; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc06g07250 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc06g07250
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionSOUL heme-binding protein
Genome locationchr6:5242145..5245857
RNA-Seq ExpressionMoc06g07250
SyntenyMoc06g07250
Gene Ontology termsGO:0110165 - cellular anatomical structure (cellular component)
InterPro domainsIPR006917 - SOUL haem-binding protein
IPR011256 - Regulatory factor, effector binding domain superfamily
IPR018790 - Protein of unknown function DUF2358


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008463332.1 PREDICTED: uncharacterized protein LOC103501513 isoform X1 [Cucumis melo]1.2e-19287.08Show/hide
Query:  MAALQLSLQNFLSTPTAGFGFRPWKSGGLTVAGLPPRLLKSRTVDFKPDARNSKWAVRLSLVDQSPPKSAVDVDRLVDFLYEDLRHLFDEQGIDRTAYDE
        MAALQLSLQNFLSTPT     RP KSG LT   L PRLL+SRT   KP+ +NSKW VR +LVDQSPPKS VDV RLVDFLYEDL HLFDEQGIDRTAYDE
Subjt:  MAALQLSLQNFLSTPTAGFGFRPWKSGGLTVAGLPPRLLKSRTVDFKPDARNSKWAVRLSLVDQSPPKSAVDVDRLVDFLYEDLRHLFDEQGIDRTAYDE

Query:  HVRFRDPITKHDTISGYSFNISLLRELFRPEFFLHWVKQTGPYEITTRWTMVMKFVLLPWKPEFIFTGNSIMGINPETGKFCSHVDLWDSIQNNDYFSLE
         VRFRDPITKHDTISGY FNISLLRE+FRPEFFLHWVKQTGPYEITTRWTM+MKF LLPWKPE IFTG SIMGINPETGKFCSHVDLWDSIQNNDYFS+E
Subjt:  HVRFRDPITKHDTISGYSFNISLLRELFRPEFFLHWVKQTGPYEITTRWTMVMKFVLLPWKPEFIFTGNSIMGINPETGKFCSHVDLWDSIQNNDYFSLE

Query:  GLLDVFKQLRFYKTPELESPKYEILKRTANYEVRKYTPFVVVETSGDKLSGSAGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSESPKVSIQIVLPSDK
        GL DVFKQLRFYKTPELESPKY ILKRT  YEVRKY PF+VVETSGDKL+GSAGFNTVAGYIFGKNS KEKIPMTTPVFTQTFDSESPKVSIQIVLPS+K
Subjt:  GLLDVFKQLRFYKTPELESPKYEILKRTANYEVRKYTPFVVVETSGDKLSGSAGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSESPKVSIQIVLPSDK

Query:  DINSLPDPEQDTIGLRKVEGGIAAVLKFSGKPTEDMVQEKAKELRSCLIKDGLKPSKGCLLARYNDPGRTWSFIMRNEVLIWLEEFS
        DI+SLPDPEQD IGLRKVEGGIAAVLKFSGKPTE++VQEKAKELRS LIKDGLKP  GCLLARYNDPGRTW+FIMRNEVLIWLEE+S
Subjt:  DINSLPDPEQDTIGLRKVEGGIAAVLKFSGKPTEDMVQEKAKELRSCLIKDGLKPSKGCLLARYNDPGRTWSFIMRNEVLIWLEEFS

XP_022144956.1 uncharacterized protein LOC111014503 isoform X1 [Momordica charantia]1.7e-226100Show/hide
Query:  MAALQLSLQNFLSTPTAGFGFRPWKSGGLTVAGLPPRLLKSRTVDFKPDARNSKWAVRLSLVDQSPPKSAVDVDRLVDFLYEDLRHLFDEQGIDRTAYDE
        MAALQLSLQNFLSTPTAGFGFRPWKSGGLTVAGLPPRLLKSRTVDFKPDARNSKWAVRLSLVDQSPPKSAVDVDRLVDFLYEDLRHLFDEQGIDRTAYDE
Subjt:  MAALQLSLQNFLSTPTAGFGFRPWKSGGLTVAGLPPRLLKSRTVDFKPDARNSKWAVRLSLVDQSPPKSAVDVDRLVDFLYEDLRHLFDEQGIDRTAYDE

Query:  HVRFRDPITKHDTISGYSFNISLLRELFRPEFFLHWVKQTGPYEITTRWTMVMKFVLLPWKPEFIFTGNSIMGINPETGKFCSHVDLWDSIQNNDYFSLE
        HVRFRDPITKHDTISGYSFNISLLRELFRPEFFLHWVKQTGPYEITTRWTMVMKFVLLPWKPEFIFTGNSIMGINPETGKFCSHVDLWDSIQNNDYFSLE
Subjt:  HVRFRDPITKHDTISGYSFNISLLRELFRPEFFLHWVKQTGPYEITTRWTMVMKFVLLPWKPEFIFTGNSIMGINPETGKFCSHVDLWDSIQNNDYFSLE

Query:  GLLDVFKQLRFYKTPELESPKYEILKRTANYEVRKYTPFVVVETSGDKLSGSAGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSESPKVSIQIVLPSDK
        GLLDVFKQLRFYKTPELESPKYEILKRTANYEVRKYTPFVVVETSGDKLSGSAGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSESPKVSIQIVLPSDK
Subjt:  GLLDVFKQLRFYKTPELESPKYEILKRTANYEVRKYTPFVVVETSGDKLSGSAGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSESPKVSIQIVLPSDK

Query:  DINSLPDPEQDTIGLRKVEGGIAAVLKFSGKPTEDMVQEKAKELRSCLIKDGLKPSKGCLLARYNDPGRTWSFIMRNEVLIWLEEFSF
        DINSLPDPEQDTIGLRKVEGGIAAVLKFSGKPTEDMVQEKAKELRSCLIKDGLKPSKGCLLARYNDPGRTWSFIMRNEVLIWLEEFSF
Subjt:  DINSLPDPEQDTIGLRKVEGGIAAVLKFSGKPTEDMVQEKAKELRSCLIKDGLKPSKGCLLARYNDPGRTWSFIMRNEVLIWLEEFSF

XP_022930662.1 uncharacterized protein LOC111437064 isoform X1 [Cucurbita moschata]3.0e-19185.86Show/hide
Query:  MAALQLSLQNFL--STPTAGFGFRPWKSGGLTVAGLPPRLLKSRTVDFKPDARNSKWAVRLSLVDQSPPKSAVDVDRLVDFLYEDLRHLFDEQGIDRTAY
        MAALQ SLQN L  STP+ GFGFRP  SG L        + +SRTV  KP  RNSKW VRLSLVDQ+PPKS VDVD+LVDFLYEDL HLFDEQGIDRTAY
Subjt:  MAALQLSLQNFL--STPTAGFGFRPWKSGGLTVAGLPPRLLKSRTVDFKPDARNSKWAVRLSLVDQSPPKSAVDVDRLVDFLYEDLRHLFDEQGIDRTAY

Query:  DEHVRFRDPITKHDTISGYSFNISLLRELFRPEFFLHWVKQTGPYEITTRWTMVMKFVLLPWKPEFIFTGNSIMGINPETGKFCSHVDLWDSIQNNDYFS
        D+ VRFRDPITKHDTI+GY FNISLLRELFRPEF LHWVK+TG YEITTRWTMVMKFVLLPWKP+ +FTGNSIMGINPETGKFCSHVDLWDSIQNNDYFS
Subjt:  DEHVRFRDPITKHDTISGYSFNISLLRELFRPEFFLHWVKQTGPYEITTRWTMVMKFVLLPWKPEFIFTGNSIMGINPETGKFCSHVDLWDSIQNNDYFS

Query:  LEGLLDVFKQLRFYKTPELESPKYEILKRTANYEVRKYTPFVVVETSGDKLSGSAGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSESPKVSIQIVLPS
        +EGLLDVFKQLRFYKTPELESPKYEILKRT NYEVRKY PF+VVETSGDKL+GSAGFN VAGYIFGKNSAKEKIPMTTPVFTQTFDSESPKVSIQIVLPS
Subjt:  LEGLLDVFKQLRFYKTPELESPKYEILKRTANYEVRKYTPFVVVETSGDKLSGSAGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSESPKVSIQIVLPS

Query:  DKDINSLPDPEQDTIGLRKVEGGIAAVLKFSGKPTEDMVQEKAKELRSCLIKDGLKPSKGCLLARYNDPGRTWSFIMRNEVLIWLEEFS
        +KD+ SLPDPEQDTIGLRKVEGG AAVLKFSGKPTE++VQEKAKELRS LIKDGLKP  GCLLARYNDPGRTW+FIMRNEVLIWLEEFS
Subjt:  DKDINSLPDPEQDTIGLRKVEGGIAAVLKFSGKPTEDMVQEKAKELRSCLIKDGLKPSKGCLLARYNDPGRTWSFIMRNEVLIWLEEFS

XP_023530707.1 uncharacterized protein LOC111793169 isoform X1 [Cucurbita pepo subsp. pepo]2.3e-19185.86Show/hide
Query:  MAALQLSLQN--FLSTPTAGFGFRPWKSGGLTVAGLPPRLLKSRTVDFKPDARNSKWAVRLSLVDQSPPKSAVDVDRLVDFLYEDLRHLFDEQGIDRTAY
        MAAL+ SLQN   LSTP+ GFGFRP  SG L        + +SRTV  KP  RNSKW VRLSLVDQ+PPKS VDVD+LVDFLYEDL HLFD+QGIDRTAY
Subjt:  MAALQLSLQN--FLSTPTAGFGFRPWKSGGLTVAGLPPRLLKSRTVDFKPDARNSKWAVRLSLVDQSPPKSAVDVDRLVDFLYEDLRHLFDEQGIDRTAY

Query:  DEHVRFRDPITKHDTISGYSFNISLLRELFRPEFFLHWVKQTGPYEITTRWTMVMKFVLLPWKPEFIFTGNSIMGINPETGKFCSHVDLWDSIQNNDYFS
        D+ VRFRDPITKHDTI+GY FNISLLRELFRPEF LHWVK+TG YEITTRWTMVMKFVLLPWKP+ +FTGNSIMGINPETGKFCSHVDLWDSIQNNDYFS
Subjt:  DEHVRFRDPITKHDTISGYSFNISLLRELFRPEFFLHWVKQTGPYEITTRWTMVMKFVLLPWKPEFIFTGNSIMGINPETGKFCSHVDLWDSIQNNDYFS

Query:  LEGLLDVFKQLRFYKTPELESPKYEILKRTANYEVRKYTPFVVVETSGDKLSGSAGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSESPKVSIQIVLPS
        +EGLLDVFKQLRFYKTPELESPKYEILKRTANYEVRKY PF+VVETSGDKL+GSAGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSESPKVSIQIVLPS
Subjt:  LEGLLDVFKQLRFYKTPELESPKYEILKRTANYEVRKYTPFVVVETSGDKLSGSAGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSESPKVSIQIVLPS

Query:  DKDINSLPDPEQDTIGLRKVEGGIAAVLKFSGKPTEDMVQEKAKELRSCLIKDGLKPSKGCLLARYNDPGRTWSFIMRNEVLIWLEEFS
        +KD+ SLPDPEQDTIGLRKVEGG AAVLKFSGKPTE++VQEKAKELRS LIKDGLKP  GCLLARYNDPGRTW+FIMRNEVLIWLEEFS
Subjt:  DKDINSLPDPEQDTIGLRKVEGGIAAVLKFSGKPTEDMVQEKAKELRSCLIKDGLKPSKGCLLARYNDPGRTWSFIMRNEVLIWLEEFS

XP_038879422.1 uncharacterized protein LOC120071301 isoform X1 [Benincasa hispida]6.4e-19487.6Show/hide
Query:  MAALQLSLQNFLSTPTAGFGFRPWKSGGLTVAGLPPRLLKSRTVDFKPDARNSKWAVRLSLVDQSPPKSAVDVDRLVDFLYEDLRHLFDEQGIDRTAYDE
        MA  QLSLQNF STPT GFG RP +SG LT   LPPRL K+RT  FKP ++NSKW VRLSLVDQSPPKS VDV RLVDFLYEDLRHLFDEQGIDRTAYDE
Subjt:  MAALQLSLQNFLSTPTAGFGFRPWKSGGLTVAGLPPRLLKSRTVDFKPDARNSKWAVRLSLVDQSPPKSAVDVDRLVDFLYEDLRHLFDEQGIDRTAYDE

Query:  HVRFRDPITKHDTISGYSFNISLLRELFRPEFFLHWVKQTGPYEITTRWTMVMKFVLLPWKPEFIFTGNSIMGINPETGKFCSHVDLWDSIQNNDYFSLE
         VRFRDPIT HDTISGY FNISLLRELFRPEFFLHWVKQTGPYEITTRWTMVMKFVLLPWKPE +FTG SIMGINPETGKFCSHVDLWDSIQNNDYFS+E
Subjt:  HVRFRDPITKHDTISGYSFNISLLRELFRPEFFLHWVKQTGPYEITTRWTMVMKFVLLPWKPEFIFTGNSIMGINPETGKFCSHVDLWDSIQNNDYFSLE

Query:  GLLDVFKQLRFYKTPELESPKYEILKRTANYEVRKYTPFVVVETSGDKLSGSAGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSESPKVSIQIVLPSDK
        GL DVFKQLR+YKTP LESPKY ILKRTANYEVRKY  F+VVETSGDKL+GSAGFNTVAGYIFGKNS KEKIPMTTPVFTQTFDSE PKV IQIVLPS+K
Subjt:  GLLDVFKQLRFYKTPELESPKYEILKRTANYEVRKYTPFVVVETSGDKLSGSAGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSESPKVSIQIVLPSDK

Query:  DINSLPDPEQDTIGLRKVEGGIAAVLKFSGKPTEDMVQEKAKELRSCLIKDGLKPSKGCLLARYNDPGRTWSFIMRNEVLIWLEEFS
        DI+SLPDPEQD IGLRKVEG IAAVLKFSGKPTE++VQEKAKELRS LIKDGLKPS GCLLARYNDPGRTW+FIMRNEVLIWLEEFS
Subjt:  DINSLPDPEQDTIGLRKVEGGIAAVLKFSGKPTEDMVQEKAKELRSCLIKDGLKPSKGCLLARYNDPGRTWSFIMRNEVLIWLEEFS

TrEMBL top hitse value%identityAlignment
A0A0A0LWP3 Uncharacterized protein4.2e-19186.05Show/hide
Query:  MAALQLSLQNFLSTPTAGFGFRPWKSGGLTVAGLPPRLLKSRTVDFKPDARNSKWAVRLSLVDQSPPKSAVDVDRLVDFLYEDLRHLFDEQGIDRTAYDE
        MA LQLSLQNF STPT     RP KSG +T   LPPRLL SRT  FKP  +NSKW VR +LVDQ PPKS +DV RLVDFL+EDL HLFDEQGIDRTAYDE
Subjt:  MAALQLSLQNFLSTPTAGFGFRPWKSGGLTVAGLPPRLLKSRTVDFKPDARNSKWAVRLSLVDQSPPKSAVDVDRLVDFLYEDLRHLFDEQGIDRTAYDE

Query:  HVRFRDPITKHDTISGYSFNISLLRELFRPEFFLHWVKQTGPYEITTRWTMVMKFVLLPWKPEFIFTGNSIMGINPETGKFCSHVDLWDSIQNNDYFSLE
         VRFRDPITKHDTISGY FNISLLRELFRPEFFLHWVKQTGPYEITTRWTMVMKF LLPWKPE +FTGNSIMGINPETGKFCSHVDLWDSIQNNDYFS+E
Subjt:  HVRFRDPITKHDTISGYSFNISLLRELFRPEFFLHWVKQTGPYEITTRWTMVMKFVLLPWKPEFIFTGNSIMGINPETGKFCSHVDLWDSIQNNDYFSLE

Query:  GLLDVFKQLRFYKTPELESPKYEILKRTANYEVRKYTPFVVVETSGDKLSGSAGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSESPKVSIQIVLPSDK
        GL DVFKQLRFYKTPELESPKY ILKRTA YEVRKY PF+VVETSGDKL+GSAGFNTVAGYIFGKNS KEKIPMTTPVFTQ F+SESPKVSIQIVLPS+K
Subjt:  GLLDVFKQLRFYKTPELESPKYEILKRTANYEVRKYTPFVVVETSGDKLSGSAGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSESPKVSIQIVLPSDK

Query:  DINSLPDPEQDTIGLRKVEGGIAAVLKFSGKPTEDMVQEKAKELRSCLIKDGLKPSKGCLLARYNDPGRTWSFIMRNEVLIWLEEFS
        DI+SLPDPEQD +GLRKVEGGIAAVLKFSGKP E++VQEKAKELRS LIKDGLKP  GCLLARYNDPGRTW+FIMRNEVLIWLEEFS
Subjt:  DINSLPDPEQDTIGLRKVEGGIAAVLKFSGKPTEDMVQEKAKELRSCLIKDGLKPSKGCLLARYNDPGRTWSFIMRNEVLIWLEEFS

A0A1S3CJ12 uncharacterized protein LOC103501513 isoform X15.8e-19387.08Show/hide
Query:  MAALQLSLQNFLSTPTAGFGFRPWKSGGLTVAGLPPRLLKSRTVDFKPDARNSKWAVRLSLVDQSPPKSAVDVDRLVDFLYEDLRHLFDEQGIDRTAYDE
        MAALQLSLQNFLSTPT     RP KSG LT   L PRLL+SRT   KP+ +NSKW VR +LVDQSPPKS VDV RLVDFLYEDL HLFDEQGIDRTAYDE
Subjt:  MAALQLSLQNFLSTPTAGFGFRPWKSGGLTVAGLPPRLLKSRTVDFKPDARNSKWAVRLSLVDQSPPKSAVDVDRLVDFLYEDLRHLFDEQGIDRTAYDE

Query:  HVRFRDPITKHDTISGYSFNISLLRELFRPEFFLHWVKQTGPYEITTRWTMVMKFVLLPWKPEFIFTGNSIMGINPETGKFCSHVDLWDSIQNNDYFSLE
         VRFRDPITKHDTISGY FNISLLRE+FRPEFFLHWVKQTGPYEITTRWTM+MKF LLPWKPE IFTG SIMGINPETGKFCSHVDLWDSIQNNDYFS+E
Subjt:  HVRFRDPITKHDTISGYSFNISLLRELFRPEFFLHWVKQTGPYEITTRWTMVMKFVLLPWKPEFIFTGNSIMGINPETGKFCSHVDLWDSIQNNDYFSLE

Query:  GLLDVFKQLRFYKTPELESPKYEILKRTANYEVRKYTPFVVVETSGDKLSGSAGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSESPKVSIQIVLPSDK
        GL DVFKQLRFYKTPELESPKY ILKRT  YEVRKY PF+VVETSGDKL+GSAGFNTVAGYIFGKNS KEKIPMTTPVFTQTFDSESPKVSIQIVLPS+K
Subjt:  GLLDVFKQLRFYKTPELESPKYEILKRTANYEVRKYTPFVVVETSGDKLSGSAGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSESPKVSIQIVLPSDK

Query:  DINSLPDPEQDTIGLRKVEGGIAAVLKFSGKPTEDMVQEKAKELRSCLIKDGLKPSKGCLLARYNDPGRTWSFIMRNEVLIWLEEFS
        DI+SLPDPEQD IGLRKVEGGIAAVLKFSGKPTE++VQEKAKELRS LIKDGLKP  GCLLARYNDPGRTW+FIMRNEVLIWLEE+S
Subjt:  DINSLPDPEQDTIGLRKVEGGIAAVLKFSGKPTEDMVQEKAKELRSCLIKDGLKPSKGCLLARYNDPGRTWSFIMRNEVLIWLEEFS

A0A6J1CUY2 uncharacterized protein LOC111014503 isoform X18.0e-227100Show/hide
Query:  MAALQLSLQNFLSTPTAGFGFRPWKSGGLTVAGLPPRLLKSRTVDFKPDARNSKWAVRLSLVDQSPPKSAVDVDRLVDFLYEDLRHLFDEQGIDRTAYDE
        MAALQLSLQNFLSTPTAGFGFRPWKSGGLTVAGLPPRLLKSRTVDFKPDARNSKWAVRLSLVDQSPPKSAVDVDRLVDFLYEDLRHLFDEQGIDRTAYDE
Subjt:  MAALQLSLQNFLSTPTAGFGFRPWKSGGLTVAGLPPRLLKSRTVDFKPDARNSKWAVRLSLVDQSPPKSAVDVDRLVDFLYEDLRHLFDEQGIDRTAYDE

Query:  HVRFRDPITKHDTISGYSFNISLLRELFRPEFFLHWVKQTGPYEITTRWTMVMKFVLLPWKPEFIFTGNSIMGINPETGKFCSHVDLWDSIQNNDYFSLE
        HVRFRDPITKHDTISGYSFNISLLRELFRPEFFLHWVKQTGPYEITTRWTMVMKFVLLPWKPEFIFTGNSIMGINPETGKFCSHVDLWDSIQNNDYFSLE
Subjt:  HVRFRDPITKHDTISGYSFNISLLRELFRPEFFLHWVKQTGPYEITTRWTMVMKFVLLPWKPEFIFTGNSIMGINPETGKFCSHVDLWDSIQNNDYFSLE

Query:  GLLDVFKQLRFYKTPELESPKYEILKRTANYEVRKYTPFVVVETSGDKLSGSAGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSESPKVSIQIVLPSDK
        GLLDVFKQLRFYKTPELESPKYEILKRTANYEVRKYTPFVVVETSGDKLSGSAGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSESPKVSIQIVLPSDK
Subjt:  GLLDVFKQLRFYKTPELESPKYEILKRTANYEVRKYTPFVVVETSGDKLSGSAGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSESPKVSIQIVLPSDK

Query:  DINSLPDPEQDTIGLRKVEGGIAAVLKFSGKPTEDMVQEKAKELRSCLIKDGLKPSKGCLLARYNDPGRTWSFIMRNEVLIWLEEFSF
        DINSLPDPEQDTIGLRKVEGGIAAVLKFSGKPTEDMVQEKAKELRSCLIKDGLKPSKGCLLARYNDPGRTWSFIMRNEVLIWLEEFSF
Subjt:  DINSLPDPEQDTIGLRKVEGGIAAVLKFSGKPTEDMVQEKAKELRSCLIKDGLKPSKGCLLARYNDPGRTWSFIMRNEVLIWLEEFSF

A0A6J1ER73 uncharacterized protein LOC111437064 isoform X11.4e-19185.86Show/hide
Query:  MAALQLSLQNFL--STPTAGFGFRPWKSGGLTVAGLPPRLLKSRTVDFKPDARNSKWAVRLSLVDQSPPKSAVDVDRLVDFLYEDLRHLFDEQGIDRTAY
        MAALQ SLQN L  STP+ GFGFRP  SG L        + +SRTV  KP  RNSKW VRLSLVDQ+PPKS VDVD+LVDFLYEDL HLFDEQGIDRTAY
Subjt:  MAALQLSLQNFL--STPTAGFGFRPWKSGGLTVAGLPPRLLKSRTVDFKPDARNSKWAVRLSLVDQSPPKSAVDVDRLVDFLYEDLRHLFDEQGIDRTAY

Query:  DEHVRFRDPITKHDTISGYSFNISLLRELFRPEFFLHWVKQTGPYEITTRWTMVMKFVLLPWKPEFIFTGNSIMGINPETGKFCSHVDLWDSIQNNDYFS
        D+ VRFRDPITKHDTI+GY FNISLLRELFRPEF LHWVK+TG YEITTRWTMVMKFVLLPWKP+ +FTGNSIMGINPETGKFCSHVDLWDSIQNNDYFS
Subjt:  DEHVRFRDPITKHDTISGYSFNISLLRELFRPEFFLHWVKQTGPYEITTRWTMVMKFVLLPWKPEFIFTGNSIMGINPETGKFCSHVDLWDSIQNNDYFS

Query:  LEGLLDVFKQLRFYKTPELESPKYEILKRTANYEVRKYTPFVVVETSGDKLSGSAGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSESPKVSIQIVLPS
        +EGLLDVFKQLRFYKTPELESPKYEILKRT NYEVRKY PF+VVETSGDKL+GSAGFN VAGYIFGKNSAKEKIPMTTPVFTQTFDSESPKVSIQIVLPS
Subjt:  LEGLLDVFKQLRFYKTPELESPKYEILKRTANYEVRKYTPFVVVETSGDKLSGSAGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSESPKVSIQIVLPS

Query:  DKDINSLPDPEQDTIGLRKVEGGIAAVLKFSGKPTEDMVQEKAKELRSCLIKDGLKPSKGCLLARYNDPGRTWSFIMRNEVLIWLEEFS
        +KD+ SLPDPEQDTIGLRKVEGG AAVLKFSGKPTE++VQEKAKELRS LIKDGLKP  GCLLARYNDPGRTW+FIMRNEVLIWLEEFS
Subjt:  DKDINSLPDPEQDTIGLRKVEGGIAAVLKFSGKPTEDMVQEKAKELRSCLIKDGLKPSKGCLLARYNDPGRTWSFIMRNEVLIWLEEFS

A0A6J1KHA6 uncharacterized protein LOC111495248 isoform X13.5e-19085.09Show/hide
Query:  MAALQLSLQN--FLSTPTAGFGFRPWKSGGLTVAGLPPRLLKSRTVDFKPDARNSKWAVRLSLVDQSPPKSAVDVDRLVDFLYEDLRHLFDEQGIDRTAY
        MAAL+ SLQN   LSTP+ GFGFRP  SG L        + +SRTV  KP  RNSKW VRLSLVDQ+PPKS VDVD+LVDFLY+DL HLFDEQGIDRTAY
Subjt:  MAALQLSLQN--FLSTPTAGFGFRPWKSGGLTVAGLPPRLLKSRTVDFKPDARNSKWAVRLSLVDQSPPKSAVDVDRLVDFLYEDLRHLFDEQGIDRTAY

Query:  DEHVRFRDPITKHDTISGYSFNISLLRELFRPEFFLHWVKQTGPYEITTRWTMVMKFVLLPWKPEFIFTGNSIMGINPETGKFCSHVDLWDSIQNNDYFS
        D+ VRFRDPITKHDTI+GY FNISLLRELF+PEF LHWVK+TG YEITTRWTMVMKFVLLPWKP+ +FTGNSIMGINPETGKFCSHVDLWDSIQNNDYFS
Subjt:  DEHVRFRDPITKHDTISGYSFNISLLRELFRPEFFLHWVKQTGPYEITTRWTMVMKFVLLPWKPEFIFTGNSIMGINPETGKFCSHVDLWDSIQNNDYFS

Query:  LEGLLDVFKQLRFYKTPELESPKYEILKRTANYEVRKYTPFVVVETSGDKLSGSAGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSESPKVSIQIVLPS
        +EGLLDVFKQLRFYKTPELESPKYEILKRTANYEVRKY PF+VVETSGDKL+GSAGFNTVAGYIFGKNSAKEKI MTTPVFTQTFDSESPKVSIQIVLPS
Subjt:  LEGLLDVFKQLRFYKTPELESPKYEILKRTANYEVRKYTPFVVVETSGDKLSGSAGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSESPKVSIQIVLPS

Query:  DKDINSLPDPEQDTIGLRKVEGGIAAVLKFSGKPTEDMVQEKAKELRSCLIKDGLKPSKGCLLARYNDPGRTWSFIMRNEVLIWLEEFS
        +KD+ SLPDPEQDTIGLRKVEGG AAVLKFSGKPTE++VQEKAK+LRS LIKDGLKP  GCLLARYNDPGRTW+FIMRNEVLIWLEEFS
Subjt:  DKDINSLPDPEQDTIGLRKVEGGIAAVLKFSGKPTEDMVQEKAKELRSCLIKDGLKPSKGCLLARYNDPGRTWSFIMRNEVLIWLEEFS

SwissProt top hitse value%identityAlignment
Q9SR77 Heme-binding-like protein At3g10130, chloroplastic2.2e-1932.66Show/hide
Query:  FYKTPELESPKYEILKRTANYEVRKYTPFVVV------ETSGDKLSGSAGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDS--ESPKVSIQIVLPSDKDI
        F   P+LE+  + +L RT  YE+R+  P+ V       ET  D    S  FN +A Y+FGKN+ KEK+ MTTPV T+   S  E  +++  ++    KD 
Subjt:  FYKTPELESPKYEILKRTANYEVRKYTPFVVV------ETSGDKLSGSAGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDS--ESPKVSIQIVLPSDKDI

Query:  N--------------SLPDPEQDTIGLRKVEGGIAAVLKFSGKPTEDMVQEKAKELRSCLIKD---GLKPSKGCLLARYNDPGRTWSFIMRNEVLIWLE
        N              +LP P+  ++ +++V   I AV+ FSG  T++ ++ + +ELR  L  D    ++      +A+YN P  T  F+ RNEV + +E
Subjt:  N--------------SLPDPEQDTIGLRKVEGGIAAVLKFSGKPTEDMVQEKAKELRSCLIKD---GLKPSKGCLLARYNDPGRTWSFIMRNEVLIWLE

Arabidopsis top hitse value%identityAlignment
AT1G17100.1 SOUL heme-binding family protein5.1e-0831.65Show/hide
Query:  LESPKYEILKRTANYEVRKYTPFVVVET------SGDKLSGSAGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSESPKVSIQIVLPSDKDINSLPDP-E
        +E P YE++     YE+R+Y   V V T      S    + +A F   A YI GKN   +KI MT PV +Q   S+ P       +       + PDP  
Subjt:  LESPKYEILKRTANYEVRKYTPFVVVET------SGDKLSGSAGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSESPKVSIQIVLPSDKDINSLPDP-E

Query:  QDTIGLRKVEGGIAAVLKFSGKPTEDMVQEKAKELRSCL
         + + ++K      AV +FSG  ++D + E+A  L S L
Subjt:  QDTIGLRKVEGGIAAVLKFSGKPTEDMVQEKAKELRSCL

AT2G37970.1 SOUL heme-binding family protein7.9e-1734.45Show/hide
Query:  LESPKYEILKRTANYEVRKYTPFVVVETSGD----KLSGSAGFNTVAGYI--FGK--NSAKEKIPMTTPVFTQ----------TFDSESPK---------
        +E+PKY + K    YE+R+Y P V  E + D    K     GF  +A YI  FGK  N   EKI MT PV T+              ES K         
Subjt:  LESPKYEILKRTANYEVRKYTPFVVVETSGD----KLSGSAGFNTVAGYI--FGK--NSAKEKIPMTTPVFTQ----------TFDSESPK---------

Query:  -----------VSIQIVLPS-DKDINSLPDPEQDTIGLRKVEGGIAAVLKFSGKPTEDMVQEKAKELRSCLIKDGLKPSKGCLLARYNDPGRTWSF--IM
                   V++Q +LPS  K     P P  + + +++  G    V+KFSG  +E +V EK K+L S L KDG K +   +LARYN P   W+     
Subjt:  -----------VSIQIVLPS-DKDINSLPDPEQDTIGLRKVEGGIAAVLKFSGKPTEDMVQEKAKELRSCLIKDGLKPSKGCLLARYNDPGRTWSF--IM

Query:  RNEVLIWLE
         NEV+I +E
Subjt:  RNEVLIWLE

AT3G10130.1 SOUL heme-binding family protein1.5e-2032.66Show/hide
Query:  FYKTPELESPKYEILKRTANYEVRKYTPFVVV------ETSGDKLSGSAGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDS--ESPKVSIQIVLPSDKDI
        F   P+LE+  + +L RT  YE+R+  P+ V       ET  D    S  FN +A Y+FGKN+ KEK+ MTTPV T+   S  E  +++  ++    KD 
Subjt:  FYKTPELESPKYEILKRTANYEVRKYTPFVVV------ETSGDKLSGSAGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDS--ESPKVSIQIVLPSDKDI

Query:  N--------------SLPDPEQDTIGLRKVEGGIAAVLKFSGKPTEDMVQEKAKELRSCLIKD---GLKPSKGCLLARYNDPGRTWSFIMRNEVLIWLE
        N              +LP P+  ++ +++V   I AV+ FSG  T++ ++ + +ELR  L  D    ++      +A+YN P  T  F+ RNEV + +E
Subjt:  N--------------SLPDPEQDTIGLRKVEGGIAAVLKFSGKPTEDMVQEKAKELRSCLIKD---GLKPSKGCLLARYNDPGRTWSFIMRNEVLIWLE

AT5G20140.1 SOUL heme-binding family protein3.1e-14675Show/hide
Query:  SAVDVDRLVDFLYEDLRHLFDEQGIDRTAYDEHVRFRDPITKHDTISGYSFNISLLRELFRPEFFLHWVKQTGPYEITTRWTMVMKFVLLPWKPEFIFTG
        S V+++ LV FLYEDL HLFD+QGID+TAYDE V+FRDPITKHDTISGY FNI+ L+ +F P+F LHW KQTGPYEITTRWTMVMKF+ LPWKPE +FTG
Subjt:  SAVDVDRLVDFLYEDLRHLFDEQGIDRTAYDEHVRFRDPITKHDTISGYSFNISLLRELFRPEFFLHWVKQTGPYEITTRWTMVMKFVLLPWKPEFIFTG

Query:  NSIMGINPETGKFCSHVDLWDSIQNNDYFSLEGLLDVFKQLRFYKTPELESPKYEILKRTANYEVRKYTPFVVVETSGDKLSGSAGFNTVAGYIFGKNSA
         SIM +NPET KFCSH+DLWDSI+NNDYFSLEGL+DVFKQLR YKTP+LE+PKY+ILKRTANYEVR Y PF+VVET GDKLSGS+GFN VAGYIFGKNS 
Subjt:  NSIMGINPETGKFCSHVDLWDSIQNNDYFSLEGLLDVFKQLRFYKTPELESPKYEILKRTANYEVRKYTPFVVVETSGDKLSGSAGFNTVAGYIFGKNSA

Query:  KEKIPMTTPVFTQTFDSE-SPKVSIQIVLPSDKDINSLPDPEQDTIGLRKVEGGIAAVLKFSGKPTEDMVQEKAKELRSCLIKDGLKPSKGCLLARYNDP
         EKIPMTTPVFTQT D++ S  VS+QIV+PS KD++SLP P ++ + L+K+EGG AA +KFSGKPTED+VQ K  ELRS L KDGL+  KGC+LARYNDP
Subjt:  KEKIPMTTPVFTQTFDSE-SPKVSIQIVLPSDKDINSLPDPEQDTIGLRKVEGGIAAVLKFSGKPTEDMVQEKAKELRSCLIKDGLKPSKGCLLARYNDP

Query:  GRTWSFIMRNEVLIWLEEFS
        GRTW+FIMRNEV+IWLE+FS
Subjt:  GRTWSFIMRNEVLIWLEEFS

AT5G20140.2 SOUL heme-binding family protein2.8e-13974.68Show/hide
Query:  SAVDVDRLVDFLYEDLRHLFDEQGIDRTAYDEHVRFRDPITKHDTISGYSFNISLLRELFRPEFFLHWVKQTGPYEITTRWTMVMKFVLLPWKPEFIFTG
        S V+++ LV FLYEDL HLFD+QGID+TAYDE V+FRDPITKHDTISGY FNI+ L+ +F P+F LHW KQTGPYEITTRWTMVMKF+ LPWKPE +FTG
Subjt:  SAVDVDRLVDFLYEDLRHLFDEQGIDRTAYDEHVRFRDPITKHDTISGYSFNISLLRELFRPEFFLHWVKQTGPYEITTRWTMVMKFVLLPWKPEFIFTG

Query:  NSIMGINPETGKFCSHVDLWDSIQNNDYFSLEGLLDVFKQLRFYKTPELESPKYEILKRTANYEVRKYTPFVVVETSGDKLSGSAGFNTVAGYIFGKNSA
         SIM +NPET KFCSH+DLWDSI+NNDYFSLEGL+DVFKQLR YKTP+LE+PKY+ILKRTANYEVR Y PF+VVET GDKLSGS+GFN VAGYIFGKNS 
Subjt:  NSIMGINPETGKFCSHVDLWDSIQNNDYFSLEGLLDVFKQLRFYKTPELESPKYEILKRTANYEVRKYTPFVVVETSGDKLSGSAGFNTVAGYIFGKNSA

Query:  KEKIPMTTPVFTQTFDSE-SPKVSIQIVLPSDKDINSLPDPEQDTIGLRKVEGGIAAVLKFSGKPTEDMVQEKAKELRSCLIKDGLKPSKGCLLARYNDP
         EKIPMTTPVFTQT D++ S  VS+QIV+PS KD++SLP P ++ + L+K+EGG AA +KFSGKPTED+VQ K  ELRS L KDGL+  KGC+LARYNDP
Subjt:  KEKIPMTTPVFTQTFDSE-SPKVSIQIVLPSDKDINSLPDPEQDTIGLRKVEGGIAAVLKFSGKPTEDMVQEKAKELRSCLIKDGLKPSKGCLLARYNDP

Query:  GRTWSFIM
        GRTW+FIM
Subjt:  GRTWSFIM


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCGCTCTTCAACTTTCCCTCCAAAATTTCCTCTCAACCCCAACAGCCGGTTTCGGTTTCCGGCCATGGAAGTCCGGCGGACTAACAGTAGCCGGCCTCCCA
CCACGTCTACTCAAAAGCAGGACTGTAGATTTTAAACCCGACGCCCGAAATTCTAAGTGGGCTGTTCGATTAAGCTTGGTGGATCAAAGCCCCCCCAAATCGGCG
GTTGATGTAGACCGATTGGTAGATTTCCTGTACGAAGATCTTCGTCATCTCTTTGATGAACAGGGGATTGATCGGACGGCGTACGACGAGCATGTGAGATTTCGG
GACCCCATTACCAAGCACGATACCATTAGCGGGTATTCGTTTAATATTTCCCTATTGCGAGAGCTCTTCAGGCCTGAGTTCTTCTTGCACTGGGTTAAACAGACA
GGACCTTATGAGATTACCACAAGATGGACTATGGTAATGAAGTTTGTCCTTCTGCCATGGAAACCAGAATTTATTTTTACGGGAAATTCCATCATGGGTATTAAT
CCAGAGACGGGCAAGTTCTGTAGCCATGTGGATCTCTGGGATTCAATACAAAATAATGACTACTTTTCTCTAGAAGGCCTGTTGGATGTATTTAAACAGCTCCGG
TTTTATAAGACACCAGAATTGGAATCACCCAAATATGAGATATTGAAAAGGACTGCAAACTATGAGGTAAGGAAATATACACCATTTGTAGTGGTAGAAACAAGT
GGAGACAAGCTCTCTGGGTCTGCTGGATTCAATACGGTTGCTGGGTACATATTTGGGAAGAACTCTGCAAAGGAGAAGATACCCATGACCACCCCTGTGTTCACC
CAGACATTTGACTCTGAATCACCCAAAGTATCCATCCAAATAGTTCTTCCTTCAGACAAAGATATTAACAGTTTGCCAGATCCTGAACAAGACACAATAGGCTTG
AGAAAGGTGGAAGGAGGTATTGCTGCAGTGCTGAAGTTCAGTGGAAAACCTACTGAAGATATGGTGCAAGAGAAGGCAAAAGAATTGCGGTCTTGTCTTATAAAG
GATGGCCTCAAACCCAGTAAGGGCTGTTTGCTTGCTCGGTACAATGACCCTGGCCGGACGTGGAGCTTTATAATGAGAAATGAGGTGCTTATATGGCTTGAAGAA
TTCTCATTCATGTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCCGCTCTTCAACTTTCCCTCCAAAATTTCCTCTCAACCCCAACAGCCGGTTTCGGTTTCCGGCCATGGAAGTCCGGCGGACTAACAGTAGCCGGCCTCCCA
CCACGTCTACTCAAAAGCAGGACTGTAGATTTTAAACCCGACGCCCGAAATTCTAAGTGGGCTGTTCGATTAAGCTTGGTGGATCAAAGCCCCCCCAAATCGGCG
GTTGATGTAGACCGATTGGTAGATTTCCTGTACGAAGATCTTCGTCATCTCTTTGATGAACAGGGGATTGATCGGACGGCGTACGACGAGCATGTGAGATTTCGG
GACCCCATTACCAAGCACGATACCATTAGCGGGTATTCGTTTAATATTTCCCTATTGCGAGAGCTCTTCAGGCCTGAGTTCTTCTTGCACTGGGTTAAACAGACA
GGACCTTATGAGATTACCACAAGATGGACTATGGTAATGAAGTTTGTCCTTCTGCCATGGAAACCAGAATTTATTTTTACGGGAAATTCCATCATGGGTATTAAT
CCAGAGACGGGCAAGTTCTGTAGCCATGTGGATCTCTGGGATTCAATACAAAATAATGACTACTTTTCTCTAGAAGGCCTGTTGGATGTATTTAAACAGCTCCGG
TTTTATAAGACACCAGAATTGGAATCACCCAAATATGAGATATTGAAAAGGACTGCAAACTATGAGGTAAGGAAATATACACCATTTGTAGTGGTAGAAACAAGT
GGAGACAAGCTCTCTGGGTCTGCTGGATTCAATACGGTTGCTGGGTACATATTTGGGAAGAACTCTGCAAAGGAGAAGATACCCATGACCACCCCTGTGTTCACC
CAGACATTTGACTCTGAATCACCCAAAGTATCCATCCAAATAGTTCTTCCTTCAGACAAAGATATTAACAGTTTGCCAGATCCTGAACAAGACACAATAGGCTTG
AGAAAGGTGGAAGGAGGTATTGCTGCAGTGCTGAAGTTCAGTGGAAAACCTACTGAAGATATGGTGCAAGAGAAGGCAAAAGAATTGCGGTCTTGTCTTATAAAG
GATGGCCTCAAACCCAGTAAGGGCTGTTTGCTTGCTCGGTACAATGACCCTGGCCGGACGTGGAGCTTTATAATGAGAAATGAGGTGCTTATATGGCTTGAAGAA
TTCTCATTCATGTAG
Protein sequenceShow/hide protein sequence
MAALQLSLQNFLSTPTAGFGFRPWKSGGLTVAGLPPRLLKSRTVDFKPDARNSKWAVRLSLVDQSPPKSAVDVDRLVDFLYEDLRHLFDEQGIDRTAYDEHVRFR
DPITKHDTISGYSFNISLLRELFRPEFFLHWVKQTGPYEITTRWTMVMKFVLLPWKPEFIFTGNSIMGINPETGKFCSHVDLWDSIQNNDYFSLEGLLDVFKQLR
FYKTPELESPKYEILKRTANYEVRKYTPFVVVETSGDKLSGSAGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSESPKVSIQIVLPSDKDINSLPDPEQDTIGL
RKVEGGIAAVLKFSGKPTEDMVQEKAKELRSCLIKDGLKPSKGCLLARYNDPGRTWSFIMRNEVLIWLEEFSFM