; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC06g0640 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC06g0640
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionSOUL heme-binding protein
Genome locationMC06:5237371..5249266
RNA-Seq ExpressionMC06g0640
SyntenyMC06g0640
Gene Ontology termsGO:0110165 - cellular anatomical structure (cellular component)
InterPro domainsIPR006917 - SOUL haem-binding protein
IPR011256 - Regulatory factor, effector binding domain superfamily
IPR018790 - Protein of unknown function DUF2358
IPR032710 - NTF2-like domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008463332.1 PREDICTED: uncharacterized protein LOC103501513 isoform X1 [Cucumis melo]2.36e-24587.34Show/hide
Query:  MAALQLSLQNFLSTPTAGFGFRPWKSGGLTVAGLPPRLLKSRTVDFKPDARNSKWAVRLSLVDQSPPKSAVDVDRLVDFLYEDLRHLFDEQGIDRTAYDE
        MAALQLSLQNFLSTPT     RP KSG LT   L PRLL+SRT   KP+ +NSKW VR +LVDQSPPKS VDV RLVDFLYEDL HLFDEQGIDRTAYDE
Subjt:  MAALQLSLQNFLSTPTAGFGFRPWKSGGLTVAGLPPRLLKSRTVDFKPDARNSKWAVRLSLVDQSPPKSAVDVDRLVDFLYEDLRHLFDEQGIDRTAYDE

Query:  HVRFRDPITKHDTISGYLFNISLLRELFRPEFFLHWVKQTGPYEITTRWTMVMKFVLLPWKPEFIFTGNSIMGINPETGKFCSHVDLWDSIQNNDYFSLE
         VRFRDPITKHDTISGYLFNISLLRE+FRPEFFLHWVKQTGPYEITTRWTM+MKF LLPWKPE IFTG SIMGINPETGKFCSHVDLWDSIQNNDYFS+E
Subjt:  HVRFRDPITKHDTISGYLFNISLLRELFRPEFFLHWVKQTGPYEITTRWTMVMKFVLLPWKPEFIFTGNSIMGINPETGKFCSHVDLWDSIQNNDYFSLE

Query:  GLLDVFKQLRFYKTPELESPKYEILKRTANYEVRKYTPFVVVETSGDKLSGSAGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSESPKVSIQIVLPSDK
        GL DVFKQLRFYKTPELESPKY ILKRT  YEVRKY PF+VVETSGDKL+GSAGFNTVAGYIFGKNS KEKIPMTTPVFTQTFDSESPKVSIQIVLPS+K
Subjt:  GLLDVFKQLRFYKTPELESPKYEILKRTANYEVRKYTPFVVVETSGDKLSGSAGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSESPKVSIQIVLPSDK

Query:  DINSLPDPEQDTIGLRKVEGGIAAVLKFSGKPTEDMVQEKAKELRSCLIKDGLKPSKGCLLARYNDPGRTWSFIMRNEVLIWLEEFS
        DI+SLPDPEQD IGLRKVEGGIAAVLKFSGKPTE++VQEKAKELRS LIKDGLKP  GCLLARYNDPGRTW+FIMRNEVLIWLEE+S
Subjt:  DINSLPDPEQDTIGLRKVEGGIAAVLKFSGKPTEDMVQEKAKELRSCLIKDGLKPSKGCLLARYNDPGRTWSFIMRNEVLIWLEEFS

XP_022144956.1 uncharacterized protein LOC111014503 isoform X1 [Momordica charantia]3.09e-28799.74Show/hide
Query:  MAALQLSLQNFLSTPTAGFGFRPWKSGGLTVAGLPPRLLKSRTVDFKPDARNSKWAVRLSLVDQSPPKSAVDVDRLVDFLYEDLRHLFDEQGIDRTAYDE
        MAALQLSLQNFLSTPTAGFGFRPWKSGGLTVAGLPPRLLKSRTVDFKPDARNSKWAVRLSLVDQSPPKSAVDVDRLVDFLYEDLRHLFDEQGIDRTAYDE
Subjt:  MAALQLSLQNFLSTPTAGFGFRPWKSGGLTVAGLPPRLLKSRTVDFKPDARNSKWAVRLSLVDQSPPKSAVDVDRLVDFLYEDLRHLFDEQGIDRTAYDE

Query:  HVRFRDPITKHDTISGYLFNISLLRELFRPEFFLHWVKQTGPYEITTRWTMVMKFVLLPWKPEFIFTGNSIMGINPETGKFCSHVDLWDSIQNNDYFSLE
        HVRFRDPITKHDTISGY FNISLLRELFRPEFFLHWVKQTGPYEITTRWTMVMKFVLLPWKPEFIFTGNSIMGINPETGKFCSHVDLWDSIQNNDYFSLE
Subjt:  HVRFRDPITKHDTISGYLFNISLLRELFRPEFFLHWVKQTGPYEITTRWTMVMKFVLLPWKPEFIFTGNSIMGINPETGKFCSHVDLWDSIQNNDYFSLE

Query:  GLLDVFKQLRFYKTPELESPKYEILKRTANYEVRKYTPFVVVETSGDKLSGSAGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSESPKVSIQIVLPSDK
        GLLDVFKQLRFYKTPELESPKYEILKRTANYEVRKYTPFVVVETSGDKLSGSAGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSESPKVSIQIVLPSDK
Subjt:  GLLDVFKQLRFYKTPELESPKYEILKRTANYEVRKYTPFVVVETSGDKLSGSAGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSESPKVSIQIVLPSDK

Query:  DINSLPDPEQDTIGLRKVEGGIAAVLKFSGKPTEDMVQEKAKELRSCLIKDGLKPSKGCLLARYNDPGRTWSFIMRNEVLIWLEEFS
        DINSLPDPEQDTIGLRKVEGGIAAVLKFSGKPTEDMVQEKAKELRSCLIKDGLKPSKGCLLARYNDPGRTWSFIMRNEVLIWLEEFS
Subjt:  DINSLPDPEQDTIGLRKVEGGIAAVLKFSGKPTEDMVQEKAKELRSCLIKDGLKPSKGCLLARYNDPGRTWSFIMRNEVLIWLEEFS

XP_022930662.1 uncharacterized protein LOC111437064 isoform X1 [Cucurbita moschata]1.37e-24386.12Show/hide
Query:  MAALQLSLQNFL--STPTAGFGFRPWKSGGLTVAGLPPRLLKSRTVDFKPDARNSKWAVRLSLVDQSPPKSAVDVDRLVDFLYEDLRHLFDEQGIDRTAY
        MAALQ SLQN L  STP+ GFGFRP  SG L        + +SRTV  KP  RNSKW VRLSLVDQ+PPKS VDVD+LVDFLYEDL HLFDEQGIDRTAY
Subjt:  MAALQLSLQNFL--STPTAGFGFRPWKSGGLTVAGLPPRLLKSRTVDFKPDARNSKWAVRLSLVDQSPPKSAVDVDRLVDFLYEDLRHLFDEQGIDRTAY

Query:  DEHVRFRDPITKHDTISGYLFNISLLRELFRPEFFLHWVKQTGPYEITTRWTMVMKFVLLPWKPEFIFTGNSIMGINPETGKFCSHVDLWDSIQNNDYFS
        D+ VRFRDPITKHDTI+GYLFNISLLRELFRPEF LHWVK+TG YEITTRWTMVMKFVLLPWKP+ +FTGNSIMGINPETGKFCSHVDLWDSIQNNDYFS
Subjt:  DEHVRFRDPITKHDTISGYLFNISLLRELFRPEFFLHWVKQTGPYEITTRWTMVMKFVLLPWKPEFIFTGNSIMGINPETGKFCSHVDLWDSIQNNDYFS

Query:  LEGLLDVFKQLRFYKTPELESPKYEILKRTANYEVRKYTPFVVVETSGDKLSGSAGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSESPKVSIQIVLPS
        +EGLLDVFKQLRFYKTPELESPKYEILKRT NYEVRKY PF+VVETSGDKL+GSAGFN VAGYIFGKNSAKEKIPMTTPVFTQTFDSESPKVSIQIVLPS
Subjt:  LEGLLDVFKQLRFYKTPELESPKYEILKRTANYEVRKYTPFVVVETSGDKLSGSAGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSESPKVSIQIVLPS

Query:  DKDINSLPDPEQDTIGLRKVEGGIAAVLKFSGKPTEDMVQEKAKELRSCLIKDGLKPSKGCLLARYNDPGRTWSFIMRNEVLIWLEEFS
        +KD+ SLPDPEQDTIGLRKVEGG AAVLKFSGKPTE++VQEKAKELRS LIKDGLKP  GCLLARYNDPGRTW+FIMRNEVLIWLEEFS
Subjt:  DKDINSLPDPEQDTIGLRKVEGGIAAVLKFSGKPTEDMVQEKAKELRSCLIKDGLKPSKGCLLARYNDPGRTWSFIMRNEVLIWLEEFS

XP_023530707.1 uncharacterized protein LOC111793169 isoform X1 [Cucurbita pepo subsp. pepo]9.62e-24486.12Show/hide
Query:  MAALQLSLQNFL--STPTAGFGFRPWKSGGLTVAGLPPRLLKSRTVDFKPDARNSKWAVRLSLVDQSPPKSAVDVDRLVDFLYEDLRHLFDEQGIDRTAY
        MAAL+ SLQN L  STP+ GFGFRP  SG L        + +SRTV  KP  RNSKW VRLSLVDQ+PPKS VDVD+LVDFLYEDL HLFD+QGIDRTAY
Subjt:  MAALQLSLQNFL--STPTAGFGFRPWKSGGLTVAGLPPRLLKSRTVDFKPDARNSKWAVRLSLVDQSPPKSAVDVDRLVDFLYEDLRHLFDEQGIDRTAY

Query:  DEHVRFRDPITKHDTISGYLFNISLLRELFRPEFFLHWVKQTGPYEITTRWTMVMKFVLLPWKPEFIFTGNSIMGINPETGKFCSHVDLWDSIQNNDYFS
        D+ VRFRDPITKHDTI+GYLFNISLLRELFRPEF LHWVK+TG YEITTRWTMVMKFVLLPWKP+ +FTGNSIMGINPETGKFCSHVDLWDSIQNNDYFS
Subjt:  DEHVRFRDPITKHDTISGYLFNISLLRELFRPEFFLHWVKQTGPYEITTRWTMVMKFVLLPWKPEFIFTGNSIMGINPETGKFCSHVDLWDSIQNNDYFS

Query:  LEGLLDVFKQLRFYKTPELESPKYEILKRTANYEVRKYTPFVVVETSGDKLSGSAGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSESPKVSIQIVLPS
        +EGLLDVFKQLRFYKTPELESPKYEILKRTANYEVRKY PF+VVETSGDKL+GSAGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSESPKVSIQIVLPS
Subjt:  LEGLLDVFKQLRFYKTPELESPKYEILKRTANYEVRKYTPFVVVETSGDKLSGSAGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSESPKVSIQIVLPS

Query:  DKDINSLPDPEQDTIGLRKVEGGIAAVLKFSGKPTEDMVQEKAKELRSCLIKDGLKPSKGCLLARYNDPGRTWSFIMRNEVLIWLEEFS
        +KD+ SLPDPEQDTIGLRKVEGG AAVLKFSGKPTE++VQEKAKELRS LIKDGLKP  GCLLARYNDPGRTW+FIMRNEVLIWLEEFS
Subjt:  DKDINSLPDPEQDTIGLRKVEGGIAAVLKFSGKPTEDMVQEKAKELRSCLIKDGLKPSKGCLLARYNDPGRTWSFIMRNEVLIWLEEFS

XP_038879422.1 uncharacterized protein LOC120071301 isoform X1 [Benincasa hispida]4.98e-24787.86Show/hide
Query:  MAALQLSLQNFLSTPTAGFGFRPWKSGGLTVAGLPPRLLKSRTVDFKPDARNSKWAVRLSLVDQSPPKSAVDVDRLVDFLYEDLRHLFDEQGIDRTAYDE
        MA  QLSLQNF STPT GFG RP +SG LT   LPPRL K+RT  FKP ++NSKW VRLSLVDQSPPKS VDV RLVDFLYEDLRHLFDEQGIDRTAYDE
Subjt:  MAALQLSLQNFLSTPTAGFGFRPWKSGGLTVAGLPPRLLKSRTVDFKPDARNSKWAVRLSLVDQSPPKSAVDVDRLVDFLYEDLRHLFDEQGIDRTAYDE

Query:  HVRFRDPITKHDTISGYLFNISLLRELFRPEFFLHWVKQTGPYEITTRWTMVMKFVLLPWKPEFIFTGNSIMGINPETGKFCSHVDLWDSIQNNDYFSLE
         VRFRDPIT HDTISGYLFNISLLRELFRPEFFLHWVKQTGPYEITTRWTMVMKFVLLPWKPE +FTG SIMGINPETGKFCSHVDLWDSIQNNDYFS+E
Subjt:  HVRFRDPITKHDTISGYLFNISLLRELFRPEFFLHWVKQTGPYEITTRWTMVMKFVLLPWKPEFIFTGNSIMGINPETGKFCSHVDLWDSIQNNDYFSLE

Query:  GLLDVFKQLRFYKTPELESPKYEILKRTANYEVRKYTPFVVVETSGDKLSGSAGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSESPKVSIQIVLPSDK
        GL DVFKQLR+YKTP LESPKY ILKRTANYEVRKY  F+VVETSGDKL+GSAGFNTVAGYIFGKNS KEKIPMTTPVFTQTFDSE PKV IQIVLPS+K
Subjt:  GLLDVFKQLRFYKTPELESPKYEILKRTANYEVRKYTPFVVVETSGDKLSGSAGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSESPKVSIQIVLPSDK

Query:  DINSLPDPEQDTIGLRKVEGGIAAVLKFSGKPTEDMVQEKAKELRSCLIKDGLKPSKGCLLARYNDPGRTWSFIMRNEVLIWLEEFS
        DI+SLPDPEQD IGLRKVEG IAAVLKFSGKPTE++VQEKAKELRS LIKDGLKPS GCLLARYNDPGRTW+FIMRNEVLIWLEEFS
Subjt:  DINSLPDPEQDTIGLRKVEGGIAAVLKFSGKPTEDMVQEKAKELRSCLIKDGLKPSKGCLLARYNDPGRTWSFIMRNEVLIWLEEFS

TrEMBL top hitse value%identityAlignment
A0A0A0LWP3 Uncharacterized protein3.12e-24386.3Show/hide
Query:  MAALQLSLQNFLSTPTAGFGFRPWKSGGLTVAGLPPRLLKSRTVDFKPDARNSKWAVRLSLVDQSPPKSAVDVDRLVDFLYEDLRHLFDEQGIDRTAYDE
        MA LQLSLQNF STPT     RP KSG +T   LPPRLL SRT  FKP  +NSKW VR +LVDQ PPKS +DV RLVDFL+EDL HLFDEQGIDRTAYDE
Subjt:  MAALQLSLQNFLSTPTAGFGFRPWKSGGLTVAGLPPRLLKSRTVDFKPDARNSKWAVRLSLVDQSPPKSAVDVDRLVDFLYEDLRHLFDEQGIDRTAYDE

Query:  HVRFRDPITKHDTISGYLFNISLLRELFRPEFFLHWVKQTGPYEITTRWTMVMKFVLLPWKPEFIFTGNSIMGINPETGKFCSHVDLWDSIQNNDYFSLE
         VRFRDPITKHDTISGYLFNISLLRELFRPEFFLHWVKQTGPYEITTRWTMVMKF LLPWKPE +FTGNSIMGINPETGKFCSHVDLWDSIQNNDYFS+E
Subjt:  HVRFRDPITKHDTISGYLFNISLLRELFRPEFFLHWVKQTGPYEITTRWTMVMKFVLLPWKPEFIFTGNSIMGINPETGKFCSHVDLWDSIQNNDYFSLE

Query:  GLLDVFKQLRFYKTPELESPKYEILKRTANYEVRKYTPFVVVETSGDKLSGSAGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSESPKVSIQIVLPSDK
        GL DVFKQLRFYKTPELESPKY ILKRTA YEVRKY PF+VVETSGDKL+GSAGFNTVAGYIFGKNS KEKIPMTTPVFTQ F+SESPKVSIQIVLPS+K
Subjt:  GLLDVFKQLRFYKTPELESPKYEILKRTANYEVRKYTPFVVVETSGDKLSGSAGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSESPKVSIQIVLPSDK

Query:  DINSLPDPEQDTIGLRKVEGGIAAVLKFSGKPTEDMVQEKAKELRSCLIKDGLKPSKGCLLARYNDPGRTWSFIMRNEVLIWLEEFS
        DI+SLPDPEQD +GLRKVEGGIAAVLKFSGKP E++VQEKAKELRS LIKDGLKP  GCLLARYNDPGRTW+FIMRNEVLIWLEEFS
Subjt:  DINSLPDPEQDTIGLRKVEGGIAAVLKFSGKPTEDMVQEKAKELRSCLIKDGLKPSKGCLLARYNDPGRTWSFIMRNEVLIWLEEFS

A0A1S3CJ12 uncharacterized protein LOC103501513 isoform X11.14e-24587.34Show/hide
Query:  MAALQLSLQNFLSTPTAGFGFRPWKSGGLTVAGLPPRLLKSRTVDFKPDARNSKWAVRLSLVDQSPPKSAVDVDRLVDFLYEDLRHLFDEQGIDRTAYDE
        MAALQLSLQNFLSTPT     RP KSG LT   L PRLL+SRT   KP+ +NSKW VR +LVDQSPPKS VDV RLVDFLYEDL HLFDEQGIDRTAYDE
Subjt:  MAALQLSLQNFLSTPTAGFGFRPWKSGGLTVAGLPPRLLKSRTVDFKPDARNSKWAVRLSLVDQSPPKSAVDVDRLVDFLYEDLRHLFDEQGIDRTAYDE

Query:  HVRFRDPITKHDTISGYLFNISLLRELFRPEFFLHWVKQTGPYEITTRWTMVMKFVLLPWKPEFIFTGNSIMGINPETGKFCSHVDLWDSIQNNDYFSLE
         VRFRDPITKHDTISGYLFNISLLRE+FRPEFFLHWVKQTGPYEITTRWTM+MKF LLPWKPE IFTG SIMGINPETGKFCSHVDLWDSIQNNDYFS+E
Subjt:  HVRFRDPITKHDTISGYLFNISLLRELFRPEFFLHWVKQTGPYEITTRWTMVMKFVLLPWKPEFIFTGNSIMGINPETGKFCSHVDLWDSIQNNDYFSLE

Query:  GLLDVFKQLRFYKTPELESPKYEILKRTANYEVRKYTPFVVVETSGDKLSGSAGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSESPKVSIQIVLPSDK
        GL DVFKQLRFYKTPELESPKY ILKRT  YEVRKY PF+VVETSGDKL+GSAGFNTVAGYIFGKNS KEKIPMTTPVFTQTFDSESPKVSIQIVLPS+K
Subjt:  GLLDVFKQLRFYKTPELESPKYEILKRTANYEVRKYTPFVVVETSGDKLSGSAGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSESPKVSIQIVLPSDK

Query:  DINSLPDPEQDTIGLRKVEGGIAAVLKFSGKPTEDMVQEKAKELRSCLIKDGLKPSKGCLLARYNDPGRTWSFIMRNEVLIWLEEFS
        DI+SLPDPEQD IGLRKVEGGIAAVLKFSGKPTE++VQEKAKELRS LIKDGLKP  GCLLARYNDPGRTW+FIMRNEVLIWLEE+S
Subjt:  DINSLPDPEQDTIGLRKVEGGIAAVLKFSGKPTEDMVQEKAKELRSCLIKDGLKPSKGCLLARYNDPGRTWSFIMRNEVLIWLEEFS

A0A6J1CUY2 uncharacterized protein LOC111014503 isoform X11.50e-28799.74Show/hide
Query:  MAALQLSLQNFLSTPTAGFGFRPWKSGGLTVAGLPPRLLKSRTVDFKPDARNSKWAVRLSLVDQSPPKSAVDVDRLVDFLYEDLRHLFDEQGIDRTAYDE
        MAALQLSLQNFLSTPTAGFGFRPWKSGGLTVAGLPPRLLKSRTVDFKPDARNSKWAVRLSLVDQSPPKSAVDVDRLVDFLYEDLRHLFDEQGIDRTAYDE
Subjt:  MAALQLSLQNFLSTPTAGFGFRPWKSGGLTVAGLPPRLLKSRTVDFKPDARNSKWAVRLSLVDQSPPKSAVDVDRLVDFLYEDLRHLFDEQGIDRTAYDE

Query:  HVRFRDPITKHDTISGYLFNISLLRELFRPEFFLHWVKQTGPYEITTRWTMVMKFVLLPWKPEFIFTGNSIMGINPETGKFCSHVDLWDSIQNNDYFSLE
        HVRFRDPITKHDTISGY FNISLLRELFRPEFFLHWVKQTGPYEITTRWTMVMKFVLLPWKPEFIFTGNSIMGINPETGKFCSHVDLWDSIQNNDYFSLE
Subjt:  HVRFRDPITKHDTISGYLFNISLLRELFRPEFFLHWVKQTGPYEITTRWTMVMKFVLLPWKPEFIFTGNSIMGINPETGKFCSHVDLWDSIQNNDYFSLE

Query:  GLLDVFKQLRFYKTPELESPKYEILKRTANYEVRKYTPFVVVETSGDKLSGSAGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSESPKVSIQIVLPSDK
        GLLDVFKQLRFYKTPELESPKYEILKRTANYEVRKYTPFVVVETSGDKLSGSAGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSESPKVSIQIVLPSDK
Subjt:  GLLDVFKQLRFYKTPELESPKYEILKRTANYEVRKYTPFVVVETSGDKLSGSAGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSESPKVSIQIVLPSDK

Query:  DINSLPDPEQDTIGLRKVEGGIAAVLKFSGKPTEDMVQEKAKELRSCLIKDGLKPSKGCLLARYNDPGRTWSFIMRNEVLIWLEEFS
        DINSLPDPEQDTIGLRKVEGGIAAVLKFSGKPTEDMVQEKAKELRSCLIKDGLKPSKGCLLARYNDPGRTWSFIMRNEVLIWLEEFS
Subjt:  DINSLPDPEQDTIGLRKVEGGIAAVLKFSGKPTEDMVQEKAKELRSCLIKDGLKPSKGCLLARYNDPGRTWSFIMRNEVLIWLEEFS

A0A6J1ER73 uncharacterized protein LOC111437064 isoform X16.62e-24486.12Show/hide
Query:  MAALQLSLQNFL--STPTAGFGFRPWKSGGLTVAGLPPRLLKSRTVDFKPDARNSKWAVRLSLVDQSPPKSAVDVDRLVDFLYEDLRHLFDEQGIDRTAY
        MAALQ SLQN L  STP+ GFGFRP  SG L        + +SRTV  KP  RNSKW VRLSLVDQ+PPKS VDVD+LVDFLYEDL HLFDEQGIDRTAY
Subjt:  MAALQLSLQNFL--STPTAGFGFRPWKSGGLTVAGLPPRLLKSRTVDFKPDARNSKWAVRLSLVDQSPPKSAVDVDRLVDFLYEDLRHLFDEQGIDRTAY

Query:  DEHVRFRDPITKHDTISGYLFNISLLRELFRPEFFLHWVKQTGPYEITTRWTMVMKFVLLPWKPEFIFTGNSIMGINPETGKFCSHVDLWDSIQNNDYFS
        D+ VRFRDPITKHDTI+GYLFNISLLRELFRPEF LHWVK+TG YEITTRWTMVMKFVLLPWKP+ +FTGNSIMGINPETGKFCSHVDLWDSIQNNDYFS
Subjt:  DEHVRFRDPITKHDTISGYLFNISLLRELFRPEFFLHWVKQTGPYEITTRWTMVMKFVLLPWKPEFIFTGNSIMGINPETGKFCSHVDLWDSIQNNDYFS

Query:  LEGLLDVFKQLRFYKTPELESPKYEILKRTANYEVRKYTPFVVVETSGDKLSGSAGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSESPKVSIQIVLPS
        +EGLLDVFKQLRFYKTPELESPKYEILKRT NYEVRKY PF+VVETSGDKL+GSAGFN VAGYIFGKNSAKEKIPMTTPVFTQTFDSESPKVSIQIVLPS
Subjt:  LEGLLDVFKQLRFYKTPELESPKYEILKRTANYEVRKYTPFVVVETSGDKLSGSAGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSESPKVSIQIVLPS

Query:  DKDINSLPDPEQDTIGLRKVEGGIAAVLKFSGKPTEDMVQEKAKELRSCLIKDGLKPSKGCLLARYNDPGRTWSFIMRNEVLIWLEEFS
        +KD+ SLPDPEQDTIGLRKVEGG AAVLKFSGKPTE++VQEKAKELRS LIKDGLKP  GCLLARYNDPGRTW+FIMRNEVLIWLEEFS
Subjt:  DKDINSLPDPEQDTIGLRKVEGGIAAVLKFSGKPTEDMVQEKAKELRSCLIKDGLKPSKGCLLARYNDPGRTWSFIMRNEVLIWLEEFS

A0A6J1KHA6 uncharacterized protein LOC111495248 isoform X14.45e-24285.35Show/hide
Query:  MAALQLSLQNFL--STPTAGFGFRPWKSGGLTVAGLPPRLLKSRTVDFKPDARNSKWAVRLSLVDQSPPKSAVDVDRLVDFLYEDLRHLFDEQGIDRTAY
        MAAL+ SLQN L  STP+ GFGFRP  SG L        + +SRTV  KP  RNSKW VRLSLVDQ+PPKS VDVD+LVDFLY+DL HLFDEQGIDRTAY
Subjt:  MAALQLSLQNFL--STPTAGFGFRPWKSGGLTVAGLPPRLLKSRTVDFKPDARNSKWAVRLSLVDQSPPKSAVDVDRLVDFLYEDLRHLFDEQGIDRTAY

Query:  DEHVRFRDPITKHDTISGYLFNISLLRELFRPEFFLHWVKQTGPYEITTRWTMVMKFVLLPWKPEFIFTGNSIMGINPETGKFCSHVDLWDSIQNNDYFS
        D+ VRFRDPITKHDTI+GYLFNISLLRELF+PEF LHWVK+TG YEITTRWTMVMKFVLLPWKP+ +FTGNSIMGINPETGKFCSHVDLWDSIQNNDYFS
Subjt:  DEHVRFRDPITKHDTISGYLFNISLLRELFRPEFFLHWVKQTGPYEITTRWTMVMKFVLLPWKPEFIFTGNSIMGINPETGKFCSHVDLWDSIQNNDYFS

Query:  LEGLLDVFKQLRFYKTPELESPKYEILKRTANYEVRKYTPFVVVETSGDKLSGSAGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSESPKVSIQIVLPS
        +EGLLDVFKQLRFYKTPELESPKYEILKRTANYEVRKY PF+VVETSGDKL+GSAGFNTVAGYIFGKNSAKEKI MTTPVFTQTFDSESPKVSIQIVLPS
Subjt:  LEGLLDVFKQLRFYKTPELESPKYEILKRTANYEVRKYTPFVVVETSGDKLSGSAGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSESPKVSIQIVLPS

Query:  DKDINSLPDPEQDTIGLRKVEGGIAAVLKFSGKPTEDMVQEKAKELRSCLIKDGLKPSKGCLLARYNDPGRTWSFIMRNEVLIWLEEFS
        +KD+ SLPDPEQDTIGLRKVEGG AAVLKFSGKPTE++VQEKAK+LRS LIKDGLKP  GCLLARYNDPGRTW+FIMRNEVLIWLEEFS
Subjt:  DKDINSLPDPEQDTIGLRKVEGGIAAVLKFSGKPTEDMVQEKAKELRSCLIKDGLKPSKGCLLARYNDPGRTWSFIMRNEVLIWLEEFS

SwissProt top hitse value%identityAlignment
Q9SR77 Heme-binding-like protein At3g10130, chloroplastic2.4e-1932.66Show/hide
Query:  FYKTPELESPKYEILKRTANYEVRKYTPFVVV------ETSGDKLSGSAGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDS--ESPKVSIQIVLPSDKDI
        F   P+LE+  + +L RT  YE+R+  P+ V       ET  D    S  FN +A Y+FGKN+ KEK+ MTTPV T+   S  E  +++  ++    KD 
Subjt:  FYKTPELESPKYEILKRTANYEVRKYTPFVVV------ETSGDKLSGSAGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDS--ESPKVSIQIVLPSDKDI

Query:  N--------------SLPDPEQDTIGLRKVEGGIAAVLKFSGKPTEDMVQEKAKELRSCLIKD---GLKPSKGCLLARYNDPGRTWSFIMRNEVLIWLE
        N              +LP P+  ++ +++V   I AV+ FSG  T++ ++ + +ELR  L  D    ++      +A+YN P  T  F+ RNEV + +E
Subjt:  N--------------SLPDPEQDTIGLRKVEGGIAAVLKFSGKPTEDMVQEKAKELRSCLIKD---GLKPSKGCLLARYNDPGRTWSFIMRNEVLIWLE

Arabidopsis top hitse value%identityAlignment
AT1G17100.1 SOUL heme-binding family protein5.6e-0831.65Show/hide
Query:  LESPKYEILKRTANYEVRKYTPFVVVET------SGDKLSGSAGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSESPKVSIQIVLPSDKDINSLPDP-E
        +E P YE++     YE+R+Y   V V T      S    + +A F   A YI GKN   +KI MT PV +Q   S+ P       +       + PDP  
Subjt:  LESPKYEILKRTANYEVRKYTPFVVVET------SGDKLSGSAGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSESPKVSIQIVLPSDKDINSLPDP-E

Query:  QDTIGLRKVEGGIAAVLKFSGKPTEDMVQEKAKELRSCL
         + + ++K      AV +FSG  ++D + E+A  L S L
Subjt:  QDTIGLRKVEGGIAAVLKFSGKPTEDMVQEKAKELRSCL

AT2G37970.1 SOUL heme-binding family protein8.7e-1734.45Show/hide
Query:  LESPKYEILKRTANYEVRKYTPFVVVETSGD----KLSGSAGFNTVAGYI--FGK--NSAKEKIPMTTPVFTQ----------TFDSESPK---------
        +E+PKY + K    YE+R+Y P V  E + D    K     GF  +A YI  FGK  N   EKI MT PV T+              ES K         
Subjt:  LESPKYEILKRTANYEVRKYTPFVVVETSGD----KLSGSAGFNTVAGYI--FGK--NSAKEKIPMTTPVFTQ----------TFDSESPK---------

Query:  -----------VSIQIVLPS-DKDINSLPDPEQDTIGLRKVEGGIAAVLKFSGKPTEDMVQEKAKELRSCLIKDGLKPSKGCLLARYNDPGRTWSF--IM
                   V++Q +LPS  K     P P  + + +++  G    V+KFSG  +E +V EK K+L S L KDG K +   +LARYN P   W+     
Subjt:  -----------VSIQIVLPS-DKDINSLPDPEQDTIGLRKVEGGIAAVLKFSGKPTEDMVQEKAKELRSCLIKDGLKPSKGCLLARYNDPGRTWSF--IM

Query:  RNEVLIWLE
         NEV+I +E
Subjt:  RNEVLIWLE

AT3G10130.1 SOUL heme-binding family protein1.7e-2032.66Show/hide
Query:  FYKTPELESPKYEILKRTANYEVRKYTPFVVV------ETSGDKLSGSAGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDS--ESPKVSIQIVLPSDKDI
        F   P+LE+  + +L RT  YE+R+  P+ V       ET  D    S  FN +A Y+FGKN+ KEK+ MTTPV T+   S  E  +++  ++    KD 
Subjt:  FYKTPELESPKYEILKRTANYEVRKYTPFVVV------ETSGDKLSGSAGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDS--ESPKVSIQIVLPSDKDI

Query:  N--------------SLPDPEQDTIGLRKVEGGIAAVLKFSGKPTEDMVQEKAKELRSCLIKD---GLKPSKGCLLARYNDPGRTWSFIMRNEVLIWLE
        N              +LP P+  ++ +++V   I AV+ FSG  T++ ++ + +ELR  L  D    ++      +A+YN P  T  F+ RNEV + +E
Subjt:  N--------------SLPDPEQDTIGLRKVEGGIAAVLKFSGKPTEDMVQEKAKELRSCLIKD---GLKPSKGCLLARYNDPGRTWSFIMRNEVLIWLE

AT5G20140.1 SOUL heme-binding family protein6.9e-14775.31Show/hide
Query:  SAVDVDRLVDFLYEDLRHLFDEQGIDRTAYDEHVRFRDPITKHDTISGYLFNISLLRELFRPEFFLHWVKQTGPYEITTRWTMVMKFVLLPWKPEFIFTG
        S V+++ LV FLYEDL HLFD+QGID+TAYDE V+FRDPITKHDTISGYLFNI+ L+ +F P+F LHW KQTGPYEITTRWTMVMKF+ LPWKPE +FTG
Subjt:  SAVDVDRLVDFLYEDLRHLFDEQGIDRTAYDEHVRFRDPITKHDTISGYLFNISLLRELFRPEFFLHWVKQTGPYEITTRWTMVMKFVLLPWKPEFIFTG

Query:  NSIMGINPETGKFCSHVDLWDSIQNNDYFSLEGLLDVFKQLRFYKTPELESPKYEILKRTANYEVRKYTPFVVVETSGDKLSGSAGFNTVAGYIFGKNSA
         SIM +NPET KFCSH+DLWDSI+NNDYFSLEGL+DVFKQLR YKTP+LE+PKY+ILKRTANYEVR Y PF+VVET GDKLSGS+GFN VAGYIFGKNS 
Subjt:  NSIMGINPETGKFCSHVDLWDSIQNNDYFSLEGLLDVFKQLRFYKTPELESPKYEILKRTANYEVRKYTPFVVVETSGDKLSGSAGFNTVAGYIFGKNSA

Query:  KEKIPMTTPVFTQTFDSE-SPKVSIQIVLPSDKDINSLPDPEQDTIGLRKVEGGIAAVLKFSGKPTEDMVQEKAKELRSCLIKDGLKPSKGCLLARYNDP
         EKIPMTTPVFTQT D++ S  VS+QIV+PS KD++SLP P ++ + L+K+EGG AA +KFSGKPTED+VQ K  ELRS L KDGL+  KGC+LARYNDP
Subjt:  KEKIPMTTPVFTQTFDSE-SPKVSIQIVLPSDKDINSLPDPEQDTIGLRKVEGGIAAVLKFSGKPTEDMVQEKAKELRSCLIKDGLKPSKGCLLARYNDP

Query:  GRTWSFIMRNEVLIWLEEFS
        GRTW+FIMRNEV+IWLE+FS
Subjt:  GRTWSFIMRNEVLIWLEEFS

AT5G20140.2 SOUL heme-binding family protein4.8e-14075Show/hide
Query:  SAVDVDRLVDFLYEDLRHLFDEQGIDRTAYDEHVRFRDPITKHDTISGYLFNISLLRELFRPEFFLHWVKQTGPYEITTRWTMVMKFVLLPWKPEFIFTG
        S V+++ LV FLYEDL HLFD+QGID+TAYDE V+FRDPITKHDTISGYLFNI+ L+ +F P+F LHW KQTGPYEITTRWTMVMKF+ LPWKPE +FTG
Subjt:  SAVDVDRLVDFLYEDLRHLFDEQGIDRTAYDEHVRFRDPITKHDTISGYLFNISLLRELFRPEFFLHWVKQTGPYEITTRWTMVMKFVLLPWKPEFIFTG

Query:  NSIMGINPETGKFCSHVDLWDSIQNNDYFSLEGLLDVFKQLRFYKTPELESPKYEILKRTANYEVRKYTPFVVVETSGDKLSGSAGFNTVAGYIFGKNSA
         SIM +NPET KFCSH+DLWDSI+NNDYFSLEGL+DVFKQLR YKTP+LE+PKY+ILKRTANYEVR Y PF+VVET GDKLSGS+GFN VAGYIFGKNS 
Subjt:  NSIMGINPETGKFCSHVDLWDSIQNNDYFSLEGLLDVFKQLRFYKTPELESPKYEILKRTANYEVRKYTPFVVVETSGDKLSGSAGFNTVAGYIFGKNSA

Query:  KEKIPMTTPVFTQTFDSE-SPKVSIQIVLPSDKDINSLPDPEQDTIGLRKVEGGIAAVLKFSGKPTEDMVQEKAKELRSCLIKDGLKPSKGCLLARYNDP
         EKIPMTTPVFTQT D++ S  VS+QIV+PS KD++SLP P ++ + L+K+EGG AA +KFSGKPTED+VQ K  ELRS L KDGL+  KGC+LARYNDP
Subjt:  KEKIPMTTPVFTQTFDSE-SPKVSIQIVLPSDKDINSLPDPEQDTIGLRKVEGGIAAVLKFSGKPTEDMVQEKAKELRSCLIKDGLKPSKGCLLARYNDP

Query:  GRTWSFIM
        GRTW+FIM
Subjt:  GRTWSFIM


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTCAAATTCGAACACGTAGATTGTTAATGCGATCTGGTCGGTTCCTCACCACAAAAAAATTCTACACTACACAATCACTCCTCTCCCACTACTTCCAATGCCCACC
AAATCTTGAGGCTCGAATGGCCGCTCTTCAACTTTCCCTCCAAAATTTCCTCTCAACCCCAACAGCCGGTTTCGGTTTCCGGCCATGGAAGTCCGGCGGACTAACAGTAG
CCGGCCTCCCACCACGTCTACTCAAAAGCAGGACTGTAGATTTTAAACCCGACGCCCGAAATTCTAAGTGGGCTGTTCGATTAAGCTTGGTGGATCAAAGCCCCCCCAAA
TCGGCGGTTGATGTAGACCGATTGGTAGATTTCCTGTACGAAGATCTTCGTCATCTCTTTGATGAACAGGGGATTGATCGGACGGCGTACGACGAGCATGTGAGATTTCG
GGACCCCATTACCAAGCACGATACCATTAGCGGGTATTTGTTTAATATTTCCCTATTGCGAGAGCTCTTCAGGCCTGAGTTCTTCTTGCACTGGGTTAAACAGACAGGAC
CTTATGAGATTACCACAAGATGGACTATGGTAATGAAGTTTGTCCTTCTGCCATGGAAACCAGAATTTATTTTTACGGGAAATTCCATCATGGGTATTAATCCAGAGACG
GGCAAGTTCTGTAGCCATGTGGATCTCTGGGATTCAATACAAAATAATGACTACTTTTCTCTAGAAGGCCTGTTGGATGTATTTAAACAGCTCCGGTTTTATAAGACACC
AGAATTGGAATCACCCAAATATGAGATATTGAAAAGGACTGCAAACTATGAGGTAAGGAAATATACACCATTTGTAGTGGTAGAAACAAGTGGAGACAAGCTCTCTGGGT
CTGCTGGATTCAATACGGTTGCTGGGTACATATTTGGGAAGAACTCTGCAAAGGAGAAGATACCCATGACCACCCCTGTGTTCACCCAGACATTTGACTCTGAATCACCC
AAAGTATCCATCCAAATAGTTCTTCCTTCAGACAAAGATATTAACAGTTTGCCAGATCCTGAACAAGACACAATAGGCTTGAGAAAGGTGGAAGGAGGTATTGCTGCAGT
GCTGAAGTTCAGTGGAAAACCTACTGAAGATATGGTGCAAGAGAAGGCAAAAGAATTGCGGTCTTGTCTTATAAAGGATGGCCTCAAACCCAGTAAGGGCTGTTTGCTTG
CTCGGTACAATGACCCTGGCCGGACGTGGAGCTTTATAATGAGAAATGAGGTGCTAATATGGCTTGAAGAATTCTCA
mRNA sequenceShow/hide mRNA sequence
GTTAATATGTGCAAGAATGAACCAGTAAAACTCTGATATGAACTAGTTAATGCATCTTACTTGTAAGTGGTTCAATATCTCATTCACACGTCTACTTTAATTATTGCTGA
ACTTCGCGTTATGAAAGCGTTAAAATTTAAACAACCAAATTTGTAATCAAGCATAATTTGATGCCAACAATTACCCATTTTATTTCATCGGAATGGTTCAAATTCGAACA
CGTAGATTGTTAATGCGATCTGGTCGGTTCCTCACCACAAAAAAATTCTACACTACACAATCACTCCTCTCCCACTACTTCCAATGCCCACCAAATCTTGAGGCTCGAAT
GGCCGCTCTTCAACTTTCCCTCCAAAATTTCCTCTCAACCCCAACAGCCGGTTTCGGTTTCCGGCCATGGAAGTCCGGCGGACTAACAGTAGCCGGCCTCCCACCACGTC
TACTCAAAAGCAGGACTGTAGATTTTAAACCCGACGCCCGAAATTCTAAGTGGGCTGTTCGATTAAGCTTGGTGGATCAAAGCCCCCCCAAATCGGCGGTTGATGTAGAC
CGATTGGTAGATTTCCTGTACGAAGATCTTCGTCATCTCTTTGATGAACAGGGGATTGATCGGACGGCGTACGACGAGCATGTGAGATTTCGGGACCCCATTACCAAGCA
CGATACCATTAGCGGGTATTTGTTTAATATTTCCCTATTGCGAGAGCTCTTCAGGCCTGAGTTCTTCTTGCACTGGGTTAAACAGACAGGACCTTATGAGATTACCACAA
GATGGACTATGGTAATGAAGTTTGTCCTTCTGCCATGGAAACCAGAATTTATTTTTACGGGAAATTCCATCATGGGTATTAATCCAGAGACGGGCAAGTTCTGTAGCCAT
GTGGATCTCTGGGATTCAATACAAAATAATGACTACTTTTCTCTAGAAGGCCTGTTGGATGTATTTAAACAGCTCCGGTTTTATAAGACACCAGAATTGGAATCACCCAA
ATATGAGATATTGAAAAGGACTGCAAACTATGAGGTAAGGAAATATACACCATTTGTAGTGGTAGAAACAAGTGGAGACAAGCTCTCTGGGTCTGCTGGATTCAATACGG
TTGCTGGGTACATATTTGGGAAGAACTCTGCAAAGGAGAAGATACCCATGACCACCCCTGTGTTCACCCAGACATTTGACTCTGAATCACCCAAAGTATCCATCCAAATA
GTTCTTCCTTCAGACAAAGATATTAACAGTTTGCCAGATCCTGAACAAGACACAATAGGCTTGAGAAAGGTGGAAGGAGGTATTGCTGCAGTGCTGAAGTTCAGTGGAAA
ACCTACTGAAGATATGGTGCAAGAGAAGGCAAAAGAATTGCGGTCTTGTCTTATAAAGGATGGCCTCAAACCCAGTAAGGGCTGTTTGCTTGCTCGGTACAATGACCCTG
GCCGGACGTGGAGCTTTATAATGAGAAATGAGGTGCTAATATGGCTTGAAGAATTCTCA
Protein sequenceShow/hide protein sequence
MVQIRTRRLLMRSGRFLTTKKFYTTQSLLSHYFQCPPNLEARMAALQLSLQNFLSTPTAGFGFRPWKSGGLTVAGLPPRLLKSRTVDFKPDARNSKWAVRLSLVDQSPPK
SAVDVDRLVDFLYEDLRHLFDEQGIDRTAYDEHVRFRDPITKHDTISGYLFNISLLRELFRPEFFLHWVKQTGPYEITTRWTMVMKFVLLPWKPEFIFTGNSIMGINPET
GKFCSHVDLWDSIQNNDYFSLEGLLDVFKQLRFYKTPELESPKYEILKRTANYEVRKYTPFVVVETSGDKLSGSAGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSESP
KVSIQIVLPSDKDINSLPDPEQDTIGLRKVEGGIAAVLKFSGKPTEDMVQEKAKELRSCLIKDGLKPSKGCLLARYNDPGRTWSFIMRNEVLIWLEEFS