; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS003783 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS003783
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionSOUL heme-binding protein
Genome locationscaffold127:415962..418050
RNA-Seq ExpressionMS003783
SyntenyMS003783
Gene Ontology termsGO:0110165 - cellular anatomical structure (cellular component)
InterPro domainsIPR006917 - SOUL haem-binding protein
IPR011256 - Regulatory factor, effector binding domain superfamily
IPR018790 - Protein of unknown function DUF2358


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008463332.1 PREDICTED: uncharacterized protein LOC103501513 isoform X1 [Cucumis melo]3.6e-18686.93Show/hide
Query:  MAALQLSLQNFLSTPTAGFGFRPWKSGGLTVTGLPPRLLKSRTVDFKPDARNSKWAVRLSLVDQSPPKSAVDVDRLVDFLYEDLRHLFDEQGIDRTAYDE
        MAALQLSLQNFLSTPT     RP KSG L  T L PRLL+SRT   KP+ +NSKW VR +LVDQSPPKS VDV RLVDFLYEDL HLFDEQGIDRTAYDE
Subjt:  MAALQLSLQNFLSTPTAGFGFRPWKSGGLTVTGLPPRLLKSRTVDFKPDARNSKWAVRLSLVDQSPPKSAVDVDRLVDFLYEDLRHLFDEQGIDRTAYDE

Query:  HVRFRDPITKHDTIGGYLFNISLLRELFRPEFFLHWVKQTGPYEITTRWTMVMKFVLLPWKPELIFTGNSIMGINPETGKFCSHVDLWDSIQNNDYFSLE
         VRFRDPITKHDTI GYLFNISLLRE+FRPEFFLHWVKQTGPYEITTRWTM+MKF LLPWKPELIFTG SIMGINPETGKFCSHVDLWDSIQNNDYFS+E
Subjt:  HVRFRDPITKHDTIGGYLFNISLLRELFRPEFFLHWVKQTGPYEITTRWTMVMKFVLLPWKPELIFTGNSIMGINPETGKFCSHVDLWDSIQNNDYFSLE

Query:  GLLDVFKQLRFYKTPELESPKYEILKRTANYEVRKYKPFVVVETSGDKLSGSAGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSESPKVSIQIVLPSDK
        GL DVFKQLRFYKTPELESPKY ILKRT  YEVRKY PF+VVETSGDKL+GSAGFNTVAGYIFGKNS KEKIPMTTPVFTQTFDSESPKVSIQIVLPS+K
Subjt:  GLLDVFKQLRFYKTPELESPKYEILKRTANYEVRKYKPFVVVETSGDKLSGSAGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSESPKVSIQIVLPSDK

Query:  DINSLPDPERDTIGLRKVEGGIAAVLKFSGKPTEDMVQEKAKELRSCLIKDGLKPSKGCLLARYNDPGRTWSFIM
        DI+SLPDPE+D IGLRKVEGGIAAVLKFSGKPTE++VQEKAKELRS LIKDGLKP  GCLLARYNDPGRTW+FIM
Subjt:  DINSLPDPERDTIGLRKVEGGIAAVLKFSGKPTEDMVQEKAKELRSCLIKDGLKPSKGCLLARYNDPGRTWSFIM

XP_022144956.1 uncharacterized protein LOC111014503 isoform X1 [Momordica charantia]2.2e-21598.4Show/hide
Query:  MAALQLSLQNFLSTPTAGFGFRPWKSGGLTVTGLPPRLLKSRTVDFKPDARNSKWAVRLSLVDQSPPKSAVDVDRLVDFLYEDLRHLFDEQGIDRTAYDE
        MAALQLSLQNFLSTPTAGFGFRPWKSGGLTV GLPPRLLKSRTVDFKPDARNSKWAVRLSLVDQSPPKSAVDVDRLVDFLYEDLRHLFDEQGIDRTAYDE
Subjt:  MAALQLSLQNFLSTPTAGFGFRPWKSGGLTVTGLPPRLLKSRTVDFKPDARNSKWAVRLSLVDQSPPKSAVDVDRLVDFLYEDLRHLFDEQGIDRTAYDE

Query:  HVRFRDPITKHDTIGGYLFNISLLRELFRPEFFLHWVKQTGPYEITTRWTMVMKFVLLPWKPELIFTGNSIMGINPETGKFCSHVDLWDSIQNNDYFSLE
        HVRFRDPITKHDTI GY FNISLLRELFRPEFFLHWVKQTGPYEITTRWTMVMKFVLLPWKPE IFTGNSIMGINPETGKFCSHVDLWDSIQNNDYFSLE
Subjt:  HVRFRDPITKHDTIGGYLFNISLLRELFRPEFFLHWVKQTGPYEITTRWTMVMKFVLLPWKPELIFTGNSIMGINPETGKFCSHVDLWDSIQNNDYFSLE

Query:  GLLDVFKQLRFYKTPELESPKYEILKRTANYEVRKYKPFVVVETSGDKLSGSAGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSESPKVSIQIVLPSDK
        GLLDVFKQLRFYKTPELESPKYEILKRTANYEVRKY PFVVVETSGDKLSGSAGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSESPKVSIQIVLPSDK
Subjt:  GLLDVFKQLRFYKTPELESPKYEILKRTANYEVRKYKPFVVVETSGDKLSGSAGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSESPKVSIQIVLPSDK

Query:  DINSLPDPERDTIGLRKVEGGIAAVLKFSGKPTEDMVQEKAKELRSCLIKDGLKPSKGCLLARYNDPGRTWSFIM
        DINSLPDPE+DTIGLRKVEGGIAAVLKFSGKPTEDMVQEKAKELRSCLIKDGLKPSKGCLLARYNDPGRTWSFIM
Subjt:  DINSLPDPERDTIGLRKVEGGIAAVLKFSGKPTEDMVQEKAKELRSCLIKDGLKPSKGCLLARYNDPGRTWSFIM

XP_022930662.1 uncharacterized protein LOC111437064 isoform X1 [Cucurbita moschata]2.0e-18485.68Show/hide
Query:  MAALQLSLQNFL--STPTAGFGFRPWKSGGLTVTGLPPRLLKSRTVDFKPDARNSKWAVRLSLVDQSPPKSAVDVDRLVDFLYEDLRHLFDEQGIDRTAY
        MAALQ SLQN L  STP+ GFGFRP  SG L        + +SRTV  KP  RNSKW VRLSLVDQ+PPKS VDVD+LVDFLYEDL HLFDEQGIDRTAY
Subjt:  MAALQLSLQNFL--STPTAGFGFRPWKSGGLTVTGLPPRLLKSRTVDFKPDARNSKWAVRLSLVDQSPPKSAVDVDRLVDFLYEDLRHLFDEQGIDRTAY

Query:  DEHVRFRDPITKHDTIGGYLFNISLLRELFRPEFFLHWVKQTGPYEITTRWTMVMKFVLLPWKPELIFTGNSIMGINPETGKFCSHVDLWDSIQNNDYFS
        D+ VRFRDPITKHDTI GYLFNISLLRELFRPEF LHWVK+TG YEITTRWTMVMKFVLLPWKP+L+FTGNSIMGINPETGKFCSHVDLWDSIQNNDYFS
Subjt:  DEHVRFRDPITKHDTIGGYLFNISLLRELFRPEFFLHWVKQTGPYEITTRWTMVMKFVLLPWKPELIFTGNSIMGINPETGKFCSHVDLWDSIQNNDYFS

Query:  LEGLLDVFKQLRFYKTPELESPKYEILKRTANYEVRKYKPFVVVETSGDKLSGSAGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSESPKVSIQIVLPS
        +EGLLDVFKQLRFYKTPELESPKYEILKRT NYEVRKY PF+VVETSGDKL+GSAGFN VAGYIFGKNSAKEKIPMTTPVFTQTFDSESPKVSIQIVLPS
Subjt:  LEGLLDVFKQLRFYKTPELESPKYEILKRTANYEVRKYKPFVVVETSGDKLSGSAGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSESPKVSIQIVLPS

Query:  DKDINSLPDPERDTIGLRKVEGGIAAVLKFSGKPTEDMVQEKAKELRSCLIKDGLKPSKGCLLARYNDPGRTWSFIM
        +KD+ SLPDPE+DTIGLRKVEGG AAVLKFSGKPTE++VQEKAKELRS LIKDGLKP  GCLLARYNDPGRTW+FIM
Subjt:  DKDINSLPDPERDTIGLRKVEGGIAAVLKFSGKPTEDMVQEKAKELRSCLIKDGLKPSKGCLLARYNDPGRTWSFIM

XP_023530707.1 uncharacterized protein LOC111793169 isoform X1 [Cucurbita pepo subsp. pepo]1.5e-18485.68Show/hide
Query:  MAALQLSLQN--FLSTPTAGFGFRPWKSGGLTVTGLPPRLLKSRTVDFKPDARNSKWAVRLSLVDQSPPKSAVDVDRLVDFLYEDLRHLFDEQGIDRTAY
        MAAL+ SLQN   LSTP+ GFGFRP  SG L        + +SRTV  KP  RNSKW VRLSLVDQ+PPKS VDVD+LVDFLYEDL HLFD+QGIDRTAY
Subjt:  MAALQLSLQN--FLSTPTAGFGFRPWKSGGLTVTGLPPRLLKSRTVDFKPDARNSKWAVRLSLVDQSPPKSAVDVDRLVDFLYEDLRHLFDEQGIDRTAY

Query:  DEHVRFRDPITKHDTIGGYLFNISLLRELFRPEFFLHWVKQTGPYEITTRWTMVMKFVLLPWKPELIFTGNSIMGINPETGKFCSHVDLWDSIQNNDYFS
        D+ VRFRDPITKHDTI GYLFNISLLRELFRPEF LHWVK+TG YEITTRWTMVMKFVLLPWKP+L+FTGNSIMGINPETGKFCSHVDLWDSIQNNDYFS
Subjt:  DEHVRFRDPITKHDTIGGYLFNISLLRELFRPEFFLHWVKQTGPYEITTRWTMVMKFVLLPWKPELIFTGNSIMGINPETGKFCSHVDLWDSIQNNDYFS

Query:  LEGLLDVFKQLRFYKTPELESPKYEILKRTANYEVRKYKPFVVVETSGDKLSGSAGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSESPKVSIQIVLPS
        +EGLLDVFKQLRFYKTPELESPKYEILKRTANYEVRKY PF+VVETSGDKL+GSAGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSESPKVSIQIVLPS
Subjt:  LEGLLDVFKQLRFYKTPELESPKYEILKRTANYEVRKYKPFVVVETSGDKLSGSAGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSESPKVSIQIVLPS

Query:  DKDINSLPDPERDTIGLRKVEGGIAAVLKFSGKPTEDMVQEKAKELRSCLIKDGLKPSKGCLLARYNDPGRTWSFIM
        +KD+ SLPDPE+DTIGLRKVEGG AAVLKFSGKPTE++VQEKAKELRS LIKDGLKP  GCLLARYNDPGRTW+FIM
Subjt:  DKDINSLPDPERDTIGLRKVEGGIAAVLKFSGKPTEDMVQEKAKELRSCLIKDGLKPSKGCLLARYNDPGRTWSFIM

XP_038879422.1 uncharacterized protein LOC120071301 isoform X1 [Benincasa hispida]7.3e-18787.2Show/hide
Query:  MAALQLSLQNFLSTPTAGFGFRPWKSGGLTVTGLPPRLLKSRTVDFKPDARNSKWAVRLSLVDQSPPKSAVDVDRLVDFLYEDLRHLFDEQGIDRTAYDE
        MA  QLSLQNF STPT GFG RP +SG L  T LPPRL K+RT  FKP ++NSKW VRLSLVDQSPPKS VDV RLVDFLYEDLRHLFDEQGIDRTAYDE
Subjt:  MAALQLSLQNFLSTPTAGFGFRPWKSGGLTVTGLPPRLLKSRTVDFKPDARNSKWAVRLSLVDQSPPKSAVDVDRLVDFLYEDLRHLFDEQGIDRTAYDE

Query:  HVRFRDPITKHDTIGGYLFNISLLRELFRPEFFLHWVKQTGPYEITTRWTMVMKFVLLPWKPELIFTGNSIMGINPETGKFCSHVDLWDSIQNNDYFSLE
         VRFRDPIT HDTI GYLFNISLLRELFRPEFFLHWVKQTGPYEITTRWTMVMKFVLLPWKPEL+FTG SIMGINPETGKFCSHVDLWDSIQNNDYFS+E
Subjt:  HVRFRDPITKHDTIGGYLFNISLLRELFRPEFFLHWVKQTGPYEITTRWTMVMKFVLLPWKPELIFTGNSIMGINPETGKFCSHVDLWDSIQNNDYFSLE

Query:  GLLDVFKQLRFYKTPELESPKYEILKRTANYEVRKYKPFVVVETSGDKLSGSAGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSESPKVSIQIVLPSDK
        GL DVFKQLR+YKTP LESPKY ILKRTANYEVRKY  F+VVETSGDKL+GSAGFNTVAGYIFGKNS KEKIPMTTPVFTQTFDSE PKV IQIVLPS+K
Subjt:  GLLDVFKQLRFYKTPELESPKYEILKRTANYEVRKYKPFVVVETSGDKLSGSAGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSESPKVSIQIVLPSDK

Query:  DINSLPDPERDTIGLRKVEGGIAAVLKFSGKPTEDMVQEKAKELRSCLIKDGLKPSKGCLLARYNDPGRTWSFIM
        DI+SLPDPE+D IGLRKVEG IAAVLKFSGKPTE++VQEKAKELRS LIKDGLKPS GCLLARYNDPGRTW+FIM
Subjt:  DINSLPDPERDTIGLRKVEGGIAAVLKFSGKPTEDMVQEKAKELRSCLIKDGLKPSKGCLLARYNDPGRTWSFIM

TrEMBL top hitse value%identityAlignment
A0A0A0LWP3 Uncharacterized protein2.8e-18485.6Show/hide
Query:  MAALQLSLQNFLSTPTAGFGFRPWKSGGLTVTGLPPRLLKSRTVDFKPDARNSKWAVRLSLVDQSPPKSAVDVDRLVDFLYEDLRHLFDEQGIDRTAYDE
        MA LQLSLQNF STPT     RP KSG   +T LPPRLL SRT  FKP  +NSKW VR +LVDQ PPKS +DV RLVDFL+EDL HLFDEQGIDRTAYDE
Subjt:  MAALQLSLQNFLSTPTAGFGFRPWKSGGLTVTGLPPRLLKSRTVDFKPDARNSKWAVRLSLVDQSPPKSAVDVDRLVDFLYEDLRHLFDEQGIDRTAYDE

Query:  HVRFRDPITKHDTIGGYLFNISLLRELFRPEFFLHWVKQTGPYEITTRWTMVMKFVLLPWKPELIFTGNSIMGINPETGKFCSHVDLWDSIQNNDYFSLE
         VRFRDPITKHDTI GYLFNISLLRELFRPEFFLHWVKQTGPYEITTRWTMVMKF LLPWKPEL+FTGNSIMGINPETGKFCSHVDLWDSIQNNDYFS+E
Subjt:  HVRFRDPITKHDTIGGYLFNISLLRELFRPEFFLHWVKQTGPYEITTRWTMVMKFVLLPWKPELIFTGNSIMGINPETGKFCSHVDLWDSIQNNDYFSLE

Query:  GLLDVFKQLRFYKTPELESPKYEILKRTANYEVRKYKPFVVVETSGDKLSGSAGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSESPKVSIQIVLPSDK
        GL DVFKQLRFYKTPELESPKY ILKRTA YEVRKY PF+VVETSGDKL+GSAGFNTVAGYIFGKNS KEKIPMTTPVFTQ F+SESPKVSIQIVLPS+K
Subjt:  GLLDVFKQLRFYKTPELESPKYEILKRTANYEVRKYKPFVVVETSGDKLSGSAGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSESPKVSIQIVLPSDK

Query:  DINSLPDPERDTIGLRKVEGGIAAVLKFSGKPTEDMVQEKAKELRSCLIKDGLKPSKGCLLARYNDPGRTWSFIM
        DI+SLPDPE+D +GLRKVEGGIAAVLKFSGKP E++VQEKAKELRS LIKDGLKP  GCLLARYNDPGRTW+FIM
Subjt:  DINSLPDPERDTIGLRKVEGGIAAVLKFSGKPTEDMVQEKAKELRSCLIKDGLKPSKGCLLARYNDPGRTWSFIM

A0A1S3CJ12 uncharacterized protein LOC103501513 isoform X11.7e-18686.93Show/hide
Query:  MAALQLSLQNFLSTPTAGFGFRPWKSGGLTVTGLPPRLLKSRTVDFKPDARNSKWAVRLSLVDQSPPKSAVDVDRLVDFLYEDLRHLFDEQGIDRTAYDE
        MAALQLSLQNFLSTPT     RP KSG L  T L PRLL+SRT   KP+ +NSKW VR +LVDQSPPKS VDV RLVDFLYEDL HLFDEQGIDRTAYDE
Subjt:  MAALQLSLQNFLSTPTAGFGFRPWKSGGLTVTGLPPRLLKSRTVDFKPDARNSKWAVRLSLVDQSPPKSAVDVDRLVDFLYEDLRHLFDEQGIDRTAYDE

Query:  HVRFRDPITKHDTIGGYLFNISLLRELFRPEFFLHWVKQTGPYEITTRWTMVMKFVLLPWKPELIFTGNSIMGINPETGKFCSHVDLWDSIQNNDYFSLE
         VRFRDPITKHDTI GYLFNISLLRE+FRPEFFLHWVKQTGPYEITTRWTM+MKF LLPWKPELIFTG SIMGINPETGKFCSHVDLWDSIQNNDYFS+E
Subjt:  HVRFRDPITKHDTIGGYLFNISLLRELFRPEFFLHWVKQTGPYEITTRWTMVMKFVLLPWKPELIFTGNSIMGINPETGKFCSHVDLWDSIQNNDYFSLE

Query:  GLLDVFKQLRFYKTPELESPKYEILKRTANYEVRKYKPFVVVETSGDKLSGSAGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSESPKVSIQIVLPSDK
        GL DVFKQLRFYKTPELESPKY ILKRT  YEVRKY PF+VVETSGDKL+GSAGFNTVAGYIFGKNS KEKIPMTTPVFTQTFDSESPKVSIQIVLPS+K
Subjt:  GLLDVFKQLRFYKTPELESPKYEILKRTANYEVRKYKPFVVVETSGDKLSGSAGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSESPKVSIQIVLPSDK

Query:  DINSLPDPERDTIGLRKVEGGIAAVLKFSGKPTEDMVQEKAKELRSCLIKDGLKPSKGCLLARYNDPGRTWSFIM
        DI+SLPDPE+D IGLRKVEGGIAAVLKFSGKPTE++VQEKAKELRS LIKDGLKP  GCLLARYNDPGRTW+FIM
Subjt:  DINSLPDPERDTIGLRKVEGGIAAVLKFSGKPTEDMVQEKAKELRSCLIKDGLKPSKGCLLARYNDPGRTWSFIM

A0A6J1CUY2 uncharacterized protein LOC111014503 isoform X11.1e-21598.4Show/hide
Query:  MAALQLSLQNFLSTPTAGFGFRPWKSGGLTVTGLPPRLLKSRTVDFKPDARNSKWAVRLSLVDQSPPKSAVDVDRLVDFLYEDLRHLFDEQGIDRTAYDE
        MAALQLSLQNFLSTPTAGFGFRPWKSGGLTV GLPPRLLKSRTVDFKPDARNSKWAVRLSLVDQSPPKSAVDVDRLVDFLYEDLRHLFDEQGIDRTAYDE
Subjt:  MAALQLSLQNFLSTPTAGFGFRPWKSGGLTVTGLPPRLLKSRTVDFKPDARNSKWAVRLSLVDQSPPKSAVDVDRLVDFLYEDLRHLFDEQGIDRTAYDE

Query:  HVRFRDPITKHDTIGGYLFNISLLRELFRPEFFLHWVKQTGPYEITTRWTMVMKFVLLPWKPELIFTGNSIMGINPETGKFCSHVDLWDSIQNNDYFSLE
        HVRFRDPITKHDTI GY FNISLLRELFRPEFFLHWVKQTGPYEITTRWTMVMKFVLLPWKPE IFTGNSIMGINPETGKFCSHVDLWDSIQNNDYFSLE
Subjt:  HVRFRDPITKHDTIGGYLFNISLLRELFRPEFFLHWVKQTGPYEITTRWTMVMKFVLLPWKPELIFTGNSIMGINPETGKFCSHVDLWDSIQNNDYFSLE

Query:  GLLDVFKQLRFYKTPELESPKYEILKRTANYEVRKYKPFVVVETSGDKLSGSAGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSESPKVSIQIVLPSDK
        GLLDVFKQLRFYKTPELESPKYEILKRTANYEVRKY PFVVVETSGDKLSGSAGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSESPKVSIQIVLPSDK
Subjt:  GLLDVFKQLRFYKTPELESPKYEILKRTANYEVRKYKPFVVVETSGDKLSGSAGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSESPKVSIQIVLPSDK

Query:  DINSLPDPERDTIGLRKVEGGIAAVLKFSGKPTEDMVQEKAKELRSCLIKDGLKPSKGCLLARYNDPGRTWSFIM
        DINSLPDPE+DTIGLRKVEGGIAAVLKFSGKPTEDMVQEKAKELRSCLIKDGLKPSKGCLLARYNDPGRTWSFIM
Subjt:  DINSLPDPERDTIGLRKVEGGIAAVLKFSGKPTEDMVQEKAKELRSCLIKDGLKPSKGCLLARYNDPGRTWSFIM

A0A6J1ER73 uncharacterized protein LOC111437064 isoform X19.6e-18585.68Show/hide
Query:  MAALQLSLQNFL--STPTAGFGFRPWKSGGLTVTGLPPRLLKSRTVDFKPDARNSKWAVRLSLVDQSPPKSAVDVDRLVDFLYEDLRHLFDEQGIDRTAY
        MAALQ SLQN L  STP+ GFGFRP  SG L        + +SRTV  KP  RNSKW VRLSLVDQ+PPKS VDVD+LVDFLYEDL HLFDEQGIDRTAY
Subjt:  MAALQLSLQNFL--STPTAGFGFRPWKSGGLTVTGLPPRLLKSRTVDFKPDARNSKWAVRLSLVDQSPPKSAVDVDRLVDFLYEDLRHLFDEQGIDRTAY

Query:  DEHVRFRDPITKHDTIGGYLFNISLLRELFRPEFFLHWVKQTGPYEITTRWTMVMKFVLLPWKPELIFTGNSIMGINPETGKFCSHVDLWDSIQNNDYFS
        D+ VRFRDPITKHDTI GYLFNISLLRELFRPEF LHWVK+TG YEITTRWTMVMKFVLLPWKP+L+FTGNSIMGINPETGKFCSHVDLWDSIQNNDYFS
Subjt:  DEHVRFRDPITKHDTIGGYLFNISLLRELFRPEFFLHWVKQTGPYEITTRWTMVMKFVLLPWKPELIFTGNSIMGINPETGKFCSHVDLWDSIQNNDYFS

Query:  LEGLLDVFKQLRFYKTPELESPKYEILKRTANYEVRKYKPFVVVETSGDKLSGSAGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSESPKVSIQIVLPS
        +EGLLDVFKQLRFYKTPELESPKYEILKRT NYEVRKY PF+VVETSGDKL+GSAGFN VAGYIFGKNSAKEKIPMTTPVFTQTFDSESPKVSIQIVLPS
Subjt:  LEGLLDVFKQLRFYKTPELESPKYEILKRTANYEVRKYKPFVVVETSGDKLSGSAGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSESPKVSIQIVLPS

Query:  DKDINSLPDPERDTIGLRKVEGGIAAVLKFSGKPTEDMVQEKAKELRSCLIKDGLKPSKGCLLARYNDPGRTWSFIM
        +KD+ SLPDPE+DTIGLRKVEGG AAVLKFSGKPTE++VQEKAKELRS LIKDGLKP  GCLLARYNDPGRTW+FIM
Subjt:  DKDINSLPDPERDTIGLRKVEGGIAAVLKFSGKPTEDMVQEKAKELRSCLIKDGLKPSKGCLLARYNDPGRTWSFIM

A0A6J1KHA6 uncharacterized protein LOC111495248 isoform X12.4e-18384.88Show/hide
Query:  MAALQLSLQN--FLSTPTAGFGFRPWKSGGLTVTGLPPRLLKSRTVDFKPDARNSKWAVRLSLVDQSPPKSAVDVDRLVDFLYEDLRHLFDEQGIDRTAY
        MAAL+ SLQN   LSTP+ GFGFRP  SG L        + +SRTV  KP  RNSKW VRLSLVDQ+PPKS VDVD+LVDFLY+DL HLFDEQGIDRTAY
Subjt:  MAALQLSLQN--FLSTPTAGFGFRPWKSGGLTVTGLPPRLLKSRTVDFKPDARNSKWAVRLSLVDQSPPKSAVDVDRLVDFLYEDLRHLFDEQGIDRTAY

Query:  DEHVRFRDPITKHDTIGGYLFNISLLRELFRPEFFLHWVKQTGPYEITTRWTMVMKFVLLPWKPELIFTGNSIMGINPETGKFCSHVDLWDSIQNNDYFS
        D+ VRFRDPITKHDTI GYLFNISLLRELF+PEF LHWVK+TG YEITTRWTMVMKFVLLPWKP+L+FTGNSIMGINPETGKFCSHVDLWDSIQNNDYFS
Subjt:  DEHVRFRDPITKHDTIGGYLFNISLLRELFRPEFFLHWVKQTGPYEITTRWTMVMKFVLLPWKPELIFTGNSIMGINPETGKFCSHVDLWDSIQNNDYFS

Query:  LEGLLDVFKQLRFYKTPELESPKYEILKRTANYEVRKYKPFVVVETSGDKLSGSAGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSESPKVSIQIVLPS
        +EGLLDVFKQLRFYKTPELESPKYEILKRTANYEVRKY PF+VVETSGDKL+GSAGFNTVAGYIFGKNSAKEKI MTTPVFTQTFDSESPKVSIQIVLPS
Subjt:  LEGLLDVFKQLRFYKTPELESPKYEILKRTANYEVRKYKPFVVVETSGDKLSGSAGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSESPKVSIQIVLPS

Query:  DKDINSLPDPERDTIGLRKVEGGIAAVLKFSGKPTEDMVQEKAKELRSCLIKDGLKPSKGCLLARYNDPGRTWSFIM
        +KD+ SLPDPE+DTIGLRKVEGG AAVLKFSGKPTE++VQEKAK+LRS LIKDGLKP  GCLLARYNDPGRTW+FIM
Subjt:  DKDINSLPDPERDTIGLRKVEGGIAAVLKFSGKPTEDMVQEKAKELRSCLIKDGLKPSKGCLLARYNDPGRTWSFIM

SwissProt top hitse value%identityAlignment
Q9SR77 Heme-binding-like protein At3g10130, chloroplastic1.5e-1731.87Show/hide
Query:  FYKTPELESPKYEILKRTANYEVRKYKPFVVV------ETSGDKLSGSAGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDS--ESPKVSIQIVLPSDKDI
        F   P+LE+  + +L RT  YE+R+ +P+ V       ET  D    S  FN +A Y+FGKN+ KEK+ MTTPV T+   S  E  +++  ++    KD 
Subjt:  FYKTPELESPKYEILKRTANYEVRKYKPFVVV------ETSGDKLSGSAGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDS--ESPKVSIQIVLPSDKDI

Query:  N--------------SLPDPERDTIGLRKVEGGIAAVLKFSGKPTEDMVQEKAKELRSCLIKD---GLKPSKGCLLARYNDP
        N              +LP P+  ++ +++V   I AV+ FSG  T++ ++ + +ELR  L  D    ++      +A+YN P
Subjt:  N--------------SLPDPERDTIGLRKVEGGIAAVLKFSGKPTEDMVQEKAKELRSCLIKD---GLKPSKGCLLARYNDP

Arabidopsis top hitse value%identityAlignment
AT1G17100.1 SOUL heme-binding family protein6.5e-0831.65Show/hide
Query:  LESPKYEILKRTANYEVRKYKPFVVVET------SGDKLSGSAGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSESPKVSIQIVLPSDKDINSLPDP-E
        +E P YE++     YE+R+Y   V V T      S    + +A F   A YI GKN   +KI MT PV +Q   S+ P       +       + PDP  
Subjt:  LESPKYEILKRTANYEVRKYKPFVVVET------SGDKLSGSAGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSESPKVSIQIVLPSDKDINSLPDP-E

Query:  RDTIGLRKVEGGIAAVLKFSGKPTEDMVQEKAKELRSCL
         + + ++K      AV +FSG  ++D + E+A  L S L
Subjt:  RDTIGLRKVEGGIAAVLKFSGKPTEDMVQEKAKELRSCL

AT2G37970.1 SOUL heme-binding family protein6.5e-1634.74Show/hide
Query:  LESPKYEILKRTANYEVRKYKPFVVVETSGD----KLSGSAGFNTVAGYI--FGK--NSAKEKIPMTTPVFTQ----------TFDSESPK---------
        +E+PKY + K    YE+R+Y P V  E + D    K     GF  +A YI  FGK  N   EKI MT PV T+              ES K         
Subjt:  LESPKYEILKRTANYEVRKYKPFVVVETSGD----KLSGSAGFNTVAGYI--FGK--NSAKEKIPMTTPVFTQ----------TFDSESPK---------

Query:  -----------VSIQIVLPS-DKDINSLPDPERDTIGLRKVEGGIAAVLKFSGKPTEDMVQEKAKELRSCLIKDGLKPSKGCLLARYNDP
                   V++Q +LPS  K     P P  + + +++  G    V+KFSG  +E +V EK K+L S L KDG K +   +LARYN P
Subjt:  -----------VSIQIVLPS-DKDINSLPDPERDTIGLRKVEGGIAAVLKFSGKPTEDMVQEKAKELRSCLIKDGLKPSKGCLLARYNDP

AT3G10130.1 SOUL heme-binding family protein1.1e-1831.87Show/hide
Query:  FYKTPELESPKYEILKRTANYEVRKYKPFVVV------ETSGDKLSGSAGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDS--ESPKVSIQIVLPSDKDI
        F   P+LE+  + +L RT  YE+R+ +P+ V       ET  D    S  FN +A Y+FGKN+ KEK+ MTTPV T+   S  E  +++  ++    KD 
Subjt:  FYKTPELESPKYEILKRTANYEVRKYKPFVVV------ETSGDKLSGSAGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDS--ESPKVSIQIVLPSDKDI

Query:  N--------------SLPDPERDTIGLRKVEGGIAAVLKFSGKPTEDMVQEKAKELRSCLIKD---GLKPSKGCLLARYNDP
        N              +LP P+  ++ +++V   I AV+ FSG  T++ ++ + +ELR  L  D    ++      +A+YN P
Subjt:  N--------------SLPDPERDTIGLRKVEGGIAAVLKFSGKPTEDMVQEKAKELRSCLIKD---GLKPSKGCLLARYNDP

AT5G20140.1 SOUL heme-binding family protein1.9e-14075Show/hide
Query:  SAVDVDRLVDFLYEDLRHLFDEQGIDRTAYDEHVRFRDPITKHDTIGGYLFNISLLRELFRPEFFLHWVKQTGPYEITTRWTMVMKFVLLPWKPELIFTG
        S V+++ LV FLYEDL HLFD+QGID+TAYDE V+FRDPITKHDTI GYLFNI+ L+ +F P+F LHW KQTGPYEITTRWTMVMKF+ LPWKPEL+FTG
Subjt:  SAVDVDRLVDFLYEDLRHLFDEQGIDRTAYDEHVRFRDPITKHDTIGGYLFNISLLRELFRPEFFLHWVKQTGPYEITTRWTMVMKFVLLPWKPELIFTG

Query:  NSIMGINPETGKFCSHVDLWDSIQNNDYFSLEGLLDVFKQLRFYKTPELESPKYEILKRTANYEVRKYKPFVVVETSGDKLSGSAGFNTVAGYIFGKNSA
         SIM +NPET KFCSH+DLWDSI+NNDYFSLEGL+DVFKQLR YKTP+LE+PKY+ILKRTANYEVR Y+PF+VVET GDKLSGS+GFN VAGYIFGKNS 
Subjt:  NSIMGINPETGKFCSHVDLWDSIQNNDYFSLEGLLDVFKQLRFYKTPELESPKYEILKRTANYEVRKYKPFVVVETSGDKLSGSAGFNTVAGYIFGKNSA

Query:  KEKIPMTTPVFTQTFDSE-SPKVSIQIVLPSDKDINSLPDPERDTIGLRKVEGGIAAVLKFSGKPTEDMVQEKAKELRSCLIKDGLKPSKGCLLARYNDP
         EKIPMTTPVFTQT D++ S  VS+QIV+PS KD++SLP P  + + L+K+EGG AA +KFSGKPTED+VQ K  ELRS L KDGL+  KGC+LARYNDP
Subjt:  KEKIPMTTPVFTQTFDSE-SPKVSIQIVLPSDKDINSLPDPERDTIGLRKVEGGIAAVLKFSGKPTEDMVQEKAKELRSCLIKDGLKPSKGCLLARYNDP

Query:  GRTWSFIM
        GRTW+FIM
Subjt:  GRTWSFIM

AT5G20140.2 SOUL heme-binding family protein1.9e-14075Show/hide
Query:  SAVDVDRLVDFLYEDLRHLFDEQGIDRTAYDEHVRFRDPITKHDTIGGYLFNISLLRELFRPEFFLHWVKQTGPYEITTRWTMVMKFVLLPWKPELIFTG
        S V+++ LV FLYEDL HLFD+QGID+TAYDE V+FRDPITKHDTI GYLFNI+ L+ +F P+F LHW KQTGPYEITTRWTMVMKF+ LPWKPEL+FTG
Subjt:  SAVDVDRLVDFLYEDLRHLFDEQGIDRTAYDEHVRFRDPITKHDTIGGYLFNISLLRELFRPEFFLHWVKQTGPYEITTRWTMVMKFVLLPWKPELIFTG

Query:  NSIMGINPETGKFCSHVDLWDSIQNNDYFSLEGLLDVFKQLRFYKTPELESPKYEILKRTANYEVRKYKPFVVVETSGDKLSGSAGFNTVAGYIFGKNSA
         SIM +NPET KFCSH+DLWDSI+NNDYFSLEGL+DVFKQLR YKTP+LE+PKY+ILKRTANYEVR Y+PF+VVET GDKLSGS+GFN VAGYIFGKNS 
Subjt:  NSIMGINPETGKFCSHVDLWDSIQNNDYFSLEGLLDVFKQLRFYKTPELESPKYEILKRTANYEVRKYKPFVVVETSGDKLSGSAGFNTVAGYIFGKNSA

Query:  KEKIPMTTPVFTQTFDSE-SPKVSIQIVLPSDKDINSLPDPERDTIGLRKVEGGIAAVLKFSGKPTEDMVQEKAKELRSCLIKDGLKPSKGCLLARYNDP
         EKIPMTTPVFTQT D++ S  VS+QIV+PS KD++SLP P  + + L+K+EGG AA +KFSGKPTED+VQ K  ELRS L KDGL+  KGC+LARYNDP
Subjt:  KEKIPMTTPVFTQTFDSE-SPKVSIQIVLPSDKDINSLPDPERDTIGLRKVEGGIAAVLKFSGKPTEDMVQEKAKELRSCLIKDGLKPSKGCLLARYNDP

Query:  GRTWSFIM
        GRTW+FIM
Subjt:  GRTWSFIM


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCGCTCTTCAACTTTCCCTGCAAAATTTCCTCTCAACCCCAACAGCCGGTTTCGGTTTCCGGCCATGGAAATCCGGCGGACTAACAGTAACCGGCCTCCCACCACG
TCTACTCAAAAGCAGGACTGTAGATTTTAAACCCGACGCCCGAAATTCTAAGTGGGCTGTTCGATTAAGCTTGGTGGATCAAAGCCCCCCCAAATCGGCGGTTGATGTAG
ACCGATTGGTGGATTTCCTGTACGAAGATCTTCGTCATCTCTTTGATGAACAGGGGATTGATCGGACGGCGTACGACGAGCATGTGAGATTTCGGGACCCCATTACCAAG
CACGATACCATTGGCGGGTATTTGTTTAATATTTCCCTATTGCGAGAGCTCTTCAGGCCTGAGTTCTTCTTGCACTGGGTTAAACAGACAGGACCTTATGAGATTACCAC
AAGATGGACTATGGTAATGAAGTTTGTCCTTCTGCCATGGAAACCAGAATTGATTTTTACGGGAAATTCCATCATGGGTATTAATCCAGAGACGGGCAAGTTCTGTAGCC
ATGTGGATCTCTGGGATTCAATACAAAATAATGACTACTTTTCTCTAGAAGGCCTGTTGGATGTATTTAAACAGCTCCGGTTTTATAAGACACCAGAATTGGAATCACCC
AAATATGAGATATTGAAAAGGACTGCAAACTATGAGGTAAGGAAATATAAACCATTTGTAGTGGTAGAAACAAGTGGAGACAAGCTCTCTGGGTCTGCTGGATTCAATAC
GGTTGCTGGGTACATATTTGGGAAGAACTCTGCAAAGGAGAAGATACCCATGACCACCCCTGTGTTCACCCAGACATTTGACTCTGAATCACCCAAAGTATCCATCCAAA
TAGTTCTTCCTTCAGACAAAGATATTAACAGTTTGCCAGATCCTGAACGAGACACAATAGGCTTGAGAAAGGTGGAAGGAGGTATTGCTGCAGTGCTGAAGTTCAGTGGA
AAACCTACTGAAGATATGGTGCAAGAGAAGGCAAAAGAATTGCGGTCTTGTCTTATAAAGGATGGCCTTAAACCCAGTAAGGGCTGTTTGCTTGCTCGGTACAATGACCC
TGGCCGGACGTGGAGCTTTATAATGGTA
mRNA sequenceShow/hide mRNA sequence
ATGGCCGCTCTTCAACTTTCCCTGCAAAATTTCCTCTCAACCCCAACAGCCGGTTTCGGTTTCCGGCCATGGAAATCCGGCGGACTAACAGTAACCGGCCTCCCACCACG
TCTACTCAAAAGCAGGACTGTAGATTTTAAACCCGACGCCCGAAATTCTAAGTGGGCTGTTCGATTAAGCTTGGTGGATCAAAGCCCCCCCAAATCGGCGGTTGATGTAG
ACCGATTGGTGGATTTCCTGTACGAAGATCTTCGTCATCTCTTTGATGAACAGGGGATTGATCGGACGGCGTACGACGAGCATGTGAGATTTCGGGACCCCATTACCAAG
CACGATACCATTGGCGGGTATTTGTTTAATATTTCCCTATTGCGAGAGCTCTTCAGGCCTGAGTTCTTCTTGCACTGGGTTAAACAGACAGGACCTTATGAGATTACCAC
AAGATGGACTATGGTAATGAAGTTTGTCCTTCTGCCATGGAAACCAGAATTGATTTTTACGGGAAATTCCATCATGGGTATTAATCCAGAGACGGGCAAGTTCTGTAGCC
ATGTGGATCTCTGGGATTCAATACAAAATAATGACTACTTTTCTCTAGAAGGCCTGTTGGATGTATTTAAACAGCTCCGGTTTTATAAGACACCAGAATTGGAATCACCC
AAATATGAGATATTGAAAAGGACTGCAAACTATGAGGTAAGGAAATATAAACCATTTGTAGTGGTAGAAACAAGTGGAGACAAGCTCTCTGGGTCTGCTGGATTCAATAC
GGTTGCTGGGTACATATTTGGGAAGAACTCTGCAAAGGAGAAGATACCCATGACCACCCCTGTGTTCACCCAGACATTTGACTCTGAATCACCCAAAGTATCCATCCAAA
TAGTTCTTCCTTCAGACAAAGATATTAACAGTTTGCCAGATCCTGAACGAGACACAATAGGCTTGAGAAAGGTGGAAGGAGGTATTGCTGCAGTGCTGAAGTTCAGTGGA
AAACCTACTGAAGATATGGTGCAAGAGAAGGCAAAAGAATTGCGGTCTTGTCTTATAAAGGATGGCCTTAAACCCAGTAAGGGCTGTTTGCTTGCTCGGTACAATGACCC
TGGCCGGACGTGGAGCTTTATAATGGTA
Protein sequenceShow/hide protein sequence
MAALQLSLQNFLSTPTAGFGFRPWKSGGLTVTGLPPRLLKSRTVDFKPDARNSKWAVRLSLVDQSPPKSAVDVDRLVDFLYEDLRHLFDEQGIDRTAYDEHVRFRDPITK
HDTIGGYLFNISLLRELFRPEFFLHWVKQTGPYEITTRWTMVMKFVLLPWKPELIFTGNSIMGINPETGKFCSHVDLWDSIQNNDYFSLEGLLDVFKQLRFYKTPELESP
KYEILKRTANYEVRKYKPFVVVETSGDKLSGSAGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSESPKVSIQIVLPSDKDINSLPDPERDTIGLRKVEGGIAAVLKFSG
KPTEDMVQEKAKELRSCLIKDGLKPSKGCLLARYNDPGRTWSFIMV