; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10007991 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10007991
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionSOUL heme-binding protein
Genome locationChr10:18362469..18368309
RNA-Seq ExpressionHG10007991
SyntenyHG10007991
Gene Ontology termsGO:0110165 - cellular anatomical structure (cellular component)
InterPro domainsIPR006917 - SOUL haem-binding protein
IPR011256 - Regulatory factor, effector binding domain superfamily
IPR018790 - Protein of unknown function DUF2358
IPR032710 - NTF2-like domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KGN65404.1 hypothetical protein Csa_019846 [Cucumis sativus]2.4e-18291.45Show/hide
Query:  MAALQLSLQNFPSTPTLGFGLRPPKSGRLTDLAPRLLKSRTLAFKPQTQNSKWVVRLSLVDQSPPKSTVDVGRLVDFLYEDLCHLFDEQGIDRTAYDEQV
        MA LQLSLQNFPSTPTL   LRPPKSGR+T L PRLL SRT AFKP T+NSKWVVR +LVDQ PPKST+DVGRLVDFL+EDL HLFDEQGIDRTAYDEQV
Subjt:  MAALQLSLQNFPSTPTLGFGLRPPKSGRLTDLAPRLLKSRTLAFKPQTQNSKWVVRLSLVDQSPPKSTVDVGRLVDFLYEDLCHLFDEQGIDRTAYDEQV

Query:  RFRDPITKYDTISGYLFNISLLRELFRPEFFLHWVKQTGPYEITTRWTMVMKFAVLPWKPELVFTGTSIMGINPETGKFCSHVDLWDSIQNNDYFSTEGL
        RFRDPITK+DTISGYLFNISLLRELFRPEFFLHWVKQTGPYEITTRWTMVMKFA+LPWKPELVFTG SIMGINPETGKFCSHVDLWDSIQNNDYFS EGL
Subjt:  RFRDPITKYDTISGYLFNISLLRELFRPEFFLHWVKQTGPYEITTRWTMVMKFAVLPWKPELVFTGTSIMGINPETGKFCSHVDLWDSIQNNDYFSTEGL

Query:  WDVFKQLRFYKTPALESPKYLILKRTANYEVRKYAPFIVVETSGDKLAGSTGFNTVAGYIFGKNSTKEKIPMTTPVFTQTFDSELPKVSIQIVLPSAKDI
        WDVFKQLRFYKTP LESPKYLILKRTA YEVRKYAPFIVVETSGDKLAGS GFNTVAGYIFGKNSTKEKIPMTTPVFTQ F+SE PKVSIQIVLPS KDI
Subjt:  WDVFKQLRFYKTPALESPKYLILKRTANYEVRKYAPFIVVETSGDKLAGSTGFNTVAGYIFGKNSTKEKIPMTTPVFTQTFDSELPKVSIQIVLPSAKDI

Query:  DSLPDPEQDIIGLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSLIKDGL
        DSLPDPEQDI+GLRKVEGGIAAVLKFSGKP EEIVQEKAKELRSSLIKDGL
Subjt:  DSLPDPEQDIIGLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSLIKDGL

TYK11984.1 SOUL heme-binding family protein isoform 1 [Cucumis melo var. makuwa]4.1e-18289.32Show/hide
Query:  MAALQLSLQNFPSTPTLGFGLRPPKSGRLTDLAPRLLKSRTLAFKPQTQNSKWVVRLSLVDQSPPKSTVDVGRLVDFLYEDLCHLFDEQGIDRTAYDEQV
        MAALQLSLQNF STPTL   LRPPKSGRLT L PRLL+SRT AFKP TQNSKWVVR +LVDQSPPKSTVDVGRLVDFLYEDL HLFDEQGIDRTAYDEQV
Subjt:  MAALQLSLQNFPSTPTLGFGLRPPKSGRLTDLAPRLLKSRTLAFKPQTQNSKWVVRLSLVDQSPPKSTVDVGRLVDFLYEDLCHLFDEQGIDRTAYDEQV

Query:  RFRDPITKYDTISGYLFNISLLRELFRPEFFLHWVKQT--------------GPYEITTRWTMVMKFAVLPWKPELVFTGTSIMGINPETGKFCSHVDLW
        RFRDPITK+DTISGYLFNISLLRE+FRPEFFLHWVKQ                PYEITTRWTM+MKFA+LPWKPEL+FTGTSIMGINPETGKFCSHVDLW
Subjt:  RFRDPITKYDTISGYLFNISLLRELFRPEFFLHWVKQT--------------GPYEITTRWTMVMKFAVLPWKPELVFTGTSIMGINPETGKFCSHVDLW

Query:  DSIQNNDYFSTEGLWDVFKQLRFYKTPALESPKYLILKRTANYEVRKYAPFIVVETSGDKLAGSTGFNTVAGYIFGKNSTKEKIPMTTPVFTQTFDSELP
        DSIQNNDYFS EGLWDVFKQLRFYKTP LESPKYLILKRTA YEVRKYAPFIVVETSGDKLAGS GFNTVAGYIFGKNSTKEKIPMTTPVFTQTFDSE P
Subjt:  DSIQNNDYFSTEGLWDVFKQLRFYKTPALESPKYLILKRTANYEVRKYAPFIVVETSGDKLAGSTGFNTVAGYIFGKNSTKEKIPMTTPVFTQTFDSELP

Query:  KVSIQIVLPSAKDIDSLPDPEQDIIGLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSLIKDGL
        KVSIQIVLPS KDIDSLPDPEQDIIGLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSLIKDGL
Subjt:  KVSIQIVLPSAKDIDSLPDPEQDIIGLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSLIKDGL

XP_008463332.1 PREDICTED: uncharacterized protein LOC103501513 isoform X1 [Cucumis melo]2.3e-18592.88Show/hide
Query:  MAALQLSLQNFPSTPTLGFGLRPPKSGRLTDLAPRLLKSRTLAFKPQTQNSKWVVRLSLVDQSPPKSTVDVGRLVDFLYEDLCHLFDEQGIDRTAYDEQV
        MAALQLSLQNF STPTL   LRPPKSGRLT+L PRLL+SRT A KP TQNSKWVVR +LVDQSPPKSTVDVGRLVDFLYEDL HLFDEQGIDRTAYDEQV
Subjt:  MAALQLSLQNFPSTPTLGFGLRPPKSGRLTDLAPRLLKSRTLAFKPQTQNSKWVVRLSLVDQSPPKSTVDVGRLVDFLYEDLCHLFDEQGIDRTAYDEQV

Query:  RFRDPITKYDTISGYLFNISLLRELFRPEFFLHWVKQTGPYEITTRWTMVMKFAVLPWKPELVFTGTSIMGINPETGKFCSHVDLWDSIQNNDYFSTEGL
        RFRDPITK+DTISGYLFNISLLRE+FRPEFFLHWVKQTGPYEITTRWTM+MKFA+LPWKPEL+FTGTSIMGINPETGKFCSHVDLWDSIQNNDYFS EGL
Subjt:  RFRDPITKYDTISGYLFNISLLRELFRPEFFLHWVKQTGPYEITTRWTMVMKFAVLPWKPELVFTGTSIMGINPETGKFCSHVDLWDSIQNNDYFSTEGL

Query:  WDVFKQLRFYKTPALESPKYLILKRTANYEVRKYAPFIVVETSGDKLAGSTGFNTVAGYIFGKNSTKEKIPMTTPVFTQTFDSELPKVSIQIVLPSAKDI
        WDVFKQLRFYKTP LESPKYLILKRT  YEVRKYAPFIVVETSGDKLAGS GFNTVAGYIFGKNSTKEKIPMTTPVFTQTFDSE PKVSIQIVLPS KDI
Subjt:  WDVFKQLRFYKTPALESPKYLILKRTANYEVRKYAPFIVVETSGDKLAGSTGFNTVAGYIFGKNSTKEKIPMTTPVFTQTFDSELPKVSIQIVLPSAKDI

Query:  DSLPDPEQDIIGLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSLIKDGL
        DSLPDPEQDIIGLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSLIKDGL
Subjt:  DSLPDPEQDIIGLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSLIKDGL

XP_011648491.1 uncharacterized protein LOC101206063 [Cucumis sativus]2.4e-18291.45Show/hide
Query:  MAALQLSLQNFPSTPTLGFGLRPPKSGRLTDLAPRLLKSRTLAFKPQTQNSKWVVRLSLVDQSPPKSTVDVGRLVDFLYEDLCHLFDEQGIDRTAYDEQV
        MA LQLSLQNFPSTPTL   LRPPKSGR+T L PRLL SRT AFKP T+NSKWVVR +LVDQ PPKST+DVGRLVDFL+EDL HLFDEQGIDRTAYDEQV
Subjt:  MAALQLSLQNFPSTPTLGFGLRPPKSGRLTDLAPRLLKSRTLAFKPQTQNSKWVVRLSLVDQSPPKSTVDVGRLVDFLYEDLCHLFDEQGIDRTAYDEQV

Query:  RFRDPITKYDTISGYLFNISLLRELFRPEFFLHWVKQTGPYEITTRWTMVMKFAVLPWKPELVFTGTSIMGINPETGKFCSHVDLWDSIQNNDYFSTEGL
        RFRDPITK+DTISGYLFNISLLRELFRPEFFLHWVKQTGPYEITTRWTMVMKFA+LPWKPELVFTG SIMGINPETGKFCSHVDLWDSIQNNDYFS EGL
Subjt:  RFRDPITKYDTISGYLFNISLLRELFRPEFFLHWVKQTGPYEITTRWTMVMKFAVLPWKPELVFTGTSIMGINPETGKFCSHVDLWDSIQNNDYFSTEGL

Query:  WDVFKQLRFYKTPALESPKYLILKRTANYEVRKYAPFIVVETSGDKLAGSTGFNTVAGYIFGKNSTKEKIPMTTPVFTQTFDSELPKVSIQIVLPSAKDI
        WDVFKQLRFYKTP LESPKYLILKRTA YEVRKYAPFIVVETSGDKLAGS GFNTVAGYIFGKNSTKEKIPMTTPVFTQ F+SE PKVSIQIVLPS KDI
Subjt:  WDVFKQLRFYKTPALESPKYLILKRTANYEVRKYAPFIVVETSGDKLAGSTGFNTVAGYIFGKNSTKEKIPMTTPVFTQTFDSELPKVSIQIVLPSAKDI

Query:  DSLPDPEQDIIGLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSLIKDGL
        DSLPDPEQDI+GLRKVEGGIAAVLKFSGKP EEIVQEKAKELRSSLIKDGL
Subjt:  DSLPDPEQDIIGLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSLIKDGL

XP_038879422.1 uncharacterized protein LOC120071301 isoform X1 [Benincasa hispida]2.8e-18693.16Show/hide
Query:  MAALQLSLQNFPSTPTLGFGLRPPKSGRLTDLAPRLLKSRTLAFKPQTQNSKWVVRLSLVDQSPPKSTVDVGRLVDFLYEDLCHLFDEQGIDRTAYDEQV
        MA  QLSLQNFPSTPTLGFGLRPP+SGRLT L PRL K+RT AFKP +QNSKWVVRLSLVDQSPPKSTVDVGRLVDFLYEDL HLFDEQGIDRTAYDEQV
Subjt:  MAALQLSLQNFPSTPTLGFGLRPPKSGRLTDLAPRLLKSRTLAFKPQTQNSKWVVRLSLVDQSPPKSTVDVGRLVDFLYEDLCHLFDEQGIDRTAYDEQV

Query:  RFRDPITKYDTISGYLFNISLLRELFRPEFFLHWVKQTGPYEITTRWTMVMKFAVLPWKPELVFTGTSIMGINPETGKFCSHVDLWDSIQNNDYFSTEGL
        RFRDPIT +DTISGYLFNISLLRELFRPEFFLHWVKQTGPYEITTRWTMVMKF +LPWKPELVFTG SIMGINPETGKFCSHVDLWDSIQNNDYFS EGL
Subjt:  RFRDPITKYDTISGYLFNISLLRELFRPEFFLHWVKQTGPYEITTRWTMVMKFAVLPWKPELVFTGTSIMGINPETGKFCSHVDLWDSIQNNDYFSTEGL

Query:  WDVFKQLRFYKTPALESPKYLILKRTANYEVRKYAPFIVVETSGDKLAGSTGFNTVAGYIFGKNSTKEKIPMTTPVFTQTFDSELPKVSIQIVLPSAKDI
        WDVFKQLR+YKTPALESPKYLILKRTANYEVRKYA FIVVETSGDKLAGS GFNTVAGYIFGKNSTKEKIPMTTPVFTQTFDSE+PKV IQIVLPS KDI
Subjt:  WDVFKQLRFYKTPALESPKYLILKRTANYEVRKYAPFIVVETSGDKLAGSTGFNTVAGYIFGKNSTKEKIPMTTPVFTQTFDSELPKVSIQIVLPSAKDI

Query:  DSLPDPEQDIIGLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSLIKDGL
        DSLPDPEQDIIGLRKVEG IAAVLKFSGKPTEEIVQEKAKELRSSLIKDGL
Subjt:  DSLPDPEQDIIGLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSLIKDGL

TrEMBL top hitse value%identityAlignment
A0A0A0LWP3 Uncharacterized protein1.2e-18291.45Show/hide
Query:  MAALQLSLQNFPSTPTLGFGLRPPKSGRLTDLAPRLLKSRTLAFKPQTQNSKWVVRLSLVDQSPPKSTVDVGRLVDFLYEDLCHLFDEQGIDRTAYDEQV
        MA LQLSLQNFPSTPTL   LRPPKSGR+T L PRLL SRT AFKP T+NSKWVVR +LVDQ PPKST+DVGRLVDFL+EDL HLFDEQGIDRTAYDEQV
Subjt:  MAALQLSLQNFPSTPTLGFGLRPPKSGRLTDLAPRLLKSRTLAFKPQTQNSKWVVRLSLVDQSPPKSTVDVGRLVDFLYEDLCHLFDEQGIDRTAYDEQV

Query:  RFRDPITKYDTISGYLFNISLLRELFRPEFFLHWVKQTGPYEITTRWTMVMKFAVLPWKPELVFTGTSIMGINPETGKFCSHVDLWDSIQNNDYFSTEGL
        RFRDPITK+DTISGYLFNISLLRELFRPEFFLHWVKQTGPYEITTRWTMVMKFA+LPWKPELVFTG SIMGINPETGKFCSHVDLWDSIQNNDYFS EGL
Subjt:  RFRDPITKYDTISGYLFNISLLRELFRPEFFLHWVKQTGPYEITTRWTMVMKFAVLPWKPELVFTGTSIMGINPETGKFCSHVDLWDSIQNNDYFSTEGL

Query:  WDVFKQLRFYKTPALESPKYLILKRTANYEVRKYAPFIVVETSGDKLAGSTGFNTVAGYIFGKNSTKEKIPMTTPVFTQTFDSELPKVSIQIVLPSAKDI
        WDVFKQLRFYKTP LESPKYLILKRTA YEVRKYAPFIVVETSGDKLAGS GFNTVAGYIFGKNSTKEKIPMTTPVFTQ F+SE PKVSIQIVLPS KDI
Subjt:  WDVFKQLRFYKTPALESPKYLILKRTANYEVRKYAPFIVVETSGDKLAGSTGFNTVAGYIFGKNSTKEKIPMTTPVFTQTFDSELPKVSIQIVLPSAKDI

Query:  DSLPDPEQDIIGLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSLIKDGL
        DSLPDPEQDI+GLRKVEGGIAAVLKFSGKP EEIVQEKAKELRSSLIKDGL
Subjt:  DSLPDPEQDIIGLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSLIKDGL

A0A1S3CJ12 uncharacterized protein LOC103501513 isoform X11.1e-18592.88Show/hide
Query:  MAALQLSLQNFPSTPTLGFGLRPPKSGRLTDLAPRLLKSRTLAFKPQTQNSKWVVRLSLVDQSPPKSTVDVGRLVDFLYEDLCHLFDEQGIDRTAYDEQV
        MAALQLSLQNF STPTL   LRPPKSGRLT+L PRLL+SRT A KP TQNSKWVVR +LVDQSPPKSTVDVGRLVDFLYEDL HLFDEQGIDRTAYDEQV
Subjt:  MAALQLSLQNFPSTPTLGFGLRPPKSGRLTDLAPRLLKSRTLAFKPQTQNSKWVVRLSLVDQSPPKSTVDVGRLVDFLYEDLCHLFDEQGIDRTAYDEQV

Query:  RFRDPITKYDTISGYLFNISLLRELFRPEFFLHWVKQTGPYEITTRWTMVMKFAVLPWKPELVFTGTSIMGINPETGKFCSHVDLWDSIQNNDYFSTEGL
        RFRDPITK+DTISGYLFNISLLRE+FRPEFFLHWVKQTGPYEITTRWTM+MKFA+LPWKPEL+FTGTSIMGINPETGKFCSHVDLWDSIQNNDYFS EGL
Subjt:  RFRDPITKYDTISGYLFNISLLRELFRPEFFLHWVKQTGPYEITTRWTMVMKFAVLPWKPELVFTGTSIMGINPETGKFCSHVDLWDSIQNNDYFSTEGL

Query:  WDVFKQLRFYKTPALESPKYLILKRTANYEVRKYAPFIVVETSGDKLAGSTGFNTVAGYIFGKNSTKEKIPMTTPVFTQTFDSELPKVSIQIVLPSAKDI
        WDVFKQLRFYKTP LESPKYLILKRT  YEVRKYAPFIVVETSGDKLAGS GFNTVAGYIFGKNSTKEKIPMTTPVFTQTFDSE PKVSIQIVLPS KDI
Subjt:  WDVFKQLRFYKTPALESPKYLILKRTANYEVRKYAPFIVVETSGDKLAGSTGFNTVAGYIFGKNSTKEKIPMTTPVFTQTFDSELPKVSIQIVLPSAKDI

Query:  DSLPDPEQDIIGLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSLIKDGL
        DSLPDPEQDIIGLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSLIKDGL
Subjt:  DSLPDPEQDIIGLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSLIKDGL

A0A5A7TMX2 SOUL heme-binding family protein isoform 12.9e-18188.77Show/hide
Query:  MAALQLSLQNFPSTPTLGFGLRPPKSGRLTDLAPRLLKSRTLAFKPQTQNSKWVVRLSLVDQSPPKSTVDVGRLVDFLYEDLCHLFDEQGIDRTAYDEQV
        MAALQLSLQNF STPTL   LRPPKSGRLT+L PRLL+SRT A KP TQNSKWVVR +LVDQSPPKSTVDVGRLVDFLYEDL HLFDEQGIDRTAYDEQV
Subjt:  MAALQLSLQNFPSTPTLGFGLRPPKSGRLTDLAPRLLKSRTLAFKPQTQNSKWVVRLSLVDQSPPKSTVDVGRLVDFLYEDLCHLFDEQGIDRTAYDEQV

Query:  RFRDPITKYDTISGYLFNISLLRELFRPEFFLHWVKQT--------------GPYEITTRWTMVMKFAVLPWKPELVFTGTSIMGINPETGKFCSHVDLW
        RFRDPITK+DTISGYLFNISLLRE+FRPEFFLHWVKQ                PYEITTRWTM+MKFA+LPWKPEL+FTGTSIMGINPETGKFCSHVDLW
Subjt:  RFRDPITKYDTISGYLFNISLLRELFRPEFFLHWVKQT--------------GPYEITTRWTMVMKFAVLPWKPELVFTGTSIMGINPETGKFCSHVDLW

Query:  DSIQNNDYFSTEGLWDVFKQLRFYKTPALESPKYLILKRTANYEVRKYAPFIVVETSGDKLAGSTGFNTVAGYIFGKNSTKEKIPMTTPVFTQTFDSELP
        DSIQNNDYFS EGLWDVFKQLRFYKTP LESPKYLILKRT  YEVRKYAPFIVVETSGDKLAGS GFNTVAGYIFGKNSTKEKIPMTTPVFTQTFDSE P
Subjt:  DSIQNNDYFSTEGLWDVFKQLRFYKTPALESPKYLILKRTANYEVRKYAPFIVVETSGDKLAGSTGFNTVAGYIFGKNSTKEKIPMTTPVFTQTFDSELP

Query:  KVSIQIVLPSAKDIDSLPDPEQDIIGLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSLIKDGL
        KVSIQIVLPS KDIDSLPDPEQDIIGLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSLIKDGL
Subjt:  KVSIQIVLPSAKDIDSLPDPEQDIIGLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSLIKDGL

A0A5D3CLD5 SOUL heme-binding family protein isoform 12.0e-18289.32Show/hide
Query:  MAALQLSLQNFPSTPTLGFGLRPPKSGRLTDLAPRLLKSRTLAFKPQTQNSKWVVRLSLVDQSPPKSTVDVGRLVDFLYEDLCHLFDEQGIDRTAYDEQV
        MAALQLSLQNF STPTL   LRPPKSGRLT L PRLL+SRT AFKP TQNSKWVVR +LVDQSPPKSTVDVGRLVDFLYEDL HLFDEQGIDRTAYDEQV
Subjt:  MAALQLSLQNFPSTPTLGFGLRPPKSGRLTDLAPRLLKSRTLAFKPQTQNSKWVVRLSLVDQSPPKSTVDVGRLVDFLYEDLCHLFDEQGIDRTAYDEQV

Query:  RFRDPITKYDTISGYLFNISLLRELFRPEFFLHWVKQT--------------GPYEITTRWTMVMKFAVLPWKPELVFTGTSIMGINPETGKFCSHVDLW
        RFRDPITK+DTISGYLFNISLLRE+FRPEFFLHWVKQ                PYEITTRWTM+MKFA+LPWKPEL+FTGTSIMGINPETGKFCSHVDLW
Subjt:  RFRDPITKYDTISGYLFNISLLRELFRPEFFLHWVKQT--------------GPYEITTRWTMVMKFAVLPWKPELVFTGTSIMGINPETGKFCSHVDLW

Query:  DSIQNNDYFSTEGLWDVFKQLRFYKTPALESPKYLILKRTANYEVRKYAPFIVVETSGDKLAGSTGFNTVAGYIFGKNSTKEKIPMTTPVFTQTFDSELP
        DSIQNNDYFS EGLWDVFKQLRFYKTP LESPKYLILKRTA YEVRKYAPFIVVETSGDKLAGS GFNTVAGYIFGKNSTKEKIPMTTPVFTQTFDSE P
Subjt:  DSIQNNDYFSTEGLWDVFKQLRFYKTPALESPKYLILKRTANYEVRKYAPFIVVETSGDKLAGSTGFNTVAGYIFGKNSTKEKIPMTTPVFTQTFDSELP

Query:  KVSIQIVLPSAKDIDSLPDPEQDIIGLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSLIKDGL
        KVSIQIVLPS KDIDSLPDPEQDIIGLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSLIKDGL
Subjt:  KVSIQIVLPSAKDIDSLPDPEQDIIGLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSLIKDGL

A0A6J1CUY2 uncharacterized protein LOC111014503 isoform X19.0e-17588.1Show/hide
Query:  MAALQLSLQNFPSTPTLGFGLRPPKSGRLT--DLAPRLLKSRTLAFKPQTQNSKWVVRLSLVDQSPPKSTVDVGRLVDFLYEDLCHLFDEQGIDRTAYDE
        MAALQLSLQNF STPT GFG RP KSG LT   L PRLLKSRT+ FKP  +NSKW VRLSLVDQSPPKS VDV RLVDFLYEDL HLFDEQGIDRTAYDE
Subjt:  MAALQLSLQNFPSTPTLGFGLRPPKSGRLT--DLAPRLLKSRTLAFKPQTQNSKWVVRLSLVDQSPPKSTVDVGRLVDFLYEDLCHLFDEQGIDRTAYDE

Query:  QVRFRDPITKYDTISGYLFNISLLRELFRPEFFLHWVKQTGPYEITTRWTMVMKFAVLPWKPELVFTGTSIMGINPETGKFCSHVDLWDSIQNNDYFSTE
         VRFRDPITK+DTISGY FNISLLRELFRPEFFLHWVKQTGPYEITTRWTMVMKF +LPWKPE +FTG SIMGINPETGKFCSHVDLWDSIQNNDYFS E
Subjt:  QVRFRDPITKYDTISGYLFNISLLRELFRPEFFLHWVKQTGPYEITTRWTMVMKFAVLPWKPELVFTGTSIMGINPETGKFCSHVDLWDSIQNNDYFSTE

Query:  GLWDVFKQLRFYKTPALESPKYLILKRTANYEVRKYAPFIVVETSGDKLAGSTGFNTVAGYIFGKNSTKEKIPMTTPVFTQTFDSELPKVSIQIVLPSAK
        GL DVFKQLRFYKTP LESPKY ILKRTANYEVRKY PF+VVETSGDKL+GS GFNTVAGYIFGKNS KEKIPMTTPVFTQTFDSE PKVSIQIVLPS K
Subjt:  GLWDVFKQLRFYKTPALESPKYLILKRTANYEVRKYAPFIVVETSGDKLAGSTGFNTVAGYIFGKNSTKEKIPMTTPVFTQTFDSELPKVSIQIVLPSAK

Query:  DIDSLPDPEQDIIGLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSLIKDGL
        DI+SLPDPEQD IGLRKVEGGIAAVLKFSGKPTE++VQEKAKELRS LIKDGL
Subjt:  DIDSLPDPEQDIIGLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSLIKDGL

SwissProt top hitse value%identityAlignment
Q9SR77 Heme-binding-like protein At3g10130, chloroplastic4.8e-1633.74Show/hide
Query:  FYKTPALESPKYLILKRTANYEVRKYAPFIVV------ETSGDKLAGSTGFNTVAGYIFGKNSTKEKIPMTTPVFTQTFDS--ELPKVSIQIVLPSAKDI
        F   P LE+  + +L RT  YE+R+  P+ V       ET  D    S  FN +A Y+FGKN+ KEK+ MTTPV T+   S  E  +++  ++   AKD 
Subjt:  FYKTPALESPKYLILKRTANYEVRKYAPFIVV------ETSGDKLAGSTGFNTVAGYIFGKNSTKEKIPMTTPVFTQTFDS--ELPKVSIQIVLPSAKDI

Query:  D--------------SLPDPEQDIIGLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSLIKD
        +              +LP P+   + +++V   I AV+ FSG  T+E ++ + +ELR +L  D
Subjt:  D--------------SLPDPEQDIIGLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSLIKD

Arabidopsis top hitse value%identityAlignment
AT1G17100.1 SOUL heme-binding family protein1.2e-0629.71Show/hide
Query:  LESPKYLILKRTANYEVRKYAPFIVVETS-----GDKLAGSTGFNTVAGYIFGKNSTKEKIPMTTPVFTQTFDSELPKVSIQIVLPSAKDIDSLPDP-EQ
        +E P Y ++     YE+R+Y   + V T          A  T F  +  YI GKN   +KI MT PV +Q   S+ P       +       + PDP   
Subjt:  LESPKYLILKRTANYEVRKYAPFIVVETS-----GDKLAGSTGFNTVAGYIFGKNSTKEKIPMTTPVFTQTFDSELPKVSIQIVLPSAKDIDSLPDP-EQ

Query:  DIIGLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSL
        + + ++K      AV +FSG  +++ + E+A  L SSL
Subjt:  DIIGLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSL

AT2G37970.1 SOUL heme-binding family protein1.5e-1233.52Show/hide
Query:  ALESPKYLILKRTANYEVRKYAPFIVVETSGD----KLAGSTGFNTVAGYI--FGK--NSTKEKIPMTTPVFTQ------------TFDSELPK------
        A+E+PKY + K    YE+R+Y P +  E + D    K     GF  +A YI  FGK  N   EKI MT PV T+            T +SE  +      
Subjt:  ALESPKYLILKRTANYEVRKYAPFIVVETSGD----KLAGSTGFNTVAGYI--FGK--NSTKEKIPMTTPVFTQ------------TFDSELPK------

Query:  ------------VSIQIVLPSA-KDIDSLPDPEQDIIGLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSLIKDG
                    V++Q +LPS  K  +  P P  + + +++  G    V+KFSG  +E +V EK K+L S L KDG
Subjt:  ------------VSIQIVLPSA-KDIDSLPDPEQDIIGLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSLIKDG

AT3G10130.1 SOUL heme-binding family protein3.4e-1733.74Show/hide
Query:  FYKTPALESPKYLILKRTANYEVRKYAPFIVV------ETSGDKLAGSTGFNTVAGYIFGKNSTKEKIPMTTPVFTQTFDS--ELPKVSIQIVLPSAKDI
        F   P LE+  + +L RT  YE+R+  P+ V       ET  D    S  FN +A Y+FGKN+ KEK+ MTTPV T+   S  E  +++  ++   AKD 
Subjt:  FYKTPALESPKYLILKRTANYEVRKYAPFIVV------ETSGDKLAGSTGFNTVAGYIFGKNSTKEKIPMTTPVFTQTFDS--ELPKVSIQIVLPSAKDI

Query:  D--------------SLPDPEQDIIGLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSLIKD
        +              +LP P+   + +++V   I AV+ FSG  T+E ++ + +ELR +L  D
Subjt:  D--------------SLPDPEQDIIGLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSLIKD

AT5G20140.1 SOUL heme-binding family protein2.6e-12675.52Show/hide
Query:  STVDVGRLVDFLYEDLCHLFDEQGIDRTAYDEQVRFRDPITKYDTISGYLFNISLLRELFRPEFFLHWVKQTGPYEITTRWTMVMKFAVLPWKPELVFTG
        STV++  LV FLYEDL HLFD+QGID+TAYDE+V+FRDPITK+DTISGYLFNI+ L+ +F P+F LHW KQTGPYEITTRWTMVMKF  LPWKPELVFTG
Subjt:  STVDVGRLVDFLYEDLCHLFDEQGIDRTAYDEQVRFRDPITKYDTISGYLFNISLLRELFRPEFFLHWVKQTGPYEITTRWTMVMKFAVLPWKPELVFTG

Query:  TSIMGINPETGKFCSHVDLWDSIQNNDYFSTEGLWDVFKQLRFYKTPALESPKYLILKRTANYEVRKYAPFIVVETSGDKLAGSTGFNTVAGYIFGKNST
         SIM +NPET KFCSH+DLWDSI+NNDYFS EGL DVFKQLR YKTP LE+PKY ILKRTANYEVR Y PFIVVET GDKL+GS+GFN VAGYIFGKNST
Subjt:  TSIMGINPETGKFCSHVDLWDSIQNNDYFSTEGLWDVFKQLRFYKTPALESPKYLILKRTANYEVRKYAPFIVVETSGDKLAGSTGFNTVAGYIFGKNST

Query:  KEKIPMTTPVFTQTFDSELPK-VSIQIVLPSAKDIDSLPDPEQDIIGLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSLIKDGL
         EKIPMTTPVFTQT D++L   VS+QIV+PS KD+ SLP P ++ + L+K+EGG AA +KFSGKPTE++VQ K  ELRSSL KDGL
Subjt:  KEKIPMTTPVFTQTFDSELPK-VSIQIVLPSAKDIDSLPDPEQDIIGLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSLIKDGL

AT5G20140.2 SOUL heme-binding family protein2.6e-12675.52Show/hide
Query:  STVDVGRLVDFLYEDLCHLFDEQGIDRTAYDEQVRFRDPITKYDTISGYLFNISLLRELFRPEFFLHWVKQTGPYEITTRWTMVMKFAVLPWKPELVFTG
        STV++  LV FLYEDL HLFD+QGID+TAYDE+V+FRDPITK+DTISGYLFNI+ L+ +F P+F LHW KQTGPYEITTRWTMVMKF  LPWKPELVFTG
Subjt:  STVDVGRLVDFLYEDLCHLFDEQGIDRTAYDEQVRFRDPITKYDTISGYLFNISLLRELFRPEFFLHWVKQTGPYEITTRWTMVMKFAVLPWKPELVFTG

Query:  TSIMGINPETGKFCSHVDLWDSIQNNDYFSTEGLWDVFKQLRFYKTPALESPKYLILKRTANYEVRKYAPFIVVETSGDKLAGSTGFNTVAGYIFGKNST
         SIM +NPET KFCSH+DLWDSI+NNDYFS EGL DVFKQLR YKTP LE+PKY ILKRTANYEVR Y PFIVVET GDKL+GS+GFN VAGYIFGKNST
Subjt:  TSIMGINPETGKFCSHVDLWDSIQNNDYFSTEGLWDVFKQLRFYKTPALESPKYLILKRTANYEVRKYAPFIVVETSGDKLAGSTGFNTVAGYIFGKNST

Query:  KEKIPMTTPVFTQTFDSELPK-VSIQIVLPSAKDIDSLPDPEQDIIGLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSLIKDGL
         EKIPMTTPVFTQT D++L   VS+QIV+PS KD+ SLP P ++ + L+K+EGG AA +KFSGKPTE++VQ K  ELRSSL KDGL
Subjt:  KEKIPMTTPVFTQTFDSELPK-VSIQIVLPSAKDIDSLPDPEQDIIGLRKVEGGIAAVLKFSGKPTEEIVQEKAKELRSSLIKDGL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCGCTCTTCAACTTTCCCTCCAAAACTTCCCCTCAACCCCAACACTCGGTTTCGGTCTCCGGCCACCGAAATCCGGCAGACTAACCGACCTCGCGCCCCGTCTACT
TAAAAGCAGAACTCTAGCTTTCAAACCCCAAACCCAAAATTCTAAGTGGGTTGTTCGATTAAGCTTGGTAGATCAAAGCCCACCAAAATCGACGGTCGATGTAGGCCGAT
TGGTGGATTTCTTGTATGAAGATCTTTGCCATCTCTTCGATGAACAGGGGATTGATCGAACGGCGTACGACGAACAAGTGCGATTTCGGGACCCCATTACCAAGTACGAT
ACGATTAGTGGGTATTTGTTTAATATTTCCCTCTTGCGAGAACTTTTCAGGCCTGAATTCTTCTTGCATTGGGTTAAACAGACAGGACCATATGAAATAACTACGAGATG
GACTATGGTAATGAAGTTTGCCGTTCTACCATGGAAACCAGAATTAGTTTTCACGGGAACTTCCATCATGGGTATCAATCCAGAGACCGGCAAGTTCTGTAGTCATGTGG
ATCTCTGGGATTCAATACAAAACAACGACTACTTTTCTACAGAAGGCCTTTGGGATGTTTTCAAGCAGCTTCGGTTCTATAAGACTCCAGCATTGGAATCACCCAAGTAT
CTGATTCTGAAAAGGACTGCAAATTATGAGGTGAGGAAATATGCACCATTTATAGTGGTGGAAACAAGTGGAGACAAGCTCGCTGGGTCTACAGGATTCAATACAGTTGC
TGGGTATATATTTGGGAAGAATTCTACAAAGGAGAAGATACCCATGACCACTCCTGTATTCACCCAAACATTTGACTCTGAATTACCTAAAGTCTCCATTCAAATAGTTC
TTCCTTCAGCGAAAGATATAGACAGTTTACCAGATCCTGAACAAGACATAATTGGCTTGAGAAAGGTTGAAGGAGGTATTGCTGCGGTGTTGAAATTCAGTGGGAAACCT
ACAGAAGAGATTGTGCAAGAGAAGGCAAAAGAACTGCGGTCTAGTCTCATAAAGGATGGATTGTGGTGGTCACCTAATCAAGTTGTGATGAGGGTTGAAGGTGAAGCAGC
TAGATCATTACGCAATAAGTACTTGAGGAATGATGCAGCAGAGGTTATTCACATTGGTACCTTAGAATTACTATACAAGGAGATGGAGTTCAAAAATGACATGGATGCCG
TAAAAAGGACATTGGTCTACTACATTGAGTTGGCGATGATAGGGAAAGAGAAGACGAGGGGCAATATTGACAAGACTTTATTGAATGGTGTGGAGGATTTGGAATACTTC
AATAAACTCGATTGTGGACATGTGTTATGGGAGAAGACATTACAAGAACTACAGAAAGCATTTAACGTTAAGGCTAAAACGTACAAGAAGAGGTCAAAGAACAATAAGAA
CTACATTGTCAAGTATAATTTACCGGGGTTTCCTCATGCATTCTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCCGCTCTTCAACTTTCCCTCCAAAACTTCCCCTCAACCCCAACACTCGGTTTCGGTCTCCGGCCACCGAAATCCGGCAGACTAACCGACCTCGCGCCCCGTCTACT
TAAAAGCAGAACTCTAGCTTTCAAACCCCAAACCCAAAATTCTAAGTGGGTTGTTCGATTAAGCTTGGTAGATCAAAGCCCACCAAAATCGACGGTCGATGTAGGCCGAT
TGGTGGATTTCTTGTATGAAGATCTTTGCCATCTCTTCGATGAACAGGGGATTGATCGAACGGCGTACGACGAACAAGTGCGATTTCGGGACCCCATTACCAAGTACGAT
ACGATTAGTGGGTATTTGTTTAATATTTCCCTCTTGCGAGAACTTTTCAGGCCTGAATTCTTCTTGCATTGGGTTAAACAGACAGGACCATATGAAATAACTACGAGATG
GACTATGGTAATGAAGTTTGCCGTTCTACCATGGAAACCAGAATTAGTTTTCACGGGAACTTCCATCATGGGTATCAATCCAGAGACCGGCAAGTTCTGTAGTCATGTGG
ATCTCTGGGATTCAATACAAAACAACGACTACTTTTCTACAGAAGGCCTTTGGGATGTTTTCAAGCAGCTTCGGTTCTATAAGACTCCAGCATTGGAATCACCCAAGTAT
CTGATTCTGAAAAGGACTGCAAATTATGAGGTGAGGAAATATGCACCATTTATAGTGGTGGAAACAAGTGGAGACAAGCTCGCTGGGTCTACAGGATTCAATACAGTTGC
TGGGTATATATTTGGGAAGAATTCTACAAAGGAGAAGATACCCATGACCACTCCTGTATTCACCCAAACATTTGACTCTGAATTACCTAAAGTCTCCATTCAAATAGTTC
TTCCTTCAGCGAAAGATATAGACAGTTTACCAGATCCTGAACAAGACATAATTGGCTTGAGAAAGGTTGAAGGAGGTATTGCTGCGGTGTTGAAATTCAGTGGGAAACCT
ACAGAAGAGATTGTGCAAGAGAAGGCAAAAGAACTGCGGTCTAGTCTCATAAAGGATGGATTGTGGTGGTCACCTAATCAAGTTGTGATGAGGGTTGAAGGTGAAGCAGC
TAGATCATTACGCAATAAGTACTTGAGGAATGATGCAGCAGAGGTTATTCACATTGGTACCTTAGAATTACTATACAAGGAGATGGAGTTCAAAAATGACATGGATGCCG
TAAAAAGGACATTGGTCTACTACATTGAGTTGGCGATGATAGGGAAAGAGAAGACGAGGGGCAATATTGACAAGACTTTATTGAATGGTGTGGAGGATTTGGAATACTTC
AATAAACTCGATTGTGGACATGTGTTATGGGAGAAGACATTACAAGAACTACAGAAAGCATTTAACGTTAAGGCTAAAACGTACAAGAAGAGGTCAAAGAACAATAAGAA
CTACATTGTCAAGTATAATTTACCGGGGTTTCCTCATGCATTCTAG
Protein sequenceShow/hide protein sequence
MAALQLSLQNFPSTPTLGFGLRPPKSGRLTDLAPRLLKSRTLAFKPQTQNSKWVVRLSLVDQSPPKSTVDVGRLVDFLYEDLCHLFDEQGIDRTAYDEQVRFRDPITKYD
TISGYLFNISLLRELFRPEFFLHWVKQTGPYEITTRWTMVMKFAVLPWKPELVFTGTSIMGINPETGKFCSHVDLWDSIQNNDYFSTEGLWDVFKQLRFYKTPALESPKY
LILKRTANYEVRKYAPFIVVETSGDKLAGSTGFNTVAGYIFGKNSTKEKIPMTTPVFTQTFDSELPKVSIQIVLPSAKDIDSLPDPEQDIIGLRKVEGGIAAVLKFSGKP
TEEIVQEKAKELRSSLIKDGLWWSPNQVVMRVEGEAARSLRNKYLRNDAAEVIHIGTLELLYKEMEFKNDMDAVKRTLVYYIELAMIGKEKTRGNIDKTLLNGVEDLEYF
NKLDCGHVLWEKTLQELQKAFNVKAKTYKKRSKNNKNYIVKYNLPGFPHAF