; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Chy2G042180 (gene) of Cucumber (hystrix) v1 genome

Gene IDChy2G042180
OrganismCucumis hystrix (Cucumber (hystrix) v1)
DescriptionSOUL heme-binding family protein
Genome locationchrH02:23035049..23038180
RNA-Seq ExpressionChy2G042180
SyntenyChy2G042180
Gene Ontology termsGO:0110165 - cellular anatomical structure (cellular component)
InterPro domainsIPR006917 - SOUL haem-binding protein
IPR011256 - Regulatory factor, effector binding domain superfamily
IPR018790 - Protein of unknown function DUF2358


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KGN65404.1 hypothetical protein Csa_019846 [Cucumis sativus]4.14e-28197.67Show/hide
Query:  MAALQLSLQNFPSTPTLSSLLRPPKSGRITHLPPRLLQSRTPSFKPHTKNSKWVVRFNLVDQSPPKSTVDVGRLVDFLHEDLSHLFDEQGIDRTAYDEQV
        MA LQLSLQNFPSTPTLSSLLRPPKSGRITHLPPRLL SRTP+FKPHTKNSKWVVR NLVDQ PPKST+DVGRLVDFLHEDLSHLFDEQGIDRTAYDEQV
Subjt:  MAALQLSLQNFPSTPTLSSLLRPPKSGRITHLPPRLLQSRTPSFKPHTKNSKWVVRFNLVDQSPPKSTVDVGRLVDFLHEDLSHLFDEQGIDRTAYDEQV

Query:  RFRDPITKHDTISGYLFNISLLRELFRPEFFLHWVKQTGPYEITTRWTMVMKFALLPWKPELVFTGNSIMGINPETGKFCSHLDLWDSIQNNDYFSVEGL
        RFRDPITKHDTISGYLFNISLLRELFRPEFFLHWVKQTGPYEITTRWTMVMKFALLPWKPELVFTGNSIMGINPETGKFCSH+DLWDSIQNNDYFSVEGL
Subjt:  RFRDPITKHDTISGYLFNISLLRELFRPEFFLHWVKQTGPYEITTRWTMVMKFALLPWKPELVFTGNSIMGINPETGKFCSHLDLWDSIQNNDYFSVEGL

Query:  WDVFKQLRFYKTPELESPKYLILKRTAKYEVRKYAPFIVVETSGDKLAGSAGFNTVAGYIFGKNSTKEKIPMTTPVFTQKFNSESAKVSIQIVLPSEKDI
        WDVFKQLRFYKTPELESPKYLILKRTAKYEVRKYAPFIVVETSGDKLAGSAGFNTVAGYIFGKNSTKEKIPMTTPVFTQKFNSES KVSIQIVLPSEKDI
Subjt:  WDVFKQLRFYKTPELESPKYLILKRTAKYEVRKYAPFIVVETSGDKLAGSAGFNTVAGYIFGKNSTKEKIPMTTPVFTQKFNSESAKVSIQIVLPSEKDI

Query:  DSLPDPEQDIVGLRKVEGGIAAVLKFSGKPIEEIVQEKAKELHSSLIKDGLKPRNGCLLARYNDPGRTWNFIMRNEVLIWLEEFSLE
        DSLPDPEQDIVGLRKVEGGIAAVLKFSGKPIEEIVQEKAKEL SSLIKDGLKPRNGCLLARYNDPGRTWNFIMRNEVLIWLEEFSLE
Subjt:  DSLPDPEQDIVGLRKVEGGIAAVLKFSGKPIEEIVQEKAKELHSSLIKDGLKPRNGCLLARYNDPGRTWNFIMRNEVLIWLEEFSLE

TYK11984.1 SOUL heme-binding family protein isoform 1 [Cucumis melo var. makuwa]1.28e-26790.77Show/hide
Query:  MAALQLSLQNFPSTPTLSSLLRPPKSGRITHLPPRLLQSRTPSFKPHTKNSKWVVRFNLVDQSPPKSTVDVGRLVDFLHEDLSHLFDEQGIDRTAYDEQV
        MAALQLSLQNF STPTL+S+LRPPKSGR+THL PRLLQSRTP+FKP+T+NSKWVVRFNLVDQSPPKSTVDVGRLVDFL+EDLSHLFDEQGIDRTAYDEQV
Subjt:  MAALQLSLQNFPSTPTLSSLLRPPKSGRITHLPPRLLQSRTPSFKPHTKNSKWVVRFNLVDQSPPKSTVDVGRLVDFLHEDLSHLFDEQGIDRTAYDEQV

Query:  RFRDPITKHDTISGYLFNISLLRELFRPEFFLHWVKQT--------------GPYEITTRWTMVMKFALLPWKPELVFTGNSIMGINPETGKFCSHLDLW
        RFRDPITKHDTISGYLFNISLLRE+FRPEFFLHWVKQ                PYEITTRWTM+MKFALLPWKPEL+FTG SIMGINPETGKFCSH+DLW
Subjt:  RFRDPITKHDTISGYLFNISLLRELFRPEFFLHWVKQT--------------GPYEITTRWTMVMKFALLPWKPELVFTGNSIMGINPETGKFCSHLDLW

Query:  DSIQNNDYFSVEGLWDVFKQLRFYKTPELESPKYLILKRTAKYEVRKYAPFIVVETSGDKLAGSAGFNTVAGYIFGKNSTKEKIPMTTPVFTQKFNSESA
        DSIQNNDYFSVEGLWDVFKQLRFYKTPELESPKYLILKRTAKYEVRKYAPFIVVETSGDKLAGSAGFNTVAGYIFGKNSTKEKIPMTTPVFTQ F+SES 
Subjt:  DSIQNNDYFSVEGLWDVFKQLRFYKTPELESPKYLILKRTAKYEVRKYAPFIVVETSGDKLAGSAGFNTVAGYIFGKNSTKEKIPMTTPVFTQKFNSESA

Query:  KVSIQIVLPSEKDIDSLPDPEQDIVGLRKVEGGIAAVLKFSGKPIEEIVQEKAKELHSSLIKDGLKPRNGCLLARYNDPGRTWNFIMRNEVLIWLEEFSL
        KVSIQIVLPSEKDIDSLPDPEQDI+GLRKVEGGIAAVLKFSGKP EEIVQEKAKEL SSLIKDGLKPRNGCLLARYNDPGRTWNFIMRNEVLIWLEE+SL
Subjt:  KVSIQIVLPSEKDIDSLPDPEQDIVGLRKVEGGIAAVLKFSGKPIEEIVQEKAKELHSSLIKDGLKPRNGCLLARYNDPGRTWNFIMRNEVLIWLEEFSL

Query:  E
        E
Subjt:  E

XP_008463332.1 PREDICTED: uncharacterized protein LOC103501513 isoform X1 [Cucumis melo]9.59e-27193.8Show/hide
Query:  MAALQLSLQNFPSTPTLSSLLRPPKSGRITHLPPRLLQSRTPSFKPHTKNSKWVVRFNLVDQSPPKSTVDVGRLVDFLHEDLSHLFDEQGIDRTAYDEQV
        MAALQLSLQNF STPTL+S+LRPPKSGR+T+L PRLLQSRTP+ KP+T+NSKWVVRFNLVDQSPPKSTVDVGRLVDFL+EDLSHLFDEQGIDRTAYDEQV
Subjt:  MAALQLSLQNFPSTPTLSSLLRPPKSGRITHLPPRLLQSRTPSFKPHTKNSKWVVRFNLVDQSPPKSTVDVGRLVDFLHEDLSHLFDEQGIDRTAYDEQV

Query:  RFRDPITKHDTISGYLFNISLLRELFRPEFFLHWVKQTGPYEITTRWTMVMKFALLPWKPELVFTGNSIMGINPETGKFCSHLDLWDSIQNNDYFSVEGL
        RFRDPITKHDTISGYLFNISLLRE+FRPEFFLHWVKQTGPYEITTRWTM+MKFALLPWKPEL+FTG SIMGINPETGKFCSH+DLWDSIQNNDYFSVEGL
Subjt:  RFRDPITKHDTISGYLFNISLLRELFRPEFFLHWVKQTGPYEITTRWTMVMKFALLPWKPELVFTGNSIMGINPETGKFCSHLDLWDSIQNNDYFSVEGL

Query:  WDVFKQLRFYKTPELESPKYLILKRTAKYEVRKYAPFIVVETSGDKLAGSAGFNTVAGYIFGKNSTKEKIPMTTPVFTQKFNSESAKVSIQIVLPSEKDI
        WDVFKQLRFYKTPELESPKYLILKRT KYEVRKYAPFIVVETSGDKLAGSAGFNTVAGYIFGKNSTKEKIPMTTPVFTQ F+SES KVSIQIVLPSEKDI
Subjt:  WDVFKQLRFYKTPELESPKYLILKRTAKYEVRKYAPFIVVETSGDKLAGSAGFNTVAGYIFGKNSTKEKIPMTTPVFTQKFNSESAKVSIQIVLPSEKDI

Query:  DSLPDPEQDIVGLRKVEGGIAAVLKFSGKPIEEIVQEKAKELHSSLIKDGLKPRNGCLLARYNDPGRTWNFIMRNEVLIWLEEFSLE
        DSLPDPEQDI+GLRKVEGGIAAVLKFSGKP EEIVQEKAKEL SSLIKDGLKPRNGCLLARYNDPGRTWNFIMRNEVLIWLEE+SLE
Subjt:  DSLPDPEQDIVGLRKVEGGIAAVLKFSGKPIEEIVQEKAKELHSSLIKDGLKPRNGCLLARYNDPGRTWNFIMRNEVLIWLEEFSLE

XP_011648491.1 uncharacterized protein LOC101206063 [Cucumis sativus]2.47e-27997.67Show/hide
Query:  MAALQLSLQNFPSTPTLSSLLRPPKSGRITHLPPRLLQSRTPSFKPHTKNSKWVVRFNLVDQSPPKSTVDVGRLVDFLHEDLSHLFDEQGIDRTAYDEQV
        MA LQLSLQNFPSTPTLSSLLRPPKSGRITHLPPRLL SRTP+FKPHTKNSKWVVR NLVDQ PPKST+DVGRLVDFLHEDLSHLFDEQGIDRTAYDEQV
Subjt:  MAALQLSLQNFPSTPTLSSLLRPPKSGRITHLPPRLLQSRTPSFKPHTKNSKWVVRFNLVDQSPPKSTVDVGRLVDFLHEDLSHLFDEQGIDRTAYDEQV

Query:  RFRDPITKHDTISGYLFNISLLRELFRPEFFLHWVKQTGPYEITTRWTMVMKFALLPWKPELVFTGNSIMGINPETGKFCSHLDLWDSIQNNDYFSVEGL
        RFRDPITKHDTISGYLFNISLLRELFRPEFFLHWVKQTGPYEITTRWTMVMKFALLPWKPELVFTGNSIMGINPETGKFCSH+DLWDSIQNNDYFSVEGL
Subjt:  RFRDPITKHDTISGYLFNISLLRELFRPEFFLHWVKQTGPYEITTRWTMVMKFALLPWKPELVFTGNSIMGINPETGKFCSHLDLWDSIQNNDYFSVEGL

Query:  WDVFKQLRFYKTPELESPKYLILKRTAKYEVRKYAPFIVVETSGDKLAGSAGFNTVAGYIFGKNSTKEKIPMTTPVFTQKFNSESAKVSIQIVLPSEKDI
        WDVFKQLRFYKTPELESPKYLILKRTAKYEVRKYAPFIVVETSGDKLAGSAGFNTVAGYIFGKNSTKEKIPMTTPVFTQKFNSES KVSIQIVLPSEKDI
Subjt:  WDVFKQLRFYKTPELESPKYLILKRTAKYEVRKYAPFIVVETSGDKLAGSAGFNTVAGYIFGKNSTKEKIPMTTPVFTQKFNSESAKVSIQIVLPSEKDI

Query:  DSLPDPEQDIVGLRKVEGGIAAVLKFSGKPIEEIVQEKAKELHSSLIKDGLKPRNGCLLARYNDPGRTWNFIMRNEVLIWLEEFSLE
        DSLPDPEQDIVGLRKVEGGIAAVLKFSGKPIEEIVQEKAKEL SSLIKDGLKPRNGCLLARYNDPGRTWNFIMRNEVLIWLEEFSLE
Subjt:  DSLPDPEQDIVGLRKVEGGIAAVLKFSGKPIEEIVQEKAKELHSSLIKDGLKPRNGCLLARYNDPGRTWNFIMRNEVLIWLEEFSLE

XP_038879422.1 uncharacterized protein LOC120071301 isoform X1 [Benincasa hispida]1.90e-26190.96Show/hide
Query:  MAALQLSLQNFPSTPTLSSLLRPPKSGRITHLPPRLLQSRTPSFKPHTKNSKWVVRFNLVDQSPPKSTVDVGRLVDFLHEDLSHLFDEQGIDRTAYDEQV
        MA  QLSLQNFPSTPTL   LRPP+SGR+THLPPRL ++RTP+FKPH++NSKWVVR +LVDQSPPKSTVDVGRLVDFL+EDL HLFDEQGIDRTAYDEQV
Subjt:  MAALQLSLQNFPSTPTLSSLLRPPKSGRITHLPPRLLQSRTPSFKPHTKNSKWVVRFNLVDQSPPKSTVDVGRLVDFLHEDLSHLFDEQGIDRTAYDEQV

Query:  RFRDPITKHDTISGYLFNISLLRELFRPEFFLHWVKQTGPYEITTRWTMVMKFALLPWKPELVFTGNSIMGINPETGKFCSHLDLWDSIQNNDYFSVEGL
        RFRDPIT HDTISGYLFNISLLRELFRPEFFLHWVKQTGPYEITTRWTMVMKF LLPWKPELVFTG SIMGINPETGKFCSH+DLWDSIQNNDYFSVEGL
Subjt:  RFRDPITKHDTISGYLFNISLLRELFRPEFFLHWVKQTGPYEITTRWTMVMKFALLPWKPELVFTGNSIMGINPETGKFCSHLDLWDSIQNNDYFSVEGL

Query:  WDVFKQLRFYKTPELESPKYLILKRTAKYEVRKYAPFIVVETSGDKLAGSAGFNTVAGYIFGKNSTKEKIPMTTPVFTQKFNSESAKVSIQIVLPSEKDI
        WDVFKQLR+YKTP LESPKYLILKRTA YEVRKYA FIVVETSGDKLAGSAGFNTVAGYIFGKNSTKEKIPMTTPVFTQ F+SE  KV IQIVLPSEKDI
Subjt:  WDVFKQLRFYKTPELESPKYLILKRTAKYEVRKYAPFIVVETSGDKLAGSAGFNTVAGYIFGKNSTKEKIPMTTPVFTQKFNSESAKVSIQIVLPSEKDI

Query:  DSLPDPEQDIVGLRKVEGGIAAVLKFSGKPIEEIVQEKAKELHSSLIKDGLKPRNGCLLARYNDPGRTWNFIMRNEVLIWLEEFSLE
        DSLPDPEQDI+GLRKVEG IAAVLKFSGKP EEIVQEKAKEL SSLIKDGLKP NGCLLARYNDPGRTWNFIMRNEVLIWLEEFSLE
Subjt:  DSLPDPEQDIVGLRKVEGGIAAVLKFSGKPIEEIVQEKAKELHSSLIKDGLKPRNGCLLARYNDPGRTWNFIMRNEVLIWLEEFSLE

TrEMBL top hitse value%identityAlignment
A0A0A0LWP3 Uncharacterized protein1.6e-21997.67Show/hide
Query:  MAALQLSLQNFPSTPTLSSLLRPPKSGRITHLPPRLLQSRTPSFKPHTKNSKWVVRFNLVDQSPPKSTVDVGRLVDFLHEDLSHLFDEQGIDRTAYDEQV
        MA LQLSLQNFPSTPTLSSLLRPPKSGRITHLPPRLL SRTP+FKPHTKNSKWVVR NLVDQ PPKST+DVGRLVDFLHEDLSHLFDEQGIDRTAYDEQV
Subjt:  MAALQLSLQNFPSTPTLSSLLRPPKSGRITHLPPRLLQSRTPSFKPHTKNSKWVVRFNLVDQSPPKSTVDVGRLVDFLHEDLSHLFDEQGIDRTAYDEQV

Query:  RFRDPITKHDTISGYLFNISLLRELFRPEFFLHWVKQTGPYEITTRWTMVMKFALLPWKPELVFTGNSIMGINPETGKFCSHLDLWDSIQNNDYFSVEGL
        RFRDPITKHDTISGYLFNISLLRELFRPEFFLHWVKQTGPYEITTRWTMVMKFALLPWKPELVFTGNSIMGINPETGKFCSH+DLWDSIQNNDYFSVEGL
Subjt:  RFRDPITKHDTISGYLFNISLLRELFRPEFFLHWVKQTGPYEITTRWTMVMKFALLPWKPELVFTGNSIMGINPETGKFCSHLDLWDSIQNNDYFSVEGL

Query:  WDVFKQLRFYKTPELESPKYLILKRTAKYEVRKYAPFIVVETSGDKLAGSAGFNTVAGYIFGKNSTKEKIPMTTPVFTQKFNSESAKVSIQIVLPSEKDI
        WDVFKQLRFYKTPELESPKYLILKRTAKYEVRKYAPFIVVETSGDKLAGSAGFNTVAGYIFGKNSTKEKIPMTTPVFTQKFNSES KVSIQIVLPSEKDI
Subjt:  WDVFKQLRFYKTPELESPKYLILKRTAKYEVRKYAPFIVVETSGDKLAGSAGFNTVAGYIFGKNSTKEKIPMTTPVFTQKFNSESAKVSIQIVLPSEKDI

Query:  DSLPDPEQDIVGLRKVEGGIAAVLKFSGKPIEEIVQEKAKELHSSLIKDGLKPRNGCLLARYNDPGRTWNFIMRNEVLIWLEEFSLE
        DSLPDPEQDIVGLRKVEGGIAAVLKFSGKPIEEIVQEKAKEL SSLIKDGLKPRNGCLLARYNDPGRTWNFIMRNEVLIWLEEFSLE
Subjt:  DSLPDPEQDIVGLRKVEGGIAAVLKFSGKPIEEIVQEKAKELHSSLIKDGLKPRNGCLLARYNDPGRTWNFIMRNEVLIWLEEFSLE

A0A1S3CJ12 uncharacterized protein LOC103501513 isoform X11.2e-21193.8Show/hide
Query:  MAALQLSLQNFPSTPTLSSLLRPPKSGRITHLPPRLLQSRTPSFKPHTKNSKWVVRFNLVDQSPPKSTVDVGRLVDFLHEDLSHLFDEQGIDRTAYDEQV
        MAALQLSLQNF STPTL+S+LRPPKSGR+T+L PRLLQSRTP+ KP+T+NSKWVVRFNLVDQSPPKSTVDVGRLVDFL+EDLSHLFDEQGIDRTAYDEQV
Subjt:  MAALQLSLQNFPSTPTLSSLLRPPKSGRITHLPPRLLQSRTPSFKPHTKNSKWVVRFNLVDQSPPKSTVDVGRLVDFLHEDLSHLFDEQGIDRTAYDEQV

Query:  RFRDPITKHDTISGYLFNISLLRELFRPEFFLHWVKQTGPYEITTRWTMVMKFALLPWKPELVFTGNSIMGINPETGKFCSHLDLWDSIQNNDYFSVEGL
        RFRDPITKHDTISGYLFNISLLRE+FRPEFFLHWVKQTGPYEITTRWTM+MKFALLPWKPEL+FTG SIMGINPETGKFCSH+DLWDSIQNNDYFSVEGL
Subjt:  RFRDPITKHDTISGYLFNISLLRELFRPEFFLHWVKQTGPYEITTRWTMVMKFALLPWKPELVFTGNSIMGINPETGKFCSHLDLWDSIQNNDYFSVEGL

Query:  WDVFKQLRFYKTPELESPKYLILKRTAKYEVRKYAPFIVVETSGDKLAGSAGFNTVAGYIFGKNSTKEKIPMTTPVFTQKFNSESAKVSIQIVLPSEKDI
        WDVFKQLRFYKTPELESPKYLILKRT KYEVRKYAPFIVVETSGDKLAGSAGFNTVAGYIFGKNSTKEKIPMTTPVFTQ F+SES KVSIQIVLPSEKDI
Subjt:  WDVFKQLRFYKTPELESPKYLILKRTAKYEVRKYAPFIVVETSGDKLAGSAGFNTVAGYIFGKNSTKEKIPMTTPVFTQKFNSESAKVSIQIVLPSEKDI

Query:  DSLPDPEQDIVGLRKVEGGIAAVLKFSGKPIEEIVQEKAKELHSSLIKDGLKPRNGCLLARYNDPGRTWNFIMRNEVLIWLEEFSLE
        DSLPDPEQDI+GLRKVEGGIAAVLKFSGKP EEIVQEKAKEL SSLIKDGLKPRNGCLLARYNDPGRTWNFIMRNEVLIWLEE+SLE
Subjt:  DSLPDPEQDIVGLRKVEGGIAAVLKFSGKPIEEIVQEKAKELHSSLIKDGLKPRNGCLLARYNDPGRTWNFIMRNEVLIWLEEFSLE

A0A5A7TMX2 SOUL heme-binding family protein isoform 13.2e-19989.92Show/hide
Query:  MAALQLSLQNFPSTPTLSSLLRPPKSGRITHLPPRLLQSRTPSFKPHTKNSKWVVRFNLVDQSPPKSTVDVGRLVDFLHEDLSHLFDEQGIDRTAYDEQV
        MAALQLSLQNF STPTL+S+LRPPKSGR+T+L PRLLQSRTP+ KP+T+NSKWVVRFNLVDQSPPKSTVDVGRLVDFL+EDLSHLFDEQGIDRTAYDEQV
Subjt:  MAALQLSLQNFPSTPTLSSLLRPPKSGRITHLPPRLLQSRTPSFKPHTKNSKWVVRFNLVDQSPPKSTVDVGRLVDFLHEDLSHLFDEQGIDRTAYDEQV

Query:  RFRDPITKHDTISGYLFNISLLRELFRPEFFLHWVKQT--------------GPYEITTRWTMVMKFALLPWKPELVFTGNSIMGINPETGKFCSHLDLW
        RFRDPITKHDTISGYLFNISLLRE+FRPEFFLHWVKQ                PYEITTRWTM+MKFALLPWKPEL+FTG SIMGINPETGKFCSH+DLW
Subjt:  RFRDPITKHDTISGYLFNISLLRELFRPEFFLHWVKQT--------------GPYEITTRWTMVMKFALLPWKPELVFTGNSIMGINPETGKFCSHLDLW

Query:  DSIQNNDYFSVEGLWDVFKQLRFYKTPELESPKYLILKRTAKYEVRKYAPFIVVETSGDKLAGSAGFNTVAGYIFGKNSTKEKIPMTTPVFTQKFNSESA
        DSIQNNDYFSVEGLWDVFKQLRFYKTPELESPKYLILKRT KYEVRKYAPFIVVETSGDKLAGSAGFNTVAGYIFGKNSTKEKIPMTTPVFTQ F+SES 
Subjt:  DSIQNNDYFSVEGLWDVFKQLRFYKTPELESPKYLILKRTAKYEVRKYAPFIVVETSGDKLAGSAGFNTVAGYIFGKNSTKEKIPMTTPVFTQKFNSESA

Query:  KVSIQIVLPSEKDIDSLPDPEQDIVGLRKVEGGIAAVLKFSGKPIEEIVQEKAKELHSSLIKDGLKPRNGCLLARYNDPGRTWNFIM
        KVSIQIVLPSEKDIDSLPDPEQDI+GLRKVEGGIAAVLKFSGKP EEIVQEKAKEL SSLIKDGLKPRNGCLLARYNDPGRTWNFIM
Subjt:  KVSIQIVLPSEKDIDSLPDPEQDIVGLRKVEGGIAAVLKFSGKPIEEIVQEKAKELHSSLIKDGLKPRNGCLLARYNDPGRTWNFIM

A0A5D3CLD5 SOUL heme-binding family protein isoform 12.0e-20990.77Show/hide
Query:  MAALQLSLQNFPSTPTLSSLLRPPKSGRITHLPPRLLQSRTPSFKPHTKNSKWVVRFNLVDQSPPKSTVDVGRLVDFLHEDLSHLFDEQGIDRTAYDEQV
        MAALQLSLQNF STPTL+S+LRPPKSGR+THL PRLLQSRTP+FKP+T+NSKWVVRFNLVDQSPPKSTVDVGRLVDFL+EDLSHLFDEQGIDRTAYDEQV
Subjt:  MAALQLSLQNFPSTPTLSSLLRPPKSGRITHLPPRLLQSRTPSFKPHTKNSKWVVRFNLVDQSPPKSTVDVGRLVDFLHEDLSHLFDEQGIDRTAYDEQV

Query:  RFRDPITKHDTISGYLFNISLLRELFRPEFFLHWVKQT--------------GPYEITTRWTMVMKFALLPWKPELVFTGNSIMGINPETGKFCSHLDLW
        RFRDPITKHDTISGYLFNISLLRE+FRPEFFLHWVKQ                PYEITTRWTM+MKFALLPWKPEL+FTG SIMGINPETGKFCSH+DLW
Subjt:  RFRDPITKHDTISGYLFNISLLRELFRPEFFLHWVKQT--------------GPYEITTRWTMVMKFALLPWKPELVFTGNSIMGINPETGKFCSHLDLW

Query:  DSIQNNDYFSVEGLWDVFKQLRFYKTPELESPKYLILKRTAKYEVRKYAPFIVVETSGDKLAGSAGFNTVAGYIFGKNSTKEKIPMTTPVFTQKFNSESA
        DSIQNNDYFSVEGLWDVFKQLRFYKTPELESPKYLILKRTAKYEVRKYAPFIVVETSGDKLAGSAGFNTVAGYIFGKNSTKEKIPMTTPVFTQ F+SES 
Subjt:  DSIQNNDYFSVEGLWDVFKQLRFYKTPELESPKYLILKRTAKYEVRKYAPFIVVETSGDKLAGSAGFNTVAGYIFGKNSTKEKIPMTTPVFTQKFNSESA

Query:  KVSIQIVLPSEKDIDSLPDPEQDIVGLRKVEGGIAAVLKFSGKPIEEIVQEKAKELHSSLIKDGLKPRNGCLLARYNDPGRTWNFIMRNEVLIWLEEFSL
        KVSIQIVLPSEKDIDSLPDPEQDI+GLRKVEGGIAAVLKFSGKP EEIVQEKAKEL SSLIKDGLKPRNGCLLARYNDPGRTWNFIMRNEVLIWLEE+SL
Subjt:  KVSIQIVLPSEKDIDSLPDPEQDIVGLRKVEGGIAAVLKFSGKPIEEIVQEKAKELHSSLIKDGLKPRNGCLLARYNDPGRTWNFIMRNEVLIWLEEFSL

Query:  E
        E
Subjt:  E

A0A6J1CUY2 uncharacterized protein LOC111014503 isoform X11.2e-19086.05Show/hide
Query:  MAALQLSLQNFPSTPTLSSLLRPPKSGRIT--HLPPRLLQSRTPSFKPHTKNSKWVVRFNLVDQSPPKSTVDVGRLVDFLHEDLSHLFDEQGIDRTAYDE
        MAALQLSLQNF STPT     RP KSG +T   LPPRLL+SRT  FKP  +NSKW VR +LVDQSPPKS VDV RLVDFL+EDL HLFDEQGIDRTAYDE
Subjt:  MAALQLSLQNFPSTPTLSSLLRPPKSGRIT--HLPPRLLQSRTPSFKPHTKNSKWVVRFNLVDQSPPKSTVDVGRLVDFLHEDLSHLFDEQGIDRTAYDE

Query:  QVRFRDPITKHDTISGYLFNISLLRELFRPEFFLHWVKQTGPYEITTRWTMVMKFALLPWKPELVFTGNSIMGINPETGKFCSHLDLWDSIQNNDYFSVE
         VRFRDPITKHDTISGY FNISLLRELFRPEFFLHWVKQTGPYEITTRWTMVMKF LLPWKPE +FTGNSIMGINPETGKFCSH+DLWDSIQNNDYFS+E
Subjt:  QVRFRDPITKHDTISGYLFNISLLRELFRPEFFLHWVKQTGPYEITTRWTMVMKFALLPWKPELVFTGNSIMGINPETGKFCSHLDLWDSIQNNDYFSVE

Query:  GLWDVFKQLRFYKTPELESPKYLILKRTAKYEVRKYAPFIVVETSGDKLAGSAGFNTVAGYIFGKNSTKEKIPMTTPVFTQKFNSESAKVSIQIVLPSEK
        GL DVFKQLRFYKTPELESPKY ILKRTA YEVRKY PF+VVETSGDKL+GSAGFNTVAGYIFGKNS KEKIPMTTPVFTQ F+SES KVSIQIVLPS+K
Subjt:  GLWDVFKQLRFYKTPELESPKYLILKRTAKYEVRKYAPFIVVETSGDKLAGSAGFNTVAGYIFGKNSTKEKIPMTTPVFTQKFNSESAKVSIQIVLPSEK

Query:  DIDSLPDPEQDIVGLRKVEGGIAAVLKFSGKPIEEIVQEKAKELHSSLIKDGLKPRNGCLLARYNDPGRTWNFIMRNEVLIWLEEFS
        DI+SLPDPEQD +GLRKVEGGIAAVLKFSGKP E++VQEKAKEL S LIKDGLKP  GCLLARYNDPGRTW+FIMRNEVLIWLEEFS
Subjt:  DIDSLPDPEQDIVGLRKVEGGIAAVLKFSGKPIEEIVQEKAKELHSSLIKDGLKPRNGCLLARYNDPGRTWNFIMRNEVLIWLEEFS

SwissProt top hitse value%identityAlignment
Q9SR77 Heme-binding-like protein At3g10130, chloroplastic2.8e-1935Show/hide
Query:  FYKTPELESPKYLILKRTAKYEVRKYAPFIVV------ETSGDKLAGSAGFNTVAGYIFGKNSTKEKIPMTTPVFTQKFNSESAKVS-------------
        F   P+LE+  + +L RT KYE+R+  P+ V       ET  D    S  FN +A Y+FGKN+ KEK+ MTTPV T+K  S   K+              
Subjt:  FYKTPELESPKYLILKRTAKYEVRKYAPFIVV------ETSGDKLAGSAGFNTVAGYIFGKNSTKEKIPMTTPVFTQKFNSESAKVS-------------

Query:  ----IQIVLPSEKDIDSLPDPEQDIVGLRKVEGGIAAVLKFSGKPIEEIVQEKAKELHSSLIKD-GLKPRNGCL--LARYNDPGRTWNFIMRNEVLIWLE
            +  V+PS K   +LP P+   V +++V   I AV+ FSG   +E ++ + +EL  +L  D   + R+G    +A+YN P  T  F+ RNEV + +E
Subjt:  ----IQIVLPSEKDIDSLPDPEQDIVGLRKVEGGIAAVLKFSGKPIEEIVQEKAKELHSSLIKD-GLKPRNGCL--LARYNDPGRTWNFIMRNEVLIWLE

Arabidopsis top hitse value%identityAlignment
AT2G37970.1 SOUL heme-binding family protein1.3e-1634.45Show/hide
Query:  LESPKYLILKRTAKYEVRKYAPFIVVETSGD----KLAGSAGFNTVAGYI--FGK--NSTKEKIPMTTPVFTQK----------FNSESAK---------
        +E+PKY + K    YE+R+Y P +  E + D    K     GF  +A YI  FGK  N   EKI MT PV T++             ES K         
Subjt:  LESPKYLILKRTAKYEVRKYAPFIVVETSGD----KLAGSAGFNTVAGYI--FGK--NSTKEKIPMTTPVFTQK----------FNSESAK---------

Query:  -----------VSIQIVLPS-EKDIDSLPDPEQDIVGLRKVEGGIAAVLKFSGKPIEEIVQEKAKELHSSLIKDGLKPRNGCLLARYNDPGRTWNF--IM
                   V++Q +LPS  K  +  P P  + V +++  G    V+KFSG   E +V EK K+L S L KDG K     +LARYN P   W      
Subjt:  -----------VSIQIVLPS-EKDIDSLPDPEQDIVGLRKVEGGIAAVLKFSGKPIEEIVQEKAKELHSSLIKDGLKPRNGCLLARYNDPGRTWNF--IM

Query:  RNEVLIWLE
         NEV+I +E
Subjt:  RNEVLIWLE

AT3G10130.1 SOUL heme-binding family protein2.0e-2035Show/hide
Query:  FYKTPELESPKYLILKRTAKYEVRKYAPFIVV------ETSGDKLAGSAGFNTVAGYIFGKNSTKEKIPMTTPVFTQKFNSESAKVS-------------
        F   P+LE+  + +L RT KYE+R+  P+ V       ET  D    S  FN +A Y+FGKN+ KEK+ MTTPV T+K  S   K+              
Subjt:  FYKTPELESPKYLILKRTAKYEVRKYAPFIVV------ETSGDKLAGSAGFNTVAGYIFGKNSTKEKIPMTTPVFTQKFNSESAKVS-------------

Query:  ----IQIVLPSEKDIDSLPDPEQDIVGLRKVEGGIAAVLKFSGKPIEEIVQEKAKELHSSLIKD-GLKPRNGCL--LARYNDPGRTWNFIMRNEVLIWLE
            +  V+PS K   +LP P+   V +++V   I AV+ FSG   +E ++ + +EL  +L  D   + R+G    +A+YN P  T  F+ RNEV + +E
Subjt:  ----IQIVLPSEKDIDSLPDPEQDIVGLRKVEGGIAAVLKFSGKPIEEIVQEKAKELHSSLIKD-GLKPRNGCL--LARYNDPGRTWNFIMRNEVLIWLE

AT5G20140.1 SOUL heme-binding family protein1.7e-14474.84Show/hide
Query:  STVDVGRLVDFLHEDLSHLFDEQGIDRTAYDEQVRFRDPITKHDTISGYLFNISLLRELFRPEFFLHWVKQTGPYEITTRWTMVMKFALLPWKPELVFTG
        STV++  LV FL+EDL HLFD+QGID+TAYDE+V+FRDPITKHDTISGYLFNI+ L+ +F P+F LHW KQTGPYEITTRWTMVMKF  LPWKPELVFTG
Subjt:  STVDVGRLVDFLHEDLSHLFDEQGIDRTAYDEQVRFRDPITKHDTISGYLFNISLLRELFRPEFFLHWVKQTGPYEITTRWTMVMKFALLPWKPELVFTG

Query:  NSIMGINPETGKFCSHLDLWDSIQNNDYFSVEGLWDVFKQLRFYKTPELESPKYLILKRTAKYEVRKYAPFIVVETSGDKLAGSAGFNTVAGYIFGKNST
         SIM +NPET KFCSHLDLWDSI+NNDYFS+EGL DVFKQLR YKTP+LE+PKY ILKRTA YEVR Y PFIVVET GDKL+GS+GFN VAGYIFGKNST
Subjt:  NSIMGINPETGKFCSHLDLWDSIQNNDYFSVEGLWDVFKQLRFYKTPELESPKYLILKRTAKYEVRKYAPFIVVETSGDKLAGSAGFNTVAGYIFGKNST

Query:  KEKIPMTTPVFTQKFNSE-SAKVSIQIVLPSEKDIDSLPDPEQDIVGLRKVEGGIAAVLKFSGKPIEEIVQEKAKELHSSLIKDGLKPRNGCLLARYNDP
         EKIPMTTPVFTQ  +++ S+ VS+QIV+PS KD+ SLP P ++ V L+K+EGG AA +KFSGKP E++VQ K  EL SSL KDGL+ + GC+LARYNDP
Subjt:  KEKIPMTTPVFTQKFNSE-SAKVSIQIVLPSEKDIDSLPDPEQDIVGLRKVEGGIAAVLKFSGKPIEEIVQEKAKELHSSLIKDGLKPRNGCLLARYNDP

Query:  GRTWNFIMRNEVLIWLEEFSLE
        GRTWNFIMRNEV+IWLE+FSL+
Subjt:  GRTWNFIMRNEVLIWLEEFSLE

AT5G20140.2 SOUL heme-binding family protein7.6e-13774.68Show/hide
Query:  STVDVGRLVDFLHEDLSHLFDEQGIDRTAYDEQVRFRDPITKHDTISGYLFNISLLRELFRPEFFLHWVKQTGPYEITTRWTMVMKFALLPWKPELVFTG
        STV++  LV FL+EDL HLFD+QGID+TAYDE+V+FRDPITKHDTISGYLFNI+ L+ +F P+F LHW KQTGPYEITTRWTMVMKF  LPWKPELVFTG
Subjt:  STVDVGRLVDFLHEDLSHLFDEQGIDRTAYDEQVRFRDPITKHDTISGYLFNISLLRELFRPEFFLHWVKQTGPYEITTRWTMVMKFALLPWKPELVFTG

Query:  NSIMGINPETGKFCSHLDLWDSIQNNDYFSVEGLWDVFKQLRFYKTPELESPKYLILKRTAKYEVRKYAPFIVVETSGDKLAGSAGFNTVAGYIFGKNST
         SIM +NPET KFCSHLDLWDSI+NNDYFS+EGL DVFKQLR YKTP+LE+PKY ILKRTA YEVR Y PFIVVET GDKL+GS+GFN VAGYIFGKNST
Subjt:  NSIMGINPETGKFCSHLDLWDSIQNNDYFSVEGLWDVFKQLRFYKTPELESPKYLILKRTAKYEVRKYAPFIVVETSGDKLAGSAGFNTVAGYIFGKNST

Query:  KEKIPMTTPVFTQKFNSE-SAKVSIQIVLPSEKDIDSLPDPEQDIVGLRKVEGGIAAVLKFSGKPIEEIVQEKAKELHSSLIKDGLKPRNGCLLARYNDP
         EKIPMTTPVFTQ  +++ S+ VS+QIV+PS KD+ SLP P ++ V L+K+EGG AA +KFSGKP E++VQ K  EL SSL KDGL+ + GC+LARYNDP
Subjt:  KEKIPMTTPVFTQKFNSE-SAKVSIQIVLPSEKDIDSLPDPEQDIVGLRKVEGGIAAVLKFSGKPIEEIVQEKAKELHSSLIKDGLKPRNGCLLARYNDP

Query:  GRTWNFIM
        GRTWNFIM
Subjt:  GRTWNFIM


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCGCTCTTCAACTTTCCCTCCAAAACTTCCCCTCAACCCCAACACTCAGTTCCCTTCTCCGCCCACCGAAATCCGGCAGAATAACCCACCTCCCACCTCGTCTACT
TCAGTCCAGAACTCCATCTTTTAAACCCCATACCAAAAATTCTAAGTGGGTTGTTCGATTCAACTTGGTTGATCAAAGCCCACCAAAATCGACGGTCGATGTAGGCCGAT
TGGTGGATTTCTTGCATGAAGATCTTTCCCATCTTTTCGATGAACAGGGGATTGATCGAACGGCGTACGACGAACAAGTGAGATTTCGTGACCCCATTACTAAGCACGAT
ACGATTAGTGGGTATTTGTTTAATATTTCCCTCTTGCGAGAACTCTTCAGGCCTGAATTCTTCTTGCACTGGGTTAAACAGACAGGACCATATGAAATAACTACAAGATG
GACTATGGTAATGAAGTTTGCCCTTCTACCATGGAAACCAGAATTAGTTTTCACAGGAAATTCCATCATGGGTATCAATCCAGAGACGGGCAAGTTCTGTAGTCACTTGG
ATCTCTGGGATTCGATACAAAACAACGACTACTTTTCAGTAGAAGGCCTTTGGGATGTTTTCAAGCAGCTTCGTTTTTATAAGACTCCAGAATTGGAGTCACCCAAGTAT
CTGATTCTGAAAAGGACTGCAAAGTATGAGGTGAGGAAATATGCTCCATTTATAGTGGTCGAAACAAGTGGAGACAAGCTCGCTGGATCTGCAGGATTCAATACAGTTGC
TGGGTATATATTTGGGAAGAACTCTACAAAGGAGAAGATACCCATGACCACTCCTGTATTCACCCAAAAATTTAACTCTGAATCAGCCAAAGTCTCCATTCAAATAGTTC
TTCCTTCAGAGAAAGATATAGACAGTTTACCAGATCCTGAACAAGACATAGTTGGCTTGAGAAAGGTTGAAGGAGGTATTGCTGCAGTTTTGAAATTCAGTGGGAAACCT
ATTGAAGAGATTGTGCAAGAGAAGGCAAAAGAACTGCATTCTAGTCTCATAAAGGATGGTCTCAAACCCAGGAACGGCTGTTTGCTTGCTCGGTATAACGACCCTGGAAG
AACATGGAACTTTATAATGAGAAATGAAGTGCTAATATGGCTTGAAGAGTTCTCATTGGAGTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCCGCTCTTCAACTTTCCCTCCAAAACTTCCCCTCAACCCCAACACTCAGTTCCCTTCTCCGCCCACCGAAATCCGGCAGAATAACCCACCTCCCACCTCGTCTACT
TCAGTCCAGAACTCCATCTTTTAAACCCCATACCAAAAATTCTAAGTGGGTTGTTCGATTCAACTTGGTTGATCAAAGCCCACCAAAATCGACGGTCGATGTAGGCCGAT
TGGTGGATTTCTTGCATGAAGATCTTTCCCATCTTTTCGATGAACAGGGGATTGATCGAACGGCGTACGACGAACAAGTGAGATTTCGTGACCCCATTACTAAGCACGAT
ACGATTAGTGGGTATTTGTTTAATATTTCCCTCTTGCGAGAACTCTTCAGGCCTGAATTCTTCTTGCACTGGGTTAAACAGACAGGACCATATGAAATAACTACAAGATG
GACTATGGTAATGAAGTTTGCCCTTCTACCATGGAAACCAGAATTAGTTTTCACAGGAAATTCCATCATGGGTATCAATCCAGAGACGGGCAAGTTCTGTAGTCACTTGG
ATCTCTGGGATTCGATACAAAACAACGACTACTTTTCAGTAGAAGGCCTTTGGGATGTTTTCAAGCAGCTTCGTTTTTATAAGACTCCAGAATTGGAGTCACCCAAGTAT
CTGATTCTGAAAAGGACTGCAAAGTATGAGGTGAGGAAATATGCTCCATTTATAGTGGTCGAAACAAGTGGAGACAAGCTCGCTGGATCTGCAGGATTCAATACAGTTGC
TGGGTATATATTTGGGAAGAACTCTACAAAGGAGAAGATACCCATGACCACTCCTGTATTCACCCAAAAATTTAACTCTGAATCAGCCAAAGTCTCCATTCAAATAGTTC
TTCCTTCAGAGAAAGATATAGACAGTTTACCAGATCCTGAACAAGACATAGTTGGCTTGAGAAAGGTTGAAGGAGGTATTGCTGCAGTTTTGAAATTCAGTGGGAAACCT
ATTGAAGAGATTGTGCAAGAGAAGGCAAAAGAACTGCATTCTAGTCTCATAAAGGATGGTCTCAAACCCAGGAACGGCTGTTTGCTTGCTCGGTATAACGACCCTGGAAG
AACATGGAACTTTATAATGAGAAATGAAGTGCTAATATGGCTTGAAGAGTTCTCATTGGAGTAG
Protein sequenceShow/hide protein sequence
MAALQLSLQNFPSTPTLSSLLRPPKSGRITHLPPRLLQSRTPSFKPHTKNSKWVVRFNLVDQSPPKSTVDVGRLVDFLHEDLSHLFDEQGIDRTAYDEQVRFRDPITKHD
TISGYLFNISLLRELFRPEFFLHWVKQTGPYEITTRWTMVMKFALLPWKPELVFTGNSIMGINPETGKFCSHLDLWDSIQNNDYFSVEGLWDVFKQLRFYKTPELESPKY
LILKRTAKYEVRKYAPFIVVETSGDKLAGSAGFNTVAGYIFGKNSTKEKIPMTTPVFTQKFNSESAKVSIQIVLPSEKDIDSLPDPEQDIVGLRKVEGGIAAVLKFSGKP
IEEIVQEKAKELHSSLIKDGLKPRNGCLLARYNDPGRTWNFIMRNEVLIWLEEFSLE