; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG06G010600 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG06G010600
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionSOUL heme-binding family protein
Genome locationCG_Chr06:22866111..22870636
RNA-Seq ExpressionClCG06G010600
SyntenyClCG06G010600
Gene Ontology termsGO:0110165 - cellular anatomical structure (cellular component)
InterPro domainsIPR006917 - SOUL haem-binding protein
IPR011256 - Regulatory factor, effector binding domain superfamily
IPR018790 - Protein of unknown function DUF2358
IPR032710 - NTF2-like domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KGN65404.1 hypothetical protein Csa_019846 [Cucumis sativus]1.7e-19989.92Show/hide
Query:  MAALQLSLQTFPSTPALGFGLRPPKPARLTDLSLRLLKSRTLAFKPQTQNSKWVVRLSLVDQSPPKSTVDVGRLVDFLYKDLCHLFDEQGIDRTAYDEQV
        MA LQLSLQ FPSTP L   LRPPK  R+T L  RLL SRT AFKP T+NSKWVVR +LVDQ PPKST+DVGRLVDFL++DL HLFDEQGIDRTAYDEQV
Subjt:  MAALQLSLQTFPSTPALGFGLRPPKPARLTDLSLRLLKSRTLAFKPQTQNSKWVVRLSLVDQSPPKSTVDVGRLVDFLYKDLCHLFDEQGIDRTAYDEQV

Query:  RFRDPITKHDTISGYLFNISLLRELFTPEFFLHWVKQTGPYEITTRWTMVMKFVLLPWKPELVFTGISIMGINPETGKFCSHVDLWDSIQNNDYFSVEGL
        RFRDPITKHDTISGYLFNISLLRELF PEFFLHWVKQTGPYEITTRWTMVMKF LLPWKPELVFTG SIMGINPETGKFCSHVDLWDSIQNNDYFSVEGL
Subjt:  RFRDPITKHDTISGYLFNISLLRELFTPEFFLHWVKQTGPYEITTRWTMVMKFVLLPWKPELVFTGISIMGINPETGKFCSHVDLWDSIQNNDYFSVEGL

Query:  WDVFKQLRYYKTPSLESPKYLILKRTADYEVRKYEPFIVVETSGDKLAGSAGFNTVAGYIFGKNSTKEKIPMTTPVFTQTFDSELPRVSIQIVLPSEKDI
        WDVFKQLR+YKTP LESPKYLILKRTA YEVRKY PFIVVETSGDKLAGSAGFNTVAGYIFGKNSTKEKIPMTTPVFTQ F+SE P+VSIQIVLPSEKDI
Subjt:  WDVFKQLRYYKTPSLESPKYLILKRTADYEVRKYEPFIVVETSGDKLAGSAGFNTVAGYIFGKNSTKEKIPMTTPVFTQTFDSELPRVSIQIVLPSEKDI

Query:  DSLPDPEQDIIGLRKVEGGTAAVLKFSGKPTEEIVLEKAKELRSSLIKDGLKPSNGCLLARYNDPGRTWNFIMRNEVLIWLEEFSLE
        DSLPDPEQDI+GLRKVEGG AAVLKFSGKP EEIV EKAKELRSSLIKDGLKP NGCLLARYNDPGRTWNFIMRNEVLIWLEEFSLE
Subjt:  DSLPDPEQDIIGLRKVEGGTAAVLKFSGKPTEEIVLEKAKELRSSLIKDGLKPSNGCLLARYNDPGRTWNFIMRNEVLIWLEEFSLE

TYK11984.1 SOUL heme-binding family protein isoform 1 [Cucumis melo var. makuwa]3.2e-19887.53Show/hide
Query:  MAALQLSLQTFPSTPALGFGLRPPKPARLTDLSLRLLKSRTLAFKPQTQNSKWVVRLSLVDQSPPKSTVDVGRLVDFLYKDLCHLFDEQGIDRTAYDEQV
        MAALQLSLQ F STP L   LRPPK  RLT L  RLL+SRT AFKP TQNSKWVVR +LVDQSPPKSTVDVGRLVDFLY+DL HLFDEQGIDRTAYDEQV
Subjt:  MAALQLSLQTFPSTPALGFGLRPPKPARLTDLSLRLLKSRTLAFKPQTQNSKWVVRLSLVDQSPPKSTVDVGRLVDFLYKDLCHLFDEQGIDRTAYDEQV

Query:  RFRDPITKHDTISGYLFNISLLRELFTPEFFLHWVKQT--------------GPYEITTRWTMVMKFVLLPWKPELVFTGISIMGINPETGKFCSHVDLW
        RFRDPITKHDTISGYLFNISLLRE+F PEFFLHWVKQ                PYEITTRWTM+MKF LLPWKPEL+FTG SIMGINPETGKFCSHVDLW
Subjt:  RFRDPITKHDTISGYLFNISLLRELFTPEFFLHWVKQT--------------GPYEITTRWTMVMKFVLLPWKPELVFTGISIMGINPETGKFCSHVDLW

Query:  DSIQNNDYFSVEGLWDVFKQLRYYKTPSLESPKYLILKRTADYEVRKYEPFIVVETSGDKLAGSAGFNTVAGYIFGKNSTKEKIPMTTPVFTQTFDSELP
        DSIQNNDYFSVEGLWDVFKQLR+YKTP LESPKYLILKRTA YEVRKY PFIVVETSGDKLAGSAGFNTVAGYIFGKNSTKEKIPMTTPVFTQTFDSE P
Subjt:  DSIQNNDYFSVEGLWDVFKQLRYYKTPSLESPKYLILKRTADYEVRKYEPFIVVETSGDKLAGSAGFNTVAGYIFGKNSTKEKIPMTTPVFTQTFDSELP

Query:  RVSIQIVLPSEKDIDSLPDPEQDIIGLRKVEGGTAAVLKFSGKPTEEIVLEKAKELRSSLIKDGLKPSNGCLLARYNDPGRTWNFIMRNEVLIWLEEFSL
        +VSIQIVLPSEKDIDSLPDPEQDIIGLRKVEGG AAVLKFSGKPTEEIV EKAKELRSSLIKDGLKP NGCLLARYNDPGRTWNFIMRNEVLIWLEE+SL
Subjt:  RVSIQIVLPSEKDIDSLPDPEQDIIGLRKVEGGTAAVLKFSGKPTEEIVLEKAKELRSSLIKDGLKPSNGCLLARYNDPGRTWNFIMRNEVLIWLEEFSL

Query:  E
        E
Subjt:  E

XP_008463332.1 PREDICTED: uncharacterized protein LOC103501513 isoform X1 [Cucumis melo]1.8e-20190.7Show/hide
Query:  MAALQLSLQTFPSTPALGFGLRPPKPARLTDLSLRLLKSRTLAFKPQTQNSKWVVRLSLVDQSPPKSTVDVGRLVDFLYKDLCHLFDEQGIDRTAYDEQV
        MAALQLSLQ F STP L   LRPPK  RLT+L  RLL+SRT A KP TQNSKWVVR +LVDQSPPKSTVDVGRLVDFLY+DL HLFDEQGIDRTAYDEQV
Subjt:  MAALQLSLQTFPSTPALGFGLRPPKPARLTDLSLRLLKSRTLAFKPQTQNSKWVVRLSLVDQSPPKSTVDVGRLVDFLYKDLCHLFDEQGIDRTAYDEQV

Query:  RFRDPITKHDTISGYLFNISLLRELFTPEFFLHWVKQTGPYEITTRWTMVMKFVLLPWKPELVFTGISIMGINPETGKFCSHVDLWDSIQNNDYFSVEGL
        RFRDPITKHDTISGYLFNISLLRE+F PEFFLHWVKQTGPYEITTRWTM+MKF LLPWKPEL+FTG SIMGINPETGKFCSHVDLWDSIQNNDYFSVEGL
Subjt:  RFRDPITKHDTISGYLFNISLLRELFTPEFFLHWVKQTGPYEITTRWTMVMKFVLLPWKPELVFTGISIMGINPETGKFCSHVDLWDSIQNNDYFSVEGL

Query:  WDVFKQLRYYKTPSLESPKYLILKRTADYEVRKYEPFIVVETSGDKLAGSAGFNTVAGYIFGKNSTKEKIPMTTPVFTQTFDSELPRVSIQIVLPSEKDI
        WDVFKQLR+YKTP LESPKYLILKRT  YEVRKY PFIVVETSGDKLAGSAGFNTVAGYIFGKNSTKEKIPMTTPVFTQTFDSE P+VSIQIVLPSEKDI
Subjt:  WDVFKQLRYYKTPSLESPKYLILKRTADYEVRKYEPFIVVETSGDKLAGSAGFNTVAGYIFGKNSTKEKIPMTTPVFTQTFDSELPRVSIQIVLPSEKDI

Query:  DSLPDPEQDIIGLRKVEGGTAAVLKFSGKPTEEIVLEKAKELRSSLIKDGLKPSNGCLLARYNDPGRTWNFIMRNEVLIWLEEFSLE
        DSLPDPEQDIIGLRKVEGG AAVLKFSGKPTEEIV EKAKELRSSLIKDGLKP NGCLLARYNDPGRTWNFIMRNEVLIWLEE+SLE
Subjt:  DSLPDPEQDIIGLRKVEGGTAAVLKFSGKPTEEIVLEKAKELRSSLIKDGLKPSNGCLLARYNDPGRTWNFIMRNEVLIWLEEFSLE

XP_011648491.1 uncharacterized protein LOC101206063 [Cucumis sativus]1.7e-19989.92Show/hide
Query:  MAALQLSLQTFPSTPALGFGLRPPKPARLTDLSLRLLKSRTLAFKPQTQNSKWVVRLSLVDQSPPKSTVDVGRLVDFLYKDLCHLFDEQGIDRTAYDEQV
        MA LQLSLQ FPSTP L   LRPPK  R+T L  RLL SRT AFKP T+NSKWVVR +LVDQ PPKST+DVGRLVDFL++DL HLFDEQGIDRTAYDEQV
Subjt:  MAALQLSLQTFPSTPALGFGLRPPKPARLTDLSLRLLKSRTLAFKPQTQNSKWVVRLSLVDQSPPKSTVDVGRLVDFLYKDLCHLFDEQGIDRTAYDEQV

Query:  RFRDPITKHDTISGYLFNISLLRELFTPEFFLHWVKQTGPYEITTRWTMVMKFVLLPWKPELVFTGISIMGINPETGKFCSHVDLWDSIQNNDYFSVEGL
        RFRDPITKHDTISGYLFNISLLRELF PEFFLHWVKQTGPYEITTRWTMVMKF LLPWKPELVFTG SIMGINPETGKFCSHVDLWDSIQNNDYFSVEGL
Subjt:  RFRDPITKHDTISGYLFNISLLRELFTPEFFLHWVKQTGPYEITTRWTMVMKFVLLPWKPELVFTGISIMGINPETGKFCSHVDLWDSIQNNDYFSVEGL

Query:  WDVFKQLRYYKTPSLESPKYLILKRTADYEVRKYEPFIVVETSGDKLAGSAGFNTVAGYIFGKNSTKEKIPMTTPVFTQTFDSELPRVSIQIVLPSEKDI
        WDVFKQLR+YKTP LESPKYLILKRTA YEVRKY PFIVVETSGDKLAGSAGFNTVAGYIFGKNSTKEKIPMTTPVFTQ F+SE P+VSIQIVLPSEKDI
Subjt:  WDVFKQLRYYKTPSLESPKYLILKRTADYEVRKYEPFIVVETSGDKLAGSAGFNTVAGYIFGKNSTKEKIPMTTPVFTQTFDSELPRVSIQIVLPSEKDI

Query:  DSLPDPEQDIIGLRKVEGGTAAVLKFSGKPTEEIVLEKAKELRSSLIKDGLKPSNGCLLARYNDPGRTWNFIMRNEVLIWLEEFSLE
        DSLPDPEQDI+GLRKVEGG AAVLKFSGKP EEIV EKAKELRSSLIKDGLKP NGCLLARYNDPGRTWNFIMRNEVLIWLEEFSLE
Subjt:  DSLPDPEQDIIGLRKVEGGTAAVLKFSGKPTEEIVLEKAKELRSSLIKDGLKPSNGCLLARYNDPGRTWNFIMRNEVLIWLEEFSLE

XP_038879422.1 uncharacterized protein LOC120071301 isoform X1 [Benincasa hispida]9.4e-20692.51Show/hide
Query:  MAALQLSLQTFPSTPALGFGLRPPKPARLTDLSLRLLKSRTLAFKPQTQNSKWVVRLSLVDQSPPKSTVDVGRLVDFLYKDLCHLFDEQGIDRTAYDEQV
        MA  QLSLQ FPSTP LGFGLRPP+  RLT L  RL K+RT AFKP +QNSKWVVRLSLVDQSPPKSTVDVGRLVDFLY+DL HLFDEQGIDRTAYDEQV
Subjt:  MAALQLSLQTFPSTPALGFGLRPPKPARLTDLSLRLLKSRTLAFKPQTQNSKWVVRLSLVDQSPPKSTVDVGRLVDFLYKDLCHLFDEQGIDRTAYDEQV

Query:  RFRDPITKHDTISGYLFNISLLRELFTPEFFLHWVKQTGPYEITTRWTMVMKFVLLPWKPELVFTGISIMGINPETGKFCSHVDLWDSIQNNDYFSVEGL
        RFRDPIT HDTISGYLFNISLLRELF PEFFLHWVKQTGPYEITTRWTMVMKFVLLPWKPELVFTGISIMGINPETGKFCSHVDLWDSIQNNDYFSVEGL
Subjt:  RFRDPITKHDTISGYLFNISLLRELFTPEFFLHWVKQTGPYEITTRWTMVMKFVLLPWKPELVFTGISIMGINPETGKFCSHVDLWDSIQNNDYFSVEGL

Query:  WDVFKQLRYYKTPSLESPKYLILKRTADYEVRKYEPFIVVETSGDKLAGSAGFNTVAGYIFGKNSTKEKIPMTTPVFTQTFDSELPRVSIQIVLPSEKDI
        WDVFKQLRYYKTP+LESPKYLILKRTA+YEVRKY  FIVVETSGDKLAGSAGFNTVAGYIFGKNSTKEKIPMTTPVFTQTFDSE+P+V IQIVLPSEKDI
Subjt:  WDVFKQLRYYKTPSLESPKYLILKRTADYEVRKYEPFIVVETSGDKLAGSAGFNTVAGYIFGKNSTKEKIPMTTPVFTQTFDSELPRVSIQIVLPSEKDI

Query:  DSLPDPEQDIIGLRKVEGGTAAVLKFSGKPTEEIVLEKAKELRSSLIKDGLKPSNGCLLARYNDPGRTWNFIMRNEVLIWLEEFSLE
        DSLPDPEQDIIGLRKVEG  AAVLKFSGKPTEEIV EKAKELRSSLIKDGLKPSNGCLLARYNDPGRTWNFIMRNEVLIWLEEFSLE
Subjt:  DSLPDPEQDIIGLRKVEGGTAAVLKFSGKPTEEIVLEKAKELRSSLIKDGLKPSNGCLLARYNDPGRTWNFIMRNEVLIWLEEFSLE

TrEMBL top hitse value%identityAlignment
A0A0A0LWP3 Uncharacterized protein8.3e-20089.92Show/hide
Query:  MAALQLSLQTFPSTPALGFGLRPPKPARLTDLSLRLLKSRTLAFKPQTQNSKWVVRLSLVDQSPPKSTVDVGRLVDFLYKDLCHLFDEQGIDRTAYDEQV
        MA LQLSLQ FPSTP L   LRPPK  R+T L  RLL SRT AFKP T+NSKWVVR +LVDQ PPKST+DVGRLVDFL++DL HLFDEQGIDRTAYDEQV
Subjt:  MAALQLSLQTFPSTPALGFGLRPPKPARLTDLSLRLLKSRTLAFKPQTQNSKWVVRLSLVDQSPPKSTVDVGRLVDFLYKDLCHLFDEQGIDRTAYDEQV

Query:  RFRDPITKHDTISGYLFNISLLRELFTPEFFLHWVKQTGPYEITTRWTMVMKFVLLPWKPELVFTGISIMGINPETGKFCSHVDLWDSIQNNDYFSVEGL
        RFRDPITKHDTISGYLFNISLLRELF PEFFLHWVKQTGPYEITTRWTMVMKF LLPWKPELVFTG SIMGINPETGKFCSHVDLWDSIQNNDYFSVEGL
Subjt:  RFRDPITKHDTISGYLFNISLLRELFTPEFFLHWVKQTGPYEITTRWTMVMKFVLLPWKPELVFTGISIMGINPETGKFCSHVDLWDSIQNNDYFSVEGL

Query:  WDVFKQLRYYKTPSLESPKYLILKRTADYEVRKYEPFIVVETSGDKLAGSAGFNTVAGYIFGKNSTKEKIPMTTPVFTQTFDSELPRVSIQIVLPSEKDI
        WDVFKQLR+YKTP LESPKYLILKRTA YEVRKY PFIVVETSGDKLAGSAGFNTVAGYIFGKNSTKEKIPMTTPVFTQ F+SE P+VSIQIVLPSEKDI
Subjt:  WDVFKQLRYYKTPSLESPKYLILKRTADYEVRKYEPFIVVETSGDKLAGSAGFNTVAGYIFGKNSTKEKIPMTTPVFTQTFDSELPRVSIQIVLPSEKDI

Query:  DSLPDPEQDIIGLRKVEGGTAAVLKFSGKPTEEIVLEKAKELRSSLIKDGLKPSNGCLLARYNDPGRTWNFIMRNEVLIWLEEFSLE
        DSLPDPEQDI+GLRKVEGG AAVLKFSGKP EEIV EKAKELRSSLIKDGLKP NGCLLARYNDPGRTWNFIMRNEVLIWLEEFSLE
Subjt:  DSLPDPEQDIIGLRKVEGGTAAVLKFSGKPTEEIVLEKAKELRSSLIKDGLKPSNGCLLARYNDPGRTWNFIMRNEVLIWLEEFSLE

A0A1S3CJ12 uncharacterized protein LOC103501513 isoform X18.9e-20290.7Show/hide
Query:  MAALQLSLQTFPSTPALGFGLRPPKPARLTDLSLRLLKSRTLAFKPQTQNSKWVVRLSLVDQSPPKSTVDVGRLVDFLYKDLCHLFDEQGIDRTAYDEQV
        MAALQLSLQ F STP L   LRPPK  RLT+L  RLL+SRT A KP TQNSKWVVR +LVDQSPPKSTVDVGRLVDFLY+DL HLFDEQGIDRTAYDEQV
Subjt:  MAALQLSLQTFPSTPALGFGLRPPKPARLTDLSLRLLKSRTLAFKPQTQNSKWVVRLSLVDQSPPKSTVDVGRLVDFLYKDLCHLFDEQGIDRTAYDEQV

Query:  RFRDPITKHDTISGYLFNISLLRELFTPEFFLHWVKQTGPYEITTRWTMVMKFVLLPWKPELVFTGISIMGINPETGKFCSHVDLWDSIQNNDYFSVEGL
        RFRDPITKHDTISGYLFNISLLRE+F PEFFLHWVKQTGPYEITTRWTM+MKF LLPWKPEL+FTG SIMGINPETGKFCSHVDLWDSIQNNDYFSVEGL
Subjt:  RFRDPITKHDTISGYLFNISLLRELFTPEFFLHWVKQTGPYEITTRWTMVMKFVLLPWKPELVFTGISIMGINPETGKFCSHVDLWDSIQNNDYFSVEGL

Query:  WDVFKQLRYYKTPSLESPKYLILKRTADYEVRKYEPFIVVETSGDKLAGSAGFNTVAGYIFGKNSTKEKIPMTTPVFTQTFDSELPRVSIQIVLPSEKDI
        WDVFKQLR+YKTP LESPKYLILKRT  YEVRKY PFIVVETSGDKLAGSAGFNTVAGYIFGKNSTKEKIPMTTPVFTQTFDSE P+VSIQIVLPSEKDI
Subjt:  WDVFKQLRYYKTPSLESPKYLILKRTADYEVRKYEPFIVVETSGDKLAGSAGFNTVAGYIFGKNSTKEKIPMTTPVFTQTFDSELPRVSIQIVLPSEKDI

Query:  DSLPDPEQDIIGLRKVEGGTAAVLKFSGKPTEEIVLEKAKELRSSLIKDGLKPSNGCLLARYNDPGRTWNFIMRNEVLIWLEEFSLE
        DSLPDPEQDIIGLRKVEGG AAVLKFSGKPTEEIV EKAKELRSSLIKDGLKP NGCLLARYNDPGRTWNFIMRNEVLIWLEE+SLE
Subjt:  DSLPDPEQDIIGLRKVEGGTAAVLKFSGKPTEEIVLEKAKELRSSLIKDGLKPSNGCLLARYNDPGRTWNFIMRNEVLIWLEEFSLE

A0A5A7TMX2 SOUL heme-binding family protein isoform 12.3e-18986.82Show/hide
Query:  MAALQLSLQTFPSTPALGFGLRPPKPARLTDLSLRLLKSRTLAFKPQTQNSKWVVRLSLVDQSPPKSTVDVGRLVDFLYKDLCHLFDEQGIDRTAYDEQV
        MAALQLSLQ F STP L   LRPPK  RLT+L  RLL+SRT A KP TQNSKWVVR +LVDQSPPKSTVDVGRLVDFLY+DL HLFDEQGIDRTAYDEQV
Subjt:  MAALQLSLQTFPSTPALGFGLRPPKPARLTDLSLRLLKSRTLAFKPQTQNSKWVVRLSLVDQSPPKSTVDVGRLVDFLYKDLCHLFDEQGIDRTAYDEQV

Query:  RFRDPITKHDTISGYLFNISLLRELFTPEFFLHWVKQT--------------GPYEITTRWTMVMKFVLLPWKPELVFTGISIMGINPETGKFCSHVDLW
        RFRDPITKHDTISGYLFNISLLRE+F PEFFLHWVKQ                PYEITTRWTM+MKF LLPWKPEL+FTG SIMGINPETGKFCSHVDLW
Subjt:  RFRDPITKHDTISGYLFNISLLRELFTPEFFLHWVKQT--------------GPYEITTRWTMVMKFVLLPWKPELVFTGISIMGINPETGKFCSHVDLW

Query:  DSIQNNDYFSVEGLWDVFKQLRYYKTPSLESPKYLILKRTADYEVRKYEPFIVVETSGDKLAGSAGFNTVAGYIFGKNSTKEKIPMTTPVFTQTFDSELP
        DSIQNNDYFSVEGLWDVFKQLR+YKTP LESPKYLILKRT  YEVRKY PFIVVETSGDKLAGSAGFNTVAGYIFGKNSTKEKIPMTTPVFTQTFDSE P
Subjt:  DSIQNNDYFSVEGLWDVFKQLRYYKTPSLESPKYLILKRTADYEVRKYEPFIVVETSGDKLAGSAGFNTVAGYIFGKNSTKEKIPMTTPVFTQTFDSELP

Query:  RVSIQIVLPSEKDIDSLPDPEQDIIGLRKVEGGTAAVLKFSGKPTEEIVLEKAKELRSSLIKDGLKPSNGCLLARYNDPGRTWNFIM
        +VSIQIVLPSEKDIDSLPDPEQDIIGLRKVEGG AAVLKFSGKPTEEIV EKAKELRSSLIKDGLKP NGCLLARYNDPGRTWNFIM
Subjt:  RVSIQIVLPSEKDIDSLPDPEQDIIGLRKVEGGTAAVLKFSGKPTEEIVLEKAKELRSSLIKDGLKPSNGCLLARYNDPGRTWNFIM

A0A5D3CLD5 SOUL heme-binding family protein isoform 11.6e-19887.53Show/hide
Query:  MAALQLSLQTFPSTPALGFGLRPPKPARLTDLSLRLLKSRTLAFKPQTQNSKWVVRLSLVDQSPPKSTVDVGRLVDFLYKDLCHLFDEQGIDRTAYDEQV
        MAALQLSLQ F STP L   LRPPK  RLT L  RLL+SRT AFKP TQNSKWVVR +LVDQSPPKSTVDVGRLVDFLY+DL HLFDEQGIDRTAYDEQV
Subjt:  MAALQLSLQTFPSTPALGFGLRPPKPARLTDLSLRLLKSRTLAFKPQTQNSKWVVRLSLVDQSPPKSTVDVGRLVDFLYKDLCHLFDEQGIDRTAYDEQV

Query:  RFRDPITKHDTISGYLFNISLLRELFTPEFFLHWVKQT--------------GPYEITTRWTMVMKFVLLPWKPELVFTGISIMGINPETGKFCSHVDLW
        RFRDPITKHDTISGYLFNISLLRE+F PEFFLHWVKQ                PYEITTRWTM+MKF LLPWKPEL+FTG SIMGINPETGKFCSHVDLW
Subjt:  RFRDPITKHDTISGYLFNISLLRELFTPEFFLHWVKQT--------------GPYEITTRWTMVMKFVLLPWKPELVFTGISIMGINPETGKFCSHVDLW

Query:  DSIQNNDYFSVEGLWDVFKQLRYYKTPSLESPKYLILKRTADYEVRKYEPFIVVETSGDKLAGSAGFNTVAGYIFGKNSTKEKIPMTTPVFTQTFDSELP
        DSIQNNDYFSVEGLWDVFKQLR+YKTP LESPKYLILKRTA YEVRKY PFIVVETSGDKLAGSAGFNTVAGYIFGKNSTKEKIPMTTPVFTQTFDSE P
Subjt:  DSIQNNDYFSVEGLWDVFKQLRYYKTPSLESPKYLILKRTADYEVRKYEPFIVVETSGDKLAGSAGFNTVAGYIFGKNSTKEKIPMTTPVFTQTFDSELP

Query:  RVSIQIVLPSEKDIDSLPDPEQDIIGLRKVEGGTAAVLKFSGKPTEEIVLEKAKELRSSLIKDGLKPSNGCLLARYNDPGRTWNFIMRNEVLIWLEEFSL
        +VSIQIVLPSEKDIDSLPDPEQDIIGLRKVEGG AAVLKFSGKPTEEIV EKAKELRSSLIKDGLKP NGCLLARYNDPGRTWNFIMRNEVLIWLEE+SL
Subjt:  RVSIQIVLPSEKDIDSLPDPEQDIIGLRKVEGGTAAVLKFSGKPTEEIVLEKAKELRSSLIKDGLKPSNGCLLARYNDPGRTWNFIMRNEVLIWLEEFSL

Query:  E
        E
Subjt:  E

A0A6J1CUY2 uncharacterized protein LOC111014503 isoform X19.2e-19186.56Show/hide
Query:  MAALQLSLQTFPSTPALGFGLRPPKPARLTDLSL--RLLKSRTLAFKPQTQNSKWVVRLSLVDQSPPKSTVDVGRLVDFLYKDLCHLFDEQGIDRTAYDE
        MAALQLSLQ F STP  GFG RP K   LT   L  RLLKSRT+ FKP  +NSKW VRLSLVDQSPPKS VDV RLVDFLY+DL HLFDEQGIDRTAYDE
Subjt:  MAALQLSLQTFPSTPALGFGLRPPKPARLTDLSL--RLLKSRTLAFKPQTQNSKWVVRLSLVDQSPPKSTVDVGRLVDFLYKDLCHLFDEQGIDRTAYDE

Query:  QVRFRDPITKHDTISGYLFNISLLRELFTPEFFLHWVKQTGPYEITTRWTMVMKFVLLPWKPELVFTGISIMGINPETGKFCSHVDLWDSIQNNDYFSVE
         VRFRDPITKHDTISGY FNISLLRELF PEFFLHWVKQTGPYEITTRWTMVMKFVLLPWKPE +FTG SIMGINPETGKFCSHVDLWDSIQNNDYFS+E
Subjt:  QVRFRDPITKHDTISGYLFNISLLRELFTPEFFLHWVKQTGPYEITTRWTMVMKFVLLPWKPELVFTGISIMGINPETGKFCSHVDLWDSIQNNDYFSVE

Query:  GLWDVFKQLRYYKTPSLESPKYLILKRTADYEVRKYEPFIVVETSGDKLAGSAGFNTVAGYIFGKNSTKEKIPMTTPVFTQTFDSELPRVSIQIVLPSEK
        GL DVFKQLR+YKTP LESPKY ILKRTA+YEVRKY PF+VVETSGDKL+GSAGFNTVAGYIFGKNS KEKIPMTTPVFTQTFDSE P+VSIQIVLPS+K
Subjt:  GLWDVFKQLRYYKTPSLESPKYLILKRTADYEVRKYEPFIVVETSGDKLAGSAGFNTVAGYIFGKNSTKEKIPMTTPVFTQTFDSELPRVSIQIVLPSEK

Query:  DIDSLPDPEQDIIGLRKVEGGTAAVLKFSGKPTEEIVLEKAKELRSSLIKDGLKPSNGCLLARYNDPGRTWNFIMRNEVLIWLEEFS
        DI+SLPDPEQD IGLRKVEGG AAVLKFSGKPTE++V EKAKELRS LIKDGLKPS GCLLARYNDPGRTW+FIMRNEVLIWLEEFS
Subjt:  DIDSLPDPEQDIIGLRKVEGGTAAVLKFSGKPTEEIVLEKAKELRSSLIKDGLKPSNGCLLARYNDPGRTWNFIMRNEVLIWLEEFS

SwissProt top hitse value%identityAlignment
Q9SR77 Heme-binding-like protein At3g10130, chloroplastic3.4e-1732.16Show/hide
Query:  YYKTPSLESPKYLILKRTADYEVRKYEPFIVV------ETSGDKLAGSAGFNTVAGYIFGKNSTKEKIPMTTPVFTQTFDS--ELPRVSIQIVLPSEKDI
        +   P LE+  + +L RT  YE+R+ EP+ V       ET  D    S  FN +A Y+FGKN+ KEK+ MTTPV T+   S  E   ++  ++    KD 
Subjt:  YYKTPSLESPKYLILKRTADYEVRKYEPFIVV------ETSGDKLAGSAGFNTVAGYIFGKNSTKEKIPMTTPVFTQTFDS--ELPRVSIQIVLPSEKDI

Query:  D--------------SLPDPEQDIIGLRKVEGGTAAVLKFSGKPTEEIVLEKAKELRSSLIKD---GLKPSNGCLLARYNDPGRTWNFIMRNEVLIWLE
        +              +LP P+   + +++V     AV+ FSG  T+E +  + +ELR +L  D    ++      +A+YN P  T  F+ RNEV + +E
Subjt:  D--------------SLPDPEQDIIGLRKVEGGTAAVLKFSGKPTEEIVLEKAKELRSSLIKD---GLKPSNGCLLARYNDPGRTWNFIMRNEVLIWLE

Arabidopsis top hitse value%identityAlignment
AT1G17100.1 SOUL heme-binding family protein8.1e-0628.99Show/hide
Query:  LESPKYLILKRTADYEVRKYEPFIVVETS-----GDKLAGSAGFNTVAGYIFGKNSTKEKIPMTTPVFTQTFDSELPRVSIQIVLPSEKDIDSLPDP-EQ
        +E P Y ++     YE+R+Y   + V T          A    F  +  YI GKN   +KI MT PV +Q   S+ P       +       + PDP   
Subjt:  LESPKYLILKRTADYEVRKYEPFIVVETS-----GDKLAGSAGFNTVAGYIFGKNSTKEKIPMTTPVFTQTFDSELPRVSIQIVLPSEKDIDSLPDP-EQ

Query:  DIIGLRKVEGGTAAVLKFSGKPTEEIVLEKAKELRSSL
        + + ++K      AV +FSG  +++ + E+A  L SSL
Subjt:  DIIGLRKVEGGTAAVLKFSGKPTEEIVLEKAKELRSSL

AT2G37970.1 SOUL heme-binding family protein7.8e-1733.81Show/hide
Query:  SLESPKYLILKRTADYEVRKYEPFIVVETSGD----KLAGSAGFNTVAGYI--FGK--NSTKEKIPMTTPVFTQ------------TFDSE---------
        ++E+PKY + K    YE+R+Y P +  E + D    K     GF  +A YI  FGK  N   EKI MT PV T+            T +SE         
Subjt:  SLESPKYLILKRTADYEVRKYEPFIVVETSGD----KLAGSAGFNTVAGYI--FGK--NSTKEKIPMTTPVFTQ------------TFDSE---------

Query:  ---------LPRVSIQIVLPS-EKDIDSLPDPEQDIIGLRKVEGGTAAVLKFSGKPTEEIVLEKAKELRSSLIKDGLKPSNGCLLARYNDPGRTWNF--I
                    V++Q +LPS  K  +  P P  + + +++  G    V+KFSG  +E +V EK K+L S L KDG K +   +LARYN P   W     
Subjt:  ---------LPRVSIQIVLPS-EKDIDSLPDPEQDIIGLRKVEGGTAAVLKFSGKPTEEIVLEKAKELRSSLIKDGLKPSNGCLLARYNDPGRTWNF--I

Query:  MRNEVLIWLE
          NEV+I +E
Subjt:  MRNEVLIWLE

AT3G10130.1 SOUL heme-binding family protein2.4e-1832.16Show/hide
Query:  YYKTPSLESPKYLILKRTADYEVRKYEPFIVV------ETSGDKLAGSAGFNTVAGYIFGKNSTKEKIPMTTPVFTQTFDS--ELPRVSIQIVLPSEKDI
        +   P LE+  + +L RT  YE+R+ EP+ V       ET  D    S  FN +A Y+FGKN+ KEK+ MTTPV T+   S  E   ++  ++    KD 
Subjt:  YYKTPSLESPKYLILKRTADYEVRKYEPFIVV------ETSGDKLAGSAGFNTVAGYIFGKNSTKEKIPMTTPVFTQTFDS--ELPRVSIQIVLPSEKDI

Query:  D--------------SLPDPEQDIIGLRKVEGGTAAVLKFSGKPTEEIVLEKAKELRSSLIKD---GLKPSNGCLLARYNDPGRTWNFIMRNEVLIWLE
        +              +LP P+   + +++V     AV+ FSG  T+E +  + +ELR +L  D    ++      +A+YN P  T  F+ RNEV + +E
Subjt:  D--------------SLPDPEQDIIGLRKVEGGTAAVLKFSGKPTEEIVLEKAKELRSSLIKD---GLKPSNGCLLARYNDPGRTWNFIMRNEVLIWLE

AT5G20140.1 SOUL heme-binding family protein3.6e-14775.78Show/hide
Query:  STVDVGRLVDFLYKDLCHLFDEQGIDRTAYDEQVRFRDPITKHDTISGYLFNISLLRELFTPEFFLHWVKQTGPYEITTRWTMVMKFVLLPWKPELVFTG
        STV++  LV FLY+DL HLFD+QGID+TAYDE+V+FRDPITKHDTISGYLFNI+ L+ +FTP+F LHW KQTGPYEITTRWTMVMKF+ LPWKPELVFTG
Subjt:  STVDVGRLVDFLYKDLCHLFDEQGIDRTAYDEQVRFRDPITKHDTISGYLFNISLLRELFTPEFFLHWVKQTGPYEITTRWTMVMKFVLLPWKPELVFTG

Query:  ISIMGINPETGKFCSHVDLWDSIQNNDYFSVEGLWDVFKQLRYYKTPSLESPKYLILKRTADYEVRKYEPFIVVETSGDKLAGSAGFNTVAGYIFGKNST
        +SIM +NPET KFCSH+DLWDSI+NNDYFS+EGL DVFKQLR YKTP LE+PKY ILKRTA+YEVR YEPFIVVET GDKL+GS+GFN VAGYIFGKNST
Subjt:  ISIMGINPETGKFCSHVDLWDSIQNNDYFSVEGLWDVFKQLRYYKTPSLESPKYLILKRTADYEVRKYEPFIVVETSGDKLAGSAGFNTVAGYIFGKNST

Query:  KEKIPMTTPVFTQTFDSELPR-VSIQIVLPSEKDIDSLPDPEQDIIGLRKVEGGTAAVLKFSGKPTEEIVLEKAKELRSSLIKDGLKPSNGCLLARYNDP
         EKIPMTTPVFTQT D++L   VS+QIV+PS KD+ SLP P ++ + L+K+EGG AA +KFSGKPTE++V  K  ELRSSL KDGL+   GC+LARYNDP
Subjt:  KEKIPMTTPVFTQTFDSELPR-VSIQIVLPSEKDIDSLPDPEQDIIGLRKVEGGTAAVLKFSGKPTEEIVLEKAKELRSSLIKDGLKPSNGCLLARYNDP

Query:  GRTWNFIMRNEVLIWLEEFSLE
        GRTWNFIMRNEV+IWLE+FSL+
Subjt:  GRTWNFIMRNEVLIWLEEFSLE

AT5G20140.2 SOUL heme-binding family protein1.2e-13975.65Show/hide
Query:  STVDVGRLVDFLYKDLCHLFDEQGIDRTAYDEQVRFRDPITKHDTISGYLFNISLLRELFTPEFFLHWVKQTGPYEITTRWTMVMKFVLLPWKPELVFTG
        STV++  LV FLY+DL HLFD+QGID+TAYDE+V+FRDPITKHDTISGYLFNI+ L+ +FTP+F LHW KQTGPYEITTRWTMVMKF+ LPWKPELVFTG
Subjt:  STVDVGRLVDFLYKDLCHLFDEQGIDRTAYDEQVRFRDPITKHDTISGYLFNISLLRELFTPEFFLHWVKQTGPYEITTRWTMVMKFVLLPWKPELVFTG

Query:  ISIMGINPETGKFCSHVDLWDSIQNNDYFSVEGLWDVFKQLRYYKTPSLESPKYLILKRTADYEVRKYEPFIVVETSGDKLAGSAGFNTVAGYIFGKNST
        +SIM +NPET KFCSH+DLWDSI+NNDYFS+EGL DVFKQLR YKTP LE+PKY ILKRTA+YEVR YEPFIVVET GDKL+GS+GFN VAGYIFGKNST
Subjt:  ISIMGINPETGKFCSHVDLWDSIQNNDYFSVEGLWDVFKQLRYYKTPSLESPKYLILKRTADYEVRKYEPFIVVETSGDKLAGSAGFNTVAGYIFGKNST

Query:  KEKIPMTTPVFTQTFDSELPR-VSIQIVLPSEKDIDSLPDPEQDIIGLRKVEGGTAAVLKFSGKPTEEIVLEKAKELRSSLIKDGLKPSNGCLLARYNDP
         EKIPMTTPVFTQT D++L   VS+QIV+PS KD+ SLP P ++ + L+K+EGG AA +KFSGKPTE++V  K  ELRSSL KDGL+   GC+LARYNDP
Subjt:  KEKIPMTTPVFTQTFDSELPR-VSIQIVLPSEKDIDSLPDPEQDIIGLRKVEGGTAAVLKFSGKPTEEIVLEKAKELRSSLIKDGLKPSNGCLLARYNDP

Query:  GRTWNFIM
        GRTWNFIM
Subjt:  GRTWNFIM


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCGCTCTTCAACTTTCCCTCCAAACCTTCCCCTCAACTCCAGCACTCGGTTTCGGTCTCCGGCCACCGAAACCCGCCAGACTAACCGACCTCTCGCTCCGTCTACT
TAAAAGCAGAACTCTAGCTTTCAAACCCCAAACCCAAAATTCCAAGTGGGTTGTTCGATTAAGCTTGGTAGATCAAAGCCCACCAAAATCGACGGTCGATGTAGGCCGAT
TGGTGGATTTCTTGTATAAAGATCTTTGCCATCTCTTCGATGAACAGGGGATTGATCGAACGGCATACGACGAACAAGTGAGATTTCGGGACCCCATTACCAAGCACGAT
ACGATTAGTGGGTATTTGTTTAACATTTCCCTCTTGCGAGAACTCTTCACACCTGAATTCTTCTTGCACTGGGTTAAACAGACAGGACCATACGAAATAACTACAAGATG
GACTATGGTAATGAAGTTCGTCCTTCTACCATGGAAGCCGGAATTAGTTTTCACTGGAATTTCCATCATGGGTATCAATCCAGAGACGGGCAAGTTCTGTAGTCATGTGG
ATCTGTGGGATTCAATACAAAACAACGACTACTTTTCTGTAGAAGGCCTTTGGGATGTTTTCAAGCAGCTTCGCTATTATAAGACTCCATCATTGGAATCACCCAAGTAT
CTGATTCTGAAAAGGACAGCAGATTATGAGGTGAGGAAATATGAGCCATTTATAGTGGTGGAAACAAGTGGAGACAAACTCGCTGGGTCTGCAGGATTCAATACAGTTGC
TGGGTATATATTTGGGAAGAACTCTACAAAGGAGAAGATACCAATGACCACTCCTGTATTCACCCAAACATTTGACTCAGAGTTACCCAGAGTATCCATTCAAATAGTTC
TTCCTTCAGAGAAAGATATAGACAGTTTACCAGATCCTGAACAAGACATAATTGGCTTGAGAAAGGTTGAAGGAGGTACTGCTGCAGTGTTGAAATTCAGTGGGAAACCT
ACTGAAGAGATTGTGCTAGAGAAGGCAAAAGAACTGCGGTCTAGTCTCATAAAAGATGGTCTCAAACCCAGTAACGGCTGTTTGCTTGCTCGGTATAACGACCCTGGAAG
AACATGGAACTTCATAATGAGAAATGAGGTGCTAATATGGCTTGAAGAATTCTCATTGGAGTAG
mRNA sequenceShow/hide mRNA sequence
AATCAGTCCTCGTCCACTACCTCCAAATCCCACTTCCCCAATGACACTCCGCCGGCTGAGGCTGAGGCTCAAATGGCCGCTCTTCAACTTTCCCTCCAAACCTTCCCCTC
AACTCCAGCACTCGGTTTCGGTCTCCGGCCACCGAAACCCGCCAGACTAACCGACCTCTCGCTCCGTCTACTTAAAAGCAGAACTCTAGCTTTCAAACCCCAAACCCAAA
ATTCCAAGTGGGTTGTTCGATTAAGCTTGGTAGATCAAAGCCCACCAAAATCGACGGTCGATGTAGGCCGATTGGTGGATTTCTTGTATAAAGATCTTTGCCATCTCTTC
GATGAACAGGGGATTGATCGAACGGCATACGACGAACAAGTGAGATTTCGGGACCCCATTACCAAGCACGATACGATTAGTGGGTATTTGTTTAACATTTCCCTCTTGCG
AGAACTCTTCACACCTGAATTCTTCTTGCACTGGGTTAAACAGACAGGACCATACGAAATAACTACAAGATGGACTATGGTAATGAAGTTCGTCCTTCTACCATGGAAGC
CGGAATTAGTTTTCACTGGAATTTCCATCATGGGTATCAATCCAGAGACGGGCAAGTTCTGTAGTCATGTGGATCTGTGGGATTCAATACAAAACAACGACTACTTTTCT
GTAGAAGGCCTTTGGGATGTTTTCAAGCAGCTTCGCTATTATAAGACTCCATCATTGGAATCACCCAAGTATCTGATTCTGAAAAGGACAGCAGATTATGAGGTGAGGAA
ATATGAGCCATTTATAGTGGTGGAAACAAGTGGAGACAAACTCGCTGGGTCTGCAGGATTCAATACAGTTGCTGGGTATATATTTGGGAAGAACTCTACAAAGGAGAAGA
TACCAATGACCACTCCTGTATTCACCCAAACATTTGACTCAGAGTTACCCAGAGTATCCATTCAAATAGTTCTTCCTTCAGAGAAAGATATAGACAGTTTACCAGATCCT
GAACAAGACATAATTGGCTTGAGAAAGGTTGAAGGAGGTACTGCTGCAGTGTTGAAATTCAGTGGGAAACCTACTGAAGAGATTGTGCTAGAGAAGGCAAAAGAACTGCG
GTCTAGTCTCATAAAAGATGGTCTCAAACCCAGTAACGGCTGTTTGCTTGCTCGGTATAACGACCCTGGAAGAACATGGAACTTCATAATGAGAAATGAGGTGCTAATAT
GGCTTGAAGAATTCTCATTGGAGTAGAGAACCATTCAAACAGAGCTTATTTTCTTCATAAAGCCTCGAGTCCTTGTTGATATTCTGTATATTTTGACTTTTCTGTGATTA
ATAAGTTTAATAGATGATTTATTTAGTCGGGTATTAGAAAGCAGAAGATACTTTTTACTTTTTGCTTTGATAATGTTGTTTGCTAGTAGTTATAGGTTAGCAAACCAAAG
TTTCTTGAGATTATGAACAGATTCCATTCGATAGCTTTGCTTTCCCATATTTTGTGATTTTTATTGTAAAAAG
Protein sequenceShow/hide protein sequence
MAALQLSLQTFPSTPALGFGLRPPKPARLTDLSLRLLKSRTLAFKPQTQNSKWVVRLSLVDQSPPKSTVDVGRLVDFLYKDLCHLFDEQGIDRTAYDEQVRFRDPITKHD
TISGYLFNISLLRELFTPEFFLHWVKQTGPYEITTRWTMVMKFVLLPWKPELVFTGISIMGINPETGKFCSHVDLWDSIQNNDYFSVEGLWDVFKQLRYYKTPSLESPKY
LILKRTADYEVRKYEPFIVVETSGDKLAGSAGFNTVAGYIFGKNSTKEKIPMTTPVFTQTFDSELPRVSIQIVLPSEKDIDSLPDPEQDIIGLRKVEGGTAAVLKFSGKP
TEEIVLEKAKELRSSLIKDGLKPSNGCLLARYNDPGRTWNFIMRNEVLIWLEEFSLE