; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

PI0011406 (gene) of Melon (PI 482460) v1 genome

Gene IDPI0011406
OrganismCucumis metuliferus PI 482460 (Melon (PI 482460) v1)
DescriptionSOUL heme-binding family protein
Genome locationchr02:18004459..18012768
RNA-Seq ExpressionPI0011406
SyntenyPI0011406
Gene Ontology termsGO:0110165 - cellular anatomical structure (cellular component)
InterPro domainsIPR006917 - SOUL haem-binding protein
IPR011256 - Regulatory factor, effector binding domain superfamily
IPR018790 - Protein of unknown function DUF2358
IPR032710 - NTF2-like domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TYK14496.1 uncharacterized protein E5676_scaffold15G00070 [Cucumis melo var. makuwa]1.7e-16183.77Show/hide
Query:  MAPAQVLSIPTLSFGFRPRKSDGPTGTIAASTTQKPHYHNHNHNKNLVVGSKLAAADHQYHRPTKSRVDVDRLVKILYDDLHHVFDEQGIDPTAYDEEVK
        MAPA VLS+PT+S GFRPRKSDG T TIAA+ TQ+P    HNHN+N VVGSKLAAAD+QY RPTKSRVDVDRLVK LYDDLHHVFDEQGIDP+AYDEE++
Subjt:  MAPAQVLSIPTLSFGFRPRKSDGPTGTIAASTTQKPHYHNHNHNKNLVVGSKLAAADHQYHRPTKSRVDVDRLVKILYDDLHHVFDEQGIDPTAYDEEVK

Query:  FRDPITKYDDIRGYLLNIALLRQFFRPQIILHWVKKTGPYEITTRWTAVMKFALLPWNPECVLTGTSIMTINPNTGKFCRHVDLWDSVQNNDYFSIEGLW
        FRDPITK+DDIRGYLLNIALLRQFF PQIILHWVKKTGPYEITTRWTAVMKF LLPW PECVLTGTSIMT+NPNTGKFCRHVDLWDSVQNNDYFSIEGLW
Subjt:  FRDPITKYDDIRGYLLNIALLRQFFRPQIILHWVKKTGPYEITTRWTAVMKFALLPWNPECVLTGTSIMTINPNTGKFCRHVDLWDSVQNNDYFSIEGLW

Query:  DVFKQFRFYETPELELPKYQILKRTANYEVRKYGPFAVAERSGENLFGCVGFNSVGCWGDCKEDDRIMKLRNKEGGIAAVLKFSGKATEEKVKNKTKELK
        DVFKQFRFYE  ELELPKYQ L RTANYEVRKYGPFAVAERSGENLFGCV  NSVG WGDCKEDDRIMKLRNKEGGIAAVL FSGKATEE VKNK KEL+
Subjt:  DVFKQFRFYETPELELPKYQILKRTANYEVRKYGPFAVAERSGENLFGCVGFNSVGCWGDCKEDDRIMKLRNKEGGIAAVLKFSGKATEEKVKNKTKELK

Query:  HCLKNDGLKSINNNSCLLVRYNNSDQTWSFVMRNEVLIWLQDFSI
        H LK DGL+S+ NNSCLL              RNEVLIWLQDFSI
Subjt:  HCLKNDGLKSINNNSCLLVRYNNSDQTWSFVMRNEVLIWLQDFSI

XP_004151455.2 uncharacterized protein LOC101205468 [Cucumis sativus]5.9e-17588.41Show/hide
Query:  MAPAQVLSIPTLSFGFRPRKSDGPTGTIAASTTQKPHYHNHNHNKNLVVGSKLAAADHQYHRPTKSRVDVDRLVKILYDDLHHVFDEQGIDPTAYDEEVK
        MAPAQVLSIPT SFGFR R SDGPT TIAA+ TQKP  HNHNHN++LVVGSKLAAADH + RPTKSRVDVD+LVK LYDDLHHVFDEQGIDPTAYDEE++
Subjt:  MAPAQVLSIPTLSFGFRPRKSDGPTGTIAASTTQKPHYHNHNHNKNLVVGSKLAAADHQYHRPTKSRVDVDRLVKILYDDLHHVFDEQGIDPTAYDEEVK

Query:  FRDPITKYDDIRGYLLNIALLRQFFRPQIILHWVKKTGPYEITTRWTAVMKFALLPWNPECVLTGTSIMTINPNTGKFCRHVDLWDSVQNNDYFSIEGLW
        FRDPITKY DIRGYLLNIALLRQFF PQIILHWVKKTGPYEITTRWTA MKFALLPW PECVLTGTSIMTINPNTGKFCRHVDLWDSVQNNDYFSIEGLW
Subjt:  FRDPITKYDDIRGYLLNIALLRQFFRPQIILHWVKKTGPYEITTRWTAVMKFALLPWNPECVLTGTSIMTINPNTGKFCRHVDLWDSVQNNDYFSIEGLW

Query:  DVFKQFRFYETPELELPKYQILKRTANYEVRKYGPFAVAERSGENLFGCVGFNSVGCWGDCKEDDRIMKLRNKEGGIAAVLKFSGKATEEKVKNKTKELK
        DVFKQFRFYETPELELPKYQ LKRT NYEVRKYGPFA AERSGENLF CV  NS+G WGDCKEDDRIM+LRNK GGIAAVL FSGKATEEKVKNK KEL+
Subjt:  DVFKQFRFYETPELELPKYQILKRTANYEVRKYGPFAVAERSGENLFGCVGFNSVGCWGDCKEDDRIMKLRNKEGGIAAVLKFSGKATEEKVKNKTKELK

Query:  HCLKNDGLKSINNNSCLLVRYNNSDQTWSFVMRNEVLIWLQDFSI
        H LK DGLKS+NNNSCLLVRYN+S+ TWSFVMRNEVLIWLQDFSI
Subjt:  HCLKNDGLKSINNNSCLLVRYNNSDQTWSFVMRNEVLIWLQDFSI

XP_008464979.1 PREDICTED: uncharacterized protein LOC103502719 [Cucumis melo]1.4e-15288.16Show/hide
Query:  MAPAQVLSIPTLSFGFRPRKSDGPTGTIAASTTQKPHYHNHNHNKNLVVGSKLAAADHQYHRPTKSRVDVDRLVKILYDDLHHVFDEQGIDPTAYDEEVK
        MAPA VLS+PT+S GFRPRKSDG T TIAA+ TQ+P    HNHN+N VVGSKLAAADHQY RPTKSRVDVDRLVK LYDDLHHVFDEQGIDPTAYDEE++
Subjt:  MAPAQVLSIPTLSFGFRPRKSDGPTGTIAASTTQKPHYHNHNHNKNLVVGSKLAAADHQYHRPTKSRVDVDRLVKILYDDLHHVFDEQGIDPTAYDEEVK

Query:  FRDPITKYDDIRGYLLNIALLRQFFRPQIILHWVKKTGPYEITTRWTAVMKFALLPWNPECVLTGTSIMTINPNTGKFCRHVDLWDSVQNNDYFSIEGLW
        FRDPITK+DDIRGYLLNIALLRQFF PQIILHWVKKTGPYEITTRWTAVMKF LLPW PECVLTGTSIMT+NPNTGKFCRHVDLWDSVQNNDYFSIEGLW
Subjt:  FRDPITKYDDIRGYLLNIALLRQFFRPQIILHWVKKTGPYEITTRWTAVMKFALLPWNPECVLTGTSIMTINPNTGKFCRHVDLWDSVQNNDYFSIEGLW

Query:  DVFKQFRFYETPELELPKYQILKRTANYEVRKYGPFAVAERSGENLFGCVGFNSVGCWGDCKEDDRIMKLRNKEGGIAAVLKFSGKATEEKVKNKTKELK
        DVFKQFRFYE  ELELPKYQ L RTANYEVRKYGPFAVAERSGENLFGCV  NSVG WGDCKEDDRIMKLRNKEGGIAAVL FSGKATEE VKNK KEL+
Subjt:  DVFKQFRFYETPELELPKYQILKRTANYEVRKYGPFAVAERSGENLFGCVGFNSVGCWGDCKEDDRIMKLRNKEGGIAAVLKFSGKATEEKVKNKTKELK

Query:  HCLK
        H LK
Subjt:  HCLK

XP_038879853.1 uncharacterized protein LOC120071584 isoform X1 [Benincasa hispida]2.8e-13773.07Show/hide
Query:  MAPAQVLSIPTLSFGFRPRKSDGPTG--TIAASTTQKPHYHNHNHNKNLVVGSKLAAADHQYHRPTKSRVDVDRLVKILYDDLHHVFDEQGIDPTAYDEE
        MAP Q LSIP + FGF PRKS GPT     AA+   KP    H+HN+N  V SK+   DHQ  RP KS VDV+RLV+ LY+DLHHVFDEQGID TAYDEE
Subjt:  MAPAQVLSIPTLSFGFRPRKSDGPTG--TIAASTTQKPHYHNHNHNKNLVVGSKLAAADHQYHRPTKSRVDVDRLVKILYDDLHHVFDEQGIDPTAYDEE

Query:  VKFRDPITKYDDIRGYLLNIALLRQFFRPQIILHWVKKTGPYEITTRWTAVMKFALLPWNPECVLTGTSIMTINPNTGKFCRHVDLWDSVQNNDYFSIEG
        ++FRDPITKYD+I GYLLNIALL  FFRP++ILHWVKKTGPYEITTRWTAVMKF +LPW PE V+TGTSIM INP+TGKFC HVDLWDSVQNNDYFSIEG
Subjt:  VKFRDPITKYDDIRGYLLNIALLRQFFRPQIILHWVKKTGPYEITTRWTAVMKFALLPWNPECVLTGTSIMTINPNTGKFCRHVDLWDSVQNNDYFSIEG

Query:  LWDVFKQFRFYETPELELPKYQILKRTANYEVRKYGPFAVAERSGENLFGC--VGFNSVGCWGDCKEDDRIMKLRNKEGGIAAVLKFSGKATEEKVKNKT
        LWDVFKQ RFYETPELE PKYQILKRTANYEVRKY P  VAE SGENL GC    F+ VG W +CKE D    LR  EGGIAAVLKFSGK+T+++V+NK 
Subjt:  LWDVFKQFRFYETPELELPKYQILKRTANYEVRKYGPFAVAERSGENLFGC--VGFNSVGCWGDCKEDDRIMKLRNKEGGIAAVLKFSGKATEEKVKNKT

Query:  KELKHCLKNDGLKSINNNSCLLVRYNNSDQTWSFVMRNEVLIWLQDFSI
        K+L+H LK DGLK INN+S LL RYNNS  TWSFVMRNEVLIWL+DFSI
Subjt:  KELKHCLKNDGLKSINNNSCLLVRYNNSDQTWSFVMRNEVLIWLQDFSI

XP_038879854.1 uncharacterized protein LOC120071584 isoform X2 [Benincasa hispida]5.0e-13472.21Show/hide
Query:  MAPAQVLSIPTLSFGFRPRKSDGPTG--TIAASTTQKPHYHNHNHNKNLVVGSKLAAADHQYHRPTKSRVDVDRLVKILYDDLHHVFDEQGIDPTAYDEE
        MAP Q LSIP + FGF PRKS GPT     AA+   KP    H+HN+N  V SK+   DHQ  RP KS VDV+RLV+ LY+DLHHVFDEQGID TAYDEE
Subjt:  MAPAQVLSIPTLSFGFRPRKSDGPTG--TIAASTTQKPHYHNHNHNKNLVVGSKLAAADHQYHRPTKSRVDVDRLVKILYDDLHHVFDEQGIDPTAYDEE

Query:  VKFRDPITKYDDIRGYLLNIALLRQFFRPQIILHWVKKTGPYEITTRWTAVMKFALLPWNPECVLTGTSIMTINPNTGKFCRHVDLWDSVQNNDYFSIEG
        ++FRDPITKYD+I GYLLNIALL  FFRP++ILHW   TGPYEITTRWTAVMKF +LPW PE V+TGTSIM INP+TGKFC HVDLWDSVQNNDYFSIEG
Subjt:  VKFRDPITKYDDIRGYLLNIALLRQFFRPQIILHWVKKTGPYEITTRWTAVMKFALLPWNPECVLTGTSIMTINPNTGKFCRHVDLWDSVQNNDYFSIEG

Query:  LWDVFKQFRFYETPELELPKYQILKRTANYEVRKYGPFAVAERSGENLFGC--VGFNSVGCWGDCKEDDRIMKLRNKEGGIAAVLKFSGKATEEKVKNKT
        LWDVFKQ RFYETPELE PKYQILKRTANYEVRKY P  VAE SGENL GC    F+ VG W +CKE D    LR  EGGIAAVLKFSGK+T+++V+NK 
Subjt:  LWDVFKQFRFYETPELELPKYQILKRTANYEVRKYGPFAVAERSGENLFGC--VGFNSVGCWGDCKEDDRIMKLRNKEGGIAAVLKFSGKATEEKVKNKT

Query:  KELKHCLKNDGLKSINNNSCLLVRYNNSDQTWSFVMRNEVLIWLQDFSI
        K+L+H LK DGLK INN+S LL RYNNS  TWSFVMRNEVLIWL+DFSI
Subjt:  KELKHCLKNDGLKSINNNSCLLVRYNNSDQTWSFVMRNEVLIWLQDFSI

TrEMBL top hitse value%identityAlignment
A0A0A0LU04 Uncharacterized protein2.8e-17588.41Show/hide
Query:  MAPAQVLSIPTLSFGFRPRKSDGPTGTIAASTTQKPHYHNHNHNKNLVVGSKLAAADHQYHRPTKSRVDVDRLVKILYDDLHHVFDEQGIDPTAYDEEVK
        MAPAQVLSIPT SFGFR R SDGPT TIAA+ TQKP  HNHNHN++LVVGSKLAAADH + RPTKSRVDVD+LVK LYDDLHHVFDEQGIDPTAYDEE++
Subjt:  MAPAQVLSIPTLSFGFRPRKSDGPTGTIAASTTQKPHYHNHNHNKNLVVGSKLAAADHQYHRPTKSRVDVDRLVKILYDDLHHVFDEQGIDPTAYDEEVK

Query:  FRDPITKYDDIRGYLLNIALLRQFFRPQIILHWVKKTGPYEITTRWTAVMKFALLPWNPECVLTGTSIMTINPNTGKFCRHVDLWDSVQNNDYFSIEGLW
        FRDPITKY DIRGYLLNIALLRQFF PQIILHWVKKTGPYEITTRWTA MKFALLPW PECVLTGTSIMTINPNTGKFCRHVDLWDSVQNNDYFSIEGLW
Subjt:  FRDPITKYDDIRGYLLNIALLRQFFRPQIILHWVKKTGPYEITTRWTAVMKFALLPWNPECVLTGTSIMTINPNTGKFCRHVDLWDSVQNNDYFSIEGLW

Query:  DVFKQFRFYETPELELPKYQILKRTANYEVRKYGPFAVAERSGENLFGCVGFNSVGCWGDCKEDDRIMKLRNKEGGIAAVLKFSGKATEEKVKNKTKELK
        DVFKQFRFYETPELELPKYQ LKRT NYEVRKYGPFA AERSGENLF CV  NS+G WGDCKEDDRIM+LRNK GGIAAVL FSGKATEEKVKNK KEL+
Subjt:  DVFKQFRFYETPELELPKYQILKRTANYEVRKYGPFAVAERSGENLFGCVGFNSVGCWGDCKEDDRIMKLRNKEGGIAAVLKFSGKATEEKVKNKTKELK

Query:  HCLKNDGLKSINNNSCLLVRYNNSDQTWSFVMRNEVLIWLQDFSI
        H LK DGLKS+NNNSCLLVRYN+S+ TWSFVMRNEVLIWLQDFSI
Subjt:  HCLKNDGLKSINNNSCLLVRYNNSDQTWSFVMRNEVLIWLQDFSI

A0A1S3CN82 uncharacterized protein LOC1035027196.8e-15388.16Show/hide
Query:  MAPAQVLSIPTLSFGFRPRKSDGPTGTIAASTTQKPHYHNHNHNKNLVVGSKLAAADHQYHRPTKSRVDVDRLVKILYDDLHHVFDEQGIDPTAYDEEVK
        MAPA VLS+PT+S GFRPRKSDG T TIAA+ TQ+P    HNHN+N VVGSKLAAADHQY RPTKSRVDVDRLVK LYDDLHHVFDEQGIDPTAYDEE++
Subjt:  MAPAQVLSIPTLSFGFRPRKSDGPTGTIAASTTQKPHYHNHNHNKNLVVGSKLAAADHQYHRPTKSRVDVDRLVKILYDDLHHVFDEQGIDPTAYDEEVK

Query:  FRDPITKYDDIRGYLLNIALLRQFFRPQIILHWVKKTGPYEITTRWTAVMKFALLPWNPECVLTGTSIMTINPNTGKFCRHVDLWDSVQNNDYFSIEGLW
        FRDPITK+DDIRGYLLNIALLRQFF PQIILHWVKKTGPYEITTRWTAVMKF LLPW PECVLTGTSIMT+NPNTGKFCRHVDLWDSVQNNDYFSIEGLW
Subjt:  FRDPITKYDDIRGYLLNIALLRQFFRPQIILHWVKKTGPYEITTRWTAVMKFALLPWNPECVLTGTSIMTINPNTGKFCRHVDLWDSVQNNDYFSIEGLW

Query:  DVFKQFRFYETPELELPKYQILKRTANYEVRKYGPFAVAERSGENLFGCVGFNSVGCWGDCKEDDRIMKLRNKEGGIAAVLKFSGKATEEKVKNKTKELK
        DVFKQFRFYE  ELELPKYQ L RTANYEVRKYGPFAVAERSGENLFGCV  NSVG WGDCKEDDRIMKLRNKEGGIAAVL FSGKATEE VKNK KEL+
Subjt:  DVFKQFRFYETPELELPKYQILKRTANYEVRKYGPFAVAERSGENLFGCVGFNSVGCWGDCKEDDRIMKLRNKEGGIAAVLKFSGKATEEKVKNKTKELK

Query:  HCLK
        H LK
Subjt:  HCLK

A0A5A7URG2 SOUL heme-binding family protein isoform 33.4e-12888.28Show/hide
Query:  MAPAQVLSIPTLSFGFRPRKSDGPTGTIAASTTQKPHYHNHNHNKNLVVGSKLAAADHQYHRPTKSRVDVDRLVKILYDDLHHVFDEQGIDPTAYDEEVK
        MAPA VLS+PT+S GFRPRKSDG T TIAA+ TQ+P    HNHN+N VVGSKLAAADHQY RPTKSRVDVDRLVK LYDDLHHVFDEQGIDPTAYDEE++
Subjt:  MAPAQVLSIPTLSFGFRPRKSDGPTGTIAASTTQKPHYHNHNHNKNLVVGSKLAAADHQYHRPTKSRVDVDRLVKILYDDLHHVFDEQGIDPTAYDEEVK

Query:  FRDPITKYDDIRGYLLNIALLRQFFRPQIILHWVKKTGPYEITTRWTAVMKFALLPWNPECVLTGTSIMTINPNTGKFCRHVDLWDSVQNNDYFSIEGLW
        FRDPITK+DDIRGYLLNIALLRQFF PQIILHWVKKTGPYEITTRWTAVMKF LLPW PECVLTGTSIMT+NPNTGKFCRHVDLWDSVQNNDYFSIEGLW
Subjt:  FRDPITKYDDIRGYLLNIALLRQFFRPQIILHWVKKTGPYEITTRWTAVMKFALLPWNPECVLTGTSIMTINPNTGKFCRHVDLWDSVQNNDYFSIEGLW

Query:  DVFKQFRFYETPELELPKYQILKRTANYEVRKYGPFAVAERSGENLFGCVGFNSVG
        DVFKQFRFYE  ELELPKYQ L RTANYEVRKYGPFAVAERSGENLFGCV  NSVG
Subjt:  DVFKQFRFYETPELELPKYQILKRTANYEVRKYGPFAVAERSGENLFGCVGFNSVG

A0A5D3CVR4 Uncharacterized protein8.0e-16283.77Show/hide
Query:  MAPAQVLSIPTLSFGFRPRKSDGPTGTIAASTTQKPHYHNHNHNKNLVVGSKLAAADHQYHRPTKSRVDVDRLVKILYDDLHHVFDEQGIDPTAYDEEVK
        MAPA VLS+PT+S GFRPRKSDG T TIAA+ TQ+P    HNHN+N VVGSKLAAAD+QY RPTKSRVDVDRLVK LYDDLHHVFDEQGIDP+AYDEE++
Subjt:  MAPAQVLSIPTLSFGFRPRKSDGPTGTIAASTTQKPHYHNHNHNKNLVVGSKLAAADHQYHRPTKSRVDVDRLVKILYDDLHHVFDEQGIDPTAYDEEVK

Query:  FRDPITKYDDIRGYLLNIALLRQFFRPQIILHWVKKTGPYEITTRWTAVMKFALLPWNPECVLTGTSIMTINPNTGKFCRHVDLWDSVQNNDYFSIEGLW
        FRDPITK+DDIRGYLLNIALLRQFF PQIILHWVKKTGPYEITTRWTAVMKF LLPW PECVLTGTSIMT+NPNTGKFCRHVDLWDSVQNNDYFSIEGLW
Subjt:  FRDPITKYDDIRGYLLNIALLRQFFRPQIILHWVKKTGPYEITTRWTAVMKFALLPWNPECVLTGTSIMTINPNTGKFCRHVDLWDSVQNNDYFSIEGLW

Query:  DVFKQFRFYETPELELPKYQILKRTANYEVRKYGPFAVAERSGENLFGCVGFNSVGCWGDCKEDDRIMKLRNKEGGIAAVLKFSGKATEEKVKNKTKELK
        DVFKQFRFYE  ELELPKYQ L RTANYEVRKYGPFAVAERSGENLFGCV  NSVG WGDCKEDDRIMKLRNKEGGIAAVL FSGKATEE VKNK KEL+
Subjt:  DVFKQFRFYETPELELPKYQILKRTANYEVRKYGPFAVAERSGENLFGCVGFNSVGCWGDCKEDDRIMKLRNKEGGIAAVLKFSGKATEEKVKNKTKELK

Query:  HCLKNDGLKSINNNSCLLVRYNNSDQTWSFVMRNEVLIWLQDFSI
        H LK DGL+S+ NNSCLL              RNEVLIWLQDFSI
Subjt:  HCLKNDGLKSINNNSCLLVRYNNSDQTWSFVMRNEVLIWLQDFSI

A0A6J1HKM5 uncharacterized protein LOC1114650222.1e-12568Show/hide
Query:  MAPAQV-----LSIPTLSFGFRPRKSDGPTGTIAASTTQKPHYHNHNHNKNLVVGSKLAAADHQYHRPTKSRVDVDRLVKILYDDLHHVFDEQGIDPTAY
        MA AQV     LSIPT+ FG RPRKS GPT   A S T  P         N     +   AD ++ +PT   VDVDRLV  +YDDL HVFDEQGID TAY
Subjt:  MAPAQV-----LSIPTLSFGFRPRKSDGPTGTIAASTTQKPHYHNHNHNKNLVVGSKLAAADHQYHRPTKSRVDVDRLVKILYDDLHHVFDEQGIDPTAY

Query:  DEEVKFRDPITKYDDIRGYLLNIALLRQFFRPQIILHWVKKTGPYEITTRWTAVMKFALLPWNPECVLTGTSIMTINPNTGKFCRHVDLWDSVQNNDYFS
        DEEV+FRDPITKYD I GY+LNIALLR+FFRP+IILHWVKKTGPYEITTRWTAVMKF LLPW PE VLTGTSIM INP TGKFC HVDLWDS+QNNDYFS
Subjt:  DEEVKFRDPITKYDDIRGYLLNIALLRQFFRPQIILHWVKKTGPYEITTRWTAVMKFALLPWNPECVLTGTSIMTINPNTGKFCRHVDLWDSVQNNDYFS

Query:  IEGLWDVFKQFRFYETPELELPKYQILKRTANYEVRKYGPFAVAERSGENLFGCVGFNSVGCWGDCKEDDRIMKLRNKEGGIAAVLKFSGKATEEKVKNK
        +E LWDVFKQFRFYETPELE PKYQILKRTANYEVRKY PF V ER+G  +    GFN VG + D K++D  M +R  EGGI AVLKFSG  TE+  + K
Subjt:  IEGLWDVFKQFRFYETPELELPKYQILKRTANYEVRKYGPFAVAERSGENLFGCVGFNSVGCWGDCKEDDRIMKLRNKEGGIAAVLKFSGKATEEKVKNK

Query:  TKELKHCLKNDGLKSINNNSCLLVRYNNSDQTWSFVMRNEVLIWLQDFSI
         KEL+  LK DGLK I  N CLL RYN+S +TW FVMRNEV+IWLQ+FSI
Subjt:  TKELKHCLKNDGLKSINNNSCLLVRYNNSDQTWSFVMRNEVLIWLQDFSI

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G20140.1 SOUL heme-binding family protein3.2e-9451.6Show/hide
Query:  NLVVGSKLAAADHQYHRPTKSRVDVDRLVKILYDDLHHVFDEQGIDPTAYDEEVKFRDPITKYDDIRGYLLNIALLRQFFRPQIILHWVKKTGPYEITTR
        +L VG ++A+A         S V+++ LV  LY+DL H+FD+QGID TAYDE VKFRDPITK+D I GYL NIA L+  F PQ  LHW K+TGPYEITTR
Subjt:  NLVVGSKLAAADHQYHRPTKSRVDVDRLVKILYDDLHHVFDEQGIDPTAYDEEVKFRDPITKYDDIRGYLLNIALLRQFFRPQIILHWVKKTGPYEITTR

Query:  WTAVMKFALLPWNPECVLTGTSIMTINPNTGKFCRHVDLWDSVQNNDYFSIEGLWDVFKQFRFYETPELELPKYQILKRTANYEVRKYGPFAVAERSGEN
        WT VMKF  LPW PE V TG SIM +NP T KFC H+DLWDS++NNDYFS+EGL DVFKQ R Y+TP+LE PKYQILKRTANYEVR Y PF V E  G+ 
Subjt:  WTAVMKFALLPWNPECVLTGTSIMTINPNTGKFCRHVDLWDSVQNNDYFSIEGLWDVFKQFRFYETPELELPKYQILKRTANYEVRKYGPFAVAERSGEN

Query:  LFGCVGFNSVG--CWGDCKEDDRI-----------------------------------------MKLRNKEGGIAAVLKFSGKATEEKVKNKTKELKHC
        L G  GFN+V    +G     ++I                                         + L+  EGG AA +KFSGK TE+ V+ K  EL+  
Subjt:  LFGCVGFNSVG--CWGDCKEDDRI-----------------------------------------MKLRNKEGGIAAVLKFSGKATEEKVKNKTKELKHC

Query:  LKNDGLKSINNNSCLLVRYNNSDQTWSFVMRNEVLIWLQDFSI
        L  DGL++     C+L RYN+  +TW+F+MRNEV+IWL+DFS+
Subjt:  LKNDGLKSINNNSCLLVRYNNSDQTWSFVMRNEVLIWLQDFSI

AT5G20140.2 SOUL heme-binding family protein2.2e-8750Show/hide
Query:  NLVVGSKLAAADHQYHRPTKSRVDVDRLVKILYDDLHHVFDEQGIDPTAYDEEVKFRDPITKYDDIRGYLLNIALLRQFFRPQIILHWVKKTGPYEITTR
        +L VG ++A+A         S V+++ LV  LY+DL H+FD+QGID TAYDE VKFRDPITK+D I GYL NIA L+  F PQ  LHW K+TGPYEITTR
Subjt:  NLVVGSKLAAADHQYHRPTKSRVDVDRLVKILYDDLHHVFDEQGIDPTAYDEEVKFRDPITKYDDIRGYLLNIALLRQFFRPQIILHWVKKTGPYEITTR

Query:  WTAVMKFALLPWNPECVLTGTSIMTINPNTGKFCRHVDLWDSVQNNDYFSIEGLWDVFKQFRFYETPELELPKYQILKRTANYEVRKYGPFAVAERSGEN
        WT VMKF  LPW PE V TG SIM +NP T KFC H+DLWDS++NNDYFS+EGL DVFKQ R Y+TP+LE PKYQILKRTANYEVR Y PF V E  G+ 
Subjt:  WTAVMKFALLPWNPECVLTGTSIMTINPNTGKFCRHVDLWDSVQNNDYFSIEGLWDVFKQFRFYETPELELPKYQILKRTANYEVRKYGPFAVAERSGEN

Query:  LFGCVGFNSVG--CWGDCKEDDRI-----------------------------------------MKLRNKEGGIAAVLKFSGKATEEKVKNKTKELKHC
        L G  GFN+V    +G     ++I                                         + L+  EGG AA +KFSGK TE+ V+ K  EL+  
Subjt:  LFGCVGFNSVG--CWGDCKEDDRI-----------------------------------------MKLRNKEGGIAAVLKFSGKATEEKVKNKTKELKHC

Query:  LKNDGLKSINNNSCLLVRYNNSDQTWSFVMRNEVLIWLQD
        L  DGL++     C+L RYN+  +TW+F+M ++VL +  D
Subjt:  LKNDGLKSINNNSCLLVRYNNSDQTWSFVMRNEVLIWLQD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTCCTGCCCAAGTTCTCTCAATACCAACCCTTAGCTTTGGTTTCAGACCAAGGAAATCGGACGGACCAACCGGAACCATAGCCGCCTCCACAACTCAAAAGCCTCA
CTATCACAATCATAATCACAACAAAAATTTGGTTGTTGGATCAAAACTAGCAGCAGCAGATCATCAATATCACAGGCCAACGAAATCAAGGGTGGACGTTGACCGATTGG
TGAAAATCTTATACGATGATCTCCACCACGTGTTCGATGAACAAGGGATTGATCCGACGGCTTACGATGAAGAAGTAAAATTTCGAGATCCAATTACAAAATATGATGAC
ATTAGAGGATATTTGTTAAATATTGCTCTCTTGCGACAATTCTTTAGGCCTCAGATCATCTTGCATTGGGTTAAAAAGACAGGACCATATGAGATAACAACAAGATGGAC
TGCAGTAATGAAGTTTGCTCTTCTACCATGGAATCCAGAATGTGTTTTGACAGGAACTTCAATCATGACTATTAATCCAAACACTGGCAAATTTTGTAGACATGTGGATC
TTTGGGATTCAGTACAAAATAATGACTACTTTTCTATAGAAGGTCTTTGGGATGTATTTAAACAGTTTCGTTTTTACGAGACACCAGAATTGGAATTGCCCAAATATCAG
ATATTGAAAAGGACTGCAAATTATGAGGTGAGAAAATATGGTCCATTTGCTGTGGCAGAAAGAAGTGGAGAGAACTTGTTTGGGTGTGTTGGATTCAATAGCGTTGGCTG
TTGGGGTGATTGTAAAGAAGACGACAGAATCATGAAGTTGAGAAATAAGGAAGGAGGAATTGCTGCAGTGTTGAAATTCAGTGGAAAAGCTACAGAAGAAAAAGTGAAAA
ACAAAACCAAAGAATTAAAGCATTGTCTAAAAAACGATGGTCTCAAAAGCATTAATAATAATAGCTGTTTACTTGTACGTTACAACAATTCCGACCAAACATGGAGTTTC
GTAATGAGAAATGAGGTGCTAATATGGCTTCAAGATTTCTCAATTTAG
mRNA sequenceShow/hide mRNA sequence
ATTTAATTCAAAGATTCACCATTCAATTCCAAATTCCAATGGCTCCTGCCCAAGTTCTCTCAATACCAACCCTTAGCTTTGGTTTCAGACCAAGGAAATCGGACGGACCA
ACCGGAACCATAGCCGCCTCCACAACTCAAAAGCCTCACTATCACAATCATAATCACAACAAAAATTTGGTTGTTGGATCAAAACTAGCAGCAGCAGATCATCAATATCA
CAGGCCAACGAAATCAAGGGTGGACGTTGACCGATTGGTGAAAATCTTATACGATGATCTCCACCACGTGTTCGATGAACAAGGGATTGATCCGACGGCTTACGATGAAG
AAGTAAAATTTCGAGATCCAATTACAAAATATGATGACATTAGAGGATATTTGTTAAATATTGCTCTCTTGCGACAATTCTTTAGGCCTCAGATCATCTTGCATTGGGTT
AAAAAGACAGGACCATATGAGATAACAACAAGATGGACTGCAGTAATGAAGTTTGCTCTTCTACCATGGAATCCAGAATGTGTTTTGACAGGAACTTCAATCATGACTAT
TAATCCAAACACTGGCAAATTTTGTAGACATGTGGATCTTTGGGATTCAGTACAAAATAATGACTACTTTTCTATAGAAGGTCTTTGGGATGTATTTAAACAGTTTCGTT
TTTACGAGACACCAGAATTGGAATTGCCCAAATATCAGATATTGAAAAGGACTGCAAATTATGAGGTGAGAAAATATGGTCCATTTGCTGTGGCAGAAAGAAGTGGAGAG
AACTTGTTTGGGTGTGTTGGATTCAATAGCGTTGGCTGTTGGGGTGATTGTAAAGAAGACGACAGAATCATGAAGTTGAGAAATAAGGAAGGAGGAATTGCTGCAGTGTT
GAAATTCAGTGGAAAAGCTACAGAAGAAAAAGTGAAAAACAAAACCAAAGAATTAAAGCATTGTCTAAAAAACGATGGTCTCAAAAGCATTAATAATAATAGCTGTTTAC
TTGTACGTTACAACAATTCCGACCAAACATGGAGTTTCGTAATGAGAAATGAGGTGCTAATATGGCTTCAAGATTTCTCAATTTAGTACAACATGCAACTATGCAAGTCC
AAACACGGGTTTATGAACAACTCCAATTTTTTTATGTTTTATTGGCACTCAAATCTTGCGACCGATCGAAGCAGAGGAGGTTGACGGTGGCGTCTGAGGATCTGCAAGGC
GCGGCAGAAAAATTAAAAAGAAAAAAGAAAACCTACCTAAACGGTTGAAGGGGCGATGAAAAAAGATTTGGAAAAGGGTAGAAAAGAAAATAATAATAATTCTTATCTAA
TATTATCTTAAATCACTGATTGACCTATCACACTACAAAAAAGTACATCTCTTGACGCATATAATCGTCGAGAAATATTGAAACAATGTTGGGAGTTCACACTAGGATTA
GAAATTTGAAGGCATTGGAGGATTGCCCATTGTAGTTAAGGTTATTTTGCTAAGCTCTGGGTAAGGTTTAGGATAATTATATTGAGTTTTCTTAGAGTTCTTGAATGTCC
ATTAAGTTTAAGTGTTAAAATTAAATTACCCATGATTTTCAGGTTCTAGAGTCGAATTTGAGAGAAAAAAATGGTCTTCCATTAGCTTTGGAGATCGTTTGGATGTTGTT
CTACAAGGTCCGAAACATAAAATAGATTAAGTGATCTTATGAGGCGTTTAAGGTTGATTTACTCATATTGATATAATGGATCTAAGTATCTAAAATTATTAATATTTGGT
ATAAGAGCAAATGGAATTAAAAGACCCACTAAGTTTGATTTAAGTTTGGATGCTTATGGTATTAAGAGTTTAAATTCGAAAGGGAATGGTTCAATTTAAACCTTGGAAGT
GAGAAATAATTGGAGTCAATGAAACGTAAGATAGTTTGAGTTGACATTATGAAGTGTTTTGGAAATAGTTGAGTCATCTTGACATCTGAAGATAACTGAGTGACCTTAAT
CATTGTTTCAGTCCAAATCGAAGTGGGAAAAGCTTATAATCTTGGGGGATCGAGTCAGTGGTTGTGAGTGGTTAATTTTCAAGCATTTTCATTATAATCTATATATGTTA
TTATAATACGCTATGATTGGGAAAATTTCTAGTATGACTGATTTATGAACCATGTGATAGGAATGTTTTCAGTTATGATTTCTAGTTTTCACGAACCCTTATATTTCAAA
AATGTGGTTGACCCCAAACTTATTAGGCATGATTTAGATGACAACTGATTTAAAAAATACTCTTTGATTTGTTATTAAACTTGGATGGTTTGAATAGACTGTTATTCTTT
AAGCTGTGTTGATGTCTAAGGATTGAAAATGACTTTATGAAAATGTGTTTCTGACCATGTTTGATTTTTTTTAAACATGAAATATGTGATTTTTATGAAAATAAGTTGGT
GATTTACCCACAAACTACGCGAATGTGTTTCCATAGCCACATCTAGGTTGTTAGGGGATTTTGGTGCTGAAAGAGGTTGTGACTAGAGATCTGAGAACTTGAGCCTAGAC
AAGGAAATGGATGGAGATTGCAATTTTGGCTTCAGGCCTAGACTAATCTAGGGAGGGTGGCTCTATGTGCATATAGGAGCACCAATATTGAGGATGTGAATATGGTTTTC
AAGTTACCACAGTGGTAGGGACTGAATATGTGTGAGCTCAGCTTGGAAATTAAAATATTTTTGAAAAATGTTTTTTGAAAAGTGCTTTCATGCCTCGGTCTAGGTTAGAG
GAGTAATCTGGAACAGAGTTTTACAAATATGGTACCAAAGCCAGGTTTTGAGGAACCTAAGGGTTTATAAGTATGATTTTGGTTAAAGTTTGAGGGTATGCATTAGAGTG
TTGGTATGAGTCATGGTAGCACGTGTAAGTTGCAATGAAACACGCCTCATCGTGGCAGACGGCTTAGGTAGCCACTGAGGAACTCGTGACGCTTCTGGTGGTTTGGAACT
TGGGGTTGAGGAATTTGAAGCTAAAACCAACCACCACCATGTTGAAAAGAACTTAGGAAATGTGCTTTTGAAGGCTCTAGGAATGACAACATTTTCAGCGATAGCTAATC
TCGCTGCATAAAGTTGCTTAGAGATGGGGAGGAATGTTTTATGGTTATAAGATGTCCAGGTCATAGGAAAGTGGTGGTGTTTAACTAAGTTATGCCACGATAGTAGACCA
ATTAGTTGGAAAGAATCCAAGCCAAAGAGCTAAAACTTGAGTTCTCTATTGATTTAAAAATTTTGTTGACTGGAGACTTTATAGTTCAGTACAAATAATCTGGAAAAATT
GTAAGAAGTGAAACTTGAACCTGCGTTTAGAGGGGGCTAGGAGACTGTTTTACCACTTGGTTAGCTTCTAGTATTGATACTTTTTTAGTTATATATAAAATAGCATAAAA
TTGAGGGGGGGCTTGCCTATACTAAATTCGTTGGTGATCTCAGAATTGAACAACTATACGCTCAAAGGTCCAAAGTAAATTAAGCTCACATTTGATCATAAAACATTGTC
ACTGGCAAAATAATTGAAGTCATCACCACTAAGACGTGATTTGTTTGAGCTACTTCATCTAGACGTAACTCAATTTTATTTTCACTAGTCAACCTTAAAATTAAGTCTTT
CAACACAAAGCACTTTTTAGTTGGATGACTGATGACCAGATGATATTTGTAGAAGTTAGGATCATTTACACTTTTGACATGATCCAGTCATTTTCATTCTGTAGTTGAGT
GAGTTGCTTTTCTAACAATTACTTCAAAATGTCTATAATATCAAAGTAAAAAATTGGATAAACTTTTTCCTGCTTTCTAATGCCTATTGCAATTGTTTTTCTTCTAAATT
TTGGTTTCTTTCCTTTTGGAAGAAAATTTCTATAGGGACATATTGATGTCCATTTATTCTTTTGTTTCACTCTTCAAAATATTTTTAGTGCCTTTTGTTTCCTTTTTACC
ATTTATTTATTTAAGGACTAGGAACCCCTTAGTTTCCCTATTGGTGTTGTTCAATTGTATATCATGAACATGAGTTGCCAACAGTTCAAATGTTCATGGTTTTATTCCTT
ATAAGATGTTGAGAAATCCCCAACGTATGTCTTAGTTGCACATCTACGTTGTTGATAATTCAATGAATCTATCTTTTAAGATAAAACTCAAAACTCTCCATTGGTTGATA
TAGTCAATGGTTGGTTTTTCCTTCAGCTATTTGGAATTCATCAACTCCATCATACCAATAGTGCATTTGGCGTTGTAGAAGCAATTTAGGAACTTTTTTTAGTTATTTGC
AACTGTCAATCACTTTGGGCTCTAGATCAGTATACCAATCAAAAAGTGATCCCCTTCAGGTCGTTAGCCTTGACTAGCAAGTCTTTTATGGTTCCCACGTTTTCACAGAT
TTTAATAAATTAAGTAATGCGTTGCTTTGAATTGTTATTTCCATCAAACAACTGGAATTTGGGTGTTGCTAGCTAGCAAACATTCTCAAGTTATTGGTTCTCTTAGTATA
TGGATTGGATTGCATAAAAAAAAGTTTGAGATGGTGTCATATTGAGGTATTATAGAGTTCATAATCATATCATGTAACTACCAAACTGACAACGAGGAGACAAAAGTGTA
CTGTTGTGTTGTTATTGCCTTTGACAACAAAAGTTTGACTTAACTCGACATTTTCTCAAGTCTTCATGGTCTCGCTCCTCGACAACTTTGATGAAAATCTCATCTCTAAG
AGATTTTATCATCTATTTTCAATTTTTTTTACGGAATGATCGGAGTTTGTTCTTATCCATACAAGATTTCTTTTGAGCAATTGTAGGTGATTGGTCCAAGTGCAAGAGTT
GCTTGTAGCAGTGGTTTTCGGATGCAACAATTTCTTATGTTATTGTTTGGTTTGAAAAACAAGAGGGAGATGTGAAGTAGAGAGGTCTCACCGAGCATGCCAAATTTTCT
AATAAGGCTTTTAGATAAACGTGTCTCATGGAGTTGAACTTGTGTTTATGTTGATTTATGTAAATGTAGTTCGATTTCTAGTTCTTGTAATTTTCTCTATTAAATGTGTA
CACTTGATTTGTAGAAGCGAAACACATGGATGTTCTTGAATTTTGGGTCTTTATCTTTAGAATATTTTTTATCTTAAGGAACATGGTTAACT
Protein sequenceShow/hide protein sequence
MAPAQVLSIPTLSFGFRPRKSDGPTGTIAASTTQKPHYHNHNHNKNLVVGSKLAAADHQYHRPTKSRVDVDRLVKILYDDLHHVFDEQGIDPTAYDEEVKFRDPITKYDD
IRGYLLNIALLRQFFRPQIILHWVKKTGPYEITTRWTAVMKFALLPWNPECVLTGTSIMTINPNTGKFCRHVDLWDSVQNNDYFSIEGLWDVFKQFRFYETPELELPKYQ
ILKRTANYEVRKYGPFAVAERSGENLFGCVGFNSVGCWGDCKEDDRIMKLRNKEGGIAAVLKFSGKATEEKVKNKTKELKHCLKNDGLKSINNNSCLLVRYNNSDQTWSF
VMRNEVLIWLQDFSI