; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG06G010680 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG06G010680
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionSOUL heme-binding family protein
Genome locationCG_Chr06:22991939..22997317
RNA-Seq ExpressionClCG06G010680
SyntenyClCG06G010680
Gene Ontology termsGO:0110165 - cellular anatomical structure (cellular component)
InterPro domainsIPR006917 - SOUL haem-binding protein
IPR011256 - Regulatory factor, effector binding domain superfamily
IPR018790 - Protein of unknown function DUF2358
IPR032710 - NTF2-like domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004151455.2 uncharacterized protein LOC101205468 [Cucumis sativus]1.1e-14173.68Show/hide
Query:  MAPAQALSIPTVGFGFRLRKSEGPTRTVAAAAAHKPQNHNH---WGVGSKMG----DHHKPAKSTVDVERLVEFLYDDLHHVFDEQGIDRTAYDEEVRFR
        MAPAQ LSIPT  FGFR R S+GPTRT+AAA   KP NHNH     VGSK+      H +P KS VDV++LV+FLYDDLHHVFDEQGID TAYDEE+ FR
Subjt:  MAPAQALSIPTVGFGFRLRKSEGPTRTVAAAAAHKPQNHNH---WGVGSKMG----DHHKPAKSTVDVERLVEFLYDDLHHVFDEQGIDRTAYDEEVRFR

Query:  DPLTKYDDIAGYLLNIALLRKFFRPQMILHWVKKTGPFEITTRWTAVMKFILLPWKPEFVLTGTSIMGINPHTGKFCSHVDLWDSVQNNDYFSIEGLWDV
        DP+TKY DI GYLLNIALLR+FF PQ+ILHWVKKTGP+EITTRWTA MKF LLPWKPE VLTGTSIM INP+TGKFC HVDLWDSVQNNDYFSIEGLWDV
Subjt:  DPLTKYDDIAGYLLNIALLRKFFRPQMILHWVKKTGPFEITTRWTAVMKFILLPWKPEFVLTGTSIMGINPHTGKFCSHVDLWDSVQNNDYFSIEGLWDV

Query:  LKQFRFYETPELESPKYQILKRTANYEVREYAPFTVAETGGHNPFGCAGFNRVGSWADCKEDKIMNSRKNEGGIAAVLKFSGKPTENMVQNKAKELRQCL
         KQFRFYETPELE PKYQ LKRT NYEVR+Y PF  AE  G N F C   N +G W DCKED  +   +N+GGIAAVL FSGK TE  V+NKAKELR  L
Subjt:  LKQFRFYETPELESPKYQILKRTANYEVREYAPFTVAETGGHNPFGCAGFNRVGSWADCKEDKIMNSRKNEGGIAAVLKFSGKPTENMVQNKAKELRQCL

Query:  KKDGLKPI-NNSCLLARYNNSYRTWSFLMRNEVLIWLEDFSI
        KKDGLK + NNSCLL RYN+S  TWSF+MRNEVLIWL+DFSI
Subjt:  KKDGLKPI-NNSCLLARYNNSYRTWSFLMRNEVLIWLEDFSI

XP_022965046.1 uncharacterized protein LOC111465022 [Cucurbita maxima]1.2e-13573.31Show/hide
Query:  MAPAQA-----LSIPTVGFGFRLRKSEGPTRTVAAAAAHKPQNHNHWGVGSKMGD--HHKPAKSTVDVERLVEFLYDDLHHVFDEQGIDRTAYDEEVRFR
        MA AQ      LSIPTV FG R RKS GPTR   AA +     +  W + S + D  H KP   TVDV+RLV+F+YDDL HVFDEQGIDRTAYDEEVRFR
Subjt:  MAPAQA-----LSIPTVGFGFRLRKSEGPTRTVAAAAAHKPQNHNHWGVGSKMGD--HHKPAKSTVDVERLVEFLYDDLHHVFDEQGIDRTAYDEEVRFR

Query:  DPLTKYDDIAGYLLNIALLRKFFRPQMILHWVKKTGPFEITTRWTAVMKFILLPWKPEFVLTGTSIMGINPHTGKFCSHVDLWDSVQNNDYFSIEGLWDV
        DP+TKYD I+GY+LNIALLR+FFRP++ILHWVKKTGP+EITTRWTAVMKFILLPWKPE VLTGTSIMGINP TGKFCSHVDLWDS+QNNDYFS+E LWDV
Subjt:  DPLTKYDDIAGYLLNIALLRKFFRPQMILHWVKKTGPFEITTRWTAVMKFILLPWKPEFVLTGTSIMGINPHTGKFCSHVDLWDSVQNNDYFSIEGLWDV

Query:  LKQFRFYETPELESPKYQILKRTANYEVREYAPFTVAETGGHNPFGCAGFNRVGSWADCKEDKIMNSRKNEGGIAAVLKFSGKPTENMVQNKAKELRQCL
         KQFRFYETPELESPKYQILKRTANYEVR+YAPF V E  GH     AGFNRVGS+ D K++  M+ R+ EGGI AVLKFSG PTE+M Q KAKELR  L
Subjt:  LKQFRFYETPELESPKYQILKRTANYEVREYAPFTVAETGGHNPFGCAGFNRVGSWADCKEDKIMNSRKNEGGIAAVLKFSGKPTENMVQNKAKELRQCL

Query:  KKDGLKPINNSCLLARYNNSYRTWSFLMRNEVLIWLEDFSI
        KKDGLKPI N CLLARYN+S RTW F+MRNEV+IWL++FSI
Subjt:  KKDGLKPINNSCLLARYNNSYRTWSFLMRNEVLIWLEDFSI

XP_023531546.1 uncharacterized protein LOC111793749 [Cucurbita pepo subsp. pepo]2.6e-13573.31Show/hide
Query:  MAPAQA-----LSIPTVGFGFRLRKSEGPTRTVAAAAAHKPQNHNHW--GVGSKMGDHHKPAKSTVDVERLVEFLYDDLHHVFDEQGIDRTAYDEEVRFR
        MA AQ      LSIPTV FG R RKS GPTR     AA       +W   + S + D  +  K TVDV+RLV+F+YDDL HVFDEQGIDRTAYDEEVRFR
Subjt:  MAPAQA-----LSIPTVGFGFRLRKSEGPTRTVAAAAAHKPQNHNHW--GVGSKMGDHHKPAKSTVDVERLVEFLYDDLHHVFDEQGIDRTAYDEEVRFR

Query:  DPLTKYDDIAGYLLNIALLRKFFRPQMILHWVKKTGPFEITTRWTAVMKFILLPWKPEFVLTGTSIMGINPHTGKFCSHVDLWDSVQNNDYFSIEGLWDV
        DP+TKYD I+GY+LNIALLR+FFRP++I HWVKKTGP+EITTRWTAVMKFILLPWKPE VLTGTSIMGINP TGKFCSHVD+WDS+QNNDYFS+E LWDV
Subjt:  DPLTKYDDIAGYLLNIALLRKFFRPQMILHWVKKTGPFEITTRWTAVMKFILLPWKPEFVLTGTSIMGINPHTGKFCSHVDLWDSVQNNDYFSIEGLWDV

Query:  LKQFRFYETPELESPKYQILKRTANYEVREYAPFTVAETGGHNPFGCAGFNRVGSWADCKEDKIMNSRKNEGGIAAVLKFSGKPTENMVQNKAKELRQCL
         KQFRFYETPELESPKYQILKRTANYEVR+YAPF V E  GH     AGFNRVGS++D K++  M+ R+ EGGI AVLKFSG PTE+M Q KAKELR  L
Subjt:  LKQFRFYETPELESPKYQILKRTANYEVREYAPFTVAETGGHNPFGCAGFNRVGSWADCKEDKIMNSRKNEGGIAAVLKFSGKPTENMVQNKAKELRQCL

Query:  KKDGLKPINNSCLLARYNNSYRTWSFLMRNEVLIWLEDFSI
        KKDGLKPI N CLLARYNNS RTWSF+MRNEVLIWLE+FSI
Subjt:  KKDGLKPINNSCLLARYNNSYRTWSFLMRNEVLIWLEDFSI

XP_038879853.1 uncharacterized protein LOC120071584 isoform X1 [Benincasa hispida]2.8e-15882.6Show/hide
Query:  MAPAQALSIPTVGFGFRLRKSEGPT-RTV-AAAAAHKPQNHN-HWGVGSKMGDHHKPAKSTVDVERLVEFLYDDLHHVFDEQGIDRTAYDEEVRFRDPLT
        MAP QALSIP VGFGF  RKS GPT RT+ AAAAA+KP +HN +WGV SKMGDH +P KSTVDVERLVEFLY+DLHHVFDEQGIDRTAYDEE+RFRDP+T
Subjt:  MAPAQALSIPTVGFGFRLRKSEGPT-RTV-AAAAAHKPQNHN-HWGVGSKMGDHHKPAKSTVDVERLVEFLYDDLHHVFDEQGIDRTAYDEEVRFRDPLT

Query:  KYDDIAGYLLNIALLRKFFRPQMILHWVKKTGPFEITTRWTAVMKFILLPWKPEFVLTGTSIMGINPHTGKFCSHVDLWDSVQNNDYFSIEGLWDVLKQF
        KYD+I GYLLNIALL  FFRP+MILHWVKKTGP+EITTRWTAVMKF++LPWKPEFV+TGTSIMGINPHTGKFCSHVDLWDSVQNNDYFSIEGLWDV KQ 
Subjt:  KYDDIAGYLLNIALLRKFFRPQMILHWVKKTGPFEITTRWTAVMKFILLPWKPEFVLTGTSIMGINPHTGKFCSHVDLWDSVQNNDYFSIEGLWDVLKQF

Query:  RFYETPELESPKYQILKRTANYEVREYAPFTVAETGGHNPFGC--AGFNRVGSWADCKEDKIMNSRKNEGGIAAVLKFSGKPTENMVQNKAKELRQCLKK
        RFYETPELESPKYQILKRTANYEVR+Y P  VAE  G N  GC  A F+RVGSWA+CKED   N RKNEGGIAAVLKFSGK T++ VQNKAK+LR  LKK
Subjt:  RFYETPELESPKYQILKRTANYEVREYAPFTVAETGGHNPFGC--AGFNRVGSWADCKEDKIMNSRKNEGGIAAVLKFSGKPTENMVQNKAKELRQCLKK

Query:  DGLKPINNSCLLARYNNSYRTWSFLMRNEVLIWLEDFSI
        DGLKPINNS LLARYNNSY TWSF+MRNEVLIWLEDFSI
Subjt:  DGLKPINNSCLLARYNNSYRTWSFLMRNEVLIWLEDFSI

XP_038879854.1 uncharacterized protein LOC120071584 isoform X2 [Benincasa hispida]5.0e-15581.71Show/hide
Query:  MAPAQALSIPTVGFGFRLRKSEGPT-RTV-AAAAAHKPQNHN-HWGVGSKMGDHHKPAKSTVDVERLVEFLYDDLHHVFDEQGIDRTAYDEEVRFRDPLT
        MAP QALSIP VGFGF  RKS GPT RT+ AAAAA+KP +HN +WGV SKMGDH +P KSTVDVERLVEFLY+DLHHVFDEQGIDRTAYDEE+RFRDP+T
Subjt:  MAPAQALSIPTVGFGFRLRKSEGPT-RTV-AAAAAHKPQNHN-HWGVGSKMGDHHKPAKSTVDVERLVEFLYDDLHHVFDEQGIDRTAYDEEVRFRDPLT

Query:  KYDDIAGYLLNIALLRKFFRPQMILHWVKKTGPFEITTRWTAVMKFILLPWKPEFVLTGTSIMGINPHTGKFCSHVDLWDSVQNNDYFSIEGLWDVLKQF
        KYD+I GYLLNIALL  FFRP+MILHW   TGP+EITTRWTAVMKF++LPWKPEFV+TGTSIMGINPHTGKFCSHVDLWDSVQNNDYFSIEGLWDV KQ 
Subjt:  KYDDIAGYLLNIALLRKFFRPQMILHWVKKTGPFEITTRWTAVMKFILLPWKPEFVLTGTSIMGINPHTGKFCSHVDLWDSVQNNDYFSIEGLWDVLKQF

Query:  RFYETPELESPKYQILKRTANYEVREYAPFTVAETGGHNPFGC--AGFNRVGSWADCKEDKIMNSRKNEGGIAAVLKFSGKPTENMVQNKAKELRQCLKK
        RFYETPELESPKYQILKRTANYEVR+Y P  VAE  G N  GC  A F+RVGSWA+CKED   N RKNEGGIAAVLKFSGK T++ VQNKAK+LR  LKK
Subjt:  RFYETPELESPKYQILKRTANYEVREYAPFTVAETGGHNPFGC--AGFNRVGSWADCKEDKIMNSRKNEGGIAAVLKFSGKPTENMVQNKAKELRQCLKK

Query:  DGLKPINNSCLLARYNNSYRTWSFLMRNEVLIWLEDFSI
        DGLKPINNS LLARYNNSY TWSF+MRNEVLIWLEDFSI
Subjt:  DGLKPINNSCLLARYNNSYRTWSFLMRNEVLIWLEDFSI

TrEMBL top hitse value%identityAlignment
A0A0A0LU04 Uncharacterized protein5.2e-14273.68Show/hide
Query:  MAPAQALSIPTVGFGFRLRKSEGPTRTVAAAAAHKPQNHNH---WGVGSKMG----DHHKPAKSTVDVERLVEFLYDDLHHVFDEQGIDRTAYDEEVRFR
        MAPAQ LSIPT  FGFR R S+GPTRT+AAA   KP NHNH     VGSK+      H +P KS VDV++LV+FLYDDLHHVFDEQGID TAYDEE+ FR
Subjt:  MAPAQALSIPTVGFGFRLRKSEGPTRTVAAAAAHKPQNHNH---WGVGSKMG----DHHKPAKSTVDVERLVEFLYDDLHHVFDEQGIDRTAYDEEVRFR

Query:  DPLTKYDDIAGYLLNIALLRKFFRPQMILHWVKKTGPFEITTRWTAVMKFILLPWKPEFVLTGTSIMGINPHTGKFCSHVDLWDSVQNNDYFSIEGLWDV
        DP+TKY DI GYLLNIALLR+FF PQ+ILHWVKKTGP+EITTRWTA MKF LLPWKPE VLTGTSIM INP+TGKFC HVDLWDSVQNNDYFSIEGLWDV
Subjt:  DPLTKYDDIAGYLLNIALLRKFFRPQMILHWVKKTGPFEITTRWTAVMKFILLPWKPEFVLTGTSIMGINPHTGKFCSHVDLWDSVQNNDYFSIEGLWDV

Query:  LKQFRFYETPELESPKYQILKRTANYEVREYAPFTVAETGGHNPFGCAGFNRVGSWADCKEDKIMNSRKNEGGIAAVLKFSGKPTENMVQNKAKELRQCL
         KQFRFYETPELE PKYQ LKRT NYEVR+Y PF  AE  G N F C   N +G W DCKED  +   +N+GGIAAVL FSGK TE  V+NKAKELR  L
Subjt:  LKQFRFYETPELESPKYQILKRTANYEVREYAPFTVAETGGHNPFGCAGFNRVGSWADCKEDKIMNSRKNEGGIAAVLKFSGKPTENMVQNKAKELRQCL

Query:  KKDGLKPI-NNSCLLARYNNSYRTWSFLMRNEVLIWLEDFSI
        KKDGLK + NNSCLL RYN+S  TWSF+MRNEVLIWL+DFSI
Subjt:  KKDGLKPI-NNSCLLARYNNSYRTWSFLMRNEVLIWLEDFSI

A0A5D3CVR4 Uncharacterized protein8.9e-13471.18Show/hide
Query:  MAPAQALSIPTVGFGFRLRKSEGPTRTVAAAAAHKPQNHN-HWGVGSKMG----DHHKPAKSTVDVERLVEFLYDDLHHVFDEQGIDRTAYDEEVRFRDP
        MAPA  LS+PTV  GFR RKS+G T+T+AAA   +P NHN +W VGSK+      + +P KS VDV+RLV+FLYDDLHHVFDEQGID +AYDEE+ FRDP
Subjt:  MAPAQALSIPTVGFGFRLRKSEGPTRTVAAAAAHKPQNHN-HWGVGSKMG----DHHKPAKSTVDVERLVEFLYDDLHHVFDEQGIDRTAYDEEVRFRDP

Query:  LTKYDDIAGYLLNIALLRKFFRPQMILHWVKKTGPFEITTRWTAVMKFILLPWKPEFVLTGTSIMGINPHTGKFCSHVDLWDSVQNNDYFSIEGLWDVLK
        +TK+DDI GYLLNIALLR+FF PQ+ILHWVKKTGP+EITTRWTAVMKF+LLPWKPE VLTGTSIM +NP+TGKFC HVDLWDSVQNNDYFSIEGLWDV K
Subjt:  LTKYDDIAGYLLNIALLRKFFRPQMILHWVKKTGPFEITTRWTAVMKFILLPWKPEFVLTGTSIMGINPHTGKFCSHVDLWDSVQNNDYFSIEGLWDVLK

Query:  QFRFYETPELESPKYQILKRTANYEVREYAPFTVAETGGHNPFGCAGFNRVGSWADCKE-DKIMNSRKNEGGIAAVLKFSGKPTENMVQNKAKELRQCLK
        QFRFYE  ELE PKYQ L RTANYEVR+Y PF VAE  G N FGC   N VG W DCKE D+IM  R  EGGIAAVL FSGK TE MV+NKAKELR  LK
Subjt:  QFRFYETPELESPKYQILKRTANYEVREYAPFTVAETGGHNPFGCAGFNRVGSWADCKE-DKIMNSRKNEGGIAAVLKFSGKPTENMVQNKAKELRQCLK

Query:  KDGLKPINNSCLLARYNNSYRTWSFLMRNEVLIWLEDFSI
        KDGL+ +NNSCLL              RNEVLIWL+DFSI
Subjt:  KDGLKPINNSCLLARYNNSYRTWSFLMRNEVLIWLEDFSI

A0A6J1CV62 uncharacterized protein LOC111014503 isoform X24.9e-12470.34Show/hide
Query:  LSIPTVGFGFRLRKSEGPTRTVAAAAAHKPQNHNHWGVGSKMGDHHKPAKSTVDVERLVEFLYDDLHHVFDEQGIDRTAYDEEVRFRDPLTKYDDIAGYL
        LSIPTVG GFR +KS   T         + +      V S++ D   P KSTVDV+RLV+FLY+DL HVFD QGID TAYDE VRFRDP+TKY+ I GY+
Subjt:  LSIPTVGFGFRLRKSEGPTRTVAAAAAHKPQNHNHWGVGSKMGDHHKPAKSTVDVERLVEFLYDDLHHVFDEQGIDRTAYDEEVRFRDPLTKYDDIAGYL

Query:  LNIALLRKFFRPQMILHWVKKTGPFEITTRWTAVMKFILLPWKPEFVLTGTSIMGINPHTGKFCSHVDLWDSVQNNDYFSIEGLWDVLKQFRFYETPELE
        LNIALLR+ FRPQ +LHWVKKTGP+EITTRWTAVMKF+LLPWKPE VLTGTSIM I+P TGKFC+HVDLWDSVQNN+YFS+EGLWD+ KQFRFYETPELE
Subjt:  LNIALLRKFFRPQMILHWVKKTGPFEITTRWTAVMKFILLPWKPEFVLTGTSIMGINPHTGKFCSHVDLWDSVQNNDYFSIEGLWDVLKQFRFYETPELE

Query:  SPKYQILKRTANYEVREYAPFTVAETGGHNPFGCAGFNRVGSWADCKEDKIMNSRKNEGGIAAVLKFSGKPTENMVQNKAKELRQCLKKDGLKPINNSCL
        SP+YQILKRTANYEVR+YAPF   ETG    +G A FNRV  + D K+D I + R  +GGIAAVLKFSGKP+ENMVQ KAKELR  L KDGLKPI   CL
Subjt:  SPKYQILKRTANYEVREYAPFTVAETGGHNPFGCAGFNRVGSWADCKEDKIMNSRKNEGGIAAVLKFSGKPTENMVQNKAKELRQCLKKDGLKPINNSCL

Query:  LARYNNSYRTWSFLMRNEVLIWLEDFS
        LARYN+  RTWSF+MRNEVLIWLE+FS
Subjt:  LARYNNSYRTWSFLMRNEVLIWLEDFS

A0A6J1EZQ2 uncharacterized protein LOC1114408396.8e-13472.73Show/hide
Query:  MAPAQA-----LSIPTVGFGFRLRKSEGPTRTVAAAAAHKPQNHNHW--GVGSKMGDHHKPAKSTVDVERLVEFLYDDLHHVFDEQGIDRTAYDEEVRFR
        MA AQ      LSIPTV  G R RKS GPTR     AA       +W   + S + D  +  K TVDV+RLV+F+YDDL HVFDEQGIDRTAYD+EVRFR
Subjt:  MAPAQA-----LSIPTVGFGFRLRKSEGPTRTVAAAAAHKPQNHNHW--GVGSKMGDHHKPAKSTVDVERLVEFLYDDLHHVFDEQGIDRTAYDEEVRFR

Query:  DPLTKYDDIAGYLLNIALLRKFFRPQMILHWVKKTGPFEITTRWTAVMKFILLPWKPEFVLTGTSIMGINPHTGKFCSHVDLWDSVQNNDYFSIEGLWDV
        DP+TKYD I+GY+LNIALLR+FFRP++ILHWVKKTGP+EITTRWTA+MKFILLPWKPE VLTGTSIMGINP TGKFCSHVDLWDS+QNNDYFS+E LWDV
Subjt:  DPLTKYDDIAGYLLNIALLRKFFRPQMILHWVKKTGPFEITTRWTAVMKFILLPWKPEFVLTGTSIMGINPHTGKFCSHVDLWDSVQNNDYFSIEGLWDV

Query:  LKQFRFYETPELESPKYQILKRTANYEVREYAPFTVAETGGHNPFGCAGFNRVGSWADCKEDKIMNSRKNEGGIAAVLKFSGKPTENMVQNKAKELRQCL
         KQFRFYETPELESPKYQILKRTANYEVR+YAPF V E  GH     AGFNRVGS +D K++  M+ R+ EGGI AVLKFSG PTE+M Q KAKELR  L
Subjt:  LKQFRFYETPELESPKYQILKRTANYEVREYAPFTVAETGGHNPFGCAGFNRVGSWADCKEDKIMNSRKNEGGIAAVLKFSGKPTENMVQNKAKELRQCL

Query:  KKDGLKPINNSCLLARYNNSYRTWSFLMRNEVLIWLEDFSI
        KKDGLKPI N CLLARYN+S RTWSF+MRNEVLIWLE+FSI
Subjt:  KKDGLKPINNSCLLARYNNSYRTWSFLMRNEVLIWLEDFSI

A0A6J1HKM5 uncharacterized protein LOC1114650225.6e-13673.31Show/hide
Query:  MAPAQA-----LSIPTVGFGFRLRKSEGPTRTVAAAAAHKPQNHNHWGVGSKMGD--HHKPAKSTVDVERLVEFLYDDLHHVFDEQGIDRTAYDEEVRFR
        MA AQ      LSIPTV FG R RKS GPTR   AA +     +  W + S + D  H KP   TVDV+RLV+F+YDDL HVFDEQGIDRTAYDEEVRFR
Subjt:  MAPAQA-----LSIPTVGFGFRLRKSEGPTRTVAAAAAHKPQNHNHWGVGSKMGD--HHKPAKSTVDVERLVEFLYDDLHHVFDEQGIDRTAYDEEVRFR

Query:  DPLTKYDDIAGYLLNIALLRKFFRPQMILHWVKKTGPFEITTRWTAVMKFILLPWKPEFVLTGTSIMGINPHTGKFCSHVDLWDSVQNNDYFSIEGLWDV
        DP+TKYD I+GY+LNIALLR+FFRP++ILHWVKKTGP+EITTRWTAVMKFILLPWKPE VLTGTSIMGINP TGKFCSHVDLWDS+QNNDYFS+E LWDV
Subjt:  DPLTKYDDIAGYLLNIALLRKFFRPQMILHWVKKTGPFEITTRWTAVMKFILLPWKPEFVLTGTSIMGINPHTGKFCSHVDLWDSVQNNDYFSIEGLWDV

Query:  LKQFRFYETPELESPKYQILKRTANYEVREYAPFTVAETGGHNPFGCAGFNRVGSWADCKEDKIMNSRKNEGGIAAVLKFSGKPTENMVQNKAKELRQCL
         KQFRFYETPELESPKYQILKRTANYEVR+YAPF V E  GH     AGFNRVGS+ D K++  M+ R+ EGGI AVLKFSG PTE+M Q KAKELR  L
Subjt:  LKQFRFYETPELESPKYQILKRTANYEVREYAPFTVAETGGHNPFGCAGFNRVGSWADCKEDKIMNSRKNEGGIAAVLKFSGKPTENMVQNKAKELRQCL

Query:  KKDGLKPINNSCLLARYNNSYRTWSFLMRNEVLIWLEDFSI
        KKDGLKPI N CLLARYN+S RTW F+MRNEV+IWL++FSI
Subjt:  KKDGLKPINNSCLLARYNNSYRTWSFLMRNEVLIWLEDFSI

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G37970.1 SOUL heme-binding family protein3.9e-0444.29Show/hide
Query:  KNEGGIA-AVLKFSGKPTENMVQNKAKELRQCLKKDGLKPINNSCLLARYNNSYRTWSFLMRNEVLIWLE
        K EGG    V+KFSG  +E++V  K K+L   L+KDG K I    +LARYN  +    F   NEV+I +E
Subjt:  KNEGGIA-AVLKFSGKPTENMVQNKAKELRQCLKKDGLKPINNSCLLARYNNSYRTWSFLMRNEVLIWLE

AT5G20140.1 SOUL heme-binding family protein2.6e-10155.56Show/hide
Query:  KMGDHHKPAKSTVDVERLVEFLYDDLHHVFDEQGIDRTAYDEEVRFRDPLTKYDDIAGYLLNIALLRKFFRPQMILHWVKKTGPFEITTRWTAVMKFILL
        ++G     A STV++E LV FLY+DL H+FD+QGID+TAYDE V+FRDP+TK+D I+GYL NIA L+  F PQ  LHW K+TGP+EITTRWT VMKFI L
Subjt:  KMGDHHKPAKSTVDVERLVEFLYDDLHHVFDEQGIDRTAYDEEVRFRDPLTKYDDIAGYLLNIALLRKFFRPQMILHWVKKTGPFEITTRWTAVMKFILL

Query:  PWKPEFVLTGTSIMGINPHTGKFCSHVDLWDSVQNNDYFSIEGLWDVLKQFRFYETPELESPKYQILKRTANYEVREYAPFTVAETGGHNPFGCAGFNRV
        PWKPE V TG SIM +NP T KFCSH+DLWDS++NNDYFS+EGL DV KQ R Y+TP+LE+PKYQILKRTANYEVR Y PF V ET G    G +GFN V
Subjt:  PWKPEFVLTGTSIMGINPHTGKFCSHVDLWDSVQNNDYFSIEGLWDVLKQFRFYETPELESPKYQILKRTANYEVREYAPFTVAETGGHNPFGCAGFNRV

Query:  GSWADCK---------------------------------------------EDKIMNSRKNEGGIAAVLKFSGKPTENMVQNKAKELRQCLKKDGLKPI
          +   K                                             E+K+ N +K EGG AA +KFSGKPTE++VQ K  ELR  L KDGL+  
Subjt:  GSWADCK---------------------------------------------EDKIMNSRKNEGGIAAVLKFSGKPTENMVQNKAKELRQCLKKDGLKPI

Query:  NNSCLLARYNNSYRTWSFLMRNEVLIWLEDFSI
           C+LARYN+  RTW+F+MRNEV+IWLEDFS+
Subjt:  NNSCLLARYNNSYRTWSFLMRNEVLIWLEDFSI

AT5G20140.2 SOUL heme-binding family protein4.0e-9453.64Show/hide
Query:  KMGDHHKPAKSTVDVERLVEFLYDDLHHVFDEQGIDRTAYDEEVRFRDPLTKYDDIAGYLLNIALLRKFFRPQMILHWVKKTGPFEITTRWTAVMKFILL
        ++G     A STV++E LV FLY+DL H+FD+QGID+TAYDE V+FRDP+TK+D I+GYL NIA L+  F PQ  LHW K+TGP+EITTRWT VMKFI L
Subjt:  KMGDHHKPAKSTVDVERLVEFLYDDLHHVFDEQGIDRTAYDEEVRFRDPLTKYDDIAGYLLNIALLRKFFRPQMILHWVKKTGPFEITTRWTAVMKFILL

Query:  PWKPEFVLTGTSIMGINPHTGKFCSHVDLWDSVQNNDYFSIEGLWDVLKQFRFYETPELESPKYQILKRTANYEVREYAPFTVAETGGHNPFGCAGFNRV
        PWKPE V TG SIM +NP T KFCSH+DLWDS++NNDYFS+EGL DV KQ R Y+TP+LE+PKYQILKRTANYEVR Y PF V ET G    G +GFN V
Subjt:  PWKPEFVLTGTSIMGINPHTGKFCSHVDLWDSVQNNDYFSIEGLWDVLKQFRFYETPELESPKYQILKRTANYEVREYAPFTVAETGGHNPFGCAGFNRV

Query:  GSWADCK---------------------------------------------EDKIMNSRKNEGGIAAVLKFSGKPTENMVQNKAKELRQCLKKDGLKPI
          +   K                                             E+K+ N +K EGG AA +KFSGKPTE++VQ K  ELR  L KDGL+  
Subjt:  GSWADCK---------------------------------------------EDKIMNSRKNEGGIAAVLKFSGKPTENMVQNKAKELRQCLKKDGLKPI

Query:  NNSCLLARYNNSYRTWSFLMRNEVLIWLED
           C+LARYN+  RTW+F+M ++VL +  D
Subjt:  NNSCLLARYNNSYRTWSFLMRNEVLIWLED


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTCCTGCCCAAGCTCTCTCAATCCCAACCGTTGGTTTTGGTTTCCGACTAAGGAAATCCGAGGGACCAACCAGAACCGTAGCCGCCGCAGCAGCTCATAAACCTCA
GAATCACAACCATTGGGGTGTTGGATCAAAAATGGGAGATCATCATAAGCCAGCGAAATCGACGGTGGACGTAGAGAGATTGGTGGAATTCTTATACGATGATCTCCACC
ACGTGTTCGATGAGCAAGGGATTGATCGGACGGCTTACGACGAAGAAGTGAGATTTCGAGATCCACTTACTAAATACGATGACATTGCGGGGTATTTGCTAAATATTGCC
CTCTTGCGAAAATTCTTTAGGCCTCAGATGATCTTGCACTGGGTCAAAAAGACTGGACCATTTGAGATAACTACAAGATGGACTGCAGTAATGAAGTTTATCCTTCTACC
ATGGAAACCAGAATTTGTTTTGACAGGAACTTCCATTATGGGTATTAATCCACACACTGGCAAGTTTTGTAGCCATGTGGATCTTTGGGATTCAGTACAAAATAATGATT
ACTTTTCTATAGAAGGTCTCTGGGATGTATTAAAACAGTTTCGTTTTTATGAAACTCCAGAGTTGGAATCGCCCAAATATCAAATATTGAAAAGGACTGCAAATTATGAG
GTGAGAGAATATGCACCATTTACAGTGGCAGAAACAGGTGGACACAACCCCTTTGGGTGTGCTGGATTTAATCGTGTTGGCAGTTGGGCAGATTGCAAAGAGGACAAAAT
TATGAACTCGAGAAAGAATGAAGGAGGCATTGCTGCAGTGTTGAAATTCAGTGGAAAACCCACAGAAAATATGGTGCAAAACAAAGCCAAAGAATTGAGACAGTGTCTCA
AAAAAGATGGTCTTAAACCCATTAATAATAGCTGTTTGCTTGCACGGTACAACAATTCCTACCGAACATGGAGTTTTCTAATGAGAAATGAAGTGCTAATATGGCTTGAA
GATTTCTCAATTTAG
mRNA sequenceShow/hide mRNA sequence
TTTAGGATTCATTCCATTCCAATGGCTCCTGCCCAAGCTCTCTCAATCCCAACCGTTGGTTTTGGTTTCCGACTAAGGAAATCCGAGGGACCAACCAGAACCGTAGCCGC
CGCAGCAGCTCATAAACCTCAGAATCACAACCATTGGGGTGTTGGATCAAAAATGGGAGATCATCATAAGCCAGCGAAATCGACGGTGGACGTAGAGAGATTGGTGGAAT
TCTTATACGATGATCTCCACCACGTGTTCGATGAGCAAGGGATTGATCGGACGGCTTACGACGAAGAAGTGAGATTTCGAGATCCACTTACTAAATACGATGACATTGCG
GGGTATTTGCTAAATATTGCCCTCTTGCGAAAATTCTTTAGGCCTCAGATGATCTTGCACTGGGTCAAAAAGACTGGACCATTTGAGATAACTACAAGATGGACTGCAGT
AATGAAGTTTATCCTTCTACCATGGAAACCAGAATTTGTTTTGACAGGAACTTCCATTATGGGTATTAATCCACACACTGGCAAGTTTTGTAGCCATGTGGATCTTTGGG
ATTCAGTACAAAATAATGATTACTTTTCTATAGAAGGTCTCTGGGATGTATTAAAACAGTTTCGTTTTTATGAAACTCCAGAGTTGGAATCGCCCAAATATCAAATATTG
AAAAGGACTGCAAATTATGAGGTGAGAGAATATGCACCATTTACAGTGGCAGAAACAGGTGGACACAACCCCTTTGGGTGTGCTGGATTTAATCGTGTTGGCAGTTGGGC
AGATTGCAAAGAGGACAAAATTATGAACTCGAGAAAGAATGAAGGAGGCATTGCTGCAGTGTTGAAATTCAGTGGAAAACCCACAGAAAATATGGTGCAAAACAAAGCCA
AAGAATTGAGACAGTGTCTCAAAAAAGATGGTCTTAAACCCATTAATAATAGCTGTTTGCTTGCACGGTACAACAATTCCTACCGAACATGGAGTTTTCTAATGAGAAAT
GAAGTGCTAATATGGCTTGAAGATTTCTCAATTTAGTACAGCACGCAAGTCAAACACGGCTTTACGAAAATCCCAATCCCTTGTTCATATATATTGTTTTTTTTCCTAAA
ATTAGTTGTGAATAGATAAGTTATTTGGTGTAATATCAGAAAGTCAATATGTTTG
Protein sequenceShow/hide protein sequence
MAPAQALSIPTVGFGFRLRKSEGPTRTVAAAAAHKPQNHNHWGVGSKMGDHHKPAKSTVDVERLVEFLYDDLHHVFDEQGIDRTAYDEEVRFRDPLTKYDDIAGYLLNIA
LLRKFFRPQMILHWVKKTGPFEITTRWTAVMKFILLPWKPEFVLTGTSIMGINPHTGKFCSHVDLWDSVQNNDYFSIEGLWDVLKQFRFYETPELESPKYQILKRTANYE
VREYAPFTVAETGGHNPFGCAGFNRVGSWADCKEDKIMNSRKNEGGIAAVLKFSGKPTENMVQNKAKELRQCLKKDGLKPINNSCLLARYNNSYRTWSFLMRNEVLIWLE
DFSI