; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi06G009240 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi06G009240
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionSOUL heme-binding family protein
Genome locationchr06:19063873..19067979
RNA-Seq ExpressionLsi06G009240
SyntenyLsi06G009240
Gene Ontology termsGO:0110165 - cellular anatomical structure (cellular component)
InterPro domainsIPR006917 - SOUL haem-binding protein
IPR011256 - Regulatory factor, effector binding domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022933414.1 uncharacterized protein LOC111440839 [Cucurbita moschata]3.5e-7076.24Show/hide
Query:  MKFMVLPWKPEFVLTGTSIMSINPRTGKFCSHVDLWDSVQNNDYFSVEGLWDAFKQFRFYETSELESPKYQILKRTANYEVRKYAPFTVVETSGDNLSGC
        MKF++LPWKPE VLTGTSIM INP+TGKFCSHVDLWDS+QNNDYFSVE LWD FKQFRFYET ELESPKYQILKRTANYEVRKYAPF VVE +G  +S  
Subjt:  MKFMVLPWKPEFVLTGTSIMSINPRTGKFCSHVDLWDSVQNNDYFSVEGLWDAFKQFRFYETSELESPKYQILKRTANYEVRKYAPFTVVETSGDNLSGC

Query:  AAFNSVGSWVDWKEDTIMKLRKNEGGIAAVLKFSGKPTKDKVQNKAKELRHSLKKDGLKLINNSCLLARYNNPYRTWSFVM
        A FN VGS  D K++  M +R+ EGGI AVLKFSG PT+D  Q KAKELR SLKKDGLK I N CLLARYN+  RTWSFVM
Subjt:  AAFNSVGSWVDWKEDTIMKLRKNEGGIAAVLKFSGKPTKDKVQNKAKELRHSLKKDGLKLINNSCLLARYNNPYRTWSFVM

XP_022965046.1 uncharacterized protein LOC111465022 [Cucurbita maxima]4.6e-7075.69Show/hide
Query:  MKFMVLPWKPEFVLTGTSIMSINPRTGKFCSHVDLWDSVQNNDYFSVEGLWDAFKQFRFYETSELESPKYQILKRTANYEVRKYAPFTVVETSGDNLSGC
        MKF++LPWKPE VLTGTSIM INP+TGKFCSHVDLWDS+QNNDYFSVE LWD FKQFRFYET ELESPKYQILKRTANYEVRKYAPF VVE +G  +S  
Subjt:  MKFMVLPWKPEFVLTGTSIMSINPRTGKFCSHVDLWDSVQNNDYFSVEGLWDAFKQFRFYETSELESPKYQILKRTANYEVRKYAPFTVVETSGDNLSGC

Query:  AAFNSVGSWVDWKEDTIMKLRKNEGGIAAVLKFSGKPTKDKVQNKAKELRHSLKKDGLKLINNSCLLARYNNPYRTWSFVM
        A FN VGS+ D K++  M +R+ EGGI AVLKFSG PT+D  Q KAKELR SLKKDGLK I N CLLARYN+  RTW FVM
Subjt:  AAFNSVGSWVDWKEDTIMKLRKNEGGIAAVLKFSGKPTKDKVQNKAKELRHSLKKDGLKLINNSCLLARYNNPYRTWSFVM

XP_023531546.1 uncharacterized protein LOC111793749 [Cucurbita pepo subsp. pepo]1.6e-7075.69Show/hide
Query:  MKFMVLPWKPEFVLTGTSIMSINPRTGKFCSHVDLWDSVQNNDYFSVEGLWDAFKQFRFYETSELESPKYQILKRTANYEVRKYAPFTVVETSGDNLSGC
        MKF++LPWKPE VLTGTSIM INP+TGKFCSHVD+WDS+QNNDYFS+E LWD FKQFRFYET ELESPKYQILKRTANYEVRKYAPF VVE +G  +S  
Subjt:  MKFMVLPWKPEFVLTGTSIMSINPRTGKFCSHVDLWDSVQNNDYFSVEGLWDAFKQFRFYETSELESPKYQILKRTANYEVRKYAPFTVVETSGDNLSGC

Query:  AAFNSVGSWVDWKEDTIMKLRKNEGGIAAVLKFSGKPTKDKVQNKAKELRHSLKKDGLKLINNSCLLARYNNPYRTWSFVM
        A FN VGS+ D K++  M +R+ EGGI AVLKFSG PT+D  Q KAKELR SLKKDGLK I N CLLARYNN  RTWSFVM
Subjt:  AAFNSVGSWVDWKEDTIMKLRKNEGGIAAVLKFSGKPTKDKVQNKAKELRHSLKKDGLKLINNSCLLARYNNPYRTWSFVM

XP_038879853.1 uncharacterized protein LOC120071584 isoform X1 [Benincasa hispida]1.7e-8082.51Show/hide
Query:  MKFMVLPWKPEFVLTGTSIMSINPRTGKFCSHVDLWDSVQNNDYFSVEGLWDAFKQFRFYETSELESPKYQILKRTANYEVRKYAPFTVVETSGDNLSGC
        MKFMVLPWKPEFV+TGTSIM INP TGKFCSHVDLWDSVQNNDYFS+EGLWD FKQ RFYET ELESPKYQILKRTANYEVRKY P  V E SG+NL GC
Subjt:  MKFMVLPWKPEFVLTGTSIMSINPRTGKFCSHVDLWDSVQNNDYFSVEGLWDAFKQFRFYETSELESPKYQILKRTANYEVRKYAPFTVVETSGDNLSGC

Query:  --AAFNSVGSWVDWKEDTIMKLRKNEGGIAAVLKFSGKPTKDKVQNKAKELRHSLKKDGLKLINNSCLLARYNNPYRTWSFVM
          A F+ VGSW + KEDT   LRKNEGGIAAVLKFSGK TKD+VQNKAK+LRHSLKKDGLK INNS LLARYNN Y TWSFVM
Subjt:  --AAFNSVGSWVDWKEDTIMKLRKNEGGIAAVLKFSGKPTKDKVQNKAKELRHSLKKDGLKLINNSCLLARYNNPYRTWSFVM

XP_038879854.1 uncharacterized protein LOC120071584 isoform X2 [Benincasa hispida]1.7e-8082.51Show/hide
Query:  MKFMVLPWKPEFVLTGTSIMSINPRTGKFCSHVDLWDSVQNNDYFSVEGLWDAFKQFRFYETSELESPKYQILKRTANYEVRKYAPFTVVETSGDNLSGC
        MKFMVLPWKPEFV+TGTSIM INP TGKFCSHVDLWDSVQNNDYFS+EGLWD FKQ RFYET ELESPKYQILKRTANYEVRKY P  V E SG+NL GC
Subjt:  MKFMVLPWKPEFVLTGTSIMSINPRTGKFCSHVDLWDSVQNNDYFSVEGLWDAFKQFRFYETSELESPKYQILKRTANYEVRKYAPFTVVETSGDNLSGC

Query:  --AAFNSVGSWVDWKEDTIMKLRKNEGGIAAVLKFSGKPTKDKVQNKAKELRHSLKKDGLKLINNSCLLARYNNPYRTWSFVM
          A F+ VGSW + KEDT   LRKNEGGIAAVLKFSGK TKD+VQNKAK+LRHSLKKDGLK INNS LLARYNN Y TWSFVM
Subjt:  --AAFNSVGSWVDWKEDTIMKLRKNEGGIAAVLKFSGKPTKDKVQNKAKELRHSLKKDGLKLINNSCLLARYNNPYRTWSFVM

TrEMBL top hitse value%identityAlignment
A0A0A0LU04 Uncharacterized protein6.5e-7074.86Show/hide
Query:  MKFMVLPWKPEFVLTGTSIMSINPRTGKFCSHVDLWDSVQNNDYFSVEGLWDAFKQFRFYETSELESPKYQILKRTANYEVRKYAPFTVVETSGDNLSGC
        MKF +LPWKPE VLTGTSIM+INP TGKFC HVDLWDSVQNNDYFS+EGLWD FKQFRFYET ELE PKYQ LKRT NYEVRKY PF   E SG+NL  C
Subjt:  MKFMVLPWKPEFVLTGTSIMSINPRTGKFCSHVDLWDSVQNNDYFSVEGLWDAFKQFRFYETSELESPKYQILKRTANYEVRKYAPFTVVETSGDNLSGC

Query:  AAFNSVGSWVDWKE-DTIMKLRKNEGGIAAVLKFSGKPTKDKVQNKAKELRHSLKKDGLKLI-NNSCLLARYNNPYRTWSFVM
           NS+G W D KE D IM+LR N+GGIAAVL FSGK T++KV+NKAKELRH LKKDGLK + NNSCLL RYN+   TWSFVM
Subjt:  AAFNSVGSWVDWKE-DTIMKLRKNEGGIAAVLKFSGKPTKDKVQNKAKELRHSLKKDGLKLI-NNSCLLARYNNPYRTWSFVM

A0A5D3CVR4 Uncharacterized protein5.1e-6776.79Show/hide
Query:  MKFMVLPWKPEFVLTGTSIMSINPRTGKFCSHVDLWDSVQNNDYFSVEGLWDAFKQFRFYETSELESPKYQILKRTANYEVRKYAPFTVVETSGDNLSGC
        MKF++LPWKPE VLTGTSIM++NP TGKFC HVDLWDSVQNNDYFS+EGLWD FKQFRFYE SELE PKYQ L RTANYEVRKY PF V E SG+NL GC
Subjt:  MKFMVLPWKPEFVLTGTSIMSINPRTGKFCSHVDLWDSVQNNDYFSVEGLWDAFKQFRFYETSELESPKYQILKRTANYEVRKYAPFTVVETSGDNLSGC

Query:  AAFNSVGSWVDWKE-DTIMKLRKNEGGIAAVLKFSGKPTKDKVQNKAKELRHSLKKDGLKLINNSCLL
           NSVG W D KE D IMKLR  EGGIAAVL FSGK T++ V+NKAKELRH LKKDGL+ +NNSCLL
Subjt:  AAFNSVGSWVDWKE-DTIMKLRKNEGGIAAVLKFSGKPTKDKVQNKAKELRHSLKKDGLKLINNSCLL

A0A6J1CV62 uncharacterized protein LOC111014503 isoform X22.1e-6874.59Show/hide
Query:  MKFMVLPWKPEFVLTGTSIMSINPRTGKFCSHVDLWDSVQNNDYFSVEGLWDAFKQFRFYETSELESPKYQILKRTANYEVRKYAPFTVVETSGDNLSGC
        MKF++LPWKPE VLTGTSIM I+P TGKFC+HVDLWDSVQNN+YFS+EGLWD FKQFRFYET ELESP+YQILKRTANYEVRKYAPF  VET  D L G 
Subjt:  MKFMVLPWKPEFVLTGTSIMSINPRTGKFCSHVDLWDSVQNNDYFSVEGLWDAFKQFRFYETSELESPKYQILKRTANYEVRKYAPFTVVETSGDNLSGC

Query:  AAFNSVGSWVDWKEDTIMKLRKNEGGIAAVLKFSGKPTKDKVQNKAKELRHSLKKDGLKLINNSCLLARYNNPYRTWSFVM
        A FN V  + D K+D I  LR  +GGIAAVLKFSGKP+++ VQ KAKELR+SL KDGLK I   CLLARYN+P RTWSFVM
Subjt:  AAFNSVGSWVDWKEDTIMKLRKNEGGIAAVLKFSGKPTKDKVQNKAKELRHSLKKDGLKLINNSCLLARYNNPYRTWSFVM

A0A6J1EZQ2 uncharacterized protein LOC1114408391.7e-7076.24Show/hide
Query:  MKFMVLPWKPEFVLTGTSIMSINPRTGKFCSHVDLWDSVQNNDYFSVEGLWDAFKQFRFYETSELESPKYQILKRTANYEVRKYAPFTVVETSGDNLSGC
        MKF++LPWKPE VLTGTSIM INP+TGKFCSHVDLWDS+QNNDYFSVE LWD FKQFRFYET ELESPKYQILKRTANYEVRKYAPF VVE +G  +S  
Subjt:  MKFMVLPWKPEFVLTGTSIMSINPRTGKFCSHVDLWDSVQNNDYFSVEGLWDAFKQFRFYETSELESPKYQILKRTANYEVRKYAPFTVVETSGDNLSGC

Query:  AAFNSVGSWVDWKEDTIMKLRKNEGGIAAVLKFSGKPTKDKVQNKAKELRHSLKKDGLKLINNSCLLARYNNPYRTWSFVM
        A FN VGS  D K++  M +R+ EGGI AVLKFSG PT+D  Q KAKELR SLKKDGLK I N CLLARYN+  RTWSFVM
Subjt:  AAFNSVGSWVDWKEDTIMKLRKNEGGIAAVLKFSGKPTKDKVQNKAKELRHSLKKDGLKLINNSCLLARYNNPYRTWSFVM

A0A6J1HKM5 uncharacterized protein LOC1114650222.2e-7075.69Show/hide
Query:  MKFMVLPWKPEFVLTGTSIMSINPRTGKFCSHVDLWDSVQNNDYFSVEGLWDAFKQFRFYETSELESPKYQILKRTANYEVRKYAPFTVVETSGDNLSGC
        MKF++LPWKPE VLTGTSIM INP+TGKFCSHVDLWDS+QNNDYFSVE LWD FKQFRFYET ELESPKYQILKRTANYEVRKYAPF VVE +G  +S  
Subjt:  MKFMVLPWKPEFVLTGTSIMSINPRTGKFCSHVDLWDSVQNNDYFSVEGLWDAFKQFRFYETSELESPKYQILKRTANYEVRKYAPFTVVETSGDNLSGC

Query:  AAFNSVGSWVDWKEDTIMKLRKNEGGIAAVLKFSGKPTKDKVQNKAKELRHSLKKDGLKLINNSCLLARYNNPYRTWSFVM
        A FN VGS+ D K++  M +R+ EGGI AVLKFSG PT+D  Q KAKELR SLKKDGLK I N CLLARYN+  RTW FVM
Subjt:  AAFNSVGSWVDWKEDTIMKLRKNEGGIAAVLKFSGKPTKDKVQNKAKELRHSLKKDGLKLINNSCLLARYNNPYRTWSFVM

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G20140.1 SOUL heme-binding family protein6.0e-6052.44Show/hide
Query:  MKFMVLPWKPEFVLTGTSIMSINPRTGKFCSHVDLWDSVQNNDYFSVEGLWDAFKQFRFYETSELESPKYQILKRTANYEVRKYAPFTVVETSGDNLSGC
        MKF+ LPWKPE V TG SIM +NP T KFCSH+DLWDS++NNDYFS+EGL D FKQ R Y+T +LE+PKYQILKRTANYEVR Y PF VVET GD LSG 
Subjt:  MKFMVLPWKPEFVLTGTSIMSINPRTGKFCSHVDLWDSVQNNDYFSVEGLWDAFKQFRFYETSELESPKYQILKRTANYEVRKYAPFTVVETSGDNLSGC

Query:  AAFNSVGSWVDWKEDTIMK--------------------------------------------LRKNEGGIAAVLKFSGKPTKDKVQNKAKELRHSLKKD
        + FN+V  ++  K  T+ K                                            L+K EGG AA +KFSGKPT+D VQ K  ELR SL KD
Subjt:  AAFNSVGSWVDWKEDTIMK--------------------------------------------LRKNEGGIAAVLKFSGKPTKDKVQNKAKELRHSLKKD

Query:  GLKLINNSCLLARYNNPYRTWSFVM
        GL+     C+LARYN+P RTW+F+M
Subjt:  GLKLINNSCLLARYNNPYRTWSFVM

AT5G20140.2 SOUL heme-binding family protein4.6e-6052.21Show/hide
Query:  MKFMVLPWKPEFVLTGTSIMSINPRTGKFCSHVDLWDSVQNNDYFSVEGLWDAFKQFRFYETSELESPKYQILKRTANYEVRKYAPFTVVETSGDNLSGC
        MKF+ LPWKPE V TG SIM +NP T KFCSH+DLWDS++NNDYFS+EGL D FKQ R Y+T +LE+PKYQILKRTANYEVR Y PF VVET GD LSG 
Subjt:  MKFMVLPWKPEFVLTGTSIMSINPRTGKFCSHVDLWDSVQNNDYFSVEGLWDAFKQFRFYETSELESPKYQILKRTANYEVRKYAPFTVVETSGDNLSGC

Query:  AAFNSVGSWVDWKEDTIMK--------------------------------------------LRKNEGGIAAVLKFSGKPTKDKVQNKAKELRHSLKKD
        + FN+V  ++  K  T+ K                                            L+K EGG AA +KFSGKPT+D VQ K  ELR SL KD
Subjt:  AAFNSVGSWVDWKEDTIMK--------------------------------------------LRKNEGGIAAVLKFSGKPTKDKVQNKAKELRHSLKKD

Query:  GLKLINNSCLLARYNNPYRTWSFVMA
        GL+     C+LARYN+P RTW+F+M+
Subjt:  GLKLINNSCLLARYNNPYRTWSFVMA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGTTTATGGTTCTTCCATGGAAACCAGAATTTGTTTTGACAGGAACTTCCATCATGAGTATTAATCCACGTACTGGCAAGTTCTGTAGCCATGTGGATCTTTGGGA
TTCAGTACAAAACAATGACTACTTTTCTGTAGAAGGCCTCTGGGATGCATTTAAACAGTTTCGGTTTTATGAGACTTCAGAATTGGAATCACCCAAATATCAAATATTGA
AAAGGACTGCAAATTATGAGGTGAGAAAATATGCACCATTTACGGTGGTAGAGACAAGTGGAGATAACCTTTCTGGGTGTGCTGCATTTAATAGTGTTGGCAGTTGGGTA
GATTGGAAAGAGGACACAATTATGAAGTTGAGAAAGAATGAAGGAGGCATTGCTGCAGTGTTGAAATTCAGTGGGAAACCCACAAAAGATAAAGTGCAAAACAAGGCCAA
AGAATTGAGGCATAGTCTCAAAAAAGATGGTCTTAAACTCATTAATAATAGCTGTTTGCTTGCACGCTATAACAATCCCTACCGAACATGGAGCTTTGTAATGGCA
mRNA sequenceShow/hide mRNA sequence
CAACATTAAAATTGATCTTACACATATTCTAATTGATTGAGTTGATATTGTAGAGTGGACCATATGAGATAATACAAGATGGACTGCAGTAATGAAGTTTATGGTTCTTC
CATGGAAACCAGAATTTGTTTTGACAGGAACTTCCATCATGAGTATTAATCCACGTACTGGCAAGTTCTGTAGCCATGTGGATCTTTGGGATTCAGTACAAAACAATGAC
TACTTTTCTGTAGAAGGCCTCTGGGATGCATTTAAACAGTTTCGGTTTTATGAGACTTCAGAATTGGAATCACCCAAATATCAAATATTGAAAAGGACTGCAAATTATGA
GGTGAGAAAATATGCACCATTTACGGTGGTAGAGACAAGTGGAGATAACCTTTCTGGGTGTGCTGCATTTAATAGTGTTGGCAGTTGGGTAGATTGGAAAGAGGACACAA
TTATGAAGTTGAGAAAGAATGAAGGAGGCATTGCTGCAGTGTTGAAATTCAGTGGGAAACCCACAAAAGATAAAGTGCAAAACAAGGCCAAAGAATTGAGGCATAGTCTC
AAAAAAGATGGTCTTAAACTCATTAATAATAGCTGTTTGCTTGCACGCTATAACAATCCCTACCGAACATGGAGCTTTGTAATGGCA
Protein sequenceShow/hide protein sequence
MKFMVLPWKPEFVLTGTSIMSINPRTGKFCSHVDLWDSVQNNDYFSVEGLWDAFKQFRFYETSELESPKYQILKRTANYEVRKYAPFTVVETSGDNLSGCAAFNSVGSWV
DWKEDTIMKLRKNEGGIAAVLKFSGKPTKDKVQNKAKELRHSLKKDGLKLINNSCLLARYNNPYRTWSFVMA