; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh11G005480 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh11G005480
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionSOUL heme-binding protein
Genome locationCmo_Chr11:2669897..2671670
RNA-Seq ExpressionCmoCh11G005480
SyntenyCmoCh11G005480
Gene Ontology termsGO:0110165 - cellular anatomical structure (cellular component)
InterPro domainsIPR006917 - SOUL haem-binding protein
IPR011256 - Regulatory factor, effector binding domain superfamily
IPR018790 - Protein of unknown function DUF2358
IPR032710 - NTF2-like domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6587902.1 hypothetical protein SDJN03_16467, partial [Cucurbita argyrosperma subsp. sororia]9.9e-18897.29Show/hide
Query:  MATAQVSFQNFLSIPTVDSGVRPRKSCGPTRAAQSRTGSPNWKSSIRSTLADQSRQKPTVDVDRLVDFMYDDLRHVFDEQGIDRTAYDDEVRFRDPITKY
        MATAQVSFQNFLSIPTVD GVRPRKSCGPTRAAQSRTGSPNWKSSIRSTL DQSRQKPTVDVDRLVDFMYDDLRHVFDEQGIDRTAYD+EVRFRDPITKY
Subjt:  MATAQVSFQNFLSIPTVDSGVRPRKSCGPTRAAQSRTGSPNWKSSIRSTLADQSRQKPTVDVDRLVDFMYDDLRHVFDEQGIDRTAYDDEVRFRDPITKY

Query:  DGISGYMLNIALLREFFRPEIILHWVKKTGPYEITTRWTAIMKFILLPWKPELVLTGTSIMGINPQTGKFCSHVDLWDSLQNNDYFSVEALWDVFKQFRF
        DGISGYMLNIALLREFFRPEIILHWVKKTGPYEITTRWTA+MKFILLPWKPELVLTGTSIMGINP+TGKFCSHVDLWDSLQNNDYFS+EALWDVFKQ RF
Subjt:  DGISGYMLNIALLREFFRPEIILHWVKKTGPYEITTRWTAIMKFILLPWKPELVLTGTSIMGINPQTGKFCSHVDLWDSLQNNDYFSVEALWDVFKQFRF

Query:  YETPELESPKYQILKRTANYEVRKYAPFVVVERNGHQISAGFNRVGSCSDAKQEDTMSIREMEGGIGAVLKFSGDPTEDMAQQKAKELRCSLKKDGLKPI
        YETPELESPKYQILKRTANYEVRKYAPFVVVERNGHQISAGFNRVGSCSDAKQEDTMSIREMEGGIGAVLKFSGDPTEDMAQQKAKELRCSLKKDGL PI
Subjt:  YETPELESPKYQILKRTANYEVRKYAPFVVVERNGHQISAGFNRVGSCSDAKQEDTMSIREMEGGIGAVLKFSGDPTEDMAQQKAKELRCSLKKDGLKPI

Query:  NGCLLARYNHSGRTWSFVMRNEVLIWLEEFSI
        NGCLLARYN SGRTWSFVMRNEVLIWLEEFSI
Subjt:  NGCLLARYNHSGRTWSFVMRNEVLIWLEEFSI

KAG7021789.1 hypothetical protein SDJN02_15516 [Cucurbita argyrosperma subsp. argyrosperma]7.6e-18897.29Show/hide
Query:  MATAQVSFQNFLSIPTVDSGVRPRKSCGPTRAAQSRTGSPNWKSSIRSTLADQSRQKPTVDVDRLVDFMYDDLRHVFDEQGIDRTAYDDEVRFRDPITKY
        MATAQVSFQNFLSIPTVD GVRPRKSCGPTRAAQSRTGSPNWKSSIRSTL DQSRQKPTVDVDRLVDFMYDDLRHVFDEQGIDRTAYD+EVRFRDPITKY
Subjt:  MATAQVSFQNFLSIPTVDSGVRPRKSCGPTRAAQSRTGSPNWKSSIRSTLADQSRQKPTVDVDRLVDFMYDDLRHVFDEQGIDRTAYDDEVRFRDPITKY

Query:  DGISGYMLNIALLREFFRPEIILHWVKKTGPYEITTRWTAIMKFILLPWKPELVLTGTSIMGINPQTGKFCSHVDLWDSLQNNDYFSVEALWDVFKQFRF
        DGISGYMLNIALLREFFRPEIILHWVKKTGPYEITTRWTA+MKFILLPWKPELVLTGTSIMGINPQTGKFCSHVDLWDSLQNNDYFS+EALWDVFKQ RF
Subjt:  DGISGYMLNIALLREFFRPEIILHWVKKTGPYEITTRWTAIMKFILLPWKPELVLTGTSIMGINPQTGKFCSHVDLWDSLQNNDYFSVEALWDVFKQFRF

Query:  YETPELESPKYQILKRTANYEVRKYAPFVVVERNGHQISAGFNRVGSCSDAKQEDTMSIREMEGGIGAVLKFSGDPTEDMAQQKAKELRCSLKKDGLKPI
        YETPELESPKYQILKRTANYEVRKYAPFVVVERNGHQISAGFNRVGSCSDAKQEDTMSIREMEGGIGAVLKFSGDPTEDMAQQKAKELRCSLKKDGL PI
Subjt:  YETPELESPKYQILKRTANYEVRKYAPFVVVERNGHQISAGFNRVGSCSDAKQEDTMSIREMEGGIGAVLKFSGDPTEDMAQQKAKELRCSLKKDGLKPI

Query:  NGCLLARYNHSGRTWSFVMRNEVLIWLEEFSI
        NGCLLARYN SGRTWSFVMRNEVLIWL+EFSI
Subjt:  NGCLLARYNHSGRTWSFVMRNEVLIWLEEFSI

XP_022933414.1 uncharacterized protein LOC111440839 [Cucurbita moschata]1.3e-192100Show/hide
Query:  MATAQVSFQNFLSIPTVDSGVRPRKSCGPTRAAQSRTGSPNWKSSIRSTLADQSRQKPTVDVDRLVDFMYDDLRHVFDEQGIDRTAYDDEVRFRDPITKY
        MATAQVSFQNFLSIPTVDSGVRPRKSCGPTRAAQSRTGSPNWKSSIRSTLADQSRQKPTVDVDRLVDFMYDDLRHVFDEQGIDRTAYDDEVRFRDPITKY
Subjt:  MATAQVSFQNFLSIPTVDSGVRPRKSCGPTRAAQSRTGSPNWKSSIRSTLADQSRQKPTVDVDRLVDFMYDDLRHVFDEQGIDRTAYDDEVRFRDPITKY

Query:  DGISGYMLNIALLREFFRPEIILHWVKKTGPYEITTRWTAIMKFILLPWKPELVLTGTSIMGINPQTGKFCSHVDLWDSLQNNDYFSVEALWDVFKQFRF
        DGISGYMLNIALLREFFRPEIILHWVKKTGPYEITTRWTAIMKFILLPWKPELVLTGTSIMGINPQTGKFCSHVDLWDSLQNNDYFSVEALWDVFKQFRF
Subjt:  DGISGYMLNIALLREFFRPEIILHWVKKTGPYEITTRWTAIMKFILLPWKPELVLTGTSIMGINPQTGKFCSHVDLWDSLQNNDYFSVEALWDVFKQFRF

Query:  YETPELESPKYQILKRTANYEVRKYAPFVVVERNGHQISAGFNRVGSCSDAKQEDTMSIREMEGGIGAVLKFSGDPTEDMAQQKAKELRCSLKKDGLKPI
        YETPELESPKYQILKRTANYEVRKYAPFVVVERNGHQISAGFNRVGSCSDAKQEDTMSIREMEGGIGAVLKFSGDPTEDMAQQKAKELRCSLKKDGLKPI
Subjt:  YETPELESPKYQILKRTANYEVRKYAPFVVVERNGHQISAGFNRVGSCSDAKQEDTMSIREMEGGIGAVLKFSGDPTEDMAQQKAKELRCSLKKDGLKPI

Query:  NGCLLARYNHSGRTWSFVMRNEVLIWLEEFSI
        NGCLLARYNHSGRTWSFVMRNEVLIWLEEFSI
Subjt:  NGCLLARYNHSGRTWSFVMRNEVLIWLEEFSI

XP_022965046.1 uncharacterized protein LOC111465022 [Cucurbita maxima]1.9e-18395.78Show/hide
Query:  MATAQVSFQNFLSIPTVDSGVRPRKSCGPTRAAQSRTGSPNWKSSIRSTLADQSRQKPTVDVDRLVDFMYDDLRHVFDEQGIDRTAYDDEVRFRDPITKY
        MATAQVSFQNFLSIPTVD GVRPRKS GPTRAAQSRT SPNWK SIRSTLADQ  QKPTVDVDRLVDFMYDDLRHVFDEQGIDRTAYD+EVRFRDPITKY
Subjt:  MATAQVSFQNFLSIPTVDSGVRPRKSCGPTRAAQSRTGSPNWKSSIRSTLADQSRQKPTVDVDRLVDFMYDDLRHVFDEQGIDRTAYDDEVRFRDPITKY

Query:  DGISGYMLNIALLREFFRPEIILHWVKKTGPYEITTRWTAIMKFILLPWKPELVLTGTSIMGINPQTGKFCSHVDLWDSLQNNDYFSVEALWDVFKQFRF
        DGISGYMLNIALLREFFRPEIILHWVKKTGPYEITTRWTA+MKFILLPWKPELVLTGTSIMGINPQTGKFCSHVDLWDSLQNNDYFSVEALWDVFKQFRF
Subjt:  DGISGYMLNIALLREFFRPEIILHWVKKTGPYEITTRWTAIMKFILLPWKPELVLTGTSIMGINPQTGKFCSHVDLWDSLQNNDYFSVEALWDVFKQFRF

Query:  YETPELESPKYQILKRTANYEVRKYAPFVVVERNGHQISAGFNRVGSCSDAKQEDTMSIREMEGGIGAVLKFSGDPTEDMAQQKAKELRCSLKKDGLKPI
        YETPELESPKYQILKRTANYEVRKYAPFVVVERNGHQISAGFNRVGS  DAKQEDTMSIREMEGGIGAVLKFSGDPTEDMAQQKAKELRCSLKKDGLKPI
Subjt:  YETPELESPKYQILKRTANYEVRKYAPFVVVERNGHQISAGFNRVGSCSDAKQEDTMSIREMEGGIGAVLKFSGDPTEDMAQQKAKELRCSLKKDGLKPI

Query:  NGCLLARYNHSGRTWSFVMRNEVLIWLEEFSI
        NGCLLARYN SGRTW FVMRNEV+IWL+EFSI
Subjt:  NGCLLARYNHSGRTWSFVMRNEVLIWLEEFSI

XP_023531546.1 uncharacterized protein LOC111793749 [Cucurbita pepo subsp. pepo]4.2e-18696.99Show/hide
Query:  MATAQVSFQNFLSIPTVDSGVRPRKSCGPTRAAQSRTGSPNWKSSIRSTLADQSRQKPTVDVDRLVDFMYDDLRHVFDEQGIDRTAYDDEVRFRDPITKY
        MATAQVSFQNFLSIPTVD GVRPRKS GPTRAAQSRTGSPNWKSSIRSTLADQSRQKPTVDVDRLVDFMYDDLRHVFDEQGIDRTAYD+EVRFRDPITKY
Subjt:  MATAQVSFQNFLSIPTVDSGVRPRKSCGPTRAAQSRTGSPNWKSSIRSTLADQSRQKPTVDVDRLVDFMYDDLRHVFDEQGIDRTAYDDEVRFRDPITKY

Query:  DGISGYMLNIALLREFFRPEIILHWVKKTGPYEITTRWTAIMKFILLPWKPELVLTGTSIMGINPQTGKFCSHVDLWDSLQNNDYFSVEALWDVFKQFRF
        DGISGYMLNIALLREFFRPEII HWVKKTGPYEITTRWTA+MKFILLPWKPELVLTGTSIMGINPQTGKFCSHVD+WDSLQNNDYFS+EALWDVFKQFRF
Subjt:  DGISGYMLNIALLREFFRPEIILHWVKKTGPYEITTRWTAIMKFILLPWKPELVLTGTSIMGINPQTGKFCSHVDLWDSLQNNDYFSVEALWDVFKQFRF

Query:  YETPELESPKYQILKRTANYEVRKYAPFVVVERNGHQISAGFNRVGSCSDAKQEDTMSIREMEGGIGAVLKFSGDPTEDMAQQKAKELRCSLKKDGLKPI
        YETPELESPKYQILKRTANYEVRKYAPFVVVERNGHQISAGFNRVGS SDAKQEDTMSIREMEGGIGAVLKFSGDPTEDMAQQKAKELRCSLKKDGLKPI
Subjt:  YETPELESPKYQILKRTANYEVRKYAPFVVVERNGHQISAGFNRVGSCSDAKQEDTMSIREMEGGIGAVLKFSGDPTEDMAQQKAKELRCSLKKDGLKPI

Query:  NGCLLARYNHSGRTWSFVMRNEVLIWLEEFSI
        NGCLLARYN+S RTWSFVMRNEVLIWLEEFSI
Subjt:  NGCLLARYNHSGRTWSFVMRNEVLIWLEEFSI

TrEMBL top hitse value%identityAlignment
A0A1S3CJ12 uncharacterized protein LOC103501513 isoform X12.3e-12662.44Show/hide
Query:  MATAQVSFQNFLSIPTVDSGVRPRKSCGPT----RAAQSRTGS--PNWKSS---IRSTLADQSRQKPTVDVDRLVDFMYDDLRHVFDEQGIDRTAYDDEV
        MA  Q+S QNFLS PT+ S +RP KS   T    R  QSRT +  PN ++S   +R  L DQS  K TVDV RLVDF+Y+DL H+FDEQGIDRTAYD++V
Subjt:  MATAQVSFQNFLSIPTVDSGVRPRKSCGPT----RAAQSRTGS--PNWKSS---IRSTLADQSRQKPTVDVDRLVDFMYDDLRHVFDEQGIDRTAYDDEV

Query:  RFRDPITKYDGISGYMLNIALLREFFRPEIILHWVKKTGPYEITTRWTAIMKFILLPWKPELVLTGTSIMGINPQTGKFCSHVDLWDSLQNNDYFSVEAL
        RFRDPITK+D ISGY+ NI+LLRE FRPE  LHWVK+TGPYEITTRWT IMKF LLPWKPEL+ TGTSIMGINP+TGKFCSHVDLWDS+QNNDYFSVE L
Subjt:  RFRDPITKYDGISGYMLNIALLREFFRPEIILHWVKKTGPYEITTRWTAIMKFILLPWKPELVLTGTSIMGINPQTGKFCSHVDLWDSLQNNDYFSVEAL

Query:  WDVFKQFRFYETPELESPKYQILKRTANYEVRKYAPFVVVERNGHQI--SAGFNRVGSCSDAK-------------------------------------
        WDVFKQ RFY+TPELESPKY ILKRT  YEVRKYAPF+VVE +G ++  SAGFN V      K                                     
Subjt:  WDVFKQFRFYETPELESPKYQILKRTANYEVRKYAPFVVVERNGHQI--SAGFNRVGSCSDAK-------------------------------------

Query:  ------QEDTMSIREMEGGIGAVLKFSGDPTEDMAQQKAKELRCSLKKDGLKPINGCLLARYNHSGRTWSFVMRNEVLIWLEEFSI
              ++D + +R++EGGI AVLKFSG PTE++ Q+KAKELR SL KDGLKP NGCLLARYN  GRTW+F+MRNEVLIWLEE+S+
Subjt:  ------QEDTMSIREMEGGIGAVLKFSGDPTEDMAQQKAKELRCSLKKDGLKPINGCLLARYNHSGRTWSFVMRNEVLIWLEEFSI

A0A6J1CUY2 uncharacterized protein LOC111014503 isoform X12.8e-12762.27Show/hide
Query:  MATAQVSFQNFLSIPTVDSGVRPRKSCG------PTRAAQSRT-----GSPNWKSSIRSTLADQSRQKPTVDVDRLVDFMYDDLRHVFDEQGIDRTAYDD
        MA  Q+S QNFLS PT   G RP KS G      P R  +SRT      + N K ++R +L DQS  K  VDVDRLVDF+Y+DLRH+FDEQGIDRTAYD+
Subjt:  MATAQVSFQNFLSIPTVDSGVRPRKSCG------PTRAAQSRT-----GSPNWKSSIRSTLADQSRQKPTVDVDRLVDFMYDDLRHVFDEQGIDRTAYDD

Query:  EVRFRDPITKYDGISGYMLNIALLREFFRPEIILHWVKKTGPYEITTRWTAIMKFILLPWKPELVLTGTSIMGINPQTGKFCSHVDLWDSLQNNDYFSVE
         VRFRDPITK+D ISGY  NI+LLRE FRPE  LHWVK+TGPYEITTRWT +MKF+LLPWKPE + TG SIMGINP+TGKFCSHVDLWDS+QNNDYFS+E
Subjt:  EVRFRDPITKYDGISGYMLNIALLREFFRPEIILHWVKKTGPYEITTRWTAIMKFILLPWKPELVLTGTSIMGINPQTGKFCSHVDLWDSLQNNDYFSVE

Query:  ALWDVFKQFRFYETPELESPKYQILKRTANYEVRKYAPFVVVERNGHQI--SAGFNRVGSCSDAK-----------------------------------
         L DVFKQ RFY+TPELESPKY+ILKRTANYEVRKY PFVVVE +G ++  SAGFN V      K                                   
Subjt:  ALWDVFKQFRFYETPELESPKYQILKRTANYEVRKYAPFVVVERNGHQI--SAGFNRVGSCSDAK-----------------------------------

Query:  --------QEDTMSIREMEGGIGAVLKFSGDPTEDMAQQKAKELRCSLKKDGLKPINGCLLARYNHSGRTWSFVMRNEVLIWLEEFS
                ++DT+ +R++EGGI AVLKFSG PTEDM Q+KAKELR  L KDGLKP  GCLLARYN  GRTWSF+MRNEVLIWLEEFS
Subjt:  --------QEDTMSIREMEGGIGAVLKFSGDPTEDMAQQKAKELRCSLKKDGLKPINGCLLARYNHSGRTWSFVMRNEVLIWLEEFS

A0A6J1CV62 uncharacterized protein LOC111014503 isoform X23.1e-13473.29Show/hide
Query:  MATAQVSFQNFLSIPTVDSGVRPRKS---CGP-TRAAQSRTGSPNWKSSIRSTLADQSRQKPTVDVDRLVDFMYDDLRHVFDEQGIDRTAYDDEVRFRDP
        M   QVS QNFLSIPTV  G RP+KS    GP  R  +SRT     K  +RS LAD+S  K TVDVDRLVDF+Y+DLRHVFD QGID TAYD+ VRFRDP
Subjt:  MATAQVSFQNFLSIPTVDSGVRPRKS---CGP-TRAAQSRTGSPNWKSSIRSTLADQSRQKPTVDVDRLVDFMYDDLRHVFDEQGIDRTAYDDEVRFRDP

Query:  ITKYDGISGYMLNIALLREFFRPEIILHWVKKTGPYEITTRWTAIMKFILLPWKPELVLTGTSIMGINPQTGKFCSHVDLWDSLQNNDYFSVEALWDVFK
        ITKY+GI GYMLNIALLR+ FRP+ +LHWVKKTGPYEITTRWTA+MKF+LLPWKPELVLTGTSIM I+P+TGKFC+HVDLWDS+QNN+YFS+E LWD+FK
Subjt:  ITKYDGISGYMLNIALLREFFRPEIILHWVKKTGPYEITTRWTAIMKFILLPWKPELVLTGTSIMGINPQTGKFCSHVDLWDSLQNNDYFSVEALWDVFK

Query:  QFRFYETPELESPKYQILKRTANYEVRKYAPFVVVERNGHQI--SAGFNRVGSCSDAKQEDTMSIREMEGGIGAVLKFSGDPTEDMAQQKAKELRCSLKK
        QFRFYETPELESP+YQILKRTANYEVRKYAPF+ VE    ++  SA FNRV    D KQ D +S+R ++GGI AVLKFSG P+E+M Q+KAKELR SL K
Subjt:  QFRFYETPELESPKYQILKRTANYEVRKYAPFVVVERNGHQI--SAGFNRVGSCSDAKQEDTMSIREMEGGIGAVLKFSGDPTEDMAQQKAKELRCSLKK

Query:  DGLKPINGCLLARYNHSGRTWSFVMRNEVLIWLEEFS
        DGLKPI GCLLARYN   RTWSFVMRNEVLIWLEEFS
Subjt:  DGLKPINGCLLARYNHSGRTWSFVMRNEVLIWLEEFS

A0A6J1EZQ2 uncharacterized protein LOC1114408396.5e-193100Show/hide
Query:  MATAQVSFQNFLSIPTVDSGVRPRKSCGPTRAAQSRTGSPNWKSSIRSTLADQSRQKPTVDVDRLVDFMYDDLRHVFDEQGIDRTAYDDEVRFRDPITKY
        MATAQVSFQNFLSIPTVDSGVRPRKSCGPTRAAQSRTGSPNWKSSIRSTLADQSRQKPTVDVDRLVDFMYDDLRHVFDEQGIDRTAYDDEVRFRDPITKY
Subjt:  MATAQVSFQNFLSIPTVDSGVRPRKSCGPTRAAQSRTGSPNWKSSIRSTLADQSRQKPTVDVDRLVDFMYDDLRHVFDEQGIDRTAYDDEVRFRDPITKY

Query:  DGISGYMLNIALLREFFRPEIILHWVKKTGPYEITTRWTAIMKFILLPWKPELVLTGTSIMGINPQTGKFCSHVDLWDSLQNNDYFSVEALWDVFKQFRF
        DGISGYMLNIALLREFFRPEIILHWVKKTGPYEITTRWTAIMKFILLPWKPELVLTGTSIMGINPQTGKFCSHVDLWDSLQNNDYFSVEALWDVFKQFRF
Subjt:  DGISGYMLNIALLREFFRPEIILHWVKKTGPYEITTRWTAIMKFILLPWKPELVLTGTSIMGINPQTGKFCSHVDLWDSLQNNDYFSVEALWDVFKQFRF

Query:  YETPELESPKYQILKRTANYEVRKYAPFVVVERNGHQISAGFNRVGSCSDAKQEDTMSIREMEGGIGAVLKFSGDPTEDMAQQKAKELRCSLKKDGLKPI
        YETPELESPKYQILKRTANYEVRKYAPFVVVERNGHQISAGFNRVGSCSDAKQEDTMSIREMEGGIGAVLKFSGDPTEDMAQQKAKELRCSLKKDGLKPI
Subjt:  YETPELESPKYQILKRTANYEVRKYAPFVVVERNGHQISAGFNRVGSCSDAKQEDTMSIREMEGGIGAVLKFSGDPTEDMAQQKAKELRCSLKKDGLKPI

Query:  NGCLLARYNHSGRTWSFVMRNEVLIWLEEFSI
        NGCLLARYNHSGRTWSFVMRNEVLIWLEEFSI
Subjt:  NGCLLARYNHSGRTWSFVMRNEVLIWLEEFSI

A0A6J1HKM5 uncharacterized protein LOC1114650229.4e-18495.78Show/hide
Query:  MATAQVSFQNFLSIPTVDSGVRPRKSCGPTRAAQSRTGSPNWKSSIRSTLADQSRQKPTVDVDRLVDFMYDDLRHVFDEQGIDRTAYDDEVRFRDPITKY
        MATAQVSFQNFLSIPTVD GVRPRKS GPTRAAQSRT SPNWK SIRSTLADQ  QKPTVDVDRLVDFMYDDLRHVFDEQGIDRTAYD+EVRFRDPITKY
Subjt:  MATAQVSFQNFLSIPTVDSGVRPRKSCGPTRAAQSRTGSPNWKSSIRSTLADQSRQKPTVDVDRLVDFMYDDLRHVFDEQGIDRTAYDDEVRFRDPITKY

Query:  DGISGYMLNIALLREFFRPEIILHWVKKTGPYEITTRWTAIMKFILLPWKPELVLTGTSIMGINPQTGKFCSHVDLWDSLQNNDYFSVEALWDVFKQFRF
        DGISGYMLNIALLREFFRPEIILHWVKKTGPYEITTRWTA+MKFILLPWKPELVLTGTSIMGINPQTGKFCSHVDLWDSLQNNDYFSVEALWDVFKQFRF
Subjt:  DGISGYMLNIALLREFFRPEIILHWVKKTGPYEITTRWTAIMKFILLPWKPELVLTGTSIMGINPQTGKFCSHVDLWDSLQNNDYFSVEALWDVFKQFRF

Query:  YETPELESPKYQILKRTANYEVRKYAPFVVVERNGHQISAGFNRVGSCSDAKQEDTMSIREMEGGIGAVLKFSGDPTEDMAQQKAKELRCSLKKDGLKPI
        YETPELESPKYQILKRTANYEVRKYAPFVVVERNGHQISAGFNRVGS  DAKQEDTMSIREMEGGIGAVLKFSGDPTEDMAQQKAKELRCSLKKDGLKPI
Subjt:  YETPELESPKYQILKRTANYEVRKYAPFVVVERNGHQISAGFNRVGSCSDAKQEDTMSIREMEGGIGAVLKFSGDPTEDMAQQKAKELRCSLKKDGLKPI

Query:  NGCLLARYNHSGRTWSFVMRNEVLIWLEEFSI
        NGCLLARYN SGRTW FVMRNEV+IWL+EFSI
Subjt:  NGCLLARYNHSGRTWSFVMRNEVLIWLEEFSI

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G20140.1 SOUL heme-binding family protein3.0e-10255.31Show/hide
Query:  TVDVDRLVDFMYDDLRHVFDEQGIDRTAYDDEVRFRDPITKYDGISGYMLNIALLREFFRPEIILHWVKKTGPYEITTRWTAIMKFILLPWKPELVLTGT
        TV+++ LV F+Y+DL H+FD+QGID+TAYD+ V+FRDPITK+D ISGY+ NIA L+  F P+  LHW K+TGPYEITTRWT +MKFI LPWKPELV TG 
Subjt:  TVDVDRLVDFMYDDLRHVFDEQGIDRTAYDDEVRFRDPITKYDGISGYMLNIALLREFFRPEIILHWVKKTGPYEITTRWTAIMKFILLPWKPELVLTGT

Query:  SIMGINPQTGKFCSHVDLWDSLQNNDYFSVEALWDVFKQFRFYETPELESPKYQILKRTANYEVRKYAPFVVVERNGHQI--SAGFNRVG----------
        SIM +NP+T KFCSH+DLWDS++NNDYFS+E L DVFKQ R Y+TP+LE+PKYQILKRTANYEVR Y PF+VVE  G ++  S+GFN V           
Subjt:  SIMGINPQTGKFCSHVDLWDSLQNNDYFSVEALWDVFKQFRFYETPELESPKYQILKRTANYEVRKYAPFVVVERNGHQI--SAGFNRVG----------

Query:  ----------------------------------SCSDAKQEDTMSIREMEGGIGAVLKFSGDPTEDMAQQKAKELRCSLKKDGLKPINGCLLARYNHSG
                                          S      E+ ++++++EGG  A +KFSG PTED+ Q K  ELR SL KDGL+   GC+LARYN  G
Subjt:  ----------------------------------SCSDAKQEDTMSIREMEGGIGAVLKFSGDPTEDMAQQKAKELRCSLKKDGLKPINGCLLARYNHSG

Query:  RTWSFVMRNEVLIWLEEFSI
        RTW+F+MRNEV+IWLE+FS+
Subjt:  RTWSFVMRNEVLIWLEEFSI

AT5G20140.2 SOUL heme-binding family protein3.6e-9554.4Show/hide
Query:  TVDVDRLVDFMYDDLRHVFDEQGIDRTAYDDEVRFRDPITKYDGISGYMLNIALLREFFRPEIILHWVKKTGPYEITTRWTAIMKFILLPWKPELVLTGT
        TV+++ LV F+Y+DL H+FD+QGID+TAYD+ V+FRDPITK+D ISGY+ NIA L+  F P+  LHW K+TGPYEITTRWT +MKFI LPWKPELV TG 
Subjt:  TVDVDRLVDFMYDDLRHVFDEQGIDRTAYDDEVRFRDPITKYDGISGYMLNIALLREFFRPEIILHWVKKTGPYEITTRWTAIMKFILLPWKPELVLTGT

Query:  SIMGINPQTGKFCSHVDLWDSLQNNDYFSVEALWDVFKQFRFYETPELESPKYQILKRTANYEVRKYAPFVVVERNGHQI--SAGFNRVG----------
        SIM +NP+T KFCSH+DLWDS++NNDYFS+E L DVFKQ R Y+TP+LE+PKYQILKRTANYEVR Y PF+VVE  G ++  S+GFN V           
Subjt:  SIMGINPQTGKFCSHVDLWDSLQNNDYFSVEALWDVFKQFRFYETPELESPKYQILKRTANYEVRKYAPFVVVERNGHQI--SAGFNRVG----------

Query:  ----------------------------------SCSDAKQEDTMSIREMEGGIGAVLKFSGDPTEDMAQQKAKELRCSLKKDGLKPINGCLLARYNHSG
                                          S      E+ ++++++EGG  A +KFSG PTED+ Q K  ELR SL KDGL+   GC+LARYN  G
Subjt:  ----------------------------------SCSDAKQEDTMSIREMEGGIGAVLKFSGDPTEDMAQQKAKELRCSLKKDGLKPINGCLLARYNHSG

Query:  RTWSFVM
        RTW+F+M
Subjt:  RTWSFVM


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGACAGCCCAAGTTTCCTTCCAAAACTTCCTTTCAATCCCAACCGTTGATTCTGGTGTCCGGCCAAGGAAATCCTGCGGACCGACCAGGGCCGCACAAAGCAGAAC
CGGAAGCCCAAATTGGAAGTCGTCTATTCGATCAACATTGGCAGATCAAAGCCGTCAGAAACCAACGGTGGACGTGGACCGACTGGTGGATTTCATGTACGACGATCTCC
GGCACGTATTCGACGAGCAGGGGATTGATCGGACGGCGTACGACGATGAAGTTAGATTTCGAGACCCGATTACAAAGTATGACGGAATTAGTGGGTATATGCTGAATATT
GCCCTGTTGCGAGAATTCTTCAGGCCGGAGATCATCTTGCACTGGGTCAAAAAGACTGGGCCATATGAGATAACTACAAGATGGACTGCGATAATGAAGTTTATCCTTCT
GCCATGGAAACCAGAGTTAGTGTTGACGGGAACTTCCATCATGGGTATCAATCCACAAACCGGCAAGTTCTGTAGCCATGTGGATCTTTGGGATTCACTGCAAAATAATG
ACTACTTTTCTGTAGAAGCCTTGTGGGATGTGTTTAAACAGTTTCGGTTTTATGAGACTCCAGAATTGGAATCGCCCAAATATCAAATACTGAAAAGGACTGCAAATTAT
GAGGTGAGAAAATATGCGCCATTTGTTGTGGTTGAAAGAAATGGACACCAGATTTCTGCCGGATTCAATAGGGTTGGTAGTTGCTCAGATGCTAAACAGGAGGACACAAT
GAGCATAAGAGAGATGGAAGGGGGCATTGGTGCAGTGTTGAAATTCAGTGGAGATCCCACAGAGGATATGGCTCAGCAAAAGGCAAAAGAATTACGATGTAGTCTAAAAA
AGGATGGGCTTAAACCCATAAATGGCTGTTTGCTTGCTCGCTACAACCACTCTGGCCGAACATGGAGCTTTGTAATGAGAAATGAGGTGCTAATATGGCTGGAAGAATTC
TCAATTTAG
mRNA sequenceShow/hide mRNA sequence
AATTTTCTATCCTTCTCCCAGCATTTAAAAAAATGTAGAAGTTGACAATGGCAGGCATTTCCCCCAAATTCTGAGAAGTTAAGCTCCGGTTGCCAATGGCGACAGCCCAA
GTTTCCTTCCAAAACTTCCTTTCAATCCCAACCGTTGATTCTGGTGTCCGGCCAAGGAAATCCTGCGGACCGACCAGGGCCGCACAAAGCAGAACCGGAAGCCCAAATTG
GAAGTCGTCTATTCGATCAACATTGGCAGATCAAAGCCGTCAGAAACCAACGGTGGACGTGGACCGACTGGTGGATTTCATGTACGACGATCTCCGGCACGTATTCGACG
AGCAGGGGATTGATCGGACGGCGTACGACGATGAAGTTAGATTTCGAGACCCGATTACAAAGTATGACGGAATTAGTGGGTATATGCTGAATATTGCCCTGTTGCGAGAA
TTCTTCAGGCCGGAGATCATCTTGCACTGGGTCAAAAAGACTGGGCCATATGAGATAACTACAAGATGGACTGCGATAATGAAGTTTATCCTTCTGCCATGGAAACCAGA
GTTAGTGTTGACGGGAACTTCCATCATGGGTATCAATCCACAAACCGGCAAGTTCTGTAGCCATGTGGATCTTTGGGATTCACTGCAAAATAATGACTACTTTTCTGTAG
AAGCCTTGTGGGATGTGTTTAAACAGTTTCGGTTTTATGAGACTCCAGAATTGGAATCGCCCAAATATCAAATACTGAAAAGGACTGCAAATTATGAGGTGAGAAAATAT
GCGCCATTTGTTGTGGTTGAAAGAAATGGACACCAGATTTCTGCCGGATTCAATAGGGTTGGTAGTTGCTCAGATGCTAAACAGGAGGACACAATGAGCATAAGAGAGAT
GGAAGGGGGCATTGGTGCAGTGTTGAAATTCAGTGGAGATCCCACAGAGGATATGGCTCAGCAAAAGGCAAAAGAATTACGATGTAGTCTAAAAAAGGATGGGCTTAAAC
CCATAAATGGCTGTTTGCTTGCTCGCTACAACCACTCTGGCCGAACATGGAGCTTTGTAATGAGAAATGAGGTGCTAATATGGCTGGAAGAATTCTCAATTTAGTCCAAC
AAGTCCAACTTCATTTTATCCGCACCAATCAAAATAAGAGAGACAACTATAATCTGATTGATAATTTTACAAACTTTTAAATCTATATTCAATTATTAATAAGAATTACA
TTGTAACAGTCTTGCCTAATTGACATTTTTACCCAAGTTTAGTTTCTCG
Protein sequenceShow/hide protein sequence
MATAQVSFQNFLSIPTVDSGVRPRKSCGPTRAAQSRTGSPNWKSSIRSTLADQSRQKPTVDVDRLVDFMYDDLRHVFDEQGIDRTAYDDEVRFRDPITKYDGISGYMLNI
ALLREFFRPEIILHWVKKTGPYEITTRWTAIMKFILLPWKPELVLTGTSIMGINPQTGKFCSHVDLWDSLQNNDYFSVEALWDVFKQFRFYETPELESPKYQILKRTANY
EVRKYAPFVVVERNGHQISAGFNRVGSCSDAKQEDTMSIREMEGGIGAVLKFSGDPTEDMAQQKAKELRCSLKKDGLKPINGCLLARYNHSGRTWSFVMRNEVLIWLEEF
SI