; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Carg12134 (gene) of Silver-seed gourd (SMH-JMG-627) v2 genome

Gene IDCarg12134
OrganismCucurbita argyrosperma subsp. argyrosperma cv. SMH-JMG-627 (Silver-seed gourd (SMH-JMG-627) v2)
DescriptionSOUL heme-binding family protein
Genome locationCarg_Chr11:2392969..2395074
RNA-Seq ExpressionCarg12134
SyntenyCarg12134
Gene Ontology termsGO:0110165 - cellular anatomical structure (cellular component)
InterPro domainsIPR006917 - SOUL haem-binding protein
IPR011256 - Regulatory factor, effector binding domain superfamily
IPR018790 - Protein of unknown function DUF2358
IPR032710 - NTF2-like domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6587902.1 hypothetical protein SDJN03_16467, partial [Cucurbita argyrosperma subsp. sororia]3.0e-19299.4Show/hide
Query:  MATAQVSFQNFLSIPTVDFGVRPRKSCGPTRAAQSRTGSPNWKSSIRSTLGDQSRQKPTVDVDRLVDFMYDDLRHVFDEQGIDRTAYDEEVRFRDPITKY
        MATAQVSFQNFLSIPTVDFGVRPRKSCGPTRAAQSRTGSPNWKSSIRSTLGDQSRQKPTVDVDRLVDFMYDDLRHVFDEQGIDRTAYDEEVRFRDPITKY
Subjt:  MATAQVSFQNFLSIPTVDFGVRPRKSCGPTRAAQSRTGSPNWKSSIRSTLGDQSRQKPTVDVDRLVDFMYDDLRHVFDEQGIDRTAYDEEVRFRDPITKY

Query:  DGISGYMLNIALLREFFRPEIILHWVKKTGPYEITTRWTAVMKFILLPWKPELVLTGTSIMGINPQTGKFCSHVDLWDSLQNNDYFSLEALWDVFKQLRF
        DGISGYMLNIALLREFFRPEIILHWVKKTGPYEITTRWTAVMKFILLPWKPELVLTGTSIMGINP+TGKFCSHVDLWDSLQNNDYFSLEALWDVFKQLRF
Subjt:  DGISGYMLNIALLREFFRPEIILHWVKKTGPYEITTRWTAVMKFILLPWKPELVLTGTSIMGINPQTGKFCSHVDLWDSLQNNDYFSLEALWDVFKQLRF

Query:  YETPELESPKYQILKRTANYEVRKYAPFVVVERNGHQISAGFNRVGSCSDAKQEDTMSIREMEGGIGAVLKFSGDPTEDMAQQKAKELRCSLKKDGLNPI
        YETPELESPKYQILKRTANYEVRKYAPFVVVERNGHQISAGFNRVGSCSDAKQEDTMSIREMEGGIGAVLKFSGDPTEDMAQQKAKELRCSLKKDGLNPI
Subjt:  YETPELESPKYQILKRTANYEVRKYAPFVVVERNGHQISAGFNRVGSCSDAKQEDTMSIREMEGGIGAVLKFSGDPTEDMAQQKAKELRCSLKKDGLNPI

Query:  NGCLLARYNDSGRTWSFVMRNEVLIWLQEFSI
        NGCLLARYNDSGRTWSFVMRNEVLIWL+EFSI
Subjt:  NGCLLARYNDSGRTWSFVMRNEVLIWLQEFSI

KAG7021789.1 hypothetical protein SDJN02_15516 [Cucurbita argyrosperma subsp. argyrosperma]4.6e-193100Show/hide
Query:  MATAQVSFQNFLSIPTVDFGVRPRKSCGPTRAAQSRTGSPNWKSSIRSTLGDQSRQKPTVDVDRLVDFMYDDLRHVFDEQGIDRTAYDEEVRFRDPITKY
        MATAQVSFQNFLSIPTVDFGVRPRKSCGPTRAAQSRTGSPNWKSSIRSTLGDQSRQKPTVDVDRLVDFMYDDLRHVFDEQGIDRTAYDEEVRFRDPITKY
Subjt:  MATAQVSFQNFLSIPTVDFGVRPRKSCGPTRAAQSRTGSPNWKSSIRSTLGDQSRQKPTVDVDRLVDFMYDDLRHVFDEQGIDRTAYDEEVRFRDPITKY

Query:  DGISGYMLNIALLREFFRPEIILHWVKKTGPYEITTRWTAVMKFILLPWKPELVLTGTSIMGINPQTGKFCSHVDLWDSLQNNDYFSLEALWDVFKQLRF
        DGISGYMLNIALLREFFRPEIILHWVKKTGPYEITTRWTAVMKFILLPWKPELVLTGTSIMGINPQTGKFCSHVDLWDSLQNNDYFSLEALWDVFKQLRF
Subjt:  DGISGYMLNIALLREFFRPEIILHWVKKTGPYEITTRWTAVMKFILLPWKPELVLTGTSIMGINPQTGKFCSHVDLWDSLQNNDYFSLEALWDVFKQLRF

Query:  YETPELESPKYQILKRTANYEVRKYAPFVVVERNGHQISAGFNRVGSCSDAKQEDTMSIREMEGGIGAVLKFSGDPTEDMAQQKAKELRCSLKKDGLNPI
        YETPELESPKYQILKRTANYEVRKYAPFVVVERNGHQISAGFNRVGSCSDAKQEDTMSIREMEGGIGAVLKFSGDPTEDMAQQKAKELRCSLKKDGLNPI
Subjt:  YETPELESPKYQILKRTANYEVRKYAPFVVVERNGHQISAGFNRVGSCSDAKQEDTMSIREMEGGIGAVLKFSGDPTEDMAQQKAKELRCSLKKDGLNPI

Query:  NGCLLARYNDSGRTWSFVMRNEVLIWLQEFSI
        NGCLLARYNDSGRTWSFVMRNEVLIWLQEFSI
Subjt:  NGCLLARYNDSGRTWSFVMRNEVLIWLQEFSI

XP_022933414.1 uncharacterized protein LOC111440839 [Cucurbita moschata]2.6e-18897.29Show/hide
Query:  MATAQVSFQNFLSIPTVDFGVRPRKSCGPTRAAQSRTGSPNWKSSIRSTLGDQSRQKPTVDVDRLVDFMYDDLRHVFDEQGIDRTAYDEEVRFRDPITKY
        MATAQVSFQNFLSIPTVD GVRPRKSCGPTRAAQSRTGSPNWKSSIRSTL DQSRQKPTVDVDRLVDFMYDDLRHVFDEQGIDRTAYD+EVRFRDPITKY
Subjt:  MATAQVSFQNFLSIPTVDFGVRPRKSCGPTRAAQSRTGSPNWKSSIRSTLGDQSRQKPTVDVDRLVDFMYDDLRHVFDEQGIDRTAYDEEVRFRDPITKY

Query:  DGISGYMLNIALLREFFRPEIILHWVKKTGPYEITTRWTAVMKFILLPWKPELVLTGTSIMGINPQTGKFCSHVDLWDSLQNNDYFSLEALWDVFKQLRF
        DGISGYMLNIALLREFFRPEIILHWVKKTGPYEITTRWTA+MKFILLPWKPELVLTGTSIMGINPQTGKFCSHVDLWDSLQNNDYFS+EALWDVFKQ RF
Subjt:  DGISGYMLNIALLREFFRPEIILHWVKKTGPYEITTRWTAVMKFILLPWKPELVLTGTSIMGINPQTGKFCSHVDLWDSLQNNDYFSLEALWDVFKQLRF

Query:  YETPELESPKYQILKRTANYEVRKYAPFVVVERNGHQISAGFNRVGSCSDAKQEDTMSIREMEGGIGAVLKFSGDPTEDMAQQKAKELRCSLKKDGLNPI
        YETPELESPKYQILKRTANYEVRKYAPFVVVERNGHQISAGFNRVGSCSDAKQEDTMSIREMEGGIGAVLKFSGDPTEDMAQQKAKELRCSLKKDGL PI
Subjt:  YETPELESPKYQILKRTANYEVRKYAPFVVVERNGHQISAGFNRVGSCSDAKQEDTMSIREMEGGIGAVLKFSGDPTEDMAQQKAKELRCSLKKDGLNPI

Query:  NGCLLARYNDSGRTWSFVMRNEVLIWLQEFSI
        NGCLLARYN SGRTWSFVMRNEVLIWL+EFSI
Subjt:  NGCLLARYNDSGRTWSFVMRNEVLIWLQEFSI

XP_022965046.1 uncharacterized protein LOC111465022 [Cucurbita maxima]2.3e-18496.08Show/hide
Query:  MATAQVSFQNFLSIPTVDFGVRPRKSCGPTRAAQSRTGSPNWKSSIRSTLGDQSRQKPTVDVDRLVDFMYDDLRHVFDEQGIDRTAYDEEVRFRDPITKY
        MATAQVSFQNFLSIPTVDFGVRPRKS GPTRAAQSRT SPNWK SIRSTL DQ  QKPTVDVDRLVDFMYDDLRHVFDEQGIDRTAYDEEVRFRDPITKY
Subjt:  MATAQVSFQNFLSIPTVDFGVRPRKSCGPTRAAQSRTGSPNWKSSIRSTLGDQSRQKPTVDVDRLVDFMYDDLRHVFDEQGIDRTAYDEEVRFRDPITKY

Query:  DGISGYMLNIALLREFFRPEIILHWVKKTGPYEITTRWTAVMKFILLPWKPELVLTGTSIMGINPQTGKFCSHVDLWDSLQNNDYFSLEALWDVFKQLRF
        DGISGYMLNIALLREFFRPEIILHWVKKTGPYEITTRWTAVMKFILLPWKPELVLTGTSIMGINPQTGKFCSHVDLWDSLQNNDYFS+EALWDVFKQ RF
Subjt:  DGISGYMLNIALLREFFRPEIILHWVKKTGPYEITTRWTAVMKFILLPWKPELVLTGTSIMGINPQTGKFCSHVDLWDSLQNNDYFSLEALWDVFKQLRF

Query:  YETPELESPKYQILKRTANYEVRKYAPFVVVERNGHQISAGFNRVGSCSDAKQEDTMSIREMEGGIGAVLKFSGDPTEDMAQQKAKELRCSLKKDGLNPI
        YETPELESPKYQILKRTANYEVRKYAPFVVVERNGHQISAGFNRVGS  DAKQEDTMSIREMEGGIGAVLKFSGDPTEDMAQQKAKELRCSLKKDGL PI
Subjt:  YETPELESPKYQILKRTANYEVRKYAPFVVVERNGHQISAGFNRVGSCSDAKQEDTMSIREMEGGIGAVLKFSGDPTEDMAQQKAKELRCSLKKDGLNPI

Query:  NGCLLARYNDSGRTWSFVMRNEVLIWLQEFSI
        NGCLLARYNDSGRTW FVMRNEV+IWLQEFSI
Subjt:  NGCLLARYNDSGRTWSFVMRNEVLIWLQEFSI

XP_023531546.1 uncharacterized protein LOC111793749 [Cucurbita pepo subsp. pepo]3.2e-18696.99Show/hide
Query:  MATAQVSFQNFLSIPTVDFGVRPRKSCGPTRAAQSRTGSPNWKSSIRSTLGDQSRQKPTVDVDRLVDFMYDDLRHVFDEQGIDRTAYDEEVRFRDPITKY
        MATAQVSFQNFLSIPTVDFGVRPRKS GPTRAAQSRTGSPNWKSSIRSTL DQSRQKPTVDVDRLVDFMYDDLRHVFDEQGIDRTAYDEEVRFRDPITKY
Subjt:  MATAQVSFQNFLSIPTVDFGVRPRKSCGPTRAAQSRTGSPNWKSSIRSTLGDQSRQKPTVDVDRLVDFMYDDLRHVFDEQGIDRTAYDEEVRFRDPITKY

Query:  DGISGYMLNIALLREFFRPEIILHWVKKTGPYEITTRWTAVMKFILLPWKPELVLTGTSIMGINPQTGKFCSHVDLWDSLQNNDYFSLEALWDVFKQLRF
        DGISGYMLNIALLREFFRPEII HWVKKTGPYEITTRWTAVMKFILLPWKPELVLTGTSIMGINPQTGKFCSHVD+WDSLQNNDYFSLEALWDVFKQ RF
Subjt:  DGISGYMLNIALLREFFRPEIILHWVKKTGPYEITTRWTAVMKFILLPWKPELVLTGTSIMGINPQTGKFCSHVDLWDSLQNNDYFSLEALWDVFKQLRF

Query:  YETPELESPKYQILKRTANYEVRKYAPFVVVERNGHQISAGFNRVGSCSDAKQEDTMSIREMEGGIGAVLKFSGDPTEDMAQQKAKELRCSLKKDGLNPI
        YETPELESPKYQILKRTANYEVRKYAPFVVVERNGHQISAGFNRVGS SDAKQEDTMSIREMEGGIGAVLKFSGDPTEDMAQQKAKELRCSLKKDGL PI
Subjt:  YETPELESPKYQILKRTANYEVRKYAPFVVVERNGHQISAGFNRVGSCSDAKQEDTMSIREMEGGIGAVLKFSGDPTEDMAQQKAKELRCSLKKDGLNPI

Query:  NGCLLARYNDSGRTWSFVMRNEVLIWLQEFSI
        NGCLLARYN+S RTWSFVMRNEVLIWL+EFSI
Subjt:  NGCLLARYNDSGRTWSFVMRNEVLIWLQEFSI

TrEMBL top hitse value%identityAlignment
A0A1S3CJ12 uncharacterized protein LOC103501513 isoform X15.2e-12661.92Show/hide
Query:  MATAQVSFQNFLSIPTVDFGVRPRKSCGPT----RAAQSRTGS--PNWKSS---IRSTLGDQSRQKPTVDVDRLVDFMYDDLRHVFDEQGIDRTAYDEEV
        MA  Q+S QNFLS PT+   +RP KS   T    R  QSRT +  PN ++S   +R  L DQS  K TVDV RLVDF+Y+DL H+FDEQGIDRTAYDE+V
Subjt:  MATAQVSFQNFLSIPTVDFGVRPRKSCGPT----RAAQSRTGS--PNWKSS---IRSTLGDQSRQKPTVDVDRLVDFMYDDLRHVFDEQGIDRTAYDEEV

Query:  RFRDPITKYDGISGYMLNIALLREFFRPEIILHWVKKTGPYEITTRWTAVMKFILLPWKPELVLTGTSIMGINPQTGKFCSHVDLWDSLQNNDYFSLEAL
        RFRDPITK+D ISGY+ NI+LLRE FRPE  LHWVK+TGPYEITTRWT +MKF LLPWKPEL+ TGTSIMGINP+TGKFCSHVDLWDS+QNNDYFS+E L
Subjt:  RFRDPITKYDGISGYMLNIALLREFFRPEIILHWVKKTGPYEITTRWTAVMKFILLPWKPELVLTGTSIMGINPQTGKFCSHVDLWDSLQNNDYFSLEAL

Query:  WDVFKQLRFYETPELESPKYQILKRTANYEVRKYAPFVVVERNGHQI--SAGFNRVGSCSDAK-------------------------------------
        WDVFKQLRFY+TPELESPKY ILKRT  YEVRKYAPF+VVE +G ++  SAGFN V      K                                     
Subjt:  WDVFKQLRFYETPELESPKYQILKRTANYEVRKYAPFVVVERNGHQI--SAGFNRVGSCSDAK-------------------------------------

Query:  ------QEDTMSIREMEGGIGAVLKFSGDPTEDMAQQKAKELRCSLKKDGLNPINGCLLARYNDSGRTWSFVMRNEVLIWLQEFSI
              ++D + +R++EGGI AVLKFSG PTE++ Q+KAKELR SL KDGL P NGCLLARYND GRTW+F+MRNEVLIWL+E+S+
Subjt:  ------QEDTMSIREMEGGIGAVLKFSGDPTEDMAQQKAKELRCSLKKDGLNPINGCLLARYNDSGRTWSFVMRNEVLIWLQEFSI

A0A6J1CUY2 uncharacterized protein LOC111014503 isoform X11.3e-12963.31Show/hide
Query:  MATAQVSFQNFLSIPTVDFGVRPRKSCG------PTRAAQSRT-----GSPNWKSSIRSTLGDQSRQKPTVDVDRLVDFMYDDLRHVFDEQGIDRTAYDE
        MA  Q+S QNFLS PT  FG RP KS G      P R  +SRT      + N K ++R +L DQS  K  VDVDRLVDF+Y+DLRH+FDEQGIDRTAYDE
Subjt:  MATAQVSFQNFLSIPTVDFGVRPRKSCG------PTRAAQSRT-----GSPNWKSSIRSTLGDQSRQKPTVDVDRLVDFMYDDLRHVFDEQGIDRTAYDE

Query:  EVRFRDPITKYDGISGYMLNIALLREFFRPEIILHWVKKTGPYEITTRWTAVMKFILLPWKPELVLTGTSIMGINPQTGKFCSHVDLWDSLQNNDYFSLE
         VRFRDPITK+D ISGY  NI+LLRE FRPE  LHWVK+TGPYEITTRWT VMKF+LLPWKPE + TG SIMGINP+TGKFCSHVDLWDS+QNNDYFSLE
Subjt:  EVRFRDPITKYDGISGYMLNIALLREFFRPEIILHWVKKTGPYEITTRWTAVMKFILLPWKPELVLTGTSIMGINPQTGKFCSHVDLWDSLQNNDYFSLE

Query:  ALWDVFKQLRFYETPELESPKYQILKRTANYEVRKYAPFVVVERNGHQI--SAGFNRVGSCSDAK-----------------------------------
         L DVFKQLRFY+TPELESPKY+ILKRTANYEVRKY PFVVVE +G ++  SAGFN V      K                                   
Subjt:  ALWDVFKQLRFYETPELESPKYQILKRTANYEVRKYAPFVVVERNGHQI--SAGFNRVGSCSDAK-----------------------------------

Query:  --------QEDTMSIREMEGGIGAVLKFSGDPTEDMAQQKAKELRCSLKKDGLNPINGCLLARYNDSGRTWSFVMRNEVLIWLQEFS
                ++DT+ +R++EGGI AVLKFSG PTEDM Q+KAKELR  L KDGL P  GCLLARYND GRTWSF+MRNEVLIWL+EFS
Subjt:  --------QEDTMSIREMEGGIGAVLKFSGDPTEDMAQQKAKELRCSLKKDGLNPINGCLLARYNDSGRTWSFVMRNEVLIWLQEFS

A0A6J1CV62 uncharacterized protein LOC111014503 isoform X24.0e-13473.29Show/hide
Query:  MATAQVSFQNFLSIPTVDFGVRPRKS---CGP-TRAAQSRTGSPNWKSSIRSTLGDQSRQKPTVDVDRLVDFMYDDLRHVFDEQGIDRTAYDEEVRFRDP
        M   QVS QNFLSIPTV  G RP+KS    GP  R  +SRT     K  +RS L D+S  K TVDVDRLVDF+Y+DLRHVFD QGID TAYDE VRFRDP
Subjt:  MATAQVSFQNFLSIPTVDFGVRPRKS---CGP-TRAAQSRTGSPNWKSSIRSTLGDQSRQKPTVDVDRLVDFMYDDLRHVFDEQGIDRTAYDEEVRFRDP

Query:  ITKYDGISGYMLNIALLREFFRPEIILHWVKKTGPYEITTRWTAVMKFILLPWKPELVLTGTSIMGINPQTGKFCSHVDLWDSLQNNDYFSLEALWDVFK
        ITKY+GI GYMLNIALLR+ FRP+ +LHWVKKTGPYEITTRWTAVMKF+LLPWKPELVLTGTSIM I+P+TGKFC+HVDLWDS+QNN+YFSLE LWD+FK
Subjt:  ITKYDGISGYMLNIALLREFFRPEIILHWVKKTGPYEITTRWTAVMKFILLPWKPELVLTGTSIMGINPQTGKFCSHVDLWDSLQNNDYFSLEALWDVFK

Query:  QLRFYETPELESPKYQILKRTANYEVRKYAPFVVVERNGHQI--SAGFNRVGSCSDAKQEDTMSIREMEGGIGAVLKFSGDPTEDMAQQKAKELRCSLKK
        Q RFYETPELESP+YQILKRTANYEVRKYAPF+ VE    ++  SA FNRV    D KQ D +S+R ++GGI AVLKFSG P+E+M Q+KAKELR SL K
Subjt:  QLRFYETPELESPKYQILKRTANYEVRKYAPFVVVERNGHQI--SAGFNRVGSCSDAKQEDTMSIREMEGGIGAVLKFSGDPTEDMAQQKAKELRCSLKK

Query:  DGLNPINGCLLARYNDSGRTWSFVMRNEVLIWLQEFS
        DGL PI GCLLARYND  RTWSFVMRNEVLIWL+EFS
Subjt:  DGLNPINGCLLARYNDSGRTWSFVMRNEVLIWLQEFS

A0A6J1EZQ2 uncharacterized protein LOC1114408391.3e-18897.29Show/hide
Query:  MATAQVSFQNFLSIPTVDFGVRPRKSCGPTRAAQSRTGSPNWKSSIRSTLGDQSRQKPTVDVDRLVDFMYDDLRHVFDEQGIDRTAYDEEVRFRDPITKY
        MATAQVSFQNFLSIPTVD GVRPRKSCGPTRAAQSRTGSPNWKSSIRSTL DQSRQKPTVDVDRLVDFMYDDLRHVFDEQGIDRTAYD+EVRFRDPITKY
Subjt:  MATAQVSFQNFLSIPTVDFGVRPRKSCGPTRAAQSRTGSPNWKSSIRSTLGDQSRQKPTVDVDRLVDFMYDDLRHVFDEQGIDRTAYDEEVRFRDPITKY

Query:  DGISGYMLNIALLREFFRPEIILHWVKKTGPYEITTRWTAVMKFILLPWKPELVLTGTSIMGINPQTGKFCSHVDLWDSLQNNDYFSLEALWDVFKQLRF
        DGISGYMLNIALLREFFRPEIILHWVKKTGPYEITTRWTA+MKFILLPWKPELVLTGTSIMGINPQTGKFCSHVDLWDSLQNNDYFS+EALWDVFKQ RF
Subjt:  DGISGYMLNIALLREFFRPEIILHWVKKTGPYEITTRWTAVMKFILLPWKPELVLTGTSIMGINPQTGKFCSHVDLWDSLQNNDYFSLEALWDVFKQLRF

Query:  YETPELESPKYQILKRTANYEVRKYAPFVVVERNGHQISAGFNRVGSCSDAKQEDTMSIREMEGGIGAVLKFSGDPTEDMAQQKAKELRCSLKKDGLNPI
        YETPELESPKYQILKRTANYEVRKYAPFVVVERNGHQISAGFNRVGSCSDAKQEDTMSIREMEGGIGAVLKFSGDPTEDMAQQKAKELRCSLKKDGL PI
Subjt:  YETPELESPKYQILKRTANYEVRKYAPFVVVERNGHQISAGFNRVGSCSDAKQEDTMSIREMEGGIGAVLKFSGDPTEDMAQQKAKELRCSLKKDGLNPI

Query:  NGCLLARYNDSGRTWSFVMRNEVLIWLQEFSI
        NGCLLARYN SGRTWSFVMRNEVLIWL+EFSI
Subjt:  NGCLLARYNDSGRTWSFVMRNEVLIWLQEFSI

A0A6J1HKM5 uncharacterized protein LOC1114650221.1e-18496.08Show/hide
Query:  MATAQVSFQNFLSIPTVDFGVRPRKSCGPTRAAQSRTGSPNWKSSIRSTLGDQSRQKPTVDVDRLVDFMYDDLRHVFDEQGIDRTAYDEEVRFRDPITKY
        MATAQVSFQNFLSIPTVDFGVRPRKS GPTRAAQSRT SPNWK SIRSTL DQ  QKPTVDVDRLVDFMYDDLRHVFDEQGIDRTAYDEEVRFRDPITKY
Subjt:  MATAQVSFQNFLSIPTVDFGVRPRKSCGPTRAAQSRTGSPNWKSSIRSTLGDQSRQKPTVDVDRLVDFMYDDLRHVFDEQGIDRTAYDEEVRFRDPITKY

Query:  DGISGYMLNIALLREFFRPEIILHWVKKTGPYEITTRWTAVMKFILLPWKPELVLTGTSIMGINPQTGKFCSHVDLWDSLQNNDYFSLEALWDVFKQLRF
        DGISGYMLNIALLREFFRPEIILHWVKKTGPYEITTRWTAVMKFILLPWKPELVLTGTSIMGINPQTGKFCSHVDLWDSLQNNDYFS+EALWDVFKQ RF
Subjt:  DGISGYMLNIALLREFFRPEIILHWVKKTGPYEITTRWTAVMKFILLPWKPELVLTGTSIMGINPQTGKFCSHVDLWDSLQNNDYFSLEALWDVFKQLRF

Query:  YETPELESPKYQILKRTANYEVRKYAPFVVVERNGHQISAGFNRVGSCSDAKQEDTMSIREMEGGIGAVLKFSGDPTEDMAQQKAKELRCSLKKDGLNPI
        YETPELESPKYQILKRTANYEVRKYAPFVVVERNGHQISAGFNRVGS  DAKQEDTMSIREMEGGIGAVLKFSGDPTEDMAQQKAKELRCSLKKDGL PI
Subjt:  YETPELESPKYQILKRTANYEVRKYAPFVVVERNGHQISAGFNRVGSCSDAKQEDTMSIREMEGGIGAVLKFSGDPTEDMAQQKAKELRCSLKKDGLNPI

Query:  NGCLLARYNDSGRTWSFVMRNEVLIWLQEFSI
        NGCLLARYNDSGRTW FVMRNEV+IWLQEFSI
Subjt:  NGCLLARYNDSGRTWSFVMRNEVLIWLQEFSI

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G20140.1 SOUL heme-binding family protein5.6e-10456.56Show/hide
Query:  TVDVDRLVDFMYDDLRHVFDEQGIDRTAYDEEVRFRDPITKYDGISGYMLNIALLREFFRPEIILHWVKKTGPYEITTRWTAVMKFILLPWKPELVLTGT
        TV+++ LV F+Y+DL H+FD+QGID+TAYDE V+FRDPITK+D ISGY+ NIA L+  F P+  LHW K+TGPYEITTRWT VMKFI LPWKPELV TG 
Subjt:  TVDVDRLVDFMYDDLRHVFDEQGIDRTAYDEEVRFRDPITKYDGISGYMLNIALLREFFRPEIILHWVKKTGPYEITTRWTAVMKFILLPWKPELVLTGT

Query:  SIMGINPQTGKFCSHVDLWDSLQNNDYFSLEALWDVFKQLRFYETPELESPKYQILKRTANYEVRKYAPFVVVERNGHQI--SAGFNRVG----------
        SIM +NP+T KFCSH+DLWDS++NNDYFSLE L DVFKQLR Y+TP+LE+PKYQILKRTANYEVR Y PF+VVE  G ++  S+GFN V           
Subjt:  SIMGINPQTGKFCSHVDLWDSLQNNDYFSLEALWDVFKQLRFYETPELESPKYQILKRTANYEVRKYAPFVVVERNGHQI--SAGFNRVG----------

Query:  ----------------------------------SCSDAKQEDTMSIREMEGGIGAVLKFSGDPTEDMAQQKAKELRCSLKKDGLNPINGCLLARYNDSG
                                          S      E+ ++++++EGG  A +KFSG PTED+ Q K  ELR SL KDGL    GC+LARYND G
Subjt:  ----------------------------------SCSDAKQEDTMSIREMEGGIGAVLKFSGDPTEDMAQQKAKELRCSLKKDGLNPINGCLLARYNDSG

Query:  RTWSFVMRNEVLIWLQEFSI
        RTW+F+MRNEV+IWL++FS+
Subjt:  RTWSFVMRNEVLIWLQEFSI

AT5G20140.2 SOUL heme-binding family protein3.0e-9756.03Show/hide
Query:  TVDVDRLVDFMYDDLRHVFDEQGIDRTAYDEEVRFRDPITKYDGISGYMLNIALLREFFRPEIILHWVKKTGPYEITTRWTAVMKFILLPWKPELVLTGT
        TV+++ LV F+Y+DL H+FD+QGID+TAYDE V+FRDPITK+D ISGY+ NIA L+  F P+  LHW K+TGPYEITTRWT VMKFI LPWKPELV TG 
Subjt:  TVDVDRLVDFMYDDLRHVFDEQGIDRTAYDEEVRFRDPITKYDGISGYMLNIALLREFFRPEIILHWVKKTGPYEITTRWTAVMKFILLPWKPELVLTGT

Query:  SIMGINPQTGKFCSHVDLWDSLQNNDYFSLEALWDVFKQLRFYETPELESPKYQILKRTANYEVRKYAPFVVVERNGHQI--SAGFNRVG----------
        SIM +NP+T KFCSH+DLWDS++NNDYFSLE L DVFKQLR Y+TP+LE+PKYQILKRTANYEVR Y PF+VVE  G ++  S+GFN V           
Subjt:  SIMGINPQTGKFCSHVDLWDSLQNNDYFSLEALWDVFKQLRFYETPELESPKYQILKRTANYEVRKYAPFVVVERNGHQI--SAGFNRVG----------

Query:  ----------------------------------SCSDAKQEDTMSIREMEGGIGAVLKFSGDPTEDMAQQKAKELRCSLKKDGLNPINGCLLARYNDSG
                                          S      E+ ++++++EGG  A +KFSG PTED+ Q K  ELR SL KDGL    GC+LARYND G
Subjt:  ----------------------------------SCSDAKQEDTMSIREMEGGIGAVLKFSGDPTEDMAQQKAKELRCSLKKDGLNPINGCLLARYNDSG

Query:  RTWSFVM
        RTW+F+M
Subjt:  RTWSFVM


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGACAGCCCAAGTTTCCTTCCAAAACTTCCTTTCAATCCCAACCGTTGATTTTGGTGTCCGGCCAAGGAAATCCTGCGGACCGACCAGGGCCGCACAAAGCAGAAC
CGGAAGCCCAAATTGGAAGTCGTCTATTCGATCAACATTGGGAGATCAAAGCCGTCAGAAACCAACGGTGGACGTGGACCGACTGGTGGATTTCATGTACGACGATCTCC
GGCACGTATTCGACGAGCAGGGGATTGATCGGACGGCGTACGACGAAGAAGTTAGATTTCGAGACCCGATTACAAAGTATGACGGAATTAGTGGGTATATGCTGAATATT
GCCCTGTTGCGAGAATTCTTCAGGCCGGAGATCATCTTGCACTGGGTCAAAAAGACTGGGCCATATGAGATAACCACAAGATGGACTGCGGTGATGAAGTTTATCCTTCT
GCCATGGAAACCAGAGTTAGTGTTGACGGGAACTTCCATCATGGGTATCAATCCACAAACCGGCAAGTTCTGTAGCCATGTGGATCTTTGGGATTCACTGCAAAATAATG
ACTACTTTTCTCTAGAAGCCTTGTGGGATGTGTTTAAACAGTTACGGTTTTATGAGACTCCAGAATTGGAATCGCCCAAATATCAAATACTGAAAAGGACTGCAAATTAT
GAGGTGAGAAAATATGCGCCATTTGTTGTGGTTGAAAGAAATGGACACCAGATTTCTGCCGGATTCAATAGGGTTGGTAGTTGCTCAGATGCTAAACAGGAGGACACAAT
GAGCATAAGAGAGATGGAAGGAGGCATTGGTGCAGTGTTGAAATTCAGTGGAGATCCCACAGAGGATATGGCTCAGCAAAAGGCAAAGGAATTACGATGTAGTCTAAAAA
AGGATGGCCTTAACCCCATAAATGGCTGTTTGCTTGCTCGCTACAACGACTCTGGCCGAACATGGAGCTTTGTAATGAGAAATGAGGTGCTAATATGGCTGCAAGAATTC
TCAATTTAG
mRNA sequenceShow/hide mRNA sequence
TTTCAATTTTATTTTTCATCTTATTATCAGAGAGAGATTGTTATTGTAATGTGAAGAATTATTGGAGTGTGAAGAAATAGTTGCAAGCCTTCGTTCAACGTATCAATTTA
ATATTTTAGAGAGTTGTGGCATTTCCCCCAAATTCTGAGAGGTTAAGCTCCGGTTGCCAATGGCGACAGCCCAAGTTTCCTTCCAAAACTTCCTTTCAATCCCAACCGTT
GATTTTGGTGTCCGGCCAAGGAAATCCTGCGGACCGACCAGGGCCGCACAAAGCAGAACCGGAAGCCCAAATTGGAAGTCGTCTATTCGATCAACATTGGGAGATCAAAG
CCGTCAGAAACCAACGGTGGACGTGGACCGACTGGTGGATTTCATGTACGACGATCTCCGGCACGTATTCGACGAGCAGGGGATTGATCGGACGGCGTACGACGAAGAAG
TTAGATTTCGAGACCCGATTACAAAGTATGACGGAATTAGTGGGTATATGCTGAATATTGCCCTGTTGCGAGAATTCTTCAGGCCGGAGATCATCTTGCACTGGGTCAAA
AAGACTGGGCCATATGAGATAACCACAAGATGGACTGCGGTGATGAAGTTTATCCTTCTGCCATGGAAACCAGAGTTAGTGTTGACGGGAACTTCCATCATGGGTATCAA
TCCACAAACCGGCAAGTTCTGTAGCCATGTGGATCTTTGGGATTCACTGCAAAATAATGACTACTTTTCTCTAGAAGCCTTGTGGGATGTGTTTAAACAGTTACGGTTTT
ATGAGACTCCAGAATTGGAATCGCCCAAATATCAAATACTGAAAAGGACTGCAAATTATGAGGTGAGAAAATATGCGCCATTTGTTGTGGTTGAAAGAAATGGACACCAG
ATTTCTGCCGGATTCAATAGGGTTGGTAGTTGCTCAGATGCTAAACAGGAGGACACAATGAGCATAAGAGAGATGGAAGGAGGCATTGGTGCAGTGTTGAAATTCAGTGG
AGATCCCACAGAGGATATGGCTCAGCAAAAGGCAAAGGAATTACGATGTAGTCTAAAAAAGGATGGCCTTAACCCCATAAATGGCTGTTTGCTTGCTCGCTACAACGACT
CTGGCCGAACATGGAGCTTTGTAATGAGAAATGAGGTGCTAATATGGCTGCAAGAATTCTCAATTTAGTCCAACAAGTCCAACTTCATTTTATCTGCACCAATCAAAATA
AGAGAGACAACTATAATCTGATTGATAATTTTACAAATTTTTAAATCTATATTCAATTATTAATAACACACAATTACGTTGTAATTGACATTTTTACCAAGTTT
Protein sequenceShow/hide protein sequence
MATAQVSFQNFLSIPTVDFGVRPRKSCGPTRAAQSRTGSPNWKSSIRSTLGDQSRQKPTVDVDRLVDFMYDDLRHVFDEQGIDRTAYDEEVRFRDPITKYDGISGYMLNI
ALLREFFRPEIILHWVKKTGPYEITTRWTAVMKFILLPWKPELVLTGTSIMGINPQTGKFCSHVDLWDSLQNNDYFSLEALWDVFKQLRFYETPELESPKYQILKRTANY
EVRKYAPFVVVERNGHQISAGFNRVGSCSDAKQEDTMSIREMEGGIGAVLKFSGDPTEDMAQQKAKELRCSLKKDGLNPINGCLLARYNDSGRTWSFVMRNEVLIWLQEF
SI