; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0002298 (gene) of Chayote v1 genome

Gene IDSed0002298
OrganismSechium edule (Chayote v1)
DescriptionSOUL heme-binding family protein
Genome locationLG08:5057024..5064408
RNA-Seq ExpressionSed0002298
SyntenySed0002298
Gene Ontology termsGO:0110165 - cellular anatomical structure (cellular component)
InterPro domainsIPR006917 - SOUL haem-binding protein
IPR011256 - Regulatory factor, effector binding domain superfamily
IPR018790 - Protein of unknown function DUF2358
IPR032710 - NTF2-like domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6587902.1 hypothetical protein SDJN03_16467, partial [Cucurbita argyrosperma subsp. sororia]2.5e-13874.78Show/hide
Query:  MATAQISLQNLLSIRTAGVCFRPRKPGERTGVGQSRTAGQKNPKRPGIRVGSTIGDDDHEKSTVDVDRLVEFLYEDLHHAFDEQGIDRTAYEEDVRFRDP
        MATAQ+S QN LSI T     RPRK    T   QSRT G  N K     + ST+GD   +K TVDVDRLV+F+Y+DL H FDEQGIDRTAY+E+VRFRDP
Subjt:  MATAQISLQNLLSIRTAGVCFRPRKPGERTGVGQSRTAGQKNPKRPGIRVGSTIGDDDHEKSTVDVDRLVEFLYEDLHHAFDEQGIDRTAYEEDVRFRDP

Query:  ITKYDDIYGFMLNIAMLQKFFRPEFILHWVKKTGPYEITTRWTAVMNFMPLPWKPELVLTGTSIMSINPQTGKFCTQVDRWDSVQNNDYFSLEGLWDVFK
        ITKYD I G+MLNIA+L++FFRPE ILHWVKKTGPYEITTRWTAVM F+ LPWKPELVLTGTSIM INP+TGKFC+ VD WDS+QNNDYFSLE LWDVFK
Subjt:  ITKYDDIYGFMLNIAMLQKFFRPEFILHWVKKTGPYEITTRWTAVMNFMPLPWKPELVLTGTSIMSINPQTGKFCTQVDRWDSVQNNDYFSLEGLWDVFK

Query:  QFRFYETSELESPKYQILKRTANYEVRKYAPFMVVEKNGGNVSAGFSSVASFSDHKHKQDTMTLRKMEGGMGAVLKFSGNPTEEMVQKMAKQLRFSLKKD
        Q RFYET ELESPKYQILKRTANYEVRKYAPF+VVE+NG  +SAGF+ V S SD K ++DTM++R+MEGG+GAVLKFSG+PTE+M Q+ AK+LR SLKKD
Subjt:  QFRFYETSELESPKYQILKRTANYEVRKYAPFMVVEKNGGNVSAGFSSVASFSDHKHKQDTMTLRKMEGGMGAVLKFSGNPTEEMVQKMAKQLRFSLKKD

Query:  GLKPINGCLLARYNDSTRTWSFVMRNEVLIWLEEFSI
        GL PINGCLLARYNDS RTWSFVMRNEVLIWLEEFSI
Subjt:  GLKPINGCLLARYNDSTRTWSFVMRNEVLIWLEEFSI

KAG7021789.1 hypothetical protein SDJN02_15516 [Cucurbita argyrosperma subsp. argyrosperma]1.9e-13874.78Show/hide
Query:  MATAQISLQNLLSIRTAGVCFRPRKPGERTGVGQSRTAGQKNPKRPGIRVGSTIGDDDHEKSTVDVDRLVEFLYEDLHHAFDEQGIDRTAYEEDVRFRDP
        MATAQ+S QN LSI T     RPRK    T   QSRT G  N K     + ST+GD   +K TVDVDRLV+F+Y+DL H FDEQGIDRTAY+E+VRFRDP
Subjt:  MATAQISLQNLLSIRTAGVCFRPRKPGERTGVGQSRTAGQKNPKRPGIRVGSTIGDDDHEKSTVDVDRLVEFLYEDLHHAFDEQGIDRTAYEEDVRFRDP

Query:  ITKYDDIYGFMLNIAMLQKFFRPEFILHWVKKTGPYEITTRWTAVMNFMPLPWKPELVLTGTSIMSINPQTGKFCTQVDRWDSVQNNDYFSLEGLWDVFK
        ITKYD I G+MLNIA+L++FFRPE ILHWVKKTGPYEITTRWTAVM F+ LPWKPELVLTGTSIM INPQTGKFC+ VD WDS+QNNDYFSLE LWDVFK
Subjt:  ITKYDDIYGFMLNIAMLQKFFRPEFILHWVKKTGPYEITTRWTAVMNFMPLPWKPELVLTGTSIMSINPQTGKFCTQVDRWDSVQNNDYFSLEGLWDVFK

Query:  QFRFYETSELESPKYQILKRTANYEVRKYAPFMVVEKNGGNVSAGFSSVASFSDHKHKQDTMTLRKMEGGMGAVLKFSGNPTEEMVQKMAKQLRFSLKKD
        Q RFYET ELESPKYQILKRTANYEVRKYAPF+VVE+NG  +SAGF+ V S SD K ++DTM++R+MEGG+GAVLKFSG+PTE+M Q+ AK+LR SLKKD
Subjt:  QFRFYETSELESPKYQILKRTANYEVRKYAPFMVVEKNGGNVSAGFSSVASFSDHKHKQDTMTLRKMEGGMGAVLKFSGNPTEEMVQKMAKQLRFSLKKD

Query:  GLKPINGCLLARYNDSTRTWSFVMRNEVLIWLEEFSI
        GL PINGCLLARYNDS RTWSFVMRNEVLIWL+EFSI
Subjt:  GLKPINGCLLARYNDSTRTWSFVMRNEVLIWLEEFSI

XP_022933414.1 uncharacterized protein LOC111440839 [Cucurbita moschata]1.2e-13774.18Show/hide
Query:  MATAQISLQNLLSIRTAGVCFRPRKPGERTGVGQSRTAGQKNPKRPGIRVGSTIGDDDHEKSTVDVDRLVEFLYEDLHHAFDEQGIDRTAYEEDVRFRDP
        MATAQ+S QN LSI T     RPRK    T   QSRT G  N K     + ST+ D   +K TVDVDRLV+F+Y+DL H FDEQGIDRTAY+++VRFRDP
Subjt:  MATAQISLQNLLSIRTAGVCFRPRKPGERTGVGQSRTAGQKNPKRPGIRVGSTIGDDDHEKSTVDVDRLVEFLYEDLHHAFDEQGIDRTAYEEDVRFRDP

Query:  ITKYDDIYGFMLNIAMLQKFFRPEFILHWVKKTGPYEITTRWTAVMNFMPLPWKPELVLTGTSIMSINPQTGKFCTQVDRWDSVQNNDYFSLEGLWDVFK
        ITKYD I G+MLNIA+L++FFRPE ILHWVKKTGPYEITTRWTA+M F+ LPWKPELVLTGTSIM INPQTGKFC+ VD WDS+QNNDYFS+E LWDVFK
Subjt:  ITKYDDIYGFMLNIAMLQKFFRPEFILHWVKKTGPYEITTRWTAVMNFMPLPWKPELVLTGTSIMSINPQTGKFCTQVDRWDSVQNNDYFSLEGLWDVFK

Query:  QFRFYETSELESPKYQILKRTANYEVRKYAPFMVVEKNGGNVSAGFSSVASFSDHKHKQDTMTLRKMEGGMGAVLKFSGNPTEEMVQKMAKQLRFSLKKD
        QFRFYET ELESPKYQILKRTANYEVRKYAPF+VVE+NG  +SAGF+ V S SD K ++DTM++R+MEGG+GAVLKFSG+PTE+M Q+ AK+LR SLKKD
Subjt:  QFRFYETSELESPKYQILKRTANYEVRKYAPFMVVEKNGGNVSAGFSSVASFSDHKHKQDTMTLRKMEGGMGAVLKFSGNPTEEMVQKMAKQLRFSLKKD

Query:  GLKPINGCLLARYNDSTRTWSFVMRNEVLIWLEEFSI
        GLKPINGCLLARYN S RTWSFVMRNEVLIWLEEFSI
Subjt:  GLKPINGCLLARYNDSTRTWSFVMRNEVLIWLEEFSI

XP_022965046.1 uncharacterized protein LOC111465022 [Cucurbita maxima]8.6e-13974.48Show/hide
Query:  MATAQISLQNLLSIRTAGVCFRPRKPGERTGVGQSRTAGQKNPKRPGIRVGSTIGDDDHEKSTVDVDRLVEFLYEDLHHAFDEQGIDRTAYEEDVRFRDP
        MATAQ+S QN LSI T     RPRK    T   QSRTA   N K     + ST+ D  H+K TVDVDRLV+F+Y+DL H FDEQGIDRTAY+E+VRFRDP
Subjt:  MATAQISLQNLLSIRTAGVCFRPRKPGERTGVGQSRTAGQKNPKRPGIRVGSTIGDDDHEKSTVDVDRLVEFLYEDLHHAFDEQGIDRTAYEEDVRFRDP

Query:  ITKYDDIYGFMLNIAMLQKFFRPEFILHWVKKTGPYEITTRWTAVMNFMPLPWKPELVLTGTSIMSINPQTGKFCTQVDRWDSVQNNDYFSLEGLWDVFK
        ITKYD I G+MLNIA+L++FFRPE ILHWVKKTGPYEITTRWTAVM F+ LPWKPELVLTGTSIM INPQTGKFC+ VD WDS+QNNDYFS+E LWDVFK
Subjt:  ITKYDDIYGFMLNIAMLQKFFRPEFILHWVKKTGPYEITTRWTAVMNFMPLPWKPELVLTGTSIMSINPQTGKFCTQVDRWDSVQNNDYFSLEGLWDVFK

Query:  QFRFYETSELESPKYQILKRTANYEVRKYAPFMVVEKNGGNVSAGFSSVASFSDHKHKQDTMTLRKMEGGMGAVLKFSGNPTEEMVQKMAKQLRFSLKKD
        QFRFYET ELESPKYQILKRTANYEVRKYAPF+VVE+NG  +SAGF+ V SF D K ++DTM++R+MEGG+GAVLKFSG+PTE+M Q+ AK+LR SLKKD
Subjt:  QFRFYETSELESPKYQILKRTANYEVRKYAPFMVVEKNGGNVSAGFSSVASFSDHKHKQDTMTLRKMEGGMGAVLKFSGNPTEEMVQKMAKQLRFSLKKD

Query:  GLKPINGCLLARYNDSTRTWSFVMRNEVLIWLEEFSI
        GLKPINGCLLARYNDS RTW FVMRNEV+IWL+EFSI
Subjt:  GLKPINGCLLARYNDSTRTWSFVMRNEVLIWLEEFSI

XP_023531546.1 uncharacterized protein LOC111793749 [Cucurbita pepo subsp. pepo]2.3e-13975.07Show/hide
Query:  MATAQISLQNLLSIRTAGVCFRPRKPGERTGVGQSRTAGQKNPKRPGIRVGSTIGDDDHEKSTVDVDRLVEFLYEDLHHAFDEQGIDRTAYEEDVRFRDP
        MATAQ+S QN LSI T     RPRK    T   QSRT G  N K     + ST+ D   +K TVDVDRLV+F+Y+DL H FDEQGIDRTAY+E+VRFRDP
Subjt:  MATAQISLQNLLSIRTAGVCFRPRKPGERTGVGQSRTAGQKNPKRPGIRVGSTIGDDDHEKSTVDVDRLVEFLYEDLHHAFDEQGIDRTAYEEDVRFRDP

Query:  ITKYDDIYGFMLNIAMLQKFFRPEFILHWVKKTGPYEITTRWTAVMNFMPLPWKPELVLTGTSIMSINPQTGKFCTQVDRWDSVQNNDYFSLEGLWDVFK
        ITKYD I G+MLNIA+L++FFRPE I HWVKKTGPYEITTRWTAVM F+ LPWKPELVLTGTSIM INPQTGKFC+ VD WDS+QNNDYFSLE LWDVFK
Subjt:  ITKYDDIYGFMLNIAMLQKFFRPEFILHWVKKTGPYEITTRWTAVMNFMPLPWKPELVLTGTSIMSINPQTGKFCTQVDRWDSVQNNDYFSLEGLWDVFK

Query:  QFRFYETSELESPKYQILKRTANYEVRKYAPFMVVEKNGGNVSAGFSSVASFSDHKHKQDTMTLRKMEGGMGAVLKFSGNPTEEMVQKMAKQLRFSLKKD
        QFRFYET ELESPKYQILKRTANYEVRKYAPF+VVE+NG  +SAGF+ V SFSD K ++DTM++R+MEGG+GAVLKFSG+PTE+M Q+ AK+LR SLKKD
Subjt:  QFRFYETSELESPKYQILKRTANYEVRKYAPFMVVEKNGGNVSAGFSSVASFSDHKHKQDTMTLRKMEGGMGAVLKFSGNPTEEMVQKMAKQLRFSLKKD

Query:  GLKPINGCLLARYNDSTRTWSFVMRNEVLIWLEEFSI
        GLKPINGCLLARYN+S RTWSFVMRNEVLIWLEEFSI
Subjt:  GLKPINGCLLARYNDSTRTWSFVMRNEVLIWLEEFSI

TrEMBL top hitse value%identityAlignment
A0A6J1CUY2 uncharacterized protein LOC111014503 isoform X16.5e-11659.17Show/hide
Query:  MATAQISLQNLLSIRTAGVCFRPRKPGERTGVG------QSRTAGQK-NPKRPGIRVGSTIGDDDHEKSTVDVDRLVEFLYEDLHHAFDEQGIDRTAYEE
        MA  Q+SLQN LS  TAG  FRP K G  T  G      +SRT   K + +     V  ++ D    KS VDVDRLV+FLYEDL H FDEQGIDRTAY+E
Subjt:  MATAQISLQNLLSIRTAGVCFRPRKPGERTGVG------QSRTAGQK-NPKRPGIRVGSTIGDDDHEKSTVDVDRLVEFLYEDLHHAFDEQGIDRTAYEE

Query:  DVRFRDPITKYDDIYGFMLNIAMLQKFFRPEFILHWVKKTGPYEITTRWTAVMNFMPLPWKPELVLTGTSIMSINPQTGKFCTQVDRWDSVQNNDYFSLE
         VRFRDPITK+D I G+  NI++L++ FRPEF LHWVK+TGPYEITTRWT VM F+ LPWKPE + TG SIM INP+TGKFC+ VD WDS+QNNDYFSLE
Subjt:  DVRFRDPITKYDDIYGFMLNIAMLQKFFRPEFILHWVKKTGPYEITTRWTAVMNFMPLPWKPELVLTGTSIMSINPQTGKFCTQVDRWDSVQNNDYFSLE

Query:  GLWDVFKQFRFYETSELESPKYQILKRTANYEVRKYAPFMVVEKNGGNV--SAGFSSVASFSDHKH----------------------------------
        GL DVFKQ RFY+T ELESPKY+ILKRTANYEVRKY PF+VVE +G  +  SAGF++VA +   K+                                  
Subjt:  GLWDVFKQFRFYETSELESPKYQILKRTANYEVRKYAPFMVVEKNGGNV--SAGFSSVASFSDHKH----------------------------------

Query:  --------KQDTMTLRKMEGGMGAVLKFSGNPTEEMVQKMAKQLRFSLKKDGLKPINGCLLARYNDSTRTWSFVMRNEVLIWLEEFS
                +QDT+ LRK+EGG+ AVLKFSG PTE+MVQ+ AK+LR  L KDGLKP  GCLLARYND  RTWSF+MRNEVLIWLEEFS
Subjt:  --------KQDTMTLRKMEGGMGAVLKFSGNPTEEMVQKMAKQLRFSLKKDGLKPINGCLLARYNDSTRTWSFVMRNEVLIWLEEFS

A0A6J1CV62 uncharacterized protein LOC111014503 isoform X22.0e-12569.23Show/hide
Query:  MATAQISLQNLLSIRTAGVCFRPRKPGERTGVGQSRTAGQKNPKRPGIRVGSTIGDDDHEKSTVDVDRLVEFLYEDLHHAFDEQGIDRTAYEEDVRFRDP
        M   Q+SLQN LSI T G  FRP+K G +TG  + R    +   R  + V S + D    KSTVDVDRLV+FLYEDL H FD QGID TAY+E VRFRDP
Subjt:  MATAQISLQNLLSIRTAGVCFRPRKPGERTGVGQSRTAGQKNPKRPGIRVGSTIGDDDHEKSTVDVDRLVEFLYEDLHHAFDEQGIDRTAYEEDVRFRDP

Query:  ITKYDDIYGFMLNIAMLQKFFRPEFILHWVKKTGPYEITTRWTAVMNFMPLPWKPELVLTGTSIMSINPQTGKFCTQVDRWDSVQNNDYFSLEGLWDVFK
        ITKY+ I G+MLNIA+L++ FRP+F+LHWVKKTGPYEITTRWTAVM F+ LPWKPELVLTGTSIM I+P+TGKFC  VD WDSVQNN+YFSLEGLWD+FK
Subjt:  ITKYDDIYGFMLNIAMLQKFFRPEFILHWVKKTGPYEITTRWTAVMNFMPLPWKPELVLTGTSIMSINPQTGKFCTQVDRWDSVQNNDYFSLEGLWDVFK

Query:  QFRFYETSELESPKYQILKRTANYEVRKYAPFMVVEKNGGNV--SAGFSSVASFSDHKHKQDTMTLRKMEGGMGAVLKFSGNPTEEMVQKMAKQLRFSLK
        QFRFYET ELESP+YQILKRTANYEVRKYAPF+ VE     +  SA F+ VA F D   KQD ++LR ++GG+ AVLKFSG P+E MVQ+ AK+LR+SL 
Subjt:  QFRFYETSELESPKYQILKRTANYEVRKYAPFMVVEKNGGNV--SAGFSSVASFSDHKHKQDTMTLRKMEGGMGAVLKFSGNPTEEMVQKMAKQLRFSLK

Query:  KDGLKPINGCLLARYNDSTRTWSFVMRNEVLIWLEEFS
        KDGLKPI GCLLARYND +RTWSFVMRNEVLIWLEEFS
Subjt:  KDGLKPINGCLLARYNDSTRTWSFVMRNEVLIWLEEFS

A0A6J1ER73 uncharacterized protein LOC111437064 isoform X19.4e-11558.03Show/hide
Query:  MATAQISLQNLLSIRTAGVCFRPRKPGERTGVGQSRTAGQKNPKRPGIR-----VGSTIGDDDHEKSTVDVDRLVEFLYEDLHHAFDEQGIDRTAYEEDV
        MA  Q SLQN L++ T  + F  R P     + +SRT     P +P  R     V  ++ D +  KSTVDVD+LV+FLYEDL H FDEQGIDRTAY++ V
Subjt:  MATAQISLQNLLSIRTAGVCFRPRKPGERTGVGQSRTAGQKNPKRPGIR-----VGSTIGDDDHEKSTVDVDRLVEFLYEDLHHAFDEQGIDRTAYEEDV

Query:  RFRDPITKYDDIYGFMLNIAMLQKFFRPEFILHWVKKTGPYEITTRWTAVMNFMPLPWKPELVLTGTSIMSINPQTGKFCTQVDRWDSVQNNDYFSLEGL
        RFRDPITK+D I G++ NI++L++ FRPEF+LHWVKKTG YEITTRWT VM F+ LPWKP+LV TG SIM INP+TGKFC+ VD WDS+QNNDYFS+EGL
Subjt:  RFRDPITKYDDIYGFMLNIAMLQKFFRPEFILHWVKKTGPYEITTRWTAVMNFMPLPWKPELVLTGTSIMSINPQTGKFCTQVDRWDSVQNNDYFSLEGL

Query:  WDVFKQFRFYETSELESPKYQILKRTANYEVRKYAPFMVVEKNGGNV--SAGFSSVASFSDHKH------------------------------------
         DVFKQ RFY+T ELESPKY+ILKRT NYEVRKYAPF+VVE +G  +  SAGF++VA +   K+                                    
Subjt:  WDVFKQFRFYETSELESPKYQILKRTANYEVRKYAPFMVVEKNGGNV--SAGFSSVASFSDHKH------------------------------------

Query:  ------KQDTMTLRKMEGGMGAVLKFSGNPTEEMVQKMAKQLRFSLKKDGLKPINGCLLARYNDSTRTWSFVMRNEVLIWLEEFSI
              +QDT+ LRK+EGG  AVLKFSG PTEE+VQ+ AK+LR SL KDGLKP NGCLLARYND  RTW+F+MRNEVLIWLEEFS+
Subjt:  ------KQDTMTLRKMEGGMGAVLKFSGNPTEEMVQKMAKQLRFSLKKDGLKPINGCLLARYNDSTRTWSFVMRNEVLIWLEEFSI

A0A6J1EZQ2 uncharacterized protein LOC1114408396.0e-13874.18Show/hide
Query:  MATAQISLQNLLSIRTAGVCFRPRKPGERTGVGQSRTAGQKNPKRPGIRVGSTIGDDDHEKSTVDVDRLVEFLYEDLHHAFDEQGIDRTAYEEDVRFRDP
        MATAQ+S QN LSI T     RPRK    T   QSRT G  N K     + ST+ D   +K TVDVDRLV+F+Y+DL H FDEQGIDRTAY+++VRFRDP
Subjt:  MATAQISLQNLLSIRTAGVCFRPRKPGERTGVGQSRTAGQKNPKRPGIRVGSTIGDDDHEKSTVDVDRLVEFLYEDLHHAFDEQGIDRTAYEEDVRFRDP

Query:  ITKYDDIYGFMLNIAMLQKFFRPEFILHWVKKTGPYEITTRWTAVMNFMPLPWKPELVLTGTSIMSINPQTGKFCTQVDRWDSVQNNDYFSLEGLWDVFK
        ITKYD I G+MLNIA+L++FFRPE ILHWVKKTGPYEITTRWTA+M F+ LPWKPELVLTGTSIM INPQTGKFC+ VD WDS+QNNDYFS+E LWDVFK
Subjt:  ITKYDDIYGFMLNIAMLQKFFRPEFILHWVKKTGPYEITTRWTAVMNFMPLPWKPELVLTGTSIMSINPQTGKFCTQVDRWDSVQNNDYFSLEGLWDVFK

Query:  QFRFYETSELESPKYQILKRTANYEVRKYAPFMVVEKNGGNVSAGFSSVASFSDHKHKQDTMTLRKMEGGMGAVLKFSGNPTEEMVQKMAKQLRFSLKKD
        QFRFYET ELESPKYQILKRTANYEVRKYAPF+VVE+NG  +SAGF+ V S SD K ++DTM++R+MEGG+GAVLKFSG+PTE+M Q+ AK+LR SLKKD
Subjt:  QFRFYETSELESPKYQILKRTANYEVRKYAPFMVVEKNGGNVSAGFSSVASFSDHKHKQDTMTLRKMEGGMGAVLKFSGNPTEEMVQKMAKQLRFSLKKD

Query:  GLKPINGCLLARYNDSTRTWSFVMRNEVLIWLEEFSI
        GLKPINGCLLARYN S RTWSFVMRNEVLIWLEEFSI
Subjt:  GLKPINGCLLARYNDSTRTWSFVMRNEVLIWLEEFSI

A0A6J1HKM5 uncharacterized protein LOC1114650224.2e-13974.48Show/hide
Query:  MATAQISLQNLLSIRTAGVCFRPRKPGERTGVGQSRTAGQKNPKRPGIRVGSTIGDDDHEKSTVDVDRLVEFLYEDLHHAFDEQGIDRTAYEEDVRFRDP
        MATAQ+S QN LSI T     RPRK    T   QSRTA   N K     + ST+ D  H+K TVDVDRLV+F+Y+DL H FDEQGIDRTAY+E+VRFRDP
Subjt:  MATAQISLQNLLSIRTAGVCFRPRKPGERTGVGQSRTAGQKNPKRPGIRVGSTIGDDDHEKSTVDVDRLVEFLYEDLHHAFDEQGIDRTAYEEDVRFRDP

Query:  ITKYDDIYGFMLNIAMLQKFFRPEFILHWVKKTGPYEITTRWTAVMNFMPLPWKPELVLTGTSIMSINPQTGKFCTQVDRWDSVQNNDYFSLEGLWDVFK
        ITKYD I G+MLNIA+L++FFRPE ILHWVKKTGPYEITTRWTAVM F+ LPWKPELVLTGTSIM INPQTGKFC+ VD WDS+QNNDYFS+E LWDVFK
Subjt:  ITKYDDIYGFMLNIAMLQKFFRPEFILHWVKKTGPYEITTRWTAVMNFMPLPWKPELVLTGTSIMSINPQTGKFCTQVDRWDSVQNNDYFSLEGLWDVFK

Query:  QFRFYETSELESPKYQILKRTANYEVRKYAPFMVVEKNGGNVSAGFSSVASFSDHKHKQDTMTLRKMEGGMGAVLKFSGNPTEEMVQKMAKQLRFSLKKD
        QFRFYET ELESPKYQILKRTANYEVRKYAPF+VVE+NG  +SAGF+ V SF D K ++DTM++R+MEGG+GAVLKFSG+PTE+M Q+ AK+LR SLKKD
Subjt:  QFRFYETSELESPKYQILKRTANYEVRKYAPFMVVEKNGGNVSAGFSSVASFSDHKHKQDTMTLRKMEGGMGAVLKFSGNPTEEMVQKMAKQLRFSLKKD

Query:  GLKPINGCLLARYNDSTRTWSFVMRNEVLIWLEEFSI
        GLKPINGCLLARYNDS RTW FVMRNEV+IWL+EFSI
Subjt:  GLKPINGCLLARYNDSTRTWSFVMRNEVLIWLEEFSI

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G20140.1 SOUL heme-binding family protein3.2e-9954.83Show/hide
Query:  STVDVDRLVEFLYEDLHHAFDEQGIDRTAYEEDVRFRDPITKYDDIYGFMLNIAMLQKFFRPEFILHWVKKTGPYEITTRWTAVMNFMPLPWKPELVLTG
        STV+++ LV FLYEDL H FD+QGID+TAY+E V+FRDPITK+D I G++ NIA L+  F P+F LHW K+TGPYEITTRWT VM F+PLPWKPELV TG
Subjt:  STVDVDRLVEFLYEDLHHAFDEQGIDRTAYEEDVRFRDPITKYDDIYGFMLNIAMLQKFFRPEFILHWVKKTGPYEITTRWTAVMNFMPLPWKPELVLTG

Query:  TSIMSINPQTGKFCTQVDRWDSVQNNDYFSLEGLWDVFKQFRFYETSELESPKYQILKRTANYEVRKYAPFMVVEKNGGNV--SAGFSSVASFSDHKH--
         SIM +NP+T KFC+ +D WDS++NNDYFSLEGL DVFKQ R Y+T +LE+PKYQILKRTANYEVR Y PF+VVE  G  +  S+GF++VA +   K+  
Subjt:  TSIMSINPQTGKFCTQVDRWDSVQNNDYFSLEGLWDVFKQFRFYETSELESPKYQILKRTANYEVRKYAPFMVVEKNGGNV--SAGFSSVASFSDHKH--

Query:  -----------------------------------------KQDTMTLRKMEGGMGAVLKFSGNPTEEMVQKMAKQLRFSLKKDGLKPINGCLLARYNDS
                                                  ++ + L+K+EGG  A +KFSG PTE++VQ    +LR SL KDGL+   GC+LARYND 
Subjt:  -----------------------------------------KQDTMTLRKMEGGMGAVLKFSGNPTEEMVQKMAKQLRFSLKKDGLKPINGCLLARYNDS

Query:  TRTWSFVMRNEVLIWLEEFSI
         RTW+F+MRNEV+IWLE+FS+
Subjt:  TRTWSFVMRNEVLIWLEEFSI

AT5G20140.2 SOUL heme-binding family protein2.9e-9253.9Show/hide
Query:  STVDVDRLVEFLYEDLHHAFDEQGIDRTAYEEDVRFRDPITKYDDIYGFMLNIAMLQKFFRPEFILHWVKKTGPYEITTRWTAVMNFMPLPWKPELVLTG
        STV+++ LV FLYEDL H FD+QGID+TAY+E V+FRDPITK+D I G++ NIA L+  F P+F LHW K+TGPYEITTRWT VM F+PLPWKPELV TG
Subjt:  STVDVDRLVEFLYEDLHHAFDEQGIDRTAYEEDVRFRDPITKYDDIYGFMLNIAMLQKFFRPEFILHWVKKTGPYEITTRWTAVMNFMPLPWKPELVLTG

Query:  TSIMSINPQTGKFCTQVDRWDSVQNNDYFSLEGLWDVFKQFRFYETSELESPKYQILKRTANYEVRKYAPFMVVEKNGGNV--SAGFSSVASFSDHKH--
         SIM +NP+T KFC+ +D WDS++NNDYFSLEGL DVFKQ R Y+T +LE+PKYQILKRTANYEVR Y PF+VVE  G  +  S+GF++VA +   K+  
Subjt:  TSIMSINPQTGKFCTQVDRWDSVQNNDYFSLEGLWDVFKQFRFYETSELESPKYQILKRTANYEVRKYAPFMVVEKNGGNV--SAGFSSVASFSDHKH--

Query:  -----------------------------------------KQDTMTLRKMEGGMGAVLKFSGNPTEEMVQKMAKQLRFSLKKDGLKPINGCLLARYNDS
                                                  ++ + L+K+EGG  A +KFSG PTE++VQ    +LR SL KDGL+   GC+LARYND 
Subjt:  -----------------------------------------KQDTMTLRKMEGGMGAVLKFSGNPTEEMVQKMAKQLRFSLKKDGLKPINGCLLARYNDS

Query:  TRTWSFVM
         RTW+F+M
Subjt:  TRTWSFVM


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAACTGCCCAAATTTCCCTCCAAAACCTCCTCTCCATCCGAACCGCCGGTGTTTGTTTCCGGCCGAGGAAACCCGGCGAGCGGACCGGGGTCGGACAAAGCAGAAC
CGCGGGTCAGAAAAACCCAAAGAGGCCGGGTATTCGGGTTGGTTCAACAATAGGAGACGATGATCATGAGAAATCGACGGTGGACGTTGACCGATTGGTGGAGTTCTTGT
ACGAAGATCTCCACCATGCGTTTGATGAGCAGGGGATTGATCGGACGGCGTACGAGGAAGACGTGAGATTTAGAGACCCGATTACAAAATACGATGATATTTATGGGTTT
ATGCTGAATATTGCAATGTTGCAAAAATTCTTTAGGCCTGAGTTCATCTTGCATTGGGTCAAAAAGACTGGACCATATGAAATAACAACAAGATGGACGGCAGTGATGAA
TTTCATGCCTCTACCATGGAAACCTGAATTAGTTTTGACCGGAACTTCAATTATGAGTATCAATCCACAGACCGGCAAGTTTTGTACCCAAGTGGATCGTTGGGATTCAG
TACAAAATAATGACTATTTTTCCCTAGAAGGTTTGTGGGATGTATTTAAACAGTTTCGATTTTATGAGACGTCAGAATTGGAATCGCCCAAATATCAGATATTGAAAAGA
ACGGCAAATTATGAGGTTAGAAAATATGCACCATTTATGGTGGTTGAAAAGAATGGAGGCAATGTCTCTGCTGGATTTAGTAGTGTTGCTAGTTTCTCAGATCATAAACA
CAAACAGGACACCATGACTTTGAGAAAGATGGAAGGAGGAATGGGTGCAGTGTTGAAATTTAGTGGAAATCCCACAGAAGAAATGGTTCAAAAAATGGCAAAACAATTAC
GGTTTAGTCTAAAAAAAGATGGCCTTAAACCCATTAATGGCTGTTTGCTCGCTCGCTACAACGATTCCACCCGGACCTGGAGCTTTGTTATGAGAAACGAGGTCCTAATA
TGGCTTGAAGAATTCTCAATTTAG
mRNA sequenceShow/hide mRNA sequence
ATTTGATTCGGATTTCAAAATTAAGTTTGAAGATATTTAACTTAATATAGAGATTGACATTGGCAAGCATTTCCTCCCAAATGGCAACTGCCCAAATTTCCCTCCAAAAC
CTCCTCTCCATCCGAACCGCCGGTGTTTGTTTCCGGCCGAGGAAACCCGGCGAGCGGACCGGGGTCGGACAAAGCAGAACCGCGGGTCAGAAAAACCCAAAGAGGCCGGG
TATTCGGGTTGGTTCAACAATAGGAGACGATGATCATGAGAAATCGACGGTGGACGTTGACCGATTGGTGGAGTTCTTGTACGAAGATCTCCACCATGCGTTTGATGAGC
AGGGGATTGATCGGACGGCGTACGAGGAAGACGTGAGATTTAGAGACCCGATTACAAAATACGATGATATTTATGGGTTTATGCTGAATATTGCAATGTTGCAAAAATTC
TTTAGGCCTGAGTTCATCTTGCATTGGGTCAAAAAGACTGGACCATATGAAATAACAACAAGATGGACGGCAGTGATGAATTTCATGCCTCTACCATGGAAACCTGAATT
AGTTTTGACCGGAACTTCAATTATGAGTATCAATCCACAGACCGGCAAGTTTTGTACCCAAGTGGATCGTTGGGATTCAGTACAAAATAATGACTATTTTTCCCTAGAAG
GTTTGTGGGATGTATTTAAACAGTTTCGATTTTATGAGACGTCAGAATTGGAATCGCCCAAATATCAGATATTGAAAAGAACGGCAAATTATGAGGTTAGAAAATATGCA
CCATTTATGGTGGTTGAAAAGAATGGAGGCAATGTCTCTGCTGGATTTAGTAGTGTTGCTAGTTTCTCAGATCATAAACACAAACAGGACACCATGACTTTGAGAAAGAT
GGAAGGAGGAATGGGTGCAGTGTTGAAATTTAGTGGAAATCCCACAGAAGAAATGGTTCAAAAAATGGCAAAACAATTACGGTTTAGTCTAAAAAAAGATGGCCTTAAAC
CCATTAATGGCTGTTTGCTCGCTCGCTACAACGATTCCACCCGGACCTGGAGCTTTGTTATGAGAAACGAGGTCCTAATATGGCTTGAAGAATTCTCAATTTAGCCGAAC
TGAGTCAAATATGAAATTATTCCAAGAAGCTGACATATTATTTCTAGCAAAGGTATATTATACCGATAATGATCATCAAAAGATTATAGGTTCAAATTTGACGTAAAATG
TTTGGTTCCAACGATTATGTAGTCAAATTCTTCACCCTTTTCCATTGATTTGTCGCATATTTGTTAAGATTTTGTGTATTTTGATTGGAGTGTTTTTGTTTTTATAAAAA
CAAGTTGTATAGATTTGTTATTTTTGGG
Protein sequenceShow/hide protein sequence
MATAQISLQNLLSIRTAGVCFRPRKPGERTGVGQSRTAGQKNPKRPGIRVGSTIGDDDHEKSTVDVDRLVEFLYEDLHHAFDEQGIDRTAYEEDVRFRDPITKYDDIYGF
MLNIAMLQKFFRPEFILHWVKKTGPYEITTRWTAVMNFMPLPWKPELVLTGTSIMSINPQTGKFCTQVDRWDSVQNNDYFSLEGLWDVFKQFRFYETSELESPKYQILKR
TANYEVRKYAPFMVVEKNGGNVSAGFSSVASFSDHKHKQDTMTLRKMEGGMGAVLKFSGNPTEEMVQKMAKQLRFSLKKDGLKPINGCLLARYNDSTRTWSFVMRNEVLI
WLEEFSI