; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0034377 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0034377
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionSOUL heme-binding protein
Genome locationchr3:6803759..6807357
RNA-Seq ExpressionLag0034377
SyntenyLag0034377
Gene Ontology termsGO:0110165 - cellular anatomical structure (cellular component)
InterPro domainsIPR006917 - SOUL haem-binding protein
IPR011256 - Regulatory factor, effector binding domain superfamily
IPR018790 - Protein of unknown function DUF2358


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6587902.1 hypothetical protein SDJN03_16467, partial [Cucurbita argyrosperma subsp. sororia]3.3e-14377.58Show/hide
Query:  AIAQVSLQNFLSIPTVGFGFRPSKSGRPTGLAQSR--KSNW--VIRSKLADQNSKKLTVDVDRLVDFLYDDLRHVFDEQGIDRTAYDDQVRFQDPITKYD
        A AQVS QNFLSIPTV FG RP KS  PT  AQSR    NW   IRS L DQ+ +K TVDVDRLVDF+YDDLRHVFDEQGIDRTAYD++VRF+DPITKYD
Subjt:  AIAQVSLQNFLSIPTVGFGFRPSKSGRPTGLAQSR--KSNW--VIRSKLADQNSKKLTVDVDRLVDFLYDDLRHVFDEQGIDRTAYDDQVRFQDPITKYD

Query:  NITGYLLNITLLREFFKPEFTLHWVKKTGPYEITTRWTAVMKFILLPWKPELVLTGISIMGIDPDTGKFRTHVDLWDSVQNNDYFSLEGLWDVFKQLRFY
         I+GY+LNI LLREFF+PE  LHWVKKTGPYEITTRWTAVMKFILLPWKPELVLTG SIMGI+P TGKF +HVDLWDS+QNNDYFSLE LWDVFKQLRFY
Subjt:  NITGYLLNITLLREFFKPEFTLHWVKKTGPYEITTRWTAVMKFILLPWKPELVLTGISIMGIDPDTGKFRTHVDLWDSVQNNDYFSLEGLWDVFKQLRFY

Query:  ETPELESPKYQILKRTANYEVRKYAPFIVFETNEDRLSAGFNRVASFFPDPKQESISLRKMEGGIAAVLKFSGNPTEDLAEQKAKQLLYTLKRDGLKPIH
        ETPELESPKYQILKRTANYEVRKYAPF+V E N  ++SAGFNRV S     +++++S+R+MEGGI AVLKFSG+PTED+A+QKAK+L  +LK+DGL PI+
Subjt:  ETPELESPKYQILKRTANYEVRKYAPFIVFETNEDRLSAGFNRVASFFPDPKQESISLRKMEGGIAAVLKFSGNPTEDLAEQKAKQLLYTLKRDGLKPIH

Query:  GCLLARYNNSTRTFSFVMRNEVLIWLEEFS
        GCLLARYN+S RT+SFVMRNEVLIWLEEFS
Subjt:  GCLLARYNNSTRTFSFVMRNEVLIWLEEFS

KAG7021789.1 hypothetical protein SDJN02_15516 [Cucurbita argyrosperma subsp. argyrosperma]5.7e-14377.27Show/hide
Query:  AIAQVSLQNFLSIPTVGFGFRPSKSGRPTGLAQSR--KSNW--VIRSKLADQNSKKLTVDVDRLVDFLYDDLRHVFDEQGIDRTAYDDQVRFQDPITKYD
        A AQVS QNFLSIPTV FG RP KS  PT  AQSR    NW   IRS L DQ+ +K TVDVDRLVDF+YDDLRHVFDEQGIDRTAYD++VRF+DPITKYD
Subjt:  AIAQVSLQNFLSIPTVGFGFRPSKSGRPTGLAQSR--KSNW--VIRSKLADQNSKKLTVDVDRLVDFLYDDLRHVFDEQGIDRTAYDDQVRFQDPITKYD

Query:  NITGYLLNITLLREFFKPEFTLHWVKKTGPYEITTRWTAVMKFILLPWKPELVLTGISIMGIDPDTGKFRTHVDLWDSVQNNDYFSLEGLWDVFKQLRFY
         I+GY+LNI LLREFF+PE  LHWVKKTGPYEITTRWTAVMKFILLPWKPELVLTG SIMGI+P TGKF +HVDLWDS+QNNDYFSLE LWDVFKQLRFY
Subjt:  NITGYLLNITLLREFFKPEFTLHWVKKTGPYEITTRWTAVMKFILLPWKPELVLTGISIMGIDPDTGKFRTHVDLWDSVQNNDYFSLEGLWDVFKQLRFY

Query:  ETPELESPKYQILKRTANYEVRKYAPFIVFETNEDRLSAGFNRVASFFPDPKQESISLRKMEGGIAAVLKFSGNPTEDLAEQKAKQLLYTLKRDGLKPIH
        ETPELESPKYQILKRTANYEVRKYAPF+V E N  ++SAGFNRV S     +++++S+R+MEGGI AVLKFSG+PTED+A+QKAK+L  +LK+DGL PI+
Subjt:  ETPELESPKYQILKRTANYEVRKYAPFIVFETNEDRLSAGFNRVASFFPDPKQESISLRKMEGGIAAVLKFSGNPTEDLAEQKAKQLLYTLKRDGLKPIH

Query:  GCLLARYNNSTRTFSFVMRNEVLIWLEEFS
        GCLLARYN+S RT+SFVMRNEVLIWL+EFS
Subjt:  GCLLARYNNSTRTFSFVMRNEVLIWLEEFS

XP_022933414.1 uncharacterized protein LOC111440839 [Cucurbita moschata]5.7e-14377.27Show/hide
Query:  AIAQVSLQNFLSIPTVGFGFRPSKSGRPTGLAQSR--KSNW--VIRSKLADQNSKKLTVDVDRLVDFLYDDLRHVFDEQGIDRTAYDDQVRFQDPITKYD
        A AQVS QNFLSIPTV  G RP KS  PT  AQSR    NW   IRS LADQ+ +K TVDVDRLVDF+YDDLRHVFDEQGIDRTAYDD+VRF+DPITKYD
Subjt:  AIAQVSLQNFLSIPTVGFGFRPSKSGRPTGLAQSR--KSNW--VIRSKLADQNSKKLTVDVDRLVDFLYDDLRHVFDEQGIDRTAYDDQVRFQDPITKYD

Query:  NITGYLLNITLLREFFKPEFTLHWVKKTGPYEITTRWTAVMKFILLPWKPELVLTGISIMGIDPDTGKFRTHVDLWDSVQNNDYFSLEGLWDVFKQLRFY
         I+GY+LNI LLREFF+PE  LHWVKKTGPYEITTRWTA+MKFILLPWKPELVLTG SIMGI+P TGKF +HVDLWDS+QNNDYFS+E LWDVFKQ RFY
Subjt:  NITGYLLNITLLREFFKPEFTLHWVKKTGPYEITTRWTAVMKFILLPWKPELVLTGISIMGIDPDTGKFRTHVDLWDSVQNNDYFSLEGLWDVFKQLRFY

Query:  ETPELESPKYQILKRTANYEVRKYAPFIVFETNEDRLSAGFNRVASFFPDPKQESISLRKMEGGIAAVLKFSGNPTEDLAEQKAKQLLYTLKRDGLKPIH
        ETPELESPKYQILKRTANYEVRKYAPF+V E N  ++SAGFNRV S     +++++S+R+MEGGI AVLKFSG+PTED+A+QKAK+L  +LK+DGLKPI+
Subjt:  ETPELESPKYQILKRTANYEVRKYAPFIVFETNEDRLSAGFNRVASFFPDPKQESISLRKMEGGIAAVLKFSGNPTEDLAEQKAKQLLYTLKRDGLKPIH

Query:  GCLLARYNNSTRTFSFVMRNEVLIWLEEFS
        GCLLARYN+S RT+SFVMRNEVLIWLEEFS
Subjt:  GCLLARYNNSTRTFSFVMRNEVLIWLEEFS

XP_022965046.1 uncharacterized protein LOC111465022 [Cucurbita maxima]1.0e-14477.95Show/hide
Query:  AIAQVSLQNFLSIPTVGFGFRPSKSGRPTGLAQSRKSN----WVIRSKLADQNSKKLTVDVDRLVDFLYDDLRHVFDEQGIDRTAYDDQVRFQDPITKYD
        A AQVS QNFLSIPTV FG RP KS  PT  AQSR ++    W IRS LADQ  +K TVDVDRLVDF+YDDLRHVFDEQGIDRTAYD++VRF+DPITKYD
Subjt:  AIAQVSLQNFLSIPTVGFGFRPSKSGRPTGLAQSRKSN----WVIRSKLADQNSKKLTVDVDRLVDFLYDDLRHVFDEQGIDRTAYDDQVRFQDPITKYD

Query:  NITGYLLNITLLREFFKPEFTLHWVKKTGPYEITTRWTAVMKFILLPWKPELVLTGISIMGIDPDTGKFRTHVDLWDSVQNNDYFSLEGLWDVFKQLRFY
         I+GY+LNI LLREFF+PE  LHWVKKTGPYEITTRWTAVMKFILLPWKPELVLTG SIMGI+P TGKF +HVDLWDS+QNNDYFS+E LWDVFKQ RFY
Subjt:  NITGYLLNITLLREFFKPEFTLHWVKKTGPYEITTRWTAVMKFILLPWKPELVLTGISIMGIDPDTGKFRTHVDLWDSVQNNDYFSLEGLWDVFKQLRFY

Query:  ETPELESPKYQILKRTANYEVRKYAPFIVFETNEDRLSAGFNRVASFFPDPKQE-SISLRKMEGGIAAVLKFSGNPTEDLAEQKAKQLLYTLKRDGLKPI
        ETPELESPKYQILKRTANYEVRKYAPF+V E N  ++SAGFNRV S FPD KQE ++S+R+MEGGI AVLKFSG+PTED+A+QKAK+L  +LK+DGLKPI
Subjt:  ETPELESPKYQILKRTANYEVRKYAPFIVFETNEDRLSAGFNRVASFFPDPKQE-SISLRKMEGGIAAVLKFSGNPTEDLAEQKAKQLLYTLKRDGLKPI

Query:  HGCLLARYNNSTRTFSFVMRNEVLIWLEEFS
        +GCLLARYN+S RT+ FVMRNEV+IWL+EFS
Subjt:  HGCLLARYNNSTRTFSFVMRNEVLIWLEEFS

XP_023531546.1 uncharacterized protein LOC111793749 [Cucurbita pepo subsp. pepo]7.9e-14577.88Show/hide
Query:  AIAQVSLQNFLSIPTVGFGFRPSKSGRPTGLAQSR--KSNW--VIRSKLADQNSKKLTVDVDRLVDFLYDDLRHVFDEQGIDRTAYDDQVRFQDPITKYD
        A AQVS QNFLSIPTV FG RP KS  PT  AQSR    NW   IRS LADQ+ +K TVDVDRLVDF+YDDLRHVFDEQGIDRTAYD++VRF+DPITKYD
Subjt:  AIAQVSLQNFLSIPTVGFGFRPSKSGRPTGLAQSR--KSNW--VIRSKLADQNSKKLTVDVDRLVDFLYDDLRHVFDEQGIDRTAYDDQVRFQDPITKYD

Query:  NITGYLLNITLLREFFKPEFTLHWVKKTGPYEITTRWTAVMKFILLPWKPELVLTGISIMGIDPDTGKFRTHVDLWDSVQNNDYFSLEGLWDVFKQLRFY
         I+GY+LNI LLREFF+PE   HWVKKTGPYEITTRWTAVMKFILLPWKPELVLTG SIMGI+P TGKF +HVD+WDS+QNNDYFSLE LWDVFKQ RFY
Subjt:  NITGYLLNITLLREFFKPEFTLHWVKKTGPYEITTRWTAVMKFILLPWKPELVLTGISIMGIDPDTGKFRTHVDLWDSVQNNDYFSLEGLWDVFKQLRFY

Query:  ETPELESPKYQILKRTANYEVRKYAPFIVFETNEDRLSAGFNRVASFFPDPKQESISLRKMEGGIAAVLKFSGNPTEDLAEQKAKQLLYTLKRDGLKPIH
        ETPELESPKYQILKRTANYEVRKYAPF+V E N  ++SAGFNRV SF    +++++S+R+MEGGI AVLKFSG+PTED+A+QKAK+L  +LK+DGLKPI+
Subjt:  ETPELESPKYQILKRTANYEVRKYAPFIVFETNEDRLSAGFNRVASFFPDPKQESISLRKMEGGIAAVLKFSGNPTEDLAEQKAKQLLYTLKRDGLKPIH

Query:  GCLLARYNNSTRTFSFVMRNEVLIWLEEFS
        GCLLARYNNS RT+SFVMRNEVLIWLEEFS
Subjt:  GCLLARYNNSTRTFSFVMRNEVLIWLEEFS

TrEMBL top hitse value%identityAlignment
A0A6J1CUY2 uncharacterized protein LOC111014503 isoform X12.9e-12962.53Show/hide
Query:  AIAQVSLQNFLSIPTVGFGFRPSKSGRPT--GL-------------AQSRKSNWVIRSKLADQNSKKLTVDVDRLVDFLYDDLRHVFDEQGIDRTAYDDQ
        A  Q+SLQNFLS PT GFGFRP KSG  T  GL               +R S W +R  L DQ+  K  VDVDRLVDFLY+DLRH+FDEQGIDRTAYD+ 
Subjt:  AIAQVSLQNFLSIPTVGFGFRPSKSGRPT--GL-------------AQSRKSNWVIRSKLADQNSKKLTVDVDRLVDFLYDDLRHVFDEQGIDRTAYDDQ

Query:  VRFQDPITKYDNITGYLLNITLLREFFKPEFTLHWVKKTGPYEITTRWTAVMKFILLPWKPELVLTGISIMGIDPDTGKFRTHVDLWDSVQNNDYFSLEG
        VRF+DPITK+D I+GY  NI+LLRE F+PEF LHWVK+TGPYEITTRWT VMKF+LLPWKPE + TG SIMGI+P+TGKF +HVDLWDS+QNNDYFSLEG
Subjt:  VRFQDPITKYDNITGYLLNITLLREFFKPEFTLHWVKKTGPYEITTRWTAVMKFILLPWKPELVLTGISIMGIDPDTGKFRTHVDLWDSVQNNDYFSLEG

Query:  LWDVFKQLRFYETPELESPKYQILKRTANYEVRKYAPFIVFETNEDRL--SAGFNRVASF----------------------------------------
        L DVFKQLRFY+TPELESPKY+ILKRTANYEVRKY PF+V ET+ D+L  SAGFN VA +                                        
Subjt:  LWDVFKQLRFYETPELESPKYQILKRTANYEVRKYAPFIVFETNEDRL--SAGFNRVASF----------------------------------------

Query:  ---FPDPKQESISLRKMEGGIAAVLKFSGNPTEDLAEQKAKQLLYTLKRDGLKPIHGCLLARYNNSTRTFSFVMRNEVLIWLEEFSF
            PDP+Q++I LRK+EGGIAAVLKFSG PTED+ ++KAK+L   L +DGLKP  GCLLARYN+  RT+SF+MRNEVLIWLEEFSF
Subjt:  ---FPDPKQESISLRKMEGGIAAVLKFSGNPTEDLAEQKAKQLLYTLKRDGLKPIHGCLLARYNNSTRTFSFVMRNEVLIWLEEFSF

A0A6J1CV62 uncharacterized protein LOC111014503 isoform X21.3e-14076.2Show/hide
Query:  QVSLQNFLSIPTVGFGFRPSKSGRPTG----LAQSRKS--NWVIRSKLADQNSKKLTVDVDRLVDFLYDDLRHVFDEQGIDRTAYDDQVRFQDPITKYDN
        QVSLQNFLSIPTVG GFRP KSGR TG    L +SR      V+RS+LAD++  K TVDVDRLVDFLY+DLRHVFD QGID TAYD+ VRF+DPITKY+ 
Subjt:  QVSLQNFLSIPTVGFGFRPSKSGRPTG----LAQSRKS--NWVIRSKLADQNSKKLTVDVDRLVDFLYDDLRHVFDEQGIDRTAYDDQVRFQDPITKYDN

Query:  ITGYLLNITLLREFFKPEFTLHWVKKTGPYEITTRWTAVMKFILLPWKPELVLTGISIMGIDPDTGKFRTHVDLWDSVQNNDYFSLEGLWDVFKQLRFYE
        I GY+LNI LLR+ F+P+F LHWVKKTGPYEITTRWTAVMKF+LLPWKPELVLTG SIM IDP+TGKF  HVDLWDSVQNN+YFSLEGLWD+FKQ RFYE
Subjt:  ITGYLLNITLLREFFKPEFTLHWVKKTGPYEITTRWTAVMKFILLPWKPELVLTGISIMGIDPDTGKFRTHVDLWDSVQNNDYFSLEGLWDVFKQLRFYE

Query:  TPELESPKYQILKRTANYEVRKYAPFIVFETNEDRL--SAGFNRVASFFPDPKQESISLRKMEGGIAAVLKFSGNPTEDLAEQKAKQLLYTLKRDGLKPI
        TPELESP+YQILKRTANYEVRKYAPFI  ET ED+L  SA FNRVA  FPDPKQ++ISLR ++GGIAAVLKFSG P+E++ ++KAK+L Y+L +DGLKPI
Subjt:  TPELESPKYQILKRTANYEVRKYAPFIVFETNEDRL--SAGFNRVASFFPDPKQESISLRKMEGGIAAVLKFSGNPTEDLAEQKAKQLLYTLKRDGLKPI

Query:  HGCLLARYNNSTRTFSFVMRNEVLIWLEEFSF
         GCLLARYN+ +RT+SFVMRNEVLIWLEEFSF
Subjt:  HGCLLARYNNSTRTFSFVMRNEVLIWLEEFSF

A0A6J1EZQ2 uncharacterized protein LOC1114408392.7e-14377.27Show/hide
Query:  AIAQVSLQNFLSIPTVGFGFRPSKSGRPTGLAQSR--KSNW--VIRSKLADQNSKKLTVDVDRLVDFLYDDLRHVFDEQGIDRTAYDDQVRFQDPITKYD
        A AQVS QNFLSIPTV  G RP KS  PT  AQSR    NW   IRS LADQ+ +K TVDVDRLVDF+YDDLRHVFDEQGIDRTAYDD+VRF+DPITKYD
Subjt:  AIAQVSLQNFLSIPTVGFGFRPSKSGRPTGLAQSR--KSNW--VIRSKLADQNSKKLTVDVDRLVDFLYDDLRHVFDEQGIDRTAYDDQVRFQDPITKYD

Query:  NITGYLLNITLLREFFKPEFTLHWVKKTGPYEITTRWTAVMKFILLPWKPELVLTGISIMGIDPDTGKFRTHVDLWDSVQNNDYFSLEGLWDVFKQLRFY
         I+GY+LNI LLREFF+PE  LHWVKKTGPYEITTRWTA+MKFILLPWKPELVLTG SIMGI+P TGKF +HVDLWDS+QNNDYFS+E LWDVFKQ RFY
Subjt:  NITGYLLNITLLREFFKPEFTLHWVKKTGPYEITTRWTAVMKFILLPWKPELVLTGISIMGIDPDTGKFRTHVDLWDSVQNNDYFSLEGLWDVFKQLRFY

Query:  ETPELESPKYQILKRTANYEVRKYAPFIVFETNEDRLSAGFNRVASFFPDPKQESISLRKMEGGIAAVLKFSGNPTEDLAEQKAKQLLYTLKRDGLKPIH
        ETPELESPKYQILKRTANYEVRKYAPF+V E N  ++SAGFNRV S     +++++S+R+MEGGI AVLKFSG+PTED+A+QKAK+L  +LK+DGLKPI+
Subjt:  ETPELESPKYQILKRTANYEVRKYAPFIVFETNEDRLSAGFNRVASFFPDPKQESISLRKMEGGIAAVLKFSGNPTEDLAEQKAKQLLYTLKRDGLKPIH

Query:  GCLLARYNNSTRTFSFVMRNEVLIWLEEFS
        GCLLARYN+S RT+SFVMRNEVLIWLEEFS
Subjt:  GCLLARYNNSTRTFSFVMRNEVLIWLEEFS

A0A6J1HKM5 uncharacterized protein LOC1114650225.0e-14577.95Show/hide
Query:  AIAQVSLQNFLSIPTVGFGFRPSKSGRPTGLAQSRKSN----WVIRSKLADQNSKKLTVDVDRLVDFLYDDLRHVFDEQGIDRTAYDDQVRFQDPITKYD
        A AQVS QNFLSIPTV FG RP KS  PT  AQSR ++    W IRS LADQ  +K TVDVDRLVDF+YDDLRHVFDEQGIDRTAYD++VRF+DPITKYD
Subjt:  AIAQVSLQNFLSIPTVGFGFRPSKSGRPTGLAQSRKSN----WVIRSKLADQNSKKLTVDVDRLVDFLYDDLRHVFDEQGIDRTAYDDQVRFQDPITKYD

Query:  NITGYLLNITLLREFFKPEFTLHWVKKTGPYEITTRWTAVMKFILLPWKPELVLTGISIMGIDPDTGKFRTHVDLWDSVQNNDYFSLEGLWDVFKQLRFY
         I+GY+LNI LLREFF+PE  LHWVKKTGPYEITTRWTAVMKFILLPWKPELVLTG SIMGI+P TGKF +HVDLWDS+QNNDYFS+E LWDVFKQ RFY
Subjt:  NITGYLLNITLLREFFKPEFTLHWVKKTGPYEITTRWTAVMKFILLPWKPELVLTGISIMGIDPDTGKFRTHVDLWDSVQNNDYFSLEGLWDVFKQLRFY

Query:  ETPELESPKYQILKRTANYEVRKYAPFIVFETNEDRLSAGFNRVASFFPDPKQE-SISLRKMEGGIAAVLKFSGNPTEDLAEQKAKQLLYTLKRDGLKPI
        ETPELESPKYQILKRTANYEVRKYAPF+V E N  ++SAGFNRV S FPD KQE ++S+R+MEGGI AVLKFSG+PTED+A+QKAK+L  +LK+DGLKPI
Subjt:  ETPELESPKYQILKRTANYEVRKYAPFIVFETNEDRLSAGFNRVASFFPDPKQE-SISLRKMEGGIAAVLKFSGNPTEDLAEQKAKQLLYTLKRDGLKPI

Query:  HGCLLARYNNSTRTFSFVMRNEVLIWLEEFS
        +GCLLARYN+S RT+ FVMRNEV+IWL+EFS
Subjt:  HGCLLARYNNSTRTFSFVMRNEVLIWLEEFS

A0A6J1KHA6 uncharacterized protein LOC111495248 isoform X17.2e-12863.42Show/hide
Query:  AIAQVSLQN--FLSIPTVGFGFRPSKSGR-------PTGLAQSRKSNWVIRSKLADQNSKKLTVDVDRLVDFLYDDLRHVFDEQGIDRTAYDDQVRFQDP
        A  + SLQN   LS P++GFGFRP  SGR             +R S WV+R  L DQN  K TVDVD+LVDFLY DL H+FDEQGIDRTAYDDQVRF+DP
Subjt:  AIAQVSLQN--FLSIPTVGFGFRPSKSGR-------PTGLAQSRKSNWVIRSKLADQNSKKLTVDVDRLVDFLYDDLRHVFDEQGIDRTAYDDQVRFQDP

Query:  ITKYDNITGYLLNITLLREFFKPEFTLHWVKKTGPYEITTRWTAVMKFILLPWKPELVLTGISIMGIDPDTGKFRTHVDLWDSVQNNDYFSLEGLWDVFK
        ITK+D ITGYL NI+LLRE FKPEF LHWVKKTG YEITTRWT VMKF+LLPWKP+LV TG SIMGI+P+TGKF +HVDLWDS+QNNDYFS+EGL DVFK
Subjt:  ITKYDNITGYLLNITLLREFFKPEFTLHWVKKTGPYEITTRWTAVMKFILLPWKPELVLTGISIMGIDPDTGKFRTHVDLWDSVQNNDYFSLEGLWDVFK

Query:  QLRFYETPELESPKYQILKRTANYEVRKYAPFIVFETNEDRL--SAGFNRVASF-------------------------------------------FPD
        QLRFY+TPELESPKY+ILKRTANYEVRKYAPFIV ET+ D+L  SAGFN VA +                                            PD
Subjt:  QLRFYETPELESPKYQILKRTANYEVRKYAPFIVFETNEDRL--SAGFNRVASF-------------------------------------------FPD

Query:  PKQESISLRKMEGGIAAVLKFSGNPTEDLAEQKAKQLLYTLKRDGLKPIHGCLLARYNNSTRTFSFVMRNEVLIWLEEFS
        P+Q++I LRK+EGG AAVLKFSG PTE++ ++KAKQL  +L +DGLKP +GCLLARYN+  RT++F+MRNEVLIWLEEFS
Subjt:  PKQESISLRKMEGGIAAVLKFSGNPTEDLAEQKAKQLLYTLKRDGLKPIHGCLLARYNNSTRTFSFVMRNEVLIWLEEFS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G20140.1 SOUL heme-binding family protein2.3e-10255.8Show/hide
Query:  TVDVDRLVDFLYDDLRHVFDEQGIDRTAYDDQVRFQDPITKYDNITGYLLNITLLREFFKPEFTLHWVKKTGPYEITTRWTAVMKFILLPWKPELVLTGI
        TV+++ LV FLY+DL H+FD+QGID+TAYD++V+F+DPITK+D I+GYL NI  L+  F P+F LHW K+TGPYEITTRWT VMKFI LPWKPELV TG+
Subjt:  TVDVDRLVDFLYDDLRHVFDEQGIDRTAYDDQVRFQDPITKYDNITGYLLNITLLREFFKPEFTLHWVKKTGPYEITTRWTAVMKFILLPWKPELVLTGI

Query:  SIMGIDPDTGKFRTHVDLWDSVQNNDYFSLEGLWDVFKQLRFYETPELESPKYQILKRTANYEVRKYAPFIVFETNEDRL--SAGFNRVASF--------
        SIM ++P+T KF +H+DLWDS++NNDYFSLEGL DVFKQLR Y+TP+LE+PKYQILKRTANYEVR Y PFIV ET  D+L  S+GFN VA +        
Subjt:  SIMGIDPDTGKFRTHVDLWDSVQNNDYFSLEGLWDVFKQLRFYETPELESPKYQILKRTANYEVRKYAPFIVFETNEDRL--SAGFNRVASF--------

Query:  ------------------------------------FPDPKQESISLRKMEGGIAAVLKFSGNPTEDLAEQKAKQLLYTLKRDGLKPIHGCLLARYNNST
                                             P P +E ++L+K+EGG AA +KFSG PTED+ + K  +L  +L +DGL+   GC+LARYN+  
Subjt:  ------------------------------------FPDPKQESISLRKMEGGIAAVLKFSGNPTEDLAEQKAKQLLYTLKRDGLKPIHGCLLARYNNST

Query:  RTFSFVMRNEVLIWLEEFS
        RT++F+MRNEV+IWLE+FS
Subjt:  RTFSFVMRNEVLIWLEEFS

AT5G20140.2 SOUL heme-binding family protein1.6e-9554.72Show/hide
Query:  TVDVDRLVDFLYDDLRHVFDEQGIDRTAYDDQVRFQDPITKYDNITGYLLNITLLREFFKPEFTLHWVKKTGPYEITTRWTAVMKFILLPWKPELVLTGI
        TV+++ LV FLY+DL H+FD+QGID+TAYD++V+F+DPITK+D I+GYL NI  L+  F P+F LHW K+TGPYEITTRWT VMKFI LPWKPELV TG+
Subjt:  TVDVDRLVDFLYDDLRHVFDEQGIDRTAYDDQVRFQDPITKYDNITGYLLNITLLREFFKPEFTLHWVKKTGPYEITTRWTAVMKFILLPWKPELVLTGI

Query:  SIMGIDPDTGKFRTHVDLWDSVQNNDYFSLEGLWDVFKQLRFYETPELESPKYQILKRTANYEVRKYAPFIVFETNEDRL--SAGFNRVASF--------
        SIM ++P+T KF +H+DLWDS++NNDYFSLEGL DVFKQLR Y+TP+LE+PKYQILKRTANYEVR Y PFIV ET  D+L  S+GFN VA +        
Subjt:  SIMGIDPDTGKFRTHVDLWDSVQNNDYFSLEGLWDVFKQLRFYETPELESPKYQILKRTANYEVRKYAPFIVFETNEDRL--SAGFNRVASF--------

Query:  ------------------------------------FPDPKQESISLRKMEGGIAAVLKFSGNPTEDLAEQKAKQLLYTLKRDGLKPIHGCLLARYNNST
                                             P P +E ++L+K+EGG AA +KFSG PTED+ + K  +L  +L +DGL+   GC+LARYN+  
Subjt:  ------------------------------------FPDPKQESISLRKMEGGIAAVLKFSGNPTEDLAEQKAKQLLYTLKRDGLKPIHGCLLARYNNST

Query:  RTFSFVM
        RT++F+M
Subjt:  RTFSFVM


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCACTGCCATTGCCCAAGTTTCCCTCCAAAACTTCCTCTCAATCCCAACCGTTGGTTTTGGTTTCCGGCCGAGCAAATCCGGCCGACCAACAGGGCTCGCACAGAG
CAGAAAGTCAAACTGGGTCATTCGATCAAAATTGGCAGATCAAAACTCTAAGAAATTGACGGTGGACGTTGACCGATTGGTGGATTTCTTGTACGACGATCTCCGCCATG
TGTTCGACGAACAGGGGATCGATCGGACGGCGTACGACGACCAAGTGAGATTCCAAGATCCAATCACAAAATATGATAACATCACTGGTTATTTGCTGAATATTACCCTC
TTGCGAGAATTCTTCAAGCCTGAGTTCACATTGCATTGGGTCAAGAAGACTGGACCATATGAAATAACTACAAGATGGACGGCCGTGATGAAGTTCATCCTTCTTCCATG
GAAACCAGAATTAGTTTTGACTGGAATTTCCATTATGGGCATCGATCCAGACACGGGCAAGTTCCGTACCCACGTGGATCTTTGGGATTCAGTACAAAATAATGACTATT
TTTCTCTAGAAGGATTGTGGGATGTATTTAAACAGTTGAGATTTTATGAGACTCCAGAATTGGAATCACCCAAGTATCAGATATTGAAAAGGACTGCAAATTATGAGGTG
AGAAAATATGCACCATTTATAGTGTTTGAAACAAATGAGGACAGGCTTTCTGCTGGATTCAATAGGGTTGCTAGTTTCTTCCCAGATCCTAAACAGGAGTCAATCAGCTT
GAGAAAGATGGAAGGAGGGATTGCTGCAGTGTTGAAATTCAGTGGAAATCCCACAGAAGATTTGGCTGAACAAAAGGCAAAACAATTACTCTATACTCTCAAAAGGGATG
GTCTCAAACCCATCCATGGTTGTTTGCTTGCTCGATACAACAACTCCACCCGAACTTTCAGCTTTGTAATGAGAAATGAAGTACTCATATGGCTCGAAGAATTCTCATTT
TAG
mRNA sequenceShow/hide mRNA sequence
ATGGCCACTGCCATTGCCCAAGTTTCCCTCCAAAACTTCCTCTCAATCCCAACCGTTGGTTTTGGTTTCCGGCCGAGCAAATCCGGCCGACCAACAGGGCTCGCACAGAG
CAGAAAGTCAAACTGGGTCATTCGATCAAAATTGGCAGATCAAAACTCTAAGAAATTGACGGTGGACGTTGACCGATTGGTGGATTTCTTGTACGACGATCTCCGCCATG
TGTTCGACGAACAGGGGATCGATCGGACGGCGTACGACGACCAAGTGAGATTCCAAGATCCAATCACAAAATATGATAACATCACTGGTTATTTGCTGAATATTACCCTC
TTGCGAGAATTCTTCAAGCCTGAGTTCACATTGCATTGGGTCAAGAAGACTGGACCATATGAAATAACTACAAGATGGACGGCCGTGATGAAGTTCATCCTTCTTCCATG
GAAACCAGAATTAGTTTTGACTGGAATTTCCATTATGGGCATCGATCCAGACACGGGCAAGTTCCGTACCCACGTGGATCTTTGGGATTCAGTACAAAATAATGACTATT
TTTCTCTAGAAGGATTGTGGGATGTATTTAAACAGTTGAGATTTTATGAGACTCCAGAATTGGAATCACCCAAGTATCAGATATTGAAAAGGACTGCAAATTATGAGGTG
AGAAAATATGCACCATTTATAGTGTTTGAAACAAATGAGGACAGGCTTTCTGCTGGATTCAATAGGGTTGCTAGTTTCTTCCCAGATCCTAAACAGGAGTCAATCAGCTT
GAGAAAGATGGAAGGAGGGATTGCTGCAGTGTTGAAATTCAGTGGAAATCCCACAGAAGATTTGGCTGAACAAAAGGCAAAACAATTACTCTATACTCTCAAAAGGGATG
GTCTCAAACCCATCCATGGTTGTTTGCTTGCTCGATACAACAACTCCACCCGAACTTTCAGCTTTGTAATGAGAAATGAAGTACTCATATGGCTCGAAGAATTCTCATTT
TAG
Protein sequenceShow/hide protein sequence
MATAIAQVSLQNFLSIPTVGFGFRPSKSGRPTGLAQSRKSNWVIRSKLADQNSKKLTVDVDRLVDFLYDDLRHVFDEQGIDRTAYDDQVRFQDPITKYDNITGYLLNITL
LREFFKPEFTLHWVKKTGPYEITTRWTAVMKFILLPWKPELVLTGISIMGIDPDTGKFRTHVDLWDSVQNNDYFSLEGLWDVFKQLRFYETPELESPKYQILKRTANYEV
RKYAPFIVFETNEDRLSAGFNRVASFFPDPKQESISLRKMEGGIAAVLKFSGNPTEDLAEQKAKQLLYTLKRDGLKPIHGCLLARYNNSTRTFSFVMRNEVLIWLEEFSF