; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0012625 (gene) of Snake gourd v1 genome

Gene IDTan0012625
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionSOUL heme-binding protein
Genome locationLG09:65764772..65773250
RNA-Seq ExpressionTan0012625
SyntenyTan0012625
Gene Ontology termsGO:0016020 - membrane (cellular component)
GO:0016747 - transferase activity, transferring acyl groups other than amino-acyl groups (molecular function)
InterPro domainsIPR006917 - SOUL haem-binding protein
IPR011256 - Regulatory factor, effector binding domain superfamily
IPR018790 - Protein of unknown function DUF2358


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAF4400066.1 hypothetical protein G4B88_021280 [Cannabis sativa]3.1e-23461.56Show/hide
Query:  NSKWVVRLSLVDQSPPKSTVDVDRLVDFLYEDLPHLFDEQGIDRTAYDDQVRFRDPITKHDTITGYLFNISLLRELFRPDFLLHWVKQTGPYEITTRWTM
        NSKW V+LSLV+QSP KSTV++ RLVDFLYEDLPHLFD+QGIDRTAYD++V+FRDPITKHD+I+GYL+NI+LL+ LFRPDFLLHWVKQTGPYEITTRWTM
Subjt:  NSKWVVRLSLVDQSPPKSTVDVDRLVDFLYEDLPHLFDEQGIDRTAYDDQVRFRDPITKHDTITGYLFNISLLRELFRPDFLLHWVKQTGPYEITTRWTM

Query:  VMKFVLLPWKPQLVFTGNSIMGINPETGKFCSHVDLWDSIQNSDYFSVEGLLDVFKQLRFYKTPELESPKYEILKRTDKYEVRKYAPFIVVETSGDKLAG
        VMKF+LLPWKP+LVFTG S+MGINPETGKFCSH+D WDSI+N+DYFS+EGL +VFKQLR YKTP+L +PKY+ILK+T  YEVRKY PF+VVE S DKL+G
Subjt:  VMKFVLLPWKPQLVFTGNSIMGINPETGKFCSHVDLWDSIQNSDYFSVEGLLDVFKQLRFYKTPELESPKYEILKRTDKYEVRKYAPFIVVETSGDKLAG

Query:  SAGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSELPKVSIQIVLPSEKDLDSLPDPEQDTIGLRKVEGGIAAVLKFSGEPTEEIVEEKAKELRSSLIKD
        S GFN V GYIFGKNSA EKIPMTTPVFT+ FDSEL  VSIQ+ LP +KD++SLP+P +DTI LRKVEGGIAAV+KFSG+PTE++V EK K LRS LIKD
Subjt:  SAGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSELPKVSIQIVLPSEKDLDSLPDPEQDTIGLRKVEGGIAAVLKFSGEPTEEIVEEKAKELRSSLIKD

Query:  NLKPSRGCLLARYNDPGRTWSFIMAQMAAPQFSLQ-NFHSVSTPTLGFGFR-----PPTSGRLIGLAPRLLRSRAITSKPHTR----NSKWVVRLSLVDQ
         LKP  GC LARYNDPGRTWSF+M  +  P    + N     T       R     P +  R    AP         S  + R    NS  +   SLV  
Subjt:  NLKPSRGCLLARYNDPGRTWSFIMAQMAAPQFSLQ-NFHSVSTPTLGFGFR-----PPTSGRLIGLAPRLLRSRAITSKPHTR----NSKWVVRLSLVDQ

Query:  SPPQSTVDVDRLVDFLYEDLPHLFDEQGIDRTAYDDQVRYRDPITKHDTITGYLFNISLLRELFRPDFLLHWVKQTGPYEITTRWTMVMKFVLLPWKPQL
        + P+ST +V+ LVDFLYEDL H+FD+QGIDR  YD+ +R+ DPITK +++  YLFNISLL+ +FRP F LH+VKQTGP+EITTRWT+VM++++ PWKP++
Subjt:  SPPQSTVDVDRLVDFLYEDLPHLFDEQGIDRTAYDDQVRYRDPITKHDTITGYLFNISLLRELFRPDFLLHWVKQTGPYEITTRWTMVMKFVLLPWKPQL

Query:  VFTGNSIMGINPETGKFCSHVDLWDSIQNSDYFSVEGLLDVFKQLRFYKTPE-LESPKYEILKRTDKYEVRKYAPFIVVETSGDKLAGSAGFNTVAGLPD
        V TG S MGINP+TGKFC+HVD WDSI+++++FS+EGL  V KQ+  +KT + L  PKY+ILKR   YEVRKY P + V  +               L D
Subjt:  VFTGNSIMGINPETGKFCSHVDLWDSIQNSDYFSVEGLLDVFKQLRFYKTPE-LESPKYEILKRTDKYEVRKYAPFIVVETSGDKLAGSAGFNTVAGLPD

Query:  PEQDTIGLRKVEGGIAAVLKFSGKPTEEIVEEKAKELRSSLIKDNLKPSKGCLLARYNDPGRTWSF
          Q T GLR  +GGIAA +KFSGK T++IV+EK + L S L+ D L+P  GCLL   +    +W+F
Subjt:  PEQDTIGLRKVEGGIAAVLKFSGKPTEEIVEEKAKELRSSLIKDNLKPSKGCLLARYNDPGRTWSF

KAG6587929.1 Heme-binding-like protein, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia]8.5e-19290.13Show/hide
Query:  MAAPQFSLQNFHSVSTPTLGFGFRPPTSGRLIGLAPRLLRSRAITSKPHTRNSKWVVRLSLVDQSPPKSTVDVDRLVDFLYEDLPHLFDEQGIDRTAYDD
        MAA QFSLQN  +VSTP+L FGFRPP SGRLI       RSR + SKPHTRNSKWVVRLSLVDQ+PPKSTVDVD+LVDFLYEDLPHLFDEQGIDRTAYDD
Subjt:  MAAPQFSLQNFHSVSTPTLGFGFRPPTSGRLIGLAPRLLRSRAITSKPHTRNSKWVVRLSLVDQSPPKSTVDVDRLVDFLYEDLPHLFDEQGIDRTAYDD

Query:  QVRFRDPITKHDTITGYLFNISLLRELFRPDFLLHWVKQTGPYEITTRWTMVMKFVLLPWKPQLVFTGNSIMGINPETGKFCSHVDLWDSIQNSDYFSVE
        +VRFRDPITKHDTITGYLFNISLLRELFRP+FLLHWVK+TG YEITTRW+MVMKFVLLPWKPQLVFTGNSIMGINPETGKFCSHVDLWDSIQ++DYFSVE
Subjt:  QVRFRDPITKHDTITGYLFNISLLRELFRPDFLLHWVKQTGPYEITTRWTMVMKFVLLPWKPQLVFTGNSIMGINPETGKFCSHVDLWDSIQNSDYFSVE

Query:  GLLDVFKQLRFYKTPELESPKYEILKRTDKYEVRKYAPFIVVETSGDKLAGSAGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSELPKVSIQIVLPSEK
        GLLDVFKQLRFYKTPELESPKYEILKRT  YEVRKYAPFIVVETSGDKLAGSAGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSE PKVSIQIVLPSEK
Subjt:  GLLDVFKQLRFYKTPELESPKYEILKRTDKYEVRKYAPFIVVETSGDKLAGSAGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSELPKVSIQIVLPSEK

Query:  DLDSLPDPEQDTIGLRKVEGGIAAVLKFSGEPTEEIVEEKAKELRSSLIKDNLKPSRGCLLARYNDPGRTWSFIM
        DL SLPDPEQDTIGLRKVEGG AAVLKFSG+PTEEIV+EKAKELRSSLIKD LKP  GCLLARYNDPGRTW+FIM
Subjt:  DLDSLPDPEQDTIGLRKVEGGIAAVLKFSGEPTEEIVEEKAKELRSSLIKDNLKPSRGCLLARYNDPGRTWSFIM

XP_022930662.1 uncharacterized protein LOC111437064 isoform X1 [Cucurbita moschata]6.9e-19490.93Show/hide
Query:  MAAPQFSLQNFHSVSTPTLGFGFRPPTSGRLIGLAPRLLRSRAITSKPHTRNSKWVVRLSLVDQSPPKSTVDVDRLVDFLYEDLPHLFDEQGIDRTAYDD
        MAA QFSLQN  +VSTP+LGFGFRPP SGRLI       RSR + SKPHTRNSKWVVRLSLVDQ+PPKSTVDVD+LVDFLYEDLPHLFDEQGIDRTAYDD
Subjt:  MAAPQFSLQNFHSVSTPTLGFGFRPPTSGRLIGLAPRLLRSRAITSKPHTRNSKWVVRLSLVDQSPPKSTVDVDRLVDFLYEDLPHLFDEQGIDRTAYDD

Query:  QVRFRDPITKHDTITGYLFNISLLRELFRPDFLLHWVKQTGPYEITTRWTMVMKFVLLPWKPQLVFTGNSIMGINPETGKFCSHVDLWDSIQNSDYFSVE
        QVRFRDPITKHDTITGYLFNISLLRELFRP+FLLHWVK+TG YEITTRWTMVMKFVLLPWKPQLVFTGNSIMGINPETGKFCSHVDLWDSIQN+DYFSVE
Subjt:  QVRFRDPITKHDTITGYLFNISLLRELFRPDFLLHWVKQTGPYEITTRWTMVMKFVLLPWKPQLVFTGNSIMGINPETGKFCSHVDLWDSIQNSDYFSVE

Query:  GLLDVFKQLRFYKTPELESPKYEILKRTDKYEVRKYAPFIVVETSGDKLAGSAGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSELPKVSIQIVLPSEK
        GLLDVFKQLRFYKTPELESPKYEILKRT  YEVRKYAPFIVVETSGDKLAGSAGFN VAGYIFGKNSAKEKIPMTTPVFTQTFDSE PKVSIQIVLPSEK
Subjt:  GLLDVFKQLRFYKTPELESPKYEILKRTDKYEVRKYAPFIVVETSGDKLAGSAGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSELPKVSIQIVLPSEK

Query:  DLDSLPDPEQDTIGLRKVEGGIAAVLKFSGEPTEEIVEEKAKELRSSLIKDNLKPSRGCLLARYNDPGRTWSFIM
        DL SLPDPEQDTIGLRKVEGG AAVLKFSG+PTEEIV+EKAKELRSSLIKD LKP  GCLLARYNDPGRTW+FIM
Subjt:  DLDSLPDPEQDTIGLRKVEGGIAAVLKFSGEPTEEIVEEKAKELRSSLIKDNLKPSRGCLLARYNDPGRTWSFIM

XP_023000971.1 uncharacterized protein LOC111495248 isoform X1 [Cucurbita maxima]1.4e-19189.6Show/hide
Query:  MAAPQFSLQNFHSVSTPTLGFGFRPPTSGRLIGLAPRLLRSRAITSKPHTRNSKWVVRLSLVDQSPPKSTVDVDRLVDFLYEDLPHLFDEQGIDRTAYDD
        MAA +FSLQN  ++STP+LGFGFRPP SGRLI       RSR + SKPHTRNSKWVVRLSLVDQ+PPKSTVDVD+LVDFLY+DLPHLFDEQGIDRTAYDD
Subjt:  MAAPQFSLQNFHSVSTPTLGFGFRPPTSGRLIGLAPRLLRSRAITSKPHTRNSKWVVRLSLVDQSPPKSTVDVDRLVDFLYEDLPHLFDEQGIDRTAYDD

Query:  QVRFRDPITKHDTITGYLFNISLLRELFRPDFLLHWVKQTGPYEITTRWTMVMKFVLLPWKPQLVFTGNSIMGINPETGKFCSHVDLWDSIQNSDYFSVE
        QVRFRDPITKHDTITGYLFNISLLRELF+P+FLLHWVK+TG YEITTRWTMVMKFVLLPWKPQLVFTGNSIMGINPETGKFCSHVDLWDSIQN+DYFSVE
Subjt:  QVRFRDPITKHDTITGYLFNISLLRELFRPDFLLHWVKQTGPYEITTRWTMVMKFVLLPWKPQLVFTGNSIMGINPETGKFCSHVDLWDSIQNSDYFSVE

Query:  GLLDVFKQLRFYKTPELESPKYEILKRTDKYEVRKYAPFIVVETSGDKLAGSAGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSELPKVSIQIVLPSEK
        GLLDVFKQLRFYKTPELESPKYEILKRT  YEVRKYAPFIVVETSGDKLAGSAGFNTVAGYIFGKNSAKEKI MTTPVFTQTFDSE PKVSIQIVLPSEK
Subjt:  GLLDVFKQLRFYKTPELESPKYEILKRTDKYEVRKYAPFIVVETSGDKLAGSAGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSELPKVSIQIVLPSEK

Query:  DLDSLPDPEQDTIGLRKVEGGIAAVLKFSGEPTEEIVEEKAKELRSSLIKDNLKPSRGCLLARYNDPGRTWSFIM
        DL SLPDPEQDTIGLRKVEGG AAVLKFSG+PTEEIV+EKAK+LRSSLIKD LKP  GCLLARYNDPGRTW+FIM
Subjt:  DLDSLPDPEQDTIGLRKVEGGIAAVLKFSGEPTEEIVEEKAKELRSSLIKDNLKPSRGCLLARYNDPGRTWSFIM

XP_023530707.1 uncharacterized protein LOC111793169 isoform X1 [Cucurbita pepo subsp. pepo]1.0e-19290.13Show/hide
Query:  MAAPQFSLQNFHSVSTPTLGFGFRPPTSGRLIGLAPRLLRSRAITSKPHTRNSKWVVRLSLVDQSPPKSTVDVDRLVDFLYEDLPHLFDEQGIDRTAYDD
        MAA +FSLQN  ++STP+LGFGFRPP SGRLI       RSR + SKPHTRNSKWVVRLSLVDQ+PPKSTVDVD+LVDFLYEDLPHLFD+QGIDRTAYDD
Subjt:  MAAPQFSLQNFHSVSTPTLGFGFRPPTSGRLIGLAPRLLRSRAITSKPHTRNSKWVVRLSLVDQSPPKSTVDVDRLVDFLYEDLPHLFDEQGIDRTAYDD

Query:  QVRFRDPITKHDTITGYLFNISLLRELFRPDFLLHWVKQTGPYEITTRWTMVMKFVLLPWKPQLVFTGNSIMGINPETGKFCSHVDLWDSIQNSDYFSVE
        +VRFRDPITKHDTITGYLFNISLLRELFRP+FLLHWVK+TG YEITTRWTMVMKFVLLPWKPQLVFTGNSIMGINPETGKFCSHVDLWDSIQN+DYFSVE
Subjt:  QVRFRDPITKHDTITGYLFNISLLRELFRPDFLLHWVKQTGPYEITTRWTMVMKFVLLPWKPQLVFTGNSIMGINPETGKFCSHVDLWDSIQNSDYFSVE

Query:  GLLDVFKQLRFYKTPELESPKYEILKRTDKYEVRKYAPFIVVETSGDKLAGSAGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSELPKVSIQIVLPSEK
        GLLDVFKQLRFYKTPELESPKYEILKRT  YEVRKYAPFIVVETSGDKLAGSAGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSE PKVSIQIVLPSEK
Subjt:  GLLDVFKQLRFYKTPELESPKYEILKRTDKYEVRKYAPFIVVETSGDKLAGSAGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSELPKVSIQIVLPSEK

Query:  DLDSLPDPEQDTIGLRKVEGGIAAVLKFSGEPTEEIVEEKAKELRSSLIKDNLKPSRGCLLARYNDPGRTWSFIM
        DL SLPDPEQDTIGLRKVEGG AAVLKFSG+PTEEIV+EKAKELRSSLIKD LKP  GCLLARYNDPGRTW+FIM
Subjt:  DLDSLPDPEQDTIGLRKVEGGIAAVLKFSGEPTEEIVEEKAKELRSSLIKDNLKPSRGCLLARYNDPGRTWSFIM

TrEMBL top hitse value%identityAlignment
A0A6J1CUY2 uncharacterized protein LOC111014503 isoform X15.2e-18786.74Show/hide
Query:  MAAPQFSLQNFHSVSTPTLGFGFRPPTSGRL--IGLAPRLLRSRAITSKPHTRNSKWVVRLSLVDQSPPKSTVDVDRLVDFLYEDLPHLFDEQGIDRTAY
        MAA Q SLQNF  +STPT GFGFRP  SG L   GL PRLL+SR +  KP  RNSKW VRLSLVDQSPPKS VDVDRLVDFLYEDL HLFDEQGIDRTAY
Subjt:  MAAPQFSLQNFHSVSTPTLGFGFRPPTSGRL--IGLAPRLLRSRAITSKPHTRNSKWVVRLSLVDQSPPKSTVDVDRLVDFLYEDLPHLFDEQGIDRTAY

Query:  DDQVRFRDPITKHDTITGYLFNISLLRELFRPDFLLHWVKQTGPYEITTRWTMVMKFVLLPWKPQLVFTGNSIMGINPETGKFCSHVDLWDSIQNSDYFS
        D+ VRFRDPITKHDTI+GY FNISLLRELFRP+F LHWVKQTGPYEITTRWTMVMKFVLLPWKP+ +FTGNSIMGINPETGKFCSHVDLWDSIQN+DYFS
Subjt:  DDQVRFRDPITKHDTITGYLFNISLLRELFRPDFLLHWVKQTGPYEITTRWTMVMKFVLLPWKPQLVFTGNSIMGINPETGKFCSHVDLWDSIQNSDYFS

Query:  VEGLLDVFKQLRFYKTPELESPKYEILKRTDKYEVRKYAPFIVVETSGDKLAGSAGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSELPKVSIQIVLPS
        +EGLLDVFKQLRFYKTPELESPKYEILKRT  YEVRKY PF+VVETSGDKL+GSAGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSE PKVSIQIVLPS
Subjt:  VEGLLDVFKQLRFYKTPELESPKYEILKRTDKYEVRKYAPFIVVETSGDKLAGSAGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSELPKVSIQIVLPS

Query:  EKDLDSLPDPEQDTIGLRKVEGGIAAVLKFSGEPTEEIVEEKAKELRSSLIKDNLKPSRGCLLARYNDPGRTWSFIM
        +KD++SLPDPEQDTIGLRKVEGGIAAVLKFSG+PTE++V+EKAKELRS LIKD LKPS+GCLLARYNDPGRTWSFIM
Subjt:  EKDLDSLPDPEQDTIGLRKVEGGIAAVLKFSGEPTEEIVEEKAKELRSSLIKDNLKPSRGCLLARYNDPGRTWSFIM

A0A6J1ER73 uncharacterized protein LOC111437064 isoform X13.3e-19490.93Show/hide
Query:  MAAPQFSLQNFHSVSTPTLGFGFRPPTSGRLIGLAPRLLRSRAITSKPHTRNSKWVVRLSLVDQSPPKSTVDVDRLVDFLYEDLPHLFDEQGIDRTAYDD
        MAA QFSLQN  +VSTP+LGFGFRPP SGRLI       RSR + SKPHTRNSKWVVRLSLVDQ+PPKSTVDVD+LVDFLYEDLPHLFDEQGIDRTAYDD
Subjt:  MAAPQFSLQNFHSVSTPTLGFGFRPPTSGRLIGLAPRLLRSRAITSKPHTRNSKWVVRLSLVDQSPPKSTVDVDRLVDFLYEDLPHLFDEQGIDRTAYDD

Query:  QVRFRDPITKHDTITGYLFNISLLRELFRPDFLLHWVKQTGPYEITTRWTMVMKFVLLPWKPQLVFTGNSIMGINPETGKFCSHVDLWDSIQNSDYFSVE
        QVRFRDPITKHDTITGYLFNISLLRELFRP+FLLHWVK+TG YEITTRWTMVMKFVLLPWKPQLVFTGNSIMGINPETGKFCSHVDLWDSIQN+DYFSVE
Subjt:  QVRFRDPITKHDTITGYLFNISLLRELFRPDFLLHWVKQTGPYEITTRWTMVMKFVLLPWKPQLVFTGNSIMGINPETGKFCSHVDLWDSIQNSDYFSVE

Query:  GLLDVFKQLRFYKTPELESPKYEILKRTDKYEVRKYAPFIVVETSGDKLAGSAGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSELPKVSIQIVLPSEK
        GLLDVFKQLRFYKTPELESPKYEILKRT  YEVRKYAPFIVVETSGDKLAGSAGFN VAGYIFGKNSAKEKIPMTTPVFTQTFDSE PKVSIQIVLPSEK
Subjt:  GLLDVFKQLRFYKTPELESPKYEILKRTDKYEVRKYAPFIVVETSGDKLAGSAGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSELPKVSIQIVLPSEK

Query:  DLDSLPDPEQDTIGLRKVEGGIAAVLKFSGEPTEEIVEEKAKELRSSLIKDNLKPSRGCLLARYNDPGRTWSFIM
        DL SLPDPEQDTIGLRKVEGG AAVLKFSG+PTEEIV+EKAKELRSSLIKD LKP  GCLLARYNDPGRTW+FIM
Subjt:  DLDSLPDPEQDTIGLRKVEGGIAAVLKFSGEPTEEIVEEKAKELRSSLIKDNLKPSRGCLLARYNDPGRTWSFIM

A0A6J1KHA6 uncharacterized protein LOC111495248 isoform X17.0e-19289.6Show/hide
Query:  MAAPQFSLQNFHSVSTPTLGFGFRPPTSGRLIGLAPRLLRSRAITSKPHTRNSKWVVRLSLVDQSPPKSTVDVDRLVDFLYEDLPHLFDEQGIDRTAYDD
        MAA +FSLQN  ++STP+LGFGFRPP SGRLI       RSR + SKPHTRNSKWVVRLSLVDQ+PPKSTVDVD+LVDFLY+DLPHLFDEQGIDRTAYDD
Subjt:  MAAPQFSLQNFHSVSTPTLGFGFRPPTSGRLIGLAPRLLRSRAITSKPHTRNSKWVVRLSLVDQSPPKSTVDVDRLVDFLYEDLPHLFDEQGIDRTAYDD

Query:  QVRFRDPITKHDTITGYLFNISLLRELFRPDFLLHWVKQTGPYEITTRWTMVMKFVLLPWKPQLVFTGNSIMGINPETGKFCSHVDLWDSIQNSDYFSVE
        QVRFRDPITKHDTITGYLFNISLLRELF+P+FLLHWVK+TG YEITTRWTMVMKFVLLPWKPQLVFTGNSIMGINPETGKFCSHVDLWDSIQN+DYFSVE
Subjt:  QVRFRDPITKHDTITGYLFNISLLRELFRPDFLLHWVKQTGPYEITTRWTMVMKFVLLPWKPQLVFTGNSIMGINPETGKFCSHVDLWDSIQNSDYFSVE

Query:  GLLDVFKQLRFYKTPELESPKYEILKRTDKYEVRKYAPFIVVETSGDKLAGSAGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSELPKVSIQIVLPSEK
        GLLDVFKQLRFYKTPELESPKYEILKRT  YEVRKYAPFIVVETSGDKLAGSAGFNTVAGYIFGKNSAKEKI MTTPVFTQTFDSE PKVSIQIVLPSEK
Subjt:  GLLDVFKQLRFYKTPELESPKYEILKRTDKYEVRKYAPFIVVETSGDKLAGSAGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSELPKVSIQIVLPSEK

Query:  DLDSLPDPEQDTIGLRKVEGGIAAVLKFSGEPTEEIVEEKAKELRSSLIKDNLKPSRGCLLARYNDPGRTWSFIM
        DL SLPDPEQDTIGLRKVEGG AAVLKFSG+PTEEIV+EKAK+LRSSLIKD LKP  GCLLARYNDPGRTW+FIM
Subjt:  DLDSLPDPEQDTIGLRKVEGGIAAVLKFSGEPTEEIVEEKAKELRSSLIKDNLKPSRGCLLARYNDPGRTWSFIM

A0A7J6HY64 Very-long-chain 3-oxoacyl-CoA synthase1.5e-23461.56Show/hide
Query:  NSKWVVRLSLVDQSPPKSTVDVDRLVDFLYEDLPHLFDEQGIDRTAYDDQVRFRDPITKHDTITGYLFNISLLRELFRPDFLLHWVKQTGPYEITTRWTM
        NSKW V+LSLV+QSP KSTV++ RLVDFLYEDLPHLFD+QGIDRTAYD++V+FRDPITKHD+I+GYL+NI+LL+ LFRPDFLLHWVKQTGPYEITTRWTM
Subjt:  NSKWVVRLSLVDQSPPKSTVDVDRLVDFLYEDLPHLFDEQGIDRTAYDDQVRFRDPITKHDTITGYLFNISLLRELFRPDFLLHWVKQTGPYEITTRWTM

Query:  VMKFVLLPWKPQLVFTGNSIMGINPETGKFCSHVDLWDSIQNSDYFSVEGLLDVFKQLRFYKTPELESPKYEILKRTDKYEVRKYAPFIVVETSGDKLAG
        VMKF+LLPWKP+LVFTG S+MGINPETGKFCSH+D WDSI+N+DYFS+EGL +VFKQLR YKTP+L +PKY+ILK+T  YEVRKY PF+VVE S DKL+G
Subjt:  VMKFVLLPWKPQLVFTGNSIMGINPETGKFCSHVDLWDSIQNSDYFSVEGLLDVFKQLRFYKTPELESPKYEILKRTDKYEVRKYAPFIVVETSGDKLAG

Query:  SAGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSELPKVSIQIVLPSEKDLDSLPDPEQDTIGLRKVEGGIAAVLKFSGEPTEEIVEEKAKELRSSLIKD
        S GFN V GYIFGKNSA EKIPMTTPVFT+ FDSEL  VSIQ+ LP +KD++SLP+P +DTI LRKVEGGIAAV+KFSG+PTE++V EK K LRS LIKD
Subjt:  SAGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSELPKVSIQIVLPSEKDLDSLPDPEQDTIGLRKVEGGIAAVLKFSGEPTEEIVEEKAKELRSSLIKD

Query:  NLKPSRGCLLARYNDPGRTWSFIMAQMAAPQFSLQ-NFHSVSTPTLGFGFR-----PPTSGRLIGLAPRLLRSRAITSKPHTR----NSKWVVRLSLVDQ
         LKP  GC LARYNDPGRTWSF+M  +  P    + N     T       R     P +  R    AP         S  + R    NS  +   SLV  
Subjt:  NLKPSRGCLLARYNDPGRTWSFIMAQMAAPQFSLQ-NFHSVSTPTLGFGFR-----PPTSGRLIGLAPRLLRSRAITSKPHTR----NSKWVVRLSLVDQ

Query:  SPPQSTVDVDRLVDFLYEDLPHLFDEQGIDRTAYDDQVRYRDPITKHDTITGYLFNISLLRELFRPDFLLHWVKQTGPYEITTRWTMVMKFVLLPWKPQL
        + P+ST +V+ LVDFLYEDL H+FD+QGIDR  YD+ +R+ DPITK +++  YLFNISLL+ +FRP F LH+VKQTGP+EITTRWT+VM++++ PWKP++
Subjt:  SPPQSTVDVDRLVDFLYEDLPHLFDEQGIDRTAYDDQVRYRDPITKHDTITGYLFNISLLRELFRPDFLLHWVKQTGPYEITTRWTMVMKFVLLPWKPQL

Query:  VFTGNSIMGINPETGKFCSHVDLWDSIQNSDYFSVEGLLDVFKQLRFYKTPE-LESPKYEILKRTDKYEVRKYAPFIVVETSGDKLAGSAGFNTVAGLPD
        V TG S MGINP+TGKFC+HVD WDSI+++++FS+EGL  V KQ+  +KT + L  PKY+ILKR   YEVRKY P + V  +               L D
Subjt:  VFTGNSIMGINPETGKFCSHVDLWDSIQNSDYFSVEGLLDVFKQLRFYKTPE-LESPKYEILKRTDKYEVRKYAPFIVVETSGDKLAGSAGFNTVAGLPD

Query:  PEQDTIGLRKVEGGIAAVLKFSGKPTEEIVEEKAKELRSSLIKDNLKPSKGCLLARYNDPGRTWSF
          Q T GLR  +GGIAA +KFSGK T++IV+EK + L S L+ D L+P  GCLL   +    +W+F
Subjt:  PEQDTIGLRKVEGGIAAVLKFSGKPTEEIVEEKAKELRSSLIKDNLKPSKGCLLARYNDPGRTWSF

A0A803NX00 Uncharacterized protein6.1e-22065.86Show/hide
Query:  NSKWVVRLSLVDQSPPKSTVDVDRLVDFLYEDLPHLFDEQGIDRTAYDDQVRFRDPITKHDTITGYLFNISLLRELFRPDFLLHWVKQTGPYEITTRWTM
        NSKW V+LSLV+QSP KSTV++ RLVDFLYEDLPHLFD+QGIDRTAYD++V+FRDPITKHD+I+GYL+NI+LL+ LFRPDFLLHWVKQTGPYEITTRWTM
Subjt:  NSKWVVRLSLVDQSPPKSTVDVDRLVDFLYEDLPHLFDEQGIDRTAYDDQVRFRDPITKHDTITGYLFNISLLRELFRPDFLLHWVKQTGPYEITTRWTM

Query:  VMKFVLLPWKPQLVFTGNSIMGINPETGKFCSHVDLWDSIQNSDYFSVEGLLDVFKQLRFYKTPELESPKYEILKRTDKYEVRKYAPFIVVETSGDKLAG
        VMKF+LLPWKP+LVFTG S+MGINPETGKFCSH+D WDSI+N+DYFS+EGL +VFKQLR YKTP+L +PKYEILK+T  YEVRKY PF+VVE S DKL+G
Subjt:  VMKFVLLPWKPQLVFTGNSIMGINPETGKFCSHVDLWDSIQNSDYFSVEGLLDVFKQLRFYKTPELESPKYEILKRTDKYEVRKYAPFIVVETSGDKLAG

Query:  SAGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSELPKVSIQIVLPSEKDLDSLPDPEQDTIGLRKVEGGIAAVLKFSGEPTEEIVEEKAKELRSSLIKD
        S GFN V GYIFGKNSA EKIPMTTPVFT+ FDSEL  VSIQ+ LP +KD++SLP+P +DTI LRKVEGGIAAV+KFSG+PTE++V EK K LRS LIKD
Subjt:  SAGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSELPKVSIQIVLPSEKDLDSLPDPEQDTIGLRKVEGGIAAVLKFSGEPTEEIVEEKAKELRSSLIKD

Query:  NLKPSRGCLLARYNDPGRTWSFIM----AQMAAPQFSLQNFHSVSTPTLGFGFRPPTSGRLIGLAPRLLRSRAITSKPHTR----NSKWVVRLSLVDQSP
         LKP  GCLLARYNDPGRTWSF+M     QMA  Q + Q+   ++T        P +  R    AP         S  + R    NS  +   SLV  + 
Subjt:  NLKPSRGCLLARYNDPGRTWSFIM----AQMAAPQFSLQNFHSVSTPTLGFGFRPPTSGRLIGLAPRLLRSRAITSKPHTR----NSKWVVRLSLVDQSP

Query:  PQSTVDVDRLVDFLYEDLPHLFDEQGIDRTAYDDQVRYRDPITKHDTITGYLFNISLLRELFRPDFLLHWVKQTGPYEITTRWTMVMKFVLLPWKPQLVF
        P+ST +V+ LVDFLYEDL H+FD+QGIDR  YD+ +R+ DPITK +++  YLFNISLL+ +FRP F LH+VKQTGP+EITTRWT+VM++++ PWKP++V 
Subjt:  PQSTVDVDRLVDFLYEDLPHLFDEQGIDRTAYDDQVRYRDPITKHDTITGYLFNISLLRELFRPDFLLHWVKQTGPYEITTRWTMVMKFVLLPWKPQLVF

Query:  TGNSIMGINPETGKFCSHVDLWDSIQNSDYFSVEGLLDVFKQLRFYKTPE-LESPKYEILKRTDKYEVRKYAPFIVV
        TG S MGINP+TGKFC+HVD WDSI+++++FS+EGL  V KQ+  +KT + L  PKY+ILKR   YEVRKY P + V
Subjt:  TGNSIMGINPETGKFCSHVDLWDSIQNSDYFSVEGLLDVFKQLRFYKTPE-LESPKYEILKRTDKYEVRKYAPFIVV

SwissProt top hitse value%identityAlignment
Q9SR77 Heme-binding-like protein At3g10130, chloroplastic1.5e-1833.52Show/hide
Query:  FYKTPELESPKYEILKRTDKYEVRKYAPFIVV------ETSGDKLAGSAGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDS--ELPKVSIQIVLPSEKDL
        F   P+LE+  + +L RTDKYE+R+  P+ V       ET  D    S  FN +A Y+FGKN+ KEK+ MTTPV T+   S  E  +++  ++    KD 
Subjt:  FYKTPELESPKYEILKRTDKYEVRKYAPFIVV------ETSGDKLAGSAGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDS--ELPKVSIQIVLPSEKDL

Query:  D--------------SLPDPEQDTIGLRKVEGGIAAVLKFSGEPTEEIVEEKAKELRSSLIKD---NLKPSRGCLLARYNDP
        +              +LP P+  ++ +++V   I AV+ FSG  T+E +E + +ELR +L  D    ++      +A+YN P
Subjt:  D--------------SLPDPEQDTIGLRKVEGGIAAVLKFSGEPTEEIVEEKAKELRSSLIKD---NLKPSRGCLLARYNDP

Arabidopsis top hitse value%identityAlignment
AT1G17100.1 SOUL heme-binding family protein6.2e-0729.71Show/hide
Query:  LESPKYEILKRTDKYEVRKYAPFIVVETS-----GDKLAGSAGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSELPKVSIQIVLPSEKDLDSLPDP-EQ
        +E P YE++   + YE+R+Y   + V T          A    F  +  YI GKN   +KI MT PV +Q   S+ P       +       + PDP   
Subjt:  LESPKYEILKRTDKYEVRKYAPFIVVETS-----GDKLAGSAGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSELPKVSIQIVLPSEKDLDSLPDP-EQ

Query:  DTIGLRKVEGGIAAVLKFSGEPTEEIVEEKAKELRSSL
        + + ++K      AV +FSG  +++ + E+A  L SSL
Subjt:  DTIGLRKVEGGIAAVLKFSGEPTEEIVEEKAKELRSSL

AT2G37970.1 SOUL heme-binding family protein2.1e-1534.21Show/hide
Query:  LESPKYEILKRTDKYEVRKYAPFIVVETSGD----KLAGSAGFNTVAGYI--FGK--NSAKEKIPMTTPVFTQ------------TFDSELPK-------
        +E+PKY + K  D YE+R+Y P +  E + D    K     GF  +A YI  FGK  N   EKI MT PV T+            T +SE  +       
Subjt:  LESPKYEILKRTDKYEVRKYAPFIVVETSGD----KLAGSAGFNTVAGYI--FGK--NSAKEKIPMTTPVFTQ------------TFDSELPK-------

Query:  -----------VSIQIVLPS-EKDLDSLPDPEQDTIGLRKVEGGIAAVLKFSGEPTEEIVEEKAKELRSSLIKDNLKPSRGCLLARYNDP
                   V++Q +LPS  K  +  P P  + + +++  G    V+KFSG  +E +V EK K+L S L KD  K +   +LARYN P
Subjt:  -----------VSIQIVLPS-EKDLDSLPDPEQDTIGLRKVEGGIAAVLKFSGEPTEEIVEEKAKELRSSLIKDNLKPSRGCLLARYNDP

AT3G10130.1 SOUL heme-binding family protein1.1e-1933.52Show/hide
Query:  FYKTPELESPKYEILKRTDKYEVRKYAPFIVV------ETSGDKLAGSAGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDS--ELPKVSIQIVLPSEKDL
        F   P+LE+  + +L RTDKYE+R+  P+ V       ET  D    S  FN +A Y+FGKN+ KEK+ MTTPV T+   S  E  +++  ++    KD 
Subjt:  FYKTPELESPKYEILKRTDKYEVRKYAPFIVV------ETSGDKLAGSAGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDS--ELPKVSIQIVLPSEKDL

Query:  D--------------SLPDPEQDTIGLRKVEGGIAAVLKFSGEPTEEIVEEKAKELRSSLIKD---NLKPSRGCLLARYNDP
        +              +LP P+  ++ +++V   I AV+ FSG  T+E +E + +ELR +L  D    ++      +A+YN P
Subjt:  D--------------SLPDPEQDTIGLRKVEGGIAAVLKFSGEPTEEIVEEKAKELRSSLIKD---NLKPSRGCLLARYNDP

AT5G20140.1 SOUL heme-binding family protein1.8e-13673.05Show/hide
Query:  STVDVDRLVDFLYEDLPHLFDEQGIDRTAYDDQVRFRDPITKHDTITGYLFNISLLRELFRPDFLLHWVKQTGPYEITTRWTMVMKFVLLPWKPQLVFTG
        STV+++ LV FLYEDLPHLFD+QGID+TAYD++V+FRDPITKHDTI+GYLFNI+ L+ +F P F LHW KQTGPYEITTRWTMVMKF+ LPWKP+LVFTG
Subjt:  STVDVDRLVDFLYEDLPHLFDEQGIDRTAYDDQVRFRDPITKHDTITGYLFNISLLRELFRPDFLLHWVKQTGPYEITTRWTMVMKFVLLPWKPQLVFTG

Query:  NSIMGINPETGKFCSHVDLWDSIQNSDYFSVEGLLDVFKQLRFYKTPELESPKYEILKRTDKYEVRKYAPFIVVETSGDKLAGSAGFNTVAGYIFGKNSA
         SIM +NPET KFCSH+DLWDSI+N+DYFS+EGL+DVFKQLR YKTP+LE+PKY+ILKRT  YEVR Y PFIVVET GDKL+GS+GFN VAGYIFGKNS 
Subjt:  NSIMGINPETGKFCSHVDLWDSIQNSDYFSVEGLLDVFKQLRFYKTPELESPKYEILKRTDKYEVRKYAPFIVVETSGDKLAGSAGFNTVAGYIFGKNSA

Query:  KEKIPMTTPVFTQTFDSELPK-VSIQIVLPSEKDLDSLPDPEQDTIGLRKVEGGIAAVLKFSGEPTEEIVEEKAKELRSSLIKDNLKPSRGCLLARYNDP
         EKIPMTTPVFTQT D++L   VS+QIV+PS KDL SLP P ++ + L+K+EGG AA +KFSG+PTE++V+ K  ELRSSL KD L+  +GC+LARYNDP
Subjt:  KEKIPMTTPVFTQTFDSELPK-VSIQIVLPSEKDLDSLPDPEQDTIGLRKVEGGIAAVLKFSGEPTEEIVEEKAKELRSSLIKDNLKPSRGCLLARYNDP

Query:  GRTWSFIM
        GRTW+FIM
Subjt:  GRTWSFIM

AT5G20140.2 SOUL heme-binding family protein2.2e-13772.67Show/hide
Query:  STVDVDRLVDFLYEDLPHLFDEQGIDRTAYDDQVRFRDPITKHDTITGYLFNISLLRELFRPDFLLHWVKQTGPYEITTRWTMVMKFVLLPWKPQLVFTG
        STV+++ LV FLYEDLPHLFD+QGID+TAYD++V+FRDPITKHDTI+GYLFNI+ L+ +F P F LHW KQTGPYEITTRWTMVMKF+ LPWKP+LVFTG
Subjt:  STVDVDRLVDFLYEDLPHLFDEQGIDRTAYDDQVRFRDPITKHDTITGYLFNISLLRELFRPDFLLHWVKQTGPYEITTRWTMVMKFVLLPWKPQLVFTG

Query:  NSIMGINPETGKFCSHVDLWDSIQNSDYFSVEGLLDVFKQLRFYKTPELESPKYEILKRTDKYEVRKYAPFIVVETSGDKLAGSAGFNTVAGYIFGKNSA
         SIM +NPET KFCSH+DLWDSI+N+DYFS+EGL+DVFKQLR YKTP+LE+PKY+ILKRT  YEVR Y PFIVVET GDKL+GS+GFN VAGYIFGKNS 
Subjt:  NSIMGINPETGKFCSHVDLWDSIQNSDYFSVEGLLDVFKQLRFYKTPELESPKYEILKRTDKYEVRKYAPFIVVETSGDKLAGSAGFNTVAGYIFGKNSA

Query:  KEKIPMTTPVFTQTFDSELPK-VSIQIVLPSEKDLDSLPDPEQDTIGLRKVEGGIAAVLKFSGEPTEEIVEEKAKELRSSLIKDNLKPSRGCLLARYNDP
         EKIPMTTPVFTQT D++L   VS+QIV+PS KDL SLP P ++ + L+K+EGG AA +KFSG+PTE++V+ K  ELRSSL KD L+  +GC+LARYNDP
Subjt:  KEKIPMTTPVFTQTFDSELPK-VSIQIVLPSEKDLDSLPDPEQDTIGLRKVEGGIAAVLKFSGEPTEEIVEEKAKELRSSLIKDNLKPSRGCLLARYNDP

Query:  GRTWSFIMAQM
        GRTW+FIM+Q+
Subjt:  GRTWSFIMAQM


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCGCTCCTCAATTTTCCCTCCAAAACTTCCACTCAGTCTCAACCCCAACACTCGGTTTCGGTTTCCGGCCGCCGACTTCCGGCAGACTAATCGGCCTCGCTCCCCG
TCTACTTAGAAGCAGAGCTATAACTTCTAAACCCCATACCCGAAATTCAAAGTGGGTCGTTCGATTAAGCTTGGTAGATCAAAGCCCACCAAAATCGACGGTCGATGTGG
ACCGATTGGTGGATTTCTTATACGAAGATCTTCCCCATCTCTTCGATGAACAGGGGATTGATCGGACGGCGTATGACGATCAAGTGAGATTTCGGGACCCCATTACCAAG
CACGATACGATTACTGGGTATTTGTTCAATATTTCCCTCTTGCGAGAACTCTTCAGGCCTGATTTCTTATTGCACTGGGTTAAACAGACAGGACCATATGAAATAACTAC
AAGATGGACTATGGTAATGAAGTTTGTCCTTCTACCATGGAAACCACAATTAGTTTTTACTGGAAATTCCATCATGGGTATCAATCCAGAGACGGGCAAGTTCTGTAGTC
ATGTGGATCTCTGGGACTCAATACAGAATAGCGACTACTTTTCTGTAGAAGGCCTGTTGGATGTATTTAAGCAGCTTCGGTTTTATAAGACTCCAGAATTGGAATCACCC
AAGTATGAGATACTGAAAAGAACTGATAAGTATGAGGTGAGAAAATATGCACCATTTATAGTGGTTGAAACAAGTGGAGACAAACTCGCTGGGTCTGCTGGATTCAATAC
AGTTGCTGGGTATATATTTGGGAAGAACTCAGCAAAGGAGAAGATACCCATGACCACTCCTGTATTCACCCAAACATTTGACTCTGAATTACCCAAAGTCTCCATTCAAA
TAGTTCTTCCTTCAGAGAAAGATCTAGACAGTCTGCCAGATCCTGAACAAGACACAATCGGCTTGAGAAAGGTTGAAGGAGGAATTGCTGCAGTGTTAAAATTCAGTGGA
GAACCTACCGAAGAGATTGTGGAAGAGAAGGCCAAAGAACTGCGGTCTAGTCTAATAAAGGATAATCTCAAACCCAGTAGGGGTTGTTTGCTTGCTCGGTATAACGACCC
TGGACGAACATGGAGCTTTATAATGGCTCAAATGGCCGCTCCTCAATTTTCCCTCCAAAACTTCCACTCAGTCTCAACCCCAACACTCGGTTTCGGTTTCCGGCCGCCGA
CTTCCGGCAGACTAATCGGCCTCGCTCCCCGTCTACTTAGAAGCAGAGCTATAACTTCTAAACCCCATACCCGAAATTCAAAGTGGGTCGTTCGATTAAGCTTGGTAGAT
CAAAGCCCACCACAATCGACGGTCGATGTGGACCGATTGGTGGATTTCTTATACGAAGATCTTCCCCATCTCTTCGATGAACAGGGGATTGATCGGACGGCGTATGACGA
TCAAGTGAGATATCGGGACCCCATTACCAAGCACGATACGATTACTGGGTATTTGTTCAATATTTCCCTCTTGCGAGAACTCTTCAGGCCTGATTTCTTATTGCACTGGG
TTAAACAGACAGGACCATATGAAATAACTACAAGATGGACTATGGTAATGAAGTTTGTCCTTCTACCATGGAAACCACAATTAGTTTTTACTGGAAATTCCATCATGGGT
ATCAATCCAGAGACGGGCAAGTTCTGTAGTCATGTGGATCTCTGGGACTCAATACAGAATAGCGACTACTTTTCTGTAGAAGGCCTGTTGGATGTATTTAAGCAGCTTCG
GTTTTATAAGACTCCAGAATTGGAATCACCCAAGTATGAGATACTGAAAAGAACTGATAAGTATGAGGTGAGAAAATATGCACCATTTATAGTGGTTGAAACAAGTGGAG
ACAAACTCGCTGGGTCTGCTGGATTCAATACAGTTGCTGGTCTGCCAGATCCTGAACAAGACACAATCGGCTTGAGAAAGGTTGAAGGAGGAATTGCTGCAGTGTTAAAA
TTCAGTGGAAAACCTACCGAAGAGATTGTGGAAGAGAAGGCCAAAGAACTGCGGTCTAGTCTCATAAAGGATAATCTCAAACCCAGTAAGGGTTGTTTGCTTGCTCGGTA
TAACGACCCTGGACGAACATGGAGCTTTATAATGAGAAATGAGGTGCTAATATGGCTTGAAGAATTCTCATTGGAGTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCCGCTCCTCAATTTTCCCTCCAAAACTTCCACTCAGTCTCAACCCCAACACTCGGTTTCGGTTTCCGGCCGCCGACTTCCGGCAGACTAATCGGCCTCGCTCCCCG
TCTACTTAGAAGCAGAGCTATAACTTCTAAACCCCATACCCGAAATTCAAAGTGGGTCGTTCGATTAAGCTTGGTAGATCAAAGCCCACCAAAATCGACGGTCGATGTGG
ACCGATTGGTGGATTTCTTATACGAAGATCTTCCCCATCTCTTCGATGAACAGGGGATTGATCGGACGGCGTATGACGATCAAGTGAGATTTCGGGACCCCATTACCAAG
CACGATACGATTACTGGGTATTTGTTCAATATTTCCCTCTTGCGAGAACTCTTCAGGCCTGATTTCTTATTGCACTGGGTTAAACAGACAGGACCATATGAAATAACTAC
AAGATGGACTATGGTAATGAAGTTTGTCCTTCTACCATGGAAACCACAATTAGTTTTTACTGGAAATTCCATCATGGGTATCAATCCAGAGACGGGCAAGTTCTGTAGTC
ATGTGGATCTCTGGGACTCAATACAGAATAGCGACTACTTTTCTGTAGAAGGCCTGTTGGATGTATTTAAGCAGCTTCGGTTTTATAAGACTCCAGAATTGGAATCACCC
AAGTATGAGATACTGAAAAGAACTGATAAGTATGAGGTGAGAAAATATGCACCATTTATAGTGGTTGAAACAAGTGGAGACAAACTCGCTGGGTCTGCTGGATTCAATAC
AGTTGCTGGGTATATATTTGGGAAGAACTCAGCAAAGGAGAAGATACCCATGACCACTCCTGTATTCACCCAAACATTTGACTCTGAATTACCCAAAGTCTCCATTCAAA
TAGTTCTTCCTTCAGAGAAAGATCTAGACAGTCTGCCAGATCCTGAACAAGACACAATCGGCTTGAGAAAGGTTGAAGGAGGAATTGCTGCAGTGTTAAAATTCAGTGGA
GAACCTACCGAAGAGATTGTGGAAGAGAAGGCCAAAGAACTGCGGTCTAGTCTAATAAAGGATAATCTCAAACCCAGTAGGGGTTGTTTGCTTGCTCGGTATAACGACCC
TGGACGAACATGGAGCTTTATAATGGCTCAAATGGCCGCTCCTCAATTTTCCCTCCAAAACTTCCACTCAGTCTCAACCCCAACACTCGGTTTCGGTTTCCGGCCGCCGA
CTTCCGGCAGACTAATCGGCCTCGCTCCCCGTCTACTTAGAAGCAGAGCTATAACTTCTAAACCCCATACCCGAAATTCAAAGTGGGTCGTTCGATTAAGCTTGGTAGAT
CAAAGCCCACCACAATCGACGGTCGATGTGGACCGATTGGTGGATTTCTTATACGAAGATCTTCCCCATCTCTTCGATGAACAGGGGATTGATCGGACGGCGTATGACGA
TCAAGTGAGATATCGGGACCCCATTACCAAGCACGATACGATTACTGGGTATTTGTTCAATATTTCCCTCTTGCGAGAACTCTTCAGGCCTGATTTCTTATTGCACTGGG
TTAAACAGACAGGACCATATGAAATAACTACAAGATGGACTATGGTAATGAAGTTTGTCCTTCTACCATGGAAACCACAATTAGTTTTTACTGGAAATTCCATCATGGGT
ATCAATCCAGAGACGGGCAAGTTCTGTAGTCATGTGGATCTCTGGGACTCAATACAGAATAGCGACTACTTTTCTGTAGAAGGCCTGTTGGATGTATTTAAGCAGCTTCG
GTTTTATAAGACTCCAGAATTGGAATCACCCAAGTATGAGATACTGAAAAGAACTGATAAGTATGAGGTGAGAAAATATGCACCATTTATAGTGGTTGAAACAAGTGGAG
ACAAACTCGCTGGGTCTGCTGGATTCAATACAGTTGCTGGTCTGCCAGATCCTGAACAAGACACAATCGGCTTGAGAAAGGTTGAAGGAGGAATTGCTGCAGTGTTAAAA
TTCAGTGGAAAACCTACCGAAGAGATTGTGGAAGAGAAGGCCAAAGAACTGCGGTCTAGTCTCATAAAGGATAATCTCAAACCCAGTAAGGGTTGTTTGCTTGCTCGGTA
TAACGACCCTGGACGAACATGGAGCTTTATAATGAGAAATGAGGTGCTAATATGGCTTGAAGAATTCTCATTGGAGTAG
Protein sequenceShow/hide protein sequence
MAAPQFSLQNFHSVSTPTLGFGFRPPTSGRLIGLAPRLLRSRAITSKPHTRNSKWVVRLSLVDQSPPKSTVDVDRLVDFLYEDLPHLFDEQGIDRTAYDDQVRFRDPITK
HDTITGYLFNISLLRELFRPDFLLHWVKQTGPYEITTRWTMVMKFVLLPWKPQLVFTGNSIMGINPETGKFCSHVDLWDSIQNSDYFSVEGLLDVFKQLRFYKTPELESP
KYEILKRTDKYEVRKYAPFIVVETSGDKLAGSAGFNTVAGYIFGKNSAKEKIPMTTPVFTQTFDSELPKVSIQIVLPSEKDLDSLPDPEQDTIGLRKVEGGIAAVLKFSG
EPTEEIVEEKAKELRSSLIKDNLKPSRGCLLARYNDPGRTWSFIMAQMAAPQFSLQNFHSVSTPTLGFGFRPPTSGRLIGLAPRLLRSRAITSKPHTRNSKWVVRLSLVD
QSPPQSTVDVDRLVDFLYEDLPHLFDEQGIDRTAYDDQVRYRDPITKHDTITGYLFNISLLRELFRPDFLLHWVKQTGPYEITTRWTMVMKFVLLPWKPQLVFTGNSIMG
INPETGKFCSHVDLWDSIQNSDYFSVEGLLDVFKQLRFYKTPELESPKYEILKRTDKYEVRKYAPFIVVETSGDKLAGSAGFNTVAGLPDPEQDTIGLRKVEGGIAAVLK
FSGKPTEEIVEEKAKELRSSLIKDNLKPSKGCLLARYNDPGRTWSFIMRNEVLIWLEEFSLE