; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10014224 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10014224
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionSqualene monooxygenase
Genome locationChr02:8659063..8666533
RNA-Seq ExpressionHG10014224
SyntenyHG10014224
Gene Ontology termsGO:0016126 - sterol biosynthetic process (biological process)
GO:0005783 - endoplasmic reticulum (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0004506 - squalene monooxygenase activity (molecular function)
GO:0071949 - FAD binding (molecular function)
InterPro domainsIPR002938 - FAD-binding domain
IPR013698 - Squalene epoxidase
IPR036188 - FAD/NAD(P)-binding domain superfamily
IPR040125 - Squalene monooxygenase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004141303.1 squalene monooxygenase SE1 [Cucumis sativus]1.9e-24083.46Show/hide
Query:  MVDQCALGWILASVLGAAALYFLFGKKNCDVSNERRRESLKNIATTNGECKSSNSDGDIIIVGAGVAGSALAYTLAKDGRRVHVIERDLTEPDRIVGELL
        MVDQCALGWILASVLGA+ALY LFGKKNC VSNERRRESLKNIATTNGECKSSNSDGDIIIVGAGVAGSALAYTLAKDGR+VHVIERDL+EPDRIVGELL
Subjt:  MVDQCALGWILASVLGAAALYFLFGKKNCDVSNERRRESLKNIATTNGECKSSNSDGDIIIVGAGVAGSALAYTLAKDGRRVHVIERDLTEPDRIVGELL

Query:  QPGGYLKLTELGLEDCVDDIDAQRVYGYALFKDGKDTRLSYPLEKFHSDVSGRSFHNGRFIQRMREKAASLPNVRLEQGTVTSLLEENGTIKGVQYKNKS
        QPGGYLKLTELGLEDCVD+IDAQRVYGYALFKDGKDTRLSYPLEKFHSDVSGRSFHNGRFIQRMREKAASLPNVRLEQGTVTSLLEENGTI+GVQYKNKS
Subjt:  QPGGYLKLTELGLEDCVDDIDAQRVYGYALFKDGKDTRLSYPLEKFHSDVSGRSFHNGRFIQRMREKAASLPNVRLEQGTVTSLLEENGTIKGVQYKNKS

Query:  GQEMTAYAPLTIVCDGCFSNLRRSLCNPK----------------------------------------------------KVPSISNGEMANYLKNVVA
        GQEMTAYAPLTIVCDGCFSNLRRSLCNPK                                                    KVPSISNGEMANYLKNVVA
Subjt:  GQEMTAYAPLTIVCDGCFSNLRRSLCNPK----------------------------------------------------KVPSISNGEMANYLKNVVA

Query:  PQIPPQLYDSFIAAIDKGNIRTMPNRSMPADPYPTPGALLMGDAFNMRHPLTGGGMTVALSDIVVLRDLLKPLRDLNDAPTLCKYLEAFYTLRKVSLVVI
        PQIPPQLY+SFIAAIDKGNIRTMPNRSMPADPYPTPGALLMGDAFNMRHPLTGGGMTVALSDIVVLRDLLKPLRDLNDAPTLCKYLEAFYTLRK     I
Subjt:  PQIPPQLYDSFIAAIDKGNIRTMPNRSMPADPYPTPGALLMGDAFNMRHPLTGGGMTVALSDIVVLRDLLKPLRDLNDAPTLCKYLEAFYTLRKVSLVVI

Query:  ------------------IKEMRQACFDYLSLGGIFSNGPVSLLSGLNPRPLSLVLHFFAVAIYGVGRLLIPFPSPKRIWIGARLISGASAIIFPIIKAE
                           KEMRQACFDYLSLGGIFSNGPVSLLSGLNPRPLSLVLHFFAVAIYGVGRLLIPFPSPKR+WIGARLISGASAIIFPIIKAE
Subjt:  ------------------IKEMRQACFDYLSLGGIFSNGPVSLLSGLNPRPLSLVLHFFAVAIYGVGRLLIPFPSPKRIWIGARLISGASAIIFPIIKAE

Query:  GVRQMFFPKTVAAYYRAPPV
        GVRQMFFPKTVAAYYRAPP+
Subjt:  GVRQMFFPKTVAAYYRAPPV

XP_016901349.1 PREDICTED: squalene monooxygenase-like [Cucumis melo]1.1e-24083.85Show/hide
Query:  MVDQCALGWILASVLGAAALYFLFGKKNCDVSNERRRESLKNIATTNGECKSSNSDGDIIIVGAGVAGSALAYTLAKDGRRVHVIERDLTEPDRIVGELL
        MVDQCALGWILASVLGA+ALY LFGKKNC V NERRRESLKNIATTNGECKSSNSDGDIIIVGAGVAGSALAYTLAKDGR+VHVIERDL+EPDRIVGELL
Subjt:  MVDQCALGWILASVLGAAALYFLFGKKNCDVSNERRRESLKNIATTNGECKSSNSDGDIIIVGAGVAGSALAYTLAKDGRRVHVIERDLTEPDRIVGELL

Query:  QPGGYLKLTELGLEDCVDDIDAQRVYGYALFKDGKDTRLSYPLEKFHSDVSGRSFHNGRFIQRMREKAASLPNVRLEQGTVTSLLEENGTIKGVQYKNKS
        QPGGYLKLTELGLEDCVDDIDAQRVYGYALFKDGKDTRLSYPLEKFHSDVSGRSFHNGRFIQRMREKAASLPNVRLEQGTVTSLLEENGTIKGVQYKNKS
Subjt:  QPGGYLKLTELGLEDCVDDIDAQRVYGYALFKDGKDTRLSYPLEKFHSDVSGRSFHNGRFIQRMREKAASLPNVRLEQGTVTSLLEENGTIKGVQYKNKS

Query:  GQEMTAYAPLTIVCDGCFSNLRRSLCNPK----------------------------------------------------KVPSISNGEMANYLKNVVA
        GQEMTAYAPLTIVCDGCFSNLRRSLCNPK                                                    KVPSISNGEMANYLKNVVA
Subjt:  GQEMTAYAPLTIVCDGCFSNLRRSLCNPK----------------------------------------------------KVPSISNGEMANYLKNVVA

Query:  PQIPPQLYDSFIAAIDKGNIRTMPNRSMPADPYPTPGALLMGDAFNMRHPLTGGGMTVALSDIVVLRDLLKPLRDLNDAPTLCKYLEAFYTLRKVSLVVI
        PQIPPQLY+SFIAAIDKGNIRTMPNRSMPADPYPTPGALLMGDAFNMRHPLTGGGMTVALSDIVVLRDLLKPLRDLNDAPTLCKYLEAFYTLRK     I
Subjt:  PQIPPQLYDSFIAAIDKGNIRTMPNRSMPADPYPTPGALLMGDAFNMRHPLTGGGMTVALSDIVVLRDLLKPLRDLNDAPTLCKYLEAFYTLRKVSLVVI

Query:  ------------------IKEMRQACFDYLSLGGIFSNGPVSLLSGLNPRPLSLVLHFFAVAIYGVGRLLIPFPSPKRIWIGARLISGASAIIFPIIKAE
                           KEMRQACFDYLSLGGIFSNGPVSLLSGLNPRPLSLVLHFFAVAIYGVGRLLIPFPSPKR+WIGARLISGASAIIFPIIKAE
Subjt:  ------------------IKEMRQACFDYLSLGGIFSNGPVSLLSGLNPRPLSLVLHFFAVAIYGVGRLLIPFPSPKRIWIGARLISGASAIIFPIIKAE

Query:  GVRQMFFPKTVAAYYRAPPV
        GVRQMFFPKTVAAYYRAPPV
Subjt:  GVRQMFFPKTVAAYYRAPPV

XP_022935797.1 squalene monooxygenase-like isoform X1 [Cucurbita moschata]3.4e-23781.92Show/hide
Query:  MVDQCALGWILASVLGAAALYFLFGKKNCDVSNERRRESLKNIATTNGECKSSNSDGDIIIVGAGVAGSALAYTLAKDGRRVHVIERDLTEPDRIVGELL
        M+DQCALGWILASV+GAAALYFLFGKKNC  SNERRRESLKNIATTNGECK SNSDGDIIIVGAGVAGSALAYTLAKDGRRVHVIERDLTEPDRIVGELL
Subjt:  MVDQCALGWILASVLGAAALYFLFGKKNCDVSNERRRESLKNIATTNGECKSSNSDGDIIIVGAGVAGSALAYTLAKDGRRVHVIERDLTEPDRIVGELL

Query:  QPGGYLKLTELGLEDCVDDIDAQRVYGYALFKDGKDTRLSYPLEKFHSDVSGRSFHNGRFIQRMREKAASLPNVRLEQGTVTSLLEENGTIKGVQYKNKS
        QPGGYLKLTELGLEDCVDDIDAQRVYGYALFKDGKDTRLSYPLEKFHSDVSGRSFHNGRFIQRMR+KAA+LPNVRL QGTVTSLLE+NGTIKGVQYKNKS
Subjt:  QPGGYLKLTELGLEDCVDDIDAQRVYGYALFKDGKDTRLSYPLEKFHSDVSGRSFHNGRFIQRMREKAASLPNVRLEQGTVTSLLEENGTIKGVQYKNKS

Query:  GQEMTAYAPLTIVCDGCFSNLRRSLCNPK----------------------------------------------------KVPSISNGEMANYLKNVVA
        GQE TAYAPLTIVCDGCFSNLRR+LCNPK                                                    KVPSISNGEMANYLKNVVA
Subjt:  GQEMTAYAPLTIVCDGCFSNLRRSLCNPK----------------------------------------------------KVPSISNGEMANYLKNVVA

Query:  PQIPPQLYDSFIAAIDKGNIRTMPNRSMPADPYPTPGALLMGDAFNMRHPLTGGGMTVALSDIVVLRDLLKPLRDLNDAPTLCKYLEAFYTLRKVSLVVI
        PQIPPQLYDSFIAAIDKGNIRTMPNRSMPADPYPTPGALLMGDAFNMRHPLTGGGMTVALSDIVVLRDLLKPLRDLNDAP+LCKYLEAFYTLRK     I
Subjt:  PQIPPQLYDSFIAAIDKGNIRTMPNRSMPADPYPTPGALLMGDAFNMRHPLTGGGMTVALSDIVVLRDLLKPLRDLNDAPTLCKYLEAFYTLRKVSLVVI

Query:  ------------------IKEMRQACFDYLSLGGIFSNGPVSLLSGLNPRPLSLVLHFFAVAIYGVGRLLIPFPSPKRIWIGARLISGASAIIFPIIKAE
                           KEMRQACFDYLSLGGIFSNGPVSLLSGLNPRPLSLV HFFAVA+YGVGRLLIPFPSPKR+WIG RLISGASAIIFPIIKAE
Subjt:  ------------------IKEMRQACFDYLSLGGIFSNGPVSLLSGLNPRPLSLVLHFFAVAIYGVGRLLIPFPSPKRIWIGARLISGASAIIFPIIKAE

Query:  GVRQMFFPKTVAAYYRAPPV
        GVRQMFFPKT+AAYYR+PPV
Subjt:  GVRQMFFPKTVAAYYRAPPV

XP_023536393.1 squalene monooxygenase-like [Cucurbita pepo subsp. pepo]8.9e-23882.12Show/hide
Query:  MVDQCALGWILASVLGAAALYFLFGKKNCDVSNERRRESLKNIATTNGECKSSNSDGDIIIVGAGVAGSALAYTLAKDGRRVHVIERDLTEPDRIVGELL
        MVDQCALGWILASV+GAAALYFLFGKKNC  SNERRRESLKNIATTNGECK SNSDGDIIIVGAGVAGSALAYTLAKDGRRVHVIERDLTEPDRIVGELL
Subjt:  MVDQCALGWILASVLGAAALYFLFGKKNCDVSNERRRESLKNIATTNGECKSSNSDGDIIIVGAGVAGSALAYTLAKDGRRVHVIERDLTEPDRIVGELL

Query:  QPGGYLKLTELGLEDCVDDIDAQRVYGYALFKDGKDTRLSYPLEKFHSDVSGRSFHNGRFIQRMREKAASLPNVRLEQGTVTSLLEENGTIKGVQYKNKS
        QPGGYLKLTELGLEDCVDDIDAQRVYGYALFKDGKDTRLSYPLEKFHSDVSGRSFHNGRFIQRMR+KAA+LPNVRL QGTVTSL+E+NGTIKGVQYKNKS
Subjt:  QPGGYLKLTELGLEDCVDDIDAQRVYGYALFKDGKDTRLSYPLEKFHSDVSGRSFHNGRFIQRMREKAASLPNVRLEQGTVTSLLEENGTIKGVQYKNKS

Query:  GQEMTAYAPLTIVCDGCFSNLRRSLCNPK----------------------------------------------------KVPSISNGEMANYLKNVVA
        GQE TAYAPLTIVCDGCFSNLRR+LCNPK                                                    KVPSISNGEMANYLKNVVA
Subjt:  GQEMTAYAPLTIVCDGCFSNLRRSLCNPK----------------------------------------------------KVPSISNGEMANYLKNVVA

Query:  PQIPPQLYDSFIAAIDKGNIRTMPNRSMPADPYPTPGALLMGDAFNMRHPLTGGGMTVALSDIVVLRDLLKPLRDLNDAPTLCKYLEAFYTLRKVSLVVI
        PQIPPQLYDSFIAAIDKGNIRTMPNRSMPADPYPTPGALLMGDAFNMRHPLTGGGMTVALSDIVVLRDLLKPLRDLNDAPTLCKYLEAFYTLRK     I
Subjt:  PQIPPQLYDSFIAAIDKGNIRTMPNRSMPADPYPTPGALLMGDAFNMRHPLTGGGMTVALSDIVVLRDLLKPLRDLNDAPTLCKYLEAFYTLRKVSLVVI

Query:  ------------------IKEMRQACFDYLSLGGIFSNGPVSLLSGLNPRPLSLVLHFFAVAIYGVGRLLIPFPSPKRIWIGARLISGASAIIFPIIKAE
                           KEMRQACFDYL+LGGIFSNGPVSLLSGLNPRPLSLV HFFAVA+YGVGRLLIPFPSPKR+WIG RLISGASAIIFPIIKAE
Subjt:  ------------------IKEMRQACFDYLSLGGIFSNGPVSLLSGLNPRPLSLVLHFFAVAIYGVGRLLIPFPSPKRIWIGARLISGASAIIFPIIKAE

Query:  GVRQMFFPKTVAAYYRAPPV
        GVRQMFFPKT+AAYYRAPPV
Subjt:  GVRQMFFPKTVAAYYRAPPV

XP_038900155.1 squalene monooxygenase SE1-like [Benincasa hispida]3.3e-24083.85Show/hide
Query:  MVDQCALGWILASVLGAAALYFLFGKKNCDVSNERRRESLKNIATTNGECKSSNSDGDIIIVGAGVAGSALAYTLAKDGRRVHVIERDLTEPDRIVGELL
        MV+QCALGWILASVLGAAALYF F KKNC VSNERRRESLKNIA TNGECKSSNSDGDIIIVGAGVAGSALAYTLAKDGRRVHVIERDLTEPDRIVGELL
Subjt:  MVDQCALGWILASVLGAAALYFLFGKKNCDVSNERRRESLKNIATTNGECKSSNSDGDIIIVGAGVAGSALAYTLAKDGRRVHVIERDLTEPDRIVGELL

Query:  QPGGYLKLTELGLEDCVDDIDAQRVYGYALFKDGKDTRLSYPLEKFHSDVSGRSFHNGRFIQRMREKAASLPNVRLEQGTVTSLLEENGTIKGVQYKNKS
        QPGGYLKLTELGLEDCVDDIDAQRVYGYALFKDGKDTRLSYPLEKFHSDVSGRSFHNGRFIQRMREKAASLP VRLEQGTVTSLLEENGTIKGVQYKNKS
Subjt:  QPGGYLKLTELGLEDCVDDIDAQRVYGYALFKDGKDTRLSYPLEKFHSDVSGRSFHNGRFIQRMREKAASLPNVRLEQGTVTSLLEENGTIKGVQYKNKS

Query:  GQEMTAYAPLTIVCDGCFSNLRRSLCNPK----------------------------------------------------KVPSISNGEMANYLKNVVA
        GQEMTAYAPLTIVCDGCFSNLRRSLCNPK                                                    KVPSISNGEMANYLKNVVA
Subjt:  GQEMTAYAPLTIVCDGCFSNLRRSLCNPK----------------------------------------------------KVPSISNGEMANYLKNVVA

Query:  PQIPPQLYDSFIAAIDKGNIRTMPNRSMPADPYPTPGALLMGDAFNMRHPLTGGGMTVALSDIVVLRDLLKPLRDLNDAPTLCKYLEAFYTLRKVSLVVI
        PQIPPQLYDSFIAAIDKGNIRTMPNRSMPADPYPTPGALLMGDAFNMRHPLTGGGMTVALSDIVVLRDLLKPLRDLNDAPTLCKYLEAFYTLRK     I
Subjt:  PQIPPQLYDSFIAAIDKGNIRTMPNRSMPADPYPTPGALLMGDAFNMRHPLTGGGMTVALSDIVVLRDLLKPLRDLNDAPTLCKYLEAFYTLRKVSLVVI

Query:  ------------------IKEMRQACFDYLSLGGIFSNGPVSLLSGLNPRPLSLVLHFFAVAIYGVGRLLIPFPSPKRIWIGARLISGASAIIFPIIKAE
                           KEMRQACFDYLSLGG+FSNGPVSLLSGLNPRPLSLVLHFFAVAIYGVGRLLIPFPSPKR+WIGARLISGASAIIFPIIKAE
Subjt:  ------------------IKEMRQACFDYLSLGGIFSNGPVSLLSGLNPRPLSLVLHFFAVAIYGVGRLLIPFPSPKRIWIGARLISGASAIIFPIIKAE

Query:  GVRQMFFPKTVAAYYRAPPV
        GVRQMFFPKTVAAYYRAPPV
Subjt:  GVRQMFFPKTVAAYYRAPPV

TrEMBL top hitse value%identityAlignment
A0A0A0L0L0 Squalene monooxygenase9.3e-24183.46Show/hide
Query:  MVDQCALGWILASVLGAAALYFLFGKKNCDVSNERRRESLKNIATTNGECKSSNSDGDIIIVGAGVAGSALAYTLAKDGRRVHVIERDLTEPDRIVGELL
        MVDQCALGWILASVLGA+ALY LFGKKNC VSNERRRESLKNIATTNGECKSSNSDGDIIIVGAGVAGSALAYTLAKDGR+VHVIERDL+EPDRIVGELL
Subjt:  MVDQCALGWILASVLGAAALYFLFGKKNCDVSNERRRESLKNIATTNGECKSSNSDGDIIIVGAGVAGSALAYTLAKDGRRVHVIERDLTEPDRIVGELL

Query:  QPGGYLKLTELGLEDCVDDIDAQRVYGYALFKDGKDTRLSYPLEKFHSDVSGRSFHNGRFIQRMREKAASLPNVRLEQGTVTSLLEENGTIKGVQYKNKS
        QPGGYLKLTELGLEDCVD+IDAQRVYGYALFKDGKDTRLSYPLEKFHSDVSGRSFHNGRFIQRMREKAASLPNVRLEQGTVTSLLEENGTI+GVQYKNKS
Subjt:  QPGGYLKLTELGLEDCVDDIDAQRVYGYALFKDGKDTRLSYPLEKFHSDVSGRSFHNGRFIQRMREKAASLPNVRLEQGTVTSLLEENGTIKGVQYKNKS

Query:  GQEMTAYAPLTIVCDGCFSNLRRSLCNPK----------------------------------------------------KVPSISNGEMANYLKNVVA
        GQEMTAYAPLTIVCDGCFSNLRRSLCNPK                                                    KVPSISNGEMANYLKNVVA
Subjt:  GQEMTAYAPLTIVCDGCFSNLRRSLCNPK----------------------------------------------------KVPSISNGEMANYLKNVVA

Query:  PQIPPQLYDSFIAAIDKGNIRTMPNRSMPADPYPTPGALLMGDAFNMRHPLTGGGMTVALSDIVVLRDLLKPLRDLNDAPTLCKYLEAFYTLRKVSLVVI
        PQIPPQLY+SFIAAIDKGNIRTMPNRSMPADPYPTPGALLMGDAFNMRHPLTGGGMTVALSDIVVLRDLLKPLRDLNDAPTLCKYLEAFYTLRK     I
Subjt:  PQIPPQLYDSFIAAIDKGNIRTMPNRSMPADPYPTPGALLMGDAFNMRHPLTGGGMTVALSDIVVLRDLLKPLRDLNDAPTLCKYLEAFYTLRKVSLVVI

Query:  ------------------IKEMRQACFDYLSLGGIFSNGPVSLLSGLNPRPLSLVLHFFAVAIYGVGRLLIPFPSPKRIWIGARLISGASAIIFPIIKAE
                           KEMRQACFDYLSLGGIFSNGPVSLLSGLNPRPLSLVLHFFAVAIYGVGRLLIPFPSPKR+WIGARLISGASAIIFPIIKAE
Subjt:  ------------------IKEMRQACFDYLSLGGIFSNGPVSLLSGLNPRPLSLVLHFFAVAIYGVGRLLIPFPSPKRIWIGARLISGASAIIFPIIKAE

Query:  GVRQMFFPKTVAAYYRAPPV
        GVRQMFFPKTVAAYYRAPP+
Subjt:  GVRQMFFPKTVAAYYRAPPV

A0A1S4DZE1 Squalene monooxygenase5.5e-24183.85Show/hide
Query:  MVDQCALGWILASVLGAAALYFLFGKKNCDVSNERRRESLKNIATTNGECKSSNSDGDIIIVGAGVAGSALAYTLAKDGRRVHVIERDLTEPDRIVGELL
        MVDQCALGWILASVLGA+ALY LFGKKNC V NERRRESLKNIATTNGECKSSNSDGDIIIVGAGVAGSALAYTLAKDGR+VHVIERDL+EPDRIVGELL
Subjt:  MVDQCALGWILASVLGAAALYFLFGKKNCDVSNERRRESLKNIATTNGECKSSNSDGDIIIVGAGVAGSALAYTLAKDGRRVHVIERDLTEPDRIVGELL

Query:  QPGGYLKLTELGLEDCVDDIDAQRVYGYALFKDGKDTRLSYPLEKFHSDVSGRSFHNGRFIQRMREKAASLPNVRLEQGTVTSLLEENGTIKGVQYKNKS
        QPGGYLKLTELGLEDCVDDIDAQRVYGYALFKDGKDTRLSYPLEKFHSDVSGRSFHNGRFIQRMREKAASLPNVRLEQGTVTSLLEENGTIKGVQYKNKS
Subjt:  QPGGYLKLTELGLEDCVDDIDAQRVYGYALFKDGKDTRLSYPLEKFHSDVSGRSFHNGRFIQRMREKAASLPNVRLEQGTVTSLLEENGTIKGVQYKNKS

Query:  GQEMTAYAPLTIVCDGCFSNLRRSLCNPK----------------------------------------------------KVPSISNGEMANYLKNVVA
        GQEMTAYAPLTIVCDGCFSNLRRSLCNPK                                                    KVPSISNGEMANYLKNVVA
Subjt:  GQEMTAYAPLTIVCDGCFSNLRRSLCNPK----------------------------------------------------KVPSISNGEMANYLKNVVA

Query:  PQIPPQLYDSFIAAIDKGNIRTMPNRSMPADPYPTPGALLMGDAFNMRHPLTGGGMTVALSDIVVLRDLLKPLRDLNDAPTLCKYLEAFYTLRKVSLVVI
        PQIPPQLY+SFIAAIDKGNIRTMPNRSMPADPYPTPGALLMGDAFNMRHPLTGGGMTVALSDIVVLRDLLKPLRDLNDAPTLCKYLEAFYTLRK     I
Subjt:  PQIPPQLYDSFIAAIDKGNIRTMPNRSMPADPYPTPGALLMGDAFNMRHPLTGGGMTVALSDIVVLRDLLKPLRDLNDAPTLCKYLEAFYTLRKVSLVVI

Query:  ------------------IKEMRQACFDYLSLGGIFSNGPVSLLSGLNPRPLSLVLHFFAVAIYGVGRLLIPFPSPKRIWIGARLISGASAIIFPIIKAE
                           KEMRQACFDYLSLGGIFSNGPVSLLSGLNPRPLSLVLHFFAVAIYGVGRLLIPFPSPKR+WIGARLISGASAIIFPIIKAE
Subjt:  ------------------IKEMRQACFDYLSLGGIFSNGPVSLLSGLNPRPLSLVLHFFAVAIYGVGRLLIPFPSPKRIWIGARLISGASAIIFPIIKAE

Query:  GVRQMFFPKTVAAYYRAPPV
        GVRQMFFPKTVAAYYRAPPV
Subjt:  GVRQMFFPKTVAAYYRAPPV

A0A5D3D983 Squalene monooxygenase5.5e-24183.85Show/hide
Query:  MVDQCALGWILASVLGAAALYFLFGKKNCDVSNERRRESLKNIATTNGECKSSNSDGDIIIVGAGVAGSALAYTLAKDGRRVHVIERDLTEPDRIVGELL
        MVDQCALGWILASVLGA+ALY LFGKKNC V NERRRESLKNIATTNGECKSSNSDGDIIIVGAGVAGSALAYTLAKDGR+VHVIERDL+EPDRIVGELL
Subjt:  MVDQCALGWILASVLGAAALYFLFGKKNCDVSNERRRESLKNIATTNGECKSSNSDGDIIIVGAGVAGSALAYTLAKDGRRVHVIERDLTEPDRIVGELL

Query:  QPGGYLKLTELGLEDCVDDIDAQRVYGYALFKDGKDTRLSYPLEKFHSDVSGRSFHNGRFIQRMREKAASLPNVRLEQGTVTSLLEENGTIKGVQYKNKS
        QPGGYLKLTELGLEDCVDDIDAQRVYGYALFKDGKDTRLSYPLEKFHSDVSGRSFHNGRFIQRMREKAASLPNVRLEQGTVTSLLEENGTIKGVQYKNKS
Subjt:  QPGGYLKLTELGLEDCVDDIDAQRVYGYALFKDGKDTRLSYPLEKFHSDVSGRSFHNGRFIQRMREKAASLPNVRLEQGTVTSLLEENGTIKGVQYKNKS

Query:  GQEMTAYAPLTIVCDGCFSNLRRSLCNPK----------------------------------------------------KVPSISNGEMANYLKNVVA
        GQEMTAYAPLTIVCDGCFSNLRRSLCNPK                                                    KVPSISNGEMANYLKNVVA
Subjt:  GQEMTAYAPLTIVCDGCFSNLRRSLCNPK----------------------------------------------------KVPSISNGEMANYLKNVVA

Query:  PQIPPQLYDSFIAAIDKGNIRTMPNRSMPADPYPTPGALLMGDAFNMRHPLTGGGMTVALSDIVVLRDLLKPLRDLNDAPTLCKYLEAFYTLRKVSLVVI
        PQIPPQLY+SFIAAIDKGNIRTMPNRSMPADPYPTPGALLMGDAFNMRHPLTGGGMTVALSDIVVLRDLLKPLRDLNDAPTLCKYLEAFYTLRK     I
Subjt:  PQIPPQLYDSFIAAIDKGNIRTMPNRSMPADPYPTPGALLMGDAFNMRHPLTGGGMTVALSDIVVLRDLLKPLRDLNDAPTLCKYLEAFYTLRKVSLVVI

Query:  ------------------IKEMRQACFDYLSLGGIFSNGPVSLLSGLNPRPLSLVLHFFAVAIYGVGRLLIPFPSPKRIWIGARLISGASAIIFPIIKAE
                           KEMRQACFDYLSLGGIFSNGPVSLLSGLNPRPLSLVLHFFAVAIYGVGRLLIPFPSPKR+WIGARLISGASAIIFPIIKAE
Subjt:  ------------------IKEMRQACFDYLSLGGIFSNGPVSLLSGLNPRPLSLVLHFFAVAIYGVGRLLIPFPSPKRIWIGARLISGASAIIFPIIKAE

Query:  GVRQMFFPKTVAAYYRAPPV
        GVRQMFFPKTVAAYYRAPPV
Subjt:  GVRQMFFPKTVAAYYRAPPV

A0A6J1CDH7 Squalene monooxygenase3.2e-23380.96Show/hide
Query:  MVDQCALGWILASVLGAAALYFLFGKKNCDVSNERRRESLKNIATTNGECKSSNSDGDIIIVGAGVAGSALAYTLAKDGRRVHVIERDLTEPDRIVGELL
        MVDQC+LGW LASVLG  A+Y LFGKKNC  SN RRR+SLKNIATTNG+CKSS+SDGDIIIVGAGVAGSALAYTLAKDGRRVHVIERDLTEPDRIVGELL
Subjt:  MVDQCALGWILASVLGAAALYFLFGKKNCDVSNERRRESLKNIATTNGECKSSNSDGDIIIVGAGVAGSALAYTLAKDGRRVHVIERDLTEPDRIVGELL

Query:  QPGGYLKLTELGLEDCVDDIDAQRVYGYALFKDGKDTRLSYPLEKFHSDVSGRSFHNGRFIQRMREKAASLPNVRLEQGTVTSLLEENGTIKGVQYKNKS
        QPGGYLKLTELGLEDCVDDIDAQRVYGYALFKDGKDTRLSYPLEKFHSDVSGRSFHNGRFIQRMREKAA+L NVRLEQGTVTSLLEENGTIKGVQYKNKS
Subjt:  QPGGYLKLTELGLEDCVDDIDAQRVYGYALFKDGKDTRLSYPLEKFHSDVSGRSFHNGRFIQRMREKAASLPNVRLEQGTVTSLLEENGTIKGVQYKNKS

Query:  GQEMTAYAPLTIVCDGCFSNLRRSLCNPK----------------------------------------------------KVPSISNGEMANYLKNVVA
        GQEMTAYAPLTIVCDGCFSNLRRSLCNPK                                                    KVPSISNGEMANYLKNVVA
Subjt:  GQEMTAYAPLTIVCDGCFSNLRRSLCNPK----------------------------------------------------KVPSISNGEMANYLKNVVA

Query:  PQIPPQLYDSFIAAIDKGNIRTMPNRSMPADPYPTPGALLMGDAFNMRHPLTGGGMTVALSDIVVLRDLLKPLRDLNDAPTLCKYLEAFYTLRKVSLVVI
        PQIP QLYD+F+AAIDKGNIRTMPNRSMPADPYPTPGALLMGDAFNMRHPLTGGGMTVALSDIVVLRDLLKPLRDLNDAPTL KYLEAFYTLRK     I
Subjt:  PQIPPQLYDSFIAAIDKGNIRTMPNRSMPADPYPTPGALLMGDAFNMRHPLTGGGMTVALSDIVVLRDLLKPLRDLNDAPTLCKYLEAFYTLRKVSLVVI

Query:  ------------------IKEMRQACFDYLSLGGIFSNGPVSLLSGLNPRPLSLVLHFFAVAIYGVGRLLIPFPSPKRIWIGARLISGASAIIFPIIKAE
                           KEMRQACFDYLSLGGIFSNGPVSLLSGLNPRPLSLVLHFFAVAIYGVGRLLIPFPSPKR+WIGA++ISGAS+IIFPIIKAE
Subjt:  ------------------IKEMRQACFDYLSLGGIFSNGPVSLLSGLNPRPLSLVLHFFAVAIYGVGRLLIPFPSPKRIWIGARLISGASAIIFPIIKAE

Query:  GVRQMFFPKTVAAYYRAPPV
        GVRQMFFP TVAAYYRAPPV
Subjt:  GVRQMFFPKTVAAYYRAPPV

A0A6J1F6K9 Squalene monooxygenase1.6e-23781.92Show/hide
Query:  MVDQCALGWILASVLGAAALYFLFGKKNCDVSNERRRESLKNIATTNGECKSSNSDGDIIIVGAGVAGSALAYTLAKDGRRVHVIERDLTEPDRIVGELL
        M+DQCALGWILASV+GAAALYFLFGKKNC  SNERRRESLKNIATTNGECK SNSDGDIIIVGAGVAGSALAYTLAKDGRRVHVIERDLTEPDRIVGELL
Subjt:  MVDQCALGWILASVLGAAALYFLFGKKNCDVSNERRRESLKNIATTNGECKSSNSDGDIIIVGAGVAGSALAYTLAKDGRRVHVIERDLTEPDRIVGELL

Query:  QPGGYLKLTELGLEDCVDDIDAQRVYGYALFKDGKDTRLSYPLEKFHSDVSGRSFHNGRFIQRMREKAASLPNVRLEQGTVTSLLEENGTIKGVQYKNKS
        QPGGYLKLTELGLEDCVDDIDAQRVYGYALFKDGKDTRLSYPLEKFHSDVSGRSFHNGRFIQRMR+KAA+LPNVRL QGTVTSLLE+NGTIKGVQYKNKS
Subjt:  QPGGYLKLTELGLEDCVDDIDAQRVYGYALFKDGKDTRLSYPLEKFHSDVSGRSFHNGRFIQRMREKAASLPNVRLEQGTVTSLLEENGTIKGVQYKNKS

Query:  GQEMTAYAPLTIVCDGCFSNLRRSLCNPK----------------------------------------------------KVPSISNGEMANYLKNVVA
        GQE TAYAPLTIVCDGCFSNLRR+LCNPK                                                    KVPSISNGEMANYLKNVVA
Subjt:  GQEMTAYAPLTIVCDGCFSNLRRSLCNPK----------------------------------------------------KVPSISNGEMANYLKNVVA

Query:  PQIPPQLYDSFIAAIDKGNIRTMPNRSMPADPYPTPGALLMGDAFNMRHPLTGGGMTVALSDIVVLRDLLKPLRDLNDAPTLCKYLEAFYTLRKVSLVVI
        PQIPPQLYDSFIAAIDKGNIRTMPNRSMPADPYPTPGALLMGDAFNMRHPLTGGGMTVALSDIVVLRDLLKPLRDLNDAP+LCKYLEAFYTLRK     I
Subjt:  PQIPPQLYDSFIAAIDKGNIRTMPNRSMPADPYPTPGALLMGDAFNMRHPLTGGGMTVALSDIVVLRDLLKPLRDLNDAPTLCKYLEAFYTLRKVSLVVI

Query:  ------------------IKEMRQACFDYLSLGGIFSNGPVSLLSGLNPRPLSLVLHFFAVAIYGVGRLLIPFPSPKRIWIGARLISGASAIIFPIIKAE
                           KEMRQACFDYLSLGGIFSNGPVSLLSGLNPRPLSLV HFFAVA+YGVGRLLIPFPSPKR+WIG RLISGASAIIFPIIKAE
Subjt:  ------------------IKEMRQACFDYLSLGGIFSNGPVSLLSGLNPRPLSLVLHFFAVAIYGVGRLLIPFPSPKRIWIGARLISGASAIIFPIIKAE

Query:  GVRQMFFPKTVAAYYRAPPV
        GVRQMFFPKT+AAYYR+PPV
Subjt:  GVRQMFFPKTVAAYYRAPPV

SwissProt top hitse value%identityAlignment
B7TWW5 Squalene monooxygenase SE21.1e-18569.67Show/hide
Query:  NGECKS---SNSDGDIIIVGAGVAGSALAYTLAKDGRRVHVIERDLTEPDRIVGELLQPGGYLKLTELGLEDCVDDIDAQRVYGYALFKDGKDTRLSYPL
        NG C     + S  D+IIVGAGVAGSALAYTLAKDGRRVHVIERDLTE DRIVGELLQPGGYLKL ELGLEDCV++IDAQRV+GYAL+ DGK+TRLSYPL
Subjt:  NGECKS---SNSDGDIIIVGAGVAGSALAYTLAKDGRRVHVIERDLTEPDRIVGELLQPGGYLKLTELGLEDCVDDIDAQRVYGYALFKDGKDTRLSYPL

Query:  EKFHSDVSGRSFHNGRFIQRMREKAASLPNVRLEQGTVTSLLEENGTIKGVQYKNKSGQEMT-AYAPLTIVCDGCFSNLRRSLCNPK-------------
        EKFH+DV+GRSFHNGRFIQRMREKAASLPNVR+EQGTVTSL+E+ GT+KGV+YK K+GQEM+ AYAPLTIVCDGCFSNLR SLCNPK             
Subjt:  EKFHSDVSGRSFHNGRFIQRMREKAASLPNVRLEQGTVTSLLEENGTIKGVQYKNKSGQEMT-AYAPLTIVCDGCFSNLRRSLCNPK-------------

Query:  ---------------------------------------KVPSISNGEMANYLKNVVAPQIPPQLYDSFIAAIDKGNIRTMPNRSMPADPYPTPGALLMG
                                               KVPSI+NGE+A+YLK  VAPQIPP+LY SFIAAIDKG I+TMPNRSMPADP+ TPGALL+G
Subjt:  ---------------------------------------KVPSISNGEMANYLKNVVAPQIPPQLYDSFIAAIDKGNIRTMPNRSMPADPYPTPGALLMG

Query:  DAFNMRHPLTGGGMTVALSDIVVLRDLLKPLRDLNDAPTLCKYLEAFYTLRKVSLVVI------------------IKEMRQACFDYLSLGGIFSNGPVS
        DAFNMRHPLTGGGMTVALSDIV++RDLL+PLRDL+D+ TLCKYLE+FYTLRK     I                   +EMR ACFDYLSLGGI S GP++
Subjt:  DAFNMRHPLTGGGMTVALSDIVVLRDLLKPLRDLNDAPTLCKYLEAFYTLRKVSLVVI------------------IKEMRQACFDYLSLGGIFSNGPVS

Query:  LLSGLNPRPLSLVLHFFAVAIYGVGRLLIPFPSPKRIWIGARLISGASAIIFPIIKAEGVRQMFFPKTVAAYYRAPPV
        LLSGLNPRP+SL  HFFAVAIYGVGRLLIPFPSP+++W+GARLISGAS IIFPIIK+EGVRQMFFP TV AYYRAPP+
Subjt:  LLSGLNPRPLSLVLHFFAVAIYGVGRLLIPFPSPKRIWIGARLISGASAIIFPIIKAEGVRQMFFPKTVAAYYRAPPV

O48651 Squalene monooxygenase SE14.2e-19066.28Show/hide
Query:  MVDQCALGWILASVLGAAALYFLFGKKNCDVSNERRRESLKNIATTNGECKSSNSDG--DIIIVGAGVAGSALAYTLAKDGRRVHVIERDLTEPDRIVGE
        ++DQ  LGWI A + G   L     K+  + S E   +       +NG     N  G  D+IIVGAGVAGSALAYTLA DGRRVHVIERDLTE DRIVGE
Subjt:  MVDQCALGWILASVLGAAALYFLFGKKNCDVSNERRRESLKNIATTNGECKSSNSDG--DIIIVGAGVAGSALAYTLAKDGRRVHVIERDLTEPDRIVGE

Query:  LLQPGGYLKLTELGLEDCVDDIDAQRVYGYALFKDGKDTRLSYPLEKFHSDVSGRSFHNGRFIQRMREKAASLPNVRLEQGTVTSLLEENGTIKGVQYKN
        LLQPGGYLKL ELGLEDCV++IDAQRV+GYAL+ DGK+TRLSYPLEKFHSDV+GRSFHNGRF+QRMREKAASLPNVR+EQGTVTSL+E+ G++KGVQYK 
Subjt:  LLQPGGYLKLTELGLEDCVDDIDAQRVYGYALFKDGKDTRLSYPLEKFHSDVSGRSFHNGRFIQRMREKAASLPNVRLEQGTVTSLLEENGTIKGVQYKN

Query:  KSGQEMTAYAPLTIVCDGCFSNLRRSLCNPK----------------------------------------------------KVPSISNGEMANYLKNV
        K GQE++A+APLTIVCDGCFSNLRRSLCNPK                                                    KVP ISNGE+ANYLK V
Subjt:  KSGQEMTAYAPLTIVCDGCFSNLRRSLCNPK----------------------------------------------------KVPSISNGEMANYLKNV

Query:  VAPQIPPQLYDSFIAAIDKGNIRTMPNRSMPADPYPTPGALLMGDAFNMRHPLTGGGMTVALSDIVVLRDLLKPLRDLNDAPTLCKYLEAFYTLRKVSLV
        VAPQ+P QLY+SFIAA+DKGNIRTMPNRSMPADP+PTPGALL+GDAFNMRHPLTGGGMTVALSDIV++RDLL+PLRDL+D+ TLCKYLE+FYTLRK    
Subjt:  VAPQIPPQLYDSFIAAIDKGNIRTMPNRSMPADPYPTPGALLMGDAFNMRHPLTGGGMTVALSDIVVLRDLLKPLRDLNDAPTLCKYLEAFYTLRKVSLV

Query:  VI------------------IKEMRQACFDYLSLGGIFSNGPVSLLSGLNPRPLSLVLHFFAVAIYGVGRLLIPFPSPKRIWIGARLISGASAIIFPIIK
         I                   +EMR ACFDYLSLGGI S GP++LLSGLNPRP+SL LHFFAVAIYGVGRLLIPFPSPKR+W+GARLI GAS IIFPIIK
Subjt:  VI------------------IKEMRQACFDYLSLGGIFSNGPVSLLSGLNPRPLSLVLHFFAVAIYGVGRLLIPFPSPKRIWIGARLISGASAIIFPIIK

Query:  AEGVRQMFFPKTVAAYYRAPPV
        +EG+RQMFFP  V AYYRAPP+
Subjt:  AEGVRQMFFPKTVAAYYRAPPV

O81000 Squalene epoxidase 2, mitochondrial1.3e-17061.67Show/hide
Query:  ALGWILASVLGAAALYFLFGKKNCDVSNERRRESLKNIATTNGECKSSNSDGDIIIVGAGVAGSALAYTLAKDGRRVHVIERDLTEPDRIVGELLQPGGY
        AL   +AS+     LY L    N +  N     S  +  + N E +  +S  D+IIVGAGVAGSALA+TL K+GRRVHVIERD +E DRIVGELLQPGGY
Subjt:  ALGWILASVLGAAALYFLFGKKNCDVSNERRRESLKNIATTNGECKSSNSDGDIIIVGAGVAGSALAYTLAKDGRRVHVIERDLTEPDRIVGELLQPGGY

Query:  LKLTELGLEDCVDDIDAQRVYGYALFKDGKDTRLSYPLEKFHSDVSGRSFHNGRFIQRMREKAASLPNVRLEQGTVTSLLEENGTIKGVQYKNKSGQEMT
        LKL ELGLEDCV  IDAQRV GY LFKDGK T+L+YPLE F SDV+GRSFHNGRF+QRMREKA +L NVRLEQGTVTSLLEE+GTIKGV+Y+ K G E  
Subjt:  LKLTELGLEDCVDDIDAQRVYGYALFKDGKDTRLSYPLEKFHSDVSGRSFHNGRFIQRMREKAASLPNVRLEQGTVTSLLEENGTIKGVQYKNKSGQEMT

Query:  AYAPLTIVCDGCFSNLRRSLCNPK----------------------------------------------------KVPSISNGEMANYLKNVVAPQIPP
        ++APLTIVCDGCFSNLRRSLC PK                                                    K+P I+NGEMA YLK  VAPQ+P 
Subjt:  AYAPLTIVCDGCFSNLRRSLCNPK----------------------------------------------------KVPSISNGEMANYLKNVVAPQIPP

Query:  QLYDSFIAAIDKGNIRTMPNRSMPADPYPTPGALLMGDAFNMRHPLTGGGMTVALSDIVVLRDLLKPLRDLNDAPTLCKYLEAFYTLRKVSLVVI-----
        ++ ++FI A++KGNIRTMPNRSMPADP PTPGALL+GDAFNMRHPLTGGGMTVAL+DIVVLRDLL+P+R+LND   L KY+E+FYTLRK     I     
Subjt:  QLYDSFIAAIDKGNIRTMPNRSMPADPYPTPGALLMGDAFNMRHPLTGGGMTVALSDIVVLRDLLKPLRDLNDAPTLCKYLEAFYTLRKVSLVVI-----

Query:  -------------IKEMRQACFDYLSLGGIFSNGPVSLLSGLNPRPLSLVLHFFAVAIYGVGRLLIPFPSPKRIWIGARLISGASAIIFPIIKAEGVRQM
                       EMR+ACFDYLSLGG+FS+GPV+LLSGLNPRPLSLVLHFFAVAIY V RL++PFPS +  W+GAR+IS AS+IIFPIIKAEGVRQM
Subjt:  -------------IKEMRQACFDYLSLGGIFSNGPVSLLSGLNPRPLSLVLHFFAVAIYGVGRLLIPFPSPKRIWIGARLISGASAIIFPIIKAEGVRQM

Query:  FFPKTVAAYYRAPP
        FFP+T+ A YRAPP
Subjt:  FFPKTVAAYYRAPP

Q8VYH2 Squalene epoxidase 32.0e-17662.24Show/hide
Query:  VDQCALGWILASVLGAAALYFLFGKKNCDVSNERRRESLKNIATTNGECK-SSNSDGDIIIVGAGVAGSALAYTLAKDGRRVHVIERDLTEPDRIVGELL
        VD C L     + L A  L ++  +++  +          ++   NG     S +D DIIIVGAGVAG+ALA+TL K+GRRVHVIERDLTEPDRIVGELL
Subjt:  VDQCALGWILASVLGAAALYFLFGKKNCDVSNERRRESLKNIATTNGECK-SSNSDGDIIIVGAGVAGSALAYTLAKDGRRVHVIERDLTEPDRIVGELL

Query:  QPGGYLKLTELGLEDCVDDIDAQRVYGYALFKDGKDTRLSYPLEKFHSDVSGRSFHNGRFIQRMREKAASLPNVRLEQGTVTSLLEENGTIKGVQYKNKS
        QPGGYLKL ELGLEDCV DIDAQRV GYALFKDGK T+LSYPL++F SDV+GRSFHNGRF+QRMREKA+ LPNVR+EQGTVTSL+EENG IKGVQYK K 
Subjt:  QPGGYLKLTELGLEDCVDDIDAQRVYGYALFKDGKDTRLSYPLEKFHSDVSGRSFHNGRFIQRMREKAASLPNVRLEQGTVTSLLEENGTIKGVQYKNKS

Query:  GQEMTAYAPLTIVCDGCFSNLRRSLCNPK----------------------------------------------------KVPSISNGEMANYLKNVVA
        GQE+ ++APLTIVCDGCFSNLRRSLC PK                                                    K+PS+++GEMA++LK +VA
Subjt:  GQEMTAYAPLTIVCDGCFSNLRRSLCNPK----------------------------------------------------KVPSISNGEMANYLKNVVA

Query:  PQIPPQLYDSFIAAIDKGNIRTMPNRSMPADPYPTPGALLMGDAFNMRHPLTGGGMTVALSDIVVLRDLLKPLRDLNDAPTLCKYLEAFYTLRKVSLVVI
        PQ+PPQ+ D+FI+A++KGNIRTMPNRSMPADP  TPGALL+GDAFNMRHPLTGGGMTVALSDIV+LRDLL PL DL +  +L KY+E+FYTLRK     I
Subjt:  PQIPPQLYDSFIAAIDKGNIRTMPNRSMPADPYPTPGALLMGDAFNMRHPLTGGGMTVALSDIVVLRDLLKPLRDLNDAPTLCKYLEAFYTLRKVSLVVI

Query:  ------------------IKEMRQACFDYLSLGGIFSNGPVSLLSGLNPRPLSLVLHFFAVAIYGVGRLLIPFPSPKRIWIGARLISGASAIIFPIIKAE
                            EMR+ACFDYLSLGG+ S+GPV+LLSGLNPRP+SLVLHFFAVAI+GVGRLL+P PS KR+W+GARLIS AS IIFPIIKAE
Subjt:  ------------------IKEMRQACFDYLSLGGIFSNGPVSLLSGLNPRPLSLVLHFFAVAIYGVGRLLIPFPSPKRIWIGARLISGASAIIFPIIKAE

Query:  GVRQMFFPKTVAAYYRAPP
        GVRQMFFP+T+ A YRAPP
Subjt:  GVRQMFFPKTVAAYYRAPP

Q9SM02 Squalene epoxidase 13.8e-18368.1Show/hide
Query:  RESLKNIATTNGECKSSNSDG----DIIIVGAGVAGSALAYTLAKDGRRVHVIERDLTEPDRIVGELLQPGGYLKLTELGLEDCVDDIDAQRVYGYALFK
        R   K ++T   +  S N  G    D+I+VGAGVAGSALAYTL KD RRVHVIERDL+EPDRIVGELLQPGGYLKL ELG+EDCV++IDAQRVYGYALFK
Subjt:  RESLKNIATTNGECKSSNSDG----DIIIVGAGVAGSALAYTLAKDGRRVHVIERDLTEPDRIVGELLQPGGYLKLTELGLEDCVDDIDAQRVYGYALFK

Query:  DGKDTRLSYPLEKFHSDVSGRSFHNGRFIQRMREKAASLPNVRLEQGTVTSLLEENGTIKGVQYKNKSGQEMTAYAPLTIVCDGCFSNLRRSLCNP----
        +GK  RL+YPLEKFH DVSGRSFHNGRFIQRMREKAASLPNV+LEQGTV SLLEENGTIKGV+YKNK+G+E TA+A LTIVCDGCFSNLRRSLCNP    
Subjt:  DGKDTRLSYPLEKFHSDVSGRSFHNGRFIQRMREKAASLPNVRLEQGTVTSLLEENGTIKGVQYKNKSGQEMTAYAPLTIVCDGCFSNLRRSLCNP----

Query:  ------------------------------------------------KKVPSISNGEMANYLKNVVAPQIPPQLYDSFIAAIDKGNIRTMPNRSMPADP
                                                        +KVPSI+NGEM NYLK VVAPQ+P ++YDSFIAA+DKGNI++MPNRSMPA P
Subjt:  ------------------------------------------------KKVPSISNGEMANYLKNVVAPQIPPQLYDSFIAAIDKGNIRTMPNRSMPADP

Query:  YPTPGALLMGDAFNMRHPLTGGGMTVALSDIVVLRDLLKPLRDLNDAPTLCKYLEAFYTLRKVSLVVI------------------IKEMRQACFDYLSL
        YPTPGALLMGDAFNMRHPLTGGGMTVAL+DIVVLR+LL+PLRDL+D  +LCKYLE+FYTLRK     I                    EMR+ACFDYL L
Subjt:  YPTPGALLMGDAFNMRHPLTGGGMTVALSDIVVLRDLLKPLRDLNDAPTLCKYLEAFYTLRKVSLVVI------------------IKEMRQACFDYLSL

Query:  GGIFSNGPVSLLSGLNPRPLSLVLHFFAVAIYGVGRLLIPFPSPKRIWIGARLISGASAIIFPIIKAEGVRQMFFPKTVAA-YYRAPPV
        GG+ ++GPVSLLSGLNPRPL+LV HFFAVA+YGV RLLIPFPSPKRIW+GA+LISGAS IIFPIIKAEGVRQMFFP TV A YY+AP V
Subjt:  GGIFSNGPVSLLSGLNPRPLSLVLHFFAVAIYGVGRLLIPFPSPKRIWIGARLISGASAIIFPIIKAEGVRQMFFPKTVAA-YYRAPPV

Arabidopsis top hitse value%identityAlignment
AT1G58440.1 FAD/NAD(P)-binding oxidoreductase family protein2.7e-18468.1Show/hide
Query:  RESLKNIATTNGECKSSNSDG----DIIIVGAGVAGSALAYTLAKDGRRVHVIERDLTEPDRIVGELLQPGGYLKLTELGLEDCVDDIDAQRVYGYALFK
        R   K ++T   +  S N  G    D+I+VGAGVAGSALAYTL KD RRVHVIERDL+EPDRIVGELLQPGGYLKL ELG+EDCV++IDAQRVYGYALFK
Subjt:  RESLKNIATTNGECKSSNSDG----DIIIVGAGVAGSALAYTLAKDGRRVHVIERDLTEPDRIVGELLQPGGYLKLTELGLEDCVDDIDAQRVYGYALFK

Query:  DGKDTRLSYPLEKFHSDVSGRSFHNGRFIQRMREKAASLPNVRLEQGTVTSLLEENGTIKGVQYKNKSGQEMTAYAPLTIVCDGCFSNLRRSLCNP----
        +GK  RL+YPLEKFH DVSGRSFHNGRFIQRMREKAASLPNV+LEQGTV SLLEENGTIKGV+YKNK+G+E TA+A LTIVCDGCFSNLRRSLCNP    
Subjt:  DGKDTRLSYPLEKFHSDVSGRSFHNGRFIQRMREKAASLPNVRLEQGTVTSLLEENGTIKGVQYKNKSGQEMTAYAPLTIVCDGCFSNLRRSLCNP----

Query:  ------------------------------------------------KKVPSISNGEMANYLKNVVAPQIPPQLYDSFIAAIDKGNIRTMPNRSMPADP
                                                        +KVPSI+NGEM NYLK VVAPQ+P ++YDSFIAA+DKGNI++MPNRSMPA P
Subjt:  ------------------------------------------------KKVPSISNGEMANYLKNVVAPQIPPQLYDSFIAAIDKGNIRTMPNRSMPADP

Query:  YPTPGALLMGDAFNMRHPLTGGGMTVALSDIVVLRDLLKPLRDLNDAPTLCKYLEAFYTLRKVSLVVI------------------IKEMRQACFDYLSL
        YPTPGALLMGDAFNMRHPLTGGGMTVAL+DIVVLR+LL+PLRDL+D  +LCKYLE+FYTLRK     I                    EMR+ACFDYL L
Subjt:  YPTPGALLMGDAFNMRHPLTGGGMTVALSDIVVLRDLLKPLRDLNDAPTLCKYLEAFYTLRKVSLVVI------------------IKEMRQACFDYLSL

Query:  GGIFSNGPVSLLSGLNPRPLSLVLHFFAVAIYGVGRLLIPFPSPKRIWIGARLISGASAIIFPIIKAEGVRQMFFPKTVAA-YYRAPPV
        GG+ ++GPVSLLSGLNPRPL+LV HFFAVA+YGV RLLIPFPSPKRIW+GA+LISGAS IIFPIIKAEGVRQMFFP TV A YY+AP V
Subjt:  GGIFSNGPVSLLSGLNPRPLSLVLHFFAVAIYGVGRLLIPFPSPKRIWIGARLISGASAIIFPIIKAEGVRQMFFPKTVAA-YYRAPPV

AT2G22830.1 squalene epoxidase 29.1e-17261.67Show/hide
Query:  ALGWILASVLGAAALYFLFGKKNCDVSNERRRESLKNIATTNGECKSSNSDGDIIIVGAGVAGSALAYTLAKDGRRVHVIERDLTEPDRIVGELLQPGGY
        AL   +AS+     LY L    N +  N     S  +  + N E +  +S  D+IIVGAGVAGSALA+TL K+GRRVHVIERD +E DRIVGELLQPGGY
Subjt:  ALGWILASVLGAAALYFLFGKKNCDVSNERRRESLKNIATTNGECKSSNSDGDIIIVGAGVAGSALAYTLAKDGRRVHVIERDLTEPDRIVGELLQPGGY

Query:  LKLTELGLEDCVDDIDAQRVYGYALFKDGKDTRLSYPLEKFHSDVSGRSFHNGRFIQRMREKAASLPNVRLEQGTVTSLLEENGTIKGVQYKNKSGQEMT
        LKL ELGLEDCV  IDAQRV GY LFKDGK T+L+YPLE F SDV+GRSFHNGRF+QRMREKA +L NVRLEQGTVTSLLEE+GTIKGV+Y+ K G E  
Subjt:  LKLTELGLEDCVDDIDAQRVYGYALFKDGKDTRLSYPLEKFHSDVSGRSFHNGRFIQRMREKAASLPNVRLEQGTVTSLLEENGTIKGVQYKNKSGQEMT

Query:  AYAPLTIVCDGCFSNLRRSLCNPK----------------------------------------------------KVPSISNGEMANYLKNVVAPQIPP
        ++APLTIVCDGCFSNLRRSLC PK                                                    K+P I+NGEMA YLK  VAPQ+P 
Subjt:  AYAPLTIVCDGCFSNLRRSLCNPK----------------------------------------------------KVPSISNGEMANYLKNVVAPQIPP

Query:  QLYDSFIAAIDKGNIRTMPNRSMPADPYPTPGALLMGDAFNMRHPLTGGGMTVALSDIVVLRDLLKPLRDLNDAPTLCKYLEAFYTLRKVSLVVI-----
        ++ ++FI A++KGNIRTMPNRSMPADP PTPGALL+GDAFNMRHPLTGGGMTVAL+DIVVLRDLL+P+R+LND   L KY+E+FYTLRK     I     
Subjt:  QLYDSFIAAIDKGNIRTMPNRSMPADPYPTPGALLMGDAFNMRHPLTGGGMTVALSDIVVLRDLLKPLRDLNDAPTLCKYLEAFYTLRKVSLVVI-----

Query:  -------------IKEMRQACFDYLSLGGIFSNGPVSLLSGLNPRPLSLVLHFFAVAIYGVGRLLIPFPSPKRIWIGARLISGASAIIFPIIKAEGVRQM
                       EMR+ACFDYLSLGG+FS+GPV+LLSGLNPRPLSLVLHFFAVAIY V RL++PFPS +  W+GAR+IS AS+IIFPIIKAEGVRQM
Subjt:  -------------IKEMRQACFDYLSLGGIFSNGPVSLLSGLNPRPLSLVLHFFAVAIYGVGRLLIPFPSPKRIWIGARLISGASAIIFPIIKAEGVRQM

Query:  FFPKTVAAYYRAPP
        FFP+T+ A YRAPP
Subjt:  FFPKTVAAYYRAPP

AT4G37760.1 squalene epoxidase 31.4e-17762.24Show/hide
Query:  VDQCALGWILASVLGAAALYFLFGKKNCDVSNERRRESLKNIATTNGECK-SSNSDGDIIIVGAGVAGSALAYTLAKDGRRVHVIERDLTEPDRIVGELL
        VD C L     + L A  L ++  +++  +          ++   NG     S +D DIIIVGAGVAG+ALA+TL K+GRRVHVIERDLTEPDRIVGELL
Subjt:  VDQCALGWILASVLGAAALYFLFGKKNCDVSNERRRESLKNIATTNGECK-SSNSDGDIIIVGAGVAGSALAYTLAKDGRRVHVIERDLTEPDRIVGELL

Query:  QPGGYLKLTELGLEDCVDDIDAQRVYGYALFKDGKDTRLSYPLEKFHSDVSGRSFHNGRFIQRMREKAASLPNVRLEQGTVTSLLEENGTIKGVQYKNKS
        QPGGYLKL ELGLEDCV DIDAQRV GYALFKDGK T+LSYPL++F SDV+GRSFHNGRF+QRMREKA+ LPNVR+EQGTVTSL+EENG IKGVQYK K 
Subjt:  QPGGYLKLTELGLEDCVDDIDAQRVYGYALFKDGKDTRLSYPLEKFHSDVSGRSFHNGRFIQRMREKAASLPNVRLEQGTVTSLLEENGTIKGVQYKNKS

Query:  GQEMTAYAPLTIVCDGCFSNLRRSLCNPK----------------------------------------------------KVPSISNGEMANYLKNVVA
        GQE+ ++APLTIVCDGCFSNLRRSLC PK                                                    K+PS+++GEMA++LK +VA
Subjt:  GQEMTAYAPLTIVCDGCFSNLRRSLCNPK----------------------------------------------------KVPSISNGEMANYLKNVVA

Query:  PQIPPQLYDSFIAAIDKGNIRTMPNRSMPADPYPTPGALLMGDAFNMRHPLTGGGMTVALSDIVVLRDLLKPLRDLNDAPTLCKYLEAFYTLRKVSLVVI
        PQ+PPQ+ D+FI+A++KGNIRTMPNRSMPADP  TPGALL+GDAFNMRHPLTGGGMTVALSDIV+LRDLL PL DL +  +L KY+E+FYTLRK     I
Subjt:  PQIPPQLYDSFIAAIDKGNIRTMPNRSMPADPYPTPGALLMGDAFNMRHPLTGGGMTVALSDIVVLRDLLKPLRDLNDAPTLCKYLEAFYTLRKVSLVVI

Query:  ------------------IKEMRQACFDYLSLGGIFSNGPVSLLSGLNPRPLSLVLHFFAVAIYGVGRLLIPFPSPKRIWIGARLISGASAIIFPIIKAE
                            EMR+ACFDYLSLGG+ S+GPV+LLSGLNPRP+SLVLHFFAVAI+GVGRLL+P PS KR+W+GARLIS AS IIFPIIKAE
Subjt:  ------------------IKEMRQACFDYLSLGGIFSNGPVSLLSGLNPRPLSLVLHFFAVAIYGVGRLLIPFPSPKRIWIGARLISGASAIIFPIIKAE

Query:  GVRQMFFPKTVAAYYRAPP
        GVRQMFFP+T+ A YRAPP
Subjt:  GVRQMFFPKTVAAYYRAPP

AT5G24150.1 FAD/NAD(P)-binding oxidoreductase family protein2.3e-11948.6Show/hide
Query:  DIIIVGAGVAGSALAYTLAKDGRRVHVIERDLTEPDRIVGELLQPGGYLKLTELGLEDCVDDIDAQRVYGYALFKDGKDTRLSYPLE--KFHSDVSGRSF
        D+IIVGAGV GSALAY LAKDGRRVHVIERDL EP+RI+GE +QPGG L L++LGLEDC++ IDAQ+  G  ++KDGK+   S+P++   F  D S RSF
Subjt:  DIIIVGAGVAGSALAYTLAKDGRRVHVIERDLTEPDRIVGELLQPGGYLKLTELGLEDCVDDIDAQRVYGYALFKDGKDTRLSYPLE--KFHSDVSGRSF

Query:  HNGRFIQRMREKAASLPNVRLEQGTVTSLLEENGTIKGVQYKNKSGQEMTAYAPLTIVCDGCFSNLRRSL------------------CN----------
        HNGRF+QR+R+KA+SLPNVRLE+GTV SL+EE G IKGV YKN +G+E TA APLT+VCDGC+SNLRRSL                  C           
Subjt:  HNGRFIQRMREKAASLPNVRLEQGTVTSLLEENGTIKGVQYKNKSGQEMTAYAPLTIVCDGCFSNLRRSL------------------CN----------

Query:  ------------------------PKKVPSISNGEMANYLKNVVAPQIPPQLYDSFIAAIDKG-NIRTMPNRSMPADPYPTPGALLMGDAFNMRHPLTGG
                                P  +PSISNGEMA ++KN +APQ+P +L   F+  ID+G +I+ MP + M A      G +L+GDAFNMRHP    
Subjt:  ------------------------PKKVPSISNGEMANYLKNVVAPQIPPQLYDSFIAAIDKG-NIRTMPNRSMPADPYPTPGALLMGDAFNMRHPLTGG

Query:  GMTVALSDIVVLRDLLKPLRDLNDAPTLCKYLEAFYTLRK------------VSLVVII------KEMRQACFDYLSLGGIFSNGPVSLLSGLNPRPLSL
        GM V LSDI++LR LL+PL +L +A  + + +++FY +RK             S V++       + MRQ C+DYLS GG  ++G ++LL G+NPRP+SL
Subjt:  GMTVALSDIVVLRDLLKPLRDLNDAPTLCKYLEAFYTLRK------------VSLVVII------KEMRQACFDYLSLGGIFSNGPVSLLSGLNPRPLSL

Query:  VLHFFAVAIYGVGRLLIPFPSPKRIWIGARLISGASAIIFPIIKAEGVRQMFFPKTVAAYYRA
        + H  A+ +  +G LL PFPSP RIW   RL   A  ++ P +KAEGV QM FP   AAY ++
Subjt:  VLHFFAVAIYGVGRLLIPFPSPKRIWIGARLISGASAIIFPIIKAEGVRQMFFPKTVAAYYRA

AT5G24160.1 squalene monoxygenase 67.8e-11545.77Show/hide
Query:  RESLKNIATTNGECKSSNSDGDIIIVGAGVAGSALAYTLAKDGRRVHVIERDLTEPDRIVGELLQPGGYLKLTELGLEDCVDDIDAQRVYGYALFKDGKD
        ++   ++A T  E +   +  D+IIVGAGV GSALAY LAKDGRRVHVIERD+ EP+R++GE +QPGG L L++LGL+DC++DIDAQ+  G A++KDGK+
Subjt:  RESLKNIATTNGECKSSNSDGDIIIVGAGVAGSALAYTLAKDGRRVHVIERDLTEPDRIVGELLQPGGYLKLTELGLEDCVDDIDAQRVYGYALFKDGKD

Query:  TRLSYPLE--KFHSDVSGRSFHNGRFIQRMREKAASLPNVRLEQGTVTSLLEENGTIKGVQYKNKSGQEMTAYAPLTIVCDGCFSNLRRSL---------
            +P++   F  + S RSFHNGRF+Q++R KA SL NVRLE+GTV SLLEE G +KGV YKNK G+E TA APLT+VCDGC+SNLRRSL         
Subjt:  TRLSYPLE--KFHSDVSGRSFHNGRFIQRMREKAASLPNVRLEQGTVTSLLEENGTIKGVQYKNKSGQEMTAYAPLTIVCDGCFSNLRRSL---------

Query:  ----------------------------------------CN----PKKVPSISNGEMANYLKNVVAPQIPPQLYDSFIAAIDKG-NIRTMPNRSMPADP
                                                C     P+  PSI+NGEM+ ++KN + PQ+PP+L   F+  ID+G +I+ +P + M +  
Subjt:  ----------------------------------------CN----PKKVPSISNGEMANYLKNVVAPQIPPQLYDSFIAAIDKG-NIRTMPNRSMPADP

Query:  YPTPGALLMGDAFNMRHPLTGGGMTVALSDIVVLRDLLKPLRDLNDAPTLCKYLEAFYTLRK------------VSLVVI-----IKE-MRQACFDYLSL
            G +++GDAFNMRHP+   GM V LSDI++LR LL+PL +L DA  + + + +FY +RK             S V+I      KE MRQ  +DYL  
Subjt:  YPTPGALLMGDAFNMRHPLTGGGMTVALSDIVVLRDLLKPLRDLNDAPTLCKYLEAFYTLRK------------VSLVVI-----IKE-MRQACFDYLSL

Query:  GGIFSNGPVSLLSGLNPRPLSLVLHFFAVAIYGVGRLLIPFPSPKRIWIGARLISGASAIIFPIIKAEGVRQMFFPKTVAAYYRA
        GG  ++G ++LL G+NPRPLSLV H  A+ +  +G+LL PFPSP RIW   +L   A  ++ P +KAEGV QM FP   AAY+++
Subjt:  GGIFSNGPVSLLSGLNPRPLSLVLHFFAVAIYGVGRLLIPFPSPKRIWIGARLISGASAIIFPIIKAEGVRQMFFPKTVAAYYRA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTGGATCAGTGCGCGTTAGGCTGGATCTTGGCCTCAGTTCTCGGAGCCGCGGCTCTATATTTCTTGTTCGGCAAGAAAAACTGCGACGTTTCGAATGAACGGAGGCG
TGAGAGCCTAAAGAATATCGCCACCACGAATGGAGAATGCAAATCGAGTAACAGCGACGGCGACATTATCATTGTCGGAGCCGGAGTCGCCGGATCCGCTCTTGCCTATA
CTCTCGCCAAGGATGGTCGTCGGGTACATGTGATTGAAAGAGACTTGACAGAACCGGACAGAATTGTTGGTGAATTATTACAACCGGGAGGCTATCTGAAGTTAACAGAA
TTGGGACTTGAAGATTGTGTGGATGATATCGATGCCCAACGAGTGTATGGTTATGCTCTTTTTAAGGATGGAAAGGATACTAGACTCTCCTACCCCTTGGAAAAATTTCA
CTCGGACGTGTCTGGGAGGAGCTTTCACAATGGACGCTTCATTCAAAGGATGCGTGAGAAGGCCGCATCCCTGCCCAATGTGCGCTTGGAGCAGGGAACGGTCACTTCTC
TGCTCGAGGAAAATGGTACAATTAAAGGTGTGCAATACAAAAACAAGTCCGGACAAGAGATGACAGCTTATGCACCTTTGACTATTGTTTGTGATGGGTGCTTTTCAAAT
TTGCGACGCTCTCTTTGCAACCCTAAGAAAGTTCCTTCCATTTCAAACGGTGAAATGGCTAACTATTTGAAGAATGTTGTTGCTCCTCAGATTCCTCCTCAACTTTATGA
TTCTTTCATAGCTGCTATTGATAAGGGTAACATCAGAACAATGCCAAATAGAAGTATGCCAGCAGATCCATATCCTACCCCCGGAGCCCTATTGATGGGCGATGCATTCA
ATATGCGTCACCCTCTAACTGGCGGAGGAATGACCGTTGCATTGTCTGACATTGTTGTTCTTAGAGATCTTCTCAAGCCTCTGCGTGATCTGAATGATGCACCCACTCTC
TGCAAGTATCTTGAAGCATTCTACACGCTACGTAAGGTTAGCCTTGTGGTGATTATAAAGGAAATGCGTCAGGCTTGCTTCGATTATTTAAGCCTCGGTGGAATCTTCTC
GAATGGACCAGTTTCTTTACTCTCTGGGTTGAACCCTCGCCCCTTGAGCTTGGTTCTCCATTTCTTTGCGGTGGCTATATACGGTGTTGGTCGATTGCTGATCCCATTTC
CTTCTCCCAAACGCATCTGGATCGGTGCCAGATTGATTTCGGGTGCATCGGCCATTATCTTTCCCATTATCAAGGCTGAAGGAGTAAGGCAGATGTTTTTCCCCAAGACC
GTTGCAGCTTATTACAGAGCTCCACCCGTGGAGCTTTTGTGGCAAACCGTTTTGGTTCCGAAGATGGACGATGCTTGTGTTACTGCTTTTGCCTTGGCTCGCAATGCTGT
CTGCACTGCAACAATAAAGGGTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGTGGATCAGTGCGCGTTAGGCTGGATCTTGGCCTCAGTTCTCGGAGCCGCGGCTCTATATTTCTTGTTCGGCAAGAAAAACTGCGACGTTTCGAATGAACGGAGGCG
TGAGAGCCTAAAGAATATCGCCACCACGAATGGAGAATGCAAATCGAGTAACAGCGACGGCGACATTATCATTGTCGGAGCCGGAGTCGCCGGATCCGCTCTTGCCTATA
CTCTCGCCAAGGATGGTCGTCGGGTACATGTGATTGAAAGAGACTTGACAGAACCGGACAGAATTGTTGGTGAATTATTACAACCGGGAGGCTATCTGAAGTTAACAGAA
TTGGGACTTGAAGATTGTGTGGATGATATCGATGCCCAACGAGTGTATGGTTATGCTCTTTTTAAGGATGGAAAGGATACTAGACTCTCCTACCCCTTGGAAAAATTTCA
CTCGGACGTGTCTGGGAGGAGCTTTCACAATGGACGCTTCATTCAAAGGATGCGTGAGAAGGCCGCATCCCTGCCCAATGTGCGCTTGGAGCAGGGAACGGTCACTTCTC
TGCTCGAGGAAAATGGTACAATTAAAGGTGTGCAATACAAAAACAAGTCCGGACAAGAGATGACAGCTTATGCACCTTTGACTATTGTTTGTGATGGGTGCTTTTCAAAT
TTGCGACGCTCTCTTTGCAACCCTAAGAAAGTTCCTTCCATTTCAAACGGTGAAATGGCTAACTATTTGAAGAATGTTGTTGCTCCTCAGATTCCTCCTCAACTTTATGA
TTCTTTCATAGCTGCTATTGATAAGGGTAACATCAGAACAATGCCAAATAGAAGTATGCCAGCAGATCCATATCCTACCCCCGGAGCCCTATTGATGGGCGATGCATTCA
ATATGCGTCACCCTCTAACTGGCGGAGGAATGACCGTTGCATTGTCTGACATTGTTGTTCTTAGAGATCTTCTCAAGCCTCTGCGTGATCTGAATGATGCACCCACTCTC
TGCAAGTATCTTGAAGCATTCTACACGCTACGTAAGGTTAGCCTTGTGGTGATTATAAAGGAAATGCGTCAGGCTTGCTTCGATTATTTAAGCCTCGGTGGAATCTTCTC
GAATGGACCAGTTTCTTTACTCTCTGGGTTGAACCCTCGCCCCTTGAGCTTGGTTCTCCATTTCTTTGCGGTGGCTATATACGGTGTTGGTCGATTGCTGATCCCATTTC
CTTCTCCCAAACGCATCTGGATCGGTGCCAGATTGATTTCGGGTGCATCGGCCATTATCTTTCCCATTATCAAGGCTGAAGGAGTAAGGCAGATGTTTTTCCCCAAGACC
GTTGCAGCTTATTACAGAGCTCCACCCGTGGAGCTTTTGTGGCAAACCGTTTTGGTTCCGAAGATGGACGATGCTTGTGTTACTGCTTTTGCCTTGGCTCGCAATGCTGT
CTGCACTGCAACAATAAAGGGTTGA
Protein sequenceShow/hide protein sequence
MVDQCALGWILASVLGAAALYFLFGKKNCDVSNERRRESLKNIATTNGECKSSNSDGDIIIVGAGVAGSALAYTLAKDGRRVHVIERDLTEPDRIVGELLQPGGYLKLTE
LGLEDCVDDIDAQRVYGYALFKDGKDTRLSYPLEKFHSDVSGRSFHNGRFIQRMREKAASLPNVRLEQGTVTSLLEENGTIKGVQYKNKSGQEMTAYAPLTIVCDGCFSN
LRRSLCNPKKVPSISNGEMANYLKNVVAPQIPPQLYDSFIAAIDKGNIRTMPNRSMPADPYPTPGALLMGDAFNMRHPLTGGGMTVALSDIVVLRDLLKPLRDLNDAPTL
CKYLEAFYTLRKVSLVVIIKEMRQACFDYLSLGGIFSNGPVSLLSGLNPRPLSLVLHFFAVAIYGVGRLLIPFPSPKRIWIGARLISGASAIIFPIIKAEGVRQMFFPKT
VAAYYRAPPVELLWQTVLVPKMDDACVTAFALARNAVCTATIKG