; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr029919 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr029919
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionSWIM-type domain-containing protein
Genome locationtig00153554:1125654..1133679
RNA-Seq ExpressionSgr029919
SyntenySgr029919
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
GO:0004506 - squalene monooxygenase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
GO:0071949 - FAD binding (molecular function)
InterPro domainsIPR002938 - FAD-binding domain
IPR004332 - Transposase, MuDR, plant
IPR006564 - Zinc finger, PMZ-type
IPR007527 - Zinc finger, SWIM-type
IPR013698 - Squalene epoxidase
IPR018289 - MULE transposase domain
IPR036188 - FAD/NAD(P)-binding domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6576892.1 hypothetical protein SDJN03_24466, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0092.11Show/hide
Query:  MEDHTFVVGQEFPDVKAFRNAIKEAAIAQHFELRIIKSDLIRYFAKCAEESCPWRIRAVKLPNSPIFTIRSLEGTHTCGQNAQNGHHQASIDWIVSFIEE
        MEDHTFVVGQEFPDVKAFRNAIKEAAIAQHFELRIIKSDLIRYFAKCAEE CPWRIRAVKLPNSPIFTIRSLEG HTCG+NAQNGHHQASIDWIVSFIEE
Subjt:  MEDHTFVVGQEFPDVKAFRNAIKEAAIAQHFELRIIKSDLIRYFAKCAEESCPWRIRAVKLPNSPIFTIRSLEGTHTCGQNAQNGHHQASIDWIVSFIEE

Query:  RLRNNINYKPKDILHDIHKQYGITIPYKQAWRAKERGLATIYGSSEEGYCLLPSYCEQIRKANPGSIAEVFTTGSDNRFQRLFVSFYASIYGFLNGCFPV
        RLRNNINY+PKDILHDIHKQYGITIPYKQAWRAKERGLATI+GSSEEGY LLP YCEQIRKANPGS+AEVFTTGS N FQRLFVSFYASIYGFLNGCFPV
Subjt:  RLRNNINYKPKDILHDIHKQYGITIPYKQAWRAKERGLATIYGSSEEGYCLLPSYCEQIRKANPGSIAEVFTTGSDNRFQRLFVSFYASIYGFLNGCFPV

Query:  IGLGGIQLKSKYLGTLLSATSYDADGGLFPLAFGVVDAENEESWIWFLSELHNALEMNSQNILHLTFLSDGQKGILDALRRKFPNSSHALCMRYLSECIG
        I LGGIQLKSKYLGTLLSATSYDADGG+FPLAFGVVDAENEESWIWFLSELH ALEMN++N LHLTFLS+GQKGILDALRRKFPNSS+ALCMRYLSE IG
Subjt:  IGLGGIQLKSKYLGTLLSATSYDADGGLFPLAFGVVDAENEESWIWFLSELHNALEMNSQNILHLTFLSDGQKGILDALRRKFPNSSHALCMRYLSECIG

Query:  KEFKNSRLVNLLWKAAYATTTIAFKERMSEIEEVSPEAAKWIQQFPPSHWALVYFEGNRYGHLSSNIEEFSKWILEARELPIIQVIEQIHSKLMVEFEER
        KEFKNSRLV+LLW+AAYATTT+AFKERMSEIEE+SPEAAKW+QQFPPSHWALVYFEGNRYGHLSSN+EEF+KWILEARELPIIQVIE+IHSKLMVEFEER
Subjt:  KEFKNSRLVNLLWKAAYATTTIAFKERMSEIEEVSPEAAKWIQQFPPSHWALVYFEGNRYGHLSSNIEEFSKWILEARELPIIQVIEQIHSKLMVEFEER

Query:  RARSTSWFSFLSPSAEKRIMEAINLASSYQVLRSDEVEFEVLSAERSDIVNIGERCCSCRDWQLYGIPCSHAVAAIASCRKDVYAFTEKCFTVSGYRETY
        RARS+SWFSFL+PSAEKRI+EAI +ASSY+VLRSDEVEFEVLSAERS IVNIG+RCCSCR+WQLYGIPCSHAVAAIASCRKDVYAFTEKCF VSGYRE Y
Subjt:  RARSTSWFSFLSPSAEKRIMEAINLASSYQVLRSDEVEFEVLSAERSDIVNIGERCCSCRDWQLYGIPCSHAVAAIASCRKDVYAFTEKCFTVSGYRETY

Query:  AKGIYPIPGKLERKKHDETPMDDDVQIVRPPKFRRPPGRPEKKRICVEDLNREKHTVHCSRCNQTGHYKTTCKAELMKSIEQF
        AKGIYPIP +LE K  DETPMDDD QIVRPPKFRRPPGRPEKKRICVE+LNREKHTVHCSRCNQTGHYKTTCKAELMK IEQF
Subjt:  AKGIYPIPGKLERKKHDETPMDDDVQIVRPPKFRRPPGRPEKKRICVEDLNREKHTVHCSRCNQTGHYKTTCKAELMKSIEQF

XP_022149173.1 uncharacterized protein LOC111017663 [Momordica charantia]0.0e+0093.14Show/hide
Query:  MEDHTFVVGQEFPDVKAFRNAIKEAAIAQHFELRIIKSDLIRYFAKCAEESCPWRIRAVKLPNSPIFTIRSLEGTHTCGQNAQNGHHQASIDWIVSFIEE
        MEDHTFVVGQEFPDVKAFRNAIKEAAIAQHFELRIIKSDL+RYFA+CAEE CPWRIRAVKLPNSP FTIRSLEG HTCGQNAQNGHHQASIDWIVSFIEE
Subjt:  MEDHTFVVGQEFPDVKAFRNAIKEAAIAQHFELRIIKSDLIRYFAKCAEESCPWRIRAVKLPNSPIFTIRSLEGTHTCGQNAQNGHHQASIDWIVSFIEE

Query:  RLRNNINYKPKDILHDIHKQYGITIPYKQAWRAKERGLATIYGSSEEGYCLLPSYCEQIRKANPGSIAEVFTTGSDNRFQRLFVSFYASIYGFLNGCFPV
        RLR++I+YKPKDILHDIHKQYGITIPYKQAWRAKERGLATI+GSSEEGYCLLP+YCEQIRKANPG++AEVFT GSDN F RLFVSFYASIYGFLNGCFPV
Subjt:  RLRNNINYKPKDILHDIHKQYGITIPYKQAWRAKERGLATIYGSSEEGYCLLPSYCEQIRKANPGSIAEVFTTGSDNRFQRLFVSFYASIYGFLNGCFPV

Query:  IGLGGIQLKSKYLGTLLSATSYDADGGLFPLAFGVVDAENEESWIWFLSELHNALEMNSQNILHLTFLSDGQKGILDALRRKFPNSSHALCMRYLSECIG
        IGLGGIQLKSKYLGTLLSATSYDADGGLFPLAFGVVDAENEESW+WFLSELH ALEMN+QNILHLTFLS+GQKGILDALRR+F NSSHA CMRYLSE +G
Subjt:  IGLGGIQLKSKYLGTLLSATSYDADGGLFPLAFGVVDAENEESWIWFLSELHNALEMNSQNILHLTFLSDGQKGILDALRRKFPNSSHALCMRYLSECIG

Query:  KEFKNSRLVNLLWKAAYATTTIAFKERMSEIEEVSPEAAKWIQQFPPSHWALVYFEGNRYGHLSSNIEEFSKWILEARELPIIQVIEQIHSKLMVEFEER
        KEFKNSRLVNLLWKAAYATTTIAFKERM EIEEVSPEAAKWIQQFPPSHWALVYFEG RYGHLSSNIE+F+KWILEARELPI+QVIE+IHSKLM EFEER
Subjt:  KEFKNSRLVNLLWKAAYATTTIAFKERMSEIEEVSPEAAKWIQQFPPSHWALVYFEGNRYGHLSSNIEEFSKWILEARELPIIQVIEQIHSKLMVEFEER

Query:  RARSTSWFSFLSPSAEKRIMEAINLASSYQVLRSDEVEFEVLSAERSDIVNIGERCCSCRDWQLYGIPCSHAVAAIASCRKDVYAFTEKCFTVSGYRETY
        RARS+SW+SFL+PSAEKRIMEAINLASSYQVLRSDEVEFEVLSAERSDIVNIG+RCCSCRDWQLYGIPCSHAVAAIASCRKDVYAFTEKCFTVSGYRETY
Subjt:  RARSTSWFSFLSPSAEKRIMEAINLASSYQVLRSDEVEFEVLSAERSDIVNIGERCCSCRDWQLYGIPCSHAVAAIASCRKDVYAFTEKCFTVSGYRETY

Query:  AKGIYPIPGKLERKKHDETPMDDDVQIVRPPKFRRPPGRPEKKRICVEDLNREKHTVHCSRCNQTGHYKTTCKAELMKSIEQF
        AKGIYPIPGKLE KKHDETPMDD+ QI+RPPKFRRPPGRPEKKRICVEDLNREKHTVHCSRCNQTGHYKTTCKAEL+KSIEQF
Subjt:  AKGIYPIPGKLERKKHDETPMDDDVQIVRPPKFRRPPGRPEKKRICVEDLNREKHTVHCSRCNQTGHYKTTCKAELMKSIEQF

XP_023553246.1 uncharacterized protein LOC111810721 [Cucurbita pepo subsp. pepo]0.0e+0092.45Show/hide
Query:  MEDHTFVVGQEFPDVKAFRNAIKEAAIAQHFELRIIKSDLIRYFAKCAEESCPWRIRAVKLPNSPIFTIRSLEGTHTCGQNAQNGHHQASIDWIVSFIEE
        MEDHTFVVGQEFPDVKAFRNAIKEAAIAQHFELRIIKSDLIRYFAKCAEE CPWRIRAVKLPNSPIFTIRSLEG HTCG+NAQNGHHQASIDWIVSFIEE
Subjt:  MEDHTFVVGQEFPDVKAFRNAIKEAAIAQHFELRIIKSDLIRYFAKCAEESCPWRIRAVKLPNSPIFTIRSLEGTHTCGQNAQNGHHQASIDWIVSFIEE

Query:  RLRNNINYKPKDILHDIHKQYGITIPYKQAWRAKERGLATIYGSSEEGYCLLPSYCEQIRKANPGSIAEVFTTGSDNRFQRLFVSFYASIYGFLNGCFPV
        RLRNNINYKPKDILHDIHKQYGITIPYKQAWRAKERGLATI+GSSEEGY LLP YCEQIRKANPGS+AEVFTTGS N FQRLFVSFYASIYGFLNGCFPV
Subjt:  RLRNNINYKPKDILHDIHKQYGITIPYKQAWRAKERGLATIYGSSEEGYCLLPSYCEQIRKANPGSIAEVFTTGSDNRFQRLFVSFYASIYGFLNGCFPV

Query:  IGLGGIQLKSKYLGTLLSATSYDADGGLFPLAFGVVDAENEESWIWFLSELHNALEMNSQNILHLTFLSDGQKGILDALRRKFPNSSHALCMRYLSECIG
        I LGGIQLKSKYLGTLLSATSYDADGG+FPLAFGVVDAENEESWIWFLSELH ALEMN++N L LTFLS+GQKGILDALRRKFPNSS+ALCMRYLSE IG
Subjt:  IGLGGIQLKSKYLGTLLSATSYDADGGLFPLAFGVVDAENEESWIWFLSELHNALEMNSQNILHLTFLSDGQKGILDALRRKFPNSSHALCMRYLSECIG

Query:  KEFKNSRLVNLLWKAAYATTTIAFKERMSEIEEVSPEAAKWIQQFPPSHWALVYFEGNRYGHLSSNIEEFSKWILEARELPIIQVIEQIHSKLMVEFEER
        KEFKNSRLV+LLWKAAYATTTIAFKERMSEIEE+SPEA KWIQQFPPSHWALVYFEGNRYGHLSSN+EEF+KWILEARELPIIQVIE+IHSKLMVEFEER
Subjt:  KEFKNSRLVNLLWKAAYATTTIAFKERMSEIEEVSPEAAKWIQQFPPSHWALVYFEGNRYGHLSSNIEEFSKWILEARELPIIQVIEQIHSKLMVEFEER

Query:  RARSTSWFSFLSPSAEKRIMEAINLASSYQVLRSDEVEFEVLSAERSDIVNIGERCCSCRDWQLYGIPCSHAVAAIASCRKDVYAFTEKCFTVSGYRETY
        RARS+SWFSFL+PSAEKRI+EAI +ASSY+VLRSDEVEFEVLSAERS IVNIG+RCCSCR+WQLYGIPCSHAVAAIASCRKDVYAFTEKCF VSGYRE Y
Subjt:  RARSTSWFSFLSPSAEKRIMEAINLASSYQVLRSDEVEFEVLSAERSDIVNIGERCCSCRDWQLYGIPCSHAVAAIASCRKDVYAFTEKCFTVSGYRETY

Query:  AKGIYPIPGKLERKKHDETPMDDDVQIVRPPKFRRPPGRPEKKRICVEDLNREKHTVHCSRCNQTGHYKTTCKAELMKSIEQF
        AKGIYPIP +LE K  DETPMDDD QIVRPPKFRRPPGRPEKKRICVE+LNREKHTVHCSRCNQTGHYKTTCKAELMK IEQF
Subjt:  AKGIYPIPGKLERKKHDETPMDDDVQIVRPPKFRRPPGRPEKKRICVEDLNREKHTVHCSRCNQTGHYKTTCKAELMKSIEQF

XP_038906150.1 uncharacterized protein LOC120092030 isoform X1 [Benincasa hispida]0.0e+0086.31Show/hide
Query:  IDSVSLCSSIFASVKLRSASSPSISRSILESCFEVVWWYGSSVSFSSVAALKIAQTGATGSEIGLSHMEDHTFVVGQEFPDVKAFRNAIKEAAIAQHFEL
        + S+SL       VKLRSA S SISR                               A+GS IGLS MEDHTFVVGQEFPDV+AFRNAIKEAAIAQHFEL
Subjt:  IDSVSLCSSIFASVKLRSASSPSISRSILESCFEVVWWYGSSVSFSSVAALKIAQTGATGSEIGLSHMEDHTFVVGQEFPDVKAFRNAIKEAAIAQHFEL

Query:  RIIKSDLIRYFAKCAEESCPWRIRAVKLPNSPIFTIRSLEGTHTCGQNAQNGHHQASIDWIVSFIEERLRNNINYKPKDILHDIHKQYGITIPYKQAWRA
        RIIKSDL+RYFAKCA+E CPWRIRAVKLPNSPIFTIRSLEGTHTCGQNAQNGHHQASIDWIVSFIEERLRNNINYKPKDILHDIHKQYGITIPYKQAWRA
Subjt:  RIIKSDLIRYFAKCAEESCPWRIRAVKLPNSPIFTIRSLEGTHTCGQNAQNGHHQASIDWIVSFIEERLRNNINYKPKDILHDIHKQYGITIPYKQAWRA

Query:  KERGLATIYGSSEEGYCLLPSYCEQIRKANPGSIAEVFTTGSDNRFQRLFVSFYASIYGFLNGCFPVIGLGGIQLKSKYLGTLLSATSYDADGGLFPLAF
        KERGLATI+GSSEEGY LLPSYCEQIRKANPGS AEVFTTGSDN FQRLFVSF+ASI+GFLNGCFPVI LGGIQLKSKYLGTLLSA SYDADGG+FPLAF
Subjt:  KERGLATIYGSSEEGYCLLPSYCEQIRKANPGSIAEVFTTGSDNRFQRLFVSFYASIYGFLNGCFPVIGLGGIQLKSKYLGTLLSATSYDADGGLFPLAF

Query:  GVVDAENEESWIWFLSELHNALEMNSQNILHLTFLSDGQKGILDALRRKFPNSSHALCMRYLSECIGKEFKNSRLVNLLWKAAYATTTIAFKERMSEIEE
        GVVDAENEESW+WFLSELH ALEMN+ N LH TFLSDGQKGILDALRRKFPNSSHALCMRYLSE IGKEFKNSRLV+L+WKAAYATTTIAFKERMS+IEE
Subjt:  GVVDAENEESWIWFLSELHNALEMNSQNILHLTFLSDGQKGILDALRRKFPNSSHALCMRYLSECIGKEFKNSRLVNLLWKAAYATTTIAFKERMSEIEE

Query:  VSPEAAKWIQQFPPSHWALVYFEGNRYGHLSSNIEEFSKWILEARELPIIQVIEQIHSKLMVEFEERRARSTSWFSFLSPSAEKRIMEAINLASSYQVLR
        VSPEAAKWIQQFPPSHWALVYFEG RYGHLSSN++EF+KWILEARELPIIQVIE+IHSKLM EFEERRARSTSWFSFL+PSAEKRI+EAINLASSYQVLR
Subjt:  VSPEAAKWIQQFPPSHWALVYFEGNRYGHLSSNIEEFSKWILEARELPIIQVIEQIHSKLMVEFEERRARSTSWFSFLSPSAEKRIMEAINLASSYQVLR

Query:  SDEVEFEVLSAERSDIVNIGERCCSCRDWQLYGIPCSHAVAAIASCRKDVYAFTEKCFTVSGYRETYAKGIYPIPGKLERKKHDETPMDDDVQIVRPPKF
        SDEVEFEVLSA+RS IV+IG+RCC CRDWQLYGIPCSHAVAAIASCRKDVYAFTEKCFTVSGYRETYAK IYPIP KLE KK DETPM DD QIVRPPKF
Subjt:  SDEVEFEVLSAERSDIVNIGERCCSCRDWQLYGIPCSHAVAAIASCRKDVYAFTEKCFTVSGYRETYAKGIYPIPGKLERKKHDETPMDDDVQIVRPPKF

Query:  RRPPGRPEKKRICVEDLNREKHTVHCSRCNQTGHYKTTCKAELMKSIEQF
        RRPPGRPEKKRICVEDLNREKHTVHCS+CNQTGHYKTTCKAELMKSIEQF
Subjt:  RRPPGRPEKKRICVEDLNREKHTVHCSRCNQTGHYKTTCKAELMKSIEQF

XP_038906151.1 uncharacterized protein LOC120092030 isoform X2 [Benincasa hispida]0.0e+0092.24Show/hide
Query:  ATGSEIGLSHMEDHTFVVGQEFPDVKAFRNAIKEAAIAQHFELRIIKSDLIRYFAKCAEESCPWRIRAVKLPNSPIFTIRSLEGTHTCGQNAQNGHHQAS
        A+GS IGLS MEDHTFVVGQEFPDV+AFRNAIKEAAIAQHFELRIIKSDL+RYFAKCA+E CPWRIRAVKLPNSPIFTIRSLEGTHTCGQNAQNGHHQAS
Subjt:  ATGSEIGLSHMEDHTFVVGQEFPDVKAFRNAIKEAAIAQHFELRIIKSDLIRYFAKCAEESCPWRIRAVKLPNSPIFTIRSLEGTHTCGQNAQNGHHQAS

Query:  IDWIVSFIEERLRNNINYKPKDILHDIHKQYGITIPYKQAWRAKERGLATIYGSSEEGYCLLPSYCEQIRKANPGSIAEVFTTGSDNRFQRLFVSFYASI
        IDWIVSFIEERLRNNINYKPKDILHDIHKQYGITIPYKQAWRAKERGLATI+GSSEEGY LLPSYCEQIRKANPGS AEVFTTGSDN FQRLFVSF+ASI
Subjt:  IDWIVSFIEERLRNNINYKPKDILHDIHKQYGITIPYKQAWRAKERGLATIYGSSEEGYCLLPSYCEQIRKANPGSIAEVFTTGSDNRFQRLFVSFYASI

Query:  YGFLNGCFPVIGLGGIQLKSKYLGTLLSATSYDADGGLFPLAFGVVDAENEESWIWFLSELHNALEMNSQNILHLTFLSDGQKGILDALRRKFPNSSHAL
        +GFLNGCFPVI LGGIQLKSKYLGTLLSA SYDADGG+FPLAFGVVDAENEESW+WFLSELH ALEMN+ N LH TFLSDGQKGILDALRRKFPNSSHAL
Subjt:  YGFLNGCFPVIGLGGIQLKSKYLGTLLSATSYDADGGLFPLAFGVVDAENEESWIWFLSELHNALEMNSQNILHLTFLSDGQKGILDALRRKFPNSSHAL

Query:  CMRYLSECIGKEFKNSRLVNLLWKAAYATTTIAFKERMSEIEEVSPEAAKWIQQFPPSHWALVYFEGNRYGHLSSNIEEFSKWILEARELPIIQVIEQIH
        CMRYLSE IGKEFKNSRLV+L+WKAAYATTTIAFKERMS+IEEVSPEAAKWIQQFPPSHWALVYFEG RYGHLSSN++EF+KWILEARELPIIQVIE+IH
Subjt:  CMRYLSECIGKEFKNSRLVNLLWKAAYATTTIAFKERMSEIEEVSPEAAKWIQQFPPSHWALVYFEGNRYGHLSSNIEEFSKWILEARELPIIQVIEQIH

Query:  SKLMVEFEERRARSTSWFSFLSPSAEKRIMEAINLASSYQVLRSDEVEFEVLSAERSDIVNIGERCCSCRDWQLYGIPCSHAVAAIASCRKDVYAFTEKC
        SKLM EFEERRARSTSWFSFL+PSAEKRI+EAINLASSYQVLRSDEVEFEVLSA+RS IV+IG+RCC CRDWQLYGIPCSHAVAAIASCRKDVYAFTEKC
Subjt:  SKLMVEFEERRARSTSWFSFLSPSAEKRIMEAINLASSYQVLRSDEVEFEVLSAERSDIVNIGERCCSCRDWQLYGIPCSHAVAAIASCRKDVYAFTEKC

Query:  FTVSGYRETYAKGIYPIPGKLERKKHDETPMDDDVQIVRPPKFRRPPGRPEKKRICVEDLNREKHTVHCSRCNQTGHYKTTCKAELMKSIEQF
        FTVSGYRETYAK IYPIP KLE KK DETPM DD QIVRPPKFRRPPGRPEKKRICVEDLNREKHTVHCS+CNQTGHYKTTCKAELMKSIEQF
Subjt:  FTVSGYRETYAKGIYPIPGKLERKKHDETPMDDDVQIVRPPKFRRPPGRPEKKRICVEDLNREKHTVHCSRCNQTGHYKTTCKAELMKSIEQF

TrEMBL top hitse value%identityAlignment
A0A0A0KYF1 SWIM-type domain-containing protein0.0e+0091.08Show/hide
Query:  MEDHTFVVGQEFPDVKAFRNAIKEAAIAQHFELRIIKSDLIRYFAKCAEESCPWRIRAVKLPNSPIFTIRSLEGTHTCGQNAQNGHHQASIDWIVSFIEE
        MEDHTFVVGQEFPDVKAFRNAIKEAAIAQHFELRIIKSDLIRYFAKCAEE CPWRIRAVKLPNSPIFTIRSL+GTHTCGQNAQNGHHQASIDWIVSFIEE
Subjt:  MEDHTFVVGQEFPDVKAFRNAIKEAAIAQHFELRIIKSDLIRYFAKCAEESCPWRIRAVKLPNSPIFTIRSLEGTHTCGQNAQNGHHQASIDWIVSFIEE

Query:  RLRNNINYKPKDILHDIHKQYGITIPYKQAWRAKERGLATIYGSSEEGYCLLPSYCEQIRKANPGSIAEVFTTGSDNRFQRLFVSFYASIYGFLNGCFPV
        RLRNNINYKPKDILHDIHKQYGITIPYKQAWRAKERGLATI+GSSEEGY LLPSYCEQI+KANPGS+AEVFTTGSDN FQRLFVSFYASIYGFLNGCFPV
Subjt:  RLRNNINYKPKDILHDIHKQYGITIPYKQAWRAKERGLATIYGSSEEGYCLLPSYCEQIRKANPGSIAEVFTTGSDNRFQRLFVSFYASIYGFLNGCFPV

Query:  IGLGGIQLKSKYLGTLLSATSYDADGGLFPLAFGVVDAENEESWIWFLSELHNALEMNSQNILHLTFLSDGQKGILDALRRKFPNSSHALCMRYLSECIG
        IGLGGI+LKSKYLGTLLSATSYDADGG+FP+AFGVVDAENEESW+WFLSELHNAL+MN+ N  HLTFLSDGQKGILDALRRKFPNSSHALCMRYLSE IG
Subjt:  IGLGGIQLKSKYLGTLLSATSYDADGGLFPLAFGVVDAENEESWIWFLSELHNALEMNSQNILHLTFLSDGQKGILDALRRKFPNSSHALCMRYLSECIG

Query:  KEFKNSRLVNLLWKAAYATTTIAFKERMSEIEEVSPEAAKWIQQFPPSHWALVYFEGNRYGHLSSNIEEFSKWILEARELPIIQVIEQIHSKLMVEFEER
        KEFKNSRLV+L+WKAAYA TTIAFKERMS+IEE+SPEAAKWIQQFPP HWALVYFEG RYGHLSSN+EEF+KWIL+ARELPIIQVIE+IHSKLM EFEER
Subjt:  KEFKNSRLVNLLWKAAYATTTIAFKERMSEIEEVSPEAAKWIQQFPPSHWALVYFEGNRYGHLSSNIEEFSKWILEARELPIIQVIEQIHSKLMVEFEER

Query:  RARSTSWFSFLSPSAEKRIMEAINLASSYQVLRSDEVEFEVLSAERSDIVNIGERCCSCRDWQLYGIPCSHAVAAIASCRKDVYAFTEKCFTVSGYRETY
        RARSTSWFSFL+PSAEKRI+EAI LASSYQVL+SDEVEFEVLSA+RS IVNIG+RCC CRDWQLYGIPCSHAVAAIASCRKDV+AF EKCFTVSGYRE Y
Subjt:  RARSTSWFSFLSPSAEKRIMEAINLASSYQVLRSDEVEFEVLSAERSDIVNIGERCCSCRDWQLYGIPCSHAVAAIASCRKDVYAFTEKCFTVSGYRETY

Query:  AKGIYPIPGKLERKKHDETPMDDDVQIVRPPKFRRPPGRPEKKRICVEDLNREKHTVHCSRCNQTGHYKTTCKAELMKSIEQF
         K ++PIP KLE K+ D+TP+DDD QIVRPPKFRRPPGRPEKKRICVEDLNREKHTVHCSRCNQTGHYKTTCKA+LMKSIEQF
Subjt:  AKGIYPIPGKLERKKHDETPMDDDVQIVRPPKFRRPPGRPEKKRICVEDLNREKHTVHCSRCNQTGHYKTTCKAELMKSIEQF

A0A5A7TM66 SWIM-type domain-containing protein0.0e+0090.72Show/hide
Query:  MEDHTFVVGQEFPDVKAFRNAIKEAAIAQHFELRIIKSDLIRYFAKCAEESCPWRIRAVKLPNSPIFTIRSLEGTHTCGQNAQNGHHQASIDWIVSFIEE
        MEDHTF+VGQEFPDVKAFRNAIKEAAIAQHFELRIIKSDLIRYFAKCAEE CPWRIRAVKLPNSPIFTIRSLEGTHTCGQNAQNGHHQASIDWIVSFIEE
Subjt:  MEDHTFVVGQEFPDVKAFRNAIKEAAIAQHFELRIIKSDLIRYFAKCAEESCPWRIRAVKLPNSPIFTIRSLEGTHTCGQNAQNGHHQASIDWIVSFIEE

Query:  RLRNNINYKPKDILHDIHKQYGITIPYKQAWRAKERGLATIYGSSEEGYCLLPSYCEQIRKANPGSIAEVFTTGSDNRFQRLFVSFYASIYGFLNGCFPV
        RLRNNINYKPKDILHDIHKQYGITIPYKQAWRAKERGLATI+GSSEEGY LLPSYCEQI+KANPGS+AEVFTTGSDN FQRLFVSFYASIYGFLNGCFPV
Subjt:  RLRNNINYKPKDILHDIHKQYGITIPYKQAWRAKERGLATIYGSSEEGYCLLPSYCEQIRKANPGSIAEVFTTGSDNRFQRLFVSFYASIYGFLNGCFPV

Query:  IGLGGIQLKSKYLGTLLSATSYDADGGLFPLAFGVVDAENEESWIWFLSELHNALEMNSQNILHLTFLSDGQKGILDALRRKFPNSSHALCMRYLSECIG
        IGLGGI+LKSKYLGTLLSATSYDADGG+FP+AFGVVDAENEESW+WFLSELHNAL MNS N  HLTFLSDGQKGILDALRRKFPNSSHALCMRYLSE IG
Subjt:  IGLGGIQLKSKYLGTLLSATSYDADGGLFPLAFGVVDAENEESWIWFLSELHNALEMNSQNILHLTFLSDGQKGILDALRRKFPNSSHALCMRYLSECIG

Query:  KEFKNSRLVNLLWKAAYATTTIAFKERMSEIEEVSPEAAKWIQQFPPSHWALVYFEGNRYGHLSSNIEEFSKWILEARELPIIQVIEQIHSKLMVEFEER
        KEFKNSRLV+L+WKAAYATTTIAFKERMS+IEE+SPEAAKWIQQFPPSHWALVYFEG RYGHLSSN+EEF++W+L+ARELPIIQVIE+IH KLM EFEER
Subjt:  KEFKNSRLVNLLWKAAYATTTIAFKERMSEIEEVSPEAAKWIQQFPPSHWALVYFEGNRYGHLSSNIEEFSKWILEARELPIIQVIEQIHSKLMVEFEER

Query:  RARSTSWFSFLSPSAEKRIMEAINLASSYQVLRSDEVEFEVLSAERSDIVNIGERCCSCRDWQLYGIPCSHAVAAIASCRKDVYAFTEKCFTVSGYRETY
        R+RSTSWFSFL+PSAEKRI+EAI LASSYQVL+SDEVEFEVLSA+RS IVNIG+RCC CRDWQL+GIPCSHAVAAIASCRKDVYAF EKCFTVSGYRE Y
Subjt:  RARSTSWFSFLSPSAEKRIMEAINLASSYQVLRSDEVEFEVLSAERSDIVNIGERCCSCRDWQLYGIPCSHAVAAIASCRKDVYAFTEKCFTVSGYRETY

Query:  AKGIYPIPGKLERKKHDETPMDDDVQIVRPPKFRRPPGRPEKKRICVEDLNREKHTVHCSRCNQTGHYKTTCKAELMKSIEQ
         K ++PIP KL  +K D+TP+D+D+QIVRPPKFRRPPGRPEKKRICVEDLNREKHTVHCSRCNQTGHYKTTCKAELMKSIEQ
Subjt:  AKGIYPIPGKLERKKHDETPMDDDVQIVRPPKFRRPPGRPEKKRICVEDLNREKHTVHCSRCNQTGHYKTTCKAELMKSIEQ

A0A6J1D640 uncharacterized protein LOC1110176630.0e+0093.14Show/hide
Query:  MEDHTFVVGQEFPDVKAFRNAIKEAAIAQHFELRIIKSDLIRYFAKCAEESCPWRIRAVKLPNSPIFTIRSLEGTHTCGQNAQNGHHQASIDWIVSFIEE
        MEDHTFVVGQEFPDVKAFRNAIKEAAIAQHFELRIIKSDL+RYFA+CAEE CPWRIRAVKLPNSP FTIRSLEG HTCGQNAQNGHHQASIDWIVSFIEE
Subjt:  MEDHTFVVGQEFPDVKAFRNAIKEAAIAQHFELRIIKSDLIRYFAKCAEESCPWRIRAVKLPNSPIFTIRSLEGTHTCGQNAQNGHHQASIDWIVSFIEE

Query:  RLRNNINYKPKDILHDIHKQYGITIPYKQAWRAKERGLATIYGSSEEGYCLLPSYCEQIRKANPGSIAEVFTTGSDNRFQRLFVSFYASIYGFLNGCFPV
        RLR++I+YKPKDILHDIHKQYGITIPYKQAWRAKERGLATI+GSSEEGYCLLP+YCEQIRKANPG++AEVFT GSDN F RLFVSFYASIYGFLNGCFPV
Subjt:  RLRNNINYKPKDILHDIHKQYGITIPYKQAWRAKERGLATIYGSSEEGYCLLPSYCEQIRKANPGSIAEVFTTGSDNRFQRLFVSFYASIYGFLNGCFPV

Query:  IGLGGIQLKSKYLGTLLSATSYDADGGLFPLAFGVVDAENEESWIWFLSELHNALEMNSQNILHLTFLSDGQKGILDALRRKFPNSSHALCMRYLSECIG
        IGLGGIQLKSKYLGTLLSATSYDADGGLFPLAFGVVDAENEESW+WFLSELH ALEMN+QNILHLTFLS+GQKGILDALRR+F NSSHA CMRYLSE +G
Subjt:  IGLGGIQLKSKYLGTLLSATSYDADGGLFPLAFGVVDAENEESWIWFLSELHNALEMNSQNILHLTFLSDGQKGILDALRRKFPNSSHALCMRYLSECIG

Query:  KEFKNSRLVNLLWKAAYATTTIAFKERMSEIEEVSPEAAKWIQQFPPSHWALVYFEGNRYGHLSSNIEEFSKWILEARELPIIQVIEQIHSKLMVEFEER
        KEFKNSRLVNLLWKAAYATTTIAFKERM EIEEVSPEAAKWIQQFPPSHWALVYFEG RYGHLSSNIE+F+KWILEARELPI+QVIE+IHSKLM EFEER
Subjt:  KEFKNSRLVNLLWKAAYATTTIAFKERMSEIEEVSPEAAKWIQQFPPSHWALVYFEGNRYGHLSSNIEEFSKWILEARELPIIQVIEQIHSKLMVEFEER

Query:  RARSTSWFSFLSPSAEKRIMEAINLASSYQVLRSDEVEFEVLSAERSDIVNIGERCCSCRDWQLYGIPCSHAVAAIASCRKDVYAFTEKCFTVSGYRETY
        RARS+SW+SFL+PSAEKRIMEAINLASSYQVLRSDEVEFEVLSAERSDIVNIG+RCCSCRDWQLYGIPCSHAVAAIASCRKDVYAFTEKCFTVSGYRETY
Subjt:  RARSTSWFSFLSPSAEKRIMEAINLASSYQVLRSDEVEFEVLSAERSDIVNIGERCCSCRDWQLYGIPCSHAVAAIASCRKDVYAFTEKCFTVSGYRETY

Query:  AKGIYPIPGKLERKKHDETPMDDDVQIVRPPKFRRPPGRPEKKRICVEDLNREKHTVHCSRCNQTGHYKTTCKAELMKSIEQF
        AKGIYPIPGKLE KKHDETPMDD+ QI+RPPKFRRPPGRPEKKRICVEDLNREKHTVHCSRCNQTGHYKTTCKAEL+KSIEQF
Subjt:  AKGIYPIPGKLERKKHDETPMDDDVQIVRPPKFRRPPGRPEKKRICVEDLNREKHTVHCSRCNQTGHYKTTCKAELMKSIEQF

A0A6J1E5V3 uncharacterized protein LOC1114308600.0e+0092.11Show/hide
Query:  MEDHTFVVGQEFPDVKAFRNAIKEAAIAQHFELRIIKSDLIRYFAKCAEESCPWRIRAVKLPNSPIFTIRSLEGTHTCGQNAQNGHHQASIDWIVSFIEE
        MEDHTFVVGQEFPDVKA RNAIKEAAIAQHFELRIIKSDLIRYFAKCAEE CPWRIRAVKLPNSPIFTIRSLEG HTCG+NAQNGHHQASIDWIVSFIEE
Subjt:  MEDHTFVVGQEFPDVKAFRNAIKEAAIAQHFELRIIKSDLIRYFAKCAEESCPWRIRAVKLPNSPIFTIRSLEGTHTCGQNAQNGHHQASIDWIVSFIEE

Query:  RLRNNINYKPKDILHDIHKQYGITIPYKQAWRAKERGLATIYGSSEEGYCLLPSYCEQIRKANPGSIAEVFTTGSDNRFQRLFVSFYASIYGFLNGCFPV
        RLRNNINYKPKDILHDIHKQYGITIPYKQAWRAKERGLATI+GSSEEGY LLP YCEQIRKANPGS+AEVFTTGS N FQRLFVSFYASIYGFLNGCFPV
Subjt:  RLRNNINYKPKDILHDIHKQYGITIPYKQAWRAKERGLATIYGSSEEGYCLLPSYCEQIRKANPGSIAEVFTTGSDNRFQRLFVSFYASIYGFLNGCFPV

Query:  IGLGGIQLKSKYLGTLLSATSYDADGGLFPLAFGVVDAENEESWIWFLSELHNALEMNSQNILHLTFLSDGQKGILDALRRKFPNSSHALCMRYLSECIG
        I LGGIQLKSKYLGTLLSATSYDADGG+FPLAFGVVDAENEESWIWFLSELH ALEMN++N LHLTFLS+GQKGILDALRRKFPNSS+ALCMRYLSE IG
Subjt:  IGLGGIQLKSKYLGTLLSATSYDADGGLFPLAFGVVDAENEESWIWFLSELHNALEMNSQNILHLTFLSDGQKGILDALRRKFPNSSHALCMRYLSECIG

Query:  KEFKNSRLVNLLWKAAYATTTIAFKERMSEIEEVSPEAAKWIQQFPPSHWALVYFEGNRYGHLSSNIEEFSKWILEARELPIIQVIEQIHSKLMVEFEER
        KEFKNSRLV+LLWKAAYATT IAFKERMSEIEE+SPEAAKW+QQFPPSHWALVYFEGNRYGHLSSN+EEF+KWILEARELPIIQVIE+IHSKLMVEFEER
Subjt:  KEFKNSRLVNLLWKAAYATTTIAFKERMSEIEEVSPEAAKWIQQFPPSHWALVYFEGNRYGHLSSNIEEFSKWILEARELPIIQVIEQIHSKLMVEFEER

Query:  RARSTSWFSFLSPSAEKRIMEAINLASSYQVLRSDEVEFEVLSAERSDIVNIGERCCSCRDWQLYGIPCSHAVAAIASCRKDVYAFTEKCFTVSGYRETY
        RARS+SWFSFL+PSAEKRI+EAI +ASSYQVLRSDEVEFEVLSAERS IVNIG RCCSC +WQLYGIPCSHAVAAIASCRKDVYAFTEKCF VSGYRE Y
Subjt:  RARSTSWFSFLSPSAEKRIMEAINLASSYQVLRSDEVEFEVLSAERSDIVNIGERCCSCRDWQLYGIPCSHAVAAIASCRKDVYAFTEKCFTVSGYRETY

Query:  AKGIYPIPGKLERKKHDETPMDDDVQIVRPPKFRRPPGRPEKKRICVEDLNREKHTVHCSRCNQTGHYKTTCKAELMKSIEQF
        AKGIYPIP +LE K  DET MDDD QIVRPPKFRRPPGRPEKKRICVE+LNREKHTVHCSRCNQTGHYKTTCKAELMK IEQF
Subjt:  AKGIYPIPGKLERKKHDETPMDDDVQIVRPPKFRRPPGRPEKKRICVEDLNREKHTVHCSRCNQTGHYKTTCKAELMKSIEQF

A0A6J1J6A0 uncharacterized protein LOC1114829750.0e+0091.77Show/hide
Query:  MEDHTFVVGQEFPDVKAFRNAIKEAAIAQHFELRIIKSDLIRYFAKCAEESCPWRIRAVKLPNSPIFTIRSLEGTHTCGQNAQNGHHQASIDWIVSFIEE
        MEDHTFVVGQEFPDVKAFRNAIKEAAIAQHFELRIIKSDLIRYFAKCAEE CPWRIRAVKLPNSPIFTIRSLEG HTCG+NAQNGHHQASIDWIVSFIEE
Subjt:  MEDHTFVVGQEFPDVKAFRNAIKEAAIAQHFELRIIKSDLIRYFAKCAEESCPWRIRAVKLPNSPIFTIRSLEGTHTCGQNAQNGHHQASIDWIVSFIEE

Query:  RLRNNINYKPKDILHDIHKQYGITIPYKQAWRAKERGLATIYGSSEEGYCLLPSYCEQIRKANPGSIAEVFTTGSDNRFQRLFVSFYASIYGFLNGCFPV
        RLRNNINYKPKDILHDIHKQYGITIPYKQAWRAKERGLATI+GSSEEGY LLP YCEQIRKANPGS+AEVFTTGS N FQRLFVSFYASIYGFLNGCFPV
Subjt:  RLRNNINYKPKDILHDIHKQYGITIPYKQAWRAKERGLATIYGSSEEGYCLLPSYCEQIRKANPGSIAEVFTTGSDNRFQRLFVSFYASIYGFLNGCFPV

Query:  IGLGGIQLKSKYLGTLLSATSYDADGGLFPLAFGVVDAENEESWIWFLSELHNALEMNSQNILHLTFLSDGQKGILDALRRKFPNSSHALCMRYLSECIG
        I LGGIQLKSKYLGTLLSATSYDADGG+FPLAFGVVDAENEESWIWFLSELH ALEMN+ N LHLTFLS+GQKGILDALRRKFPNSS+ALCMRYLSE IG
Subjt:  IGLGGIQLKSKYLGTLLSATSYDADGGLFPLAFGVVDAENEESWIWFLSELHNALEMNSQNILHLTFLSDGQKGILDALRRKFPNSSHALCMRYLSECIG

Query:  KEFKNSRLVNLLWKAAYATTTIAFKERMSEIEEVSPEAAKWIQQFPPSHWALVYFEGNRYGHLSSNIEEFSKWILEARELPIIQVIEQIHSKLMVEFEER
        KEFKNSRLV+LLWKAAYATT IAFKERMSEIEE+SPEAAKW+QQFPPSHWALVYFEGNRYGHLSSN+EEF+KWILEARELPIIQVIE+IHSKLMVEFEER
Subjt:  KEFKNSRLVNLLWKAAYATTTIAFKERMSEIEEVSPEAAKWIQQFPPSHWALVYFEGNRYGHLSSNIEEFSKWILEARELPIIQVIEQIHSKLMVEFEER

Query:  RARSTSWFSFLSPSAEKRIMEAINLASSYQVLRSDEVEFEVLSAERSDIVNIGERCCSCRDWQLYGIPCSHAVAAIASCRKDVYAFTEKCFTVSGYRETY
         ARS+SWFSFL+PSAE+RI+EAI +ASSY+VLRSDEVEFEV+SAERS IVNIG+RCCSCR+WQLYGIPCSHAVAAIASCRKDVYAFTEKCF VSGYRE Y
Subjt:  RARSTSWFSFLSPSAEKRIMEAINLASSYQVLRSDEVEFEVLSAERSDIVNIGERCCSCRDWQLYGIPCSHAVAAIASCRKDVYAFTEKCFTVSGYRETY

Query:  AKGIYPIPGKLERKKHDETPMDDDVQIVRPPKFRRPPGRPEKKRICVEDLNREKHTVHCSRCNQTGHYKTTCKAELMKSIEQF
        AKGIYPIP +LE K  DET MDDD QIVRPPKFRRPPGRPEKKRICVE+LNREKHTVHCSRCNQTGHYKTTCKAELMK IEQF
Subjt:  AKGIYPIPGKLERKKHDETPMDDDVQIVRPPKFRRPPGRPEKKRICVEDLNREKHTVHCSRCNQTGHYKTTCKAELMKSIEQF

SwissProt top hitse value%identityAlignment
B7TWW5 Squalene monooxygenase SE24.7e-10948.1Show/hide
Query:  FVVLGLIASPLALFVLKKQSQRKQSQSFLQNQQ---------PKSPANGSSICRSKSHGRIDVVIVGAGVAGSALAYALGKDGRRMHIIEQDLTMHNRMI
        F    L    L LF L+++ +R++ +    + +           S  NG       +    DV+IVGAGVAGSALAY L KDGRR+H+IE+DLT  +R++
Subjt:  FVVLGLIASPLALFVLKKQSQRKQSQSFLQNQQ---------PKSPANGSSICRSKSHGRIDVVIVGAGVAGSALAYALGKDGRRMHIIEQDLTMHNRMI

Query:  GEVLKPGGYLKLLELGLEDCVLDIEAQQVFGYVIYKDGKYIRLPYRLNSSNSNVTERSFHNGLFVQRLREKVASLPNVEFKQGTVLGLLKQKDTVKGVRY
        GE+L+PGGYLKL+ELGLEDCV +I+AQ+VFGY +Y DGK  RL Y L   +++V  RSFHNG F+QR+REK ASLPNV  +QGTV  L++QK TVKGVRY
Subjt:  GEVLKPGGYLKLLELGLEDCVLDIEAQQVFGYVIYKDGKYIRLPYRLNSSNSNVTERSFHNGLFVQRLREKVASLPNVEFKQGTVLGLLKQKDTVKGVRY

Query:  KTKTGQEM-VAHADLTVICDGRYSNLRRFLCR-------------------------------------HKISNTQIRCFVDIPGHKAPSISNGEMPHYL
        KTK GQEM  A+A LT++CDG +SNLR  LC                                      +KIS+T+IRC VD+PG K PSI+NGE+ HYL
Subjt:  KTKTGQEM-VAHADLTVICDGRYSNLRRFLCR-------------------------------------HKISNTQIRCFVDIPGHKAPSISNGEMPHYL

Query:  KTMVAPQIPPEFYDAFMSTIDKATVRTIPYWKMFAMPPHPNPTAGAILLGDALYVRHPFSGGGVTVTLSDVVVLCDLLRPLHSLDDASSVSDHLQSFYTL
        KT VAPQIPPE Y +F++ IDK  ++T+P     +MP  P+ T GA+LLGDA  +RHP +GGG+TV LSD+V++ DLLRPL  L D+S++  +L+SFYTL
Subjt:  KTMVAPQIPPEFYDAFMSTIDKATVRTIPYWKMFAMPPHPNPTAGAILLGDALYVRHPFSGGGVTVTLSDVVVLCDLLRPLHSLDDASSVSDHLQSFYTL

Query:  RKPMAITMDTLAALGQLAFYNHWSGLMREA--NVKETSFRFLCLGDL
        RKP+A T++TLA     A Y  +     +A   +++  F +L LG +
Subjt:  RKPMAITMDTLAALGQLAFYNHWSGLMREA--NVKETSFRFLCLGDL

O48651 Squalene monooxygenase SE12.3e-10851.8Show/hide
Query:  GRIDVVIVGAGVAGSALAYALGKDGRRMHIIEQDLTMHNRMIGEVLKPGGYLKLLELGLEDCVLDIEAQQVFGYVIYKDGKYIRLPYRLNSSNSNVTERS
        G  DV+IVGAGVAGSALAY L  DGRR+H+IE+DLT  +R++GE+L+PGGYLKL+ELGLEDCV +I+AQ+VFGY +Y DGK  RL Y L   +S+V  RS
Subjt:  GRIDVVIVGAGVAGSALAYALGKDGRRMHIIEQDLTMHNRMIGEVLKPGGYLKLLELGLEDCVLDIEAQQVFGYVIYKDGKYIRLPYRLNSSNSNVTERS

Query:  FHNGLFVQRLREKVASLPNVEFKQGTVLGLLKQKDTVKGVRYKTKTGQEMVAHADLTVICDGRYSNLRRFLCR---------------------------
        FHNG FVQR+REK ASLPNV  +QGTV  L+++K +VKGV+YKTK GQE+ A A LT++CDG +SNLRR LC                            
Subjt:  FHNGLFVQRLREKVASLPNVEFKQGTVLGLLKQKDTVKGVRYKTKTGQEMVAHADLTVICDGRYSNLRRFLCR---------------------------

Query:  ----------HKISNTQIRCFVDIPGHKAPSISNGEMPHYLKTMVAPQIPPEFYDAFMSTIDKATVRTIPYWKMFAMPPHPNPTAGAILLGDALYVRHPF
                  +KIS+T+IRC VD+PG K P ISNGE+ +YLKT+VAPQ+P + Y++F++ +DK  +RT+P     +MP  P+PT GA+LLGDA  +RHP 
Subjt:  ----------HKISNTQIRCFVDIPGHKAPSISNGEMPHYLKTMVAPQIPPEFYDAFMSTIDKATVRTIPYWKMFAMPPHPNPTAGAILLGDALYVRHPF

Query:  SGGGVTVTLSDVVVLCDLLRPLHSLDDASSVSDHLQSFYTLRKPMAITMDTLAALGQLAFYNHWSGLMREA--NVKETSFRFLCLGDL
        +GGG+TV LSD+V++ DLLRPL  L D+S++  +L+SFYTLRKP+A T++TLA     A Y  +     +A   ++   F +L LG +
Subjt:  SGGGVTVTLSDVVVLCDLLRPLHSLDDASSVSDHLQSFYTLRKPMAITMDTLAALGQLAFYNHWSGLMREA--NVKETSFRFLCLGDL

O81000 Squalene epoxidase 2, mitochondrial9.6e-10246.24Show/hide
Query:  LALFVLKKQSQR-KQSQSFLQNQQPKSPANGSSICRSKSHGRIDVVIVGAGVAGSALAYALGKDGRRMHIIEQDLTMHNRMIGEVLKPGGYLKLLELGLE
        L L++L++ S + K+++  + +Q      N      ++     DV+IVGAGVAGSALA+ LGK+GRR+H+IE+D +  +R++GE+L+PGGYLKL+ELGLE
Subjt:  LALFVLKKQSQR-KQSQSFLQNQQPKSPANGSSICRSKSHGRIDVVIVGAGVAGSALAYALGKDGRRMHIIEQDLTMHNRMIGEVLKPGGYLKLLELGLE

Query:  DCVLDIEAQQVFGYVIYKDGKYIRLPYRLNSSNSNVTERSFHNGLFVQRLREKVASLPNVEFKQGTVLGLLKQKDTVKGVRYKTKTGQEMVAHADLTVIC
        DCV  I+AQ+V GYV++KDGK+ +L Y L + +S+V  RSFHNG FVQR+REK  +L NV  +QGTV  LL++  T+KGVRY+TK G E  + A LT++C
Subjt:  DCVLDIEAQQVFGYVIYKDGKYIRLPYRLNSSNSNVTERSFHNGLFVQRLREKVASLPNVEFKQGTVLGLLKQKDTVKGVRYKTKTGQEMVAHADLTVIC

Query:  DGRYSNLRRFLCRHK-------------------------------------ISNTQIRCFVDIPGHKAPSISNGEMPHYLKTMVAPQIPPEFYDAFMST
        DG +SNLRR LC+ K                                     IS++++RC VD+PG K P I+NGEM  YLKT VAPQ+P +  +AF++ 
Subjt:  DGRYSNLRRFLCRHK-------------------------------------ISNTQIRCFVDIPGHKAPSISNGEMPHYLKTMVAPQIPPEFYDAFMST

Query:  IDKATVRTIPYWKMFAMPPHPNPTAGAILLGDALYVRHPFSGGGVTVTLSDVVVLCDLLRPLHSLDDASSVSDHLQSFYTLRKPMAITMDTLAALGQLAF
        ++K  +RT+P     +MP  P PT GA+LLGDA  +RHP +GGG+TV L+D+VVL DLLRP+ +L+D  ++S +++SFYTLRKP+A T++TLA     A 
Subjt:  IDKATVRTIPYWKMFAMPPHPNPTAGAILLGDALYVRHPFSGGGVTVTLSDVVVLCDLLRPLHSLDDASSVSDHLQSFYTLRKPMAITMDTLAALGQLAF

Query:  YNHWSGLMREA--NVKETSFRFLCLG
        Y  +     EA   ++E  F +L LG
Subjt:  YNHWSGLMREA--NVKETSFRFLCLG

Q8VYH2 Squalene epoxidase 31.7e-10346.82Show/hide
Query:  LALFVLKKQSQRKQSQSFLQNQQPKSPANGSSICRSKSHGRIDVVIVGAGVAGSALAYALGKDGRRMHIIEQDLTMHNRMIGEVLKPGGYLKLLELGLED
        L L+VL+++S+       ++N               KS   +D++IVGAGVAG+ALA+ LGK+GRR+H+IE+DLT  +R++GE+L+PGGYLKL+ELGLED
Subjt:  LALFVLKKQSQRKQSQSFLQNQQPKSPANGSSICRSKSHGRIDVVIVGAGVAGSALAYALGKDGRRMHIIEQDLTMHNRMIGEVLKPGGYLKLLELGLED

Query:  CVLDIEAQQVFGYVIYKDGKYIRLPYRLNSSNSNVTERSFHNGLFVQRLREKVASLPNVEFKQGTVLGLLKQKDTVKGVRYKTKTGQEMVAHADLTVICD
        CV DI+AQ+V GY ++KDGK+ +L Y L+  +S+V  RSFHNG FVQR+REK + LPNV  +QGTV  L+++   +KGV+YKTK GQE+ + A LT++CD
Subjt:  CVLDIEAQQVFGYVIYKDGKYIRLPYRLNSSNSNVTERSFHNGLFVQRLREKVASLPNVEFKQGTVLGLLKQKDTVKGVRYKTKTGQEMVAHADLTVICD

Query:  GRYSNLRRFLCRHK-------------------------------------ISNTQIRCFVDIPGHKAPSISNGEMPHYLKTMVAPQIPPEFYDAFMSTI
        G +SNLRR LC+ K                                     IS++++RC VD+PG K PS+++GEM H+LKTMVAPQ+PP+  DAF+S +
Subjt:  GRYSNLRRFLCRHK-------------------------------------ISNTQIRCFVDIPGHKAPSISNGEMPHYLKTMVAPQIPPEFYDAFMSTI

Query:  DKATVRTIPYWKMFAMPPHPNPTAGAILLGDALYVRHPFSGGGVTVTLSDVVVLCDLLRPLHSLDDASSVSDHLQSFYTLRKPMAITMDTLAALGQLAFY
        +K  +RT+P   M A P H   T GA+LLGDA  +RHP +GGG+TV LSD+V+L DLL PL  L +  S+S +++SFYTLRKP+A T++TLA     A Y
Subjt:  DKATVRTIPYWKMFAMPPHPNPTAGAILLGDALYVRHPFSGGGVTVTLSDVVVLCDLLRPLHSLDDASSVSDHLQSFYTLRKPMAITMDTLAALGQLAFY

Query:  NHWSGLMREA--NVKETSFRFLCLG
          +     +A   ++   F +L LG
Subjt:  NHWSGLMREA--NVKETSFRFLCLG

Q9SM02 Squalene epoxidase 15.8e-10750.91Show/hide
Query:  DVVIVGAGVAGSALAYALGKDGRRMHIIEQDLTMHNRMIGEVLKPGGYLKLLELGLEDCVLDIEAQQVFGYVIYKDGKYIRLPYRLNSSNSNVTERSFHN
        DV++VGAGVAGSALAY LGKD RR+H+IE+DL+  +R++GE+L+PGGYLKLLELG+EDCV +I+AQ+V+GY ++K+GK IRL Y L   + +V+ RSFHN
Subjt:  DVVIVGAGVAGSALAYALGKDGRRMHIIEQDLTMHNRMIGEVLKPGGYLKLLELGLEDCVLDIEAQQVFGYVIYKDGKYIRLPYRLNSSNSNVTERSFHN

Query:  GLFVQRLREKVASLPNVEFKQGTVLGLLKQKDTVKGVRYKTKTGQEMVAHADLTVICDGRYSNLRRFLCR------------------------------
        G F+QR+REK ASLPNV+ +QGTVL LL++  T+KGVRYK K G+E  A A LT++CDG +SNLRR LC                               
Subjt:  GLFVQRLREKVASLPNVEFKQGTVLGLLKQKDTVKGVRYKTKTGQEMVAHADLTVICDGRYSNLRRFLCR------------------------------

Query:  -------HKISNTQIRCFVDIPGHKAPSISNGEMPHYLKTMVAPQIPPEFYDAFMSTIDKATVRTIPYWKMFAMPPHPNPTAGAILLGDALYVRHPFSGG
               + IS+T++RC VD+PG K PSI+NGEM +YLKT+VAPQ+P E YD+F++ +DK  ++++P     +MP  P PT GA+L+GDA  +RHP +GG
Subjt:  -------HKISNTQIRCFVDIPGHKAPSISNGEMPHYLKTMVAPQIPPEFYDAFMSTIDKATVRTIPYWKMFAMPPHPNPTAGAILLGDALYVRHPFSGG

Query:  GVTVTLSDVVVLCDLLRPLHSLDDASSVSDHLQSFYTLRKPMAITMDTLAALGQLAFYNHWSGLMREA--NVKETSFRFLCLGDL
        G+TV L+D+VVL +LLRPL  L D +S+  +L+SFYTLRKP+A T++TLA     A Y  +     EA   ++E  F +L LG +
Subjt:  GVTVTLSDVVVLCDLLRPLHSLDDASSVSDHLQSFYTLRKPMAITMDTLAALGQLAFYNHWSGLMREA--NVKETSFRFLCLGDL

Arabidopsis top hitse value%identityAlignment
AT1G58440.1 FAD/NAD(P)-binding oxidoreductase family protein4.1e-10850.91Show/hide
Query:  DVVIVGAGVAGSALAYALGKDGRRMHIIEQDLTMHNRMIGEVLKPGGYLKLLELGLEDCVLDIEAQQVFGYVIYKDGKYIRLPYRLNSSNSNVTERSFHN
        DV++VGAGVAGSALAY LGKD RR+H+IE+DL+  +R++GE+L+PGGYLKLLELG+EDCV +I+AQ+V+GY ++K+GK IRL Y L   + +V+ RSFHN
Subjt:  DVVIVGAGVAGSALAYALGKDGRRMHIIEQDLTMHNRMIGEVLKPGGYLKLLELGLEDCVLDIEAQQVFGYVIYKDGKYIRLPYRLNSSNSNVTERSFHN

Query:  GLFVQRLREKVASLPNVEFKQGTVLGLLKQKDTVKGVRYKTKTGQEMVAHADLTVICDGRYSNLRRFLCR------------------------------
        G F+QR+REK ASLPNV+ +QGTVL LL++  T+KGVRYK K G+E  A A LT++CDG +SNLRR LC                               
Subjt:  GLFVQRLREKVASLPNVEFKQGTVLGLLKQKDTVKGVRYKTKTGQEMVAHADLTVICDGRYSNLRRFLCR------------------------------

Query:  -------HKISNTQIRCFVDIPGHKAPSISNGEMPHYLKTMVAPQIPPEFYDAFMSTIDKATVRTIPYWKMFAMPPHPNPTAGAILLGDALYVRHPFSGG
               + IS+T++RC VD+PG K PSI+NGEM +YLKT+VAPQ+P E YD+F++ +DK  ++++P     +MP  P PT GA+L+GDA  +RHP +GG
Subjt:  -------HKISNTQIRCFVDIPGHKAPSISNGEMPHYLKTMVAPQIPPEFYDAFMSTIDKATVRTIPYWKMFAMPPHPNPTAGAILLGDALYVRHPFSGG

Query:  GVTVTLSDVVVLCDLLRPLHSLDDASSVSDHLQSFYTLRKPMAITMDTLAALGQLAFYNHWSGLMREA--NVKETSFRFLCLGDL
        G+TV L+D+VVL +LLRPL  L D +S+  +L+SFYTLRKP+A T++TLA     A Y  +     EA   ++E  F +L LG +
Subjt:  GVTVTLSDVVVLCDLLRPLHSLDDASSVSDHLQSFYTLRKPMAITMDTLAALGQLAFYNHWSGLMREA--NVKETSFRFLCLGDL

AT2G22830.1 squalene epoxidase 26.8e-10346.24Show/hide
Query:  LALFVLKKQSQR-KQSQSFLQNQQPKSPANGSSICRSKSHGRIDVVIVGAGVAGSALAYALGKDGRRMHIIEQDLTMHNRMIGEVLKPGGYLKLLELGLE
        L L++L++ S + K+++  + +Q      N      ++     DV+IVGAGVAGSALA+ LGK+GRR+H+IE+D +  +R++GE+L+PGGYLKL+ELGLE
Subjt:  LALFVLKKQSQR-KQSQSFLQNQQPKSPANGSSICRSKSHGRIDVVIVGAGVAGSALAYALGKDGRRMHIIEQDLTMHNRMIGEVLKPGGYLKLLELGLE

Query:  DCVLDIEAQQVFGYVIYKDGKYIRLPYRLNSSNSNVTERSFHNGLFVQRLREKVASLPNVEFKQGTVLGLLKQKDTVKGVRYKTKTGQEMVAHADLTVIC
        DCV  I+AQ+V GYV++KDGK+ +L Y L + +S+V  RSFHNG FVQR+REK  +L NV  +QGTV  LL++  T+KGVRY+TK G E  + A LT++C
Subjt:  DCVLDIEAQQVFGYVIYKDGKYIRLPYRLNSSNSNVTERSFHNGLFVQRLREKVASLPNVEFKQGTVLGLLKQKDTVKGVRYKTKTGQEMVAHADLTVIC

Query:  DGRYSNLRRFLCRHK-------------------------------------ISNTQIRCFVDIPGHKAPSISNGEMPHYLKTMVAPQIPPEFYDAFMST
        DG +SNLRR LC+ K                                     IS++++RC VD+PG K P I+NGEM  YLKT VAPQ+P +  +AF++ 
Subjt:  DGRYSNLRRFLCRHK-------------------------------------ISNTQIRCFVDIPGHKAPSISNGEMPHYLKTMVAPQIPPEFYDAFMST

Query:  IDKATVRTIPYWKMFAMPPHPNPTAGAILLGDALYVRHPFSGGGVTVTLSDVVVLCDLLRPLHSLDDASSVSDHLQSFYTLRKPMAITMDTLAALGQLAF
        ++K  +RT+P     +MP  P PT GA+LLGDA  +RHP +GGG+TV L+D+VVL DLLRP+ +L+D  ++S +++SFYTLRKP+A T++TLA     A 
Subjt:  IDKATVRTIPYWKMFAMPPHPNPTAGAILLGDALYVRHPFSGGGVTVTLSDVVVLCDLLRPLHSLDDASSVSDHLQSFYTLRKPMAITMDTLAALGQLAF

Query:  YNHWSGLMREA--NVKETSFRFLCLG
        Y  +     EA   ++E  F +L LG
Subjt:  YNHWSGLMREA--NVKETSFRFLCLG

AT4G37760.1 squalene epoxidase 31.2e-10446.82Show/hide
Query:  LALFVLKKQSQRKQSQSFLQNQQPKSPANGSSICRSKSHGRIDVVIVGAGVAGSALAYALGKDGRRMHIIEQDLTMHNRMIGEVLKPGGYLKLLELGLED
        L L+VL+++S+       ++N               KS   +D++IVGAGVAG+ALA+ LGK+GRR+H+IE+DLT  +R++GE+L+PGGYLKL+ELGLED
Subjt:  LALFVLKKQSQRKQSQSFLQNQQPKSPANGSSICRSKSHGRIDVVIVGAGVAGSALAYALGKDGRRMHIIEQDLTMHNRMIGEVLKPGGYLKLLELGLED

Query:  CVLDIEAQQVFGYVIYKDGKYIRLPYRLNSSNSNVTERSFHNGLFVQRLREKVASLPNVEFKQGTVLGLLKQKDTVKGVRYKTKTGQEMVAHADLTVICD
        CV DI+AQ+V GY ++KDGK+ +L Y L+  +S+V  RSFHNG FVQR+REK + LPNV  +QGTV  L+++   +KGV+YKTK GQE+ + A LT++CD
Subjt:  CVLDIEAQQVFGYVIYKDGKYIRLPYRLNSSNSNVTERSFHNGLFVQRLREKVASLPNVEFKQGTVLGLLKQKDTVKGVRYKTKTGQEMVAHADLTVICD

Query:  GRYSNLRRFLCRHK-------------------------------------ISNTQIRCFVDIPGHKAPSISNGEMPHYLKTMVAPQIPPEFYDAFMSTI
        G +SNLRR LC+ K                                     IS++++RC VD+PG K PS+++GEM H+LKTMVAPQ+PP+  DAF+S +
Subjt:  GRYSNLRRFLCRHK-------------------------------------ISNTQIRCFVDIPGHKAPSISNGEMPHYLKTMVAPQIPPEFYDAFMSTI

Query:  DKATVRTIPYWKMFAMPPHPNPTAGAILLGDALYVRHPFSGGGVTVTLSDVVVLCDLLRPLHSLDDASSVSDHLQSFYTLRKPMAITMDTLAALGQLAFY
        +K  +RT+P   M A P H   T GA+LLGDA  +RHP +GGG+TV LSD+V+L DLL PL  L +  S+S +++SFYTLRKP+A T++TLA     A Y
Subjt:  DKATVRTIPYWKMFAMPPHPNPTAGAILLGDALYVRHPFSGGGVTVTLSDVVVLCDLLRPLHSLDDASSVSDHLQSFYTLRKPMAITMDTLAALGQLAFY

Query:  NHWSGLMREA--NVKETSFRFLCLG
          +     +A   ++   F +L LG
Subjt:  NHWSGLMREA--NVKETSFRFLCLG

AT5G24150.1 FAD/NAD(P)-binding oxidoreductase family protein4.9e-7741.38Show/hide
Query:  FLQNQQPKSPANGSSICRSKSHGRIDVVIVGAGVAGSALAYALGKDGRRMHIIEQDLTMHNRMIGEVLKPGGYLKLLELGLEDCVLDIEAQQVFGYVIYK
        ++ N+  K+     ++   +  G  DV+IVGAGV GSALAYAL KDGRR+H+IE+DL    R++GE ++PGG L L +LGLEDC+  I+AQ+  G  +YK
Subjt:  FLQNQQPKSPANGSSICRSKSHGRIDVVIVGAGVAGSALAYALGKDGRRMHIIEQDLTMHNRMIGEVLKPGGYLKLLELGLEDCVLDIEAQQVFGYVIYK

Query:  DGKYIRLPYRLNSSN--SNVTERSFHNGLFVQRLREKVASLPNVEFKQGTVLGLLKQKDTVKGVRYKTKTGQEMVAHADLTVICDGRYSNLRR-------
        DGK     + ++++N   + + RSFHNG FVQRLR+K +SLPNV  ++GTV  L+++K  +KGV YK   G+E  A A LTV+CDG YSNLRR       
Subjt:  DGKYIRLPYRLNSSN--SNVTERSFHNGLFVQRLREKVASLPNVEFKQGTVLGLLKQKDTVKGVRYKTKTGQEMVAHADLTVICDGRYSNLRR-------

Query:  ------------------------------FLCRHKISNTQIRCFVDIPGHKAPSISNGEMPHYLKTMVAPQIPPEFYDAFMSTIDKAT-VRTIPYWKMF
                                      F   ++IS+T +RC  ++  +  PSISNGEM  ++K  +APQ+P +    F+  ID+   ++ +P  KM 
Subjt:  ------------------------------FLCRHKISNTQIRCFVDIPGHKAPSISNGEMPHYLKTMVAPQIPPEFYDAFMSTIDKAT-VRTIPYWKMF

Query:  AMPPHPNPTAGAILLGDALYVRHPFSGGGVTVTLSDVVVLCDLLRPLHSLDDASSVSDHLQSFYTLRKPMAITMDTL
        A     +   G ILLGDA  +RHP    G+ V LSD+++L  LL+PL +L +A  +S  ++SFY +RKPM+ T++TL
Subjt:  AMPPHPNPTAGAILLGDALYVRHPFSGGGVTVTLSDVVVLCDLLRPLHSLDDASSVSDHLQSFYTLRKPMAITMDTL

AT5G24160.1 squalene monoxygenase 64.7e-8040.53Show/hide
Query:  FLQNQQPKSPANGSSICRSKSHGRIDVVIVGAGVAGSALAYALGKDGRRMHIIEQDLTMHNRMIGEVLKPGGYLKLLELGLEDCVLDIEAQQVFGYVIYK
        +L N + K+     ++   +  G  DV+IVGAGV GSALAYAL KDGRR+H+IE+D+    RM+GE ++PGG L L +LGL+DC+ DI+AQ+  G  +YK
Subjt:  FLQNQQPKSPANGSSICRSKSHGRIDVVIVGAGVAGSALAYALGKDGRRMHIIEQDLTMHNRMIGEVLKPGGYLKLLELGLEDCVLDIEAQQVFGYVIYK

Query:  DGKYIRLPYRLNSSNSNV--TERSFHNGLFVQRLREKVASLPNVEFKQGTVLGLLKQKDTVKGVRYKTKTGQEMVAHADLTVICDGRYSNLRRFL-----
        DGK    P+ ++++N +   + RSFHNG FVQ+LR K  SL NV  ++GTV  LL++K  VKGV YK K G+E  A A LTV+CDG YSNLRR L     
Subjt:  DGKYIRLPYRLNSSNSNV--TERSFHNGLFVQRLREKVASLPNVEFKQGTVLGLLKQKDTVKGVRYKTKTGQEMVAHADLTVICDGRYSNLRRFL-----

Query:  --------------CR-------------------HKISNTQIRCFVDIPGHKAPSISNGEMPHYLKTMVAPQIPPEFYDAFMSTIDK-ATVRTIPYWKM
                      CR                   ++IS+T +RC  ++     PSI+NGEM  ++K  + PQ+PP+    F+  ID+ A ++ +P  +M
Subjt:  --------------CR-------------------HKISNTQIRCFVDIPGHKAPSISNGEMPHYLKTMVAPQIPPEFYDAFMSTIDK-ATVRTIPYWKM

Query:  FAMPPHPNPTAGAILLGDALYVRHPFSGGGVTVTLSDVVVLCDLLRPLHSLDDASSVSDHLQSFYTLRKPMAITMDTLAALGQLAFYNHWSGLMREAN--
         +     +   G I+LGDA  +RHP    G+ V LSD+++L  LL+PL +L DA+ VS+ + SFY +RKPM+ T++TL      AF     G   EA   
Subjt:  FAMPPHPNPTAGAILLGDALYVRHPFSGGGVTVTLSDVVVLCDLLRPLHSLDDASSVSDHLQSFYTLRKPMAITMDTLAALGQLAFYNHWSGLMREAN--

Query:  VKETSFRFLCLGDLNLS
        +++  + +LC G    S
Subjt:  VKETSFRFLCLGDLNLS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATTTATCAGTTTGTGGTCTTGGGACTTATAGCCTCACCTTTGGCACTTTTTGTTCTGAAAAAGCAAAGCCAAAGGAAGCAAAGCCAAAGCTTTCTGCAAAACCAGCA
ACCAAAGAGCCCTGCAAATGGAAGTTCCATTTGCAGGAGCAAGAGCCATGGGAGAATTGATGTGGTCATTGTTGGTGCCGGTGTTGCTGGTTCTGCTCTTGCTTACGCTC
TTGGCAAGGATGGGCGTCGTATGCATATAATTGAGCAAGATTTGACCATGCACAACAGAATGATAGGGGAAGTGCTTAAGCCAGGAGGCTATCTGAAATTACTCGAGTTG
GGTCTTGAAGATTGTGTGTTGGATATTGAAGCTCAACAAGTGTTTGGATATGTCATCTACAAGGATGGGAAATATATTAGGCTGCCTTATCGCTTAAACAGCTCCAATTC
AAATGTGACAGAAAGAAGCTTCCACAATGGGCTCTTTGTACAAAGACTAAGGGAAAAGGTTGCATCTCTTCCCAATGTAGAATTCAAGCAAGGAACTGTCCTTGGCCTAC
TGAAGCAAAAAGATACTGTCAAAGGGGTGCGGTACAAGACGAAAACCGGTCAAGAAATGGTGGCACATGCTGATCTCACAGTCATATGCGACGGCCGCTACTCGAATCTA
CGACGGTTTCTCTGCAGGCATAAGATTAGTAACACCCAAATCCGCTGTTTTGTGGACATACCTGGCCATAAAGCTCCCTCCATTTCCAACGGTGAAATGCCCCATTACTT
GAAGACTATGGTGGCTCCTCAGATCCCACCAGAGTTTTACGATGCTTTCATGTCAACAATCGATAAAGCAACCGTGAGAACAATCCCTTATTGGAAAATGTTCGCCATGC
CTCCTCATCCCAATCCCACCGCCGGCGCCATTCTCCTCGGCGACGCTTTATACGTCCGGCATCCATTTTCCGGCGGCGGGGTGACCGTCACCTTATCAGACGTAGTCGTG
CTTTGCGATCTTCTCCGACCGCTTCACAGTCTCGACGACGCATCTTCAGTCTCCGACCATCTTCAATCCTTTTACACTCTTCGCAAGCCAATGGCTATTACGATGGATAC
ACTGGCCGCACTTGGTCAGTTGGCGTTCTATAATCATTGGAGTGGGCTGATGAGGGAAGCAAATGTGAAGGAAACTTCTTTCAGGTTCTTGTGCCTTGGAGACCTCAATC
TTTCGCCGGAATGCCGAGGATGTGCTCTCCCGCATGGCTGCGCCATTGACTCTGTTTCTCTCTGTTCCTCCATCTTCGCCTCTGTGAAGCTCCGATCTGCATCGTCCCCC
TCAATTTCAAGGTCGATCTTAGAGTCGTGTTTTGAAGTTGTTTGGTGGTACGGAAGTTCCGTTTCTTTTAGTTCTGTGGCCGCGTTGAAGATAGCTCAGACAGGAGCAAC
TGGTTCTGAGATTGGATTGTCTCACATGGAGGATCATACGTTTGTTGTAGGTCAAGAGTTTCCTGATGTCAAGGCATTTCGAAATGCAATTAAAGAAGCTGCTATCGCTC
AGCACTTTGAGTTACGAATTATTAAGAGTGACTTGATACGTTATTTTGCAAAATGTGCCGAAGAGAGTTGTCCATGGCGCATTCGTGCTGTTAAGCTTCCTAATTCTCCA
ATCTTCACAATTAGAAGCCTTGAAGGGACGCATACCTGTGGGCAAAATGCTCAAAATGGACACCATCAGGCCTCCATAGATTGGATTGTGAGTTTCATAGAAGAACGGCT
GCGGAACAACATTAATTACAAACCAAAGGACATATTACATGACATCCATAAGCAATATGGCATCACTATACCATATAAGCAGGCTTGGCGTGCAAAGGAACGAGGCCTTG
CCACAATTTATGGTTCTTCTGAAGAAGGTTATTGTCTGCTTCCATCCTACTGTGAACAAATAAGGAAAGCCAACCCTGGAAGTATTGCAGAAGTGTTCACGACTGGTTCA
GATAATCGCTTTCAGCGACTCTTTGTTTCGTTTTATGCATCCATATATGGGTTTCTAAATGGTTGCTTCCCTGTGATTGGTCTCGGTGGAATCCAGCTTAAAAGCAAATA
CCTAGGTACTTTACTATCAGCTACTTCTTATGATGCTGATGGTGGGCTATTTCCACTTGCTTTTGGTGTAGTTGATGCCGAAAATGAGGAAAGTTGGATTTGGTTTTTGT
CGGAGTTACACAACGCACTTGAGATGAACAGTCAGAATATACTGCACCTCACATTTTTATCAGATGGACAAAAAGGAATTTTAGATGCTTTGAGAAGGAAATTCCCAAAT
TCTTCACATGCATTATGCATGCGCTACTTAAGTGAATGCATTGGAAAAGAGTTCAAAAACTCGAGGCTTGTTAATCTTCTGTGGAAAGCGGCATATGCTACAACTACAAT
TGCTTTCAAAGAGAGAATGTCAGAAATAGAAGAGGTTTCACCTGAAGCTGCAAAGTGGATACAGCAATTTCCACCCTCTCATTGGGCATTAGTTTATTTTGAAGGAAATC
GGTATGGGCATCTCTCTTCAAATATAGAGGAATTCTCTAAATGGATTCTTGAGGCACGTGAGTTACCCATCATCCAGGTGATTGAGCAGATTCACAGTAAACTAATGGTT
GAGTTTGAAGAGCGGCGTGCTAGGAGTACATCATGGTTTTCTTTCCTGAGCCCATCAGCTGAAAAACGAATAATGGAAGCAATCAACCTTGCCTCTTCGTATCAAGTTCT
TCGATCAGATGAGGTAGAGTTTGAGGTTTTATCAGCTGAGAGATCAGATATAGTCAATATCGGAGAGCGATGTTGTTCCTGTCGTGATTGGCAGCTTTATGGAATACCTT
GCTCTCATGCTGTTGCAGCCATTGCCTCATGTAGAAAAGATGTTTATGCCTTTACAGAGAAGTGCTTTACGGTTTCTGGTTACCGGGAAACATATGCAAAAGGTATATAC
CCCATTCCAGGAAAACTCGAACGGAAAAAACACGATGAAACTCCTATGGATGATGATGTGCAAATTGTGCGACCACCTAAGTTTCGTCGACCTCCAGGACGACCTGAAAA
GAAGAGAATTTGTGTAGAGGACCTGAATCGTGAGAAACATACCGTGCATTGTAGTCGATGTAATCAAACAGGACACTACAAGACTACCTGCAAGGCAGAGCTCATGAAGA
GCATTGAACAGTTCTAG
mRNA sequenceShow/hide mRNA sequence
ATGATTTATCAGTTTGTGGTCTTGGGACTTATAGCCTCACCTTTGGCACTTTTTGTTCTGAAAAAGCAAAGCCAAAGGAAGCAAAGCCAAAGCTTTCTGCAAAACCAGCA
ACCAAAGAGCCCTGCAAATGGAAGTTCCATTTGCAGGAGCAAGAGCCATGGGAGAATTGATGTGGTCATTGTTGGTGCCGGTGTTGCTGGTTCTGCTCTTGCTTACGCTC
TTGGCAAGGATGGGCGTCGTATGCATATAATTGAGCAAGATTTGACCATGCACAACAGAATGATAGGGGAAGTGCTTAAGCCAGGAGGCTATCTGAAATTACTCGAGTTG
GGTCTTGAAGATTGTGTGTTGGATATTGAAGCTCAACAAGTGTTTGGATATGTCATCTACAAGGATGGGAAATATATTAGGCTGCCTTATCGCTTAAACAGCTCCAATTC
AAATGTGACAGAAAGAAGCTTCCACAATGGGCTCTTTGTACAAAGACTAAGGGAAAAGGTTGCATCTCTTCCCAATGTAGAATTCAAGCAAGGAACTGTCCTTGGCCTAC
TGAAGCAAAAAGATACTGTCAAAGGGGTGCGGTACAAGACGAAAACCGGTCAAGAAATGGTGGCACATGCTGATCTCACAGTCATATGCGACGGCCGCTACTCGAATCTA
CGACGGTTTCTCTGCAGGCATAAGATTAGTAACACCCAAATCCGCTGTTTTGTGGACATACCTGGCCATAAAGCTCCCTCCATTTCCAACGGTGAAATGCCCCATTACTT
GAAGACTATGGTGGCTCCTCAGATCCCACCAGAGTTTTACGATGCTTTCATGTCAACAATCGATAAAGCAACCGTGAGAACAATCCCTTATTGGAAAATGTTCGCCATGC
CTCCTCATCCCAATCCCACCGCCGGCGCCATTCTCCTCGGCGACGCTTTATACGTCCGGCATCCATTTTCCGGCGGCGGGGTGACCGTCACCTTATCAGACGTAGTCGTG
CTTTGCGATCTTCTCCGACCGCTTCACAGTCTCGACGACGCATCTTCAGTCTCCGACCATCTTCAATCCTTTTACACTCTTCGCAAGCCAATGGCTATTACGATGGATAC
ACTGGCCGCACTTGGTCAGTTGGCGTTCTATAATCATTGGAGTGGGCTGATGAGGGAAGCAAATGTGAAGGAAACTTCTTTCAGGTTCTTGTGCCTTGGAGACCTCAATC
TTTCGCCGGAATGCCGAGGATGTGCTCTCCCGCATGGCTGCGCCATTGACTCTGTTTCTCTCTGTTCCTCCATCTTCGCCTCTGTGAAGCTCCGATCTGCATCGTCCCCC
TCAATTTCAAGGTCGATCTTAGAGTCGTGTTTTGAAGTTGTTTGGTGGTACGGAAGTTCCGTTTCTTTTAGTTCTGTGGCCGCGTTGAAGATAGCTCAGACAGGAGCAAC
TGGTTCTGAGATTGGATTGTCTCACATGGAGGATCATACGTTTGTTGTAGGTCAAGAGTTTCCTGATGTCAAGGCATTTCGAAATGCAATTAAAGAAGCTGCTATCGCTC
AGCACTTTGAGTTACGAATTATTAAGAGTGACTTGATACGTTATTTTGCAAAATGTGCCGAAGAGAGTTGTCCATGGCGCATTCGTGCTGTTAAGCTTCCTAATTCTCCA
ATCTTCACAATTAGAAGCCTTGAAGGGACGCATACCTGTGGGCAAAATGCTCAAAATGGACACCATCAGGCCTCCATAGATTGGATTGTGAGTTTCATAGAAGAACGGCT
GCGGAACAACATTAATTACAAACCAAAGGACATATTACATGACATCCATAAGCAATATGGCATCACTATACCATATAAGCAGGCTTGGCGTGCAAAGGAACGAGGCCTTG
CCACAATTTATGGTTCTTCTGAAGAAGGTTATTGTCTGCTTCCATCCTACTGTGAACAAATAAGGAAAGCCAACCCTGGAAGTATTGCAGAAGTGTTCACGACTGGTTCA
GATAATCGCTTTCAGCGACTCTTTGTTTCGTTTTATGCATCCATATATGGGTTTCTAAATGGTTGCTTCCCTGTGATTGGTCTCGGTGGAATCCAGCTTAAAAGCAAATA
CCTAGGTACTTTACTATCAGCTACTTCTTATGATGCTGATGGTGGGCTATTTCCACTTGCTTTTGGTGTAGTTGATGCCGAAAATGAGGAAAGTTGGATTTGGTTTTTGT
CGGAGTTACACAACGCACTTGAGATGAACAGTCAGAATATACTGCACCTCACATTTTTATCAGATGGACAAAAAGGAATTTTAGATGCTTTGAGAAGGAAATTCCCAAAT
TCTTCACATGCATTATGCATGCGCTACTTAAGTGAATGCATTGGAAAAGAGTTCAAAAACTCGAGGCTTGTTAATCTTCTGTGGAAAGCGGCATATGCTACAACTACAAT
TGCTTTCAAAGAGAGAATGTCAGAAATAGAAGAGGTTTCACCTGAAGCTGCAAAGTGGATACAGCAATTTCCACCCTCTCATTGGGCATTAGTTTATTTTGAAGGAAATC
GGTATGGGCATCTCTCTTCAAATATAGAGGAATTCTCTAAATGGATTCTTGAGGCACGTGAGTTACCCATCATCCAGGTGATTGAGCAGATTCACAGTAAACTAATGGTT
GAGTTTGAAGAGCGGCGTGCTAGGAGTACATCATGGTTTTCTTTCCTGAGCCCATCAGCTGAAAAACGAATAATGGAAGCAATCAACCTTGCCTCTTCGTATCAAGTTCT
TCGATCAGATGAGGTAGAGTTTGAGGTTTTATCAGCTGAGAGATCAGATATAGTCAATATCGGAGAGCGATGTTGTTCCTGTCGTGATTGGCAGCTTTATGGAATACCTT
GCTCTCATGCTGTTGCAGCCATTGCCTCATGTAGAAAAGATGTTTATGCCTTTACAGAGAAGTGCTTTACGGTTTCTGGTTACCGGGAAACATATGCAAAAGGTATATAC
CCCATTCCAGGAAAACTCGAACGGAAAAAACACGATGAAACTCCTATGGATGATGATGTGCAAATTGTGCGACCACCTAAGTTTCGTCGACCTCCAGGACGACCTGAAAA
GAAGAGAATTTGTGTAGAGGACCTGAATCGTGAGAAACATACCGTGCATTGTAGTCGATGTAATCAAACAGGACACTACAAGACTACCTGCAAGGCAGAGCTCATGAAGA
GCATTGAACAGTTCTAG
Protein sequenceShow/hide protein sequence
MIYQFVVLGLIASPLALFVLKKQSQRKQSQSFLQNQQPKSPANGSSICRSKSHGRIDVVIVGAGVAGSALAYALGKDGRRMHIIEQDLTMHNRMIGEVLKPGGYLKLLEL
GLEDCVLDIEAQQVFGYVIYKDGKYIRLPYRLNSSNSNVTERSFHNGLFVQRLREKVASLPNVEFKQGTVLGLLKQKDTVKGVRYKTKTGQEMVAHADLTVICDGRYSNL
RRFLCRHKISNTQIRCFVDIPGHKAPSISNGEMPHYLKTMVAPQIPPEFYDAFMSTIDKATVRTIPYWKMFAMPPHPNPTAGAILLGDALYVRHPFSGGGVTVTLSDVVV
LCDLLRPLHSLDDASSVSDHLQSFYTLRKPMAITMDTLAALGQLAFYNHWSGLMREANVKETSFRFLCLGDLNLSPECRGCALPHGCAIDSVSLCSSIFASVKLRSASSP
SISRSILESCFEVVWWYGSSVSFSSVAALKIAQTGATGSEIGLSHMEDHTFVVGQEFPDVKAFRNAIKEAAIAQHFELRIIKSDLIRYFAKCAEESCPWRIRAVKLPNSP
IFTIRSLEGTHTCGQNAQNGHHQASIDWIVSFIEERLRNNINYKPKDILHDIHKQYGITIPYKQAWRAKERGLATIYGSSEEGYCLLPSYCEQIRKANPGSIAEVFTTGS
DNRFQRLFVSFYASIYGFLNGCFPVIGLGGIQLKSKYLGTLLSATSYDADGGLFPLAFGVVDAENEESWIWFLSELHNALEMNSQNILHLTFLSDGQKGILDALRRKFPN
SSHALCMRYLSECIGKEFKNSRLVNLLWKAAYATTTIAFKERMSEIEEVSPEAAKWIQQFPPSHWALVYFEGNRYGHLSSNIEEFSKWILEARELPIIQVIEQIHSKLMV
EFEERRARSTSWFSFLSPSAEKRIMEAINLASSYQVLRSDEVEFEVLSAERSDIVNIGERCCSCRDWQLYGIPCSHAVAAIASCRKDVYAFTEKCFTVSGYRETYAKGIY
PIPGKLERKKHDETPMDDDVQIVRPPKFRRPPGRPEKKRICVEDLNREKHTVHCSRCNQTGHYKTTCKAELMKSIEQF