; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr025547 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr025547
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
Descriptioncytochrome P450 71A1-like
Genome locationtig00007935:962746..967080
RNA-Seq ExpressionSgr025547
SyntenySgr025547
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
GO:0004497 - monooxygenase activity (molecular function)
GO:0005506 - iron ion binding (molecular function)
GO:0016705 - oxidoreductase activity, acting on paired donors, with incorporation or reduction of molecular oxygen (molecular function)
GO:0020037 - heme binding (molecular function)
InterPro domainsIPR001128 - Cytochrome P450
IPR002401 - Cytochrome P450, E-class, group I
IPR036396 - Cytochrome P450 superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0053469.1 cytochrome P450 71A1-like [Cucumis melo var. makuwa]1.0e-0477.78Show/hide
Query:  QKEDLNMEEVFGLSTPKKFPLDAVAEPRLPPHLYSI
        + EDLNM+EVFGLSTPKKFPLD VAEPRL   LY I
Subjt:  QKEDLNMEEVFGLSTPKKFPLDAVAEPRLPPHLYSI

XP_008460308.1 PREDICTED: cytochrome P450 71A1-like [Cucumis melo]1.0e-0477.78Show/hide
Query:  QKEDLNMEEVFGLSTPKKFPLDAVAEPRLPPHLYSI
        + EDLNM+EVFGLSTPKKFPLD VAEPRL   LY I
Subjt:  QKEDLNMEEVFGLSTPKKFPLDAVAEPRLPPHLYSI

XP_008460308.1 PREDICTED: cytochrome P450 71A1-like [Cucumis melo]3.8e-15394.46Show/hide
Query:  AWVATLALLLLSRRLRRRKLNLPPGPKPWPLIGNLNLIGSLPHQSIHQLSQKYGPIMHLRFGSFPVVVGSSVDMAKIFLKTQDLTFVSRPKTAAGKYTTY
        AW+ATLA+LLLSRR+RRRKLNLPPGPKPWPLIGNL+LIGSLPHQSIHQLS+KYGPIMHLRFGSFPVVVGSSV+MAKIFLKTQDL FVSRPKTAAGKYTTY
Subjt:  AWVATLALLLLSRRLRRRKLNLPPGPKPWPLIGNLNLIGSLPHQSIHQLSQKYGPIMHLRFGSFPVVVGSSVDMAKIFLKTQDLTFVSRPKTAAGKYTTY

Query:  NYSNITWSQYGPYWRQARKMCLMELFSARRLDSYEYIRKEEMNALLKEICKSSGKVIKLKDYLSTVSLNVISRMVLGKKYTDESEDAIVSPDEFKKMLDE
        NYS+ITWSQYGPYWRQARKMCLMELFSARRLDSYEYIRKEEMNALL+EI KS G+VIK+KDYLSTVSLNVISRMVLGKKYTDESE+ IVSPDEFKKMLDE
Subjt:  NYSNITWSQYGPYWRQARKMCLMELFSARRLDSYEYIRKEEMNALLKEICKSSGKVIKLKDYLSTVSLNVISRMVLGKKYTDESEDAIVSPDEFKKMLDE

Query:  LFLLSGVLNIGDSIPWIDFLDLQGYVKRMKALSKKFDRFLEHVLDEHNERRKGVKDYVAKDMVDVLLQLADDPDLEVKLERHGVKAFTQ
        LFLLSGVLNIGDSIPWIDFLDLQGYVKRMKALSKKFDRFLEHVLDEHNERRKGV+DYVAKDMVDVLLQLADDPDLEVKLERHGVKAFTQ
Subjt:  LFLLSGVLNIGDSIPWIDFLDLQGYVKRMKALSKKFDRFLEHVLDEHNERRKGVKDYVAKDMVDVLLQLADDPDLEVKLERHGVKAFTQ

XP_022152553.1 cytochrome P450 71A1-like [Momordica charantia]3.4e-0888.89Show/hide
Query:  QKEDLNMEEVFGLSTPKKFPLDAVAEPRLPPHLYSI
        +KEDLNMEE+FGLSTPKKFPL AVAEPRLPPHLYS+
Subjt:  QKEDLNMEEVFGLSTPKKFPLDAVAEPRLPPHLYSI

XP_022152553.1 cytochrome P450 71A1-like [Momordica charantia]1.7e-15693.33Show/hide
Query:  MEAPSWVSYAAAWVATLALLLLSRRLRRRKLNLPPGPKPWPLIGNLNLIGSLPHQSIHQLSQKYGPIMHLRFGSFPVVVGSSVDMAKIFLKTQDLTFVSR
        M+ PSWVSYAAAW+ATLALLLLSRRL RRKLNLPPGP+PWPLIGNLNLIGSLPHQSIHQLS+KYGPIMHLRFGSFPVVVGSSV+MAKIFLKT DL FVSR
Subjt:  MEAPSWVSYAAAWVATLALLLLSRRLRRRKLNLPPGPKPWPLIGNLNLIGSLPHQSIHQLSQKYGPIMHLRFGSFPVVVGSSVDMAKIFLKTQDLTFVSR

Query:  PKTAAGKYTTYNYSNITWSQYGPYWRQARKMCLMELFSARRLDSYEYIRKEEMNALLKEICKSSGKVIKLKDYLSTVSLNVISRMVLGKKYTDESEDAIV
        PKTAAGKYTTYNYS+ITWSQYGPYWRQARKMCLMELFSARRLDSYEYIRKEEMNALLK+I KS G+VIKLKDYLSTVSLNVISRMVLG+KYTDESE+AIV
Subjt:  PKTAAGKYTTYNYSNITWSQYGPYWRQARKMCLMELFSARRLDSYEYIRKEEMNALLKEICKSSGKVIKLKDYLSTVSLNVISRMVLGKKYTDESEDAIV

Query:  SPDEFKKMLDELFLLSGVLNIGDSIPWIDFLDLQGYVKRMKALSKKFDRFLEHVLDEHNERRKGVKDYVAKDMVDVLLQLADDPDLEVKLERHGVKAFTQ
        SPDEFKKMLDELFLLSGVLNIGDSIPWIDFLDLQGYVKRMKALSKKFDRFLEHVLDEHNERRKGV++YVAKDMVDVLLQ ADDP+LEVKLERHGVKAFTQ
Subjt:  SPDEFKKMLDELFLLSGVLNIGDSIPWIDFLDLQGYVKRMKALSKKFDRFLEHVLDEHNERRKGVKDYVAKDMVDVLLQLADDPDLEVKLERHGVKAFTQ

XP_038876425.1 trimethyltridecatetraene synthase-like [Benincasa hispida]2.0e-15794.02Show/hide
Query:  MEAPSWV-SYAAAWVATLALLLLSRRLRRRKLNLPPGPKPWPLIGNLNLIGSLPHQSIHQLSQKYGPIMHLRFGSFPVVVGSSVDMAKIFLKTQDLTFVS
        ++ PSW+ SYAAAW+ATLALLLLSRRLRRRKLNLPPGPKPWPLIGNLNLIGSLPHQSIH+LS+KYGPIMHLRFGSFPVVVGSSV+MAKIFLKT DLTFVS
Subjt:  MEAPSWV-SYAAAWVATLALLLLSRRLRRRKLNLPPGPKPWPLIGNLNLIGSLPHQSIHQLSQKYGPIMHLRFGSFPVVVGSSVDMAKIFLKTQDLTFVS

Query:  RPKTAAGKYTTYNYSNITWSQYGPYWRQARKMCLMELFSARRLDSYEYIRKEEMNALLKEICKSSGKVIKLKDYLSTVSLNVISRMVLGKKYTDESEDAI
        RPKTAAGKYTTYNYSNITWSQYGPYWRQARKMCLMELFSARRLDSYEYIRKEEMNALLK+I KS G+VIKLKDYLSTVSLNVISRMVLGKKYTDESE+AI
Subjt:  RPKTAAGKYTTYNYSNITWSQYGPYWRQARKMCLMELFSARRLDSYEYIRKEEMNALLKEICKSSGKVIKLKDYLSTVSLNVISRMVLGKKYTDESEDAI

Query:  VSPDEFKKMLDELFLLSGVLNIGDSIPWIDFLDLQGYVKRMKALSKKFDRFLEHVLDEHNERRKGVKDYVAKDMVDVLLQLADDPDLEVKLERHGVKAFT
        VSPDEFKKMLDELFLLSGVLNIGDSIPWIDFLDLQGYVKRMKALSKKFDRFLEHVLDEHNERRKGV++YVAKDMVDVLLQLADDP LEVKLERHGVKAFT
Subjt:  VSPDEFKKMLDELFLLSGVLNIGDSIPWIDFLDLQGYVKRMKALSKKFDRFLEHVLDEHNERRKGVKDYVAKDMVDVLLQLADDPDLEVKLERHGVKAFT

Query:  Q
        Q
Subjt:  Q

XP_038876425.1 trimethyltridecatetraene synthase-like [Benincasa hispida]6.4e-0786.11Show/hide
Query:  QKEDLNMEEVFGLSTPKKFPLDAVAEPRLPPHLYSI
        +KEDLNM+EVFGLSTPKKFPLD VAEPRL P LYSI
Subjt:  QKEDLNMEEVFGLSTPKKFPLDAVAEPRLPPHLYSI

XP_038876425.1 trimethyltridecatetraene synthase-like [Benincasa hispida]1.3e-15692.33Show/hide
Query:  MEAPSWVSYAAAWVATLALLLLSRRLRRRKLNLPPGPKPWPLIGNLNLIGSLPHQSIHQLSQKYGPIMHLRFGSFPVVVGSSVDMAKIFLKTQDLTFVSR
        ME PSWVSYAAAWVAT+ALLLLSRRLRRRKLNLPPGPKPWP IGNLNLIGSLPHQSIHQLS+KYG IMHLRFGSFPVVVGSSV+MAKIFLKT DLTFVSR
Subjt:  MEAPSWVSYAAAWVATLALLLLSRRLRRRKLNLPPGPKPWPLIGNLNLIGSLPHQSIHQLSQKYGPIMHLRFGSFPVVVGSSVDMAKIFLKTQDLTFVSR

Query:  PKTAAGKYTTYNYSNITWSQYGPYWRQARKMCLMELFSARRLDSYEYIRKEEMNALLKEICKSSGKVIKLKDYLSTVSLNVISRMVLGKKYTDESEDAIV
        PKTAAGKYTTYNYS+ITWSQYGPYWRQARKMCLMELFSA+RLDSYEYIR+EEM+ALL++I +S G+ I++KDYLSTVSLNVISRMVLGKKYTDESE+AIV
Subjt:  PKTAAGKYTTYNYSNITWSQYGPYWRQARKMCLMELFSARRLDSYEYIRKEEMNALLKEICKSSGKVIKLKDYLSTVSLNVISRMVLGKKYTDESEDAIV

Query:  SPDEFKKMLDELFLLSGVLNIGDSIPWIDFLDLQGYVKRMKALSKKFDRFLEHVLDEHNERRKGVKDYVAKDMVDVLLQLADDPDLEVKLERHGVKAFTQ
        SPDEFKKMLDELFLLSGVLNIGDSIPWIDFLDLQGY+KRMKALSKKFDRFLEHVLDEHNERRKG+KDYVAKDMVDVLLQLADDPDLEVKLERHGVKAFTQ
Subjt:  SPDEFKKMLDELFLLSGVLNIGDSIPWIDFLDLQGYVKRMKALSKKFDRFLEHVLDEHNERRKGVKDYVAKDMVDVLLQLADDPDLEVKLERHGVKAFTQ

XP_038877760.1 trimethyltridecatetraene synthase-like [Benincasa hispida]8.9e-0986.11Show/hide
Query:  QKEDLNMEEVFGLSTPKKFPLDAVAEPRLPPHLYSI
        +KEDLN+EE+FGLSTPKKFPLDAVA+PRLPPHLYS+
Subjt:  QKEDLNMEEVFGLSTPKKFPLDAVAEPRLPPHLYSI

XP_038877760.1 trimethyltridecatetraene synthase-like [Benincasa hispida]3.8e-15394.46Show/hide
Query:  AWVATLALLLLSRRLRRRKLNLPPGPKPWPLIGNLNLIGSLPHQSIHQLSQKYGPIMHLRFGSFPVVVGSSVDMAKIFLKTQDLTFVSRPKTAAGKYTTY
        AW+ATLA+LLLSRR+RRRKLNLPPGPKPWPLIGNL+LIGSLPHQSIHQLS+KYGPIMHLRFGSFPVVVGSSV+MAKIFLKTQDL FVSRPKTAAGKYTTY
Subjt:  AWVATLALLLLSRRLRRRKLNLPPGPKPWPLIGNLNLIGSLPHQSIHQLSQKYGPIMHLRFGSFPVVVGSSVDMAKIFLKTQDLTFVSRPKTAAGKYTTY

Query:  NYSNITWSQYGPYWRQARKMCLMELFSARRLDSYEYIRKEEMNALLKEICKSSGKVIKLKDYLSTVSLNVISRMVLGKKYTDESEDAIVSPDEFKKMLDE
        NYS+ITWSQYGPYWRQARKMCLMELFSARRLDSYEYIRKEEMNALL+EI KS G+VIK+KDYLSTVSLNVISRMVLGKKYTDESE+ IVSPDEFKKMLDE
Subjt:  NYSNITWSQYGPYWRQARKMCLMELFSARRLDSYEYIRKEEMNALLKEICKSSGKVIKLKDYLSTVSLNVISRMVLGKKYTDESEDAIVSPDEFKKMLDE

Query:  LFLLSGVLNIGDSIPWIDFLDLQGYVKRMKALSKKFDRFLEHVLDEHNERRKGVKDYVAKDMVDVLLQLADDPDLEVKLERHGVKAFTQ
        LFLLSGVLNIGDSIPWIDFLDLQGYVKRMKALSKKFDRFLEHVLDEHNERRKGV+DYVAKDMVDVLLQLADDPDLEVKLERHGVKAFTQ
Subjt:  LFLLSGVLNIGDSIPWIDFLDLQGYVKRMKALSKKFDRFLEHVLDEHNERRKGVKDYVAKDMVDVLLQLADDPDLEVKLERHGVKAFTQ

TrEMBL top hitse value%identityAlignment
A0A1S3CCA6 cytochrome P450 71A1-like4.9e-0577.78Show/hide
Query:  QKEDLNMEEVFGLSTPKKFPLDAVAEPRLPPHLYSI
        + EDLNM+EVFGLSTPKKFPLD VAEPRL   LY I
Subjt:  QKEDLNMEEVFGLSTPKKFPLDAVAEPRLPPHLYSI

A0A1S3CCA6 cytochrome P450 71A1-like1.6e-15290.91Show/hide
Query:  PSWVSYAAAWVATLALLLLSRRLRRRKLNLPPGPKPWPLIGNLNLIGSLPHQSIHQLSQKYGPIMHLRFGSFPVVVGSSVDMAKIFLKTQDLTFVSRPKT
        PSWVSYAAAW+A LA LLLSRRLRRR LNLPPGPKPWPLIGNL+LIGSLPHQSIH LS+KYGPIMHLRFGSFPVVVGSSV+MAKIFLKT DL FVSRPKT
Subjt:  PSWVSYAAAWVATLALLLLSRRLRRRKLNLPPGPKPWPLIGNLNLIGSLPHQSIHQLSQKYGPIMHLRFGSFPVVVGSSVDMAKIFLKTQDLTFVSRPKT

Query:  AAGKYTTYNYSNITWSQYGPYWRQARKMCLMELFSARRLDSYEYIRKEEMNALLKEICKSSGKVIKLKDYLSTVSLNVISRMVLGKKYTDESEDAIVSPD
        AAGKYTTYNYS+ITWSQYGPYWRQARKMCLMELFSA+RLDSYEYIR+EEMNALL+++ KS G+V+++KDYLSTVSLNVISRMVLGKKYTDESE+ IVSPD
Subjt:  AAGKYTTYNYSNITWSQYGPYWRQARKMCLMELFSARRLDSYEYIRKEEMNALLKEICKSSGKVIKLKDYLSTVSLNVISRMVLGKKYTDESEDAIVSPD

Query:  EFKKMLDELFLLSGVLNIGDSIPWIDFLDLQGYVKRMKALSKKFDRFLEHVLDEHNERRKGVKDYVAKDMVDVLLQLADDPDLEVKLERHGVKAFTQ
        EFKKMLDELFLLSGVLNIGDSIPWIDFLDLQGYVKRMKALSKKFDRFLEHVLDEHN RR+GVKDYVAKDMVDVLLQLADDP LEVKLERHGVKAFTQ
Subjt:  EFKKMLDELFLLSGVLNIGDSIPWIDFLDLQGYVKRMKALSKKFDRFLEHVLDEHNERRKGVKDYVAKDMVDVLLQLADDPDLEVKLERHGVKAFTQ

A0A5D3D8H9 Cytochrome P450 71A1-like4.9e-0577.78Show/hide
Query:  QKEDLNMEEVFGLSTPKKFPLDAVAEPRLPPHLYSI
        + EDLNM+EVFGLSTPKKFPLD VAEPRL   LY I
Subjt:  QKEDLNMEEVFGLSTPKKFPLDAVAEPRLPPHLYSI

A0A5D3D8H9 Cytochrome P450 71A1-like1.9e-15394.46Show/hide
Query:  AWVATLALLLLSRRLRRRKLNLPPGPKPWPLIGNLNLIGSLPHQSIHQLSQKYGPIMHLRFGSFPVVVGSSVDMAKIFLKTQDLTFVSRPKTAAGKYTTY
        AW+ATLA+LLLSRR+RRRKLNLPPGPKPWPLIGNL+LIGSLPHQSIHQLS+KYGPIMHLRFGSFPVVVGSSV+MAKIFLKTQDL FVSRPKTAAGKYTTY
Subjt:  AWVATLALLLLSRRLRRRKLNLPPGPKPWPLIGNLNLIGSLPHQSIHQLSQKYGPIMHLRFGSFPVVVGSSVDMAKIFLKTQDLTFVSRPKTAAGKYTTY

Query:  NYSNITWSQYGPYWRQARKMCLMELFSARRLDSYEYIRKEEMNALLKEICKSSGKVIKLKDYLSTVSLNVISRMVLGKKYTDESEDAIVSPDEFKKMLDE
        NYS+ITWSQYGPYWRQARKMCLMELFSARRLDSYEYIRKEEMNALL+EI KS G+VIK+KDYLSTVSLNVISRMVLGKKYTDESE+ IVSPDEFKKMLDE
Subjt:  NYSNITWSQYGPYWRQARKMCLMELFSARRLDSYEYIRKEEMNALLKEICKSSGKVIKLKDYLSTVSLNVISRMVLGKKYTDESEDAIVSPDEFKKMLDE

Query:  LFLLSGVLNIGDSIPWIDFLDLQGYVKRMKALSKKFDRFLEHVLDEHNERRKGVKDYVAKDMVDVLLQLADDPDLEVKLERHGVKAFTQ
        LFLLSGVLNIGDSIPWIDFLDLQGYVKRMKALSKKFDRFLEHVLDEHNERRKGV+DYVAKDMVDVLLQLADDPDLEVKLERHGVKAFTQ
Subjt:  LFLLSGVLNIGDSIPWIDFLDLQGYVKRMKALSKKFDRFLEHVLDEHNERRKGVKDYVAKDMVDVLLQLADDPDLEVKLERHGVKAFTQ

A0A6J1DF57 cytochrome P450 71A1-like6.1e-15792.33Show/hide
Query:  MEAPSWVSYAAAWVATLALLLLSRRLRRRKLNLPPGPKPWPLIGNLNLIGSLPHQSIHQLSQKYGPIMHLRFGSFPVVVGSSVDMAKIFLKTQDLTFVSR
        ME PSWVSYAAAWVAT+ALLLLSRRLRRRKLNLPPGPKPWP IGNLNLIGSLPHQSIHQLS+KYG IMHLRFGSFPVVVGSSV+MAKIFLKT DLTFVSR
Subjt:  MEAPSWVSYAAAWVATLALLLLSRRLRRRKLNLPPGPKPWPLIGNLNLIGSLPHQSIHQLSQKYGPIMHLRFGSFPVVVGSSVDMAKIFLKTQDLTFVSR

Query:  PKTAAGKYTTYNYSNITWSQYGPYWRQARKMCLMELFSARRLDSYEYIRKEEMNALLKEICKSSGKVIKLKDYLSTVSLNVISRMVLGKKYTDESEDAIV
        PKTAAGKYTTYNYS+ITWSQYGPYWRQARKMCLMELFSA+RLDSYEYIR+EEM+ALL++I +S G+ I++KDYLSTVSLNVISRMVLGKKYTDESE+AIV
Subjt:  PKTAAGKYTTYNYSNITWSQYGPYWRQARKMCLMELFSARRLDSYEYIRKEEMNALLKEICKSSGKVIKLKDYLSTVSLNVISRMVLGKKYTDESEDAIV

Query:  SPDEFKKMLDELFLLSGVLNIGDSIPWIDFLDLQGYVKRMKALSKKFDRFLEHVLDEHNERRKGVKDYVAKDMVDVLLQLADDPDLEVKLERHGVKAFTQ
        SPDEFKKMLDELFLLSGVLNIGDSIPWIDFLDLQGY+KRMKALSKKFDRFLEHVLDEHNERRKG+KDYVAKDMVDVLLQLADDPDLEVKLERHGVKAFTQ
Subjt:  SPDEFKKMLDELFLLSGVLNIGDSIPWIDFLDLQGYVKRMKALSKKFDRFLEHVLDEHNERRKGVKDYVAKDMVDVLLQLADDPDLEVKLERHGVKAFTQ

A0A6J1DF57 cytochrome P450 71A1-like1.6e-0888.89Show/hide
Query:  QKEDLNMEEVFGLSTPKKFPLDAVAEPRLPPHLYSI
        +KEDLNMEE+FGLSTPKKFPL AVAEPRLPPHLYS+
Subjt:  QKEDLNMEEVFGLSTPKKFPLDAVAEPRLPPHLYSI

A0A6J1DF57 cytochrome P450 71A1-like1.9e-15394.46Show/hide
Query:  AWVATLALLLLSRRLRRRKLNLPPGPKPWPLIGNLNLIGSLPHQSIHQLSQKYGPIMHLRFGSFPVVVGSSVDMAKIFLKTQDLTFVSRPKTAAGKYTTY
        AW+ATLA+LLLSRR+RRRKLNLPPGPKPWPLIGNL+LIGSLPHQSIHQLS+KYGPIMHLRFGSFPVVVGSSV+MAKIFLKTQDL FVSRPKTAAGKYTTY
Subjt:  AWVATLALLLLSRRLRRRKLNLPPGPKPWPLIGNLNLIGSLPHQSIHQLSQKYGPIMHLRFGSFPVVVGSSVDMAKIFLKTQDLTFVSRPKTAAGKYTTY

Query:  NYSNITWSQYGPYWRQARKMCLMELFSARRLDSYEYIRKEEMNALLKEICKSSGKVIKLKDYLSTVSLNVISRMVLGKKYTDESEDAIVSPDEFKKMLDE
        NYS+ITWSQYGPYWRQARKMCLMELFSARRLDSYEYIRKEEMNALL+EI KS G+VIK+KDYLSTVSLNVISRMVLGKKYTDESE+ IVSPDEFKKMLDE
Subjt:  NYSNITWSQYGPYWRQARKMCLMELFSARRLDSYEYIRKEEMNALLKEICKSSGKVIKLKDYLSTVSLNVISRMVLGKKYTDESEDAIVSPDEFKKMLDE

Query:  LFLLSGVLNIGDSIPWIDFLDLQGYVKRMKALSKKFDRFLEHVLDEHNERRKGVKDYVAKDMVDVLLQLADDPDLEVKLERHGVKAFTQ
        LFLLSGVLNIGDSIPWIDFLDLQGYVKRMKALSKKFDRFLEHVLDEHNERRKGV+DYVAKDMVDVLLQLADDPDLEVKLERHGVKAFTQ
Subjt:  LFLLSGVLNIGDSIPWIDFLDLQGYVKRMKALSKKFDRFLEHVLDEHNERRKGVKDYVAKDMVDVLLQLADDPDLEVKLERHGVKAFTQ

A0A6J1GXG6 cytochrome P450 71A1-like7.3e-0988.89Show/hide
Query:  QKEDLNMEEVFGLSTPKKFPLDAVAEPRLPPHLYSI
        +KEDLNM+EVFGLSTPKKFPLD VA+PRLPPHLYSI
Subjt:  QKEDLNMEEVFGLSTPKKFPLDAVAEPRLPPHLYSI

A0A6J1IRR8 cytochrome P450 71A1-like9.6e-0986.11Show/hide
Query:  QKEDLNMEEVFGLSTPKKFPLDAVAEPRLPPHLYSI
        +KEDLNM+E+FGLSTPKKFPLD VA+PRLPPHLYSI
Subjt:  QKEDLNMEEVFGLSTPKKFPLDAVAEPRLPPHLYSI

A0A6J1IRR8 cytochrome P450 71A1-like1.6e-15290.91Show/hide
Query:  PSWVSYAAAWVATLALLLLSRRLRRRKLNLPPGPKPWPLIGNLNLIGSLPHQSIHQLSQKYGPIMHLRFGSFPVVVGSSVDMAKIFLKTQDLTFVSRPKT
        PSWVSYAAAW+A LAL  LSRRLRRR LNLPPGPKPWPLIGNL+LIGSLPHQSIH LS+KYGPIMHLRFGSFPVVVGSSV+MAKIFLKT DL FVSRPKT
Subjt:  PSWVSYAAAWVATLALLLLSRRLRRRKLNLPPGPKPWPLIGNLNLIGSLPHQSIHQLSQKYGPIMHLRFGSFPVVVGSSVDMAKIFLKTQDLTFVSRPKT

Query:  AAGKYTTYNYSNITWSQYGPYWRQARKMCLMELFSARRLDSYEYIRKEEMNALLKEICKSSGKVIKLKDYLSTVSLNVISRMVLGKKYTDESEDAIVSPD
        AAGKYTTYNYS+ITWSQYGPYWRQARKMCLMELFSA+RLDSYEYIR+EEMNALL+++ +S G VIK+KDYLSTVSLNVISRMVLGKKYTDESE+ IVSPD
Subjt:  AAGKYTTYNYSNITWSQYGPYWRQARKMCLMELFSARRLDSYEYIRKEEMNALLKEICKSSGKVIKLKDYLSTVSLNVISRMVLGKKYTDESEDAIVSPD

Query:  EFKKMLDELFLLSGVLNIGDSIPWIDFLDLQGYVKRMKALSKKFDRFLEHVLDEHNERRKGVKDYVAKDMVDVLLQLADDPDLEVKLERHGVKAFTQ
        EFKKMLDELFLLSGVLNIGDSIPWIDFLDLQGYVKRMKALSKKFDRFLEHVLDEHN RR+GVKDYVA+DMVDVLLQLADDPDLEVKLERHGVKAFTQ
Subjt:  EFKKMLDELFLLSGVLNIGDSIPWIDFLDLQGYVKRMKALSKKFDRFLEHVLDEHNERRKGVKDYVAKDMVDVLLQLADDPDLEVKLERHGVKAFTQ

SwissProt top hitse value%identityAlignment
A0A068Q5V6 Cytochrome P450 71AU509.4e-5438.57Show/hide
Query:  WV-ATLALLLLSRRL----RRRKLNLPPGPKPWPLIGNLNLIGSLPHQSIHQLSQKYGPIMHLRFGSFPVVVGSSVDMAKIFLKTQDLTFVSRPKTAAGK
        W+ AT+ LL L   L    + +K  LPPGP+ +P+ G+L+L+G  P++ +H+L++KYG IM++R G  P +V SS + A++FLKT DL F SRP     K
Subjt:  WV-ATLALLLLSRRL----RRRKLNLPPGPKPWPLIGNLNLIGSLPHQSIHQLSQKYGPIMHLRFGSFPVVVGSSVDMAKIFLKTQDLTFVSRPKTAAGK

Query:  YTTYNYSNITWSQYGPYWRQARKMCLMELFSARRLDSYEYIRKEEMNALLKEI---CKSSGKVIKLKDYLSTVSLNVISRMVLGKKYTDESEDAIVSPDE
        + ++   N+ +S+YG YWR  RKMC +EL S  +++S++ +R+EE++  ++ I     + G  + L D +S++S+++  RMVLGKKY DE  D       
Subjt:  YTTYNYSNITWSQYGPYWRQARKMCLMELFSARRLDSYEYIRKEEMNALLKEI---CKSSGKVIKLKDYLSTVSLNVISRMVLGKKYTDESEDAIVSPDE

Query:  FKKMLDELFLLSGVLNIGDSIPWIDFLDLQGYVKRMKALSKKFDRFLEHVLDEHNERRKGVKDYVAKDMVDVLLQLADDPDLEVKLERHGVKA
        FK ++ E   L+   N+GD I +I  LDLQG+ KRMK+++K FD   E +++EH +   G +     D VDV++      + E ++ER  +KA
Subjt:  FKKMLDELFLLSGVLNIGDSIPWIDFLDLQGYVKRMKALSKKFDRFLEHVLDEHNERRKGVKDYVAKDMVDVLLQLADDPDLEVKLERHGVKA

A0A1D6F9Y9 Trimethyltridecatetraene synthase8.8e-6042.01Show/hide
Query:  VSYAAAWVATLALLLLSR-----RLRRRKLNLPPGPKPWPLIGNLN-LIGSL-PHQSIHQLSQKYGPIMHLRFGSFPVVVGSSVDMAKIFLKTQDLTFVS
        +S A A    LA+ ++ R     R R + LNLPPGP+ WP+IG+L  L G+L PH+++  L+ ++GP+MHLR GS+  VV SS D A++ LK  DL F  
Subjt:  VSYAAAWVATLALLLLSR-----RLRRRKLNLPPGPKPWPLIGNLN-LIGSL-PHQSIHQLSQKYGPIMHLRFGSFPVVVGSSVDMAKIFLKTQDLTFVS

Query:  RPKTAAGKYTTYNYSNITWSQYGPYWRQARKMCLMELFSARRLDSYEYIRKEEMNALLKEICKSSGK-VIKLKDYLSTVSLNVISRMVLGKKYTDESEDA
        RP+TAAG+  +Y Y  I  + YG YWR ARK+C  ELFSARR+DS+E +R +EM AL + + + +G+  + ++++++  +L  I RM +G+K++      
Subjt:  RPKTAAGKYTTYNYSNITWSQYGPYWRQARKMCLMELFSARRLDSYEYIRKEEMNALLKEICKSSGK-VIKLKDYLSTVSLNVISRMVLGKKYTDESEDA

Query:  IVSP--DEFKKMLDELFLLSG-VLNIGDSIPWIDFLDLQGYVKRMKALSKKFDRFLEHVLDEHNER-------RKGVKDYVA-KDMVDVLLQLADDPD--
          SP  + F++ LDE F ++G V N+G+ +PW+ +LDLQG V+RMK L  ++DRF E +LD+H+ +       R G  D  A  D+VDVLL+L ++ +  
Subjt:  IVSP--DEFKKMLDELFLLSG-VLNIGDSIPWIDFLDLQGYVKRMKALSKKFDRFLEHVLDEHNER-------RKGVKDYVA-KDMVDVLLQLADDPD--

Query:  ----LEVKLERHGVKAFTQ
               +L R GVKAF Q
Subjt:  ----LEVKLERHGVKAFTQ

A0A1D6F9Y9 Trimethyltridecatetraene synthase9.5e-0672.73Show/hide
Query:  EDLNMEEVFGLSTPKKFPLDAVAEPRLPPHLYS
        ED++MEE+FGLST +K PL AVAEPRLP HLY+
Subjt:  EDLNMEEVFGLSTPKKFPLDAVAEPRLPPHLYS

A0A1D6HSP4 Dimethylnonatriene synthase9.1e-5739.25Show/hide
Query:  MEAPSWVSYAAAWVATLALLLLS-----RRLRRRKLNLPPGPKPWPLIGNLN-LIGSL-PHQSIHQLSQKYGPIMHLRFGSFPVVVGSSVDMAKIFLKTQ
        ME  S +S A A  A + ++L S     R  R + L LPPGP+ WP++G+L  L G+L PH+++  L+ ++GP+MHLR GS+  VV SS D A++ L+T 
Subjt:  MEAPSWVSYAAAWVATLALLLLS-----RRLRRRKLNLPPGPKPWPLIGNLN-LIGSL-PHQSIHQLSQKYGPIMHLRFGSFPVVVGSSVDMAKIFLKTQ

Query:  DLTFVSRPKTAAGKYTTYNYSNITWSQYGPYWRQARKMCLMELFSARRLDSYEYIRKEEMNALLKEI--CKSSGKVIKLKDYLSTVSLNVISRMVLGKKY
        D     RP TAAG+ T+Y Y  I  +  G YWR AR++C  ELFSARR++S++ +R +EM AL + +  C +  + + ++++++  ++  I RM +G+K+
Subjt:  DLTFVSRPKTAAGKYTTYNYSNITWSQYGPYWRQARKMCLMELFSARRLDSYEYIRKEEMNALLKEI--CKSSGKVIKLKDYLSTVSLNVISRMVLGKKY

Query:  TDESEDAIVSP--DEFKKMLDELFLLSG-VLNIGDSIPWIDFLDLQGYVKRMKALSKKFDRFLEHVLDEHNERRKGVK----DYVAKDMVDVLLQLAD--
        +        SP  + F++ LDE F  +G V N+G+ +PW+ +LD+QG+ ++MK L    D F E +L +H ERR+  +    ++VA D+VDVLLQL++  
Subjt:  TDESEDAIVSP--DEFKKMLDELFLLSG-VLNIGDSIPWIDFLDLQGYVKRMKALSKKFDRFLEHVLDEHNERRKGVK----DYVAKDMVDVLLQLAD--

Query:  ---DPDLEVKLERHGVKAFTQ
           + + E +L R GVKA  Q
Subjt:  ---DPDLEVKLERHGVKAFTQ

A0A1D6HSP4 Dimethylnonatriene synthase1.1e-0472.73Show/hide
Query:  EDLNMEEVFGLSTPKKFPLDAVAEPRLPPHLYS
        ED++MEE  GLST +K PL AVAEPRLP HLYS
Subjt:  EDLNMEEVFGLSTPKKFPLDAVAEPRLPPHLYS

Q50EK4 Cytochrome P450 750A15.0e-5540.71Show/hide
Query:  RRKLNLPPGPKPWPLIGNLNLIGSLPHQSIHQLSQKYGPIMHLRFGSFPVVVGSSVDMAKIFLKTQDLTFVSRPKTAAGKYTTYNYSNITWSQYGPYWRQ
        +R   LPPGP PWP+IGN + +    H+++  L++KYGPI+ LRFGS P VV SS + AK FLKT DL F SRP T+ GKY  YN+ +I +S YG +WR+
Subjt:  RRKLNLPPGPKPWPLIGNLNLIGSLPHQSIHQLSQKYGPIMHLRFGSFPVVVGSSVDMAKIFLKTQDLTFVSRPKTAAGKYTTYNYSNITWSQYGPYWRQ

Query:  ARKMCLMELFSARRLDSYEYIRKEEMNALLKEICK--SSGKV-IKLKDYLSTVSLNVISRMVLGKKYTDESEDAIVSPDEFKKMLDELFLLSGVLNIGDS
         RK+C++EL +++R++S++++R+EE++A++  I +   SG++ + +   +ST   N++ R++  KK++D   D       F  ++ E+ +  G LNIGD 
Subjt:  ARKMCLMELFSARRLDSYEYIRKEEMNALLKEICK--SSGKV-IKLKDYLSTVSLNVISRMVLGKKYTDESEDAIVSPDEFKKMLDELFLLSGVLNIGDS

Query:  IPWIDFLDLQGYVKRMKALSKKFDRFLEHVLDEH---NERRKGVKD--YVAKDMVDVLLQLADDPDLEVKLERHGVKAFT
        IP++D LDLQG  + +K  + +FD F E ++DEH   +  R G  D     KD++DVLL++A + +   K+ R  +KA T
Subjt:  IPWIDFLDLQGYVKRMKALSKKFDRFLEHVLDEH---NERRKGVKD--YVAKDMVDVLLQLADDPDLEVKLERHGVKAFT

Q9SBQ9 Flavonoid 3'-monooxygenase1.0e-5539.79Show/hide
Query:  MEAPSWVSYAAAWVATLALLLLSRRLRRRKLNLPPGPKPWPLIGNLNLIGSLPHQSIHQLSQKYGPIMHLRFGSFPVVVGSSVDMAKIFLKTQDLTFVSR
        ME  S + Y   +   L  +L S   +R  L LPPGPKPWP+IGNL  +G  PHQS   ++Q YGP+M+L+ G   VVV +S  +A  FLKT D  F SR
Subjt:  MEAPSWVSYAAAWVATLALLLLSRRLRRRKLNLPPGPKPWPLIGNLNLIGSLPHQSIHQLSQKYGPIMHLRFGSFPVVVGSSVDMAKIFLKTQDLTFVSR

Query:  PKTAAGKYTTYNYSNITWSQYGPYWRQARKMCLMELFSARRLDSYEYIRKEEMNALLKEICKSSGKVIKLKDYLSTVSLNVISRMVLGKK-YTDESEDAI
        P  +  ++  YNY ++ ++ YGP WR  RK+C + LFS + LD + ++R++E+  L + +  +  K +KL   L+  + N ++R++LGK+ + D S D  
Subjt:  PKTAAGKYTTYNYSNITWSQYGPYWRQARKMCLMELFSARRLDSYEYIRKEEMNALLKEICKSSGKVIKLKDYLSTVSLNVISRMVLGKK-YTDESEDAI

Query:  VSPDEFKKMLDELFLLSGVLNIGDSIPWIDFLDLQGYVKRMKALSKKFDRFLEHVLDEHNERRKGVKDYVAKDMVDVLLQLADD
            EFK M+ E+ +++GV NIGD IP +++LD+QG   +MK L  +FD FL  +L+EH    KG      KD++  L+ L +D
Subjt:  VSPDEFKKMLDELFLLSGVLNIGDSIPWIDFLDLQGYVKRMKALSKKFDRFLEHVLDEHNERRKGVKDYVAKDMVDVLLQLADD

Arabidopsis top hitse value%identityAlignment
AT3G48280.1 cytochrome P450, family 71, subfamily A, polypeptide 253.1e-4433.92Show/hide
Query:  LALLLLSRRLRRRKLNLPPGPKPWPLIGNLNLIGSLPHQSIHQLSQKYGPIMHLRFGSFPVVVGSSVDMAKIFLKTQDLTFVSRPKTAAGKYTTYNYSNI
        + +L L ++L  +K   PP P   PLIGNL+ +G   H+S+  LS++YGP+M L  G  PV++ SS DMA+  LKT D  F +RP++   +   YN  ++
Subjt:  LALLLLSRRLRRRKLNLPPGPKPWPLIGNLNLIGSLPHQSIHQLSQKYGPIMHLRFGSFPVVVGSSVDMAKIFLKTQDLTFVSRPKTAAGKYTTYNYSNI

Query:  TWSQYGPYWRQARKMCLMELFSARRLDSYEYIRKEEMNALLKEICKSSGKVIKLKDYLSTVSLNVISRMVLGKKYTDESEDAIVSPDEFKKMLDELFLLS
          + YG YWRQ + +C++ L S + + S+  +R+EE+  ++ +I KSS     +   L  ++ +VI R+ LG+KY  E+        +FKK+ D L  L 
Subjt:  TWSQYGPYWRQARKMCLMELFSARRLDSYEYIRKEEMNALLKEICKSSGKVIKLKDYLSTVSLNVISRMVLGKKYTDESEDAIVSPDEFKKMLDELFLLS

Query:  GVLNIGDSIPWIDFLD-LQGYVKRMKALSKKFDRFLEHVLDEH--NERRKGVKDYVAKDMVDVLLQLADDPDLEVKLERHGVKAFT
        G  +IG  +PW+ ++D ++G+  ++  + K  D F E V+ +H   +RR G       D++D LL++  +     ++ER  +KA T
Subjt:  GVLNIGDSIPWIDFLD-LQGYVKRMKALSKKFDRFLEHVLDEH--NERRKGVKDYVAKDMVDVLLQLADDPDLEVKLERHGVKAFT

AT3G48310.1 cytochrome P450, family 71, subfamily A, polypeptide 223.7e-4534.04Show/hide
Query:  LALLLLSRRLRRRKLNLPPGPKPWPLIGNLNLIGSLPHQSIHQLSQKYGPIMHLRFGSFPVVVGSSVDMAKIFLKTQDLTFVSRPKTAAGKYTTYNYSNI
        + +L   ++ + +K N P  P   PLIGNL+ +G  PH+S+  LS +YGP+M LRFG  PV+V SS D+A+  LKT D  F SRP++   +   Y   ++
Subjt:  LALLLLSRRLRRRKLNLPPGPKPWPLIGNLNLIGSLPHQSIHQLSQKYGPIMHLRFGSFPVVVGSSVDMAKIFLKTQDLTFVSRPKTAAGKYTTYNYSNI

Query:  TWSQYGPYWRQARKMCLMELFSARRLDSYEYIRKEEMNALLKEICKSSGKVIKLKDYLSTVSLNVISRMVLGKKYTDESEDAIVSPDEFKKMLDELFLLS
          + YG YWRQ + +C++ L + + + S+  +R+EE++ ++++I KSS   + L + L +++ +VISR+ LG+KY+DE+        +FK+++  L  L 
Subjt:  TWSQYGPYWRQARKMCLMELFSARRLDSYEYIRKEEMNALLKEICKSSGKVIKLKDYLSTVSLNVISRMVLGKKYTDESEDAIVSPDEFKKMLDELFLLS

Query:  GVLNIGDSIPWIDFLD-LQGYVKRMKALSKKFDRFLEHVLDEHNERRKGVKDYVAKDMVDVLLQLADDPDLEVKLERHGVKA
        G   +G  +PW+ ++D + G   ++K      D FLE V+ +H +      D    D VDVLL++  +  +  +++R  +KA
Subjt:  GVLNIGDSIPWIDFLD-LQGYVKRMKALSKKFDRFLEHVLDEHNERRKGVKDYVAKDMVDVLLQLADDPDLEVKLERHGVKA

AT3G48320.1 cytochrome P450, family 71, subfamily A, polypeptide 211.2e-4332.62Show/hide
Query:  LALLLLSRRLRRRKLNLPPGPKPWPLIGNLNLIGSLPHQSIHQLSQKYGPIMHLRFGSFPVVVGSSVDMAKIFLKTQDLTFVSRPKTAAGKYTTYNYSNI
        + +L   ++ R +K N P  P   PLIGNL+ +G  PH+S+  LS +YGP+M L  G  PV+V SS D+A+  LKT D  F SRP++   +   Y+  ++
Subjt:  LALLLLSRRLRRRKLNLPPGPKPWPLIGNLNLIGSLPHQSIHQLSQKYGPIMHLRFGSFPVVVGSSVDMAKIFLKTQDLTFVSRPKTAAGKYTTYNYSNI

Query:  TWSQYGPYWRQARKMCLMELFSARRLDSYEYIRKEEMNALLKEICKSSGKVIKLKDYLSTVSLNVISRMVLGKKYTDESEDAIVSPDEFKKMLDELFLLS
         ++ YG YWRQ + +C++ L S + + S+  +R+EE++ ++++I KSS   + + + L +++ +VISR+ LG+KY+ E++         K+++  L +L 
Subjt:  TWSQYGPYWRQARKMCLMELFSARRLDSYEYIRKEEMNALLKEICKSSGKVIKLKDYLSTVSLNVISRMVLGKKYTDESEDAIVSPDEFKKMLDELFLLS

Query:  GVLNIGDSIPWIDFLD-LQGYVKRMKALSKKFDRFLEHVLDEHNERRKGVKDYVAKDMVDVLLQLADDPDLEVKLERHGVKA
        G  ++G  +PW+ ++D + G   ++       D FLE V+ +H +      D    D VDVLL++  +  +  +++R  +KA
Subjt:  GVLNIGDSIPWIDFLD-LQGYVKRMKALSKKFDRFLEHVLDEHNERRKGVKDYVAKDMVDVLLQLADDPDLEVKLERHGVKA

AT5G06900.1 cytochrome P450, family 93, subfamily D, polypeptide 17.9e-4836.14Show/hide
Query:  TLALLLLSRRLRRRKLNLPPGPKPWPLIGNLNLIGSLPHQSIHQLSQKYGPIMHLRFGSFPVVVGSSVDMAKIFLKTQDLTFVSRPKTAAGKYTTYNYSN
        T+ +  ++ RLR R L LPP P   P+IG+++L+G + HQ++H+LS +YGP+M+L  GS P ++ SS +MA   LK+ +L F++RP      Y TY  ++
Subjt:  TLALLLLSRRLRRRKLNLPPGPKPWPLIGNLNLIGSLPHQSIHQLSQKYGPIMHLRFGSFPVVVGSSVDMAKIFLKTQDLTFVSRPKTAAGKYTTYNYSN

Query:  ITWSQYGPYWRQARKMCLMELFSARRLDSYEYIRKEEMNALLKEICK--SSGKVIKLKDYLSTVSLNVISRMVLGKKYTDESEDAIVSPDEFKKMLDELF
           + YG +W+  +++C++ELFS+R LDS+  +R EE+  LL  + K   + + + L + L  ++ N+I+RM+  K  +D   D     +E  KM+ EL 
Subjt:  ITWSQYGPYWRQARKMCLMELFSARRLDSYEYIRKEEMNALLKEICK--SSGKVIKLKDYLSTVSLNVISRMVLGKKYTDESEDAIVSPDEFKKMLDELF

Query:  LLSGVLNIGDSIPWIDFLDLQGYVKRMKALSKKFDRFLEHVLDEHNERRKGVKDYVAKDMVDVLLQLADDPDLEVKLERHGVKAF
         L+G  N+ ++  ++  LDLQG  KR+K    K+D  +E +++EH   +K       ++M+DVLL + +D + E+KL R  +KAF
Subjt:  LLSGVLNIGDSIPWIDFLDLQGYVKRMKALSKKFDRFLEHVLDEHNERRKGVKDYVAKDMVDVLLQLADDPDLEVKLERHGVKAF

AT5G07990.1 Cytochrome P450 superfamily protein2.0e-5138.75Show/hide
Query:  VATLALLLL----SRRLRRRKLNLPPGPKPWPLIGNLNLIGSLPHQSIHQLSQKYGPIMHLRFGSFPVVVGSSVDMAKIFLKTQDLTFVSRPKTAAGKYT
        +AT+  L+L     RR R     LPPGP PWP+IGNL  +G+ PH+++  +   YGPI+HLR G   VVV +S  +A+ FLK  D  F SRP  +  K+ 
Subjt:  VATLALLLL----SRRLRRRKLNLPPGPKPWPLIGNLNLIGSLPHQSIHQLSQKYGPIMHLRFGSFPVVVGSSVDMAKIFLKTQDLTFVSRPKTAAGKYT

Query:  TYNYSNITWSQYGPYWRQARKMCLMELFSARRLDSYEYIRKEEMNALLKEICKSSGKVIKLKDYLSTVSLNVISRMVLGKKYTDESEDAIVSPDEFKKML
         YNY ++ ++ YG  WR  RK+  + LFSA+ L+ ++++R+EE+  L +E+ +   K + L   ++   +N + R ++G++      DA    DEF+ M+
Subjt:  TYNYSNITWSQYGPYWRQARKMCLMELFSARRLDSYEYIRKEEMNALLKEICKSSGKVIKLKDYLSTVSLNVISRMVLGKKYTDESEDAIVSPDEFKKML

Query:  DELFLLSGVLNIGDSIPWIDFLDLQGYVKRMKALSKKFDRFLEHVLDEHNERRKGVKDYVAKDMVDVLLQL
         E+  L+GV NIGD +P +D+LDLQG   +MK L K+FD FL  +L EH       +D    DM+  L+ L
Subjt:  DELFLLSGVLNIGDSIPWIDFLDLQGYVKRMKALSKKFDRFLEHVLDEHNERRKGVKDYVAKDMVDVLLQL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGCTCCTTCATGGGTTTCTTATGCAGCTGCATGGGTCGCCACTCTCGCACTCCTCCTCCTCTCCCGCCGCCTCCGTCGTAGAAAGCTCAATCTGCCGCCGGGACC
TAAACCTTGGCCCTTGATCGGAAATCTCAACTTGATCGGCTCTCTGCCTCACCAGTCAATTCATCAACTTTCCCAAAAATATGGTCCCATCATGCATCTCCGTTTCGGCT
CCTTCCCCGTCGTCGTCGGATCCTCCGTCGACATGGCTAAGATTTTCCTCAAAACCCAGGATCTTACTTTCGTGTCCCGCCCCAAAACTGCCGCCGGAAAGTACACCACC
TACAACTATTCCAACATTACCTGGTCTCAATATGGCCCTTACTGGCGTCAAGCTCGAAAGATGTGTTTGATGGAGCTTTTCAGTGCCAGACGACTCGATTCTTATGAGTA
CATACGAAAGGAAGAAATGAATGCTTTGCTTAAAGAAATATGCAAGTCTTCTGGTAAAGTGATCAAGCTCAAAGATTACTTGTCTACTGTAAGTTTGAACGTGATAAGTC
GGATGGTGTTGGGAAAGAAGTACACGGACGAGTCGGAAGACGCCATTGTTAGTCCGGACGAGTTCAAGAAAATGTTGGACGAGCTGTTCTTGCTGAGTGGTGTGCTCAAC
ATTGGGGACTCGATACCATGGATAGATTTCTTGGATCTGCAGGGGTACGTGAAGAGGATGAAGGCACTGAGCAAGAAGTTCGATAGATTCCTTGAGCACGTATTGGATGA
ACATAATGAAAGGAGAAAGGGAGTTAAAGATTATGTGGCCAAAGATATGGTGGATGTGTTGTTGCAGTTGGCCGATGATCCTGATCTTGAAGTCAAACTTGAAAGGCATG
GAGTCAAGGCATTTACTCAGAAGGAAGATTTGAACATGGAAGAAGTGTTTGGTCTCTCAACTCCCAAGAAATTTCCACTTGATGCTGTGGCCGAGCCTCGACTCCCTCCT
CATCTTTACTCAATCATATGCTCCTGCTTCCATTACATCCTTCATCAAGCTTCTGTAGCATTCTTCATCTTCTTCACCATCGCTGTGAATTTCAGTTTCGCCCATCTCAG
TCTTATCTTCCTCAGCATCCTCCCCACGCCACCACCCGCCCCGGTCGCTTCCCCCCGAGTCTTGCCACCGGCATTCGTCGCCTCCGAAAGTAGATCCCCCGACATGTCGC
CCACCGCAACCCCGCGACGGGCTGGCCTTCTTCCCGGCCTCCATTTTCTGTTTCATAGCCTTCAGCCATCTGGTTTCTGTGCTAGTGAGAAAGAGAGAGATCAGAGAGAG
GCAACAATAACAGATAAGATAGAGATGCTGATGACTAAGAATTTGGATAGAACATGTGCAGTGCAGTTGCTGTTTGCATAA
mRNA sequenceShow/hide mRNA sequence
ATGGAAGCTCCTTCATGGGTTTCTTATGCAGCTGCATGGGTCGCCACTCTCGCACTCCTCCTCCTCTCCCGCCGCCTCCGTCGTAGAAAGCTCAATCTGCCGCCGGGACC
TAAACCTTGGCCCTTGATCGGAAATCTCAACTTGATCGGCTCTCTGCCTCACCAGTCAATTCATCAACTTTCCCAAAAATATGGTCCCATCATGCATCTCCGTTTCGGCT
CCTTCCCCGTCGTCGTCGGATCCTCCGTCGACATGGCTAAGATTTTCCTCAAAACCCAGGATCTTACTTTCGTGTCCCGCCCCAAAACTGCCGCCGGAAAGTACACCACC
TACAACTATTCCAACATTACCTGGTCTCAATATGGCCCTTACTGGCGTCAAGCTCGAAAGATGTGTTTGATGGAGCTTTTCAGTGCCAGACGACTCGATTCTTATGAGTA
CATACGAAAGGAAGAAATGAATGCTTTGCTTAAAGAAATATGCAAGTCTTCTGGTAAAGTGATCAAGCTCAAAGATTACTTGTCTACTGTAAGTTTGAACGTGATAAGTC
GGATGGTGTTGGGAAAGAAGTACACGGACGAGTCGGAAGACGCCATTGTTAGTCCGGACGAGTTCAAGAAAATGTTGGACGAGCTGTTCTTGCTGAGTGGTGTGCTCAAC
ATTGGGGACTCGATACCATGGATAGATTTCTTGGATCTGCAGGGGTACGTGAAGAGGATGAAGGCACTGAGCAAGAAGTTCGATAGATTCCTTGAGCACGTATTGGATGA
ACATAATGAAAGGAGAAAGGGAGTTAAAGATTATGTGGCCAAAGATATGGTGGATGTGTTGTTGCAGTTGGCCGATGATCCTGATCTTGAAGTCAAACTTGAAAGGCATG
GAGTCAAGGCATTTACTCAGAAGGAAGATTTGAACATGGAAGAAGTGTTTGGTCTCTCAACTCCCAAGAAATTTCCACTTGATGCTGTGGCCGAGCCTCGACTCCCTCCT
CATCTTTACTCAATCATATGCTCCTGCTTCCATTACATCCTTCATCAAGCTTCTGTAGCATTCTTCATCTTCTTCACCATCGCTGTGAATTTCAGTTTCGCCCATCTCAG
TCTTATCTTCCTCAGCATCCTCCCCACGCCACCACCCGCCCCGGTCGCTTCCCCCCGAGTCTTGCCACCGGCATTCGTCGCCTCCGAAAGTAGATCCCCCGACATGTCGC
CCACCGCAACCCCGCGACGGGCTGGCCTTCTTCCCGGCCTCCATTTTCTGTTTCATAGCCTTCAGCCATCTGGTTTCTGTGCTAGTGAGAAAGAGAGAGATCAGAGAGAG
GCAACAATAACAGATAAGATAGAGATGCTGATGACTAAGAATTTGGATAGAACATGTGCAGTGCAGTTGCTGTTTGCATAA
Protein sequenceShow/hide protein sequence
MEAPSWVSYAAAWVATLALLLLSRRLRRRKLNLPPGPKPWPLIGNLNLIGSLPHQSIHQLSQKYGPIMHLRFGSFPVVVGSSVDMAKIFLKTQDLTFVSRPKTAAGKYTT
YNYSNITWSQYGPYWRQARKMCLMELFSARRLDSYEYIRKEEMNALLKEICKSSGKVIKLKDYLSTVSLNVISRMVLGKKYTDESEDAIVSPDEFKKMLDELFLLSGVLN
IGDSIPWIDFLDLQGYVKRMKALSKKFDRFLEHVLDEHNERRKGVKDYVAKDMVDVLLQLADDPDLEVKLERHGVKAFTQKEDLNMEEVFGLSTPKKFPLDAVAEPRLPP
HLYSIICSCFHYILHQASVAFFIFFTIAVNFSFAHLSLIFLSILPTPPPAPVASPRVLPPAFVASESRSPDMSPTATPRRAGLLPGLHFLFHSLQPSGFCASEKERDQRE
ATITDKIEMLMTKNLDRTCAVQLLFA