; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr025546 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr025546
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
Descriptioncytochrome P450 71A1-like
Genome locationtig00007935:943569..955730
RNA-Seq ExpressionSgr025546
SyntenySgr025546
Gene Ontology termsGO:0015969 - guanosine tetraphosphate metabolic process (biological process)
GO:0016310 - phosphorylation (biological process)
GO:0009507 - chloroplast (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0016705 - oxidoreductase activity, acting on paired donors, with incorporation or reduction of molecular oxygen (molecular function)
GO:0020037 - heme binding (molecular function)
GO:0016301 - kinase activity (molecular function)
GO:0008728 - GTP diphosphokinase activity (molecular function)
GO:0005525 - GTP binding (molecular function)
GO:0005524 - ATP binding (molecular function)
GO:0005506 - iron ion binding (molecular function)
GO:0004497 - monooxygenase activity (molecular function)
GO:0003723 - RNA binding (molecular function)
InterPro domainsIPR043519 - Nucleotidyltransferase superfamily
IPR036396 - Cytochrome P450 superfamily
IPR017972 - Cytochrome P450, conserved site
IPR006674 - HD domain
IPR003607 - HD/PDEase domain
IPR002401 - Cytochrome P450, E-class, group I
IPR001128 - Cytochrome P450


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0053469.1 cytochrome P450 71A1-like [Cucumis melo var. makuwa]9.3e-20279.11Show/hide
Query:  METPWVCCAAAWITILLLLLLLSLRLRRRKLNLPPGPKPWPFIGNLNLIGSLPHQSIHQLSKKYGPIMHLYFGSFPVVVGSSVEMAKIFLKTHDLTFAYR
        ME  +     AWI   L +LLLS R+RRRKLNLPPGPKPWP IGNL+LIGSLPHQSIHQLSKKYGPIMHL FGSFPVVVGSSVEMAKIFLKT DL F  R
Subjt:  METPWVCCAAAWITILLLLLLLSLRLRRRKLNLPPGPKPWPFIGNLNLIGSLPHQSIHQLSKKYGPIMHLYFGSFPVVVGSSVEMAKIFLKTHDLTFAYR

Query:  PKTAAGKYTTYNYSNITWSQYGPYWRQARKMFLVELFNAKRLDSYEYIRKEEMIALLQEIFKSAGKSIKLKGYLSALSMNVISRMALGKKYLLDESKDSI
        PKTAAGKYTTYNYS+ITWSQYGPYWRQARKM L+ELF+A+RLDSYEYIRKEEM ALL+EI+KS G+ IK+K YLS +S+NVISRM LGKKY  DES++ I
Subjt:  PKTAAGKYTTYNYSNITWSQYGPYWRQARKMFLVELFNAKRLDSYEYIRKEEMIALLQEIFKSAGKSIKLKGYLSALSMNVISRMALGKKYLLDESKDSI

Query:  ISPDEFKNMVDELFLLSGVLNIGDLIPWIDFLDLQGYVKRMKAFSKKFDRFLEHILDEHNERRKGVKDYVPKDMVDVLLQLADDPNLEVKVERHGVKALT
        +SPDEFK M+DELFLLSGVLNIGD IPWIDFLDLQGYVKRMKA SKKFDRFLEH+LDEHNERRKGV+DYV KDMVDVLLQLADDP+LEVK+ERHGVKA T
Subjt:  ISPDEFKNMVDELFLLSGVLNIGDLIPWIDFLDLQGYVKRMKAFSKKFDRFLEHILDEHNERRKGVKDYVPKDMVDVLLQLADDPNLEVKVERHGVKALT

Query:  ---------LLNIDTRVGVSELLKKPEIFNKATEELDRVIGRERWVEEKDIVNLPYIDAIAKETMRLHPVVPMLVPRLCREDCQVAGHDIAKGTGVFVNV
                    +     +SELLKKPEIFNKA EELDRVIGRERWVEEKDI+NLPYIDAIAKETMRLHPV PMLVPR+ REDCQ+AG+DI KGT VFVNV
Subjt:  ---------LLNIDTRVGVSELLKKPEIFNKATEELDRVIGRERWVEEKDIVNLPYIDAIAKETMRLHPVVPMLVPRLCREDCQVAGHDIAKGTGVFVNV

Query:  WAIGRDPVVWENPNDFNPERFIGKCIDVKGQNFELLPFGSGRRMCPGLTI
        W IGRDP VWENP +F PERF+GK IDVKGQ+FELLPFGSGRRMCPG ++
Subjt:  WAIGRDPVVWENPNDFNPERFIGKCIDVKGQNFELLPFGSGRRMCPGLTI

KAA0058526.1 putative GTP diphosphokinase RSH2 [Cucumis melo var. makuwa]8.1e-20679.23Show/hide
Query:  PPSTICSSPHPCQMNSHGSYDLEFTSRSSSLASSTASSSQKPIVGGLSSLF-------SSSSSSISSGGDELGSFRHDKGEELKELSSSFRYSPRSGFLW
        PPSTICSSPHPCQ+NSH S DLEFTSRSSSLASSTA+SSQKP+VGGLSSLF       SSSS+SISSGGDELGSFRHDKG+ELKE SSSFRYSP   F+ 
Subjt:  PPSTICSSPHPCQMNSHGSYDLEFTSRSSSLASSTASSSQKPIVGGLSSLF-------SSSSSSISSGGDELGSFRHDKGEELKELSSSFRYSPRSGFLW

Query:  QLWGR--------------------VGGENPSIVDCKGKSGDASFHGRGSTNRLFNGFVRNALGSCVDSDSPRLEVPSDGLDVGSSALFVDELTFNMEDN
          + R                         P +   + +SGD SFHGRGSTNRLF+GF RNALGSCVD DSPRLEV SDGLDVGSSALF DELTFNMEDN
Subjt:  QLWGR--------------------VGGENPSIVDCKGKSGDASFHGRGSTNRLFNGFVRNALGSCVDSDSPRLEVPSDGLDVGSSALFVDELTFNMEDN

Query:  ITESNSESFAKDLLLSAQSKHKIFCDEFVVKAFFEAEKAHRGQMRASGDPYLEHCVETAVMLALVGANSTVVAAGLLHDTLDDSFVSHDYILGKFGAGVA
        ITE NSES+AKDLLLSAQSKH+IFCDEFVVKAFFEAEKAHRGQMRASGDPYLEHCVETAVMLALVGANSTVVAAGLLHDT+DDSFV+HDYILG FGA VA
Subjt:  ITESNSESFAKDLLLSAQSKHKIFCDEFVVKAFFEAEKAHRGQMRASGDPYLEHCVETAVMLALVGANSTVVAAGLLHDTLDDSFVSHDYILGKFGAGVA

Query:  DLVEGVSKLSHLSKLAREHDMADRMVEADRLHTMFLAMADARAVLIKLADRLHNMMTLDALPMIKRLRFAKETMEIFVPLVNRLGIYSWKEQLENLCFMH
        DLVEGVSKLSHLSKLAREHD A+RMVEADRLHTMFLAMADARAVL+KLADRLHNMMTLDALP IKR RFAKETMEIFVPL NRLGIY+WKEQLEN+CF H
Subjt:  DLVEGVSKLSHLSKLAREHDMADRMVEADRLHTMFLAMADARAVLIKLADRLHNMMTLDALPMIKRLRFAKETMEIFVPLVNRLGIYSWKEQLENLCFMH

Query:  LNLEQHKDLSSKLMGLYDEAIIYSAIKKLERALEDKGISYHVVTGRHKSVYSIHRKMLKYAFCLPPSISLSFSLYLCRLLFTESCVQKSDF
        LNLEQH+DLSSKL+GLYDEAII SA +KLERAL+DKG SYH VTGRHKSVYSIHRKMLKYAF      ++SFSL+LC+LLF+ +CVQ  DF
Subjt:  LNLEQHKDLSSKLMGLYDEAIIYSAIKKLERALEDKGISYHVVTGRHKSVYSIHRKMLKYAFCLPPSISLSFSLYLCRLLFTESCVQKSDF

XP_022152553.1 cytochrome P450 71A1-like [Momordica charantia]1.2e-20479.38Show/hide
Query:  METP-WVCCAAAWITILLLLLLLSLRLRRRKLNLPPGPKPWPFIGNLNLIGSLPHQSIHQLSKKYGPIMHLYFGSFPVVVGSSVEMAKIFLKTHDLTFAY
        ME P WV  AAAW+   + LLLLS RLRRRKLNLPPGPKPWPFIGNLNLIGSLPHQSIHQLS+KYG IMHL FGSFPVVVGSSVEMAKIFLKTHDLTF  
Subjt:  METP-WVCCAAAWITILLLLLLLSLRLRRRKLNLPPGPKPWPFIGNLNLIGSLPHQSIHQLSKKYGPIMHLYFGSFPVVVGSSVEMAKIFLKTHDLTFAY

Query:  RPKTAAGKYTTYNYSNITWSQYGPYWRQARKMFLVELFNAKRLDSYEYIRKEEMIALLQEIFKSAGKSIKLKGYLSALSMNVISRMALGKKYLLDESKDS
        RPKTAAGKYTTYNYS+ITWSQYGPYWRQARKM L+ELF+AKRLDSYEYIR+EEM ALL++I++S G++I++K YLS +S+NVISRM LGKKY  DES+++
Subjt:  RPKTAAGKYTTYNYSNITWSQYGPYWRQARKMFLVELFNAKRLDSYEYIRKEEMIALLQEIFKSAGKSIKLKGYLSALSMNVISRMALGKKYLLDESKDS

Query:  IISPDEFKNMVDELFLLSGVLNIGDLIPWIDFLDLQGYVKRMKAFSKKFDRFLEHILDEHNERRKGVKDYVPKDMVDVLLQLADDPNLEVKVERHGVKAL
        I+SPDEFK M+DELFLLSGVLNIGD IPWIDFLDLQGY+KRMKA SKKFDRFLEH+LDEHNERRKG+KDYV KDMVDVLLQLADDP+LEVK+ERHGVKA 
Subjt:  IISPDEFKNMVDELFLLSGVLNIGDLIPWIDFLDLQGYVKRMKAFSKKFDRFLEHILDEHNERRKGVKDYVPKDMVDVLLQLADDPNLEVKVERHGVKAL

Query:  T---------LLNIDTRVGVSELLKKPEIFNKATEELDRVIGRERWVEEKDIVNLPYIDAIAKETMRLHPVVPMLVPRLCREDCQVAGHDIAKGTGVFVN
        T            +     +SELLKKPEIF KATEELDRVIG+ERWVEEKDI NLPYIDAIAKETMRLHPV PMLVPR+ REDC++AG+DIAK T + VN
Subjt:  T---------LLNIDTRVGVSELLKKPEIFNKATEELDRVIGRERWVEEKDIVNLPYIDAIAKETMRLHPVVPMLVPRLCREDCQVAGHDIAKGTGVFVN

Query:  VWAIGRDPVVWENPNDFNPERFIGKCIDVKGQNFELLPFGSGRRMCPGLTI
        VW IGRDP VWENPN+FNPERFIGK IDVKGQ+FELLPFGSGRRMCPG ++
Subjt:  VWAIGRDPVVWENPNDFNPERFIGKCIDVKGQNFELLPFGSGRRMCPGLTI

XP_022152608.1 probable GTP diphosphokinase RSH2, chloroplastic [Momordica charantia]1.7e-20384.31Show/hide
Query:  PPSTICSSPHPCQMNSHGSYDLEFTSRSSSLASSTASSSQKPIVGGLSSLFS-------SSSSSISSGGDELGSFRHDKGEELKELSSSFRYSPRSGFLW
        PPSTICSSPHPC MNSH SYDLEFTSRSSSL SSTASSSQKP++GGLSSLFS       SSS+SISSGGDELGSFRHDKGEELKELSSSFRYSP S F+ 
Subjt:  PPSTICSSPHPCQMNSHGSYDLEFTSRSSSLASSTASSSQKPIVGGLSSLFS-------SSSSSISSGGDELGSFRHDKGEELKELSSSFRYSPRSGFLW

Query:  QLWGR------------------VGG--ENPSIVDCKGKSGDASFHGRGSTNRLFNGFVRNALGSCVDSDSPRLEVPSDGLDVGSSALFVDELTFNMEDN
          + R                  VG     P +   + +SGDASFHGRGS+NRLFNGFVRNALGSCVD DSPRLEV SD LDVGSSAL VDELTFNMEDN
Subjt:  QLWGR------------------VGG--ENPSIVDCKGKSGDASFHGRGSTNRLFNGFVRNALGSCVDSDSPRLEVPSDGLDVGSSALFVDELTFNMEDN

Query:  ITESNSESFAKDLLLSAQSKHKIFCDEFVVKAFFEAEKAHRGQMRASGDPYLEHCVETAVMLALVGANSTVVAAGLLHDTLDDSFVSHDYILGKFGAGVA
        I E NSES+AKDLLLSAQSKHKIFCDEFV+KAFFEAEKAHRGQMRASGDPY EHCVETAVMLALVGANSTVVAAGLLHDTLDDSFVSHDYILG FGAGVA
Subjt:  ITESNSESFAKDLLLSAQSKHKIFCDEFVVKAFFEAEKAHRGQMRASGDPYLEHCVETAVMLALVGANSTVVAAGLLHDTLDDSFVSHDYILGKFGAGVA

Query:  DLVEGVSKLSHLSKLAREHDMADRMVEADRLHTMFLAMADARAVLIKLADRLHNMMTLDALPMIKRLRFAKETMEIFVPLVNRLGIYSWKEQLENLCFMH
        DLVEGVSKLSHLSKLAREHD ADRMVEADRLHTMFLAMADARAVLIKLAD LHNMMTLDALP+IKR RFAKETMEIFVPL NRLGIYSWKEQLENLCF H
Subjt:  DLVEGVSKLSHLSKLAREHDMADRMVEADRLHTMFLAMADARAVLIKLADRLHNMMTLDALPMIKRLRFAKETMEIFVPLVNRLGIYSWKEQLENLCFMH

Query:  LNLEQHKDLSSKLMGLYDEAIIYSAIKKLERALEDKGISYHVVTGRHKSVYSIHRKMLK
        LNLEQHKDLSSKLMGLYDEAIIYSAI+KLERALEDKGISYHVVTGRHKSVYS+H KMLK
Subjt:  LNLEQHKDLSSKLMGLYDEAIIYSAIKKLERALEDKGISYHVVTGRHKSVYSIHRKMLK

XP_038876425.1 trimethyltridecatetraene synthase-like [Benincasa hispida]5.3e-20579.3Show/hide
Query:  PPLPPITMETP-WV-CCAAAWITILLLLLLLSLRLRRRKLNLPPGPKPWPFIGNLNLIGSLPHQSIHQLSKKYGPIMHLYFGSFPVVVGSSVEMAKIFLK
        PPL  + ++ P W+   AAAWI   L LLLLS RLRRRKLNLPPGPKPWP IGNLNLIGSLPHQSIH+LSKKYGPIMHL FGSFPVVVGSSVEMAKIFLK
Subjt:  PPLPPITMETP-WV-CCAAAWITILLLLLLLSLRLRRRKLNLPPGPKPWPFIGNLNLIGSLPHQSIHQLSKKYGPIMHLYFGSFPVVVGSSVEMAKIFLK

Query:  THDLTFAYRPKTAAGKYTTYNYSNITWSQYGPYWRQARKMFLVELFNAKRLDSYEYIRKEEMIALLQEIFKSAGKSIKLKGYLSALSMNVISRMALGKKY
        THDLTF  RPKTAAGKYTTYNYSNITWSQYGPYWRQARKM L+ELF+A+RLDSYEYIRKEEM ALL++I+KS G+ IKLK YLS +S+NVISRM LGKKY
Subjt:  THDLTFAYRPKTAAGKYTTYNYSNITWSQYGPYWRQARKMFLVELFNAKRLDSYEYIRKEEMIALLQEIFKSAGKSIKLKGYLSALSMNVISRMALGKKY

Query:  LLDESKDSIISPDEFKNMVDELFLLSGVLNIGDLIPWIDFLDLQGYVKRMKAFSKKFDRFLEHILDEHNERRKGVKDYVPKDMVDVLLQLADDPNLEVKV
          DES+++I+SPDEFK M+DELFLLSGVLNIGD IPWIDFLDLQGYVKRMKA SKKFDRFLEH+LDEHNERRKGV++YV KDMVDVLLQLADDP+LEVK+
Subjt:  LLDESKDSIISPDEFKNMVDELFLLSGVLNIGDLIPWIDFLDLQGYVKRMKAFSKKFDRFLEHILDEHNERRKGVKDYVPKDMVDVLLQLADDPNLEVKV

Query:  ERHGVKALT---------LLNIDTRVGVSELLKKPEIFNKATEELDRVIGRERWVEEKDIVNLPYIDAIAKETMRLHPVVPMLVPRLCREDCQVAGHDIA
        ERHGVKA T            +     +SELLKKPEI NKA EELDRVIG+ERWVEEKD+VNLPYIDAIAKETMRLHPV PMLVPR+ REDCQ+AG+DIA
Subjt:  ERHGVKALT---------LLNIDTRVGVSELLKKPEIFNKATEELDRVIGRERWVEEKDIVNLPYIDAIAKETMRLHPVVPMLVPRLCREDCQVAGHDIA

Query:  KGTGVFVNVWAIGRDPVVWENPNDFNPERFIGKCIDVKGQNFELLPFGSGRRMCPGLTI
        KGT V VNVW IGRDP VWENP +FNPERF+GK IDVKGQ+FELLPFGSGRRMCPG ++
Subjt:  KGTGVFVNVWAIGRDPVVWENPNDFNPERFIGKCIDVKGQNFELLPFGSGRRMCPGLTI

TrEMBL top hitse value%identityAlignment
A0A1S3CCA6 cytochrome P450 71A1-like1.0e-20178.89Show/hide
Query:  METPWVCCAAAWITILLLLLLLSLRLRRRKLNLPPGPKPWPFIGNLNLIGSLPHQSIHQLSKKYGPIMHLYFGSFPVVVGSSVEMAKIFLKTHDLTFAYR
        ME  +     AWI   L +LLLS R+RRRKLNLPPGPKPWP IGNL+LIGSLPHQSIHQLSKKYGPIMHL FGSFPVVVGSSVEMAKIFLKT DL F  R
Subjt:  METPWVCCAAAWITILLLLLLLSLRLRRRKLNLPPGPKPWPFIGNLNLIGSLPHQSIHQLSKKYGPIMHLYFGSFPVVVGSSVEMAKIFLKTHDLTFAYR

Query:  PKTAAGKYTTYNYSNITWSQYGPYWRQARKMFLVELFNAKRLDSYEYIRKEEMIALLQEIFKSAGKSIKLKGYLSALSMNVISRMALGKKYLLDESKDSI
        PKTAAGKYTTYNYS+ITWSQYGPYWRQARKM L+ELF+A+RLDSYEYIRKEEM ALL+EI+KS G+ IK+K YLS +S+NVISRM LGKKY  DES++ I
Subjt:  PKTAAGKYTTYNYSNITWSQYGPYWRQARKMFLVELFNAKRLDSYEYIRKEEMIALLQEIFKSAGKSIKLKGYLSALSMNVISRMALGKKYLLDESKDSI

Query:  ISPDEFKNMVDELFLLSGVLNIGDLIPWIDFLDLQGYVKRMKAFSKKFDRFLEHILDEHNERRKGVKDYVPKDMVDVLLQLADDPNLEVKVERHGVKALT
        +SPDEFK M+DELFLLSGVLNIGD IPWIDFLDLQGYVKRMKA SKKFDRFLEH+LDEHNERRKGV+DYV KDMVDVLLQLADDP+LEVK+ERHGVKA T
Subjt:  ISPDEFKNMVDELFLLSGVLNIGDLIPWIDFLDLQGYVKRMKAFSKKFDRFLEHILDEHNERRKGVKDYVPKDMVDVLLQLADDPNLEVKVERHGVKALT

Query:  ---------LLNIDTRVGVSELLKKPEIFNKATEELDRVIGRERWVEEKDIVNLPYIDAIAKETMRLHPVVPMLVPRLCREDCQVAGHDIAKGTGVFVNV
                    +     +SELLKKPEIFNKA EELDRVIG+ERWVEEKDI+NLPYIDAIAKETMRLHPV PMLVPR+ REDCQ+AG+DI KGT VFVNV
Subjt:  ---------LLNIDTRVGVSELLKKPEIFNKATEELDRVIGRERWVEEKDIVNLPYIDAIAKETMRLHPVVPMLVPRLCREDCQVAGHDIAKGTGVFVNV

Query:  WAIGRDPVVWENPNDFNPERFIGKCIDVKGQNFELLPFGSGRRMCPGLTI
        W IGRDP VWENP +F PERF+GK IDVKGQ+FELLPFGSGRRMCPG ++
Subjt:  WAIGRDPVVWENPNDFNPERFIGKCIDVKGQNFELLPFGSGRRMCPGLTI

A0A5A7UWH8 GTP diphosphokinase3.9e-20679.23Show/hide
Query:  PPSTICSSPHPCQMNSHGSYDLEFTSRSSSLASSTASSSQKPIVGGLSSLF-------SSSSSSISSGGDELGSFRHDKGEELKELSSSFRYSPRSGFLW
        PPSTICSSPHPCQ+NSH S DLEFTSRSSSLASSTA+SSQKP+VGGLSSLF       SSSS+SISSGGDELGSFRHDKG+ELKE SSSFRYSP   F+ 
Subjt:  PPSTICSSPHPCQMNSHGSYDLEFTSRSSSLASSTASSSQKPIVGGLSSLF-------SSSSSSISSGGDELGSFRHDKGEELKELSSSFRYSPRSGFLW

Query:  QLWGR--------------------VGGENPSIVDCKGKSGDASFHGRGSTNRLFNGFVRNALGSCVDSDSPRLEVPSDGLDVGSSALFVDELTFNMEDN
          + R                         P +   + +SGD SFHGRGSTNRLF+GF RNALGSCVD DSPRLEV SDGLDVGSSALF DELTFNMEDN
Subjt:  QLWGR--------------------VGGENPSIVDCKGKSGDASFHGRGSTNRLFNGFVRNALGSCVDSDSPRLEVPSDGLDVGSSALFVDELTFNMEDN

Query:  ITESNSESFAKDLLLSAQSKHKIFCDEFVVKAFFEAEKAHRGQMRASGDPYLEHCVETAVMLALVGANSTVVAAGLLHDTLDDSFVSHDYILGKFGAGVA
        ITE NSES+AKDLLLSAQSKH+IFCDEFVVKAFFEAEKAHRGQMRASGDPYLEHCVETAVMLALVGANSTVVAAGLLHDT+DDSFV+HDYILG FGA VA
Subjt:  ITESNSESFAKDLLLSAQSKHKIFCDEFVVKAFFEAEKAHRGQMRASGDPYLEHCVETAVMLALVGANSTVVAAGLLHDTLDDSFVSHDYILGKFGAGVA

Query:  DLVEGVSKLSHLSKLAREHDMADRMVEADRLHTMFLAMADARAVLIKLADRLHNMMTLDALPMIKRLRFAKETMEIFVPLVNRLGIYSWKEQLENLCFMH
        DLVEGVSKLSHLSKLAREHD A+RMVEADRLHTMFLAMADARAVL+KLADRLHNMMTLDALP IKR RFAKETMEIFVPL NRLGIY+WKEQLEN+CF H
Subjt:  DLVEGVSKLSHLSKLAREHDMADRMVEADRLHTMFLAMADARAVLIKLADRLHNMMTLDALPMIKRLRFAKETMEIFVPLVNRLGIYSWKEQLENLCFMH

Query:  LNLEQHKDLSSKLMGLYDEAIIYSAIKKLERALEDKGISYHVVTGRHKSVYSIHRKMLKYAFCLPPSISLSFSLYLCRLLFTESCVQKSDF
        LNLEQH+DLSSKL+GLYDEAII SA +KLERAL+DKG SYH VTGRHKSVYSIHRKMLKYAF      ++SFSL+LC+LLF+ +CVQ  DF
Subjt:  LNLEQHKDLSSKLMGLYDEAIIYSAIKKLERALEDKGISYHVVTGRHKSVYSIHRKMLKYAFCLPPSISLSFSLYLCRLLFTESCVQKSDF

A0A5D3D8H9 Cytochrome P450 71A1-like4.5e-20279.11Show/hide
Query:  METPWVCCAAAWITILLLLLLLSLRLRRRKLNLPPGPKPWPFIGNLNLIGSLPHQSIHQLSKKYGPIMHLYFGSFPVVVGSSVEMAKIFLKTHDLTFAYR
        ME  +     AWI   L +LLLS R+RRRKLNLPPGPKPWP IGNL+LIGSLPHQSIHQLSKKYGPIMHL FGSFPVVVGSSVEMAKIFLKT DL F  R
Subjt:  METPWVCCAAAWITILLLLLLLSLRLRRRKLNLPPGPKPWPFIGNLNLIGSLPHQSIHQLSKKYGPIMHLYFGSFPVVVGSSVEMAKIFLKTHDLTFAYR

Query:  PKTAAGKYTTYNYSNITWSQYGPYWRQARKMFLVELFNAKRLDSYEYIRKEEMIALLQEIFKSAGKSIKLKGYLSALSMNVISRMALGKKYLLDESKDSI
        PKTAAGKYTTYNYS+ITWSQYGPYWRQARKM L+ELF+A+RLDSYEYIRKEEM ALL+EI+KS G+ IK+K YLS +S+NVISRM LGKKY  DES++ I
Subjt:  PKTAAGKYTTYNYSNITWSQYGPYWRQARKMFLVELFNAKRLDSYEYIRKEEMIALLQEIFKSAGKSIKLKGYLSALSMNVISRMALGKKYLLDESKDSI

Query:  ISPDEFKNMVDELFLLSGVLNIGDLIPWIDFLDLQGYVKRMKAFSKKFDRFLEHILDEHNERRKGVKDYVPKDMVDVLLQLADDPNLEVKVERHGVKALT
        +SPDEFK M+DELFLLSGVLNIGD IPWIDFLDLQGYVKRMKA SKKFDRFLEH+LDEHNERRKGV+DYV KDMVDVLLQLADDP+LEVK+ERHGVKA T
Subjt:  ISPDEFKNMVDELFLLSGVLNIGDLIPWIDFLDLQGYVKRMKAFSKKFDRFLEHILDEHNERRKGVKDYVPKDMVDVLLQLADDPNLEVKVERHGVKALT

Query:  ---------LLNIDTRVGVSELLKKPEIFNKATEELDRVIGRERWVEEKDIVNLPYIDAIAKETMRLHPVVPMLVPRLCREDCQVAGHDIAKGTGVFVNV
                    +     +SELLKKPEIFNKA EELDRVIGRERWVEEKDI+NLPYIDAIAKETMRLHPV PMLVPR+ REDCQ+AG+DI KGT VFVNV
Subjt:  ---------LLNIDTRVGVSELLKKPEIFNKATEELDRVIGRERWVEEKDIVNLPYIDAIAKETMRLHPVVPMLVPRLCREDCQVAGHDIAKGTGVFVNV

Query:  WAIGRDPVVWENPNDFNPERFIGKCIDVKGQNFELLPFGSGRRMCPGLTI
        W IGRDP VWENP +F PERF+GK IDVKGQ+FELLPFGSGRRMCPG ++
Subjt:  WAIGRDPVVWENPNDFNPERFIGKCIDVKGQNFELLPFGSGRRMCPGLTI

A0A6J1DF57 cytochrome P450 71A1-like5.7e-20579.38Show/hide
Query:  METP-WVCCAAAWITILLLLLLLSLRLRRRKLNLPPGPKPWPFIGNLNLIGSLPHQSIHQLSKKYGPIMHLYFGSFPVVVGSSVEMAKIFLKTHDLTFAY
        ME P WV  AAAW+   + LLLLS RLRRRKLNLPPGPKPWPFIGNLNLIGSLPHQSIHQLS+KYG IMHL FGSFPVVVGSSVEMAKIFLKTHDLTF  
Subjt:  METP-WVCCAAAWITILLLLLLLSLRLRRRKLNLPPGPKPWPFIGNLNLIGSLPHQSIHQLSKKYGPIMHLYFGSFPVVVGSSVEMAKIFLKTHDLTFAY

Query:  RPKTAAGKYTTYNYSNITWSQYGPYWRQARKMFLVELFNAKRLDSYEYIRKEEMIALLQEIFKSAGKSIKLKGYLSALSMNVISRMALGKKYLLDESKDS
        RPKTAAGKYTTYNYS+ITWSQYGPYWRQARKM L+ELF+AKRLDSYEYIR+EEM ALL++I++S G++I++K YLS +S+NVISRM LGKKY  DES+++
Subjt:  RPKTAAGKYTTYNYSNITWSQYGPYWRQARKMFLVELFNAKRLDSYEYIRKEEMIALLQEIFKSAGKSIKLKGYLSALSMNVISRMALGKKYLLDESKDS

Query:  IISPDEFKNMVDELFLLSGVLNIGDLIPWIDFLDLQGYVKRMKAFSKKFDRFLEHILDEHNERRKGVKDYVPKDMVDVLLQLADDPNLEVKVERHGVKAL
        I+SPDEFK M+DELFLLSGVLNIGD IPWIDFLDLQGY+KRMKA SKKFDRFLEH+LDEHNERRKG+KDYV KDMVDVLLQLADDP+LEVK+ERHGVKA 
Subjt:  IISPDEFKNMVDELFLLSGVLNIGDLIPWIDFLDLQGYVKRMKAFSKKFDRFLEHILDEHNERRKGVKDYVPKDMVDVLLQLADDPNLEVKVERHGVKAL

Query:  T---------LLNIDTRVGVSELLKKPEIFNKATEELDRVIGRERWVEEKDIVNLPYIDAIAKETMRLHPVVPMLVPRLCREDCQVAGHDIAKGTGVFVN
        T            +     +SELLKKPEIF KATEELDRVIG+ERWVEEKDI NLPYIDAIAKETMRLHPV PMLVPR+ REDC++AG+DIAK T + VN
Subjt:  T---------LLNIDTRVGVSELLKKPEIFNKATEELDRVIGRERWVEEKDIVNLPYIDAIAKETMRLHPVVPMLVPRLCREDCQVAGHDIAKGTGVFVN

Query:  VWAIGRDPVVWENPNDFNPERFIGKCIDVKGQNFELLPFGSGRRMCPGLTI
        VW IGRDP VWENPN+FNPERFIGK IDVKGQ+FELLPFGSGRRMCPG ++
Subjt:  VWAIGRDPVVWENPNDFNPERFIGKCIDVKGQNFELLPFGSGRRMCPGLTI

A0A6J1DFB1 GTP diphosphokinase8.2e-20484.31Show/hide
Query:  PPSTICSSPHPCQMNSHGSYDLEFTSRSSSLASSTASSSQKPIVGGLSSLFS-------SSSSSISSGGDELGSFRHDKGEELKELSSSFRYSPRSGFLW
        PPSTICSSPHPC MNSH SYDLEFTSRSSSL SSTASSSQKP++GGLSSLFS       SSS+SISSGGDELGSFRHDKGEELKELSSSFRYSP S F+ 
Subjt:  PPSTICSSPHPCQMNSHGSYDLEFTSRSSSLASSTASSSQKPIVGGLSSLFS-------SSSSSISSGGDELGSFRHDKGEELKELSSSFRYSPRSGFLW

Query:  QLWGR------------------VGG--ENPSIVDCKGKSGDASFHGRGSTNRLFNGFVRNALGSCVDSDSPRLEVPSDGLDVGSSALFVDELTFNMEDN
          + R                  VG     P +   + +SGDASFHGRGS+NRLFNGFVRNALGSCVD DSPRLEV SD LDVGSSAL VDELTFNMEDN
Subjt:  QLWGR------------------VGG--ENPSIVDCKGKSGDASFHGRGSTNRLFNGFVRNALGSCVDSDSPRLEVPSDGLDVGSSALFVDELTFNMEDN

Query:  ITESNSESFAKDLLLSAQSKHKIFCDEFVVKAFFEAEKAHRGQMRASGDPYLEHCVETAVMLALVGANSTVVAAGLLHDTLDDSFVSHDYILGKFGAGVA
        I E NSES+AKDLLLSAQSKHKIFCDEFV+KAFFEAEKAHRGQMRASGDPY EHCVETAVMLALVGANSTVVAAGLLHDTLDDSFVSHDYILG FGAGVA
Subjt:  ITESNSESFAKDLLLSAQSKHKIFCDEFVVKAFFEAEKAHRGQMRASGDPYLEHCVETAVMLALVGANSTVVAAGLLHDTLDDSFVSHDYILGKFGAGVA

Query:  DLVEGVSKLSHLSKLAREHDMADRMVEADRLHTMFLAMADARAVLIKLADRLHNMMTLDALPMIKRLRFAKETMEIFVPLVNRLGIYSWKEQLENLCFMH
        DLVEGVSKLSHLSKLAREHD ADRMVEADRLHTMFLAMADARAVLIKLAD LHNMMTLDALP+IKR RFAKETMEIFVPL NRLGIYSWKEQLENLCF H
Subjt:  DLVEGVSKLSHLSKLAREHDMADRMVEADRLHTMFLAMADARAVLIKLADRLHNMMTLDALPMIKRLRFAKETMEIFVPLVNRLGIYSWKEQLENLCFMH

Query:  LNLEQHKDLSSKLMGLYDEAIIYSAIKKLERALEDKGISYHVVTGRHKSVYSIHRKMLK
        LNLEQHKDLSSKLMGLYDEAIIYSAI+KLERALEDKGISYHVVTGRHKSVYS+H KMLK
Subjt:  LNLEQHKDLSSKLMGLYDEAIIYSAIKKLERALEDKGISYHVVTGRHKSVYSIHRKMLK

SwissProt top hitse value%identityAlignment
Q7XAP4 Probable GTP diphosphokinase RSH2, chloroplastic4.0e-11553.74Show/hide
Query:  PPSTICSSPHPCQMNSHGSYDLEFTSRSSSLASSTA------SSSQKPIVGGLSSLFSSSSSS--ISSGGDELGSFRHDKGEELKEL-----SSSFRY-S
        P   + +SP      S  S +LE +SR S+  ++ A      S   + I GGLS LFSS +++   ++  DELG+  HD+  E   +        + Y  
Subjt:  PPSTICSSPHPCQMNSHGSYDLEFTSRSSSLASSTA------SSSQKPIVGGLSSLFSSSSSS--ISSGGDELGSFRHDKGEELKEL-----SSSFRY-S

Query:  PRSGFLWQ---LWGRVGGENPSIVDCKGKSGDASFHGRGSTNRLFNGFVRNALGSCVDSDSPRLEVPSDGLDVGSSA---LFVDELTFNMEDNITES--N
        P S F W+       V   +        +S  AS+       RLF+ FVRNALGSCVD        P   L +G SA   +   EL F ++++++E+  +
Subjt:  PRSGFLWQ---LWGRVGGENPSIVDCKGKSGDASFHGRGSTNRLFNGFVRNALGSCVDSDSPRLEVPSDGLDVGSSA---LFVDELTFNMEDNITES--N

Query:  SESFAKDLLLSAQSKHKIFCDEFVVKAFFEAEKAHRGQMRASGDPYLEHCVETAVMLALVGANSTVVAAGLLHDTLDDSFVSHDYILGKFGAGVADLVEG
         E +A+DLL  AQ++H+IF DE VVKAFFEAE+AHRGQ RASGDPYL+HCVETAV+LA +GAN+TVV+AGLLHDT+DDSF+ +D I   FGAGVADLVEG
Subjt:  SESFAKDLLLSAQSKHKIFCDEFVVKAFFEAEKAHRGQMRASGDPYLEHCVETAVMLALVGANSTVVAAGLLHDTLDDSFVSHDYILGKFGAGVADLVEG

Query:  VSKLSHLSKLAREHDMADRMVEADRLHTMFLAMADARAVLIKLADRLHNMMTLDALPMIKRLRFAKETMEIFVPLVNRLGIYSWKEQLENLCFMHLNLEQ
        VSKLSHLSKLAR+++ A R VEADRLHTMFLAMADARAVLIKLADRLHNM T++ALP++K+ RFAKETMEIFVPL NRLGI SWK+QLEN+CF HLN E+
Subjt:  VSKLSHLSKLAREHDMADRMVEADRLHTMFLAMADARAVLIKLADRLHNMMTLDALPMIKRLRFAKETMEIFVPLVNRLGIYSWKEQLENLCFMHLNLEQ

Query:  HKDLSSKLMGLYDEAIIYSAIKKLERALEDKGISYHVVTGRHKSVYSIHRKMLK
        HK+LSSKL+  +DEA++ S + KL++ L D+GISYH ++GRHKS+YSI+ KM+K
Subjt:  HKDLSSKLMGLYDEAIIYSAIKKLERALEDKGISYHVVTGRHKSVYSIHRKMLK

Q9LVJ3 Probable GTP diphosphokinase RSH2, chloroplastic6.6e-12660.13Show/hide
Query:  PPSTICSSPHPCQMNSHGSYDLEFTSRSSSLASSTASSSQKPIVGGLSSLF--------SSSSSSISSGGDELGSFRHDKGEELKEL--SSSFRYSP---
        PPS++CS+PH        S DL+ TSRSSS +SS ASS QKPIVGGLSSLF        SSSS S S+G DE  S R+D+ ++LK+L  SSSF YSP   
Subjt:  PPSTICSSPHPCQMNSHGSYDLEFTSRSSSLASSTASSSQKPIVGGLSSLF--------SSSSSSISSGGDELGSFRHDKGEELKEL--SSSFRYSP---

Query:  -------RSGFLWQLWGRVGGE-NPSIVDCKGKSGDASFHGRGSTNRLFNGFVRNALGSCVDSDSPRLEVPSDGLDVGSSALFVDELTFNME-DNITESN
                   +  L G V    +P +   + ++ D SF  R   + LFNGFVR ALGSCVD             + GS ++ VDELTF ME D I    
Subjt:  -------RSGFLWQLWGRVGGE-NPSIVDCKGKSGDASFHGRGSTNRLFNGFVRNALGSCVDSDSPRLEVPSDGLDVGSSALFVDELTFNME-DNITESN

Query:  SESFAKDLLLSAQSKHKIFCDEFVVKAFFEAEKAHRGQMRASGDPYLEHCVETAVMLALVGANSTVVAAGLLHDTLDDSFVSHDYILGKFGAGVADLVEG
         + +A+DLL  AQ +HKIF DE V+KAF+EAEKAHRGQMRAS DPYL+HCVETA++LA +GANSTVV AGLLHDT+DDSF+S+DYIL  FGAGVADLVEG
Subjt:  SESFAKDLLLSAQSKHKIFCDEFVVKAFFEAEKAHRGQMRASGDPYLEHCVETAVMLALVGANSTVVAAGLLHDTLDDSFVSHDYILGKFGAGVADLVEG

Query:  VSKLSHLSKLAREHDMADRMVEADRLHTMFLAMADARAVLIKLADRLHNMMTLDALPMIKRLRFAKETMEIFVPLVNRLGIYSWKEQLENLCFMHLNLEQ
        VSKLS LSKLARE++ A + VEADRLHTMFLAMADARAVLIKLADRLHNM TL AL  +K+ RFAKET+EIF PL NRLGI +WK QLENLCF HL   Q
Subjt:  VSKLSHLSKLAREHDMADRMVEADRLHTMFLAMADARAVLIKLADRLHNMMTLDALPMIKRLRFAKETMEIFVPLVNRLGIYSWKEQLENLCFMHLNLEQ

Query:  HKDLSSKLMGLYDEAIIYSAIKKLERALEDKGISYHVVTGRHKSVYSIHRKMLK
        H ++S+ L   +DEA+I SAI+KLE+AL+  GISYHV+ GRHKS+YSI+ KMLK
Subjt:  HKDLSSKLMGLYDEAIIYSAIKKLERALEDKGISYHVVTGRHKSVYSIHRKMLK

Q9M5P5 Probable GTP diphosphokinase RSH3, chloroplastic3.1e-12358.28Show/hide
Query:  PPSTICSSPHPCQMNSHGSYDLEFTSRSSSLASSTASSSQKPIVGGLSSLF--------SSSSSSISSGGDELGSFRHDKGEELKELSSSFRYSPR----
        P ST+CS+ H  Q+N+H S DL+  SRSSS +SST+S    P +GGLS LF        SSSSSS  S G+EL S RHD+ E+ + LS SF YSP     
Subjt:  PPSTICSSPHPCQMNSHGSYDLEFTSRSSSLASSTASSSQKPIVGGLSSLF--------SSSSSSISSGGDELGSFRHDKGEELKELSSSFRYSPR----

Query:  SGFLWQ--------LWGRV-GGENPSIVDCKGKSGDASFHGRGSTNRLFNGFVRNALGSCVDSDSPRLEVPSDGLDVGSSALFVDELTFNMEDNITESNS
        S +L +        L G +  G +P +   + ++ D     R  ++RLFNGFVR A+GSCVD D+              S L  ++L F M+D       
Subjt:  SGFLWQ--------LWGRV-GGENPSIVDCKGKSGDASFHGRGSTNRLFNGFVRNALGSCVDSDSPRLEVPSDGLDVGSSALFVDELTFNMEDNITESNS

Query:  ESFAKDLLLSAQSKHKIFCDEFVVKAFFEAEKAHRGQMRASGDPYLEHCVETAVMLALVGANSTVVAAGLLHDTLDDSFVSHDYILGKFGAGVADLVEGV
        + +A+DLL  AQ KHKIF DE V+KAF+EAEKAHRGQMRA+GDPYL+HCVETA++LA +GANSTVV AG+LHDTLDDSF+S+DYIL  FG+GVADLVEGV
Subjt:  ESFAKDLLLSAQSKHKIFCDEFVVKAFFEAEKAHRGQMRASGDPYLEHCVETAVMLALVGANSTVVAAGLLHDTLDDSFVSHDYILGKFGAGVADLVEGV

Query:  SKLSHLSKLAREHDMADRMVEADRLHTMFLAMADARAVLIKLADRLHNMMTLDALPMIKRLRFAKETMEIFVPLVNRLGIYSWKEQLENLCFMHLNLEQH
        S+   LSKLARE++ A + VEADRLHTMFLAMADARAVLIKLADRLHNMMTL ALP +KR RFAKET+EIF PL NRLGI SWK +LENLCF HL+ +QH
Subjt:  SKLSHLSKLAREHDMADRMVEADRLHTMFLAMADARAVLIKLADRLHNMMTLDALPMIKRLRFAKETMEIFVPLVNRLGIYSWKEQLENLCFMHLNLEQH

Query:  KDLSSKLMGLYDEAIIYSAIKKLERALEDKGISYHVVTGRHKSVYSIHRKMLK
         ++S  L   +DEA+I SAI+KLE+AL+ +GISYHVV+GRHKS+YSI+ KMLK
Subjt:  KDLSSKLMGLYDEAIIYSAIKKLERALEDKGISYHVVTGRHKSVYSIHRKMLK

Q9M5P6 Probable GTP diphosphokinase RSH2, chloroplastic1.4e-12359.47Show/hide
Query:  PPSTICSSPHPCQMNSHGSYDLEFTSRSSSLASSTASSSQKPIVGGLSSLF--------SSSSSSISSGGDELGSFRHDKGEELKEL--SSSFRYSP---
        PPS++CS+PH        S DL+ TSRSSS +SS ASS QKPIVGGLSSLF        SSSS S S+  DE  S R+D+ ++LK+L  SSSF YSP   
Subjt:  PPSTICSSPHPCQMNSHGSYDLEFTSRSSSLASSTASSSQKPIVGGLSSLF--------SSSSSSISSGGDELGSFRHDKGEELKEL--SSSFRYSP---

Query:  -------RSGFLWQLWGRVGGE-NPSIVDCKGKSGDASFHGRGSTNRLFNGFVRNALGSCVDSDSPRLEVPSDGLDVGSSALFVDELTFNME-DNITESN
                   +  L G V    +P +   + ++ D SF  R   +RLFNGFVR ALGSCVD             ++GS +  VDELTF ME D I    
Subjt:  -------RSGFLWQLWGRVGGE-NPSIVDCKGKSGDASFHGRGSTNRLFNGFVRNALGSCVDSDSPRLEVPSDGLDVGSSALFVDELTFNME-DNITESN

Query:  SESFAKDLLLSAQSKHKIFCDEFVVKAFFEAEKAHRGQMRASGDPYLEHCVETAVMLALVGANSTVVAAGLLHDTLDDSFVSHDYILGKFGAGVADLVEG
         + +A+DLL  AQ +HKIF DE V+KAF+EAEKAHRGQMRAS DPYL+HCVETA++LA +GANSTVV AGLLHDT+DDSF+S+DYIL  FGAGVADLVEG
Subjt:  SESFAKDLLLSAQSKHKIFCDEFVVKAFFEAEKAHRGQMRASGDPYLEHCVETAVMLALVGANSTVVAAGLLHDTLDDSFVSHDYILGKFGAGVADLVEG

Query:  VSKLSHLSKLAREHDMADRMVEADRLHTMFLAMADARAVLIKLADRLHNMMTLDALPMIKRLRFAKETMEIFVPLVNRLGIYSWKEQLENLCFMHLNLEQ
        VSKLS LSKLARE++ A + VEADRLH MFLAMADARAVLIKLADRLHNM TL AL  +K+ RFAKET+EIF PL N LGI +WK QLENLCF HL   Q
Subjt:  VSKLSHLSKLAREHDMADRMVEADRLHTMFLAMADARAVLIKLADRLHNMMTLDALPMIKRLRFAKETMEIFVPLVNRLGIYSWKEQLENLCFMHLNLEQ

Query:  HKDLSSKLMGLYDEAIIYSAIKKLERALEDKGISYHVVTGRHKSVYSIHRKMLK
        H ++S+ L   +DEA+I SAI+KL++AL+  GISYHV+ GRHKS+YSI+ KMLK
Subjt:  HKDLSSKLMGLYDEAIIYSAIKKLERALEDKGISYHVVTGRHKSVYSIHRKMLK

Q9SYH1 Probable GTP diphosphokinase RSH3, chloroplastic3.0e-12658.94Show/hide
Query:  PPSTICSSPHPCQMNSHGSYDLEFTSRSSSLASSTASSSQKPIVGGLSSLF--------SSSSSSISSGGDELGSFRHDKGEELKELSSSFRYSPR----
        P ST+CS+ H  Q+N+H S DL+  SRSSS +SST+S    P +GGLS LF        SSSSSS  S G+EL S RHD+ E+ + LS SF YSP     
Subjt:  PPSTICSSPHPCQMNSHGSYDLEFTSRSSSLASSTASSSQKPIVGGLSSLF--------SSSSSSISSGGDELGSFRHDKGEELKELSSSFRYSPR----

Query:  SGFLWQ--------LWGRV-GGENPSIVDCKGKSGDASFHGRGSTNRLFNGFVRNALGSCVDSDSPRLEVPSDGLDVGSSALFVDELTFNMEDNITESNS
        S +L +        L G +  G +P +   + ++ D     R  ++RLFNGFVR A+GSCVD D+              S L  ++L F M+D       
Subjt:  SGFLWQ--------LWGRV-GGENPSIVDCKGKSGDASFHGRGSTNRLFNGFVRNALGSCVDSDSPRLEVPSDGLDVGSSALFVDELTFNMEDNITESNS

Query:  ESFAKDLLLSAQSKHKIFCDEFVVKAFFEAEKAHRGQMRASGDPYLEHCVETAVMLALVGANSTVVAAGLLHDTLDDSFVSHDYILGKFGAGVADLVEGV
        + +A+DLL  AQ KHKIF DE V+KAF+EAEKAHRGQMRA+GDPYL+HCVETA++LA +GANSTVV AG+LHDTLDDSF+S+DYIL  FG+GVADLVEGV
Subjt:  ESFAKDLLLSAQSKHKIFCDEFVVKAFFEAEKAHRGQMRASGDPYLEHCVETAVMLALVGANSTVVAAGLLHDTLDDSFVSHDYILGKFGAGVADLVEGV

Query:  SKLSHLSKLAREHDMADRMVEADRLHTMFLAMADARAVLIKLADRLHNMMTLDALPMIKRLRFAKETMEIFVPLVNRLGIYSWKEQLENLCFMHLNLEQH
        SKLS LSKLARE++ A + VEADRLHTMFLAMADARAVLIKLADRLHNMMTL ALP +KR RFAKET+EIF PL NRLGI SWK +LENLCF HL+ +QH
Subjt:  SKLSHLSKLAREHDMADRMVEADRLHTMFLAMADARAVLIKLADRLHNMMTLDALPMIKRLRFAKETMEIFVPLVNRLGIYSWKEQLENLCFMHLNLEQH

Query:  KDLSSKLMGLYDEAIIYSAIKKLERALEDKGISYHVVTGRHKSVYSIHRKMLK
         ++S  L   +DEA+I SAI+KLE+AL+ +GISYHVV+GRHKS+YSI+ KMLK
Subjt:  KDLSSKLMGLYDEAIIYSAIKKLERALEDKGISYHVVTGRHKSVYSIHRKMLK

Arabidopsis top hitse value%identityAlignment
AT1G13080.1 cytochrome P450, family 71, subfamily B, polypeptide 22.4e-7836.84Show/hide
Query:  ITILLLLLLLSL----------RLRRRKLNLPPGPKPWPFIGNLNLIGSLPHQSIHQLSKKYGPIMHLYFGSFPVVVGSSVEMAKIFLKTHDLTFAYRPK
        +TILL   L+SL          + +  K NLPP P   P IGNL+ +  LPH+  H+LS KYGP++ L  GS PVVV SS E A+  LKT+DL    RPK
Subjt:  ITILLLLLLLSL----------RLRRRKLNLPPGPKPWPFIGNLNLIGSLPHQSIHQLSKKYGPIMHLYFGSFPVVVGSSVEMAKIFLKTHDLTFAYRPK

Query:  TAAGKYTTYNYSNITWSQYGPYWRQARKMFLVELFNAKRLDSYEYIRKEEMIALLQEIFKSAGKS--IKLKGYLSALSMNVISRMALGKKYLLDESKDSI
        T      +Y + +IT++ YG YWR+ RK+ ++ELF++K++ S+ YIR+EE+  +++++ +SA K   + L     +L+ ++I R+ALG+ +        +
Subjt:  TAAGKYTTYNYSNITWSQYGPYWRQARKMFLVELFNAKRLDSYEYIRKEEMIALLQEIFKSAGKS--IKLKGYLSALSMNVISRMALGKKYLLDESKDSI

Query:  ISPDEFKNMVDELFLLSGVLNIGDLIP-----WIDFLDLQGYVKRMKAFSKKFDRFLEHILDEHNERRKGVKDYVPKDMVDVLLQLAD--DPNLEVKVER
        I  D  + +V E     G     D  P     ++D+L  Q + K  K F K+ D F +H++D+H  + +G K+   +D+V ++L + D  + +   K+  
Subjt:  ISPDEFKNMVDELFLLSGVLNIGDLIP-----WIDFLDLQGYVKRMKAFSKKFDRFLEHILDEHNERRKGVKDYVPKDMVDVLLQLAD--DPNLEVKVER

Query:  HGVKALT----LLNIDTRV-----GVSELLKKPEIFNKATEELDRVIG-RERWVEEKDIVNLPYIDAIAKETMRLHPVVPMLVPRLCREDCQVAGHDIAK
          +KA+     L  IDT        ++EL++ P +  KA E +   +G ++  + E+D+  + Y++ I KET RLHP +P +VPR      ++ G+DI  
Subjt:  HGVKALT----LLNIDTRV-----GVSELLKKPEIFNKATEELDRVIG-RERWVEEKDIVNLPYIDAIAKETMRLHPVVPMLVPRLCREDCQVAGHDIAK

Query:  GTGVFVNVWAIGRDPVVWENPNDFNPERFIGKCIDVKGQNFELLPFGSGRRMCPGL
         T + +NVW IGRDP  W +P +FNPERF    +D +GQ+F+LLPFGSGRR+CPG+
Subjt:  GTGVFVNVWAIGRDPVVWENPNDFNPERFIGKCIDVKGQNFELLPFGSGRRMCPGL

AT1G54130.1 RELA/SPOT homolog 32.1e-12758.94Show/hide
Query:  PPSTICSSPHPCQMNSHGSYDLEFTSRSSSLASSTASSSQKPIVGGLSSLF--------SSSSSSISSGGDELGSFRHDKGEELKELSSSFRYSPR----
        P ST+CS+ H  Q+N+H S DL+  SRSSS +SST+S    P +GGLS LF        SSSSSS  S G+EL S RHD+ E+ + LS SF YSP     
Subjt:  PPSTICSSPHPCQMNSHGSYDLEFTSRSSSLASSTASSSQKPIVGGLSSLF--------SSSSSSISSGGDELGSFRHDKGEELKELSSSFRYSPR----

Query:  SGFLWQ--------LWGRV-GGENPSIVDCKGKSGDASFHGRGSTNRLFNGFVRNALGSCVDSDSPRLEVPSDGLDVGSSALFVDELTFNMEDNITESNS
        S +L +        L G +  G +P +   + ++ D     R  ++RLFNGFVR A+GSCVD D+              S L  ++L F M+D       
Subjt:  SGFLWQ--------LWGRV-GGENPSIVDCKGKSGDASFHGRGSTNRLFNGFVRNALGSCVDSDSPRLEVPSDGLDVGSSALFVDELTFNMEDNITESNS

Query:  ESFAKDLLLSAQSKHKIFCDEFVVKAFFEAEKAHRGQMRASGDPYLEHCVETAVMLALVGANSTVVAAGLLHDTLDDSFVSHDYILGKFGAGVADLVEGV
        + +A+DLL  AQ KHKIF DE V+KAF+EAEKAHRGQMRA+GDPYL+HCVETA++LA +GANSTVV AG+LHDTLDDSF+S+DYIL  FG+GVADLVEGV
Subjt:  ESFAKDLLLSAQSKHKIFCDEFVVKAFFEAEKAHRGQMRASGDPYLEHCVETAVMLALVGANSTVVAAGLLHDTLDDSFVSHDYILGKFGAGVADLVEGV

Query:  SKLSHLSKLAREHDMADRMVEADRLHTMFLAMADARAVLIKLADRLHNMMTLDALPMIKRLRFAKETMEIFVPLVNRLGIYSWKEQLENLCFMHLNLEQH
        SKLS LSKLARE++ A + VEADRLHTMFLAMADARAVLIKLADRLHNMMTL ALP +KR RFAKET+EIF PL NRLGI SWK +LENLCF HL+ +QH
Subjt:  SKLSHLSKLAREHDMADRMVEADRLHTMFLAMADARAVLIKLADRLHNMMTLDALPMIKRLRFAKETMEIFVPLVNRLGIYSWKEQLENLCFMHLNLEQH

Query:  KDLSSKLMGLYDEAIIYSAIKKLERALEDKGISYHVVTGRHKSVYSIHRKMLK
         ++S  L   +DEA+I SAI+KLE+AL+ +GISYHVV+GRHKS+YSI+ KMLK
Subjt:  KDLSSKLMGLYDEAIIYSAIKKLERALEDKGISYHVVTGRHKSVYSIHRKMLK

AT3G14050.1 RELA/SPOT homolog 24.7e-12760.13Show/hide
Query:  PPSTICSSPHPCQMNSHGSYDLEFTSRSSSLASSTASSSQKPIVGGLSSLF--------SSSSSSISSGGDELGSFRHDKGEELKEL--SSSFRYSP---
        PPS++CS+PH        S DL+ TSRSSS +SS ASS QKPIVGGLSSLF        SSSS S S+G DE  S R+D+ ++LK+L  SSSF YSP   
Subjt:  PPSTICSSPHPCQMNSHGSYDLEFTSRSSSLASSTASSSQKPIVGGLSSLF--------SSSSSSISSGGDELGSFRHDKGEELKEL--SSSFRYSP---

Query:  -------RSGFLWQLWGRVGGE-NPSIVDCKGKSGDASFHGRGSTNRLFNGFVRNALGSCVDSDSPRLEVPSDGLDVGSSALFVDELTFNME-DNITESN
                   +  L G V    +P +   + ++ D SF  R   + LFNGFVR ALGSCVD             + GS ++ VDELTF ME D I    
Subjt:  -------RSGFLWQLWGRVGGE-NPSIVDCKGKSGDASFHGRGSTNRLFNGFVRNALGSCVDSDSPRLEVPSDGLDVGSSALFVDELTFNME-DNITESN

Query:  SESFAKDLLLSAQSKHKIFCDEFVVKAFFEAEKAHRGQMRASGDPYLEHCVETAVMLALVGANSTVVAAGLLHDTLDDSFVSHDYILGKFGAGVADLVEG
         + +A+DLL  AQ +HKIF DE V+KAF+EAEKAHRGQMRAS DPYL+HCVETA++LA +GANSTVV AGLLHDT+DDSF+S+DYIL  FGAGVADLVEG
Subjt:  SESFAKDLLLSAQSKHKIFCDEFVVKAFFEAEKAHRGQMRASGDPYLEHCVETAVMLALVGANSTVVAAGLLHDTLDDSFVSHDYILGKFGAGVADLVEG

Query:  VSKLSHLSKLAREHDMADRMVEADRLHTMFLAMADARAVLIKLADRLHNMMTLDALPMIKRLRFAKETMEIFVPLVNRLGIYSWKEQLENLCFMHLNLEQ
        VSKLS LSKLARE++ A + VEADRLHTMFLAMADARAVLIKLADRLHNM TL AL  +K+ RFAKET+EIF PL NRLGI +WK QLENLCF HL   Q
Subjt:  VSKLSHLSKLAREHDMADRMVEADRLHTMFLAMADARAVLIKLADRLHNMMTLDALPMIKRLRFAKETMEIFVPLVNRLGIYSWKEQLENLCFMHLNLEQ

Query:  HKDLSSKLMGLYDEAIIYSAIKKLERALEDKGISYHVVTGRHKSVYSIHRKMLK
        H ++S+ L   +DEA+I SAI+KLE+AL+  GISYHV+ GRHKS+YSI+ KMLK
Subjt:  HKDLSSKLMGLYDEAIIYSAIKKLERALEDKGISYHVVTGRHKSVYSIHRKMLK

AT5G06900.1 cytochrome P450, family 93, subfamily D, polypeptide 13.3e-8039.91Show/hide
Query:  ILLLLLLLSLRLRRRKLNLPPGPKPWPFIGNLNLIGSLPHQSIHQLSKKYGPIMHLYFGSFPVVVGSSVEMAKIFLKTHDLTFAYRPKTAAGKYTTYNYS
        I +L+  ++ RLR R L LPP P   P IG+++L+G + HQ++H+LS +YGP+M+L+ GS P ++ SS EMA   LK+++L F  RP      Y TY  +
Subjt:  ILLLLLLLSLRLRRRKLNLPPGPKPWPFIGNLNLIGSLPHQSIHQLSKKYGPIMHLYFGSFPVVVGSSVEMAKIFLKTHDLTFAYRPKTAAGKYTTYNYS

Query:  NITWSQYGPYWRQARKMFLVELFNAKRLDSYEYIRKEEMIALLQEIFK--SAGKSIKLKGYLSALSMNVISRMALGKKYLLDESKDSIISPDEFKNMVDE
        +   + YG +W+  +++ +VELF+++ LDS+  +R EE+  LL  + K   A +S+ L   L  L+ N+I+RM   K   +    D     +E   MV E
Subjt:  NITWSQYGPYWRQARKMFLVELFNAKRLDSYEYIRKEEMIALLQEIFK--SAGKSIKLKGYLSALSMNVISRMALGKKYLLDESKDSIISPDEFKNMVDE

Query:  LFLLSGVLNIGDLIPWIDFLDLQGYVKRMKAFSKKFDRFLEHILDEHNERRKGVKDYVPKDMVDVLLQLADDPNLEVKVERHGVKALTLLNI-----DTR
        L  L+G  N+ +   ++  LDLQG  KR+K    K+D  +E I++EH   +K       ++M+DVLL + +D N E+K+ R  +KA  ++NI     DT 
Subjt:  LFLLSGVLNIGDLIPWIDFLDLQGYVKRMKAFSKKFDRFLEHILDEHNERRKGVKDYVPKDMVDVLLQLADDPNLEVKVERHGVKALTLLNI-----DTR

Query:  V-----GVSELLKKPEIFNKATEELDRVIGRERWVEEKDIVNLPYIDAIAKETMRLHPVVPMLVPRLCREDCQVAGHDIAKGTGVFVNVWAIGRDPVVWE
               ++EL+  PEI  KA +E+++V+G +R VEE D+ NL Y  A+ KETMRLHP  P+ V R   E+C VAG  I   T V VNVWAIGRD   WE
Subjt:  V-----GVSELLKKPEIFNKATEELDRVIGRERWVEEKDIVNLPYIDAIAKETMRLHPVVPMLVPRLCREDCQVAGHDIAKGTGVFVNVWAIGRDPVVWE

Query:  NPNDFNPERFIGKCIDVKGQNFELLPFGSGRRMCPG
        +P +F PERF G   + K  + +++ FG+GRR CPG
Subjt:  NPNDFNPERFIGKCIDVKGQNFELLPFGSGRRMCPG

AT5G07990.1 Cytochrome P450 superfamily protein2.1e-9039.61Show/hide
Query:  METPWVCCAAAWITILLLLLLLSLRLRRRKLNLPPGPKPWPFIGNLNLIGSLPHQSIHQLSKKYGPIMHLYFGSFPVVVGSSVEMAKIFLKTHDLTFAYR
        M T ++    A +  L+L +    R R     LPPGP PWP IGNL  +G+ PH+++  +   YGPI+HL  G   VVV +S  +A+ FLK HD  FA R
Subjt:  METPWVCCAAAWITILLLLLLLSLRLRRRKLNLPPGPKPWPFIGNLNLIGSLPHQSIHQLSKKYGPIMHLYFGSFPVVVGSSVEMAKIFLKTHDLTFAYR

Query:  PKTAAGKYTTYNYSNITWSQYGPYWRQARKMFLVELFNAKRLDSYEYIRKEEMIALLQEIFKSAGKSIKLKGYLSALSMNVISRMALGKKYLLDESKDSI
        P  +  K+  YNY ++ ++ YG  WR  RK+  V LF+AK L+ ++++R+EE+  L +E+ +   K + L   ++   +N + R  +G++       D+ 
Subjt:  PKTAAGKYTTYNYSNITWSQYGPYWRQARKMFLVELFNAKRLDSYEYIRKEEMIALLQEIFKSAGKSIKLKGYLSALSMNVISRMALGKKYLLDESKDSI

Query:  ISPDEFKNMVDELFLLSGVLNIGDLIPWIDFLDLQGYVKRMKAFSKKFDRFLEHILDEHNERRKGVKDYVPKDMVDVLLQL--ADDPNLEVKVERHGVKA
           DEF++MV E+  L+GV NIGD +P +D+LDLQG   +MK   K+FD FL  IL EH       +D    DM+  L+ L   D       +    +KA
Subjt:  ISPDEFKNMVDELFLLSGVLNIGDLIPWIDFLDLQGYVKRMKAFSKKFDRFLEHILDEHNERRKGVKDYVPKDMVDVLLQL--ADDPNLEVKVERHGVKA

Query:  LTLLNI-----DTRV-----GVSELLKKPEIFNKATEELDRVIGRERWVEEKDIVNLPYIDAIAKETMRLHPVVPMLVPRLCREDCQVAGHDIAKGTGVF
        L LLN+     DT        ++EL++ P+I  KA EELD V+GR+R V E DI  LPY+ A+ KE  RLHP  P+ +P +  E C++ G+ I KG+ + 
Subjt:  LTLLNI-----DTRV-----GVSELLKKPEIFNKATEELDRVIGRERWVEEKDIVNLPYIDAIAKETMRLHPVVPMLVPRLCREDCQVAGHDIAKGTGVF

Query:  VNVWAIGRDPVVWENPNDFNPERFI----GKCIDVKGQNFELLPFGSGRRMCPGLTI
         N+WAI RDP  W +P  F PERF+       +DVKG +FEL+PFG+GRR+C GL++
Subjt:  VNVWAIGRDPVVWENPNDFNPERFI----GKCIDVKGQNFELLPFGSGRRMCPGLTI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCTTTCAAGTTTGACTTCAAGATCAGGATCATCGGCCAACTGCAACAACACATCCACCATATCTTTGGCCACATAATCTTTAACTCCCTTTCTCCTTTCATTATGTT
CATCCAATACGTGCTCAAGGAATCTATCGAACTTCTTGCTCAGTGCCTTCATCCTCTTCACGTACCCCTGCAGATCCAAGAAATCTATCCATGGTATCGAGTCCCCAATG
TTGAGCACACCACCCAGCAAGAACAGCTCGTCCAACATTTTCTTGAACTCGTCTGGACTAACAATGGCGTCTTCCGACTCGTCCGTGTACTTCTTTCCCAACACCATCCG
ACTTATCACGTTCAAACTTACGGTAGACAAAGAGCCGATCAAGTTGAGATTTCCGATCAAGGGCCAAGGTTTAGGTCCCGGCGGCAGATTGAGCTTTCTACGACGGAGGC
GGCGGGAGAGGAGGAGGAGGGCGAGAGTGGCGACCCATGCAGCTGCATAAGAAACCCATGGAGGAGCTTCCATTATAATCCCCAATGGCGGTGGCAAAACTTGACTGTAA
TTGATGTCAATTTCCAACCACCGCTGCCGCCGATAACAATGGAAACTCCTTGGGTTTGTTGCGCGGCTGCATGGATAACCATCCTCCTCCTCCTCCTCCTTCTCTCCCTT
CGTCTCCGGCGTCGTAAACTCAATCTACCGCCTGGACCTAAGCCCTGGCCCTTCATTGGAAACCTCAACCTGATCGGTTCTCTACCGCACCAGTCCATTCATCAACTCTC
CAAAAAATATGGCCCCATCATGCACCTCTATTTCGGCTCCTTCCCAGTCGTCGTCGGATCCTCCGTCGAGATGGCCAAAATCTTCCTCAAAACCCATGATCTTACTTTCG
CATACCGCCCCAAAACCGCCGCCGGAAAGTACACCACCTACAACTATTCCAACATTACGTGGTCCCAATACGGCCCTTACTGGCGTCAAGCTCGTAAAATGTTTCTCGTG
GAGCTTTTCAACGCCAAACGACTCGATTCTTATGAGTACATACGCAAGGAAGAAATGATTGCTTTGCTTCAAGAAATATTCAAGTCCGCCGGCAAATCGATCAAGCTCAA
AGGTTACTTGTCCGCATTGAGTATGAACGTGATAAGTCGGATGGCCTTGGGGAAGAAGTACTTGTTGGACGAGTCCAAAGACTCCATTATTAGTCCAGATGAGTTCAAAA
ACATGGTGGACGAGTTGTTCTTGCTAAGTGGTGTGCTTAACATAGGAGATTTGATACCATGGATAGATTTCTTGGATTTGCAGGGCTACGTGAAGAGGATGAAGGCATTC
AGCAAGAAATTCGATAGATTCCTTGAGCATATACTGGATGAACATAATGAAAGGAGAAAGGGAGTTAAGGATTATGTGCCCAAAGATATGGTGGATGTTTTGTTGCAGTT
GGCCGATGATCCTAATCTTGAAGTCAAAGTTGAAAGACATGGAGTCAAGGCACTTACTCTGCTCAACATTGACACTAGAGTGGGCGTGTCAGAGCTTTTGAAAAAGCCAG
AGATTTTCAACAAGGCAACGGAAGAGCTTGACAGAGTGATTGGAAGGGAAAGATGGGTGGAAGAGAAAGACATTGTAAATTTGCCTTACATTGATGCAATTGCAAAAGAG
ACGATGAGATTGCACCCTGTAGTACCAATGCTGGTGCCTAGATTGTGTAGGGAGGATTGCCAAGTTGCAGGCCACGACATAGCCAAAGGCACTGGAGTGTTTGTGAATGT
GTGGGCGATTGGGAGAGACCCTGTAGTGTGGGAAAATCCAAATGATTTTAATCCAGAAAGGTTCATTGGAAAATGCATCGACGTGAAAGGCCAGAACTTTGAGCTTTTGC
CTTTTGGATCGGGAAGGAGGATGTGCCCTGGCCTCACCATCTATAAAGAGGATCTGCTCCAACGACGACAGGGAAATGGGGCCATATTTTTTGGAGAGTTATTGATGAAT
GGAGAGTGGTGGGGCAGAGAAGCGATCAAGTTGAGATTTCCGATCAAGGGCCAAGGCTTTAGGTCCCGGCGGCAGACTGAGCTTTATATGACGGAGGCGACGGGAGAAGA
GGAGGAGGGCGAGAGTGGCGACCCATTATGTGGAAACAGCAGTAGCACCTTGATGATTCTTTTGGAGCTGGGGTTGCTGAGATTTGGTTCAAGGGGTAAAAATACTGCAA
TAGCTGCTATACTTTTAACGCCAAGCAATATTGAGAACACAGAGATGCCTAGAAAATTACAGAAGGAACTCTCGCTTCGTTCTGGTTCTCACTTTCTCGTCCAAGCATTG
TTTGTTTTTACTGGGAAATTGTGTTGGTTTCTTGAGACAATTGGTGGGGCTTCGAATAAAAAAGGCCATGGCTGTGCCGACCATAGCTCTGTATACGAGCCACCGAGCAC
TATCTGCTCCTCACCGCACCCTTGCCAGATGAATTCTCATGGATCATATGATTTAGAATTTACTTCTCGGTCCTCGTCGTTGGCGTCTTCGACGGCCTCTTCGTCCCAGA
AACCAATCGTCGGTGGGTTGTCAAGTCTGTTCTCATCGTCATCGTCGAGCATTTCAAGTGGTGGAGATGAATTAGGTTCTTTTAGGCACGATAAAGGGGAGGAACTTAAA
GAATTGAGTTCCTCGTTTCGTTATTCCCCAAGGTCCGGTTTCCTGTGGCAGTTGTGGGGTCGGGTCGGCGGCGAGAACCCCTCCATTGTGGACTGCAAGGGAAAGAGTGG
GGATGCTAGTTTTCATGGTCGAGGAAGTACCAACAGGCTGTTCAATGGTTTTGTGAGAAATGCACTGGGATCATGTGTAGACTCTGATTCCCCAAGATTGGAGGTGCCTA
GTGATGGTTTGGATGTGGGTTCATCGGCGTTGTTTGTTGATGAATTGACTTTCAACATGGAGGACAATATTACAGAAAGTAATTCCGAATCATTTGCGAAGGATTTGCTC
CTAAGTGCACAATCGAAGCACAAAATCTTTTGCGATGAGTTTGTGGTCAAGGCTTTTTTTGAGGCCGAGAAAGCACATAGAGGACAGATGCGTGCAAGTGGCGATCCATA
CTTGGAACATTGTGTGGAAACAGCAGTGATGCTTGCACTTGTTGGTGCTAATTCCACGGTTGTTGCTGCAGGGCTCTTGCACGACACACTTGATGATTCTTTTGTGAGCC
ATGACTACATATTGGGGAAATTTGGAGCTGGGGTTGCTGATTTAGTTGAAGGGGTGTCTAAGCTAAGTCATTTAAGCAAGCTTGCTCGTGAACATGATATGGCTGATAGA
ATGGTTGAGGCAGATCGTTTGCACACCATGTTCCTTGCTATGGCTGATGCAAGGGCCGTCCTCATTAAATTAGCAGACCGATTGCACAATATGATGACTTTGGATGCATT
GCCTATGATCAAGCGGCTAAGGTTTGCGAAGGAGACTATGGAGATTTTTGTTCCTCTGGTGAATCGCCTAGGAATCTACAGTTGGAAGGAGCAGCTAGAAAACCTGTGTT
TTATGCATCTTAACTTGGAACAGCACAAAGATTTGTCCTCCAAGCTTATGGGTTTATATGATGAAGCAATTATATATTCTGCAATTAAAAAATTAGAGCGAGCTCTTGAG
GATAAAGGAATCTCTTATCATGTTGTAACTGGGCGGCACAAAAGTGTCTACAGTATACACCGCAAAATGTTGAAGTATGCTTTCTGCCTTCCACCTTCCATCTCTCTCTC
GTTCTCTCTTTATCTCTGCAGACTACTATTTACTGAATCTTGTGTTCAGAAGAGTGATTTTAGTTAA
mRNA sequenceShow/hide mRNA sequence
ATGCCTTTCAAGTTTGACTTCAAGATCAGGATCATCGGCCAACTGCAACAACACATCCACCATATCTTTGGCCACATAATCTTTAACTCCCTTTCTCCTTTCATTATGTT
CATCCAATACGTGCTCAAGGAATCTATCGAACTTCTTGCTCAGTGCCTTCATCCTCTTCACGTACCCCTGCAGATCCAAGAAATCTATCCATGGTATCGAGTCCCCAATG
TTGAGCACACCACCCAGCAAGAACAGCTCGTCCAACATTTTCTTGAACTCGTCTGGACTAACAATGGCGTCTTCCGACTCGTCCGTGTACTTCTTTCCCAACACCATCCG
ACTTATCACGTTCAAACTTACGGTAGACAAAGAGCCGATCAAGTTGAGATTTCCGATCAAGGGCCAAGGTTTAGGTCCCGGCGGCAGATTGAGCTTTCTACGACGGAGGC
GGCGGGAGAGGAGGAGGAGGGCGAGAGTGGCGACCCATGCAGCTGCATAAGAAACCCATGGAGGAGCTTCCATTATAATCCCCAATGGCGGTGGCAAAACTTGACTGTAA
TTGATGTCAATTTCCAACCACCGCTGCCGCCGATAACAATGGAAACTCCTTGGGTTTGTTGCGCGGCTGCATGGATAACCATCCTCCTCCTCCTCCTCCTTCTCTCCCTT
CGTCTCCGGCGTCGTAAACTCAATCTACCGCCTGGACCTAAGCCCTGGCCCTTCATTGGAAACCTCAACCTGATCGGTTCTCTACCGCACCAGTCCATTCATCAACTCTC
CAAAAAATATGGCCCCATCATGCACCTCTATTTCGGCTCCTTCCCAGTCGTCGTCGGATCCTCCGTCGAGATGGCCAAAATCTTCCTCAAAACCCATGATCTTACTTTCG
CATACCGCCCCAAAACCGCCGCCGGAAAGTACACCACCTACAACTATTCCAACATTACGTGGTCCCAATACGGCCCTTACTGGCGTCAAGCTCGTAAAATGTTTCTCGTG
GAGCTTTTCAACGCCAAACGACTCGATTCTTATGAGTACATACGCAAGGAAGAAATGATTGCTTTGCTTCAAGAAATATTCAAGTCCGCCGGCAAATCGATCAAGCTCAA
AGGTTACTTGTCCGCATTGAGTATGAACGTGATAAGTCGGATGGCCTTGGGGAAGAAGTACTTGTTGGACGAGTCCAAAGACTCCATTATTAGTCCAGATGAGTTCAAAA
ACATGGTGGACGAGTTGTTCTTGCTAAGTGGTGTGCTTAACATAGGAGATTTGATACCATGGATAGATTTCTTGGATTTGCAGGGCTACGTGAAGAGGATGAAGGCATTC
AGCAAGAAATTCGATAGATTCCTTGAGCATATACTGGATGAACATAATGAAAGGAGAAAGGGAGTTAAGGATTATGTGCCCAAAGATATGGTGGATGTTTTGTTGCAGTT
GGCCGATGATCCTAATCTTGAAGTCAAAGTTGAAAGACATGGAGTCAAGGCACTTACTCTGCTCAACATTGACACTAGAGTGGGCGTGTCAGAGCTTTTGAAAAAGCCAG
AGATTTTCAACAAGGCAACGGAAGAGCTTGACAGAGTGATTGGAAGGGAAAGATGGGTGGAAGAGAAAGACATTGTAAATTTGCCTTACATTGATGCAATTGCAAAAGAG
ACGATGAGATTGCACCCTGTAGTACCAATGCTGGTGCCTAGATTGTGTAGGGAGGATTGCCAAGTTGCAGGCCACGACATAGCCAAAGGCACTGGAGTGTTTGTGAATGT
GTGGGCGATTGGGAGAGACCCTGTAGTGTGGGAAAATCCAAATGATTTTAATCCAGAAAGGTTCATTGGAAAATGCATCGACGTGAAAGGCCAGAACTTTGAGCTTTTGC
CTTTTGGATCGGGAAGGAGGATGTGCCCTGGCCTCACCATCTATAAAGAGGATCTGCTCCAACGACGACAGGGAAATGGGGCCATATTTTTTGGAGAGTTATTGATGAAT
GGAGAGTGGTGGGGCAGAGAAGCGATCAAGTTGAGATTTCCGATCAAGGGCCAAGGCTTTAGGTCCCGGCGGCAGACTGAGCTTTATATGACGGAGGCGACGGGAGAAGA
GGAGGAGGGCGAGAGTGGCGACCCATTATGTGGAAACAGCAGTAGCACCTTGATGATTCTTTTGGAGCTGGGGTTGCTGAGATTTGGTTCAAGGGGTAAAAATACTGCAA
TAGCTGCTATACTTTTAACGCCAAGCAATATTGAGAACACAGAGATGCCTAGAAAATTACAGAAGGAACTCTCGCTTCGTTCTGGTTCTCACTTTCTCGTCCAAGCATTG
TTTGTTTTTACTGGGAAATTGTGTTGGTTTCTTGAGACAATTGGTGGGGCTTCGAATAAAAAAGGCCATGGCTGTGCCGACCATAGCTCTGTATACGAGCCACCGAGCAC
TATCTGCTCCTCACCGCACCCTTGCCAGATGAATTCTCATGGATCATATGATTTAGAATTTACTTCTCGGTCCTCGTCGTTGGCGTCTTCGACGGCCTCTTCGTCCCAGA
AACCAATCGTCGGTGGGTTGTCAAGTCTGTTCTCATCGTCATCGTCGAGCATTTCAAGTGGTGGAGATGAATTAGGTTCTTTTAGGCACGATAAAGGGGAGGAACTTAAA
GAATTGAGTTCCTCGTTTCGTTATTCCCCAAGGTCCGGTTTCCTGTGGCAGTTGTGGGGTCGGGTCGGCGGCGAGAACCCCTCCATTGTGGACTGCAAGGGAAAGAGTGG
GGATGCTAGTTTTCATGGTCGAGGAAGTACCAACAGGCTGTTCAATGGTTTTGTGAGAAATGCACTGGGATCATGTGTAGACTCTGATTCCCCAAGATTGGAGGTGCCTA
GTGATGGTTTGGATGTGGGTTCATCGGCGTTGTTTGTTGATGAATTGACTTTCAACATGGAGGACAATATTACAGAAAGTAATTCCGAATCATTTGCGAAGGATTTGCTC
CTAAGTGCACAATCGAAGCACAAAATCTTTTGCGATGAGTTTGTGGTCAAGGCTTTTTTTGAGGCCGAGAAAGCACATAGAGGACAGATGCGTGCAAGTGGCGATCCATA
CTTGGAACATTGTGTGGAAACAGCAGTGATGCTTGCACTTGTTGGTGCTAATTCCACGGTTGTTGCTGCAGGGCTCTTGCACGACACACTTGATGATTCTTTTGTGAGCC
ATGACTACATATTGGGGAAATTTGGAGCTGGGGTTGCTGATTTAGTTGAAGGGGTGTCTAAGCTAAGTCATTTAAGCAAGCTTGCTCGTGAACATGATATGGCTGATAGA
ATGGTTGAGGCAGATCGTTTGCACACCATGTTCCTTGCTATGGCTGATGCAAGGGCCGTCCTCATTAAATTAGCAGACCGATTGCACAATATGATGACTTTGGATGCATT
GCCTATGATCAAGCGGCTAAGGTTTGCGAAGGAGACTATGGAGATTTTTGTTCCTCTGGTGAATCGCCTAGGAATCTACAGTTGGAAGGAGCAGCTAGAAAACCTGTGTT
TTATGCATCTTAACTTGGAACAGCACAAAGATTTGTCCTCCAAGCTTATGGGTTTATATGATGAAGCAATTATATATTCTGCAATTAAAAAATTAGAGCGAGCTCTTGAG
GATAAAGGAATCTCTTATCATGTTGTAACTGGGCGGCACAAAAGTGTCTACAGTATACACCGCAAAATGTTGAAGTATGCTTTCTGCCTTCCACCTTCCATCTCTCTCTC
GTTCTCTCTTTATCTCTGCAGACTACTATTTACTGAATCTTGTGTTCAGAAGAGTGATTTTAGTTAA
Protein sequenceShow/hide protein sequence
MPFKFDFKIRIIGQLQQHIHHIFGHIIFNSLSPFIMFIQYVLKESIELLAQCLHPLHVPLQIQEIYPWYRVPNVEHTTQQEQLVQHFLELVWTNNGVFRLVRVLLSQHHP
TYHVQTYGRQRADQVEISDQGPRFRSRRQIELSTTEAAGEEEEGESGDPCSCIRNPWRSFHYNPQWRWQNLTVIDVNFQPPLPPITMETPWVCCAAAWITILLLLLLLSL
RLRRRKLNLPPGPKPWPFIGNLNLIGSLPHQSIHQLSKKYGPIMHLYFGSFPVVVGSSVEMAKIFLKTHDLTFAYRPKTAAGKYTTYNYSNITWSQYGPYWRQARKMFLV
ELFNAKRLDSYEYIRKEEMIALLQEIFKSAGKSIKLKGYLSALSMNVISRMALGKKYLLDESKDSIISPDEFKNMVDELFLLSGVLNIGDLIPWIDFLDLQGYVKRMKAF
SKKFDRFLEHILDEHNERRKGVKDYVPKDMVDVLLQLADDPNLEVKVERHGVKALTLLNIDTRVGVSELLKKPEIFNKATEELDRVIGRERWVEEKDIVNLPYIDAIAKE
TMRLHPVVPMLVPRLCREDCQVAGHDIAKGTGVFVNVWAIGRDPVVWENPNDFNPERFIGKCIDVKGQNFELLPFGSGRRMCPGLTIYKEDLLQRRQGNGAIFFGELLMN
GEWWGREAIKLRFPIKGQGFRSRRQTELYMTEATGEEEEGESGDPLCGNSSSTLMILLELGLLRFGSRGKNTAIAAILLTPSNIENTEMPRKLQKELSLRSGSHFLVQAL
FVFTGKLCWFLETIGGASNKKGHGCADHSSVYEPPSTICSSPHPCQMNSHGSYDLEFTSRSSSLASSTASSSQKPIVGGLSSLFSSSSSSISSGGDELGSFRHDKGEELK
ELSSSFRYSPRSGFLWQLWGRVGGENPSIVDCKGKSGDASFHGRGSTNRLFNGFVRNALGSCVDSDSPRLEVPSDGLDVGSSALFVDELTFNMEDNITESNSESFAKDLL
LSAQSKHKIFCDEFVVKAFFEAEKAHRGQMRASGDPYLEHCVETAVMLALVGANSTVVAAGLLHDTLDDSFVSHDYILGKFGAGVADLVEGVSKLSHLSKLAREHDMADR
MVEADRLHTMFLAMADARAVLIKLADRLHNMMTLDALPMIKRLRFAKETMEIFVPLVNRLGIYSWKEQLENLCFMHLNLEQHKDLSSKLMGLYDEAIIYSAIKKLERALE
DKGISYHVVTGRHKSVYSIHRKMLKYAFCLPPSISLSFSLYLCRLLFTESCVQKSDFS