; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr027961 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr027961
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
Descriptioncarotenoid 9,10(9',10')-cleavage dioxygenase 1-like
Genome locationtig00153056:1952796..1970609
RNA-Seq ExpressionSgr027961
SyntenySgr027961
Gene Ontology termsGO:0005515 - protein binding (molecular function)
GO:0016702 - oxidoreductase activity, acting on single donors with incorporation of molecular oxygen, incorporation of two atoms of oxygen (molecular function)
GO:0046872 - metal ion binding (molecular function)
InterPro domainsIPR001680 - WD40 repeat
IPR004294 - Carotenoid oxygenase
IPR015943 - WD40/YVTN repeat-like-containing domain superfamily
IPR036322 - WD40-repeat-containing domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7031724.1 Protein JINGUBANG, partial [Cucurbita argyrosperma subsp. argyrosperma]0.0e+0065.85Show/hide
Query:  MLHALYFNKDSRGTWNLLYNNRYVQTETFRLEKLVRNRPSFLPAVEGDSLAVLSAFFLNLLRFGKVYKDISNTNVLEHSGKFYSLAENFMPQEFDITTLE
        MLHA+YF K+  G W+L YNNRYVQT++F+ EK V+NRPSFLPAVEGD+LAV++AF LN LRFG+V KD+SNTNV+EHSG+FYS+AEN MPQE DI +L+
Subjt:  MLHALYFNKDSRGTWNLLYNNRYVQTETFRLEKLVRNRPSFLPAVEGDSLAVLSAFFLNLLRFGKVYKDISNTNVLEHSGKFYSLAENFMPQEFDITTLE

Query:  TLGDWDVNGAWNRPFTSHPKKAPETGELVVLGATATKPFM----VSESFLRMVHKVDVKLSRSSLSHEIGVTQRYNVILDYPVTVDLNRLIRGGP-----
        +LG WDV  AWNRPFTSHPKKAP+TGELV++G   TKP+M    +SE   RMVHKVDVKLSRSSL+HEIGVT+R+NVILDYPVT+DLNRLIRGG      
Subjt:  TLGDWDVNGAWNRPFTSHPKKAPETGELVVLGATATKPFM----VSESFLRMVHKVDVKLSRSSLSHEIGVTQRYNVILDYPVTVDLNRLIRGGP-----

Query:  --------------------------LIKYDKEGYARIGVMPRYGDADSIQWFQVKPNCTMHLFNCFEDKDEVMVWGCRASDSVIPGPEKGLNKFEWFSK
                                  LIK++KEGYA+IGVMPRYGDADSIQWF+VKPNCTMHLFNCFE + EV+VWGCRA+DS IPGPEKGLNKFEWFS+
Subjt:  --------------------------LIKYDKEGYARIGVMPRYGDADSIQWFQVKPNCTMHLFNCFEDKDEVMVWGCRASDSVIPGPEKGLNKFEWFSK

Query:  RFKPLPATDHEEHNTDSSIEDGSLFSRAYEWRLNLKTGEARERSLTGTQFSMDFPFINSRFTGLKNKFGYAQVLDSFASSNSGMFKFGGLAKLHFEEPQS
        RFKP+  TD  E NT    EDGSL SRAY+WR+NLKTGE RER LTG QFSMDFPFINS FTGLKNK+GYAQVLDS ASSNS      GLAKLHFEEPQS
Subjt:  RFKPLPATDHEEHNTDSSIEDGSLFSRAYEWRLNLKTGEARERSLTGTQFSMDFPFINSRFTGLKNKFGYAQVLDSFASSNSGMFKFGGLAKLHFEEPQS

Query:  FEFSLPKKCEEQRIIKVEYHMFENNSFCSGASFVSREGSPEEDDGWIIAHVHNEITNTSQLAMATAMYTSLQLFLNYNTLLLLPSIPHNKR--ASVTFMA
         E SL K  EE+  IKVEYHM ENNSFC+GASFV RE S +EDDGW+IAHVHNEITNTSQ+ +  A   S +          +P   H     +   F  
Subjt:  FEFSLPKKCEEQRIIKVEYHMFENNSFCSGASFVSREGSPEEDDGWIIAHVHNEITNTSQLAMATAMYTSLQLFLNYNTLLLLPSIPHNKR--ASVTFMA

Query:  SQANKSLLSFMDDEKLRTPQIQSPAQVSSETTPQVMSDQQGVAPSSSPHHSKVLKLIPPPSPESPWTLSPLQTPSPPLLYHCIASLHRGEGNIYSIAISK
          A+ +L  ++    +    +    Q +     +VM +Q G         SK +K IPPPSPESPW LSPL+TPSP LL+HCIASLHRGEGNIYSIAIS 
Subjt:  SQANKSLLSFMDDEKLRTPQIQSPAQVSSETTPQVMSDQQGVAPSSSPHHSKVLKLIPPPSPESPWTLSPLQTPSPPLLYHCIASLHRGEGNIYSIAISK

Query:  GVVFTGSETSRIRAWKQPDCMERGYLKAAAGGVKAISASG--VFAS------------------EKKLIS-------VVLSNGNRPPPQR-FISCMAYYH
        GVVFTGS++ R+RAWKQPDCM+RGYLKAAAGGV AI+A G  +F S                   KKL S       ++ S        +  ISCMAYYH
Subjt:  GVVFTGSETSRIRAWKQPDCMERGYLKAAAGGVKAISASG--VFAS------------------EKKLIS-------VVLSNGNRPPPQR-FISCMAYYH

Query:  AQDLLYTGSHDKTIKAWRVSDRKCVDSFVAHEDNVNSIAVNQNDGCLFTCSSDGTVKIWRRVYRENSHTLTMTLKFQTSPVNAVALCSHYD-TTFLYSGS
        AQDLLYT SHDKT+KAWR+SDRKCVDSF+AHED VNSI VNQNDGCLFTCSSDGTVKIWR +YRENSHTLTMTLKFQTSPVNAV L SH D  TFLYSGS
Subjt:  AQDLLYTGSHDKTIKAWRVSDRKCVDSFVAHEDNVNSIAVNQNDGCLFTCSSDGTVKIWRRVYRENSHTLTMTLKFQTSPVNAVALCSHYD-TTFLYSGS

Query:  SDGTINFWEKEKLSYRYNHRGFLQGHRFAVLCLVVVERLIFSGSEDTTVRIWRKEEGSYYHECLAVLDGHRGPVRCLAASLEVEKITTSFLVYSASLDQT
        SDGTINFWEKE++SYRYNH GFLQGHRF VLCL VVERL+ SGSEDTTVRIWRKEEG  YHECLAVLDGHRGPVRCLA SLE+E +  SFLVYS SLDQ+
Subjt:  SDGTINFWEKEKLSYRYNHRGFLQGHRFAVLCLVVVERLIFSGSEDTTVRIWRKEEGSYYHECLAVLDGHRGPVRCLAASLEVEKITTSFLVYSASLDQT

Query:  FKVWRVKVLPEEKRCLDYGEGGESKMKKLGELYEMSPVLSPSW
        FKVWRVKV+  E+       GG+ KM K   LYEMSPVLSPSW
Subjt:  FKVWRVKVLPEEKRCLDYGEGGESKMKKLGELYEMSPVLSPSW

XP_022139412.1 carotenoid 9,10(9',10')-cleavage dioxygenase 1-like [Momordica charantia]3.8e-20678.59Show/hide
Query:  MFGRSSYIWVEGEGMLHALYFNKD--SRGTWNLLYNNRYVQTETFRLEKLVRNRPSFLPAVEGDSLAVLSAFFLNLLRFGKVYKDISNTNVLEHSGKFYS
        +FGRS+Y WVEGEGMLHALYFNKD      WNL YNNRYVQT+TF+LEK V+N P FLPA+EGDSLAVLSAFFLN+LRFG V KD+SNTNV+EHSGKFYS
Subjt:  MFGRSSYIWVEGEGMLHALYFNKD--SRGTWNLLYNNRYVQTETFRLEKLVRNRPSFLPAVEGDSLAVLSAFFLNLLRFGKVYKDISNTNVLEHSGKFYS

Query:  LAENFMPQEFDITTLETLGDWDVNGAWNRPFTSHPKKAPETGELVVLGATATKPFM----VSESFLRMVHKVDVKLSRSSLSHEIGVTQRYNVILDYPVT
        +A+N +PQ+ DI +L++LG WDVNGAWNRPFTSHPKKAP TGELV++G T  KPFM    +SE    MVHKVDVKLSR S SHEIGVT+RYNVILDYPVT
Subjt:  LAENFMPQEFDITTLETLGDWDVNGAWNRPFTSHPKKAPETGELVVLGATATKPFM----VSESFLRMVHKVDVKLSRSSLSHEIGVTQRYNVILDYPVT

Query:  VDLNRLIRGGPLIKYDKEGYARIGVMPRYGDADSIQWFQVKPNCTMHLFNCFEDKDEVMVWGCRASDSVIPGPEKGLNKFEWFSKRFKPLPATDHEEHNT
        +D NRLI GG LIKYDKEGYARIGVMPRYGD DSIQWFQVKPNCTMHLFN FE  DEV+VWGCRASDSVIPGPEKG NKFEWFS RFKPLP  D +  ++
Subjt:  VDLNRLIRGGPLIKYDKEGYARIGVMPRYGDADSIQWFQVKPNCTMHLFNCFEDKDEVMVWGCRASDSVIPGPEKGLNKFEWFSKRFKPLPATDHEEHNT

Query:  -DSSIEDGSLFSRAYEWRLNLKTGEARERSLTGTQFSMDFPFINSRFTGLKNKFGYAQVLDSFASSNSGMFKFGGLAKLHFEEP-QSFEFSLPKKCEEQR
          SS+EDGSLFSRAYEWRLNLKTGEARER L+GTQFSMDFPFINSRFTGL+NKFGYAQ+LDSFASSNSGMFKFGGLAKLHFEEP QS  FSLPKK   + 
Subjt:  -DSSIEDGSLFSRAYEWRLNLKTGEARERSLTGTQFSMDFPFINSRFTGLKNKFGYAQVLDSFASSNSGMFKFGGLAKLHFEEP-QSFEFSLPKKCEEQR

Query:  IIKVEYHMFENNSFCSGASFVSREGSPEEDDGWIIAHVHNEITNTSQLAMATA
         IKVEYHMFEN+SFCSGASFV RE   EEDDGWIIAH+HNEITNTSQ+ +  A
Subjt:  IIKVEYHMFENNSFCSGASFVSREGSPEEDDGWIIAHVHNEITNTSQLAMATA

XP_022940198.1 carotenoid 9,10(9',10')-cleavage dioxygenase 1-like [Cucurbita moschata]1.4e-20076.17Show/hide
Query:  MFGRSSYIWVEGEGMLHALYFNKDSRGTWNLLYNNRYVQTETFRLEKLVRNRPSFLPAVEGDSLAVLSAFFLNLLRFGKVYKDISNTNVLEHSGKFYSLA
        M GRSS+IWVEGEGMLHA+YF K+  G W+L YNNRYVQT++F+ EK V+NRPSFLPAVEGD+LAV++AF LN LRFG+V KD+SNTNV+EHSG+FYS+A
Subjt:  MFGRSSYIWVEGEGMLHALYFNKDSRGTWNLLYNNRYVQTETFRLEKLVRNRPSFLPAVEGDSLAVLSAFFLNLLRFGKVYKDISNTNVLEHSGKFYSLA

Query:  ENFMPQEFDITTLETLGDWDVNGAWNRPFTSHPKKAPETGELVVLGATATKPFM----VSESFLRMVHKVDVKLSRSSLSHEIGVTQRYNVILDYPVTVD
        EN MPQE DI +L++LG WDV  AWNRPFTSHPKKAP+TGELV++G   TKP+M    +SE   RMVHKVDVKLSRSSL+HEIGVT+R+NVILDYPVT+D
Subjt:  ENFMPQEFDITTLETLGDWDVNGAWNRPFTSHPKKAPETGELVVLGATATKPFM----VSESFLRMVHKVDVKLSRSSLSHEIGVTQRYNVILDYPVTVD

Query:  LNRLIRGGPLIKYDKEGYARIGVMPRYGDADSIQWFQVKPNCTMHLFNCFEDKDEVMVWGCRASDSVIPGPEKGLNKFEWFSKRFKPLPATDHEEHNTDS
        LNRLIRGG LIK++KEGYA+IGVMPRYGDADSIQWF+VKPNCTMHLFNCFE ++EV+VWGCRA+DS+IPGPEKGLNKFEWFS+RFKP+  TD      + 
Subjt:  LNRLIRGGPLIKYDKEGYARIGVMPRYGDADSIQWFQVKPNCTMHLFNCFEDKDEVMVWGCRASDSVIPGPEKGLNKFEWFSKRFKPLPATDHEEHNTDS

Query:  SIEDGSLFSRAYEWRLNLKTGEARERSLTGTQFSMDFPFINSRFTGLKNKFGYAQVLDSFASSNSGMFKFGGLAKLHFEEPQSFEFSLPKKCEEQRIIKV
        + EDGSL S AYEWRLNLKTGEARER LTG QFSMDFPFINS FTGLKN++GYAQVLDS ASSNSGMFKFGGLAKLHFEEPQS E SL K  EE+  IKV
Subjt:  SIEDGSLFSRAYEWRLNLKTGEARERSLTGTQFSMDFPFINSRFTGLKNKFGYAQVLDSFASSNSGMFKFGGLAKLHFEEPQSFEFSLPKKCEEQRIIKV

Query:  EYHMFENNSFCSGASFVSREGSPEEDDGWIIAHVHNEITNTSQLAMATA
        EYHM ENNSFC+GASFV RE S +EDDGWIIAHVHNEITNTSQ+ +  A
Subjt:  EYHMFENNSFCSGASFVSREGSPEEDDGWIIAHVHNEITNTSQLAMATA

XP_022980961.1 carotenoid 9,10(9',10')-cleavage dioxygenase 1-like isoform X1 [Cucurbita maxima]4.8e-20176.84Show/hide
Query:  MFGRSSYIWVEGEGMLHALYFNKDSRGTWNLLYNNRYVQTETFRLEKLVRNRPSFLPAVEGDSLAVLSAFFLNLLRFGKVYKDISNTNVLEHSGKFYSLA
        MFGRSS+IWVEGEGMLHA+YF K+  G W+L YNNRYVQTE+FR EK V+NRPSFLPAVEGD++A+++AF LN LRFG+V KD+SNTNV+EHSGKFYS+A
Subjt:  MFGRSSYIWVEGEGMLHALYFNKDSRGTWNLLYNNRYVQTETFRLEKLVRNRPSFLPAVEGDSLAVLSAFFLNLLRFGKVYKDISNTNVLEHSGKFYSLA

Query:  ENFMPQEFDITTLETLGDWDVNGAWNRPFTSHPKKAPETGELVVLGATATKPFM----VSESFLRMVHKVDVKLSRSSLSHEIGVTQRYNVILDYPVTVD
        EN MPQ+ DI +L++LG WDV  AWNRPFTSHPKKAP+TGELV++G   TKP+M    +SE   RMVHKVDVKLSRSSL+HEIGVT+R+NVILDYPVT+D
Subjt:  ENFMPQEFDITTLETLGDWDVNGAWNRPFTSHPKKAPETGELVVLGATATKPFM----VSESFLRMVHKVDVKLSRSSLSHEIGVTQRYNVILDYPVTVD

Query:  LNRLIRGGPLIKYDKEGYARIGVMPRYGDADSIQWFQVKPNCTMHLFNCFEDKDEVMVWGCRASDSVIPGPEKGLNKFEWFSKRFKPLPATDHEEHNTDS
        LNRLIRGG LIK++KEGYA+IGVMPRYGDADSIQWF+VKPNCTMHLFNCFE ++EV+VWGCRA+DS+IPGPEKGLNKFEWFS+RFKP+  TD  E NT  
Subjt:  LNRLIRGGPLIKYDKEGYARIGVMPRYGDADSIQWFQVKPNCTMHLFNCFEDKDEVMVWGCRASDSVIPGPEKGLNKFEWFSKRFKPLPATDHEEHNTDS

Query:  SIEDGSLFSRAYEWRLNLKTGEARERSLTGTQFSMDFPFINSRFTGLKNKFGYAQVLDSFASSNSGMFKFGGLAKLHFEEPQSFEFSLPKKCEEQRIIKV
          EDGSL S AYEWRLNLKTGEARER LTG QFSMDFPFINS FTGLKNK+GYAQVLDS ASSNSG+FKFGGLAKLHFEEPQS E SL K  EE+  IKV
Subjt:  SIEDGSLFSRAYEWRLNLKTGEARERSLTGTQFSMDFPFINSRFTGLKNKFGYAQVLDSFASSNSGMFKFGGLAKLHFEEPQSFEFSLPKKCEEQRIIKV

Query:  EYHMFENNSFCSGASFVSREGSPEEDDGWIIAHVHNEITNTSQLAMATA
        EYHM ENNSFC+GASFV RE S +EDDGWIIAHVHNEIT TSQ+ +  A
Subjt:  EYHMFENNSFCSGASFVSREGSPEEDDGWIIAHVHNEITNTSQLAMATA

XP_023523500.1 carotenoid 9,10(9',10')-cleavage dioxygenase 1-like [Cucurbita pepo subsp. pepo]4.8e-20176.61Show/hide
Query:  MFGRSSYIWVEGEGMLHALYFNKDSRGTWNLLYNNRYVQTETFRLEKLVRNRPSFLPAVEGDSLAVLSAFFLNLLRFGKVYKDISNTNVLEHSGKFYSLA
        MFGRSS+IWVEGEGMLHA+YF K   G W+LLYNNRYVQTE+FR EK V+NRPSFLPAVEGD++AV++AF LN LRFG+V KD+SNTNV+EHSG+FYS+A
Subjt:  MFGRSSYIWVEGEGMLHALYFNKDSRGTWNLLYNNRYVQTETFRLEKLVRNRPSFLPAVEGDSLAVLSAFFLNLLRFGKVYKDISNTNVLEHSGKFYSLA

Query:  ENFMPQEFDITTLETLGDWDVNGAWNRPFTSHPKKAPETGELVVLGATATKPFM----VSESFLRMVHKVDVKLSRSSLSHEIGVTQRYNVILDYPVTVD
        EN MPQE DI +L++LG WDV  AWNRPFTSHPKKAP+TGELV++G   TKP+M    +SE   RMVHKVDVKLSRSSL+HEIG+T+R+NVILDYPVT+D
Subjt:  ENFMPQEFDITTLETLGDWDVNGAWNRPFTSHPKKAPETGELVVLGATATKPFM----VSESFLRMVHKVDVKLSRSSLSHEIGVTQRYNVILDYPVTVD

Query:  LNRLIRGGPLIKYDKEGYARIGVMPRYGDADSIQWFQVKPNCTMHLFNCFEDKDEVMVWGCRASDSVIPGPEKGLNKFEWFSKRFKPLPATDHEEHNTDS
        LNRLIRGG LIK++KEGYA+IGVMPRYGDADSIQWF+VKPNCTMH+FNCFE + EV+VWGCRA+DS IPGPEKGLNKFEWFS+RFKP+  TD  E NT  
Subjt:  LNRLIRGGPLIKYDKEGYARIGVMPRYGDADSIQWFQVKPNCTMHLFNCFEDKDEVMVWGCRASDSVIPGPEKGLNKFEWFSKRFKPLPATDHEEHNTDS

Query:  SIEDGSLFSRAYEWRLNLKTGEARERSLTGTQFSMDFPFINSRFTGLKNKFGYAQVLDSFASSNSGMFKFGGLAKLHFEEPQSFEFSLPKKCEEQRIIKV
          EDGSL SRAY+WRLNLKTGE RER LTG QFSMDFPFINS FTGLKNK+GYAQVLDS ASSNSG+FKFGGLAKLHFEEPQS E SL K  EE+  IKV
Subjt:  SIEDGSLFSRAYEWRLNLKTGEARERSLTGTQFSMDFPFINSRFTGLKNKFGYAQVLDSFASSNSGMFKFGGLAKLHFEEPQSFEFSLPKKCEEQRIIKV

Query:  EYHMFENNSFCSGASFVSREGSPEEDDGWIIAHVHNEITNTSQLAMATA
        EYHM ENNSFC+GASFV RE S +EDDGW+IAHVHNEITNTSQ+ +  A
Subjt:  EYHMFENNSFCSGASFVSREGSPEEDDGWIIAHVHNEITNTSQLAMATA

TrEMBL top hitse value%identityAlignment
A0A6J1CDW2 carotenoid 9,10(9',10')-cleavage dioxygenase 1-like1.8e-20678.59Show/hide
Query:  MFGRSSYIWVEGEGMLHALYFNKD--SRGTWNLLYNNRYVQTETFRLEKLVRNRPSFLPAVEGDSLAVLSAFFLNLLRFGKVYKDISNTNVLEHSGKFYS
        +FGRS+Y WVEGEGMLHALYFNKD      WNL YNNRYVQT+TF+LEK V+N P FLPA+EGDSLAVLSAFFLN+LRFG V KD+SNTNV+EHSGKFYS
Subjt:  MFGRSSYIWVEGEGMLHALYFNKD--SRGTWNLLYNNRYVQTETFRLEKLVRNRPSFLPAVEGDSLAVLSAFFLNLLRFGKVYKDISNTNVLEHSGKFYS

Query:  LAENFMPQEFDITTLETLGDWDVNGAWNRPFTSHPKKAPETGELVVLGATATKPFM----VSESFLRMVHKVDVKLSRSSLSHEIGVTQRYNVILDYPVT
        +A+N +PQ+ DI +L++LG WDVNGAWNRPFTSHPKKAP TGELV++G T  KPFM    +SE    MVHKVDVKLSR S SHEIGVT+RYNVILDYPVT
Subjt:  LAENFMPQEFDITTLETLGDWDVNGAWNRPFTSHPKKAPETGELVVLGATATKPFM----VSESFLRMVHKVDVKLSRSSLSHEIGVTQRYNVILDYPVT

Query:  VDLNRLIRGGPLIKYDKEGYARIGVMPRYGDADSIQWFQVKPNCTMHLFNCFEDKDEVMVWGCRASDSVIPGPEKGLNKFEWFSKRFKPLPATDHEEHNT
        +D NRLI GG LIKYDKEGYARIGVMPRYGD DSIQWFQVKPNCTMHLFN FE  DEV+VWGCRASDSVIPGPEKG NKFEWFS RFKPLP  D +  ++
Subjt:  VDLNRLIRGGPLIKYDKEGYARIGVMPRYGDADSIQWFQVKPNCTMHLFNCFEDKDEVMVWGCRASDSVIPGPEKGLNKFEWFSKRFKPLPATDHEEHNT

Query:  -DSSIEDGSLFSRAYEWRLNLKTGEARERSLTGTQFSMDFPFINSRFTGLKNKFGYAQVLDSFASSNSGMFKFGGLAKLHFEEP-QSFEFSLPKKCEEQR
          SS+EDGSLFSRAYEWRLNLKTGEARER L+GTQFSMDFPFINSRFTGL+NKFGYAQ+LDSFASSNSGMFKFGGLAKLHFEEP QS  FSLPKK   + 
Subjt:  -DSSIEDGSLFSRAYEWRLNLKTGEARERSLTGTQFSMDFPFINSRFTGLKNKFGYAQVLDSFASSNSGMFKFGGLAKLHFEEP-QSFEFSLPKKCEEQR

Query:  IIKVEYHMFENNSFCSGASFVSREGSPEEDDGWIIAHVHNEITNTSQLAMATA
         IKVEYHMFEN+SFCSGASFV RE   EEDDGWIIAH+HNEITNTSQ+ +  A
Subjt:  IIKVEYHMFENNSFCSGASFVSREGSPEEDDGWIIAHVHNEITNTSQLAMATA

A0A6J1FNM1 carotenoid 9,10(9',10')-cleavage dioxygenase 1-like6.8e-20176.17Show/hide
Query:  MFGRSSYIWVEGEGMLHALYFNKDSRGTWNLLYNNRYVQTETFRLEKLVRNRPSFLPAVEGDSLAVLSAFFLNLLRFGKVYKDISNTNVLEHSGKFYSLA
        M GRSS+IWVEGEGMLHA+YF K+  G W+L YNNRYVQT++F+ EK V+NRPSFLPAVEGD+LAV++AF LN LRFG+V KD+SNTNV+EHSG+FYS+A
Subjt:  MFGRSSYIWVEGEGMLHALYFNKDSRGTWNLLYNNRYVQTETFRLEKLVRNRPSFLPAVEGDSLAVLSAFFLNLLRFGKVYKDISNTNVLEHSGKFYSLA

Query:  ENFMPQEFDITTLETLGDWDVNGAWNRPFTSHPKKAPETGELVVLGATATKPFM----VSESFLRMVHKVDVKLSRSSLSHEIGVTQRYNVILDYPVTVD
        EN MPQE DI +L++LG WDV  AWNRPFTSHPKKAP+TGELV++G   TKP+M    +SE   RMVHKVDVKLSRSSL+HEIGVT+R+NVILDYPVT+D
Subjt:  ENFMPQEFDITTLETLGDWDVNGAWNRPFTSHPKKAPETGELVVLGATATKPFM----VSESFLRMVHKVDVKLSRSSLSHEIGVTQRYNVILDYPVTVD

Query:  LNRLIRGGPLIKYDKEGYARIGVMPRYGDADSIQWFQVKPNCTMHLFNCFEDKDEVMVWGCRASDSVIPGPEKGLNKFEWFSKRFKPLPATDHEEHNTDS
        LNRLIRGG LIK++KEGYA+IGVMPRYGDADSIQWF+VKPNCTMHLFNCFE ++EV+VWGCRA+DS+IPGPEKGLNKFEWFS+RFKP+  TD      + 
Subjt:  LNRLIRGGPLIKYDKEGYARIGVMPRYGDADSIQWFQVKPNCTMHLFNCFEDKDEVMVWGCRASDSVIPGPEKGLNKFEWFSKRFKPLPATDHEEHNTDS

Query:  SIEDGSLFSRAYEWRLNLKTGEARERSLTGTQFSMDFPFINSRFTGLKNKFGYAQVLDSFASSNSGMFKFGGLAKLHFEEPQSFEFSLPKKCEEQRIIKV
        + EDGSL S AYEWRLNLKTGEARER LTG QFSMDFPFINS FTGLKN++GYAQVLDS ASSNSGMFKFGGLAKLHFEEPQS E SL K  EE+  IKV
Subjt:  SIEDGSLFSRAYEWRLNLKTGEARERSLTGTQFSMDFPFINSRFTGLKNKFGYAQVLDSFASSNSGMFKFGGLAKLHFEEPQSFEFSLPKKCEEQRIIKV

Query:  EYHMFENNSFCSGASFVSREGSPEEDDGWIIAHVHNEITNTSQLAMATA
        EYHM ENNSFC+GASFV RE S +EDDGWIIAHVHNEITNTSQ+ +  A
Subjt:  EYHMFENNSFCSGASFVSREGSPEEDDGWIIAHVHNEITNTSQLAMATA

A0A6J1IV34 carotenoid 9,10(9',10')-cleavage dioxygenase 1-like isoform X12.3e-20176.84Show/hide
Query:  MFGRSSYIWVEGEGMLHALYFNKDSRGTWNLLYNNRYVQTETFRLEKLVRNRPSFLPAVEGDSLAVLSAFFLNLLRFGKVYKDISNTNVLEHSGKFYSLA
        MFGRSS+IWVEGEGMLHA+YF K+  G W+L YNNRYVQTE+FR EK V+NRPSFLPAVEGD++A+++AF LN LRFG+V KD+SNTNV+EHSGKFYS+A
Subjt:  MFGRSSYIWVEGEGMLHALYFNKDSRGTWNLLYNNRYVQTETFRLEKLVRNRPSFLPAVEGDSLAVLSAFFLNLLRFGKVYKDISNTNVLEHSGKFYSLA

Query:  ENFMPQEFDITTLETLGDWDVNGAWNRPFTSHPKKAPETGELVVLGATATKPFM----VSESFLRMVHKVDVKLSRSSLSHEIGVTQRYNVILDYPVTVD
        EN MPQ+ DI +L++LG WDV  AWNRPFTSHPKKAP+TGELV++G   TKP+M    +SE   RMVHKVDVKLSRSSL+HEIGVT+R+NVILDYPVT+D
Subjt:  ENFMPQEFDITTLETLGDWDVNGAWNRPFTSHPKKAPETGELVVLGATATKPFM----VSESFLRMVHKVDVKLSRSSLSHEIGVTQRYNVILDYPVTVD

Query:  LNRLIRGGPLIKYDKEGYARIGVMPRYGDADSIQWFQVKPNCTMHLFNCFEDKDEVMVWGCRASDSVIPGPEKGLNKFEWFSKRFKPLPATDHEEHNTDS
        LNRLIRGG LIK++KEGYA+IGVMPRYGDADSIQWF+VKPNCTMHLFNCFE ++EV+VWGCRA+DS+IPGPEKGLNKFEWFS+RFKP+  TD  E NT  
Subjt:  LNRLIRGGPLIKYDKEGYARIGVMPRYGDADSIQWFQVKPNCTMHLFNCFEDKDEVMVWGCRASDSVIPGPEKGLNKFEWFSKRFKPLPATDHEEHNTDS

Query:  SIEDGSLFSRAYEWRLNLKTGEARERSLTGTQFSMDFPFINSRFTGLKNKFGYAQVLDSFASSNSGMFKFGGLAKLHFEEPQSFEFSLPKKCEEQRIIKV
          EDGSL S AYEWRLNLKTGEARER LTG QFSMDFPFINS FTGLKNK+GYAQVLDS ASSNSG+FKFGGLAKLHFEEPQS E SL K  EE+  IKV
Subjt:  SIEDGSLFSRAYEWRLNLKTGEARERSLTGTQFSMDFPFINSRFTGLKNKFGYAQVLDSFASSNSGMFKFGGLAKLHFEEPQSFEFSLPKKCEEQRIIKV

Query:  EYHMFENNSFCSGASFVSREGSPEEDDGWIIAHVHNEITNTSQLAMATA
        EYHM ENNSFC+GASFV RE S +EDDGWIIAHVHNEIT TSQ+ +  A
Subjt:  EYHMFENNSFCSGASFVSREGSPEEDDGWIIAHVHNEITNTSQLAMATA

A0A7J6FQE6 WD_REPEATS_REGION domain-containing protein1.5e-18744.06Show/hide
Query:  MPQEFDITTLETLGDWDVNGAWNRPFTSHPKKAPETGELVVLGATATKPF----MVSESFLRMVHKVDVKLSRSSLSHEIGVTQRYNVILDYPVTVDLNR
        MPQE DI TL+TLG+W ++  WNRPFTSHPK+   +GELV +G   TKP+    ++S    +++H+ D+KL R  + H+IG+T+R               
Subjt:  MPQEFDITTLETLGDWDVNGAWNRPFTSHPKKAPETGELVVLGATATKPF----MVSESFLRMVHKVDVKLSRSSLSHEIGVTQRYNVILDYPVTVDLNR

Query:  LIRGGPLIKYDKEGYARIGVMPRYGDADSIQWFQVKPNCTMHLFNCFED-KDEVMVWGCRASDSVIPGPEKGLNKFEWFSKRFKPLPATDHEEHNTDSSI
              LIKYDK+ YARIGVMPRYGDADSI+WF+++PNCTMH FNCFED  DE++VWGCRA DS+I                     ++  E+ +T ++ 
Subjt:  LIRGGPLIKYDKEGYARIGVMPRYGDADSIQWFQVKPNCTMHLFNCFED-KDEVMVWGCRASDSVIPGPEKGLNKFEWFSKRFKPLPATDHEEHNTDSSI

Query:  EDGSLFSRAYEWRLNLKTGEARERSLT-GTQFSMDFPFINSRFTGLKNKFGYAQVLDSF-ASSNSGMFKFGGLAKLHFEEPQSFEFSLPKKCEEQRIIKV
         + S     YEWRLN+K G  +E+ LT  +++SMD+P IN  + GLKNKFGYAQV+D   A++++ + K+ G+AKLHFEE ++   S     E + ++K+
Subjt:  EDGSLFSRAYEWRLNLKTGEARERSLT-GTQFSMDFPFINSRFTGLKNKFGYAQVLDSF-ASSNSGMFKFGGLAKLHFEEPQSFEFSLPKKCEEQRIIKV

Query:  EYHMFENNSFCSGASFVSREGSP--EEDDGWIIAHVHNEITNTSQLAMATAMYTSLQLFLNYNTLLLLPSIPHNKRASVTFMASQANKSLLSFMDDEKLR
        EYHMFE N+FCSGA+FVS +     +EDDGWII  VHNE TN SQ+ +                                                    
Subjt:  EYHMFENNSFCSGASFVSREGSP--EEDDGWIIAHVHNEITNTSQLAMATAMYTSLQLFLNYNTLLLLPSIPHNKRASVTFMASQANKSLLSFMDDEKLR

Query:  TPQIQSPAQVSSETTPQVMSDQQGVAPSSSPHHSKVLKLIPPPSPESPWTLSPLQTPSPPLLYHCIASLHRGEGNIYSIAISKGVVFTGSETSRIRAWKQ
                 + S+                                                                         F+ S+++RIRAW+ 
Subjt:  TPQIQSPAQVSSETTPQVMSDQQGVAPSSSPHHSKVLKLIPPPSPESPWTLSPLQTPSPPLLYHCIASLHRGEGNIYSIAISKGVVFTGSETSRIRAWKQ

Query:  PDCMERGYLKAAAGGVKAISASG--VFASEKKL------ISVVLSN------GNRPPPQR---------------------FISCMAYYHAQDLLYTGSH
        PDC+ERG LK+ +G V+AI A G  +F S K L       +VV SN       + P  +R                      +SC+AYYHA+ +LYTGSH
Subjt:  PDCMERGYLKAAAGGVKAISASG--VFASEKKL------ISVVLSN------GNRPPPQR---------------------FISCMAYYHAQDLLYTGSH

Query:  DKTIKAWRVSDRKCVDSFVAHEDNVNSIAVNQNDGCLFTCSSDGTVKIWRRVYRENSHTLTMTLKFQTSPVNAVALCSHYD---------TTFLYSGSSD
        D+T+KAWRVS RKCVDSFVAHEDNVN I VNQ DGC+FTCSSDG+VKIWRR+YRENSHTLTM L+FQ SPVNA+AL S            T FLYSGSSD
Subjt:  DKTIKAWRVSDRKCVDSFVAHEDNVNSIAVNQNDGCLFTCSSDGTVKIWRRVYRENSHTLTMTLKFQTSPVNAVALCSHYD---------TTFLYSGSSD

Query:  GTINFWEKEKLSYRYNHRGFLQGHRFAVLCLVVVERLIFSGSEDTTVRIWRKEEGSYYHECLAVLDGHRGPVRCLAASLEVEKITTS-FLVYSASLDQTF
        GTINFWEKE++SYR+NH GFLQGHRFAVLC+V VE+++FSGSEDTT+RIWR+EEG+  HECLAVLDGHRGPVRCLAA LE E +    FL+YSASLD+TF
Subjt:  GTINFWEKEKLSYRYNHRGFLQGHRFAVLCLVVVERLIFSGSEDTTVRIWRKEEGSYYHECLAVLDGHRGPVRCLAASLEVEKITTS-FLVYSASLDQTF

Query:  KVWRVKVLPEEK---------RCLDYGEGG------------ESKMKKLGELYE----MSPVLSPSW
        KVWRVK+LPEE+         R +  G GG            +S+       YE     SPVLSPSW
Subjt:  KVWRVKVLPEEK---------RCLDYGEGG------------ESKMKKLGELYE----MSPVLSPSW

A0A803NFI9 Uncharacterized protein5.5e-21951.94Show/hide
Query:  MPQEFDITTLETLGDWDVNGAWNRPFTSHPKKAPETGELVVLGATATKPF----MVSESFLRMVHKVDVKLSRSSLSHEIGVTQRYNVILDYPVTVDLNR
        MPQE DI TL+TL  W ++  WNRPFTSHPK+   +GELV +G   TKP+    ++S    +++H+ D+KL R  + H+IG+T+RYN+I+D P+T+D+ R
Subjt:  MPQEFDITTLETLGDWDVNGAWNRPFTSHPKKAPETGELVVLGATATKPF----MVSESFLRMVHKVDVKLSRSSLSHEIGVTQRYNVILDYPVTVDLNR

Query:  LIRGGPLIKYDKEGYARIGVMPRYGDADSIQWFQVKPNCTMHLFNCFED-KDEVMVWGCRASDSVIPGPEKGLNKFEWFSKRFKPLPATDHEEHNTDSSI
        L+RGGPLIKYDK+ YARIGVMPRYGDA SI+WF+V+PNCTMH FNCFED  DE++VWGCRA DS+I                     ++  E+ +T ++ 
Subjt:  LIRGGPLIKYDKEGYARIGVMPRYGDADSIQWFQVKPNCTMHLFNCFED-KDEVMVWGCRASDSVIPGPEKGLNKFEWFSKRFKPLPATDHEEHNTDSSI

Query:  EDGSLFSRAYEWRLNLKTGEARERSLT-GTQFSMDFPFINSRFTGLKNKFGYAQVLDSF-ASSNSGMFKFGGLAKLHFEEPQSFEFSLPKKCEEQRIIKV
         + S     YEWRLN+K G  +E+ LT  +++SMD+P IN  + GLKNKFGYAQV+D   A++++ + K+ G+AKLHFEE ++   S     E + ++K+
Subjt:  EDGSLFSRAYEWRLNLKTGEARERSLT-GTQFSMDFPFINSRFTGLKNKFGYAQVLDSF-ASSNSGMFKFGGLAKLHFEEPQSFEFSLPKKCEEQRIIKV

Query:  EYHMFENNSFCSGASFVSREGSP--EEDDGWIIAHVHNEITNTSQLAMATAMYTSLQLFLNYNTLLLLPSIPHNKRASVTFMASQANKSLLSFMDDEKLR
        EYHMFE N+FCSGA+FVS +     +EDDGWII  VHNE TN SQ+ +  +   S +          +P   H    S+   +S                
Subjt:  EYHMFENNSFCSGASFVSREGSP--EEDDGWIIAHVHNEITNTSQLAMATAMYTSLQLFLNYNTLLLLPSIPHNKRASVTFMASQANKSLLSFMDDEKLR

Query:  TPQIQSPAQVSSETTPQVMSDQQGVAPSSSPHHSKVLKLIPPPSPESPWTLSPLQTPSPPLLYHCIASLHRGEGNIYSIAISKGVVFTGSETSRIRAWKQ
             SP Q  ++  P                      LIPPPSPESPWT SPLQTPSP LLYHCIASLHR +G IYS+A+S GVVFTGS+++RIRAW+ 
Subjt:  TPQIQSPAQVSSETTPQVMSDQQGVAPSSSPHHSKVLKLIPPPSPESPWTLSPLQTPSPPLLYHCIASLHRGEGNIYSIAISKGVVFTGSETSRIRAWKQ

Query:  PDCMERGYLKAAAGGVKAISASG--VFASEKKL------ISVVLSN------GNRPPPQR---------------------FISCMAYYHAQDLLYTGSH
        PDC+ERG LK+ +G V+AI A G  +F S K L       +VV SN       + P  +R                      +SC+AYYHA+ +LYTGSH
Subjt:  PDCMERGYLKAAAGGVKAISASG--VFASEKKL------ISVVLSN------GNRPPPQR---------------------FISCMAYYHAQDLLYTGSH

Query:  DKTIKAWRVSDRKCVDSFVAHEDNVNSIAVNQNDGCLFTCSSDGTVKIWRRVYRENSHTLTMTLKFQTSPVNAVALCSHYD----------TTFLYSGSS
        D+T+KAWRVS RKCVDSFVAHEDNVN I VNQ DGC+FTCSSDG+VKIWRR+YRENSHTLTM L+FQ SPVNA+AL S             T FLYSGSS
Subjt:  DKTIKAWRVSDRKCVDSFVAHEDNVNSIAVNQNDGCLFTCSSDGTVKIWRRVYRENSHTLTMTLKFQTSPVNAVALCSHYD----------TTFLYSGSS

Query:  DGTINFWEKEKLSYRYNHRGFLQGHRFAVLCLVVVERLIFSGSEDTTVRIWRKEEGSYYHECLAVLDGHRGPVR
        DGTINFWEKE++SYR+NH GFLQGHRFAVLC+V VE+++FSGSEDTT+RIWR+EEG+  HECLAVLDGHRGPVR
Subjt:  DGTINFWEKEKLSYRYNHRGFLQGHRFAVLCLVVVERLIFSGSEDTTVRIWRKEEGSYYHECLAVLDGHRGPVR

SwissProt top hitse value%identityAlignment
O48716 Protein JINGUBANG1.7e-5236.96Show/hide
Query:  LIPPPSPESPWTLSPLQTPSPPLLYH-CIASLHRGEGNIYSIAISKGVVFTGSETSRIRAWKQPDCMERGYLKAAAGGVKAISASG-------------V
        ++ P +  +P+T +   T    L  +  I SL R EG+IYS+A +K +++TGS++  IR WK  +  E    K  +G VKAI  SG             V
Subjt:  LIPPPSPESPWTLSPLQTPSPPLLYH-CIASLHRGEGNIYSIAISKGVVFTGSETSRIRAWKQPDCMERGYLKAAAGGVKAISASG-------------V

Query:  FASEKKLISVVLSNGNRP----------PPQRF-----------------ISCMAYYHAQDLLYTGSHDKTIKAWRVSDRKCVDSFVAHEDNVNSIAVNQ
        +    K  S+   +G  P           P+ +                 +SC++    Q LLY+ S D+TIK WR++D KC++S  AH+D VNS+ V+ 
Subjt:  FASEKKLISVVLSNGNRP----------PPQRF-----------------ISCMAYYHAQDLLYTGSHDKTIKAWRVSDRKCVDSFVAHEDNVNSIAVNQ

Query:  NDGCLFTCSSDGTVKIWRRVY--RENSHTLTMTLKFQTSPVNAVALCSHYDTTFLYSGSSDGTINFWEKEKLSYRYNHRGFLQGHRFAVLCLVVVERLIF
         +  +F+ S+DGTVK W+R    +   HTL  TL  Q S V A+A+    +   +Y GSSDG +NFWE+EK   + N+ G L+GH+ AVLCL V   L+F
Subjt:  NDGCLFTCSSDGTVKIWRRVY--RENSHTLTMTLKFQTSPVNAVALCSHYDTTFLYSGSSDGTINFWEKEKLSYRYNHRGFLQGHRFAVLCLVVVERLIF

Query:  SGSEDTTVRIWRKEEGSYYHECLAVLDGHRGPVRCLAASLE---VEKITTSFLVYSASLDQTFKVWRV
        SGS D T+ +W+++     H CL+VL GH GPV+CLA   +    E+    ++VYS SLD++ KVW V
Subjt:  SGSEDTTVRIWRKEEGSYYHECLAVLDGHRGPVRCLAASLE---VEKITTSFLVYSASLDQTFKVWRV

O65572 Carotenoid 9,10(9',10')-cleavage dioxygenase 11.3e-3628.42Show/hide
Query:  SSYIWVEGEGMLHALYFNKDSRGTWNLLYNNRYVQTETFRLEKLVRNRPSFLPAVEGDSLAVLSAFF----LNLLRFGKVYKDI--------SNTNVLEH
        + Y W +G+GM+H +   KD + T    Y +RYV+T   + E+       F  A +   +  L  FF    +N+ +     K +        +NT ++ H
Subjt:  SSYIWVEGEGMLHALYFNKDSRGTWNLLYNNRYVQTETFRLEKLVRNRPSFLPAVEGDSLAVLSAFF----LNLLRFGKVYKDI--------SNTNVLEH

Query:  SGKFYSLAENFMPQEFDIT---TLETLGDWDVNGAWNRPFTSHPKKAPETGELVVLGATATKPFMVSESFLR---MVHKVDVKLSRSSLSHEIGVTQRYN
         GK  +L E   P    +     L+TLG  D +      FT+HPK  P TGE+   G + T P++      +   M   V + +S   + H+  +T+ Y 
Subjt:  SGKFYSLAENFMPQEFDIT---TLETLGDWDVNGAWNRPFTSHPKKAPETGELVVLGATATKPFMVSESFLR---MVHKVDVKLSRSSLSHEIGVTQRYN

Query:  VILDYPVTVDLNRLIRGGPLI-KYDKEGYARIGVMPRYG-DADSIQWFQVKPNC-TMHLFNCFEDKDEVMVWGCRASDSVIPGPEKGLNKFEWFSKRFKP
        + +D P+      +++   +I  +D    AR GV+PRY  D   I+WF++ PNC   H  N +E++DEV++  CR  +                      
Subjt:  VILDYPVTVDLNRLIRGGPLI-KYDKEGYARIGVMPRYG-DADSIQWFQVKPNC-TMHLFNCFEDKDEVMVWGCRASDSVIPGPEKGLNKFEWFSKRFKP

Query:  LPATDHEEHNTDSSIEDGSLFSRAYEWRLNLKTGEARERSLTGTQFSMDFPFINSRFTGLKNKFGYAQVLDSFASSNSGMFKFGGLAKLHFEEPQSFEFS
         P  D         +E  +  +  YE R N+KTG A ++ L+ +  ++DFP IN  +TG K ++ Y  +LDS A   +G+ KF     LH E        
Subjt:  LPATDHEEHNTDSSIEDGSLFSRAYEWRLNLKTGEARERSLTGTQFSMDFPFINSRFTGLKNKFGYAQVLDSFASSNSGMFKFGGLAKLHFEEPQSFEFS

Query:  LPKKCEEQRIIKVEYHMFENNSFCSGASFVSREGSPEEDDGWIIAHVHNEITNTSQLAMATAMYTSLQ
          +  E    IK  Y + E   + S A +V RE + EEDDG++I  VH+E T  S + +  A   S +
Subjt:  LPKKCEEQRIIKVEYHMFENNSFCSGASFVSREGSPEEDDGWIIAHVHNEITNTSQLAMATAMYTSLQ

Q69NX5 9-cis-epoxycarotenoid dioxygenase NCED4, chloroplastic5.4e-3025.43Show/hide
Query:  RSSYIWVEGEGMLHALYFNKDSRGTWNLLYNNRYVQTETFRLEKLVRNRPSFLPAV---EGDSLAVLSAFFLNLLRFGKVYKD----ISNTNVLEHSGKF
        R+ +   +G+GMLHA+       G     Y  R+ +T   R E+ +  RP F  A+    G S       F +    G +       ++N  ++ H G+ 
Subjt:  RSSYIWVEGEGMLHALYFNKDSRGTWNLLYNNRYVQTETFRLEKLVRNRPSFLPAV---EGDSLAVLSAFFLNLLRFGKVYKD----ISNTNVLEHSGKF

Query:  YSLAENFMPQEFDIT---TLETLGDWDVNGAWNRPFT--SHPKKAPETGELVVLG-ATATKPFMVSESFL---RMVHKVDVKLSRSSLSHEIGVTQRYNV
         +++E+ +P    +T    LET+G +D +G  +   T  +HPK  P TGEL  L     +KP++    F    R    VD+ +   ++ H+  VT+ Y V
Subjt:  YSLAENFMPQEFDIT---TLETLGDWDVNGAWNRPFT--SHPKKAPETGELVVLG-ATATKPFMVSESFL---RMVHKVDVKLSRSSLSHEIGVTQRYNV

Query:  ILDYPVTVDLNRLIRGGPLIKYDKEGYARIGVMP-RYGDADSIQWFQVKPNCTMHLFNCFED--KDEVMVWGCRASDSVIPGPEKGLNKFEWFSKRFKPL
        + D  +   L  ++RGG  + YD+E  +R GV+P R  DA  ++W +V      HL+N +ED    E++V G     S +  P+   N+           
Subjt:  ILDYPVTVDLNRLIRGGPLIKYDKEGYARIGVMP-RYGDADSIQWFQVKPNCTMHLFNCFED--KDEVMVWGCRASDSVIPGPEKGLNKFEWFSKRFKPL

Query:  PATDHEEHNTDSSIEDGSLFSRAYEWRLNLKTGEARERSL---TGTQFSMDFPFINSRFTGLKNKFGYAQVLDSFASSNSGMFKFGGLAKLHFEEPQSFE
        P+   EE +  S +          E RL+ +TG +R R +      Q +++   +N +  G K ++ Y  + + +        +  G AK+  E   + +
Subjt:  PATDHEEHNTDSSIEDGSLFSRAYEWRLNLKTGEARERSL---TGTQFSMDFPFINSRFTGLKNKFGYAQVLDSFASSNSGMFKFGGLAKLHFEEPQSFE

Query:  FSLPKKCEEQRIIKVEYHMFENNSFCSGASFVSREGSPEEDDGWIIAHVHNEITNTSQLAMATA
        F                 ++    +     FV R G+  EDDG ++  VH+E   TS+L +  A
Subjt:  FSLPKKCEEQRIIKVEYHMFENNSFCSGASFVSREGSPEEDDGWIIAHVHNEITNTSQLAMATA

Q94IR2 Carotenoid 9,10(9',10')-cleavage dioxygenase 11.4e-3829.37Show/hide
Query:  SSYIWVEGEGMLHALYFNKDSRGTWNLLYNNRYVQTETFRLEKLVRNRPSFLPAVEGDSLAVLSAFFLNLLRFGKVYKDIS------NTNVLEHSGKFYS
        + Y W +G+GM+H L   KD + T    Y +R+V+T   + E+    R  F+   +   L  L    +++LR      D+S      NT ++ H GK  +
Subjt:  SSYIWVEGEGMLHALYFNKDSRGTWNLLYNNRYVQTETFRLEKLVRNRPSFLPAVEGDSLAVLSAFFLNLLRFGKVYKDIS------NTNVLEHSGKFYS

Query:  LAENFMP---QEFDITTLETLGDWDVNGAWNRPFTSHPKKAPETGELVVLGATATKPFMVSESFLR---MVHKVDVKLSRSSLSHEIGVTQRYNVILDYP
        L+E   P   + F+   L+TLG  D +      FT+HPK  P TGE+   G   T P++      +   M   V + +S   + H+  +T+ Y V +D P
Subjt:  LAENFMP---QEFDITTLETLGDWDVNGAWNRPFTSHPKKAPETGELVVLGATATKPFMVSESFLR---MVHKVDVKLSRSSLSHEIGVTQRYNVILDYP

Query:  VTVDLNRLIRGGPLI-KYDKEGYARIGVMPRYG-DADSIQWFQVKPNC-TMHLFNCFEDKDEVMVWGCRASDSVIPGPEKGLNKFEWFSKRFKPLPATDH
        +      +++   LI  +D    AR GV+PRY  D   I+WF++ PNC   H  N +E++DEV++  CR  +                       P  D+
Subjt:  VTVDLNRLIRGGPLI-KYDKEGYARIGVMPRYG-DADSIQWFQVKPNC-TMHLFNCFEDKDEVMVWGCRASDSVIPGPEKGLNKFEWFSKRFKPLPATDH

Query:  EEHNTDSSIEDGSLFSRAYEWRLNLKTGEARERSLTGTQFSMDFPFINSRFTGLKNKFGYAQVLDSFASSNSGMFKFGGLAKLHFEEPQSFEFSLPKKCE
                +E+ S  +  YE R N+KTGEA ++ L+ +  ++DFP +N  +TG K ++ Y   LDS A   +G+ KF     LH E     E     K E
Subjt:  EEHNTDSSIEDGSLFSRAYEWRLNLKTGEARERSLTGTQFSMDFPFINSRFTGLKNKFGYAQVLDSFASSNSGMFKFGGLAKLHFEEPQSFEFSLPKKCE

Query:  EQRIIKVEYHMFENNSFCSGASFVSREG--SPEEDDGWIIAHVHNE
            ++  Y +     F S A ++ R      EEDDG+++  VH+E
Subjt:  EQRIIKVEYHMFENNSFCSGASFVSREG--SPEEDDGWIIAHVHNE

Q9LRR7 9-cis-epoxycarotenoid dioxygenase NCED3, chloroplastic2.6e-3226.54Show/hide
Query:  EGEGMLHALYFNKDSRGTWNLLYNNRYVQTETFRLEKLVRNRPSFLPAV-EGDSLAVLSAFFLNLLRFGKVYKD------ISNTNVLEHSGKFYSLAENF
        +G+GM+HA+ F   S       Y  R+ QT  F  E+ +  RP F  A+ E      ++   L   R      D      ++N  ++  +G+  +++E+ 
Subjt:  EGEGMLHALYFNKDSRGTWNLLYNNRYVQTETFRLEKLVRNRPSFLPAV-EGDSLAVLSAFFLNLLRFGKVYKD------ISNTNVLEHSGKFYSLAENF

Query:  MPQEFDIT---TLETLGDWDVNGAWNRPFTSHPKKAPETGELVVLG-ATATKPFMVSESFLRMVHK---VDVKLSRSSLSHEIGVTQRYNVILDYPVTVD
        +P +  IT    L+T+G +D +G       +HPK  PE+GEL  L     +KP++    F     K   V+++L + ++ H+  +T+ + V+ D  V   
Subjt:  MPQEFDIT---TLETLGDWDVNGAWNRPFTSHPKKAPETGELVVLG-ATATKPFMVSESFLRMVHK---VDVKLSRSSLSHEIGVTQRYNVILDYPVTVD

Query:  LNRLIRGGPLIKYDKEGYARIGVMPRYG-DADSIQWFQVKPNCTMHLFNCFE--DKDEVMVWGCRASDSVIPGPEKGLNKFEWFSKRFKPLPATDHEEHN
        L  +IRGG  + YDK   AR G++ +Y  D+ +I+W         HL+N +E  + DEV+V G     S +  P+   N+                    
Subjt:  LNRLIRGGPLIKYDKEGYARIGVMPRYG-DADSIQWFQVKPNCTMHLFNCFE--DKDEVMVWGCRASDSVIPGPEKGLNKFEWFSKRFKPLPATDHEEHN

Query:  TDSSIEDGSLFSRAYEWRLNLKTGEARERSLTGT---QFSMDFPFINSRFTGLKNKFGYAQVLDSFASSNSGMFKFGGLAKLHFEEPQSFEFSLPKKCEE
              D +L S   E RLNLKTGE+  R +      Q +++   +N    G K KF Y  + + +        K  G AK+     +            
Subjt:  TDSSIEDGSLFSRAYEWRLNLKTGEARERSLTGT---QFSMDFPFINSRFTGLKNKFGYAQVLDSFASSNSGMFKFGGLAKLHFEEPQSFEFSLPKKCEE

Query:  QRIIKVEYHMFENNSFCSGASFVSREGSPEEDDGWIIAHVHNEITNTSQLAMATAM
             V+ H++ +N +     F+  EG  EED+G+I+  VH+E T  S+L +  A+
Subjt:  QRIIKVEYHMFENNSFCSGASFVSREGSPEEDDGWIIAHVHNEITNTSQLAMATAM

Arabidopsis top hitse value%identityAlignment
AT1G49450.1 Transducin/WD40 repeat-like superfamily protein4.4e-5134.55Show/hide
Query:  ESPWTLSPLQTPSPPLLYH-------------CIASLHRGEGNIYSIAISKGVVFTGSETSRIRAWKQPDCMERGYLKAAAGGVKAISAS----------
        +SPW  +       P +Y               I ++ R EG++YS+A S  ++FTGS++  IR WK  D  +    K+ +G VKAI  +          
Subjt:  ESPWTLSPLQTPSPPLLYH-------------CIASLHRGEGNIYSIAISKGVVFTGSETSRIRAWKQPDCMERGYLKAAAGGVKAISAS----------

Query:  ----GVFASEKKLISVVLSNGNRPPPQRF---------------------------ISCMAYYHAQDLLYTGSHDKTIKAWRVSDRKCVDSFVAHEDNVN
             V+   KK        G+ P  + F                           +SC++      LLY+GS DKT+K WR+SD KC++S  AH+D VN
Subjt:  ----GVFASEKKLISVVLSNGNRPPPQRF---------------------------ISCMAYYHAQDLLYTGSHDKTIKAWRVSDRKCVDSFVAHEDNVN

Query:  SIAVNQNDGCLFTCSSDGTVKIWRRVY--RENSHTLTMTLKFQTSPVNAVALCSHYDTTFLYSGSSDGTINFWEKEKLSYRYNHRGFLQGHRFAVLCLVV
        ++ V+  D  +FT S+DGT+K+W+R    +E  H L   L  Q + V A+A+  +     +Y GSSDGT+NFWE++K      H+G + GHR AVLCL  
Subjt:  SIAVNQNDGCLFTCSSDGTVKIWRRVY--RENSHTLTMTLKFQTSPVNAVALCSHYDTTFLYSGSSDGTINFWEKEKLSYRYNHRGFLQGHRFAVLCLVV

Query:  VERLIFSGSEDTTVRIWRKEEGSYYHECLAVLDGHRGPVRCLAASLEV-----------EKITTSFLVYSASLDQTFKVWRV
           L+ SG  D  + +W K  G   H CL+VL  H GPV+CLAA  E            EK    ++VYS SLD + KVWRV
Subjt:  VERLIFSGSEDTTVRIWRKEEGSYYHECLAVLDGHRGPVRCLAASLEV-----------EKITTSFLVYSASLDQTFKVWRV

AT2G26490.1 Transducin/WD40 repeat-like superfamily protein1.2e-5336.96Show/hide
Query:  LIPPPSPESPWTLSPLQTPSPPLLYH-CIASLHRGEGNIYSIAISKGVVFTGSETSRIRAWKQPDCMERGYLKAAAGGVKAISASG-------------V
        ++ P +  +P+T +   T    L  +  I SL R EG+IYS+A +K +++TGS++  IR WK  +  E    K  +G VKAI  SG             V
Subjt:  LIPPPSPESPWTLSPLQTPSPPLLYH-CIASLHRGEGNIYSIAISKGVVFTGSETSRIRAWKQPDCMERGYLKAAAGGVKAISASG-------------V

Query:  FASEKKLISVVLSNGNRP----------PPQRF-----------------ISCMAYYHAQDLLYTGSHDKTIKAWRVSDRKCVDSFVAHEDNVNSIAVNQ
        +    K  S+   +G  P           P+ +                 +SC++    Q LLY+ S D+TIK WR++D KC++S  AH+D VNS+ V+ 
Subjt:  FASEKKLISVVLSNGNRP----------PPQRF-----------------ISCMAYYHAQDLLYTGSHDKTIKAWRVSDRKCVDSFVAHEDNVNSIAVNQ

Query:  NDGCLFTCSSDGTVKIWRRVY--RENSHTLTMTLKFQTSPVNAVALCSHYDTTFLYSGSSDGTINFWEKEKLSYRYNHRGFLQGHRFAVLCLVVVERLIF
         +  +F+ S+DGTVK W+R    +   HTL  TL  Q S V A+A+    +   +Y GSSDG +NFWE+EK   + N+ G L+GH+ AVLCL V   L+F
Subjt:  NDGCLFTCSSDGTVKIWRRVY--RENSHTLTMTLKFQTSPVNAVALCSHYDTTFLYSGSSDGTINFWEKEKLSYRYNHRGFLQGHRFAVLCLVVVERLIF

Query:  SGSEDTTVRIWRKEEGSYYHECLAVLDGHRGPVRCLAASLE---VEKITTSFLVYSASLDQTFKVWRV
        SGS D T+ +W+++     H CL+VL GH GPV+CLA   +    E+    ++VYS SLD++ KVW V
Subjt:  SGSEDTTVRIWRKEEGSYYHECLAVLDGHRGPVRCLAASLE---VEKITTSFLVYSASLDQTFKVWRV

AT3G18950.1 Transducin/WD40 repeat-like superfamily protein9.8e-5134.37Show/hide
Query:  AQVSSETTPQVMSDQQGVAP---SSSPHHSKVLKLIPPPSPESPWTLSPLQTPSP----PLLYH-----------CIASLHRGEGNIYSIAISKGVVFTG
        +Q ++ TT    S  Q ++P   + SP+++      P  SP SPW     QT SP    P +Y             I ++ R +G++YS+A S  ++FTG
Subjt:  AQVSSETTPQVMSDQQGVAP---SSSPHHSKVLKLIPPPSPESPWTLSPLQTPSP----PLLYH-----------CIASLHRGEGNIYSIAISKGVVFTG

Query:  SETSRIRAWKQPDCMERGYLKAAAGGVKAISASG--------------VFASEKKLISVVLSNGNRPPPQRF---------------------------I
        S++  IR WK  D  +    K+ +G VKAI  +G              V+   K+        G+ P  + F                           +
Subjt:  SETSRIRAWKQPDCMERGYLKAAAGGVKAISASG--------------VFASEKKLISVVLSNGNRPPPQRF---------------------------I

Query:  SCMAYYHAQDLLYTGSHDKTIKAWRVSDRKCVDSFVAHEDNVNSIAVNQNDGCLFTCSSDGTVKIWRRVY--RENSHTLTMTLKFQTSPVNAVALCSHYD
        SC++      LLY+GS DKT+K WR+SD KC++S  AH+D +N++A   +D  LFT S+DGT+K+W+R    +   H L   L  Q + V A+A+  +  
Subjt:  SCMAYYHAQDLLYTGSHDKTIKAWRVSDRKCVDSFVAHEDNVNSIAVNQNDGCLFTCSSDGTVKIWRRVY--RENSHTLTMTLKFQTSPVNAVALCSHYD

Query:  TTFLYSGSSDGTINFWEKEKLSYRYNHRGFLQGHRFAVLCLVVVERLIFSGSEDTTVRIWRKEEGSYYHECLAVLDGHRGPVRCLAASLE-----VEKIT
           +Y GSSDGT+NFWE +K     +H G L+GHR AVLCL     L+ SG  D  + +WR+  G   H CL+VL  H GPV+CL A  +      EK  
Subjt:  TTFLYSGSSDGTINFWEKEKLSYRYNHRGFLQGHRFAVLCLVVVERLIFSGSEDTTVRIWRKEEGSYYHECLAVLDGHRGPVRCLAASLE-----VEKIT

Query:  TSFLVYSASLDQTFKVWRV
          ++VYS SLD++ KVWRV
Subjt:  TSFLVYSASLDQTFKVWRV

AT3G50390.1 Transducin/WD40 repeat-like superfamily protein1.4e-5736.08Show/hide
Query:  SSETTPQVMSDQQGVAPSSSPHHSKVLKLIPPPSPESPWTLSPLQTPSPPLLYHCIASLHRGEGNIYSIAISKGVVFTGSETSRIRAWKQPDCMERGYLK
        SS  +    S Q   +   SP+HS  +K+    + E           SP +L   + SL R EG+IYS+A S  +++TGS++  IR WK  + +E    K
Subjt:  SSETTPQVMSDQQGVAPSSSPHHSKVLKLIPPPSPESPWTLSPLQTPSPPLLYHCIASLHRGEGNIYSIAISKGVVFTGSETSRIRAWKQPDCMERGYLK

Query:  AAAGGVKAISASG-------------VFASEKKLISVVLSNGNRP-----------PPQRF-------------------ISCMAYYHAQDLLYTGSHDK
        + +G VKAI  +G             V+ +  K  +V    G  P           P   F                   ISC+A    + LLY+GS DK
Subjt:  AAAGGVKAISASG-------------VFASEKKLISVVLSNGNRP-----------PPQRF-------------------ISCMAYYHAQDLLYTGSHDK

Query:  TIKAWRVSDRKCVDSFVAHEDNVNSIAVNQNDGCLFTCSSDGTVKIWRR--VYRENSHTLTMTLKFQTSPVNAVALCSHYDTTFLYSGSSDGTINFWEKE
        T K WRVSD +CV+S  AHED VN++ V+  DG +FT S+DGTVK+WRR    ++  H  + TL  Q   V A+A+      T +Y GSSDGT+NFWE+E
Subjt:  TIKAWRVSDRKCVDSFVAHEDNVNSIAVNQNDGCLFTCSSDGTVKIWRR--VYRENSHTLTMTLKFQTSPVNAVALCSHYDTTFLYSGSSDGTINFWEKE

Query:  KLSYRYNHRGFLQGHRFAVLCLVVVERLIFSGSEDTTVRIWRKEEGSYYHECLAVLDGHRGPVRCLAASLEVEKIT--TSFLVYSASLDQTFKVWRVK--
               + G L+GH+ AVLCLV    L+FSGS D  +R+WR+ EG   H CL+VL GH GPV+CLA   + E ++    ++VYS SLD++ K+WRV   
Subjt:  KLSYRYNHRGFLQGHRFAVLCLVVVERLIFSGSEDTTVRIWRKEEGSYYHECLAVLDGHRGPVRCLAASLEVEKIT--TSFLVYSASLDQTFKVWRVK--

Query:  ----VLPEEKRCLDYGEGGESKMKKLGELYEMSPVLSPSWAATSQFTPE
            V  E K    +G GG     +L         ++PS++A  + +P+
Subjt:  ----VLPEEKRCLDYGEGGESKMKKLGELYEMSPVLSPSWAATSQFTPE

AT4G34380.1 Transducin/WD40 repeat-like superfamily protein1.8e-4932.1Show/hide
Query:  LSFMDDEKLRTPQIQSPAQVSSETTPQVMSDQQGVAPSSSPHHSKVLKLIPPPSPESPWTLSPLQTPSPPL-----------LYHCIASLHRGEGNIYSI
        +S  D +++  P    P  ++S  T Q  S       SS P   +           SP  +SP    SPP                I S+ R EG+IYS+
Subjt:  LSFMDDEKLRTPQIQSPAQVSSETTPQVMSDQQGVAPSSSPHHSKVLKLIPPPSPESPWTLSPLQTPSPPL-----------LYHCIASLHRGEGNIYSI

Query:  AISKGVVFTGSETSRIRAWKQPDCMERGYLKAAAGGVKAISASG--VFASEKKLISVVLSNGNRPP---------------------PQRF---------
        A S  +++TGS++  IR WK  +  E    K+++G +KAI   G  +F   +     +     R P                     P+ F         
Subjt:  AISKGVVFTGSETSRIRAWKQPDCMERGYLKAAAGGVKAISASG--VFASEKKLISVVLSNGNRPP---------------------PQRF---------

Query:  --------ISCMAYYHAQDLLYTGSHDKTIKAWRVSDRKCVDSFVAHEDNVNSIAVNQNDGCLFTCSSDGTVKIWRRVY--RENSHTLTMTLKFQTSPVN
                +S ++      LLY+ S D TIK WR++D KC++S  AH+D +NS+ ++  D  +FT S+DGTVK+W+R    +   HTL   L  Q + V 
Subjt:  --------ISCMAYYHAQDLLYTGSHDKTIKAWRVSDRKCVDSFVAHEDNVNSIAVNQNDGCLFTCSSDGTVKIWRRVY--RENSHTLTMTLKFQTSPVN

Query:  AVALCSHYDTTFLYSGSSDGTINFWEKEKLSYRYNHRGFLQGHRFAVLCLVVVERLIFSGSEDTTVRIWRKEEGSYYHECLAVLDGHRGPVRCLA-----
        A+A+ S   ++ +Y GSSDG +N+WE+ K S+     G L+GH+ AVLCL +   L+ SGS D  + +WR++     H+CL+VL GH GPV+CLA     
Subjt:  AVALCSHYDTTFLYSGSSDGTINFWEKEKLSYRYNHRGFLQGHRFAVLCLVVVERLIFSGSEDTTVRIWRKEEGSYYHECLAVLDGHRGPVRCLA-----

Query:  -----ASLEVEKITTSFLVYSASLDQTFKVWRV
             A   V +    +++YS SLD++ KVWRV
Subjt:  -----ASLEVEKITTSFLVYSASLDQTFKVWRV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTTGGAAGATCGAGTTACATATGGGTGGAAGGAGAAGGGATGCTTCACGCCTTGTATTTCAATAAAGACAGCCGAGGCACATGGAATCTCCTCTACAACAATAGATA
TGTCCAAACTGAAACATTTCGACTCGAAAAACTTGTCAGAAACCGACCATCTTTCCTTCCTGCTGTGGAGGGCGATTCTCTCGCTGTTCTCTCTGCCTTTTTCCTCAACT
TGCTAAGATTCGGCAAAGTCTACAAAGACATCAGCAACACCAACGTGTTGGAGCACTCGGGGAAGTTTTACTCACTCGCAGAAAATTTTATGCCCCAAGAGTTCGACATC
ACGACGCTGGAAACTTTGGGCGATTGGGATGTCAATGGTGCTTGGAACAGACCTTTTACAAGCCATCCAAAGAAAGCTCCGGAAACCGGTGAGTTGGTGGTCTTGGGTGC
CACTGCAACCAAACCCTTCATGGTATCGGAATCATTTCTGCGAATGGTTCATAAAGTCGACGTCAAACTCAGTAGAAGTAGCCTCAGCCATGAGATCGGAGTCACACAGA
GGTACAATGTGATATTGGATTACCCCGTAACTGTCGACTTGAACAGACTTATCAGAGGCGGACCATTAATAAAATACGATAAGGAAGGGTACGCCAGAATCGGAGTAATG
CCTCGTTACGGAGATGCCGATTCAATTCAATGGTTTCAGGTGAAACCCAATTGCACGATGCATCTTTTCAACTGCTTTGAGGACAAGGATGAGGTTATGGTGTGGGGATG
TAGAGCTTCTGATTCAGTCATACCTGGACCTGAAAAGGGACTCAACAAATTCGAGTGGTTCTCTAAGAGATTTAAGCCATTACCTGCAACTGATCATGAAGAACACAACA
CTGATTCCTCCATTGAAGATGGGTCGTTGTTTTCTCGTGCTTACGAGTGGAGGCTGAACCTCAAAACTGGAGAGGCCCGGGAGAGATCTCTCACCGGAACTCAATTTTCC
ATGGATTTTCCCTTCATAAATTCACGCTTCACTGGTCTTAAAAATAAATTTGGATACGCACAGGTTCTTGACTCCTTCGCTAGTTCTAACTCAGGCATGTTTAAATTTGG
GGGCCTGGCGAAGCTACATTTTGAAGAGCCTCAAAGTTTTGAATTTTCGTTGCCAAAAAAGTGTGAAGAACAGCGTATTATAAAAGTGGAATACCATATGTTTGAGAACA
ACTCCTTTTGCAGCGGAGCCTCCTTTGTGTCCAGAGAAGGCAGTCCCGAAGAAGATGATGGTTGGATAATCGCTCATGTTCACAATGAGATCACCAACACGTCTCAGCTA
GCCATGGCGACAGCCATGTACACCTCGCTTCAACTGTTTCTTAATTACAACACCCTCCTGCTCCTTCCTTCAATTCCCCATAACAAAAGAGCCTCCGTCACCTTTATGGC
CTCCCAAGCCAATAAGTCTTTACTCAGCTTCATGGACGATGAGAAACTGAGGACTCCACAAATTCAATCCCCCGCCCAAGTTTCCTCAGAAACAACCCCACAAGTTATGT
CTGATCAACAGGGCGTGGCGCCGTCGTCCAGCCCACATCACTCTAAAGTGCTCAAGTTAATCCCTCCGCCGAGTCCGGAATCGCCATGGACGCTGTCTCCTCTCCAGACC
CCTTCGCCTCCTCTGCTTTACCACTGCATAGCGTCGCTTCATCGCGGCGAAGGGAATATCTACTCCATTGCTATTTCCAAGGGAGTGGTGTTTACTGGCTCCGAAACCAG
CCGCATTCGTGCATGGAAGCAACCGGACTGCATGGAACGTGGGTATCTTAAGGCCGCCGCCGGCGGAGTCAAGGCCATTTCAGCTTCCGGTGTGTTCGCTTCCGAAAAGA
AGCTCATTTCTGTTGTTCTCTCGAACGGAAACCGCCCACCTCCACAAAGATTCATATCTTGCATGGCTTATTACCACGCACAGGACCTCCTCTACACCGGCTCCCACGAT
AAAACCATCAAAGCCTGGCGAGTTTCCGACCGCAAATGCGTCGACTCCTTCGTCGCCCATGAAGATAACGTCAACTCCATAGCGGTCAACCAAAACGACGGCTGCCTTTT
CACTTGCTCTTCAGACGGAACCGTCAAAATCTGGAGGAGGGTTTACCGGGAAAACTCACACACTCTCACCATGACGCTCAAATTCCAAACTTCTCCGGTAAACGCCGTCG
CTTTATGCTCCCACTACGACACCACCTTTCTGTACTCCGGTTCCTCCGATGGGACCATAAACTTTTGGGAGAAGGAGAAGCTGTCTTACAGATATAACCACAGGGGGTTC
TTACAAGGCCACCGATTTGCGGTTCTGTGTCTGGTAGTGGTGGAGAGGCTGATATTTAGTGGATCGGAGGACACGACGGTGAGGATATGGAGGAAGGAAGAGGGAAGCTA
TTACCATGAATGCTTGGCGGTGCTGGACGGCCACAGAGGGCCGGTGAGATGCTTGGCGGCGAGCCTGGAAGTGGAAAAGATTACGACGAGTTTTCTGGTTTACAGTGCTA
GTCTGGACCAAACGTTTAAAGTATGGAGAGTGAAGGTATTGCCTGAAGAGAAAAGGTGCTTGGATTATGGGGAGGGTGGGGAGTCCAAGATGAAGAAGCTGGGGGAGCTC
TATGAGATGAGCCCTGTGCTGTCTCCTTCCTGGGCTGCAACGTCACAGTTCACACCCGAGATTACACCTGAAACAGAACATGGTTCATCAATGTTGATGTTGTGCAAGCA
ACTGCAGAGAAGATTTGTTCTGTATGATGTTTCCCGCGACACAAGTGATGCAATAAATGCTGCTACTGAAGCAGCAACACCTGCCACAGATACAGCTGCATTGGAAAGAG
CCTTGGAGAGCTCTTTGGCCGAAAGGCTCCACGATCTGCCAAGAAACTCCATGGACTCGATTGGAGTTTCAGATTTTGCTGTCGGAAAAGAGATTTTATACCCAAAACAA
CATCAACGCCGACACCCATTAGCCCATACACCATTAGAGACGCACCCAATGCAATCTCAGCTTTCCCTTCTTCATAATACACAGAGAGAGCTGAGAAAGCATCATGCACG
TCGCCAACAAATACTGAAAAGAGAGCAAGGGAAGACGATAGTAAAACACAACCAAGACGACGAAGATGATAACAAGAAGAATGAGAGCGATGAAGGAGCCGTGAAGAAGC
GCAAGAACAATCAGATCCGAACTGGGAACCATCGGTTCAGTTCCTGCGATCTTTTGGAAGCCAGTTTGGTAATGGATTGCAAAGCAAGGCAAATCCTTTGTATATCCTTG
TTTGGCACAAAAGAGAGCGAGAGGGATGATTTCAAATTACAGTCTAAGCATCTATCCCTAACAAAGGGAAGGAAATGCAGCTGGGTCCGTCTGTACAATATCAAATGCCC
TTCAAAAGAAATGGAGATCAATATTGAATTGTACTACAACAAGACTTACTTCCAAAAGCTTTGCCATTTTGGCAAAGTAGAAGAATGCATTAGCTACATCAGTACCAAAA
AATATTTAAAGAGCACGCACGGATCTGACAGGGTAGAATTATCTCTTATTACTATCTTGAGAATTGAGGACACTATACAAAAAGTAAAAGTACCTGATCTCCCTCATTTT
CTCGAAGAGATCAACCAAATAGATGTTATTCTCATCTGCAAAGTAAACAATTCCATCAAGGCGGTGGGTTTCAATTACTCCTCAATACATCAGCTGTTTCATCGGATTGG
GAGGACATTTCCACAACAATCCATAGAAGAGGTGGTTGGACCAACTTTAGCGTGTGCGCCAAACGATCAAGATAATAGGATTGGAGTGGATGAGCAGATGTCGGAGTCAC
AATTATCAGTAGCTTTTGAGGCTCTAATTCTTGAGCGATTAGATGGTTATCCAGATTATCATAAGAAACATTATACATTGGCACTTCTGCATGGGTTGAGTTACTCTTCA
TGTCTTCACCCTCCAAGGGAATAG
mRNA sequenceShow/hide mRNA sequence
ATGTTTGGAAGATCGAGTTACATATGGGTGGAAGGAGAAGGGATGCTTCACGCCTTGTATTTCAATAAAGACAGCCGAGGCACATGGAATCTCCTCTACAACAATAGATA
TGTCCAAACTGAAACATTTCGACTCGAAAAACTTGTCAGAAACCGACCATCTTTCCTTCCTGCTGTGGAGGGCGATTCTCTCGCTGTTCTCTCTGCCTTTTTCCTCAACT
TGCTAAGATTCGGCAAAGTCTACAAAGACATCAGCAACACCAACGTGTTGGAGCACTCGGGGAAGTTTTACTCACTCGCAGAAAATTTTATGCCCCAAGAGTTCGACATC
ACGACGCTGGAAACTTTGGGCGATTGGGATGTCAATGGTGCTTGGAACAGACCTTTTACAAGCCATCCAAAGAAAGCTCCGGAAACCGGTGAGTTGGTGGTCTTGGGTGC
CACTGCAACCAAACCCTTCATGGTATCGGAATCATTTCTGCGAATGGTTCATAAAGTCGACGTCAAACTCAGTAGAAGTAGCCTCAGCCATGAGATCGGAGTCACACAGA
GGTACAATGTGATATTGGATTACCCCGTAACTGTCGACTTGAACAGACTTATCAGAGGCGGACCATTAATAAAATACGATAAGGAAGGGTACGCCAGAATCGGAGTAATG
CCTCGTTACGGAGATGCCGATTCAATTCAATGGTTTCAGGTGAAACCCAATTGCACGATGCATCTTTTCAACTGCTTTGAGGACAAGGATGAGGTTATGGTGTGGGGATG
TAGAGCTTCTGATTCAGTCATACCTGGACCTGAAAAGGGACTCAACAAATTCGAGTGGTTCTCTAAGAGATTTAAGCCATTACCTGCAACTGATCATGAAGAACACAACA
CTGATTCCTCCATTGAAGATGGGTCGTTGTTTTCTCGTGCTTACGAGTGGAGGCTGAACCTCAAAACTGGAGAGGCCCGGGAGAGATCTCTCACCGGAACTCAATTTTCC
ATGGATTTTCCCTTCATAAATTCACGCTTCACTGGTCTTAAAAATAAATTTGGATACGCACAGGTTCTTGACTCCTTCGCTAGTTCTAACTCAGGCATGTTTAAATTTGG
GGGCCTGGCGAAGCTACATTTTGAAGAGCCTCAAAGTTTTGAATTTTCGTTGCCAAAAAAGTGTGAAGAACAGCGTATTATAAAAGTGGAATACCATATGTTTGAGAACA
ACTCCTTTTGCAGCGGAGCCTCCTTTGTGTCCAGAGAAGGCAGTCCCGAAGAAGATGATGGTTGGATAATCGCTCATGTTCACAATGAGATCACCAACACGTCTCAGCTA
GCCATGGCGACAGCCATGTACACCTCGCTTCAACTGTTTCTTAATTACAACACCCTCCTGCTCCTTCCTTCAATTCCCCATAACAAAAGAGCCTCCGTCACCTTTATGGC
CTCCCAAGCCAATAAGTCTTTACTCAGCTTCATGGACGATGAGAAACTGAGGACTCCACAAATTCAATCCCCCGCCCAAGTTTCCTCAGAAACAACCCCACAAGTTATGT
CTGATCAACAGGGCGTGGCGCCGTCGTCCAGCCCACATCACTCTAAAGTGCTCAAGTTAATCCCTCCGCCGAGTCCGGAATCGCCATGGACGCTGTCTCCTCTCCAGACC
CCTTCGCCTCCTCTGCTTTACCACTGCATAGCGTCGCTTCATCGCGGCGAAGGGAATATCTACTCCATTGCTATTTCCAAGGGAGTGGTGTTTACTGGCTCCGAAACCAG
CCGCATTCGTGCATGGAAGCAACCGGACTGCATGGAACGTGGGTATCTTAAGGCCGCCGCCGGCGGAGTCAAGGCCATTTCAGCTTCCGGTGTGTTCGCTTCCGAAAAGA
AGCTCATTTCTGTTGTTCTCTCGAACGGAAACCGCCCACCTCCACAAAGATTCATATCTTGCATGGCTTATTACCACGCACAGGACCTCCTCTACACCGGCTCCCACGAT
AAAACCATCAAAGCCTGGCGAGTTTCCGACCGCAAATGCGTCGACTCCTTCGTCGCCCATGAAGATAACGTCAACTCCATAGCGGTCAACCAAAACGACGGCTGCCTTTT
CACTTGCTCTTCAGACGGAACCGTCAAAATCTGGAGGAGGGTTTACCGGGAAAACTCACACACTCTCACCATGACGCTCAAATTCCAAACTTCTCCGGTAAACGCCGTCG
CTTTATGCTCCCACTACGACACCACCTTTCTGTACTCCGGTTCCTCCGATGGGACCATAAACTTTTGGGAGAAGGAGAAGCTGTCTTACAGATATAACCACAGGGGGTTC
TTACAAGGCCACCGATTTGCGGTTCTGTGTCTGGTAGTGGTGGAGAGGCTGATATTTAGTGGATCGGAGGACACGACGGTGAGGATATGGAGGAAGGAAGAGGGAAGCTA
TTACCATGAATGCTTGGCGGTGCTGGACGGCCACAGAGGGCCGGTGAGATGCTTGGCGGCGAGCCTGGAAGTGGAAAAGATTACGACGAGTTTTCTGGTTTACAGTGCTA
GTCTGGACCAAACGTTTAAAGTATGGAGAGTGAAGGTATTGCCTGAAGAGAAAAGGTGCTTGGATTATGGGGAGGGTGGGGAGTCCAAGATGAAGAAGCTGGGGGAGCTC
TATGAGATGAGCCCTGTGCTGTCTCCTTCCTGGGCTGCAACGTCACAGTTCACACCCGAGATTACACCTGAAACAGAACATGGTTCATCAATGTTGATGTTGTGCAAGCA
ACTGCAGAGAAGATTTGTTCTGTATGATGTTTCCCGCGACACAAGTGATGCAATAAATGCTGCTACTGAAGCAGCAACACCTGCCACAGATACAGCTGCATTGGAAAGAG
CCTTGGAGAGCTCTTTGGCCGAAAGGCTCCACGATCTGCCAAGAAACTCCATGGACTCGATTGGAGTTTCAGATTTTGCTGTCGGAAAAGAGATTTTATACCCAAAACAA
CATCAACGCCGACACCCATTAGCCCATACACCATTAGAGACGCACCCAATGCAATCTCAGCTTTCCCTTCTTCATAATACACAGAGAGAGCTGAGAAAGCATCATGCACG
TCGCCAACAAATACTGAAAAGAGAGCAAGGGAAGACGATAGTAAAACACAACCAAGACGACGAAGATGATAACAAGAAGAATGAGAGCGATGAAGGAGCCGTGAAGAAGC
GCAAGAACAATCAGATCCGAACTGGGAACCATCGGTTCAGTTCCTGCGATCTTTTGGAAGCCAGTTTGGTAATGGATTGCAAAGCAAGGCAAATCCTTTGTATATCCTTG
TTTGGCACAAAAGAGAGCGAGAGGGATGATTTCAAATTACAGTCTAAGCATCTATCCCTAACAAAGGGAAGGAAATGCAGCTGGGTCCGTCTGTACAATATCAAATGCCC
TTCAAAAGAAATGGAGATCAATATTGAATTGTACTACAACAAGACTTACTTCCAAAAGCTTTGCCATTTTGGCAAAGTAGAAGAATGCATTAGCTACATCAGTACCAAAA
AATATTTAAAGAGCACGCACGGATCTGACAGGGTAGAATTATCTCTTATTACTATCTTGAGAATTGAGGACACTATACAAAAAGTAAAAGTACCTGATCTCCCTCATTTT
CTCGAAGAGATCAACCAAATAGATGTTATTCTCATCTGCAAAGTAAACAATTCCATCAAGGCGGTGGGTTTCAATTACTCCTCAATACATCAGCTGTTTCATCGGATTGG
GAGGACATTTCCACAACAATCCATAGAAGAGGTGGTTGGACCAACTTTAGCGTGTGCGCCAAACGATCAAGATAATAGGATTGGAGTGGATGAGCAGATGTCGGAGTCAC
AATTATCAGTAGCTTTTGAGGCTCTAATTCTTGAGCGATTAGATGGTTATCCAGATTATCATAAGAAACATTATACATTGGCACTTCTGCATGGGTTGAGTTACTCTTCA
TGTCTTCACCCTCCAAGGGAATAG
Protein sequenceShow/hide protein sequence
MFGRSSYIWVEGEGMLHALYFNKDSRGTWNLLYNNRYVQTETFRLEKLVRNRPSFLPAVEGDSLAVLSAFFLNLLRFGKVYKDISNTNVLEHSGKFYSLAENFMPQEFDI
TTLETLGDWDVNGAWNRPFTSHPKKAPETGELVVLGATATKPFMVSESFLRMVHKVDVKLSRSSLSHEIGVTQRYNVILDYPVTVDLNRLIRGGPLIKYDKEGYARIGVM
PRYGDADSIQWFQVKPNCTMHLFNCFEDKDEVMVWGCRASDSVIPGPEKGLNKFEWFSKRFKPLPATDHEEHNTDSSIEDGSLFSRAYEWRLNLKTGEARERSLTGTQFS
MDFPFINSRFTGLKNKFGYAQVLDSFASSNSGMFKFGGLAKLHFEEPQSFEFSLPKKCEEQRIIKVEYHMFENNSFCSGASFVSREGSPEEDDGWIIAHVHNEITNTSQL
AMATAMYTSLQLFLNYNTLLLLPSIPHNKRASVTFMASQANKSLLSFMDDEKLRTPQIQSPAQVSSETTPQVMSDQQGVAPSSSPHHSKVLKLIPPPSPESPWTLSPLQT
PSPPLLYHCIASLHRGEGNIYSIAISKGVVFTGSETSRIRAWKQPDCMERGYLKAAAGGVKAISASGVFASEKKLISVVLSNGNRPPPQRFISCMAYYHAQDLLYTGSHD
KTIKAWRVSDRKCVDSFVAHEDNVNSIAVNQNDGCLFTCSSDGTVKIWRRVYRENSHTLTMTLKFQTSPVNAVALCSHYDTTFLYSGSSDGTINFWEKEKLSYRYNHRGF
LQGHRFAVLCLVVVERLIFSGSEDTTVRIWRKEEGSYYHECLAVLDGHRGPVRCLAASLEVEKITTSFLVYSASLDQTFKVWRVKVLPEEKRCLDYGEGGESKMKKLGEL
YEMSPVLSPSWAATSQFTPEITPETEHGSSMLMLCKQLQRRFVLYDVSRDTSDAINAATEAATPATDTAALERALESSLAERLHDLPRNSMDSIGVSDFAVGKEILYPKQ
HQRRHPLAHTPLETHPMQSQLSLLHNTQRELRKHHARRQQILKREQGKTIVKHNQDDEDDNKKNESDEGAVKKRKNNQIRTGNHRFSSCDLLEASLVMDCKARQILCISL
FGTKESERDDFKLQSKHLSLTKGRKCSWVRLYNIKCPSKEMEINIELYYNKTYFQKLCHFGKVEECISYISTKKYLKSTHGSDRVELSLITILRIEDTIQKVKVPDLPHF
LEEINQIDVILICKVNNSIKAVGFNYSSIHQLFHRIGRTFPQQSIEEVVGPTLACAPNDQDNRIGVDEQMSESQLSVAFEALILERLDGYPDYHKKHYTLALLHGLSYSS
CLHPPRE