; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr014279 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr014279
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionCytochrome P450, putative
Genome locationtig00000233:61029..61903
RNA-Seq ExpressionSgr014279
SyntenySgr014279
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAB4265806.1 unnamed protein product [Prunus armeniaca]3.7e-1830.82Show/hide
Query:  LVGKKSRDSSEPACAWPVIGHLH-LGGGDRPLYRTLGVMADQY-----------------------------------------CKHL--------HLPY
        +VG KSR++ EPA AWP+IGHLH LGGGD+ LYRTLG MAD+Y                                          KH+          PY
Subjt:  LVGKKSRDSSEPACAWPVIGHLH-LGGGDRPLYRTLGVMADQY-----------------------------------------CKHL--------HLPY

Query:  HSGWRS-----------------IWGGGASELNMRWECESCIFYEHRTTRLCLWNLN-----------------------DAACEDG-EARRCHKPISRF
         S WR                  +     SEL+M  +    ++ ++ ++R  +  LN                        A C+DG EARRC K I +F
Subjt:  HSGWRS-----------------IWGGGASELNMRWECESCIFYEHRTTRLCLWNLN-----------------------DAACEDG-EARRCHKPISRF

Query:  VHLIVFLCGLRCTSIFMVTAC------LDLQGHEKVMKTTAIELHLNVIILARAGMRRGPRQQRVVVLHEGNADGDQDFIDMMFFISRRASL
         HLI          IF+V+        LDLQGHEK MK TA +  L+ I+    G     RQ+RV+   +G  + D+DFID+M  +     L
Subjt:  VHLIVFLCGLRCTSIFMVTAC------LDLQGHEKVMKTTAIELHLNVIILARAGMRRGPRQQRVVVLHEGNADGDQDFIDMMFFISRRASL

CAB4296396.1 unnamed protein product [Prunus armeniaca]3.7e-1830.82Show/hide
Query:  LVGKKSRDSSEPACAWPVIGHLH-LGGGDRPLYRTLGVMADQY-----------------------------------------CKHL--------HLPY
        +VG KSR++ EPA AWP+IGHLH LGGGD+ LYRTLG MAD+Y                                          KH+          PY
Subjt:  LVGKKSRDSSEPACAWPVIGHLH-LGGGDRPLYRTLGVMADQY-----------------------------------------CKHL--------HLPY

Query:  HSGWRS-----------------IWGGGASELNMRWECESCIFYEHRTTRLCLWNLN-----------------------DAACEDG-EARRCHKPISRF
         S WR                  +     SEL+M  +    ++ ++ ++R  +  LN                        A C+DG EARRC K I +F
Subjt:  HSGWRS-----------------IWGGGASELNMRWECESCIFYEHRTTRLCLWNLN-----------------------DAACEDG-EARRCHKPISRF

Query:  VHLIVFLCGLRCTSIFMVTAC------LDLQGHEKVMKTTAIELHLNVIILARAGMRRGPRQQRVVVLHEGNADGDQDFIDMMFFISRRASL
         HLI          IF+V+        LDLQGHEK MK TA +  L+ I+    G     RQ+RV+   +G  + D+DFID+M  +     L
Subjt:  VHLIVFLCGLRCTSIFMVTAC------LDLQGHEKVMKTTAIELHLNVIILARAGMRRGPRQQRVVVLHEGNADGDQDFIDMMFFISRRASL

XP_007227272.2 cytochrome P450 82C4 [Prunus persica]2.4e-1730.82Show/hide
Query:  LVGKKSRDSSEPACAWPVIGHLH-LGGGDRPLYRTLGVMADQY-----------------------------------------CKHL--------HLPY
        +VG KSR++ EPA AWP+IGHLH LGGGD+ LYRTLG MAD+Y                                          KH+          PY
Subjt:  LVGKKSRDSSEPACAWPVIGHLH-LGGGDRPLYRTLGVMADQY-----------------------------------------CKHL--------HLPY

Query:  HSGWRSIWGGGASEL--NMRWECESCI---------------FYEHRTTRLCLWNLN-----------------------DAACEDG-EARRCHKPISRF
           WR +      EL  N R E    +               + ++ ++R  +  LN                        A C+DG EARRC K I +F
Subjt:  HSGWRSIWGGGASEL--NMRWECESCI---------------FYEHRTTRLCLWNLN-----------------------DAACEDG-EARRCHKPISRF

Query:  VHLIVFLCGLRCTSIFMVTAC------LDLQGHEKVMKTTAIELHLNVIILARAGMRRGPRQQRVVVLHEGNADGDQDFIDMMFFISRRASL
         HLI          IF+V+        LDLQGHEK MK TA +  L+ I+    G     RQ+RV+   +G  + D+DFID+M  +     L
Subjt:  VHLIVFLCGLRCTSIFMVTAC------LDLQGHEKVMKTTAIELHLNVIILARAGMRRGPRQQRVVVLHEGNADGDQDFIDMMFFISRRASL

XP_016646999.1 PREDICTED: cytochrome P450 82C4-like [Prunus mume]9.2e-1730.48Show/hide
Query:  LVGKKSRDSSEPACAWPVIGHLH-LGGGDRPLYRTLGVMADQY-----------------------------------------CKHL--------HLPY
        +VG KSR++ EPA AWP+IGHLH LGGGD+ LYRTLG MAD+Y                                          KH+          PY
Subjt:  LVGKKSRDSSEPACAWPVIGHLH-LGGGDRPLYRTLGVMADQY-----------------------------------------CKHL--------HLPY

Query:  HSGWRSIWGGGASEL--NMRWECESCI---------------FYEHRTTRLCLWNLN-----------------------DAACEDG-EARRCHKPISRF
           WR +      EL  N R E    +               + ++ ++R  +  LN                        A C++G EARRC K I +F
Subjt:  HSGWRSIWGGGASEL--NMRWECESCI---------------FYEHRTTRLCLWNLN-----------------------DAACEDG-EARRCHKPISRF

Query:  VHLIVFLCGLRCTSIFMVTAC------LDLQGHEKVMKTTAIELHLNVIILARAGMRRGPRQQRVVVLHEGNADGDQDFIDMMFFISRRASL
         HLI          IF+V+        LDLQGHEK MK TA +  L+ I+    G     RQ+RV+   +G  + D+DFID+M  +     L
Subjt:  VHLIVFLCGLRCTSIFMVTAC------LDLQGHEKVMKTTAIELHLNVIILARAGMRRGPRQQRVVVLHEGNADGDQDFIDMMFFISRRASL

XP_021820904.1 cytochrome P450 82C4-like isoform X1 [Prunus avium]1.4e-1730.82Show/hide
Query:  LVGKKSRDSSEPACAWPVIGHLH-LGGGDRPLYRTLGVMADQY-----------------------------------------CKHL--------HLPY
        +VG KSR++ EPA AWP+IGHLH LGGGD+ LYRTLG MAD+Y                                          KH+          PY
Subjt:  LVGKKSRDSSEPACAWPVIGHLH-LGGGDRPLYRTLGVMADQY-----------------------------------------CKHL--------HLPY

Query:  HSGWRSIWGGGASEL--NMRWECESCI---------------FYEHRTTRLCLWNLN-----------------------DAACEDG-EARRCHKPISRF
           WR +      EL  N R E    +               + ++ ++R  +  LN                        A C+DG EARRC K I +F
Subjt:  HSGWRSIWGGGASEL--NMRWECESCI---------------FYEHRTTRLCLWNLN-----------------------DAACEDG-EARRCHKPISRF

Query:  VHLIVFLCGLRCTSIFMVTAC------LDLQGHEKVMKTTAIELHLNVIILARAGMRRGPRQQRVVVLHEGNADGDQDFIDMMFFISRRASL
         HLI          IF+V+        LDLQGHEK MK TA +  L+ I+    G     RQ+RV+   +G  + D+DFID+M  +     L
Subjt:  VHLIVFLCGLRCTSIFMVTAC------LDLQGHEKVMKTTAIELHLNVIILARAGMRRGPRQQRVVVLHEGNADGDQDFIDMMFFISRRASL

TrEMBL top hitse value%identityAlignment
A0A251RCK3 Uncharacterized protein1.2e-1730.82Show/hide
Query:  LVGKKSRDSSEPACAWPVIGHLH-LGGGDRPLYRTLGVMADQY-----------------------------------------CKHL--------HLPY
        +VG KSR++ EPA AWP+IGHLH LGGGD+ LYRTLG MAD+Y                                          KH+          PY
Subjt:  LVGKKSRDSSEPACAWPVIGHLH-LGGGDRPLYRTLGVMADQY-----------------------------------------CKHL--------HLPY

Query:  HSGWRSIWGGGASEL--NMRWECESCI---------------FYEHRTTRLCLWNLN-----------------------DAACEDG-EARRCHKPISRF
           WR +      EL  N R E    +               + ++ ++R  +  LN                        A C+DG EARRC K I +F
Subjt:  HSGWRSIWGGGASEL--NMRWECESCI---------------FYEHRTTRLCLWNLN-----------------------DAACEDG-EARRCHKPISRF

Query:  VHLIVFLCGLRCTSIFMVTAC------LDLQGHEKVMKTTAIELHLNVIILARAGMRRGPRQQRVVVLHEGNADGDQDFIDMMFFISRRASL
         HLI          IF+V+        LDLQGHEK MK TA +  L+ I+    G     RQ+RV+   +G  + D+DFID+M  +     L
Subjt:  VHLIVFLCGLRCTSIFMVTAC------LDLQGHEKVMKTTAIELHLNVIILARAGMRRGPRQQRVVVLHEGNADGDQDFIDMMFFISRRASL

A0A6J5TNZ2 Uncharacterized protein1.8e-1830.82Show/hide
Query:  LVGKKSRDSSEPACAWPVIGHLH-LGGGDRPLYRTLGVMADQY-----------------------------------------CKHL--------HLPY
        +VG KSR++ EPA AWP+IGHLH LGGGD+ LYRTLG MAD+Y                                          KH+          PY
Subjt:  LVGKKSRDSSEPACAWPVIGHLH-LGGGDRPLYRTLGVMADQY-----------------------------------------CKHL--------HLPY

Query:  HSGWRS-----------------IWGGGASELNMRWECESCIFYEHRTTRLCLWNLN-----------------------DAACEDG-EARRCHKPISRF
         S WR                  +     SEL+M  +    ++ ++ ++R  +  LN                        A C+DG EARRC K I +F
Subjt:  HSGWRS-----------------IWGGGASELNMRWECESCIFYEHRTTRLCLWNLN-----------------------DAACEDG-EARRCHKPISRF

Query:  VHLIVFLCGLRCTSIFMVTAC------LDLQGHEKVMKTTAIELHLNVIILARAGMRRGPRQQRVVVLHEGNADGDQDFIDMMFFISRRASL
         HLI          IF+V+        LDLQGHEK MK TA +  L+ I+    G     RQ+RV+   +G  + D+DFID+M  +     L
Subjt:  VHLIVFLCGLRCTSIFMVTAC------LDLQGHEKVMKTTAIELHLNVIILARAGMRRGPRQQRVVVLHEGNADGDQDFIDMMFFISRRASL

A0A6J5W4K7 Uncharacterized protein1.8e-1830.82Show/hide
Query:  LVGKKSRDSSEPACAWPVIGHLH-LGGGDRPLYRTLGVMADQY-----------------------------------------CKHL--------HLPY
        +VG KSR++ EPA AWP+IGHLH LGGGD+ LYRTLG MAD+Y                                          KH+          PY
Subjt:  LVGKKSRDSSEPACAWPVIGHLH-LGGGDRPLYRTLGVMADQY-----------------------------------------CKHL--------HLPY

Query:  HSGWRS-----------------IWGGGASELNMRWECESCIFYEHRTTRLCLWNLN-----------------------DAACEDG-EARRCHKPISRF
         S WR                  +     SEL+M  +    ++ ++ ++R  +  LN                        A C+DG EARRC K I +F
Subjt:  HSGWRS-----------------IWGGGASELNMRWECESCIFYEHRTTRLCLWNLN-----------------------DAACEDG-EARRCHKPISRF

Query:  VHLIVFLCGLRCTSIFMVTAC------LDLQGHEKVMKTTAIELHLNVIILARAGMRRGPRQQRVVVLHEGNADGDQDFIDMMFFISRRASL
         HLI          IF+V+        LDLQGHEK MK TA +  L+ I+    G     RQ+RV+   +G  + D+DFID+M  +     L
Subjt:  VHLIVFLCGLRCTSIFMVTAC------LDLQGHEKVMKTTAIELHLNVIILARAGMRRGPRQQRVVVLHEGNADGDQDFIDMMFFISRRASL

A0A6P5SX50 cytochrome P450 82C4-like isoform X16.9e-1830.82Show/hide
Query:  LVGKKSRDSSEPACAWPVIGHLH-LGGGDRPLYRTLGVMADQY-----------------------------------------CKHL--------HLPY
        +VG KSR++ EPA AWP+IGHLH LGGGD+ LYRTLG MAD+Y                                          KH+          PY
Subjt:  LVGKKSRDSSEPACAWPVIGHLH-LGGGDRPLYRTLGVMADQY-----------------------------------------CKHL--------HLPY

Query:  HSGWRSIWGGGASEL--NMRWECESCI---------------FYEHRTTRLCLWNLN-----------------------DAACEDG-EARRCHKPISRF
           WR +      EL  N R E    +               + ++ ++R  +  LN                        A C+DG EARRC K I +F
Subjt:  HSGWRSIWGGGASEL--NMRWECESCI---------------FYEHRTTRLCLWNLN-----------------------DAACEDG-EARRCHKPISRF

Query:  VHLIVFLCGLRCTSIFMVTAC------LDLQGHEKVMKTTAIELHLNVIILARAGMRRGPRQQRVVVLHEGNADGDQDFIDMMFFISRRASL
         HLI          IF+V+        LDLQGHEK MK TA +  L+ I+    G     RQ+RV+   +G  + D+DFID+M  +     L
Subjt:  VHLIVFLCGLRCTSIFMVTAC------LDLQGHEKVMKTTAIELHLNVIILARAGMRRGPRQQRVVVLHEGNADGDQDFIDMMFFISRRASL

M5XSG2 Uncharacterized protein (Fragment)1.2e-1730.82Show/hide
Query:  LVGKKSRDSSEPACAWPVIGHLH-LGGGDRPLYRTLGVMADQY-----------------------------------------CKHL--------HLPY
        +VG KSR++ EPA AWP+IGHLH LGGGD+ LYRTLG MAD+Y                                          KH+          PY
Subjt:  LVGKKSRDSSEPACAWPVIGHLH-LGGGDRPLYRTLGVMADQY-----------------------------------------CKHL--------HLPY

Query:  HSGWRSIWGGGASEL--NMRWECESCI---------------FYEHRTTRLCLWNLN-----------------------DAACEDG-EARRCHKPISRF
           WR +      EL  N R E    +               + ++ ++R  +  LN                        A C+DG EARRC K I +F
Subjt:  HSGWRSIWGGGASEL--NMRWECESCI---------------FYEHRTTRLCLWNLN-----------------------DAACEDG-EARRCHKPISRF

Query:  VHLIVFLCGLRCTSIFMVTAC------LDLQGHEKVMKTTAIELHLNVIILARAGMRRGPRQQRVVVLHEGNADGDQDFIDMMFFISRRASL
         HLI          IF+V+        LDLQGHEK MK TA +  L+ I+    G     RQ+RV+   +G  + D+DFID+M  +     L
Subjt:  VHLIVFLCGLRCTSIFMVTAC------LDLQGHEKVMKTTAIELHLNVIILARAGMRRGPRQQRVVVLHEGNADGDQDFIDMMFFISRRASL

SwissProt top hitse value%identityAlignment
I3V6B7 (13S,14R)-13-O-acetyl-1-hydroxy-N-methylcanadine 8-hydroxylase CYP82X14.2e-0455Show/hide
Query:  KKSRDSSEPACAWPVIGHLHLGGGDR-PLYRTLGVMADQY
        KK + +   A AWP+IGHL L   D+ PLYR LG MAD+Y
Subjt:  KKSRDSSEPACAWPVIGHLHLGGGDR-PLYRTLGVMADQY

O49394 Xanthotoxin 5-hydroxylase CYP82C24.9e-0568.75Show/hide
Query:  PACAWPVIGHLH-LGGGDRPLYRTLGVMADQY
        P+ AWP+IGHLH L G ++ LYRTLG MADQY
Subjt:  PACAWPVIGHLH-LGGGDRPLYRTLGVMADQY

O49396 Cytochrome P450 82C33.8e-0568.75Show/hide
Query:  PACAWPVIGHLH-LGGGDRPLYRTLGVMADQY
        P+ AWP+IGHLH LGG ++ LYRTLG MAD Y
Subjt:  PACAWPVIGHLH-LGGGDRPLYRTLGVMADQY

Q9SZ46 Xanthotoxin 5-hydroxylase CYP82C42.5e-0925.99Show/hide
Query:  PACAWPVIGHLH-LGGGDRPLYRTLGVMADQY-----------------------------------------CKHL--------HLPYHSGWRS-----
        P+ AWP+IGHLH LGG ++ LYRTLG MAD Y                                          KH+          PY + WR      
Subjt:  PACAWPVIGHLH-LGGGDRPLYRTLGVMADQY-----------------------------------------CKHL--------HLPYHSGWRS-----

Query:  ------------IWGGGASELNMRWECESCIFYEHRTTRLCLWNL-------------------------NDAACED-GEARRCHKPISRFVHLIVFLCG
                    +     SE+ M  +    +++++  T+  + +L                            + ED  EA +C K I++F HLI     
Subjt:  ------------IWGGGASELNMRWECESCIFYEHRTTRLCLWNL-------------------------NDAACED-GEARRCHKPISRFVHLIVFLCG

Query:  LRCTSIFMVTACLDLQGHEKVMKTTAIELHLNVIILARAGMRRGPRQQRVVVLHEGNADGDQDFIDMMFFISRRASL
           +  F   +  DLQGHEK MK T  EL    +IL R       RQQR      G  + D DFID+M  ++ +  L
Subjt:  LRCTSIFMVTACLDLQGHEKVMKTTAIELHLNVIILARAGMRRGPRQQRVVVLHEGNADGDQDFIDMMFFISRRASL

Arabidopsis top hitse value%identityAlignment
AT4G31940.1 cytochrome P450, family 82, subfamily C, polypeptide 41.8e-1025.99Show/hide
Query:  PACAWPVIGHLH-LGGGDRPLYRTLGVMADQY-----------------------------------------CKHL--------HLPYHSGWRS-----
        P+ AWP+IGHLH LGG ++ LYRTLG MAD Y                                          KH+          PY + WR      
Subjt:  PACAWPVIGHLH-LGGGDRPLYRTLGVMADQY-----------------------------------------CKHL--------HLPYHSGWRS-----

Query:  ------------IWGGGASELNMRWECESCIFYEHRTTRLCLWNL-------------------------NDAACED-GEARRCHKPISRFVHLIVFLCG
                    +     SE+ M  +    +++++  T+  + +L                            + ED  EA +C K I++F HLI     
Subjt:  ------------IWGGGASELNMRWECESCIFYEHRTTRLCLWNL-------------------------NDAACED-GEARRCHKPISRFVHLIVFLCG

Query:  LRCTSIFMVTACLDLQGHEKVMKTTAIELHLNVIILARAGMRRGPRQQRVVVLHEGNADGDQDFIDMMFFISRRASL
           +  F   +  DLQGHEK MK T  EL    +IL R       RQQR      G  + D DFID+M  ++ +  L
Subjt:  LRCTSIFMVTACLDLQGHEKVMKTTAIELHLNVIILARAGMRRGPRQQRVVVLHEGNADGDQDFIDMMFFISRRASL

AT4G31950.1 cytochrome P450, family 82, subfamily C, polypeptide 32.7e-0668.75Show/hide
Query:  PACAWPVIGHLH-LGGGDRPLYRTLGVMADQY
        P+ AWP+IGHLH LGG ++ LYRTLG MAD Y
Subjt:  PACAWPVIGHLH-LGGGDRPLYRTLGVMADQY

AT4G31970.1 cytochrome P450, family 82, subfamily C, polypeptide 23.5e-0668.75Show/hide
Query:  PACAWPVIGHLH-LGGGDRPLYRTLGVMADQY
        P+ AWP+IGHLH L G ++ LYRTLG MADQY
Subjt:  PACAWPVIGHLH-LGGGDRPLYRTLGVMADQY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACTGCTGTTTTTCCTTTACTTCTCGTCTGTTACACTTCGTCTGGATGGACAAGCGGTGTTCTGGTGGGGAAGAAGAGTAGGGACTCTTCTGAACCTGCTTGTGCATG
GCCAGTTATTGGCCATCTTCACCTTGGTGGCGGCGATCGGCCTCTGTACCGCACTCTTGGAGTAATGGCTGACCAGTACTGCAAGCATTTACACCTGCCCTACCACAGTG
GCTGGCGAAGCATATGGGGTGGGGGGGCTTCTGAACTTAACATGAGATGGGAATGCGAGAGCTGTATCTTCTACGAACACAGAACTACTCGCCTGTGCTTATGGAACTTA
AACGATGCTGCTTGTGAAGATGGAGAGGCAAGACGATGCCATAAACCCATCAGTCGATTTGTCCATTTGATAGTCTTTTTATGTGGTCTCCGATGCACTTCCATTTTTAT
GGTAACTGCCTGTCTGGATTTGCAGGGGCACGAGAAAGTAATGAAGACAACAGCCATAGAGCTACATTTGAATGTGATAATACTTGCAAGGGCTGGAATGAGGAGAGGGC
CTCGCCAACAGAGAGTTGTTGTTCTTCATGAGGGTAATGCTGATGGTGATCAAGACTTCATTGACATGATGTTTTTCATTTCAAGAAGGGCATCGCTCAAATTTTAA
mRNA sequenceShow/hide mRNA sequence
ATGACTGCTGTTTTTCCTTTACTTCTCGTCTGTTACACTTCGTCTGGATGGACAAGCGGTGTTCTGGTGGGGAAGAAGAGTAGGGACTCTTCTGAACCTGCTTGTGCATG
GCCAGTTATTGGCCATCTTCACCTTGGTGGCGGCGATCGGCCTCTGTACCGCACTCTTGGAGTAATGGCTGACCAGTACTGCAAGCATTTACACCTGCCCTACCACAGTG
GCTGGCGAAGCATATGGGGTGGGGGGGCTTCTGAACTTAACATGAGATGGGAATGCGAGAGCTGTATCTTCTACGAACACAGAACTACTCGCCTGTGCTTATGGAACTTA
AACGATGCTGCTTGTGAAGATGGAGAGGCAAGACGATGCCATAAACCCATCAGTCGATTTGTCCATTTGATAGTCTTTTTATGTGGTCTCCGATGCACTTCCATTTTTAT
GGTAACTGCCTGTCTGGATTTGCAGGGGCACGAGAAAGTAATGAAGACAACAGCCATAGAGCTACATTTGAATGTGATAATACTTGCAAGGGCTGGAATGAGGAGAGGGC
CTCGCCAACAGAGAGTTGTTGTTCTTCATGAGGGTAATGCTGATGGTGATCAAGACTTCATTGACATGATGTTTTTCATTTCAAGAAGGGCATCGCTCAAATTTTAA
Protein sequenceShow/hide protein sequence
MTAVFPLLLVCYTSSGWTSGVLVGKKSRDSSEPACAWPVIGHLHLGGGDRPLYRTLGVMADQYCKHLHLPYHSGWRSIWGGGASELNMRWECESCIFYEHRTTRLCLWNL
NDAACEDGEARRCHKPISRFVHLIVFLCGLRCTSIFMVTACLDLQGHEKVMKTTAIELHLNVIILARAGMRRGPRQQRVVVLHEGNADGDQDFIDMMFFISRRASLKF