; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr023048 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr023048
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionPentafunctional AROM polypeptide
Genome locationtig00000729:2303455..2304267
RNA-Seq ExpressionSgr023048
SyntenySgr023048
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
GER30062.1 pentafunctional AROM polypeptide [Striga asiatica]2.8e-1433.51Show/hide
Query:  HQLLQLLRLLRHQNRRGLLQIRDSQQPILEILLLHTHPGATDHHLLHHCRVHRRHAVPNKSAITESNQTEFPHRVGLHDPHHTLGLKPFRPLRPSRERVA
        HQL QL R LRHQ    L+ +   +QP+ ++ LL  HP A  HHLLH  R  RR AV ++  + + +     H V  H PHH   L+   P+      + 
Subjt:  HQLLQLLRLLRHQNRRGLLQIRDSQQPILEILLLHTHPGATDHHLLHHCRVHRRHAVPNKSAITESNQTEFPHRVGLHDPHHTLGLKPFRPLRPSRERVA

Query:  EEDQVRH---------------------------KEIGLQFDRVLGDPAVDCG-AVFEVCNGGSKPTFDEAVAVAPVLGLCETEAPGH
        EEDQ+RH                           KE+GL+     GDPAV  G AV ++   GS+    E   V  + G  E +AP H
Subjt:  EEDQVRH---------------------------KEIGLQFDRVLGDPAVDCG-AVFEVCNGGSKPTFDEAVAVAPVLGLCETEAPGH

GER49615.1 hypothetical protein STAS_26876 [Striga asiatica]1.3e-1635.65Show/hide
Query:  SYLYEISEQTTRHQPQR---RRPEKGVTHQLLQLLRLLRHQNRRGLLQIRDSQQPILEILLLHTHPGATDHHLLHHCRVHRRHAVPNKSAITESNQTE-F
        +++ +I ++  RH   R   R+P     HQL Q LR    QN   L      +Q +L +LLLHTH GA DHH LH     RR AV ++ A+ + +  +  
Subjt:  SYLYEISEQTTRHQPQR---RRPEKGVTHQLLQLLRLLRHQNRRGLLQIRDSQQPILEILLLHTHPGATDHHLLHHCRVHRRHAVPNKSAITESNQTE-F

Query:  PHRVGLHDPHHTLGLKPFRPLRPSRERVAEEDQVRHKEI---------------GLQFDRV------------LGDPAVDCGAVFEVCNGGSKPTFDEAV
            G H  HH  GLK   PL P R  VAEE++VR  ++               G+  D V             G PAVD GAV EV  GG +    E  
Subjt:  PHRVGLHDPHHTLGLKPFRPLRPSRERVAEEDQVRHKEI---------------GLQFDRV------------LGDPAVDCGAVFEVCNGGSKPTFDEAV

Query:  AVAPVLGLCETEAPGH
         VA V G CE EA  H
Subjt:  AVAPVLGLCETEAPGH

KAE7998056.1 hypothetical protein FH972_002634 [Carpinus fangiana]9.6e-1533.68Show/hide
Query:  EKGVTHQLLQLLRLLRHQNRRGLLQIRDSQQPILEILLLHTHPGATDHHLLHHCRVHRRHAVPNKSAITESNQTE-FPHRVGLHDPHHTLGLKPFRPLRP
        E+    +L  LLR  R QN  GL+     ++P++ +LLLH HP A  HH  H     RRHAV +++A+ +        H +GL   H   GL+   PL P
Subjt:  EKGVTHQLLQLLRLLRHQNRRGLLQIRDSQQPILEILLLHTHPGATDHHLLHHCRVHRRHAVPNKSAITESNQTE-FPHRVGLHDPHHTLGLKPFRPLRP

Query:  SRERVAEEDQVRH---------------------------KEIGLQFDRVLGDPAVDCGAVFEVCNGGSKPTFDEAVAVAPVLGLCETEAPGH
         R  V+ EDQVRH                            E+G      LGDPAV   AV EV  GG +    E + V P+    E EA  H
Subjt:  SRERVAEEDQVRH---------------------------KEIGLQFDRVLGDPAVDCGAVFEVCNGGSKPTFDEAVAVAPVLGLCETEAPGH

PON61538.1 hypothetical protein PanWU01x14_144540, partial [Parasponia andersonii]3.7e-1440.71Show/hide
Query:  ATDHHLLHHCRVHRRHAVPNKSAITESNQTEFPHRVGLHDPHHTLGLKPFRPLRPSRERVAEEDQVR---------------------------HKEIGL
        +T HH LH   +  R  VP+K A+ + +Q + P RVGLHDP H L L+    L   R R  EED+VR                            +E+GL
Subjt:  ATDHHLLHHCRVHRRHAVPNKSAITESNQTEFPHRVGLHDPHHTLGLKPFRPLRPSRERVAEEDQVR---------------------------HKEIGL

Query:  -QFDRVLGDPAVDCG-AVFEVCNGGSKPTFDEAVAVAPVL
           DRVLGDPAV  G  V EVC GGS+P F E  +V PVL
Subjt:  -QFDRVLGDPAVDCG-AVFEVCNGGSKPTFDEAVAVAPVL

POO03971.1 hypothetical protein TorRG33x02_002750, partial [Trema orientale]9.6e-1541.43Show/hide
Query:  ATDHHLLHHCRVHRRHAVPNKSAITESNQTEFPHRVGLHDPHHTLGLKPFRPLRPSRERVAEEDQVR---------------------------HKEIGL
        +T HH LH   +  R  VP+K A+ + + T+ P RVGLHDP H L L+    L   R R  EED+VR                            +E+GL
Subjt:  ATDHHLLHHCRVHRRHAVPNKSAITESNQTEFPHRVGLHDPHHTLGLKPFRPLRPSRERVAEEDQVR---------------------------HKEIGL

Query:  -QFDRVLGDPAVDCG-AVFEVCNGGSKPTFDEAVAVAPVL
           DRVLGDPAV  G  V EVC GGS+P F E  +VAPVL
Subjt:  -QFDRVLGDPAVDCG-AVFEVCNGGSKPTFDEAVAVAPVL

TrEMBL top hitse value%identityAlignment
A0A2P5CKH3 Uncharacterized protein (Fragment)1.8e-1440.71Show/hide
Query:  ATDHHLLHHCRVHRRHAVPNKSAITESNQTEFPHRVGLHDPHHTLGLKPFRPLRPSRERVAEEDQVR---------------------------HKEIGL
        +T HH LH   +  R  VP+K A+ + +Q + P RVGLHDP H L L+    L   R R  EED+VR                            +E+GL
Subjt:  ATDHHLLHHCRVHRRHAVPNKSAITESNQTEFPHRVGLHDPHHTLGLKPFRPLRPSRERVAEEDQVR---------------------------HKEIGL

Query:  -QFDRVLGDPAVDCG-AVFEVCNGGSKPTFDEAVAVAPVL
           DRVLGDPAV  G  V EVC GGS+P F E  +V PVL
Subjt:  -QFDRVLGDPAVDCG-AVFEVCNGGSKPTFDEAVAVAPVL

A0A2P5G1S8 Uncharacterized protein (Fragment)4.7e-1541.43Show/hide
Query:  ATDHHLLHHCRVHRRHAVPNKSAITESNQTEFPHRVGLHDPHHTLGLKPFRPLRPSRERVAEEDQVR---------------------------HKEIGL
        +T HH LH   +  R  VP+K A+ + + T+ P RVGLHDP H L L+    L   R R  EED+VR                            +E+GL
Subjt:  ATDHHLLHHCRVHRRHAVPNKSAITESNQTEFPHRVGLHDPHHTLGLKPFRPLRPSRERVAEEDQVR---------------------------HKEIGL

Query:  -QFDRVLGDPAVDCG-AVFEVCNGGSKPTFDEAVAVAPVL
           DRVLGDPAV  G  V EVC GGS+P F E  +VAPVL
Subjt:  -QFDRVLGDPAVDCG-AVFEVCNGGSKPTFDEAVAVAPVL

A0A5A7PBK3 Pentafunctional AROM polypeptide1.4e-1433.51Show/hide
Query:  HQLLQLLRLLRHQNRRGLLQIRDSQQPILEILLLHTHPGATDHHLLHHCRVHRRHAVPNKSAITESNQTEFPHRVGLHDPHHTLGLKPFRPLRPSRERVA
        HQL QL R LRHQ    L+ +   +QP+ ++ LL  HP A  HHLLH  R  RR AV ++  + + +     H V  H PHH   L+   P+      + 
Subjt:  HQLLQLLRLLRHQNRRGLLQIRDSQQPILEILLLHTHPGATDHHLLHHCRVHRRHAVPNKSAITESNQTEFPHRVGLHDPHHTLGLKPFRPLRPSRERVA

Query:  EEDQVRH---------------------------KEIGLQFDRVLGDPAVDCG-AVFEVCNGGSKPTFDEAVAVAPVLGLCETEAPGH
        EEDQ+RH                           KE+GL+     GDPAV  G AV ++   GS+    E   V  + G  E +AP H
Subjt:  EEDQVRH---------------------------KEIGLQFDRVLGDPAVDCG-AVFEVCNGGSKPTFDEAVAVAPVLGLCETEAPGH

A0A5A7QWC4 Uncharacterized protein6.5e-1735.65Show/hide
Query:  SYLYEISEQTTRHQPQR---RRPEKGVTHQLLQLLRLLRHQNRRGLLQIRDSQQPILEILLLHTHPGATDHHLLHHCRVHRRHAVPNKSAITESNQTE-F
        +++ +I ++  RH   R   R+P     HQL Q LR    QN   L      +Q +L +LLLHTH GA DHH LH     RR AV ++ A+ + +  +  
Subjt:  SYLYEISEQTTRHQPQR---RRPEKGVTHQLLQLLRLLRHQNRRGLLQIRDSQQPILEILLLHTHPGATDHHLLHHCRVHRRHAVPNKSAITESNQTE-F

Query:  PHRVGLHDPHHTLGLKPFRPLRPSRERVAEEDQVRHKEI---------------GLQFDRV------------LGDPAVDCGAVFEVCNGGSKPTFDEAV
            G H  HH  GLK   PL P R  VAEE++VR  ++               G+  D V             G PAVD GAV EV  GG +    E  
Subjt:  PHRVGLHDPHHTLGLKPFRPLRPSRERVAEEDQVRHKEI---------------GLQFDRV------------LGDPAVDCGAVFEVCNGGSKPTFDEAV

Query:  AVAPVLGLCETEAPGH
         VA V G CE EA  H
Subjt:  AVAPVLGLCETEAPGH

A0A5N6QFH0 Uncharacterized protein4.7e-1533.68Show/hide
Query:  EKGVTHQLLQLLRLLRHQNRRGLLQIRDSQQPILEILLLHTHPGATDHHLLHHCRVHRRHAVPNKSAITESNQTE-FPHRVGLHDPHHTLGLKPFRPLRP
        E+    +L  LLR  R QN  GL+     ++P++ +LLLH HP A  HH  H     RRHAV +++A+ +        H +GL   H   GL+   PL P
Subjt:  EKGVTHQLLQLLRLLRHQNRRGLLQIRDSQQPILEILLLHTHPGATDHHLLHHCRVHRRHAVPNKSAITESNQTE-FPHRVGLHDPHHTLGLKPFRPLRP

Query:  SRERVAEEDQVRH---------------------------KEIGLQFDRVLGDPAVDCGAVFEVCNGGSKPTFDEAVAVAPVLGLCETEAPGH
         R  V+ EDQVRH                            E+G      LGDPAV   AV EV  GG +    E + V P+    E EA  H
Subjt:  SRERVAEEDQVRH---------------------------KEIGLQFDRVLGDPAVDCGAVFEVCNGGSKPTFDEAVAVAPVLGLCETEAPGH

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTGGGAAGAATCTCAAATGGAGCTCTTGAGTCTTACCTCTATGAAATCAGTGAGCAAACAACAAGGCATCAGCCGCAGCGGCGGCGGCCGGAAAAAGGAGTAACCCA
CCAACTCCTTCAGCTTCTCCGGCTTCTGCGGCACCAAAATCGCCGTGGCCTCCTCCAAATCAGAGACTCTCAACAACCCATCCTTGAAATCCTTCTCCTCCACACACACC
CCGGCGCAACAGATCACCACCTTCTCCACCATTGCAGGGTACATCGCCGCCACGCTGTACCCAACAAATCCGCCATAACTGAGTCCAACCAAACTGAGTTTCCTCACCGA
GTTGGCCTCCATGACCCGCATCACACACTCGGCTTGAAACCATTCCGTCCTCTCCGGCCGAGTCGTGAAAGAGTCGCCGAAGAAGACCAAGTCCGGCACAAGGAGATTGG
GCTTCAATTCGATCGGGTTCTTGGGGACCCAGCAGTGGACTGTGGTGCCGTCTTTGAGGTCTGTAATGGTGGATCGAAGCCCACATTTGATGAAGCTGTAGCGGTGGCAC
CAGTTCTTGGTCTCTGTGAAACTGAAGCACCTGGTCATTTGAGGAACATGAAAGAATCAGAATCAATGGAGAGTCTGTTTTCTGGACCCAGAAAGAGGCCGAGCCAGCGA
GTCAATGGCCGTGGAGTTTTTGAGGGAGAAGATGAAGATGAATCAAGGCTTCAGACAGAACGAAGACTGTGA
mRNA sequenceShow/hide mRNA sequence
ATGTTGGGAAGAATCTCAAATGGAGCTCTTGAGTCTTACCTCTATGAAATCAGTGAGCAAACAACAAGGCATCAGCCGCAGCGGCGGCGGCCGGAAAAAGGAGTAACCCA
CCAACTCCTTCAGCTTCTCCGGCTTCTGCGGCACCAAAATCGCCGTGGCCTCCTCCAAATCAGAGACTCTCAACAACCCATCCTTGAAATCCTTCTCCTCCACACACACC
CCGGCGCAACAGATCACCACCTTCTCCACCATTGCAGGGTACATCGCCGCCACGCTGTACCCAACAAATCCGCCATAACTGAGTCCAACCAAACTGAGTTTCCTCACCGA
GTTGGCCTCCATGACCCGCATCACACACTCGGCTTGAAACCATTCCGTCCTCTCCGGCCGAGTCGTGAAAGAGTCGCCGAAGAAGACCAAGTCCGGCACAAGGAGATTGG
GCTTCAATTCGATCGGGTTCTTGGGGACCCAGCAGTGGACTGTGGTGCCGTCTTTGAGGTCTGTAATGGTGGATCGAAGCCCACATTTGATGAAGCTGTAGCGGTGGCAC
CAGTTCTTGGTCTCTGTGAAACTGAAGCACCTGGTCATTTGAGGAACATGAAAGAATCAGAATCAATGGAGAGTCTGTTTTCTGGACCCAGAAAGAGGCCGAGCCAGCGA
GTCAATGGCCGTGGAGTTTTTGAGGGAGAAGATGAAGATGAATCAAGGCTTCAGACAGAACGAAGACTGTGA
Protein sequenceShow/hide protein sequence
MLGRISNGALESYLYEISEQTTRHQPQRRRPEKGVTHQLLQLLRLLRHQNRRGLLQIRDSQQPILEILLLHTHPGATDHHLLHHCRVHRRHAVPNKSAITESNQTEFPHR
VGLHDPHHTLGLKPFRPLRPSRERVAEEDQVRHKEIGLQFDRVLGDPAVDCGAVFEVCNGGSKPTFDEAVAVAPVLGLCETEAPGHLRNMKESESMESLFSGPRKRPSQR
VNGRGVFEGEDEDESRLQTERRL