; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr022502 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr022502
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionPeroxidase CA
Genome locationtig00154740:209988..212806
RNA-Seq ExpressionSgr022502
SyntenySgr022502
Gene Ontology termsGO:0015886 - heme transport (biological process)
GO:0017004 - cytochrome complex assembly (biological process)
GO:0098869 - cellular oxidant detoxification (biological process)
GO:0005739 - mitochondrion (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0004601 - peroxidase activity (molecular function)
GO:0015232 - heme transporter activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAA3033717.1 hypothetical protein TSUD_199210 [Olea europaea subsp. europaea]2.1e-2862.99Show/hide
Query:  VALTDTIYDKVRSSFFPAAIRFRLKALKKGREGEELLTSDQPAKGCEVGP---------------------EWGGWFRFDKGRIPGRRSSTVGSRRFRFG
        +ALTDTIY+KVRSSFFPAAIR               LTSDQPAK CE  P                     E   WFRFDKGR+PGRRSSTVGSRR RFG
Subjt:  VALTDTIYDKVRSSFFPAAIRFRLKALKKGREGEELLTSDQPAKGCEVGP---------------------EWGGWFRFDKGRIPGRRSSTVGSRRFRFG

Query:  SRARTRNRRCTILAATYVDLDLTDSSN
        SRARTRNRRCTILAATYV LDLTDSSN
Subjt:  SRARTRNRRCTILAATYVDLDLTDSSN

GAU19240.1 hypothetical protein TSUD_199210 [Trifolium subterraneum]2.7e-2360.15Show/hide
Query:  RSTGDTKVALTDTIYDKVRSSFFPAAIRFRLKALKKGREGEELLTS----DQPAK--------GC----EVGPEW-GGWFRFDKGRIPGRRSSTVGSRRF
        RSTGDT+VALTDTIY+KVRSSFF   +  R     K  E EELL+     DQ  +         C     V  ++ GGWFRFDKG +PGRRSSTVGSRR 
Subjt:  RSTGDTKVALTDTIYDKVRSSFFPAAIRFRLKALKKGREGEELLTS----DQPAK--------GC----EVGPEW-GGWFRFDKGRIPGRRSSTVGSRRF

Query:  RFGSRARTRNR---RCTILAATYVDLDLTDSSN
        RFGSRARTRNR   RCTILAA YV+LDL DSSN
Subjt:  RFGSRARTRNR---RCTILAATYVDLDLTDSSN

GAU51027.1 hypothetical protein TSUD_411680 [Trifolium subterraneum]1.4e-1958.62Show/hide
Query:  RSTGDTKVALTDTIYDKVRSSFFPAAIRFRLKALKKGREGEELLTSDQPAKGCEVGPEWGGWFRFDKGRIPGRRSSTVGSRRFRFGSRARTRNR---RCT
        RSTGDT+VALTDTIY+KVR  +           ++ G  G     S    + C      GGWFRFDKG +PGRRSSTVGSRR RFGSRARTRNR   RCT
Subjt:  RSTGDTKVALTDTIYDKVRSSFFPAAIRFRLKALKKGREGEELLTSDQPAKGCEVGPEWGGWFRFDKGRIPGRRSSTVGSRRFRFGSRARTRNR---RCT

Query:  ILAATYVDLDLTDSSN
        ILA  YV+LDL DSSN
Subjt:  ILAATYVDLDLTDSSN

GEX73349.1 hypothetical protein [Tanacetum cinerariifolium]4.0e-1950.35Show/hide
Query:  VALTDTIYDKVRSSFFPAAIRFRLKALKKGREG-----------------------EELLTSDQPAKGCEVGPEWGGWFRF-------------------
        +ALTDTIY+KV SS FPAAIRF LK  +KGR                         +  LTS QPAKGCE GPE      +                   
Subjt:  VALTDTIYDKVRSSFFPAAIRFRLKALKKGREG-----------------------EELLTSDQPAKGCEVGPEWGGWFRF-------------------

Query:  ---DKGRIPGRRSSTVGSRRFRFGSRARTRNRRCTILAATY
           DK R+PGRRSSTVGSRR RFGSRARTRNRRCTILAATY
Subjt:  ---DKGRIPGRRSSTVGSRRFRFGSRARTRNRRCTILAATY

QCD84921.1 hypothetical protein DEO72_LG2g5279 [Vigna unguiculata]1.9e-1688.89Show/hide
Query:  GGWFRFDKGRIPGRRSSTVGSRRFRFGSRARTRNRRCTILAATYVDLDLTDSSN
        GGWFRFDKGR+PGRRSSTVGSRR RFGSRARTR RRCTILAA YV+LDL DSSN
Subjt:  GGWFRFDKGRIPGRRSSTVGSRRFRFGSRARTRNRRCTILAATYVDLDLTDSSN

TrEMBL top hitse value%identityAlignment
A0A2Z6M2W7 Uncharacterized protein1.3e-2360.15Show/hide
Query:  RSTGDTKVALTDTIYDKVRSSFFPAAIRFRLKALKKGREGEELLTS----DQPAK--------GC----EVGPEW-GGWFRFDKGRIPGRRSSTVGSRRF
        RSTGDT+VALTDTIY+KVRSSFF   +  R     K  E EELL+     DQ  +         C     V  ++ GGWFRFDKG +PGRRSSTVGSRR 
Subjt:  RSTGDTKVALTDTIYDKVRSSFFPAAIRFRLKALKKGREGEELLTS----DQPAK--------GC----EVGPEW-GGWFRFDKGRIPGRRSSTVGSRRF

Query:  RFGSRARTRNR---RCTILAATYVDLDLTDSSN
        RFGSRARTRNR   RCTILAA YV+LDL DSSN
Subjt:  RFGSRARTRNR---RCTILAATYVDLDLTDSSN

A0A2Z6PV29 Uncharacterized protein6.7e-2058.62Show/hide
Query:  RSTGDTKVALTDTIYDKVRSSFFPAAIRFRLKALKKGREGEELLTSDQPAKGCEVGPEWGGWFRFDKGRIPGRRSSTVGSRRFRFGSRARTRNR---RCT
        RSTGDT+VALTDTIY+KVR  +           ++ G  G     S    + C      GGWFRFDKG +PGRRSSTVGSRR RFGSRARTRNR   RCT
Subjt:  RSTGDTKVALTDTIYDKVRSSFFPAAIRFRLKALKKGREGEELLTSDQPAKGCEVGPEWGGWFRFDKGRIPGRRSSTVGSRRFRFGSRARTRNR---RCT

Query:  ILAATYVDLDLTDSSN
        ILA  YV+LDL DSSN
Subjt:  ILAATYVDLDLTDSSN

A0A4D6LXD6 Uncharacterized protein9.0e-1788.89Show/hide
Query:  GGWFRFDKGRIPGRRSSTVGSRRFRFGSRARTRNRRCTILAATYVDLDLTDSSN
        GGWFRFDKGR+PGRRSSTVGSRR RFGSRARTR RRCTILAA YV+LDL DSSN
Subjt:  GGWFRFDKGRIPGRRSSTVGSRRFRFGSRARTRNRRCTILAATYVDLDLTDSSN

M1ABT5 Uncharacterized protein8.4e-1588.46Show/hide
Query:  WFRFDKGRIPGRRSSTVGSRRFRFGSRARTRNRRCTILAATYVDLDLTDSSN
        WFRFDKGR+PGRRSSTVGSRR RFGSRART NRRCTILAATY  LDLTDS N
Subjt:  WFRFDKGRIPGRRSSTVGSRRFRFGSRARTRNRRCTILAATYVDLDLTDSSN

R4IUG0 Uncharacterized protein1.5e-1688.89Show/hide
Query:  GGWFRFDKGRIPGRRSSTVGSRRFRFGSRARTRNRRCTILAATYVDLDLTDSSN
        GGWFRFDKG +PGRRSSTVGSRR RFGSRARTRNRRCTILAA YV+LDL DSSN
Subjt:  GGWFRFDKGRIPGRRSSTVGSRRFRFGSRARTRNRRCTILAATYVDLDLTDSSN

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAAAAGAGAGTCTTTGTCCTGCTATTCCCGGCAATCGCTCTTTCGAGGGTTAAAAGAGAGGATCGGTGCCCGCTCTTTCCCAATCCGAACCAGCGGAATAGGCCTCT
TCCGGGTCGGAAGGCTTCCTTCAGTGAGTGCGCCTGTTCAACTTCAACTCTTGAAGATGCCGAATCGGATTATTCTTCTTCCGATTCGGCATCTTCAGAAGCAGCTGCCA
CTAGAGAAACAGAAGTCTCAGCTAAAGCCGAATCCGAAGACGCATCGACAGACGCATACGCAAACACCGGGGAATCCGCTTCATATGAAGGAACAGCACCGGTGCCGGTC
ATGCTTCAAAGTGGAACAAAACAGATTGGTTCGAGTTTCTATGAGTCGACAGGAGAAGCACCGCTCAAAGGCGGCTCATTGAGCCGAAACTACGTCCCCATCGGCTTTAG
GAAGGGCCTTAGCCTTAGCTTCAAGATCTCAGTTTGCACCAACCAGGATTTCCGTTCTCACCAGGAGCCGATTGCTCTTAAGGAGAAACCGATGGAACAAGATAGTCGTT
CATTTTTCTCTCCCGATCAGGAAAGCCTTTCTCGATATGAGAAAGATGCGCTCAAGCACCCAACCTATACAAGGGGCTTGGGCTCGAAAGCCGGCCTTACTCAACAAGCA
CCTTTATTAGTAAGGTCAACGGGGGATACTAAAGTGGCTCTCACTGACACCATCTACGATAAGGTGAGGTCGTCTTTCTTTCCAGCCGCCATACGGTTCCGCCTTAAAGC
CTTAAAGAAGGGAAGGGAAGGGGAGGAGCTACTGACTTCTGACCAACCCGCGAAGGGTTGTGAAGTTGGACCAGAGTGGGGTGGTTGGTTCCGATTCGACAAGGGTCGAA
TACCTGGTCGAAGGTCCAGTACAGTAGGTAGCAGGCGTTTTCGTTTTGGCTCCCGAGCGAGAACGAGAAATAGGCGATGCACCATCCTTGCTGCTACGTACGTTGACCTT
GACTTGACAGATTCATCCAATTGA
mRNA sequenceShow/hide mRNA sequence
ATGAAAAAGAGAGTCTTTGTCCTGCTATTCCCGGCAATCGCTCTTTCGAGGGTTAAAAGAGAGGATCGGTGCCCGCTCTTTCCCAATCCGAACCAGCGGAATAGGCCTCT
TCCGGGTCGGAAGGCTTCCTTCAGTGAGTGCGCCTGTTCAACTTCAACTCTTGAAGATGCCGAATCGGATTATTCTTCTTCCGATTCGGCATCTTCAGAAGCAGCTGCCA
CTAGAGAAACAGAAGTCTCAGCTAAAGCCGAATCCGAAGACGCATCGACAGACGCATACGCAAACACCGGGGAATCCGCTTCATATGAAGGAACAGCACCGGTGCCGGTC
ATGCTTCAAAGTGGAACAAAACAGATTGGTTCGAGTTTCTATGAGTCGACAGGAGAAGCACCGCTCAAAGGCGGCTCATTGAGCCGAAACTACGTCCCCATCGGCTTTAG
GAAGGGCCTTAGCCTTAGCTTCAAGATCTCAGTTTGCACCAACCAGGATTTCCGTTCTCACCAGGAGCCGATTGCTCTTAAGGAGAAACCGATGGAACAAGATAGTCGTT
CATTTTTCTCTCCCGATCAGGAAAGCCTTTCTCGATATGAGAAAGATGCGCTCAAGCACCCAACCTATACAAGGGGCTTGGGCTCGAAAGCCGGCCTTACTCAACAAGCA
CCTTTATTAGTAAGGTCAACGGGGGATACTAAAGTGGCTCTCACTGACACCATCTACGATAAGGTGAGGTCGTCTTTCTTTCCAGCCGCCATACGGTTCCGCCTTAAAGC
CTTAAAGAAGGGAAGGGAAGGGGAGGAGCTACTGACTTCTGACCAACCCGCGAAGGGTTGTGAAGTTGGACCAGAGTGGGGTGGTTGGTTCCGATTCGACAAGGGTCGAA
TACCTGGTCGAAGGTCCAGTACAGTAGGTAGCAGGCGTTTTCGTTTTGGCTCCCGAGCGAGAACGAGAAATAGGCGATGCACCATCCTTGCTGCTACGTACGTTGACCTT
GACTTGACAGATTCATCCAATTGA
Protein sequenceShow/hide protein sequence
MKKRVFVLLFPAIALSRVKREDRCPLFPNPNQRNRPLPGRKASFSECACSTSTLEDAESDYSSSDSASSEAAATRETEVSAKAESEDASTDAYANTGESASYEGTAPVPV
MLQSGTKQIGSSFYESTGEAPLKGGSLSRNYVPIGFRKGLSLSFKISVCTNQDFRSHQEPIALKEKPMEQDSRSFFSPDQESLSRYEKDALKHPTYTRGLGSKAGLTQQA
PLLVRSTGDTKVALTDTIYDKVRSSFFPAAIRFRLKALKKGREGEELLTSDQPAKGCEVGPEWGGWFRFDKGRIPGRRSSTVGSRRFRFGSRARTRNRRCTILAATYVDL
DLTDSSN