; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr024735 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr024735
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationtig00002486:2346903..2347581
RNA-Seq ExpressionSgr024735
SyntenySgr024735
Gene Ontology termsGO:0016125 - sterol metabolic process (biological process)
GO:0019287 - isopentenyl diphosphate biosynthetic process, mevalonate pathway (biological process)
GO:0019288 - isopentenyl diphosphate biosynthetic process, methylerythritol 4-phosphate pathway (biological process)
GO:0048364 - root development (biological process)
GO:0050790 - regulation of catalytic activity (biological process)
GO:0005739 - mitochondrion (cellular component)
GO:0003729 - mRNA binding (molecular function)
GO:0005515 - protein binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
GO:0034046 - poly(G) binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008438671.1 PREDICTED: pentatricopeptide repeat-containing protein At4g14850 [Cucumis melo]3.3e-3582.29Show/hide
Query:  IGVSIRSTLLGRAAHAQILKTLQTPLPSFLYNHLVNMYAKLDHPNWAELVLELAPSRSVVTWTALIAGSVQNGHFASALLHFSNMLSERVRPNDFT
        + VS+RS+LLGRAAHAQILKTL+TP P+FLYNHLVNMYAKLDH N A+L+LELAP RSVVTWTALIAGSVQNG F SALLHFS+MLS+ VRPNDFT
Subjt:  IGVSIRSTLLGRAAHAQILKTLQTPLPSFLYNHLVNMYAKLDHPNWAELVLELAPSRSVVTWTALIAGSVQNGHFASALLHFSNMLSERVRPNDFT

XP_022137756.1 pentatricopeptide repeat-containing protein At4g14850 [Momordica charantia]4.6e-3781.31Show/hide
Query:  SLAKLARFTGRIGVSIRSTLLGRAAHAQILKTLQTPLPSFLYNHLVNMYAKLDHPNWAELVLELAPSRSVVTWTALIAGSVQNGHFASALLHFSNMLSER
        SLA L      + VS RS+LLGRAAHAQILKTLQTPLPSFLYNHLVNMYAKLDHP+ AELVL LAP RSVVTWTALIAGSVQNGHF+SALL+FS+MLS+ 
Subjt:  SLAKLARFTGRIGVSIRSTLLGRAAHAQILKTLQTPLPSFLYNHLVNMYAKLDHPNWAELVLELAPSRSVVTWTALIAGSVQNGHFASALLHFSNMLSER

Query:  VRPNDFT
        VRPNDFT
Subjt:  VRPNDFT

XP_022956070.1 pentatricopeptide repeat-containing protein At4g14850 [Cucurbita moschata]5.6e-3577.57Show/hide
Query:  SLAKLARFTGRIGVSIRSTLLGRAAHAQILKTLQTPLPSFLYNHLVNMYAKLDHPNWAELVLELAPSRSVVTWTALIAGSVQNGHFASALLHFSNMLSER
        SLA L  F     +SIRS+LLGR AHAQILKTL+TP P+FLYNHLVNMYAKLD  N AEL+LELAP RSVVTWT+LIAGSVQNG FASALLHFS+MLS+ 
Subjt:  SLAKLARFTGRIGVSIRSTLLGRAAHAQILKTLQTPLPSFLYNHLVNMYAKLDHPNWAELVLELAPSRSVVTWTALIAGSVQNGHFASALLHFSNMLSER

Query:  VRPNDFT
        VRPNDFT
Subjt:  VRPNDFT

XP_031738596.1 pentatricopeptide repeat-containing protein At4g14850 [Cucumis sativus]3.3e-3582.29Show/hide
Query:  IGVSIRSTLLGRAAHAQILKTLQTPLPSFLYNHLVNMYAKLDHPNWAELVLELAPSRSVVTWTALIAGSVQNGHFASALLHFSNMLSERVRPNDFT
        + VS+RS+LLGRAAHAQILKTL+TP P+FLYNHLVNMYAKLDH N A+L+LELAP RSVVTWTALIAGSVQNG F SALLHFS+MLS+ VRPNDFT
Subjt:  IGVSIRSTLLGRAAHAQILKTLQTPLPSFLYNHLVNMYAKLDHPNWAELVLELAPSRSVVTWTALIAGSVQNGHFASALLHFSNMLSERVRPNDFT

XP_038881355.1 pentatricopeptide repeat-containing protein At4g14850 [Benincasa hispida]1.1e-3578.5Show/hide
Query:  SLAKLARFTGRIGVSIRSTLLGRAAHAQILKTLQTPLPSFLYNHLVNMYAKLDHPNWAELVLELAPSRSVVTWTALIAGSVQNGHFASALLHFSNMLSER
        SLA L      + VS+RS+LLGRAAHAQILKTL+TPLP+FLYNHLVNMYAK DH N A+L+LELAP RSVVTWTALIAGSVQNG FASALLHFS+MLS+ 
Subjt:  SLAKLARFTGRIGVSIRSTLLGRAAHAQILKTLQTPLPSFLYNHLVNMYAKLDHPNWAELVLELAPSRSVVTWTALIAGSVQNGHFASALLHFSNMLSER

Query:  VRPNDFT
        VRPNDFT
Subjt:  VRPNDFT

TrEMBL top hitse value%identityAlignment
A0A0A0L4T8 Uncharacterized protein1.6e-3582.29Show/hide
Query:  IGVSIRSTLLGRAAHAQILKTLQTPLPSFLYNHLVNMYAKLDHPNWAELVLELAPSRSVVTWTALIAGSVQNGHFASALLHFSNMLSERVRPNDFT
        + VS+RS+LLGRAAHAQILKTL+TP P+FLYNHLVNMYAKLDH N A+L+LELAP RSVVTWTALIAGSVQNG F SALLHFS+MLS+ VRPNDFT
Subjt:  IGVSIRSTLLGRAAHAQILKTLQTPLPSFLYNHLVNMYAKLDHPNWAELVLELAPSRSVVTWTALIAGSVQNGHFASALLHFSNMLSERVRPNDFT

A0A1S3AXN0 pentatricopeptide repeat-containing protein At4g148501.6e-3582.29Show/hide
Query:  IGVSIRSTLLGRAAHAQILKTLQTPLPSFLYNHLVNMYAKLDHPNWAELVLELAPSRSVVTWTALIAGSVQNGHFASALLHFSNMLSERVRPNDFT
        + VS+RS+LLGRAAHAQILKTL+TP P+FLYNHLVNMYAKLDH N A+L+LELAP RSVVTWTALIAGSVQNG F SALLHFS+MLS+ VRPNDFT
Subjt:  IGVSIRSTLLGRAAHAQILKTLQTPLPSFLYNHLVNMYAKLDHPNWAELVLELAPSRSVVTWTALIAGSVQNGHFASALLHFSNMLSERVRPNDFT

A0A2N9H1X5 DYW_deaminase domain-containing protein1.2e-3575Show/hide
Query:  VSIRSTLLGRAAHAQILKTLQTPLPSFLYNHLVNMYAKLDHPNWAELVLELAPSRSVVTWTALIAGSVQNGHFASALLHFSNMLSERVRPNDFTSLALSK
        VS RS+LLGRAAHAQ+LKTL+TP PSFL NHLVNMY+KLD PN A+LVL L PSR VVTWTALIAGSVQNGHFASALLHFSNML ER+RPNDFT     K
Subjt:  VSIRSTLLGRAAHAQILKTLQTPLPSFLYNHLVNMYAKLDHPNWAELVLELAPSRSVVTWTALIAGSVQNGHFASALLHFSNMLSERVRPNDFTSLALSK

Query:  PPLAFAWP
           +   P
Subjt:  PPLAFAWP

A0A5A7U206 Pentatricopeptide repeat-containing protein1.6e-3582.29Show/hide
Query:  IGVSIRSTLLGRAAHAQILKTLQTPLPSFLYNHLVNMYAKLDHPNWAELVLELAPSRSVVTWTALIAGSVQNGHFASALLHFSNMLSERVRPNDFT
        + VS+RS+LLGRAAHAQILKTL+TP P+FLYNHLVNMYAKLDH N A+L+LELAP RSVVTWTALIAGSVQNG F SALLHFS+MLS+ VRPNDFT
Subjt:  IGVSIRSTLLGRAAHAQILKTLQTPLPSFLYNHLVNMYAKLDHPNWAELVLELAPSRSVVTWTALIAGSVQNGHFASALLHFSNMLSERVRPNDFT

A0A6J1C7M0 pentatricopeptide repeat-containing protein At4g148502.2e-3781.31Show/hide
Query:  SLAKLARFTGRIGVSIRSTLLGRAAHAQILKTLQTPLPSFLYNHLVNMYAKLDHPNWAELVLELAPSRSVVTWTALIAGSVQNGHFASALLHFSNMLSER
        SLA L      + VS RS+LLGRAAHAQILKTLQTPLPSFLYNHLVNMYAKLDHP+ AELVL LAP RSVVTWTALIAGSVQNGHF+SALL+FS+MLS+ 
Subjt:  SLAKLARFTGRIGVSIRSTLLGRAAHAQILKTLQTPLPSFLYNHLVNMYAKLDHPNWAELVLELAPSRSVVTWTALIAGSVQNGHFASALLHFSNMLSER

Query:  VRPNDFT
        VRPNDFT
Subjt:  VRPNDFT

SwissProt top hitse value%identityAlignment
Q0WNP3 Pentatricopeptide repeat-containing protein At4g18520, chloroplastic1.2e-0839.78Show/hide
Query:  SIRSTLLGRAAHAQILKTLQTPLPSFLYNHLVNMYAKLDHPNWAELVLELAPSRSVVTWTALIAGSVQNGHFASALLHFSNMLSERVRPNDFT
        S+ + LLG+  HAQI+K        ++ + LV +Y K      A  VL+  PSR VV+WTA+I+G    GH + AL     M+ E V PN FT
Subjt:  SIRSTLLGRAAHAQILKTLQTPLPSFLYNHLVNMYAKLDHPNWAELVLELAPSRSVVTWTALIAGSVQNGHFASALLHFSNMLSERVRPNDFT

Q0WSH6 Pentatricopeptide repeat-containing protein At4g148507.1e-2558.51Show/hide
Query:  VSIRSTLLGRAAHAQILKTLQTPLPSFLYNHLVNMYAKLDHPNWAELVLELAPSRSVVTWTALIAGSVQNGHFASALLHFSNMLSERVRPNDFT
        +S  S  LGR  HA+I+KTL +P P FL N+L+NMY+KLDHP  A LVL L P+R+VV+WT+LI+G  QNGHF++AL+ F  M  E V PNDFT
Subjt:  VSIRSTLLGRAAHAQILKTLQTPLPSFLYNHLVNMYAKLDHPNWAELVLELAPSRSVVTWTALIAGSVQNGHFASALLHFSNMLSERVRPNDFT

Q9FMA1 Pentatricopeptide repeat-containing protein At5g563102.7e-0845.07Show/hide
Query:  LYNHLVNMYAKLDHPNWAELVLELAPSRSVVTWTALIAGSVQNGHFASALLHFSNMLSERVRPNDFTSLAL
        L N +++MYAK  +   A  V E    R+VVTWT +IAG   +GH A AL  F+ M+   VRPND T +A+
Subjt:  LYNHLVNMYAKLDHPNWAELVLELAPSRSVVTWTALIAGSVQNGHFASALLHFSNMLSERVRPNDFTSLAL

Q9LT48 Pentatricopeptide repeat-containing protein At3g207301.1e-0937.62Show/hide
Query:  RIGVSIRSTLLGRAAHAQILKTLQTPLPSFLYNHLVNMYAKLDHPNWAELVLELAPSRSVVTWTALIAGSVQNGHFASALLHFSNMLSERVRPNDFTSLA
        +I  +I S  +GR  H   LK+ Q      L N L++MYAK      A L  E    + V +WT+LIAG  ++G+F  A+  ++ M  ER++PND T L+
Subjt:  RIGVSIRSTLLGRAAHAQILKTLQTPLPSFLYNHLVNMYAKLDHPNWAELVLELAPSRSVVTWTALIAGSVQNGHFASALLHFSNMLSERVRPNDFTSLA

Query:  L
        L
Subjt:  L

Q9SX45 Pentatricopeptide repeat-containing protein At1g502708.4e-1035.48Show/hide
Query:  FVNLQMIMTFAGAYHAVSLAKLARFTGRIGVSIRSTLLGRAAHAQILKTLQTPLPSFLYNHLVNMYAKLDHPNWAELVLELAPSRSVVTWTALIAGSVQN
        FV ++     A     VS+ K A         +     GR+ H   L+T +     F+ + LV+MY K    + A+ V +  PSR+VVTWTALIAG VQ+
Subjt:  FVNLQMIMTFAGAYHAVSLAKLARFTGRIGVSIRSTLLGRAAHAQILKTLQTPLPSFLYNHLVNMYAKLDHPNWAELVLELAPSRSVVTWTALIAGSVQN

Query:  GHFASALLHFSNMLSERVRPNDFT
          F   +L F  ML   V PN+ T
Subjt:  GHFASALLHFSNMLSERVRPNDFT

Arabidopsis top hitse value%identityAlignment
AT1G50270.1 Pentatricopeptide repeat (PPR) superfamily protein6.0e-1135.48Show/hide
Query:  FVNLQMIMTFAGAYHAVSLAKLARFTGRIGVSIRSTLLGRAAHAQILKTLQTPLPSFLYNHLVNMYAKLDHPNWAELVLELAPSRSVVTWTALIAGSVQN
        FV ++     A     VS+ K A         +     GR+ H   L+T +     F+ + LV+MY K    + A+ V +  PSR+VVTWTALIAG VQ+
Subjt:  FVNLQMIMTFAGAYHAVSLAKLARFTGRIGVSIRSTLLGRAAHAQILKTLQTPLPSFLYNHLVNMYAKLDHPNWAELVLELAPSRSVVTWTALIAGSVQN

Query:  GHFASALLHFSNMLSERVRPNDFT
          F   +L F  ML   V PN+ T
Subjt:  GHFASALLHFSNMLSERVRPNDFT

AT3G20730.1 Tetratricopeptide repeat (TPR)-like superfamily protein7.8e-1137.62Show/hide
Query:  RIGVSIRSTLLGRAAHAQILKTLQTPLPSFLYNHLVNMYAKLDHPNWAELVLELAPSRSVVTWTALIAGSVQNGHFASALLHFSNMLSERVRPNDFTSLA
        +I  +I S  +GR  H   LK+ Q      L N L++MYAK      A L  E    + V +WT+LIAG  ++G+F  A+  ++ M  ER++PND T L+
Subjt:  RIGVSIRSTLLGRAAHAQILKTLQTPLPSFLYNHLVNMYAKLDHPNWAELVLELAPSRSVVTWTALIAGSVQNGHFASALLHFSNMLSERVRPNDFTSLA

Query:  L
        L
Subjt:  L

AT4G14850.1 Pentatricopeptide repeat (PPR) superfamily protein5.1e-2658.51Show/hide
Query:  VSIRSTLLGRAAHAQILKTLQTPLPSFLYNHLVNMYAKLDHPNWAELVLELAPSRSVVTWTALIAGSVQNGHFASALLHFSNMLSERVRPNDFT
        +S  S  LGR  HA+I+KTL +P P FL N+L+NMY+KLDHP  A LVL L P+R+VV+WT+LI+G  QNGHF++AL+ F  M  E V PNDFT
Subjt:  VSIRSTLLGRAAHAQILKTLQTPLPSFLYNHLVNMYAKLDHPNWAELVLELAPSRSVVTWTALIAGSVQNGHFASALLHFSNMLSERVRPNDFT

AT4G18520.1 Pentatricopeptide repeat (PPR) superfamily protein8.7e-1039.78Show/hide
Query:  SIRSTLLGRAAHAQILKTLQTPLPSFLYNHLVNMYAKLDHPNWAELVLELAPSRSVVTWTALIAGSVQNGHFASALLHFSNMLSERVRPNDFT
        S+ + LLG+  HAQI+K        ++ + LV +Y K      A  VL+  PSR VV+WTA+I+G    GH + AL     M+ E V PN FT
Subjt:  SIRSTLLGRAAHAQILKTLQTPLPSFLYNHLVNMYAKLDHPNWAELVLELAPSRSVVTWTALIAGSVQNGHFASALLHFSNMLSERVRPNDFT

AT5G56310.1 Pentatricopeptide repeat (PPR) superfamily protein1.9e-0945.07Show/hide
Query:  LYNHLVNMYAKLDHPNWAELVLELAPSRSVVTWTALIAGSVQNGHFASALLHFSNMLSERVRPNDFTSLAL
        L N +++MYAK  +   A  V E    R+VVTWT +IAG   +GH A AL  F+ M+   VRPND T +A+
Subjt:  LYNHLVNMYAKLDHPNWAELVLELAPSRSVVTWTALIAGSVQNGHFASALLHFSNMLSERVRPNDFTSLAL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTGTTTGGATGGAAAGAGAGAGAGAGAGAGACATTCGGGGTACGTGGAGAGACGACGAAAGCTCCGCGCCTTTGTAAATCTTCAAATGATCATGACCTTCGCAGGAGC
CTATCATGCCGTTTCTCTCGCCAAACTCGCTCGCTTCACTGGTCGAATTGGCGTATCGATTCGTTCTACGCTTCTCGGCCGGGCTGCCCACGCACAGATTCTCAAAACCC
TGCAAACCCCTCTTCCTTCCTTCCTCTACAACCACCTTGTCAACATGTACGCCAAACTCGACCATCCTAACTGGGCCGAACTCGTCCTCGAACTCGCCCCATCCCGCTCC
GTCGTCACTTGGACCGCCCTCATTGCCGGCTCTGTCCAAAATGGCCATTTTGCTTCTGCTCTACTTCACTTCTCCAACATGCTCAGCGAGCGTGTTCGCCCCAATGACTT
CACTTCCCTTGCGCTCTCAAAGCCTCCACTTGCCTTCGCATGGCCATGA
mRNA sequenceShow/hide mRNA sequence
ATGTGTTTGGATGGAAAGAGAGAGAGAGAGAGACATTCGGGGTACGTGGAGAGACGACGAAAGCTCCGCGCCTTTGTAAATCTTCAAATGATCATGACCTTCGCAGGAGC
CTATCATGCCGTTTCTCTCGCCAAACTCGCTCGCTTCACTGGTCGAATTGGCGTATCGATTCGTTCTACGCTTCTCGGCCGGGCTGCCCACGCACAGATTCTCAAAACCC
TGCAAACCCCTCTTCCTTCCTTCCTCTACAACCACCTTGTCAACATGTACGCCAAACTCGACCATCCTAACTGGGCCGAACTCGTCCTCGAACTCGCCCCATCCCGCTCC
GTCGTCACTTGGACCGCCCTCATTGCCGGCTCTGTCCAAAATGGCCATTTTGCTTCTGCTCTACTTCACTTCTCCAACATGCTCAGCGAGCGTGTTCGCCCCAATGACTT
CACTTCCCTTGCGCTCTCAAAGCCTCCACTTGCCTTCGCATGGCCATGA
Protein sequenceShow/hide protein sequence
MCLDGKRERERHSGYVERRRKLRAFVNLQMIMTFAGAYHAVSLAKLARFTGRIGVSIRSTLLGRAAHAQILKTLQTPLPSFLYNHLVNMYAKLDHPNWAELVLELAPSRS
VVTWTALIAGSVQNGHFASALLHFSNMLSERVRPNDFTSLALSKPPLAFAWP