; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr020531 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr020531
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationtig00153535:139651..140052
RNA-Seq ExpressionSgr020531
SyntenySgr020531
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6586124.1 Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. sororia]1.3e-6689.47Show/hide
Query:  MVEAGFHPDLTTFNIRALAFSKMDLLWDIHLSIEHMKHMKIEPDLVTYGCIVDAYVDRRLGRNLDFALSKMNPDQPPVSLTDPFVFEALGKGDFHMSSEA
        MVEAGF+PD+TTFNIRA+AFS+MDLLWD+HLS+EHMKH+KIEPDLVTYGC+VDAYVDRRLGRNL+F LSKMNPDQPP+SLTDPFVFEALGKGDFHMSSEA
Subjt:  MVEAGFHPDLTTFNIRALAFSKMDLLWDIHLSIEHMKHMKIEPDLVTYGCIVDAYVDRRLGRNLDFALSKMNPDQPPVSLTDPFVFEALGKGDFHMSSEA

Query:  FMQFQRQKTWTYRELISLYLKKQYRRDQVFWNY
        FMQFQ+QK WTYRELISLYLKKQ+RRDQVFWNY
Subjt:  FMQFQRQKTWTYRELISLYLKKQYRRDQVFWNY

KAG7020945.1 Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. argyrosperma]1.3e-6689.47Show/hide
Query:  MVEAGFHPDLTTFNIRALAFSKMDLLWDIHLSIEHMKHMKIEPDLVTYGCIVDAYVDRRLGRNLDFALSKMNPDQPPVSLTDPFVFEALGKGDFHMSSEA
        MVEAGF+PD+TTFNIRA+AFS+MDLLWD+HLS+EHMKH+KIEPDLVTYGC+VDAYVDRRLGRNL+F LSKMNPDQPP+SLTDPFVFEALGKGDFHMSSEA
Subjt:  MVEAGFHPDLTTFNIRALAFSKMDLLWDIHLSIEHMKHMKIEPDLVTYGCIVDAYVDRRLGRNLDFALSKMNPDQPPVSLTDPFVFEALGKGDFHMSSEA

Query:  FMQFQRQKTWTYRELISLYLKKQYRRDQVFWNY
        FMQFQ+QK WTYRELISLYLKKQ+RRDQVFWNY
Subjt:  FMQFQRQKTWTYRELISLYLKKQYRRDQVFWNY

XP_022149915.1 pentatricopeptide repeat-containing protein At3g42630 isoform X1 [Momordica charantia]1.0e-6691.73Show/hide
Query:  MVEAGFHPDLTTFNIRALAFSKMDLLWDIHLSIEHMKHMKIEPDLVTYGCIVDAYVDRRLGRNLDFALSKMNPDQPPVSLTDPFVFEALGKGDFHMSSEA
        MVEAGFHPDLTTFNIRALAFS+MDLLWD+HLS+EHM+H+KIEPDLVTYGC+VDAYVDRRLGRNLDFALSKMNPDQ PVSLT+ FVFEALGKGDFHMSSEA
Subjt:  MVEAGFHPDLTTFNIRALAFSKMDLLWDIHLSIEHMKHMKIEPDLVTYGCIVDAYVDRRLGRNLDFALSKMNPDQPPVSLTDPFVFEALGKGDFHMSSEA

Query:  FMQFQRQKTWTYRELISLYLKKQYRRDQVFWNY
        FMQFQRQK WTYRELISLYLK+QYRRDQVFWNY
Subjt:  FMQFQRQKTWTYRELISLYLKKQYRRDQVFWNY

XP_022149916.1 pentatricopeptide repeat-containing protein At3g42630 isoform X2 [Momordica charantia]1.0e-6691.73Show/hide
Query:  MVEAGFHPDLTTFNIRALAFSKMDLLWDIHLSIEHMKHMKIEPDLVTYGCIVDAYVDRRLGRNLDFALSKMNPDQPPVSLTDPFVFEALGKGDFHMSSEA
        MVEAGFHPDLTTFNIRALAFS+MDLLWD+HLS+EHM+H+KIEPDLVTYGC+VDAYVDRRLGRNLDFALSKMNPDQ PVSLT+ FVFEALGKGDFHMSSEA
Subjt:  MVEAGFHPDLTTFNIRALAFSKMDLLWDIHLSIEHMKHMKIEPDLVTYGCIVDAYVDRRLGRNLDFALSKMNPDQPPVSLTDPFVFEALGKGDFHMSSEA

Query:  FMQFQRQKTWTYRELISLYLKKQYRRDQVFWNY
        FMQFQRQK WTYRELISLYLK+QYRRDQVFWNY
Subjt:  FMQFQRQKTWTYRELISLYLKKQYRRDQVFWNY

XP_022937749.1 pentatricopeptide repeat-containing protein At3g42630 [Cucurbita moschata]1.3e-6689.47Show/hide
Query:  MVEAGFHPDLTTFNIRALAFSKMDLLWDIHLSIEHMKHMKIEPDLVTYGCIVDAYVDRRLGRNLDFALSKMNPDQPPVSLTDPFVFEALGKGDFHMSSEA
        MVEAGF+PD+TTFNIRA+AFS+MDLLWD+HLS+EHMKH+KIEPDLVTYGC+VDAYVDRRLGRNL+F LSKMNPDQPP+SLTDPFVFEALGKGDFHMSSEA
Subjt:  MVEAGFHPDLTTFNIRALAFSKMDLLWDIHLSIEHMKHMKIEPDLVTYGCIVDAYVDRRLGRNLDFALSKMNPDQPPVSLTDPFVFEALGKGDFHMSSEA

Query:  FMQFQRQKTWTYRELISLYLKKQYRRDQVFWNY
        FMQFQ+QK WTYRELISLYLKKQ+RRDQVFWNY
Subjt:  FMQFQRQKTWTYRELISLYLKKQYRRDQVFWNY

TrEMBL top hitse value%identityAlignment
A0A0A0LMJ8 Uncharacterized protein2.3e-6487.97Show/hide
Query:  MVEAGFHPDLTTFNIRALAFSKMDLLWDIHLSIEHMKHMKIEPDLVTYGCIVDAYVDRRLGRNLDFALSKMNPDQPPVSLTDPFVFEALGKGDFHMSSEA
        MV+AGF+PDLTTFNIRALAFS+MDLLWD+HLS+EHMKHM IEPDLVTYGC+VDAYVDRRLGRNL+F LSKMNPDQPPVSLTD FVFEALGKGDFHMSSEA
Subjt:  MVEAGFHPDLTTFNIRALAFSKMDLLWDIHLSIEHMKHMKIEPDLVTYGCIVDAYVDRRLGRNLDFALSKMNPDQPPVSLTDPFVFEALGKGDFHMSSEA

Query:  FMQFQRQKTWTYRELISLYLKKQYRRDQVFWNY
        FMQF++QK WTYRELISLYLKK +RR+QVFWNY
Subjt:  FMQFQRQKTWTYRELISLYLKKQYRRDQVFWNY

A0A1S3BP30 pentatricopeptide repeat-containing protein At3g42630-like3.0e-6488.72Show/hide
Query:  MVEAGFHPDLTTFNIRALAFSKMDLLWDIHLSIEHMKHMKIEPDLVTYGCIVDAYVDRRLGRNLDFALSKMNPDQPPVSLTDPFVFEALGKGDFHMSSEA
        MVEAGF+PDLTTFNIRALAFS+MDLLWD+HLS+EHMKHM IEPDLVTYGC+VDAYVDRRLGRNL+F LSKMNP QPPVSLTD FVFEALGKGDFHMSSEA
Subjt:  MVEAGFHPDLTTFNIRALAFSKMDLLWDIHLSIEHMKHMKIEPDLVTYGCIVDAYVDRRLGRNLDFALSKMNPDQPPVSLTDPFVFEALGKGDFHMSSEA

Query:  FMQFQRQKTWTYRELISLYLKKQYRRDQVFWNY
        FMQF++QK WTYRELISLYLKKQ+RR+QVFWNY
Subjt:  FMQFQRQKTWTYRELISLYLKKQYRRDQVFWNY

A0A6J1D726 pentatricopeptide repeat-containing protein At3g42630 isoform X14.9e-6791.73Show/hide
Query:  MVEAGFHPDLTTFNIRALAFSKMDLLWDIHLSIEHMKHMKIEPDLVTYGCIVDAYVDRRLGRNLDFALSKMNPDQPPVSLTDPFVFEALGKGDFHMSSEA
        MVEAGFHPDLTTFNIRALAFS+MDLLWD+HLS+EHM+H+KIEPDLVTYGC+VDAYVDRRLGRNLDFALSKMNPDQ PVSLT+ FVFEALGKGDFHMSSEA
Subjt:  MVEAGFHPDLTTFNIRALAFSKMDLLWDIHLSIEHMKHMKIEPDLVTYGCIVDAYVDRRLGRNLDFALSKMNPDQPPVSLTDPFVFEALGKGDFHMSSEA

Query:  FMQFQRQKTWTYRELISLYLKKQYRRDQVFWNY
        FMQFQRQK WTYRELISLYLK+QYRRDQVFWNY
Subjt:  FMQFQRQKTWTYRELISLYLKKQYRRDQVFWNY

A0A6J1D9V6 pentatricopeptide repeat-containing protein At3g42630 isoform X24.9e-6791.73Show/hide
Query:  MVEAGFHPDLTTFNIRALAFSKMDLLWDIHLSIEHMKHMKIEPDLVTYGCIVDAYVDRRLGRNLDFALSKMNPDQPPVSLTDPFVFEALGKGDFHMSSEA
        MVEAGFHPDLTTFNIRALAFS+MDLLWD+HLS+EHM+H+KIEPDLVTYGC+VDAYVDRRLGRNLDFALSKMNPDQ PVSLT+ FVFEALGKGDFHMSSEA
Subjt:  MVEAGFHPDLTTFNIRALAFSKMDLLWDIHLSIEHMKHMKIEPDLVTYGCIVDAYVDRRLGRNLDFALSKMNPDQPPVSLTDPFVFEALGKGDFHMSSEA

Query:  FMQFQRQKTWTYRELISLYLKKQYRRDQVFWNY
        FMQFQRQK WTYRELISLYLK+QYRRDQVFWNY
Subjt:  FMQFQRQKTWTYRELISLYLKKQYRRDQVFWNY

A0A6J1FC40 pentatricopeptide repeat-containing protein At3g426306.4e-6789.47Show/hide
Query:  MVEAGFHPDLTTFNIRALAFSKMDLLWDIHLSIEHMKHMKIEPDLVTYGCIVDAYVDRRLGRNLDFALSKMNPDQPPVSLTDPFVFEALGKGDFHMSSEA
        MVEAGF+PD+TTFNIRA+AFS+MDLLWD+HLS+EHMKH+KIEPDLVTYGC+VDAYVDRRLGRNL+F LSKMNPDQPP+SLTDPFVFEALGKGDFHMSSEA
Subjt:  MVEAGFHPDLTTFNIRALAFSKMDLLWDIHLSIEHMKHMKIEPDLVTYGCIVDAYVDRRLGRNLDFALSKMNPDQPPVSLTDPFVFEALGKGDFHMSSEA

Query:  FMQFQRQKTWTYRELISLYLKKQYRRDQVFWNY
        FMQFQ+QK WTYRELISLYLKKQ+RRDQVFWNY
Subjt:  FMQFQRQKTWTYRELISLYLKKQYRRDQVFWNY

SwissProt top hitse value%identityAlignment
Q9M2A1 Pentatricopeptide repeat-containing protein At3g426306.2e-5165.41Show/hide
Query:  MVEAGFHPDLTTFNIRALAFSKMDLLWDIHLSIEHMKHMKIEPDLVTYGCIVDAYVDRRLGRNLDFALSKMNPDQPPVSLTDPFVFEALGKGDFHMSSEA
        M++AGF PDLTTFNIRALAFS+M L WD+HL++EHM+ + I PDLVT+GC+VDAY+D+RL RNL+F  ++MN D  P+ LTDP  FE LGKGDFH+SSEA
Subjt:  MVEAGFHPDLTTFNIRALAFSKMDLLWDIHLSIEHMKHMKIEPDLVTYGCIVDAYVDRRLGRNLDFALSKMNPDQPPVSLTDPFVFEALGKGDFHMSSEA

Query:  FMQFQRQKTWTYRELISLYLKKQYRRDQVFWNY
         ++F  +K WTYR+LI +YLKK+ RRDQ+FWNY
Subjt:  FMQFQRQKTWTYRELISLYLKKQYRRDQVFWNY

Arabidopsis top hitse value%identityAlignment
AT1G74580.1 Pentatricopeptide repeat (PPR) superfamily protein4.5e-0436.36Show/hide
Query:  MVEAGFHPDLTTFNIRALAFSKMDLLWDIHLSIEHMKHMKIEPDLVTYGCIVDAY
        MVE G  P+L TFNI   +  +   L +    +E MK+  + PD VT+G ++D +
Subjt:  MVEAGFHPDLTTFNIRALAFSKMDLLWDIHLSIEHMKHMKIEPDLVTYGCIVDAY

AT2G26790.1 Pentatricopeptide repeat (PPR) superfamily protein1.2e-0437.5Show/hide
Query:  MVEAGFHPDLTTFNIRALAFSKMDLLWDIHLSIEHMKHMKIEPDLVTYGCIVDAYV
        MVE G  PDL T+ I    + +++ L       E MK   I+PD+VTY  ++D Y+
Subjt:  MVEAGFHPDLTTFNIRALAFSKMDLLWDIHLSIEHMKHMKIEPDLVTYGCIVDAYV

AT3G22670.1 Pentatricopeptide repeat (PPR) superfamily protein4.5e-0430.39Show/hide
Query:  PDLTTFNIRALAFSKMDLLWDIHLSIEHMKHMKIEPDLVTYGCIVDAYVDRRLGRNLDFALSKMNPDQ-PPVSLTDPFVFEALGKGDFHMSSEAFMQFQR
        PD  TFNI    F K     D    ++ MK  +  PD+VTY   V+AY      R ++  L +M  +   P  +T   V  +LGK      +EA   +++
Subjt:  PDLTTFNIRALAFSKMDLLWDIHLSIEHMKHMKIEPDLVTYGCIVDAYVDRRLGRNLDFALSKMNPDQ-PPVSLTDPFVFEALGKGDFHMSSEAFMQFQR

Query:  QK
         K
Subjt:  QK

AT3G42630.1 Pentatricopeptide repeat (PPR) superfamily protein4.4e-5265.41Show/hide
Query:  MVEAGFHPDLTTFNIRALAFSKMDLLWDIHLSIEHMKHMKIEPDLVTYGCIVDAYVDRRLGRNLDFALSKMNPDQPPVSLTDPFVFEALGKGDFHMSSEA
        M++AGF PDLTTFNIRALAFS+M L WD+HL++EHM+ + I PDLVT+GC+VDAY+D+RL RNL+F  ++MN D  P+ LTDP  FE LGKGDFH+SSEA
Subjt:  MVEAGFHPDLTTFNIRALAFSKMDLLWDIHLSIEHMKHMKIEPDLVTYGCIVDAYVDRRLGRNLDFALSKMNPDQPPVSLTDPFVFEALGKGDFHMSSEA

Query:  FMQFQRQKTWTYRELISLYLKKQYRRDQVFWNY
         ++F  +K WTYR+LI +YLKK+ RRDQ+FWNY
Subjt:  FMQFQRQKTWTYRELISLYLKKQYRRDQVFWNY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTGAAGCTGGGTTTCATCCTGATCTTACCACATTTAATATTAGAGCGCTAGCATTTTCAAAAATGGATTTGTTATGGGATATTCATCTCAGCATTGAACATATGAA
ACACATGAAGATTGAACCCGATCTCGTGACCTACGGTTGTATTGTTGATGCATATGTAGATAGAAGACTTGGAAGAAATTTGGATTTCGCTTTGAGCAAAATGAATCCAG
ATCAACCTCCAGTATCATTAACAGATCCGTTTGTTTTCGAGGCATTGGGAAAAGGAGACTTCCACATGAGCTCCGAGGCGTTTATGCAGTTCCAGAGGCAGAAGACATGG
ACTTACAGAGAGTTAATATCATTGTATCTGAAAAAGCAATACAGGAGAGATCAAGTCTTCTGGAATTACTAA
mRNA sequenceShow/hide mRNA sequence
ATGGTTGAAGCTGGGTTTCATCCTGATCTTACCACATTTAATATTAGAGCGCTAGCATTTTCAAAAATGGATTTGTTATGGGATATTCATCTCAGCATTGAACATATGAA
ACACATGAAGATTGAACCCGATCTCGTGACCTACGGTTGTATTGTTGATGCATATGTAGATAGAAGACTTGGAAGAAATTTGGATTTCGCTTTGAGCAAAATGAATCCAG
ATCAACCTCCAGTATCATTAACAGATCCGTTTGTTTTCGAGGCATTGGGAAAAGGAGACTTCCACATGAGCTCCGAGGCGTTTATGCAGTTCCAGAGGCAGAAGACATGG
ACTTACAGAGAGTTAATATCATTGTATCTGAAAAAGCAATACAGGAGAGATCAAGTCTTCTGGAATTACTAA
Protein sequenceShow/hide protein sequence
MVEAGFHPDLTTFNIRALAFSKMDLLWDIHLSIEHMKHMKIEPDLVTYGCIVDAYVDRRLGRNLDFALSKMNPDQPPVSLTDPFVFEALGKGDFHMSSEAFMQFQRQKTW
TYRELISLYLKKQYRRDQVFWNY