; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr020789 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr020789
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationtig00153572:1136810..1137817
RNA-Seq ExpressionSgr020789
SyntenySgr020789
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6579716.1 Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. sororia]3.9e-2437.5Show/hide
Query:  MALFQIFYFRSSSFHLKSSKFSTAQLSTVSSSDLYYDHLQKNNGNVE-----------------------------------------------------
        MALF+IFY R SSF  K S  ST QLSTVSS+DL+YDHLQKNNGNVE                                                     
Subjt:  MALFQIFYFRSSSFHLKSSKFSTAQLSTVSSSDLYYDHLQKNNGNVE-----------------------------------------------------

Query:  ---------PWLL-----------------------------------------------------------LFTEKGEMDKAME---EMDSIDLHPDTI
                 P LL                                                           LFTEKG+MDKAME   EMDS+D+ P+ I
Subjt:  ---------PWLL-----------------------------------------------------------LFTEKGEMDKAME---EMDSIDLHPDTI

Query:  TYIAMLKGFCDAGRLEDAY---------------VAYSVLLNGATRHGIMEKIMELL-EVEKMG
        TYIAMLKGFCD GRLEDAY               VAYSVLLNGA+RHG +EK+MELL E+EK G
Subjt:  TYIAMLKGFCDAGRLEDAY---------------VAYSVLLNGATRHGIMEKIMELL-EVEKMG

KAG7017159.1 Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. argyrosperma]3.9e-2437.5Show/hide
Query:  MALFQIFYFRSSSFHLKSSKFSTAQLSTVSSSDLYYDHLQKNNGNVE-----------------------------------------------------
        MALF+IFY R SSF  K S  ST QLSTVSS+DL+YDHLQKNNGNVE                                                     
Subjt:  MALFQIFYFRSSSFHLKSSKFSTAQLSTVSSSDLYYDHLQKNNGNVE-----------------------------------------------------

Query:  ---------PWLL-----------------------------------------------------------LFTEKGEMDKAME---EMDSIDLHPDTI
                 P LL                                                           LFTEKG+MDKAME   EMDS+D+ P+ I
Subjt:  ---------PWLL-----------------------------------------------------------LFTEKGEMDKAME---EMDSIDLHPDTI

Query:  TYIAMLKGFCDAGRLEDAY---------------VAYSVLLNGATRHGIMEKIMELL-EVEKMG
        TYIAMLKGFCD GRLEDAY               VAYSVLLNGA+RHG +EK+MELL E+EK G
Subjt:  TYIAMLKGFCDAGRLEDAY---------------VAYSVLLNGATRHGIMEKIMELL-EVEKMG

XP_022928928.1 pentatricopeptide repeat-containing protein At5g47360 [Cucurbita moschata]8.6e-2437.5Show/hide
Query:  MALFQIFYFRSSSFHLKSSKFSTAQLSTVSSSDLYYDHLQKNNGNVE-----------------------------------------------------
        MALF+IFY R SSF  K S  ST QLSTVSS+DL+YDHLQK NGNVE                                                     
Subjt:  MALFQIFYFRSSSFHLKSSKFSTAQLSTVSSSDLYYDHLQKNNGNVE-----------------------------------------------------

Query:  ---------PWLL-----------------------------------------------------------LFTEKGEMDKAME---EMDSIDLHPDTI
                 P LL                                                           LFTEKGEMDKAME   EMDS+D+ P+ I
Subjt:  ---------PWLL-----------------------------------------------------------LFTEKGEMDKAME---EMDSIDLHPDTI

Query:  TYIAMLKGFCDAGRLEDAY---------------VAYSVLLNGATRHGIMEKIMELL-EVEKMG
        TYIAMLKGFCD GRLEDAY               VAYSVLLNGA+RHG +EK+MELL E+EK G
Subjt:  TYIAMLKGFCDAGRLEDAY---------------VAYSVLLNGATRHGIMEKIMELL-EVEKMG

XP_022969835.1 pentatricopeptide repeat-containing protein At5g47360 [Cucurbita maxima]1.7e-2437.88Show/hide
Query:  MALFQIFYFRSSSFHLKSSKFSTAQLSTVSSSDLYYDHLQKNNGNVE-----------------------------------------------------
        MALF+IFY R SSF  K S  ST QLSTVSS+DL+YDHLQKNNGNVE                                                     
Subjt:  MALFQIFYFRSSSFHLKSSKFSTAQLSTVSSSDLYYDHLQKNNGNVE-----------------------------------------------------

Query:  ---------PWLL-----------------------------------------------------------LFTEKGEMDKAME---EMDSIDLHPDTI
                 P LL                                                           LFTEKGEMDKAME   EMDS+D+ P+ I
Subjt:  ---------PWLL-----------------------------------------------------------LFTEKGEMDKAME---EMDSIDLHPDTI

Query:  TYIAMLKGFCDAGRLEDAY---------------VAYSVLLNGATRHGIMEKIMELL-EVEKMG
        TYIAMLKGFCD GRLEDAY               VAYSVLLNGA+RHG +EK+MELL E+EK G
Subjt:  TYIAMLKGFCDAGRLEDAY---------------VAYSVLLNGATRHGIMEKIMELL-EVEKMG

XP_023551479.1 pentatricopeptide repeat-containing protein At5g47360 [Cucurbita pepo subsp. pepo]3.0e-2437.12Show/hide
Query:  MALFQIFYFRSSSFHLKSSKFSTAQLSTVSSSDLYYDHLQKNNGNVEPWLL-------------------------------------------------
        MALF+IFY R SSF  K S  ST QLSTVSS+DL+YDHLQKNNGNVE  L                                                  
Subjt:  MALFQIFYFRSSSFHLKSSKFSTAQLSTVSSSDLYYDHLQKNNGNVEPWLL-------------------------------------------------

Query:  ------------------------------------------------------------------------LFTEKGEMDKAME---EMDSIDLHPDTI
                                                                                LFTEKGEMDKAME   EMDS+D+ P+ I
Subjt:  ------------------------------------------------------------------------LFTEKGEMDKAME---EMDSIDLHPDTI

Query:  TYIAMLKGFCDAGRLEDAY---------------VAYSVLLNGATRHGIMEKIMELL-EVEKMG
        TYIAMLKGFCD GRLEDAY               VAYSVLLNGA+RHG +EK+MELL E+EK G
Subjt:  TYIAMLKGFCDAGRLEDAY---------------VAYSVLLNGATRHGIMEKIMELL-EVEKMG

TrEMBL top hitse value%identityAlignment
A0A1S3B4L9 pentatricopeptide repeat-containing protein At5g473601.5e-1834.09Show/hide
Query:  MALFQIFYFRSSSFHLKSSKFSTAQLSTVSSSDLYYDHLQKNNGNVE-----------------------------------------------------
        MALF+I Y RSSS  L  S  ST  LST+SSSDL+YDHL+KNNGNVE                                                     
Subjt:  MALFQIFYFRSSSFHLKSSKFSTAQLSTVSSSDLYYDHLQKNNGNVE-----------------------------------------------------

Query:  ---------PWLL-----------------------------------------------------------LFTEKGEMDKAME---EMDSIDLHPDTI
                 P LL                                                           L TEKGEMDKAME   EMDS+D+HP+ I
Subjt:  ---------PWLL-----------------------------------------------------------LFTEKGEMDKAME---EMDSIDLHPDTI

Query:  TYIAMLKGFCDAGRLEDAY---------------VAYSVLLNGATRHGIMEKIMELL-EVEKMG
        TYI+M+KGFCD GR EDAY               V YSVL+NGA R  IM+K+ME+L E+EK G
Subjt:  TYIAMLKGFCDAGRLEDAY---------------VAYSVLLNGATRHGIMEKIMELL-EVEKMG

A0A5A7SSI7 Pentatricopeptide repeat-containing protein1.5e-1834.09Show/hide
Query:  MALFQIFYFRSSSFHLKSSKFSTAQLSTVSSSDLYYDHLQKNNGNVE-----------------------------------------------------
        MALF+I Y RSSS  L  S  ST  LST+SSSDL+YDHL+KNNGNVE                                                     
Subjt:  MALFQIFYFRSSSFHLKSSKFSTAQLSTVSSSDLYYDHLQKNNGNVE-----------------------------------------------------

Query:  ---------PWLL-----------------------------------------------------------LFTEKGEMDKAME---EMDSIDLHPDTI
                 P LL                                                           L TEKGEMDKAME   EMDS+D+HP+ I
Subjt:  ---------PWLL-----------------------------------------------------------LFTEKGEMDKAME---EMDSIDLHPDTI

Query:  TYIAMLKGFCDAGRLEDAY---------------VAYSVLLNGATRHGIMEKIMELL-EVEKMG
        TYI+M+KGFCD GR EDAY               V YSVL+NGA R  IM+K+ME+L E+EK G
Subjt:  TYIAMLKGFCDAGRLEDAY---------------VAYSVLLNGATRHGIMEKIMELL-EVEKMG

A0A6J1DNK5 pentatricopeptide repeat-containing protein At5g473609.3e-2437.88Show/hide
Query:  MALFQIFYFRSSSFHLKSSKFSTAQLSTVSSSDLYYDHLQKNNGNVE-----------------------------------------------------
        MALF IF FRS SF LK SK S   LSTVSS+DL+YDHLQKNNGNVE                                                     
Subjt:  MALFQIFYFRSSSFHLKSSKFSTAQLSTVSSSDLYYDHLQKNNGNVE-----------------------------------------------------

Query:  ---------PWLL-----------------------------------------------------------LFTEKGEMDKA---MEEMDSIDLHPDTI
                 P LL                                                           LF EKGEMDKA   MEEMDSID+HP+ I
Subjt:  ---------PWLL-----------------------------------------------------------LFTEKGEMDKA---MEEMDSIDLHPDTI

Query:  TYIAMLKGFCDAGRLEDAY---------------VAYSVLLNGATRHGIMEKIMELL-EVEKMG
        TYIAMLKGFCD GRLEDAY               +AYS+LLNGA+R GI EKIMELL E+EK G
Subjt:  TYIAMLKGFCDAGRLEDAY---------------VAYSVLLNGATRHGIMEKIMELL-EVEKMG

A0A6J1EQI2 pentatricopeptide repeat-containing protein At5g473604.2e-2437.5Show/hide
Query:  MALFQIFYFRSSSFHLKSSKFSTAQLSTVSSSDLYYDHLQKNNGNVE-----------------------------------------------------
        MALF+IFY R SSF  K S  ST QLSTVSS+DL+YDHLQK NGNVE                                                     
Subjt:  MALFQIFYFRSSSFHLKSSKFSTAQLSTVSSSDLYYDHLQKNNGNVE-----------------------------------------------------

Query:  ---------PWLL-----------------------------------------------------------LFTEKGEMDKAME---EMDSIDLHPDTI
                 P LL                                                           LFTEKGEMDKAME   EMDS+D+ P+ I
Subjt:  ---------PWLL-----------------------------------------------------------LFTEKGEMDKAME---EMDSIDLHPDTI

Query:  TYIAMLKGFCDAGRLEDAY---------------VAYSVLLNGATRHGIMEKIMELL-EVEKMG
        TYIAMLKGFCD GRLEDAY               VAYSVLLNGA+RHG +EK+MELL E+EK G
Subjt:  TYIAMLKGFCDAGRLEDAY---------------VAYSVLLNGATRHGIMEKIMELL-EVEKMG

A0A6J1I125 pentatricopeptide repeat-containing protein At5g473608.4e-2537.88Show/hide
Query:  MALFQIFYFRSSSFHLKSSKFSTAQLSTVSSSDLYYDHLQKNNGNVE-----------------------------------------------------
        MALF+IFY R SSF  K S  ST QLSTVSS+DL+YDHLQKNNGNVE                                                     
Subjt:  MALFQIFYFRSSSFHLKSSKFSTAQLSTVSSSDLYYDHLQKNNGNVE-----------------------------------------------------

Query:  ---------PWLL-----------------------------------------------------------LFTEKGEMDKAME---EMDSIDLHPDTI
                 P LL                                                           LFTEKGEMDKAME   EMDS+D+ P+ I
Subjt:  ---------PWLL-----------------------------------------------------------LFTEKGEMDKAME---EMDSIDLHPDTI

Query:  TYIAMLKGFCDAGRLEDAY---------------VAYSVLLNGATRHGIMEKIMELL-EVEKMG
        TYIAMLKGFCD GRLEDAY               VAYSVLLNGA+RHG +EK+MELL E+EK G
Subjt:  TYIAMLKGFCDAGRLEDAY---------------VAYSVLLNGATRHGIMEKIMELL-EVEKMG

SwissProt top hitse value%identityAlignment
Q9LVD3 Pentatricopeptide repeat-containing protein At5g57250, mitochondrial2.7e-0430.49Show/hide
Query:  LLFTEKGEMDKAMEEMDSIDLHPDTITYIAMLKGFCDAGRLEDAY--------------VAYSVLLNGATRHGIMEKIMELL
        LL    GE D     M  +DL PDT TY  M+KG+C  G++E+A               V Y+ +++   + G+++   E+L
Subjt:  LLFTEKGEMDKAMEEMDSIDLHPDTITYIAMLKGFCDAGRLEDAY--------------VAYSVLLNGATRHGIMEKIMELL

Q9LVS3 Pentatricopeptide repeat-containing protein At5g473601.3e-0938.89Show/hide
Query:  LFTEKGEM---DKAMEEMDSIDLHPDTITYIAMLKGFCDAGRLEDAY---------------VAYSVLLNGATRHGIMEKIMELL-EVEK
        LF +KG++   D  ++EMD + L+PD ITY +M+ G+C+AG+++DA+               V YS +L G  + G ME+ +ELL E+EK
Subjt:  LFTEKGEM---DKAMEEMDSIDLHPDTITYIAMLKGFCDAGRLEDAY---------------VAYSVLLNGATRHGIMEKIMELL-EVEK

Arabidopsis top hitse value%identityAlignment
AT1G05670.1 Pentatricopeptide repeat (PPR-like) superfamily protein1.6e-0430.12Show/hide
Query:  EMDKAMEEMDSIDLHPDTITYIAMLKGFCDAGRLEDAY---------------VAYSVLLNGATRHGIMEKIMELL-EVEKMG
        E  K   EM    L PD++T+  ++ G+C AG ++DA+               V Y+ L++G  + G ++   ELL E+ K+G
Subjt:  EMDKAMEEMDSIDLHPDTITYIAMLKGFCDAGRLEDAY---------------VAYSVLLNGATRHGIMEKIMELL-EVEKMG

AT1G05670.2 Pentatricopeptide repeat (PPR-like) superfamily protein1.6e-0430.12Show/hide
Query:  EMDKAMEEMDSIDLHPDTITYIAMLKGFCDAGRLEDAY---------------VAYSVLLNGATRHGIMEKIMELL-EVEKMG
        E  K   EM    L PD++T+  ++ G+C AG ++DA+               V Y+ L++G  + G ++   ELL E+ K+G
Subjt:  EMDKAMEEMDSIDLHPDTITYIAMLKGFCDAGRLEDAY---------------VAYSVLLNGATRHGIMEKIMELL-EVEKMG

AT1G63070.1 pentatricopeptide (PPR) repeat-containing protein1.3e-0429.7Show/hide
Query:  NNGNVEPWLLLFTEKGEMDKAMEEMDSIDLHPDTITYIAMLKGFCDAGRLEDAY---------------VAYSVLLNGATRHGIMEKIMEL-LEVEKMGK
        NNGNVE  L++F          E M   D+  D +TY  M++  C AG++ED +               V Y+ +++G  R G+ E+   L +E+++ G 
Subjt:  NNGNVEPWLLLFTEKGEMDKAMEEMDSIDLHPDTITYIAMLKGFCDAGRLEDAY---------------VAYSVLLNGATRHGIMEKIMEL-LEVEKMGK

Query:  L
        L
Subjt:  L

AT5G47360.1 Tetratricopeptide repeat (TPR)-like superfamily protein8.9e-1138.89Show/hide
Query:  LFTEKGEM---DKAMEEMDSIDLHPDTITYIAMLKGFCDAGRLEDAY---------------VAYSVLLNGATRHGIMEKIMELL-EVEK
        LF +KG++   D  ++EMD + L+PD ITY +M+ G+C+AG+++DA+               V YS +L G  + G ME+ +ELL E+EK
Subjt:  LFTEKGEM---DKAMEEMDSIDLHPDTITYIAMLKGFCDAGRLEDAY---------------VAYSVLLNGATRHGIMEKIMELL-EVEK

AT5G57250.1 Pentatricopeptide repeat (PPR) superfamily protein1.9e-0530.49Show/hide
Query:  LLFTEKGEMDKAMEEMDSIDLHPDTITYIAMLKGFCDAGRLEDAY--------------VAYSVLLNGATRHGIMEKIMELL
        LL    GE D     M  +DL PDT TY  M+KG+C  G++E+A               V Y+ +++   + G+++   E+L
Subjt:  LLFTEKGEMDKAMEEMDSIDLHPDTITYIAMLKGFCDAGRLEDAY--------------VAYSVLLNGATRHGIMEKIMELL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCTCTGCCTTCGTCGAAATCTATCAACAATCCAAGTGACGGGCAGCCAACCCAAATAGAACCGAGTAGGTCTTGGCTAACCAAACCGATCAGTTTTATTATTGACCT
GCAACTCTTCTTCACCCTCCAGGCGACTCTTGCTACAGTGTTTTGTCGCTCTGAAGTTACCATATCAATGGCTCTCTTTCAAATCTTTTACTTCCGCTCATCCTCATTTC
ATCTCAAGAGCTCCAAGTTTTCTACAGCACAACTAAGTACAGTCTCTTCTTCCGATTTATACTACGATCATTTGCAGAAAAACAATGGTAATGTGGAGCCCTGGCTACTG
TTATTTACTGAAAAGGGTGAGATGGATAAGGCGATGGAAGAGATGGATTCAATTGATCTTCATCCTGACACGATCACTTATATTGCCATGCTCAAAGGATTTTGTGACGC
CGGCCGTTTGGAGGATGCTTATGTGGCTTACTCAGTGCTACTTAATGGCGCCACTCGGCATGGGATTATGGAAAAGATAATGGAATTGTTGGAGGTGGAAAAAATGGGGA
AGTTGTAG
mRNA sequenceShow/hide mRNA sequence
ATGCCTCTGCCTTCGTCGAAATCTATCAACAATCCAAGTGACGGGCAGCCAACCCAAATAGAACCGAGTAGGTCTTGGCTAACCAAACCGATCAGTTTTATTATTGACCT
GCAACTCTTCTTCACCCTCCAGGCGACTCTTGCTACAGTGTTTTGTCGCTCTGAAGTTACCATATCAATGGCTCTCTTTCAAATCTTTTACTTCCGCTCATCCTCATTTC
ATCTCAAGAGCTCCAAGTTTTCTACAGCACAACTAAGTACAGTCTCTTCTTCCGATTTATACTACGATCATTTGCAGAAAAACAATGGTAATGTGGAGCCCTGGCTACTG
TTATTTACTGAAAAGGGTGAGATGGATAAGGCGATGGAAGAGATGGATTCAATTGATCTTCATCCTGACACGATCACTTATATTGCCATGCTCAAAGGATTTTGTGACGC
CGGCCGTTTGGAGGATGCTTATGTGGCTTACTCAGTGCTACTTAATGGCGCCACTCGGCATGGGATTATGGAAAAGATAATGGAATTGTTGGAGGTGGAAAAAATGGGGA
AGTTGTAG
Protein sequenceShow/hide protein sequence
MPLPSSKSINNPSDGQPTQIEPSRSWLTKPISFIIDLQLFFTLQATLATVFCRSEVTISMALFQIFYFRSSSFHLKSSKFSTAQLSTVSSSDLYYDHLQKNNGNVEPWLL
LFTEKGEMDKAMEEMDSIDLHPDTITYIAMLKGFCDAGRLEDAYVAYSVLLNGATRHGIMEKIMELLEVEKMGKL