; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr029598 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr029598
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationtig00153403:2380845..2385768
RNA-Seq ExpressionSgr029598
SyntenySgr029598
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008461323.1 PREDICTED: pentatricopeptide repeat-containing protein At3g53170 isoform X1 [Cucumis melo]4.7e-10051.07Show/hide
Query:  DASIFHSSSISPTCTVGSMASVSPTRSVISSSHSRSLKRSPGQSSGGLQRDPKKGLSRILRKDAAIRAIERKANSKKYNNLWPKAVLEALDEAIEENLWE
        DA+IF   S S T TV SM S+S T   ISSSHSRSLKRS  QSS  LQRDPKKGLSRILRKDAAI+AIE+KANSKKYNNLWPKAVLEALDEAI+ENLWE
Subjt:  DASIFHSSSISPTCTVGSMASVSPTRSVISSSHSRSLKRSPGQSSGGLQRDPKKGLSRILRKDAAIRAIERKANSKKYNNLWPKAVLEALDEAIEENLWE

Query:  TALKIFGLLRQQQWYEPRCQTYTKLLMLLGKCRQPEQASLLFQIMLSEGLKPSIDVYTALVSAY------------------VSDCKPDVHTYSILIDCC
        TALKIFGLLRQQQWYEPRCQTYTKLLMLLGKCRQPEQASLLF+IM SEGLKPSIDVYTALVSAY                  VSDCKPDVHTYSILIDCC
Subjt:  TALKIFGLLRQQQWYEPRCQTYTKLLMLLGKCRQPEQASLLFQIMLSEGLKPSIDVYTALVSAY------------------VSDCKPDVHTYSILIDCC

Query:  TKLHRFDLLKDILADMSYLGIA-------------------------------------------------SDGNQARRLDIQYYDQL--------IW--
        T+L RFDLLK++LADMSYLGI                                                   +  Q  +++ ++YD+         +W  
Subjt:  TKLHRFDLLKDILADMSYLGIA-------------------------------------------------SDGNQARRLDIQYYDQL--------IW--

Query:  ---------------------------------------------------ESW--------------------NAYGKSGNIEKVDSVLRQIENSDVVP
                                                           E +                    NAYGKSG++EK+DS+LRQIENSDVVP
Subjt:  ---------------------------------------------------ESW--------------------NAYGKSGNIEKVDSVLRQIENSDVVP

Query:  DTPLFNCLINVYGQAGD----------------------------ALNAQGMTEAAQRLENKLFAT
        DTPLFNCLIN YGQAGD                            AL AQGMTEAAQRLENKL AT
Subjt:  DTPLFNCLINVYGQAGD----------------------------ALNAQGMTEAAQRLENKLFAT

XP_008461329.1 PREDICTED: pentatricopeptide repeat-containing protein At3g53170 isoform X2 [Cucumis melo]4.7e-10051.07Show/hide
Query:  DASIFHSSSISPTCTVGSMASVSPTRSVISSSHSRSLKRSPGQSSGGLQRDPKKGLSRILRKDAAIRAIERKANSKKYNNLWPKAVLEALDEAIEENLWE
        DA+IF   S S T TV SM S+S T   ISSSHSRSLKRS  QSS  LQRDPKKGLSRILRKDAAI+AIE+KANSKKYNNLWPKAVLEALDEAI+ENLWE
Subjt:  DASIFHSSSISPTCTVGSMASVSPTRSVISSSHSRSLKRSPGQSSGGLQRDPKKGLSRILRKDAAIRAIERKANSKKYNNLWPKAVLEALDEAIEENLWE

Query:  TALKIFGLLRQQQWYEPRCQTYTKLLMLLGKCRQPEQASLLFQIMLSEGLKPSIDVYTALVSAY------------------VSDCKPDVHTYSILIDCC
        TALKIFGLLRQQQWYEPRCQTYTKLLMLLGKCRQPEQASLLF+IM SEGLKPSIDVYTALVSAY                  VSDCKPDVHTYSILIDCC
Subjt:  TALKIFGLLRQQQWYEPRCQTYTKLLMLLGKCRQPEQASLLFQIMLSEGLKPSIDVYTALVSAY------------------VSDCKPDVHTYSILIDCC

Query:  TKLHRFDLLKDILADMSYLGIA-------------------------------------------------SDGNQARRLDIQYYDQL--------IW--
        T+L RFDLLK++LADMSYLGI                                                   +  Q  +++ ++YD+         +W  
Subjt:  TKLHRFDLLKDILADMSYLGIA-------------------------------------------------SDGNQARRLDIQYYDQL--------IW--

Query:  ---------------------------------------------------ESW--------------------NAYGKSGNIEKVDSVLRQIENSDVVP
                                                           E +                    NAYGKSG++EK+DS+LRQIENSDVVP
Subjt:  ---------------------------------------------------ESW--------------------NAYGKSGNIEKVDSVLRQIENSDVVP

Query:  DTPLFNCLINVYGQAGD----------------------------ALNAQGMTEAAQRLENKLFAT
        DTPLFNCLIN YGQAGD                            AL AQGMTEAAQRLENKL AT
Subjt:  DTPLFNCLINVYGQAGD----------------------------ALNAQGMTEAAQRLENKLFAT

XP_022153245.1 pentatricopeptide repeat-containing protein At3g53170 [Momordica charantia]3.6e-10051.16Show/hide
Query:  MEP--RLRDASIFHSSSISPTCTVGSMASVSPTRSVISSSHSRSLKRSPGQSS-GGLQRDPKKGLSRILRKDAAIRAIERKANSKKYNNLWPKAVLEALD
        MEP   LRDASI  S      C V SMAS+ PTRS ISSS + SLKRS  Q+S  GLQRDPKKGLSRILRKDAAIRAIERKANSKKYNNLWPKAVLEALD
Subjt:  MEP--RLRDASIFHSSSISPTCTVGSMASVSPTRSVISSSHSRSLKRSPGQSS-GGLQRDPKKGLSRILRKDAAIRAIERKANSKKYNNLWPKAVLEALD

Query:  EAIEENLWETALKIFGLLRQQQWYEPRCQTYTKLLMLLGKCRQPEQASLLFQIMLSEGLKPSIDVYTALVSAY------------------VSDCKPDVH
        EAI++NLWET+LKIFGLLRQQ+WYEPRC+TYTKLLM+LGKCRQPEQASLLFQI+LSEGLKPSIDVYTALVSAY                  VSD KP++H
Subjt:  EAIEENLWETALKIFGLLRQQQWYEPRCQTYTKLLMLLGKCRQPEQASLLFQIMLSEGLKPSIDVYTALVSAY------------------VSDCKPDVH

Query:  TYSILIDCCTKLHRFDLLKDILADMSYLGIA-------------------------------------------------SDGNQARRLDIQY-------
        TYSILIDCCTKL RFDLLK IL DMSYLGIA                                                  +  Q  +++  Y       
Subjt:  TYSILIDCCTKLHRFDLLKDILADMSYLGIA-------------------------------------------------SDGNQARRLDIQY-------

Query:  --------------------YDQLI---------------------------------WESW--------------------NAYGKSGNIEKVDSVLRQ
                            YD+++                                  E +                    NAYGKSGNIEKV+S+LRQ
Subjt:  --------------------YDQLI---------------------------------WESW--------------------NAYGKSGNIEKVDSVLRQ

Query:  IENSDVVPDTPLFNCLINVYGQAGD----------------------------ALNAQGMTEAAQRLENKLFATR
        IENSDVVPDTPLFNCLINVYGQAGD                            ALNAQGMTEAAQRLENKL ATR
Subjt:  IENSDVVPDTPLFNCLINVYGQAGD----------------------------ALNAQGMTEAAQRLENKLFATR

XP_022960242.1 pentatricopeptide repeat-containing protein At3g53170 [Cucurbita moschata]5.8e-9849.79Show/hide
Query:  MEP--RLRDASIFHSSSISPTCTVGSMASVSPTRSVISSSHSRSLKRSPGQSSGGLQRDPKKGLSRILRKDAAIRAIERKANSKKYNNLWPKAVLEALDE
        MEP   L +A IF   S  PT TV SMAS++PT   ISSS SR L R+  QSS GLQRDPKKGLSRILRKDAAIRAIERKANSKKYNNLWP+AVLEALDE
Subjt:  MEP--RLRDASIFHSSSISPTCTVGSMASVSPTRSVISSSHSRSLKRSPGQSSGGLQRDPKKGLSRILRKDAAIRAIERKANSKKYNNLWPKAVLEALDE

Query:  AIEENLWETALKIFGLLRQQQWYEPRCQTYTKLLMLLGKCRQPEQASLLFQIMLSEGLKPSIDVYTALVSAY------------------VSDCKPDVHT
        AI+ENLWET LKIFGLLRQQ WYEPRC+TYTKLLM+LGKCRQPEQASLLFQIMLSEGLKPSIDVYTALVSAY                  +SDCKPDVHT
Subjt:  AIEENLWETALKIFGLLRQQQWYEPRCQTYTKLLMLLGKCRQPEQASLLFQIMLSEGLKPSIDVYTALVSAY------------------VSDCKPDVHT

Query:  YSILIDCCTKLHRFDLLKDILADMSYLGIA-------------------------------------------------SDGNQARRLDIQY--------
        YSILIDCCT+  R DLLKDILADMSYLGI                                                   +  Q  +++  Y        
Subjt:  YSILIDCCTKLHRFDLLKDILADMSYLGIA-------------------------------------------------SDGNQARRLDIQY--------

Query:  -------------------YDQLI---------------------------------WESW--------------------NAYGKSGNIEKVDSVLRQI
                           YD+++                                  E +                    NAYGK+GNIEKVDS+LRQI
Subjt:  -------------------YDQLI---------------------------------WESW--------------------NAYGKSGNIEKVDSVLRQI

Query:  ENSDVVPDTPLFNCLINVYGQAGD----------------------------ALNAQGMTEAAQRLENKLFATR
        ENSDVV DTPLFNCLINVYGQAGD                            AL   GMTEAAQRLENKL   R
Subjt:  ENSDVVPDTPLFNCLINVYGQAGD----------------------------ALNAQGMTEAAQRLENKLFATR

XP_038898944.1 pentatricopeptide repeat-containing protein At3g53170 isoform X2 [Benincasa hispida]6.2e-10051.79Show/hide
Query:  MEPRL--RDASIFHSSSISPTCTVGSMASV-SPTRSVISSSHSRSLKRSPGQSSGGLQRDPKKGLSRILRKDAAIRAIERKANSKKYNNLWPKAVLEALD
        MEP L  ++A IF  S  S T TV SMAS+ S T S ISSSHSRSLKRS GQSS  LQRDPKKGLSRILRKDAAI+AIERKANSKKYNNLWPKAVLEALD
Subjt:  MEPRL--RDASIFHSSSISPTCTVGSMASV-SPTRSVISSSHSRSLKRSPGQSSGGLQRDPKKGLSRILRKDAAIRAIERKANSKKYNNLWPKAVLEALD

Query:  EAIEENLWETALKIFGLLRQQQWYEPRCQTYTKLLMLLGKCRQPEQASLLFQIMLSEGLKPSIDVYTALVSAY------------------VSDCKPDVH
        EAI+ENLWETALKIFGLLRQQQWYEPRCQTYTKLLMLL KCRQPEQASLLFQIM SEGLKPSIDVYTALVSAY                  VSDCKPDV 
Subjt:  EAIEENLWETALKIFGLLRQQQWYEPRCQTYTKLLMLLGKCRQPEQASLLFQIMLSEGLKPSIDVYTALVSAY------------------VSDCKPDVH

Query:  TYSILIDCCTKLHRFDLLKDILADMSYLGIA-------------------------------------------------SDGNQARRLDIQY-------
        TYSILIDCCT+  RFDLLK+I ADMSYLGI                                                   +  Q  +++  Y       
Subjt:  TYSILIDCCTKLHRFDLLKDILADMSYLGIA-------------------------------------------------SDGNQARRLDIQY-------

Query:  --------------------YDQLI---------------------------------WESW--------------------NAYGKSGNIEKVDSVLRQ
                            YD+++                                  E +                    NAYGKSGNIEKVDS+LRQ
Subjt:  --------------------YDQLI---------------------------------WESW--------------------NAYGKSGNIEKVDSVLRQ

Query:  IENSDVVPDTPLFNCLINVYGQAGD----------------------------ALNAQGMTEAAQRLENKLFATR
        IENSDVVPDTPLFNCLINVYGQAGD                            AL AQGMTEAAQRLENKL   R
Subjt:  IENSDVVPDTPLFNCLINVYGQAGD----------------------------ALNAQGMTEAAQRLENKLFATR

TrEMBL top hitse value%identityAlignment
A0A0A0K9Q1 Uncharacterized protein1.1e-9750.75Show/hide
Query:  DASIFHSSSISPTCTVGSMASVSPTRSVISSSHSRSLKRSPGQSSGGLQRDPKKGLSRILRKDAAIRAIERKANSKKYNNLWPKAVLEALDEAIEENLWE
        DA++F   S S T  V SMASVS T   ISSSHSRSLKR+  QSS  LQRDPKKGLSRILR+DAAI+AIERKANSKKYNNLWPKAVLEALDEAI+ENLWE
Subjt:  DASIFHSSSISPTCTVGSMASVSPTRSVISSSHSRSLKRSPGQSSGGLQRDPKKGLSRILRKDAAIRAIERKANSKKYNNLWPKAVLEALDEAIEENLWE

Query:  TALKIFGLLRQQQWYEPRCQTYTKLLMLLGKCRQPEQASLLFQIMLSEGLKPSIDVYTALVSAY------------------VSDCKPDVHTYSILIDCC
        TALKIFGLLRQQQWYEPRCQTYTKLLMLLGKC+QPEQASLLF+IM SEGLKPSIDVYTALVSAY                  +SDCKPDVHTYSILIDCC
Subjt:  TALKIFGLLRQQQWYEPRCQTYTKLLMLLGKCRQPEQASLLFQIMLSEGLKPSIDVYTALVSAY------------------VSDCKPDVHTYSILIDCC

Query:  TKLHRFDLLKDILADMSYLGIA-------------------------------------------------SDGNQARRLDIQY----------------
        T+L RFDLLK ILADMS LGI                                                   +  Q  +++  Y                
Subjt:  TKLHRFDLLKDILADMSYLGIA-------------------------------------------------SDGNQARRLDIQY----------------

Query:  -----------YDQL---------------------IWESW--------------------------------NAYGKSGNIEKVDSVLRQIENSDVVPD
                   YD++                     I +S+                                NAYGKSG++EKVDS+LRQIENSDVVPD
Subjt:  -----------YDQL---------------------IWESW--------------------------------NAYGKSGNIEKVDSVLRQIENSDVVPD

Query:  TPLFNCLINVYGQAGD----------------------------ALNAQGMTEAAQRLENKLFAT
        TPLFNCLINVYGQAG+                            AL AQGMTE AQRLENKL AT
Subjt:  TPLFNCLINVYGQAGD----------------------------ALNAQGMTEAAQRLENKLFAT

A0A1S3CEG4 pentatricopeptide repeat-containing protein At3g53170 isoform X12.3e-10051.07Show/hide
Query:  DASIFHSSSISPTCTVGSMASVSPTRSVISSSHSRSLKRSPGQSSGGLQRDPKKGLSRILRKDAAIRAIERKANSKKYNNLWPKAVLEALDEAIEENLWE
        DA+IF   S S T TV SM S+S T   ISSSHSRSLKRS  QSS  LQRDPKKGLSRILRKDAAI+AIE+KANSKKYNNLWPKAVLEALDEAI+ENLWE
Subjt:  DASIFHSSSISPTCTVGSMASVSPTRSVISSSHSRSLKRSPGQSSGGLQRDPKKGLSRILRKDAAIRAIERKANSKKYNNLWPKAVLEALDEAIEENLWE

Query:  TALKIFGLLRQQQWYEPRCQTYTKLLMLLGKCRQPEQASLLFQIMLSEGLKPSIDVYTALVSAY------------------VSDCKPDVHTYSILIDCC
        TALKIFGLLRQQQWYEPRCQTYTKLLMLLGKCRQPEQASLLF+IM SEGLKPSIDVYTALVSAY                  VSDCKPDVHTYSILIDCC
Subjt:  TALKIFGLLRQQQWYEPRCQTYTKLLMLLGKCRQPEQASLLFQIMLSEGLKPSIDVYTALVSAY------------------VSDCKPDVHTYSILIDCC

Query:  TKLHRFDLLKDILADMSYLGIA-------------------------------------------------SDGNQARRLDIQYYDQL--------IW--
        T+L RFDLLK++LADMSYLGI                                                   +  Q  +++ ++YD+         +W  
Subjt:  TKLHRFDLLKDILADMSYLGIA-------------------------------------------------SDGNQARRLDIQYYDQL--------IW--

Query:  ---------------------------------------------------ESW--------------------NAYGKSGNIEKVDSVLRQIENSDVVP
                                                           E +                    NAYGKSG++EK+DS+LRQIENSDVVP
Subjt:  ---------------------------------------------------ESW--------------------NAYGKSGNIEKVDSVLRQIENSDVVP

Query:  DTPLFNCLINVYGQAGD----------------------------ALNAQGMTEAAQRLENKLFAT
        DTPLFNCLIN YGQAGD                            AL AQGMTEAAQRLENKL AT
Subjt:  DTPLFNCLINVYGQAGD----------------------------ALNAQGMTEAAQRLENKLFAT

A0A1S3CFP9 pentatricopeptide repeat-containing protein At3g53170 isoform X22.3e-10051.07Show/hide
Query:  DASIFHSSSISPTCTVGSMASVSPTRSVISSSHSRSLKRSPGQSSGGLQRDPKKGLSRILRKDAAIRAIERKANSKKYNNLWPKAVLEALDEAIEENLWE
        DA+IF   S S T TV SM S+S T   ISSSHSRSLKRS  QSS  LQRDPKKGLSRILRKDAAI+AIE+KANSKKYNNLWPKAVLEALDEAI+ENLWE
Subjt:  DASIFHSSSISPTCTVGSMASVSPTRSVISSSHSRSLKRSPGQSSGGLQRDPKKGLSRILRKDAAIRAIERKANSKKYNNLWPKAVLEALDEAIEENLWE

Query:  TALKIFGLLRQQQWYEPRCQTYTKLLMLLGKCRQPEQASLLFQIMLSEGLKPSIDVYTALVSAY------------------VSDCKPDVHTYSILIDCC
        TALKIFGLLRQQQWYEPRCQTYTKLLMLLGKCRQPEQASLLF+IM SEGLKPSIDVYTALVSAY                  VSDCKPDVHTYSILIDCC
Subjt:  TALKIFGLLRQQQWYEPRCQTYTKLLMLLGKCRQPEQASLLFQIMLSEGLKPSIDVYTALVSAY------------------VSDCKPDVHTYSILIDCC

Query:  TKLHRFDLLKDILADMSYLGIA-------------------------------------------------SDGNQARRLDIQYYDQL--------IW--
        T+L RFDLLK++LADMSYLGI                                                   +  Q  +++ ++YD+         +W  
Subjt:  TKLHRFDLLKDILADMSYLGIA-------------------------------------------------SDGNQARRLDIQYYDQL--------IW--

Query:  ---------------------------------------------------ESW--------------------NAYGKSGNIEKVDSVLRQIENSDVVP
                                                           E +                    NAYGKSG++EK+DS+LRQIENSDVVP
Subjt:  ---------------------------------------------------ESW--------------------NAYGKSGNIEKVDSVLRQIENSDVVP

Query:  DTPLFNCLINVYGQAGD----------------------------ALNAQGMTEAAQRLENKLFAT
        DTPLFNCLIN YGQAGD                            AL AQGMTEAAQRLENKL AT
Subjt:  DTPLFNCLINVYGQAGD----------------------------ALNAQGMTEAAQRLENKLFAT

A0A6J1DG98 pentatricopeptide repeat-containing protein At3g531701.8e-10051.16Show/hide
Query:  MEP--RLRDASIFHSSSISPTCTVGSMASVSPTRSVISSSHSRSLKRSPGQSS-GGLQRDPKKGLSRILRKDAAIRAIERKANSKKYNNLWPKAVLEALD
        MEP   LRDASI  S      C V SMAS+ PTRS ISSS + SLKRS  Q+S  GLQRDPKKGLSRILRKDAAIRAIERKANSKKYNNLWPKAVLEALD
Subjt:  MEP--RLRDASIFHSSSISPTCTVGSMASVSPTRSVISSSHSRSLKRSPGQSS-GGLQRDPKKGLSRILRKDAAIRAIERKANSKKYNNLWPKAVLEALD

Query:  EAIEENLWETALKIFGLLRQQQWYEPRCQTYTKLLMLLGKCRQPEQASLLFQIMLSEGLKPSIDVYTALVSAY------------------VSDCKPDVH
        EAI++NLWET+LKIFGLLRQQ+WYEPRC+TYTKLLM+LGKCRQPEQASLLFQI+LSEGLKPSIDVYTALVSAY                  VSD KP++H
Subjt:  EAIEENLWETALKIFGLLRQQQWYEPRCQTYTKLLMLLGKCRQPEQASLLFQIMLSEGLKPSIDVYTALVSAY------------------VSDCKPDVH

Query:  TYSILIDCCTKLHRFDLLKDILADMSYLGIA-------------------------------------------------SDGNQARRLDIQY-------
        TYSILIDCCTKL RFDLLK IL DMSYLGIA                                                  +  Q  +++  Y       
Subjt:  TYSILIDCCTKLHRFDLLKDILADMSYLGIA-------------------------------------------------SDGNQARRLDIQY-------

Query:  --------------------YDQLI---------------------------------WESW--------------------NAYGKSGNIEKVDSVLRQ
                            YD+++                                  E +                    NAYGKSGNIEKV+S+LRQ
Subjt:  --------------------YDQLI---------------------------------WESW--------------------NAYGKSGNIEKVDSVLRQ

Query:  IENSDVVPDTPLFNCLINVYGQAGD----------------------------ALNAQGMTEAAQRLENKLFATR
        IENSDVVPDTPLFNCLINVYGQAGD                            ALNAQGMTEAAQRLENKL ATR
Subjt:  IENSDVVPDTPLFNCLINVYGQAGD----------------------------ALNAQGMTEAAQRLENKLFATR

A0A6J1HAD9 pentatricopeptide repeat-containing protein At3g531702.8e-9849.79Show/hide
Query:  MEP--RLRDASIFHSSSISPTCTVGSMASVSPTRSVISSSHSRSLKRSPGQSSGGLQRDPKKGLSRILRKDAAIRAIERKANSKKYNNLWPKAVLEALDE
        MEP   L +A IF   S  PT TV SMAS++PT   ISSS SR L R+  QSS GLQRDPKKGLSRILRKDAAIRAIERKANSKKYNNLWP+AVLEALDE
Subjt:  MEP--RLRDASIFHSSSISPTCTVGSMASVSPTRSVISSSHSRSLKRSPGQSSGGLQRDPKKGLSRILRKDAAIRAIERKANSKKYNNLWPKAVLEALDE

Query:  AIEENLWETALKIFGLLRQQQWYEPRCQTYTKLLMLLGKCRQPEQASLLFQIMLSEGLKPSIDVYTALVSAY------------------VSDCKPDVHT
        AI+ENLWET LKIFGLLRQQ WYEPRC+TYTKLLM+LGKCRQPEQASLLFQIMLSEGLKPSIDVYTALVSAY                  +SDCKPDVHT
Subjt:  AIEENLWETALKIFGLLRQQQWYEPRCQTYTKLLMLLGKCRQPEQASLLFQIMLSEGLKPSIDVYTALVSAY------------------VSDCKPDVHT

Query:  YSILIDCCTKLHRFDLLKDILADMSYLGIA-------------------------------------------------SDGNQARRLDIQY--------
        YSILIDCCT+  R DLLKDILADMSYLGI                                                   +  Q  +++  Y        
Subjt:  YSILIDCCTKLHRFDLLKDILADMSYLGIA-------------------------------------------------SDGNQARRLDIQY--------

Query:  -------------------YDQLI---------------------------------WESW--------------------NAYGKSGNIEKVDSVLRQI
                           YD+++                                  E +                    NAYGK+GNIEKVDS+LRQI
Subjt:  -------------------YDQLI---------------------------------WESW--------------------NAYGKSGNIEKVDSVLRQI

Query:  ENSDVVPDTPLFNCLINVYGQAGD----------------------------ALNAQGMTEAAQRLENKLFATR
        ENSDVV DTPLFNCLINVYGQAGD                            AL   GMTEAAQRLENKL   R
Subjt:  ENSDVVPDTPLFNCLINVYGQAGD----------------------------ALNAQGMTEAAQRLENKLFATR

SwissProt top hitse value%identityAlignment
A7LN87 Pentatricopeptide repeat-containing protein PPR5, chloroplastic8.6e-1224.75Show/hide
Query:  DEAIEENLWETALKIFGLLRQQQWYEPRCQTYTKLLMLLGKCRQPEQASLLFQIMLSEGLKPSIDVYTALVSAY----------------------VSDC
        +E    + W   L +F  +++Q+WY      Y+KL+ ++G+  Q   A  LF  M + G KP   VY +L+ A+                      +  C
Subjt:  DEAIEENLWETALKIFGLLRQQQWYEPRCQTYTKLLMLLGKCRQPEQASLLFQIMLSEGLKPSIDVYTALVSAY----------------------VSDC

Query:  KPDVHTYSILIDCCTKLHRFDLLKDILADMSYLGIASDGNQARRLDIQYYDQLIWESWNAYGKSGNIEKVDSVLRQIENSDVVPDTPLFNCLINVYGQ
        +P + TY+IL+    +      +  +  D+    ++         D+  Y+ ++    +AYGK+G I++++SVL +++++   PD   FN LI+ YG+
Subjt:  KPDVHTYSILIDCCTKLHRFDLLKDILADMSYLGIASDGNQARRLDIQYYDQLIWESWNAYGKSGNIEKVDSVLRQIENSDVVPDTPLFNCLINVYGQ

Q9FKC3 Pentatricopeptide repeat-containing protein At5g48730, chloroplastic5.0e-3635.29Show/hide
Query:  KKGLSRILRKDAAIRAIERKANSKKYNNLWPKAVLEALDEAIEENLWETALKIFGLLRQQQWYEPRCQTYTKLLMLLGKCRQPEQASLLFQIMLSEGLKP
        +K +S ILR++A    IE+K  SKK   L P+ VLE+L E I    WE+A+++F LLR+Q WY+P    Y KL+++LGKC+QPE+A  LFQ M++EG   
Subjt:  KKGLSRILRKDAAIRAIERKANSKKYNNLWPKAVLEALDEAIEENLWETALKIFGLLRQQQWYEPRCQTYTKLLMLLGKCRQPEQASLLFQIMLSEGLKP

Query:  SIDVYTALVSAY------------------VSDCKPDVHTYSILIDCCTKLHRFDLLKDILADMSYLGIAS---------DGNQARRLDIQYYDQLIW--
        + +VYTALVSAY                    +C+PDVHTYSILI    ++  FD ++D+L+DM   GI           D     ++ ++    LI   
Subjt:  SIDVYTALVSAY------------------VSDCKPDVHTYSILIDCCTKLHRFDLLKDILADMSYLGIAS---------DGNQARRLDIQYYDQLIW--

Query:  -------ESW------NAYGKSGNIEKVDSVLRQIENSDVVPDTPLFNCLINVYGQAGDALNAQGMTEAAQR
               +SW       A+G +G IE +++   + ++S + P+   FN L++ YG++G+      + E  Q+
Subjt:  -------ESW------NAYGKSGNIEKVDSVLRQIENSDVVPDTPLFNCLINVYGQAGDALNAQGMTEAAQR

Q9SCP4 Pentatricopeptide repeat-containing protein At3g531702.3e-6553.97Show/hide
Query:  HSRSLKRSPGQSSGGLQRDPKKGLSRILRKDAAIRAIERKANSKKYNNLWPKAVLEALDEAIEENLWETALKIFGLLRQQQWYEPRCQTYTKLLMLLGKC
        + R+ K + G  S   Q DPKK LSRILR DAA++ IERKANS+KY  LWPKAVLEALDEAI+EN W++ALKIF LLR+Q WYEPRC+TYTKL  +LG C
Subjt:  HSRSLKRSPGQSSGGLQRDPKKGLSRILRKDAAIRAIERKANSKKYNNLWPKAVLEALDEAIEENLWETALKIFGLLRQQQWYEPRCQTYTKLLMLLGKC

Query:  RQPEQASLLFQIMLSEGLKPSIDVYTALVSAY------------------VSDCKPDVHTYSILIDCCTKLHRFDLLKDILADMSYLGIASDGNQARRLD
        +QP+QASLLF++MLSEGLKP+IDVYT+L+S Y                  VSDCKPDV T+++LI CC KL RFDL+K I+ +MSYLG+           
Subjt:  RQPEQASLLFQIMLSEGLKPSIDVYTALVSAY------------------VSDCKPDVHTYSILIDCCTKLHRFDLLKDILADMSYLGIASDGNQARRLD

Query:  IQYYDQLIWESWNAYGKSGNIEKVDSVLR-QIENSDVVPDTPLFNCLINVYG
           Y+ +I    + YGK+G  E+++SVL   IE+ D +PD    N +I  YG
Subjt:  IQYYDQLIWESWNAYGKSGNIEKVDSVLR-QIENSDVVPDTPLFNCLINVYG

Q9SQU6 Pentatricopeptide repeat-containing protein At3g06430, chloroplastic8.0e-2629.5Show/hide
Query:  IRAIERKANSKKYNNLWPKAVLEALDEAIEENLWETALKIFGLLRQQQWYEPRCQTYTKLLMLLGKCRQPEQASLLFQIMLSEGLKPSIDVYTALVSAYV
        I+ +++K + +   N W   V E L + I +  W  AL++F +LR+Q +Y+P+  TY KLL+LLGK  QP +A  LF  ML EGL+P++++YTAL++AY 
Subjt:  IRAIERKANSKKYNNLWPKAVLEALDEAIEENLWETALKIFGLLRQQQWYEPRCQTYTKLLMLLGKCRQPEQASLLFQIMLSEGLKPSIDVYTALVSAYV

Query:  ------------------SDCKPDVHTYSILIDCCTKLHRFDLLKDILADM-----------SYLGIASDGNQAR--RLDIQYYDQLIW-----ESW---
                            C+PDV TYS L+  C    +FDL+  +  +M             + ++  G   R  +++    D L+      + W   
Subjt:  ------------------SDCKPDVHTYSILIDCCTKLHRFDLLKDILADM-----------SYLGIASDGNQAR--RLDIQYYDQLIW-----ESW---

Query:  ---NAYGKSGNIEKVDSVLRQIENSDVVPDTPLFNCLINVYGQAGDALNAQGMTEAAQRLE
           + +G  G I+ ++S   +  N  + P+T  FN LI  YG+         + E  ++LE
Subjt:  ---NAYGKSGNIEKVDSVLRQIENSDVVPDTPLFNCLINVYGQAGDALNAQGMTEAAQRLE

Q9SV96 Pentatricopeptide repeat-containing protein At4g39620, chloroplastic3.7e-1525.76Show/hide
Query:  DEAIEENLWETALKIFGLLRQQQWYEPRCQTYTKLLMLLGKCRQPEQASLLFQIMLSEGLKPSIDVYTALVSAY----------------------VSDC
        +E  + + W   L++F  +++Q+WY P    Y+KL+ ++GK  Q   A  LF  M + G +P   VY AL++A+                      +  C
Subjt:  DEAIEENLWETALKIFGLLRQQQWYEPRCQTYTKLLMLLGKCRQPEQASLLFQIMLSEGLKPSIDVYTALVSAY----------------------VSDC

Query:  KPDVHTYSILIDCCTKLHRFDLLKDILADMSYLGIASDGNQARRLDIQYYDQLIWESWNAYGKSGNIEKVDSVLRQIENSDVVPDTPLFNCLINVYGQ
        +P+V TY+IL+    +  + D +  +  D+    ++         D+  ++ ++    +AYGK+G I+++++VL ++ +++  PD   FN LI+ YG+
Subjt:  KPDVHTYSILIDCCTKLHRFDLLKDILADMSYLGIASDGNQARRLDIQYYDQLIWESWNAYGKSGNIEKVDSVLRQIENSDVVPDTPLFNCLINVYGQ

Arabidopsis top hitse value%identityAlignment
AT3G06430.1 Tetratricopeptide repeat (TPR)-like superfamily protein5.7e-2729.5Show/hide
Query:  IRAIERKANSKKYNNLWPKAVLEALDEAIEENLWETALKIFGLLRQQQWYEPRCQTYTKLLMLLGKCRQPEQASLLFQIMLSEGLKPSIDVYTALVSAYV
        I+ +++K + +   N W   V E L + I +  W  AL++F +LR+Q +Y+P+  TY KLL+LLGK  QP +A  LF  ML EGL+P++++YTAL++AY 
Subjt:  IRAIERKANSKKYNNLWPKAVLEALDEAIEENLWETALKIFGLLRQQQWYEPRCQTYTKLLMLLGKCRQPEQASLLFQIMLSEGLKPSIDVYTALVSAYV

Query:  ------------------SDCKPDVHTYSILIDCCTKLHRFDLLKDILADM-----------SYLGIASDGNQAR--RLDIQYYDQLIW-----ESW---
                            C+PDV TYS L+  C    +FDL+  +  +M             + ++  G   R  +++    D L+      + W   
Subjt:  ------------------SDCKPDVHTYSILIDCCTKLHRFDLLKDILADM-----------SYLGIASDGNQAR--RLDIQYYDQLIW-----ESW---

Query:  ---NAYGKSGNIEKVDSVLRQIENSDVVPDTPLFNCLINVYGQAGDALNAQGMTEAAQRLE
           + +G  G I+ ++S   +  N  + P+T  FN LI  YG+         + E  ++LE
Subjt:  ---NAYGKSGNIEKVDSVLRQIENSDVVPDTPLFNCLINVYGQAGDALNAQGMTEAAQRLE

AT3G53170.1 Tetratricopeptide repeat (TPR)-like superfamily protein3.3e-6752.65Show/hide
Query:  SVSPTRSVISSSHSRSLKRSPGQSSGGLQRDPKKGLSRILRKDAAIRAIERKANSKKYNNLWPKAVLEALDEAIEENLWETALKIFGLLRQQQWYEPRCQ
        S++PT       + R+ K + G  S   Q DPKK LSRILR DAA++ IERKANS+KY  LWPKAVLEALDEAI+EN W++ALKIF LLR+Q WYEPRC+
Subjt:  SVSPTRSVISSSHSRSLKRSPGQSSGGLQRDPKKGLSRILRKDAAIRAIERKANSKKYNNLWPKAVLEALDEAIEENLWETALKIFGLLRQQQWYEPRCQ

Query:  TYTKLLMLLGKCRQPEQASLLFQIMLSEGLKPSIDVYTALVSAY------------------VSDCKPDVHTYSILIDCCTKLHRFDLLKDILADMSYLG
        TYTKL  +LG C+QP+QASLLF++MLSEGLKP+IDVYT+L+S Y                  VSDCKPDV T+++LI CC KL RFDL+K I+ +MSYLG
Subjt:  TYTKLLMLLGKCRQPEQASLLFQIMLSEGLKPSIDVYTALVSAY------------------VSDCKPDVHTYSILIDCCTKLHRFDLLKDILADMSYLG

Query:  IASDGNQARRLDIQYYDQLIWESWNAYGKSGNIEKVDSVLR-QIENSDVVPDTPLFNCLINVYG
        +              Y+ +I    + YGK+G  E+++SVL   IE+ D +PD    N +I  YG
Subjt:  IASDGNQARRLDIQYYDQLIWESWNAYGKSGNIEKVDSVLR-QIENSDVVPDTPLFNCLINVYG

AT4G39620.1 Tetratricopeptide repeat (TPR)-like superfamily protein2.7e-1625.76Show/hide
Query:  DEAIEENLWETALKIFGLLRQQQWYEPRCQTYTKLLMLLGKCRQPEQASLLFQIMLSEGLKPSIDVYTALVSAY----------------------VSDC
        +E  + + W   L++F  +++Q+WY P    Y+KL+ ++GK  Q   A  LF  M + G +P   VY AL++A+                      +  C
Subjt:  DEAIEENLWETALKIFGLLRQQQWYEPRCQTYTKLLMLLGKCRQPEQASLLFQIMLSEGLKPSIDVYTALVSAY----------------------VSDC

Query:  KPDVHTYSILIDCCTKLHRFDLLKDILADMSYLGIASDGNQARRLDIQYYDQLIWESWNAYGKSGNIEKVDSVLRQIENSDVVPDTPLFNCLINVYGQ
        +P+V TY+IL+    +  + D +  +  D+    ++         D+  ++ ++    +AYGK+G I+++++VL ++ +++  PD   FN LI+ YG+
Subjt:  KPDVHTYSILIDCCTKLHRFDLLKDILADMSYLGIASDGNQARRLDIQYYDQLIWESWNAYGKSGNIEKVDSVLRQIENSDVVPDTPLFNCLINVYGQ

AT4G39620.2 Tetratricopeptide repeat (TPR)-like superfamily protein2.7e-1625.76Show/hide
Query:  DEAIEENLWETALKIFGLLRQQQWYEPRCQTYTKLLMLLGKCRQPEQASLLFQIMLSEGLKPSIDVYTALVSAY----------------------VSDC
        +E  + + W   L++F  +++Q+WY P    Y+KL+ ++GK  Q   A  LF  M + G +P   VY AL++A+                      +  C
Subjt:  DEAIEENLWETALKIFGLLRQQQWYEPRCQTYTKLLMLLGKCRQPEQASLLFQIMLSEGLKPSIDVYTALVSAY----------------------VSDC

Query:  KPDVHTYSILIDCCTKLHRFDLLKDILADMSYLGIASDGNQARRLDIQYYDQLIWESWNAYGKSGNIEKVDSVLRQIENSDVVPDTPLFNCLINVYGQ
        +P+V TY+IL+    +  + D +  +  D+    ++         D+  ++ ++    +AYGK+G I+++++VL ++ +++  PD   FN LI+ YG+
Subjt:  KPDVHTYSILIDCCTKLHRFDLLKDILADMSYLGIASDGNQARRLDIQYYDQLIWESWNAYGKSGNIEKVDSVLRQIENSDVVPDTPLFNCLINVYGQ

AT5G48730.1 Pentatricopeptide repeat (PPR) superfamily protein3.6e-3735.29Show/hide
Query:  KKGLSRILRKDAAIRAIERKANSKKYNNLWPKAVLEALDEAIEENLWETALKIFGLLRQQQWYEPRCQTYTKLLMLLGKCRQPEQASLLFQIMLSEGLKP
        +K +S ILR++A    IE+K  SKK   L P+ VLE+L E I    WE+A+++F LLR+Q WY+P    Y KL+++LGKC+QPE+A  LFQ M++EG   
Subjt:  KKGLSRILRKDAAIRAIERKANSKKYNNLWPKAVLEALDEAIEENLWETALKIFGLLRQQQWYEPRCQTYTKLLMLLGKCRQPEQASLLFQIMLSEGLKP

Query:  SIDVYTALVSAY------------------VSDCKPDVHTYSILIDCCTKLHRFDLLKDILADMSYLGIAS---------DGNQARRLDIQYYDQLIW--
        + +VYTALVSAY                    +C+PDVHTYSILI    ++  FD ++D+L+DM   GI           D     ++ ++    LI   
Subjt:  SIDVYTALVSAY------------------VSDCKPDVHTYSILIDCCTKLHRFDLLKDILADMSYLGIAS---------DGNQARRLDIQYYDQLIW--

Query:  -------ESW------NAYGKSGNIEKVDSVLRQIENSDVVPDTPLFNCLINVYGQAGDALNAQGMTEAAQR
               +SW       A+G +G IE +++   + ++S + P+   FN L++ YG++G+      + E  Q+
Subjt:  -------ESW------NAYGKSGNIEKVDSVLRQIENSDVVPDTPLFNCLINVYGQAGDALNAQGMTEAAQR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGCCTCGCCTTCGCGACGCCTCAATCTTCCACTCGTCGTCGATTTCCCCAACCTGCACAGTCGGTTCAATGGCTTCCGTCAGCCCCACGCGTTCGGTTATCTCTTC
TTCCCACTCACGGTCACTTAAGCGGAGTCCCGGCCAGTCCTCCGGCGGCCTACAGAGAGATCCGAAGAAGGGTCTGTCTCGAATTCTGAGGAAAGACGCTGCTATTAGAG
CCATAGAGAGGAAAGCGAACTCGAAGAAGTACAATAATCTGTGGCCCAAAGCTGTTTTGGAGGCTTTGGATGAGGCTATTGAGGAGAATCTCTGGGAGACTGCTCTTAAG
ATTTTTGGATTACTTCGCCAGCAACAATGGTATGAACCAAGATGCCAAACATACACAAAATTGTTGATGCTGCTGGGTAAGTGCAGGCAACCTGAGCAAGCAAGCTTGCT
GTTTCAGATTATGTTGTCTGAGGGGTTGAAACCCTCTATAGATGTTTACACTGCTCTTGTTAGTGCTTATGTTTCTGACTGCAAGCCAGATGTACATACATATTCAATTC
TCATTGATTGTTGCACAAAACTCCATCGTTTTGATCTTCTGAAGGACATACTTGCTGATATGTCATATCTGGGGATTGCATCTGATGGGAATCAAGCCAGACGTTTGGAC
ATACAATACTATGATCAGCTCATATGGGAAAGCTGGAATGCTTATGGTAAATCTGGTAATATTGAGAAAGTTGATTCGGTTTTGAGGCAAATTGAAAATTCTGATGTGGT
ACCAGATACCCCCCTTTTTAACTGCCTTATCAATGTGTATGGCCAGGCTGGTGATGCCTTAAATGCTCAAGGCATGACAGAGGCTGCTCAAAGATTGGAGAATAAGTTGT
TTGCCACCAGGATGACAAGTGCGGTGGGCACTGAGAGAAAACCTGGCTCAAAGATCGCAAATCTCGCAAAAGTTGGGTTTTGTGATGGGACAATGAATATCTATCACTGC
AAGGAAAGATTCTTGATGGTCTTGCGTCTGAATGATCCATATGTTAACATGAAAACTGTGTTCTTGAAGAGTGCGCGCCCCCCAACACCCCCCCAAAAGCTTAGTGGGAC
TACTATTCCTTCTCAAGGCAATCAACCGTTAGCAACAAAACATTCATCTCTGCTCAACTCAAGGTACTCTTCAGACTGTTACCGTGGCCCGTCACCCCCATTGCATAATC
TTCTGGTATCTTTCACTGTGACTGTTATACTGTACCCGGTCGCCCGTTGCATAATCTTCTGTTGTGCTCCTCTAAAACTTGTCCATAGACATCCTTGCAACTGCATTCCT
ACTTTACAGCTGCCTATGGAAGATGGAGGGGTCTAG
mRNA sequenceShow/hide mRNA sequence
ATGGAGCCTCGCCTTCGCGACGCCTCAATCTTCCACTCGTCGTCGATTTCCCCAACCTGCACAGTCGGTTCAATGGCTTCCGTCAGCCCCACGCGTTCGGTTATCTCTTC
TTCCCACTCACGGTCACTTAAGCGGAGTCCCGGCCAGTCCTCCGGCGGCCTACAGAGAGATCCGAAGAAGGGTCTGTCTCGAATTCTGAGGAAAGACGCTGCTATTAGAG
CCATAGAGAGGAAAGCGAACTCGAAGAAGTACAATAATCTGTGGCCCAAAGCTGTTTTGGAGGCTTTGGATGAGGCTATTGAGGAGAATCTCTGGGAGACTGCTCTTAAG
ATTTTTGGATTACTTCGCCAGCAACAATGGTATGAACCAAGATGCCAAACATACACAAAATTGTTGATGCTGCTGGGTAAGTGCAGGCAACCTGAGCAAGCAAGCTTGCT
GTTTCAGATTATGTTGTCTGAGGGGTTGAAACCCTCTATAGATGTTTACACTGCTCTTGTTAGTGCTTATGTTTCTGACTGCAAGCCAGATGTACATACATATTCAATTC
TCATTGATTGTTGCACAAAACTCCATCGTTTTGATCTTCTGAAGGACATACTTGCTGATATGTCATATCTGGGGATTGCATCTGATGGGAATCAAGCCAGACGTTTGGAC
ATACAATACTATGATCAGCTCATATGGGAAAGCTGGAATGCTTATGGTAAATCTGGTAATATTGAGAAAGTTGATTCGGTTTTGAGGCAAATTGAAAATTCTGATGTGGT
ACCAGATACCCCCCTTTTTAACTGCCTTATCAATGTGTATGGCCAGGCTGGTGATGCCTTAAATGCTCAAGGCATGACAGAGGCTGCTCAAAGATTGGAGAATAAGTTGT
TTGCCACCAGGATGACAAGTGCGGTGGGCACTGAGAGAAAACCTGGCTCAAAGATCGCAAATCTCGCAAAAGTTGGGTTTTGTGATGGGACAATGAATATCTATCACTGC
AAGGAAAGATTCTTGATGGTCTTGCGTCTGAATGATCCATATGTTAACATGAAAACTGTGTTCTTGAAGAGTGCGCGCCCCCCAACACCCCCCCAAAAGCTTAGTGGGAC
TACTATTCCTTCTCAAGGCAATCAACCGTTAGCAACAAAACATTCATCTCTGCTCAACTCAAGGTACTCTTCAGACTGTTACCGTGGCCCGTCACCCCCATTGCATAATC
TTCTGGTATCTTTCACTGTGACTGTTATACTGTACCCGGTCGCCCGTTGCATAATCTTCTGTTGTGCTCCTCTAAAACTTGTCCATAGACATCCTTGCAACTGCATTCCT
ACTTTACAGCTGCCTATGGAAGATGGAGGGGTCTAG
Protein sequenceShow/hide protein sequence
MEPRLRDASIFHSSSISPTCTVGSMASVSPTRSVISSSHSRSLKRSPGQSSGGLQRDPKKGLSRILRKDAAIRAIERKANSKKYNNLWPKAVLEALDEAIEENLWETALK
IFGLLRQQQWYEPRCQTYTKLLMLLGKCRQPEQASLLFQIMLSEGLKPSIDVYTALVSAYVSDCKPDVHTYSILIDCCTKLHRFDLLKDILADMSYLGIASDGNQARRLD
IQYYDQLIWESWNAYGKSGNIEKVDSVLRQIENSDVVPDTPLFNCLINVYGQAGDALNAQGMTEAAQRLENKLFATRMTSAVGTERKPGSKIANLAKVGFCDGTMNIYHC
KERFLMVLRLNDPYVNMKTVFLKSARPPTPPQKLSGTTIPSQGNQPLATKHSSLLNSRYSSDCYRGPSPPLHNLLVSFTVTVILYPVARCIIFCCAPLKLVHRHPCNCIP
TLQLPMEDGGV