; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Csor.00g045410 (gene) of Silver-seed gourd (wild; sororia) v1 genome

Gene IDCsor.00g045410
OrganismCucurbita argyrosperma subsp. sororia (Silver-seed gourd (wild; sororia) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationCsor_Chr18:781381..783989
RNA-Seq ExpressionCsor.00g045410
SyntenyCsor.00g045410
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6573032.1 Pentatricopeptide repeat-containing protein, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia]7.52e-212100Show/hide
Query:  MWNFHWLLSQSILRCSLTVRHLSLINFHFSQIFTISVFAVDFFSDDWRLSSDAGKCRVKPKDLVLGNTSRQIWCKPNEYIYAIVISLLGCEGLLNCSEIF
        MWNFHWLLSQSILRCSLTVRHLSLINFHFSQIFTISVFAVDFFSDDWRLSSDAGKCRVKPKDLVLGNTSRQIWCKPNEYIYAIVISLLGCEGLLNCSEIF
Subjt:  MWNFHWLLSQSILRCSLTVRHLSLINFHFSQIFTISVFAVDFFSDDWRLSSDAGKCRVKPKDLVLGNTSRQIWCKPNEYIYAIVISLLGCEGLLNCSEIF

Query:  DEMSSQGVIRSVFSYTALINAYGRNGAARGLGDEAEMFIVLTFEKLGSVKEAMDVFKQMQAAGCVPNSSTYSILLNLYGKHGRYDDVRELFLEMKESSAE
        DEMSSQGVIRSVFSYTALINAYGRNGAARGLGDEAEMFIVLTFEKLGSVKEAMDVFKQMQAAGCVPNSSTYSILLNLYGKHGRYDDVRELFLEMKESSAE
Subjt:  DEMSSQGVIRSVFSYTALINAYGRNGAARGLGDEAEMFIVLTFEKLGSVKEAMDVFKQMQAAGCVPNSSTYSILLNLYGKHGRYDDVRELFLEMKESSAE

Query:  PDATTYNILIRVFGEGGYYKEAVALFHDFLDEKIDPNIETYKGGGLYMEFEAIWLRMRESGISRNLNSFSGIVEGFVDESKEQFHEIKAS
        PDATTYNILIRVFGEGGYYKEAVALFHDFLDEKIDPNIETYKGGGLYMEFEAIWLRMRESGISRNLNSFSGIVEGFVDESKEQFHEIKAS
Subjt:  PDATTYNILIRVFGEGGYYKEAVALFHDFLDEKIDPNIETYKGGGLYMEFEAIWLRMRESGISRNLNSFSGIVEGFVDESKEQFHEIKAS

XP_011651334.1 pentatricopeptide repeat-containing protein At1g74850, chloroplastic isoform X2 [Cucumis sativus]2.67e-8838.74Show/hide
Query:  FFSDDWRLSSDAGKCRVKPKDLVLGNTS--------------------------------------------------------------------RQIW
        FFSDDWRLSSD GK R K KDLVLGN S                                                                    RQIW
Subjt:  FFSDDWRLSSDAGKCRVKPKDLVLGNTS--------------------------------------------------------------------RQIW

Query:  CKPNEYIYAIVISLLGCEGLLN-CSEIFDEMSSQGVIRSVFSYTALINAYGRNG----------------------------------------------
        CKPNE+IY I+ISLLG EGLL  CSEIFDEM+SQGVIRSVFSYTALINAYGRNG                                              
Subjt:  CKPNEYIYAIVISLLGCEGLLN-CSEIFDEMSSQGVIRSVFSYTALINAYGRNG----------------------------------------------

Query:  ---------------------AARGLGDEAEM------------------FIVLTF-----------------------------------EKLGSVKEA
                             AARGLGDEAEM                  +IV TF                                    KLGS+KEA
Subjt:  ---------------------AARGLGDEAEM------------------FIVLTF-----------------------------------EKLGSVKEA

Query:  MDVFKQMQAAGCVPNSSTYSILLNLYGKHGRYDDVRELFLEMKESSAEPDATTYNILIRVFGEGGYYKEAVALFHDFLDEKIDPNIETYKG---------
        MDVFKQMQAAGCVPN+STYSILLNLYGKHGRYDDVRELFL+MKESSAEPDATTYNILIRVFGEGGY+KE V LFHD +DE IDPN+ETY+G         
Subjt:  MDVFKQMQAAGCVPNSSTYSILLNLYGKHGRYDDVRELFLEMKESSAEPDATTYNILIRVFGEGGYYKEAVALFHDFLDEKIDPNIETYKG---------

Query:  --------------------------------------------------------------------GGLYMEFEAIWLRMRESGISRNLNSFSGIVEG
                                                                            GGLY EFEAI  RMRE GISRN  SFSGI+EG
Subjt:  --------------------------------------------------------------------GGLYMEFEAIWLRMRESGISRNLNSFSGIVEG

Query:  F----------------------------------------VDESKEQFHEIKAS
        +                                        VDESKEQF EIKAS
Subjt:  F----------------------------------------VDESKEQFHEIKAS

XP_022137367.1 pentatricopeptide repeat-containing protein At1g74850, chloroplastic [Momordica charantia]2.73e-9138.92Show/hide
Query:  FFSDDWRLSSDAGKCRVKPKDLVLGNTS--------------------------------------------------------------------RQIW
        F SDDW  SS+ GKCR KPKDLVLGN S                                                                    RQIW
Subjt:  FFSDDWRLSSDAGKCRVKPKDLVLGNTS--------------------------------------------------------------------RQIW

Query:  CKPNEYIYAIVISLLGCEGLLN-CSEIFDEMSSQGVIRSVFSYTALINAYGRNG----------------------------------------------
        CKPNE+IY I+ISLLG EGLL  CSEIFDEM+SQGVIRSVFSYTALINAYGRNG                                              
Subjt:  CKPNEYIYAIVISLLGCEGLLN-CSEIFDEMSSQGVIRSVFSYTALINAYGRNG----------------------------------------------

Query:  ---------------------AARGLGDEAEM------------------FIVLTF-----------------------------------EKLGSVKEA
                             AARGLGDEAEM                  +IV TF                                    KLGS+KEA
Subjt:  ---------------------AARGLGDEAEM------------------FIVLTF-----------------------------------EKLGSVKEA

Query:  MDVFKQMQAAGCVPNSSTYSILLNLYGKHGRYDDVRELFLEMKESSAEPDATTYNILIRVFGEGGYYKEAVALFHDFLDEKIDPNIETYKG---------
        MDVFKQMQAAGCVPN+STYSILLNLYGKHGRYDDVRELFLEMKESSAEPDATTYNILIRVFGEGGY+KE V LFHD ++EKIDPN+ETY+G         
Subjt:  MDVFKQMQAAGCVPNSSTYSILLNLYGKHGRYDDVRELFLEMKESSAEPDATTYNILIRVFGEGGYYKEAVALFHDFLDEKIDPNIETYKG---------

Query:  --------------------------------------------------------------------GGLYMEFEAIWLRMRESGISRNLNSFSGIVEG
                                                                            GGLY EFEAI LRMRESGISRN+ SFSGI+EG
Subjt:  --------------------------------------------------------------------GGLYMEFEAIWLRMRESGISRNLNSFSGIVEG

Query:  F----------------------------------------VDESKEQFHEIKAS
        +                                        VDESKEQFHEIKA+
Subjt:  F----------------------------------------VDESKEQFHEIKAS

XP_038894203.1 pentatricopeptide repeat-containing protein At1g74850, chloroplastic isoform X1 [Benincasa hispida]6.36e-9239.28Show/hide
Query:  FFSDDWRLSSDAGKCRVKPKDLVLGNTS--------------------------------------------------------------------RQIW
        FFSDDW LSSD  KCR KPKDLVLGN S                                                                    RQIW
Subjt:  FFSDDWRLSSDAGKCRVKPKDLVLGNTS--------------------------------------------------------------------RQIW

Query:  CKPNEYIYAIVISLLGCEGLLN-CSEIFDEMSSQGVIRSVFSYTALINAYGRNG----------------------------------------------
        CKPNE+IY I+ISLLG EGLL  CSEIFDEM+SQGVIRSVFSYTALINAYGRNG                                              
Subjt:  CKPNEYIYAIVISLLGCEGLLN-CSEIFDEMSSQGVIRSVFSYTALINAYGRNG----------------------------------------------

Query:  ---------------------AARGLGDEAEM------------------FIVLTFEKLG-----------------------------------SVKEA
                             AARGLGDEAEM                  +IV TF KLG                                   S+KEA
Subjt:  ---------------------AARGLGDEAEM------------------FIVLTFEKLG-----------------------------------SVKEA

Query:  MDVFKQMQAAGCVPNSSTYSILLNLYGKHGRYDDVRELFLEMKESSAEPDATTYNILIRVFGEGGYYKEAVALFHDFLDEKIDPNIETYKG---------
        MDVFKQMQAAGCVPN+STYSILLNLYGKHGRYDDVRELFLEMKESSAEPDATTYNILIRVFGEGGY+KE V LFHD ++EKIDPN+ETY+G         
Subjt:  MDVFKQMQAAGCVPNSSTYSILLNLYGKHGRYDDVRELFLEMKESSAEPDATTYNILIRVFGEGGYYKEAVALFHDFLDEKIDPNIETYKG---------

Query:  --------------------------------------------------------------------GGLYMEFEAIWLRMRESGISRNLNSFSGIVEG
                                                                            GGLY EFEAI LRMRE GISRN  SFSGI+EG
Subjt:  --------------------------------------------------------------------GGLYMEFEAIWLRMRESGISRNLNSFSGIVEG

Query:  F----------------------------------------VDESKEQFHEIKAS
        +                                        VDESKEQFHEIKAS
Subjt:  F----------------------------------------VDESKEQFHEIKAS

XP_038894204.1 pentatricopeptide repeat-containing protein At1g74850, chloroplastic isoform X2 [Benincasa hispida]6.25e-9239.28Show/hide
Query:  FFSDDWRLSSDAGKCRVKPKDLVLGNTS--------------------------------------------------------------------RQIW
        FFSDDW LSSD  KCR KPKDLVLGN S                                                                    RQIW
Subjt:  FFSDDWRLSSDAGKCRVKPKDLVLGNTS--------------------------------------------------------------------RQIW

Query:  CKPNEYIYAIVISLLGCEGLLN-CSEIFDEMSSQGVIRSVFSYTALINAYGRNG----------------------------------------------
        CKPNE+IY I+ISLLG EGLL  CSEIFDEM+SQGVIRSVFSYTALINAYGRNG                                              
Subjt:  CKPNEYIYAIVISLLGCEGLLN-CSEIFDEMSSQGVIRSVFSYTALINAYGRNG----------------------------------------------

Query:  ---------------------AARGLGDEAEM------------------FIVLTFEKLG-----------------------------------SVKEA
                             AARGLGDEAEM                  +IV TF KLG                                   S+KEA
Subjt:  ---------------------AARGLGDEAEM------------------FIVLTFEKLG-----------------------------------SVKEA

Query:  MDVFKQMQAAGCVPNSSTYSILLNLYGKHGRYDDVRELFLEMKESSAEPDATTYNILIRVFGEGGYYKEAVALFHDFLDEKIDPNIETYKG---------
        MDVFKQMQAAGCVPN+STYSILLNLYGKHGRYDDVRELFLEMKESSAEPDATTYNILIRVFGEGGY+KE V LFHD ++EKIDPN+ETY+G         
Subjt:  MDVFKQMQAAGCVPNSSTYSILLNLYGKHGRYDDVRELFLEMKESSAEPDATTYNILIRVFGEGGYYKEAVALFHDFLDEKIDPNIETYKG---------

Query:  --------------------------------------------------------------------GGLYMEFEAIWLRMRESGISRNLNSFSGIVEG
                                                                            GGLY EFEAI LRMRE GISRN  SFSGI+EG
Subjt:  --------------------------------------------------------------------GGLYMEFEAIWLRMRESGISRNLNSFSGIVEG

Query:  F----------------------------------------VDESKEQFHEIKAS
        +                                        VDESKEQFHEIKAS
Subjt:  F----------------------------------------VDESKEQFHEIKAS

TrEMBL top hitse value%identityAlignment
A0A0A0LW62 Smr domain-containing protein1.29e-8838.74Show/hide
Query:  FFSDDWRLSSDAGKCRVKPKDLVLGNTS--------------------------------------------------------------------RQIW
        FFSDDWRLSSD GK R K KDLVLGN S                                                                    RQIW
Subjt:  FFSDDWRLSSDAGKCRVKPKDLVLGNTS--------------------------------------------------------------------RQIW

Query:  CKPNEYIYAIVISLLGCEGLLN-CSEIFDEMSSQGVIRSVFSYTALINAYGRNG----------------------------------------------
        CKPNE+IY I+ISLLG EGLL  CSEIFDEM+SQGVIRSVFSYTALINAYGRNG                                              
Subjt:  CKPNEYIYAIVISLLGCEGLLN-CSEIFDEMSSQGVIRSVFSYTALINAYGRNG----------------------------------------------

Query:  ---------------------AARGLGDEAEM------------------FIVLTF-----------------------------------EKLGSVKEA
                             AARGLGDEAEM                  +IV TF                                    KLGS+KEA
Subjt:  ---------------------AARGLGDEAEM------------------FIVLTF-----------------------------------EKLGSVKEA

Query:  MDVFKQMQAAGCVPNSSTYSILLNLYGKHGRYDDVRELFLEMKESSAEPDATTYNILIRVFGEGGYYKEAVALFHDFLDEKIDPNIETYKG---------
        MDVFKQMQAAGCVPN+STYSILLNLYGKHGRYDDVRELFL+MKESSAEPDATTYNILIRVFGEGGY+KE V LFHD +DE IDPN+ETY+G         
Subjt:  MDVFKQMQAAGCVPNSSTYSILLNLYGKHGRYDDVRELFLEMKESSAEPDATTYNILIRVFGEGGYYKEAVALFHDFLDEKIDPNIETYKG---------

Query:  --------------------------------------------------------------------GGLYMEFEAIWLRMRESGISRNLNSFSGIVEG
                                                                            GGLY EFEAI  RMRE GISRN  SFSGI+EG
Subjt:  --------------------------------------------------------------------GGLYMEFEAIWLRMRESGISRNLNSFSGIVEG

Query:  F----------------------------------------VDESKEQFHEIKAS
        +                                        VDESKEQF EIKAS
Subjt:  F----------------------------------------VDESKEQFHEIKAS

A0A1S3AUM4 pentatricopeptide repeat-containing protein At1g74850, chloroplastic isoform X13.60e-8838.56Show/hide
Query:  FFSDDWRLSSDAGKCRVKPKDLVLGNTS--------------------------------------------------------------------RQIW
        FFSDDWRLSSD GK R K KDLVLGN S                                                                    RQIW
Subjt:  FFSDDWRLSSDAGKCRVKPKDLVLGNTS--------------------------------------------------------------------RQIW

Query:  CKPNEYIYAIVISLLGCEGLLN-CSEIFDEMSSQGVIRSVFSYTALINAYGRNG----------------------------------------------
        CKPNE+IY I+ISLLG EGLL  CSEIFDEM+SQGVIRSVFSYTALINAYGRNG                                              
Subjt:  CKPNEYIYAIVISLLGCEGLLN-CSEIFDEMSSQGVIRSVFSYTALINAYGRNG----------------------------------------------

Query:  ---------------------AARGLGDEAEM------------------FIVLTF-----------------------------------EKLGSVKEA
                             AARGLGDEAEM                  +IV TF                                    KLGS+KEA
Subjt:  ---------------------AARGLGDEAEM------------------FIVLTF-----------------------------------EKLGSVKEA

Query:  MDVFKQMQAAGCVPNSSTYSILLNLYGKHGRYDDVRELFLEMKESSAEPDATTYNILIRVFGEGGYYKEAVALFHDFLDEKIDPNIETYKG---------
        MDVFKQMQAAGCVPN+STYSILLNLYGKHGRYDDVRELFL+MKESSAEPDATTYNILIRVFGEGGY+KE V LFHD ++E IDPN+ETY+G         
Subjt:  MDVFKQMQAAGCVPNSSTYSILLNLYGKHGRYDDVRELFLEMKESSAEPDATTYNILIRVFGEGGYYKEAVALFHDFLDEKIDPNIETYKG---------

Query:  --------------------------------------------------------------------GGLYMEFEAIWLRMRESGISRNLNSFSGIVEG
                                                                            GGLY EFEAI  RMRE GISRN  SFSGI+EG
Subjt:  --------------------------------------------------------------------GGLYMEFEAIWLRMRESGISRNLNSFSGIVEG

Query:  F----------------------------------------VDESKEQFHEIKAS
        +                                        VDESKEQF EIKAS
Subjt:  F----------------------------------------VDESKEQFHEIKAS

A0A1S3AVM1 pentatricopeptide repeat-containing protein At1g74850, chloroplastic isoform X23.54e-8838.56Show/hide
Query:  FFSDDWRLSSDAGKCRVKPKDLVLGNTS--------------------------------------------------------------------RQIW
        FFSDDWRLSSD GK R K KDLVLGN S                                                                    RQIW
Subjt:  FFSDDWRLSSDAGKCRVKPKDLVLGNTS--------------------------------------------------------------------RQIW

Query:  CKPNEYIYAIVISLLGCEGLLN-CSEIFDEMSSQGVIRSVFSYTALINAYGRNG----------------------------------------------
        CKPNE+IY I+ISLLG EGLL  CSEIFDEM+SQGVIRSVFSYTALINAYGRNG                                              
Subjt:  CKPNEYIYAIVISLLGCEGLLN-CSEIFDEMSSQGVIRSVFSYTALINAYGRNG----------------------------------------------

Query:  ---------------------AARGLGDEAEM------------------FIVLTF-----------------------------------EKLGSVKEA
                             AARGLGDEAEM                  +IV TF                                    KLGS+KEA
Subjt:  ---------------------AARGLGDEAEM------------------FIVLTF-----------------------------------EKLGSVKEA

Query:  MDVFKQMQAAGCVPNSSTYSILLNLYGKHGRYDDVRELFLEMKESSAEPDATTYNILIRVFGEGGYYKEAVALFHDFLDEKIDPNIETYKG---------
        MDVFKQMQAAGCVPN+STYSILLNLYGKHGRYDDVRELFL+MKESSAEPDATTYNILIRVFGEGGY+KE V LFHD ++E IDPN+ETY+G         
Subjt:  MDVFKQMQAAGCVPNSSTYSILLNLYGKHGRYDDVRELFLEMKESSAEPDATTYNILIRVFGEGGYYKEAVALFHDFLDEKIDPNIETYKG---------

Query:  --------------------------------------------------------------------GGLYMEFEAIWLRMRESGISRNLNSFSGIVEG
                                                                            GGLY EFEAI  RMRE GISRN  SFSGI+EG
Subjt:  --------------------------------------------------------------------GGLYMEFEAIWLRMRESGISRNLNSFSGIVEG

Query:  F----------------------------------------VDESKEQFHEIKAS
        +                                        VDESKEQF EIKAS
Subjt:  F----------------------------------------VDESKEQFHEIKAS

A0A6J1C827 pentatricopeptide repeat-containing protein At1g74850, chloroplastic1.32e-9138.92Show/hide
Query:  FFSDDWRLSSDAGKCRVKPKDLVLGNTS--------------------------------------------------------------------RQIW
        F SDDW  SS+ GKCR KPKDLVLGN S                                                                    RQIW
Subjt:  FFSDDWRLSSDAGKCRVKPKDLVLGNTS--------------------------------------------------------------------RQIW

Query:  CKPNEYIYAIVISLLGCEGLLN-CSEIFDEMSSQGVIRSVFSYTALINAYGRNG----------------------------------------------
        CKPNE+IY I+ISLLG EGLL  CSEIFDEM+SQGVIRSVFSYTALINAYGRNG                                              
Subjt:  CKPNEYIYAIVISLLGCEGLLN-CSEIFDEMSSQGVIRSVFSYTALINAYGRNG----------------------------------------------

Query:  ---------------------AARGLGDEAEM------------------FIVLTF-----------------------------------EKLGSVKEA
                             AARGLGDEAEM                  +IV TF                                    KLGS+KEA
Subjt:  ---------------------AARGLGDEAEM------------------FIVLTF-----------------------------------EKLGSVKEA

Query:  MDVFKQMQAAGCVPNSSTYSILLNLYGKHGRYDDVRELFLEMKESSAEPDATTYNILIRVFGEGGYYKEAVALFHDFLDEKIDPNIETYKG---------
        MDVFKQMQAAGCVPN+STYSILLNLYGKHGRYDDVRELFLEMKESSAEPDATTYNILIRVFGEGGY+KE V LFHD ++EKIDPN+ETY+G         
Subjt:  MDVFKQMQAAGCVPNSSTYSILLNLYGKHGRYDDVRELFLEMKESSAEPDATTYNILIRVFGEGGYYKEAVALFHDFLDEKIDPNIETYKG---------

Query:  --------------------------------------------------------------------GGLYMEFEAIWLRMRESGISRNLNSFSGIVEG
                                                                            GGLY EFEAI LRMRESGISRN+ SFSGI+EG
Subjt:  --------------------------------------------------------------------GGLYMEFEAIWLRMRESGISRNLNSFSGIVEG

Query:  F----------------------------------------VDESKEQFHEIKAS
        +                                        VDESKEQFHEIKA+
Subjt:  F----------------------------------------VDESKEQFHEIKAS

A0A6J1KJF1 pentatricopeptide repeat-containing protein At1g74850, chloroplastic isoform X23.87e-8838.56Show/hide
Query:  FFSDDWRLSSDAGKCRVKPKDLVLGNTS--------------------------------------------------------------------RQIW
        FFSD+W L SD GKCR KPKDLVLGN S                                                                    RQIW
Subjt:  FFSDDWRLSSDAGKCRVKPKDLVLGNTS--------------------------------------------------------------------RQIW

Query:  CKPNEYIYAIVISLLGCEGLLN-CSEIFDEMSSQGVIRSVFSYTALINAYGRNG----------------------------------------------
        CKPNE+IY I+ISLLG EGLL  CSEIFDEM+SQGVIRSVFSYTALINAYGRNG                                              
Subjt:  CKPNEYIYAIVISLLGCEGLLN-CSEIFDEMSSQGVIRSVFSYTALINAYGRNG----------------------------------------------

Query:  ---------------------AARGLGDEAEM------------------FIVLTFEKLG-----------------------------------SVKEA
                             AAR LGDEAEM                  +IV TF KLG                                   S+KEA
Subjt:  ---------------------AARGLGDEAEM------------------FIVLTFEKLG-----------------------------------SVKEA

Query:  MDVFKQMQAAGCVPNSSTYSILLNLYGKHGRYDDVRELFLEMKESSAEPDATTYNILIRVFGEGGYYKEAVALFHDFLDEKIDPNIETYKG---------
        MDVFKQMQAAGCVPN+STYSILLNLYGKHGRYDDVRELFLEMKESSAEPDATTYNILIRVFGEGGY+KE V LFHD ++EKIDPN+ETY+G         
Subjt:  MDVFKQMQAAGCVPNSSTYSILLNLYGKHGRYDDVRELFLEMKESSAEPDATTYNILIRVFGEGGYYKEAVALFHDFLDEKIDPNIETYKG---------

Query:  --------------------------------------------------------------------GGLYMEFEAIWLRMRESGISRNLNSFSGIVE-
                                                                            GGLY E EAI  RM ESGISRN  SFSGI+E 
Subjt:  --------------------------------------------------------------------GGLYMEFEAIWLRMRESGISRNLNSFSGIVE-

Query:  ---------------------------------------GFVDESKEQFHEIKAS
                                               G VDESKEQFHEIKAS
Subjt:  ---------------------------------------GFVDESKEQFHEIKAS

SwissProt top hitse value%identityAlignment
A0A1D6IEG9 Pentatricopeptide repeat-containing protein CRP1, chloroplastic6.8e-1827.48Show/hide
Query:  KPNEYIY-AIVISLLGCEGLLNCSEIFDEMSSQGVIRSVFSYTALINAYGRNG------------AARGLGDEAEMF--IVLTFEKLGSVKEAMDVFKQM
        KP    Y A++   +    L N  ++ DEMS  GV     +Y+ L++AY R G             A G+   + +F  I+  F   G  ++A  V ++M
Subjt:  KPNEYIY-AIVISLLGCEGLLNCSEIFDEMSSQGVIRSVFSYTALINAYGRNG------------AARGLGDEAEMF--IVLTFEKLGSVKEAMDVFKQM

Query:  QAAGCVPNSSTYSILLNLYGKHGRYDDVRELFLEMKESSAEPDATTYNILIRVFGEGGYYKEAVALFHDFLDEKIDPNIETYK------GGGLYME-FEA
        QA+G  P+   Y+++++ +GK+       + F +M+E   EPD  T+N LI    +GG +  A  LF +  +    P   TY       G   + E  EA
Subjt:  QAAGCVPNSSTYSILLNLYGKHGRYDDVRELFLEMKESSAEPDATTYNILIRVFGEGGYYKEAVALFHDFLDEKIDPNIETYK------GGGLYME-FEA

Query:  IWLRMRESGISRNLNSFSGIVE
        +   M+E G+  N+ +++ +V+
Subjt:  IWLRMRESGISRNLNSFSGIVE

Q9FKC3 Pentatricopeptide repeat-containing protein At5g48730, chloroplastic8.0e-1926.96Show/hide
Query:  QIWCKPNEYIYAIVISLLG-CEGLLNCSEIFDEMSSQGVIRSVFSYTALINAYGRNG---AARGLGDEAE------------MFIVLTFEKLGSVKEAMD
        Q+W KPN  IY  +I +LG C+      E+F EM ++G + +   YTAL++AY R+G   AA  L +  +              ++ +F ++ +  +  D
Subjt:  QIWCKPNEYIYAIVISLLG-CEGLLNCSEIFDEMSSQGVIRSVFSYTALINAYGRNG---AARGLGDEAE------------MFIVLTFEKLGSVKEAMD

Query:  VFKQMQAAGCVPNSSTYSILLNLYGKHGRYDDVRELFLEM-KESSAEPDATTYNILIRVFGEGGYYKEAVALFHDFLDEKIDPNIET-------YKGGGL
        +   M+  G  PN+ TY+ L++ YGK   + ++    ++M  E   +PD+ T N  +R FG  G  +     +  F    I+PNI T       Y   G 
Subjt:  VFKQMQAAGCVPNSSTYSILLNLYGKHGRYDDVRELFLEM-KESSAEPDATTYNILIRVFGEGGYYKEAVALFHDFLDEKIDPNIET-------YKGGGL

Query:  YMEFEAIWLRMRESGISRNLNSFSGIVEGF
        Y +  A+   M++   S  + +++ +++ F
Subjt:  YMEFEAIWLRMRESGISRNLNSFSGIVEGF

Q9LYZ9 Pentatricopeptide repeat-containing protein At5g028601.5e-2027.41Show/hide
Query:  EIFDEMSSQGVIRSVFSYTALINAYGRNG------------AARGLGDEAEMFIVLT--FEKLGSVKEAMDVFKQMQAAGCVPNSSTYSILLNLYGKHGR
        ++ +EM   G   S+ +Y +LI+AY R+G            A +G   +   +  L   FE+ G V+ AM +F++M+ AGC PN  T++  + +YG  G+
Subjt:  EIFDEMSSQGVIRSVFSYTALINAYGRNG------------AARGLGDEAEMFIVLT--FEKLGSVKEAMDVFKQMQAAGCVPNSSTYSILLNLYGKHGR

Query:  YDDVRELFLEMKESSAEPDATTYNILIRVFGEGGYYKEAVALFHD-----FLDEKIDPN--IETYKGGGLYMEFEAIWLRMRESGISRNLNSFSGIV
        + ++ ++F E+      PD  T+N L+ VFG+ G   E   +F +     F+ E+   N  I  Y   G + +   ++ RM ++G++ +L++++ ++
Subjt:  YDDVRELFLEMKESSAEPDATTYNILIRVFGEGGYYKEAVALFHD-----FLDEKIDPN--IETYKGGGLYMEFEAIWLRMRESGISRNLNSFSGIV

Q9S7Q2 Pentatricopeptide repeat-containing protein At1g74850, chloroplastic7.1e-6042.09Show/hide
Query:  RQIWCKPNEYIYAIVISLLGCEGLLN-CSEIFDEMSSQGVIRSVFSYTALINAYGRNG------------------------------------------
        RQIWCKPNE+IY I+ISLLG EGLL+ C E+FDEM SQGV RSVFSYTALINAYGRNG                                          
Subjt:  RQIWCKPNEYIYAIVISLLGCEGLLN-CSEIFDEMSSQGVIRSVFSYTALINAYGRNG------------------------------------------

Query:  -------------------------AARGLGDEAEM------------------FIVLTFEKL-----------------------------------GS
                                 A RGLGDEAEM                   +V TF KL                                   GS
Subjt:  -------------------------AARGLGDEAEM------------------FIVLTFEKL-----------------------------------GS

Query:  VKEAMDVFKQMQAAGCVPNSSTYSILLNLYGKHGRYDDVRELFLEMKESSAEPDATTYNILIRVFGEGGYYKEAVALFHDFLDEKIDPNIETYKG-----
        +KEAM VF QMQAAGC PN++TYS+LLNL+G+ GRYDDVR+LFLEMK S+ +PDA TYNILI VFGEGGY+KE V LFHD ++E I+P++ETY+G     
Subjt:  VKEAMDVFKQMQAAGCVPNSSTYSILLNLYGKHGRYDDVRELFLEMKESSAEPDATTYNILIRVFGEGGYYKEAVALFHDFLDEKIDPNIETYKG-----

Query:  --GGLYMEFEAIWLRMRESGISRNLNSFSGIVEGF
          GGL+ +   I   M  + I  +  +++G++E F
Subjt:  --GGLYMEFEAIWLRMRESGISRNLNSFSGIVEGF

Q9SIC9 Pentatricopeptide repeat-containing protein At2g31400, chloroplastic7.2e-2028.5Show/hide
Query:  EIFDEMSSQGVIRSVFSYTALINAYGRNG----AARGLGDEAEMFIVL----------TFEKLGSVKEAMDVFKQMQAAGCVPNSSTYSILLNLYGKHGR
        EI  +M  + ++ +V SY+ +I+ + + G    A    G+   + I L           + K+G  +EA+D+ ++M + G   +  TY+ LL  YGK G+
Subjt:  EIFDEMSSQGVIRSVFSYTALINAYGRNG----AARGLGDEAEMFIVL----------TFEKLGSVKEAMDVFKQMQAAGCVPNSSTYSILLNLYGKHGR

Query:  YDDVRELFLEMKESSAEPDATTYNILIRVFGEGGYYKEAVALFHDFLDEKIDPNIETYKG-------GGLYMEFEAIWLRMRESGISRNLNSFSGIVEGF
        YD+V+++F EMK     P+  TY+ LI  + +GG YKEA+ +F +F    +  ++  Y          GL     ++   M + GIS N+ +++ I++ F
Subjt:  YDDVRELFLEMKESSAEPDATTYNILIRVFGEGGYYKEAVALFHDFLDEKIDPNIETYKG-------GGLYMEFEAIWLRMRESGISRNLNSFSGIVEGF

Arabidopsis top hitse value%identityAlignment
AT1G74750.1 Pentatricopeptide repeat (PPR) superfamily protein1.4e-1825.86Show/hide
Query:  RQIWCKPNEYIYAIVISLLG-CEGLLNCSEIFDEMSSQGVIRSVFSYTALINAYGRNGAARGLGDEAEMFIVLTFEKLGSVKEAMDVFKQMQAAGCVPNS
        RQ   K + + Y  ++  LG  +     +++ DEM   G   +  +Y  LI++YGR                        +KEAM+VF QMQ AGC P+ 
Subjt:  RQIWCKPNEYIYAIVISLLG-CEGLLNCSEIFDEMSSQGVIRSVFSYTALINAYGRNGAARGLGDEAEMFIVLTFEKLGSVKEAMDVFKQMQAAGCVPNS

Query:  STYSILLNLYGKHGRYDDVRELFLEMKESSAEPDATTYNILIRVFGEGGYYKEAVALFHDFLDEKIDPNIETYK-------GGGLYMEFEAIWLRMRESG
         TY  L++++ K G  D   +++  M+E+   PD  TY+++I   G+ G+   A  LF + + +   PN+ T+            Y     ++  M+ +G
Subjt:  STYSILLNLYGKHGRYDDVRELFLEMKESSAEPDATTYNILIRVFGEGGYYKEAVALFHDFLDEKIDPNIETYK-------GGGLYMEFEAIWLRMRESG

Query:  ISRNLNSFSGIVE-----GFVDESKEQFHEIK
           +  ++S ++E     GF++E++  F E++
Subjt:  ISRNLNSFSGIVE-----GFVDESKEQFHEIK

AT1G74850.1 plastid transcriptionally active 25.1e-6142.09Show/hide
Query:  RQIWCKPNEYIYAIVISLLGCEGLLN-CSEIFDEMSSQGVIRSVFSYTALINAYGRNG------------------------------------------
        RQIWCKPNE+IY I+ISLLG EGLL+ C E+FDEM SQGV RSVFSYTALINAYGRNG                                          
Subjt:  RQIWCKPNEYIYAIVISLLGCEGLLN-CSEIFDEMSSQGVIRSVFSYTALINAYGRNG------------------------------------------

Query:  -------------------------AARGLGDEAEM------------------FIVLTFEKL-----------------------------------GS
                                 A RGLGDEAEM                   +V TF KL                                   GS
Subjt:  -------------------------AARGLGDEAEM------------------FIVLTFEKL-----------------------------------GS

Query:  VKEAMDVFKQMQAAGCVPNSSTYSILLNLYGKHGRYDDVRELFLEMKESSAEPDATTYNILIRVFGEGGYYKEAVALFHDFLDEKIDPNIETYKG-----
        +KEAM VF QMQAAGC PN++TYS+LLNL+G+ GRYDDVR+LFLEMK S+ +PDA TYNILI VFGEGGY+KE V LFHD ++E I+P++ETY+G     
Subjt:  VKEAMDVFKQMQAAGCVPNSSTYSILLNLYGKHGRYDDVRELFLEMKESSAEPDATTYNILIRVFGEGGYYKEAVALFHDFLDEKIDPNIETYKG-----

Query:  --GGLYMEFEAIWLRMRESGISRNLNSFSGIVEGF
          GGL+ +   I   M  + I  +  +++G++E F
Subjt:  --GGLYMEFEAIWLRMRESGISRNLNSFSGIVEGF

AT2G31400.1 genomes uncoupled 15.1e-2128.5Show/hide
Query:  EIFDEMSSQGVIRSVFSYTALINAYGRNG----AARGLGDEAEMFIVL----------TFEKLGSVKEAMDVFKQMQAAGCVPNSSTYSILLNLYGKHGR
        EI  +M  + ++ +V SY+ +I+ + + G    A    G+   + I L           + K+G  +EA+D+ ++M + G   +  TY+ LL  YGK G+
Subjt:  EIFDEMSSQGVIRSVFSYTALINAYGRNG----AARGLGDEAEMFIVL----------TFEKLGSVKEAMDVFKQMQAAGCVPNSSTYSILLNLYGKHGR

Query:  YDDVRELFLEMKESSAEPDATTYNILIRVFGEGGYYKEAVALFHDFLDEKIDPNIETYKG-------GGLYMEFEAIWLRMRESGISRNLNSFSGIVEGF
        YD+V+++F EMK     P+  TY+ LI  + +GG YKEA+ +F +F    +  ++  Y          GL     ++   M + GIS N+ +++ I++ F
Subjt:  YDDVRELFLEMKESSAEPDATTYNILIRVFGEGGYYKEAVALFHDFLDEKIDPNIETYKG-------GGLYMEFEAIWLRMRESGISRNLNSFSGIVEGF

AT5G02860.1 Pentatricopeptide repeat (PPR) superfamily protein1.0e-2127.41Show/hide
Query:  EIFDEMSSQGVIRSVFSYTALINAYGRNG------------AARGLGDEAEMFIVLT--FEKLGSVKEAMDVFKQMQAAGCVPNSSTYSILLNLYGKHGR
        ++ +EM   G   S+ +Y +LI+AY R+G            A +G   +   +  L   FE+ G V+ AM +F++M+ AGC PN  T++  + +YG  G+
Subjt:  EIFDEMSSQGVIRSVFSYTALINAYGRNG------------AARGLGDEAEMFIVLT--FEKLGSVKEAMDVFKQMQAAGCVPNSSTYSILLNLYGKHGR

Query:  YDDVRELFLEMKESSAEPDATTYNILIRVFGEGGYYKEAVALFHD-----FLDEKIDPN--IETYKGGGLYMEFEAIWLRMRESGISRNLNSFSGIV
        + ++ ++F E+      PD  T+N L+ VFG+ G   E   +F +     F+ E+   N  I  Y   G + +   ++ RM ++G++ +L++++ ++
Subjt:  YDDVRELFLEMKESSAEPDATTYNILIRVFGEGGYYKEAVALFHD-----FLDEKIDPN--IETYKGGGLYMEFEAIWLRMRESGISRNLNSFSGIV

AT5G48730.1 Pentatricopeptide repeat (PPR) superfamily protein5.7e-2026.96Show/hide
Query:  QIWCKPNEYIYAIVISLLG-CEGLLNCSEIFDEMSSQGVIRSVFSYTALINAYGRNG---AARGLGDEAE------------MFIVLTFEKLGSVKEAMD
        Q+W KPN  IY  +I +LG C+      E+F EM ++G + +   YTAL++AY R+G   AA  L +  +              ++ +F ++ +  +  D
Subjt:  QIWCKPNEYIYAIVISLLG-CEGLLNCSEIFDEMSSQGVIRSVFSYTALINAYGRNG---AARGLGDEAE------------MFIVLTFEKLGSVKEAMD

Query:  VFKQMQAAGCVPNSSTYSILLNLYGKHGRYDDVRELFLEM-KESSAEPDATTYNILIRVFGEGGYYKEAVALFHDFLDEKIDPNIET-------YKGGGL
        +   M+  G  PN+ TY+ L++ YGK   + ++    ++M  E   +PD+ T N  +R FG  G  +     +  F    I+PNI T       Y   G 
Subjt:  VFKQMQAAGCVPNSSTYSILLNLYGKHGRYDDVRELFLEM-KESSAEPDATTYNILIRVFGEGGYYKEAVALFHDFLDEKIDPNIET-------YKGGGL

Query:  YMEFEAIWLRMRESGISRNLNSFSGIVEGF
        Y +  A+   M++   S  + +++ +++ F
Subjt:  YMEFEAIWLRMRESGISRNLNSFSGIVEGF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTGGAATTTTCACTGGTTGTTATCTCAATCTATTCTTCGTTGTTCGTTAACTGTTCGTCATCTTAGTCTCATCAACTTCCATTTCTCTCAAATATTTACAATTTCAGT
GTTCGCCGTAGATTTTTTCTCAGATGATTGGAGATTGTCCTCCGATGCGGGGAAGTGCCGGGTGAAGCCGAAGGATCTTGTTCTTGGGAATACGTCGCGTCAGATATGGT
GCAAGCCGAACGAGTACATCTATGCCATCGTGATCAGCTTGCTCGGCTGCGAAGGATTGCTAAACTGTAGCGAGATATTCGATGAAATGTCGAGCCAGGGCGTGATACGT
AGCGTGTTTTCTTATACCGCTTTGATAAATGCCTACGGGCGCAATGGTGCTGCCCGCGGTTTAGGTGATGAGGCTGAGATGTTTATTGTTCTAACATTTGAAAAATTAGG
GTCCGTGAAGGAGGCAATGGATGTGTTTAAGCAGATGCAAGCAGCAGGATGCGTGCCAAATTCGTCCACTTACAGCATTCTGTTGAATCTATATGGGAAGCATGGGAGGT
ATGATGATGTTCGAGAGCTTTTCCTTGAAATGAAAGAGAGTAGTGCTGAGCCAGATGCAACTACTTACAACATTCTCATACGAGTATTTGGAGAGGGTGGATACTACAAG
GAGGCTGTAGCTTTGTTTCATGACTTTCTGGATGAAAAGATTGACCCAAATATCGAAACATACAAGGGAGGTGGACTCTACATGGAGTTTGAAGCAATCTGGTTGAGAAT
GAGAGAATCCGGCATTTCAAGGAATCTGAATTCATTTAGTGGTATTGTTGAAGGTTTTGTAGACGAGAGCAAGGAGCAGTTTCATGAGATTAAAGCTTCATGA
mRNA sequenceShow/hide mRNA sequence
ATGTGGAATTTTCACTGGTTGTTATCTCAATCTATTCTTCGTTGTTCGTTAACTGTTCGTCATCTTAGTCTCATCAACTTCCATTTCTCTCAAATATTTACAATTTCAGT
GTTCGCCGTAGATTTTTTCTCAGATGATTGGAGATTGTCCTCCGATGCGGGGAAGTGCCGGGTGAAGCCGAAGGATCTTGTTCTTGGGAATACGTCGCGTCAGATATGGT
GCAAGCCGAACGAGTACATCTATGCCATCGTGATCAGCTTGCTCGGCTGCGAAGGATTGCTAAACTGTAGCGAGATATTCGATGAAATGTCGAGCCAGGGCGTGATACGT
AGCGTGTTTTCTTATACCGCTTTGATAAATGCCTACGGGCGCAATGGTGCTGCCCGCGGTTTAGGTGATGAGGCTGAGATGTTTATTGTTCTAACATTTGAAAAATTAGG
GTCCGTGAAGGAGGCAATGGATGTGTTTAAGCAGATGCAAGCAGCAGGATGCGTGCCAAATTCGTCCACTTACAGCATTCTGTTGAATCTATATGGGAAGCATGGGAGGT
ATGATGATGTTCGAGAGCTTTTCCTTGAAATGAAAGAGAGTAGTGCTGAGCCAGATGCAACTACTTACAACATTCTCATACGAGTATTTGGAGAGGGTGGATACTACAAG
GAGGCTGTAGCTTTGTTTCATGACTTTCTGGATGAAAAGATTGACCCAAATATCGAAACATACAAGGGAGGTGGACTCTACATGGAGTTTGAAGCAATCTGGTTGAGAAT
GAGAGAATCCGGCATTTCAAGGAATCTGAATTCATTTAGTGGTATTGTTGAAGGTTTTGTAGACGAGAGCAAGGAGCAGTTTCATGAGATTAAAGCTTCATGA
Protein sequenceShow/hide protein sequence
MWNFHWLLSQSILRCSLTVRHLSLINFHFSQIFTISVFAVDFFSDDWRLSSDAGKCRVKPKDLVLGNTSRQIWCKPNEYIYAIVISLLGCEGLLNCSEIFDEMSSQGVIR
SVFSYTALINAYGRNGAARGLGDEAEMFIVLTFEKLGSVKEAMDVFKQMQAAGCVPNSSTYSILLNLYGKHGRYDDVRELFLEMKESSAEPDATTYNILIRVFGEGGYYK
EAVALFHDFLDEKIDPNIETYKGGGLYMEFEAIWLRMRESGISRNLNSFSGIVEGFVDESKEQFHEIKAS