; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Carg02026 (gene) of Silver-seed gourd (SMH-JMG-627) v2 genome

Gene IDCarg02026
OrganismCucurbita argyrosperma subsp. argyrosperma cv. SMH-JMG-627 (Silver-seed gourd (SMH-JMG-627) v2)
DescriptionPentatricopeptide repeat-containing protein
Genome locationCarg_Chr04:7963999..7965021
RNA-Seq ExpressionCarg02026
SyntenyCarg02026
Gene Ontology termsGO:0005515 - protein binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily
IPR032867 - DYW domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CBI40653.3 unnamed protein product, partial [Vitis vinifera]3.4e-4536.41Show/hide
Query:  MTGVVNLCDKCRQIDDAYKVFDRMPQRDLLFWGAIMAGFSQNGSARKALE---------PRPDLITFVTVLPAVADVRSF----VRHGYAIRAGFAKLRT
        MTGVVN+  KCR +++AYK+FDRMP+RDL+ W  I++G++QNG  + ALE          RPD IT V++LPAVADV S       HGY++RAGF     
Subjt:  MTGVVNLCDKCRQIDDAYKVFDRMPQRDLLFWGAIMAGFSQNGSARKALE---------PRPDLITFVTVLPAVADVRSF----VRHGYAIRAGFAKLRT

Query:  VVSWNSMIVGYVQSGE-------------PEMAIAGGRDRDKLSNVILVEALHAYADLGDRKREKFV----DKLNLGCDAAVTR------------YGVR
        V +  +++  Y + G               E+      ++ +++NV ++ ALHA ADLGD ++ +FV    D+L LG D +V              + + 
Subjt:  VVSWNSMIVGYVQSGE-------------PEMAIAGGRDRDKLSNVILVEALHAYADLGDRKREKFV----DKLNLGCDAAVTR------------YGVR

Query:  DSTISRKWIKAFDGSLQCHG-------------------------------------------------RPSRSKI-------RDEIKVADYVPDTNSIQ
        D      W    DG    HG                                                  P   KI        + IK A Y+PDTNS+ 
Subjt:  DSTISRKWIKAFDGSLQCHG-------------------------------------------------RPSRSKI-------RDEIKVADYVPDTNSIQ

Query:  DVEDDVQEQLLNSHSEKLAITFGLLNTSPDTTILFRKNLRVCGDIH--------------------RFHHFKNETYCNC
        DVED V+EQLLNSHSEKLAI F LLNTSP TTI  RKNLRVCGD H                    RFHHFK+ T C+C
Subjt:  DVEDDVQEQLLNSHSEKLAITFGLLNTSPDTTILFRKNLRVCGDIH--------------------RFHHFKNETYCNC

GAU16441.1 hypothetical protein TSUD_117900 [Trifolium subterraneum]7.8e-4240.43Show/hide
Query:  MTGVVNLCDKCRQIDDAYKVFDRMPQRDLLFWGAIMAGFSQNGSARKAL---------EPRPDLITFVTVLPAVADVRSF----VRHGYAIRAGF-----
        M GV+N   KC +IDDAYK+F+RM ++DL+ W +++AG++QNG   +AL         + +PD +  V+VLPAVAD++ F      HGYA+R GF     
Subjt:  MTGVVNLCDKCRQIDDAYKVFDRMPQRDLLFWGAIMAGFSQNGSARKAL---------EPRPDLITFVTVLPAVADVRSF----VRHGYAIRAGF-----

Query:  -----------AKLRTVVSWNSMIVGYVQSGEPEMAIAGGRDRDKLSNVILVEALHAYADLGDRKREKFVDKLNLGCDAAVTRYGVRDSTISRKWIKAFD
                    K +T V+WN+MI+GY Q+G    A+                +L       D K + F         A +T      + +S  W     
Subjt:  -----------AKLRTVVSWNSMIVGYVQSGEPEMAIAGGRDRDKLSNVILVEALHAYADLGDRKREKFVDKLNLGCDAAVTRYGVRDSTISRKWIKAFD

Query:  GSLQCHGRPSRSKIRDEIKVADYVPDTNSIQDVEDDVQEQLLNSHSEKLAITFGLLNTSPDTTILFRKNLRVCGDIH
          L+  G        D+I+ A YVPDT+SI DVE+DV+EQLL+SHSE+LAI FGLLNTSP TTI  RKNLRVCGD H
Subjt:  GSLQCHGRPSRSKIRDEIKVADYVPDTNSIQDVEDDVQEQLLNSHSEKLAITFGLLNTSPDTTILFRKNLRVCGDIH

KAG6601091.1 Pentatricopeptide repeat-containing protein, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia]6.6e-4998.98Show/hide
Query:  RTVVSWNSMIVGYVQSGEPEMAIAGGRDRDKLSNVILVEALHAYADLGDRKREKFVDKLNLGCDAAVTRYGVRDSTISRKWIKAFDGSLQCHGRPSRS
        RTVVSWNSMI+GYVQSGEPEMAIAGGRDRDKLSNVILVEALHAYADLGDRKREKFVDKLNLGCDAAVTRYGVRDSTISRKWIKAFDGSLQCHGRPSRS
Subjt:  RTVVSWNSMIVGYVQSGEPEMAIAGGRDRDKLSNVILVEALHAYADLGDRKREKFVDKLNLGCDAAVTRYGVRDSTISRKWIKAFDGSLQCHGRPSRS

KAG7031895.1 Pentatricopeptide repeat-containing protein, chloroplastic, partial [Cucurbita argyrosperma subsp. argyrosperma]3.4e-154100Show/hide
Query:  MTGVVNLCDKCRQIDDAYKVFDRMPQRDLLFWGAIMAGFSQNGSARKALEPRPDLITFVTVLPAVADVRSFVRHGYAIRAGFAKLRTVVSWNSMIVGYVQ
        MTGVVNLCDKCRQIDDAYKVFDRMPQRDLLFWGAIMAGFSQNGSARKALEPRPDLITFVTVLPAVADVRSFVRHGYAIRAGFAKLRTVVSWNSMIVGYVQ
Subjt:  MTGVVNLCDKCRQIDDAYKVFDRMPQRDLLFWGAIMAGFSQNGSARKALEPRPDLITFVTVLPAVADVRSFVRHGYAIRAGFAKLRTVVSWNSMIVGYVQ

Query:  SGEPEMAIAGGRDRDKLSNVILVEALHAYADLGDRKREKFVDKLNLGCDAAVTRYGVRDSTISRKWIKAFDGSLQCHGRPSRSKIRDEIKVADYVPDTNS
        SGEPEMAIAGGRDRDKLSNVILVEALHAYADLGDRKREKFVDKLNLGCDAAVTRYGVRDSTISRKWIKAFDGSLQCHGRPSRSKIRDEIKVADYVPDTNS
Subjt:  SGEPEMAIAGGRDRDKLSNVILVEALHAYADLGDRKREKFVDKLNLGCDAAVTRYGVRDSTISRKWIKAFDGSLQCHGRPSRSKIRDEIKVADYVPDTNS

Query:  IQDVEDDVQEQLLNSHSEKLAITFGLLNTSPDTTILFRKNLRVCGDIHRFHHFKNETYCNCNTCIDNV
        IQDVEDDVQEQLLNSHSEKLAITFGLLNTSPDTTILFRKNLRVCGDIHRFHHFKNETYCNCNTCIDNV
Subjt:  IQDVEDDVQEQLLNSHSEKLAITFGLLNTSPDTTILFRKNLRVCGDIHRFHHFKNETYCNCNTCIDNV

XP_022967033.1 pentatricopeptide repeat-containing protein At1g11290, chloroplastic isoform X2 [Cucurbita maxima]2.7e-4229.15Show/hide
Query:  MTGVVNLCDKCRQIDDAYKVFDRMPQRDLLFWGAIMAGFSQNGSARKALE---------PRPDLITFVTVLPAVADVRSFV----RHGYAIRAGFAKL--
        MTGVVN+  KCRQI DA+K+FDRMP RDL+ W  I+ GFSQNG A KALE          RPD IT VTVLPA AD+ S +     HGYAIRAGF+KL  
Subjt:  MTGVVNLCDKCRQIDDAYKVFDRMPQRDLLFWGAIMAGFSQNGSARKALE---------PRPDLITFVTVLPAVADVRSFV----RHGYAIRAGFAKL--

Query:  ---------------------------RTVVSWNSMIVGYVQSGEPEMAIA----GGRDRDKLSNVILVEALHAYADLGDRKR----EKFVDKLNLGCDA
                                   +TVVSWNSM+ GYVQSGEPEMAIA       +  + +NV ++EALHA ADLGD +      KFVDKLNLG D 
Subjt:  ---------------------------RTVVSWNSMIVGYVQSGEPEMAIA----GGRDRDKLSNVILVEALHAYADLGDRKR----EKFVDKLNLGCDA

Query:  AVTR-------------------------------------------------------YGVRDS----------------TISRKWI------------
        ++                                                          G++                  T   KWI            
Subjt:  AVTR-------------------------------------------------------YGVRDS----------------TISRKWI------------

Query:  ---------------------KAFD---------------------------------------------------------------------------
                             K FD                                                                           
Subjt:  ---------------------KAFD---------------------------------------------------------------------------

Query:  ------------------------------------------GSLQCH-----GRPSRSK----------------------------------------
                                                  G+ + H     G  S +K                                        
Subjt:  ------------------------------------------GSLQCH-----GRPSRSK----------------------------------------

Query:  -------------------------------------IRDEIKVADYVPDTNSIQDVEDDVQEQLLNSHSEKLAITFGLLNTSPDTTILFRKNLRVCGDI
                                             + DEIK A YVPDTNSI DVEDDVQEQLLNSHSEKLAI F LLNTSP+TTI  RKNLRVCGD+
Subjt:  -------------------------------------IRDEIKVADYVPDTNSIQDVEDDVQEQLLNSHSEKLAITFGLLNTSPDTTILFRKNLRVCGDI

Query:  HRFHHFKNETYCNC
        HRFHHFKN T C+C
Subjt:  HRFHHFKNETYCNC

TrEMBL top hitse value%identityAlignment
A0A0A0KPP6 DYW_deaminase domain-containing protein1.3e-1885.25Show/hide
Query:  EIKVADYVPDTNSIQDVEDDVQEQLLNSHSEKLAITFGLLNTSPDTTILFRKNLRVCGDIH
        EIK A YVPDTN I DVEDDVQEQLLNSHSEKLAI FGLLNTSP TTI  RKNLRVCGD H
Subjt:  EIKVADYVPDTNSIQDVEDDVQEQLLNSHSEKLAITFGLLNTSPDTTILFRKNLRVCGDIH

A0A1S3BEJ2 pentatricopeptide repeat-containing protein At1g11290, chloroplastic2.6e-1986.89Show/hide
Query:  EIKVADYVPDTNSIQDVEDDVQEQLLNSHSEKLAITFGLLNTSPDTTILFRKNLRVCGDIH
        EIK A YVPDTNSI DVEDDVQEQLLNSHSEKLAI FGLLNTSP TTI  RKNLRVCGD H
Subjt:  EIKVADYVPDTNSIQDVEDDVQEQLLNSHSEKLAITFGLLNTSPDTTILFRKNLRVCGDIH

A0A1S3BEJ2 pentatricopeptide repeat-containing protein At1g11290, chloroplastic7.9e-4052.2Show/hide
Query:  MTGVVNLCDKCRQIDDAYKVFDRMPQRDLLFWGAIMAGFSQNGSARKALE---------PRPDLITFVTVLPAVADVRSFV----RHGYAIRAGFAKL--
        MTGVVN+  KCRQIDDAYK+FDRMP+RDL+ W  I+AGFSQNG A+KALE          RPD IT VTVLPA ADV   +     HGYAIRAGFAKL  
Subjt:  MTGVVNLCDKCRQIDDAYKVFDRMPQRDLLFWGAIMAGFSQNGSARKALE---------PRPDLITFVTVLPAVADVRSFV----RHGYAIRAGFAKL--

Query:  ---------------------------RTVVSWNSMIVGYVQSGEPEMAIA-------GGRDRDKLSNVILVEALHAYADLGDRKR----EKFVDKLNLG
                                   +TVVSWNSM+ GYVQ+GEPE AIA        G D    + V ++EALHA ADLGD +R     KFVD+LNLG
Subjt:  ---------------------------RTVVSWNSMIVGYVQSGEPEMAIA-------GGRDRDKLSNVILVEALHAYADLGDRKR----EKFVDKLNLG

Query:  CDAAV
         D +V
Subjt:  CDAAV

A0A5D3CX30 Pentatricopeptide repeat-containing protein1.2e-4052.68Show/hide
Query:  MTGVVNLCDKCRQIDDAYKVFDRMPQRDLLFWGAIMAGFSQNGSARKALE---------PRPDLITFVTVLPAVADVRSFV----RHGYAIRAGFAKL--
        MTGVVN+  KCRQIDDAYK+FDRMP+RDL+ W  I+AGFSQNG A+KALE          RPD IT VTVLPA ADV S +     HGYAIRAGFAKL  
Subjt:  MTGVVNLCDKCRQIDDAYKVFDRMPQRDLLFWGAIMAGFSQNGSARKALE---------PRPDLITFVTVLPAVADVRSFV----RHGYAIRAGFAKL--

Query:  ---------------------------RTVVSWNSMIVGYVQSGEPEMAIA-------GGRDRDKLSNVILVEALHAYADLGDRKR----EKFVDKLNLG
                                   +TVVSWNSM+ GYVQ+GEPE AIA        G D    ++V ++EALHA ADLGD +R     KFVD+LNLG
Subjt:  ---------------------------RTVVSWNSMIVGYVQSGEPEMAIA-------GGRDRDKLSNVILVEALHAYADLGDRKR----EKFVDKLNLG

Query:  CDAAV
         D +V
Subjt:  CDAAV

A0A5D3CX30 Pentatricopeptide repeat-containing protein2.6e-1986.89Show/hide
Query:  EIKVADYVPDTNSIQDVEDDVQEQLLNSHSEKLAITFGLLNTSPDTTILFRKNLRVCGDIH
        EIK A YVPDTNSI DVEDDVQEQLLNSHSEKLAI FGLLNTSP TTI  RKNLRVCGD H
Subjt:  EIKVADYVPDTNSIQDVEDDVQEQLLNSHSEKLAITFGLLNTSPDTTILFRKNLRVCGDIH

A0A5D3CX30 Pentatricopeptide repeat-containing protein1.2e-4051.98Show/hide
Query:  MTGVVNLCDKCRQIDDAYKVFDRMPQRDLLFWGAIMAGFSQNGSARKALE---------PRPDLITFVTVLPAVADVRSFV----RHGYAIRAGFAKL--
        MTGVVN+  KCRQIDDAYK+FDRMP RDL+ W  I+ GFSQNG A KALE          RPD IT VTVLPA AD+ S +     HGYAIRAGF+KL  
Subjt:  MTGVVNLCDKCRQIDDAYKVFDRMPQRDLLFWGAIMAGFSQNGSARKALE---------PRPDLITFVTVLPAVADVRSFV----RHGYAIRAGFAKL--

Query:  ---------------------------RTVVSWNSMIVGYVQSGEPEMAIA----GGRDRDKLSNVILVEALHAYADLGDRKR----EKFVDKLNLGCDA
                                   +TVVSWNSM+ GYVQSGEPEMAIA       +  + +NV ++EALHA ADLGD +      KFVDKLNLG D 
Subjt:  ---------------------------RTVVSWNSMIVGYVQSGEPEMAIA----GGRDRDKLSNVILVEALHAYADLGDRKR----EKFVDKLNLGCDA

Query:  AV
        ++
Subjt:  AV

A0A6J1G036 pentatricopeptide repeat-containing protein At1g11290, chloroplastic2.2e-1878.46Show/hide
Query:  KIRDEIKVADYVPDTNSIQDVEDDVQEQLLNSHSEKLAITFGLLNTSPDTTILFRKNLRVCGDIH
        ++ DEIK A YVPDT+SI DVED VQEQLLNSHSEKLAI F LLNTSP+TTI  RKNLRVCGD H
Subjt:  KIRDEIKVADYVPDTNSIQDVEDDVQEQLLNSHSEKLAITFGLLNTSPDTTILFRKNLRVCGDIH

A0A6J1G036 pentatricopeptide repeat-containing protein At1g11290, chloroplastic1.2e-4052.68Show/hide
Query:  MTGVVNLCDKCRQIDDAYKVFDRMPQRDLLFWGAIMAGFSQNGSARKALE---------PRPDLITFVTVLPAVADVRSFV----RHGYAIRAGFAKL--
        MTGVVN+  KCRQIDDAYK+FDRMP+RDL+ W  I+AGFSQNG A+KALE          RPD IT VTVLPA ADV S +     HGYAIRAGFAKL  
Subjt:  MTGVVNLCDKCRQIDDAYKVFDRMPQRDLLFWGAIMAGFSQNGSARKALE---------PRPDLITFVTVLPAVADVRSFV----RHGYAIRAGFAKL--

Query:  ---------------------------RTVVSWNSMIVGYVQSGEPEMAIA-------GGRDRDKLSNVILVEALHAYADLGDRKR----EKFVDKLNLG
                                   +TVVSWNSM+ GYVQ+GEPE AIA        G D    ++V ++EALHA ADLGD +R     KFVD+LNLG
Subjt:  ---------------------------RTVVSWNSMIVGYVQSGEPEMAIA-------GGRDRDKLSNVILVEALHAYADLGDRKR----EKFVDKLNLG

Query:  CDAAV
         D +V
Subjt:  CDAAV

A0A6J1HTZ0 pentatricopeptide repeat-containing protein At1g11290, chloroplastic isoform X21.3e-4229.15Show/hide
Query:  MTGVVNLCDKCRQIDDAYKVFDRMPQRDLLFWGAIMAGFSQNGSARKALE---------PRPDLITFVTVLPAVADVRSFV----RHGYAIRAGFAKL--
        MTGVVN+  KCRQI DA+K+FDRMP RDL+ W  I+ GFSQNG A KALE          RPD IT VTVLPA AD+ S +     HGYAIRAGF+KL  
Subjt:  MTGVVNLCDKCRQIDDAYKVFDRMPQRDLLFWGAIMAGFSQNGSARKALE---------PRPDLITFVTVLPAVADVRSFV----RHGYAIRAGFAKL--

Query:  ---------------------------RTVVSWNSMIVGYVQSGEPEMAIA----GGRDRDKLSNVILVEALHAYADLGDRKR----EKFVDKLNLGCDA
                                   +TVVSWNSM+ GYVQSGEPEMAIA       +  + +NV ++EALHA ADLGD +      KFVDKLNLG D 
Subjt:  ---------------------------RTVVSWNSMIVGYVQSGEPEMAIA----GGRDRDKLSNVILVEALHAYADLGDRKR----EKFVDKLNLGCDA

Query:  AVTR-------------------------------------------------------YGVRDS----------------TISRKWI------------
        ++                                                          G++                  T   KWI            
Subjt:  AVTR-------------------------------------------------------YGVRDS----------------TISRKWI------------

Query:  ---------------------KAFD---------------------------------------------------------------------------
                             K FD                                                                           
Subjt:  ---------------------KAFD---------------------------------------------------------------------------

Query:  ------------------------------------------GSLQCH-----GRPSRSK----------------------------------------
                                                  G+ + H     G  S +K                                        
Subjt:  ------------------------------------------GSLQCH-----GRPSRSK----------------------------------------

Query:  -------------------------------------IRDEIKVADYVPDTNSIQDVEDDVQEQLLNSHSEKLAITFGLLNTSPDTTILFRKNLRVCGDI
                                             + DEIK A YVPDTNSI DVEDDVQEQLLNSHSEKLAI F LLNTSP+TTI  RKNLRVCGD+
Subjt:  -------------------------------------IRDEIKVADYVPDTNSIQDVEDDVQEQLLNSHSEKLAITFGLLNTSPDTTILFRKNLRVCGDI

Query:  HRFHHFKNETYCNC
        HRFHHFKN T C+C
Subjt:  HRFHHFKNETYCNC

SwissProt top hitse value%identityAlignment
Q3E6Q1 Pentatricopeptide repeat-containing protein At1g11290, chloroplastic2.7e-2941.29Show/hide
Query:  MTGVVNLCDKCRQIDDAYKVFDRMPQRDLLFWGAIMAGFSQNGSARKALEP---------RPDLITFVTVLPAVADVRSF----VRHGYAIRAGFAKL--
        MTG+ N+  KCRQ+++A KVFDRMP+RDL+ W  I+AG+SQNG AR ALE          +P  IT V+VLPAV+ +R        HGYA+R+GF  L  
Subjt:  MTGVVNLCDKCRQIDDAYKVFDRMPQRDLLFWGAIMAGFSQNGSARKALEP---------RPDLITFVTVLPAVADVRSF----VRHGYAIRAGFAKL--

Query:  ---------------------------RTVVSWNSMIVGYVQSGEPEMAIAGGR----DRDKLSNVILVEALHAYADLGDRKREKFVDKLN--LGCDAAV
                                   R VVSWNSMI  YVQ+  P+ A+   +    +  K ++V ++ ALHA ADLGD +R +F+ KL+  LG D  V
Subjt:  ---------------------------RTVVSWNSMIVGYVQSGEPEMAIAGGR----DRDKLSNVILVEALHAYADLGDRKREKFVDKLN--LGCDAAV

Query:  T
        +
Subjt:  T

Q3E6Q1 Pentatricopeptide repeat-containing protein At1g11290, chloroplastic2.6e-1658.33Show/hide
Query:  KWIKAFDGSLQCHGRPSRSKIRDEIKVADYVPDTNSIQDVEDDVQEQLLNSHSEKLAITFGLLNTSPDTTILFRKNLRVCGDIH
        K I AF   L CH           IK A YVPDTN +  VE+DV+EQLL++HSEKLAI+FGLLNT+  TTI  RKNLRVC D H
Subjt:  KWIKAFDGSLQCHGRPSRSKIRDEIKVADYVPDTNSIQDVEDDVQEQLLNSHSEKLAITFGLLNTSPDTTILFRKNLRVCGDIH

Q9LW32 Pentatricopeptide repeat-containing protein At3g26782, mitochondrial1.8e-2028.07Show/hide
Query:  TGVVNLCDKCRQIDDAYKVFDRMPQRDLLFWGAIMAGFSQNGSARKALE---------PRPDLITFVTVLPAVADVRSFV--------------------
        T ++++  KC +++ A K FDRM  +++  W A++AG+  +G A KALE          RP+ ITFV+VL A +     V                    
Subjt:  TGVVNLCDKCRQIDDAYKVFDRMPQRDLLFWGAIMAGFSQNGSARKALE---------PRPDLITFVTVLPAVADVRSFV--------------------

Query:  RHGYAI----RAGF----------AKLR-TVVSWNSMIVG-----YVQSGEPEMAIAGGRDRDKLSNVILVEALHAYADLGDRKREKFVDKLNLGCDAAV
         +G  +    RAGF           K++   + W+S++        V+  E  +A     D       +L+   H YAD G   R K V+++ +      
Subjt:  RHGYAI----RAGF----------AKLR-TVVSWNSMIVG-----YVQSGEPEMAIAGGRDRDKLSNVILVEALHAYADLGDRKREKFVDKLNLGCDAAV

Query:  TRYGVRDSTISRKWIKAFDGSLQC-----HGRPSRSKIRD-------EIKVADYVPDTNSI-QDVEDDVQEQLLNSHSEKLAITFGLLNTSPDTTILFRK
         R  V+    S   +   +G +          P R KI +       ++  A YV +T+S+  DV+++ +E  L  HSEKLAI FG++NT P +T+   K
Subjt:  TRYGVRDSTISRKWIKAFDGSLQC-----HGRPSRSKIRD-------EIKVADYVPDTNSI-QDVEDDVQEQLLNSHSEKLAITFGLLNTSPDTTILFRK

Query:  NLRVCGDIH--------------------RFHHFKNETYCNC
        NLRVC D H                    RFHHFK +  C+C
Subjt:  NLRVCGDIH--------------------RFHHFKNETYCNC

Q9M4P3 Pentatricopeptide repeat-containing protein At4g16835, mitochondrial5.1e-2025.66Show/hide
Query:  MTGVVNLCDKCRQIDDAYKVFDRMPQRDLLFWGAIMAGFSQNGSARKAL---------EPRPDLITFVTVLPAVADVRSFVRHGYAIRAGFAKLRTVV--
        +T ++++  KC ++ DA+K+F+ M ++D++ W A+++G++Q+G+A KAL         + RPD ITFV VL A         H   +  G A   ++V  
Subjt:  MTGVVNLCDKCRQIDDAYKVFDRMPQRDLLFWGAIMAGFSQNGSARKAL---------EPRPDLITFVTVLPAVADVRSFVRHGYAIRAGFAKLRTVV--

Query:  --------SWNSMIVGYVQSGEPEMAIAGGRDRDKLSNVILVEAL------HAYADLGDRKREK-----------FVDKLNLGC------DAAVTRYGVR
                 +  M+    ++G+ E A+   R      +  +   L      H   +L +   EK           +V   N+        D A  R  ++
Subjt:  --------SWNSMIVGYVQSGEPEMAIAGGRDRDKLSNVILVEAL------HAYADLGDRKREK-----------FVDKLNLGC------DAAVTRYGVR

Query:  DSTISR----KWIKA------FDGSLQCHG-----RPSRSKIRDEIKVADYVPDTN-SIQDVEDDVQEQLLNSHSEKLAITFGLLNTSPDTTILFRKNLR
        +S + +     WI+       F  S + H           ++  ++K+A Y P+   ++ +VE++ +E+LL  HSEKLA+ FG +     + I   KNLR
Subjt:  DSTISR----KWIKA------FDGSLQCHG-----RPSRSKIRDEIKVADYVPDTN-SIQDVEDDVQEQLLNSHSEKLAITFGLLNTSPDTTILFRKNLR

Query:  VCGDIH--------------------RFHHFKNETYCNC
        +CGD H                    RFHHFK+ + C+C
Subjt:  VCGDIH--------------------RFHHFKNETYCNC

Q9SHZ8 Pentatricopeptide repeat-containing protein At2g220701.9e-1927.81Show/hide
Query:  VVNLCDKCRQIDDAYKVFDRMP-QRDLLFWGAIMAGFSQNGSARKALE---------PRPDLITFVTVLPAV--ADVRSFVRHGYAIRAGFAKLRTVVSW
        ++ +  K   I  A + FD +  +RD + W +++   +Q+G A +ALE          RPD IT+V V  A   A + +  R  + +     K+   +S 
Subjt:  VVNLCDKCRQIDDAYKVFDRMP-QRDLLFWGAIMAGFSQNGSARKALE---------PRPDLITFVTVLPAV--ADVRSFVRHGYAIRAGFAKLRTVVSW

Query:  NSMIV------GYVQSG---------EPEMAIAGGRDRDKLSNVILVEALHAYADLGDRKREK--FVDKLNLGC---------------DAAVTRYGVRD
         + +V      G +Q           EP++   G        +++    +H   DLG    E+   ++  N G                +AA  R  ++D
Subjt:  NSMIV------GYVQSG---------EPEMAIAGGRDRDKLSNVILVEALHAYADLGDRKREK--FVDKLNLGC---------------DAAVTRYGVRD

Query:  STISRK----WI----KAFDGSLQCHGRPSRS-------KIRDEIKVADYVPDTNSI-QDVEDDVQEQLLNSHSEKLAITFGLLNTSPDTTILFRKNLRV
          + ++    WI    K     ++    P ++       KI DEIK   YVPDT S+  D+E++V+EQ+L  HSEKLAI FGL++T   TT+   KNLRV
Subjt:  STISRK----WI----KAFDGSLQCHGRPSRS-------KIRDEIKVADYVPDTNSI-QDVEDDVQEQLLNSHSEKLAITFGLLNTSPDTTILFRKNLRV

Query:  CGDIH--------------------RFHHFKNETYCNC
        C D H                    RFHHFK + +C+C
Subjt:  CGDIH--------------------RFHHFKNETYCNC

Q9SZT8 Pentatricopeptide repeat-containing protein ELI1, chloroplastic1.9e-1927.6Show/hide
Query:  TGVVNLCDKCRQIDDAYKVFDRMPQRDLLFWGAIMAGFSQNGSARKALE----------PRPDLITFVTVLPAVA-------DVRSFVRHG--YAI----
        TG++++  KC  +++A  VF+  P++D++ W A++AG++ +G ++ AL            +P  ITF+  L A A        +R F   G  Y I    
Subjt:  TGVVNLCDKCRQIDDAYKVFDRMPQRDLLFWGAIMAGFSQNGSARKALE----------PRPDLITFVTVLPAVA-------DVRSFVRHG--YAI----

Query:  -----------RAGFAK--LRTV---------VSWNSMIVGYVQSGEPEMAIAGGRDRDKL------SNVILVEALHAYADLGDRKREKFVDKLNLGCDA
                   RAG  K    T+         V W+S++      G+    + G    + L      ++ I V   + YA +GD +    V  L +    
Subjt:  -----------RAGFAK--LRTV---------VSWNSMIVGYVQSGEPEMAIAGGRDRDKL------SNVILVEALHAYADLGDRKREKFVDKLNLGCDA

Query:  AVTRYGVRDSTISRKWIKAFDGSLQCHGRPSR-----SKIRDEIKVADYVPDTNSI-QDVEDDVQEQLLNSHSEKLAITFGLLNTSPDTTILFRKNLRVC
         V   G+    I  K +  F    + H +         KI + IK   YVP+TN++ QD+E+  +EQ L  HSE+LAI +GL++T P + +   KNLRVC
Subjt:  AVTRYGVRDSTISRKWIKAFDGSLQCHGRPSR-----SKIRDEIKVADYVPDTNSI-QDVEDDVQEQLLNSHSEKLAITFGLLNTSPDTTILFRKNLRVC

Query:  GDIH--------------------RFHHFKNETYCNC
         D H                    RFHHF + + C+C
Subjt:  GDIH--------------------RFHHFKNETYCNC

Arabidopsis top hitse value%identityAlignment
AT1G11290.1 Pentatricopeptide repeat (PPR) superfamily protein1.9e-3041.29Show/hide
Query:  MTGVVNLCDKCRQIDDAYKVFDRMPQRDLLFWGAIMAGFSQNGSARKALEP---------RPDLITFVTVLPAVADVRSF----VRHGYAIRAGFAKL--
        MTG+ N+  KCRQ+++A KVFDRMP+RDL+ W  I+AG+SQNG AR ALE          +P  IT V+VLPAV+ +R        HGYA+R+GF  L  
Subjt:  MTGVVNLCDKCRQIDDAYKVFDRMPQRDLLFWGAIMAGFSQNGSARKALEP---------RPDLITFVTVLPAVADVRSF----VRHGYAIRAGFAKL--

Query:  ---------------------------RTVVSWNSMIVGYVQSGEPEMAIAGGR----DRDKLSNVILVEALHAYADLGDRKREKFVDKLN--LGCDAAV
                                   R VVSWNSMI  YVQ+  P+ A+   +    +  K ++V ++ ALHA ADLGD +R +F+ KL+  LG D  V
Subjt:  ---------------------------RTVVSWNSMIVGYVQSGEPEMAIAGGR----DRDKLSNVILVEALHAYADLGDRKREKFVDKLN--LGCDAAV

Query:  T
        +
Subjt:  T

AT1G11290.1 Pentatricopeptide repeat (PPR) superfamily protein1.9e-1758.33Show/hide
Query:  KWIKAFDGSLQCHGRPSRSKIRDEIKVADYVPDTNSIQDVEDDVQEQLLNSHSEKLAITFGLLNTSPDTTILFRKNLRVCGDIH
        K I AF   L CH           IK A YVPDTN +  VE+DV+EQLL++HSEKLAI+FGLLNT+  TTI  RKNLRVC D H
Subjt:  KWIKAFDGSLQCHGRPSRSKIRDEIKVADYVPDTNSIQDVEDDVQEQLLNSHSEKLAITFGLLNTSPDTTILFRKNLRVCGDIH

AT2G22070.1 pentatricopeptide (PPR) repeat-containing protein1.4e-2027.81Show/hide
Query:  VVNLCDKCRQIDDAYKVFDRMP-QRDLLFWGAIMAGFSQNGSARKALE---------PRPDLITFVTVLPAV--ADVRSFVRHGYAIRAGFAKLRTVVSW
        ++ +  K   I  A + FD +  +RD + W +++   +Q+G A +ALE          RPD IT+V V  A   A + +  R  + +     K+   +S 
Subjt:  VVNLCDKCRQIDDAYKVFDRMP-QRDLLFWGAIMAGFSQNGSARKALE---------PRPDLITFVTVLPAV--ADVRSFVRHGYAIRAGFAKLRTVVSW

Query:  NSMIV------GYVQSG---------EPEMAIAGGRDRDKLSNVILVEALHAYADLGDRKREK--FVDKLNLGC---------------DAAVTRYGVRD
         + +V      G +Q           EP++   G        +++    +H   DLG    E+   ++  N G                +AA  R  ++D
Subjt:  NSMIV------GYVQSG---------EPEMAIAGGRDRDKLSNVILVEALHAYADLGDRKREK--FVDKLNLGC---------------DAAVTRYGVRD

Query:  STISRK----WI----KAFDGSLQCHGRPSRS-------KIRDEIKVADYVPDTNSI-QDVEDDVQEQLLNSHSEKLAITFGLLNTSPDTTILFRKNLRV
          + ++    WI    K     ++    P ++       KI DEIK   YVPDT S+  D+E++V+EQ+L  HSEKLAI FGL++T   TT+   KNLRV
Subjt:  STISRK----WI----KAFDGSLQCHGRPSRS-------KIRDEIKVADYVPDTNSI-QDVEDDVQEQLLNSHSEKLAITFGLLNTSPDTTILFRKNLRV

Query:  CGDIH--------------------RFHHFKNETYCNC
        C D H                    RFHHFK + +C+C
Subjt:  CGDIH--------------------RFHHFKNETYCNC

AT3G26782.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.2e-2128.07Show/hide
Query:  TGVVNLCDKCRQIDDAYKVFDRMPQRDLLFWGAIMAGFSQNGSARKALE---------PRPDLITFVTVLPAVADVRSFV--------------------
        T ++++  KC +++ A K FDRM  +++  W A++AG+  +G A KALE          RP+ ITFV+VL A +     V                    
Subjt:  TGVVNLCDKCRQIDDAYKVFDRMPQRDLLFWGAIMAGFSQNGSARKALE---------PRPDLITFVTVLPAVADVRSFV--------------------

Query:  RHGYAI----RAGF----------AKLR-TVVSWNSMIVG-----YVQSGEPEMAIAGGRDRDKLSNVILVEALHAYADLGDRKREKFVDKLNLGCDAAV
         +G  +    RAGF           K++   + W+S++        V+  E  +A     D       +L+   H YAD G   R K V+++ +      
Subjt:  RHGYAI----RAGF----------AKLR-TVVSWNSMIVG-----YVQSGEPEMAIAGGRDRDKLSNVILVEALHAYADLGDRKREKFVDKLNLGCDAAV

Query:  TRYGVRDSTISRKWIKAFDGSLQC-----HGRPSRSKIRD-------EIKVADYVPDTNSI-QDVEDDVQEQLLNSHSEKLAITFGLLNTSPDTTILFRK
         R  V+    S   +   +G +          P R KI +       ++  A YV +T+S+  DV+++ +E  L  HSEKLAI FG++NT P +T+   K
Subjt:  TRYGVRDSTISRKWIKAFDGSLQC-----HGRPSRSKIRD-------EIKVADYVPDTNSI-QDVEDDVQEQLLNSHSEKLAITFGLLNTSPDTTILFRK

Query:  NLRVCGDIH--------------------RFHHFKNETYCNC
        NLRVC D H                    RFHHFK +  C+C
Subjt:  NLRVCGDIH--------------------RFHHFKNETYCNC

AT4G16835.1 Tetratricopeptide repeat (TPR)-like superfamily protein3.6e-2125.66Show/hide
Query:  MTGVVNLCDKCRQIDDAYKVFDRMPQRDLLFWGAIMAGFSQNGSARKAL---------EPRPDLITFVTVLPAVADVRSFVRHGYAIRAGFAKLRTVV--
        +T ++++  KC ++ DA+K+F+ M ++D++ W A+++G++Q+G+A KAL         + RPD ITFV VL A         H   +  G A   ++V  
Subjt:  MTGVVNLCDKCRQIDDAYKVFDRMPQRDLLFWGAIMAGFSQNGSARKAL---------EPRPDLITFVTVLPAVADVRSFVRHGYAIRAGFAKLRTVV--

Query:  --------SWNSMIVGYVQSGEPEMAIAGGRDRDKLSNVILVEAL------HAYADLGDRKREK-----------FVDKLNLGC------DAAVTRYGVR
                 +  M+    ++G+ E A+   R      +  +   L      H   +L +   EK           +V   N+        D A  R  ++
Subjt:  --------SWNSMIVGYVQSGEPEMAIAGGRDRDKLSNVILVEAL------HAYADLGDRKREK-----------FVDKLNLGC------DAAVTRYGVR

Query:  DSTISR----KWIKA------FDGSLQCHG-----RPSRSKIRDEIKVADYVPDTN-SIQDVEDDVQEQLLNSHSEKLAITFGLLNTSPDTTILFRKNLR
        +S + +     WI+       F  S + H           ++  ++K+A Y P+   ++ +VE++ +E+LL  HSEKLA+ FG +     + I   KNLR
Subjt:  DSTISR----KWIKA------FDGSLQCHG-----RPSRSKIRDEIKVADYVPDTN-SIQDVEDDVQEQLLNSHSEKLAITFGLLNTSPDTTILFRKNLR

Query:  VCGDIH--------------------RFHHFKNETYCNC
        +CGD H                    RFHHFK+ + C+C
Subjt:  VCGDIH--------------------RFHHFKNETYCNC

AT4G37380.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.4e-2027.6Show/hide
Query:  TGVVNLCDKCRQIDDAYKVFDRMPQRDLLFWGAIMAGFSQNGSARKALE----------PRPDLITFVTVLPAVA-------DVRSFVRHG--YAI----
        TG++++  KC  +++A  VF+  P++D++ W A++AG++ +G ++ AL            +P  ITF+  L A A        +R F   G  Y I    
Subjt:  TGVVNLCDKCRQIDDAYKVFDRMPQRDLLFWGAIMAGFSQNGSARKALE----------PRPDLITFVTVLPAVA-------DVRSFVRHG--YAI----

Query:  -----------RAGFAK--LRTV---------VSWNSMIVGYVQSGEPEMAIAGGRDRDKL------SNVILVEALHAYADLGDRKREKFVDKLNLGCDA
                   RAG  K    T+         V W+S++      G+    + G    + L      ++ I V   + YA +GD +    V  L +    
Subjt:  -----------RAGFAK--LRTV---------VSWNSMIVGYVQSGEPEMAIAGGRDRDKL------SNVILVEALHAYADLGDRKREKFVDKLNLGCDA

Query:  AVTRYGVRDSTISRKWIKAFDGSLQCHGRPSR-----SKIRDEIKVADYVPDTNSI-QDVEDDVQEQLLNSHSEKLAITFGLLNTSPDTTILFRKNLRVC
         V   G+    I  K +  F    + H +         KI + IK   YVP+TN++ QD+E+  +EQ L  HSE+LAI +GL++T P + +   KNLRVC
Subjt:  AVTRYGVRDSTISRKWIKAFDGSLQCHGRPSR-----SKIRDEIKVADYVPDTNSI-QDVEDDVQEQLLNSHSEKLAITFGLLNTSPDTTILFRKNLRVC

Query:  GDIH--------------------RFHHFKNETYCNC
         D H                    RFHHF + + C+C
Subjt:  GDIH--------------------RFHHFKNETYCNC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACTGGCGTTGTGAATTTGTGCGATAAATGCAGGCAGATTGATGATGCGTACAAGGTGTTCGACAGAATGCCTCAGAGAGATTTGTTGTTTTGGGGTGCGATTATGGC
TGGGTTTTCTCAAAATGGCTCTGCAAGGAAGGCACTGGAGCCAAGGCCTGATTTGATTACATTTGTTACTGTTTTGCCTGCTGTTGCTGATGTTCGTTCATTTGTTCGTC
ACGGGTATGCCATTAGAGCTGGGTTTGCAAAGCTTAGGACTGTTGTGTCATGGAACTCCATGATTGTTGGATATGTGCAAAGTGGTGAACCAGAGATGGCTATTGCAGGA
GGAAGGGATAGAGATAAGCTAAGCAATGTAATACTTGTGGAAGCTTTGCATGCCTATGCCGATTTGGGCGATCGCAAGAGGGAGAAGTTTGTCGATAAGTTAAATCTTGG
TTGTGATGCCGCAGTCACTCGGTATGGCGTGAGGGACTCTACTATTTCAAGAAAATGGATTAAAGCCTTTGATGGATCACTACAGTGCCATGGTCGACCTTCTCGATCGA
AAATTAGAGATGAAATCAAGGTAGCTGACTATGTGCCAGATACTAACTCGATTCAAGATGTAGAAGATGATGTGCAGGAGCAGCTCCTCAATAGCCATAGTGAGAAGCTC
GCCATTACTTTTGGTCTTTTAAATACCAGTCCTGATACTACGATACTCTTTCGAAAGAACCTACGAGTGTGTGGAGATATACATAGATTTCATCATTTCAAAAATGAAAC
TTATTGTAACTGTAACACCTGCATTGATAATGTATAG
mRNA sequenceShow/hide mRNA sequence
ATGACTGGCGTTGTGAATTTGTGCGATAAATGCAGGCAGATTGATGATGCGTACAAGGTGTTCGACAGAATGCCTCAGAGAGATTTGTTGTTTTGGGGTGCGATTATGGC
TGGGTTTTCTCAAAATGGCTCTGCAAGGAAGGCACTGGAGCCAAGGCCTGATTTGATTACATTTGTTACTGTTTTGCCTGCTGTTGCTGATGTTCGTTCATTTGTTCGTC
ACGGGTATGCCATTAGAGCTGGGTTTGCAAAGCTTAGGACTGTTGTGTCATGGAACTCCATGATTGTTGGATATGTGCAAAGTGGTGAACCAGAGATGGCTATTGCAGGA
GGAAGGGATAGAGATAAGCTAAGCAATGTAATACTTGTGGAAGCTTTGCATGCCTATGCCGATTTGGGCGATCGCAAGAGGGAGAAGTTTGTCGATAAGTTAAATCTTGG
TTGTGATGCCGCAGTCACTCGGTATGGCGTGAGGGACTCTACTATTTCAAGAAAATGGATTAAAGCCTTTGATGGATCACTACAGTGCCATGGTCGACCTTCTCGATCGA
AAATTAGAGATGAAATCAAGGTAGCTGACTATGTGCCAGATACTAACTCGATTCAAGATGTAGAAGATGATGTGCAGGAGCAGCTCCTCAATAGCCATAGTGAGAAGCTC
GCCATTACTTTTGGTCTTTTAAATACCAGTCCTGATACTACGATACTCTTTCGAAAGAACCTACGAGTGTGTGGAGATATACATAGATTTCATCATTTCAAAAATGAAAC
TTATTGTAACTGTAACACCTGCATTGATAATGTATAG
Protein sequenceShow/hide protein sequence
MTGVVNLCDKCRQIDDAYKVFDRMPQRDLLFWGAIMAGFSQNGSARKALEPRPDLITFVTVLPAVADVRSFVRHGYAIRAGFAKLRTVVSWNSMIVGYVQSGEPEMAIAG
GRDRDKLSNVILVEALHAYADLGDRKREKFVDKLNLGCDAAVTRYGVRDSTISRKWIKAFDGSLQCHGRPSRSKIRDEIKVADYVPDTNSIQDVEDDVQEQLLNSHSEKL
AITFGLLNTSPDTTILFRKNLRVCGDIHRFHHFKNETYCNCNTCIDNV