; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr024736 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr024736
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationtig00002486:2347659..2352326
RNA-Seq ExpressionSgr024736
SyntenySgr024736
Gene Ontology termsGO:0016125 - sterol metabolic process (biological process)
GO:0019287 - isopentenyl diphosphate biosynthetic process, mevalonate pathway (biological process)
GO:0019288 - isopentenyl diphosphate biosynthetic process, methylerythritol 4-phosphate pathway (biological process)
GO:0048364 - root development (biological process)
GO:0050790 - regulation of catalytic activity (biological process)
GO:0005739 - mitochondrion (cellular component)
GO:0003729 - mRNA binding (molecular function)
GO:0005515 - protein binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
GO:0034046 - poly(G) binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR007011 - Late embryogenesis abundant protein, SMP subgroup domain
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6582300.1 Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. sororia]2.2e-22888.29Show/hide
Query:  MYSKLGLQEDAYKVFVEMPHRNLETWNAYISNSVLHGRPKDSAIAFIELLRAGGNPDSITFCAFLNACSDKLGLEPGCQLHGFIIRSGCGQNVSVSNGLI
        MYSKLGL +DAYK+FVEMPHRNLETWNAYISNSVLHGRP+DSAIAFIELLRAGG PDSITFCAFLNACSDKLGLEPGCQLHGFIIRSGCGQNVS+SNGLI
Subjt:  MYSKLGLQEDAYKVFVEMPHRNLETWNAYISNSVLHGRPKDSAIAFIELLRAGGNPDSITFCAFLNACSDKLGLEPGCQLHGFIIRSGCGQNVSVSNGLI

Query:  DFYGKCGEVKCSEMVFDRMGERNSVSWSSLIAAYVQNNEEEKACNLFFRARKEDIKPTDFMVSSVLCACAGLSGIELGRSVQALAVKACVEGNIFVGSAL
        DFYGKCGEV CSE++FDRMGERNSVSWSSLIAAYVQNNEEEKAC LF RARKEDIKPTDFMVSSVLCA AGLS IELGRSVQALAVKACV+ NIFVGSAL
Subjt:  DFYGKCGEVKCSEMVFDRMGERNSVSWSSLIAAYVQNNEEEKACNLFFRARKEDIKPTDFMVSSVLCACAGLSGIELGRSVQALAVKACVEGNIFVGSAL

Query:  VDMYGKCGSIDEAERAFKEMPERNLVSWNALLGGYAHQGHADRAVALLEEMASA-GMALSDVSLVCVLSACSRAGDMKMGMQIFESMKARYDVEAGPEHY
        VDMYGKCGSID+AE+AF EMPERNLVSWNALLGGYAHQG+AD+AVALL++MAS  G+A S VSLVC LSACSRAGD+K GMQIFESMKARY +E GPEHY
Subjt:  VDMYGKCGSIDEAERAFKEMPERNLVSWNALLGGYAHQGHADRAVALLEEMASA-GMALSDVSLVCVLSACSRAGDMKMGMQIFESMKARYDVEAGPEHY

Query:  ACLVDLLGRGGMVECAHDFIKNMPFPPTISIWGALLGACRMHGKPELGKLAAEKLFELDPKDSGNHVVLSNMFASTGRWEEVTVVRNEMKEVGIKKGAGF
        ACLVDL GR GMVECA+DFI+ MPFPPTISIWGALLGACRMHGKPELGKLAAEKLFELDPKDSGNHVVLSNM A+T RWEEVTV+RNEMKEVGIKKGAGF
Subjt:  ACLVDLLGRGGMVECAHDFIKNMPFPPTISIWGALLGACRMHGKPELGKLAAEKLFELDPKDSGNHVVLSNMFASTGRWEEVTVVRNEMKEVGIKKGAGF

Query:  SWITVNSKIQIFQARDSSHEKDSAIQDMLAKLRKEMQEAPGCIA
        SWITVNS+I IFQA+D S+EKDS +QDML KLRKEMQEA G IA
Subjt:  SWITVNSKIQIFQARDSSHEKDSAIQDMLAKLRKEMQEAPGCIA

KAG7018711.1 Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. argyrosperma]3.4e-22988.96Show/hide
Query:  MYSKLGLQEDAYKVFVEMPHRNLETWNAYISNSVLHGRPKDSAIAFIELLRAGGNPDSITFCAFLNACSDKLGLEPGCQLHGFIIRSGCGQNVSVSNGLI
        MYSKLGL +DAYK+FVEMPHRNLETWNAYISNSVLHGRP+DSAIAFIELLRAGG PDSITFCAFLNACSDKLGLEPGCQLHGFIIRSGCGQNVS+SNGLI
Subjt:  MYSKLGLQEDAYKVFVEMPHRNLETWNAYISNSVLHGRPKDSAIAFIELLRAGGNPDSITFCAFLNACSDKLGLEPGCQLHGFIIRSGCGQNVSVSNGLI

Query:  DFYGKCGEVKCSEMVFDRMGERNSVSWSSLIAAYVQNNEEEKACNLFFRARKEDIKPTDFMVSSVLCACAGLSGIELGRSVQALAVKACVEGNIFVGSAL
        DFYGKCGEV CSE++FDRMGERNSVSWSSLIAAYVQNNEEEKAC LF RARKEDIKPTDFMVSSVLCA AGLS IELGRSVQALAVKACVE NIFVGSAL
Subjt:  DFYGKCGEVKCSEMVFDRMGERNSVSWSSLIAAYVQNNEEEKACNLFFRARKEDIKPTDFMVSSVLCACAGLSGIELGRSVQALAVKACVEGNIFVGSAL

Query:  VDMYGKCGSIDEAERAFKEMPERNLVSWNALLGGYAHQGHADRAVALLEEMASA-GMALSDVSLVCVLSACSRAGDMKMGMQIFESMKARYDVEAGPEHY
        VDMYGKCGSID+AE+AF EMPERNLVSWNALLGGYAHQG+AD+AVALLEEMASA G+A + VSLVC LSACSRAGD+K GMQIFESMKARY +E GPEHY
Subjt:  VDMYGKCGSIDEAERAFKEMPERNLVSWNALLGGYAHQGHADRAVALLEEMASA-GMALSDVSLVCVLSACSRAGDMKMGMQIFESMKARYDVEAGPEHY

Query:  ACLVDLLGRGGMVECAHDFIKNMPFPPTISIWGALLGACRMHGKPELGKLAAEKLFELDPKDSGNHVVLSNMFASTGRWEEVTVVRNEMKEVGIKKGAGF
        ACLVDL GR GMVECA+DFI+ MPFPPTISIWGALLGACRMHGKPELGKLAAEKLFELDPKDSGNHVVLSNM A+T RWEEVTV+RNEMKEVGIKKGAGF
Subjt:  ACLVDLLGRGGMVECAHDFIKNMPFPPTISIWGALLGACRMHGKPELGKLAAEKLFELDPKDSGNHVVLSNMFASTGRWEEVTVVRNEMKEVGIKKGAGF

Query:  SWITVNSKIQIFQARDSSHEKDSAIQDMLAKLRKEMQEAPGCIA
        SWITVNS+I+IFQA+D S+EKD  IQDML KLRKEMQEA G IA
Subjt:  SWITVNSKIQIFQARDSSHEKDSAIQDMLAKLRKEMQEAPGCIA

XP_022956070.1 pentatricopeptide repeat-containing protein At4g14850 [Cucurbita moschata]2.2e-22888.29Show/hide
Query:  MYSKLGLQEDAYKVFVEMPHRNLETWNAYISNSVLHGRPKDSAIAFIELLRAGGNPDSITFCAFLNACSDKLGLEPGCQLHGFIIRSGCGQNVSVSNGLI
        MYSKLGL +DAYK+FVEMPHRNLETWNAYISNSVLHGRP+DSAIAFIELLRAGG PDSITFCAFLNACSDKLGLEPGCQLHGFIIRSGCGQNVS+SNGLI
Subjt:  MYSKLGLQEDAYKVFVEMPHRNLETWNAYISNSVLHGRPKDSAIAFIELLRAGGNPDSITFCAFLNACSDKLGLEPGCQLHGFIIRSGCGQNVSVSNGLI

Query:  DFYGKCGEVKCSEMVFDRMGERNSVSWSSLIAAYVQNNEEEKACNLFFRARKEDIKPTDFMVSSVLCACAGLSGIELGRSVQALAVKACVEGNIFVGSAL
        DFYGKCGEV CSE++FDRMGERNSVSWSSLIAAYVQNNEEEKAC LF RARKEDIKPTDFMVSSVLCA AGLS IELGRSVQALAVKACV+ NIFVGSAL
Subjt:  DFYGKCGEVKCSEMVFDRMGERNSVSWSSLIAAYVQNNEEEKACNLFFRARKEDIKPTDFMVSSVLCACAGLSGIELGRSVQALAVKACVEGNIFVGSAL

Query:  VDMYGKCGSIDEAERAFKEMPERNLVSWNALLGGYAHQGHADRAVALLEEMASA-GMALSDVSLVCVLSACSRAGDMKMGMQIFESMKARYDVEAGPEHY
        VDMYGKCGSID+AE+AF EMPERNLVSWNALLGGYAHQG+AD+AVALL++MAS  G+A S VSLVC LSACSRAGD+K GMQIFESMKARY +E GPEHY
Subjt:  VDMYGKCGSIDEAERAFKEMPERNLVSWNALLGGYAHQGHADRAVALLEEMASA-GMALSDVSLVCVLSACSRAGDMKMGMQIFESMKARYDVEAGPEHY

Query:  ACLVDLLGRGGMVECAHDFIKNMPFPPTISIWGALLGACRMHGKPELGKLAAEKLFELDPKDSGNHVVLSNMFASTGRWEEVTVVRNEMKEVGIKKGAGF
        ACLVDL GR GMVECA+DFI+ MPFPPTISIWGALLGACRMHGKPELGKLAAEKLFELDPKDSGNHVVLSNM A+T RWEEVTV+RNEMKEVGIKKGAGF
Subjt:  ACLVDLLGRGGMVECAHDFIKNMPFPPTISIWGALLGACRMHGKPELGKLAAEKLFELDPKDSGNHVVLSNMFASTGRWEEVTVVRNEMKEVGIKKGAGF

Query:  SWITVNSKIQIFQARDSSHEKDSAIQDMLAKLRKEMQEAPGCIA
        SWITVNS+I IFQA+D S+EKDS +QDML KLRKEMQEA G IA
Subjt:  SWITVNSKIQIFQARDSSHEKDSAIQDMLAKLRKEMQEAPGCIA

XP_022979420.1 pentatricopeptide repeat-containing protein At4g14850 [Cucurbita maxima]6.3e-22888.74Show/hide
Query:  MYSKLGLQEDAYKVFVEMPHRNLETWNAYISNSVLHGRPKDSAIAFIELLRAGGNPDSITFCAFLNACSDKLGLEPGCQLHGFIIRSGCGQNVSVSNGLI
        MYSKLGL +DAYK+FVEMPHRNLETWNAYISNSVLHGRP+DSAIAFIELLRAGG PDSITFCAFLNACSDKLGLEPGCQLHGFIIRSGCGQNVS+SNGLI
Subjt:  MYSKLGLQEDAYKVFVEMPHRNLETWNAYISNSVLHGRPKDSAIAFIELLRAGGNPDSITFCAFLNACSDKLGLEPGCQLHGFIIRSGCGQNVSVSNGLI

Query:  DFYGKCGEVKCSEMVFDRMGERNSVSWSSLIAAYVQNNEEEKACNLFFRARKEDIKPTDFMVSSVLCACAGLSGIELGRSVQALAVKACVEGNIFVGSAL
        DFYGKCGEV CSE++FDRMGERNSVSWSSLIAAYVQNNEEEKAC LF RARKE IKPTDFMVSSVLCA AGLS IELGRSVQALAVKACVE NIFVGSAL
Subjt:  DFYGKCGEVKCSEMVFDRMGERNSVSWSSLIAAYVQNNEEEKACNLFFRARKEDIKPTDFMVSSVLCACAGLSGIELGRSVQALAVKACVEGNIFVGSAL

Query:  VDMYGKCGSIDEAERAFKEMPERNLVSWNALLGGYAHQGHADRAVALLEEMASA-GMALSDVSLVCVLSACSRAGDMKMGMQIFESMKARYDVEAGPEHY
        VDMYGKCGSIDEAERAF EMPERNLVSWN+LLGGYAHQG AD+AVALLEEMASA G+A S VSLVC LSACSRAGD+K GMQIFESMKARY +E GPEHY
Subjt:  VDMYGKCGSIDEAERAFKEMPERNLVSWNALLGGYAHQGHADRAVALLEEMASA-GMALSDVSLVCVLSACSRAGDMKMGMQIFESMKARYDVEAGPEHY

Query:  ACLVDLLGRGGMVECAHDFIKNMPFPPTISIWGALLGACRMHGKPELGKLAAEKLFELDPKDSGNHVVLSNMFASTGRWEEVTVVRNEMKEVGIKKGAGF
        ACLVDL GR GMVECA+DFI+ MPFPPTISIWGALLGACRMHGKPELGKLAAEKLFELDPKDSGNHVVLSNM A+T RWEEVTV+RNEMKEVGIKKGAGF
Subjt:  ACLVDLLGRGGMVECAHDFIKNMPFPPTISIWGALLGACRMHGKPELGKLAAEKLFELDPKDSGNHVVLSNMFASTGRWEEVTVVRNEMKEVGIKKGAGF

Query:  SWITVNSKIQIFQARDSSHEKDSAIQDMLAKLRKEMQEAPGCIA
        SWITVN +I IFQA+D S+EKDS +QDML  LRKEMQEA G IA
Subjt:  SWITVNSKIQIFQARDSSHEKDSAIQDMLAKLRKEMQEAPGCIA

XP_023526347.1 pentatricopeptide repeat-containing protein At4g14850 [Cucurbita pepo subsp. pepo]2.8e-22888.51Show/hide
Query:  MYSKLGLQEDAYKVFVEMPHRNLETWNAYISNSVLHGRPKDSAIAFIELLRAGGNPDSITFCAFLNACSDKLGLEPGCQLHGFIIRSGCGQNVSVSNGLI
        MYSKLGL +DAYK+FVEMPHRNLETWNAYISNSVLHGRP+DSAIAFIELLRAGG PDSITFCAFLNACSDKLGLEPGCQLHGFIIRSGCGQNVS+SNGLI
Subjt:  MYSKLGLQEDAYKVFVEMPHRNLETWNAYISNSVLHGRPKDSAIAFIELLRAGGNPDSITFCAFLNACSDKLGLEPGCQLHGFIIRSGCGQNVSVSNGLI

Query:  DFYGKCGEVKCSEMVFDRMGERNSVSWSSLIAAYVQNNEEEKACNLFFRARKEDIKPTDFMVSSVLCACAGLSGIELGRSVQALAVKACVEGNIFVGSAL
        DFYGKCGEV CSE++FDRMGERNSVSWSSLIAAYVQNNEEEKAC LF RARKEDIKPTDFMVSSVLCA AGLS IELGRSVQALAVKACVE NIFVGSAL
Subjt:  DFYGKCGEVKCSEMVFDRMGERNSVSWSSLIAAYVQNNEEEKACNLFFRARKEDIKPTDFMVSSVLCACAGLSGIELGRSVQALAVKACVEGNIFVGSAL

Query:  VDMYGKCGSIDEAERAFKEMPERNLVSWNALLGGYAHQGHADRAVALLEEMASA-GMALSDVSLVCVLSACSRAGDMKMGMQIFESMKARYDVEAGPEHY
        VDMYGKCGSID+AE+AF EMPERNLVSWNALLGGYAHQG+AD+AVALLEEMASA G+A + VSLVC LSACSRAGD+K G+QIFESMKARY +E GPEHY
Subjt:  VDMYGKCGSIDEAERAFKEMPERNLVSWNALLGGYAHQGHADRAVALLEEMASA-GMALSDVSLVCVLSACSRAGDMKMGMQIFESMKARYDVEAGPEHY

Query:  ACLVDLLGRGGMVECAHDFIKNMPFPPTISIWGALLGACRMHGKPELGKLAAEKLFELDPKDSGNHVVLSNMFASTGRWEEVTVVRNEMKEVGIKKGAGF
        ACLVDL GR GMVECA+DFI+ MPFPPTISIWGALLGACRMHGKPELGKLAAEKLFELDPKDSGNHVVLSNM A+T RWEEVTV+RNEMKEVGIKKGAGF
Subjt:  ACLVDLLGRGGMVECAHDFIKNMPFPPTISIWGALLGACRMHGKPELGKLAAEKLFELDPKDSGNHVVLSNMFASTGRWEEVTVVRNEMKEVGIKKGAGF

Query:  SWITVNSKIQIFQARDSSHEKDSAIQDMLAKLRKEMQEAPGCIA
        SWITVNS+I+IFQA+D S+EKD  IQDML  LRKEMQEA G IA
Subjt:  SWITVNSKIQIFQARDSSHEKDSAIQDMLAKLRKEMQEAPGCIA

TrEMBL top hitse value%identityAlignment
A0A0A0L4T8 Uncharacterized protein1.1e-22587.44Show/hide
Query:  MYSKLGLQEDAYKVFVEMPHRNLETWNAYISNSVLHGRPKDSAIAFIELLRAGGNPDSITFCAFLNACSDKLGLEPGCQLHGFIIRSGCGQNVSVSNGLI
        MYSKLG   DAYKVF EMPHRNLETWNAYISNSVLHGRP+DS IAFIELLR GG PDSITFCAFLNACSDKLGL PGCQLHGFIIRSG GQNVSVSNGLI
Subjt:  MYSKLGLQEDAYKVFVEMPHRNLETWNAYISNSVLHGRPKDSAIAFIELLRAGGNPDSITFCAFLNACSDKLGLEPGCQLHGFIIRSGCGQNVSVSNGLI

Query:  DFYGKCGEVKCSEMVFDRMGERNSVSWSSLIAAYVQNNEEEKACNLFFRARKEDIKPTDFMVSSVLCACAGLSGIELGRSVQALAVKACVEGNIFVGSAL
        DFYGKCGEV+CSEMVFDRMGERNSVSWSSLIAAYVQNNEEEKA  LF RARKEDI+PTDFMVSSVLCACAGLS IE GRSVQALAVKACVE NIFV SAL
Subjt:  DFYGKCGEVKCSEMVFDRMGERNSVSWSSLIAAYVQNNEEEKACNLFFRARKEDIKPTDFMVSSVLCACAGLSGIELGRSVQALAVKACVEGNIFVGSAL

Query:  VDMYGKCGSIDEAERAFKEMPERNLVSWNALLGGYAHQGHADRAVALLEEMAS-AGMALSDVSLVCVLSACSRAGDMKMGMQIFESMKARYDVEAGPEHY
        VDMYGKCGSID AE+AF  MPERNLVSWNALLGGYAHQGHA++AVALLEEM S AG+  S VSL+C LSACSRAGD+K GM+IFESMK RY VE GPEHY
Subjt:  VDMYGKCGSIDEAERAFKEMPERNLVSWNALLGGYAHQGHADRAVALLEEMAS-AGMALSDVSLVCVLSACSRAGDMKMGMQIFESMKARYDVEAGPEHY

Query:  ACLVDLLGRGGMVECAHDFIKNMPFPPTISIWGALLGACRMHGKPELGKLAAEKLFELDPKDSGNHVVLSNMFASTGRWEEVTVVRNEMKEVGIKKGAGF
        ACLVDLLGR GMVECA+DFIK MPFPPTISIWGALLGACRMHGKPELGKLAAEKLFELDPKDSGNHVVLSNMFA+TGRWEEVTVVRNEMKEVGIKKGAGF
Subjt:  ACLVDLLGRGGMVECAHDFIKNMPFPPTISIWGALLGACRMHGKPELGKLAAEKLFELDPKDSGNHVVLSNMFASTGRWEEVTVVRNEMKEVGIKKGAGF

Query:  SWITVNSKIQIFQARDSSHEKDSAIQDMLAKLRKEMQEAPGCIATP
        SWITV+S+I +FQA+D SHEKD  IQD+L KLRKEMQ+A GCIA P
Subjt:  SWITVNSKIQIFQARDSSHEKDSAIQDMLAKLRKEMQEAPGCIATP

A0A5A7U206 Pentatricopeptide repeat-containing protein1.9e-22286.32Show/hide
Query:  MYSKLGLQEDAYKVFVEMPHRNLETWNAYISNSVLHGRPKDSAIAFIELLRAGGNPDSITFCAFLNACSDKLGLEPGCQLHGFIIRSGCGQNVSVSNGLI
        MYSKLG   DAYK+F EMP RNLETWNAYI+NSVLHGRP+DSAIAFIELLR G  PDSITFCAFLNACSDKLGL PGCQLHGF+IRSG GQNVSVSNGLI
Subjt:  MYSKLGLQEDAYKVFVEMPHRNLETWNAYISNSVLHGRPKDSAIAFIELLRAGGNPDSITFCAFLNACSDKLGLEPGCQLHGFIIRSGCGQNVSVSNGLI

Query:  DFYGKCGEVKCSEMVFDRMGERNSVSWSSLIAAYVQNNEEEKACNLFFRARKEDIKPTDFMVSSVLCACAGLSGIELGRSVQALAVKACVEGNIFVGSAL
        DFYGKCGEV+CSEMVFDRMGERNSVSWSSLIAAYVQNNEEEKA  LF RARKEDI+PTDFMVSSVLCACAGLS IE GRSVQALAVKACVE NIFV SAL
Subjt:  DFYGKCGEVKCSEMVFDRMGERNSVSWSSLIAAYVQNNEEEKACNLFFRARKEDIKPTDFMVSSVLCACAGLSGIELGRSVQALAVKACVEGNIFVGSAL

Query:  VDMYGKCGSIDEAERAFKEMPERNLVSWNALLGGYAHQGHADRAVALLEEMAS-AGMALSDVSLVCVLSACSRAGDMKMGMQIFESMKARYDVEAGPEHY
        VDMYGKCGSID A +AF  MPERNLVSWNALLGGYAHQGHA++AVALLEEM S AG+  S VSL+C LSACSRAGD+K GM+IFESMK RY VE GPEHY
Subjt:  VDMYGKCGSIDEAERAFKEMPERNLVSWNALLGGYAHQGHADRAVALLEEMAS-AGMALSDVSLVCVLSACSRAGDMKMGMQIFESMKARYDVEAGPEHY

Query:  ACLVDLLGRGGMVECAHDFIKNMPFPPTISIWGALLGACRMHGKPELGKLAAEKLFELDPKDSGNHVVLSNMFASTGRWEEVTVVRNEMKEVGIKKGAGF
        ACLVDLLGR GMVECA+DFIK MPFPPTISIWGALLGACRMHGKPELGKLAAEKLFELDPKDSGNHVVLSNMFA+TGRWEE TVVRNEMKEVGIKKGAGF
Subjt:  ACLVDLLGRGGMVECAHDFIKNMPFPPTISIWGALLGACRMHGKPELGKLAAEKLFELDPKDSGNHVVLSNMFASTGRWEEVTVVRNEMKEVGIKKGAGF

Query:  SWITVNSKIQIFQARDSSHEKDSAIQDMLAKLRKEMQEAPGCIATP
        SWITV+S+I IFQA+D SHEKD  IQ+ML KLRKEMQ+A GCIA P
Subjt:  SWITVNSKIQIFQARDSSHEKDSAIQDMLAKLRKEMQEAPGCIATP

A0A6J1C7M0 pentatricopeptide repeat-containing protein At4g148505.2e-22889.19Show/hide
Query:  MYSKLGLQEDAYKVFVEMPHRNLETWNAYISNSVLHGRPKDSAIAFIELLRAGGNPDSITFCAFLNACSDKLGLEPGCQLHGFIIRSGCGQNVSVSNGLI
        MYSKLGL EDA KVFVEMPHRNLETWNAYISNSV HGRP+DS IAF+ELLRAGG+PDSITFCAFLNACSDKLGLEPGCQLHGFIIRSG  QNVSVSNGLI
Subjt:  MYSKLGLQEDAYKVFVEMPHRNLETWNAYISNSVLHGRPKDSAIAFIELLRAGGNPDSITFCAFLNACSDKLGLEPGCQLHGFIIRSGCGQNVSVSNGLI

Query:  DFYGKCGEVKCSEMVFDRMGERNSVSWSSLIAAYVQNNEEEKACNLFFRARKEDIKPTDFMVSSVLCACAGLSGIELGRSVQALAVKACVEGNIFVGSAL
        DFYGKCGEV+CS MVFDRMGERN+VSWSSLIAAY+QNNEEEKAC LF +ARKEDIKP DFMVSSVLCACAGLSGIELGRSVQALAVKACVE NIFVGSAL
Subjt:  DFYGKCGEVKCSEMVFDRMGERNSVSWSSLIAAYVQNNEEEKACNLFFRARKEDIKPTDFMVSSVLCACAGLSGIELGRSVQALAVKACVEGNIFVGSAL

Query:  VDMYGKCGSIDEAERAFKEMPERNLVSWNALLGGYAHQGHADRAVALLEEMAS-AGMALSDVSLVCVLSACSRAGDMKMGMQIFESMKARYDVEAGPEHY
        VDMYGKCGSIDEAERAFKEMP++NLVSWN LLGGYAHQGHAD+AVALLEEM S AGMA S VSLVC LSACSRAGD+K GMQIFESMKARY+VE GPEHY
Subjt:  VDMYGKCGSIDEAERAFKEMPERNLVSWNALLGGYAHQGHADRAVALLEEMAS-AGMALSDVSLVCVLSACSRAGDMKMGMQIFESMKARYDVEAGPEHY

Query:  ACLVDLLGRGGMVECAHDFIKNMPFPPTISIWGALLGACRMHGKPELGKLAAEKLFELDPKDSGNHVVLSNMFASTGRWEEVTVVRNEMKEVGIKKGAGF
        A LVDLLGR GMVECA+DFIKNMPF PTISIWGALLGACRMHGKPELGKLAAEKLFELDPKDSGNHVVLSNMFA+TGRWEEVT VRNEM+EVGIKKGAGF
Subjt:  ACLVDLLGRGGMVECAHDFIKNMPFPPTISIWGALLGACRMHGKPELGKLAAEKLFELDPKDSGNHVVLSNMFASTGRWEEVTVVRNEMKEVGIKKGAGF

Query:  SWITVNSKIQIFQARDSSHEKDSAIQDMLAKLRKEMQEAPGCIA
        SWITV+S+I IFQA+D SHEKDS IQD+L KLRKEMQEA G IA
Subjt:  SWITVNSKIQIFQARDSSHEKDSAIQDMLAKLRKEMQEAPGCIA

A0A6J1GWT9 pentatricopeptide repeat-containing protein At4g148501.1e-22888.29Show/hide
Query:  MYSKLGLQEDAYKVFVEMPHRNLETWNAYISNSVLHGRPKDSAIAFIELLRAGGNPDSITFCAFLNACSDKLGLEPGCQLHGFIIRSGCGQNVSVSNGLI
        MYSKLGL +DAYK+FVEMPHRNLETWNAYISNSVLHGRP+DSAIAFIELLRAGG PDSITFCAFLNACSDKLGLEPGCQLHGFIIRSGCGQNVS+SNGLI
Subjt:  MYSKLGLQEDAYKVFVEMPHRNLETWNAYISNSVLHGRPKDSAIAFIELLRAGGNPDSITFCAFLNACSDKLGLEPGCQLHGFIIRSGCGQNVSVSNGLI

Query:  DFYGKCGEVKCSEMVFDRMGERNSVSWSSLIAAYVQNNEEEKACNLFFRARKEDIKPTDFMVSSVLCACAGLSGIELGRSVQALAVKACVEGNIFVGSAL
        DFYGKCGEV CSE++FDRMGERNSVSWSSLIAAYVQNNEEEKAC LF RARKEDIKPTDFMVSSVLCA AGLS IELGRSVQALAVKACV+ NIFVGSAL
Subjt:  DFYGKCGEVKCSEMVFDRMGERNSVSWSSLIAAYVQNNEEEKACNLFFRARKEDIKPTDFMVSSVLCACAGLSGIELGRSVQALAVKACVEGNIFVGSAL

Query:  VDMYGKCGSIDEAERAFKEMPERNLVSWNALLGGYAHQGHADRAVALLEEMASA-GMALSDVSLVCVLSACSRAGDMKMGMQIFESMKARYDVEAGPEHY
        VDMYGKCGSID+AE+AF EMPERNLVSWNALLGGYAHQG+AD+AVALL++MAS  G+A S VSLVC LSACSRAGD+K GMQIFESMKARY +E GPEHY
Subjt:  VDMYGKCGSIDEAERAFKEMPERNLVSWNALLGGYAHQGHADRAVALLEEMASA-GMALSDVSLVCVLSACSRAGDMKMGMQIFESMKARYDVEAGPEHY

Query:  ACLVDLLGRGGMVECAHDFIKNMPFPPTISIWGALLGACRMHGKPELGKLAAEKLFELDPKDSGNHVVLSNMFASTGRWEEVTVVRNEMKEVGIKKGAGF
        ACLVDL GR GMVECA+DFI+ MPFPPTISIWGALLGACRMHGKPELGKLAAEKLFELDPKDSGNHVVLSNM A+T RWEEVTV+RNEMKEVGIKKGAGF
Subjt:  ACLVDLLGRGGMVECAHDFIKNMPFPPTISIWGALLGACRMHGKPELGKLAAEKLFELDPKDSGNHVVLSNMFASTGRWEEVTVVRNEMKEVGIKKGAGF

Query:  SWITVNSKIQIFQARDSSHEKDSAIQDMLAKLRKEMQEAPGCIA
        SWITVNS+I IFQA+D S+EKDS +QDML KLRKEMQEA G IA
Subjt:  SWITVNSKIQIFQARDSSHEKDSAIQDMLAKLRKEMQEAPGCIA

A0A6J1IQR6 pentatricopeptide repeat-containing protein At4g148503.1e-22888.74Show/hide
Query:  MYSKLGLQEDAYKVFVEMPHRNLETWNAYISNSVLHGRPKDSAIAFIELLRAGGNPDSITFCAFLNACSDKLGLEPGCQLHGFIIRSGCGQNVSVSNGLI
        MYSKLGL +DAYK+FVEMPHRNLETWNAYISNSVLHGRP+DSAIAFIELLRAGG PDSITFCAFLNACSDKLGLEPGCQLHGFIIRSGCGQNVS+SNGLI
Subjt:  MYSKLGLQEDAYKVFVEMPHRNLETWNAYISNSVLHGRPKDSAIAFIELLRAGGNPDSITFCAFLNACSDKLGLEPGCQLHGFIIRSGCGQNVSVSNGLI

Query:  DFYGKCGEVKCSEMVFDRMGERNSVSWSSLIAAYVQNNEEEKACNLFFRARKEDIKPTDFMVSSVLCACAGLSGIELGRSVQALAVKACVEGNIFVGSAL
        DFYGKCGEV CSE++FDRMGERNSVSWSSLIAAYVQNNEEEKAC LF RARKE IKPTDFMVSSVLCA AGLS IELGRSVQALAVKACVE NIFVGSAL
Subjt:  DFYGKCGEVKCSEMVFDRMGERNSVSWSSLIAAYVQNNEEEKACNLFFRARKEDIKPTDFMVSSVLCACAGLSGIELGRSVQALAVKACVEGNIFVGSAL

Query:  VDMYGKCGSIDEAERAFKEMPERNLVSWNALLGGYAHQGHADRAVALLEEMASA-GMALSDVSLVCVLSACSRAGDMKMGMQIFESMKARYDVEAGPEHY
        VDMYGKCGSIDEAERAF EMPERNLVSWN+LLGGYAHQG AD+AVALLEEMASA G+A S VSLVC LSACSRAGD+K GMQIFESMKARY +E GPEHY
Subjt:  VDMYGKCGSIDEAERAFKEMPERNLVSWNALLGGYAHQGHADRAVALLEEMASA-GMALSDVSLVCVLSACSRAGDMKMGMQIFESMKARYDVEAGPEHY

Query:  ACLVDLLGRGGMVECAHDFIKNMPFPPTISIWGALLGACRMHGKPELGKLAAEKLFELDPKDSGNHVVLSNMFASTGRWEEVTVVRNEMKEVGIKKGAGF
        ACLVDL GR GMVECA+DFI+ MPFPPTISIWGALLGACRMHGKPELGKLAAEKLFELDPKDSGNHVVLSNM A+T RWEEVTV+RNEMKEVGIKKGAGF
Subjt:  ACLVDLLGRGGMVECAHDFIKNMPFPPTISIWGALLGACRMHGKPELGKLAAEKLFELDPKDSGNHVVLSNMFASTGRWEEVTVVRNEMKEVGIKKGAGF

Query:  SWITVNSKIQIFQARDSSHEKDSAIQDMLAKLRKEMQEAPGCIA
        SWITVN +I IFQA+D S+EKDS +QDML  LRKEMQEA G IA
Subjt:  SWITVNSKIQIFQARDSSHEKDSAIQDMLAKLRKEMQEAPGCIA

SwissProt top hitse value%identityAlignment
Q0WSH6 Pentatricopeptide repeat-containing protein At4g148502.2e-16262.73Show/hide
Query:  MYSKLGLQEDAYKVFVEMPHRNLETWNAYISNSVLHGRPKDSAIAFIELLRAGGNPDSITFCAFLNACSDKLGLEPGCQLHGFIIRSGCGQNVSVSNGLI
        MY K  L++DA K+F E+P RNLETWNA+ISNSV  GRP+++  AFIE  R  G+P+SITFCAFLNACSD L L  G QLHG ++RSG   +VSV NGLI
Subjt:  MYSKLGLQEDAYKVFVEMPHRNLETWNAYISNSVLHGRPKDSAIAFIELLRAGGNPDSITFCAFLNACSDKLGLEPGCQLHGFIIRSGCGQNVSVSNGLI

Query:  DFYGKCGEVKCSEMVFDRMGERNSVSWSSLIAAYVQNNEEEKACNLFFRARKEDIKPTDFMVSSVLCACAGLSGIELGRSVQALAVKACVEGNIFVGSAL
        DFYGKC +++ SE++F  MG +N+VSW SL+AAYVQN+E+EKA  L+ R+RK+ ++ +DFM+SSVL ACAG++G+ELGRS+ A AVKACVE  IFVGSAL
Subjt:  DFYGKCGEVKCSEMVFDRMGERNSVSWSSLIAAYVQNNEEEKACNLFFRARKEDIKPTDFMVSSVLCACAGLSGIELGRSVQALAVKACVEGNIFVGSAL

Query:  VDMYGKCGSIDEAERAFKEMPERNLVSWNALLGGYAHQGHADRAVALLEEMASAGMALSD--VSLVCVLSACSRAGDMKMGMQIFESMKARYDVEAGPEH
        VDMYGKCG I+++E+AF EMPE+NLV+ N+L+GGYAHQG  D A+AL EEMA  G   +   ++ V +LSACSRAG ++ GM+IF+SM++ Y +E G EH
Subjt:  VDMYGKCGSIDEAERAFKEMPERNLVSWNALLGGYAHQGHADRAVALLEEMASAGMALSD--VSLVCVLSACSRAGDMKMGMQIFESMKARYDVEAGPEH

Query:  YACLVDLLGRGGMVECAHDFIKNMPFPPTISIWGALLGACRMHGKPELGKLAAEKLFELDPKDSGNHVVLSNMFASTGRWEEVTVVRNEMKEVGIKKGAG
        Y+C+VD+LGR GMVE A++FIK MP  PTIS+WGAL  ACRMHGKP+LG LAAE LF+LDPKDSGNHV+LSN FA+ GRW E   VR E+K VGIKKGAG
Subjt:  YACLVDLLGRGGMVECAHDFIKNMPFPPTISIWGALLGACRMHGKPELGKLAAEKLFELDPKDSGNHVVLSNMFASTGRWEEVTVVRNEMKEVGIKKGAG

Query:  FSWITVNSKIQIFQARDSSHEKDSAIQDMLAKLRKEMQEA
        +SWITV +++  FQA+D SH  +  IQ  LAKLR EM+ A
Subjt:  FSWITVNSKIQIFQARDSSHEKDSAIQDMLAKLRKEMQEA

Q9FIB2 Putative pentatricopeptide repeat-containing protein At5g099502.9e-9039.73Show/hide
Query:  MYSKLGLQEDAYKVFVEMPHRNLETWNAYISNSVLHGRP-KDSAIAFIELLRAGGNPDSITFCAFLNACSDKLGLEPGCQLHGFIIRSGCGQNVSVSNGL
        +Y++ G   +  K+F  MP  +  +WN+ I       R   ++ + F+   RAG   + ITF + L+A S     E G Q+HG  +++      +  N L
Subjt:  MYSKLGLQEDAYKVFVEMPHRNLETWNAYISNSVLHGRP-KDSAIAFIELLRAGGNPDSITFCAFLNACSDKLGLEPGCQLHGFIIRSGCGQNVSVSNGL

Query:  IDFYGKCGEVKCSEMVFDRMGE-RNSVSWSSLIAAYVQNNEEEKACNLFFRARKEDIKPTDFMVSSVLCACAGLSGIELGRSVQALAVKACVEGNIFVGS
        I  YGKCGE+   E +F RM E R++V+W+S+I+ Y+ N    KA +L +   +   +   FM ++VL A A ++ +E G  V A +V+AC+E ++ VGS
Subjt:  IDFYGKCGEVKCSEMVFDRMGE-RNSVSWSSLIAAYVQNNEEEKACNLFFRARKEDIKPTDFMVSSVLCACAGLSGIELGRSVQALAVKACVEGNIFVGS

Query:  ALVDMYGKCGSIDEAERAFKEMPERNLVSWNALLGGYAHQGHADRAVALLEEMASAGMALSD-VSLVCVLSACSRAGDMKMGMQIFESMKARYDVEAGPE
        ALVDMY KCG +D A R F  MP RN  SWN+++ GYA  G  + A+ L E M   G    D V+ V VLSACS AG ++ G + FESM   Y +    E
Subjt:  ALVDMYGKCGSIDEAERAFKEMPERNLVSWNALLGGYAHQGHADRAVALLEEMASAGMALSD-VSLVCVLSACSRAGDMKMGMQIFESMKARYDVEAGPE

Query:  HYACLVDLLGRGGMVECAHDFIKNMPFPPTISIWGALLGA-CRMHG-KPELGKLAAEKLFELDPKDSGNHVVLSNMFASTGRWEEVTVVRNEMKEVGIKK
        H++C+ D+LGR G ++   DFI+ MP  P + IW  +LGA CR +G K ELGK AAE LF+L+P+++ N+V+L NM+A+ GRWE++   R +MK+  +KK
Subjt:  HYACLVDLLGRGGMVECAHDFIKNMPFPPTISIWGALLGA-CRMHG-KPELGKLAAEKLFELDPKDSGNHVVLSNMFASTGRWEEVTVVRNEMKEVGIKK

Query:  GAGFSWITVNSKIQIFQARDSSHEKDSAIQDMLAKLRKEMQEA
         AG+SW+T+   + +F A D SH     I   L +L ++M++A
Subjt:  GAGFSWITVNSKIQIFQARDSSHEKDSAIQDMLAKLRKEMQEA

Q9LIQ7 Pentatricopeptide repeat-containing protein At3g24000, mitochondrial2.1e-8839.59Show/hide
Query:  MYSKLGLQEDAYKVFVEMPHRNLETWNAYISNSVLHGRPKDSAIAFIELLRAGGNPDSITFCAFLNACSDKLGLEPGCQLHGFIIRSGCGQNVSVSNGLI
        MY+K G  E+A KVF +MP R+  TW   IS    H RP D+ + F ++LR G +P+  T  + + A + +     G QLHGF ++ G   NV V + L+
Subjt:  MYSKLGLQEDAYKVFVEMPHRNLETWNAYISNSVLHGRPKDSAIAFIELLRAGGNPDSITFCAFLNACSDKLGLEPGCQLHGFIIRSGCGQNVSVSNGLI

Query:  DFYGKCGEVKCSEMVFDRMGERNSVSWSSLIAAYVQNNEEEKACNLFFRARKEDIKPTDFMVSSVLCACAGLSGIELGRSVQALAVKACVEGNIFVGSAL
        D Y + G +  +++VFD +  RN VSW++LIA + + +  EKA  LF    ++  +P+ F  +S+  AC+    +E G+ V A  +K+  +   F G+ L
Subjt:  DFYGKCGEVKCSEMVFDRMGERNSVSWSSLIAAYVQNNEEEKACNLFFRARKEDIKPTDFMVSSVLCACAGLSGIELGRSVQALAVKACVEGNIFVGSAL

Query:  VDMYGKCGSIDEAERAFKEMPERNLVSWNALLGGYAHQGHADRAVALLEEMASAGMALSDVSLVCVLSACSRAGDMKMGMQIFESMKARYDVEAGPEHYA
        +DMY K GSI +A + F  + +R++VSWN+LL  YA  G    AV   EEM   G+  +++S + VL+ACS +G +  G   +E MK +  +     HY 
Subjt:  VDMYGKCGSIDEAERAFKEMPERNLVSWNALLGGYAHQGHADRAVALLEEMASAGMALSDVSLVCVLSACSRAGDMKMGMQIFESMKARYDVEAGPEHYA

Query:  CLVDLLGRGGMVECAHDFIKNMPFPPTISIWGALLGACRMHGKPELGKLAAEKLFELDPKDSGNHVVLSNMFASTGRWEEVTVVRNEMKEVGIKKGAGFS
         +VDLLGR G +  A  FI+ MP  PT +IW ALL ACRMH   ELG  AAE +FELDP D G HV+L N++AS GRW +   VR +MKE G+KK    S
Subjt:  CLVDLLGRGGMVECAHDFIKNMPFPPTISIWGALLGACRMHGKPELGKLAAEKLFELDPKDSGNHVVLSNMFASTGRWEEVTVVRNEMKEVGIKKGAGFS

Query:  WITVNSKIQIFQARDSSHEKDSAI----QDMLAKLRK
        W+ + + I +F A D  H +   I    +++LAK+++
Subjt:  WITVNSKIQIFQARDSSHEKDSAI----QDMLAKLRK

Q9LZ19 Pentatricopeptide repeat-containing protein At5g04780, mitochondrial2.5e-8636.47Show/hide
Query:  YSKLGLQEDAYKVFVEMPHRNLETWNAYISNSVLHGRPKDSAIAFIELLRAGGNPDSITFCAFLNACSDKLGLEPGCQLHGFIIRSGCGQNVSVSNGLID
        YSK G  E A +VF  M  R+L +WN  I     +    ++   F+E+   G      T  + L+AC          +LH   +++    N+ V   L+D
Subjt:  YSKLGLQEDAYKVFVEMPHRNLETWNAYISNSVLHGRPKDSAIAFIELLRAGGNPDSITFCAFLNACSDKLGLEPGCQLHGFIIRSGCGQNVSVSNGLID

Query:  FYGKCGEVKCSEMVFDRMGERNSVSWSSLIAAYVQNNEEEKACNLFFRARKEDIKPTDFMVSSVLCACAGLSGIELGRSVQALAVKACVEGNIFVGSALV
         Y KCG +K +  VF+ M +++SV+WSS++A YVQN   E+A  L+ RA++  ++   F +SSV+CAC+ L+ +  G+ + A+  K+    N+FV S+ V
Subjt:  FYGKCGEVKCSEMVFDRMGERNSVSWSSLIAAYVQNNEEEKACNLFFRARKEDIKPTDFMVSSVLCACAGLSGIELGRSVQALAVKACVEGNIFVGSALV

Query:  DMYGKCGSIDEAERAFKEMPERNLVSWNALLGGYAHQGHADRAVALLEEMASAGMALSDVSLVCVLSACSRAGDMKMGMQIFESMKARYDVEAGPEHYAC
        DMY KCGS+ E+   F E+ E+NL  WN ++ G+A        + L E+M   GM  ++V+   +LS C   G ++ G + F+ M+  Y +     HY+C
Subjt:  DMYGKCGSIDEAERAFKEMPERNLVSWNALLGGYAHQGHADRAVALLEEMASAGMALSDVSLVCVLSACSRAGDMKMGMQIFESMKARYDVEAGPEHYAC

Query:  LVDLLGRGGMVECAHDFIKNMPFPPTISIWGALLGACRMHGKPELGKLAAEKLFELDPKDSGNHVVLSNMFASTGRWEEVTVVRNEMKEVGIKKGAGFSW
        +VD+LGR G++  A++ IK++PF PT SIWG+LL +CR++   EL ++AAEKLFEL+P+++GNHV+LSN++A+  +WEE+   R  +++  +KK  G SW
Subjt:  LVDLLGRGGMVECAHDFIKNMPFPPTISIWGALLGACRMHGKPELGKLAAEKLFELDPKDSGNHVVLSNMFASTGRWEEVTVVRNEMKEVGIKKGAGFSW

Query:  ITVNSKIQIFQARDSSH----EKDSAIQDMLAKLRK
        I +  K+  F   +S H    E  S + +++ K RK
Subjt:  ITVNSKIQIFQARDSSH----EKDSAIQDMLAKLRK

Q9SIT7 Pentatricopeptide repeat-containing protein At2g136002.7e-8836.08Show/hide
Query:  MYSKLGLQEDAYKVFVEMPHRNLETWNAYISNSVLHGRPKDSAIAFIELLRAGGNPDSITFCAFLNACSDKLGLEPGCQLHGFIIRSGCGQN-VSVSNGL
        MYSK G   DA +VF EM  RN+ +WN+ I+    +G   ++   F  +L +   PD +T  + ++AC+    ++ G ++HG ++++   +N + +SN  
Subjt:  MYSKLGLQEDAYKVFVEMPHRNLETWNAYISNSVLHGRPKDSAIAFIELLRAGGNPDSITFCAFLNACSDKLGLEPGCQLHGFIIRSGCGQN-VSVSNGL

Query:  IDFYGKCGEVKCSEMVFD-------------------------------RMGERNSVSWSSLIAAYVQNNEEEKACNLFFRARKEDIKPTDFMVSSVLCA
        +D Y KC  +K +  +FD                               +M ERN VSW++LIA Y QN E E+A +LF   ++E + PT +  +++L A
Subjt:  IDFYGKCGEVKCSEMVFD-------------------------------RMGERNSVSWSSLIAAYVQNNEEEKACNLFFRARKEDIKPTDFMVSSVLCA

Query:  CAGLSGIELGRSVQALAVK------ACVEGNIFVGSALVDMYGKCGSIDEAERAFKEMPERNLVSWNALLGGYAHQGHADRAVALLEEMASAGMALSDVS
        CA L+ + LG       +K      +  E +IFVG++L+DMY KCG ++E    F++M ER+ VSWNA++ G+A  G+ + A+ L  EM  +G     ++
Subjt:  CAGLSGIELGRSVQALAVK------ACVEGNIFVGSALVDMYGKCGSIDEAERAFKEMPERNLVSWNALLGGYAHQGHADRAVALLEEMASAGMALSDVS

Query:  LVCVLSACSRAGDMKMGMQIFESMKARYDVEAGPEHYACLVDLLGRGGMVECAHDFIKNMPFPPTISIWGALLGACRMHGKPELGKLAAEKLFELDPKDS
        ++ VLSAC  AG ++ G   F SM   + V    +HY C+VDLLGR G +E A   I+ MP  P   IWG+LL AC++H    LGK  AEKL E++P +S
Subjt:  LVCVLSACSRAGDMKMGMQIFESMKARYDVEAGPEHYACLVDLLGRGGMVECAHDFIKNMPFPPTISIWGALLGACRMHGKPELGKLAAEKLFELDPKDS

Query:  GNHVVLSNMFASTGRWEEVTVVRNEMKEVGIKKGAGFSWITVNSKIQIFQARDSSHEKDSAIQDMLAKLRKEMQ
        G +V+LSNM+A  G+WE+V  VR  M++ G+ K  G SWI +     +F  +D SH +   I  +L  L  EM+
Subjt:  GNHVVLSNMFASTGRWEEVTVVRNEMKEVGIKKGAGFSWITVNSKIQIFQARDSSHEKDSAIQDMLAKLRKEMQ

Arabidopsis top hitse value%identityAlignment
AT2G13600.1 Pentatricopeptide repeat (PPR) superfamily protein1.9e-8936.08Show/hide
Query:  MYSKLGLQEDAYKVFVEMPHRNLETWNAYISNSVLHGRPKDSAIAFIELLRAGGNPDSITFCAFLNACSDKLGLEPGCQLHGFIIRSGCGQN-VSVSNGL
        MYSK G   DA +VF EM  RN+ +WN+ I+    +G   ++   F  +L +   PD +T  + ++AC+    ++ G ++HG ++++   +N + +SN  
Subjt:  MYSKLGLQEDAYKVFVEMPHRNLETWNAYISNSVLHGRPKDSAIAFIELLRAGGNPDSITFCAFLNACSDKLGLEPGCQLHGFIIRSGCGQN-VSVSNGL

Query:  IDFYGKCGEVKCSEMVFD-------------------------------RMGERNSVSWSSLIAAYVQNNEEEKACNLFFRARKEDIKPTDFMVSSVLCA
        +D Y KC  +K +  +FD                               +M ERN VSW++LIA Y QN E E+A +LF   ++E + PT +  +++L A
Subjt:  IDFYGKCGEVKCSEMVFD-------------------------------RMGERNSVSWSSLIAAYVQNNEEEKACNLFFRARKEDIKPTDFMVSSVLCA

Query:  CAGLSGIELGRSVQALAVK------ACVEGNIFVGSALVDMYGKCGSIDEAERAFKEMPERNLVSWNALLGGYAHQGHADRAVALLEEMASAGMALSDVS
        CA L+ + LG       +K      +  E +IFVG++L+DMY KCG ++E    F++M ER+ VSWNA++ G+A  G+ + A+ L  EM  +G     ++
Subjt:  CAGLSGIELGRSVQALAVK------ACVEGNIFVGSALVDMYGKCGSIDEAERAFKEMPERNLVSWNALLGGYAHQGHADRAVALLEEMASAGMALSDVS

Query:  LVCVLSACSRAGDMKMGMQIFESMKARYDVEAGPEHYACLVDLLGRGGMVECAHDFIKNMPFPPTISIWGALLGACRMHGKPELGKLAAEKLFELDPKDS
        ++ VLSAC  AG ++ G   F SM   + V    +HY C+VDLLGR G +E A   I+ MP  P   IWG+LL AC++H    LGK  AEKL E++P +S
Subjt:  LVCVLSACSRAGDMKMGMQIFESMKARYDVEAGPEHYACLVDLLGRGGMVECAHDFIKNMPFPPTISIWGALLGACRMHGKPELGKLAAEKLFELDPKDS

Query:  GNHVVLSNMFASTGRWEEVTVVRNEMKEVGIKKGAGFSWITVNSKIQIFQARDSSHEKDSAIQDMLAKLRKEMQ
        G +V+LSNM+A  G+WE+V  VR  M++ G+ K  G SWI +     +F  +D SH +   I  +L  L  EM+
Subjt:  GNHVVLSNMFASTGRWEEVTVVRNEMKEVGIKKGAGFSWITVNSKIQIFQARDSSHEKDSAIQDMLAKLRKEMQ

AT3G24000.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.5e-8939.59Show/hide
Query:  MYSKLGLQEDAYKVFVEMPHRNLETWNAYISNSVLHGRPKDSAIAFIELLRAGGNPDSITFCAFLNACSDKLGLEPGCQLHGFIIRSGCGQNVSVSNGLI
        MY+K G  E+A KVF +MP R+  TW   IS    H RP D+ + F ++LR G +P+  T  + + A + +     G QLHGF ++ G   NV V + L+
Subjt:  MYSKLGLQEDAYKVFVEMPHRNLETWNAYISNSVLHGRPKDSAIAFIELLRAGGNPDSITFCAFLNACSDKLGLEPGCQLHGFIIRSGCGQNVSVSNGLI

Query:  DFYGKCGEVKCSEMVFDRMGERNSVSWSSLIAAYVQNNEEEKACNLFFRARKEDIKPTDFMVSSVLCACAGLSGIELGRSVQALAVKACVEGNIFVGSAL
        D Y + G +  +++VFD +  RN VSW++LIA + + +  EKA  LF    ++  +P+ F  +S+  AC+    +E G+ V A  +K+  +   F G+ L
Subjt:  DFYGKCGEVKCSEMVFDRMGERNSVSWSSLIAAYVQNNEEEKACNLFFRARKEDIKPTDFMVSSVLCACAGLSGIELGRSVQALAVKACVEGNIFVGSAL

Query:  VDMYGKCGSIDEAERAFKEMPERNLVSWNALLGGYAHQGHADRAVALLEEMASAGMALSDVSLVCVLSACSRAGDMKMGMQIFESMKARYDVEAGPEHYA
        +DMY K GSI +A + F  + +R++VSWN+LL  YA  G    AV   EEM   G+  +++S + VL+ACS +G +  G   +E MK +  +     HY 
Subjt:  VDMYGKCGSIDEAERAFKEMPERNLVSWNALLGGYAHQGHADRAVALLEEMASAGMALSDVSLVCVLSACSRAGDMKMGMQIFESMKARYDVEAGPEHYA

Query:  CLVDLLGRGGMVECAHDFIKNMPFPPTISIWGALLGACRMHGKPELGKLAAEKLFELDPKDSGNHVVLSNMFASTGRWEEVTVVRNEMKEVGIKKGAGFS
         +VDLLGR G +  A  FI+ MP  PT +IW ALL ACRMH   ELG  AAE +FELDP D G HV+L N++AS GRW +   VR +MKE G+KK    S
Subjt:  CLVDLLGRGGMVECAHDFIKNMPFPPTISIWGALLGACRMHGKPELGKLAAEKLFELDPKDSGNHVVLSNMFASTGRWEEVTVVRNEMKEVGIKKGAGFS

Query:  WITVNSKIQIFQARDSSHEKDSAI----QDMLAKLRK
        W+ + + I +F A D  H +   I    +++LAK+++
Subjt:  WITVNSKIQIFQARDSSHEKDSAI----QDMLAKLRK

AT4G14850.1 Pentatricopeptide repeat (PPR) superfamily protein1.5e-16362.73Show/hide
Query:  MYSKLGLQEDAYKVFVEMPHRNLETWNAYISNSVLHGRPKDSAIAFIELLRAGGNPDSITFCAFLNACSDKLGLEPGCQLHGFIIRSGCGQNVSVSNGLI
        MY K  L++DA K+F E+P RNLETWNA+ISNSV  GRP+++  AFIE  R  G+P+SITFCAFLNACSD L L  G QLHG ++RSG   +VSV NGLI
Subjt:  MYSKLGLQEDAYKVFVEMPHRNLETWNAYISNSVLHGRPKDSAIAFIELLRAGGNPDSITFCAFLNACSDKLGLEPGCQLHGFIIRSGCGQNVSVSNGLI

Query:  DFYGKCGEVKCSEMVFDRMGERNSVSWSSLIAAYVQNNEEEKACNLFFRARKEDIKPTDFMVSSVLCACAGLSGIELGRSVQALAVKACVEGNIFVGSAL
        DFYGKC +++ SE++F  MG +N+VSW SL+AAYVQN+E+EKA  L+ R+RK+ ++ +DFM+SSVL ACAG++G+ELGRS+ A AVKACVE  IFVGSAL
Subjt:  DFYGKCGEVKCSEMVFDRMGERNSVSWSSLIAAYVQNNEEEKACNLFFRARKEDIKPTDFMVSSVLCACAGLSGIELGRSVQALAVKACVEGNIFVGSAL

Query:  VDMYGKCGSIDEAERAFKEMPERNLVSWNALLGGYAHQGHADRAVALLEEMASAGMALSD--VSLVCVLSACSRAGDMKMGMQIFESMKARYDVEAGPEH
        VDMYGKCG I+++E+AF EMPE+NLV+ N+L+GGYAHQG  D A+AL EEMA  G   +   ++ V +LSACSRAG ++ GM+IF+SM++ Y +E G EH
Subjt:  VDMYGKCGSIDEAERAFKEMPERNLVSWNALLGGYAHQGHADRAVALLEEMASAGMALSD--VSLVCVLSACSRAGDMKMGMQIFESMKARYDVEAGPEH

Query:  YACLVDLLGRGGMVECAHDFIKNMPFPPTISIWGALLGACRMHGKPELGKLAAEKLFELDPKDSGNHVVLSNMFASTGRWEEVTVVRNEMKEVGIKKGAG
        Y+C+VD+LGR GMVE A++FIK MP  PTIS+WGAL  ACRMHGKP+LG LAAE LF+LDPKDSGNHV+LSN FA+ GRW E   VR E+K VGIKKGAG
Subjt:  YACLVDLLGRGGMVECAHDFIKNMPFPPTISIWGALLGACRMHGKPELGKLAAEKLFELDPKDSGNHVVLSNMFASTGRWEEVTVVRNEMKEVGIKKGAG

Query:  FSWITVNSKIQIFQARDSSHEKDSAIQDMLAKLRKEMQEA
        +SWITV +++  FQA+D SH  +  IQ  LAKLR EM+ A
Subjt:  FSWITVNSKIQIFQARDSSHEKDSAIQDMLAKLRKEMQEA

AT5G04780.1 Pentatricopeptide repeat (PPR) superfamily protein1.8e-8736.47Show/hide
Query:  YSKLGLQEDAYKVFVEMPHRNLETWNAYISNSVLHGRPKDSAIAFIELLRAGGNPDSITFCAFLNACSDKLGLEPGCQLHGFIIRSGCGQNVSVSNGLID
        YSK G  E A +VF  M  R+L +WN  I     +    ++   F+E+   G      T  + L+AC          +LH   +++    N+ V   L+D
Subjt:  YSKLGLQEDAYKVFVEMPHRNLETWNAYISNSVLHGRPKDSAIAFIELLRAGGNPDSITFCAFLNACSDKLGLEPGCQLHGFIIRSGCGQNVSVSNGLID

Query:  FYGKCGEVKCSEMVFDRMGERNSVSWSSLIAAYVQNNEEEKACNLFFRARKEDIKPTDFMVSSVLCACAGLSGIELGRSVQALAVKACVEGNIFVGSALV
         Y KCG +K +  VF+ M +++SV+WSS++A YVQN   E+A  L+ RA++  ++   F +SSV+CAC+ L+ +  G+ + A+  K+    N+FV S+ V
Subjt:  FYGKCGEVKCSEMVFDRMGERNSVSWSSLIAAYVQNNEEEKACNLFFRARKEDIKPTDFMVSSVLCACAGLSGIELGRSVQALAVKACVEGNIFVGSALV

Query:  DMYGKCGSIDEAERAFKEMPERNLVSWNALLGGYAHQGHADRAVALLEEMASAGMALSDVSLVCVLSACSRAGDMKMGMQIFESMKARYDVEAGPEHYAC
        DMY KCGS+ E+   F E+ E+NL  WN ++ G+A        + L E+M   GM  ++V+   +LS C   G ++ G + F+ M+  Y +     HY+C
Subjt:  DMYGKCGSIDEAERAFKEMPERNLVSWNALLGGYAHQGHADRAVALLEEMASAGMALSDVSLVCVLSACSRAGDMKMGMQIFESMKARYDVEAGPEHYAC

Query:  LVDLLGRGGMVECAHDFIKNMPFPPTISIWGALLGACRMHGKPELGKLAAEKLFELDPKDSGNHVVLSNMFASTGRWEEVTVVRNEMKEVGIKKGAGFSW
        +VD+LGR G++  A++ IK++PF PT SIWG+LL +CR++   EL ++AAEKLFEL+P+++GNHV+LSN++A+  +WEE+   R  +++  +KK  G SW
Subjt:  LVDLLGRGGMVECAHDFIKNMPFPPTISIWGALLGACRMHGKPELGKLAAEKLFELDPKDSGNHVVLSNMFASTGRWEEVTVVRNEMKEVGIKKGAGFSW

Query:  ITVNSKIQIFQARDSSH----EKDSAIQDMLAKLRK
        I +  K+  F   +S H    E  S + +++ K RK
Subjt:  ITVNSKIQIFQARDSSH----EKDSAIQDMLAKLRK

AT5G09950.1 Tetratricopeptide repeat (TPR)-like superfamily protein2.0e-9139.73Show/hide
Query:  MYSKLGLQEDAYKVFVEMPHRNLETWNAYISNSVLHGRP-KDSAIAFIELLRAGGNPDSITFCAFLNACSDKLGLEPGCQLHGFIIRSGCGQNVSVSNGL
        +Y++ G   +  K+F  MP  +  +WN+ I       R   ++ + F+   RAG   + ITF + L+A S     E G Q+HG  +++      +  N L
Subjt:  MYSKLGLQEDAYKVFVEMPHRNLETWNAYISNSVLHGRP-KDSAIAFIELLRAGGNPDSITFCAFLNACSDKLGLEPGCQLHGFIIRSGCGQNVSVSNGL

Query:  IDFYGKCGEVKCSEMVFDRMGE-RNSVSWSSLIAAYVQNNEEEKACNLFFRARKEDIKPTDFMVSSVLCACAGLSGIELGRSVQALAVKACVEGNIFVGS
        I  YGKCGE+   E +F RM E R++V+W+S+I+ Y+ N    KA +L +   +   +   FM ++VL A A ++ +E G  V A +V+AC+E ++ VGS
Subjt:  IDFYGKCGEVKCSEMVFDRMGE-RNSVSWSSLIAAYVQNNEEEKACNLFFRARKEDIKPTDFMVSSVLCACAGLSGIELGRSVQALAVKACVEGNIFVGS

Query:  ALVDMYGKCGSIDEAERAFKEMPERNLVSWNALLGGYAHQGHADRAVALLEEMASAGMALSD-VSLVCVLSACSRAGDMKMGMQIFESMKARYDVEAGPE
        ALVDMY KCG +D A R F  MP RN  SWN+++ GYA  G  + A+ L E M   G    D V+ V VLSACS AG ++ G + FESM   Y +    E
Subjt:  ALVDMYGKCGSIDEAERAFKEMPERNLVSWNALLGGYAHQGHADRAVALLEEMASAGMALSD-VSLVCVLSACSRAGDMKMGMQIFESMKARYDVEAGPE

Query:  HYACLVDLLGRGGMVECAHDFIKNMPFPPTISIWGALLGA-CRMHG-KPELGKLAAEKLFELDPKDSGNHVVLSNMFASTGRWEEVTVVRNEMKEVGIKK
        H++C+ D+LGR G ++   DFI+ MP  P + IW  +LGA CR +G K ELGK AAE LF+L+P+++ N+V+L NM+A+ GRWE++   R +MK+  +KK
Subjt:  HYACLVDLLGRGGMVECAHDFIKNMPFPPTISIWGALLGA-CRMHG-KPELGKLAAEKLFELDPKDSGNHVVLSNMFASTGRWEEVTVVRNEMKEVGIKK

Query:  GAGFSWITVNSKIQIFQARDSSHEKDSAIQDMLAKLRKEMQEA
         AG+SW+T+   + +F A D SH     I   L +L ++M++A
Subjt:  GAGFSWITVNSKIQIFQARDSSHEKDSAIQDMLAKLRKEMQEA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTACAGCAAACTGGGTCTTCAAGAGGACGCATACAAGGTGTTTGTTGAAATGCCTCACCGAAACCTCGAGACGTGGAATGCATATATATCCAATTCGGTGCTCCATGG
GCGGCCTAAAGACTCCGCCATTGCATTTATTGAGCTACTCCGAGCTGGTGGGAACCCCGATTCAATCACATTCTGCGCTTTTCTCAATGCATGTTCAGACAAACTAGGCT
TGGAGCCTGGGTGTCAGCTACATGGGTTCATTATTCGAAGTGGGTGTGGGCAGAATGTCTCCGTTTCAAATGGGTTGATTGATTTTTATGGGAAATGTGGGGAGGTTAAA
TGTTCTGAGATGGTTTTTGACAGAATGGGAGAGCGGAACAGCGTCTCCTGGTCCTCTTTGATAGCTGCTTACGTTCAAAACAACGAGGAAGAGAAGGCTTGCAACTTATT
CTTTCGAGCGAGGAAAGAAGATATCAAACCAACTGATTTTATGGTATCAAGTGTGCTTTGTGCCTGCGCTGGCCTTTCAGGAATCGAGTTGGGAAGGTCAGTTCAAGCAC
TGGCCGTCAAGGCTTGTGTAGAGGGTAACATCTTTGTTGGAAGTGCACTGGTTGATATGTATGGAAAATGTGGAAGCATTGATGAAGCAGAGCGAGCCTTCAAGGAGATG
CCAGAGAGAAACTTGGTGTCTTGGAATGCATTATTAGGCGGATACGCTCACCAAGGGCACGCAGACAGGGCTGTGGCATTGCTGGAGGAGATGGCGTCGGCGGGCATGGC
GCTGAGCGACGTGAGTTTGGTGTGTGTATTATCAGCTTGCAGTAGAGCAGGAGATATGAAGATGGGGATGCAGATTTTTGAGTCCATGAAAGCAAGGTACGATGTAGAGG
CAGGGCCAGAGCATTACGCTTGCTTGGTGGACTTGCTTGGGCGTGGTGGAATGGTAGAGTGTGCGCATGATTTTATAAAGAACATGCCATTTCCTCCAACAATCTCAATC
TGGGGCGCTCTGTTAGGGGCTTGTCGAATGCATGGAAAGCCAGAGTTGGGAAAGTTGGCCGCTGAGAAGCTATTTGAACTTGATCCAAAAGACTCTGGAAACCACGTTGT
GCTGTCCAATATGTTTGCTTCAACTGGCAGGTGGGAAGAAGTGACTGTTGTAAGAAATGAGATGAAAGAAGTTGGGATTAAGAAGGGAGCTGGGTTCAGTTGGATAACTG
TAAACAGTAAAATTCAAATATTCCAAGCGAGAGACAGTAGCCATGAGAAGGACTCTGCAATTCAGGACATGCTGGCCAAGCTGAGGAAGGAGATGCAGGAAGCTCCTGGG
TGCATTGCAACACCAATTATGCTCTTTTTGAAGAAACCAGCAAGTGTCAAGCAGCTCGCAGTCCTTTCTCCTCTCTGGATTCTCAAGCTTTCGCCGATCGCCATCTGTAC
AGAAGATCATCAAATCACCATCCTTCTCCATAACTTGTCTGAACATGGCCCGCCCCACCGCACGTCTTCGGCAGGCCCATCCCTTCATATAGCGAAGAAAGAACATCCAG
CCGCCATTGCAGAAAGGTCGGCACGTTGTTTGGTTTTCTTTCGATTTTGTGTTGGAATCATGAGTCGGGAGCAGCCGTGCTGGGACGAAGAACAGCAGCAGTCACGCCGG
GACGACGACGAAAGGCAGGAACCCATCAAGTACGGAGACGTTTTCAACGTCTCCGGCGACCTGGCTTCGAACCCCATCGCACCTCAGGATGCTCGCATGATGGGCAGTGC
CGAAAGCAGGGTTTTGGGGCAAGTGCCACAGGCTGGTCCGGCCCATGTCATGCGAGCCGCCGCCGCCCATAACGTCCGAACCGGTCTTATCGGAAGCCGCGACGTTAGCG
ATGCTGCTAAAAACCAAGGCATTAACATCAGCGAGACCGACGTCCCTGGAGCCCGCATCGTGACCGAACACGTCGCCGGACAGAGATTCTACGCTGAACAATACCCGCTG
CTGATTGGACAGCTGCTGTCCACTACTAAATGTGGTTGGCCAGTACATGGACACCACGGCGGCGACGGCACAGAGTGCGATGATCACCATTGGACAAGCACTAGAAGCGG
CGGGTCAAACGGCAGGAAACAAGCCAGTGGAGCAGAGCGACGCTGCAGCAATTCAAGCCGCAGAGTCAGAGCAACCGGCAGTAAAGTCATAGTGCCAGGTGGGCTCGCCG
CCACGGCTCAGTCGGCGGCGACTTACAACGCCGGATTGGAACGAGACGAGGACAAGATCAAGCTCAACTTTATCCTAACGGACGCAACCGGGAAGCTGCCGTCGGATAAA
CCGGTGAGCCGGCAGGATGCGGAAGGAGTGGTGAACGCGGAGCTGAGGAACAATCCGAACCTGACGACGCACCCAGGTGCGGTGGCGGCCTCCATCACCGCCGCCGCGAG
GCTGAATGAGAGCGGAAATGAATGA
mRNA sequenceShow/hide mRNA sequence
ATGTACAGCAAACTGGGTCTTCAAGAGGACGCATACAAGGTGTTTGTTGAAATGCCTCACCGAAACCTCGAGACGTGGAATGCATATATATCCAATTCGGTGCTCCATGG
GCGGCCTAAAGACTCCGCCATTGCATTTATTGAGCTACTCCGAGCTGGTGGGAACCCCGATTCAATCACATTCTGCGCTTTTCTCAATGCATGTTCAGACAAACTAGGCT
TGGAGCCTGGGTGTCAGCTACATGGGTTCATTATTCGAAGTGGGTGTGGGCAGAATGTCTCCGTTTCAAATGGGTTGATTGATTTTTATGGGAAATGTGGGGAGGTTAAA
TGTTCTGAGATGGTTTTTGACAGAATGGGAGAGCGGAACAGCGTCTCCTGGTCCTCTTTGATAGCTGCTTACGTTCAAAACAACGAGGAAGAGAAGGCTTGCAACTTATT
CTTTCGAGCGAGGAAAGAAGATATCAAACCAACTGATTTTATGGTATCAAGTGTGCTTTGTGCCTGCGCTGGCCTTTCAGGAATCGAGTTGGGAAGGTCAGTTCAAGCAC
TGGCCGTCAAGGCTTGTGTAGAGGGTAACATCTTTGTTGGAAGTGCACTGGTTGATATGTATGGAAAATGTGGAAGCATTGATGAAGCAGAGCGAGCCTTCAAGGAGATG
CCAGAGAGAAACTTGGTGTCTTGGAATGCATTATTAGGCGGATACGCTCACCAAGGGCACGCAGACAGGGCTGTGGCATTGCTGGAGGAGATGGCGTCGGCGGGCATGGC
GCTGAGCGACGTGAGTTTGGTGTGTGTATTATCAGCTTGCAGTAGAGCAGGAGATATGAAGATGGGGATGCAGATTTTTGAGTCCATGAAAGCAAGGTACGATGTAGAGG
CAGGGCCAGAGCATTACGCTTGCTTGGTGGACTTGCTTGGGCGTGGTGGAATGGTAGAGTGTGCGCATGATTTTATAAAGAACATGCCATTTCCTCCAACAATCTCAATC
TGGGGCGCTCTGTTAGGGGCTTGTCGAATGCATGGAAAGCCAGAGTTGGGAAAGTTGGCCGCTGAGAAGCTATTTGAACTTGATCCAAAAGACTCTGGAAACCACGTTGT
GCTGTCCAATATGTTTGCTTCAACTGGCAGGTGGGAAGAAGTGACTGTTGTAAGAAATGAGATGAAAGAAGTTGGGATTAAGAAGGGAGCTGGGTTCAGTTGGATAACTG
TAAACAGTAAAATTCAAATATTCCAAGCGAGAGACAGTAGCCATGAGAAGGACTCTGCAATTCAGGACATGCTGGCCAAGCTGAGGAAGGAGATGCAGGAAGCTCCTGGG
TGCATTGCAACACCAATTATGCTCTTTTTGAAGAAACCAGCAAGTGTCAAGCAGCTCGCAGTCCTTTCTCCTCTCTGGATTCTCAAGCTTTCGCCGATCGCCATCTGTAC
AGAAGATCATCAAATCACCATCCTTCTCCATAACTTGTCTGAACATGGCCCGCCCCACCGCACGTCTTCGGCAGGCCCATCCCTTCATATAGCGAAGAAAGAACATCCAG
CCGCCATTGCAGAAAGGTCGGCACGTTGTTTGGTTTTCTTTCGATTTTGTGTTGGAATCATGAGTCGGGAGCAGCCGTGCTGGGACGAAGAACAGCAGCAGTCACGCCGG
GACGACGACGAAAGGCAGGAACCCATCAAGTACGGAGACGTTTTCAACGTCTCCGGCGACCTGGCTTCGAACCCCATCGCACCTCAGGATGCTCGCATGATGGGCAGTGC
CGAAAGCAGGGTTTTGGGGCAAGTGCCACAGGCTGGTCCGGCCCATGTCATGCGAGCCGCCGCCGCCCATAACGTCCGAACCGGTCTTATCGGAAGCCGCGACGTTAGCG
ATGCTGCTAAAAACCAAGGCATTAACATCAGCGAGACCGACGTCCCTGGAGCCCGCATCGTGACCGAACACGTCGCCGGACAGAGATTCTACGCTGAACAATACCCGCTG
CTGATTGGACAGCTGCTGTCCACTACTAAATGTGGTTGGCCAGTACATGGACACCACGGCGGCGACGGCACAGAGTGCGATGATCACCATTGGACAAGCACTAGAAGCGG
CGGGTCAAACGGCAGGAAACAAGCCAGTGGAGCAGAGCGACGCTGCAGCAATTCAAGCCGCAGAGTCAGAGCAACCGGCAGTAAAGTCATAGTGCCAGGTGGGCTCGCCG
CCACGGCTCAGTCGGCGGCGACTTACAACGCCGGATTGGAACGAGACGAGGACAAGATCAAGCTCAACTTTATCCTAACGGACGCAACCGGGAAGCTGCCGTCGGATAAA
CCGGTGAGCCGGCAGGATGCGGAAGGAGTGGTGAACGCGGAGCTGAGGAACAATCCGAACCTGACGACGCACCCAGGTGCGGTGGCGGCCTCCATCACCGCCGCCGCGAG
GCTGAATGAGAGCGGAAATGAATGA
Protein sequenceShow/hide protein sequence
MYSKLGLQEDAYKVFVEMPHRNLETWNAYISNSVLHGRPKDSAIAFIELLRAGGNPDSITFCAFLNACSDKLGLEPGCQLHGFIIRSGCGQNVSVSNGLIDFYGKCGEVK
CSEMVFDRMGERNSVSWSSLIAAYVQNNEEEKACNLFFRARKEDIKPTDFMVSSVLCACAGLSGIELGRSVQALAVKACVEGNIFVGSALVDMYGKCGSIDEAERAFKEM
PERNLVSWNALLGGYAHQGHADRAVALLEEMASAGMALSDVSLVCVLSACSRAGDMKMGMQIFESMKARYDVEAGPEHYACLVDLLGRGGMVECAHDFIKNMPFPPTISI
WGALLGACRMHGKPELGKLAAEKLFELDPKDSGNHVVLSNMFASTGRWEEVTVVRNEMKEVGIKKGAGFSWITVNSKIQIFQARDSSHEKDSAIQDMLAKLRKEMQEAPG
CIATPIMLFLKKPASVKQLAVLSPLWILKLSPIAICTEDHQITILLHNLSEHGPPHRTSSAGPSLHIAKKEHPAAIAERSARCLVFFRFCVGIMSREQPCWDEEQQQSRR
DDDERQEPIKYGDVFNVSGDLASNPIAPQDARMMGSAESRVLGQVPQAGPAHVMRAAAAHNVRTGLIGSRDVSDAAKNQGINISETDVPGARIVTEHVAGQRFYAEQYPL
LIGQLLSTTKCGWPVHGHHGGDGTECDDHHWTSTRSGGSNGRKQASGAERRCSNSSRRVRATGSKVIVPGGLAATAQSAATYNAGLERDEDKIKLNFILTDATGKLPSDK
PVSRQDAEGVVNAELRNNPNLTTHPGAVAASITAAARLNESGNE