; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cucsat.G5782 (gene) of Cucumber (B10) v3 genome

Gene IDCucsat.G5782
OrganismCucumis sativus L. var. sativus cv. B10 (Cucumber (B10) v3)
DescriptionPentatricopeptide repeat-containing protein
Genome locationctg1402:1746232..1748215
RNA-Seq ExpressionCucsat.G5782
SyntenyCucsat.G5782
Gene Ontology termsGO:0000373 - Group II intron splicing (biological process)
GO:0000963 - mitochondrial RNA processing (biological process)
GO:0032981 - mitochondrial respiratory chain complex I assembly (biological process)
GO:0005739 - mitochondrion (cellular component)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0035033.1 pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa]2.53e-14995.09Show/hide
Query:  RKNQVNGAVDLLEDMLNKGYIPDTLSYATLLNSLCRKKKLREAYKLLCRMKVKGCNPDIAHYNTVIMGFCREGRALDACKILEDMQSNGCLPNLVSYESL
        RKNQVNGAVDLLEDMLNKGYIPDTLSYATLLNSLCRKKKL+EAYKLLCRMKVKGCNPDIAHYNTVIMGFCREGRALDACKILEDMQSNGCLPNLVSYESL
Subjt:  RKNQVNGAVDLLEDMLNKGYIPDTLSYATLLNSLCRKKKLREAYKLLCRMKVKGCNPDIAHYNTVIMGFCREGRALDACKILEDMQSNGCLPNLVSYESL

Query:  TNGLCDQGMFELAKGYVEEMTLKGFYPHFSVIHALVKGFHSIGRIHESCSVLEDMLKRGKAPHSDTWEIIISGICEVEDTAKFCEVWEKILKKDVRRDTR
        TNGLCDQGMFELAKGYVEEMTLKGFYPHFSVIHALVKGFH++GR+ ESCSVLE MLK GKAPHSDTWEIIISGICEVEDT KFCE+ EKILKKDVRRDTR
Subjt:  TNGLCDQGMFELAKGYVEEMTLKGFYPHFSVIHALVKGFHSIGRIHESCSVLEDMLKRGKAPHSDTWEIIISGICEVEDTAKFCEVWEKILKKDVRRDTR

Query:  IVEAGTGLGEYLIRKLQASISRRI
        IVEAGTGLGEYLIRKLQAS SRRI
Subjt:  IVEAGTGLGEYLIRKLQASISRRI

XP_004138384.1 pentatricopeptide repeat-containing protein At4g01400, mitochondrial [Cucumis sativus]5.74e-159100Show/hide
Query:  RKNQVNGAVDLLEDMLNKGYIPDTLSYATLLNSLCRKKKLREAYKLLCRMKVKGCNPDIAHYNTVIMGFCREGRALDACKILEDMQSNGCLPNLVSYESL
        RKNQVNGAVDLLEDMLNKGYIPDTLSYATLLNSLCRKKKLREAYKLLCRMKVKGCNPDIAHYNTVIMGFCREGRALDACKILEDMQSNGCLPNLVSYESL
Subjt:  RKNQVNGAVDLLEDMLNKGYIPDTLSYATLLNSLCRKKKLREAYKLLCRMKVKGCNPDIAHYNTVIMGFCREGRALDACKILEDMQSNGCLPNLVSYESL

Query:  TNGLCDQGMFELAKGYVEEMTLKGFYPHFSVIHALVKGFHSIGRIHESCSVLEDMLKRGKAPHSDTWEIIISGICEVEDTAKFCEVWEKILKKDVRRDTR
        TNGLCDQGMFELAKGYVEEMTLKGFYPHFSVIHALVKGFHSIGRIHESCSVLEDMLKRGKAPHSDTWEIIISGICEVEDTAKFCEVWEKILKKDVRRDTR
Subjt:  TNGLCDQGMFELAKGYVEEMTLKGFYPHFSVIHALVKGFHSIGRIHESCSVLEDMLKRGKAPHSDTWEIIISGICEVEDTAKFCEVWEKILKKDVRRDTR

Query:  IVEAGTGLGEYLIRKLQASISRRI
        IVEAGTGLGEYLIRKLQASISRRI
Subjt:  IVEAGTGLGEYLIRKLQASISRRI

XP_008463091.1 PREDICTED: pentatricopeptide repeat-containing protein At4g01400, mitochondrial-like [Cucumis melo]1.77e-15095.09Show/hide
Query:  RKNQVNGAVDLLEDMLNKGYIPDTLSYATLLNSLCRKKKLREAYKLLCRMKVKGCNPDIAHYNTVIMGFCREGRALDACKILEDMQSNGCLPNLVSYESL
        RKNQVNGAVDLLEDMLNKGYIPDTLSYATLLNSLCRKKKL+EAYKLLCRMKVKGCNPDIAHYNTVIMGFCREGRALDACKILEDMQSNGCLPNLVSYESL
Subjt:  RKNQVNGAVDLLEDMLNKGYIPDTLSYATLLNSLCRKKKLREAYKLLCRMKVKGCNPDIAHYNTVIMGFCREGRALDACKILEDMQSNGCLPNLVSYESL

Query:  TNGLCDQGMFELAKGYVEEMTLKGFYPHFSVIHALVKGFHSIGRIHESCSVLEDMLKRGKAPHSDTWEIIISGICEVEDTAKFCEVWEKILKKDVRRDTR
        TNGLCDQGMFELAKGYVEEMTLKGFYPHFSVIHALVKGFH++GR+ ESCSVLE MLK GKAPHSDTWEIIISGICEVEDT KFCE+ EKILKKDVRRDTR
Subjt:  TNGLCDQGMFELAKGYVEEMTLKGFYPHFSVIHALVKGFHSIGRIHESCSVLEDMLKRGKAPHSDTWEIIISGICEVEDTAKFCEVWEKILKKDVRRDTR

Query:  IVEAGTGLGEYLIRKLQASISRRI
        IVEAGTGLGEYLIRKLQAS SRRI
Subjt:  IVEAGTGLGEYLIRKLQASISRRI

XP_022974029.1 pentatricopeptide repeat-containing protein At4g01400, mitochondrial-like [Cucurbita maxima]5.39e-14090.13Show/hide
Query:  RKNQVNGAVDLLEDMLNKGYIPDTLSYATLLNSLCRKKKLREAYKLLCRMKVKGCNPDIAHYNTVIMGFCREGRALDACKILEDMQSNGCLPNLVSYESL
        RKNQV GAVDLLEDMLNKGYIPDTLSYATLLNSLCRKKKLREAYKLLCRMKVKGCNPD+AHYNTVI GFCREGRALDACKILEDMQ NGCLPNLVSY+SL
Subjt:  RKNQVNGAVDLLEDMLNKGYIPDTLSYATLLNSLCRKKKLREAYKLLCRMKVKGCNPDIAHYNTVIMGFCREGRALDACKILEDMQSNGCLPNLVSYESL

Query:  TNGLCDQGMFELAKGYVEEMTLKGFYPHFSVIHALVKGFHSIGRIHESCSVLEDMLKRGKAPHSDTWEIIISGICEVEDTAKFCEVWEKILKKDVRRDTR
        TNGLCDQGMFELAK YVEEMTL GF PHFSVIH LVKGF ++GRI +SCSVLEDMLK GKAPHS+TWEIIISGICEVEDT K CE+ EKILKKDVRRDTR
Subjt:  TNGLCDQGMFELAKGYVEEMTLKGFYPHFSVIHALVKGFHSIGRIHESCSVLEDMLKRGKAPHSDTWEIIISGICEVEDTAKFCEVWEKILKKDVRRDTR

Query:  IVEAGTGLGEYLIRKLQASISRR
        IVEAG+GLGEYLIRKLQAS SRR
Subjt:  IVEAGTGLGEYLIRKLQASISRR

XP_038886671.1 pentatricopeptide repeat-containing protein At4g01400, mitochondrial [Benincasa hispida]1.42e-14289.73Show/hide
Query:  RKNQVNGAVDLLEDMLNKGYIPDTLSYATLLNSLCRKKKLREAYKLLCRMKVKGCNPDIAHYNTVIMGFCREGRALDACKILEDMQSNGCLPNLVSYESL
        RKNQVNGAVDLLEDMLNKGYIPD+LSYATLLNSLCRKKKLREAYKLLCRMKVKGCNPD+AHYNT I+GFCREGRALDACKILEDMQSNGCLPNLVSY+SL
Subjt:  RKNQVNGAVDLLEDMLNKGYIPDTLSYATLLNSLCRKKKLREAYKLLCRMKVKGCNPDIAHYNTVIMGFCREGRALDACKILEDMQSNGCLPNLVSYESL

Query:  TNGLCDQGMFELAKGYVEEMTLKGFYPHFSVIHALVKGFHSIGRIHESCSVLEDMLKRGKAPHSDTWEIIISGICEVEDTAKFCEVWEKILKKDVRRDTR
        TNGLCDQGMFELAK YVEEMTLKGF PHFS+IHALVKGF ++GRI ESCS+LEDML  GKAPHSDTWEIIISGICEVEDT K CE+  KILKKDVRRDTR
Subjt:  TNGLCDQGMFELAKGYVEEMTLKGFYPHFSVIHALVKGFHSIGRIHESCSVLEDMLKRGKAPHSDTWEIIISGICEVEDTAKFCEVWEKILKKDVRRDTR

Query:  IVEAGTGLGEYLIRKLQASISRRI
        IVEAG+GLGEYLIRKLQAS SRR+
Subjt:  IVEAGTGLGEYLIRKLQASISRRI

TrEMBL top hitse value%identityAlignment
A0A0A0K8U0 Uncharacterized protein2.78e-159100Show/hide
Query:  RKNQVNGAVDLLEDMLNKGYIPDTLSYATLLNSLCRKKKLREAYKLLCRMKVKGCNPDIAHYNTVIMGFCREGRALDACKILEDMQSNGCLPNLVSYESL
        RKNQVNGAVDLLEDMLNKGYIPDTLSYATLLNSLCRKKKLREAYKLLCRMKVKGCNPDIAHYNTVIMGFCREGRALDACKILEDMQSNGCLPNLVSYESL
Subjt:  RKNQVNGAVDLLEDMLNKGYIPDTLSYATLLNSLCRKKKLREAYKLLCRMKVKGCNPDIAHYNTVIMGFCREGRALDACKILEDMQSNGCLPNLVSYESL

Query:  TNGLCDQGMFELAKGYVEEMTLKGFYPHFSVIHALVKGFHSIGRIHESCSVLEDMLKRGKAPHSDTWEIIISGICEVEDTAKFCEVWEKILKKDVRRDTR
        TNGLCDQGMFELAKGYVEEMTLKGFYPHFSVIHALVKGFHSIGRIHESCSVLEDMLKRGKAPHSDTWEIIISGICEVEDTAKFCEVWEKILKKDVRRDTR
Subjt:  TNGLCDQGMFELAKGYVEEMTLKGFYPHFSVIHALVKGFHSIGRIHESCSVLEDMLKRGKAPHSDTWEIIISGICEVEDTAKFCEVWEKILKKDVRRDTR

Query:  IVEAGTGLGEYLIRKLQASISRRI
        IVEAGTGLGEYLIRKLQASISRRI
Subjt:  IVEAGTGLGEYLIRKLQASISRRI

A0A1S3CIG1 pentatricopeptide repeat-containing protein At4g01400, mitochondrial-like8.56e-15195.09Show/hide
Query:  RKNQVNGAVDLLEDMLNKGYIPDTLSYATLLNSLCRKKKLREAYKLLCRMKVKGCNPDIAHYNTVIMGFCREGRALDACKILEDMQSNGCLPNLVSYESL
        RKNQVNGAVDLLEDMLNKGYIPDTLSYATLLNSLCRKKKL+EAYKLLCRMKVKGCNPDIAHYNTVIMGFCREGRALDACKILEDMQSNGCLPNLVSYESL
Subjt:  RKNQVNGAVDLLEDMLNKGYIPDTLSYATLLNSLCRKKKLREAYKLLCRMKVKGCNPDIAHYNTVIMGFCREGRALDACKILEDMQSNGCLPNLVSYESL

Query:  TNGLCDQGMFELAKGYVEEMTLKGFYPHFSVIHALVKGFHSIGRIHESCSVLEDMLKRGKAPHSDTWEIIISGICEVEDTAKFCEVWEKILKKDVRRDTR
        TNGLCDQGMFELAKGYVEEMTLKGFYPHFSVIHALVKGFH++GR+ ESCSVLE MLK GKAPHSDTWEIIISGICEVEDT KFCE+ EKILKKDVRRDTR
Subjt:  TNGLCDQGMFELAKGYVEEMTLKGFYPHFSVIHALVKGFHSIGRIHESCSVLEDMLKRGKAPHSDTWEIIISGICEVEDTAKFCEVWEKILKKDVRRDTR

Query:  IVEAGTGLGEYLIRKLQASISRRI
        IVEAGTGLGEYLIRKLQAS SRRI
Subjt:  IVEAGTGLGEYLIRKLQASISRRI

A0A5A7SWW3 Pentatricopeptide repeat-containing protein1.23e-14995.09Show/hide
Query:  RKNQVNGAVDLLEDMLNKGYIPDTLSYATLLNSLCRKKKLREAYKLLCRMKVKGCNPDIAHYNTVIMGFCREGRALDACKILEDMQSNGCLPNLVSYESL
        RKNQVNGAVDLLEDMLNKGYIPDTLSYATLLNSLCRKKKL+EAYKLLCRMKVKGCNPDIAHYNTVIMGFCREGRALDACKILEDMQSNGCLPNLVSYESL
Subjt:  RKNQVNGAVDLLEDMLNKGYIPDTLSYATLLNSLCRKKKLREAYKLLCRMKVKGCNPDIAHYNTVIMGFCREGRALDACKILEDMQSNGCLPNLVSYESL

Query:  TNGLCDQGMFELAKGYVEEMTLKGFYPHFSVIHALVKGFHSIGRIHESCSVLEDMLKRGKAPHSDTWEIIISGICEVEDTAKFCEVWEKILKKDVRRDTR
        TNGLCDQGMFELAKGYVEEMTLKGFYPHFSVIHALVKGFH++GR+ ESCSVLE MLK GKAPHSDTWEIIISGICEVEDT KFCE+ EKILKKDVRRDTR
Subjt:  TNGLCDQGMFELAKGYVEEMTLKGFYPHFSVIHALVKGFHSIGRIHESCSVLEDMLKRGKAPHSDTWEIIISGICEVEDTAKFCEVWEKILKKDVRRDTR

Query:  IVEAGTGLGEYLIRKLQASISRRI
        IVEAGTGLGEYLIRKLQAS SRRI
Subjt:  IVEAGTGLGEYLIRKLQASISRRI

A0A6J1EXV3 pentatricopeptide repeat-containing protein At4g01400, mitochondrial-like5.24e-14089.24Show/hide
Query:  RKNQVNGAVDLLEDMLNKGYIPDTLSYATLLNSLCRKKKLREAYKLLCRMKVKGCNPDIAHYNTVIMGFCREGRALDACKILEDMQSNGCLPNLVSYESL
        RKNQV GAVDLLEDMLNKGY+PDTLSYATLLNSLCRKKKLREAYKLLCRMKVKGCNPD+AHYNTVI GFCREGRALDACKILEDMQSN CLPNLVSY+SL
Subjt:  RKNQVNGAVDLLEDMLNKGYIPDTLSYATLLNSLCRKKKLREAYKLLCRMKVKGCNPDIAHYNTVIMGFCREGRALDACKILEDMQSNGCLPNLVSYESL

Query:  TNGLCDQGMFELAKGYVEEMTLKGFYPHFSVIHALVKGFHSIGRIHESCSVLEDMLKRGKAPHSDTWEIIISGICEVEDTAKFCEVWEKILKKDVRRDTR
        TNGLCDQGMFELAK YVEEMTLKGF PHFSVIH LVKGF ++GRI +SCSVLEDMLK GKAPHS+TWE+IISG+CEVEDT K CE+ EKILKKDVRRDTR
Subjt:  TNGLCDQGMFELAKGYVEEMTLKGFYPHFSVIHALVKGFHSIGRIHESCSVLEDMLKRGKAPHSDTWEIIISGICEVEDTAKFCEVWEKILKKDVRRDTR

Query:  IVEAGTGLGEYLIRKLQASISRR
        IVEAG+GLGEYLIRKLQAS SRR
Subjt:  IVEAGTGLGEYLIRKLQASISRR

A0A6J1ICW5 pentatricopeptide repeat-containing protein At4g01400, mitochondrial-like2.61e-14090.13Show/hide
Query:  RKNQVNGAVDLLEDMLNKGYIPDTLSYATLLNSLCRKKKLREAYKLLCRMKVKGCNPDIAHYNTVIMGFCREGRALDACKILEDMQSNGCLPNLVSYESL
        RKNQV GAVDLLEDMLNKGYIPDTLSYATLLNSLCRKKKLREAYKLLCRMKVKGCNPD+AHYNTVI GFCREGRALDACKILEDMQ NGCLPNLVSY+SL
Subjt:  RKNQVNGAVDLLEDMLNKGYIPDTLSYATLLNSLCRKKKLREAYKLLCRMKVKGCNPDIAHYNTVIMGFCREGRALDACKILEDMQSNGCLPNLVSYESL

Query:  TNGLCDQGMFELAKGYVEEMTLKGFYPHFSVIHALVKGFHSIGRIHESCSVLEDMLKRGKAPHSDTWEIIISGICEVEDTAKFCEVWEKILKKDVRRDTR
        TNGLCDQGMFELAK YVEEMTL GF PHFSVIH LVKGF ++GRI +SCSVLEDMLK GKAPHS+TWEIIISGICEVEDT K CE+ EKILKKDVRRDTR
Subjt:  TNGLCDQGMFELAKGYVEEMTLKGFYPHFSVIHALVKGFHSIGRIHESCSVLEDMLKRGKAPHSDTWEIIISGICEVEDTAKFCEVWEKILKKDVRRDTR

Query:  IVEAGTGLGEYLIRKLQASISRR
        IVEAG+GLGEYLIRKLQAS SRR
Subjt:  IVEAGTGLGEYLIRKLQASISRR

SwissProt top hitse value%identityAlignment
O49436 Pentatricopeptide repeat-containing protein At4g200901.1e-2631.13Show/hide
Query:  KNQVNGAVDLLEDMLNKGYIPDTLSYATLLNSLCRKKKLREAYKLLCRMKVKGCNPDIAHYNTVIMGFCREGRALDACKILEDMQSNGCLPNLVSYESLT
        K +++ AV LLE M++   IP+ ++Y TL+N L ++++  +A +LL  M+ +G + +   Y+ +I G  +EG+A +A  +   M   GC PN+V Y  L 
Subjt:  KNQVNGAVDLLEDMLNKGYIPDTLSYATLLNSLCRKKKLREAYKLLCRMKVKGCNPDIAHYNTVIMGFCREGRALDACKILEDMQSNGCLPNLVSYESLT

Query:  NGLCDQGMFELAKGYVEEMTLKGFYPHFSVIHALVKGFHSIGRIHESCSVLEDMLKRGKAPHSDTWEIIISGICEVEDTAKFCEVWEKILKKDVRRDT--
        +GLC +G    AK  +  M   G  P+     +L+KGF   G   E+  V ++M K G + +   + ++I G+C V    +   VW K+L   ++ DT  
Subjt:  NGLCDQGMFELAKGYVEEMTLKGFYPHFSVIHALVKGFHSIGRIHESCSVLEDMLKRGKAPHSDTWEIIISGICEVEDTAKFCEVWEKILKKDVRRDT--

Query:  --RIVEAGTGLG
           I++   G+G
Subjt:  --RIVEAGTGLG

Q3EDF8 Pentatricopeptide repeat-containing protein At1g099001.1e-2633.89Show/hide
Query:  RKNQVNGAVDLLEDMLNKGYIPDTLSYATLLNSLCRKKKLREAYKLLCRMKVKGCNPDIAHYNTVIMGFCREGRALDACKILEDMQSNGCLPNLVSYESL
        RK  +  A+D+LE M   G  P++LSY  LL+  C++KK+  A + L RM  +GC PDI  YNT++   C++G+  DA +IL  + S GC P L++Y ++
Subjt:  RKNQVNGAVDLLEDMLNKGYIPDTLSYATLLNSLCRKKKLREAYKLLCRMKVKGCNPDIAHYNTVIMGFCREGRALDACKILEDMQSNGCLPNLVSYESL

Query:  TNGLCDQGMFELAKGYVEEMTLKGFYPHFSVIHALVKGFHSIGRIHESCSVLEDMLKRGKAPHSDTWEIIISGICEVEDT
         +GL   G    A   ++EM  K   P      +LV G    G++ E+     +  + G  P++ T+  I+ G+C+   T
Subjt:  TNGLCDQGMFELAKGYVEEMTLKGFYPHFSVIHALVKGFHSIGRIHESCSVLEDMLKRGKAPHSDTWEIIISGICEVEDT

Q8LDU5 Pentatricopeptide repeat-containing protein At4g01400, mitochondrial3.5e-7862.21Show/hide
Query:  RKNQVNGAVDLLEDMLNKGYIPDTLSYATLLNSLCRKKKLREAYKLLCRMKVKGCNPDIAHYNTVIMGFCREGRALDACKILEDMQSNGCLPNLVSYESL
        RK QVNGA++LL+DMLNKG++PD LSY TLLNSLCRK +LREAYKLLCRMK+KGCNPD+ HYNT+I+GFCRE RA+DA K+L+DM SNGC PN VSY +L
Subjt:  RKNQVNGAVDLLEDMLNKGYIPDTLSYATLLNSLCRKKKLREAYKLLCRMKVKGCNPDIAHYNTVIMGFCREGRALDACKILEDMQSNGCLPNLVSYESL

Query:  TNGLCDQGMFELAKGYVEEMTLKGFYPHFSVIHALVKGFHSIGRIHESCSVLEDMLKRGKAPHSDTWEIIISGICEVEDTAKFCEVWEKILKKDVRRDTR
          GLCDQGMF+  K Y+EEM  KGF PHFSV + LVKGF S G++ E+C V+E ++K G+  HSDTWE++I  IC  +++ K     E  +K+++  DTR
Subjt:  TNGLCDQGMFELAKGYVEEMTLKGFYPHFSVIHALVKGFHSIGRIHESCSVLEDMLKRGKAPHSDTWEIIISGICEVEDTAKFCEVWEKILKKDVRRDTR

Query:  IVEAGTGLGEYLIRKLQ
        IV+ G GLG YL  KLQ
Subjt:  IVEAGTGLGEYLIRKLQ

Q9FIX3 Pentatricopeptide repeat-containing protein At5g397101.5e-2530.85Show/hide
Query:  SRLRKNQVNGAVDLLEDMLNKGYIPDTLSYATLLNSLCRKKKLREAYKLLCRMKVKGCNPDIAHYNTVIMGFCREGRALDACKILEDMQSNGCLPNLVSY
        S  +   +N A++ L+ M  +G  P+  +Y TL++   +K  + EAY++L  M   G +P +  YN +I G C  G+  DA  +LEDM+  G  P++VSY
Subjt:  SRLRKNQVNGAVDLLEDMLNKGYIPDTLSYATLLNSLCRKKKLREAYKLLCRMKVKGCNPDIAHYNTVIMGFCREGRALDACKILEDMQSNGCLPNLVSY

Query:  ESLTNGLCDQGMFELAKGYVEEMTLKGFYPHFSVIHALVKGFHSIGRIHESCSVLEDMLKRGKAPHSDTWEIIISGICEVEDTAKFCEVWEKILKKDVRR
         ++ +G C     + A     EM  KG  P      +L++GF    R  E+C + E+ML+ G  P   T+  +I+  C   D  K  ++  ++++K V  
Subjt:  ESLTNGLCDQGMFELAKGYVEEMTLKGFYPHFSVIHALVKGFHSIGRIHESCSVLEDMLKRGKAPHSDTWEIIISGICEVEDTAKFCEVWEKILKKDVRR

Query:  D
        D
Subjt:  D

Q9FNL2 Pentatricopeptide repeat-containing protein At5g461008.2e-2731.18Show/hide
Query:  SSRLRKNQVNGAVDLLEDMLNKGYIPDTLSYATLLNSLCRKKKLREAYKLLCRMKVKGCNPDIAHYNTVIMGFCREGRALDACKILEDMQSNGCLPNLVS
        S   R  +++ A  L  +M+ K   P  ++Y +L+N LC  K + EA + L  MK KG  P++  Y++++ G C++GR+L A ++ E M + GC PN+V+
Subjt:  SSRLRKNQVNGAVDLLEDMLNKGYIPDTLSYATLLNSLCRKKKLREAYKLLCRMKVKGCNPDIAHYNTVIMGFCREGRALDACKILEDMQSNGCLPNLVS

Query:  YESLTNGLCDQGMFELAKGYVEEMTLKGFYPHFSVIHALVKGFHSIGRIHESCSVLEDMLKRGKAPHSDTWEI-------IISGIC
        Y +L  GLC +   + A   ++ M L+G  P   +   ++ GF +I +  E+ + L++M+  G  P+  TW I       ++ G+C
Subjt:  YESLTNGLCDQGMFELAKGYVEEMTLKGFYPHFSVIHALVKGFHSIGRIHESCSVLEDMLKRGKAPHSDTWEI-------IISGIC

Arabidopsis top hitse value%identityAlignment
AT1G09900.1 Pentatricopeptide repeat (PPR-like) superfamily protein7.6e-2833.89Show/hide
Query:  RKNQVNGAVDLLEDMLNKGYIPDTLSYATLLNSLCRKKKLREAYKLLCRMKVKGCNPDIAHYNTVIMGFCREGRALDACKILEDMQSNGCLPNLVSYESL
        RK  +  A+D+LE M   G  P++LSY  LL+  C++KK+  A + L RM  +GC PDI  YNT++   C++G+  DA +IL  + S GC P L++Y ++
Subjt:  RKNQVNGAVDLLEDMLNKGYIPDTLSYATLLNSLCRKKKLREAYKLLCRMKVKGCNPDIAHYNTVIMGFCREGRALDACKILEDMQSNGCLPNLVSYESL

Query:  TNGLCDQGMFELAKGYVEEMTLKGFYPHFSVIHALVKGFHSIGRIHESCSVLEDMLKRGKAPHSDTWEIIISGICEVEDT
         +GL   G    A   ++EM  K   P      +LV G    G++ E+     +  + G  P++ T+  I+ G+C+   T
Subjt:  TNGLCDQGMFELAKGYVEEMTLKGFYPHFSVIHALVKGFHSIGRIHESCSVLEDMLKRGKAPHSDTWEIIISGICEVEDT

AT4G01400.1 FUNCTIONS IN: molecular_function unknown2.1e-4141.41Show/hide
Query:  AVDLLEDMLNKGYIPDTLSYATLLNSLCRKKKLREAYKLLCRMKVKGCNPDIAHYNTVIMGFCREGRALDACKILEDMQSNGCLPNLVSYESLTNGLCDQ
        A +L +     G +P+T SY  L+ + C    L  AY+L  +M  +   PD+  Y  +I GFCR+G+   A ++L+DM + G +P+     +L  GLCDQ
Subjt:  AVDLLEDMLNKGYIPDTLSYATLLNSLCRKKKLREAYKLLCRMKVKGCNPDIAHYNTVIMGFCREGRALDACKILEDMQSNGCLPNLVSYESLTNGLCDQ

Query:  GMFELAKGYVEEMTLKGFYPHFSVIHALVKGFHSIGRIHESCSVLEDMLKRGKAPHSDTWEIIISGICEVEDTAKFCEVWEKILKKDVRRDTRIVEAG
        GMF+  K Y+EEM  KGF PHFSV + LVKGF S G++ E+C V+E ++K G+  HSDTWE++I  IC  +++ K     E  +K+++  DTRIV+ G
Subjt:  GMFELAKGYVEEMTLKGFYPHFSVIHALVKGFHSIGRIHESCSVLEDMLKRGKAPHSDTWEIIISGICEVEDTAKFCEVWEKILKKDVRRDTRIVEAG

AT4G01400.3 FUNCTIONS IN: molecular_function unknown2.5e-7962.21Show/hide
Query:  RKNQVNGAVDLLEDMLNKGYIPDTLSYATLLNSLCRKKKLREAYKLLCRMKVKGCNPDIAHYNTVIMGFCREGRALDACKILEDMQSNGCLPNLVSYESL
        RK QVNGA++LL+DMLNKG++PD LSY TLLNSLCRK +LREAYKLLCRMK+KGCNPD+ HYNT+I+GFCRE RA+DA K+L+DM SNGC PN VSY +L
Subjt:  RKNQVNGAVDLLEDMLNKGYIPDTLSYATLLNSLCRKKKLREAYKLLCRMKVKGCNPDIAHYNTVIMGFCREGRALDACKILEDMQSNGCLPNLVSYESL

Query:  TNGLCDQGMFELAKGYVEEMTLKGFYPHFSVIHALVKGFHSIGRIHESCSVLEDMLKRGKAPHSDTWEIIISGICEVEDTAKFCEVWEKILKKDVRRDTR
          GLCDQGMF+  K Y+EEM  KGF PHFSV + LVKGF S G++ E+C V+E ++K G+  HSDTWE++I  IC  +++ K     E  +K+++  DTR
Subjt:  TNGLCDQGMFELAKGYVEEMTLKGFYPHFSVIHALVKGFHSIGRIHESCSVLEDMLKRGKAPHSDTWEIIISGICEVEDTAKFCEVWEKILKKDVRRDTR

Query:  IVEAGTGLGEYLIRKLQ
        IV+ G GLG YL  KLQ
Subjt:  IVEAGTGLGEYLIRKLQ

AT4G20090.1 Pentatricopeptide repeat (PPR) superfamily protein7.6e-2831.13Show/hide
Query:  KNQVNGAVDLLEDMLNKGYIPDTLSYATLLNSLCRKKKLREAYKLLCRMKVKGCNPDIAHYNTVIMGFCREGRALDACKILEDMQSNGCLPNLVSYESLT
        K +++ AV LLE M++   IP+ ++Y TL+N L ++++  +A +LL  M+ +G + +   Y+ +I G  +EG+A +A  +   M   GC PN+V Y  L 
Subjt:  KNQVNGAVDLLEDMLNKGYIPDTLSYATLLNSLCRKKKLREAYKLLCRMKVKGCNPDIAHYNTVIMGFCREGRALDACKILEDMQSNGCLPNLVSYESLT

Query:  NGLCDQGMFELAKGYVEEMTLKGFYPHFSVIHALVKGFHSIGRIHESCSVLEDMLKRGKAPHSDTWEIIISGICEVEDTAKFCEVWEKILKKDVRRDT--
        +GLC +G    AK  +  M   G  P+     +L+KGF   G   E+  V ++M K G + +   + ++I G+C V    +   VW K+L   ++ DT  
Subjt:  NGLCDQGMFELAKGYVEEMTLKGFYPHFSVIHALVKGFHSIGRIHESCSVLEDMLKRGKAPHSDTWEIIISGICEVEDTAKFCEVWEKILKKDVRRDT--

Query:  --RIVEAGTGLG
           I++   G+G
Subjt:  --RIVEAGTGLG

AT5G46100.1 Pentatricopeptide repeat (PPR) superfamily protein5.8e-2831.18Show/hide
Query:  SSRLRKNQVNGAVDLLEDMLNKGYIPDTLSYATLLNSLCRKKKLREAYKLLCRMKVKGCNPDIAHYNTVIMGFCREGRALDACKILEDMQSNGCLPNLVS
        S   R  +++ A  L  +M+ K   P  ++Y +L+N LC  K + EA + L  MK KG  P++  Y++++ G C++GR+L A ++ E M + GC PN+V+
Subjt:  SSRLRKNQVNGAVDLLEDMLNKGYIPDTLSYATLLNSLCRKKKLREAYKLLCRMKVKGCNPDIAHYNTVIMGFCREGRALDACKILEDMQSNGCLPNLVS

Query:  YESLTNGLCDQGMFELAKGYVEEMTLKGFYPHFSVIHALVKGFHSIGRIHESCSVLEDMLKRGKAPHSDTWEI-------IISGIC
        Y +L  GLC +   + A   ++ M L+G  P   +   ++ GF +I +  E+ + L++M+  G  P+  TW I       ++ G+C
Subjt:  YESLTNGLCDQGMFELAKGYVEEMTLKGFYPHFSVIHALVKGFHSIGRIHESCSVLEDMLKRGKAPHSDTWEI-------IISGIC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
TCTTCCCGACTCCGGAAGAATCAAGTGAATGGTGCTGTTGACTTGCTTGAAGATATGTTGAACAAAGGATACATTCCAGACACATTGAGCTATGCCACTTTGTTAAATAG
TTTATGTAGGAAGAAAAAGCTAAGGGAAGCTTATAAACTTCTCTGTAGAATGAAGGTTAAAGGTTGCAATCCTGATATTGCTCATTACAATACAGTTATAATGGGATTTT
GCAGAGAAGGGCGTGCTCTTGATGCTTGTAAGATTCTGGAGGATATGCAGTCAAATGGTTGTTTGCCTAATTTAGTATCCTACGAGAGTTTGACTAATGGATTATGTGAT
CAAGGAATGTTTGAATTGGCAAAGGGTTATGTTGAGGAGATGACATTAAAGGGTTTTTACCCACATTTCTCTGTCATTCATGCTTTGGTTAAGGGTTTCCATAGCATTGG
CAGAATCCACGAGTCGTGTAGTGTTCTTGAAGACATGCTAAAGCGTGGGAAAGCCCCTCATTCCGATACTTGGGAGATTATTATATCTGGGATTTGTGAAGTTGAGGACA
CTGCCAAATTTTGTGAAGTTTGGGAGAAGATTTTGAAGAAAGATGTAAGAAGAGACACTAGAATAGTTGAAGCAGGCACTGGTTTGGGTGAGTATTTAATTAGGAAGCTA
CAAGCTTCCATATCACGAAGGATTTGA
mRNA sequenceShow/hide mRNA sequence
TCTTCCCGACTCCGGAAGAATCAAGTGAATGGTGCTGTTGACTTGCTTGAAGATATGTTGAACAAAGGATACATTCCAGACACATTGAGCTATGCCACTTTGTTAAATAG
TTTATGTAGGAAGAAAAAGCTAAGGGAAGCTTATAAACTTCTCTGTAGAATGAAGGTTAAAGGTTGCAATCCTGATATTGCTCATTACAATACAGTTATAATGGGATTTT
GCAGAGAAGGGCGTGCTCTTGATGCTTGTAAGATTCTGGAGGATATGCAGTCAAATGGTTGTTTGCCTAATTTAGTATCCTACGAGAGTTTGACTAATGGATTATGTGAT
CAAGGAATGTTTGAATTGGCAAAGGGTTATGTTGAGGAGATGACATTAAAGGGTTTTTACCCACATTTCTCTGTCATTCATGCTTTGGTTAAGGGTTTCCATAGCATTGG
CAGAATCCACGAGTCGTGTAGTGTTCTTGAAGACATGCTAAAGCGTGGGAAAGCCCCTCATTCCGATACTTGGGAGATTATTATATCTGGGATTTGTGAAGTTGAGGACA
CTGCCAAATTTTGTGAAGTTTGGGAGAAGATTTTGAAGAAAGATGTAAGAAGAGACACTAGAATAGTTGAAGCAGGCACTGGTTTGGGTGAGTATTTAATTAGGAAGCTA
CAAGCTTCCATATCACGAAGGATTTGA
Protein sequenceShow/hide protein sequence
SSRLRKNQVNGAVDLLEDMLNKGYIPDTLSYATLLNSLCRKKKLREAYKLLCRMKVKGCNPDIAHYNTVIMGFCREGRALDACKILEDMQSNGCLPNLVSYESLTNGLCD
QGMFELAKGYVEEMTLKGFYPHFSVIHALVKGFHSIGRIHESCSVLEDMLKRGKAPHSDTWEIIISGICEVEDTAKFCEVWEKILKKDVRRDTRIVEAGTGLGEYLIRKL
QASISRRI