; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI07G05170 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI07G05170
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
Descriptionpentatricopeptide repeat-containing protein At4g18975, chloroplastic
Genome locationChr7:3830975..3843528
RNA-Seq ExpressionCSPI07G05170
SyntenyCSPI07G05170
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6571702.1 Mediator of RNA polymerase II transcription subunit 15a, partial [Cucurbita argyrosperma subsp. sororia]1.3e-13380.73Show/hide
Query:  MLIRRIHRAAAWATPLLRHPTVGQTMELGVSRLQVGSSCYCTTIQDQMCQQLADKDRKDKDVNSSKALGHISEQNIGDIRKHQIGKNISRKDKIHFLVNT
        MLIRR HRAA WATPLLR  TVGQ MELGV++LQ+G+SCYCT +Q+QM ++ ADKD  DKDVN+SK L   SE+NIGDIRKHQIG+N+SRKDKI FLVNT
Subjt:  MLIRRIHRAAAWATPLLRHPTVGQTMELGVSRLQVGSSCYCTTIQDQMCQQLADKDRKDKDVNSSKALGHISEQNIGDIRKHQIGKNISRKDKIHFLVNT

Query:  LLDLRDSKEAVYGALDAWVAWEQDFPIAPLKHVLAALEKEQQWHRIVQVIKWMLSKGQGTTMNVYGQLIRALDMDHRGEEAHKFWVMKIGSDLHSVPWQV
        L+DLRDSKEAVYGALDAWVAWEQDFPIA LKH LA LEKE QWHR+VQVIKWMLSKGQGTTMNVYGQLIRALDMDHR EEAHKFWVMKIGSDLHSVPWQ+
Subjt:  LLDLRDSKEAVYGALDAWVAWEQDFPIAPLKHVLAALEKEQQWHRIVQVIKWMLSKGQGTTMNVYGQLIRALDMDHRGEEAHKFWVMKIGSDLHSVPWQV

Query:  CRSMMAIYYRNKRLEDLVKLFKDLEAFGRKPPDKSIVQRVADACEMLGLLEEKERVLVKYKYLF-DEKEGPMKKYKRISFEKSKRKRKSTKGTEDNSNLV
        CRSM++IYYRNK LEDLVKLFKDLEAFGRKPP+KSIVQRVADACEMLGL+EEKERVLVKY YLF DEK+G +KKY        K KRKSTKG +DNS L+
Subjt:  CRSMMAIYYRNKRLEDLVKLFKDLEAFGRKPPDKSIVQRVADACEMLGLLEEKERVLVKYKYLF-DEKEGPMKKYKRISFEKSKRKRKSTKGTEDNSNLV

Query:  K
        K
Subjt:  K

XP_004136857.2 pentatricopeptide repeat-containing protein At4g18975, chloroplastic [Cucumis sativus]1.5e-169100Show/hide
Query:  MLIRRIHRAAAWATPLLRHPTVGQTMELGVSRLQVGSSCYCTTIQDQMCQQLADKDRKDKDVNSSKALGHISEQNIGDIRKHQIGKNISRKDKIHFLVNT
        MLIRRIHRAAAWATPLLRHPTVGQTMELGVSRLQVGSSCYCTTIQDQMCQQLADKDRKDKDVNSSKALGHISEQNIGDIRKHQIGKNISRKDKIHFLVNT
Subjt:  MLIRRIHRAAAWATPLLRHPTVGQTMELGVSRLQVGSSCYCTTIQDQMCQQLADKDRKDKDVNSSKALGHISEQNIGDIRKHQIGKNISRKDKIHFLVNT

Query:  LLDLRDSKEAVYGALDAWVAWEQDFPIAPLKHVLAALEKEQQWHRIVQVIKWMLSKGQGTTMNVYGQLIRALDMDHRGEEAHKFWVMKIGSDLHSVPWQV
        LLDLRDSKEAVYGALDAWVAWEQDFPIAPLKHVLAALEKEQQWHRIVQVIKWMLSKGQGTTMNVYGQLIRALDMDHRGEEAHKFWVMKIGSDLHSVPWQV
Subjt:  LLDLRDSKEAVYGALDAWVAWEQDFPIAPLKHVLAALEKEQQWHRIVQVIKWMLSKGQGTTMNVYGQLIRALDMDHRGEEAHKFWVMKIGSDLHSVPWQV

Query:  CRSMMAIYYRNKRLEDLVKLFKDLEAFGRKPPDKSIVQRVADACEMLGLLEEKERVLVKYKYLFDEKEGPMKKYKRISFEKSKRKRKSTKGTEDNSNLVK
        CRSMMAIYYRNKRLEDLVKLFKDLEAFGRKPPDKSIVQRVADACEMLGLLEEKERVLVKYKYLFDEKEGPMKKYKRISFEKSKRKRKSTKGTEDNSNLVK
Subjt:  CRSMMAIYYRNKRLEDLVKLFKDLEAFGRKPPDKSIVQRVADACEMLGLLEEKERVLVKYKYLFDEKEGPMKKYKRISFEKSKRKRKSTKGTEDNSNLVK

Query:  SE
        SE
Subjt:  SE

XP_008455250.1 PREDICTED: pentatricopeptide repeat-containing protein At4g18975, chloroplastic [Cucumis melo]1.5e-15391.39Show/hide
Query:  MLIRRIHRAAAWATPLLRHPTVGQTMELGVSRLQVGSSCYCTTIQDQMCQQLADKDRKDKDVNSSKALGHISEQNIGDIRKHQIGKNISRKDKIHFLVNT
        MLIRR +RAAAWATPLLRHPTVG+TMELGVSRLQVG S YCT IQDQM +QLADKDRK+KDV++SKALGHISEQNIGDIRKH+IG+N+SRKDKI FLVNT
Subjt:  MLIRRIHRAAAWATPLLRHPTVGQTMELGVSRLQVGSSCYCTTIQDQMCQQLADKDRKDKDVNSSKALGHISEQNIGDIRKHQIGKNISRKDKIHFLVNT

Query:  LLDLRDSKEAVYGALDAWVAWEQDFPIAPLKHVLAALEKEQQWHRIVQVIKWMLSKGQGTTMNVYGQLIRALDMDHRGEEAHKFWVMKIGSDLHSVPWQV
        LLDLRDSKEAVYGALDAWVAWEQDFPIA LKHVLAALEKEQQWHRIVQVIKWMLSKGQGTTMNVYGQLIRALDMDHR EEAHKFWVMKIGSDLHSVPWQ+
Subjt:  LLDLRDSKEAVYGALDAWVAWEQDFPIAPLKHVLAALEKEQQWHRIVQVIKWMLSKGQGTTMNVYGQLIRALDMDHRGEEAHKFWVMKIGSDLHSVPWQV

Query:  CRSMMAIYYRNKRLEDLVKLFKDLEAFGRKPPDKSIVQRVADACEMLGLLEEKERVLVKYKYLFDEKEGPMKKYKRISFEKSKRKRKSTKGTEDNSNLVK
        CRSM+AIYYRNK LEDLVKLFKDLEAFGRKPPDKSIVQRVADACEMLGLLEEKERVLVKYKYLFDEK+  MKKYKR+SFEK KRKRKSTKG+EDNSNLVK
Subjt:  CRSMMAIYYRNKRLEDLVKLFKDLEAFGRKPPDKSIVQRVADACEMLGLLEEKERVLVKYKYLFDEKEGPMKKYKRISFEKSKRKRKSTKGTEDNSNLVK

Query:  SE
        SE
Subjt:  SE

XP_022967610.1 pentatricopeptide repeat-containing protein At4g18975, chloroplastic isoform X1 [Cucurbita maxima]4.4e-13480.73Show/hide
Query:  MLIRRIHRAAAWATPLLRHPTVGQTMELGVSRLQVGSSCYCTTIQDQMCQQLADKDRKDKDVNSSKALGHISEQNIGDIRKHQIGKNISRKDKIHFLVNT
        MLIRR HRAA WATPLLR  TVGQ MELGV++LQ+G+SCYCT +Q+QM ++ ADKD  DKDVN+SK L   SE+NIGDIRKHQIG+N+SRKDKI+FLVNT
Subjt:  MLIRRIHRAAAWATPLLRHPTVGQTMELGVSRLQVGSSCYCTTIQDQMCQQLADKDRKDKDVNSSKALGHISEQNIGDIRKHQIGKNISRKDKIHFLVNT

Query:  LLDLRDSKEAVYGALDAWVAWEQDFPIAPLKHVLAALEKEQQWHRIVQVIKWMLSKGQGTTMNVYGQLIRALDMDHRGEEAHKFWVMKIGSDLHSVPWQV
        L+DLRDSKEAVYGALDAWVAWEQDFPIA LKH LA LEKE QWHR+VQVIKWMLSKGQGTTMNVYGQLIRALDMDHR EEAHKFWVMKIGSDLHSVPWQ+
Subjt:  LLDLRDSKEAVYGALDAWVAWEQDFPIAPLKHVLAALEKEQQWHRIVQVIKWMLSKGQGTTMNVYGQLIRALDMDHRGEEAHKFWVMKIGSDLHSVPWQV

Query:  CRSMMAIYYRNKRLEDLVKLFKDLEAFGRKPPDKSIVQRVADACEMLGLLEEKERVLVKYKYLF-DEKEGPMKKYKRISFEKSKRKRKSTKGTEDNSNLV
        CRSM++IYYRNK LEDLVKLFKDLEAFGRKPP+KSIVQRVADACEMLGL+EEKERVLVKY YLF DEK+G +KKY        K KRKSTKG +DNS+L+
Subjt:  CRSMMAIYYRNKRLEDLVKLFKDLEAFGRKPPDKSIVQRVADACEMLGLLEEKERVLVKYKYLF-DEKEGPMKKYKRISFEKSKRKRKSTKGTEDNSNLV

Query:  K
        K
Subjt:  K

XP_038887984.1 pentatricopeptide repeat-containing protein At4g18975, chloroplastic isoform X1 [Benincasa hispida]1.2e-14486.14Show/hide
Query:  MLIRRIHRAAAWATPLLRHPTVGQTMELGVSRLQVGSSCYCTTIQDQMCQQLADKDRKDKDVNSSKALGHISEQNIGDIRKHQIGKNISRKDKIHFLVNT
        ML+RR HRA AWATPLLR  TVGQ MELGVSRLQVGS CYCT IQDQM +QLA KD K+KD N+SKALG  SEQNIGD+RKHQIGKN+ RKDKI+FLVNT
Subjt:  MLIRRIHRAAAWATPLLRHPTVGQTMELGVSRLQVGSSCYCTTIQDQMCQQLADKDRKDKDVNSSKALGHISEQNIGDIRKHQIGKNISRKDKIHFLVNT

Query:  LLDLRDSKEAVYGALDAWVAWEQDFPIAPLKHVLAALEKEQQWHRIVQVIKWMLSKGQGTTMNVYGQLIRALDMDHRGEEAHKFWVMKIGSDLHSVPWQV
        LLDLRDSKEAVYGALDAWVAWEQDFPI  LKHVL  LEKEQQWHR+VQVIKWMLSKGQGTTMNVYGQLIRALDMDHR EEAHKFWVMKIGSDLHSVPWQ+
Subjt:  LLDLRDSKEAVYGALDAWVAWEQDFPIAPLKHVLAALEKEQQWHRIVQVIKWMLSKGQGTTMNVYGQLIRALDMDHRGEEAHKFWVMKIGSDLHSVPWQV

Query:  CRSMMAIYYRNKRLEDLVKLFKDLEAFGRKPPDKSIVQRVADACEMLGLLEEKERVLVKYKYLF-DEKEGPMKKYKRISFEKSKRKRKSTKGTEDNSNLV
        CRSM+AIYYRNK LEDLVKLFKDLEAFGRKPP+KSIVQRVADACE+LGLLEEKERVL+KYKYLF DEKEG +KKYKR+SFEKSK KRKSTK TEDNSNL+
Subjt:  CRSMMAIYYRNKRLEDLVKLFKDLEAFGRKPPDKSIVQRVADACEMLGLLEEKERVLVKYKYLF-DEKEGPMKKYKRISFEKSKRKRKSTKGTEDNSNLV

Query:  KSE
        K++
Subjt:  KSE

TrEMBL top hitse value%identityAlignment
A0A1S3C174 pentatricopeptide repeat-containing protein At4g18975, chloroplastic7.1e-15491.39Show/hide
Query:  MLIRRIHRAAAWATPLLRHPTVGQTMELGVSRLQVGSSCYCTTIQDQMCQQLADKDRKDKDVNSSKALGHISEQNIGDIRKHQIGKNISRKDKIHFLVNT
        MLIRR +RAAAWATPLLRHPTVG+TMELGVSRLQVG S YCT IQDQM +QLADKDRK+KDV++SKALGHISEQNIGDIRKH+IG+N+SRKDKI FLVNT
Subjt:  MLIRRIHRAAAWATPLLRHPTVGQTMELGVSRLQVGSSCYCTTIQDQMCQQLADKDRKDKDVNSSKALGHISEQNIGDIRKHQIGKNISRKDKIHFLVNT

Query:  LLDLRDSKEAVYGALDAWVAWEQDFPIAPLKHVLAALEKEQQWHRIVQVIKWMLSKGQGTTMNVYGQLIRALDMDHRGEEAHKFWVMKIGSDLHSVPWQV
        LLDLRDSKEAVYGALDAWVAWEQDFPIA LKHVLAALEKEQQWHRIVQVIKWMLSKGQGTTMNVYGQLIRALDMDHR EEAHKFWVMKIGSDLHSVPWQ+
Subjt:  LLDLRDSKEAVYGALDAWVAWEQDFPIAPLKHVLAALEKEQQWHRIVQVIKWMLSKGQGTTMNVYGQLIRALDMDHRGEEAHKFWVMKIGSDLHSVPWQV

Query:  CRSMMAIYYRNKRLEDLVKLFKDLEAFGRKPPDKSIVQRVADACEMLGLLEEKERVLVKYKYLFDEKEGPMKKYKRISFEKSKRKRKSTKGTEDNSNLVK
        CRSM+AIYYRNK LEDLVKLFKDLEAFGRKPPDKSIVQRVADACEMLGLLEEKERVLVKYKYLFDEK+  MKKYKR+SFEK KRKRKSTKG+EDNSNLVK
Subjt:  CRSMMAIYYRNKRLEDLVKLFKDLEAFGRKPPDKSIVQRVADACEMLGLLEEKERVLVKYKYLFDEKEGPMKKYKRISFEKSKRKRKSTKGTEDNSNLVK

Query:  SE
        SE
Subjt:  SE

A0A6J1DM10 pentatricopeptide repeat-containing protein At4g18975, chloroplastic isoform X11.4e-13078.22Show/hide
Query:  MLIRRIHRAAAWATPLLRHPTVGQTMELGVSRLQVGSSCYCTTIQDQMCQQLADKDRKDKDVNSSKALGHISEQNIGDIRKHQIGKNISRKDKIHFLVNT
        ML+RR HRA  W TPLLR  T GQ M+LGVSRLQVG+SCYCT +Q QMCQQLAD+D K+KDVN+SKAL   SEQN GD+RKHQIG+N+SRKDKI+FLV T
Subjt:  MLIRRIHRAAAWATPLLRHPTVGQTMELGVSRLQVGSSCYCTTIQDQMCQQLADKDRKDKDVNSSKALGHISEQNIGDIRKHQIGKNISRKDKIHFLVNT

Query:  LLDLRDSKEAVYGALDAWVAWEQDFPIAPLKHVLAALEKEQQWHRIVQVIKWMLSKGQGTTMNVYGQLIRALDMDHRGEEAHKFWVMKIGSDLHSVPWQV
        L+DLR SKEAVYGALDAWVAWEQ+FPIA LK VLA LEKEQQWHR+VQVIKWMLSKGQGTTM VYGQLIRALDMDHR EE+HKFWVMKIG+DLHSVPWQ+
Subjt:  LLDLRDSKEAVYGALDAWVAWEQDFPIAPLKHVLAALEKEQQWHRIVQVIKWMLSKGQGTTMNVYGQLIRALDMDHRGEEAHKFWVMKIGSDLHSVPWQV

Query:  CRSMMAIYYRNKRLEDLVKLFKDLEAFGRKPPDKSIVQRVADACEMLGLLEEKERVLVKYKYLF-DEKEGPMKKYKRISFEKSKRKRKSTKGTEDNSNLV
        CRSM++IYYRNK L++LVKLFKDLEAFGRKPP+KSIVQRVADA EMLGL EEKERVL KYK LF DE++GP++KY +ISFEKSKR+RK TK ++DN +L 
Subjt:  CRSMMAIYYRNKRLEDLVKLFKDLEAFGRKPPDKSIVQRVADACEMLGLLEEKERVLVKYKYLF-DEKEGPMKKYKRISFEKSKRKRKSTKGTEDNSNLV

Query:  KSE
        K +
Subjt:  KSE

A0A6J1DNN5 pentatricopeptide repeat-containing protein At4g18975, chloroplastic isoform X31.4e-13078.22Show/hide
Query:  MLIRRIHRAAAWATPLLRHPTVGQTMELGVSRLQVGSSCYCTTIQDQMCQQLADKDRKDKDVNSSKALGHISEQNIGDIRKHQIGKNISRKDKIHFLVNT
        ML+RR HRA  W TPLLR  T GQ M+LGVSRLQVG+SCYCT +Q QMCQQLAD+D K+KDVN+SKAL   SEQN GD+RKHQIG+N+SRKDKI+FLV T
Subjt:  MLIRRIHRAAAWATPLLRHPTVGQTMELGVSRLQVGSSCYCTTIQDQMCQQLADKDRKDKDVNSSKALGHISEQNIGDIRKHQIGKNISRKDKIHFLVNT

Query:  LLDLRDSKEAVYGALDAWVAWEQDFPIAPLKHVLAALEKEQQWHRIVQVIKWMLSKGQGTTMNVYGQLIRALDMDHRGEEAHKFWVMKIGSDLHSVPWQV
        L+DLR SKEAVYGALDAWVAWEQ+FPIA LK VLA LEKEQQWHR+VQVIKWMLSKGQGTTM VYGQLIRALDMDHR EE+HKFWVMKIG+DLHSVPWQ+
Subjt:  LLDLRDSKEAVYGALDAWVAWEQDFPIAPLKHVLAALEKEQQWHRIVQVIKWMLSKGQGTTMNVYGQLIRALDMDHRGEEAHKFWVMKIGSDLHSVPWQV

Query:  CRSMMAIYYRNKRLEDLVKLFKDLEAFGRKPPDKSIVQRVADACEMLGLLEEKERVLVKYKYLF-DEKEGPMKKYKRISFEKSKRKRKSTKGTEDNSNLV
        CRSM++IYYRNK L++LVKLFKDLEAFGRKPP+KSIVQRVADA EMLGL EEKERVL KYK LF DE++GP++KY +ISFEKSKR+RK TK ++DN +L 
Subjt:  CRSMMAIYYRNKRLEDLVKLFKDLEAFGRKPPDKSIVQRVADACEMLGLLEEKERVLVKYKYLF-DEKEGPMKKYKRISFEKSKRKRKSTKGTEDNSNLV

Query:  KSE
        K +
Subjt:  KSE

A0A6J1HGC4 pentatricopeptide repeat-containing protein At4g18975, chloroplastic isoform X13.1e-13380.07Show/hide
Query:  MLIRRIHRAAAWATPLLRHPTVGQTMELGVSRLQVGSSCYCTTIQDQMCQQLADKDRKDKDVNSSKALGHISEQNIGDIRKHQIGKNISRKDKIHFLVNT
        MLIRR HRAA WATPLLR  TVGQ MELGV++LQ+G+SCYCT +Q+QM ++  DKD  DKDVN+SK L   SE+NIGDIRKHQIG+N+SRKDKI FLVNT
Subjt:  MLIRRIHRAAAWATPLLRHPTVGQTMELGVSRLQVGSSCYCTTIQDQMCQQLADKDRKDKDVNSSKALGHISEQNIGDIRKHQIGKNISRKDKIHFLVNT

Query:  LLDLRDSKEAVYGALDAWVAWEQDFPIAPLKHVLAALEKEQQWHRIVQVIKWMLSKGQGTTMNVYGQLIRALDMDHRGEEAHKFWVMKIGSDLHSVPWQV
        L+DLRDSKEAVYGALDAWVAWEQDFPIA LKH LA LEKE QWHR+VQVIKWMLSKGQGTTMNVYGQLIRALDMDHR EEAHKFWVMKIGSDLHSVPWQ+
Subjt:  LLDLRDSKEAVYGALDAWVAWEQDFPIAPLKHVLAALEKEQQWHRIVQVIKWMLSKGQGTTMNVYGQLIRALDMDHRGEEAHKFWVMKIGSDLHSVPWQV

Query:  CRSMMAIYYRNKRLEDLVKLFKDLEAFGRKPPDKSIVQRVADACEMLGLLEEKERVLVKYKYLF-DEKEGPMKKYKRISFEKSKRKRKSTKGTEDNSNLV
        CRSM++IYYRNK LEDLVKLFK+LEAFGRKPP+KSIVQRVADACEMLGL+EEKERVLVKY YLF DEK+G +KKY        K KRKSTKG +DNS+L+
Subjt:  CRSMMAIYYRNKRLEDLVKLFKDLEAFGRKPPDKSIVQRVADACEMLGLLEEKERVLVKYKYLF-DEKEGPMKKYKRISFEKSKRKRKSTKGTEDNSNLV

Query:  K
        K
Subjt:  K

A0A6J1HUZ4 pentatricopeptide repeat-containing protein At4g18975, chloroplastic isoform X12.1e-13480.73Show/hide
Query:  MLIRRIHRAAAWATPLLRHPTVGQTMELGVSRLQVGSSCYCTTIQDQMCQQLADKDRKDKDVNSSKALGHISEQNIGDIRKHQIGKNISRKDKIHFLVNT
        MLIRR HRAA WATPLLR  TVGQ MELGV++LQ+G+SCYCT +Q+QM ++ ADKD  DKDVN+SK L   SE+NIGDIRKHQIG+N+SRKDKI+FLVNT
Subjt:  MLIRRIHRAAAWATPLLRHPTVGQTMELGVSRLQVGSSCYCTTIQDQMCQQLADKDRKDKDVNSSKALGHISEQNIGDIRKHQIGKNISRKDKIHFLVNT

Query:  LLDLRDSKEAVYGALDAWVAWEQDFPIAPLKHVLAALEKEQQWHRIVQVIKWMLSKGQGTTMNVYGQLIRALDMDHRGEEAHKFWVMKIGSDLHSVPWQV
        L+DLRDSKEAVYGALDAWVAWEQDFPIA LKH LA LEKE QWHR+VQVIKWMLSKGQGTTMNVYGQLIRALDMDHR EEAHKFWVMKIGSDLHSVPWQ+
Subjt:  LLDLRDSKEAVYGALDAWVAWEQDFPIAPLKHVLAALEKEQQWHRIVQVIKWMLSKGQGTTMNVYGQLIRALDMDHRGEEAHKFWVMKIGSDLHSVPWQV

Query:  CRSMMAIYYRNKRLEDLVKLFKDLEAFGRKPPDKSIVQRVADACEMLGLLEEKERVLVKYKYLF-DEKEGPMKKYKRISFEKSKRKRKSTKGTEDNSNLV
        CRSM++IYYRNK LEDLVKLFKDLEAFGRKPP+KSIVQRVADACEMLGL+EEKERVLVKY YLF DEK+G +KKY        K KRKSTKG +DNS+L+
Subjt:  CRSMMAIYYRNKRLEDLVKLFKDLEAFGRKPPDKSIVQRVADACEMLGLLEEKERVLVKYKYLF-DEKEGPMKKYKRISFEKSKRKRKSTKGTEDNSNLV

Query:  K
        K
Subjt:  K

SwissProt top hitse value%identityAlignment
Q2V3H0 Pentatricopeptide repeat-containing protein At4g18975, chloroplastic5.6e-3136.99Show/hide
Query:  GHISEQNIGDIRK-----HQIGK---NISRKDKIHFLVNTLLDLRDSKEAVYGALDAWVAWEQDFPIAPLKHVLAALEKEQQWHRIVQVIKWMLSKGQGT
        G+++  N  +I+K     H + K   +     K   LV  L  L + KEAVYGAL+ WVAWE +FPI      L  L K  QWHR++Q+ KWMLSKGQG 
Subjt:  GHISEQNIGDIRK-----HQIGK---NISRKDKIHFLVNTLLDLRDSKEAVYGALDAWVAWEQDFPIAPLKHVLAALEKEQQWHRIVQVIKWMLSKGQGT

Query:  TMNVYGQLIRALDMDHRGEEAHKFWVMKIGSDLHSVPWQVCRSMMAIYYRNKRLEDLVKLFKDLEAFGRKPPDKSIVQRVADACEMLGLLEEKE----RV
        TM  Y  L+ A DMD R +EA   W M + +   S+P ++   M+A+Y  +   + ++++F D+E   +  PD+   +RVA A   L   E ++    R 
Subjt:  TMNVYGQLIRALDMDHRGEEAHKFWVMKIGSDLHSVPWQVCRSMMAIYYRNKRLEDLVKLFKDLEAFGRKPPDKSIVQRVADACEMLGLLEEKE----RV

Query:  LVKYKYL-FDEKEGPMKKY
        L +YKY+ F+ +   +K+Y
Subjt:  LVKYKYL-FDEKEGPMKKY

Q8LG95 Pentatricopeptide repeat-containing protein At4g211907.5e-2837.71Show/hide
Query:  KNISRKDKIHFLVNTLLDLRDSKEAVYGALDAWVAWEQDFPIAPLKHVLAALEKEQQWHRIVQVIKWMLSKGQGTTMNVYGQLIRALDMDHRGEEAHKFW
        K I    K   ++  +  L + KE VYGALD+++AWE +FP+  +K  L  LE E++W +I+QV KWMLSKGQG TM  Y  L+ AL  D+R +EA + W
Subjt:  KNISRKDKIHFLVNTLLDLRDSKEAVYGALDAWVAWEQDFPIAPLKHVLAALEKEQQWHRIVQVIKWMLSKGQGTTMNVYGQLIRALDMDHRGEEAHKFW

Query:  VMKIGSDLHSVPWQVCRSMMAIYYRNKRLEDLVKLFKDLEAFGRKPPDKSIVQRVADACEMLGLLEEKERVLVKY
               L   P +    M++IYY+    + L ++F D+E  G K P+ +IV  V      L + ++ E+++ KY
Subjt:  VMKIGSDLHSVPWQVCRSMMAIYYRNKRLEDLVKLFKDLEAFGRKPPDKSIVQRVADACEMLGLLEEKERVLVKY

Arabidopsis top hitse value%identityAlignment
AT1G04590.1 BEST Arabidopsis thaliana protein match is: Pentatricopeptide repeat (PPR) superfamily protein (TAIR:AT4G21190.1)1.5e-7655.93Show/hide
Query:  LQVGSSCYCTTIQDQMCQQLADKDRKDKDVNSSKALGHISEQNIGDIRKHQIGKNISRKDKIHFLVNTLLDLRDSKEAVYGALDAWVAWEQDFPIAPLKH
        L+V S  Y          +   K+  ++D + S   G     N  + RKHQIG+NI +KDKI FLVNTLLD+ D+KEAVYGALDAWVAWE++FPIA LK 
Subjt:  LQVGSSCYCTTIQDQMCQQLADKDRKDKDVNSSKALGHISEQNIGDIRKHQIGKNISRKDKIHFLVNTLLDLRDSKEAVYGALDAWVAWEQDFPIAPLKH

Query:  VLAALEKEQQWHRIVQVIKWMLSKGQGTTMNVYGQLIRALDMDHRGEEAHKFWVMKIGSDLHSVPWQVCRSMMAIYYRNKRLEDLVKLFKDLEAFGRKPP
        V+A+LEKE QWHR+VQVIKW+LSKGQG TM  YGQLIRALDMD R EEAH  W  K+G+DLHSVPWQ+C  MM IY+RN  L++LVKLFKDLE++ RKPP
Subjt:  VLAALEKEQQWHRIVQVIKWMLSKGQGTTMNVYGQLIRALDMDHRGEEAHKFWVMKIGSDLHSVPWQVCRSMMAIYYRNKRLEDLVKLFKDLEAFGRKPP

Query:  DKSIVQRVADACEMLGLLEEKERVLVKYKYLFDEKEGPMKKYKRISFEKSKRKRKSTKGTEDNSNLVKSE
        DK IVQ VADA E+LG+L+EKERV+ KY +L        K  +    +K    R     TE   +  K+E
Subjt:  DKSIVQRVADACEMLGLLEEKERVLVKYKYLFDEKEGPMKKYKRISFEKSKRKRKSTKGTEDNSNLVKSE

AT1G04590.2 BEST Arabidopsis thaliana protein match is: Pentatricopeptide repeat (PPR) superfamily protein (TAIR:AT4G18975.4)6.5e-7555.31Show/hide
Query:  LQVGSSCYCTTIQDQMCQQLADKDRKDKDVNSSKALGHISEQNIGDIRKHQIGKNISRKDKIHFLVNTLLDLRDSKEAVYGALDAWVAWEQDFPIAPLKH
        L+V S  Y          +   K+  ++D + S   G     N  + RKHQIG+NI +KDKI FLVNTLLD+ D+KEAVYGALDAWVAWE++FPIA LK 
Subjt:  LQVGSSCYCTTIQDQMCQQLADKDRKDKDVNSSKALGHISEQNIGDIRKHQIGKNISRKDKIHFLVNTLLDLRDSKEAVYGALDAWVAWEQDFPIAPLKH

Query:  VLAALEKEQQWHRIVQVIKWMLSKGQGTTMNVYGQLIRALDMDHRGEEAHKFWVMKIGSDLHSVPWQVCRSMMAIYYRNKRLEDLV---KLFKDLEAFGR
        V+A+LEKE QWHR+VQVIKW+LSKGQG TM  YGQLIRALDMD R EEAH  W  K+G+DLHSVPWQ+C  MM IY+RN  L++LV   KLFKDLE++ R
Subjt:  VLAALEKEQQWHRIVQVIKWMLSKGQGTTMNVYGQLIRALDMDHRGEEAHKFWVMKIGSDLHSVPWQVCRSMMAIYYRNKRLEDLV---KLFKDLEAFGR

Query:  KPPDKSIVQRVADACEMLGLLEEKERVLVKYKYLFDEKEGPMKKYKRISFEKSKRKRKSTKGTEDNSNLVKSE
        KPPDK IVQ VADA E+LG+L+EKERV+ KY +L        K  +    +K    R     TE   +  K+E
Subjt:  KPPDKSIVQRVADACEMLGLLEEKERVLVKYKYLFDEKEGPMKKYKRISFEKSKRKRKSTKGTEDNSNLVKSE

AT4G18975.1 Pentatricopeptide repeat (PPR) superfamily protein4.0e-3236.99Show/hide
Query:  GHISEQNIGDIRK-----HQIGK---NISRKDKIHFLVNTLLDLRDSKEAVYGALDAWVAWEQDFPIAPLKHVLAALEKEQQWHRIVQVIKWMLSKGQGT
        G+++  N  +I+K     H + K   +     K   LV  L  L + KEAVYGAL+ WVAWE +FPI      L  L K  QWHR++Q+ KWMLSKGQG 
Subjt:  GHISEQNIGDIRK-----HQIGK---NISRKDKIHFLVNTLLDLRDSKEAVYGALDAWVAWEQDFPIAPLKHVLAALEKEQQWHRIVQVIKWMLSKGQGT

Query:  TMNVYGQLIRALDMDHRGEEAHKFWVMKIGSDLHSVPWQVCRSMMAIYYRNKRLEDLVKLFKDLEAFGRKPPDKSIVQRVADACEMLGLLEEKE----RV
        TM  Y  L+ A DMD R +EA   W M + +   S+P ++   M+A+Y  +   + ++++F D+E   +  PD+   +RVA A   L   E ++    R 
Subjt:  TMNVYGQLIRALDMDHRGEEAHKFWVMKIGSDLHSVPWQVCRSMMAIYYRNKRLEDLVKLFKDLEAFGRKPPDKSIVQRVADACEMLGLLEEKE----RV

Query:  LVKYKYL-FDEKEGPMKKY
        L +YKY+ F+ +   +K+Y
Subjt:  LVKYKYL-FDEKEGPMKKY

AT4G18975.2 Pentatricopeptide repeat (PPR) superfamily protein4.0e-3236.99Show/hide
Query:  GHISEQNIGDIRK-----HQIGK---NISRKDKIHFLVNTLLDLRDSKEAVYGALDAWVAWEQDFPIAPLKHVLAALEKEQQWHRIVQVIKWMLSKGQGT
        G+++  N  +I+K     H + K   +     K   LV  L  L + KEAVYGAL+ WVAWE +FPI      L  L K  QWHR++Q+ KWMLSKGQG 
Subjt:  GHISEQNIGDIRK-----HQIGK---NISRKDKIHFLVNTLLDLRDSKEAVYGALDAWVAWEQDFPIAPLKHVLAALEKEQQWHRIVQVIKWMLSKGQGT

Query:  TMNVYGQLIRALDMDHRGEEAHKFWVMKIGSDLHSVPWQVCRSMMAIYYRNKRLEDLVKLFKDLEAFGRKPPDKSIVQRVADACEMLGLLEEKE----RV
        TM  Y  L+ A DMD R +EA   W M + +   S+P ++   M+A+Y  +   + ++++F D+E   +  PD+   +RVA A   L   E ++    R 
Subjt:  TMNVYGQLIRALDMDHRGEEAHKFWVMKIGSDLHSVPWQVCRSMMAIYYRNKRLEDLVKLFKDLEAFGRKPPDKSIVQRVADACEMLGLLEEKE----RV

Query:  LVKYKYL-FDEKEGPMKKY
        L +YKY+ F+ +   +K+Y
Subjt:  LVKYKYL-FDEKEGPMKKY

AT4G18975.3 Pentatricopeptide repeat (PPR) superfamily protein4.0e-3236.99Show/hide
Query:  GHISEQNIGDIRK-----HQIGK---NISRKDKIHFLVNTLLDLRDSKEAVYGALDAWVAWEQDFPIAPLKHVLAALEKEQQWHRIVQVIKWMLSKGQGT
        G+++  N  +I+K     H + K   +     K   LV  L  L + KEAVYGAL+ WVAWE +FPI      L  L K  QWHR++Q+ KWMLSKGQG 
Subjt:  GHISEQNIGDIRK-----HQIGK---NISRKDKIHFLVNTLLDLRDSKEAVYGALDAWVAWEQDFPIAPLKHVLAALEKEQQWHRIVQVIKWMLSKGQGT

Query:  TMNVYGQLIRALDMDHRGEEAHKFWVMKIGSDLHSVPWQVCRSMMAIYYRNKRLEDLVKLFKDLEAFGRKPPDKSIVQRVADACEMLGLLEEKE----RV
        TM  Y  L+ A DMD R +EA   W M + +   S+P ++   M+A+Y  +   + ++++F D+E   +  PD+   +RVA A   L   E ++    R 
Subjt:  TMNVYGQLIRALDMDHRGEEAHKFWVMKIGSDLHSVPWQVCRSMMAIYYRNKRLEDLVKLFKDLEAFGRKPPDKSIVQRVADACEMLGLLEEKE----RV

Query:  LVKYKYL-FDEKEGPMKKY
        L +YKY+ F+ +   +K+Y
Subjt:  LVKYKYL-FDEKEGPMKKY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTCATCCGGAGGATTCATCGAGCCGCGGCATGGGCGACGCCTCTGTTGCGACATCCAACCGTAGGGCAAACCATGGAGCTTGGAGTCAGCAGGCTGCAAGTTGGGTC
CTCTTGTTACTGCACAACGATACAAGATCAAATGTGTCAACAGCTTGCTGATAAAGATAGAAAAGATAAGGATGTTAACAGTAGTAAAGCTTTGGGCCACATTTCAGAGC
AAAATATTGGAGACATTAGAAAGCACCAAATTGGGAAAAACATTTCACGGAAAGACAAAATTCACTTTCTTGTAAATACGCTGCTTGATCTGAGAGATAGTAAGGAGGCT
GTTTATGGTGCTCTTGACGCCTGGGTTGCATGGGAGCAAGACTTTCCAATAGCACCCCTTAAGCATGTATTGGCTGCCCTTGAGAAGGAACAACAGTGGCATAGAATTGT
ACAGGTAATCAAATGGATGCTAAGCAAGGGCCAGGGAACCACAATGAATGTCTATGGGCAGTTAATACGGGCTTTAGACATGGACCATCGAGGGGAAGAAGCACACAAAT
TTTGGGTCATGAAGATTGGTTCGGATCTTCATTCAGTTCCCTGGCAAGTGTGCAGAAGCATGATGGCAATATACTACCGAAATAAAAGGCTAGAAGACCTTGTAAAGCTT
TTTAAGGATCTCGAAGCCTTCGGACGTAAACCCCCAGACAAATCAATAGTTCAGAGGGTTGCAGATGCTTGTGAGATGCTAGGCTTGCTTGAAGAGAAAGAGAGGGTACT
CGTAAAGTACAAATATCTTTTTGATGAGAAGGAAGGACCCATGAAGAAATATAAGAGGATTTCGTTTGAAAAATCAAAGAGAAAACGAAAATCAACAAAGGGCACTGAAG
ACAATAGCAACCTTGTGAAGTCTGAATGA
mRNA sequenceShow/hide mRNA sequence
TGCCTAATTACCACTCGGGACTGTGGCGCCCTTCACTCTACAATGCTCATCCGGAGGATTCATCGAGCCGCGGCATGGGCGACGCCTCTGTTGCGACATCCAACCGTAGG
GCAAACCATGGAGCTTGGAGTCAGCAGGCTGCAAGTTGGGTCCTCTTGTTACTGCACAACGATACAAGATCAAATGTGTCAACAGCTTGCTGATAAAGATAGAAAAGATA
AGGATGTTAACAGTAGTAAAGCTTTGGGCCACATTTCAGAGCAAAATATTGGAGACATTAGAAAGCACCAAATTGGGAAAAACATTTCACGGAAAGACAAAATTCACTTT
CTTGTAAATACGCTGCTTGATCTGAGAGATAGTAAGGAGGCTGTTTATGGTGCTCTTGACGCCTGGGTTGCATGGGAGCAAGACTTTCCAATAGCACCCCTTAAGCATGT
ATTGGCTGCCCTTGAGAAGGAACAACAGTGGCATAGAATTGTACAGGTAATCAAATGGATGCTAAGCAAGGGCCAGGGAACCACAATGAATGTCTATGGGCAGTTAATAC
GGGCTTTAGACATGGACCATCGAGGGGAAGAAGCACACAAATTTTGGGTCATGAAGATTGGTTCGGATCTTCATTCAGTTCCCTGGCAAGTGTGCAGAAGCATGATGGCA
ATATACTACCGAAATAAAAGGCTAGAAGACCTTGTAAAGCTTTTTAAGGATCTCGAAGCCTTCGGACGTAAACCCCCAGACAAATCAATAGTTCAGAGGGTTGCAGATGC
TTGTGAGATGCTAGGCTTGCTTGAAGAGAAAGAGAGGGTACTCGTAAAGTACAAATATCTTTTTGATGAGAAGGAAGGACCCATGAAGAAATATAAGAGGATTTCGTTTG
AAAAATCAAAGAGAAAACGAAAATCAACAAAGGGCACTGAAGACAATAGCAACCTTGTGAAGTCTGAATGAACGAAAGAAACAGGTAAAAATATAAAAGAAATTATGTGT
TAGTTAACTTGTTCATTTGTCTCGGCAACCCAGATTATTGTGTTGACGTCTTGTGGGTATAGATATTTTGCTTATCCATATTAATTCAAAAGTATTTGTAGTAAATGATA
AAATTAGCTAATTTCGGGCATACCAAGTTGG
Protein sequenceShow/hide protein sequence
MLIRRIHRAAAWATPLLRHPTVGQTMELGVSRLQVGSSCYCTTIQDQMCQQLADKDRKDKDVNSSKALGHISEQNIGDIRKHQIGKNISRKDKIHFLVNTLLDLRDSKEA
VYGALDAWVAWEQDFPIAPLKHVLAALEKEQQWHRIVQVIKWMLSKGQGTTMNVYGQLIRALDMDHRGEEAHKFWVMKIGSDLHSVPWQVCRSMMAIYYRNKRLEDLVKL
FKDLEAFGRKPPDKSIVQRVADACEMLGLLEEKERVLVKYKYLFDEKEGPMKKYKRISFEKSKRKRKSTKGTEDNSNLVKSE