; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cucsat.G9213 (gene) of Cucumber (B10) v3 genome

Gene IDCucsat.G9213
OrganismCucumis sativus L. var. sativus cv. B10 (Cucumber (B10) v3)
Descriptionpentatricopeptide repeat-containing protein At4g18975, chloroplastic
Genome locationctg1658:1629546..1633883
RNA-Seq ExpressionCucsat.G9213
SyntenyCucsat.G9213
Gene Ontology termsGO:0005634 - nucleus (cellular component)
GO:0003677 - DNA binding (molecular function)
GO:0005515 - protein binding (molecular function)
GO:0046872 - metal ion binding (molecular function)
GO:0046983 - protein dimerization activity (molecular function)
InterPro domainsIPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004136857.2 pentatricopeptide repeat-containing protein At4g18975, chloroplastic [Cucumis sativus]2.46e-217100Show/hide
Query:  MLIRRIHRAAAWATPLLRHPTVGQTMELGVSRLQVGSSCYCTTIQDQMCQQLADKDRKDKDVNSSKALGHISEQNIGDIRKHQIGKNISRKDKIHFLVNT
        MLIRRIHRAAAWATPLLRHPTVGQTMELGVSRLQVGSSCYCTTIQDQMCQQLADKDRKDKDVNSSKALGHISEQNIGDIRKHQIGKNISRKDKIHFLVNT
Subjt:  MLIRRIHRAAAWATPLLRHPTVGQTMELGVSRLQVGSSCYCTTIQDQMCQQLADKDRKDKDVNSSKALGHISEQNIGDIRKHQIGKNISRKDKIHFLVNT

Query:  LLDLRDSKEAVYGALDAWVAWEQDFPIAPLKHVLAALEKEQQWHRIVQVIKWMLSKGQGTTMNVYGQLIRALDMDHRGEEAHKFWVMKIGSDLHSVPWQV
        LLDLRDSKEAVYGALDAWVAWEQDFPIAPLKHVLAALEKEQQWHRIVQVIKWMLSKGQGTTMNVYGQLIRALDMDHRGEEAHKFWVMKIGSDLHSVPWQV
Subjt:  LLDLRDSKEAVYGALDAWVAWEQDFPIAPLKHVLAALEKEQQWHRIVQVIKWMLSKGQGTTMNVYGQLIRALDMDHRGEEAHKFWVMKIGSDLHSVPWQV

Query:  CRSMMAIYYRNKRLEDLVKLFKDLEAFGRKPPDKSIVQRVADACEMLGLLEEKERVLVKYKYLFDEKEGPMKKYKRISFEKSKRKRKSTKGTEDNSNLVK
        CRSMMAIYYRNKRLEDLVKLFKDLEAFGRKPPDKSIVQRVADACEMLGLLEEKERVLVKYKYLFDEKEGPMKKYKRISFEKSKRKRKSTKGTEDNSNLVK
Subjt:  CRSMMAIYYRNKRLEDLVKLFKDLEAFGRKPPDKSIVQRVADACEMLGLLEEKERVLVKYKYLFDEKEGPMKKYKRISFEKSKRKRKSTKGTEDNSNLVK

Query:  SE
        SE
Subjt:  SE

XP_008455250.1 PREDICTED: pentatricopeptide repeat-containing protein At4g18975, chloroplastic [Cucumis melo]2.62e-19691.39Show/hide
Query:  MLIRRIHRAAAWATPLLRHPTVGQTMELGVSRLQVGSSCYCTTIQDQMCQQLADKDRKDKDVNSSKALGHISEQNIGDIRKHQIGKNISRKDKIHFLVNT
        MLIRR +RAAAWATPLLRHPTVG+TMELGVSRLQVG S YCT IQDQM +QLADKDRK+KDV++SKALGHISEQNIGDIRKH+IG+N+SRKDKI FLVNT
Subjt:  MLIRRIHRAAAWATPLLRHPTVGQTMELGVSRLQVGSSCYCTTIQDQMCQQLADKDRKDKDVNSSKALGHISEQNIGDIRKHQIGKNISRKDKIHFLVNT

Query:  LLDLRDSKEAVYGALDAWVAWEQDFPIAPLKHVLAALEKEQQWHRIVQVIKWMLSKGQGTTMNVYGQLIRALDMDHRGEEAHKFWVMKIGSDLHSVPWQV
        LLDLRDSKEAVYGALDAWVAWEQDFPIA LKHVLAALEKEQQWHRIVQVIKWMLSKGQGTTMNVYGQLIRALDMDHR EEAHKFWVMKIGSDLHSVPWQ+
Subjt:  LLDLRDSKEAVYGALDAWVAWEQDFPIAPLKHVLAALEKEQQWHRIVQVIKWMLSKGQGTTMNVYGQLIRALDMDHRGEEAHKFWVMKIGSDLHSVPWQV

Query:  CRSMMAIYYRNKRLEDLVKLFKDLEAFGRKPPDKSIVQRVADACEMLGLLEEKERVLVKYKYLFDEKEGPMKKYKRISFEKSKRKRKSTKGTEDNSNLVK
        CRSM+AIYYRNK LEDLVKLFKDLEAFGRKPPDKSIVQRVADACEMLGLLEEKERVLVKYKYLFDEK+  MKKYKR+SFEK KRKRKSTKG+EDNSNLVK
Subjt:  CRSMMAIYYRNKRLEDLVKLFKDLEAFGRKPPDKSIVQRVADACEMLGLLEEKERVLVKYKYLFDEKEGPMKKYKRISFEKSKRKRKSTKGTEDNSNLVK

Query:  SE
        SE
Subjt:  SE

XP_022963531.1 pentatricopeptide repeat-containing protein At4g18975, chloroplastic isoform X1 [Cucurbita moschata]1.90e-16980.07Show/hide
Query:  MLIRRIHRAAAWATPLLRHPTVGQTMELGVSRLQVGSSCYCTTIQDQMCQQLADKDRKDKDVNSSKALGHISEQNIGDIRKHQIGKNISRKDKIHFLVNT
        MLIRR HRAA WATPLLR  TVGQ MELGV++LQ+G+SCYCT +Q+QM ++  DKD  DKDVN+SK L   SE+NIGDIRKHQIG+N+SRKDKI FLVNT
Subjt:  MLIRRIHRAAAWATPLLRHPTVGQTMELGVSRLQVGSSCYCTTIQDQMCQQLADKDRKDKDVNSSKALGHISEQNIGDIRKHQIGKNISRKDKIHFLVNT

Query:  LLDLRDSKEAVYGALDAWVAWEQDFPIAPLKHVLAALEKEQQWHRIVQVIKWMLSKGQGTTMNVYGQLIRALDMDHRGEEAHKFWVMKIGSDLHSVPWQV
        L+DLRDSKEAVYGALDAWVAWEQDFPIA LKH LA LEKE QWHR+VQVIKWMLSKGQGTTMNVYGQLIRALDMDHR EEAHKFWVMKIGSDLHSVPWQ+
Subjt:  LLDLRDSKEAVYGALDAWVAWEQDFPIAPLKHVLAALEKEQQWHRIVQVIKWMLSKGQGTTMNVYGQLIRALDMDHRGEEAHKFWVMKIGSDLHSVPWQV

Query:  CRSMMAIYYRNKRLEDLVKLFKDLEAFGRKPPDKSIVQRVADACEMLGLLEEKERVLVKYKYLF-DEKEGPMKKYKRISFEKSKRKRKSTKGTEDNSNLV
        CRSM++IYYRNK LEDLVKLFK+LEAFGRKPP+KSIVQRVADACEMLGL+EEKERVLVKY YLF DEK+G +KKYK         KRKSTKG +DNS+L+
Subjt:  CRSMMAIYYRNKRLEDLVKLFKDLEAFGRKPPDKSIVQRVADACEMLGLLEEKERVLVKYKYLF-DEKEGPMKKYKRISFEKSKRKRKSTKGTEDNSNLV

Query:  K
        K
Subjt:  K

XP_022967610.1 pentatricopeptide repeat-containing protein At4g18975, chloroplastic isoform X1 [Cucurbita maxima]5.70e-17180.73Show/hide
Query:  MLIRRIHRAAAWATPLLRHPTVGQTMELGVSRLQVGSSCYCTTIQDQMCQQLADKDRKDKDVNSSKALGHISEQNIGDIRKHQIGKNISRKDKIHFLVNT
        MLIRR HRAA WATPLLR  TVGQ MELGV++LQ+G+SCYCT +Q+QM ++ ADKD  DKDVN+SK L   SE+NIGDIRKHQIG+N+SRKDKI+FLVNT
Subjt:  MLIRRIHRAAAWATPLLRHPTVGQTMELGVSRLQVGSSCYCTTIQDQMCQQLADKDRKDKDVNSSKALGHISEQNIGDIRKHQIGKNISRKDKIHFLVNT

Query:  LLDLRDSKEAVYGALDAWVAWEQDFPIAPLKHVLAALEKEQQWHRIVQVIKWMLSKGQGTTMNVYGQLIRALDMDHRGEEAHKFWVMKIGSDLHSVPWQV
        L+DLRDSKEAVYGALDAWVAWEQDFPIA LKH LA LEKE QWHR+VQVIKWMLSKGQGTTMNVYGQLIRALDMDHR EEAHKFWVMKIGSDLHSVPWQ+
Subjt:  LLDLRDSKEAVYGALDAWVAWEQDFPIAPLKHVLAALEKEQQWHRIVQVIKWMLSKGQGTTMNVYGQLIRALDMDHRGEEAHKFWVMKIGSDLHSVPWQV

Query:  CRSMMAIYYRNKRLEDLVKLFKDLEAFGRKPPDKSIVQRVADACEMLGLLEEKERVLVKYKYLF-DEKEGPMKKYKRISFEKSKRKRKSTKGTEDNSNLV
        CRSM++IYYRNK LEDLVKLFKDLEAFGRKPP+KSIVQRVADACEMLGL+EEKERVLVKY YLF DEK+G +KKYK         KRKSTKG +DNS+L+
Subjt:  CRSMMAIYYRNKRLEDLVKLFKDLEAFGRKPPDKSIVQRVADACEMLGLLEEKERVLVKYKYLF-DEKEGPMKKYKRISFEKSKRKRKSTKGTEDNSNLV

Query:  K
        K
Subjt:  K

XP_038887984.1 pentatricopeptide repeat-containing protein At4g18975, chloroplastic isoform X1 [Benincasa hispida]1.46e-18486.14Show/hide
Query:  MLIRRIHRAAAWATPLLRHPTVGQTMELGVSRLQVGSSCYCTTIQDQMCQQLADKDRKDKDVNSSKALGHISEQNIGDIRKHQIGKNISRKDKIHFLVNT
        ML+RR HRA AWATPLLR  TVGQ MELGVSRLQVGS CYCT IQDQM +QLA KD K+KD N+SKALG  SEQNIGD+RKHQIGKN+ RKDKI+FLVNT
Subjt:  MLIRRIHRAAAWATPLLRHPTVGQTMELGVSRLQVGSSCYCTTIQDQMCQQLADKDRKDKDVNSSKALGHISEQNIGDIRKHQIGKNISRKDKIHFLVNT

Query:  LLDLRDSKEAVYGALDAWVAWEQDFPIAPLKHVLAALEKEQQWHRIVQVIKWMLSKGQGTTMNVYGQLIRALDMDHRGEEAHKFWVMKIGSDLHSVPWQV
        LLDLRDSKEAVYGALDAWVAWEQDFPI  LKHVL  LEKEQQWHR+VQVIKWMLSKGQGTTMNVYGQLIRALDMDHR EEAHKFWVMKIGSDLHSVPWQ+
Subjt:  LLDLRDSKEAVYGALDAWVAWEQDFPIAPLKHVLAALEKEQQWHRIVQVIKWMLSKGQGTTMNVYGQLIRALDMDHRGEEAHKFWVMKIGSDLHSVPWQV

Query:  CRSMMAIYYRNKRLEDLVKLFKDLEAFGRKPPDKSIVQRVADACEMLGLLEEKERVLVKYKYLF-DEKEGPMKKYKRISFEKSKRKRKSTKGTEDNSNLV
        CRSM+AIYYRNK LEDLVKLFKDLEAFGRKPP+KSIVQRVADACE+LGLLEEKERVL+KYKYLF DEKEG +KKYKR+SFEKSK KRKSTK TEDNSNL+
Subjt:  CRSMMAIYYRNKRLEDLVKLFKDLEAFGRKPPDKSIVQRVADACEMLGLLEEKERVLVKYKYLF-DEKEGPMKKYKRISFEKSKRKRKSTKGTEDNSNLV

Query:  KSE
        K++
Subjt:  KSE

TrEMBL top hitse value%identityAlignment
A0A1S3C174 pentatricopeptide repeat-containing protein At4g18975, chloroplastic1.27e-19691.39Show/hide
Query:  MLIRRIHRAAAWATPLLRHPTVGQTMELGVSRLQVGSSCYCTTIQDQMCQQLADKDRKDKDVNSSKALGHISEQNIGDIRKHQIGKNISRKDKIHFLVNT
        MLIRR +RAAAWATPLLRHPTVG+TMELGVSRLQVG S YCT IQDQM +QLADKDRK+KDV++SKALGHISEQNIGDIRKH+IG+N+SRKDKI FLVNT
Subjt:  MLIRRIHRAAAWATPLLRHPTVGQTMELGVSRLQVGSSCYCTTIQDQMCQQLADKDRKDKDVNSSKALGHISEQNIGDIRKHQIGKNISRKDKIHFLVNT

Query:  LLDLRDSKEAVYGALDAWVAWEQDFPIAPLKHVLAALEKEQQWHRIVQVIKWMLSKGQGTTMNVYGQLIRALDMDHRGEEAHKFWVMKIGSDLHSVPWQV
        LLDLRDSKEAVYGALDAWVAWEQDFPIA LKHVLAALEKEQQWHRIVQVIKWMLSKGQGTTMNVYGQLIRALDMDHR EEAHKFWVMKIGSDLHSVPWQ+
Subjt:  LLDLRDSKEAVYGALDAWVAWEQDFPIAPLKHVLAALEKEQQWHRIVQVIKWMLSKGQGTTMNVYGQLIRALDMDHRGEEAHKFWVMKIGSDLHSVPWQV

Query:  CRSMMAIYYRNKRLEDLVKLFKDLEAFGRKPPDKSIVQRVADACEMLGLLEEKERVLVKYKYLFDEKEGPMKKYKRISFEKSKRKRKSTKGTEDNSNLVK
        CRSM+AIYYRNK LEDLVKLFKDLEAFGRKPPDKSIVQRVADACEMLGLLEEKERVLVKYKYLFDEK+  MKKYKR+SFEK KRKRKSTKG+EDNSNLVK
Subjt:  CRSMMAIYYRNKRLEDLVKLFKDLEAFGRKPPDKSIVQRVADACEMLGLLEEKERVLVKYKYLFDEKEGPMKKYKRISFEKSKRKRKSTKGTEDNSNLVK

Query:  SE
        SE
Subjt:  SE

A0A6J1DM10 pentatricopeptide repeat-containing protein At4g18975, chloroplastic isoform X13.80e-16778.43Show/hide
Query:  ALHSTMLIRRIHRAAAWATPLLRHPTVGQTMELGVSRLQVGSSCYCTTIQDQMCQQLADKDRKDKDVNSSKALGHISEQNIGDIRKHQIGKNISRKDKIH
        AL S+ML+RR HRA  W TPLLR  T GQ M+LGVSRLQVG+SCYCT +Q QMCQQLAD+D K+KDVN+SKAL   SEQN GD+RKHQIG+N+SRKDKI+
Subjt:  ALHSTMLIRRIHRAAAWATPLLRHPTVGQTMELGVSRLQVGSSCYCTTIQDQMCQQLADKDRKDKDVNSSKALGHISEQNIGDIRKHQIGKNISRKDKIH

Query:  FLVNTLLDLRDSKEAVYGALDAWVAWEQDFPIAPLKHVLAALEKEQQWHRIVQVIKWMLSKGQGTTMNVYGQLIRALDMDHRGEEAHKFWVMKIGSDLHS
        FLV TL+DLR SKEAVYGALDAWVAWEQ+FPIA LK VLA LEKEQQWHR+VQVIKWMLSKGQGTTM VYGQLIRALDMDHR EE+HKFWVMKIG+DLHS
Subjt:  FLVNTLLDLRDSKEAVYGALDAWVAWEQDFPIAPLKHVLAALEKEQQWHRIVQVIKWMLSKGQGTTMNVYGQLIRALDMDHRGEEAHKFWVMKIGSDLHS

Query:  VPWQVCRSMMAIYYRNKRLEDLVKLFKDLEAFGRKPPDKSIVQRVADACEMLGLLEEKERVLVKYKYLF-DEKEGPMKKYKRISFEKSKRKRKSTKGTED
        VPWQ+CRSM++IYYRNK L++LVKLFKDLEAFGRKPP+KSIVQRVADA EMLGL EEKERVL KYK LF DE++GP++KY +ISFEKSKR+RK TK ++D
Subjt:  VPWQVCRSMMAIYYRNKRLEDLVKLFKDLEAFGRKPPDKSIVQRVADACEMLGLLEEKERVLVKYKYLF-DEKEGPMKKYKRISFEKSKRKRKSTKGTED

Query:  NSNLVK
        N +L K
Subjt:  NSNLVK

A0A6J1DNN5 pentatricopeptide repeat-containing protein At4g18975, chloroplastic isoform X33.75e-16678.74Show/hide
Query:  MLIRRIHRAAAWATPLLRHPTVGQTMELGVSRLQVGSSCYCTTIQDQMCQQLADKDRKDKDVNSSKALGHISEQNIGDIRKHQIGKNISRKDKIHFLVNT
        ML+RR HRA  W TPLLR  T GQ M+LGVSRLQVG+SCYCT +Q QMCQQLAD+D K+KDVN+SKAL   SEQN GD+RKHQIG+N+SRKDKI+FLV T
Subjt:  MLIRRIHRAAAWATPLLRHPTVGQTMELGVSRLQVGSSCYCTTIQDQMCQQLADKDRKDKDVNSSKALGHISEQNIGDIRKHQIGKNISRKDKIHFLVNT

Query:  LLDLRDSKEAVYGALDAWVAWEQDFPIAPLKHVLAALEKEQQWHRIVQVIKWMLSKGQGTTMNVYGQLIRALDMDHRGEEAHKFWVMKIGSDLHSVPWQV
        L+DLR SKEAVYGALDAWVAWEQ+FPIA LK VLA LEKEQQWHR+VQVIKWMLSKGQGTTM VYGQLIRALDMDHR EE+HKFWVMKIG+DLHSVPWQ+
Subjt:  LLDLRDSKEAVYGALDAWVAWEQDFPIAPLKHVLAALEKEQQWHRIVQVIKWMLSKGQGTTMNVYGQLIRALDMDHRGEEAHKFWVMKIGSDLHSVPWQV

Query:  CRSMMAIYYRNKRLEDLVKLFKDLEAFGRKPPDKSIVQRVADACEMLGLLEEKERVLVKYKYLF-DEKEGPMKKYKRISFEKSKRKRKSTKGTEDNSNLV
        CRSM++IYYRNK L++LVKLFKDLEAFGRKPP+KSIVQRVADA EMLGL EEKERVL KYK LF DE++GP++KY +ISFEKSKR+RK TK ++DN +L 
Subjt:  CRSMMAIYYRNKRLEDLVKLFKDLEAFGRKPPDKSIVQRVADACEMLGLLEEKERVLVKYKYLF-DEKEGPMKKYKRISFEKSKRKRKSTKGTEDNSNLV

Query:  K
        K
Subjt:  K

A0A6J1HGC4 pentatricopeptide repeat-containing protein At4g18975, chloroplastic isoform X19.18e-17080.07Show/hide
Query:  MLIRRIHRAAAWATPLLRHPTVGQTMELGVSRLQVGSSCYCTTIQDQMCQQLADKDRKDKDVNSSKALGHISEQNIGDIRKHQIGKNISRKDKIHFLVNT
        MLIRR HRAA WATPLLR  TVGQ MELGV++LQ+G+SCYCT +Q+QM ++  DKD  DKDVN+SK L   SE+NIGDIRKHQIG+N+SRKDKI FLVNT
Subjt:  MLIRRIHRAAAWATPLLRHPTVGQTMELGVSRLQVGSSCYCTTIQDQMCQQLADKDRKDKDVNSSKALGHISEQNIGDIRKHQIGKNISRKDKIHFLVNT

Query:  LLDLRDSKEAVYGALDAWVAWEQDFPIAPLKHVLAALEKEQQWHRIVQVIKWMLSKGQGTTMNVYGQLIRALDMDHRGEEAHKFWVMKIGSDLHSVPWQV
        L+DLRDSKEAVYGALDAWVAWEQDFPIA LKH LA LEKE QWHR+VQVIKWMLSKGQGTTMNVYGQLIRALDMDHR EEAHKFWVMKIGSDLHSVPWQ+
Subjt:  LLDLRDSKEAVYGALDAWVAWEQDFPIAPLKHVLAALEKEQQWHRIVQVIKWMLSKGQGTTMNVYGQLIRALDMDHRGEEAHKFWVMKIGSDLHSVPWQV

Query:  CRSMMAIYYRNKRLEDLVKLFKDLEAFGRKPPDKSIVQRVADACEMLGLLEEKERVLVKYKYLF-DEKEGPMKKYKRISFEKSKRKRKSTKGTEDNSNLV
        CRSM++IYYRNK LEDLVKLFK+LEAFGRKPP+KSIVQRVADACEMLGL+EEKERVLVKY YLF DEK+G +KKYK         KRKSTKG +DNS+L+
Subjt:  CRSMMAIYYRNKRLEDLVKLFKDLEAFGRKPPDKSIVQRVADACEMLGLLEEKERVLVKYKYLF-DEKEGPMKKYKRISFEKSKRKRKSTKGTEDNSNLV

Query:  K
        K
Subjt:  K

A0A6J1HUZ4 pentatricopeptide repeat-containing protein At4g18975, chloroplastic isoform X12.76e-17180.73Show/hide
Query:  MLIRRIHRAAAWATPLLRHPTVGQTMELGVSRLQVGSSCYCTTIQDQMCQQLADKDRKDKDVNSSKALGHISEQNIGDIRKHQIGKNISRKDKIHFLVNT
        MLIRR HRAA WATPLLR  TVGQ MELGV++LQ+G+SCYCT +Q+QM ++ ADKD  DKDVN+SK L   SE+NIGDIRKHQIG+N+SRKDKI+FLVNT
Subjt:  MLIRRIHRAAAWATPLLRHPTVGQTMELGVSRLQVGSSCYCTTIQDQMCQQLADKDRKDKDVNSSKALGHISEQNIGDIRKHQIGKNISRKDKIHFLVNT

Query:  LLDLRDSKEAVYGALDAWVAWEQDFPIAPLKHVLAALEKEQQWHRIVQVIKWMLSKGQGTTMNVYGQLIRALDMDHRGEEAHKFWVMKIGSDLHSVPWQV
        L+DLRDSKEAVYGALDAWVAWEQDFPIA LKH LA LEKE QWHR+VQVIKWMLSKGQGTTMNVYGQLIRALDMDHR EEAHKFWVMKIGSDLHSVPWQ+
Subjt:  LLDLRDSKEAVYGALDAWVAWEQDFPIAPLKHVLAALEKEQQWHRIVQVIKWMLSKGQGTTMNVYGQLIRALDMDHRGEEAHKFWVMKIGSDLHSVPWQV

Query:  CRSMMAIYYRNKRLEDLVKLFKDLEAFGRKPPDKSIVQRVADACEMLGLLEEKERVLVKYKYLF-DEKEGPMKKYKRISFEKSKRKRKSTKGTEDNSNLV
        CRSM++IYYRNK LEDLVKLFKDLEAFGRKPP+KSIVQRVADACEMLGL+EEKERVLVKY YLF DEK+G +KKYK         KRKSTKG +DNS+L+
Subjt:  CRSMMAIYYRNKRLEDLVKLFKDLEAFGRKPPDKSIVQRVADACEMLGLLEEKERVLVKYKYLF-DEKEGPMKKYKRISFEKSKRKRKSTKGTEDNSNLV

Query:  K
        K
Subjt:  K

SwissProt top hitse value%identityAlignment
Q2V3H0 Pentatricopeptide repeat-containing protein At4g18975, chloroplastic6.3e-3136.99Show/hide
Query:  GHISEQNIGDIRK-----HQIGK---NISRKDKIHFLVNTLLDLRDSKEAVYGALDAWVAWEQDFPIAPLKHVLAALEKEQQWHRIVQVIKWMLSKGQGT
        G+++  N  +I+K     H + K   +     K   LV  L  L + KEAVYGAL+ WVAWE +FPI      L  L K  QWHR++Q+ KWMLSKGQG 
Subjt:  GHISEQNIGDIRK-----HQIGK---NISRKDKIHFLVNTLLDLRDSKEAVYGALDAWVAWEQDFPIAPLKHVLAALEKEQQWHRIVQVIKWMLSKGQGT

Query:  TMNVYGQLIRALDMDHRGEEAHKFWVMKIGSDLHSVPWQVCRSMMAIYYRNKRLEDLVKLFKDLEAFGRKPPDKSIVQRVADACEMLGLLEEKE----RV
        TM  Y  L+ A DMD R +EA   W M + +   S+P ++   M+A+Y  +   + ++++F D+E   +  PD+   +RVA A   L   E ++    R 
Subjt:  TMNVYGQLIRALDMDHRGEEAHKFWVMKIGSDLHSVPWQVCRSMMAIYYRNKRLEDLVKLFKDLEAFGRKPPDKSIVQRVADACEMLGLLEEKE----RV

Query:  LVKYKYL-FDEKEGPMKKY
        L +YKY+ F+ +   +K+Y
Subjt:  LVKYKYL-FDEKEGPMKKY

Q8LG95 Pentatricopeptide repeat-containing protein At4g211908.5e-2837.71Show/hide
Query:  KNISRKDKIHFLVNTLLDLRDSKEAVYGALDAWVAWEQDFPIAPLKHVLAALEKEQQWHRIVQVIKWMLSKGQGTTMNVYGQLIRALDMDHRGEEAHKFW
        K I    K   ++  +  L + KE VYGALD+++AWE +FP+  +K  L  LE E++W +I+QV KWMLSKGQG TM  Y  L+ AL  D+R +EA + W
Subjt:  KNISRKDKIHFLVNTLLDLRDSKEAVYGALDAWVAWEQDFPIAPLKHVLAALEKEQQWHRIVQVIKWMLSKGQGTTMNVYGQLIRALDMDHRGEEAHKFW

Query:  VMKIGSDLHSVPWQVCRSMMAIYYRNKRLEDLVKLFKDLEAFGRKPPDKSIVQRVADACEMLGLLEEKERVLVKY
               L   P +    M++IYY+    + L ++F D+E  G K P+ +IV  V      L + ++ E+++ KY
Subjt:  VMKIGSDLHSVPWQVCRSMMAIYYRNKRLEDLVKLFKDLEAFGRKPPDKSIVQRVADACEMLGLLEEKERVLVKY

Arabidopsis top hitse value%identityAlignment
AT1G04590.1 BEST Arabidopsis thaliana protein match is: Pentatricopeptide repeat (PPR) superfamily protein (TAIR:AT4G21190.1)3.8e-7655.93Show/hide
Query:  LQVGSSCYCTTIQDQMCQQLADKDRKDKDVNSSKALGHISEQNIGDIRKHQIGKNISRKDKIHFLVNTLLDLRDSKEAVYGALDAWVAWEQDFPIAPLKH
        L+V S  Y          +   K+  ++D + S   G     N  + RKHQIG+NI +KDKI FLVNTLLD+ D+KEAVYGALDAWVAWE++FPIA LK 
Subjt:  LQVGSSCYCTTIQDQMCQQLADKDRKDKDVNSSKALGHISEQNIGDIRKHQIGKNISRKDKIHFLVNTLLDLRDSKEAVYGALDAWVAWEQDFPIAPLKH

Query:  VLAALEKEQQWHRIVQVIKWMLSKGQGTTMNVYGQLIRALDMDHRGEEAHKFWVMKIGSDLHSVPWQVCRSMMAIYYRNKRLEDLVKLFKDLEAFGRKPP
        V+A+LEKE QWHR+VQVIKW+LSKGQG TM  YGQLIRALDMD R EEAH  W  K+G+DLHSVPWQ+C  MM IY+RN  L++LVKLFKDLE++ RKPP
Subjt:  VLAALEKEQQWHRIVQVIKWMLSKGQGTTMNVYGQLIRALDMDHRGEEAHKFWVMKIGSDLHSVPWQVCRSMMAIYYRNKRLEDLVKLFKDLEAFGRKPP

Query:  DKSIVQRVADACEMLGLLEEKERVLVKYKYLFDEKEGPMKKYKRISFEKSKRKRKSTKGTEDNSNLVKSE
        DK IVQ VADA E+LG+L+EKERV+ KY +L        K  +    +K    R     TE   +  K+E
Subjt:  DKSIVQRVADACEMLGLLEEKERVLVKYKYLFDEKEGPMKKYKRISFEKSKRKRKSTKGTEDNSNLVKSE

AT1G04590.2 BEST Arabidopsis thaliana protein match is: Pentatricopeptide repeat (PPR) superfamily protein (TAIR:AT4G18975.4)1.6e-7455.31Show/hide
Query:  LQVGSSCYCTTIQDQMCQQLADKDRKDKDVNSSKALGHISEQNIGDIRKHQIGKNISRKDKIHFLVNTLLDLRDSKEAVYGALDAWVAWEQDFPIAPLKH
        L+V S  Y          +   K+  ++D + S   G     N  + RKHQIG+NI +KDKI FLVNTLLD+ D+KEAVYGALDAWVAWE++FPIA LK 
Subjt:  LQVGSSCYCTTIQDQMCQQLADKDRKDKDVNSSKALGHISEQNIGDIRKHQIGKNISRKDKIHFLVNTLLDLRDSKEAVYGALDAWVAWEQDFPIAPLKH

Query:  VLAALEKEQQWHRIVQVIKWMLSKGQGTTMNVYGQLIRALDMDHRGEEAHKFWVMKIGSDLHSVPWQVCRSMMAIYYRNKRLEDLV---KLFKDLEAFGR
        V+A+LEKE QWHR+VQVIKW+LSKGQG TM  YGQLIRALDMD R EEAH  W  K+G+DLHSVPWQ+C  MM IY+RN  L++LV   KLFKDLE++ R
Subjt:  VLAALEKEQQWHRIVQVIKWMLSKGQGTTMNVYGQLIRALDMDHRGEEAHKFWVMKIGSDLHSVPWQVCRSMMAIYYRNKRLEDLV---KLFKDLEAFGR

Query:  KPPDKSIVQRVADACEMLGLLEEKERVLVKYKYLFDEKEGPMKKYKRISFEKSKRKRKSTKGTEDNSNLVKSE
        KPPDK IVQ VADA E+LG+L+EKERV+ KY +L        K  +    +K    R     TE   +  K+E
Subjt:  KPPDKSIVQRVADACEMLGLLEEKERVLVKYKYLFDEKEGPMKKYKRISFEKSKRKRKSTKGTEDNSNLVKSE

AT4G18975.1 Pentatricopeptide repeat (PPR) superfamily protein4.4e-3236.99Show/hide
Query:  GHISEQNIGDIRK-----HQIGK---NISRKDKIHFLVNTLLDLRDSKEAVYGALDAWVAWEQDFPIAPLKHVLAALEKEQQWHRIVQVIKWMLSKGQGT
        G+++  N  +I+K     H + K   +     K   LV  L  L + KEAVYGAL+ WVAWE +FPI      L  L K  QWHR++Q+ KWMLSKGQG 
Subjt:  GHISEQNIGDIRK-----HQIGK---NISRKDKIHFLVNTLLDLRDSKEAVYGALDAWVAWEQDFPIAPLKHVLAALEKEQQWHRIVQVIKWMLSKGQGT

Query:  TMNVYGQLIRALDMDHRGEEAHKFWVMKIGSDLHSVPWQVCRSMMAIYYRNKRLEDLVKLFKDLEAFGRKPPDKSIVQRVADACEMLGLLEEKE----RV
        TM  Y  L+ A DMD R +EA   W M + +   S+P ++   M+A+Y  +   + ++++F D+E   +  PD+   +RVA A   L   E ++    R 
Subjt:  TMNVYGQLIRALDMDHRGEEAHKFWVMKIGSDLHSVPWQVCRSMMAIYYRNKRLEDLVKLFKDLEAFGRKPPDKSIVQRVADACEMLGLLEEKE----RV

Query:  LVKYKYL-FDEKEGPMKKY
        L +YKY+ F+ +   +K+Y
Subjt:  LVKYKYL-FDEKEGPMKKY

AT4G18975.2 Pentatricopeptide repeat (PPR) superfamily protein4.4e-3236.99Show/hide
Query:  GHISEQNIGDIRK-----HQIGK---NISRKDKIHFLVNTLLDLRDSKEAVYGALDAWVAWEQDFPIAPLKHVLAALEKEQQWHRIVQVIKWMLSKGQGT
        G+++  N  +I+K     H + K   +     K   LV  L  L + KEAVYGAL+ WVAWE +FPI      L  L K  QWHR++Q+ KWMLSKGQG 
Subjt:  GHISEQNIGDIRK-----HQIGK---NISRKDKIHFLVNTLLDLRDSKEAVYGALDAWVAWEQDFPIAPLKHVLAALEKEQQWHRIVQVIKWMLSKGQGT

Query:  TMNVYGQLIRALDMDHRGEEAHKFWVMKIGSDLHSVPWQVCRSMMAIYYRNKRLEDLVKLFKDLEAFGRKPPDKSIVQRVADACEMLGLLEEKE----RV
        TM  Y  L+ A DMD R +EA   W M + +   S+P ++   M+A+Y  +   + ++++F D+E   +  PD+   +RVA A   L   E ++    R 
Subjt:  TMNVYGQLIRALDMDHRGEEAHKFWVMKIGSDLHSVPWQVCRSMMAIYYRNKRLEDLVKLFKDLEAFGRKPPDKSIVQRVADACEMLGLLEEKE----RV

Query:  LVKYKYL-FDEKEGPMKKY
        L +YKY+ F+ +   +K+Y
Subjt:  LVKYKYL-FDEKEGPMKKY

AT4G18975.3 Pentatricopeptide repeat (PPR) superfamily protein4.4e-3236.99Show/hide
Query:  GHISEQNIGDIRK-----HQIGK---NISRKDKIHFLVNTLLDLRDSKEAVYGALDAWVAWEQDFPIAPLKHVLAALEKEQQWHRIVQVIKWMLSKGQGT
        G+++  N  +I+K     H + K   +     K   LV  L  L + KEAVYGAL+ WVAWE +FPI      L  L K  QWHR++Q+ KWMLSKGQG 
Subjt:  GHISEQNIGDIRK-----HQIGK---NISRKDKIHFLVNTLLDLRDSKEAVYGALDAWVAWEQDFPIAPLKHVLAALEKEQQWHRIVQVIKWMLSKGQGT

Query:  TMNVYGQLIRALDMDHRGEEAHKFWVMKIGSDLHSVPWQVCRSMMAIYYRNKRLEDLVKLFKDLEAFGRKPPDKSIVQRVADACEMLGLLEEKE----RV
        TM  Y  L+ A DMD R +EA   W M + +   S+P ++   M+A+Y  +   + ++++F D+E   +  PD+   +RVA A   L   E ++    R 
Subjt:  TMNVYGQLIRALDMDHRGEEAHKFWVMKIGSDLHSVPWQVCRSMMAIYYRNKRLEDLVKLFKDLEAFGRKPPDKSIVQRVADACEMLGLLEEKE----RV

Query:  LVKYKYL-FDEKEGPMKKY
        L +YKY+ F+ +   +K+Y
Subjt:  LVKYKYL-FDEKEGPMKKY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ACATGGGTTGTTTTACAAATCTCTCTCTCTCTATATATTTTACATCTATTGGCGGTTTCTTTAAGAGCGTATTGCCTAATTACCACTCGGGACTGTGGCGCCCTTCACTC
TACAATGCTCATCCGGAGGATTCATCGAGCCGCGGCATGGGCGACGCCTCTGTTGCGACATCCAACCGTAGGGCAAACCATGGAGCTTGGAGTCAGCAGGCTGCAAGTTG
GGTCCTCTTGTTACTGCACAACGATACAAGATCAAATGTGTCAACAGCTTGCTGATAAAGATAGAAAAGATAAGGATGTTAACAGTAGTAAAGCTTTGGGCCACATTTCA
GAGCAAAATATTGGAGACATTAGAAAGCACCAAATTGGGAAAAACATTTCACGGAAAGACAAAATTCACTTTCTTGTAAATACGCTGCTTGATCTGAGAGATAGTAAGGA
GGCTGTTTATGGTGCTCTTGATGCCTGGGTTGCATGGGAGCAAGACTTTCCAATAGCACCCCTTAAGCATGTATTGGCTGCCCTTGAGAAGGAACAACAGTGGCATAGAA
TTGTACAGGTAATCAAATGGATGCTAAGCAAGGGCCAGGGAACCACAATGAATGTCTATGGGCAGTTAATACGGGCTTTAGACATGGACCATCGAGGGGAAGAAGCACAC
AAATTTTGGGTCATGAAGATTGGTTCGGATCTTCATTCAGTTCCCTGGCAAGTGTGCAGAAGCATGATGGCAATATACTACCGAAATAAAAGGCTAGAAGACCTTGTAAA
GCTTTTTAAGGATCTCGAAGCCTTCGGACGTAAACCCCCAGACAAATCAATAGTTCAGAGGGTTGCAGATGCTTGTGAGATGCTAGGCTTGCTTGAAGAGAAAGAGAGGG
TACTCGTAAAGTACAAATACCTTTTTGATGAGAAGGAAGGACCCATGAAGAAATATAAGAGGATTTCGTTTGAAAAATCAAAGAGAAAACGAAAATCAACAAAGGGCACT
GAAGACAATAGCAACCTTGTGAAGTCTGAATGA
mRNA sequenceShow/hide mRNA sequence
ACATGGGTTGTTTTACAAATCTCTCTCTCTCTATATATTTTACATCTATTGGCGGTTTCTTTAAGAGCGTATTGCCTAATTACCACTCGGGACTGTGGCGCCCTTCACTC
TACAATGCTCATCCGGAGGATTCATCGAGCCGCGGCATGGGCGACGCCTCTGTTGCGACATCCAACCGTAGGGCAAACCATGGAGCTTGGAGTCAGCAGGCTGCAAGTTG
GGTCCTCTTGTTACTGCACAACGATACAAGATCAAATGTGTCAACAGCTTGCTGATAAAGATAGAAAAGATAAGGATGTTAACAGTAGTAAAGCTTTGGGCCACATTTCA
GAGCAAAATATTGGAGACATTAGAAAGCACCAAATTGGGAAAAACATTTCACGGAAAGACAAAATTCACTTTCTTGTAAATACGCTGCTTGATCTGAGAGATAGTAAGGA
GGCTGTTTATGGTGCTCTTGATGCCTGGGTTGCATGGGAGCAAGACTTTCCAATAGCACCCCTTAAGCATGTATTGGCTGCCCTTGAGAAGGAACAACAGTGGCATAGAA
TTGTACAGGTAATCAAATGGATGCTAAGCAAGGGCCAGGGAACCACAATGAATGTCTATGGGCAGTTAATACGGGCTTTAGACATGGACCATCGAGGGGAAGAAGCACAC
AAATTTTGGGTCATGAAGATTGGTTCGGATCTTCATTCAGTTCCCTGGCAAGTGTGCAGAAGCATGATGGCAATATACTACCGAAATAAAAGGCTAGAAGACCTTGTAAA
GCTTTTTAAGGATCTCGAAGCCTTCGGACGTAAACCCCCAGACAAATCAATAGTTCAGAGGGTTGCAGATGCTTGTGAGATGCTAGGCTTGCTTGAAGAGAAAGAGAGGG
TACTCGTAAAGTACAAATACCTTTTTGATGAGAAGGAAGGACCCATGAAGAAATATAAGAGGATTTCGTTTGAAAAATCAAAGAGAAAACGAAAATCAACAAAGGGCACT
GAAGACAATAGCAACCTTGTGAAGTCTGAATGA
Protein sequenceShow/hide protein sequence
TWVVLQISLSLYILHLLAVSLRAYCLITTRDCGALHSTMLIRRIHRAAAWATPLLRHPTVGQTMELGVSRLQVGSSCYCTTIQDQMCQQLADKDRKDKDVNSSKALGHIS
EQNIGDIRKHQIGKNISRKDKIHFLVNTLLDLRDSKEAVYGALDAWVAWEQDFPIAPLKHVLAALEKEQQWHRIVQVIKWMLSKGQGTTMNVYGQLIRALDMDHRGEEAH
KFWVMKIGSDLHSVPWQVCRSMMAIYYRNKRLEDLVKLFKDLEAFGRKPPDKSIVQRVADACEMLGLLEEKERVLVKYKYLFDEKEGPMKKYKRISFEKSKRKRKSTKGT
EDNSNLVKSE