; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc09G14660 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc09G14660
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
Descriptionpentatricopeptide repeat-containing protein At4g18975, chloroplastic
Genome locationClcChr09:19336470..19366125
RNA-Seq ExpressionClc09G14660
SyntenyClc09G14660
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6571702.1 Mediator of RNA polymerase II transcription subunit 15a, partial [Cucurbita argyrosperma subsp. sororia]9.2e-14084.39Show/hide
Query:  MLIRRFHRAAAWATPLLRDLTVGQIMELGVSRLQVGNSCYCTLIQDQMSKQLAYKDIKNKDVNNSKALGHTSEQNIGDIRKHQIGENVSRMDKINFLVNT
        MLIRRFHRAA WATPLLRD TVGQIMELGV++LQ+GNSCYCT++Q+QMSK+ A KD+ +KDVNNSK L  TSE+NIGDIRKHQIGENVSR DKI+FLVNT
Subjt:  MLIRRFHRAAAWATPLLRDLTVGQIMELGVSRLQVGNSCYCTLIQDQMSKQLAYKDIKNKDVNNSKALGHTSEQNIGDIRKHQIGENVSRMDKINFLVNT

Query:  LLDLRDSKEAVYGALDAWVAWEQDFPIVSLKHVLAALEKEQQWHRVVQVIKWMLSKGQGTTMNVYGQLIRALDMDHRAEEAHKFWVMKIGSDLHSVPWQL
        L+DLRDSKEAVYGALDAWVAWEQDFPI SLKH LA LEKE QWHRVVQVIKWMLSKGQGTTMNVYGQLIRALDMDHRAEEAHKFWVMKIGSDLHSVPWQL
Subjt:  LLDLRDSKEAVYGALDAWVAWEQDFPIVSLKHVLAALEKEQQWHRVVQVIKWMLSKGQGTTMNVYGQLIRALDMDHRAEEAHKFWVMKIGSDLHSVPWQL

Query:  CRSMMSIYYRNKMLEDLVKLFKDLEGFGRKPPEKSIVQRVADACEILGLLEEKERLLMKYKYLFTDEKDGSIKKYKRVSFEKSKRKRKSTKGTELNSNLT
        CRSM+SIYYRNKMLEDLVKLFKDLE FGRKPPEKSIVQRVADACE+LGL+EEKER+L+KY YLFTDEK GSIKKY        K KRKSTKG + NS L 
Subjt:  CRSMMSIYYRNKMLEDLVKLFKDLEGFGRKPPEKSIVQRVADACEILGLLEEKERLLMKYKYLFTDEKDGSIKKYKRVSFEKSKRKRKSTKGTELNSNLT

Query:  K
        K
Subjt:  K

XP_004136857.2 pentatricopeptide repeat-containing protein At4g18975, chloroplastic [Cucumis sativus]1.0e-14687.46Show/hide
Query:  MLIRRFHRAAAWATPLLRDLTVGQIMELGVSRLQVGNSCYCTLIQDQMSKQLAYKDIKNKDVNNSKALGHTSEQNIGDIRKHQIGENVSRMDKINFLVNT
        MLIRR HRAAAWATPLLR  TVGQ MELGVSRLQVG+SCYCT IQDQM +QLA KD K+KDVN+SKALGH SEQNIGDIRKHQIG+N+SR DKI+FLVNT
Subjt:  MLIRRFHRAAAWATPLLRDLTVGQIMELGVSRLQVGNSCYCTLIQDQMSKQLAYKDIKNKDVNNSKALGHTSEQNIGDIRKHQIGENVSRMDKINFLVNT

Query:  LLDLRDSKEAVYGALDAWVAWEQDFPIVSLKHVLAALEKEQQWHRVVQVIKWMLSKGQGTTMNVYGQLIRALDMDHRAEEAHKFWVMKIGSDLHSVPWQL
        LLDLRDSKEAVYGALDAWVAWEQDFPI  LKHVLAALEKEQQWHR+VQVIKWMLSKGQGTTMNVYGQLIRALDMDHR EEAHKFWVMKIGSDLHSVPWQ+
Subjt:  LLDLRDSKEAVYGALDAWVAWEQDFPIVSLKHVLAALEKEQQWHRVVQVIKWMLSKGQGTTMNVYGQLIRALDMDHRAEEAHKFWVMKIGSDLHSVPWQL

Query:  CRSMMSIYYRNKMLEDLVKLFKDLEGFGRKPPEKSIVQRVADACEILGLLEEKERLLMKYKYLFTDEKDGSIKKYKRVSFEKSKRKRKSTKGTELNSNLT
        CRSMM+IYYRNK LEDLVKLFKDLE FGRKPP+KSIVQRVADACE+LGLLEEKER+L+KYKYLF DEK+G +KKYKR+SFEKSKRKRKSTKGTE NSNL 
Subjt:  CRSMMSIYYRNKMLEDLVKLFKDLEGFGRKPPEKSIVQRVADACEILGLLEEKERLLMKYKYLFTDEKDGSIKKYKRVSFEKSKRKRKSTKGTELNSNLT

Query:  KAQ
        K++
Subjt:  KAQ

XP_008455250.1 PREDICTED: pentatricopeptide repeat-containing protein At4g18975, chloroplastic [Cucumis melo]5.0e-14688.45Show/hide
Query:  MLIRRFHRAAAWATPLLRDLTVGQIMELGVSRLQVGNSCYCTLIQDQMSKQLAYKDIKNKDVNNSKALGHTSEQNIGDIRKHQIGENVSRMDKINFLVNT
        MLIRRF+RAAAWATPLLR  TVG+ MELGVSRLQVG S YCT+IQDQM KQLA KD KNKDV+NSKALGH SEQNIGDIRKH+IGENVSR DKI+FLVNT
Subjt:  MLIRRFHRAAAWATPLLRDLTVGQIMELGVSRLQVGNSCYCTLIQDQMSKQLAYKDIKNKDVNNSKALGHTSEQNIGDIRKHQIGENVSRMDKINFLVNT

Query:  LLDLRDSKEAVYGALDAWVAWEQDFPIVSLKHVLAALEKEQQWHRVVQVIKWMLSKGQGTTMNVYGQLIRALDMDHRAEEAHKFWVMKIGSDLHSVPWQL
        LLDLRDSKEAVYGALDAWVAWEQDFPI SLKHVLAALEKEQQWHR+VQVIKWMLSKGQGTTMNVYGQLIRALDMDHRAEEAHKFWVMKIGSDLHSVPWQL
Subjt:  LLDLRDSKEAVYGALDAWVAWEQDFPIVSLKHVLAALEKEQQWHRVVQVIKWMLSKGQGTTMNVYGQLIRALDMDHRAEEAHKFWVMKIGSDLHSVPWQL

Query:  CRSMMSIYYRNKMLEDLVKLFKDLEGFGRKPPEKSIVQRVADACEILGLLEEKERLLMKYKYLFTDEKDGSIKKYKRVSFEKSKRKRKSTKGTELNSNLT
        CRSM++IYYRNKMLEDLVKLFKDLE FGRKPP+KSIVQRVADACE+LGLLEEKER+L+KYKYLF DEK  S+KKYKRVSFEK KRKRKSTKG+E NSNL 
Subjt:  CRSMMSIYYRNKMLEDLVKLFKDLEGFGRKPPEKSIVQRVADACEILGLLEEKERLLMKYKYLFTDEKDGSIKKYKRVSFEKSKRKRKSTKGTELNSNLT

Query:  KAQ
        K++
Subjt:  KAQ

XP_022967610.1 pentatricopeptide repeat-containing protein At4g18975, chloroplastic isoform X1 [Cucurbita maxima]5.4e-14084.05Show/hide
Query:  MLIRRFHRAAAWATPLLRDLTVGQIMELGVSRLQVGNSCYCTLIQDQMSKQLAYKDIKNKDVNNSKALGHTSEQNIGDIRKHQIGENVSRMDKINFLVNT
        MLIRRFHRAA WATPLLRD TVGQ+MELGV++LQ+GNSCYCT++Q+QM K+ A KD+ +KDVNNSK L  TSE+NIGDIRKHQIGENVSR DKINFLVNT
Subjt:  MLIRRFHRAAAWATPLLRDLTVGQIMELGVSRLQVGNSCYCTLIQDQMSKQLAYKDIKNKDVNNSKALGHTSEQNIGDIRKHQIGENVSRMDKINFLVNT

Query:  LLDLRDSKEAVYGALDAWVAWEQDFPIVSLKHVLAALEKEQQWHRVVQVIKWMLSKGQGTTMNVYGQLIRALDMDHRAEEAHKFWVMKIGSDLHSVPWQL
        L+DLRDSKEAVYGALDAWVAWEQDFPI SLKH LA LEKE QWHRVVQVIKWMLSKGQGTTMNVYGQLIRALDMDHRAEEAHKFWVMKIGSDLHSVPWQL
Subjt:  LLDLRDSKEAVYGALDAWVAWEQDFPIVSLKHVLAALEKEQQWHRVVQVIKWMLSKGQGTTMNVYGQLIRALDMDHRAEEAHKFWVMKIGSDLHSVPWQL

Query:  CRSMMSIYYRNKMLEDLVKLFKDLEGFGRKPPEKSIVQRVADACEILGLLEEKERLLMKYKYLFTDEKDGSIKKYKRVSFEKSKRKRKSTKGTELNSNLT
        CRSM+SIYYRNKMLEDLVKLFKDLE FGRKPPEKSIVQRVADACE+LGL+EEKER+L+KY YLFTDEK GSIKKY        K KRKSTKG + NS+L 
Subjt:  CRSMMSIYYRNKMLEDLVKLFKDLEGFGRKPPEKSIVQRVADACEILGLLEEKERLLMKYKYLFTDEKDGSIKKYKRVSFEKSKRKRKSTKGTELNSNLT

Query:  K
        K
Subjt:  K

XP_038887984.1 pentatricopeptide repeat-containing protein At4g18975, chloroplastic isoform X1 [Benincasa hispida]7.7e-15592.08Show/hide
Query:  MLIRRFHRAAAWATPLLRDLTVGQIMELGVSRLQVGNSCYCTLIQDQMSKQLAYKDIKNKDVNNSKALGHTSEQNIGDIRKHQIGENVSRMDKINFLVNT
        ML+RRFHRA AWATPLLRDLTVGQIMELGVSRLQVG+ CYCT+IQDQMSKQLA KDIKNKD NNSKALG TSEQNIGD+RKHQIG+NV R DKINFLVNT
Subjt:  MLIRRFHRAAAWATPLLRDLTVGQIMELGVSRLQVGNSCYCTLIQDQMSKQLAYKDIKNKDVNNSKALGHTSEQNIGDIRKHQIGENVSRMDKINFLVNT

Query:  LLDLRDSKEAVYGALDAWVAWEQDFPIVSLKHVLAALEKEQQWHRVVQVIKWMLSKGQGTTMNVYGQLIRALDMDHRAEEAHKFWVMKIGSDLHSVPWQL
        LLDLRDSKEAVYGALDAWVAWEQDFPI SLKHVL  LEKEQQWHRVVQVIKWMLSKGQGTTMNVYGQLIRALDMDHRAEEAHKFWVMKIGSDLHSVPWQL
Subjt:  LLDLRDSKEAVYGALDAWVAWEQDFPIVSLKHVLAALEKEQQWHRVVQVIKWMLSKGQGTTMNVYGQLIRALDMDHRAEEAHKFWVMKIGSDLHSVPWQL

Query:  CRSMMSIYYRNKMLEDLVKLFKDLEGFGRKPPEKSIVQRVADACEILGLLEEKERLLMKYKYLFTDEKDGSIKKYKRVSFEKSKRKRKSTKGTELNSNLT
        CRSM++IYYRNKMLEDLVKLFKDLE FGRKPPEKSIVQRVADACEILGLLEEKER+LMKYKYLFTDEK+GSIKKYKRVSFEKSK KRKSTK TE NSNL 
Subjt:  CRSMMSIYYRNKMLEDLVKLFKDLEGFGRKPPEKSIVQRVADACEILGLLEEKERLLMKYKYLFTDEKDGSIKKYKRVSFEKSKRKRKSTKGTELNSNLT

Query:  KAQ
        KAQ
Subjt:  KAQ

TrEMBL top hitse value%identityAlignment
A0A1S3C174 pentatricopeptide repeat-containing protein At4g18975, chloroplastic2.4e-14688.45Show/hide
Query:  MLIRRFHRAAAWATPLLRDLTVGQIMELGVSRLQVGNSCYCTLIQDQMSKQLAYKDIKNKDVNNSKALGHTSEQNIGDIRKHQIGENVSRMDKINFLVNT
        MLIRRF+RAAAWATPLLR  TVG+ MELGVSRLQVG S YCT+IQDQM KQLA KD KNKDV+NSKALGH SEQNIGDIRKH+IGENVSR DKI+FLVNT
Subjt:  MLIRRFHRAAAWATPLLRDLTVGQIMELGVSRLQVGNSCYCTLIQDQMSKQLAYKDIKNKDVNNSKALGHTSEQNIGDIRKHQIGENVSRMDKINFLVNT

Query:  LLDLRDSKEAVYGALDAWVAWEQDFPIVSLKHVLAALEKEQQWHRVVQVIKWMLSKGQGTTMNVYGQLIRALDMDHRAEEAHKFWVMKIGSDLHSVPWQL
        LLDLRDSKEAVYGALDAWVAWEQDFPI SLKHVLAALEKEQQWHR+VQVIKWMLSKGQGTTMNVYGQLIRALDMDHRAEEAHKFWVMKIGSDLHSVPWQL
Subjt:  LLDLRDSKEAVYGALDAWVAWEQDFPIVSLKHVLAALEKEQQWHRVVQVIKWMLSKGQGTTMNVYGQLIRALDMDHRAEEAHKFWVMKIGSDLHSVPWQL

Query:  CRSMMSIYYRNKMLEDLVKLFKDLEGFGRKPPEKSIVQRVADACEILGLLEEKERLLMKYKYLFTDEKDGSIKKYKRVSFEKSKRKRKSTKGTELNSNLT
        CRSM++IYYRNKMLEDLVKLFKDLE FGRKPP+KSIVQRVADACE+LGLLEEKER+L+KYKYLF DEK  S+KKYKRVSFEK KRKRKSTKG+E NSNL 
Subjt:  CRSMMSIYYRNKMLEDLVKLFKDLEGFGRKPPEKSIVQRVADACEILGLLEEKERLLMKYKYLFTDEKDGSIKKYKRVSFEKSKRKRKSTKGTELNSNLT

Query:  KAQ
        K++
Subjt:  KAQ

A0A6J1DM10 pentatricopeptide repeat-containing protein At4g18975, chloroplastic isoform X11.9e-13581.19Show/hide
Query:  MLIRRFHRAAAWATPLLRDLTVGQIMELGVSRLQVGNSCYCTLIQDQMSKQLAYKDIKNKDVNNSKALGHTSEQNIGDIRKHQIGENVSRMDKINFLVNT
        ML+RRFHRA  W TPLLRDLT GQIM+LGVSRLQVGNSCYCT++Q QM +QLA +D+KNKDVNNSKAL   SEQN GD+RKHQIGENVSR DKINFLV T
Subjt:  MLIRRFHRAAAWATPLLRDLTVGQIMELGVSRLQVGNSCYCTLIQDQMSKQLAYKDIKNKDVNNSKALGHTSEQNIGDIRKHQIGENVSRMDKINFLVNT

Query:  LLDLRDSKEAVYGALDAWVAWEQDFPIVSLKHVLAALEKEQQWHRVVQVIKWMLSKGQGTTMNVYGQLIRALDMDHRAEEAHKFWVMKIGSDLHSVPWQL
        L+DLR SKEAVYGALDAWVAWEQ+FPI SLK VLA LEKEQQWHRVVQVIKWMLSKGQGTTM VYGQLIRALDMDHRAEE+HKFWVMKIG+DLHSVPWQL
Subjt:  LLDLRDSKEAVYGALDAWVAWEQDFPIVSLKHVLAALEKEQQWHRVVQVIKWMLSKGQGTTMNVYGQLIRALDMDHRAEEAHKFWVMKIGSDLHSVPWQL

Query:  CRSMMSIYYRNKMLEDLVKLFKDLEGFGRKPPEKSIVQRVADACEILGLLEEKERLLMKYKYLFTDEKDGSIKKYKRVSFEKSKRKRKSTKGTELNSNLT
        CRSM+SIYYRNKML++LVKLFKDLE FGRKPPEKSIVQRVADA E+LGL EEKER+L KYK LFTDE+ G I+KY ++SFEKSKR+RK TK ++ N +L 
Subjt:  CRSMMSIYYRNKMLEDLVKLFKDLEGFGRKPPEKSIVQRVADACEILGLLEEKERLLMKYKYLFTDEKDGSIKKYKRVSFEKSKRKRKSTKGTELNSNLT

Query:  KAQ
        K Q
Subjt:  KAQ

A0A6J1DNN5 pentatricopeptide repeat-containing protein At4g18975, chloroplastic isoform X31.9e-13581.19Show/hide
Query:  MLIRRFHRAAAWATPLLRDLTVGQIMELGVSRLQVGNSCYCTLIQDQMSKQLAYKDIKNKDVNNSKALGHTSEQNIGDIRKHQIGENVSRMDKINFLVNT
        ML+RRFHRA  W TPLLRDLT GQIM+LGVSRLQVGNSCYCT++Q QM +QLA +D+KNKDVNNSKAL   SEQN GD+RKHQIGENVSR DKINFLV T
Subjt:  MLIRRFHRAAAWATPLLRDLTVGQIMELGVSRLQVGNSCYCTLIQDQMSKQLAYKDIKNKDVNNSKALGHTSEQNIGDIRKHQIGENVSRMDKINFLVNT

Query:  LLDLRDSKEAVYGALDAWVAWEQDFPIVSLKHVLAALEKEQQWHRVVQVIKWMLSKGQGTTMNVYGQLIRALDMDHRAEEAHKFWVMKIGSDLHSVPWQL
        L+DLR SKEAVYGALDAWVAWEQ+FPI SLK VLA LEKEQQWHRVVQVIKWMLSKGQGTTM VYGQLIRALDMDHRAEE+HKFWVMKIG+DLHSVPWQL
Subjt:  LLDLRDSKEAVYGALDAWVAWEQDFPIVSLKHVLAALEKEQQWHRVVQVIKWMLSKGQGTTMNVYGQLIRALDMDHRAEEAHKFWVMKIGSDLHSVPWQL

Query:  CRSMMSIYYRNKMLEDLVKLFKDLEGFGRKPPEKSIVQRVADACEILGLLEEKERLLMKYKYLFTDEKDGSIKKYKRVSFEKSKRKRKSTKGTELNSNLT
        CRSM+SIYYRNKML++LVKLFKDLE FGRKPPEKSIVQRVADA E+LGL EEKER+L KYK LFTDE+ G I+KY ++SFEKSKR+RK TK ++ N +L 
Subjt:  CRSMMSIYYRNKMLEDLVKLFKDLEGFGRKPPEKSIVQRVADACEILGLLEEKERLLMKYKYLFTDEKDGSIKKYKRVSFEKSKRKRKSTKGTELNSNLT

Query:  KAQ
        K Q
Subjt:  KAQ

A0A6J1HGC4 pentatricopeptide repeat-containing protein At4g18975, chloroplastic isoform X12.2e-13983.72Show/hide
Query:  MLIRRFHRAAAWATPLLRDLTVGQIMELGVSRLQVGNSCYCTLIQDQMSKQLAYKDIKNKDVNNSKALGHTSEQNIGDIRKHQIGENVSRMDKINFLVNT
        MLIRRFHRAA WATPLLRD TVGQIMELGV++LQ+GNSCYCT++Q+QMSK+   KD+ +KDVNNSK L  TSE+NIGDIRKHQIGENVSR DKI+FLVNT
Subjt:  MLIRRFHRAAAWATPLLRDLTVGQIMELGVSRLQVGNSCYCTLIQDQMSKQLAYKDIKNKDVNNSKALGHTSEQNIGDIRKHQIGENVSRMDKINFLVNT

Query:  LLDLRDSKEAVYGALDAWVAWEQDFPIVSLKHVLAALEKEQQWHRVVQVIKWMLSKGQGTTMNVYGQLIRALDMDHRAEEAHKFWVMKIGSDLHSVPWQL
        L+DLRDSKEAVYGALDAWVAWEQDFPI SLKH LA LEKE QWHRVVQVIKWMLSKGQGTTMNVYGQLIRALDMDHRAEEAHKFWVMKIGSDLHSVPWQL
Subjt:  LLDLRDSKEAVYGALDAWVAWEQDFPIVSLKHVLAALEKEQQWHRVVQVIKWMLSKGQGTTMNVYGQLIRALDMDHRAEEAHKFWVMKIGSDLHSVPWQL

Query:  CRSMMSIYYRNKMLEDLVKLFKDLEGFGRKPPEKSIVQRVADACEILGLLEEKERLLMKYKYLFTDEKDGSIKKYKRVSFEKSKRKRKSTKGTELNSNLT
        CRSM+SIYYRNKMLEDLVKLFK+LE FGRKPPEKSIVQRVADACE+LGL+EEKER+L+KY YLFTDEK GSIKKY        K KRKSTKG + NS+L 
Subjt:  CRSMMSIYYRNKMLEDLVKLFKDLEGFGRKPPEKSIVQRVADACEILGLLEEKERLLMKYKYLFTDEKDGSIKKYKRVSFEKSKRKRKSTKGTELNSNLT

Query:  K
        K
Subjt:  K

A0A6J1HUZ4 pentatricopeptide repeat-containing protein At4g18975, chloroplastic isoform X12.6e-14084.05Show/hide
Query:  MLIRRFHRAAAWATPLLRDLTVGQIMELGVSRLQVGNSCYCTLIQDQMSKQLAYKDIKNKDVNNSKALGHTSEQNIGDIRKHQIGENVSRMDKINFLVNT
        MLIRRFHRAA WATPLLRD TVGQ+MELGV++LQ+GNSCYCT++Q+QM K+ A KD+ +KDVNNSK L  TSE+NIGDIRKHQIGENVSR DKINFLVNT
Subjt:  MLIRRFHRAAAWATPLLRDLTVGQIMELGVSRLQVGNSCYCTLIQDQMSKQLAYKDIKNKDVNNSKALGHTSEQNIGDIRKHQIGENVSRMDKINFLVNT

Query:  LLDLRDSKEAVYGALDAWVAWEQDFPIVSLKHVLAALEKEQQWHRVVQVIKWMLSKGQGTTMNVYGQLIRALDMDHRAEEAHKFWVMKIGSDLHSVPWQL
        L+DLRDSKEAVYGALDAWVAWEQDFPI SLKH LA LEKE QWHRVVQVIKWMLSKGQGTTMNVYGQLIRALDMDHRAEEAHKFWVMKIGSDLHSVPWQL
Subjt:  LLDLRDSKEAVYGALDAWVAWEQDFPIVSLKHVLAALEKEQQWHRVVQVIKWMLSKGQGTTMNVYGQLIRALDMDHRAEEAHKFWVMKIGSDLHSVPWQL

Query:  CRSMMSIYYRNKMLEDLVKLFKDLEGFGRKPPEKSIVQRVADACEILGLLEEKERLLMKYKYLFTDEKDGSIKKYKRVSFEKSKRKRKSTKGTELNSNLT
        CRSM+SIYYRNKMLEDLVKLFKDLE FGRKPPEKSIVQRVADACE+LGL+EEKER+L+KY YLFTDEK GSIKKY        K KRKSTKG + NS+L 
Subjt:  CRSMMSIYYRNKMLEDLVKLFKDLEGFGRKPPEKSIVQRVADACEILGLLEEKERLLMKYKYLFTDEKDGSIKKYKRVSFEKSKRKRKSTKGTELNSNLT

Query:  K
        K
Subjt:  K

SwissProt top hitse value%identityAlignment
Q2V3H0 Pentatricopeptide repeat-containing protein At4g18975, chloroplastic2.7e-3341.53Show/hide
Query:  LVNTLLDLRDSKEAVYGALDAWVAWEQDFPIVSLKHVLAALEKEQQWHRVVQVIKWMLSKGQGTTMNVYGQLIRALDMDHRAEEAHKFWVMKIGSDLHSV
        LV  L  L + KEAVYGAL+ WVAWE +FPI++    L  L K  QWHRV+Q+ KWMLSKGQG TM  Y  L+ A DMD RA+EA   W M + +   S+
Subjt:  LVNTLLDLRDSKEAVYGALDAWVAWEQDFPIVSLKHVLAALEKEQQWHRVVQVIKWMLSKGQGTTMNVYGQLIRALDMDHRAEEAHKFWVMKIGSDLHSV

Query:  PWQLCRSMMSIYYRNKMLEDLVKLFKDLEGFGRKPPEKSIVQRVADACEILGLLEEKE----RLLMKYKYLFTDEKDGSIKKY
        P +L   M+++Y  + + + ++++F D+E     P E S  +RVA A   L   E ++    R L +YKY++ + +   +K+Y
Subjt:  PWQLCRSMMSIYYRNKMLEDLVKLFKDLEGFGRKPPEKSIVQRVADACEILGLLEEKE----RLLMKYKYLFTDEKDGSIKKY

Q8LG95 Pentatricopeptide repeat-containing protein At4g211901.4e-2938.29Show/hide
Query:  ENVSRMDKINFLVNTLLDLRDSKEAVYGALDAWVAWEQDFPIVSLKHVLAALEKEQQWHRVVQVIKWMLSKGQGTTMNVYGQLIRALDMDHRAEEAHKFW
        + +  + K   ++  +  L + KE VYGALD+++AWE +FP+V +K  L  LE E++W +++QV KWMLSKGQG TM  Y  L+ AL  D+R +EA + W
Subjt:  ENVSRMDKINFLVNTLLDLRDSKEAVYGALDAWVAWEQDFPIVSLKHVLAALEKEQQWHRVVQVIKWMLSKGQGTTMNVYGQLIRALDMDHRAEEAHKFW

Query:  VMKIGSDLHSVPWQLCRSMMSIYYRNKMLEDLVKLFKDLEGFGRKPPEKSIVQRVADACEILGLLEEKERLLMKY
               L   P +    M+SIYY+  M + L ++F D+E  G K P  +IV  V      L + ++ E+L+ KY
Subjt:  VMKIGSDLHSVPWQLCRSMMSIYYRNKMLEDLVKLFKDLEGFGRKPPEKSIVQRVADACEILGLLEEKERLLMKY

Arabidopsis top hitse value%identityAlignment
AT1G04590.1 BEST Arabidopsis thaliana protein match is: Pentatricopeptide repeat (PPR) superfamily protein (TAIR:AT4G21190.1)1.3e-7859.04Show/hide
Query:  LQVGNSCYCTLIQDQMSKQLAYKDIKNKDVNNSKALGHTSEQNIGDIRKHQIGENVSRMDKINFLVNTLLDLRDSKEAVYGALDAWVAWEQDFPIVSLKH
        L+V +  Y  +     S +   K+   +D ++S   G     N  + RKHQIGEN+ + DKI FLVNTLLD+ D+KEAVYGALDAWVAWE++FPI SLK 
Subjt:  LQVGNSCYCTLIQDQMSKQLAYKDIKNKDVNNSKALGHTSEQNIGDIRKHQIGENVSRMDKINFLVNTLLDLRDSKEAVYGALDAWVAWEQDFPIVSLKH

Query:  VLAALEKEQQWHRVVQVIKWMLSKGQGTTMNVYGQLIRALDMDHRAEEAHKFWVMKIGSDLHSVPWQLCRSMMSIYYRNKMLEDLVKLFKDLEGFGRKPP
        V+A+LEKE QWHR+VQVIKW+LSKGQG TM  YGQLIRALDMD RAEEAH  W  K+G+DLHSVPWQLC  MM IY+RN ML++LVKLFKDLE + RKPP
Subjt:  VLAALEKEQQWHRVVQVIKWMLSKGQGTTMNVYGQLIRALDMDHRAEEAHKFWVMKIGSDLHSVPWQLCRSMMSIYYRNKMLEDLVKLFKDLEGFGRKPP

Query:  EKSIVQRVADACEILGLLEEKERLLMKYKYLF----TDEKDGSIKKYKR
        +K IVQ VADA E+LG+L+EKER++ KY +L     +D+K     + K+
Subjt:  EKSIVQRVADACEILGLLEEKERLLMKYKYLF----TDEKDGSIKKYKR

AT1G04590.2 BEST Arabidopsis thaliana protein match is: Pentatricopeptide repeat (PPR) superfamily protein (TAIR:AT4G18975.4)5.3e-7758.33Show/hide
Query:  LQVGNSCYCTLIQDQMSKQLAYKDIKNKDVNNSKALGHTSEQNIGDIRKHQIGENVSRMDKINFLVNTLLDLRDSKEAVYGALDAWVAWEQDFPIVSLKH
        L+V +  Y  +     S +   K+   +D ++S   G     N  + RKHQIGEN+ + DKI FLVNTLLD+ D+KEAVYGALDAWVAWE++FPI SLK 
Subjt:  LQVGNSCYCTLIQDQMSKQLAYKDIKNKDVNNSKALGHTSEQNIGDIRKHQIGENVSRMDKINFLVNTLLDLRDSKEAVYGALDAWVAWEQDFPIVSLKH

Query:  VLAALEKEQQWHRVVQVIKWMLSKGQGTTMNVYGQLIRALDMDHRAEEAHKFWVMKIGSDLHSVPWQLCRSMMSIYYRNKMLEDLV---KLFKDLEGFGR
        V+A+LEKE QWHR+VQVIKW+LSKGQG TM  YGQLIRALDMD RAEEAH  W  K+G+DLHSVPWQLC  MM IY+RN ML++LV   KLFKDLE + R
Subjt:  VLAALEKEQQWHRVVQVIKWMLSKGQGTTMNVYGQLIRALDMDHRAEEAHKFWVMKIGSDLHSVPWQLCRSMMSIYYRNKMLEDLV---KLFKDLEGFGR

Query:  KPPEKSIVQRVADACEILGLLEEKERLLMKYKYLF----TDEKDGSIKKYKR
        KPP+K IVQ VADA E+LG+L+EKER++ KY +L     +D+K     + K+
Subjt:  KPPEKSIVQRVADACEILGLLEEKERLLMKYKYLF----TDEKDGSIKKYKR

AT4G18975.1 Pentatricopeptide repeat (PPR) superfamily protein1.9e-3441.53Show/hide
Query:  LVNTLLDLRDSKEAVYGALDAWVAWEQDFPIVSLKHVLAALEKEQQWHRVVQVIKWMLSKGQGTTMNVYGQLIRALDMDHRAEEAHKFWVMKIGSDLHSV
        LV  L  L + KEAVYGAL+ WVAWE +FPI++    L  L K  QWHRV+Q+ KWMLSKGQG TM  Y  L+ A DMD RA+EA   W M + +   S+
Subjt:  LVNTLLDLRDSKEAVYGALDAWVAWEQDFPIVSLKHVLAALEKEQQWHRVVQVIKWMLSKGQGTTMNVYGQLIRALDMDHRAEEAHKFWVMKIGSDLHSV

Query:  PWQLCRSMMSIYYRNKMLEDLVKLFKDLEGFGRKPPEKSIVQRVADACEILGLLEEKE----RLLMKYKYLFTDEKDGSIKKY
        P +L   M+++Y  + + + ++++F D+E     P E S  +RVA A   L   E ++    R L +YKY++ + +   +K+Y
Subjt:  PWQLCRSMMSIYYRNKMLEDLVKLFKDLEGFGRKPPEKSIVQRVADACEILGLLEEKE----RLLMKYKYLFTDEKDGSIKKY

AT4G18975.2 Pentatricopeptide repeat (PPR) superfamily protein1.9e-3441.53Show/hide
Query:  LVNTLLDLRDSKEAVYGALDAWVAWEQDFPIVSLKHVLAALEKEQQWHRVVQVIKWMLSKGQGTTMNVYGQLIRALDMDHRAEEAHKFWVMKIGSDLHSV
        LV  L  L + KEAVYGAL+ WVAWE +FPI++    L  L K  QWHRV+Q+ KWMLSKGQG TM  Y  L+ A DMD RA+EA   W M + +   S+
Subjt:  LVNTLLDLRDSKEAVYGALDAWVAWEQDFPIVSLKHVLAALEKEQQWHRVVQVIKWMLSKGQGTTMNVYGQLIRALDMDHRAEEAHKFWVMKIGSDLHSV

Query:  PWQLCRSMMSIYYRNKMLEDLVKLFKDLEGFGRKPPEKSIVQRVADACEILGLLEEKE----RLLMKYKYLFTDEKDGSIKKY
        P +L   M+++Y  + + + ++++F D+E     P E S  +RVA A   L   E ++    R L +YKY++ + +   +K+Y
Subjt:  PWQLCRSMMSIYYRNKMLEDLVKLFKDLEGFGRKPPEKSIVQRVADACEILGLLEEKE----RLLMKYKYLFTDEKDGSIKKY

AT4G18975.3 Pentatricopeptide repeat (PPR) superfamily protein1.9e-3441.53Show/hide
Query:  LVNTLLDLRDSKEAVYGALDAWVAWEQDFPIVSLKHVLAALEKEQQWHRVVQVIKWMLSKGQGTTMNVYGQLIRALDMDHRAEEAHKFWVMKIGSDLHSV
        LV  L  L + KEAVYGAL+ WVAWE +FPI++    L  L K  QWHRV+Q+ KWMLSKGQG TM  Y  L+ A DMD RA+EA   W M + +   S+
Subjt:  LVNTLLDLRDSKEAVYGALDAWVAWEQDFPIVSLKHVLAALEKEQQWHRVVQVIKWMLSKGQGTTMNVYGQLIRALDMDHRAEEAHKFWVMKIGSDLHSV

Query:  PWQLCRSMMSIYYRNKMLEDLVKLFKDLEGFGRKPPEKSIVQRVADACEILGLLEEKE----RLLMKYKYLFTDEKDGSIKKY
        P +L   M+++Y  + + + ++++F D+E     P E S  +RVA A   L   E ++    R L +YKY++ + +   +K+Y
Subjt:  PWQLCRSMMSIYYRNKMLEDLVKLFKDLEGFGRKPPEKSIVQRVADACEILGLLEEKE----RLLMKYKYLFTDEKDGSIKKY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTCATCCGGAGGTTTCATCGAGCCGCGGCATGGGCGACGCCTCTATTGCGAGACCTAACTGTAGGACAAATCATGGAGCTTGGAGTCAGCAGGCTGCAAGTTGGGAA
CTCTTGTTACTGCACATTGATACAAGATCAAATGTCTAAACAGCTTGCTTATAAAGATATAAAAAATAAGGATGTTAACAATAGTAAAGCTTTGGGCCACACTTCAGAGC
AAAATATTGGAGACATTAGAAAGCACCAAATTGGGGAAAATGTTTCTCGGATGGACAAAATTAACTTTCTTGTAAATACGCTTCTCGATCTGAGAGATAGTAAGGAGGCT
GTTTATGGTGCTCTTGATGCCTGGGTTGCATGGGAGCAAGACTTTCCAATAGTATCCCTTAAGCATGTATTGGCTGCCCTTGAGAAGGAACAGCAGTGGCATAGAGTTGT
ACAGGTAATCAAATGGATGTTAAGCAAGGGGCAGGGTACCACAATGAATGTTTATGGGCAGTTAATACGGGCTTTGGACATGGACCATCGAGCGGAAGAAGCACACAAGT
TTTGGGTCATGAAAATTGGTTCGGATCTACATTCAGTTCCTTGGCAATTGTGCAGAAGCATGATGTCAATATACTACCGAAATAAAATGCTAGAAGATCTTGTAAAGCTG
TTTAAGGATCTTGAAGGTTTCGGGCGTAAACCTCCAGAAAAATCCATAGTGCAGAGGGTAGCAGATGCTTGTGAGATTCTGGGCTTGCTTGAAGAGAAAGAGAGGCTACT
AATGAAGTACAAATACCTTTTTACAGATGAGAAGGATGGGTCCATCAAGAAATATAAGAGGGTTTCGTTTGAGAAGTCAAAGAGAAAAAGAAAATCGACCAAGGGCACTG
AACTCAATAGCAACCTTACGAAGGCTCAATGA
mRNA sequenceShow/hide mRNA sequence
TTTTTTTGTTTTTTTGTTTTGCCATTGCAAGTTTCATAAAGCATTTGAGTATTTTGAGCGTTTTGCCCCATTAGCATTTGGGACTGTGGCGCCCTTTACTCTACAATGCT
CATCCGGAGGTTTCATCGAGCCGCGGCATGGGCGACGCCTCTATTGCGAGACCTAACTGTAGGACAAATCATGGAGCTTGGAGTCAGCAGGCTGCAAGTTGGGAACTCTT
GTTACTGCACATTGATACAAGATCAAATGTCTAAACAGCTTGCTTATAAAGATATAAAAAATAAGGATGTTAACAATAGTAAAGCTTTGGGCCACACTTCAGAGCAAAAT
ATTGGAGACATTAGAAAGCACCAAATTGGGGAAAATGTTTCTCGGATGGACAAAATTAACTTTCTTGTAAATACGCTTCTCGATCTGAGAGATAGTAAGGAGGCTGTTTA
TGGTGCTCTTGATGCCTGGGTTGCATGGGAGCAAGACTTTCCAATAGTATCCCTTAAGCATGTATTGGCTGCCCTTGAGAAGGAACAGCAGTGGCATAGAGTTGTACAGG
TAATCAAATGGATGTTAAGCAAGGGGCAGGGTACCACAATGAATGTTTATGGGCAGTTAATACGGGCTTTGGACATGGACCATCGAGCGGAAGAAGCACACAAGTTTTGG
GTCATGAAAATTGGTTCGGATCTACATTCAGTTCCTTGGCAATTGTGCAGAAGCATGATGTCAATATACTACCGAAATAAAATGCTAGAAGATCTTGTAAAGCTGTTTAA
GGATCTTGAAGGTTTCGGGCGTAAACCTCCAGAAAAATCCATAGTGCAGAGGGTAGCAGATGCTTGTGAGATTCTGGGCTTGCTTGAAGAGAAAGAGAGGCTACTAATGA
AGTACAAATACCTTTTTACAGATGAGAAGGATGGGTCCATCAAGAAATATAAGAGGGTTTCGTTTGAGAAGTCAAAGAGAAAAAGAAAATCGACCAAGGGCACTGAACTC
AATAGCAACCTTACGAAGGCTCAATGAATATGAAAGAAAATTATGTATTAGTTAACTTATTCATTTATCTCATCTGGTTACGTCTCCTGAAGGCCTTACCGACTGTAATA
GGTATTAATCTTTTGCTTTGGCTTTGTAAATAATTGGTTTTGATTGTTGAATTATAGTCCTCGAACTTGTAGTACTGACATGTATTTAGCCCATAAACAATTAC
Protein sequenceShow/hide protein sequence
MLIRRFHRAAAWATPLLRDLTVGQIMELGVSRLQVGNSCYCTLIQDQMSKQLAYKDIKNKDVNNSKALGHTSEQNIGDIRKHQIGENVSRMDKINFLVNTLLDLRDSKEA
VYGALDAWVAWEQDFPIVSLKHVLAALEKEQQWHRVVQVIKWMLSKGQGTTMNVYGQLIRALDMDHRAEEAHKFWVMKIGSDLHSVPWQLCRSMMSIYYRNKMLEDLVKL
FKDLEGFGRKPPEKSIVQRVADACEILGLLEEKERLLMKYKYLFTDEKDGSIKKYKRVSFEKSKRKRKSTKGTELNSNLTKAQ