; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg006309 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg006309
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
Descriptionpentatricopeptide repeat-containing protein At4g18975, chloroplastic
Genome locationscaffold4:3543969..3555588
RNA-Seq ExpressionSpg006309
SyntenySpg006309
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6571702.1 Mediator of RNA polymerase II transcription subunit 15a, partial [Cucurbita argyrosperma subsp. sororia]1.1e-12167.13Show/hide
Query:  MLIRRLHRAATWATPLLRDITVGQIMELGVSKLQVGNSFYCTMIQAQMCKQLADKDMKNKDINHNKALCQTSEQNTGDMRKHQIGENVSRKDKINFLVNT
        MLIRR HRAATWATPLLRD TVGQIMELGV+KLQ+GNS YCTM+Q QM K+ ADKDM +KD+N++K L QTSE+N GD+RKHQIGENVSRKDKI+FLVNT
Subjt:  MLIRRLHRAATWATPLLRDITVGQIMELGVSKLQVGNSFYCTMIQAQMCKQLADKDMKNKDINHNKALCQTSEQNTGDMRKHQIGENVSRKDKINFLVNT

Query:  ANSRDEMFAFGGQNALGDFLVARSFIMEEDLDHLLWGRRSEGDNRGFSSPYAFSGEREIFMAGGISAFCWDLCCETYNGIFIRLLDLRDSKEAVYGALDA
                                                                                           L+DLRDSKEAVYGALDA
Subjt:  ANSRDEMFAFGGQNALGDFLVARSFIMEEDLDHLLWGRRSEGDNRGFSSPYAFSGEREIFMAGGISAFCWDLCCETYNGIFIRLLDLRDSKEAVYGALDA

Query:  WVAWEQNFPIASLKQALAALEKEQQWHRVVQVIKWMLSKGQGTTMSVYGQLIRALDMDHRAEEAHKFWVMKIGSDLHSVPWQLCRSMISIYYRNKMLENL
        WVAWEQ+FPIASLK ALA LEKE QWHRVVQVIKWMLSKGQGTTM+VYGQLIRALDMDHRAEEAHKFWVMKIGSDLHSVPWQLCRSMISIYYRNKMLE+L
Subjt:  WVAWEQNFPIASLKQALAALEKEQQWHRVVQVIKWMLSKGQGTTMSVYGQLIRALDMDHRAEEAHKFWVMKIGSDLHSVPWQLCRSMISIYYRNKMLENL

Query:  VKLFKDLEAFGRKPPDKSIVQRVADAYEMLGLLEEKERVLVKYKDLFTGEKKASIKKYK
        VKLFKDLEAFGRKPP+KSIVQRVADA EMLGL+EEKERVLVKY  LFT EKK SIKKYK
Subjt:  VKLFKDLEAFGRKPPDKSIVQRVADAYEMLGLLEEKERVLVKYKDLFTGEKKASIKKYK

XP_022154414.1 pentatricopeptide repeat-containing protein At4g18975, chloroplastic isoform X1 [Momordica charantia]2.6e-12866.94Show/hide
Query:  MLIRRLHRAATWATPLLRDITVGQIMELGVSKLQVGNSFYCTMIQAQMCKQLADKDMKNKDINHNKALCQTSEQNTGDMRKHQIGENVSRKDKINFLVNT
        ML+RR HRA TW TPLLRD+T GQIM+LGVS+LQVGNS YCTM+QAQMC+QLAD+DMKNKD+N++KALCQ SEQN GDMRKHQIGENVSRKDKINFLV T
Subjt:  MLIRRLHRAATWATPLLRDITVGQIMELGVSKLQVGNSFYCTMIQAQMCKQLADKDMKNKDINHNKALCQTSEQNTGDMRKHQIGENVSRKDKINFLVNT

Query:  ANSRDEMFAFGGQNALGDFLVARSFIMEEDLDHLLWGRRSEGDNRGFSSPYAFSGEREIFMAGGISAFCWDLCCETYNGIFIRLLDLRDSKEAVYGALDA
                                                                                           L+DLR SKEAVYGALDA
Subjt:  ANSRDEMFAFGGQNALGDFLVARSFIMEEDLDHLLWGRRSEGDNRGFSSPYAFSGEREIFMAGGISAFCWDLCCETYNGIFIRLLDLRDSKEAVYGALDA

Query:  WVAWEQNFPIASLKQALAALEKEQQWHRVVQVIKWMLSKGQGTTMSVYGQLIRALDMDHRAEEAHKFWVMKIGSDLHSVPWQLCRSMISIYYRNKMLENL
        WVAWEQNFPIASLKQ LA LEKEQQWHRVVQVIKWMLSKGQGTTM VYGQLIRALDMDHRAEE+HKFWVMKIG+DLHSVPWQLCRSMISIYYRNKML+NL
Subjt:  WVAWEQNFPIASLKQALAALEKEQQWHRVVQVIKWMLSKGQGTTMSVYGQLIRALDMDHRAEEAHKFWVMKIGSDLHSVPWQLCRSMISIYYRNKMLENL

Query:  VKLFKDLEAFGRKPPDKSIVQRVADAYEMLGLLEEKERVLVKYKDLFTGEKKASIKKYKRISSEKSKRKKKI
        VKLFKDLEAFGRKPP+KSIVQRVADAYEMLGL EEKERVL KYKDLFT E+K  I+KY +IS EKSKR++K+
Subjt:  VKLFKDLEAFGRKPPDKSIVQRVADAYEMLGLLEEKERVLVKYKDLFTGEKKASIKKYKRISSEKSKRKKKI

XP_022154416.1 pentatricopeptide repeat-containing protein At4g18975, chloroplastic isoform X3 [Momordica charantia]2.6e-12866.94Show/hide
Query:  MLIRRLHRAATWATPLLRDITVGQIMELGVSKLQVGNSFYCTMIQAQMCKQLADKDMKNKDINHNKALCQTSEQNTGDMRKHQIGENVSRKDKINFLVNT
        ML+RR HRA TW TPLLRD+T GQIM+LGVS+LQVGNS YCTM+QAQMC+QLAD+DMKNKD+N++KALCQ SEQN GDMRKHQIGENVSRKDKINFLV T
Subjt:  MLIRRLHRAATWATPLLRDITVGQIMELGVSKLQVGNSFYCTMIQAQMCKQLADKDMKNKDINHNKALCQTSEQNTGDMRKHQIGENVSRKDKINFLVNT

Query:  ANSRDEMFAFGGQNALGDFLVARSFIMEEDLDHLLWGRRSEGDNRGFSSPYAFSGEREIFMAGGISAFCWDLCCETYNGIFIRLLDLRDSKEAVYGALDA
                                                                                           L+DLR SKEAVYGALDA
Subjt:  ANSRDEMFAFGGQNALGDFLVARSFIMEEDLDHLLWGRRSEGDNRGFSSPYAFSGEREIFMAGGISAFCWDLCCETYNGIFIRLLDLRDSKEAVYGALDA

Query:  WVAWEQNFPIASLKQALAALEKEQQWHRVVQVIKWMLSKGQGTTMSVYGQLIRALDMDHRAEEAHKFWVMKIGSDLHSVPWQLCRSMISIYYRNKMLENL
        WVAWEQNFPIASLKQ LA LEKEQQWHRVVQVIKWMLSKGQGTTM VYGQLIRALDMDHRAEE+HKFWVMKIG+DLHSVPWQLCRSMISIYYRNKML+NL
Subjt:  WVAWEQNFPIASLKQALAALEKEQQWHRVVQVIKWMLSKGQGTTMSVYGQLIRALDMDHRAEEAHKFWVMKIGSDLHSVPWQLCRSMISIYYRNKMLENL

Query:  VKLFKDLEAFGRKPPDKSIVQRVADAYEMLGLLEEKERVLVKYKDLFTGEKKASIKKYKRISSEKSKRKKKI
        VKLFKDLEAFGRKPP+KSIVQRVADAYEMLGL EEKERVL KYKDLFT E+K  I+KY +IS EKSKR++K+
Subjt:  VKLFKDLEAFGRKPPDKSIVQRVADAYEMLGLLEEKERVLVKYKDLFTGEKKASIKKYKRISSEKSKRKKKI

XP_022967610.1 pentatricopeptide repeat-containing protein At4g18975, chloroplastic isoform X1 [Cucurbita maxima]6.3e-12267.13Show/hide
Query:  MLIRRLHRAATWATPLLRDITVGQIMELGVSKLQVGNSFYCTMIQAQMCKQLADKDMKNKDINHNKALCQTSEQNTGDMRKHQIGENVSRKDKINFLVNT
        MLIRR HRAATWATPLLRD TVGQ+MELGV+KLQ+GNS YCTM+Q QM K+ ADKDM +KD+N++K L QTSE+N GD+RKHQIGENVSRKDKINFLVNT
Subjt:  MLIRRLHRAATWATPLLRDITVGQIMELGVSKLQVGNSFYCTMIQAQMCKQLADKDMKNKDINHNKALCQTSEQNTGDMRKHQIGENVSRKDKINFLVNT

Query:  ANSRDEMFAFGGQNALGDFLVARSFIMEEDLDHLLWGRRSEGDNRGFSSPYAFSGEREIFMAGGISAFCWDLCCETYNGIFIRLLDLRDSKEAVYGALDA
                                                                                           L+DLRDSKEAVYGALDA
Subjt:  ANSRDEMFAFGGQNALGDFLVARSFIMEEDLDHLLWGRRSEGDNRGFSSPYAFSGEREIFMAGGISAFCWDLCCETYNGIFIRLLDLRDSKEAVYGALDA

Query:  WVAWEQNFPIASLKQALAALEKEQQWHRVVQVIKWMLSKGQGTTMSVYGQLIRALDMDHRAEEAHKFWVMKIGSDLHSVPWQLCRSMISIYYRNKMLENL
        WVAWEQ+FPIASLK ALA LEKE QWHRVVQVIKWMLSKGQGTTM+VYGQLIRALDMDHRAEEAHKFWVMKIGSDLHSVPWQLCRSMISIYYRNKMLE+L
Subjt:  WVAWEQNFPIASLKQALAALEKEQQWHRVVQVIKWMLSKGQGTTMSVYGQLIRALDMDHRAEEAHKFWVMKIGSDLHSVPWQLCRSMISIYYRNKMLENL

Query:  VKLFKDLEAFGRKPPDKSIVQRVADAYEMLGLLEEKERVLVKYKDLFTGEKKASIKKYK
        VKLFKDLEAFGRKPP+KSIVQRVADA EMLGL+EEKERVLVKY  LFT EKK SIKKYK
Subjt:  VKLFKDLEAFGRKPPDKSIVQRVADAYEMLGLLEEKERVLVKYKDLFTGEKKASIKKYK

XP_038887984.1 pentatricopeptide repeat-containing protein At4g18975, chloroplastic isoform X1 [Benincasa hispida]1.3e-12266.31Show/hide
Query:  MLIRRLHRAATWATPLLRDITVGQIMELGVSKLQVGNSFYCTMIQAQMCKQLADKDMKNKDINHNKALCQTSEQNTGDMRKHQIGENVSRKDKINFLVNT
        ML+RR HRA  WATPLLRD+TVGQIMELGVS+LQVG+  YCTMIQ QM KQLA KD+KNKD N++KAL QTSEQN GD+RKHQIG+NV RKDKINFLVNT
Subjt:  MLIRRLHRAATWATPLLRDITVGQIMELGVSKLQVGNSFYCTMIQAQMCKQLADKDMKNKDINHNKALCQTSEQNTGDMRKHQIGENVSRKDKINFLVNT

Query:  ANSRDEMFAFGGQNALGDFLVARSFIMEEDLDHLLWGRRSEGDNRGFSSPYAFSGEREIFMAGGISAFCWDLCCETYNGIFIRLLDLRDSKEAVYGALDA
                                                                                           LLDLRDSKEAVYGALDA
Subjt:  ANSRDEMFAFGGQNALGDFLVARSFIMEEDLDHLLWGRRSEGDNRGFSSPYAFSGEREIFMAGGISAFCWDLCCETYNGIFIRLLDLRDSKEAVYGALDA

Query:  WVAWEQNFPIASLKQALAALEKEQQWHRVVQVIKWMLSKGQGTTMSVYGQLIRALDMDHRAEEAHKFWVMKIGSDLHSVPWQLCRSMISIYYRNKMLENL
        WVAWEQ+FPI SLK  L  LEKEQQWHRVVQVIKWMLSKGQGTTM+VYGQLIRALDMDHRAEEAHKFWVMKIGSDLHSVPWQLCRSMI+IYYRNKMLE+L
Subjt:  WVAWEQNFPIASLKQALAALEKEQQWHRVVQVIKWMLSKGQGTTMSVYGQLIRALDMDHRAEEAHKFWVMKIGSDLHSVPWQLCRSMISIYYRNKMLENL

Query:  VKLFKDLEAFGRKPPDKSIVQRVADAYEMLGLLEEKERVLVKYKDLFTGEKKASIKKYKRISSEKSKRKKK
        VKLFKDLEAFGRKPP+KSIVQRVADA E+LGLLEEKERVL+KYK LFT EK+ SIKKYKR+S EKSK K+K
Subjt:  VKLFKDLEAFGRKPPDKSIVQRVADAYEMLGLLEEKERVLVKYKDLFTGEKKASIKKYKRISSEKSKRKKK

TrEMBL top hitse value%identityAlignment
A0A6J1DJJ2 pentatricopeptide repeat-containing protein At4g18975, chloroplastic isoform X23.4e-12165.05Show/hide
Query:  MLIRRLHRAATWATPLLRDITVGQIMELGVSKLQVGNSFYCTMIQAQMCKQLADKDMKNKDINHNKALCQTSEQNTGDMRKHQIGENVSRKDKINFLVNT
        ML+RR HRA TW TPLLRD+T GQIM+LGVS+LQVGNS YCTM+QAQMC+QLAD+DMKNK           SEQN GDMRKHQIGENVSRKDKINFLV T
Subjt:  MLIRRLHRAATWATPLLRDITVGQIMELGVSKLQVGNSFYCTMIQAQMCKQLADKDMKNKDINHNKALCQTSEQNTGDMRKHQIGENVSRKDKINFLVNT

Query:  ANSRDEMFAFGGQNALGDFLVARSFIMEEDLDHLLWGRRSEGDNRGFSSPYAFSGEREIFMAGGISAFCWDLCCETYNGIFIRLLDLRDSKEAVYGALDA
                                                                                           L+DLR SKEAVYGALDA
Subjt:  ANSRDEMFAFGGQNALGDFLVARSFIMEEDLDHLLWGRRSEGDNRGFSSPYAFSGEREIFMAGGISAFCWDLCCETYNGIFIRLLDLRDSKEAVYGALDA

Query:  WVAWEQNFPIASLKQALAALEKEQQWHRVVQVIKWMLSKGQGTTMSVYGQLIRALDMDHRAEEAHKFWVMKIGSDLHSVPWQLCRSMISIYYRNKMLENL
        WVAWEQNFPIASLKQ LA LEKEQQWHRVVQVIKWMLSKGQGTTM VYGQLIRALDMDHRAEE+HKFWVMKIG+DLHSVPWQLCRSMISIYYRNKML+NL
Subjt:  WVAWEQNFPIASLKQALAALEKEQQWHRVVQVIKWMLSKGQGTTMSVYGQLIRALDMDHRAEEAHKFWVMKIGSDLHSVPWQLCRSMISIYYRNKMLENL

Query:  VKLFKDLEAFGRKPPDKSIVQRVADAYEMLGLLEEKERVLVKYKDLFTGEKKASIKKYKRISSEKSKRKKKI
        VKLFKDLEAFGRKPP+KSIVQRVADAYEMLGL EEKERVL KYKDLFT E+K  I+KY +IS EKSKR++K+
Subjt:  VKLFKDLEAFGRKPPDKSIVQRVADAYEMLGLLEEKERVLVKYKDLFTGEKKASIKKYKRISSEKSKRKKKI

A0A6J1DM10 pentatricopeptide repeat-containing protein At4g18975, chloroplastic isoform X11.3e-12866.94Show/hide
Query:  MLIRRLHRAATWATPLLRDITVGQIMELGVSKLQVGNSFYCTMIQAQMCKQLADKDMKNKDINHNKALCQTSEQNTGDMRKHQIGENVSRKDKINFLVNT
        ML+RR HRA TW TPLLRD+T GQIM+LGVS+LQVGNS YCTM+QAQMC+QLAD+DMKNKD+N++KALCQ SEQN GDMRKHQIGENVSRKDKINFLV T
Subjt:  MLIRRLHRAATWATPLLRDITVGQIMELGVSKLQVGNSFYCTMIQAQMCKQLADKDMKNKDINHNKALCQTSEQNTGDMRKHQIGENVSRKDKINFLVNT

Query:  ANSRDEMFAFGGQNALGDFLVARSFIMEEDLDHLLWGRRSEGDNRGFSSPYAFSGEREIFMAGGISAFCWDLCCETYNGIFIRLLDLRDSKEAVYGALDA
                                                                                           L+DLR SKEAVYGALDA
Subjt:  ANSRDEMFAFGGQNALGDFLVARSFIMEEDLDHLLWGRRSEGDNRGFSSPYAFSGEREIFMAGGISAFCWDLCCETYNGIFIRLLDLRDSKEAVYGALDA

Query:  WVAWEQNFPIASLKQALAALEKEQQWHRVVQVIKWMLSKGQGTTMSVYGQLIRALDMDHRAEEAHKFWVMKIGSDLHSVPWQLCRSMISIYYRNKMLENL
        WVAWEQNFPIASLKQ LA LEKEQQWHRVVQVIKWMLSKGQGTTM VYGQLIRALDMDHRAEE+HKFWVMKIG+DLHSVPWQLCRSMISIYYRNKML+NL
Subjt:  WVAWEQNFPIASLKQALAALEKEQQWHRVVQVIKWMLSKGQGTTMSVYGQLIRALDMDHRAEEAHKFWVMKIGSDLHSVPWQLCRSMISIYYRNKMLENL

Query:  VKLFKDLEAFGRKPPDKSIVQRVADAYEMLGLLEEKERVLVKYKDLFTGEKKASIKKYKRISSEKSKRKKKI
        VKLFKDLEAFGRKPP+KSIVQRVADAYEMLGL EEKERVL KYKDLFT E+K  I+KY +IS EKSKR++K+
Subjt:  VKLFKDLEAFGRKPPDKSIVQRVADAYEMLGLLEEKERVLVKYKDLFTGEKKASIKKYKRISSEKSKRKKKI

A0A6J1DNN5 pentatricopeptide repeat-containing protein At4g18975, chloroplastic isoform X31.3e-12866.94Show/hide
Query:  MLIRRLHRAATWATPLLRDITVGQIMELGVSKLQVGNSFYCTMIQAQMCKQLADKDMKNKDINHNKALCQTSEQNTGDMRKHQIGENVSRKDKINFLVNT
        ML+RR HRA TW TPLLRD+T GQIM+LGVS+LQVGNS YCTM+QAQMC+QLAD+DMKNKD+N++KALCQ SEQN GDMRKHQIGENVSRKDKINFLV T
Subjt:  MLIRRLHRAATWATPLLRDITVGQIMELGVSKLQVGNSFYCTMIQAQMCKQLADKDMKNKDINHNKALCQTSEQNTGDMRKHQIGENVSRKDKINFLVNT

Query:  ANSRDEMFAFGGQNALGDFLVARSFIMEEDLDHLLWGRRSEGDNRGFSSPYAFSGEREIFMAGGISAFCWDLCCETYNGIFIRLLDLRDSKEAVYGALDA
                                                                                           L+DLR SKEAVYGALDA
Subjt:  ANSRDEMFAFGGQNALGDFLVARSFIMEEDLDHLLWGRRSEGDNRGFSSPYAFSGEREIFMAGGISAFCWDLCCETYNGIFIRLLDLRDSKEAVYGALDA

Query:  WVAWEQNFPIASLKQALAALEKEQQWHRVVQVIKWMLSKGQGTTMSVYGQLIRALDMDHRAEEAHKFWVMKIGSDLHSVPWQLCRSMISIYYRNKMLENL
        WVAWEQNFPIASLKQ LA LEKEQQWHRVVQVIKWMLSKGQGTTM VYGQLIRALDMDHRAEE+HKFWVMKIG+DLHSVPWQLCRSMISIYYRNKML+NL
Subjt:  WVAWEQNFPIASLKQALAALEKEQQWHRVVQVIKWMLSKGQGTTMSVYGQLIRALDMDHRAEEAHKFWVMKIGSDLHSVPWQLCRSMISIYYRNKMLENL

Query:  VKLFKDLEAFGRKPPDKSIVQRVADAYEMLGLLEEKERVLVKYKDLFTGEKKASIKKYKRISSEKSKRKKKI
        VKLFKDLEAFGRKPP+KSIVQRVADAYEMLGL EEKERVL KYKDLFT E+K  I+KY +IS EKSKR++K+
Subjt:  VKLFKDLEAFGRKPPDKSIVQRVADAYEMLGLLEEKERVLVKYKDLFTGEKKASIKKYKRISSEKSKRKKKI

A0A6J1HGC4 pentatricopeptide repeat-containing protein At4g18975, chloroplastic isoform X15.7e-12166.57Show/hide
Query:  MLIRRLHRAATWATPLLRDITVGQIMELGVSKLQVGNSFYCTMIQAQMCKQLADKDMKNKDINHNKALCQTSEQNTGDMRKHQIGENVSRKDKINFLVNT
        MLIRR HRAATWATPLLRD TVGQIMELGV+KLQ+GNS YCTM+Q QM K+  DKDM +KD+N++K L QTSE+N GD+RKHQIGENVSRKDKI+FLVNT
Subjt:  MLIRRLHRAATWATPLLRDITVGQIMELGVSKLQVGNSFYCTMIQAQMCKQLADKDMKNKDINHNKALCQTSEQNTGDMRKHQIGENVSRKDKINFLVNT

Query:  ANSRDEMFAFGGQNALGDFLVARSFIMEEDLDHLLWGRRSEGDNRGFSSPYAFSGEREIFMAGGISAFCWDLCCETYNGIFIRLLDLRDSKEAVYGALDA
                                                                                           L+DLRDSKEAVYGALDA
Subjt:  ANSRDEMFAFGGQNALGDFLVARSFIMEEDLDHLLWGRRSEGDNRGFSSPYAFSGEREIFMAGGISAFCWDLCCETYNGIFIRLLDLRDSKEAVYGALDA

Query:  WVAWEQNFPIASLKQALAALEKEQQWHRVVQVIKWMLSKGQGTTMSVYGQLIRALDMDHRAEEAHKFWVMKIGSDLHSVPWQLCRSMISIYYRNKMLENL
        WVAWEQ+FPIASLK ALA LEKE QWHRVVQVIKWMLSKGQGTTM+VYGQLIRALDMDHRAEEAHKFWVMKIGSDLHSVPWQLCRSMISIYYRNKMLE+L
Subjt:  WVAWEQNFPIASLKQALAALEKEQQWHRVVQVIKWMLSKGQGTTMSVYGQLIRALDMDHRAEEAHKFWVMKIGSDLHSVPWQLCRSMISIYYRNKMLENL

Query:  VKLFKDLEAFGRKPPDKSIVQRVADAYEMLGLLEEKERVLVKYKDLFTGEKKASIKKYK
        VKLFK+LEAFGRKPP+KSIVQRVADA EMLGL+EEKERVLVKY  LFT EKK SIKKYK
Subjt:  VKLFKDLEAFGRKPPDKSIVQRVADAYEMLGLLEEKERVLVKYKDLFTGEKKASIKKYK

A0A6J1HUZ4 pentatricopeptide repeat-containing protein At4g18975, chloroplastic isoform X13.0e-12267.13Show/hide
Query:  MLIRRLHRAATWATPLLRDITVGQIMELGVSKLQVGNSFYCTMIQAQMCKQLADKDMKNKDINHNKALCQTSEQNTGDMRKHQIGENVSRKDKINFLVNT
        MLIRR HRAATWATPLLRD TVGQ+MELGV+KLQ+GNS YCTM+Q QM K+ ADKDM +KD+N++K L QTSE+N GD+RKHQIGENVSRKDKINFLVNT
Subjt:  MLIRRLHRAATWATPLLRDITVGQIMELGVSKLQVGNSFYCTMIQAQMCKQLADKDMKNKDINHNKALCQTSEQNTGDMRKHQIGENVSRKDKINFLVNT

Query:  ANSRDEMFAFGGQNALGDFLVARSFIMEEDLDHLLWGRRSEGDNRGFSSPYAFSGEREIFMAGGISAFCWDLCCETYNGIFIRLLDLRDSKEAVYGALDA
                                                                                           L+DLRDSKEAVYGALDA
Subjt:  ANSRDEMFAFGGQNALGDFLVARSFIMEEDLDHLLWGRRSEGDNRGFSSPYAFSGEREIFMAGGISAFCWDLCCETYNGIFIRLLDLRDSKEAVYGALDA

Query:  WVAWEQNFPIASLKQALAALEKEQQWHRVVQVIKWMLSKGQGTTMSVYGQLIRALDMDHRAEEAHKFWVMKIGSDLHSVPWQLCRSMISIYYRNKMLENL
        WVAWEQ+FPIASLK ALA LEKE QWHRVVQVIKWMLSKGQGTTM+VYGQLIRALDMDHRAEEAHKFWVMKIGSDLHSVPWQLCRSMISIYYRNKMLE+L
Subjt:  WVAWEQNFPIASLKQALAALEKEQQWHRVVQVIKWMLSKGQGTTMSVYGQLIRALDMDHRAEEAHKFWVMKIGSDLHSVPWQLCRSMISIYYRNKMLENL

Query:  VKLFKDLEAFGRKPPDKSIVQRVADAYEMLGLLEEKERVLVKYKDLFTGEKKASIKKYK
        VKLFKDLEAFGRKPP+KSIVQRVADA EMLGL+EEKERVLVKY  LFT EKK SIKKYK
Subjt:  VKLFKDLEAFGRKPPDKSIVQRVADAYEMLGLLEEKERVLVKYKDLFTGEKKASIKKYK

SwissProt top hitse value%identityAlignment
Q2V3H0 Pentatricopeptide repeat-containing protein At4g18975, chloroplastic5.7e-3341.21Show/hide
Query:  IRLLD-LRDSKEAVYGALDAWVAWEQNFPIASLKQALAALEKEQQWHRVVQVIKWMLSKGQGTTMSVYGQLIRALDMDHRAEEAHKFWVMKIGSDLHSVP
        +R+L  L + KEAVYGAL+ WVAWE  FPI +  +AL  L K  QWHRV+Q+ KWMLSKGQG TM  Y  L+ A DMD RA+EA   W M + +   S+P
Subjt:  IRLLD-LRDSKEAVYGALDAWVAWEQNFPIASLKQALAALEKEQQWHRVVQVIKWMLSKGQGTTMSVYGQLIRALDMDHRAEEAHKFWVMKIGSDLHSVP

Query:  WQLCRSMISIYYRNKMLENLVKLFKDLEAFGRKPPDKSIVQRVADAYEMLGLLEEKE----RVLVKYKDLFTGEKKASIKKY
         +L   MI++Y  + + + ++++F D+E   +  PD+   +RVA A+  L   E ++    R L +YK ++   ++  +K+Y
Subjt:  WQLCRSMISIYYRNKMLENLVKLFKDLEAFGRKPPDKSIVQRVADAYEMLGLLEEKE----RVLVKYKDLFTGEKKASIKKY

Q8LG95 Pentatricopeptide repeat-containing protein At4g211907.7e-3042.04Show/hide
Query:  LRDSKEAVYGALDAWVAWEQNFPIASLKQALAALEKEQQWHRVVQVIKWMLSKGQGTTMSVYGQLIRALDMDHRAEEAHKFWVMKIGSDLHSVPWQLCRS
        L + KE VYGALD+++AWE  FP+  +K+AL  LE E++W +++QV KWMLSKGQG TM  Y  L+ AL  D+R +EA + W       L   P +    
Subjt:  LRDSKEAVYGALDAWVAWEQNFPIASLKQALAALEKEQQWHRVVQVIKWMLSKGQGTTMSVYGQLIRALDMDHRAEEAHKFWVMKIGSDLHSVPWQLCRS

Query:  MISIYYRNKMLENLVKLFKDLEAFGRKPPDKSIVQRVADAYEMLGLLEEKERVLVKY
        MISIYY+  M + L ++F D+E  G K P+ +IV  V   +  L + ++ E+++ KY
Subjt:  MISIYYRNKMLENLVKLFKDLEAFGRKPPDKSIVQRVADAYEMLGLLEEKERVLVKY

Arabidopsis top hitse value%identityAlignment
AT1G04590.1 BEST Arabidopsis thaliana protein match is: Pentatricopeptide repeat (PPR) superfamily protein (TAIR:AT4G21190.1)6.6e-6946.57Show/hide
Query:  IQAQMCKQLADKDMKNKDINHNKALCQTSEQ----NTGDMRKHQIGENVSRKDKINFLVNTANSRDEMFAFGGQNALGDFLVARSFIMEEDLDHLLWGRR
        +Q+   + +AD     K I  N+     S+     N  + RKHQIGEN+ +KDKI FLVNT                                       
Subjt:  IQAQMCKQLADKDMKNKDINHNKALCQTSEQ----NTGDMRKHQIGENVSRKDKINFLVNTANSRDEMFAFGGQNALGDFLVARSFIMEEDLDHLLWGRR

Query:  SEGDNRGFSSPYAFSGEREIFMAGGISAFCWDLCCETYNGIFIRLLDLRDSKEAVYGALDAWVAWEQNFPIASLKQALAALEKEQQWHRVVQVIKWMLSK
                                                    LLD+ D+KEAVYGALDAWVAWE+NFPIASLK  +A+LEKE QWHR+VQVIKW+LSK
Subjt:  SEGDNRGFSSPYAFSGEREIFMAGGISAFCWDLCCETYNGIFIRLLDLRDSKEAVYGALDAWVAWEQNFPIASLKQALAALEKEQQWHRVVQVIKWMLSK

Query:  GQGTTMSVYGQLIRALDMDHRAEEAHKFWVMKIGSDLHSVPWQLCRSMISIYYRNKMLENLVKLFKDLEAFGRKPPDKSIVQRVADAYEMLGLLEEKERV
        GQG TM  YGQLIRALDMD RAEEAH  W  K+G+DLHSVPWQLC  M+ IY+RN ML+ LVKLFKDLE++ RKPPDK IVQ VADAYE+LG+L+EKERV
Subjt:  GQGTTMSVYGQLIRALDMDHRAEEAHKFWVMKIGSDLHSVPWQLCRSMISIYYRNKMLENLVKLFKDLEAFGRKPPDKSIVQRVADAYEMLGLLEEKERV

Query:  LVKYKDLFTGEKKASIKKYKRISSEKSKRKKKIDE
        + KY  L  G    S  K  R S +K K + +I E
Subjt:  LVKYKDLFTGEKKASIKKYKRISSEKSKRKKKIDE

AT1G04590.2 BEST Arabidopsis thaliana protein match is: Pentatricopeptide repeat (PPR) superfamily protein (TAIR:AT4G18975.4)2.8e-6746.15Show/hide
Query:  IQAQMCKQLADKDMKNKDINHNKALCQTSEQ----NTGDMRKHQIGENVSRKDKINFLVNTANSRDEMFAFGGQNALGDFLVARSFIMEEDLDHLLWGRR
        +Q+   + +AD     K I  N+     S+     N  + RKHQIGEN+ +KDKI FLVNT                                       
Subjt:  IQAQMCKQLADKDMKNKDINHNKALCQTSEQ----NTGDMRKHQIGENVSRKDKINFLVNTANSRDEMFAFGGQNALGDFLVARSFIMEEDLDHLLWGRR

Query:  SEGDNRGFSSPYAFSGEREIFMAGGISAFCWDLCCETYNGIFIRLLDLRDSKEAVYGALDAWVAWEQNFPIASLKQALAALEKEQQWHRVVQVIKWMLSK
                                                    LLD+ D+KEAVYGALDAWVAWE+NFPIASLK  +A+LEKE QWHR+VQVIKW+LSK
Subjt:  SEGDNRGFSSPYAFSGEREIFMAGGISAFCWDLCCETYNGIFIRLLDLRDSKEAVYGALDAWVAWEQNFPIASLKQALAALEKEQQWHRVVQVIKWMLSK

Query:  GQGTTMSVYGQLIRALDMDHRAEEAHKFWVMKIGSDLHSVPWQLCRSMISIYYRNKMLENLV---KLFKDLEAFGRKPPDKSIVQRVADAYEMLGLLEEK
        GQG TM  YGQLIRALDMD RAEEAH  W  K+G+DLHSVPWQLC  M+ IY+RN ML+ LV   KLFKDLE++ RKPPDK IVQ VADAYE+LG+L+EK
Subjt:  GQGTTMSVYGQLIRALDMDHRAEEAHKFWVMKIGSDLHSVPWQLCRSMISIYYRNKMLENLV---KLFKDLEAFGRKPPDKSIVQRVADAYEMLGLLEEK

Query:  ERVLVKYKDLFTGEKKASIKKYKRISSEKSKRKKKIDE
        ERV+ KY  L  G    S  K  R S +K K + +I E
Subjt:  ERVLVKYKDLFTGEKKASIKKYKRISSEKSKRKKKIDE

AT4G18975.1 Pentatricopeptide repeat (PPR) superfamily protein4.0e-3441.21Show/hide
Query:  IRLLD-LRDSKEAVYGALDAWVAWEQNFPIASLKQALAALEKEQQWHRVVQVIKWMLSKGQGTTMSVYGQLIRALDMDHRAEEAHKFWVMKIGSDLHSVP
        +R+L  L + KEAVYGAL+ WVAWE  FPI +  +AL  L K  QWHRV+Q+ KWMLSKGQG TM  Y  L+ A DMD RA+EA   W M + +   S+P
Subjt:  IRLLD-LRDSKEAVYGALDAWVAWEQNFPIASLKQALAALEKEQQWHRVVQVIKWMLSKGQGTTMSVYGQLIRALDMDHRAEEAHKFWVMKIGSDLHSVP

Query:  WQLCRSMISIYYRNKMLENLVKLFKDLEAFGRKPPDKSIVQRVADAYEMLGLLEEKE----RVLVKYKDLFTGEKKASIKKY
         +L   MI++Y  + + + ++++F D+E   +  PD+   +RVA A+  L   E ++    R L +YK ++   ++  +K+Y
Subjt:  WQLCRSMISIYYRNKMLENLVKLFKDLEAFGRKPPDKSIVQRVADAYEMLGLLEEKE----RVLVKYKDLFTGEKKASIKKY

AT4G18975.2 Pentatricopeptide repeat (PPR) superfamily protein4.0e-3441.21Show/hide
Query:  IRLLD-LRDSKEAVYGALDAWVAWEQNFPIASLKQALAALEKEQQWHRVVQVIKWMLSKGQGTTMSVYGQLIRALDMDHRAEEAHKFWVMKIGSDLHSVP
        +R+L  L + KEAVYGAL+ WVAWE  FPI +  +AL  L K  QWHRV+Q+ KWMLSKGQG TM  Y  L+ A DMD RA+EA   W M + +   S+P
Subjt:  IRLLD-LRDSKEAVYGALDAWVAWEQNFPIASLKQALAALEKEQQWHRVVQVIKWMLSKGQGTTMSVYGQLIRALDMDHRAEEAHKFWVMKIGSDLHSVP

Query:  WQLCRSMISIYYRNKMLENLVKLFKDLEAFGRKPPDKSIVQRVADAYEMLGLLEEKE----RVLVKYKDLFTGEKKASIKKY
         +L   MI++Y  + + + ++++F D+E   +  PD+   +RVA A+  L   E ++    R L +YK ++   ++  +K+Y
Subjt:  WQLCRSMISIYYRNKMLENLVKLFKDLEAFGRKPPDKSIVQRVADAYEMLGLLEEKE----RVLVKYKDLFTGEKKASIKKY

AT4G18975.3 Pentatricopeptide repeat (PPR) superfamily protein4.0e-3441.21Show/hide
Query:  IRLLD-LRDSKEAVYGALDAWVAWEQNFPIASLKQALAALEKEQQWHRVVQVIKWMLSKGQGTTMSVYGQLIRALDMDHRAEEAHKFWVMKIGSDLHSVP
        +R+L  L + KEAVYGAL+ WVAWE  FPI +  +AL  L K  QWHRV+Q+ KWMLSKGQG TM  Y  L+ A DMD RA+EA   W M + +   S+P
Subjt:  IRLLD-LRDSKEAVYGALDAWVAWEQNFPIASLKQALAALEKEQQWHRVVQVIKWMLSKGQGTTMSVYGQLIRALDMDHRAEEAHKFWVMKIGSDLHSVP

Query:  WQLCRSMISIYYRNKMLENLVKLFKDLEAFGRKPPDKSIVQRVADAYEMLGLLEEKE----RVLVKYKDLFTGEKKASIKKY
         +L   MI++Y  + + + ++++F D+E   +  PD+   +RVA A+  L   E ++    R L +YK ++   ++  +K+Y
Subjt:  WQLCRSMISIYYRNKMLENLVKLFKDLEAFGRKPPDKSIVQRVADAYEMLGLLEEKE----RVLVKYKDLFTGEKKASIKKY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTCATCAGGAGGCTTCATCGAGCAGCGACATGGGCAACGCCTCTGTTGCGAGACATAACTGTAGGACAAATAATGGAGCTTGGTGTAAGCAAGCTGCAAGTTGGGAA
CTCCTTTTACTGCACAATGATACAAGCTCAAATGTGTAAACAGCTTGCTGATAAAGATATGAAAAATAAGGATATTAACCATAATAAAGCTCTGTGCCAGACTTCAGAAC
AAAATACTGGAGACATGAGGAAGCACCAAATTGGGGAAAATGTATCACGGAAGGACAAAATTAACTTCCTTGTAAATACGGCAAACTCGAGAGATGAGATGTTCGCTTTT
GGAGGCCAGAATGCTCTGGGGGATTTTCTTGTAGCTCGTTCTTTCATTATGGAGGAAGATCTTGATCATTTGCTTTGGGGTCGTCGAAGCGAGGGAGATAATCGAGGATT
TTCTTCTCCATATGCCTTTTCGGGAGAAAGAGAGATTTTTATGGCTGGTGGGATTTCTGCTTTTTGTTGGGATTTGTGCTGTGAGACGTACAATGGAATCTTCATAAGGC
TTCTCGATCTGAGAGATAGTAAGGAGGCTGTTTATGGTGCTCTTGATGCCTGGGTTGCATGGGAGCAAAACTTTCCAATAGCATCCCTTAAGCAGGCATTGGCTGCCCTT
GAGAAGGAACAGCAGTGGCATAGAGTTGTTCAGGTAATCAAATGGATGTTAAGCAAGGGACAGGGAACCACAATGAGCGTCTATGGGCAGTTAATACGGGCTTTAGACAT
GGACCATCGAGCGGAAGAGGCACACAAGTTTTGGGTCATGAAAATTGGTTCAGATCTTCATTCAGTCCCTTGGCAATTGTGCAGAAGCATGATATCAATATACTATCGAA
ATAAAATGCTAGAAAATCTCGTAAAGCTTTTTAAGGATCTCGAAGCTTTCGGTCGTAAACCCCCAGACAAATCAATAGTGCAGAGGGTAGCAGATGCTTATGAGATGCTA
GGCTTGCTTGAAGAGAAAGAGAGGGTGTTAGTAAAGTACAAAGACCTTTTCACTGGTGAGAAGAAAGCGTCCATAAAAAAATATAAGAGGATTTCGTCTGAGAAATCAAA
GAGAAAAAAGAAAATCGATGAAGGGCACTGA
mRNA sequenceShow/hide mRNA sequence
ATGCTCATCAGGAGGCTTCATCGAGCAGCGACATGGGCAACGCCTCTGTTGCGAGACATAACTGTAGGACAAATAATGGAGCTTGGTGTAAGCAAGCTGCAAGTTGGGAA
CTCCTTTTACTGCACAATGATACAAGCTCAAATGTGTAAACAGCTTGCTGATAAAGATATGAAAAATAAGGATATTAACCATAATAAAGCTCTGTGCCAGACTTCAGAAC
AAAATACTGGAGACATGAGGAAGCACCAAATTGGGGAAAATGTATCACGGAAGGACAAAATTAACTTCCTTGTAAATACGGCAAACTCGAGAGATGAGATGTTCGCTTTT
GGAGGCCAGAATGCTCTGGGGGATTTTCTTGTAGCTCGTTCTTTCATTATGGAGGAAGATCTTGATCATTTGCTTTGGGGTCGTCGAAGCGAGGGAGATAATCGAGGATT
TTCTTCTCCATATGCCTTTTCGGGAGAAAGAGAGATTTTTATGGCTGGTGGGATTTCTGCTTTTTGTTGGGATTTGTGCTGTGAGACGTACAATGGAATCTTCATAAGGC
TTCTCGATCTGAGAGATAGTAAGGAGGCTGTTTATGGTGCTCTTGATGCCTGGGTTGCATGGGAGCAAAACTTTCCAATAGCATCCCTTAAGCAGGCATTGGCTGCCCTT
GAGAAGGAACAGCAGTGGCATAGAGTTGTTCAGGTAATCAAATGGATGTTAAGCAAGGGACAGGGAACCACAATGAGCGTCTATGGGCAGTTAATACGGGCTTTAGACAT
GGACCATCGAGCGGAAGAGGCACACAAGTTTTGGGTCATGAAAATTGGTTCAGATCTTCATTCAGTCCCTTGGCAATTGTGCAGAAGCATGATATCAATATACTATCGAA
ATAAAATGCTAGAAAATCTCGTAAAGCTTTTTAAGGATCTCGAAGCTTTCGGTCGTAAACCCCCAGACAAATCAATAGTGCAGAGGGTAGCAGATGCTTATGAGATGCTA
GGCTTGCTTGAAGAGAAAGAGAGGGTGTTAGTAAAGTACAAAGACCTTTTCACTGGTGAGAAGAAAGCGTCCATAAAAAAATATAAGAGGATTTCGTCTGAGAAATCAAA
GAGAAAAAAGAAAATCGATGAAGGGCACTGA
Protein sequenceShow/hide protein sequence
MLIRRLHRAATWATPLLRDITVGQIMELGVSKLQVGNSFYCTMIQAQMCKQLADKDMKNKDINHNKALCQTSEQNTGDMRKHQIGENVSRKDKINFLVNTANSRDEMFAF
GGQNALGDFLVARSFIMEEDLDHLLWGRRSEGDNRGFSSPYAFSGEREIFMAGGISAFCWDLCCETYNGIFIRLLDLRDSKEAVYGALDAWVAWEQNFPIASLKQALAAL
EKEQQWHRVVQVIKWMLSKGQGTTMSVYGQLIRALDMDHRAEEAHKFWVMKIGSDLHSVPWQLCRSMISIYYRNKMLENLVKLFKDLEAFGRKPPDKSIVQRVADAYEML
GLLEEKERVLVKYKDLFTGEKKASIKKYKRISSEKSKRKKKIDEGH