; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CsGy3G016120 (gene) of Cucumber (Gy14) v2.1 genome

Gene IDCsGy3G016120
OrganismCucumis sativus L. var. sativus cv. Gy14 (Cucumber (Gy14) v2.1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationGy14Chr3:11974959..11976775
RNA-Seq ExpressionCsGy3G016120
SyntenyCsGy3G016120
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0067680.1 pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa]0.095.65Show/hide
Query:  MLISLHLLTPPSSSLLLFCSSKPKKSKKERRKLLHQKLLRISKAKQSTDLSFPKSSPTPLLIHPKPFFQSKIQALDAVLTDLEASIDNGLFIDPEIFSSL
        MLISL LLTPPS+SLLLFCSSKPKKSKKERRKLLHQKLLRISKAKQSTDLSFPKSS TPLLIH KPFFQSKIQALDAVLTDLE SIDNGL IDPEIFSSL
Subjt:  MLISLHLLTPPSSSLLLFCSSKPKKSKKERRKLLHQKLLRISKAKQSTDLSFPKSSPTPLLIHPKPFFQSKIQALDAVLTDLEASIDNGLFIDPEIFSSL

Query:  LELCYQLQAIHHGIRIHRLIPTNLLRRNVGISSKLLRLYASFGYMEDAHQVFDEMGNRNFSAFAWNSLISGYAELGLYEDALALYFQMEEEGVEPDNFTF
        LELCYQL+AIHHGIRIHRLIPTNLLRRNVGISSKLLRLYAS GYMEDAHQVFDEMG RNFSAFAWNSLISGYAELGLYEDALALYFQMEEEGVEPD+FTF
Subjt:  LELCYQLQAIHHGIRIHRLIPTNLLRRNVGISSKLLRLYASFGYMEDAHQVFDEMGNRNFSAFAWNSLISGYAELGLYEDALALYFQMEEEGVEPDNFTF

Query:  PRVLKACGGIGSIQIGEAVHRHVVRSGFAGDVFVLNALVDMYSKCGCIVRARKVFDQIEYKDIVSWNSMLTGYTRHGLHFEALDIFDQMIQEGYEPDSVA
        PRVLKACGGIGSIQIGEAVHRHVVRSGFAGDVFVLNALVDMYSKCGCIVRARKVFDQI YKDIVSWNSMLTGYTRHGLHFEALDIFDQMIQEGY+PDSVA
Subjt:  PRVLKACGGIGSIQIGEAVHRHVVRSGFAGDVFVLNALVDMYSKCGCIVRARKVFDQIEYKDIVSWNSMLTGYTRHGLHFEALDIFDQMIQEGYEPDSVA

Query:  LSTLLSNISSMKFKLHIHGWVIRHGVEWNLSIANSLIVMYAKCGKLNRAKWLFQQMPQKDMVSWNSIISAHFNSAEALTYFEVMESLGVSPDGVTFVSLL
        LSTLLSNI S+KFKLHIHGWVIRHGVEWNLSIANSLIVMYAKCGKLNRAKWLFQQMPQKDMVSWNSIISAHFN+ EALTYFEVMESLGV PD VTFVSLL
Subjt:  LSTLLSNISSMKFKLHIHGWVIRHGVEWNLSIANSLIVMYAKCGKLNRAKWLFQQMPQKDMVSWNSIISAHFNSAEALTYFEVMESLGVSPDGVTFVSLL

Query:  STCAHLGLVKEGGKLYFLMKGKYGIRPTIEHYACMVNLYGRAGMIEEAYKIITKGMEIEAGPTIWGALLYACYLHSDVDIAEIAAERLFELEPDNELNFE
        STCAHLGLVKEGG+LY LMKGKY IRPTIEHYACMVNLYGRAGMIEEAYKIITKGMEIEAGPTIWGALLYACYLH +VDIAEIAAERLFELEPDNELNFE
Subjt:  STCAHLGLVKEGGKLYFLMKGKYGIRPTIEHYACMVNLYGRAGMIEEAYKIITKGMEIEAGPTIWGALLYACYLHSDVDIAEIAAERLFELEPDNELNFE

Query:  LLMKIYGNAGRSEDEKRVKLMMAERGLNS
        LLMKIYGNAGRSEDEKRVKLMMAERGLNS
Subjt:  LLMKIYGNAGRSEDEKRVKLMMAERGLNS

KAG6600165.1 Pentatricopeptide repeat-containing protein, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia]0.083.52Show/hide
Query:  MLISLHLLTPPSSSLLLFCSSKPKKSKKERRKLLHQKLLRISKAKQSTDLSFPKSSPTPLLIHPKPFFQSKIQALDAVLTDLEASIDNGLFIDPEIFSSL
        MLISL     P +SL LFCSS PKKSKKERRKLL +KL+RISKAK++T L FPKSS TPLLIH KPF QSKIQALDAVL DLEAS+ NG+ ID EIFSSL
Subjt:  MLISLHLLTPPSSSLLLFCSSKPKKSKKERRKLLHQKLLRISKAKQSTDLSFPKSSPTPLLIHPKPFFQSKIQALDAVLTDLEASIDNGLFIDPEIFSSL

Query:  LELCYQLQAIHHGIRIHRLIPTNLLRRNVGISSKLLRLYASFGYMEDAHQVFDEMGNRNFSAFAWNSLISGYAELGLYEDALALYFQMEEEGVEPDNFTF
        LE CYQL+A+ HGIRIHRLIPTN LRRNVG+SSKLLRLYASFGYMEDAHQVFDEM  RN SAF+WNSLISGYAELGLYEDALALYFQMEEEGVEPD+FTF
Subjt:  LELCYQLQAIHHGIRIHRLIPTNLLRRNVGISSKLLRLYASFGYMEDAHQVFDEMGNRNFSAFAWNSLISGYAELGLYEDALALYFQMEEEGVEPDNFTF

Query:  PRVLKACGGIGSIQIGEAVHRHVVRSGFAGDVFVLNALVDMYSKCGCIVRARKVFDQIEYKDIVSWNSMLTGYTRHGLHFEALDIFDQMIQEGYEPDSVA
        PRVLKACGGIGSI++GEAVHRHVVRSGFAGD+FVLNALVDMY+KCG I+RARKVFDQI  KD VSWNSMLTGYTRHGL  EAL+ FDQMIQEGYEPDSVA
Subjt:  PRVLKACGGIGSIQIGEAVHRHVVRSGFAGDVFVLNALVDMYSKCGCIVRARKVFDQIEYKDIVSWNSMLTGYTRHGLHFEALDIFDQMIQEGYEPDSVA

Query:  LSTLLSNISSMKFKLHIHGWVIRHGVEWNLSIANSLIVMYAKCGKLNRAKWLFQQMPQKDMVSWNSIISAHFNSAEALTYFEVMESLGVSPDGVTFVSLL
        LST++SNISS KFKLHIHGW IRHG+EWNLSIANSLI MYA  GK+NRA+WLF+QMP++D+VSWN+IISAH N+++ALTYFEVMESLGV PD VTFVSLL
Subjt:  LSTLLSNISSMKFKLHIHGWVIRHGVEWNLSIANSLIVMYAKCGKLNRAKWLFQQMPQKDMVSWNSIISAHFNSAEALTYFEVMESLGVSPDGVTFVSLL

Query:  STCAHLGLVKEGGKLYFLMKGKYGIRPTIEHYACMVNLYGRAGMIEEAYKIITKGMEIEAGPTIWGALLYACYLHSDVDIAEIAAERLFELEPDNELNFE
        STCAHL LVKEGGKLY +MKGKYGIRPTIEHYACMVNLYGRAG+IEEAY+IIT GME+EAGPT+WGALLYACYLH +VDIAE+AAE+LFE EPDNELNF+
Subjt:  STCAHLGLVKEGGKLYFLMKGKYGIRPTIEHYACMVNLYGRAGMIEEAYKIITKGMEIEAGPTIWGALLYACYLHSDVDIAEIAAERLFELEPDNELNFE

Query:  LLMKIYGNAGRSEDEKRVKLMMAERGLN
        LLMKIYGNAGR EDEKRV+LMMAERGL+
Subjt:  LLMKIYGNAGRSEDEKRVKLMMAERGLN

XP_016898932.1 PREDICTED: pentatricopeptide repeat-containing protein At4g25270, chloroplastic [Cucumis melo]0.095.46Show/hide
Query:  MLISLHLLTPPSSSLLLFCSSKPKKSKKERRKLLHQKLLRISKAKQSTDLSFPKSSPTPLLIHPKPFFQSKIQALDAVLTDLEASIDNGLFIDPEIFSSL
        MLISL LLTPPS+SLLLFCSSKPKKSKKERRKLLHQKLLRISKAKQSTDLSFPKSS TPLLIH KPFFQSKIQALDAVLTDLE SIDNGL IDPEIFSSL
Subjt:  MLISLHLLTPPSSSLLLFCSSKPKKSKKERRKLLHQKLLRISKAKQSTDLSFPKSSPTPLLIHPKPFFQSKIQALDAVLTDLEASIDNGLFIDPEIFSSL

Query:  LELCYQLQAIHHGIRIHRLIPTNLLRRNVGISSKLLRLYASFGYMEDAHQVFDEMGNRNFSAFAWNSLISGYAELGLYEDALALYFQMEEEGVEPDNFTF
        LELCYQL+AIHHGIRIHRLIPTNLLRRNVGISSKLLRLYAS GYMEDAHQVFDEMG RNFSAFAWNSLISGYAELGLYEDALALYFQMEEEGVEPD+FTF
Subjt:  LELCYQLQAIHHGIRIHRLIPTNLLRRNVGISSKLLRLYASFGYMEDAHQVFDEMGNRNFSAFAWNSLISGYAELGLYEDALALYFQMEEEGVEPDNFTF

Query:  PRVLKACGGIGSIQIGEAVHRHVVRSGFAGDVFVLNALVDMYSKCGCIVRARKVFDQIEYKDIVSWNSMLTGYTRHGLHFEALDIFDQMIQEGYEPDSVA
        PRVLKACGGIGSIQIGEAVHRHVVRSGFAGDVFVLNALVDMYSKCGCIVRARKVFDQI YKDIVSWNSMLTGYTRHGLHFEALDIFDQMIQEGY+PDSVA
Subjt:  PRVLKACGGIGSIQIGEAVHRHVVRSGFAGDVFVLNALVDMYSKCGCIVRARKVFDQIEYKDIVSWNSMLTGYTRHGLHFEALDIFDQMIQEGYEPDSVA

Query:  LSTLLSNISSMKFKLHIHGWVIRHGVEWNLSIANSLIVMYAKCGKLNRAKWLFQQMPQKDMVSWNSIISAHFNSAEALTYFEVMESLGVSPDGVTFVSLL
        LSTLLSNI S+KFKLHIHGWVIRHGVEWNLSIANSLIVMYAKCGKLNRAKWLFQQMPQKDMVSWNSIISAHFN+ EALTYFEVMESLGV PD VTFVSLL
Subjt:  LSTLLSNISSMKFKLHIHGWVIRHGVEWNLSIANSLIVMYAKCGKLNRAKWLFQQMPQKDMVSWNSIISAHFNSAEALTYFEVMESLGVSPDGVTFVSLL

Query:  STCAHLGLVKEGGKLYFLMKGKYGIRPTIEHYACMVNLYGRAGMIEEAYKIITKGMEIEAGPTIWGALLYACYLHSDVDIAEIAAERLFELEPDNELNFE
        STCAHLGLVKEGG+LY LMKGKY IRPTIEHYACMVNLYGRAGMIEEAYKIITKGMEIEAGPTIWGALLYACYLH +VDIAEIAAERLFELEPDNELNFE
Subjt:  STCAHLGLVKEGGKLYFLMKGKYGIRPTIEHYACMVNLYGRAGMIEEAYKIITKGMEIEAGPTIWGALLYACYLHSDVDIAEIAAERLFELEPDNELNFE

Query:  LLMKIYGNAGRSEDEKRVKLMMAERGLNS
        LLMKIYGNAGRS+DEKRVKLMMAERGLNS
Subjt:  LLMKIYGNAGRSEDEKRVKLMMAERGLNS

XP_031738497.1 pentatricopeptide repeat-containing protein At4g25270, chloroplastic [Cucumis sativus]0.0100Show/hide
Query:  MNDHFPLLLQRFVFPMLISLHLLTPPSSSLLLFCSSKPKKSKKERRKLLHQKLLRISKAKQSTDLSFPKSSPTPLLIHPKPFFQSKIQALDAVLTDLEAS
        MNDHFPLLLQRFVFPMLISLHLLTPPSSSLLLFCSSKPKKSKKERRKLLHQKLLRISKAKQSTDLSFPKSSPTPLLIHPKPFFQSKIQALDAVLTDLEAS
Subjt:  MNDHFPLLLQRFVFPMLISLHLLTPPSSSLLLFCSSKPKKSKKERRKLLHQKLLRISKAKQSTDLSFPKSSPTPLLIHPKPFFQSKIQALDAVLTDLEAS

Query:  IDNGLFIDPEIFSSLLELCYQLQAIHHGIRIHRLIPTNLLRRNVGISSKLLRLYASFGYMEDAHQVFDEMGNRNFSAFAWNSLISGYAELGLYEDALALY
        IDNGLFIDPEIFSSLLELCYQLQAIHHGIRIHRLIPTNLLRRNVGISSKLLRLYASFGYMEDAHQVFDEMGNRNFSAFAWNSLISGYAELGLYEDALALY
Subjt:  IDNGLFIDPEIFSSLLELCYQLQAIHHGIRIHRLIPTNLLRRNVGISSKLLRLYASFGYMEDAHQVFDEMGNRNFSAFAWNSLISGYAELGLYEDALALY

Query:  FQMEEEGVEPDNFTFPRVLKACGGIGSIQIGEAVHRHVVRSGFAGDVFVLNALVDMYSKCGCIVRARKVFDQIEYKDIVSWNSMLTGYTRHGLHFEALDI
        FQMEEEGVEPDNFTFPRVLKACGGIGSIQIGEAVHRHVVRSGFAGDVFVLNALVDMYSKCGCIVRARKVFDQIEYKDIVSWNSMLTGYTRHGLHFEALDI
Subjt:  FQMEEEGVEPDNFTFPRVLKACGGIGSIQIGEAVHRHVVRSGFAGDVFVLNALVDMYSKCGCIVRARKVFDQIEYKDIVSWNSMLTGYTRHGLHFEALDI

Query:  FDQMIQEGYEPDSVALSTLLSNISSMKFKLHIHGWVIRHGVEWNLSIANSLIVMYAKCGKLNRAKWLFQQMPQKDMVSWNSIISAHFNSAEALTYFEVME
        FDQMIQEGYEPDSVALSTLLSNISSMKFKLHIHGWVIRHGVEWNLSIANSLIVMYAKCGKLNRAKWLFQQMPQKDMVSWNSIISAHFNSAEALTYFEVME
Subjt:  FDQMIQEGYEPDSVALSTLLSNISSMKFKLHIHGWVIRHGVEWNLSIANSLIVMYAKCGKLNRAKWLFQQMPQKDMVSWNSIISAHFNSAEALTYFEVME

Query:  SLGVSPDGVTFVSLLSTCAHLGLVKEGGKLYFLMKGKYGIRPTIEHYACMVNLYGRAGMIEEAYKIITKGMEIEAGPTIWGALLYACYLHSDVDIAEIAA
        SLGVSPDGVTFVSLLSTCAHLGLVKEGGKLYFLMKGKYGIRPTIEHYACMVNLYGRAGMIEEAYKIITKGMEIEAGPTIWGALLYACYLHSDVDIAEIAA
Subjt:  SLGVSPDGVTFVSLLSTCAHLGLVKEGGKLYFLMKGKYGIRPTIEHYACMVNLYGRAGMIEEAYKIITKGMEIEAGPTIWGALLYACYLHSDVDIAEIAA

Query:  ERLFELEPDNELNFELLMKIYGNAGRSEDEKRVKLMMAERGLNS
        ERLFELEPDNELNFELLMKIYGNAGRSEDEKRVKLMMAERGLNS
Subjt:  ERLFELEPDNELNFELLMKIYGNAGRSEDEKRVKLMMAERGLNS

XP_038901542.1 pentatricopeptide repeat-containing protein At4g25270, chloroplastic [Benincasa hispida]0.089.79Show/hide
Query:  MLISLHLLTPPSSSLLLFCSSKPKKSKKERRKLLHQKLLRISKAKQSTDLSFPKSSPTPLLIHPKPFFQSKIQALDAVLTDLEASIDNGLFIDPEIFSSL
        MLISL L   PS+SLLLFCSSKPKKSKKER+KLLHQKLLRISKA+++TDL FPKSS TPLLIHPKPFFQ+KIQALDA+LTDLE S+DNGL  DPEIFSSL
Subjt:  MLISLHLLTPPSSSLLLFCSSKPKKSKKERRKLLHQKLLRISKAKQSTDLSFPKSSPTPLLIHPKPFFQSKIQALDAVLTDLEASIDNGLFIDPEIFSSL

Query:  LELCYQLQAIHHGIRIHRLIPTNLLRRNVGISSKLLRLYASFGYMEDAHQVFDEMGNRNFSAFAWNSLISGYAELGLYEDALALYFQMEEEGVEPDNFTF
        LE+CYQL++IHHGIRIHRLIPTNLLRRNVG+SSKLLRLYASFGYME AHQVFDEM  RN SAFAWNSLISGYAELGLYEDALALYFQMEEEGVEPD+FTF
Subjt:  LELCYQLQAIHHGIRIHRLIPTNLLRRNVGISSKLLRLYASFGYMEDAHQVFDEMGNRNFSAFAWNSLISGYAELGLYEDALALYFQMEEEGVEPDNFTF

Query:  PRVLKACGGIGSIQIGEAVHRHVVRSGFAGDVFVLNALVDMYSKCGCIVRARKVFDQIEYKDIVSWNSMLTGYTRHGLHFEALDIFDQMIQEGYEPDSVA
        PRVLKACGGI S+QIGEAVHRHV+RSGFAGDVFVLNALVDMYSKCG IVRARKVFDQI  KD VSWNSMLTGYTRHGL  EALDIFDQMIQEGYEPDSVA
Subjt:  PRVLKACGGIGSIQIGEAVHRHVVRSGFAGDVFVLNALVDMYSKCGCIVRARKVFDQIEYKDIVSWNSMLTGYTRHGLHFEALDIFDQMIQEGYEPDSVA

Query:  LSTLLSNISSMKFKLHIHGWVIRHGVEWNLSIANSLIVMYAKCGKLNRAKWLFQQMPQKDMVSWNSIISAHFNSAEALTYFEVMESLGVSPDGVTFVSLL
        LST+LSNISS+KF+LHIHGWVIRHGVEWNLSIANSLIVMYA CGK+NRAKWLFQQMPQKD VSWNSIISAHFNS EALTYFEVMESLGV PD VTFVSLL
Subjt:  LSTLLSNISSMKFKLHIHGWVIRHGVEWNLSIANSLIVMYAKCGKLNRAKWLFQQMPQKDMVSWNSIISAHFNSAEALTYFEVMESLGVSPDGVTFVSLL

Query:  STCAHLGLVKEGGKLYFLMKGKYGIRPTIEHYACMVNLYGRAGMIEEAYKIITKGMEIEAGPTIWGALLYACYLHSDVDIAEIAAERLFELEPDNELNFE
        STCA+LGLVKEGGKLY LMKGKY IRPT EHYACMVNLYGRAG+IEEAY+IITK MEIEAGPT+WGALLYACYLHS+VDIAEIAAERLFELEPDNELNFE
Subjt:  STCAHLGLVKEGGKLYFLMKGKYGIRPTIEHYACMVNLYGRAGMIEEAYKIITKGMEIEAGPTIWGALLYACYLHSDVDIAEIAAERLFELEPDNELNFE

Query:  LLMKIYGNAGRSEDEKRVKLMMAERGLNS
        LLMKIYGNAGRSEDEKRVKLMMAERGL+S
Subjt:  LLMKIYGNAGRSEDEKRVKLMMAERGLNS

TrEMBL top hitse value%identityAlignment
A0A0A0L5M0 Uncharacterized protein0.0100Show/hide
Query:  MNDHFPLLLQRFVFPMLISLHLLTPPSSSLLLFCSSKPKKSKKERRKLLHQKLLRISKAKQSTDLSFPKSSPTPLLIHPKPFFQSKIQALDAVLTDLEAS
        MNDHFPLLLQRFVFPMLISLHLLTPPSSSLLLFCSSKPKKSKKERRKLLHQKLLRISKAKQSTDLSFPKSSPTPLLIHPKPFFQSKIQALDAVLTDLEAS
Subjt:  MNDHFPLLLQRFVFPMLISLHLLTPPSSSLLLFCSSKPKKSKKERRKLLHQKLLRISKAKQSTDLSFPKSSPTPLLIHPKPFFQSKIQALDAVLTDLEAS

Query:  IDNGLFIDPEIFSSLLELCYQLQAIHHGIRIHRLIPTNLLRRNVGISSKLLRLYASFGYMEDAHQVFDEMGNRNFSAFAWNSLISGYAELGLYEDALALY
        IDNGLFIDPEIFSSLLELCYQLQAIHHGIRIHRLIPTNLLRRNVGISSKLLRLYASFGYMEDAHQVFDEMGNRNFSAFAWNSLISGYAELGLYEDALALY
Subjt:  IDNGLFIDPEIFSSLLELCYQLQAIHHGIRIHRLIPTNLLRRNVGISSKLLRLYASFGYMEDAHQVFDEMGNRNFSAFAWNSLISGYAELGLYEDALALY

Query:  FQMEEEGVEPDNFTFPRVLKACGGIGSIQIGEAVHRHVVRSGFAGDVFVLNALVDMYSKCGCIVRARKVFDQIEYKDIVSWNSMLTGYTRHGLHFEALDI
        FQMEEEGVEPDNFTFPRVLKACGGIGSIQIGEAVHRHVVRSGFAGDVFVLNALVDMYSKCGCIVRARKVFDQIEYKDIVSWNSMLTGYTRHGLHFEALDI
Subjt:  FQMEEEGVEPDNFTFPRVLKACGGIGSIQIGEAVHRHVVRSGFAGDVFVLNALVDMYSKCGCIVRARKVFDQIEYKDIVSWNSMLTGYTRHGLHFEALDI

Query:  FDQMIQEGYEPDSVALSTLLSNISSMKFKLHIHGWVIRHGVEWNLSIANSLIVMYAKCGKLNRAKWLFQQMPQKDMVSWNSIISAHFNSAEALTYFEVME
        FDQMIQEGYEPDSVALSTLLSNISSMKFKLHIHGWVIRHGVEWNLSIANSLIVMYAKCGKLNRAKWLFQQMPQKDMVSWNSIISAHFNSAEALTYFEVME
Subjt:  FDQMIQEGYEPDSVALSTLLSNISSMKFKLHIHGWVIRHGVEWNLSIANSLIVMYAKCGKLNRAKWLFQQMPQKDMVSWNSIISAHFNSAEALTYFEVME

Query:  SLGVSPDGVTFVSLLSTCAHLGLVKEGGKLYFLMKGKYGIRPTIEHYACMVNLYGRAGMIEEAYKIITKGMEIEAGPTIWGALLYACYLHSDVDIAEIAA
        SLGVSPDGVTFVSLLSTCAHLGLVKEGGKLYFLMKGKYGIRPTIEHYACMVNLYGRAGMIEEAYKIITKGMEIEAGPTIWGALLYACYLHSDVDIAEIAA
Subjt:  SLGVSPDGVTFVSLLSTCAHLGLVKEGGKLYFLMKGKYGIRPTIEHYACMVNLYGRAGMIEEAYKIITKGMEIEAGPTIWGALLYACYLHSDVDIAEIAA

Query:  ERLFELEPDNELNFELLMKIYGNAGRSEDEKRVKLMMAERGLNS
        ERLFELEPDNELNFELLMKIYGNAGRSEDEKRVKLMMAERGLNS
Subjt:  ERLFELEPDNELNFELLMKIYGNAGRSEDEKRVKLMMAERGLNS

A0A1S4DSF7 pentatricopeptide repeat-containing protein At4g25270, chloroplastic0.095.46Show/hide
Query:  MLISLHLLTPPSSSLLLFCSSKPKKSKKERRKLLHQKLLRISKAKQSTDLSFPKSSPTPLLIHPKPFFQSKIQALDAVLTDLEASIDNGLFIDPEIFSSL
        MLISL LLTPPS+SLLLFCSSKPKKSKKERRKLLHQKLLRISKAKQSTDLSFPKSS TPLLIH KPFFQSKIQALDAVLTDLE SIDNGL IDPEIFSSL
Subjt:  MLISLHLLTPPSSSLLLFCSSKPKKSKKERRKLLHQKLLRISKAKQSTDLSFPKSSPTPLLIHPKPFFQSKIQALDAVLTDLEASIDNGLFIDPEIFSSL

Query:  LELCYQLQAIHHGIRIHRLIPTNLLRRNVGISSKLLRLYASFGYMEDAHQVFDEMGNRNFSAFAWNSLISGYAELGLYEDALALYFQMEEEGVEPDNFTF
        LELCYQL+AIHHGIRIHRLIPTNLLRRNVGISSKLLRLYAS GYMEDAHQVFDEMG RNFSAFAWNSLISGYAELGLYEDALALYFQMEEEGVEPD+FTF
Subjt:  LELCYQLQAIHHGIRIHRLIPTNLLRRNVGISSKLLRLYASFGYMEDAHQVFDEMGNRNFSAFAWNSLISGYAELGLYEDALALYFQMEEEGVEPDNFTF

Query:  PRVLKACGGIGSIQIGEAVHRHVVRSGFAGDVFVLNALVDMYSKCGCIVRARKVFDQIEYKDIVSWNSMLTGYTRHGLHFEALDIFDQMIQEGYEPDSVA
        PRVLKACGGIGSIQIGEAVHRHVVRSGFAGDVFVLNALVDMYSKCGCIVRARKVFDQI YKDIVSWNSMLTGYTRHGLHFEALDIFDQMIQEGY+PDSVA
Subjt:  PRVLKACGGIGSIQIGEAVHRHVVRSGFAGDVFVLNALVDMYSKCGCIVRARKVFDQIEYKDIVSWNSMLTGYTRHGLHFEALDIFDQMIQEGYEPDSVA

Query:  LSTLLSNISSMKFKLHIHGWVIRHGVEWNLSIANSLIVMYAKCGKLNRAKWLFQQMPQKDMVSWNSIISAHFNSAEALTYFEVMESLGVSPDGVTFVSLL
        LSTLLSNI S+KFKLHIHGWVIRHGVEWNLSIANSLIVMYAKCGKLNRAKWLFQQMPQKDMVSWNSIISAHFN+ EALTYFEVMESLGV PD VTFVSLL
Subjt:  LSTLLSNISSMKFKLHIHGWVIRHGVEWNLSIANSLIVMYAKCGKLNRAKWLFQQMPQKDMVSWNSIISAHFNSAEALTYFEVMESLGVSPDGVTFVSLL

Query:  STCAHLGLVKEGGKLYFLMKGKYGIRPTIEHYACMVNLYGRAGMIEEAYKIITKGMEIEAGPTIWGALLYACYLHSDVDIAEIAAERLFELEPDNELNFE
        STCAHLGLVKEGG+LY LMKGKY IRPTIEHYACMVNLYGRAGMIEEAYKIITKGMEIEAGPTIWGALLYACYLH +VDIAEIAAERLFELEPDNELNFE
Subjt:  STCAHLGLVKEGGKLYFLMKGKYGIRPTIEHYACMVNLYGRAGMIEEAYKIITKGMEIEAGPTIWGALLYACYLHSDVDIAEIAAERLFELEPDNELNFE

Query:  LLMKIYGNAGRSEDEKRVKLMMAERGLNS
        LLMKIYGNAGRS+DEKRVKLMMAERGLNS
Subjt:  LLMKIYGNAGRSEDEKRVKLMMAERGLNS

A0A5D3DJ70 Pentatricopeptide repeat-containing protein0.095.65Show/hide
Query:  MLISLHLLTPPSSSLLLFCSSKPKKSKKERRKLLHQKLLRISKAKQSTDLSFPKSSPTPLLIHPKPFFQSKIQALDAVLTDLEASIDNGLFIDPEIFSSL
        MLISL LLTPPS+SLLLFCSSKPKKSKKERRKLLHQKLLRISKAKQSTDLSFPKSS TPLLIH KPFFQSKIQALDAVLTDLE SIDNGL IDPEIFSSL
Subjt:  MLISLHLLTPPSSSLLLFCSSKPKKSKKERRKLLHQKLLRISKAKQSTDLSFPKSSPTPLLIHPKPFFQSKIQALDAVLTDLEASIDNGLFIDPEIFSSL

Query:  LELCYQLQAIHHGIRIHRLIPTNLLRRNVGISSKLLRLYASFGYMEDAHQVFDEMGNRNFSAFAWNSLISGYAELGLYEDALALYFQMEEEGVEPDNFTF
        LELCYQL+AIHHGIRIHRLIPTNLLRRNVGISSKLLRLYAS GYMEDAHQVFDEMG RNFSAFAWNSLISGYAELGLYEDALALYFQMEEEGVEPD+FTF
Subjt:  LELCYQLQAIHHGIRIHRLIPTNLLRRNVGISSKLLRLYASFGYMEDAHQVFDEMGNRNFSAFAWNSLISGYAELGLYEDALALYFQMEEEGVEPDNFTF

Query:  PRVLKACGGIGSIQIGEAVHRHVVRSGFAGDVFVLNALVDMYSKCGCIVRARKVFDQIEYKDIVSWNSMLTGYTRHGLHFEALDIFDQMIQEGYEPDSVA
        PRVLKACGGIGSIQIGEAVHRHVVRSGFAGDVFVLNALVDMYSKCGCIVRARKVFDQI YKDIVSWNSMLTGYTRHGLHFEALDIFDQMIQEGY+PDSVA
Subjt:  PRVLKACGGIGSIQIGEAVHRHVVRSGFAGDVFVLNALVDMYSKCGCIVRARKVFDQIEYKDIVSWNSMLTGYTRHGLHFEALDIFDQMIQEGYEPDSVA

Query:  LSTLLSNISSMKFKLHIHGWVIRHGVEWNLSIANSLIVMYAKCGKLNRAKWLFQQMPQKDMVSWNSIISAHFNSAEALTYFEVMESLGVSPDGVTFVSLL
        LSTLLSNI S+KFKLHIHGWVIRHGVEWNLSIANSLIVMYAKCGKLNRAKWLFQQMPQKDMVSWNSIISAHFN+ EALTYFEVMESLGV PD VTFVSLL
Subjt:  LSTLLSNISSMKFKLHIHGWVIRHGVEWNLSIANSLIVMYAKCGKLNRAKWLFQQMPQKDMVSWNSIISAHFNSAEALTYFEVMESLGVSPDGVTFVSLL

Query:  STCAHLGLVKEGGKLYFLMKGKYGIRPTIEHYACMVNLYGRAGMIEEAYKIITKGMEIEAGPTIWGALLYACYLHSDVDIAEIAAERLFELEPDNELNFE
        STCAHLGLVKEGG+LY LMKGKY IRPTIEHYACMVNLYGRAGMIEEAYKIITKGMEIEAGPTIWGALLYACYLH +VDIAEIAAERLFELEPDNELNFE
Subjt:  STCAHLGLVKEGGKLYFLMKGKYGIRPTIEHYACMVNLYGRAGMIEEAYKIITKGMEIEAGPTIWGALLYACYLHSDVDIAEIAAERLFELEPDNELNFE

Query:  LLMKIYGNAGRSEDEKRVKLMMAERGLNS
        LLMKIYGNAGRSEDEKRVKLMMAERGLNS
Subjt:  LLMKIYGNAGRSEDEKRVKLMMAERGLNS

A0A6J1FPF9 pentatricopeptide repeat-containing protein At4g25270, chloroplastic0.083.14Show/hide
Query:  MLISLHLLTPPSSSLLLFCSSKPKKSKKERRKLLHQKLLRISKAKQSTDLSFPKSSPTPLLIHPKPFFQSKIQALDAVLTDLEASIDNGLFIDPEIFSSL
        MLISL     P +SL L CSS PKKSKKERRKLL +KL+RISKAK++T L FPKSS TPLLIH KPF QSKIQALDAVL DLEAS+ NG+ ID EIFSSL
Subjt:  MLISLHLLTPPSSSLLLFCSSKPKKSKKERRKLLHQKLLRISKAKQSTDLSFPKSSPTPLLIHPKPFFQSKIQALDAVLTDLEASIDNGLFIDPEIFSSL

Query:  LELCYQLQAIHHGIRIHRLIPTNLLRRNVGISSKLLRLYASFGYMEDAHQVFDEMGNRNFSAFAWNSLISGYAELGLYEDALALYFQMEEEGVEPDNFTF
        LE CYQL+A+ HGIRIHRLIPTN LRRNVG+SSKLLRLYASFGYMEDAHQVFDEM  RN SAF+WNSLISGYAELGLYEDALALYFQMEEEGVEPD+FTF
Subjt:  LELCYQLQAIHHGIRIHRLIPTNLLRRNVGISSKLLRLYASFGYMEDAHQVFDEMGNRNFSAFAWNSLISGYAELGLYEDALALYFQMEEEGVEPDNFTF

Query:  PRVLKACGGIGSIQIGEAVHRHVVRSGFAGDVFVLNALVDMYSKCGCIVRARKVFDQIEYKDIVSWNSMLTGYTRHGLHFEALDIFDQMIQEGYEPDSVA
        PRVLKACGGIGSI++GEAVHRH+VRSGFAGD+FVLNALVDMY+KCG I+RARKVFDQI  KD VSWNSMLTGYTRHGL  EAL+ FDQMIQEGYEPDSVA
Subjt:  PRVLKACGGIGSIQIGEAVHRHVVRSGFAGDVFVLNALVDMYSKCGCIVRARKVFDQIEYKDIVSWNSMLTGYTRHGLHFEALDIFDQMIQEGYEPDSVA

Query:  LSTLLSNISSMKFKLHIHGWVIRHGVEWNLSIANSLIVMYAKCGKLNRAKWLFQQMPQKDMVSWNSIISAHFNSAEALTYFEVMESLGVSPDGVTFVSLL
        LST++SNISS KFKLHIHGW IRHG+EWNLSIANSLI MYA  GK+NRA+WLF+QMP++D+VSWN+IISAH N+++ALTYFEVMESLGV PD VTFVSLL
Subjt:  LSTLLSNISSMKFKLHIHGWVIRHGVEWNLSIANSLIVMYAKCGKLNRAKWLFQQMPQKDMVSWNSIISAHFNSAEALTYFEVMESLGVSPDGVTFVSLL

Query:  STCAHLGLVKEGGKLYFLMKGKYGIRPTIEHYACMVNLYGRAGMIEEAYKIITKGMEIEAGPTIWGALLYACYLHSDVDIAEIAAERLFELEPDNELNFE
        STCAHL LVKEGGKLY  MKGKYGIRPTIEHYACMVNLYGRAG+IEEAY+IIT GME+EAGPT+WGALLYACYLH +VDIAE+AAE+LFE EPDNELNF+
Subjt:  STCAHLGLVKEGGKLYFLMKGKYGIRPTIEHYACMVNLYGRAGMIEEAYKIITKGMEIEAGPTIWGALLYACYLHSDVDIAEIAAERLFELEPDNELNFE

Query:  LLMKIYGNAGRSEDEKRVKLMMAERGLN
        LLMKIYGNAGR EDEKRV+LMMAERGL+
Subjt:  LLMKIYGNAGRSEDEKRVKLMMAERGLN

A0A6J1JZI1 pentatricopeptide repeat-containing protein At4g25270, chloroplastic0.083.14Show/hide
Query:  MLISLHLLTPPSSSLLLFCSSKPKKSKKERRKLLHQKLLRISKAKQSTDLSFPKSSPTPLLIHPKPFFQSKIQALDAVLTDLEASIDNGLFIDPEIFSSL
        MLISL     P +SL LFCSS PKKSKKERRKLL +KL+RISKAK++T L FPKSS TPLLIH KPF +SKIQALDAVL DLEAS+DNG+ ID EIFSSL
Subjt:  MLISLHLLTPPSSSLLLFCSSKPKKSKKERRKLLHQKLLRISKAKQSTDLSFPKSSPTPLLIHPKPFFQSKIQALDAVLTDLEASIDNGLFIDPEIFSSL

Query:  LELCYQLQAIHHGIRIHRLIPTNLLRRNVGISSKLLRLYASFGYMEDAHQVFDEMGNRNFSAFAWNSLISGYAELGLYEDALALYFQMEEEGVEPDNFTF
        LE CYQL+A+ HGIRIHRLIPTN LRRNVG+SSKLLRLYASFGYMEDAHQVFDEM  RN SAF+WNSLISGYAELGLYEDALALYFQMEEEGVEPD+FTF
Subjt:  LELCYQLQAIHHGIRIHRLIPTNLLRRNVGISSKLLRLYASFGYMEDAHQVFDEMGNRNFSAFAWNSLISGYAELGLYEDALALYFQMEEEGVEPDNFTF

Query:  PRVLKACGGIGSIQIGEAVHRHVVRSGFAGDVFVLNALVDMYSKCGCIVRARKVFDQIEYKDIVSWNSMLTGYTRHGLHFEALDIFDQMIQEGYEPDSVA
        PRVLKACGGIGSI++GEAVHRH+VRSGFAGD+FVLNALVDMY+KCG I+RARKVFDQI +KD VSWNSMLTGYTRHGL  EAL+ FDQMIQEGYEPDSVA
Subjt:  PRVLKACGGIGSIQIGEAVHRHVVRSGFAGDVFVLNALVDMYSKCGCIVRARKVFDQIEYKDIVSWNSMLTGYTRHGLHFEALDIFDQMIQEGYEPDSVA

Query:  LSTLLSNISSMKFKLHIHGWVIRHGVEWNLSIANSLIVMYAKCGKLNRAKWLFQQMPQKDMVSWNSIISAHFNSAEALTYFEVMESLGVSPDGVTFVSLL
        LST++SNI+S KFKLHIHGW IRHG+EWNLSIANSLI MYA  GK+NRA+WLF+QMPQ+D+VSWN+IISAH N+++ALTYFEVMESLGV PD VTFVSLL
Subjt:  LSTLLSNISSMKFKLHIHGWVIRHGVEWNLSIANSLIVMYAKCGKLNRAKWLFQQMPQKDMVSWNSIISAHFNSAEALTYFEVMESLGVSPDGVTFVSLL

Query:  STCAHLGLVKEGGKLYFLMKGKYGIRPTIEHYACMVNLYGRAGMIEEAYKIITKGMEIEAGPTIWGALLYACYLHSDVDIAEIAAERLFELEPDNELNFE
        STCAHL LVKEGGKLY +MKGKYGIRPTIEHYACMVNLYGRAG+IEEAY+II  GME+EAGPT+WGALLYACYLH +VDIAE+AAE+LFE EPDNELNF+
Subjt:  STCAHLGLVKEGGKLYFLMKGKYGIRPTIEHYACMVNLYGRAGMIEEAYKIITKGMEIEAGPTIWGALLYACYLHSDVDIAEIAAERLFELEPDNELNFE

Query:  LLMKIYGNAGRSEDEKRVKLMMAERGLN
        LLMKIYGNAGR EDEKRV+LMMAERGL+
Subjt:  LLMKIYGNAGRSEDEKRVKLMMAERGLN

SwissProt top hitse value%identityAlignment
Q9LN01 Pentatricopeptide repeat-containing protein At1g08070, chloroplastic3.4e-7934.68Show/hide
Query:  GLFIDPEIFSSLLELCYQLQAIHHGIRIHRLIPTNLLRRNVGISSKLLRLYASFGYMEDAHQVFDEMGNRNFSAFAWNSLISGYAELGLYEDALALYFQM
        G  +D  + +SL+ +  Q   +    ++    P     R+V   + L++ YAS GY+E+A ++FDE+  ++    +WN++ISGYAE G Y++AL L+  M
Subjt:  GLFIDPEIFSSLLELCYQLQAIHHGIRIHRLIPTNLLRRNVGISSKLLRLYASFGYMEDAHQVFDEMGNRNFSAFAWNSLISGYAELGLYEDALALYFQM

Query:  EEEGVEPDNFTFPRVLKACGGIGSIQIGEAVHRHVVRSGFAGDVFVLNALVDMYSKCGCIVRARKVFDQIEYKDIVSWNSMLTGYTRHGLHFEALDIFDQ
         +  V PD  T   V+ AC   GSI++G  VH  +   GF  ++ ++NAL+D+YSKCG +  A  +F+++ YKD++SWN+++ GYT   L+ EAL +F +
Subjt:  EEEGVEPDNFTFPRVLKACGGIGSIQIGEAVHRHVVRSGFAGDVFVLNALVDMYSKCGCIVRARKVFDQIEYKDIVSWNSMLTGYTRHGLHFEALDIFDQ

Query:  MIQEGYEPDSVALSTLL---SNISSMKFKLHIHGWVIRH--GVEWNLSIANSLIVMYAKCGKLNRAKWLFQQMPQKDMVSWNSII---SAHFNSAEALTY
        M++ G  P+ V + ++L   +++ ++     IH ++ +   GV    S+  SLI MYAKCG +  A  +F  +  K + SWN++I   + H  +  +   
Subjt:  MIQEGYEPDSVALSTLL---SNISSMKFKLHIHGWVIRH--GVEWNLSIANSLIVMYAKCGKLNRAKWLFQQMPQKDMVSWNSII---SAHFNSAEALTY

Query:  FEVMESLGVSPDGVTFVSLLSTCAHLGLVKEGGKLYFLMKGKYGIRPTIEHYACMVNLYGRAGMIEEAYKIITKGMEIEAGPTIWGALLYACYLHSDVDI
        F  M  +G+ PD +TFV LLS C+H G++  G  ++  M   Y + P +EHY CM++L G +G+ +EA ++I   ME+E    IW +LL AC +H +V++
Subjt:  FEVMESLGVSPDGVTFVSLLSTCAHLGLVKEGGKLYFLMKGKYGIRPTIEHYACMVNLYGRAGMIEEAYKIITKGMEIEAGPTIWGALLYACYLHSDVDI

Query:  AEIAAERLFELEPDNELNFELLMKIYGNAGRSEDEKRVKLMMAERGL
         E  AE L ++EP+N  ++ LL  IY +AGR  +  + + ++ ++G+
Subjt:  AEIAAERLFELEPDNELNFELLMKIYGNAGRSEDEKRVKLMMAERGL

Q9SB36 Pentatricopeptide repeat-containing protein At4g25270, chloroplastic2.8e-18260.38Show/hide
Query:  PSSSLLLFCSSKPKKSKKERRKLLHQKLLRISKAKQSTDLSFPKSSPTPLLIHPKPFFQSKIQALDAVLTDLEASIDNGLFI-DPEIFSSLLELCYQLQA
        PS S     SS  KK  +  ++L   +  + +     T LSF K SPTPLLI  +   +++++ALD+V+TDLE S   G+ + +PEIF+SLLE CY L+A
Subjt:  PSSSLLLFCSSKPKKSKKERRKLLHQKLLRISKAKQSTDLSFPKSSPTPLLIHPKPFFQSKIQALDAVLTDLEASIDNGLFI-DPEIFSSLLELCYQLQA

Query:  IHHGIRIHRLIPTNLLRRNVGISSKLLRLYASFGYMEDAHQVFDEMGNRNFSAFAWNSLISGYAELGLYEDALALYFQMEEEGVEPDNFTFPRVLKACGG
        I HG+R+H LIP  LLR N+GISSKL+RLYAS GY E AH+VFD M  R+ S FAWNSLISGYAELG YEDA+ALYFQM E+GV+PD FTFPRVLKACGG
Subjt:  IHHGIRIHRLIPTNLLRRNVGISSKLLRLYASFGYMEDAHQVFDEMGNRNFSAFAWNSLISGYAELGLYEDALALYFQMEEEGVEPDNFTFPRVLKACGG

Query:  IGSIQIGEAVHRHVVRSGFAGDVFVLNALVDMYSKCGCIVRARKVFDQIEYKDIVSWNSMLTGYTRHGLHFEALDIFDQMIQEGYEPDSVALSTLLSNIS
        IGS+QIGEA+HR +V+ GF  DV+VLNALV MY+KCG IV+AR VFD I +KD VSWNSMLTGY  HGL  EALDIF  M+Q G EPD VA+S++L+ + 
Subjt:  IGSIQIGEAVHRHVVRSGFAGDVFVLNALVDMYSKCGCIVRARKVFDQIEYKDIVSWNSMLTGYTRHGLHFEALDIFDQMIQEGYEPDSVALSTLLSNIS

Query:  SMKFKLHIHGWVIRHGVEWNLSIANSLIVMYAKCGKLNRAKWLFQQMPQKDMVSWNSIISAHFNSAEALTYFEVMESLGVSPDGVTFVSLLSTCAHLGLV
        S K    +HGWVIR G+EW LS+AN+LIV+Y+K G+L +A ++F QM ++D VSWN+IISAH  ++  L YFE M      PDG+TFVS+LS CA+ G+V
Subjt:  SMKFKLHIHGWVIRHGVEWNLSIANSLIVMYAKCGKLNRAKWLFQQMPQKDMVSWNSIISAHFNSAEALTYFEVMESLGVSPDGVTFVSLLSTCAHLGLV

Query:  KEGGKLYFLMKGKYGIRPTIEHYACMVNLYGRAGMIEEAYKIITKGMEIEAGPTIWGALLYACYLHSDVDIAEIAAERLFELEPDNELNFELLMKIYGNA
        ++G +L+ LM  +YGI P +EHYACMVNLYGRAGM+EEAY +I + M +EAGPT+WGALLYACYLH + DI E+AA+RLFELEPDNE NFELL++IY  A
Subjt:  KEGGKLYFLMKGKYGIRPTIEHYACMVNLYGRAGMIEEAYKIITKGMEIEAGPTIWGALLYACYLHSDVDIAEIAAERLFELEPDNELNFELLMKIYGNA

Query:  GRSEDEKRVKLMMAERGLNS
         R+ED +RV+ MM +RGL +
Subjt:  GRSEDEKRVKLMMAERGLNS

Q9SN39 Pentatricopeptide repeat-containing protein DOT4, chloroplastic8.6e-8337.36Show/hide
Query:  NGLFIDPEIFSSLLELCYQLQAIHHGIRIHRLIPTNLLRRNVGISSKLLRLYASFGYMEDAHQVFDEMGNRNFSAFAWNSLISGYAELGLYEDALALYFQ
        +G+ ID     S+   C   + I  G  +H +       R     + LL +Y+  G ++ A  VF EM +R  S  ++ S+I+GYA  GL  +A+ L+ +
Subjt:  NGLFIDPEIFSSLLELCYQLQAIHHGIRIHRLIPTNLLRRNVGISSKLLRLYASFGYMEDAHQVFDEMGNRNFSAFAWNSLISGYAELGLYEDALALYFQ

Query:  MEEEGVEPDNFTFPRVLKACGGIGSIQIGEAVHRHVVRSGFAGDVFVLNALVDMYSKCGCIVRARKVFDQIEYKDIVSWNSMLTGYTRHGLHFEALDIFD
        MEEEG+ PD +T   VL  C     +  G+ VH  +  +    D+FV NAL+DMY+KCG +  A  VF ++  KDI+SWN+++ GY+++    EAL +F+
Subjt:  MEEEGVEPDNFTFPRVLKACGGIGSIQIGEAVHRHVVRSGFAGDVFVLNALVDMYSKCGCIVRARKVFDQIEYKDIVSWNSMLTGYTRHGLHFEALDIFD

Query:  QMIQE-GYEPDSVALSTLL---SNISSMKFKLHIHGWVIRHGVEWNLSIANSLIVMYAKCGKLNRAKWLFQQMPQKDMVSWNSIISA---HFNSAEALTY
         +++E  + PD   ++ +L   +++S+      IHG+++R+G   +  +ANSL+ MYAKCG L  A  LF  +  KD+VSW  +I+    H    EA+  
Subjt:  QMIQE-GYEPDSVALSTLL---SNISSMKFKLHIHGWVIRHGVEWNLSIANSLIVMYAKCGKLNRAKWLFQQMPQKDMVSWNSIISA---HFNSAEALTY

Query:  FEVMESLGVSPDGVTFVSLLSTCAHLGLVKEGGKLYFLMKGKYGIRPTIEHYACMVNLYGRAGMIEEAYKIITKGMEIEAGPTIWGALLYACYLHSDVDI
        F  M   G+  D ++FVSLL  C+H GLV EG + + +M+ +  I PT+EHYAC+V++  R G + +AY+ I + M I    TIWGALL  C +H DV +
Subjt:  FEVMESLGVSPDGVTFVSLLSTCAHLGLVKEGGKLYFLMKGKYGIRPTIEHYACMVNLYGRAGMIEEAYKIITKGMEIEAGPTIWGALLYACYLHSDVDI

Query:  AEIAAERLFELEPDNELNFELLMKIYGNAGRSEDEKRVKLMMAERGL
        AE  AE++FELEP+N   + L+  IY  A + E  KR++  + +RGL
Subjt:  AEIAAERLFELEPDNELNFELLMKIYGNAGRSEDEKRVKLMMAERGL

Q9SR82 Putative pentatricopeptide repeat-containing protein At3g088201.1e-8036.48Show/hide
Query:  DLEASI-DNGLFIDPEIFSSLLELCYQLQAIHHGIRIHRLIPTNLLRRNVGISSKLLRLYASFGYMEDAHQVFDEMGNRNFSAFAWNSLISGYAELGLYE
        DL  SI  +GL++    F  +L+ C +  +   GI +H L+       +V   + LL +Y+  G + DAH++FDE+ +R  S   W +L SGY   G + 
Subjt:  DLEASI-DNGLFIDPEIFSSLLELCYQLQAIHHGIRIHRLIPTNLLRRNVGISSKLLRLYASFGYMEDAHQVFDEMGNRNFSAFAWNSLISGYAELGLYE

Query:  DALALYFQMEEEGVEPDNFTFPRVLKACGGIGSIQIGEAVHRHVVRSGFAGDVFVLNALVDMYSKCGCIVRARKVFDQIEYKDIVSWNSMLTGYTRHGLH
        +A+ L+ +M E GV+PD++   +VL AC  +G +  GE + +++       + FV   LV++Y+KCG + +AR VFD +  KDIV+W++M+ GY  +   
Subjt:  DALALYFQMEEEGVEPDNFTFPRVLKACGGIGSIQIGEAVHRHVVRSGFAGDVFVLNALVDMYSKCGCIVRARKVFDQIEYKDIVSWNSMLTGYTRHGLH

Query:  FEALDIFDQMIQEGYEPDSVALSTLLSNISSMKFKLHIHGWVI----RHGVEWNLSIANSLIVMYAKCGKLNRAKWLFQQMPQKDMVSWNSIISAHFNSA
         E +++F QM+QE  +PD  ++   LS+ +S+   L +  W I    RH    NL +AN+LI MYAKCG + R   +F++M +KD+V  N+ IS    + 
Subjt:  FEALDIFDQMIQEGYEPDSVALSTLLSNISSMKFKLHIHGWVI----RHGVEWNLSIANSLIVMYAKCGKLNRAKWLFQQMPQKDMVSWNSIISAHFNSA

Query:  EALTYFEVM---ESLGVSPDGVTFVSLLSTCAHLGLVKEGGKLYFLMKGKYGIRPTIEHYACMVNLYGRAGMIEEAYKIITKGMEIEAGPTIWGALLYAC
             F V    E LG+SPDG TF+ LL  C H GL+++G + +  +   Y ++ T+EHY CMV+L+GRAGM+++AY++I   M +     +WGALL  C
Subjt:  EALTYFEVM---ESLGVSPDGVTFVSLLSTCAHLGLVKEGGKLYFLMKGKYGIRPTIEHYACMVNLYGRAGMIEEAYKIITKGMEIEAGPTIWGALLYAC

Query:  YLHSDVDIAEIAAERLFELEPDNELNFELLMKIYGNAGRSEDEKRVKLMMAERGL
         L  D  +AE   + L  LEP N  N+  L  IY   GR ++   V+ MM ++G+
Subjt:  YLHSDVDIAEIAAERLFELEPDNELNFELLMKIYGNAGRSEDEKRVKLMMAERGL

Q9STF3 Pentatricopeptide repeat-containing protein At3g46790, chloroplastic1.3e-8637.53Show/hide
Query:  EIFSSLLELCYQLQAIHHGIRIHRLIPTNLLRRNVGISSKLLRLYASFGYMEDAHQVFDEMGNRNFSAFAWNSLISGYAELGLYEDALALYFQMEEEGVE
        + +  L+  C    ++   +R+HR I  N   ++  +++KL+ +Y+  G ++ A +VFD+   R  + + WN+L       G  E+ L LY++M   GVE
Subjt:  EIFSSLLELCYQLQAIHHGIRIHRLIPTNLLRRNVGISSKLLRLYASFGYMEDAHQVFDEMGNRNFSAFAWNSLISGYAELGLYEDALALYFQMEEEGVE

Query:  PDNFTFPRVLKACGG----IGSIQIGEAVHRHVVRSGFAGDVFVLNALVDMYSKCGCIVRARKVFDQIEYKDIVSWNSMLTGYTRHGLHFEALDIFDQMI
         D FT+  VLKAC      +  +  G+ +H H+ R G++  V+++  LVDMY++ GC+  A  VF  +  +++VSW++M+  Y ++G  FEAL  F +M+
Subjt:  PDNFTFPRVLKACGG----IGSIQIGEAVHRHVVRSGFAGDVFVLNALVDMYSKCGCIVRARKVFDQIEYKDIVSWNSMLTGYTRHGLHFEALDIFDQMI

Query:  QEGYE--PDSVALSTLL---SNISSMKFKLHIHGWVIRHGVEWNLSIANSLIVMYAKCGKLNRAKWLFQQMPQKDMVSWNSIISA---HFNSAEALTYFE
        +E  +  P+SV + ++L   +++++++    IHG+++R G++  L + ++L+ MY +CGKL   + +F +M  +D+VSWNS+IS+   H    +A+  FE
Subjt:  QEGYE--PDSVALSTLL---SNISSMKFKLHIHGWVIRHGVEWNLSIANSLIVMYAKCGKLNRAKWLFQQMPQKDMVSWNSIISA---HFNSAEALTYFE

Query:  VMESLGVSPDGVTFVSLLSTCAHLGLVKEGGKLYFLMKGKYGIRPTIEHYACMVNLYGRAGMIEEAYKIITKGMEIEAGPTIWGALLYACYLHSDVDIAE
         M + G SP  VTFVS+L  C+H GLV+EG +L+  M   +GI+P IEHYACMV+L GRA  ++EA K++ + M  E GP +WG+LL +C +H +V++AE
Subjt:  VMESLGVSPDGVTFVSLLSTCAHLGLVKEGGKLYFLMKGKYGIRPTIEHYACMVNLYGRAGMIEEAYKIITKGMEIEAGPTIWGALLYACYLHSDVDIAE

Query:  IAAERLFELEPDNELNFELLMKIYGNAGRSEDEKRVKLMMAERGL
         A+ RLF LEP N  N+ LL  IY  A   ++ KRVK ++  RGL
Subjt:  IAAERLFELEPDNELNFELLMKIYGNAGRSEDEKRVKLMMAERGL

Arabidopsis top hitse value%identityAlignment
AT1G08070.1 Tetratricopeptide repeat (TPR)-like superfamily protein2.4e-8034.68Show/hide
Query:  GLFIDPEIFSSLLELCYQLQAIHHGIRIHRLIPTNLLRRNVGISSKLLRLYASFGYMEDAHQVFDEMGNRNFSAFAWNSLISGYAELGLYEDALALYFQM
        G  +D  + +SL+ +  Q   +    ++    P     R+V   + L++ YAS GY+E+A ++FDE+  ++    +WN++ISGYAE G Y++AL L+  M
Subjt:  GLFIDPEIFSSLLELCYQLQAIHHGIRIHRLIPTNLLRRNVGISSKLLRLYASFGYMEDAHQVFDEMGNRNFSAFAWNSLISGYAELGLYEDALALYFQM

Query:  EEEGVEPDNFTFPRVLKACGGIGSIQIGEAVHRHVVRSGFAGDVFVLNALVDMYSKCGCIVRARKVFDQIEYKDIVSWNSMLTGYTRHGLHFEALDIFDQ
         +  V PD  T   V+ AC   GSI++G  VH  +   GF  ++ ++NAL+D+YSKCG +  A  +F+++ YKD++SWN+++ GYT   L+ EAL +F +
Subjt:  EEEGVEPDNFTFPRVLKACGGIGSIQIGEAVHRHVVRSGFAGDVFVLNALVDMYSKCGCIVRARKVFDQIEYKDIVSWNSMLTGYTRHGLHFEALDIFDQ

Query:  MIQEGYEPDSVALSTLL---SNISSMKFKLHIHGWVIRH--GVEWNLSIANSLIVMYAKCGKLNRAKWLFQQMPQKDMVSWNSII---SAHFNSAEALTY
        M++ G  P+ V + ++L   +++ ++     IH ++ +   GV    S+  SLI MYAKCG +  A  +F  +  K + SWN++I   + H  +  +   
Subjt:  MIQEGYEPDSVALSTLL---SNISSMKFKLHIHGWVIRH--GVEWNLSIANSLIVMYAKCGKLNRAKWLFQQMPQKDMVSWNSII---SAHFNSAEALTY

Query:  FEVMESLGVSPDGVTFVSLLSTCAHLGLVKEGGKLYFLMKGKYGIRPTIEHYACMVNLYGRAGMIEEAYKIITKGMEIEAGPTIWGALLYACYLHSDVDI
        F  M  +G+ PD +TFV LLS C+H G++  G  ++  M   Y + P +EHY CM++L G +G+ +EA ++I   ME+E    IW +LL AC +H +V++
Subjt:  FEVMESLGVSPDGVTFVSLLSTCAHLGLVKEGGKLYFLMKGKYGIRPTIEHYACMVNLYGRAGMIEEAYKIITKGMEIEAGPTIWGALLYACYLHSDVDI

Query:  AEIAAERLFELEPDNELNFELLMKIYGNAGRSEDEKRVKLMMAERGL
         E  AE L ++EP+N  ++ LL  IY +AGR  +  + + ++ ++G+
Subjt:  AEIAAERLFELEPDNELNFELLMKIYGNAGRSEDEKRVKLMMAERGL

AT3G08820.1 Pentatricopeptide repeat (PPR) superfamily protein7.5e-8236.48Show/hide
Query:  DLEASI-DNGLFIDPEIFSSLLELCYQLQAIHHGIRIHRLIPTNLLRRNVGISSKLLRLYASFGYMEDAHQVFDEMGNRNFSAFAWNSLISGYAELGLYE
        DL  SI  +GL++    F  +L+ C +  +   GI +H L+       +V   + LL +Y+  G + DAH++FDE+ +R  S   W +L SGY   G + 
Subjt:  DLEASI-DNGLFIDPEIFSSLLELCYQLQAIHHGIRIHRLIPTNLLRRNVGISSKLLRLYASFGYMEDAHQVFDEMGNRNFSAFAWNSLISGYAELGLYE

Query:  DALALYFQMEEEGVEPDNFTFPRVLKACGGIGSIQIGEAVHRHVVRSGFAGDVFVLNALVDMYSKCGCIVRARKVFDQIEYKDIVSWNSMLTGYTRHGLH
        +A+ L+ +M E GV+PD++   +VL AC  +G +  GE + +++       + FV   LV++Y+KCG + +AR VFD +  KDIV+W++M+ GY  +   
Subjt:  DALALYFQMEEEGVEPDNFTFPRVLKACGGIGSIQIGEAVHRHVVRSGFAGDVFVLNALVDMYSKCGCIVRARKVFDQIEYKDIVSWNSMLTGYTRHGLH

Query:  FEALDIFDQMIQEGYEPDSVALSTLLSNISSMKFKLHIHGWVI----RHGVEWNLSIANSLIVMYAKCGKLNRAKWLFQQMPQKDMVSWNSIISAHFNSA
         E +++F QM+QE  +PD  ++   LS+ +S+   L +  W I    RH    NL +AN+LI MYAKCG + R   +F++M +KD+V  N+ IS    + 
Subjt:  FEALDIFDQMIQEGYEPDSVALSTLLSNISSMKFKLHIHGWVI----RHGVEWNLSIANSLIVMYAKCGKLNRAKWLFQQMPQKDMVSWNSIISAHFNSA

Query:  EALTYFEVM---ESLGVSPDGVTFVSLLSTCAHLGLVKEGGKLYFLMKGKYGIRPTIEHYACMVNLYGRAGMIEEAYKIITKGMEIEAGPTIWGALLYAC
             F V    E LG+SPDG TF+ LL  C H GL+++G + +  +   Y ++ T+EHY CMV+L+GRAGM+++AY++I   M +     +WGALL  C
Subjt:  EALTYFEVM---ESLGVSPDGVTFVSLLSTCAHLGLVKEGGKLYFLMKGKYGIRPTIEHYACMVNLYGRAGMIEEAYKIITKGMEIEAGPTIWGALLYAC

Query:  YLHSDVDIAEIAAERLFELEPDNELNFELLMKIYGNAGRSEDEKRVKLMMAERGL
         L  D  +AE   + L  LEP N  N+  L  IY   GR ++   V+ MM ++G+
Subjt:  YLHSDVDIAEIAAERLFELEPDNELNFELLMKIYGNAGRSEDEKRVKLMMAERGL

AT3G46790.1 Tetratricopeptide repeat (TPR)-like superfamily protein9.2e-8837.53Show/hide
Query:  EIFSSLLELCYQLQAIHHGIRIHRLIPTNLLRRNVGISSKLLRLYASFGYMEDAHQVFDEMGNRNFSAFAWNSLISGYAELGLYEDALALYFQMEEEGVE
        + +  L+  C    ++   +R+HR I  N   ++  +++KL+ +Y+  G ++ A +VFD+   R  + + WN+L       G  E+ L LY++M   GVE
Subjt:  EIFSSLLELCYQLQAIHHGIRIHRLIPTNLLRRNVGISSKLLRLYASFGYMEDAHQVFDEMGNRNFSAFAWNSLISGYAELGLYEDALALYFQMEEEGVE

Query:  PDNFTFPRVLKACGG----IGSIQIGEAVHRHVVRSGFAGDVFVLNALVDMYSKCGCIVRARKVFDQIEYKDIVSWNSMLTGYTRHGLHFEALDIFDQMI
         D FT+  VLKAC      +  +  G+ +H H+ R G++  V+++  LVDMY++ GC+  A  VF  +  +++VSW++M+  Y ++G  FEAL  F +M+
Subjt:  PDNFTFPRVLKACGG----IGSIQIGEAVHRHVVRSGFAGDVFVLNALVDMYSKCGCIVRARKVFDQIEYKDIVSWNSMLTGYTRHGLHFEALDIFDQMI

Query:  QEGYE--PDSVALSTLL---SNISSMKFKLHIHGWVIRHGVEWNLSIANSLIVMYAKCGKLNRAKWLFQQMPQKDMVSWNSIISA---HFNSAEALTYFE
        +E  +  P+SV + ++L   +++++++    IHG+++R G++  L + ++L+ MY +CGKL   + +F +M  +D+VSWNS+IS+   H    +A+  FE
Subjt:  QEGYE--PDSVALSTLL---SNISSMKFKLHIHGWVIRHGVEWNLSIANSLIVMYAKCGKLNRAKWLFQQMPQKDMVSWNSIISA---HFNSAEALTYFE

Query:  VMESLGVSPDGVTFVSLLSTCAHLGLVKEGGKLYFLMKGKYGIRPTIEHYACMVNLYGRAGMIEEAYKIITKGMEIEAGPTIWGALLYACYLHSDVDIAE
         M + G SP  VTFVS+L  C+H GLV+EG +L+  M   +GI+P IEHYACMV+L GRA  ++EA K++ + M  E GP +WG+LL +C +H +V++AE
Subjt:  VMESLGVSPDGVTFVSLLSTCAHLGLVKEGGKLYFLMKGKYGIRPTIEHYACMVNLYGRAGMIEEAYKIITKGMEIEAGPTIWGALLYACYLHSDVDIAE

Query:  IAAERLFELEPDNELNFELLMKIYGNAGRSEDEKRVKLMMAERGL
         A+ RLF LEP N  N+ LL  IY  A   ++ KRVK ++  RGL
Subjt:  IAAERLFELEPDNELNFELLMKIYGNAGRSEDEKRVKLMMAERGL

AT4G18750.1 Pentatricopeptide repeat (PPR) superfamily protein6.1e-8437.36Show/hide
Query:  NGLFIDPEIFSSLLELCYQLQAIHHGIRIHRLIPTNLLRRNVGISSKLLRLYASFGYMEDAHQVFDEMGNRNFSAFAWNSLISGYAELGLYEDALALYFQ
        +G+ ID     S+   C   + I  G  +H +       R     + LL +Y+  G ++ A  VF EM +R  S  ++ S+I+GYA  GL  +A+ L+ +
Subjt:  NGLFIDPEIFSSLLELCYQLQAIHHGIRIHRLIPTNLLRRNVGISSKLLRLYASFGYMEDAHQVFDEMGNRNFSAFAWNSLISGYAELGLYEDALALYFQ

Query:  MEEEGVEPDNFTFPRVLKACGGIGSIQIGEAVHRHVVRSGFAGDVFVLNALVDMYSKCGCIVRARKVFDQIEYKDIVSWNSMLTGYTRHGLHFEALDIFD
        MEEEG+ PD +T   VL  C     +  G+ VH  +  +    D+FV NAL+DMY+KCG +  A  VF ++  KDI+SWN+++ GY+++    EAL +F+
Subjt:  MEEEGVEPDNFTFPRVLKACGGIGSIQIGEAVHRHVVRSGFAGDVFVLNALVDMYSKCGCIVRARKVFDQIEYKDIVSWNSMLTGYTRHGLHFEALDIFD

Query:  QMIQE-GYEPDSVALSTLL---SNISSMKFKLHIHGWVIRHGVEWNLSIANSLIVMYAKCGKLNRAKWLFQQMPQKDMVSWNSIISA---HFNSAEALTY
         +++E  + PD   ++ +L   +++S+      IHG+++R+G   +  +ANSL+ MYAKCG L  A  LF  +  KD+VSW  +I+    H    EA+  
Subjt:  QMIQE-GYEPDSVALSTLL---SNISSMKFKLHIHGWVIRHGVEWNLSIANSLIVMYAKCGKLNRAKWLFQQMPQKDMVSWNSIISA---HFNSAEALTY

Query:  FEVMESLGVSPDGVTFVSLLSTCAHLGLVKEGGKLYFLMKGKYGIRPTIEHYACMVNLYGRAGMIEEAYKIITKGMEIEAGPTIWGALLYACYLHSDVDI
        F  M   G+  D ++FVSLL  C+H GLV EG + + +M+ +  I PT+EHYAC+V++  R G + +AY+ I + M I    TIWGALL  C +H DV +
Subjt:  FEVMESLGVSPDGVTFVSLLSTCAHLGLVKEGGKLYFLMKGKYGIRPTIEHYACMVNLYGRAGMIEEAYKIITKGMEIEAGPTIWGALLYACYLHSDVDI

Query:  AEIAAERLFELEPDNELNFELLMKIYGNAGRSEDEKRVKLMMAERGL
        AE  AE++FELEP+N   + L+  IY  A + E  KR++  + +RGL
Subjt:  AEIAAERLFELEPDNELNFELLMKIYGNAGRSEDEKRVKLMMAERGL

AT4G25270.1 Tetratricopeptide repeat (TPR)-like superfamily protein2.0e-18360.38Show/hide
Query:  PSSSLLLFCSSKPKKSKKERRKLLHQKLLRISKAKQSTDLSFPKSSPTPLLIHPKPFFQSKIQALDAVLTDLEASIDNGLFI-DPEIFSSLLELCYQLQA
        PS S     SS  KK  +  ++L   +  + +     T LSF K SPTPLLI  +   +++++ALD+V+TDLE S   G+ + +PEIF+SLLE CY L+A
Subjt:  PSSSLLLFCSSKPKKSKKERRKLLHQKLLRISKAKQSTDLSFPKSSPTPLLIHPKPFFQSKIQALDAVLTDLEASIDNGLFI-DPEIFSSLLELCYQLQA

Query:  IHHGIRIHRLIPTNLLRRNVGISSKLLRLYASFGYMEDAHQVFDEMGNRNFSAFAWNSLISGYAELGLYEDALALYFQMEEEGVEPDNFTFPRVLKACGG
        I HG+R+H LIP  LLR N+GISSKL+RLYAS GY E AH+VFD M  R+ S FAWNSLISGYAELG YEDA+ALYFQM E+GV+PD FTFPRVLKACGG
Subjt:  IHHGIRIHRLIPTNLLRRNVGISSKLLRLYASFGYMEDAHQVFDEMGNRNFSAFAWNSLISGYAELGLYEDALALYFQMEEEGVEPDNFTFPRVLKACGG

Query:  IGSIQIGEAVHRHVVRSGFAGDVFVLNALVDMYSKCGCIVRARKVFDQIEYKDIVSWNSMLTGYTRHGLHFEALDIFDQMIQEGYEPDSVALSTLLSNIS
        IGS+QIGEA+HR +V+ GF  DV+VLNALV MY+KCG IV+AR VFD I +KD VSWNSMLTGY  HGL  EALDIF  M+Q G EPD VA+S++L+ + 
Subjt:  IGSIQIGEAVHRHVVRSGFAGDVFVLNALVDMYSKCGCIVRARKVFDQIEYKDIVSWNSMLTGYTRHGLHFEALDIFDQMIQEGYEPDSVALSTLLSNIS

Query:  SMKFKLHIHGWVIRHGVEWNLSIANSLIVMYAKCGKLNRAKWLFQQMPQKDMVSWNSIISAHFNSAEALTYFEVMESLGVSPDGVTFVSLLSTCAHLGLV
        S K    +HGWVIR G+EW LS+AN+LIV+Y+K G+L +A ++F QM ++D VSWN+IISAH  ++  L YFE M      PDG+TFVS+LS CA+ G+V
Subjt:  SMKFKLHIHGWVIRHGVEWNLSIANSLIVMYAKCGKLNRAKWLFQQMPQKDMVSWNSIISAHFNSAEALTYFEVMESLGVSPDGVTFVSLLSTCAHLGLV

Query:  KEGGKLYFLMKGKYGIRPTIEHYACMVNLYGRAGMIEEAYKIITKGMEIEAGPTIWGALLYACYLHSDVDIAEIAAERLFELEPDNELNFELLMKIYGNA
        ++G +L+ LM  +YGI P +EHYACMVNLYGRAGM+EEAY +I + M +EAGPT+WGALLYACYLH + DI E+AA+RLFELEPDNE NFELL++IY  A
Subjt:  KEGGKLYFLMKGKYGIRPTIEHYACMVNLYGRAGMIEEAYKIITKGMEIEAGPTIWGALLYACYLHSDVDIAEIAAERLFELEPDNELNFELLMKIYGNA

Query:  GRSEDEKRVKLMMAERGLNS
         R+ED +RV+ MM +RGL +
Subjt:  GRSEDEKRVKLMMAERGLNS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAACGACCATTTCCCTCTTCTTCTCCAACGGTTTGTTTTTCCAATGCTGATTTCTTTGCACTTGTTAACTCCACCCTCATCTTCTCTTCTTCTCTTCTGTTCTTCCAA
ACCCAAAAAATCCAAGAAAGAGAGAAGGAAACTTCTCCACCAAAAACTTCTCCGCATTAGCAAAGCTAAACAGTCCACTGATCTCTCCTTCCCCAAATCCTCGCCAACCC
CTCTCTTAATCCACCCCAAACCCTTCTTCCAGTCCAAAATTCAAGCCCTTGATGCTGTTCTCACCGACCTTGAAGCTTCCATCGACAATGGCCTCTTTATTGATCCTGAA
ATTTTCTCTTCCCTTTTGGAACTTTGTTACCAATTGCAAGCTATTCACCATGGTATTCGGATTCATCGCTTAATACCCACCAATCTTTTAAGGAGAAATGTGGGTATTTC
TTCTAAGCTTCTTCGTCTGTATGCTTCTTTTGGGTACATGGAGGATGCACACCAGGTGTTCGACGAAATGGGTAATCGTAATTTCTCTGCATTTGCTTGGAATTCTCTTA
TTTCTGGATACGCGGAACTTGGTCTTTATGAAGATGCTCTGGCGCTTTACTTTCAAATGGAGGAAGAAGGTGTTGAACCTGACAATTTCACTTTTCCTCGTGTGCTCAAG
GCCTGTGGTGGCATTGGGTCGATTCAAATCGGAGAGGCGGTGCATCGGCATGTAGTTCGTTCTGGCTTTGCTGGAGATGTCTTTGTCCTCAATGCTTTAGTTGACATGTA
TTCCAAATGTGGTTGCATTGTGAGGGCTAGGAAAGTGTTTGATCAGATTGAGTATAAGGATATAGTTTCCTGGAACTCAATGCTCACTGGTTACACACGCCATGGGCTTC
ACTTTGAGGCATTAGACATCTTTGATCAAATGATTCAAGAAGGTTACGAGCCCGATTCGGTTGCTTTGTCCACCCTACTTTCTAACATTTCGTCAATGAAATTCAAGTTA
CATATTCATGGATGGGTAATTCGACATGGAGTCGAATGGAATTTGTCCATTGCTAACTCCTTGATAGTCATGTATGCCAAATGTGGTAAGCTTAACAGAGCAAAATGGCT
GTTCCAGCAAATGCCTCAAAAGGACATGGTCTCATGGAACTCCATAATCTCTGCTCATTTCAATAGCGCAGAAGCTTTGACATATTTCGAAGTGATGGAAAGCCTTGGTG
TTTCGCCAGACGGTGTAACATTTGTGTCATTGTTATCAACTTGTGCTCATCTGGGGTTGGTGAAGGAAGGGGGGAAATTGTATTTTTTGATGAAGGGGAAGTACGGAATA
AGACCAACAATTGAACATTATGCTTGTATGGTGAATCTTTACGGGAGAGCAGGGATGATTGAAGAAGCTTATAAAATCATAACGAAAGGGATGGAGATCGAGGCAGGTCC
GACCATATGGGGGGCGTTGTTGTATGCGTGTTATCTCCATAGCGATGTAGATATCGCTGAGATTGCTGCTGAAAGACTCTTCGAATTGGAGCCAGATAATGAGCTCAATT
TTGAGCTTTTGATGAAGATTTATGGCAATGCTGGGAGATCGGAGGACGAGAAGCGAGTGAAATTAATGATGGCAGAACGAGGACTGAATTCATAG
mRNA sequenceShow/hide mRNA sequence
GGTGTCATAAAATTACGATGGGGTGTGACTGTGGGCTGCGAGAACGAAACCCCAAATGAACGACCATTTCCCTCTTCTTCTCCAACGGTTTGTTTTTCCAATGCTGATTT
CTTTGCACTTGTTAACTCCACCCTCATCTTCTCTTCTTCTCTTCTGTTCTTCCAAACCCAAAAAATCCAAGAAAGAGAGAAGGAAACTTCTCCACCAAAAACTTCTCCGC
ATTAGCAAAGCTAAACAGTCCACTGATCTCTCCTTCCCCAAATCCTCGCCAACCCCTCTCTTAATCCACCCCAAACCCTTCTTCCAGTCCAAAATTCAAGCCCTTGATGC
TGTTCTCACCGACCTTGAAGCTTCCATCGACAATGGCCTCTTTATTGATCCTGAAATTTTCTCTTCCCTTTTGGAACTTTGTTACCAATTGCAAGCTATTCACCATGGTA
TTCGGATTCATCGCTTAATACCCACCAATCTTTTAAGGAGAAATGTGGGTATTTCTTCTAAGCTTCTTCGTCTGTATGCTTCTTTTGGGTACATGGAGGATGCACACCAG
GTGTTCGACGAAATGGGTAATCGTAATTTCTCTGCATTTGCTTGGAATTCTCTTATTTCTGGATACGCGGAACTTGGTCTTTATGAAGATGCTCTGGCGCTTTACTTTCA
AATGGAGGAAGAAGGTGTTGAACCTGACAATTTCACTTTTCCTCGTGTGCTCAAGGCCTGTGGTGGCATTGGGTCGATTCAAATCGGAGAGGCGGTGCATCGGCATGTAG
TTCGTTCTGGCTTTGCTGGAGATGTCTTTGTCCTCAATGCTTTAGTTGACATGTATTCCAAATGTGGTTGCATTGTGAGGGCTAGGAAAGTGTTTGATCAGATTGAGTAT
AAGGATATAGTTTCCTGGAACTCAATGCTCACTGGTTACACACGCCATGGGCTTCACTTTGAGGCATTAGACATCTTTGATCAAATGATTCAAGAAGGTTACGAGCCCGA
TTCGGTTGCTTTGTCCACCCTACTTTCTAACATTTCGTCAATGAAATTCAAGTTACATATTCATGGATGGGTAATTCGACATGGAGTCGAATGGAATTTGTCCATTGCTA
ACTCCTTGATAGTCATGTATGCCAAATGTGGTAAGCTTAACAGAGCAAAATGGCTGTTCCAGCAAATGCCTCAAAAGGACATGGTCTCATGGAACTCCATAATCTCTGCT
CATTTCAATAGCGCAGAAGCTTTGACATATTTCGAAGTGATGGAAAGCCTTGGTGTTTCGCCAGACGGTGTAACATTTGTGTCATTGTTATCAACTTGTGCTCATCTGGG
GTTGGTGAAGGAAGGGGGGAAATTGTATTTTTTGATGAAGGGGAAGTACGGAATAAGACCAACAATTGAACATTATGCTTGTATGGTGAATCTTTACGGGAGAGCAGGGA
TGATTGAAGAAGCTTATAAAATCATAACGAAAGGGATGGAGATCGAGGCAGGTCCGACCATATGGGGGGCGTTGTTGTATGCGTGTTATCTCCATAGCGATGTAGATATC
GCTGAGATTGCTGCTGAAAGACTCTTCGAATTGGAGCCAGATAATGAGCTCAATTTTGAGCTTTTGATGAAGATTTATGGCAATGCTGGGAGATCGGAGGACGAGAAGCG
AGTGAAATTAATGATGGCAGAACGAGGACTGAATTCATAGTGTAGATAATAATTGAAATCTATTCACATTCATGGATGGATAATGAATTCATCTGGATAGATAACAGGTT
CAGTGTGAAGTTAATATGAGAATTCACACAAAGTTGGAAATGGATTCCACTGAGAGT
Protein sequenceShow/hide protein sequence
MNDHFPLLLQRFVFPMLISLHLLTPPSSSLLLFCSSKPKKSKKERRKLLHQKLLRISKAKQSTDLSFPKSSPTPLLIHPKPFFQSKIQALDAVLTDLEASIDNGLFIDPE
IFSSLLELCYQLQAIHHGIRIHRLIPTNLLRRNVGISSKLLRLYASFGYMEDAHQVFDEMGNRNFSAFAWNSLISGYAELGLYEDALALYFQMEEEGVEPDNFTFPRVLK
ACGGIGSIQIGEAVHRHVVRSGFAGDVFVLNALVDMYSKCGCIVRARKVFDQIEYKDIVSWNSMLTGYTRHGLHFEALDIFDQMIQEGYEPDSVALSTLLSNISSMKFKL
HIHGWVIRHGVEWNLSIANSLIVMYAKCGKLNRAKWLFQQMPQKDMVSWNSIISAHFNSAEALTYFEVMESLGVSPDGVTFVSLLSTCAHLGLVKEGGKLYFLMKGKYGI
RPTIEHYACMVNLYGRAGMIEEAYKIITKGMEIEAGPTIWGALLYACYLHSDVDIAEIAAERLFELEPDNELNFELLMKIYGNAGRSEDEKRVKLMMAERGLNS