; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg002983 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg002983
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationscaffold6:2208355..2209947
RNA-Seq ExpressionSpg002983
SyntenySpg002983
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6600165.1 Pentatricopeptide repeat-containing protein, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia]2.5e-27088.45Show/hide
Query:  MLISLQLSVSPIAPLLLLCSSKPKKSKKERRKLLQEKLIRISKAKEATDLSFPKSSSTPLLIHHKPFSQTKIQALDAVVDDLEASIDNGVRIDAEIFSSL
        MLISL+ S+SPI  L L CSS PKKSKKERRKLLQEKLIRISKAKEAT L FPKSSSTPLLIHHKPFSQ+KIQALDAV++DLEAS+ NGV IDAEIFSSL
Subjt:  MLISLQLSVSPIAPLLLLCSSKPKKSKKERRKLLQEKLIRISKAKEATDLSFPKSSSTPLLIHHKPFSQTKIQALDAVVDDLEASIDNGVRIDAEIFSSL

Query:  LETCYNLRAIDHGIRIHRLIPTDLLRRNVGVSSKLLRLYASFGYMENAHQVFDEMGKRNVSAFSWNSLISGYAELGLYEDALALYFQMEEEGVEPDHFTF
        LETCY LRA+DHGIRIHRLIPT+ LRRNVGVSSKLLRLYASFGYME+AHQVFDEM +RN+SAFSWNSLISGYAELGLYEDALALYFQMEEEGVEPDHFTF
Subjt:  LETCYNLRAIDHGIRIHRLIPTDLLRRNVGVSSKLLRLYASFGYMENAHQVFDEMGKRNVSAFSWNSLISGYAELGLYEDALALYFQMEEEGVEPDHFTF

Query:  PRVLKACGGIGSIRIGEAVHRHIVRSGFAGDVFVLNALVDMYSKCGDIVRARKVFDQIVSKDTVSWNSMLTGYTRHGLLLEALDIFDQMIREGYEPDSVA
        PRVLKACGGIGSIR+GEAVHRH+VRSGFAGD+FVLNALVDMY+KCGDI+RARKVFDQIVSKDTVSWNSMLTGYTRHGLLLEAL+ FDQMI+EGYEPDSVA
Subjt:  PRVLKACGGIGSIRIGEAVHRHIVRSGFAGDVFVLNALVDMYSKCGDIVRARKVFDQIVSKDTVSWNSMLTGYTRHGLLLEALDIFDQMIREGYEPDSVA

Query:  LSTILSSTSSLKFKLHIHGWVIRNGVEWNLSIANSLIVMYANCGKIDRAKWLFQQMPQRDIVSWNSIISAHFNTSEALTYFEVMESLGVLPDTVTFVSLL
        LST++S+ SS KFKLHIHGW IR+G+EWNLSIANSLI MYAN GKI+RA+WLF+QMP+RDIVSWN+IISAH NTS+ALTYFEVMESLGVLPD+VTFVSLL
Subjt:  LSTILSSTSSLKFKLHIHGWVIRNGVEWNLSIANSLIVMYANCGKIDRAKWLFQQMPQRDIVSWNSIISAHFNTSEALTYFEVMESLGVLPDTVTFVSLL

Query:  STSAHLGLVKEGEKLYSLMKGKYGIRPTMEHYACMVNLYGRAGLIEEAYRIITKGMEVEAGPTVWGALLYACYLHGNVDIAEIAAERLFELEPDNELNFE
        ST AHL LVKEG KLYS+MKGKYGIRPT+EHYACMVNLYGRAGLIEEAYRIIT GMEVEAGPTVWGALLYACYLHGNVDIAE+AAE+LFE EPDNELNF+
Subjt:  STSAHLGLVKEGEKLYSLMKGKYGIRPTMEHYACMVNLYGRAGLIEEAYRIITKGMEVEAGPTVWGALLYACYLHGNVDIAEIAAERLFELEPDNELNFE

Query:  LLMKIYGNAGRSEDEKRVRLMMAERGLD
        LLMKIYGNAGR EDEKRVRLMMAERGLD
Subjt:  LLMKIYGNAGRSEDEKRVRLMMAERGLD

KAG7030831.1 Pentatricopeptide repeat-containing protein, chloroplastic, partial [Cucurbita argyrosperma subsp. argyrosperma]2.5e-27088.45Show/hide
Query:  MLISLQLSVSPIAPLLLLCSSKPKKSKKERRKLLQEKLIRISKAKEATDLSFPKSSSTPLLIHHKPFSQTKIQALDAVVDDLEASIDNGVRIDAEIFSSL
        MLISL+ S+SPI  L L CSS PKKSKKERRKLLQEKLIRISKAKEAT L FPKSSSTPLLIHHKPFSQ+KIQALDAV++DLEAS+ NGV IDAEIFSSL
Subjt:  MLISLQLSVSPIAPLLLLCSSKPKKSKKERRKLLQEKLIRISKAKEATDLSFPKSSSTPLLIHHKPFSQTKIQALDAVVDDLEASIDNGVRIDAEIFSSL

Query:  LETCYNLRAIDHGIRIHRLIPTDLLRRNVGVSSKLLRLYASFGYMENAHQVFDEMGKRNVSAFSWNSLISGYAELGLYEDALALYFQMEEEGVEPDHFTF
        LETCY LRA+DHGIRIHRLIPT+ LRRNVGVSSKLLRLYASFGYME+AHQVFDEM +RN+SAFSWNSLISGYAELGLYEDALALYFQMEEEGVEPDHFTF
Subjt:  LETCYNLRAIDHGIRIHRLIPTDLLRRNVGVSSKLLRLYASFGYMENAHQVFDEMGKRNVSAFSWNSLISGYAELGLYEDALALYFQMEEEGVEPDHFTF

Query:  PRVLKACGGIGSIRIGEAVHRHIVRSGFAGDVFVLNALVDMYSKCGDIVRARKVFDQIVSKDTVSWNSMLTGYTRHGLLLEALDIFDQMIREGYEPDSVA
        PRVLKACGGIGSIR+GEAVHRH+VRSGFAGD+FVLNALVDMY+KCGDI+RARKVFDQIVSKDTVSWNSMLTGYTRHGLLLEAL+ FDQMI+EGYEPDSVA
Subjt:  PRVLKACGGIGSIRIGEAVHRHIVRSGFAGDVFVLNALVDMYSKCGDIVRARKVFDQIVSKDTVSWNSMLTGYTRHGLLLEALDIFDQMIREGYEPDSVA

Query:  LSTILSSTSSLKFKLHIHGWVIRNGVEWNLSIANSLIVMYANCGKIDRAKWLFQQMPQRDIVSWNSIISAHFNTSEALTYFEVMESLGVLPDTVTFVSLL
        LST++S+ SS KFKLHIHGW IR+G+EWNLSIANSLI MYAN GKI+RA+WLF+QMP+RDIVSWN+IISAH NTS+ALTYFEVMESLGVLPD+VTFVSLL
Subjt:  LSTILSSTSSLKFKLHIHGWVIRNGVEWNLSIANSLIVMYANCGKIDRAKWLFQQMPQRDIVSWNSIISAHFNTSEALTYFEVMESLGVLPDTVTFVSLL

Query:  STSAHLGLVKEGEKLYSLMKGKYGIRPTMEHYACMVNLYGRAGLIEEAYRIITKGMEVEAGPTVWGALLYACYLHGNVDIAEIAAERLFELEPDNELNFE
        ST AHL LVKEG KLYS+MKGKYGIRPT+EHYACMVNLYGRAGLIEEAYRIIT GMEVEAGPTVWGALLYACYLHGNVDIAE+AAE+LFE EPDNELNF+
Subjt:  STSAHLGLVKEGEKLYSLMKGKYGIRPTMEHYACMVNLYGRAGLIEEAYRIITKGMEVEAGPTVWGALLYACYLHGNVDIAEIAAERLFELEPDNELNFE

Query:  LLMKIYGNAGRSEDEKRVRLMMAERGLD
        LLMKIYGNAGR EDEKRVRLMMAERGLD
Subjt:  LLMKIYGNAGRSEDEKRVRLMMAERGLD

XP_022942651.1 pentatricopeptide repeat-containing protein At4g25270, chloroplastic [Cucurbita moschata]4.3e-27088.64Show/hide
Query:  MLISLQLSVSPIAPLLLLCSSKPKKSKKERRKLLQEKLIRISKAKEATDLSFPKSSSTPLLIHHKPFSQTKIQALDAVVDDLEASIDNGVRIDAEIFSSL
        MLISL+ S+SPI  L LLCSS PKKSKKERRKLLQEKLIRISKAKEAT L FPKSSSTPLLIHHKPFSQ+KIQALDAV++DLEAS+ NGV IDAEIFSSL
Subjt:  MLISLQLSVSPIAPLLLLCSSKPKKSKKERRKLLQEKLIRISKAKEATDLSFPKSSSTPLLIHHKPFSQTKIQALDAVVDDLEASIDNGVRIDAEIFSSL

Query:  LETCYNLRAIDHGIRIHRLIPTDLLRRNVGVSSKLLRLYASFGYMENAHQVFDEMGKRNVSAFSWNSLISGYAELGLYEDALALYFQMEEEGVEPDHFTF
        LETCY LRA+DHGIRIHRLIPT+ LRRNVGVSSKLLRLYASFGYME+AHQVFDEM +RN+SAFSWNSLISGYAELGLYEDALALYFQMEEEGVEPDHFTF
Subjt:  LETCYNLRAIDHGIRIHRLIPTDLLRRNVGVSSKLLRLYASFGYMENAHQVFDEMGKRNVSAFSWNSLISGYAELGLYEDALALYFQMEEEGVEPDHFTF

Query:  PRVLKACGGIGSIRIGEAVHRHIVRSGFAGDVFVLNALVDMYSKCGDIVRARKVFDQIVSKDTVSWNSMLTGYTRHGLLLEALDIFDQMIREGYEPDSVA
        PRVLKACGGIGSIR+GEAVHRHIVRSGFAGD+FVLNALVDMY+KCGDI+RARKVFDQIVSKDTVSWNSMLTGYTRHGLLLEAL+ FDQMI+EGYEPDSVA
Subjt:  PRVLKACGGIGSIRIGEAVHRHIVRSGFAGDVFVLNALVDMYSKCGDIVRARKVFDQIVSKDTVSWNSMLTGYTRHGLLLEALDIFDQMIREGYEPDSVA

Query:  LSTILSSTSSLKFKLHIHGWVIRNGVEWNLSIANSLIVMYANCGKIDRAKWLFQQMPQRDIVSWNSIISAHFNTSEALTYFEVMESLGVLPDTVTFVSLL
        LST++S+ SS KFKLHIHGW IR+G+EWNLSIANSLI MYAN GKI+RA+WLF+QMP+RDIVSWN+IISAH NTS+ALTYFEVMESLGVLPD+VTFVSLL
Subjt:  LSTILSSTSSLKFKLHIHGWVIRNGVEWNLSIANSLIVMYANCGKIDRAKWLFQQMPQRDIVSWNSIISAHFNTSEALTYFEVMESLGVLPDTVTFVSLL

Query:  STSAHLGLVKEGEKLYSLMKGKYGIRPTMEHYACMVNLYGRAGLIEEAYRIITKGMEVEAGPTVWGALLYACYLHGNVDIAEIAAERLFELEPDNELNFE
        ST AHL LVKEG KLY+ MKGKYGIRPT+EHYACMVNLYGRAGLIEEAYRIIT GMEVEAGPTVWGALLYACYLHGNVDIAE+AAE+LFE EPDNELNF+
Subjt:  STSAHLGLVKEGEKLYSLMKGKYGIRPTMEHYACMVNLYGRAGLIEEAYRIITKGMEVEAGPTVWGALLYACYLHGNVDIAEIAAERLFELEPDNELNFE

Query:  LLMKIYGNAGRSEDEKRVRLMMAERGLD
        LLMKIYGNAGR EDEKRVRLMMAERGLD
Subjt:  LLMKIYGNAGRSEDEKRVRLMMAERGLD

XP_022993795.1 pentatricopeptide repeat-containing protein At4g25270, chloroplastic [Cucurbita maxima]2.2e-26988.07Show/hide
Query:  MLISLQLSVSPIAPLLLLCSSKPKKSKKERRKLLQEKLIRISKAKEATDLSFPKSSSTPLLIHHKPFSQTKIQALDAVVDDLEASIDNGVRIDAEIFSSL
        MLISL+ S+SPI  L L CSS PKKSKKERRKLLQEKLIRISKAKEAT L FPKSSSTPLLIHHKPFS++KIQALDAV++DLEAS+DNGV IDAEIFSSL
Subjt:  MLISLQLSVSPIAPLLLLCSSKPKKSKKERRKLLQEKLIRISKAKEATDLSFPKSSSTPLLIHHKPFSQTKIQALDAVVDDLEASIDNGVRIDAEIFSSL

Query:  LETCYNLRAIDHGIRIHRLIPTDLLRRNVGVSSKLLRLYASFGYMENAHQVFDEMGKRNVSAFSWNSLISGYAELGLYEDALALYFQMEEEGVEPDHFTF
        LETCY LRA+DHGIRIHRLIPT+ LRRNVGVSSKLLRLYASFGYME+AHQVFDEM +RN+SAFSWNSLISGYAELGLYEDALALYFQMEEEGVEPDHFTF
Subjt:  LETCYNLRAIDHGIRIHRLIPTDLLRRNVGVSSKLLRLYASFGYMENAHQVFDEMGKRNVSAFSWNSLISGYAELGLYEDALALYFQMEEEGVEPDHFTF

Query:  PRVLKACGGIGSIRIGEAVHRHIVRSGFAGDVFVLNALVDMYSKCGDIVRARKVFDQIVSKDTVSWNSMLTGYTRHGLLLEALDIFDQMIREGYEPDSVA
        PRVLKACGGIGSIR+GEAVHRHIVRSGFAGD+FVLNALVDMY+KCGDI+RARKVFDQI+ KDTVSWNSMLTGYTRHGLLLEAL+ FDQMI+EGYEPDSVA
Subjt:  PRVLKACGGIGSIRIGEAVHRHIVRSGFAGDVFVLNALVDMYSKCGDIVRARKVFDQIVSKDTVSWNSMLTGYTRHGLLLEALDIFDQMIREGYEPDSVA

Query:  LSTILSSTSSLKFKLHIHGWVIRNGVEWNLSIANSLIVMYANCGKIDRAKWLFQQMPQRDIVSWNSIISAHFNTSEALTYFEVMESLGVLPDTVTFVSLL
        LST++S+ +S KFKLHIHGW IR+G+EWNLSIANSLI MYAN GKI+RA+WLF+QMPQRDIVSWN+IISAH NTS+ALTYFEVMESLGVLPD+VTFVSLL
Subjt:  LSTILSSTSSLKFKLHIHGWVIRNGVEWNLSIANSLIVMYANCGKIDRAKWLFQQMPQRDIVSWNSIISAHFNTSEALTYFEVMESLGVLPDTVTFVSLL

Query:  STSAHLGLVKEGEKLYSLMKGKYGIRPTMEHYACMVNLYGRAGLIEEAYRIITKGMEVEAGPTVWGALLYACYLHGNVDIAEIAAERLFELEPDNELNFE
        ST AHL LVKEG KLYS+MKGKYGIRPT+EHYACMVNLYGRAGLIEEAYRII  GMEVEAGPTVWGALLYACYLHGNVDIAE+AAE+LFE EPDNELNF+
Subjt:  STSAHLGLVKEGEKLYSLMKGKYGIRPTMEHYACMVNLYGRAGLIEEAYRIITKGMEVEAGPTVWGALLYACYLHGNVDIAEIAAERLFELEPDNELNFE

Query:  LLMKIYGNAGRSEDEKRVRLMMAERGLD
        LLMKIYGNAGR EDEKRVRLMMAERGLD
Subjt:  LLMKIYGNAGRSEDEKRVRLMMAERGLD

XP_038901542.1 pentatricopeptide repeat-containing protein At4g25270, chloroplastic [Benincasa hispida]9.7e-27088.85Show/hide
Query:  MLISLQLSVSPIAPLLLLCSSKPKKSKKERRKLLQEKLIRISKAKEATDLSFPKSSSTPLLIHHKPFSQTKIQALDAVVDDLEASIDNGVRIDAEIFSSL
        MLISLQLS+SP   LLL CSSKPKKSKKER+KLL +KL+RISKA+E TDL FPKSSSTPLLIH KPF QTKIQALDA++ DLE S+DNG+  D EIFSSL
Subjt:  MLISLQLSVSPIAPLLLLCSSKPKKSKKERRKLLQEKLIRISKAKEATDLSFPKSSSTPLLIHHKPFSQTKIQALDAVVDDLEASIDNGVRIDAEIFSSL

Query:  LETCYNLRAIDHGIRIHRLIPTDLLRRNVGVSSKLLRLYASFGYMENAHQVFDEMGKRNVSAFSWNSLISGYAELGLYEDALALYFQMEEEGVEPDHFTF
        LE CY LR+I HGIRIHRLIPT+LLRRNVGVSSKLLRLYASFGYME AHQVFDEM KRNVSAF+WNSLISGYAELGLYEDALALYFQMEEEGVEPDHFTF
Subjt:  LETCYNLRAIDHGIRIHRLIPTDLLRRNVGVSSKLLRLYASFGYMENAHQVFDEMGKRNVSAFSWNSLISGYAELGLYEDALALYFQMEEEGVEPDHFTF

Query:  PRVLKACGGIGSIRIGEAVHRHIVRSGFAGDVFVLNALVDMYSKCGDIVRARKVFDQIVSKDTVSWNSMLTGYTRHGLLLEALDIFDQMIREGYEPDSVA
        PRVLKACGGI S++IGEAVHRH++RSGFAGDVFVLNALVDMYSKCG IVRARKVFDQIV KDTVSWNSMLTGYTRHGLL EALDIFDQMI+EGYEPDSVA
Subjt:  PRVLKACGGIGSIRIGEAVHRHIVRSGFAGDVFVLNALVDMYSKCGDIVRARKVFDQIVSKDTVSWNSMLTGYTRHGLLLEALDIFDQMIREGYEPDSVA

Query:  LSTILSSTSSLKFKLHIHGWVIRNGVEWNLSIANSLIVMYANCGKIDRAKWLFQQMPQRDIVSWNSIISAHFNTSEALTYFEVMESLGVLPDTVTFVSLL
        LSTILS+ SSLKF+LHIHGWVIR+GVEWNLSIANSLIVMYANCGKI+RAKWLFQQMPQ+D VSWNSIISAHFN+ EALTYFEVMESLGVLPD+VTFVSLL
Subjt:  LSTILSSTSSLKFKLHIHGWVIRNGVEWNLSIANSLIVMYANCGKIDRAKWLFQQMPQRDIVSWNSIISAHFNTSEALTYFEVMESLGVLPDTVTFVSLL

Query:  STSAHLGLVKEGEKLYSLMKGKYGIRPTMEHYACMVNLYGRAGLIEEAYRIITKGMEVEAGPTVWGALLYACYLHGNVDIAEIAAERLFELEPDNELNFE
        ST A+LGLVKEG KLYSLMKGKY IRPT EHYACMVNLYGRAGLIEEAYRIITK ME+EAGPTVWGALLYACYLH NVDIAEIAAERLFELEPDNELNFE
Subjt:  STSAHLGLVKEGEKLYSLMKGKYGIRPTMEHYACMVNLYGRAGLIEEAYRIITKGMEVEAGPTVWGALLYACYLHGNVDIAEIAAERLFELEPDNELNFE

Query:  LLMKIYGNAGRSEDEKRVRLMMAERGLDS
        LLMKIYGNAGRSEDEKRV+LMMAERGLDS
Subjt:  LLMKIYGNAGRSEDEKRVRLMMAERGLDS

TrEMBL top hitse value%identityAlignment
A0A1S4DSF7 pentatricopeptide repeat-containing protein At4g25270, chloroplastic7.5e-26887.9Show/hide
Query:  MLISLQLSVSPIAPLLLLCSSKPKKSKKERRKLLQEKLIRISKAKEATDLSFPKSSSTPLLIHHKPFSQTKIQALDAVVDDLEASIDNGVRIDAEIFSSL
        MLISLQL   P   LLL CSSKPKKSKKERRKLL +KL+RISKAK++TDLSFPKSSSTPLLIH KPF Q+KIQALDAV+ DLE SIDNG+ ID EIFSSL
Subjt:  MLISLQLSVSPIAPLLLLCSSKPKKSKKERRKLLQEKLIRISKAKEATDLSFPKSSSTPLLIHHKPFSQTKIQALDAVVDDLEASIDNGVRIDAEIFSSL

Query:  LETCYNLRAIDHGIRIHRLIPTDLLRRNVGVSSKLLRLYASFGYMENAHQVFDEMGKRNVSAFSWNSLISGYAELGLYEDALALYFQMEEEGVEPDHFTF
        LE CY LRAI HGIRIHRLIPT+LLRRNVG+SSKLLRLYAS GYME+AHQVFDEMGKRN SAF+WNSLISGYAELGLYEDALALYFQMEEEGVEPDHFTF
Subjt:  LETCYNLRAIDHGIRIHRLIPTDLLRRNVGVSSKLLRLYASFGYMENAHQVFDEMGKRNVSAFSWNSLISGYAELGLYEDALALYFQMEEEGVEPDHFTF

Query:  PRVLKACGGIGSIRIGEAVHRHIVRSGFAGDVFVLNALVDMYSKCGDIVRARKVFDQIVSKDTVSWNSMLTGYTRHGLLLEALDIFDQMIREGYEPDSVA
        PRVLKACGGIGSI+IGEAVHRH+VRSGFAGDVFVLNALVDMYSKCG IVRARKVFDQIV KD VSWNSMLTGYTRHGL  EALDIFDQMI+EGY+PDSVA
Subjt:  PRVLKACGGIGSIRIGEAVHRHIVRSGFAGDVFVLNALVDMYSKCGDIVRARKVFDQIVSKDTVSWNSMLTGYTRHGLLLEALDIFDQMIREGYEPDSVA

Query:  LSTILSSTSSLKFKLHIHGWVIRNGVEWNLSIANSLIVMYANCGKIDRAKWLFQQMPQRDIVSWNSIISAHFNTSEALTYFEVMESLGVLPDTVTFVSLL
        LST+LS+  SLKFKLHIHGWVIR+GVEWNLSIANSLIVMYA CGK++RAKWLFQQMPQ+D+VSWNSIISAHFNT+EALTYFEVMESLGVLPD VTFVSLL
Subjt:  LSTILSSTSSLKFKLHIHGWVIRNGVEWNLSIANSLIVMYANCGKIDRAKWLFQQMPQRDIVSWNSIISAHFNTSEALTYFEVMESLGVLPDTVTFVSLL

Query:  STSAHLGLVKEGEKLYSLMKGKYGIRPTMEHYACMVNLYGRAGLIEEAYRIITKGMEVEAGPTVWGALLYACYLHGNVDIAEIAAERLFELEPDNELNFE
        ST AHLGLVKEG +LYSLMKGKY IRPT+EHYACMVNLYGRAG+IEEAY+IITKGME+EAGPT+WGALLYACYLH NVDIAEIAAERLFELEPDNELNFE
Subjt:  STSAHLGLVKEGEKLYSLMKGKYGIRPTMEHYACMVNLYGRAGLIEEAYRIITKGMEVEAGPTVWGALLYACYLHGNVDIAEIAAERLFELEPDNELNFE

Query:  LLMKIYGNAGRSEDEKRVRLMMAERGLDS
        LLMKIYGNAGRS+DEKRV+LMMAERGL+S
Subjt:  LLMKIYGNAGRSEDEKRVRLMMAERGLDS

A0A5D3DJ70 Pentatricopeptide repeat-containing protein3.4e-26888.09Show/hide
Query:  MLISLQLSVSPIAPLLLLCSSKPKKSKKERRKLLQEKLIRISKAKEATDLSFPKSSSTPLLIHHKPFSQTKIQALDAVVDDLEASIDNGVRIDAEIFSSL
        MLISLQL   P   LLL CSSKPKKSKKERRKLL +KL+RISKAK++TDLSFPKSSSTPLLIH KPF Q+KIQALDAV+ DLE SIDNG+ ID EIFSSL
Subjt:  MLISLQLSVSPIAPLLLLCSSKPKKSKKERRKLLQEKLIRISKAKEATDLSFPKSSSTPLLIHHKPFSQTKIQALDAVVDDLEASIDNGVRIDAEIFSSL

Query:  LETCYNLRAIDHGIRIHRLIPTDLLRRNVGVSSKLLRLYASFGYMENAHQVFDEMGKRNVSAFSWNSLISGYAELGLYEDALALYFQMEEEGVEPDHFTF
        LE CY LRAI HGIRIHRLIPT+LLRRNVG+SSKLLRLYAS GYME+AHQVFDEMGKRN SAF+WNSLISGYAELGLYEDALALYFQMEEEGVEPDHFTF
Subjt:  LETCYNLRAIDHGIRIHRLIPTDLLRRNVGVSSKLLRLYASFGYMENAHQVFDEMGKRNVSAFSWNSLISGYAELGLYEDALALYFQMEEEGVEPDHFTF

Query:  PRVLKACGGIGSIRIGEAVHRHIVRSGFAGDVFVLNALVDMYSKCGDIVRARKVFDQIVSKDTVSWNSMLTGYTRHGLLLEALDIFDQMIREGYEPDSVA
        PRVLKACGGIGSI+IGEAVHRH+VRSGFAGDVFVLNALVDMYSKCG IVRARKVFDQIV KD VSWNSMLTGYTRHGL  EALDIFDQMI+EGY+PDSVA
Subjt:  PRVLKACGGIGSIRIGEAVHRHIVRSGFAGDVFVLNALVDMYSKCGDIVRARKVFDQIVSKDTVSWNSMLTGYTRHGLLLEALDIFDQMIREGYEPDSVA

Query:  LSTILSSTSSLKFKLHIHGWVIRNGVEWNLSIANSLIVMYANCGKIDRAKWLFQQMPQRDIVSWNSIISAHFNTSEALTYFEVMESLGVLPDTVTFVSLL
        LST+LS+  SLKFKLHIHGWVIR+GVEWNLSIANSLIVMYA CGK++RAKWLFQQMPQ+D+VSWNSIISAHFNT+EALTYFEVMESLGVLPD VTFVSLL
Subjt:  LSTILSSTSSLKFKLHIHGWVIRNGVEWNLSIANSLIVMYANCGKIDRAKWLFQQMPQRDIVSWNSIISAHFNTSEALTYFEVMESLGVLPDTVTFVSLL

Query:  STSAHLGLVKEGEKLYSLMKGKYGIRPTMEHYACMVNLYGRAGLIEEAYRIITKGMEVEAGPTVWGALLYACYLHGNVDIAEIAAERLFELEPDNELNFE
        ST AHLGLVKEG +LYSLMKGKY IRPT+EHYACMVNLYGRAG+IEEAY+IITKGME+EAGPT+WGALLYACYLH NVDIAEIAAERLFELEPDNELNFE
Subjt:  STSAHLGLVKEGEKLYSLMKGKYGIRPTMEHYACMVNLYGRAGLIEEAYRIITKGMEVEAGPTVWGALLYACYLHGNVDIAEIAAERLFELEPDNELNFE

Query:  LLMKIYGNAGRSEDEKRVRLMMAERGLDS
        LLMKIYGNAGRSEDEKRV+LMMAERGL+S
Subjt:  LLMKIYGNAGRSEDEKRVRLMMAERGLDS

A0A6J1CGA6 pentatricopeptide repeat-containing protein At4g25270, chloroplastic1.1e-26689.6Show/hide
Query:  MLISLQLSVSPIAPLLLLCSSKPKKSKKERRKLLQEKLIRISKAKEATDLSFPKSSSTPLLIHHKPFSQTKIQALDAVVDDLEASIDNGVRIDAEIFSSL
        MLISLQLS S    L   CSSKPKKSKKERRKLLQEKLIRISK+K +T LSFPKSSSTPLLIHHKPFSQTKIQAL+AV+DDLEAS++NGV +DAEIFSSL
Subjt:  MLISLQLSVSPIAPLLLLCSSKPKKSKKERRKLLQEKLIRISKAKEATDLSFPKSSSTPLLIHHKPFSQTKIQALDAVVDDLEASIDNGVRIDAEIFSSL

Query:  LETCYNLRAIDHGIRIHRLIPTDLLRRNVGVSSKLLRLYASFGYMENAHQVFDEMGKRNVSAFSWNSLISGYAELGLYEDALALYFQMEEEGVEPDHFTF
        LETCY LRAI + IRIHRLIPT LLRRNVGVSSKLLRLYASFGYME+AHQVFDEM KRNVSAF+WNSLISGYAELGLYEDALALYFQMEEEGVEPDHFTF
Subjt:  LETCYNLRAIDHGIRIHRLIPTDLLRRNVGVSSKLLRLYASFGYMENAHQVFDEMGKRNVSAFSWNSLISGYAELGLYEDALALYFQMEEEGVEPDHFTF

Query:  PRVLKACGGIGSIRIGEAVHRHIVRSGFAGDVFVLNALVDMYSKCGDIVRARKVFDQIVSKDTVSWNSMLTGYTRHGLLLEALDIFDQMIREGYEPDSVA
        PRVLKACGGIGSI+IGEAVHRHIVRSGFAGDVFVLNALVDMYSKCGDIVRARKVFDQIV KDTVSWNSMLTGYTRHGLLLEALDIFDQ+IREGYEPDSVA
Subjt:  PRVLKACGGIGSIRIGEAVHRHIVRSGFAGDVFVLNALVDMYSKCGDIVRARKVFDQIVSKDTVSWNSMLTGYTRHGLLLEALDIFDQMIREGYEPDSVA

Query:  LSTILSSTSSLKFKLHIHGWVIRNGVEWNLSIANSLIVMYANCGKIDRAKWLFQQMPQRDIVSWNSIISAHFNTSEALTYFEVMESLGVLPDTVTFVSLL
        LSTILS+ SSLKFKLHIHGWVIR GVEWNLSIANSLIV+YAN GKIDRAKWLFQQMPQ+D +SWNS+ISAH NTSEAL YFE MES GVLPDTVTFVSLL
Subjt:  LSTILSSTSSLKFKLHIHGWVIRNGVEWNLSIANSLIVMYANCGKIDRAKWLFQQMPQRDIVSWNSIISAHFNTSEALTYFEVMESLGVLPDTVTFVSLL

Query:  STSAHLGLVKEGEKLYSLMKGKYGIRPTMEHYACMVNLYGRAGLIEEAYRIITKGMEVEAGPTVWGALLYACYL-HGNVDIAEIAAERLFELEPDNELNF
        ST AHLGLVKEGE+L+S+MKGKYGIRPTMEHYACMVNLYGRAGLIEEAYRII +GME+EAGPTVWGALLYAC+L +G+VDIAEIAAE+LFELEPDNELNF
Subjt:  STSAHLGLVKEGEKLYSLMKGKYGIRPTMEHYACMVNLYGRAGLIEEAYRIITKGMEVEAGPTVWGALLYACYL-HGNVDIAEIAAERLFELEPDNELNF

Query:  ELLMKIYGNAGRSEDEKRVRLMMAERGLD
        ELLMKIYGNAGRSEDEKRVRLMM ERGLD
Subjt:  ELLMKIYGNAGRSEDEKRVRLMMAERGLD

A0A6J1FPF9 pentatricopeptide repeat-containing protein At4g25270, chloroplastic2.1e-27088.64Show/hide
Query:  MLISLQLSVSPIAPLLLLCSSKPKKSKKERRKLLQEKLIRISKAKEATDLSFPKSSSTPLLIHHKPFSQTKIQALDAVVDDLEASIDNGVRIDAEIFSSL
        MLISL+ S+SPI  L LLCSS PKKSKKERRKLLQEKLIRISKAKEAT L FPKSSSTPLLIHHKPFSQ+KIQALDAV++DLEAS+ NGV IDAEIFSSL
Subjt:  MLISLQLSVSPIAPLLLLCSSKPKKSKKERRKLLQEKLIRISKAKEATDLSFPKSSSTPLLIHHKPFSQTKIQALDAVVDDLEASIDNGVRIDAEIFSSL

Query:  LETCYNLRAIDHGIRIHRLIPTDLLRRNVGVSSKLLRLYASFGYMENAHQVFDEMGKRNVSAFSWNSLISGYAELGLYEDALALYFQMEEEGVEPDHFTF
        LETCY LRA+DHGIRIHRLIPT+ LRRNVGVSSKLLRLYASFGYME+AHQVFDEM +RN+SAFSWNSLISGYAELGLYEDALALYFQMEEEGVEPDHFTF
Subjt:  LETCYNLRAIDHGIRIHRLIPTDLLRRNVGVSSKLLRLYASFGYMENAHQVFDEMGKRNVSAFSWNSLISGYAELGLYEDALALYFQMEEEGVEPDHFTF

Query:  PRVLKACGGIGSIRIGEAVHRHIVRSGFAGDVFVLNALVDMYSKCGDIVRARKVFDQIVSKDTVSWNSMLTGYTRHGLLLEALDIFDQMIREGYEPDSVA
        PRVLKACGGIGSIR+GEAVHRHIVRSGFAGD+FVLNALVDMY+KCGDI+RARKVFDQIVSKDTVSWNSMLTGYTRHGLLLEAL+ FDQMI+EGYEPDSVA
Subjt:  PRVLKACGGIGSIRIGEAVHRHIVRSGFAGDVFVLNALVDMYSKCGDIVRARKVFDQIVSKDTVSWNSMLTGYTRHGLLLEALDIFDQMIREGYEPDSVA

Query:  LSTILSSTSSLKFKLHIHGWVIRNGVEWNLSIANSLIVMYANCGKIDRAKWLFQQMPQRDIVSWNSIISAHFNTSEALTYFEVMESLGVLPDTVTFVSLL
        LST++S+ SS KFKLHIHGW IR+G+EWNLSIANSLI MYAN GKI+RA+WLF+QMP+RDIVSWN+IISAH NTS+ALTYFEVMESLGVLPD+VTFVSLL
Subjt:  LSTILSSTSSLKFKLHIHGWVIRNGVEWNLSIANSLIVMYANCGKIDRAKWLFQQMPQRDIVSWNSIISAHFNTSEALTYFEVMESLGVLPDTVTFVSLL

Query:  STSAHLGLVKEGEKLYSLMKGKYGIRPTMEHYACMVNLYGRAGLIEEAYRIITKGMEVEAGPTVWGALLYACYLHGNVDIAEIAAERLFELEPDNELNFE
        ST AHL LVKEG KLY+ MKGKYGIRPT+EHYACMVNLYGRAGLIEEAYRIIT GMEVEAGPTVWGALLYACYLHGNVDIAE+AAE+LFE EPDNELNF+
Subjt:  STSAHLGLVKEGEKLYSLMKGKYGIRPTMEHYACMVNLYGRAGLIEEAYRIITKGMEVEAGPTVWGALLYACYLHGNVDIAEIAAERLFELEPDNELNFE

Query:  LLMKIYGNAGRSEDEKRVRLMMAERGLD
        LLMKIYGNAGR EDEKRVRLMMAERGLD
Subjt:  LLMKIYGNAGRSEDEKRVRLMMAERGLD

A0A6J1JZI1 pentatricopeptide repeat-containing protein At4g25270, chloroplastic1.0e-26988.07Show/hide
Query:  MLISLQLSVSPIAPLLLLCSSKPKKSKKERRKLLQEKLIRISKAKEATDLSFPKSSSTPLLIHHKPFSQTKIQALDAVVDDLEASIDNGVRIDAEIFSSL
        MLISL+ S+SPI  L L CSS PKKSKKERRKLLQEKLIRISKAKEAT L FPKSSSTPLLIHHKPFS++KIQALDAV++DLEAS+DNGV IDAEIFSSL
Subjt:  MLISLQLSVSPIAPLLLLCSSKPKKSKKERRKLLQEKLIRISKAKEATDLSFPKSSSTPLLIHHKPFSQTKIQALDAVVDDLEASIDNGVRIDAEIFSSL

Query:  LETCYNLRAIDHGIRIHRLIPTDLLRRNVGVSSKLLRLYASFGYMENAHQVFDEMGKRNVSAFSWNSLISGYAELGLYEDALALYFQMEEEGVEPDHFTF
        LETCY LRA+DHGIRIHRLIPT+ LRRNVGVSSKLLRLYASFGYME+AHQVFDEM +RN+SAFSWNSLISGYAELGLYEDALALYFQMEEEGVEPDHFTF
Subjt:  LETCYNLRAIDHGIRIHRLIPTDLLRRNVGVSSKLLRLYASFGYMENAHQVFDEMGKRNVSAFSWNSLISGYAELGLYEDALALYFQMEEEGVEPDHFTF

Query:  PRVLKACGGIGSIRIGEAVHRHIVRSGFAGDVFVLNALVDMYSKCGDIVRARKVFDQIVSKDTVSWNSMLTGYTRHGLLLEALDIFDQMIREGYEPDSVA
        PRVLKACGGIGSIR+GEAVHRHIVRSGFAGD+FVLNALVDMY+KCGDI+RARKVFDQI+ KDTVSWNSMLTGYTRHGLLLEAL+ FDQMI+EGYEPDSVA
Subjt:  PRVLKACGGIGSIRIGEAVHRHIVRSGFAGDVFVLNALVDMYSKCGDIVRARKVFDQIVSKDTVSWNSMLTGYTRHGLLLEALDIFDQMIREGYEPDSVA

Query:  LSTILSSTSSLKFKLHIHGWVIRNGVEWNLSIANSLIVMYANCGKIDRAKWLFQQMPQRDIVSWNSIISAHFNTSEALTYFEVMESLGVLPDTVTFVSLL
        LST++S+ +S KFKLHIHGW IR+G+EWNLSIANSLI MYAN GKI+RA+WLF+QMPQRDIVSWN+IISAH NTS+ALTYFEVMESLGVLPD+VTFVSLL
Subjt:  LSTILSSTSSLKFKLHIHGWVIRNGVEWNLSIANSLIVMYANCGKIDRAKWLFQQMPQRDIVSWNSIISAHFNTSEALTYFEVMESLGVLPDTVTFVSLL

Query:  STSAHLGLVKEGEKLYSLMKGKYGIRPTMEHYACMVNLYGRAGLIEEAYRIITKGMEVEAGPTVWGALLYACYLHGNVDIAEIAAERLFELEPDNELNFE
        ST AHL LVKEG KLYS+MKGKYGIRPT+EHYACMVNLYGRAGLIEEAYRII  GMEVEAGPTVWGALLYACYLHGNVDIAE+AAE+LFE EPDNELNF+
Subjt:  STSAHLGLVKEGEKLYSLMKGKYGIRPTMEHYACMVNLYGRAGLIEEAYRIITKGMEVEAGPTVWGALLYACYLHGNVDIAEIAAERLFELEPDNELNFE

Query:  LLMKIYGNAGRSEDEKRVRLMMAERGLD
        LLMKIYGNAGR EDEKRVRLMMAERGLD
Subjt:  LLMKIYGNAGRSEDEKRVRLMMAERGLD

SwissProt top hitse value%identityAlignment
Q9LN01 Pentatricopeptide repeat-containing protein At1g08070, chloroplastic6.7e-8035.79Show/hide
Query:  GVRIDAEIFSSLLETCYNLRAIDHGIRIHRLIPTDLLRRNVGVSSKLLRLYASFGYMENAHQVFDEMGKRNVSAFSWNSLISGYAELGLYEDALALYFQM
        G  +D  + +SL+        ++   ++    P     R+V   + L++ YAS GY+ENA ++FDE+  ++V   SWN++ISGYAE G Y++AL L+  M
Subjt:  GVRIDAEIFSSLLETCYNLRAIDHGIRIHRLIPTDLLRRNVGVSSKLLRLYASFGYMENAHQVFDEMGKRNVSAFSWNSLISGYAELGLYEDALALYFQM

Query:  EEEGVEPDHFTFPRVLKACGGIGSIRIGEAVHRHIVRSGFAGDVFVLNALVDMYSKCGDIVRARKVFDQIVSKDTVSWNSMLTGYTRHGLLLEALDIFDQ
         +  V PD  T   V+ AC   GSI +G  VH  I   GF  ++ ++NAL+D+YSKCG++  A  +F+++  KD +SWN+++ GYT   L  EAL +F +
Subjt:  EEEGVEPDHFTFPRVLKACGGIGSIRIGEAVHRHIVRSGFAGDVFVLNALVDMYSKCGDIVRARKVFDQIVSKDTVSWNSMLTGYTRHGLLLEALDIFDQ

Query:  MIREGYEPDSVALSTILSSTS---SLKFKLHIHGWVIR--NGVEWNLSIANSLIVMYANCGKIDRAKWLFQQMPQRDIVSWNSII---SAHFNTSEALTY
        M+R G  P+ V + +IL + +   ++     IH ++ +   GV    S+  SLI MYA CG I+ A  +F  +  + + SWN++I   + H     +   
Subjt:  MIREGYEPDSVALSTILSSTS---SLKFKLHIHGWVIR--NGVEWNLSIANSLIVMYANCGKIDRAKWLFQQMPQRDIVSWNSII---SAHFNTSEALTY

Query:  FEVMESLGVLPDTVTFVSLLSTSAHLGLVKEGEKLYSLMKGKYGIRPTMEHYACMVNLYGRAGLIEEAYRIITKGMEVEAGPTVWGALLYACYLHGNVDI
        F  M  +G+ PD +TFV LLS  +H G++  G  ++  M   Y + P +EHY CM++L G +GL +EA  +I   ME+E    +W +LL AC +HGNV++
Subjt:  FEVMESLGVLPDTVTFVSLLSTSAHLGLVKEGEKLYSLMKGKYGIRPTMEHYACMVNLYGRAGLIEEAYRIITKGMEVEAGPTVWGALLYACYLHGNVDI

Query:  AEIAAERLFELEPDNELNFELLMKIYGNAGRSEDEKRVRLMMAERGL
         E  AE L ++EP+N  ++ LL  IY +AGR  +  + R ++ ++G+
Subjt:  AEIAAERLFELEPDNELNFELLMKIYGNAGRSEDEKRVRLMMAERGL

Q9SB36 Pentatricopeptide repeat-containing protein At4g25270, chloroplastic2.5e-18361.84Show/hide
Query:  SSKPKKSKKERRKLLQEKLIRISKAKEATDLSFPKSSSTPLLIHHKPFSQTKIQALDAVVDDLEASIDNGVRI-DAEIFSSLLETCYNLRAIDHGIRIHR
        SS  KK  +  ++L Q +  + +     T LSF K S TPLLI  +   +T+++ALD+V+ DLE S   G+ + + EIF+SLLETCY+LRAIDHG+R+H 
Subjt:  SSKPKKSKKERRKLLQEKLIRISKAKEATDLSFPKSSSTPLLIHHKPFSQTKIQALDAVVDDLEASIDNGVRI-DAEIFSSLLETCYNLRAIDHGIRIHR

Query:  LIPTDLLRRNVGVSSKLLRLYASFGYMENAHQVFDEMGKRNVSAFSWNSLISGYAELGLYEDALALYFQMEEEGVEPDHFTFPRVLKACGGIGSIRIGEA
        LIP  LLR N+G+SSKL+RLYAS GY E AH+VFD M KR+ S F+WNSLISGYAELG YEDA+ALYFQM E+GV+PD FTFPRVLKACGGIGS++IGEA
Subjt:  LIPTDLLRRNVGVSSKLLRLYASFGYMENAHQVFDEMGKRNVSAFSWNSLISGYAELGLYEDALALYFQMEEEGVEPDHFTFPRVLKACGGIGSIRIGEA

Query:  VHRHIVRSGFAGDVFVLNALVDMYSKCGDIVRARKVFDQIVSKDTVSWNSMLTGYTRHGLLLEALDIFDQMIREGYEPDSVALSTILSSTSSLKFKLHIH
        +HR +V+ GF  DV+VLNALV MY+KCGDIV+AR VFD I  KD VSWNSMLTGY  HGLL EALDIF  M++ G EPD VA+S++L+   S K    +H
Subjt:  VHRHIVRSGFAGDVFVLNALVDMYSKCGDIVRARKVFDQIVSKDTVSWNSMLTGYTRHGLLLEALDIFDQMIREGYEPDSVALSTILSSTSSLKFKLHIH

Query:  GWVIRNGVEWNLSIANSLIVMYANCGKIDRAKWLFQQMPQRDIVSWNSIISAHFNTSEALTYFEVMESLGVLPDTVTFVSLLSTSAHLGLVKEGEKLYSL
        GWVIR G+EW LS+AN+LIV+Y+  G++ +A ++F QM +RD VSWN+IISAH   S  L YFE M      PD +TFVS+LS  A+ G+V++GE+L+SL
Subjt:  GWVIRNGVEWNLSIANSLIVMYANCGKIDRAKWLFQQMPQRDIVSWNSIISAHFNTSEALTYFEVMESLGVLPDTVTFVSLLSTSAHLGLVKEGEKLYSL

Query:  MKGKYGIRPTMEHYACMVNLYGRAGLIEEAYRIITKGMEVEAGPTVWGALLYACYLHGNVDIAEIAAERLFELEPDNELNFELLMKIYGNAGRSEDEKRV
        M  +YGI P MEHYACMVNLYGRAG++EEAY +I + M +EAGPTVWGALLYACYLHGN DI E+AA+RLFELEPDNE NFELL++IY  A R+ED +RV
Subjt:  MKGKYGIRPTMEHYACMVNLYGRAGLIEEAYRIITKGMEVEAGPTVWGALLYACYLHGNVDIAEIAAERLFELEPDNELNFELLMKIYGNAGRSEDEKRV

Query:  RLMMAERGLDS
        R MM +RGL++
Subjt:  RLMMAERGLDS

Q9SJZ3 Pentatricopeptide repeat-containing protein At2g22410, mitochondrial1.9e-7935.16Show/hide
Query:  RIDAEIFSSLLETCYNLRAIDHGIRIHRLIPTDLLRRNVGVSSKLLRLYASFGYMENAHQVFDEMGKRNVSAFSWNSLISGYAELGLYEDALALYFQMEE
        R D   +  L + C +LR    G  I   +    L     V +  + ++AS G MENA +VFDE   R++   SWN LI+GY ++G  E A+ +Y  ME 
Subjt:  RIDAEIFSSLLETCYNLRAIDHGIRIHRLIPTDLLRRNVGVSSKLLRLYASFGYMENAHQVFDEMGKRNVSAFSWNSLISGYAELGLYEDALALYFQMEE

Query:  EGVEPDHFTFPRVLKACGGIGSIRIGEAVHRHIVRSGFAGDVFVLNALVDMYSKCGDIVRARKVFDQIVSKDTVSWNSMLTGYTRHGLL-----------
        EGV+PD  T   ++ +C  +G +  G+  + ++  +G    + ++NAL+DM+SKCGDI  AR++FD +  +  VSW +M++GY R GLL           
Subjt:  EGVEPDHFTFPRVLKACGGIGSIRIGEAVHRHIVRSGFAGDVFVLNALVDMYSKCGDIVRARKVFDQIVSKDTVSWNSMLTGYTRHGLL-----------

Query:  --------------------LEALDIFDQMIREGYEPDSVALSTILSSTS---SLKFKLHIHGWVIRNGVEWNLSIANSLIVMYANCGKIDRAKWLFQQM
                             +AL +F +M     +PD + +   LS+ S   +L   + IH ++ +  +  N+++  SL+ MYA CG I  A  +F  +
Subjt:  --------------------LEALDIFDQMIREGYEPDSVALSTILSSTS---SLKFKLHIHGWVIRNGVEWNLSIANSLIVMYANCGKIDRAKWLFQQM

Query:  PQRDIVSWNSII---SAHFNTSEALTYFEVMESLGVLPDTVTFVSLLSTSAHLGLVKEGEKLYSLMKGKYGIRPTMEHYACMVNLYGRAGLIEEAYRIIT
          R+ +++ +II   + H + S A++YF  M   G+ PD +TF+ LLS   H G+++ G   +S MK ++ + P ++HY+ MV+L GRAGL+EEA R++ 
Subjt:  PQRDIVSWNSII---SAHFNTSEALTYFEVMESLGVLPDTVTFVSLLSTSAHLGLVKEGEKLYSLMKGKYGIRPTMEHYACMVNLYGRAGLIEEAYRIIT

Query:  KGMEVEAGPTVWGALLYACYLHGNVDIAEIAAERLFELEPDNELNFELLMKIYGNAGRSEDEKRVRLMMAERGLD
        + M +EA   VWGALL+ C +HGNV++ E AA++L EL+P +   + LL  +YG A   ED KR R MM ERG++
Subjt:  KGMEVEAGPTVWGALLYACYLHGNVDIAEIAAERLFELEPDNELNFELLMKIYGNAGRSEDEKRVRLMMAERGLD

Q9SN39 Pentatricopeptide repeat-containing protein DOT4, chloroplastic2.1e-8137.14Show/hide
Query:  NGVRIDAEIFSSLLETCYNLRAIDHGIRIHRLIPTDLLRRNVGVSSKLLRLYASFGYMENAHQVFDEMGKRNVSAFSWNSLISGYAELGLYEDALALYFQ
        +G+ ID     S+   C + R I  G  +H +       R     + LL +Y+  G +++A  VF EM  R+V   S+ S+I+GYA  GL  +A+ L+ +
Subjt:  NGVRIDAEIFSSLLETCYNLRAIDHGIRIHRLIPTDLLRRNVGVSSKLLRLYASFGYMENAHQVFDEMGKRNVSAFSWNSLISGYAELGLYEDALALYFQ

Query:  MEEEGVEPDHFTFPRVLKACGGIGSIRIGEAVHRHIVRSGFAGDVFVLNALVDMYSKCGDIVRARKVFDQIVSKDTVSWNSMLTGYTRHGLLLEALDIFD
        MEEEG+ PD +T   VL  C     +  G+ VH  I  +    D+FV NAL+DMY+KCG +  A  VF ++  KD +SWN+++ GY+++    EAL +F+
Subjt:  MEEEGVEPDHFTFPRVLKACGGIGSIRIGEAVHRHIVRSGFAGDVFVLNALVDMYSKCGDIVRARKVFDQIVSKDTVSWNSMLTGYTRHGLLLEALDIFD

Query:  QMIRE-GYEPDSVALSTILSSTSSLKF---KLHIHGWVIRNGVEWNLSIANSLIVMYANCGKIDRAKWLFQQMPQRDIVSWNSIISA---HFNTSEALTY
         ++ E  + PD   ++ +L + +SL        IHG+++RNG   +  +ANSL+ MYA CG +  A  LF  +  +D+VSW  +I+    H    EA+  
Subjt:  QMIRE-GYEPDSVALSTILSSTSSLKF---KLHIHGWVIRNGVEWNLSIANSLIVMYANCGKIDRAKWLFQQMPQRDIVSWNSIISA---HFNTSEALTY

Query:  FEVMESLGVLPDTVTFVSLLSTSAHLGLVKEGEKLYSLMKGKYGIRPTMEHYACMVNLYGRAGLIEEAYRIITKGMEVEAGPTVWGALLYACYLHGNVDI
        F  M   G+  D ++FVSLL   +H GLV EG + +++M+ +  I PT+EHYAC+V++  R G + +AYR I + M +    T+WGALL  C +H +V +
Subjt:  FEVMESLGVLPDTVTFVSLLSTSAHLGLVKEGEKLYSLMKGKYGIRPTMEHYACMVNLYGRAGLIEEAYRIITKGMEVEAGPTVWGALLYACYLHGNVDI

Query:  AEIAAERLFELEPDNELNFELLMKIYGNAGRSEDEKRVRLMMAERGL
        AE  AE++FELEP+N   + L+  IY  A + E  KR+R  + +RGL
Subjt:  AEIAAERLFELEPDNELNFELLMKIYGNAGRSEDEKRVRLMMAERGL

Q9STF3 Pentatricopeptide repeat-containing protein At3g46790, chloroplastic5.3e-8537.67Show/hide
Query:  EIFSSLLETCYNLRAIDHGIRIHRLIPTDLLRRNVGVSSKLLRLYASFGYMENAHQVFDEMGKRNVSAFSWNSLISGYAELGLYEDALALYFQMEEEGVE
        + +  L+  C +  ++   +R+HR I  +   ++  +++KL+ +Y+  G ++ A +VFD+  KR +  + WN+L       G  E+ L LY++M   GVE
Subjt:  EIFSSLLETCYNLRAIDHGIRIHRLIPTDLLRRNVGVSSKLLRLYASFGYMENAHQVFDEMGKRNVSAFSWNSLISGYAELGLYEDALALYFQMEEEGVE

Query:  PDHFTFPRVLKACGG----IGSIRIGEAVHRHIVRSGFAGDVFVLNALVDMYSKCGDIVRARKVFDQIVSKDTVSWNSMLTGYTRHGLLLEALDIFDQMI
         D FT+  VLKAC      +  +  G+ +H H+ R G++  V+++  LVDMY++ G +  A  VF  +  ++ VSW++M+  Y ++G   EAL  F +M+
Subjt:  PDHFTFPRVLKACGG----IGSIRIGEAVHRHIVRSGFAGDVFVLNALVDMYSKCGDIVRARKVFDQIVSKDTVSWNSMLTGYTRHGLLLEALDIFDQMI

Query:  REGYE--PDSVALSTILSSTSSL----KFKLHIHGWVIRNGVEWNLSIANSLIVMYANCGKIDRAKWLFQQMPQRDIVSWNSIISA---HFNTSEALTYF
        RE  +  P+SV + ++L + +SL    + KL IHG+++R G++  L + ++L+ MY  CGK++  + +F +M  RD+VSWNS+IS+   H    +A+  F
Subjt:  REGYE--PDSVALSTILSSTSSL----KFKLHIHGWVIRNGVEWNLSIANSLIVMYANCGKIDRAKWLFQQMPQRDIVSWNSIISA---HFNTSEALTYF

Query:  EVMESLGVLPDTVTFVSLLSTSAHLGLVKEGEKLYSLMKGKYGIRPTMEHYACMVNLYGRAGLIEEAYRIITKGMEVEAGPTVWGALLYACYLHGNVDIA
        E M + G  P  VTFVS+L   +H GLV+EG++L+  M   +GI+P +EHYACMV+L GRA  ++EA +++ + M  E GP VWG+LL +C +HGNV++A
Subjt:  EVMESLGVLPDTVTFVSLLSTSAHLGLVKEGEKLYSLMKGKYGIRPTMEHYACMVNLYGRAGLIEEAYRIITKGMEVEAGPTVWGALLYACYLHGNVDIA

Query:  EIAAERLFELEPDNELNFELLMKIYGNAGRSEDEKRVRLMMAERGL
        E A+ RLF LEP N  N+ LL  IY  A   ++ KRV+ ++  RGL
Subjt:  EIAAERLFELEPDNELNFELLMKIYGNAGRSEDEKRVRLMMAERGL

Arabidopsis top hitse value%identityAlignment
AT1G08070.1 Tetratricopeptide repeat (TPR)-like superfamily protein4.7e-8135.79Show/hide
Query:  GVRIDAEIFSSLLETCYNLRAIDHGIRIHRLIPTDLLRRNVGVSSKLLRLYASFGYMENAHQVFDEMGKRNVSAFSWNSLISGYAELGLYEDALALYFQM
        G  +D  + +SL+        ++   ++    P     R+V   + L++ YAS GY+ENA ++FDE+  ++V   SWN++ISGYAE G Y++AL L+  M
Subjt:  GVRIDAEIFSSLLETCYNLRAIDHGIRIHRLIPTDLLRRNVGVSSKLLRLYASFGYMENAHQVFDEMGKRNVSAFSWNSLISGYAELGLYEDALALYFQM

Query:  EEEGVEPDHFTFPRVLKACGGIGSIRIGEAVHRHIVRSGFAGDVFVLNALVDMYSKCGDIVRARKVFDQIVSKDTVSWNSMLTGYTRHGLLLEALDIFDQ
         +  V PD  T   V+ AC   GSI +G  VH  I   GF  ++ ++NAL+D+YSKCG++  A  +F+++  KD +SWN+++ GYT   L  EAL +F +
Subjt:  EEEGVEPDHFTFPRVLKACGGIGSIRIGEAVHRHIVRSGFAGDVFVLNALVDMYSKCGDIVRARKVFDQIVSKDTVSWNSMLTGYTRHGLLLEALDIFDQ

Query:  MIREGYEPDSVALSTILSSTS---SLKFKLHIHGWVIR--NGVEWNLSIANSLIVMYANCGKIDRAKWLFQQMPQRDIVSWNSII---SAHFNTSEALTY
        M+R G  P+ V + +IL + +   ++     IH ++ +   GV    S+  SLI MYA CG I+ A  +F  +  + + SWN++I   + H     +   
Subjt:  MIREGYEPDSVALSTILSSTS---SLKFKLHIHGWVIR--NGVEWNLSIANSLIVMYANCGKIDRAKWLFQQMPQRDIVSWNSII---SAHFNTSEALTY

Query:  FEVMESLGVLPDTVTFVSLLSTSAHLGLVKEGEKLYSLMKGKYGIRPTMEHYACMVNLYGRAGLIEEAYRIITKGMEVEAGPTVWGALLYACYLHGNVDI
        F  M  +G+ PD +TFV LLS  +H G++  G  ++  M   Y + P +EHY CM++L G +GL +EA  +I   ME+E    +W +LL AC +HGNV++
Subjt:  FEVMESLGVLPDTVTFVSLLSTSAHLGLVKEGEKLYSLMKGKYGIRPTMEHYACMVNLYGRAGLIEEAYRIITKGMEVEAGPTVWGALLYACYLHGNVDI

Query:  AEIAAERLFELEPDNELNFELLMKIYGNAGRSEDEKRVRLMMAERGL
         E  AE L ++EP+N  ++ LL  IY +AGR  +  + R ++ ++G+
Subjt:  AEIAAERLFELEPDNELNFELLMKIYGNAGRSEDEKRVRLMMAERGL

AT2G22410.1 SLOW GROWTH 11.4e-8035.16Show/hide
Query:  RIDAEIFSSLLETCYNLRAIDHGIRIHRLIPTDLLRRNVGVSSKLLRLYASFGYMENAHQVFDEMGKRNVSAFSWNSLISGYAELGLYEDALALYFQMEE
        R D   +  L + C +LR    G  I   +    L     V +  + ++AS G MENA +VFDE   R++   SWN LI+GY ++G  E A+ +Y  ME 
Subjt:  RIDAEIFSSLLETCYNLRAIDHGIRIHRLIPTDLLRRNVGVSSKLLRLYASFGYMENAHQVFDEMGKRNVSAFSWNSLISGYAELGLYEDALALYFQMEE

Query:  EGVEPDHFTFPRVLKACGGIGSIRIGEAVHRHIVRSGFAGDVFVLNALVDMYSKCGDIVRARKVFDQIVSKDTVSWNSMLTGYTRHGLL-----------
        EGV+PD  T   ++ +C  +G +  G+  + ++  +G    + ++NAL+DM+SKCGDI  AR++FD +  +  VSW +M++GY R GLL           
Subjt:  EGVEPDHFTFPRVLKACGGIGSIRIGEAVHRHIVRSGFAGDVFVLNALVDMYSKCGDIVRARKVFDQIVSKDTVSWNSMLTGYTRHGLL-----------

Query:  --------------------LEALDIFDQMIREGYEPDSVALSTILSSTS---SLKFKLHIHGWVIRNGVEWNLSIANSLIVMYANCGKIDRAKWLFQQM
                             +AL +F +M     +PD + +   LS+ S   +L   + IH ++ +  +  N+++  SL+ MYA CG I  A  +F  +
Subjt:  --------------------LEALDIFDQMIREGYEPDSVALSTILSSTS---SLKFKLHIHGWVIRNGVEWNLSIANSLIVMYANCGKIDRAKWLFQQM

Query:  PQRDIVSWNSII---SAHFNTSEALTYFEVMESLGVLPDTVTFVSLLSTSAHLGLVKEGEKLYSLMKGKYGIRPTMEHYACMVNLYGRAGLIEEAYRIIT
          R+ +++ +II   + H + S A++YF  M   G+ PD +TF+ LLS   H G+++ G   +S MK ++ + P ++HY+ MV+L GRAGL+EEA R++ 
Subjt:  PQRDIVSWNSII---SAHFNTSEALTYFEVMESLGVLPDTVTFVSLLSTSAHLGLVKEGEKLYSLMKGKYGIRPTMEHYACMVNLYGRAGLIEEAYRIIT

Query:  KGMEVEAGPTVWGALLYACYLHGNVDIAEIAAERLFELEPDNELNFELLMKIYGNAGRSEDEKRVRLMMAERGLD
        + M +EA   VWGALL+ C +HGNV++ E AA++L EL+P +   + LL  +YG A   ED KR R MM ERG++
Subjt:  KGMEVEAGPTVWGALLYACYLHGNVDIAEIAAERLFELEPDNELNFELLMKIYGNAGRSEDEKRVRLMMAERGLD

AT3G46790.1 Tetratricopeptide repeat (TPR)-like superfamily protein3.7e-8637.67Show/hide
Query:  EIFSSLLETCYNLRAIDHGIRIHRLIPTDLLRRNVGVSSKLLRLYASFGYMENAHQVFDEMGKRNVSAFSWNSLISGYAELGLYEDALALYFQMEEEGVE
        + +  L+  C +  ++   +R+HR I  +   ++  +++KL+ +Y+  G ++ A +VFD+  KR +  + WN+L       G  E+ L LY++M   GVE
Subjt:  EIFSSLLETCYNLRAIDHGIRIHRLIPTDLLRRNVGVSSKLLRLYASFGYMENAHQVFDEMGKRNVSAFSWNSLISGYAELGLYEDALALYFQMEEEGVE

Query:  PDHFTFPRVLKACGG----IGSIRIGEAVHRHIVRSGFAGDVFVLNALVDMYSKCGDIVRARKVFDQIVSKDTVSWNSMLTGYTRHGLLLEALDIFDQMI
         D FT+  VLKAC      +  +  G+ +H H+ R G++  V+++  LVDMY++ G +  A  VF  +  ++ VSW++M+  Y ++G   EAL  F +M+
Subjt:  PDHFTFPRVLKACGG----IGSIRIGEAVHRHIVRSGFAGDVFVLNALVDMYSKCGDIVRARKVFDQIVSKDTVSWNSMLTGYTRHGLLLEALDIFDQMI

Query:  REGYE--PDSVALSTILSSTSSL----KFKLHIHGWVIRNGVEWNLSIANSLIVMYANCGKIDRAKWLFQQMPQRDIVSWNSIISA---HFNTSEALTYF
        RE  +  P+SV + ++L + +SL    + KL IHG+++R G++  L + ++L+ MY  CGK++  + +F +M  RD+VSWNS+IS+   H    +A+  F
Subjt:  REGYE--PDSVALSTILSSTSSL----KFKLHIHGWVIRNGVEWNLSIANSLIVMYANCGKIDRAKWLFQQMPQRDIVSWNSIISA---HFNTSEALTYF

Query:  EVMESLGVLPDTVTFVSLLSTSAHLGLVKEGEKLYSLMKGKYGIRPTMEHYACMVNLYGRAGLIEEAYRIITKGMEVEAGPTVWGALLYACYLHGNVDIA
        E M + G  P  VTFVS+L   +H GLV+EG++L+  M   +GI+P +EHYACMV+L GRA  ++EA +++ + M  E GP VWG+LL +C +HGNV++A
Subjt:  EVMESLGVLPDTVTFVSLLSTSAHLGLVKEGEKLYSLMKGKYGIRPTMEHYACMVNLYGRAGLIEEAYRIITKGMEVEAGPTVWGALLYACYLHGNVDIA

Query:  EIAAERLFELEPDNELNFELLMKIYGNAGRSEDEKRVRLMMAERGL
        E A+ RLF LEP N  N+ LL  IY  A   ++ KRV+ ++  RGL
Subjt:  EIAAERLFELEPDNELNFELLMKIYGNAGRSEDEKRVRLMMAERGL

AT4G18750.1 Pentatricopeptide repeat (PPR) superfamily protein1.5e-8237.14Show/hide
Query:  NGVRIDAEIFSSLLETCYNLRAIDHGIRIHRLIPTDLLRRNVGVSSKLLRLYASFGYMENAHQVFDEMGKRNVSAFSWNSLISGYAELGLYEDALALYFQ
        +G+ ID     S+   C + R I  G  +H +       R     + LL +Y+  G +++A  VF EM  R+V   S+ S+I+GYA  GL  +A+ L+ +
Subjt:  NGVRIDAEIFSSLLETCYNLRAIDHGIRIHRLIPTDLLRRNVGVSSKLLRLYASFGYMENAHQVFDEMGKRNVSAFSWNSLISGYAELGLYEDALALYFQ

Query:  MEEEGVEPDHFTFPRVLKACGGIGSIRIGEAVHRHIVRSGFAGDVFVLNALVDMYSKCGDIVRARKVFDQIVSKDTVSWNSMLTGYTRHGLLLEALDIFD
        MEEEG+ PD +T   VL  C     +  G+ VH  I  +    D+FV NAL+DMY+KCG +  A  VF ++  KD +SWN+++ GY+++    EAL +F+
Subjt:  MEEEGVEPDHFTFPRVLKACGGIGSIRIGEAVHRHIVRSGFAGDVFVLNALVDMYSKCGDIVRARKVFDQIVSKDTVSWNSMLTGYTRHGLLLEALDIFD

Query:  QMIRE-GYEPDSVALSTILSSTSSLKF---KLHIHGWVIRNGVEWNLSIANSLIVMYANCGKIDRAKWLFQQMPQRDIVSWNSIISA---HFNTSEALTY
         ++ E  + PD   ++ +L + +SL        IHG+++RNG   +  +ANSL+ MYA CG +  A  LF  +  +D+VSW  +I+    H    EA+  
Subjt:  QMIRE-GYEPDSVALSTILSSTSSLKF---KLHIHGWVIRNGVEWNLSIANSLIVMYANCGKIDRAKWLFQQMPQRDIVSWNSIISA---HFNTSEALTY

Query:  FEVMESLGVLPDTVTFVSLLSTSAHLGLVKEGEKLYSLMKGKYGIRPTMEHYACMVNLYGRAGLIEEAYRIITKGMEVEAGPTVWGALLYACYLHGNVDI
        F  M   G+  D ++FVSLL   +H GLV EG + +++M+ +  I PT+EHYAC+V++  R G + +AYR I + M +    T+WGALL  C +H +V +
Subjt:  FEVMESLGVLPDTVTFVSLLSTSAHLGLVKEGEKLYSLMKGKYGIRPTMEHYACMVNLYGRAGLIEEAYRIITKGMEVEAGPTVWGALLYACYLHGNVDI

Query:  AEIAAERLFELEPDNELNFELLMKIYGNAGRSEDEKRVRLMMAERGL
        AE  AE++FELEP+N   + L+  IY  A + E  KR+R  + +RGL
Subjt:  AEIAAERLFELEPDNELNFELLMKIYGNAGRSEDEKRVRLMMAERGL

AT4G25270.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.7e-18461.84Show/hide
Query:  SSKPKKSKKERRKLLQEKLIRISKAKEATDLSFPKSSSTPLLIHHKPFSQTKIQALDAVVDDLEASIDNGVRI-DAEIFSSLLETCYNLRAIDHGIRIHR
        SS  KK  +  ++L Q +  + +     T LSF K S TPLLI  +   +T+++ALD+V+ DLE S   G+ + + EIF+SLLETCY+LRAIDHG+R+H 
Subjt:  SSKPKKSKKERRKLLQEKLIRISKAKEATDLSFPKSSSTPLLIHHKPFSQTKIQALDAVVDDLEASIDNGVRI-DAEIFSSLLETCYNLRAIDHGIRIHR

Query:  LIPTDLLRRNVGVSSKLLRLYASFGYMENAHQVFDEMGKRNVSAFSWNSLISGYAELGLYEDALALYFQMEEEGVEPDHFTFPRVLKACGGIGSIRIGEA
        LIP  LLR N+G+SSKL+RLYAS GY E AH+VFD M KR+ S F+WNSLISGYAELG YEDA+ALYFQM E+GV+PD FTFPRVLKACGGIGS++IGEA
Subjt:  LIPTDLLRRNVGVSSKLLRLYASFGYMENAHQVFDEMGKRNVSAFSWNSLISGYAELGLYEDALALYFQMEEEGVEPDHFTFPRVLKACGGIGSIRIGEA

Query:  VHRHIVRSGFAGDVFVLNALVDMYSKCGDIVRARKVFDQIVSKDTVSWNSMLTGYTRHGLLLEALDIFDQMIREGYEPDSVALSTILSSTSSLKFKLHIH
        +HR +V+ GF  DV+VLNALV MY+KCGDIV+AR VFD I  KD VSWNSMLTGY  HGLL EALDIF  M++ G EPD VA+S++L+   S K    +H
Subjt:  VHRHIVRSGFAGDVFVLNALVDMYSKCGDIVRARKVFDQIVSKDTVSWNSMLTGYTRHGLLLEALDIFDQMIREGYEPDSVALSTILSSTSSLKFKLHIH

Query:  GWVIRNGVEWNLSIANSLIVMYANCGKIDRAKWLFQQMPQRDIVSWNSIISAHFNTSEALTYFEVMESLGVLPDTVTFVSLLSTSAHLGLVKEGEKLYSL
        GWVIR G+EW LS+AN+LIV+Y+  G++ +A ++F QM +RD VSWN+IISAH   S  L YFE M      PD +TFVS+LS  A+ G+V++GE+L+SL
Subjt:  GWVIRNGVEWNLSIANSLIVMYANCGKIDRAKWLFQQMPQRDIVSWNSIISAHFNTSEALTYFEVMESLGVLPDTVTFVSLLSTSAHLGLVKEGEKLYSL

Query:  MKGKYGIRPTMEHYACMVNLYGRAGLIEEAYRIITKGMEVEAGPTVWGALLYACYLHGNVDIAEIAAERLFELEPDNELNFELLMKIYGNAGRSEDEKRV
        M  +YGI P MEHYACMVNLYGRAG++EEAY +I + M +EAGPTVWGALLYACYLHGN DI E+AA+RLFELEPDNE NFELL++IY  A R+ED +RV
Subjt:  MKGKYGIRPTMEHYACMVNLYGRAGLIEEAYRIITKGMEVEAGPTVWGALLYACYLHGNVDIAEIAAERLFELEPDNELNFELLMKIYGNAGRSEDEKRV

Query:  RLMMAERGLDS
        R MM +RGL++
Subjt:  RLMMAERGLDS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTGATCTCTTTGCAACTGTCAGTTTCTCCCATAGCTCCGCTTCTTCTCCTCTGTTCTTCCAAACCCAAGAAATCCAAGAAAGAAAGAAGGAAACTTCTTCAGGAAAA
ACTGATTCGCATTAGCAAAGCCAAAGAGGCTACTGATCTCTCTTTCCCTAAATCCTCGTCAACCCCACTCTTAATCCACCACAAACCCTTCTCCCAAACCAAAATTCAAG
CCCTTGATGCTGTTGTCGATGACCTTGAAGCTTCCATCGACAATGGCGTCCGTATTGATGCTGAAATTTTCTCTTCCCTCTTGGAAACTTGTTACAATTTGCGAGCCATT
GATCATGGTATTCGGATTCATCGGCTCATACCCACTGATCTTTTACGTAGAAATGTGGGTGTTTCTTCTAAGCTTCTTCGTCTGTATGCTTCTTTTGGGTACATGGAGAA
TGCACACCAGGTGTTTGATGAAATGGGTAAACGAAATGTCTCTGCTTTTTCTTGGAATTCTCTTATTTCTGGATATGCTGAACTTGGTCTTTATGAAGATGCTCTGGCTC
TGTACTTCCAAATGGAGGAAGAAGGTGTTGAACCTGACCACTTTACTTTTCCTCGTGTGCTCAAGGCCTGTGGTGGCATTGGGTCGATTCGAATCGGGGAGGCGGTGCAC
CGGCATATCGTTCGTTCGGGCTTTGCTGGAGACGTCTTTGTCCTCAATGCTCTAGTCGACATGTATTCAAAATGTGGTGACATTGTGAGAGCTAGAAAAGTTTTCGACCA
GATTGTCTCTAAGGATACAGTTTCCTGGAACTCAATGCTCACTGGTTACACACGCCATGGGCTTCTCTTGGAGGCATTGGACATCTTTGATCAAATGATTCGAGAAGGGT
ACGAGCCCGACTCGGTTGCTTTATCCACCATTCTTTCTAGCACTTCGTCGCTGAAATTCAAGTTACACATCCATGGATGGGTGATTCGAAACGGAGTCGAATGGAATTTG
TCCATTGCTAACTCCTTGATTGTCATGTATGCCAACTGTGGTAAGATTGACAGAGCAAAATGGCTGTTCCAGCAGATGCCTCAAAGGGACATAGTCTCATGGAACTCCAT
AATCTCTGCTCATTTCAATACCTCAGAAGCTTTGACATATTTTGAAGTGATGGAGAGCCTTGGTGTTTTGCCAGACACTGTGACATTTGTGTCATTGTTGTCAACTTCTG
CTCATCTGGGCTTGGTGAAGGAAGGGGAAAAGTTATATTCTCTTATGAAGGGGAAGTATGGAATAAGACCAACCATGGAACATTATGCTTGTATGGTGAATCTTTATGGG
AGGGCAGGGCTGATTGAAGAAGCTTATAGAATCATAACAAAAGGGATGGAGGTCGAGGCAGGTCCGACCGTATGGGGGGCGCTGTTGTATGCGTGCTATCTTCATGGCAA
TGTAGATATCGCTGAGATTGCTGCTGAAAGACTTTTCGAGTTGGAGCCCGATAACGAGCTCAATTTCGAGCTTCTGATGAAGATTTATGGCAATGCTGGGAGATCCGAAG
ATGAGAAGAGAGTGAGATTAATGATGGCAGAACGAGGATTGGATTCATCGTAA
mRNA sequenceShow/hide mRNA sequence
ATGCTGATCTCTTTGCAACTGTCAGTTTCTCCCATAGCTCCGCTTCTTCTCCTCTGTTCTTCCAAACCCAAGAAATCCAAGAAAGAAAGAAGGAAACTTCTTCAGGAAAA
ACTGATTCGCATTAGCAAAGCCAAAGAGGCTACTGATCTCTCTTTCCCTAAATCCTCGTCAACCCCACTCTTAATCCACCACAAACCCTTCTCCCAAACCAAAATTCAAG
CCCTTGATGCTGTTGTCGATGACCTTGAAGCTTCCATCGACAATGGCGTCCGTATTGATGCTGAAATTTTCTCTTCCCTCTTGGAAACTTGTTACAATTTGCGAGCCATT
GATCATGGTATTCGGATTCATCGGCTCATACCCACTGATCTTTTACGTAGAAATGTGGGTGTTTCTTCTAAGCTTCTTCGTCTGTATGCTTCTTTTGGGTACATGGAGAA
TGCACACCAGGTGTTTGATGAAATGGGTAAACGAAATGTCTCTGCTTTTTCTTGGAATTCTCTTATTTCTGGATATGCTGAACTTGGTCTTTATGAAGATGCTCTGGCTC
TGTACTTCCAAATGGAGGAAGAAGGTGTTGAACCTGACCACTTTACTTTTCCTCGTGTGCTCAAGGCCTGTGGTGGCATTGGGTCGATTCGAATCGGGGAGGCGGTGCAC
CGGCATATCGTTCGTTCGGGCTTTGCTGGAGACGTCTTTGTCCTCAATGCTCTAGTCGACATGTATTCAAAATGTGGTGACATTGTGAGAGCTAGAAAAGTTTTCGACCA
GATTGTCTCTAAGGATACAGTTTCCTGGAACTCAATGCTCACTGGTTACACACGCCATGGGCTTCTCTTGGAGGCATTGGACATCTTTGATCAAATGATTCGAGAAGGGT
ACGAGCCCGACTCGGTTGCTTTATCCACCATTCTTTCTAGCACTTCGTCGCTGAAATTCAAGTTACACATCCATGGATGGGTGATTCGAAACGGAGTCGAATGGAATTTG
TCCATTGCTAACTCCTTGATTGTCATGTATGCCAACTGTGGTAAGATTGACAGAGCAAAATGGCTGTTCCAGCAGATGCCTCAAAGGGACATAGTCTCATGGAACTCCAT
AATCTCTGCTCATTTCAATACCTCAGAAGCTTTGACATATTTTGAAGTGATGGAGAGCCTTGGTGTTTTGCCAGACACTGTGACATTTGTGTCATTGTTGTCAACTTCTG
CTCATCTGGGCTTGGTGAAGGAAGGGGAAAAGTTATATTCTCTTATGAAGGGGAAGTATGGAATAAGACCAACCATGGAACATTATGCTTGTATGGTGAATCTTTATGGG
AGGGCAGGGCTGATTGAAGAAGCTTATAGAATCATAACAAAAGGGATGGAGGTCGAGGCAGGTCCGACCGTATGGGGGGCGCTGTTGTATGCGTGCTATCTTCATGGCAA
TGTAGATATCGCTGAGATTGCTGCTGAAAGACTTTTCGAGTTGGAGCCCGATAACGAGCTCAATTTCGAGCTTCTGATGAAGATTTATGGCAATGCTGGGAGATCCGAAG
ATGAGAAGAGAGTGAGATTAATGATGGCAGAACGAGGATTGGATTCATCGTAA
Protein sequenceShow/hide protein sequence
MLISLQLSVSPIAPLLLLCSSKPKKSKKERRKLLQEKLIRISKAKEATDLSFPKSSSTPLLIHHKPFSQTKIQALDAVVDDLEASIDNGVRIDAEIFSSLLETCYNLRAI
DHGIRIHRLIPTDLLRRNVGVSSKLLRLYASFGYMENAHQVFDEMGKRNVSAFSWNSLISGYAELGLYEDALALYFQMEEEGVEPDHFTFPRVLKACGGIGSIRIGEAVH
RHIVRSGFAGDVFVLNALVDMYSKCGDIVRARKVFDQIVSKDTVSWNSMLTGYTRHGLLLEALDIFDQMIREGYEPDSVALSTILSSTSSLKFKLHIHGWVIRNGVEWNL
SIANSLIVMYANCGKIDRAKWLFQQMPQRDIVSWNSIISAHFNTSEALTYFEVMESLGVLPDTVTFVSLLSTSAHLGLVKEGEKLYSLMKGKYGIRPTMEHYACMVNLYG
RAGLIEEAYRIITKGMEVEAGPTVWGALLYACYLHGNVDIAEIAAERLFELEPDNELNFELLMKIYGNAGRSEDEKRVRLMMAERGLDSS