; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0019919 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0019919
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationchr5:46716343..46721154
RNA-Seq ExpressionLag0019919
SyntenyLag0019919
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily
IPR029472 - Retrotransposon Copia-like, N-terminal


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6600165.1 Pentatricopeptide repeat-containing protein, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia]3.8e-26988.26Show/hide
Query:  MLISLQLSVSPIASLLLFCSSKPKKSKKERRKLLQEKLIRISKAKEATDLSFPKSSSTPLLIHHKPFSQTKIQALDAVLNDLEASIDNGVRIDAEIFSSL
        MLISL+ S+SPI SL LFCSS PKKSKKERRKLLQEKLIRISKAKEAT L FPKSSSTPLLIHHKPFSQ+KIQALDAVLNDLEAS+ NGV IDAEIFSSL
Subjt:  MLISLQLSVSPIASLLLFCSSKPKKSKKERRKLLQEKLIRISKAKEATDLSFPKSSSTPLLIHHKPFSQTKIQALDAVLNDLEASIDNGVRIDAEIFSSL

Query:  LETCYKLRAIDHGIRIHRLIPTDLLRRNVGVSSKLLRLYASFGYMENAHQVFDEMGKRNVSAFSWNSLISGYAELALYEDALALYFQMEEEGVEPDHFTF
        LETCY+LRA+DHGIRIHRLIPT+ LRRNVGVSSKLLRLYASFGYME+AHQVFDEM +RN+SAFSWNSLISGYAEL LYEDALALYFQMEEEGVEPDHFTF
Subjt:  LETCYKLRAIDHGIRIHRLIPTDLLRRNVGVSSKLLRLYASFGYMENAHQVFDEMGKRNVSAFSWNSLISGYAELALYEDALALYFQMEEEGVEPDHFTF

Query:  PRVLKACGGIGSIRIGEAVHRHIVRSGFAGDVFVLNALVDMYSKCGDIVRARKVFDQIVSKDTVSWNSMLTGYTRHGLLLEALDIFDQMIREGFEPDSVA
        PRVLKACGGIGSIR+GEAVHRH+VRSGFAGD+FVLNALVDMY+KCGDI+RARKVFDQIVSKDTVSWNSMLTGYTRHGLLLEAL+ FDQMI+EG+EPDSVA
Subjt:  PRVLKACGGIGSIRIGEAVHRHIVRSGFAGDVFVLNALVDMYSKCGDIVRARKVFDQIVSKDTVSWNSMLTGYTRHGLLLEALDIFDQMIREGFEPDSVA

Query:  LSTILSSTSSLKFKLHIHGWVIRNGVEWNLSIANSLIVMYANCGKIDRAKWLFQQMPQRDIVSWNSIISAHFNTSEALTYFEVMESLGVLPDSVTFVSLL
        LST++S+ SS KFKLHIHGW IR+G+EWNLSIANSLI MYAN GKI+RA+WLF+QMP+RDIVSWN+IISAH NTS+ALTYFEVMESLGVLPDSVTFVSLL
Subjt:  LSTILSSTSSLKFKLHIHGWVIRNGVEWNLSIANSLIVMYANCGKIDRAKWLFQQMPQRDIVSWNSIISAHFNTSEALTYFEVMESLGVLPDSVTFVSLL

Query:  STSAHLGLVKEGEKLYSLMKGKYGIRPTMEHYACMVNLYGRAGLIEGAYRIITKGMEVEAGPTVWGALLFACYLHDNVDIAEIAAERLFELEPDNELNFE
        ST AHL LVKEG KLYS+MKGKYGIRPT+EHYACMVNLYGRAGLIE AYRIIT GMEVEAGPTVWGALL+ACYLH NVDIAE+AAE+LFE EPDNELNF+
Subjt:  STSAHLGLVKEGEKLYSLMKGKYGIRPTMEHYACMVNLYGRAGLIEGAYRIITKGMEVEAGPTVWGALLFACYLHDNVDIAEIAAERLFELEPDNELNFE

Query:  LLMKIYGNAGRSKDEKRVRLMMAERGLD
        LLMKIYGNAGR +DEKRVRLMMAERGLD
Subjt:  LLMKIYGNAGRSKDEKRVRLMMAERGLD

KAG7030831.1 Pentatricopeptide repeat-containing protein, chloroplastic, partial [Cucurbita argyrosperma subsp. argyrosperma]3.8e-26988.26Show/hide
Query:  MLISLQLSVSPIASLLLFCSSKPKKSKKERRKLLQEKLIRISKAKEATDLSFPKSSSTPLLIHHKPFSQTKIQALDAVLNDLEASIDNGVRIDAEIFSSL
        MLISL+ S+SPI SL LFCSS PKKSKKERRKLLQEKLIRISKAKEAT L FPKSSSTPLLIHHKPFSQ+KIQALDAVLNDLEAS+ NGV IDAEIFSSL
Subjt:  MLISLQLSVSPIASLLLFCSSKPKKSKKERRKLLQEKLIRISKAKEATDLSFPKSSSTPLLIHHKPFSQTKIQALDAVLNDLEASIDNGVRIDAEIFSSL

Query:  LETCYKLRAIDHGIRIHRLIPTDLLRRNVGVSSKLLRLYASFGYMENAHQVFDEMGKRNVSAFSWNSLISGYAELALYEDALALYFQMEEEGVEPDHFTF
        LETCY+LRA+DHGIRIHRLIPT+ LRRNVGVSSKLLRLYASFGYME+AHQVFDEM +RN+SAFSWNSLISGYAEL LYEDALALYFQMEEEGVEPDHFTF
Subjt:  LETCYKLRAIDHGIRIHRLIPTDLLRRNVGVSSKLLRLYASFGYMENAHQVFDEMGKRNVSAFSWNSLISGYAELALYEDALALYFQMEEEGVEPDHFTF

Query:  PRVLKACGGIGSIRIGEAVHRHIVRSGFAGDVFVLNALVDMYSKCGDIVRARKVFDQIVSKDTVSWNSMLTGYTRHGLLLEALDIFDQMIREGFEPDSVA
        PRVLKACGGIGSIR+GEAVHRH+VRSGFAGD+FVLNALVDMY+KCGDI+RARKVFDQIVSKDTVSWNSMLTGYTRHGLLLEAL+ FDQMI+EG+EPDSVA
Subjt:  PRVLKACGGIGSIRIGEAVHRHIVRSGFAGDVFVLNALVDMYSKCGDIVRARKVFDQIVSKDTVSWNSMLTGYTRHGLLLEALDIFDQMIREGFEPDSVA

Query:  LSTILSSTSSLKFKLHIHGWVIRNGVEWNLSIANSLIVMYANCGKIDRAKWLFQQMPQRDIVSWNSIISAHFNTSEALTYFEVMESLGVLPDSVTFVSLL
        LST++S+ SS KFKLHIHGW IR+G+EWNLSIANSLI MYAN GKI+RA+WLF+QMP+RDIVSWN+IISAH NTS+ALTYFEVMESLGVLPDSVTFVSLL
Subjt:  LSTILSSTSSLKFKLHIHGWVIRNGVEWNLSIANSLIVMYANCGKIDRAKWLFQQMPQRDIVSWNSIISAHFNTSEALTYFEVMESLGVLPDSVTFVSLL

Query:  STSAHLGLVKEGEKLYSLMKGKYGIRPTMEHYACMVNLYGRAGLIEGAYRIITKGMEVEAGPTVWGALLFACYLHDNVDIAEIAAERLFELEPDNELNFE
        ST AHL LVKEG KLYS+MKGKYGIRPT+EHYACMVNLYGRAGLIE AYRIIT GMEVEAGPTVWGALL+ACYLH NVDIAE+AAE+LFE EPDNELNF+
Subjt:  STSAHLGLVKEGEKLYSLMKGKYGIRPTMEHYACMVNLYGRAGLIEGAYRIITKGMEVEAGPTVWGALLFACYLHDNVDIAEIAAERLFELEPDNELNFE

Query:  LLMKIYGNAGRSKDEKRVRLMMAERGLD
        LLMKIYGNAGR +DEKRVRLMMAERGLD
Subjt:  LLMKIYGNAGRSKDEKRVRLMMAERGLD

XP_022942651.1 pentatricopeptide repeat-containing protein At4g25270, chloroplastic [Cucurbita moschata]9.4e-26888.07Show/hide
Query:  MLISLQLSVSPIASLLLFCSSKPKKSKKERRKLLQEKLIRISKAKEATDLSFPKSSSTPLLIHHKPFSQTKIQALDAVLNDLEASIDNGVRIDAEIFSSL
        MLISL+ S+SPI SL L CSS PKKSKKERRKLLQEKLIRISKAKEAT L FPKSSSTPLLIHHKPFSQ+KIQALDAVLNDLEAS+ NGV IDAEIFSSL
Subjt:  MLISLQLSVSPIASLLLFCSSKPKKSKKERRKLLQEKLIRISKAKEATDLSFPKSSSTPLLIHHKPFSQTKIQALDAVLNDLEASIDNGVRIDAEIFSSL

Query:  LETCYKLRAIDHGIRIHRLIPTDLLRRNVGVSSKLLRLYASFGYMENAHQVFDEMGKRNVSAFSWNSLISGYAELALYEDALALYFQMEEEGVEPDHFTF
        LETCY+LRA+DHGIRIHRLIPT+ LRRNVGVSSKLLRLYASFGYME+AHQVFDEM +RN+SAFSWNSLISGYAEL LYEDALALYFQMEEEGVEPDHFTF
Subjt:  LETCYKLRAIDHGIRIHRLIPTDLLRRNVGVSSKLLRLYASFGYMENAHQVFDEMGKRNVSAFSWNSLISGYAELALYEDALALYFQMEEEGVEPDHFTF

Query:  PRVLKACGGIGSIRIGEAVHRHIVRSGFAGDVFVLNALVDMYSKCGDIVRARKVFDQIVSKDTVSWNSMLTGYTRHGLLLEALDIFDQMIREGFEPDSVA
        PRVLKACGGIGSIR+GEAVHRHIVRSGFAGD+FVLNALVDMY+KCGDI+RARKVFDQIVSKDTVSWNSMLTGYTRHGLLLEAL+ FDQMI+EG+EPDSVA
Subjt:  PRVLKACGGIGSIRIGEAVHRHIVRSGFAGDVFVLNALVDMYSKCGDIVRARKVFDQIVSKDTVSWNSMLTGYTRHGLLLEALDIFDQMIREGFEPDSVA

Query:  LSTILSSTSSLKFKLHIHGWVIRNGVEWNLSIANSLIVMYANCGKIDRAKWLFQQMPQRDIVSWNSIISAHFNTSEALTYFEVMESLGVLPDSVTFVSLL
        LST++S+ SS KFKLHIHGW IR+G+EWNLSIANSLI MYAN GKI+RA+WLF+QMP+RDIVSWN+IISAH NTS+ALTYFEVMESLGVLPDSVTFVSLL
Subjt:  LSTILSSTSSLKFKLHIHGWVIRNGVEWNLSIANSLIVMYANCGKIDRAKWLFQQMPQRDIVSWNSIISAHFNTSEALTYFEVMESLGVLPDSVTFVSLL

Query:  STSAHLGLVKEGEKLYSLMKGKYGIRPTMEHYACMVNLYGRAGLIEGAYRIITKGMEVEAGPTVWGALLFACYLHDNVDIAEIAAERLFELEPDNELNFE
        ST AHL LVKEG KLY+ MKGKYGIRPT+EHYACMVNLYGRAGLIE AYRIIT GMEVEAGPTVWGALL+ACYLH NVDIAE+AAE+LFE EPDNELNF+
Subjt:  STSAHLGLVKEGEKLYSLMKGKYGIRPTMEHYACMVNLYGRAGLIEGAYRIITKGMEVEAGPTVWGALLFACYLHDNVDIAEIAAERLFELEPDNELNFE

Query:  LLMKIYGNAGRSKDEKRVRLMMAERGLD
        LLMKIYGNAGR +DEKRVRLMMAERGLD
Subjt:  LLMKIYGNAGRSKDEKRVRLMMAERGLD

XP_022993795.1 pentatricopeptide repeat-containing protein At4g25270, chloroplastic [Cucurbita maxima]3.2e-26887.88Show/hide
Query:  MLISLQLSVSPIASLLLFCSSKPKKSKKERRKLLQEKLIRISKAKEATDLSFPKSSSTPLLIHHKPFSQTKIQALDAVLNDLEASIDNGVRIDAEIFSSL
        MLISL+ S+SPI SL LFCSS PKKSKKERRKLLQEKLIRISKAKEAT L FPKSSSTPLLIHHKPFS++KIQALDAVLNDLEAS+DNGV IDAEIFSSL
Subjt:  MLISLQLSVSPIASLLLFCSSKPKKSKKERRKLLQEKLIRISKAKEATDLSFPKSSSTPLLIHHKPFSQTKIQALDAVLNDLEASIDNGVRIDAEIFSSL

Query:  LETCYKLRAIDHGIRIHRLIPTDLLRRNVGVSSKLLRLYASFGYMENAHQVFDEMGKRNVSAFSWNSLISGYAELALYEDALALYFQMEEEGVEPDHFTF
        LETCY+LRA+DHGIRIHRLIPT+ LRRNVGVSSKLLRLYASFGYME+AHQVFDEM +RN+SAFSWNSLISGYAEL LYEDALALYFQMEEEGVEPDHFTF
Subjt:  LETCYKLRAIDHGIRIHRLIPTDLLRRNVGVSSKLLRLYASFGYMENAHQVFDEMGKRNVSAFSWNSLISGYAELALYEDALALYFQMEEEGVEPDHFTF

Query:  PRVLKACGGIGSIRIGEAVHRHIVRSGFAGDVFVLNALVDMYSKCGDIVRARKVFDQIVSKDTVSWNSMLTGYTRHGLLLEALDIFDQMIREGFEPDSVA
        PRVLKACGGIGSIR+GEAVHRHIVRSGFAGD+FVLNALVDMY+KCGDI+RARKVFDQI+ KDTVSWNSMLTGYTRHGLLLEAL+ FDQMI+EG+EPDSVA
Subjt:  PRVLKACGGIGSIRIGEAVHRHIVRSGFAGDVFVLNALVDMYSKCGDIVRARKVFDQIVSKDTVSWNSMLTGYTRHGLLLEALDIFDQMIREGFEPDSVA

Query:  LSTILSSTSSLKFKLHIHGWVIRNGVEWNLSIANSLIVMYANCGKIDRAKWLFQQMPQRDIVSWNSIISAHFNTSEALTYFEVMESLGVLPDSVTFVSLL
        LST++S+ +S KFKLHIHGW IR+G+EWNLSIANSLI MYAN GKI+RA+WLF+QMPQRDIVSWN+IISAH NTS+ALTYFEVMESLGVLPDSVTFVSLL
Subjt:  LSTILSSTSSLKFKLHIHGWVIRNGVEWNLSIANSLIVMYANCGKIDRAKWLFQQMPQRDIVSWNSIISAHFNTSEALTYFEVMESLGVLPDSVTFVSLL

Query:  STSAHLGLVKEGEKLYSLMKGKYGIRPTMEHYACMVNLYGRAGLIEGAYRIITKGMEVEAGPTVWGALLFACYLHDNVDIAEIAAERLFELEPDNELNFE
        ST AHL LVKEG KLYS+MKGKYGIRPT+EHYACMVNLYGRAGLIE AYRII  GMEVEAGPTVWGALL+ACYLH NVDIAE+AAE+LFE EPDNELNF+
Subjt:  STSAHLGLVKEGEKLYSLMKGKYGIRPTMEHYACMVNLYGRAGLIEGAYRIITKGMEVEAGPTVWGALLFACYLHDNVDIAEIAAERLFELEPDNELNFE

Query:  LLMKIYGNAGRSKDEKRVRLMMAERGLD
        LLMKIYGNAGR +DEKRVRLMMAERGLD
Subjt:  LLMKIYGNAGRSKDEKRVRLMMAERGLD

XP_038901542.1 pentatricopeptide repeat-containing protein At4g25270, chloroplastic [Benincasa hispida]6.5e-26988.66Show/hide
Query:  MLISLQLSVSPIASLLLFCSSKPKKSKKERRKLLQEKLIRISKAKEATDLSFPKSSSTPLLIHHKPFSQTKIQALDAVLNDLEASIDNGVRIDAEIFSSL
        MLISLQLS+SP  SLLLFCSSKPKKSKKER+KLL +KL+RISKA+E TDL FPKSSSTPLLIH KPF QTKIQALDA+L DLE S+DNG+  D EIFSSL
Subjt:  MLISLQLSVSPIASLLLFCSSKPKKSKKERRKLLQEKLIRISKAKEATDLSFPKSSSTPLLIHHKPFSQTKIQALDAVLNDLEASIDNGVRIDAEIFSSL

Query:  LETCYKLRAIDHGIRIHRLIPTDLLRRNVGVSSKLLRLYASFGYMENAHQVFDEMGKRNVSAFSWNSLISGYAELALYEDALALYFQMEEEGVEPDHFTF
        LE CY+LR+I HGIRIHRLIPT+LLRRNVGVSSKLLRLYASFGYME AHQVFDEM KRNVSAF+WNSLISGYAEL LYEDALALYFQMEEEGVEPDHFTF
Subjt:  LETCYKLRAIDHGIRIHRLIPTDLLRRNVGVSSKLLRLYASFGYMENAHQVFDEMGKRNVSAFSWNSLISGYAELALYEDALALYFQMEEEGVEPDHFTF

Query:  PRVLKACGGIGSIRIGEAVHRHIVRSGFAGDVFVLNALVDMYSKCGDIVRARKVFDQIVSKDTVSWNSMLTGYTRHGLLLEALDIFDQMIREGFEPDSVA
        PRVLKACGGI S++IGEAVHRH++RSGFAGDVFVLNALVDMYSKCG IVRARKVFDQIV KDTVSWNSMLTGYTRHGLL EALDIFDQMI+EG+EPDSVA
Subjt:  PRVLKACGGIGSIRIGEAVHRHIVRSGFAGDVFVLNALVDMYSKCGDIVRARKVFDQIVSKDTVSWNSMLTGYTRHGLLLEALDIFDQMIREGFEPDSVA

Query:  LSTILSSTSSLKFKLHIHGWVIRNGVEWNLSIANSLIVMYANCGKIDRAKWLFQQMPQRDIVSWNSIISAHFNTSEALTYFEVMESLGVLPDSVTFVSLL
        LSTILS+ SSLKF+LHIHGWVIR+GVEWNLSIANSLIVMYANCGKI+RAKWLFQQMPQ+D VSWNSIISAHFN+ EALTYFEVMESLGVLPDSVTFVSLL
Subjt:  LSTILSSTSSLKFKLHIHGWVIRNGVEWNLSIANSLIVMYANCGKIDRAKWLFQQMPQRDIVSWNSIISAHFNTSEALTYFEVMESLGVLPDSVTFVSLL

Query:  STSAHLGLVKEGEKLYSLMKGKYGIRPTMEHYACMVNLYGRAGLIEGAYRIITKGMEVEAGPTVWGALLFACYLHDNVDIAEIAAERLFELEPDNELNFE
        ST A+LGLVKEG KLYSLMKGKY IRPT EHYACMVNLYGRAGLIE AYRIITK ME+EAGPTVWGALL+ACYLH NVDIAEIAAERLFELEPDNELNFE
Subjt:  STSAHLGLVKEGEKLYSLMKGKYGIRPTMEHYACMVNLYGRAGLIEGAYRIITKGMEVEAGPTVWGALLFACYLHDNVDIAEIAAERLFELEPDNELNFE

Query:  LLMKIYGNAGRSKDEKRVRLMMAERGLDS
        LLMKIYGNAGRS+DEKRV+LMMAERGLDS
Subjt:  LLMKIYGNAGRSKDEKRVRLMMAERGLDS

TrEMBL top hitse value%identityAlignment
A0A0A0L5M0 Uncharacterized protein1.0e-26486.77Show/hide
Query:  MLISLQLSVSPIASLLLFCSSKPKKSKKERRKLLQEKLIRISKAKEATDLSFPKSSSTPLLIHHKPFSQTKIQALDAVLNDLEASIDNGVRIDAEIFSSL
        MLISL L   P +SLLLFCSSKPKKSKKERRKLL +KL+RISKAK++TDLSFPKSS TPLLIH KPF Q+KIQALDAVL DLEASIDNG+ ID EIFSSL
Subjt:  MLISLQLSVSPIASLLLFCSSKPKKSKKERRKLLQEKLIRISKAKEATDLSFPKSSSTPLLIHHKPFSQTKIQALDAVLNDLEASIDNGVRIDAEIFSSL

Query:  LETCYKLRAIDHGIRIHRLIPTDLLRRNVGVSSKLLRLYASFGYMENAHQVFDEMGKRNVSAFSWNSLISGYAELALYEDALALYFQMEEEGVEPDHFTF
        LE CY+L+AI HGIRIHRLIPT+LLRRNVG+SSKLLRLYASFGYME+AHQVFDEMG RN SAF+WNSLISGYAEL LYEDALALYFQMEEEGVEPD+FTF
Subjt:  LETCYKLRAIDHGIRIHRLIPTDLLRRNVGVSSKLLRLYASFGYMENAHQVFDEMGKRNVSAFSWNSLISGYAELALYEDALALYFQMEEEGVEPDHFTF

Query:  PRVLKACGGIGSIRIGEAVHRHIVRSGFAGDVFVLNALVDMYSKCGDIVRARKVFDQIVSKDTVSWNSMLTGYTRHGLLLEALDIFDQMIREGFEPDSVA
        PRVLKACGGIGSI+IGEAVHRH+VRSGFAGDVFVLNALVDMYSKCG IVRARKVFDQI  KD VSWNSMLTGYTRHGL  EALDIFDQMI+EG+EPDSVA
Subjt:  PRVLKACGGIGSIRIGEAVHRHIVRSGFAGDVFVLNALVDMYSKCGDIVRARKVFDQIVSKDTVSWNSMLTGYTRHGLLLEALDIFDQMIREGFEPDSVA

Query:  LSTILSSTSSLKFKLHIHGWVIRNGVEWNLSIANSLIVMYANCGKIDRAKWLFQQMPQRDIVSWNSIISAHFNTSEALTYFEVMESLGVLPDSVTFVSLL
        LST+LS+ SS+KFKLHIHGWVIR+GVEWNLSIANSLIVMYA CGK++RAKWLFQQMPQ+D+VSWNSIISAHFN++EALTYFEVMESLGV PD VTFVSLL
Subjt:  LSTILSSTSSLKFKLHIHGWVIRNGVEWNLSIANSLIVMYANCGKIDRAKWLFQQMPQRDIVSWNSIISAHFNTSEALTYFEVMESLGVLPDSVTFVSLL

Query:  STSAHLGLVKEGEKLYSLMKGKYGIRPTMEHYACMVNLYGRAGLIEGAYRIITKGMEVEAGPTVWGALLFACYLHDNVDIAEIAAERLFELEPDNELNFE
        ST AHLGLVKEG KLY LMKGKYGIRPT+EHYACMVNLYGRAG+IE AY+IITKGME+EAGPT+WGALL+ACYLH +VDIAEIAAERLFELEPDNELNFE
Subjt:  STSAHLGLVKEGEKLYSLMKGKYGIRPTMEHYACMVNLYGRAGLIEGAYRIITKGMEVEAGPTVWGALLFACYLHDNVDIAEIAAERLFELEPDNELNFE

Query:  LLMKIYGNAGRSKDEKRVRLMMAERGLDS
        LLMKIYGNAGRS+DEKRV+LMMAERGL+S
Subjt:  LLMKIYGNAGRSKDEKRVRLMMAERGLDS

A0A1S4DSF7 pentatricopeptide repeat-containing protein At4g25270, chloroplastic8.6e-26787.71Show/hide
Query:  MLISLQLSVSPIASLLLFCSSKPKKSKKERRKLLQEKLIRISKAKEATDLSFPKSSSTPLLIHHKPFSQTKIQALDAVLNDLEASIDNGVRIDAEIFSSL
        MLISLQL   P  SLLLFCSSKPKKSKKERRKLL +KL+RISKAK++TDLSFPKSSSTPLLIH KPF Q+KIQALDAVL DLE SIDNG+ ID EIFSSL
Subjt:  MLISLQLSVSPIASLLLFCSSKPKKSKKERRKLLQEKLIRISKAKEATDLSFPKSSSTPLLIHHKPFSQTKIQALDAVLNDLEASIDNGVRIDAEIFSSL

Query:  LETCYKLRAIDHGIRIHRLIPTDLLRRNVGVSSKLLRLYASFGYMENAHQVFDEMGKRNVSAFSWNSLISGYAELALYEDALALYFQMEEEGVEPDHFTF
        LE CY+LRAI HGIRIHRLIPT+LLRRNVG+SSKLLRLYAS GYME+AHQVFDEMGKRN SAF+WNSLISGYAEL LYEDALALYFQMEEEGVEPDHFTF
Subjt:  LETCYKLRAIDHGIRIHRLIPTDLLRRNVGVSSKLLRLYASFGYMENAHQVFDEMGKRNVSAFSWNSLISGYAELALYEDALALYFQMEEEGVEPDHFTF

Query:  PRVLKACGGIGSIRIGEAVHRHIVRSGFAGDVFVLNALVDMYSKCGDIVRARKVFDQIVSKDTVSWNSMLTGYTRHGLLLEALDIFDQMIREGFEPDSVA
        PRVLKACGGIGSI+IGEAVHRH+VRSGFAGDVFVLNALVDMYSKCG IVRARKVFDQIV KD VSWNSMLTGYTRHGL  EALDIFDQMI+EG++PDSVA
Subjt:  PRVLKACGGIGSIRIGEAVHRHIVRSGFAGDVFVLNALVDMYSKCGDIVRARKVFDQIVSKDTVSWNSMLTGYTRHGLLLEALDIFDQMIREGFEPDSVA

Query:  LSTILSSTSSLKFKLHIHGWVIRNGVEWNLSIANSLIVMYANCGKIDRAKWLFQQMPQRDIVSWNSIISAHFNTSEALTYFEVMESLGVLPDSVTFVSLL
        LST+LS+  SLKFKLHIHGWVIR+GVEWNLSIANSLIVMYA CGK++RAKWLFQQMPQ+D+VSWNSIISAHFNT+EALTYFEVMESLGVLPD VTFVSLL
Subjt:  LSTILSSTSSLKFKLHIHGWVIRNGVEWNLSIANSLIVMYANCGKIDRAKWLFQQMPQRDIVSWNSIISAHFNTSEALTYFEVMESLGVLPDSVTFVSLL

Query:  STSAHLGLVKEGEKLYSLMKGKYGIRPTMEHYACMVNLYGRAGLIEGAYRIITKGMEVEAGPTVWGALLFACYLHDNVDIAEIAAERLFELEPDNELNFE
        ST AHLGLVKEG +LYSLMKGKY IRPT+EHYACMVNLYGRAG+IE AY+IITKGME+EAGPT+WGALL+ACYLH NVDIAEIAAERLFELEPDNELNFE
Subjt:  STSAHLGLVKEGEKLYSLMKGKYGIRPTMEHYACMVNLYGRAGLIEGAYRIITKGMEVEAGPTVWGALLFACYLHDNVDIAEIAAERLFELEPDNELNFE

Query:  LLMKIYGNAGRSKDEKRVRLMMAERGLDS
        LLMKIYGNAGRS DEKRV+LMMAERGL+S
Subjt:  LLMKIYGNAGRSKDEKRVRLMMAERGLDS

A0A5D3DJ70 Pentatricopeptide repeat-containing protein5.0e-26787.71Show/hide
Query:  MLISLQLSVSPIASLLLFCSSKPKKSKKERRKLLQEKLIRISKAKEATDLSFPKSSSTPLLIHHKPFSQTKIQALDAVLNDLEASIDNGVRIDAEIFSSL
        MLISLQL   P  SLLLFCSSKPKKSKKERRKLL +KL+RISKAK++TDLSFPKSSSTPLLIH KPF Q+KIQALDAVL DLE SIDNG+ ID EIFSSL
Subjt:  MLISLQLSVSPIASLLLFCSSKPKKSKKERRKLLQEKLIRISKAKEATDLSFPKSSSTPLLIHHKPFSQTKIQALDAVLNDLEASIDNGVRIDAEIFSSL

Query:  LETCYKLRAIDHGIRIHRLIPTDLLRRNVGVSSKLLRLYASFGYMENAHQVFDEMGKRNVSAFSWNSLISGYAELALYEDALALYFQMEEEGVEPDHFTF
        LE CY+LRAI HGIRIHRLIPT+LLRRNVG+SSKLLRLYAS GYME+AHQVFDEMGKRN SAF+WNSLISGYAEL LYEDALALYFQMEEEGVEPDHFTF
Subjt:  LETCYKLRAIDHGIRIHRLIPTDLLRRNVGVSSKLLRLYASFGYMENAHQVFDEMGKRNVSAFSWNSLISGYAELALYEDALALYFQMEEEGVEPDHFTF

Query:  PRVLKACGGIGSIRIGEAVHRHIVRSGFAGDVFVLNALVDMYSKCGDIVRARKVFDQIVSKDTVSWNSMLTGYTRHGLLLEALDIFDQMIREGFEPDSVA
        PRVLKACGGIGSI+IGEAVHRH+VRSGFAGDVFVLNALVDMYSKCG IVRARKVFDQIV KD VSWNSMLTGYTRHGL  EALDIFDQMI+EG++PDSVA
Subjt:  PRVLKACGGIGSIRIGEAVHRHIVRSGFAGDVFVLNALVDMYSKCGDIVRARKVFDQIVSKDTVSWNSMLTGYTRHGLLLEALDIFDQMIREGFEPDSVA

Query:  LSTILSSTSSLKFKLHIHGWVIRNGVEWNLSIANSLIVMYANCGKIDRAKWLFQQMPQRDIVSWNSIISAHFNTSEALTYFEVMESLGVLPDSVTFVSLL
        LST+LS+  SLKFKLHIHGWVIR+GVEWNLSIANSLIVMYA CGK++RAKWLFQQMPQ+D+VSWNSIISAHFNT+EALTYFEVMESLGVLPD VTFVSLL
Subjt:  LSTILSSTSSLKFKLHIHGWVIRNGVEWNLSIANSLIVMYANCGKIDRAKWLFQQMPQRDIVSWNSIISAHFNTSEALTYFEVMESLGVLPDSVTFVSLL

Query:  STSAHLGLVKEGEKLYSLMKGKYGIRPTMEHYACMVNLYGRAGLIEGAYRIITKGMEVEAGPTVWGALLFACYLHDNVDIAEIAAERLFELEPDNELNFE
        ST AHLGLVKEG +LYSLMKGKY IRPT+EHYACMVNLYGRAG+IE AY+IITKGME+EAGPT+WGALL+ACYLH NVDIAEIAAERLFELEPDNELNFE
Subjt:  STSAHLGLVKEGEKLYSLMKGKYGIRPTMEHYACMVNLYGRAGLIEGAYRIITKGMEVEAGPTVWGALLFACYLHDNVDIAEIAAERLFELEPDNELNFE

Query:  LLMKIYGNAGRSKDEKRVRLMMAERGLDS
        LLMKIYGNAGRS+DEKRV+LMMAERGL+S
Subjt:  LLMKIYGNAGRSKDEKRVRLMMAERGLDS

A0A6J1FPF9 pentatricopeptide repeat-containing protein At4g25270, chloroplastic4.5e-26888.07Show/hide
Query:  MLISLQLSVSPIASLLLFCSSKPKKSKKERRKLLQEKLIRISKAKEATDLSFPKSSSTPLLIHHKPFSQTKIQALDAVLNDLEASIDNGVRIDAEIFSSL
        MLISL+ S+SPI SL L CSS PKKSKKERRKLLQEKLIRISKAKEAT L FPKSSSTPLLIHHKPFSQ+KIQALDAVLNDLEAS+ NGV IDAEIFSSL
Subjt:  MLISLQLSVSPIASLLLFCSSKPKKSKKERRKLLQEKLIRISKAKEATDLSFPKSSSTPLLIHHKPFSQTKIQALDAVLNDLEASIDNGVRIDAEIFSSL

Query:  LETCYKLRAIDHGIRIHRLIPTDLLRRNVGVSSKLLRLYASFGYMENAHQVFDEMGKRNVSAFSWNSLISGYAELALYEDALALYFQMEEEGVEPDHFTF
        LETCY+LRA+DHGIRIHRLIPT+ LRRNVGVSSKLLRLYASFGYME+AHQVFDEM +RN+SAFSWNSLISGYAEL LYEDALALYFQMEEEGVEPDHFTF
Subjt:  LETCYKLRAIDHGIRIHRLIPTDLLRRNVGVSSKLLRLYASFGYMENAHQVFDEMGKRNVSAFSWNSLISGYAELALYEDALALYFQMEEEGVEPDHFTF

Query:  PRVLKACGGIGSIRIGEAVHRHIVRSGFAGDVFVLNALVDMYSKCGDIVRARKVFDQIVSKDTVSWNSMLTGYTRHGLLLEALDIFDQMIREGFEPDSVA
        PRVLKACGGIGSIR+GEAVHRHIVRSGFAGD+FVLNALVDMY+KCGDI+RARKVFDQIVSKDTVSWNSMLTGYTRHGLLLEAL+ FDQMI+EG+EPDSVA
Subjt:  PRVLKACGGIGSIRIGEAVHRHIVRSGFAGDVFVLNALVDMYSKCGDIVRARKVFDQIVSKDTVSWNSMLTGYTRHGLLLEALDIFDQMIREGFEPDSVA

Query:  LSTILSSTSSLKFKLHIHGWVIRNGVEWNLSIANSLIVMYANCGKIDRAKWLFQQMPQRDIVSWNSIISAHFNTSEALTYFEVMESLGVLPDSVTFVSLL
        LST++S+ SS KFKLHIHGW IR+G+EWNLSIANSLI MYAN GKI+RA+WLF+QMP+RDIVSWN+IISAH NTS+ALTYFEVMESLGVLPDSVTFVSLL
Subjt:  LSTILSSTSSLKFKLHIHGWVIRNGVEWNLSIANSLIVMYANCGKIDRAKWLFQQMPQRDIVSWNSIISAHFNTSEALTYFEVMESLGVLPDSVTFVSLL

Query:  STSAHLGLVKEGEKLYSLMKGKYGIRPTMEHYACMVNLYGRAGLIEGAYRIITKGMEVEAGPTVWGALLFACYLHDNVDIAEIAAERLFELEPDNELNFE
        ST AHL LVKEG KLY+ MKGKYGIRPT+EHYACMVNLYGRAGLIE AYRIIT GMEVEAGPTVWGALL+ACYLH NVDIAE+AAE+LFE EPDNELNF+
Subjt:  STSAHLGLVKEGEKLYSLMKGKYGIRPTMEHYACMVNLYGRAGLIEGAYRIITKGMEVEAGPTVWGALLFACYLHDNVDIAEIAAERLFELEPDNELNFE

Query:  LLMKIYGNAGRSKDEKRVRLMMAERGLD
        LLMKIYGNAGR +DEKRVRLMMAERGLD
Subjt:  LLMKIYGNAGRSKDEKRVRLMMAERGLD

A0A6J1JZI1 pentatricopeptide repeat-containing protein At4g25270, chloroplastic1.6e-26887.88Show/hide
Query:  MLISLQLSVSPIASLLLFCSSKPKKSKKERRKLLQEKLIRISKAKEATDLSFPKSSSTPLLIHHKPFSQTKIQALDAVLNDLEASIDNGVRIDAEIFSSL
        MLISL+ S+SPI SL LFCSS PKKSKKERRKLLQEKLIRISKAKEAT L FPKSSSTPLLIHHKPFS++KIQALDAVLNDLEAS+DNGV IDAEIFSSL
Subjt:  MLISLQLSVSPIASLLLFCSSKPKKSKKERRKLLQEKLIRISKAKEATDLSFPKSSSTPLLIHHKPFSQTKIQALDAVLNDLEASIDNGVRIDAEIFSSL

Query:  LETCYKLRAIDHGIRIHRLIPTDLLRRNVGVSSKLLRLYASFGYMENAHQVFDEMGKRNVSAFSWNSLISGYAELALYEDALALYFQMEEEGVEPDHFTF
        LETCY+LRA+DHGIRIHRLIPT+ LRRNVGVSSKLLRLYASFGYME+AHQVFDEM +RN+SAFSWNSLISGYAEL LYEDALALYFQMEEEGVEPDHFTF
Subjt:  LETCYKLRAIDHGIRIHRLIPTDLLRRNVGVSSKLLRLYASFGYMENAHQVFDEMGKRNVSAFSWNSLISGYAELALYEDALALYFQMEEEGVEPDHFTF

Query:  PRVLKACGGIGSIRIGEAVHRHIVRSGFAGDVFVLNALVDMYSKCGDIVRARKVFDQIVSKDTVSWNSMLTGYTRHGLLLEALDIFDQMIREGFEPDSVA
        PRVLKACGGIGSIR+GEAVHRHIVRSGFAGD+FVLNALVDMY+KCGDI+RARKVFDQI+ KDTVSWNSMLTGYTRHGLLLEAL+ FDQMI+EG+EPDSVA
Subjt:  PRVLKACGGIGSIRIGEAVHRHIVRSGFAGDVFVLNALVDMYSKCGDIVRARKVFDQIVSKDTVSWNSMLTGYTRHGLLLEALDIFDQMIREGFEPDSVA

Query:  LSTILSSTSSLKFKLHIHGWVIRNGVEWNLSIANSLIVMYANCGKIDRAKWLFQQMPQRDIVSWNSIISAHFNTSEALTYFEVMESLGVLPDSVTFVSLL
        LST++S+ +S KFKLHIHGW IR+G+EWNLSIANSLI MYAN GKI+RA+WLF+QMPQRDIVSWN+IISAH NTS+ALTYFEVMESLGVLPDSVTFVSLL
Subjt:  LSTILSSTSSLKFKLHIHGWVIRNGVEWNLSIANSLIVMYANCGKIDRAKWLFQQMPQRDIVSWNSIISAHFNTSEALTYFEVMESLGVLPDSVTFVSLL

Query:  STSAHLGLVKEGEKLYSLMKGKYGIRPTMEHYACMVNLYGRAGLIEGAYRIITKGMEVEAGPTVWGALLFACYLHDNVDIAEIAAERLFELEPDNELNFE
        ST AHL LVKEG KLYS+MKGKYGIRPT+EHYACMVNLYGRAGLIE AYRII  GMEVEAGPTVWGALL+ACYLH NVDIAE+AAE+LFE EPDNELNF+
Subjt:  STSAHLGLVKEGEKLYSLMKGKYGIRPTMEHYACMVNLYGRAGLIEGAYRIITKGMEVEAGPTVWGALLFACYLHDNVDIAEIAAERLFELEPDNELNFE

Query:  LLMKIYGNAGRSKDEKRVRLMMAERGLD
        LLMKIYGNAGR +DEKRVRLMMAERGLD
Subjt:  LLMKIYGNAGRSKDEKRVRLMMAERGLD

SwissProt top hitse value%identityAlignment
O23337 Pentatricopeptide repeat-containing protein At4g148202.5e-7733.33Show/hide
Query:  GVRIDAEIFSSLLETCYKLRAIDHGIRIHRLIPTDLLRRNVGVSSKLLRLYASFGYMENAHQVFDEMGKRNVSAFSWNSLISGYAELALYEDALALYFQM
        G R+D   F  +L+   K+ A+  G+ +H +        +  V +  + +YAS G +  A  VFDEM  R+V   +WN++I  Y    L ++A  L+ +M
Subjt:  GVRIDAEIFSSLLETCYKLRAIDHGIRIHRLIPTDLLRRNVGVSSKLLRLYASFGYMENAHQVFDEMGKRNVSAFSWNSLISGYAELALYEDALALYFQM

Query:  EEEGVEPDHFTFPRVLKACGGIGSIRIGEAVHRHIVRSGFAGDVFVLNALVDM-------------------------------YSKCGDIVRARKVFDQ
        ++  V PD      ++ ACG  G++R   A++  ++ +    D  +L ALV M                               YSKCG +  A+ +FDQ
Subjt:  EEEGVEPDHFTFPRVLKACGGIGSIRIGEAVHRHIVRSGFAGDVFVLNALVDM-------------------------------YSKCGDIVRARKVFDQ

Query:  IVSKDTVSWNSMLTGYTRHGLLLEALDIFDQMIREGFEPDSVALSTILSSTSSLKF---KLHIHGWVIRNGVEWNLSIANSLIVMYANCGKIDRAKWLFQ
           KD V W +M++ Y       EAL +F++M   G +PD V++ +++S+ ++L        +H  +  NG+E  LSI N+LI MYA CG +D  + +F+
Subjt:  IVSKDTVSWNSMLTGYTRHGLLLEALDIFDQMIREGFEPDSVALSTILSSTSSLKF---KLHIHGWVIRNGVEWNLSIANSLIVMYANCGKIDRAKWLFQ

Query:  QMPQRDIVSWNSIISA---HFNTSEALTYFEVMESLGVLPDSVTFVSLLSTSAHLGLVKEGEKLYSLMKGKYGIRPTMEHYACMVNLYGRAGLIEGAYRI
        +MP+R++VSW+S+I+A   H   S+AL+ F  M+   V P+ VTFV +L   +H GLV+EG+K+++ M  +Y I P +EHY CMV+L+GRA L+  A  +
Subjt:  QMPQRDIVSWNSIISA---HFNTSEALTYFEVMESLGVLPDSVTFVSLLSTSAHLGLVKEGEKLYSLMKGKYGIRPTMEHYACMVNLYGRAGLIEGAYRI

Query:  ITKGMEVEAGPTVWGALLFACYLHDNVDIAEIAAERLFELEPDNELNFELLMKIYGNAGRSKDEKRVRLMMAERGLDSSENLASQIAQN
        I + M V +   +WG+L+ AC +H  +++ + AA+R+ ELEPD++    L+  IY    R +D + +R +M E+ +   + L S+I QN
Subjt:  ITKGMEVEAGPTVWGALLFACYLHDNVDIAEIAAERLFELEPDNELNFELLMKIYGNAGRSKDEKRVRLMMAERGLDSSENLASQIAQN

P0C899 Putative pentatricopeptide repeat-containing protein At3g491421.9e-7732.19Show/hide
Query:  LIHHKPFSQTKIQALDAVLNDLEASIDNGVRIDAEIFSSLLETCYKLRAIDHGIRIHRLIPTDLLRRNVGVSSKLLRLYASFGYMENAHQVFDEMGKRNV
        L+H   F + + + + + L  LE       +    +   +L+T   +R +     +H  I  + LR N  +  KL+R YAS   + +A +VFDE+ +RNV
Subjt:  LIHHKPFSQTKIQALDAVLNDLEASIDNGVRIDAEIFSSLLETCYKLRAIDHGIRIHRLIPTDLLRRNVGVSSKLLRLYASFGYMENAHQVFDEMGKRNV

Query:  SAFSWNSLISGYAELALYEDALALYFQMEEEGVEPDHFTFPRVLKACGGIGSIRIGEAVHRHIVRSGFAGDVFVLNALVDMYSKCGDIVRARKVFDQIVS
             N +I  Y     Y + + ++  M    V PDH+TFP VLKAC   G+I IG  +H    + G +  +FV N LV MY KCG +  AR V D++  
Subjt:  SAFSWNSLISGYAELALYEDALALYFQMEEEGVEPDHFTFPRVLKACGGIGSIRIGEAVHRHIVRSGFAGDVFVLNALVDMYSKCGDIVRARKVFDQIVS

Query:  KDTVSWNSMLTGYT--------------------------------------------------------------------RHGLLLEALDIFDQMIRE
        +D VSWNS++ GY                                                                     ++ + +EA++++ +M  +
Subjt:  KDTVSWNSMLTGYT--------------------------------------------------------------------RHGLLLEALDIFDQMIRE

Query:  GFEPDSVALSTIL---SSTSSLKFKLHIHGWVIRNGVEWNLSIANSLIVMYANCGKIDRAKWLFQQMPQRDIVSWNSIISAHFNTS---EALTYFEVMES
        GFEPD+V+++++L     TS+L     IHG++ R  +  NL + N+LI MYA CG +++A+ +F+ M  RD+VSW ++ISA+  +    +A+  F  ++ 
Subjt:  GFEPDSVALSTIL---SSTSSLKFKLHIHGWVIRNGVEWNLSIANSLIVMYANCGKIDRAKWLFQQMPQRDIVSWNSIISAHFNTS---EALTYFEVMES

Query:  LGVLPDSVTFVSLLSTSAHLGLVKEGEKLYSLMKGKYGIRPTMEHYACMVNLYGRAGLIEGAYRIITKGMEVEAGPTVWGALLFACYLHDNVDIAEIAAE
         G++PDS+ FV+ L+  +H GL++EG   + LM   Y I P +EH ACMV+L GRAG ++ AYR I + M +E    VWGALL AC +H + DI  +AA+
Subjt:  LGVLPDSVTFVSLLSTSAHLGLVKEGEKLYSLMKGKYGIRPTMEHYACMVNLYGRAGLIEGAYRIITKGMEVEAGPTVWGALLFACYLHDNVDIAEIAAE

Query:  RLFELEPDNELNFELLMKIYGNAGRSKDEKRVRLMMAERGLDSSENLASQIAQNLI
        +LF+L P+    + LL  IY  AGR ++   +R +M  +GL  +   AS +  N I
Subjt:  RLFELEPDNELNFELLMKIYGNAGRSKDEKRVRLMMAERGLDSSENLASQIAQNLI

Q9SB36 Pentatricopeptide repeat-containing protein At4g25270, chloroplastic5.9e-18060.86Show/hide
Query:  SSKPKKSKKERRKLLQEKLIRISKAKEATDLSFPKSSSTPLLIHHKPFSQTKIQALDAVLNDLEASIDNGVRI-DAEIFSSLLETCYKLRAIDHGIRIHR
        SS  KK  +  ++L Q +  + +     T LSF K S TPLLI  +   +T+++ALD+V+ DLE S   G+ + + EIF+SLLETCY LRAIDHG+R+H 
Subjt:  SSKPKKSKKERRKLLQEKLIRISKAKEATDLSFPKSSSTPLLIHHKPFSQTKIQALDAVLNDLEASIDNGVRI-DAEIFSSLLETCYKLRAIDHGIRIHR

Query:  LIPTDLLRRNVGVSSKLLRLYASFGYMENAHQVFDEMGKRNVSAFSWNSLISGYAELALYEDALALYFQMEEEGVEPDHFTFPRVLKACGGIGSIRIGEA
        LIP  LLR N+G+SSKL+RLYAS GY E AH+VFD M KR+ S F+WNSLISGYAEL  YEDA+ALYFQM E+GV+PD FTFPRVLKACGGIGS++IGEA
Subjt:  LIPTDLLRRNVGVSSKLLRLYASFGYMENAHQVFDEMGKRNVSAFSWNSLISGYAELALYEDALALYFQMEEEGVEPDHFTFPRVLKACGGIGSIRIGEA

Query:  VHRHIVRSGFAGDVFVLNALVDMYSKCGDIVRARKVFDQIVSKDTVSWNSMLTGYTRHGLLLEALDIFDQMIREGFEPDSVALSTILSSTSSLKFKLHIH
        +HR +V+ GF  DV+VLNALV MY+KCGDIV+AR VFD I  KD VSWNSMLTGY  HGLL EALDIF  M++ G EPD VA+S++L+   S K    +H
Subjt:  VHRHIVRSGFAGDVFVLNALVDMYSKCGDIVRARKVFDQIVSKDTVSWNSMLTGYTRHGLLLEALDIFDQMIREGFEPDSVALSTILSSTSSLKFKLHIH

Query:  GWVIRNGVEWNLSIANSLIVMYANCGKIDRAKWLFQQMPQRDIVSWNSIISAHFNTSEALTYFEVMESLGVLPDSVTFVSLLSTSAHLGLVKEGEKLYSL
        GWVIR G+EW LS+AN+LIV+Y+  G++ +A ++F QM +RD VSWN+IISAH   S  L YFE M      PD +TFVS+LS  A+ G+V++GE+L+SL
Subjt:  GWVIRNGVEWNLSIANSLIVMYANCGKIDRAKWLFQQMPQRDIVSWNSIISAHFNTSEALTYFEVMESLGVLPDSVTFVSLLSTSAHLGLVKEGEKLYSL

Query:  MKGKYGIRPTMEHYACMVNLYGRAGLIEGAYRIITKGMEVEAGPTVWGALLFACYLHDNVDIAEIAAERLFELEPDNELNFELLMKIYGNAGRSKDEKRV
        M  +YGI P MEHYACMVNLYGRAG++E AY +I + M +EAGPTVWGALL+ACYLH N DI E+AA+RLFELEPDNE NFELL++IY  A R++D +RV
Subjt:  MKGKYGIRPTMEHYACMVNLYGRAGLIEGAYRIITKGMEVEAGPTVWGALLFACYLHDNVDIAEIAAERLFELEPDNELNFELLMKIYGNAGRSKDEKRV

Query:  RLMMAERGLDS
        R MM +RGL++
Subjt:  RLMMAERGLDS

Q9SN39 Pentatricopeptide repeat-containing protein DOT4, chloroplastic9.0e-8036.91Show/hide
Query:  NGVRIDAEIFSSLLETCYKLRAIDHGIRIHRLIPTDLLRRNVGVSSKLLRLYASFGYMENAHQVFDEMGKRNVSAFSWNSLISGYAELALYEDALALYFQ
        +G+ ID     S+   C   R I  G  +H +       R     + LL +Y+  G +++A  VF EM  R+V   S+ S+I+GYA   L  +A+ L+ +
Subjt:  NGVRIDAEIFSSLLETCYKLRAIDHGIRIHRLIPTDLLRRNVGVSSKLLRLYASFGYMENAHQVFDEMGKRNVSAFSWNSLISGYAELALYEDALALYFQ

Query:  MEEEGVEPDHFTFPRVLKACGGIGSIRIGEAVHRHIVRSGFAGDVFVLNALVDMYSKCGDIVRARKVFDQIVSKDTVSWNSMLTGYTRHGLLLEALDIFD
        MEEEG+ PD +T   VL  C     +  G+ VH  I  +    D+FV NAL+DMY+KCG +  A  VF ++  KD +SWN+++ GY+++    EAL +F+
Subjt:  MEEEGVEPDHFTFPRVLKACGGIGSIRIGEAVHRHIVRSGFAGDVFVLNALVDMYSKCGDIVRARKVFDQIVSKDTVSWNSMLTGYTRHGLLLEALDIFD

Query:  QMIRE-GFEPDSVALSTILSSTSSLKF---KLHIHGWVIRNGVEWNLSIANSLIVMYANCGKIDRAKWLFQQMPQRDIVSWNSIISA---HFNTSEALTY
         ++ E  F PD   ++ +L + +SL        IHG+++RNG   +  +ANSL+ MYA CG +  A  LF  +  +D+VSW  +I+    H    EA+  
Subjt:  QMIRE-GFEPDSVALSTILSSTSSLKF---KLHIHGWVIRNGVEWNLSIANSLIVMYANCGKIDRAKWLFQQMPQRDIVSWNSIISA---HFNTSEALTY

Query:  FEVMESLGVLPDSVTFVSLLSTSAHLGLVKEGEKLYSLMKGKYGIRPTMEHYACMVNLYGRAGLIEGAYRIITKGMEVEAGPTVWGALLFACYLHDNVDI
        F  M   G+  D ++FVSLL   +H GLV EG + +++M+ +  I PT+EHYAC+V++  R G +  AYR I + M +    T+WGALL  C +H +V +
Subjt:  FEVMESLGVLPDSVTFVSLLSTSAHLGLVKEGEKLYSLMKGKYGIRPTMEHYACMVNLYGRAGLIEGAYRIITKGMEVEAGPTVWGALLFACYLHDNVDI

Query:  AEIAAERLFELEPDNELNFELLMKIYGNAGRSKDEKRVRLMMAERGL
        AE  AE++FELEP+N   + L+  IY  A + +  KR+R  + +RGL
Subjt:  AEIAAERLFELEPDNELNFELLMKIYGNAGRSKDEKRVRLMMAERGL

Q9STF3 Pentatricopeptide repeat-containing protein At3g46790, chloroplastic1.3e-8137Show/hide
Query:  EIFSSLLETCYKLRAIDHGIRIHRLIPTDLLRRNVGVSSKLLRLYASFGYMENAHQVFDEMGKRNVSAFSWNSLISGYAELALYEDALALYFQMEEEGVE
        + +  L+  C    ++   +R+HR I  +   ++  +++KL+ +Y+  G ++ A +VFD+  KR +  + WN+L          E+ L LY++M   GVE
Subjt:  EIFSSLLETCYKLRAIDHGIRIHRLIPTDLLRRNVGVSSKLLRLYASFGYMENAHQVFDEMGKRNVSAFSWNSLISGYAELALYEDALALYFQMEEEGVE

Query:  PDHFTFPRVLKACGG----IGSIRIGEAVHRHIVRSGFAGDVFVLNALVDMYSKCGDIVRARKVFDQIVSKDTVSWNSMLTGYTRHGLLLEALDIFDQMI
         D FT+  VLKAC      +  +  G+ +H H+ R G++  V+++  LVDMY++ G +  A  VF  +  ++ VSW++M+  Y ++G   EAL  F +M+
Subjt:  PDHFTFPRVLKACGG----IGSIRIGEAVHRHIVRSGFAGDVFVLNALVDMYSKCGDIVRARKVFDQIVSKDTVSWNSMLTGYTRHGLLLEALDIFDQMI

Query:  RE--GFEPDSVALSTILSSTSSL----KFKLHIHGWVIRNGVEWNLSIANSLIVMYANCGKIDRAKWLFQQMPQRDIVSWNSIISA---HFNTSEALTYF
        RE     P+SV + ++L + +SL    + KL IHG+++R G++  L + ++L+ MY  CGK++  + +F +M  RD+VSWNS+IS+   H    +A+  F
Subjt:  RE--GFEPDSVALSTILSSTSSL----KFKLHIHGWVIRNGVEWNLSIANSLIVMYANCGKIDRAKWLFQQMPQRDIVSWNSIISA---HFNTSEALTYF

Query:  EVMESLGVLPDSVTFVSLLSTSAHLGLVKEGEKLYSLMKGKYGIRPTMEHYACMVNLYGRAGLIEGAYRIITKGMEVEAGPTVWGALLFACYLHDNVDIA
        E M + G  P  VTFVS+L   +H GLV+EG++L+  M   +GI+P +EHYACMV+L GRA  ++ A +++ + M  E GP VWG+LL +C +H NV++A
Subjt:  EVMESLGVLPDSVTFVSLLSTSAHLGLVKEGEKLYSLMKGKYGIRPTMEHYACMVNLYGRAGLIEGAYRIITKGMEVEAGPTVWGALLFACYLHDNVDIA

Query:  EIAAERLFELEPDNELNFELLMKIYGNAGRSKDEKRVRLMMAERGL
        E A+ RLF LEP N  N+ LL  IY  A    + KRV+ ++  RGL
Subjt:  EIAAERLFELEPDNELNFELLMKIYGNAGRSKDEKRVRLMMAERGL

Arabidopsis top hitse value%identityAlignment
AT1G06140.1 Pentatricopeptide repeat (PPR) superfamily protein1.7e-7838.57Show/hide
Query:  LETCYKLRAIDHGIRIHRLIPTDLLRRNVGVSSKLLRLYASFGYMENAHQVFDEMGKRNVSAFSWNSLISGYAELALYEDALALYFQMEEEGVEPDHFTF
        ++ C  L  +++GI IH L   + L ++  V+  L+ +YA  G ME+A +VFDE+  RN  +  W  L+ GY + +   +   L+  M + G+  D  T 
Subjt:  LETCYKLRAIDHGIRIHRLIPTDLLRRNVGVSSKLLRLYASFGYMENAHQVFDEMGKRNVSAFSWNSLISGYAELALYEDALALYFQMEEEGVEPDHFTF

Query:  PRVLKACGGIGSIRIGEAVHRHIVRSGFAGDVFVLNA-LVDMYSKCGDIVRARKVFDQIVSKDTVSWNSMLTGYTRHGLLLEALDIFDQMIREGFEPDSV
          ++KACG + + ++G+ VH   +R  F      L A ++DMY KC  +  ARK+F+  V ++ V W ++++G+ +    +EA D+F QM+RE   P+  
Subjt:  PRVLKACGGIGSIRIGEAVHRHIVRSGFAGDVFVLNA-LVDMYSKCGDIVRARKVFDQIVSKDTVSWNSMLTGYTRHGLLLEALDIFDQMIREGFEPDSV

Query:  ALSTILSSTSSLKFKLH---IHGWVIRNGVEWNLSIANSLIVMYANCGKIDRAKWLFQQMPQRDIVSWNSIISA---HFNTSEALTYFEVMESLGVLPDS
         L+ IL S SSL    H   +HG++IRNG+E +     S I MYA CG I  A+ +F  MP+R+++SW+S+I+A   +    EAL  F  M+S  V+P+S
Subjt:  ALSTILSSTSSLKFKLH---IHGWVIRNGVEWNLSIANSLIVMYANCGKIDRAKWLFQQMPQRDIVSWNSIISA---HFNTSEALTYFEVMESLGVLPDS

Query:  VTFVSLLSTSAHLGLVKEGEKLYSLMKGKYGIRPTMEHYACMVNLYGRAGLIEGAYRIITKGMEVEAGPTVWGALLFACYLHDNVDIAEIAAERLFELEP
        VTFVSLLS  +H G VKEG K +  M   YG+ P  EHYACMV+L GRAG I G  +     M V+   + WGALL AC +H  VD+A   AE+L  +EP
Subjt:  VTFVSLLSTSAHLGLVKEGEKLYSLMKGKYGIRPTMEHYACMVNLYGRAGLIEGAYRIITKGMEVEAGPTVWGALLFACYLHDNVDIAEIAAERLFELEP

Query:  DNELNFELLMKIYGNAGRSKDEKRVRLMMAERG
        +    + LL  IY +AG  +    VR  M  +G
Subjt:  DNELNFELLMKIYGNAGRSKDEKRVRLMMAERG

AT3G46790.1 Tetratricopeptide repeat (TPR)-like superfamily protein8.9e-8337Show/hide
Query:  EIFSSLLETCYKLRAIDHGIRIHRLIPTDLLRRNVGVSSKLLRLYASFGYMENAHQVFDEMGKRNVSAFSWNSLISGYAELALYEDALALYFQMEEEGVE
        + +  L+  C    ++   +R+HR I  +   ++  +++KL+ +Y+  G ++ A +VFD+  KR +  + WN+L          E+ L LY++M   GVE
Subjt:  EIFSSLLETCYKLRAIDHGIRIHRLIPTDLLRRNVGVSSKLLRLYASFGYMENAHQVFDEMGKRNVSAFSWNSLISGYAELALYEDALALYFQMEEEGVE

Query:  PDHFTFPRVLKACGG----IGSIRIGEAVHRHIVRSGFAGDVFVLNALVDMYSKCGDIVRARKVFDQIVSKDTVSWNSMLTGYTRHGLLLEALDIFDQMI
         D FT+  VLKAC      +  +  G+ +H H+ R G++  V+++  LVDMY++ G +  A  VF  +  ++ VSW++M+  Y ++G   EAL  F +M+
Subjt:  PDHFTFPRVLKACGG----IGSIRIGEAVHRHIVRSGFAGDVFVLNALVDMYSKCGDIVRARKVFDQIVSKDTVSWNSMLTGYTRHGLLLEALDIFDQMI

Query:  RE--GFEPDSVALSTILSSTSSL----KFKLHIHGWVIRNGVEWNLSIANSLIVMYANCGKIDRAKWLFQQMPQRDIVSWNSIISA---HFNTSEALTYF
        RE     P+SV + ++L + +SL    + KL IHG+++R G++  L + ++L+ MY  CGK++  + +F +M  RD+VSWNS+IS+   H    +A+  F
Subjt:  RE--GFEPDSVALSTILSSTSSL----KFKLHIHGWVIRNGVEWNLSIANSLIVMYANCGKIDRAKWLFQQMPQRDIVSWNSIISA---HFNTSEALTYF

Query:  EVMESLGVLPDSVTFVSLLSTSAHLGLVKEGEKLYSLMKGKYGIRPTMEHYACMVNLYGRAGLIEGAYRIITKGMEVEAGPTVWGALLFACYLHDNVDIA
        E M + G  P  VTFVS+L   +H GLV+EG++L+  M   +GI+P +EHYACMV+L GRA  ++ A +++ + M  E GP VWG+LL +C +H NV++A
Subjt:  EVMESLGVLPDSVTFVSLLSTSAHLGLVKEGEKLYSLMKGKYGIRPTMEHYACMVNLYGRAGLIEGAYRIITKGMEVEAGPTVWGALLFACYLHDNVDIA

Query:  EIAAERLFELEPDNELNFELLMKIYGNAGRSKDEKRVRLMMAERGL
        E A+ RLF LEP N  N+ LL  IY  A    + KRV+ ++  RGL
Subjt:  EIAAERLFELEPDNELNFELLMKIYGNAGRSKDEKRVRLMMAERGL

AT3G49142.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.3e-7832.19Show/hide
Query:  LIHHKPFSQTKIQALDAVLNDLEASIDNGVRIDAEIFSSLLETCYKLRAIDHGIRIHRLIPTDLLRRNVGVSSKLLRLYASFGYMENAHQVFDEMGKRNV
        L+H   F + + + + + L  LE       +    +   +L+T   +R +     +H  I  + LR N  +  KL+R YAS   + +A +VFDE+ +RNV
Subjt:  LIHHKPFSQTKIQALDAVLNDLEASIDNGVRIDAEIFSSLLETCYKLRAIDHGIRIHRLIPTDLLRRNVGVSSKLLRLYASFGYMENAHQVFDEMGKRNV

Query:  SAFSWNSLISGYAELALYEDALALYFQMEEEGVEPDHFTFPRVLKACGGIGSIRIGEAVHRHIVRSGFAGDVFVLNALVDMYSKCGDIVRARKVFDQIVS
             N +I  Y     Y + + ++  M    V PDH+TFP VLKAC   G+I IG  +H    + G +  +FV N LV MY KCG +  AR V D++  
Subjt:  SAFSWNSLISGYAELALYEDALALYFQMEEEGVEPDHFTFPRVLKACGGIGSIRIGEAVHRHIVRSGFAGDVFVLNALVDMYSKCGDIVRARKVFDQIVS

Query:  KDTVSWNSMLTGYT--------------------------------------------------------------------RHGLLLEALDIFDQMIRE
        +D VSWNS++ GY                                                                     ++ + +EA++++ +M  +
Subjt:  KDTVSWNSMLTGYT--------------------------------------------------------------------RHGLLLEALDIFDQMIRE

Query:  GFEPDSVALSTIL---SSTSSLKFKLHIHGWVIRNGVEWNLSIANSLIVMYANCGKIDRAKWLFQQMPQRDIVSWNSIISAHFNTS---EALTYFEVMES
        GFEPD+V+++++L     TS+L     IHG++ R  +  NL + N+LI MYA CG +++A+ +F+ M  RD+VSW ++ISA+  +    +A+  F  ++ 
Subjt:  GFEPDSVALSTIL---SSTSSLKFKLHIHGWVIRNGVEWNLSIANSLIVMYANCGKIDRAKWLFQQMPQRDIVSWNSIISAHFNTS---EALTYFEVMES

Query:  LGVLPDSVTFVSLLSTSAHLGLVKEGEKLYSLMKGKYGIRPTMEHYACMVNLYGRAGLIEGAYRIITKGMEVEAGPTVWGALLFACYLHDNVDIAEIAAE
         G++PDS+ FV+ L+  +H GL++EG   + LM   Y I P +EH ACMV+L GRAG ++ AYR I + M +E    VWGALL AC +H + DI  +AA+
Subjt:  LGVLPDSVTFVSLLSTSAHLGLVKEGEKLYSLMKGKYGIRPTMEHYACMVNLYGRAGLIEGAYRIITKGMEVEAGPTVWGALLFACYLHDNVDIAEIAAE

Query:  RLFELEPDNELNFELLMKIYGNAGRSKDEKRVRLMMAERGLDSSENLASQIAQNLI
        +LF+L P+    + LL  IY  AGR ++   +R +M  +GL  +   AS +  N I
Subjt:  RLFELEPDNELNFELLMKIYGNAGRSKDEKRVRLMMAERGLDSSENLASQIAQNLI

AT4G18750.1 Pentatricopeptide repeat (PPR) superfamily protein6.4e-8136.91Show/hide
Query:  NGVRIDAEIFSSLLETCYKLRAIDHGIRIHRLIPTDLLRRNVGVSSKLLRLYASFGYMENAHQVFDEMGKRNVSAFSWNSLISGYAELALYEDALALYFQ
        +G+ ID     S+   C   R I  G  +H +       R     + LL +Y+  G +++A  VF EM  R+V   S+ S+I+GYA   L  +A+ L+ +
Subjt:  NGVRIDAEIFSSLLETCYKLRAIDHGIRIHRLIPTDLLRRNVGVSSKLLRLYASFGYMENAHQVFDEMGKRNVSAFSWNSLISGYAELALYEDALALYFQ

Query:  MEEEGVEPDHFTFPRVLKACGGIGSIRIGEAVHRHIVRSGFAGDVFVLNALVDMYSKCGDIVRARKVFDQIVSKDTVSWNSMLTGYTRHGLLLEALDIFD
        MEEEG+ PD +T   VL  C     +  G+ VH  I  +    D+FV NAL+DMY+KCG +  A  VF ++  KD +SWN+++ GY+++    EAL +F+
Subjt:  MEEEGVEPDHFTFPRVLKACGGIGSIRIGEAVHRHIVRSGFAGDVFVLNALVDMYSKCGDIVRARKVFDQIVSKDTVSWNSMLTGYTRHGLLLEALDIFD

Query:  QMIRE-GFEPDSVALSTILSSTSSLKF---KLHIHGWVIRNGVEWNLSIANSLIVMYANCGKIDRAKWLFQQMPQRDIVSWNSIISA---HFNTSEALTY
         ++ E  F PD   ++ +L + +SL        IHG+++RNG   +  +ANSL+ MYA CG +  A  LF  +  +D+VSW  +I+    H    EA+  
Subjt:  QMIRE-GFEPDSVALSTILSSTSSLKF---KLHIHGWVIRNGVEWNLSIANSLIVMYANCGKIDRAKWLFQQMPQRDIVSWNSIISA---HFNTSEALTY

Query:  FEVMESLGVLPDSVTFVSLLSTSAHLGLVKEGEKLYSLMKGKYGIRPTMEHYACMVNLYGRAGLIEGAYRIITKGMEVEAGPTVWGALLFACYLHDNVDI
        F  M   G+  D ++FVSLL   +H GLV EG + +++M+ +  I PT+EHYAC+V++  R G +  AYR I + M +    T+WGALL  C +H +V +
Subjt:  FEVMESLGVLPDSVTFVSLLSTSAHLGLVKEGEKLYSLMKGKYGIRPTMEHYACMVNLYGRAGLIEGAYRIITKGMEVEAGPTVWGALLFACYLHDNVDI

Query:  AEIAAERLFELEPDNELNFELLMKIYGNAGRSKDEKRVRLMMAERGL
        AE  AE++FELEP+N   + L+  IY  A + +  KR+R  + +RGL
Subjt:  AEIAAERLFELEPDNELNFELLMKIYGNAGRSKDEKRVRLMMAERGL

AT4G25270.1 Tetratricopeptide repeat (TPR)-like superfamily protein4.2e-18160.86Show/hide
Query:  SSKPKKSKKERRKLLQEKLIRISKAKEATDLSFPKSSSTPLLIHHKPFSQTKIQALDAVLNDLEASIDNGVRI-DAEIFSSLLETCYKLRAIDHGIRIHR
        SS  KK  +  ++L Q +  + +     T LSF K S TPLLI  +   +T+++ALD+V+ DLE S   G+ + + EIF+SLLETCY LRAIDHG+R+H 
Subjt:  SSKPKKSKKERRKLLQEKLIRISKAKEATDLSFPKSSSTPLLIHHKPFSQTKIQALDAVLNDLEASIDNGVRI-DAEIFSSLLETCYKLRAIDHGIRIHR

Query:  LIPTDLLRRNVGVSSKLLRLYASFGYMENAHQVFDEMGKRNVSAFSWNSLISGYAELALYEDALALYFQMEEEGVEPDHFTFPRVLKACGGIGSIRIGEA
        LIP  LLR N+G+SSKL+RLYAS GY E AH+VFD M KR+ S F+WNSLISGYAEL  YEDA+ALYFQM E+GV+PD FTFPRVLKACGGIGS++IGEA
Subjt:  LIPTDLLRRNVGVSSKLLRLYASFGYMENAHQVFDEMGKRNVSAFSWNSLISGYAELALYEDALALYFQMEEEGVEPDHFTFPRVLKACGGIGSIRIGEA

Query:  VHRHIVRSGFAGDVFVLNALVDMYSKCGDIVRARKVFDQIVSKDTVSWNSMLTGYTRHGLLLEALDIFDQMIREGFEPDSVALSTILSSTSSLKFKLHIH
        +HR +V+ GF  DV+VLNALV MY+KCGDIV+AR VFD I  KD VSWNSMLTGY  HGLL EALDIF  M++ G EPD VA+S++L+   S K    +H
Subjt:  VHRHIVRSGFAGDVFVLNALVDMYSKCGDIVRARKVFDQIVSKDTVSWNSMLTGYTRHGLLLEALDIFDQMIREGFEPDSVALSTILSSTSSLKFKLHIH

Query:  GWVIRNGVEWNLSIANSLIVMYANCGKIDRAKWLFQQMPQRDIVSWNSIISAHFNTSEALTYFEVMESLGVLPDSVTFVSLLSTSAHLGLVKEGEKLYSL
        GWVIR G+EW LS+AN+LIV+Y+  G++ +A ++F QM +RD VSWN+IISAH   S  L YFE M      PD +TFVS+LS  A+ G+V++GE+L+SL
Subjt:  GWVIRNGVEWNLSIANSLIVMYANCGKIDRAKWLFQQMPQRDIVSWNSIISAHFNTSEALTYFEVMESLGVLPDSVTFVSLLSTSAHLGLVKEGEKLYSL

Query:  MKGKYGIRPTMEHYACMVNLYGRAGLIEGAYRIITKGMEVEAGPTVWGALLFACYLHDNVDIAEIAAERLFELEPDNELNFELLMKIYGNAGRSKDEKRV
        M  +YGI P MEHYACMVNLYGRAG++E AY +I + M +EAGPTVWGALL+ACYLH N DI E+AA+RLFELEPDNE NFELL++IY  A R++D +RV
Subjt:  MKGKYGIRPTMEHYACMVNLYGRAGLIEGAYRIITKGMEVEAGPTVWGALLFACYLHDNVDIAEIAAERLFELEPDNELNFELLMKIYGNAGRSKDEKRV

Query:  RLMMAERGLDS
        R MM +RGL++
Subjt:  RLMMAERGLDS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTGATCTCTTTGCAACTGTCAGTTTCTCCCATAGCTTCGCTTCTTCTCTTCTGTTCTTCCAAACCCAAGAAATCCAAGAAAGAAAGAAGGAAACTTCTTCAGGAAAA
ACTGATTCGCATTAGCAAAGCCAAAGAGGCCACTGATCTCTCTTTCCCTAAATCCTCATCAACCCCACTCTTAATCCACCACAAACCCTTCTCCCAAACCAAAATTCAAG
CCCTTGATGCTGTTCTCAATGACCTTGAAGCTTCCATCGACAATGGCGTCCGTATTGATGCTGAAATTTTCTCTTCCCTCTTGGAAACTTGTTACAAATTGCGAGCCATT
GACCATGGTATTCGGATTCATCGCCTCATACCCACTGATCTCTTACGTAGAAATGTGGGTGTTTCTTCTAAGCTGCTTCGTCTGTATGCTTCTTTTGGGTACATGGAGAA
TGCACACCAGGTGTTTGATGAAATGGGTAAACGAAATGTCTCTGCTTTTTCGTGGAATTCTCTTATTTCTGGATATGCTGAACTTGCTCTTTATGAAGATGCTCTGGCTC
TGTACTTCCAAATGGAGGAAGAAGGTGTTGAACCTGACCACTTTACTTTTCCTCGTGTGCTCAAGGCCTGTGGTGGCATTGGGTCGATTCGAATCGGGGAGGCGGTGCAC
CGACATATCGTTCGTTCGGGCTTTGCTGGAGATGTCTTTGTCCTCAATGCTCTAGTCGACATGTATTCCAAATGTGGTGACATTGTGAGAGCTAGAAAAGTTTTCGATCA
GATTGTCTCTAAGGATACAGTTTCCTGGAACTCAATGCTCACTGGTTACACACGCCATGGGCTTCTCTTGGAGGCATTGGACATCTTTGATCAAATGATTCGAGAAGGGT
TCGAGCCCGATTCGGTTGCTTTATCCACCATTCTTTCTAGCACTTCGTCGCTGAAATTCAAGTTACACATCCATGGATGGGTGATTCGAAACGGAGTCGAGTGGAATTTG
TCCATTGCTAACTCTTTGATTGTCATGTATGCCAACTGTGGTAAGATTGACAGAGCAAAATGGCTGTTCCAGCAGATGCCTCAAAGGGACATAGTCTCATGGAACTCCAT
AATCTCTGCTCATTTCAATACCTCAGAAGCTTTGACATATTTTGAAGTGATGGAGAGCCTTGGTGTTTTGCCAGACAGTGTAACATTTGTGTCATTGTTGTCAACTTCTG
CTCATCTGGGCTTGGTGAAGGAAGGGGAAAAGTTATATTCTCTTATGAAGGGGAAGTATGGAATAAGACCAACCATGGAACATTATGCTTGTATGGTGAATCTTTATGGG
AGGGCAGGGCTGATTGAAGGAGCTTATAGAATCATAACAAAAGGGATGGAGGTTGAGGCAGGTCCGACCGTATGGGGGGCGCTGTTGTTTGCGTGCTATCTTCACGACAA
TGTAGATATCGCCGAGATTGCTGCTGAAAGACTTTTCGAGTTGGAGCCCGACAATGAGCTCAATTTCGAGCTTTTGATGAAGATTTATGGCAATGCTGGGAGATCCAAAG
ATGAGAAGAGAGTGAGATTAATGATGGCGGAACGAGGATTGGATTCGTCAGAGAATCTTGCTTCCCAAATAGCTCAGAATCTTATTGCTCCTCAGTCCACCAGTCAATCC
GTCGTCGACCTATACACCAATCCCTATTACCTGCATCATTCGAATCGGACAAACCTCGTTCTCGTCTCAGAGCTTCTCACCGAGAACAATTACACATCCTGGCATCAAGC
TTTGATGATAGGTCTCACAGTCAAGAACAAACTGGGGTTTATCGATGGATCTTTACCGCGTCCAACTGGCGAACTCCTTCGTTCTTGGACGATTTGTAACAGTGTGGTCA
AGGCATGGATTTTGAATGCCATATCCAAAGAAATTGCTGCTAGTGTCAACTATGCGGACTCTGCTTTCGATATGTGGAGTGATCTCCAACAAAGATACCAACGGAAAAAT
CGCCCGGGCATTTTTCAGCTCCGGCGCGAAATCTCCAATCTGGCACAAGATCAGCTCTCAGTCTTTGCATACTTCGCCAAATTAAAGTCACTATGGAATGAATTAATCTC
TTACCGCTCTTCTTGTTCCTGCGGCCTTTGTTCCTACGGCAGTGTCAAAGATTTGGCAAAATAA
mRNA sequenceShow/hide mRNA sequence
ATGCTGATCTCTTTGCAACTGTCAGTTTCTCCCATAGCTTCGCTTCTTCTCTTCTGTTCTTCCAAACCCAAGAAATCCAAGAAAGAAAGAAGGAAACTTCTTCAGGAAAA
ACTGATTCGCATTAGCAAAGCCAAAGAGGCCACTGATCTCTCTTTCCCTAAATCCTCATCAACCCCACTCTTAATCCACCACAAACCCTTCTCCCAAACCAAAATTCAAG
CCCTTGATGCTGTTCTCAATGACCTTGAAGCTTCCATCGACAATGGCGTCCGTATTGATGCTGAAATTTTCTCTTCCCTCTTGGAAACTTGTTACAAATTGCGAGCCATT
GACCATGGTATTCGGATTCATCGCCTCATACCCACTGATCTCTTACGTAGAAATGTGGGTGTTTCTTCTAAGCTGCTTCGTCTGTATGCTTCTTTTGGGTACATGGAGAA
TGCACACCAGGTGTTTGATGAAATGGGTAAACGAAATGTCTCTGCTTTTTCGTGGAATTCTCTTATTTCTGGATATGCTGAACTTGCTCTTTATGAAGATGCTCTGGCTC
TGTACTTCCAAATGGAGGAAGAAGGTGTTGAACCTGACCACTTTACTTTTCCTCGTGTGCTCAAGGCCTGTGGTGGCATTGGGTCGATTCGAATCGGGGAGGCGGTGCAC
CGACATATCGTTCGTTCGGGCTTTGCTGGAGATGTCTTTGTCCTCAATGCTCTAGTCGACATGTATTCCAAATGTGGTGACATTGTGAGAGCTAGAAAAGTTTTCGATCA
GATTGTCTCTAAGGATACAGTTTCCTGGAACTCAATGCTCACTGGTTACACACGCCATGGGCTTCTCTTGGAGGCATTGGACATCTTTGATCAAATGATTCGAGAAGGGT
TCGAGCCCGATTCGGTTGCTTTATCCACCATTCTTTCTAGCACTTCGTCGCTGAAATTCAAGTTACACATCCATGGATGGGTGATTCGAAACGGAGTCGAGTGGAATTTG
TCCATTGCTAACTCTTTGATTGTCATGTATGCCAACTGTGGTAAGATTGACAGAGCAAAATGGCTGTTCCAGCAGATGCCTCAAAGGGACATAGTCTCATGGAACTCCAT
AATCTCTGCTCATTTCAATACCTCAGAAGCTTTGACATATTTTGAAGTGATGGAGAGCCTTGGTGTTTTGCCAGACAGTGTAACATTTGTGTCATTGTTGTCAACTTCTG
CTCATCTGGGCTTGGTGAAGGAAGGGGAAAAGTTATATTCTCTTATGAAGGGGAAGTATGGAATAAGACCAACCATGGAACATTATGCTTGTATGGTGAATCTTTATGGG
AGGGCAGGGCTGATTGAAGGAGCTTATAGAATCATAACAAAAGGGATGGAGGTTGAGGCAGGTCCGACCGTATGGGGGGCGCTGTTGTTTGCGTGCTATCTTCACGACAA
TGTAGATATCGCCGAGATTGCTGCTGAAAGACTTTTCGAGTTGGAGCCCGACAATGAGCTCAATTTCGAGCTTTTGATGAAGATTTATGGCAATGCTGGGAGATCCAAAG
ATGAGAAGAGAGTGAGATTAATGATGGCGGAACGAGGATTGGATTCGTCAGAGAATCTTGCTTCCCAAATAGCTCAGAATCTTATTGCTCCTCAGTCCACCAGTCAATCC
GTCGTCGACCTATACACCAATCCCTATTACCTGCATCATTCGAATCGGACAAACCTCGTTCTCGTCTCAGAGCTTCTCACCGAGAACAATTACACATCCTGGCATCAAGC
TTTGATGATAGGTCTCACAGTCAAGAACAAACTGGGGTTTATCGATGGATCTTTACCGCGTCCAACTGGCGAACTCCTTCGTTCTTGGACGATTTGTAACAGTGTGGTCA
AGGCATGGATTTTGAATGCCATATCCAAAGAAATTGCTGCTAGTGTCAACTATGCGGACTCTGCTTTCGATATGTGGAGTGATCTCCAACAAAGATACCAACGGAAAAAT
CGCCCGGGCATTTTTCAGCTCCGGCGCGAAATCTCCAATCTGGCACAAGATCAGCTCTCAGTCTTTGCATACTTCGCCAAATTAAAGTCACTATGGAATGAATTAATCTC
TTACCGCTCTTCTTGTTCCTGCGGCCTTTGTTCCTACGGCAGTGTCAAAGATTTGGCAAAATAA
Protein sequenceShow/hide protein sequence
MLISLQLSVSPIASLLLFCSSKPKKSKKERRKLLQEKLIRISKAKEATDLSFPKSSSTPLLIHHKPFSQTKIQALDAVLNDLEASIDNGVRIDAEIFSSLLETCYKLRAI
DHGIRIHRLIPTDLLRRNVGVSSKLLRLYASFGYMENAHQVFDEMGKRNVSAFSWNSLISGYAELALYEDALALYFQMEEEGVEPDHFTFPRVLKACGGIGSIRIGEAVH
RHIVRSGFAGDVFVLNALVDMYSKCGDIVRARKVFDQIVSKDTVSWNSMLTGYTRHGLLLEALDIFDQMIREGFEPDSVALSTILSSTSSLKFKLHIHGWVIRNGVEWNL
SIANSLIVMYANCGKIDRAKWLFQQMPQRDIVSWNSIISAHFNTSEALTYFEVMESLGVLPDSVTFVSLLSTSAHLGLVKEGEKLYSLMKGKYGIRPTMEHYACMVNLYG
RAGLIEGAYRIITKGMEVEAGPTVWGALLFACYLHDNVDIAEIAAERLFELEPDNELNFELLMKIYGNAGRSKDEKRVRLMMAERGLDSSENLASQIAQNLIAPQSTSQS
VVDLYTNPYYLHHSNRTNLVLVSELLTENNYTSWHQALMIGLTVKNKLGFIDGSLPRPTGELLRSWTICNSVVKAWILNAISKEIAASVNYADSAFDMWSDLQQRYQRKN
RPGIFQLRREISNLAQDQLSVFAYFAKLKSLWNELISYRSSCSCGLCSYGSVKDLAK