; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0020323 (gene) of Snake gourd v1 genome

Gene IDTan0020323
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationLG01:3167556..3169277
RNA-Seq ExpressionTan0020323
SyntenyTan0020323
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6600165.1 Pentatricopeptide repeat-containing protein, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia]1.5e-27388.64Show/hide
Query:  MLISLQRSISPLNSLLLFCSSKPKKSKKERRKLLQEKLIRISKAKEATDLSFPKSSSTPLLIHHKPFSQTKIQALDAVVNDLEASVDNGVPIDAEIFSSL
        MLISL+ SISP+ SL LFCSS PKKSKKERRKLLQEKLIRISKAKEAT L FPKSSSTPLLIHHKPFSQ+KIQALDAV+NDLEAS+ NGVPIDAEIFSSL
Subjt:  MLISLQRSISPLNSLLLFCSSKPKKSKKERRKLLQEKLIRISKAKEATDLSFPKSSSTPLLIHHKPFSQTKIQALDAVVNDLEASVDNGVPIDAEIFSSL

Query:  LETCYQLRAIDHGFRIHRLIPTNLLRRNVGVSSKLLRLYASFGYIENAHQVFDEMGKRNLSAFAWNSLISGYAELGFYEDALALYFQMEEEGVEPDHFTF
        LETCYQLRA+DHG RIHRLIPTN LRRNVGVSSKLLRLYASFGY+E+AHQVFDEM +RNLSAF+WNSLISGYAELG YEDALALYFQMEEEGVEPDHFTF
Subjt:  LETCYQLRAIDHGFRIHRLIPTNLLRRNVGVSSKLLRLYASFGYIENAHQVFDEMGKRNLSAFAWNSLISGYAELGFYEDALALYFQMEEEGVEPDHFTF

Query:  PRVLKACGGIGSILIGEAVHRHIVRSGFAGDVFVLNALVDMYSKCGDIVRARKVFDQIVCKDTVSWNSMLTGYTRHGLLLEALDIFDQMIREGYESDSVA
        PRVLKACGGIGSI +GEAVHRH+VRSGFAGD+FVLNALVDMY+KCGDI+RARKVFDQIV KDTVSWNSMLTGYTRHGLLLEAL+ FDQMI+EGYE DSVA
Subjt:  PRVLKACGGIGSILIGEAVHRHIVRSGFAGDVFVLNALVDMYSKCGDIVRARKVFDQIVCKDTVSWNSMLTGYTRHGLLLEALDIFDQMIREGYESDSVA

Query:  LSTILSNISSLKFKLHIHGWVIRHGVEWDLSIANSLIVMYANCGKIDRAKWLFQQMPQRDIVSWNSIISAHFNTSEALTYFEVMESLGVLPDSVTFVSLL
        LST++SNISS KFKLHIHGW IRHG+EW+LSIANSLI MYAN GKI+RA+WLF+QMP+RDIVSWN+IISAH NTS+ALTYFEVMESLGVLPDSVTFVSLL
Subjt:  LSTILSNISSLKFKLHIHGWVIRHGVEWDLSIANSLIVMYANCGKIDRAKWLFQQMPQRDIVSWNSIISAHFNTSEALTYFEVMESLGVLPDSVTFVSLL

Query:  STCAHLGLVKEGGKLYSLMKGKYGIRPTMEHYACMVNLYGRAGLIEEAYRIVTGRIEIEAGPTVWGALLYACYLHGNVDIAEVAAERLFELEPDNELNFE
        STCAHL LVKEGGKLYS+MKGKYGIRPT+EHYACMVNLYGRAGLIEEAYRI+T  +E+EAGPTVWGALLYACYLHGNVDIAEVAAE+LFE EPDNELNF+
Subjt:  STCAHLGLVKEGGKLYSLMKGKYGIRPTMEHYACMVNLYGRAGLIEEAYRIVTGRIEIEAGPTVWGALLYACYLHGNVDIAEVAAERLFELEPDNELNFE

Query:  LLMKIYGNAGRLEDEKRVRLMMVERGLD
        LLMKIYGNAGR+EDEKRVRLMM ERGLD
Subjt:  LLMKIYGNAGRLEDEKRVRLMMVERGLD

KAG7030831.1 Pentatricopeptide repeat-containing protein, chloroplastic, partial [Cucurbita argyrosperma subsp. argyrosperma]1.5e-27388.64Show/hide
Query:  MLISLQRSISPLNSLLLFCSSKPKKSKKERRKLLQEKLIRISKAKEATDLSFPKSSSTPLLIHHKPFSQTKIQALDAVVNDLEASVDNGVPIDAEIFSSL
        MLISL+ SISP+ SL LFCSS PKKSKKERRKLLQEKLIRISKAKEAT L FPKSSSTPLLIHHKPFSQ+KIQALDAV+NDLEAS+ NGVPIDAEIFSSL
Subjt:  MLISLQRSISPLNSLLLFCSSKPKKSKKERRKLLQEKLIRISKAKEATDLSFPKSSSTPLLIHHKPFSQTKIQALDAVVNDLEASVDNGVPIDAEIFSSL

Query:  LETCYQLRAIDHGFRIHRLIPTNLLRRNVGVSSKLLRLYASFGYIENAHQVFDEMGKRNLSAFAWNSLISGYAELGFYEDALALYFQMEEEGVEPDHFTF
        LETCYQLRA+DHG RIHRLIPTN LRRNVGVSSKLLRLYASFGY+E+AHQVFDEM +RNLSAF+WNSLISGYAELG YEDALALYFQMEEEGVEPDHFTF
Subjt:  LETCYQLRAIDHGFRIHRLIPTNLLRRNVGVSSKLLRLYASFGYIENAHQVFDEMGKRNLSAFAWNSLISGYAELGFYEDALALYFQMEEEGVEPDHFTF

Query:  PRVLKACGGIGSILIGEAVHRHIVRSGFAGDVFVLNALVDMYSKCGDIVRARKVFDQIVCKDTVSWNSMLTGYTRHGLLLEALDIFDQMIREGYESDSVA
        PRVLKACGGIGSI +GEAVHRH+VRSGFAGD+FVLNALVDMY+KCGDI+RARKVFDQIV KDTVSWNSMLTGYTRHGLLLEAL+ FDQMI+EGYE DSVA
Subjt:  PRVLKACGGIGSILIGEAVHRHIVRSGFAGDVFVLNALVDMYSKCGDIVRARKVFDQIVCKDTVSWNSMLTGYTRHGLLLEALDIFDQMIREGYESDSVA

Query:  LSTILSNISSLKFKLHIHGWVIRHGVEWDLSIANSLIVMYANCGKIDRAKWLFQQMPQRDIVSWNSIISAHFNTSEALTYFEVMESLGVLPDSVTFVSLL
        LST++SNISS KFKLHIHGW IRHG+EW+LSIANSLI MYAN GKI+RA+WLF+QMP+RDIVSWN+IISAH NTS+ALTYFEVMESLGVLPDSVTFVSLL
Subjt:  LSTILSNISSLKFKLHIHGWVIRHGVEWDLSIANSLIVMYANCGKIDRAKWLFQQMPQRDIVSWNSIISAHFNTSEALTYFEVMESLGVLPDSVTFVSLL

Query:  STCAHLGLVKEGGKLYSLMKGKYGIRPTMEHYACMVNLYGRAGLIEEAYRIVTGRIEIEAGPTVWGALLYACYLHGNVDIAEVAAERLFELEPDNELNFE
        STCAHL LVKEGGKLYS+MKGKYGIRPT+EHYACMVNLYGRAGLIEEAYRI+T  +E+EAGPTVWGALLYACYLHGNVDIAEVAAE+LFE EPDNELNF+
Subjt:  STCAHLGLVKEGGKLYSLMKGKYGIRPTMEHYACMVNLYGRAGLIEEAYRIVTGRIEIEAGPTVWGALLYACYLHGNVDIAEVAAERLFELEPDNELNFE

Query:  LLMKIYGNAGRLEDEKRVRLMMVERGLD
        LLMKIYGNAGR+EDEKRVRLMM ERGLD
Subjt:  LLMKIYGNAGRLEDEKRVRLMMVERGLD

XP_022942651.1 pentatricopeptide repeat-containing protein At4g25270, chloroplastic [Cucurbita moschata]3.7e-27288.45Show/hide
Query:  MLISLQRSISPLNSLLLFCSSKPKKSKKERRKLLQEKLIRISKAKEATDLSFPKSSSTPLLIHHKPFSQTKIQALDAVVNDLEASVDNGVPIDAEIFSSL
        MLISL+ SISP+ SL L CSS PKKSKKERRKLLQEKLIRISKAKEAT L FPKSSSTPLLIHHKPFSQ+KIQALDAV+NDLEAS+ NGVPIDAEIFSSL
Subjt:  MLISLQRSISPLNSLLLFCSSKPKKSKKERRKLLQEKLIRISKAKEATDLSFPKSSSTPLLIHHKPFSQTKIQALDAVVNDLEASVDNGVPIDAEIFSSL

Query:  LETCYQLRAIDHGFRIHRLIPTNLLRRNVGVSSKLLRLYASFGYIENAHQVFDEMGKRNLSAFAWNSLISGYAELGFYEDALALYFQMEEEGVEPDHFTF
        LETCYQLRA+DHG RIHRLIPTN LRRNVGVSSKLLRLYASFGY+E+AHQVFDEM +RNLSAF+WNSLISGYAELG YEDALALYFQMEEEGVEPDHFTF
Subjt:  LETCYQLRAIDHGFRIHRLIPTNLLRRNVGVSSKLLRLYASFGYIENAHQVFDEMGKRNLSAFAWNSLISGYAELGFYEDALALYFQMEEEGVEPDHFTF

Query:  PRVLKACGGIGSILIGEAVHRHIVRSGFAGDVFVLNALVDMYSKCGDIVRARKVFDQIVCKDTVSWNSMLTGYTRHGLLLEALDIFDQMIREGYESDSVA
        PRVLKACGGIGSI +GEAVHRHIVRSGFAGD+FVLNALVDMY+KCGDI+RARKVFDQIV KDTVSWNSMLTGYTRHGLLLEAL+ FDQMI+EGYE DSVA
Subjt:  PRVLKACGGIGSILIGEAVHRHIVRSGFAGDVFVLNALVDMYSKCGDIVRARKVFDQIVCKDTVSWNSMLTGYTRHGLLLEALDIFDQMIREGYESDSVA

Query:  LSTILSNISSLKFKLHIHGWVIRHGVEWDLSIANSLIVMYANCGKIDRAKWLFQQMPQRDIVSWNSIISAHFNTSEALTYFEVMESLGVLPDSVTFVSLL
        LST++SNISS KFKLHIHGW IRHG+EW+LSIANSLI MYAN GKI+RA+WLF+QMP+RDIVSWN+IISAH NTS+ALTYFEVMESLGVLPDSVTFVSLL
Subjt:  LSTILSNISSLKFKLHIHGWVIRHGVEWDLSIANSLIVMYANCGKIDRAKWLFQQMPQRDIVSWNSIISAHFNTSEALTYFEVMESLGVLPDSVTFVSLL

Query:  STCAHLGLVKEGGKLYSLMKGKYGIRPTMEHYACMVNLYGRAGLIEEAYRIVTGRIEIEAGPTVWGALLYACYLHGNVDIAEVAAERLFELEPDNELNFE
        STCAHL LVKEGGKLY+ MKGKYGIRPT+EHYACMVNLYGRAGLIEEAYRI+T  +E+EAGPTVWGALLYACYLHGNVDIAEVAAE+LFE EPDNELNF+
Subjt:  STCAHLGLVKEGGKLYSLMKGKYGIRPTMEHYACMVNLYGRAGLIEEAYRIVTGRIEIEAGPTVWGALLYACYLHGNVDIAEVAAERLFELEPDNELNFE

Query:  LLMKIYGNAGRLEDEKRVRLMMVERGLD
        LLMKIYGNAGR+EDEKRVRLMM ERGLD
Subjt:  LLMKIYGNAGRLEDEKRVRLMMVERGLD

XP_022993795.1 pentatricopeptide repeat-containing protein At4g25270, chloroplastic [Cucurbita maxima]5.8e-27388.45Show/hide
Query:  MLISLQRSISPLNSLLLFCSSKPKKSKKERRKLLQEKLIRISKAKEATDLSFPKSSSTPLLIHHKPFSQTKIQALDAVVNDLEASVDNGVPIDAEIFSSL
        MLISL+ SISP+ SL LFCSS PKKSKKERRKLLQEKLIRISKAKEAT L FPKSSSTPLLIHHKPFS++KIQALDAV+NDLEAS+DNGVPIDAEIFSSL
Subjt:  MLISLQRSISPLNSLLLFCSSKPKKSKKERRKLLQEKLIRISKAKEATDLSFPKSSSTPLLIHHKPFSQTKIQALDAVVNDLEASVDNGVPIDAEIFSSL

Query:  LETCYQLRAIDHGFRIHRLIPTNLLRRNVGVSSKLLRLYASFGYIENAHQVFDEMGKRNLSAFAWNSLISGYAELGFYEDALALYFQMEEEGVEPDHFTF
        LETCYQLRA+DHG RIHRLIPTN LRRNVGVSSKLLRLYASFGY+E+AHQVFDEM +RNLSAF+WNSLISGYAELG YEDALALYFQMEEEGVEPDHFTF
Subjt:  LETCYQLRAIDHGFRIHRLIPTNLLRRNVGVSSKLLRLYASFGYIENAHQVFDEMGKRNLSAFAWNSLISGYAELGFYEDALALYFQMEEEGVEPDHFTF

Query:  PRVLKACGGIGSILIGEAVHRHIVRSGFAGDVFVLNALVDMYSKCGDIVRARKVFDQIVCKDTVSWNSMLTGYTRHGLLLEALDIFDQMIREGYESDSVA
        PRVLKACGGIGSI +GEAVHRHIVRSGFAGD+FVLNALVDMY+KCGDI+RARKVFDQI+ KDTVSWNSMLTGYTRHGLLLEAL+ FDQMI+EGYE DSVA
Subjt:  PRVLKACGGIGSILIGEAVHRHIVRSGFAGDVFVLNALVDMYSKCGDIVRARKVFDQIVCKDTVSWNSMLTGYTRHGLLLEALDIFDQMIREGYESDSVA

Query:  LSTILSNISSLKFKLHIHGWVIRHGVEWDLSIANSLIVMYANCGKIDRAKWLFQQMPQRDIVSWNSIISAHFNTSEALTYFEVMESLGVLPDSVTFVSLL
        LST++SNI+S KFKLHIHGW IRHG+EW+LSIANSLI MYAN GKI+RA+WLF+QMPQRDIVSWN+IISAH NTS+ALTYFEVMESLGVLPDSVTFVSLL
Subjt:  LSTILSNISSLKFKLHIHGWVIRHGVEWDLSIANSLIVMYANCGKIDRAKWLFQQMPQRDIVSWNSIISAHFNTSEALTYFEVMESLGVLPDSVTFVSLL

Query:  STCAHLGLVKEGGKLYSLMKGKYGIRPTMEHYACMVNLYGRAGLIEEAYRIVTGRIEIEAGPTVWGALLYACYLHGNVDIAEVAAERLFELEPDNELNFE
        STCAHL LVKEGGKLYS+MKGKYGIRPT+EHYACMVNLYGRAGLIEEAYRI+   +E+EAGPTVWGALLYACYLHGNVDIAEVAAE+LFE EPDNELNF+
Subjt:  STCAHLGLVKEGGKLYSLMKGKYGIRPTMEHYACMVNLYGRAGLIEEAYRIVTGRIEIEAGPTVWGALLYACYLHGNVDIAEVAAERLFELEPDNELNFE

Query:  LLMKIYGNAGRLEDEKRVRLMMVERGLD
        LLMKIYGNAGR+EDEKRVRLMM ERGLD
Subjt:  LLMKIYGNAGRLEDEKRVRLMMVERGLD

XP_038901542.1 pentatricopeptide repeat-containing protein At4g25270, chloroplastic [Benincasa hispida]1.7e-27289.22Show/hide
Query:  MLISLQRSISPLNSLLLFCSSKPKKSKKERRKLLQEKLIRISKAKEATDLSFPKSSSTPLLIHHKPFSQTKIQALDAVVNDLEASVDNGVPIDAEIFSSL
        MLISLQ SISP  SLLLFCSSKPKKSKKER+KLL +KL+RISKA+E TDL FPKSSSTPLLIH KPF QTKIQALDA++ DLE SVDNG+  D EIFSSL
Subjt:  MLISLQRSISPLNSLLLFCSSKPKKSKKERRKLLQEKLIRISKAKEATDLSFPKSSSTPLLIHHKPFSQTKIQALDAVVNDLEASVDNGVPIDAEIFSSL

Query:  LETCYQLRAIDHGFRIHRLIPTNLLRRNVGVSSKLLRLYASFGYIENAHQVFDEMGKRNLSAFAWNSLISGYAELGFYEDALALYFQMEEEGVEPDHFTF
        LE CYQLR+I HG RIHRLIPTNLLRRNVGVSSKLLRLYASFGY+E AHQVFDEM KRN+SAFAWNSLISGYAELG YEDALALYFQMEEEGVEPDHFTF
Subjt:  LETCYQLRAIDHGFRIHRLIPTNLLRRNVGVSSKLLRLYASFGYIENAHQVFDEMGKRNLSAFAWNSLISGYAELGFYEDALALYFQMEEEGVEPDHFTF

Query:  PRVLKACGGIGSILIGEAVHRHIVRSGFAGDVFVLNALVDMYSKCGDIVRARKVFDQIVCKDTVSWNSMLTGYTRHGLLLEALDIFDQMIREGYESDSVA
        PRVLKACGGI S+ IGEAVHRH++RSGFAGDVFVLNALVDMYSKCG IVRARKVFDQIVCKDTVSWNSMLTGYTRHGLL EALDIFDQMI+EGYE DSVA
Subjt:  PRVLKACGGIGSILIGEAVHRHIVRSGFAGDVFVLNALVDMYSKCGDIVRARKVFDQIVCKDTVSWNSMLTGYTRHGLLLEALDIFDQMIREGYESDSVA

Query:  LSTILSNISSLKFKLHIHGWVIRHGVEWDLSIANSLIVMYANCGKIDRAKWLFQQMPQRDIVSWNSIISAHFNTSEALTYFEVMESLGVLPDSVTFVSLL
        LSTILSNISSLKF+LHIHGWVIRHGVEW+LSIANSLIVMYANCGKI+RAKWLFQQMPQ+D VSWNSIISAHFN+ EALTYFEVMESLGVLPDSVTFVSLL
Subjt:  LSTILSNISSLKFKLHIHGWVIRHGVEWDLSIANSLIVMYANCGKIDRAKWLFQQMPQRDIVSWNSIISAHFNTSEALTYFEVMESLGVLPDSVTFVSLL

Query:  STCAHLGLVKEGGKLYSLMKGKYGIRPTMEHYACMVNLYGRAGLIEEAYRIVTGRIEIEAGPTVWGALLYACYLHGNVDIAEVAAERLFELEPDNELNFE
        STCA+LGLVKEGGKLYSLMKGKY IRPT EHYACMVNLYGRAGLIEEAYRI+T  +EIEAGPTVWGALLYACYLH NVDIAE+AAERLFELEPDNELNFE
Subjt:  STCAHLGLVKEGGKLYSLMKGKYGIRPTMEHYACMVNLYGRAGLIEEAYRIVTGRIEIEAGPTVWGALLYACYLHGNVDIAEVAAERLFELEPDNELNFE

Query:  LLMKIYGNAGRLEDEKRVRLMMVERGLDS
        LLMKIYGNAGR EDEKRV+LMM ERGLDS
Subjt:  LLMKIYGNAGRLEDEKRVRLMMVERGLDS

TrEMBL top hitse value%identityAlignment
A0A0A0L5M0 Uncharacterized protein2.4e-26985.48Show/hide
Query:  NDRFGTAIFQRFHCPMLISLQRSISPLNSLLLFCSSKPKKSKKERRKLLQEKLIRISKAKEATDLSFPKSSSTPLLIHHKPFSQTKIQALDAVVNDLEAS
        ND F   + QRF  PMLISL     P +SLLLFCSSKPKKSKKERRKLL +KL+RISKAK++TDLSFPKSS TPLLIH KPF Q+KIQALDAV+ DLEAS
Subjt:  NDRFGTAIFQRFHCPMLISLQRSISPLNSLLLFCSSKPKKSKKERRKLLQEKLIRISKAKEATDLSFPKSSSTPLLIHHKPFSQTKIQALDAVVNDLEAS

Query:  VDNGVPIDAEIFSSLLETCYQLRAIDHGFRIHRLIPTNLLRRNVGVSSKLLRLYASFGYIENAHQVFDEMGKRNLSAFAWNSLISGYAELGFYEDALALY
        +DNG+ ID EIFSSLLE CYQL+AI HG RIHRLIPTNLLRRNVG+SSKLLRLYASFGY+E+AHQVFDEMG RN SAFAWNSLISGYAELG YEDALALY
Subjt:  VDNGVPIDAEIFSSLLETCYQLRAIDHGFRIHRLIPTNLLRRNVGVSSKLLRLYASFGYIENAHQVFDEMGKRNLSAFAWNSLISGYAELGFYEDALALY

Query:  FQMEEEGVEPDHFTFPRVLKACGGIGSILIGEAVHRHIVRSGFAGDVFVLNALVDMYSKCGDIVRARKVFDQIVCKDTVSWNSMLTGYTRHGLLLEALDI
        FQMEEEGVEPD+FTFPRVLKACGGIGSI IGEAVHRH+VRSGFAGDVFVLNALVDMYSKCG IVRARKVFDQI  KD VSWNSMLTGYTRHGL  EALDI
Subjt:  FQMEEEGVEPDHFTFPRVLKACGGIGSILIGEAVHRHIVRSGFAGDVFVLNALVDMYSKCGDIVRARKVFDQIVCKDTVSWNSMLTGYTRHGLLLEALDI

Query:  FDQMIREGYESDSVALSTILSNISSLKFKLHIHGWVIRHGVEWDLSIANSLIVMYANCGKIDRAKWLFQQMPQRDIVSWNSIISAHFNTSEALTYFEVME
        FDQMI+EGYE DSVALST+LSNISS+KFKLHIHGWVIRHGVEW+LSIANSLIVMYA CGK++RAKWLFQQMPQ+D+VSWNSIISAHFN++EALTYFEVME
Subjt:  FDQMIREGYESDSVALSTILSNISSLKFKLHIHGWVIRHGVEWDLSIANSLIVMYANCGKIDRAKWLFQQMPQRDIVSWNSIISAHFNTSEALTYFEVME

Query:  SLGVLPDSVTFVSLLSTCAHLGLVKEGGKLYSLMKGKYGIRPTMEHYACMVNLYGRAGLIEEAYRIVTGRIEIEAGPTVWGALLYACYLHGNVDIAEVAA
        SLGV PD VTFVSLLSTCAHLGLVKEGGKLY LMKGKYGIRPT+EHYACMVNLYGRAG+IEEAY+I+T  +EIEAGPT+WGALLYACYLH +VDIAE+AA
Subjt:  SLGVLPDSVTFVSLLSTCAHLGLVKEGGKLYSLMKGKYGIRPTMEHYACMVNLYGRAGLIEEAYRIVTGRIEIEAGPTVWGALLYACYLHGNVDIAEVAA

Query:  ERLFELEPDNELNFELLMKIYGNAGRLEDEKRVRLMMVERGLDS
        ERLFELEPDNELNFELLMKIYGNAGR EDEKRV+LMM ERGL+S
Subjt:  ERLFELEPDNELNFELLMKIYGNAGRLEDEKRVRLMMVERGLDS

A0A1S4DSF7 pentatricopeptide repeat-containing protein At4g25270, chloroplastic2.7e-26887.33Show/hide
Query:  MLISLQRSISPLNSLLLFCSSKPKKSKKERRKLLQEKLIRISKAKEATDLSFPKSSSTPLLIHHKPFSQTKIQALDAVVNDLEASVDNGVPIDAEIFSSL
        MLISLQ    P  SLLLFCSSKPKKSKKERRKLL +KL+RISKAK++TDLSFPKSSSTPLLIH KPF Q+KIQALDAV+ DLE S+DNG+ ID EIFSSL
Subjt:  MLISLQRSISPLNSLLLFCSSKPKKSKKERRKLLQEKLIRISKAKEATDLSFPKSSSTPLLIHHKPFSQTKIQALDAVVNDLEASVDNGVPIDAEIFSSL

Query:  LETCYQLRAIDHGFRIHRLIPTNLLRRNVGVSSKLLRLYASFGYIENAHQVFDEMGKRNLSAFAWNSLISGYAELGFYEDALALYFQMEEEGVEPDHFTF
        LE CYQLRAI HG RIHRLIPTNLLRRNVG+SSKLLRLYAS GY+E+AHQVFDEMGKRN SAFAWNSLISGYAELG YEDALALYFQMEEEGVEPDHFTF
Subjt:  LETCYQLRAIDHGFRIHRLIPTNLLRRNVGVSSKLLRLYASFGYIENAHQVFDEMGKRNLSAFAWNSLISGYAELGFYEDALALYFQMEEEGVEPDHFTF

Query:  PRVLKACGGIGSILIGEAVHRHIVRSGFAGDVFVLNALVDMYSKCGDIVRARKVFDQIVCKDTVSWNSMLTGYTRHGLLLEALDIFDQMIREGYESDSVA
        PRVLKACGGIGSI IGEAVHRH+VRSGFAGDVFVLNALVDMYSKCG IVRARKVFDQIV KD VSWNSMLTGYTRHGL  EALDIFDQMI+EGY+ DSVA
Subjt:  PRVLKACGGIGSILIGEAVHRHIVRSGFAGDVFVLNALVDMYSKCGDIVRARKVFDQIVCKDTVSWNSMLTGYTRHGLLLEALDIFDQMIREGYESDSVA

Query:  LSTILSNISSLKFKLHIHGWVIRHGVEWDLSIANSLIVMYANCGKIDRAKWLFQQMPQRDIVSWNSIISAHFNTSEALTYFEVMESLGVLPDSVTFVSLL
        LST+LSNI SLKFKLHIHGWVIRHGVEW+LSIANSLIVMYA CGK++RAKWLFQQMPQ+D+VSWNSIISAHFNT+EALTYFEVMESLGVLPD VTFVSLL
Subjt:  LSTILSNISSLKFKLHIHGWVIRHGVEWDLSIANSLIVMYANCGKIDRAKWLFQQMPQRDIVSWNSIISAHFNTSEALTYFEVMESLGVLPDSVTFVSLL

Query:  STCAHLGLVKEGGKLYSLMKGKYGIRPTMEHYACMVNLYGRAGLIEEAYRIVTGRIEIEAGPTVWGALLYACYLHGNVDIAEVAAERLFELEPDNELNFE
        STCAHLGLVKEGG+LYSLMKGKY IRPT+EHYACMVNLYGRAG+IEEAY+I+T  +EIEAGPT+WGALLYACYLH NVDIAE+AAERLFELEPDNELNFE
Subjt:  STCAHLGLVKEGGKLYSLMKGKYGIRPTMEHYACMVNLYGRAGLIEEAYRIVTGRIEIEAGPTVWGALLYACYLHGNVDIAEVAAERLFELEPDNELNFE

Query:  LLMKIYGNAGRLEDEKRVRLMMVERGLDS
        LLMKIYGNAGR +DEKRV+LMM ERGL+S
Subjt:  LLMKIYGNAGRLEDEKRVRLMMVERGLDS

A0A5D3DJ70 Pentatricopeptide repeat-containing protein1.2e-26887.52Show/hide
Query:  MLISLQRSISPLNSLLLFCSSKPKKSKKERRKLLQEKLIRISKAKEATDLSFPKSSSTPLLIHHKPFSQTKIQALDAVVNDLEASVDNGVPIDAEIFSSL
        MLISLQ    P  SLLLFCSSKPKKSKKERRKLL +KL+RISKAK++TDLSFPKSSSTPLLIH KPF Q+KIQALDAV+ DLE S+DNG+ ID EIFSSL
Subjt:  MLISLQRSISPLNSLLLFCSSKPKKSKKERRKLLQEKLIRISKAKEATDLSFPKSSSTPLLIHHKPFSQTKIQALDAVVNDLEASVDNGVPIDAEIFSSL

Query:  LETCYQLRAIDHGFRIHRLIPTNLLRRNVGVSSKLLRLYASFGYIENAHQVFDEMGKRNLSAFAWNSLISGYAELGFYEDALALYFQMEEEGVEPDHFTF
        LE CYQLRAI HG RIHRLIPTNLLRRNVG+SSKLLRLYAS GY+E+AHQVFDEMGKRN SAFAWNSLISGYAELG YEDALALYFQMEEEGVEPDHFTF
Subjt:  LETCYQLRAIDHGFRIHRLIPTNLLRRNVGVSSKLLRLYASFGYIENAHQVFDEMGKRNLSAFAWNSLISGYAELGFYEDALALYFQMEEEGVEPDHFTF

Query:  PRVLKACGGIGSILIGEAVHRHIVRSGFAGDVFVLNALVDMYSKCGDIVRARKVFDQIVCKDTVSWNSMLTGYTRHGLLLEALDIFDQMIREGYESDSVA
        PRVLKACGGIGSI IGEAVHRH+VRSGFAGDVFVLNALVDMYSKCG IVRARKVFDQIV KD VSWNSMLTGYTRHGL  EALDIFDQMI+EGY+ DSVA
Subjt:  PRVLKACGGIGSILIGEAVHRHIVRSGFAGDVFVLNALVDMYSKCGDIVRARKVFDQIVCKDTVSWNSMLTGYTRHGLLLEALDIFDQMIREGYESDSVA

Query:  LSTILSNISSLKFKLHIHGWVIRHGVEWDLSIANSLIVMYANCGKIDRAKWLFQQMPQRDIVSWNSIISAHFNTSEALTYFEVMESLGVLPDSVTFVSLL
        LST+LSNI SLKFKLHIHGWVIRHGVEW+LSIANSLIVMYA CGK++RAKWLFQQMPQ+D+VSWNSIISAHFNT+EALTYFEVMESLGVLPD VTFVSLL
Subjt:  LSTILSNISSLKFKLHIHGWVIRHGVEWDLSIANSLIVMYANCGKIDRAKWLFQQMPQRDIVSWNSIISAHFNTSEALTYFEVMESLGVLPDSVTFVSLL

Query:  STCAHLGLVKEGGKLYSLMKGKYGIRPTMEHYACMVNLYGRAGLIEEAYRIVTGRIEIEAGPTVWGALLYACYLHGNVDIAEVAAERLFELEPDNELNFE
        STCAHLGLVKEGG+LYSLMKGKY IRPT+EHYACMVNLYGRAG+IEEAY+I+T  +EIEAGPT+WGALLYACYLH NVDIAE+AAERLFELEPDNELNFE
Subjt:  STCAHLGLVKEGGKLYSLMKGKYGIRPTMEHYACMVNLYGRAGLIEEAYRIVTGRIEIEAGPTVWGALLYACYLHGNVDIAEVAAERLFELEPDNELNFE

Query:  LLMKIYGNAGRLEDEKRVRLMMVERGLDS
        LLMKIYGNAGR EDEKRV+LMM ERGL+S
Subjt:  LLMKIYGNAGRLEDEKRVRLMMVERGLDS

A0A6J1FPF9 pentatricopeptide repeat-containing protein At4g25270, chloroplastic1.8e-27288.45Show/hide
Query:  MLISLQRSISPLNSLLLFCSSKPKKSKKERRKLLQEKLIRISKAKEATDLSFPKSSSTPLLIHHKPFSQTKIQALDAVVNDLEASVDNGVPIDAEIFSSL
        MLISL+ SISP+ SL L CSS PKKSKKERRKLLQEKLIRISKAKEAT L FPKSSSTPLLIHHKPFSQ+KIQALDAV+NDLEAS+ NGVPIDAEIFSSL
Subjt:  MLISLQRSISPLNSLLLFCSSKPKKSKKERRKLLQEKLIRISKAKEATDLSFPKSSSTPLLIHHKPFSQTKIQALDAVVNDLEASVDNGVPIDAEIFSSL

Query:  LETCYQLRAIDHGFRIHRLIPTNLLRRNVGVSSKLLRLYASFGYIENAHQVFDEMGKRNLSAFAWNSLISGYAELGFYEDALALYFQMEEEGVEPDHFTF
        LETCYQLRA+DHG RIHRLIPTN LRRNVGVSSKLLRLYASFGY+E+AHQVFDEM +RNLSAF+WNSLISGYAELG YEDALALYFQMEEEGVEPDHFTF
Subjt:  LETCYQLRAIDHGFRIHRLIPTNLLRRNVGVSSKLLRLYASFGYIENAHQVFDEMGKRNLSAFAWNSLISGYAELGFYEDALALYFQMEEEGVEPDHFTF

Query:  PRVLKACGGIGSILIGEAVHRHIVRSGFAGDVFVLNALVDMYSKCGDIVRARKVFDQIVCKDTVSWNSMLTGYTRHGLLLEALDIFDQMIREGYESDSVA
        PRVLKACGGIGSI +GEAVHRHIVRSGFAGD+FVLNALVDMY+KCGDI+RARKVFDQIV KDTVSWNSMLTGYTRHGLLLEAL+ FDQMI+EGYE DSVA
Subjt:  PRVLKACGGIGSILIGEAVHRHIVRSGFAGDVFVLNALVDMYSKCGDIVRARKVFDQIVCKDTVSWNSMLTGYTRHGLLLEALDIFDQMIREGYESDSVA

Query:  LSTILSNISSLKFKLHIHGWVIRHGVEWDLSIANSLIVMYANCGKIDRAKWLFQQMPQRDIVSWNSIISAHFNTSEALTYFEVMESLGVLPDSVTFVSLL
        LST++SNISS KFKLHIHGW IRHG+EW+LSIANSLI MYAN GKI+RA+WLF+QMP+RDIVSWN+IISAH NTS+ALTYFEVMESLGVLPDSVTFVSLL
Subjt:  LSTILSNISSLKFKLHIHGWVIRHGVEWDLSIANSLIVMYANCGKIDRAKWLFQQMPQRDIVSWNSIISAHFNTSEALTYFEVMESLGVLPDSVTFVSLL

Query:  STCAHLGLVKEGGKLYSLMKGKYGIRPTMEHYACMVNLYGRAGLIEEAYRIVTGRIEIEAGPTVWGALLYACYLHGNVDIAEVAAERLFELEPDNELNFE
        STCAHL LVKEGGKLY+ MKGKYGIRPT+EHYACMVNLYGRAGLIEEAYRI+T  +E+EAGPTVWGALLYACYLHGNVDIAEVAAE+LFE EPDNELNF+
Subjt:  STCAHLGLVKEGGKLYSLMKGKYGIRPTMEHYACMVNLYGRAGLIEEAYRIVTGRIEIEAGPTVWGALLYACYLHGNVDIAEVAAERLFELEPDNELNFE

Query:  LLMKIYGNAGRLEDEKRVRLMMVERGLD
        LLMKIYGNAGR+EDEKRVRLMM ERGLD
Subjt:  LLMKIYGNAGRLEDEKRVRLMMVERGLD

A0A6J1JZI1 pentatricopeptide repeat-containing protein At4g25270, chloroplastic2.8e-27388.45Show/hide
Query:  MLISLQRSISPLNSLLLFCSSKPKKSKKERRKLLQEKLIRISKAKEATDLSFPKSSSTPLLIHHKPFSQTKIQALDAVVNDLEASVDNGVPIDAEIFSSL
        MLISL+ SISP+ SL LFCSS PKKSKKERRKLLQEKLIRISKAKEAT L FPKSSSTPLLIHHKPFS++KIQALDAV+NDLEAS+DNGVPIDAEIFSSL
Subjt:  MLISLQRSISPLNSLLLFCSSKPKKSKKERRKLLQEKLIRISKAKEATDLSFPKSSSTPLLIHHKPFSQTKIQALDAVVNDLEASVDNGVPIDAEIFSSL

Query:  LETCYQLRAIDHGFRIHRLIPTNLLRRNVGVSSKLLRLYASFGYIENAHQVFDEMGKRNLSAFAWNSLISGYAELGFYEDALALYFQMEEEGVEPDHFTF
        LETCYQLRA+DHG RIHRLIPTN LRRNVGVSSKLLRLYASFGY+E+AHQVFDEM +RNLSAF+WNSLISGYAELG YEDALALYFQMEEEGVEPDHFTF
Subjt:  LETCYQLRAIDHGFRIHRLIPTNLLRRNVGVSSKLLRLYASFGYIENAHQVFDEMGKRNLSAFAWNSLISGYAELGFYEDALALYFQMEEEGVEPDHFTF

Query:  PRVLKACGGIGSILIGEAVHRHIVRSGFAGDVFVLNALVDMYSKCGDIVRARKVFDQIVCKDTVSWNSMLTGYTRHGLLLEALDIFDQMIREGYESDSVA
        PRVLKACGGIGSI +GEAVHRHIVRSGFAGD+FVLNALVDMY+KCGDI+RARKVFDQI+ KDTVSWNSMLTGYTRHGLLLEAL+ FDQMI+EGYE DSVA
Subjt:  PRVLKACGGIGSILIGEAVHRHIVRSGFAGDVFVLNALVDMYSKCGDIVRARKVFDQIVCKDTVSWNSMLTGYTRHGLLLEALDIFDQMIREGYESDSVA

Query:  LSTILSNISSLKFKLHIHGWVIRHGVEWDLSIANSLIVMYANCGKIDRAKWLFQQMPQRDIVSWNSIISAHFNTSEALTYFEVMESLGVLPDSVTFVSLL
        LST++SNI+S KFKLHIHGW IRHG+EW+LSIANSLI MYAN GKI+RA+WLF+QMPQRDIVSWN+IISAH NTS+ALTYFEVMESLGVLPDSVTFVSLL
Subjt:  LSTILSNISSLKFKLHIHGWVIRHGVEWDLSIANSLIVMYANCGKIDRAKWLFQQMPQRDIVSWNSIISAHFNTSEALTYFEVMESLGVLPDSVTFVSLL

Query:  STCAHLGLVKEGGKLYSLMKGKYGIRPTMEHYACMVNLYGRAGLIEEAYRIVTGRIEIEAGPTVWGALLYACYLHGNVDIAEVAAERLFELEPDNELNFE
        STCAHL LVKEGGKLYS+MKGKYGIRPT+EHYACMVNLYGRAGLIEEAYRI+   +E+EAGPTVWGALLYACYLHGNVDIAEVAAE+LFE EPDNELNF+
Subjt:  STCAHLGLVKEGGKLYSLMKGKYGIRPTMEHYACMVNLYGRAGLIEEAYRIVTGRIEIEAGPTVWGALLYACYLHGNVDIAEVAAERLFELEPDNELNFE

Query:  LLMKIYGNAGRLEDEKRVRLMMVERGLD
        LLMKIYGNAGR+EDEKRVRLMM ERGLD
Subjt:  LLMKIYGNAGRLEDEKRVRLMMVERGLD

SwissProt top hitse value%identityAlignment
Q9LN01 Pentatricopeptide repeat-containing protein At1g08070, chloroplastic2.4e-8035.35Show/hide
Query:  GVPIDAEIFSSLLETCYQLRAIDHGFRIHRLIPTNLLRRNVGVSSKLLRLYASFGYIENAHQVFDEMGKRNLSAFAWNSLISGYAELGFYEDALALYFQM
        G  +D  + +SL+    Q   ++   ++    P     R+V   + L++ YAS GYIENA ++FDE+  +++   +WN++ISGYAE G Y++AL L+  M
Subjt:  GVPIDAEIFSSLLETCYQLRAIDHGFRIHRLIPTNLLRRNVGVSSKLLRLYASFGYIENAHQVFDEMGKRNLSAFAWNSLISGYAELGFYEDALALYFQM

Query:  EEEGVEPDHFTFPRVLKACGGIGSILIGEAVHRHIVRSGFAGDVFVLNALVDMYSKCGDIVRARKVFDQIVCKDTVSWNSMLTGYTRHGLLLEALDIFDQ
         +  V PD  T   V+ AC   GSI +G  VH  I   GF  ++ ++NAL+D+YSKCG++  A  +F+++  KD +SWN+++ GYT   L  EAL +F +
Subjt:  EEEGVEPDHFTFPRVLKACGGIGSILIGEAVHRHIVRSGFAGDVFVLNALVDMYSKCGDIVRARKVFDQIVCKDTVSWNSMLTGYTRHGLLLEALDIFDQ

Query:  MIREGYESDSVALSTIL---SNISSLKFKLHIHGWVIRH--GVEWDLSIANSLIVMYANCGKIDRAKWLFQQMPQRDIVSWNSII---SAHFNTSEALTY
        M+R G   + V + +IL   +++ ++     IH ++ +   GV    S+  SLI MYA CG I+ A  +F  +  + + SWN++I   + H     +   
Subjt:  MIREGYESDSVALSTIL---SNISSLKFKLHIHGWVIRH--GVEWDLSIANSLIVMYANCGKIDRAKWLFQQMPQRDIVSWNSII---SAHFNTSEALTY

Query:  FEVMESLGVLPDSVTFVSLLSTCAHLGLVKEGGKLYSLMKGKYGIRPTMEHYACMVNLYGRAGLIEEAYRIVTGRIEIEAGPTVWGALLYACYLHGNVDI
        F  M  +G+ PD +TFV LLS C+H G++  G  ++  M   Y + P +EHY CM++L G +GL +EA  ++   +E+E    +W +LL AC +HGNV++
Subjt:  FEVMESLGVLPDSVTFVSLLSTCAHLGLVKEGGKLYSLMKGKYGIRPTMEHYACMVNLYGRAGLIEEAYRIVTGRIEIEAGPTVWGALLYACYLHGNVDI

Query:  AEVAAERLFELEPDNELNFELLMKIYGNAGRLEDEKRVRLMMVERGL
         E  AE L ++EP+N  ++ LL  IY +AGR  +  + R ++ ++G+
Subjt:  AEVAAERLFELEPDNELNFELLMKIYGNAGRLEDEKRVRLMMVERGL

Q9LW32 Pentatricopeptide repeat-containing protein At3g26782, mitochondrial7.7e-7935.87Show/hide
Query:  FSSLLETCYQLRAIDHGFRIHRLIPTNLLRRNVGVSSKLLRLYASFGYIENAHQVFDEMGKRNLSAFAWNSLISGYAELGFYEDALALYFQM------EE
        F   ++ C  L  I  G + H+       + ++ VSS L+ +Y++ G +E+A +VFDE+ KRN+   +W S+I GY   G   DA++L+  +      ++
Subjt:  FSSLLETCYQLRAIDHGFRIHRLIPTNLLRRNVGVSSKLLRLYASFGYIENAHQVFDEMGKRNLSAFAWNSLISGYAELGFYEDALALYFQM------EE

Query:  EGVEPDHFTFPRVLKACGGIGSILIGEAVHRHIVRSGFAGDVFVLNALVDMYSKCGD--IVRARKVFDQIVCKDTVSWNSMLTGYTRHGLLLEALDIFDQ
        + +  D      V+ AC  + +  + E++H  +++ GF   V V N L+D Y+K G+  +  ARK+FDQIV KD VS+NS+++ Y + G+  EA ++F +
Subjt:  EGVEPDHFTFPRVLKACGGIGSILIGEAVHRHIVRSGFAGDVFVLNALVDMYSKCGD--IVRARKVFDQIVCKDTVSWNSMLTGYTRHGLLLEALDIFDQ

Query:  MIREGYES-DSVALSTIL---SNISSLKFKLHIHGWVIRHGVEWDLSIANSLIVMYANCGKIDRAKWLFQQMPQRDIVSWNSIISA---HFNTSEALTYF
        +++    + +++ LST+L   S+  +L+    IH  VIR G+E D+ +  S+I MY  CG+++ A+  F +M  +++ SW ++I+    H + ++AL  F
Subjt:  MIREGYES-DSVALSTIL---SNISSLKFKLHIHGWVIRHGVEWDLSIANSLIVMYANCGKIDRAKWLFQQMPQRDIVSWNSIISA---HFNTSEALTYF

Query:  EVMESLGVLPDSVTFVSLLSTCAHLGLVKEGGKLYSLMKGKYGIRPTMEHYACMVNLYGRAGLIEEAYRIVTGRIEIEAGPTVWGALLYACYLHGNVDIA
          M   GV P+ +TFVS+L+ C+H GL  EG + ++ MKG++G+ P +EHY CMV+L GRAG +++AY ++  R++++    +W +LL AC +H NV++A
Subjt:  EVMESLGVLPDSVTFVSLLSTCAHLGLVKEGGKLYSLMKGKYGIRPTMEHYACMVNLYGRAGLIEEAYRIVTGRIEIEAGPTVWGALLYACYLHGNVDIA

Query:  EVAAERLFELEPDNELNFELLMKIYGNAGRLEDEKRVRLMMVERGL
        E++  RLFEL+  N   + LL  IY +AGR +D +RVR++M  RGL
Subjt:  EVAAERLFELEPDNELNFELLMKIYGNAGRLEDEKRVRLMMVERGL

Q9SB36 Pentatricopeptide repeat-containing protein At4g25270, chloroplastic3.4e-18361.84Show/hide
Query:  SSKPKKSKKERRKLLQEKLIRISKAKEATDLSFPKSSSTPLLIHHKPFSQTKIQALDAVVNDLEASVDNGVPI-DAEIFSSLLETCYQLRAIDHGFRIHR
        SS  KK  +  ++L Q +  + +     T LSF K S TPLLI  +   +T+++ALD+V+ DLE S   G+ + + EIF+SLLETCY LRAIDHG R+H 
Subjt:  SSKPKKSKKERRKLLQEKLIRISKAKEATDLSFPKSSSTPLLIHHKPFSQTKIQALDAVVNDLEASVDNGVPI-DAEIFSSLLETCYQLRAIDHGFRIHR

Query:  LIPTNLLRRNVGVSSKLLRLYASFGYIENAHQVFDEMGKRNLSAFAWNSLISGYAELGFYEDALALYFQMEEEGVEPDHFTFPRVLKACGGIGSILIGEA
        LIP  LLR N+G+SSKL+RLYAS GY E AH+VFD M KR+ S FAWNSLISGYAELG YEDA+ALYFQM E+GV+PD FTFPRVLKACGGIGS+ IGEA
Subjt:  LIPTNLLRRNVGVSSKLLRLYASFGYIENAHQVFDEMGKRNLSAFAWNSLISGYAELGFYEDALALYFQMEEEGVEPDHFTFPRVLKACGGIGSILIGEA

Query:  VHRHIVRSGFAGDVFVLNALVDMYSKCGDIVRARKVFDQIVCKDTVSWNSMLTGYTRHGLLLEALDIFDQMIREGYESDSVALSTILSNISSLKFKLHIH
        +HR +V+ GF  DV+VLNALV MY+KCGDIV+AR VFD I  KD VSWNSMLTGY  HGLL EALDIF  M++ G E D VA+S++L+ + S K    +H
Subjt:  VHRHIVRSGFAGDVFVLNALVDMYSKCGDIVRARKVFDQIVCKDTVSWNSMLTGYTRHGLLLEALDIFDQMIREGYESDSVALSTILSNISSLKFKLHIH

Query:  GWVIRHGVEWDLSIANSLIVMYANCGKIDRAKWLFQQMPQRDIVSWNSIISAHFNTSEALTYFEVMESLGVLPDSVTFVSLLSTCAHLGLVKEGGKLYSL
        GWVIR G+EW+LS+AN+LIV+Y+  G++ +A ++F QM +RD VSWN+IISAH   S  L YFE M      PD +TFVS+LS CA+ G+V++G +L+SL
Subjt:  GWVIRHGVEWDLSIANSLIVMYANCGKIDRAKWLFQQMPQRDIVSWNSIISAHFNTSEALTYFEVMESLGVLPDSVTFVSLLSTCAHLGLVKEGGKLYSL

Query:  MKGKYGIRPTMEHYACMVNLYGRAGLIEEAYRIVTGRIEIEAGPTVWGALLYACYLHGNVDIAEVAAERLFELEPDNELNFELLMKIYGNAGRLEDEKRV
        M  +YGI P MEHYACMVNLYGRAG++EEAY ++   + +EAGPTVWGALLYACYLHGN DI EVAA+RLFELEPDNE NFELL++IY  A R ED +RV
Subjt:  MKGKYGIRPTMEHYACMVNLYGRAGLIEEAYRIVTGRIEIEAGPTVWGALLYACYLHGNVDIAEVAAERLFELEPDNELNFELLMKIYGNAGRLEDEKRV

Query:  RLMMVERGLDS
        R MMV+RGL++
Subjt:  RLMMVERGLDS

Q9SN39 Pentatricopeptide repeat-containing protein DOT4, chloroplastic7.0e-8036.24Show/hide
Query:  NGVPIDAEIFSSLLETCYQLRAIDHGFRIHRLIPTNLLRRNVGVSSKLLRLYASFGYIENAHQVFDEMGKRNLSAFAWNSLISGYAELGFYEDALALYFQ
        +G+ ID     S+   C   R I  G  +H +       R     + LL +Y+  G +++A  VF EM  R  S  ++ S+I+GYA  G   +A+ L+ +
Subjt:  NGVPIDAEIFSSLLETCYQLRAIDHGFRIHRLIPTNLLRRNVGVSSKLLRLYASFGYIENAHQVFDEMGKRNLSAFAWNSLISGYAELGFYEDALALYFQ

Query:  MEEEGVEPDHFTFPRVLKACGGIGSILIGEAVHRHIVRSGFAGDVFVLNALVDMYSKCGDIVRARKVFDQIVCKDTVSWNSMLTGYTRHGLLLEALDIFD
        MEEEG+ PD +T   VL  C     +  G+ VH  I  +    D+FV NAL+DMY+KCG +  A  VF ++  KD +SWN+++ GY+++    EAL +F+
Subjt:  MEEEGVEPDHFTFPRVLKACGGIGSILIGEAVHRHIVRSGFAGDVFVLNALVDMYSKCGDIVRARKVFDQIVCKDTVSWNSMLTGYTRHGLLLEALDIFD

Query:  QMIRE-GYESDSVALSTIL---SNISSLKFKLHIHGWVIRHGVEWDLSIANSLIVMYANCGKIDRAKWLFQQMPQRDIVSWNSIISA---HFNTSEALTY
         ++ E  +  D   ++ +L   +++S+      IHG+++R+G   D  +ANSL+ MYA CG +  A  LF  +  +D+VSW  +I+    H    EA+  
Subjt:  QMIRE-GYESDSVALSTIL---SNISSLKFKLHIHGWVIRHGVEWDLSIANSLIVMYANCGKIDRAKWLFQQMPQRDIVSWNSIISA---HFNTSEALTY

Query:  FEVMESLGVLPDSVTFVSLLSTCAHLGLVKEGGKLYSLMKGKYGIRPTMEHYACMVNLYGRAGLIEEAYRIVTGRIEIEAGPTVWGALLYACYLHGNVDI
        F  M   G+  D ++FVSLL  C+H GLV EG + +++M+ +  I PT+EHYAC+V++  R G + +AYR +   + I    T+WGALL  C +H +V +
Subjt:  FEVMESLGVLPDSVTFVSLLSTCAHLGLVKEGGKLYSLMKGKYGIRPTMEHYACMVNLYGRAGLIEEAYRIVTGRIEIEAGPTVWGALLYACYLHGNVDI

Query:  AEVAAERLFELEPDNELNFELLMKIYGNAGRLEDEKRVRLMMVERGL
        AE  AE++FELEP+N   + L+  IY  A + E  KR+R  + +RGL
Subjt:  AEVAAERLFELEPDNELNFELLMKIYGNAGRLEDEKRVRLMMVERGL

Q9STF3 Pentatricopeptide repeat-containing protein At3g46790, chloroplastic3.8e-8637.53Show/hide
Query:  EIFSSLLETCYQLRAIDHGFRIHRLIPTNLLRRNVGVSSKLLRLYASFGYIENAHQVFDEMGKRNLSAFAWNSLISGYAELGFYEDALALYFQMEEEGVE
        + +  L+  C    ++    R+HR I  N   ++  +++KL+ +Y+  G ++ A +VFD+  KR +  + WN+L       G  E+ L LY++M   GVE
Subjt:  EIFSSLLETCYQLRAIDHGFRIHRLIPTNLLRRNVGVSSKLLRLYASFGYIENAHQVFDEMGKRNLSAFAWNSLISGYAELGFYEDALALYFQMEEEGVE

Query:  PDHFTFPRVLKACGG----IGSILIGEAVHRHIVRSGFAGDVFVLNALVDMYSKCGDIVRARKVFDQIVCKDTVSWNSMLTGYTRHGLLLEALDIFDQMI
         D FT+  VLKAC      +  ++ G+ +H H+ R G++  V+++  LVDMY++ G +  A  VF  +  ++ VSW++M+  Y ++G   EAL  F +M+
Subjt:  PDHFTFPRVLKACGG----IGSILIGEAVHRHIVRSGFAGDVFVLNALVDMYSKCGDIVRARKVFDQIVCKDTVSWNSMLTGYTRHGLLLEALDIFDQMI

Query:  REGYES--DSVALSTIL---SNISSLKFKLHIHGWVIRHGVEWDLSIANSLIVMYANCGKIDRAKWLFQQMPQRDIVSWNSIISA---HFNTSEALTYFE
        RE  +S  +SV + ++L   +++++L+    IHG+++R G++  L + ++L+ MY  CGK++  + +F +M  RD+VSWNS+IS+   H    +A+  FE
Subjt:  REGYES--DSVALSTIL---SNISSLKFKLHIHGWVIRHGVEWDLSIANSLIVMYANCGKIDRAKWLFQQMPQRDIVSWNSIISA---HFNTSEALTYFE

Query:  VMESLGVLPDSVTFVSLLSTCAHLGLVKEGGKLYSLMKGKYGIRPTMEHYACMVNLYGRAGLIEEAYRIVTGRIEIEAGPTVWGALLYACYLHGNVDIAE
         M + G  P  VTFVS+L  C+H GLV+EG +L+  M   +GI+P +EHYACMV+L GRA  ++EA ++V   +  E GP VWG+LL +C +HGNV++AE
Subjt:  VMESLGVLPDSVTFVSLLSTCAHLGLVKEGGKLYSLMKGKYGIRPTMEHYACMVNLYGRAGLIEEAYRIVTGRIEIEAGPTVWGALLYACYLHGNVDIAE

Query:  VAAERLFELEPDNELNFELLMKIYGNAGRLEDEKRVRLMMVERGL
         A+ RLF LEP N  N+ LL  IY  A   ++ KRV+ ++  RGL
Subjt:  VAAERLFELEPDNELNFELLMKIYGNAGRLEDEKRVRLMMVERGL

Arabidopsis top hitse value%identityAlignment
AT1G08070.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.7e-8135.35Show/hide
Query:  GVPIDAEIFSSLLETCYQLRAIDHGFRIHRLIPTNLLRRNVGVSSKLLRLYASFGYIENAHQVFDEMGKRNLSAFAWNSLISGYAELGFYEDALALYFQM
        G  +D  + +SL+    Q   ++   ++    P     R+V   + L++ YAS GYIENA ++FDE+  +++   +WN++ISGYAE G Y++AL L+  M
Subjt:  GVPIDAEIFSSLLETCYQLRAIDHGFRIHRLIPTNLLRRNVGVSSKLLRLYASFGYIENAHQVFDEMGKRNLSAFAWNSLISGYAELGFYEDALALYFQM

Query:  EEEGVEPDHFTFPRVLKACGGIGSILIGEAVHRHIVRSGFAGDVFVLNALVDMYSKCGDIVRARKVFDQIVCKDTVSWNSMLTGYTRHGLLLEALDIFDQ
         +  V PD  T   V+ AC   GSI +G  VH  I   GF  ++ ++NAL+D+YSKCG++  A  +F+++  KD +SWN+++ GYT   L  EAL +F +
Subjt:  EEEGVEPDHFTFPRVLKACGGIGSILIGEAVHRHIVRSGFAGDVFVLNALVDMYSKCGDIVRARKVFDQIVCKDTVSWNSMLTGYTRHGLLLEALDIFDQ

Query:  MIREGYESDSVALSTIL---SNISSLKFKLHIHGWVIRH--GVEWDLSIANSLIVMYANCGKIDRAKWLFQQMPQRDIVSWNSII---SAHFNTSEALTY
        M+R G   + V + +IL   +++ ++     IH ++ +   GV    S+  SLI MYA CG I+ A  +F  +  + + SWN++I   + H     +   
Subjt:  MIREGYESDSVALSTIL---SNISSLKFKLHIHGWVIRH--GVEWDLSIANSLIVMYANCGKIDRAKWLFQQMPQRDIVSWNSII---SAHFNTSEALTY

Query:  FEVMESLGVLPDSVTFVSLLSTCAHLGLVKEGGKLYSLMKGKYGIRPTMEHYACMVNLYGRAGLIEEAYRIVTGRIEIEAGPTVWGALLYACYLHGNVDI
        F  M  +G+ PD +TFV LLS C+H G++  G  ++  M   Y + P +EHY CM++L G +GL +EA  ++   +E+E    +W +LL AC +HGNV++
Subjt:  FEVMESLGVLPDSVTFVSLLSTCAHLGLVKEGGKLYSLMKGKYGIRPTMEHYACMVNLYGRAGLIEEAYRIVTGRIEIEAGPTVWGALLYACYLHGNVDI

Query:  AEVAAERLFELEPDNELNFELLMKIYGNAGRLEDEKRVRLMMVERGL
         E  AE L ++EP+N  ++ LL  IY +AGR  +  + R ++ ++G+
Subjt:  AEVAAERLFELEPDNELNFELLMKIYGNAGRLEDEKRVRLMMVERGL

AT3G26782.1 Tetratricopeptide repeat (TPR)-like superfamily protein5.5e-8035.87Show/hide
Query:  FSSLLETCYQLRAIDHGFRIHRLIPTNLLRRNVGVSSKLLRLYASFGYIENAHQVFDEMGKRNLSAFAWNSLISGYAELGFYEDALALYFQM------EE
        F   ++ C  L  I  G + H+       + ++ VSS L+ +Y++ G +E+A +VFDE+ KRN+   +W S+I GY   G   DA++L+  +      ++
Subjt:  FSSLLETCYQLRAIDHGFRIHRLIPTNLLRRNVGVSSKLLRLYASFGYIENAHQVFDEMGKRNLSAFAWNSLISGYAELGFYEDALALYFQM------EE

Query:  EGVEPDHFTFPRVLKACGGIGSILIGEAVHRHIVRSGFAGDVFVLNALVDMYSKCGD--IVRARKVFDQIVCKDTVSWNSMLTGYTRHGLLLEALDIFDQ
        + +  D      V+ AC  + +  + E++H  +++ GF   V V N L+D Y+K G+  +  ARK+FDQIV KD VS+NS+++ Y + G+  EA ++F +
Subjt:  EGVEPDHFTFPRVLKACGGIGSILIGEAVHRHIVRSGFAGDVFVLNALVDMYSKCGD--IVRARKVFDQIVCKDTVSWNSMLTGYTRHGLLLEALDIFDQ

Query:  MIREGYES-DSVALSTIL---SNISSLKFKLHIHGWVIRHGVEWDLSIANSLIVMYANCGKIDRAKWLFQQMPQRDIVSWNSIISA---HFNTSEALTYF
        +++    + +++ LST+L   S+  +L+    IH  VIR G+E D+ +  S+I MY  CG+++ A+  F +M  +++ SW ++I+    H + ++AL  F
Subjt:  MIREGYES-DSVALSTIL---SNISSLKFKLHIHGWVIRHGVEWDLSIANSLIVMYANCGKIDRAKWLFQQMPQRDIVSWNSIISA---HFNTSEALTYF

Query:  EVMESLGVLPDSVTFVSLLSTCAHLGLVKEGGKLYSLMKGKYGIRPTMEHYACMVNLYGRAGLIEEAYRIVTGRIEIEAGPTVWGALLYACYLHGNVDIA
          M   GV P+ +TFVS+L+ C+H GL  EG + ++ MKG++G+ P +EHY CMV+L GRAG +++AY ++  R++++    +W +LL AC +H NV++A
Subjt:  EVMESLGVLPDSVTFVSLLSTCAHLGLVKEGGKLYSLMKGKYGIRPTMEHYACMVNLYGRAGLIEEAYRIVTGRIEIEAGPTVWGALLYACYLHGNVDIA

Query:  EVAAERLFELEPDNELNFELLMKIYGNAGRLEDEKRVRLMMVERGL
        E++  RLFEL+  N   + LL  IY +AGR +D +RVR++M  RGL
Subjt:  EVAAERLFELEPDNELNFELLMKIYGNAGRLEDEKRVRLMMVERGL

AT3G46790.1 Tetratricopeptide repeat (TPR)-like superfamily protein2.7e-8737.53Show/hide
Query:  EIFSSLLETCYQLRAIDHGFRIHRLIPTNLLRRNVGVSSKLLRLYASFGYIENAHQVFDEMGKRNLSAFAWNSLISGYAELGFYEDALALYFQMEEEGVE
        + +  L+  C    ++    R+HR I  N   ++  +++KL+ +Y+  G ++ A +VFD+  KR +  + WN+L       G  E+ L LY++M   GVE
Subjt:  EIFSSLLETCYQLRAIDHGFRIHRLIPTNLLRRNVGVSSKLLRLYASFGYIENAHQVFDEMGKRNLSAFAWNSLISGYAELGFYEDALALYFQMEEEGVE

Query:  PDHFTFPRVLKACGG----IGSILIGEAVHRHIVRSGFAGDVFVLNALVDMYSKCGDIVRARKVFDQIVCKDTVSWNSMLTGYTRHGLLLEALDIFDQMI
         D FT+  VLKAC      +  ++ G+ +H H+ R G++  V+++  LVDMY++ G +  A  VF  +  ++ VSW++M+  Y ++G   EAL  F +M+
Subjt:  PDHFTFPRVLKACGG----IGSILIGEAVHRHIVRSGFAGDVFVLNALVDMYSKCGDIVRARKVFDQIVCKDTVSWNSMLTGYTRHGLLLEALDIFDQMI

Query:  REGYES--DSVALSTIL---SNISSLKFKLHIHGWVIRHGVEWDLSIANSLIVMYANCGKIDRAKWLFQQMPQRDIVSWNSIISA---HFNTSEALTYFE
        RE  +S  +SV + ++L   +++++L+    IHG+++R G++  L + ++L+ MY  CGK++  + +F +M  RD+VSWNS+IS+   H    +A+  FE
Subjt:  REGYES--DSVALSTIL---SNISSLKFKLHIHGWVIRHGVEWDLSIANSLIVMYANCGKIDRAKWLFQQMPQRDIVSWNSIISA---HFNTSEALTYFE

Query:  VMESLGVLPDSVTFVSLLSTCAHLGLVKEGGKLYSLMKGKYGIRPTMEHYACMVNLYGRAGLIEEAYRIVTGRIEIEAGPTVWGALLYACYLHGNVDIAE
         M + G  P  VTFVS+L  C+H GLV+EG +L+  M   +GI+P +EHYACMV+L GRA  ++EA ++V   +  E GP VWG+LL +C +HGNV++AE
Subjt:  VMESLGVLPDSVTFVSLLSTCAHLGLVKEGGKLYSLMKGKYGIRPTMEHYACMVNLYGRAGLIEEAYRIVTGRIEIEAGPTVWGALLYACYLHGNVDIAE

Query:  VAAERLFELEPDNELNFELLMKIYGNAGRLEDEKRVRLMMVERGL
         A+ RLF LEP N  N+ LL  IY  A   ++ KRV+ ++  RGL
Subjt:  VAAERLFELEPDNELNFELLMKIYGNAGRLEDEKRVRLMMVERGL

AT4G18750.1 Pentatricopeptide repeat (PPR) superfamily protein5.0e-8136.24Show/hide
Query:  NGVPIDAEIFSSLLETCYQLRAIDHGFRIHRLIPTNLLRRNVGVSSKLLRLYASFGYIENAHQVFDEMGKRNLSAFAWNSLISGYAELGFYEDALALYFQ
        +G+ ID     S+   C   R I  G  +H +       R     + LL +Y+  G +++A  VF EM  R  S  ++ S+I+GYA  G   +A+ L+ +
Subjt:  NGVPIDAEIFSSLLETCYQLRAIDHGFRIHRLIPTNLLRRNVGVSSKLLRLYASFGYIENAHQVFDEMGKRNLSAFAWNSLISGYAELGFYEDALALYFQ

Query:  MEEEGVEPDHFTFPRVLKACGGIGSILIGEAVHRHIVRSGFAGDVFVLNALVDMYSKCGDIVRARKVFDQIVCKDTVSWNSMLTGYTRHGLLLEALDIFD
        MEEEG+ PD +T   VL  C     +  G+ VH  I  +    D+FV NAL+DMY+KCG +  A  VF ++  KD +SWN+++ GY+++    EAL +F+
Subjt:  MEEEGVEPDHFTFPRVLKACGGIGSILIGEAVHRHIVRSGFAGDVFVLNALVDMYSKCGDIVRARKVFDQIVCKDTVSWNSMLTGYTRHGLLLEALDIFD

Query:  QMIRE-GYESDSVALSTIL---SNISSLKFKLHIHGWVIRHGVEWDLSIANSLIVMYANCGKIDRAKWLFQQMPQRDIVSWNSIISA---HFNTSEALTY
         ++ E  +  D   ++ +L   +++S+      IHG+++R+G   D  +ANSL+ MYA CG +  A  LF  +  +D+VSW  +I+    H    EA+  
Subjt:  QMIRE-GYESDSVALSTIL---SNISSLKFKLHIHGWVIRHGVEWDLSIANSLIVMYANCGKIDRAKWLFQQMPQRDIVSWNSIISA---HFNTSEALTY

Query:  FEVMESLGVLPDSVTFVSLLSTCAHLGLVKEGGKLYSLMKGKYGIRPTMEHYACMVNLYGRAGLIEEAYRIVTGRIEIEAGPTVWGALLYACYLHGNVDI
        F  M   G+  D ++FVSLL  C+H GLV EG + +++M+ +  I PT+EHYAC+V++  R G + +AYR +   + I    T+WGALL  C +H +V +
Subjt:  FEVMESLGVLPDSVTFVSLLSTCAHLGLVKEGGKLYSLMKGKYGIRPTMEHYACMVNLYGRAGLIEEAYRIVTGRIEIEAGPTVWGALLYACYLHGNVDI

Query:  AEVAAERLFELEPDNELNFELLMKIYGNAGRLEDEKRVRLMMVERGL
        AE  AE++FELEP+N   + L+  IY  A + E  KR+R  + +RGL
Subjt:  AEVAAERLFELEPDNELNFELLMKIYGNAGRLEDEKRVRLMMVERGL

AT4G25270.1 Tetratricopeptide repeat (TPR)-like superfamily protein2.4e-18461.84Show/hide
Query:  SSKPKKSKKERRKLLQEKLIRISKAKEATDLSFPKSSSTPLLIHHKPFSQTKIQALDAVVNDLEASVDNGVPI-DAEIFSSLLETCYQLRAIDHGFRIHR
        SS  KK  +  ++L Q +  + +     T LSF K S TPLLI  +   +T+++ALD+V+ DLE S   G+ + + EIF+SLLETCY LRAIDHG R+H 
Subjt:  SSKPKKSKKERRKLLQEKLIRISKAKEATDLSFPKSSSTPLLIHHKPFSQTKIQALDAVVNDLEASVDNGVPI-DAEIFSSLLETCYQLRAIDHGFRIHR

Query:  LIPTNLLRRNVGVSSKLLRLYASFGYIENAHQVFDEMGKRNLSAFAWNSLISGYAELGFYEDALALYFQMEEEGVEPDHFTFPRVLKACGGIGSILIGEA
        LIP  LLR N+G+SSKL+RLYAS GY E AH+VFD M KR+ S FAWNSLISGYAELG YEDA+ALYFQM E+GV+PD FTFPRVLKACGGIGS+ IGEA
Subjt:  LIPTNLLRRNVGVSSKLLRLYASFGYIENAHQVFDEMGKRNLSAFAWNSLISGYAELGFYEDALALYFQMEEEGVEPDHFTFPRVLKACGGIGSILIGEA

Query:  VHRHIVRSGFAGDVFVLNALVDMYSKCGDIVRARKVFDQIVCKDTVSWNSMLTGYTRHGLLLEALDIFDQMIREGYESDSVALSTILSNISSLKFKLHIH
        +HR +V+ GF  DV+VLNALV MY+KCGDIV+AR VFD I  KD VSWNSMLTGY  HGLL EALDIF  M++ G E D VA+S++L+ + S K    +H
Subjt:  VHRHIVRSGFAGDVFVLNALVDMYSKCGDIVRARKVFDQIVCKDTVSWNSMLTGYTRHGLLLEALDIFDQMIREGYESDSVALSTILSNISSLKFKLHIH

Query:  GWVIRHGVEWDLSIANSLIVMYANCGKIDRAKWLFQQMPQRDIVSWNSIISAHFNTSEALTYFEVMESLGVLPDSVTFVSLLSTCAHLGLVKEGGKLYSL
        GWVIR G+EW+LS+AN+LIV+Y+  G++ +A ++F QM +RD VSWN+IISAH   S  L YFE M      PD +TFVS+LS CA+ G+V++G +L+SL
Subjt:  GWVIRHGVEWDLSIANSLIVMYANCGKIDRAKWLFQQMPQRDIVSWNSIISAHFNTSEALTYFEVMESLGVLPDSVTFVSLLSTCAHLGLVKEGGKLYSL

Query:  MKGKYGIRPTMEHYACMVNLYGRAGLIEEAYRIVTGRIEIEAGPTVWGALLYACYLHGNVDIAEVAAERLFELEPDNELNFELLMKIYGNAGRLEDEKRV
        M  +YGI P MEHYACMVNLYGRAG++EEAY ++   + +EAGPTVWGALLYACYLHGN DI EVAA+RLFELEPDNE NFELL++IY  A R ED +RV
Subjt:  MKGKYGIRPTMEHYACMVNLYGRAGLIEEAYRIVTGRIEIEAGPTVWGALLYACYLHGNVDIAEVAAERLFELEPDNELNFELLMKIYGNAGRLEDEKRV

Query:  RLMMVERGLDS
        R MMV+RGL++
Subjt:  RLMMVERGLDS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTGTGTGGCCTGCGACAATGGATCTATGAAACCCCAAATGACCGCTTTGGAACCGCCATTTTCCAACGGTTTCATTGCCCAATGCTGATATCTTTGCAACGGTCAAT
TTCACCCTTAAATTCGCTTCTTCTCTTCTGTTCTTCCAAACCCAAGAAATCCAAGAAAGAAAGAAGAAAACTTCTTCAGGAAAAACTGATTCGCATTAGCAAAGCCAAAG
AGGCAACTGATCTCTCTTTCCCTAAATCCTCATCAACCCCACTCTTAATTCACCACAAACCCTTCTCCCAAACCAAAATTCAAGCCCTTGACGCTGTTGTCAATGACCTC
GAAGCTTCCGTCGACAATGGCGTCCCTATTGATGCTGAAATTTTCTCTTCCCTTTTGGAAACTTGTTACCAATTGCGAGCCATTGACCATGGTTTTCGGATTCATCGCCT
TATACCCACCAATCTTTTACGCAGAAATGTGGGTGTTTCTTCCAAGCTTCTTCGTCTCTATGCTTCTTTTGGGTACATAGAGAATGCACACCAGGTGTTTGATGAAATGG
GTAAACGAAATCTCTCTGCTTTTGCTTGGAATTCTCTTATTTCTGGCTATGCTGAACTTGGTTTTTATGAAGATGCTCTGGCTCTCTACTTTCAAATGGAGGAAGAAGGT
GTTGAACCTGACCACTTCACTTTTCCTCGTGTACTCAAGGCCTGTGGCGGCATTGGGTCGATTCTAATTGGGGAGGCCGTGCATCGGCATATCGTTCGTTCGGGCTTTGC
TGGAGATGTCTTTGTCCTCAATGCTCTAGTCGACATGTATTCCAAATGTGGTGACATTGTGAGGGCTAGGAAAGTGTTTGATCAGATTGTTTGTAAGGATACAGTTTCCT
GGAACTCGATGCTTACTGGTTACACACGCCATGGGCTTCTCTTGGAGGCATTGGACATCTTTGATCAAATGATTCGAGAAGGGTACGAGTCGGATTCGGTTGCTTTGTCC
ACCATTCTTTCTAACATTTCTTCGTTGAAATTCAAGTTACACATCCATGGATGGGTGATTCGACACGGAGTCGAATGGGATTTGTCCATTGCTAACTCCTTGATTGTCAT
GTATGCCAACTGTGGTAAGATTGACAGAGCAAAATGGCTGTTCCAGCAGATGCCTCAAAGAGACATAGTCTCATGGAACTCCATAATCTCTGCTCATTTCAATACCTCAG
AAGCTTTGACATATTTTGAAGTGATGGAGAGCCTTGGTGTTTTGCCAGATAGTGTAACATTTGTATCATTGTTGTCAACTTGTGCTCATCTGGGCTTGGTAAAGGAAGGG
GGAAAGTTATATTCCCTGATGAAAGGGAAGTATGGAATAAGACCAACCATGGAGCATTACGCTTGTATGGTGAATCTTTATGGGAGGGCAGGGCTGATTGAAGAAGCTTA
TAGAATCGTAACAGGACGAATCGAGATCGAGGCTGGTCCGACCGTATGGGGGGCGCTGTTGTATGCCTGCTATCTTCACGGCAATGTAGATATCGCTGAGGTTGCTGCTG
AAAGACTTTTCGAATTGGAGCCGGATAACGAGCTCAATTTCGAGCTTCTAATGAAGATTTATGGCAATGCTGGGAGGTTGGAAGACGAGAAGAGAGTGAGATTAATGATG
GTGGAACGAGGATTAGATTCATAG
mRNA sequenceShow/hide mRNA sequence
TTTGAACCCAAATTTTCTTGGCATAAGATTACGATGCTGTGTGGCCTGCGACAATGGATCTATGAAACCCCAAATGACCGCTTTGGAACCGCCATTTTCCAACGGTTTCA
TTGCCCAATGCTGATATCTTTGCAACGGTCAATTTCACCCTTAAATTCGCTTCTTCTCTTCTGTTCTTCCAAACCCAAGAAATCCAAGAAAGAAAGAAGAAAACTTCTTC
AGGAAAAACTGATTCGCATTAGCAAAGCCAAAGAGGCAACTGATCTCTCTTTCCCTAAATCCTCATCAACCCCACTCTTAATTCACCACAAACCCTTCTCCCAAACCAAA
ATTCAAGCCCTTGACGCTGTTGTCAATGACCTCGAAGCTTCCGTCGACAATGGCGTCCCTATTGATGCTGAAATTTTCTCTTCCCTTTTGGAAACTTGTTACCAATTGCG
AGCCATTGACCATGGTTTTCGGATTCATCGCCTTATACCCACCAATCTTTTACGCAGAAATGTGGGTGTTTCTTCCAAGCTTCTTCGTCTCTATGCTTCTTTTGGGTACA
TAGAGAATGCACACCAGGTGTTTGATGAAATGGGTAAACGAAATCTCTCTGCTTTTGCTTGGAATTCTCTTATTTCTGGCTATGCTGAACTTGGTTTTTATGAAGATGCT
CTGGCTCTCTACTTTCAAATGGAGGAAGAAGGTGTTGAACCTGACCACTTCACTTTTCCTCGTGTACTCAAGGCCTGTGGCGGCATTGGGTCGATTCTAATTGGGGAGGC
CGTGCATCGGCATATCGTTCGTTCGGGCTTTGCTGGAGATGTCTTTGTCCTCAATGCTCTAGTCGACATGTATTCCAAATGTGGTGACATTGTGAGGGCTAGGAAAGTGT
TTGATCAGATTGTTTGTAAGGATACAGTTTCCTGGAACTCGATGCTTACTGGTTACACACGCCATGGGCTTCTCTTGGAGGCATTGGACATCTTTGATCAAATGATTCGA
GAAGGGTACGAGTCGGATTCGGTTGCTTTGTCCACCATTCTTTCTAACATTTCTTCGTTGAAATTCAAGTTACACATCCATGGATGGGTGATTCGACACGGAGTCGAATG
GGATTTGTCCATTGCTAACTCCTTGATTGTCATGTATGCCAACTGTGGTAAGATTGACAGAGCAAAATGGCTGTTCCAGCAGATGCCTCAAAGAGACATAGTCTCATGGA
ACTCCATAATCTCTGCTCATTTCAATACCTCAGAAGCTTTGACATATTTTGAAGTGATGGAGAGCCTTGGTGTTTTGCCAGATAGTGTAACATTTGTATCATTGTTGTCA
ACTTGTGCTCATCTGGGCTTGGTAAAGGAAGGGGGAAAGTTATATTCCCTGATGAAAGGGAAGTATGGAATAAGACCAACCATGGAGCATTACGCTTGTATGGTGAATCT
TTATGGGAGGGCAGGGCTGATTGAAGAAGCTTATAGAATCGTAACAGGACGAATCGAGATCGAGGCTGGTCCGACCGTATGGGGGGCGCTGTTGTATGCCTGCTATCTTC
ACGGCAATGTAGATATCGCTGAGGTTGCTGCTGAAAGACTTTTCGAATTGGAGCCGGATAACGAGCTCAATTTCGAGCTTCTAATGAAGATTTATGGCAATGCTGGGAGG
TTGGAAGACGAGAAGAGAGTGAGATTAATGATGGTGGAACGAGGATTAGATTCATAGTGTGTTGATGATGTG
Protein sequenceShow/hide protein sequence
MLCGLRQWIYETPNDRFGTAIFQRFHCPMLISLQRSISPLNSLLLFCSSKPKKSKKERRKLLQEKLIRISKAKEATDLSFPKSSSTPLLIHHKPFSQTKIQALDAVVNDL
EASVDNGVPIDAEIFSSLLETCYQLRAIDHGFRIHRLIPTNLLRRNVGVSSKLLRLYASFGYIENAHQVFDEMGKRNLSAFAWNSLISGYAELGFYEDALALYFQMEEEG
VEPDHFTFPRVLKACGGIGSILIGEAVHRHIVRSGFAGDVFVLNALVDMYSKCGDIVRARKVFDQIVCKDTVSWNSMLTGYTRHGLLLEALDIFDQMIREGYESDSVALS
TILSNISSLKFKLHIHGWVIRHGVEWDLSIANSLIVMYANCGKIDRAKWLFQQMPQRDIVSWNSIISAHFNTSEALTYFEVMESLGVLPDSVTFVSLLSTCAHLGLVKEG
GKLYSLMKGKYGIRPTMEHYACMVNLYGRAGLIEEAYRIVTGRIEIEAGPTVWGALLYACYLHGNVDIAEVAAERLFELEPDNELNFELLMKIYGNAGRLEDEKRVRLMM
VERGLDS