; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0016655 (gene) of Snake gourd v1 genome

Gene IDTan0016655
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationLG01:39555319..39557622
RNA-Seq ExpressionTan0016655
SyntenyTan0016655
Gene Ontology termsGO:0007018 - microtubule-based movement (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0030424 - axon (cellular component)
GO:0030425 - dendrite (cellular component)
GO:0003777 - microtubule motor activity (molecular function)
GO:0005524 - ATP binding (molecular function)
GO:0008017 - microtubule binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022159320.1 pentatricopeptide repeat-containing protein At2g13600-like [Momordica charantia]1.1e-25487.05Show/hide
Query:  MCSLATKLTKYSLCSALNSCAKAHNLFLGLQIHAQIVKIGLEENLFLNSALVDLYSKCNAIVDAKRIFSHMKAHDQVSWTSIISGLSQNGAGSEAILMFK
        MC L  KLTKY+LCSALNSCAK  NLF GLQIHAQIVKIG EENLFLNSALVDLY+KCNAIVDAKR+F  M+ HDQVSWTS+ISGLS+NG GSEAI MFK
Subjt:  MCSLATKLTKYSLCSALNSCAKAHNLFLGLQIHAQIVKIGLEENLFLNSALVDLYSKCNAIVDAKRIFSHMKAHDQVSWTSIISGLSQNGAGSEAILMFK

Query:  KMLVTQVRPNCFTYATIISSCPTLKDELQIRLATLLHAHVVKLGFSFSSFVISSTIDCYSKLGRIEEAALLFYETNVKDNIIFNSMISGYSQNLYGEEAL
        +MLVTQ RPNCFTYAT+ISSCPTL+D+LQ+RLA LLHAHV+K GF+FSSFVISSTIDCYSKLGRI +AALLFYET  KDNIIFNSMISGYSQNL+GEEAL
Subjt:  KMLVTQVRPNCFTYATIISSCPTLKDELQIRLATLLHAHVVKLGFSFSSFVISSTIDCYSKLGRIEEAALLFYETNVKDNIIFNSMISGYSQNLYGEEAL

Query:  KLFVEMRGGNLNPTDHTLTSVLNACGSLTVLEQGRQVHSLVTKMGSENNVFVVCSLLDMYSKCGSVDEAFFIFNQTVEKNSVLSTSMIMAFAQCGRGSDA
        KLFVEMR  +L+PTDHTLTSVLNACGSLTVLEQGRQVHSLVTKMGSENNVFVVCSL+DMYSKCG VDEAFFIFNQ VEKNSVLSTSMIMAFAQCGRGSDA
Subjt:  KLFVEMRGGNLNPTDHTLTSVLNACGSLTVLEQGRQVHSLVTKMGSENNVFVVCSLLDMYSKCGSVDEAFFIFNQTVEKNSVLSTSMIMAFAQCGRGSDA

Query:  LKLFESSLTEEGFLPDHVCFTAVLTACNHAGLLDEAVEYFNKMGSEYRLEPQIDHYACLIDLYARNGHVEKAKQLMEQMPYESNYVMWCSLLGACKVHEE
        LKLFE  L EEGFLPDHVCFTAVLTACNHAG LDEAVEYFNKMGSEY L+PQIDHYACLIDLYARNGH+EKAKQL+EQMPYESNYVMWCSLLGACKVH E
Subjt:  LKLFESSLTEEGFLPDHVCFTAVLTACNHAGLLDEAVEYFNKMGSEYRLEPQIDHYACLIDLYARNGHVEKAKQLMEQMPYESNYVMWCSLLGACKVHEE

Query:  VELGREVACRLIEMDTSNAAPYITLAHIYARAGLWAQLIDIRKYMQQKRVKKSAGWSWIEIDKKAHVFSVGDATHPKFYEIYSKLDQLDLDMKAAEHASK
        VELGREVA RLIEMD SNAAPY+TLAHIYARAGLW Q+ DIRK MQQ++V+KS GWSWIEIDKK+HVFS GDATHPK +EIYSKLDQLDLDMK+AEHASK
Subjt:  VELGREVACRLIEMDTSNAAPYITLAHIYARAGLWAQLIDIRKYMQQKRVKKSAGWSWIEIDKKAHVFSVGDATHPKFYEIYSKLDQLDLDMKAAEHASK

Query:  AL
        AL
Subjt:  AL

XP_022948515.1 pentatricopeptide repeat-containing protein At2g13600-like [Cucurbita moschata]3.2e-26289.57Show/hide
Query:  MCSLATKLTKYSLCSALNSCAKAHNLFLGLQIHAQIVKIGLEENLFLNSALVDLYSKCNAIVDAKRIFSHMKAHDQVSWTSIISGLSQNGAGSEAILMFK
        MCSL  KLTKYSLCSALNSCAK  NLFLGLQIHAQIVKIG E+NL+LNS LV+LYSKCNAIVDAKRIF HMK HDQVSWTSIISGLSQNGAG EAILMFK
Subjt:  MCSLATKLTKYSLCSALNSCAKAHNLFLGLQIHAQIVKIGLEENLFLNSALVDLYSKCNAIVDAKRIFSHMKAHDQVSWTSIISGLSQNGAGSEAILMFK

Query:  KMLVTQVRPNCFTYATIISSCPTLKDELQIRLATLLHAHVVKLGF-SFSSFVISSTIDCYSKLGRIEEAALLFYETNVKDNIIFNSMISGYSQNLYGEEA
         MLVTQ RPNCFTYAT+ISSCP+LKDEL I L TL HAHV+KLGF  FSSFVISS IDCYSKLGRIEEAALLFYE NVKDN+IFNSMISG+SQNLYGEEA
Subjt:  KMLVTQVRPNCFTYATIISSCPTLKDELQIRLATLLHAHVVKLGF-SFSSFVISSTIDCYSKLGRIEEAALLFYETNVKDNIIFNSMISGYSQNLYGEEA

Query:  LKLFVEMRGGNLNPTDHTLTSVLNACGSLTVLEQGRQVHSLVTKMGSENNVFVVCSLLDMYSKCGSVDEAFFIFNQTVEKNSVLSTSMIMAFAQCGRGSD
        LKLFVEMR  NL+PTDHTLTSVLNACGSLTVLEQGRQVHSLVTKMGSENNVFVVCSLLDMYSKCGSVD+AF IFNQTVEKNSVLSTSMIMAFAQCGRGSD
Subjt:  LKLFVEMRGGNLNPTDHTLTSVLNACGSLTVLEQGRQVHSLVTKMGSENNVFVVCSLLDMYSKCGSVDEAFFIFNQTVEKNSVLSTSMIMAFAQCGRGSD

Query:  ALKLFESSLTEEGFLPDHVCFTAVLTACNHAGLLDEAVEYFNKMGSEYRLEPQIDHYACLIDLYARNGHVEKAKQLMEQMPYESNYVMWCSLLGACKVHE
        ALKLFES LTEEGFLPDHVCFTAVLTACNHAGLL+EAVEYFNKMGSEYRL+PQIDHYACLIDLYARNGHVEKAK+LME+MPYESNYVMWCSLLGACKVH 
Subjt:  ALKLFESSLTEEGFLPDHVCFTAVLTACNHAGLLDEAVEYFNKMGSEYRLEPQIDHYACLIDLYARNGHVEKAKQLMEQMPYESNYVMWCSLLGACKVHE

Query:  EVELGREVACRLIEMDTSNAAPYITLAHIYARAGLWAQLIDIRKYMQQKRVKKSAGWSWIEIDKKAHVFSVGDATHPKFYEIYSKLDQLDLDMKAAEHAS
        EVELGREVA RLIEMD  NAAPY+TLAHIYARAGLW QL DIR  MQQKRV+KSAGWSWIEIDKKAHVFSVGDATHPK  EIYSKL+QLDLDM+ AEHAS
Subjt:  EVELGREVACRLIEMDTSNAAPYITLAHIYARAGLWAQLIDIRKYMQQKRVKKSAGWSWIEIDKKAHVFSVGDATHPKFYEIYSKLDQLDLDMKAAEHAS

Query:  KALEYVEF
        KALE+VEF
Subjt:  KALEYVEF

XP_022998877.1 pentatricopeptide repeat-containing protein At2g13600-like [Cucurbita maxima]5.1e-26088.98Show/hide
Query:  MCSLATKLTKYSLCSALNSCAKAHNLFLGLQIHAQIVKIGLEENLFLNSALVDLYSKCNAIVDAKRIFSHMKAHDQVSWTSIISGLSQNGAGSEAILMFK
        MCSL  KLTKYSLCSALNSCAK  NLFLGLQIHAQIVKIG E+NL+LNS LV+LYSKCNAIVDAKRIF HMK HDQVSWTSIISGLSQNGAG EAILMFK
Subjt:  MCSLATKLTKYSLCSALNSCAKAHNLFLGLQIHAQIVKIGLEENLFLNSALVDLYSKCNAIVDAKRIFSHMKAHDQVSWTSIISGLSQNGAGSEAILMFK

Query:  KMLVTQVRPNCFTYATIISSCPTLKDELQIRLATLLHAHVVKLGF-SFSSFVISSTIDCYSKLGRIEEAALLFYETNVKDNIIFNSMISGYSQNLYGEEA
         MLVTQ RPNCFTYAT+ISSCP+LKDEL   L TL HAHV+KLGF  FSSFVISS IDCYSKLGRIEEAALLFYE NVKDN+IFNSMISG+SQNLYGEEA
Subjt:  KMLVTQVRPNCFTYATIISSCPTLKDELQIRLATLLHAHVVKLGF-SFSSFVISSTIDCYSKLGRIEEAALLFYETNVKDNIIFNSMISGYSQNLYGEEA

Query:  LKLFVEMRGGNLNPTDHTLTSVLNACGSLTVLEQGRQVHSLVTKMGSENNVFVVCSLLDMYSKCGSVDEAFFIFNQTVEKNSVLSTSMIMAFAQCGRGSD
        LKLFVEMR  NL+ TDHTLTSVLNACGSLTVLEQGRQVHSLVTKMGSENNVFVVCSLLDMYSKCGSVD+AF IFNQTVEKNSVLSTSMIMAFAQCGRGSD
Subjt:  LKLFVEMRGGNLNPTDHTLTSVLNACGSLTVLEQGRQVHSLVTKMGSENNVFVVCSLLDMYSKCGSVDEAFFIFNQTVEKNSVLSTSMIMAFAQCGRGSD

Query:  ALKLFESSLTEEGFLPDHVCFTAVLTACNHAGLLDEAVEYFNKMGSEYRLEPQIDHYACLIDLYARNGHVEKAKQLMEQMPYESNYVMWCSLLGACKVHE
        ALKLFES LTEEGFLPDHVCFTAVLTACNHAGLL+EAVEYFNKMGSEYRL+PQIDHYACLIDLYARNGHVEKAK+LME++PYESNYVMWCSLLGACKVH 
Subjt:  ALKLFESSLTEEGFLPDHVCFTAVLTACNHAGLLDEAVEYFNKMGSEYRLEPQIDHYACLIDLYARNGHVEKAKQLMEQMPYESNYVMWCSLLGACKVHE

Query:  EVELGREVACRLIEMDTSNAAPYITLAHIYARAGLWAQLIDIRKYMQQKRVKKSAGWSWIEIDKKAHVFSVGDATHPKFYEIYSKLDQLDLDMKAAEHAS
        EVELGREVA RLIEMD SNAAPY+TLAHIYARAGLW QL DIRK MQQKRV+KSAGWSWIEIDKKAHVFSVGDATHPK  EIYSKL+QLDLDM+  EHAS
Subjt:  EVELGREVACRLIEMDTSNAAPYITLAHIYARAGLWAQLIDIRKYMQQKRVKKSAGWSWIEIDKKAHVFSVGDATHPKFYEIYSKLDQLDLDMKAAEHAS

Query:  KALEYVEF
        KALE++EF
Subjt:  KALEYVEF

XP_023521522.1 pentatricopeptide repeat-containing protein At2g13600-like [Cucurbita pepo subsp. pepo]1.0e-26089.37Show/hide
Query:  MCSLATKLTKYSLCSALNSCAKAHNLFLGLQIHAQIVKIGLEENLFLNSALVDLYSKCNAIVDAKRIFSHMKAHDQVSWTSIISGLSQNGAGSEAILMFK
        MCSL  KLTKYSLCSALNSCAK  NLFLGLQIHAQIVKIG E+NL+LNSALV+LYSKCNAIVDAKRIF HMK HDQVSWTSIISGLSQNGAG EAILMFK
Subjt:  MCSLATKLTKYSLCSALNSCAKAHNLFLGLQIHAQIVKIGLEENLFLNSALVDLYSKCNAIVDAKRIFSHMKAHDQVSWTSIISGLSQNGAGSEAILMFK

Query:  KMLVTQVRPNCFTYATIISSCPTLKDELQIRLATLLHAHVVKLGF-SFSSFVISSTIDCYSKLGRIEEAALLFYETNVKDNIIFNSMISGYSQNLYGEEA
         MLVTQ RPNCFTYAT+ISSCP+LKDEL   L TL HAHV+KLGF  FSSFVISS IDCYSKLGRIEEAALLFYE NVKDN+IFNSMISG+SQNLYGEEA
Subjt:  KMLVTQVRPNCFTYATIISSCPTLKDELQIRLATLLHAHVVKLGF-SFSSFVISSTIDCYSKLGRIEEAALLFYETNVKDNIIFNSMISGYSQNLYGEEA

Query:  LKLFVEMRGGNLNPTDHTLTSVLNACGSLTVLEQGRQVHSLVTKMGSENNVFVVCSLLDMYSKCGSVDEAFFIFNQTVEKNSVLSTSMIMAFAQCGRGSD
        LKLFVEMR  NL+PTDHTLTSVLNACGSLTVLEQGRQVHSLVTKMGSENNVFVVCSLLDMYSKCGSVD+AF IFNQTVEKNSVLSTSMIMAFAQCGRGSD
Subjt:  LKLFVEMRGGNLNPTDHTLTSVLNACGSLTVLEQGRQVHSLVTKMGSENNVFVVCSLLDMYSKCGSVDEAFFIFNQTVEKNSVLSTSMIMAFAQCGRGSD

Query:  ALKLFESSLTEEGFLPDHVCFTAVLTACNHAGLLDEAVEYFNKMGSEYRLEPQIDHYACLIDLYARNGHVEKAKQLMEQMPYESNYVMWCSLLGACKVHE
        ALKLFES LTEEGFLPDHVCFTAVLTACNHAGLL+EAVEYFNKMGSEYRL+PQIDHYACLIDLYARNGHV KAK+LME+MPYESNYVMWCSLLGACKVH 
Subjt:  ALKLFESSLTEEGFLPDHVCFTAVLTACNHAGLLDEAVEYFNKMGSEYRLEPQIDHYACLIDLYARNGHVEKAKQLMEQMPYESNYVMWCSLLGACKVHE

Query:  EVELGREVACRLIEMDTSNAAPYITLAHIYARAGLWAQLIDIRKYMQQKRVKKSAGWSWIEIDKKAHVFSVGDATHPKFYEIYSKLDQLDLDMKAAEHAS
         VELGREVA RLIEMD SNAAPY+TLAHIYARAGLW QL DIRK MQQKRV+KSAGWSWIEIDKKAHVFSVGDATHPK  EIYSKL+QLDLDM+  EHAS
Subjt:  EVELGREVACRLIEMDTSNAAPYITLAHIYARAGLWAQLIDIRKYMQQKRVKKSAGWSWIEIDKKAHVFSVGDATHPKFYEIYSKLDQLDLDMKAAEHAS

Query:  KALEYVEF
        KALE+VEF
Subjt:  KALEYVEF

XP_038896055.1 pentatricopeptide repeat-containing protein At3g12770-like [Benincasa hispida]4.8e-26690.87Show/hide
Query:  MCSLATKLTKYSLCSALNSCAKAHNLFLGLQIHAQIVKIGLEENLFLNSALVDLYSKCNAIVDAKRIFSHMKAHDQVSWTSIISGLSQNGAGSEAILMFK
        MCSL  KLTKYSLCSALNSCAK HNLFLGLQIHAQIVKIG EENLFLNS+LVDLYSKCNAIV+AKR+FSHMK HDQVSWTSIISGLSQNG GSEAILMFK
Subjt:  MCSLATKLTKYSLCSALNSCAKAHNLFLGLQIHAQIVKIGLEENLFLNSALVDLYSKCNAIVDAKRIFSHMKAHDQVSWTSIISGLSQNGAGSEAILMFK

Query:  KMLVTQVRPNCFTYATIISSCPTLKDELQIRLATLLHAHVVKLGFSFSSFVISSTIDCYSKLGRIEEAALLFYETNVKDNIIFNSMISGYSQNLYGEEAL
         MLVTQVRPNCFTYAT+ISSC TLKDELQI LATLLHAHV+KLGF+FS+FVISSTIDCYSKLGR++EAALLFYET VKDNIIFNSMISGYSQNLYGEEAL
Subjt:  KMLVTQVRPNCFTYATIISSCPTLKDELQIRLATLLHAHVVKLGFSFSSFVISSTIDCYSKLGRIEEAALLFYETNVKDNIIFNSMISGYSQNLYGEEAL

Query:  KLFVEMRGGNLNPTDHTLTSVLNACGSLTVLEQGRQVHSLVTKMGSENNVFVVCSLLDMYSKCGSVDEAFFIFNQTVEKNSVLSTSMIMAFAQCGRGSDA
        KLFVEMR  NL+PTDHTLTSVLNACG LTVLEQGRQVHSLVTKMGSENNVFVVCSLLDMYSKCGSVDEAFF+FNQT+EKNSVLSTSMIMAFAQCGRGS+A
Subjt:  KLFVEMRGGNLNPTDHTLTSVLNACGSLTVLEQGRQVHSLVTKMGSENNVFVVCSLLDMYSKCGSVDEAFFIFNQTVEKNSVLSTSMIMAFAQCGRGSDA

Query:  LKLFESSLTEEGFLPDHVCFTAVLTACNHAGLLDEAVEYFNKMGSEYRLEPQIDHYACLIDLYARNGHVEKAKQLMEQMPYESNYVMWCSLLGACKVHEE
        LKLFES LTEEGFLPDHVCF AVLTACNHAGLL+EAVEYFNKM  EYRL+PQIDHYACLIDLYARNGHVEKAKQLMEQMPYESNYVMWCSLLGACKVHEE
Subjt:  LKLFESSLTEEGFLPDHVCFTAVLTACNHAGLLDEAVEYFNKMGSEYRLEPQIDHYACLIDLYARNGHVEKAKQLMEQMPYESNYVMWCSLLGACKVHEE

Query:  VELGREVACRLIEMDTSNAAPYITLAHIYARAGLWAQLIDIRKYMQQKRVKKSAGWSWIEIDKKAHVFSVGDATHPKFYEIYSKLDQLDLDMKAAEHASK
        VELGREVA +LIEMD SNAAPY+TLAHIYARAGLW QL +IRK MQQKRV+KSAGWSWIEIDKK HVFSVGDA HPK  EIYSKLDQL+LDMKAAEHASK
Subjt:  VELGREVACRLIEMDTSNAAPYITLAHIYARAGLWAQLIDIRKYMQQKRVKKSAGWSWIEIDKKAHVFSVGDATHPKFYEIYSKLDQLDLDMKAAEHASK

Query:  ALEY
        ALEY
Subjt:  ALEY

TrEMBL top hitse value%identityAlignment
A0A1S3CML0 pentatricopeptide repeat-containing protein At2g13600-like1.5e-25486.9Show/hide
Query:  MCSLATKLTKYSLCSALNSCAKAHNLFLGLQIHAQIVKIGLEENLFLNSALVDLYSKCNAIVDAKRIFSHMKAHDQVSWTSIISGLSQNGAGSEAILMFK
        MCSL  KLT YSLCSAL+SCAK HNLFLGLQIHAQIVKIG EENLFLNS+LVDLYSKCNAIV+AKR+FS MK HDQVSWTSIISGLSQNG GSEAILMFK
Subjt:  MCSLATKLTKYSLCSALNSCAKAHNLFLGLQIHAQIVKIGLEENLFLNSALVDLYSKCNAIVDAKRIFSHMKAHDQVSWTSIISGLSQNGAGSEAILMFK

Query:  KMLVTQVRPNCFTYATIISSCPTLKDELQIRLATLLHAHVVKLGFSFSSFVISSTIDCYSKLGRIEEAALLFYETNVKDNIIFNSMISGYSQNLYGEEAL
        KMLVTQVRPNCFTYAT+ISSCPTLK+ELQI LATLLHAHV+K GF+FSSFVISSTIDCYSKLGRI+EA+LLF ET+VKDNIIFNSMISGYSQNL GEEAL
Subjt:  KMLVTQVRPNCFTYATIISSCPTLKDELQIRLATLLHAHVVKLGFSFSSFVISSTIDCYSKLGRIEEAALLFYETNVKDNIIFNSMISGYSQNLYGEEAL

Query:  KLFVEMRGGNLNPTDHTLTSVLNACGSLTVLEQGRQVHSLVTKMGSENNVFVVCSLLDMYSKCGSVDEAFFIFNQTVEKNSVLSTSMIMAFAQCGRGSDA
        KLFVEMR  NL+PTDHTLTSVLNACG LTVLEQGRQVHSL+TKMGSENNVFVVCSLLDMYSKCGS+DEAF +FNQTV+KNSVLSTSMIMAFAQCGRG +A
Subjt:  KLFVEMRGGNLNPTDHTLTSVLNACGSLTVLEQGRQVHSLVTKMGSENNVFVVCSLLDMYSKCGSVDEAFFIFNQTVEKNSVLSTSMIMAFAQCGRGSDA

Query:  LKLFESSLTEEGFLPDHVCFTAVLTACNHAGLLDEAVEYFNKMGSEYRLEPQIDHYACLIDLYARNGHVEKAKQLMEQMPYESNYVMWCSLLGACKVHEE
        LKLFE   TE+ F+PDH+CFTAVLTACNHAGLLDEAVEYFNKM  EY+L+PQIDHYACLIDLYARNG+VEKAKQ+MEQMPYESNYVMWCSLLGACKVH E
Subjt:  LKLFESSLTEEGFLPDHVCFTAVLTACNHAGLLDEAVEYFNKMGSEYRLEPQIDHYACLIDLYARNGHVEKAKQLMEQMPYESNYVMWCSLLGACKVHEE

Query:  VELGREVACRLIEMDTSNAAPYITLAHIYARAGLWAQLIDIRKYMQQKRVKKSAGWSWIEIDKKAHVFSVGDATHPKFYEIYSKLDQLDLDMKAAEHASK
        VELGREVA RLIEMD  NAAPY+TLAHIYARAGLW Q+ +IRK MQQKRV+KSAGWSWIEIDKK HVFSVGDA HPK  EIYSKLDQL+LDMKAAE + K
Subjt:  VELGREVACRLIEMDTSNAAPYITLAHIYARAGLWAQLIDIRKYMQQKRVKKSAGWSWIEIDKKAHVFSVGDATHPKFYEIYSKLDQLDLDMKAAEHASK

Query:  ALEY
        ALEY
Subjt:  ALEY

A0A5A7UEE6 Pentatricopeptide repeat-containing protein1.5e-25486.9Show/hide
Query:  MCSLATKLTKYSLCSALNSCAKAHNLFLGLQIHAQIVKIGLEENLFLNSALVDLYSKCNAIVDAKRIFSHMKAHDQVSWTSIISGLSQNGAGSEAILMFK
        MCSL  KLT YSLCSAL+SCAK HNLFLGLQIHAQIVKIG EENLFLNS+LVDLYSKCNAIV+AKR+FS MK HDQVSWTSIISGLSQNG GSEAILMFK
Subjt:  MCSLATKLTKYSLCSALNSCAKAHNLFLGLQIHAQIVKIGLEENLFLNSALVDLYSKCNAIVDAKRIFSHMKAHDQVSWTSIISGLSQNGAGSEAILMFK

Query:  KMLVTQVRPNCFTYATIISSCPTLKDELQIRLATLLHAHVVKLGFSFSSFVISSTIDCYSKLGRIEEAALLFYETNVKDNIIFNSMISGYSQNLYGEEAL
        KMLVTQVRPNCFTYAT+ISSCPTLK+ELQI LATLLHAHV+K GF+FSSFVISSTIDCYSKLGRI+EA+LLF ET+VKDNIIFNSMISGYSQNL GEEAL
Subjt:  KMLVTQVRPNCFTYATIISSCPTLKDELQIRLATLLHAHVVKLGFSFSSFVISSTIDCYSKLGRIEEAALLFYETNVKDNIIFNSMISGYSQNLYGEEAL

Query:  KLFVEMRGGNLNPTDHTLTSVLNACGSLTVLEQGRQVHSLVTKMGSENNVFVVCSLLDMYSKCGSVDEAFFIFNQTVEKNSVLSTSMIMAFAQCGRGSDA
        KLFVEMR  NL+PTDHTLTSVLNACG LTVLEQGRQVHSL+TKMGSENNVFVVCSLLDMYSKCGS+DEAF +FNQTV+KNSVLSTSMIMAFAQCGRG +A
Subjt:  KLFVEMRGGNLNPTDHTLTSVLNACGSLTVLEQGRQVHSLVTKMGSENNVFVVCSLLDMYSKCGSVDEAFFIFNQTVEKNSVLSTSMIMAFAQCGRGSDA

Query:  LKLFESSLTEEGFLPDHVCFTAVLTACNHAGLLDEAVEYFNKMGSEYRLEPQIDHYACLIDLYARNGHVEKAKQLMEQMPYESNYVMWCSLLGACKVHEE
        LKLFE   TE+ F+PDH+CFTAVLTACNHAGLLDEAVEYFNKM  EY+L+PQIDHYACLIDLYARNG+VEKAKQ+MEQMPYESNYVMWCSLLGACKVH E
Subjt:  LKLFESSLTEEGFLPDHVCFTAVLTACNHAGLLDEAVEYFNKMGSEYRLEPQIDHYACLIDLYARNGHVEKAKQLMEQMPYESNYVMWCSLLGACKVHEE

Query:  VELGREVACRLIEMDTSNAAPYITLAHIYARAGLWAQLIDIRKYMQQKRVKKSAGWSWIEIDKKAHVFSVGDATHPKFYEIYSKLDQLDLDMKAAEHASK
        VELGREVA RLIEMD  NAAPY+TLAHIYARAGLW Q+ +IRK MQQKRV+KSAGWSWIEIDKK HVFSVGDA HPK  EIYSKLDQL+LDMKAAE + K
Subjt:  VELGREVACRLIEMDTSNAAPYITLAHIYARAGLWAQLIDIRKYMQQKRVKKSAGWSWIEIDKKAHVFSVGDATHPKFYEIYSKLDQLDLDMKAAEHASK

Query:  ALEY
        ALEY
Subjt:  ALEY

A0A6J1DYD1 pentatricopeptide repeat-containing protein At2g13600-like1.2e-25487.05Show/hide
Query:  MCSLATKLTKYSLCSALNSCAKAHNLFLGLQIHAQIVKIGLEENLFLNSALVDLYSKCNAIVDAKRIFSHMKAHDQVSWTSIISGLSQNGAGSEAILMFK
        MC L  KLTKY+LCSALNSCAK  NLF GLQIHAQIVKIG EENLFLNSALVDLY+KCNAIVDAKR+F  M+ HDQVSWTS+ISGLS+NG GSEAI MFK
Subjt:  MCSLATKLTKYSLCSALNSCAKAHNLFLGLQIHAQIVKIGLEENLFLNSALVDLYSKCNAIVDAKRIFSHMKAHDQVSWTSIISGLSQNGAGSEAILMFK

Query:  KMLVTQVRPNCFTYATIISSCPTLKDELQIRLATLLHAHVVKLGFSFSSFVISSTIDCYSKLGRIEEAALLFYETNVKDNIIFNSMISGYSQNLYGEEAL
        +MLVTQ RPNCFTYAT+ISSCPTL+D+LQ+RLA LLHAHV+K GF+FSSFVISSTIDCYSKLGRI +AALLFYET  KDNIIFNSMISGYSQNL+GEEAL
Subjt:  KMLVTQVRPNCFTYATIISSCPTLKDELQIRLATLLHAHVVKLGFSFSSFVISSTIDCYSKLGRIEEAALLFYETNVKDNIIFNSMISGYSQNLYGEEAL

Query:  KLFVEMRGGNLNPTDHTLTSVLNACGSLTVLEQGRQVHSLVTKMGSENNVFVVCSLLDMYSKCGSVDEAFFIFNQTVEKNSVLSTSMIMAFAQCGRGSDA
        KLFVEMR  +L+PTDHTLTSVLNACGSLTVLEQGRQVHSLVTKMGSENNVFVVCSL+DMYSKCG VDEAFFIFNQ VEKNSVLSTSMIMAFAQCGRGSDA
Subjt:  KLFVEMRGGNLNPTDHTLTSVLNACGSLTVLEQGRQVHSLVTKMGSENNVFVVCSLLDMYSKCGSVDEAFFIFNQTVEKNSVLSTSMIMAFAQCGRGSDA

Query:  LKLFESSLTEEGFLPDHVCFTAVLTACNHAGLLDEAVEYFNKMGSEYRLEPQIDHYACLIDLYARNGHVEKAKQLMEQMPYESNYVMWCSLLGACKVHEE
        LKLFE  L EEGFLPDHVCFTAVLTACNHAG LDEAVEYFNKMGSEY L+PQIDHYACLIDLYARNGH+EKAKQL+EQMPYESNYVMWCSLLGACKVH E
Subjt:  LKLFESSLTEEGFLPDHVCFTAVLTACNHAGLLDEAVEYFNKMGSEYRLEPQIDHYACLIDLYARNGHVEKAKQLMEQMPYESNYVMWCSLLGACKVHEE

Query:  VELGREVACRLIEMDTSNAAPYITLAHIYARAGLWAQLIDIRKYMQQKRVKKSAGWSWIEIDKKAHVFSVGDATHPKFYEIYSKLDQLDLDMKAAEHASK
        VELGREVA RLIEMD SNAAPY+TLAHIYARAGLW Q+ DIRK MQQ++V+KS GWSWIEIDKK+HVFS GDATHPK +EIYSKLDQLDLDMK+AEHASK
Subjt:  VELGREVACRLIEMDTSNAAPYITLAHIYARAGLWAQLIDIRKYMQQKRVKKSAGWSWIEIDKKAHVFSVGDATHPKFYEIYSKLDQLDLDMKAAEHASK

Query:  AL
        AL
Subjt:  AL

A0A6J1GA23 pentatricopeptide repeat-containing protein At2g13600-like1.5e-26289.57Show/hide
Query:  MCSLATKLTKYSLCSALNSCAKAHNLFLGLQIHAQIVKIGLEENLFLNSALVDLYSKCNAIVDAKRIFSHMKAHDQVSWTSIISGLSQNGAGSEAILMFK
        MCSL  KLTKYSLCSALNSCAK  NLFLGLQIHAQIVKIG E+NL+LNS LV+LYSKCNAIVDAKRIF HMK HDQVSWTSIISGLSQNGAG EAILMFK
Subjt:  MCSLATKLTKYSLCSALNSCAKAHNLFLGLQIHAQIVKIGLEENLFLNSALVDLYSKCNAIVDAKRIFSHMKAHDQVSWTSIISGLSQNGAGSEAILMFK

Query:  KMLVTQVRPNCFTYATIISSCPTLKDELQIRLATLLHAHVVKLGF-SFSSFVISSTIDCYSKLGRIEEAALLFYETNVKDNIIFNSMISGYSQNLYGEEA
         MLVTQ RPNCFTYAT+ISSCP+LKDEL I L TL HAHV+KLGF  FSSFVISS IDCYSKLGRIEEAALLFYE NVKDN+IFNSMISG+SQNLYGEEA
Subjt:  KMLVTQVRPNCFTYATIISSCPTLKDELQIRLATLLHAHVVKLGF-SFSSFVISSTIDCYSKLGRIEEAALLFYETNVKDNIIFNSMISGYSQNLYGEEA

Query:  LKLFVEMRGGNLNPTDHTLTSVLNACGSLTVLEQGRQVHSLVTKMGSENNVFVVCSLLDMYSKCGSVDEAFFIFNQTVEKNSVLSTSMIMAFAQCGRGSD
        LKLFVEMR  NL+PTDHTLTSVLNACGSLTVLEQGRQVHSLVTKMGSENNVFVVCSLLDMYSKCGSVD+AF IFNQTVEKNSVLSTSMIMAFAQCGRGSD
Subjt:  LKLFVEMRGGNLNPTDHTLTSVLNACGSLTVLEQGRQVHSLVTKMGSENNVFVVCSLLDMYSKCGSVDEAFFIFNQTVEKNSVLSTSMIMAFAQCGRGSD

Query:  ALKLFESSLTEEGFLPDHVCFTAVLTACNHAGLLDEAVEYFNKMGSEYRLEPQIDHYACLIDLYARNGHVEKAKQLMEQMPYESNYVMWCSLLGACKVHE
        ALKLFES LTEEGFLPDHVCFTAVLTACNHAGLL+EAVEYFNKMGSEYRL+PQIDHYACLIDLYARNGHVEKAK+LME+MPYESNYVMWCSLLGACKVH 
Subjt:  ALKLFESSLTEEGFLPDHVCFTAVLTACNHAGLLDEAVEYFNKMGSEYRLEPQIDHYACLIDLYARNGHVEKAKQLMEQMPYESNYVMWCSLLGACKVHE

Query:  EVELGREVACRLIEMDTSNAAPYITLAHIYARAGLWAQLIDIRKYMQQKRVKKSAGWSWIEIDKKAHVFSVGDATHPKFYEIYSKLDQLDLDMKAAEHAS
        EVELGREVA RLIEMD  NAAPY+TLAHIYARAGLW QL DIR  MQQKRV+KSAGWSWIEIDKKAHVFSVGDATHPK  EIYSKL+QLDLDM+ AEHAS
Subjt:  EVELGREVACRLIEMDTSNAAPYITLAHIYARAGLWAQLIDIRKYMQQKRVKKSAGWSWIEIDKKAHVFSVGDATHPKFYEIYSKLDQLDLDMKAAEHAS

Query:  KALEYVEF
        KALE+VEF
Subjt:  KALEYVEF

A0A6J1K975 pentatricopeptide repeat-containing protein At2g13600-like2.5e-26088.98Show/hide
Query:  MCSLATKLTKYSLCSALNSCAKAHNLFLGLQIHAQIVKIGLEENLFLNSALVDLYSKCNAIVDAKRIFSHMKAHDQVSWTSIISGLSQNGAGSEAILMFK
        MCSL  KLTKYSLCSALNSCAK  NLFLGLQIHAQIVKIG E+NL+LNS LV+LYSKCNAIVDAKRIF HMK HDQVSWTSIISGLSQNGAG EAILMFK
Subjt:  MCSLATKLTKYSLCSALNSCAKAHNLFLGLQIHAQIVKIGLEENLFLNSALVDLYSKCNAIVDAKRIFSHMKAHDQVSWTSIISGLSQNGAGSEAILMFK

Query:  KMLVTQVRPNCFTYATIISSCPTLKDELQIRLATLLHAHVVKLGF-SFSSFVISSTIDCYSKLGRIEEAALLFYETNVKDNIIFNSMISGYSQNLYGEEA
         MLVTQ RPNCFTYAT+ISSCP+LKDEL   L TL HAHV+KLGF  FSSFVISS IDCYSKLGRIEEAALLFYE NVKDN+IFNSMISG+SQNLYGEEA
Subjt:  KMLVTQVRPNCFTYATIISSCPTLKDELQIRLATLLHAHVVKLGF-SFSSFVISSTIDCYSKLGRIEEAALLFYETNVKDNIIFNSMISGYSQNLYGEEA

Query:  LKLFVEMRGGNLNPTDHTLTSVLNACGSLTVLEQGRQVHSLVTKMGSENNVFVVCSLLDMYSKCGSVDEAFFIFNQTVEKNSVLSTSMIMAFAQCGRGSD
        LKLFVEMR  NL+ TDHTLTSVLNACGSLTVLEQGRQVHSLVTKMGSENNVFVVCSLLDMYSKCGSVD+AF IFNQTVEKNSVLSTSMIMAFAQCGRGSD
Subjt:  LKLFVEMRGGNLNPTDHTLTSVLNACGSLTVLEQGRQVHSLVTKMGSENNVFVVCSLLDMYSKCGSVDEAFFIFNQTVEKNSVLSTSMIMAFAQCGRGSD

Query:  ALKLFESSLTEEGFLPDHVCFTAVLTACNHAGLLDEAVEYFNKMGSEYRLEPQIDHYACLIDLYARNGHVEKAKQLMEQMPYESNYVMWCSLLGACKVHE
        ALKLFES LTEEGFLPDHVCFTAVLTACNHAGLL+EAVEYFNKMGSEYRL+PQIDHYACLIDLYARNGHVEKAK+LME++PYESNYVMWCSLLGACKVH 
Subjt:  ALKLFESSLTEEGFLPDHVCFTAVLTACNHAGLLDEAVEYFNKMGSEYRLEPQIDHYACLIDLYARNGHVEKAKQLMEQMPYESNYVMWCSLLGACKVHE

Query:  EVELGREVACRLIEMDTSNAAPYITLAHIYARAGLWAQLIDIRKYMQQKRVKKSAGWSWIEIDKKAHVFSVGDATHPKFYEIYSKLDQLDLDMKAAEHAS
        EVELGREVA RLIEMD SNAAPY+TLAHIYARAGLW QL DIRK MQQKRV+KSAGWSWIEIDKKAHVFSVGDATHPK  EIYSKL+QLDLDM+  EHAS
Subjt:  EVELGREVACRLIEMDTSNAAPYITLAHIYARAGLWAQLIDIRKYMQQKRVKKSAGWSWIEIDKKAHVFSVGDATHPKFYEIYSKLDQLDLDMKAAEHAS

Query:  KALEYVEF
        KALE++EF
Subjt:  KALEYVEF

SwissProt top hitse value%identityAlignment
Q5G1T1 Pentatricopeptide repeat-containing protein At3g49170, chloroplastic6.1e-9135.38Show/hide
Query:  KYSLCSALNSCAKAHNLFLGLQIHAQIVKIGLEENLFLNSALVDLYSKCNA---IVDAKRIFSHMKAHDQVSWTSIISGLSQN-GAGSEAILMFKKMLVT
        K++L S  ++CA+  NL LG Q+H+  ++ GL ++  +  +LVD+Y+KC+A   + D +++F  M+ H  +SWT++I+G  +N    +EAI +F +M+  
Subjt:  KYSLCSALNSCAKAHNLFLGLQIHAQIVKIGLEENLFLNSALVDLYSKCNA---IVDAKRIFSHMKAHDQVSWTSIISGLSQN-GAGSEAILMFKKMLVT

Query:  -QVRPNCFTYATIISSCPTLKDELQIRLATLLHAHVVKLGFSFSSFVISSTIDCYSKLGRIEEAALLFYETNVKDNIIFNSMISGYSQNLYGEEALKLFV
          V PN FT+++   +C  L D    R+   +     K G + +S V +S I  + K  R+E+A   F   + K+ + +N+ + G  +NL  E+A KL  
Subjt:  -QVRPNCFTYATIISSCPTLKDELQIRLATLLHAHVVKLGFSFSSFVISSTIDCYSKLGRIEEAALLFYETNVKDNIIFNSMISGYSQNLYGEEALKLFV

Query:  EMRGGNLNPTDHTLTSVLNACGSLTVLEQGRQVHSLVTKMGSENNVFVVCSLLDMYSKCGSVDEAFFIFNQTVEKNSVLSTSMIMAFAQCGRGSDALKLF
        E+    L  +  T  S+L+   ++  + +G Q+HS V K+G   N  V  +L+ MYSKCGS+D A  +FN    +N +  TSMI  FA+ G     L+ F
Subjt:  EMRGGNLNPTDHTLTSVLNACGSLTVLEQGRQVHSLVTKMGSENNVFVVCSLLDMYSKCGSVDEAFFIFNQTVEKNSVLSTSMIMAFAQCGRGSDALKLF

Query:  ESSLTEEGFLPDHVCFTAVLTACNHAGLLDEAVEYFNKMGSEYRLEPQIDHYACLIDLYARNGHVEKAKQLMEQMPYESNYVMWCSLLGACKVHEEVELG
         + + EEG  P+ V + A+L+AC+H GL+ E   +FN M  +++++P+++HYAC++DL  R G +  A + +  MP++++ ++W + LGAC+VH   ELG
Subjt:  ESSLTEEGFLPDHVCFTAVLTACNHAGLLDEAVEYFNKMGSEYRLEPQIDHYACLIDLYARNGHVEKAKQLMEQMPYESNYVMWCSLLGACKVHEEVELG

Query:  REVACRLIEMDTSNAAPYITLAHIYARAGLWAQLIDIRKYMQQKRVKKSAGWSWIEIDKKAHVFSVGDATHPKFYEIYSKLDQLDLDMK
        +  A +++E+D +  A YI L++IYA AG W +  ++R+ M+++ + K  G SWIE+  K H F VGD  HP  ++IY +LD+L  ++K
Subjt:  REVACRLIEMDTSNAAPYITLAHIYARAGLWAQLIDIRKYMQQKRVKKSAGWSWIEIDKKAHVFSVGDATHPKFYEIYSKLDQLDLDMK

Q9FWA6 Pentatricopeptide repeat-containing protein At3g02330, mitochondrial2.5e-9234.38Show/hide
Query:  SLCSALNSCAKAHNLFLGLQIHAQIVKIGLEENLFLNSALVDLYSKCNAIVDAKRIFSHMKAHDQVSWTSIISGLSQNGAGSEAILMFKKMLVTQVRPNC
        SL     +CA    L  GLQI+   +K  L  ++ + +A +D+Y KC A+ +A R+F  M+  D VSW +II+   QNG G E + +F  ML +++ P+ 
Subjt:  SLCSALNSCAKAHNLFLGLQIHAQIVKIGLEENLFLNSALVDLYSKCNAIVDAKRIFSHMKAHDQVSWTSIISGLSQNGAGSEAILMFKKMLVTQVRPNC

Query:  FTYATIISSCPTLKDELQIRLATLLHAHVVKLGFSFSSFVISSTIDCYSKLGRIEEA----ALLFYETNVKDN----------------IIFNSMISGYS
        FT+ +I+ +C        +     +H+ +VK G + +S V  S ID YSK G IEEA    +  F   NV                   + +NS+ISGY 
Subjt:  FTYATIISSCPTLKDELQIRLATLLHAHVVKLGFSFSSFVISSTIDCYSKLGRIEEA----ALLFYETNVKDN----------------IIFNSMISGYS

Query:  QNLYGEEALKLFVEMRGGNLNPTDHTLTSVLNACGSLTVLEQGRQVHSLVTKMGSENNVFVVCSLLDMYSKCGSVDEAFFIFNQTVEKNSVLSTSMIMAF
             E+A  LF  M    + P   T  +VL+ C +L     G+Q+H+ V K   +++V++  +L+DMYSKCG + ++  +F +++ ++ V   +MI  +
Subjt:  QNLYGEEALKLFVEMRGGNLNPTDHTLTSVLNACGSLTVLEQGRQVHSLVTKMGSENNVFVVCSLLDMYSKCGSVDEAFFIFNQTVEKNSVLSTSMIMAF

Query:  AQCGRGSDALKLFESSLTEEGFLPDHVCFTAVLTACNHAGLLDEAVEYFNKMGSEYRLEPQIDHYACLIDLYARNGHVEKAKQLMEQMPYESNYVMWCSL
        A  G+G +A++LFE  +  E   P+HV F ++L AC H GL+D+ +EYF  M  +Y L+PQ+ HY+ ++D+  ++G V++A +L+ +MP+E++ V+W +L
Subjt:  AQCGRGSDALKLFESSLTEEGFLPDHVCFTAVLTACNHAGLLDEAVEYFNKMGSEYRLEPQIDHYACLIDLYARNGHVEKAKQLMEQMPYESNYVMWCSL

Query:  LGACKVH-EEVELGREVACRLIEMDTSNAAPYITLAHIYARAGLWAQLIDIRKYMQQKRVKKSAGWSWIEIDKKAHVFSVGDATHPKFYEIYSKLDQLDL
        LG C +H   VE+  E    L+ +D  +++ Y  L+++YA AG+W ++ D+R+ M+  ++KK  G SW+E+  + HVF VGD  HP++ EIY +L  +  
Subjt:  LGACKVH-EEVELGREVACRLIEMDTSNAAPYITLAHIYARAGLWAQLIDIRKYMQQKRVKKSAGWSWIEIDKKAHVFSVGDATHPKFYEIYSKLDQLDL

Query:  DMKAAEHAS
        +MK  + +S
Subjt:  DMKAAEHAS

Q9LTV8 Pentatricopeptide repeat-containing protein At3g127701.3e-8833.19Show/hide
Query:  LNSCAKAHNLFLGLQIHAQIVKIGLEENLFLNSALVDLYSKCNAIVDAKRIFSHMKAHDQ--VSWTSIISGLSQNGAGSEAILMFKKMLVTQVRPNCFTY
        L +C+   +L +G  +HAQ+ ++G + ++F+ + L+ LY+KC  +  A+ +F  +   ++  VSWT+I+S  +QNG   EA+ +F +M    V+P+    
Subjt:  LNSCAKAHNLFLGLQIHAQIVKIGLEENLFLNSALVDLYSKCNAIVDAKRIFSHMKAHDQ--VSWTSIISGLSQNGAGSEAILMFKKMLVTQVRPNCFTY

Query:  ATIISSCPTLKDELQIRLATLLHAHVVKLGFSFSSFVISSTIDCYSKLGRIEEAALLFYETNVKDNIIFNSMISGYSQNLYGEEALKLFVEMRGGNLNPT
         +++++   L+D   ++    +HA VVK+G      ++ S    Y+K G++  A +LF +    + I++N+MISGY++N Y  EA+ +F EM   ++ P 
Subjt:  ATIISSCPTLKDELQIRLATLLHAHVVKLGFSFSSFVISSTIDCYSKLGRIEEAALLFYETNVKDNIIFNSMISGYSQNLYGEEALKLFVEMRGGNLNPT

Query:  DHTLTSVLNACGSLTVLEQGRQVHSLVTKMGSENNVFVVCSLLDMYSKCGSVDEAFFIFNQTVEKNSVLSTSMIMAFAQCGRGSDALKLFESSLTEEGFL
          ++TS ++AC  +  LEQ R ++  V +    ++VF+  +L+DM++KCGSV+ A  +F++T++++ V+ ++MI+ +   GR  +A+ L+  ++   G  
Subjt:  DHTLTSVLNACGSLTVLEQGRQVHSLVTKMGSENNVFVVCSLLDMYSKCGSVDEAFFIFNQTVEKNSVLSTSMIMAFAQCGRGSDALKLFESSLTEEGFL

Query:  PDHVCFTAVLTACNHAGLLDEAVEYFNKMGSEYRLEPQIDHYACLIDLYARNGHVEKAKQLMEQMPYESNYVMWCSLLGACKVHEEVELGREVACRLIEM
        P+ V F  +L ACNH+G++ E   +FN+M +++++ PQ  HYAC+IDL  R GH+++A ++++ MP +    +W +LL ACK H  VELG   A +L  +
Subjt:  PDHVCFTAVLTACNHAGLLDEAVEYFNKMGSEYRLEPQIDHYACLIDLYARNGHVEKAKQLMEQMPYESNYVMWCSLLGACKVHEEVELGREVACRLIEM

Query:  DTSNAAPYITLAHIYARAGLWAQLIDIRKYMQQKRVKKSAGWSWIEIDKKAHVFSVGDATHPKFYEIYSKLDQLDLDMK
        D SN   Y+ L+++YA A LW ++ ++R  M++K + K  G SW+E+  +   F VGD +HP++ EI  +++ ++  +K
Subjt:  DTSNAAPYITLAHIYARAGLWAQLIDIRKYMQQKRVKKSAGWSWIEIDKKAHVFSVGDATHPKFYEIYSKLDQLDLDMK

Q9SIT7 Pentatricopeptide repeat-containing protein At2g136001.8e-9836.83Show/hide
Query:  LTKYSLCSALNSCAKAHNLFLGLQIHAQIVKIGLEENLFLNSALVDLYSKCNAIVDAKRIFSHMKAHDQVSWTSIISGLSQNGAGSEAILMFKKMLVTQV
        L +YS  S L++C+  +++  G+Q+H+ I K     ++++ SALVD+YSKC  + DA+R+F  M   + VSW S+I+   QNG   EA+ +F+ ML ++V
Subjt:  LTKYSLCSALNSCAKAHNLFLGLQIHAQIVKIGLEENLFLNSALVDLYSKCNAIVDAKRIFSHMKAHDQVSWTSIISGLSQNGAGSEAILMFKKMLVTQV

Query:  RPNCFTYATIISSCPTLKDELQIRLATLLHAHVVKLGFSFSSFVISST-IDCYSKLGRIEEAALLFYETNVKD---------------------------
         P+  T A++IS+C +L     I++   +H  VVK     +  ++S+  +D Y+K  RI+EA  +F    +++                           
Subjt:  RPNCFTYATIISSCPTLKDELQIRLATLLHAHVVKLGFSFSSFVISST-IDCYSKLGRIEEAALLFYETNVKD---------------------------

Query:  ---NII-FNSMISGYSQNLYGEEALKLFVEMRGGNLNPTDHTLTSVLNACGSLTVLEQGRQVHSLVTK------MGSENNVFVVCSLLDMYSKCGSVDEA
           N++ +N++I+GY+QN   EEAL LF  ++  ++ PT ++  ++L AC  L  L  G Q H  V K       G E+++FV  SL+DMY KCG V+E 
Subjt:  ---NII-FNSMISGYSQNLYGEEALKLFVEMRGGNLNPTDHTLTSVLNACGSLTVLEQGRQVHSLVTK------MGSENNVFVVCSLLDMYSKCGSVDEA

Query:  FFIFNQTVEKNSVLSTSMIMAFAQCGRGSDALKLFESSLTEEGFLPDHVCFTAVLTACNHAGLLDEAVEYFNKMGSEYRLEPQIDHYACLIDLYARNGHV
        + +F + +E++ V   +MI+ FAQ G G++AL+LF   L E G  PDH+    VL+AC HAG ++E   YF+ M  ++ + P  DHY C++DL  R G +
Subjt:  FFIFNQTVEKNSVLSTSMIMAFAQCGRGSDALKLFESSLTEEGFLPDHVCFTAVLTACNHAGLLDEAVEYFNKMGSEYRLEPQIDHYACLIDLYARNGHV

Query:  EKAKQLMEQMPYESNYVMWCSLLGACKVHEEVELGREVACRLIEMDTSNAAPYITLAHIYARAGLWAQLIDIRKYMQQKRVKKSAGWSWIEIDKKAHVFS
        E+AK ++E+MP + + V+W SLL ACKVH  + LG+ VA +L+E++ SN+ PY+ L+++YA  G W  ++++RK M+++ V K  G SWI+I    HVF 
Subjt:  EKAKQLMEQMPYESNYVMWCSLLGACKVHEEVELGREVACRLIEMDTSNAAPYITLAHIYARAGLWAQLIDIRKYMQQKRVKKSAGWSWIEIDKKAHVFS

Query:  VGDATHPKFYEIYSKLDQLDLDMK
        V D +HP+  +I+S LD L  +M+
Subjt:  VGDATHPKFYEIYSKLDQLDLDMK

Q9ZUW3 Pentatricopeptide repeat-containing protein At2g276105.2e-9036.2Show/hide
Query:  KLTKYSLCSALNSCAKAHNLFLGLQIHAQIVKIGLEENLFLNSALVDLYSKCNAIVDAKRIFSHMK-AHDQVSWTSIISGLSQNGAGSEAILMFKKMLVT
        +L++ S  S +  CA    L    Q+H  +VK G   +  + +AL+  YSKC A++DA R+F  +    + VSWT++ISG  QN    EA+ +F +M   
Subjt:  KLTKYSLCSALNSCAKAHNLFLGLQIHAQIVKIGLEENLFLNSALVDLYSKCNAIVDAKRIFSHMK-AHDQVSWTSIISGLSQNGAGSEAILMFKKMLVT

Query:  QVRPNCFTYATIISSCPTLKDELQIRLATLLHAHVVKLGFSFSSFVISSTIDCYSKLGRIEEAALLFYETNVKDNIIFNSMISGYSQNLYGEEALKLFVE
         VRPN FTY+ I+++ P +         + +HA VVK  +  SS V ++ +D Y KLG++EEAA +F   + KD + +++M++GY+Q    E A+K+F E
Subjt:  QVRPNCFTYATIISSCPTLKDELQIRLATLLHAHVVKLGFSFSSFVISSTIDCYSKLGRIEEAALLFYETNVKDNIIFNSMISGYSQNLYGEEALKLFVE

Query:  MRGGNLNPTDHTLTSVLNACGSLTV-LEQGRQVHSLVTKMGSENNVFVVCSLLDMYSKCGSVDEAFFIFNQTVEKNSVLSTSMIMAFAQCGRGSDALKLF
        +  G + P + T +S+LN C +    + QG+Q H    K   ++++ V  +LL MY+K G+++ A  +F +  EK+ V   SMI  +AQ G+   AL +F
Subjt:  MRGGNLNPTDHTLTSVLNACGSLTV-LEQGRQVHSLVTKMGSENNVFVVCSLLDMYSKCGSVDEAFFIFNQTVEKNSVLSTSMIMAFAQCGRGSDALKLF

Query:  ESSLTEEGFLPDHVCFTAVLTACNHAGLLDEAVEYFNKMGSEYRLEPQIDHYACLIDLYARNGHVEKAKQLMEQMPYESNYVMWCSLLGACKVHEEVELG
        +  + +     D V F  V  AC HAGL++E  +YF+ M  + ++ P  +H +C++DLY+R G +EKA +++E MP  +   +W ++L AC+VH++ ELG
Subjt:  ESSLTEEGFLPDHVCFTAVLTACNHAGLLDEAVEYFNKMGSEYRLEPQIDHYACLIDLYARNGHVEKAKQLMEQMPYESNYVMWCSLLGACKVHEEVELG

Query:  REVACRLIEMDTSNAAPYITLAHIYARAGLWAQLIDIRKYMQQKRVKKSAGWSWIEIDKKAHVFSVGDATHPKFYEIYSKLDQLDLDMK
        R  A ++I M   ++A Y+ L+++YA +G W +   +RK M ++ VKK  G+SWIE+  K + F  GD +HP   +IY KL+ L   +K
Subjt:  REVACRLIEMDTSNAAPYITLAHIYARAGLWAQLIDIRKYMQQKRVKKSAGWSWIEIDKKAHVFSVGDATHPKFYEIYSKLDQLDLDMK

Arabidopsis top hitse value%identityAlignment
AT2G13600.1 Pentatricopeptide repeat (PPR) superfamily protein1.3e-9936.83Show/hide
Query:  LTKYSLCSALNSCAKAHNLFLGLQIHAQIVKIGLEENLFLNSALVDLYSKCNAIVDAKRIFSHMKAHDQVSWTSIISGLSQNGAGSEAILMFKKMLVTQV
        L +YS  S L++C+  +++  G+Q+H+ I K     ++++ SALVD+YSKC  + DA+R+F  M   + VSW S+I+   QNG   EA+ +F+ ML ++V
Subjt:  LTKYSLCSALNSCAKAHNLFLGLQIHAQIVKIGLEENLFLNSALVDLYSKCNAIVDAKRIFSHMKAHDQVSWTSIISGLSQNGAGSEAILMFKKMLVTQV

Query:  RPNCFTYATIISSCPTLKDELQIRLATLLHAHVVKLGFSFSSFVISST-IDCYSKLGRIEEAALLFYETNVKD---------------------------
         P+  T A++IS+C +L     I++   +H  VVK     +  ++S+  +D Y+K  RI+EA  +F    +++                           
Subjt:  RPNCFTYATIISSCPTLKDELQIRLATLLHAHVVKLGFSFSSFVISST-IDCYSKLGRIEEAALLFYETNVKD---------------------------

Query:  ---NII-FNSMISGYSQNLYGEEALKLFVEMRGGNLNPTDHTLTSVLNACGSLTVLEQGRQVHSLVTK------MGSENNVFVVCSLLDMYSKCGSVDEA
           N++ +N++I+GY+QN   EEAL LF  ++  ++ PT ++  ++L AC  L  L  G Q H  V K       G E+++FV  SL+DMY KCG V+E 
Subjt:  ---NII-FNSMISGYSQNLYGEEALKLFVEMRGGNLNPTDHTLTSVLNACGSLTVLEQGRQVHSLVTK------MGSENNVFVVCSLLDMYSKCGSVDEA

Query:  FFIFNQTVEKNSVLSTSMIMAFAQCGRGSDALKLFESSLTEEGFLPDHVCFTAVLTACNHAGLLDEAVEYFNKMGSEYRLEPQIDHYACLIDLYARNGHV
        + +F + +E++ V   +MI+ FAQ G G++AL+LF   L E G  PDH+    VL+AC HAG ++E   YF+ M  ++ + P  DHY C++DL  R G +
Subjt:  FFIFNQTVEKNSVLSTSMIMAFAQCGRGSDALKLFESSLTEEGFLPDHVCFTAVLTACNHAGLLDEAVEYFNKMGSEYRLEPQIDHYACLIDLYARNGHV

Query:  EKAKQLMEQMPYESNYVMWCSLLGACKVHEEVELGREVACRLIEMDTSNAAPYITLAHIYARAGLWAQLIDIRKYMQQKRVKKSAGWSWIEIDKKAHVFS
        E+AK ++E+MP + + V+W SLL ACKVH  + LG+ VA +L+E++ SN+ PY+ L+++YA  G W  ++++RK M+++ V K  G SWI+I    HVF 
Subjt:  EKAKQLMEQMPYESNYVMWCSLLGACKVHEEVELGREVACRLIEMDTSNAAPYITLAHIYARAGLWAQLIDIRKYMQQKRVKKSAGWSWIEIDKKAHVFS

Query:  VGDATHPKFYEIYSKLDQLDLDMK
        V D +HP+  +I+S LD L  +M+
Subjt:  VGDATHPKFYEIYSKLDQLDLDMK

AT2G27610.1 Tetratricopeptide repeat (TPR)-like superfamily protein3.7e-9136.2Show/hide
Query:  KLTKYSLCSALNSCAKAHNLFLGLQIHAQIVKIGLEENLFLNSALVDLYSKCNAIVDAKRIFSHMK-AHDQVSWTSIISGLSQNGAGSEAILMFKKMLVT
        +L++ S  S +  CA    L    Q+H  +VK G   +  + +AL+  YSKC A++DA R+F  +    + VSWT++ISG  QN    EA+ +F +M   
Subjt:  KLTKYSLCSALNSCAKAHNLFLGLQIHAQIVKIGLEENLFLNSALVDLYSKCNAIVDAKRIFSHMK-AHDQVSWTSIISGLSQNGAGSEAILMFKKMLVT

Query:  QVRPNCFTYATIISSCPTLKDELQIRLATLLHAHVVKLGFSFSSFVISSTIDCYSKLGRIEEAALLFYETNVKDNIIFNSMISGYSQNLYGEEALKLFVE
         VRPN FTY+ I+++ P +         + +HA VVK  +  SS V ++ +D Y KLG++EEAA +F   + KD + +++M++GY+Q    E A+K+F E
Subjt:  QVRPNCFTYATIISSCPTLKDELQIRLATLLHAHVVKLGFSFSSFVISSTIDCYSKLGRIEEAALLFYETNVKDNIIFNSMISGYSQNLYGEEALKLFVE

Query:  MRGGNLNPTDHTLTSVLNACGSLTV-LEQGRQVHSLVTKMGSENNVFVVCSLLDMYSKCGSVDEAFFIFNQTVEKNSVLSTSMIMAFAQCGRGSDALKLF
        +  G + P + T +S+LN C +    + QG+Q H    K   ++++ V  +LL MY+K G+++ A  +F +  EK+ V   SMI  +AQ G+   AL +F
Subjt:  MRGGNLNPTDHTLTSVLNACGSLTV-LEQGRQVHSLVTKMGSENNVFVVCSLLDMYSKCGSVDEAFFIFNQTVEKNSVLSTSMIMAFAQCGRGSDALKLF

Query:  ESSLTEEGFLPDHVCFTAVLTACNHAGLLDEAVEYFNKMGSEYRLEPQIDHYACLIDLYARNGHVEKAKQLMEQMPYESNYVMWCSLLGACKVHEEVELG
        +  + +     D V F  V  AC HAGL++E  +YF+ M  + ++ P  +H +C++DLY+R G +EKA +++E MP  +   +W ++L AC+VH++ ELG
Subjt:  ESSLTEEGFLPDHVCFTAVLTACNHAGLLDEAVEYFNKMGSEYRLEPQIDHYACLIDLYARNGHVEKAKQLMEQMPYESNYVMWCSLLGACKVHEEVELG

Query:  REVACRLIEMDTSNAAPYITLAHIYARAGLWAQLIDIRKYMQQKRVKKSAGWSWIEIDKKAHVFSVGDATHPKFYEIYSKLDQLDLDMK
        R  A ++I M   ++A Y+ L+++YA +G W +   +RK M ++ VKK  G+SWIE+  K + F  GD +HP   +IY KL+ L   +K
Subjt:  REVACRLIEMDTSNAAPYITLAHIYARAGLWAQLIDIRKYMQQKRVKKSAGWSWIEIDKKAHVFSVGDATHPKFYEIYSKLDQLDLDMK

AT3G02330.1 Pentatricopeptide repeat (PPR) superfamily protein1.8e-9334.38Show/hide
Query:  SLCSALNSCAKAHNLFLGLQIHAQIVKIGLEENLFLNSALVDLYSKCNAIVDAKRIFSHMKAHDQVSWTSIISGLSQNGAGSEAILMFKKMLVTQVRPNC
        SL     +CA    L  GLQI+   +K  L  ++ + +A +D+Y KC A+ +A R+F  M+  D VSW +II+   QNG G E + +F  ML +++ P+ 
Subjt:  SLCSALNSCAKAHNLFLGLQIHAQIVKIGLEENLFLNSALVDLYSKCNAIVDAKRIFSHMKAHDQVSWTSIISGLSQNGAGSEAILMFKKMLVTQVRPNC

Query:  FTYATIISSCPTLKDELQIRLATLLHAHVVKLGFSFSSFVISSTIDCYSKLGRIEEA----ALLFYETNVKDN----------------IIFNSMISGYS
        FT+ +I+ +C        +     +H+ +VK G + +S V  S ID YSK G IEEA    +  F   NV                   + +NS+ISGY 
Subjt:  FTYATIISSCPTLKDELQIRLATLLHAHVVKLGFSFSSFVISSTIDCYSKLGRIEEA----ALLFYETNVKDN----------------IIFNSMISGYS

Query:  QNLYGEEALKLFVEMRGGNLNPTDHTLTSVLNACGSLTVLEQGRQVHSLVTKMGSENNVFVVCSLLDMYSKCGSVDEAFFIFNQTVEKNSVLSTSMIMAF
             E+A  LF  M    + P   T  +VL+ C +L     G+Q+H+ V K   +++V++  +L+DMYSKCG + ++  +F +++ ++ V   +MI  +
Subjt:  QNLYGEEALKLFVEMRGGNLNPTDHTLTSVLNACGSLTVLEQGRQVHSLVTKMGSENNVFVVCSLLDMYSKCGSVDEAFFIFNQTVEKNSVLSTSMIMAF

Query:  AQCGRGSDALKLFESSLTEEGFLPDHVCFTAVLTACNHAGLLDEAVEYFNKMGSEYRLEPQIDHYACLIDLYARNGHVEKAKQLMEQMPYESNYVMWCSL
        A  G+G +A++LFE  +  E   P+HV F ++L AC H GL+D+ +EYF  M  +Y L+PQ+ HY+ ++D+  ++G V++A +L+ +MP+E++ V+W +L
Subjt:  AQCGRGSDALKLFESSLTEEGFLPDHVCFTAVLTACNHAGLLDEAVEYFNKMGSEYRLEPQIDHYACLIDLYARNGHVEKAKQLMEQMPYESNYVMWCSL

Query:  LGACKVH-EEVELGREVACRLIEMDTSNAAPYITLAHIYARAGLWAQLIDIRKYMQQKRVKKSAGWSWIEIDKKAHVFSVGDATHPKFYEIYSKLDQLDL
        LG C +H   VE+  E    L+ +D  +++ Y  L+++YA AG+W ++ D+R+ M+  ++KK  G SW+E+  + HVF VGD  HP++ EIY +L  +  
Subjt:  LGACKVH-EEVELGREVACRLIEMDTSNAAPYITLAHIYARAGLWAQLIDIRKYMQQKRVKKSAGWSWIEIDKKAHVFSVGDATHPKFYEIYSKLDQLDL

Query:  DMKAAEHAS
        +MK  + +S
Subjt:  DMKAAEHAS

AT3G12770.1 mitochondrial editing factor 229.1e-9033.19Show/hide
Query:  LNSCAKAHNLFLGLQIHAQIVKIGLEENLFLNSALVDLYSKCNAIVDAKRIFSHMKAHDQ--VSWTSIISGLSQNGAGSEAILMFKKMLVTQVRPNCFTY
        L +C+   +L +G  +HAQ+ ++G + ++F+ + L+ LY+KC  +  A+ +F  +   ++  VSWT+I+S  +QNG   EA+ +F +M    V+P+    
Subjt:  LNSCAKAHNLFLGLQIHAQIVKIGLEENLFLNSALVDLYSKCNAIVDAKRIFSHMKAHDQ--VSWTSIISGLSQNGAGSEAILMFKKMLVTQVRPNCFTY

Query:  ATIISSCPTLKDELQIRLATLLHAHVVKLGFSFSSFVISSTIDCYSKLGRIEEAALLFYETNVKDNIIFNSMISGYSQNLYGEEALKLFVEMRGGNLNPT
         +++++   L+D   ++    +HA VVK+G      ++ S    Y+K G++  A +LF +    + I++N+MISGY++N Y  EA+ +F EM   ++ P 
Subjt:  ATIISSCPTLKDELQIRLATLLHAHVVKLGFSFSSFVISSTIDCYSKLGRIEEAALLFYETNVKDNIIFNSMISGYSQNLYGEEALKLFVEMRGGNLNPT

Query:  DHTLTSVLNACGSLTVLEQGRQVHSLVTKMGSENNVFVVCSLLDMYSKCGSVDEAFFIFNQTVEKNSVLSTSMIMAFAQCGRGSDALKLFESSLTEEGFL
          ++TS ++AC  +  LEQ R ++  V +    ++VF+  +L+DM++KCGSV+ A  +F++T++++ V+ ++MI+ +   GR  +A+ L+  ++   G  
Subjt:  DHTLTSVLNACGSLTVLEQGRQVHSLVTKMGSENNVFVVCSLLDMYSKCGSVDEAFFIFNQTVEKNSVLSTSMIMAFAQCGRGSDALKLFESSLTEEGFL

Query:  PDHVCFTAVLTACNHAGLLDEAVEYFNKMGSEYRLEPQIDHYACLIDLYARNGHVEKAKQLMEQMPYESNYVMWCSLLGACKVHEEVELGREVACRLIEM
        P+ V F  +L ACNH+G++ E   +FN+M +++++ PQ  HYAC+IDL  R GH+++A ++++ MP +    +W +LL ACK H  VELG   A +L  +
Subjt:  PDHVCFTAVLTACNHAGLLDEAVEYFNKMGSEYRLEPQIDHYACLIDLYARNGHVEKAKQLMEQMPYESNYVMWCSLLGACKVHEEVELGREVACRLIEM

Query:  DTSNAAPYITLAHIYARAGLWAQLIDIRKYMQQKRVKKSAGWSWIEIDKKAHVFSVGDATHPKFYEIYSKLDQLDLDMK
        D SN   Y+ L+++YA A LW ++ ++R  M++K + K  G SW+E+  +   F VGD +HP++ EI  +++ ++  +K
Subjt:  DTSNAAPYITLAHIYARAGLWAQLIDIRKYMQQKRVKKSAGWSWIEIDKKAHVFSVGDATHPKFYEIYSKLDQLDLDMK

AT3G49170.1 Tetratricopeptide repeat (TPR)-like superfamily protein4.4e-9235.38Show/hide
Query:  KYSLCSALNSCAKAHNLFLGLQIHAQIVKIGLEENLFLNSALVDLYSKCNA---IVDAKRIFSHMKAHDQVSWTSIISGLSQN-GAGSEAILMFKKMLVT
        K++L S  ++CA+  NL LG Q+H+  ++ GL ++  +  +LVD+Y+KC+A   + D +++F  M+ H  +SWT++I+G  +N    +EAI +F +M+  
Subjt:  KYSLCSALNSCAKAHNLFLGLQIHAQIVKIGLEENLFLNSALVDLYSKCNA---IVDAKRIFSHMKAHDQVSWTSIISGLSQN-GAGSEAILMFKKMLVT

Query:  -QVRPNCFTYATIISSCPTLKDELQIRLATLLHAHVVKLGFSFSSFVISSTIDCYSKLGRIEEAALLFYETNVKDNIIFNSMISGYSQNLYGEEALKLFV
          V PN FT+++   +C  L D    R+   +     K G + +S V +S I  + K  R+E+A   F   + K+ + +N+ + G  +NL  E+A KL  
Subjt:  -QVRPNCFTYATIISSCPTLKDELQIRLATLLHAHVVKLGFSFSSFVISSTIDCYSKLGRIEEAALLFYETNVKDNIIFNSMISGYSQNLYGEEALKLFV

Query:  EMRGGNLNPTDHTLTSVLNACGSLTVLEQGRQVHSLVTKMGSENNVFVVCSLLDMYSKCGSVDEAFFIFNQTVEKNSVLSTSMIMAFAQCGRGSDALKLF
        E+    L  +  T  S+L+   ++  + +G Q+HS V K+G   N  V  +L+ MYSKCGS+D A  +FN    +N +  TSMI  FA+ G     L+ F
Subjt:  EMRGGNLNPTDHTLTSVLNACGSLTVLEQGRQVHSLVTKMGSENNVFVVCSLLDMYSKCGSVDEAFFIFNQTVEKNSVLSTSMIMAFAQCGRGSDALKLF

Query:  ESSLTEEGFLPDHVCFTAVLTACNHAGLLDEAVEYFNKMGSEYRLEPQIDHYACLIDLYARNGHVEKAKQLMEQMPYESNYVMWCSLLGACKVHEEVELG
         + + EEG  P+ V + A+L+AC+H GL+ E   +FN M  +++++P+++HYAC++DL  R G +  A + +  MP++++ ++W + LGAC+VH   ELG
Subjt:  ESSLTEEGFLPDHVCFTAVLTACNHAGLLDEAVEYFNKMGSEYRLEPQIDHYACLIDLYARNGHVEKAKQLMEQMPYESNYVMWCSLLGACKVHEEVELG

Query:  REVACRLIEMDTSNAAPYITLAHIYARAGLWAQLIDIRKYMQQKRVKKSAGWSWIEIDKKAHVFSVGDATHPKFYEIYSKLDQLDLDMK
        +  A +++E+D +  A YI L++IYA AG W +  ++R+ M+++ + K  G SWIE+  K H F VGD  HP  ++IY +LD+L  ++K
Subjt:  REVACRLIEMDTSNAAPYITLAHIYARAGLWAQLIDIRKYMQQKRVKKSAGWSWIEIDKKAHVFSVGDATHPKFYEIYSKLDQLDLDMK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTGCAGCTTAGCCACAAAATTGACCAAATATTCTCTTTGCTCTGCTCTTAATTCCTGTGCTAAAGCACATAATTTGTTTTTGGGTTTGCAAATTCATGCCCAGATTGT
CAAAATTGGACTTGAAGAGAACTTATTTTTGAACAGTGCACTTGTTGATTTATACTCCAAATGTAATGCCATTGTGGATGCAAAGAGGATCTTCTCACATATGAAAGCTC
ATGACCAAGTTTCTTGGACCTCTATAATATCTGGGCTCTCCCAAAATGGGGCTGGGAGCGAAGCCATCTTGATGTTTAAGAAAATGTTGGTAACTCAGGTTAGACCCAAC
TGTTTTACTTATGCCACTATTATTAGTTCATGTCCAACTCTGAAGGATGAACTTCAGATTCGTCTTGCAACTTTGCTTCATGCTCATGTTGTCAAACTTGGTTTTAGTTT
TAGCAGTTTTGTAATTAGCTCCACTATTGATTGTTACTCAAAACTAGGAAGAATAGAAGAAGCTGCTCTGCTCTTTTATGAGACAAATGTGAAGGATAATATCATATTTA
ATTCTATGATATCAGGGTATTCTCAAAACTTGTATGGGGAAGAGGCATTAAAACTGTTTGTAGAAATGAGAGGTGGTAATTTGAACCCAACTGATCATACATTAACTAGT
GTCTTAAATGCTTGTGGGAGTCTAACAGTACTTGAACAAGGAAGGCAAGTGCACTCTCTAGTTACAAAAATGGGATCGGAAAATAATGTGTTTGTGGTCTGTTCTTTGCT
AGACATGTACTCAAAATGTGGCAGTGTCGATGAGGCATTTTTCATATTCAATCAGACGGTTGAAAAGAACAGTGTGTTGTCAACTTCGATGATAATGGCTTTTGCTCAAT
GTGGTAGAGGCTCAGATGCCTTAAAGCTCTTTGAGAGTTCGTTGACTGAAGAAGGTTTCTTGCCTGATCATGTCTGTTTTACTGCAGTTTTAACTGCCTGCAATCATGCA
GGATTACTAGATGAGGCAGTTGAATATTTCAATAAAATGGGCAGTGAATACAGATTAGAGCCTCAAATTGATCATTATGCTTGTTTGATTGATCTCTATGCCAGAAATGG
GCATGTAGAAAAAGCTAAGCAATTGATGGAGCAAATGCCTTACGAGTCTAATTACGTAATGTGGTGTTCCCTCTTAGGTGCTTGCAAAGTTCATGAAGAGGTCGAGCTCG
GGAGGGAGGTCGCATGTCGACTCATCGAAATGGATACAAGTAATGCTGCACCCTATATAACACTTGCTCATATCTATGCTAGAGCTGGTTTATGGGCACAGTTGATTGAT
ATTAGAAAATATATGCAACAAAAAAGGGTAAAGAAAAGTGCTGGGTGGAGCTGGATTGAGATAGATAAGAAAGCTCATGTCTTCTCAGTTGGTGATGCTACTCATCCTAA
ATTTTATGAGATTTATTCAAAACTTGACCAACTGGACTTGGATATGAAAGCAGCTGAACATGCATCAAAAGCACTTGAATATGTTGAGTTTTAA
mRNA sequenceShow/hide mRNA sequence
CTAAAGGGGGGTCCTTTACAAATTACACTGAACCACCCACGACTCAGCCCACCCAGCGGCCGGATCAGCCACAGCACAGCGGTCCGACGATAGGCTCATTTGGTTCGGCC
GAAACCCTCATATTTCTCCTCTCCTCTCCTCTCTCCTTGGCCGTCACCAATGAAGAAGAAGAATCAGTAGAAAGCGATCACCGAAAAATCCTAATTACTCGAAGTCTCGA
ACTGCACTGCTCTTCGGCAGGGGAAAATTCCAGTGACTCGAAGTTTTTCTCGGAGTACGAGTAATCAACTGATGTTGAGTTTTGTGATTTCCGTTGGAATTTGAGTTCTC
TATGATTTTTTTGCTATGCTTCGACCTTGTTTCGGCCGCGATCGCCTATTTTTCTTTCTTTGGCCTGTTTATCATTCCCAGCGGGCCGAAATCGAGGGAATTCAAGGCAA
CGACTGGATTGCAAATGTTGCGAGTATCTTGCTATGATACGATCAACGTTCATCGGAACAAAAATCTCAGCGAAAGAGATGTCAGAATGTTACTAATTAGGAGATGAAAT
TTGGCAACTGTGGTCATGCCTCTGAAGCATTGGGTTACTTTGCAGAATGTGCAGCTTAGCCACAAAATTGACCAAATATTCTCTTTGCTCTGCTCTTAATTCCTGTGCTA
AAGCACATAATTTGTTTTTGGGTTTGCAAATTCATGCCCAGATTGTCAAAATTGGACTTGAAGAGAACTTATTTTTGAACAGTGCACTTGTTGATTTATACTCCAAATGT
AATGCCATTGTGGATGCAAAGAGGATCTTCTCACATATGAAAGCTCATGACCAAGTTTCTTGGACCTCTATAATATCTGGGCTCTCCCAAAATGGGGCTGGGAGCGAAGC
CATCTTGATGTTTAAGAAAATGTTGGTAACTCAGGTTAGACCCAACTGTTTTACTTATGCCACTATTATTAGTTCATGTCCAACTCTGAAGGATGAACTTCAGATTCGTC
TTGCAACTTTGCTTCATGCTCATGTTGTCAAACTTGGTTTTAGTTTTAGCAGTTTTGTAATTAGCTCCACTATTGATTGTTACTCAAAACTAGGAAGAATAGAAGAAGCT
GCTCTGCTCTTTTATGAGACAAATGTGAAGGATAATATCATATTTAATTCTATGATATCAGGGTATTCTCAAAACTTGTATGGGGAAGAGGCATTAAAACTGTTTGTAGA
AATGAGAGGTGGTAATTTGAACCCAACTGATCATACATTAACTAGTGTCTTAAATGCTTGTGGGAGTCTAACAGTACTTGAACAAGGAAGGCAAGTGCACTCTCTAGTTA
CAAAAATGGGATCGGAAAATAATGTGTTTGTGGTCTGTTCTTTGCTAGACATGTACTCAAAATGTGGCAGTGTCGATGAGGCATTTTTCATATTCAATCAGACGGTTGAA
AAGAACAGTGTGTTGTCAACTTCGATGATAATGGCTTTTGCTCAATGTGGTAGAGGCTCAGATGCCTTAAAGCTCTTTGAGAGTTCGTTGACTGAAGAAGGTTTCTTGCC
TGATCATGTCTGTTTTACTGCAGTTTTAACTGCCTGCAATCATGCAGGATTACTAGATGAGGCAGTTGAATATTTCAATAAAATGGGCAGTGAATACAGATTAGAGCCTC
AAATTGATCATTATGCTTGTTTGATTGATCTCTATGCCAGAAATGGGCATGTAGAAAAAGCTAAGCAATTGATGGAGCAAATGCCTTACGAGTCTAATTACGTAATGTGG
TGTTCCCTCTTAGGTGCTTGCAAAGTTCATGAAGAGGTCGAGCTCGGGAGGGAGGTCGCATGTCGACTCATCGAAATGGATACAAGTAATGCTGCACCCTATATAACACT
TGCTCATATCTATGCTAGAGCTGGTTTATGGGCACAGTTGATTGATATTAGAAAATATATGCAACAAAAAAGGGTAAAGAAAAGTGCTGGGTGGAGCTGGATTGAGATAG
ATAAGAAAGCTCATGTCTTCTCAGTTGGTGATGCTACTCATCCTAAATTTTATGAGATTTATTCAAAACTTGACCAACTGGACTTGGATATGAAAGCAGCTGAACATGCA
TCAAAAGCACTTGAATATGTTGAGTTTTAATACCAATGATGGAATAGTGTTATTGTTTGAAGTTCTGATAGACTAGTTGGAAATGAGAATCTTGTTTTAAATGAGTGTCC
AAGCGATTCATGTTTGCAATGAATGAGTGAGGAACCATGAAATTTGTTTAGTAGTTGAATTATTTTTGTAAGGCATATATGATGAATAAATTATTCAATCTGTT
Protein sequenceShow/hide protein sequence
MCSLATKLTKYSLCSALNSCAKAHNLFLGLQIHAQIVKIGLEENLFLNSALVDLYSKCNAIVDAKRIFSHMKAHDQVSWTSIISGLSQNGAGSEAILMFKKMLVTQVRPN
CFTYATIISSCPTLKDELQIRLATLLHAHVVKLGFSFSSFVISSTIDCYSKLGRIEEAALLFYETNVKDNIIFNSMISGYSQNLYGEEALKLFVEMRGGNLNPTDHTLTS
VLNACGSLTVLEQGRQVHSLVTKMGSENNVFVVCSLLDMYSKCGSVDEAFFIFNQTVEKNSVLSTSMIMAFAQCGRGSDALKLFESSLTEEGFLPDHVCFTAVLTACNHA
GLLDEAVEYFNKMGSEYRLEPQIDHYACLIDLYARNGHVEKAKQLMEQMPYESNYVMWCSLLGACKVHEEVELGREVACRLIEMDTSNAAPYITLAHIYARAGLWAQLID
IRKYMQQKRVKKSAGWSWIEIDKKAHVFSVGDATHPKFYEIYSKLDQLDLDMKAAEHASKALEYVEF