; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr020964 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr020964
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationtig00153577:1058819..1060342
RNA-Seq ExpressionSgr020964
SyntenySgr020964
Gene Ontology termsGO:0007018 - microtubule-based movement (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0030424 - axon (cellular component)
GO:0030425 - dendrite (cellular component)
GO:0003777 - microtubule motor activity (molecular function)
GO:0005524 - ATP binding (molecular function)
GO:0008017 - microtubule binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022159320.1 pentatricopeptide repeat-containing protein At2g13600-like [Momordica charantia]6.2e-25888.84Show/hide
Query:  MCSLGAKLTKYSLCSALNSCAKTQNLFLGLQIHAQIVKIGFEENLFLNSALVDLYAKCNAIVDAKRVFSHMKTHDHVSWTSIISGLSQNGRGSEAILMFK
        MC LGAKLTKY+LCSALNSCAKT NLF GLQIHAQIVKIG EENLFLNSALVDLYAKCNAIVDAKRVF  M+THD VSWTS+ISGLS+NGRGSEAI MFK
Subjt:  MCSLGAKLTKYSLCSALNSCAKTQNLFLGLQIHAQIVKIGFEENLFLNSALVDLYAKCNAIVDAKRVFSHMKTHDHVSWTSIISGLSQNGRGSEAILMFK

Query:  KMLATHVRPNCFTYATGISSCPTLEDDLQIRLATLLHAHVIKLGFTLSSFVISSTIDCYSKLGRIEEAALLFHETTVKDNIIFNSMISGYSQNLYGEEAL
        +ML T  RPNCFTYAT ISSCPTLEDDLQ+RLA LLHAHVIK GFT SSFVISSTIDCYSKLGRI +AALLF+ET  KDNIIFNSMISGYSQNL+GEEAL
Subjt:  KMLATHVRPNCFTYATGISSCPTLEDDLQIRLATLLHAHVIKLGFTLSSFVISSTIDCYSKLGRIEEAALLFHETTVKDNIIFNSMISGYSQNLYGEEAL

Query:  KLFVEMRSSDLSPTDHTLTSVLNACGSLTVLEQGRQVHSLVTKMGSENNVFVVCSLLDMYSKCGSVDEVFFIFNQTVEKNSVLSTSMIMAFAQCGRGSDA
        KLFVEMRSS LSPTDHTLTSVLNACGSLTVLEQGRQVHSLVTKMGSENNVFVVCSL+DMYSKCG VDE FFIFNQ VEKNSVLSTSMIMAFAQCGRGSDA
Subjt:  KLFVEMRSSDLSPTDHTLTSVLNACGSLTVLEQGRQVHSLVTKMGSENNVFVVCSLLDMYSKCGSVDEVFFIFNQTVEKNSVLSTSMIMAFAQCGRGSDA

Query:  LKLFECLLTEEGFLPDHVCFTAVLTACNHAGFLDEAVEYFNKMGSKYRLDPQIDHYACLIDLYARNGHVEKAKQLMEQMPYESNHVMWCSLLGACKVHEE
        LKLFE LL EEGFLPDHVCFTAVLTACNHAGFLDEAVEYFNKMGS+Y LDPQIDHYACLIDLYARNGH+EKAKQL+EQMPYESN+VMWCSLLGACKVH E
Subjt:  LKLFECLLTEEGFLPDHVCFTAVLTACNHAGFLDEAVEYFNKMGSKYRLDPQIDHYACLIDLYARNGHVEKAKQLMEQMPYESNHVMWCSLLGACKVHEE

Query:  VELGREVAYQLNEMDPSNAAPYVTLAQIYARAGLWARVADIRKQMQQKNVRKTAGWSWIEIDKKAHVFSVGDATHPKSCEIYSKLDQLDLDMKAAEYASK
        VELGREVAY+L EMDPSNAAPYVTLA IYARAGLW +V DIRK+MQQ+ VRK+ GWSWIEIDKK+HVFS GDATHPKS EIYSKLDQLDLDMK+AE+ASK
Subjt:  VELGREVAYQLNEMDPSNAAPYVTLAQIYARAGLWARVADIRKQMQQKNVRKTAGWSWIEIDKKAHVFSVGDATHPKSCEIYSKLDQLDLDMKAAEYASK

Query:  AL
        AL
Subjt:  AL

XP_022948515.1 pentatricopeptide repeat-containing protein At2g13600-like [Cucurbita moschata]2.5e-25987.99Show/hide
Query:  MCSLGAKLTKYSLCSALNSCAKTQNLFLGLQIHAQIVKIGFEENLFLNSALVDLYAKCNAIVDAKRVFSHMKTHDHVSWTSIISGLSQNGRGSEAILMFK
        MCSLGAKLTKYSLCSALNSCAKT NLFLGLQIHAQIVKIGFE+NL+LNS LV+LY+KCNAIVDAKR+F HMKTHD VSWTSIISGLSQNG G EAILMFK
Subjt:  MCSLGAKLTKYSLCSALNSCAKTQNLFLGLQIHAQIVKIGFEENLFLNSALVDLYAKCNAIVDAKRVFSHMKTHDHVSWTSIISGLSQNGRGSEAILMFK

Query:  KMLATHVRPNCFTYATGISSCPTLEDDLQIRLATLLHAHVIKLGFT-LSSFVISSTIDCYSKLGRIEEAALLFHETTVKDNIIFNSMISGYSQNLYGEEA
         ML T  RPNCFTYAT ISSCP+L+D+L I L TL HAHVIKLGF   SSFVISS IDCYSKLGRIEEAALLF+E  VKDN+IFNSMISG+SQNLYGEEA
Subjt:  KMLATHVRPNCFTYATGISSCPTLEDDLQIRLATLLHAHVIKLGFT-LSSFVISSTIDCYSKLGRIEEAALLFHETTVKDNIIFNSMISGYSQNLYGEEA

Query:  LKLFVEMRSSDLSPTDHTLTSVLNACGSLTVLEQGRQVHSLVTKMGSENNVFVVCSLLDMYSKCGSVDEVFFIFNQTVEKNSVLSTSMIMAFAQCGRGSD
        LKLFVEMR+S+LSPTDHTLTSVLNACGSLTVLEQGRQVHSLVTKMGSENNVFVVCSLLDMYSKCGSVD+ F IFNQTVEKNSVLSTSMIMAFAQCGRGSD
Subjt:  LKLFVEMRSSDLSPTDHTLTSVLNACGSLTVLEQGRQVHSLVTKMGSENNVFVVCSLLDMYSKCGSVDEVFFIFNQTVEKNSVLSTSMIMAFAQCGRGSD

Query:  ALKLFECLLTEEGFLPDHVCFTAVLTACNHAGFLDEAVEYFNKMGSKYRLDPQIDHYACLIDLYARNGHVEKAKQLMEQMPYESNHVMWCSLLGACKVHE
        ALKLFE LLTEEGFLPDHVCFTAVLTACNHAG L+EAVEYFNKMGS+YRLDPQIDHYACLIDLYARNGHVEKAK+LME+MPYESN+VMWCSLLGACKVH 
Subjt:  ALKLFECLLTEEGFLPDHVCFTAVLTACNHAGFLDEAVEYFNKMGSKYRLDPQIDHYACLIDLYARNGHVEKAKQLMEQMPYESNHVMWCSLLGACKVHE

Query:  EVELGREVAYQLNEMDPSNAAPYVTLAQIYARAGLWARVADIRKQMQQKNVRKTAGWSWIEIDKKAHVFSVGDATHPKSCEIYSKLDQLDLDMKAAEYAS
        EVELGREVAY+L EMDP NAAPYVTLA IYARAGLW ++ADIR QMQQK VRK+AGWSWIEIDKKAHVFSVGDATHPKSCEIYSKL+QLDLDM+ AE+AS
Subjt:  EVELGREVAYQLNEMDPSNAAPYVTLAQIYARAGLWARVADIRKQMQQKNVRKTAGWSWIEIDKKAHVFSVGDATHPKSCEIYSKLDQLDLDMKAAEYAS

Query:  KALKYVEF
        KAL++VEF
Subjt:  KALKYVEF

XP_022998877.1 pentatricopeptide repeat-containing protein At2g13600-like [Cucurbita maxima]4.0e-25787.4Show/hide
Query:  MCSLGAKLTKYSLCSALNSCAKTQNLFLGLQIHAQIVKIGFEENLFLNSALVDLYAKCNAIVDAKRVFSHMKTHDHVSWTSIISGLSQNGRGSEAILMFK
        MCSLGAKLTKYSLCSALNSCAKT NLFLGLQIHAQIVKIGFE+NL+LNS LV+LY+KCNAIVDAKR+F HMKTHD VSWTSIISGLSQNG G EAILMFK
Subjt:  MCSLGAKLTKYSLCSALNSCAKTQNLFLGLQIHAQIVKIGFEENLFLNSALVDLYAKCNAIVDAKRVFSHMKTHDHVSWTSIISGLSQNGRGSEAILMFK

Query:  KMLATHVRPNCFTYATGISSCPTLEDDLQIRLATLLHAHVIKLGFT-LSSFVISSTIDCYSKLGRIEEAALLFHETTVKDNIIFNSMISGYSQNLYGEEA
         ML T  RPNCFTYAT ISSCP+L+D+L   L TL HAHVIKLGF   SSFVISS IDCYSKLGRIEEAALLF+E  VKDN+IFNSMISG+SQNLYGEEA
Subjt:  KMLATHVRPNCFTYATGISSCPTLEDDLQIRLATLLHAHVIKLGFT-LSSFVISSTIDCYSKLGRIEEAALLFHETTVKDNIIFNSMISGYSQNLYGEEA

Query:  LKLFVEMRSSDLSPTDHTLTSVLNACGSLTVLEQGRQVHSLVTKMGSENNVFVVCSLLDMYSKCGSVDEVFFIFNQTVEKNSVLSTSMIMAFAQCGRGSD
        LKLFVEMR+S+LS TDHTLTSVLNACGSLTVLEQGRQVHSLVTKMGSENNVFVVCSLLDMYSKCGSVD+ F IFNQTVEKNSVLSTSMIMAFAQCGRGSD
Subjt:  LKLFVEMRSSDLSPTDHTLTSVLNACGSLTVLEQGRQVHSLVTKMGSENNVFVVCSLLDMYSKCGSVDEVFFIFNQTVEKNSVLSTSMIMAFAQCGRGSD

Query:  ALKLFECLLTEEGFLPDHVCFTAVLTACNHAGFLDEAVEYFNKMGSKYRLDPQIDHYACLIDLYARNGHVEKAKQLMEQMPYESNHVMWCSLLGACKVHE
        ALKLFE LLTEEGFLPDHVCFTAVLTACNHAG L+EAVEYFNKMGS+YRLDPQIDHYACLIDLYARNGHVEKAK+LME++PYESN+VMWCSLLGACKVH 
Subjt:  ALKLFECLLTEEGFLPDHVCFTAVLTACNHAGFLDEAVEYFNKMGSKYRLDPQIDHYACLIDLYARNGHVEKAKQLMEQMPYESNHVMWCSLLGACKVHE

Query:  EVELGREVAYQLNEMDPSNAAPYVTLAQIYARAGLWARVADIRKQMQQKNVRKTAGWSWIEIDKKAHVFSVGDATHPKSCEIYSKLDQLDLDMKAAEYAS
        EVELGREVAY+L EMDPSNAAPYVTLA IYARAGLW ++ADIRKQMQQK VRK+AGWSWIEIDKKAHVFSVGDATHPKSCEIYSKL+QLDLDM+  E+AS
Subjt:  EVELGREVAYQLNEMDPSNAAPYVTLAQIYARAGLWARVADIRKQMQQKNVRKTAGWSWIEIDKKAHVFSVGDATHPKSCEIYSKLDQLDLDMKAAEYAS

Query:  KALKYVEF
        KAL+++EF
Subjt:  KALKYVEF

XP_023521522.1 pentatricopeptide repeat-containing protein At2g13600-like [Cucurbita pepo subsp. pepo]2.8e-25887.99Show/hide
Query:  MCSLGAKLTKYSLCSALNSCAKTQNLFLGLQIHAQIVKIGFEENLFLNSALVDLYAKCNAIVDAKRVFSHMKTHDHVSWTSIISGLSQNGRGSEAILMFK
        MCSLGAKLTKYSLCSALNSCAKT NLFLGLQIHAQIVKIGFE+NL+LNSALV+LY+KCNAIVDAKR+F HMKTHD VSWTSIISGLSQNG G EAILMFK
Subjt:  MCSLGAKLTKYSLCSALNSCAKTQNLFLGLQIHAQIVKIGFEENLFLNSALVDLYAKCNAIVDAKRVFSHMKTHDHVSWTSIISGLSQNGRGSEAILMFK

Query:  KMLATHVRPNCFTYATGISSCPTLEDDLQIRLATLLHAHVIKLGFTL-SSFVISSTIDCYSKLGRIEEAALLFHETTVKDNIIFNSMISGYSQNLYGEEA
         ML T  RPNCFTYAT ISSCP+L+D+L   L TL HAHVIKLGF L SSFVISS IDCYSKLGRIEEAALLF+E  VKDN+IFNSMISG+SQNLYGEEA
Subjt:  KMLATHVRPNCFTYATGISSCPTLEDDLQIRLATLLHAHVIKLGFTL-SSFVISSTIDCYSKLGRIEEAALLFHETTVKDNIIFNSMISGYSQNLYGEEA

Query:  LKLFVEMRSSDLSPTDHTLTSVLNACGSLTVLEQGRQVHSLVTKMGSENNVFVVCSLLDMYSKCGSVDEVFFIFNQTVEKNSVLSTSMIMAFAQCGRGSD
        LKLFVEMR+S+LSPTDHTLTSVLNACGSLTVLEQGRQVHSLVTKMGSENNVFVVCSLLDMYSKCGSVD+ F IFNQTVEKNSVLSTSMIMAFAQCGRGSD
Subjt:  LKLFVEMRSSDLSPTDHTLTSVLNACGSLTVLEQGRQVHSLVTKMGSENNVFVVCSLLDMYSKCGSVDEVFFIFNQTVEKNSVLSTSMIMAFAQCGRGSD

Query:  ALKLFECLLTEEGFLPDHVCFTAVLTACNHAGFLDEAVEYFNKMGSKYRLDPQIDHYACLIDLYARNGHVEKAKQLMEQMPYESNHVMWCSLLGACKVHE
        ALKLFE LLTEEGFLPDHVCFTAVLTACNHAG L+EAVEYFNKMGS+YRLDPQIDHYACLIDLYARNGHV KAK+LME+MPYESN+VMWCSLLGACKVH 
Subjt:  ALKLFECLLTEEGFLPDHVCFTAVLTACNHAGFLDEAVEYFNKMGSKYRLDPQIDHYACLIDLYARNGHVEKAKQLMEQMPYESNHVMWCSLLGACKVHE

Query:  EVELGREVAYQLNEMDPSNAAPYVTLAQIYARAGLWARVADIRKQMQQKNVRKTAGWSWIEIDKKAHVFSVGDATHPKSCEIYSKLDQLDLDMKAAEYAS
         VELGREVAY+L EMDPSNAAPYVTLA IYARAGLW ++ADIRKQMQQK VRK+AGWSWIEIDKKAHVFSVGDATHPKSCEIYSKL+QLDLDM+  E+AS
Subjt:  EVELGREVAYQLNEMDPSNAAPYVTLAQIYARAGLWARVADIRKQMQQKNVRKTAGWSWIEIDKKAHVFSVGDATHPKSCEIYSKLDQLDLDMKAAEYAS

Query:  KALKYVEF
        KAL++VEF
Subjt:  KALKYVEF

XP_038896055.1 pentatricopeptide repeat-containing protein At3g12770-like [Benincasa hispida]9.0e-26590.28Show/hide
Query:  MCSLGAKLTKYSLCSALNSCAKTQNLFLGLQIHAQIVKIGFEENLFLNSALVDLYAKCNAIVDAKRVFSHMKTHDHVSWTSIISGLSQNGRGSEAILMFK
        MCSLGAKLTKYSLCSALNSCAKT NLFLGLQIHAQIVKIGFEENLFLNS+LVDLY+KCNAIV+AKRVFSHMKTHD VSWTSIISGLSQNG GSEAILMFK
Subjt:  MCSLGAKLTKYSLCSALNSCAKTQNLFLGLQIHAQIVKIGFEENLFLNSALVDLYAKCNAIVDAKRVFSHMKTHDHVSWTSIISGLSQNGRGSEAILMFK

Query:  KMLATHVRPNCFTYATGISSCPTLEDDLQIRLATLLHAHVIKLGFTLSSFVISSTIDCYSKLGRIEEAALLFHETTVKDNIIFNSMISGYSQNLYGEEAL
         ML T VRPNCFTYAT ISSC TL+D+LQI LATLLHAHVIKLGFT S+FVISSTIDCYSKLGR++EAALLF+ETTVKDNIIFNSMISGYSQNLYGEEAL
Subjt:  KMLATHVRPNCFTYATGISSCPTLEDDLQIRLATLLHAHVIKLGFTLSSFVISSTIDCYSKLGRIEEAALLFHETTVKDNIIFNSMISGYSQNLYGEEAL

Query:  KLFVEMRSSDLSPTDHTLTSVLNACGSLTVLEQGRQVHSLVTKMGSENNVFVVCSLLDMYSKCGSVDEVFFIFNQTVEKNSVLSTSMIMAFAQCGRGSDA
        KLFVEMR+S+LSPTDHTLTSVLNACG LTVLEQGRQVHSLVTKMGSENNVFVVCSLLDMYSKCGSVDE FF+FNQT+EKNSVLSTSMIMAFAQCGRGS+A
Subjt:  KLFVEMRSSDLSPTDHTLTSVLNACGSLTVLEQGRQVHSLVTKMGSENNVFVVCSLLDMYSKCGSVDEVFFIFNQTVEKNSVLSTSMIMAFAQCGRGSDA

Query:  LKLFECLLTEEGFLPDHVCFTAVLTACNHAGFLDEAVEYFNKMGSKYRLDPQIDHYACLIDLYARNGHVEKAKQLMEQMPYESNHVMWCSLLGACKVHEE
        LKLFE LLTEEGFLPDHVCF AVLTACNHAG L+EAVEYFNKM  +YRLDPQIDHYACLIDLYARNGHVEKAKQLMEQMPYESN+VMWCSLLGACKVHEE
Subjt:  LKLFECLLTEEGFLPDHVCFTAVLTACNHAGFLDEAVEYFNKMGSKYRLDPQIDHYACLIDLYARNGHVEKAKQLMEQMPYESNHVMWCSLLGACKVHEE

Query:  VELGREVAYQLNEMDPSNAAPYVTLAQIYARAGLWARVADIRKQMQQKNVRKTAGWSWIEIDKKAHVFSVGDATHPKSCEIYSKLDQLDLDMKAAEYASK
        VELGREVAYQL EMDPSNAAPYVTLA IYARAGLW ++ +IRK+MQQK VRK+AGWSWIEIDKK HVFSVGDA HPKSCEIYSKLDQL+LDMKAAE+ASK
Subjt:  VELGREVAYQLNEMDPSNAAPYVTLAQIYARAGLWARVADIRKQMQQKNVRKTAGWSWIEIDKKAHVFSVGDATHPKSCEIYSKLDQLDLDMKAAEYASK

Query:  ALKY
        AL+Y
Subjt:  ALKY

TrEMBL top hitse value%identityAlignment
A0A1S3CML0 pentatricopeptide repeat-containing protein At2g13600-like3.1e-25586.71Show/hide
Query:  MCSLGAKLTKYSLCSALNSCAKTQNLFLGLQIHAQIVKIGFEENLFLNSALVDLYAKCNAIVDAKRVFSHMKTHDHVSWTSIISGLSQNGRGSEAILMFK
        MCSLGAKLT YSLCSAL+SCAKT NLFLGLQIHAQIVKIGFEENLFLNS+LVDLY+KCNAIV+AKRVFS MKTHD VSWTSIISGLSQNG GSEAILMFK
Subjt:  MCSLGAKLTKYSLCSALNSCAKTQNLFLGLQIHAQIVKIGFEENLFLNSALVDLYAKCNAIVDAKRVFSHMKTHDHVSWTSIISGLSQNGRGSEAILMFK

Query:  KMLATHVRPNCFTYATGISSCPTLEDDLQIRLATLLHAHVIKLGFTLSSFVISSTIDCYSKLGRIEEAALLFHETTVKDNIIFNSMISGYSQNLYGEEAL
        KML T VRPNCFTYAT ISSCPTL+++LQI LATLLHAHVIK GFT SSFVISSTIDCYSKLGRI+EA+LLF ET+VKDNIIFNSMISGYSQNL GEEAL
Subjt:  KMLATHVRPNCFTYATGISSCPTLEDDLQIRLATLLHAHVIKLGFTLSSFVISSTIDCYSKLGRIEEAALLFHETTVKDNIIFNSMISGYSQNLYGEEAL

Query:  KLFVEMRSSDLSPTDHTLTSVLNACGSLTVLEQGRQVHSLVTKMGSENNVFVVCSLLDMYSKCGSVDEVFFIFNQTVEKNSVLSTSMIMAFAQCGRGSDA
        KLFVEMR+S+LSPTDHTLTSVLNACG LTVLEQGRQVHSL+TKMGSENNVFVVCSLLDMYSKCGS+DE F +FNQTV+KNSVLSTSMIMAFAQCGRG +A
Subjt:  KLFVEMRSSDLSPTDHTLTSVLNACGSLTVLEQGRQVHSLVTKMGSENNVFVVCSLLDMYSKCGSVDEVFFIFNQTVEKNSVLSTSMIMAFAQCGRGSDA

Query:  LKLFECLLTEEGFLPDHVCFTAVLTACNHAGFLDEAVEYFNKMGSKYRLDPQIDHYACLIDLYARNGHVEKAKQLMEQMPYESNHVMWCSLLGACKVHEE
        LKLFECL TE+ F+PDH+CFTAVLTACNHAG LDEAVEYFNKM  +Y+LDPQIDHYACLIDLYARNG+VEKAKQ+MEQMPYESN+VMWCSLLGACKVH E
Subjt:  LKLFECLLTEEGFLPDHVCFTAVLTACNHAGFLDEAVEYFNKMGSKYRLDPQIDHYACLIDLYARNGHVEKAKQLMEQMPYESNHVMWCSLLGACKVHEE

Query:  VELGREVAYQLNEMDPSNAAPYVTLAQIYARAGLWARVADIRKQMQQKNVRKTAGWSWIEIDKKAHVFSVGDATHPKSCEIYSKLDQLDLDMKAAEYASK
        VELGREVAY+L EMDP NAAPY+TLA IYARAGLW +V +IRK+MQQK VRK+AGWSWIEIDKK HVFSVGDA HPKSCEIYSKLDQL+LDMKAAE + K
Subjt:  VELGREVAYQLNEMDPSNAAPYVTLAQIYARAGLWARVADIRKQMQQKNVRKTAGWSWIEIDKKAHVFSVGDATHPKSCEIYSKLDQLDLDMKAAEYASK

Query:  ALKY
        AL+Y
Subjt:  ALKY

A0A5A7UEE6 Pentatricopeptide repeat-containing protein3.1e-25586.71Show/hide
Query:  MCSLGAKLTKYSLCSALNSCAKTQNLFLGLQIHAQIVKIGFEENLFLNSALVDLYAKCNAIVDAKRVFSHMKTHDHVSWTSIISGLSQNGRGSEAILMFK
        MCSLGAKLT YSLCSAL+SCAKT NLFLGLQIHAQIVKIGFEENLFLNS+LVDLY+KCNAIV+AKRVFS MKTHD VSWTSIISGLSQNG GSEAILMFK
Subjt:  MCSLGAKLTKYSLCSALNSCAKTQNLFLGLQIHAQIVKIGFEENLFLNSALVDLYAKCNAIVDAKRVFSHMKTHDHVSWTSIISGLSQNGRGSEAILMFK

Query:  KMLATHVRPNCFTYATGISSCPTLEDDLQIRLATLLHAHVIKLGFTLSSFVISSTIDCYSKLGRIEEAALLFHETTVKDNIIFNSMISGYSQNLYGEEAL
        KML T VRPNCFTYAT ISSCPTL+++LQI LATLLHAHVIK GFT SSFVISSTIDCYSKLGRI+EA+LLF ET+VKDNIIFNSMISGYSQNL GEEAL
Subjt:  KMLATHVRPNCFTYATGISSCPTLEDDLQIRLATLLHAHVIKLGFTLSSFVISSTIDCYSKLGRIEEAALLFHETTVKDNIIFNSMISGYSQNLYGEEAL

Query:  KLFVEMRSSDLSPTDHTLTSVLNACGSLTVLEQGRQVHSLVTKMGSENNVFVVCSLLDMYSKCGSVDEVFFIFNQTVEKNSVLSTSMIMAFAQCGRGSDA
        KLFVEMR+S+LSPTDHTLTSVLNACG LTVLEQGRQVHSL+TKMGSENNVFVVCSLLDMYSKCGS+DE F +FNQTV+KNSVLSTSMIMAFAQCGRG +A
Subjt:  KLFVEMRSSDLSPTDHTLTSVLNACGSLTVLEQGRQVHSLVTKMGSENNVFVVCSLLDMYSKCGSVDEVFFIFNQTVEKNSVLSTSMIMAFAQCGRGSDA

Query:  LKLFECLLTEEGFLPDHVCFTAVLTACNHAGFLDEAVEYFNKMGSKYRLDPQIDHYACLIDLYARNGHVEKAKQLMEQMPYESNHVMWCSLLGACKVHEE
        LKLFECL TE+ F+PDH+CFTAVLTACNHAG LDEAVEYFNKM  +Y+LDPQIDHYACLIDLYARNG+VEKAKQ+MEQMPYESN+VMWCSLLGACKVH E
Subjt:  LKLFECLLTEEGFLPDHVCFTAVLTACNHAGFLDEAVEYFNKMGSKYRLDPQIDHYACLIDLYARNGHVEKAKQLMEQMPYESNHVMWCSLLGACKVHEE

Query:  VELGREVAYQLNEMDPSNAAPYVTLAQIYARAGLWARVADIRKQMQQKNVRKTAGWSWIEIDKKAHVFSVGDATHPKSCEIYSKLDQLDLDMKAAEYASK
        VELGREVAY+L EMDP NAAPY+TLA IYARAGLW +V +IRK+MQQK VRK+AGWSWIEIDKK HVFSVGDA HPKSCEIYSKLDQL+LDMKAAE + K
Subjt:  VELGREVAYQLNEMDPSNAAPYVTLAQIYARAGLWARVADIRKQMQQKNVRKTAGWSWIEIDKKAHVFSVGDATHPKSCEIYSKLDQLDLDMKAAEYASK

Query:  ALKY
        AL+Y
Subjt:  ALKY

A0A6J1DYD1 pentatricopeptide repeat-containing protein At2g13600-like6.7e-25888.84Show/hide
Query:  MCSLGAKLTKYSLCSALNSCAKTQNLFLGLQIHAQIVKIGFEENLFLNSALVDLYAKCNAIVDAKRVFSHMKTHDHVSWTSIISGLSQNGRGSEAILMFK
        MC LGAKLTKY+LCSALNSCAKT NLF GLQIHAQIVKIG EENLFLNSALVDLYAKCNAIVDAKRVF  M+THD VSWTS+ISGLS+NGRGSEAI MFK
Subjt:  MCSLGAKLTKYSLCSALNSCAKTQNLFLGLQIHAQIVKIGFEENLFLNSALVDLYAKCNAIVDAKRVFSHMKTHDHVSWTSIISGLSQNGRGSEAILMFK

Query:  KMLATHVRPNCFTYATGISSCPTLEDDLQIRLATLLHAHVIKLGFTLSSFVISSTIDCYSKLGRIEEAALLFHETTVKDNIIFNSMISGYSQNLYGEEAL
        +ML T  RPNCFTYAT ISSCPTLEDDLQ+RLA LLHAHVIK GFT SSFVISSTIDCYSKLGRI +AALLF+ET  KDNIIFNSMISGYSQNL+GEEAL
Subjt:  KMLATHVRPNCFTYATGISSCPTLEDDLQIRLATLLHAHVIKLGFTLSSFVISSTIDCYSKLGRIEEAALLFHETTVKDNIIFNSMISGYSQNLYGEEAL

Query:  KLFVEMRSSDLSPTDHTLTSVLNACGSLTVLEQGRQVHSLVTKMGSENNVFVVCSLLDMYSKCGSVDEVFFIFNQTVEKNSVLSTSMIMAFAQCGRGSDA
        KLFVEMRSS LSPTDHTLTSVLNACGSLTVLEQGRQVHSLVTKMGSENNVFVVCSL+DMYSKCG VDE FFIFNQ VEKNSVLSTSMIMAFAQCGRGSDA
Subjt:  KLFVEMRSSDLSPTDHTLTSVLNACGSLTVLEQGRQVHSLVTKMGSENNVFVVCSLLDMYSKCGSVDEVFFIFNQTVEKNSVLSTSMIMAFAQCGRGSDA

Query:  LKLFECLLTEEGFLPDHVCFTAVLTACNHAGFLDEAVEYFNKMGSKYRLDPQIDHYACLIDLYARNGHVEKAKQLMEQMPYESNHVMWCSLLGACKVHEE
        LKLFE LL EEGFLPDHVCFTAVLTACNHAGFLDEAVEYFNKMGS+Y LDPQIDHYACLIDLYARNGH+EKAKQL+EQMPYESN+VMWCSLLGACKVH E
Subjt:  LKLFECLLTEEGFLPDHVCFTAVLTACNHAGFLDEAVEYFNKMGSKYRLDPQIDHYACLIDLYARNGHVEKAKQLMEQMPYESNHVMWCSLLGACKVHEE

Query:  VELGREVAYQLNEMDPSNAAPYVTLAQIYARAGLWARVADIRKQMQQKNVRKTAGWSWIEIDKKAHVFSVGDATHPKSCEIYSKLDQLDLDMKAAEYASK
        VELGREVAY+L EMDPSNAAPYVTLA IYARAGLW +V DIRK+MQQ+ VRK+ GWSWIEIDKK+HVFS GDATHPKS EIYSKLDQLDLDMK+AE+ASK
Subjt:  VELGREVAYQLNEMDPSNAAPYVTLAQIYARAGLWARVADIRKQMQQKNVRKTAGWSWIEIDKKAHVFSVGDATHPKSCEIYSKLDQLDLDMKAAEYASK

Query:  AL
        AL
Subjt:  AL

A0A6J1GA23 pentatricopeptide repeat-containing protein At2g13600-like1.2e-25987.99Show/hide
Query:  MCSLGAKLTKYSLCSALNSCAKTQNLFLGLQIHAQIVKIGFEENLFLNSALVDLYAKCNAIVDAKRVFSHMKTHDHVSWTSIISGLSQNGRGSEAILMFK
        MCSLGAKLTKYSLCSALNSCAKT NLFLGLQIHAQIVKIGFE+NL+LNS LV+LY+KCNAIVDAKR+F HMKTHD VSWTSIISGLSQNG G EAILMFK
Subjt:  MCSLGAKLTKYSLCSALNSCAKTQNLFLGLQIHAQIVKIGFEENLFLNSALVDLYAKCNAIVDAKRVFSHMKTHDHVSWTSIISGLSQNGRGSEAILMFK

Query:  KMLATHVRPNCFTYATGISSCPTLEDDLQIRLATLLHAHVIKLGFT-LSSFVISSTIDCYSKLGRIEEAALLFHETTVKDNIIFNSMISGYSQNLYGEEA
         ML T  RPNCFTYAT ISSCP+L+D+L I L TL HAHVIKLGF   SSFVISS IDCYSKLGRIEEAALLF+E  VKDN+IFNSMISG+SQNLYGEEA
Subjt:  KMLATHVRPNCFTYATGISSCPTLEDDLQIRLATLLHAHVIKLGFT-LSSFVISSTIDCYSKLGRIEEAALLFHETTVKDNIIFNSMISGYSQNLYGEEA

Query:  LKLFVEMRSSDLSPTDHTLTSVLNACGSLTVLEQGRQVHSLVTKMGSENNVFVVCSLLDMYSKCGSVDEVFFIFNQTVEKNSVLSTSMIMAFAQCGRGSD
        LKLFVEMR+S+LSPTDHTLTSVLNACGSLTVLEQGRQVHSLVTKMGSENNVFVVCSLLDMYSKCGSVD+ F IFNQTVEKNSVLSTSMIMAFAQCGRGSD
Subjt:  LKLFVEMRSSDLSPTDHTLTSVLNACGSLTVLEQGRQVHSLVTKMGSENNVFVVCSLLDMYSKCGSVDEVFFIFNQTVEKNSVLSTSMIMAFAQCGRGSD

Query:  ALKLFECLLTEEGFLPDHVCFTAVLTACNHAGFLDEAVEYFNKMGSKYRLDPQIDHYACLIDLYARNGHVEKAKQLMEQMPYESNHVMWCSLLGACKVHE
        ALKLFE LLTEEGFLPDHVCFTAVLTACNHAG L+EAVEYFNKMGS+YRLDPQIDHYACLIDLYARNGHVEKAK+LME+MPYESN+VMWCSLLGACKVH 
Subjt:  ALKLFECLLTEEGFLPDHVCFTAVLTACNHAGFLDEAVEYFNKMGSKYRLDPQIDHYACLIDLYARNGHVEKAKQLMEQMPYESNHVMWCSLLGACKVHE

Query:  EVELGREVAYQLNEMDPSNAAPYVTLAQIYARAGLWARVADIRKQMQQKNVRKTAGWSWIEIDKKAHVFSVGDATHPKSCEIYSKLDQLDLDMKAAEYAS
        EVELGREVAY+L EMDP NAAPYVTLA IYARAGLW ++ADIR QMQQK VRK+AGWSWIEIDKKAHVFSVGDATHPKSCEIYSKL+QLDLDM+ AE+AS
Subjt:  EVELGREVAYQLNEMDPSNAAPYVTLAQIYARAGLWARVADIRKQMQQKNVRKTAGWSWIEIDKKAHVFSVGDATHPKSCEIYSKLDQLDLDMKAAEYAS

Query:  KALKYVEF
        KAL++VEF
Subjt:  KALKYVEF

A0A6J1K975 pentatricopeptide repeat-containing protein At2g13600-like2.0e-25787.4Show/hide
Query:  MCSLGAKLTKYSLCSALNSCAKTQNLFLGLQIHAQIVKIGFEENLFLNSALVDLYAKCNAIVDAKRVFSHMKTHDHVSWTSIISGLSQNGRGSEAILMFK
        MCSLGAKLTKYSLCSALNSCAKT NLFLGLQIHAQIVKIGFE+NL+LNS LV+LY+KCNAIVDAKR+F HMKTHD VSWTSIISGLSQNG G EAILMFK
Subjt:  MCSLGAKLTKYSLCSALNSCAKTQNLFLGLQIHAQIVKIGFEENLFLNSALVDLYAKCNAIVDAKRVFSHMKTHDHVSWTSIISGLSQNGRGSEAILMFK

Query:  KMLATHVRPNCFTYATGISSCPTLEDDLQIRLATLLHAHVIKLGFT-LSSFVISSTIDCYSKLGRIEEAALLFHETTVKDNIIFNSMISGYSQNLYGEEA
         ML T  RPNCFTYAT ISSCP+L+D+L   L TL HAHVIKLGF   SSFVISS IDCYSKLGRIEEAALLF+E  VKDN+IFNSMISG+SQNLYGEEA
Subjt:  KMLATHVRPNCFTYATGISSCPTLEDDLQIRLATLLHAHVIKLGFT-LSSFVISSTIDCYSKLGRIEEAALLFHETTVKDNIIFNSMISGYSQNLYGEEA

Query:  LKLFVEMRSSDLSPTDHTLTSVLNACGSLTVLEQGRQVHSLVTKMGSENNVFVVCSLLDMYSKCGSVDEVFFIFNQTVEKNSVLSTSMIMAFAQCGRGSD
        LKLFVEMR+S+LS TDHTLTSVLNACGSLTVLEQGRQVHSLVTKMGSENNVFVVCSLLDMYSKCGSVD+ F IFNQTVEKNSVLSTSMIMAFAQCGRGSD
Subjt:  LKLFVEMRSSDLSPTDHTLTSVLNACGSLTVLEQGRQVHSLVTKMGSENNVFVVCSLLDMYSKCGSVDEVFFIFNQTVEKNSVLSTSMIMAFAQCGRGSD

Query:  ALKLFECLLTEEGFLPDHVCFTAVLTACNHAGFLDEAVEYFNKMGSKYRLDPQIDHYACLIDLYARNGHVEKAKQLMEQMPYESNHVMWCSLLGACKVHE
        ALKLFE LLTEEGFLPDHVCFTAVLTACNHAG L+EAVEYFNKMGS+YRLDPQIDHYACLIDLYARNGHVEKAK+LME++PYESN+VMWCSLLGACKVH 
Subjt:  ALKLFECLLTEEGFLPDHVCFTAVLTACNHAGFLDEAVEYFNKMGSKYRLDPQIDHYACLIDLYARNGHVEKAKQLMEQMPYESNHVMWCSLLGACKVHE

Query:  EVELGREVAYQLNEMDPSNAAPYVTLAQIYARAGLWARVADIRKQMQQKNVRKTAGWSWIEIDKKAHVFSVGDATHPKSCEIYSKLDQLDLDMKAAEYAS
        EVELGREVAY+L EMDPSNAAPYVTLA IYARAGLW ++ADIRKQMQQK VRK+AGWSWIEIDKKAHVFSVGDATHPKSCEIYSKL+QLDLDM+  E+AS
Subjt:  EVELGREVAYQLNEMDPSNAAPYVTLAQIYARAGLWARVADIRKQMQQKNVRKTAGWSWIEIDKKAHVFSVGDATHPKSCEIYSKLDQLDLDMKAAEYAS

Query:  KALKYVEF
        KAL+++EF
Subjt:  KALKYVEF

SwissProt top hitse value%identityAlignment
Q5G1T1 Pentatricopeptide repeat-containing protein At3g49170, chloroplastic1.1e-9235.34Show/hide
Query:  GAKLTKYSLCSALNSCAKTQNLFLGLQIHAQIVKIGFEENLFLNSALVDLYAKCNA---IVDAKRVFSHMKTHDHVSWTSIISGLSQN-GRGSEAILMFK
        G +  K++L S  ++CA+ +NL LG Q+H+  ++ G  ++  +  +LVD+YAKC+A   + D ++VF  M+ H  +SWT++I+G  +N    +EAI +F 
Subjt:  GAKLTKYSLCSALNSCAKTQNLFLGLQIHAQIVKIGFEENLFLNSALVDLYAKCNA---IVDAKRVFSHMKTHDHVSWTSIISGLSQN-GRGSEAILMFK

Query:  KMLAT-HVRPNCFTYATGISSCPTLEDDLQIRLATLLHAHVIKLGFTLSSFVISSTIDCYSKLGRIEEAALLFHETTVKDNIIFNSMISGYSQNLYGEEA
        +M+   HV PN FT+++   +C  L D    R+   +     K G   +S V +S I  + K  R+E+A   F   + K+ + +N+ + G  +NL  E+A
Subjt:  KMLAT-HVRPNCFTYATGISSCPTLEDDLQIRLATLLHAHVIKLGFTLSSFVISSTIDCYSKLGRIEEAALLFHETTVKDNIIFNSMISGYSQNLYGEEA

Query:  LKLFVEMRSSDLSPTDHTLTSVLNACGSLTVLEQGRQVHSLVTKMGSENNVFVVCSLLDMYSKCGSVDEVFFIFNQTVEKNSVLSTSMIMAFAQCGRGSD
         KL  E+   +L  +  T  S+L+   ++  + +G Q+HS V K+G   N  V  +L+ MYSKCGS+D    +FN    +N +  TSMI  FA+ G    
Subjt:  LKLFVEMRSSDLSPTDHTLTSVLNACGSLTVLEQGRQVHSLVTKMGSENNVFVVCSLLDMYSKCGSVDEVFFIFNQTVEKNSVLSTSMIMAFAQCGRGSD

Query:  ALKLFECLLTEEGFLPDHVCFTAVLTACNHAGFLDEAVEYFNKMGSKYRLDPQIDHYACLIDLYARNGHVEKAKQLMEQMPYESNHVMWCSLLGACKVHE
         L+ F  ++ EEG  P+ V + A+L+AC+H G + E   +FN M   +++ P+++HYAC++DL  R G +  A + +  MP++++ ++W + LGAC+VH 
Subjt:  ALKLFECLLTEEGFLPDHVCFTAVLTACNHAGFLDEAVEYFNKMGSKYRLDPQIDHYACLIDLYARNGHVEKAKQLMEQMPYESNHVMWCSLLGACKVHE

Query:  EVELGREVAYQLNEMDPSNAAPYVTLAQIYARAGLWARVADIRKQMQQKNVRKTAGWSWIEIDKKAHVFSVGDATHPKSCEIYSKLDQLDLDMKAAEY
          ELG+  A ++ E+DP+  A Y+ L+ IYA AG W    ++R++M+++N+ K  G SWIE+  K H F VGD  HP + +IY +LD+L  ++K   Y
Subjt:  EVELGREVAYQLNEMDPSNAAPYVTLAQIYARAGLWARVADIRKQMQQKNVRKTAGWSWIEIDKKAHVFSVGDATHPKSCEIYSKLDQLDLDMKAAEY

Q9FWA6 Pentatricopeptide repeat-containing protein At3g02330, mitochondrial3.6e-9133.65Show/hide
Query:  MCSLGAKLTKYSLCSALNSCAKTQNLFLGLQIHAQIVKIGFEENLFLNSALVDLYAKCNAIVDAKRVFSHMKTHDHVSWTSIISGLSQNGRGSEAILMFK
        + S G    + SL     +CA  + L  GLQI+   +K     ++ + +A +D+Y KC A+ +A RVF  M+  D VSW +II+   QNG+G E + +F 
Subjt:  MCSLGAKLTKYSLCSALNSCAKTQNLFLGLQIHAQIVKIGFEENLFLNSALVDLYAKCNAIVDAKRVFSHMKTHDHVSWTSIISGLSQNGRGSEAILMFK

Query:  KMLATHVRPNCFTYATGISSCPTLEDDLQIRLATLLHAHVIKLGFTLSSFVISSTIDCYSKLGRIEEA----ALLFHETTVKDN----------------
         ML + + P+ FT+ + + +C        +     +H+ ++K G   +S V  S ID YSK G IEEA    +  F    V                   
Subjt:  KMLATHVRPNCFTYATGISSCPTLEDDLQIRLATLLHAHVIKLGFTLSSFVISSTIDCYSKLGRIEEA----ALLFHETTVKDN----------------

Query:  IIFNSMISGYSQNLYGEEALKLFVEMRSSDLSPTDHTLTSVLNACGSLTVLEQGRQVHSLVTKMGSENNVFVVCSLLDMYSKCGSVDEVFFIFNQTVEKN
        + +NS+ISGY      E+A  LF  M    ++P   T  +VL+ C +L     G+Q+H+ V K   +++V++  +L+DMYSKCG + +   +F +++ ++
Subjt:  IIFNSMISGYSQNLYGEEALKLFVEMRSSDLSPTDHTLTSVLNACGSLTVLEQGRQVHSLVTKMGSENNVFVVCSLLDMYSKCGSVDEVFFIFNQTVEKN

Query:  SVLSTSMIMAFAQCGRGSDALKLFECLLTEEGFLPDHVCFTAVLTACNHAGFLDEAVEYFNKMGSKYRLDPQIDHYACLIDLYARNGHVEKAKQLMEQMP
         V   +MI  +A  G+G +A++LFE ++  E   P+HV F ++L AC H G +D+ +EYF  M   Y LDPQ+ HY+ ++D+  ++G V++A +L+ +MP
Subjt:  SVLSTSMIMAFAQCGRGSDALKLFECLLTEEGFLPDHVCFTAVLTACNHAGFLDEAVEYFNKMGSKYRLDPQIDHYACLIDLYARNGHVEKAKQLMEQMP

Query:  YESNHVMWCSLLGACKVH-EEVELGREVAYQLNEMDPSNAAPYVTLAQIYARAGLWARVADIRKQMQQKNVRKTAGWSWIEIDKKAHVFSVGDATHPKSC
        +E++ V+W +LLG C +H   VE+  E    L  +DP +++ Y  L+ +YA AG+W +V+D+R+ M+   ++K  G SW+E+  + HVF VGD  HP+  
Subjt:  YESNHVMWCSLLGACKVH-EEVELGREVAYQLNEMDPSNAAPYVTLAQIYARAGLWARVADIRKQMQQKNVRKTAGWSWIEIDKKAHVFSVGDATHPKSC

Query:  EIYSKLDQLDLDMKAAEYAS
        EIY +L  +  +MK  + +S
Subjt:  EIYSKLDQLDLDMKAAEYAS

Q9LTV8 Pentatricopeptide repeat-containing protein At3g127709.5e-9234.43Show/hide
Query:  LNSCAKTQNLFLGLQIHAQIVKIGFEENLFLNSALVDLYAKCNAIVDAKRVFSHMKTHDH--VSWTSIISGLSQNGRGSEAILMFKKMLATHVRPNCFTY
        L +C+   +L +G  +HAQ+ ++GF+ ++F+ + L+ LYAKC  +  A+ VF  +   +   VSWT+I+S  +QNG   EA+ +F +M    V+P+    
Subjt:  LNSCAKTQNLFLGLQIHAQIVKIGFEENLFLNSALVDLYAKCNAIVDAKRVFSHMKTHDH--VSWTSIISGLSQNGRGSEAILMFKKMLATHVRPNCFTY

Query:  ATGISSCPTLEDDLQIRLATLLHAHVIKLGFTLSSFVISSTIDCYSKLGRIEEAALLFHETTVKDNIIFNSMISGYSQNLYGEEALKLFVEMRSSDLSPT
         + +++   L+D   ++    +HA V+K+G  +   ++ S    Y+K G++  A +LF +    + I++N+MISGY++N Y  EA+ +F EM + D+ P 
Subjt:  ATGISSCPTLEDDLQIRLATLLHAHVIKLGFTLSSFVISSTIDCYSKLGRIEEAALLFHETTVKDNIIFNSMISGYSQNLYGEEALKLFVEMRSSDLSPT

Query:  DHTLTSVLNACGSLTVLEQGRQVHSLVTKMGSENNVFVVCSLLDMYSKCGSVDEVFFIFNQTVEKNSVLSTSMIMAFAQCGRGSDALKLFECLLTEEGFL
          ++TS ++AC  +  LEQ R ++  V +    ++VF+  +L+DM++KCGSV+    +F++T++++ V+ ++MI+ +   GR  +A+ L+   +   G  
Subjt:  DHTLTSVLNACGSLTVLEQGRQVHSLVTKMGSENNVFVVCSLLDMYSKCGSVDEVFFIFNQTVEKNSVLSTSMIMAFAQCGRGSDALKLFECLLTEEGFL

Query:  PDHVCFTAVLTACNHAGFLDEAVEYFNKMGSKYRLDPQIDHYACLIDLYARNGHVEKAKQLMEQMPYESNHVMWCSLLGACKVHEEVELGREVAYQLNEM
        P+ V F  +L ACNH+G + E   +FN+M + ++++PQ  HYAC+IDL  R GH+++A ++++ MP +    +W +LL ACK H  VELG   A QL  +
Subjt:  PDHVCFTAVLTACNHAGFLDEAVEYFNKMGSKYRLDPQIDHYACLIDLYARNGHVEKAKQLMEQMPYESNHVMWCSLLGACKVHEEVELGREVAYQLNEM

Query:  DPSNAAPYVTLAQIYARAGLWARVADIRKQMQQKNVRKTAGWSWIEIDKKAHVFSVGDATHPKSCEIYSKLDQLDLDMKAAEYAS
        DPSN   YV L+ +YA A LW RVA++R +M++K + K  G SW+E+  +   F VGD +HP+  EI  +++ ++  +K   + +
Subjt:  DPSNAAPYVTLAQIYARAGLWARVADIRKQMQQKNVRKTAGWSWIEIDKKAHVFSVGDATHPKSCEIYSKLDQLDLDMKAAEYAS

Q9SIT7 Pentatricopeptide repeat-containing protein At2g136002.5e-10037.57Show/hide
Query:  GAKLTKYSLCSALNSCAKTQNLFLGLQIHAQIVKIGFEENLFLNSALVDLYAKCNAIVDAKRVFSHMKTHDHVSWTSIISGLSQNGRGSEAILMFKKMLA
        G  L +YS  S L++C+   ++  G+Q+H+ I K  F  ++++ SALVD+Y+KC  + DA+RVF  M   + VSW S+I+   QNG   EA+ +F+ ML 
Subjt:  GAKLTKYSLCSALNSCAKTQNLFLGLQIHAQIVKIGFEENLFLNSALVDLYAKCNAIVDAKRVFSHMKTHDHVSWTSIISGLSQNGRGSEAILMFKKMLA

Query:  THVRPNCFTYATGISSCPTLEDDLQIRLATLLHAHVIKLGFTLSSFVISST-IDCYSKLGRIEEAALLFHETTVKD------------------------
        + V P+  T A+ IS+C +L     I++   +H  V+K     +  ++S+  +D Y+K  RI+EA  +F    +++                        
Subjt:  THVRPNCFTYATGISSCPTLEDDLQIRLATLLHAHVIKLGFTLSSFVISST-IDCYSKLGRIEEAALLFHETTVKD------------------------

Query:  ------NII-FNSMISGYSQNLYGEEALKLFVEMRSSDLSPTDHTLTSVLNACGSLTVLEQGRQVHSLVTK------MGSENNVFVVCSLLDMYSKCGSV
              N++ +N++I+GY+QN   EEAL LF  ++   + PT ++  ++L AC  L  L  G Q H  V K       G E+++FV  SL+DMY KCG V
Subjt:  ------NII-FNSMISGYSQNLYGEEALKLFVEMRSSDLSPTDHTLTSVLNACGSLTVLEQGRQVHSLVTK------MGSENNVFVVCSLLDMYSKCGSV

Query:  DEVFFIFNQTVEKNSVLSTSMIMAFAQCGRGSDALKLFECLLTEEGFLPDHVCFTAVLTACNHAGFLDEAVEYFNKMGSKYRLDPQIDHYACLIDLYARN
        +E + +F + +E++ V   +MI+ FAQ G G++AL+LF  +L E G  PDH+    VL+AC HAGF++E   YF+ M   + + P  DHY C++DL  R 
Subjt:  DEVFFIFNQTVEKNSVLSTSMIMAFAQCGRGSDALKLFECLLTEEGFLPDHVCFTAVLTACNHAGFLDEAVEYFNKMGSKYRLDPQIDHYACLIDLYARN

Query:  GHVEKAKQLMEQMPYESNHVMWCSLLGACKVHEEVELGREVAYQLNEMDPSNAAPYVTLAQIYARAGLWARVADIRKQMQQKNVRKTAGWSWIEIDKKAH
        G +E+AK ++E+MP + + V+W SLL ACKVH  + LG+ VA +L E++PSN+ PYV L+ +YA  G W  V ++RK M+++ V K  G SWI+I    H
Subjt:  GHVEKAKQLMEQMPYESNHVMWCSLLGACKVHEEVELGREVAYQLNEMDPSNAAPYVTLAQIYARAGLWARVADIRKQMQQKNVRKTAGWSWIEIDKKAH

Query:  VFSVGDATHPKSCEIYSKLDQLDLDMK
        VF V D +HP+  +I+S LD L  +M+
Subjt:  VFSVGDATHPKSCEIYSKLDQLDLDMK

Q9ZUW3 Pentatricopeptide repeat-containing protein At2g276101.3e-8835.5Show/hide
Query:  KLTKYSLCSALNSCAKTQNLFLGLQIHAQIVKIGFEENLFLNSALVDLYAKCNAIVDAKRVFSHMK-THDHVSWTSIISGLSQNGRGSEAILMFKKMLAT
        +L++ S  S +  CA  + L    Q+H  +VK GF  +  + +AL+  Y+KC A++DA R+F  +    + VSWT++ISG  QN    EA+ +F +M   
Subjt:  KLTKYSLCSALNSCAKTQNLFLGLQIHAQIVKIGFEENLFLNSALVDLYAKCNAIVDAKRVFSHMK-THDHVSWTSIISGLSQNGRGSEAILMFKKMLAT

Query:  HVRPNCFTYATGISSCPTLEDDLQIRLATLLHAHVIKLGFTLSSFVISSTIDCYSKLGRIEEAALLFHETTVKDNIIFNSMISGYSQNLYGEEALKLFVE
         VRPN FTY+  +++ P +         + +HA V+K  +  SS V ++ +D Y KLG++EEAA +F     KD + +++M++GY+Q    E A+K+F E
Subjt:  HVRPNCFTYATGISSCPTLEDDLQIRLATLLHAHVIKLGFTLSSFVISSTIDCYSKLGRIEEAALLFHETTVKDNIIFNSMISGYSQNLYGEEALKLFVE

Query:  MRSSDLSPTDHTLTSVLNACGSLTV-LEQGRQVHSLVTKMGSENNVFVVCSLLDMYSKCGSVDEVFFIFNQTVEKNSVLSTSMIMAFAQCGRGSDALKLF
        +    + P + T +S+LN C +    + QG+Q H    K   ++++ V  +LL MY+K G+++    +F +  EK+ V   SMI  +AQ G+   AL +F
Subjt:  MRSSDLSPTDHTLTSVLNACGSLTV-LEQGRQVHSLVTKMGSENNVFVVCSLLDMYSKCGSVDEVFFIFNQTVEKNSVLSTSMIMAFAQCGRGSDALKLF

Query:  ECLLTEEGFLPDHVCFTAVLTACNHAGFLDEAVEYFNKMGSKYRLDPQIDHYACLIDLYARNGHVEKAKQLMEQMPYESNHVMWCSLLGACKVHEEVELG
        +  + +     D V F  V  AC HAG ++E  +YF+ M    ++ P  +H +C++DLY+R G +EKA +++E MP  +   +W ++L AC+VH++ ELG
Subjt:  ECLLTEEGFLPDHVCFTAVLTACNHAGFLDEAVEYFNKMGSKYRLDPQIDHYACLIDLYARNGHVEKAKQLMEQMPYESNHVMWCSLLGACKVHEEVELG

Query:  REVAYQLNEMDPSNAAPYVTLAQIYARAGLWARVADIRKQMQQKNVRKTAGWSWIEIDKKAHVFSVGDATHPKSCEIYSKLDQLDLDMKAAEY
        R  A ++  M P ++A YV L+ +YA +G W   A +RK M ++NV+K  G+SWIE+  K + F  GD +HP   +IY KL+ L   +K   Y
Subjt:  REVAYQLNEMDPSNAAPYVTLAQIYARAGLWARVADIRKQMQQKNVRKTAGWSWIEIDKKAHVFSVGDATHPKSCEIYSKLDQLDLDMKAAEY

Arabidopsis top hitse value%identityAlignment
AT2G13600.1 Pentatricopeptide repeat (PPR) superfamily protein1.8e-10137.57Show/hide
Query:  GAKLTKYSLCSALNSCAKTQNLFLGLQIHAQIVKIGFEENLFLNSALVDLYAKCNAIVDAKRVFSHMKTHDHVSWTSIISGLSQNGRGSEAILMFKKMLA
        G  L +YS  S L++C+   ++  G+Q+H+ I K  F  ++++ SALVD+Y+KC  + DA+RVF  M   + VSW S+I+   QNG   EA+ +F+ ML 
Subjt:  GAKLTKYSLCSALNSCAKTQNLFLGLQIHAQIVKIGFEENLFLNSALVDLYAKCNAIVDAKRVFSHMKTHDHVSWTSIISGLSQNGRGSEAILMFKKMLA

Query:  THVRPNCFTYATGISSCPTLEDDLQIRLATLLHAHVIKLGFTLSSFVISST-IDCYSKLGRIEEAALLFHETTVKD------------------------
        + V P+  T A+ IS+C +L     I++   +H  V+K     +  ++S+  +D Y+K  RI+EA  +F    +++                        
Subjt:  THVRPNCFTYATGISSCPTLEDDLQIRLATLLHAHVIKLGFTLSSFVISST-IDCYSKLGRIEEAALLFHETTVKD------------------------

Query:  ------NII-FNSMISGYSQNLYGEEALKLFVEMRSSDLSPTDHTLTSVLNACGSLTVLEQGRQVHSLVTK------MGSENNVFVVCSLLDMYSKCGSV
              N++ +N++I+GY+QN   EEAL LF  ++   + PT ++  ++L AC  L  L  G Q H  V K       G E+++FV  SL+DMY KCG V
Subjt:  ------NII-FNSMISGYSQNLYGEEALKLFVEMRSSDLSPTDHTLTSVLNACGSLTVLEQGRQVHSLVTK------MGSENNVFVVCSLLDMYSKCGSV

Query:  DEVFFIFNQTVEKNSVLSTSMIMAFAQCGRGSDALKLFECLLTEEGFLPDHVCFTAVLTACNHAGFLDEAVEYFNKMGSKYRLDPQIDHYACLIDLYARN
        +E + +F + +E++ V   +MI+ FAQ G G++AL+LF  +L E G  PDH+    VL+AC HAGF++E   YF+ M   + + P  DHY C++DL  R 
Subjt:  DEVFFIFNQTVEKNSVLSTSMIMAFAQCGRGSDALKLFECLLTEEGFLPDHVCFTAVLTACNHAGFLDEAVEYFNKMGSKYRLDPQIDHYACLIDLYARN

Query:  GHVEKAKQLMEQMPYESNHVMWCSLLGACKVHEEVELGREVAYQLNEMDPSNAAPYVTLAQIYARAGLWARVADIRKQMQQKNVRKTAGWSWIEIDKKAH
        G +E+AK ++E+MP + + V+W SLL ACKVH  + LG+ VA +L E++PSN+ PYV L+ +YA  G W  V ++RK M+++ V K  G SWI+I    H
Subjt:  GHVEKAKQLMEQMPYESNHVMWCSLLGACKVHEEVELGREVAYQLNEMDPSNAAPYVTLAQIYARAGLWARVADIRKQMQQKNVRKTAGWSWIEIDKKAH

Query:  VFSVGDATHPKSCEIYSKLDQLDLDMK
        VF V D +HP+  +I+S LD L  +M+
Subjt:  VFSVGDATHPKSCEIYSKLDQLDLDMK

AT2G27610.1 Tetratricopeptide repeat (TPR)-like superfamily protein9.1e-9035.5Show/hide
Query:  KLTKYSLCSALNSCAKTQNLFLGLQIHAQIVKIGFEENLFLNSALVDLYAKCNAIVDAKRVFSHMK-THDHVSWTSIISGLSQNGRGSEAILMFKKMLAT
        +L++ S  S +  CA  + L    Q+H  +VK GF  +  + +AL+  Y+KC A++DA R+F  +    + VSWT++ISG  QN    EA+ +F +M   
Subjt:  KLTKYSLCSALNSCAKTQNLFLGLQIHAQIVKIGFEENLFLNSALVDLYAKCNAIVDAKRVFSHMK-THDHVSWTSIISGLSQNGRGSEAILMFKKMLAT

Query:  HVRPNCFTYATGISSCPTLEDDLQIRLATLLHAHVIKLGFTLSSFVISSTIDCYSKLGRIEEAALLFHETTVKDNIIFNSMISGYSQNLYGEEALKLFVE
         VRPN FTY+  +++ P +         + +HA V+K  +  SS V ++ +D Y KLG++EEAA +F     KD + +++M++GY+Q    E A+K+F E
Subjt:  HVRPNCFTYATGISSCPTLEDDLQIRLATLLHAHVIKLGFTLSSFVISSTIDCYSKLGRIEEAALLFHETTVKDNIIFNSMISGYSQNLYGEEALKLFVE

Query:  MRSSDLSPTDHTLTSVLNACGSLTV-LEQGRQVHSLVTKMGSENNVFVVCSLLDMYSKCGSVDEVFFIFNQTVEKNSVLSTSMIMAFAQCGRGSDALKLF
        +    + P + T +S+LN C +    + QG+Q H    K   ++++ V  +LL MY+K G+++    +F +  EK+ V   SMI  +AQ G+   AL +F
Subjt:  MRSSDLSPTDHTLTSVLNACGSLTV-LEQGRQVHSLVTKMGSENNVFVVCSLLDMYSKCGSVDEVFFIFNQTVEKNSVLSTSMIMAFAQCGRGSDALKLF

Query:  ECLLTEEGFLPDHVCFTAVLTACNHAGFLDEAVEYFNKMGSKYRLDPQIDHYACLIDLYARNGHVEKAKQLMEQMPYESNHVMWCSLLGACKVHEEVELG
        +  + +     D V F  V  AC HAG ++E  +YF+ M    ++ P  +H +C++DLY+R G +EKA +++E MP  +   +W ++L AC+VH++ ELG
Subjt:  ECLLTEEGFLPDHVCFTAVLTACNHAGFLDEAVEYFNKMGSKYRLDPQIDHYACLIDLYARNGHVEKAKQLMEQMPYESNHVMWCSLLGACKVHEEVELG

Query:  REVAYQLNEMDPSNAAPYVTLAQIYARAGLWARVADIRKQMQQKNVRKTAGWSWIEIDKKAHVFSVGDATHPKSCEIYSKLDQLDLDMKAAEY
        R  A ++  M P ++A YV L+ +YA +G W   A +RK M ++NV+K  G+SWIE+  K + F  GD +HP   +IY KL+ L   +K   Y
Subjt:  REVAYQLNEMDPSNAAPYVTLAQIYARAGLWARVADIRKQMQQKNVRKTAGWSWIEIDKKAHVFSVGDATHPKSCEIYSKLDQLDLDMKAAEY

AT3G02330.1 Pentatricopeptide repeat (PPR) superfamily protein2.6e-9233.65Show/hide
Query:  MCSLGAKLTKYSLCSALNSCAKTQNLFLGLQIHAQIVKIGFEENLFLNSALVDLYAKCNAIVDAKRVFSHMKTHDHVSWTSIISGLSQNGRGSEAILMFK
        + S G    + SL     +CA  + L  GLQI+   +K     ++ + +A +D+Y KC A+ +A RVF  M+  D VSW +II+   QNG+G E + +F 
Subjt:  MCSLGAKLTKYSLCSALNSCAKTQNLFLGLQIHAQIVKIGFEENLFLNSALVDLYAKCNAIVDAKRVFSHMKTHDHVSWTSIISGLSQNGRGSEAILMFK

Query:  KMLATHVRPNCFTYATGISSCPTLEDDLQIRLATLLHAHVIKLGFTLSSFVISSTIDCYSKLGRIEEA----ALLFHETTVKDN----------------
         ML + + P+ FT+ + + +C        +     +H+ ++K G   +S V  S ID YSK G IEEA    +  F    V                   
Subjt:  KMLATHVRPNCFTYATGISSCPTLEDDLQIRLATLLHAHVIKLGFTLSSFVISSTIDCYSKLGRIEEA----ALLFHETTVKDN----------------

Query:  IIFNSMISGYSQNLYGEEALKLFVEMRSSDLSPTDHTLTSVLNACGSLTVLEQGRQVHSLVTKMGSENNVFVVCSLLDMYSKCGSVDEVFFIFNQTVEKN
        + +NS+ISGY      E+A  LF  M    ++P   T  +VL+ C +L     G+Q+H+ V K   +++V++  +L+DMYSKCG + +   +F +++ ++
Subjt:  IIFNSMISGYSQNLYGEEALKLFVEMRSSDLSPTDHTLTSVLNACGSLTVLEQGRQVHSLVTKMGSENNVFVVCSLLDMYSKCGSVDEVFFIFNQTVEKN

Query:  SVLSTSMIMAFAQCGRGSDALKLFECLLTEEGFLPDHVCFTAVLTACNHAGFLDEAVEYFNKMGSKYRLDPQIDHYACLIDLYARNGHVEKAKQLMEQMP
         V   +MI  +A  G+G +A++LFE ++  E   P+HV F ++L AC H G +D+ +EYF  M   Y LDPQ+ HY+ ++D+  ++G V++A +L+ +MP
Subjt:  SVLSTSMIMAFAQCGRGSDALKLFECLLTEEGFLPDHVCFTAVLTACNHAGFLDEAVEYFNKMGSKYRLDPQIDHYACLIDLYARNGHVEKAKQLMEQMP

Query:  YESNHVMWCSLLGACKVH-EEVELGREVAYQLNEMDPSNAAPYVTLAQIYARAGLWARVADIRKQMQQKNVRKTAGWSWIEIDKKAHVFSVGDATHPKSC
        +E++ V+W +LLG C +H   VE+  E    L  +DP +++ Y  L+ +YA AG+W +V+D+R+ M+   ++K  G SW+E+  + HVF VGD  HP+  
Subjt:  YESNHVMWCSLLGACKVH-EEVELGREVAYQLNEMDPSNAAPYVTLAQIYARAGLWARVADIRKQMQQKNVRKTAGWSWIEIDKKAHVFSVGDATHPKSC

Query:  EIYSKLDQLDLDMKAAEYAS
        EIY +L  +  +MK  + +S
Subjt:  EIYSKLDQLDLDMKAAEYAS

AT3G12770.1 mitochondrial editing factor 226.7e-9334.43Show/hide
Query:  LNSCAKTQNLFLGLQIHAQIVKIGFEENLFLNSALVDLYAKCNAIVDAKRVFSHMKTHDH--VSWTSIISGLSQNGRGSEAILMFKKMLATHVRPNCFTY
        L +C+   +L +G  +HAQ+ ++GF+ ++F+ + L+ LYAKC  +  A+ VF  +   +   VSWT+I+S  +QNG   EA+ +F +M    V+P+    
Subjt:  LNSCAKTQNLFLGLQIHAQIVKIGFEENLFLNSALVDLYAKCNAIVDAKRVFSHMKTHDH--VSWTSIISGLSQNGRGSEAILMFKKMLATHVRPNCFTY

Query:  ATGISSCPTLEDDLQIRLATLLHAHVIKLGFTLSSFVISSTIDCYSKLGRIEEAALLFHETTVKDNIIFNSMISGYSQNLYGEEALKLFVEMRSSDLSPT
         + +++   L+D   ++    +HA V+K+G  +   ++ S    Y+K G++  A +LF +    + I++N+MISGY++N Y  EA+ +F EM + D+ P 
Subjt:  ATGISSCPTLEDDLQIRLATLLHAHVIKLGFTLSSFVISSTIDCYSKLGRIEEAALLFHETTVKDNIIFNSMISGYSQNLYGEEALKLFVEMRSSDLSPT

Query:  DHTLTSVLNACGSLTVLEQGRQVHSLVTKMGSENNVFVVCSLLDMYSKCGSVDEVFFIFNQTVEKNSVLSTSMIMAFAQCGRGSDALKLFECLLTEEGFL
          ++TS ++AC  +  LEQ R ++  V +    ++VF+  +L+DM++KCGSV+    +F++T++++ V+ ++MI+ +   GR  +A+ L+   +   G  
Subjt:  DHTLTSVLNACGSLTVLEQGRQVHSLVTKMGSENNVFVVCSLLDMYSKCGSVDEVFFIFNQTVEKNSVLSTSMIMAFAQCGRGSDALKLFECLLTEEGFL

Query:  PDHVCFTAVLTACNHAGFLDEAVEYFNKMGSKYRLDPQIDHYACLIDLYARNGHVEKAKQLMEQMPYESNHVMWCSLLGACKVHEEVELGREVAYQLNEM
        P+ V F  +L ACNH+G + E   +FN+M + ++++PQ  HYAC+IDL  R GH+++A ++++ MP +    +W +LL ACK H  VELG   A QL  +
Subjt:  PDHVCFTAVLTACNHAGFLDEAVEYFNKMGSKYRLDPQIDHYACLIDLYARNGHVEKAKQLMEQMPYESNHVMWCSLLGACKVHEEVELGREVAYQLNEM

Query:  DPSNAAPYVTLAQIYARAGLWARVADIRKQMQQKNVRKTAGWSWIEIDKKAHVFSVGDATHPKSCEIYSKLDQLDLDMKAAEYAS
        DPSN   YV L+ +YA A LW RVA++R +M++K + K  G SW+E+  +   F VGD +HP+  EI  +++ ++  +K   + +
Subjt:  DPSNAAPYVTLAQIYARAGLWARVADIRKQMQQKNVRKTAGWSWIEIDKKAHVFSVGDATHPKSCEIYSKLDQLDLDMKAAEYAS

AT3G49170.1 Tetratricopeptide repeat (TPR)-like superfamily protein8.0e-9435.34Show/hide
Query:  GAKLTKYSLCSALNSCAKTQNLFLGLQIHAQIVKIGFEENLFLNSALVDLYAKCNA---IVDAKRVFSHMKTHDHVSWTSIISGLSQN-GRGSEAILMFK
        G +  K++L S  ++CA+ +NL LG Q+H+  ++ G  ++  +  +LVD+YAKC+A   + D ++VF  M+ H  +SWT++I+G  +N    +EAI +F 
Subjt:  GAKLTKYSLCSALNSCAKTQNLFLGLQIHAQIVKIGFEENLFLNSALVDLYAKCNA---IVDAKRVFSHMKTHDHVSWTSIISGLSQN-GRGSEAILMFK

Query:  KMLAT-HVRPNCFTYATGISSCPTLEDDLQIRLATLLHAHVIKLGFTLSSFVISSTIDCYSKLGRIEEAALLFHETTVKDNIIFNSMISGYSQNLYGEEA
        +M+   HV PN FT+++   +C  L D    R+   +     K G   +S V +S I  + K  R+E+A   F   + K+ + +N+ + G  +NL  E+A
Subjt:  KMLAT-HVRPNCFTYATGISSCPTLEDDLQIRLATLLHAHVIKLGFTLSSFVISSTIDCYSKLGRIEEAALLFHETTVKDNIIFNSMISGYSQNLYGEEA

Query:  LKLFVEMRSSDLSPTDHTLTSVLNACGSLTVLEQGRQVHSLVTKMGSENNVFVVCSLLDMYSKCGSVDEVFFIFNQTVEKNSVLSTSMIMAFAQCGRGSD
         KL  E+   +L  +  T  S+L+   ++  + +G Q+HS V K+G   N  V  +L+ MYSKCGS+D    +FN    +N +  TSMI  FA+ G    
Subjt:  LKLFVEMRSSDLSPTDHTLTSVLNACGSLTVLEQGRQVHSLVTKMGSENNVFVVCSLLDMYSKCGSVDEVFFIFNQTVEKNSVLSTSMIMAFAQCGRGSD

Query:  ALKLFECLLTEEGFLPDHVCFTAVLTACNHAGFLDEAVEYFNKMGSKYRLDPQIDHYACLIDLYARNGHVEKAKQLMEQMPYESNHVMWCSLLGACKVHE
         L+ F  ++ EEG  P+ V + A+L+AC+H G + E   +FN M   +++ P+++HYAC++DL  R G +  A + +  MP++++ ++W + LGAC+VH 
Subjt:  ALKLFECLLTEEGFLPDHVCFTAVLTACNHAGFLDEAVEYFNKMGSKYRLDPQIDHYACLIDLYARNGHVEKAKQLMEQMPYESNHVMWCSLLGACKVHE

Query:  EVELGREVAYQLNEMDPSNAAPYVTLAQIYARAGLWARVADIRKQMQQKNVRKTAGWSWIEIDKKAHVFSVGDATHPKSCEIYSKLDQLDLDMKAAEY
          ELG+  A ++ E+DP+  A Y+ L+ IYA AG W    ++R++M+++N+ K  G SWIE+  K H F VGD  HP + +IY +LD+L  ++K   Y
Subjt:  EVELGREVAYQLNEMDPSNAAPYVTLAQIYARAGLWARVADIRKQMQQKNVRKTAGWSWIEIDKKAHVFSVGDATHPKSCEIYSKLDQLDLDMKAAEY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTGCAGCTTAGGAGCAAAGCTGACAAAGTATTCTCTCTGCTCTGCTCTTAATTCTTGTGCTAAAACACAAAATTTGTTTCTGGGTCTGCAAATTCACGCCCAGATTGT
TAAAATTGGATTTGAAGAGAACTTATTTTTGAACAGTGCACTTGTTGATTTATACGCCAAATGTAATGCCATCGTGGATGCAAAAAGGGTCTTCTCTCACATGAAGACTC
ACGATCATGTATCTTGGACCTCTATAATATCTGGACTTTCCCAAAATGGGCGTGGGAGCGAAGCCATCTTGATGTTTAAGAAAATGTTGGCAACTCATGTTAGACCCAAC
TGTTTTACTTATGCTACTGGTATTAGTTCATGCCCAACACTGGAGGATGACCTTCAGATTCGTCTCGCAACTTTGCTTCATGCACATGTTATCAAACTTGGTTTTACTTT
AAGCAGTTTTGTTATTAGCTCCACCATTGATTGTTACTCAAAACTAGGAAGAATAGAAGAAGCTGCTCTGCTCTTTCATGAGACAACAGTGAAGGATAATATCATATTTA
ATTCTATGATATCAGGGTATTCTCAAAACTTATATGGGGAAGAGGCATTAAAACTGTTTGTAGAAATGAGATCTAGTGATTTGAGCCCAACTGATCATACACTAACTAGT
GTTTTGAATGCTTGTGGAAGTCTAACAGTACTTGAACAAGGAAGGCAAGTGCACTCACTAGTTACAAAAATGGGATCAGAAAATAATGTGTTTGTGGTCTGTTCTCTGCT
AGATATGTATTCAAAGTGTGGGAGTGTTGATGAGGTCTTTTTCATATTCAATCAGACAGTCGAAAAGAACAGTGTGTTGTCAACTTCGATGATAATGGCTTTTGCTCAAT
GTGGTAGAGGCTCAGATGCATTAAAGCTCTTTGAGTGTTTGTTGACTGAAGAAGGTTTCTTGCCAGATCATGTCTGTTTTACTGCAGTTTTAACTGCCTGCAATCATGCT
GGATTTCTAGATGAGGCAGTTGAATATTTCAATAAAATGGGCAGTAAATACAGATTGGATCCTCAAATTGATCATTATGCTTGTTTGATTGATCTTTATGCCAGAAATGG
GCACGTAGAAAAAGCTAAGCAATTAATGGAGCAAATGCCTTACGAATCTAATCACGTTATGTGGTGTTCCCTCTTAGGTGCTTGCAAAGTTCATGAAGAGGTCGAGCTTG
GGAGGGAGGTGGCATATCAACTAAACGAGATGGATCCAAGTAATGCTGCACCCTATGTAACACTTGCTCAAATCTATGCTAGAGCTGGTTTATGGGCTCGGGTGGCTGAT
ATTAGAAAACAGATGCAACAAAAAAATGTAAGGAAAACTGCTGGGTGGAGTTGGATTGAGATAGATAAGAAAGCTCATGTCTTCTCAGTTGGTGATGCAACTCATCCTAA
ATCTTGTGAGATTTATTCAAAACTGGACCAACTGGACTTGGATATGAAAGCAGCTGAATATGCATCAAAAGCACTTAAGTATGTTGAGTTTTAA
mRNA sequenceShow/hide mRNA sequence
ATGTGCAGCTTAGGAGCAAAGCTGACAAAGTATTCTCTCTGCTCTGCTCTTAATTCTTGTGCTAAAACACAAAATTTGTTTCTGGGTCTGCAAATTCACGCCCAGATTGT
TAAAATTGGATTTGAAGAGAACTTATTTTTGAACAGTGCACTTGTTGATTTATACGCCAAATGTAATGCCATCGTGGATGCAAAAAGGGTCTTCTCTCACATGAAGACTC
ACGATCATGTATCTTGGACCTCTATAATATCTGGACTTTCCCAAAATGGGCGTGGGAGCGAAGCCATCTTGATGTTTAAGAAAATGTTGGCAACTCATGTTAGACCCAAC
TGTTTTACTTATGCTACTGGTATTAGTTCATGCCCAACACTGGAGGATGACCTTCAGATTCGTCTCGCAACTTTGCTTCATGCACATGTTATCAAACTTGGTTTTACTTT
AAGCAGTTTTGTTATTAGCTCCACCATTGATTGTTACTCAAAACTAGGAAGAATAGAAGAAGCTGCTCTGCTCTTTCATGAGACAACAGTGAAGGATAATATCATATTTA
ATTCTATGATATCAGGGTATTCTCAAAACTTATATGGGGAAGAGGCATTAAAACTGTTTGTAGAAATGAGATCTAGTGATTTGAGCCCAACTGATCATACACTAACTAGT
GTTTTGAATGCTTGTGGAAGTCTAACAGTACTTGAACAAGGAAGGCAAGTGCACTCACTAGTTACAAAAATGGGATCAGAAAATAATGTGTTTGTGGTCTGTTCTCTGCT
AGATATGTATTCAAAGTGTGGGAGTGTTGATGAGGTCTTTTTCATATTCAATCAGACAGTCGAAAAGAACAGTGTGTTGTCAACTTCGATGATAATGGCTTTTGCTCAAT
GTGGTAGAGGCTCAGATGCATTAAAGCTCTTTGAGTGTTTGTTGACTGAAGAAGGTTTCTTGCCAGATCATGTCTGTTTTACTGCAGTTTTAACTGCCTGCAATCATGCT
GGATTTCTAGATGAGGCAGTTGAATATTTCAATAAAATGGGCAGTAAATACAGATTGGATCCTCAAATTGATCATTATGCTTGTTTGATTGATCTTTATGCCAGAAATGG
GCACGTAGAAAAAGCTAAGCAATTAATGGAGCAAATGCCTTACGAATCTAATCACGTTATGTGGTGTTCCCTCTTAGGTGCTTGCAAAGTTCATGAAGAGGTCGAGCTTG
GGAGGGAGGTGGCATATCAACTAAACGAGATGGATCCAAGTAATGCTGCACCCTATGTAACACTTGCTCAAATCTATGCTAGAGCTGGTTTATGGGCTCGGGTGGCTGAT
ATTAGAAAACAGATGCAACAAAAAAATGTAAGGAAAACTGCTGGGTGGAGTTGGATTGAGATAGATAAGAAAGCTCATGTCTTCTCAGTTGGTGATGCAACTCATCCTAA
ATCTTGTGAGATTTATTCAAAACTGGACCAACTGGACTTGGATATGAAAGCAGCTGAATATGCATCAAAAGCACTTAAGTATGTTGAGTTTTAA
Protein sequenceShow/hide protein sequence
MCSLGAKLTKYSLCSALNSCAKTQNLFLGLQIHAQIVKIGFEENLFLNSALVDLYAKCNAIVDAKRVFSHMKTHDHVSWTSIISGLSQNGRGSEAILMFKKMLATHVRPN
CFTYATGISSCPTLEDDLQIRLATLLHAHVIKLGFTLSSFVISSTIDCYSKLGRIEEAALLFHETTVKDNIIFNSMISGYSQNLYGEEALKLFVEMRSSDLSPTDHTLTS
VLNACGSLTVLEQGRQVHSLVTKMGSENNVFVVCSLLDMYSKCGSVDEVFFIFNQTVEKNSVLSTSMIMAFAQCGRGSDALKLFECLLTEEGFLPDHVCFTAVLTACNHA
GFLDEAVEYFNKMGSKYRLDPQIDHYACLIDLYARNGHVEKAKQLMEQMPYESNHVMWCSLLGACKVHEEVELGREVAYQLNEMDPSNAAPYVTLAQIYARAGLWARVAD
IRKQMQQKNVRKTAGWSWIEIDKKAHVFSVGDATHPKSCEIYSKLDQLDLDMKAAEYASKALKYVEF