; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0023042 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0023042
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationchr7:43267377..43268903
RNA-Seq ExpressionLag0023042
SyntenyLag0023042
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008464861.1 PREDICTED: pentatricopeptide repeat-containing protein At2g13600-like [Cucumis melo]6.2e-25887.9Show/hide
Query:  MCSLGAKLTKYSLCSALNSCAKTHNFLLGLQIHAQIVKIGFEDNLFLNSSLVDLYSKCNAIWDAKRVFSHMKTHDQVSWTSIISGLSQNGHGSEAILMFK
        MCSLGAKLT YSLCSAL+SCAKTHN  LGLQIHAQIVKIGFE+NLFLNSSLVDLYSKCNAI +AKRVFS MKTHDQVSWTSIISGLSQNG GSEAILMFK
Subjt:  MCSLGAKLTKYSLCSALNSCAKTHNFLLGLQIHAQIVKIGFEDNLFLNSSLVDLYSKCNAIWDAKRVFSHMKTHDQVSWTSIISGLSQNGHGSEAILMFK

Query:  NMLVTQVRPNCFTYATVISSCPTLKDELQIRLSTLLHAHVFKLGFTLSSFVISSTIDCYSKLGRIEEAALLFYESTVKDNIIFNSMISGYSQNLYGEEAL
         MLVTQVRPNCFTYATVISSCPTLK+ELQI L+TLLHAHV K GFT SSFVISSTIDCYSKLGRI+EA+LLF E++VKDNIIFNSMISGYSQNL GEEAL
Subjt:  NMLVTQVRPNCFTYATVISSCPTLKDELQIRLSTLLHAHVFKLGFTLSSFVISSTIDCYSKLGRIEEAALLFYESTVKDNIIFNSMISGYSQNLYGEEAL

Query:  KLFVEMRAGNLSPTDHTLTSVLNACGSLTVLEQGRQVHSLVTKMGSENNVFVVCSLLDMYSKCGSIDEAFFIFNQTVEKNSVLSTSMIMAFAQCGRGSDA
        KLFVEMRA NLSPTDHTLTSVLNACG LTVLEQGRQVHSL+TKMGSENNVFVVCSLLDMYSKCGSIDEAF +FNQTV+KNSVLSTSMIMAFAQCGRG +A
Subjt:  KLFVEMRAGNLSPTDHTLTSVLNACGSLTVLEQGRQVHSLVTKMGSENNVFVVCSLLDMYSKCGSIDEAFFIFNQTVEKNSVLSTSMIMAFAQCGRGSDA

Query:  LKLFESLLTEEGFLPDHVCFTAVLTACNHAGLLDEAVGYFNKMGSEYRLDPQIDHYACLIDLYARNGHVEKAKQLMEQMPYESNYVMWCSLLGACKVHEE
        LKLFE L TE+ F+PDH+CFTAVLTACNHAGLLDEAV YFNKM  EY+LDPQIDHYACLIDLYARNG+VEKAKQ+MEQMPYESNYVMWCSLLGACKVH E
Subjt:  LKLFESLLTEEGFLPDHVCFTAVLTACNHAGLLDEAVGYFNKMGSEYRLDPQIDHYACLIDLYARNGHVEKAKQLMEQMPYESNYVMWCSLLGACKVHEE

Query:  VELGREVAYRLIEMDPSNAAPYVTLAHIYARAGLWAQLADIRKQMQQKKVRKSAGWSWIEIDKKAHVFSVGDGTHPQSCEIYSKLNQLDLDMKAAENASK
        VELGREVAYRLIEMDP NAAPY+TLAHIYARAGLW Q+ +IRK+MQQK+VRKSAGWSWIEIDKK HVFSVGD  HP+SCEIYSKL+QL+LDMKAAE + K
Subjt:  VELGREVAYRLIEMDPSNAAPYVTLAHIYARAGLWAQLADIRKQMQQKKVRKSAGWSWIEIDKKAHVFSVGDGTHPQSCEIYSKLNQLDLDMKAAENASK

Query:  ALDY
        AL+Y
Subjt:  ALDY

XP_022948515.1 pentatricopeptide repeat-containing protein At2g13600-like [Cucurbita moschata]6.7e-26890.94Show/hide
Query:  MCSLGAKLTKYSLCSALNSCAKTHNFLLGLQIHAQIVKIGFEDNLFLNSSLVDLYSKCNAIWDAKRVFSHMKTHDQVSWTSIISGLSQNGHGSEAILMFK
        MCSLGAKLTKYSLCSALNSCAKT N  LGLQIHAQIVKIGFEDNL+LNS LV+LYSKCNAI DAKR+F HMKTHDQVSWTSIISGLSQNG G EAILMFK
Subjt:  MCSLGAKLTKYSLCSALNSCAKTHNFLLGLQIHAQIVKIGFEDNLFLNSSLVDLYSKCNAIWDAKRVFSHMKTHDQVSWTSIISGLSQNGHGSEAILMFK

Query:  NMLVTQVRPNCFTYATVISSCPTLKDELQIRLSTLLHAHVFKLGFT-LSSFVISSTIDCYSKLGRIEEAALLFYESTVKDNIIFNSMISGYSQNLYGEEA
        NMLVTQ RPNCFTYATVISSCP+LKDEL I L+TL HAHV KLGF   SSFVISS IDCYSKLGRIEEAALLFYE+ VKDN+IFNSMISG+SQNLYGEEA
Subjt:  NMLVTQVRPNCFTYATVISSCPTLKDELQIRLSTLLHAHVFKLGFT-LSSFVISSTIDCYSKLGRIEEAALLFYESTVKDNIIFNSMISGYSQNLYGEEA

Query:  LKLFVEMRAGNLSPTDHTLTSVLNACGSLTVLEQGRQVHSLVTKMGSENNVFVVCSLLDMYSKCGSIDEAFFIFNQTVEKNSVLSTSMIMAFAQCGRGSD
        LKLFVEMRA NLSPTDHTLTSVLNACGSLTVLEQGRQVHSLVTKMGSENNVFVVCSLLDMYSKCGS+D+AF IFNQTVEKNSVLSTSMIMAFAQCGRGSD
Subjt:  LKLFVEMRAGNLSPTDHTLTSVLNACGSLTVLEQGRQVHSLVTKMGSENNVFVVCSLLDMYSKCGSIDEAFFIFNQTVEKNSVLSTSMIMAFAQCGRGSD

Query:  ALKLFESLLTEEGFLPDHVCFTAVLTACNHAGLLDEAVGYFNKMGSEYRLDPQIDHYACLIDLYARNGHVEKAKQLMEQMPYESNYVMWCSLLGACKVHE
        ALKLFESLLTEEGFLPDHVCFTAVLTACNHAGLL+EAV YFNKMGSEYRLDPQIDHYACLIDLYARNGHVEKAK+LME+MPYESNYVMWCSLLGACKVH 
Subjt:  ALKLFESLLTEEGFLPDHVCFTAVLTACNHAGLLDEAVGYFNKMGSEYRLDPQIDHYACLIDLYARNGHVEKAKQLMEQMPYESNYVMWCSLLGACKVHE

Query:  EVELGREVAYRLIEMDPSNAAPYVTLAHIYARAGLWAQLADIRKQMQQKKVRKSAGWSWIEIDKKAHVFSVGDGTHPQSCEIYSKLNQLDLDMKAAENAS
        EVELGREVAYRLIEMDP NAAPYVTLAHIYARAGLW QLADIR QMQQK+VRKSAGWSWIEIDKKAHVFSVGD THP+SCEIYSKLNQLDLDM+ AE+AS
Subjt:  EVELGREVAYRLIEMDPSNAAPYVTLAHIYARAGLWAQLADIRKQMQQKKVRKSAGWSWIEIDKKAHVFSVGDGTHPQSCEIYSKLNQLDLDMKAAENAS

Query:  KALDYVEF
        KAL++VEF
Subjt:  KALDYVEF

XP_022998877.1 pentatricopeptide repeat-containing protein At2g13600-like [Cucurbita maxima]1.1e-26590.35Show/hide
Query:  MCSLGAKLTKYSLCSALNSCAKTHNFLLGLQIHAQIVKIGFEDNLFLNSSLVDLYSKCNAIWDAKRVFSHMKTHDQVSWTSIISGLSQNGHGSEAILMFK
        MCSLGAKLTKYSLCSALNSCAKT N  LGLQIHAQIVKIGFEDNL+LNS LV+LYSKCNAI DAKR+F HMKTHDQVSWTSIISGLSQNG G EAILMFK
Subjt:  MCSLGAKLTKYSLCSALNSCAKTHNFLLGLQIHAQIVKIGFEDNLFLNSSLVDLYSKCNAIWDAKRVFSHMKTHDQVSWTSIISGLSQNGHGSEAILMFK

Query:  NMLVTQVRPNCFTYATVISSCPTLKDELQIRLSTLLHAHVFKLGFT-LSSFVISSTIDCYSKLGRIEEAALLFYESTVKDNIIFNSMISGYSQNLYGEEA
        NMLVTQ RPNCFTYATVISSCP+LKDEL   L+TL HAHV KLGF   SSFVISS IDCYSKLGRIEEAALLFYE+ VKDN+IFNSMISG+SQNLYGEEA
Subjt:  NMLVTQVRPNCFTYATVISSCPTLKDELQIRLSTLLHAHVFKLGFT-LSSFVISSTIDCYSKLGRIEEAALLFYESTVKDNIIFNSMISGYSQNLYGEEA

Query:  LKLFVEMRAGNLSPTDHTLTSVLNACGSLTVLEQGRQVHSLVTKMGSENNVFVVCSLLDMYSKCGSIDEAFFIFNQTVEKNSVLSTSMIMAFAQCGRGSD
        LKLFVEMRA NLS TDHTLTSVLNACGSLTVLEQGRQVHSLVTKMGSENNVFVVCSLLDMYSKCGS+D+AF IFNQTVEKNSVLSTSMIMAFAQCGRGSD
Subjt:  LKLFVEMRAGNLSPTDHTLTSVLNACGSLTVLEQGRQVHSLVTKMGSENNVFVVCSLLDMYSKCGSIDEAFFIFNQTVEKNSVLSTSMIMAFAQCGRGSD

Query:  ALKLFESLLTEEGFLPDHVCFTAVLTACNHAGLLDEAVGYFNKMGSEYRLDPQIDHYACLIDLYARNGHVEKAKQLMEQMPYESNYVMWCSLLGACKVHE
        ALKLFESLLTEEGFLPDHVCFTAVLTACNHAGLL+EAV YFNKMGSEYRLDPQIDHYACLIDLYARNGHVEKAK+LME++PYESNYVMWCSLLGACKVH 
Subjt:  ALKLFESLLTEEGFLPDHVCFTAVLTACNHAGLLDEAVGYFNKMGSEYRLDPQIDHYACLIDLYARNGHVEKAKQLMEQMPYESNYVMWCSLLGACKVHE

Query:  EVELGREVAYRLIEMDPSNAAPYVTLAHIYARAGLWAQLADIRKQMQQKKVRKSAGWSWIEIDKKAHVFSVGDGTHPQSCEIYSKLNQLDLDMKAAENAS
        EVELGREVAYRLIEMDPSNAAPYVTLAHIYARAGLW QLADIRKQMQQK+VRKSAGWSWIEIDKKAHVFSVGD THP+SCEIYSKLNQLDLDM+  E+AS
Subjt:  EVELGREVAYRLIEMDPSNAAPYVTLAHIYARAGLWAQLADIRKQMQQKKVRKSAGWSWIEIDKKAHVFSVGDGTHPQSCEIYSKLNQLDLDMKAAENAS

Query:  KALDYVEF
        KAL+++EF
Subjt:  KALDYVEF

XP_023521522.1 pentatricopeptide repeat-containing protein At2g13600-like [Cucurbita pepo subsp. pepo]1.6e-26690.75Show/hide
Query:  MCSLGAKLTKYSLCSALNSCAKTHNFLLGLQIHAQIVKIGFEDNLFLNSSLVDLYSKCNAIWDAKRVFSHMKTHDQVSWTSIISGLSQNGHGSEAILMFK
        MCSLGAKLTKYSLCSALNSCAKT N  LGLQIHAQIVKIGFEDNL+LNS+LV+LYSKCNAI DAKR+F HMKTHDQVSWTSIISGLSQNG G EAILMFK
Subjt:  MCSLGAKLTKYSLCSALNSCAKTHNFLLGLQIHAQIVKIGFEDNLFLNSSLVDLYSKCNAIWDAKRVFSHMKTHDQVSWTSIISGLSQNGHGSEAILMFK

Query:  NMLVTQVRPNCFTYATVISSCPTLKDELQIRLSTLLHAHVFKLGFTL-SSFVISSTIDCYSKLGRIEEAALLFYESTVKDNIIFNSMISGYSQNLYGEEA
        NMLVTQ RPNCFTYATVISSCP+LKDEL   L+TL HAHV KLGF L SSFVISS IDCYSKLGRIEEAALLFYE+ VKDN+IFNSMISG+SQNLYGEEA
Subjt:  NMLVTQVRPNCFTYATVISSCPTLKDELQIRLSTLLHAHVFKLGFTL-SSFVISSTIDCYSKLGRIEEAALLFYESTVKDNIIFNSMISGYSQNLYGEEA

Query:  LKLFVEMRAGNLSPTDHTLTSVLNACGSLTVLEQGRQVHSLVTKMGSENNVFVVCSLLDMYSKCGSIDEAFFIFNQTVEKNSVLSTSMIMAFAQCGRGSD
        LKLFVEMRA NLSPTDHTLTSVLNACGSLTVLEQGRQVHSLVTKMGSENNVFVVCSLLDMYSKCGS+D+AF IFNQTVEKNSVLSTSMIMAFAQCGRGSD
Subjt:  LKLFVEMRAGNLSPTDHTLTSVLNACGSLTVLEQGRQVHSLVTKMGSENNVFVVCSLLDMYSKCGSIDEAFFIFNQTVEKNSVLSTSMIMAFAQCGRGSD

Query:  ALKLFESLLTEEGFLPDHVCFTAVLTACNHAGLLDEAVGYFNKMGSEYRLDPQIDHYACLIDLYARNGHVEKAKQLMEQMPYESNYVMWCSLLGACKVHE
        ALKLFESLLTEEGFLPDHVCFTAVLTACNHAGLL+EAV YFNKMGSEYRLDPQIDHYACLIDLYARNGHV KAK+LME+MPYESNYVMWCSLLGACKVH 
Subjt:  ALKLFESLLTEEGFLPDHVCFTAVLTACNHAGLLDEAVGYFNKMGSEYRLDPQIDHYACLIDLYARNGHVEKAKQLMEQMPYESNYVMWCSLLGACKVHE

Query:  EVELGREVAYRLIEMDPSNAAPYVTLAHIYARAGLWAQLADIRKQMQQKKVRKSAGWSWIEIDKKAHVFSVGDGTHPQSCEIYSKLNQLDLDMKAAENAS
         VELGREVAYRLIEMDPSNAAPYVTLAHIYARAGLW QLADIRKQMQQK+VRKSAGWSWIEIDKKAHVFSVGD THP+SCEIYSKLNQLDLDM+  E+AS
Subjt:  EVELGREVAYRLIEMDPSNAAPYVTLAHIYARAGLWAQLADIRKQMQQKKVRKSAGWSWIEIDKKAHVFSVGDGTHPQSCEIYSKLNQLDLDMKAAENAS

Query:  KALDYVEF
        KAL++VEF
Subjt:  KALDYVEF

XP_038896055.1 pentatricopeptide repeat-containing protein At3g12770-like [Benincasa hispida]8.4e-27192.06Show/hide
Query:  MCSLGAKLTKYSLCSALNSCAKTHNFLLGLQIHAQIVKIGFEDNLFLNSSLVDLYSKCNAIWDAKRVFSHMKTHDQVSWTSIISGLSQNGHGSEAILMFK
        MCSLGAKLTKYSLCSALNSCAKTHN  LGLQIHAQIVKIGFE+NLFLNSSLVDLYSKCNAI +AKRVFSHMKTHDQVSWTSIISGLSQNG GSEAILMFK
Subjt:  MCSLGAKLTKYSLCSALNSCAKTHNFLLGLQIHAQIVKIGFEDNLFLNSSLVDLYSKCNAIWDAKRVFSHMKTHDQVSWTSIISGLSQNGHGSEAILMFK

Query:  NMLVTQVRPNCFTYATVISSCPTLKDELQIRLSTLLHAHVFKLGFTLSSFVISSTIDCYSKLGRIEEAALLFYESTVKDNIIFNSMISGYSQNLYGEEAL
        NMLVTQVRPNCFTYATVISSC TLKDELQI L+TLLHAHV KLGFT S+FVISSTIDCYSKLGR++EAALLFYE+TVKDNIIFNSMISGYSQNLYGEEAL
Subjt:  NMLVTQVRPNCFTYATVISSCPTLKDELQIRLSTLLHAHVFKLGFTLSSFVISSTIDCYSKLGRIEEAALLFYESTVKDNIIFNSMISGYSQNLYGEEAL

Query:  KLFVEMRAGNLSPTDHTLTSVLNACGSLTVLEQGRQVHSLVTKMGSENNVFVVCSLLDMYSKCGSIDEAFFIFNQTVEKNSVLSTSMIMAFAQCGRGSDA
        KLFVEMRA NLSPTDHTLTSVLNACG LTVLEQGRQVHSLVTKMGSENNVFVVCSLLDMYSKCGS+DEAFF+FNQT+EKNSVLSTSMIMAFAQCGRGS+A
Subjt:  KLFVEMRAGNLSPTDHTLTSVLNACGSLTVLEQGRQVHSLVTKMGSENNVFVVCSLLDMYSKCGSIDEAFFIFNQTVEKNSVLSTSMIMAFAQCGRGSDA

Query:  LKLFESLLTEEGFLPDHVCFTAVLTACNHAGLLDEAVGYFNKMGSEYRLDPQIDHYACLIDLYARNGHVEKAKQLMEQMPYESNYVMWCSLLGACKVHEE
        LKLFESLLTEEGFLPDHVCF AVLTACNHAGLL+EAV YFNKM  EYRLDPQIDHYACLIDLYARNGHVEKAKQLMEQMPYESNYVMWCSLLGACKVHEE
Subjt:  LKLFESLLTEEGFLPDHVCFTAVLTACNHAGLLDEAVGYFNKMGSEYRLDPQIDHYACLIDLYARNGHVEKAKQLMEQMPYESNYVMWCSLLGACKVHEE

Query:  VELGREVAYRLIEMDPSNAAPYVTLAHIYARAGLWAQLADIRKQMQQKKVRKSAGWSWIEIDKKAHVFSVGDGTHPQSCEIYSKLNQLDLDMKAAENASK
        VELGREVAY+LIEMDPSNAAPYVTLAHIYARAGLW QL +IRK+MQQK+VRKSAGWSWIEIDKK HVFSVGD  HP+SCEIYSKL+QL+LDMKAAE+ASK
Subjt:  VELGREVAYRLIEMDPSNAAPYVTLAHIYARAGLWAQLADIRKQMQQKKVRKSAGWSWIEIDKKAHVFSVGDGTHPQSCEIYSKLNQLDLDMKAAENASK

Query:  ALDY
        AL+Y
Subjt:  ALDY

TrEMBL top hitse value%identityAlignment
A0A1S3CML0 pentatricopeptide repeat-containing protein At2g13600-like3.0e-25887.9Show/hide
Query:  MCSLGAKLTKYSLCSALNSCAKTHNFLLGLQIHAQIVKIGFEDNLFLNSSLVDLYSKCNAIWDAKRVFSHMKTHDQVSWTSIISGLSQNGHGSEAILMFK
        MCSLGAKLT YSLCSAL+SCAKTHN  LGLQIHAQIVKIGFE+NLFLNSSLVDLYSKCNAI +AKRVFS MKTHDQVSWTSIISGLSQNG GSEAILMFK
Subjt:  MCSLGAKLTKYSLCSALNSCAKTHNFLLGLQIHAQIVKIGFEDNLFLNSSLVDLYSKCNAIWDAKRVFSHMKTHDQVSWTSIISGLSQNGHGSEAILMFK

Query:  NMLVTQVRPNCFTYATVISSCPTLKDELQIRLSTLLHAHVFKLGFTLSSFVISSTIDCYSKLGRIEEAALLFYESTVKDNIIFNSMISGYSQNLYGEEAL
         MLVTQVRPNCFTYATVISSCPTLK+ELQI L+TLLHAHV K GFT SSFVISSTIDCYSKLGRI+EA+LLF E++VKDNIIFNSMISGYSQNL GEEAL
Subjt:  NMLVTQVRPNCFTYATVISSCPTLKDELQIRLSTLLHAHVFKLGFTLSSFVISSTIDCYSKLGRIEEAALLFYESTVKDNIIFNSMISGYSQNLYGEEAL

Query:  KLFVEMRAGNLSPTDHTLTSVLNACGSLTVLEQGRQVHSLVTKMGSENNVFVVCSLLDMYSKCGSIDEAFFIFNQTVEKNSVLSTSMIMAFAQCGRGSDA
        KLFVEMRA NLSPTDHTLTSVLNACG LTVLEQGRQVHSL+TKMGSENNVFVVCSLLDMYSKCGSIDEAF +FNQTV+KNSVLSTSMIMAFAQCGRG +A
Subjt:  KLFVEMRAGNLSPTDHTLTSVLNACGSLTVLEQGRQVHSLVTKMGSENNVFVVCSLLDMYSKCGSIDEAFFIFNQTVEKNSVLSTSMIMAFAQCGRGSDA

Query:  LKLFESLLTEEGFLPDHVCFTAVLTACNHAGLLDEAVGYFNKMGSEYRLDPQIDHYACLIDLYARNGHVEKAKQLMEQMPYESNYVMWCSLLGACKVHEE
        LKLFE L TE+ F+PDH+CFTAVLTACNHAGLLDEAV YFNKM  EY+LDPQIDHYACLIDLYARNG+VEKAKQ+MEQMPYESNYVMWCSLLGACKVH E
Subjt:  LKLFESLLTEEGFLPDHVCFTAVLTACNHAGLLDEAVGYFNKMGSEYRLDPQIDHYACLIDLYARNGHVEKAKQLMEQMPYESNYVMWCSLLGACKVHEE

Query:  VELGREVAYRLIEMDPSNAAPYVTLAHIYARAGLWAQLADIRKQMQQKKVRKSAGWSWIEIDKKAHVFSVGDGTHPQSCEIYSKLNQLDLDMKAAENASK
        VELGREVAYRLIEMDP NAAPY+TLAHIYARAGLW Q+ +IRK+MQQK+VRKSAGWSWIEIDKK HVFSVGD  HP+SCEIYSKL+QL+LDMKAAE + K
Subjt:  VELGREVAYRLIEMDPSNAAPYVTLAHIYARAGLWAQLADIRKQMQQKKVRKSAGWSWIEIDKKAHVFSVGDGTHPQSCEIYSKLNQLDLDMKAAENASK

Query:  ALDY
        AL+Y
Subjt:  ALDY

A0A5A7UEE6 Pentatricopeptide repeat-containing protein3.0e-25887.9Show/hide
Query:  MCSLGAKLTKYSLCSALNSCAKTHNFLLGLQIHAQIVKIGFEDNLFLNSSLVDLYSKCNAIWDAKRVFSHMKTHDQVSWTSIISGLSQNGHGSEAILMFK
        MCSLGAKLT YSLCSAL+SCAKTHN  LGLQIHAQIVKIGFE+NLFLNSSLVDLYSKCNAI +AKRVFS MKTHDQVSWTSIISGLSQNG GSEAILMFK
Subjt:  MCSLGAKLTKYSLCSALNSCAKTHNFLLGLQIHAQIVKIGFEDNLFLNSSLVDLYSKCNAIWDAKRVFSHMKTHDQVSWTSIISGLSQNGHGSEAILMFK

Query:  NMLVTQVRPNCFTYATVISSCPTLKDELQIRLSTLLHAHVFKLGFTLSSFVISSTIDCYSKLGRIEEAALLFYESTVKDNIIFNSMISGYSQNLYGEEAL
         MLVTQVRPNCFTYATVISSCPTLK+ELQI L+TLLHAHV K GFT SSFVISSTIDCYSKLGRI+EA+LLF E++VKDNIIFNSMISGYSQNL GEEAL
Subjt:  NMLVTQVRPNCFTYATVISSCPTLKDELQIRLSTLLHAHVFKLGFTLSSFVISSTIDCYSKLGRIEEAALLFYESTVKDNIIFNSMISGYSQNLYGEEAL

Query:  KLFVEMRAGNLSPTDHTLTSVLNACGSLTVLEQGRQVHSLVTKMGSENNVFVVCSLLDMYSKCGSIDEAFFIFNQTVEKNSVLSTSMIMAFAQCGRGSDA
        KLFVEMRA NLSPTDHTLTSVLNACG LTVLEQGRQVHSL+TKMGSENNVFVVCSLLDMYSKCGSIDEAF +FNQTV+KNSVLSTSMIMAFAQCGRG +A
Subjt:  KLFVEMRAGNLSPTDHTLTSVLNACGSLTVLEQGRQVHSLVTKMGSENNVFVVCSLLDMYSKCGSIDEAFFIFNQTVEKNSVLSTSMIMAFAQCGRGSDA

Query:  LKLFESLLTEEGFLPDHVCFTAVLTACNHAGLLDEAVGYFNKMGSEYRLDPQIDHYACLIDLYARNGHVEKAKQLMEQMPYESNYVMWCSLLGACKVHEE
        LKLFE L TE+ F+PDH+CFTAVLTACNHAGLLDEAV YFNKM  EY+LDPQIDHYACLIDLYARNG+VEKAKQ+MEQMPYESNYVMWCSLLGACKVH E
Subjt:  LKLFESLLTEEGFLPDHVCFTAVLTACNHAGLLDEAVGYFNKMGSEYRLDPQIDHYACLIDLYARNGHVEKAKQLMEQMPYESNYVMWCSLLGACKVHEE

Query:  VELGREVAYRLIEMDPSNAAPYVTLAHIYARAGLWAQLADIRKQMQQKKVRKSAGWSWIEIDKKAHVFSVGDGTHPQSCEIYSKLNQLDLDMKAAENASK
        VELGREVAYRLIEMDP NAAPY+TLAHIYARAGLW Q+ +IRK+MQQK+VRKSAGWSWIEIDKK HVFSVGD  HP+SCEIYSKL+QL+LDMKAAE + K
Subjt:  VELGREVAYRLIEMDPSNAAPYVTLAHIYARAGLWAQLADIRKQMQQKKVRKSAGWSWIEIDKKAHVFSVGDGTHPQSCEIYSKLNQLDLDMKAAENASK

Query:  ALDY
        AL+Y
Subjt:  ALDY

A0A6J1DYD1 pentatricopeptide repeat-containing protein At2g13600-like1.1e-25587.45Show/hide
Query:  MCSLGAKLTKYSLCSALNSCAKTHNFLLGLQIHAQIVKIGFEDNLFLNSSLVDLYSKCNAIWDAKRVFSHMKTHDQVSWTSIISGLSQNGHGSEAILMFK
        MC LGAKLTKY+LCSALNSCAKT N   GLQIHAQIVKIG E+NLFLNS+LVDLY+KCNAI DAKRVF  M+THDQVSWTS+ISGLS+NG GSEAI MFK
Subjt:  MCSLGAKLTKYSLCSALNSCAKTHNFLLGLQIHAQIVKIGFEDNLFLNSSLVDLYSKCNAIWDAKRVFSHMKTHDQVSWTSIISGLSQNGHGSEAILMFK

Query:  NMLVTQVRPNCFTYATVISSCPTLKDELQIRLSTLLHAHVFKLGFTLSSFVISSTIDCYSKLGRIEEAALLFYESTVKDNIIFNSMISGYSQNLYGEEAL
         MLVTQ RPNCFTYATVISSCPTL+D+LQ+RL+ LLHAHV K GFT SSFVISSTIDCYSKLGRI +AALLFYE+  KDNIIFNSMISGYSQNL+GEEAL
Subjt:  NMLVTQVRPNCFTYATVISSCPTLKDELQIRLSTLLHAHVFKLGFTLSSFVISSTIDCYSKLGRIEEAALLFYESTVKDNIIFNSMISGYSQNLYGEEAL

Query:  KLFVEMRAGNLSPTDHTLTSVLNACGSLTVLEQGRQVHSLVTKMGSENNVFVVCSLLDMYSKCGSIDEAFFIFNQTVEKNSVLSTSMIMAFAQCGRGSDA
        KLFVEMR+ +LSPTDHTLTSVLNACGSLTVLEQGRQVHSLVTKMGSENNVFVVCSL+DMYSKCG +DEAFFIFNQ VEKNSVLSTSMIMAFAQCGRGSDA
Subjt:  KLFVEMRAGNLSPTDHTLTSVLNACGSLTVLEQGRQVHSLVTKMGSENNVFVVCSLLDMYSKCGSIDEAFFIFNQTVEKNSVLSTSMIMAFAQCGRGSDA

Query:  LKLFESLLTEEGFLPDHVCFTAVLTACNHAGLLDEAVGYFNKMGSEYRLDPQIDHYACLIDLYARNGHVEKAKQLMEQMPYESNYVMWCSLLGACKVHEE
        LKLFE LL EEGFLPDHVCFTAVLTACNHAG LDEAV YFNKMGSEY LDPQIDHYACLIDLYARNGH+EKAKQL+EQMPYESNYVMWCSLLGACKVH E
Subjt:  LKLFESLLTEEGFLPDHVCFTAVLTACNHAGLLDEAVGYFNKMGSEYRLDPQIDHYACLIDLYARNGHVEKAKQLMEQMPYESNYVMWCSLLGACKVHEE

Query:  VELGREVAYRLIEMDPSNAAPYVTLAHIYARAGLWAQLADIRKQMQQKKVRKSAGWSWIEIDKKAHVFSVGDGTHPQSCEIYSKLNQLDLDMKAAENASK
        VELGREVAYRLIEMDPSNAAPYVTLAHIYARAGLW Q+ DIRK+MQQ+KVRKS GWSWIEIDKK+HVFS GD THP+S EIYSKL+QLDLDMK+AE+ASK
Subjt:  VELGREVAYRLIEMDPSNAAPYVTLAHIYARAGLWAQLADIRKQMQQKKVRKSAGWSWIEIDKKAHVFSVGDGTHPQSCEIYSKLNQLDLDMKAAENASK

Query:  AL
        AL
Subjt:  AL

A0A6J1GA23 pentatricopeptide repeat-containing protein At2g13600-like3.2e-26890.94Show/hide
Query:  MCSLGAKLTKYSLCSALNSCAKTHNFLLGLQIHAQIVKIGFEDNLFLNSSLVDLYSKCNAIWDAKRVFSHMKTHDQVSWTSIISGLSQNGHGSEAILMFK
        MCSLGAKLTKYSLCSALNSCAKT N  LGLQIHAQIVKIGFEDNL+LNS LV+LYSKCNAI DAKR+F HMKTHDQVSWTSIISGLSQNG G EAILMFK
Subjt:  MCSLGAKLTKYSLCSALNSCAKTHNFLLGLQIHAQIVKIGFEDNLFLNSSLVDLYSKCNAIWDAKRVFSHMKTHDQVSWTSIISGLSQNGHGSEAILMFK

Query:  NMLVTQVRPNCFTYATVISSCPTLKDELQIRLSTLLHAHVFKLGFT-LSSFVISSTIDCYSKLGRIEEAALLFYESTVKDNIIFNSMISGYSQNLYGEEA
        NMLVTQ RPNCFTYATVISSCP+LKDEL I L+TL HAHV KLGF   SSFVISS IDCYSKLGRIEEAALLFYE+ VKDN+IFNSMISG+SQNLYGEEA
Subjt:  NMLVTQVRPNCFTYATVISSCPTLKDELQIRLSTLLHAHVFKLGFT-LSSFVISSTIDCYSKLGRIEEAALLFYESTVKDNIIFNSMISGYSQNLYGEEA

Query:  LKLFVEMRAGNLSPTDHTLTSVLNACGSLTVLEQGRQVHSLVTKMGSENNVFVVCSLLDMYSKCGSIDEAFFIFNQTVEKNSVLSTSMIMAFAQCGRGSD
        LKLFVEMRA NLSPTDHTLTSVLNACGSLTVLEQGRQVHSLVTKMGSENNVFVVCSLLDMYSKCGS+D+AF IFNQTVEKNSVLSTSMIMAFAQCGRGSD
Subjt:  LKLFVEMRAGNLSPTDHTLTSVLNACGSLTVLEQGRQVHSLVTKMGSENNVFVVCSLLDMYSKCGSIDEAFFIFNQTVEKNSVLSTSMIMAFAQCGRGSD

Query:  ALKLFESLLTEEGFLPDHVCFTAVLTACNHAGLLDEAVGYFNKMGSEYRLDPQIDHYACLIDLYARNGHVEKAKQLMEQMPYESNYVMWCSLLGACKVHE
        ALKLFESLLTEEGFLPDHVCFTAVLTACNHAGLL+EAV YFNKMGSEYRLDPQIDHYACLIDLYARNGHVEKAK+LME+MPYESNYVMWCSLLGACKVH 
Subjt:  ALKLFESLLTEEGFLPDHVCFTAVLTACNHAGLLDEAVGYFNKMGSEYRLDPQIDHYACLIDLYARNGHVEKAKQLMEQMPYESNYVMWCSLLGACKVHE

Query:  EVELGREVAYRLIEMDPSNAAPYVTLAHIYARAGLWAQLADIRKQMQQKKVRKSAGWSWIEIDKKAHVFSVGDGTHPQSCEIYSKLNQLDLDMKAAENAS
        EVELGREVAYRLIEMDP NAAPYVTLAHIYARAGLW QLADIR QMQQK+VRKSAGWSWIEIDKKAHVFSVGD THP+SCEIYSKLNQLDLDM+ AE+AS
Subjt:  EVELGREVAYRLIEMDPSNAAPYVTLAHIYARAGLWAQLADIRKQMQQKKVRKSAGWSWIEIDKKAHVFSVGDGTHPQSCEIYSKLNQLDLDMKAAENAS

Query:  KALDYVEF
        KAL++VEF
Subjt:  KALDYVEF

A0A6J1K975 pentatricopeptide repeat-containing protein At2g13600-like5.1e-26690.35Show/hide
Query:  MCSLGAKLTKYSLCSALNSCAKTHNFLLGLQIHAQIVKIGFEDNLFLNSSLVDLYSKCNAIWDAKRVFSHMKTHDQVSWTSIISGLSQNGHGSEAILMFK
        MCSLGAKLTKYSLCSALNSCAKT N  LGLQIHAQIVKIGFEDNL+LNS LV+LYSKCNAI DAKR+F HMKTHDQVSWTSIISGLSQNG G EAILMFK
Subjt:  MCSLGAKLTKYSLCSALNSCAKTHNFLLGLQIHAQIVKIGFEDNLFLNSSLVDLYSKCNAIWDAKRVFSHMKTHDQVSWTSIISGLSQNGHGSEAILMFK

Query:  NMLVTQVRPNCFTYATVISSCPTLKDELQIRLSTLLHAHVFKLGFT-LSSFVISSTIDCYSKLGRIEEAALLFYESTVKDNIIFNSMISGYSQNLYGEEA
        NMLVTQ RPNCFTYATVISSCP+LKDEL   L+TL HAHV KLGF   SSFVISS IDCYSKLGRIEEAALLFYE+ VKDN+IFNSMISG+SQNLYGEEA
Subjt:  NMLVTQVRPNCFTYATVISSCPTLKDELQIRLSTLLHAHVFKLGFT-LSSFVISSTIDCYSKLGRIEEAALLFYESTVKDNIIFNSMISGYSQNLYGEEA

Query:  LKLFVEMRAGNLSPTDHTLTSVLNACGSLTVLEQGRQVHSLVTKMGSENNVFVVCSLLDMYSKCGSIDEAFFIFNQTVEKNSVLSTSMIMAFAQCGRGSD
        LKLFVEMRA NLS TDHTLTSVLNACGSLTVLEQGRQVHSLVTKMGSENNVFVVCSLLDMYSKCGS+D+AF IFNQTVEKNSVLSTSMIMAFAQCGRGSD
Subjt:  LKLFVEMRAGNLSPTDHTLTSVLNACGSLTVLEQGRQVHSLVTKMGSENNVFVVCSLLDMYSKCGSIDEAFFIFNQTVEKNSVLSTSMIMAFAQCGRGSD

Query:  ALKLFESLLTEEGFLPDHVCFTAVLTACNHAGLLDEAVGYFNKMGSEYRLDPQIDHYACLIDLYARNGHVEKAKQLMEQMPYESNYVMWCSLLGACKVHE
        ALKLFESLLTEEGFLPDHVCFTAVLTACNHAGLL+EAV YFNKMGSEYRLDPQIDHYACLIDLYARNGHVEKAK+LME++PYESNYVMWCSLLGACKVH 
Subjt:  ALKLFESLLTEEGFLPDHVCFTAVLTACNHAGLLDEAVGYFNKMGSEYRLDPQIDHYACLIDLYARNGHVEKAKQLMEQMPYESNYVMWCSLLGACKVHE

Query:  EVELGREVAYRLIEMDPSNAAPYVTLAHIYARAGLWAQLADIRKQMQQKKVRKSAGWSWIEIDKKAHVFSVGDGTHPQSCEIYSKLNQLDLDMKAAENAS
        EVELGREVAYRLIEMDPSNAAPYVTLAHIYARAGLW QLADIRKQMQQK+VRKSAGWSWIEIDKKAHVFSVGD THP+SCEIYSKLNQLDLDM+  E+AS
Subjt:  EVELGREVAYRLIEMDPSNAAPYVTLAHIYARAGLWAQLADIRKQMQQKKVRKSAGWSWIEIDKKAHVFSVGDGTHPQSCEIYSKLNQLDLDMKAAENAS

Query:  KALDYVEF
        KAL+++EF
Subjt:  KALDYVEF

SwissProt top hitse value%identityAlignment
Q5G1T1 Pentatricopeptide repeat-containing protein At3g49170, chloroplastic2.5e-9235.63Show/hide
Query:  GAKLTKYSLCSALNSCAKTHNFLLGLQIHAQIVKIGFEDNLFLNSSLVDLYSKCNA---IWDAKRVFSHMKTHDQVSWTSIISGLSQNGH-GSEAILMFK
        G +  K++L S  ++CA+  N  LG Q+H+  ++ G  D+  +  SLVD+Y+KC+A   + D ++VF  M+ H  +SWT++I+G  +N +  +EAI +F 
Subjt:  GAKLTKYSLCSALNSCAKTHNFLLGLQIHAQIVKIGFEDNLFLNSSLVDLYSKCNA---IWDAKRVFSHMKTHDQVSWTSIISGLSQNGH-GSEAILMFK

Query:  NMLVT-QVRPNCFTYATVISSCPTLKDELQIRLSTLLHAHVFKLGFTLSSFVISSTIDCYSKLGRIEEAALLFYESTVKDNIIFNSMISGYSQNLYGEEA
         M+    V PN FT+++   +C  L D    R+   +    FK G   +S V +S I  + K  R+E+A   F   + K+ + +N+ + G  +NL  E+A
Subjt:  NMLVT-QVRPNCFTYATVISSCPTLKDELQIRLSTLLHAHVFKLGFTLSSFVISSTIDCYSKLGRIEEAALLFYESTVKDNIIFNSMISGYSQNLYGEEA

Query:  LKLFVEMRAGNLSPTDHTLTSVLNACGSLTVLEQGRQVHSLVTKMGSENNVFVVCSLLDMYSKCGSIDEAFFIFNQTVEKNSVLSTSMIMAFAQCGRGSD
         KL  E+    L  +  T  S+L+   ++  + +G Q+HS V K+G   N  V  +L+ MYSKCGSID A  +FN    +N +  TSMI  FA+ G    
Subjt:  LKLFVEMRAGNLSPTDHTLTSVLNACGSLTVLEQGRQVHSLVTKMGSENNVFVVCSLLDMYSKCGSIDEAFFIFNQTVEKNSVLSTSMIMAFAQCGRGSD

Query:  ALKLFESLLTEEGFLPDHVCFTAVLTACNHAGLLDEAVGYFNKMGSEYRLDPQIDHYACLIDLYARNGHVEKAKQLMEQMPYESNYVMWCSLLGACKVHE
         L+ F  ++ EEG  P+ V + A+L+AC+H GL+ E   +FN M  ++++ P+++HYAC++DL  R G +  A + +  MP++++ ++W + LGAC+VH 
Subjt:  ALKLFESLLTEEGFLPDHVCFTAVLTACNHAGLLDEAVGYFNKMGSEYRLDPQIDHYACLIDLYARNGHVEKAKQLMEQMPYESNYVMWCSLLGACKVHE

Query:  EVELGREVAYRLIEMDPSNAAPYVTLAHIYARAGLWAQLADIRKQMQQKKVRKSAGWSWIEIDKKAHVFSVGDGTHPQSCEIYSKLNQLDLDMK
          ELG+  A +++E+DP+  A Y+ L++IYA AG W +  ++R++M+++ + K  G SWIE+  K H F VGD  HP + +IY +L++L  ++K
Subjt:  EVELGREVAYRLIEMDPSNAAPYVTLAHIYARAGLWAQLADIRKQMQQKKVRKSAGWSWIEIDKKAHVFSVGDGTHPQSCEIYSKLNQLDLDMK

Q9FWA6 Pentatricopeptide repeat-containing protein At3g02330, mitochondrial2.8e-9133.27Show/hide
Query:  MCSLGAKLTKYSLCSALNSCAKTHNFLLGLQIHAQIVKIGFEDNLFLNSSLVDLYSKCNAIWDAKRVFSHMKTHDQVSWTSIISGLSQNGHGSEAILMFK
        + S G    + SL     +CA       GLQI+   +K     ++ + ++ +D+Y KC A+ +A RVF  M+  D VSW +II+   QNG G E + +F 
Subjt:  MCSLGAKLTKYSLCSALNSCAKTHNFLLGLQIHAQIVKIGFEDNLFLNSSLVDLYSKCNAIWDAKRVFSHMKTHDQVSWTSIISGLSQNGHGSEAILMFK

Query:  NMLVTQVRPNCFTYATVISSCPTLKDELQIRLSTLLHAHVFKLGFTLSSFVISSTIDCYSKLGRIEEA----ALLFYESTVKDN----------------
        +ML +++ P+ FT+ +++ +C        +     +H+ + K G   +S V  S ID YSK G IEEA    +  F  + V                   
Subjt:  NMLVTQVRPNCFTYATVISSCPTLKDELQIRLSTLLHAHVFKLGFTLSSFVISSTIDCYSKLGRIEEA----ALLFYESTVKDN----------------

Query:  IIFNSMISGYSQNLYGEEALKLFVEMRAGNLSPTDHTLTSVLNACGSLTVLEQGRQVHSLVTKMGSENNVFVVCSLLDMYSKCGSIDEAFFIFNQTVEKN
        + +NS+ISGY      E+A  LF  M    ++P   T  +VL+ C +L     G+Q+H+ V K   +++V++  +L+DMYSKCG + ++  +F +++ ++
Subjt:  IIFNSMISGYSQNLYGEEALKLFVEMRAGNLSPTDHTLTSVLNACGSLTVLEQGRQVHSLVTKMGSENNVFVVCSLLDMYSKCGSIDEAFFIFNQTVEKN

Query:  SVLSTSMIMAFAQCGRGSDALKLFESLLTEEGFLPDHVCFTAVLTACNHAGLLDEAVGYFNKMGSEYRLDPQIDHYACLIDLYARNGHVEKAKQLMEQMP
         V   +MI  +A  G+G +A++LFE ++  E   P+HV F ++L AC H GL+D+ + YF  M  +Y LDPQ+ HY+ ++D+  ++G V++A +L+ +MP
Subjt:  SVLSTSMIMAFAQCGRGSDALKLFESLLTEEGFLPDHVCFTAVLTACNHAGLLDEAVGYFNKMGSEYRLDPQIDHYACLIDLYARNGHVEKAKQLMEQMP

Query:  YESNYVMWCSLLGACKVH-EEVELGREVAYRLIEMDPSNAAPYVTLAHIYARAGLWAQLADIRKQMQQKKVRKSAGWSWIEIDKKAHVFSVGDGTHPQSC
        +E++ V+W +LLG C +H   VE+  E    L+ +DP +++ Y  L+++YA AG+W +++D+R+ M+  K++K  G SW+E+  + HVF VGD  HP+  
Subjt:  YESNYVMWCSLLGACKVH-EEVELGREVAYRLIEMDPSNAAPYVTLAHIYARAGLWAQLADIRKQMQQKKVRKSAGWSWIEIDKKAHVFSVGDGTHPQSC

Query:  EIYSKLNQLDLDMKAAENAS
        EIY +L  +  +MK  +++S
Subjt:  EIYSKLNQLDLDMKAAENAS

Q9SIT7 Pentatricopeptide repeat-containing protein At2g136002.5e-10036.33Show/hide
Query:  GAKLTKYSLCSALNSCAKTHNFLLGLQIHAQIVKIGFEDNLFLNSSLVDLYSKCNAIWDAKRVFSHMKTHDQVSWTSIISGLSQNGHGSEAILMFKNMLV
        G  L +YS  S L++C+  ++   G+Q+H+ I K  F  ++++ S+LVD+YSKC  + DA+RVF  M   + VSW S+I+   QNG   EA+ +F+ ML 
Subjt:  GAKLTKYSLCSALNSCAKTHNFLLGLQIHAQIVKIGFEDNLFLNSSLVDLYSKCNAIWDAKRVFSHMKTHDQVSWTSIISGLSQNGHGSEAILMFKNMLV

Query:  TQVRPNCFTYATVISSCPTLKDELQIRLSTLLHAHVFKLGFTLSSFVISST-IDCYSKLGRIEEAALLFYESTVKD------------------------
        ++V P+  T A+VIS+C +L     I++   +H  V K     +  ++S+  +D Y+K  RI+EA  +F    +++                        
Subjt:  TQVRPNCFTYATVISSCPTLKDELQIRLSTLLHAHVFKLGFTLSSFVISST-IDCYSKLGRIEEAALLFYESTVKD------------------------

Query:  ------NII-FNSMISGYSQNLYGEEALKLFVEMRAGNLSPTDHTLTSVLNACGSLTVLEQGRQVHSLVTK------MGSENNVFVVCSLLDMYSKCGSI
              N++ +N++I+GY+QN   EEAL LF  ++  ++ PT ++  ++L AC  L  L  G Q H  V K       G E+++FV  SL+DMY KCG +
Subjt:  ------NII-FNSMISGYSQNLYGEEALKLFVEMRAGNLSPTDHTLTSVLNACGSLTVLEQGRQVHSLVTK------MGSENNVFVVCSLLDMYSKCGSI

Query:  DEAFFIFNQTVEKNSVLSTSMIMAFAQCGRGSDALKLFESLLTEEGFLPDHVCFTAVLTACNHAGLLDEAVGYFNKMGSEYRLDPQIDHYACLIDLYARN
        +E + +F + +E++ V   +MI+ FAQ G G++AL+LF  +L E G  PDH+    VL+AC HAG ++E   YF+ M  ++ + P  DHY C++DL  R 
Subjt:  DEAFFIFNQTVEKNSVLSTSMIMAFAQCGRGSDALKLFESLLTEEGFLPDHVCFTAVLTACNHAGLLDEAVGYFNKMGSEYRLDPQIDHYACLIDLYARN

Query:  GHVEKAKQLMEQMPYESNYVMWCSLLGACKVHEEVELGREVAYRLIEMDPSNAAPYVTLAHIYARAGLWAQLADIRKQMQQKKVRKSAGWSWIEIDKKAH
        G +E+AK ++E+MP + + V+W SLL ACKVH  + LG+ VA +L+E++PSN+ PYV L+++YA  G W  + ++RK M+++ V K  G SWI+I    H
Subjt:  GHVEKAKQLMEQMPYESNYVMWCSLLGACKVHEEVELGREVAYRLIEMDPSNAAPYVTLAHIYARAGLWAQLADIRKQMQQKKVRKSAGWSWIEIDKKAH

Query:  VFSVGDGTHPQSCEIYSKLNQLDLDMKAAEN-------ASKALDY
        VF V D +HP+  +I+S L+ L  +M+  ++       +S+ +DY
Subjt:  VFSVGDGTHPQSCEIYSKLNQLDLDMKAAEN-------ASKALDY

Q9STS9 Putative pentatricopeptide repeat-containing protein At3g478401.1e-9038.89Show/hide
Query:  YSLCSALNSCAKTHNFLLGLQIHAQIVKIGFEDNLFLNSSLVDLYSKCNAIWDAKRVFSHMKTHDQVSWTSIISGLSQNGHGSEAILMFKNMLVTQVRPN
        Y+   AL +CA       G  IH  ++  GF   L + +SL  +Y++C  + D   +F +M   D VSWTS+I    + G   +A+  F  M  +QV PN
Subjt:  YSLCSALNSCAKTHNFLLGLQIHAQIVKIGFEDNLFLNSSLVDLYSKCNAIWDAKRVFSHMKTHDQVSWTSIISGLSQNGHGSEAILMFKNMLVTQVRPN

Query:  CFTYATVISSCPTLKDELQIRLSTLLHAHVFKLGFTLSSFVISSTIDCYSKLGRIEEAALLFYESTVKDNIIFNSMISGYSQNLYGEEALKLFVEMRAGN
          T+A++ S+C +L    ++     LH +V  LG   S  V +S +  YS  G +  A++LF     +D I ++++I GY Q  +GEE  K F  MR   
Subjt:  CFTYATVISSCPTLKDELQIRLSTLLHAHVFKLGFTLSSFVISSTIDCYSKLGRIEEAALLFYESTVKDNIIFNSMISGYSQNLYGEEALKLFVEMRAGN

Query:  LSPTDHTLTSVLNACGSLTVLEQGRQVHSLVTKMGSENNVFVVCSLLDMYSKCGSIDEAFFIFNQTVEKNSVLSTSMIMAFAQCGRGSDALKLFESLLTE
          PTD  L S+L+  G++ V+E GRQVH+L    G E N  V  SL++MYSKCGSI EA  IF +T   + V  T+MI  +A+ G+  +A+ LFE  L +
Subjt:  LSPTDHTLTSVLNACGSLTVLEQGRQVHSLVTKMGSENNVFVVCSLLDMYSKCGSIDEAFFIFNQTVEKNSVLSTSMIMAFAQCGRGSDALKLFESLLTE

Query:  EGFLPDHVCFTAVLTACNHAGLLDEAVGYFNKMGSEYRLDPQIDHYACLIDLYARNGHVEKAKQLMEQMPYESNYVMWCSLLGACKVHEEVELGREVAYR
         GF PD V F +VLTAC H+G LD    YFN M   Y + P  +HY C++DL  R G +  A++++ +M ++ + V+W +LL ACK   ++E GR  A R
Subjt:  EGFLPDHVCFTAVLTACNHAGLLDEAVGYFNKMGSEYRLDPQIDHYACLIDLYARNGHVEKAKQLMEQMPYESNYVMWCSLLGACKVHEEVELGREVAYR

Query:  LIEMDPSNAAPYVTLAHIYARAGLWAQLADIRKQMQQKKVRKSAGWSWIEIDKKAHVFSVGDGTHPQSCEIYSKLNQLDLDMKAAE
        ++E+DP+ A   VTLA+IY+  G   + A++RK M+ K V K  GWS I+I      F  GD  HPQS +IY   N L+L +  AE
Subjt:  LIEMDPSNAAPYVTLAHIYARAGLWAQLADIRKQMQQKKVRKSAGWSWIEIDKKAHVFSVGDGTHPQSCEIYSKLNQLDLDMKAAE

Q9ZUW3 Pentatricopeptide repeat-containing protein At2g276106.8e-9036.2Show/hide
Query:  KLTKYSLCSALNSCAKTHNFLLGLQIHAQIVKIGFEDNLFLNSSLVDLYSKCNAIWDAKRVFSHMK-THDQVSWTSIISGLSQNGHGSEAILMFKNMLVT
        +L++ S  S +  CA         Q+H  +VK GF  +  + ++L+  YSKC A+ DA R+F  +    + VSWT++ISG  QN    EA+ +F  M   
Subjt:  KLTKYSLCSALNSCAKTHNFLLGLQIHAQIVKIGFEDNLFLNSSLVDLYSKCNAIWDAKRVFSHMK-THDQVSWTSIISGLSQNGHGSEAILMFKNMLVT

Query:  QVRPNCFTYATVISSCPTLKDELQIRLSTLLHAHVFKLGFTLSSFVISSTIDCYSKLGRIEEAALLFYESTVKDNIIFNSMISGYSQNLYGEEALKLFVE
         VRPN FTY+ ++++ P +         + +HA V K  +  SS V ++ +D Y KLG++EEAA +F     KD + +++M++GY+Q    E A+K+F E
Subjt:  QVRPNCFTYATVISSCPTLKDELQIRLSTLLHAHVFKLGFTLSSFVISSTIDCYSKLGRIEEAALLFYESTVKDNIIFNSMISGYSQNLYGEEALKLFVE

Query:  MRAGNLSPTDHTLTSVLNACGSLTV-LEQGRQVHSLVTKMGSENNVFVVCSLLDMYSKCGSIDEAFFIFNQTVEKNSVLSTSMIMAFAQCGRGSDALKLF
        +  G + P + T +S+LN C +    + QG+Q H    K   ++++ V  +LL MY+K G+I+ A  +F +  EK+ V   SMI  +AQ G+   AL +F
Subjt:  MRAGNLSPTDHTLTSVLNACGSLTV-LEQGRQVHSLVTKMGSENNVFVVCSLLDMYSKCGSIDEAFFIFNQTVEKNSVLSTSMIMAFAQCGRGSDALKLF

Query:  ESLLTEEGFLPDHVCFTAVLTACNHAGLLDEAVGYFNKMGSEYRLDPQIDHYACLIDLYARNGHVEKAKQLMEQMPYESNYVMWCSLLGACKVHEEVELG
        + +   +  + D V F  V  AC HAGL++E   YF+ M  + ++ P  +H +C++DLY+R G +EKA +++E MP  +   +W ++L AC+VH++ ELG
Subjt:  ESLLTEEGFLPDHVCFTAVLTACNHAGLLDEAVGYFNKMGSEYRLDPQIDHYACLIDLYARNGHVEKAKQLMEQMPYESNYVMWCSLLGACKVHEEVELG

Query:  REVAYRLIEMDPSNAAPYVTLAHIYARAGLWAQLADIRKQMQQKKVRKSAGWSWIEIDKKAHVFSVGDGTHPQSCEIYSKLNQLDLDMK
        R  A ++I M P ++A YV L+++YA +G W + A +RK M ++ V+K  G+SWIE+  K + F  GD +HP   +IY KL  L   +K
Subjt:  REVAYRLIEMDPSNAAPYVTLAHIYARAGLWAQLADIRKQMQQKKVRKSAGWSWIEIDKKAHVFSVGDGTHPQSCEIYSKLNQLDLDMK

Arabidopsis top hitse value%identityAlignment
AT2G13600.1 Pentatricopeptide repeat (PPR) superfamily protein1.8e-10136.33Show/hide
Query:  GAKLTKYSLCSALNSCAKTHNFLLGLQIHAQIVKIGFEDNLFLNSSLVDLYSKCNAIWDAKRVFSHMKTHDQVSWTSIISGLSQNGHGSEAILMFKNMLV
        G  L +YS  S L++C+  ++   G+Q+H+ I K  F  ++++ S+LVD+YSKC  + DA+RVF  M   + VSW S+I+   QNG   EA+ +F+ ML 
Subjt:  GAKLTKYSLCSALNSCAKTHNFLLGLQIHAQIVKIGFEDNLFLNSSLVDLYSKCNAIWDAKRVFSHMKTHDQVSWTSIISGLSQNGHGSEAILMFKNMLV

Query:  TQVRPNCFTYATVISSCPTLKDELQIRLSTLLHAHVFKLGFTLSSFVISST-IDCYSKLGRIEEAALLFYESTVKD------------------------
        ++V P+  T A+VIS+C +L     I++   +H  V K     +  ++S+  +D Y+K  RI+EA  +F    +++                        
Subjt:  TQVRPNCFTYATVISSCPTLKDELQIRLSTLLHAHVFKLGFTLSSFVISST-IDCYSKLGRIEEAALLFYESTVKD------------------------

Query:  ------NII-FNSMISGYSQNLYGEEALKLFVEMRAGNLSPTDHTLTSVLNACGSLTVLEQGRQVHSLVTK------MGSENNVFVVCSLLDMYSKCGSI
              N++ +N++I+GY+QN   EEAL LF  ++  ++ PT ++  ++L AC  L  L  G Q H  V K       G E+++FV  SL+DMY KCG +
Subjt:  ------NII-FNSMISGYSQNLYGEEALKLFVEMRAGNLSPTDHTLTSVLNACGSLTVLEQGRQVHSLVTK------MGSENNVFVVCSLLDMYSKCGSI

Query:  DEAFFIFNQTVEKNSVLSTSMIMAFAQCGRGSDALKLFESLLTEEGFLPDHVCFTAVLTACNHAGLLDEAVGYFNKMGSEYRLDPQIDHYACLIDLYARN
        +E + +F + +E++ V   +MI+ FAQ G G++AL+LF  +L E G  PDH+    VL+AC HAG ++E   YF+ M  ++ + P  DHY C++DL  R 
Subjt:  DEAFFIFNQTVEKNSVLSTSMIMAFAQCGRGSDALKLFESLLTEEGFLPDHVCFTAVLTACNHAGLLDEAVGYFNKMGSEYRLDPQIDHYACLIDLYARN

Query:  GHVEKAKQLMEQMPYESNYVMWCSLLGACKVHEEVELGREVAYRLIEMDPSNAAPYVTLAHIYARAGLWAQLADIRKQMQQKKVRKSAGWSWIEIDKKAH
        G +E+AK ++E+MP + + V+W SLL ACKVH  + LG+ VA +L+E++PSN+ PYV L+++YA  G W  + ++RK M+++ V K  G SWI+I    H
Subjt:  GHVEKAKQLMEQMPYESNYVMWCSLLGACKVHEEVELGREVAYRLIEMDPSNAAPYVTLAHIYARAGLWAQLADIRKQMQQKKVRKSAGWSWIEIDKKAH

Query:  VFSVGDGTHPQSCEIYSKLNQLDLDMKAAEN-------ASKALDY
        VF V D +HP+  +I+S L+ L  +M+  ++       +S+ +DY
Subjt:  VFSVGDGTHPQSCEIYSKLNQLDLDMKAAEN-------ASKALDY

AT2G27610.1 Tetratricopeptide repeat (TPR)-like superfamily protein4.8e-9136.2Show/hide
Query:  KLTKYSLCSALNSCAKTHNFLLGLQIHAQIVKIGFEDNLFLNSSLVDLYSKCNAIWDAKRVFSHMK-THDQVSWTSIISGLSQNGHGSEAILMFKNMLVT
        +L++ S  S +  CA         Q+H  +VK GF  +  + ++L+  YSKC A+ DA R+F  +    + VSWT++ISG  QN    EA+ +F  M   
Subjt:  KLTKYSLCSALNSCAKTHNFLLGLQIHAQIVKIGFEDNLFLNSSLVDLYSKCNAIWDAKRVFSHMK-THDQVSWTSIISGLSQNGHGSEAILMFKNMLVT

Query:  QVRPNCFTYATVISSCPTLKDELQIRLSTLLHAHVFKLGFTLSSFVISSTIDCYSKLGRIEEAALLFYESTVKDNIIFNSMISGYSQNLYGEEALKLFVE
         VRPN FTY+ ++++ P +         + +HA V K  +  SS V ++ +D Y KLG++EEAA +F     KD + +++M++GY+Q    E A+K+F E
Subjt:  QVRPNCFTYATVISSCPTLKDELQIRLSTLLHAHVFKLGFTLSSFVISSTIDCYSKLGRIEEAALLFYESTVKDNIIFNSMISGYSQNLYGEEALKLFVE

Query:  MRAGNLSPTDHTLTSVLNACGSLTV-LEQGRQVHSLVTKMGSENNVFVVCSLLDMYSKCGSIDEAFFIFNQTVEKNSVLSTSMIMAFAQCGRGSDALKLF
        +  G + P + T +S+LN C +    + QG+Q H    K   ++++ V  +LL MY+K G+I+ A  +F +  EK+ V   SMI  +AQ G+   AL +F
Subjt:  MRAGNLSPTDHTLTSVLNACGSLTV-LEQGRQVHSLVTKMGSENNVFVVCSLLDMYSKCGSIDEAFFIFNQTVEKNSVLSTSMIMAFAQCGRGSDALKLF

Query:  ESLLTEEGFLPDHVCFTAVLTACNHAGLLDEAVGYFNKMGSEYRLDPQIDHYACLIDLYARNGHVEKAKQLMEQMPYESNYVMWCSLLGACKVHEEVELG
        + +   +  + D V F  V  AC HAGL++E   YF+ M  + ++ P  +H +C++DLY+R G +EKA +++E MP  +   +W ++L AC+VH++ ELG
Subjt:  ESLLTEEGFLPDHVCFTAVLTACNHAGLLDEAVGYFNKMGSEYRLDPQIDHYACLIDLYARNGHVEKAKQLMEQMPYESNYVMWCSLLGACKVHEEVELG

Query:  REVAYRLIEMDPSNAAPYVTLAHIYARAGLWAQLADIRKQMQQKKVRKSAGWSWIEIDKKAHVFSVGDGTHPQSCEIYSKLNQLDLDMK
        R  A ++I M P ++A YV L+++YA +G W + A +RK M ++ V+K  G+SWIE+  K + F  GD +HP   +IY KL  L   +K
Subjt:  REVAYRLIEMDPSNAAPYVTLAHIYARAGLWAQLADIRKQMQQKKVRKSAGWSWIEIDKKAHVFSVGDGTHPQSCEIYSKLNQLDLDMK

AT3G02330.1 Pentatricopeptide repeat (PPR) superfamily protein2.0e-9233.27Show/hide
Query:  MCSLGAKLTKYSLCSALNSCAKTHNFLLGLQIHAQIVKIGFEDNLFLNSSLVDLYSKCNAIWDAKRVFSHMKTHDQVSWTSIISGLSQNGHGSEAILMFK
        + S G    + SL     +CA       GLQI+   +K     ++ + ++ +D+Y KC A+ +A RVF  M+  D VSW +II+   QNG G E + +F 
Subjt:  MCSLGAKLTKYSLCSALNSCAKTHNFLLGLQIHAQIVKIGFEDNLFLNSSLVDLYSKCNAIWDAKRVFSHMKTHDQVSWTSIISGLSQNGHGSEAILMFK

Query:  NMLVTQVRPNCFTYATVISSCPTLKDELQIRLSTLLHAHVFKLGFTLSSFVISSTIDCYSKLGRIEEA----ALLFYESTVKDN----------------
        +ML +++ P+ FT+ +++ +C        +     +H+ + K G   +S V  S ID YSK G IEEA    +  F  + V                   
Subjt:  NMLVTQVRPNCFTYATVISSCPTLKDELQIRLSTLLHAHVFKLGFTLSSFVISSTIDCYSKLGRIEEA----ALLFYESTVKDN----------------

Query:  IIFNSMISGYSQNLYGEEALKLFVEMRAGNLSPTDHTLTSVLNACGSLTVLEQGRQVHSLVTKMGSENNVFVVCSLLDMYSKCGSIDEAFFIFNQTVEKN
        + +NS+ISGY      E+A  LF  M    ++P   T  +VL+ C +L     G+Q+H+ V K   +++V++  +L+DMYSKCG + ++  +F +++ ++
Subjt:  IIFNSMISGYSQNLYGEEALKLFVEMRAGNLSPTDHTLTSVLNACGSLTVLEQGRQVHSLVTKMGSENNVFVVCSLLDMYSKCGSIDEAFFIFNQTVEKN

Query:  SVLSTSMIMAFAQCGRGSDALKLFESLLTEEGFLPDHVCFTAVLTACNHAGLLDEAVGYFNKMGSEYRLDPQIDHYACLIDLYARNGHVEKAKQLMEQMP
         V   +MI  +A  G+G +A++LFE ++  E   P+HV F ++L AC H GL+D+ + YF  M  +Y LDPQ+ HY+ ++D+  ++G V++A +L+ +MP
Subjt:  SVLSTSMIMAFAQCGRGSDALKLFESLLTEEGFLPDHVCFTAVLTACNHAGLLDEAVGYFNKMGSEYRLDPQIDHYACLIDLYARNGHVEKAKQLMEQMP

Query:  YESNYVMWCSLLGACKVH-EEVELGREVAYRLIEMDPSNAAPYVTLAHIYARAGLWAQLADIRKQMQQKKVRKSAGWSWIEIDKKAHVFSVGDGTHPQSC
        +E++ V+W +LLG C +H   VE+  E    L+ +DP +++ Y  L+++YA AG+W +++D+R+ M+  K++K  G SW+E+  + HVF VGD  HP+  
Subjt:  YESNYVMWCSLLGACKVH-EEVELGREVAYRLIEMDPSNAAPYVTLAHIYARAGLWAQLADIRKQMQQKKVRKSAGWSWIEIDKKAHVFSVGDGTHPQSC

Query:  EIYSKLNQLDLDMKAAENAS
        EIY +L  +  +MK  +++S
Subjt:  EIYSKLNQLDLDMKAAENAS

AT3G47840.1 Tetratricopeptide repeat (TPR)-like superfamily protein7.5e-9238.89Show/hide
Query:  YSLCSALNSCAKTHNFLLGLQIHAQIVKIGFEDNLFLNSSLVDLYSKCNAIWDAKRVFSHMKTHDQVSWTSIISGLSQNGHGSEAILMFKNMLVTQVRPN
        Y+   AL +CA       G  IH  ++  GF   L + +SL  +Y++C  + D   +F +M   D VSWTS+I    + G   +A+  F  M  +QV PN
Subjt:  YSLCSALNSCAKTHNFLLGLQIHAQIVKIGFEDNLFLNSSLVDLYSKCNAIWDAKRVFSHMKTHDQVSWTSIISGLSQNGHGSEAILMFKNMLVTQVRPN

Query:  CFTYATVISSCPTLKDELQIRLSTLLHAHVFKLGFTLSSFVISSTIDCYSKLGRIEEAALLFYESTVKDNIIFNSMISGYSQNLYGEEALKLFVEMRAGN
          T+A++ S+C +L    ++     LH +V  LG   S  V +S +  YS  G +  A++LF     +D I ++++I GY Q  +GEE  K F  MR   
Subjt:  CFTYATVISSCPTLKDELQIRLSTLLHAHVFKLGFTLSSFVISSTIDCYSKLGRIEEAALLFYESTVKDNIIFNSMISGYSQNLYGEEALKLFVEMRAGN

Query:  LSPTDHTLTSVLNACGSLTVLEQGRQVHSLVTKMGSENNVFVVCSLLDMYSKCGSIDEAFFIFNQTVEKNSVLSTSMIMAFAQCGRGSDALKLFESLLTE
          PTD  L S+L+  G++ V+E GRQVH+L    G E N  V  SL++MYSKCGSI EA  IF +T   + V  T+MI  +A+ G+  +A+ LFE  L +
Subjt:  LSPTDHTLTSVLNACGSLTVLEQGRQVHSLVTKMGSENNVFVVCSLLDMYSKCGSIDEAFFIFNQTVEKNSVLSTSMIMAFAQCGRGSDALKLFESLLTE

Query:  EGFLPDHVCFTAVLTACNHAGLLDEAVGYFNKMGSEYRLDPQIDHYACLIDLYARNGHVEKAKQLMEQMPYESNYVMWCSLLGACKVHEEVELGREVAYR
         GF PD V F +VLTAC H+G LD    YFN M   Y + P  +HY C++DL  R G +  A++++ +M ++ + V+W +LL ACK   ++E GR  A R
Subjt:  EGFLPDHVCFTAVLTACNHAGLLDEAVGYFNKMGSEYRLDPQIDHYACLIDLYARNGHVEKAKQLMEQMPYESNYVMWCSLLGACKVHEEVELGREVAYR

Query:  LIEMDPSNAAPYVTLAHIYARAGLWAQLADIRKQMQQKKVRKSAGWSWIEIDKKAHVFSVGDGTHPQSCEIYSKLNQLDLDMKAAE
        ++E+DP+ A   VTLA+IY+  G   + A++RK M+ K V K  GWS I+I      F  GD  HPQS +IY   N L+L +  AE
Subjt:  LIEMDPSNAAPYVTLAHIYARAGLWAQLADIRKQMQQKKVRKSAGWSWIEIDKKAHVFSVGDGTHPQSCEIYSKLNQLDLDMKAAE

AT3G49170.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.8e-9335.63Show/hide
Query:  GAKLTKYSLCSALNSCAKTHNFLLGLQIHAQIVKIGFEDNLFLNSSLVDLYSKCNA---IWDAKRVFSHMKTHDQVSWTSIISGLSQNGH-GSEAILMFK
        G +  K++L S  ++CA+  N  LG Q+H+  ++ G  D+  +  SLVD+Y+KC+A   + D ++VF  M+ H  +SWT++I+G  +N +  +EAI +F 
Subjt:  GAKLTKYSLCSALNSCAKTHNFLLGLQIHAQIVKIGFEDNLFLNSSLVDLYSKCNA---IWDAKRVFSHMKTHDQVSWTSIISGLSQNGH-GSEAILMFK

Query:  NMLVT-QVRPNCFTYATVISSCPTLKDELQIRLSTLLHAHVFKLGFTLSSFVISSTIDCYSKLGRIEEAALLFYESTVKDNIIFNSMISGYSQNLYGEEA
         M+    V PN FT+++   +C  L D    R+   +    FK G   +S V +S I  + K  R+E+A   F   + K+ + +N+ + G  +NL  E+A
Subjt:  NMLVT-QVRPNCFTYATVISSCPTLKDELQIRLSTLLHAHVFKLGFTLSSFVISSTIDCYSKLGRIEEAALLFYESTVKDNIIFNSMISGYSQNLYGEEA

Query:  LKLFVEMRAGNLSPTDHTLTSVLNACGSLTVLEQGRQVHSLVTKMGSENNVFVVCSLLDMYSKCGSIDEAFFIFNQTVEKNSVLSTSMIMAFAQCGRGSD
         KL  E+    L  +  T  S+L+   ++  + +G Q+HS V K+G   N  V  +L+ MYSKCGSID A  +FN    +N +  TSMI  FA+ G    
Subjt:  LKLFVEMRAGNLSPTDHTLTSVLNACGSLTVLEQGRQVHSLVTKMGSENNVFVVCSLLDMYSKCGSIDEAFFIFNQTVEKNSVLSTSMIMAFAQCGRGSD

Query:  ALKLFESLLTEEGFLPDHVCFTAVLTACNHAGLLDEAVGYFNKMGSEYRLDPQIDHYACLIDLYARNGHVEKAKQLMEQMPYESNYVMWCSLLGACKVHE
         L+ F  ++ EEG  P+ V + A+L+AC+H GL+ E   +FN M  ++++ P+++HYAC++DL  R G +  A + +  MP++++ ++W + LGAC+VH 
Subjt:  ALKLFESLLTEEGFLPDHVCFTAVLTACNHAGLLDEAVGYFNKMGSEYRLDPQIDHYACLIDLYARNGHVEKAKQLMEQMPYESNYVMWCSLLGACKVHE

Query:  EVELGREVAYRLIEMDPSNAAPYVTLAHIYARAGLWAQLADIRKQMQQKKVRKSAGWSWIEIDKKAHVFSVGDGTHPQSCEIYSKLNQLDLDMK
          ELG+  A +++E+DP+  A Y+ L++IYA AG W +  ++R++M+++ + K  G SWIE+  K H F VGD  HP + +IY +L++L  ++K
Subjt:  EVELGREVAYRLIEMDPSNAAPYVTLAHIYARAGLWAQLADIRKQMQQKKVRKSAGWSWIEIDKKAHVFSVGDGTHPQSCEIYSKLNQLDLDMK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTGCAGCTTAGGAGCAAAATTGACCAAGTATTCTCTGTGCTCTGCCCTTAATTCTTGTGCTAAAACACATAATTTTCTTTTGGGTTTGCAAATTCATGCCCAGATTGT
GAAAATTGGATTTGAAGATAACTTATTTTTGAACAGTTCACTTGTTGATTTATACTCCAAATGTAATGCCATTTGGGATGCAAAAAGGGTCTTCTCTCATATGAAGACTC
ATGATCAAGTATCTTGGACCTCTATAATATCTGGGCTCTCCCAAAATGGGCATGGTAGTGAAGCCATCTTGATGTTTAAGAATATGTTGGTAACCCAGGTTAGACCCAAC
TGTTTTACTTATGCCACTGTTATTAGTTCATGCCCTACTCTGAAGGATGAACTTCAGATTCGTCTTTCAACTTTGCTTCATGCTCATGTTTTCAAACTTGGATTTACTTT
AAGCAGCTTTGTAATTAGCTCTACTATTGATTGTTACTCAAAACTAGGAAGAATAGAAGAAGCTGCTCTGCTCTTTTATGAATCAACTGTGAAGGATAATATCATATTTA
ATTCTATGATATCAGGGTATTCTCAAAACTTGTATGGGGAGGAGGCTTTAAAACTGTTTGTAGAAATGAGAGCTGGTAATTTGAGCCCAACTGATCATACACTAACTAGT
GTTTTAAATGCTTGTGGGAGTTTGACAGTACTTGAACAAGGAAGGCAAGTGCACTCTCTGGTTACAAAAATGGGATCAGAAAATAATGTGTTTGTGGTCTGTTCTCTGCT
AGATATGTACTCGAAGTGTGGCAGTATCGACGAGGCATTTTTTATATTCAATCAAACGGTGGAAAAGAACAGTGTGTTGTCGACTTCAATGATAATGGCTTTTGCTCAAT
GTGGTAGAGGCTCAGATGCCTTAAAGCTCTTTGAGAGTTTGTTGACTGAAGAAGGTTTCTTGCCTGATCATGTCTGTTTTACTGCAGTTTTAACTGCCTGCAATCATGCA
GGATTACTAGATGAGGCAGTTGGATATTTCAATAAAATGGGCAGTGAATACAGATTAGATCCACAAATTGATCATTATGCTTGTTTGATTGATCTCTATGCCAGAAATGG
GCATGTAGAAAAAGCTAAGCAATTGATGGAGCAAATGCCTTACGAGTCTAATTACGTAATGTGGTGTTCCCTCTTAGGTGCTTGCAAAGTTCATGAAGAGGTCGAGCTCG
GGAGGGAGGTGGCGTATCGACTCATCGAGATGGATCCAAGTAATGCTGCACCCTATGTAACACTTGCTCATATCTATGCTAGAGCTGGTTTATGGGCACAGTTGGCTGAT
ATCAGAAAACAGATGCAACAGAAAAAGGTAAGGAAAAGCGCAGGGTGGAGCTGGATTGAGATAGACAAGAAAGCTCATGTCTTCTCAGTTGGTGATGGTACTCATCCTCA
ATCCTGTGAGATTTATTCAAAACTTAACCAACTGGACTTGGATATGAAAGCAGCTGAAAATGCATCAAAAGCACTTGATTATGTTGAGTTTTCATGA
mRNA sequenceShow/hide mRNA sequence
ATGTGCAGCTTAGGAGCAAAATTGACCAAGTATTCTCTGTGCTCTGCCCTTAATTCTTGTGCTAAAACACATAATTTTCTTTTGGGTTTGCAAATTCATGCCCAGATTGT
GAAAATTGGATTTGAAGATAACTTATTTTTGAACAGTTCACTTGTTGATTTATACTCCAAATGTAATGCCATTTGGGATGCAAAAAGGGTCTTCTCTCATATGAAGACTC
ATGATCAAGTATCTTGGACCTCTATAATATCTGGGCTCTCCCAAAATGGGCATGGTAGTGAAGCCATCTTGATGTTTAAGAATATGTTGGTAACCCAGGTTAGACCCAAC
TGTTTTACTTATGCCACTGTTATTAGTTCATGCCCTACTCTGAAGGATGAACTTCAGATTCGTCTTTCAACTTTGCTTCATGCTCATGTTTTCAAACTTGGATTTACTTT
AAGCAGCTTTGTAATTAGCTCTACTATTGATTGTTACTCAAAACTAGGAAGAATAGAAGAAGCTGCTCTGCTCTTTTATGAATCAACTGTGAAGGATAATATCATATTTA
ATTCTATGATATCAGGGTATTCTCAAAACTTGTATGGGGAGGAGGCTTTAAAACTGTTTGTAGAAATGAGAGCTGGTAATTTGAGCCCAACTGATCATACACTAACTAGT
GTTTTAAATGCTTGTGGGAGTTTGACAGTACTTGAACAAGGAAGGCAAGTGCACTCTCTGGTTACAAAAATGGGATCAGAAAATAATGTGTTTGTGGTCTGTTCTCTGCT
AGATATGTACTCGAAGTGTGGCAGTATCGACGAGGCATTTTTTATATTCAATCAAACGGTGGAAAAGAACAGTGTGTTGTCGACTTCAATGATAATGGCTTTTGCTCAAT
GTGGTAGAGGCTCAGATGCCTTAAAGCTCTTTGAGAGTTTGTTGACTGAAGAAGGTTTCTTGCCTGATCATGTCTGTTTTACTGCAGTTTTAACTGCCTGCAATCATGCA
GGATTACTAGATGAGGCAGTTGGATATTTCAATAAAATGGGCAGTGAATACAGATTAGATCCACAAATTGATCATTATGCTTGTTTGATTGATCTCTATGCCAGAAATGG
GCATGTAGAAAAAGCTAAGCAATTGATGGAGCAAATGCCTTACGAGTCTAATTACGTAATGTGGTGTTCCCTCTTAGGTGCTTGCAAAGTTCATGAAGAGGTCGAGCTCG
GGAGGGAGGTGGCGTATCGACTCATCGAGATGGATCCAAGTAATGCTGCACCCTATGTAACACTTGCTCATATCTATGCTAGAGCTGGTTTATGGGCACAGTTGGCTGAT
ATCAGAAAACAGATGCAACAGAAAAAGGTAAGGAAAAGCGCAGGGTGGAGCTGGATTGAGATAGACAAGAAAGCTCATGTCTTCTCAGTTGGTGATGGTACTCATCCTCA
ATCCTGTGAGATTTATTCAAAACTTAACCAACTGGACTTGGATATGAAAGCAGCTGAAAATGCATCAAAAGCACTTGATTATGTTGAGTTTTCATGA
Protein sequenceShow/hide protein sequence
MCSLGAKLTKYSLCSALNSCAKTHNFLLGLQIHAQIVKIGFEDNLFLNSSLVDLYSKCNAIWDAKRVFSHMKTHDQVSWTSIISGLSQNGHGSEAILMFKNMLVTQVRPN
CFTYATVISSCPTLKDELQIRLSTLLHAHVFKLGFTLSSFVISSTIDCYSKLGRIEEAALLFYESTVKDNIIFNSMISGYSQNLYGEEALKLFVEMRAGNLSPTDHTLTS
VLNACGSLTVLEQGRQVHSLVTKMGSENNVFVVCSLLDMYSKCGSIDEAFFIFNQTVEKNSVLSTSMIMAFAQCGRGSDALKLFESLLTEEGFLPDHVCFTAVLTACNHA
GLLDEAVGYFNKMGSEYRLDPQIDHYACLIDLYARNGHVEKAKQLMEQMPYESNYVMWCSLLGACKVHEEVELGREVAYRLIEMDPSNAAPYVTLAHIYARAGLWAQLAD
IRKQMQQKKVRKSAGWSWIEIDKKAHVFSVGDGTHPQSCEIYSKLNQLDLDMKAAENASKALDYVEFS