; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10022038 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10022038
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationChr05:20118453..20119979
RNA-Seq ExpressionHG10022038
SyntenyHG10022038
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
GO:0043005 - neuron projection (cellular component)
GO:0000166 - nucleotide binding (molecular function)
GO:0003774 - motor activity (molecular function)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008464861.1 PREDICTED: pentatricopeptide repeat-containing protein At2g13600-like [Cucumis melo]9.6e-26789.96Show/hide
Query:  MCSLGAKLTKYSLCSALNSCAKTHNLFLGLQIHAQIVKIGLEENLFLNSALVDLYSKCNAIVNAKRVFFHMKTHDQVSWTSIISGLSQNGHGHEAILMFK
        MCSLGAKLT YSLCSAL+SCAKTHNLFLGLQIHAQIVKIG EENLFLNS+LVDLYSKCNAIVNAKRVF  MKTHDQVSWTSIISGLSQNG G EAILMFK
Subjt:  MCSLGAKLTKYSLCSALNSCAKTHNLFLGLQIHAQIVKIGLEENLFLNSALVDLYSKCNAIVNAKRVFFHMKTHDQVSWTSIISGLSQNGHGHEAILMFK

Query:  NMLVTKVRPNCFTYATVISSCPTLKDELQIRLAILLHAHVIKLGFSFSNFVISSTIDCYSKLGRIQEAALLFYETTVKDNIIFNSMISGYSQNLYGEEAL
         MLVT+VRPNCFTYATVISSCPTLK+ELQI LA LLHAHVIK GF+FS+FVISSTIDCYSKLGRIQEA+LLF ET+VKDNIIFNSMISGYSQNL GEEAL
Subjt:  NMLVTKVRPNCFTYATVISSCPTLKDELQIRLAILLHAHVIKLGFSFSNFVISSTIDCYSKLGRIQEAALLFYETTVKDNIIFNSMISGYSQNLYGEEAL

Query:  KLFVEMRASNLSPTDHTLTSVLNACGCLTVLEQGRQVHSLVTKMGSENNVFVVCSLLDMYSKCGSVDEAFYLFNRTVEKNSVLSTSMIMAFAQCGRGSEA
        KLFVEMRASNLSPTDHTLTSVLNACGCLTVLEQGRQVHSL+TKMGSENNVFVVCSLLDMYSKCGS+DEAF LFN+TV+KNSVLSTSMIMAFAQCGRG EA
Subjt:  KLFVEMRASNLSPTDHTLTSVLNACGCLTVLEQGRQVHSLVTKMGSENNVFVVCSLLDMYSKCGSVDEAFYLFNRTVEKNSVLSTSMIMAFAQCGRGSEA

Query:  LKLFESLLTEEGFLPDHVCFTAVLTACNHAGLLNEAVDYFNKMSCEYRLDPQIDHYACLIDLYARNGHVEKAKQLMEQMPYESNYVMWCSLLGACKVHEE
        LKLFE L TE+ F+PDH+CFTAVLTACNHAGLL+EAV+YFNKM CEY+LDPQIDHYACLIDLYARNG+VEKAKQ+MEQMPYESNYVMWCSLLGACKVH E
Subjt:  LKLFESLLTEEGFLPDHVCFTAVLTACNHAGLLNEAVDYFNKMSCEYRLDPQIDHYACLIDLYARNGHVEKAKQLMEQMPYESNYVMWCSLLGACKVHEE

Query:  VELGREVAYQLIEMDPSNAAPYVTLAHVYARAGLWTQLAEIRKQMQQKRARKSAGWSWIEIDKKTHVFSVGDADHPKSCEIYSKLDQLSLDMKVAEQSSK
        VELGREVAY+LIEMDP NAAPY+TLAH+YARAGLWTQ+ EIRK+MQQKR RKSAGWSWIEIDKKTHVFSVGDA HPKSCEIYSKLDQL+LDMK AEQS K
Subjt:  VELGREVAYQLIEMDPSNAAPYVTLAHVYARAGLWTQLAEIRKQMQQKRARKSAGWSWIEIDKKTHVFSVGDADHPKSCEIYSKLDQLSLDMKVAEQSSK

Query:  ALEYDVEC
        ALEYDVEC
Subjt:  ALEYDVEC

XP_011653616.1 pentatricopeptide repeat-containing protein At2g13600 [Cucumis sativus]1.7e-26389.37Show/hide
Query:  MCSLGAKLTKYSLCSALNSCAKTHNLFLGLQIHAQIVKIGLEENLFLNSALVDLYSKCNAIVNAKRVFFHMKTHDQVSWTSIISGLSQNGHGHEAILMFK
        MCSLGA LTKYSLCSAL+SCAKTHNLFLGLQIHAQIVKIG EENLFLNS+LVDLYSKCNAIVNAKRVF  MKTHD VSWTSIISGLSQNG G EAILMFK
Subjt:  MCSLGAKLTKYSLCSALNSCAKTHNLFLGLQIHAQIVKIGLEENLFLNSALVDLYSKCNAIVNAKRVFFHMKTHDQVSWTSIISGLSQNGHGHEAILMFK

Query:  NMLVTKVRPNCFTYATVISSCPTLKDELQIRLAILLHAHVIKLGFSFSNFVISSTIDCYSKLGRIQEAALLFYETTVKDNIIFNSMISGYSQNLYGEEAL
        NMLVT+VRPNCFTYATVISSCPTLK+ELQI LA LLHAHVIK GF+FS+FVISSTIDCYSKLGRI+EAALLF E++VKDNIIFNSMISGYSQNLYGEEAL
Subjt:  NMLVTKVRPNCFTYATVISSCPTLKDELQIRLAILLHAHVIKLGFSFSNFVISSTIDCYSKLGRIQEAALLFYETTVKDNIIFNSMISGYSQNLYGEEAL

Query:  KLFVEMRASNLSPTDHTLTSVLNACGCLTVLEQGRQVHSLVTKMGSENNVFVVCSLLDMYSKCGSVDEAFYLFNRTVEKNSVLSTSMIMAFAQCGRGSEA
        KLFVEMRASNLSPTDHTLTSVLNACGCLTVLEQGRQVHSLVTKMGSENNVFVVCSLLDMYSKCGS+DEAF +FN+TV+KNSVLSTSMI AFAQCGRG EA
Subjt:  KLFVEMRASNLSPTDHTLTSVLNACGCLTVLEQGRQVHSLVTKMGSENNVFVVCSLLDMYSKCGSVDEAFYLFNRTVEKNSVLSTSMIMAFAQCGRGSEA

Query:  LKLFESLLTEEGFLPDHVCFTAVLTACNHAGLLNEAVDYFNKMSCEYRLDPQIDHYACLIDLYARNGHVEKAKQLMEQMPYESNYVMWCSLLGACKVHEE
        LKLFESLLTE+ F+PDH+CFTAVLTACNHAGLL+EAV+YFNKM  EY LDPQIDHYACLIDLYARNG+VEKAKQ+MEQMPYESNYV+ CSLLGACKVH E
Subjt:  LKLFESLLTEEGFLPDHVCFTAVLTACNHAGLLNEAVDYFNKMSCEYRLDPQIDHYACLIDLYARNGHVEKAKQLMEQMPYESNYVMWCSLLGACKVHEE

Query:  VELGREVAYQLIEMDPSNAAPYVTLAHVYARAGLWTQLAEIRKQMQQKRARKSAGWSWIEIDKKTHVFSVGDADHPKSCEIYSKLDQLSLDMKVAEQSSK
        VELGREVA++LIEMDPSNAAPY+TLAH+ A+AGLWTQ+ EIRK+MQQKR RKSAGWSWIEIDKKTHVFSVGDA HPKSCEIYSKLDQL+LDMK AEQSSK
Subjt:  VELGREVAYQLIEMDPSNAAPYVTLAHVYARAGLWTQLAEIRKQMQQKRARKSAGWSWIEIDKKTHVFSVGDADHPKSCEIYSKLDQLSLDMKVAEQSSK

Query:  ALEYDVEC
        ALEYDVEC
Subjt:  ALEYDVEC

XP_022948515.1 pentatricopeptide repeat-containing protein At2g13600-like [Cucurbita moschata]4.6e-26189.11Show/hide
Query:  MCSLGAKLTKYSLCSALNSCAKTHNLFLGLQIHAQIVKIGLEENLFLNSALVDLYSKCNAIVNAKRVFFHMKTHDQVSWTSIISGLSQNGHGHEAILMFK
        MCSLGAKLTKYSLCSALNSCAKT NLFLGLQIHAQIVKIG E+NL+LNS LV+LYSKCNAIV+AKR+F HMKTHDQVSWTSIISGLSQNG G EAILMFK
Subjt:  MCSLGAKLTKYSLCSALNSCAKTHNLFLGLQIHAQIVKIGLEENLFLNSALVDLYSKCNAIVNAKRVFFHMKTHDQVSWTSIISGLSQNGHGHEAILMFK

Query:  NMLVTKVRPNCFTYATVISSCPTLKDELQIRLAILLHAHVIKLGF-SFSNFVISSTIDCYSKLGRIQEAALLFYETTVKDNIIFNSMISGYSQNLYGEEA
        NMLVT+ RPNCFTYATVISSCP+LKDEL I L  L HAHVIKLGF  FS+FVISS IDCYSKLGRI+EAALLFYE  VKDN+IFNSMISG+SQNLYGEEA
Subjt:  NMLVTKVRPNCFTYATVISSCPTLKDELQIRLAILLHAHVIKLGF-SFSNFVISSTIDCYSKLGRIQEAALLFYETTVKDNIIFNSMISGYSQNLYGEEA

Query:  LKLFVEMRASNLSPTDHTLTSVLNACGCLTVLEQGRQVHSLVTKMGSENNVFVVCSLLDMYSKCGSVDEAFYLFNRTVEKNSVLSTSMIMAFAQCGRGSE
        LKLFVEMRASNLSPTDHTLTSVLNACG LTVLEQGRQVHSLVTKMGSENNVFVVCSLLDMYSKCGSVD+AF +FN+TVEKNSVLSTSMIMAFAQCGRGS+
Subjt:  LKLFVEMRASNLSPTDHTLTSVLNACGCLTVLEQGRQVHSLVTKMGSENNVFVVCSLLDMYSKCGSVDEAFYLFNRTVEKNSVLSTSMIMAFAQCGRGSE

Query:  ALKLFESLLTEEGFLPDHVCFTAVLTACNHAGLLNEAVDYFNKMSCEYRLDPQIDHYACLIDLYARNGHVEKAKQLMEQMPYESNYVMWCSLLGACKVHE
        ALKLFESLLTEEGFLPDHVCFTAVLTACNHAGLLNEAV+YFNKM  EYRLDPQIDHYACLIDLYARNGHVEKAK+LME+MPYESNYVMWCSLLGACKVH 
Subjt:  ALKLFESLLTEEGFLPDHVCFTAVLTACNHAGLLNEAVDYFNKMSCEYRLDPQIDHYACLIDLYARNGHVEKAKQLMEQMPYESNYVMWCSLLGACKVHE

Query:  EVELGREVAYQLIEMDPSNAAPYVTLAHVYARAGLWTQLAEIRKQMQQKRARKSAGWSWIEIDKKTHVFSVGDADHPKSCEIYSKLDQLSLDMKVAEQSS
        EVELGREVAY+LIEMDP NAAPYVTLAH+YARAGLWTQLA+IR QMQQKR RKSAGWSWIEIDKK HVFSVGDA HPKSCEIYSKL+QL LDM+ AE +S
Subjt:  EVELGREVAYQLIEMDPSNAAPYVTLAHVYARAGLWTQLAEIRKQMQQKRARKSAGWSWIEIDKKTHVFSVGDADHPKSCEIYSKLDQLSLDMKVAEQSS

Query:  KALEY
        KALE+
Subjt:  KALEY

XP_023521522.1 pentatricopeptide repeat-containing protein At2g13600-like [Cucurbita pepo subsp. pepo]1.5e-25988.91Show/hide
Query:  MCSLGAKLTKYSLCSALNSCAKTHNLFLGLQIHAQIVKIGLEENLFLNSALVDLYSKCNAIVNAKRVFFHMKTHDQVSWTSIISGLSQNGHGHEAILMFK
        MCSLGAKLTKYSLCSALNSCAKT NLFLGLQIHAQIVKIG E+NL+LNSALV+LYSKCNAIV+AKR+F HMKTHDQVSWTSIISGLSQNG G EAILMFK
Subjt:  MCSLGAKLTKYSLCSALNSCAKTHNLFLGLQIHAQIVKIGLEENLFLNSALVDLYSKCNAIVNAKRVFFHMKTHDQVSWTSIISGLSQNGHGHEAILMFK

Query:  NMLVTKVRPNCFTYATVISSCPTLKDELQIRLAILLHAHVIKLGF-SFSNFVISSTIDCYSKLGRIQEAALLFYETTVKDNIIFNSMISGYSQNLYGEEA
        NMLVT+ RPNCFTYATVISSCP+LKDEL   L  L HAHVIKLGF  FS+FVISS IDCYSKLGRI+EAALLFYE  VKDN+IFNSMISG+SQNLYGEEA
Subjt:  NMLVTKVRPNCFTYATVISSCPTLKDELQIRLAILLHAHVIKLGF-SFSNFVISSTIDCYSKLGRIQEAALLFYETTVKDNIIFNSMISGYSQNLYGEEA

Query:  LKLFVEMRASNLSPTDHTLTSVLNACGCLTVLEQGRQVHSLVTKMGSENNVFVVCSLLDMYSKCGSVDEAFYLFNRTVEKNSVLSTSMIMAFAQCGRGSE
        LKLFVEMRASNLSPTDHTLTSVLNACG LTVLEQGRQVHSLVTKMGSENNVFVVCSLLDMYSKCGSVD+AF +FN+TVEKNSVLSTSMIMAFAQCGRGS+
Subjt:  LKLFVEMRASNLSPTDHTLTSVLNACGCLTVLEQGRQVHSLVTKMGSENNVFVVCSLLDMYSKCGSVDEAFYLFNRTVEKNSVLSTSMIMAFAQCGRGSE

Query:  ALKLFESLLTEEGFLPDHVCFTAVLTACNHAGLLNEAVDYFNKMSCEYRLDPQIDHYACLIDLYARNGHVEKAKQLMEQMPYESNYVMWCSLLGACKVHE
        ALKLFESLLTEEGFLPDHVCFTAVLTACNHAGLLNEAV+YFNKM  EYRLDPQIDHYACLIDLYARNGHV KAK+LME+MPYESNYVMWCSLLGACKVH 
Subjt:  ALKLFESLLTEEGFLPDHVCFTAVLTACNHAGLLNEAVDYFNKMSCEYRLDPQIDHYACLIDLYARNGHVEKAKQLMEQMPYESNYVMWCSLLGACKVHE

Query:  EVELGREVAYQLIEMDPSNAAPYVTLAHVYARAGLWTQLAEIRKQMQQKRARKSAGWSWIEIDKKTHVFSVGDADHPKSCEIYSKLDQLSLDMKVAEQSS
         VELGREVAY+LIEMDPSNAAPYVTLAH+YARAGLWTQLA+IRKQMQQKR RKSAGWSWIEIDKK HVFSVGDA HPKSCEIYSKL+QL LDM+  E +S
Subjt:  EVELGREVAYQLIEMDPSNAAPYVTLAHVYARAGLWTQLAEIRKQMQQKRARKSAGWSWIEIDKKTHVFSVGDADHPKSCEIYSKLDQLSLDMKVAEQSS

Query:  KALEY
        KALE+
Subjt:  KALEY

XP_038896055.1 pentatricopeptide repeat-containing protein At3g12770-like [Benincasa hispida]6.9e-28194.88Show/hide
Query:  MCSLGAKLTKYSLCSALNSCAKTHNLFLGLQIHAQIVKIGLEENLFLNSALVDLYSKCNAIVNAKRVFFHMKTHDQVSWTSIISGLSQNGHGHEAILMFK
        MCSLGAKLTKYSLCSALNSCAKTHNLFLGLQIHAQIVKIG EENLFLNS+LVDLYSKCNAIVNAKRVF HMKTHDQVSWTSIISGLSQNG G EAILMFK
Subjt:  MCSLGAKLTKYSLCSALNSCAKTHNLFLGLQIHAQIVKIGLEENLFLNSALVDLYSKCNAIVNAKRVFFHMKTHDQVSWTSIISGLSQNGHGHEAILMFK

Query:  NMLVTKVRPNCFTYATVISSCPTLKDELQIRLAILLHAHVIKLGFSFSNFVISSTIDCYSKLGRIQEAALLFYETTVKDNIIFNSMISGYSQNLYGEEAL
        NMLVT+VRPNCFTYATVISSC TLKDELQI LA LLHAHVIKLGF+FSNFVISSTIDCYSKLGR+QEAALLFYETTVKDNIIFNSMISGYSQNLYGEEAL
Subjt:  NMLVTKVRPNCFTYATVISSCPTLKDELQIRLAILLHAHVIKLGFSFSNFVISSTIDCYSKLGRIQEAALLFYETTVKDNIIFNSMISGYSQNLYGEEAL

Query:  KLFVEMRASNLSPTDHTLTSVLNACGCLTVLEQGRQVHSLVTKMGSENNVFVVCSLLDMYSKCGSVDEAFYLFNRTVEKNSVLSTSMIMAFAQCGRGSEA
        KLFVEMRASNLSPTDHTLTSVLNACGCLTVLEQGRQVHSLVTKMGSENNVFVVCSLLDMYSKCGSVDEAF+LFN+T+EKNSVLSTSMIMAFAQCGRGSEA
Subjt:  KLFVEMRASNLSPTDHTLTSVLNACGCLTVLEQGRQVHSLVTKMGSENNVFVVCSLLDMYSKCGSVDEAFYLFNRTVEKNSVLSTSMIMAFAQCGRGSEA

Query:  LKLFESLLTEEGFLPDHVCFTAVLTACNHAGLLNEAVDYFNKMSCEYRLDPQIDHYACLIDLYARNGHVEKAKQLMEQMPYESNYVMWCSLLGACKVHEE
        LKLFESLLTEEGFLPDHVCF AVLTACNHAGLLNEAV+YFNKMSCEYRLDPQIDHYACLIDLYARNGHVEKAKQLMEQMPYESNYVMWCSLLGACKVHEE
Subjt:  LKLFESLLTEEGFLPDHVCFTAVLTACNHAGLLNEAVDYFNKMSCEYRLDPQIDHYACLIDLYARNGHVEKAKQLMEQMPYESNYVMWCSLLGACKVHEE

Query:  VELGREVAYQLIEMDPSNAAPYVTLAHVYARAGLWTQLAEIRKQMQQKRARKSAGWSWIEIDKKTHVFSVGDADHPKSCEIYSKLDQLSLDMKVAEQSSK
        VELGREVAYQLIEMDPSNAAPYVTLAH+YARAGLWTQL EIRK+MQQKR RKSAGWSWIEIDKK HVFSVGDA HPKSCEIYSKLDQL+LDMK AE +SK
Subjt:  VELGREVAYQLIEMDPSNAAPYVTLAHVYARAGLWTQLAEIRKQMQQKRARKSAGWSWIEIDKKTHVFSVGDADHPKSCEIYSKLDQLSLDMKVAEQSSK

Query:  ALEYDVEC
        ALEYDVEC
Subjt:  ALEYDVEC

TrEMBL top hitse value%identityAlignment
A0A1S3CML0 pentatricopeptide repeat-containing protein At2g13600-like4.7e-26789.96Show/hide
Query:  MCSLGAKLTKYSLCSALNSCAKTHNLFLGLQIHAQIVKIGLEENLFLNSALVDLYSKCNAIVNAKRVFFHMKTHDQVSWTSIISGLSQNGHGHEAILMFK
        MCSLGAKLT YSLCSAL+SCAKTHNLFLGLQIHAQIVKIG EENLFLNS+LVDLYSKCNAIVNAKRVF  MKTHDQVSWTSIISGLSQNG G EAILMFK
Subjt:  MCSLGAKLTKYSLCSALNSCAKTHNLFLGLQIHAQIVKIGLEENLFLNSALVDLYSKCNAIVNAKRVFFHMKTHDQVSWTSIISGLSQNGHGHEAILMFK

Query:  NMLVTKVRPNCFTYATVISSCPTLKDELQIRLAILLHAHVIKLGFSFSNFVISSTIDCYSKLGRIQEAALLFYETTVKDNIIFNSMISGYSQNLYGEEAL
         MLVT+VRPNCFTYATVISSCPTLK+ELQI LA LLHAHVIK GF+FS+FVISSTIDCYSKLGRIQEA+LLF ET+VKDNIIFNSMISGYSQNL GEEAL
Subjt:  NMLVTKVRPNCFTYATVISSCPTLKDELQIRLAILLHAHVIKLGFSFSNFVISSTIDCYSKLGRIQEAALLFYETTVKDNIIFNSMISGYSQNLYGEEAL

Query:  KLFVEMRASNLSPTDHTLTSVLNACGCLTVLEQGRQVHSLVTKMGSENNVFVVCSLLDMYSKCGSVDEAFYLFNRTVEKNSVLSTSMIMAFAQCGRGSEA
        KLFVEMRASNLSPTDHTLTSVLNACGCLTVLEQGRQVHSL+TKMGSENNVFVVCSLLDMYSKCGS+DEAF LFN+TV+KNSVLSTSMIMAFAQCGRG EA
Subjt:  KLFVEMRASNLSPTDHTLTSVLNACGCLTVLEQGRQVHSLVTKMGSENNVFVVCSLLDMYSKCGSVDEAFYLFNRTVEKNSVLSTSMIMAFAQCGRGSEA

Query:  LKLFESLLTEEGFLPDHVCFTAVLTACNHAGLLNEAVDYFNKMSCEYRLDPQIDHYACLIDLYARNGHVEKAKQLMEQMPYESNYVMWCSLLGACKVHEE
        LKLFE L TE+ F+PDH+CFTAVLTACNHAGLL+EAV+YFNKM CEY+LDPQIDHYACLIDLYARNG+VEKAKQ+MEQMPYESNYVMWCSLLGACKVH E
Subjt:  LKLFESLLTEEGFLPDHVCFTAVLTACNHAGLLNEAVDYFNKMSCEYRLDPQIDHYACLIDLYARNGHVEKAKQLMEQMPYESNYVMWCSLLGACKVHEE

Query:  VELGREVAYQLIEMDPSNAAPYVTLAHVYARAGLWTQLAEIRKQMQQKRARKSAGWSWIEIDKKTHVFSVGDADHPKSCEIYSKLDQLSLDMKVAEQSSK
        VELGREVAY+LIEMDP NAAPY+TLAH+YARAGLWTQ+ EIRK+MQQKR RKSAGWSWIEIDKKTHVFSVGDA HPKSCEIYSKLDQL+LDMK AEQS K
Subjt:  VELGREVAYQLIEMDPSNAAPYVTLAHVYARAGLWTQLAEIRKQMQQKRARKSAGWSWIEIDKKTHVFSVGDADHPKSCEIYSKLDQLSLDMKVAEQSSK

Query:  ALEYDVEC
        ALEYDVEC
Subjt:  ALEYDVEC

A0A5A7UEE6 Pentatricopeptide repeat-containing protein4.7e-26789.96Show/hide
Query:  MCSLGAKLTKYSLCSALNSCAKTHNLFLGLQIHAQIVKIGLEENLFLNSALVDLYSKCNAIVNAKRVFFHMKTHDQVSWTSIISGLSQNGHGHEAILMFK
        MCSLGAKLT YSLCSAL+SCAKTHNLFLGLQIHAQIVKIG EENLFLNS+LVDLYSKCNAIVNAKRVF  MKTHDQVSWTSIISGLSQNG G EAILMFK
Subjt:  MCSLGAKLTKYSLCSALNSCAKTHNLFLGLQIHAQIVKIGLEENLFLNSALVDLYSKCNAIVNAKRVFFHMKTHDQVSWTSIISGLSQNGHGHEAILMFK

Query:  NMLVTKVRPNCFTYATVISSCPTLKDELQIRLAILLHAHVIKLGFSFSNFVISSTIDCYSKLGRIQEAALLFYETTVKDNIIFNSMISGYSQNLYGEEAL
         MLVT+VRPNCFTYATVISSCPTLK+ELQI LA LLHAHVIK GF+FS+FVISSTIDCYSKLGRIQEA+LLF ET+VKDNIIFNSMISGYSQNL GEEAL
Subjt:  NMLVTKVRPNCFTYATVISSCPTLKDELQIRLAILLHAHVIKLGFSFSNFVISSTIDCYSKLGRIQEAALLFYETTVKDNIIFNSMISGYSQNLYGEEAL

Query:  KLFVEMRASNLSPTDHTLTSVLNACGCLTVLEQGRQVHSLVTKMGSENNVFVVCSLLDMYSKCGSVDEAFYLFNRTVEKNSVLSTSMIMAFAQCGRGSEA
        KLFVEMRASNLSPTDHTLTSVLNACGCLTVLEQGRQVHSL+TKMGSENNVFVVCSLLDMYSKCGS+DEAF LFN+TV+KNSVLSTSMIMAFAQCGRG EA
Subjt:  KLFVEMRASNLSPTDHTLTSVLNACGCLTVLEQGRQVHSLVTKMGSENNVFVVCSLLDMYSKCGSVDEAFYLFNRTVEKNSVLSTSMIMAFAQCGRGSEA

Query:  LKLFESLLTEEGFLPDHVCFTAVLTACNHAGLLNEAVDYFNKMSCEYRLDPQIDHYACLIDLYARNGHVEKAKQLMEQMPYESNYVMWCSLLGACKVHEE
        LKLFE L TE+ F+PDH+CFTAVLTACNHAGLL+EAV+YFNKM CEY+LDPQIDHYACLIDLYARNG+VEKAKQ+MEQMPYESNYVMWCSLLGACKVH E
Subjt:  LKLFESLLTEEGFLPDHVCFTAVLTACNHAGLLNEAVDYFNKMSCEYRLDPQIDHYACLIDLYARNGHVEKAKQLMEQMPYESNYVMWCSLLGACKVHEE

Query:  VELGREVAYQLIEMDPSNAAPYVTLAHVYARAGLWTQLAEIRKQMQQKRARKSAGWSWIEIDKKTHVFSVGDADHPKSCEIYSKLDQLSLDMKVAEQSSK
        VELGREVAY+LIEMDP NAAPY+TLAH+YARAGLWTQ+ EIRK+MQQKR RKSAGWSWIEIDKKTHVFSVGDA HPKSCEIYSKLDQL+LDMK AEQS K
Subjt:  VELGREVAYQLIEMDPSNAAPYVTLAHVYARAGLWTQLAEIRKQMQQKRARKSAGWSWIEIDKKTHVFSVGDADHPKSCEIYSKLDQLSLDMKVAEQSSK

Query:  ALEYDVEC
        ALEYDVEC
Subjt:  ALEYDVEC

A0A6J1DYD1 pentatricopeptide repeat-containing protein At2g13600-like2.5e-25286.25Show/hide
Query:  MCSLGAKLTKYSLCSALNSCAKTHNLFLGLQIHAQIVKIGLEENLFLNSALVDLYSKCNAIVNAKRVFFHMKTHDQVSWTSIISGLSQNGHGHEAILMFK
        MC LGAKLTKY+LCSALNSCAKT NLF GLQIHAQIVKIG EENLFLNSALVDLY+KCNAIV+AKRVFF M+THDQVSWTS+ISGLS+NG G EAI MFK
Subjt:  MCSLGAKLTKYSLCSALNSCAKTHNLFLGLQIHAQIVKIGLEENLFLNSALVDLYSKCNAIVNAKRVFFHMKTHDQVSWTSIISGLSQNGHGHEAILMFK

Query:  NMLVTKVRPNCFTYATVISSCPTLKDELQIRLAILLHAHVIKLGFSFSNFVISSTIDCYSKLGRIQEAALLFYETTVKDNIIFNSMISGYSQNLYGEEAL
         MLVT+ RPNCFTYATVISSCPTL+D+LQ+RLA LLHAHVIK GF+FS+FVISSTIDCYSKLGRI +AALLFYET  KDNIIFNSMISGYSQNL+GEEAL
Subjt:  NMLVTKVRPNCFTYATVISSCPTLKDELQIRLAILLHAHVIKLGFSFSNFVISSTIDCYSKLGRIQEAALLFYETTVKDNIIFNSMISGYSQNLYGEEAL

Query:  KLFVEMRASNLSPTDHTLTSVLNACGCLTVLEQGRQVHSLVTKMGSENNVFVVCSLLDMYSKCGSVDEAFYLFNRTVEKNSVLSTSMIMAFAQCGRGSEA
        KLFVEMR+S+LSPTDHTLTSVLNACG LTVLEQGRQVHSLVTKMGSENNVFVVCSL+DMYSKCG VDEAF++FN+ VEKNSVLSTSMIMAFAQCGRGS+A
Subjt:  KLFVEMRASNLSPTDHTLTSVLNACGCLTVLEQGRQVHSLVTKMGSENNVFVVCSLLDMYSKCGSVDEAFYLFNRTVEKNSVLSTSMIMAFAQCGRGSEA

Query:  LKLFESLLTEEGFLPDHVCFTAVLTACNHAGLLNEAVDYFNKMSCEYRLDPQIDHYACLIDLYARNGHVEKAKQLMEQMPYESNYVMWCSLLGACKVHEE
        LKLFE LL EEGFLPDHVCFTAVLTACNHAG L+EAV+YFNKM  EY LDPQIDHYACLIDLYARNGH+EKAKQL+EQMPYESNYVMWCSLLGACKVH E
Subjt:  LKLFESLLTEEGFLPDHVCFTAVLTACNHAGLLNEAVDYFNKMSCEYRLDPQIDHYACLIDLYARNGHVEKAKQLMEQMPYESNYVMWCSLLGACKVHEE

Query:  VELGREVAYQLIEMDPSNAAPYVTLAHVYARAGLWTQLAEIRKQMQQKRARKSAGWSWIEIDKKTHVFSVGDADHPKSCEIYSKLDQLSLDMKVAEQSSK
        VELGREVAY+LIEMDPSNAAPYVTLAH+YARAGLW Q+ +IRK+MQQ++ RKS GWSWIEIDKK+HVFS GDA HPKS EIYSKLDQL LDMK AE +SK
Subjt:  VELGREVAYQLIEMDPSNAAPYVTLAHVYARAGLWTQLAEIRKQMQQKRARKSAGWSWIEIDKKTHVFSVGDADHPKSCEIYSKLDQLSLDMKVAEQSSK

Query:  AL
        AL
Subjt:  AL

A0A6J1GA23 pentatricopeptide repeat-containing protein At2g13600-like2.2e-26189.11Show/hide
Query:  MCSLGAKLTKYSLCSALNSCAKTHNLFLGLQIHAQIVKIGLEENLFLNSALVDLYSKCNAIVNAKRVFFHMKTHDQVSWTSIISGLSQNGHGHEAILMFK
        MCSLGAKLTKYSLCSALNSCAKT NLFLGLQIHAQIVKIG E+NL+LNS LV+LYSKCNAIV+AKR+F HMKTHDQVSWTSIISGLSQNG G EAILMFK
Subjt:  MCSLGAKLTKYSLCSALNSCAKTHNLFLGLQIHAQIVKIGLEENLFLNSALVDLYSKCNAIVNAKRVFFHMKTHDQVSWTSIISGLSQNGHGHEAILMFK

Query:  NMLVTKVRPNCFTYATVISSCPTLKDELQIRLAILLHAHVIKLGF-SFSNFVISSTIDCYSKLGRIQEAALLFYETTVKDNIIFNSMISGYSQNLYGEEA
        NMLVT+ RPNCFTYATVISSCP+LKDEL I L  L HAHVIKLGF  FS+FVISS IDCYSKLGRI+EAALLFYE  VKDN+IFNSMISG+SQNLYGEEA
Subjt:  NMLVTKVRPNCFTYATVISSCPTLKDELQIRLAILLHAHVIKLGF-SFSNFVISSTIDCYSKLGRIQEAALLFYETTVKDNIIFNSMISGYSQNLYGEEA

Query:  LKLFVEMRASNLSPTDHTLTSVLNACGCLTVLEQGRQVHSLVTKMGSENNVFVVCSLLDMYSKCGSVDEAFYLFNRTVEKNSVLSTSMIMAFAQCGRGSE
        LKLFVEMRASNLSPTDHTLTSVLNACG LTVLEQGRQVHSLVTKMGSENNVFVVCSLLDMYSKCGSVD+AF +FN+TVEKNSVLSTSMIMAFAQCGRGS+
Subjt:  LKLFVEMRASNLSPTDHTLTSVLNACGCLTVLEQGRQVHSLVTKMGSENNVFVVCSLLDMYSKCGSVDEAFYLFNRTVEKNSVLSTSMIMAFAQCGRGSE

Query:  ALKLFESLLTEEGFLPDHVCFTAVLTACNHAGLLNEAVDYFNKMSCEYRLDPQIDHYACLIDLYARNGHVEKAKQLMEQMPYESNYVMWCSLLGACKVHE
        ALKLFESLLTEEGFLPDHVCFTAVLTACNHAGLLNEAV+YFNKM  EYRLDPQIDHYACLIDLYARNGHVEKAK+LME+MPYESNYVMWCSLLGACKVH 
Subjt:  ALKLFESLLTEEGFLPDHVCFTAVLTACNHAGLLNEAVDYFNKMSCEYRLDPQIDHYACLIDLYARNGHVEKAKQLMEQMPYESNYVMWCSLLGACKVHE

Query:  EVELGREVAYQLIEMDPSNAAPYVTLAHVYARAGLWTQLAEIRKQMQQKRARKSAGWSWIEIDKKTHVFSVGDADHPKSCEIYSKLDQLSLDMKVAEQSS
        EVELGREVAY+LIEMDP NAAPYVTLAH+YARAGLWTQLA+IR QMQQKR RKSAGWSWIEIDKK HVFSVGDA HPKSCEIYSKL+QL LDM+ AE +S
Subjt:  EVELGREVAYQLIEMDPSNAAPYVTLAHVYARAGLWTQLAEIRKQMQQKRARKSAGWSWIEIDKKTHVFSVGDADHPKSCEIYSKLDQLSLDMKVAEQSS

Query:  KALEY
        KALE+
Subjt:  KALEY

A0A6J1K975 pentatricopeptide repeat-containing protein At2g13600-like1.6e-25988.71Show/hide
Query:  MCSLGAKLTKYSLCSALNSCAKTHNLFLGLQIHAQIVKIGLEENLFLNSALVDLYSKCNAIVNAKRVFFHMKTHDQVSWTSIISGLSQNGHGHEAILMFK
        MCSLGAKLTKYSLCSALNSCAKT NLFLGLQIHAQIVKIG E+NL+LNS LV+LYSKCNAIV+AKR+F HMKTHDQVSWTSIISGLSQNG G EAILMFK
Subjt:  MCSLGAKLTKYSLCSALNSCAKTHNLFLGLQIHAQIVKIGLEENLFLNSALVDLYSKCNAIVNAKRVFFHMKTHDQVSWTSIISGLSQNGHGHEAILMFK

Query:  NMLVTKVRPNCFTYATVISSCPTLKDELQIRLAILLHAHVIKLGF-SFSNFVISSTIDCYSKLGRIQEAALLFYETTVKDNIIFNSMISGYSQNLYGEEA
        NMLVT+ RPNCFTYATVISSCP+LKDEL   L  L HAHVIKLGF  FS+FVISS IDCYSKLGRI+EAALLFYE  VKDN+IFNSMISG+SQNLYGEEA
Subjt:  NMLVTKVRPNCFTYATVISSCPTLKDELQIRLAILLHAHVIKLGF-SFSNFVISSTIDCYSKLGRIQEAALLFYETTVKDNIIFNSMISGYSQNLYGEEA

Query:  LKLFVEMRASNLSPTDHTLTSVLNACGCLTVLEQGRQVHSLVTKMGSENNVFVVCSLLDMYSKCGSVDEAFYLFNRTVEKNSVLSTSMIMAFAQCGRGSE
        LKLFVEMRASNLS TDHTLTSVLNACG LTVLEQGRQVHSLVTKMGSENNVFVVCSLLDMYSKCGSVD+AF +FN+TVEKNSVLSTSMIMAFAQCGRGS+
Subjt:  LKLFVEMRASNLSPTDHTLTSVLNACGCLTVLEQGRQVHSLVTKMGSENNVFVVCSLLDMYSKCGSVDEAFYLFNRTVEKNSVLSTSMIMAFAQCGRGSE

Query:  ALKLFESLLTEEGFLPDHVCFTAVLTACNHAGLLNEAVDYFNKMSCEYRLDPQIDHYACLIDLYARNGHVEKAKQLMEQMPYESNYVMWCSLLGACKVHE
        ALKLFESLLTEEGFLPDHVCFTAVLTACNHAGLLNEAV+YFNKM  EYRLDPQIDHYACLIDLYARNGHVEKAK+LME++PYESNYVMWCSLLGACKVH 
Subjt:  ALKLFESLLTEEGFLPDHVCFTAVLTACNHAGLLNEAVDYFNKMSCEYRLDPQIDHYACLIDLYARNGHVEKAKQLMEQMPYESNYVMWCSLLGACKVHE

Query:  EVELGREVAYQLIEMDPSNAAPYVTLAHVYARAGLWTQLAEIRKQMQQKRARKSAGWSWIEIDKKTHVFSVGDADHPKSCEIYSKLDQLSLDMKVAEQSS
        EVELGREVAY+LIEMDPSNAAPYVTLAH+YARAGLWTQLA+IRKQMQQKR RKSAGWSWIEIDKK HVFSVGDA HPKSCEIYSKL+QL LDM+  E +S
Subjt:  EVELGREVAYQLIEMDPSNAAPYVTLAHVYARAGLWTQLAEIRKQMQQKRARKSAGWSWIEIDKKTHVFSVGDADHPKSCEIYSKLDQLSLDMKVAEQSS

Query:  KALEY
        KALE+
Subjt:  KALEY

SwissProt top hitse value%identityAlignment
Q5G1T1 Pentatricopeptide repeat-containing protein At3g49170, chloroplastic2.0e-8934.82Show/hide
Query:  GAKLTKYSLCSALNSCAKTHNLFLGLQIHAQIVKIGLEENLFLNSALVDLYSKCNA---IVNAKRVFFHMKTHDQVSWTSIISGLSQNGH-GHEAILMFK
        G +  K++L S  ++CA+  NL LG Q+H+  ++ GL ++  +  +LVD+Y+KC+A   + + ++VF  M+ H  +SWT++I+G  +N +   EAI +F 
Subjt:  GAKLTKYSLCSALNSCAKTHNLFLGLQIHAQIVKIGLEENLFLNSALVDLYSKCNA---IVNAKRVFFHMKTHDQVSWTSIISGLSQNGH-GHEAILMFK

Query:  NMLVT-KVRPNCFTYATVISSCPTLKDELQIRLAILLHAHVIKLGFSFSNFVISSTIDCYSKLGRIQEAALLFYETTVKDNIIFNSMISGYSQNLYGEEA
         M+    V PN FT+++   +C  L D    R+   +     K G + ++ V +S I  + K  R+++A   F   + K+ + +N+ + G  +NL  E+A
Subjt:  NMLVT-KVRPNCFTYATVISSCPTLKDELQIRLAILLHAHVIKLGFSFSNFVISSTIDCYSKLGRIQEAALLFYETTVKDNIIFNSMISGYSQNLYGEEA

Query:  LKLFVEMRASNLSPTDHTLTSVLNACGCLTVLEQGRQVHSLVTKMGSENNVFVVCSLLDMYSKCGSVDEAFYLFNRTVEKNSVLSTSMIMAFAQCGRGSE
         KL  E+    L  +  T  S+L+    +  + +G Q+HS V K+G   N  V  +L+ MYSKCGS+D A  +FN    +N +  TSMI  FA+ G    
Subjt:  LKLFVEMRASNLSPTDHTLTSVLNACGCLTVLEQGRQVHSLVTKMGSENNVFVVCSLLDMYSKCGSVDEAFYLFNRTVEKNSVLSTSMIMAFAQCGRGSE

Query:  ALKLFESLLTEEGFLPDHVCFTAVLTACNHAGLLNEAVDYFNKMSCEYRLDPQIDHYACLIDLYARNGHVEKAKQLMEQMPYESNYVMWCSLLGACKVHE
         L+ F  ++ EEG  P+ V + A+L+AC+H GL++E   +FN M  ++++ P+++HYAC++DL  R G +  A + +  MP++++ ++W + LGAC+VH 
Subjt:  ALKLFESLLTEEGFLPDHVCFTAVLTACNHAGLLNEAVDYFNKMSCEYRLDPQIDHYACLIDLYARNGHVEKAKQLMEQMPYESNYVMWCSLLGACKVHE

Query:  EVELGREVAYQLIEMDPSNAAPYVTLAHVYARAGLWTQLAEIRKQMQQKRARKSAGWSWIEIDKKTHVFSVGDADHPKSCEIYSKLDQLSLDMK
          ELG+  A +++E+DP+  A Y+ L+++YA AG W +  E+R++M+++   K  G SWIE+  K H F VGD  HP + +IY +LD+L  ++K
Subjt:  EVELGREVAYQLIEMDPSNAAPYVTLAHVYARAGLWTQLAEIRKQMQQKRARKSAGWSWIEIDKKTHVFSVGDADHPKSCEIYSKLDQLSLDMK

Q9FWA6 Pentatricopeptide repeat-containing protein At3g02330, mitochondrial4.3e-9233.33Show/hide
Query:  MCSLGAKLTKYSLCSALNSCAKTHNLFLGLQIHAQIVKIGLEENLFLNSALVDLYSKCNAIVNAKRVFFHMKTHDQVSWTSIISGLSQNGHGHEAILMFK
        + S G    + SL     +CA    L  GLQI+   +K  L  ++ + +A +D+Y KC A+  A RVF  M+  D VSW +II+   QNG G+E + +F 
Subjt:  MCSLGAKLTKYSLCSALNSCAKTHNLFLGLQIHAQIVKIGLEENLFLNSALVDLYSKCNAIVNAKRVFFHMKTHDQVSWTSIISGLSQNGHGHEAILMFK

Query:  NMLVTKVRPNCFTYATVISSCPTLKDELQIRLAILLHAHVIKLGFSFSNFVISSTIDCYSKLGRIQEA----ALLFYETTVKDN----------------
        +ML +++ P+ FT+ +++ +C        +   + +H+ ++K G + ++ V  S ID YSK G I+EA    +  F    V                   
Subjt:  NMLVTKVRPNCFTYATVISSCPTLKDELQIRLAILLHAHVIKLGFSFSNFVISSTIDCYSKLGRIQEA----ALLFYETTVKDN----------------

Query:  IIFNSMISGYSQNLYGEEALKLFVEMRASNLSPTDHTLTSVLNACGCLTVLEQGRQVHSLVTKMGSENNVFVVCSLLDMYSKCGSVDEAFYLFNRTVEKN
        + +NS+ISGY      E+A  LF  M    ++P   T  +VL+ C  L     G+Q+H+ V K   +++V++  +L+DMYSKCG + ++  +F +++ ++
Subjt:  IIFNSMISGYSQNLYGEEALKLFVEMRASNLSPTDHTLTSVLNACGCLTVLEQGRQVHSLVTKMGSENNVFVVCSLLDMYSKCGSVDEAFYLFNRTVEKN

Query:  SVLSTSMIMAFAQCGRGSEALKLFESLLTEEGFLPDHVCFTAVLTACNHAGLLNEAVDYFNKMSCEYRLDPQIDHYACLIDLYARNGHVEKAKQLMEQMP
         V   +MI  +A  G+G EA++LFE ++  E   P+HV F ++L AC H GL+++ ++YF  M  +Y LDPQ+ HY+ ++D+  ++G V++A +L+ +MP
Subjt:  SVLSTSMIMAFAQCGRGSEALKLFESLLTEEGFLPDHVCFTAVLTACNHAGLLNEAVDYFNKMSCEYRLDPQIDHYACLIDLYARNGHVEKAKQLMEQMP

Query:  YESNYVMWCSLLGACKVH-EEVELGREVAYQLIEMDPSNAAPYVTLAHVYARAGLWTQLAEIRKQMQQKRARKSAGWSWIEIDKKTHVFSVGDADHPKSC
        +E++ V+W +LLG C +H   VE+  E    L+ +DP +++ Y  L++VYA AG+W +++++R+ M+  + +K  G SW+E+  + HVF VGD  HP+  
Subjt:  YESNYVMWCSLLGACKVH-EEVELGREVAYQLIEMDPSNAAPYVTLAHVYARAGLWTQLAEIRKQMQQKRARKSAGWSWIEIDKKTHVFSVGDADHPKSC

Query:  EIYSKLDQLSLDMKVAEQSSKALEYDVE
        EIY +L  +  +MK  + SS     +VE
Subjt:  EIYSKLDQLSLDMKVAEQSSKALEYDVE

Q9LTV8 Pentatricopeptide repeat-containing protein At3g127703.1e-9035.28Show/hide
Query:  LNSCAKTHNLFLGLQIHAQIVKIGLEENLFLNSALVDLYSKCNAIVNAKRVFFHMKTHDQ--VSWTSIISGLSQNGHGHEAILMFKNMLVTKVRPNCFTY
        L +C+   +L +G  +HAQ+ ++G + ++F+ + L+ LY+KC  + +A+ VF  +   ++  VSWT+I+S  +QNG   EA+ +F  M    V+P+    
Subjt:  LNSCAKTHNLFLGLQIHAQIVKIGLEENLFLNSALVDLYSKCNAIVNAKRVFFHMKTHDQ--VSWTSIISGLSQNGHGHEAILMFKNMLVTKVRPNCFTY

Query:  ATVISSCPTLKDELQIRLAILLHAHVIKLGFSFSNFVISSTIDCYSKLGRIQEAALLFYETTVKDNIIFNSMISGYSQNLYGEEALKLFVEMRASNLSPT
         +V+++   L+D  Q R    +HA V+K+G      ++ S    Y+K G++  A +LF +    + I++N+MISGY++N Y  EA+ +F EM   ++ P 
Subjt:  ATVISSCPTLKDELQIRLAILLHAHVIKLGFSFSNFVISSTIDCYSKLGRIQEAALLFYETTVKDNIIFNSMISGYSQNLYGEEALKLFVEMRASNLSPT

Query:  DHTLTSVLNACGCLTVLEQGRQVHSLVTKMGSENNVFVVCSLLDMYSKCGSVDEAFYLFNRTVEKNSVLSTSMIMAFAQCGRGSEALKLFESLLTEEGFL
          ++TS ++AC  +  LEQ R ++  V +    ++VF+  +L+DM++KCGSV+ A  +F+RT++++ V+ ++MI+ +   GR  EA+ L+ + +   G  
Subjt:  DHTLTSVLNACGCLTVLEQGRQVHSLVTKMGSENNVFVVCSLLDMYSKCGSVDEAFYLFNRTVEKNSVLSTSMIMAFAQCGRGSEALKLFESLLTEEGFL

Query:  PDHVCFTAVLTACNHAGLLNEAVDYFNKMSCEYRLDPQIDHYACLIDLYARNGHVEKAKQLMEQMPYESNYVMWCSLLGACKVHEEVELGREVAYQLIEM
        P+ V F  +L ACNH+G++ E   +FN+M+ +++++PQ  HYAC+IDL  R GH+++A ++++ MP +    +W +LL ACK H  VELG   A QL  +
Subjt:  PDHVCFTAVLTACNHAGLLNEAVDYFNKMSCEYRLDPQIDHYACLIDLYARNGHVEKAKQLMEQMPYESNYVMWCSLLGACKVHEEVELGREVAYQLIEM

Query:  DPSNAAPYVTLAHVYARAGLWTQLAEIRKQMQQKRARKSAGWSWIEIDKKTHVFSVGDADHPKSCEIYSKLDQLSLDMK
        DPSN   YV L+++YA A LW ++AE+R +M++K   K  G SW+E+  +   F VGD  HP+  EI  +++ +   +K
Subjt:  DPSNAAPYVTLAHVYARAGLWTQLAEIRKQMQQKRARKSAGWSWIEIDKKTHVFSVGDADHPKSCEIYSKLDQLSLDMK

Q9SIT7 Pentatricopeptide repeat-containing protein At2g136002.8e-9936.51Show/hide
Query:  GAKLTKYSLCSALNSCAKTHNLFLGLQIHAQIVKIGLEENLFLNSALVDLYSKCNAIVNAKRVFFHMKTHDQVSWTSIISGLSQNGHGHEAILMFKNMLV
        G  L +YS  S L++C+  +++  G+Q+H+ I K     ++++ SALVD+YSKC  + +A+RVF  M   + VSW S+I+   QNG   EA+ +F+ ML 
Subjt:  GAKLTKYSLCSALNSCAKTHNLFLGLQIHAQIVKIGLEENLFLNSALVDLYSKCNAIVNAKRVFFHMKTHDQVSWTSIISGLSQNGHGHEAILMFKNMLV

Query:  TKVRPNCFTYATVISSCPTLKDELQIRLAILLHAHVIKLGFSFSNFVISST-IDCYSKLGRIQEAALLFYETTVKD------------------------
        ++V P+  T A+VIS+C +L     I++   +H  V+K     ++ ++S+  +D Y+K  RI+EA  +F    +++                        
Subjt:  TKVRPNCFTYATVISSCPTLKDELQIRLAILLHAHVIKLGFSFSNFVISST-IDCYSKLGRIQEAALLFYETTVKD------------------------

Query:  ------NII-FNSMISGYSQNLYGEEALKLFVEMRASNLSPTDHTLTSVLNACGCLTVLEQGRQVHSLVTK------MGSENNVFVVCSLLDMYSKCGSV
              N++ +N++I+GY+QN   EEAL LF  ++  ++ PT ++  ++L AC  L  L  G Q H  V K       G E+++FV  SL+DMY KCG V
Subjt:  ------NII-FNSMISGYSQNLYGEEALKLFVEMRASNLSPTDHTLTSVLNACGCLTVLEQGRQVHSLVTK------MGSENNVFVVCSLLDMYSKCGSV

Query:  DEAFYLFNRTVEKNSVLSTSMIMAFAQCGRGSEALKLFESLLTEEGFLPDHVCFTAVLTACNHAGLLNEAVDYFNKMSCEYRLDPQIDHYACLIDLYARN
        +E + +F + +E++ V   +MI+ FAQ G G+EAL+LF  +L E G  PDH+    VL+AC HAG + E   YF+ M+ ++ + P  DHY C++DL  R 
Subjt:  DEAFYLFNRTVEKNSVLSTSMIMAFAQCGRGSEALKLFESLLTEEGFLPDHVCFTAVLTACNHAGLLNEAVDYFNKMSCEYRLDPQIDHYACLIDLYARN

Query:  GHVEKAKQLMEQMPYESNYVMWCSLLGACKVHEEVELGREVAYQLIEMDPSNAAPYVTLAHVYARAGLWTQLAEIRKQMQQKRARKSAGWSWIEIDKKTH
        G +E+AK ++E+MP + + V+W SLL ACKVH  + LG+ VA +L+E++PSN+ PYV L+++YA  G W  +  +RK M+++   K  G SWI+I    H
Subjt:  GHVEKAKQLMEQMPYESNYVMWCSLLGACKVHEEVELGREVAYQLIEMDPSNAAPYVTLAHVYARAGLWTQLAEIRKQMQQKRARKSAGWSWIEIDKKTH

Query:  VFSVGDADHPKSCEIYSKLDQLSLDMK-------VAEQSSKALEY
        VF V D  HP+  +I+S LD L  +M+       +   SS+ ++Y
Subjt:  VFSVGDADHPKSCEIYSKLDQLSLDMK-------VAEQSSKALEY

Q9ZUW3 Pentatricopeptide repeat-containing protein At2g276103.4e-8935.79Show/hide
Query:  KLTKYSLCSALNSCAKTHNLFLGLQIHAQIVKIGLEENLFLNSALVDLYSKCNAIVNAKRVFFHMK-THDQVSWTSIISGLSQNGHGHEAILMFKNMLVT
        +L++ S  S +  CA    L    Q+H  +VK G   +  + +AL+  YSKC A+++A R+F  +    + VSWT++ISG  QN    EA+ +F  M   
Subjt:  KLTKYSLCSALNSCAKTHNLFLGLQIHAQIVKIGLEENLFLNSALVDLYSKCNAIVNAKRVFFHMK-THDQVSWTSIISGLSQNGHGHEAILMFKNMLVT

Query:  KVRPNCFTYATVISSCPTLKDELQIRLAILLHAHVIKLGFSFSNFVISSTIDCYSKLGRIQEAALLFYETTVKDNIIFNSMISGYSQNLYGEEALKLFVE
         VRPN FTY+ ++++ P +           +HA V+K  +  S+ V ++ +D Y KLG+++EAA +F     KD + +++M++GY+Q    E A+K+F E
Subjt:  KVRPNCFTYATVISSCPTLKDELQIRLAILLHAHVIKLGFSFSNFVISSTIDCYSKLGRIQEAALLFYETTVKDNIIFNSMISGYSQNLYGEEALKLFVE

Query:  MRASNLSPTDHTLTSVLNACGCLTV-LEQGRQVHSLVTKMGSENNVFVVCSLLDMYSKCGSVDEAFYLFNRTVEKNSVLSTSMIMAFAQCGRGSEALKLF
        +    + P + T +S+LN C      + QG+Q H    K   ++++ V  +LL MY+K G+++ A  +F R  EK+ V   SMI  +AQ G+  +AL +F
Subjt:  MRASNLSPTDHTLTSVLNACGCLTV-LEQGRQVHSLVTKMGSENNVFVVCSLLDMYSKCGSVDEAFYLFNRTVEKNSVLSTSMIMAFAQCGRGSEALKLF

Query:  ESLLTEEGFLPDHVCFTAVLTACNHAGLLNEAVDYFNKMSCEYRLDPQIDHYACLIDLYARNGHVEKAKQLMEQMPYESNYVMWCSLLGACKVHEEVELG
        + +   +  + D V F  V  AC HAGL+ E   YF+ M  + ++ P  +H +C++DLY+R G +EKA +++E MP  +   +W ++L AC+VH++ ELG
Subjt:  ESLLTEEGFLPDHVCFTAVLTACNHAGLLNEAVDYFNKMSCEYRLDPQIDHYACLIDLYARNGHVEKAKQLMEQMPYESNYVMWCSLLGACKVHEEVELG

Query:  REVAYQLIEMDPSNAAPYVTLAHVYARAGLWTQLAEIRKQMQQKRARKSAGWSWIEIDKKTHVFSVGDADHPKSCEIYSKLDQLSLDMK
        R  A ++I M P ++A YV L+++YA +G W + A++RK M ++  +K  G+SWIE+  KT+ F  GD  HP   +IY KL+ LS  +K
Subjt:  REVAYQLIEMDPSNAAPYVTLAHVYARAGLWTQLAEIRKQMQQKRARKSAGWSWIEIDKKTHVFSVGDADHPKSCEIYSKLDQLSLDMK

Arabidopsis top hitse value%identityAlignment
AT2G13600.1 Pentatricopeptide repeat (PPR) superfamily protein2.0e-10036.51Show/hide
Query:  GAKLTKYSLCSALNSCAKTHNLFLGLQIHAQIVKIGLEENLFLNSALVDLYSKCNAIVNAKRVFFHMKTHDQVSWTSIISGLSQNGHGHEAILMFKNMLV
        G  L +YS  S L++C+  +++  G+Q+H+ I K     ++++ SALVD+YSKC  + +A+RVF  M   + VSW S+I+   QNG   EA+ +F+ ML 
Subjt:  GAKLTKYSLCSALNSCAKTHNLFLGLQIHAQIVKIGLEENLFLNSALVDLYSKCNAIVNAKRVFFHMKTHDQVSWTSIISGLSQNGHGHEAILMFKNMLV

Query:  TKVRPNCFTYATVISSCPTLKDELQIRLAILLHAHVIKLGFSFSNFVISST-IDCYSKLGRIQEAALLFYETTVKD------------------------
        ++V P+  T A+VIS+C +L     I++   +H  V+K     ++ ++S+  +D Y+K  RI+EA  +F    +++                        
Subjt:  TKVRPNCFTYATVISSCPTLKDELQIRLAILLHAHVIKLGFSFSNFVISST-IDCYSKLGRIQEAALLFYETTVKD------------------------

Query:  ------NII-FNSMISGYSQNLYGEEALKLFVEMRASNLSPTDHTLTSVLNACGCLTVLEQGRQVHSLVTK------MGSENNVFVVCSLLDMYSKCGSV
              N++ +N++I+GY+QN   EEAL LF  ++  ++ PT ++  ++L AC  L  L  G Q H  V K       G E+++FV  SL+DMY KCG V
Subjt:  ------NII-FNSMISGYSQNLYGEEALKLFVEMRASNLSPTDHTLTSVLNACGCLTVLEQGRQVHSLVTK------MGSENNVFVVCSLLDMYSKCGSV

Query:  DEAFYLFNRTVEKNSVLSTSMIMAFAQCGRGSEALKLFESLLTEEGFLPDHVCFTAVLTACNHAGLLNEAVDYFNKMSCEYRLDPQIDHYACLIDLYARN
        +E + +F + +E++ V   +MI+ FAQ G G+EAL+LF  +L E G  PDH+    VL+AC HAG + E   YF+ M+ ++ + P  DHY C++DL  R 
Subjt:  DEAFYLFNRTVEKNSVLSTSMIMAFAQCGRGSEALKLFESLLTEEGFLPDHVCFTAVLTACNHAGLLNEAVDYFNKMSCEYRLDPQIDHYACLIDLYARN

Query:  GHVEKAKQLMEQMPYESNYVMWCSLLGACKVHEEVELGREVAYQLIEMDPSNAAPYVTLAHVYARAGLWTQLAEIRKQMQQKRARKSAGWSWIEIDKKTH
        G +E+AK ++E+MP + + V+W SLL ACKVH  + LG+ VA +L+E++PSN+ PYV L+++YA  G W  +  +RK M+++   K  G SWI+I    H
Subjt:  GHVEKAKQLMEQMPYESNYVMWCSLLGACKVHEEVELGREVAYQLIEMDPSNAAPYVTLAHVYARAGLWTQLAEIRKQMQQKRARKSAGWSWIEIDKKTH

Query:  VFSVGDADHPKSCEIYSKLDQLSLDMK-------VAEQSSKALEY
        VF V D  HP+  +I+S LD L  +M+       +   SS+ ++Y
Subjt:  VFSVGDADHPKSCEIYSKLDQLSLDMK-------VAEQSSKALEY

AT2G27610.1 Tetratricopeptide repeat (TPR)-like superfamily protein2.4e-9035.79Show/hide
Query:  KLTKYSLCSALNSCAKTHNLFLGLQIHAQIVKIGLEENLFLNSALVDLYSKCNAIVNAKRVFFHMK-THDQVSWTSIISGLSQNGHGHEAILMFKNMLVT
        +L++ S  S +  CA    L    Q+H  +VK G   +  + +AL+  YSKC A+++A R+F  +    + VSWT++ISG  QN    EA+ +F  M   
Subjt:  KLTKYSLCSALNSCAKTHNLFLGLQIHAQIVKIGLEENLFLNSALVDLYSKCNAIVNAKRVFFHMK-THDQVSWTSIISGLSQNGHGHEAILMFKNMLVT

Query:  KVRPNCFTYATVISSCPTLKDELQIRLAILLHAHVIKLGFSFSNFVISSTIDCYSKLGRIQEAALLFYETTVKDNIIFNSMISGYSQNLYGEEALKLFVE
         VRPN FTY+ ++++ P +           +HA V+K  +  S+ V ++ +D Y KLG+++EAA +F     KD + +++M++GY+Q    E A+K+F E
Subjt:  KVRPNCFTYATVISSCPTLKDELQIRLAILLHAHVIKLGFSFSNFVISSTIDCYSKLGRIQEAALLFYETTVKDNIIFNSMISGYSQNLYGEEALKLFVE

Query:  MRASNLSPTDHTLTSVLNACGCLTV-LEQGRQVHSLVTKMGSENNVFVVCSLLDMYSKCGSVDEAFYLFNRTVEKNSVLSTSMIMAFAQCGRGSEALKLF
        +    + P + T +S+LN C      + QG+Q H    K   ++++ V  +LL MY+K G+++ A  +F R  EK+ V   SMI  +AQ G+  +AL +F
Subjt:  MRASNLSPTDHTLTSVLNACGCLTV-LEQGRQVHSLVTKMGSENNVFVVCSLLDMYSKCGSVDEAFYLFNRTVEKNSVLSTSMIMAFAQCGRGSEALKLF

Query:  ESLLTEEGFLPDHVCFTAVLTACNHAGLLNEAVDYFNKMSCEYRLDPQIDHYACLIDLYARNGHVEKAKQLMEQMPYESNYVMWCSLLGACKVHEEVELG
        + +   +  + D V F  V  AC HAGL+ E   YF+ M  + ++ P  +H +C++DLY+R G +EKA +++E MP  +   +W ++L AC+VH++ ELG
Subjt:  ESLLTEEGFLPDHVCFTAVLTACNHAGLLNEAVDYFNKMSCEYRLDPQIDHYACLIDLYARNGHVEKAKQLMEQMPYESNYVMWCSLLGACKVHEEVELG

Query:  REVAYQLIEMDPSNAAPYVTLAHVYARAGLWTQLAEIRKQMQQKRARKSAGWSWIEIDKKTHVFSVGDADHPKSCEIYSKLDQLSLDMK
        R  A ++I M P ++A YV L+++YA +G W + A++RK M ++  +K  G+SWIE+  KT+ F  GD  HP   +IY KL+ LS  +K
Subjt:  REVAYQLIEMDPSNAAPYVTLAHVYARAGLWTQLAEIRKQMQQKRARKSAGWSWIEIDKKTHVFSVGDADHPKSCEIYSKLDQLSLDMK

AT3G02330.1 Pentatricopeptide repeat (PPR) superfamily protein3.0e-9333.33Show/hide
Query:  MCSLGAKLTKYSLCSALNSCAKTHNLFLGLQIHAQIVKIGLEENLFLNSALVDLYSKCNAIVNAKRVFFHMKTHDQVSWTSIISGLSQNGHGHEAILMFK
        + S G    + SL     +CA    L  GLQI+   +K  L  ++ + +A +D+Y KC A+  A RVF  M+  D VSW +II+   QNG G+E + +F 
Subjt:  MCSLGAKLTKYSLCSALNSCAKTHNLFLGLQIHAQIVKIGLEENLFLNSALVDLYSKCNAIVNAKRVFFHMKTHDQVSWTSIISGLSQNGHGHEAILMFK

Query:  NMLVTKVRPNCFTYATVISSCPTLKDELQIRLAILLHAHVIKLGFSFSNFVISSTIDCYSKLGRIQEA----ALLFYETTVKDN----------------
        +ML +++ P+ FT+ +++ +C        +   + +H+ ++K G + ++ V  S ID YSK G I+EA    +  F    V                   
Subjt:  NMLVTKVRPNCFTYATVISSCPTLKDELQIRLAILLHAHVIKLGFSFSNFVISSTIDCYSKLGRIQEA----ALLFYETTVKDN----------------

Query:  IIFNSMISGYSQNLYGEEALKLFVEMRASNLSPTDHTLTSVLNACGCLTVLEQGRQVHSLVTKMGSENNVFVVCSLLDMYSKCGSVDEAFYLFNRTVEKN
        + +NS+ISGY      E+A  LF  M    ++P   T  +VL+ C  L     G+Q+H+ V K   +++V++  +L+DMYSKCG + ++  +F +++ ++
Subjt:  IIFNSMISGYSQNLYGEEALKLFVEMRASNLSPTDHTLTSVLNACGCLTVLEQGRQVHSLVTKMGSENNVFVVCSLLDMYSKCGSVDEAFYLFNRTVEKN

Query:  SVLSTSMIMAFAQCGRGSEALKLFESLLTEEGFLPDHVCFTAVLTACNHAGLLNEAVDYFNKMSCEYRLDPQIDHYACLIDLYARNGHVEKAKQLMEQMP
         V   +MI  +A  G+G EA++LFE ++  E   P+HV F ++L AC H GL+++ ++YF  M  +Y LDPQ+ HY+ ++D+  ++G V++A +L+ +MP
Subjt:  SVLSTSMIMAFAQCGRGSEALKLFESLLTEEGFLPDHVCFTAVLTACNHAGLLNEAVDYFNKMSCEYRLDPQIDHYACLIDLYARNGHVEKAKQLMEQMP

Query:  YESNYVMWCSLLGACKVH-EEVELGREVAYQLIEMDPSNAAPYVTLAHVYARAGLWTQLAEIRKQMQQKRARKSAGWSWIEIDKKTHVFSVGDADHPKSC
        +E++ V+W +LLG C +H   VE+  E    L+ +DP +++ Y  L++VYA AG+W +++++R+ M+  + +K  G SW+E+  + HVF VGD  HP+  
Subjt:  YESNYVMWCSLLGACKVH-EEVELGREVAYQLIEMDPSNAAPYVTLAHVYARAGLWTQLAEIRKQMQQKRARKSAGWSWIEIDKKTHVFSVGDADHPKSC

Query:  EIYSKLDQLSLDMKVAEQSSKALEYDVE
        EIY +L  +  +MK  + SS     +VE
Subjt:  EIYSKLDQLSLDMKVAEQSSKALEYDVE

AT3G12770.1 mitochondrial editing factor 222.2e-9135.28Show/hide
Query:  LNSCAKTHNLFLGLQIHAQIVKIGLEENLFLNSALVDLYSKCNAIVNAKRVFFHMKTHDQ--VSWTSIISGLSQNGHGHEAILMFKNMLVTKVRPNCFTY
        L +C+   +L +G  +HAQ+ ++G + ++F+ + L+ LY+KC  + +A+ VF  +   ++  VSWT+I+S  +QNG   EA+ +F  M    V+P+    
Subjt:  LNSCAKTHNLFLGLQIHAQIVKIGLEENLFLNSALVDLYSKCNAIVNAKRVFFHMKTHDQ--VSWTSIISGLSQNGHGHEAILMFKNMLVTKVRPNCFTY

Query:  ATVISSCPTLKDELQIRLAILLHAHVIKLGFSFSNFVISSTIDCYSKLGRIQEAALLFYETTVKDNIIFNSMISGYSQNLYGEEALKLFVEMRASNLSPT
         +V+++   L+D  Q R    +HA V+K+G      ++ S    Y+K G++  A +LF +    + I++N+MISGY++N Y  EA+ +F EM   ++ P 
Subjt:  ATVISSCPTLKDELQIRLAILLHAHVIKLGFSFSNFVISSTIDCYSKLGRIQEAALLFYETTVKDNIIFNSMISGYSQNLYGEEALKLFVEMRASNLSPT

Query:  DHTLTSVLNACGCLTVLEQGRQVHSLVTKMGSENNVFVVCSLLDMYSKCGSVDEAFYLFNRTVEKNSVLSTSMIMAFAQCGRGSEALKLFESLLTEEGFL
          ++TS ++AC  +  LEQ R ++  V +    ++VF+  +L+DM++KCGSV+ A  +F+RT++++ V+ ++MI+ +   GR  EA+ L+ + +   G  
Subjt:  DHTLTSVLNACGCLTVLEQGRQVHSLVTKMGSENNVFVVCSLLDMYSKCGSVDEAFYLFNRTVEKNSVLSTSMIMAFAQCGRGSEALKLFESLLTEEGFL

Query:  PDHVCFTAVLTACNHAGLLNEAVDYFNKMSCEYRLDPQIDHYACLIDLYARNGHVEKAKQLMEQMPYESNYVMWCSLLGACKVHEEVELGREVAYQLIEM
        P+ V F  +L ACNH+G++ E   +FN+M+ +++++PQ  HYAC+IDL  R GH+++A ++++ MP +    +W +LL ACK H  VELG   A QL  +
Subjt:  PDHVCFTAVLTACNHAGLLNEAVDYFNKMSCEYRLDPQIDHYACLIDLYARNGHVEKAKQLMEQMPYESNYVMWCSLLGACKVHEEVELGREVAYQLIEM

Query:  DPSNAAPYVTLAHVYARAGLWTQLAEIRKQMQQKRARKSAGWSWIEIDKKTHVFSVGDADHPKSCEIYSKLDQLSLDMK
        DPSN   YV L+++YA A LW ++AE+R +M++K   K  G SW+E+  +   F VGD  HP+  EI  +++ +   +K
Subjt:  DPSNAAPYVTLAHVYARAGLWTQLAEIRKQMQQKRARKSAGWSWIEIDKKTHVFSVGDADHPKSCEIYSKLDQLSLDMK

AT3G49170.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.4e-9034.82Show/hide
Query:  GAKLTKYSLCSALNSCAKTHNLFLGLQIHAQIVKIGLEENLFLNSALVDLYSKCNA---IVNAKRVFFHMKTHDQVSWTSIISGLSQNGH-GHEAILMFK
        G +  K++L S  ++CA+  NL LG Q+H+  ++ GL ++  +  +LVD+Y+KC+A   + + ++VF  M+ H  +SWT++I+G  +N +   EAI +F 
Subjt:  GAKLTKYSLCSALNSCAKTHNLFLGLQIHAQIVKIGLEENLFLNSALVDLYSKCNA---IVNAKRVFFHMKTHDQVSWTSIISGLSQNGH-GHEAILMFK

Query:  NMLVT-KVRPNCFTYATVISSCPTLKDELQIRLAILLHAHVIKLGFSFSNFVISSTIDCYSKLGRIQEAALLFYETTVKDNIIFNSMISGYSQNLYGEEA
         M+    V PN FT+++   +C  L D    R+   +     K G + ++ V +S I  + K  R+++A   F   + K+ + +N+ + G  +NL  E+A
Subjt:  NMLVT-KVRPNCFTYATVISSCPTLKDELQIRLAILLHAHVIKLGFSFSNFVISSTIDCYSKLGRIQEAALLFYETTVKDNIIFNSMISGYSQNLYGEEA

Query:  LKLFVEMRASNLSPTDHTLTSVLNACGCLTVLEQGRQVHSLVTKMGSENNVFVVCSLLDMYSKCGSVDEAFYLFNRTVEKNSVLSTSMIMAFAQCGRGSE
         KL  E+    L  +  T  S+L+    +  + +G Q+HS V K+G   N  V  +L+ MYSKCGS+D A  +FN    +N +  TSMI  FA+ G    
Subjt:  LKLFVEMRASNLSPTDHTLTSVLNACGCLTVLEQGRQVHSLVTKMGSENNVFVVCSLLDMYSKCGSVDEAFYLFNRTVEKNSVLSTSMIMAFAQCGRGSE

Query:  ALKLFESLLTEEGFLPDHVCFTAVLTACNHAGLLNEAVDYFNKMSCEYRLDPQIDHYACLIDLYARNGHVEKAKQLMEQMPYESNYVMWCSLLGACKVHE
         L+ F  ++ EEG  P+ V + A+L+AC+H GL++E   +FN M  ++++ P+++HYAC++DL  R G +  A + +  MP++++ ++W + LGAC+VH 
Subjt:  ALKLFESLLTEEGFLPDHVCFTAVLTACNHAGLLNEAVDYFNKMSCEYRLDPQIDHYACLIDLYARNGHVEKAKQLMEQMPYESNYVMWCSLLGACKVHE

Query:  EVELGREVAYQLIEMDPSNAAPYVTLAHVYARAGLWTQLAEIRKQMQQKRARKSAGWSWIEIDKKTHVFSVGDADHPKSCEIYSKLDQLSLDMK
          ELG+  A +++E+DP+  A Y+ L+++YA AG W +  E+R++M+++   K  G SWIE+  K H F VGD  HP + +IY +LD+L  ++K
Subjt:  EVELGREVAYQLIEMDPSNAAPYVTLAHVYARAGLWTQLAEIRKQMQQKRARKSAGWSWIEIDKKTHVFSVGDADHPKSCEIYSKLDQLSLDMK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTGCAGCTTAGGAGCAAAATTGACCAAGTATTCTCTATGCTCTGCTCTTAATTCTTGTGCTAAAACACATAATTTGTTCTTGGGTTTGCAAATTCATGCTCAGATTGT
CAAAATTGGACTTGAAGAGAACTTATTTTTGAACAGTGCACTTGTTGATTTATATTCCAAATGTAATGCCATTGTGAATGCCAAAAGAGTCTTCTTTCATATGAAGACTC
ATGACCAAGTATCTTGGACTTCTATAATATCAGGGCTCTCCCAAAATGGTCATGGTCATGAAGCCATCCTCATGTTCAAGAATATGTTGGTAACTAAGGTTAGACCCAAC
TGTTTTACTTATGCCACTGTTATTAGTTCATGCCCAACTTTGAAGGATGAACTTCAGATTCGTCTTGCAATTTTGCTTCATGCTCATGTTATCAAACTTGGTTTTAGTTT
TAGCAATTTTGTAATTAGCTCTACTATTGATTGTTACTCTAAACTAGGAAGAATACAAGAAGCTGCTCTGCTCTTTTATGAGACAACTGTGAAGGATAATATCATATTTA
ATTCTATGATATCAGGGTATTCTCAAAACTTGTATGGGGAAGAGGCATTGAAACTGTTTGTAGAAATGAGAGCTAGTAATTTGAGCCCAACTGATCATACATTAACTAGT
GTTTTAAATGCCTGTGGGTGTTTAACAGTACTTGAACAAGGAAGGCAAGTTCACTCTCTAGTTACAAAAATGGGATCAGAAAATAATGTGTTTGTAGTCTGTTCATTGCT
AGATATGTACTCGAAGTGTGGCAGTGTCGACGAAGCATTTTATTTATTCAATCGGACAGTAGAAAAGAACAGTGTGTTGTCGACATCGATGATAATGGCTTTTGCTCAAT
GTGGTAGAGGCTCAGAAGCATTAAAGCTCTTTGAGAGTTTGTTGACTGAAGAAGGTTTCTTGCCTGACCATGTCTGTTTTACTGCAGTTCTAACTGCCTGCAATCATGCA
GGATTACTAAATGAGGCAGTTGATTATTTCAATAAAATGAGCTGTGAATACAGATTAGATCCTCAAATTGATCATTATGCTTGTTTGATTGACCTCTATGCCAGAAATGG
GCATGTAGAAAAAGCCAAGCAATTGATGGAGCAAATGCCTTATGAATCTAATTACGTAATGTGGTGTTCCCTCTTAGGTGCTTGCAAAGTTCATGAAGAGGTCGAGCTTG
GGAGGGAGGTGGCATATCAACTCATCGAGATGGATCCAAGTAATGCTGCACCCTATGTAACGCTTGCTCATGTCTATGCTAGAGCTGGTTTATGGACACAGTTGGCTGAG
ATTAGAAAACAGATGCAACAAAAAAGGGCAAGGAAAAGTGCAGGGTGGAGTTGGATTGAGATAGATAAGAAAACTCATGTCTTCTCAGTTGGTGATGCTGATCATCCTAA
ATCCTGTGAGATTTATTCAAAACTTGATCAACTGAGTTTGGATATGAAAGTAGCTGAACAATCATCAAAAGCACTTGAATATGATGTTGAGTGTTAA
mRNA sequenceShow/hide mRNA sequence
ATGTGCAGCTTAGGAGCAAAATTGACCAAGTATTCTCTATGCTCTGCTCTTAATTCTTGTGCTAAAACACATAATTTGTTCTTGGGTTTGCAAATTCATGCTCAGATTGT
CAAAATTGGACTTGAAGAGAACTTATTTTTGAACAGTGCACTTGTTGATTTATATTCCAAATGTAATGCCATTGTGAATGCCAAAAGAGTCTTCTTTCATATGAAGACTC
ATGACCAAGTATCTTGGACTTCTATAATATCAGGGCTCTCCCAAAATGGTCATGGTCATGAAGCCATCCTCATGTTCAAGAATATGTTGGTAACTAAGGTTAGACCCAAC
TGTTTTACTTATGCCACTGTTATTAGTTCATGCCCAACTTTGAAGGATGAACTTCAGATTCGTCTTGCAATTTTGCTTCATGCTCATGTTATCAAACTTGGTTTTAGTTT
TAGCAATTTTGTAATTAGCTCTACTATTGATTGTTACTCTAAACTAGGAAGAATACAAGAAGCTGCTCTGCTCTTTTATGAGACAACTGTGAAGGATAATATCATATTTA
ATTCTATGATATCAGGGTATTCTCAAAACTTGTATGGGGAAGAGGCATTGAAACTGTTTGTAGAAATGAGAGCTAGTAATTTGAGCCCAACTGATCATACATTAACTAGT
GTTTTAAATGCCTGTGGGTGTTTAACAGTACTTGAACAAGGAAGGCAAGTTCACTCTCTAGTTACAAAAATGGGATCAGAAAATAATGTGTTTGTAGTCTGTTCATTGCT
AGATATGTACTCGAAGTGTGGCAGTGTCGACGAAGCATTTTATTTATTCAATCGGACAGTAGAAAAGAACAGTGTGTTGTCGACATCGATGATAATGGCTTTTGCTCAAT
GTGGTAGAGGCTCAGAAGCATTAAAGCTCTTTGAGAGTTTGTTGACTGAAGAAGGTTTCTTGCCTGACCATGTCTGTTTTACTGCAGTTCTAACTGCCTGCAATCATGCA
GGATTACTAAATGAGGCAGTTGATTATTTCAATAAAATGAGCTGTGAATACAGATTAGATCCTCAAATTGATCATTATGCTTGTTTGATTGACCTCTATGCCAGAAATGG
GCATGTAGAAAAAGCCAAGCAATTGATGGAGCAAATGCCTTATGAATCTAATTACGTAATGTGGTGTTCCCTCTTAGGTGCTTGCAAAGTTCATGAAGAGGTCGAGCTTG
GGAGGGAGGTGGCATATCAACTCATCGAGATGGATCCAAGTAATGCTGCACCCTATGTAACGCTTGCTCATGTCTATGCTAGAGCTGGTTTATGGACACAGTTGGCTGAG
ATTAGAAAACAGATGCAACAAAAAAGGGCAAGGAAAAGTGCAGGGTGGAGTTGGATTGAGATAGATAAGAAAACTCATGTCTTCTCAGTTGGTGATGCTGATCATCCTAA
ATCCTGTGAGATTTATTCAAAACTTGATCAACTGAGTTTGGATATGAAAGTAGCTGAACAATCATCAAAAGCACTTGAATATGATGTTGAGTGTTAA
Protein sequenceShow/hide protein sequence
MCSLGAKLTKYSLCSALNSCAKTHNLFLGLQIHAQIVKIGLEENLFLNSALVDLYSKCNAIVNAKRVFFHMKTHDQVSWTSIISGLSQNGHGHEAILMFKNMLVTKVRPN
CFTYATVISSCPTLKDELQIRLAILLHAHVIKLGFSFSNFVISSTIDCYSKLGRIQEAALLFYETTVKDNIIFNSMISGYSQNLYGEEALKLFVEMRASNLSPTDHTLTS
VLNACGCLTVLEQGRQVHSLVTKMGSENNVFVVCSLLDMYSKCGSVDEAFYLFNRTVEKNSVLSTSMIMAFAQCGRGSEALKLFESLLTEEGFLPDHVCFTAVLTACNHA
GLLNEAVDYFNKMSCEYRLDPQIDHYACLIDLYARNGHVEKAKQLMEQMPYESNYVMWCSLLGACKVHEEVELGREVAYQLIEMDPSNAAPYVTLAHVYARAGLWTQLAE
IRKQMQQKRARKSAGWSWIEIDKKTHVFSVGDADHPKSCEIYSKLDQLSLDMKVAEQSSKALEYDVEC