; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MELO3C005140 (gene) of Melon (DHL92) v4 genome

Gene IDMELO3C005140
OrganismCucumis melo DHL92 (Melon (DHL92) v4)
DescriptionPentatricopeptide repeat-containing protein
Genome locationchr09:16462527..16464819
RNA-Seq ExpressionMELO3C005140
SyntenyMELO3C005140
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0050180.1 pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa]0.0e+0099.82Show/hide
Query:  MDSQGKPPSEKQFEILIRMHCDAYRGLRVYYVYEKMKKFGVVPRVFLYNRILDALVKTDHLDLALTVYRDFQENGLVEESITFMILIKGLCKAGRVDEML
        MDSQGKPPSEKQFEILIRMHCDA RGLRVYYVYEKMKKFGVVPRVFLYNRILDALVKTDHLDLALTVYRDFQENGLVEESITFMILIKGLCKAGRVDEML
Subjt:  MDSQGKPPSEKQFEILIRMHCDAYRGLRVYYVYEKMKKFGVVPRVFLYNRILDALVKTDHLDLALTVYRDFQENGLVEESITFMILIKGLCKAGRVDEML

Query:  ELLARMRATLCKPDVFAYTAMVKVFVSKENLEGCLRVWDEMRADRVEPDVMAYGTLIIGLCKVGRAQKGYELFQEMKGKRILIDRAIYGTLIEAFVQDEK
        ELLARMRATLCKPDVFAYTAMVKVFVSKENLEGCLRVWDEMRADRVEPDVMAYGTLIIGLCKVGRAQKGYELFQEMKGKRILIDRAIYGTLIEAFVQDEK
Subjt:  ELLARMRATLCKPDVFAYTAMVKVFVSKENLEGCLRVWDEMRADRVEPDVMAYGTLIIGLCKVGRAQKGYELFQEMKGKRILIDRAIYGTLIEAFVQDEK

Query:  VGLACDLYKDLVDSGYRADLGIYHSLIKGLCNLNQVHKAYKLFQLTIREDLKPDFETVKPIMMMYVEMGRMDDLWKLVTLLQKLEFSVDDVLSKFLCFMV
        VGLACDLYKDLVDSGYRADLGIYHSLIKGLCNLNQVHKAYKLFQLTIREDLKPDFETVKPIMMMYVEMGRMDDLWKLVTLLQKLEFSVDDVLSKFLCFMV
Subjt:  VGLACDLYKDLVDSGYRADLGIYHSLIKGLCNLNQVHKAYKLFQLTIREDLKPDFETVKPIMMMYVEMGRMDDLWKLVTLLQKLEFSVDDVLSKFLCFMV

Query:  EEEDKISVALDLFHGMIDKGYGSVALYNVIMGALHRYGQANKALEIYNDMKNSNIEPDSTTYSIAVLCFVEIGKIREACASHNKIIELGSVPSMAAYCSL
        EEEDKISVALDLFHGMIDKGYGSVALYNVIMGALHRYGQANKALEIYNDMKNSNIEPDSTTYSIAVLCFVEIGKIREACASHNKIIELGSVPSMAAYCSL
Subjt:  EEEDKISVALDLFHGMIDKGYGSVALYNVIMGALHRYGQANKALEIYNDMKNSNIEPDSTTYSIAVLCFVEIGKIREACASHNKIIELGSVPSMAAYCSL

Query:  SEGLFKICEIDAVMMLVRDCLANVESGPQEFKYALTIVHACKSGKAEMVIDVLKEMVLQDCAPSSVAYSAIISGMSKYGTFNEAKKVFLHLRERSQLTEA
        SEGLFKICEIDAVMMLVRDCLANVESGPQEFKYALTIVHACKSGKAEMVIDVLKEMVLQDCAPSSVAYSAIISGMSKYGTFNEAKKVFLHLRERSQLTEA
Subjt:  SEGLFKICEIDAVMMLVRDCLANVESGPQEFKYALTIVHACKSGKAEMVIDVLKEMVLQDCAPSSVAYSAIISGMSKYGTFNEAKKVFLHLRERSQLTEA

Query:  NCIVCEELLIEHMKKKTADLVRCGLKFFNLESRLKAKGCNLLST
        NCIVCEELLIEHMKKKTADLVRCGLKFFNLESRLKAKGCNLLST
Subjt:  NCIVCEELLIEHMKKKTADLVRCGLKFFNLESRLKAKGCNLLST

KAG7016643.1 Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. argyrosperma]3.1e-26384.16Show/hide
Query:  MDSQGKPPSEKQFEILIRMHCDAYRGLRVYYVYEKMKKFGVVPRVFLYNRILDALVKTDHLDLALTVYRDFQENGLVEESITFMILIKGLCKAGRVDEML
        MDSQGKPPSEKQFEILIRMHCDA RGLRVYYVYEKMKKFGV+PRVFLYNRILDALVKT +LDLAL+VYRDFQENGLVEE+ITFMILIKGLCKAGRVDEML
Subjt:  MDSQGKPPSEKQFEILIRMHCDAYRGLRVYYVYEKMKKFGVVPRVFLYNRILDALVKTDHLDLALTVYRDFQENGLVEESITFMILIKGLCKAGRVDEML

Query:  ELLARMRATLCKPDVFAYTAMVKVFVSKENLEGCLRVWDEMRADRVEPDVMAYGTLIIGLCKVGRAQKGYELFQEMKGKRILIDRAIYGTLIEAFVQDEK
        ELL RMRA  CKPDVFAYTAMVKV VS+ENLEGCLRVWDEM ADRVEPDVMAYGTLIIGLCKVGR +K +ELFQEMKGKRILIDRAIYG+LIEAFVQDEK
Subjt:  ELLARMRATLCKPDVFAYTAMVKVFVSKENLEGCLRVWDEMRADRVEPDVMAYGTLIIGLCKVGRAQKGYELFQEMKGKRILIDRAIYGTLIEAFVQDEK

Query:  VGLACDLYKDLVDSGYRADLGIYHSLIKGLCNLNQVHKAYKLFQLTIREDLKPDFETVKPIMMMYVEMGRMDDLWKLVTLLQKLEFSVDDVLSKFLCFMV
        VGLA DL KDLVDSGYRADL IY SLIKGLCN+NQV +AYKLF++TIREDLKPDF TVKPIM+ YVE  RM+DLWKL++LLQKLE S+DD+LSKF+  MV
Subjt:  VGLACDLYKDLVDSGYRADLGIYHSLIKGLCNLNQVHKAYKLFQLTIREDLKPDFETVKPIMMMYVEMGRMDDLWKLVTLLQKLEFSVDDVLSKFLCFMV

Query:  EEEDKISVALDLFHGMIDKGYGSVALYNVIMGALHRYGQANKALEIYNDMKNSNIEPDSTTYSIAVLCFVEIGKIREACASHNKIIELGSVPSMAAYCSL
        EEEDKIS ALD+F G+IDKGYGSVA+YN+I+GALHR+GQA KALEIYNDMKNSNIEPD +TYSI V C+VE+ +I+EACASHNKI+ELGSVPS+AAYCSL
Subjt:  EEEDKISVALDLFHGMIDKGYGSVALYNVIMGALHRYGQANKALEIYNDMKNSNIEPDSTTYSIAVLCFVEIGKIREACASHNKIIELGSVPSMAAYCSL

Query:  SEGLFKICEIDAVMMLVRDCLANVESGPQEFKYALTIVHACKSGKAEMVIDVLKEMVLQDCAPSSVAYSAIISGMSKYGTFNEAKKVFLHLRERSQLTEA
        SEGLFKICEIDAVMMLVRDCLAN ESGP EFKYALTI+ ACKSGKAEMVIDVL EMV Q C  S+V YSAI+SGM KYGT + AKKVFLHL+E  Q+ EA
Subjt:  SEGLFKICEIDAVMMLVRDCLANVESGPQEFKYALTIVHACKSGKAEMVIDVLKEMVLQDCAPSSVAYSAIISGMSKYGTFNEAKKVFLHLRERSQLTEA

Query:  NCIVCEELLIEHMKKKTADLVRCGLKFFNLESRLKAKGCNLLS
        NCIVCEE+L+EHMKKKTADLVRCGLKFFNLES+LKAKG  LLS
Subjt:  NCIVCEELLIEHMKKKTADLVRCGLKFFNLESRLKAKGCNLLS

XP_004140286.1 pentatricopeptide repeat-containing protein At4g20740 [Cucumis sativus]2.7e-29995.04Show/hide
Query:  MDSQGKPPSEKQFEILIRMHCDAYRGLRVYYVYEKMKKFGVVPRVFLYNRILDALVKTDHLDLALTVYRDFQENGLVEESITFMILIKGLCKAGRVDEML
        MDSQGKPPSEKQFEILIRMHCDA RGLRVYYVYEKMKKFGVVPRVFLYNRILDALVKTDHLDLALTVYRDFQENGLVEES+TFMILIKGLCKAGRVDEML
Subjt:  MDSQGKPPSEKQFEILIRMHCDAYRGLRVYYVYEKMKKFGVVPRVFLYNRILDALVKTDHLDLALTVYRDFQENGLVEESITFMILIKGLCKAGRVDEML

Query:  ELLARMRATLCKPDVFAYTAMVKVFVSKENLEGCLRVWDEMRADRVEPDVMAYGTLIIGLCKVGRAQKGYELFQEMKGKRILIDRAIYGTLIEAFVQDEK
        ELLARMRA LCKPDVFAYTAMVKV  SK+NLEGCLRVWDEMRADRVEPDVMAYGTLIIGLCKVGRAQKGYELFQEMKGKRILIDRAIYGTLIEAFVQDEK
Subjt:  ELLARMRATLCKPDVFAYTAMVKVFVSKENLEGCLRVWDEMRADRVEPDVMAYGTLIIGLCKVGRAQKGYELFQEMKGKRILIDRAIYGTLIEAFVQDEK

Query:  VGLACDLYKDLVDSGYRADLGIYHSLIKGLCNLNQVHKAYKLFQLTIREDLKPDFETVKPIMMMYVEMGRMDDLWKLVTLLQKLEFSVDDVLSKFLCFMV
        VGLACDL+KDLVDSGYRADLGIYHSLIKGLCN+NQV KAYKLFQLTIREDLKPDFETVKPIMMMYVE GRMDD WKLV+LLQKLEFSVDDVLSKFL FMV
Subjt:  VGLACDLYKDLVDSGYRADLGIYHSLIKGLCNLNQVHKAYKLFQLTIREDLKPDFETVKPIMMMYVEMGRMDDLWKLVTLLQKLEFSVDDVLSKFLCFMV

Query:  EEEDKISVALDLFHGMIDKGYGSVALYNVIMGALHRYGQANKALEIYNDMKNSNIEPDSTTYSIAVLCFVEIGKIREACASHNKIIELGSVPSMAAYCSL
        EEEDKISVALD+FHGMIDKGYGSVALYNV++GALHRYGQANKALEIYNDMKNSNIEP+STTYSIA+LCFVEIGKI+EACASHNKI+ELGSVPS+AAYCSL
Subjt:  EEEDKISVALDLFHGMIDKGYGSVALYNVIMGALHRYGQANKALEIYNDMKNSNIEPDSTTYSIAVLCFVEIGKIREACASHNKIIELGSVPSMAAYCSL

Query:  SEGLFKICEIDAVMMLVRDCLANVESGPQEFKYALTIVHACKSGKAEMVIDVLKEMVLQDCAPSSVAYSAIISGMSKYGTFNEAKKVFLHLRERSQLTEA
        SEGLFKICEI+AVMMLVRDCLAN+ESGPQEFKYALTIVHACKSGKAEMVIDVLKEMVLQDC PSSVAYSAIISGMSKYGT +EAKKVFLHLRER QLTEA
Subjt:  SEGLFKICEIDAVMMLVRDCLANVESGPQEFKYALTIVHACKSGKAEMVIDVLKEMVLQDCAPSSVAYSAIISGMSKYGTFNEAKKVFLHLRERSQLTEA

Query:  NCIVCEELLIEHMKKKTADLVRCGLKFFNLESRLKAKGCNLLST
        NCIVCEELLIEHMKKKTADLVRCGLKFFNLESRLKAKGCNLLST
Subjt:  NCIVCEELLIEHMKKKTADLVRCGLKFFNLESRLKAKGCNLLST

XP_016898857.1 PREDICTED: pentatricopeptide repeat-containing protein At4g20740 [Cucumis melo]0.0e+00100Show/hide
Query:  MDSQGKPPSEKQFEILIRMHCDAYRGLRVYYVYEKMKKFGVVPRVFLYNRILDALVKTDHLDLALTVYRDFQENGLVEESITFMILIKGLCKAGRVDEML
        MDSQGKPPSEKQFEILIRMHCDAYRGLRVYYVYEKMKKFGVVPRVFLYNRILDALVKTDHLDLALTVYRDFQENGLVEESITFMILIKGLCKAGRVDEML
Subjt:  MDSQGKPPSEKQFEILIRMHCDAYRGLRVYYVYEKMKKFGVVPRVFLYNRILDALVKTDHLDLALTVYRDFQENGLVEESITFMILIKGLCKAGRVDEML

Query:  ELLARMRATLCKPDVFAYTAMVKVFVSKENLEGCLRVWDEMRADRVEPDVMAYGTLIIGLCKVGRAQKGYELFQEMKGKRILIDRAIYGTLIEAFVQDEK
        ELLARMRATLCKPDVFAYTAMVKVFVSKENLEGCLRVWDEMRADRVEPDVMAYGTLIIGLCKVGRAQKGYELFQEMKGKRILIDRAIYGTLIEAFVQDEK
Subjt:  ELLARMRATLCKPDVFAYTAMVKVFVSKENLEGCLRVWDEMRADRVEPDVMAYGTLIIGLCKVGRAQKGYELFQEMKGKRILIDRAIYGTLIEAFVQDEK

Query:  VGLACDLYKDLVDSGYRADLGIYHSLIKGLCNLNQVHKAYKLFQLTIREDLKPDFETVKPIMMMYVEMGRMDDLWKLVTLLQKLEFSVDDVLSKFLCFMV
        VGLACDLYKDLVDSGYRADLGIYHSLIKGLCNLNQVHKAYKLFQLTIREDLKPDFETVKPIMMMYVEMGRMDDLWKLVTLLQKLEFSVDDVLSKFLCFMV
Subjt:  VGLACDLYKDLVDSGYRADLGIYHSLIKGLCNLNQVHKAYKLFQLTIREDLKPDFETVKPIMMMYVEMGRMDDLWKLVTLLQKLEFSVDDVLSKFLCFMV

Query:  EEEDKISVALDLFHGMIDKGYGSVALYNVIMGALHRYGQANKALEIYNDMKNSNIEPDSTTYSIAVLCFVEIGKIREACASHNKIIELGSVPSMAAYCSL
        EEEDKISVALDLFHGMIDKGYGSVALYNVIMGALHRYGQANKALEIYNDMKNSNIEPDSTTYSIAVLCFVEIGKIREACASHNKIIELGSVPSMAAYCSL
Subjt:  EEEDKISVALDLFHGMIDKGYGSVALYNVIMGALHRYGQANKALEIYNDMKNSNIEPDSTTYSIAVLCFVEIGKIREACASHNKIIELGSVPSMAAYCSL

Query:  SEGLFKICEIDAVMMLVRDCLANVESGPQEFKYALTIVHACKSGKAEMVIDVLKEMVLQDCAPSSVAYSAIISGMSKYGTFNEAKKVFLHLRERSQLTEA
        SEGLFKICEIDAVMMLVRDCLANVESGPQEFKYALTIVHACKSGKAEMVIDVLKEMVLQDCAPSSVAYSAIISGMSKYGTFNEAKKVFLHLRERSQLTEA
Subjt:  SEGLFKICEIDAVMMLVRDCLANVESGPQEFKYALTIVHACKSGKAEMVIDVLKEMVLQDCAPSSVAYSAIISGMSKYGTFNEAKKVFLHLRERSQLTEA

Query:  NCIVCEELLIEHMKKKTADLVRCGLKFFNLESRLKAKGCNLLST
        NCIVCEELLIEHMKKKTADLVRCGLKFFNLESRLKAKGCNLLST
Subjt:  NCIVCEELLIEHMKKKTADLVRCGLKFFNLESRLKAKGCNLLST

XP_038906442.1 pentatricopeptide repeat-containing protein At4g20740 [Benincasa hispida]3.6e-28089.34Show/hide
Query:  MDSQGKPPSEKQFEILIRMHCDAYRGLRVYYVYEKMKKFGVVPRVFLYNRILDALVKTDHLDLALTVYRDFQENGLVEESITFMILIKGLCKAGRVDEML
        MDSQGKPPSEKQFEILIRMHCDA RGLRVYYVYEKMKKFGVVPRVFLYNRILDALVKT+HLDLAL+VYRDFQENGLVEESITFMILIKGLCKAGR+DEML
Subjt:  MDSQGKPPSEKQFEILIRMHCDAYRGLRVYYVYEKMKKFGVVPRVFLYNRILDALVKTDHLDLALTVYRDFQENGLVEESITFMILIKGLCKAGRVDEML

Query:  ELLARMRATLCKPDVFAYTAMVKVFVSKENLEGCLRVWDEMRADRVEPDVMAYGTLIIGLCKVGRAQKGYELFQEMKGKRILIDRAIYGTLIEAFVQDEK
        ELLARMRA LCKPDVFAYTAMVKV VS+ENL+GCLRVWDEMRADRVEPDVMAYGTLIIGLCKVGRAQKG+EL QEMKGKRILIDRAIYGTLIEAFVQDEK
Subjt:  ELLARMRATLCKPDVFAYTAMVKVFVSKENLEGCLRVWDEMRADRVEPDVMAYGTLIIGLCKVGRAQKGYELFQEMKGKRILIDRAIYGTLIEAFVQDEK

Query:  VGLACDLYKDLVDSGYRADLGIYHSLIKGLCNLNQVHKAYKLFQLTIREDLKPDFETVKPIMMMYVEMGRMDDLWKLVTLLQKLEFSVDDVLSKFLCFMV
        VGLACDL+KDLVDSGYRADLGIYHSLIKGLCN+NQV KAYKLFQLTIREDLKPD ETVKPIMMMYVE  RM+DLWKL+TLLQKLE S DDVLSKFL FMV
Subjt:  VGLACDLYKDLVDSGYRADLGIYHSLIKGLCNLNQVHKAYKLFQLTIREDLKPDFETVKPIMMMYVEMGRMDDLWKLVTLLQKLEFSVDDVLSKFLCFMV

Query:  EEEDKISVALDLFHGMIDKGYGSVALYNVIMGALHRYGQANKALEIYNDMKNSNIEPDSTTYSIAVLCFVEIGKIREACASHNKIIELGSVPSMAAYCSL
        EEEDKIS ALD+F GMIDKGYG+VA+YN+IMGALH+YGQA KALEIY+DMK+SNI+PDS+TYSIAV CFVE+GKI+EACASHNKI+ELGSVPS AAY SL
Subjt:  EEEDKISVALDLFHGMIDKGYGSVALYNVIMGALHRYGQANKALEIYNDMKNSNIEPDSTTYSIAVLCFVEIGKIREACASHNKIIELGSVPSMAAYCSL

Query:  SEGLFKICEIDAVMMLVRDCLANVESGPQEFKYALTIVHACKSGKAEMVIDVLKEMVLQDCAPSSVAYSAIISGMSKYGTFNEAKKVFLHLRERSQLTEA
        SEGLFKICEIDAVMMLVRDCLAN+E+GP EFKYAL I+H CK GKAEMV DV+ EMV QDC PS+VAYSAIISGMSKYGT +EAKKVFLHLRE  QLTEA
Subjt:  SEGLFKICEIDAVMMLVRDCLANVESGPQEFKYALTIVHACKSGKAEMVIDVLKEMVLQDCAPSSVAYSAIISGMSKYGTFNEAKKVFLHLRERSQLTEA

Query:  NCIVCEELLIEHMKKKTADLVRCGLKFFNLESRLKAKGCNLLST
        NCIVCEELLIEHMKKKTADLVRCGLKFFNLES+LKAKGC LLST
Subjt:  NCIVCEELLIEHMKKKTADLVRCGLKFFNLESRLKAKGCNLLST

TrEMBL top hitse value%identityAlignment
A0A0A0KRS5 Uncharacterized protein1.3e-29995.04Show/hide
Query:  MDSQGKPPSEKQFEILIRMHCDAYRGLRVYYVYEKMKKFGVVPRVFLYNRILDALVKTDHLDLALTVYRDFQENGLVEESITFMILIKGLCKAGRVDEML
        MDSQGKPPSEKQFEILIRMHCDA RGLRVYYVYEKMKKFGVVPRVFLYNRILDALVKTDHLDLALTVYRDFQENGLVEES+TFMILIKGLCKAGRVDEML
Subjt:  MDSQGKPPSEKQFEILIRMHCDAYRGLRVYYVYEKMKKFGVVPRVFLYNRILDALVKTDHLDLALTVYRDFQENGLVEESITFMILIKGLCKAGRVDEML

Query:  ELLARMRATLCKPDVFAYTAMVKVFVSKENLEGCLRVWDEMRADRVEPDVMAYGTLIIGLCKVGRAQKGYELFQEMKGKRILIDRAIYGTLIEAFVQDEK
        ELLARMRA LCKPDVFAYTAMVKV  SK+NLEGCLRVWDEMRADRVEPDVMAYGTLIIGLCKVGRAQKGYELFQEMKGKRILIDRAIYGTLIEAFVQDEK
Subjt:  ELLARMRATLCKPDVFAYTAMVKVFVSKENLEGCLRVWDEMRADRVEPDVMAYGTLIIGLCKVGRAQKGYELFQEMKGKRILIDRAIYGTLIEAFVQDEK

Query:  VGLACDLYKDLVDSGYRADLGIYHSLIKGLCNLNQVHKAYKLFQLTIREDLKPDFETVKPIMMMYVEMGRMDDLWKLVTLLQKLEFSVDDVLSKFLCFMV
        VGLACDL+KDLVDSGYRADLGIYHSLIKGLCN+NQV KAYKLFQLTIREDLKPDFETVKPIMMMYVE GRMDD WKLV+LLQKLEFSVDDVLSKFL FMV
Subjt:  VGLACDLYKDLVDSGYRADLGIYHSLIKGLCNLNQVHKAYKLFQLTIREDLKPDFETVKPIMMMYVEMGRMDDLWKLVTLLQKLEFSVDDVLSKFLCFMV

Query:  EEEDKISVALDLFHGMIDKGYGSVALYNVIMGALHRYGQANKALEIYNDMKNSNIEPDSTTYSIAVLCFVEIGKIREACASHNKIIELGSVPSMAAYCSL
        EEEDKISVALD+FHGMIDKGYGSVALYNV++GALHRYGQANKALEIYNDMKNSNIEP+STTYSIA+LCFVEIGKI+EACASHNKI+ELGSVPS+AAYCSL
Subjt:  EEEDKISVALDLFHGMIDKGYGSVALYNVIMGALHRYGQANKALEIYNDMKNSNIEPDSTTYSIAVLCFVEIGKIREACASHNKIIELGSVPSMAAYCSL

Query:  SEGLFKICEIDAVMMLVRDCLANVESGPQEFKYALTIVHACKSGKAEMVIDVLKEMVLQDCAPSSVAYSAIISGMSKYGTFNEAKKVFLHLRERSQLTEA
        SEGLFKICEI+AVMMLVRDCLAN+ESGPQEFKYALTIVHACKSGKAEMVIDVLKEMVLQDC PSSVAYSAIISGMSKYGT +EAKKVFLHLRER QLTEA
Subjt:  SEGLFKICEIDAVMMLVRDCLANVESGPQEFKYALTIVHACKSGKAEMVIDVLKEMVLQDCAPSSVAYSAIISGMSKYGTFNEAKKVFLHLRERSQLTEA

Query:  NCIVCEELLIEHMKKKTADLVRCGLKFFNLESRLKAKGCNLLST
        NCIVCEELLIEHMKKKTADLVRCGLKFFNLESRLKAKGCNLLST
Subjt:  NCIVCEELLIEHMKKKTADLVRCGLKFFNLESRLKAKGCNLLST

A0A1S4DS81 pentatricopeptide repeat-containing protein At4g207400.0e+00100Show/hide
Query:  MDSQGKPPSEKQFEILIRMHCDAYRGLRVYYVYEKMKKFGVVPRVFLYNRILDALVKTDHLDLALTVYRDFQENGLVEESITFMILIKGLCKAGRVDEML
        MDSQGKPPSEKQFEILIRMHCDAYRGLRVYYVYEKMKKFGVVPRVFLYNRILDALVKTDHLDLALTVYRDFQENGLVEESITFMILIKGLCKAGRVDEML
Subjt:  MDSQGKPPSEKQFEILIRMHCDAYRGLRVYYVYEKMKKFGVVPRVFLYNRILDALVKTDHLDLALTVYRDFQENGLVEESITFMILIKGLCKAGRVDEML

Query:  ELLARMRATLCKPDVFAYTAMVKVFVSKENLEGCLRVWDEMRADRVEPDVMAYGTLIIGLCKVGRAQKGYELFQEMKGKRILIDRAIYGTLIEAFVQDEK
        ELLARMRATLCKPDVFAYTAMVKVFVSKENLEGCLRVWDEMRADRVEPDVMAYGTLIIGLCKVGRAQKGYELFQEMKGKRILIDRAIYGTLIEAFVQDEK
Subjt:  ELLARMRATLCKPDVFAYTAMVKVFVSKENLEGCLRVWDEMRADRVEPDVMAYGTLIIGLCKVGRAQKGYELFQEMKGKRILIDRAIYGTLIEAFVQDEK

Query:  VGLACDLYKDLVDSGYRADLGIYHSLIKGLCNLNQVHKAYKLFQLTIREDLKPDFETVKPIMMMYVEMGRMDDLWKLVTLLQKLEFSVDDVLSKFLCFMV
        VGLACDLYKDLVDSGYRADLGIYHSLIKGLCNLNQVHKAYKLFQLTIREDLKPDFETVKPIMMMYVEMGRMDDLWKLVTLLQKLEFSVDDVLSKFLCFMV
Subjt:  VGLACDLYKDLVDSGYRADLGIYHSLIKGLCNLNQVHKAYKLFQLTIREDLKPDFETVKPIMMMYVEMGRMDDLWKLVTLLQKLEFSVDDVLSKFLCFMV

Query:  EEEDKISVALDLFHGMIDKGYGSVALYNVIMGALHRYGQANKALEIYNDMKNSNIEPDSTTYSIAVLCFVEIGKIREACASHNKIIELGSVPSMAAYCSL
        EEEDKISVALDLFHGMIDKGYGSVALYNVIMGALHRYGQANKALEIYNDMKNSNIEPDSTTYSIAVLCFVEIGKIREACASHNKIIELGSVPSMAAYCSL
Subjt:  EEEDKISVALDLFHGMIDKGYGSVALYNVIMGALHRYGQANKALEIYNDMKNSNIEPDSTTYSIAVLCFVEIGKIREACASHNKIIELGSVPSMAAYCSL

Query:  SEGLFKICEIDAVMMLVRDCLANVESGPQEFKYALTIVHACKSGKAEMVIDVLKEMVLQDCAPSSVAYSAIISGMSKYGTFNEAKKVFLHLRERSQLTEA
        SEGLFKICEIDAVMMLVRDCLANVESGPQEFKYALTIVHACKSGKAEMVIDVLKEMVLQDCAPSSVAYSAIISGMSKYGTFNEAKKVFLHLRERSQLTEA
Subjt:  SEGLFKICEIDAVMMLVRDCLANVESGPQEFKYALTIVHACKSGKAEMVIDVLKEMVLQDCAPSSVAYSAIISGMSKYGTFNEAKKVFLHLRERSQLTEA

Query:  NCIVCEELLIEHMKKKTADLVRCGLKFFNLESRLKAKGCNLLST
        NCIVCEELLIEHMKKKTADLVRCGLKFFNLESRLKAKGCNLLST
Subjt:  NCIVCEELLIEHMKKKTADLVRCGLKFFNLESRLKAKGCNLLST

A0A5D3BGR3 Pentatricopeptide repeat-containing protein0.0e+0099.82Show/hide
Query:  MDSQGKPPSEKQFEILIRMHCDAYRGLRVYYVYEKMKKFGVVPRVFLYNRILDALVKTDHLDLALTVYRDFQENGLVEESITFMILIKGLCKAGRVDEML
        MDSQGKPPSEKQFEILIRMHCDA RGLRVYYVYEKMKKFGVVPRVFLYNRILDALVKTDHLDLALTVYRDFQENGLVEESITFMILIKGLCKAGRVDEML
Subjt:  MDSQGKPPSEKQFEILIRMHCDAYRGLRVYYVYEKMKKFGVVPRVFLYNRILDALVKTDHLDLALTVYRDFQENGLVEESITFMILIKGLCKAGRVDEML

Query:  ELLARMRATLCKPDVFAYTAMVKVFVSKENLEGCLRVWDEMRADRVEPDVMAYGTLIIGLCKVGRAQKGYELFQEMKGKRILIDRAIYGTLIEAFVQDEK
        ELLARMRATLCKPDVFAYTAMVKVFVSKENLEGCLRVWDEMRADRVEPDVMAYGTLIIGLCKVGRAQKGYELFQEMKGKRILIDRAIYGTLIEAFVQDEK
Subjt:  ELLARMRATLCKPDVFAYTAMVKVFVSKENLEGCLRVWDEMRADRVEPDVMAYGTLIIGLCKVGRAQKGYELFQEMKGKRILIDRAIYGTLIEAFVQDEK

Query:  VGLACDLYKDLVDSGYRADLGIYHSLIKGLCNLNQVHKAYKLFQLTIREDLKPDFETVKPIMMMYVEMGRMDDLWKLVTLLQKLEFSVDDVLSKFLCFMV
        VGLACDLYKDLVDSGYRADLGIYHSLIKGLCNLNQVHKAYKLFQLTIREDLKPDFETVKPIMMMYVEMGRMDDLWKLVTLLQKLEFSVDDVLSKFLCFMV
Subjt:  VGLACDLYKDLVDSGYRADLGIYHSLIKGLCNLNQVHKAYKLFQLTIREDLKPDFETVKPIMMMYVEMGRMDDLWKLVTLLQKLEFSVDDVLSKFLCFMV

Query:  EEEDKISVALDLFHGMIDKGYGSVALYNVIMGALHRYGQANKALEIYNDMKNSNIEPDSTTYSIAVLCFVEIGKIREACASHNKIIELGSVPSMAAYCSL
        EEEDKISVALDLFHGMIDKGYGSVALYNVIMGALHRYGQANKALEIYNDMKNSNIEPDSTTYSIAVLCFVEIGKIREACASHNKIIELGSVPSMAAYCSL
Subjt:  EEEDKISVALDLFHGMIDKGYGSVALYNVIMGALHRYGQANKALEIYNDMKNSNIEPDSTTYSIAVLCFVEIGKIREACASHNKIIELGSVPSMAAYCSL

Query:  SEGLFKICEIDAVMMLVRDCLANVESGPQEFKYALTIVHACKSGKAEMVIDVLKEMVLQDCAPSSVAYSAIISGMSKYGTFNEAKKVFLHLRERSQLTEA
        SEGLFKICEIDAVMMLVRDCLANVESGPQEFKYALTIVHACKSGKAEMVIDVLKEMVLQDCAPSSVAYSAIISGMSKYGTFNEAKKVFLHLRERSQLTEA
Subjt:  SEGLFKICEIDAVMMLVRDCLANVESGPQEFKYALTIVHACKSGKAEMVIDVLKEMVLQDCAPSSVAYSAIISGMSKYGTFNEAKKVFLHLRERSQLTEA

Query:  NCIVCEELLIEHMKKKTADLVRCGLKFFNLESRLKAKGCNLLST
        NCIVCEELLIEHMKKKTADLVRCGLKFFNLESRLKAKGCNLLST
Subjt:  NCIVCEELLIEHMKKKTADLVRCGLKFFNLESRLKAKGCNLLST

A0A6J1FN51 pentatricopeptide repeat-containing protein At4g207409.7e-26383.61Show/hide
Query:  MDSQGKPPSEKQFEILIRMHCDAYRGLRVYYVYEKMKKFGVVPRVFLYNRILDALVKTDHLDLALTVYRDFQENGLVEESITFMILIKGLCKAGRVDEML
        MDSQGKPPSEKQFEILIRMHCDA RGLRVYYVYEKMKKFGV+PRVFLYNRILDALVKT ++DLAL+VYRDFQENGLVEE+ITFMILIKGLCKAGRVDEML
Subjt:  MDSQGKPPSEKQFEILIRMHCDAYRGLRVYYVYEKMKKFGVVPRVFLYNRILDALVKTDHLDLALTVYRDFQENGLVEESITFMILIKGLCKAGRVDEML

Query:  ELLARMRATLCKPDVFAYTAMVKVFVSKENLEGCLRVWDEMRADRVEPDVMAYGTLIIGLCKVGRAQKGYELFQEMKGKRILIDRAIYGTLIEAFVQDEK
        ELL RMRA  CKPDVFAYTAMVKV VS+ENLEGCLRVWDEM ADRVEPDVMAYGTLIIGLCKVGR +K +ELFQEMKGKRILIDRAIYG+LIEAFVQDEK
Subjt:  ELLARMRATLCKPDVFAYTAMVKVFVSKENLEGCLRVWDEMRADRVEPDVMAYGTLIIGLCKVGRAQKGYELFQEMKGKRILIDRAIYGTLIEAFVQDEK

Query:  VGLACDLYKDLVDSGYRADLGIYHSLIKGLCNLNQVHKAYKLFQLTIREDLKPDFETVKPIMMMYVEMGRMDDLWKLVTLLQKLEFSVDDVLSKFLCFMV
        VGLA DL KDLVDSGYRADL IY SLIKGLCN+NQV +AYKLF++TIREDLKPDF TVKPIM+ YVE  RM+DLWKL++LLQKLE S+DDVLSKF+  MV
Subjt:  VGLACDLYKDLVDSGYRADLGIYHSLIKGLCNLNQVHKAYKLFQLTIREDLKPDFETVKPIMMMYVEMGRMDDLWKLVTLLQKLEFSVDDVLSKFLCFMV

Query:  EEEDKISVALDLFHGMIDKGYGSVALYNVIMGALHRYGQANKALEIYNDMKNSNIEPDSTTYSIAVLCFVEIGKIREACASHNKIIELGSVPSMAAYCSL
        EEEDKIS ALD+F G+IDKGYGSVA+YN+++GALHR+GQA KALEIYNDMKNSNI+PD +TYSI V C+VE+ +I+EACASHNKI+ELGSVPS+AAYCSL
Subjt:  EEEDKISVALDLFHGMIDKGYGSVALYNVIMGALHRYGQANKALEIYNDMKNSNIEPDSTTYSIAVLCFVEIGKIREACASHNKIIELGSVPSMAAYCSL

Query:  SEGLFKICEIDAVMMLVRDCLANVESGPQEFKYALTIVHACKSGKAEMVIDVLKEMVLQDCAPSSVAYSAIISGMSKYGTFNEAKKVFLHLRERSQLTEA
        SEGLFKICEIDAVMMLVRDCLAN ESGP EFKYALTI+ ACKSGKAEMVIDVL EMV Q C  S+V YSAI+SGM KYGT + AKK FLHL+E  Q+ EA
Subjt:  SEGLFKICEIDAVMMLVRDCLANVESGPQEFKYALTIVHACKSGKAEMVIDVLKEMVLQDCAPSSVAYSAIISGMSKYGTFNEAKKVFLHLRERSQLTEA

Query:  NCIVCEELLIEHMKKKTADLVRCGLKFFNLESRLKAKGCNLLS
        NCIVCEE+L+EHMKKKTADLVRCGLKFFNLES+LKAKG  LLS
Subjt:  NCIVCEELLIEHMKKKTADLVRCGLKFFNLESRLKAKGCNLLS

A0A6J1K1S7 pentatricopeptide repeat-containing protein At4g207406.3e-26283.43Show/hide
Query:  MDSQGKPPSEKQFEILIRMHCDAYRGLRVYYVYEKMKKFGVVPRVFLYNRILDALVKTDHLDLALTVYRDFQENGLVEESITFMILIKGLCKAGRVDEML
        MDSQGKPPSEKQFEILIRMHCDA RGLRVYYVYEKMKKFGV+PRVFLYNRILDALVKT ++DLAL+VYRDFQENGLVEE+ITFMILIKGLCKAGRVDEML
Subjt:  MDSQGKPPSEKQFEILIRMHCDAYRGLRVYYVYEKMKKFGVVPRVFLYNRILDALVKTDHLDLALTVYRDFQENGLVEESITFMILIKGLCKAGRVDEML

Query:  ELLARMRATLCKPDVFAYTAMVKVFVSKENLEGCLRVWDEMRADRVEPDVMAYGTLIIGLCKVGRAQKGYELFQEMKGKRILIDRAIYGTLIEAFVQDEK
        E L RMRA  CKPDVFAYTAMVKV +S+ENLEGCLRVWDEMRADRVEPDVMAYGTLI GLCKVG  QK +ELFQEMKGKRILIDRAIYG+LIEAFVQDEK
Subjt:  ELLARMRATLCKPDVFAYTAMVKVFVSKENLEGCLRVWDEMRADRVEPDVMAYGTLIIGLCKVGRAQKGYELFQEMKGKRILIDRAIYGTLIEAFVQDEK

Query:  VGLACDLYKDLVDSGYRADLGIYHSLIKGLCNLNQVHKAYKLFQLTIREDLKPDFETVKPIMMMYVEMGRMDDLWKLVTLLQKLEFSVDDVLSKFLCFMV
        VGLA DL KDL+DSGYRADLGIY SLIKGLCN NQV +AYKLF++TIREDLKPDF TVKPIM+ YVE  RM+DLWKL++LLQKLE S+DDVL K +  MV
Subjt:  VGLACDLYKDLVDSGYRADLGIYHSLIKGLCNLNQVHKAYKLFQLTIREDLKPDFETVKPIMMMYVEMGRMDDLWKLVTLLQKLEFSVDDVLSKFLCFMV

Query:  EEEDKISVALDLFHGMIDKGYGSVALYNVIMGALHRYGQANKALEIYNDMKNSNIEPDSTTYSIAVLCFVEIGKIREACASHNKIIELGSVPSMAAYCSL
        EEEDKIS ALD+F G+IDKGYGSVA+YN+++GALHR+GQA KALEIYNDMK+SNIEPD +TYSI V C+VE+ +I+EACASHNKI+ELGSVPSMAAYCSL
Subjt:  EEEDKISVALDLFHGMIDKGYGSVALYNVIMGALHRYGQANKALEIYNDMKNSNIEPDSTTYSIAVLCFVEIGKIREACASHNKIIELGSVPSMAAYCSL

Query:  SEGLFKICEIDAVMMLVRDCLANVESGPQEFKYALTIVHACKSGKAEMVIDVLKEMVLQDCAPSSVAYSAIISGMSKYGTFNEAKKVFLHLRERSQLTEA
        SEGLFKICEIDAVMMLVRDCLAN ESGP EFKYALTI+ ACKSGKAEMVIDVL EMV Q C  S+VAYSAI+SGM KYGT   AK VFLHL+E  Q+ EA
Subjt:  SEGLFKICEIDAVMMLVRDCLANVESGPQEFKYALTIVHACKSGKAEMVIDVLKEMVLQDCAPSSVAYSAIISGMSKYGTFNEAKKVFLHLRERSQLTEA

Query:  NCIVCEELLIEHMKKKTADLVRCGLKFFNLESRLKAKGCNLLS
        NCIVCEELL+EHMKKKTADLVRCGLKFFNLES+LKAKG  LLS
Subjt:  NCIVCEELLIEHMKKKTADLVRCGLKFFNLESRLKAKGCNLLS

SwissProt top hitse value%identityAlignment
Q9CAN0 Pentatricopeptide repeat-containing protein At1g63130, mitochondrial4.1e-4022.87Show/hide
Query:  EKMKKFGVVPRVFLYNRILDALVKTDHLDLALTVYRDFQENGLVEESITFMILIKGLCKAGRVDEMLELLARMRATLCKPDVFAYTAMVKVFVSKENLEG
        E+M+  G+   ++ Y+ +++   +   L LAL V     + G   + +T   L+ G C   R+ + + L+ +M     +PD F +  ++           
Subjt:  EKMKKFGVVPRVFLYNRILDALVKTDHLDLALTVYRDFQENGLVEESITFMILIKGLCKAGRVDEMLELLARMRATLCKPDVFAYTAMVKVFVSKENLEG

Query:  CLRVWDEMRADRVEPDVMAYGTLIIGLCKVGRAQKGYELFQEMKGKRILIDRAIYGTLIEAFVQDEKVGLACDLYKDLVDSGYRADLGIYHSLIKGLCNL
         + + D M     +PD++ YG ++ GLCK G       L ++M+  +I     IY T+I+A    + V  A +L+ ++ + G R ++  Y+SLI+ LCN 
Subjt:  CLRVWDEMRADRVEPDVMAYGTLIIGLCKVGRAQKGYELFQEMKGKRILIDRAIYGTLIEAFVQDEKVGLACDLYKDLVDSGYRADLGIYHSLIKGLCNL

Query:  NQVHKAYKLFQLTIREDLKPDFETVKPIMMMYVEMGRMDDLWKLVTLLQKLEFSVDDVLSKFLCFMVEEEDKISVALDLFHGMIDKG-YGSVALYNVIMG
         +   A +L    I   + P+  T   ++  +V+ G++ +  KL   + K     D      L       D++  A  +F  MI K  + +V  YN ++ 
Subjt:  NQVHKAYKLFQLTIREDLKPDFETVKPIMMMYVEMGRMDDLWKLVTLLQKLEFSVDDVLSKFLCFMVEEEDKISVALDLFHGMIDKG-YGSVALYNVIMG

Query:  ALHRYGQANKALEIYNDMKNSNIEPDSTTYSIAVLCFVEIGKIREACASHNKIIELGSVPSMAAYCSLSEGLFKICEIDAVMMLVRDCLANVESGPQEFK
           +  + ++ +E++ +M    +  ++ TY+  +  F +  +   A     +++  G +P +  Y  L +GL    +++   ++V + L   +  P  + 
Subjt:  ALHRYGQANKALEIYNDMKNSNIEPDSTTYSIAVLCFVEIGKIREACASHNKIIELGSVPSMAAYCSLSEGLFKICEIDAVMMLVRDCLANVESGPQEFK

Query:  YALTIVHACKSGKAEMVIDVLKEMVLQDCAPSSVAYSAIISGMSKYGTFNEAKKVFLHLRERSQLTEANCIVCEELLIEHM----KKKTADLVR
        Y + I   CK+GK E   D+   + L+   P+ V Y+ ++SG  + G   EA  +F  ++E   L ++       L+  H+    K  +A+L+R
Subjt:  YALTIVHACKSGKAEMVIDVLKEMVLQDCAPSSVAYSAIISGMSKYGTFNEAKKVFLHLRERSQLTEANCIVCEELLIEHM----KKKTADLVR

Q9LQ14 Pentatricopeptide repeat-containing protein At1g62930, chloroplastic1.2e-3923.79Show/hide
Query:  QGKP-PSEKQFEILIRMHCDAYRGLRVYYVYEKMKKFGVVPRVFLYNRILDALVKTDHLDLALTVYRDFQENGLVEESITFMILIKGLCKAGRVDEMLEL
        Q +P PS  +F  L+       +   V  + E+M+   +   ++ YN +++   +   L LAL V     + G   + +T   L+ G C   R+ E + L
Subjt:  QGKP-PSEKQFEILIRMHCDAYRGLRVYYVYEKMKKFGVVPRVFLYNRILDALVKTDHLDLALTVYRDFQENGLVEESITFMILIKGLCKAGRVDEMLEL

Query:  LARMRATLCKPDVFAYTAMVKVFVSKENLEGCLRVWDEMRADRVEPDVMAYGTLIIGLCKVGRAQKGYELFQEMKGKRILIDRAIYGTLIEAFVQDEKVG
        + +M     +P+   +  ++            + + D M A   +PD+  YGT++ GLCK G       L ++M+  +I  D  IY T+I+A    + V 
Subjt:  LARMRATLCKPDVFAYTAMVKVFVSKENLEGCLRVWDEMRADRVEPDVMAYGTLIIGLCKVGRAQKGYELFQEMKGKRILIDRAIYGTLIEAFVQDEKVG

Query:  LACDLYKDLVDSGYRADLGIYHSLIKGLCNLNQVHKAYKLFQLTIREDLKPDFETVKPIMMMYVEMGRMDDLWKLVTLLQKLEFSVDDVLSKFLCFMVEE
         A +L+ ++ + G R ++  Y+SLI+ LCN  +   A +L    I   + P+  T   ++  +V+ G++ +  KL   + K     D      L      
Subjt:  LACDLYKDLVDSGYRADLGIYHSLIKGLCNLNQVHKAYKLFQLTIREDLKPDFETVKPIMMMYVEMGRMDDLWKLVTLLQKLEFSVDDVLSKFLCFMVEE

Query:  EDKISVALDLFHGMIDKG-YGSVALYNVIMGALHRYGQANKALEIYNDMKNSNIEPDSTTYSIAVLCFVEIGKIREACASHNKIIELGSVPSMAAYCSLS
         D++  A  +F  MI K  + +V  YN ++    +  +  + +E++ +M    +  ++ TY+  +    + G    A     K++  G  P +  Y  L 
Subjt:  EDKISVALDLFHGMIDKG-YGSVALYNVIMGALHRYGQANKALEIYNDMKNSNIEPDSTTYSIAVLCFVEIGKIREACASHNKIIELGSVPSMAAYCSLS

Query:  EGLFKICEIDAVMMLVRDCLANVESGPQEFKYALTIVHACKSGKAEMVIDVLKEMVLQDCAPSSVAYSAIISGMSKYGTFNEAKKVFLHLRERSQL
        +GL K  +++   ++V + L   +  P  + Y + I   CK+GK E   D+   + L+   P+ + Y+ +ISG  + G   EA  +F  ++E   L
Subjt:  EGLFKICEIDAVMMLVRDCLANVESGPQEFKYALTIVHACKSGKAEMVIDVLKEMVLQDCAPSSVAYSAIISGMSKYGTFNEAKKVFLHLRERSQL

Q9LQ16 Pentatricopeptide repeat-containing protein At1g629105.3e-4023.63Show/hide
Query:  PSEKQFEILIRMHCDAYRGLRVYYVYEKMKKFGVVPRVFLYNRILDALVKTDHLDLALTVYRDFQENGLVEESITFMILIKGLCKAGRVDEMLELLARMR
        PS  +F  L+       +   V  + E+M+  G+   ++ Y+  ++   +   L LAL V     + G   + +T   L+ G C + R+ + + L+ +M 
Subjt:  PSEKQFEILIRMHCDAYRGLRVYYVYEKMKKFGVVPRVFLYNRILDALVKTDHLDLALTVYRDFQENGLVEESITFMILIKGLCKAGRVDEMLELLARMR

Query:  ATLCKPDVFAYTAMVKVFVSKENLEGCLRVWDEMRADRVEPDVMAYGTLIIGLCKVGRAQKGYELFQEMKGKRILIDRAIYGTLIEAFVQDEKVGLACDL
            KPD F +T ++            + + D+M     +PD++ YGT++ GLCK G       L ++M+  +I  D  IY T+I+   + + +  A +L
Subjt:  ATLCKPDVFAYTAMVKVFVSKENLEGCLRVWDEMRADRVEPDVMAYGTLIIGLCKVGRAQKGYELFQEMKGKRILIDRAIYGTLIEAFVQDEKVGLACDL

Query:  YKDLVDSGYRADLGIYHSLIKGLCNLNQVHKAYKLFQLTIREDLKPDFETVKPIMMMYVEMGRMDDLWKLVTLLQKLEFSVDDVLSKFLCFMVEEEDKIS
        + ++ + G R D+  Y SLI  LCN  +   A +L    I   + P+  T   ++  +V+ G++ +  KL   + K     D      L       D++ 
Subjt:  YKDLVDSGYRADLGIYHSLIKGLCNLNQVHKAYKLFQLTIREDLKPDFETVKPIMMMYVEMGRMDDLWKLVTLLQKLEFSVDDVLSKFLCFMVEEEDKIS

Query:  VALDLFHGMIDKG-YGSVALYNVIMGALHRYGQANKALEIYNDMKNSNIEPDSTTYSIAVLCFVEIGKIREACASHNKIIELGSVPSMAAYCSLSEGLFK
         A  +F  MI K  + +V  Y+ ++    +  +  + +E++ +M    +  ++ TY+  +  F +      A     +++ +G  P++  Y  L +GL K
Subjt:  VALDLFHGMIDKG-YGSVALYNVIMGALHRYGQANKALEIYNDMKNSNIEPDSTTYSIAVLCFVEIGKIREACASHNKIIELGSVPSMAAYCSLSEGLFK

Query:  ICEIDAVMMLVRDCLANVESGPQEFKYALTIVHACKSGKAEMVIDVLKEMVLQDCAPSSVAYSAIISGMSKYGTFNEAKKVFLHLRERSQL
          ++ A  M+V + L      P  + Y + I   CK+GK E   ++   + L+  +P+ +AY+ +ISG  + G+  EA  +   ++E   L
Subjt:  ICEIDAVMMLVRDCLANVESGPQEFKYALTIVHACKSGKAEMVIDVLKEMVLQDCAPSSVAYSAIISGMSKYGTFNEAKKVFLHLRERSQL

Q9SVH3 Pentatricopeptide repeat-containing protein At4g207409.9e-19661.81Show/hide
Query:  MDSQGKPPSEKQFEILIRMHCDAYRGLRVYYVYEKMKKFGVVPRVFLYNRILDALVKTDHLDLALTVYRDFQENGLVEESITFMILIKGLCKAGRVDEML
        MDSQG+PPSEKQFEILIRMH D  RGLRVYYVYEKMKKFG  PRVFLYNRI+DALVK  + DLAL VY DF+E+GLVEES TFMIL+KGLCKAGR++EML
Subjt:  MDSQGKPPSEKQFEILIRMHCDAYRGLRVYYVYEKMKKFGVVPRVFLYNRILDALVKTDHLDLALTVYRDFQENGLVEESITFMILIKGLCKAGRVDEML

Query:  ELLARMRATLCKPDVFAYTAMVKVFVSKENLEGCLRVWDEMRADRVEPDVMAYGTLIIGLCKVGRAQKGYELFQEMKGKRILIDRAIYGTLIEAFVQDEK
        E+L RMR  LCKPDVFAYTAM+K  VS+ NL+  LRVWDEMR D ++PDVMAYGTL++GLCK GR ++GYELF EMKGK+ILIDR IY  LIE FV D K
Subjt:  ELLARMRATLCKPDVFAYTAMVKVFVSKENLEGCLRVWDEMRADRVEPDVMAYGTLIIGLCKVGRAQKGYELFQEMKGKRILIDRAIYGTLIEAFVQDEK

Query:  VGLACDLYKDLVDSGYRADLGIYHSLIKGLCNLNQVHKAYKLFQLTIREDLKPDFETVKPIMMMYVEMGRMDDLWKLVTLLQKLEFSVDDVLSKFLCFMV
        V  AC+L++DLVDSGY AD+GIY+++IKGLC++NQV KAYKLFQ+ I E+L+PDFET+ PIM+ YV M R+ D   ++  + +L + V D L++F   + 
Subjt:  VGLACDLYKDLVDSGYRADLGIYHSLIKGLCNLNQVHKAYKLFQLTIREDLKPDFETVKPIMMMYVEMGRMDDLWKLVTLLQKLEFSVDDVLSKFLCFMV

Query:  EEEDKISVALDLFHGMIDKGYGSVALYNVIMGALHRYGQANKALEIYNDMKNSNIEPDSTTYSIAVLCFVEIGKIREACASHNKIIELGSVPSMAAYCSL
         +E+K ++ALD+F+ +  KG+GSV++YN++M AL++ G   K+L ++ +M+    EPDS++YSIA+ CFVE G ++ AC+ H KIIE+  VPS+AAY SL
Subjt:  EEEDKISVALDLFHGMIDKGYGSVALYNVIMGALHRYGQANKALEIYNDMKNSNIEPDSTTYSIAVLCFVEIGKIREACASHNKIIELGSVPSMAAYCSL

Query:  SEGLFKICEIDAVMMLVRDCLANVESGPQEFKYALTIVHACKSGKAEMVIDVLKEMVLQDCAPSSVAYSAIISGMSKYGTFNEAKKVFLHLRERSQLTEA
        ++GL +I EIDAVM+LVR+CL NVESGP EFKYALT+ H CK   AE V+ V+ EM  +    + V Y AIISGMSK+GT   A++VF  L++R  +TEA
Subjt:  SEGLFKICEIDAVMMLVRDCLANVESGPQEFKYALTIVHACKSGKAEMVIDVLKEMVLQDCAPSSVAYSAIISGMSKYGTFNEAKKVFLHLRERSQLTEA

Query:  NCIVCEELLIEHMKKKTADLVRCGLKFFNLESRLKAKGCNLL
        + +V EE+LIE  KKKTADLV  G+KFF LES+L+AKGC LL
Subjt:  NCIVCEELLIEHMKKKTADLVRCGLKFFNLESRLKAKGCNLL

Q9SXD1 Pentatricopeptide repeat-containing protein At1g62670, mitochondrial3.3e-4224.04Show/hide
Query:  EKMKKFGVVPRVFLYNRILDALVKTDHLDLALTVYRDFQENGLVEESITFMILIKGLCKAGRVDEMLELLARMRATLCKPDVFAYTAMVKVFVSKENLEG
        E+M+  G+    + Y+ +++   +   L LAL V     + G     +T   L+ G C + R+ E + L+ +M  T  +P+   +  ++           
Subjt:  EKMKKFGVVPRVFLYNRILDALVKTDHLDLALTVYRDFQENGLVEESITFMILIKGLCKAGRVDEMLELLARMRATLCKPDVFAYTAMVKVFVSKENLEG

Query:  CLRVWDEMRADRVEPDVMAYGTLIIGLCKVGRAQKGYELFQEMKGKRILIDRAIYGTLIEAFVQDEKVGLACDLYKDLVDSGYRADLGIYHSLIKGLCNL
         + + D M A   +PD++ YG ++ GLCK G     + L  +M+  ++     IY T+I+   + + +  A +L+K++   G R ++  Y SLI  LCN 
Subjt:  CLRVWDEMRADRVEPDVMAYGTLIIGLCKVGRAQKGYELFQEMKGKRILIDRAIYGTLIEAFVQDEKVGLACDLYKDLVDSGYRADLGIYHSLIKGLCNL

Query:  NQVHKAYKLFQLTIREDLKPDFETVKPIMMMYVEMGRMDDLWKLVTLLQKLEFSVDDVLSKFLCFMVEEEDKISVALDLFHGMIDKG-YGSVALYNVIMG
         +   A +L    I   + PD  T   ++  +V+ G++ +  KL   + K       V    L       D++  A  +F  M+ K  +  V  YN ++ 
Subjt:  NQVHKAYKLFQLTIREDLKPDFETVKPIMMMYVEMGRMDDLWKLVTLLQKLEFSVDDVLSKFLCFMVEEEDKISVALDLFHGMIDKG-YGSVALYNVIMG

Query:  ALHRYGQANKALEIYNDMKNSNIEPDSTTYSIAVLCFVEIGKIREACASHNKIIELGSVPSMAAYCSLSEGLFKICEIDAVMMLVRDCLANVESGPQEFK
           +Y +  + +E++ +M    +  ++ TY+I +    + G    A     +++  G  P++  Y +L +GL K  +++   M+V + L   +  P  + 
Subjt:  ALHRYGQANKALEIYNDMKNSNIEPDSTTYSIAVLCFVEIGKIREACASHNKIIELGSVPSMAAYCSLSEGLFKICEIDAVMMLVRDCLANVESGPQEFK

Query:  YALTIVHACKSGKAEMVIDVLKEMVLQDCAPSSVAYSAIISGMSKYGTFNEAKKVFLHLRERSQLTEANC
        Y + I   CK+GK E   D+   + L+   P  VAY+ +ISG  + G+  EA  +F  ++E   L  + C
Subjt:  YALTIVHACKSGKAEMVIDVLKEMVLQDCAPSSVAYSAIISGMSKYGTFNEAKKVFLHLRERSQLTEANC

Arabidopsis top hitse value%identityAlignment
AT1G62670.1 rna processing factor 22.4e-4324.04Show/hide
Query:  EKMKKFGVVPRVFLYNRILDALVKTDHLDLALTVYRDFQENGLVEESITFMILIKGLCKAGRVDEMLELLARMRATLCKPDVFAYTAMVKVFVSKENLEG
        E+M+  G+    + Y+ +++   +   L LAL V     + G     +T   L+ G C + R+ E + L+ +M  T  +P+   +  ++           
Subjt:  EKMKKFGVVPRVFLYNRILDALVKTDHLDLALTVYRDFQENGLVEESITFMILIKGLCKAGRVDEMLELLARMRATLCKPDVFAYTAMVKVFVSKENLEG

Query:  CLRVWDEMRADRVEPDVMAYGTLIIGLCKVGRAQKGYELFQEMKGKRILIDRAIYGTLIEAFVQDEKVGLACDLYKDLVDSGYRADLGIYHSLIKGLCNL
         + + D M A   +PD++ YG ++ GLCK G     + L  +M+  ++     IY T+I+   + + +  A +L+K++   G R ++  Y SLI  LCN 
Subjt:  CLRVWDEMRADRVEPDVMAYGTLIIGLCKVGRAQKGYELFQEMKGKRILIDRAIYGTLIEAFVQDEKVGLACDLYKDLVDSGYRADLGIYHSLIKGLCNL

Query:  NQVHKAYKLFQLTIREDLKPDFETVKPIMMMYVEMGRMDDLWKLVTLLQKLEFSVDDVLSKFLCFMVEEEDKISVALDLFHGMIDKG-YGSVALYNVIMG
         +   A +L    I   + PD  T   ++  +V+ G++ +  KL   + K       V    L       D++  A  +F  M+ K  +  V  YN ++ 
Subjt:  NQVHKAYKLFQLTIREDLKPDFETVKPIMMMYVEMGRMDDLWKLVTLLQKLEFSVDDVLSKFLCFMVEEEDKISVALDLFHGMIDKG-YGSVALYNVIMG

Query:  ALHRYGQANKALEIYNDMKNSNIEPDSTTYSIAVLCFVEIGKIREACASHNKIIELGSVPSMAAYCSLSEGLFKICEIDAVMMLVRDCLANVESGPQEFK
           +Y +  + +E++ +M    +  ++ TY+I +    + G    A     +++  G  P++  Y +L +GL K  +++   M+V + L   +  P  + 
Subjt:  ALHRYGQANKALEIYNDMKNSNIEPDSTTYSIAVLCFVEIGKIREACASHNKIIELGSVPSMAAYCSLSEGLFKICEIDAVMMLVRDCLANVESGPQEFK

Query:  YALTIVHACKSGKAEMVIDVLKEMVLQDCAPSSVAYSAIISGMSKYGTFNEAKKVFLHLRERSQLTEANC
        Y + I   CK+GK E   D+   + L+   P  VAY+ +ISG  + G+  EA  +F  ++E   L  + C
Subjt:  YALTIVHACKSGKAEMVIDVLKEMVLQDCAPSSVAYSAIISGMSKYGTFNEAKKVFLHLRERSQLTEANC

AT1G62910.1 Pentatricopeptide repeat (PPR) superfamily protein3.8e-4123.63Show/hide
Query:  PSEKQFEILIRMHCDAYRGLRVYYVYEKMKKFGVVPRVFLYNRILDALVKTDHLDLALTVYRDFQENGLVEESITFMILIKGLCKAGRVDEMLELLARMR
        PS  +F  L+       +   V  + E+M+  G+   ++ Y+  ++   +   L LAL V     + G   + +T   L+ G C + R+ + + L+ +M 
Subjt:  PSEKQFEILIRMHCDAYRGLRVYYVYEKMKKFGVVPRVFLYNRILDALVKTDHLDLALTVYRDFQENGLVEESITFMILIKGLCKAGRVDEMLELLARMR

Query:  ATLCKPDVFAYTAMVKVFVSKENLEGCLRVWDEMRADRVEPDVMAYGTLIIGLCKVGRAQKGYELFQEMKGKRILIDRAIYGTLIEAFVQDEKVGLACDL
            KPD F +T ++            + + D+M     +PD++ YGT++ GLCK G       L ++M+  +I  D  IY T+I+   + + +  A +L
Subjt:  ATLCKPDVFAYTAMVKVFVSKENLEGCLRVWDEMRADRVEPDVMAYGTLIIGLCKVGRAQKGYELFQEMKGKRILIDRAIYGTLIEAFVQDEKVGLACDL

Query:  YKDLVDSGYRADLGIYHSLIKGLCNLNQVHKAYKLFQLTIREDLKPDFETVKPIMMMYVEMGRMDDLWKLVTLLQKLEFSVDDVLSKFLCFMVEEEDKIS
        + ++ + G R D+  Y SLI  LCN  +   A +L    I   + P+  T   ++  +V+ G++ +  KL   + K     D      L       D++ 
Subjt:  YKDLVDSGYRADLGIYHSLIKGLCNLNQVHKAYKLFQLTIREDLKPDFETVKPIMMMYVEMGRMDDLWKLVTLLQKLEFSVDDVLSKFLCFMVEEEDKIS

Query:  VALDLFHGMIDKG-YGSVALYNVIMGALHRYGQANKALEIYNDMKNSNIEPDSTTYSIAVLCFVEIGKIREACASHNKIIELGSVPSMAAYCSLSEGLFK
         A  +F  MI K  + +V  Y+ ++    +  +  + +E++ +M    +  ++ TY+  +  F +      A     +++ +G  P++  Y  L +GL K
Subjt:  VALDLFHGMIDKG-YGSVALYNVIMGALHRYGQANKALEIYNDMKNSNIEPDSTTYSIAVLCFVEIGKIREACASHNKIIELGSVPSMAAYCSLSEGLFK

Query:  ICEIDAVMMLVRDCLANVESGPQEFKYALTIVHACKSGKAEMVIDVLKEMVLQDCAPSSVAYSAIISGMSKYGTFNEAKKVFLHLRERSQL
          ++ A  M+V + L      P  + Y + I   CK+GK E   ++   + L+  +P+ +AY+ +ISG  + G+  EA  +   ++E   L
Subjt:  ICEIDAVMMLVRDCLANVESGPQEFKYALTIVHACKSGKAEMVIDVLKEMVLQDCAPSSVAYSAIISGMSKYGTFNEAKKVFLHLRERSQL

AT1G62930.1 Tetratricopeptide repeat (TPR)-like superfamily protein8.4e-4123.79Show/hide
Query:  QGKP-PSEKQFEILIRMHCDAYRGLRVYYVYEKMKKFGVVPRVFLYNRILDALVKTDHLDLALTVYRDFQENGLVEESITFMILIKGLCKAGRVDEMLEL
        Q +P PS  +F  L+       +   V  + E+M+   +   ++ YN +++   +   L LAL V     + G   + +T   L+ G C   R+ E + L
Subjt:  QGKP-PSEKQFEILIRMHCDAYRGLRVYYVYEKMKKFGVVPRVFLYNRILDALVKTDHLDLALTVYRDFQENGLVEESITFMILIKGLCKAGRVDEMLEL

Query:  LARMRATLCKPDVFAYTAMVKVFVSKENLEGCLRVWDEMRADRVEPDVMAYGTLIIGLCKVGRAQKGYELFQEMKGKRILIDRAIYGTLIEAFVQDEKVG
        + +M     +P+   +  ++            + + D M A   +PD+  YGT++ GLCK G       L ++M+  +I  D  IY T+I+A    + V 
Subjt:  LARMRATLCKPDVFAYTAMVKVFVSKENLEGCLRVWDEMRADRVEPDVMAYGTLIIGLCKVGRAQKGYELFQEMKGKRILIDRAIYGTLIEAFVQDEKVG

Query:  LACDLYKDLVDSGYRADLGIYHSLIKGLCNLNQVHKAYKLFQLTIREDLKPDFETVKPIMMMYVEMGRMDDLWKLVTLLQKLEFSVDDVLSKFLCFMVEE
         A +L+ ++ + G R ++  Y+SLI+ LCN  +   A +L    I   + P+  T   ++  +V+ G++ +  KL   + K     D      L      
Subjt:  LACDLYKDLVDSGYRADLGIYHSLIKGLCNLNQVHKAYKLFQLTIREDLKPDFETVKPIMMMYVEMGRMDDLWKLVTLLQKLEFSVDDVLSKFLCFMVEE

Query:  EDKISVALDLFHGMIDKG-YGSVALYNVIMGALHRYGQANKALEIYNDMKNSNIEPDSTTYSIAVLCFVEIGKIREACASHNKIIELGSVPSMAAYCSLS
         D++  A  +F  MI K  + +V  YN ++    +  +  + +E++ +M    +  ++ TY+  +    + G    A     K++  G  P +  Y  L 
Subjt:  EDKISVALDLFHGMIDKG-YGSVALYNVIMGALHRYGQANKALEIYNDMKNSNIEPDSTTYSIAVLCFVEIGKIREACASHNKIIELGSVPSMAAYCSLS

Query:  EGLFKICEIDAVMMLVRDCLANVESGPQEFKYALTIVHACKSGKAEMVIDVLKEMVLQDCAPSSVAYSAIISGMSKYGTFNEAKKVFLHLRERSQL
        +GL K  +++   ++V + L   +  P  + Y + I   CK+GK E   D+   + L+   P+ + Y+ +ISG  + G   EA  +F  ++E   L
Subjt:  EGLFKICEIDAVMMLVRDCLANVESGPQEFKYALTIVHACKSGKAEMVIDVLKEMVLQDCAPSSVAYSAIISGMSKYGTFNEAKKVFLHLRERSQL

AT1G63130.1 Tetratricopeptide repeat (TPR)-like superfamily protein2.9e-4122.87Show/hide
Query:  EKMKKFGVVPRVFLYNRILDALVKTDHLDLALTVYRDFQENGLVEESITFMILIKGLCKAGRVDEMLELLARMRATLCKPDVFAYTAMVKVFVSKENLEG
        E+M+  G+   ++ Y+ +++   +   L LAL V     + G   + +T   L+ G C   R+ + + L+ +M     +PD F +  ++           
Subjt:  EKMKKFGVVPRVFLYNRILDALVKTDHLDLALTVYRDFQENGLVEESITFMILIKGLCKAGRVDEMLELLARMRATLCKPDVFAYTAMVKVFVSKENLEG

Query:  CLRVWDEMRADRVEPDVMAYGTLIIGLCKVGRAQKGYELFQEMKGKRILIDRAIYGTLIEAFVQDEKVGLACDLYKDLVDSGYRADLGIYHSLIKGLCNL
         + + D M     +PD++ YG ++ GLCK G       L ++M+  +I     IY T+I+A    + V  A +L+ ++ + G R ++  Y+SLI+ LCN 
Subjt:  CLRVWDEMRADRVEPDVMAYGTLIIGLCKVGRAQKGYELFQEMKGKRILIDRAIYGTLIEAFVQDEKVGLACDLYKDLVDSGYRADLGIYHSLIKGLCNL

Query:  NQVHKAYKLFQLTIREDLKPDFETVKPIMMMYVEMGRMDDLWKLVTLLQKLEFSVDDVLSKFLCFMVEEEDKISVALDLFHGMIDKG-YGSVALYNVIMG
         +   A +L    I   + P+  T   ++  +V+ G++ +  KL   + K     D      L       D++  A  +F  MI K  + +V  YN ++ 
Subjt:  NQVHKAYKLFQLTIREDLKPDFETVKPIMMMYVEMGRMDDLWKLVTLLQKLEFSVDDVLSKFLCFMVEEEDKISVALDLFHGMIDKG-YGSVALYNVIMG

Query:  ALHRYGQANKALEIYNDMKNSNIEPDSTTYSIAVLCFVEIGKIREACASHNKIIELGSVPSMAAYCSLSEGLFKICEIDAVMMLVRDCLANVESGPQEFK
           +  + ++ +E++ +M    +  ++ TY+  +  F +  +   A     +++  G +P +  Y  L +GL    +++   ++V + L   +  P  + 
Subjt:  ALHRYGQANKALEIYNDMKNSNIEPDSTTYSIAVLCFVEIGKIREACASHNKIIELGSVPSMAAYCSLSEGLFKICEIDAVMMLVRDCLANVESGPQEFK

Query:  YALTIVHACKSGKAEMVIDVLKEMVLQDCAPSSVAYSAIISGMSKYGTFNEAKKVFLHLRERSQLTEANCIVCEELLIEHM----KKKTADLVR
        Y + I   CK+GK E   D+   + L+   P+ V Y+ ++SG  + G   EA  +F  ++E   L ++       L+  H+    K  +A+L+R
Subjt:  YALTIVHACKSGKAEMVIDVLKEMVLQDCAPSSVAYSAIISGMSKYGTFNEAKKVFLHLRERSQLTEANCIVCEELLIEHM----KKKTADLVR

AT4G20740.1 Pentatricopeptide repeat (PPR-like) superfamily protein7.0e-19761.81Show/hide
Query:  MDSQGKPPSEKQFEILIRMHCDAYRGLRVYYVYEKMKKFGVVPRVFLYNRILDALVKTDHLDLALTVYRDFQENGLVEESITFMILIKGLCKAGRVDEML
        MDSQG+PPSEKQFEILIRMH D  RGLRVYYVYEKMKKFG  PRVFLYNRI+DALVK  + DLAL VY DF+E+GLVEES TFMIL+KGLCKAGR++EML
Subjt:  MDSQGKPPSEKQFEILIRMHCDAYRGLRVYYVYEKMKKFGVVPRVFLYNRILDALVKTDHLDLALTVYRDFQENGLVEESITFMILIKGLCKAGRVDEML

Query:  ELLARMRATLCKPDVFAYTAMVKVFVSKENLEGCLRVWDEMRADRVEPDVMAYGTLIIGLCKVGRAQKGYELFQEMKGKRILIDRAIYGTLIEAFVQDEK
        E+L RMR  LCKPDVFAYTAM+K  VS+ NL+  LRVWDEMR D ++PDVMAYGTL++GLCK GR ++GYELF EMKGK+ILIDR IY  LIE FV D K
Subjt:  ELLARMRATLCKPDVFAYTAMVKVFVSKENLEGCLRVWDEMRADRVEPDVMAYGTLIIGLCKVGRAQKGYELFQEMKGKRILIDRAIYGTLIEAFVQDEK

Query:  VGLACDLYKDLVDSGYRADLGIYHSLIKGLCNLNQVHKAYKLFQLTIREDLKPDFETVKPIMMMYVEMGRMDDLWKLVTLLQKLEFSVDDVLSKFLCFMV
        V  AC+L++DLVDSGY AD+GIY+++IKGLC++NQV KAYKLFQ+ I E+L+PDFET+ PIM+ YV M R+ D   ++  + +L + V D L++F   + 
Subjt:  VGLACDLYKDLVDSGYRADLGIYHSLIKGLCNLNQVHKAYKLFQLTIREDLKPDFETVKPIMMMYVEMGRMDDLWKLVTLLQKLEFSVDDVLSKFLCFMV

Query:  EEEDKISVALDLFHGMIDKGYGSVALYNVIMGALHRYGQANKALEIYNDMKNSNIEPDSTTYSIAVLCFVEIGKIREACASHNKIIELGSVPSMAAYCSL
         +E+K ++ALD+F+ +  KG+GSV++YN++M AL++ G   K+L ++ +M+    EPDS++YSIA+ CFVE G ++ AC+ H KIIE+  VPS+AAY SL
Subjt:  EEEDKISVALDLFHGMIDKGYGSVALYNVIMGALHRYGQANKALEIYNDMKNSNIEPDSTTYSIAVLCFVEIGKIREACASHNKIIELGSVPSMAAYCSL

Query:  SEGLFKICEIDAVMMLVRDCLANVESGPQEFKYALTIVHACKSGKAEMVIDVLKEMVLQDCAPSSVAYSAIISGMSKYGTFNEAKKVFLHLRERSQLTEA
        ++GL +I EIDAVM+LVR+CL NVESGP EFKYALT+ H CK   AE V+ V+ EM  +    + V Y AIISGMSK+GT   A++VF  L++R  +TEA
Subjt:  SEGLFKICEIDAVMMLVRDCLANVESGPQEFKYALTIVHACKSGKAEMVIDVLKEMVLQDCAPSSVAYSAIISGMSKYGTFNEAKKVFLHLRERSQLTEA

Query:  NCIVCEELLIEHMKKKTADLVRCGLKFFNLESRLKAKGCNLL
        + +V EE+LIE  KKKTADLV  G+KFF LES+L+AKGC LL
Subjt:  NCIVCEELLIEHMKKKTADLVRCGLKFFNLESRLKAKGCNLL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATTCGCAAGGTAAGCCACCTAGTGAAAAACAGTTTGAGATTCTGATTAGGATGCATTGTGATGCCTATAGAGGTCTTAGAGTTTACTATGTTTATGAAAAGATGAA
GAAGTTTGGGGTTGTTCCCCGTGTCTTCTTGTATAACAGGATTCTCGATGCCTTAGTCAAAACAGATCATTTGGATTTAGCTTTAACTGTTTATAGGGATTTCCAGGAAA
ATGGGCTAGTGGAAGAGAGTATCACGTTTATGATTTTGATTAAAGGGTTGTGTAAAGCAGGGAGGGTTGATGAAATGCTTGAGCTTTTGGCTCGAATGAGGGCAACTTTG
TGTAAGCCTGATGTGTTTGCTTACACAGCAATGGTGAAGGTGTTCGTTTCTAAGGAGAATTTGGAGGGTTGTTTGAGAGTTTGGGATGAAATGAGAGCAGATAGAGTAGA
GCCTGATGTTATGGCATATGGGACTTTGATTATTGGATTGTGCAAAGTCGGGCGGGCACAAAAAGGGTATGAATTGTTTCAAGAGATGAAAGGGAAGAGGATTTTGATAG
ACAGAGCAATTTATGGGACTTTGATAGAGGCGTTTGTGCAGGATGAGAAAGTTGGATTGGCTTGTGATTTGTATAAGGATTTGGTAGATTCAGGGTATAGAGCTGATTTG
GGGATATATCATTCTCTCATTAAAGGTCTTTGTAATTTAAATCAAGTCCACAAGGCTTATAAACTCTTTCAGTTAACCATACGAGAGGATCTTAAGCCAGATTTCGAAAC
TGTGAAACCTATTATGATGATGTACGTGGAAATGGGAAGAATGGACGACTTGTGGAAGTTAGTAACCTTGTTGCAGAAGTTGGAATTTTCTGTGGATGATGTTCTTTCCA
AATTTTTGTGTTTTATGGTAGAAGAGGAGGACAAAATAAGCGTAGCTTTAGATTTATTTCACGGCATGATTGATAAGGGATATGGCAGTGTTGCCTTATACAATGTCATC
ATGGGGGCTCTTCATCGCTATGGGCAGGCAAATAAGGCTCTAGAAATCTACAATGACATGAAGAACTCAAATATCGAACCTGATTCAACAACTTACTCTATTGCCGTATT
ATGTTTTGTAGAAATTGGTAAAATCCGAGAGGCTTGTGCATCCCATAACAAAATAATTGAGCTGGGTTCAGTTCCTTCCATGGCTGCCTACTGTTCTCTTTCCGAAGGTC
TCTTCAAAATCTGCGAGATTGATGCAGTTATGATGCTTGTTCGAGATTGCCTGGCGAACGTCGAGAGTGGACCTCAGGAATTTAAATATGCTCTAACGATTGTCCATGCA
TGTAAGTCAGGTAAAGCAGAAATGGTGATTGATGTTCTTAAGGAAATGGTGCTACAAGATTGCGCTCCGAGCTCAGTCGCATACTCAGCTATCATATCTGGAATGTCGAA
GTATGGGACATTTAATGAGGCAAAGAAAGTGTTTTTGCATCTGAGAGAGAGGAGTCAGCTGACTGAAGCTAACTGCATTGTCTGTGAGGAATTGTTAATTGAACACATGA
AAAAGAAAACAGCAGATTTAGTAAGATGTGGATTGAAGTTTTTCAATCTTGAATCCAGATTGAAAGCAAAAGGTTGCAACTTGCTGTCAACTTAA
mRNA sequenceShow/hide mRNA sequence
ATGCCCCCTCAGAAACCCCACAAGCAATATTTCTACTACGGCCACCGCCACCGCAACCCCCACCAGCACCGCCCCACCGTCTACGGCGGCTTCTTCACCAACCGCCGATC
CCTCCCTCCACCCAGTCCTCACCAACCCATTTCCCCAAAACCCCAACCCTTCCTTCTTCACAACTGGGATCCTGATCTCCCATCTCAAAAACGCTCCAACCTCCCGTCGT
TCACCTCCGATGCCTTCTTCTCCACCTCACTCCGCCTTTCCCCAATTGCTCGGTTTATCGTCGACGTCTTTCGGAAGAATCAAACCAGTGGGGCCCCCCGGTGCTCTCTG
AACTCAACAAGCTCCGCCGAGTTACTCCGGACCTTGTGGCGGAGGTTCTCAAGGCTTCTCACCGTCGTGATTCTAACCCTATTTTAGCCTCCAAGTTCTTCTACTGGGCT
GGTAAGCAAAAGGGGTTTCATCACACTTTTGCTTCTTACAATGCGTTTGCTTATTGTTTGAATCGCCACAATCGTTTCAGAGCTGCCGATCAGATTCCTGAGCTCATGGA
TTCGCAAGGTAAGCCACCTAGTGAAAAACAGTTTGAGATTCTGATTAGGATGCATTGTGATGCCTATAGAGGTCTTAGAGTTTACTATGTTTATGAAAAGATGAAGAAGT
TTGGGGTTGTTCCCCGTGTCTTCTTGTATAACAGGATTCTCGATGCCTTAGTCAAAACAGATCATTTGGATTTAGCTTTAACTGTTTATAGGGATTTCCAGGAAAATGGG
CTAGTGGAAGAGAGTATCACGTTTATGATTTTGATTAAAGGGTTGTGTAAAGCAGGGAGGGTTGATGAAATGCTTGAGCTTTTGGCTCGAATGAGGGCAACTTTGTGTAA
GCCTGATGTGTTTGCTTACACAGCAATGGTGAAGGTGTTCGTTTCTAAGGAGAATTTGGAGGGTTGTTTGAGAGTTTGGGATGAAATGAGAGCAGATAGAGTAGAGCCTG
ATGTTATGGCATATGGGACTTTGATTATTGGATTGTGCAAAGTCGGGCGGGCACAAAAAGGGTATGAATTGTTTCAAGAGATGAAAGGGAAGAGGATTTTGATAGACAGA
GCAATTTATGGGACTTTGATAGAGGCGTTTGTGCAGGATGAGAAAGTTGGATTGGCTTGTGATTTGTATAAGGATTTGGTAGATTCAGGGTATAGAGCTGATTTGGGGAT
ATATCATTCTCTCATTAAAGGTCTTTGTAATTTAAATCAAGTCCACAAGGCTTATAAACTCTTTCAGTTAACCATACGAGAGGATCTTAAGCCAGATTTCGAAACTGTGA
AACCTATTATGATGATGTACGTGGAAATGGGAAGAATGGACGACTTGTGGAAGTTAGTAACCTTGTTGCAGAAGTTGGAATTTTCTGTGGATGATGTTCTTTCCAAATTT
TTGTGTTTTATGGTAGAAGAGGAGGACAAAATAAGCGTAGCTTTAGATTTATTTCACGGCATGATTGATAAGGGATATGGCAGTGTTGCCTTATACAATGTCATCATGGG
GGCTCTTCATCGCTATGGGCAGGCAAATAAGGCTCTAGAAATCTACAATGACATGAAGAACTCAAATATCGAACCTGATTCAACAACTTACTCTATTGCCGTATTATGTT
TTGTAGAAATTGGTAAAATCCGAGAGGCTTGTGCATCCCATAACAAAATAATTGAGCTGGGTTCAGTTCCTTCCATGGCTGCCTACTGTTCTCTTTCCGAAGGTCTCTTC
AAAATCTGCGAGATTGATGCAGTTATGATGCTTGTTCGAGATTGCCTGGCGAACGTCGAGAGTGGACCTCAGGAATTTAAATATGCTCTAACGATTGTCCATGCATGTAA
GTCAGGTAAAGCAGAAATGGTGATTGATGTTCTTAAGGAAATGGTGCTACAAGATTGCGCTCCGAGCTCAGTCGCATACTCAGCTATCATATCTGGAATGTCGAAGTATG
GGACATTTAATGAGGCAAAGAAAGTGTTTTTGCATCTGAGAGAGAGGAGTCAGCTGACTGAAGCTAACTGCATTGTCTGTGAGGAATTGTTAATTGAACACATGAAAAAG
AAAACAGCAGATTTAGTAAGATGTGGATTGAAGTTTTTCAATCTTGAATCCAGATTGAAAGCAAAAGGTTGCAACTTGCTGTCAACTTAAAAACATAGACTAACCTGTTT
TATTTATCTCATTATTGTTTATCAATAAAAACTTTCTAATTTTATTCTGTTTAGGTTGACTTTTTAAGTATTTATATATATTTTTCTTTTAAC
Protein sequenceShow/hide protein sequence
MDSQGKPPSEKQFEILIRMHCDAYRGLRVYYVYEKMKKFGVVPRVFLYNRILDALVKTDHLDLALTVYRDFQENGLVEESITFMILIKGLCKAGRVDEMLELLARMRATL
CKPDVFAYTAMVKVFVSKENLEGCLRVWDEMRADRVEPDVMAYGTLIIGLCKVGRAQKGYELFQEMKGKRILIDRAIYGTLIEAFVQDEKVGLACDLYKDLVDSGYRADL
GIYHSLIKGLCNLNQVHKAYKLFQLTIREDLKPDFETVKPIMMMYVEMGRMDDLWKLVTLLQKLEFSVDDVLSKFLCFMVEEEDKISVALDLFHGMIDKGYGSVALYNVI
MGALHRYGQANKALEIYNDMKNSNIEPDSTTYSIAVLCFVEIGKIREACASHNKIIELGSVPSMAAYCSLSEGLFKICEIDAVMMLVRDCLANVESGPQEFKYALTIVHA
CKSGKAEMVIDVLKEMVLQDCAPSSVAYSAIISGMSKYGTFNEAKKVFLHLRERSQLTEANCIVCEELLIEHMKKKTADLVRCGLKFFNLESRLKAKGCNLLST