; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr019700 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr019700
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationtig00153403:450086..452321
RNA-Seq ExpressionSgr019700
SyntenySgr019700
Gene Ontology termsGO:0005739 - mitochondrion (cellular component)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7027261.1 Pentatricopeptide repeat-containing protein, mitochondrial [Cucurbita argyrosperma subsp. argyrosperma]1.2e-25388.57Show/hide
Query:  MLRSLRTSMATAARRFSAEAYMAAVENTALEGGSGT----GGGRDTLGRRLMSLAFPKRSAVIAIRKWQEEGHTVRKYELNRIVRELRKLKRYKHALEIC
        MLRS R  +ATAARRFS EA   AVENT LE  SG+    GGGRDTLGRRLMSLAFPKRSAVIAIRKWQEEGHTVRKYELNRIVRELRKLKRYKHALEIC
Subjt:  MLRSLRTSMATAARRFSAEAYMAAVENTALEGGSGT----GGGRDTLGRRLMSLAFPKRSAVIAIRKWQEEGHTVRKYELNRIVRELRKLKRYKHALEIC

Query:  EWMTLQKDIKLLPGDYAVHLDLIAKIRGLNSAEKFFEDLPEKMRGQSACTALLHVYVQNNLSDKAEALMEKMSECGFLKSPLSFNHMLSLYVSNKQPEKV
        EWMTLQKD+KLLPGDYAVHLDLI+KIRGL+SAEKFF DLP+KMRGQSA T+LLHV+VQNNLS+KAEALM KMSE GFLKSPLSFNHMLSLY++NK+ EKV
Subjt:  EWMTLQKDIKLLPGDYAVHLDLIAKIRGLNSAEKFFEDLPEKMRGQSACTALLHVYVQNNLSDKAEALMEKMSECGFLKSPLSFNHMLSLYVSNKQPEKV

Query:  PALIQELKKNTKPDVVTYNLLLNVRTLQNDVEAAENIFLEMKKTKIEPDWVTFSTLTNLYSKKQLTEKAASTLKEMEKMASKRNRITFSSLLSLYTNLGD
        PAL+QELKKNTKPDVVTYNLLLNV TLQNDVEAAENIFLEMK  KIEPDWV+FSTL NLYSK+QLTEKAASTLK+MEKMASKRNRI+FSSLLSLYTNLGD
Subjt:  PALIQELKKNTKPDVVTYNLLLNVRTLQNDVEAAENIFLEMKKTKIEPDWVTFSTLTNLYSKKQLTEKAASTLKEMEKMASKRNRITFSSLLSLYTNLGD

Query:  KDGVWRIWKKMKSSFRKMSDSEYTCMISSLVKLDKLEEAEKLYTEWESVSGTGDTRVPNILLAAYINKNQMEQAESFYDRMLLKGIVPSYTTWELLTWGY
        KDGV RIWKKM SSFRKM+DSEYTCMISSLVKLDKLEEAEKLYTEWESVSGTGDTRVPNILLAAYINKNQ +QAESFYDRM LKGIVPSYTTWELLTWGY
Subjt:  KDGVWRIWKKMKSSFRKMSDSEYTCMISSLVKLDKLEEAEKLYTEWESVSGTGDTRVPNILLAAYINKNQMEQAESFYDRMLLKGIVPSYTTWELLTWGY

Query:  LKENQMEKVVHFFKNAIGSVKKWNADERLVKEVCKKLAEQGNIEGAEQLLIVLRNAGHVDTEIYNSLLRTYAKAGKMPLIVAERMEKDNVQLDEETRELI
        LKENQMEKV+ FFKNA+GSVKKWNADERLVK VCK+L EQGN EGAEQLLI+LRNAGHVDTEIYNSLLRTYAKAGKMPLIVAERME+DNVQL+EE+REL+
Subjt:  LKENQMEKVVHFFKNAIGSVKKWNADERLVKEVCKKLAEQGNIEGAEQLLIVLRNAGHVDTEIYNSLLRTYAKAGKMPLIVAERMEKDNVQLDEETRELI

Query:  KLTSKMCVSEVSSTLY
        KLTSKMCVSEVSST Y
Subjt:  KLTSKMCVSEVSSTLY

XP_022150265.1 pentatricopeptide repeat-containing protein At4g02820, mitochondrial isoform X1 [Momordica charantia]1.2e-26188.43Show/hide
Query:  MLRSLRTSMATAARRFSAEAYMAAVENTALEGGSGT---GGGRDTLGRRLMSLAFPKRSAVIAIRKWQEEGHTVRKYELNRIVRELRKLKRYKHALE---
        MLRSLRTS+ATAARRFS EA+MAAVENTA+EGGSG+   GGGRDTLGRRLMSLAFPKRSAVIAIRKWQEEGHTVRKYELNRIVRELRKLKRYKHALE   
Subjt:  MLRSLRTSMATAARRFSAEAYMAAVENTALEGGSGT---GGGRDTLGRRLMSLAFPKRSAVIAIRKWQEEGHTVRKYELNRIVRELRKLKRYKHALE---

Query:  ------------------ICEWMTLQKDIKLLPGDYAVHLDLIAKIRGLNSAEKFFEDLPEKMRGQSACTALLHVYVQNNLSDKAEALMEKMSECGFLKS
                          ICEW T QKD+KLLPGDYAVHLDLIAKIRGLNSAEKFFEDLP+KMRGQSACTALLHVYVQNNLSDKAEALMEKMSECGFLKS
Subjt:  ------------------ICEWMTLQKDIKLLPGDYAVHLDLIAKIRGLNSAEKFFEDLPEKMRGQSACTALLHVYVQNNLSDKAEALMEKMSECGFLKS

Query:  PLSFNHMLSLYVSNKQPEKVPALIQELKKNTKPDVVTYNLLLNVRTLQNDVEAAENIFLEMKKTKIEPDWVTFSTLTNLYSKKQLTEKAASTLKEMEKMA
        PLSFNHMLSL++SNKQ +KVPALIQ+L+KNTKPDVVTYNLLLNV TLQNDVEAAENI LEMKK KIE DWVT STLTNLYSKKQLTEKAASTLKEMEKMA
Subjt:  PLSFNHMLSLYVSNKQPEKVPALIQELKKNTKPDVVTYNLLLNVRTLQNDVEAAENIFLEMKKTKIEPDWVTFSTLTNLYSKKQLTEKAASTLKEMEKMA

Query:  SKRNRITFSSLLSLYTNLGDKDGVWRIWKKMKSSFRKMSDSEYTCMISSLVKLDKLEEAEKLYTEWESVSGTGDTRVPNILLAAYINKNQMEQAESFYDR
        SKRNRITFSSLLSLYTNLGDKDG WRIWKKMK+SFRKMSDSEYTCMISS+VKL +LEEAEKLYTEWESVSGTGDTRVPNILLAAYIN NQMEQAESFY+R
Subjt:  SKRNRITFSSLLSLYTNLGDKDGVWRIWKKMKSSFRKMSDSEYTCMISSLVKLDKLEEAEKLYTEWESVSGTGDTRVPNILLAAYINKNQMEQAESFYDR

Query:  MLLKGIVPSYTTWELLTWGYLKENQMEKVVHFFKNAIGSVKKWNADERLVKEVCKKLAEQGNIEGAEQLLIVLRNAGHVDTEIYNSLLRTYAKAGKMPLI
        M LKGIVPSYTTWELLTWGYLKENQMEKV+HFFKNA+GSVKKWNADERLVK+VCKKL E+GNIEGAE+LLIVLRNAGHV+TEIYNSLLRTYAKAGKMPLI
Subjt:  MLLKGIVPSYTTWELLTWGYLKENQMEKVVHFFKNAIGSVKKWNADERLVKEVCKKLAEQGNIEGAEQLLIVLRNAGHVDTEIYNSLLRTYAKAGKMPLI

Query:  VAERMEKDNVQLDEETRELIKLTSKMCVSEVSSTLY
        VAERMEKD+V+LDEETRELIKLTSKMCVSEVSST Y
Subjt:  VAERMEKDNVQLDEETRELIKLTSKMCVSEVSSTLY

XP_022150266.1 pentatricopeptide repeat-containing protein At4g02820, mitochondrial isoform X2 [Momordica charantia]2.4e-26592.04Show/hide
Query:  MLRSLRTSMATAARRFSAEAYMAAVENTALEGGSGT---GGGRDTLGRRLMSLAFPKRSAVIAIRKWQEEGHTVRKYELNRIVRELRKLKRYKHALEICE
        MLRSLRTS+ATAARRFS EA+MAAVENTA+EGGSG+   GGGRDTLGRRLMSLAFPKRSAVIAIRKWQEEGHTVRKYELNRIVRELRKLKRYKHALEICE
Subjt:  MLRSLRTSMATAARRFSAEAYMAAVENTALEGGSGT---GGGRDTLGRRLMSLAFPKRSAVIAIRKWQEEGHTVRKYELNRIVRELRKLKRYKHALEICE

Query:  WMTLQKDIKLLPGDYAVHLDLIAKIRGLNSAEKFFEDLPEKMRGQSACTALLHVYVQNNLSDKAEALMEKMSECGFLKSPLSFNHMLSLYVSNKQPEKVP
        W T QKD+KLLPGDYAVHLDLIAKIRGLNSAEKFFEDLP+KMRGQSACTALLHVYVQNNLSDKAEALMEKMSECGFLKSPLSFNHMLSL++SNKQ +KVP
Subjt:  WMTLQKDIKLLPGDYAVHLDLIAKIRGLNSAEKFFEDLPEKMRGQSACTALLHVYVQNNLSDKAEALMEKMSECGFLKSPLSFNHMLSLYVSNKQPEKVP

Query:  ALIQELKKNTKPDVVTYNLLLNVRTLQNDVEAAENIFLEMKKTKIEPDWVTFSTLTNLYSKKQLTEKAASTLKEMEKMASKRNRITFSSLLSLYTNLGDK
        ALIQ+L+KNTKPDVVTYNLLLNV TLQNDVEAAENI LEMKK KIE DWVT STLTNLYSKKQLTEKAASTLKEMEKMASKRNRITFSSLLSLYTNLGDK
Subjt:  ALIQELKKNTKPDVVTYNLLLNVRTLQNDVEAAENIFLEMKKTKIEPDWVTFSTLTNLYSKKQLTEKAASTLKEMEKMASKRNRITFSSLLSLYTNLGDK

Query:  DGVWRIWKKMKSSFRKMSDSEYTCMISSLVKLDKLEEAEKLYTEWESVSGTGDTRVPNILLAAYINKNQMEQAESFYDRMLLKGIVPSYTTWELLTWGYL
        DG WRIWKKMK+SFRKMSDSEYTCMISS+VKL +LEEAEKLYTEWESVSGTGDTRVPNILLAAYIN NQMEQAESFY+RM LKGIVPSYTTWELLTWGYL
Subjt:  DGVWRIWKKMKSSFRKMSDSEYTCMISSLVKLDKLEEAEKLYTEWESVSGTGDTRVPNILLAAYINKNQMEQAESFYDRMLLKGIVPSYTTWELLTWGYL

Query:  KENQMEKVVHFFKNAIGSVKKWNADERLVKEVCKKLAEQGNIEGAEQLLIVLRNAGHVDTEIYNSLLRTYAKAGKMPLIVAERMEKDNVQLDEETRELIK
        KENQMEKV+HFFKNA+GSVKKWNADERLVK+VCKKL E+GNIEGAE+LLIVLRNAGHV+TEIYNSLLRTYAKAGKMPLIVAERMEKD+V+LDEETRELIK
Subjt:  KENQMEKVVHFFKNAIGSVKKWNADERLVKEVCKKLAEQGNIEGAEQLLIVLRNAGHVDTEIYNSLLRTYAKAGKMPLIVAERMEKDNVQLDEETRELIK

Query:  LTSKMCVSEVSSTLY
        LTSKMCVSEVSST Y
Subjt:  LTSKMCVSEVSSTLY

XP_023515289.1 pentatricopeptide repeat-containing protein At4g02820, mitochondrial isoform X1 [Cucurbita pepo subsp. pepo]3.6e-25388.57Show/hide
Query:  MLRSLRTSMATAARRFSAEAYMAAVENTALEGGSG----TGGGRDTLGRRLMSLAFPKRSAVIAIRKWQEEGHTVRKYELNRIVRELRKLKRYKHALEIC
        MLRS R  +ATAARRFS EAY AAVENT LE  SG    +GGGRDTLGRRLMSLAFPKRSAVIAIRKWQEEGHTVRKYELNRIVRELRKLKRYKHALEIC
Subjt:  MLRSLRTSMATAARRFSAEAYMAAVENTALEGGSG----TGGGRDTLGRRLMSLAFPKRSAVIAIRKWQEEGHTVRKYELNRIVRELRKLKRYKHALEIC

Query:  EWMTLQKDIKLLPGDYAVHLDLIAKIRGLNSAEKFFEDLPEKMRGQSACTALLHVYVQNNLSDKAEALMEKMSECGFLKSPLSFNHMLSLYVSNKQPEKV
        EWMTLQKD+KLLPGDYAV LDLI+KIRGL+SAEKFF DLP+KMRGQSA TALLHV+VQNNLS+KAEALM KMSE GFLKSPLSFNHMLSL+++NK+ EKV
Subjt:  EWMTLQKDIKLLPGDYAVHLDLIAKIRGLNSAEKFFEDLPEKMRGQSACTALLHVYVQNNLSDKAEALMEKMSECGFLKSPLSFNHMLSLYVSNKQPEKV

Query:  PALIQELKKNTKPDVVTYNLLLNVRTLQNDVEAAENIFLEMKKTKIEPDWVTFSTLTNLYSKKQLTEKAASTLKEMEKMASKRNRITFSSLLSLYTNLGD
        PAL+QELKKNTKPDVVTYNLLLNV TLQNDVEAAENIFLEMK  KIEPDWV+FSTL NLYSK+QLTEKAASTLK MEKMASKRNRI+FSSLLSLYTNLGD
Subjt:  PALIQELKKNTKPDVVTYNLLLNVRTLQNDVEAAENIFLEMKKTKIEPDWVTFSTLTNLYSKKQLTEKAASTLKEMEKMASKRNRITFSSLLSLYTNLGD

Query:  KDGVWRIWKKMKSSFRKMSDSEYTCMISSLVKLDKLEEAEKLYTEWESVSGTGDTRVPNILLAAYINKNQMEQAESFYDRMLLKGIVPSYTTWELLTWGY
        KDGV RIWKKM SSFRKM+DSEY CMISSLVKLDKLEEAEKLYTEWESVSGTGDTRVPNILLAAYINKNQ +QAESFYDRM LKGIVPSYTTWELLTWGY
Subjt:  KDGVWRIWKKMKSSFRKMSDSEYTCMISSLVKLDKLEEAEKLYTEWESVSGTGDTRVPNILLAAYINKNQMEQAESFYDRMLLKGIVPSYTTWELLTWGY

Query:  LKENQMEKVVHFFKNAIGSVKKWNADERLVKEVCKKLAEQGNIEGAEQLLIVLRNAGHVDTEIYNSLLRTYAKAGKMPLIVAERMEKDNVQLDEETRELI
        LKENQMEKV+ FFKNA+GSVKKWNADERLVK VCK+L EQGN EGAEQLLI+LRNAGHVDTEIYNSLLRTYAKAGKMPLIVAERME+DNVQL+EE+REL+
Subjt:  LKENQMEKVVHFFKNAIGSVKKWNADERLVKEVCKKLAEQGNIEGAEQLLIVLRNAGHVDTEIYNSLLRTYAKAGKMPLIVAERMEKDNVQLDEETRELI

Query:  KLTSKMCVSEVSSTLY
        KLTSKMCVSEVSST Y
Subjt:  KLTSKMCVSEVSSTLY

XP_038900168.1 pentatricopeptide repeat-containing protein At4g02820, mitochondrial [Benincasa hispida]7.2e-25486.72Show/hide
Query:  MLRSLRTSMAT-AARRFSAEAYMAAVENTALEGG--------------SGTGGGRDTLGRRLMSLAFPKRSAVIAIRKWQEEGHTVRKYELNRIVRELRK
        M RSLR S+AT AARRFS EA+MAA EN  LEGG              SGTGGGRDTLGRRLMSL FPKRSAVIAIRKWQEEGHTVRKYELNRIVRELRK
Subjt:  MLRSLRTSMAT-AARRFSAEAYMAAVENTALEGG--------------SGTGGGRDTLGRRLMSLAFPKRSAVIAIRKWQEEGHTVRKYELNRIVRELRK

Query:  LKRYKHALEICEWMTLQKDIKLLPGDYAVHLDLIAKIRGLNSAEKFFEDLPEKMRGQSACTALLHVYVQNNLSDKAEALMEKMSECGFLKSPLSFNHMLS
        LKRYKHALE+CEWMTLQKD+KLLPGDYAVHLDLIAKIRGLNSAEKFFEDLP+KMR QS CTALLH YVQNNL +KAEALMEKMSE GFLK PLSFNHMLS
Subjt:  LKRYKHALEICEWMTLQKDIKLLPGDYAVHLDLIAKIRGLNSAEKFFEDLPEKMRGQSACTALLHVYVQNNLSDKAEALMEKMSECGFLKSPLSFNHMLS

Query:  LYVSNKQPEKVPALIQELKKNTKPDVVTYNLLLNVRTLQNDVEAAENIFLEMKKTKIEPDWVTFSTLTNLYSKKQLTEKAASTLKEMEKMASKRNRITFS
        LY+SNKQ EKVP +IQ LKKNTKPDVVTYNLLLNV TLQNDVEAAENIFLEMKKTK EPDWV+FSTL NLYSKKQLTEKAASTLK+MEKMASKRNRI+FS
Subjt:  LYVSNKQPEKVPALIQELKKNTKPDVVTYNLLLNVRTLQNDVEAAENIFLEMKKTKIEPDWVTFSTLTNLYSKKQLTEKAASTLKEMEKMASKRNRITFS

Query:  SLLSLYTNLGDKDGVWRIWKKMKSSFRKMSDSEYTCMISSLVKLDKLEEAEKLYTEWESVSGTGDTRVPNILLAAYINKNQMEQAESFYDRMLLKGIVPS
        SLLSLYTNLGDK+GV+RIWKKMKSSFRKMSDSEYTCMISSLVKL++LEEAEKLY EWESVSGTGDTRV NILLAAYINKNQMEQAE+FY+RM +KG+VPS
Subjt:  SLLSLYTNLGDKDGVWRIWKKMKSSFRKMSDSEYTCMISSLVKLDKLEEAEKLYTEWESVSGTGDTRVPNILLAAYINKNQMEQAESFYDRMLLKGIVPS

Query:  YTTWELLTWGYLKENQMEKVVHFFKNAIGSVKKWNADERLVKEVCKKLAEQGNIEGAEQLLIVLRNAGHVDTEIYNSLLRTYAKAGKMPLIVAERMEKDN
        YTTWELLTWGYLKENQMEKV+HF KNA+GSVKKWN DERLVKEVCKKL EQGNIEGAEQLL++LRN GHVDTEIYNSLLRTYAKAGKMPLIVAERME DN
Subjt:  YTTWELLTWGYLKENQMEKVVHFFKNAIGSVKKWNADERLVKEVCKKLAEQGNIEGAEQLLIVLRNAGHVDTEIYNSLLRTYAKAGKMPLIVAERMEKDN

Query:  VQLDEETRELIKLTSKMCVSEVSSTLY
        VQL++ETREL++LTSKMCVSEVS+TLY
Subjt:  VQLDEETRELIKLTSKMCVSEVSSTLY

TrEMBL top hitse value%identityAlignment
A0A5D3D4M6 Pentatricopeptide repeat-containing protein1.8e-25085.74Show/hide
Query:  MLRSLRTSMAT-AARRFSAEAYMAAVENTALEGGSGT------GGGRDTLGRRLMSLAFPKRSAVIAIRKWQEEGHTVRKYELNRIVRELRKLKRYKHAL
        M RS R+S+AT AARRFS EA +AA ENT++EGG+GT      GGGRDTLGRRLMSL FPKRSAVIAIRKWQEEGHT+RKYELN IVRELRKLKRYKHAL
Subjt:  MLRSLRTSMAT-AARRFSAEAYMAAVENTALEGGSGT------GGGRDTLGRRLMSLAFPKRSAVIAIRKWQEEGHTVRKYELNRIVRELRKLKRYKHAL

Query:  EICEWMTLQKDIKLLPGDYAVHLDLIAKIRGLNSAEKFFEDLPEKMRGQSACTALLHVYVQNNLSDKAEALMEKMSECGFLKSPLSFNHMLSLYVSNKQP
        E+CEWMTLQKD+KLLPGDYAV LDLIAKIRGLNSAEKFFEDLP+K+R QS CTALLH YVQ NLS+KAEALMEKMSECGFLKSPLSFNHMLSL++SNKQ 
Subjt:  EICEWMTLQKDIKLLPGDYAVHLDLIAKIRGLNSAEKFFEDLPEKMRGQSACTALLHVYVQNNLSDKAEALMEKMSECGFLKSPLSFNHMLSLYVSNKQP

Query:  EKVPALIQELKKNTKPDVVTYNLLLNVRTLQNDVEAAENIFLEMKKTKIEPDWVTFSTLTNLYSKKQLTEKAASTLKEMEKMASKRNRITFSSLLSLYTN
        EKVPALI+ LKKNTKPDVVTYNLLLNV TLQND EAAENIFLEMKKTK++PDW++FSTL NLY KKQLTEKAA+TLKEMEKMA KRNR++FSSLLSLYTN
Subjt:  EKVPALIQELKKNTKPDVVTYNLLLNVRTLQNDVEAAENIFLEMKKTKIEPDWVTFSTLTNLYSKKQLTEKAASTLKEMEKMASKRNRITFSSLLSLYTN

Query:  LGDKDGVWRIWKKMKSSFRKMSDSEYTCMISSLVKLDKLEEAEKLYTEWESVSGTGDTRVPNILLAAYINKNQMEQAESFYDRMLLKGIVPSYTTWELLT
        LGDK+ V RIWKK+KSSFRKMSDSEY CM+SSLVKL++LEEAEKLYTEWESVSGT DTR+ N++LAAYINKNQMEQAESFY+RM LKGIVPSYTTWELLT
Subjt:  LGDKDGVWRIWKKMKSSFRKMSDSEYTCMISSLVKLDKLEEAEKLYTEWESVSGTGDTRVPNILLAAYINKNQMEQAESFYDRMLLKGIVPSYTTWELLT

Query:  WGYLKENQMEKVVHFFKNAIGSVKKWNADERLVKEVCKKLAEQGNIEGAEQLLIVLRNAGHVDTEIYNSLLRTYAKAGKMPLIVAERMEKDNVQLDEETR
        WGYLKENQMEKV+HFFKNA+GSVKKWNADERLVK VCKKL EQGNIEG EQLL++LRNAGHVDTEIYNSLLRTYAKAGKMPLIVAERMEKDNVQL++ETR
Subjt:  WGYLKENQMEKVVHFFKNAIGSVKKWNADERLVKEVCKKLAEQGNIEGAEQLLIVLRNAGHVDTEIYNSLLRTYAKAGKMPLIVAERMEKDNVQLDEETR

Query:  ELIKLTSKMCVSEVSSTLY
        EL++LTSKMCVSEVSSTLY
Subjt:  ELIKLTSKMCVSEVSSTLY

A0A6J1D809 pentatricopeptide repeat-containing protein At4g02820, mitochondrial isoform X15.9e-26288.43Show/hide
Query:  MLRSLRTSMATAARRFSAEAYMAAVENTALEGGSGT---GGGRDTLGRRLMSLAFPKRSAVIAIRKWQEEGHTVRKYELNRIVRELRKLKRYKHALE---
        MLRSLRTS+ATAARRFS EA+MAAVENTA+EGGSG+   GGGRDTLGRRLMSLAFPKRSAVIAIRKWQEEGHTVRKYELNRIVRELRKLKRYKHALE   
Subjt:  MLRSLRTSMATAARRFSAEAYMAAVENTALEGGSGT---GGGRDTLGRRLMSLAFPKRSAVIAIRKWQEEGHTVRKYELNRIVRELRKLKRYKHALE---

Query:  ------------------ICEWMTLQKDIKLLPGDYAVHLDLIAKIRGLNSAEKFFEDLPEKMRGQSACTALLHVYVQNNLSDKAEALMEKMSECGFLKS
                          ICEW T QKD+KLLPGDYAVHLDLIAKIRGLNSAEKFFEDLP+KMRGQSACTALLHVYVQNNLSDKAEALMEKMSECGFLKS
Subjt:  ------------------ICEWMTLQKDIKLLPGDYAVHLDLIAKIRGLNSAEKFFEDLPEKMRGQSACTALLHVYVQNNLSDKAEALMEKMSECGFLKS

Query:  PLSFNHMLSLYVSNKQPEKVPALIQELKKNTKPDVVTYNLLLNVRTLQNDVEAAENIFLEMKKTKIEPDWVTFSTLTNLYSKKQLTEKAASTLKEMEKMA
        PLSFNHMLSL++SNKQ +KVPALIQ+L+KNTKPDVVTYNLLLNV TLQNDVEAAENI LEMKK KIE DWVT STLTNLYSKKQLTEKAASTLKEMEKMA
Subjt:  PLSFNHMLSLYVSNKQPEKVPALIQELKKNTKPDVVTYNLLLNVRTLQNDVEAAENIFLEMKKTKIEPDWVTFSTLTNLYSKKQLTEKAASTLKEMEKMA

Query:  SKRNRITFSSLLSLYTNLGDKDGVWRIWKKMKSSFRKMSDSEYTCMISSLVKLDKLEEAEKLYTEWESVSGTGDTRVPNILLAAYINKNQMEQAESFYDR
        SKRNRITFSSLLSLYTNLGDKDG WRIWKKMK+SFRKMSDSEYTCMISS+VKL +LEEAEKLYTEWESVSGTGDTRVPNILLAAYIN NQMEQAESFY+R
Subjt:  SKRNRITFSSLLSLYTNLGDKDGVWRIWKKMKSSFRKMSDSEYTCMISSLVKLDKLEEAEKLYTEWESVSGTGDTRVPNILLAAYINKNQMEQAESFYDR

Query:  MLLKGIVPSYTTWELLTWGYLKENQMEKVVHFFKNAIGSVKKWNADERLVKEVCKKLAEQGNIEGAEQLLIVLRNAGHVDTEIYNSLLRTYAKAGKMPLI
        M LKGIVPSYTTWELLTWGYLKENQMEKV+HFFKNA+GSVKKWNADERLVK+VCKKL E+GNIEGAE+LLIVLRNAGHV+TEIYNSLLRTYAKAGKMPLI
Subjt:  MLLKGIVPSYTTWELLTWGYLKENQMEKVVHFFKNAIGSVKKWNADERLVKEVCKKLAEQGNIEGAEQLLIVLRNAGHVDTEIYNSLLRTYAKAGKMPLI

Query:  VAERMEKDNVQLDEETRELIKLTSKMCVSEVSSTLY
        VAERMEKD+V+LDEETRELIKLTSKMCVSEVSST Y
Subjt:  VAERMEKDNVQLDEETRELIKLTSKMCVSEVSSTLY

A0A6J1DB09 pentatricopeptide repeat-containing protein At4g02820, mitochondrial isoform X21.2e-26592.04Show/hide
Query:  MLRSLRTSMATAARRFSAEAYMAAVENTALEGGSGT---GGGRDTLGRRLMSLAFPKRSAVIAIRKWQEEGHTVRKYELNRIVRELRKLKRYKHALEICE
        MLRSLRTS+ATAARRFS EA+MAAVENTA+EGGSG+   GGGRDTLGRRLMSLAFPKRSAVIAIRKWQEEGHTVRKYELNRIVRELRKLKRYKHALEICE
Subjt:  MLRSLRTSMATAARRFSAEAYMAAVENTALEGGSGT---GGGRDTLGRRLMSLAFPKRSAVIAIRKWQEEGHTVRKYELNRIVRELRKLKRYKHALEICE

Query:  WMTLQKDIKLLPGDYAVHLDLIAKIRGLNSAEKFFEDLPEKMRGQSACTALLHVYVQNNLSDKAEALMEKMSECGFLKSPLSFNHMLSLYVSNKQPEKVP
        W T QKD+KLLPGDYAVHLDLIAKIRGLNSAEKFFEDLP+KMRGQSACTALLHVYVQNNLSDKAEALMEKMSECGFLKSPLSFNHMLSL++SNKQ +KVP
Subjt:  WMTLQKDIKLLPGDYAVHLDLIAKIRGLNSAEKFFEDLPEKMRGQSACTALLHVYVQNNLSDKAEALMEKMSECGFLKSPLSFNHMLSLYVSNKQPEKVP

Query:  ALIQELKKNTKPDVVTYNLLLNVRTLQNDVEAAENIFLEMKKTKIEPDWVTFSTLTNLYSKKQLTEKAASTLKEMEKMASKRNRITFSSLLSLYTNLGDK
        ALIQ+L+KNTKPDVVTYNLLLNV TLQNDVEAAENI LEMKK KIE DWVT STLTNLYSKKQLTEKAASTLKEMEKMASKRNRITFSSLLSLYTNLGDK
Subjt:  ALIQELKKNTKPDVVTYNLLLNVRTLQNDVEAAENIFLEMKKTKIEPDWVTFSTLTNLYSKKQLTEKAASTLKEMEKMASKRNRITFSSLLSLYTNLGDK

Query:  DGVWRIWKKMKSSFRKMSDSEYTCMISSLVKLDKLEEAEKLYTEWESVSGTGDTRVPNILLAAYINKNQMEQAESFYDRMLLKGIVPSYTTWELLTWGYL
        DG WRIWKKMK+SFRKMSDSEYTCMISS+VKL +LEEAEKLYTEWESVSGTGDTRVPNILLAAYIN NQMEQAESFY+RM LKGIVPSYTTWELLTWGYL
Subjt:  DGVWRIWKKMKSSFRKMSDSEYTCMISSLVKLDKLEEAEKLYTEWESVSGTGDTRVPNILLAAYINKNQMEQAESFYDRMLLKGIVPSYTTWELLTWGYL

Query:  KENQMEKVVHFFKNAIGSVKKWNADERLVKEVCKKLAEQGNIEGAEQLLIVLRNAGHVDTEIYNSLLRTYAKAGKMPLIVAERMEKDNVQLDEETRELIK
        KENQMEKV+HFFKNA+GSVKKWNADERLVK+VCKKL E+GNIEGAE+LLIVLRNAGHV+TEIYNSLLRTYAKAGKMPLIVAERMEKD+V+LDEETRELIK
Subjt:  KENQMEKVVHFFKNAIGSVKKWNADERLVKEVCKKLAEQGNIEGAEQLLIVLRNAGHVDTEIYNSLLRTYAKAGKMPLIVAERMEKDNVQLDEETRELIK

Query:  LTSKMCVSEVSSTLY
        LTSKMCVSEVSST Y
Subjt:  LTSKMCVSEVSSTLY

A0A6J1H967 pentatricopeptide repeat-containing protein At4g02820, mitochondrial9.5e-25287.6Show/hide
Query:  MLRSLRTSMATAARRFSAEAYMAAVENTALEGGSGT----GGGRDTLGRRLMSLAFPKRSAVIAIRKWQEEGHTVRKYELNRIVRELRKLKRYKHALEIC
        MLRS R  +ATAARRFS EA   AVENT LE  SG+    GGGRDTLGRRLMSLAFPKRSAVIAIRKWQEEGHTVRKY+LNRIVRELRKLKRYKHALEIC
Subjt:  MLRSLRTSMATAARRFSAEAYMAAVENTALEGGSGT----GGGRDTLGRRLMSLAFPKRSAVIAIRKWQEEGHTVRKYELNRIVRELRKLKRYKHALEIC

Query:  EWMTLQKDIKLLPGDYAVHLDLIAKIRGLNSAEKFFEDLPEKMRGQSACTALLHVYVQNNLSDKAEALMEKMSECGFLKSPLSFNHMLSLYVSNKQPEKV
        EWMTLQKD+KLLPGDYAVHLDLI+KIRGL+SAEKFF DLP+KMRGQSA T+LLHV+VQNNLS+KAEALM KMSE GFLKSPLSFNHMLSL+++NK+ EKV
Subjt:  EWMTLQKDIKLLPGDYAVHLDLIAKIRGLNSAEKFFEDLPEKMRGQSACTALLHVYVQNNLSDKAEALMEKMSECGFLKSPLSFNHMLSLYVSNKQPEKV

Query:  PALIQELKKNTKPDVVTYNLLLNVRTLQNDVEAAENIFLEMKKTKIEPDWVTFSTLTNLYSKKQLTEKAASTLKEMEKMASKRNRITFSSLLSLYTNLGD
        PAL+QELKKNTKPDVVTYNLLLNV TLQNDVEAAENIFLEMK  KIEPDWV+FSTL NLYSK+QLTEKAASTLK+MEKMASKRNRI+FSSLLSLYTNLGD
Subjt:  PALIQELKKNTKPDVVTYNLLLNVRTLQNDVEAAENIFLEMKKTKIEPDWVTFSTLTNLYSKKQLTEKAASTLKEMEKMASKRNRITFSSLLSLYTNLGD

Query:  KDGVWRIWKKMKSSFRKMSDSEYTCMISSLVKLDKLEEAEKLYTEWESVSGTGDTRVPNILLAAYINKNQMEQAESFYDRMLLKGIVPSYTTWELLTWGY
        KDGV RIWKKM SSFRKM+DSEY CMISSLVKLDKLEEAEKLYTEWESVSGTGDTRVPNILLAAYINKNQ +QAESFYDRM LKGI+PSYTTWELLTWGY
Subjt:  KDGVWRIWKKMKSSFRKMSDSEYTCMISSLVKLDKLEEAEKLYTEWESVSGTGDTRVPNILLAAYINKNQMEQAESFYDRMLLKGIVPSYTTWELLTWGY

Query:  LKENQMEKVVHFFKNAIGSVKKWNADERLVKEVCKKLAEQGNIEGAEQLLIVLRNAGHVDTEIYNSLLRTYAKAGKMPLIVAERMEKDNVQLDEETRELI
        LKENQMEKV+ FFKNA+GSVKKWN DERLVK VCK+L EQGN EGAEQLLI+LRNAGHVDTEIYNSLLRTYAKAGKMPLIVAERME+DNVQL+EE+REL+
Subjt:  LKENQMEKVVHFFKNAIGSVKKWNADERLVKEVCKKLAEQGNIEGAEQLLIVLRNAGHVDTEIYNSLLRTYAKAGKMPLIVAERMEKDNVQLDEETRELI

Query:  KLTSKMCVSEVSSTLY
        KLTSKMCVSEVSST Y
Subjt:  KLTSKMCVSEVSSTLY

A0A6J1KUT7 pentatricopeptide repeat-containing protein At4g02820, mitochondrial3.3e-25287.6Show/hide
Query:  MLRSLRTSMATAARRFSAEAYMAAVENTALEGGSG----TGGGRDTLGRRLMSLAFPKRSAVIAIRKWQEEGHTVRKYELNRIVRELRKLKRYKHALEIC
        M RS R  +ATAARRFS EAY AAVENT LE  SG    +GGGRDTLGRRLMSLAFPKRSAVIAIRKWQEEGHTVRKYELNRIVRELRKLKRYKHALEIC
Subjt:  MLRSLRTSMATAARRFSAEAYMAAVENTALEGGSG----TGGGRDTLGRRLMSLAFPKRSAVIAIRKWQEEGHTVRKYELNRIVRELRKLKRYKHALEIC

Query:  EWMTLQKDIKLLPGDYAVHLDLIAKIRGLNSAEKFFEDLPEKMRGQSACTALLHVYVQNNLSDKAEALMEKMSECGFLKSPLSFNHMLSLYVSNKQPEKV
        EWMTLQK++KLLPGDYAVHLDLI+KIRGL+SAEKFF DLP+KMRGQSA T+LLHV+VQNNLS+KAEALM KMSE GFLKSPLSFNHMLSL+++NK+ EKV
Subjt:  EWMTLQKDIKLLPGDYAVHLDLIAKIRGLNSAEKFFEDLPEKMRGQSACTALLHVYVQNNLSDKAEALMEKMSECGFLKSPLSFNHMLSLYVSNKQPEKV

Query:  PALIQELKKNTKPDVVTYNLLLNVRTLQNDVEAAENIFLEMKKTKIEPDWVTFSTLTNLYSKKQLTEKAASTLKEMEKMASKRNRITFSSLLSLYTNLGD
        PAL+QELKKNTKPDVVTYNLLLNV TLQNDVEAAENIFLEMK  KIEPDWV+FSTL NLYSK+QLTEKAASTLK+MEKMASKRNRI+FSSLLSLYTNLGD
Subjt:  PALIQELKKNTKPDVVTYNLLLNVRTLQNDVEAAENIFLEMKKTKIEPDWVTFSTLTNLYSKKQLTEKAASTLKEMEKMASKRNRITFSSLLSLYTNLGD

Query:  KDGVWRIWKKMKSSFRKMSDSEYTCMISSLVKLDKLEEAEKLYTEWESVSGTGDTRVPNILLAAYINKNQMEQAESFYDRMLLKGIVPSYTTWELLTWGY
        KDGV RIWKKM SSFRKM+DSEY CMISSLVKLDKLE+AEKLYTEWESVSGTGDTRVPNILLAAYINKNQM+QAESFYDRM LKGI+PSYTTWELLTWGY
Subjt:  KDGVWRIWKKMKSSFRKMSDSEYTCMISSLVKLDKLEEAEKLYTEWESVSGTGDTRVPNILLAAYINKNQMEQAESFYDRMLLKGIVPSYTTWELLTWGY

Query:  LKENQMEKVVHFFKNAIGSVKKWNADERLVKEVCKKLAEQGNIEGAEQLLIVLRNAGHVDTEIYNSLLRTYAKAGKMPLIVAERMEKDNVQLDEETRELI
        LKENQMEKV+ FFKNA+GSVKKWNADERLVK VCK+L EQGN EG EQLLI+LRNAGHVDTEIYNSLLRTYAKAGKMPLIVAERME+DNVQL+EE+REL+
Subjt:  LKENQMEKVVHFFKNAIGSVKKWNADERLVKEVCKKLAEQGNIEGAEQLLIVLRNAGHVDTEIYNSLLRTYAKAGKMPLIVAERMEKDNVQLDEETRELI

Query:  KLTSKMCVSEVSSTLY
        KLT+KMCVSEVSST Y
Subjt:  KLTSKMCVSEVSSTLY

SwissProt top hitse value%identityAlignment
Q3E911 Pentatricopeptide repeat-containing protein At5g274603.9e-6933.95Show/hide
Query:  MATAARRFSAEAYMAAVENTALEGGSGTGGGRDTLGRRLMSLAFPKRSAVIAIRKWQEEGHTVRKYELNRIVRELRKLKRYKHALEICEWMTLQKDIKLL
        +A A  RF+    ++ + ++  +G   +        + ++    P+RS    +++  + GH V   EL  I + L +  RY  AL++ EWM  QKDI+  
Subjt:  MATAARRFSAEAYMAAVENTALEGGSGTGGGRDTLGRRLMSLAFPKRSAVIAIRKWQEEGHTVRKYELNRIVRELRKLKRYKHALEICEWMTLQKDIKLL

Query:  PGDYAVHLDLIAKIRGLNSAEKFFEDL---PEKMR-GQSACTALLHVYVQNNLSDKAEALMEKMSECGFLKSPLSFNHMLSLYVSNKQPEKVPALIQELK
          D A+ LDLI K  GL   E++FE L      MR  +SA   LL  YV+N +  +AEALMEK++  GFL +P  FN M+ LY ++ Q EKV  ++  +K
Subjt:  PGDYAVHLDLIAKIRGLNSAEKFFEDL---PEKMR-GQSACTALLHVYVQNNLSDKAEALMEKMSECGFLKSPLSFNHMLSLYVSNKQPEKVPALIQELK

Query:  KNTKP-DVVTYNLLLNVRTLQNDVEAAENIFLEMKKTK-IEPDWVTFSTLTNLYSKKQLTEKAASTLKEMEKMASKRNRITFSSLLSLYTNLGDKDGVWR
         N  P +V++YNL +N     + V A E ++ EM   K +E  W +  TL N+Y K    EKA   L++ EKM ++ NR+ +  L++LY +LG+K+GV R
Subjt:  KNTKP-DVVTYNLLLNVRTLQNDVEAAENIFLEMKKTK-IEPDWVTFSTLTNLYSKKQLTEKAASTLKEMEKMASKRNRITFSSLLSLYTNLGDKDGVWR

Query:  IWKKMKSSFRKMSDSEYTCMISSLVKLDKLEEAEKLYTEWESVSGTGDTRVPNILLAAYINKNQMEQAESFYDRMLLKGIVPSYTTWELLTWGYLKENQM
        +W+  KS   ++S   Y C++SSLVK   LEEAE++++EWE+     D RV N+LL AY+   ++ +AES +  +L +G  P+Y TWE+L  G++K   M
Subjt:  IWKKMKSSFRKMSDSEYTCMISSLVKLDKLEEAEKLYTEWESVSGTGDTRVPNILLAAYINKNQMEQAESFYDRMLLKGIVPSYTTWELLTWGYLKENQM

Query:  EKVVHFFKNAIGSVKK--WNADERLVKEVCKKLAEQGNIEGAEQLLIVLRNAGHVDTEIYNSLLRTYAKAGKMPLIVAERMEKDNV
        EK +         +++  W     +V  + +   ++  IE A   +  L   G     +Y  LLR +  A +    + E M+ D +
Subjt:  EKVVHFFKNAIGSVKK--WNADERLVKEVCKKLAEQGNIEGAEQLLIVLRNAGHVDTEIYNSLLRTYAKAGKMPLIVAERMEKDNV

Q84JR3 Pentatricopeptide repeat-containing protein At4g21705, mitochondrial2.6e-6531.48Show/hide
Query:  RDTLGRRLMSLAFPKRSAVIAIRKWQEEGHTVRKYELNRIVRELRKLKRYKHALEICEWMTLQKDIKLLPGDYAVHLDLIAKIRGLNSAEKFFEDLPEKM
        + TL  ++  L  PK S    ++ W + G  V   EL RIV +LR+ KR+ HALE+ +WM         P ++AVHLDLI ++ G  +AE++FE+L E+ 
Subjt:  RDTLGRRLMSLAFPKRSAVIAIRKWQEEGHTVRKYELNRIVRELRKLKRYKHALEICEWMTLQKDIKLLPGDYAVHLDLIAKIRGLNSAEKFFEDLPEKM

Query:  RGQSACTALLHVYVQNNLSDKAEALMEKMSECGFLKSPLSFNHMLSLYVSNKQPEKVPALIQELK-KNTKPDVVTYNLLLNVRTLQNDVEAAENIFLEMK
        +      ALL+ YV+    +K+    EKM E GF+ S L++N+++ LY +  Q EKVP +++E+K +N  PD  +Y + +N      D+E       +M+
Subjt:  RGQSACTALLHVYVQNNLSDKAEALMEKMSECGFLKSPLSFNHMLSLYVSNKQPEKVPALIQELK-KNTKPDVVTYNLLLNVRTLQNDVEAAENIFLEMK

Query:  KTK-IEPDWVTFSTLTNLYSKKQLTEKAASTLKEMEKMASKRNRITFSSLLSLYTNLGDKDGVWRIWKKMKSSFRKMSDSEYTCMISSLVKLDKLEEAEK
        + + I  DW T++     Y      ++A   LK  E    K++   ++ L++LY  LG K  V R+W   K   ++  + +Y  ++ SLVK+D L EAE+
Subjt:  KTK-IEPDWVTFSTLTNLYSKKQLTEKAASTLKEMEKMASKRNRITFSSLLSLYTNLGDKDGVWRIWKKMKSSFRKMSDSEYTCMISSLVKLDKLEEAEK

Query:  LYTEWESVSGTGDTRVPNILLAAYINKNQMEQAESFYDRMLLKGIVPSYTTWELLTWGYLKENQMEKVVHFFKNAIG---SVKKWNADERLVKEVCKKLA
        + TEW+S     D RVPN ++  YI K+  E+AE+  + +  +G   +  +WEL+   Y ++  +E      K A+G     +KW     LV  V   + 
Subjt:  LYTEWESVSGTGDTRVPNILLAAYINKNQMEQAESFYDRMLLKGIVPSYTTWELLTWGYLKENQMEKVVHFFKNAIG---SVKKWNADERLVKEVCKKLA

Query:  EQGNIEGAEQLLIVLRNAGHVDTEIYNSLLRTYAKAGKMPL-IVAERMEKDNVQLDEETRELIKLTS
        ++G+++  E  +  LRN   V+ ++Y++L++   + G   +  + +RM+ D +++DEET  ++   S
Subjt:  EQGNIEGAEQLLIVLRNAGHVDTEIYNSLLRTYAKAGKMPL-IVAERMEKDNVQLDEETRELIKLTS

Q8LPS6 Pentatricopeptide repeat-containing protein At1g021501.2e-7034.59Show/hide
Query:  RRLMSLAFPKRSAVIAIRKWQEEGHTVRKYELNRIVRELRKLKRYKHALEICEWMTLQKD-IKLLPGDYAVHLDLIAKIRGLNSAEKFFEDLPEKMRGQS
        +++  +  P+  A   + +W++ G  + K+EL R+V+ELRK KR   ALE+ +WM  + +  +L   D A+ LDLI K+RG+  AE+FF  LPE  + + 
Subjt:  RRLMSLAFPKRSAVIAIRKWQEEGHTVRKYELNRIVRELRKLKRYKHALEICEWMTLQKD-IKLLPGDYAVHLDLIAKIRGLNSAEKFFEDLPEKMRGQS

Query:  ACTALLHVYVQNNLSDKAEALMEKMSECGFLKSPLSFNHMLSLYVSNKQPEKVPALIQELK-KNTKPDVVTYNLLLNVRTLQNDVEAAENIFLEMKK-TK
           +LL+ YV+    +KAEAL+  M + G+   PL FN M++LY++ ++ +KV A++ E+K K+ + D+ +YN+ L+       VE  E ++ +MK    
Subjt:  ACTALLHVYVQNNLSDKAEALMEKMSECGFLKSPLSFNHMLSLYVSNKQPEKVPALIQELK-KNTKPDVVTYNLLLNVRTLQNDVEAAENIFLEMKK-TK

Query:  IEPDWVTFSTLTNLYSKKQLTEKAASTLKEMEKMASKRNRITFSSLLSLYTNLGDKDGVWRIWKKMKSSFRKMSDSEYTCMISSLVKLDKLEEAEKLYTE
        I P+W TFST+  +Y K   TEKA   L+++E   + RNRI +  LLSLY +LG+K  ++R+W   KS    + +  Y  ++SSLV++  +E AEK+Y E
Subjt:  IEPDWVTFSTLTNLYSKKQLTEKAASTLKEMEKMASKRNRITFSSLLSLYTNLGDKDGVWRIWKKMKSSFRKMSDSEYTCMISSLVKLDKLEEAEKLYTE

Query:  WESVSGTGDTRVPNILLAAYINKNQMEQAESFYDRMLLKGIVPSYTTWELLTWGYLKENQMEKVVHFFKNAIGS--VKKWNADERLVKEVCKKLAEQGNI
        W  V  + D R+PN+L+ AY+  +Q+E AE  +D M+  G  PS +TWE+L  G+ ++  + + +   +NA  +     W     ++    K   E+ ++
Subjt:  WESVSGTGDTRVPNILLAAYINKNQMEQAESFYDRMLLKGIVPSYTTWELLTWGYLKENQMEKVVHFFKNAIGS--VKKWNADERLVKEVCKKLAEQGNI

Query:  EGAEQLLIVLRNAGHVDTEIYNSLL
           E +L +LR +G ++ + Y +L+
Subjt:  EGAEQLLIVLRNAGHVDTEIYNSLL

Q9SKU6 Pentatricopeptide repeat-containing protein At2g20710, mitochondrial2.6e-6534.14Show/hide
Query:  DTLGRRLMSLAFPKRSAVIAIRKWQEEGHTVRKYELNRIVRELRKLKRYKHALEICEWMTLQKDIKLLPGDYAVHLDLIAKIRGLNSAEKFFEDLPEKMR
        DTL RR+     P  S +  +  W ++G+ V+  EL+ I++ LRK  R+ HAL+I +WM+  +  ++  GD A+ LDLIAK+ GL  AEKFFE +P + R
Subjt:  DTLGRRLMSLAFPKRSAVIAIRKWQEEGHTVRKYELNRIVRELRKLKRYKHALEICEWMTLQKDIKLLPGDYAVHLDLIAKIRGLNSAEKFFEDLPEKMR

Query:  GQSACTALLHVYVQNNLSDKAEALMEKMSECGFLKSPLSFNHMLSLYVSNKQPEKVPALIQELKKNT-KPDVVTYNLLLNVRTLQNDVEAAENIFLEMKK
              ALL+ Y    +  KAE + ++M E GFLK  L +N ML+LYV   +   V  L++E++  T KPD+ T N  L+  ++ +DVE  E   +  + 
Subjt:  GQSACTALLHVYVQNNLSDKAEALMEKMSECGFLKSPLSFNHMLSLYVSNKQPEKVPALIQELKKNT-KPDVVTYNLLLNVRTLQNDVEAAENIFLEMKK

Query:  TK-IEPDWVTFSTLTNLYSKKQLTEKAASTLKEMEKMA-SKRNRITFSSLLSLYTNLGDKDGVWRIWKKMKSSFRKMSDSEYTCMISSLVKLDKLEEAEK
         + +  DW T++   N Y K  LTEKA   L++ E+M  +++ +  +  L+S Y   G K+ V+R+W   K       ++ Y  +IS+L+K+D +EE EK
Subjt:  TK-IEPDWVTFSTLTNLYSKKQLTEKAASTLKEMEKMA-SKRNRITFSSLLSLYTNLGDKDGVWRIWKKMKSSFRKMSDSEYTCMISSLVKLDKLEEAEK

Query:  LYTEWESVSGTGDTRVPNILLAAYINKNQMEQAESFYDRMLLKGIVPSYTTWELLTWGYLKENQMEKVVHFFKNAIGSVKK-WNADERLVKEVCKKLAEQ
        +  EWE+     D R+P++L+  Y  K  ME+AE   + ++ K  V   +TWE L  GY    +MEK V  +K AI   K  W   + ++      L  Q
Subjt:  LYTEWESVSGTGDTRVPNILLAAYINKNQMEQAESFYDRMLLKGIVPSYTTWELLTWGYLKENQMEKVVHFFKNAIGSVKK-WNADERLVKEVCKKLAEQ

Query:  GNIEGAEQLLIVLRNAGHVDTEIYNSLLRTYAKAGKMPLIVAERMEKDNVQLDEETR
         ++EG  ++L +L   GH+    Y+ LL     AG +   + + M K     + E R
Subjt:  GNIEGAEQLLIVLRNAGHVDTEIYNSLLRTYAKAGKMPLIVAERMEKDNVQLDEETR

Q9SY07 Pentatricopeptide repeat-containing protein At4g02820, mitochondrial5.6e-18562.88Show/hide
Query:  MLRSLRTSMATAARRFSAEAYMAAVENTAL------EGGSGTG-----------GGRDTLGRRLMSLAFPKRSAVIAIRKWQEEGHTVRKYELNRIVREL
        ++RS R ++A+  R FSA A  AA  +TA         G G G           GGRDTLG RL+SL + KRSAV+ IRKW+EEGH+VRKYELNRIVREL
Subjt:  MLRSLRTSMATAARRFSAEAYMAAVENTAL------EGGSGTG-----------GGRDTLGRRLMSLAFPKRSAVIAIRKWQEEGHTVRKYELNRIVREL

Query:  RKLKRYKHALEICEWMTLQKDIKLLPGDYAVHLDLIAKIRGLNSAEKFFEDLPEKMRGQSACTALLHVYVQNNLSDKAEALMEKMSECGFLKSPLSFNHM
        RK+KRYKHALEICEWM +Q+DIKL  GDYAVHLDLI+KIRGLNSAEKFFED+P++MRG +ACT+LLH YVQN LSDKAEAL EKM ECGFLKS L +NHM
Subjt:  RKLKRYKHALEICEWMTLQKDIKLLPGDYAVHLDLIAKIRGLNSAEKFFEDLPEKMRGQSACTALLHVYVQNNLSDKAEALMEKMSECGFLKSPLSFNHM

Query:  LSLYVSNKQPEKVPALIQELKKNTKPDVVTYNLLLNVRTLQNDVEAAENIFLEMKKTKIEPDWVTFSTLTNLYSKKQLTEKAASTLKEMEKMASKRNRIT
        LS+Y+S  Q EKVP LI+ELK  T PD+VTYNL L      NDVE AE ++L+ K+ K+ PDWVT+S LTNLY+K    EKA   LKEMEK+ SK+NR+ 
Subjt:  LSLYVSNKQPEKVPALIQELKKNTKPDVVTYNLLLNVRTLQNDVEAAENIFLEMKKTKIEPDWVTFSTLTNLYSKKQLTEKAASTLKEMEKMASKRNRIT

Query:  FSSLLSLYTNLGDKDGVWRIWKKMKSSFRKMSDSEYTCMISSLVKLDKLEEAEKLYTEWESVSGTGDTRVPNILLAAYINKNQMEQAESFYDRMLLKGIV
        ++SL+SL+ NLGDKDGV   WKK+KSSF+KM+D+EY  MIS++VKL + E+A+ LY EWESVSGTGD R+PN++LA Y+N++++   E FY+R++ KGI 
Subjt:  FSSLLSLYTNLGDKDGVWRIWKKMKSSFRKMSDSEYTCMISSLVKLDKLEEAEKLYTEWESVSGTGDTRVPNILLAAYINKNQMEQAESFYDRMLLKGIV

Query:  PSYTTWELLTWGYLKENQMEKVVHFFKNAIGSVKKWNADERLVKEVCKKLAEQGNIEGAEQLLIVLRNAGHVDTEIYNSLLRTYAKAGKMPLIVAERMEK
        PSY+TWE+LTW YLK   MEKV+  F  AI SVKKW  + RLVK  CK+L EQGN++GAE+L+ +L+ AG+V+T++YNSLLRTYAKAG+M LIV ERM K
Subjt:  PSYTTWELLTWGYLKENQMEKVVHFFKNAIGSVKKWNADERLVKEVCKKLAEQGNIEGAEQLLIVLRNAGHVDTEIYNSLLRTYAKAGKMPLIVAERMEK

Query:  DNVQLDEETRELIKLTSKMCVSEVSSTL
        DNV+LDEET+ELI+LTS+M V+E+SST+
Subjt:  DNVQLDEETRELIKLTSKMCVSEVSSTL

Arabidopsis top hitse value%identityAlignment
AT1G02150.1 Tetratricopeptide repeat (TPR)-like superfamily protein8.6e-7234.59Show/hide
Query:  RRLMSLAFPKRSAVIAIRKWQEEGHTVRKYELNRIVRELRKLKRYKHALEICEWMTLQKD-IKLLPGDYAVHLDLIAKIRGLNSAEKFFEDLPEKMRGQS
        +++  +  P+  A   + +W++ G  + K+EL R+V+ELRK KR   ALE+ +WM  + +  +L   D A+ LDLI K+RG+  AE+FF  LPE  + + 
Subjt:  RRLMSLAFPKRSAVIAIRKWQEEGHTVRKYELNRIVRELRKLKRYKHALEICEWMTLQKD-IKLLPGDYAVHLDLIAKIRGLNSAEKFFEDLPEKMRGQS

Query:  ACTALLHVYVQNNLSDKAEALMEKMSECGFLKSPLSFNHMLSLYVSNKQPEKVPALIQELK-KNTKPDVVTYNLLLNVRTLQNDVEAAENIFLEMKK-TK
           +LL+ YV+    +KAEAL+  M + G+   PL FN M++LY++ ++ +KV A++ E+K K+ + D+ +YN+ L+       VE  E ++ +MK    
Subjt:  ACTALLHVYVQNNLSDKAEALMEKMSECGFLKSPLSFNHMLSLYVSNKQPEKVPALIQELK-KNTKPDVVTYNLLLNVRTLQNDVEAAENIFLEMKK-TK

Query:  IEPDWVTFSTLTNLYSKKQLTEKAASTLKEMEKMASKRNRITFSSLLSLYTNLGDKDGVWRIWKKMKSSFRKMSDSEYTCMISSLVKLDKLEEAEKLYTE
        I P+W TFST+  +Y K   TEKA   L+++E   + RNRI +  LLSLY +LG+K  ++R+W   KS    + +  Y  ++SSLV++  +E AEK+Y E
Subjt:  IEPDWVTFSTLTNLYSKKQLTEKAASTLKEMEKMASKRNRITFSSLLSLYTNLGDKDGVWRIWKKMKSSFRKMSDSEYTCMISSLVKLDKLEEAEKLYTE

Query:  WESVSGTGDTRVPNILLAAYINKNQMEQAESFYDRMLLKGIVPSYTTWELLTWGYLKENQMEKVVHFFKNAIGS--VKKWNADERLVKEVCKKLAEQGNI
        W  V  + D R+PN+L+ AY+  +Q+E AE  +D M+  G  PS +TWE+L  G+ ++  + + +   +NA  +     W     ++    K   E+ ++
Subjt:  WESVSGTGDTRVPNILLAAYINKNQMEQAESFYDRMLLKGIVPSYTTWELLTWGYLKENQMEKVVHFFKNAIGS--VKKWNADERLVKEVCKKLAEQGNI

Query:  EGAEQLLIVLRNAGHVDTEIYNSLL
           E +L +LR +G ++ + Y +L+
Subjt:  EGAEQLLIVLRNAGHVDTEIYNSLL

AT1G60770.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.9e-6632.65Show/hide
Query:  VRKYELNRIVRELRKLKRYKHALEICEWMTLQKDIKLLPGDYAVHLDLIAKIRGLNSAEKFFEDLPEKMRGQSACTALLHVYVQNNLSDKAEALMEKMSE
        V K+E+   +++LR    Y  AL++ E M  ++ +     D A+HLDL+AK R + + E +F DLPE  + +    +LL+ Y +  L++KAE L+ KM E
Subjt:  VRKYELNRIVRELRKLKRYKHALEICEWMTLQKDIKLLPGDYAVHLDLIAKIRGLNSAEKFFEDLPEKMRGQSACTALLHVYVQNNLSDKAEALMEKMSE

Query:  CGFLKSPLSFNHMLSLYVSNKQPEKVPALIQELK-KNTKPDVVTYNLLLNVRTLQNDVEAAENIFLEMKKT-KIEPDWVTFSTLTNLYSKKQLTEKAAST
             S +S+N +++LY    + EKVPA+IQELK +N  PD  TYN+ +      ND+   E +  EM +  ++ PDW T+S + ++Y    L++KA   
Subjt:  CGFLKSPLSFNHMLSLYVSNKQPEKVPALIQELK-KNTKPDVVTYNLLLNVRTLQNDVEAAENIFLEMKKT-KIEPDWVTFSTLTNLYSKKQLTEKAAST

Query:  LKEMEKMASKRNRITFSSLLSLYTNLGDKDGVWRIWKKMKSSFRKMSDSEYTCMISSLVKLDKLEEAEKLYTEWESVSGTGDTRVPNILLAAYINKNQME
        L+E+E   ++R+   +  L++LY  LG    V+RIW+ ++ +  K S+  Y  MI  LVKL+ L  AE L+ EW++   T D R+ N+L+ AY  +  ++
Subjt:  LKEMEKMASKRNRITFSSLLSLYTNLGDKDGVWRIWKKMKSSFRKMSDSEYTCMISSLVKLDKLEEAEKLYTEWESVSGTGDTRVPNILLAAYINKNQME

Query:  QAESFYDRMLLKGIVPSYTTWELLTWGYLKENQMEKVVHFFKNAI----GSVKKWNADERLVKEVCKKLAEQGNIEGAEQLLIVLRN-AGHVDTEIYNSL
        +A    ++   +G   +  TWE+    Y+K   M + +     A+    G   KW      V+ +     ++ ++ GAE LL +L+N   ++  EI+  L
Subjt:  QAESFYDRMLLKGIVPSYTTWELLTWGYLKENQMEKVVHFFKNAI----GSVKKWNADERLVKEVCKKLAEQGNIEGAEQLLIVLRN-AGHVDTEIYNSL

Query:  LRTYAKAGKMPLIVAERMEKDNVQLDEETRELIKLTSK
        +RTYA AGK    +  R++ +NV+++E T++L+   S+
Subjt:  LRTYAKAGKMPLIVAERMEKDNVQLDEETRELIKLTSK

AT2G20710.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.9e-6634.14Show/hide
Query:  DTLGRRLMSLAFPKRSAVIAIRKWQEEGHTVRKYELNRIVRELRKLKRYKHALEICEWMTLQKDIKLLPGDYAVHLDLIAKIRGLNSAEKFFEDLPEKMR
        DTL RR+     P  S +  +  W ++G+ V+  EL+ I++ LRK  R+ HAL+I +WM+  +  ++  GD A+ LDLIAK+ GL  AEKFFE +P + R
Subjt:  DTLGRRLMSLAFPKRSAVIAIRKWQEEGHTVRKYELNRIVRELRKLKRYKHALEICEWMTLQKDIKLLPGDYAVHLDLIAKIRGLNSAEKFFEDLPEKMR

Query:  GQSACTALLHVYVQNNLSDKAEALMEKMSECGFLKSPLSFNHMLSLYVSNKQPEKVPALIQELKKNT-KPDVVTYNLLLNVRTLQNDVEAAENIFLEMKK
              ALL+ Y    +  KAE + ++M E GFLK  L +N ML+LYV   +   V  L++E++  T KPD+ T N  L+  ++ +DVE  E   +  + 
Subjt:  GQSACTALLHVYVQNNLSDKAEALMEKMSECGFLKSPLSFNHMLSLYVSNKQPEKVPALIQELKKNT-KPDVVTYNLLLNVRTLQNDVEAAENIFLEMKK

Query:  TK-IEPDWVTFSTLTNLYSKKQLTEKAASTLKEMEKMA-SKRNRITFSSLLSLYTNLGDKDGVWRIWKKMKSSFRKMSDSEYTCMISSLVKLDKLEEAEK
         + +  DW T++   N Y K  LTEKA   L++ E+M  +++ +  +  L+S Y   G K+ V+R+W   K       ++ Y  +IS+L+K+D +EE EK
Subjt:  TK-IEPDWVTFSTLTNLYSKKQLTEKAASTLKEMEKMA-SKRNRITFSSLLSLYTNLGDKDGVWRIWKKMKSSFRKMSDSEYTCMISSLVKLDKLEEAEK

Query:  LYTEWESVSGTGDTRVPNILLAAYINKNQMEQAESFYDRMLLKGIVPSYTTWELLTWGYLKENQMEKVVHFFKNAIGSVKK-WNADERLVKEVCKKLAEQ
        +  EWE+     D R+P++L+  Y  K  ME+AE   + ++ K  V   +TWE L  GY    +MEK V  +K AI   K  W   + ++      L  Q
Subjt:  LYTEWESVSGTGDTRVPNILLAAYINKNQMEQAESFYDRMLLKGIVPSYTTWELLTWGYLKENQMEKVVHFFKNAIGSVKK-WNADERLVKEVCKKLAEQ

Query:  GNIEGAEQLLIVLRNAGHVDTEIYNSLLRTYAKAGKMPLIVAERMEKDNVQLDEETR
         ++EG  ++L +L   GH+    Y+ LL     AG +   + + M K     + E R
Subjt:  GNIEGAEQLLIVLRNAGHVDTEIYNSLLRTYAKAGKMPLIVAERMEKDNVQLDEETR

AT4G02820.1 Pentatricopeptide repeat (PPR) superfamily protein4.0e-18662.88Show/hide
Query:  MLRSLRTSMATAARRFSAEAYMAAVENTAL------EGGSGTG-----------GGRDTLGRRLMSLAFPKRSAVIAIRKWQEEGHTVRKYELNRIVREL
        ++RS R ++A+  R FSA A  AA  +TA         G G G           GGRDTLG RL+SL + KRSAV+ IRKW+EEGH+VRKYELNRIVREL
Subjt:  MLRSLRTSMATAARRFSAEAYMAAVENTAL------EGGSGTG-----------GGRDTLGRRLMSLAFPKRSAVIAIRKWQEEGHTVRKYELNRIVREL

Query:  RKLKRYKHALEICEWMTLQKDIKLLPGDYAVHLDLIAKIRGLNSAEKFFEDLPEKMRGQSACTALLHVYVQNNLSDKAEALMEKMSECGFLKSPLSFNHM
        RK+KRYKHALEICEWM +Q+DIKL  GDYAVHLDLI+KIRGLNSAEKFFED+P++MRG +ACT+LLH YVQN LSDKAEAL EKM ECGFLKS L +NHM
Subjt:  RKLKRYKHALEICEWMTLQKDIKLLPGDYAVHLDLIAKIRGLNSAEKFFEDLPEKMRGQSACTALLHVYVQNNLSDKAEALMEKMSECGFLKSPLSFNHM

Query:  LSLYVSNKQPEKVPALIQELKKNTKPDVVTYNLLLNVRTLQNDVEAAENIFLEMKKTKIEPDWVTFSTLTNLYSKKQLTEKAASTLKEMEKMASKRNRIT
        LS+Y+S  Q EKVP LI+ELK  T PD+VTYNL L      NDVE AE ++L+ K+ K+ PDWVT+S LTNLY+K    EKA   LKEMEK+ SK+NR+ 
Subjt:  LSLYVSNKQPEKVPALIQELKKNTKPDVVTYNLLLNVRTLQNDVEAAENIFLEMKKTKIEPDWVTFSTLTNLYSKKQLTEKAASTLKEMEKMASKRNRIT

Query:  FSSLLSLYTNLGDKDGVWRIWKKMKSSFRKMSDSEYTCMISSLVKLDKLEEAEKLYTEWESVSGTGDTRVPNILLAAYINKNQMEQAESFYDRMLLKGIV
        ++SL+SL+ NLGDKDGV   WKK+KSSF+KM+D+EY  MIS++VKL + E+A+ LY EWESVSGTGD R+PN++LA Y+N++++   E FY+R++ KGI 
Subjt:  FSSLLSLYTNLGDKDGVWRIWKKMKSSFRKMSDSEYTCMISSLVKLDKLEEAEKLYTEWESVSGTGDTRVPNILLAAYINKNQMEQAESFYDRMLLKGIV

Query:  PSYTTWELLTWGYLKENQMEKVVHFFKNAIGSVKKWNADERLVKEVCKKLAEQGNIEGAEQLLIVLRNAGHVDTEIYNSLLRTYAKAGKMPLIVAERMEK
        PSY+TWE+LTW YLK   MEKV+  F  AI SVKKW  + RLVK  CK+L EQGN++GAE+L+ +L+ AG+V+T++YNSLLRTYAKAG+M LIV ERM K
Subjt:  PSYTTWELLTWGYLKENQMEKVVHFFKNAIGSVKKWNADERLVKEVCKKLAEQGNIEGAEQLLIVLRNAGHVDTEIYNSLLRTYAKAGKMPLIVAERMEK

Query:  DNVQLDEETRELIKLTSKMCVSEVSSTL
        DNV+LDEET+ELI+LTS+M V+E+SST+
Subjt:  DNVQLDEETRELIKLTSKMCVSEVSSTL

AT5G27460.1 Tetratricopeptide repeat (TPR)-like superfamily protein2.8e-7033.95Show/hide
Query:  MATAARRFSAEAYMAAVENTALEGGSGTGGGRDTLGRRLMSLAFPKRSAVIAIRKWQEEGHTVRKYELNRIVRELRKLKRYKHALEICEWMTLQKDIKLL
        +A A  RF+    ++ + ++  +G   +        + ++    P+RS    +++  + GH V   EL  I + L +  RY  AL++ EWM  QKDI+  
Subjt:  MATAARRFSAEAYMAAVENTALEGGSGTGGGRDTLGRRLMSLAFPKRSAVIAIRKWQEEGHTVRKYELNRIVRELRKLKRYKHALEICEWMTLQKDIKLL

Query:  PGDYAVHLDLIAKIRGLNSAEKFFEDL---PEKMR-GQSACTALLHVYVQNNLSDKAEALMEKMSECGFLKSPLSFNHMLSLYVSNKQPEKVPALIQELK
          D A+ LDLI K  GL   E++FE L      MR  +SA   LL  YV+N +  +AEALMEK++  GFL +P  FN M+ LY ++ Q EKV  ++  +K
Subjt:  PGDYAVHLDLIAKIRGLNSAEKFFEDL---PEKMR-GQSACTALLHVYVQNNLSDKAEALMEKMSECGFLKSPLSFNHMLSLYVSNKQPEKVPALIQELK

Query:  KNTKP-DVVTYNLLLNVRTLQNDVEAAENIFLEMKKTK-IEPDWVTFSTLTNLYSKKQLTEKAASTLKEMEKMASKRNRITFSSLLSLYTNLGDKDGVWR
         N  P +V++YNL +N     + V A E ++ EM   K +E  W +  TL N+Y K    EKA   L++ EKM ++ NR+ +  L++LY +LG+K+GV R
Subjt:  KNTKP-DVVTYNLLLNVRTLQNDVEAAENIFLEMKKTK-IEPDWVTFSTLTNLYSKKQLTEKAASTLKEMEKMASKRNRITFSSLLSLYTNLGDKDGVWR

Query:  IWKKMKSSFRKMSDSEYTCMISSLVKLDKLEEAEKLYTEWESVSGTGDTRVPNILLAAYINKNQMEQAESFYDRMLLKGIVPSYTTWELLTWGYLKENQM
        +W+  KS   ++S   Y C++SSLVK   LEEAE++++EWE+     D RV N+LL AY+   ++ +AES +  +L +G  P+Y TWE+L  G++K   M
Subjt:  IWKKMKSSFRKMSDSEYTCMISSLVKLDKLEEAEKLYTEWESVSGTGDTRVPNILLAAYINKNQMEQAESFYDRMLLKGIVPSYTTWELLTWGYLKENQM

Query:  EKVVHFFKNAIGSVKK--WNADERLVKEVCKKLAEQGNIEGAEQLLIVLRNAGHVDTEIYNSLLRTYAKAGKMPLIVAERMEKDNV
        EK +         +++  W     +V  + +   ++  IE A   +  L   G     +Y  LLR +  A +    + E M+ D +
Subjt:  EKVVHFFKNAIGSVKK--WNADERLVKEVCKKLAEQGNIEGAEQLLIVLRNAGHVDTEIYNSLLRTYAKAGKMPLIVAERMEKDNV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTCCGCTCCCTGCGGACGTCTATGGCTACGGCCGCCCGCCGATTCTCGGCGGAAGCCTACATGGCGGCGGTTGAGAACACAGCACTAGAAGGTGGCTCCGGCACCGG
CGGTGGTCGGGACACGCTTGGGCGTAGACTTATGAGCCTCGCCTTCCCAAAACGCAGCGCCGTGATTGCCATTCGCAAATGGCAAGAAGAGGGCCACACTGTCCGCAAGT
ACGAGCTCAATCGCATCGTCCGGGAACTCCGCAAGCTCAAGCGCTACAAGCACGCACTCGAGATATGTGAATGGATGACATTACAGAAAGATATCAAGCTGCTACCTGGT
GATTATGCAGTTCATCTGGATTTGATTGCAAAAATCCGTGGCCTGAATAGCGCAGAAAAGTTTTTCGAGGATCTCCCTGAAAAAATGAGAGGTCAATCAGCCTGCACAGC
TCTTCTTCATGTGTACGTGCAAAATAATCTATCTGACAAAGCTGAGGCTCTAATGGAGAAAATGTCTGAATGTGGTTTCTTGAAAAGTCCTCTTTCTTTCAACCACATGC
TATCTCTTTACGTCTCAAATAAGCAACCGGAGAAGGTTCCTGCTCTGATTCAAGAATTGAAGAAGAACACTAAACCAGATGTGGTAACATACAATCTTCTTTTGAATGTT
CGTACTTTGCAAAATGATGTTGAAGCTGCAGAAAACATTTTCCTTGAGATGAAGAAGACAAAAATTGAACCGGATTGGGTAACATTCAGCACATTAACCAACTTGTATTC
CAAAAAACAACTAACTGAAAAAGCAGCATCTACTTTGAAGGAGATGGAGAAAATGGCATCTAAAAGAAACAGAATCACATTTTCCTCTCTTCTTAGCTTATATACCAATT
TGGGGGATAAGGATGGAGTTTGGAGGATATGGAAAAAGATGAAGTCATCCTTTCGCAAGATGAGTGATAGTGAGTATACTTGCATGATATCCTCTCTTGTGAAACTCGAC
AAGCTTGAGGAAGCCGAGAAACTCTATACCGAATGGGAATCGGTTTCCGGGACAGGTGATACTCGAGTTCCAAATATATTGCTTGCAGCGTATATCAACAAAAACCAAAT
GGAACAAGCCGAGAGTTTCTACGATCGGATGTTGCTAAAAGGAATTGTTCCTTCTTACACTACTTGGGAGCTCCTCACATGGGGTTATTTGAAAGAGAACCAGATGGAGA
AAGTCGTGCATTTCTTCAAGAATGCTATTGGCAGCGTGAAGAAATGGAATGCGGATGAGAGGTTGGTTAAAGAAGTTTGTAAGAAACTTGCGGAGCAGGGTAATATTGAA
GGGGCAGAGCAGTTATTGATTGTTCTTAGGAATGCTGGTCATGTGGATACTGAAATATACAATTCTCTCTTACGCACTTATGCAAAAGCTGGTAAAATGCCACTCATTGT
TGCTGAAAGAATGGAAAAGGACAATGTTCAGTTGGATGAAGAGACTCGTGAGCTTATAAAATTGACCAGCAAGATGTGTGTGAGTGAAGTTTCAAGCACTTTGTACTAG
mRNA sequenceShow/hide mRNA sequence
ATGCTCCGCTCCCTGCGGACGTCTATGGCTACGGCCGCCCGCCGATTCTCGGCGGAAGCCTACATGGCGGCGGTTGAGAACACAGCACTAGAAGGTGGCTCCGGCACCGG
CGGTGGTCGGGACACGCTTGGGCGTAGACTTATGAGCCTCGCCTTCCCAAAACGCAGCGCCGTGATTGCCATTCGCAAATGGCAAGAAGAGGGCCACACTGTCCGCAAGT
ACGAGCTCAATCGCATCGTCCGGGAACTCCGCAAGCTCAAGCGCTACAAGCACGCACTCGAGATATGTGAATGGATGACATTACAGAAAGATATCAAGCTGCTACCTGGT
GATTATGCAGTTCATCTGGATTTGATTGCAAAAATCCGTGGCCTGAATAGCGCAGAAAAGTTTTTCGAGGATCTCCCTGAAAAAATGAGAGGTCAATCAGCCTGCACAGC
TCTTCTTCATGTGTACGTGCAAAATAATCTATCTGACAAAGCTGAGGCTCTAATGGAGAAAATGTCTGAATGTGGTTTCTTGAAAAGTCCTCTTTCTTTCAACCACATGC
TATCTCTTTACGTCTCAAATAAGCAACCGGAGAAGGTTCCTGCTCTGATTCAAGAATTGAAGAAGAACACTAAACCAGATGTGGTAACATACAATCTTCTTTTGAATGTT
CGTACTTTGCAAAATGATGTTGAAGCTGCAGAAAACATTTTCCTTGAGATGAAGAAGACAAAAATTGAACCGGATTGGGTAACATTCAGCACATTAACCAACTTGTATTC
CAAAAAACAACTAACTGAAAAAGCAGCATCTACTTTGAAGGAGATGGAGAAAATGGCATCTAAAAGAAACAGAATCACATTTTCCTCTCTTCTTAGCTTATATACCAATT
TGGGGGATAAGGATGGAGTTTGGAGGATATGGAAAAAGATGAAGTCATCCTTTCGCAAGATGAGTGATAGTGAGTATACTTGCATGATATCCTCTCTTGTGAAACTCGAC
AAGCTTGAGGAAGCCGAGAAACTCTATACCGAATGGGAATCGGTTTCCGGGACAGGTGATACTCGAGTTCCAAATATATTGCTTGCAGCGTATATCAACAAAAACCAAAT
GGAACAAGCCGAGAGTTTCTACGATCGGATGTTGCTAAAAGGAATTGTTCCTTCTTACACTACTTGGGAGCTCCTCACATGGGGTTATTTGAAAGAGAACCAGATGGAGA
AAGTCGTGCATTTCTTCAAGAATGCTATTGGCAGCGTGAAGAAATGGAATGCGGATGAGAGGTTGGTTAAAGAAGTTTGTAAGAAACTTGCGGAGCAGGGTAATATTGAA
GGGGCAGAGCAGTTATTGATTGTTCTTAGGAATGCTGGTCATGTGGATACTGAAATATACAATTCTCTCTTACGCACTTATGCAAAAGCTGGTAAAATGCCACTCATTGT
TGCTGAAAGAATGGAAAAGGACAATGTTCAGTTGGATGAAGAGACTCGTGAGCTTATAAAATTGACCAGCAAGATGTGTGTGAGTGAAGTTTCAAGCACTTTGTACTAG
Protein sequenceShow/hide protein sequence
MLRSLRTSMATAARRFSAEAYMAAVENTALEGGSGTGGGRDTLGRRLMSLAFPKRSAVIAIRKWQEEGHTVRKYELNRIVRELRKLKRYKHALEICEWMTLQKDIKLLPG
DYAVHLDLIAKIRGLNSAEKFFEDLPEKMRGQSACTALLHVYVQNNLSDKAEALMEKMSECGFLKSPLSFNHMLSLYVSNKQPEKVPALIQELKKNTKPDVVTYNLLLNV
RTLQNDVEAAENIFLEMKKTKIEPDWVTFSTLTNLYSKKQLTEKAASTLKEMEKMASKRNRITFSSLLSLYTNLGDKDGVWRIWKKMKSSFRKMSDSEYTCMISSLVKLD
KLEEAEKLYTEWESVSGTGDTRVPNILLAAYINKNQMEQAESFYDRMLLKGIVPSYTTWELLTWGYLKENQMEKVVHFFKNAIGSVKKWNADERLVKEVCKKLAEQGNIE
GAEQLLIVLRNAGHVDTEIYNSLLRTYAKAGKMPLIVAERMEKDNVQLDEETRELIKLTSKMCVSEVSSTLY