; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS018397 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS018397
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationscaffold342:634725..637816
RNA-Seq ExpressionMS018397
SyntenyMS018397
Gene Ontology termsGO:0005739 - mitochondrion (cellular component)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7027261.1 Pentatricopeptide repeat-containing protein, mitochondrial [Cucurbita argyrosperma subsp. argyrosperma]2.1e-25386.88Show/hide
Query:  MLRSLRTSLATAARRFSGEAFMAAVENTAIEGGSGSSG-GGGGRDTLGRRLMSLAFPKRSAVIAIRKWQEEGHTVRKYELNRIVRELRKLKRYKHALEIC
        MLRS R  LATAARRFSGEA   AVENT +E  SGSSG GGGGRDTLGRRLMSLAFPKRSAVIAIRKWQEEGHTVRKYELNRIVRELRKLKRYKHALEIC
Subjt:  MLRSLRTSLATAARRFSGEAFMAAVENTAIEGGSGSSG-GGGGRDTLGRRLMSLAFPKRSAVIAIRKWQEEGHTVRKYELNRIVRELRKLKRYKHALEIC

Query:  EWMTSQKDMKLLPGDYAVHLDLIAKIRGLNSAEKFFEDLPDKMRGQSACTALLHVYVQNNLSDKAEALMEKMSECGFLKSPLSFNHMLSLHISNKQLDKV
        EWMT QKDMKLLPGDYAVHLDLI+KIRGL+SAEKFF DLPDKMRGQSA T+LLHV+VQNNLS+KAEALM KMSE GFLKSPLSFNHMLSL+I+NK+L+KV
Subjt:  EWMTSQKDMKLLPGDYAVHLDLIAKIRGLNSAEKFFEDLPDKMRGQSACTALLHVYVQNNLSDKAEALMEKMSECGFLKSPLSFNHMLSLHISNKQLDKV

Query:  PALIQDLQKNTKPDVVTYNLLLNVCTLQNDVEAAENILLEMKKMKIERDWVTLSTLTNLYSKKQLTEKAASTLKEMEKMASKRNRITFSSLLSLYTNLGD
        PAL+Q+L+KNTKPDVVTYNLLLNVCTLQNDVEAAENI LEMK  KIE DWV+ STL NLYSK+QLTEKAASTLK+MEKMASKRNRI+FSSLLSLYTNLGD
Subjt:  PALIQDLQKNTKPDVVTYNLLLNVCTLQNDVEAAENILLEMKKMKIERDWVTLSTLTNLYSKKQLTEKAASTLKEMEKMASKRNRITFSSLLSLYTNLGD

Query:  KDGVWRIWKKMKTSFRKMSDSEYTCMISSVVKLHELEEAEKLYTEWESVSGTGDTRVPNILLAAYINNNQMEQAESFYNRMSLKGIVPSYTTWELLTWGY
        KDGV RIWKKM +SFRKM+DSEYTCMISS+VKL +LEEAEKLYTEWESVSGTGDTRVPNILLAAYIN NQ +QAESFY+RM+LKGIVPSYTTWELLTWGY
Subjt:  KDGVWRIWKKMKTSFRKMSDSEYTCMISSVVKLHELEEAEKLYTEWESVSGTGDTRVPNILLAAYINNNQMEQAESFYNRMSLKGIVPSYTTWELLTWGY

Query:  LKENQMEKVLHFFKNAVGSVKKWNADERLVKQVCKKLEEEGNIEGAEKLLIVLRNAGHVNTEIYNSLLRTYAKAGKMPLIVAERMEKDDVKLDEETRELI
        LKENQMEKVL FFKNAVGSVKKWNADERLVK VCK+LEE+GN EGAE+LLI+LRNAGHV+TEIYNSLLRTYAKAGKMPLIVAERME+D+V+L+EE+REL+
Subjt:  LKENQMEKVLHFFKNAVGSVKKWNADERLVKQVCKKLEEEGNIEGAEKLLIVLRNAGHVNTEIYNSLLRTYAKAGKMPLIVAERMEKDDVKLDEETRELI

Query:  KLTSKMCVSEVSSTFYYETRKTDQTD
        KLTSKMCVSEVSSTFY+E +KT  TD
Subjt:  KLTSKMCVSEVSSTFYYETRKTDQTD

XP_022150265.1 pentatricopeptide repeat-containing protein At4g02820, mitochondrial isoform X1 [Momordica charantia]3.2e-28995.79Show/hide
Query:  MLRSLRTSLATAARRFSGEAFMAAVENTAIEGGSGSSGGGGGRDTLGRRLMSLAFPKRSAVIAIRKWQEEGHTVRKYELNRIVRELRKLKRYKHALE---
        MLRSLRTSLATAARRFSGEAFMAAVENTAIEGGSGSSGGGGGRDTLGRRLMSLAFPKRSAVIAIRKWQEEGHTVRKYELNRIVRELRKLKRYKHALE   
Subjt:  MLRSLRTSLATAARRFSGEAFMAAVENTAIEGGSGSSGGGGGRDTLGRRLMSLAFPKRSAVIAIRKWQEEGHTVRKYELNRIVRELRKLKRYKHALE---

Query:  ------------------ICEWMTSQKDMKLLPGDYAVHLDLIAKIRGLNSAEKFFEDLPDKMRGQSACTALLHVYVQNNLSDKAEALMEKMSECGFLKS
                          ICEW TSQKDMKLLPGDYAVHLDLIAKIRGLNSAEKFFEDLPDKMRGQSACTALLHVYVQNNLSDKAEALMEKMSECGFLKS
Subjt:  ------------------ICEWMTSQKDMKLLPGDYAVHLDLIAKIRGLNSAEKFFEDLPDKMRGQSACTALLHVYVQNNLSDKAEALMEKMSECGFLKS

Query:  PLSFNHMLSLHISNKQLDKVPALIQDLQKNTKPDVVTYNLLLNVCTLQNDVEAAENILLEMKKMKIERDWVTLSTLTNLYSKKQLTEKAASTLKEMEKMA
        PLSFNHMLSLHISNKQLDKVPALIQDLQKNTKPDVVTYNLLLNVCTLQNDVEAAENILLEMKKMKIERDWVTLSTLTNLYSKKQLTEKAASTLKEMEKMA
Subjt:  PLSFNHMLSLHISNKQLDKVPALIQDLQKNTKPDVVTYNLLLNVCTLQNDVEAAENILLEMKKMKIERDWVTLSTLTNLYSKKQLTEKAASTLKEMEKMA

Query:  SKRNRITFSSLLSLYTNLGDKDGVWRIWKKMKTSFRKMSDSEYTCMISSVVKLHELEEAEKLYTEWESVSGTGDTRVPNILLAAYINNNQMEQAESFYNR
        SKRNRITFSSLLSLYTNLGDKDG WRIWKKMKTSFRKMSDSEYTCMISSVVKLHELEEAEKLYTEWESVSGTGDTRVPNILLAAYINNNQMEQAESFYNR
Subjt:  SKRNRITFSSLLSLYTNLGDKDGVWRIWKKMKTSFRKMSDSEYTCMISSVVKLHELEEAEKLYTEWESVSGTGDTRVPNILLAAYINNNQMEQAESFYNR

Query:  MSLKGIVPSYTTWELLTWGYLKENQMEKVLHFFKNAVGSVKKWNADERLVKQVCKKLEEEGNIEGAEKLLIVLRNAGHVNTEIYNSLLRTYAKAGKMPLI
        MSLKGIVPSYTTWELLTWGYLKENQMEKVLHFFKNAVGSVKKWNADERLVKQVCKKLEEEGNIEGAEKLLIVLRNAGHVNTEIYNSLLRTYAKAGKMPLI
Subjt:  MSLKGIVPSYTTWELLTWGYLKENQMEKVLHFFKNAVGSVKKWNADERLVKQVCKKLEEEGNIEGAEKLLIVLRNAGHVNTEIYNSLLRTYAKAGKMPLI

Query:  VAERMEKDDVKLDEETRELIKLTSKMCVSEVSSTFYYETRKTDQTD
        VAERMEKDDVKLDEETRELIKLTSKMCVSEVSSTFYYETRKTDQTD
Subjt:  VAERMEKDDVKLDEETRELIKLTSKMCVSEVSSTFYYETRKTDQTD

XP_022150266.1 pentatricopeptide repeat-containing protein At4g02820, mitochondrial isoform X2 [Momordica charantia]6.2e-29399.62Show/hide
Query:  MLRSLRTSLATAARRFSGEAFMAAVENTAIEGGSGSSGGGGGRDTLGRRLMSLAFPKRSAVIAIRKWQEEGHTVRKYELNRIVRELRKLKRYKHALEICE
        MLRSLRTSLATAARRFSGEAFMAAVENTAIEGGSGSSGGGGGRDTLGRRLMSLAFPKRSAVIAIRKWQEEGHTVRKYELNRIVRELRKLKRYKHALEICE
Subjt:  MLRSLRTSLATAARRFSGEAFMAAVENTAIEGGSGSSGGGGGRDTLGRRLMSLAFPKRSAVIAIRKWQEEGHTVRKYELNRIVRELRKLKRYKHALEICE

Query:  WMTSQKDMKLLPGDYAVHLDLIAKIRGLNSAEKFFEDLPDKMRGQSACTALLHVYVQNNLSDKAEALMEKMSECGFLKSPLSFNHMLSLHISNKQLDKVP
        W TSQKDMKLLPGDYAVHLDLIAKIRGLNSAEKFFEDLPDKMRGQSACTALLHVYVQNNLSDKAEALMEKMSECGFLKSPLSFNHMLSLHISNKQLDKVP
Subjt:  WMTSQKDMKLLPGDYAVHLDLIAKIRGLNSAEKFFEDLPDKMRGQSACTALLHVYVQNNLSDKAEALMEKMSECGFLKSPLSFNHMLSLHISNKQLDKVP

Query:  ALIQDLQKNTKPDVVTYNLLLNVCTLQNDVEAAENILLEMKKMKIERDWVTLSTLTNLYSKKQLTEKAASTLKEMEKMASKRNRITFSSLLSLYTNLGDK
        ALIQDLQKNTKPDVVTYNLLLNVCTLQNDVEAAENILLEMKKMKIERDWVTLSTLTNLYSKKQLTEKAASTLKEMEKMASKRNRITFSSLLSLYTNLGDK
Subjt:  ALIQDLQKNTKPDVVTYNLLLNVCTLQNDVEAAENILLEMKKMKIERDWVTLSTLTNLYSKKQLTEKAASTLKEMEKMASKRNRITFSSLLSLYTNLGDK

Query:  DGVWRIWKKMKTSFRKMSDSEYTCMISSVVKLHELEEAEKLYTEWESVSGTGDTRVPNILLAAYINNNQMEQAESFYNRMSLKGIVPSYTTWELLTWGYL
        DG WRIWKKMKTSFRKMSDSEYTCMISSVVKLHELEEAEKLYTEWESVSGTGDTRVPNILLAAYINNNQMEQAESFYNRMSLKGIVPSYTTWELLTWGYL
Subjt:  DGVWRIWKKMKTSFRKMSDSEYTCMISSVVKLHELEEAEKLYTEWESVSGTGDTRVPNILLAAYINNNQMEQAESFYNRMSLKGIVPSYTTWELLTWGYL

Query:  KENQMEKVLHFFKNAVGSVKKWNADERLVKQVCKKLEEEGNIEGAEKLLIVLRNAGHVNTEIYNSLLRTYAKAGKMPLIVAERMEKDDVKLDEETRELIK
        KENQMEKVLHFFKNAVGSVKKWNADERLVKQVCKKLEEEGNIEGAEKLLIVLRNAGHVNTEIYNSLLRTYAKAGKMPLIVAERMEKDDVKLDEETRELIK
Subjt:  KENQMEKVLHFFKNAVGSVKKWNADERLVKQVCKKLEEEGNIEGAEKLLIVLRNAGHVNTEIYNSLLRTYAKAGKMPLIVAERMEKDDVKLDEETRELIK

Query:  LTSKMCVSEVSSTFYYETRKTDQTD
        LTSKMCVSEVSSTFYYETRKTDQTD
Subjt:  LTSKMCVSEVSSTFYYETRKTDQTD

XP_022960480.1 pentatricopeptide repeat-containing protein At4g02820, mitochondrial [Cucurbita moschata]6.2e-25386.31Show/hide
Query:  MLRSLRTSLATAARRFSGEAFMAAVENTAIEGGSGSSG-GGGGRDTLGRRLMSLAFPKRSAVIAIRKWQEEGHTVRKYELNRIVRELRKLKRYKHALEIC
        MLRS R  LATAARRFSGEA   AVENT +E  SGSSG GGGGRDTLGRRLMSLAFPKRSAVIAIRKWQEEGHTVRKY+LNRIVRELRKLKRYKHALEIC
Subjt:  MLRSLRTSLATAARRFSGEAFMAAVENTAIEGGSGSSG-GGGGRDTLGRRLMSLAFPKRSAVIAIRKWQEEGHTVRKYELNRIVRELRKLKRYKHALEIC

Query:  EWMTSQKDMKLLPGDYAVHLDLIAKIRGLNSAEKFFEDLPDKMRGQSACTALLHVYVQNNLSDKAEALMEKMSECGFLKSPLSFNHMLSLHISNKQLDKV
        EWMT QKDMKLLPGDYAVHLDLI+KIRGL+SAEKFF DLPDKMRGQSA T+LLHV+VQNNLS+KAEALM KMSE GFLKSPLSFNHMLSLHI+NK+L+KV
Subjt:  EWMTSQKDMKLLPGDYAVHLDLIAKIRGLNSAEKFFEDLPDKMRGQSACTALLHVYVQNNLSDKAEALMEKMSECGFLKSPLSFNHMLSLHISNKQLDKV

Query:  PALIQDLQKNTKPDVVTYNLLLNVCTLQNDVEAAENILLEMKKMKIERDWVTLSTLTNLYSKKQLTEKAASTLKEMEKMASKRNRITFSSLLSLYTNLGD
        PAL+Q+L+KNTKPDVVTYNLLLNVCTLQNDVEAAENI LEMK  KIE DWV+ STL NLYSK+QLTEKAASTLK+MEKMASKRNRI+FSSLLSLYTNLGD
Subjt:  PALIQDLQKNTKPDVVTYNLLLNVCTLQNDVEAAENILLEMKKMKIERDWVTLSTLTNLYSKKQLTEKAASTLKEMEKMASKRNRITFSSLLSLYTNLGD

Query:  KDGVWRIWKKMKTSFRKMSDSEYTCMISSVVKLHELEEAEKLYTEWESVSGTGDTRVPNILLAAYINNNQMEQAESFYNRMSLKGIVPSYTTWELLTWGY
        KDGV RIWKKM +SFRKM+DSEY CMISS+VKL +LEEAEKLYTEWESVSGTGDTRVPNILLAAYIN NQ +QAESFY+RM+LKGI+PSYTTWELLTWGY
Subjt:  KDGVWRIWKKMKTSFRKMSDSEYTCMISSVVKLHELEEAEKLYTEWESVSGTGDTRVPNILLAAYINNNQMEQAESFYNRMSLKGIVPSYTTWELLTWGY

Query:  LKENQMEKVLHFFKNAVGSVKKWNADERLVKQVCKKLEEEGNIEGAEKLLIVLRNAGHVNTEIYNSLLRTYAKAGKMPLIVAERMEKDDVKLDEETRELI
        LKENQMEKVL FFKNAVGSVKKWN DERLVK VCK+LEE+GN EGAE+LLI+LRNAGHV+TEIYNSLLRTYAKAGKMPLIVAERME+D+V+L+EE+REL+
Subjt:  LKENQMEKVLHFFKNAVGSVKKWNADERLVKQVCKKLEEEGNIEGAEKLLIVLRNAGHVNTEIYNSLLRTYAKAGKMPLIVAERMEKDDVKLDEETRELI

Query:  KLTSKMCVSEVSSTFYYETRKTDQTD
        KLTSKMCVSEVSSTFY+E +KT++TD
Subjt:  KLTSKMCVSEVSSTFYYETRKTDQTD

XP_023515289.1 pentatricopeptide repeat-containing protein At4g02820, mitochondrial isoform X1 [Cucurbita pepo subsp. pepo]1.3e-25386.88Show/hide
Query:  MLRSLRTSLATAARRFSGEAFMAAVENTAIEGGSGSSG-GGGGRDTLGRRLMSLAFPKRSAVIAIRKWQEEGHTVRKYELNRIVRELRKLKRYKHALEIC
        MLRS R  LATAARRFSGEA+ AAVENT +E  SGSSG  GGGRDTLGRRLMSLAFPKRSAVIAIRKWQEEGHTVRKYELNRIVRELRKLKRYKHALEIC
Subjt:  MLRSLRTSLATAARRFSGEAFMAAVENTAIEGGSGSSG-GGGGRDTLGRRLMSLAFPKRSAVIAIRKWQEEGHTVRKYELNRIVRELRKLKRYKHALEIC

Query:  EWMTSQKDMKLLPGDYAVHLDLIAKIRGLNSAEKFFEDLPDKMRGQSACTALLHVYVQNNLSDKAEALMEKMSECGFLKSPLSFNHMLSLHISNKQLDKV
        EWMT QKDMKLLPGDYAV LDLI+KIRGL+SAEKFF DLPDKMRGQSA TALLHV+VQNNLS+KAEALM KMSE GFLKSPLSFNHMLSLHI+NK+L+KV
Subjt:  EWMTSQKDMKLLPGDYAVHLDLIAKIRGLNSAEKFFEDLPDKMRGQSACTALLHVYVQNNLSDKAEALMEKMSECGFLKSPLSFNHMLSLHISNKQLDKV

Query:  PALIQDLQKNTKPDVVTYNLLLNVCTLQNDVEAAENILLEMKKMKIERDWVTLSTLTNLYSKKQLTEKAASTLKEMEKMASKRNRITFSSLLSLYTNLGD
        PAL+Q+L+KNTKPDVVTYNLLLNVCTLQNDVEAAENI LEMK  KIE DWV+ STL NLYSK+QLTEKAASTLK MEKMASKRNRI+FSSLLSLYTNLGD
Subjt:  PALIQDLQKNTKPDVVTYNLLLNVCTLQNDVEAAENILLEMKKMKIERDWVTLSTLTNLYSKKQLTEKAASTLKEMEKMASKRNRITFSSLLSLYTNLGD

Query:  KDGVWRIWKKMKTSFRKMSDSEYTCMISSVVKLHELEEAEKLYTEWESVSGTGDTRVPNILLAAYINNNQMEQAESFYNRMSLKGIVPSYTTWELLTWGY
        KDGV RIWKKM +SFRKM+DSEY CMISS+VKL +LEEAEKLYTEWESVSGTGDTRVPNILLAAYIN NQ +QAESFY+RM+LKGIVPSYTTWELLTWGY
Subjt:  KDGVWRIWKKMKTSFRKMSDSEYTCMISSVVKLHELEEAEKLYTEWESVSGTGDTRVPNILLAAYINNNQMEQAESFYNRMSLKGIVPSYTTWELLTWGY

Query:  LKENQMEKVLHFFKNAVGSVKKWNADERLVKQVCKKLEEEGNIEGAEKLLIVLRNAGHVNTEIYNSLLRTYAKAGKMPLIVAERMEKDDVKLDEETRELI
        LKENQMEKVL FFKNAVGSVKKWNADERLVK VCK+LEE+GN EGAE+LLI+LRNAGHV+TEIYNSLLRTYAKAGKMPLIVAERME+D+V+L+EE+REL+
Subjt:  LKENQMEKVLHFFKNAVGSVKKWNADERLVKQVCKKLEEEGNIEGAEKLLIVLRNAGHVNTEIYNSLLRTYAKAGKMPLIVAERMEKDDVKLDEETRELI

Query:  KLTSKMCVSEVSSTFYYETRKTDQTD
        KLTSKMCVSEVSSTFY+E +KT++TD
Subjt:  KLTSKMCVSEVSSTFYYETRKTDQTD

TrEMBL top hitse value%identityAlignment
A0A5D3D4M6 Pentatricopeptide repeat-containing protein2.0e-24984.69Show/hide
Query:  MLRSLRTSLAT-AARRFSGEAFMAAVENTAIEGGSGS---SGGGGGRDTLGRRLMSLAFPKRSAVIAIRKWQEEGHTVRKYELNRIVRELRKLKRYKHAL
        M RS R+SLAT AARRFSGEA +AA ENT++EGG+G+   S  GGGRDTLGRRLMSL FPKRSAVIAIRKWQEEGHT+RKYELN IVRELRKLKRYKHAL
Subjt:  MLRSLRTSLAT-AARRFSGEAFMAAVENTAIEGGSGS---SGGGGGRDTLGRRLMSLAFPKRSAVIAIRKWQEEGHTVRKYELNRIVRELRKLKRYKHAL

Query:  EICEWMTSQKDMKLLPGDYAVHLDLIAKIRGLNSAEKFFEDLPDKMRGQSACTALLHVYVQNNLSDKAEALMEKMSECGFLKSPLSFNHMLSLHISNKQL
        E+CEWMT QKDMKLLPGDYAV LDLIAKIRGLNSAEKFFEDLPDK+R QS CTALLH YVQ NLS+KAEALMEKMSECGFLKSPLSFNHMLSLHISNKQL
Subjt:  EICEWMTSQKDMKLLPGDYAVHLDLIAKIRGLNSAEKFFEDLPDKMRGQSACTALLHVYVQNNLSDKAEALMEKMSECGFLKSPLSFNHMLSLHISNKQL

Query:  DKVPALIQDLQKNTKPDVVTYNLLLNVCTLQNDVEAAENILLEMKKMKIERDWVTLSTLTNLYSKKQLTEKAASTLKEMEKMASKRNRITFSSLLSLYTN
        +KVPALI+ L+KNTKPDVVTYNLLLNVCTLQND EAAENI LEMKK K++ DW++ STL NLY KKQLTEKAA+TLKEMEKMA KRNR++FSSLLSLYTN
Subjt:  DKVPALIQDLQKNTKPDVVTYNLLLNVCTLQNDVEAAENILLEMKKMKIERDWVTLSTLTNLYSKKQLTEKAASTLKEMEKMASKRNRITFSSLLSLYTN

Query:  LGDKDGVWRIWKKMKTSFRKMSDSEYTCMISSVVKLHELEEAEKLYTEWESVSGTGDTRVPNILLAAYINNNQMEQAESFYNRMSLKGIVPSYTTWELLT
        LGDK+ V RIWKK+K+SFRKMSDSEY CM+SS+VKL+ELEEAEKLYTEWESVSGT DTR+ N++LAAYIN NQMEQAESFYNRMSLKGIVPSYTTWELLT
Subjt:  LGDKDGVWRIWKKMKTSFRKMSDSEYTCMISSVVKLHELEEAEKLYTEWESVSGTGDTRVPNILLAAYINNNQMEQAESFYNRMSLKGIVPSYTTWELLT

Query:  WGYLKENQMEKVLHFFKNAVGSVKKWNADERLVKQVCKKLEEEGNIEGAEKLLIVLRNAGHVNTEIYNSLLRTYAKAGKMPLIVAERMEKDDVKLDEETR
        WGYLKENQMEKVLHFFKNAVGSVKKWNADERLVK VCKKLEE+GNIEG E+LL++LRNAGHV+TEIYNSLLRTYAKAGKMPLIVAERMEKD+V+L++ETR
Subjt:  WGYLKENQMEKVLHFFKNAVGSVKKWNADERLVKQVCKKLEEEGNIEGAEKLLIVLRNAGHVNTEIYNSLLRTYAKAGKMPLIVAERMEKDDVKLDEETR

Query:  ELIKLTSKMCVSEVSSTFYYETRKTDQTD
        EL++LTSKMCVSEVSST Y    KTDQT+
Subjt:  ELIKLTSKMCVSEVSSTFYYETRKTDQTD

A0A6J1D809 pentatricopeptide repeat-containing protein At4g02820, mitochondrial isoform X11.5e-28995.79Show/hide
Query:  MLRSLRTSLATAARRFSGEAFMAAVENTAIEGGSGSSGGGGGRDTLGRRLMSLAFPKRSAVIAIRKWQEEGHTVRKYELNRIVRELRKLKRYKHALE---
        MLRSLRTSLATAARRFSGEAFMAAVENTAIEGGSGSSGGGGGRDTLGRRLMSLAFPKRSAVIAIRKWQEEGHTVRKYELNRIVRELRKLKRYKHALE   
Subjt:  MLRSLRTSLATAARRFSGEAFMAAVENTAIEGGSGSSGGGGGRDTLGRRLMSLAFPKRSAVIAIRKWQEEGHTVRKYELNRIVRELRKLKRYKHALE---

Query:  ------------------ICEWMTSQKDMKLLPGDYAVHLDLIAKIRGLNSAEKFFEDLPDKMRGQSACTALLHVYVQNNLSDKAEALMEKMSECGFLKS
                          ICEW TSQKDMKLLPGDYAVHLDLIAKIRGLNSAEKFFEDLPDKMRGQSACTALLHVYVQNNLSDKAEALMEKMSECGFLKS
Subjt:  ------------------ICEWMTSQKDMKLLPGDYAVHLDLIAKIRGLNSAEKFFEDLPDKMRGQSACTALLHVYVQNNLSDKAEALMEKMSECGFLKS

Query:  PLSFNHMLSLHISNKQLDKVPALIQDLQKNTKPDVVTYNLLLNVCTLQNDVEAAENILLEMKKMKIERDWVTLSTLTNLYSKKQLTEKAASTLKEMEKMA
        PLSFNHMLSLHISNKQLDKVPALIQDLQKNTKPDVVTYNLLLNVCTLQNDVEAAENILLEMKKMKIERDWVTLSTLTNLYSKKQLTEKAASTLKEMEKMA
Subjt:  PLSFNHMLSLHISNKQLDKVPALIQDLQKNTKPDVVTYNLLLNVCTLQNDVEAAENILLEMKKMKIERDWVTLSTLTNLYSKKQLTEKAASTLKEMEKMA

Query:  SKRNRITFSSLLSLYTNLGDKDGVWRIWKKMKTSFRKMSDSEYTCMISSVVKLHELEEAEKLYTEWESVSGTGDTRVPNILLAAYINNNQMEQAESFYNR
        SKRNRITFSSLLSLYTNLGDKDG WRIWKKMKTSFRKMSDSEYTCMISSVVKLHELEEAEKLYTEWESVSGTGDTRVPNILLAAYINNNQMEQAESFYNR
Subjt:  SKRNRITFSSLLSLYTNLGDKDGVWRIWKKMKTSFRKMSDSEYTCMISSVVKLHELEEAEKLYTEWESVSGTGDTRVPNILLAAYINNNQMEQAESFYNR

Query:  MSLKGIVPSYTTWELLTWGYLKENQMEKVLHFFKNAVGSVKKWNADERLVKQVCKKLEEEGNIEGAEKLLIVLRNAGHVNTEIYNSLLRTYAKAGKMPLI
        MSLKGIVPSYTTWELLTWGYLKENQMEKVLHFFKNAVGSVKKWNADERLVKQVCKKLEEEGNIEGAEKLLIVLRNAGHVNTEIYNSLLRTYAKAGKMPLI
Subjt:  MSLKGIVPSYTTWELLTWGYLKENQMEKVLHFFKNAVGSVKKWNADERLVKQVCKKLEEEGNIEGAEKLLIVLRNAGHVNTEIYNSLLRTYAKAGKMPLI

Query:  VAERMEKDDVKLDEETRELIKLTSKMCVSEVSSTFYYETRKTDQTD
        VAERMEKDDVKLDEETRELIKLTSKMCVSEVSSTFYYETRKTDQTD
Subjt:  VAERMEKDDVKLDEETRELIKLTSKMCVSEVSSTFYYETRKTDQTD

A0A6J1DB09 pentatricopeptide repeat-containing protein At4g02820, mitochondrial isoform X23.0e-29399.62Show/hide
Query:  MLRSLRTSLATAARRFSGEAFMAAVENTAIEGGSGSSGGGGGRDTLGRRLMSLAFPKRSAVIAIRKWQEEGHTVRKYELNRIVRELRKLKRYKHALEICE
        MLRSLRTSLATAARRFSGEAFMAAVENTAIEGGSGSSGGGGGRDTLGRRLMSLAFPKRSAVIAIRKWQEEGHTVRKYELNRIVRELRKLKRYKHALEICE
Subjt:  MLRSLRTSLATAARRFSGEAFMAAVENTAIEGGSGSSGGGGGRDTLGRRLMSLAFPKRSAVIAIRKWQEEGHTVRKYELNRIVRELRKLKRYKHALEICE

Query:  WMTSQKDMKLLPGDYAVHLDLIAKIRGLNSAEKFFEDLPDKMRGQSACTALLHVYVQNNLSDKAEALMEKMSECGFLKSPLSFNHMLSLHISNKQLDKVP
        W TSQKDMKLLPGDYAVHLDLIAKIRGLNSAEKFFEDLPDKMRGQSACTALLHVYVQNNLSDKAEALMEKMSECGFLKSPLSFNHMLSLHISNKQLDKVP
Subjt:  WMTSQKDMKLLPGDYAVHLDLIAKIRGLNSAEKFFEDLPDKMRGQSACTALLHVYVQNNLSDKAEALMEKMSECGFLKSPLSFNHMLSLHISNKQLDKVP

Query:  ALIQDLQKNTKPDVVTYNLLLNVCTLQNDVEAAENILLEMKKMKIERDWVTLSTLTNLYSKKQLTEKAASTLKEMEKMASKRNRITFSSLLSLYTNLGDK
        ALIQDLQKNTKPDVVTYNLLLNVCTLQNDVEAAENILLEMKKMKIERDWVTLSTLTNLYSKKQLTEKAASTLKEMEKMASKRNRITFSSLLSLYTNLGDK
Subjt:  ALIQDLQKNTKPDVVTYNLLLNVCTLQNDVEAAENILLEMKKMKIERDWVTLSTLTNLYSKKQLTEKAASTLKEMEKMASKRNRITFSSLLSLYTNLGDK

Query:  DGVWRIWKKMKTSFRKMSDSEYTCMISSVVKLHELEEAEKLYTEWESVSGTGDTRVPNILLAAYINNNQMEQAESFYNRMSLKGIVPSYTTWELLTWGYL
        DG WRIWKKMKTSFRKMSDSEYTCMISSVVKLHELEEAEKLYTEWESVSGTGDTRVPNILLAAYINNNQMEQAESFYNRMSLKGIVPSYTTWELLTWGYL
Subjt:  DGVWRIWKKMKTSFRKMSDSEYTCMISSVVKLHELEEAEKLYTEWESVSGTGDTRVPNILLAAYINNNQMEQAESFYNRMSLKGIVPSYTTWELLTWGYL

Query:  KENQMEKVLHFFKNAVGSVKKWNADERLVKQVCKKLEEEGNIEGAEKLLIVLRNAGHVNTEIYNSLLRTYAKAGKMPLIVAERMEKDDVKLDEETRELIK
        KENQMEKVLHFFKNAVGSVKKWNADERLVKQVCKKLEEEGNIEGAEKLLIVLRNAGHVNTEIYNSLLRTYAKAGKMPLIVAERMEKDDVKLDEETRELIK
Subjt:  KENQMEKVLHFFKNAVGSVKKWNADERLVKQVCKKLEEEGNIEGAEKLLIVLRNAGHVNTEIYNSLLRTYAKAGKMPLIVAERMEKDDVKLDEETRELIK

Query:  LTSKMCVSEVSSTFYYETRKTDQTD
        LTSKMCVSEVSSTFYYETRKTDQTD
Subjt:  LTSKMCVSEVSSTFYYETRKTDQTD

A0A6J1H967 pentatricopeptide repeat-containing protein At4g02820, mitochondrial3.0e-25386.31Show/hide
Query:  MLRSLRTSLATAARRFSGEAFMAAVENTAIEGGSGSSG-GGGGRDTLGRRLMSLAFPKRSAVIAIRKWQEEGHTVRKYELNRIVRELRKLKRYKHALEIC
        MLRS R  LATAARRFSGEA   AVENT +E  SGSSG GGGGRDTLGRRLMSLAFPKRSAVIAIRKWQEEGHTVRKY+LNRIVRELRKLKRYKHALEIC
Subjt:  MLRSLRTSLATAARRFSGEAFMAAVENTAIEGGSGSSG-GGGGRDTLGRRLMSLAFPKRSAVIAIRKWQEEGHTVRKYELNRIVRELRKLKRYKHALEIC

Query:  EWMTSQKDMKLLPGDYAVHLDLIAKIRGLNSAEKFFEDLPDKMRGQSACTALLHVYVQNNLSDKAEALMEKMSECGFLKSPLSFNHMLSLHISNKQLDKV
        EWMT QKDMKLLPGDYAVHLDLI+KIRGL+SAEKFF DLPDKMRGQSA T+LLHV+VQNNLS+KAEALM KMSE GFLKSPLSFNHMLSLHI+NK+L+KV
Subjt:  EWMTSQKDMKLLPGDYAVHLDLIAKIRGLNSAEKFFEDLPDKMRGQSACTALLHVYVQNNLSDKAEALMEKMSECGFLKSPLSFNHMLSLHISNKQLDKV

Query:  PALIQDLQKNTKPDVVTYNLLLNVCTLQNDVEAAENILLEMKKMKIERDWVTLSTLTNLYSKKQLTEKAASTLKEMEKMASKRNRITFSSLLSLYTNLGD
        PAL+Q+L+KNTKPDVVTYNLLLNVCTLQNDVEAAENI LEMK  KIE DWV+ STL NLYSK+QLTEKAASTLK+MEKMASKRNRI+FSSLLSLYTNLGD
Subjt:  PALIQDLQKNTKPDVVTYNLLLNVCTLQNDVEAAENILLEMKKMKIERDWVTLSTLTNLYSKKQLTEKAASTLKEMEKMASKRNRITFSSLLSLYTNLGD

Query:  KDGVWRIWKKMKTSFRKMSDSEYTCMISSVVKLHELEEAEKLYTEWESVSGTGDTRVPNILLAAYINNNQMEQAESFYNRMSLKGIVPSYTTWELLTWGY
        KDGV RIWKKM +SFRKM+DSEY CMISS+VKL +LEEAEKLYTEWESVSGTGDTRVPNILLAAYIN NQ +QAESFY+RM+LKGI+PSYTTWELLTWGY
Subjt:  KDGVWRIWKKMKTSFRKMSDSEYTCMISSVVKLHELEEAEKLYTEWESVSGTGDTRVPNILLAAYINNNQMEQAESFYNRMSLKGIVPSYTTWELLTWGY

Query:  LKENQMEKVLHFFKNAVGSVKKWNADERLVKQVCKKLEEEGNIEGAEKLLIVLRNAGHVNTEIYNSLLRTYAKAGKMPLIVAERMEKDDVKLDEETRELI
        LKENQMEKVL FFKNAVGSVKKWN DERLVK VCK+LEE+GN EGAE+LLI+LRNAGHV+TEIYNSLLRTYAKAGKMPLIVAERME+D+V+L+EE+REL+
Subjt:  LKENQMEKVLHFFKNAVGSVKKWNADERLVKQVCKKLEEEGNIEGAEKLLIVLRNAGHVNTEIYNSLLRTYAKAGKMPLIVAERMEKDDVKLDEETRELI

Query:  KLTSKMCVSEVSSTFYYETRKTDQTD
        KLTSKMCVSEVSSTFY+E +KT++TD
Subjt:  KLTSKMCVSEVSSTFYYETRKTDQTD

A0A6J1KUT7 pentatricopeptide repeat-containing protein At4g02820, mitochondrial1.1e-25285.93Show/hide
Query:  MLRSLRTSLATAARRFSGEAFMAAVENTAIEGGSGSSG-GGGGRDTLGRRLMSLAFPKRSAVIAIRKWQEEGHTVRKYELNRIVRELRKLKRYKHALEIC
        M RS R  LATAARRFSGEA+ AAVENT +E  SGSSG  GGGRDTLGRRLMSLAFPKRSAVIAIRKWQEEGHTVRKYELNRIVRELRKLKRYKHALEIC
Subjt:  MLRSLRTSLATAARRFSGEAFMAAVENTAIEGGSGSSG-GGGGRDTLGRRLMSLAFPKRSAVIAIRKWQEEGHTVRKYELNRIVRELRKLKRYKHALEIC

Query:  EWMTSQKDMKLLPGDYAVHLDLIAKIRGLNSAEKFFEDLPDKMRGQSACTALLHVYVQNNLSDKAEALMEKMSECGFLKSPLSFNHMLSLHISNKQLDKV
        EWMT QK+MKLLPGDYAVHLDLI+KIRGL+SAEKFF DLPDKMRGQSA T+LLHV+VQNNLS+KAEALM KMSE GFLKSPLSFNHMLSLHI+NK+L+KV
Subjt:  EWMTSQKDMKLLPGDYAVHLDLIAKIRGLNSAEKFFEDLPDKMRGQSACTALLHVYVQNNLSDKAEALMEKMSECGFLKSPLSFNHMLSLHISNKQLDKV

Query:  PALIQDLQKNTKPDVVTYNLLLNVCTLQNDVEAAENILLEMKKMKIERDWVTLSTLTNLYSKKQLTEKAASTLKEMEKMASKRNRITFSSLLSLYTNLGD
        PAL+Q+L+KNTKPDVVTYNLLLNVCTLQNDVEAAENI LEMK  KIE DWV+ STL NLYSK+QLTEKAASTLK+MEKMASKRNRI+FSSLLSLYTNLGD
Subjt:  PALIQDLQKNTKPDVVTYNLLLNVCTLQNDVEAAENILLEMKKMKIERDWVTLSTLTNLYSKKQLTEKAASTLKEMEKMASKRNRITFSSLLSLYTNLGD

Query:  KDGVWRIWKKMKTSFRKMSDSEYTCMISSVVKLHELEEAEKLYTEWESVSGTGDTRVPNILLAAYINNNQMEQAESFYNRMSLKGIVPSYTTWELLTWGY
        KDGV RIWKKM +SFRKM+DSEY CMISS+VKL +LE+AEKLYTEWESVSGTGDTRVPNILLAAYIN NQM+QAESFY+RM+LKGI+PSYTTWELLTWGY
Subjt:  KDGVWRIWKKMKTSFRKMSDSEYTCMISSVVKLHELEEAEKLYTEWESVSGTGDTRVPNILLAAYINNNQMEQAESFYNRMSLKGIVPSYTTWELLTWGY

Query:  LKENQMEKVLHFFKNAVGSVKKWNADERLVKQVCKKLEEEGNIEGAEKLLIVLRNAGHVNTEIYNSLLRTYAKAGKMPLIVAERMEKDDVKLDEETRELI
        LKENQMEKVL FFKNAVGSVKKWNADERLVK VCK+LEE+GN EG E+LLI+LRNAGHV+TEIYNSLLRTYAKAGKMPLIVAERME+D+V+L+EE+REL+
Subjt:  LKENQMEKVLHFFKNAVGSVKKWNADERLVKQVCKKLEEEGNIEGAEKLLIVLRNAGHVNTEIYNSLLRTYAKAGKMPLIVAERMEKDDVKLDEETRELI

Query:  KLTSKMCVSEVSSTFYYETRKTDQTD
        KLT+KMCVSEVSSTFY+E +KT++TD
Subjt:  KLTSKMCVSEVSSTFYYETRKTDQTD

SwissProt top hitse value%identityAlignment
O22714 Pentatricopeptide repeat-containing protein At1g607703.6e-6231.74Show/hide
Query:  VRKYELNRIVRELRKLKRYKHALEICEWMTSQKDMKLLPGDYAVHLDLIAKIRGLNSAEKFFEDLPDKMRGQSACTALLHVYVQNNLSDKAEALMEKMSE
        V K+E+   +++LR    Y  AL++ E M  ++ M     D A+HLDL+AK R + + E +F DLP+  + +    +LL+ Y +  L++KAE L+ KM E
Subjt:  VRKYELNRIVRELRKLKRYKHALEICEWMTSQKDMKLLPGDYAVHLDLIAKIRGLNSAEKFFEDLPDKMRGQSACTALLHVYVQNNLSDKAEALMEKMSE

Query:  CGFLKSPLSFNHMLSLHISNKQLDKVPALIQDLQ-KNTKPDVVTYNLLLNVCTLQNDVEAAENILLEMKK-MKIERDWVTLSTLTNLYSKKQLTEKAAST
             S +S+N +++L+    + +KVPA+IQ+L+ +N  PD  TYN+ +      ND+   E ++ EM +  ++  DW T S + ++Y    L++KA   
Subjt:  CGFLKSPLSFNHMLSLHISNKQLDKVPALIQDLQ-KNTKPDVVTYNLLLNVCTLQNDVEAAENILLEMKK-MKIERDWVTLSTLTNLYSKKQLTEKAAST

Query:  LKEMEKMASKRNRITFSSLLSLYTNLGDKDGVWRIWKKMKTSFRKMSDSEYTCMISSVVKLHELEEAEKLYTEWESVSGTGDTRVPNILLAAYINNNQME
        L+E+E   ++R+   +  L++LY  LG    V+RIW+ ++ +  K S+  Y  MI  +VKL++L  AE L+ EW++   T D R+ N+L+ AY     ++
Subjt:  LKEMEKMASKRNRITFSSLLSLYTNLGDKDGVWRIWKKMKTSFRKMSDSEYTCMISSVVKLHELEEAEKLYTEWESVSGTGDTRVPNILLAAYINNNQME

Query:  QAESFYNRMSLKGIVPSYTTWELLTWGYLKENQMEKVLHFFKNAV----GSVKKWNADERLVKQVCKKLEEEGNIEGAEKLLIVLRN-AGHVNTEIYNSL
        +A     +   +G   +  TWE+    Y+K   M + L     AV    G   KW      V+ +    E++ ++ GAE LL +L+N   ++  EI+  L
Subjt:  QAESFYNRMSLKGIVPSYTTWELLTWGYLKENQMEKVLHFFKNAV----GSVKKWNADERLVKQVCKKLEEEGNIEGAEKLLIVLRN-AGHVNTEIYNSL

Query:  LRTYAKAGKMPLIVAERMEKDDVKLDEETRELIKLTSK
        +RTYA AGK    +  R++ ++V+++E T++L+   S+
Subjt:  LRTYAKAGKMPLIVAERMEKDDVKLDEETRELIKLTSK

Q3E911 Pentatricopeptide repeat-containing protein At5g274602.0e-6834.29Show/hide
Query:  LATAARRFSGEAFMAAVENTAIEGGSGSSGGGGGRDTLGRRLMSLAFPKRSAVIAIRKWQEEGHTVRKYELNRIVRELRKLKRYKHALEICEWMTSQKDM
        +A A  RF+    ++ + ++  +G   SS     R++L + ++    P+RS    +++  + GH V   EL  I + L +  RY  AL++ EWM +QKD+
Subjt:  LATAARRFSGEAFMAAVENTAIEGGSGSSGGGGGRDTLGRRLMSLAFPKRSAVIAIRKWQEEGHTVRKYELNRIVRELRKLKRYKHALEICEWMTSQKDM

Query:  KLLPGDYAVHLDLIAKIRGLNSAEKFFEDL---PDKMR-GQSACTALLHVYVQNNLSDKAEALMEKMSECGFLKSPLSFNHMLSLHISNKQLDKVPALIQ
        +    D A+ LDLI K  GL   E++FE L      MR  +SA   LL  YV+N +  +AEALMEK++  GFL +P  FN M+ L+ ++ Q +KV  ++ 
Subjt:  KLLPGDYAVHLDLIAKIRGLNSAEKFFEDL---PDKMR-GQSACTALLHVYVQNNLSDKAEALMEKMSECGFLKSPLSFNHMLSLHISNKQLDKVPALIQ

Query:  DLQKNTKP-DVVTYNLLLNVCTLQNDVEAAENILLEMKKMK-IERDWVTLSTLTNLYSKKQLTEKAASTLKEMEKMASKRNRITFSSLLSLYTNLGDKDG
         ++ N  P +V++YNL +N C   + V A E +  EM   K +E  W +L TL N+Y K    EKA   L++ EKM ++ NR+ +  L++LY +LG+K+G
Subjt:  DLQKNTKP-DVVTYNLLLNVCTLQNDVEAAENILLEMKKMK-IERDWVTLSTLTNLYSKKQLTEKAASTLKEMEKMASKRNRITFSSLLSLYTNLGDKDG

Query:  VWRIWKKMKTSFRKMSDSEYTCMISSVVKLHELEEAEKLYTEWESVSGTGDTRVPNILLAAYINNNQMEQAESFYNRMSLKGIVPSYTTWELLTWGYLKE
        V R+W+  K+   ++S   Y C++SS+VK  +LEEAE++++EWE+     D RV N+LL AY+ N ++ +AES +  +  +G  P+Y TWE+L  G++K 
Subjt:  VWRIWKKMKTSFRKMSDSEYTCMISSVVKLHELEEAEKLYTEWESVSGTGDTRVPNILLAAYINNNQMEQAESFYNRMSLKGIVPSYTTWELLTWGYLKE

Query:  NQMEKVLHFFKNAVGSVKK--WNADERLVKQVCKKLEEEGNIEGAEKLLIVLRNAGHVNTEIYNSLLRTYAKAGKMPLIVAERMEKD
          MEK +         +++  W     +V  + +  E+E  IE A   +  L   G  +  +Y  LLR +  A +    + E M+ D
Subjt:  NQMEKVLHFFKNAVGSVKK--WNADERLVKQVCKKLEEEGNIEGAEKLLIVLRNAGHVNTEIYNSLLRTYAKAGKMPLIVAERMEKD

Q8LPS6 Pentatricopeptide repeat-containing protein At1g021501.1e-6934.12Show/hide
Query:  RRLMSLAFPKRSAVIAIRKWQEEGHTVRKYELNRIVRELRKLKRYKHALEICEWMTSQKD-MKLLPGDYAVHLDLIAKIRGLNSAEKFFEDLPDKMRGQS
        +++  +  P+  A   + +W++ G  + K+EL R+V+ELRK KR   ALE+ +WM ++ +  +L   D A+ LDLI K+RG+  AE+FF  LP+  + + 
Subjt:  RRLMSLAFPKRSAVIAIRKWQEEGHTVRKYELNRIVRELRKLKRYKHALEICEWMTSQKD-MKLLPGDYAVHLDLIAKIRGLNSAEKFFEDLPDKMRGQS

Query:  ACTALLHVYVQNNLSDKAEALMEKMSECGFLKSPLSFNHMLSLHISNKQLDKVPALIQDL-QKNTKPDVVTYNLLLNVCTLQNDVEAAENILLEMKK-MK
           +LL+ YV+    +KAEAL+  M + G+   PL FN M++L+++ ++ DKV A++ ++ QK+ + D+ +YN+ L+ C     VE  E +  +MK  + 
Subjt:  ACTALLHVYVQNNLSDKAEALMEKMSECGFLKSPLSFNHMLSLHISNKQLDKVPALIQDL-QKNTKPDVVTYNLLLNVCTLQNDVEAAENILLEMKK-MK

Query:  IERDWVTLSTLTNLYSKKQLTEKAASTLKEMEKMASKRNRITFSSLLSLYTNLGDKDGVWRIWKKMKTSFRKMSDSEYTCMISSVVKLHELEEAEKLYTE
        I  +W T ST+  +Y K   TEKA   L+++E   + RNRI +  LLSLY +LG+K  ++R+W   K+    + +  Y  ++SS+V++ ++E AEK+Y E
Subjt:  IERDWVTLSTLTNLYSKKQLTEKAASTLKEMEKMASKRNRITFSSLLSLYTNLGDKDGVWRIWKKMKTSFRKMSDSEYTCMISSVVKLHELEEAEKLYTE

Query:  WESVSGTGDTRVPNILLAAYINNNQMEQAESFYNRMSLKGIVPSYTTWELLTWGYLKENQMEKVLHFFKNAVGS--VKKWNADERLVKQVCKKLEEEGNI
        W  V  + D R+PN+L+ AY+ N+Q+E AE  ++ M   G  PS +TWE+L  G+ ++  + + L   +NA  +     W     ++    K  EEE ++
Subjt:  WESVSGTGDTRVPNILLAAYINNNQMEQAESFYNRMSLKGIVPSYTTWELLTWGYLKENQMEKVLHFFKNAVGS--VKKWNADERLVKQVCKKLEEEGNI

Query:  EGAEKLLIVLRNAGHVNTEIYNSLL
           E +L +LR +G +  + Y +L+
Subjt:  EGAEKLLIVLRNAGHVNTEIYNSLL

Q9SKU6 Pentatricopeptide repeat-containing protein At2g20710, mitochondrial8.7e-6433.79Show/hide
Query:  DTLGRRLMSLAFPKRSAVIAIRKWQEEGHTVRKYELNRIVRELRKLKRYKHALEICEWMTSQKDMKLLPGDYAVHLDLIAKIRGLNSAEKFFEDLPDKMR
        DTL RR+     P  S +  +  W ++G+ V+  EL+ I++ LRK  R+ HAL+I +WM+  +  ++  GD A+ LDLIAK+ GL  AEKFFE +P + R
Subjt:  DTLGRRLMSLAFPKRSAVIAIRKWQEEGHTVRKYELNRIVRELRKLKRYKHALEICEWMTSQKDMKLLPGDYAVHLDLIAKIRGLNSAEKFFEDLPDKMR

Query:  GQSACTALLHVYVQNNLSDKAEALMEKMSECGFLKSPLSFNHMLSLHISNKQLDKVPALIQDLQKNT-KPDVVTYNLLLNVCTLQNDVEAAENILLEMK-
              ALL+ Y    +  KAE + ++M E GFLK  L +N ML+L++   +   V  L+++++  T KPD+ T N  L+  ++ +DVE  E  L+  + 
Subjt:  GQSACTALLHVYVQNNLSDKAEALMEKMSECGFLKSPLSFNHMLSLHISNKQLDKVPALIQDLQKNT-KPDVVTYNLLLNVCTLQNDVEAAENILLEMK-

Query:  KMKIERDWVTLSTLTNLYSKKQLTEKAASTLKEMEKMA-SKRNRITFSSLLSLYTNLGDKDGVWRIWKKMKTSFRKMSDSEYTCMISSVVKLHELEEAEK
           +  DW T +   N Y K  LTEKA   L++ E+M  +++ +  +  L+S Y   G K+ V+R+W   K       ++ Y  +IS+++K+ ++EE EK
Subjt:  KMKIERDWVTLSTLTNLYSKKQLTEKAASTLKEMEKMA-SKRNRITFSSLLSLYTNLGDKDGVWRIWKKMKTSFRKMSDSEYTCMISSVVKLHELEEAEK

Query:  LYTEWESVSGTGDTRVPNILLAAYINNNQMEQAESFYNRMSLKGIVPSYTTWELLTWGYLKENQMEKVLHFFKNAVGSVKK-WNADERLVKQVCKKLEEE
        +  EWE+     D R+P++L+  Y     ME+AE   N +  K  V   +TWE L  GY    +MEK +  +K A+   K  W   + ++      LE +
Subjt:  LYTEWESVSGTGDTRVPNILLAAYINNNQMEQAESFYNRMSLKGIVPSYTTWELLTWGYLKENQMEKVLHFFKNAVGSVKK-WNADERLVKQVCKKLEEE

Query:  GNIEGAEKLLIVLRNAGHVNTEIYNSLLRTYAKAG
         ++EG  K+L +L   GH++   Y+ LL     AG
Subjt:  GNIEGAEKLLIVLRNAGHVNTEIYNSLLRTYAKAG

Q9SY07 Pentatricopeptide repeat-containing protein At4g02820, mitochondrial2.3e-18162.81Show/hide
Query:  MLRSLRTSLATAARRFSGEAFMAAVENTA----IEGGSGSSGGG----------GGRDTLGRRLMSLAFPKRSAVIAIRKWQEEGHTVRKYELNRIVREL
        ++RS R +LA+  R FS  A  AA  +TA    ++  SG   GG          GGRDTLG RL+SL + KRSAV+ IRKW+EEGH+VRKYELNRIVREL
Subjt:  MLRSLRTSLATAARRFSGEAFMAAVENTA----IEGGSGSSGGG----------GGRDTLGRRLMSLAFPKRSAVIAIRKWQEEGHTVRKYELNRIVREL

Query:  RKLKRYKHALEICEWMTSQKDMKLLPGDYAVHLDLIAKIRGLNSAEKFFEDLPDKMRGQSACTALLHVYVQNNLSDKAEALMEKMSECGFLKSPLSFNHM
        RK+KRYKHALEICEWM  Q+D+KL  GDYAVHLDLI+KIRGLNSAEKFFED+PD+MRG +ACT+LLH YVQN LSDKAEAL EKM ECGFLKS L +NHM
Subjt:  RKLKRYKHALEICEWMTSQKDMKLLPGDYAVHLDLIAKIRGLNSAEKFFEDLPDKMRGQSACTALLHVYVQNNLSDKAEALMEKMSECGFLKSPLSFNHM

Query:  LSLHISNKQLDKVPALIQDLQKNTKPDVVTYNLLLNVCTLQNDVEAAENILLEMKKMKIERDWVTLSTLTNLYSKKQLTEKAASTLKEMEKMASKRNRIT
        LS++IS  Q +KVP LI++L+  T PD+VTYNL L      NDVE AE + L+ K+ K+  DWVT S LTNLY+K    EKA   LKEMEK+ SK+NR+ 
Subjt:  LSLHISNKQLDKVPALIQDLQKNTKPDVVTYNLLLNVCTLQNDVEAAENILLEMKKMKIERDWVTLSTLTNLYSKKQLTEKAASTLKEMEKMASKRNRIT

Query:  FSSLLSLYTNLGDKDGVWRIWKKMKTSFRKMSDSEYTCMISSVVKLHELEEAEKLYTEWESVSGTGDTRVPNILLAAYINNNQMEQAESFYNRMSLKGIV
        ++SL+SL+ NLGDKDGV   WKK+K+SF+KM+D+EY  MIS+VVKL E E+A+ LY EWESVSGTGD R+PN++LA Y+N +++   E FY R+  KGI 
Subjt:  FSSLLSLYTNLGDKDGVWRIWKKMKTSFRKMSDSEYTCMISSVVKLHELEEAEKLYTEWESVSGTGDTRVPNILLAAYINNNQMEQAESFYNRMSLKGIV

Query:  PSYTTWELLTWGYLKENQMEKVLHFFKNAVGSVKKWNADERLVKQVCKKLEEEGNIEGAEKLLIVLRNAGHVNTEIYNSLLRTYAKAGKMPLIVAERMEK
        PSY+TWE+LTW YLK   MEKVL  F  A+ SVKKW  + RLVK  CK+LEE+GN++GAEKL+ +L+ AG+VNT++YNSLLRTYAKAG+M LIV ERM K
Subjt:  PSYTTWELLTWGYLKENQMEKVLHFFKNAVGSVKKWNADERLVKQVCKKLEEEGNIEGAEKLLIVLRNAGHVNTEIYNSLLRTYAKAGKMPLIVAERMEK

Query:  DDVKLDEETRELIKLTSKMCVSEVSST
        D+V+LDEET+ELI+LTS+M V+E+SST
Subjt:  DDVKLDEETRELIKLTSKMCVSEVSST

Arabidopsis top hitse value%identityAlignment
AT1G02150.1 Tetratricopeptide repeat (TPR)-like superfamily protein7.5e-7134.12Show/hide
Query:  RRLMSLAFPKRSAVIAIRKWQEEGHTVRKYELNRIVRELRKLKRYKHALEICEWMTSQKD-MKLLPGDYAVHLDLIAKIRGLNSAEKFFEDLPDKMRGQS
        +++  +  P+  A   + +W++ G  + K+EL R+V+ELRK KR   ALE+ +WM ++ +  +L   D A+ LDLI K+RG+  AE+FF  LP+  + + 
Subjt:  RRLMSLAFPKRSAVIAIRKWQEEGHTVRKYELNRIVRELRKLKRYKHALEICEWMTSQKD-MKLLPGDYAVHLDLIAKIRGLNSAEKFFEDLPDKMRGQS

Query:  ACTALLHVYVQNNLSDKAEALMEKMSECGFLKSPLSFNHMLSLHISNKQLDKVPALIQDL-QKNTKPDVVTYNLLLNVCTLQNDVEAAENILLEMKK-MK
           +LL+ YV+    +KAEAL+  M + G+   PL FN M++L+++ ++ DKV A++ ++ QK+ + D+ +YN+ L+ C     VE  E +  +MK  + 
Subjt:  ACTALLHVYVQNNLSDKAEALMEKMSECGFLKSPLSFNHMLSLHISNKQLDKVPALIQDL-QKNTKPDVVTYNLLLNVCTLQNDVEAAENILLEMKK-MK

Query:  IERDWVTLSTLTNLYSKKQLTEKAASTLKEMEKMASKRNRITFSSLLSLYTNLGDKDGVWRIWKKMKTSFRKMSDSEYTCMISSVVKLHELEEAEKLYTE
        I  +W T ST+  +Y K   TEKA   L+++E   + RNRI +  LLSLY +LG+K  ++R+W   K+    + +  Y  ++SS+V++ ++E AEK+Y E
Subjt:  IERDWVTLSTLTNLYSKKQLTEKAASTLKEMEKMASKRNRITFSSLLSLYTNLGDKDGVWRIWKKMKTSFRKMSDSEYTCMISSVVKLHELEEAEKLYTE

Query:  WESVSGTGDTRVPNILLAAYINNNQMEQAESFYNRMSLKGIVPSYTTWELLTWGYLKENQMEKVLHFFKNAVGS--VKKWNADERLVKQVCKKLEEEGNI
        W  V  + D R+PN+L+ AY+ N+Q+E AE  ++ M   G  PS +TWE+L  G+ ++  + + L   +NA  +     W     ++    K  EEE ++
Subjt:  WESVSGTGDTRVPNILLAAYINNNQMEQAESFYNRMSLKGIVPSYTTWELLTWGYLKENQMEKVLHFFKNAVGS--VKKWNADERLVKQVCKKLEEEGNI

Query:  EGAEKLLIVLRNAGHVNTEIYNSLL
           E +L +LR +G +  + Y +L+
Subjt:  EGAEKLLIVLRNAGHVNTEIYNSLL

AT1G60770.1 Tetratricopeptide repeat (TPR)-like superfamily protein2.6e-6331.74Show/hide
Query:  VRKYELNRIVRELRKLKRYKHALEICEWMTSQKDMKLLPGDYAVHLDLIAKIRGLNSAEKFFEDLPDKMRGQSACTALLHVYVQNNLSDKAEALMEKMSE
        V K+E+   +++LR    Y  AL++ E M  ++ M     D A+HLDL+AK R + + E +F DLP+  + +    +LL+ Y +  L++KAE L+ KM E
Subjt:  VRKYELNRIVRELRKLKRYKHALEICEWMTSQKDMKLLPGDYAVHLDLIAKIRGLNSAEKFFEDLPDKMRGQSACTALLHVYVQNNLSDKAEALMEKMSE

Query:  CGFLKSPLSFNHMLSLHISNKQLDKVPALIQDLQ-KNTKPDVVTYNLLLNVCTLQNDVEAAENILLEMKK-MKIERDWVTLSTLTNLYSKKQLTEKAAST
             S +S+N +++L+    + +KVPA+IQ+L+ +N  PD  TYN+ +      ND+   E ++ EM +  ++  DW T S + ++Y    L++KA   
Subjt:  CGFLKSPLSFNHMLSLHISNKQLDKVPALIQDLQ-KNTKPDVVTYNLLLNVCTLQNDVEAAENILLEMKK-MKIERDWVTLSTLTNLYSKKQLTEKAAST

Query:  LKEMEKMASKRNRITFSSLLSLYTNLGDKDGVWRIWKKMKTSFRKMSDSEYTCMISSVVKLHELEEAEKLYTEWESVSGTGDTRVPNILLAAYINNNQME
        L+E+E   ++R+   +  L++LY  LG    V+RIW+ ++ +  K S+  Y  MI  +VKL++L  AE L+ EW++   T D R+ N+L+ AY     ++
Subjt:  LKEMEKMASKRNRITFSSLLSLYTNLGDKDGVWRIWKKMKTSFRKMSDSEYTCMISSVVKLHELEEAEKLYTEWESVSGTGDTRVPNILLAAYINNNQME

Query:  QAESFYNRMSLKGIVPSYTTWELLTWGYLKENQMEKVLHFFKNAV----GSVKKWNADERLVKQVCKKLEEEGNIEGAEKLLIVLRN-AGHVNTEIYNSL
        +A     +   +G   +  TWE+    Y+K   M + L     AV    G   KW      V+ +    E++ ++ GAE LL +L+N   ++  EI+  L
Subjt:  QAESFYNRMSLKGIVPSYTTWELLTWGYLKENQMEKVLHFFKNAV----GSVKKWNADERLVKQVCKKLEEEGNIEGAEKLLIVLRN-AGHVNTEIYNSL

Query:  LRTYAKAGKMPLIVAERMEKDDVKLDEETRELIKLTSK
        +RTYA AGK    +  R++ ++V+++E T++L+   S+
Subjt:  LRTYAKAGKMPLIVAERMEKDDVKLDEETRELIKLTSK

AT2G20710.1 Tetratricopeptide repeat (TPR)-like superfamily protein6.2e-6533.79Show/hide
Query:  DTLGRRLMSLAFPKRSAVIAIRKWQEEGHTVRKYELNRIVRELRKLKRYKHALEICEWMTSQKDMKLLPGDYAVHLDLIAKIRGLNSAEKFFEDLPDKMR
        DTL RR+     P  S +  +  W ++G+ V+  EL+ I++ LRK  R+ HAL+I +WM+  +  ++  GD A+ LDLIAK+ GL  AEKFFE +P + R
Subjt:  DTLGRRLMSLAFPKRSAVIAIRKWQEEGHTVRKYELNRIVRELRKLKRYKHALEICEWMTSQKDMKLLPGDYAVHLDLIAKIRGLNSAEKFFEDLPDKMR

Query:  GQSACTALLHVYVQNNLSDKAEALMEKMSECGFLKSPLSFNHMLSLHISNKQLDKVPALIQDLQKNT-KPDVVTYNLLLNVCTLQNDVEAAENILLEMK-
              ALL+ Y    +  KAE + ++M E GFLK  L +N ML+L++   +   V  L+++++  T KPD+ T N  L+  ++ +DVE  E  L+  + 
Subjt:  GQSACTALLHVYVQNNLSDKAEALMEKMSECGFLKSPLSFNHMLSLHISNKQLDKVPALIQDLQKNT-KPDVVTYNLLLNVCTLQNDVEAAENILLEMK-

Query:  KMKIERDWVTLSTLTNLYSKKQLTEKAASTLKEMEKMA-SKRNRITFSSLLSLYTNLGDKDGVWRIWKKMKTSFRKMSDSEYTCMISSVVKLHELEEAEK
           +  DW T +   N Y K  LTEKA   L++ E+M  +++ +  +  L+S Y   G K+ V+R+W   K       ++ Y  +IS+++K+ ++EE EK
Subjt:  KMKIERDWVTLSTLTNLYSKKQLTEKAASTLKEMEKMA-SKRNRITFSSLLSLYTNLGDKDGVWRIWKKMKTSFRKMSDSEYTCMISSVVKLHELEEAEK

Query:  LYTEWESVSGTGDTRVPNILLAAYINNNQMEQAESFYNRMSLKGIVPSYTTWELLTWGYLKENQMEKVLHFFKNAVGSVKK-WNADERLVKQVCKKLEEE
        +  EWE+     D R+P++L+  Y     ME+AE   N +  K  V   +TWE L  GY    +MEK +  +K A+   K  W   + ++      LE +
Subjt:  LYTEWESVSGTGDTRVPNILLAAYINNNQMEQAESFYNRMSLKGIVPSYTTWELLTWGYLKENQMEKVLHFFKNAVGSVKK-WNADERLVKQVCKKLEEE

Query:  GNIEGAEKLLIVLRNAGHVNTEIYNSLLRTYAKAG
         ++EG  K+L +L   GH++   Y+ LL     AG
Subjt:  GNIEGAEKLLIVLRNAGHVNTEIYNSLLRTYAKAG

AT4G02820.1 Pentatricopeptide repeat (PPR) superfamily protein1.6e-18262.81Show/hide
Query:  MLRSLRTSLATAARRFSGEAFMAAVENTA----IEGGSGSSGGG----------GGRDTLGRRLMSLAFPKRSAVIAIRKWQEEGHTVRKYELNRIVREL
        ++RS R +LA+  R FS  A  AA  +TA    ++  SG   GG          GGRDTLG RL+SL + KRSAV+ IRKW+EEGH+VRKYELNRIVREL
Subjt:  MLRSLRTSLATAARRFSGEAFMAAVENTA----IEGGSGSSGGG----------GGRDTLGRRLMSLAFPKRSAVIAIRKWQEEGHTVRKYELNRIVREL

Query:  RKLKRYKHALEICEWMTSQKDMKLLPGDYAVHLDLIAKIRGLNSAEKFFEDLPDKMRGQSACTALLHVYVQNNLSDKAEALMEKMSECGFLKSPLSFNHM
        RK+KRYKHALEICEWM  Q+D+KL  GDYAVHLDLI+KIRGLNSAEKFFED+PD+MRG +ACT+LLH YVQN LSDKAEAL EKM ECGFLKS L +NHM
Subjt:  RKLKRYKHALEICEWMTSQKDMKLLPGDYAVHLDLIAKIRGLNSAEKFFEDLPDKMRGQSACTALLHVYVQNNLSDKAEALMEKMSECGFLKSPLSFNHM

Query:  LSLHISNKQLDKVPALIQDLQKNTKPDVVTYNLLLNVCTLQNDVEAAENILLEMKKMKIERDWVTLSTLTNLYSKKQLTEKAASTLKEMEKMASKRNRIT
        LS++IS  Q +KVP LI++L+  T PD+VTYNL L      NDVE AE + L+ K+ K+  DWVT S LTNLY+K    EKA   LKEMEK+ SK+NR+ 
Subjt:  LSLHISNKQLDKVPALIQDLQKNTKPDVVTYNLLLNVCTLQNDVEAAENILLEMKKMKIERDWVTLSTLTNLYSKKQLTEKAASTLKEMEKMASKRNRIT

Query:  FSSLLSLYTNLGDKDGVWRIWKKMKTSFRKMSDSEYTCMISSVVKLHELEEAEKLYTEWESVSGTGDTRVPNILLAAYINNNQMEQAESFYNRMSLKGIV
        ++SL+SL+ NLGDKDGV   WKK+K+SF+KM+D+EY  MIS+VVKL E E+A+ LY EWESVSGTGD R+PN++LA Y+N +++   E FY R+  KGI 
Subjt:  FSSLLSLYTNLGDKDGVWRIWKKMKTSFRKMSDSEYTCMISSVVKLHELEEAEKLYTEWESVSGTGDTRVPNILLAAYINNNQMEQAESFYNRMSLKGIV

Query:  PSYTTWELLTWGYLKENQMEKVLHFFKNAVGSVKKWNADERLVKQVCKKLEEEGNIEGAEKLLIVLRNAGHVNTEIYNSLLRTYAKAGKMPLIVAERMEK
        PSY+TWE+LTW YLK   MEKVL  F  A+ SVKKW  + RLVK  CK+LEE+GN++GAEKL+ +L+ AG+VNT++YNSLLRTYAKAG+M LIV ERM K
Subjt:  PSYTTWELLTWGYLKENQMEKVLHFFKNAVGSVKKWNADERLVKQVCKKLEEEGNIEGAEKLLIVLRNAGHVNTEIYNSLLRTYAKAGKMPLIVAERMEK

Query:  DDVKLDEETRELIKLTSKMCVSEVSST
        D+V+LDEET+ELI+LTS+M V+E+SST
Subjt:  DDVKLDEETRELIKLTSKMCVSEVSST

AT5G27460.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.4e-6934.29Show/hide
Query:  LATAARRFSGEAFMAAVENTAIEGGSGSSGGGGGRDTLGRRLMSLAFPKRSAVIAIRKWQEEGHTVRKYELNRIVRELRKLKRYKHALEICEWMTSQKDM
        +A A  RF+    ++ + ++  +G   SS     R++L + ++    P+RS    +++  + GH V   EL  I + L +  RY  AL++ EWM +QKD+
Subjt:  LATAARRFSGEAFMAAVENTAIEGGSGSSGGGGGRDTLGRRLMSLAFPKRSAVIAIRKWQEEGHTVRKYELNRIVRELRKLKRYKHALEICEWMTSQKDM

Query:  KLLPGDYAVHLDLIAKIRGLNSAEKFFEDL---PDKMR-GQSACTALLHVYVQNNLSDKAEALMEKMSECGFLKSPLSFNHMLSLHISNKQLDKVPALIQ
        +    D A+ LDLI K  GL   E++FE L      MR  +SA   LL  YV+N +  +AEALMEK++  GFL +P  FN M+ L+ ++ Q +KV  ++ 
Subjt:  KLLPGDYAVHLDLIAKIRGLNSAEKFFEDL---PDKMR-GQSACTALLHVYVQNNLSDKAEALMEKMSECGFLKSPLSFNHMLSLHISNKQLDKVPALIQ

Query:  DLQKNTKP-DVVTYNLLLNVCTLQNDVEAAENILLEMKKMK-IERDWVTLSTLTNLYSKKQLTEKAASTLKEMEKMASKRNRITFSSLLSLYTNLGDKDG
         ++ N  P +V++YNL +N C   + V A E +  EM   K +E  W +L TL N+Y K    EKA   L++ EKM ++ NR+ +  L++LY +LG+K+G
Subjt:  DLQKNTKP-DVVTYNLLLNVCTLQNDVEAAENILLEMKKMK-IERDWVTLSTLTNLYSKKQLTEKAASTLKEMEKMASKRNRITFSSLLSLYTNLGDKDG

Query:  VWRIWKKMKTSFRKMSDSEYTCMISSVVKLHELEEAEKLYTEWESVSGTGDTRVPNILLAAYINNNQMEQAESFYNRMSLKGIVPSYTTWELLTWGYLKE
        V R+W+  K+   ++S   Y C++SS+VK  +LEEAE++++EWE+     D RV N+LL AY+ N ++ +AES +  +  +G  P+Y TWE+L  G++K 
Subjt:  VWRIWKKMKTSFRKMSDSEYTCMISSVVKLHELEEAEKLYTEWESVSGTGDTRVPNILLAAYINNNQMEQAESFYNRMSLKGIVPSYTTWELLTWGYLKE

Query:  NQMEKVLHFFKNAVGSVKK--WNADERLVKQVCKKLEEEGNIEGAEKLLIVLRNAGHVNTEIYNSLLRTYAKAGKMPLIVAERMEKD
          MEK +         +++  W     +V  + +  E+E  IE A   +  L   G  +  +Y  LLR +  A +    + E M+ D
Subjt:  NQMEKVLHFFKNAVGSVKK--WNADERLVKQVCKKLEEEGNIEGAEKLLIVLRNAGHVNTEIYNSLLRTYAKAGKMPLIVAERMEKD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTCCGCTCCCTGCGGACATCTCTGGCTACGGCCGCCCGCCGATTTTCCGGGGAAGCCTTCATGGCGGCGGTCGAGAACACGGCAATAGAAGGCGGCTCCGGCAGCTC
CGGTGGAGGCGGCGGTCGAGACACGCTCGGGCGAAGACTCATGAGCCTCGCTTTCCCCAAGCGTAGCGCCGTGATTGCCATTCGCAAATGGCAAGAAGAAGGCCACACTG
TCCGCAAATACGAGCTCAATCGCATCGTCCGGGAACTCCGCAAGCTCAAGCGCTACAAGCACGCACTCGAGATATGTGAATGGATGACATCACAGAAAGATATGAAGCTG
CTACCAGGTGATTATGCAGTTCACCTGGATTTGATTGCAAAAATCCGTGGCCTGAATAGCGCAGAGAAGTTTTTCGAGGATCTCCCTGATAAAATGAGAGGCCAATCTGC
CTGTACAGCTCTTCTTCATGTGTATGTGCAAAACAATCTATCAGACAAAGCTGAGGCTTTAATGGAGAAAATGTCTGAATGTGGTTTCTTAAAAAGTCCTCTTTCTTTCA
ACCACATGCTATCCCTTCACATCTCAAACAAGCAACTAGACAAGGTTCCTGCTCTGATTCAAGATTTACAGAAGAACACTAAACCAGATGTGGTAACATACAATCTTTTG
TTGAATGTTTGTACTTTGCAAAATGATGTTGAAGCTGCTGAAAACATTCTCCTCGAGATGAAGAAGATGAAAATCGAACGAGACTGGGTAACATTAAGCACATTAACTAA
TCTGTATTCCAAAAAGCAACTTACTGAAAAAGCAGCATCTACTTTGAAGGAGATGGAGAAAATGGCATCTAAAAGAAATAGAATCACATTTTCCTCTCTTCTTAGCCTAT
ACACCAATTTGGGGGATAAGGATGGAGTTTGGAGGATATGGAAAAAGATGAAGACGTCGTTCCGAAAGATGAGTGATAGTGAGTATACTTGCATGATATCTTCTGTTGTG
AAACTCCACGAGCTCGAGGAGGCCGAGAAACTCTATACTGAATGGGAGTCGGTGTCTGGGACAGGCGATACTCGGGTTCCAAACATACTTCTCGCGGCATATATCAACAA
TAATCAAATGGAACAAGCCGAGAGTTTCTACAATCGGATGTCGCTAAAAGGAATTGTTCCATCTTACACAACTTGGGAGCTCCTCACGTGGGGTTATTTGAAGGAGAATC
AGATGGAGAAAGTGCTACATTTCTTCAAGAATGCTGTCGGCAGTGTGAAGAAATGGAATGCTGATGAGAGATTGGTTAAACAAGTGTGTAAGAAACTTGAGGAGGAGGGT
AACATTGAAGGGGCAGAAAAGTTATTGATTGTTCTTAGGAATGCAGGTCATGTGAATACTGAGATTTACAATTCTCTTTTGCGCACTTATGCAAAAGCTGGTAAAATGCC
ACTCATAGTTGCTGAAAGAATGGAAAAAGATGATGTTAAGTTGGATGAAGAGACCCGTGAGCTTATAAAACTGACCAGCAAGATGTGTGTGAGTGAAGTTTCGAGCACTT
TTTACTATGAAACCCGAAAGACCGACCAAACTGAC
mRNA sequenceShow/hide mRNA sequence
ATGCTCCGCTCCCTGCGGACATCTCTGGCTACGGCCGCCCGCCGATTTTCCGGGGAAGCCTTCATGGCGGCGGTCGAGAACACGGCAATAGAAGGCGGCTCCGGCAGCTC
CGGTGGAGGCGGCGGTCGAGACACGCTCGGGCGAAGACTCATGAGCCTCGCTTTCCCCAAGCGTAGCGCCGTGATTGCCATTCGCAAATGGCAAGAAGAAGGCCACACTG
TCCGCAAATACGAGCTCAATCGCATCGTCCGGGAACTCCGCAAGCTCAAGCGCTACAAGCACGCACTCGAGATATGTGAATGGATGACATCACAGAAAGATATGAAGCTG
CTACCAGGTGATTATGCAGTTCACCTGGATTTGATTGCAAAAATCCGTGGCCTGAATAGCGCAGAGAAGTTTTTCGAGGATCTCCCTGATAAAATGAGAGGCCAATCTGC
CTGTACAGCTCTTCTTCATGTGTATGTGCAAAACAATCTATCAGACAAAGCTGAGGCTTTAATGGAGAAAATGTCTGAATGTGGTTTCTTAAAAAGTCCTCTTTCTTTCA
ACCACATGCTATCCCTTCACATCTCAAACAAGCAACTAGACAAGGTTCCTGCTCTGATTCAAGATTTACAGAAGAACACTAAACCAGATGTGGTAACATACAATCTTTTG
TTGAATGTTTGTACTTTGCAAAATGATGTTGAAGCTGCTGAAAACATTCTCCTCGAGATGAAGAAGATGAAAATCGAACGAGACTGGGTAACATTAAGCACATTAACTAA
TCTGTATTCCAAAAAGCAACTTACTGAAAAAGCAGCATCTACTTTGAAGGAGATGGAGAAAATGGCATCTAAAAGAAATAGAATCACATTTTCCTCTCTTCTTAGCCTAT
ACACCAATTTGGGGGATAAGGATGGAGTTTGGAGGATATGGAAAAAGATGAAGACGTCGTTCCGAAAGATGAGTGATAGTGAGTATACTTGCATGATATCTTCTGTTGTG
AAACTCCACGAGCTCGAGGAGGCCGAGAAACTCTATACTGAATGGGAGTCGGTGTCTGGGACAGGCGATACTCGGGTTCCAAACATACTTCTCGCGGCATATATCAACAA
TAATCAAATGGAACAAGCCGAGAGTTTCTACAATCGGATGTCGCTAAAAGGAATTGTTCCATCTTACACAACTTGGGAGCTCCTCACGTGGGGTTATTTGAAGGAGAATC
AGATGGAGAAAGTGCTACATTTCTTCAAGAATGCTGTCGGCAGTGTGAAGAAATGGAATGCTGATGAGAGATTGGTTAAACAAGTGTGTAAGAAACTTGAGGAGGAGGGT
AACATTGAAGGGGCAGAAAAGTTATTGATTGTTCTTAGGAATGCAGGTCATGTGAATACTGAGATTTACAATTCTCTTTTGCGCACTTATGCAAAAGCTGGTAAAATGCC
ACTCATAGTTGCTGAAAGAATGGAAAAAGATGATGTTAAGTTGGATGAAGAGACCCGTGAGCTTATAAAACTGACCAGCAAGATGTGTGTGAGTGAAGTTTCGAGCACTT
TTTACTATGAAACCCGAAAGACCGACCAAACTGAC
Protein sequenceShow/hide protein sequence
MLRSLRTSLATAARRFSGEAFMAAVENTAIEGGSGSSGGGGGRDTLGRRLMSLAFPKRSAVIAIRKWQEEGHTVRKYELNRIVRELRKLKRYKHALEICEWMTSQKDMKL
LPGDYAVHLDLIAKIRGLNSAEKFFEDLPDKMRGQSACTALLHVYVQNNLSDKAEALMEKMSECGFLKSPLSFNHMLSLHISNKQLDKVPALIQDLQKNTKPDVVTYNLL
LNVCTLQNDVEAAENILLEMKKMKIERDWVTLSTLTNLYSKKQLTEKAASTLKEMEKMASKRNRITFSSLLSLYTNLGDKDGVWRIWKKMKTSFRKMSDSEYTCMISSVV
KLHELEEAEKLYTEWESVSGTGDTRVPNILLAAYINNNQMEQAESFYNRMSLKGIVPSYTTWELLTWGYLKENQMEKVLHFFKNAVGSVKKWNADERLVKQVCKKLEEEG
NIEGAEKLLIVLRNAGHVNTEIYNSLLRTYAKAGKMPLIVAERMEKDDVKLDEETRELIKLTSKMCVSEVSSTFYYETRKTDQTD