; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10016347 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10016347
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationChr03:4348095..4350324
RNA-Seq ExpressionHG10016347
SyntenyHG10016347
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022950690.1 pentatricopeptide repeat-containing protein At3g51320 isoform X1 [Cucurbita moschata]1.4e-25680.69Show/hide
Query:  DNSFVSHTPP--SLCHS--------NPLIHL---LLHSALCQNQIFRSRPQILLDITEATRFFNHVRAY--INIPNTFCINRVIKAYSISTVPLQAVFVY
        D S  +  PP  + CHS          L+ +   L+ S L  +  + +R  +LL  +E       V  +  IN+PN FCINRVIKAYS+S+ PL+AVFVY
Subjt:  DNSFVSHTPP--SLCHS--------NPLIHL---LLHSALCQNQIFRSRPQILLDITEATRFFNHVRAY--INIPNTFCINRVIKAYSISTVPLQAVFVY

Query:  FEWLGNGFRPDSYTFLSLFSACANFGCGASGRKCHGQAFKNGVDSVMVLRNSLIHMYGCCGHIELGRKVFDEMSSWDFVSWNSIVTAYARVGDLHTARDM
        F+WLG+GFRPD+YTFLSLF ACA+ GCG+SGRKCHGQAFKNGVD VMVLRNSLIHMYGCCGHIELGRKVFDEM + D VSWNSIVTAYARVGDLHTA DM
Subjt:  FEWLGNGFRPDSYTFLSLFSACANFGCGASGRKCHGQAFKNGVDSVMVLRNSLIHMYGCCGHIELGRKVFDEMSSWDFVSWNSIVTAYARVGDLHTARDM

Query:  FDAMPERNVVSWNLMISEYLRGGNPGCAMKLFRNMVNIGIRGNNTTMVNVLGACGRSARLNEGRSVHGFMYRTSMKFCVFIDTALVDMYSKCQRVSIARR
        FDAMPERNVVSWNLMISEYLRGG+PGCAMKLFRNM+ IGIRGN+TTMVN+LGACGRSARLNEGRSVHGFMYRTSMKFCVFIDTALVDMY KCQRVS+ARR
Subjt:  FDAMPERNVVSWNLMISEYLRGGNPGCAMKLFRNMVNIGIRGNNTTMVNVLGACGRSARLNEGRSVHGFMYRTSMKFCVFIDTALVDMYSKCQRVSIARR

Query:  VFDRMLSRNLVTWNAMVLGHCLHGHPEDGLKLFEEMAAKLREINGETGSGKKFKQDEGKRKVYPDQITFIGVLCACARAGLLKDAKNYFDEMINVFLVKP
        +FDRM++RNLVTWNAMVLGHCLHG+PEDGLKLFEEMAAKLRE NGE GSGKKFKQDEG+RKV+ DQITFIGVLCACARAGLL+DA NYFDEMINVFLV+P
Subjt:  VFDRMLSRNLVTWNAMVLGHCLHGHPEDGLKLFEEMAAKLREINGETGSGKKFKQDEGKRKVYPDQITFIGVLCACARAGLLKDAKNYFDEMINVFLVKP

Query:  NFAHYWCLANVYVAAGLIQQAVEILRNMPEDNEDFSSESVVWINLLATCRFVGDVSLGEQIAKYLIDLEPKNDSYYRLLLNIYAVAGRWEDVSRIKLLMK
        NFAHYWCLANVYVAAGLIQQAVEILRNMPED EDFSSE VVW NLLATCRF GDVSLGEQIA YLID+EPKN+SYYRLLLNIYAVAGRWEDVSRIK+LMK
Subjt:  NFAHYWCLANVYVAAGLIQQAVEILRNMPEDNEDFSSESVVWINLLATCRFVGDVSLGEQIAKYLIDLEPKNDSYYRLLLNIYAVAGRWEDVSRIKLLMK

Query:  EKRLGTMPGCRLVDLKEIVHRLKLGNLLQEGMRETNTVMHKLASELMML
        EKRLGT PGCRLVDLKEIVHRLKLGNLLQEGM+ETNTVMHKLASE+ +L
Subjt:  EKRLGTMPGCRLVDLKEIVHRLKLGNLLQEGMRETNTVMHKLASELMML

XP_022950691.1 pentatricopeptide repeat-containing protein At3g51320 isoform X2 [Cucurbita moschata]1.4e-25680.69Show/hide
Query:  DNSFVSHTPP--SLCHS--------NPLIHL---LLHSALCQNQIFRSRPQILLDITEATRFFNHVRAY--INIPNTFCINRVIKAYSISTVPLQAVFVY
        D S  +  PP  + CHS          L+ +   L+ S L  +  + +R  +LL  +E       V  +  IN+PN FCINRVIKAYS+S+ PL+AVFVY
Subjt:  DNSFVSHTPP--SLCHS--------NPLIHL---LLHSALCQNQIFRSRPQILLDITEATRFFNHVRAY--INIPNTFCINRVIKAYSISTVPLQAVFVY

Query:  FEWLGNGFRPDSYTFLSLFSACANFGCGASGRKCHGQAFKNGVDSVMVLRNSLIHMYGCCGHIELGRKVFDEMSSWDFVSWNSIVTAYARVGDLHTARDM
        F+WLG+GFRPD+YTFLSLF ACA+ GCG+SGRKCHGQAFKNGVD VMVLRNSLIHMYGCCGHIELGRKVFDEM + D VSWNSIVTAYARVGDLHTA DM
Subjt:  FEWLGNGFRPDSYTFLSLFSACANFGCGASGRKCHGQAFKNGVDSVMVLRNSLIHMYGCCGHIELGRKVFDEMSSWDFVSWNSIVTAYARVGDLHTARDM

Query:  FDAMPERNVVSWNLMISEYLRGGNPGCAMKLFRNMVNIGIRGNNTTMVNVLGACGRSARLNEGRSVHGFMYRTSMKFCVFIDTALVDMYSKCQRVSIARR
        FDAMPERNVVSWNLMISEYLRGG+PGCAMKLFRNM+ IGIRGN+TTMVN+LGACGRSARLNEGRSVHGFMYRTSMKFCVFIDTALVDMY KCQRVS+ARR
Subjt:  FDAMPERNVVSWNLMISEYLRGGNPGCAMKLFRNMVNIGIRGNNTTMVNVLGACGRSARLNEGRSVHGFMYRTSMKFCVFIDTALVDMYSKCQRVSIARR

Query:  VFDRMLSRNLVTWNAMVLGHCLHGHPEDGLKLFEEMAAKLREINGETGSGKKFKQDEGKRKVYPDQITFIGVLCACARAGLLKDAKNYFDEMINVFLVKP
        +FDRM++RNLVTWNAMVLGHCLHG+PEDGLKLFEEMAAKLRE NGE GSGKKFKQDEG+RKV+ DQITFIGVLCACARAGLL+DA NYFDEMINVFLV+P
Subjt:  VFDRMLSRNLVTWNAMVLGHCLHGHPEDGLKLFEEMAAKLREINGETGSGKKFKQDEGKRKVYPDQITFIGVLCACARAGLLKDAKNYFDEMINVFLVKP

Query:  NFAHYWCLANVYVAAGLIQQAVEILRNMPEDNEDFSSESVVWINLLATCRFVGDVSLGEQIAKYLIDLEPKNDSYYRLLLNIYAVAGRWEDVSRIKLLMK
        NFAHYWCLANVYVAAGLIQQAVEILRNMPED EDFSSE VVW NLLATCRF GDVSLGEQIA YLID+EPKN+SYYRLLLNIYAVAGRWEDVSRIK+LMK
Subjt:  NFAHYWCLANVYVAAGLIQQAVEILRNMPEDNEDFSSESVVWINLLATCRFVGDVSLGEQIAKYLIDLEPKNDSYYRLLLNIYAVAGRWEDVSRIKLLMK

Query:  EKRLGTMPGCRLVDLKEIVHRLKLGNLLQEGMRETNTVMHKLASELMML
        EKRLGT PGCRLVDLKEIVHRLKLGNLLQEGM+ETNTVMHKLASE+ +L
Subjt:  EKRLGTMPGCRLVDLKEIVHRLKLGNLLQEGMRETNTVMHKLASELMML

XP_023544620.1 pentatricopeptide repeat-containing protein At3g51320 isoform X1 [Cucurbita pepo subsp. pepo]5.6e-25880.69Show/hide
Query:  DNSFVSHTPP--SLCHS--------NPLIHL---LLHSALCQNQIFRSRPQILLDITEATRFFNHVRAY--INIPNTFCINRVIKAYSISTVPLQAVFVY
        D S  +  PP  + CHS          L+ +   L+ S L  +  + +R  +LL  +E       V  +  IN+PN FCINRVIKAYS+S+VPL+AVFVY
Subjt:  DNSFVSHTPP--SLCHS--------NPLIHL---LLHSALCQNQIFRSRPQILLDITEATRFFNHVRAY--INIPNTFCINRVIKAYSISTVPLQAVFVY

Query:  FEWLGNGFRPDSYTFLSLFSACANFGCGASGRKCHGQAFKNGVDSVMVLRNSLIHMYGCCGHIELGRKVFDEMSSWDFVSWNSIVTAYARVGDLHTARDM
        F+WLG+GFRPD+YTFLSLF ACA+ GCG+SGRKCHGQAFKNGVD VMVLRNSLIHMY CCGHIELGRKVFDEMS+ D VSWNSIVTAYARVGDLHTA DM
Subjt:  FEWLGNGFRPDSYTFLSLFSACANFGCGASGRKCHGQAFKNGVDSVMVLRNSLIHMYGCCGHIELGRKVFDEMSSWDFVSWNSIVTAYARVGDLHTARDM

Query:  FDAMPERNVVSWNLMISEYLRGGNPGCAMKLFRNMVNIGIRGNNTTMVNVLGACGRSARLNEGRSVHGFMYRTSMKFCVFIDTALVDMYSKCQRVSIARR
        FDAMPERNVVSWNLMISEYLRGG+PGCAMKLFRNM+ IGIRGN+TTMVN+LGACGRSARLNEGRSVHGFMYRTSMKFCVFIDTALVDMY KCQRVS+ARR
Subjt:  FDAMPERNVVSWNLMISEYLRGGNPGCAMKLFRNMVNIGIRGNNTTMVNVLGACGRSARLNEGRSVHGFMYRTSMKFCVFIDTALVDMYSKCQRVSIARR

Query:  VFDRMLSRNLVTWNAMVLGHCLHGHPEDGLKLFEEMAAKLREINGETGSGKKFKQDEGKRKVYPDQITFIGVLCACARAGLLKDAKNYFDEMINVFLVKP
        +FDRM++RNLVTWNAMVLGHCLHG+PEDGLKLFEEMAAKLRE NGE GSGKKFKQDEG+RKV+PDQITFIGVLCACARAGLL+DA NYFDEMIN+FLV+P
Subjt:  VFDRMLSRNLVTWNAMVLGHCLHGHPEDGLKLFEEMAAKLREINGETGSGKKFKQDEGKRKVYPDQITFIGVLCACARAGLLKDAKNYFDEMINVFLVKP

Query:  NFAHYWCLANVYVAAGLIQQAVEILRNMPEDNEDFSSESVVWINLLATCRFVGDVSLGEQIAKYLIDLEPKNDSYYRLLLNIYAVAGRWEDVSRIKLLMK
        NFAHYWCLANVYVAAGLIQQAVEILRNMPED EDFSSE VVW NLLATCRF GDVSLGEQIA YLID+EPKN+SYYRLLLNIYAVAGRWEDVSRIK+LMK
Subjt:  NFAHYWCLANVYVAAGLIQQAVEILRNMPEDNEDFSSESVVWINLLATCRFVGDVSLGEQIAKYLIDLEPKNDSYYRLLLNIYAVAGRWEDVSRIKLLMK

Query:  EKRLGTMPGCRLVDLKEIVHRLKLGNLLQEGMRETNTVMHKLASELMML
        EKRLGT PGCRLVDLKEIVHRLKLGNLLQEGM+ETN+VMHKLASE+ +L
Subjt:  EKRLGTMPGCRLVDLKEIVHRLKLGNLLQEGMRETNTVMHKLASELMML

XP_023544621.1 pentatricopeptide repeat-containing protein At3g51320 isoform X2 [Cucurbita pepo subsp. pepo]5.6e-25880.69Show/hide
Query:  DNSFVSHTPP--SLCHS--------NPLIHL---LLHSALCQNQIFRSRPQILLDITEATRFFNHVRAY--INIPNTFCINRVIKAYSISTVPLQAVFVY
        D S  +  PP  + CHS          L+ +   L+ S L  +  + +R  +LL  +E       V  +  IN+PN FCINRVIKAYS+S+VPL+AVFVY
Subjt:  DNSFVSHTPP--SLCHS--------NPLIHL---LLHSALCQNQIFRSRPQILLDITEATRFFNHVRAY--INIPNTFCINRVIKAYSISTVPLQAVFVY

Query:  FEWLGNGFRPDSYTFLSLFSACANFGCGASGRKCHGQAFKNGVDSVMVLRNSLIHMYGCCGHIELGRKVFDEMSSWDFVSWNSIVTAYARVGDLHTARDM
        F+WLG+GFRPD+YTFLSLF ACA+ GCG+SGRKCHGQAFKNGVD VMVLRNSLIHMY CCGHIELGRKVFDEMS+ D VSWNSIVTAYARVGDLHTA DM
Subjt:  FEWLGNGFRPDSYTFLSLFSACANFGCGASGRKCHGQAFKNGVDSVMVLRNSLIHMYGCCGHIELGRKVFDEMSSWDFVSWNSIVTAYARVGDLHTARDM

Query:  FDAMPERNVVSWNLMISEYLRGGNPGCAMKLFRNMVNIGIRGNNTTMVNVLGACGRSARLNEGRSVHGFMYRTSMKFCVFIDTALVDMYSKCQRVSIARR
        FDAMPERNVVSWNLMISEYLRGG+PGCAMKLFRNM+ IGIRGN+TTMVN+LGACGRSARLNEGRSVHGFMYRTSMKFCVFIDTALVDMY KCQRVS+ARR
Subjt:  FDAMPERNVVSWNLMISEYLRGGNPGCAMKLFRNMVNIGIRGNNTTMVNVLGACGRSARLNEGRSVHGFMYRTSMKFCVFIDTALVDMYSKCQRVSIARR

Query:  VFDRMLSRNLVTWNAMVLGHCLHGHPEDGLKLFEEMAAKLREINGETGSGKKFKQDEGKRKVYPDQITFIGVLCACARAGLLKDAKNYFDEMINVFLVKP
        +FDRM++RNLVTWNAMVLGHCLHG+PEDGLKLFEEMAAKLRE NGE GSGKKFKQDEG+RKV+PDQITFIGVLCACARAGLL+DA NYFDEMIN+FLV+P
Subjt:  VFDRMLSRNLVTWNAMVLGHCLHGHPEDGLKLFEEMAAKLREINGETGSGKKFKQDEGKRKVYPDQITFIGVLCACARAGLLKDAKNYFDEMINVFLVKP

Query:  NFAHYWCLANVYVAAGLIQQAVEILRNMPEDNEDFSSESVVWINLLATCRFVGDVSLGEQIAKYLIDLEPKNDSYYRLLLNIYAVAGRWEDVSRIKLLMK
        NFAHYWCLANVYVAAGLIQQAVEILRNMPED EDFSSE VVW NLLATCRF GDVSLGEQIA YLID+EPKN+SYYRLLLNIYAVAGRWEDVSRIK+LMK
Subjt:  NFAHYWCLANVYVAAGLIQQAVEILRNMPEDNEDFSSESVVWINLLATCRFVGDVSLGEQIAKYLIDLEPKNDSYYRLLLNIYAVAGRWEDVSRIKLLMK

Query:  EKRLGTMPGCRLVDLKEIVHRLKLGNLLQEGMRETNTVMHKLASELMML
        EKRLGT PGCRLVDLKEIVHRLKLGNLLQEGM+ETN+VMHKLASE+ +L
Subjt:  EKRLGTMPGCRLVDLKEIVHRLKLGNLLQEGMRETNTVMHKLASELMML

XP_038882774.1 pentatricopeptide repeat-containing protein At3g51320 [Benincasa hispida]5.9e-26892.9Show/hide
Query:  YINIPNTFCINRVIKAYSISTVPLQAVFVYFEWLGNGFRPDSYTFLSLFSACANFGCGASGRKCHGQAFKNGVDSVMVLRNSLIHMYGCCGHIELGRKVF
        YI++PNTFCINRVIKAYS+STVPL+AVF+YFEWLGNGFRPDSYTFL+LFSACA+FGC ASGRKCHGQAFKNGVDSVMVLRNSLIHMYGCCGHIELGRKVF
Subjt:  YINIPNTFCINRVIKAYSISTVPLQAVFVYFEWLGNGFRPDSYTFLSLFSACANFGCGASGRKCHGQAFKNGVDSVMVLRNSLIHMYGCCGHIELGRKVF

Query:  DEMSSWDFVSWNSIVTAYARVGDLHTARDMFDAMPERNVVSWNLMISEYLRGGNPGCAMKLFRNMVNIGIRGNNTTMVNVLGACGRSARLNEGRSVHGFM
        DEMS+WD VSWNSIVTAYARVGDLH+A DMFD MPERNVVSWNLMISEYLRGGNPGCAMKLFRNMVNIGIRGNNTTMVNV GACGRSARLNEGRSVHGFM
Subjt:  DEMSSWDFVSWNSIVTAYARVGDLHTARDMFDAMPERNVVSWNLMISEYLRGGNPGCAMKLFRNMVNIGIRGNNTTMVNVLGACGRSARLNEGRSVHGFM

Query:  YRTSMKFCVFIDTALVDMYSKCQRVSIARRVFDRMLSRNLVTWNAMVLGHCLHGHPEDGLKLFEEMAAKLREINGETGSGKKFKQDEGKRKVYPDQITFI
        YR  M FCVFIDTALVDMYSKCQ+VSIARRVFDRMLSRNLVTWNAMVLGHCLHG+PEDGLKLFEEMAAKLREINGETGSGK+FKQ EGK+KV+PDQITFI
Subjt:  YRTSMKFCVFIDTALVDMYSKCQRVSIARRVFDRMLSRNLVTWNAMVLGHCLHGHPEDGLKLFEEMAAKLREINGETGSGKKFKQDEGKRKVYPDQITFI

Query:  GVLCACARAGLLKDAKNYFDEMINVFLVKPNFAHYWCLANVYVAAGLIQQAVEILRNMPEDNEDFSSESVVWINLLATCRFVGDVSLGEQIAKYLIDLEP
        GVLCACARAGLL+DAKNYFDEMINVFLV+PNFAHYWCLANVYVAAGLIQ+AVEILRNMPEDNEDFSSESVVWINLL TCRFVGDVSLGEQIAKYLID+EP
Subjt:  GVLCACARAGLLKDAKNYFDEMINVFLVKPNFAHYWCLANVYVAAGLIQQAVEILRNMPEDNEDFSSESVVWINLLATCRFVGDVSLGEQIAKYLIDLEP

Query:  KNDSYYRLLLNIYAVAGRWEDVSRIKLLMKEKRLGTMPGCRLVDLKEIVHRLKLGNLLQEGMRETNTVMHKLASELMML
        KNDSY RLLLNIYAVAGRWEDVSRIKLLMKEKRLGTMPGCRL+DLKEIVHRLKLGNLLQEGM+ETNTVMHKLASE+ +L
Subjt:  KNDSYYRLLLNIYAVAGRWEDVSRIKLLMKEKRLGTMPGCRLVDLKEIVHRLKLGNLLQEGMRETNTVMHKLASELMML

TrEMBL top hitse value%identityAlignment
A0A0A0KMJ6 Uncharacterized protein1.2e-25387.19Show/hide
Query:  DITEATRFFNHVRAYINIPNTFCINRVIKAYSISTVPLQAVFVYFEWLGNGFRPDSYTFLSLFSACANFGCGASGRKCHGQAFKNGVDSVMVLRNSLIHM
        DI      F H++    +PNTFC+NRVIKAYS+STVPL+AVFVYFEWLGNG RPDSYTFLSLFSACA+FGCGASGRKCHGQAFKNGVDSVMVL NSLIHM
Subjt:  DITEATRFFNHVRAYINIPNTFCINRVIKAYSISTVPLQAVFVYFEWLGNGFRPDSYTFLSLFSACANFGCGASGRKCHGQAFKNGVDSVMVLRNSLIHM

Query:  YGCCGHIELGRKVFDEMSSWDFVSWNSIVTAYARVGDLHTARDMFDAMPERNVVSWNLMISEYLRGGNPGCAMKLFRNMVNIGIRGNNTTMVNVLGACGR
        YGCC HIELGRKVFDEMS+ D VSWNSIVTAYARVGDL+TA DMFD MPERNVVSWNLMISEYLRGGNPGCAMKLFRNMVN+GIRGNNTTMVNVL AC R
Subjt:  YGCCGHIELGRKVFDEMSSWDFVSWNSIVTAYARVGDLHTARDMFDAMPERNVVSWNLMISEYLRGGNPGCAMKLFRNMVNIGIRGNNTTMVNVLGACGR

Query:  SARLNEGRSVHGFMYRTSMKFCVFIDTALVDMYSKCQRVSIARRVFDRMLSRNLVTWNAMVLGHCLHGHPEDGLKLFEEMAAKLREINGETGSGKKFKQD
        SARLNEGRSVHGFMYR SMKFCVFI+TALVDMYSKC RVS+ARRVFDR++ RNLVTWNAM+LGH LHG+P+DGL+LFEEM  +LREIN ETG+GKKFKQD
Subjt:  SARLNEGRSVHGFMYRTSMKFCVFIDTALVDMYSKCQRVSIARRVFDRMLSRNLVTWNAMVLGHCLHGHPEDGLKLFEEMAAKLREINGETGSGKKFKQD

Query:  EGKRKVYPDQITFIGVLCACARAGLLKDAKNYFDEMINVFLVKPNFAHYWCLANVYVAAGLIQQAVEILRNMPEDNEDFSSESVVWINLLATCRFVGDVS
        EGKRKV+PDQITFIGVLCACARAGLLKDA+NYFDEMINVFLV+PNF HYWCLANVYVA GLI+QAVEILRNMPEDNEDFSSESVVWI+LL TCRFVGDVS
Subjt:  EGKRKVYPDQITFIGVLCACARAGLLKDAKNYFDEMINVFLVKPNFAHYWCLANVYVAAGLIQQAVEILRNMPEDNEDFSSESVVWINLLATCRFVGDVS

Query:  LGEQIAKYLIDLEPKNDSYYRLLLNIYAVAGRWEDVSRIKLLMKEKRLGTMPGCRLVDLKEIVHRLKLGNLLQEGMRETNTVMH
        LGEQIAKYLID+EPKNDSYYRLLLNIYAVAGRWEDVSRIKLLMKEKRLGTM GCRLVDLKEIVH LKLGN LQE M+ETNTV+H
Subjt:  LGEQIAKYLIDLEPKNDSYYRLLLNIYAVAGRWEDVSRIKLLMKEKRLGTMPGCRLVDLKEIVHRLKLGNLLQEGMRETNTVMH

A0A1S4DTP8 pentatricopeptide repeat-containing protein At3g513203.1e-25488.43Show/hide
Query:  DITEATRFFNHVRAYINIPNTFCINRVIKAYSISTVPLQAVFVYFEWLGNGFRPDSYTFLSLFSACANFGCGASGRKCHGQAFKNGVDSVMVLRNSLIHM
        DI      F H++    +PNTFC+NRVIKAYS+STVPL+AVFVYFEWLGNG RPDSYTFLSLFSACA+FGCGASGRKCHGQAFKNGVDSVMVL NSLIHM
Subjt:  DITEATRFFNHVRAYINIPNTFCINRVIKAYSISTVPLQAVFVYFEWLGNGFRPDSYTFLSLFSACANFGCGASGRKCHGQAFKNGVDSVMVLRNSLIHM

Query:  YGCCGHIELGRKVFDEMSSWDFVSWNSIVTAYARVGDLHTARDMFDAMPERNVVSWNLMISEYLRGGNPGCAMKLFRNMVNIGIRGNNTTMVNVLGACGR
        YGCCGHIELGRKVFDEMS+ D VSWNSIVTAYARVGD++TA DMFD MPERNVVSWNLMISEYLRGGNPGCAMKLFRNMVN+GIRGNNTTMVNVLGAC R
Subjt:  YGCCGHIELGRKVFDEMSSWDFVSWNSIVTAYARVGDLHTARDMFDAMPERNVVSWNLMISEYLRGGNPGCAMKLFRNMVNIGIRGNNTTMVNVLGACGR

Query:  SARLNEGRSVHGFMYRTSMKFCVFIDTALVDMYSKCQRVSIARRVFDRMLSRNLVTWNAMVLGHCLHGHPEDGLKLFEEMAAKLREINGETGSGKKFKQD
        SARLNEGRSVHGFMYRTSMKFCVFIDTALVDMYSKCQRV IARRVFDRM+SRNLVTWNAMVLGH LHG+P+DGLKLFEEMAA+LRE+  ETG+GKKFKQD
Subjt:  SARLNEGRSVHGFMYRTSMKFCVFIDTALVDMYSKCQRVSIARRVFDRMLSRNLVTWNAMVLGHCLHGHPEDGLKLFEEMAAKLREINGETGSGKKFKQD

Query:  EGKRKVYPDQITFIGVLCACARAGLLKDAKNYFDEMINVFLVKPNFAHYWCLANVYVAAGLIQQAVEILRNMPEDNEDFSSESVVWINLLATCRFVGDVS
        EGKRKV+PDQITFIGVLCACARAGLLKDAKNYFDEMI VFLV+PNFAHYWCLANVYVA GLI+QAVEILRNMP   EDFSSESVVWI+LL TCRFVGDVS
Subjt:  EGKRKVYPDQITFIGVLCACARAGLLKDAKNYFDEMINVFLVKPNFAHYWCLANVYVAAGLIQQAVEILRNMPEDNEDFSSESVVWINLLATCRFVGDVS

Query:  LGEQIAKYLIDLEPKNDSYYRLLLNIYAVAGRWEDVSRIKLLMKEKRLGTMPGCRLVDLKEIVHRLKLGNLLQEGMRETNTVMH
        LGEQIAKYLID+EPKNDSYYRLLLN+YAVAGRWEDVSRIKLLMKEKRLGTMPGCRLVDLKEIVH LKLGN LQE M+ETNTV+H
Subjt:  LGEQIAKYLIDLEPKNDSYYRLLLNIYAVAGRWEDVSRIKLLMKEKRLGTMPGCRLVDLKEIVHRLKLGNLLQEGMRETNTVMH

A0A5D3CKW6 Pentatricopeptide repeat-containing protein4.0e-25488.43Show/hide
Query:  DITEATRFFNHVRAYINIPNTFCINRVIKAYSISTVPLQAVFVYFEWLGNGFRPDSYTFLSLFSACANFGCGASGRKCHGQAFKNGVDSVMVLRNSLIHM
        DI      F H++    +PNTFC+NRVIKAYS+STVPL+AVFVYFEWLGNG RPDSYTFLSLFSACA+FGCGASGRKCHGQAFKNGVDSVMVL NSLIHM
Subjt:  DITEATRFFNHVRAYINIPNTFCINRVIKAYSISTVPLQAVFVYFEWLGNGFRPDSYTFLSLFSACANFGCGASGRKCHGQAFKNGVDSVMVLRNSLIHM

Query:  YGCCGHIELGRKVFDEMSSWDFVSWNSIVTAYARVGDLHTARDMFDAMPERNVVSWNLMISEYLRGGNPGCAMKLFRNMVNIGIRGNNTTMVNVLGACGR
        YGCCGHIELGRKVFDEMS+ D VSWNSIVTAYARVGD++TA DMFD MPERNVVSWNLMISEYLRGGNPGCAMKLFRNMVN+GIRGNNTTMVNVLGAC R
Subjt:  YGCCGHIELGRKVFDEMSSWDFVSWNSIVTAYARVGDLHTARDMFDAMPERNVVSWNLMISEYLRGGNPGCAMKLFRNMVNIGIRGNNTTMVNVLGACGR

Query:  SARLNEGRSVHGFMYRTSMKFCVFIDTALVDMYSKCQRVSIARRVFDRMLSRNLVTWNAMVLGHCLHGHPEDGLKLFEEMAAKLREINGETGSGKKFKQD
        SARLNEGRSVHGFMYRTSMKFCVFIDTALVDMYSKCQRV IARRVFDRM+SRNLVTWNAMVLGH LHG+P+DGLKLFEEMAA+LRE+  ETG+GKKFKQD
Subjt:  SARLNEGRSVHGFMYRTSMKFCVFIDTALVDMYSKCQRVSIARRVFDRMLSRNLVTWNAMVLGHCLHGHPEDGLKLFEEMAAKLREINGETGSGKKFKQD

Query:  EGKRKVYPDQITFIGVLCACARAGLLKDAKNYFDEMINVFLVKPNFAHYWCLANVYVAAGLIQQAVEILRNMPEDNEDFSSESVVWINLLATCRFVGDVS
        EGKRKV+PDQITFIGVLCACARAGLLKDAKNYFDEMI VFLV+PNFAHYWCLANVYVA GLI+QAVEILRNMP   EDFSSESVVWI+LL TCRFVGDVS
Subjt:  EGKRKVYPDQITFIGVLCACARAGLLKDAKNYFDEMINVFLVKPNFAHYWCLANVYVAAGLIQQAVEILRNMPEDNEDFSSESVVWINLLATCRFVGDVS

Query:  LGEQIAKYLIDLEPKNDSYYRLLLNIYAVAGRWEDVSRIKLLMKEKRLGTMPGCRLVDLKEIVHRLKLGNLLQEGMRETNTVMH
        LGEQIAKYLID+EPKNDSYYRLLLN+YAVAGRWEDVSRIKLLMKEKRLGTMPGCRLVDLKEIVH LKLGN LQE M+ETNTV+H
Subjt:  LGEQIAKYLIDLEPKNDSYYRLLLNIYAVAGRWEDVSRIKLLMKEKRLGTMPGCRLVDLKEIVHRLKLGNLLQEGMRETNTVMH

A0A6J1GFI6 pentatricopeptide repeat-containing protein At3g51320 isoform X26.6e-25780.69Show/hide
Query:  DNSFVSHTPP--SLCHS--------NPLIHL---LLHSALCQNQIFRSRPQILLDITEATRFFNHVRAY--INIPNTFCINRVIKAYSISTVPLQAVFVY
        D S  +  PP  + CHS          L+ +   L+ S L  +  + +R  +LL  +E       V  +  IN+PN FCINRVIKAYS+S+ PL+AVFVY
Subjt:  DNSFVSHTPP--SLCHS--------NPLIHL---LLHSALCQNQIFRSRPQILLDITEATRFFNHVRAY--INIPNTFCINRVIKAYSISTVPLQAVFVY

Query:  FEWLGNGFRPDSYTFLSLFSACANFGCGASGRKCHGQAFKNGVDSVMVLRNSLIHMYGCCGHIELGRKVFDEMSSWDFVSWNSIVTAYARVGDLHTARDM
        F+WLG+GFRPD+YTFLSLF ACA+ GCG+SGRKCHGQAFKNGVD VMVLRNSLIHMYGCCGHIELGRKVFDEM + D VSWNSIVTAYARVGDLHTA DM
Subjt:  FEWLGNGFRPDSYTFLSLFSACANFGCGASGRKCHGQAFKNGVDSVMVLRNSLIHMYGCCGHIELGRKVFDEMSSWDFVSWNSIVTAYARVGDLHTARDM

Query:  FDAMPERNVVSWNLMISEYLRGGNPGCAMKLFRNMVNIGIRGNNTTMVNVLGACGRSARLNEGRSVHGFMYRTSMKFCVFIDTALVDMYSKCQRVSIARR
        FDAMPERNVVSWNLMISEYLRGG+PGCAMKLFRNM+ IGIRGN+TTMVN+LGACGRSARLNEGRSVHGFMYRTSMKFCVFIDTALVDMY KCQRVS+ARR
Subjt:  FDAMPERNVVSWNLMISEYLRGGNPGCAMKLFRNMVNIGIRGNNTTMVNVLGACGRSARLNEGRSVHGFMYRTSMKFCVFIDTALVDMYSKCQRVSIARR

Query:  VFDRMLSRNLVTWNAMVLGHCLHGHPEDGLKLFEEMAAKLREINGETGSGKKFKQDEGKRKVYPDQITFIGVLCACARAGLLKDAKNYFDEMINVFLVKP
        +FDRM++RNLVTWNAMVLGHCLHG+PEDGLKLFEEMAAKLRE NGE GSGKKFKQDEG+RKV+ DQITFIGVLCACARAGLL+DA NYFDEMINVFLV+P
Subjt:  VFDRMLSRNLVTWNAMVLGHCLHGHPEDGLKLFEEMAAKLREINGETGSGKKFKQDEGKRKVYPDQITFIGVLCACARAGLLKDAKNYFDEMINVFLVKP

Query:  NFAHYWCLANVYVAAGLIQQAVEILRNMPEDNEDFSSESVVWINLLATCRFVGDVSLGEQIAKYLIDLEPKNDSYYRLLLNIYAVAGRWEDVSRIKLLMK
        NFAHYWCLANVYVAAGLIQQAVEILRNMPED EDFSSE VVW NLLATCRF GDVSLGEQIA YLID+EPKN+SYYRLLLNIYAVAGRWEDVSRIK+LMK
Subjt:  NFAHYWCLANVYVAAGLIQQAVEILRNMPEDNEDFSSESVVWINLLATCRFVGDVSLGEQIAKYLIDLEPKNDSYYRLLLNIYAVAGRWEDVSRIKLLMK

Query:  EKRLGTMPGCRLVDLKEIVHRLKLGNLLQEGMRETNTVMHKLASELMML
        EKRLGT PGCRLVDLKEIVHRLKLGNLLQEGM+ETNTVMHKLASE+ +L
Subjt:  EKRLGTMPGCRLVDLKEIVHRLKLGNLLQEGMRETNTVMHKLASELMML

A0A6J1GGG4 pentatricopeptide repeat-containing protein At3g51320 isoform X16.6e-25780.69Show/hide
Query:  DNSFVSHTPP--SLCHS--------NPLIHL---LLHSALCQNQIFRSRPQILLDITEATRFFNHVRAY--INIPNTFCINRVIKAYSISTVPLQAVFVY
        D S  +  PP  + CHS          L+ +   L+ S L  +  + +R  +LL  +E       V  +  IN+PN FCINRVIKAYS+S+ PL+AVFVY
Subjt:  DNSFVSHTPP--SLCHS--------NPLIHL---LLHSALCQNQIFRSRPQILLDITEATRFFNHVRAY--INIPNTFCINRVIKAYSISTVPLQAVFVY

Query:  FEWLGNGFRPDSYTFLSLFSACANFGCGASGRKCHGQAFKNGVDSVMVLRNSLIHMYGCCGHIELGRKVFDEMSSWDFVSWNSIVTAYARVGDLHTARDM
        F+WLG+GFRPD+YTFLSLF ACA+ GCG+SGRKCHGQAFKNGVD VMVLRNSLIHMYGCCGHIELGRKVFDEM + D VSWNSIVTAYARVGDLHTA DM
Subjt:  FEWLGNGFRPDSYTFLSLFSACANFGCGASGRKCHGQAFKNGVDSVMVLRNSLIHMYGCCGHIELGRKVFDEMSSWDFVSWNSIVTAYARVGDLHTARDM

Query:  FDAMPERNVVSWNLMISEYLRGGNPGCAMKLFRNMVNIGIRGNNTTMVNVLGACGRSARLNEGRSVHGFMYRTSMKFCVFIDTALVDMYSKCQRVSIARR
        FDAMPERNVVSWNLMISEYLRGG+PGCAMKLFRNM+ IGIRGN+TTMVN+LGACGRSARLNEGRSVHGFMYRTSMKFCVFIDTALVDMY KCQRVS+ARR
Subjt:  FDAMPERNVVSWNLMISEYLRGGNPGCAMKLFRNMVNIGIRGNNTTMVNVLGACGRSARLNEGRSVHGFMYRTSMKFCVFIDTALVDMYSKCQRVSIARR

Query:  VFDRMLSRNLVTWNAMVLGHCLHGHPEDGLKLFEEMAAKLREINGETGSGKKFKQDEGKRKVYPDQITFIGVLCACARAGLLKDAKNYFDEMINVFLVKP
        +FDRM++RNLVTWNAMVLGHCLHG+PEDGLKLFEEMAAKLRE NGE GSGKKFKQDEG+RKV+ DQITFIGVLCACARAGLL+DA NYFDEMINVFLV+P
Subjt:  VFDRMLSRNLVTWNAMVLGHCLHGHPEDGLKLFEEMAAKLREINGETGSGKKFKQDEGKRKVYPDQITFIGVLCACARAGLLKDAKNYFDEMINVFLVKP

Query:  NFAHYWCLANVYVAAGLIQQAVEILRNMPEDNEDFSSESVVWINLLATCRFVGDVSLGEQIAKYLIDLEPKNDSYYRLLLNIYAVAGRWEDVSRIKLLMK
        NFAHYWCLANVYVAAGLIQQAVEILRNMPED EDFSSE VVW NLLATCRF GDVSLGEQIA YLID+EPKN+SYYRLLLNIYAVAGRWEDVSRIK+LMK
Subjt:  NFAHYWCLANVYVAAGLIQQAVEILRNMPEDNEDFSSESVVWINLLATCRFVGDVSLGEQIAKYLIDLEPKNDSYYRLLLNIYAVAGRWEDVSRIKLLMK

Query:  EKRLGTMPGCRLVDLKEIVHRLKLGNLLQEGMRETNTVMHKLASELMML
        EKRLGT PGCRLVDLKEIVHRLKLGNLLQEGM+ETNTVMHKLASE+ +L
Subjt:  EKRLGTMPGCRLVDLKEIVHRLKLGNLLQEGMRETNTVMHKLASELMML

SwissProt top hitse value%identityAlignment
Q0WVU0 Pentatricopeptide repeat-containing protein At3g513204.9e-14048.8Show/hide
Query:  SNPLIHLL-LHSALCQNQIFRSRPQILLDITEATRFFNH---VRAYINIPNTFCINRVIKAYSISTVPLQAVFVYFEWLGNGFRPDSYTFLSLFSACANF
        SN + HL  +H+ L  +  F      +  +  ++RF +    V  Y +I   +C N V KAY +S+ P QA+  YF+ L  GF PDSYTF+SL S     
Subjt:  SNPLIHLL-LHSALCQNQIFRSRPQILLDITEATRFFNH---VRAYINIPNTFCINRVIKAYSISTVPLQAVFVYFEWLGNGFRPDSYTFLSLFSACANF

Query:  GCGASGRKCHGQAFKNGVDSVMVLRNSLIHMYGCCGHIELGRKVFDEMSSWDFVSWNSIVTAYARVGDLHTARDMFDAMPERNVVSWNLMISEYLRGGNP
         C  SG+ CHGQA K+G D V+ ++NSL+HMY CCG ++L +K+F E+   D VSWNSI+    R GD+  A  +FD MP++N++SWN+MIS YL   NP
Subjt:  GCGASGRKCHGQAFKNGVDSVMVLRNSLIHMYGCCGHIELGRKVFDEMSSWDFVSWNSIVTAYARVGDLHTARDMFDAMPERNVVSWNLMISEYLRGGNP

Query:  GCAMKLFRNMVNIGIRGNNTTMVNVLGACGRSARLNEGRSVHGFMYRTSMKFCVFIDTALVDMYSKCQRVSIARRVFDRMLSRNLVTWNAMVLGHCLHGH
        G ++ LFR MV  G +GN +T+V +L ACGRSARL EGRSVH  + RT +   V IDTAL+DMY KC+ V +ARR+FD +  RN VTWN M+L HCLHG 
Subjt:  GCAMKLFRNMVNIGIRGNNTTMVNVLGACGRSARLNEGRSVHGFMYRTSMKFCVFIDTALVDMYSKCQRVSIARRVFDRMLSRNLVTWNAMVLGHCLHGH

Query:  PEDGLKLFEEMAAKLREINGETGSGKKFKQDEGKRKVYPDQITFIGVLCACARAGLLKDAKNYFDEMINVFLVKPNFAHYWCLANVYVAAGLIQQAVEIL
        PE GL+LFE M      ING                + PD++TF+GVLC CARAGL+   ++Y+  M++ F +KPNF H WC+AN+Y +AG  ++A E L
Subjt:  PEDGLKLFEEMAAKLREINGETGSGKKFKQDEGKRKVYPDQITFIGVLCACARAGLLKDAKNYFDEMINVFLVKPNFAHYWCLANVYVAAGLIQQAVEIL

Query:  RNMPEDNEDFSSESVVWINLLATCRFVGDVSLGEQIAKYLIDLEPKNDSYYRLLLNIYAVAGRWEDVSRIKLLMKEKRLGTMPGCRLVDLKEIVHRLKLG
        +N+P+  ED + ES  W NLL++ RF G+ +LGE IAK LI+ +P N  YY LL+NIY+V GRWEDV+R++ ++KE+++G +PGC LVDLKEIVH L+LG
Subjt:  RNMPEDNEDFSSESVVWINLLATCRFVGDVSLGEQIAKYLIDLEPKNDSYYRLLLNIYAVAGRWEDVSRIKLLMKEKRLGTMPGCRLVDLKEIVHRLKLG

Q9CA54 Pentatricopeptide repeat-containing protein At1g746305.9e-7731.81Show/hide
Query:  LDITEATRFFNHVRAYINIPNTFCINRVIKAYSISTVPLQAVFVYFEWLGNGF-RPDSYTFLSLFSACANFGCGASGRKCHGQAFKNGVDSVMVLRNSLI
        + I++A  +   +      P+ F  N +++ YS S  P  +V V+ E +  GF  PDS++F  +  A  NF    +G + H QA K+G++S + +  +LI
Subjt:  LDITEATRFFNHVRAYINIPNTFCINRVIKAYSISTVPLQAVFVYFEWLGNGF-RPDSYTFLSLFSACANFGCGASGRKCHGQAFKNGVDSVMVLRNSLI

Query:  HMYGCCGHIELGRKVFDEMSSWDFVSWNSIVTAYARVGDLHTARDMFDAMPERNVVSWNLMISEYLRGGNPGCAMKL-----------------------
         MYG CG +E  RKVFDEM   + V+WN+++TA  R  D+  AR++FD M  RN  SWN+M++ Y++ G    A ++                       
Subjt:  HMYGCCGHIELGRKVFDEMSSWDFVSWNSIVTAYARVGDLHTARDMFDAMPERNVVSWNLMISEYLRGGNPGCAMKL-----------------------

Query:  --------FRNMVNIGIRGNNTTMVNVLGACGRSARLNEGRSVHGFMYRTSMKFCVFIDTALVDMYSKCQRVSIARRVFDRML-SRNLVTWNAMVLGHCL
                FR +   G+  N  ++  VL AC +S     G+ +HGF+ +    + V ++ AL+DMYS+C  V +AR VF+ M   R +V+W +M+ G  +
Subjt:  --------FRNMVNIGIRGNNTTMVNVLGACGRSARLNEGRSVHGFMYRTSMKFCVFIDTALVDMYSKCQRVSIARRVFDRML-SRNLVTWNAMVLGHCL

Query:  HGHPEDGLKLFEEMAAKLREINGETGSGKKFKQDEGKRKVYPDQITFIGVLCACARAGLLKDAKNYFDEMINVFLVKPNFAHYWCLANVYVAAGLIQQAV
        HG  E+ ++LF EM A                       V PD I+FI +L AC+ AGL+++ ++YF EM  V+ ++P   HY C+ ++Y  +G +Q+A 
Subjt:  HGHPEDGLKLFEEMAAKLREINGETGSGKKFKQDEGKRKVYPDQITFIGVLCACARAGLLKDAKNYFDEMINVFLVKPNFAHYWCLANVYVAAGLIQQAV

Query:  EILRNMPEDNEDFSSESVVWINLLATCRFVGDVSLGEQIAKYLIDLEPKNDSYYRLLLNIYAVAGRWEDVSRIKLLMKEKRLGTMPGCRLVDLKEIVHRL
        + +  MP         ++VW  LL  C   G++ L EQ+ + L +L+P N     LL N YA AG+W+DV+ I+  M  +R+       LV++ + +++ 
Subjt:  EILRNMPEDNEDFSSESVVWINLLATCRFVGDVSLGEQIAKYLIDLEPKNDSYYRLLLNIYAVAGRWEDVSRIKLLMKEKRLGTMPGCRLVDLKEIVHRL

Query:  KLG
          G
Subjt:  KLG

Q9SIL5 Pentatricopeptide repeat-containing protein At2g205402.2e-7632.31Show/hide
Query:  DITEATRFFNHVRAYINIPNTFCINRVIKAYSISTVPLQAVFVYFEWLGNGFR-PDSYTFLSLFSACANFGCGASGRKCHGQAFKNGVDSVMVLRNSLIH
        D+  ATR FN V    + PN F  N +I+AY+ +++    + +Y + L   F  PD +TF  +F +CA+ G    G++ HG   K G    +V  N+LI 
Subjt:  DITEATRFFNHVRAYINIPNTFCINRVIKAYSISTVPLQAVFVYFEWLGNGFR-PDSYTFLSLFSACANFGCGASGRKCHGQAFKNGVDSVMVLRNSLIH

Query:  MYGCCGHIELGRKVFDEMSSWDFVSWNSIVTAYARVGDLHTARDMFDAMPERNVVSWNLMISEYLRGGNPGCAMKLFRNMVNIGIRGNNTTMVNVLGACG
        MY     +    KVFDEM   D +SWNS+++ YAR+G +  A+ +F  M ++ +VSW  MIS Y   G    AM  FR M   GI  +  ++++VL +C 
Subjt:  MYGCCGHIELGRKVFDEMSSWDFVSWNSIVTAYARVGDLHTARDMFDAMPERNVVSWNLMISEYLRGGNPGCAMKLFRNMVNIGIRGNNTTMVNVLGACG

Query:  RSARLNEGRSVHGFMYRTSMKFCVFIDTALVDMYSKCQRVSIARRVFDRMLSRNLVTWNAMVLGHCLHGHPEDGLKLFEEMAAKLREINGETGSGKKFKQ
        +   L  G+ +H +  R        +  AL++MYSKC  +S A ++F +M  +++++W+ M+ G+  HG+    ++ F EM                   
Subjt:  RSARLNEGRSVHGFMYRTSMKFCVFIDTALVDMYSKCQRVSIARRVFDRMLSRNLVTWNAMVLGHCLHGHPEDGLKLFEEMAAKLREINGETGSGKKFKQ

Query:  DEGKRKVYPDQITFIGVLCACARAGLLKDAKNYFDEMINVFLVKPNFAHYWCLANVYVAAGLIQQAVEILRNMPEDNEDFSSESVVWINLLATCRFVGDV
           + KV P+ ITF+G+L AC+  G+ ++   YFD M   + ++P   HY CL +V   AG +++AVEI + MP        +S +W +LL++CR  G++
Subjt:  DEGKRKVYPDQITFIGVLCACARAGLLKDAKNYFDEMINVFLVKPNFAHYWCLANVYVAAGLIQQAVEILRNMPEDNEDFSSESVVWINLLATCRFVGDV

Query:  SLGEQIAKYLIDLEPKNDSYYRLLLNIYAVAGRWEDVSRIKLLMKEKRLGTMPGCRLVDLKEIVHRLKLGNLLQEGMRETNTVMHKLAS
         +      +L++LEP++   Y LL NIYA  G+WEDVSR++ +++ + +   PG  L+++  IV     G+  +    E + V+    S
Subjt:  SLGEQIAKYLIDLEPKNDSYYRLLLNIYAVAGRWEDVSRIKLLMKEKRLGTMPGCRLVDLKEIVHRLKLGNLLQEGMRETNTVMHKLAS

Q9SJG6 Pentatricopeptide repeat-containing protein At2g42920, chloroplastic3.0e-7332.37Show/hide
Query:  INIPNTFCINRVIKAYSISTVPLQAVFVYFEWL--GNGFRPDSYTFLSLFSACANFGCGASGRKCHGQAFKNGVDSVMVLRNSLIHMYGCCGHIELGRKV
        IN  N F  N +I+ +S S+ P  A+ ++ + L      +P   T+ S+F A    G    GR+ HG   K G++    +RN+++HMY  CG +    ++
Subjt:  INIPNTFCINRVIKAYSISTVPLQAVFVYFEWL--GNGFRPDSYTFLSLFSACANFGCGASGRKCHGQAFKNGVDSVMVLRNSLIHMYGCCGHIELGRKV

Query:  FDEMSSWDFVSWNSIVTAYARVGDLHTARDMFDAMPERNVVSWNLMISEYLRGGNPGCAMKLFRNMVNIGIRGNNTTMVNVLGACGRSARLNEGRSVHGF
        F  M  +D V+WNS++  +A+ G +  A+++FD MP+RN VSWN MIS ++R G    A+ +FR M    ++ +  TMV++L AC       +GR +H +
Subjt:  FDEMSSWDFVSWNSIVTAYARVGDLHTARDMFDAMPERNVVSWNLMISEYLRGGNPGCAMKLFRNMVNIGIRGNNTTMVNVLGACGRSARLNEGRSVHGF

Query:  MYRTSMKFCVFIDTALVDMYSKCQRVSIARRVFDRMLSRNLVTWNAMVLGHCLHGHPEDGLKLFEEMAAKLREINGETGSGKKFKQDEGKRKVYPDQITF
        + R   +    + TAL+DMY KC  +     VF+    + L  WN+M+LG   +G  E  + LF E+                      +  + PD ++F
Subjt:  MYRTSMKFCVFIDTALVDMYSKCQRVSIARRVFDRMLSRNLVTWNAMVLGHCLHGHPEDGLKLFEEMAAKLREINGETGSGKKFKQDEGKRKVYPDQITF

Query:  IGVLCACARAGLLKDAKNYFDEMINVFLVKPNFAHYWCLANVYVAAGLIQQAVEILRNMPEDNEDFSSESVVWINLLATCRFVGDVSLGEQIAKYLIDLE
        IGVL ACA +G +  A  +F  M   ++++P+  HY  + NV   AGL+++A  +++NMP +      ++V+W +LL+ CR +G+V + ++ AK L  L+
Subjt:  IGVLCACARAGLLKDAKNYFDEMINVFLVKPNFAHYWCLANVYVAAGLIQQAVEILRNMPEDNEDFSSESVVWINLLATCRFVGDVSLGEQIAKYLIDLE

Query:  PKNDSYYRLLLNIYAVAGRWEDVSRIKLLMKEKRLGTMPGCRLVDLKEIVH
        P     Y LL N YA  G +E+    +LLMKE+++    GC  +++   VH
Subjt:  PKNDSYYRLLLNIYAVAGRWEDVSRIKLLMKEKRLGTMPGCRLVDLKEIVH

Q9SZT8 Pentatricopeptide repeat-containing protein ELI1, chloroplastic1.2e-7734.04Show/hide
Query:  PNTFCINRVIKAYSISTVPLQAVFVYFEWLGNGFRPDSYTFLSLFSACANFGCGASGRKCHGQAFKNGVDSVMVLRNSLIHMYGCCGHIELGRKVFDEMS
        P+ F     I   SI+ +  QA  +Y + L +   P+ +TF SL  +C+      SG+  H    K G+     +   L+ +Y   G +   +KVFD M 
Subjt:  PNTFCINRVIKAYSISTVPLQAVFVYFEWLGNGFRPDSYTFLSLFSACANFGCGASGRKCHGQAFKNGVDSVMVLRNSLIHMYGCCGHIELGRKVFDEMS

Query:  SWDFVSWNSIVTAYARVGDLHTARDMFDAMPERNVVSWNLMISEYLRGGNPGCAMKLFRNMVNIG-IRGNNTTMVNVLGACGRSARLNEGRSVHGFMYRT
            VS  +++T YA+ G++  AR +FD+M ER++VSWN+MI  Y + G P  A+ LF+ ++  G  + +  T+V  L AC +   L  GR +H F+  +
Subjt:  SWDFVSWNSIVTAYARVGDLHTARDMFDAMPERNVVSWNLMISEYLRGGNPGCAMKLFRNMVNIG-IRGNNTTMVNVLGACGRSARLNEGRSVHGFMYRT

Query:  SMKFCVFIDTALVDMYSKCQRVSIARRVFDRMLSRNLVTWNAMVLGHCLHGHPEDGLKLFEEMAAKLREINGETGSGKKFKQDEGKRKVYPDQITFIGVL
         ++  V + T L+DMYSKC  +  A  VF+    +++V WNAM+ G+ +HG+ +D L+LF EM        G TG             + P  ITFIG L
Subjt:  SMKFCVFIDTALVDMYSKCQRVSIARRVFDRMLSRNLVTWNAMVLGHCLHGHPEDGLKLFEEMAAKLREINGETGSGKKFKQDEGKRKVYPDQITFIGVL

Query:  CACARAGLLKDAKNYFDEMINVFLVKPNFAHYWCLANVYVAAGLIQQAVEILRNMPEDNEDFSSESVVWINLLATCRFVGDVSLGEQIAKYLIDLEPKND
         ACA AGL+ +    F+ M   + +KP   HY CL ++   AG +++A E ++NM  D     ++SV+W ++L +C+  GD  LG++IA+YLI L  KN 
Subjt:  CACARAGLLKDAKNYFDEMINVFLVKPNFAHYWCLANVYVAAGLIQQAVEILRNMPEDNEDFSSESVVWINLLATCRFVGDVSLGEQIAKYLIDLEPKND

Query:  SYYRLLLNIYAVAGRWEDVSRIKLLMKEKRLGTMPGCRLVDLKEIVHRLKLGNLLQEGMRETNTVMHKLASEL
          Y LL NIYA  G +E V++++ LMKEK +   PG   ++++  VH  + G+      +E  T++ K++  +
Subjt:  SYYRLLLNIYAVAGRWEDVSRIKLLMKEKRLGTMPGCRLVDLKEIVHRLKLGNLLQEGMRETNTVMHKLASEL

Arabidopsis top hitse value%identityAlignment
AT1G74630.1 Tetratricopeptide repeat (TPR)-like superfamily protein4.2e-7831.81Show/hide
Query:  LDITEATRFFNHVRAYINIPNTFCINRVIKAYSISTVPLQAVFVYFEWLGNGF-RPDSYTFLSLFSACANFGCGASGRKCHGQAFKNGVDSVMVLRNSLI
        + I++A  +   +      P+ F  N +++ YS S  P  +V V+ E +  GF  PDS++F  +  A  NF    +G + H QA K+G++S + +  +LI
Subjt:  LDITEATRFFNHVRAYINIPNTFCINRVIKAYSISTVPLQAVFVYFEWLGNGF-RPDSYTFLSLFSACANFGCGASGRKCHGQAFKNGVDSVMVLRNSLI

Query:  HMYGCCGHIELGRKVFDEMSSWDFVSWNSIVTAYARVGDLHTARDMFDAMPERNVVSWNLMISEYLRGGNPGCAMKL-----------------------
         MYG CG +E  RKVFDEM   + V+WN+++TA  R  D+  AR++FD M  RN  SWN+M++ Y++ G    A ++                       
Subjt:  HMYGCCGHIELGRKVFDEMSSWDFVSWNSIVTAYARVGDLHTARDMFDAMPERNVVSWNLMISEYLRGGNPGCAMKL-----------------------

Query:  --------FRNMVNIGIRGNNTTMVNVLGACGRSARLNEGRSVHGFMYRTSMKFCVFIDTALVDMYSKCQRVSIARRVFDRML-SRNLVTWNAMVLGHCL
                FR +   G+  N  ++  VL AC +S     G+ +HGF+ +    + V ++ AL+DMYS+C  V +AR VF+ M   R +V+W +M+ G  +
Subjt:  --------FRNMVNIGIRGNNTTMVNVLGACGRSARLNEGRSVHGFMYRTSMKFCVFIDTALVDMYSKCQRVSIARRVFDRML-SRNLVTWNAMVLGHCL

Query:  HGHPEDGLKLFEEMAAKLREINGETGSGKKFKQDEGKRKVYPDQITFIGVLCACARAGLLKDAKNYFDEMINVFLVKPNFAHYWCLANVYVAAGLIQQAV
        HG  E+ ++LF EM A                       V PD I+FI +L AC+ AGL+++ ++YF EM  V+ ++P   HY C+ ++Y  +G +Q+A 
Subjt:  HGHPEDGLKLFEEMAAKLREINGETGSGKKFKQDEGKRKVYPDQITFIGVLCACARAGLLKDAKNYFDEMINVFLVKPNFAHYWCLANVYVAAGLIQQAV

Query:  EILRNMPEDNEDFSSESVVWINLLATCRFVGDVSLGEQIAKYLIDLEPKNDSYYRLLLNIYAVAGRWEDVSRIKLLMKEKRLGTMPGCRLVDLKEIVHRL
        + +  MP         ++VW  LL  C   G++ L EQ+ + L +L+P N     LL N YA AG+W+DV+ I+  M  +R+       LV++ + +++ 
Subjt:  EILRNMPEDNEDFSSESVVWINLLATCRFVGDVSLGEQIAKYLIDLEPKNDSYYRLLLNIYAVAGRWEDVSRIKLLMKEKRLGTMPGCRLVDLKEIVHRL

Query:  KLG
          G
Subjt:  KLG

AT2G20540.1 mitochondrial editing factor 211.6e-7732.31Show/hide
Query:  DITEATRFFNHVRAYINIPNTFCINRVIKAYSISTVPLQAVFVYFEWLGNGFR-PDSYTFLSLFSACANFGCGASGRKCHGQAFKNGVDSVMVLRNSLIH
        D+  ATR FN V    + PN F  N +I+AY+ +++    + +Y + L   F  PD +TF  +F +CA+ G    G++ HG   K G    +V  N+LI 
Subjt:  DITEATRFFNHVRAYINIPNTFCINRVIKAYSISTVPLQAVFVYFEWLGNGFR-PDSYTFLSLFSACANFGCGASGRKCHGQAFKNGVDSVMVLRNSLIH

Query:  MYGCCGHIELGRKVFDEMSSWDFVSWNSIVTAYARVGDLHTARDMFDAMPERNVVSWNLMISEYLRGGNPGCAMKLFRNMVNIGIRGNNTTMVNVLGACG
        MY     +    KVFDEM   D +SWNS+++ YAR+G +  A+ +F  M ++ +VSW  MIS Y   G    AM  FR M   GI  +  ++++VL +C 
Subjt:  MYGCCGHIELGRKVFDEMSSWDFVSWNSIVTAYARVGDLHTARDMFDAMPERNVVSWNLMISEYLRGGNPGCAMKLFRNMVNIGIRGNNTTMVNVLGACG

Query:  RSARLNEGRSVHGFMYRTSMKFCVFIDTALVDMYSKCQRVSIARRVFDRMLSRNLVTWNAMVLGHCLHGHPEDGLKLFEEMAAKLREINGETGSGKKFKQ
        +   L  G+ +H +  R        +  AL++MYSKC  +S A ++F +M  +++++W+ M+ G+  HG+    ++ F EM                   
Subjt:  RSARLNEGRSVHGFMYRTSMKFCVFIDTALVDMYSKCQRVSIARRVFDRMLSRNLVTWNAMVLGHCLHGHPEDGLKLFEEMAAKLREINGETGSGKKFKQ

Query:  DEGKRKVYPDQITFIGVLCACARAGLLKDAKNYFDEMINVFLVKPNFAHYWCLANVYVAAGLIQQAVEILRNMPEDNEDFSSESVVWINLLATCRFVGDV
           + KV P+ ITF+G+L AC+  G+ ++   YFD M   + ++P   HY CL +V   AG +++AVEI + MP        +S +W +LL++CR  G++
Subjt:  DEGKRKVYPDQITFIGVLCACARAGLLKDAKNYFDEMINVFLVKPNFAHYWCLANVYVAAGLIQQAVEILRNMPEDNEDFSSESVVWINLLATCRFVGDV

Query:  SLGEQIAKYLIDLEPKNDSYYRLLLNIYAVAGRWEDVSRIKLLMKEKRLGTMPGCRLVDLKEIVHRLKLGNLLQEGMRETNTVMHKLAS
         +      +L++LEP++   Y LL NIYA  G+WEDVSR++ +++ + +   PG  L+++  IV     G+  +    E + V+    S
Subjt:  SLGEQIAKYLIDLEPKNDSYYRLLLNIYAVAGRWEDVSRIKLLMKEKRLGTMPGCRLVDLKEIVHRLKLGNLLQEGMRETNTVMHKLAS

AT2G42920.1 Pentatricopeptide repeat (PPR-like) superfamily protein2.1e-7432.37Show/hide
Query:  INIPNTFCINRVIKAYSISTVPLQAVFVYFEWL--GNGFRPDSYTFLSLFSACANFGCGASGRKCHGQAFKNGVDSVMVLRNSLIHMYGCCGHIELGRKV
        IN  N F  N +I+ +S S+ P  A+ ++ + L      +P   T+ S+F A    G    GR+ HG   K G++    +RN+++HMY  CG +    ++
Subjt:  INIPNTFCINRVIKAYSISTVPLQAVFVYFEWL--GNGFRPDSYTFLSLFSACANFGCGASGRKCHGQAFKNGVDSVMVLRNSLIHMYGCCGHIELGRKV

Query:  FDEMSSWDFVSWNSIVTAYARVGDLHTARDMFDAMPERNVVSWNLMISEYLRGGNPGCAMKLFRNMVNIGIRGNNTTMVNVLGACGRSARLNEGRSVHGF
        F  M  +D V+WNS++  +A+ G +  A+++FD MP+RN VSWN MIS ++R G    A+ +FR M    ++ +  TMV++L AC       +GR +H +
Subjt:  FDEMSSWDFVSWNSIVTAYARVGDLHTARDMFDAMPERNVVSWNLMISEYLRGGNPGCAMKLFRNMVNIGIRGNNTTMVNVLGACGRSARLNEGRSVHGF

Query:  MYRTSMKFCVFIDTALVDMYSKCQRVSIARRVFDRMLSRNLVTWNAMVLGHCLHGHPEDGLKLFEEMAAKLREINGETGSGKKFKQDEGKRKVYPDQITF
        + R   +    + TAL+DMY KC  +     VF+    + L  WN+M+LG   +G  E  + LF E+                      +  + PD ++F
Subjt:  MYRTSMKFCVFIDTALVDMYSKCQRVSIARRVFDRMLSRNLVTWNAMVLGHCLHGHPEDGLKLFEEMAAKLREINGETGSGKKFKQDEGKRKVYPDQITF

Query:  IGVLCACARAGLLKDAKNYFDEMINVFLVKPNFAHYWCLANVYVAAGLIQQAVEILRNMPEDNEDFSSESVVWINLLATCRFVGDVSLGEQIAKYLIDLE
        IGVL ACA +G +  A  +F  M   ++++P+  HY  + NV   AGL+++A  +++NMP +      ++V+W +LL+ CR +G+V + ++ AK L  L+
Subjt:  IGVLCACARAGLLKDAKNYFDEMINVFLVKPNFAHYWCLANVYVAAGLIQQAVEILRNMPEDNEDFSSESVVWINLLATCRFVGDVSLGEQIAKYLIDLE

Query:  PKNDSYYRLLLNIYAVAGRWEDVSRIKLLMKEKRLGTMPGCRLVDLKEIVH
        P     Y LL N YA  G +E+    +LLMKE+++    GC  +++   VH
Subjt:  PKNDSYYRLLLNIYAVAGRWEDVSRIKLLMKEKRLGTMPGCRLVDLKEIVH

AT3G51320.1 Pentatricopeptide repeat (PPR) superfamily protein3.5e-14148.8Show/hide
Query:  SNPLIHLL-LHSALCQNQIFRSRPQILLDITEATRFFNH---VRAYINIPNTFCINRVIKAYSISTVPLQAVFVYFEWLGNGFRPDSYTFLSLFSACANF
        SN + HL  +H+ L  +  F      +  +  ++RF +    V  Y +I   +C N V KAY +S+ P QA+  YF+ L  GF PDSYTF+SL S     
Subjt:  SNPLIHLL-LHSALCQNQIFRSRPQILLDITEATRFFNH---VRAYINIPNTFCINRVIKAYSISTVPLQAVFVYFEWLGNGFRPDSYTFLSLFSACANF

Query:  GCGASGRKCHGQAFKNGVDSVMVLRNSLIHMYGCCGHIELGRKVFDEMSSWDFVSWNSIVTAYARVGDLHTARDMFDAMPERNVVSWNLMISEYLRGGNP
         C  SG+ CHGQA K+G D V+ ++NSL+HMY CCG ++L +K+F E+   D VSWNSI+    R GD+  A  +FD MP++N++SWN+MIS YL   NP
Subjt:  GCGASGRKCHGQAFKNGVDSVMVLRNSLIHMYGCCGHIELGRKVFDEMSSWDFVSWNSIVTAYARVGDLHTARDMFDAMPERNVVSWNLMISEYLRGGNP

Query:  GCAMKLFRNMVNIGIRGNNTTMVNVLGACGRSARLNEGRSVHGFMYRTSMKFCVFIDTALVDMYSKCQRVSIARRVFDRMLSRNLVTWNAMVLGHCLHGH
        G ++ LFR MV  G +GN +T+V +L ACGRSARL EGRSVH  + RT +   V IDTAL+DMY KC+ V +ARR+FD +  RN VTWN M+L HCLHG 
Subjt:  GCAMKLFRNMVNIGIRGNNTTMVNVLGACGRSARLNEGRSVHGFMYRTSMKFCVFIDTALVDMYSKCQRVSIARRVFDRMLSRNLVTWNAMVLGHCLHGH

Query:  PEDGLKLFEEMAAKLREINGETGSGKKFKQDEGKRKVYPDQITFIGVLCACARAGLLKDAKNYFDEMINVFLVKPNFAHYWCLANVYVAAGLIQQAVEIL
        PE GL+LFE M      ING                + PD++TF+GVLC CARAGL+   ++Y+  M++ F +KPNF H WC+AN+Y +AG  ++A E L
Subjt:  PEDGLKLFEEMAAKLREINGETGSGKKFKQDEGKRKVYPDQITFIGVLCACARAGLLKDAKNYFDEMINVFLVKPNFAHYWCLANVYVAAGLIQQAVEIL

Query:  RNMPEDNEDFSSESVVWINLLATCRFVGDVSLGEQIAKYLIDLEPKNDSYYRLLLNIYAVAGRWEDVSRIKLLMKEKRLGTMPGCRLVDLKEIVHRLKLG
        +N+P+  ED + ES  W NLL++ RF G+ +LGE IAK LI+ +P N  YY LL+NIY+V GRWEDV+R++ ++KE+++G +PGC LVDLKEIVH L+LG
Subjt:  RNMPEDNEDFSSESVVWINLLATCRFVGDVSLGEQIAKYLIDLEPKNDSYYRLLLNIYAVAGRWEDVSRIKLLMKEKRLGTMPGCRLVDLKEIVHRLKLG

AT4G37380.1 Tetratricopeptide repeat (TPR)-like superfamily protein8.4e-7934.04Show/hide
Query:  PNTFCINRVIKAYSISTVPLQAVFVYFEWLGNGFRPDSYTFLSLFSACANFGCGASGRKCHGQAFKNGVDSVMVLRNSLIHMYGCCGHIELGRKVFDEMS
        P+ F     I   SI+ +  QA  +Y + L +   P+ +TF SL  +C+      SG+  H    K G+     +   L+ +Y   G +   +KVFD M 
Subjt:  PNTFCINRVIKAYSISTVPLQAVFVYFEWLGNGFRPDSYTFLSLFSACANFGCGASGRKCHGQAFKNGVDSVMVLRNSLIHMYGCCGHIELGRKVFDEMS

Query:  SWDFVSWNSIVTAYARVGDLHTARDMFDAMPERNVVSWNLMISEYLRGGNPGCAMKLFRNMVNIG-IRGNNTTMVNVLGACGRSARLNEGRSVHGFMYRT
            VS  +++T YA+ G++  AR +FD+M ER++VSWN+MI  Y + G P  A+ LF+ ++  G  + +  T+V  L AC +   L  GR +H F+  +
Subjt:  SWDFVSWNSIVTAYARVGDLHTARDMFDAMPERNVVSWNLMISEYLRGGNPGCAMKLFRNMVNIG-IRGNNTTMVNVLGACGRSARLNEGRSVHGFMYRT

Query:  SMKFCVFIDTALVDMYSKCQRVSIARRVFDRMLSRNLVTWNAMVLGHCLHGHPEDGLKLFEEMAAKLREINGETGSGKKFKQDEGKRKVYPDQITFIGVL
         ++  V + T L+DMYSKC  +  A  VF+    +++V WNAM+ G+ +HG+ +D L+LF EM        G TG             + P  ITFIG L
Subjt:  SMKFCVFIDTALVDMYSKCQRVSIARRVFDRMLSRNLVTWNAMVLGHCLHGHPEDGLKLFEEMAAKLREINGETGSGKKFKQDEGKRKVYPDQITFIGVL

Query:  CACARAGLLKDAKNYFDEMINVFLVKPNFAHYWCLANVYVAAGLIQQAVEILRNMPEDNEDFSSESVVWINLLATCRFVGDVSLGEQIAKYLIDLEPKND
         ACA AGL+ +    F+ M   + +KP   HY CL ++   AG +++A E ++NM  D     ++SV+W ++L +C+  GD  LG++IA+YLI L  KN 
Subjt:  CACARAGLLKDAKNYFDEMINVFLVKPNFAHYWCLANVYVAAGLIQQAVEILRNMPEDNEDFSSESVVWINLLATCRFVGDVSLGEQIAKYLIDLEPKND

Query:  SYYRLLLNIYAVAGRWEDVSRIKLLMKEKRLGTMPGCRLVDLKEIVHRLKLGNLLQEGMRETNTVMHKLASEL
          Y LL NIYA  G +E V++++ LMKEK +   PG   ++++  VH  + G+      +E  T++ K++  +
Subjt:  SYYRLLLNIYAVAGRWEDVSRIKLLMKEKRLGTMPGCRLVDLKEIVHRLKLGNLLQEGMRETNTVMHKLASEL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGGAAACTAAGTGGAAGAAGAACCAGAAGAAGAAGAAGAAGAATTCAGTGAAAGTTAATCAATGTCATACCGCAAATGAATGGCAAGGATTTCCACTCGACAACTC
TTTCGTTTCACACACGCCCCCCTCCCTTTGCCATTCAAATCCGTTGATCCATCTTCTTCTCCATTCTGCTCTTTGCCAGAACCAGATCTTTCGCTCGAGACCACAAATCC
TCCTAGACATAACCGAAGCCACTCGCTTCTTCAATCATGTCAGAGCGTATATCAACATTCCCAATACCTTCTGTATCAATAGAGTAATTAAGGCTTATTCTATTAGCACA
GTTCCTCTACAGGCTGTATTTGTGTATTTTGAATGGCTTGGTAATGGGTTTCGGCCAGATTCGTACACTTTTCTTTCACTTTTTTCGGCTTGTGCGAATTTTGGCTGTGG
AGCTTCTGGGCGTAAGTGCCATGGACAAGCTTTCAAGAATGGGGTTGACTCTGTGATGGTTTTGAGAAATAGTTTGATTCATATGTATGGCTGTTGTGGCCATATTGAGC
TCGGTCGGAAGGTGTTCGACGAAATGTCGAGCTGGGATTTCGTATCTTGGAATTCAATTGTTACTGCTTATGCAAGAGTTGGAGATTTGCATACTGCCCGTGACATGTTC
GATGCAATGCCTGAGAGAAATGTTGTGTCTTGGAATTTGATGATTAGTGAGTATTTGAGAGGTGGGAATCCGGGCTGTGCAATGAAGTTGTTTAGGAATATGGTGAATAT
AGGAATCAGAGGGAACAATACAACAATGGTAAATGTTCTTGGTGCTTGCGGTCGATCGGCCAGGCTGAATGAAGGAAGATCGGTTCATGGTTTTATGTACCGTACTTCAA
TGAAGTTTTGCGTATTTATCGACACGGCATTGGTTGACATGTATAGCAAATGCCAGAGAGTGTCTATTGCACGTAGAGTTTTTGACAGGATGCTGAGTCGAAATTTGGTT
ACCTGGAATGCAATGGTTTTGGGGCATTGTCTACATGGCCATCCTGAAGATGGACTTAAGCTGTTTGAGGAAATGGCCGCCAAATTAAGAGAAATAAATGGGGAAACTGG
CAGTGGCAAGAAATTCAAGCAAGATGAAGGTAAGCGAAAAGTTTACCCAGACCAAATTACATTTATTGGCGTTCTATGTGCCTGTGCCCGAGCAGGACTGCTGAAAGATG
CAAAGAATTACTTCGACGAGATGATCAATGTGTTTCTTGTGAAGCCAAATTTCGCCCACTACTGGTGTTTAGCCAATGTTTACGTTGCAGCAGGATTGATACAACAGGCT
GTGGAAATACTGAGGAACATGCCTGAGGATAACGAAGACTTTTCATCAGAATCGGTCGTATGGATTAACTTGCTCGCTACATGTCGTTTCGTGGGGGATGTTTCTTTAGG
AGAACAGATAGCAAAATATTTGATTGACTTGGAACCTAAGAATGACTCATACTATAGATTGCTTCTGAATATTTATGCTGTAGCAGGGAGATGGGAGGATGTGTCTAGAA
TCAAATTATTAATGAAAGAAAAAAGACTCGGAACAATGCCGGGTTGTAGACTAGTAGACCTGAAAGAGATTGTTCACAGATTAAAACTGGGAAATCTTCTACAAGAGGGG
ATGAGGGAGACGAACACTGTGATGCATAAACTTGCTAGTGAACTCATGATGTTGTGGCTATGCGGCAGATTCATGTTCAAAGCTCTTTTCAACCTAGCTGCATATGAAGT
GAAGGTGGAGCAATAA
mRNA sequenceShow/hide mRNA sequence
ATGGCGGAAACTAAGTGGAAGAAGAACCAGAAGAAGAAGAAGAAGAATTCAGTGAAAGTTAATCAATGTCATACCGCAAATGAATGGCAAGGATTTCCACTCGACAACTC
TTTCGTTTCACACACGCCCCCCTCCCTTTGCCATTCAAATCCGTTGATCCATCTTCTTCTCCATTCTGCTCTTTGCCAGAACCAGATCTTTCGCTCGAGACCACAAATCC
TCCTAGACATAACCGAAGCCACTCGCTTCTTCAATCATGTCAGAGCGTATATCAACATTCCCAATACCTTCTGTATCAATAGAGTAATTAAGGCTTATTCTATTAGCACA
GTTCCTCTACAGGCTGTATTTGTGTATTTTGAATGGCTTGGTAATGGGTTTCGGCCAGATTCGTACACTTTTCTTTCACTTTTTTCGGCTTGTGCGAATTTTGGCTGTGG
AGCTTCTGGGCGTAAGTGCCATGGACAAGCTTTCAAGAATGGGGTTGACTCTGTGATGGTTTTGAGAAATAGTTTGATTCATATGTATGGCTGTTGTGGCCATATTGAGC
TCGGTCGGAAGGTGTTCGACGAAATGTCGAGCTGGGATTTCGTATCTTGGAATTCAATTGTTACTGCTTATGCAAGAGTTGGAGATTTGCATACTGCCCGTGACATGTTC
GATGCAATGCCTGAGAGAAATGTTGTGTCTTGGAATTTGATGATTAGTGAGTATTTGAGAGGTGGGAATCCGGGCTGTGCAATGAAGTTGTTTAGGAATATGGTGAATAT
AGGAATCAGAGGGAACAATACAACAATGGTAAATGTTCTTGGTGCTTGCGGTCGATCGGCCAGGCTGAATGAAGGAAGATCGGTTCATGGTTTTATGTACCGTACTTCAA
TGAAGTTTTGCGTATTTATCGACACGGCATTGGTTGACATGTATAGCAAATGCCAGAGAGTGTCTATTGCACGTAGAGTTTTTGACAGGATGCTGAGTCGAAATTTGGTT
ACCTGGAATGCAATGGTTTTGGGGCATTGTCTACATGGCCATCCTGAAGATGGACTTAAGCTGTTTGAGGAAATGGCCGCCAAATTAAGAGAAATAAATGGGGAAACTGG
CAGTGGCAAGAAATTCAAGCAAGATGAAGGTAAGCGAAAAGTTTACCCAGACCAAATTACATTTATTGGCGTTCTATGTGCCTGTGCCCGAGCAGGACTGCTGAAAGATG
CAAAGAATTACTTCGACGAGATGATCAATGTGTTTCTTGTGAAGCCAAATTTCGCCCACTACTGGTGTTTAGCCAATGTTTACGTTGCAGCAGGATTGATACAACAGGCT
GTGGAAATACTGAGGAACATGCCTGAGGATAACGAAGACTTTTCATCAGAATCGGTCGTATGGATTAACTTGCTCGCTACATGTCGTTTCGTGGGGGATGTTTCTTTAGG
AGAACAGATAGCAAAATATTTGATTGACTTGGAACCTAAGAATGACTCATACTATAGATTGCTTCTGAATATTTATGCTGTAGCAGGGAGATGGGAGGATGTGTCTAGAA
TCAAATTATTAATGAAAGAAAAAAGACTCGGAACAATGCCGGGTTGTAGACTAGTAGACCTGAAAGAGATTGTTCACAGATTAAAACTGGGAAATCTTCTACAAGAGGGG
ATGAGGGAGACGAACACTGTGATGCATAAACTTGCTAGTGAACTCATGATGTTGTGGCTATGCGGCAGATTCATGTTCAAAGCTCTTTTCAACCTAGCTGCATATGAAGT
GAAGGTGGAGCAATAA
Protein sequenceShow/hide protein sequence
MAETKWKKNQKKKKKNSVKVNQCHTANEWQGFPLDNSFVSHTPPSLCHSNPLIHLLLHSALCQNQIFRSRPQILLDITEATRFFNHVRAYINIPNTFCINRVIKAYSIST
VPLQAVFVYFEWLGNGFRPDSYTFLSLFSACANFGCGASGRKCHGQAFKNGVDSVMVLRNSLIHMYGCCGHIELGRKVFDEMSSWDFVSWNSIVTAYARVGDLHTARDMF
DAMPERNVVSWNLMISEYLRGGNPGCAMKLFRNMVNIGIRGNNTTMVNVLGACGRSARLNEGRSVHGFMYRTSMKFCVFIDTALVDMYSKCQRVSIARRVFDRMLSRNLV
TWNAMVLGHCLHGHPEDGLKLFEEMAAKLREINGETGSGKKFKQDEGKRKVYPDQITFIGVLCACARAGLLKDAKNYFDEMINVFLVKPNFAHYWCLANVYVAAGLIQQA
VEILRNMPEDNEDFSSESVVWINLLATCRFVGDVSLGEQIAKYLIDLEPKNDSYYRLLLNIYAVAGRWEDVSRIKLLMKEKRLGTMPGCRLVDLKEIVHRLKLGNLLQEG
MRETNTVMHKLASELMMLWLCGRFMFKALFNLAAYEVKVEQ