; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0007030 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0007030
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationchr6:48202623..48217305
RNA-Seq ExpressionLag0007030
SyntenyLag0007030
Gene Ontology termsGO:0006644 - phospholipid metabolic process (biological process)
GO:0009451 - RNA modification (biological process)
GO:0015986 - ATP synthesis coupled proton transport (biological process)
GO:0050482 - arachidonic acid secretion (biological process)
GO:0000276 - mitochondrial proton-transporting ATP synthase complex, coupling factor F(o) (cellular component)
GO:0015078 - proton transmembrane transporter activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
GO:0005515 - protein binding (molecular function)
GO:0004623 - phospholipase A2 activity (molecular function)
GO:0003723 - RNA binding (molecular function)
InterPro domainsIPR036444 - Phospholipase A2 domain superfamily
IPR033113 - Phospholipase A2, histidine active site
IPR032867 - DYW domain
IPR029004 - Ribosomal L28e/Mak16
IPR011990 - Tetratricopeptide-like helical domain superfamily
IPR006808 - ATP synthase, F0 complex, subunit G, mitochondrial
IPR002885 - Pentatricopeptide repeat


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0061500.1 pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa]3.3e-23189.06Show/hide
Query:  MASSIPSQATRNSIITTNFKSRIIPIHSPAVKSFRPTSLCTNALSNSRNRKAARSYDGQSSTNTLSKSQNQTSDSVFGLPSVVDLVALCEEGKVMDALEY
        M+SSI S AT NSI TTNFKS IIPIH PAVKSFR T L TNALSNSR RKAAR YD QS+TNTLS+SQNQTSDSV+  PS VDL+ALC+EGKV+DALEY
Subjt:  MASSIPSQATRNSIITTNFKSRIIPIHSPAVKSFRPTSLCTNALSNSRNRKAARSYDGQSSTNTLSKSQNQTSDSVFGLPSVVDLVALCEEGKVMDALEY

Query:  IGQGANVDYGVFTALLNSSGNLKLLEAGRRVDGLLKGTKFRGDVELNNKLIEMYSSCGCMKDARRVFDKMPDRDIRTWNLMIKGYGENGEGDHGLALFEQ
        IGQGA VDYGVFTALLNS  NLKLLEAGRRVDGLLKGTKFRGDVELNNKLIEMYS+CGCMK+AR+VFDKMP++D RTWNLMIKGYGENGEGD+GLALFEQ
Subjt:  IGQGANVDYGVFTALLNSSGNLKLLEAGRRVDGLLKGTKFRGDVELNNKLIEMYSSCGCMKDARRVFDKMPDRDIRTWNLMIKGYGENGEGDHGLALFEQ

Query:  MKNVGLQPNSETFLVVLAACAMAEAVEEGIFYFNSMENEHGINPEIGHYLGVVDVLGKSGHLIEAEEFIENMSINPTAKIWDALRNYAQLHGNMELEDRA
        MKNVGLQPNSETFLVVLAACAMAEAVEEG+FYFN M NE+GINPEI HYLGVVDVLGKSGHLIEAEEFIE M INPTAKIWDALRNYA+LHGNMELEDRA
Subjt:  MKNVGLQPNSETFLVVLAACAMAEAVEEGIFYFNSMENEHGINPEIGHYLGVVDVLGKSGHLIEAEEFIENMSINPTAKIWDALRNYAQLHGNMELEDRA

Query:  EELMFSLDPSMAATATKPPLPPPWKQSATNMLEEKDRVREFRCAMPYKEEGEGKLKGLNGQMREAGYVPDTRYVLHDIDEEAKQQALQYHSERLAIAYGL
        EELMFSLDPS  AT TKP LPP  KQS+TNMLEEKDRVREFR AMPYKEEGEGKLKGLNGQM+EAGYVPDTRYVLHDIDEEAKQQALQYHSERLAIAYGL
Subjt:  EELMFSLDPSMAATATKPPLPPPWKQSATNMLEEKDRVREFRCAMPYKEEGEGKLKGLNGQMREAGYVPDTRYVLHDIDEEAKQQALQYHSERLAIAYGL

Query:  ISTPARTTLRIIKNLRICGDCHNAIKIMSRIVGRELIVRDNKRFHHFKDGKCSCGDY
        ISTPARTTLRIIKNLRICGDCHNAIKIMS+IVGRELIVRDNKRFHHFKDGKCSCGDY
Subjt:  ISTPARTTLRIIKNLRICGDCHNAIKIMSRIVGRELIVRDNKRFHHFKDGKCSCGDY

KAG6586480.1 Pentatricopeptide repeat-containing protein, mitochondrial, partial [Cucurbita argyrosperma subsp. sororia]7.3e-29582.45Show/hide
Query:  MASSIPSQATRNSIITTNFKSRIIPIHSPAVKSFRPTSLCTNALSNSRNRKAARSYDGQSSTNTLSKSQNQTSDSVFGLPSVVDLVALCEEGKVMDALEY
        MASSIPSQ T NSIITTNF SRIIP HSPAVKSFRPT  C NALSNS NRK AR YDGQ++TNTLSKSQN+TSDSV  LPS VDLVALCE G+VMD LE+
Subjt:  MASSIPSQATRNSIITTNFKSRIIPIHSPAVKSFRPTSLCTNALSNSRNRKAARSYDGQSSTNTLSKSQNQTSDSVFGLPSVVDLVALCEEGKVMDALEY

Query:  IGQGANVDYGVFTALLNSSGNLKLLEAGRRVDGLLKGTKFRGDVELNNKLIEMYSSCGCMKDARRVFDKMPDRDIRTWNLMIKGYGENGEGDHGLALFEQ
        IGQGAN+DYG+ TALLNS  N KLLEAGRRVD LLKGTKF GDV LNN LIEMYS CGCMKDARRVFDKMPDRDIRTWNLMIKGYGENGEGD GLALFE+
Subjt:  IGQGANVDYGVFTALLNSSGNLKLLEAGRRVDGLLKGTKFRGDVELNNKLIEMYSSCGCMKDARRVFDKMPDRDIRTWNLMIKGYGENGEGDHGLALFEQ

Query:  MKNVGLQPNSETFLVVLAACAMAEAVEEGIFYFNSMENEHGINPEIGHYLGVVDVLGKSGHLIEAEEFIENMSINPTAKIWDALRNYAQLHGNMELEDRA
        MK+VGLQPNSETFLVVLAACAMAEAVEEG+FYFNSMENE+GI PEI HYLGVVDVLGKSGHLIEAEEFI+ + INPTAKIWDALR YA+LHGN+ELEDRA
Subjt:  MKNVGLQPNSETFLVVLAACAMAEAVEEGIFYFNSMENEHGINPEIGHYLGVVDVLGKSGHLIEAEEFIENMSINPTAKIWDALRNYAQLHGNMELEDRA

Query:  EELMFSLDPSMAATATKPPLPPPWKQSATNMLEEKDRVREFRCAMPYKEEGE-GKLKGLNGQMREAGYVPDTRYVLHDIDEEAKQQALQYHSERLAIAYG
        EEL+FS DPSMA T  KPPLPPP KQSATNMLEEKDRVREFRCAMPYKEEGE GKLKGLNGQMREAGYVPDTRYVLHDIDEEAKQQALQYHSERLAIAYG
Subjt:  EELMFSLDPSMAATATKPPLPPPWKQSATNMLEEKDRVREFRCAMPYKEEGE-GKLKGLNGQMREAGYVPDTRYVLHDIDEEAKQQALQYHSERLAIAYG

Query:  LISTPARTTLRIIKNLRICGDCHNAIKIMSRIVGRELIVRDNKRFHHFKDGKCSCGDYCL----GVGAPLSPTSGERKAAISVAFSPSQSPPSPSLLLFF
        LISTPARTTLRIIKNLRICGDCHNAIKIMS+IVGRELIVRDNKRFHHFKDGKCS  +       GVGAPLSPTSG+RK                      
Subjt:  LISTPARTTLRIIKNLRICGDCHNAIKIMSRIVGRELIVRDNKRFHHFKDGKCSCGDYCL----GVGAPLSPTSGERKAAISVAFSPSQSPPSPSLLLFF

Query:  EVELLFCVDICEFFSFILDLMASKLHLLQSKVCQASQFAFKNGSSYYKQLLEQNKQYIQEPASVEKCNLLSKQLLYTRLASIPVRCESFWKELDCVKNLW
                   EF SF+LDLMASKLH LQSK C+ASQ+A K+GSSYYKQLLEQNKQYIQEPA+VEKC+LLS+QL YTRLASIP R ESFWKELDCVKNLW
Subjt:  EVELLFCVDICEFFSFILDLMASKLHLLQSKVCQASQFAFKNGSSYYKQLLEQNKQYIQEPASVEKCNLLSKQLLYTRLASIPVRCESFWKELDCVKNLW

Query:  KNRQELKVEDVGIAALFGLECFAWFCAGEIVGRGFTFT
        KNRQELKVED GIAALFGLECFAWFCAGEIVGRGFTFT
Subjt:  KNRQELKVEDVGIAALFGLECFAWFCAGEIVGRGFTFT

KAG7021339.1 Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. argyrosperma]3.3e-29582.6Show/hide
Query:  MASSIPSQATRNSIITTNFKSRIIPIHSPAVKSFRPTSLCTNALSNSRNRKAARSYDGQSSTNTLSKSQNQTSDSVFGLPSVVDLVALCEEGKVMDALEY
        MASSIPSQ T NSIITTNF SRIIP HSPAVKSFRPT  C NALSNS NRK AR YDGQ++TNTLSKSQN+TSDSV  LPS VDLVALCE G+VMD LE+
Subjt:  MASSIPSQATRNSIITTNFKSRIIPIHSPAVKSFRPTSLCTNALSNSRNRKAARSYDGQSSTNTLSKSQNQTSDSVFGLPSVVDLVALCEEGKVMDALEY

Query:  IGQGANVDYGVFTALLNSSGNLKLLEAGRRVDGLLKGTKFRGDVELNNKLIEMYSSCGCMKDARRVFDKMPDRDIRTWNLMIKGYGENGEGDHGLALFEQ
        IGQGAN+DYG+ TALLNS  N KLLEAGRRVD LLKGTKF GDV LNNKLIEMYS CGCMKDARRVFDKMPDRDIRTWNLMIKGYGENGEGD GLALFE+
Subjt:  IGQGANVDYGVFTALLNSSGNLKLLEAGRRVDGLLKGTKFRGDVELNNKLIEMYSSCGCMKDARRVFDKMPDRDIRTWNLMIKGYGENGEGDHGLALFEQ

Query:  MKNVGLQPNSETFLVVLAACAMAEAVEEGIFYFNSMENEHGINPEIGHYLGVVDVLGKSGHLIEAEEFIENMSINPTAKIWDALRNYAQLHGNMELEDRA
        MK+VGLQPNSETFLVVLAACAMAEAVEEG+FYFNSMENE+GI PEI HYLGVVDVLGKSGHLIEAEEFI+ + INPTAKIWDALR YA+LHGN+ELEDRA
Subjt:  MKNVGLQPNSETFLVVLAACAMAEAVEEGIFYFNSMENEHGINPEIGHYLGVVDVLGKSGHLIEAEEFIENMSINPTAKIWDALRNYAQLHGNMELEDRA

Query:  EELMFSLDPSMAATATKPPLPPPWKQSATNMLEEKDRVREFRCAMPYKEEGE-GKLKGLNGQMREAGYVPDTRYVLHDIDEEAKQQALQYHSERLAIAYG
        EEL+FS DPSMA T  KPPLPPP KQSATNMLEEKDRVREFRCAMPYKEEGE GKLKGLNGQMREAGYVPDTRYVLHDIDEEAKQQALQYHSERLAIAYG
Subjt:  EELMFSLDPSMAATATKPPLPPPWKQSATNMLEEKDRVREFRCAMPYKEEGE-GKLKGLNGQMREAGYVPDTRYVLHDIDEEAKQQALQYHSERLAIAYG

Query:  LISTPARTTLRIIKNLRICGDCHNAIKIMSRIVGRELIVRDNKRFHHFKDGKCSCGDYCL----GVGAPLSPTSGERKAAISVAFSPSQSPPSPSLLLFF
        LISTPARTTLRIIKNLRICGDCHNAIKIMS+IVGRELIVRDNKRFHHFKDGKCS  +       GVGAPLSPTSG+RK                      
Subjt:  LISTPARTTLRIIKNLRICGDCHNAIKIMSRIVGRELIVRDNKRFHHFKDGKCSCGDYCL----GVGAPLSPTSGERKAAISVAFSPSQSPPSPSLLLFF

Query:  EVELLFCVDICEFFSFILDLMASKLHLLQSKVCQASQFAFKNGSSYYKQLLEQNKQYIQEPASVEKCNLLSKQLLYTRLASIPVRCESFWKELDCVKNLW
                   EF SF+LDLMASKLH LQSK C+ASQ+A K+GSSYYKQLLEQNKQYIQEPA+VEKC+LLS+QL YTRLASIP R ESFWKELDCVKNLW
Subjt:  EVELLFCVDICEFFSFILDLMASKLHLLQSKVCQASQFAFKNGSSYYKQLLEQNKQYIQEPASVEKCNLLSKQLLYTRLASIPVRCESFWKELDCVKNLW

Query:  KNRQELKVEDVGIAALFGLECFAWFCAGEIVGRGFTFT
        KNRQELKVED GIAALFGLECFAWFCAGEIVGRGFTFT
Subjt:  KNRQELKVEDVGIAALFGLECFAWFCAGEIVGRGFTFT

TYK10773.1 pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa]1.1e-22988.18Show/hide
Query:  MASSIPSQATRNSIITTNFKSRIIPIHSPAVKSFRPTSLCTNALSNSRNRKAARSYDGQSSTNTLSKSQNQTSDSVFGLPSVVDLVALCEEGKVMDALEY
        M+ SI S AT NSI TTNFKS IIPIH PAVKSFR T L TNALSNSR RKAAR YD QS+TNTLS+SQNQTSDSV+  PS VDL+ALCEEGKV+DALEY
Subjt:  MASSIPSQATRNSIITTNFKSRIIPIHSPAVKSFRPTSLCTNALSNSRNRKAARSYDGQSSTNTLSKSQNQTSDSVFGLPSVVDLVALCEEGKVMDALEY

Query:  IGQGANVDYGVFTALLNSSGNLKLLEAGRRVDGLLKGTKFRGDVELNNKLIEMYSSCGCMKDARRVFDKMPDRDIRTWNLMIKGYGENGEGDHGLALFEQ
        IGQGA VDYGVFTALLNS  NLKLLEAGRRVDGLLKGTKFRGDVELNNKLIEMYS+CGCMK+AR+VFDKMP++D RTWNLMIKGYGENGEGD+GLALFE+
Subjt:  IGQGANVDYGVFTALLNSSGNLKLLEAGRRVDGLLKGTKFRGDVELNNKLIEMYSSCGCMKDARRVFDKMPDRDIRTWNLMIKGYGENGEGDHGLALFEQ

Query:  MKNVGLQPNSETFLVVLAACAMAEAVEEGIFYFNSMENEHGINPEIGHYLGVVDVLGKSGHLIEAEEFIENMSINPTAKIWDALRNYAQLHGNMELEDRA
        MKNVGLQPNSETFLV+LAACAMAEAVEEG+FYFN M NE+GI+PEI HYLGV+DVLGKSGHLIEAEEFIE M INPTAKIWDALRNYA+LHGNMELEDRA
Subjt:  MKNVGLQPNSETFLVVLAACAMAEAVEEGIFYFNSMENEHGINPEIGHYLGVVDVLGKSGHLIEAEEFIENMSINPTAKIWDALRNYAQLHGNMELEDRA

Query:  EELMFSLDPSMAATATKPPLPPPWKQSATNMLEEKDRVREFRCAMPYKEEGEGKLKGLNGQMREAGYVPDTRYVLHDIDEEAKQQALQYHSERLAIAYGL
        EELMFSLDPS  AT TKP LPP  KQS+TNMLEEKDRVREFR AMPYKEEGEGKLKGLNGQM+EAGYVPDTRYVLHDIDEEAKQQALQYHSERLAIAYGL
Subjt:  EELMFSLDPSMAATATKPPLPPPWKQSATNMLEEKDRVREFRCAMPYKEEGEGKLKGLNGQMREAGYVPDTRYVLHDIDEEAKQQALQYHSERLAIAYGL

Query:  ISTPARTTLRIIKNLRICGDCHNAIKIMSRIVGRELIVRDNKRFHHFKDGKCSCGDY
        ISTPARTTLRIIKNLRICGDCHNAIKIMS+IVGRELIVRDNKRFHHFKDGKCSCGDY
Subjt:  ISTPARTTLRIIKNLRICGDCHNAIKIMSRIVGRELIVRDNKRFHHFKDGKCSCGDY

XP_008459269.1 PREDICTED: pentatricopeptide repeat-containing protein At2g15690-like [Cucumis melo]2.2e-23088.4Show/hide
Query:  MASSIPSQATRNSIITTNFKSRIIPIHSPAVKSFRPTSLCTNALSNSRNRKAARSYDGQSSTNTLSKSQNQTSDSVFGLPSVVDLVALCEEGKVMDALEY
        M+SSI S AT NSI TTNFKS IIPIH PAVKSFR T L TNALSNSR RKAAR YD QS+TNTLS+SQNQTSDSV+  PS VDL+ALCEEGKV+DALEY
Subjt:  MASSIPSQATRNSIITTNFKSRIIPIHSPAVKSFRPTSLCTNALSNSRNRKAARSYDGQSSTNTLSKSQNQTSDSVFGLPSVVDLVALCEEGKVMDALEY

Query:  IGQGANVDYGVFTALLNSSGNLKLLEAGRRVDGLLKGTKFRGDVELNNKLIEMYSSCGCMKDARRVFDKMPDRDIRTWNLMIKGYGENGEGDHGLALFEQ
        IGQGA VDYGVFTALLNS  NLKLLEAGRRVDGLLKGTKFRGDVELNNKLIEMYS+CGCMK+AR+VFDKMP++D RTWNLMIKGYGENGEGD+GLALFE+
Subjt:  IGQGANVDYGVFTALLNSSGNLKLLEAGRRVDGLLKGTKFRGDVELNNKLIEMYSSCGCMKDARRVFDKMPDRDIRTWNLMIKGYGENGEGDHGLALFEQ

Query:  MKNVGLQPNSETFLVVLAACAMAEAVEEGIFYFNSMENEHGINPEIGHYLGVVDVLGKSGHLIEAEEFIENMSINPTAKIWDALRNYAQLHGNMELEDRA
        MKNVGLQPNSETFLV+LAACAMAEAVEEG+FYFN M NE+GI+PEI HYLGV+DVLGKSGHLIEAEEFIE M INPTAKIWDALRNYA+LHGNMELEDRA
Subjt:  MKNVGLQPNSETFLVVLAACAMAEAVEEGIFYFNSMENEHGINPEIGHYLGVVDVLGKSGHLIEAEEFIENMSINPTAKIWDALRNYAQLHGNMELEDRA

Query:  EELMFSLDPSMAATATKPPLPPPWKQSATNMLEEKDRVREFRCAMPYKEEGEGKLKGLNGQMREAGYVPDTRYVLHDIDEEAKQQALQYHSERLAIAYGL
        EELMFSLDPS  AT TKP LPP  KQS+TNMLEEKDRVREFR AMPYKEEGEGKLKGLNGQM+EAGYVPDTRYVLHDIDEEAKQQALQYHSERLAIAYGL
Subjt:  EELMFSLDPSMAATATKPPLPPPWKQSATNMLEEKDRVREFRCAMPYKEEGEGKLKGLNGQMREAGYVPDTRYVLHDIDEEAKQQALQYHSERLAIAYGL

Query:  ISTPARTTLRIIKNLRICGDCHNAIKIMSRIVGRELIVRDNKRFHHFKDGKCSCGDY
        ISTPARTTLRIIKNLRICGDCHNAIKIMS+IVGRELIVRDNKRFHHFKDGKCSCGDY
Subjt:  ISTPARTTLRIIKNLRICGDCHNAIKIMSRIVGRELIVRDNKRFHHFKDGKCSCGDY

TrEMBL top hitse value%identityAlignment
A0A1S3C9A6 pentatricopeptide repeat-containing protein At2g15690-like1.0e-23088.4Show/hide
Query:  MASSIPSQATRNSIITTNFKSRIIPIHSPAVKSFRPTSLCTNALSNSRNRKAARSYDGQSSTNTLSKSQNQTSDSVFGLPSVVDLVALCEEGKVMDALEY
        M+SSI S AT NSI TTNFKS IIPIH PAVKSFR T L TNALSNSR RKAAR YD QS+TNTLS+SQNQTSDSV+  PS VDL+ALCEEGKV+DALEY
Subjt:  MASSIPSQATRNSIITTNFKSRIIPIHSPAVKSFRPTSLCTNALSNSRNRKAARSYDGQSSTNTLSKSQNQTSDSVFGLPSVVDLVALCEEGKVMDALEY

Query:  IGQGANVDYGVFTALLNSSGNLKLLEAGRRVDGLLKGTKFRGDVELNNKLIEMYSSCGCMKDARRVFDKMPDRDIRTWNLMIKGYGENGEGDHGLALFEQ
        IGQGA VDYGVFTALLNS  NLKLLEAGRRVDGLLKGTKFRGDVELNNKLIEMYS+CGCMK+AR+VFDKMP++D RTWNLMIKGYGENGEGD+GLALFE+
Subjt:  IGQGANVDYGVFTALLNSSGNLKLLEAGRRVDGLLKGTKFRGDVELNNKLIEMYSSCGCMKDARRVFDKMPDRDIRTWNLMIKGYGENGEGDHGLALFEQ

Query:  MKNVGLQPNSETFLVVLAACAMAEAVEEGIFYFNSMENEHGINPEIGHYLGVVDVLGKSGHLIEAEEFIENMSINPTAKIWDALRNYAQLHGNMELEDRA
        MKNVGLQPNSETFLV+LAACAMAEAVEEG+FYFN M NE+GI+PEI HYLGV+DVLGKSGHLIEAEEFIE M INPTAKIWDALRNYA+LHGNMELEDRA
Subjt:  MKNVGLQPNSETFLVVLAACAMAEAVEEGIFYFNSMENEHGINPEIGHYLGVVDVLGKSGHLIEAEEFIENMSINPTAKIWDALRNYAQLHGNMELEDRA

Query:  EELMFSLDPSMAATATKPPLPPPWKQSATNMLEEKDRVREFRCAMPYKEEGEGKLKGLNGQMREAGYVPDTRYVLHDIDEEAKQQALQYHSERLAIAYGL
        EELMFSLDPS  AT TKP LPP  KQS+TNMLEEKDRVREFR AMPYKEEGEGKLKGLNGQM+EAGYVPDTRYVLHDIDEEAKQQALQYHSERLAIAYGL
Subjt:  EELMFSLDPSMAATATKPPLPPPWKQSATNMLEEKDRVREFRCAMPYKEEGEGKLKGLNGQMREAGYVPDTRYVLHDIDEEAKQQALQYHSERLAIAYGL

Query:  ISTPARTTLRIIKNLRICGDCHNAIKIMSRIVGRELIVRDNKRFHHFKDGKCSCGDY
        ISTPARTTLRIIKNLRICGDCHNAIKIMS+IVGRELIVRDNKRFHHFKDGKCSCGDY
Subjt:  ISTPARTTLRIIKNLRICGDCHNAIKIMSRIVGRELIVRDNKRFHHFKDGKCSCGDY

A0A5A7V2K9 Pentatricopeptide repeat-containing protein1.6e-23189.06Show/hide
Query:  MASSIPSQATRNSIITTNFKSRIIPIHSPAVKSFRPTSLCTNALSNSRNRKAARSYDGQSSTNTLSKSQNQTSDSVFGLPSVVDLVALCEEGKVMDALEY
        M+SSI S AT NSI TTNFKS IIPIH PAVKSFR T L TNALSNSR RKAAR YD QS+TNTLS+SQNQTSDSV+  PS VDL+ALC+EGKV+DALEY
Subjt:  MASSIPSQATRNSIITTNFKSRIIPIHSPAVKSFRPTSLCTNALSNSRNRKAARSYDGQSSTNTLSKSQNQTSDSVFGLPSVVDLVALCEEGKVMDALEY

Query:  IGQGANVDYGVFTALLNSSGNLKLLEAGRRVDGLLKGTKFRGDVELNNKLIEMYSSCGCMKDARRVFDKMPDRDIRTWNLMIKGYGENGEGDHGLALFEQ
        IGQGA VDYGVFTALLNS  NLKLLEAGRRVDGLLKGTKFRGDVELNNKLIEMYS+CGCMK+AR+VFDKMP++D RTWNLMIKGYGENGEGD+GLALFEQ
Subjt:  IGQGANVDYGVFTALLNSSGNLKLLEAGRRVDGLLKGTKFRGDVELNNKLIEMYSSCGCMKDARRVFDKMPDRDIRTWNLMIKGYGENGEGDHGLALFEQ

Query:  MKNVGLQPNSETFLVVLAACAMAEAVEEGIFYFNSMENEHGINPEIGHYLGVVDVLGKSGHLIEAEEFIENMSINPTAKIWDALRNYAQLHGNMELEDRA
        MKNVGLQPNSETFLVVLAACAMAEAVEEG+FYFN M NE+GINPEI HYLGVVDVLGKSGHLIEAEEFIE M INPTAKIWDALRNYA+LHGNMELEDRA
Subjt:  MKNVGLQPNSETFLVVLAACAMAEAVEEGIFYFNSMENEHGINPEIGHYLGVVDVLGKSGHLIEAEEFIENMSINPTAKIWDALRNYAQLHGNMELEDRA

Query:  EELMFSLDPSMAATATKPPLPPPWKQSATNMLEEKDRVREFRCAMPYKEEGEGKLKGLNGQMREAGYVPDTRYVLHDIDEEAKQQALQYHSERLAIAYGL
        EELMFSLDPS  AT TKP LPP  KQS+TNMLEEKDRVREFR AMPYKEEGEGKLKGLNGQM+EAGYVPDTRYVLHDIDEEAKQQALQYHSERLAIAYGL
Subjt:  EELMFSLDPSMAATATKPPLPPPWKQSATNMLEEKDRVREFRCAMPYKEEGEGKLKGLNGQMREAGYVPDTRYVLHDIDEEAKQQALQYHSERLAIAYGL

Query:  ISTPARTTLRIIKNLRICGDCHNAIKIMSRIVGRELIVRDNKRFHHFKDGKCSCGDY
        ISTPARTTLRIIKNLRICGDCHNAIKIMS+IVGRELIVRDNKRFHHFKDGKCSCGDY
Subjt:  ISTPARTTLRIIKNLRICGDCHNAIKIMSRIVGRELIVRDNKRFHHFKDGKCSCGDY

A0A5D3CFT5 Pentatricopeptide repeat-containing protein5.2e-23088.18Show/hide
Query:  MASSIPSQATRNSIITTNFKSRIIPIHSPAVKSFRPTSLCTNALSNSRNRKAARSYDGQSSTNTLSKSQNQTSDSVFGLPSVVDLVALCEEGKVMDALEY
        M+ SI S AT NSI TTNFKS IIPIH PAVKSFR T L TNALSNSR RKAAR YD QS+TNTLS+SQNQTSDSV+  PS VDL+ALCEEGKV+DALEY
Subjt:  MASSIPSQATRNSIITTNFKSRIIPIHSPAVKSFRPTSLCTNALSNSRNRKAARSYDGQSSTNTLSKSQNQTSDSVFGLPSVVDLVALCEEGKVMDALEY

Query:  IGQGANVDYGVFTALLNSSGNLKLLEAGRRVDGLLKGTKFRGDVELNNKLIEMYSSCGCMKDARRVFDKMPDRDIRTWNLMIKGYGENGEGDHGLALFEQ
        IGQGA VDYGVFTALLNS  NLKLLEAGRRVDGLLKGTKFRGDVELNNKLIEMYS+CGCMK+AR+VFDKMP++D RTWNLMIKGYGENGEGD+GLALFE+
Subjt:  IGQGANVDYGVFTALLNSSGNLKLLEAGRRVDGLLKGTKFRGDVELNNKLIEMYSSCGCMKDARRVFDKMPDRDIRTWNLMIKGYGENGEGDHGLALFEQ

Query:  MKNVGLQPNSETFLVVLAACAMAEAVEEGIFYFNSMENEHGINPEIGHYLGVVDVLGKSGHLIEAEEFIENMSINPTAKIWDALRNYAQLHGNMELEDRA
        MKNVGLQPNSETFLV+LAACAMAEAVEEG+FYFN M NE+GI+PEI HYLGV+DVLGKSGHLIEAEEFIE M INPTAKIWDALRNYA+LHGNMELEDRA
Subjt:  MKNVGLQPNSETFLVVLAACAMAEAVEEGIFYFNSMENEHGINPEIGHYLGVVDVLGKSGHLIEAEEFIENMSINPTAKIWDALRNYAQLHGNMELEDRA

Query:  EELMFSLDPSMAATATKPPLPPPWKQSATNMLEEKDRVREFRCAMPYKEEGEGKLKGLNGQMREAGYVPDTRYVLHDIDEEAKQQALQYHSERLAIAYGL
        EELMFSLDPS  AT TKP LPP  KQS+TNMLEEKDRVREFR AMPYKEEGEGKLKGLNGQM+EAGYVPDTRYVLHDIDEEAKQQALQYHSERLAIAYGL
Subjt:  EELMFSLDPSMAATATKPPLPPPWKQSATNMLEEKDRVREFRCAMPYKEEGEGKLKGLNGQMREAGYVPDTRYVLHDIDEEAKQQALQYHSERLAIAYGL

Query:  ISTPARTTLRIIKNLRICGDCHNAIKIMSRIVGRELIVRDNKRFHHFKDGKCSCGDY
        ISTPARTTLRIIKNLRICGDCHNAIKIMS+IVGRELIVRDNKRFHHFKDGKCSCGDY
Subjt:  ISTPARTTLRIIKNLRICGDCHNAIKIMSRIVGRELIVRDNKRFHHFKDGKCSCGDY

A0A6J1FC52 pentatricopeptide repeat-containing protein At2g15690, mitochondrial-like isoform X14.4e-22988.21Show/hide
Query:  MASSIPSQATRNSIITTNFKSRIIPIHSPAVKSFRPTSLCTNALSNSRNRKAARSYDGQSSTNTLSKSQNQTSDSVFGLPSVVDLVALCEEGKVMDALEY
        MASSIPSQ T NSIITTNF SRIIP HSPAVKSFRPT  C N+LSNS NRK AR YDGQ++TNTLSKSQN+TSDSV  LPS VDLVALCE G+VMD LE+
Subjt:  MASSIPSQATRNSIITTNFKSRIIPIHSPAVKSFRPTSLCTNALSNSRNRKAARSYDGQSSTNTLSKSQNQTSDSVFGLPSVVDLVALCEEGKVMDALEY

Query:  IGQGANVDYGVFTALLNSSGNLKLLEAGRRVDGLLKGTKFRGDVELNNKLIEMYSSCGCMKDARRVFDKMPDRDIRTWNLMIKGYGENGEGDHGLALFEQ
        IGQGAN+DYG+ TALLNS  N KLLEAGRRVD LLKGTKF GDV LNN LIEMYS CGCMKDARRVFDKMPDRDIRTWNLMIKGYGENGEGD GLALFE+
Subjt:  IGQGANVDYGVFTALLNSSGNLKLLEAGRRVDGLLKGTKFRGDVELNNKLIEMYSSCGCMKDARRVFDKMPDRDIRTWNLMIKGYGENGEGDHGLALFEQ

Query:  MKNVGLQPNSETFLVVLAACAMAEAVEEGIFYFNSMENEHGINPEIGHYLGVVDVLGKSGHLIEAEEFIENMSINPTAKIWDALRNYAQLHGNMELEDRA
        MK+VGLQPNSETFLVVLAACAMAEAVEEG+FYFNSMENE+GI PEI HYLGVVDVLGKSGHLIEAEEFI+ + INPTAKIWDALR YA+LHGN+ELEDRA
Subjt:  MKNVGLQPNSETFLVVLAACAMAEAVEEGIFYFNSMENEHGINPEIGHYLGVVDVLGKSGHLIEAEEFIENMSINPTAKIWDALRNYAQLHGNMELEDRA

Query:  EELMFSLDPSMAATATKPPLPPPWKQSATNMLEEKDRVREFRCAMPYKEEGE-GKLKGLNGQMREAGYVPDTRYVLHDIDEEAKQQALQYHSERLAIAYG
        EEL+FS DPSMA TA KPPL PP KQSATNMLEEKDRVREFRCAMPYKEEGE GKLKGLNGQMREAGYVPDTRYVLHDIDEEAKQQALQYHSERLAIAYG
Subjt:  EELMFSLDPSMAATATKPPLPPPWKQSATNMLEEKDRVREFRCAMPYKEEGE-GKLKGLNGQMREAGYVPDTRYVLHDIDEEAKQQALQYHSERLAIAYG

Query:  LISTPARTTLRIIKNLRICGDCHNAIKIMSRIVGRELIVRDNKRFHHFKDGKCSCGDY
        LISTPARTTLRIIKNLRICGDCHNAIKIMS+IVGRELIVRDNKRFHHFKDGKCSCGDY
Subjt:  LISTPARTTLRIIKNLRICGDCHNAIKIMSRIVGRELIVRDNKRFHHFKDGKCSCGDY

A0A6J1FHU4 pentatricopeptide repeat-containing protein At2g15690, mitochondrial-like isoform X24.4e-22988.21Show/hide
Query:  MASSIPSQATRNSIITTNFKSRIIPIHSPAVKSFRPTSLCTNALSNSRNRKAARSYDGQSSTNTLSKSQNQTSDSVFGLPSVVDLVALCEEGKVMDALEY
        MASSIPSQ T NSIITTNF SRIIP HSPAVKSFRPT  C N+LSNS NRK AR YDGQ++TNTLSKSQN+TSDSV  LPS VDLVALCE G+VMD LE+
Subjt:  MASSIPSQATRNSIITTNFKSRIIPIHSPAVKSFRPTSLCTNALSNSRNRKAARSYDGQSSTNTLSKSQNQTSDSVFGLPSVVDLVALCEEGKVMDALEY

Query:  IGQGANVDYGVFTALLNSSGNLKLLEAGRRVDGLLKGTKFRGDVELNNKLIEMYSSCGCMKDARRVFDKMPDRDIRTWNLMIKGYGENGEGDHGLALFEQ
        IGQGAN+DYG+ TALLNS  N KLLEAGRRVD LLKGTKF GDV LNN LIEMYS CGCMKDARRVFDKMPDRDIRTWNLMIKGYGENGEGD GLALFE+
Subjt:  IGQGANVDYGVFTALLNSSGNLKLLEAGRRVDGLLKGTKFRGDVELNNKLIEMYSSCGCMKDARRVFDKMPDRDIRTWNLMIKGYGENGEGDHGLALFEQ

Query:  MKNVGLQPNSETFLVVLAACAMAEAVEEGIFYFNSMENEHGINPEIGHYLGVVDVLGKSGHLIEAEEFIENMSINPTAKIWDALRNYAQLHGNMELEDRA
        MK+VGLQPNSETFLVVLAACAMAEAVEEG+FYFNSMENE+GI PEI HYLGVVDVLGKSGHLIEAEEFI+ + INPTAKIWDALR YA+LHGN+ELEDRA
Subjt:  MKNVGLQPNSETFLVVLAACAMAEAVEEGIFYFNSMENEHGINPEIGHYLGVVDVLGKSGHLIEAEEFIENMSINPTAKIWDALRNYAQLHGNMELEDRA

Query:  EELMFSLDPSMAATATKPPLPPPWKQSATNMLEEKDRVREFRCAMPYKEEGE-GKLKGLNGQMREAGYVPDTRYVLHDIDEEAKQQALQYHSERLAIAYG
        EEL+FS DPSMA TA KPPL PP KQSATNMLEEKDRVREFRCAMPYKEEGE GKLKGLNGQMREAGYVPDTRYVLHDIDEEAKQQALQYHSERLAIAYG
Subjt:  EELMFSLDPSMAATATKPPLPPPWKQSATNMLEEKDRVREFRCAMPYKEEGE-GKLKGLNGQMREAGYVPDTRYVLHDIDEEAKQQALQYHSERLAIAYG

Query:  LISTPARTTLRIIKNLRICGDCHNAIKIMSRIVGRELIVRDNKRFHHFKDGKCSCGDY
        LISTPARTTLRIIKNLRICGDCHNAIKIMS+IVGRELIVRDNKRFHHFKDGKCSCGDY
Subjt:  LISTPARTTLRIIKNLRICGDCHNAIKIMSRIVGRELIVRDNKRFHHFKDGKCSCGDY

SwissProt top hitse value%identityAlignment
Q9LTF4 Putative pentatricopeptide repeat-containing protein At5g526301.6e-7140.48Show/hide
Query:  DYGVFTALLNSSGNLKLLEAGRRVDGLLKGTKFRGDVELNNKLIEMYSSCGCMKDARRVFDKMPDRDIRTWNLMIKGYGENGEGDHGLALFEQMKNVGLQ
        DY  F+++++   N  LLE GR++ GL   + F     + + L+ +YS CG  + A +VF+++P +++  WN M+K Y ++      + LF++MK  G++
Subjt:  DYGVFTALLNSSGNLKLLEAGRRVDGLLKGTKFRGDVELNNKLIEMYSSCGCMKDARRVFDKMPDRDIRTWNLMIKGYGENGEGDHGLALFEQMKNVGLQ

Query:  PNSETFLVVLAACAMAEAVEEGIFYFNSMENEHGINPEIGHYLGVVDVLGKSGHLIEAEEFIENMSINPTAKIWDALRNYAQLHGNMELEDRAEELMFSL
        PN  TFL VL AC+ A  V+EG +YF+ M+ E  I P   HY  +VD+LG++G L EA E I NM I+PT  +W AL     +H N EL   A + +F L
Subjt:  PNSETFLVVLAACAMAEAVEEGIFYFNSMENEHGINPEIGHYLGVVDVLGKSGHLIEAEEFIENMSINPTAKIWDALRNYAQLHGNMELEDRAEELMFSL

Query:  DP-------------------SMAATATKPPLP-PPWKQSATNMLEEKDRVREFRCAMPYKEEGE---GKLKGLNGQMREAGYVPDTRYVLHDIDEEAKQ
         P                     AA A K        K++  + +EE+++V  F       E+ +    KL  L  +M +AGY+ DT YVL ++D + K 
Subjt:  DP-------------------SMAATATKPPLP-PPWKQSATNMLEEKDRVREFRCAMPYKEEGE---GKLKGLNGQMREAGYVPDTRYVLHDIDEEAKQ

Query:  QALQYHSERLAIAYGLISTPARTTLRIIKNLRICGDCHNAIKIMSRIVGRELIVRDNKRFHHFKDGKCSCGDY
        Q ++YHSERLAIA+GLI+ PA   +R++KNLR+CGDCHNAIK MS    R +IVRDN RFH F+DGKCSC DY
Subjt:  QALQYHSERLAIAYGLISTPARTTLRIIKNLRICGDCHNAIKIMSRIVGRELIVRDNKRFHHFKDGKCSCGDY

Q9LW63 Putative pentatricopeptide repeat-containing protein At3g233303.3e-7237.72Show/hide
Query:  EEGKVMDALEYIGQ--GANVDYG--VFTALLNSSGNLKLLEAGRRVDGLLKGTKFRGDVELNNKLIEMYSSCGCMKDARRVFDKMPDRDIRTWNLMIKGY
        + G+  +AL    Q   A V  G   F++++ +  +L  L  G+++ G +    F  ++ + + L++MYS CG +K AR++FD+M   D  +W  +I G+
Subjt:  EEGKVMDALEYIGQ--GANVDYG--VFTALLNSSGNLKLLEAGRRVDGLLKGTKFRGDVELNNKLIEMYSSCGCMKDARRVFDKMPDRDIRTWNLMIKGY

Query:  GENGEGDHGLALFEQMKNVGLQPNSETFLVVLAACAMAEAVEEGIFYFNSMENEHGINPEIGHYLGVVDVLGKSGHLIEAEEFIENMSINPTAKIWDALR
          +G G   ++LFE+MK  G++PN   F+ VL AC+    V+E   YFNSM   +G+N E+ HY  V D+LG++G L EA  FI  M + PT  +W  L 
Subjt:  GENGEGDHGLALFEQMKNVGLQPNSETFLVVLAACAMAEAVEEGIFYFNSMENEHGINPEIGHYLGVVDVLGKSGHLIEAEEFIENMSINPTAKIWDALR

Query:  NYAQLHGNMELEDRAEELMFSLD--------------------PSMAATATKPPLPPPWKQSATNMLEEKDRVREFRC---AMPYKEEGEGKLKGLNGQM
        +   +H N+EL ++  E +F++D                      MA    +       K+ A + +E K++   F     + P  ++    LK +  QM
Subjt:  NYAQLHGNMELEDRAEELMFSLD--------------------PSMAATATKPPLPPPWKQSATNMLEEKDRVREFRC---AMPYKEEGEGKLKGLNGQM

Query:  REAGYVPDTRYVLHDIDEEAKQQALQYHSERLAIAYGLISTPARTTLRIIKNLRICGDCHNAIKIMSRIVGRELIVRDNKRFHHFKDGKCSCGDY
         + GYV DT  VLHD+DEE K++ L  HSERLA+A+G+I+T   TT+R+ KN+RIC DCH AIK +S+I  RE+IVRDN RFHHF  G CSCGDY
Subjt:  REAGYVPDTRYVLHDIDEEAKQQALQYHSERLAIAYGLISTPARTTLRIIKNLRICGDCHNAIKIMSRIVGRELIVRDNKRFHHFKDGKCSCGDY

Q9SUH6 Pentatricopeptide repeat-containing protein At4g307007.0e-7540.7Show/hide
Query:  TALLNSSGNLKLLEAGRRVDGLLKGTKFRGDVELNNKLIEMYSSCGCMKDARRVFDKMPDRDIRTWNLMIKGYGENGEGDHGLALFEQMKNVGLQPNSET
        T +L++   L  L  G+ V  L++ T F   + ++  LI MY+ CG + +ARR+FD M  ++  TWN MI GYG +G+G   L +F +M N G+ P   T
Subjt:  TALLNSSGNLKLLEAGRRVDGLLKGTKFRGDVELNNKLIEMYSSCGCMKDARRVFDKMPDRDIRTWNLMIKGYGENGEGDHGLALFEQMKNVGLQPNSET

Query:  FLVVLAACAMAEAVEEGIFYFNSMENEHGINPEIGHYLGVVDVLGKSGHLIEAEEFIENMSINPTAKIWDALRNYAQLHGNMELEDRAEELMFSLDP---
        FL VL AC+ A  V+EG   FNSM + +G  P + HY  +VD+LG++GHL  A +FIE MSI P + +W+ L    ++H +  L     E +F LDP   
Subjt:  FLVVLAACAMAEAVEEGIFYFNSMENEHGINPEIGHYLGVVDVLGKSGHLIEAEEFIENMSINPTAKIWDALRNYAQLHGNMELEDRAEELMFSLDP---

Query:  --------------------SMAATATKPPLPPPWKQSATNMLEEKDRVREFRC---AMPYKEEGEGKLKGLNGQMREAGYVPDTRYVLHDIDEEAKQQA
                            ++  TA K  L    K     ++E  +    F     + P  +E   KL+ L G+MREAGY P+T   LHD++EE ++  
Subjt:  --------------------SMAATATKPPLPPPWKQSATNMLEEKDRVREFRC---AMPYKEEGEGKLKGLNGQMREAGYVPDTRYVLHDIDEEAKQQA

Query:  LQYHSERLAIAYGLISTPARTTLRIIKNLRICGDCHNAIKIMSRIVGRELIVRDNKRFHHFKDGKCSCGDY
        ++ HSERLAIA+GLI+T   T +RIIKNLR+C DCH   K++S+I  R ++VRD  RFHHFKDG CSCGDY
Subjt:  LQYHSERLAIAYGLISTPARTTLRIIKNLRICGDCHNAIKIMSRIVGRELIVRDNKRFHHFKDGKCSCGDY

Q9SY02 Pentatricopeptide repeat-containing protein At4g027502.1e-7137.04Show/hide
Query:  QGANVDYGVFTALLNSSGNLKLLEAGRRVDGLLKGTKFRGDVELNNKLIEMYSSCGCMKDARRVFDKMPDRDIRTWNLMIKGYGENGEGDHGLALFEQMK
        +G  ++   F++ L++  ++  LE G+++ G L    +     + N L+ MY  CG +++A  +F +M  +DI +WN MI GY  +G G+  L  FE MK
Subjt:  QGANVDYGVFTALLNSSGNLKLLEAGRRVDGLLKGTKFRGDVELNNKLIEMYSSCGCMKDARRVFDKMPDRDIRTWNLMIKGYGENGEGDHGLALFEQMK

Query:  NVGLQPNSETFLVVLAACAMAEAVEEGIFYFNSMENEHGINPEIGHYLGVVDVLGKSGHLIEAEEFIENMSINPTAKIWDALRNYAQLHGNMELEDRAEE
          GL+P+  T + VL+AC+    V++G  YF +M  ++G+ P   HY  +VD+LG++G L +A   ++NM   P A IW  L   +++HGN EL + A +
Subjt:  NVGLQPNSETFLVVLAACAMAEAVEEGIFYFNSMENEHGINPEIGHYLGVVDVLGKSGHLIEAEEFIENMSINPTAKIWDALRNYAQLHGNMELEDRAEE

Query:  LMFSLDPSMAA----TATKPPLPPPW----------------KQSATNMLEEKDRVREFRCA---MPYKEEGEGKLKGLNGQMREAGYVPDTRYVLHDID
         +F+++P  +      +        W                K    + +E +++   F       P K+E    L+ L+ +M++AGYV  T  VLHD++
Subjt:  LMFSLDPSMAA----TATKPPLPPPW----------------KQSATNMLEEKDRVREFRCA---MPYKEEGEGKLKGLNGQMREAGYVPDTRYVLHDID

Query:  EEAKQQALQYHSERLAIAYGLISTPARTTLRIIKNLRICGDCHNAIKIMSRIVGRELIVRDNKRFHHFKDGKCSCGDY
        EE K++ ++YHSERLA+AYG++   +   +R+IKNLR+C DCHNAIK M+RI GR +I+RDN RFHHFKDG CSCGDY
Subjt:  EEAKQQALQYHSERLAIAYGLISTPARTTLRIIKNLRICGDCHNAIKIMSRIVGRELIVRDNKRFHHFKDGKCSCGDY

Q9ZQE5 Pentatricopeptide repeat-containing protein At2g15690, mitochondrial1.5e-11754.34Show/hide
Query:  SKSQNQTSDSVFGLPSVVDLVALCEEGKVMDALEYIGQGANVDYGVFTALLNSSGNLKLLEAGRRVDGLLKGTKFRGDVELNNKLIEMYSSCGCMKDARR
        ++S NQ ++ V   PSV +++ LC+     DA+E + +GA  D   F  L  S  NLK LE  ++V      +KFRGD +LNN +I M+  C  + DA+R
Subjt:  SKSQNQTSDSVFGLPSVVDLVALCEEGKVMDALEYIGQGANVDYGVFTALLNSSGNLKLLEAGRRVDGLLKGTKFRGDVELNNKLIEMYSSCGCMKDARR

Query:  VFDKMPDRDIRTWNLMIKGYGENGEGDHGLALFEQMKNVGLQPNSETFLVVLAACAMAEAVEEGIFYFNSMENEHGINPEIGHYLGVVDVLGKSGHLIEA
        VFD M D+D+ +W+LM+  Y +NG GD  L LFE+M   GL+PN ETFL V  ACA    +EE   +F+SM+NEHGI+P+  HYLGV+ VLGK GHL+EA
Subjt:  VFDKMPDRDIRTWNLMIKGYGENGEGDHGLALFEQMKNVGLQPNSETFLVVLAACAMAEAVEEGIFYFNSMENEHGINPEIGHYLGVVDVLGKSGHLIEA

Query:  EEFIENMSINPTAKIWDALRNYAQLHGNMELEDRAEELMFSLDPSMAATATKPPLPPPWKQSATNMLEEKDRVREFRCAMPYKEEGEGKLKGLNGQMREA
        E++I ++   PTA  W+A+RNYA+LHG+++LED  EELM  +DPS  A   K P PPP     TNM+  K R+ EFR    YK+E + ++    G +   
Subjt:  EEFIENMSINPTAKIWDALRNYAQLHGNMELEDRAEELMFSLDPSMAATATKPPLPPPWKQSATNMLEEKDRVREFRCAMPYKEEGEGKLKGLNGQMREA

Query:  GYVPDTRYVLHDIDEEAKQQALQYHSERLAIAYGLISTPARTTLRIIKNLRICGDCHNAIKIMSRIVGRELIVRDNKRFHHFKDGKCSCGDY
         YVPDTR+VLHDID+EAK+QAL YHSERLAIAYG+I TP R TL IIKNLR+CGDCHN IKIMS+I+GR LIVRDNKRFHHFKDGKCSCGDY
Subjt:  GYVPDTRYVLHDIDEEAKQQALQYHSERLAIAYGLISTPARTTLRIIKNLRICGDCHNAIKIMSRIVGRELIVRDNKRFHHFKDGKCSCGDY

Arabidopsis top hitse value%identityAlignment
AT2G15690.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.1e-11854.34Show/hide
Query:  SKSQNQTSDSVFGLPSVVDLVALCEEGKVMDALEYIGQGANVDYGVFTALLNSSGNLKLLEAGRRVDGLLKGTKFRGDVELNNKLIEMYSSCGCMKDARR
        ++S NQ ++ V   PSV +++ LC+     DA+E + +GA  D   F  L  S  NLK LE  ++V      +KFRGD +LNN +I M+  C  + DA+R
Subjt:  SKSQNQTSDSVFGLPSVVDLVALCEEGKVMDALEYIGQGANVDYGVFTALLNSSGNLKLLEAGRRVDGLLKGTKFRGDVELNNKLIEMYSSCGCMKDARR

Query:  VFDKMPDRDIRTWNLMIKGYGENGEGDHGLALFEQMKNVGLQPNSETFLVVLAACAMAEAVEEGIFYFNSMENEHGINPEIGHYLGVVDVLGKSGHLIEA
        VFD M D+D+ +W+LM+  Y +NG GD  L LFE+M   GL+PN ETFL V  ACA    +EE   +F+SM+NEHGI+P+  HYLGV+ VLGK GHL+EA
Subjt:  VFDKMPDRDIRTWNLMIKGYGENGEGDHGLALFEQMKNVGLQPNSETFLVVLAACAMAEAVEEGIFYFNSMENEHGINPEIGHYLGVVDVLGKSGHLIEA

Query:  EEFIENMSINPTAKIWDALRNYAQLHGNMELEDRAEELMFSLDPSMAATATKPPLPPPWKQSATNMLEEKDRVREFRCAMPYKEEGEGKLKGLNGQMREA
        E++I ++   PTA  W+A+RNYA+LHG+++LED  EELM  +DPS  A   K P PPP     TNM+  K R+ EFR    YK+E + ++    G +   
Subjt:  EEFIENMSINPTAKIWDALRNYAQLHGNMELEDRAEELMFSLDPSMAATATKPPLPPPWKQSATNMLEEKDRVREFRCAMPYKEEGEGKLKGLNGQMREA

Query:  GYVPDTRYVLHDIDEEAKQQALQYHSERLAIAYGLISTPARTTLRIIKNLRICGDCHNAIKIMSRIVGRELIVRDNKRFHHFKDGKCSCGDY
         YVPDTR+VLHDID+EAK+QAL YHSERLAIAYG+I TP R TL IIKNLR+CGDCHN IKIMS+I+GR LIVRDNKRFHHFKDGKCSCGDY
Subjt:  GYVPDTRYVLHDIDEEAKQQALQYHSERLAIAYGLISTPARTTLRIIKNLRICGDCHNAIKIMSRIVGRELIVRDNKRFHHFKDGKCSCGDY

AT3G23330.1 Tetratricopeptide repeat (TPR)-like superfamily protein2.3e-7337.72Show/hide
Query:  EEGKVMDALEYIGQ--GANVDYG--VFTALLNSSGNLKLLEAGRRVDGLLKGTKFRGDVELNNKLIEMYSSCGCMKDARRVFDKMPDRDIRTWNLMIKGY
        + G+  +AL    Q   A V  G   F++++ +  +L  L  G+++ G +    F  ++ + + L++MYS CG +K AR++FD+M   D  +W  +I G+
Subjt:  EEGKVMDALEYIGQ--GANVDYG--VFTALLNSSGNLKLLEAGRRVDGLLKGTKFRGDVELNNKLIEMYSSCGCMKDARRVFDKMPDRDIRTWNLMIKGY

Query:  GENGEGDHGLALFEQMKNVGLQPNSETFLVVLAACAMAEAVEEGIFYFNSMENEHGINPEIGHYLGVVDVLGKSGHLIEAEEFIENMSINPTAKIWDALR
          +G G   ++LFE+MK  G++PN   F+ VL AC+    V+E   YFNSM   +G+N E+ HY  V D+LG++G L EA  FI  M + PT  +W  L 
Subjt:  GENGEGDHGLALFEQMKNVGLQPNSETFLVVLAACAMAEAVEEGIFYFNSMENEHGINPEIGHYLGVVDVLGKSGHLIEAEEFIENMSINPTAKIWDALR

Query:  NYAQLHGNMELEDRAEELMFSLD--------------------PSMAATATKPPLPPPWKQSATNMLEEKDRVREFRC---AMPYKEEGEGKLKGLNGQM
        +   +H N+EL ++  E +F++D                      MA    +       K+ A + +E K++   F     + P  ++    LK +  QM
Subjt:  NYAQLHGNMELEDRAEELMFSLD--------------------PSMAATATKPPLPPPWKQSATNMLEEKDRVREFRC---AMPYKEEGEGKLKGLNGQM

Query:  REAGYVPDTRYVLHDIDEEAKQQALQYHSERLAIAYGLISTPARTTLRIIKNLRICGDCHNAIKIMSRIVGRELIVRDNKRFHHFKDGKCSCGDY
         + GYV DT  VLHD+DEE K++ L  HSERLA+A+G+I+T   TT+R+ KN+RIC DCH AIK +S+I  RE+IVRDN RFHHF  G CSCGDY
Subjt:  REAGYVPDTRYVLHDIDEEAKQQALQYHSERLAIAYGLISTPARTTLRIIKNLRICGDCHNAIKIMSRIVGRELIVRDNKRFHHFKDGKCSCGDY

AT4G02750.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.5e-7237.04Show/hide
Query:  QGANVDYGVFTALLNSSGNLKLLEAGRRVDGLLKGTKFRGDVELNNKLIEMYSSCGCMKDARRVFDKMPDRDIRTWNLMIKGYGENGEGDHGLALFEQMK
        +G  ++   F++ L++  ++  LE G+++ G L    +     + N L+ MY  CG +++A  +F +M  +DI +WN MI GY  +G G+  L  FE MK
Subjt:  QGANVDYGVFTALLNSSGNLKLLEAGRRVDGLLKGTKFRGDVELNNKLIEMYSSCGCMKDARRVFDKMPDRDIRTWNLMIKGYGENGEGDHGLALFEQMK

Query:  NVGLQPNSETFLVVLAACAMAEAVEEGIFYFNSMENEHGINPEIGHYLGVVDVLGKSGHLIEAEEFIENMSINPTAKIWDALRNYAQLHGNMELEDRAEE
          GL+P+  T + VL+AC+    V++G  YF +M  ++G+ P   HY  +VD+LG++G L +A   ++NM   P A IW  L   +++HGN EL + A +
Subjt:  NVGLQPNSETFLVVLAACAMAEAVEEGIFYFNSMENEHGINPEIGHYLGVVDVLGKSGHLIEAEEFIENMSINPTAKIWDALRNYAQLHGNMELEDRAEE

Query:  LMFSLDPSMAA----TATKPPLPPPW----------------KQSATNMLEEKDRVREFRCA---MPYKEEGEGKLKGLNGQMREAGYVPDTRYVLHDID
         +F+++P  +      +        W                K    + +E +++   F       P K+E    L+ L+ +M++AGYV  T  VLHD++
Subjt:  LMFSLDPSMAA----TATKPPLPPPW----------------KQSATNMLEEKDRVREFRCA---MPYKEEGEGKLKGLNGQMREAGYVPDTRYVLHDID

Query:  EEAKQQALQYHSERLAIAYGLISTPARTTLRIIKNLRICGDCHNAIKIMSRIVGRELIVRDNKRFHHFKDGKCSCGDY
        EE K++ ++YHSERLA+AYG++   +   +R+IKNLR+C DCHNAIK M+RI GR +I+RDN RFHHFKDG CSCGDY
Subjt:  EEAKQQALQYHSERLAIAYGLISTPARTTLRIIKNLRICGDCHNAIKIMSRIVGRELIVRDNKRFHHFKDGKCSCGDY

AT4G30700.1 Pentatricopeptide repeat (PPR) superfamily protein5.0e-7640.7Show/hide
Query:  TALLNSSGNLKLLEAGRRVDGLLKGTKFRGDVELNNKLIEMYSSCGCMKDARRVFDKMPDRDIRTWNLMIKGYGENGEGDHGLALFEQMKNVGLQPNSET
        T +L++   L  L  G+ V  L++ T F   + ++  LI MY+ CG + +ARR+FD M  ++  TWN MI GYG +G+G   L +F +M N G+ P   T
Subjt:  TALLNSSGNLKLLEAGRRVDGLLKGTKFRGDVELNNKLIEMYSSCGCMKDARRVFDKMPDRDIRTWNLMIKGYGENGEGDHGLALFEQMKNVGLQPNSET

Query:  FLVVLAACAMAEAVEEGIFYFNSMENEHGINPEIGHYLGVVDVLGKSGHLIEAEEFIENMSINPTAKIWDALRNYAQLHGNMELEDRAEELMFSLDP---
        FL VL AC+ A  V+EG   FNSM + +G  P + HY  +VD+LG++GHL  A +FIE MSI P + +W+ L    ++H +  L     E +F LDP   
Subjt:  FLVVLAACAMAEAVEEGIFYFNSMENEHGINPEIGHYLGVVDVLGKSGHLIEAEEFIENMSINPTAKIWDALRNYAQLHGNMELEDRAEELMFSLDP---

Query:  --------------------SMAATATKPPLPPPWKQSATNMLEEKDRVREFRC---AMPYKEEGEGKLKGLNGQMREAGYVPDTRYVLHDIDEEAKQQA
                            ++  TA K  L    K     ++E  +    F     + P  +E   KL+ L G+MREAGY P+T   LHD++EE ++  
Subjt:  --------------------SMAATATKPPLPPPWKQSATNMLEEKDRVREFRC---AMPYKEEGEGKLKGLNGQMREAGYVPDTRYVLHDIDEEAKQQA

Query:  LQYHSERLAIAYGLISTPARTTLRIIKNLRICGDCHNAIKIMSRIVGRELIVRDNKRFHHFKDGKCSCGDY
        ++ HSERLAIA+GLI+T   T +RIIKNLR+C DCH   K++S+I  R ++VRD  RFHHFKDG CSCGDY
Subjt:  LQYHSERLAIAYGLISTPARTTLRIIKNLRICGDCHNAIKIMSRIVGRELIVRDNKRFHHFKDGKCSCGDY

AT5G52630.1 mitochondrial RNAediting factor 11.2e-7240.48Show/hide
Query:  DYGVFTALLNSSGNLKLLEAGRRVDGLLKGTKFRGDVELNNKLIEMYSSCGCMKDARRVFDKMPDRDIRTWNLMIKGYGENGEGDHGLALFEQMKNVGLQ
        DY  F+++++   N  LLE GR++ GL   + F     + + L+ +YS CG  + A +VF+++P +++  WN M+K Y ++      + LF++MK  G++
Subjt:  DYGVFTALLNSSGNLKLLEAGRRVDGLLKGTKFRGDVELNNKLIEMYSSCGCMKDARRVFDKMPDRDIRTWNLMIKGYGENGEGDHGLALFEQMKNVGLQ

Query:  PNSETFLVVLAACAMAEAVEEGIFYFNSMENEHGINPEIGHYLGVVDVLGKSGHLIEAEEFIENMSINPTAKIWDALRNYAQLHGNMELEDRAEELMFSL
        PN  TFL VL AC+ A  V+EG +YF+ M+ E  I P   HY  +VD+LG++G L EA E I NM I+PT  +W AL     +H N EL   A + +F L
Subjt:  PNSETFLVVLAACAMAEAVEEGIFYFNSMENEHGINPEIGHYLGVVDVLGKSGHLIEAEEFIENMSINPTAKIWDALRNYAQLHGNMELEDRAEELMFSL

Query:  DP-------------------SMAATATKPPLP-PPWKQSATNMLEEKDRVREFRCAMPYKEEGE---GKLKGLNGQMREAGYVPDTRYVLHDIDEEAKQ
         P                     AA A K        K++  + +EE+++V  F       E+ +    KL  L  +M +AGY+ DT YVL ++D + K 
Subjt:  DP-------------------SMAATATKPPLP-PPWKQSATNMLEEKDRVREFRCAMPYKEEGE---GKLKGLNGQMREAGYVPDTRYVLHDIDEEAKQ

Query:  QALQYHSERLAIAYGLISTPARTTLRIIKNLRICGDCHNAIKIMSRIVGRELIVRDNKRFHHFKDGKCSCGDY
        Q ++YHSERLAIA+GLI+ PA   +R++KNLR+CGDCHNAIK MS    R +IVRDN RFH F+DGKCSC DY
Subjt:  QALQYHSERLAIAYGLISTPARTTLRIIKNLRICGDCHNAIKIMSRIVGRELIVRDNKRFHHFKDGKCSCGDY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCCTCGATTCCTTCTCAGGCCACTCGAAACTCCATTATCACTACCAATTTCAAATCTCGAATCATCCCAATTCACTCTCCTGCTGTCAAATCCTTCCGCCCGAC
GTCCTTGTGCACTAATGCTCTCTCCAATTCCCGCAATCGGAAGGCAGCTCGCTCGTACGACGGCCAAAGCTCGACGAACACTCTCTCGAAATCTCAGAACCAGACGAGCG
ATTCAGTTTTTGGACTTCCGTCGGTGGTGGATTTAGTGGCTCTATGCGAAGAGGGTAAGGTAATGGATGCTCTGGAATATATTGGCCAAGGTGCTAATGTTGATTATGGT
GTTTTTACTGCTTTGTTGAATTCTAGTGGGAATTTGAAGTTGCTTGAGGCTGGAAGAAGGGTGGATGGGCTTTTGAAGGGGACGAAGTTTCGTGGGGATGTGGAATTGAA
CAATAAATTGATTGAAATGTACTCGAGTTGTGGCTGTATGAAAGATGCACGTAGGGTGTTTGATAAAATGCCTGACAGGGATATTAGGACTTGGAATTTGATGATCAAGG
GGTATGGAGAGAATGGTGAAGGAGATCATGGGTTGGCCTTGTTTGAGCAAATGAAGAATGTGGGATTGCAGCCAAATTCAGAAACATTTCTGGTGGTTTTAGCAGCTTGT
GCCATGGCTGAAGCTGTGGAGGAAGGTATATTTTACTTCAACTCAATGGAAAATGAACATGGAATCAATCCTGAGATTGGACATTATTTGGGAGTTGTTGATGTTCTTGG
GAAATCTGGACATTTGATTGAAGCAGAGGAGTTCATTGAGAACATGTCCATCAATCCCACTGCCAAAATCTGGGATGCCCTTAGAAACTACGCTCAACTCCATGGAAACA
TGGAGCTTGAAGATCGAGCCGAGGAGTTGATGTTCTCTCTCGACCCGTCCATGGCAGCCACAGCGACCAAGCCACCGCTCCCTCCGCCATGGAAGCAATCGGCAACCAAC
ATGTTGGAGGAGAAGGATAGAGTGAGGGAGTTTAGATGTGCAATGCCTTACAAAGAAGAGGGTGAAGGGAAGCTAAAAGGATTGAATGGACAAATGAGAGAAGCAGGCTA
TGTGCCAGATACAAGATATGTGCTACATGACATTGATGAAGAGGCTAAACAGCAGGCTTTGCAATACCATAGCGAGCGGTTGGCCATTGCTTATGGATTGATCAGTACAC
CGGCGAGAACGACGCTGAGGATCATCAAGAATCTTCGAATCTGCGGCGACTGCCACAATGCAATCAAGATCATGTCAAGAATTGTTGGAAGAGAGTTGATTGTTAGAGAT
AATAAACGTTTTCATCACTTTAAAGATGGAAAATGCTCCTGTGGGGATTACTGTCTCGGAGTCGGAGCTCCACTTTCTCCAACTAGCGGCGAGAGAAAAGCAGCCATTTC
AGTCGCATTTTCACCTTCTCAATCTCCTCCGTCACCGTCTCTTTTGCTATTTTTCGAAGTGGAATTATTGTTTTGTGTTGATATTTGCGAGTTTTTTAGCTTTATTTTGG
ATTTGATGGCATCCAAGTTGCATCTGTTGCAATCCAAGGTTTGTCAAGCTTCTCAATTTGCTTTCAAGAACGGTTCTTCCTATTATAAGCAGTTATTGGAGCAGAACAAG
CAATACATCCAAGAACCAGCTTCTGTGGAAAAATGCAACTTGCTTTCAAAGCAATTGTTGTATACTCGGCTTGCTAGCATCCCAGTCCGCTGTGAATCATTCTGGAAGGA
ACTCGATTGTGTGAAAAATTTGTGGAAGAACAGACAGGAGCTGAAAGTTGAAGATGTTGGCATCGCTGCCCTTTTCGGGCTGGAGTGCTTTGCATGGTTTTGTGCCGGTG
AGATCGTAGGAAGGGGCTTTACATTCACAGATGTGGCTTTTGCTGCCTACAACTTCCCAGCATTTCTCGTTGAACTTTTTAACCAAATGATCTGCAGGGTCAAGCGGGCT
GGAGAAACACGACATAGAAGAACTTCGCCGAAGGAATCTGTTCGTGGCCTGAAGAAGACGACTGAAAATGGCCCGCATCACCAGCATTTCCCTTCGCAAACATGTCTCAT
CAGCGATTATCCTCGCCCTTGTTCTCCTTACCGTCGTCTCCGAGTGTTCCAACAACGAATCTCGGCTGTTGGGATTCGATACGGAAAGTTCTGTGGAGTAGGATGGACGG
GTTGTGCTGGTGAAAAGCCTTGCGATGATCTTGATGCCTGTTGCAAAGTTCATGACGAATGCGTTGAAAGAAAAGGTTTAACCAATATTAAATGTCACGAGAAGTTCAAG
AGTTGTATTAAGAAAGTGCAGAAATCTGGGAAGGTTGGGTTCTCACAGGAGTGCCCTTATTCCACAGCTGTTCCTACAATGGTACAAGGCATGGATTTGGCCATCTTGTT
CAGCCAGTTTGTTGGGGCTGAAGAAAGCGGCAGACGGAACCGGCGTATTCTCTTCTGTTTCTGCATCTCTCTTCGACTTGTTGCAGCACAGATGGCGAACGTTCCGGGGC
AGTTGGTCTGGGAAATCGTCAAGAAGAACAGCTCATTCCTCGTTAAGGAGTTTGGCAGAGGCAATGCTGGTATTCAATTCAGCAAAGAGCCTAACAATCTCTACAACCTC
AACTCCTACAAGCATTCCGGCTTGGCAAACCGCAAGACTGTAACCGTTCAGGCAGGAGGCAAGGATTTGTCAGTATTGCTCGCGACAACAAAGACAAAGAAGCAGAACAA
ACCCGCGAGCTTGCTCCACAAATCACTCATGAGGAAGGAATTTCCTCGTATGGCCAAGGCTGTGACTAATCAGGTGGCTGACAATTACTACCGCCCGGACTTAAAGAAGG
CTGCCCTTGCCAAGCTAAGCGCAGTTCACAGGAGCCTCAAGGTTGCCAAGTCTGGTGTGAAGAAGAGGAACAGGCAGGCTGTTAGGACCCGCGGTAGGAAGTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCTTCCTCGATTCCTTCTCAGGCCACTCGAAACTCCATTATCACTACCAATTTCAAATCTCGAATCATCCCAATTCACTCTCCTGCTGTCAAATCCTTCCGCCCGAC
GTCCTTGTGCACTAATGCTCTCTCCAATTCCCGCAATCGGAAGGCAGCTCGCTCGTACGACGGCCAAAGCTCGACGAACACTCTCTCGAAATCTCAGAACCAGACGAGCG
ATTCAGTTTTTGGACTTCCGTCGGTGGTGGATTTAGTGGCTCTATGCGAAGAGGGTAAGGTAATGGATGCTCTGGAATATATTGGCCAAGGTGCTAATGTTGATTATGGT
GTTTTTACTGCTTTGTTGAATTCTAGTGGGAATTTGAAGTTGCTTGAGGCTGGAAGAAGGGTGGATGGGCTTTTGAAGGGGACGAAGTTTCGTGGGGATGTGGAATTGAA
CAATAAATTGATTGAAATGTACTCGAGTTGTGGCTGTATGAAAGATGCACGTAGGGTGTTTGATAAAATGCCTGACAGGGATATTAGGACTTGGAATTTGATGATCAAGG
GGTATGGAGAGAATGGTGAAGGAGATCATGGGTTGGCCTTGTTTGAGCAAATGAAGAATGTGGGATTGCAGCCAAATTCAGAAACATTTCTGGTGGTTTTAGCAGCTTGT
GCCATGGCTGAAGCTGTGGAGGAAGGTATATTTTACTTCAACTCAATGGAAAATGAACATGGAATCAATCCTGAGATTGGACATTATTTGGGAGTTGTTGATGTTCTTGG
GAAATCTGGACATTTGATTGAAGCAGAGGAGTTCATTGAGAACATGTCCATCAATCCCACTGCCAAAATCTGGGATGCCCTTAGAAACTACGCTCAACTCCATGGAAACA
TGGAGCTTGAAGATCGAGCCGAGGAGTTGATGTTCTCTCTCGACCCGTCCATGGCAGCCACAGCGACCAAGCCACCGCTCCCTCCGCCATGGAAGCAATCGGCAACCAAC
ATGTTGGAGGAGAAGGATAGAGTGAGGGAGTTTAGATGTGCAATGCCTTACAAAGAAGAGGGTGAAGGGAAGCTAAAAGGATTGAATGGACAAATGAGAGAAGCAGGCTA
TGTGCCAGATACAAGATATGTGCTACATGACATTGATGAAGAGGCTAAACAGCAGGCTTTGCAATACCATAGCGAGCGGTTGGCCATTGCTTATGGATTGATCAGTACAC
CGGCGAGAACGACGCTGAGGATCATCAAGAATCTTCGAATCTGCGGCGACTGCCACAATGCAATCAAGATCATGTCAAGAATTGTTGGAAGAGAGTTGATTGTTAGAGAT
AATAAACGTTTTCATCACTTTAAAGATGGAAAATGCTCCTGTGGGGATTACTGTCTCGGAGTCGGAGCTCCACTTTCTCCAACTAGCGGCGAGAGAAAAGCAGCCATTTC
AGTCGCATTTTCACCTTCTCAATCTCCTCCGTCACCGTCTCTTTTGCTATTTTTCGAAGTGGAATTATTGTTTTGTGTTGATATTTGCGAGTTTTTTAGCTTTATTTTGG
ATTTGATGGCATCCAAGTTGCATCTGTTGCAATCCAAGGTTTGTCAAGCTTCTCAATTTGCTTTCAAGAACGGTTCTTCCTATTATAAGCAGTTATTGGAGCAGAACAAG
CAATACATCCAAGAACCAGCTTCTGTGGAAAAATGCAACTTGCTTTCAAAGCAATTGTTGTATACTCGGCTTGCTAGCATCCCAGTCCGCTGTGAATCATTCTGGAAGGA
ACTCGATTGTGTGAAAAATTTGTGGAAGAACAGACAGGAGCTGAAAGTTGAAGATGTTGGCATCGCTGCCCTTTTCGGGCTGGAGTGCTTTGCATGGTTTTGTGCCGGTG
AGATCGTAGGAAGGGGCTTTACATTCACAGATGTGGCTTTTGCTGCCTACAACTTCCCAGCATTTCTCGTTGAACTTTTTAACCAAATGATCTGCAGGGTCAAGCGGGCT
GGAGAAACACGACATAGAAGAACTTCGCCGAAGGAATCTGTTCGTGGCCTGAAGAAGACGACTGAAAATGGCCCGCATCACCAGCATTTCCCTTCGCAAACATGTCTCAT
CAGCGATTATCCTCGCCCTTGTTCTCCTTACCGTCGTCTCCGAGTGTTCCAACAACGAATCTCGGCTGTTGGGATTCGATACGGAAAGTTCTGTGGAGTAGGATGGACGG
GTTGTGCTGGTGAAAAGCCTTGCGATGATCTTGATGCCTGTTGCAAAGTTCATGACGAATGCGTTGAAAGAAAAGGTTTAACCAATATTAAATGTCACGAGAAGTTCAAG
AGTTGTATTAAGAAAGTGCAGAAATCTGGGAAGGTTGGGTTCTCACAGGAGTGCCCTTATTCCACAGCTGTTCCTACAATGGTACAAGGCATGGATTTGGCCATCTTGTT
CAGCCAGTTTGTTGGGGCTGAAGAAAGCGGCAGACGGAACCGGCGTATTCTCTTCTGTTTCTGCATCTCTCTTCGACTTGTTGCAGCACAGATGGCGAACGTTCCGGGGC
AGTTGGTCTGGGAAATCGTCAAGAAGAACAGCTCATTCCTCGTTAAGGAGTTTGGCAGAGGCAATGCTGGTATTCAATTCAGCAAAGAGCCTAACAATCTCTACAACCTC
AACTCCTACAAGCATTCCGGCTTGGCAAACCGCAAGACTGTAACCGTTCAGGCAGGAGGCAAGGATTTGTCAGTATTGCTCGCGACAACAAAGACAAAGAAGCAGAACAA
ACCCGCGAGCTTGCTCCACAAATCACTCATGAGGAAGGAATTTCCTCGTATGGCCAAGGCTGTGACTAATCAGGTGGCTGACAATTACTACCGCCCGGACTTAAAGAAGG
CTGCCCTTGCCAAGCTAAGCGCAGTTCACAGGAGCCTCAAGGTTGCCAAGTCTGGTGTGAAGAAGAGGAACAGGCAGGCTGTTAGGACCCGCGGTAGGAAGTGA
Protein sequenceShow/hide protein sequence
MASSIPSQATRNSIITTNFKSRIIPIHSPAVKSFRPTSLCTNALSNSRNRKAARSYDGQSSTNTLSKSQNQTSDSVFGLPSVVDLVALCEEGKVMDALEYIGQGANVDYG
VFTALLNSSGNLKLLEAGRRVDGLLKGTKFRGDVELNNKLIEMYSSCGCMKDARRVFDKMPDRDIRTWNLMIKGYGENGEGDHGLALFEQMKNVGLQPNSETFLVVLAAC
AMAEAVEEGIFYFNSMENEHGINPEIGHYLGVVDVLGKSGHLIEAEEFIENMSINPTAKIWDALRNYAQLHGNMELEDRAEELMFSLDPSMAATATKPPLPPPWKQSATN
MLEEKDRVREFRCAMPYKEEGEGKLKGLNGQMREAGYVPDTRYVLHDIDEEAKQQALQYHSERLAIAYGLISTPARTTLRIIKNLRICGDCHNAIKIMSRIVGRELIVRD
NKRFHHFKDGKCSCGDYCLGVGAPLSPTSGERKAAISVAFSPSQSPPSPSLLLFFEVELLFCVDICEFFSFILDLMASKLHLLQSKVCQASQFAFKNGSSYYKQLLEQNK
QYIQEPASVEKCNLLSKQLLYTRLASIPVRCESFWKELDCVKNLWKNRQELKVEDVGIAALFGLECFAWFCAGEIVGRGFTFTDVAFAAYNFPAFLVELFNQMICRVKRA
GETRHRRTSPKESVRGLKKTTENGPHHQHFPSQTCLISDYPRPCSPYRRLRVFQQRISAVGIRYGKFCGVGWTGCAGEKPCDDLDACCKVHDECVERKGLTNIKCHEKFK
SCIKKVQKSGKVGFSQECPYSTAVPTMVQGMDLAILFSQFVGAEESGRRNRRILFCFCISLRLVAAQMANVPGQLVWEIVKKNSSFLVKEFGRGNAGIQFSKEPNNLYNL
NSYKHSGLANRKTVTVQAGGKDLSVLLATTKTKKQNKPASLLHKSLMRKEFPRMAKAVTNQVADNYYRPDLKKAALAKLSAVHRSLKVAKSGVKKRNRQAVRTRGRK