; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG07G003760 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG07G003760
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionPentatricopeptide repeat-containing protein
Genome locationCG_Chr07:4156101..4158588
RNA-Seq ExpressionClCG07G003760
SyntenyClCG07G003760
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily
IPR044646 - Pentatricopeptide repeat-containing protein EMB1417-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008464896.1 PREDICTED: pentatricopeptide repeat-containing protein At4g18975, chloroplastic isoform X1 [Cucumis melo]8.2e-14092.19Show/hide
Query:  MLVFHGSSTGFDALVPKIYCIYYHNKLTFRATIVNCVHKQAAQPLTSFITAERRIVKKVGKEEHHLWKKRDSAGSGQKALNLVRIVSQCPNEKEAVYGEL
        MLV HGSSTGFDALVPKI CIYYHNK  FR   V CVH QAAQP TSF T ERR+VKKVGKE HHLWKKRDSAGSGQKALNLVRIVSQCPNEKEAVYGEL
Subjt:  MLVFHGSSTGFDALVPKIYCIYYHNKLTFRATIVNCVHKQAAQPLTSFITAERRIVKKVGKEEHHLWKKRDSAGSGQKALNLVRIVSQCPNEKEAVYGEL

Query:  NKWIAWETEFPLIAAAKALRILRKRNQWKRVIQVAKWMLSKGQGATMGTYDTLLLAFDMDKRVDEAESLWNMILHTHTRSISKRLFSRMISLYEHHDLQD
        NKWIAWETEFPLIAAAKALRILRKR+QWKRVIQVAKWMLSKGQGATMGTYDTLLLAFDMDKRVDEAESLWNMILHTHTRSISKR+FSRMISLYEHHDLQD
Subjt:  NKWIAWETEFPLIAAAKALRILRKRNQWKRVIQVAKWMLSKGQGATMGTYDTLLLAFDMDKRVDEAESLWNMILHTHTRSISKRLFSRMISLYEHHDLQD

Query:  KIIEIFADMEELGVKPDEDTVRRVGRAFQKLGQEENRKIVYKRYGCQWKYIHFKGERVRVRRDEWDEDD
        KIIEIFADMEELGVKPDEDTVRR+GRAFQKLGQEENRK+VYKRY CQWKYIHFKGERVRVR+D WDEDD
Subjt:  KIIEIFADMEELGVKPDEDTVRRVGRAFQKLGQEENRKIVYKRYGCQWKYIHFKGERVRVRRDEWDEDD

XP_011654973.1 pentatricopeptide repeat-containing protein At4g18975, chloroplastic isoform X1 [Cucumis sativus]2.6e-13891.79Show/hide
Query:  MLVFHGSSTGFDALVPKIYCIYYHNKLTFRATIVNCVHKQAAQPLTSFITAERRIVKKVGKEEHHLWKKRDSAGSGQKALNLVRIVSQCPNEKEAVYGEL
        MLVFHG+STGFDAL+PKI CIYYHNK TF  + V CVH QAAQPLTSF T ERR+VKKVGKE HHLWKKRDSAGSGQKALNLVRIVSQCPNEKEAVYGEL
Subjt:  MLVFHGSSTGFDALVPKIYCIYYHNKLTFRATIVNCVHKQAAQPLTSFITAERRIVKKVGKEEHHLWKKRDSAGSGQKALNLVRIVSQCPNEKEAVYGEL

Query:  NKWIAWETEFPLIAAAKALRILRKRNQWKRVIQVAKWMLSKGQGATMGTYDTLLLAFDMDKRVDEAESLWNMILHTHTRSISKRLFSRMISLYEHHDLQD
        NKWIAWETEFPLIAAAKALRILRKR+QWKRVIQVAKWMLSKGQGATMGTYDTLLLAFDMDKRVDEAESLWNMILHTHTRSISKR+FSRMISLYEHHDLQD
Subjt:  NKWIAWETEFPLIAAAKALRILRKRNQWKRVIQVAKWMLSKGQGATMGTYDTLLLAFDMDKRVDEAESLWNMILHTHTRSISKRLFSRMISLYEHHDLQD

Query:  KIIEIFADMEELGVKPDEDTVRRVGRAFQKLGQEENRKIVYKRYGCQWKYIHFKGERVRVRRDEWDED
        KIIEIFADMEELGVKPDEDTVRRV  AFQKLGQE+NRK+VYKRY CQWKYIHFKGERVRVRRD WDED
Subjt:  KIIEIFADMEELGVKPDEDTVRRVGRAFQKLGQEENRKIVYKRYGCQWKYIHFKGERVRVRRDEWDED

XP_023541595.1 pentatricopeptide repeat-containing protein At4g18975, chloroplastic [Cucurbita pepo subsp. pepo]8.8e-13489.63Show/hide
Query:  MLVFHGSSTGFDALVPKIYCIYYHNKLTFRATIVNCVHKQAAQPLTSFITAERRIV-KKVGKEEHHLWKKRDSAGSGQKALNLVRIVSQCPNEKEAVYGE
        ML + GSSTGFDALVPK  CIY +NKL FRA  V CVHKQAAQ LTS  TAERRIV KKVGKE HHLWKKRDSAGSGQKALNLVRIVSQCPNEKEAVYGE
Subjt:  MLVFHGSSTGFDALVPKIYCIYYHNKLTFRATIVNCVHKQAAQPLTSFITAERRIV-KKVGKEEHHLWKKRDSAGSGQKALNLVRIVSQCPNEKEAVYGE

Query:  LNKWIAWETEFPLIAAAKALRILRKRNQWKRVIQVAKWMLSKGQGATMGTYDTLLLAFDMDKRVDEAESLWNMILHTHTRSISKRLFSRMISLYEHHDLQ
        LNKWIAWETEFPLIAA+KALRILRKR+QWKRVIQVAKWMLSKGQGATMGTYDTLLLAFDMDKRVDEAESLWNMILH HTRSISKRLFSRMISLY+HHDLQ
Subjt:  LNKWIAWETEFPLIAAAKALRILRKRNQWKRVIQVAKWMLSKGQGATMGTYDTLLLAFDMDKRVDEAESLWNMILHTHTRSISKRLFSRMISLYEHHDLQ

Query:  DKIIEIFADMEELGVKPDEDTVRRVGRAFQKLGQEENRKIVYKRYGCQWKYIHFKGERVRVRRDEWDEDD
        DKIIEIFADMEELGV+PDEDTVRRV  AF+KLGQEEN K+VYKRYGC+WKYIHFKGERVRVRRD WDEDD
Subjt:  DKIIEIFADMEELGVKPDEDTVRRVGRAFQKLGQEENRKIVYKRYGCQWKYIHFKGERVRVRRDEWDEDD

XP_031740922.1 pentatricopeptide repeat-containing protein At4g18975, chloroplastic isoform X2 [Cucumis sativus]4.6e-13590.67Show/hide
Query:  MLVFHGSSTGFDALVPKIYCIYYHNKLTFRATIVNCVHKQAAQPLTSFITAERRIVKKVGKEEHHLWKKRDSAGSGQKALNLVRIVSQCPNEKEAVYGEL
        MLVFHG+STGFDAL+PKI CIYYHNK TF  + V CVH QAAQPLTSF T ERR+VKKVGKE HHLWKKRDSAGSGQKALNL   VSQCPNEKEAVYGEL
Subjt:  MLVFHGSSTGFDALVPKIYCIYYHNKLTFRATIVNCVHKQAAQPLTSFITAERRIVKKVGKEEHHLWKKRDSAGSGQKALNLVRIVSQCPNEKEAVYGEL

Query:  NKWIAWETEFPLIAAAKALRILRKRNQWKRVIQVAKWMLSKGQGATMGTYDTLLLAFDMDKRVDEAESLWNMILHTHTRSISKRLFSRMISLYEHHDLQD
        NKWIAWETEFPLIAAAKALRILRKR+QWKRVIQVAKWMLSKGQGATMGTYDTLLLAFDMDKRVDEAESLWNMILHTHTRSISKR+FSRMISLYEHHDLQD
Subjt:  NKWIAWETEFPLIAAAKALRILRKRNQWKRVIQVAKWMLSKGQGATMGTYDTLLLAFDMDKRVDEAESLWNMILHTHTRSISKRLFSRMISLYEHHDLQD

Query:  KIIEIFADMEELGVKPDEDTVRRVGRAFQKLGQEENRKIVYKRYGCQWKYIHFKGERVRVRRDEWDED
        KIIEIFADMEELGVKPDEDTVRRV  AFQKLGQE+NRK+VYKRY CQWKYIHFKGERVRVRRD WDED
Subjt:  KIIEIFADMEELGVKPDEDTVRRVGRAFQKLGQEENRKIVYKRYGCQWKYIHFKGERVRVRRDEWDED

XP_038892676.1 pentatricopeptide repeat-containing protein At4g18975, chloroplastic [Benincasa hispida]9.4e-13691.45Show/hide
Query:  MLVFHGSSTGFDALVPKIYCIYYHNKLTFRATIVNCVHKQAAQPLTSFITAERRIVKKVGKEEHHLWKKRDSAGSGQKALNLVRIVSQCPNEKEAVYGEL
        MLV+HGSSTGFDALVPKI CIY++NKLTFRA  V CVHKQ          AERRIVKKVGKEEHHLWKKRDSAGSGQKALNLVRIVSQCPNEKEAVYGEL
Subjt:  MLVFHGSSTGFDALVPKIYCIYYHNKLTFRATIVNCVHKQAAQPLTSFITAERRIVKKVGKEEHHLWKKRDSAGSGQKALNLVRIVSQCPNEKEAVYGEL

Query:  NKWIAWETEFPLIAAAKALRILRKRNQWKRVIQVAKWMLSKGQGATMGTYDTLLLAFDMDKRVDEAESLWNMILHTHTRSISKRLFSRMISLYEHHDLQD
        NKWIAWETEFPLIAAAKALRILRKR+QWK VIQVAKWMLSKGQGATMGTYDTLLLAFDMDKRVDEAESLWNMILHTHTRSISKRLFSRMISLYEHHDLQD
Subjt:  NKWIAWETEFPLIAAAKALRILRKRNQWKRVIQVAKWMLSKGQGATMGTYDTLLLAFDMDKRVDEAESLWNMILHTHTRSISKRLFSRMISLYEHHDLQD

Query:  KIIEIFADMEELGVKPDEDTVRRVGRAFQKLGQEENRKIVYKRYGCQWKYIHFKGERVRVRRDEWDEDD
        KIIEIFADMEELGVKPDEDTVRRVGRAF KLGQEEN+K+VYKRYGCQWKYIHFKGERVRVRRD WDEDD
Subjt:  KIIEIFADMEELGVKPDEDTVRRVGRAFQKLGQEENRKIVYKRYGCQWKYIHFKGERVRVRRDEWDEDD

TrEMBL top hitse value%identityAlignment
A0A0A0KSA5 Uncharacterized protein1.3e-13891.79Show/hide
Query:  MLVFHGSSTGFDALVPKIYCIYYHNKLTFRATIVNCVHKQAAQPLTSFITAERRIVKKVGKEEHHLWKKRDSAGSGQKALNLVRIVSQCPNEKEAVYGEL
        MLVFHG+STGFDAL+PKI CIYYHNK TF  + V CVH QAAQPLTSF T ERR+VKKVGKE HHLWKKRDSAGSGQKALNLVRIVSQCPNEKEAVYGEL
Subjt:  MLVFHGSSTGFDALVPKIYCIYYHNKLTFRATIVNCVHKQAAQPLTSFITAERRIVKKVGKEEHHLWKKRDSAGSGQKALNLVRIVSQCPNEKEAVYGEL

Query:  NKWIAWETEFPLIAAAKALRILRKRNQWKRVIQVAKWMLSKGQGATMGTYDTLLLAFDMDKRVDEAESLWNMILHTHTRSISKRLFSRMISLYEHHDLQD
        NKWIAWETEFPLIAAAKALRILRKR+QWKRVIQVAKWMLSKGQGATMGTYDTLLLAFDMDKRVDEAESLWNMILHTHTRSISKR+FSRMISLYEHHDLQD
Subjt:  NKWIAWETEFPLIAAAKALRILRKRNQWKRVIQVAKWMLSKGQGATMGTYDTLLLAFDMDKRVDEAESLWNMILHTHTRSISKRLFSRMISLYEHHDLQD

Query:  KIIEIFADMEELGVKPDEDTVRRVGRAFQKLGQEENRKIVYKRYGCQWKYIHFKGERVRVRRDEWDED
        KIIEIFADMEELGVKPDEDTVRRV  AFQKLGQE+NRK+VYKRY CQWKYIHFKGERVRVRRD WDED
Subjt:  KIIEIFADMEELGVKPDEDTVRRVGRAFQKLGQEENRKIVYKRYGCQWKYIHFKGERVRVRRDEWDED

A0A1S3CP50 pentatricopeptide repeat-containing protein At4g18975, chloroplastic isoform X14.0e-14092.19Show/hide
Query:  MLVFHGSSTGFDALVPKIYCIYYHNKLTFRATIVNCVHKQAAQPLTSFITAERRIVKKVGKEEHHLWKKRDSAGSGQKALNLVRIVSQCPNEKEAVYGEL
        MLV HGSSTGFDALVPKI CIYYHNK  FR   V CVH QAAQP TSF T ERR+VKKVGKE HHLWKKRDSAGSGQKALNLVRIVSQCPNEKEAVYGEL
Subjt:  MLVFHGSSTGFDALVPKIYCIYYHNKLTFRATIVNCVHKQAAQPLTSFITAERRIVKKVGKEEHHLWKKRDSAGSGQKALNLVRIVSQCPNEKEAVYGEL

Query:  NKWIAWETEFPLIAAAKALRILRKRNQWKRVIQVAKWMLSKGQGATMGTYDTLLLAFDMDKRVDEAESLWNMILHTHTRSISKRLFSRMISLYEHHDLQD
        NKWIAWETEFPLIAAAKALRILRKR+QWKRVIQVAKWMLSKGQGATMGTYDTLLLAFDMDKRVDEAESLWNMILHTHTRSISKR+FSRMISLYEHHDLQD
Subjt:  NKWIAWETEFPLIAAAKALRILRKRNQWKRVIQVAKWMLSKGQGATMGTYDTLLLAFDMDKRVDEAESLWNMILHTHTRSISKRLFSRMISLYEHHDLQD

Query:  KIIEIFADMEELGVKPDEDTVRRVGRAFQKLGQEENRKIVYKRYGCQWKYIHFKGERVRVRRDEWDEDD
        KIIEIFADMEELGVKPDEDTVRR+GRAFQKLGQEENRK+VYKRY CQWKYIHFKGERVRVR+D WDEDD
Subjt:  KIIEIFADMEELGVKPDEDTVRRVGRAFQKLGQEENRKIVYKRYGCQWKYIHFKGERVRVRRDEWDEDD

A0A5A7UD21 Pentatricopeptide repeat-containing protein4.0e-14092.19Show/hide
Query:  MLVFHGSSTGFDALVPKIYCIYYHNKLTFRATIVNCVHKQAAQPLTSFITAERRIVKKVGKEEHHLWKKRDSAGSGQKALNLVRIVSQCPNEKEAVYGEL
        MLV HGSSTGFDALVPKI CIYYHNK  FR   V CVH QAAQP TSF T ERR+VKKVGKE HHLWKKRDSAGSGQKALNLVRIVSQCPNEKEAVYGEL
Subjt:  MLVFHGSSTGFDALVPKIYCIYYHNKLTFRATIVNCVHKQAAQPLTSFITAERRIVKKVGKEEHHLWKKRDSAGSGQKALNLVRIVSQCPNEKEAVYGEL

Query:  NKWIAWETEFPLIAAAKALRILRKRNQWKRVIQVAKWMLSKGQGATMGTYDTLLLAFDMDKRVDEAESLWNMILHTHTRSISKRLFSRMISLYEHHDLQD
        NKWIAWETEFPLIAAAKALRILRKR+QWKRVIQVAKWMLSKGQGATMGTYDTLLLAFDMDKRVDEAESLWNMILHTHTRSISKR+FSRMISLYEHHDLQD
Subjt:  NKWIAWETEFPLIAAAKALRILRKRNQWKRVIQVAKWMLSKGQGATMGTYDTLLLAFDMDKRVDEAESLWNMILHTHTRSISKRLFSRMISLYEHHDLQD

Query:  KIIEIFADMEELGVKPDEDTVRRVGRAFQKLGQEENRKIVYKRYGCQWKYIHFKGERVRVRRDEWDEDD
        KIIEIFADMEELGVKPDEDTVRR+GRAFQKLGQEENRK+VYKRY CQWKYIHFKGERVRVR+D WDEDD
Subjt:  KIIEIFADMEELGVKPDEDTVRRVGRAFQKLGQEENRKIVYKRYGCQWKYIHFKGERVRVRRDEWDEDD

A0A6J1FWG3 pentatricopeptide repeat-containing protein At4g18975, chloroplastic1.6e-13389.26Show/hide
Query:  MLVFHGSSTGFDALVPKIYCIYYHNKLTFRATIVNCVHKQAAQPLTSFITAERRIV-KKVGKEEHHLWKKRDSAGSGQKALNLVRIVSQCPNEKEAVYGE
        ML + GSSTGFDALVPK  CIY +NKL FRA  V CVHKQAAQ LT   TAERRIV KKVGKE HHLWKKRDSAGSGQKALNLVRIVSQCPNEKEAVYGE
Subjt:  MLVFHGSSTGFDALVPKIYCIYYHNKLTFRATIVNCVHKQAAQPLTSFITAERRIV-KKVGKEEHHLWKKRDSAGSGQKALNLVRIVSQCPNEKEAVYGE

Query:  LNKWIAWETEFPLIAAAKALRILRKRNQWKRVIQVAKWMLSKGQGATMGTYDTLLLAFDMDKRVDEAESLWNMILHTHTRSISKRLFSRMISLYEHHDLQ
        LNKWIAWETEFPLIAA+KALRILRKR+QWKRVIQVAKWMLSKGQGATMGTYDTLLLAFDMDKRVDEAESLWNMILH HTRSISKRLFSRMISLY+HHDLQ
Subjt:  LNKWIAWETEFPLIAAAKALRILRKRNQWKRVIQVAKWMLSKGQGATMGTYDTLLLAFDMDKRVDEAESLWNMILHTHTRSISKRLFSRMISLYEHHDLQ

Query:  DKIIEIFADMEELGVKPDEDTVRRVGRAFQKLGQEENRKIVYKRYGCQWKYIHFKGERVRVRRDEWDEDD
        DKIIEIFADMEELGV+PDEDTVRRV  AF+KLGQEEN K+VYKRYGC+WKYIHFKGERVRVRRD WDEDD
Subjt:  DKIIEIFADMEELGVKPDEDTVRRVGRAFQKLGQEENRKIVYKRYGCQWKYIHFKGERVRVRRDEWDEDD

A0A6J1K6G2 pentatricopeptide repeat-containing protein At4g18975, chloroplastic1.0e-13288.89Show/hide
Query:  MLVFHGSSTGFDALVPKIYCIYYHNKLTFRATIVNCVHKQAAQPLTSFITAERRIV-KKVGKEEHHLWKKRDSAGSGQKALNLVRIVSQCPNEKEAVYGE
        ML + GSSTGFDALVPK  CIY +NKL FRA  V CVHKQAAQ LTS  TAERRIV KKVGKE HHLWKKRDSAGSGQKALNLVRIVSQCPNEKEAVYGE
Subjt:  MLVFHGSSTGFDALVPKIYCIYYHNKLTFRATIVNCVHKQAAQPLTSFITAERRIV-KKVGKEEHHLWKKRDSAGSGQKALNLVRIVSQCPNEKEAVYGE

Query:  LNKWIAWETEFPLIAAAKALRILRKRNQWKRVIQVAKWMLSKGQGATMGTYDTLLLAFDMDKRVDEAESLWNMILHTHTRSISKRLFSRMISLYEHHDLQ
        LNKWIAWETEFPLIAA+KALRILRKR+QWKRVIQVAKWMLSKGQGATMGTYDTLLLAFDMDKRVDEAESLWNMILH HTRSISKRLFSRMI+LY+HHDLQ
Subjt:  LNKWIAWETEFPLIAAAKALRILRKRNQWKRVIQVAKWMLSKGQGATMGTYDTLLLAFDMDKRVDEAESLWNMILHTHTRSISKRLFSRMISLYEHHDLQ

Query:  DKIIEIFADMEELGVKPDEDTVRRVGRAFQKLGQEENRKIVYKRYGCQWKYIHFKGERVRVRRDEWDEDD
        DKIIEIFADMEELGV+PDEDTVRRV  AF+KLGQEEN K VYKRYGC+WKYIHFK ERVRVRRD WDEDD
Subjt:  DKIIEIFADMEELGVKPDEDTVRRVGRAFQKLGQEENRKIVYKRYGCQWKYIHFKGERVRVRRDEWDEDD

SwissProt top hitse value%identityAlignment
Q2V3H0 Pentatricopeptide repeat-containing protein At4g18975, chloroplastic4.6e-9376.53Show/hide
Query:  TAERRIVKKVGKEEHHLWKKRDSAGSGQKALNLVRIVSQCPNEKEAVYGELNKWIAWETEFPLIAAAKALRILRKRNQWKRVIQVAKWMLSKGQGATMGT
        T   + +KKVGK+EHHLWKK DSAGSGQKALNLVR++S  PNEKEAVYG LNKW+AWE EFP+IAAAKAL+ILRKR+QW RVIQ+AKWMLSKGQGATMGT
Subjt:  TAERRIVKKVGKEEHHLWKKRDSAGSGQKALNLVRIVSQCPNEKEAVYGELNKWIAWETEFPLIAAAKALRILRKRNQWKRVIQVAKWMLSKGQGATMGT

Query:  YDTLLLAFDMDKRVDEAESLWNMILHTHTRSISKRLFSRMISLYEHHDLQDKIIEIFADMEELGVKPDEDTVRRVGRAFQKLGQEENRKIVYKRYGCQWK
        YD LLLAFDMD+R DEAESLWNMILHTHTRSI +RLF+RMI+LY HHDL DK+IE+FADMEEL V PDED+ RRV RAF++L QEENRK++ +RY  ++K
Subjt:  YDTLLLAFDMDKRVDEAESLWNMILHTHTRSISKRLFSRMISLYEHHDLQDKIIEIFADMEELGVKPDEDTVRRVGRAFQKLGQEENRKIVYKRYGCQWK

Query:  YIHFKGERVRVRR
        YI+F GERVRV+R
Subjt:  YIHFKGERVRVRR

Q8LG95 Pentatricopeptide repeat-containing protein At4g211903.2e-4645.81Show/hide
Query:  LWKKRDSAGSGQKALNLVRIVSQCPNEKEAVYGELNKWIAWETEFPLIAAAKALRILRKRNQWKRVIQVAKWMLSKGQGATMGTYDTLLLAFDMDKRVDE
        +WK R   G+  KA  ++  +    N KE VYG L+ +IAWE EFPL+   KAL IL    +WK++IQV KWMLSKGQG TMGTY +LL A   D R+DE
Subjt:  LWKKRDSAGSGQKALNLVRIVSQCPNEKEAVYGELNKWIAWETEFPLIAAAKALRILRKRNQWKRVIQVAKWMLSKGQGATMGTYDTLLLAFDMDKRVDE

Query:  AESLWNMILHTHTRSISKRLFSRMISLYEHHDLQDKIIEIFADMEELGVKPDEDTVRRVGRAFQKLGQEENRKIVYKRY-GCQWKYIHFKGERVRVRRDE
        AE LWN +   H     ++ F++MIS+Y   D+  K+ E+FADMEELGVKP+   V  VG+ F KL  ++  + + K+Y   QW++ + KG RV+V+  +
Subjt:  AESLWNMILHTHTRSISKRLFSRMISLYEHHDLQDKIIEIFADMEELGVKPDEDTVRRVGRAFQKLGQEENRKIVYKRY-GCQWKYIHFKGERVRVRRDE

Query:  WDE
         +E
Subjt:  WDE

Q9M8W9 Pentatricopeptide repeat-containing protein At3g04130, mitochondrial8.8e-0424.86Show/hide
Query:  KVGKEEHHLWKKRDSAGSGQKA-----LNLVRIVSQCPNEKEAVYGELNKWIAWETEFPLIAAAKALRILRKRNQWKRVIQVAKWMLSKGQGATMGTYDT
        K  + E  LW  ++  G G +        ++R   Q   E   VY  L++  A  +    I     +  L  + +++  ++VA  M   G       Y+ 
Subjt:  KVGKEEHHLWKKRDSAGSGQKA-----LNLVRIVSQCPNEKEAVYGELNKWIAWETEFPLIAAAKALRILRKRNQWKRVIQVAKWMLSKGQGATMGTYDT

Query:  LLLAFDMDKRVDEAESLWNMILHTHTRSISKRLFSRMISLYEHHDLQDKIIEIFADMEELGV-KPDEDTVRRVGRAFQKLG
        L+       R++EAE ++ + +     SI+   ++ MI++Y HHD +DK IE+  +ME   +  PD  T + + R+  K G
Subjt:  LLLAFDMDKRVDEAESLWNMILHTHTRSISKRLFSRMISLYEHHDLQDKIIEIFADMEELGV-KPDEDTVRRVGRAFQKLG

Arabidopsis top hitse value%identityAlignment
AT4G18975.1 Pentatricopeptide repeat (PPR) superfamily protein3.2e-9476.53Show/hide
Query:  TAERRIVKKVGKEEHHLWKKRDSAGSGQKALNLVRIVSQCPNEKEAVYGELNKWIAWETEFPLIAAAKALRILRKRNQWKRVIQVAKWMLSKGQGATMGT
        T   + +KKVGK+EHHLWKK DSAGSGQKALNLVR++S  PNEKEAVYG LNKW+AWE EFP+IAAAKAL+ILRKR+QW RVIQ+AKWMLSKGQGATMGT
Subjt:  TAERRIVKKVGKEEHHLWKKRDSAGSGQKALNLVRIVSQCPNEKEAVYGELNKWIAWETEFPLIAAAKALRILRKRNQWKRVIQVAKWMLSKGQGATMGT

Query:  YDTLLLAFDMDKRVDEAESLWNMILHTHTRSISKRLFSRMISLYEHHDLQDKIIEIFADMEELGVKPDEDTVRRVGRAFQKLGQEENRKIVYKRYGCQWK
        YD LLLAFDMD+R DEAESLWNMILHTHTRSI +RLF+RMI+LY HHDL DK+IE+FADMEEL V PDED+ RRV RAF++L QEENRK++ +RY  ++K
Subjt:  YDTLLLAFDMDKRVDEAESLWNMILHTHTRSISKRLFSRMISLYEHHDLQDKIIEIFADMEELGVKPDEDTVRRVGRAFQKLGQEENRKIVYKRYGCQWK

Query:  YIHFKGERVRVRR
        YI+F GERVRV+R
Subjt:  YIHFKGERVRVRR

AT4G18975.2 Pentatricopeptide repeat (PPR) superfamily protein3.2e-9476.53Show/hide
Query:  TAERRIVKKVGKEEHHLWKKRDSAGSGQKALNLVRIVSQCPNEKEAVYGELNKWIAWETEFPLIAAAKALRILRKRNQWKRVIQVAKWMLSKGQGATMGT
        T   + +KKVGK+EHHLWKK DSAGSGQKALNLVR++S  PNEKEAVYG LNKW+AWE EFP+IAAAKAL+ILRKR+QW RVIQ+AKWMLSKGQGATMGT
Subjt:  TAERRIVKKVGKEEHHLWKKRDSAGSGQKALNLVRIVSQCPNEKEAVYGELNKWIAWETEFPLIAAAKALRILRKRNQWKRVIQVAKWMLSKGQGATMGT

Query:  YDTLLLAFDMDKRVDEAESLWNMILHTHTRSISKRLFSRMISLYEHHDLQDKIIEIFADMEELGVKPDEDTVRRVGRAFQKLGQEENRKIVYKRYGCQWK
        YD LLLAFDMD+R DEAESLWNMILHTHTRSI +RLF+RMI+LY HHDL DK+IE+FADMEEL V PDED+ RRV RAF++L QEENRK++ +RY  ++K
Subjt:  YDTLLLAFDMDKRVDEAESLWNMILHTHTRSISKRLFSRMISLYEHHDLQDKIIEIFADMEELGVKPDEDTVRRVGRAFQKLGQEENRKIVYKRYGCQWK

Query:  YIHFKGERVRVRR
        YI+F GERVRV+R
Subjt:  YIHFKGERVRVRR

AT4G18975.3 Pentatricopeptide repeat (PPR) superfamily protein3.2e-9476.53Show/hide
Query:  TAERRIVKKVGKEEHHLWKKRDSAGSGQKALNLVRIVSQCPNEKEAVYGELNKWIAWETEFPLIAAAKALRILRKRNQWKRVIQVAKWMLSKGQGATMGT
        T   + +KKVGK+EHHLWKK DSAGSGQKALNLVR++S  PNEKEAVYG LNKW+AWE EFP+IAAAKAL+ILRKR+QW RVIQ+AKWMLSKGQGATMGT
Subjt:  TAERRIVKKVGKEEHHLWKKRDSAGSGQKALNLVRIVSQCPNEKEAVYGELNKWIAWETEFPLIAAAKALRILRKRNQWKRVIQVAKWMLSKGQGATMGT

Query:  YDTLLLAFDMDKRVDEAESLWNMILHTHTRSISKRLFSRMISLYEHHDLQDKIIEIFADMEELGVKPDEDTVRRVGRAFQKLGQEENRKIVYKRYGCQWK
        YD LLLAFDMD+R DEAESLWNMILHTHTRSI +RLF+RMI+LY HHDL DK+IE+FADMEEL V PDED+ RRV RAF++L QEENRK++ +RY  ++K
Subjt:  YDTLLLAFDMDKRVDEAESLWNMILHTHTRSISKRLFSRMISLYEHHDLQDKIIEIFADMEELGVKPDEDTVRRVGRAFQKLGQEENRKIVYKRYGCQWK

Query:  YIHFKGERVRVRR
        YI+F GERVRV+R
Subjt:  YIHFKGERVRVRR

AT4G18975.4 Pentatricopeptide repeat (PPR) superfamily protein3.2e-9476.53Show/hide
Query:  TAERRIVKKVGKEEHHLWKKRDSAGSGQKALNLVRIVSQCPNEKEAVYGELNKWIAWETEFPLIAAAKALRILRKRNQWKRVIQVAKWMLSKGQGATMGT
        T   + +KKVGK+EHHLWKK DSAGSGQKALNLVR++S  PNEKEAVYG LNKW+AWE EFP+IAAAKAL+ILRKR+QW RVIQ+AKWMLSKGQGATMGT
Subjt:  TAERRIVKKVGKEEHHLWKKRDSAGSGQKALNLVRIVSQCPNEKEAVYGELNKWIAWETEFPLIAAAKALRILRKRNQWKRVIQVAKWMLSKGQGATMGT

Query:  YDTLLLAFDMDKRVDEAESLWNMILHTHTRSISKRLFSRMISLYEHHDLQDKIIEIFADMEELGVKPDEDTVRRVGRAFQKLGQEENRKIVYKRYGCQWK
        YD LLLAFDMD+R DEAESLWNMILHTHTRSI +RLF+RMI+LY HHDL DK+IE+FADMEEL V PDED+ RRV RAF++L QEENRK++ +RY  ++K
Subjt:  YDTLLLAFDMDKRVDEAESLWNMILHTHTRSISKRLFSRMISLYEHHDLQDKIIEIFADMEELGVKPDEDTVRRVGRAFQKLGQEENRKIVYKRYGCQWK

Query:  YIHFKGERVRVRR
        YI+F GERVRV+R
Subjt:  YIHFKGERVRVRR

AT4G21190.1 Pentatricopeptide repeat (PPR) superfamily protein2.3e-4745.81Show/hide
Query:  LWKKRDSAGSGQKALNLVRIVSQCPNEKEAVYGELNKWIAWETEFPLIAAAKALRILRKRNQWKRVIQVAKWMLSKGQGATMGTYDTLLLAFDMDKRVDE
        +WK R   G+  KA  ++  +    N KE VYG L+ +IAWE EFPL+   KAL IL    +WK++IQV KWMLSKGQG TMGTY +LL A   D R+DE
Subjt:  LWKKRDSAGSGQKALNLVRIVSQCPNEKEAVYGELNKWIAWETEFPLIAAAKALRILRKRNQWKRVIQVAKWMLSKGQGATMGTYDTLLLAFDMDKRVDE

Query:  AESLWNMILHTHTRSISKRLFSRMISLYEHHDLQDKIIEIFADMEELGVKPDEDTVRRVGRAFQKLGQEENRKIVYKRY-GCQWKYIHFKGERVRVRRDE
        AE LWN +   H     ++ F++MIS+Y   D+  K+ E+FADMEELGVKP+   V  VG+ F KL  ++  + + K+Y   QW++ + KG RV+V+  +
Subjt:  AESLWNMILHTHTRSISKRLFSRMISLYEHHDLQDKIIEIFADMEELGVKPDEDTVRRVGRAFQKLGQEENRKIVYKRY-GCQWKYIHFKGERVRVRRDE

Query:  WDE
         +E
Subjt:  WDE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTGGTTTTCCATGGAAGCTCAACTGGGTTCGATGCCCTCGTGCCGAAGATCTATTGCATTTACTATCACAACAAATTGACATTTAGAGCTACCATTGTCAATTGTGT
CCACAAGCAAGCTGCACAACCGCTTACTAGTTTCATCACAGCTGAGAGACGTATTGTTAAGAAGGTTGGGAAGGAGGAACACCATTTATGGAAGAAAAGAGATTCTGCTG
GCTCTGGGCAAAAGGCTCTTAATCTTGTTAGAATTGTTTCCCAATGTCCTAATGAGAAAGAAGCTGTATATGGAGAATTGAATAAGTGGATAGCTTGGGAGACAGAGTTT
CCATTGATTGCAGCTGCTAAAGCTTTAAGAATACTGAGGAAGAGAAATCAATGGAAGCGTGTCATTCAAGTGGCAAAGTGGATGTTAAGCAAGGGTCAAGGAGCCACAAT
GGGAACATATGACACCCTTCTACTGGCATTTGATATGGACAAGAGGGTGGACGAAGCCGAATCCTTATGGAACATGATTTTGCATACACATACACGGTCCATCTCTAAGA
GATTGTTTTCTAGGATGATCTCTTTGTATGAACACCATGACTTGCAAGATAAGATTATCGAGATATTTGCAGACATGGAAGAGTTGGGTGTAAAACCAGATGAAGATACC
GTCAGAAGAGTCGGCCGTGCCTTTCAAAAACTAGGACAAGAAGAAAACCGGAAAATCGTCTATAAAAGATATGGATGCCAATGGAAATACATACACTTCAAGGGTGAGAG
AGTTAGAGTGCGAAGAGATGAATGGGATGAAGATGATGTATGA
mRNA sequenceShow/hide mRNA sequence
GGTAAGGTGTGAGCTTTCATGGGTGGCTTTGATTCATATCTGTAATTTGTGTTGCGCCCTTCTTTATCCTAAATGAATTAACCCCTTTGACCAGAGAGAGCCCAAACTCA
AACAGAGCGCTGAAATTTGAAATTTTTGATGTTGGTTTTCCATGGAAGCTCAACTGGGTTCGATGCCCTCGTGCCGAAGATCTATTGCATTTACTATCACAACAAATTGA
CATTTAGAGCTACCATTGTCAATTGTGTCCACAAGCAAGCTGCACAACCGCTTACTAGTTTCATCACAGCTGAGAGACGTATTGTTAAGAAGGTTGGGAAGGAGGAACAC
CATTTATGGAAGAAAAGAGATTCTGCTGGCTCTGGGCAAAAGGCTCTTAATCTTGTTAGAATTGTTTCCCAATGTCCTAATGAGAAAGAAGCTGTATATGGAGAATTGAA
TAAGTGGATAGCTTGGGAGACAGAGTTTCCATTGATTGCAGCTGCTAAAGCTTTAAGAATACTGAGGAAGAGAAATCAATGGAAGCGTGTCATTCAAGTGGCAAAGTGGA
TGTTAAGCAAGGGTCAAGGAGCCACAATGGGAACATATGACACCCTTCTACTGGCATTTGATATGGACAAGAGGGTGGACGAAGCCGAATCCTTATGGAACATGATTTTG
CATACACATACACGGTCCATCTCTAAGAGATTGTTTTCTAGGATGATCTCTTTGTATGAACACCATGACTTGCAAGATAAGATTATCGAGATATTTGCAGACATGGAAGA
GTTGGGTGTAAAACCAGATGAAGATACCGTCAGAAGAGTCGGCCGTGCCTTTCAAAAACTAGGACAAGAAGAAAACCGGAAAATCGTCTATAAAAGATATGGATGCCAAT
GGAAATACATACACTTCAAGGGTGAGAGAGTTAGAGTGCGAAGAGATGAATGGGATGAAGATGATGTATGATTGAAGAGCAGAGATGTCTGAACATTAGCATGAACAAGT
TCATTGGATCATAAGGTATCTATCTACTGATCGCCGCTAATTATATGCACTTAAGCACGCTGAAACCTAGATTTGCAGTCATAACCTAGTTTTTCTCCTCTGCTCTGATT
GAATATTTTAGAACTGCTAACATAGATGATGCGAACATTGTAAGATATATGAATAAGATATGAAAAAAAAAATATATGTAGTGTATATGTTAACCGAAGTTGAAACTTGT
TTTCTTGAATCTTGATTCTATTTTAGGCTTGAGGTATGAAAAGAATAGCAGTAGGATTGTTTTCATTGGAACAAAC
Protein sequenceShow/hide protein sequence
MLVFHGSSTGFDALVPKIYCIYYHNKLTFRATIVNCVHKQAAQPLTSFITAERRIVKKVGKEEHHLWKKRDSAGSGQKALNLVRIVSQCPNEKEAVYGELNKWIAWETEF
PLIAAAKALRILRKRNQWKRVIQVAKWMLSKGQGATMGTYDTLLLAFDMDKRVDEAESLWNMILHTHTRSISKRLFSRMISLYEHHDLQDKIIEIFADMEELGVKPDEDT
VRRVGRAFQKLGQEENRKIVYKRYGCQWKYIHFKGERVRVRRDEWDEDDV