; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10014850 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10014850
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationChr02:20806303..20808337
RNA-Seq ExpressionHG10014850
SyntenyHG10014850
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily
IPR044646 - Pentatricopeptide repeat-containing protein EMB1417-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008464896.1 PREDICTED: pentatricopeptide repeat-containing protein At4g18975, chloroplastic isoform X1 [Cucumis melo]1.7e-13285.56Show/hide
Query:  MLVYHGSSTGFYALMPKIDYIYYHNKLKFRAASVNYVHKQAAQPLTSLTTVERRIVKKVGKEEHHLWKKRDSAGCGQKALNLVRIVSQCPNEKEVVYGEL
        MLV HGSSTGF AL+PKID IYYHNK  FR ASV  VH QAAQP TS TT ERR+VKKVGKE HHLWKKRDSAG GQKALNLVRIVSQCPNEKE VYGEL
Subjt:  MLVYHGSSTGFYALMPKIDYIYYHNKLKFRAASVNYVHKQAAQPLTSLTTVERRIVKKVGKEEHHLWKKRDSAGCGQKALNLVRIVSQCPNEKEVVYGEL

Query:  NKWIAWETEFPLIAAAKALRILRKRNQWKRVIQVRISSGLLAVFLAFGELVAKWMLSKGQGATMGTYDTLLLAFDMDKRVDEAESLWNMILHAHTRSISK
        NKWIAWETEFPLIAAAKALRILRKR+QWKRVIQ                 VAKWMLSKGQGATMGTYDTLLLAFDMDKRVDEAESLWNMILH HTRSISK
Subjt:  NKWIAWETEFPLIAAAKALRILRKRNQWKRVIQVRISSGLLAVFLAFGELVAKWMLSKGQGATMGTYDTLLLAFDMDKRVDEAESLWNMILHAHTRSISK

Query:  RLFSRMISLYEHHDLQDKIIEIFADMEELGVKPDEDTVRRVGRAFQKLGQEENRKMVYKRYGCQWKYIHFKGERVRVRRDGWDD
        R+FSRMISLYEHHDLQDKIIEIFADMEELGVKPDEDTVRR+GRAFQKLGQEENRKMVYKRY CQWKYIHFKGERVRVR+DGWD+
Subjt:  RLFSRMISLYEHHDLQDKIIEIFADMEELGVKPDEDTVRRVGRAFQKLGQEENRKMVYKRYGCQWKYIHFKGERVRVRRDGWDD

XP_011654973.1 pentatricopeptide repeat-containing protein At4g18975, chloroplastic isoform X1 [Cucumis sativus]4.3e-13184.86Show/hide
Query:  MLVYHGSSTGFYALMPKIDYIYYHNKLKFRAASVNYVHKQAAQPLTSLTTVERRIVKKVGKEEHHLWKKRDSAGCGQKALNLVRIVSQCPNEKEVVYGEL
        MLV+HG+STGF ALMPKID IYYHNK  F  +SV  VH QAAQPLTS TT ERR+VKKVGKE HHLWKKRDSAG GQKALNLVRIVSQCPNEKE VYGEL
Subjt:  MLVYHGSSTGFYALMPKIDYIYYHNKLKFRAASVNYVHKQAAQPLTSLTTVERRIVKKVGKEEHHLWKKRDSAGCGQKALNLVRIVSQCPNEKEVVYGEL

Query:  NKWIAWETEFPLIAAAKALRILRKRNQWKRVIQVRISSGLLAVFLAFGELVAKWMLSKGQGATMGTYDTLLLAFDMDKRVDEAESLWNMILHAHTRSISK
        NKWIAWETEFPLIAAAKALRILRKR+QWKRVIQ                 VAKWMLSKGQGATMGTYDTLLLAFDMDKRVDEAESLWNMILH HTRSISK
Subjt:  NKWIAWETEFPLIAAAKALRILRKRNQWKRVIQVRISSGLLAVFLAFGELVAKWMLSKGQGATMGTYDTLLLAFDMDKRVDEAESLWNMILHAHTRSISK

Query:  RLFSRMISLYEHHDLQDKIIEIFADMEELGVKPDEDTVRRVGRAFQKLGQEENRKMVYKRYGCQWKYIHFKGERVRVRRDGWDD
        R+FSRMISLYEHHDLQDKIIEIFADMEELGVKPDEDTVRRV  AFQKLGQE+NRKMVYKRY CQWKYIHFKGERVRVRRDGWD+
Subjt:  RLFSRMISLYEHHDLQDKIIEIFADMEELGVKPDEDTVRRVGRAFQKLGQEENRKMVYKRYGCQWKYIHFKGERVRVRRDGWDD

XP_023541595.1 pentatricopeptide repeat-containing protein At4g18975, chloroplastic [Cucurbita pepo subsp. pepo]1.1e-12683.51Show/hide
Query:  MLVYHGSSTGFYALMPKIDYIYYHNKLKFRAASVNYVHKQAAQPLTSLTTVERRIV-KKVGKEEHHLWKKRDSAGCGQKALNLVRIVSQCPNEKEVVYGE
        ML Y GSSTGF AL+PK   IY +NKL FRAASV  VHKQAAQ LTS TT ERRIV KKVGKE HHLWKKRDSAG GQKALNLVRIVSQCPNEKE VYGE
Subjt:  MLVYHGSSTGFYALMPKIDYIYYHNKLKFRAASVNYVHKQAAQPLTSLTTVERRIV-KKVGKEEHHLWKKRDSAGCGQKALNLVRIVSQCPNEKEVVYGE

Query:  LNKWIAWETEFPLIAAAKALRILRKRNQWKRVIQVRISSGLLAVFLAFGELVAKWMLSKGQGATMGTYDTLLLAFDMDKRVDEAESLWNMILHAHTRSIS
        LNKWIAWETEFPLIAA+KALRILRKR+QWKRVIQ                 VAKWMLSKGQGATMGTYDTLLLAFDMDKRVDEAESLWNMILHAHTRSIS
Subjt:  LNKWIAWETEFPLIAAAKALRILRKRNQWKRVIQVRISSGLLAVFLAFGELVAKWMLSKGQGATMGTYDTLLLAFDMDKRVDEAESLWNMILHAHTRSIS

Query:  KRLFSRMISLYEHHDLQDKIIEIFADMEELGVKPDEDTVRRVGRAFQKLGQEENRKMVYKRYGCQWKYIHFKGERVRVRRDGWDD
        KRLFSRMISLY+HHDLQDKIIEIFADMEELGV+PDEDTVRRV  AF+KLGQEEN K+VYKRYGC+WKYIHFKGERVRVRRDGWD+
Subjt:  KRLFSRMISLYEHHDLQDKIIEIFADMEELGVKPDEDTVRRVGRAFQKLGQEENRKMVYKRYGCQWKYIHFKGERVRVRRDGWDD

XP_031740922.1 pentatricopeptide repeat-containing protein At4g18975, chloroplastic isoform X2 [Cucumis sativus]5.8e-12883.8Show/hide
Query:  MLVYHGSSTGFYALMPKIDYIYYHNKLKFRAASVNYVHKQAAQPLTSLTTVERRIVKKVGKEEHHLWKKRDSAGCGQKALNLVRIVSQCPNEKEVVYGEL
        MLV+HG+STGF ALMPKID IYYHNK  F  +SV  VH QAAQPLTS TT ERR+VKKVGKE HHLWKKRDSAG GQKALNL   VSQCPNEKE VYGEL
Subjt:  MLVYHGSSTGFYALMPKIDYIYYHNKLKFRAASVNYVHKQAAQPLTSLTTVERRIVKKVGKEEHHLWKKRDSAGCGQKALNLVRIVSQCPNEKEVVYGEL

Query:  NKWIAWETEFPLIAAAKALRILRKRNQWKRVIQVRISSGLLAVFLAFGELVAKWMLSKGQGATMGTYDTLLLAFDMDKRVDEAESLWNMILHAHTRSISK
        NKWIAWETEFPLIAAAKALRILRKR+QWKRVIQ                 VAKWMLSKGQGATMGTYDTLLLAFDMDKRVDEAESLWNMILH HTRSISK
Subjt:  NKWIAWETEFPLIAAAKALRILRKRNQWKRVIQVRISSGLLAVFLAFGELVAKWMLSKGQGATMGTYDTLLLAFDMDKRVDEAESLWNMILHAHTRSISK

Query:  RLFSRMISLYEHHDLQDKIIEIFADMEELGVKPDEDTVRRVGRAFQKLGQEENRKMVYKRYGCQWKYIHFKGERVRVRRDGWDD
        R+FSRMISLYEHHDLQDKIIEIFADMEELGVKPDEDTVRRV  AFQKLGQE+NRKMVYKRY CQWKYIHFKGERVRVRRDGWD+
Subjt:  RLFSRMISLYEHHDLQDKIIEIFADMEELGVKPDEDTVRRVGRAFQKLGQEENRKMVYKRYGCQWKYIHFKGERVRVRRDGWDD

XP_038892676.1 pentatricopeptide repeat-containing protein At4g18975, chloroplastic [Benincasa hispida]1.5e-12884.86Show/hide
Query:  MLVYHGSSTGFYALMPKIDYIYYHNKLKFRAASVNYVHKQAAQPLTSLTTVERRIVKKVGKEEHHLWKKRDSAGCGQKALNLVRIVSQCPNEKEVVYGEL
        MLVYHGSSTGF AL+PKID IY++NKL FRAASV  VHKQA          ERRIVKKVGKEEHHLWKKRDSAG GQKALNLVRIVSQCPNEKE VYGEL
Subjt:  MLVYHGSSTGFYALMPKIDYIYYHNKLKFRAASVNYVHKQAAQPLTSLTTVERRIVKKVGKEEHHLWKKRDSAGCGQKALNLVRIVSQCPNEKEVVYGEL

Query:  NKWIAWETEFPLIAAAKALRILRKRNQWKRVIQVRISSGLLAVFLAFGELVAKWMLSKGQGATMGTYDTLLLAFDMDKRVDEAESLWNMILHAHTRSISK
        NKWIAWETEFPLIAAAKALRILRKR+QWK VIQ                 VAKWMLSKGQGATMGTYDTLLLAFDMDKRVDEAESLWNMILH HTRSISK
Subjt:  NKWIAWETEFPLIAAAKALRILRKRNQWKRVIQVRISSGLLAVFLAFGELVAKWMLSKGQGATMGTYDTLLLAFDMDKRVDEAESLWNMILHAHTRSISK

Query:  RLFSRMISLYEHHDLQDKIIEIFADMEELGVKPDEDTVRRVGRAFQKLGQEENRKMVYKRYGCQWKYIHFKGERVRVRRDGWDD
        RLFSRMISLYEHHDLQDKIIEIFADMEELGVKPDEDTVRRVGRAF KLGQEEN+KMVYKRYGCQWKYIHFKGERVRVRRDGWD+
Subjt:  RLFSRMISLYEHHDLQDKIIEIFADMEELGVKPDEDTVRRVGRAFQKLGQEENRKMVYKRYGCQWKYIHFKGERVRVRRDGWDD

TrEMBL top hitse value%identityAlignment
A0A0A0KSA5 Uncharacterized protein2.1e-13184.86Show/hide
Query:  MLVYHGSSTGFYALMPKIDYIYYHNKLKFRAASVNYVHKQAAQPLTSLTTVERRIVKKVGKEEHHLWKKRDSAGCGQKALNLVRIVSQCPNEKEVVYGEL
        MLV+HG+STGF ALMPKID IYYHNK  F  +SV  VH QAAQPLTS TT ERR+VKKVGKE HHLWKKRDSAG GQKALNLVRIVSQCPNEKE VYGEL
Subjt:  MLVYHGSSTGFYALMPKIDYIYYHNKLKFRAASVNYVHKQAAQPLTSLTTVERRIVKKVGKEEHHLWKKRDSAGCGQKALNLVRIVSQCPNEKEVVYGEL

Query:  NKWIAWETEFPLIAAAKALRILRKRNQWKRVIQVRISSGLLAVFLAFGELVAKWMLSKGQGATMGTYDTLLLAFDMDKRVDEAESLWNMILHAHTRSISK
        NKWIAWETEFPLIAAAKALRILRKR+QWKRVIQ                 VAKWMLSKGQGATMGTYDTLLLAFDMDKRVDEAESLWNMILH HTRSISK
Subjt:  NKWIAWETEFPLIAAAKALRILRKRNQWKRVIQVRISSGLLAVFLAFGELVAKWMLSKGQGATMGTYDTLLLAFDMDKRVDEAESLWNMILHAHTRSISK

Query:  RLFSRMISLYEHHDLQDKIIEIFADMEELGVKPDEDTVRRVGRAFQKLGQEENRKMVYKRYGCQWKYIHFKGERVRVRRDGWDD
        R+FSRMISLYEHHDLQDKIIEIFADMEELGVKPDEDTVRRV  AFQKLGQE+NRKMVYKRY CQWKYIHFKGERVRVRRDGWD+
Subjt:  RLFSRMISLYEHHDLQDKIIEIFADMEELGVKPDEDTVRRVGRAFQKLGQEENRKMVYKRYGCQWKYIHFKGERVRVRRDGWDD

A0A1S3CP50 pentatricopeptide repeat-containing protein At4g18975, chloroplastic isoform X18.4e-13385.56Show/hide
Query:  MLVYHGSSTGFYALMPKIDYIYYHNKLKFRAASVNYVHKQAAQPLTSLTTVERRIVKKVGKEEHHLWKKRDSAGCGQKALNLVRIVSQCPNEKEVVYGEL
        MLV HGSSTGF AL+PKID IYYHNK  FR ASV  VH QAAQP TS TT ERR+VKKVGKE HHLWKKRDSAG GQKALNLVRIVSQCPNEKE VYGEL
Subjt:  MLVYHGSSTGFYALMPKIDYIYYHNKLKFRAASVNYVHKQAAQPLTSLTTVERRIVKKVGKEEHHLWKKRDSAGCGQKALNLVRIVSQCPNEKEVVYGEL

Query:  NKWIAWETEFPLIAAAKALRILRKRNQWKRVIQVRISSGLLAVFLAFGELVAKWMLSKGQGATMGTYDTLLLAFDMDKRVDEAESLWNMILHAHTRSISK
        NKWIAWETEFPLIAAAKALRILRKR+QWKRVIQ                 VAKWMLSKGQGATMGTYDTLLLAFDMDKRVDEAESLWNMILH HTRSISK
Subjt:  NKWIAWETEFPLIAAAKALRILRKRNQWKRVIQVRISSGLLAVFLAFGELVAKWMLSKGQGATMGTYDTLLLAFDMDKRVDEAESLWNMILHAHTRSISK

Query:  RLFSRMISLYEHHDLQDKIIEIFADMEELGVKPDEDTVRRVGRAFQKLGQEENRKMVYKRYGCQWKYIHFKGERVRVRRDGWDD
        R+FSRMISLYEHHDLQDKIIEIFADMEELGVKPDEDTVRR+GRAFQKLGQEENRKMVYKRY CQWKYIHFKGERVRVR+DGWD+
Subjt:  RLFSRMISLYEHHDLQDKIIEIFADMEELGVKPDEDTVRRVGRAFQKLGQEENRKMVYKRYGCQWKYIHFKGERVRVRRDGWDD

A0A5A7UD21 Pentatricopeptide repeat-containing protein8.4e-13385.56Show/hide
Query:  MLVYHGSSTGFYALMPKIDYIYYHNKLKFRAASVNYVHKQAAQPLTSLTTVERRIVKKVGKEEHHLWKKRDSAGCGQKALNLVRIVSQCPNEKEVVYGEL
        MLV HGSSTGF AL+PKID IYYHNK  FR ASV  VH QAAQP TS TT ERR+VKKVGKE HHLWKKRDSAG GQKALNLVRIVSQCPNEKE VYGEL
Subjt:  MLVYHGSSTGFYALMPKIDYIYYHNKLKFRAASVNYVHKQAAQPLTSLTTVERRIVKKVGKEEHHLWKKRDSAGCGQKALNLVRIVSQCPNEKEVVYGEL

Query:  NKWIAWETEFPLIAAAKALRILRKRNQWKRVIQVRISSGLLAVFLAFGELVAKWMLSKGQGATMGTYDTLLLAFDMDKRVDEAESLWNMILHAHTRSISK
        NKWIAWETEFPLIAAAKALRILRKR+QWKRVIQ                 VAKWMLSKGQGATMGTYDTLLLAFDMDKRVDEAESLWNMILH HTRSISK
Subjt:  NKWIAWETEFPLIAAAKALRILRKRNQWKRVIQVRISSGLLAVFLAFGELVAKWMLSKGQGATMGTYDTLLLAFDMDKRVDEAESLWNMILHAHTRSISK

Query:  RLFSRMISLYEHHDLQDKIIEIFADMEELGVKPDEDTVRRVGRAFQKLGQEENRKMVYKRYGCQWKYIHFKGERVRVRRDGWDD
        R+FSRMISLYEHHDLQDKIIEIFADMEELGVKPDEDTVRR+GRAFQKLGQEENRKMVYKRY CQWKYIHFKGERVRVR+DGWD+
Subjt:  RLFSRMISLYEHHDLQDKIIEIFADMEELGVKPDEDTVRRVGRAFQKLGQEENRKMVYKRYGCQWKYIHFKGERVRVRRDGWDD

A0A6J1FWG3 pentatricopeptide repeat-containing protein At4g18975, chloroplastic6.9e-12783.51Show/hide
Query:  MLVYHGSSTGFYALMPKIDYIYYHNKLKFRAASVNYVHKQAAQPLTSLTTVERRIV-KKVGKEEHHLWKKRDSAGCGQKALNLVRIVSQCPNEKEVVYGE
        ML Y GSSTGF AL+PK   IY +NKL FRAASV  VHKQAAQ LT  TT ERRIV KKVGKE HHLWKKRDSAG GQKALNLVRIVSQCPNEKE VYGE
Subjt:  MLVYHGSSTGFYALMPKIDYIYYHNKLKFRAASVNYVHKQAAQPLTSLTTVERRIV-KKVGKEEHHLWKKRDSAGCGQKALNLVRIVSQCPNEKEVVYGE

Query:  LNKWIAWETEFPLIAAAKALRILRKRNQWKRVIQVRISSGLLAVFLAFGELVAKWMLSKGQGATMGTYDTLLLAFDMDKRVDEAESLWNMILHAHTRSIS
        LNKWIAWETEFPLIAA+KALRILRKR+QWKRVIQ                 VAKWMLSKGQGATMGTYDTLLLAFDMDKRVDEAESLWNMILHAHTRSIS
Subjt:  LNKWIAWETEFPLIAAAKALRILRKRNQWKRVIQVRISSGLLAVFLAFGELVAKWMLSKGQGATMGTYDTLLLAFDMDKRVDEAESLWNMILHAHTRSIS

Query:  KRLFSRMISLYEHHDLQDKIIEIFADMEELGVKPDEDTVRRVGRAFQKLGQEENRKMVYKRYGCQWKYIHFKGERVRVRRDGWDD
        KRLFSRMISLY+HHDLQDKIIEIFADMEELGV+PDEDTVRRV  AF+KLGQEEN KMVYKRYGC+WKYIHFKGERVRVRRDGWD+
Subjt:  KRLFSRMISLYEHHDLQDKIIEIFADMEELGVKPDEDTVRRVGRAFQKLGQEENRKMVYKRYGCQWKYIHFKGERVRVRRDGWDD

A0A6J1K6G2 pentatricopeptide repeat-containing protein At4g18975, chloroplastic1.3e-12582.81Show/hide
Query:  MLVYHGSSTGFYALMPKIDYIYYHNKLKFRAASVNYVHKQAAQPLTSLTTVERRIV-KKVGKEEHHLWKKRDSAGCGQKALNLVRIVSQCPNEKEVVYGE
        ML Y GSSTGF AL+PK   IY +NKL FRAASV  VHKQAAQ LTS TT ERRIV KKVGKE HHLWKKRDSAG GQKALNLVRIVSQCPNEKE VYGE
Subjt:  MLVYHGSSTGFYALMPKIDYIYYHNKLKFRAASVNYVHKQAAQPLTSLTTVERRIV-KKVGKEEHHLWKKRDSAGCGQKALNLVRIVSQCPNEKEVVYGE

Query:  LNKWIAWETEFPLIAAAKALRILRKRNQWKRVIQVRISSGLLAVFLAFGELVAKWMLSKGQGATMGTYDTLLLAFDMDKRVDEAESLWNMILHAHTRSIS
        LNKWIAWETEFPLIAA+KALRILRKR+QWKRVIQ                 VAKWMLSKGQGATMGTYDTLLLAFDMDKRVDEAESLWNMILHAHTRSIS
Subjt:  LNKWIAWETEFPLIAAAKALRILRKRNQWKRVIQVRISSGLLAVFLAFGELVAKWMLSKGQGATMGTYDTLLLAFDMDKRVDEAESLWNMILHAHTRSIS

Query:  KRLFSRMISLYEHHDLQDKIIEIFADMEELGVKPDEDTVRRVGRAFQKLGQEENRKMVYKRYGCQWKYIHFKGERVRVRRDGWDD
        KRLFSRMI+LY+HHDLQDKIIEIFADMEELGV+PDEDTVRRV  AF+KLGQEEN K VYKRYGC+WKYIHFK ERVRVRRDGWD+
Subjt:  KRLFSRMISLYEHHDLQDKIIEIFADMEELGVKPDEDTVRRVGRAFQKLGQEENRKMVYKRYGCQWKYIHFKGERVRVRRDGWDD

SwissProt top hitse value%identityAlignment
Q2V3H0 Pentatricopeptide repeat-containing protein At4g18975, chloroplastic4.2e-8969.4Show/hide
Query:  LTTVERRIVKKVGKEEHHLWKKRDSAGCGQKALNLVRIVSQCPNEKEVVYGELNKWIAWETEFPLIAAAKALRILRKRNQWKRVIQVRISSGLLAVFLAF
        + TV  + +KKVGK+EHHLWKK DSAG GQKALNLVR++S  PNEKE VYG LNKW+AWE EFP+IAAAKAL+ILRKR+QW RVIQ              
Subjt:  LTTVERRIVKKVGKEEHHLWKKRDSAGCGQKALNLVRIVSQCPNEKEVVYGELNKWIAWETEFPLIAAAKALRILRKRNQWKRVIQVRISSGLLAVFLAF

Query:  GELVAKWMLSKGQGATMGTYDTLLLAFDMDKRVDEAESLWNMILHAHTRSISKRLFSRMISLYEHHDLQDKIIEIFADMEELGVKPDEDTVRRVGRAFQK
           +AKWMLSKGQGATMGTYD LLLAFDMD+R DEAESLWNMILH HTRSI +RLF+RMI+LY HHDL DK+IE+FADMEEL V PDED+ RRV RAF++
Subjt:  GELVAKWMLSKGQGATMGTYDTLLLAFDMDKRVDEAESLWNMILHAHTRSISKRLFSRMISLYEHHDLQDKIIEIFADMEELGVKPDEDTVRRVGRAFQK

Query:  LGQEENRKMVYKRYGCQWKYIHFKGERVRVRR
        L QEENRK++ +RY  ++KYI+F GERVRV+R
Subjt:  LGQEENRKMVYKRYGCQWKYIHFKGERVRVRR

Q8LG95 Pentatricopeptide repeat-containing protein At4g211906.0e-4342.99Show/hide
Query:  LWKKRDSAGCGQKALNLVRIVSQCPNEKEVVYGELNKWIAWETEFPLIAAAKALRILRKRNQWKRVIQVRISSGLLAVFLAFGELVAKWMLSKGQGATMG
        +WK R   G   KA  ++  +    N KE VYG L+ +IAWE EFPL+   KAL IL    +WK++IQ                 V KWMLSKGQG TMG
Subjt:  LWKKRDSAGCGQKALNLVRIVSQCPNEKEVVYGELNKWIAWETEFPLIAAAKALRILRKRNQWKRVIQVRISSGLLAVFLAFGELVAKWMLSKGQGATMG

Query:  TYDTLLLAFDMDKRVDEAESLWNMILHAHTRSISKRLFSRMISLYEHHDLQDKIIEIFADMEELGVKPDEDTVRRVGRAFQKLGQEENRKMVYKRY-GCQ
        TY +LL A   D R+DEAE LWN +   H     ++ F++MIS+Y   D+  K+ E+FADMEELGVKP+   V  VG+ F KL  ++  + + K+Y   Q
Subjt:  TYDTLLLAFDMDKRVDEAESLWNMILHAHTRSISKRLFSRMISLYEHHDLQDKIIEIFADMEELGVKPDEDTVRRVGRAFQKLGQEENRKMVYKRY-GCQ

Query:  WKYIHFKGERVRVR
        W++ + KG RV+V+
Subjt:  WKYIHFKGERVRVR

Arabidopsis top hitse value%identityAlignment
AT4G18975.1 Pentatricopeptide repeat (PPR) superfamily protein3.0e-9069.4Show/hide
Query:  LTTVERRIVKKVGKEEHHLWKKRDSAGCGQKALNLVRIVSQCPNEKEVVYGELNKWIAWETEFPLIAAAKALRILRKRNQWKRVIQVRISSGLLAVFLAF
        + TV  + +KKVGK+EHHLWKK DSAG GQKALNLVR++S  PNEKE VYG LNKW+AWE EFP+IAAAKAL+ILRKR+QW RVIQ              
Subjt:  LTTVERRIVKKVGKEEHHLWKKRDSAGCGQKALNLVRIVSQCPNEKEVVYGELNKWIAWETEFPLIAAAKALRILRKRNQWKRVIQVRISSGLLAVFLAF

Query:  GELVAKWMLSKGQGATMGTYDTLLLAFDMDKRVDEAESLWNMILHAHTRSISKRLFSRMISLYEHHDLQDKIIEIFADMEELGVKPDEDTVRRVGRAFQK
           +AKWMLSKGQGATMGTYD LLLAFDMD+R DEAESLWNMILH HTRSI +RLF+RMI+LY HHDL DK+IE+FADMEEL V PDED+ RRV RAF++
Subjt:  GELVAKWMLSKGQGATMGTYDTLLLAFDMDKRVDEAESLWNMILHAHTRSISKRLFSRMISLYEHHDLQDKIIEIFADMEELGVKPDEDTVRRVGRAFQK

Query:  LGQEENRKMVYKRYGCQWKYIHFKGERVRVRR
        L QEENRK++ +RY  ++KYI+F GERVRV+R
Subjt:  LGQEENRKMVYKRYGCQWKYIHFKGERVRVRR

AT4G18975.2 Pentatricopeptide repeat (PPR) superfamily protein3.0e-9069.4Show/hide
Query:  LTTVERRIVKKVGKEEHHLWKKRDSAGCGQKALNLVRIVSQCPNEKEVVYGELNKWIAWETEFPLIAAAKALRILRKRNQWKRVIQVRISSGLLAVFLAF
        + TV  + +KKVGK+EHHLWKK DSAG GQKALNLVR++S  PNEKE VYG LNKW+AWE EFP+IAAAKAL+ILRKR+QW RVIQ              
Subjt:  LTTVERRIVKKVGKEEHHLWKKRDSAGCGQKALNLVRIVSQCPNEKEVVYGELNKWIAWETEFPLIAAAKALRILRKRNQWKRVIQVRISSGLLAVFLAF

Query:  GELVAKWMLSKGQGATMGTYDTLLLAFDMDKRVDEAESLWNMILHAHTRSISKRLFSRMISLYEHHDLQDKIIEIFADMEELGVKPDEDTVRRVGRAFQK
           +AKWMLSKGQGATMGTYD LLLAFDMD+R DEAESLWNMILH HTRSI +RLF+RMI+LY HHDL DK+IE+FADMEEL V PDED+ RRV RAF++
Subjt:  GELVAKWMLSKGQGATMGTYDTLLLAFDMDKRVDEAESLWNMILHAHTRSISKRLFSRMISLYEHHDLQDKIIEIFADMEELGVKPDEDTVRRVGRAFQK

Query:  LGQEENRKMVYKRYGCQWKYIHFKGERVRVRR
        L QEENRK++ +RY  ++KYI+F GERVRV+R
Subjt:  LGQEENRKMVYKRYGCQWKYIHFKGERVRVRR

AT4G18975.3 Pentatricopeptide repeat (PPR) superfamily protein3.0e-9069.4Show/hide
Query:  LTTVERRIVKKVGKEEHHLWKKRDSAGCGQKALNLVRIVSQCPNEKEVVYGELNKWIAWETEFPLIAAAKALRILRKRNQWKRVIQVRISSGLLAVFLAF
        + TV  + +KKVGK+EHHLWKK DSAG GQKALNLVR++S  PNEKE VYG LNKW+AWE EFP+IAAAKAL+ILRKR+QW RVIQ              
Subjt:  LTTVERRIVKKVGKEEHHLWKKRDSAGCGQKALNLVRIVSQCPNEKEVVYGELNKWIAWETEFPLIAAAKALRILRKRNQWKRVIQVRISSGLLAVFLAF

Query:  GELVAKWMLSKGQGATMGTYDTLLLAFDMDKRVDEAESLWNMILHAHTRSISKRLFSRMISLYEHHDLQDKIIEIFADMEELGVKPDEDTVRRVGRAFQK
           +AKWMLSKGQGATMGTYD LLLAFDMD+R DEAESLWNMILH HTRSI +RLF+RMI+LY HHDL DK+IE+FADMEEL V PDED+ RRV RAF++
Subjt:  GELVAKWMLSKGQGATMGTYDTLLLAFDMDKRVDEAESLWNMILHAHTRSISKRLFSRMISLYEHHDLQDKIIEIFADMEELGVKPDEDTVRRVGRAFQK

Query:  LGQEENRKMVYKRYGCQWKYIHFKGERVRVRR
        L QEENRK++ +RY  ++KYI+F GERVRV+R
Subjt:  LGQEENRKMVYKRYGCQWKYIHFKGERVRVRR

AT4G18975.4 Pentatricopeptide repeat (PPR) superfamily protein3.0e-9069.4Show/hide
Query:  LTTVERRIVKKVGKEEHHLWKKRDSAGCGQKALNLVRIVSQCPNEKEVVYGELNKWIAWETEFPLIAAAKALRILRKRNQWKRVIQVRISSGLLAVFLAF
        + TV  + +KKVGK+EHHLWKK DSAG GQKALNLVR++S  PNEKE VYG LNKW+AWE EFP+IAAAKAL+ILRKR+QW RVIQ              
Subjt:  LTTVERRIVKKVGKEEHHLWKKRDSAGCGQKALNLVRIVSQCPNEKEVVYGELNKWIAWETEFPLIAAAKALRILRKRNQWKRVIQVRISSGLLAVFLAF

Query:  GELVAKWMLSKGQGATMGTYDTLLLAFDMDKRVDEAESLWNMILHAHTRSISKRLFSRMISLYEHHDLQDKIIEIFADMEELGVKPDEDTVRRVGRAFQK
           +AKWMLSKGQGATMGTYD LLLAFDMD+R DEAESLWNMILH HTRSI +RLF+RMI+LY HHDL DK+IE+FADMEEL V PDED+ RRV RAF++
Subjt:  GELVAKWMLSKGQGATMGTYDTLLLAFDMDKRVDEAESLWNMILHAHTRSISKRLFSRMISLYEHHDLQDKIIEIFADMEELGVKPDEDTVRRVGRAFQK

Query:  LGQEENRKMVYKRYGCQWKYIHFKGERVRVRR
        L QEENRK++ +RY  ++KYI+F GERVRV+R
Subjt:  LGQEENRKMVYKRYGCQWKYIHFKGERVRVRR

AT4G21190.1 Pentatricopeptide repeat (PPR) superfamily protein4.2e-4442.99Show/hide
Query:  LWKKRDSAGCGQKALNLVRIVSQCPNEKEVVYGELNKWIAWETEFPLIAAAKALRILRKRNQWKRVIQVRISSGLLAVFLAFGELVAKWMLSKGQGATMG
        +WK R   G   KA  ++  +    N KE VYG L+ +IAWE EFPL+   KAL IL    +WK++IQ                 V KWMLSKGQG TMG
Subjt:  LWKKRDSAGCGQKALNLVRIVSQCPNEKEVVYGELNKWIAWETEFPLIAAAKALRILRKRNQWKRVIQVRISSGLLAVFLAFGELVAKWMLSKGQGATMG

Query:  TYDTLLLAFDMDKRVDEAESLWNMILHAHTRSISKRLFSRMISLYEHHDLQDKIIEIFADMEELGVKPDEDTVRRVGRAFQKLGQEENRKMVYKRY-GCQ
        TY +LL A   D R+DEAE LWN +   H     ++ F++MIS+Y   D+  K+ E+FADMEELGVKP+   V  VG+ F KL  ++  + + K+Y   Q
Subjt:  TYDTLLLAFDMDKRVDEAESLWNMILHAHTRSISKRLFSRMISLYEHHDLQDKIIEIFADMEELGVKPDEDTVRRVGRAFQKLGQEENRKMVYKRY-GCQ

Query:  WKYIHFKGERVRVR
        W++ + KG RV+V+
Subjt:  WKYIHFKGERVRVR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTGGTTTACCATGGAAGCTCAACTGGGTTCTATGCCCTCATGCCGAAGATAGATTATATTTACTATCACAACAAATTGAAATTTAGAGCTGCCAGTGTCAATTATGT
CCACAAGCAAGCTGCACAGCCGCTTACTAGTTTAACCACAGTTGAGAGACGTATTGTTAAGAAGGTTGGGAAGGAGGAACACCATTTATGGAAGAAAAGAGATTCTGCTG
GCTGTGGGCAAAAGGCTCTTAATCTTGTTAGAATTGTTTCCCAATGTCCTAATGAGAAAGAAGTTGTATATGGAGAATTGAATAAGTGGATAGCTTGGGAGACAGAGTTT
CCATTGATTGCAGCTGCTAAAGCTTTAAGAATATTGAGGAAGAGAAATCAATGGAAGCGTGTCATTCAAGTACGAATAAGTTCGGGTTTACTTGCTGTCTTTTTGGCCTT
TGGTGAATTGGTGGCAAAGTGGATGTTAAGCAAGGGTCAAGGAGCCACAATGGGAACATATGACACCCTTCTACTGGCATTTGATATGGACAAGAGGGTAGATGAGGCCG
AATCCTTATGGAACATGATTTTGCATGCACATACACGTTCCATCTCAAAGCGATTGTTTTCTAGGATGATCTCTTTGTATGAACATCATGACTTGCAAGATAAGATTATC
GAGATATTTGCAGACATGGAAGAGTTGGGTGTAAAACCAGATGAAGATACCGTCAGAAGAGTCGGCCGTGCCTTTCAAAAACTAGGTCAAGAGGAAAACCGGAAAATGGT
CTATAAAAGATACGGCTGCCAATGGAAATACATACACTTCAAGGGTGAGAGGGTTAGAGTGAGAAGAGATGGATGGGATGATGGATGA
mRNA sequenceShow/hide mRNA sequence
ATGTTGGTTTACCATGGAAGCTCAACTGGGTTCTATGCCCTCATGCCGAAGATAGATTATATTTACTATCACAACAAATTGAAATTTAGAGCTGCCAGTGTCAATTATGT
CCACAAGCAAGCTGCACAGCCGCTTACTAGTTTAACCACAGTTGAGAGACGTATTGTTAAGAAGGTTGGGAAGGAGGAACACCATTTATGGAAGAAAAGAGATTCTGCTG
GCTGTGGGCAAAAGGCTCTTAATCTTGTTAGAATTGTTTCCCAATGTCCTAATGAGAAAGAAGTTGTATATGGAGAATTGAATAAGTGGATAGCTTGGGAGACAGAGTTT
CCATTGATTGCAGCTGCTAAAGCTTTAAGAATATTGAGGAAGAGAAATCAATGGAAGCGTGTCATTCAAGTACGAATAAGTTCGGGTTTACTTGCTGTCTTTTTGGCCTT
TGGTGAATTGGTGGCAAAGTGGATGTTAAGCAAGGGTCAAGGAGCCACAATGGGAACATATGACACCCTTCTACTGGCATTTGATATGGACAAGAGGGTAGATGAGGCCG
AATCCTTATGGAACATGATTTTGCATGCACATACACGTTCCATCTCAAAGCGATTGTTTTCTAGGATGATCTCTTTGTATGAACATCATGACTTGCAAGATAAGATTATC
GAGATATTTGCAGACATGGAAGAGTTGGGTGTAAAACCAGATGAAGATACCGTCAGAAGAGTCGGCCGTGCCTTTCAAAAACTAGGTCAAGAGGAAAACCGGAAAATGGT
CTATAAAAGATACGGCTGCCAATGGAAATACATACACTTCAAGGGTGAGAGGGTTAGAGTGAGAAGAGATGGATGGGATGATGGATGA
Protein sequenceShow/hide protein sequence
MLVYHGSSTGFYALMPKIDYIYYHNKLKFRAASVNYVHKQAAQPLTSLTTVERRIVKKVGKEEHHLWKKRDSAGCGQKALNLVRIVSQCPNEKEVVYGELNKWIAWETEF
PLIAAAKALRILRKRNQWKRVIQVRISSGLLAVFLAFGELVAKWMLSKGQGATMGTYDTLLLAFDMDKRVDEAESLWNMILHAHTRSISKRLFSRMISLYEHHDLQDKII
EIFADMEELGVKPDEDTVRRVGRAFQKLGQEENRKMVYKRYGCQWKYIHFKGERVRVRRDGWDDG