; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg002702 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg002702
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationscaffold6:1116842..1121427
RNA-Seq ExpressionSpg002702
SyntenySpg002702
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily
IPR044646 - Pentatricopeptide repeat-containing protein EMB1417-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7013270.1 Pentatricopeptide repeat-containing protein [Cucurbita argyrosperma subsp. argyrosperma]5.5e-14786.69Show/hide
Query:  IALLKMLTLIYSFPVISKKIESIKFSRSASSSVVCAAKGPRPRYPRVWKTRKRIGTISKAEKLVDCVKGLSNVKEEVYGALDSFIAWELEFPLITVKKAL
        +A  +M TLIYSFPVISK IES+KFS  ASSSVVCAAKGPRPRYPRVWKTRKRIGTISKA KLVDCVKGLSNVKEEVYGALDSFIAWELEFPLITVKKAL
Subjt:  IALLKMLTLIYSFPVISKKIESIKFSRSASSSVVCAAKGPRPRYPRVWKTRKRIGTISKAEKLVDCVKGLSNVKEEVYGALDSFIAWELEFPLITVKKAL

Query:  KTLENQREWKRIIQLTKWMLSKGQGRTMGSYFTLLNALAEDGRLDEAEELWNKLFSQHLESMPRIFFHKMISLYYDRGMHDKLFEVFADMEELGVQPNMA
        KTLE QREWKRIIQLTKWMLSKGQGRTMGSYFTLLNALAEDGRLDEAEELWNKLFSQHLESMPRIFFHKMISLYY+R MHDKLFE+FADMEELGVQP+MA
Subjt:  KTLENQREWKRIIQLTKWMLSKGQGRTMGSYFTLLNALAEDGRLDEAEELWNKLFSQHLESMPRIFFHKMISLYYDRGMHDKLFEVFADMEELGVQPNMA

Query:  IVTMIGNVFQELGMFDKYEKLKKKYPPPKWEYRYIKGKRVRIRAKYLNEYGTSNNGSSELDKKEHSST-KLLEEAEMTSEDTLTDSSLEDDEMSEDTDEI
        IVTM+GNVFQ+LGM DKYEKLKKKYPPPKWEYRYI+GKRV+IRAK L+E G+SNNGS ELDKKEHSST +LLEE ++TS+    DSSLEDDEMSED  E 
Subjt:  IVTMIGNVFQELGMFDKYEKLKKKYPPPKWEYRYIKGKRVRIRAKYLNEYGTSNNGSSELDKKEHSST-KLLEEAEMTSEDTLTDSSLEDDEMSEDTDEI

Query:  LEDENMS-KESNFVHDFMGFGQL
        LEDE+M  KES F HDFMGFGQL
Subjt:  LEDENMS-KESNFVHDFMGFGQL

XP_022140817.1 pentatricopeptide repeat-containing protein At4g21190 [Momordica charantia]1.9e-14786.75Show/hide
Query:  KMLTLIYSFPVISKKIESIKFSRSASSSVVCAAKGPRPRYPRVWKTRKRIGTISKAEKLVDCVKGLSNVKEEVYGALDSFIAWELEFPLITVKKALKTLE
        +M TLIYSFPVISK+IES+KFS SASS+VVCAAKGPRPRYPRVWKTRKRIGTISKA KLVDCVKGLSNVKEEVYGALDSFIAWELEFPLITVKKALKTLE
Subjt:  KMLTLIYSFPVISKKIESIKFSRSASSSVVCAAKGPRPRYPRVWKTRKRIGTISKAEKLVDCVKGLSNVKEEVYGALDSFIAWELEFPLITVKKALKTLE

Query:  NQREWKRIIQLTKWMLSKGQGRTMGSYFTLLNALAEDGRLDEAEELWNKLFSQHLESMPRIFFHKMISLYYDRGMHDKLFEVFADMEELGVQPNMAIVTM
         QREWKRIIQLTKWMLSKGQGRTMGSYFTLLNALAEDGRLDEAEELWNKLFSQ+LESMPR+FFHKMISLYYD+GMHDKLFEVFADMEELGVQPN  IVTM
Subjt:  NQREWKRIIQLTKWMLSKGQGRTMGSYFTLLNALAEDGRLDEAEELWNKLFSQHLESMPRIFFHKMISLYYDRGMHDKLFEVFADMEELGVQPNMAIVTM

Query:  IGNVFQELGMFDKYEKLKKKYPPPKWEYRYIKGKRVRIRAKYLNEYGTSNNGSSELDKKEHSSTKLLEEAEMTSEDTLTDSSLEDDEMSEDTDEILEDEN
        IGNVFQELGMFDKYEKLKKKYPP KWEYRY+KGKRVRIRAKYLNEYG SNNGSSELD+K+ SS KLLEEAE  S+    DSSLED+EM ED DEILEDE+
Subjt:  IGNVFQELGMFDKYEKLKKKYPPPKWEYRYIKGKRVRIRAKYLNEYGTSNNGSSELDKKEHSSTKLLEEAEMTSEDTLTDSSLEDDEMSEDTDEILEDEN

Query:  MSKESNFVHDFMGFGQL
          +  +F ++FMG+G+L
Subjt:  MSKESNFVHDFMGFGQL

XP_022945794.1 pentatricopeptide repeat-containing protein At4g21190 [Cucurbita moschata]9.4e-14786.69Show/hide
Query:  IALLKMLTLIYSFPVISKKIESIKFSRSASSSVVCAAKGPRPRYPRVWKTRKRIGTISKAEKLVDCVKGLSNVKEEVYGALDSFIAWELEFPLITVKKAL
        +A  +M TLIYSFPVISK IES+KFS  ASSSVVCAAKGPRPRYPRVWKTRKRIGTISKA KLVDCVKGLSNVKEEVYGALDSFIAWELEFPLITVKKAL
Subjt:  IALLKMLTLIYSFPVISKKIESIKFSRSASSSVVCAAKGPRPRYPRVWKTRKRIGTISKAEKLVDCVKGLSNVKEEVYGALDSFIAWELEFPLITVKKAL

Query:  KTLENQREWKRIIQLTKWMLSKGQGRTMGSYFTLLNALAEDGRLDEAEELWNKLFSQHLESMPRIFFHKMISLYYDRGMHDKLFEVFADMEELGVQPNMA
        KTLE QREWKRIIQLTKWMLSKGQGRTMGSYFTLLNALA DGRLDEAEELWNKLFSQHLESMPRIFFHKMISLYY+R MHDKLFE+FADMEELGVQP+MA
Subjt:  KTLENQREWKRIIQLTKWMLSKGQGRTMGSYFTLLNALAEDGRLDEAEELWNKLFSQHLESMPRIFFHKMISLYYDRGMHDKLFEVFADMEELGVQPNMA

Query:  IVTMIGNVFQELGMFDKYEKLKKKYPPPKWEYRYIKGKRVRIRAKYLNEYGTSNNGSSELDKKEHSST-KLLEEAEMTSEDTLTDSSLEDDEMSEDTDEI
        IVT +GNVFQ+LGM DKYEKLKKKYPPPKWEYRYI+GKRV+IRAK L+E G+SNNGS ELDKKEHSST +LLEE E+TS+    DSSLEDDEMSED DE 
Subjt:  IVTMIGNVFQELGMFDKYEKLKKKYPPPKWEYRYIKGKRVRIRAKYLNEYGTSNNGSSELDKKEHSST-KLLEEAEMTSEDTLTDSSLEDDEMSEDTDEI

Query:  LEDENMS-KESNFVHDFMGFGQL
        LEDE+M  KES F HDFMGFGQL
Subjt:  LEDENMS-KESNFVHDFMGFGQL

XP_038888232.1 pentatricopeptide repeat-containing protein At4g21190 isoform X1 [Benincasa hispida]3.8e-14886.92Show/hide
Query:  ALLKMLTLIYSFPVISKKIESIKFSRSASSSVVCAAKGPRPRYPRVWKTRKRIGTISKAEKLVDCVKGLSNVKEEVYGALDSFIAWELEFPLITVKKALK
        A  +MLTLIYSFPVISK+IES+ FS  ASSSVVCAAKGPRPRYPRVWKT+KRIGT+SKA KLVDCVKGLSNVKEEVYGALDSFIAWELEFPLITVKKAL+
Subjt:  ALLKMLTLIYSFPVISKKIESIKFSRSASSSVVCAAKGPRPRYPRVWKTRKRIGTISKAEKLVDCVKGLSNVKEEVYGALDSFIAWELEFPLITVKKALK

Query:  TLENQREWKRIIQLTKWMLSKGQGRTMGSYFTLLNALAEDGRLDEAEELWNKLFSQHLESMPRIFFHKMISLYYDRGMHDKLFEVFADMEELGVQPNMAI
        TLENQREWKRIIQLTKWMLSKGQGRTMGSYFTLLNALAEDGRLDEAEELWNKLFSQHLESMPRIFFHKMISLYYDR MHDKLFEVFADMEELGVQPNM I
Subjt:  TLENQREWKRIIQLTKWMLSKGQGRTMGSYFTLLNALAEDGRLDEAEELWNKLFSQHLESMPRIFFHKMISLYYDRGMHDKLFEVFADMEELGVQPNMAI

Query:  VTMIGNVFQELGMFDKYEKLKKKYPPPKWEYRYIKGKRVRIRAKYLNEYGTSNNGSSELDKKEHSSTKLLEEAEMTSEDTLTDSSLEDD-EMSEDTDEIL
        VTM+GNVF ELGM DKYEKL KKYPPPKWEYRYIKGKRVRIR+KYL E G  NN  S+ DK EHSSTKLLEEAE+TSEDT    +LEDD EMSED +EI 
Subjt:  VTMIGNVFQELGMFDKYEKLKKKYPPPKWEYRYIKGKRVRIRAKYLNEYGTSNNGSSELDKKEHSSTKLLEEAEMTSEDTLTDSSLEDD-EMSEDTDEIL

Query:  EDENMSKESNFVHDFMGFGQL
        +DE MSKE NF HDFMGFGQL
Subjt:  EDENMSKESNFVHDFMGFGQL

XP_038888233.1 pentatricopeptide repeat-containing protein At4g21190 isoform X2 [Benincasa hispida]8.5e-14887.7Show/hide
Query:  MLTLIYSFPVISKKIESIKFSRSASSSVVCAAKGPRPRYPRVWKTRKRIGTISKAEKLVDCVKGLSNVKEEVYGALDSFIAWELEFPLITVKKALKTLEN
        MLTLIYSFPVISK+IES+ FS  ASSSVVCAAKGPRPRYPRVWKT+KRIGT+SKA KLVDCVKGLSNVKEEVYGALDSFIAWELEFPLITVKKAL+TLEN
Subjt:  MLTLIYSFPVISKKIESIKFSRSASSSVVCAAKGPRPRYPRVWKTRKRIGTISKAEKLVDCVKGLSNVKEEVYGALDSFIAWELEFPLITVKKALKTLEN

Query:  QREWKRIIQLTKWMLSKGQGRTMGSYFTLLNALAEDGRLDEAEELWNKLFSQHLESMPRIFFHKMISLYYDRGMHDKLFEVFADMEELGVQPNMAIVTMI
        QREWKRIIQLTKWMLSKGQGRTMGSYFTLLNALAEDGRLDEAEELWNKLFSQHLESMPRIFFHKMISLYYDR MHDKLFEVFADMEELGVQPNM IVTM+
Subjt:  QREWKRIIQLTKWMLSKGQGRTMGSYFTLLNALAEDGRLDEAEELWNKLFSQHLESMPRIFFHKMISLYYDRGMHDKLFEVFADMEELGVQPNMAIVTMI

Query:  GNVFQELGMFDKYEKLKKKYPPPKWEYRYIKGKRVRIRAKYLNEYGTSNNGSSELDKKEHSSTKLLEEAEMTSEDTLTDSSLEDD-EMSEDTDEILEDEN
        GNVF ELGM DKYEKL KKYPPPKWEYRYIKGKRVRIR+KYL E G  NN  S+ DK EHSSTKLLEEAE+TSEDT    +LEDD EMSED +EI +DE 
Subjt:  GNVFQELGMFDKYEKLKKKYPPPKWEYRYIKGKRVRIRAKYLNEYGTSNNGSSELDKKEHSSTKLLEEAEMTSEDTLTDSSLEDD-EMSEDTDEILEDEN

Query:  MSKESNFVHDFMGFGQL
        MSKE NF HDFMGFGQL
Subjt:  MSKESNFVHDFMGFGQL

TrEMBL top hitse value%identityAlignment
A0A0A0L6Q9 Uncharacterized protein1.7e-14685.09Show/hide
Query:  IALLKMLTLIYSFPVISKKIESIKFSRSASSSVVCAAKGPRPRYPRVWKTRKRIGTISKAEKLVDCVKGLSNVKEEVYGALDSFIAWELEFPLITVKKAL
        +A  +MLTL+Y+FPV SK+IES+ FS   SSSVVCAAKGPRPRYPRVWKT+KRIGTISKA KLVDCVKGLSNVKEEVYGALDSFIAWELEFPLITVKKAL
Subjt:  IALLKMLTLIYSFPVISKKIESIKFSRSASSSVVCAAKGPRPRYPRVWKTRKRIGTISKAEKLVDCVKGLSNVKEEVYGALDSFIAWELEFPLITVKKAL

Query:  KTLENQREWKRIIQLTKWMLSKGQGRTMGSYFTLLNALAEDGRLDEAEELWNKLFSQHLESMPRIFFHKMISLYYDRGMHDKLFEVFADMEELGVQPNMA
        KTLENQREWKRIIQLTKWMLSKGQGRTMGSYFTLLNALAEDGRLDEAEELWNKLFSQHLES+PRIFFHKMISLYYD+ MHDKLFEVFADMEELGVQPNMA
Subjt:  KTLENQREWKRIIQLTKWMLSKGQGRTMGSYFTLLNALAEDGRLDEAEELWNKLFSQHLESMPRIFFHKMISLYYDRGMHDKLFEVFADMEELGVQPNMA

Query:  IVTMIGNVFQELGMFDKYEKLKKKYPPPKWEYRYIKGKRVRIRAKYLNEYGTSNNGSSELDKKEHSSTKLLEEAEMTSEDTLTDSSLEDDE-MSEDTDEI
        IVT +GNVFQELGM DKY+KL KKYPPPKWEYRYIKGKRV+IRAKYL+E G SNNG SE  K EHSST  ++EAE+TSE    DSSLEDDE MSED DEI
Subjt:  IVTMIGNVFQELGMFDKYEKLKKKYPPPKWEYRYIKGKRVRIRAKYLNEYGTSNNGSSELDKKEHSSTKLLEEAEMTSEDTLTDSSLEDDE-MSEDTDEI

Query:  LEDENMSKESNFVHDFMGFGQL
        LEDE+M  +SNF HDFMG GQL
Subjt:  LEDENMSKESNFVHDFMGFGQL

A0A1S4DTK8 pentatricopeptide repeat-containing protein At4g21190 isoform X15.5e-14582.28Show/hide
Query:  SFDHSLNFFLGIALLKMLTLIYSFPVISKKIESIKFSRSASSSVVCAAKGPRPRYPRVWKTRKRIGTISKAEKLVDCVKGLSNVKEEVYGALDSFIAWEL
        S   +L  F  I   +MLTL+Y+FPV SK+IES+ FS   SSSVVCAAKGPRPRYPRVWKTRKRIGTISKA KLVDCVKGLSNVKEEVYGALDSFIAWEL
Subjt:  SFDHSLNFFLGIALLKMLTLIYSFPVISKKIESIKFSRSASSSVVCAAKGPRPRYPRVWKTRKRIGTISKAEKLVDCVKGLSNVKEEVYGALDSFIAWEL

Query:  EFPLITVKKALKTLENQREWKRIIQLTKWMLSKGQGRTMGSYFTLLNALAEDGRLDEAEELWNKLFSQHLESMPRIFFHKMISLYYDRGMHDKLFEVFAD
        EFPLITVKKALKTLENQREWKRIIQLTKWMLSKGQGRTMGSYFTLLNAL EDGRLDEAEELWNKLFSQ+LESMPRIFFHKMISLYYDR MHDKLFEVFAD
Subjt:  EFPLITVKKALKTLENQREWKRIIQLTKWMLSKGQGRTMGSYFTLLNALAEDGRLDEAEELWNKLFSQHLESMPRIFFHKMISLYYDRGMHDKLFEVFAD

Query:  MEELGVQPNMAIVTMIGNVFQELGMFDKYEKLKKKYPPPKWEYRYIKGKRVRIRAKYLNEYGTSNNGSSELDKKEHSSTKLLEEAEMTSEDTLTDSSLED
        MEELGVQPNMAIVT +GN+FQELGM DKYEKL KKYPPPKWEYRYIKGKRV+IR KYL+E G S NG SE +K EHSST  L+EAE+TSE    DSSLED
Subjt:  MEELGVQPNMAIVTMIGNVFQELGMFDKYEKLKKKYPPPKWEYRYIKGKRVRIRAKYLNEYGTSNNGSSELDKKEHSSTKLLEEAEMTSEDTLTDSSLED

Query:  D-EMSEDTDEILEDENMSKESNFVHDFMGFGQL
        D E+ +D DEILEDE+M  +SNF HDFMG GQL
Subjt:  D-EMSEDTDEILEDENMSKESNFVHDFMGFGQL

A0A6J1CI57 pentatricopeptide repeat-containing protein At4g211909.1e-14886.75Show/hide
Query:  KMLTLIYSFPVISKKIESIKFSRSASSSVVCAAKGPRPRYPRVWKTRKRIGTISKAEKLVDCVKGLSNVKEEVYGALDSFIAWELEFPLITVKKALKTLE
        +M TLIYSFPVISK+IES+KFS SASS+VVCAAKGPRPRYPRVWKTRKRIGTISKA KLVDCVKGLSNVKEEVYGALDSFIAWELEFPLITVKKALKTLE
Subjt:  KMLTLIYSFPVISKKIESIKFSRSASSSVVCAAKGPRPRYPRVWKTRKRIGTISKAEKLVDCVKGLSNVKEEVYGALDSFIAWELEFPLITVKKALKTLE

Query:  NQREWKRIIQLTKWMLSKGQGRTMGSYFTLLNALAEDGRLDEAEELWNKLFSQHLESMPRIFFHKMISLYYDRGMHDKLFEVFADMEELGVQPNMAIVTM
         QREWKRIIQLTKWMLSKGQGRTMGSYFTLLNALAEDGRLDEAEELWNKLFSQ+LESMPR+FFHKMISLYYD+GMHDKLFEVFADMEELGVQPN  IVTM
Subjt:  NQREWKRIIQLTKWMLSKGQGRTMGSYFTLLNALAEDGRLDEAEELWNKLFSQHLESMPRIFFHKMISLYYDRGMHDKLFEVFADMEELGVQPNMAIVTM

Query:  IGNVFQELGMFDKYEKLKKKYPPPKWEYRYIKGKRVRIRAKYLNEYGTSNNGSSELDKKEHSSTKLLEEAEMTSEDTLTDSSLEDDEMSEDTDEILEDEN
        IGNVFQELGMFDKYEKLKKKYPP KWEYRY+KGKRVRIRAKYLNEYG SNNGSSELD+K+ SS KLLEEAE  S+    DSSLED+EM ED DEILEDE+
Subjt:  IGNVFQELGMFDKYEKLKKKYPPPKWEYRYIKGKRVRIRAKYLNEYGTSNNGSSELDKKEHSSTKLLEEAEMTSEDTLTDSSLEDDEMSEDTDEILEDEN

Query:  MSKESNFVHDFMGFGQL
          +  +F ++FMG+G+L
Subjt:  MSKESNFVHDFMGFGQL

A0A6J1G1Y3 pentatricopeptide repeat-containing protein At4g211904.5e-14786.69Show/hide
Query:  IALLKMLTLIYSFPVISKKIESIKFSRSASSSVVCAAKGPRPRYPRVWKTRKRIGTISKAEKLVDCVKGLSNVKEEVYGALDSFIAWELEFPLITVKKAL
        +A  +M TLIYSFPVISK IES+KFS  ASSSVVCAAKGPRPRYPRVWKTRKRIGTISKA KLVDCVKGLSNVKEEVYGALDSFIAWELEFPLITVKKAL
Subjt:  IALLKMLTLIYSFPVISKKIESIKFSRSASSSVVCAAKGPRPRYPRVWKTRKRIGTISKAEKLVDCVKGLSNVKEEVYGALDSFIAWELEFPLITVKKAL

Query:  KTLENQREWKRIIQLTKWMLSKGQGRTMGSYFTLLNALAEDGRLDEAEELWNKLFSQHLESMPRIFFHKMISLYYDRGMHDKLFEVFADMEELGVQPNMA
        KTLE QREWKRIIQLTKWMLSKGQGRTMGSYFTLLNALA DGRLDEAEELWNKLFSQHLESMPRIFFHKMISLYY+R MHDKLFE+FADMEELGVQP+MA
Subjt:  KTLENQREWKRIIQLTKWMLSKGQGRTMGSYFTLLNALAEDGRLDEAEELWNKLFSQHLESMPRIFFHKMISLYYDRGMHDKLFEVFADMEELGVQPNMA

Query:  IVTMIGNVFQELGMFDKYEKLKKKYPPPKWEYRYIKGKRVRIRAKYLNEYGTSNNGSSELDKKEHSST-KLLEEAEMTSEDTLTDSSLEDDEMSEDTDEI
        IVT +GNVFQ+LGM DKYEKLKKKYPPPKWEYRYI+GKRV+IRAK L+E G+SNNGS ELDKKEHSST +LLEE E+TS+    DSSLEDDEMSED DE 
Subjt:  IVTMIGNVFQELGMFDKYEKLKKKYPPPKWEYRYIKGKRVRIRAKYLNEYGTSNNGSSELDKKEHSST-KLLEEAEMTSEDTLTDSSLEDDEMSEDTDEI

Query:  LEDENMS-KESNFVHDFMGFGQL
        LEDE+M  KES F HDFMGFGQL
Subjt:  LEDENMS-KESNFVHDFMGFGQL

A0A6J1HWS4 pentatricopeptide repeat-containing protein At4g211903.2e-14583.53Show/hide
Query:  SSFDHSL---NFFLGIALLKMLTLIYSFPVISKKIESIKFS--RSASSSVVCAAKGPRPRYPRVWKTRKRIGTISKAEKLVDCVKGLSNVKEEVYGALDS
        SS  HSL   +    +A  +M TLIYSFPVISK IES+KFS   SA SSVVCAAKGPRPRYPRVWKTRKRIGTISKA KLVDCVKGLSNVKEEVYGALDS
Subjt:  SSFDHSL---NFFLGIALLKMLTLIYSFPVISKKIESIKFS--RSASSSVVCAAKGPRPRYPRVWKTRKRIGTISKAEKLVDCVKGLSNVKEEVYGALDS

Query:  FIAWELEFPLITVKKALKTLENQREWKRIIQLTKWMLSKGQGRTMGSYFTLLNALAEDGRLDEAEELWNKLFSQHLESMPRIFFHKMISLYYDRGMHDKL
        FIAWELEFPLITVKKALKTLE QREWKRIIQLTKWMLSKGQGRTMGSYFTLLNALAEDGRLDEAEELWNKLFSQHLESMPRIFFHKMISLYY+R MHDKL
Subjt:  FIAWELEFPLITVKKALKTLENQREWKRIIQLTKWMLSKGQGRTMGSYFTLLNALAEDGRLDEAEELWNKLFSQHLESMPRIFFHKMISLYYDRGMHDKL

Query:  FEVFADMEELGVQPNMAIVTMIGNVFQELGMFDKYEKLKKKYPPPKWEYRYIKGKRVRIRAKYLNEYGTSNNGSSELDKKEHSST-KLLEEAEMTSEDTL
        FE+FADMEELGVQP+MAIVTM+G+VFQ+LGM DK EKLKKKYPPPKWEYRYI+GKRV+IRAK L+E G+SNNGS ELDKKE SST +LLEE E TS+   
Subjt:  FEVFADMEELGVQPNMAIVTMIGNVFQELGMFDKYEKLKKKYPPPKWEYRYIKGKRVRIRAKYLNEYGTSNNGSSELDKKEHSST-KLLEEAEMTSEDTL

Query:  TDSSLEDDEMSEDTDEILEDENM-SKESNFVHDFMGFGQL
         DSSLEDDEMSED DE+LEDE+M  KES F HDFMGFGQL
Subjt:  TDSSLEDDEMSEDTDEILEDENM-SKESNFVHDFMGFGQL

SwissProt top hitse value%identityAlignment
Q2V3H0 Pentatricopeptide repeat-containing protein At4g18975, chloroplastic5.3e-4445.81Show/hide
Query:  VWKTRKRIGTISKAEKLVDCVKGLSNVKEEVYGALDSFIAWELEFPLITVKKALKTLENQREWKRIIQLTKWMLSKGQGRTMGSYFTLLNALAEDGRLDE
        +WK     G+  KA  LV  + GL N KE VYGAL+ ++AWE+EFP+I   KAL+ L  + +W R+IQL KWMLSKGQG TMG+Y  LL A   D R DE
Subjt:  VWKTRKRIGTISKAEKLVDCVKGLSNVKEEVYGALDSFIAWELEFPLITVKKALKTLENQREWKRIIQLTKWMLSKGQGRTMGSYFTLLNALAEDGRLDE

Query:  AEELWNKLFSQHLESMPRIFFHKMISLYYDRGMHDKLFEVFADMEELGVQPNMAIVTMIGNVFQELGMFDKYEKLKKKYPPPKWEYRYIKGKRVRIRAKY
        AE LWN +   H  S+PR  F +MI+LY    +HDK+ EVFADMEEL V P+      +   F+EL   +  + + ++Y   +++Y Y  G+RVR++ +Y
Subjt:  AEELWNKLFSQHLESMPRIFFHKMISLYYDRGMHDKLFEVFADMEELGVQPNMAIVTMIGNVFQELGMFDKYEKLKKKYPPPKWEYRYIKGKRVRIRAKY

Query:  LNE
         +E
Subjt:  LNE

Q8LG95 Pentatricopeptide repeat-containing protein At4g211906.2e-10165.73Show/hide
Query:  MLTLIYSFPVI---SKKIESIKFSRSASSSVVCAAKGPRPRYPRVWKTRKRIGTISKAEKLVDCVKGLSNVKEEVYGALDSFIAWELEFPLITVKKALKT
        ML+L YS P +   +++  +  F++  ++ VVCAA+GPRPR PRVWKTRKRIGTISKA K++ C+KGLSNVKEEVYGALDSFIAWELEFPL+ VKKAL  
Subjt:  MLTLIYSFPVI---SKKIESIKFSRSASSSVVCAAKGPRPRYPRVWKTRKRIGTISKAEKLVDCVKGLSNVKEEVYGALDSFIAWELEFPLITVKKALKT

Query:  LENQREWKRIIQLTKWMLSKGQGRTMGSYFTLLNALAEDGRLDEAEELWNKLFSQHLESMPRIFFHKMISLYYDRGMHDKLFEVFADMEELGVQPNMAIV
        LE+++EWK+IIQ+TKWMLSKGQGRTMG+YF+LLNALAED RLDEAEELWNKLF +HLE  PR FF+KMIS+YY R MH KLFEVFADMEELGV+PN+AIV
Subjt:  LENQREWKRIIQLTKWMLSKGQGRTMGSYFTLLNALAEDGRLDEAEELWNKLFSQHLESMPRIFFHKMISLYYDRGMHDKLFEVFADMEELGVQPNMAIV

Query:  TMIGNVFQELGMFDKYEKLKKKYPPPKWEYRYIKGKRVRIRAKYLNEYGTSNNG-SSELDKKEHSSTKLLEEAEMTSEDTLTDSSL
        +M+G VF +L M DKYEKL KKYPPP+WE+RYIKG+RV+++AK LNE      G SS+ DK ++      E+ E  SE+   +  L
Subjt:  TMIGNVFQELGMFDKYEKLKKKYPPPKWEYRYIKGKRVRIRAKYLNEYGTSNNG-SSELDKKEHSSTKLLEEAEMTSEDTLTDSSL

Q9LYZ9 Pentatricopeptide repeat-containing protein At5g028604.1e-0421.88Show/hide
Query:  LVDCVKGLSNVKEEVYGALDSFIAWELEFPLITVKKALKTLENQREWKRIIQLTKWMLSKGQGRTMGSYFTLLNALAEDGRLDEAEELWNKLFSQHLESM
        L+ C K   ++ +E     +   A    +  +T    L         K  +++   M+  G   ++ +Y +L++A A DG LDEA EL N++  +   + 
Subjt:  LVDCVKGLSNVKEEVYGALDSFIAWELEFPLITVKKALKTLENQREWKRIIQLTKWMLSKGQGRTMGSYFTLLNALAEDGRLDEAEELWNKLFSQHLESM

Query:  PRIFFHKMISLYYDR-GMHDKLFEVFADMEELGVQPNMAIVTMIGNVFQELGMFDKYEKL
        P +F +  +   ++R G  +    +F +M   G +PN+        ++   G F +  K+
Subjt:  PRIFFHKMISLYYDR-GMHDKLFEVFADMEELGVQPNMAIVTMIGNVFQELGMFDKYEKL

Arabidopsis top hitse value%identityAlignment
AT4G18975.1 Pentatricopeptide repeat (PPR) superfamily protein3.8e-4545.81Show/hide
Query:  VWKTRKRIGTISKAEKLVDCVKGLSNVKEEVYGALDSFIAWELEFPLITVKKALKTLENQREWKRIIQLTKWMLSKGQGRTMGSYFTLLNALAEDGRLDE
        +WK     G+  KA  LV  + GL N KE VYGAL+ ++AWE+EFP+I   KAL+ L  + +W R+IQL KWMLSKGQG TMG+Y  LL A   D R DE
Subjt:  VWKTRKRIGTISKAEKLVDCVKGLSNVKEEVYGALDSFIAWELEFPLITVKKALKTLENQREWKRIIQLTKWMLSKGQGRTMGSYFTLLNALAEDGRLDE

Query:  AEELWNKLFSQHLESMPRIFFHKMISLYYDRGMHDKLFEVFADMEELGVQPNMAIVTMIGNVFQELGMFDKYEKLKKKYPPPKWEYRYIKGKRVRIRAKY
        AE LWN +   H  S+PR  F +MI+LY    +HDK+ EVFADMEEL V P+      +   F+EL   +  + + ++Y   +++Y Y  G+RVR++ +Y
Subjt:  AEELWNKLFSQHLESMPRIFFHKMISLYYDRGMHDKLFEVFADMEELGVQPNMAIVTMIGNVFQELGMFDKYEKLKKKYPPPKWEYRYIKGKRVRIRAKY

Query:  LNE
         +E
Subjt:  LNE

AT4G18975.2 Pentatricopeptide repeat (PPR) superfamily protein1.9e-4441.09Show/hide
Query:  FFLGIALLKMLTLIYSFPV----ISKKIESIKFSRSASSSVVC-AAKGPRPRYPRVWKTRKRIGTISKAEKLVDCVKGLSNVKEEVYGALDSFIAWELEF
        FF  I+ L+ L L+    V      K+ E+ K  R   ++V     K    +   +WK     G+  KA  LV  + GL N KE VYGAL+ ++AWE+EF
Subjt:  FFLGIALLKMLTLIYSFPV----ISKKIESIKFSRSASSSVVC-AAKGPRPRYPRVWKTRKRIGTISKAEKLVDCVKGLSNVKEEVYGALDSFIAWELEF

Query:  PLITVKKALKTLENQREWKRIIQLTKWMLSKGQGRTMGSYFTLLNALAEDGRLDEAEELWNKLFSQHLESMPRIFFHKMISLYYDRGMHDKLFEVFADME
        P+I   KAL+ L  + +W R+IQL KWMLSKGQG TMG+Y  LL A   D R DEAE LWN +   H  S+PR  F +MI+LY    +HDK+ EVFADME
Subjt:  PLITVKKALKTLENQREWKRIIQLTKWMLSKGQGRTMGSYFTLLNALAEDGRLDEAEELWNKLFSQHLESMPRIFFHKMISLYYDRGMHDKLFEVFADME

Query:  ELGVQPNMAIVTMIGNVFQELGMFDKYEKLKKKYPPPKWEYRYIKGKRVRIRAKYLNE
        EL V P+      +   F+EL   +  + + ++Y   +++Y Y  G+RVR++ +Y +E
Subjt:  ELGVQPNMAIVTMIGNVFQELGMFDKYEKLKKKYPPPKWEYRYIKGKRVRIRAKYLNE

AT4G18975.3 Pentatricopeptide repeat (PPR) superfamily protein3.8e-4545.81Show/hide
Query:  VWKTRKRIGTISKAEKLVDCVKGLSNVKEEVYGALDSFIAWELEFPLITVKKALKTLENQREWKRIIQLTKWMLSKGQGRTMGSYFTLLNALAEDGRLDE
        +WK     G+  KA  LV  + GL N KE VYGAL+ ++AWE+EFP+I   KAL+ L  + +W R+IQL KWMLSKGQG TMG+Y  LL A   D R DE
Subjt:  VWKTRKRIGTISKAEKLVDCVKGLSNVKEEVYGALDSFIAWELEFPLITVKKALKTLENQREWKRIIQLTKWMLSKGQGRTMGSYFTLLNALAEDGRLDE

Query:  AEELWNKLFSQHLESMPRIFFHKMISLYYDRGMHDKLFEVFADMEELGVQPNMAIVTMIGNVFQELGMFDKYEKLKKKYPPPKWEYRYIKGKRVRIRAKY
        AE LWN +   H  S+PR  F +MI+LY    +HDK+ EVFADMEEL V P+      +   F+EL   +  + + ++Y   +++Y Y  G+RVR++ +Y
Subjt:  AEELWNKLFSQHLESMPRIFFHKMISLYYDRGMHDKLFEVFADMEELGVQPNMAIVTMIGNVFQELGMFDKYEKLKKKYPPPKWEYRYIKGKRVRIRAKY

Query:  LNE
         +E
Subjt:  LNE

AT4G18975.4 Pentatricopeptide repeat (PPR) superfamily protein3.8e-4545.81Show/hide
Query:  VWKTRKRIGTISKAEKLVDCVKGLSNVKEEVYGALDSFIAWELEFPLITVKKALKTLENQREWKRIIQLTKWMLSKGQGRTMGSYFTLLNALAEDGRLDE
        +WK     G+  KA  LV  + GL N KE VYGAL+ ++AWE+EFP+I   KAL+ L  + +W R+IQL KWMLSKGQG TMG+Y  LL A   D R DE
Subjt:  VWKTRKRIGTISKAEKLVDCVKGLSNVKEEVYGALDSFIAWELEFPLITVKKALKTLENQREWKRIIQLTKWMLSKGQGRTMGSYFTLLNALAEDGRLDE

Query:  AEELWNKLFSQHLESMPRIFFHKMISLYYDRGMHDKLFEVFADMEELGVQPNMAIVTMIGNVFQELGMFDKYEKLKKKYPPPKWEYRYIKGKRVRIRAKY
        AE LWN +   H  S+PR  F +MI+LY    +HDK+ EVFADMEEL V P+      +   F+EL   +  + + ++Y   +++Y Y  G+RVR++ +Y
Subjt:  AEELWNKLFSQHLESMPRIFFHKMISLYYDRGMHDKLFEVFADMEELGVQPNMAIVTMIGNVFQELGMFDKYEKLKKKYPPPKWEYRYIKGKRVRIRAKY

Query:  LNE
         +E
Subjt:  LNE

AT4G21190.1 Pentatricopeptide repeat (PPR) superfamily protein4.4e-10265.73Show/hide
Query:  MLTLIYSFPVI---SKKIESIKFSRSASSSVVCAAKGPRPRYPRVWKTRKRIGTISKAEKLVDCVKGLSNVKEEVYGALDSFIAWELEFPLITVKKALKT
        ML+L YS P +   +++  +  F++  ++ VVCAA+GPRPR PRVWKTRKRIGTISKA K++ C+KGLSNVKEEVYGALDSFIAWELEFPL+ VKKAL  
Subjt:  MLTLIYSFPVI---SKKIESIKFSRSASSSVVCAAKGPRPRYPRVWKTRKRIGTISKAEKLVDCVKGLSNVKEEVYGALDSFIAWELEFPLITVKKALKT

Query:  LENQREWKRIIQLTKWMLSKGQGRTMGSYFTLLNALAEDGRLDEAEELWNKLFSQHLESMPRIFFHKMISLYYDRGMHDKLFEVFADMEELGVQPNMAIV
        LE+++EWK+IIQ+TKWMLSKGQGRTMG+YF+LLNALAED RLDEAEELWNKLF +HLE  PR FF+KMIS+YY R MH KLFEVFADMEELGV+PN+AIV
Subjt:  LENQREWKRIIQLTKWMLSKGQGRTMGSYFTLLNALAEDGRLDEAEELWNKLFSQHLESMPRIFFHKMISLYYDRGMHDKLFEVFADMEELGVQPNMAIV

Query:  TMIGNVFQELGMFDKYEKLKKKYPPPKWEYRYIKGKRVRIRAKYLNEYGTSNNG-SSELDKKEHSSTKLLEEAEMTSEDTLTDSSL
        +M+G VF +L M DKYEKL KKYPPP+WE+RYIKG+RV+++AK LNE      G SS+ DK ++      E+ E  SE+   +  L
Subjt:  TMIGNVFQELGMFDKYEKLKKKYPPPKWEYRYIKGKRVRIRAKYLNEYGTSNNG-SSELDKKEHSSTKLLEEAEMTSEDTLTDSSL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGGAAGTAATGGCGGCGGAGAAAGAGAGAGAGAGAGTTACAGGACAAAGCGGAGGAGCAGCAGAAGTTTGCGCCGGCGGGGAATCGAACGGCCAGAGCAGTTTCGA
TCATTCTCTCAACTTTTTTCTTGGTATTGCGCTATTGAAGATGCTTACTTTGATTTACTCTTTTCCAGTCATATCCAAAAAGATAGAATCTATCAAATTTTCACGGAGTG
CAAGCAGTTCAGTGGTATGCGCGGCAAAAGGTCCACGGCCGAGATATCCTCGGGTTTGGAAAACCAGAAAGAGAATTGGAACCATATCCAAGGCAGAAAAGCTTGTTGAT
TGTGTCAAGGGACTGTCTAACGTCAAAGAGGAAGTCTATGGAGCTCTTGATTCCTTCATTGCCTGGGAACTAGAGTTTCCTCTTATTACTGTAAAGAAGGCCCTGAAGAC
CTTGGAGAACCAAAGAGAATGGAAGAGGATAATTCAGTTGACGAAATGGATGTTAAGTAAAGGCCAAGGAAGAACAATGGGAAGTTATTTCACGTTGTTAAATGCCTTAG
CTGAAGATGGAAGACTTGATGAGGCTGAAGAGCTTTGGAACAAATTGTTTTCTCAGCATCTGGAGAGCATGCCTCGCATTTTCTTTCATAAAATGATATCCCTCTACTAT
GACCGGGGTATGCACGACAAGTTATTTGAGGTATTTGCTGATATGGAGGAACTTGGAGTTCAACCAAATATGGCGATTGTCACTATGATTGGAAATGTTTTCCAAGAGTT
GGGTATGTTCGATAAATATGAAAAATTGAAGAAGAAATATCCCCCACCAAAATGGGAATATCGTTACATCAAAGGAAAGCGTGTAAGAATACGAGCAAAGTATCTGAATG
AATATGGTACTTCCAACAATGGTTCAAGTGAGCTTGACAAAAAGGAGCATAGTTCAACAAAACTGTTGGAGGAAGCTGAAATGACTTCCGAAGATACTCTCACAGATTCC
AGTCTTGAAGATGATGAAATGAGCGAAGATACAGATGAAATTTTGGAAGATGAAAATATGTCCAAGGAATCCAATTTTGTGCACGATTTCATGGGGTTTGGGCAATTGTA
A
mRNA sequenceShow/hide mRNA sequence
ATGAAGGAAGTAATGGCGGCGGAGAAAGAGAGAGAGAGAGTTACAGGACAAAGCGGAGGAGCAGCAGAAGTTTGCGCCGGCGGGGAATCGAACGGCCAGAGCAGTTTCGA
TCATTCTCTCAACTTTTTTCTTGGTATTGCGCTATTGAAGATGCTTACTTTGATTTACTCTTTTCCAGTCATATCCAAAAAGATAGAATCTATCAAATTTTCACGGAGTG
CAAGCAGTTCAGTGGTATGCGCGGCAAAAGGTCCACGGCCGAGATATCCTCGGGTTTGGAAAACCAGAAAGAGAATTGGAACCATATCCAAGGCAGAAAAGCTTGTTGAT
TGTGTCAAGGGACTGTCTAACGTCAAAGAGGAAGTCTATGGAGCTCTTGATTCCTTCATTGCCTGGGAACTAGAGTTTCCTCTTATTACTGTAAAGAAGGCCCTGAAGAC
CTTGGAGAACCAAAGAGAATGGAAGAGGATAATTCAGTTGACGAAATGGATGTTAAGTAAAGGCCAAGGAAGAACAATGGGAAGTTATTTCACGTTGTTAAATGCCTTAG
CTGAAGATGGAAGACTTGATGAGGCTGAAGAGCTTTGGAACAAATTGTTTTCTCAGCATCTGGAGAGCATGCCTCGCATTTTCTTTCATAAAATGATATCCCTCTACTAT
GACCGGGGTATGCACGACAAGTTATTTGAGGTATTTGCTGATATGGAGGAACTTGGAGTTCAACCAAATATGGCGATTGTCACTATGATTGGAAATGTTTTCCAAGAGTT
GGGTATGTTCGATAAATATGAAAAATTGAAGAAGAAATATCCCCCACCAAAATGGGAATATCGTTACATCAAAGGAAAGCGTGTAAGAATACGAGCAAAGTATCTGAATG
AATATGGTACTTCCAACAATGGTTCAAGTGAGCTTGACAAAAAGGAGCATAGTTCAACAAAACTGTTGGAGGAAGCTGAAATGACTTCCGAAGATACTCTCACAGATTCC
AGTCTTGAAGATGATGAAATGAGCGAAGATACAGATGAAATTTTGGAAGATGAAAATATGTCCAAGGAATCCAATTTTGTGCACGATTTCATGGGGTTTGGGCAATTGTA
A
Protein sequenceShow/hide protein sequence
MKEVMAAEKERERVTGQSGGAAEVCAGGESNGQSSFDHSLNFFLGIALLKMLTLIYSFPVISKKIESIKFSRSASSSVVCAAKGPRPRYPRVWKTRKRIGTISKAEKLVD
CVKGLSNVKEEVYGALDSFIAWELEFPLITVKKALKTLENQREWKRIIQLTKWMLSKGQGRTMGSYFTLLNALAEDGRLDEAEELWNKLFSQHLESMPRIFFHKMISLYY
DRGMHDKLFEVFADMEELGVQPNMAIVTMIGNVFQELGMFDKYEKLKKKYPPPKWEYRYIKGKRVRIRAKYLNEYGTSNNGSSELDKKEHSSTKLLEEAEMTSEDTLTDS
SLEDDEMSEDTDEILEDENMSKESNFVHDFMGFGQL