; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0020066 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0020066
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationchr5:47807717..47810162
RNA-Seq ExpressionLag0020066
SyntenyLag0020066
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily
IPR044646 - Pentatricopeptide repeat-containing protein EMB1417-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7013270.1 Pentatricopeptide repeat-containing protein [Cucurbita argyrosperma subsp. argyrosperma]2.8e-15086.65Show/hide
Query:  MMHSIVPASLSSS-ASNRMLTLIYSFPVIISKKIESVKFPRSASSSVVCAAKGPRPRYPRVWKTRKRIGTISKAGKLVDCVKGLSNVKEEVYGALDSFIA
        M HS+VPASLSS+ ASNRM TLIYSFPV ISK IESVKF   ASSSVVCAAKGPRPRYPRVWKTRKRIGTISKA KLVDCVKGLSNVKEEVYGALDSFIA
Subjt:  MMHSIVPASLSSS-ASNRMLTLIYSFPVIISKKIESVKFPRSASSSVVCAAKGPRPRYPRVWKTRKRIGTISKAGKLVDCVKGLSNVKEEVYGALDSFIA

Query:  WELEFPLITVKKALKTLENQREWKRIIQLTKWMLSKGQGRTMGSYFTLLNALAEDGRLDEAEELWNKLFSQHLESMPRIFFHKMISLYYDRGMHDKLFEV
        WELEFPLITVKKALKTLE QREWKRIIQLTKWMLSKGQGRTMGSYFTLLNALAEDGRLDEAEELWNKLFSQHLESMPRIFFHKMISLYY+R MHDKLFE+
Subjt:  WELEFPLITVKKALKTLENQREWKRIIQLTKWMLSKGQGRTMGSYFTLLNALAEDGRLDEAEELWNKLFSQHLESMPRIFFHKMISLYYDRGMHDKLFEV

Query:  FADMEELGVQPNMAIVTMIGNVFQELGMFDKYEKLKKKYPPPKWEYRYIKGKRVRIRAKYLNEYGTSNNDSSELDKKEHSST-KLLEEAEMTSEDTLTDS
        FADMEELGVQP+MAIVTM+GNVFQ+LGM DKYEKLKKKYPPPKWEYRYI+GKRV+IRAK L+E G+SNN S ELDKKEHSST +LLEE ++TS+    DS
Subjt:  FADMEELGVQPNMAIVTMIGNVFQELGMFDKYEKLKKKYPPPKWEYRYIKGKRVRIRAKYLNEYGTSNNDSSELDKKEHSST-KLLEEAEMTSEDTLTDS

Query:  SLEDDEMSEDTDEILEEENMS-KESNFEHDFMGFGQL
        SLEDDEMSED  E LE+E+M  KES FEHDFMGFGQL
Subjt:  SLEDDEMSEDTDEILEEENMS-KESNFEHDFMGFGQL

XP_022140817.1 pentatricopeptide repeat-containing protein At4g21190 [Momordica charantia]1.4e-14985.97Show/hide
Query:  MMHSIVPASL-SSSASNRMLTLIYSFPVIISKKIESVKFPRSASSSVVCAAKGPRPRYPRVWKTRKRIGTISKAGKLVDCVKGLSNVKEEVYGALDSFIA
        M HS V ASL SSSASNRM TLIYSFPV ISK+IESVKF  SASS+VVCAAKGPRPRYPRVWKTRKRIGTISKA KLVDCVKGLSNVKEEVYGALDSFIA
Subjt:  MMHSIVPASL-SSSASNRMLTLIYSFPVIISKKIESVKFPRSASSSVVCAAKGPRPRYPRVWKTRKRIGTISKAGKLVDCVKGLSNVKEEVYGALDSFIA

Query:  WELEFPLITVKKALKTLENQREWKRIIQLTKWMLSKGQGRTMGSYFTLLNALAEDGRLDEAEELWNKLFSQHLESMPRIFFHKMISLYYDRGMHDKLFEV
        WELEFPLITVKKALKTLE QREWKRIIQLTKWMLSKGQGRTMGSYFTLLNALAEDGRLDEAEELWNKLFSQ+LESMPR+FFHKMISLYYD+GMHDKLFEV
Subjt:  WELEFPLITVKKALKTLENQREWKRIIQLTKWMLSKGQGRTMGSYFTLLNALAEDGRLDEAEELWNKLFSQHLESMPRIFFHKMISLYYDRGMHDKLFEV

Query:  FADMEELGVQPNMAIVTMIGNVFQELGMFDKYEKLKKKYPPPKWEYRYIKGKRVRIRAKYLNEYGTSNNDSSELDKKEHSSTKLLEEAEMTSEDTLTDSS
        FADMEELGVQPN  IVTMIGNVFQELGMFDKYEKLKKKYPP KWEYRY+KGKRVRIRAKYLNEYG SNN SSELD+K+ SS KLLEEAE  S+    DSS
Subjt:  FADMEELGVQPNMAIVTMIGNVFQELGMFDKYEKLKKKYPPPKWEYRYIKGKRVRIRAKYLNEYGTSNNDSSELDKKEHSSTKLLEEAEMTSEDTLTDSS

Query:  LEDDEMSEDTDEILEEENMSKESNFEHDFMGFGQL
        LED+EM ED DEILE+E+  +  +FE++FMG+G+L
Subjt:  LEDDEMSEDTDEILEEENMSKESNFEHDFMGFGQL

XP_022945794.1 pentatricopeptide repeat-containing protein At4g21190 [Cucurbita moschata]4.8e-15086.65Show/hide
Query:  MMHSIVPASLSSS-ASNRMLTLIYSFPVIISKKIESVKFPRSASSSVVCAAKGPRPRYPRVWKTRKRIGTISKAGKLVDCVKGLSNVKEEVYGALDSFIA
        M HS+VPASLSS+ ASNRM TLIYSFPV ISK IESVKF   ASSSVVCAAKGPRPRYPRVWKTRKRIGTISKA KLVDCVKGLSNVKEEVYGALDSFIA
Subjt:  MMHSIVPASLSSS-ASNRMLTLIYSFPVIISKKIESVKFPRSASSSVVCAAKGPRPRYPRVWKTRKRIGTISKAGKLVDCVKGLSNVKEEVYGALDSFIA

Query:  WELEFPLITVKKALKTLENQREWKRIIQLTKWMLSKGQGRTMGSYFTLLNALAEDGRLDEAEELWNKLFSQHLESMPRIFFHKMISLYYDRGMHDKLFEV
        WELEFPLITVKKALKTLE QREWKRIIQLTKWMLSKGQGRTMGSYFTLLNALA DGRLDEAEELWNKLFSQHLESMPRIFFHKMISLYY+R MHDKLFE+
Subjt:  WELEFPLITVKKALKTLENQREWKRIIQLTKWMLSKGQGRTMGSYFTLLNALAEDGRLDEAEELWNKLFSQHLESMPRIFFHKMISLYYDRGMHDKLFEV

Query:  FADMEELGVQPNMAIVTMIGNVFQELGMFDKYEKLKKKYPPPKWEYRYIKGKRVRIRAKYLNEYGTSNNDSSELDKKEHSST-KLLEEAEMTSEDTLTDS
        FADMEELGVQP+MAIVT +GNVFQ+LGM DKYEKLKKKYPPPKWEYRYI+GKRV+IRAK L+E G+SNN S ELDKKEHSST +LLEE E+TS+    DS
Subjt:  FADMEELGVQPNMAIVTMIGNVFQELGMFDKYEKLKKKYPPPKWEYRYIKGKRVRIRAKYLNEYGTSNNDSSELDKKEHSST-KLLEEAEMTSEDTLTDS

Query:  SLEDDEMSEDTDEILEEENMS-KESNFEHDFMGFGQL
        SLEDDEMSED DE LE+E+M  KES FEHDFMGFGQL
Subjt:  SLEDDEMSEDTDEILEEENMS-KESNFEHDFMGFGQL

XP_023541679.1 pentatricopeptide repeat-containing protein At4g21190 [Cucurbita pepo subsp. pepo]8.3e-15086.43Show/hide
Query:  MMHSIVPASLSSS-ASNRMLTLIYSFPVIISKKIESVKFP--RSASSSVVCAAKGPRPRYPRVWKTRKRIGTISKAGKLVDCVKGLSNVKEEVYGALDSF
        M HS+VPASLSS+ ASNRM TLIYSFPV ISK IESVKF    SASSSVVCAAKGPRPRYPRVWKTRKRIGTISKA KLVDCVKGLSNVKEEVYGALDSF
Subjt:  MMHSIVPASLSSS-ASNRMLTLIYSFPVIISKKIESVKFP--RSASSSVVCAAKGPRPRYPRVWKTRKRIGTISKAGKLVDCVKGLSNVKEEVYGALDSF

Query:  IAWELEFPLITVKKALKTLENQREWKRIIQLTKWMLSKGQGRTMGSYFTLLNALAEDGRLDEAEELWNKLFSQHLESMPRIFFHKMISLYYDRGMHDKLF
        IAWELEFPLITVKKALKTLE QREWKRIIQLTKWMLSKGQGRTMGSYFTLLNALAEDGRLDEAEELWNKLFSQHLESMPRIFFHKMISLYY+R MHDKLF
Subjt:  IAWELEFPLITVKKALKTLENQREWKRIIQLTKWMLSKGQGRTMGSYFTLLNALAEDGRLDEAEELWNKLFSQHLESMPRIFFHKMISLYYDRGMHDKLF

Query:  EVFADMEELGVQPNMAIVTMIGNVFQELGMFDKYEKLKKKYPPPKWEYRYIKGKRVRIRAKYLNEYGTSNNDSSELDKKEHSST-KLLEEAEMTSEDTLT
        E+FADMEELGVQP+MAIVTM+GNVFQ+LGM DKYEKLKKKYPPPKWEYRYI+GKRV+IRAK L+E G+SNN S + DKKEHSST +LLEE E+TS+    
Subjt:  EVFADMEELGVQPNMAIVTMIGNVFQELGMFDKYEKLKKKYPPPKWEYRYIKGKRVRIRAKYLNEYGTSNNDSSELDKKEHSST-KLLEEAEMTSEDTLT

Query:  DSSLEDDEMSEDTDEILEEENM-SKESNFEHDFMGFGQL
        DSSLEDDEMSED DE LE+E+M  KES FEHDFMGFGQL
Subjt:  DSSLEDDEMSEDTDEILEEENM-SKESNFEHDFMGFGQL

XP_038888232.1 pentatricopeptide repeat-containing protein At4g21190 isoform X1 [Benincasa hispida]3.7e-15086.31Show/hide
Query:  MMHSIVPASL-SSSASNRMLTLIYSFPVIISKKIESVKFPRSASSSVVCAAKGPRPRYPRVWKTRKRIGTISKAGKLVDCVKGLSNVKEEVYGALDSFIA
        M HS+  ASL SSSAS RMLTLIYSFPV ISK+IESV F   ASSSVVCAAKGPRPRYPRVWKT+KRIGT+SKA KLVDCVKGLSNVKEEVYGALDSFIA
Subjt:  MMHSIVPASL-SSSASNRMLTLIYSFPVIISKKIESVKFPRSASSSVVCAAKGPRPRYPRVWKTRKRIGTISKAGKLVDCVKGLSNVKEEVYGALDSFIA

Query:  WELEFPLITVKKALKTLENQREWKRIIQLTKWMLSKGQGRTMGSYFTLLNALAEDGRLDEAEELWNKLFSQHLESMPRIFFHKMISLYYDRGMHDKLFEV
        WELEFPLITVKKAL+TLENQREWKRIIQLTKWMLSKGQGRTMGSYFTLLNALAEDGRLDEAEELWNKLFSQHLESMPRIFFHKMISLYYDR MHDKLFEV
Subjt:  WELEFPLITVKKALKTLENQREWKRIIQLTKWMLSKGQGRTMGSYFTLLNALAEDGRLDEAEELWNKLFSQHLESMPRIFFHKMISLYYDRGMHDKLFEV

Query:  FADMEELGVQPNMAIVTMIGNVFQELGMFDKYEKLKKKYPPPKWEYRYIKGKRVRIRAKYLNEYGTSNNDSSELDKKEHSSTKLLEEAEMTSEDTLTDSS
        FADMEELGVQPNM IVTM+GNVF ELGM DKYEKL KKYPPPKWEYRYIKGKRVRIR+KYL E G  NN  S+ DK EHSSTKLLEEAE+TSEDT    +
Subjt:  FADMEELGVQPNMAIVTMIGNVFQELGMFDKYEKLKKKYPPPKWEYRYIKGKRVRIRAKYLNEYGTSNNDSSELDKKEHSSTKLLEEAEMTSEDTLTDSS

Query:  LEDD-EMSEDTDEILEEENMSKESNFEHDFMGFGQL
        LEDD EMSED +EI ++E MSKE NFEHDFMGFGQL
Subjt:  LEDD-EMSEDTDEILEEENMSKESNFEHDFMGFGQL

TrEMBL top hitse value%identityAlignment
A0A0A0L6Q9 Uncharacterized protein1.6e-14683.93Show/hide
Query:  MMHSIVPASLSS-SASNRMLTLIYSFPVIISKKIESVKFPRSASSSVVCAAKGPRPRYPRVWKTRKRIGTISKAGKLVDCVKGLSNVKEEVYGALDSFIA
        M HS+  A+LSS  AS RMLTL+Y+FPV  SK+IESV F    SSSVVCAAKGPRPRYPRVWKT+KRIGTISKA KLVDCVKGLSNVKEEVYGALDSFIA
Subjt:  MMHSIVPASLSS-SASNRMLTLIYSFPVIISKKIESVKFPRSASSSVVCAAKGPRPRYPRVWKTRKRIGTISKAGKLVDCVKGLSNVKEEVYGALDSFIA

Query:  WELEFPLITVKKALKTLENQREWKRIIQLTKWMLSKGQGRTMGSYFTLLNALAEDGRLDEAEELWNKLFSQHLESMPRIFFHKMISLYYDRGMHDKLFEV
        WELEFPLITVKKALKTLENQREWKRIIQLTKWMLSKGQGRTMGSYFTLLNALAEDGRLDEAEELWNKLFSQHLES+PRIFFHKMISLYYD+ MHDKLFEV
Subjt:  WELEFPLITVKKALKTLENQREWKRIIQLTKWMLSKGQGRTMGSYFTLLNALAEDGRLDEAEELWNKLFSQHLESMPRIFFHKMISLYYDRGMHDKLFEV

Query:  FADMEELGVQPNMAIVTMIGNVFQELGMFDKYEKLKKKYPPPKWEYRYIKGKRVRIRAKYLNEYGTSNNDSSELDKKEHSSTKLLEEAEMTSEDTLTDSS
        FADMEELGVQPNMAIVT +GNVFQELGM DKY+KL KKYPPPKWEYRYIKGKRV+IRAKYL+E G SNN  SE  K EHSST  ++EAE+TSE    DSS
Subjt:  FADMEELGVQPNMAIVTMIGNVFQELGMFDKYEKLKKKYPPPKWEYRYIKGKRVRIRAKYLNEYGTSNNDSSELDKKEHSSTKLLEEAEMTSEDTLTDSS

Query:  LEDDE-MSEDTDEILEEENMSKESNFEHDFMGFGQL
        LEDDE MSED DEILE+E+M  +SNFEHDFMG GQL
Subjt:  LEDDE-MSEDTDEILEEENMSKESNFEHDFMGFGQL

A0A5D3DHA7 Pentatricopeptide repeat-containing protein8.6e-14582.74Show/hide
Query:  MMHSIVPASLSS-SASNRMLTLIYSFPVIISKKIESVKFPRSASSSVVCAAKGPRPRYPRVWKTRKRIGTISKAGKLVDCVKGLSNVKEEVYGALDSFIA
        M H +  A+L+S SAS RMLTL+Y+FPV  SK+IESV F    SSSVVCAAKGPRPRYPRVWKTRKRIGTISKA KLVDCVKGLSNVKEEVYGALDSFIA
Subjt:  MMHSIVPASLSS-SASNRMLTLIYSFPVIISKKIESVKFPRSASSSVVCAAKGPRPRYPRVWKTRKRIGTISKAGKLVDCVKGLSNVKEEVYGALDSFIA

Query:  WELEFPLITVKKALKTLENQREWKRIIQLTKWMLSKGQGRTMGSYFTLLNALAEDGRLDEAEELWNKLFSQHLESMPRIFFHKMISLYYDRGMHDKLFEV
        WELEFPLITVKKALKTLENQREWKRIIQLTKWMLSKGQGRTMGSYFTLLNAL EDGRLDEAEELWNKLFSQ+LESMPRIFFHKMISLYYDR MHDKLFEV
Subjt:  WELEFPLITVKKALKTLENQREWKRIIQLTKWMLSKGQGRTMGSYFTLLNALAEDGRLDEAEELWNKLFSQHLESMPRIFFHKMISLYYDRGMHDKLFEV

Query:  FADMEELGVQPNMAIVTMIGNVFQELGMFDKYEKLKKKYPPPKWEYRYIKGKRVRIRAKYLNEYGTSNNDSSELDKKEHSSTKLLEEAEMTSEDTLTDSS
        FADMEELGVQPNMAIVT +GN+FQELGM DKYEKL KKYPPPKWEYRYIKGKRV+IR KYL+E G S N  SE +K EHSST  L+EAE+TSE    DSS
Subjt:  FADMEELGVQPNMAIVTMIGNVFQELGMFDKYEKLKKKYPPPKWEYRYIKGKRVRIRAKYLNEYGTSNNDSSELDKKEHSSTKLLEEAEMTSEDTLTDSS

Query:  LEDD-EMSEDTDEILEEENMSKESNFEHDFMGFGQL
        LEDD E+ +D DEILE+E+M  +SNFEHDFMG GQL
Subjt:  LEDD-EMSEDTDEILEEENMSKESNFEHDFMGFGQL

A0A6J1CI57 pentatricopeptide repeat-containing protein At4g211906.8e-15085.97Show/hide
Query:  MMHSIVPASL-SSSASNRMLTLIYSFPVIISKKIESVKFPRSASSSVVCAAKGPRPRYPRVWKTRKRIGTISKAGKLVDCVKGLSNVKEEVYGALDSFIA
        M HS V ASL SSSASNRM TLIYSFPV ISK+IESVKF  SASS+VVCAAKGPRPRYPRVWKTRKRIGTISKA KLVDCVKGLSNVKEEVYGALDSFIA
Subjt:  MMHSIVPASL-SSSASNRMLTLIYSFPVIISKKIESVKFPRSASSSVVCAAKGPRPRYPRVWKTRKRIGTISKAGKLVDCVKGLSNVKEEVYGALDSFIA

Query:  WELEFPLITVKKALKTLENQREWKRIIQLTKWMLSKGQGRTMGSYFTLLNALAEDGRLDEAEELWNKLFSQHLESMPRIFFHKMISLYYDRGMHDKLFEV
        WELEFPLITVKKALKTLE QREWKRIIQLTKWMLSKGQGRTMGSYFTLLNALAEDGRLDEAEELWNKLFSQ+LESMPR+FFHKMISLYYD+GMHDKLFEV
Subjt:  WELEFPLITVKKALKTLENQREWKRIIQLTKWMLSKGQGRTMGSYFTLLNALAEDGRLDEAEELWNKLFSQHLESMPRIFFHKMISLYYDRGMHDKLFEV

Query:  FADMEELGVQPNMAIVTMIGNVFQELGMFDKYEKLKKKYPPPKWEYRYIKGKRVRIRAKYLNEYGTSNNDSSELDKKEHSSTKLLEEAEMTSEDTLTDSS
        FADMEELGVQPN  IVTMIGNVFQELGMFDKYEKLKKKYPP KWEYRY+KGKRVRIRAKYLNEYG SNN SSELD+K+ SS KLLEEAE  S+    DSS
Subjt:  FADMEELGVQPNMAIVTMIGNVFQELGMFDKYEKLKKKYPPPKWEYRYIKGKRVRIRAKYLNEYGTSNNDSSELDKKEHSSTKLLEEAEMTSEDTLTDSS

Query:  LEDDEMSEDTDEILEEENMSKESNFEHDFMGFGQL
        LED+EM ED DEILE+E+  +  +FE++FMG+G+L
Subjt:  LEDDEMSEDTDEILEEENMSKESNFEHDFMGFGQL

A0A6J1G1Y3 pentatricopeptide repeat-containing protein At4g211902.3e-15086.65Show/hide
Query:  MMHSIVPASLSSS-ASNRMLTLIYSFPVIISKKIESVKFPRSASSSVVCAAKGPRPRYPRVWKTRKRIGTISKAGKLVDCVKGLSNVKEEVYGALDSFIA
        M HS+VPASLSS+ ASNRM TLIYSFPV ISK IESVKF   ASSSVVCAAKGPRPRYPRVWKTRKRIGTISKA KLVDCVKGLSNVKEEVYGALDSFIA
Subjt:  MMHSIVPASLSSS-ASNRMLTLIYSFPVIISKKIESVKFPRSASSSVVCAAKGPRPRYPRVWKTRKRIGTISKAGKLVDCVKGLSNVKEEVYGALDSFIA

Query:  WELEFPLITVKKALKTLENQREWKRIIQLTKWMLSKGQGRTMGSYFTLLNALAEDGRLDEAEELWNKLFSQHLESMPRIFFHKMISLYYDRGMHDKLFEV
        WELEFPLITVKKALKTLE QREWKRIIQLTKWMLSKGQGRTMGSYFTLLNALA DGRLDEAEELWNKLFSQHLESMPRIFFHKMISLYY+R MHDKLFE+
Subjt:  WELEFPLITVKKALKTLENQREWKRIIQLTKWMLSKGQGRTMGSYFTLLNALAEDGRLDEAEELWNKLFSQHLESMPRIFFHKMISLYYDRGMHDKLFEV

Query:  FADMEELGVQPNMAIVTMIGNVFQELGMFDKYEKLKKKYPPPKWEYRYIKGKRVRIRAKYLNEYGTSNNDSSELDKKEHSST-KLLEEAEMTSEDTLTDS
        FADMEELGVQP+MAIVT +GNVFQ+LGM DKYEKLKKKYPPPKWEYRYI+GKRV+IRAK L+E G+SNN S ELDKKEHSST +LLEE E+TS+    DS
Subjt:  FADMEELGVQPNMAIVTMIGNVFQELGMFDKYEKLKKKYPPPKWEYRYIKGKRVRIRAKYLNEYGTSNNDSSELDKKEHSST-KLLEEAEMTSEDTLTDS

Query:  SLEDDEMSEDTDEILEEENMS-KESNFEHDFMGFGQL
        SLEDDEMSED DE LE+E+M  KES FEHDFMGFGQL
Subjt:  SLEDDEMSEDTDEILEEENMS-KESNFEHDFMGFGQL

A0A6J1HWS4 pentatricopeptide repeat-containing protein At4g211904.9e-14885.84Show/hide
Query:  MMHSIVPASLSSS-ASNRMLTLIYSFPVIISKKIESVKFP--RSASSSVVCAAKGPRPRYPRVWKTRKRIGTISKAGKLVDCVKGLSNVKEEVYGALDSF
        M HS+VPASLSS+ ASNRM TLIYSFPV ISK IESVKF    SA SSVVCAAKGPRPRYPRVWKTRKRIGTISKA KLVDCVKGLSNVKEEVYGALDSF
Subjt:  MMHSIVPASLSSS-ASNRMLTLIYSFPVIISKKIESVKFP--RSASSSVVCAAKGPRPRYPRVWKTRKRIGTISKAGKLVDCVKGLSNVKEEVYGALDSF

Query:  IAWELEFPLITVKKALKTLENQREWKRIIQLTKWMLSKGQGRTMGSYFTLLNALAEDGRLDEAEELWNKLFSQHLESMPRIFFHKMISLYYDRGMHDKLF
        IAWELEFPLITVKKALKTLE QREWKRIIQLTKWMLSKGQGRTMGSYFTLLNALAEDGRLDEAEELWNKLFSQHLESMPRIFFHKMISLYY+R MHDKLF
Subjt:  IAWELEFPLITVKKALKTLENQREWKRIIQLTKWMLSKGQGRTMGSYFTLLNALAEDGRLDEAEELWNKLFSQHLESMPRIFFHKMISLYYDRGMHDKLF

Query:  EVFADMEELGVQPNMAIVTMIGNVFQELGMFDKYEKLKKKYPPPKWEYRYIKGKRVRIRAKYLNEYGTSNNDSSELDKKEHSST-KLLEEAEMTSEDTLT
        E+FADMEELGVQP+MAIVTM+G+VFQ+LGM DK EKLKKKYPPPKWEYRYI+GKRV+IRAK L+E G+SNN S ELDKKE SST +LLEE E TS+    
Subjt:  EVFADMEELGVQPNMAIVTMIGNVFQELGMFDKYEKLKKKYPPPKWEYRYIKGKRVRIRAKYLNEYGTSNNDSSELDKKEHSST-KLLEEAEMTSEDTLT

Query:  DSSLEDDEMSEDTDEILEEENM-SKESNFEHDFMGFGQL
        DSSLEDDEMSED DE+LE+E+M  KES FEHDFMGFGQL
Subjt:  DSSLEDDEMSEDTDEILEEENM-SKESNFEHDFMGFGQL

SwissProt top hitse value%identityAlignment
Q2V3H0 Pentatricopeptide repeat-containing protein At4g18975, chloroplastic6.3e-4445.81Show/hide
Query:  VWKTRKRIGTISKAGKLVDCVKGLSNVKEEVYGALDSFIAWELEFPLITVKKALKTLENQREWKRIIQLTKWMLSKGQGRTMGSYFTLLNALAEDGRLDE
        +WK     G+  KA  LV  + GL N KE VYGAL+ ++AWE+EFP+I   KAL+ L  + +W R+IQL KWMLSKGQG TMG+Y  LL A   D R DE
Subjt:  VWKTRKRIGTISKAGKLVDCVKGLSNVKEEVYGALDSFIAWELEFPLITVKKALKTLENQREWKRIIQLTKWMLSKGQGRTMGSYFTLLNALAEDGRLDE

Query:  AEELWNKLFSQHLESMPRIFFHKMISLYYDRGMHDKLFEVFADMEELGVQPNMAIVTMIGNVFQELGMFDKYEKLKKKYPPPKWEYRYIKGKRVRIRAKY
        AE LWN +   H  S+PR  F +MI+LY    +HDK+ EVFADMEEL V P+      +   F+EL   +  + + ++Y   +++Y Y  G+RVR++ +Y
Subjt:  AEELWNKLFSQHLESMPRIFFHKMISLYYDRGMHDKLFEVFADMEELGVQPNMAIVTMIGNVFQELGMFDKYEKLKKKYPPPKWEYRYIKGKRVRIRAKY

Query:  LNE
         +E
Subjt:  LNE

Q8LG95 Pentatricopeptide repeat-containing protein At4g211902.1e-10062.75Show/hide
Query:  MLTLIYSFPVIISKKIESVK--FPRSASSSVVCAAKGPRPRYPRVWKTRKRIGTISKAGKLVDCVKGLSNVKEEVYGALDSFIAWELEFPLITVKKALKT
        ML+L YS P ++ +  ES    F +  ++ VVCAA+GPRPR PRVWKTRKRIGTISKA K++ C+KGLSNVKEEVYGALDSFIAWELEFPL+ VKKAL  
Subjt:  MLTLIYSFPVIISKKIESVK--FPRSASSSVVCAAKGPRPRYPRVWKTRKRIGTISKAGKLVDCVKGLSNVKEEVYGALDSFIAWELEFPLITVKKALKT

Query:  LENQREWKRIIQLTKWMLSKGQGRTMGSYFTLLNALAEDGRLDEAEELWNKLFSQHLESMPRIFFHKMISLYYDRGMHDKLFEVFADMEELGVQPNMAIV
        LE+++EWK+IIQ+TKWMLSKGQGRTMG+YF+LLNALAED RLDEAEELWNKLF +HLE  PR FF+KMIS+YY R MH KLFEVFADMEELGV+PN+AIV
Subjt:  LENQREWKRIIQLTKWMLSKGQGRTMGSYFTLLNALAEDGRLDEAEELWNKLFSQHLESMPRIFFHKMISLYYDRGMHDKLFEVFADMEELGVQPNMAIV

Query:  TMIGNVFQELGMFDKYEKLKKKYPPPKWEYRYIKGKRVRIRAKYLNEYGTSNNDSSELDKKEHSSTKLLEEAEMTSEDTLTDSSLEDDEMSEDTDEILEE
        +M+G VF +L M DKYEKL KKYPPP+WE+RYIKG+RV+++AK LNE                       E  ++S++   D+ +E +E  ED +++ EE
Subjt:  TMIGNVFQELGMFDKYEKLKKKYPPPKWEYRYIKGKRVRIRAKYLNEYGTSNNDSSELDKKEHSSTKLLEEAEMTSEDTLTDSSLEDDEMSEDTDEILEE

Query:  ENMSKE
        E   KE
Subjt:  ENMSKE

Q9LYZ9 Pentatricopeptide repeat-containing protein At5g028602.9e-0421.88Show/hide
Query:  LVDCVKGLSNVKEEVYGALDSFIAWELEFPLITVKKALKTLENQREWKRIIQLTKWMLSKGQGRTMGSYFTLLNALAEDGRLDEAEELWNKLFSQHLESM
        L+ C K   ++ +E     +   A    +  +T    L         K  +++   M+  G   ++ +Y +L++A A DG LDEA EL N++  +   + 
Subjt:  LVDCVKGLSNVKEEVYGALDSFIAWELEFPLITVKKALKTLENQREWKRIIQLTKWMLSKGQGRTMGSYFTLLNALAEDGRLDEAEELWNKLFSQHLESM

Query:  PRIFFHKMISLYYDR-GMHDKLFEVFADMEELGVQPNMAIVTMIGNVFQELGMFDKYEKL
        P +F +  +   ++R G  +    +F +M   G +PN+        ++   G F +  K+
Subjt:  PRIFFHKMISLYYDR-GMHDKLFEVFADMEELGVQPNMAIVTMIGNVFQELGMFDKYEKL

Arabidopsis top hitse value%identityAlignment
AT4G18975.1 Pentatricopeptide repeat (PPR) superfamily protein4.5e-4545.81Show/hide
Query:  VWKTRKRIGTISKAGKLVDCVKGLSNVKEEVYGALDSFIAWELEFPLITVKKALKTLENQREWKRIIQLTKWMLSKGQGRTMGSYFTLLNALAEDGRLDE
        +WK     G+  KA  LV  + GL N KE VYGAL+ ++AWE+EFP+I   KAL+ L  + +W R+IQL KWMLSKGQG TMG+Y  LL A   D R DE
Subjt:  VWKTRKRIGTISKAGKLVDCVKGLSNVKEEVYGALDSFIAWELEFPLITVKKALKTLENQREWKRIIQLTKWMLSKGQGRTMGSYFTLLNALAEDGRLDE

Query:  AEELWNKLFSQHLESMPRIFFHKMISLYYDRGMHDKLFEVFADMEELGVQPNMAIVTMIGNVFQELGMFDKYEKLKKKYPPPKWEYRYIKGKRVRIRAKY
        AE LWN +   H  S+PR  F +MI+LY    +HDK+ EVFADMEEL V P+      +   F+EL   +  + + ++Y   +++Y Y  G+RVR++ +Y
Subjt:  AEELWNKLFSQHLESMPRIFFHKMISLYYDRGMHDKLFEVFADMEELGVQPNMAIVTMIGNVFQELGMFDKYEKLKKKYPPPKWEYRYIKGKRVRIRAKY

Query:  LNE
         +E
Subjt:  LNE

AT4G18975.2 Pentatricopeptide repeat (PPR) superfamily protein4.5e-4545.81Show/hide
Query:  VWKTRKRIGTISKAGKLVDCVKGLSNVKEEVYGALDSFIAWELEFPLITVKKALKTLENQREWKRIIQLTKWMLSKGQGRTMGSYFTLLNALAEDGRLDE
        +WK     G+  KA  LV  + GL N KE VYGAL+ ++AWE+EFP+I   KAL+ L  + +W R+IQL KWMLSKGQG TMG+Y  LL A   D R DE
Subjt:  VWKTRKRIGTISKAGKLVDCVKGLSNVKEEVYGALDSFIAWELEFPLITVKKALKTLENQREWKRIIQLTKWMLSKGQGRTMGSYFTLLNALAEDGRLDE

Query:  AEELWNKLFSQHLESMPRIFFHKMISLYYDRGMHDKLFEVFADMEELGVQPNMAIVTMIGNVFQELGMFDKYEKLKKKYPPPKWEYRYIKGKRVRIRAKY
        AE LWN +   H  S+PR  F +MI+LY    +HDK+ EVFADMEEL V P+      +   F+EL   +  + + ++Y   +++Y Y  G+RVR++ +Y
Subjt:  AEELWNKLFSQHLESMPRIFFHKMISLYYDRGMHDKLFEVFADMEELGVQPNMAIVTMIGNVFQELGMFDKYEKLKKKYPPPKWEYRYIKGKRVRIRAKY

Query:  LNE
         +E
Subjt:  LNE

AT4G18975.3 Pentatricopeptide repeat (PPR) superfamily protein4.5e-4545.81Show/hide
Query:  VWKTRKRIGTISKAGKLVDCVKGLSNVKEEVYGALDSFIAWELEFPLITVKKALKTLENQREWKRIIQLTKWMLSKGQGRTMGSYFTLLNALAEDGRLDE
        +WK     G+  KA  LV  + GL N KE VYGAL+ ++AWE+EFP+I   KAL+ L  + +W R+IQL KWMLSKGQG TMG+Y  LL A   D R DE
Subjt:  VWKTRKRIGTISKAGKLVDCVKGLSNVKEEVYGALDSFIAWELEFPLITVKKALKTLENQREWKRIIQLTKWMLSKGQGRTMGSYFTLLNALAEDGRLDE

Query:  AEELWNKLFSQHLESMPRIFFHKMISLYYDRGMHDKLFEVFADMEELGVQPNMAIVTMIGNVFQELGMFDKYEKLKKKYPPPKWEYRYIKGKRVRIRAKY
        AE LWN +   H  S+PR  F +MI+LY    +HDK+ EVFADMEEL V P+      +   F+EL   +  + + ++Y   +++Y Y  G+RVR++ +Y
Subjt:  AEELWNKLFSQHLESMPRIFFHKMISLYYDRGMHDKLFEVFADMEELGVQPNMAIVTMIGNVFQELGMFDKYEKLKKKYPPPKWEYRYIKGKRVRIRAKY

Query:  LNE
         +E
Subjt:  LNE

AT4G18975.4 Pentatricopeptide repeat (PPR) superfamily protein4.5e-4545.81Show/hide
Query:  VWKTRKRIGTISKAGKLVDCVKGLSNVKEEVYGALDSFIAWELEFPLITVKKALKTLENQREWKRIIQLTKWMLSKGQGRTMGSYFTLLNALAEDGRLDE
        +WK     G+  KA  LV  + GL N KE VYGAL+ ++AWE+EFP+I   KAL+ L  + +W R+IQL KWMLSKGQG TMG+Y  LL A   D R DE
Subjt:  VWKTRKRIGTISKAGKLVDCVKGLSNVKEEVYGALDSFIAWELEFPLITVKKALKTLENQREWKRIIQLTKWMLSKGQGRTMGSYFTLLNALAEDGRLDE

Query:  AEELWNKLFSQHLESMPRIFFHKMISLYYDRGMHDKLFEVFADMEELGVQPNMAIVTMIGNVFQELGMFDKYEKLKKKYPPPKWEYRYIKGKRVRIRAKY
        AE LWN +   H  S+PR  F +MI+LY    +HDK+ EVFADMEEL V P+      +   F+EL   +  + + ++Y   +++Y Y  G+RVR++ +Y
Subjt:  AEELWNKLFSQHLESMPRIFFHKMISLYYDRGMHDKLFEVFADMEELGVQPNMAIVTMIGNVFQELGMFDKYEKLKKKYPPPKWEYRYIKGKRVRIRAKY

Query:  LNE
         +E
Subjt:  LNE

AT4G21190.1 Pentatricopeptide repeat (PPR) superfamily protein1.5e-10162.75Show/hide
Query:  MLTLIYSFPVIISKKIESVK--FPRSASSSVVCAAKGPRPRYPRVWKTRKRIGTISKAGKLVDCVKGLSNVKEEVYGALDSFIAWELEFPLITVKKALKT
        ML+L YS P ++ +  ES    F +  ++ VVCAA+GPRPR PRVWKTRKRIGTISKA K++ C+KGLSNVKEEVYGALDSFIAWELEFPL+ VKKAL  
Subjt:  MLTLIYSFPVIISKKIESVK--FPRSASSSVVCAAKGPRPRYPRVWKTRKRIGTISKAGKLVDCVKGLSNVKEEVYGALDSFIAWELEFPLITVKKALKT

Query:  LENQREWKRIIQLTKWMLSKGQGRTMGSYFTLLNALAEDGRLDEAEELWNKLFSQHLESMPRIFFHKMISLYYDRGMHDKLFEVFADMEELGVQPNMAIV
        LE+++EWK+IIQ+TKWMLSKGQGRTMG+YF+LLNALAED RLDEAEELWNKLF +HLE  PR FF+KMIS+YY R MH KLFEVFADMEELGV+PN+AIV
Subjt:  LENQREWKRIIQLTKWMLSKGQGRTMGSYFTLLNALAEDGRLDEAEELWNKLFSQHLESMPRIFFHKMISLYYDRGMHDKLFEVFADMEELGVQPNMAIV

Query:  TMIGNVFQELGMFDKYEKLKKKYPPPKWEYRYIKGKRVRIRAKYLNEYGTSNNDSSELDKKEHSSTKLLEEAEMTSEDTLTDSSLEDDEMSEDTDEILEE
        +M+G VF +L M DKYEKL KKYPPP+WE+RYIKG+RV+++AK LNE                       E  ++S++   D+ +E +E  ED +++ EE
Subjt:  TMIGNVFQELGMFDKYEKLKKKYPPPKWEYRYIKGKRVRIRAKYLNEYGTSNNDSSELDKKEHSSTKLLEEAEMTSEDTLTDSSLEDDEMSEDTDEILEE

Query:  ENMSKE
        E   KE
Subjt:  ENMSKE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATGCACTCTATAGTTCCAGCATCCCTCTCGTCATCGGCTTCCAACCGGATGCTTACTTTGATTTACTCTTTTCCAGTCATCATATCCAAAAAGATAGAATCTGTCAA
ATTTCCACGGAGTGCAAGCAGTTCAGTGGTATGCGCGGCAAAAGGTCCACGGCCGAGATATCCTCGGGTTTGGAAAACCAGAAAGAGAATTGGAACCATATCCAAGGCAG
GAAAGCTTGTTGATTGTGTCAAGGGACTGTCTAACGTCAAAGAGGAAGTCTATGGGGCTCTTGATTCCTTCATTGCCTGGGAACTTGAGTTTCCTCTTATTACTGTAAAG
AAGGCCCTGAAGACCTTGGAGAACCAAAGAGAATGGAAGAGGATAATTCAGTTGACGAAATGGATGTTAAGTAAAGGCCAAGGAAGAACAATGGGAAGTTATTTCACGTT
GTTAAATGCCTTAGCTGAAGATGGAAGACTTGATGAGGCTGAAGAGCTTTGGAACAAATTGTTTTCTCAGCATCTGGAGAGCATGCCTCGCATATTCTTTCATAAAATGA
TATCCCTCTACTATGACCGGGGTATGCACGACAAGTTATTTGAGGTATTTGCTGATATGGAGGAACTTGGAGTTCAACCAAATATGGCGATTGTCACTATGATTGGAAAT
GTCTTCCAAGAGTTGGGTATGTTCGATAAATATGAAAAATTGAAGAAGAAATATCCCCCACCAAAATGGGAATATCGTTACATCAAAGGGAAGCGTGTAAGAATACGAGC
AAAGTATCTGAACGAATATGGTACTTCCAACAATGATTCAAGTGAGCTTGACAAAAAGGAGCATAGTTCAACAAAACTGTTGGAGGAAGCTGAAATGACTTCAGAAGATA
CTCTCACAGATTCCAGTCTTGAAGATGATGAAATGAGCGAAGATACAGATGAAATTTTGGAAGAGGAAAATATGTCCAAGGAATCCAATTTTGAGCACGATTTCATGGGG
TTTGGGCAATTGTAA
mRNA sequenceShow/hide mRNA sequence
ATGATGCACTCTATAGTTCCAGCATCCCTCTCGTCATCGGCTTCCAACCGGATGCTTACTTTGATTTACTCTTTTCCAGTCATCATATCCAAAAAGATAGAATCTGTCAA
ATTTCCACGGAGTGCAAGCAGTTCAGTGGTATGCGCGGCAAAAGGTCCACGGCCGAGATATCCTCGGGTTTGGAAAACCAGAAAGAGAATTGGAACCATATCCAAGGCAG
GAAAGCTTGTTGATTGTGTCAAGGGACTGTCTAACGTCAAAGAGGAAGTCTATGGGGCTCTTGATTCCTTCATTGCCTGGGAACTTGAGTTTCCTCTTATTACTGTAAAG
AAGGCCCTGAAGACCTTGGAGAACCAAAGAGAATGGAAGAGGATAATTCAGTTGACGAAATGGATGTTAAGTAAAGGCCAAGGAAGAACAATGGGAAGTTATTTCACGTT
GTTAAATGCCTTAGCTGAAGATGGAAGACTTGATGAGGCTGAAGAGCTTTGGAACAAATTGTTTTCTCAGCATCTGGAGAGCATGCCTCGCATATTCTTTCATAAAATGA
TATCCCTCTACTATGACCGGGGTATGCACGACAAGTTATTTGAGGTATTTGCTGATATGGAGGAACTTGGAGTTCAACCAAATATGGCGATTGTCACTATGATTGGAAAT
GTCTTCCAAGAGTTGGGTATGTTCGATAAATATGAAAAATTGAAGAAGAAATATCCCCCACCAAAATGGGAATATCGTTACATCAAAGGGAAGCGTGTAAGAATACGAGC
AAAGTATCTGAACGAATATGGTACTTCCAACAATGATTCAAGTGAGCTTGACAAAAAGGAGCATAGTTCAACAAAACTGTTGGAGGAAGCTGAAATGACTTCAGAAGATA
CTCTCACAGATTCCAGTCTTGAAGATGATGAAATGAGCGAAGATACAGATGAAATTTTGGAAGAGGAAAATATGTCCAAGGAATCCAATTTTGAGCACGATTTCATGGGG
TTTGGGCAATTGTAA
Protein sequenceShow/hide protein sequence
MMHSIVPASLSSSASNRMLTLIYSFPVIISKKIESVKFPRSASSSVVCAAKGPRPRYPRVWKTRKRIGTISKAGKLVDCVKGLSNVKEEVYGALDSFIAWELEFPLITVK
KALKTLENQREWKRIIQLTKWMLSKGQGRTMGSYFTLLNALAEDGRLDEAEELWNKLFSQHLESMPRIFFHKMISLYYDRGMHDKLFEVFADMEELGVQPNMAIVTMIGN
VFQELGMFDKYEKLKKKYPPPKWEYRYIKGKRVRIRAKYLNEYGTSNNDSSELDKKEHSSTKLLEEAEMTSEDTLTDSSLEDDEMSEDTDEILEEENMSKESNFEHDFMG
FGQL