; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10012277 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10012277
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationChr01:19680574..19682917
RNA-Seq ExpressionHG10012277
SyntenyHG10012277
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR011990 - Tetratricopeptide-like helical domain superfamily
IPR044646 - Pentatricopeptide repeat-containing protein EMB1417-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TYK23004.1 pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa]3.5e-16187.92Show/hide
Query:  MTHSIAQASLSSSSASYRMLTLIYSFPVISKRIESVKFSWCASSSVVCAAKGPRPRYPRVWKTKKRIGTISKAAKLVDCVKGLSNVKEEVYGALDSFIAW
        MTH +AQA+L+S SASYRMLTL+Y+FPV SKRIESV FSWC SSSVVCAAKGPRPRYPRVWKT+KRIGTISKAAKLVDCVKGLSNVKEEVYGALDSFIAW
Subjt:  MTHSIAQASLSSSSASYRMLTLIYSFPVISKRIESVKFSWCASSSVVCAAKGPRPRYPRVWKTKKRIGTISKAAKLVDCVKGLSNVKEEVYGALDSFIAW

Query:  ELEFPLITVKKALKTLENQREWKRIIQLTKWMLSKGQGRTMGSYFTLLNALAEDGRLDEAEELWNKLFYQHLESMPRIFFHKMISLYYDRAMHDKLFEVF
        ELEFPLITVKKALKTLENQREWKRIIQLTKWMLSKGQGRTMGSYFTLLNAL EDGRLDEAEELWNKLF Q+LESMPRIFFHKMISLYYDRAMHDKLFEVF
Subjt:  ELEFPLITVKKALKTLENQREWKRIIQLTKWMLSKGQGRTMGSYFTLLNALAEDGRLDEAEELWNKLFYQHLESMPRIFFHKMISLYYDRAMHDKLFEVF

Query:  ADMEELGVQPNMAIVTMVGNVFHELGMFDKYEKLMKKYPPPKWEYRYVKGKRVRIRSKYLYENGNSNNGLSEHDKLEHSSTKLLEEAEITSEDSSLEDDE
        ADMEELGVQPNMAIVT VGN+F ELGM DKYEKLMKKYPPPKWEYRY+KGKRV+IR+KYL ENGNS NGLSE +K+EHSST  L+EAEITSEDSSLEDDE
Subjt:  ADMEELGVQPNMAIVTMVGNVFHELGMFDKYEKLMKKYPPPKWEYRYVKGKRVRIRSKYLYENGNSNNGLSEHDKLEHSSTKLLEEAEITSEDSSLEDDE

Query:  EMSEDPVGILEDECISKESNFEHDFMGFGQL
        E+ +DP  ILEDE +  +SNFEHDFMG GQL
Subjt:  EMSEDPVGILEDECISKESNFEHDFMGFGQL

XP_004140747.2 pentatricopeptide repeat-containing protein At4g21190 [Cucumis sativus]7.6e-16489.43Show/hide
Query:  MTHSIAQASLSSSSASYRMLTLIYSFPVISKRIESVKFSWCASSSVVCAAKGPRPRYPRVWKTKKRIGTISKAAKLVDCVKGLSNVKEEVYGALDSFIAW
        MTHS+AQA+LSS  ASYRMLTL+Y+FPV SKRIESV FSWC SSSVVCAAKGPRPRYPRVWKTKKRIGTISKAAKLVDCVKGLSNVKEEVYGALDSFIAW
Subjt:  MTHSIAQASLSSSSASYRMLTLIYSFPVISKRIESVKFSWCASSSVVCAAKGPRPRYPRVWKTKKRIGTISKAAKLVDCVKGLSNVKEEVYGALDSFIAW

Query:  ELEFPLITVKKALKTLENQREWKRIIQLTKWMLSKGQGRTMGSYFTLLNALAEDGRLDEAEELWNKLFYQHLESMPRIFFHKMISLYYDRAMHDKLFEVF
        ELEFPLITVKKALKTLENQREWKRIIQLTKWMLSKGQGRTMGSYFTLLNALAEDGRLDEAEELWNKLF QHLES+PRIFFHKMISLYYD+AMHDKLFEVF
Subjt:  ELEFPLITVKKALKTLENQREWKRIIQLTKWMLSKGQGRTMGSYFTLLNALAEDGRLDEAEELWNKLFYQHLESMPRIFFHKMISLYYDRAMHDKLFEVF

Query:  ADMEELGVQPNMAIVTMVGNVFHELGMFDKYEKLMKKYPPPKWEYRYVKGKRVRIRSKYLYENGNSNNGLSEHDKLEHSSTKLLEEAEITSEDSSLEDDE
        ADMEELGVQPNMAIVT VGNVF ELGM DKY+KLMKKYPPPKWEYRY+KGKRV+IR+KYL ENGNSNNGLSEH K+EHSST  ++EAEITSEDSSLEDDE
Subjt:  ADMEELGVQPNMAIVTMVGNVFHELGMFDKYEKLMKKYPPPKWEYRYVKGKRVRIRSKYLYENGNSNNGLSEHDKLEHSSTKLLEEAEITSEDSSLEDDE

Query:  EMSEDPVGILEDECISKESNFEHDFMGFGQL
        +MSEDP  ILEDE +  +SNFEHDFMG GQL
Subjt:  EMSEDPVGILEDECISKESNFEHDFMGFGQL

XP_008439301.1 PREDICTED: pentatricopeptide repeat-containing protein At4g21190 isoform X2 [Cucumis melo]6.0e-16187.92Show/hide
Query:  MTHSIAQASLSSSSASYRMLTLIYSFPVISKRIESVKFSWCASSSVVCAAKGPRPRYPRVWKTKKRIGTISKAAKLVDCVKGLSNVKEEVYGALDSFIAW
        MTH +AQA+L+S SASYRMLTL+Y+FPV SKRIESV FSWC SSSVVCAAKGPRPRYPRVWKT+KRIGTISKAAKLVDCVKGLSNVKEEVYGALDSFIAW
Subjt:  MTHSIAQASLSSSSASYRMLTLIYSFPVISKRIESVKFSWCASSSVVCAAKGPRPRYPRVWKTKKRIGTISKAAKLVDCVKGLSNVKEEVYGALDSFIAW

Query:  ELEFPLITVKKALKTLENQREWKRIIQLTKWMLSKGQGRTMGSYFTLLNALAEDGRLDEAEELWNKLFYQHLESMPRIFFHKMISLYYDRAMHDKLFEVF
        ELEFPLITVKKALKTLENQREWKRIIQLTKWMLSKGQGRTMGSYFTLLNAL EDGRLDEAEELWNKLF Q+LESMPRIFFHKMISLYYDRAMHDKLFEVF
Subjt:  ELEFPLITVKKALKTLENQREWKRIIQLTKWMLSKGQGRTMGSYFTLLNALAEDGRLDEAEELWNKLFYQHLESMPRIFFHKMISLYYDRAMHDKLFEVF

Query:  ADMEELGVQPNMAIVTMVGNVFHELGMFDKYEKLMKKYPPPKWEYRYVKGKRVRIRSKYLYENGNSNNGLSEHDKLEHSSTKLLEEAEITSEDSSLEDDE
        ADMEELGVQPNMAIVT VGN+F ELGM DKYEKLMKKYPPPKWEYRY+KGKRV+IR+KYL ENGNS NGLSE +K+EHSST  L+EAEITSEDSSLEDDE
Subjt:  ADMEELGVQPNMAIVTMVGNVFHELGMFDKYEKLMKKYPPPKWEYRYVKGKRVRIRSKYLYENGNSNNGLSEHDKLEHSSTKLLEEAEITSEDSSLEDDE

Query:  EMSEDPVGILEDECISKESNFEHDFMGFGQL
        E+ +DP  ILEDE +  +SNFEHDFMG GQL
Subjt:  EMSEDPVGILEDECISKESNFEHDFMGFGQL

XP_038888232.1 pentatricopeptide repeat-containing protein At4g21190 isoform X1 [Benincasa hispida]3.4e-17293.35Show/hide
Query:  MTHSIAQASLSSSSASYRMLTLIYSFPVISKRIESVKFSWCASSSVVCAAKGPRPRYPRVWKTKKRIGTISKAAKLVDCVKGLSNVKEEVYGALDSFIAW
        M HS+AQASLSSSSASYRMLTLIYSFPVISKRIESV FSWCASSSVVCAAKGPRPRYPRVWKTKKRIGT+SKAAKLVDCVKGLSNVKEEVYGALDSFIAW
Subjt:  MTHSIAQASLSSSSASYRMLTLIYSFPVISKRIESVKFSWCASSSVVCAAKGPRPRYPRVWKTKKRIGTISKAAKLVDCVKGLSNVKEEVYGALDSFIAW

Query:  ELEFPLITVKKALKTLENQREWKRIIQLTKWMLSKGQGRTMGSYFTLLNALAEDGRLDEAEELWNKLFYQHLESMPRIFFHKMISLYYDRAMHDKLFEVF
        ELEFPLITVKKAL+TLENQREWKRIIQLTKWMLSKGQGRTMGSYFTLLNALAEDGRLDEAEELWNKLF QHLESMPRIFFHKMISLYYDRAMHDKLFEVF
Subjt:  ELEFPLITVKKALKTLENQREWKRIIQLTKWMLSKGQGRTMGSYFTLLNALAEDGRLDEAEELWNKLFYQHLESMPRIFFHKMISLYYDRAMHDKLFEVF

Query:  ADMEELGVQPNMAIVTMVGNVFHELGMFDKYEKLMKKYPPPKWEYRYVKGKRVRIRSKYLYENGNSNNGLSEHDKLEHSSTKLLEEAEITSEDSSLEDDE
        ADMEELGVQPNM IVTMVGNVFHELGM DKYEKLMKKYPPPKWEYRY+KGKRVRIRSKYLYENGN NN LS+HDK+EHSSTKLLEEAEITSED++LEDD 
Subjt:  ADMEELGVQPNMAIVTMVGNVFHELGMFDKYEKLMKKYPPPKWEYRYVKGKRVRIRSKYLYENGNSNNGLSEHDKLEHSSTKLLEEAEITSEDSSLEDDE

Query:  EMSEDPVGILEDECISKESNFEHDFMGFGQL
        EMSEDP  I +DEC+SKE NFEHDFMGFGQL
Subjt:  EMSEDPVGILEDECISKESNFEHDFMGFGQL

XP_038888233.1 pentatricopeptide repeat-containing protein At4g21190 isoform X2 [Benincasa hispida]3.4e-16493.61Show/hide
Query:  MLTLIYSFPVISKRIESVKFSWCASSSVVCAAKGPRPRYPRVWKTKKRIGTISKAAKLVDCVKGLSNVKEEVYGALDSFIAWELEFPLITVKKALKTLEN
        MLTLIYSFPVISKRIESV FSWCASSSVVCAAKGPRPRYPRVWKTKKRIGT+SKAAKLVDCVKGLSNVKEEVYGALDSFIAWELEFPLITVKKAL+TLEN
Subjt:  MLTLIYSFPVISKRIESVKFSWCASSSVVCAAKGPRPRYPRVWKTKKRIGTISKAAKLVDCVKGLSNVKEEVYGALDSFIAWELEFPLITVKKALKTLEN

Query:  QREWKRIIQLTKWMLSKGQGRTMGSYFTLLNALAEDGRLDEAEELWNKLFYQHLESMPRIFFHKMISLYYDRAMHDKLFEVFADMEELGVQPNMAIVTMV
        QREWKRIIQLTKWMLSKGQGRTMGSYFTLLNALAEDGRLDEAEELWNKLF QHLESMPRIFFHKMISLYYDRAMHDKLFEVFADMEELGVQPNM IVTMV
Subjt:  QREWKRIIQLTKWMLSKGQGRTMGSYFTLLNALAEDGRLDEAEELWNKLFYQHLESMPRIFFHKMISLYYDRAMHDKLFEVFADMEELGVQPNMAIVTMV

Query:  GNVFHELGMFDKYEKLMKKYPPPKWEYRYVKGKRVRIRSKYLYENGNSNNGLSEHDKLEHSSTKLLEEAEITSEDSSLEDDEEMSEDPVGILEDECISKE
        GNVFHELGM DKYEKLMKKYPPPKWEYRY+KGKRVRIRSKYLYENGN NN LS+HDK+EHSSTKLLEEAEITSED++LEDD EMSEDP  I +DEC+SKE
Subjt:  GNVFHELGMFDKYEKLMKKYPPPKWEYRYVKGKRVRIRSKYLYENGNSNNGLSEHDKLEHSSTKLLEEAEITSEDSSLEDDEEMSEDPVGILEDECISKE

Query:  SNFEHDFMGFGQL
         NFEHDFMGFGQL
Subjt:  SNFEHDFMGFGQL

TrEMBL top hitse value%identityAlignment
A0A0A0L6Q9 Uncharacterized protein3.7e-16489.43Show/hide
Query:  MTHSIAQASLSSSSASYRMLTLIYSFPVISKRIESVKFSWCASSSVVCAAKGPRPRYPRVWKTKKRIGTISKAAKLVDCVKGLSNVKEEVYGALDSFIAW
        MTHS+AQA+LSS  ASYRMLTL+Y+FPV SKRIESV FSWC SSSVVCAAKGPRPRYPRVWKTKKRIGTISKAAKLVDCVKGLSNVKEEVYGALDSFIAW
Subjt:  MTHSIAQASLSSSSASYRMLTLIYSFPVISKRIESVKFSWCASSSVVCAAKGPRPRYPRVWKTKKRIGTISKAAKLVDCVKGLSNVKEEVYGALDSFIAW

Query:  ELEFPLITVKKALKTLENQREWKRIIQLTKWMLSKGQGRTMGSYFTLLNALAEDGRLDEAEELWNKLFYQHLESMPRIFFHKMISLYYDRAMHDKLFEVF
        ELEFPLITVKKALKTLENQREWKRIIQLTKWMLSKGQGRTMGSYFTLLNALAEDGRLDEAEELWNKLF QHLES+PRIFFHKMISLYYD+AMHDKLFEVF
Subjt:  ELEFPLITVKKALKTLENQREWKRIIQLTKWMLSKGQGRTMGSYFTLLNALAEDGRLDEAEELWNKLFYQHLESMPRIFFHKMISLYYDRAMHDKLFEVF

Query:  ADMEELGVQPNMAIVTMVGNVFHELGMFDKYEKLMKKYPPPKWEYRYVKGKRVRIRSKYLYENGNSNNGLSEHDKLEHSSTKLLEEAEITSEDSSLEDDE
        ADMEELGVQPNMAIVT VGNVF ELGM DKY+KLMKKYPPPKWEYRY+KGKRV+IR+KYL ENGNSNNGLSEH K+EHSST  ++EAEITSEDSSLEDDE
Subjt:  ADMEELGVQPNMAIVTMVGNVFHELGMFDKYEKLMKKYPPPKWEYRYVKGKRVRIRSKYLYENGNSNNGLSEHDKLEHSSTKLLEEAEITSEDSSLEDDE

Query:  EMSEDPVGILEDECISKESNFEHDFMGFGQL
        +MSEDP  ILEDE +  +SNFEHDFMG GQL
Subjt:  EMSEDPVGILEDECISKESNFEHDFMGFGQL

A0A1S3AY30 pentatricopeptide repeat-containing protein At4g21190 isoform X22.9e-16187.92Show/hide
Query:  MTHSIAQASLSSSSASYRMLTLIYSFPVISKRIESVKFSWCASSSVVCAAKGPRPRYPRVWKTKKRIGTISKAAKLVDCVKGLSNVKEEVYGALDSFIAW
        MTH +AQA+L+S SASYRMLTL+Y+FPV SKRIESV FSWC SSSVVCAAKGPRPRYPRVWKT+KRIGTISKAAKLVDCVKGLSNVKEEVYGALDSFIAW
Subjt:  MTHSIAQASLSSSSASYRMLTLIYSFPVISKRIESVKFSWCASSSVVCAAKGPRPRYPRVWKTKKRIGTISKAAKLVDCVKGLSNVKEEVYGALDSFIAW

Query:  ELEFPLITVKKALKTLENQREWKRIIQLTKWMLSKGQGRTMGSYFTLLNALAEDGRLDEAEELWNKLFYQHLESMPRIFFHKMISLYYDRAMHDKLFEVF
        ELEFPLITVKKALKTLENQREWKRIIQLTKWMLSKGQGRTMGSYFTLLNAL EDGRLDEAEELWNKLF Q+LESMPRIFFHKMISLYYDRAMHDKLFEVF
Subjt:  ELEFPLITVKKALKTLENQREWKRIIQLTKWMLSKGQGRTMGSYFTLLNALAEDGRLDEAEELWNKLFYQHLESMPRIFFHKMISLYYDRAMHDKLFEVF

Query:  ADMEELGVQPNMAIVTMVGNVFHELGMFDKYEKLMKKYPPPKWEYRYVKGKRVRIRSKYLYENGNSNNGLSEHDKLEHSSTKLLEEAEITSEDSSLEDDE
        ADMEELGVQPNMAIVT VGN+F ELGM DKYEKLMKKYPPPKWEYRY+KGKRV+IR+KYL ENGNS NGLSE +K+EHSST  L+EAEITSEDSSLEDDE
Subjt:  ADMEELGVQPNMAIVTMVGNVFHELGMFDKYEKLMKKYPPPKWEYRYVKGKRVRIRSKYLYENGNSNNGLSEHDKLEHSSTKLLEEAEITSEDSSLEDDE

Query:  EMSEDPVGILEDECISKESNFEHDFMGFGQL
        E+ +DP  ILEDE +  +SNFEHDFMG GQL
Subjt:  EMSEDPVGILEDECISKESNFEHDFMGFGQL

A0A1S4DTK8 pentatricopeptide repeat-containing protein At4g21190 isoform X11.1e-15588.57Show/hide
Query:  YRMLTLIYSFPVISKRIESVKFSWCASSSVVCAAKGPRPRYPRVWKTKKRIGTISKAAKLVDCVKGLSNVKEEVYGALDSFIAWELEFPLITVKKALKTL
        +RMLTL+Y+FPV SKRIESV FSWC SSSVVCAAKGPRPRYPRVWKT+KRIGTISKAAKLVDCVKGLSNVKEEVYGALDSFIAWELEFPLITVKKALKTL
Subjt:  YRMLTLIYSFPVISKRIESVKFSWCASSSVVCAAKGPRPRYPRVWKTKKRIGTISKAAKLVDCVKGLSNVKEEVYGALDSFIAWELEFPLITVKKALKTL

Query:  ENQREWKRIIQLTKWMLSKGQGRTMGSYFTLLNALAEDGRLDEAEELWNKLFYQHLESMPRIFFHKMISLYYDRAMHDKLFEVFADMEELGVQPNMAIVT
        ENQREWKRIIQLTKWMLSKGQGRTMGSYFTLLNAL EDGRLDEAEELWNKLF Q+LESMPRIFFHKMISLYYDRAMHDKLFEVFADMEELGVQPNMAIVT
Subjt:  ENQREWKRIIQLTKWMLSKGQGRTMGSYFTLLNALAEDGRLDEAEELWNKLFYQHLESMPRIFFHKMISLYYDRAMHDKLFEVFADMEELGVQPNMAIVT

Query:  MVGNVFHELGMFDKYEKLMKKYPPPKWEYRYVKGKRVRIRSKYLYENGNSNNGLSEHDKLEHSSTKLLEEAEITSEDSSLEDDEEMSEDPVGILEDECIS
         VGN+F ELGM DKYEKLMKKYPPPKWEYRY+KGKRV+IR+KYL ENGNS NGLSE +K+EHSST  L+EAEITSEDSSLEDDEE+ +DP  ILEDE + 
Subjt:  MVGNVFHELGMFDKYEKLMKKYPPPKWEYRYVKGKRVRIRSKYLYENGNSNNGLSEHDKLEHSSTKLLEEAEITSEDSSLEDDEEMSEDPVGILEDECIS

Query:  KESNFEHDFMGFGQL
         +SNFEHDFMG GQL
Subjt:  KESNFEHDFMGFGQL

A0A5A7SS06 Pentatricopeptide repeat-containing protein2.9e-16187.92Show/hide
Query:  MTHSIAQASLSSSSASYRMLTLIYSFPVISKRIESVKFSWCASSSVVCAAKGPRPRYPRVWKTKKRIGTISKAAKLVDCVKGLSNVKEEVYGALDSFIAW
        MTH +AQA+L+S SASYRMLTL+Y+FPV SKRIESV FSWC SSSVVCAAKGPRPRYPRVWKT+KRIGTISKAAKLVDCVKGLSNVKEEVYGALDSFIAW
Subjt:  MTHSIAQASLSSSSASYRMLTLIYSFPVISKRIESVKFSWCASSSVVCAAKGPRPRYPRVWKTKKRIGTISKAAKLVDCVKGLSNVKEEVYGALDSFIAW

Query:  ELEFPLITVKKALKTLENQREWKRIIQLTKWMLSKGQGRTMGSYFTLLNALAEDGRLDEAEELWNKLFYQHLESMPRIFFHKMISLYYDRAMHDKLFEVF
        ELEFPLITVKKALKTLENQREWKRIIQLTKWMLSKGQGRTMGSYFTLLNAL EDGRLDEAEELWNKLF Q+LESMPRIFFHKMISLYYDRAMHDKLFEVF
Subjt:  ELEFPLITVKKALKTLENQREWKRIIQLTKWMLSKGQGRTMGSYFTLLNALAEDGRLDEAEELWNKLFYQHLESMPRIFFHKMISLYYDRAMHDKLFEVF

Query:  ADMEELGVQPNMAIVTMVGNVFHELGMFDKYEKLMKKYPPPKWEYRYVKGKRVRIRSKYLYENGNSNNGLSEHDKLEHSSTKLLEEAEITSEDSSLEDDE
        ADMEELGVQPNMAIVT VGN+F ELGM DKYEKLMKKYPPPKWEYRY+KGKRV+IR+KYL ENGNS NGLSE +K+EHSST  L+EAEITSEDSSLEDDE
Subjt:  ADMEELGVQPNMAIVTMVGNVFHELGMFDKYEKLMKKYPPPKWEYRYVKGKRVRIRSKYLYENGNSNNGLSEHDKLEHSSTKLLEEAEITSEDSSLEDDE

Query:  EMSEDPVGILEDECISKESNFEHDFMGFGQL
        E+ +DP  ILEDE +  +SNFEHDFMG GQL
Subjt:  EMSEDPVGILEDECISKESNFEHDFMGFGQL

A0A5D3DHA7 Pentatricopeptide repeat-containing protein1.7e-16187.92Show/hide
Query:  MTHSIAQASLSSSSASYRMLTLIYSFPVISKRIESVKFSWCASSSVVCAAKGPRPRYPRVWKTKKRIGTISKAAKLVDCVKGLSNVKEEVYGALDSFIAW
        MTH +AQA+L+S SASYRMLTL+Y+FPV SKRIESV FSWC SSSVVCAAKGPRPRYPRVWKT+KRIGTISKAAKLVDCVKGLSNVKEEVYGALDSFIAW
Subjt:  MTHSIAQASLSSSSASYRMLTLIYSFPVISKRIESVKFSWCASSSVVCAAKGPRPRYPRVWKTKKRIGTISKAAKLVDCVKGLSNVKEEVYGALDSFIAW

Query:  ELEFPLITVKKALKTLENQREWKRIIQLTKWMLSKGQGRTMGSYFTLLNALAEDGRLDEAEELWNKLFYQHLESMPRIFFHKMISLYYDRAMHDKLFEVF
        ELEFPLITVKKALKTLENQREWKRIIQLTKWMLSKGQGRTMGSYFTLLNAL EDGRLDEAEELWNKLF Q+LESMPRIFFHKMISLYYDRAMHDKLFEVF
Subjt:  ELEFPLITVKKALKTLENQREWKRIIQLTKWMLSKGQGRTMGSYFTLLNALAEDGRLDEAEELWNKLFYQHLESMPRIFFHKMISLYYDRAMHDKLFEVF

Query:  ADMEELGVQPNMAIVTMVGNVFHELGMFDKYEKLMKKYPPPKWEYRYVKGKRVRIRSKYLYENGNSNNGLSEHDKLEHSSTKLLEEAEITSEDSSLEDDE
        ADMEELGVQPNMAIVT VGN+F ELGM DKYEKLMKKYPPPKWEYRY+KGKRV+IR+KYL ENGNS NGLSE +K+EHSST  L+EAEITSEDSSLEDDE
Subjt:  ADMEELGVQPNMAIVTMVGNVFHELGMFDKYEKLMKKYPPPKWEYRYVKGKRVRIRSKYLYENGNSNNGLSEHDKLEHSSTKLLEEAEITSEDSSLEDDE

Query:  EMSEDPVGILEDECISKESNFEHDFMGFGQL
        E+ +DP  ILEDE +  +SNFEHDFMG GQL
Subjt:  EMSEDPVGILEDECISKESNFEHDFMGFGQL

SwissProt top hitse value%identityAlignment
Q2V3H0 Pentatricopeptide repeat-containing protein At4g18975, chloroplastic2.2e-4446.08Show/hide
Query:  VWKTKKRIGTISKAAKLVDCVKGLSNVKEEVYGALDSFIAWELEFPLITVKKALKTLENQREWKRIIQLTKWMLSKGQGRTMGSYFTLLNALAEDGRLDE
        +WK     G+  KA  LV  + GL N KE VYGAL+ ++AWE+EFP+I   KAL+ L  + +W R+IQL KWMLSKGQG TMG+Y  LL A   D R DE
Subjt:  VWKTKKRIGTISKAAKLVDCVKGLSNVKEEVYGALDSFIAWELEFPLITVKKALKTLENQREWKRIIQLTKWMLSKGQGRTMGSYFTLLNALAEDGRLDE

Query:  AEELWNKLFYQHLESMPRIFFHKMISLYYDRAMHDKLFEVFADMEELGVQPNMAIVTMVGNVFHELGMFDKYEKLMKKYPPPKWEYRYVKGKRVRIRSKY
        AE LWN + + H  S+PR  F +MI+LY    +HDK+ EVFADMEEL V P+      V   F EL   +  + ++++Y   +++Y Y  G+RVR++ +Y
Subjt:  AEELWNKLFYQHLESMPRIFFHKMISLYYDRAMHDKLFEVFADMEELGVQPNMAIVTMVGNVFHELGMFDKYEKLMKKYPPPKWEYRYVKGKRVRIRSKY

Query:  LYEN
          E+
Subjt:  LYEN

Q8LG95 Pentatricopeptide repeat-containing protein At4g211903.0e-10263.43Show/hide
Query:  MLTLIYSFP--VISKRIESVK-FSWCASSSVVCAAKGPRPRYPRVWKTKKRIGTISKAAKLVDCVKGLSNVKEEVYGALDSFIAWELEFPLITVKKALKT
        ML+L YS P  ++  R  S K F+   ++ VVCAA+GPRPR PRVWKT+KRIGTISKAAK++ C+KGLSNVKEEVYGALDSFIAWELEFPL+ VKKAL  
Subjt:  MLTLIYSFP--VISKRIESVK-FSWCASSSVVCAAKGPRPRYPRVWKTKKRIGTISKAAKLVDCVKGLSNVKEEVYGALDSFIAWELEFPLITVKKALKT

Query:  LENQREWKRIIQLTKWMLSKGQGRTMGSYFTLLNALAEDGRLDEAEELWNKLFYQHLESMPRIFFHKMISLYYDRAMHDKLFEVFADMEELGVQPNMAIV
        LE+++EWK+IIQ+TKWMLSKGQGRTMG+YF+LLNALAED RLDEAEELWNKLF +HLE  PR FF+KMIS+YY R MH KLFEVFADMEELGV+PN+AIV
Subjt:  LENQREWKRIIQLTKWMLSKGQGRTMGSYFTLLNALAEDGRLDEAEELWNKLFYQHLESMPRIFFHKMISLYYDRAMHDKLFEVFADMEELGVQPNMAIV

Query:  TMVGNVFHELGMFDKYEKLMKKYPPPKWEYRYVKGKRVRIRSKYLYENGNSNNGL-SEHDKLEHSSTKLLEEAEITSEDSSLEDDEEMSEDPVGILEDEC
        +MVG VF +L M DKYEKLMKKYPPP+WE+RY+KG+RV++++K L E      GL S+ DK+++     +E  E   ED  L ++EE  ++ +G  + + 
Subjt:  TMVGNVFHELGMFDKYEKLMKKYPPPKWEYRYVKGKRVRIRSKYLYENGNSNNGL-SEHDKLEHSSTKLLEEAEITSEDSSLEDDEEMSEDPVGILEDEC

Query:  ISKESNFEH
         S+E + +H
Subjt:  ISKESNFEH

Arabidopsis top hitse value%identityAlignment
AT4G18975.1 Pentatricopeptide repeat (PPR) superfamily protein1.5e-4546.08Show/hide
Query:  VWKTKKRIGTISKAAKLVDCVKGLSNVKEEVYGALDSFIAWELEFPLITVKKALKTLENQREWKRIIQLTKWMLSKGQGRTMGSYFTLLNALAEDGRLDE
        +WK     G+  KA  LV  + GL N KE VYGAL+ ++AWE+EFP+I   KAL+ L  + +W R+IQL KWMLSKGQG TMG+Y  LL A   D R DE
Subjt:  VWKTKKRIGTISKAAKLVDCVKGLSNVKEEVYGALDSFIAWELEFPLITVKKALKTLENQREWKRIIQLTKWMLSKGQGRTMGSYFTLLNALAEDGRLDE

Query:  AEELWNKLFYQHLESMPRIFFHKMISLYYDRAMHDKLFEVFADMEELGVQPNMAIVTMVGNVFHELGMFDKYEKLMKKYPPPKWEYRYVKGKRVRIRSKY
        AE LWN + + H  S+PR  F +MI+LY    +HDK+ EVFADMEEL V P+      V   F EL   +  + ++++Y   +++Y Y  G+RVR++ +Y
Subjt:  AEELWNKLFYQHLESMPRIFFHKMISLYYDRAMHDKLFEVFADMEELGVQPNMAIVTMVGNVFHELGMFDKYEKLMKKYPPPKWEYRYVKGKRVRIRSKY

Query:  LYEN
          E+
Subjt:  LYEN

AT4G18975.2 Pentatricopeptide repeat (PPR) superfamily protein1.5e-4546.08Show/hide
Query:  VWKTKKRIGTISKAAKLVDCVKGLSNVKEEVYGALDSFIAWELEFPLITVKKALKTLENQREWKRIIQLTKWMLSKGQGRTMGSYFTLLNALAEDGRLDE
        +WK     G+  KA  LV  + GL N KE VYGAL+ ++AWE+EFP+I   KAL+ L  + +W R+IQL KWMLSKGQG TMG+Y  LL A   D R DE
Subjt:  VWKTKKRIGTISKAAKLVDCVKGLSNVKEEVYGALDSFIAWELEFPLITVKKALKTLENQREWKRIIQLTKWMLSKGQGRTMGSYFTLLNALAEDGRLDE

Query:  AEELWNKLFYQHLESMPRIFFHKMISLYYDRAMHDKLFEVFADMEELGVQPNMAIVTMVGNVFHELGMFDKYEKLMKKYPPPKWEYRYVKGKRVRIRSKY
        AE LWN + + H  S+PR  F +MI+LY    +HDK+ EVFADMEEL V P+      V   F EL   +  + ++++Y   +++Y Y  G+RVR++ +Y
Subjt:  AEELWNKLFYQHLESMPRIFFHKMISLYYDRAMHDKLFEVFADMEELGVQPNMAIVTMVGNVFHELGMFDKYEKLMKKYPPPKWEYRYVKGKRVRIRSKY

Query:  LYEN
          E+
Subjt:  LYEN

AT4G18975.3 Pentatricopeptide repeat (PPR) superfamily protein1.5e-4546.08Show/hide
Query:  VWKTKKRIGTISKAAKLVDCVKGLSNVKEEVYGALDSFIAWELEFPLITVKKALKTLENQREWKRIIQLTKWMLSKGQGRTMGSYFTLLNALAEDGRLDE
        +WK     G+  KA  LV  + GL N KE VYGAL+ ++AWE+EFP+I   KAL+ L  + +W R+IQL KWMLSKGQG TMG+Y  LL A   D R DE
Subjt:  VWKTKKRIGTISKAAKLVDCVKGLSNVKEEVYGALDSFIAWELEFPLITVKKALKTLENQREWKRIIQLTKWMLSKGQGRTMGSYFTLLNALAEDGRLDE

Query:  AEELWNKLFYQHLESMPRIFFHKMISLYYDRAMHDKLFEVFADMEELGVQPNMAIVTMVGNVFHELGMFDKYEKLMKKYPPPKWEYRYVKGKRVRIRSKY
        AE LWN + + H  S+PR  F +MI+LY    +HDK+ EVFADMEEL V P+      V   F EL   +  + ++++Y   +++Y Y  G+RVR++ +Y
Subjt:  AEELWNKLFYQHLESMPRIFFHKMISLYYDRAMHDKLFEVFADMEELGVQPNMAIVTMVGNVFHELGMFDKYEKLMKKYPPPKWEYRYVKGKRVRIRSKY

Query:  LYEN
          E+
Subjt:  LYEN

AT4G18975.4 Pentatricopeptide repeat (PPR) superfamily protein1.5e-4546.08Show/hide
Query:  VWKTKKRIGTISKAAKLVDCVKGLSNVKEEVYGALDSFIAWELEFPLITVKKALKTLENQREWKRIIQLTKWMLSKGQGRTMGSYFTLLNALAEDGRLDE
        +WK     G+  KA  LV  + GL N KE VYGAL+ ++AWE+EFP+I   KAL+ L  + +W R+IQL KWMLSKGQG TMG+Y  LL A   D R DE
Subjt:  VWKTKKRIGTISKAAKLVDCVKGLSNVKEEVYGALDSFIAWELEFPLITVKKALKTLENQREWKRIIQLTKWMLSKGQGRTMGSYFTLLNALAEDGRLDE

Query:  AEELWNKLFYQHLESMPRIFFHKMISLYYDRAMHDKLFEVFADMEELGVQPNMAIVTMVGNVFHELGMFDKYEKLMKKYPPPKWEYRYVKGKRVRIRSKY
        AE LWN + + H  S+PR  F +MI+LY    +HDK+ EVFADMEEL V P+      V   F EL   +  + ++++Y   +++Y Y  G+RVR++ +Y
Subjt:  AEELWNKLFYQHLESMPRIFFHKMISLYYDRAMHDKLFEVFADMEELGVQPNMAIVTMVGNVFHELGMFDKYEKLMKKYPPPKWEYRYVKGKRVRIRSKY

Query:  LYEN
          E+
Subjt:  LYEN

AT4G21190.1 Pentatricopeptide repeat (PPR) superfamily protein2.1e-10363.43Show/hide
Query:  MLTLIYSFP--VISKRIESVK-FSWCASSSVVCAAKGPRPRYPRVWKTKKRIGTISKAAKLVDCVKGLSNVKEEVYGALDSFIAWELEFPLITVKKALKT
        ML+L YS P  ++  R  S K F+   ++ VVCAA+GPRPR PRVWKT+KRIGTISKAAK++ C+KGLSNVKEEVYGALDSFIAWELEFPL+ VKKAL  
Subjt:  MLTLIYSFP--VISKRIESVK-FSWCASSSVVCAAKGPRPRYPRVWKTKKRIGTISKAAKLVDCVKGLSNVKEEVYGALDSFIAWELEFPLITVKKALKT

Query:  LENQREWKRIIQLTKWMLSKGQGRTMGSYFTLLNALAEDGRLDEAEELWNKLFYQHLESMPRIFFHKMISLYYDRAMHDKLFEVFADMEELGVQPNMAIV
        LE+++EWK+IIQ+TKWMLSKGQGRTMG+YF+LLNALAED RLDEAEELWNKLF +HLE  PR FF+KMIS+YY R MH KLFEVFADMEELGV+PN+AIV
Subjt:  LENQREWKRIIQLTKWMLSKGQGRTMGSYFTLLNALAEDGRLDEAEELWNKLFYQHLESMPRIFFHKMISLYYDRAMHDKLFEVFADMEELGVQPNMAIV

Query:  TMVGNVFHELGMFDKYEKLMKKYPPPKWEYRYVKGKRVRIRSKYLYENGNSNNGL-SEHDKLEHSSTKLLEEAEITSEDSSLEDDEEMSEDPVGILEDEC
        +MVG VF +L M DKYEKLMKKYPPP+WE+RY+KG+RV++++K L E      GL S+ DK+++     +E  E   ED  L ++EE  ++ +G  + + 
Subjt:  TMVGNVFHELGMFDKYEKLMKKYPPPKWEYRYVKGKRVRIRSKYLYENGNSNNGL-SEHDKLEHSSTKLLEEAEITSEDSSLEDDEEMSEDPVGILEDEC

Query:  ISKESNFEH
         S+E + +H
Subjt:  ISKESNFEH


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACTCACTCTATAGCCCAAGCATCGCTTTCGTCCTCCTCGGCTTCCTACCGGATGCTTACTTTGATTTACTCTTTTCCAGTCATATCCAAAAGGATAGAATCTGTTAA
ATTTTCCTGGTGTGCAAGCAGTTCAGTTGTATGCGCTGCAAAGGGTCCACGGCCGAGATATCCTCGGGTCTGGAAAACCAAAAAGAGAATTGGGACGATATCCAAGGCAG
CAAAGCTTGTTGATTGTGTCAAGGGACTGTCTAACGTCAAAGAGGAAGTCTATGGAGCTCTTGATTCCTTCATTGCATGGGAACTAGAGTTTCCTCTTATTACTGTAAAG
AAAGCCTTGAAGACCTTAGAGAACCAAAGAGAATGGAAGAGGATAATTCAGTTGACGAAATGGATGTTAAGTAAAGGCCAAGGAAGAACAATGGGAAGCTATTTCACGTT
ATTAAATGCCTTAGCTGAAGATGGAAGACTTGATGAAGCAGAAGAGCTTTGGAACAAATTGTTTTATCAGCATTTGGAGAGCATGCCTCGCATATTCTTTCATAAAATGA
TATCCCTGTACTACGATCGGGCAATGCACGACAAGTTATTTGAGGTATTTGCTGATATGGAGGAACTTGGAGTTCAACCAAATATGGCAATTGTCACCATGGTTGGAAAT
GTCTTCCATGAGTTGGGTATGTTCGATAAATATGAAAAGCTGATGAAGAAATATCCCCCACCAAAATGGGAATATCGTTACGTCAAAGGAAAGCGCGTAAGAATACGATC
AAAGTATCTGTATGAAAATGGTAATTCCAACAATGGTTTAAGTGAGCATGACAAATTGGAACATAGTTCAACGAAGCTGTTGGAGGAAGCTGAAATAACTTCCGAGGATT
CCAGTCTTGAAGATGATGAGGAAATGAGCGAAGATCCAGTTGGAATTCTGGAAGATGAATGTATTTCAAAAGAATCCAATTTTGAGCATGATTTCATGGGGTTTGGGCAA
TTGTAA
mRNA sequenceShow/hide mRNA sequence
ATGACTCACTCTATAGCCCAAGCATCGCTTTCGTCCTCCTCGGCTTCCTACCGGATGCTTACTTTGATTTACTCTTTTCCAGTCATATCCAAAAGGATAGAATCTGTTAA
ATTTTCCTGGTGTGCAAGCAGTTCAGTTGTATGCGCTGCAAAGGGTCCACGGCCGAGATATCCTCGGGTCTGGAAAACCAAAAAGAGAATTGGGACGATATCCAAGGCAG
CAAAGCTTGTTGATTGTGTCAAGGGACTGTCTAACGTCAAAGAGGAAGTCTATGGAGCTCTTGATTCCTTCATTGCATGGGAACTAGAGTTTCCTCTTATTACTGTAAAG
AAAGCCTTGAAGACCTTAGAGAACCAAAGAGAATGGAAGAGGATAATTCAGTTGACGAAATGGATGTTAAGTAAAGGCCAAGGAAGAACAATGGGAAGCTATTTCACGTT
ATTAAATGCCTTAGCTGAAGATGGAAGACTTGATGAAGCAGAAGAGCTTTGGAACAAATTGTTTTATCAGCATTTGGAGAGCATGCCTCGCATATTCTTTCATAAAATGA
TATCCCTGTACTACGATCGGGCAATGCACGACAAGTTATTTGAGGTATTTGCTGATATGGAGGAACTTGGAGTTCAACCAAATATGGCAATTGTCACCATGGTTGGAAAT
GTCTTCCATGAGTTGGGTATGTTCGATAAATATGAAAAGCTGATGAAGAAATATCCCCCACCAAAATGGGAATATCGTTACGTCAAAGGAAAGCGCGTAAGAATACGATC
AAAGTATCTGTATGAAAATGGTAATTCCAACAATGGTTTAAGTGAGCATGACAAATTGGAACATAGTTCAACGAAGCTGTTGGAGGAAGCTGAAATAACTTCCGAGGATT
CCAGTCTTGAAGATGATGAGGAAATGAGCGAAGATCCAGTTGGAATTCTGGAAGATGAATGTATTTCAAAAGAATCCAATTTTGAGCATGATTTCATGGGGTTTGGGCAA
TTGTAA
Protein sequenceShow/hide protein sequence
MTHSIAQASLSSSSASYRMLTLIYSFPVISKRIESVKFSWCASSSVVCAAKGPRPRYPRVWKTKKRIGTISKAAKLVDCVKGLSNVKEEVYGALDSFIAWELEFPLITVK
KALKTLENQREWKRIIQLTKWMLSKGQGRTMGSYFTLLNALAEDGRLDEAEELWNKLFYQHLESMPRIFFHKMISLYYDRAMHDKLFEVFADMEELGVQPNMAIVTMVGN
VFHELGMFDKYEKLMKKYPPPKWEYRYVKGKRVRIRSKYLYENGNSNNGLSEHDKLEHSSTKLLEEAEITSEDSSLEDDEEMSEDPVGILEDECISKESNFEHDFMGFGQ
L