; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10020231 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10020231
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationChr04:30062166..30063944
RNA-Seq ExpressionHG10020231
SyntenyHG10020231
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7017366.1 Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. argyrosperma]3.2e-24676.39Show/hide
Query:  MSNQAIATNIAKLILKSGLKPFKTTPSLLSNLDSRVTQLVLSNPNVPTQSCLSFFNFLRQNPSQKPDLRAHLILICRLYRARKFAVMKNVLNFIVNDGNL
        M+NQA+ATNIAKLI+KSGLKPFKTTPSLLSNLDS VTQLVLSNP+VPTQSCLSFFNFLRQNPSQKPDLRAHLILICRLYRARKFAVMKNVLNFIVNDGNL
Subjt:  MSNQAIATNIAKLILKSGLKPFKTTPSLLSNLDSRVTQLVLSNPNVPTQSCLSFFNFLRQNPSQKPDLRAHLILICRLYRARKFAVMKNVLNFIVNDGNL

Query:  RSTAERIVASI-GGECNEPKFVDKFCDMLFRVYVDNRMFDSALEVFHYARKSGLEIEERSCFVFLLALKRSGNVELSLEFLRQMVDSGVEISVYSLTIVV
        RS+ ERIV+SI GGE +EPKFVDKFCDMLFRVYVDN MFDSALEVF YARK+  EIEERSC V LLALKRSGNVELSLEFLRQMVDSGVEISVYSLTIVV
Subjt:  RSTAERIVASI-GGECNEPKFVDKFCDMLFRVYVDNRMFDSALEVFHYARKSGLEIEERSCFVFLLALKRSGNVELSLEFLRQMVDSGVEISVYSLTIVV

Query:  DGLCKKGEVVRAKALMDELVCKGFKPNVFTYNTLLNAYIERNDIGCVNEILSLMEKDGVDYNATTYTILIESYSRSLKILEAEKLFDEMLKKGIEPDVYV
        DGLC+KGEV RAKALMDELV KGFKPNV TYNTLLNAYIER ++ CVNEILSLM KDGVDYNATTYTILIE +SRS KI EAEK+FDEMLK+GIEPDVYV
Subjt:  DGLCKKGEVVRAKALMDELVCKGFKPNVFTYNTLLNAYIERNDIGCVNEILSLMEKDGVDYNATTYTILIESYSRSLKILEAEKLFDEMLKKGIEPDVYV

Query:  YTSIINWNCNYGNMKRAFALFDEMTERGLVPNAYTYGALVNGACKAGQMEAAEMLVNDMQSKGIDVNQVIFNTLIDGYCKKGMIDEALRLQDIMQQKGFE
        YTSIINWNCN GNM+RAFALFDEMTERGLVPNAYTYGAL+NGACKAG+MEAAEMLVNDMQSKGIDVNQVIFNTLIDGYCKKGM+DEALRLQDIMQQKGFE
Subjt:  YTSIINWNCNYGNMKRAFALFDEMTERGLVPNAYTYGALVNGACKAGQMEAAEMLVNDMQSKGIDVNQVIFNTLIDGYCKKGMIDEALRLQDIMQQKGFE

Query:  IDVFTYNIIASGFCRSNRQEEARRLLLTMEERGVAPNA--------------------------------------------------------------
        IDVFTYNIIASGFCRSNR++EA+ LLLTMEERGVAPNA                                                              
Subjt:  IDVFTYNIIASGFCRSNRQEEARRLLLTMEERGVAPNA--------------------------------------------------------------

Query:  --------------------------ALQLFDEMPKRGLNRNVVTYTAMISGLSKDGRADEAFKLYDEMKAAGIEPDDRIYFSLTGSLHKVGS
                                  AL+LF+EMP+RGLNRN++TYTA+ISGLSKDGR+DEAFKLYDEMKAAGIEPDDRIY SLTGSLH+ GS
Subjt:  --------------------------ALQLFDEMPKRGLNRNVVTYTAMISGLSKDGRADEAFKLYDEMKAAGIEPDDRIYFSLTGSLHKVGS

XP_022934560.1 pentatricopeptide repeat-containing protein At2g32630 isoform X1 [Cucurbita moschata]5.3e-24976.86Show/hide
Query:  MSNQAIATNIAKLILKSGLKPFKTTPSLLSNLDSRVTQLVLSNPNVPTQSCLSFFNFLRQNPSQKPDLRAHLILICRLYRARKFAVMKNVLNFIVNDGNL
        M+NQA+ATNI KLI+KSGLKPFKTTPSLLSNLDSRVTQLVLSNP+VPTQSCLSFFNFLRQNPSQKPDLRAHLILICRLYRARKFAVMKNVLNFIVNDGNL
Subjt:  MSNQAIATNIAKLILKSGLKPFKTTPSLLSNLDSRVTQLVLSNPNVPTQSCLSFFNFLRQNPSQKPDLRAHLILICRLYRARKFAVMKNVLNFIVNDGNL

Query:  RSTAERIVASIGGECNEPKFVDKFCDMLFRVYVDNRMFDSALEVFHYARKSGLEIEERSCFVFLLALKRSGNVELSLEFLRQMVDSGVEISVYSLTIVVD
        RS+ ERIV+SIGGE +EPKFVDKFCDMLFRVYVDN MFDSALEVF YARK+G EIEERSC V LLALKRSGNVELSLEFLRQMVDSGVEISVYSLTIVVD
Subjt:  RSTAERIVASIGGECNEPKFVDKFCDMLFRVYVDNRMFDSALEVFHYARKSGLEIEERSCFVFLLALKRSGNVELSLEFLRQMVDSGVEISVYSLTIVVD

Query:  GLCKKGEVVRAKALMDELVCKGFKPNVFTYNTLLNAYIERNDIGCVNEILSLMEKDGVDYNATTYTILIESYSRSLKILEAEKLFDEMLKKGIEPDVYVY
        GLC+KGEV RAKALMDELV KGFKPNV TYNTLLNAYIER ++ CVNEILSLM KDGVDY+ATTYTILIE YSRS KI EAEK+FDEMLK+GIEPDVYVY
Subjt:  GLCKKGEVVRAKALMDELVCKGFKPNVFTYNTLLNAYIERNDIGCVNEILSLMEKDGVDYNATTYTILIESYSRSLKILEAEKLFDEMLKKGIEPDVYVY

Query:  TSIINWNCNYGNMKRAFALFDEMTERGLVPNAYTYGALVNGACKAGQMEAAEMLVNDMQSKGIDVNQVIFNTLIDGYCKKGMIDEALRLQDIMQQKGFEI
        TSIINWNCN GNMKRAFALFDEMTERGLVPNAYTYGAL+NGACKAG+MEAAEMLVNDMQSKGIDVNQVIFNTLIDGYCKKGM+DEALRLQDIMQQKGF+I
Subjt:  TSIINWNCNYGNMKRAFALFDEMTERGLVPNAYTYGALVNGACKAGQMEAAEMLVNDMQSKGIDVNQVIFNTLIDGYCKKGMIDEALRLQDIMQQKGFEI

Query:  DVFTYNIIASGFCRSNRQEEARRLLLTMEERGVAPNA---------------------------------------------------------------
        DVFTYNIIASGFCRSNR++EAR LLLTMEERGVAPNA                                                               
Subjt:  DVFTYNIIASGFCRSNRQEEARRLLLTMEERGVAPNA---------------------------------------------------------------

Query:  -------------------------ALQLFDEMPKRGLNRNVVTYTAMISGLSKDGRADEAFKLYDEMKAAGIEPDDRIYFSLTGSLHKVGS
                                 AL+LF+EMP+RGLNRN++TYTA+ISGLSKDGR+DEAFKLYDEMKAAGIEPDDRIY SLTGSLH+ GS
Subjt:  -------------------------ALQLFDEMPKRGLNRNVVTYTAMISGLSKDGRADEAFKLYDEMKAAGIEPDDRIYFSLTGSLHKVGS

XP_022984086.1 pentatricopeptide repeat-containing protein At2g32630 isoform X1 [Cucurbita maxima]1.8e-24977.2Show/hide
Query:  MSNQAIATNIAKLILKSGLKPFKTTPSLLSNLDSRVTQLVLSNPNVPTQSCLSFFNFLRQNPSQKPDLRAHLILICRLYRARKFAVMKNVLNFIVNDGNL
        M+NQA+ATNIAKLI+KSGLKPFKTTPSLLSNLDSRVTQLVLSNP+VPTQSCLSFFNFLRQNPSQKPDLRAHLILICRLYRARKFAVMKNVLNFIVNDGNL
Subjt:  MSNQAIATNIAKLILKSGLKPFKTTPSLLSNLDSRVTQLVLSNPNVPTQSCLSFFNFLRQNPSQKPDLRAHLILICRLYRARKFAVMKNVLNFIVNDGNL

Query:  RSTAERIVASIGGECNEPKFVDKFCDMLFRVYVDNRMFDSALEVFHYARKSGLEIEERSCFVFLLALKRSGNVELSLEFLRQMVDSGVEISVYSLTIVVD
        R T ERIV+SIGGE +EPKFVDKFCDMLFRVYVDN MFDSALEVF YARK+  EIEERSC V LLALKRSGNVELSLEFLRQMVDSGVEISVYSLTIVVD
Subjt:  RSTAERIVASIGGECNEPKFVDKFCDMLFRVYVDNRMFDSALEVFHYARKSGLEIEERSCFVFLLALKRSGNVELSLEFLRQMVDSGVEISVYSLTIVVD

Query:  GLCKKGEVVRAKALMDELVCKGFKPNVFTYNTLLNAYIERNDIGCVNEILSLMEKDGVDYNATTYTILIESYSRSLKILEAEKLFDEMLKKGIEPDVYVY
        GLC+KGEV RAKALMDELV KGFKPNVFTYNTLLNAYIER ++ CVNEILSLMEKDGVDYNATTYTILIE YSRS KI EAEK+FDEMLK+GIEPDVYVY
Subjt:  GLCKKGEVVRAKALMDELVCKGFKPNVFTYNTLLNAYIERNDIGCVNEILSLMEKDGVDYNATTYTILIESYSRSLKILEAEKLFDEMLKKGIEPDVYVY

Query:  TSIINWNCNYGNMKRAFALFDEMTERGLVPNAYTYGALVNGACKAGQMEAAEMLVNDMQSKGIDVNQVIFNTLIDGYCKKGMIDEALRLQDIMQQKGFEI
        TSIINWN N GNMKRAFALFDEMTERGLVPNAYTYGAL+NGACKAG+MEAAEMLVNDMQSKGIDVNQVIFNTLIDGYCKKGM+DEALRLQDIMQQKGFEI
Subjt:  TSIINWNCNYGNMKRAFALFDEMTERGLVPNAYTYGALVNGACKAGQMEAAEMLVNDMQSKGIDVNQVIFNTLIDGYCKKGMIDEALRLQDIMQQKGFEI

Query:  DVFTYNIIASGFCRSNRQEEARRLLLTMEERGVAPNA---------------------------------------------------------------
        DVFTYNIIASGFCRSNR++EA+ LLLTMEERGVAPNA                                                               
Subjt:  DVFTYNIIASGFCRSNRQEEARRLLLTMEERGVAPNA---------------------------------------------------------------

Query:  -------------------------ALQLFDEMPKRGLNRNVVTYTAMISGLSKDGRADEAFKLYDEMKAAGIEPDDRIYFSLTGSLHKVGS
                                 AL+LF+EMP+RGLNRN++TYTA+ISGLSKDGR+DEAFKLYDEMKAAGIEPDDRIY SLTGSLH+ GS
Subjt:  -------------------------ALQLFDEMPKRGLNRNVVTYTAMISGLSKDGRADEAFKLYDEMKAAGIEPDDRIYFSLTGSLHKVGS

XP_023528780.1 pentatricopeptide repeat-containing protein At2g32630 [Cucurbita pepo subsp. pepo]3.6e-25077.2Show/hide
Query:  MSNQAIATNIAKLILKSGLKPFKTTPSLLSNLDSRVTQLVLSNPNVPTQSCLSFFNFLRQNPSQKPDLRAHLILICRLYRARKFAVMKNVLNFIVNDGNL
        M+NQA+ATNIAKLI+KSGLKPFKTTPSLLSNLDSRVTQLVLSNP+VPTQSCLSFFNFLRQNPSQKPDLRAHLILICRLYRARKFAVMKNVLNFIVNDGNL
Subjt:  MSNQAIATNIAKLILKSGLKPFKTTPSLLSNLDSRVTQLVLSNPNVPTQSCLSFFNFLRQNPSQKPDLRAHLILICRLYRARKFAVMKNVLNFIVNDGNL

Query:  RSTAERIVASIGGECNEPKFVDKFCDMLFRVYVDNRMFDSALEVFHYARKSGLEIEERSCFVFLLALKRSGNVELSLEFLRQMVDSGVEISVYSLTIVVD
        RS+ ERIV+SIGGE +EPKFVDKFCDMLFRVYVDN MFDSALEVF YARK+  EIEERSC V LLALKRSGNVELSLEFLRQMVDSGVEISVYSLTIVVD
Subjt:  RSTAERIVASIGGECNEPKFVDKFCDMLFRVYVDNRMFDSALEVFHYARKSGLEIEERSCFVFLLALKRSGNVELSLEFLRQMVDSGVEISVYSLTIVVD

Query:  GLCKKGEVVRAKALMDELVCKGFKPNVFTYNTLLNAYIERNDIGCVNEILSLMEKDGVDYNATTYTILIESYSRSLKILEAEKLFDEMLKKGIEPDVYVY
        GLC+KGEV RAKALMDELV KGFKPNVFTYNTLL AYIER ++ CVNEILSLMEKDGVDYNATTYTILIE YSRS KI EAEK+FDEMLK+GIEPDVYVY
Subjt:  GLCKKGEVVRAKALMDELVCKGFKPNVFTYNTLLNAYIERNDIGCVNEILSLMEKDGVDYNATTYTILIESYSRSLKILEAEKLFDEMLKKGIEPDVYVY

Query:  TSIINWNCNYGNMKRAFALFDEMTERGLVPNAYTYGALVNGACKAGQMEAAEMLVNDMQSKGIDVNQVIFNTLIDGYCKKGMIDEALRLQDIMQQKGFEI
        TSIINWNCN GNMKRAFALFDEMTERGLVPNAYTYGAL+NGACKAG+MEAAEMLVNDMQSKGIDVNQVIFNTLIDGYCKKGM+DEALRLQDIMQQKGFEI
Subjt:  TSIINWNCNYGNMKRAFALFDEMTERGLVPNAYTYGALVNGACKAGQMEAAEMLVNDMQSKGIDVNQVIFNTLIDGYCKKGMIDEALRLQDIMQQKGFEI

Query:  DVFTYNIIASGFCRSNRQEEARRLLLTMEERGVAPNA---------------------------------------------------------------
        DVFTYNIIASGFCRSNR++EA+ LLLTMEERGVAPNA                                                               
Subjt:  DVFTYNIIASGFCRSNRQEEARRLLLTMEERGVAPNA---------------------------------------------------------------

Query:  -------------------------ALQLFDEMPKRGLNRNVVTYTAMISGLSKDGRADEAFKLYDEMKAAGIEPDDRIYFSLTGSLHKVGS
                                 AL+LF+EMP+RGLNRN++TYTA+ISGLSKDGR+DEAFKLYDEMKAAGIEPDDRIY SLTGSLH+ GS
Subjt:  -------------------------ALQLFDEMPKRGLNRNVVTYTAMISGLSKDGRADEAFKLYDEMKAAGIEPDDRIYFSLTGSLHKVGS

XP_038904125.1 pentatricopeptide repeat-containing protein At2g32630 [Benincasa hispida]1.3e-25278.06Show/hide
Query:  MSNQAIATNIAKLILKSGLKPFKTTPSLLSNLDSRVTQLVLSNPNVPTQSCLSFFNFLRQNPSQKPDLRAHLILICRLYRARKFAVMKNVLNFIVNDGNL
        MSNQAIATNIAKLILKSGLKPFKTTPSLLSNLDSRVTQLVLS+PN+PTQSCLSFFNFLRQNPSQKPDLRAHLILICRLYRARKFAVMKNVLNF+VNDGNL
Subjt:  MSNQAIATNIAKLILKSGLKPFKTTPSLLSNLDSRVTQLVLSNPNVPTQSCLSFFNFLRQNPSQKPDLRAHLILICRLYRARKFAVMKNVLNFIVNDGNL

Query:  RSTAERIVASIGGECNEPKFVDKFCDMLFRVYVDNRMFDSALEVFHYARKSGLEIEERSCFVFLLALKRSGNVELSLEFLRQMVDSGVEISVYSLTIVVD
        RST ERIV+SIG E NEPKFVDKFCDMLFRVYVDNRMFDSALEVF YARKSGLEIEERSCFVFLLALKRSGNVELSLEFL QMVDSGVEISVYSLT+VVD
Subjt:  RSTAERIVASIGGECNEPKFVDKFCDMLFRVYVDNRMFDSALEVFHYARKSGLEIEERSCFVFLLALKRSGNVELSLEFLRQMVDSGVEISVYSLTIVVD

Query:  GLCKKGEVVRAKALMDELVCKGFKPNVFTYNTLLNAYIERNDIGCVNEILSLMEKDGVDYNATTYTILIESYSRSLKILEAEKLFDEMLKKGIEPDVYVY
        GLCKK EVVRAKALMDEL CKGFKPN+FTYNTLLNAYIERND+GCVNEILSLMEKDGVDYNA+TYTILIE YSR+LKI EAE+LF++MLKKG+EPDVYVY
Subjt:  GLCKKGEVVRAKALMDELVCKGFKPNVFTYNTLLNAYIERNDIGCVNEILSLMEKDGVDYNATTYTILIESYSRSLKILEAEKLFDEMLKKGIEPDVYVY

Query:  TSIINWNCNYGNMKRAFALFDEMTERGLVPNAYTYGALVNGACKAGQMEAAEMLVNDMQSKGIDVNQVIFNTLIDGYCKKGMIDEALRLQDIMQQKGFEI
        TSIINWNCN  NMKRAFALFD+MTERG+VPNAYTYGAL+NG CKAG+MEAAEMLVNDMQSKGID+N VIFNTLIDGYCKKGMIDEALRLQDIMQQKGFE 
Subjt:  TSIINWNCNYGNMKRAFALFDEMTERGLVPNAYTYGALVNGACKAGQMEAAEMLVNDMQSKGIDVNQVIFNTLIDGYCKKGMIDEALRLQDIMQQKGFEI

Query:  DVFTYNIIASGFCRSNRQEEARRLLLTMEERGVAPNA---------------------------------------------------------------
        DVFTYNIIASGFCR NRQ+EARRLLLTMEERGVAPNA                                                               
Subjt:  DVFTYNIIASGFCRSNRQEEARRLLLTMEERGVAPNA---------------------------------------------------------------

Query:  -------------------------ALQLFDEMPKRGLNRNVVTYTAMISGLSKDGRADEAFKLYDEMKAAGIEPDDRIYFSLTGSLH
                                 AL+LF+EM ++GLNRNV+TYTAMISGLSKDGRADEAFKLYDEMKAAGIEPDDRIY SLTGSLH
Subjt:  -------------------------ALQLFDEMPKRGLNRNVVTYTAMISGLSKDGRADEAFKLYDEMKAAGIEPDDRIYFSLTGSLH

TrEMBL top hitse value%identityAlignment
A0A6J1CU39 pentatricopeptide repeat-containing protein At2g326301.0e-23774.16Show/hide
Query:  MSNQAIATNIAKLILKSGLKPFKTTPSLLSNLDSRVTQLVLSNPNVPTQSCLSFFNFLRQNPSQKPDLRAHLILICRLYRARKFAVMKNVLNFIVNDGNL
        MSNQAIATNI KLI+KSGLKPFKTTPSLLSNLD+++T LVLSNPNVPTQSCLSFFNFLR NPS KPDLRAHLIL+ RLY ARKFAVMKNVLNFI ND NL
Subjt:  MSNQAIATNIAKLILKSGLKPFKTTPSLLSNLDSRVTQLVLSNPNVPTQSCLSFFNFLRQNPSQKPDLRAHLILICRLYRARKFAVMKNVLNFIVNDGNL

Query:  RSTAERIVASIGGECNEPKFVDKFCDMLFRVYVDNRMFDSALEVFHYARKSGLEIEERSCFVFLLALKRSGNVELSLEFLRQMVDSGVEISVYSLTIVVD
        RST ERIVASIG E NEPKFVDK CDMLFRVYVDNRMFDSAL VF YA+ +GLEIEERSCFVFLLALKRS NVELSLE LR+MVDSGVEI+VYSLTIV+D
Subjt:  RSTAERIVASIGGECNEPKFVDKFCDMLFRVYVDNRMFDSALEVFHYARKSGLEIEERSCFVFLLALKRSGNVELSLEFLRQMVDSGVEISVYSLTIVVD

Query:  GLCKKGEVVRAKALMDELVCKGFKPNVFTYNTLLNAYIERNDIGCVNEILSLMEKDGVDYNATTYTILIESYSRSLKILEAEKLFDEMLKKGIEPDVYVY
        GLC+KG+V RAK LMDELV KG KPNVFTYNTLLNAY ER D+  VNEILSLMEKDGVDYNA TYTILIE YSRS KI EAEKLFDEMLKK IEPDVYVY
Subjt:  GLCKKGEVVRAKALMDELVCKGFKPNVFTYNTLLNAYIERNDIGCVNEILSLMEKDGVDYNATTYTILIESYSRSLKILEAEKLFDEMLKKGIEPDVYVY

Query:  TSIINWNCNYGNMKRAFALFDEMTERGLVPNAYTYGALVNGACKAGQMEAAEMLVNDMQSKGIDVNQVIFNTLIDGYCKKGMIDEALRLQDIMQQKGFEI
        TSIINWNCN GNMKRAFALFDEMTERGLVPN+YTYGAL+NGACKAG MEAAE+LVNDMQS+GIDVNQV+FNTLIDGYCKKGMIDEALRLQ+IMQQKG EI
Subjt:  TSIINWNCNYGNMKRAFALFDEMTERGLVPNAYTYGALVNGACKAGQMEAAEMLVNDMQSKGIDVNQVIFNTLIDGYCKKGMIDEALRLQDIMQQKGFEI

Query:  DVFTYNIIASGFCRSNRQEEARRLLLTMEERGVAPNA---------------------------------------------------------------
        DVFTYNIIASGFCRSNR+EEARRLLLTMEERGVAPNA                                                               
Subjt:  DVFTYNIIASGFCRSNRQEEARRLLLTMEERGVAPNA---------------------------------------------------------------

Query:  -------------------------ALQLFDEMPKRGLNRNVVTYTAMISGLSKDGRADEAFKLYDEMKAAGIEPDDRIYFSLTGSLHKVGS
                                 AL+LF+EMP++GL+RNVVTY AMISGLSKDGRA+EAF+LYDEMKAAGI+PDDRIY SLTGSLH+ GS
Subjt:  -------------------------ALQLFDEMPKRGLNRNVVTYTAMISGLSKDGRADEAFKLYDEMKAAGIEPDDRIYFSLTGSLHKVGS

A0A6J1F254 pentatricopeptide repeat-containing protein At2g32630 isoform X22.1e-23582.65Show/hide
Query:  MSNQAIATNIAKLILKSGLKPFKTTPSLLSNLDSRVTQLVLSNPNVPTQSCLSFFNFLRQNPSQKPDLRAHLILICRLYRARKFAVMKNVLNFIVNDGNL
        M+NQA+ATNI KLI+KSGLKPFKTTPSLLSNLDSRVTQLVLSNP+VPTQSCLSFFNFLRQNPSQKPDLRAHLILICRLYRARKFAVMKNVLNFIVNDGNL
Subjt:  MSNQAIATNIAKLILKSGLKPFKTTPSLLSNLDSRVTQLVLSNPNVPTQSCLSFFNFLRQNPSQKPDLRAHLILICRLYRARKFAVMKNVLNFIVNDGNL

Query:  RSTAERIVASIGGECNEPKFVDKFCDMLFRVYVDNRMFDSALEVFHYARKSGLEIEERSCFVFLLALKRSGNVELSLEFLRQMVDSGVEISVYSLTIVVD
        RS+ ERIV+SIGGE +EPKFVDKFCDMLFRVYVDN MFDSALEVF YARK+G EIEERSC V LLALKRSGNVELSLEFLRQMVDSGVEISVYSLTIVVD
Subjt:  RSTAERIVASIGGECNEPKFVDKFCDMLFRVYVDNRMFDSALEVFHYARKSGLEIEERSCFVFLLALKRSGNVELSLEFLRQMVDSGVEISVYSLTIVVD

Query:  GLCKKGEVVRAKALMDELVCKGFKPNVFTYNTLLNAYIERNDIGCVNEILSLMEKDGVDYNATTYTILIESYSRSLKILEAEKLFDEMLKKGIEPDVYVY
        GLC+KGEV RAKALMDELV KGFKPNV TYNTLLNAYIER ++ CVNEILSLM KDGVDY+ATTYTILIE YSRS KI EAEK+FDEMLK+GIEPDVYVY
Subjt:  GLCKKGEVVRAKALMDELVCKGFKPNVFTYNTLLNAYIERNDIGCVNEILSLMEKDGVDYNATTYTILIESYSRSLKILEAEKLFDEMLKKGIEPDVYVY

Query:  TSIINWNCNYGNMKRAFALFDEMTERGLVPNAYTYGALVNGACKAGQMEAAEMLVNDMQSKGIDVNQVIFNTLIDGYCKKGMIDEALRLQDIMQQKGFEI
        TSIINWNCN GNMKRAFALFDEMTERGLVPNAYTYGAL+NGACKAG+MEAAEMLVNDMQSKGIDVNQVIFNTLIDGYCKKGM+DEALRLQDIMQQKGF+I
Subjt:  TSIINWNCNYGNMKRAFALFDEMTERGLVPNAYTYGALVNGACKAGQMEAAEMLVNDMQSKGIDVNQVIFNTLIDGYCKKGMIDEALRLQDIMQQKGFEI

Query:  DVFTYNIIASGFCRSNRQEEARRLLLTMEERGVAPNA------------------ALQLFDEMPKRGLNRNVVTYTAMISGLSKDGRADEAFKLYDEMKA
        DVFTYNIIASGFCRSNR++EAR LLLTMEERGVAPNA                  A +L  EM  +G   NVVTY   I G  K G+ +EA+KL DEM+ 
Subjt:  DVFTYNIIASGFCRSNRQEEARRLLLTMEERGVAPNA------------------ALQLFDEMPKRGLNRNVVTYTAMISGLSKDGRADEAFKLYDEMKA

Query:  AGIEPDDRIYFSL
         G+  D   Y SL
Subjt:  AGIEPDDRIYFSL

A0A6J1F818 pentatricopeptide repeat-containing protein At2g32630 isoform X12.5e-24976.86Show/hide
Query:  MSNQAIATNIAKLILKSGLKPFKTTPSLLSNLDSRVTQLVLSNPNVPTQSCLSFFNFLRQNPSQKPDLRAHLILICRLYRARKFAVMKNVLNFIVNDGNL
        M+NQA+ATNI KLI+KSGLKPFKTTPSLLSNLDSRVTQLVLSNP+VPTQSCLSFFNFLRQNPSQKPDLRAHLILICRLYRARKFAVMKNVLNFIVNDGNL
Subjt:  MSNQAIATNIAKLILKSGLKPFKTTPSLLSNLDSRVTQLVLSNPNVPTQSCLSFFNFLRQNPSQKPDLRAHLILICRLYRARKFAVMKNVLNFIVNDGNL

Query:  RSTAERIVASIGGECNEPKFVDKFCDMLFRVYVDNRMFDSALEVFHYARKSGLEIEERSCFVFLLALKRSGNVELSLEFLRQMVDSGVEISVYSLTIVVD
        RS+ ERIV+SIGGE +EPKFVDKFCDMLFRVYVDN MFDSALEVF YARK+G EIEERSC V LLALKRSGNVELSLEFLRQMVDSGVEISVYSLTIVVD
Subjt:  RSTAERIVASIGGECNEPKFVDKFCDMLFRVYVDNRMFDSALEVFHYARKSGLEIEERSCFVFLLALKRSGNVELSLEFLRQMVDSGVEISVYSLTIVVD

Query:  GLCKKGEVVRAKALMDELVCKGFKPNVFTYNTLLNAYIERNDIGCVNEILSLMEKDGVDYNATTYTILIESYSRSLKILEAEKLFDEMLKKGIEPDVYVY
        GLC+KGEV RAKALMDELV KGFKPNV TYNTLLNAYIER ++ CVNEILSLM KDGVDY+ATTYTILIE YSRS KI EAEK+FDEMLK+GIEPDVYVY
Subjt:  GLCKKGEVVRAKALMDELVCKGFKPNVFTYNTLLNAYIERNDIGCVNEILSLMEKDGVDYNATTYTILIESYSRSLKILEAEKLFDEMLKKGIEPDVYVY

Query:  TSIINWNCNYGNMKRAFALFDEMTERGLVPNAYTYGALVNGACKAGQMEAAEMLVNDMQSKGIDVNQVIFNTLIDGYCKKGMIDEALRLQDIMQQKGFEI
        TSIINWNCN GNMKRAFALFDEMTERGLVPNAYTYGAL+NGACKAG+MEAAEMLVNDMQSKGIDVNQVIFNTLIDGYCKKGM+DEALRLQDIMQQKGF+I
Subjt:  TSIINWNCNYGNMKRAFALFDEMTERGLVPNAYTYGALVNGACKAGQMEAAEMLVNDMQSKGIDVNQVIFNTLIDGYCKKGMIDEALRLQDIMQQKGFEI

Query:  DVFTYNIIASGFCRSNRQEEARRLLLTMEERGVAPNA---------------------------------------------------------------
        DVFTYNIIASGFCRSNR++EAR LLLTMEERGVAPNA                                                               
Subjt:  DVFTYNIIASGFCRSNRQEEARRLLLTMEERGVAPNA---------------------------------------------------------------

Query:  -------------------------ALQLFDEMPKRGLNRNVVTYTAMISGLSKDGRADEAFKLYDEMKAAGIEPDDRIYFSLTGSLHKVGS
                                 AL+LF+EMP+RGLNRN++TYTA+ISGLSKDGR+DEAFKLYDEMKAAGIEPDDRIY SLTGSLH+ GS
Subjt:  -------------------------ALQLFDEMPKRGLNRNVVTYTAMISGLSKDGRADEAFKLYDEMKAAGIEPDDRIYFSLTGSLHKVGS

A0A6J1J171 pentatricopeptide repeat-containing protein At2g32630 isoform X18.7e-25077.2Show/hide
Query:  MSNQAIATNIAKLILKSGLKPFKTTPSLLSNLDSRVTQLVLSNPNVPTQSCLSFFNFLRQNPSQKPDLRAHLILICRLYRARKFAVMKNVLNFIVNDGNL
        M+NQA+ATNIAKLI+KSGLKPFKTTPSLLSNLDSRVTQLVLSNP+VPTQSCLSFFNFLRQNPSQKPDLRAHLILICRLYRARKFAVMKNVLNFIVNDGNL
Subjt:  MSNQAIATNIAKLILKSGLKPFKTTPSLLSNLDSRVTQLVLSNPNVPTQSCLSFFNFLRQNPSQKPDLRAHLILICRLYRARKFAVMKNVLNFIVNDGNL

Query:  RSTAERIVASIGGECNEPKFVDKFCDMLFRVYVDNRMFDSALEVFHYARKSGLEIEERSCFVFLLALKRSGNVELSLEFLRQMVDSGVEISVYSLTIVVD
        R T ERIV+SIGGE +EPKFVDKFCDMLFRVYVDN MFDSALEVF YARK+  EIEERSC V LLALKRSGNVELSLEFLRQMVDSGVEISVYSLTIVVD
Subjt:  RSTAERIVASIGGECNEPKFVDKFCDMLFRVYVDNRMFDSALEVFHYARKSGLEIEERSCFVFLLALKRSGNVELSLEFLRQMVDSGVEISVYSLTIVVD

Query:  GLCKKGEVVRAKALMDELVCKGFKPNVFTYNTLLNAYIERNDIGCVNEILSLMEKDGVDYNATTYTILIESYSRSLKILEAEKLFDEMLKKGIEPDVYVY
        GLC+KGEV RAKALMDELV KGFKPNVFTYNTLLNAYIER ++ CVNEILSLMEKDGVDYNATTYTILIE YSRS KI EAEK+FDEMLK+GIEPDVYVY
Subjt:  GLCKKGEVVRAKALMDELVCKGFKPNVFTYNTLLNAYIERNDIGCVNEILSLMEKDGVDYNATTYTILIESYSRSLKILEAEKLFDEMLKKGIEPDVYVY

Query:  TSIINWNCNYGNMKRAFALFDEMTERGLVPNAYTYGALVNGACKAGQMEAAEMLVNDMQSKGIDVNQVIFNTLIDGYCKKGMIDEALRLQDIMQQKGFEI
        TSIINWN N GNMKRAFALFDEMTERGLVPNAYTYGAL+NGACKAG+MEAAEMLVNDMQSKGIDVNQVIFNTLIDGYCKKGM+DEALRLQDIMQQKGFEI
Subjt:  TSIINWNCNYGNMKRAFALFDEMTERGLVPNAYTYGALVNGACKAGQMEAAEMLVNDMQSKGIDVNQVIFNTLIDGYCKKGMIDEALRLQDIMQQKGFEI

Query:  DVFTYNIIASGFCRSNRQEEARRLLLTMEERGVAPNA---------------------------------------------------------------
        DVFTYNIIASGFCRSNR++EA+ LLLTMEERGVAPNA                                                               
Subjt:  DVFTYNIIASGFCRSNRQEEARRLLLTMEERGVAPNA---------------------------------------------------------------

Query:  -------------------------ALQLFDEMPKRGLNRNVVTYTAMISGLSKDGRADEAFKLYDEMKAAGIEPDDRIYFSLTGSLHKVGS
                                 AL+LF+EMP+RGLNRN++TYTA+ISGLSKDGR+DEAFKLYDEMKAAGIEPDDRIY SLTGSLH+ GS
Subjt:  -------------------------ALQLFDEMPKRGLNRNVVTYTAMISGLSKDGRADEAFKLYDEMKAAGIEPDDRIYFSLTGSLHKVGS

A0A6J1J9J7 pentatricopeptide repeat-containing protein At2g32630 isoform X27.2e-23683.04Show/hide
Query:  MSNQAIATNIAKLILKSGLKPFKTTPSLLSNLDSRVTQLVLSNPNVPTQSCLSFFNFLRQNPSQKPDLRAHLILICRLYRARKFAVMKNVLNFIVNDGNL
        M+NQA+ATNIAKLI+KSGLKPFKTTPSLLSNLDSRVTQLVLSNP+VPTQSCLSFFNFLRQNPSQKPDLRAHLILICRLYRARKFAVMKNVLNFIVNDGNL
Subjt:  MSNQAIATNIAKLILKSGLKPFKTTPSLLSNLDSRVTQLVLSNPNVPTQSCLSFFNFLRQNPSQKPDLRAHLILICRLYRARKFAVMKNVLNFIVNDGNL

Query:  RSTAERIVASIGGECNEPKFVDKFCDMLFRVYVDNRMFDSALEVFHYARKSGLEIEERSCFVFLLALKRSGNVELSLEFLRQMVDSGVEISVYSLTIVVD
        R T ERIV+SIGGE +EPKFVDKFCDMLFRVYVDN MFDSALEVF YARK+  EIEERSC V LLALKRSGNVELSLEFLRQMVDSGVEISVYSLTIVVD
Subjt:  RSTAERIVASIGGECNEPKFVDKFCDMLFRVYVDNRMFDSALEVFHYARKSGLEIEERSCFVFLLALKRSGNVELSLEFLRQMVDSGVEISVYSLTIVVD

Query:  GLCKKGEVVRAKALMDELVCKGFKPNVFTYNTLLNAYIERNDIGCVNEILSLMEKDGVDYNATTYTILIESYSRSLKILEAEKLFDEMLKKGIEPDVYVY
        GLC+KGEV RAKALMDELV KGFKPNVFTYNTLLNAYIER ++ CVNEILSLMEKDGVDYNATTYTILIE YSRS KI EAEK+FDEMLK+GIEPDVYVY
Subjt:  GLCKKGEVVRAKALMDELVCKGFKPNVFTYNTLLNAYIERNDIGCVNEILSLMEKDGVDYNATTYTILIESYSRSLKILEAEKLFDEMLKKGIEPDVYVY

Query:  TSIINWNCNYGNMKRAFALFDEMTERGLVPNAYTYGALVNGACKAGQMEAAEMLVNDMQSKGIDVNQVIFNTLIDGYCKKGMIDEALRLQDIMQQKGFEI
        TSIINWN N GNMKRAFALFDEMTERGLVPNAYTYGAL+NGACKAG+MEAAEMLVNDMQSKGIDVNQVIFNTLIDGYCKKGM+DEALRLQDIMQQKGFEI
Subjt:  TSIINWNCNYGNMKRAFALFDEMTERGLVPNAYTYGALVNGACKAGQMEAAEMLVNDMQSKGIDVNQVIFNTLIDGYCKKGMIDEALRLQDIMQQKGFEI

Query:  DVFTYNIIASGFCRSNRQEEARRLLLTMEERGVAPNA------------------ALQLFDEMPKRGLNRNVVTYTAMISGLSKDGRADEAFKLYDEMKA
        DVFTYNIIASGFCRSNR++EA+ LLLTMEERGVAPNA                  A +L  EM  +G   NVVTY   I G  K G+ +EA+KL DEM+ 
Subjt:  DVFTYNIIASGFCRSNRQEEARRLLLTMEERGVAPNA------------------ALQLFDEMPKRGLNRNVVTYTAMISGLSKDGRADEAFKLYDEMKA

Query:  AGIEPDDRIYFSL
         G+  D   Y SL
Subjt:  AGIEPDDRIYFSL

SwissProt top hitse value%identityAlignment
O04491 Putative pentatricopeptide repeat-containing protein At1g096803.5e-5426.64Show/hide
Query:  FKTTPSLLSNLDS----RVTQLVLSNP-NVPTQSCLSFFNFLRQNPSQKPDLRAHLILICRLYRARKFAVMKNVLNFIVNDGNLRSTAERIVASIGGECN
        F   PS+   L S     V  L+  NP ++P +S  +FF F+   P  +  +  + +L   L     F   ++++  +V+     S +   ++ +  E  
Subjt:  FKTTPSLLSNLDS----RVTQLVLSNP-NVPTQSCLSFFNFLRQNPSQKPDLRAHLILICRLYRARKFAVMKNVLNFIVNDGNLRSTAERIVASIGGECN

Query:  EPKFVDKFCDMLFRVYVDNRMFDSALEVFHYARKSGLEIEERSCFVFLLALKRSGNVELSLEFLRQMVDSGVEISVYSLTIVVDGLCKKGEVVRAKALMD
                 D L   Y D      A++ F  +RK   ++  R C   L  + +         F  +++D+G  ++VY   I+++  CK+G +  A+ + D
Subjt:  EPKFVDKFCDMLFRVYVDNRMFDSALEVFHYARKSGLEIEERSCFVFLLALKRSGNVELSLEFLRQMVDSGVEISVYSLTIVVDGLCKKGEVVRAKALMD

Query:  ELVCKGFKPNVFTYNTLLNAYIERNDIGCVNEILSLMEKDGVDYNATTYTILIESYSRSLKILEAEKLFDEMLKKGIEPDVYVYTSIINWNCNYGNMKRA
        E+  +  +P V ++NTL+N Y +  ++     +   MEK     +  TY+ LI +  +  K+  A  LFDEM K+G+ P+  ++T++I+ +   G +   
Subjt:  ELVCKGFKPNVFTYNTLLNAYIERNDIGCVNEILSLMEKDGVDYNATTYTILIESYSRSLKILEAEKLFDEMLKKGIEPDVYVYTSIINWNCNYGNMKRA

Query:  FALFDEMTERGLVPNAYTYGALVNGACKAGQMEAAEMLVNDMQSKGIDVNQVIFNTLIDGYCKKGMIDEALRLQDIMQQKGFEIDVFTYNIIASGFCRSN
           + +M  +GL P+   Y  LVNG CK G + AA  +V+ M  +G+  +++ + TLIDG+C+ G ++ AL ++  M Q G E+D   ++ +  G C+  
Subjt:  FALFDEMTERGLVPNAYTYGALVNGACKAGQMEAAEMLVNDMQSKGIDVNQVIFNTLIDGYCKKGMIDEALRLQDIMQQKGFEIDVFTYNIIASGFCRSN

Query:  RQEEARRLLLTMEERGVAPN------------------AALQLFDEMPKRGLNRNVVTYTAMISGLSKDGRADEAFKLYDEMKAAGIEPDDRIYFSLTGS
        R  +A R L  M   G+ P+                     +L  EM   G   +VVTY  +++GL K G+   A  L D M   G+ PDD  Y +L   
Subjt:  RQEEARRLLLTMEERGVAPN------------------AALQLFDEMPKRGLNRNVVTYTAMISGLSKDGRADEAFKLYDEMKAAGIEPDDRIYFSLTGS

Query:  LHK
         H+
Subjt:  LHK

O04504 Pentatricopeptide repeat-containing protein At1g098207.8e-5429.05Show/hide
Query:  CLSFFNFLRQNPSQKPDLRAHLILICRLYRARKFAVMKNVLNFIVNDGNLRSTAERIVASIG---GECNEPKFVDKFCDMLFRVYVDNRMFDSALEVFHY
        CL ++++L +N      L     L+  L  A++++ +++ L     DG +R+ ++  V SI      C+         DML   Y +N  F+   E F  
Subjt:  CLSFFNFLRQNPSQKPDLRAHLILICRLYRARKFAVMKNVLNFIVNDGNLRSTAERIVASIG---GECNEPKFVDKFCDMLFRVYVDNRMFDSALEVFHY

Query:  ARKSGLEIEERSCFVFLLAL---KRSGNVELSLEFLRQMVDSGVEISVYSLTIVVDGLCKKGEVVRAKALMDELVCKGFKPNVFTYNTLLNAYIERNDIG
        +   G ++   SC   ++AL    RS +VE      ++M+   ++ +V++  +V++ LCK G++ +A+ +M+++   G  PNV +YNTL++ Y +    G
Subjt:  ARKSGLEIEERSCFVFLLAL---KRSGNVELSLEFLRQMVDSGVEISVYSLTIVVDGLCKKGEVVRAKALMDELVCKGFKPNVFTYNTLLNAYIERNDIG

Query:  CV---NEILSLMEKDGVDYNATTYTILIESYSRSLKILEAEKLFDEMLKKGIEPDVYVYTSIINWNCNYGNMKRAFALFDEMTERGLVPNAYTYGALVNG
         +   + +L  M ++ V  N TT+ ILI+ + +   +  + K+F EML + ++P+V  Y S+IN  CN G +  A ++ D+M   G+ PN  TY AL+NG
Subjt:  CV---NEILSLMEKDGVDYNATTYTILIESYSRSLKILEAEKLFDEMLKKGIEPDVYVYTSIINWNCNYGNMKRAFALFDEMTERGLVPNAYTYGALVNG

Query:  ACKAGQMEAAEMLVNDMQSKGIDVNQVIFNTLIDGYCKKGMIDEALRLQDIMQQKGFEIDVFTYNIIASGFCRSNRQEEARRLLLTMEERGVAPNAALQL
         CK   ++ A  +   ++ +G      ++N LID YCK G ID+   L++ M+++G   DV TYN + +G CR+   E                 AA +L
Subjt:  ACKAGQMEAAEMLVNDMQSKGIDVNQVIFNTLIDGYCKKGMIDEALRLQDIMQQKGFEIDVFTYNIIASGFCRSNRQEEARRLLLTMEERGVAPNAALQL

Query:  FDEMPKRGLNRNVVTYTAMISGLSKDGRADEAFKLYDEMKAAGIEPDDRIY
        FD++  +GL  ++VT+  ++ G  + G + +A  L  EM   G++P    Y
Subjt:  FDEMPKRGLNRNVVTYTAMISGLSKDGRADEAFKLYDEMKAAGIEPDDRIY

Q8S8P6 Pentatricopeptide repeat-containing protein At2g326304.8e-14446.96Show/hide
Query:  SNQAIATNI-AKLILKSGLKPFKTTPSLLSNLDSRVTQLVLSNPNVPTQSCLSFFNFLRQNPSQ-KPDLRAHLILICRLYRARKFAVMKNVLNFIVNDGN
        S+Q  A  I A L+ KS +   ++ PSLL NL+S VT+LVLS P +PTQSC+ FF  LR+  S  KPDL A + L  RLY  R+F  M+++LN +VNDG 
Subjt:  SNQAIATNI-AKLILKSGLKPFKTTPSLLSNLDSRVTQLVLSNPNVPTQSCLSFFNFLRQNPSQ-KPDLRAHLILICRLYRARKFAVMKNVLNFIVNDGN

Query:  LRSTAERI-VASIGGECNEPK--FVDKFCDMLFRVYVDNRMFDSALEVFHYARKSGLEIEERSCFVFLLALKRSGNVELSLEFLRQMVDSGVEISVYSLT
         +   E +  A +  + +E K  F +KF D++FRVYVDN MF+  L VF Y  K GL I+ERSC VFL+A K+   ++L LE  R+MVDSGV+I+VYSLT
Subjt:  LRSTAERI-VASIGGECNEPK--FVDKFCDMLFRVYVDNRMFDSALEVFHYARKSGLEIEERSCFVFLLALKRSGNVELSLEFLRQMVDSGVEISVYSLT

Query:  IVVDGLCKKGEVVRAKALMDELVCKGFKPNVFTYNTLLNAYIERNDIGCVNEILSLMEKDGVDYNATTYTILIESYSRSLKILEAEKLFDEMLKKGIEPD
        IVV+GLC++GEV ++K L+ E   KG KP  +TYNT++NAY+++ D   V  +L +M+KDGV YN  TYT+L+E   ++ K+ +AEKLFDEM ++GIE D
Subjt:  IVVDGLCKKGEVVRAKALMDELVCKGFKPNVFTYNTLLNAYIERNDIGCVNEILSLMEKDGVDYNATTYTILIESYSRSLKILEAEKLFDEMLKKGIEPD

Query:  VYVYTSIINWNCNYGNMKRAFALFDEMTERGLVPNAYTYGALVNGACKAGQMEAAEMLVNDMQSKGIDVNQVIFNTLIDGYCKKGMIDEALRLQDIMQQK
        V+VYTS+I+WNC  GNMKRAF LFDE+TE+GL P++YTYGAL++G CK G+M AAE+L+N+MQSKG+++ QV+FNTLIDGYC+KGM+DEA  + D+M+QK
Subjt:  VYVYTSIINWNCNYGNMKRAFALFDEMTERGLVPNAYTYGALVNGACKAGQMEAAEMLVNDMQSKGIDVNQVIFNTLIDGYCKKGMIDEALRLQDIMQQK

Query:  GFEIDVFTYNIIASGF-----------------------------------CRSNRQEEARRLLLTMEERGVAPNA------------------------
        GF+ DVFT N IAS F                                   C+    EEA+RL + M  +GV PNA                        
Subjt:  GFEIDVFTYNIIASGF-----------------------------------CRSNRQEEARRLLLTMEERGVAPNA------------------------

Query:  -----------------------------ALQLFDEMPKRGLNRNVVTYTAMISGLSKDGRADEAFKLYDEMKAAGIEPDDRIYFSLTGSLH
                                     A++LF EM  +GL++N VTYT MISGLSK G++DEAF LYDEMK  G   D+++Y +L GS+H
Subjt:  -----------------------------ALQLFDEMPKRGLNRNVVTYTAMISGLSKDGRADEAFKLYDEMKAAGIEPDDRIYFSLTGSLH

Q9LN69 Putative pentatricopeptide repeat-containing protein At1g192901.9e-5529.75Show/hide
Query:  VLSNPNVPTQSCLSFFNFLRQNPSQKPDLRAHLILICRLYRARKFAVMKNVLNFIVNDGNLRSTAERIVASIGGECNEPKFVDKFCDMLFRVYVDNRMFD
        +L    +  ++CL  FN   +    +PD +A+  ++  L RAR +   K+ L  +V    L  +   +   +     E  F     DM+ +VY +  +  
Subjt:  VLSNPNVPTQSCLSFFNFLRQNPSQKPDLRAHLILICRLYRARKFAVMKNVLNFIVNDGNLRSTAERIVASIGGECNEPKFVDKFCDMLFRVYVDNRMFD

Query:  SALEVFHYARKSGLEIEERSCFVFLLALKRSGNVELSLEFLRQMVDSGVEISVYSLTIVVDGLCKKGEVVRAKALMDELVCK-GFKPNVFTYNTLLNAYI
        +AL VF      G      SC   L  L R G   ++L    QM+   V   V++ +IVV+  C+ G V +A     E     G + NV TYN+L+N Y 
Subjt:  SALEVFHYARKSGLEIEERSCFVFLLALKRSGNVELSLEFLRQMVDSGVEISVYSLTIVVDGLCKKGEVVRAKALMDELVCK-GFKPNVFTYNTLLNAYI

Query:  ERNDIGCVNEILSLMEKDGVDYNATTYTILIESYSRSLKILEAEKLFDEMLKKGIEPDVYVYTSIINWNCNYGNMKRAFALFDEMTERGLVPNAYTYGAL
           D+  +  +L LM + GV  N  TYT LI+ Y +   + EAE +F+ + +K +  D ++Y  +++  C  G ++ A  + D M E G+  N     +L
Subjt:  ERNDIGCVNEILSLMEKDGVDYNATTYTILIESYSRSLKILEAEKLFDEMLKKGIEPDVYVYTSIINWNCNYGNMKRAFALFDEMTERGLVPNAYTYGAL

Query:  VNGACKAGQMEAAEMLVNDMQSKGIDVNQVIFNTLIDGYCKKGMIDEALRLQDIMQQKGFEIDVFTYNIIASGFCRSNRQEEARRLLLTMEERGVAP---
        +NG CK+GQ+  AE + + M    +  +   +NTL+DGYC+ G +DEAL+L D M QK     V TYNI+  G+ R     +   L   M +RGV     
Subjt:  VNGACKAGQMEAAEMLVNDMQSKGIDVNQVIFNTLIDGYCKKGMIDEALRLQDIMQQKGFEIDVFTYNIIASGFCRSNRQEEARRLLLTMEERGVAP---

Query:  ---------------NAALQLFDEMPKRGLNRNVVTYTAMISGLSKDGRADEAFKLYDEMKAAGIEPDDRIYFSLTGSLHKVGS
                       N A++L++ +  RGL  + +T   MISGL K  + +EA ++ D +     +P  + Y +L+   +KVG+
Subjt:  ---------------NAALQLFDEMPKRGLNRNVVTYTAMISGLSKDGRADEAFKLYDEMKAAGIEPDDRIYFSLTGSLHKVGS

Q9ZUU7 Pentatricopeptide repeat-containing protein At2g280501.5e-6533.11Show/hide
Query:  SNQAIATNIAKLILKSGL--KPFKTTPSLLSNLDSRVTQLVLSNPNVPTQSCLSFFNFLRQNPSQ---KPDLRAHLILICRLYRARKFAVMKNVLNFIVN
        + Q    +I KL+L S    +   +  + LS+L+    + +LS+P++ +  C+S FNF+ +NPS    +PDLR HL L  R+   R+F+  K +L  +  
Subjt:  SNQAIATNIAKLILKSGL--KPFKTTPSLLSNLDSRVTQLVLSNPNVPTQSCLSFFNFLRQNPSQ---KPDLRAHLILICRLYRARKFAVMKNVLNFIVN

Query:  DGNLRSTAERIVASIGGECN-EPKFVDKFCDMLFRVYVDNRMFDSALEVFHYARKSGLEIEERSCFVFLLALKRSGNVELSLEFLRQMVDSGVE-ISVYS
        D  LR     IV+S+  EC  E K V +F + +  VY DN  F   +EVF Y + + ++I+E++C + LL LKR   +EL+ +F   MV+SG++ ++VYS
Subjt:  DGNLRSTAERIVASIGGECN-EPKFVDKFCDMLFRVYVDNRMFDSALEVFHYARKSGLEIEERSCFVFLLALKRSGNVELSLEFLRQMVDSGVE-ISVYS

Query:  LTIVVDGLCKKGEVVRAKALMDEL-VCKGFKPNVFTYNTLLNAYIERNDIGCVNEILSLMEKDGVDYNATTYTILIESYSRSLKILEAEKLFDEMLKKGI
        LT+VV  LC  GE+ RA+ L++E+ + KG K N+ T+ +++   ++R D   ++ +L LMEK+ V  +  +Y +LI+ ++   K+ EAE+L   M  K +
Subjt:  LTIVVDGLCKKGEVVRAKALMDEL-VCKGFKPNVFTYNTLLNAYIERNDIGCVNEILSLMEKDGVDYNATTYTILIESYSRSLKILEAEKLFDEMLKKGI

Query:  EPDVYVYTSIINWNCNYGNMKRAFALFDEMTERGLVPNAYTYGALVNGACKAGQMEAAEMLVNDMQSKGIDVNQVIFNTLIDGYCKKGMIDEALRLQDIM
          + Y+Y  I+N    +G +++   L+ EM+ RG+ PN  TY  L+NG CKAG++  A   +N+++    ++++ +++TL +   + GMID++L +   M
Subjt:  EPDVYVYTSIINWNCNYGNMKRAFALFDEMTERGLVPNAYTYGALVNGACKAGQMEAAEMLVNDMQSKGIDVNQVIFNTLIDGYCKKGMIDEALRLQDIM

Query:  QQKGFEIDVFTYNIIASGFCRSNRQEEARRLLLTMEERGVAPNAALQ
         + GF         +A      NR +EA+ L+  + + G+ P +  Q
Subjt:  QQKGFEIDVFTYNIIASGFCRSNRQEEARRLLLTMEERGVAPNAALQ

Arabidopsis top hitse value%identityAlignment
AT1G09680.1 Pentatricopeptide repeat (PPR) superfamily protein2.5e-5526.64Show/hide
Query:  FKTTPSLLSNLDS----RVTQLVLSNP-NVPTQSCLSFFNFLRQNPSQKPDLRAHLILICRLYRARKFAVMKNVLNFIVNDGNLRSTAERIVASIGGECN
        F   PS+   L S     V  L+  NP ++P +S  +FF F+   P  +  +  + +L   L     F   ++++  +V+     S +   ++ +  E  
Subjt:  FKTTPSLLSNLDS----RVTQLVLSNP-NVPTQSCLSFFNFLRQNPSQKPDLRAHLILICRLYRARKFAVMKNVLNFIVNDGNLRSTAERIVASIGGECN

Query:  EPKFVDKFCDMLFRVYVDNRMFDSALEVFHYARKSGLEIEERSCFVFLLALKRSGNVELSLEFLRQMVDSGVEISVYSLTIVVDGLCKKGEVVRAKALMD
                 D L   Y D      A++ F  +RK   ++  R C   L  + +         F  +++D+G  ++VY   I+++  CK+G +  A+ + D
Subjt:  EPKFVDKFCDMLFRVYVDNRMFDSALEVFHYARKSGLEIEERSCFVFLLALKRSGNVELSLEFLRQMVDSGVEISVYSLTIVVDGLCKKGEVVRAKALMD

Query:  ELVCKGFKPNVFTYNTLLNAYIERNDIGCVNEILSLMEKDGVDYNATTYTILIESYSRSLKILEAEKLFDEMLKKGIEPDVYVYTSIINWNCNYGNMKRA
        E+  +  +P V ++NTL+N Y +  ++     +   MEK     +  TY+ LI +  +  K+  A  LFDEM K+G+ P+  ++T++I+ +   G +   
Subjt:  ELVCKGFKPNVFTYNTLLNAYIERNDIGCVNEILSLMEKDGVDYNATTYTILIESYSRSLKILEAEKLFDEMLKKGIEPDVYVYTSIINWNCNYGNMKRA

Query:  FALFDEMTERGLVPNAYTYGALVNGACKAGQMEAAEMLVNDMQSKGIDVNQVIFNTLIDGYCKKGMIDEALRLQDIMQQKGFEIDVFTYNIIASGFCRSN
           + +M  +GL P+   Y  LVNG CK G + AA  +V+ M  +G+  +++ + TLIDG+C+ G ++ AL ++  M Q G E+D   ++ +  G C+  
Subjt:  FALFDEMTERGLVPNAYTYGALVNGACKAGQMEAAEMLVNDMQSKGIDVNQVIFNTLIDGYCKKGMIDEALRLQDIMQQKGFEIDVFTYNIIASGFCRSN

Query:  RQEEARRLLLTMEERGVAPN------------------AALQLFDEMPKRGLNRNVVTYTAMISGLSKDGRADEAFKLYDEMKAAGIEPDDRIYFSLTGS
        R  +A R L  M   G+ P+                     +L  EM   G   +VVTY  +++GL K G+   A  L D M   G+ PDD  Y +L   
Subjt:  RQEEARRLLLTMEERGVAPN------------------AALQLFDEMPKRGLNRNVVTYTAMISGLSKDGRADEAFKLYDEMKAAGIEPDDRIYFSLTGS

Query:  LHK
         H+
Subjt:  LHK

AT1G09820.1 Pentatricopeptide repeat (PPR-like) superfamily protein5.5e-5529.05Show/hide
Query:  CLSFFNFLRQNPSQKPDLRAHLILICRLYRARKFAVMKNVLNFIVNDGNLRSTAERIVASIG---GECNEPKFVDKFCDMLFRVYVDNRMFDSALEVFHY
        CL ++++L +N      L     L+  L  A++++ +++ L     DG +R+ ++  V SI      C+         DML   Y +N  F+   E F  
Subjt:  CLSFFNFLRQNPSQKPDLRAHLILICRLYRARKFAVMKNVLNFIVNDGNLRSTAERIVASIG---GECNEPKFVDKFCDMLFRVYVDNRMFDSALEVFHY

Query:  ARKSGLEIEERSCFVFLLAL---KRSGNVELSLEFLRQMVDSGVEISVYSLTIVVDGLCKKGEVVRAKALMDELVCKGFKPNVFTYNTLLNAYIERNDIG
        +   G ++   SC   ++AL    RS +VE      ++M+   ++ +V++  +V++ LCK G++ +A+ +M+++   G  PNV +YNTL++ Y +    G
Subjt:  ARKSGLEIEERSCFVFLLAL---KRSGNVELSLEFLRQMVDSGVEISVYSLTIVVDGLCKKGEVVRAKALMDELVCKGFKPNVFTYNTLLNAYIERNDIG

Query:  CV---NEILSLMEKDGVDYNATTYTILIESYSRSLKILEAEKLFDEMLKKGIEPDVYVYTSIINWNCNYGNMKRAFALFDEMTERGLVPNAYTYGALVNG
         +   + +L  M ++ V  N TT+ ILI+ + +   +  + K+F EML + ++P+V  Y S+IN  CN G +  A ++ D+M   G+ PN  TY AL+NG
Subjt:  CV---NEILSLMEKDGVDYNATTYTILIESYSRSLKILEAEKLFDEMLKKGIEPDVYVYTSIINWNCNYGNMKRAFALFDEMTERGLVPNAYTYGALVNG

Query:  ACKAGQMEAAEMLVNDMQSKGIDVNQVIFNTLIDGYCKKGMIDEALRLQDIMQQKGFEIDVFTYNIIASGFCRSNRQEEARRLLLTMEERGVAPNAALQL
         CK   ++ A  +   ++ +G      ++N LID YCK G ID+   L++ M+++G   DV TYN + +G CR+   E                 AA +L
Subjt:  ACKAGQMEAAEMLVNDMQSKGIDVNQVIFNTLIDGYCKKGMIDEALRLQDIMQQKGFEIDVFTYNIIASGFCRSNRQEEARRLLLTMEERGVAPNAALQL

Query:  FDEMPKRGLNRNVVTYTAMISGLSKDGRADEAFKLYDEMKAAGIEPDDRIY
        FD++  +GL  ++VT+  ++ G  + G + +A  L  EM   G++P    Y
Subjt:  FDEMPKRGLNRNVVTYTAMISGLSKDGRADEAFKLYDEMKAAGIEPDDRIY

AT1G19290.1 Pentatricopeptide repeat (PPR) superfamily protein1.3e-5629.75Show/hide
Query:  VLSNPNVPTQSCLSFFNFLRQNPSQKPDLRAHLILICRLYRARKFAVMKNVLNFIVNDGNLRSTAERIVASIGGECNEPKFVDKFCDMLFRVYVDNRMFD
        +L    +  ++CL  FN   +    +PD +A+  ++  L RAR +   K+ L  +V    L  +   +   +     E  F     DM+ +VY +  +  
Subjt:  VLSNPNVPTQSCLSFFNFLRQNPSQKPDLRAHLILICRLYRARKFAVMKNVLNFIVNDGNLRSTAERIVASIGGECNEPKFVDKFCDMLFRVYVDNRMFD

Query:  SALEVFHYARKSGLEIEERSCFVFLLALKRSGNVELSLEFLRQMVDSGVEISVYSLTIVVDGLCKKGEVVRAKALMDELVCK-GFKPNVFTYNTLLNAYI
        +AL VF      G      SC   L  L R G   ++L    QM+   V   V++ +IVV+  C+ G V +A     E     G + NV TYN+L+N Y 
Subjt:  SALEVFHYARKSGLEIEERSCFVFLLALKRSGNVELSLEFLRQMVDSGVEISVYSLTIVVDGLCKKGEVVRAKALMDELVCK-GFKPNVFTYNTLLNAYI

Query:  ERNDIGCVNEILSLMEKDGVDYNATTYTILIESYSRSLKILEAEKLFDEMLKKGIEPDVYVYTSIINWNCNYGNMKRAFALFDEMTERGLVPNAYTYGAL
           D+  +  +L LM + GV  N  TYT LI+ Y +   + EAE +F+ + +K +  D ++Y  +++  C  G ++ A  + D M E G+  N     +L
Subjt:  ERNDIGCVNEILSLMEKDGVDYNATTYTILIESYSRSLKILEAEKLFDEMLKKGIEPDVYVYTSIINWNCNYGNMKRAFALFDEMTERGLVPNAYTYGAL

Query:  VNGACKAGQMEAAEMLVNDMQSKGIDVNQVIFNTLIDGYCKKGMIDEALRLQDIMQQKGFEIDVFTYNIIASGFCRSNRQEEARRLLLTMEERGVAP---
        +NG CK+GQ+  AE + + M    +  +   +NTL+DGYC+ G +DEAL+L D M QK     V TYNI+  G+ R     +   L   M +RGV     
Subjt:  VNGACKAGQMEAAEMLVNDMQSKGIDVNQVIFNTLIDGYCKKGMIDEALRLQDIMQQKGFEIDVFTYNIIASGFCRSNRQEEARRLLLTMEERGVAP---

Query:  ---------------NAALQLFDEMPKRGLNRNVVTYTAMISGLSKDGRADEAFKLYDEMKAAGIEPDDRIYFSLTGSLHKVGS
                       N A++L++ +  RGL  + +T   MISGL K  + +EA ++ D +     +P  + Y +L+   +KVG+
Subjt:  ---------------NAALQLFDEMPKRGLNRNVVTYTAMISGLSKDGRADEAFKLYDEMKAAGIEPDDRIYFSLTGSLHKVGS

AT2G28050.1 Pentatricopeptide repeat (PPR) superfamily protein1.1e-6633.11Show/hide
Query:  SNQAIATNIAKLILKSGL--KPFKTTPSLLSNLDSRVTQLVLSNPNVPTQSCLSFFNFLRQNPSQ---KPDLRAHLILICRLYRARKFAVMKNVLNFIVN
        + Q    +I KL+L S    +   +  + LS+L+    + +LS+P++ +  C+S FNF+ +NPS    +PDLR HL L  R+   R+F+  K +L  +  
Subjt:  SNQAIATNIAKLILKSGL--KPFKTTPSLLSNLDSRVTQLVLSNPNVPTQSCLSFFNFLRQNPSQ---KPDLRAHLILICRLYRARKFAVMKNVLNFIVN

Query:  DGNLRSTAERIVASIGGECN-EPKFVDKFCDMLFRVYVDNRMFDSALEVFHYARKSGLEIEERSCFVFLLALKRSGNVELSLEFLRQMVDSGVE-ISVYS
        D  LR     IV+S+  EC  E K V +F + +  VY DN  F   +EVF Y + + ++I+E++C + LL LKR   +EL+ +F   MV+SG++ ++VYS
Subjt:  DGNLRSTAERIVASIGGECN-EPKFVDKFCDMLFRVYVDNRMFDSALEVFHYARKSGLEIEERSCFVFLLALKRSGNVELSLEFLRQMVDSGVE-ISVYS

Query:  LTIVVDGLCKKGEVVRAKALMDEL-VCKGFKPNVFTYNTLLNAYIERNDIGCVNEILSLMEKDGVDYNATTYTILIESYSRSLKILEAEKLFDEMLKKGI
        LT+VV  LC  GE+ RA+ L++E+ + KG K N+ T+ +++   ++R D   ++ +L LMEK+ V  +  +Y +LI+ ++   K+ EAE+L   M  K +
Subjt:  LTIVVDGLCKKGEVVRAKALMDEL-VCKGFKPNVFTYNTLLNAYIERNDIGCVNEILSLMEKDGVDYNATTYTILIESYSRSLKILEAEKLFDEMLKKGI

Query:  EPDVYVYTSIINWNCNYGNMKRAFALFDEMTERGLVPNAYTYGALVNGACKAGQMEAAEMLVNDMQSKGIDVNQVIFNTLIDGYCKKGMIDEALRLQDIM
          + Y+Y  I+N    +G +++   L+ EM+ RG+ PN  TY  L+NG CKAG++  A   +N+++    ++++ +++TL +   + GMID++L +   M
Subjt:  EPDVYVYTSIINWNCNYGNMKRAFALFDEMTERGLVPNAYTYGALVNGACKAGQMEAAEMLVNDMQSKGIDVNQVIFNTLIDGYCKKGMIDEALRLQDIM

Query:  QQKGFEIDVFTYNIIASGFCRSNRQEEARRLLLTMEERGVAPNAALQ
         + GF         +A      NR +EA+ L+  + + G+ P +  Q
Subjt:  QQKGFEIDVFTYNIIASGFCRSNRQEEARRLLLTMEERGVAPNAALQ

AT2G32630.1 Pentatricopeptide repeat (PPR-like) superfamily protein3.4e-14546.96Show/hide
Query:  SNQAIATNI-AKLILKSGLKPFKTTPSLLSNLDSRVTQLVLSNPNVPTQSCLSFFNFLRQNPSQ-KPDLRAHLILICRLYRARKFAVMKNVLNFIVNDGN
        S+Q  A  I A L+ KS +   ++ PSLL NL+S VT+LVLS P +PTQSC+ FF  LR+  S  KPDL A + L  RLY  R+F  M+++LN +VNDG 
Subjt:  SNQAIATNI-AKLILKSGLKPFKTTPSLLSNLDSRVTQLVLSNPNVPTQSCLSFFNFLRQNPSQ-KPDLRAHLILICRLYRARKFAVMKNVLNFIVNDGN

Query:  LRSTAERI-VASIGGECNEPK--FVDKFCDMLFRVYVDNRMFDSALEVFHYARKSGLEIEERSCFVFLLALKRSGNVELSLEFLRQMVDSGVEISVYSLT
         +   E +  A +  + +E K  F +KF D++FRVYVDN MF+  L VF Y  K GL I+ERSC VFL+A K+   ++L LE  R+MVDSGV+I+VYSLT
Subjt:  LRSTAERI-VASIGGECNEPK--FVDKFCDMLFRVYVDNRMFDSALEVFHYARKSGLEIEERSCFVFLLALKRSGNVELSLEFLRQMVDSGVEISVYSLT

Query:  IVVDGLCKKGEVVRAKALMDELVCKGFKPNVFTYNTLLNAYIERNDIGCVNEILSLMEKDGVDYNATTYTILIESYSRSLKILEAEKLFDEMLKKGIEPD
        IVV+GLC++GEV ++K L+ E   KG KP  +TYNT++NAY+++ D   V  +L +M+KDGV YN  TYT+L+E   ++ K+ +AEKLFDEM ++GIE D
Subjt:  IVVDGLCKKGEVVRAKALMDELVCKGFKPNVFTYNTLLNAYIERNDIGCVNEILSLMEKDGVDYNATTYTILIESYSRSLKILEAEKLFDEMLKKGIEPD

Query:  VYVYTSIINWNCNYGNMKRAFALFDEMTERGLVPNAYTYGALVNGACKAGQMEAAEMLVNDMQSKGIDVNQVIFNTLIDGYCKKGMIDEALRLQDIMQQK
        V+VYTS+I+WNC  GNMKRAF LFDE+TE+GL P++YTYGAL++G CK G+M AAE+L+N+MQSKG+++ QV+FNTLIDGYC+KGM+DEA  + D+M+QK
Subjt:  VYVYTSIINWNCNYGNMKRAFALFDEMTERGLVPNAYTYGALVNGACKAGQMEAAEMLVNDMQSKGIDVNQVIFNTLIDGYCKKGMIDEALRLQDIMQQK

Query:  GFEIDVFTYNIIASGF-----------------------------------CRSNRQEEARRLLLTMEERGVAPNA------------------------
        GF+ DVFT N IAS F                                   C+    EEA+RL + M  +GV PNA                        
Subjt:  GFEIDVFTYNIIASGF-----------------------------------CRSNRQEEARRLLLTMEERGVAPNA------------------------

Query:  -----------------------------ALQLFDEMPKRGLNRNVVTYTAMISGLSKDGRADEAFKLYDEMKAAGIEPDDRIYFSLTGSLH
                                     A++LF EM  +GL++N VTYT MISGLSK G++DEAF LYDEMK  G   D+++Y +L GS+H
Subjt:  -----------------------------ALQLFDEMPKRGLNRNVVTYTAMISGLSKDGRADEAFKLYDEMKAAGIEPDDRIYFSLTGSLH


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGAATCAGGCGATTGCCACGAACATCGCGAAGCTAATTCTGAAATCTGGCCTTAAACCCTTCAAAACGACGCCATCGCTGCTTTCCAATCTTGATTCTCGGGTAAC
GCAACTGGTTTTGTCCAACCCAAATGTTCCTACTCAGTCGTGTTTGAGTTTTTTCAACTTTCTCCGACAAAACCCTTCTCAGAAACCCGATCTTCGGGCACATTTAATCC
TCATCTGTAGGTTGTATCGAGCTCGGAAGTTCGCGGTAATGAAAAATGTGTTGAACTTCATCGTTAATGATGGAAATCTTCGGAGCACTGCCGAGCGGATTGTTGCTTCG
ATTGGAGGTGAGTGTAATGAGCCGAAATTTGTTGATAAATTTTGTGATATGTTGTTTAGGGTATACGTGGATAACAGAATGTTTGATTCGGCTTTGGAGGTTTTTCATTA
TGCGAGAAAGAGTGGGTTAGAGATTGAGGAGAGATCCTGTTTTGTGTTTTTACTTGCTTTAAAGAGGTCTGGTAATGTAGAATTATCTCTAGAATTCTTGCGCCAAATGG
TCGATTCGGGTGTGGAAATAAGTGTTTATTCGTTGACGATTGTGGTTGATGGGTTGTGCAAGAAAGGGGAGGTTGTAAGGGCTAAAGCTTTGATGGATGAACTTGTTTGC
AAAGGATTTAAGCCCAATGTTTTCACATATAACACTCTTTTGAATGCTTATATTGAAAGGAATGATATAGGATGTGTAAATGAGATTCTTAGTTTGATGGAGAAGGATGG
CGTGGATTATAATGCAACAACATATACAATTTTGATTGAATCGTATTCAAGAAGCTTGAAAATTTTGGAAGCAGAGAAGCTGTTTGATGAAATGCTTAAGAAAGGCATAG
AGCCTGATGTGTATGTTTACACCTCCATTATTAATTGGAATTGTAATTATGGGAACATGAAGAGGGCCTTTGCTCTGTTTGATGAAATGACTGAGAGAGGGCTTGTTCCA
AATGCTTACACTTATGGGGCCCTTGTAAATGGTGCCTGCAAGGCAGGGCAGATGGAGGCAGCTGAGATGCTGGTAAATGACATGCAAAGCAAAGGAATTGATGTAAATCA
AGTGATATTCAATACGTTGATAGATGGGTACTGCAAAAAAGGGATGATTGACGAAGCTCTAAGGCTGCAGGATATCATGCAGCAAAAAGGGTTCGAGATTGATGTGTTTA
CTTATAACATAATTGCCAGTGGTTTTTGTAGATCGAACCGGCAAGAGGAAGCAAGGAGATTATTGCTCACAATGGAAGAAAGGGGAGTGGCTCCAAATGCAGCGCTTCAA
CTGTTCGATGAAATGCCAAAACGAGGGCTAAATAGGAATGTGGTAACTTACACTGCAATGATCTCTGGGTTGTCCAAGGATGGCAGAGCTGATGAAGCTTTTAAATTATA
CGATGAAATGAAAGCAGCAGGCATTGAACCTGATGATAGAATATATTTTTCCTTGACAGGGAGCCTTCATAAAGTAGGATCTTAG
mRNA sequenceShow/hide mRNA sequence
ATGTCGAATCAGGCGATTGCCACGAACATCGCGAAGCTAATTCTGAAATCTGGCCTTAAACCCTTCAAAACGACGCCATCGCTGCTTTCCAATCTTGATTCTCGGGTAAC
GCAACTGGTTTTGTCCAACCCAAATGTTCCTACTCAGTCGTGTTTGAGTTTTTTCAACTTTCTCCGACAAAACCCTTCTCAGAAACCCGATCTTCGGGCACATTTAATCC
TCATCTGTAGGTTGTATCGAGCTCGGAAGTTCGCGGTAATGAAAAATGTGTTGAACTTCATCGTTAATGATGGAAATCTTCGGAGCACTGCCGAGCGGATTGTTGCTTCG
ATTGGAGGTGAGTGTAATGAGCCGAAATTTGTTGATAAATTTTGTGATATGTTGTTTAGGGTATACGTGGATAACAGAATGTTTGATTCGGCTTTGGAGGTTTTTCATTA
TGCGAGAAAGAGTGGGTTAGAGATTGAGGAGAGATCCTGTTTTGTGTTTTTACTTGCTTTAAAGAGGTCTGGTAATGTAGAATTATCTCTAGAATTCTTGCGCCAAATGG
TCGATTCGGGTGTGGAAATAAGTGTTTATTCGTTGACGATTGTGGTTGATGGGTTGTGCAAGAAAGGGGAGGTTGTAAGGGCTAAAGCTTTGATGGATGAACTTGTTTGC
AAAGGATTTAAGCCCAATGTTTTCACATATAACACTCTTTTGAATGCTTATATTGAAAGGAATGATATAGGATGTGTAAATGAGATTCTTAGTTTGATGGAGAAGGATGG
CGTGGATTATAATGCAACAACATATACAATTTTGATTGAATCGTATTCAAGAAGCTTGAAAATTTTGGAAGCAGAGAAGCTGTTTGATGAAATGCTTAAGAAAGGCATAG
AGCCTGATGTGTATGTTTACACCTCCATTATTAATTGGAATTGTAATTATGGGAACATGAAGAGGGCCTTTGCTCTGTTTGATGAAATGACTGAGAGAGGGCTTGTTCCA
AATGCTTACACTTATGGGGCCCTTGTAAATGGTGCCTGCAAGGCAGGGCAGATGGAGGCAGCTGAGATGCTGGTAAATGACATGCAAAGCAAAGGAATTGATGTAAATCA
AGTGATATTCAATACGTTGATAGATGGGTACTGCAAAAAAGGGATGATTGACGAAGCTCTAAGGCTGCAGGATATCATGCAGCAAAAAGGGTTCGAGATTGATGTGTTTA
CTTATAACATAATTGCCAGTGGTTTTTGTAGATCGAACCGGCAAGAGGAAGCAAGGAGATTATTGCTCACAATGGAAGAAAGGGGAGTGGCTCCAAATGCAGCGCTTCAA
CTGTTCGATGAAATGCCAAAACGAGGGCTAAATAGGAATGTGGTAACTTACACTGCAATGATCTCTGGGTTGTCCAAGGATGGCAGAGCTGATGAAGCTTTTAAATTATA
CGATGAAATGAAAGCAGCAGGCATTGAACCTGATGATAGAATATATTTTTCCTTGACAGGGAGCCTTCATAAAGTAGGATCTTAG
Protein sequenceShow/hide protein sequence
MSNQAIATNIAKLILKSGLKPFKTTPSLLSNLDSRVTQLVLSNPNVPTQSCLSFFNFLRQNPSQKPDLRAHLILICRLYRARKFAVMKNVLNFIVNDGNLRSTAERIVAS
IGGECNEPKFVDKFCDMLFRVYVDNRMFDSALEVFHYARKSGLEIEERSCFVFLLALKRSGNVELSLEFLRQMVDSGVEISVYSLTIVVDGLCKKGEVVRAKALMDELVC
KGFKPNVFTYNTLLNAYIERNDIGCVNEILSLMEKDGVDYNATTYTILIESYSRSLKILEAEKLFDEMLKKGIEPDVYVYTSIINWNCNYGNMKRAFALFDEMTERGLVP
NAYTYGALVNGACKAGQMEAAEMLVNDMQSKGIDVNQVIFNTLIDGYCKKGMIDEALRLQDIMQQKGFEIDVFTYNIIASGFCRSNRQEEARRLLLTMEERGVAPNAALQ
LFDEMPKRGLNRNVVTYTAMISGLSKDGRADEAFKLYDEMKAAGIEPDDRIYFSLTGSLHKVGS