; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Carg16776 (gene) of Silver-seed gourd (SMH-JMG-627) v2 genome

Gene IDCarg16776
OrganismCucurbita argyrosperma subsp. argyrosperma cv. SMH-JMG-627 (Silver-seed gourd (SMH-JMG-627) v2)
DescriptionPentatricopeptide repeat-containing protein
Genome locationCarg_Chr16:1159199..1163754
RNA-Seq ExpressionCarg16776
SyntenyCarg16776
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6576841.1 Pentatricopeptide repeat-containing protein, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia]1.5e-28998.24Show/hide
Query:  INMQSIFLPKTFIVYSFGGVYSGTLLHRSSSKCDGRYMFGDAVLKLFRKNGLKKVSKAALYDNYTISARWHGCKDQEELSGDLCNCLIRDYCKVGNVDAA
        INMQSIFLPKTFIVYSFGGVYSGTLLHRSSSKCDGRYMFGDAVLKLFRKNGLKKVSKAALYDNYTISARWHGCKDQEELSGDLCNCLIRDYCKVGNVDAA
Subjt:  INMQSIFLPKTFIVYSFGGVYSGTLLHRSSSKCDGRYMFGDAVLKLFRKNGLKKVSKAALYDNYTISARWHGCKDQEELSGDLCNCLIRDYCKVGNVDAA

Query:  MSLLSHMEAVGLHASVASYTYLIEAHGNVGRTLEADILFQEMISFGRTPRTTVCNALLRGFLRKGLLDLASDALVLMSDLDIQKNQETYEILLDYHANAG
        MSLLSHMEAVGLHASVASYTYLIEAHGNVGRTLEADILFQEMISFGRTPRTTVCNALLRGFLRKGLLDLASDALVLMSDLDIQKNQETYEILLDYHANAG
Subjt:  MSLLSHMEAVGLHASVASYTYLIEAHGNVGRTLEADILFQEMISFGRTPRTTVCNALLRGFLRKGLLDLASDALVLMSDLDIQKNQETYEILLDYHANAG

Query:  RLEDTWSIINEMKRKGFELNSFVYSK------NNGMWKKAVGIVDEIRKSGIYVDKHIYNSIIDTFGKYGQLPEALEVFKKMQQDGVMPDITTWNSLIQR
        RLEDTWSIINEMKRKGFELNSFVYSK      NNGMWKKAVGIVDEIRKSGIYVDKHIYNSIIDTFGKYGQLPEALEVFKKMQQDGVMPDITTWNSLIQR
Subjt:  RLEDTWSIINEMKRKGFELNSFVYSK------NNGMWKKAVGIVDEIRKSGIYVDKHIYNSIIDTFGKYGQLPEALEVFKKMQQDGVMPDITTWNSLIQR

Query:  NCKSGNLATALELFTDMQEQGMHPDPKIFITLISSLGEQGKWDIIKKNLDSMKLRGHKNSGLVYEILVDIYGQYGQFRDAEKCISALKSAGLLPSASNFC
        NCKSGNLATALELFTDMQEQGMHPDPKIFITLISSLGEQGKWDIIKKNLDSMKLRGHKNSGLVYEILVDIYGQYGQFRDAEKCISALKSAGLLPSASNFC
Subjt:  NCKSGNLATALELFTDMQEQGMHPDPKIFITLISSLGEQGKWDIIKKNLDSMKLRGHKNSGLVYEILVDIYGQYGQFRDAEKCISALKSAGLLPSASNFC

Query:  IIANAFARQGLCEETVKVLQLMEAEGIEPNLVMLNVLINAFAVAGRHLEAMAIYHHIVEVGISPDVITYTTLMKAYIRAKKFHKVPEIYKEMERAGCTPD
        IIANAFARQGLCEETVKVLQLMEAEGIEPNLVMLNVLINAFAVAGRHLEAMAIYHHIVEVGISPDVITYTTLMKAYIRAKKFHKVPEIYKEMERAGCTPD
Subjt:  IIANAFARQGLCEETVKVLQLMEAEGIEPNLVMLNVLINAFAVAGRHLEAMAIYHHIVEVGISPDVITYTTLMKAYIRAKKFHKVPEIYKEMERAGCTPD

Query:  RKAREMLKSVTV
        RKAREMLKS  +
Subjt:  RKAREMLKSVTV

KAG7014862.1 Pentatricopeptide repeat-containing protein, mitochondrial, partial [Cucurbita argyrosperma subsp. argyrosperma]0.0e+00100Show/hide
Query:  MVLGGDRDQISITPKFINMQSIFLPKTFIVYSFGGVYSGTLLHRSSSKCDGRYMFGDAVLKLFRKNGLKKVSKAALYDNYTISARWHGCKDQEELSGDLC
        MVLGGDRDQISITPKFINMQSIFLPKTFIVYSFGGVYSGTLLHRSSSKCDGRYMFGDAVLKLFRKNGLKKVSKAALYDNYTISARWHGCKDQEELSGDLC
Subjt:  MVLGGDRDQISITPKFINMQSIFLPKTFIVYSFGGVYSGTLLHRSSSKCDGRYMFGDAVLKLFRKNGLKKVSKAALYDNYTISARWHGCKDQEELSGDLC

Query:  NCLIRDYCKVGNVDAAMSLLSHMEAVGLHASVASYTYLIEAHGNVGRTLEADILFQEMISFGRTPRTTVCNALLRGFLRKGLLDLASDALVLMSDLDIQK
        NCLIRDYCKVGNVDAAMSLLSHMEAVGLHASVASYTYLIEAHGNVGRTLEADILFQEMISFGRTPRTTVCNALLRGFLRKGLLDLASDALVLMSDLDIQK
Subjt:  NCLIRDYCKVGNVDAAMSLLSHMEAVGLHASVASYTYLIEAHGNVGRTLEADILFQEMISFGRTPRTTVCNALLRGFLRKGLLDLASDALVLMSDLDIQK

Query:  NQETYEILLDYHANAGRLEDTWSIINEMKRKGFELNSFVYSKNNGMWKKAVGIVDEIRKSGIYVDKHIYNSIIDTFGKYGQLPEALEVFKKMQQDGVMPD
        NQETYEILLDYHANAGRLEDTWSIINEMKRKGFELNSFVYSKNNGMWKKAVGIVDEIRKSGIYVDKHIYNSIIDTFGKYGQLPEALEVFKKMQQDGVMPD
Subjt:  NQETYEILLDYHANAGRLEDTWSIINEMKRKGFELNSFVYSKNNGMWKKAVGIVDEIRKSGIYVDKHIYNSIIDTFGKYGQLPEALEVFKKMQQDGVMPD

Query:  ITTWNSLIQRNCKSGNLATALELFTDMQEQGMHPDPKIFITLISSLGEQGKWDIIKKNLDSMKLRGHKNSGLVYEILVDIYGQYGQFRDAEKCISALKSA
        ITTWNSLIQRNCKSGNLATALELFTDMQEQGMHPDPKIFITLISSLGEQGKWDIIKKNLDSMKLRGHKNSGLVYEILVDIYGQYGQFRDAEKCISALKSA
Subjt:  ITTWNSLIQRNCKSGNLATALELFTDMQEQGMHPDPKIFITLISSLGEQGKWDIIKKNLDSMKLRGHKNSGLVYEILVDIYGQYGQFRDAEKCISALKSA

Query:  GLLPSASNFCIIANAFARQGLCEETVKVLQLMEAEGIEPNLVMLNVLINAFAVAGRHLEAMAIYHHIVEVGISPDVITYTTLMKAYIRAKKFHKVPEIYK
        GLLPSASNFCIIANAFARQGLCEETVKVLQLMEAEGIEPNLVMLNVLINAFAVAGRHLEAMAIYHHIVEVGISPDVITYTTLMKAYIRAKKFHKVPEIYK
Subjt:  GLLPSASNFCIIANAFARQGLCEETVKVLQLMEAEGIEPNLVMLNVLINAFAVAGRHLEAMAIYHHIVEVGISPDVITYTTLMKAYIRAKKFHKVPEIYK

Query:  EMERAGCTPDRKAREMLKSVTVPSKILIEELIYTKKVDHNARPAEFQLDNPKI
        EMERAGCTPDRKAREMLKSVTVPSKILIEELIYTKKVDHNARPAEFQLDNPKI
Subjt:  EMERAGCTPDRKAREMLKSVTVPSKILIEELIYTKKVDHNARPAEFQLDNPKI

XP_022922450.1 pentatricopeptide repeat-containing protein At5g42310, chloroplastic-like isoform X1 [Cucurbita moschata]1.2e-28698.04Show/hide
Query:  MQSIFLPKTFIVYSFGGVYSGTLLHRSSSKCDGRYMFGDAVLKLFRKNGLKKVSKAALYDNYTISARWHGCKDQEELSGDLCNCLIRDYCKVGNVDAAMS
        MQSIFLPKTFIVYSFGGVYSGTLLHRSSSKCDGRYMFGDAVLKLFRKNGLKKVSKAALYDNYTISARWHGCKDQEELSGDLCNCLIRDYCKVGNVDAAMS
Subjt:  MQSIFLPKTFIVYSFGGVYSGTLLHRSSSKCDGRYMFGDAVLKLFRKNGLKKVSKAALYDNYTISARWHGCKDQEELSGDLCNCLIRDYCKVGNVDAAMS

Query:  LLSHMEAVGLHASVASYTYLIEAHGNVGRTLEADILFQEMISFGRTPRTTVCNALLRGFLRKGLLDLASDALVLMSDLDIQKNQETYEILLDYHANAGRL
        LLSHMEAVGLHASVASYTYLIEAHGNVGRTLEADILFQEMISFGR PRTTVCNALLRGFLRKGLLDLASD LVLMSDLDIQKNQETYEILLDYHANAGRL
Subjt:  LLSHMEAVGLHASVASYTYLIEAHGNVGRTLEADILFQEMISFGRTPRTTVCNALLRGFLRKGLLDLASDALVLMSDLDIQKNQETYEILLDYHANAGRL

Query:  EDTWSIINEMKRKGFELNSFVYSK------NNGMWKKAVGIVDEIRKSGIYVDKHIYNSIIDTFGKYGQLPEALEVFKKMQQDGVMPDITTWNSLIQRNC
        EDTWSIINEMKRKGFELNSFVYSK      NNGMWKKAVGIVDEIRKSGI VDKHIYNSIIDTFGKYGQLPEALEVFKKMQQDGVMPDITTWNSLIQ NC
Subjt:  EDTWSIINEMKRKGFELNSFVYSK------NNGMWKKAVGIVDEIRKSGIYVDKHIYNSIIDTFGKYGQLPEALEVFKKMQQDGVMPDITTWNSLIQRNC

Query:  KSGNLATALELFTDMQEQGMHPDPKIFITLISSLGEQGKWDIIKKNLDSMKLRGHKNSGLVYEILVDIYGQYGQFRDAEKCISALKSAGLLPSASNFCII
        KSGNLATALELFTDMQEQGMHPDPKIFITLISSLGEQGKWDIIKKNLDSMKLRGHKNSGLVYEILVDIYGQYGQFRDAEKCISALKSAGLLPSASNFCII
Subjt:  KSGNLATALELFTDMQEQGMHPDPKIFITLISSLGEQGKWDIIKKNLDSMKLRGHKNSGLVYEILVDIYGQYGQFRDAEKCISALKSAGLLPSASNFCII

Query:  ANAFARQGLCEETVKVLQLMEAEGIEPNLVMLNVLINAFAVAGRHLEAMAIYHHIVEVGISPDVITYTTLMKAYIRAKKFHKVPEIYKEMERAGCTPDRK
        ANAFARQGLCEETVKVLQLMEAEGIEPNLVMLNVLINAFAVAGRHLEAMAIYHHIVEVGISPDVITYTTLMKAYIRAKKFHKVPEIYKEMERAGCTPDRK
Subjt:  ANAFARQGLCEETVKVLQLMEAEGIEPNLVMLNVLINAFAVAGRHLEAMAIYHHIVEVGISPDVITYTTLMKAYIRAKKFHKVPEIYKEMERAGCTPDRK

Query:  AREMLKSVTV
        AREMLKSVTV
Subjt:  AREMLKSVTV

XP_022922460.1 pentatricopeptide repeat-containing protein At5g42310, chloroplastic-like isoform X2 [Cucurbita moschata]1.2e-28698.04Show/hide
Query:  MQSIFLPKTFIVYSFGGVYSGTLLHRSSSKCDGRYMFGDAVLKLFRKNGLKKVSKAALYDNYTISARWHGCKDQEELSGDLCNCLIRDYCKVGNVDAAMS
        MQSIFLPKTFIVYSFGGVYSGTLLHRSSSKCDGRYMFGDAVLKLFRKNGLKKVSKAALYDNYTISARWHGCKDQEELSGDLCNCLIRDYCKVGNVDAAMS
Subjt:  MQSIFLPKTFIVYSFGGVYSGTLLHRSSSKCDGRYMFGDAVLKLFRKNGLKKVSKAALYDNYTISARWHGCKDQEELSGDLCNCLIRDYCKVGNVDAAMS

Query:  LLSHMEAVGLHASVASYTYLIEAHGNVGRTLEADILFQEMISFGRTPRTTVCNALLRGFLRKGLLDLASDALVLMSDLDIQKNQETYEILLDYHANAGRL
        LLSHMEAVGLHASVASYTYLIEAHGNVGRTLEADILFQEMISFGR PRTTVCNALLRGFLRKGLLDLASD LVLMSDLDIQKNQETYEILLDYHANAGRL
Subjt:  LLSHMEAVGLHASVASYTYLIEAHGNVGRTLEADILFQEMISFGRTPRTTVCNALLRGFLRKGLLDLASDALVLMSDLDIQKNQETYEILLDYHANAGRL

Query:  EDTWSIINEMKRKGFELNSFVYSK------NNGMWKKAVGIVDEIRKSGIYVDKHIYNSIIDTFGKYGQLPEALEVFKKMQQDGVMPDITTWNSLIQRNC
        EDTWSIINEMKRKGFELNSFVYSK      NNGMWKKAVGIVDEIRKSGI VDKHIYNSIIDTFGKYGQLPEALEVFKKMQQDGVMPDITTWNSLIQ NC
Subjt:  EDTWSIINEMKRKGFELNSFVYSK------NNGMWKKAVGIVDEIRKSGIYVDKHIYNSIIDTFGKYGQLPEALEVFKKMQQDGVMPDITTWNSLIQRNC

Query:  KSGNLATALELFTDMQEQGMHPDPKIFITLISSLGEQGKWDIIKKNLDSMKLRGHKNSGLVYEILVDIYGQYGQFRDAEKCISALKSAGLLPSASNFCII
        KSGNLATALELFTDMQEQGMHPDPKIFITLISSLGEQGKWDIIKKNLDSMKLRGHKNSGLVYEILVDIYGQYGQFRDAEKCISALKSAGLLPSASNFCII
Subjt:  KSGNLATALELFTDMQEQGMHPDPKIFITLISSLGEQGKWDIIKKNLDSMKLRGHKNSGLVYEILVDIYGQYGQFRDAEKCISALKSAGLLPSASNFCII

Query:  ANAFARQGLCEETVKVLQLMEAEGIEPNLVMLNVLINAFAVAGRHLEAMAIYHHIVEVGISPDVITYTTLMKAYIRAKKFHKVPEIYKEMERAGCTPDRK
        ANAFARQGLCEETVKVLQLMEAEGIEPNLVMLNVLINAFAVAGRHLEAMAIYHHIVEVGISPDVITYTTLMKAYIRAKKFHKVPEIYKEMERAGCTPDRK
Subjt:  ANAFARQGLCEETVKVLQLMEAEGIEPNLVMLNVLINAFAVAGRHLEAMAIYHHIVEVGISPDVITYTTLMKAYIRAKKFHKVPEIYKEMERAGCTPDRK

Query:  AREMLKSVTV
        AREMLKSVTV
Subjt:  AREMLKSVTV

XP_023551942.1 pentatricopeptide repeat-containing protein At5g42310, chloroplastic-like isoform X2 [Cucurbita pepo subsp. pepo]2.9e-28596.68Show/hide
Query:  INMQSIFLPKTFIVYSFGGVYSGTLLHRSSSKCDGRYMFGDAVLKLFRKNGLKKVSKAALYDNYTISARWHGCKDQEELSGDLCNCLIRDYCKVGNVDAA
        INMQSIFLPKTFIVYSFGGVYSGTLLHRSSSKCDGRYMFGDA+LKLFRKNGLKKVSKAALYDNYTISARWHGCKDQEELSGDLCNCLIRDYCKVGNVDAA
Subjt:  INMQSIFLPKTFIVYSFGGVYSGTLLHRSSSKCDGRYMFGDAVLKLFRKNGLKKVSKAALYDNYTISARWHGCKDQEELSGDLCNCLIRDYCKVGNVDAA

Query:  MSLLSHMEAVGLHASVASYTYLIEAHGNVGRTLEADILFQEMISFGRTPRTTVCNALLRGFLRKGLLDLASDALVLMSDLDIQKNQETYEILLDYHANAG
        MSLLSHMEAVGLHASVASYTYLIEAHGNVGRTLEADILFQEMISFGR PRTTVCNALLRGFLRKGLLDLASD LVLMSDLDIQKNQETYEILLDYHANAG
Subjt:  MSLLSHMEAVGLHASVASYTYLIEAHGNVGRTLEADILFQEMISFGRTPRTTVCNALLRGFLRKGLLDLASDALVLMSDLDIQKNQETYEILLDYHANAG

Query:  RLEDTWSIINEMKRKGFELNSFVYSK------NNGMWKKAVGIVDEIRKSGIYVDKHIYNSIIDTFGKYGQLPEALEVFKKMQQDGVMPDITTWNSLIQR
        RLEDTW IINEMKRKGFELNSFVYSK      NNGMWKKAVGIVDEIRKSGI VDKHIYNSIIDTFGKYGQLPEALEVFKKMQQDGVMPDITTWNSLIQ 
Subjt:  RLEDTWSIINEMKRKGFELNSFVYSK------NNGMWKKAVGIVDEIRKSGIYVDKHIYNSIIDTFGKYGQLPEALEVFKKMQQDGVMPDITTWNSLIQR

Query:  NCKSGNLATALELFTDMQEQGMHPDPKIFITLISSLGEQGKWDIIKKNLDSMKLRGHKNSGLVYEILVDIYGQYGQFRDAEKCISALKSAGLLPSASNFC
        NCKSGNLATALELFTDMQEQGMHPDPKIFITLISSLGEQGKWD+IKKNLDSMKLRGHKNSGLVYEILVDIYGQYGQF+DAEKCISALKSAGLLPSASNFC
Subjt:  NCKSGNLATALELFTDMQEQGMHPDPKIFITLISSLGEQGKWDIIKKNLDSMKLRGHKNSGLVYEILVDIYGQYGQFRDAEKCISALKSAGLLPSASNFC

Query:  IIANAFARQGLCEETVKVLQLMEAEGIEPNLVMLNVLINAFAVAGRHLEAMAIYHHIVEVGISPDVITYTTLMKAYIRAKKFHKVPEIYKEMERAGCTPD
        IIANAFARQGLCEETVKVL+LMEAEGIEPNLVMLNVLINAFAVAGRHLEA+AIYHHIVEVGISPDVITYTTLMKAYIRAKKFHKVPEIYKEME AGCTPD
Subjt:  IIANAFARQGLCEETVKVLQLMEAEGIEPNLVMLNVLINAFAVAGRHLEAMAIYHHIVEVGISPDVITYTTLMKAYIRAKKFHKVPEIYKEMERAGCTPD

Query:  RKAREMLKSVTV
        RKAREMLKSVTV
Subjt:  RKAREMLKSVTV

TrEMBL top hitse value%identityAlignment
A0A6J1E3F2 pentatricopeptide repeat-containing protein At5g42310, chloroplastic-like isoform X15.8e-28798.04Show/hide
Query:  MQSIFLPKTFIVYSFGGVYSGTLLHRSSSKCDGRYMFGDAVLKLFRKNGLKKVSKAALYDNYTISARWHGCKDQEELSGDLCNCLIRDYCKVGNVDAAMS
        MQSIFLPKTFIVYSFGGVYSGTLLHRSSSKCDGRYMFGDAVLKLFRKNGLKKVSKAALYDNYTISARWHGCKDQEELSGDLCNCLIRDYCKVGNVDAAMS
Subjt:  MQSIFLPKTFIVYSFGGVYSGTLLHRSSSKCDGRYMFGDAVLKLFRKNGLKKVSKAALYDNYTISARWHGCKDQEELSGDLCNCLIRDYCKVGNVDAAMS

Query:  LLSHMEAVGLHASVASYTYLIEAHGNVGRTLEADILFQEMISFGRTPRTTVCNALLRGFLRKGLLDLASDALVLMSDLDIQKNQETYEILLDYHANAGRL
        LLSHMEAVGLHASVASYTYLIEAHGNVGRTLEADILFQEMISFGR PRTTVCNALLRGFLRKGLLDLASD LVLMSDLDIQKNQETYEILLDYHANAGRL
Subjt:  LLSHMEAVGLHASVASYTYLIEAHGNVGRTLEADILFQEMISFGRTPRTTVCNALLRGFLRKGLLDLASDALVLMSDLDIQKNQETYEILLDYHANAGRL

Query:  EDTWSIINEMKRKGFELNSFVYSK------NNGMWKKAVGIVDEIRKSGIYVDKHIYNSIIDTFGKYGQLPEALEVFKKMQQDGVMPDITTWNSLIQRNC
        EDTWSIINEMKRKGFELNSFVYSK      NNGMWKKAVGIVDEIRKSGI VDKHIYNSIIDTFGKYGQLPEALEVFKKMQQDGVMPDITTWNSLIQ NC
Subjt:  EDTWSIINEMKRKGFELNSFVYSK------NNGMWKKAVGIVDEIRKSGIYVDKHIYNSIIDTFGKYGQLPEALEVFKKMQQDGVMPDITTWNSLIQRNC

Query:  KSGNLATALELFTDMQEQGMHPDPKIFITLISSLGEQGKWDIIKKNLDSMKLRGHKNSGLVYEILVDIYGQYGQFRDAEKCISALKSAGLLPSASNFCII
        KSGNLATALELFTDMQEQGMHPDPKIFITLISSLGEQGKWDIIKKNLDSMKLRGHKNSGLVYEILVDIYGQYGQFRDAEKCISALKSAGLLPSASNFCII
Subjt:  KSGNLATALELFTDMQEQGMHPDPKIFITLISSLGEQGKWDIIKKNLDSMKLRGHKNSGLVYEILVDIYGQYGQFRDAEKCISALKSAGLLPSASNFCII

Query:  ANAFARQGLCEETVKVLQLMEAEGIEPNLVMLNVLINAFAVAGRHLEAMAIYHHIVEVGISPDVITYTTLMKAYIRAKKFHKVPEIYKEMERAGCTPDRK
        ANAFARQGLCEETVKVLQLMEAEGIEPNLVMLNVLINAFAVAGRHLEAMAIYHHIVEVGISPDVITYTTLMKAYIRAKKFHKVPEIYKEMERAGCTPDRK
Subjt:  ANAFARQGLCEETVKVLQLMEAEGIEPNLVMLNVLINAFAVAGRHLEAMAIYHHIVEVGISPDVITYTTLMKAYIRAKKFHKVPEIYKEMERAGCTPDRK

Query:  AREMLKSVTV
        AREMLKSVTV
Subjt:  AREMLKSVTV

A0A6J1E471 pentatricopeptide repeat-containing protein At5g42310, chloroplastic-like isoform X32.0e-26391.96Show/hide
Query:  MQSIFLPKTFIVYSFGGVYSGTLLHRSSSKCDGRYMFGDAVLKLFRKNGLKKVSKAALYDNYTISARWHGCKDQEELSGDLCNCLIRDYCKVGNVDAAMS
        MQSIFLPKTFIVYSFGGVYSGTLLHRSSSKCDG                               SARWHGCKDQEELSGDLCNCLIRDYCKVGNVDAAMS
Subjt:  MQSIFLPKTFIVYSFGGVYSGTLLHRSSSKCDGRYMFGDAVLKLFRKNGLKKVSKAALYDNYTISARWHGCKDQEELSGDLCNCLIRDYCKVGNVDAAMS

Query:  LLSHMEAVGLHASVASYTYLIEAHGNVGRTLEADILFQEMISFGRTPRTTVCNALLRGFLRKGLLDLASDALVLMSDLDIQKNQETYEILLDYHANAGRL
        LLSHMEAVGLHASVASYTYLIEAHGNVGRTLEADILFQEMISFGR PRTTVCNALLRGFLRKGLLDLASD LVLMSDLDIQKNQETYEILLDYHANAGRL
Subjt:  LLSHMEAVGLHASVASYTYLIEAHGNVGRTLEADILFQEMISFGRTPRTTVCNALLRGFLRKGLLDLASDALVLMSDLDIQKNQETYEILLDYHANAGRL

Query:  EDTWSIINEMKRKGFELNSFVYSK------NNGMWKKAVGIVDEIRKSGIYVDKHIYNSIIDTFGKYGQLPEALEVFKKMQQDGVMPDITTWNSLIQRNC
        EDTWSIINEMKRKGFELNSFVYSK      NNGMWKKAVGIVDEIRKSGI VDKHIYNSIIDTFGKYGQLPEALEVFKKMQQDGVMPDITTWNSLIQ NC
Subjt:  EDTWSIINEMKRKGFELNSFVYSK------NNGMWKKAVGIVDEIRKSGIYVDKHIYNSIIDTFGKYGQLPEALEVFKKMQQDGVMPDITTWNSLIQRNC

Query:  KSGNLATALELFTDMQEQGMHPDPKIFITLISSLGEQGKWDIIKKNLDSMKLRGHKNSGLVYEILVDIYGQYGQFRDAEKCISALKSAGLLPSASNFCII
        KSGNLATALELFTDMQEQGMHPDPKIFITLISSLGEQGKWDIIKKNLDSMKLRGHKNSGLVYEILVDIYGQYGQFRDAEKCISALKSAGLLPSASNFCII
Subjt:  KSGNLATALELFTDMQEQGMHPDPKIFITLISSLGEQGKWDIIKKNLDSMKLRGHKNSGLVYEILVDIYGQYGQFRDAEKCISALKSAGLLPSASNFCII

Query:  ANAFARQGLCEETVKVLQLMEAEGIEPNLVMLNVLINAFAVAGRHLEAMAIYHHIVEVGISPDVITYTTLMKAYIRAKKFHKVPEIYKEMERAGCTPDRK
        ANAFARQGLCEETVKVLQLMEAEGIEPNLVMLNVLINAFAVAGRHLEAMAIYHHIVEVGISPDVITYTTLMKAYIRAKKFHKVPEIYKEMERAGCTPDRK
Subjt:  ANAFARQGLCEETVKVLQLMEAEGIEPNLVMLNVLINAFAVAGRHLEAMAIYHHIVEVGISPDVITYTTLMKAYIRAKKFHKVPEIYKEMERAGCTPDRK

Query:  AREMLKSVTV
        AREMLKSVTV
Subjt:  AREMLKSVTV

A0A6J1E8S9 pentatricopeptide repeat-containing protein At5g42310, chloroplastic-like isoform X25.8e-28798.04Show/hide
Query:  MQSIFLPKTFIVYSFGGVYSGTLLHRSSSKCDGRYMFGDAVLKLFRKNGLKKVSKAALYDNYTISARWHGCKDQEELSGDLCNCLIRDYCKVGNVDAAMS
        MQSIFLPKTFIVYSFGGVYSGTLLHRSSSKCDGRYMFGDAVLKLFRKNGLKKVSKAALYDNYTISARWHGCKDQEELSGDLCNCLIRDYCKVGNVDAAMS
Subjt:  MQSIFLPKTFIVYSFGGVYSGTLLHRSSSKCDGRYMFGDAVLKLFRKNGLKKVSKAALYDNYTISARWHGCKDQEELSGDLCNCLIRDYCKVGNVDAAMS

Query:  LLSHMEAVGLHASVASYTYLIEAHGNVGRTLEADILFQEMISFGRTPRTTVCNALLRGFLRKGLLDLASDALVLMSDLDIQKNQETYEILLDYHANAGRL
        LLSHMEAVGLHASVASYTYLIEAHGNVGRTLEADILFQEMISFGR PRTTVCNALLRGFLRKGLLDLASD LVLMSDLDIQKNQETYEILLDYHANAGRL
Subjt:  LLSHMEAVGLHASVASYTYLIEAHGNVGRTLEADILFQEMISFGRTPRTTVCNALLRGFLRKGLLDLASDALVLMSDLDIQKNQETYEILLDYHANAGRL

Query:  EDTWSIINEMKRKGFELNSFVYSK------NNGMWKKAVGIVDEIRKSGIYVDKHIYNSIIDTFGKYGQLPEALEVFKKMQQDGVMPDITTWNSLIQRNC
        EDTWSIINEMKRKGFELNSFVYSK      NNGMWKKAVGIVDEIRKSGI VDKHIYNSIIDTFGKYGQLPEALEVFKKMQQDGVMPDITTWNSLIQ NC
Subjt:  EDTWSIINEMKRKGFELNSFVYSK------NNGMWKKAVGIVDEIRKSGIYVDKHIYNSIIDTFGKYGQLPEALEVFKKMQQDGVMPDITTWNSLIQRNC

Query:  KSGNLATALELFTDMQEQGMHPDPKIFITLISSLGEQGKWDIIKKNLDSMKLRGHKNSGLVYEILVDIYGQYGQFRDAEKCISALKSAGLLPSASNFCII
        KSGNLATALELFTDMQEQGMHPDPKIFITLISSLGEQGKWDIIKKNLDSMKLRGHKNSGLVYEILVDIYGQYGQFRDAEKCISALKSAGLLPSASNFCII
Subjt:  KSGNLATALELFTDMQEQGMHPDPKIFITLISSLGEQGKWDIIKKNLDSMKLRGHKNSGLVYEILVDIYGQYGQFRDAEKCISALKSAGLLPSASNFCII

Query:  ANAFARQGLCEETVKVLQLMEAEGIEPNLVMLNVLINAFAVAGRHLEAMAIYHHIVEVGISPDVITYTTLMKAYIRAKKFHKVPEIYKEMERAGCTPDRK
        ANAFARQGLCEETVKVLQLMEAEGIEPNLVMLNVLINAFAVAGRHLEAMAIYHHIVEVGISPDVITYTTLMKAYIRAKKFHKVPEIYKEMERAGCTPDRK
Subjt:  ANAFARQGLCEETVKVLQLMEAEGIEPNLVMLNVLINAFAVAGRHLEAMAIYHHIVEVGISPDVITYTTLMKAYIRAKKFHKVPEIYKEMERAGCTPDRK

Query:  AREMLKSVTV
        AREMLKSVTV
Subjt:  AREMLKSVTV

A0A6J1J2J8 pentatricopeptide repeat-containing protein At5g42310, chloroplastic-like isoform X13.1e-28096.08Show/hide
Query:  MQSIFLPKTFIVYSFGGVYSGTLLHRSSSKCDGRYMFGDAVLKLFRKNGLKKVSKAALYDNYTISARWHGCKDQEELSGDLCNCLIRDYCKVGNVDAAMS
        MQSIFLPKTFIVYSFGGVYSGTL H+SSSKCDGRYMFGDAVL+LFRKNGLKKVSKAALYDNYTISAR HGCKDQEELSGDLCN LIRDYCKVGNVDAAMS
Subjt:  MQSIFLPKTFIVYSFGGVYSGTLLHRSSSKCDGRYMFGDAVLKLFRKNGLKKVSKAALYDNYTISARWHGCKDQEELSGDLCNCLIRDYCKVGNVDAAMS

Query:  LLSHMEAVGLHASVASYTYLIEAHGNVGRTLEADILFQEMISFGRTPRTTVCNALLRGFLRKGLLDLASDALVLMSDLDIQKNQETYEILLDYHANAGRL
        LLS+MEAVGLHASVASYTYLIEAHGNVGRTLEADILFQEMISFGR PRTTVCNALLRGFLRKGLLDLASD LVLMSDLDIQKNQETYEILLDYHANAGRL
Subjt:  LLSHMEAVGLHASVASYTYLIEAHGNVGRTLEADILFQEMISFGRTPRTTVCNALLRGFLRKGLLDLASDALVLMSDLDIQKNQETYEILLDYHANAGRL

Query:  EDTWSIINEMKRKGFELNSFVYSK------NNGMWKKAVGIVDEIRKSGIYVDKHIYNSIIDTFGKYGQLPEALEVFKKMQQDGVMPDITTWNSLIQRNC
        EDTWSIINEMKRKGFELNSFVYSK      NNGMWKKAVGIVDEIRKSGI VDKHIYNSIIDTFGKYGQLPEALEVFKKMQQDGVMPDITTWNSLIQ NC
Subjt:  EDTWSIINEMKRKGFELNSFVYSK------NNGMWKKAVGIVDEIRKSGIYVDKHIYNSIIDTFGKYGQLPEALEVFKKMQQDGVMPDITTWNSLIQRNC

Query:  KSGNLATALELFTDMQEQGMHPDPKIFITLISSLGEQGKWDIIKKNLDSMKLRGHKNSGLVYEILVDIYGQYGQFRDAEKCISALKSAGLLPSASNFCII
        KSGNL TALELFTDMQEQGMHPDPKIFITLISSLGEQGKWDIIKKNLDSMKLRGHKNSGLVYEILVDIYGQYGQFRDAEKCISALKSAGLLPSASNFCII
Subjt:  KSGNLATALELFTDMQEQGMHPDPKIFITLISSLGEQGKWDIIKKNLDSMKLRGHKNSGLVYEILVDIYGQYGQFRDAEKCISALKSAGLLPSASNFCII

Query:  ANAFARQGLCEETVKVLQLMEAEGIEPNLVMLNVLINAFAVAGRHLEAMAIYHHIVEVGISPDVITYTTLMKAYIRAKKFHKVPEIYKEMERAGCTPDRK
        ANAFARQGLCEETVKVL+LMEAEGIEPNLVMLNVLINAFAVAGRHLEA+AIYHHIVEVGISPDVITYTTLMKAYIRAKKFHKVPEIYKEME AGCTPDRK
Subjt:  ANAFARQGLCEETVKVLQLMEAEGIEPNLVMLNVLINAFAVAGRHLEAMAIYHHIVEVGISPDVITYTTLMKAYIRAKKFHKVPEIYKEMERAGCTPDRK

Query:  AREMLKSVTV
        AREMLKSVTV
Subjt:  AREMLKSVTV

A0A6J1J5P1 pentatricopeptide repeat-containing protein At5g42310, chloroplastic-like isoform X23.1e-28096.08Show/hide
Query:  MQSIFLPKTFIVYSFGGVYSGTLLHRSSSKCDGRYMFGDAVLKLFRKNGLKKVSKAALYDNYTISARWHGCKDQEELSGDLCNCLIRDYCKVGNVDAAMS
        MQSIFLPKTFIVYSFGGVYSGTL H+SSSKCDGRYMFGDAVL+LFRKNGLKKVSKAALYDNYTISAR HGCKDQEELSGDLCN LIRDYCKVGNVDAAMS
Subjt:  MQSIFLPKTFIVYSFGGVYSGTLLHRSSSKCDGRYMFGDAVLKLFRKNGLKKVSKAALYDNYTISARWHGCKDQEELSGDLCNCLIRDYCKVGNVDAAMS

Query:  LLSHMEAVGLHASVASYTYLIEAHGNVGRTLEADILFQEMISFGRTPRTTVCNALLRGFLRKGLLDLASDALVLMSDLDIQKNQETYEILLDYHANAGRL
        LLS+MEAVGLHASVASYTYLIEAHGNVGRTLEADILFQEMISFGR PRTTVCNALLRGFLRKGLLDLASD LVLMSDLDIQKNQETYEILLDYHANAGRL
Subjt:  LLSHMEAVGLHASVASYTYLIEAHGNVGRTLEADILFQEMISFGRTPRTTVCNALLRGFLRKGLLDLASDALVLMSDLDIQKNQETYEILLDYHANAGRL

Query:  EDTWSIINEMKRKGFELNSFVYSK------NNGMWKKAVGIVDEIRKSGIYVDKHIYNSIIDTFGKYGQLPEALEVFKKMQQDGVMPDITTWNSLIQRNC
        EDTWSIINEMKRKGFELNSFVYSK      NNGMWKKAVGIVDEIRKSGI VDKHIYNSIIDTFGKYGQLPEALEVFKKMQQDGVMPDITTWNSLIQ NC
Subjt:  EDTWSIINEMKRKGFELNSFVYSK------NNGMWKKAVGIVDEIRKSGIYVDKHIYNSIIDTFGKYGQLPEALEVFKKMQQDGVMPDITTWNSLIQRNC

Query:  KSGNLATALELFTDMQEQGMHPDPKIFITLISSLGEQGKWDIIKKNLDSMKLRGHKNSGLVYEILVDIYGQYGQFRDAEKCISALKSAGLLPSASNFCII
        KSGNL TALELFTDMQEQGMHPDPKIFITLISSLGEQGKWDIIKKNLDSMKLRGHKNSGLVYEILVDIYGQYGQFRDAEKCISALKSAGLLPSASNFCII
Subjt:  KSGNLATALELFTDMQEQGMHPDPKIFITLISSLGEQGKWDIIKKNLDSMKLRGHKNSGLVYEILVDIYGQYGQFRDAEKCISALKSAGLLPSASNFCII

Query:  ANAFARQGLCEETVKVLQLMEAEGIEPNLVMLNVLINAFAVAGRHLEAMAIYHHIVEVGISPDVITYTTLMKAYIRAKKFHKVPEIYKEMERAGCTPDRK
        ANAFARQGLCEETVKVL+LMEAEGIEPNLVMLNVLINAFAVAGRHLEA+AIYHHIVEVGISPDVITYTTLMKAYIRAKKFHKVPEIYKEME AGCTPDRK
Subjt:  ANAFARQGLCEETVKVLQLMEAEGIEPNLVMLNVLINAFAVAGRHLEAMAIYHHIVEVGISPDVITYTTLMKAYIRAKKFHKVPEIYKEMERAGCTPDRK

Query:  AREMLKSVTV
        AREMLKSVTV
Subjt:  AREMLKSVTV

SwissProt top hitse value%identityAlignment
A0A1D6IEG9 Pentatricopeptide repeat-containing protein CRP1, chloroplastic1.8e-8840.42Show/hide
Query:  ELSGDLCNCLIRDYCKVGNVDAAMSLLSHMEAVGLHASVASYTYLIEAHGNVGRTLEADILFQEMISFGR-TPRTTVCNALLRGFLRKGLLDLASDALVL
        E    L + LI  + +    DAA+ LL+  +A+GL     + T LI A G  GR  EA+ LF E    G   PRT   NALL+G++R   L  A   L  
Subjt:  ELSGDLCNCLIRDYCKVGNVDAAMSLLSHMEAVGLHASVASYTYLIEAHGNVGRTLEADILFQEMISFGR-TPRTTVCNALLRGFLRKGLLDLASDALVL

Query:  MSDLDIQKNQETYEILLDYHANAGRLEDTWSIINEMKRKGFELNSFVYS------KNNGMWKKAVGIVDEIRKSGIYVDKHIYNSIIDTFGKYGQLPEAL
        MS   +  ++ TY +L+D +  AGR E    ++ EM+  G + +S+V+S      ++ G W+KA  ++ E++ SG+  D+H YN +IDTFGKY  L  A+
Subjt:  MSDLDIQKNQETYEILLDYHANAGRLEDTWSIINEMKRKGFELNSFVYS------KNNGMWKKAVGIVDEIRKSGIYVDKHIYNSIIDTFGKYGQLPEAL

Query:  EVFKKMQQDGVMPDITTWNSLIQRNCKSGNLATALELFTDMQEQGMHPDPKIFITLISSLGEQGKWDIIKKNLDSMKLRGHKNSGLVYEILVDIYGQYGQ
        + F KM+++G+ PD+ TWN+LI  +CK G    A ELF +M+E    P    +  +I+ LGEQ  W+ ++  L  MK +G   + + Y  LVD+YG+ G+
Subjt:  EVFKKMQQDGVMPDITTWNSLIQRNCKSGNLATALELFTDMQEQGMHPDPKIFITLISSLGEQGKWDIIKKNLDSMKLRGHKNSGLVYEILVDIYGQYGQ

Query:  FRDAEKCISALKSAGLLPSASNFCIIANAFARQGLCEETVKVLQLMEAEGIEPNLVMLNVLINAFAVAGRHLEAMAIYHHIVEVGISPDVITYTTLMKAY
        +++A  CI A+K+ GL PS + +  + NA+A++GL +  + V++ M+A+G+E ++++LN LINAF    R +EA ++   + E G+ PDVITYTTLMKA 
Subjt:  FRDAEKCISALKSAGLLPSASNFCIIANAFARQGLCEETVKVLQLMEAEGIEPNLVMLNVLINAFAVAGRHLEAMAIYHHIVEVGISPDVITYTTLMKAY

Query:  IRAKKFHKVPEIYKEMERAGCTPDRKAREMLKS
        IR ++F KVP IY+EM  +GC PDRKAR ML+S
Subjt:  IRAKKFHKVPEIYKEMERAGCTPDRKAREMLKS

Q84ZD2 Pentatricopeptide repeat-containing protein CRP1 homolog, chloroplastic2.2e-8639.72Show/hide
Query:  ELSGDLCNCLIRDYCKVGNVDAAMSLLSHMEAVGLHASVASYTYLIEAHGNVGRTLEADILFQEMISFGR-TPRTTVCNALLRGFLRKGLLDLASDALVL
        E    L + LI  + +    DAA+ LL+  +A+GL     + T LI + G+  R  EA+ LF E    G   PRT   NALL+G+++ G L  A   L  
Subjt:  ELSGDLCNCLIRDYCKVGNVDAAMSLLSHMEAVGLHASVASYTYLIEAHGNVGRTLEADILFQEMISFGR-TPRTTVCNALLRGFLRKGLLDLASDALVL

Query:  MSDLDIQKNQETYEILLDYHANAGRLEDTWSIINEMKRKGFELNSFVYS------KNNGMWKKAVGIVDEIRKSGIYVDKHIYNSIIDTFGKYGQLPEAL
        MS   +  ++ TY +L+D +  AGR E    ++ EM+  G + +S+V+S      ++ G W+KA  ++ E+  SG+  D+H YN +IDTFGKY  L  A+
Subjt:  MSDLDIQKNQETYEILLDYHANAGRLEDTWSIINEMKRKGFELNSFVYS------KNNGMWKKAVGIVDEIRKSGIYVDKHIYNSIIDTFGKYGQLPEAL

Query:  EVFKKMQQDGVMPDITTWNSLIQRNCKSGNLATALELFTDMQEQGMHPDPKIFITLISSLGEQGKWDIIKKNLDSMKLRGHKNSGLVYEILVDIYGQYGQ
        + F +M+++G+ PD+ TWN+LI  +CK G    A+ELF +M+E         +  +I+ LGE+ +W+ ++  L  MK +G   + + Y  LVD+YG+ G+
Subjt:  EVFKKMQQDGVMPDITTWNSLIQRNCKSGNLATALELFTDMQEQGMHPDPKIFITLISSLGEQGKWDIIKKNLDSMKLRGHKNSGLVYEILVDIYGQYGQ

Query:  FRDAEKCISALKSAGLLPSASNFCIIANAFARQGLCEETVKVLQLMEAEGIEPNLVMLNVLINAFAVAGRHLEAMAIYHHIVEVGISPDVITYTTLMKAY
        F++A  CI A+K+ GL PS + +  + NA+A++GL +  + V++ M A+G+E + V+LN LINAF    R  EA ++   + E G+ PDVITYTTLMKA 
Subjt:  FRDAEKCISALKSAGLLPSASNFCIIANAFARQGLCEETVKVLQLMEAEGIEPNLVMLNVLINAFAVAGRHLEAMAIYHHIVEVGISPDVITYTTLMKAY

Query:  IRAKKFHKVPEIYKEMERAGCTPDRKAREMLKS
        IR ++F KVP IY+EM  +GC PDRKAR ML+S
Subjt:  IRAKKFHKVPEIYKEMERAGCTPDRKAREMLKS

Q8L844 Pentatricopeptide repeat-containing protein At5g42310, chloroplastic2.3e-9138.54Show/hide
Query:  VLKLFRKNGLKKVSKAALYDNYTISARWHGCKDQEELSGDLCNCLIRDYCKVGNVDAAMSLLSHMEAVGLHASVASYTYLIEAHGNVGRTLEADILFQEM
        +  L R N +  V    LY            +D+ EL   L N +I  + K G+   A+ LL   +A GL A  A+   +I A  + GRTLEA+ LF+E+
Subjt:  VLKLFRKNGLKKVSKAALYDNYTISARWHGCKDQEELSGDLCNCLIRDYCKVGNVDAAMSLLSHMEAVGLHASVASYTYLIEAHGNVGRTLEADILFQEM

Query:  ISFGRTPRTTVCNALLRGFLRKGLLDLASDALVLMSDLDIQKNQETYEILLDYHANAGRLEDTWSIINEMKRKGFELNSFVYS------KNNGMWKKAVG
           G  PRT   NALL+G+++ G L  A   +  M    +  ++ TY +L+D + NAGR E    ++ EM+    + NSFV+S      ++ G W+K   
Subjt:  ISFGRTPRTTVCNALLRGFLRKGLLDLASDALVLMSDLDIQKNQETYEILLDYHANAGRLEDTWSIINEMKRKGFELNSFVYS------KNNGMWKKAVG

Query:  IVDEIRKSGIYVDKHIYNSIIDTFGKYGQLPEALEVFKKMQQDGVMPDITTWNSLIQRNCKSGNLATALELFTDMQEQGMHPDPKIFITLISSLGEQGKW
        ++ E++  G+  D+  YN +IDTFGK+  L  A+  F +M  +G+ PD  TWN+LI  +CK G    A E+F  M+ +G  P    +  +I+S G+Q +W
Subjt:  IVDEIRKSGIYVDKHIYNSIIDTFGKYGQLPEALEVFKKMQQDGVMPDITTWNSLIQRNCKSGNLATALELFTDMQEQGMHPDPKIFITLISSLGEQGKW

Query:  DIIKKNLDSMKLRGHKNSGLVYEILVDIYGQYGQFRDAEKCISALKSAGLLPSASNFCIIANAFARQGLCEETVKVLQLMEAEGIEPNLVMLNVLINAFA
        D +K+ L  MK +G   + + +  LVD+YG+ G+F DA +C+  +KS GL PS++ +  + NA+A++GL E+ V   ++M ++G++P+L+ LN LINAF 
Subjt:  DIIKKNLDSMKLRGHKNSGLVYEILVDIYGQYGQFRDAEKCISALKSAGLLPSASNFCIIANAFARQGLCEETVKVLQLMEAEGIEPNLVMLNVLINAFA

Query:  VAGRHLEAMAIYHHIVEVGISPDVITYTTLMKAYIRAKKFHKVPEIYKEMERAGCTPDRKAREMLKS
           R  EA A+  ++ E G+ PDV+TYTTLMKA IR  KF KVP +Y+EM  +GC PDRKAR ML+S
Subjt:  VAGRHLEAMAIYHHIVEVGISPDVITYTTLMKAYIRAKKFHKVPEIYKEMERAGCTPDRKAREMLKS

Q9FFZ2 Putative pentatricopeptide repeat-containing protein At5g363001.2e-7140Show/hide
Query:  LSGDLCNCLIRDYCKVGNVDAAMSLLSHMEAVGLHASVASYTYLIEAHGNVGRTLEADILFQEMISF--GRTPRTTVCNALLRGFLRKGLLDLASDALVL
        +S  + N  IR +C+ G  + AMSLL+ + ++G      SY   IE   ++ RTLEAD LF E++ F    +    + NAL+  +LRK            
Subjt:  LSGDLCNCLIRDYCKVGNVDAAMSLLSHMEAVGLHASVASYTYLIEAHGNVGRTLEADILFQEMISF--GRTPRTTVCNALLRGFLRKGLLDLASDALVL

Query:  MSDLDIQKNQETYEILLDYHANAGRLEDTWSIINEMKRKGFELNSFVYSK------NNGMWKKAVGIVDEIRKSGIYVDKHIYNSIIDTFGKYGQLPEAL
                                  E +W ++NEMK++ F LNSFVY K      +NGMWKKA+GIV+EIR+ G+ +D  IYNS+IDTFGKYG+L E L
Subjt:  MSDLDIQKNQETYEILLDYHANAGRLEDTWSIINEMKRKGFELNSFVYSK------NNGMWKKAVGIVDEIRKSGIYVDKHIYNSIIDTFGKYGQLPEAL

Query:  EVFKKMQQDG-VMPDITTWNSLIQRNCKSGNLATALELFTDMQEQGMHPDPKIFITLISSLGEQGKWDIIKKNLDSMKLRGHKNSGLVYEILVDIYGQYG
        +V +K+Q+     P+I TWNSLI+ +C  G +  ALELFT +                                                          
Subjt:  EVFKKMQQDG-VMPDITTWNSLIQRNCKSGNLATALELFTDMQEQGMHPDPKIFITLISSLGEQGKWDIIKKNLDSMKLRGHKNSGLVYEILVDIYGQYG

Query:  QFRDAEKCISALKSAGLLPSASNFCIIANAFARQGLCEETVKVLQLMEAEGIEPNLVMLNVLINAFAVAGRHLEAMAIYHHIVE-VGISPDVITYTTLMK
         F D  + +  LKS G+ PSA+ FC +ANA+A+QGLC++TVKVL++ME EGIEPNL+MLNVLINAF  AG+H+EA++IYHHI E V I PDV+TY+TLMK
Subjt:  QFRDAEKCISALKSAGLLPSASNFCIIANAFARQGLCEETVKVLQLMEAEGIEPNLVMLNVLINAFAVAGRHLEAMAIYHHIVE-VGISPDVITYTTLMK

Query:  AYIRAKKFHKVPEIY
        A+ RAKK+  V   Y
Subjt:  AYIRAKKFHKVPEIY

Q9FIX3 Pentatricopeptide repeat-containing protein At5g397107.8e-4726.74Show/hide
Query:  DLC-------NCLIRDYCKVGNVDAAMSLLSHMEAVGLHASVASYTYLIEAHGNVGRTLE-ADILFQEMISFGRTPRTTVCNALLRGFLRKGLLDLASDA
        DLC       + +++ Y ++  +D A+S++   +A G    V SY  +++A     R +  A+ +F+EM+    +P     N L+RGF   G +D+A   
Subjt:  DLC-------NCLIRDYCKVGNVDAAMSLLSHMEAVGLHASVASYTYLIEAHGNVGRTLE-ADILFQEMISFGRTPRTTVCNALLRGFLRKGLLDLASDA

Query:  LVLMSDLDIQKNQETYEILLDYHANAGRLEDTWSIINEMKRKGFELNSFVYSK------NNGMWKKAVGIVDEIRKSGIYVDKHIYNSIIDTFGKYGQLP
           M       N  TY  L+D +    +++D + ++  M  KG E N   Y+         G  K+   ++ E+ + G  +D+  YN++I  + K G   
Subjt:  LVLMSDLDIQKNQETYEILLDYHANAGRLEDTWSIINEMKRKGFELNSFVYSK------NNGMWKKAVGIVDEIRKSGIYVDKHIYNSIIDTFGKYGQLP

Query:  EALEVFKKMQQDGVMPDITTWNSLIQRNCKSGNLATALELFTDMQEQGMHPDPKIFITLISSLGEQGKWDIIKKNLDSMKLRGHKNSGLVYEILVDIYGQ
        +AL +  +M + G+ P + T+ SLI   CK+GN+  A+E    M+ +G+ P+ + + TL+    ++G  +   + L  M   G   S + Y  L++ +  
Subjt:  EALEVFKKMQQDGVMPDITTWNSLIQRNCKSGNLATALELFTDMQEQGMHPDPKIFITLISSLGEQGKWDIIKKNLDSMKLRGHKNSGLVYEILVDIYGQ

Query:  YGQFRDAEKCISALKSAGLLPSASNFCIIANAFARQGLCEETVKVLQLMEAEGIEPNLVMLNVLINAFAVAGRHLEAMAIYHHIVEVGISPDVITYTTLM
         G+  DA   +  +K  GL P   ++  + + F R    +E ++V + M  +GI+P+ +  + LI  F    R  EA  +Y  ++ VG+ PD  TYT L+
Subjt:  YGQFRDAEKCISALKSAGLLPSASNFCIIANAFARQGLCEETVKVLQLMEAEGIEPNLVMLNVLINAFAVAGRHLEAMAIYHHIVEVGISPDVITYTTLM

Query:  KAYIRAKKFHKVPEIYKEMERAGCTPDRKAREML------KSVTVPSKILIEELIYTKKV
         AY       K  +++ EM   G  PD     +L      +S T  +K L+ +L Y + V
Subjt:  KAYIRAKKFHKVPEIYKEMERAGCTPDRKAREML------KSVTVPSKILIEELIYTKKV

Arabidopsis top hitse value%identityAlignment
AT5G02860.1 Pentatricopeptide repeat (PPR) superfamily protein5.9e-4225.87Show/hide
Query:  QEELSGDLCNCLIRDYCKVGNVDAAMSLLSHMEAVGLHASVASYTYLIEAHGNVGRTLEADILFQEMISFGRTPRTTVCNALLRGFLRKGLLDLASDALV
        Q  L   +   +I    K G V +A ++ + ++  G    V SYT LI A  N GR  EA  +F++M   G  P     N +L  F              
Subjt:  QEELSGDLCNCLIRDYCKVGNVDAAMSLLSHMEAVGLHASVASYTYLIEAHGNVGRTLEADILFQEMISFGRTPRTTVCNALLRGFLRKGLLDLASDALV

Query:  LMSDLDIQKNQETYEILLDYHANAGRLEDTW----SIINEMKRKGFELNSFVYS------KNNGMWKKAVGIVDEIRKSGIYVDKHIYNSIIDTFGKYGQ
                                G++   W    S++ +MK  G   +++ Y+      K   + ++A  + +E++ +G   DK  YN+++D +GK  +
Subjt:  LMSDLDIQKNQETYEILLDYHANAGRLEDTW----SIINEMKRKGFELNSFVYS------KNNGMWKKAVGIVDEIRKSGIYVDKHIYNSIIDTFGKYGQ

Query:  LPEALEVFKKMQQDGVMPDITTWNSLIQRNCKSGNLATALELFTDMQEQGMHPDPKIFITLISSLGEQGKWDIIKKNLDSMKLRGHKNSGLVYEILVDIY
          EA++V  +M  +G  P I T+NSLI    + G L  A+EL   M E+G  PD   + TL+S     GK +      + M+  G K +   +   + +Y
Subjt:  LPEALEVFKKMQQDGVMPDITTWNSLIQRNCKSGNLATALELFTDMQEQGMHPDPKIFITLISSLGEQGKWDIIKKNLDSMKLRGHKNSGLVYEILVDIY

Query:  GQYGQFRDAEKCISALKSAGLLPSASNFCIIANAFARQGLCEETVKVLQLMEAEGIEPNLVMLNVLINAFAVAGRHLEAMAIYHHIVEVGISPDVITYTT
        G  G+F +  K    +   GL P    +  +   F + G+  E   V + M+  G  P     N LI+A++  G   +AM +Y  +++ G++PD+ TY T
Subjt:  GQYGQFRDAEKCISALKSAGLLPSASNFCIIANAFARQGLCEETVKVLQLMEAEGIEPNLVMLNVLINAFAVAGRHLEAMAIYHHIVEVGISPDVITYTT

Query:  LMKAYIRAKKFHKVPEIYKEMERAGCTPD
        ++ A  R   + +  ++  EME   C P+
Subjt:  LMKAYIRAKKFHKVPEIYKEMERAGCTPD

AT5G36300.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.6e-5536.34Show/hide
Query:  MSLLSHMEAVGLHASVASYTYLIEAHGNVGRTLEADILFQEMISF--GRTPRTTVCNALLRGFLRKGLLDLASDALVLMSDLDIQKNQETYEILLDYHAN
        MSLL+ + ++G      SY   IE   ++ RTLEAD LF E++ F    +    + NAL+  +LRK                                  
Subjt:  MSLLSHMEAVGLHASVASYTYLIEAHGNVGRTLEADILFQEMISF--GRTPRTTVCNALLRGFLRKGLLDLASDALVLMSDLDIQKNQETYEILLDYHAN

Query:  AGRLEDTWSIINEMKRKGFELNSFVYSK------NNGMWKKAVGIVDEIRKSGIYVDKHIYNSIIDTFGKYGQLPEALEVFKKMQQDGVMPDITTWNSLI
            E +W ++NEMK++ F LNSFVY K      +NGMWKKA+GIV+EIR+ G+ +D  IYNS+IDTFGKYG+L E L                      
Subjt:  AGRLEDTWSIINEMKRKGFELNSFVYSK------NNGMWKKAVGIVDEIRKSGIYVDKHIYNSIIDTFGKYGQLPEALEVFKKMQQDGVMPDITTWNSLI

Query:  QRNCKSGNLATALELFTDMQEQGMHPDPKIFITLISSLGEQGKWDIIKKNLDSMKLRGHKNSGLVYEILVDIYGQYGQFRDAEKCISALKSAGLLPSASN
                                                                                  Q+G F D  + +  LKS G+ PSA+ 
Subjt:  QRNCKSGNLATALELFTDMQEQGMHPDPKIFITLISSLGEQGKWDIIKKNLDSMKLRGHKNSGLVYEILVDIYGQYGQFRDAEKCISALKSAGLLPSASN

Query:  FCIIANAFARQGLCEETVKVLQLMEAEGIEPNLVMLNVLINAFAVAGRHLEAMAIYHHIVE-VGISPDVITYTTLMKAYIRAKKFHKV
        FC +ANA+A+QGLC++TVKVL++ME EGIEPNL+MLNVLINAF  AG+H+EA++IYHHI E V I PDV+TY+TLMKA+ RAKK+  V
Subjt:  FCIIANAFARQGLCEETVKVLQLMEAEGIEPNLVMLNVLINAFAVAGRHLEAMAIYHHIVE-VGISPDVITYTTLMKAYIRAKKFHKV

AT5G39710.1 Tetratricopeptide repeat (TPR)-like superfamily protein5.5e-4826.74Show/hide
Query:  DLC-------NCLIRDYCKVGNVDAAMSLLSHMEAVGLHASVASYTYLIEAHGNVGRTLE-ADILFQEMISFGRTPRTTVCNALLRGFLRKGLLDLASDA
        DLC       + +++ Y ++  +D A+S++   +A G    V SY  +++A     R +  A+ +F+EM+    +P     N L+RGF   G +D+A   
Subjt:  DLC-------NCLIRDYCKVGNVDAAMSLLSHMEAVGLHASVASYTYLIEAHGNVGRTLE-ADILFQEMISFGRTPRTTVCNALLRGFLRKGLLDLASDA

Query:  LVLMSDLDIQKNQETYEILLDYHANAGRLEDTWSIINEMKRKGFELNSFVYSK------NNGMWKKAVGIVDEIRKSGIYVDKHIYNSIIDTFGKYGQLP
           M       N  TY  L+D +    +++D + ++  M  KG E N   Y+         G  K+   ++ E+ + G  +D+  YN++I  + K G   
Subjt:  LVLMSDLDIQKNQETYEILLDYHANAGRLEDTWSIINEMKRKGFELNSFVYSK------NNGMWKKAVGIVDEIRKSGIYVDKHIYNSIIDTFGKYGQLP

Query:  EALEVFKKMQQDGVMPDITTWNSLIQRNCKSGNLATALELFTDMQEQGMHPDPKIFITLISSLGEQGKWDIIKKNLDSMKLRGHKNSGLVYEILVDIYGQ
        +AL +  +M + G+ P + T+ SLI   CK+GN+  A+E    M+ +G+ P+ + + TL+    ++G  +   + L  M   G   S + Y  L++ +  
Subjt:  EALEVFKKMQQDGVMPDITTWNSLIQRNCKSGNLATALELFTDMQEQGMHPDPKIFITLISSLGEQGKWDIIKKNLDSMKLRGHKNSGLVYEILVDIYGQ

Query:  YGQFRDAEKCISALKSAGLLPSASNFCIIANAFARQGLCEETVKVLQLMEAEGIEPNLVMLNVLINAFAVAGRHLEAMAIYHHIVEVGISPDVITYTTLM
         G+  DA   +  +K  GL P   ++  + + F R    +E ++V + M  +GI+P+ +  + LI  F    R  EA  +Y  ++ VG+ PD  TYT L+
Subjt:  YGQFRDAEKCISALKSAGLLPSASNFCIIANAFARQGLCEETVKVLQLMEAEGIEPNLVMLNVLINAFAVAGRHLEAMAIYHHIVEVGISPDVITYTTLM

Query:  KAYIRAKKFHKVPEIYKEMERAGCTPDRKAREML------KSVTVPSKILIEELIYTKKV
         AY       K  +++ EM   G  PD     +L      +S T  +K L+ +L Y + V
Subjt:  KAYIRAKKFHKVPEIYKEMERAGCTPDRKAREML------KSVTVPSKILIEELIYTKKV

AT5G42310.1 Pentatricopeptide repeat (PPR-like) superfamily protein1.6e-9238.54Show/hide
Query:  VLKLFRKNGLKKVSKAALYDNYTISARWHGCKDQEELSGDLCNCLIRDYCKVGNVDAAMSLLSHMEAVGLHASVASYTYLIEAHGNVGRTLEADILFQEM
        +  L R N +  V    LY            +D+ EL   L N +I  + K G+   A+ LL   +A GL A  A+   +I A  + GRTLEA+ LF+E+
Subjt:  VLKLFRKNGLKKVSKAALYDNYTISARWHGCKDQEELSGDLCNCLIRDYCKVGNVDAAMSLLSHMEAVGLHASVASYTYLIEAHGNVGRTLEADILFQEM

Query:  ISFGRTPRTTVCNALLRGFLRKGLLDLASDALVLMSDLDIQKNQETYEILLDYHANAGRLEDTWSIINEMKRKGFELNSFVYS------KNNGMWKKAVG
           G  PRT   NALL+G+++ G L  A   +  M    +  ++ TY +L+D + NAGR E    ++ EM+    + NSFV+S      ++ G W+K   
Subjt:  ISFGRTPRTTVCNALLRGFLRKGLLDLASDALVLMSDLDIQKNQETYEILLDYHANAGRLEDTWSIINEMKRKGFELNSFVYS------KNNGMWKKAVG

Query:  IVDEIRKSGIYVDKHIYNSIIDTFGKYGQLPEALEVFKKMQQDGVMPDITTWNSLIQRNCKSGNLATALELFTDMQEQGMHPDPKIFITLISSLGEQGKW
        ++ E++  G+  D+  YN +IDTFGK+  L  A+  F +M  +G+ PD  TWN+LI  +CK G    A E+F  M+ +G  P    +  +I+S G+Q +W
Subjt:  IVDEIRKSGIYVDKHIYNSIIDTFGKYGQLPEALEVFKKMQQDGVMPDITTWNSLIQRNCKSGNLATALELFTDMQEQGMHPDPKIFITLISSLGEQGKW

Query:  DIIKKNLDSMKLRGHKNSGLVYEILVDIYGQYGQFRDAEKCISALKSAGLLPSASNFCIIANAFARQGLCEETVKVLQLMEAEGIEPNLVMLNVLINAFA
        D +K+ L  MK +G   + + +  LVD+YG+ G+F DA +C+  +KS GL PS++ +  + NA+A++GL E+ V   ++M ++G++P+L+ LN LINAF 
Subjt:  DIIKKNLDSMKLRGHKNSGLVYEILVDIYGQYGQFRDAEKCISALKSAGLLPSASNFCIIANAFARQGLCEETVKVLQLMEAEGIEPNLVMLNVLINAFA

Query:  VAGRHLEAMAIYHHIVEVGISPDVITYTTLMKAYIRAKKFHKVPEIYKEMERAGCTPDRKAREMLKS
           R  EA A+  ++ E G+ PDV+TYTTLMKA IR  KF KVP +Y+EM  +GC PDRKAR ML+S
Subjt:  VAGRHLEAMAIYHHIVEVGISPDVITYTTLMKAYIRAKKFHKVPEIYKEMERAGCTPDRKAREMLKS

AT5G55840.1 Pentatricopeptide repeat (PPR) superfamily protein1.1e-4026.44Show/hide
Query:  NCLIRDYCKVGNVDAAMSLLSHMEAVGLHASVASYTYLIEAHGNVGRTLEADILFQEMISFGRTPRTTVCNALLRGFLRKGLLDLASDALVLMSDLDIQK
        N ++  YCK G   AA+ LL HM++ G+ A V +Y  LI       R  +  +L ++M      P     N L+ GF  +G + +AS  L  M    +  
Subjt:  NCLIRDYCKVGNVDAAMSLLSHMEAVGLHASVASYTYLIEAHGNVGRTLEADILFQEMISFGRTPRTTVCNALLRGFLRKGLLDLASDALVLMSDLDIQK

Query:  NQETYEILLDYHANAGRLEDTWSIINEMKRKGFELNSFVYSK------NNGMWKKAVGIVDEIRKSGIYVDKHIYNSIIDTFGKYGQLPEALEVFKKMQQ
        N  T+  L+D H + G  ++   +   M+ KG   +   Y         N  +  A G    ++++G+ V +  Y  +ID   K G L EA+ +  +M +
Subjt:  NQETYEILLDYHANAGRLEDTWSIINEMKRKGFELNSFVYSK------NNGMWKKAVGIVDEIRKSGIYVDKHIYNSIIDTFGKYGQLPEALEVFKKMQQ

Query:  DGVMPDITTWNSLIQRNCKSGNLATALELFTDMQEQGMHPDPKIFITLISSLGEQGKWDIIKKNLDSMKLRGHKNSGLVYEILVDIYGQYGQFRDAEKCI
        DG+ PDI T+++LI   CK G   TA E+   +   G+ P+  I+ TLI +    G      +  ++M L GH      + +LV    + G+  +AE+ +
Subjt:  DGVMPDITTWNSLIQRNCKSGNLATALELFTDMQEQGMHPDPKIFITLISSLGEQGKWDIIKKNLDSMKLRGHKNSGLVYEILVDIYGQYGQFRDAEKCI

Query:  SALKSAGLLPSASNFCIIANAFARQGLCEETVKVLQLMEAEGIEPNLVMLNVLINAFAVAGRHLEAMAIYHHIVEVGISPDVITYTTLMKAYIRAKKFHK
          + S G+LP+  +F  + N +   G   +   V   M   G  P       L+      G   EA      +  V  + D + Y TL+ A  ++    K
Subjt:  SALKSAGLLPSASNFCIIANAFARQGLCEETVKVLQLMEAEGIEPNLVMLNVLINAFAVAGRHLEAMAIYHHIVEVGISPDVITYTTLMKAYIRAKKFHK

Query:  VPEIYKEMERAGCTPD
           ++ EM +    PD
Subjt:  VPEIYKEMERAGCTPD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTGCTTGGAGGAGACCGAGATCAAATTAGCATTACTCCGAAATTCATAAATATGCAATCTATTTTCTTGCCAAAAACATTCATCGTATACTCATTTGGTGGAGTATA
TTCTGGCACTTTGTTGCATCGAAGTTCTAGTAAATGTGATGGAAGGTATATGTTTGGCGATGCCGTATTAAAGTTGTTTAGGAAGAATGGCCTAAAAAAGGTCAGTAAAG
CAGCATTATATGATAATTATACTATTAGCGCTAGGTGGCATGGATGTAAAGATCAAGAGGAACTATCTGGTGATTTGTGCAACTGTTTGATTCGTGACTATTGTAAGGTA
GGGAATGTTGATGCTGCTATGTCTCTTCTTTCTCATATGGAGGCTGTAGGTCTCCATGCCTCTGTAGCATCTTACACATATTTGATTGAAGCTCATGGAAATGTAGGCAG
GACTTTGGAAGCTGATATCTTATTTCAAGAAATGATTAGTTTTGGTCGTACGCCAAGAACAACGGTCTGCAATGCACTACTAAGAGGGTTCTTGAGAAAAGGCCTTTTAG
ATCTTGCATCTGATGCTCTTGTGTTAATGAGTGATTTAGATATTCAAAAAAATCAAGAAACGTATGAAATTCTCTTGGATTATCATGCCAATGCTGGACGGCTGGAAGAT
ACTTGGTCTATTATTAATGAGATGAAACGAAAAGGTTTTGAGCTGAACTCATTTGTGTATAGTAAGAACAATGGCATGTGGAAGAAAGCAGTGGGGATCGTTGATGAGAT
AAGAAAATCAGGGATTTATGTGGACAAACACATTTACAACAGTATCATAGATACATTTGGGAAATATGGTCAATTACCCGAGGCCTTAGAAGTGTTCAAAAAAATGCAGC
AGGATGGTGTAATGCCTGATATAACAACCTGGAATTCGCTGATACAACGGAACTGTAAATCTGGGAACCTTGCTACTGCCCTGGAGTTATTCACTGACATGCAAGAACAG
GGGATGCATCCAGATCCTAAGATTTTCATTACTCTAATTAGCTCGTTGGGTGAGCAGGGAAAGTGGGATATTATAAAGAAGAATCTTGATAGTATGAAGCTCAGAGGGCA
TAAGAATAGTGGCCTAGTTTATGAAATTTTAGTTGATATTTATGGGCAGTATGGTCAATTTCGGGATGCTGAGAAATGTATATCTGCTCTCAAGTCTGCAGGTCTTCTAC
CATCCGCTAGCAATTTTTGCATTATAGCAAATGCTTTTGCTCGACAGGGGTTGTGTGAAGAGACAGTAAAAGTGCTTCAGCTAATGGAGGCAGAAGGAATCGAACCAAAT
CTTGTAATGCTGAATGTACTGATCAATGCGTTTGCTGTTGCTGGCAGACATTTGGAGGCGATGGCCATTTATCATCATATAGTTGAAGTTGGTATCAGTCCTGATGTTAT
AACCTACACCACCCTTATGAAGGCGTATATTCGTGCAAAGAAGTTTCATAAGGTCCCTGAAATATATAAAGAAATGGAACGTGCTGGTTGCACGCCAGATAGGAAGGCCA
GAGAGATGTTGAAGTCCGTAACAGTGCCATCTAAAATCTTGATCGAGGAATTGATCTACACCAAGAAAGTAGATCATAATGCCAGACCGGCAGAATTTCAACTTGACAAT
CCAAAGATTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGTGCTTGGAGGAGACCGAGATCAAATTAGCATTACTCCGAAATTCATAAATATGCAATCTATTTTCTTGCCAAAAACATTCATCGTATACTCATTTGGTGGAGTATA
TTCTGGCACTTTGTTGCATCGAAGTTCTAGTAAATGTGATGGAAGGTATATGTTTGGCGATGCCGTATTAAAGTTGTTTAGGAAGAATGGCCTAAAAAAGGTCAGTAAAG
CAGCATTATATGATAATTATACTATTAGCGCTAGGTGGCATGGATGTAAAGATCAAGAGGAACTATCTGGTGATTTGTGCAACTGTTTGATTCGTGACTATTGTAAGGTA
GGGAATGTTGATGCTGCTATGTCTCTTCTTTCTCATATGGAGGCTGTAGGTCTCCATGCCTCTGTAGCATCTTACACATATTTGATTGAAGCTCATGGAAATGTAGGCAG
GACTTTGGAAGCTGATATCTTATTTCAAGAAATGATTAGTTTTGGTCGTACGCCAAGAACAACGGTCTGCAATGCACTACTAAGAGGGTTCTTGAGAAAAGGCCTTTTAG
ATCTTGCATCTGATGCTCTTGTGTTAATGAGTGATTTAGATATTCAAAAAAATCAAGAAACGTATGAAATTCTCTTGGATTATCATGCCAATGCTGGACGGCTGGAAGAT
ACTTGGTCTATTATTAATGAGATGAAACGAAAAGGTTTTGAGCTGAACTCATTTGTGTATAGTAAGAACAATGGCATGTGGAAGAAAGCAGTGGGGATCGTTGATGAGAT
AAGAAAATCAGGGATTTATGTGGACAAACACATTTACAACAGTATCATAGATACATTTGGGAAATATGGTCAATTACCCGAGGCCTTAGAAGTGTTCAAAAAAATGCAGC
AGGATGGTGTAATGCCTGATATAACAACCTGGAATTCGCTGATACAACGGAACTGTAAATCTGGGAACCTTGCTACTGCCCTGGAGTTATTCACTGACATGCAAGAACAG
GGGATGCATCCAGATCCTAAGATTTTCATTACTCTAATTAGCTCGTTGGGTGAGCAGGGAAAGTGGGATATTATAAAGAAGAATCTTGATAGTATGAAGCTCAGAGGGCA
TAAGAATAGTGGCCTAGTTTATGAAATTTTAGTTGATATTTATGGGCAGTATGGTCAATTTCGGGATGCTGAGAAATGTATATCTGCTCTCAAGTCTGCAGGTCTTCTAC
CATCCGCTAGCAATTTTTGCATTATAGCAAATGCTTTTGCTCGACAGGGGTTGTGTGAAGAGACAGTAAAAGTGCTTCAGCTAATGGAGGCAGAAGGAATCGAACCAAAT
CTTGTAATGCTGAATGTACTGATCAATGCGTTTGCTGTTGCTGGCAGACATTTGGAGGCGATGGCCATTTATCATCATATAGTTGAAGTTGGTATCAGTCCTGATGTTAT
AACCTACACCACCCTTATGAAGGCGTATATTCGTGCAAAGAAGTTTCATAAGGTCCCTGAAATATATAAAGAAATGGAACGTGCTGGTTGCACGCCAGATAGGAAGGCCA
GAGAGATGTTGAAGTCCGTAACAGTGCCATCTAAAATCTTGATCGAGGAATTGATCTACACCAAGAAAGTAGATCATAATGCCAGACCGGCAGAATTTCAACTTGACAAT
CCAAAGATTTAG
Protein sequenceShow/hide protein sequence
MVLGGDRDQISITPKFINMQSIFLPKTFIVYSFGGVYSGTLLHRSSSKCDGRYMFGDAVLKLFRKNGLKKVSKAALYDNYTISARWHGCKDQEELSGDLCNCLIRDYCKV
GNVDAAMSLLSHMEAVGLHASVASYTYLIEAHGNVGRTLEADILFQEMISFGRTPRTTVCNALLRGFLRKGLLDLASDALVLMSDLDIQKNQETYEILLDYHANAGRLED
TWSIINEMKRKGFELNSFVYSKNNGMWKKAVGIVDEIRKSGIYVDKHIYNSIIDTFGKYGQLPEALEVFKKMQQDGVMPDITTWNSLIQRNCKSGNLATALELFTDMQEQ
GMHPDPKIFITLISSLGEQGKWDIIKKNLDSMKLRGHKNSGLVYEILVDIYGQYGQFRDAEKCISALKSAGLLPSASNFCIIANAFARQGLCEETVKVLQLMEAEGIEPN
LVMLNVLINAFAVAGRHLEAMAIYHHIVEVGISPDVITYTTLMKAYIRAKKFHKVPEIYKEMERAGCTPDRKAREMLKSVTVPSKILIEELIYTKKVDHNARPAEFQLDN
PKI