; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10001895 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10001895
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationChr11:1471783..1474098
RNA-Seq ExpressionHG10001895
SyntenyHG10001895
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_023551938.1 pentatricopeptide repeat-containing protein At5g42310, chloroplastic-like isoform X1 [Cucurbita pepo subsp. pepo]5.0e-27290.93Show/hide
Query:  MKMQSIFLPKTVIVSSFGGVFSDLLLHPCSSKCDGKYMFDDAALKLFRNNALKKASKAALDDNCIISTRWHGCKDEEELSSDLCNCLIRDYCKVGNVDSA
        + MQSIFLPKT IV SFGGV+S  LLH  SSKCDG+YMF DA LKLFR N LKK SKAAL DN  IS RWHGCKD+EELS DLCNCLIRDYCKVGNVD+A
Subjt:  MKMQSIFLPKTVIVSSFGGVFSDLLLHPCSSKCDGKYMFDDAALKLFRNNALKKASKAALDDNCIISTRWHGCKDEEELSSDLCNCLIRDYCKVGNVDSA

Query:  MSLLAHMESVGRHASVASYAYLIEALGNLGRTLEADIIFQEMISFGCKPRTIVCNALLRGFLRKGLLDLASDVLVLMSDLDIQKNQETYEILLDYHANAG
        MSLL+HME+VG HASVASY YLIEA GN+GRTLEADI+FQEMISFG KPRT VCNALLRGFLRKGLLDLASDVLVLMSDLDIQKNQETYEILLDYHANAG
Subjt:  MSLLAHMESVGRHASVASYAYLIEALGNLGRTLEADIIFQEMISFGCKPRTIVCNALLRGFLRKGLLDLASDVLVLMSDLDIQKNQETYEILLDYHANAG

Query:  RLEDTWSIINEMKRKGFELNSFVYSKVIVIYQNNGMWKKAVGIVDEIRKSGISVDKHIYNSIIDTFGKYGQLSEALEVFKRMQQDGVMPNITTWNSLIQW
        RLEDTW IINEMKRKGFELNSFVYSKVIVIYQNNGMWKKAVGIVDEIRKSGISVDKHIYNSIIDTFGKYGQL EALEVFK+MQQDGVMP+ITTWNSLIQW
Subjt:  RLEDTWSIINEMKRKGFELNSFVYSKVIVIYQNNGMWKKAVGIVDEIRKSGISVDKHIYNSIIDTFGKYGQLSEALEVFKRMQQDGVMPNITTWNSLIQW

Query:  NCKAGNLATALELFTDMQEQGMHPDPKIFVTLISSLGEQGKWDVIKQNLDSMKLRGHKNSGLVYEILVDIYGQYGQFQDAEKCISALKSAGLLPSASNFC
        NCK+GNLATALELFTDMQEQGMHPDPKIF+TLISSLGEQGKWDVIK+NLDSMKLRGHKNSGLVYEILVDIYGQYGQFQDAEKCISALKSAGLLPSASNFC
Subjt:  NCKAGNLATALELFTDMQEQGMHPDPKIFVTLISSLGEQGKWDVIKQNLDSMKLRGHKNSGLVYEILVDIYGQYGQFQDAEKCISALKSAGLLPSASNFC

Query:  IIANAFSQQGLCEETVKVLQLMEAEGIEPNLVILNVLINAFAVAGRHSEALAIYHHIIEVGISPDVITYTTLMKAFIRAKKFSKVPEIYKEMENAGCTPD
        IIANAF++QGLCEETVKVL+LMEAEGIEPNLV+LNVLINAFAVAGRH EALAIYHHI+EVGISPDVITYTTLMKA+IRAKKF KVPEIYKEME+AGCTPD
Subjt:  IIANAFSQQGLCEETVKVLQLMEAEGIEPNLVILNVLINAFAVAGRHSEALAIYHHIIEVGISPDVITYTTLMKAFIRAKKFSKVPEIYKEMENAGCTPD

Query:  RKAREMLKSVTVVLEQRH
        RKAREMLKSVTVVLEQRH
Subjt:  RKAREMLKSVTVVLEQRH

XP_023551942.1 pentatricopeptide repeat-containing protein At5g42310, chloroplastic-like isoform X2 [Cucurbita pepo subsp. pepo]5.0e-27290.93Show/hide
Query:  MKMQSIFLPKTVIVSSFGGVFSDLLLHPCSSKCDGKYMFDDAALKLFRNNALKKASKAALDDNCIISTRWHGCKDEEELSSDLCNCLIRDYCKVGNVDSA
        + MQSIFLPKT IV SFGGV+S  LLH  SSKCDG+YMF DA LKLFR N LKK SKAAL DN  IS RWHGCKD+EELS DLCNCLIRDYCKVGNVD+A
Subjt:  MKMQSIFLPKTVIVSSFGGVFSDLLLHPCSSKCDGKYMFDDAALKLFRNNALKKASKAALDDNCIISTRWHGCKDEEELSSDLCNCLIRDYCKVGNVDSA

Query:  MSLLAHMESVGRHASVASYAYLIEALGNLGRTLEADIIFQEMISFGCKPRTIVCNALLRGFLRKGLLDLASDVLVLMSDLDIQKNQETYEILLDYHANAG
        MSLL+HME+VG HASVASY YLIEA GN+GRTLEADI+FQEMISFG KPRT VCNALLRGFLRKGLLDLASDVLVLMSDLDIQKNQETYEILLDYHANAG
Subjt:  MSLLAHMESVGRHASVASYAYLIEALGNLGRTLEADIIFQEMISFGCKPRTIVCNALLRGFLRKGLLDLASDVLVLMSDLDIQKNQETYEILLDYHANAG

Query:  RLEDTWSIINEMKRKGFELNSFVYSKVIVIYQNNGMWKKAVGIVDEIRKSGISVDKHIYNSIIDTFGKYGQLSEALEVFKRMQQDGVMPNITTWNSLIQW
        RLEDTW IINEMKRKGFELNSFVYSKVIVIYQNNGMWKKAVGIVDEIRKSGISVDKHIYNSIIDTFGKYGQL EALEVFK+MQQDGVMP+ITTWNSLIQW
Subjt:  RLEDTWSIINEMKRKGFELNSFVYSKVIVIYQNNGMWKKAVGIVDEIRKSGISVDKHIYNSIIDTFGKYGQLSEALEVFKRMQQDGVMPNITTWNSLIQW

Query:  NCKAGNLATALELFTDMQEQGMHPDPKIFVTLISSLGEQGKWDVIKQNLDSMKLRGHKNSGLVYEILVDIYGQYGQFQDAEKCISALKSAGLLPSASNFC
        NCK+GNLATALELFTDMQEQGMHPDPKIF+TLISSLGEQGKWDVIK+NLDSMKLRGHKNSGLVYEILVDIYGQYGQFQDAEKCISALKSAGLLPSASNFC
Subjt:  NCKAGNLATALELFTDMQEQGMHPDPKIFVTLISSLGEQGKWDVIKQNLDSMKLRGHKNSGLVYEILVDIYGQYGQFQDAEKCISALKSAGLLPSASNFC

Query:  IIANAFSQQGLCEETVKVLQLMEAEGIEPNLVILNVLINAFAVAGRHSEALAIYHHIIEVGISPDVITYTTLMKAFIRAKKFSKVPEIYKEMENAGCTPD
        IIANAF++QGLCEETVKVL+LMEAEGIEPNLV+LNVLINAFAVAGRH EALAIYHHI+EVGISPDVITYTTLMKA+IRAKKF KVPEIYKEME+AGCTPD
Subjt:  IIANAFSQQGLCEETVKVLQLMEAEGIEPNLVILNVLINAFAVAGRHSEALAIYHHIIEVGISPDVITYTTLMKAFIRAKKFSKVPEIYKEMENAGCTPD

Query:  RKAREMLKSVTVVLEQRH
        RKAREMLKSVTVVLEQRH
Subjt:  RKAREMLKSVTVVLEQRH

XP_038901715.1 pentatricopeptide repeat-containing protein At5g42310, chloroplastic-like isoform X1 [Benincasa hispida]1.5e-27994.02Show/hide
Query:  MKMQSIFLPKTVIVSSFGGVFSDLLLHPCSSKCDGKYMFDDAALKLFRNNALKKASKAALDDNCIISTRWHGCKDEEELSSDLCNCLIRDYCKVGNVDSA
        MKMQS FLPK++IVSSFGGVFSDLLL+P S KC G+YMFDDAALKLFRNNALKK SK ALDDN IIS+RWHGC+DE ELSSDLCNCLIRDYCKVGNVDSA
Subjt:  MKMQSIFLPKTVIVSSFGGVFSDLLLHPCSSKCDGKYMFDDAALKLFRNNALKKASKAALDDNCIISTRWHGCKDEEELSSDLCNCLIRDYCKVGNVDSA

Query:  MSLLAHMESVGRHASVASYAYLIEALGNLGRTLEADIIFQEMISFGCKPRTIVCNALLRGFLRKGLLDLASDVLVLMSDLDIQKNQETYEILLDYHANAG
        MSLLAHMESVG HASVASY YLIEALGNLGRTLEADI+FQEM+SFGCKPRTIVCNALLRGFLRKGLLDLAS VLVLMSDLDI+KNQETYEILLDYHANAG
Subjt:  MSLLAHMESVGRHASVASYAYLIEALGNLGRTLEADIIFQEMISFGCKPRTIVCNALLRGFLRKGLLDLASDVLVLMSDLDIQKNQETYEILLDYHANAG

Query:  RLEDTWSIINEMKRKGFELNSFVYSKVIVIYQNNGMWKKAVGIVDEIRKSGISVDKHIYNSIIDTFGKYGQLSEALEVFKRMQQDGVMPNITTWNSLIQW
        RLEDTWSIINEMK+KGFELNSFVYSKVIVIYQNNGMWKKAVGIVDEIRKSGISVDK IYNSI+DTFGKYGQLSEALEVFKRMQQDGVMP+ITTWNSLIQW
Subjt:  RLEDTWSIINEMKRKGFELNSFVYSKVIVIYQNNGMWKKAVGIVDEIRKSGISVDKHIYNSIIDTFGKYGQLSEALEVFKRMQQDGVMPNITTWNSLIQW

Query:  NCKAGNLATALELFTDMQEQGMHPDPKIFVTLISSLGEQGKWDVIKQNLDSMKLRGHKNSGLVYEILVDIYGQYGQFQDAEKCISALKSAGLLPSASNFC
        NCKAGNLATALELFTDMQEQGMHPDPKIFVTLISSLGEQGKWDVIKQNLDSMKLRGHK+SGLVYEILVDIYGQYGQFQDAEKCISALKSAGLLPSASNFC
Subjt:  NCKAGNLATALELFTDMQEQGMHPDPKIFVTLISSLGEQGKWDVIKQNLDSMKLRGHKNSGLVYEILVDIYGQYGQFQDAEKCISALKSAGLLPSASNFC

Query:  IIANAFSQQGLCEETVKVLQLMEAEGIEPNLVILNVLINAFAVAGRHSEALAIYHHIIEVGISPDVITYTTLMKAFIRAKKFSKVPEIYKEMENAGCTPD
        IIANAF+QQGL EETVKVLQLMEA+GIEPNLV+LNVLINAFAVAGRHSEALAIYHHIIEVGISPDVITYTTLMKAFIRAKKF+KVPEIYKEME+AGCTPD
Subjt:  IIANAFSQQGLCEETVKVLQLMEAEGIEPNLVILNVLINAFAVAGRHSEALAIYHHIIEVGISPDVITYTTLMKAFIRAKKFSKVPEIYKEMENAGCTPD

Query:  RKAREMLKSVTVVLEQRH
        RKAREMLKSVTVVLEQRH
Subjt:  RKAREMLKSVTVVLEQRH

XP_038901745.1 pentatricopeptide repeat-containing protein At5g42310, chloroplastic-like isoform X2 [Benincasa hispida]1.5e-27994.02Show/hide
Query:  MKMQSIFLPKTVIVSSFGGVFSDLLLHPCSSKCDGKYMFDDAALKLFRNNALKKASKAALDDNCIISTRWHGCKDEEELSSDLCNCLIRDYCKVGNVDSA
        MKMQS FLPK++IVSSFGGVFSDLLL+P S KC G+YMFDDAALKLFRNNALKK SK ALDDN IIS+RWHGC+DE ELSSDLCNCLIRDYCKVGNVDSA
Subjt:  MKMQSIFLPKTVIVSSFGGVFSDLLLHPCSSKCDGKYMFDDAALKLFRNNALKKASKAALDDNCIISTRWHGCKDEEELSSDLCNCLIRDYCKVGNVDSA

Query:  MSLLAHMESVGRHASVASYAYLIEALGNLGRTLEADIIFQEMISFGCKPRTIVCNALLRGFLRKGLLDLASDVLVLMSDLDIQKNQETYEILLDYHANAG
        MSLLAHMESVG HASVASY YLIEALGNLGRTLEADI+FQEM+SFGCKPRTIVCNALLRGFLRKGLLDLAS VLVLMSDLDI+KNQETYEILLDYHANAG
Subjt:  MSLLAHMESVGRHASVASYAYLIEALGNLGRTLEADIIFQEMISFGCKPRTIVCNALLRGFLRKGLLDLASDVLVLMSDLDIQKNQETYEILLDYHANAG

Query:  RLEDTWSIINEMKRKGFELNSFVYSKVIVIYQNNGMWKKAVGIVDEIRKSGISVDKHIYNSIIDTFGKYGQLSEALEVFKRMQQDGVMPNITTWNSLIQW
        RLEDTWSIINEMK+KGFELNSFVYSKVIVIYQNNGMWKKAVGIVDEIRKSGISVDK IYNSI+DTFGKYGQLSEALEVFKRMQQDGVMP+ITTWNSLIQW
Subjt:  RLEDTWSIINEMKRKGFELNSFVYSKVIVIYQNNGMWKKAVGIVDEIRKSGISVDKHIYNSIIDTFGKYGQLSEALEVFKRMQQDGVMPNITTWNSLIQW

Query:  NCKAGNLATALELFTDMQEQGMHPDPKIFVTLISSLGEQGKWDVIKQNLDSMKLRGHKNSGLVYEILVDIYGQYGQFQDAEKCISALKSAGLLPSASNFC
        NCKAGNLATALELFTDMQEQGMHPDPKIFVTLISSLGEQGKWDVIKQNLDSMKLRGHK+SGLVYEILVDIYGQYGQFQDAEKCISALKSAGLLPSASNFC
Subjt:  NCKAGNLATALELFTDMQEQGMHPDPKIFVTLISSLGEQGKWDVIKQNLDSMKLRGHKNSGLVYEILVDIYGQYGQFQDAEKCISALKSAGLLPSASNFC

Query:  IIANAFSQQGLCEETVKVLQLMEAEGIEPNLVILNVLINAFAVAGRHSEALAIYHHIIEVGISPDVITYTTLMKAFIRAKKFSKVPEIYKEMENAGCTPD
        IIANAF+QQGL EETVKVLQLMEA+GIEPNLV+LNVLINAFAVAGRHSEALAIYHHIIEVGISPDVITYTTLMKAFIRAKKF+KVPEIYKEME+AGCTPD
Subjt:  IIANAFSQQGLCEETVKVLQLMEAEGIEPNLVILNVLINAFAVAGRHSEALAIYHHIIEVGISPDVITYTTLMKAFIRAKKFSKVPEIYKEMENAGCTPD

Query:  RKAREMLKSVTVVLEQRH
        RKAREMLKSVTVVLEQRH
Subjt:  RKAREMLKSVTVVLEQRH

XP_038901793.1 pentatricopeptide repeat-containing protein At5g42310, chloroplastic-like isoform X3 [Benincasa hispida]5.0e-28093.83Show/hide
Query:  MKMQSIFLPKTVIVSSFGGVFSDLLLHPCSSKCDGKYMFDDAALKLFRNNALKKASKAALDDNCIISTRWHGCKDEEELSSDLCNCLIRDYCKVGNVDSA
        MKMQS FLPK++IVSSFGGVFSDLLL+P S KC G+YMFDDAALKLFRNNALKK SK ALDDN IIS+RWHGC+DE ELSSDLCNCLIRDYCKVGNVDSA
Subjt:  MKMQSIFLPKTVIVSSFGGVFSDLLLHPCSSKCDGKYMFDDAALKLFRNNALKKASKAALDDNCIISTRWHGCKDEEELSSDLCNCLIRDYCKVGNVDSA

Query:  MSLLAHMESVGRHASVASYAYLIEALGNLGRTLEADIIFQEMISFGCKPRTIVCNALLRGFLRKGLLDLASDVLVLMSDLDIQKNQETYEILLDYHANAG
        MSLLAHMESVG HASVASY YLIEALGNLGRTLEADI+FQEM+SFGCKPRTIVCNALLRGFLRKGLLDLAS VLVLMSDLDI+KNQETYEILLDYHANAG
Subjt:  MSLLAHMESVGRHASVASYAYLIEALGNLGRTLEADIIFQEMISFGCKPRTIVCNALLRGFLRKGLLDLASDVLVLMSDLDIQKNQETYEILLDYHANAG

Query:  RLEDTWSIINEMKRKGFELNSFVYSKVIVIYQNNGMWKKAVGIVDEIRKSGISVDKHIYNSIIDTFGKYGQLSEALEVFKRMQQDGVMPNITTWNSLIQW
        RLEDTWSIINEMK+KGFELNSFVYSKVIVIYQNNGMWKKAVGIVDEIRKSGISVDK IYNSI+DTFGKYGQLSEALEVFKRMQQDGVMP+ITTWNSLIQW
Subjt:  RLEDTWSIINEMKRKGFELNSFVYSKVIVIYQNNGMWKKAVGIVDEIRKSGISVDKHIYNSIIDTFGKYGQLSEALEVFKRMQQDGVMPNITTWNSLIQW

Query:  NCKAGNLATALELFTDMQEQGMHPDPKIFVTLISSLGEQGKWDVIKQNLDSMKLRGHKNSGLVYEILVDIYGQYGQFQDAEKCISALKSAGLLPSASNFC
        NCKAGNLATALELFTDMQEQGMHPDPKIFVTLISSLGEQGKWDVIKQNLDSMKLRGHK+SGLVYEILVDIYGQYGQFQDAEKCISALKSAGLLPSASNFC
Subjt:  NCKAGNLATALELFTDMQEQGMHPDPKIFVTLISSLGEQGKWDVIKQNLDSMKLRGHKNSGLVYEILVDIYGQYGQFQDAEKCISALKSAGLLPSASNFC

Query:  IIANAFSQQGLCEETVKVLQLMEAEGIEPNLVILNVLINAFAVAGRHSEALAIYHHIIEVGISPDVITYTTLMKAFIRAKKFSKVPEIYKEMENAGCTPD
        IIANAF+QQGL EETVKVLQLMEA+GIEPNLV+LNVLINAFAVAGRHSEALAIYHHIIEVGISPDVITYTTLMKAFIRAKKF+KVPEIYKEME+AGCTPD
Subjt:  IIANAFSQQGLCEETVKVLQLMEAEGIEPNLVILNVLINAFAVAGRHSEALAIYHHIIEVGISPDVITYTTLMKAFIRAKKFSKVPEIYKEMENAGCTPD

Query:  RKAREMLKSVTVVLEQRHF
        RKAREMLKSVTVVLEQRH+
Subjt:  RKAREMLKSVTVVLEQRHF

TrEMBL top hitse value%identityAlignment
A0A0A0KZ91 Uncharacterized protein2.3e-27090.17Show/hide
Query:  MKMQSIFLPKTVIVSSFGGVFSDLLLHPCSSKCDGKYMFDDAALKLFRNNALKKASKAALDDNCIISTRWHGCKDEEELSSDLCNCLIRDYCKVGNVDSA
        MKM SIFLPK  IVSSFGGVFSD LL   SSKCDGKYMFD   +KLFRNN+L  ASKA +DDNCIIS+RWHGC DEEELSS+ CN LIRDYCKVG+VDSA
Subjt:  MKMQSIFLPKTVIVSSFGGVFSDLLLHPCSSKCDGKYMFDDAALKLFRNNALKKASKAALDDNCIISTRWHGCKDEEELSSDLCNCLIRDYCKVGNVDSA

Query:  MSLLAHMESVGRHASVASYAYLIEALGNLGRTLEADIIFQEMISFGCKPRTIVCNALLRGFLRKGLLDLASDVLVLMSDLDIQKNQETYEILLDYHANAG
        MSLLAHMESVG HA++ SY YLIEALGN+GRTLEADIIFQEMISFGCKPRT+VCNALLRGFLRKGLLDLAS V VLMSDLDI+KNQETYEILLDYH NAG
Subjt:  MSLLAHMESVGRHASVASYAYLIEALGNLGRTLEADIIFQEMISFGCKPRTIVCNALLRGFLRKGLLDLASDVLVLMSDLDIQKNQETYEILLDYHANAG

Query:  RLEDTWSIINEMKRKGFELNSFVYSKVIVIYQNNGMWKKAVGIVDEIRKSGISVDKHIYNSIIDTFGKYGQLSEALEVFKRMQQDGVMPNITTWNSLIQW
        RLEDTWSIINEMKRKGFELNSFVYSKVIVIYQNNGMWKKAVGIVDEIRKSGIS+DKHIYNSIIDTFGKYG LSEALEVFKRMQQDGV+P+ITTWNSLIQW
Subjt:  RLEDTWSIINEMKRKGFELNSFVYSKVIVIYQNNGMWKKAVGIVDEIRKSGISVDKHIYNSIIDTFGKYGQLSEALEVFKRMQQDGVMPNITTWNSLIQW

Query:  NCKAGNLATALELFTDMQEQGMHPDPKIFVTLISSLGEQGKWDVIKQNLDSMKLRGHKNSGLVYEILVDIYGQYGQFQDAEKCISALKSAGLLPSASNFC
        NCK+GNLATALELFTDMQEQGMHPDPKIF+TLIS LGEQGKWDVI QNLDSMKLRGHKNS LVYEILVDIYGQYGQFQDAEKCISALKSAGLL S SNFC
Subjt:  NCKAGNLATALELFTDMQEQGMHPDPKIFVTLISSLGEQGKWDVIKQNLDSMKLRGHKNSGLVYEILVDIYGQYGQFQDAEKCISALKSAGLLPSASNFC

Query:  IIANAFSQQGLCEETVKVLQLMEAEGIEPNLVILNVLINAFAVAGRHSEALAIYHHIIEVGISPDVITYTTLMKAFIRAKKFSKVPEIYKEMENAGCTPD
        IIANAF+QQGLCEETVKVLQLMEAEGIEPNLV+LNVLINAFAVAGRHSEALAIYHHIIEVGISPDVITYTTLMKAFIRAKKF+KVPEIYKEME+AGCTPD
Subjt:  IIANAFSQQGLCEETVKVLQLMEAEGIEPNLVILNVLINAFAVAGRHSEALAIYHHIIEVGISPDVITYTTLMKAFIRAKKFSKVPEIYKEMENAGCTPD

Query:  RKAREMLKSVTVVLEQRHF
        RKAREMLKSVT +LEQRH+
Subjt:  RKAREMLKSVTVVLEQRHF

A0A1S4E032 pentatricopeptide repeat-containing protein At5g42310, mitochondrial-like1.9e-26991.19Show/hide
Query:  MKMQSIFLPKTVIVSSFGGVFSDLLLHPCSSKCDGKYMFDDAALKLFRNNALKKASKAALDDNCIISTRWHGCKDEEELSSDLCNCLIRDYCKVGNVDSA
        MKMQSIFLPK  IVSSFGG FSD LLH  SSK DG Y FD A LK FRNN L  ASKAA+DDNCI+S+RWHGC DEEELSS+ CN LI DYCKVGNVDSA
Subjt:  MKMQSIFLPKTVIVSSFGGVFSDLLLHPCSSKCDGKYMFDDAALKLFRNNALKKASKAALDDNCIISTRWHGCKDEEELSSDLCNCLIRDYCKVGNVDSA

Query:  MSLLAHMESVGRHASVASYAYLIEALGNLGRTLEADIIFQEMISFGCKPRTIVCNALLRGFLRKGLLDLASDVLVLMSDLDIQKNQETYEILLDYHANAG
        MSLLAHMESVG HA++ASY YLIEALGN+GRTLEADIIFQEMISFGCKPRTIVCNALLRGFLRKGLLDLAS VLVLMSDLDI+KNQETYEILLDYH NAG
Subjt:  MSLLAHMESVGRHASVASYAYLIEALGNLGRTLEADIIFQEMISFGCKPRTIVCNALLRGFLRKGLLDLASDVLVLMSDLDIQKNQETYEILLDYHANAG

Query:  RLEDTWSIINEMKRKGFELNSFVYSKVIVIYQNNGMWKKAVGIVDEIRKSGISVDKHIYNSIIDTFGKYGQLSEALEVFKRMQQDGVMPNITTWNSLIQW
        RLEDTWSIINEMKRKGFELNSFVYSKVIVIYQNNGMWKKAVGIVDEIRKSGIS+DKHIYNSIIDTFGKYGQLSEALEVFKRMQQD V+P+ITTWNSLIQW
Subjt:  RLEDTWSIINEMKRKGFELNSFVYSKVIVIYQNNGMWKKAVGIVDEIRKSGISVDKHIYNSIIDTFGKYGQLSEALEVFKRMQQDGVMPNITTWNSLIQW

Query:  NCKAGNLATALELFTDMQEQGMHPDPKIFVTLISSLGEQGKWDVIKQNLDSMKLRGHKNSGLVYEILVDIYGQYGQFQDAEKCISALKSAGLLPSASNFC
        NCKAGNLATALELFTDMQEQGMHPDPKIF+TLIS L EQGKWDVIKQNLDSMKLRGHKNS LVYEILVDIYGQYGQFQD EKCISALKSAGLLPS+SNFC
Subjt:  NCKAGNLATALELFTDMQEQGMHPDPKIFVTLISSLGEQGKWDVIKQNLDSMKLRGHKNSGLVYEILVDIYGQYGQFQDAEKCISALKSAGLLPSASNFC

Query:  IIANAFSQQGLCEETVKVLQLMEAEGIEPNLVILNVLINAFAVAGRHSEALAIYHHIIEVGISPDVITYTTLMKAFIRAKKFSKVPEIYKEMENAGCTPD
        IIANAF+QQGLCEETVKVLQLMEAEGIEPNLV+LNVLINAFAVAGRHSEALAIYHHIIEVGISPDVITYTTLMKAFIRAKKFSKVPEIYKEME+AGCTPD
Subjt:  IIANAFSQQGLCEETVKVLQLMEAEGIEPNLVILNVLINAFAVAGRHSEALAIYHHIIEVGISPDVITYTTLMKAFIRAKKFSKVPEIYKEMENAGCTPD

Query:  RKAREMLKSVTVVLEQRHFSQP
        RKAREMLKSVT VLEQRH SQP
Subjt:  RKAREMLKSVTVVLEQRHFSQP

A0A5A7TTI3 Pentatricopeptide repeat-containing protein1.1e-26991.19Show/hide
Query:  MKMQSIFLPKTVIVSSFGGVFSDLLLHPCSSKCDGKYMFDDAALKLFRNNALKKASKAALDDNCIISTRWHGCKDEEELSSDLCNCLIRDYCKVGNVDSA
        MKMQSIFLPK  IVSSFGG FSD LLH  SSK DGKY FD A LK FRNN L  ASKAA+DDNCIIS+RWHGC DEEELSS+ CN LI DYCKVGNVDSA
Subjt:  MKMQSIFLPKTVIVSSFGGVFSDLLLHPCSSKCDGKYMFDDAALKLFRNNALKKASKAALDDNCIISTRWHGCKDEEELSSDLCNCLIRDYCKVGNVDSA

Query:  MSLLAHMESVGRHASVASYAYLIEALGNLGRTLEADIIFQEMISFGCKPRTIVCNALLRGFLRKGLLDLASDVLVLMSDLDIQKNQETYEILLDYHANAG
        MSLLAHMESVG HA++ASY YLIEALGN+GRTLEADIIFQEMISFGCKPRTIVCNALLRGFLRKGLLDLAS VLVLMSDLDI+KNQETYEILLDYH NAG
Subjt:  MSLLAHMESVGRHASVASYAYLIEALGNLGRTLEADIIFQEMISFGCKPRTIVCNALLRGFLRKGLLDLASDVLVLMSDLDIQKNQETYEILLDYHANAG

Query:  RLEDTWSIINEMKRKGFELNSFVYSKVIVIYQNNGMWKKAVGIVDEIRKSGISVDKHIYNSIIDTFGKYGQLSEALEVFKRMQQDGVMPNITTWNSLIQW
        RLEDTWSIINEMKRKGFELNSFVYSKVIVIYQNNGMWKKAVGIVDEIRKSGIS+DKHIYNSIIDTFGKYGQLSEALEVFKRMQQD V+P+ITTWNSLIQW
Subjt:  RLEDTWSIINEMKRKGFELNSFVYSKVIVIYQNNGMWKKAVGIVDEIRKSGISVDKHIYNSIIDTFGKYGQLSEALEVFKRMQQDGVMPNITTWNSLIQW

Query:  NCKAGNLATALELFTDMQEQGMHPDPKIFVTLISSLGEQGKWDVIKQNLDSMKLRGHKNSGLVYEILVDIYGQYGQFQDAEKCISALKSAGLLPSASNFC
        NCKAGNLATALELFTDMQEQGMHPDPKIF+TLIS L EQGKWDVIKQNLDSMKLRGHKNS LVYEILVDIYGQYGQFQD EKCISALKSAGLLPS+SNFC
Subjt:  NCKAGNLATALELFTDMQEQGMHPDPKIFVTLISSLGEQGKWDVIKQNLDSMKLRGHKNSGLVYEILVDIYGQYGQFQDAEKCISALKSAGLLPSASNFC

Query:  IIANAFSQQGLCEETVKVLQLMEAEGIEPNLVILNVLINAFAVAGRHSEALAIYHHIIEVGISPDVITYTTLMKAFIRAKKFSKVPEIYKEMENAGCTPD
        IIANAF+QQGLCEETVKVLQLMEAEGIEPNLV+LNVLINAFAVAGRHSEALAIYHHIIEVGISPDVITYTTLMKAFIRAKKF+KVPEIYKEME+AGCTPD
Subjt:  IIANAFSQQGLCEETVKVLQLMEAEGIEPNLVILNVLINAFAVAGRHSEALAIYHHIIEVGISPDVITYTTLMKAFIRAKKFSKVPEIYKEMENAGCTPD

Query:  RKAREMLKSVTVVLEQRHFSQP
        RK REMLKSVT VLEQRH SQP
Subjt:  RKAREMLKSVTVVLEQRHFSQP

A0A6J1E3F2 pentatricopeptide repeat-containing protein At5g42310, chloroplastic-like isoform X11.6e-27190.89Show/hide
Query:  MQSIFLPKTVIVSSFGGVFSDLLLHPCSSKCDGKYMFDDAALKLFRNNALKKASKAALDDNCIISTRWHGCKDEEELSSDLCNCLIRDYCKVGNVDSAMS
        MQSIFLPKT IV SFGGV+S  LLH  SSKCDG+YMF DA LKLFR N LKK SKAAL DN  IS RWHGCKD+EELS DLCNCLIRDYCKVGNVD+AMS
Subjt:  MQSIFLPKTVIVSSFGGVFSDLLLHPCSSKCDGKYMFDDAALKLFRNNALKKASKAALDDNCIISTRWHGCKDEEELSSDLCNCLIRDYCKVGNVDSAMS

Query:  LLAHMESVGRHASVASYAYLIEALGNLGRTLEADIIFQEMISFGCKPRTIVCNALLRGFLRKGLLDLASDVLVLMSDLDIQKNQETYEILLDYHANAGRL
        LL+HME+VG HASVASY YLIEA GN+GRTLEADI+FQEMISFG  PRT VCNALLRGFLRKGLLDLASDVLVLMSDLDIQKNQETYEILLDYHANAGRL
Subjt:  LLAHMESVGRHASVASYAYLIEALGNLGRTLEADIIFQEMISFGCKPRTIVCNALLRGFLRKGLLDLASDVLVLMSDLDIQKNQETYEILLDYHANAGRL

Query:  EDTWSIINEMKRKGFELNSFVYSKVIVIYQNNGMWKKAVGIVDEIRKSGISVDKHIYNSIIDTFGKYGQLSEALEVFKRMQQDGVMPNITTWNSLIQWNC
        EDTWSIINEMKRKGFELNSFVYSKVIVIYQNNGMWKKAVGIVDEIRKSGISVDKHIYNSIIDTFGKYGQL EALEVFK+MQQDGVMP+ITTWNSLIQWNC
Subjt:  EDTWSIINEMKRKGFELNSFVYSKVIVIYQNNGMWKKAVGIVDEIRKSGISVDKHIYNSIIDTFGKYGQLSEALEVFKRMQQDGVMPNITTWNSLIQWNC

Query:  KAGNLATALELFTDMQEQGMHPDPKIFVTLISSLGEQGKWDVIKQNLDSMKLRGHKNSGLVYEILVDIYGQYGQFQDAEKCISALKSAGLLPSASNFCII
        K+GNLATALELFTDMQEQGMHPDPKIF+TLISSLGEQGKWD+IK+NLDSMKLRGHKNSGLVYEILVDIYGQYGQF+DAEKCISALKSAGLLPSASNFCII
Subjt:  KAGNLATALELFTDMQEQGMHPDPKIFVTLISSLGEQGKWDVIKQNLDSMKLRGHKNSGLVYEILVDIYGQYGQFQDAEKCISALKSAGLLPSASNFCII

Query:  ANAFSQQGLCEETVKVLQLMEAEGIEPNLVILNVLINAFAVAGRHSEALAIYHHIIEVGISPDVITYTTLMKAFIRAKKFSKVPEIYKEMENAGCTPDRK
        ANAF++QGLCEETVKVLQLMEAEGIEPNLV+LNVLINAFAVAGRH EA+AIYHHI+EVGISPDVITYTTLMKA+IRAKKF KVPEIYKEME AGCTPDRK
Subjt:  ANAFSQQGLCEETVKVLQLMEAEGIEPNLVILNVLINAFAVAGRHSEALAIYHHIIEVGISPDVITYTTLMKAFIRAKKFSKVPEIYKEMENAGCTPDRK

Query:  AREMLKSVTVVLEQRH
        AREMLKSVTVVLEQRH
Subjt:  AREMLKSVTVVLEQRH

A0A6J1E8S9 pentatricopeptide repeat-containing protein At5g42310, chloroplastic-like isoform X21.6e-27190.89Show/hide
Query:  MQSIFLPKTVIVSSFGGVFSDLLLHPCSSKCDGKYMFDDAALKLFRNNALKKASKAALDDNCIISTRWHGCKDEEELSSDLCNCLIRDYCKVGNVDSAMS
        MQSIFLPKT IV SFGGV+S  LLH  SSKCDG+YMF DA LKLFR N LKK SKAAL DN  IS RWHGCKD+EELS DLCNCLIRDYCKVGNVD+AMS
Subjt:  MQSIFLPKTVIVSSFGGVFSDLLLHPCSSKCDGKYMFDDAALKLFRNNALKKASKAALDDNCIISTRWHGCKDEEELSSDLCNCLIRDYCKVGNVDSAMS

Query:  LLAHMESVGRHASVASYAYLIEALGNLGRTLEADIIFQEMISFGCKPRTIVCNALLRGFLRKGLLDLASDVLVLMSDLDIQKNQETYEILLDYHANAGRL
        LL+HME+VG HASVASY YLIEA GN+GRTLEADI+FQEMISFG  PRT VCNALLRGFLRKGLLDLASDVLVLMSDLDIQKNQETYEILLDYHANAGRL
Subjt:  LLAHMESVGRHASVASYAYLIEALGNLGRTLEADIIFQEMISFGCKPRTIVCNALLRGFLRKGLLDLASDVLVLMSDLDIQKNQETYEILLDYHANAGRL

Query:  EDTWSIINEMKRKGFELNSFVYSKVIVIYQNNGMWKKAVGIVDEIRKSGISVDKHIYNSIIDTFGKYGQLSEALEVFKRMQQDGVMPNITTWNSLIQWNC
        EDTWSIINEMKRKGFELNSFVYSKVIVIYQNNGMWKKAVGIVDEIRKSGISVDKHIYNSIIDTFGKYGQL EALEVFK+MQQDGVMP+ITTWNSLIQWNC
Subjt:  EDTWSIINEMKRKGFELNSFVYSKVIVIYQNNGMWKKAVGIVDEIRKSGISVDKHIYNSIIDTFGKYGQLSEALEVFKRMQQDGVMPNITTWNSLIQWNC

Query:  KAGNLATALELFTDMQEQGMHPDPKIFVTLISSLGEQGKWDVIKQNLDSMKLRGHKNSGLVYEILVDIYGQYGQFQDAEKCISALKSAGLLPSASNFCII
        K+GNLATALELFTDMQEQGMHPDPKIF+TLISSLGEQGKWD+IK+NLDSMKLRGHKNSGLVYEILVDIYGQYGQF+DAEKCISALKSAGLLPSASNFCII
Subjt:  KAGNLATALELFTDMQEQGMHPDPKIFVTLISSLGEQGKWDVIKQNLDSMKLRGHKNSGLVYEILVDIYGQYGQFQDAEKCISALKSAGLLPSASNFCII

Query:  ANAFSQQGLCEETVKVLQLMEAEGIEPNLVILNVLINAFAVAGRHSEALAIYHHIIEVGISPDVITYTTLMKAFIRAKKFSKVPEIYKEMENAGCTPDRK
        ANAF++QGLCEETVKVLQLMEAEGIEPNLV+LNVLINAFAVAGRH EA+AIYHHI+EVGISPDVITYTTLMKA+IRAKKF KVPEIYKEME AGCTPDRK
Subjt:  ANAFSQQGLCEETVKVLQLMEAEGIEPNLVILNVLINAFAVAGRHSEALAIYHHIIEVGISPDVITYTTLMKAFIRAKKFSKVPEIYKEMENAGCTPDRK

Query:  AREMLKSVTVVLEQRH
        AREMLKSVTVVLEQRH
Subjt:  AREMLKSVTVVLEQRH

SwissProt top hitse value%identityAlignment
A0A1D6IEG9 Pentatricopeptide repeat-containing protein CRP1, chloroplastic2.6e-9039.72Show/hide
Query:  ELSSDLCNCLIRDYCKVGNVDSAMSLLAHMESVGRHASVASYAYLIEALGNLGRTLEADIIFQEMISFG-CKPRTIVCNALLRGFLRKGLLDLASDVLVL
        E  + L + LI  + +    D+A+ LLA  +++G      +   LI ALG  GR  EA+ +F E    G  KPRT   NALL+G++R   L  A  VL  
Subjt:  ELSSDLCNCLIRDYCKVGNVDSAMSLLAHMESVGRHASVASYAYLIEALGNLGRTLEADIIFQEMISFG-CKPRTIVCNALLRGFLRKGLLDLASDVLVL

Query:  MSDLDIQKNQETYEILLDYHANAGRLEDTWSIINEMKRKGFELNSFVYSKVIVIYQNNGMWKKAVGIVDEIRKSGISVDKHIYNSIIDTFGKYGQLSEAL
        MS   +  ++ TY +L+D +  AGR E    ++ EM+  G + +S+V+S+++  +++ G W+KA  ++ E++ SG+  D+H YN +IDTFGKY  L  A+
Subjt:  MSDLDIQKNQETYEILLDYHANAGRLEDTWSIINEMKRKGFELNSFVYSKVIVIYQNNGMWKKAVGIVDEIRKSGISVDKHIYNSIIDTFGKYGQLSEAL

Query:  EVFKRMQQDGVMPNITTWNSLIQWNCKAGNLATALELFTDMQEQGMHPDPKIFVTLISSLGEQGKWDVIKQNLDSMKLRGHKNSGLVYEILVDIYGQYGQ
        + F +M+++G+ P++ TWN+LI  +CK G    A ELF +M+E    P    +  +I+ LGEQ  W+ ++  L  MK +G   + + Y  LVD+YG+ G+
Subjt:  EVFKRMQQDGVMPNITTWNSLIQWNCKAGNLATALELFTDMQEQGMHPDPKIFVTLISSLGEQGKWDVIKQNLDSMKLRGHKNSGLVYEILVDIYGQYGQ

Query:  FQDAEKCISALKSAGLLPSASNFCIIANAFSQQGLCEETVKVLQLMEAEGIEPNLVILNVLINAFAVAGRHSEALAIYHHIIEVGISPDVITYTTLMKAF
        +++A  CI A+K+ GL PS + +  + NA++Q+GL +  + V++ M+A+G+E ++++LN LINAF    R  EA ++   + E G+ PDVITYTTLMKA 
Subjt:  FQDAEKCISALKSAGLLPSASNFCIIANAFSQQGLCEETVKVLQLMEAEGIEPNLVILNVLINAFAVAGRHSEALAIYHHIIEVGISPDVITYTTLMKAF

Query:  IRAKKFSKVPEIYKEMENAGCTPDRKAREMLKS
        IR ++F KVP IY+EM  +GC PDRKAR ML+S
Subjt:  IRAKKFSKVPEIYKEMENAGCTPDRKAREMLKS

Q84ZD2 Pentatricopeptide repeat-containing protein CRP1 homolog, chloroplastic5.9e-9039.07Show/hide
Query:  AALDDNCIISTRWHGCKDEEELSSDLCNCLIRDYCKVGNVDSAMSLLAHMESVGRHASVASYAYLIEALGNLGRTLEADIIFQEMISFG-CKPRTIVCNA
        AAL D  +   R    + +  L SD    LI  + +    D+A+ LLA  +++G      +   LI +LG+  R  EA+ +F E    G  KPRT   NA
Subjt:  AALDDNCIISTRWHGCKDEEELSSDLCNCLIRDYCKVGNVDSAMSLLAHMESVGRHASVASYAYLIEALGNLGRTLEADIIFQEMISFG-CKPRTIVCNA

Query:  LLRGFLRKGLLDLASDVLVLMSDLDIQKNQETYEILLDYHANAGRLEDTWSIINEMKRKGFELNSFVYSKVIVIYQNNGMWKKAVGIVDEIRKSGISVDK
        LL+G+++ G L  A  VL  MS   +  ++ TY +L+D +  AGR E    ++ EM+  G + +S+V+S+++  +++ G W+KA  ++ E+  SG+  D+
Subjt:  LLRGFLRKGLLDLASDVLVLMSDLDIQKNQETYEILLDYHANAGRLEDTWSIINEMKRKGFELNSFVYSKVIVIYQNNGMWKKAVGIVDEIRKSGISVDK

Query:  HIYNSIIDTFGKYGQLSEALEVFKRMQQDGVMPNITTWNSLIQWNCKAGNLATALELFTDMQEQGMHPDPKIFVTLISSLGEQGKWDVIKQNLDSMKLRG
        H YN +IDTFGKY  L  A++ F RM+++G+ P++ TWN+LI  +CK G    A+ELF +M+E         +  +I+ LGE+ +W+ ++  L  MK +G
Subjt:  HIYNSIIDTFGKYGQLSEALEVFKRMQQDGVMPNITTWNSLIQWNCKAGNLATALELFTDMQEQGMHPDPKIFVTLISSLGEQGKWDVIKQNLDSMKLRG

Query:  HKNSGLVYEILVDIYGQYGQFQDAEKCISALKSAGLLPSASNFCIIANAFSQQGLCEETVKVLQLMEAEGIEPNLVILNVLINAFAVAGRHSEALAIYHH
           + + Y  LVD+YG+ G+F++A  CI A+K+ GL PS + +  + NA++Q+GL +  + V++ M A+G+E + V+LN LINAF    R +EA ++   
Subjt:  HKNSGLVYEILVDIYGQYGQFQDAEKCISALKSAGLLPSASNFCIIANAFSQQGLCEETVKVLQLMEAEGIEPNLVILNVLINAFAVAGRHSEALAIYHH

Query:  IIEVGISPDVITYTTLMKAFIRAKKFSKVPEIYKEMENAGCTPDRKAREMLKS
        + E G+ PDVITYTTLMKA IR ++F KVP IY+EM  +GC PDRKAR ML+S
Subjt:  IIEVGISPDVITYTTLMKAFIRAKKFSKVPEIYKEMENAGCTPDRKAREMLKS

Q8L844 Pentatricopeptide repeat-containing protein At5g42310, chloroplastic1.0e-9439.28Show/hide
Query:  KDEEELSSDLCNCLIRDYCKVGNVDSAMSLLAHMESVGRHASVASYAYLIEALGNLGRTLEADIIFQEMISFGCKPRTIVCNALLRGFLRKGLLDLASDV
        +D+ EL   L N +I  + K G+   A+ LL   ++ G  A  A+   +I AL + GRTLEA+ +F+E+   G KPRT   NALL+G+++ G L  A  +
Subjt:  KDEEELSSDLCNCLIRDYCKVGNVDSAMSLLAHMESVGRHASVASYAYLIEALGNLGRTLEADIIFQEMISFGCKPRTIVCNALLRGFLRKGLLDLASDV

Query:  LVLMSDLDIQKNQETYEILLDYHANAGRLEDTWSIINEMKRKGFELNSFVYSKVIVIYQNNGMWKKAVGIVDEIRKSGISVDKHIYNSIIDTFGKYGQLS
        +  M    +  ++ TY +L+D + NAGR E    ++ EM+    + NSFV+S+++  +++ G W+K   ++ E++  G+  D+  YN +IDTFGK+  L 
Subjt:  LVLMSDLDIQKNQETYEILLDYHANAGRLEDTWSIINEMKRKGFELNSFVYSKVIVIYQNNGMWKKAVGIVDEIRKSGISVDKHIYNSIIDTFGKYGQLS

Query:  EALEVFKRMQQDGVMPNITTWNSLIQWNCKAGNLATALELFTDMQEQGMHPDPKIFVTLISSLGEQGKWDVIKQNLDSMKLRGHKNSGLVYEILVDIYGQ
         A+  F RM  +G+ P+  TWN+LI  +CK G    A E+F  M+ +G  P    +  +I+S G+Q +WD +K+ L  MK +G   + + +  LVD+YG+
Subjt:  EALEVFKRMQQDGVMPNITTWNSLIQWNCKAGNLATALELFTDMQEQGMHPDPKIFVTLISSLGEQGKWDVIKQNLDSMKLRGHKNSGLVYEILVDIYGQ

Query:  YGQFQDAEKCISALKSAGLLPSASNFCIIANAFSQQGLCEETVKVLQLMEAEGIEPNLVILNVLINAFAVAGRHSEALAIYHHIIEVGISPDVITYTTLM
         G+F DA +C+  +KS GL PS++ +  + NA++Q+GL E+ V   ++M ++G++P+L+ LN LINAF    R +EA A+  ++ E G+ PDV+TYTTLM
Subjt:  YGQFQDAEKCISALKSAGLLPSASNFCIIANAFSQQGLCEETVKVLQLMEAEGIEPNLVILNVLINAFAVAGRHSEALAIYHHIIEVGISPDVITYTTLM

Query:  KAFIRAKKFSKVPEIYKEMENAGCTPDRKAREMLKSVTVVLEQ
        KA IR  KF KVP +Y+EM  +GC PDRKAR ML+S    ++Q
Subjt:  KAFIRAKKFSKVPEIYKEMENAGCTPDRKAREMLKSVTVVLEQ

Q9FFZ2 Putative pentatricopeptide repeat-containing protein At5g363001.6e-7942.17Show/hide
Query:  LSSDLCNCLIRDYCKVGNVDSAMSLLAHMESVGRHASVASYAYLIEALGNLGRTLEADIIFQEMISFGCKPRTIV--CNALLRGFLRKGLLDLASDVLVL
        +S  + N  IR +C+ G  + AMSLLA + S+G      SY   IE L +L RTLEAD +F E++ F       V   NAL+  +LRK            
Subjt:  LSSDLCNCLIRDYCKVGNVDSAMSLLAHMESVGRHASVASYAYLIEALGNLGRTLEADIIFQEMISFGCKPRTIV--CNALLRGFLRKGLLDLASDVLVL

Query:  MSDLDIQKNQETYEILLDYHANAGRLEDTWSIINEMKRKGFELNSFVYSKVIVIYQNNGMWKKAVGIVDEIRKSGISVDKHIYNSIIDTFGKYGQLSEAL
                                  E +W ++NEMK++ F LNSFVY K+I IY++NGMWKKA+GIV+EIR+ G+ +D  IYNS+IDTFGKYG+L E L
Subjt:  MSDLDIQKNQETYEILLDYHANAGRLEDTWSIINEMKRKGFELNSFVYSKVIVIYQNNGMWKKAVGIVDEIRKSGISVDKHIYNSIIDTFGKYGQLSEAL

Query:  EVFKRMQQDG-VMPNITTWNSLIQWNCKAGNLATALELFTDMQEQGMHPDPKIFVTLISSLGEQGKWDVIKQNLDSMKLRGHKNSGLVYEILVDIYGQYG
        +V +++Q+     PNI TWNSLI+W+C  G +  ALELFT +                                                          
Subjt:  EVFKRMQQDG-VMPNITTWNSLIQWNCKAGNLATALELFTDMQEQGMHPDPKIFVTLISSLGEQGKWDVIKQNLDSMKLRGHKNSGLVYEILVDIYGQYG

Query:  QFQDAEKCISALKSAGLLPSASNFCIIANAFSQQGLCEETVKVLQLMEAEGIEPNLVILNVLINAFAVAGRHSEALAIYHHIIE-VGISPDVITYTTLMK
         F+D  + +  LKS G+ PSA+ FC +ANA++QQGLC++TVKVL++ME EGIEPNL++LNVLINAF  AG+H EAL+IYHHI E V I PDV+TY+TLMK
Subjt:  QFQDAEKCISALKSAGLLPSASNFCIIANAFSQQGLCEETVKVLQLMEAEGIEPNLVILNVLINAFAVAGRHSEALAIYHHIIE-VGISPDVITYTTLMK

Query:  AFIRAKKFSKVPEIY
        AF RAKK+  V   Y
Subjt:  AFIRAKKFSKVPEIY

Q9FIX3 Pentatricopeptide repeat-containing protein At5g397105.0e-4927.01Show/hide
Query:  SSDLCNCLIRDYCKVGNVDSAMSLLAHMESVGRHASVASYAYLIEALGNLGRTLE-ADIIFQEMISFGCKPRTIVCNALLRGFLRKGLLDLASDVLVLMS
        +S + + +++ Y ++  +D A+S++   ++ G    V SY  +++A     R +  A+ +F+EM+     P     N L+RGF   G +D+A  +   M 
Subjt:  SSDLCNCLIRDYCKVGNVDSAMSLLAHMESVGRHASVASYAYLIEALGNLGRTLE-ADIIFQEMISFGCKPRTIVCNALLRGFLRKGLLDLASDVLVLMS

Query:  DLDIQKNQETYEILLDYHANAGRLEDTWSIINEMKRKGFELNSFVYSKVIVIYQNNGMWKKAVGIVDEIRKSGISVDKHIYNSIIDTFGKYGQLSEALEV
              N  TY  L+D +    +++D + ++  M  KG E N   Y+ VI      G  K+   ++ E+ + G S+D+  YN++I  + K G   +AL +
Subjt:  DLDIQKNQETYEILLDYHANAGRLEDTWSIINEMKRKGFELNSFVYSKVIVIYQNNGMWKKAVGIVDEIRKSGISVDKHIYNSIIDTFGKYGQLSEALEV

Query:  FKRMQQDGVMPNITTWNSLIQWNCKAGNLATALELFTDMQEQGMHPDPKIFVTLISSLGEQGKWDVIKQNLDSMKLRGHKNSGLVYEILVDIYGQYGQFQ
           M + G+ P++ T+ SLI   CKAGN+  A+E    M+ +G+ P+ + + TL+    ++G  +   + L  M   G   S + Y  L++ +   G+ +
Subjt:  FKRMQQDGVMPNITTWNSLIQWNCKAGNLATALELFTDMQEQGMHPDPKIFVTLISSLGEQGKWDVIKQNLDSMKLRGHKNSGLVYEILVDIYGQYGQFQ

Query:  DAEKCISALKSAGLLPSASNFCIIANAFSQQGLCEETVKVLQLMEAEGIEPNLVILNVLINAFAVAGRHSEALAIYHHIIEVGISPDVITYTTLMKAFIR
        DA   +  +K  GL P   ++  + + F +    +E ++V + M  +GI+P+ +  + LI  F    R  EA  +Y  ++ VG+ PD  TYT L+ A+  
Subjt:  DAEKCISALKSAGLLPSASNFCIIANAFSQQGLCEETVKVLQLMEAEGIEPNLVILNVLINAFAVAGRHSEALAIYHHIIEVGISPDVITYTTLMKAFIR

Query:  AKKFSKVPEIYKEMENAGCTPD
             K  +++ EM   G  PD
Subjt:  AKKFSKVPEIYKEMENAGCTPD

Arabidopsis top hitse value%identityAlignment
AT1G05670.1 Pentatricopeptide repeat (PPR-like) superfamily protein5.9e-4525.72Show/hide
Query:  LIRDYCKVGNVDSAMSLLAHMESVGRHASVASYAYLIEALGNLGRTLEADIIFQEMISFGCKPRTIVCNALLRGFLRKGLLDLASDVLVLMSDLDIQKNQ
        ++  YC+ G +D    L+  M+  G   +   Y  +I  L  + +  EA+  F EMI  G  P T+V   L+ GF ++G +  AS     M   DI  + 
Subjt:  LIRDYCKVGNVDSAMSLLAHMESVGRHASVASYAYLIEALGNLGRTLEADIIFQEMISFGCKPRTIVCNALLRGFLRKGLLDLASDVLVLMSDLDIQKNQ

Query:  ETYEILLDYHANAGRLEDTWSIINEMKRKGFELNSFVYSKVIVIYQNNGMWKKAVGIVDEIRKSGISVDKHIYNSIIDTFGKYGQLSEALEVFKRMQQDG
         TY  ++      G + +   + +EM  KG E +S  ++++I  Y   G  K A  + + + ++G S +   Y ++ID   K G L  A E+   M + G
Subjt:  ETYEILLDYHANAGRLEDTWSIINEMKRKGFELNSFVYSKVIVIYQNNGMWKKAVGIVDEIRKSGISVDKHIYNSIIDTFGKYGQLSEALEVFKRMQQDG

Query:  VMPNITTWNSLIQWNCKAGNLATALELFTDMQEQGMHPDPKIFVTLISSLGEQGKWDVIKQNLDSMKLRGHKNSGLVYEILVDIYGQYGQFQDAEKCISA
        + PNI T+NS++   CK+GN+  A++L  + +  G++ D   + TL+ +  + G+ D  ++ L  M  +G + + + + +L++ +  +G  +D EK ++ 
Subjt:  VMPNITTWNSLIQWNCKAGNLATALELFTDMQEQGMHPDPKIFVTLISSLGEQGKWDVIKQNLDSMKLRGHKNSGLVYEILVDIYGQYGQFQDAEKCISA

Query:  LKSAGLLPSASNFCIIANAFSQQGLCEETVKVLQLMEAEGIEPNLVILNVLINAFAVAGRHSEALAIYHHIIEVGISPDVITYTTLMKAFIRAKKFSKVP
        + + G+ P+A+ F  +   +  +   +    + + M + G+ P+      L+     A    EA  ++  +   G S  V TY+ L+K F++ KKF +  
Subjt:  LKSAGLLPSASNFCIIANAFSQQGLCEETVKVLQLMEAEGIEPNLVILNVLINAFAVAGRHSEALAIYHHIIEVGISPDVITYTTLMKAFIRAKKFSKVP

Query:  EIYKEMENAGCTPDRK
        E++ +M   G   D++
Subjt:  EIYKEMENAGCTPDRK

AT1G05670.2 Pentatricopeptide repeat (PPR-like) superfamily protein5.9e-4525.72Show/hide
Query:  LIRDYCKVGNVDSAMSLLAHMESVGRHASVASYAYLIEALGNLGRTLEADIIFQEMISFGCKPRTIVCNALLRGFLRKGLLDLASDVLVLMSDLDIQKNQ
        ++  YC+ G +D    L+  M+  G   +   Y  +I  L  + +  EA+  F EMI  G  P T+V   L+ GF ++G +  AS     M   DI  + 
Subjt:  LIRDYCKVGNVDSAMSLLAHMESVGRHASVASYAYLIEALGNLGRTLEADIIFQEMISFGCKPRTIVCNALLRGFLRKGLLDLASDVLVLMSDLDIQKNQ

Query:  ETYEILLDYHANAGRLEDTWSIINEMKRKGFELNSFVYSKVIVIYQNNGMWKKAVGIVDEIRKSGISVDKHIYNSIIDTFGKYGQLSEALEVFKRMQQDG
         TY  ++      G + +   + +EM  KG E +S  ++++I  Y   G  K A  + + + ++G S +   Y ++ID   K G L  A E+   M + G
Subjt:  ETYEILLDYHANAGRLEDTWSIINEMKRKGFELNSFVYSKVIVIYQNNGMWKKAVGIVDEIRKSGISVDKHIYNSIIDTFGKYGQLSEALEVFKRMQQDG

Query:  VMPNITTWNSLIQWNCKAGNLATALELFTDMQEQGMHPDPKIFVTLISSLGEQGKWDVIKQNLDSMKLRGHKNSGLVYEILVDIYGQYGQFQDAEKCISA
        + PNI T+NS++   CK+GN+  A++L  + +  G++ D   + TL+ +  + G+ D  ++ L  M  +G + + + + +L++ +  +G  +D EK ++ 
Subjt:  VMPNITTWNSLIQWNCKAGNLATALELFTDMQEQGMHPDPKIFVTLISSLGEQGKWDVIKQNLDSMKLRGHKNSGLVYEILVDIYGQYGQFQDAEKCISA

Query:  LKSAGLLPSASNFCIIANAFSQQGLCEETVKVLQLMEAEGIEPNLVILNVLINAFAVAGRHSEALAIYHHIIEVGISPDVITYTTLMKAFIRAKKFSKVP
        + + G+ P+A+ F  +   +  +   +    + + M + G+ P+      L+     A    EA  ++  +   G S  V TY+ L+K F++ KKF +  
Subjt:  LKSAGLLPSASNFCIIANAFSQQGLCEETVKVLQLMEAEGIEPNLVILNVLINAFAVAGRHSEALAIYHHIIEVGISPDVITYTTLMKAFIRAKKFSKVP

Query:  EIYKEMENAGCTPDRK
        E++ +M   G   D++
Subjt:  EIYKEMENAGCTPDRK

AT5G36300.1 Tetratricopeptide repeat (TPR)-like superfamily protein2.6e-6138.4Show/hide
Query:  MSLLAHMESVGRHASVASYAYLIEALGNLGRTLEADIIFQEMISFGCKPRTIV--CNALLRGFLRKGLLDLASDVLVLMSDLDIQKNQETYEILLDYHAN
        MSLLA + S+G      SY   IE L +L RTLEAD +F E++ F       V   NAL+  +LRK                                  
Subjt:  MSLLAHMESVGRHASVASYAYLIEALGNLGRTLEADIIFQEMISFGCKPRTIV--CNALLRGFLRKGLLDLASDVLVLMSDLDIQKNQETYEILLDYHAN

Query:  AGRLEDTWSIINEMKRKGFELNSFVYSKVIVIYQNNGMWKKAVGIVDEIRKSGISVDKHIYNSIIDTFGKYGQLSEALEVFKRMQQDGVMPNITTWNSLI
            E +W ++NEMK++ F LNSFVY K+I IY++NGMWKKA+GIV+EIR+ G+ +D  IYNS+IDTFGKYG+L E L                      
Subjt:  AGRLEDTWSIINEMKRKGFELNSFVYSKVIVIYQNNGMWKKAVGIVDEIRKSGISVDKHIYNSIIDTFGKYGQLSEALEVFKRMQQDGVMPNITTWNSLI

Query:  QWNCKAGNLATALELFTDMQEQGMHPDPKIFVTLISSLGEQGKWDVIKQNLDSMKLRGHKNSGLVYEILVDIYGQYGQFQDAEKCISALKSAGLLPSASN
                                                                                  Q+G F+D  + +  LKS G+ PSA+ 
Subjt:  QWNCKAGNLATALELFTDMQEQGMHPDPKIFVTLISSLGEQGKWDVIKQNLDSMKLRGHKNSGLVYEILVDIYGQYGQFQDAEKCISALKSAGLLPSASN

Query:  FCIIANAFSQQGLCEETVKVLQLMEAEGIEPNLVILNVLINAFAVAGRHSEALAIYHHIIE-VGISPDVITYTTLMKAFIRAKKFSKV
        FC +ANA++QQGLC++TVKVL++ME EGIEPNL++LNVLINAF  AG+H EAL+IYHHI E V I PDV+TY+TLMKAF RAKK+  V
Subjt:  FCIIANAFSQQGLCEETVKVLQLMEAEGIEPNLVILNVLINAFAVAGRHSEALAIYHHIIE-VGISPDVITYTTLMKAFIRAKKFSKV

AT5G39710.1 Tetratricopeptide repeat (TPR)-like superfamily protein3.6e-5027.01Show/hide
Query:  SSDLCNCLIRDYCKVGNVDSAMSLLAHMESVGRHASVASYAYLIEALGNLGRTLE-ADIIFQEMISFGCKPRTIVCNALLRGFLRKGLLDLASDVLVLMS
        +S + + +++ Y ++  +D A+S++   ++ G    V SY  +++A     R +  A+ +F+EM+     P     N L+RGF   G +D+A  +   M 
Subjt:  SSDLCNCLIRDYCKVGNVDSAMSLLAHMESVGRHASVASYAYLIEALGNLGRTLE-ADIIFQEMISFGCKPRTIVCNALLRGFLRKGLLDLASDVLVLMS

Query:  DLDIQKNQETYEILLDYHANAGRLEDTWSIINEMKRKGFELNSFVYSKVIVIYQNNGMWKKAVGIVDEIRKSGISVDKHIYNSIIDTFGKYGQLSEALEV
              N  TY  L+D +    +++D + ++  M  KG E N   Y+ VI      G  K+   ++ E+ + G S+D+  YN++I  + K G   +AL +
Subjt:  DLDIQKNQETYEILLDYHANAGRLEDTWSIINEMKRKGFELNSFVYSKVIVIYQNNGMWKKAVGIVDEIRKSGISVDKHIYNSIIDTFGKYGQLSEALEV

Query:  FKRMQQDGVMPNITTWNSLIQWNCKAGNLATALELFTDMQEQGMHPDPKIFVTLISSLGEQGKWDVIKQNLDSMKLRGHKNSGLVYEILVDIYGQYGQFQ
           M + G+ P++ T+ SLI   CKAGN+  A+E    M+ +G+ P+ + + TL+    ++G  +   + L  M   G   S + Y  L++ +   G+ +
Subjt:  FKRMQQDGVMPNITTWNSLIQWNCKAGNLATALELFTDMQEQGMHPDPKIFVTLISSLGEQGKWDVIKQNLDSMKLRGHKNSGLVYEILVDIYGQYGQFQ

Query:  DAEKCISALKSAGLLPSASNFCIIANAFSQQGLCEETVKVLQLMEAEGIEPNLVILNVLINAFAVAGRHSEALAIYHHIIEVGISPDVITYTTLMKAFIR
        DA   +  +K  GL P   ++  + + F +    +E ++V + M  +GI+P+ +  + LI  F    R  EA  +Y  ++ VG+ PD  TYT L+ A+  
Subjt:  DAEKCISALKSAGLLPSASNFCIIANAFSQQGLCEETVKVLQLMEAEGIEPNLVILNVLINAFAVAGRHSEALAIYHHIIEVGISPDVITYTTLMKAFIR

Query:  AKKFSKVPEIYKEMENAGCTPD
             K  +++ EM   G  PD
Subjt:  AKKFSKVPEIYKEMENAGCTPD

AT5G42310.1 Pentatricopeptide repeat (PPR-like) superfamily protein7.4e-9639.28Show/hide
Query:  KDEEELSSDLCNCLIRDYCKVGNVDSAMSLLAHMESVGRHASVASYAYLIEALGNLGRTLEADIIFQEMISFGCKPRTIVCNALLRGFLRKGLLDLASDV
        +D+ EL   L N +I  + K G+   A+ LL   ++ G  A  A+   +I AL + GRTLEA+ +F+E+   G KPRT   NALL+G+++ G L  A  +
Subjt:  KDEEELSSDLCNCLIRDYCKVGNVDSAMSLLAHMESVGRHASVASYAYLIEALGNLGRTLEADIIFQEMISFGCKPRTIVCNALLRGFLRKGLLDLASDV

Query:  LVLMSDLDIQKNQETYEILLDYHANAGRLEDTWSIINEMKRKGFELNSFVYSKVIVIYQNNGMWKKAVGIVDEIRKSGISVDKHIYNSIIDTFGKYGQLS
        +  M    +  ++ TY +L+D + NAGR E    ++ EM+    + NSFV+S+++  +++ G W+K   ++ E++  G+  D+  YN +IDTFGK+  L 
Subjt:  LVLMSDLDIQKNQETYEILLDYHANAGRLEDTWSIINEMKRKGFELNSFVYSKVIVIYQNNGMWKKAVGIVDEIRKSGISVDKHIYNSIIDTFGKYGQLS

Query:  EALEVFKRMQQDGVMPNITTWNSLIQWNCKAGNLATALELFTDMQEQGMHPDPKIFVTLISSLGEQGKWDVIKQNLDSMKLRGHKNSGLVYEILVDIYGQ
         A+  F RM  +G+ P+  TWN+LI  +CK G    A E+F  M+ +G  P    +  +I+S G+Q +WD +K+ L  MK +G   + + +  LVD+YG+
Subjt:  EALEVFKRMQQDGVMPNITTWNSLIQWNCKAGNLATALELFTDMQEQGMHPDPKIFVTLISSLGEQGKWDVIKQNLDSMKLRGHKNSGLVYEILVDIYGQ

Query:  YGQFQDAEKCISALKSAGLLPSASNFCIIANAFSQQGLCEETVKVLQLMEAEGIEPNLVILNVLINAFAVAGRHSEALAIYHHIIEVGISPDVITYTTLM
         G+F DA +C+  +KS GL PS++ +  + NA++Q+GL E+ V   ++M ++G++P+L+ LN LINAF    R +EA A+  ++ E G+ PDV+TYTTLM
Subjt:  YGQFQDAEKCISALKSAGLLPSASNFCIIANAFSQQGLCEETVKVLQLMEAEGIEPNLVILNVLINAFAVAGRHSEALAIYHHIIEVGISPDVITYTTLM

Query:  KAFIRAKKFSKVPEIYKEMENAGCTPDRKAREMLKSVTVVLEQ
        KA IR  KF KVP +Y+EM  +GC PDRKAR ML+S    ++Q
Subjt:  KAFIRAKKFSKVPEIYKEMENAGCTPDRKAREMLKSVTVVLEQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAAATGCAATCTATTTTCTTGCCAAAAACAGTCATCGTATCCTCATTTGGTGGAGTATTTTCTGACCTTTTGTTGCATCCATGTTCTAGTAAATGTGATGGAAAGTA
CATGTTTGATGATGCGGCATTAAAGTTGTTTAGGAATAATGCCCTTAAAAAGGCCAGTAAAGCAGCGTTAGATGATAATTGCATTATTAGCACGAGGTGGCATGGATGTA
AAGATGAAGAGGAACTATCGAGTGACTTGTGTAACTGTTTGATTCGTGATTATTGTAAGGTAGGTAATGTTGATTCTGCCATGTCTCTTCTTGCTCATATGGAGTCTGTT
GGTCGTCATGCCTCTGTAGCATCTTACGCATATTTGATTGAAGCTCTTGGAAACTTAGGTAGGACTTTGGAAGCTGATATCATATTTCAAGAAATGATTAGTTTTGGTTG
TAAGCCGAGAACAATTGTCTGCAATGCACTACTAAGAGGGTTTTTGAGAAAAGGCCTTTTAGATCTTGCCTCTGACGTTCTTGTGTTAATGAGTGATTTAGATATTCAAA
AAAATCAAGAAACTTATGAAATTCTCTTGGATTATCATGCCAATGCTGGACGGTTGGAAGATACTTGGTCTATTATTAATGAGATGAAACGAAAAGGTTTTGAGCTGAAC
TCATTTGTGTATAGTAAGGTTATCGTTATATATCAAAACAATGGCATGTGGAAGAAAGCAGTGGGAATTGTTGATGAGATAAGAAAATCAGGGATTTCTGTGGACAAACA
CATTTATAACAGCATCATAGATACATTTGGAAAATATGGTCAATTGTCCGAGGCCTTAGAAGTGTTCAAAAGAATGCAACAGGATGGTGTAATGCCTAATATAACAACTT
GGAATTCACTGATACAATGGAACTGTAAAGCTGGGAACCTTGCTACTGCCCTTGAGTTATTCACGGACATGCAAGAACAGGGAATGCATCCAGATCCTAAGATCTTCGTT
ACTCTAATAAGCTCCTTGGGTGAGCAGGGAAAGTGGGATGTGATAAAGCAGAATCTTGATAGTATGAAGCTCAGAGGGCATAAGAATAGTGGCCTAGTTTATGAAATCTT
GGTAGATATTTATGGGCAGTATGGTCAATTTCAGGATGCTGAGAAGTGTATATCTGCTCTAAAGTCTGCAGGCCTTCTACCATCCGCTAGCAATTTTTGCATTATAGCAA
ATGCTTTTTCTCAACAGGGGTTGTGTGAAGAGACTGTAAAAGTGCTTCAGCTCATGGAGGCAGAAGGAATTGAACCAAATCTTGTAATCCTGAATGTACTGATCAATGCA
TTTGCTGTTGCTGGTAGGCATTCGGAGGCATTAGCAATTTATCATCATATAATTGAAGTTGGTATCAGTCCTGACGTTATAACCTACACCACCCTTATGAAGGCATTTAT
TCGTGCAAAGAAGTTTAGTAAGGTTCCTGAAATATATAAAGAAATGGAAAATGCTGGTTGCACGCCAGATAGGAAAGCCAGAGAGATGTTAAAGTCCGTAACAGTGGTTC
TTGAACAGAGGCATTTTTCTCAGCCAGAGCAGTCCTTGCATGATTGTTCTATAATTCTTCACCCTCTAGGTGATGGTAAATTTAGAATGTCAAGTCATAATTTGAAAAGT
TCATACCTGGAGAGTGGTGCAGAAGGATTCTTCTTGAAAAATATTGCATCATCAGGCAACAAGTCTTACTGA
mRNA sequenceShow/hide mRNA sequence
ATGAAAATGCAATCTATTTTCTTGCCAAAAACAGTCATCGTATCCTCATTTGGTGGAGTATTTTCTGACCTTTTGTTGCATCCATGTTCTAGTAAATGTGATGGAAAGTA
CATGTTTGATGATGCGGCATTAAAGTTGTTTAGGAATAATGCCCTTAAAAAGGCCAGTAAAGCAGCGTTAGATGATAATTGCATTATTAGCACGAGGTGGCATGGATGTA
AAGATGAAGAGGAACTATCGAGTGACTTGTGTAACTGTTTGATTCGTGATTATTGTAAGGTAGGTAATGTTGATTCTGCCATGTCTCTTCTTGCTCATATGGAGTCTGTT
GGTCGTCATGCCTCTGTAGCATCTTACGCATATTTGATTGAAGCTCTTGGAAACTTAGGTAGGACTTTGGAAGCTGATATCATATTTCAAGAAATGATTAGTTTTGGTTG
TAAGCCGAGAACAATTGTCTGCAATGCACTACTAAGAGGGTTTTTGAGAAAAGGCCTTTTAGATCTTGCCTCTGACGTTCTTGTGTTAATGAGTGATTTAGATATTCAAA
AAAATCAAGAAACTTATGAAATTCTCTTGGATTATCATGCCAATGCTGGACGGTTGGAAGATACTTGGTCTATTATTAATGAGATGAAACGAAAAGGTTTTGAGCTGAAC
TCATTTGTGTATAGTAAGGTTATCGTTATATATCAAAACAATGGCATGTGGAAGAAAGCAGTGGGAATTGTTGATGAGATAAGAAAATCAGGGATTTCTGTGGACAAACA
CATTTATAACAGCATCATAGATACATTTGGAAAATATGGTCAATTGTCCGAGGCCTTAGAAGTGTTCAAAAGAATGCAACAGGATGGTGTAATGCCTAATATAACAACTT
GGAATTCACTGATACAATGGAACTGTAAAGCTGGGAACCTTGCTACTGCCCTTGAGTTATTCACGGACATGCAAGAACAGGGAATGCATCCAGATCCTAAGATCTTCGTT
ACTCTAATAAGCTCCTTGGGTGAGCAGGGAAAGTGGGATGTGATAAAGCAGAATCTTGATAGTATGAAGCTCAGAGGGCATAAGAATAGTGGCCTAGTTTATGAAATCTT
GGTAGATATTTATGGGCAGTATGGTCAATTTCAGGATGCTGAGAAGTGTATATCTGCTCTAAAGTCTGCAGGCCTTCTACCATCCGCTAGCAATTTTTGCATTATAGCAA
ATGCTTTTTCTCAACAGGGGTTGTGTGAAGAGACTGTAAAAGTGCTTCAGCTCATGGAGGCAGAAGGAATTGAACCAAATCTTGTAATCCTGAATGTACTGATCAATGCA
TTTGCTGTTGCTGGTAGGCATTCGGAGGCATTAGCAATTTATCATCATATAATTGAAGTTGGTATCAGTCCTGACGTTATAACCTACACCACCCTTATGAAGGCATTTAT
TCGTGCAAAGAAGTTTAGTAAGGTTCCTGAAATATATAAAGAAATGGAAAATGCTGGTTGCACGCCAGATAGGAAAGCCAGAGAGATGTTAAAGTCCGTAACAGTGGTTC
TTGAACAGAGGCATTTTTCTCAGCCAGAGCAGTCCTTGCATGATTGTTCTATAATTCTTCACCCTCTAGGTGATGGTAAATTTAGAATGTCAAGTCATAATTTGAAAAGT
TCATACCTGGAGAGTGGTGCAGAAGGATTCTTCTTGAAAAATATTGCATCATCAGGCAACAAGTCTTACTGA
Protein sequenceShow/hide protein sequence
MKMQSIFLPKTVIVSSFGGVFSDLLLHPCSSKCDGKYMFDDAALKLFRNNALKKASKAALDDNCIISTRWHGCKDEEELSSDLCNCLIRDYCKVGNVDSAMSLLAHMESV
GRHASVASYAYLIEALGNLGRTLEADIIFQEMISFGCKPRTIVCNALLRGFLRKGLLDLASDVLVLMSDLDIQKNQETYEILLDYHANAGRLEDTWSIINEMKRKGFELN
SFVYSKVIVIYQNNGMWKKAVGIVDEIRKSGISVDKHIYNSIIDTFGKYGQLSEALEVFKRMQQDGVMPNITTWNSLIQWNCKAGNLATALELFTDMQEQGMHPDPKIFV
TLISSLGEQGKWDVIKQNLDSMKLRGHKNSGLVYEILVDIYGQYGQFQDAEKCISALKSAGLLPSASNFCIIANAFSQQGLCEETVKVLQLMEAEGIEPNLVILNVLINA
FAVAGRHSEALAIYHHIIEVGISPDVITYTTLMKAFIRAKKFSKVPEIYKEMENAGCTPDRKAREMLKSVTVVLEQRHFSQPEQSLHDCSIILHPLGDGKFRMSSHNLKS
SYLESGAEGFFLKNIASSGNKSY