; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI07G00520 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI07G00520
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationChr7:578702..580735
RNA-Seq ExpressionCSPI07G00520
SyntenyCSPI07G00520
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0049682.1 pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa]6.1e-29395.35Show/hide
Query:  MVRLWISAVHQFFPINVHNYSSKPKFLSTKHQLLSLLKHCSSTNHLFEIHAQILVSGLQNDSFFTTELLRVAALSPSRNLSYGCSLLFHCHFHSATMPWN
        MVRLWISAVHQFFPIN H+Y SKPKFLSTKHQ LSLLKHCSSTNHLFEIHAQILVSG QNDSF TTELLRVAALSPSRNLSYGCSLLFHCHFHSAT+PWN
Subjt:  MVRLWISAVHQFFPINVHNYSSKPKFLSTKHQLLSLLKHCSSTNHLFEIHAQILVSGLQNDSFFTTELLRVAALSPSRNLSYGCSLLFHCHFHSATMPWN

Query:  FIIRGYSSSDSPQEAISLFGEMRRRGVRPNNLTFPFLLKACATLATLQEGKQFHAIAIKCGLDLDVYVRNTLINFYGSCKRMSGARKVFDEMTERTLVSW
         IIRGYSSSDSP+EAISLFGEMRRRGV PNNLTFPFLLKACATLATLQEGKQFHAI IKCGLDLDVYVRNTLI+FYGSCKRMSGARKVFDEMTERTLVSW
Subjt:  FIIRGYSSSDSPQEAISLFGEMRRRGVRPNNLTFPFLLKACATLATLQEGKQFHAIAIKCGLDLDVYVRNTLINFYGSCKRMSGARKVFDEMTERTLVSW

Query:  NAVITACVENFCFDEAIDYFLKMGNHGFEPDETTMVVILSACAELGNLSLGRWVHSQVVGRGMVLNVQLGTAFVDMYAKSGDVGCARHVFNCLKQKSVWT
        NA+ITACVENF FDEAIDYFLKMGNHGFEPDETTMVVILSACAELGNLSLGRWVHSQVVGRGMVLNVQLGTAFVDMYAKSGDVGCAR VFNCLKQKSVWT
Subjt:  NAVITACVENFCFDEAIDYFLKMGNHGFEPDETTMVVILSACAELGNLSLGRWVHSQVVGRGMVLNVQLGTAFVDMYAKSGDVGCARHVFNCLKQKSVWT

Query:  WSAMILGLAQHGFANEAIELFTNMMSSPIVPNHVTFIGVLCACSHAGLVDKSYHYFNLMERVYGIKPMMIHYGSMVDVLGRAGQVKEAYELIMSMPVEPD
        WSAMILGLAQHGFANEAIELFTNM SSPIVPN+VTF+GVLCACSHAGLVDKSYHYFN+MERVYGIKPMMIHYG MVDVLGRAGQVKEAYELIMSMPVEPD
Subjt:  WSAMILGLAQHGFANEAIELFTNMMSSPIVPNHVTFIGVLCACSHAGLVDKSYHYFNLMERVYGIKPMMIHYGSMVDVLGRAGQVKEAYELIMSMPVEPD

Query:  PIVWRTLLSACSGRDVNGGAEVAEEARKRLLELEPKRGGNVVMVANKFAELGMWKQAADYRRTMKDRGIKKMAGESCIELGGSLRKFFSGFDSRAAPDGI
        P+VWRTLLSACSGRDVNGGAEVAEEARKRLLELEPKRGGNVVMVANKFAE+GMWKQAADYRRTMKDRGIKKMAGESCIELGGSLRKFFSGF+SRAA DGI
Subjt:  PIVWRTLLSACSGRDVNGGAEVAEEARKRLLELEPKRGGNVVMVANKFAELGMWKQAADYRRTMKDRGIKKMAGESCIELGGSLRKFFSGFDSRAAPDGI

Query:  YDLLDGLNLHMQLTNF
        YDLLDGLNLHMQLTNF
Subjt:  YDLLDGLNLHMQLTNF

XP_004148551.1 pentatricopeptide repeat-containing protein At2g36730 isoform X1 [Cucumis sativus]2.4e-30599.61Show/hide
Query:  MVRLWISAVHQFFPINVHNYSSKPKFLSTKHQLLSLLKHCSSTNHLFEIHAQILVSGLQNDSFFTTELLRVAALSPSRNLSYGCSLLFHCHFHSATMPWN
        MVRLWISAVHQFFPINVHNYSSKPKFLSTKHQLLSLL HCSSTNHLFEIHAQILVSGLQNDSFFTTELLRVAALSPSRNLSYGCSLLFHCHFHSATMPWN
Subjt:  MVRLWISAVHQFFPINVHNYSSKPKFLSTKHQLLSLLKHCSSTNHLFEIHAQILVSGLQNDSFFTTELLRVAALSPSRNLSYGCSLLFHCHFHSATMPWN

Query:  FIIRGYSSSDSPQEAISLFGEMRRRGVRPNNLTFPFLLKACATLATLQEGKQFHAIAIKCGLDLDVYVRNTLINFYGSCKRMSGARKVFDEMTERTLVSW
        FIIRGYSSSDSPQEAISLFGEMRRRGVRPNNLTFPFLLKACATLATLQEGKQFHAIAIKCGLDLDVYVRNTLI FYGSCKRMSGARKVFDEMTERTLVSW
Subjt:  FIIRGYSSSDSPQEAISLFGEMRRRGVRPNNLTFPFLLKACATLATLQEGKQFHAIAIKCGLDLDVYVRNTLINFYGSCKRMSGARKVFDEMTERTLVSW

Query:  NAVITACVENFCFDEAIDYFLKMGNHGFEPDETTMVVILSACAELGNLSLGRWVHSQVVGRGMVLNVQLGTAFVDMYAKSGDVGCARHVFNCLKQKSVWT
        NAVITACVENFCFDEAIDYFLKMGNHGFEPDETTMVVILSACAELGNLSLGRWVHSQVVGRGMVLNVQLGTAFVDMYAKSGDVGCARHVFNCLKQKSVWT
Subjt:  NAVITACVENFCFDEAIDYFLKMGNHGFEPDETTMVVILSACAELGNLSLGRWVHSQVVGRGMVLNVQLGTAFVDMYAKSGDVGCARHVFNCLKQKSVWT

Query:  WSAMILGLAQHGFANEAIELFTNMMSSPIVPNHVTFIGVLCACSHAGLVDKSYHYFNLMERVYGIKPMMIHYGSMVDVLGRAGQVKEAYELIMSMPVEPD
        WSAMILGLAQHGFANEAIELFTNMMSSPIVPNHVTFIGVLCACSHAGLVDKSYHYFNLMERVYGIKPMMIHYGSMVDVLGRAGQVKEAYELIMSMPVEPD
Subjt:  WSAMILGLAQHGFANEAIELFTNMMSSPIVPNHVTFIGVLCACSHAGLVDKSYHYFNLMERVYGIKPMMIHYGSMVDVLGRAGQVKEAYELIMSMPVEPD

Query:  PIVWRTLLSACSGRDVNGGAEVAEEARKRLLELEPKRGGNVVMVANKFAELGMWKQAADYRRTMKDRGIKKMAGESCIELGGSLRKFFSGFDSRAAPDGI
        PIVWRTLLSACSGRDVNGGAEVAEEARKRLLELEPKRGGNVVMVANKFAELGMWKQAADYRRTMKDRGIKKMAGESCIELGGSLRKFFSGFDSRAAPDGI
Subjt:  PIVWRTLLSACSGRDVNGGAEVAEEARKRLLELEPKRGGNVVMVANKFAELGMWKQAADYRRTMKDRGIKKMAGESCIELGGSLRKFFSGFDSRAAPDGI

Query:  YDLLDGLNLHMQLTNF
        YDLLDGLNLHMQLTNF
Subjt:  YDLLDGLNLHMQLTNF

XP_008448023.1 PREDICTED: pentatricopeptide repeat-containing protein At2g36730 [Cucumis melo]6.1e-29395.35Show/hide
Query:  MVRLWISAVHQFFPINVHNYSSKPKFLSTKHQLLSLLKHCSSTNHLFEIHAQILVSGLQNDSFFTTELLRVAALSPSRNLSYGCSLLFHCHFHSATMPWN
        MVRLWISAVHQFFPIN H+Y SKPKFLSTKHQ LSLLKHCSSTNHLFEIHAQILVSG QNDSF TTELLRVAALSPSRNLSYGCSLLFHCHFHSAT+PWN
Subjt:  MVRLWISAVHQFFPINVHNYSSKPKFLSTKHQLLSLLKHCSSTNHLFEIHAQILVSGLQNDSFFTTELLRVAALSPSRNLSYGCSLLFHCHFHSATMPWN

Query:  FIIRGYSSSDSPQEAISLFGEMRRRGVRPNNLTFPFLLKACATLATLQEGKQFHAIAIKCGLDLDVYVRNTLINFYGSCKRMSGARKVFDEMTERTLVSW
         IIRGYSSSDSP+EAISLFGEMRRRGV PNNLTFPFLLKACATLATLQEGKQFHAI IKCGLDLDVYVRNTLI+FYGSCKRMSGARKVFDEMTERTLVSW
Subjt:  FIIRGYSSSDSPQEAISLFGEMRRRGVRPNNLTFPFLLKACATLATLQEGKQFHAIAIKCGLDLDVYVRNTLINFYGSCKRMSGARKVFDEMTERTLVSW

Query:  NAVITACVENFCFDEAIDYFLKMGNHGFEPDETTMVVILSACAELGNLSLGRWVHSQVVGRGMVLNVQLGTAFVDMYAKSGDVGCARHVFNCLKQKSVWT
        NAVITACVENF FDEAIDYFLKMGNHGFEPDETTMVVILSACAELGNLSLGRWVHSQVVGRGMVLN+QLGTAFVDMYAKSGDVGCAR VFNCLKQKSVWT
Subjt:  NAVITACVENFCFDEAIDYFLKMGNHGFEPDETTMVVILSACAELGNLSLGRWVHSQVVGRGMVLNVQLGTAFVDMYAKSGDVGCARHVFNCLKQKSVWT

Query:  WSAMILGLAQHGFANEAIELFTNMMSSPIVPNHVTFIGVLCACSHAGLVDKSYHYFNLMERVYGIKPMMIHYGSMVDVLGRAGQVKEAYELIMSMPVEPD
        WSAMILGLAQHGFANEAIELFTNM SSPIVPN+VTF+GVLCACSHAGLVDKSYHYFN+MERVYGIKPMMIHYG MVDVLGRAGQVKEAYELIMSMPVEPD
Subjt:  WSAMILGLAQHGFANEAIELFTNMMSSPIVPNHVTFIGVLCACSHAGLVDKSYHYFNLMERVYGIKPMMIHYGSMVDVLGRAGQVKEAYELIMSMPVEPD

Query:  PIVWRTLLSACSGRDVNGGAEVAEEARKRLLELEPKRGGNVVMVANKFAELGMWKQAADYRRTMKDRGIKKMAGESCIELGGSLRKFFSGFDSRAAPDGI
        P+VWRTLLSACSGRDVNGGAEVAEEARKRLLELEPKRGGNVVMVANKFAE+GMWKQAADYRRTMKDRGIKKMAGESCIELGGSLRKFFSGF+SRAA DGI
Subjt:  PIVWRTLLSACSGRDVNGGAEVAEEARKRLLELEPKRGGNVVMVANKFAELGMWKQAADYRRTMKDRGIKKMAGESCIELGGSLRKFFSGFDSRAAPDGI

Query:  YDLLDGLNLHMQLTNF
        YDLLDGLNLHMQLTNF
Subjt:  YDLLDGLNLHMQLTNF

XP_038903330.1 pentatricopeptide repeat-containing protein At2g36730 isoform X3 [Benincasa hispida]7.5e-27590.06Show/hide
Query:  MVRLWISAVHQFFPINVHNYSSKPKFLSTKHQLLSLLKHCSSTNHLFEIHAQILVSGLQNDSFFTTELLRVAALSPSRNLSYGCSLLFHCHFHSATMPWN
        MVRL ISA+HQ FP + HNY+S   FLS KHQ LSLL  CSSTNHLFEIHAQILVSGLQND F +TELLR+AALSPSRNLSYG SLLFHCHFHSA +PWN
Subjt:  MVRLWISAVHQFFPINVHNYSSKPKFLSTKHQLLSLLKHCSSTNHLFEIHAQILVSGLQNDSFFTTELLRVAALSPSRNLSYGCSLLFHCHFHSATMPWN

Query:  FIIRGYSSSDSPQEAISLFGEMRRRGVRPNNLTFPFLLKACATLATLQEGKQFHAIAIKCGLDLDVYVRNTLINFYGSCKRMSGARKVFDEMTERTLVSW
         IIRGY+SSDSPQEAI +FGEMRRRG+RPNNLTFPFLLKACATLATLQEGKQFHA+AIKCGLDLDVYVRNTLINFYGSCKRMSGARKVFDEM+ERTLVSW
Subjt:  FIIRGYSSSDSPQEAISLFGEMRRRGVRPNNLTFPFLLKACATLATLQEGKQFHAIAIKCGLDLDVYVRNTLINFYGSCKRMSGARKVFDEMTERTLVSW

Query:  NAVITACVENFCFDEAIDYFLKMGNHGFEPDETTMVVILSACAELGNLSLGRWVHSQVVGRGMVLNVQLGTAFVDMYAKSGDVGCARHVFNCLKQKSVWT
        NAVITACVENFCFDEAID+FLKMG HGFEPDETTMVVILSACAELGNLSLGRWVHSQVVGRGMVLNVQLGTAF+DMYAKSGDVGCAR VFNCLKQ+SVWT
Subjt:  NAVITACVENFCFDEAIDYFLKMGNHGFEPDETTMVVILSACAELGNLSLGRWVHSQVVGRGMVLNVQLGTAFVDMYAKSGDVGCARHVFNCLKQKSVWT

Query:  WSAMILGLAQHGFANEAIELFTNMMSSPIVPNHVTFIGVLCACSHAGLVDKSYHYFNLMERVYGIKPMMIHYGSMVDVLGRAGQVKEAYELIMSMPVEPD
        WSAMILGLAQHG+ANEAIELFT+MMSS +VPN+VTFIGVLCACSHA LVDKSYHYFN+MERVYGIKPMMIHYGSMVDVLGRAGQVKEAYELIMSMP+EPD
Subjt:  WSAMILGLAQHGFANEAIELFTNMMSSPIVPNHVTFIGVLCACSHAGLVDKSYHYFNLMERVYGIKPMMIHYGSMVDVLGRAGQVKEAYELIMSMPVEPD

Query:  PIVWRTLLSACSGRDVNGGAEVAEEARKRLLELEPKRGGNVVMVANKFAELGMWKQAADYRRTMKDRGIKKMAGESCIELGGSLRKFFSGFDSRAAPDGI
        PIVWRTLLSAC+GRDV+GGA+VAEEARKRLLELEPKRGGNVVMVANKFAE+GMWKQAAD RR MKDRGIKKMAGESCIELGGSLRKFFSGFD+ AA DGI
Subjt:  PIVWRTLLSACSGRDVNGGAEVAEEARKRLLELEPKRGGNVVMVANKFAELGMWKQAADYRRTMKDRGIKKMAGESCIELGGSLRKFFSGFDSRAAPDGI

Query:  YDLLDGLNLHMQL
        YDLLDGLNLHMQ+
Subjt:  YDLLDGLNLHMQL

XP_038903331.1 pentatricopeptide repeat-containing protein At2g36730 isoform X4 [Benincasa hispida]4.4e-27589.71Show/hide
Query:  MVRLWISAVHQFFPINVHNYSSKPKFLSTKHQLLSLLKHCSSTNHLFEIHAQILVSGLQNDSFFTTELLRVAALSPSRNLSYGCSLLFHCHFHSATMPWN
        MVRL ISA+HQ FP + HNY+S   FLS KHQ LSLL  CSSTNHLFEIHAQILVSGLQND F +TELLR+AALSPSRNLSYG SLLFHCHFHSA +PWN
Subjt:  MVRLWISAVHQFFPINVHNYSSKPKFLSTKHQLLSLLKHCSSTNHLFEIHAQILVSGLQNDSFFTTELLRVAALSPSRNLSYGCSLLFHCHFHSATMPWN

Query:  FIIRGYSSSDSPQEAISLFGEMRRRGVRPNNLTFPFLLKACATLATLQEGKQFHAIAIKCGLDLDVYVRNTLINFYGSCKRMSGARKVFDEMTERTLVSW
         IIRGY+SSDSPQEAI +FGEMRRRG+RPNNLTFPFLLKACATLATLQEGKQFHA+AIKCGLDLDVYVRNTLINFYGSCKRMSGARKVFDEM+ERTLVSW
Subjt:  FIIRGYSSSDSPQEAISLFGEMRRRGVRPNNLTFPFLLKACATLATLQEGKQFHAIAIKCGLDLDVYVRNTLINFYGSCKRMSGARKVFDEMTERTLVSW

Query:  NAVITACVENFCFDEAIDYFLKMGNHGFEPDETTMVVILSACAELGNLSLGRWVHSQVVGRGMVLNVQLGTAFVDMYAKSGDVGCARHVFNCLKQKSVWT
        NAVITACVENFCFDEAID+FLKMG HGFEPDETTMVVILSACAELGNLSLGRWVHSQVVGRGMVLNVQLGTAF+DMYAKSGDVGCAR VFNCLKQ+SVWT
Subjt:  NAVITACVENFCFDEAIDYFLKMGNHGFEPDETTMVVILSACAELGNLSLGRWVHSQVVGRGMVLNVQLGTAFVDMYAKSGDVGCARHVFNCLKQKSVWT

Query:  WSAMILGLAQHGFANEAIELFTNMMSSPIVPNHVTFIGVLCACSHAGLVDKSYHYFNLMERVYGIKPMMIHYGSMVDVLGRAGQVKEAYELIMSMPVEPD
        WSAMILGLAQHG+ANEAIELFT+MMSS +VPN+VTFIGVLCACSHA LVDKSYHYFN+MERVYGIKPMMIHYGSMVDVLGRAGQVKEAYELIMSMP+EPD
Subjt:  WSAMILGLAQHGFANEAIELFTNMMSSPIVPNHVTFIGVLCACSHAGLVDKSYHYFNLMERVYGIKPMMIHYGSMVDVLGRAGQVKEAYELIMSMPVEPD

Query:  PIVWRTLLSACSGRDVNGGAEVAEEARKRLLELEPKRGGNVVMVANKFAELGMWKQAADYRRTMKDRGIKKMAGESCIELGGSLRKFFSGFDSRAAPDGI
        PIVWRTLLSAC+GRDV+GGA+VAEEARKRLLELEPKRGGNVVMVANKFAE+GMWKQAAD RR MKDRGIKKMAGESCIELGGSLRKFFSGFD+ AA DGI
Subjt:  PIVWRTLLSACSGRDVNGGAEVAEEARKRLLELEPKRGGNVVMVANKFAELGMWKQAADYRRTMKDRGIKKMAGESCIELGGSLRKFFSGFDSRAAPDGI

Query:  YDLLDGLNLHMQLTN
        YDLLDGLNLHMQ+ +
Subjt:  YDLLDGLNLHMQLTN

TrEMBL top hitse value%identityAlignment
A0A0A0K153 Uncharacterized protein1.2e-30599.61Show/hide
Query:  MVRLWISAVHQFFPINVHNYSSKPKFLSTKHQLLSLLKHCSSTNHLFEIHAQILVSGLQNDSFFTTELLRVAALSPSRNLSYGCSLLFHCHFHSATMPWN
        MVRLWISAVHQFFPINVHNYSSKPKFLSTKHQLLSLL HCSSTNHLFEIHAQILVSGLQNDSFFTTELLRVAALSPSRNLSYGCSLLFHCHFHSATMPWN
Subjt:  MVRLWISAVHQFFPINVHNYSSKPKFLSTKHQLLSLLKHCSSTNHLFEIHAQILVSGLQNDSFFTTELLRVAALSPSRNLSYGCSLLFHCHFHSATMPWN

Query:  FIIRGYSSSDSPQEAISLFGEMRRRGVRPNNLTFPFLLKACATLATLQEGKQFHAIAIKCGLDLDVYVRNTLINFYGSCKRMSGARKVFDEMTERTLVSW
        FIIRGYSSSDSPQEAISLFGEMRRRGVRPNNLTFPFLLKACATLATLQEGKQFHAIAIKCGLDLDVYVRNTLI FYGSCKRMSGARKVFDEMTERTLVSW
Subjt:  FIIRGYSSSDSPQEAISLFGEMRRRGVRPNNLTFPFLLKACATLATLQEGKQFHAIAIKCGLDLDVYVRNTLINFYGSCKRMSGARKVFDEMTERTLVSW

Query:  NAVITACVENFCFDEAIDYFLKMGNHGFEPDETTMVVILSACAELGNLSLGRWVHSQVVGRGMVLNVQLGTAFVDMYAKSGDVGCARHVFNCLKQKSVWT
        NAVITACVENFCFDEAIDYFLKMGNHGFEPDETTMVVILSACAELGNLSLGRWVHSQVVGRGMVLNVQLGTAFVDMYAKSGDVGCARHVFNCLKQKSVWT
Subjt:  NAVITACVENFCFDEAIDYFLKMGNHGFEPDETTMVVILSACAELGNLSLGRWVHSQVVGRGMVLNVQLGTAFVDMYAKSGDVGCARHVFNCLKQKSVWT

Query:  WSAMILGLAQHGFANEAIELFTNMMSSPIVPNHVTFIGVLCACSHAGLVDKSYHYFNLMERVYGIKPMMIHYGSMVDVLGRAGQVKEAYELIMSMPVEPD
        WSAMILGLAQHGFANEAIELFTNMMSSPIVPNHVTFIGVLCACSHAGLVDKSYHYFNLMERVYGIKPMMIHYGSMVDVLGRAGQVKEAYELIMSMPVEPD
Subjt:  WSAMILGLAQHGFANEAIELFTNMMSSPIVPNHVTFIGVLCACSHAGLVDKSYHYFNLMERVYGIKPMMIHYGSMVDVLGRAGQVKEAYELIMSMPVEPD

Query:  PIVWRTLLSACSGRDVNGGAEVAEEARKRLLELEPKRGGNVVMVANKFAELGMWKQAADYRRTMKDRGIKKMAGESCIELGGSLRKFFSGFDSRAAPDGI
        PIVWRTLLSACSGRDVNGGAEVAEEARKRLLELEPKRGGNVVMVANKFAELGMWKQAADYRRTMKDRGIKKMAGESCIELGGSLRKFFSGFDSRAAPDGI
Subjt:  PIVWRTLLSACSGRDVNGGAEVAEEARKRLLELEPKRGGNVVMVANKFAELGMWKQAADYRRTMKDRGIKKMAGESCIELGGSLRKFFSGFDSRAAPDGI

Query:  YDLLDGLNLHMQLTNF
        YDLLDGLNLHMQLTNF
Subjt:  YDLLDGLNLHMQLTNF

A0A1S3BI68 pentatricopeptide repeat-containing protein At2g367302.9e-29395.35Show/hide
Query:  MVRLWISAVHQFFPINVHNYSSKPKFLSTKHQLLSLLKHCSSTNHLFEIHAQILVSGLQNDSFFTTELLRVAALSPSRNLSYGCSLLFHCHFHSATMPWN
        MVRLWISAVHQFFPIN H+Y SKPKFLSTKHQ LSLLKHCSSTNHLFEIHAQILVSG QNDSF TTELLRVAALSPSRNLSYGCSLLFHCHFHSAT+PWN
Subjt:  MVRLWISAVHQFFPINVHNYSSKPKFLSTKHQLLSLLKHCSSTNHLFEIHAQILVSGLQNDSFFTTELLRVAALSPSRNLSYGCSLLFHCHFHSATMPWN

Query:  FIIRGYSSSDSPQEAISLFGEMRRRGVRPNNLTFPFLLKACATLATLQEGKQFHAIAIKCGLDLDVYVRNTLINFYGSCKRMSGARKVFDEMTERTLVSW
         IIRGYSSSDSP+EAISLFGEMRRRGV PNNLTFPFLLKACATLATLQEGKQFHAI IKCGLDLDVYVRNTLI+FYGSCKRMSGARKVFDEMTERTLVSW
Subjt:  FIIRGYSSSDSPQEAISLFGEMRRRGVRPNNLTFPFLLKACATLATLQEGKQFHAIAIKCGLDLDVYVRNTLINFYGSCKRMSGARKVFDEMTERTLVSW

Query:  NAVITACVENFCFDEAIDYFLKMGNHGFEPDETTMVVILSACAELGNLSLGRWVHSQVVGRGMVLNVQLGTAFVDMYAKSGDVGCARHVFNCLKQKSVWT
        NAVITACVENF FDEAIDYFLKMGNHGFEPDETTMVVILSACAELGNLSLGRWVHSQVVGRGMVLN+QLGTAFVDMYAKSGDVGCAR VFNCLKQKSVWT
Subjt:  NAVITACVENFCFDEAIDYFLKMGNHGFEPDETTMVVILSACAELGNLSLGRWVHSQVVGRGMVLNVQLGTAFVDMYAKSGDVGCARHVFNCLKQKSVWT

Query:  WSAMILGLAQHGFANEAIELFTNMMSSPIVPNHVTFIGVLCACSHAGLVDKSYHYFNLMERVYGIKPMMIHYGSMVDVLGRAGQVKEAYELIMSMPVEPD
        WSAMILGLAQHGFANEAIELFTNM SSPIVPN+VTF+GVLCACSHAGLVDKSYHYFN+MERVYGIKPMMIHYG MVDVLGRAGQVKEAYELIMSMPVEPD
Subjt:  WSAMILGLAQHGFANEAIELFTNMMSSPIVPNHVTFIGVLCACSHAGLVDKSYHYFNLMERVYGIKPMMIHYGSMVDVLGRAGQVKEAYELIMSMPVEPD

Query:  PIVWRTLLSACSGRDVNGGAEVAEEARKRLLELEPKRGGNVVMVANKFAELGMWKQAADYRRTMKDRGIKKMAGESCIELGGSLRKFFSGFDSRAAPDGI
        P+VWRTLLSACSGRDVNGGAEVAEEARKRLLELEPKRGGNVVMVANKFAE+GMWKQAADYRRTMKDRGIKKMAGESCIELGGSLRKFFSGF+SRAA DGI
Subjt:  PIVWRTLLSACSGRDVNGGAEVAEEARKRLLELEPKRGGNVVMVANKFAELGMWKQAADYRRTMKDRGIKKMAGESCIELGGSLRKFFSGFDSRAAPDGI

Query:  YDLLDGLNLHMQLTNF
        YDLLDGLNLHMQLTNF
Subjt:  YDLLDGLNLHMQLTNF

A0A5D3CL98 Pentatricopeptide repeat-containing protein2.9e-29395.35Show/hide
Query:  MVRLWISAVHQFFPINVHNYSSKPKFLSTKHQLLSLLKHCSSTNHLFEIHAQILVSGLQNDSFFTTELLRVAALSPSRNLSYGCSLLFHCHFHSATMPWN
        MVRLWISAVHQFFPIN H+Y SKPKFLSTKHQ LSLLKHCSSTNHLFEIHAQILVSG QNDSF TTELLRVAALSPSRNLSYGCSLLFHCHFHSAT+PWN
Subjt:  MVRLWISAVHQFFPINVHNYSSKPKFLSTKHQLLSLLKHCSSTNHLFEIHAQILVSGLQNDSFFTTELLRVAALSPSRNLSYGCSLLFHCHFHSATMPWN

Query:  FIIRGYSSSDSPQEAISLFGEMRRRGVRPNNLTFPFLLKACATLATLQEGKQFHAIAIKCGLDLDVYVRNTLINFYGSCKRMSGARKVFDEMTERTLVSW
         IIRGYSSSDSP+EAISLFGEMRRRGV PNNLTFPFLLKACATLATLQEGKQFHAI IKCGLDLDVYVRNTLI+FYGSCKRMSGARKVFDEMTERTLVSW
Subjt:  FIIRGYSSSDSPQEAISLFGEMRRRGVRPNNLTFPFLLKACATLATLQEGKQFHAIAIKCGLDLDVYVRNTLINFYGSCKRMSGARKVFDEMTERTLVSW

Query:  NAVITACVENFCFDEAIDYFLKMGNHGFEPDETTMVVILSACAELGNLSLGRWVHSQVVGRGMVLNVQLGTAFVDMYAKSGDVGCARHVFNCLKQKSVWT
        NA+ITACVENF FDEAIDYFLKMGNHGFEPDETTMVVILSACAELGNLSLGRWVHSQVVGRGMVLNVQLGTAFVDMYAKSGDVGCAR VFNCLKQKSVWT
Subjt:  NAVITACVENFCFDEAIDYFLKMGNHGFEPDETTMVVILSACAELGNLSLGRWVHSQVVGRGMVLNVQLGTAFVDMYAKSGDVGCARHVFNCLKQKSVWT

Query:  WSAMILGLAQHGFANEAIELFTNMMSSPIVPNHVTFIGVLCACSHAGLVDKSYHYFNLMERVYGIKPMMIHYGSMVDVLGRAGQVKEAYELIMSMPVEPD
        WSAMILGLAQHGFANEAIELFTNM SSPIVPN+VTF+GVLCACSHAGLVDKSYHYFN+MERVYGIKPMMIHYG MVDVLGRAGQVKEAYELIMSMPVEPD
Subjt:  WSAMILGLAQHGFANEAIELFTNMMSSPIVPNHVTFIGVLCACSHAGLVDKSYHYFNLMERVYGIKPMMIHYGSMVDVLGRAGQVKEAYELIMSMPVEPD

Query:  PIVWRTLLSACSGRDVNGGAEVAEEARKRLLELEPKRGGNVVMVANKFAELGMWKQAADYRRTMKDRGIKKMAGESCIELGGSLRKFFSGFDSRAAPDGI
        P+VWRTLLSACSGRDVNGGAEVAEEARKRLLELEPKRGGNVVMVANKFAE+GMWKQAADYRRTMKDRGIKKMAGESCIELGGSLRKFFSGF+SRAA DGI
Subjt:  PIVWRTLLSACSGRDVNGGAEVAEEARKRLLELEPKRGGNVVMVANKFAELGMWKQAADYRRTMKDRGIKKMAGESCIELGGSLRKFFSGFDSRAAPDGI

Query:  YDLLDGLNLHMQLTNF
        YDLLDGLNLHMQLTNF
Subjt:  YDLLDGLNLHMQLTNF

A0A6J1F5Z2 pentatricopeptide repeat-containing protein At2g36730 isoform X15.2e-25885.08Show/hide
Query:  MVRLWISAVHQFFPINVHNYSSKPKFLSTKHQLLSLLKHCSSTNHLFEIHAQILVSGLQNDSFFTTELLRVAALSPSRNLSYGCSLLFHCHFHSATMPWN
        MVRL I AVHQ FP N HN  S   FLS KHQ LS++K CSS NHLF+IH+QI+VSGLQNDSF TTELLR AALSPSRNLSY  SLLFH + H + +PWN
Subjt:  MVRLWISAVHQFFPINVHNYSSKPKFLSTKHQLLSLLKHCSSTNHLFEIHAQILVSGLQNDSFFTTELLRVAALSPSRNLSYGCSLLFHCHFHSATMPWN

Query:  FIIRGYSSSDSPQEAISLFGEMRRRGVRPNNLTFPFLLKACATLATLQEGKQFHAIAIKCGLDLDVYVRNTLINFYGSCKRMSGARKVFDEMTERTLVSW
         IIRGY+SSDSP+EAI +F EMRRRG+RPNNLTFPFL+KACATL TLQEGK+FHA AIKCGLDLDVYVRNTLINFYGSCKRMSGARKVFDEM+ RTLVSW
Subjt:  FIIRGYSSSDSPQEAISLFGEMRRRGVRPNNLTFPFLLKACATLATLQEGKQFHAIAIKCGLDLDVYVRNTLINFYGSCKRMSGARKVFDEMTERTLVSW

Query:  NAVITACVENFCFDEAIDYFLKMGNHGFEPDETTMVVILSACAELGNLSLGRWVHSQVVGRGMVLNVQLGTAFVDMYAKSGDVGCARHVFNCLKQKSVWT
        NAVITACVENFCFD+AI+YFLKMGNHGFEPDETTMVVILSACAELGNLSLGRWVHSQVV RGMVLNVQLGTA VDMYAKSGDVGCAR VFNCLKQ+SVWT
Subjt:  NAVITACVENFCFDEAIDYFLKMGNHGFEPDETTMVVILSACAELGNLSLGRWVHSQVVGRGMVLNVQLGTAFVDMYAKSGDVGCARHVFNCLKQKSVWT

Query:  WSAMILGLAQHGFANEAIELFTNMMSSPIVPNHVTFIGVLCACSHAGLVDKSYHYFNLMERVYGIKPMMIHYGSMVDVLGRAGQVKEAYELIMSMPVEPD
        WSAMILGLAQHGFA+EAIELFTNMMSS + PN+VTFIGVLCACSHAGLVDK YHYFN+MERVYGIKPMMIHYGSMVDVL RAG+VKEAYE IM MPVEPD
Subjt:  WSAMILGLAQHGFANEAIELFTNMMSSPIVPNHVTFIGVLCACSHAGLVDKSYHYFNLMERVYGIKPMMIHYGSMVDVLGRAGQVKEAYELIMSMPVEPD

Query:  PIVWRTLLSACSGRDVNGGAEVAEEARKRLLELEPKRGGNVVMVANKFAELGMWKQAADYRRTMKDRGIKKMAGESCIELGGSLRKFFSGFDSRAAPDGI
        PIVWRTLLSACSGRDV+GGA+V EEARKRLLELEPKRGGNVVMVAN FAE+GMWKQAAD RR MKD GIKKMAGESC+E+GGSL KFFSGFD RA   GI
Subjt:  PIVWRTLLSACSGRDVNGGAEVAEEARKRLLELEPKRGGNVVMVANKFAELGMWKQAADYRRTMKDRGIKKMAGESCIELGGSLRKFFSGFDSRAAPDGI

Query:  YDLLDGLNLHMQLTNF
        YDLLDGLNLHMQ+ NF
Subjt:  YDLLDGLNLHMQLTNF

A0A6J1KVA8 pentatricopeptide repeat-containing protein At2g36730-like isoform X15.8e-25784.88Show/hide
Query:  MVRLWISAVHQFFPINVHNYSSKPKFLSTKHQLLSLLKHCSSTNHLFEIHAQILVSGLQNDSFFTTELLRVAALSPSRNLSYGCSLLFHCHFHSATMPWN
        MVRL ISAVHQ FP N HN  S   FLS KHQ LSL+K CSS NHLF+IH+QI+V GLQNDSF TTELLR AALSPSRNLSY  SLLFH + H + +PWN
Subjt:  MVRLWISAVHQFFPINVHNYSSKPKFLSTKHQLLSLLKHCSSTNHLFEIHAQILVSGLQNDSFFTTELLRVAALSPSRNLSYGCSLLFHCHFHSATMPWN

Query:  FIIRGYSSSDSPQEAISLFGEMRRRGVRPNNLTFPFLLKACATLATLQEGKQFHAIAIKCGLDLDVYVRNTLINFYGSCKRMSGARKVFDEMTERTLVSW
         IIRGY+SSDSP+EAI +F EMRRRG+RPN+LTFPFL+KACATL TLQEGK+FHA AIKCGLDLDVYVRNTLINFYGSCKRMSGARKVFDEM+ RTLVSW
Subjt:  FIIRGYSSSDSPQEAISLFGEMRRRGVRPNNLTFPFLLKACATLATLQEGKQFHAIAIKCGLDLDVYVRNTLINFYGSCKRMSGARKVFDEMTERTLVSW

Query:  NAVITACVENFCFDEAIDYFLKMGNHGFEPDETTMVVILSACAELGNLSLGRWVHSQVVGRGMVLNVQLGTAFVDMYAKSGDVGCARHVFNCLKQKSVWT
        NAVITACVENFCFDEAI+YFL+MGNHGFE DETTMVVILSACAELGNLSLGRWVHSQVV RGMVLNVQLGTA VDMYAKSGDVGCAR VFNCLKQ+SVWT
Subjt:  NAVITACVENFCFDEAIDYFLKMGNHGFEPDETTMVVILSACAELGNLSLGRWVHSQVVGRGMVLNVQLGTAFVDMYAKSGDVGCARHVFNCLKQKSVWT

Query:  WSAMILGLAQHGFANEAIELFTNMMSSPIVPNHVTFIGVLCACSHAGLVDKSYHYFNLMERVYGIKPMMIHYGSMVDVLGRAGQVKEAYELIMSMPVEPD
        WSAMILGLAQHGFANEAIELFTNMMSS + PN+VTFIGVLCACSHAGLVDK YHYFN+MERVY IKPMMIHYGSMVDVL RAG+VKEAYE IM MPVEPD
Subjt:  WSAMILGLAQHGFANEAIELFTNMMSSPIVPNHVTFIGVLCACSHAGLVDKSYHYFNLMERVYGIKPMMIHYGSMVDVLGRAGQVKEAYELIMSMPVEPD

Query:  PIVWRTLLSACSGRDVNGGAEVAEEARKRLLELEPKRGGNVVMVANKFAELGMWKQAADYRRTMKDRGIKKMAGESCIELGGSLRKFFSGFDSRAAPDGI
        PIVWRTLLSACS RDV+GGA+V EEA+KRLLELEPKRGGNVVMVAN FAE+GMWKQAAD RR MKD GIKKMAGESC+E+GGSLRKFFSGFD RA  DGI
Subjt:  PIVWRTLLSACSGRDVNGGAEVAEEARKRLLELEPKRGGNVVMVANKFAELGMWKQAADYRRTMKDRGIKKMAGESCIELGGSLRKFFSGFDSRAAPDGI

Query:  YDLLDGLNLHMQLTNF
        YDLLDGLNLHMQ+ NF
Subjt:  YDLLDGLNLHMQLTNF

SwissProt top hitse value%identityAlignment
A8MQA3 Pentatricopeptide repeat-containing protein At4g210654.3e-9237.63Show/hide
Query:  FLSTKHQLLSLLKHC---------SSTNHLFEIHAQILVSGLQ-NDSFFTTELLRVAALSPS-RNLSYGCSLLFHCHFHSATMPWNFIIRGYSSSDSPQE
        F  T   LL +++ C         SS   L +IHA  +  G+  +D+     L+      PS   +SY   +            WN +IRGY+   +   
Subjt:  FLSTKHQLLSLLKHC---------SSTNHLFEIHAQILVSGLQ-NDSFFTTELLRVAALSPS-RNLSYGCSLLFHCHFHSATMPWNFIIRGYSSSDSPQE

Query:  AISLFGEMRRRG-VRPNNLTFPFLLKACATLATLQEGKQFHAIAIKCGLDLDVYVRNTLINFYGSCKRMSGARKVFDEMTERTLVSWNAVITACVENFCF
        A SL+ EMR  G V P+  T+PFL+KA  T+A ++ G+  H++ I+ G    +YV+N+L++ Y +C  ++ A KVFD+M E+ LV+WN+VI    EN   
Subjt:  AISLFGEMRRRG-VRPNNLTFPFLLKACATLATLQEGKQFHAIAIKCGLDLDVYVRNTLINFYGSCKRMSGARKVFDEMTERTLVSWNAVITACVENFCF

Query:  DEAIDYFLKMGNHGFEPDETTMVVILSACAELGNLSLGRWVHSQVVGRGMVLNVQLGTAFVDMYAKSGDVGCARHVFNCLKQKSVWTWSAMILGLAQHGF
        +EA+  + +M + G +PD  T+V +LSACA++G L+LG+ VH  ++  G+  N+      +D+YA+ G V  A+ +F+ +  K+  +W+++I+GLA +GF
Subjt:  DEAIDYFLKMGNHGFEPDETTMVVILSACAELGNLSLGRWVHSQVVGRGMVLNVQLGTAFVDMYAKSGDVGCARHVFNCLKQKSVWTWSAMILGLAQHGF

Query:  ANEAIELFTNMMSSP-IVPNHVTFIGVLCACSHAGLVDKSYHYFNLMERVYGIKPMMIHYGSMVDVLGRAGQVKEAYELIMSMPVEPDPIVWRTLLSACS
          EAIELF  M S+  ++P  +TF+G+L ACSH G+V + + YF  M   Y I+P + H+G MVD+L RAGQVK+AYE I SMP++P+ ++WRTLL AC+
Subjt:  ANEAIELFTNMMSSP-IVPNHVTFIGVLCACSHAGLVDKSYHYFNLMERVYGIKPMMIHYGSMVDVLGRAGQVKEAYELIMSMPVEPDPIVWRTLLSACS

Query:  GRDVNGGAEVAEEARKRLLELEPKRGGNVVMVANKFAELGMWKQAADYRRTMKDRGIKKMAGESCIELGGSLRKFFSGFDSRAAPDGIY
           V+G +++AE AR ++L+LEP   G+ V+++N +A    W      R+ M   G+KK+ G S +E+G  + +F  G  S    D IY
Subjt:  GRDVNGGAEVAEEARKRLLELEPKRGGNVVMVANKFAELGMWKQAADYRRTMKDRGIKKMAGESCIELGGSLRKFFSGFDSRAAPDGIY

Q8LK93 Pentatricopeptide repeat-containing protein At2g02980, chloroplastic1.6e-9436.08Show/hide
Query:  SAVHQFFPINVHNYSSKPKFLSTKHQLLSLLKHCSSTNHLFEIHAQILVSGLQNDSFFTTELLRVAALSPSRNLSYGCSLLFHCHFHSATMPWNFIIRGY
        S +  F         SK   ++T++ +L L+  C+S   L +I A  + S ++ D  F  +L+     SP+ +       LF        + +N + RGY
Subjt:  SAVHQFFPINVHNYSSKPKFLSTKHQLLSLLKHCSSTNHLFEIHAQILVSGLQNDSFFTTELLRVAALSPSRNLSYGCSLLFHCHFHSATMPWNFIIRGY

Query:  SSSDSPQEAISLFGEMRRRGVRPNNLTFPFLLKACATLATLQEGKQFHAIAIKCGLDLDVYVRNTLINFYGSCKRMSGARKVFDEMTERTLVSWNAVITA
        S   +P E  SLF E+   G+ P+N TFP LLKACA    L+EG+Q H +++K GLD +VYV  TLIN Y  C+ +  AR VFD + E  +V +NA+IT 
Subjt:  SSSDSPQEAISLFGEMRRRGVRPNNLTFPFLLKACATLATLQEGKQFHAIAIKCGLDLDVYVRNTLINFYGSCKRMSGARKVFDEMTERTLVSWNAVITA

Query:  CVENFCFDEAIDYFLKMGNHGFEPDETTMVVILSACAELGNLSLGRWVHSQVVGRGMVLNVQLGTAFVDMYAKSGDVGCARHVFNCLKQKSVWTWSAMIL
               +EA+  F +M     +P+E T++ +LS+CA LG+L LG+W+H           V++ TA +DM+AK G +  A  +F  ++ K    WSAMI+
Subjt:  CVENFCFDEAIDYFLKMGNHGFEPDETTMVVILSACAELGNLSLGRWVHSQVVGRGMVLNVQLGTAFVDMYAKSGDVGCARHVFNCLKQKSVWTWSAMIL

Query:  GLAQHGFANEAIELFTNMMSSPIVPNHVTFIGVLCACSHAGLVDKSYHYFNLMERVYGIKPMMIHYGSMVDVLGRAGQVKEAYELIMSMPVEPDPIVWRT
          A HG A +++ +F  M S  + P+ +TF+G+L ACSH G V++   YF+ M   +GI P + HYGSMVD+L RAG +++AYE I  +P+ P P++WR 
Subjt:  GLAQHGFANEAIELFTNMMSSPIVPNHVTFIGVLCACSHAGLVDKSYHYFNLMERVYGIKPMMIHYGSMVDVLGRAGQVKEAYELIMSMPVEPDPIVWRT

Query:  LLSACSGRDVNGGAEVAEEARKRLLELEPKRGGNVVMVANKFAELGMWKQAADYRRTMKDRGIKKMAGESCIELGGSLRKFFSGFDSRAAPDGIYDLLDG
        LL+ACS    +   ++AE+  +R+ EL+   GG+ V+++N +A    W+     R+ MKDR   K+ G S IE+   + +FFSG   ++A   ++  LD 
Subjt:  LLSACSGRDVNGGAEVAEEARKRLLELEPKRGGNVVMVANKFAELGMWKQAADYRRTMKDRGIKKMAGESCIELGGSLRKFFSGFDSRAAPDGIYDLLDG

Query:  LNLHMQLTNF
        +   ++L+ +
Subjt:  LNLHMQLTNF

Q9CA54 Pentatricopeptide repeat-containing protein At1g746301.9e-9236.07Show/hide
Query:  HQLLSLLKHCSSTNHLFEIHAQILVSGLQNDSFFTTELLRVAALSPSRNLSYGCSLLFHCHFHSATMPWNFIIRGYSSSDSPQEAISLFGEMRRRG-VRP
        H  LSLL  C +   L +IH   +  G+  DS+FT +L+   A+S S  L Y   LL  C        +N ++RGYS SD P  ++++F EM R+G V P
Subjt:  HQLLSLLKHCSSTNHLFEIHAQILVSGLQNDSFFTTELLRVAALSPSRNLSYGCSLLFHCHFHSATMPWNFIIRGYSSSDSPQEAISLFGEMRRRG-VRP

Query:  NNLTFPFLLKACATLATLQEGKQFHAIAIKCGLDLDVYVRNTLINFYGSCKRMSGARKVFDEMTERTLVSWNAVITAC----------------------
        ++ +F F++KA     +L+ G Q H  A+K GL+  ++V  TLI  YG C  +  ARKVFDEM +  LV+WNAVITAC                      
Subjt:  NNLTFPFLLKACATLATLQEGKQFHAIAIKCGLDLDVYVRNTLINFYGSCKRMSGARKVFDEMTERTLVSWNAVITAC----------------------

Query:  ----------------------------------------VENFCFDEAIDYFLKMGNHGFEPDETTMVVILSACAELGNLSLGRWVHSQVVGRGMVLNV
                                                  N  F+E+  YF ++   G  P+E ++  +LSAC++ G+   G+ +H  V   G    V
Subjt:  ----------------------------------------VENFCFDEAIDYFLKMGNHGFEPDETTMVVILSACAELGNLSLGRWVHSQVVGRGMVLNV

Query:  QLGTAFVDMYAKSGDVGCARHVFNCLKQK-SVWTWSAMILGLAQHGFANEAIELFTNMMSSPIVPNHVTFIGVLCACSHAGLVDKSYHYFNLMERVYGIK
         +  A +DMY++ G+V  AR VF  +++K  + +W++MI GLA HG   EA+ LF  M +  + P+ ++FI +L ACSHAGL+++   YF+ M+RVY I+
Subjt:  QLGTAFVDMYAKSGDVGCARHVFNCLKQK-SVWTWSAMILGLAQHGFANEAIELFTNMMSSPIVPNHVTFIGVLCACSHAGLVDKSYHYFNLMERVYGIK

Query:  PMMIHYGSMVDVLGRAGQVKEAYELIMSMPVEPDPIVWRTLLSACSGRDVNGGAEVAEEARKRLLELEPKRGGNVVMVANKFAELGMWKQAADYRRTMKD
        P + HYG MVD+ GR+G++++AY+ I  MP+ P  IVWRTLL ACS    +G  E+AE+ ++RL EL+P   G++V+++N +A  G WK  A  R++M  
Subjt:  PMMIHYGSMVDVLGRAGQVKEAYELIMSMPVEPDPIVWRTLLSACSGRDVNGGAEVAEEARKRLLELEPKRGGNVVMVANKFAELGMWKQAADYRRTMKD

Query:  RGIKKMAGESCIELGGSLRKFFSG
        + IKK    S +E+G ++ KF +G
Subjt:  RGIKKMAGESCIELGGSLRKFFSG

Q9SJG6 Pentatricopeptide repeat-containing protein At2g42920, chloroplastic1.1e-8737.18Show/hide
Query:  CSSTNHLFEIHAQILVSGLQNDSFFTTELLRVAALSPS-RNLSYGCSLLFHCHFHSATMPWNFIIRGYSSSDSPQEAISLFGEM--RRRGVRPNNLTFPF
        CS+   L +IHA ++ +GL +D+   + +L     SPS  N +Y   L+F    H     WN IIRG+S S  P+ AIS+F +M      V+P  LT+P 
Subjt:  CSSTNHLFEIHAQILVSGLQNDSFFTTELLRVAALSPS-RNLSYGCSLLFHCHFHSATMPWNFIIRGYSSSDSPQEAISLFGEM--RRRGVRPNNLTFPF

Query:  LLKACATLATLQEGKQFHAIAIKCGLDLDVYVRNTLINFY-------------------------------GSCKRMSGARKVFDEMTERTLVSWNAVIT
        + KA   L   ++G+Q H + IK GL+ D ++RNT+++ Y                                 C  +  A+ +FDEM +R  VSWN++I+
Subjt:  LLKACATLATLQEGKQFHAIAIKCGLDLDVYVRNTLINFY-------------------------------GSCKRMSGARKVFDEMTERTLVSWNAVIT

Query:  ACVENFCFDEAIDYFLKMGNHGFEPDETTMVVILSACAELGNLSLGRWVHSQVVGRGMVLNVQLGTAFVDMYAKSGDVGCARHVFNCLKQKSVWTWSAMI
          V N  F +A+D F +M     +PD  TMV +L+ACA LG    GRW+H  +V     LN  + TA +DMY K G +    +VF C  +K +  W++MI
Subjt:  ACVENFCFDEAIDYFLKMGNHGFEPDETTMVVILSACAELGNLSLGRWVHSQVVGRGMVLNVQLGTAFVDMYAKSGDVGCARHVFNCLKQKSVWTWSAMI

Query:  LGLAQHGFANEAIELFTNMMSSPIVPNHVTFIGVLCACSHAGLVDKSYHYFNLMERVYGIKPMMIHYGSMVDVLGRAGQVKEAYELIMSMPVEPDPIVWR
        LGLA +GF   A++LF+ +  S + P+ V+FIGVL AC+H+G V ++  +F LM+  Y I+P + HY  MV+VLG AG ++EA  LI +MPVE D ++W 
Subjt:  LGLAQHGFANEAIELFTNMMSSPIVPNHVTFIGVLCACSHAGLVDKSYHYFNLMERVYGIKPMMIHYGSMVDVLGRAGQVKEAYELIMSMPVEPDPIVWR

Query:  TLLSACSGRDVNGGAEVAEEARKRLLELEPKRGGNVVMVANKFAELGMWKQAADYRRTMKDRGIKKMAGESCIELGGSLRKFFSGFDSRAAPDGIYDLLD
        +LLSAC  R + G  E+A+ A K L +L+P      V+++N +A  G++++A + R  MK+R ++K  G S IE+   + +F S   +      IY LLD
Subjt:  TLLSACSGRDVNGGAEVAEEARKRLLELEPKRGGNVVMVANKFAELGMWKQAADYRRTMKDRGIKKMAGESCIELGGSLRKFFSGFDSRAAPDGIYDLLD

Query:  GLN
         LN
Subjt:  GLN

Q9ZQA1 Pentatricopeptide repeat-containing protein At2g367301.1e-15957.91Show/hide
Query:  YSSKPKFLSTKHQLLSLLKHCSSTNHLFEIHAQILVSGLQNDSFFTTELLRVAALSPSRNLSYGCSLLFHCHFHSATMPWNFIIRGYSSSDSPQEAISLF
        +SS   F S KHQ L  LK CSS  HL +IH QI +S LQNDSF  +EL+RV++LS +++L++  +LL H    S    WN + RGYSSSDSP E+I ++
Subjt:  YSSKPKFLSTKHQLLSLLKHCSSTNHLFEIHAQILVSGLQNDSFFTTELLRVAALSPSRNLSYGCSLLFHCHFHSATMPWNFIIRGYSSSDSPQEAISLF

Query:  GEMRRRGVRPNNLTFPFLLKACATLATLQEGKQFHAIAIKCGLDLDVYVRNTLINFYGSCKRMSGARKVFDEMTERTLVSWNAVITACVENFCFDEAIDY
         EM+RRG++PN LTFPFLLKACA+   L  G+Q     +K G D DVYV N LI+ YG+CK+ S ARKVFDEMTER +VSWN+++TA VEN   +   + 
Subjt:  GEMRRRGVRPNNLTFPFLLKACATLATLQEGKQFHAIAIKCGLDLDVYVRNTLINFYGSCKRMSGARKVFDEMTERTLVSWNAVITACVENFCFDEAIDY

Query:  FLKMGNHGFEPDETTMVVILSACAELGNLSLGRWVHSQVVGRGMVLNVQLGTAFVDMYAKSGDVGCARHVFNCLKQKSVWTWSAMILGLAQHGFANEAIE
        F +M    F PDETTMVV+LSAC   GNLSLG+ VHSQV+ R + LN +LGTA VDMYAKSG +  AR VF  +  K+VWTWSAMI+GLAQ+GFA EA++
Subjt:  FLKMGNHGFEPDETTMVVILSACAELGNLSLGRWVHSQVVGRGMVLNVQLGTAFVDMYAKSGDVGCARHVFNCLKQKSVWTWSAMILGLAQHGFANEAIE

Query:  LFTNMM-SSPIVPNHVTFIGVLCACSHAGLVDKSYHYFNLMERVYGIKPMMIHYGSMVDVLGRAGQVKEAYELIMSMPVEPDPIVWRTLLSACSGRDVNG
        LF+ MM  S + PN+VTF+GVLCACSH GLVD  Y YF+ ME+++ IKPMMIHYG+MVD+LGRAG++ EAY+ I  MP EPD +VWRTLLSACS      
Subjt:  LFTNMM-SSPIVPNHVTFIGVLCACSHAGLVDKSYHYFNLMERVYGIKPMMIHYGSMVDVLGRAGQVKEAYELIMSMPVEPDPIVWRTLLSACSGRDVNG

Query:  GAEVAEEARKRLLELEPKRGGNVVMVANKFAELGMWKQAADYRRTMKDRGIKKMAGESCIELGGSLRKFFSGFDSRAAPDGIYDLLD
           + E+ +KRL+ELEPKR GN+V+VAN+FAE  MW +AA+ RR MK+  +KK+AGESC+ELGGS  +FFSG+D R+    IY+LLD
Subjt:  GAEVAEEARKRLLELEPKRGGNVVMVANKFAELGMWKQAADYRRTMKDRGIKKMAGESCIELGGSLRKFFSGFDSRAAPDGIYDLLD

Arabidopsis top hitse value%identityAlignment
AT1G74630.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.4e-9336.07Show/hide
Query:  HQLLSLLKHCSSTNHLFEIHAQILVSGLQNDSFFTTELLRVAALSPSRNLSYGCSLLFHCHFHSATMPWNFIIRGYSSSDSPQEAISLFGEMRRRG-VRP
        H  LSLL  C +   L +IH   +  G+  DS+FT +L+   A+S S  L Y   LL  C        +N ++RGYS SD P  ++++F EM R+G V P
Subjt:  HQLLSLLKHCSSTNHLFEIHAQILVSGLQNDSFFTTELLRVAALSPSRNLSYGCSLLFHCHFHSATMPWNFIIRGYSSSDSPQEAISLFGEMRRRG-VRP

Query:  NNLTFPFLLKACATLATLQEGKQFHAIAIKCGLDLDVYVRNTLINFYGSCKRMSGARKVFDEMTERTLVSWNAVITAC----------------------
        ++ +F F++KA     +L+ G Q H  A+K GL+  ++V  TLI  YG C  +  ARKVFDEM +  LV+WNAVITAC                      
Subjt:  NNLTFPFLLKACATLATLQEGKQFHAIAIKCGLDLDVYVRNTLINFYGSCKRMSGARKVFDEMTERTLVSWNAVITAC----------------------

Query:  ----------------------------------------VENFCFDEAIDYFLKMGNHGFEPDETTMVVILSACAELGNLSLGRWVHSQVVGRGMVLNV
                                                  N  F+E+  YF ++   G  P+E ++  +LSAC++ G+   G+ +H  V   G    V
Subjt:  ----------------------------------------VENFCFDEAIDYFLKMGNHGFEPDETTMVVILSACAELGNLSLGRWVHSQVVGRGMVLNV

Query:  QLGTAFVDMYAKSGDVGCARHVFNCLKQK-SVWTWSAMILGLAQHGFANEAIELFTNMMSSPIVPNHVTFIGVLCACSHAGLVDKSYHYFNLMERVYGIK
         +  A +DMY++ G+V  AR VF  +++K  + +W++MI GLA HG   EA+ LF  M +  + P+ ++FI +L ACSHAGL+++   YF+ M+RVY I+
Subjt:  QLGTAFVDMYAKSGDVGCARHVFNCLKQK-SVWTWSAMILGLAQHGFANEAIELFTNMMSSPIVPNHVTFIGVLCACSHAGLVDKSYHYFNLMERVYGIK

Query:  PMMIHYGSMVDVLGRAGQVKEAYELIMSMPVEPDPIVWRTLLSACSGRDVNGGAEVAEEARKRLLELEPKRGGNVVMVANKFAELGMWKQAADYRRTMKD
        P + HYG MVD+ GR+G++++AY+ I  MP+ P  IVWRTLL ACS    +G  E+AE+ ++RL EL+P   G++V+++N +A  G WK  A  R++M  
Subjt:  PMMIHYGSMVDVLGRAGQVKEAYELIMSMPVEPDPIVWRTLLSACSGRDVNGGAEVAEEARKRLLELEPKRGGNVVMVANKFAELGMWKQAADYRRTMKD

Query:  RGIKKMAGESCIELGGSLRKFFSG
        + IKK    S +E+G ++ KF +G
Subjt:  RGIKKMAGESCIELGGSLRKFFSG

AT2G02980.1 Pentatricopeptide repeat (PPR) superfamily protein1.1e-9536.08Show/hide
Query:  SAVHQFFPINVHNYSSKPKFLSTKHQLLSLLKHCSSTNHLFEIHAQILVSGLQNDSFFTTELLRVAALSPSRNLSYGCSLLFHCHFHSATMPWNFIIRGY
        S +  F         SK   ++T++ +L L+  C+S   L +I A  + S ++ D  F  +L+     SP+ +       LF        + +N + RGY
Subjt:  SAVHQFFPINVHNYSSKPKFLSTKHQLLSLLKHCSSTNHLFEIHAQILVSGLQNDSFFTTELLRVAALSPSRNLSYGCSLLFHCHFHSATMPWNFIIRGY

Query:  SSSDSPQEAISLFGEMRRRGVRPNNLTFPFLLKACATLATLQEGKQFHAIAIKCGLDLDVYVRNTLINFYGSCKRMSGARKVFDEMTERTLVSWNAVITA
        S   +P E  SLF E+   G+ P+N TFP LLKACA    L+EG+Q H +++K GLD +VYV  TLIN Y  C+ +  AR VFD + E  +V +NA+IT 
Subjt:  SSSDSPQEAISLFGEMRRRGVRPNNLTFPFLLKACATLATLQEGKQFHAIAIKCGLDLDVYVRNTLINFYGSCKRMSGARKVFDEMTERTLVSWNAVITA

Query:  CVENFCFDEAIDYFLKMGNHGFEPDETTMVVILSACAELGNLSLGRWVHSQVVGRGMVLNVQLGTAFVDMYAKSGDVGCARHVFNCLKQKSVWTWSAMIL
               +EA+  F +M     +P+E T++ +LS+CA LG+L LG+W+H           V++ TA +DM+AK G +  A  +F  ++ K    WSAMI+
Subjt:  CVENFCFDEAIDYFLKMGNHGFEPDETTMVVILSACAELGNLSLGRWVHSQVVGRGMVLNVQLGTAFVDMYAKSGDVGCARHVFNCLKQKSVWTWSAMIL

Query:  GLAQHGFANEAIELFTNMMSSPIVPNHVTFIGVLCACSHAGLVDKSYHYFNLMERVYGIKPMMIHYGSMVDVLGRAGQVKEAYELIMSMPVEPDPIVWRT
          A HG A +++ +F  M S  + P+ +TF+G+L ACSH G V++   YF+ M   +GI P + HYGSMVD+L RAG +++AYE I  +P+ P P++WR 
Subjt:  GLAQHGFANEAIELFTNMMSSPIVPNHVTFIGVLCACSHAGLVDKSYHYFNLMERVYGIKPMMIHYGSMVDVLGRAGQVKEAYELIMSMPVEPDPIVWRT

Query:  LLSACSGRDVNGGAEVAEEARKRLLELEPKRGGNVVMVANKFAELGMWKQAADYRRTMKDRGIKKMAGESCIELGGSLRKFFSGFDSRAAPDGIYDLLDG
        LL+ACS    +   ++AE+  +R+ EL+   GG+ V+++N +A    W+     R+ MKDR   K+ G S IE+   + +FFSG   ++A   ++  LD 
Subjt:  LLSACSGRDVNGGAEVAEEARKRLLELEPKRGGNVVMVANKFAELGMWKQAADYRRTMKDRGIKKMAGESCIELGGSLRKFFSGFDSRAAPDGIYDLLDG

Query:  LNLHMQLTNF
        +   ++L+ +
Subjt:  LNLHMQLTNF

AT2G36730.1 Pentatricopeptide repeat (PPR) superfamily protein7.7e-16157.91Show/hide
Query:  YSSKPKFLSTKHQLLSLLKHCSSTNHLFEIHAQILVSGLQNDSFFTTELLRVAALSPSRNLSYGCSLLFHCHFHSATMPWNFIIRGYSSSDSPQEAISLF
        +SS   F S KHQ L  LK CSS  HL +IH QI +S LQNDSF  +EL+RV++LS +++L++  +LL H    S    WN + RGYSSSDSP E+I ++
Subjt:  YSSKPKFLSTKHQLLSLLKHCSSTNHLFEIHAQILVSGLQNDSFFTTELLRVAALSPSRNLSYGCSLLFHCHFHSATMPWNFIIRGYSSSDSPQEAISLF

Query:  GEMRRRGVRPNNLTFPFLLKACATLATLQEGKQFHAIAIKCGLDLDVYVRNTLINFYGSCKRMSGARKVFDEMTERTLVSWNAVITACVENFCFDEAIDY
         EM+RRG++PN LTFPFLLKACA+   L  G+Q     +K G D DVYV N LI+ YG+CK+ S ARKVFDEMTER +VSWN+++TA VEN   +   + 
Subjt:  GEMRRRGVRPNNLTFPFLLKACATLATLQEGKQFHAIAIKCGLDLDVYVRNTLINFYGSCKRMSGARKVFDEMTERTLVSWNAVITACVENFCFDEAIDY

Query:  FLKMGNHGFEPDETTMVVILSACAELGNLSLGRWVHSQVVGRGMVLNVQLGTAFVDMYAKSGDVGCARHVFNCLKQKSVWTWSAMILGLAQHGFANEAIE
        F +M    F PDETTMVV+LSAC   GNLSLG+ VHSQV+ R + LN +LGTA VDMYAKSG +  AR VF  +  K+VWTWSAMI+GLAQ+GFA EA++
Subjt:  FLKMGNHGFEPDETTMVVILSACAELGNLSLGRWVHSQVVGRGMVLNVQLGTAFVDMYAKSGDVGCARHVFNCLKQKSVWTWSAMILGLAQHGFANEAIE

Query:  LFTNMM-SSPIVPNHVTFIGVLCACSHAGLVDKSYHYFNLMERVYGIKPMMIHYGSMVDVLGRAGQVKEAYELIMSMPVEPDPIVWRTLLSACSGRDVNG
        LF+ MM  S + PN+VTF+GVLCACSH GLVD  Y YF+ ME+++ IKPMMIHYG+MVD+LGRAG++ EAY+ I  MP EPD +VWRTLLSACS      
Subjt:  LFTNMM-SSPIVPNHVTFIGVLCACSHAGLVDKSYHYFNLMERVYGIKPMMIHYGSMVDVLGRAGQVKEAYELIMSMPVEPDPIVWRTLLSACSGRDVNG

Query:  GAEVAEEARKRLLELEPKRGGNVVMVANKFAELGMWKQAADYRRTMKDRGIKKMAGESCIELGGSLRKFFSGFDSRAAPDGIYDLLD
           + E+ +KRL+ELEPKR GN+V+VAN+FAE  MW +AA+ RR MK+  +KK+AGESC+ELGGS  +FFSG+D R+    IY+LLD
Subjt:  GAEVAEEARKRLLELEPKRGGNVVMVANKFAELGMWKQAADYRRTMKDRGIKKMAGESCIELGGSLRKFFSGFDSRAAPDGIYDLLD

AT2G42920.1 Pentatricopeptide repeat (PPR-like) superfamily protein7.9e-8937.18Show/hide
Query:  CSSTNHLFEIHAQILVSGLQNDSFFTTELLRVAALSPS-RNLSYGCSLLFHCHFHSATMPWNFIIRGYSSSDSPQEAISLFGEM--RRRGVRPNNLTFPF
        CS+   L +IHA ++ +GL +D+   + +L     SPS  N +Y   L+F    H     WN IIRG+S S  P+ AIS+F +M      V+P  LT+P 
Subjt:  CSSTNHLFEIHAQILVSGLQNDSFFTTELLRVAALSPS-RNLSYGCSLLFHCHFHSATMPWNFIIRGYSSSDSPQEAISLFGEM--RRRGVRPNNLTFPF

Query:  LLKACATLATLQEGKQFHAIAIKCGLDLDVYVRNTLINFY-------------------------------GSCKRMSGARKVFDEMTERTLVSWNAVIT
        + KA   L   ++G+Q H + IK GL+ D ++RNT+++ Y                                 C  +  A+ +FDEM +R  VSWN++I+
Subjt:  LLKACATLATLQEGKQFHAIAIKCGLDLDVYVRNTLINFY-------------------------------GSCKRMSGARKVFDEMTERTLVSWNAVIT

Query:  ACVENFCFDEAIDYFLKMGNHGFEPDETTMVVILSACAELGNLSLGRWVHSQVVGRGMVLNVQLGTAFVDMYAKSGDVGCARHVFNCLKQKSVWTWSAMI
          V N  F +A+D F +M     +PD  TMV +L+ACA LG    GRW+H  +V     LN  + TA +DMY K G +    +VF C  +K +  W++MI
Subjt:  ACVENFCFDEAIDYFLKMGNHGFEPDETTMVVILSACAELGNLSLGRWVHSQVVGRGMVLNVQLGTAFVDMYAKSGDVGCARHVFNCLKQKSVWTWSAMI

Query:  LGLAQHGFANEAIELFTNMMSSPIVPNHVTFIGVLCACSHAGLVDKSYHYFNLMERVYGIKPMMIHYGSMVDVLGRAGQVKEAYELIMSMPVEPDPIVWR
        LGLA +GF   A++LF+ +  S + P+ V+FIGVL AC+H+G V ++  +F LM+  Y I+P + HY  MV+VLG AG ++EA  LI +MPVE D ++W 
Subjt:  LGLAQHGFANEAIELFTNMMSSPIVPNHVTFIGVLCACSHAGLVDKSYHYFNLMERVYGIKPMMIHYGSMVDVLGRAGQVKEAYELIMSMPVEPDPIVWR

Query:  TLLSACSGRDVNGGAEVAEEARKRLLELEPKRGGNVVMVANKFAELGMWKQAADYRRTMKDRGIKKMAGESCIELGGSLRKFFSGFDSRAAPDGIYDLLD
        +LLSAC  R + G  E+A+ A K L +L+P      V+++N +A  G++++A + R  MK+R ++K  G S IE+   + +F S   +      IY LLD
Subjt:  TLLSACSGRDVNGGAEVAEEARKRLLELEPKRGGNVVMVANKFAELGMWKQAADYRRTMKDRGIKKMAGESCIELGGSLRKFFSGFDSRAAPDGIYDLLD

Query:  GLN
         LN
Subjt:  GLN

AT4G21065.1 Tetratricopeptide repeat (TPR)-like superfamily protein3.1e-9337.63Show/hide
Query:  FLSTKHQLLSLLKHC---------SSTNHLFEIHAQILVSGLQ-NDSFFTTELLRVAALSPS-RNLSYGCSLLFHCHFHSATMPWNFIIRGYSSSDSPQE
        F  T   LL +++ C         SS   L +IHA  +  G+  +D+     L+      PS   +SY   +            WN +IRGY+   +   
Subjt:  FLSTKHQLLSLLKHC---------SSTNHLFEIHAQILVSGLQ-NDSFFTTELLRVAALSPS-RNLSYGCSLLFHCHFHSATMPWNFIIRGYSSSDSPQE

Query:  AISLFGEMRRRG-VRPNNLTFPFLLKACATLATLQEGKQFHAIAIKCGLDLDVYVRNTLINFYGSCKRMSGARKVFDEMTERTLVSWNAVITACVENFCF
        A SL+ EMR  G V P+  T+PFL+KA  T+A ++ G+  H++ I+ G    +YV+N+L++ Y +C  ++ A KVFD+M E+ LV+WN+VI    EN   
Subjt:  AISLFGEMRRRG-VRPNNLTFPFLLKACATLATLQEGKQFHAIAIKCGLDLDVYVRNTLINFYGSCKRMSGARKVFDEMTERTLVSWNAVITACVENFCF

Query:  DEAIDYFLKMGNHGFEPDETTMVVILSACAELGNLSLGRWVHSQVVGRGMVLNVQLGTAFVDMYAKSGDVGCARHVFNCLKQKSVWTWSAMILGLAQHGF
        +EA+  + +M + G +PD  T+V +LSACA++G L+LG+ VH  ++  G+  N+      +D+YA+ G V  A+ +F+ +  K+  +W+++I+GLA +GF
Subjt:  DEAIDYFLKMGNHGFEPDETTMVVILSACAELGNLSLGRWVHSQVVGRGMVLNVQLGTAFVDMYAKSGDVGCARHVFNCLKQKSVWTWSAMILGLAQHGF

Query:  ANEAIELFTNMMSSP-IVPNHVTFIGVLCACSHAGLVDKSYHYFNLMERVYGIKPMMIHYGSMVDVLGRAGQVKEAYELIMSMPVEPDPIVWRTLLSACS
          EAIELF  M S+  ++P  +TF+G+L ACSH G+V + + YF  M   Y I+P + H+G MVD+L RAGQVK+AYE I SMP++P+ ++WRTLL AC+
Subjt:  ANEAIELFTNMMSSP-IVPNHVTFIGVLCACSHAGLVDKSYHYFNLMERVYGIKPMMIHYGSMVDVLGRAGQVKEAYELIMSMPVEPDPIVWRTLLSACS

Query:  GRDVNGGAEVAEEARKRLLELEPKRGGNVVMVANKFAELGMWKQAADYRRTMKDRGIKKMAGESCIELGGSLRKFFSGFDSRAAPDGIY
           V+G +++AE AR ++L+LEP   G+ V+++N +A    W      R+ M   G+KK+ G S +E+G  + +F  G  S    D IY
Subjt:  GRDVNGGAEVAEEARKRLLELEPKRGGNVVMVANKFAELGMWKQAADYRRTMKDRGIKKMAGESCIELGGSLRKFFSGFDSRAAPDGIY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTCGACTCTGGATCTCTGCCGTCCATCAATTTTTTCCTATTAACGTCCACAACTATAGTTCAAAACCCAAATTTCTCTCTACAAAGCATCAACTCCTTTCCCTTCT
CAAGCACTGTTCCTCAACGAATCACCTTTTTGAAATCCATGCACAAATCCTCGTCTCTGGCCTTCAAAATGACTCATTTTTCACCACGGAACTCCTCCGCGTCGCTGCTC
TATCACCCTCCAGAAATCTCAGCTATGGCTGCTCTCTCCTCTTCCATTGCCACTTTCATTCCGCCACTATGCCATGGAATTTTATCATTAGGGGATATTCCTCGAGTGAT
TCTCCGCAAGAGGCCATTTCGTTATTCGGGGAAATGCGAAGACGTGGAGTCAGACCCAATAACCTCACTTTCCCTTTCCTTCTCAAGGCCTGTGCAACACTTGCGACCCT
CCAAGAAGGTAAGCAGTTTCATGCTATTGCCATAAAGTGTGGTTTAGATTTAGATGTATATGTTCGGAATACTCTGATTAATTTCTATGGGTCCTGCAAAAGAATGTCTG
GTGCACGAAAGGTGTTCGATGAAATGACAGAAAGAACTTTAGTTTCATGGAATGCGGTTATTACAGCGTGTGTTGAGAATTTTTGCTTTGATGAAGCGATTGACTACTTT
CTGAAAATGGGAAACCATGGTTTTGAGCCAGATGAGACTACGATGGTGGTGATATTATCAGCTTGTGCGGAGCTTGGCAATTTGAGCTTAGGGAGATGGGTTCATTCTCA
AGTGGTGGGAAGAGGGATGGTTTTGAATGTTCAATTGGGCACTGCCTTCGTCGACATGTATGCAAAATCCGGCGATGTGGGATGTGCTAGGCATGTATTCAATTGTTTGA
AACAGAAAAGTGTATGGACATGGAGTGCAATGATTTTGGGTCTAGCCCAACATGGATTTGCGAATGAAGCTATTGAACTCTTCACAAATATGATGAGCTCCCCTATAGTT
CCCAACCATGTCACCTTCATTGGTGTCCTATGTGCTTGCAGCCATGCTGGATTGGTGGATAAAAGCTACCACTACTTCAACCTTATGGAGAGAGTTTACGGGATAAAGCC
GATGATGATACATTACGGATCAATGGTGGATGTTTTAGGTCGTGCAGGTCAAGTCAAGGAAGCCTATGAGCTCATCATGAGCATGCCTGTGGAGCCTGATCCAATTGTGT
GGAGGACATTGCTGAGTGCGTGCAGTGGTCGTGACGTCAATGGTGGGGCTGAGGTTGCAGAGGAGGCAAGGAAGAGATTGCTTGAGCTCGAGCCAAAGAGAGGTGGGAAT
GTGGTGATGGTTGCGAACAAGTTTGCTGAACTTGGTATGTGGAAACAAGCAGCTGATTACCGGAGAACGATGAAAGATAGGGGAATAAAAAAGATGGCTGGGGAGAGTTG
CATCGAATTAGGTGGCTCTTTGCGTAAATTCTTTTCAGGCTTTGATTCTCGAGCTGCTCCTGATGGCATTTACGATTTGCTTGATGGATTGAATCTGCATATGCAATTGA
CAAACTTTTAA
mRNA sequenceShow/hide mRNA sequence
CCGTTGTTAGTGTCTCTAAAATGTCATTTCTCATTGCTCATTCCTGCAAAGAGTATCTTTGTGTTATGTGCTATGAATTGTCGCGTTCTAGATATGGGATGCATTTTTAG
AGCGTAGTTTAACGCTCAATTCACAACCATGTGCAACCATTGAGCTCGTTGAAATCCATAAGCCGCATGGTTCGACTCTGGATCTCTGCCGTCCATCAATTTTTTCCTAT
TAACGTCCACAACTATAGTTCAAAACCCAAATTTCTCTCTACAAAGCATCAACTCCTTTCCCTTCTCAAGCACTGTTCCTCAACGAATCACCTTTTTGAAATCCATGCAC
AAATCCTCGTCTCTGGCCTTCAAAATGACTCATTTTTCACCACGGAACTCCTCCGCGTCGCTGCTCTATCACCCTCCAGAAATCTCAGCTATGGCTGCTCTCTCCTCTTC
CATTGCCACTTTCATTCCGCCACTATGCCATGGAATTTTATCATTAGGGGATATTCCTCGAGTGATTCTCCGCAAGAGGCCATTTCGTTATTCGGGGAAATGCGAAGACG
TGGAGTCAGACCCAATAACCTCACTTTCCCTTTCCTTCTCAAGGCCTGTGCAACACTTGCGACCCTCCAAGAAGGTAAGCAGTTTCATGCTATTGCCATAAAGTGTGGTT
TAGATTTAGATGTATATGTTCGGAATACTCTGATTAATTTCTATGGGTCCTGCAAAAGAATGTCTGGTGCACGAAAGGTGTTCGATGAAATGACAGAAAGAACTTTAGTT
TCATGGAATGCGGTTATTACAGCGTGTGTTGAGAATTTTTGCTTTGATGAAGCGATTGACTACTTTCTGAAAATGGGAAACCATGGTTTTGAGCCAGATGAGACTACGAT
GGTGGTGATATTATCAGCTTGTGCGGAGCTTGGCAATTTGAGCTTAGGGAGATGGGTTCATTCTCAAGTGGTGGGAAGAGGGATGGTTTTGAATGTTCAATTGGGCACTG
CCTTCGTCGACATGTATGCAAAATCCGGCGATGTGGGATGTGCTAGGCATGTATTCAATTGTTTGAAACAGAAAAGTGTATGGACATGGAGTGCAATGATTTTGGGTCTA
GCCCAACATGGATTTGCGAATGAAGCTATTGAACTCTTCACAAATATGATGAGCTCCCCTATAGTTCCCAACCATGTCACCTTCATTGGTGTCCTATGTGCTTGCAGCCA
TGCTGGATTGGTGGATAAAAGCTACCACTACTTCAACCTTATGGAGAGAGTTTACGGGATAAAGCCGATGATGATACATTACGGATCAATGGTGGATGTTTTAGGTCGTG
CAGGTCAAGTCAAGGAAGCCTATGAGCTCATCATGAGCATGCCTGTGGAGCCTGATCCAATTGTGTGGAGGACATTGCTGAGTGCGTGCAGTGGTCGTGACGTCAATGGT
GGGGCTGAGGTTGCAGAGGAGGCAAGGAAGAGATTGCTTGAGCTCGAGCCAAAGAGAGGTGGGAATGTGGTGATGGTTGCGAACAAGTTTGCTGAACTTGGTATGTGGAA
ACAAGCAGCTGATTACCGGAGAACGATGAAAGATAGGGGAATAAAAAAGATGGCTGGGGAGAGTTGCATCGAATTAGGTGGCTCTTTGCGTAAATTCTTTTCAGGCTTTG
ATTCTCGAGCTGCTCCTGATGGCATTTACGATTTGCTTGATGGATTGAATCTGCATATGCAATTGACAAACTTTTAATGATGATTATATCTAAATGTTCATTTTTGGTTA
TCTAGGTTGCTTGACACAACCATTGATTATTTTGTACCAACTTGTAGGCTTCATGCTGTTCTCTATCTCTTCGCAGGGGTATAGGCTTCATTATTTTTCTTTTAGTCCCT
CATAGAATCTGCTCCTTTTGAGTATTAGGTTCCTAACCGTGTTTGTAACGGTGATGCTAAAAGTAGTGGGTCTAATATAATTAATGCTTAACCATGGTTAGATTAAGCAC
CTTTCATTTT
Protein sequenceShow/hide protein sequence
MVRLWISAVHQFFPINVHNYSSKPKFLSTKHQLLSLLKHCSSTNHLFEIHAQILVSGLQNDSFFTTELLRVAALSPSRNLSYGCSLLFHCHFHSATMPWNFIIRGYSSSD
SPQEAISLFGEMRRRGVRPNNLTFPFLLKACATLATLQEGKQFHAIAIKCGLDLDVYVRNTLINFYGSCKRMSGARKVFDEMTERTLVSWNAVITACVENFCFDEAIDYF
LKMGNHGFEPDETTMVVILSACAELGNLSLGRWVHSQVVGRGMVLNVQLGTAFVDMYAKSGDVGCARHVFNCLKQKSVWTWSAMILGLAQHGFANEAIELFTNMMSSPIV
PNHVTFIGVLCACSHAGLVDKSYHYFNLMERVYGIKPMMIHYGSMVDVLGRAGQVKEAYELIMSMPVEPDPIVWRTLLSACSGRDVNGGAEVAEEARKRLLELEPKRGGN
VVMVANKFAELGMWKQAADYRRTMKDRGIKKMAGESCIELGGSLRKFFSGFDSRAAPDGIYDLLDGLNLHMQLTNF