; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

PI0017580 (gene) of Melon (PI 482460) v1 genome

Gene IDPI0017580
OrganismCucumis metuliferus PI 482460 (Melon (PI 482460) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationchr12:21909373..21912480
RNA-Seq ExpressionPI0017580
SyntenyPI0017580
Gene Ontology termsGO:0005515 - protein binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily
IPR032867 - DYW domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0051836.1 pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa]0.0e+0097.27Show/hide
Query:  MKAKSTLRQAVDLLCSRSTVTAEAYTQLVLECVRTNEINQAKRLQSHMEHHLFQPTDPFLHNQLLHLYAKFGKLRDAQNLFDKMLKRDVFSWNALLSAYA
        MKAKSTLRQ+VDLLCSRST T+EAYTQLVLECVRTNEINQAKRLQSHMEHHLFQPTDPFLHNQLLHLYAKFGKLRDAQNLFDKMLKRD FSWNALLSAYA
Subjt:  MKAKSTLRQAVDLLCSRSTVTAEAYTQLVLECVRTNEINQAKRLQSHMEHHLFQPTDPFLHNQLLHLYAKFGKLRDAQNLFDKMLKRDVFSWNALLSAYA

Query:  KSGSIQNLKATFDRMPFRDSVSYNSAIAGFAGNSCPQESLELFKRMQREGFEPTEYTIVSILNASAQLLDLRYGKQIHGSVIVRNFLGNVFIWNALTDMY
        KSGSIQNLKATFDRMPFRDSVSYN+ IAGF+GNSCPQESL+LFKRMQREGFEPTEYTIVSILNASAQLLDLRYGKQIHGS+IVRNFLGNVFIWN LTDMY
Subjt:  KSGSIQNLKATFDRMPFRDSVSYNSAIAGFAGNSCPQESLELFKRMQREGFEPTEYTIVSILNASAQLLDLRYGKQIHGSVIVRNFLGNVFIWNALTDMY

Query:  AKCGEIEQARWLFDCLTKKNLVSWNLMISGYAKNGQPEKCIGLLHQMRLSGHMPDQVTMSTIIAAYCQYGRVDEARRVLSEFKEKDIVCWTAMLVGYAKN
        AKCGEIEQARWLFDCLTKKNLVSWNLMISGYAKNGQPEKCIGLLHQMRLSGHMP+QVTMSTIIAAYCQ GRVDEARRV SEFKEKDIVCWTAMLVGYAKN
Subjt:  AKCGEIEQARWLFDCLTKKNLVSWNLMISGYAKNGQPEKCIGLLHQMRLSGHMPDQVTMSTIIAAYCQYGRVDEARRVLSEFKEKDIVCWTAMLVGYAKN

Query:  GREEDALLLFNEMLLEHIEPDSYTLSSVVSSCAKLASLHHGQAVHGKSILAGLNNNLLVSSALIDMYSKCGFIDDARSVFNLMPTRNVVSWNAMIVGCAQ
        GREEDALLLFNEMLLEHIEPDSYTLSSVVSSCAKLASLHHGQAVHGKSILAGLNNNLLVSSALIDMYSKCGFIDDARSVFNLMPTRNVVSWNAMIVGCAQ
Subjt:  GREEDALLLFNEMLLEHIEPDSYTLSSVVSSCAKLASLHHGQAVHGKSILAGLNNNLLVSSALIDMYSKCGFIDDARSVFNLMPTRNVVSWNAMIVGCAQ

Query:  NGHDKDALELFENMLQQKFKPDNVTFIGILSACLHCNWIEQGQEYFDSISNQHGMTPTLDHYACMVNLLGRTGRIEQAISLIKNMAHEPDFLILSTLLSI
        NGHDKDALELFENMLQQKFKPDNVTFIGILSACLHCNWIEQGQEYFDSISNQHG+TPTLDHYACMVNLLGRTGRIEQA+SLIKNMAHEPDFLI STLLSI
Subjt:  NGHDKDALELFENMLQQKFKPDNVTFIGILSACLHCNWIEQGQEYFDSISNQHGMTPTLDHYACMVNLLGRTGRIEQAISLIKNMAHEPDFLILSTLLSI

Query:  CSTKGDIVNAEMAARHLFELDPTSAVPYIMLSNMYASMGRWKDVASVRNLMKSKNVKKFAGFSWIEIDNEVHRFTSEDRTHPESENIYEELNILIGKLQE
        CSTKGDIVNAEMAARHLFELDPTSAVPYIMLSNMYASMGRWK VASVRNLMKSKNVKKFAGFSWIEID EVHRFTSEDRTHPESENIYEELNILIGKLQE
Subjt:  CSTKGDIVNAEMAARHLFELDPTSAVPYIMLSNMYASMGRWKDVASVRNLMKSKNVKKFAGFSWIEIDNEVHRFTSEDRTHPESENIYEELNILIGKLQE

Query:  EGFTPNTNLVLHDVGEDEKFKSICFHSEKLALAFGLIKKPNGISPIRIIKNIRICNDCHEFMKFASRIIGRQIILRDSNRFHHFSTGKCSCNDNW
        EGFTPNT LVLHDVGEDEKFKSICFHSEKLALAFGLIKKPNGISPIRIIKNIRICNDCHEFMKFASRIIGRQIILRDSNRFHHFSTGKCSCNDNW
Subjt:  EGFTPNTNLVLHDVGEDEKFKSICFHSEKLALAFGLIKKPNGISPIRIIKNIRICNDCHEFMKFASRIIGRQIILRDSNRFHHFSTGKCSCNDNW

XP_004147314.1 putative pentatricopeptide repeat-containing protein At1g68930 [Cucumis sativus]0.0e+0096.55Show/hide
Query:  MKAKSTLRQAVDLLCSRSTVTAEAYTQLVLECVRTNEINQAKRLQSHMEHHLFQPTDPFLHNQLLHLYAKFGKLRDAQNLFDKMLKRDVFSWNALLSAYA
        MKAKS LRQ+VDLLCSRST T+EAYTQLVLECVRTNEINQAKRLQSHMEHHLFQPTD FLHNQLLHLYAKFGKLRDAQNLFDKMLKRD+FSWNALLSAYA
Subjt:  MKAKSTLRQAVDLLCSRSTVTAEAYTQLVLECVRTNEINQAKRLQSHMEHHLFQPTDPFLHNQLLHLYAKFGKLRDAQNLFDKMLKRDVFSWNALLSAYA

Query:  KSGSIQNLKATFDRMPFRDSVSYNSAIAGFAGNSCPQESLELFKRMQREGFEPTEYTIVSILNASAQLLDLRYGKQIHGSVIVRNFLGNVFIWNALTDMY
        KSGSIQNLKATFDRMPFRDSVSYN+ IAGF+GNSCPQESLELFKRMQREGFEPTEYTIVSILNASAQL DLRYGKQIHGS+IVRNFLGNVFIWNALTDMY
Subjt:  KSGSIQNLKATFDRMPFRDSVSYNSAIAGFAGNSCPQESLELFKRMQREGFEPTEYTIVSILNASAQLLDLRYGKQIHGSVIVRNFLGNVFIWNALTDMY

Query:  AKCGEIEQARWLFDCLTKKNLVSWNLMISGYAKNGQPEKCIGLLHQMRLSGHMPDQVTMSTIIAAYCQYGRVDEARRVLSEFKEKDIVCWTAMLVGYAKN
        AKCGEIEQARWLFDCLTKKNLVSWNLMISGYAKNGQPEKCIGLLHQMRLSGHMPDQVTMSTIIAAYCQ GRVDEARRV SEFKEKDIVCWTAM+VGYAKN
Subjt:  AKCGEIEQARWLFDCLTKKNLVSWNLMISGYAKNGQPEKCIGLLHQMRLSGHMPDQVTMSTIIAAYCQYGRVDEARRVLSEFKEKDIVCWTAMLVGYAKN

Query:  GREEDALLLFNEMLLEHIEPDSYTLSSVVSSCAKLASLHHGQAVHGKSILAGLNNNLLVSSALIDMYSKCGFIDDARSVFNLMPTRNVVSWNAMIVGCAQ
        GREEDALLLFNEMLLEHIEPDSYTLSSVVSSCAKLASLHHGQAVHGKSILAGLNNNLLVSSALIDMYSKCGFIDDARSVFNLMPTRNVVSWNAMIVGCAQ
Subjt:  GREEDALLLFNEMLLEHIEPDSYTLSSVVSSCAKLASLHHGQAVHGKSILAGLNNNLLVSSALIDMYSKCGFIDDARSVFNLMPTRNVVSWNAMIVGCAQ

Query:  NGHDKDALELFENMLQQKFKPDNVTFIGILSACLHCNWIEQGQEYFDSISNQHGMTPTLDHYACMVNLLGRTGRIEQAISLIKNMAHEPDFLILSTLLSI
        NGHDKDALELFENMLQQKFKPDNVTFIGILSACLHCNWIEQGQEYFDSI+NQHGMTPTLDHYACMVNLLGRTGRIEQA++LIKNMAH+PDFLI STLLSI
Subjt:  NGHDKDALELFENMLQQKFKPDNVTFIGILSACLHCNWIEQGQEYFDSISNQHGMTPTLDHYACMVNLLGRTGRIEQAISLIKNMAHEPDFLILSTLLSI

Query:  CSTKGDIVNAEMAARHLFELDPTSAVPYIMLSNMYASMGRWKDVASVRNLMKSKNVKKFAGFSWIEIDNEVHRFTSEDRTHPESENIYEELNILIGKLQE
        CSTKGDIVNAE+AARHLFELDPT AVPYIMLSNMYASMGRWKDVASVRNLMKSKNVKKFAGFSWIEIDNEVHRFTSEDRTHPESE+IYE+LN+LIGKLQE
Subjt:  CSTKGDIVNAEMAARHLFELDPTSAVPYIMLSNMYASMGRWKDVASVRNLMKSKNVKKFAGFSWIEIDNEVHRFTSEDRTHPESENIYEELNILIGKLQE

Query:  EGFTPNTNLVLHDVGEDEKFKSICFHSEKLALAFGLIKKPNGISPIRIIKNIRICNDCHEFMKFASRIIGRQIILRDSNRFHHFSTGKCSCNDNW
        EGFTPNTNLVLHDVGEDEKFKSICFHSEKLALAFGLIKKPNGISPIRIIKNIRICNDCHEFMKFASRIIGRQIILRDSNRFHHFSTGKCSCNDNW
Subjt:  EGFTPNTNLVLHDVGEDEKFKSICFHSEKLALAFGLIKKPNGISPIRIIKNIRICNDCHEFMKFASRIIGRQIILRDSNRFHHFSTGKCSCNDNW

XP_022145099.1 pentatricopeptide repeat-containing protein At4g02750-like [Momordica charantia]0.0e+0085.76Show/hide
Query:  MKAKSTLRQAVDLLCSRSTVTAEAYTQLVLECVRTNEINQAKRLQSHMEHHLFQPTDPFLHNQLLHLYAKFGKLRDAQNLFDKMLKRDVFSWNALLSAYA
        M+AK  LRQA+DLLCSR + ++EAYT L+LECVRTNE++QAKRLQSHMEHHLFQP DPFL NQLLHLYAKFGK+RDAQNLFDKML+RDVFSWNALLSAYA
Subjt:  MKAKSTLRQAVDLLCSRSTVTAEAYTQLVLECVRTNEINQAKRLQSHMEHHLFQPTDPFLHNQLLHLYAKFGKLRDAQNLFDKMLKRDVFSWNALLSAYA

Query:  KSGSIQNLKATFDRMPFRDSVSYNSAIAGFAGNSCPQESLELFKRMQREGFEPTEYTIVSILNASAQLLDLRYGKQIHGSVIVRNFLGNVFIWNALTDMY
        KSGSIQNL+ATFDRMPFRDSVSYN+ IAGFAGN CP+ESLELF+RMQ EGF PTEYT VS LNA+AQLLDLR GK+IHGSVIV  FLGN FIWNALTDMY
Subjt:  KSGSIQNLKATFDRMPFRDSVSYNSAIAGFAGNSCPQESLELFKRMQREGFEPTEYTIVSILNASAQLLDLRYGKQIHGSVIVRNFLGNVFIWNALTDMY

Query:  AKCGEIEQARWLFDCLTKKNLVSWNLMISGYAKNGQPEKCIGLLHQMRLSGHMPDQVTMSTIIAAYCQYGRVDEARRVLSEFKEKDIVCWTAMLVGYAKN
        AKCGEIEQARWLFD L  KNL+SWNLMISGY KNGQPEKCIGLLH+M++SGHMPDQVTMSTIIAAYCQ   VDEAR+V SEFKEKDIVCWTAMLVGYAKN
Subjt:  AKCGEIEQARWLFDCLTKKNLVSWNLMISGYAKNGQPEKCIGLLHQMRLSGHMPDQVTMSTIIAAYCQYGRVDEARRVLSEFKEKDIVCWTAMLVGYAKN

Query:  GREEDALLLFNEMLLEHIEPDSYTLSSVVSSCAKLASLHHGQAVHGKSILAGLNNNLLVSSALIDMYSKCGFIDDARSVFNLMPTRNVVSWNAMIVGCAQ
        GREEDALLLFNEMLLEH++PDSYTLSSVVSSCAKLASL+HGQAVHGKSILAGL+NNLLVSSALIDMYSKCGF+D+ARSVFN+MPTRNV+SWNAMIVG AQ
Subjt:  GREEDALLLFNEMLLEHIEPDSYTLSSVVSSCAKLASLHHGQAVHGKSILAGLNNNLLVSSALIDMYSKCGFIDDARSVFNLMPTRNVVSWNAMIVGCAQ

Query:  NGHDKDALELFENMLQQKFKPDNVTFIGILSACLHCNWIEQGQEYFDSISNQHGMTPTLDHYACMVNLLGRTGRIEQAISLIKNMAHEPDFLILSTLLSI
        NGHDKDAL  FENMLQQKFKPDNVTFIG+LSACLH NWIE+GQ YFDSISNQHG+ PT+DHYACMVNLLGR GRI+QA+ LIK+M HEPD LI STLLS+
Subjt:  NGHDKDALELFENMLQQKFKPDNVTFIGILSACLHCNWIEQGQEYFDSISNQHGMTPTLDHYACMVNLLGRTGRIEQAISLIKNMAHEPDFLILSTLLSI

Query:  CSTKGDIVNAEMAARHLFELDPTSAVPYIMLSNMYASMGRWKDVASVRNLMKSKNVKKFAGFSWIEIDNEVHRFTSEDRTHPESENIYEELNILIGKLQE
         + KGDI NAEMAAR+LFELDP +AVPY+MLSNMYA MGRWKDVASVR LMKSKNVKKFAG+SWIEIDN+VH+FTSEDRTHPE+E IYEELN+LI K QE
Subjt:  CSTKGDIVNAEMAARHLFELDPTSAVPYIMLSNMYASMGRWKDVASVRNLMKSKNVKKFAGFSWIEIDNEVHRFTSEDRTHPESENIYEELNILIGKLQE

Query:  EGFTPNTNLVLHDVGEDEKFKSICFHSEKLALAFGLIKKPNGISPIRIIKNIRICNDCHEFMKFASRIIGRQIILRDSNRFHHFSTGKCSCNDNW
        +GFTPNTNLVLHDVGEDEKFKSICFHSEKLALAFGL+KKPNG++PIRIIKNIRIC+DCHEFMKFASRII RQIILRDSNRFHHF+TGKCSC DNW
Subjt:  EGFTPNTNLVLHDVGEDEKFKSICFHSEKLALAFGLIKKPNGISPIRIIKNIRICNDCHEFMKFASRIIGRQIILRDSNRFHHFSTGKCSCNDNW

XP_022960689.1 pentatricopeptide repeat-containing protein At4g02750-like isoform X1 [Cucurbita moschata]0.0e+0085.76Show/hide
Query:  MKAKSTLRQAVDLLCSRSTVTAEAYTQLVLECVRTNEINQAKRLQSHMEHHLFQPTDPFLHNQLLHLYAKFGKLRDAQNLFDKMLKRDVFSWNALLSAYA
        MKAKS LRQAVDLLCSRST T+EAYTQLVLECVR NEI+QAKRLQSHMEHHLFQP DPFLHNQLLHLYAKFGKLRDAQNLFDKML+RDVFSWNALLSAYA
Subjt:  MKAKSTLRQAVDLLCSRSTVTAEAYTQLVLECVRTNEINQAKRLQSHMEHHLFQPTDPFLHNQLLHLYAKFGKLRDAQNLFDKMLKRDVFSWNALLSAYA

Query:  KSGSIQNLKATFDRMPFRDSVSYNSAIAGFAGNSCPQESLELFKRMQREGFEPTEYTIVSILNASAQLLDLRYGKQIHGSVIVRNFLGNVFIWNALTDMY
        KSGSIQ+L+ATFDRMP+RDSVSYN+ IAG +GNS P+ESLELF+RMQREG  PTEYT VS LNASAQLLDLR GKQIHGSVIV N+LGNVFI NALTDMY
Subjt:  KSGSIQNLKATFDRMPFRDSVSYNSAIAGFAGNSCPQESLELFKRMQREGFEPTEYTIVSILNASAQLLDLRYGKQIHGSVIVRNFLGNVFIWNALTDMY

Query:  AKCGEIEQARWLFDCLTKKNLVSWNLMISGYAKNGQPEKCIGLLHQMRLSGHMPDQVTMSTIIAAYCQYGRVDEARRVLSEFKEKDIVCWTAMLVGYAKN
        AKCGEIEQARWLFD LT KNLVSWNLMISGY KNGQPEKCIGLLH MRLSGHMPDQVT+ST+IAAYCQ GR DEARRV +EFK+KDIVCWTAMLVGYAK+
Subjt:  AKCGEIEQARWLFDCLTKKNLVSWNLMISGYAKNGQPEKCIGLLHQMRLSGHMPDQVTMSTIIAAYCQYGRVDEARRVLSEFKEKDIVCWTAMLVGYAKN

Query:  GREEDALLLFNEMLLEHIEPDSYTLSSVVSSCAKLASLHHGQAVHGKSILAGLNNNLLVSSALIDMYSKCGFIDDARSVFNLMPTRNVVSWNAMIVGCAQ
        GREEDALLLFNEMLLEH EPDSYTLSSVVSSCAKLASL+HGQA+HGKSILAGL+NNLLVSSALIDMYSKCG I+DARSVF++MPTRNV++WNAMIVG AQ
Subjt:  GREEDALLLFNEMLLEHIEPDSYTLSSVVSSCAKLASLHHGQAVHGKSILAGLNNNLLVSSALIDMYSKCGFIDDARSVFNLMPTRNVVSWNAMIVGCAQ

Query:  NGHDKDALELFENMLQQKFKPDNVTFIGILSACLHCNWIEQGQEYFDSISNQHGMTPTLDHYACMVNLLGRTGRIEQAISLIKNMAHEPDFLILSTLLSI
        NG DKD LELFENMLQ+KFKPDNVTF+G+LSACLH N+IEQGQ +FDSISNQHG+TP+LDHYACMVNLLGR+GRI+QA+ LIK+M HEPDFLI STLLS+
Subjt:  NGHDKDALELFENMLQQKFKPDNVTFIGILSACLHCNWIEQGQEYFDSISNQHGMTPTLDHYACMVNLLGRTGRIEQAISLIKNMAHEPDFLILSTLLSI

Query:  CSTKGDIVNAEMAARHLFELDPTSAVPYIMLSNMYASMGRWKDVASVRNLMKSKNVKKFAGFSWIEIDNEVHRFTSEDRTHPESENIYEELNILIGKLQE
         +TKGD+ +AEM  RHLFELDPT+AVPYIMLSNMYASMGRWKDVA+VR++MK+KNVKKFAG+SWIEIDNEVH+FTSEDRTHPE+E IYEEL ILI KL+E
Subjt:  CSTKGDIVNAEMAARHLFELDPTSAVPYIMLSNMYASMGRWKDVASVRNLMKSKNVKKFAGFSWIEIDNEVHRFTSEDRTHPESENIYEELNILIGKLQE

Query:  EGFTPNTNLVLHDVGEDEKFKSICFHSEKLALAFGLIKKPNGISPIRIIKNIRICNDCHEFMKFASRIIGRQIILRDSNRFHHFSTGKCSCNDNW
        +GF PNTNLVLHDVGE+EK KSICFHSEKLAL FGLIKK NG+SPIRIIKNIRIC+DCHEFMKFAS  I RQIILRDSNRFHHFS GKCSC DNW
Subjt:  EGFTPNTNLVLHDVGEDEKFKSICFHSEKLALAFGLIKKPNGISPIRIIKNIRICNDCHEFMKFASRIIGRQIILRDSNRFHHFSTGKCSCNDNW

XP_038895252.1 pentatricopeptide repeat-containing protein At2g22070-like [Benincasa hispida]0.0e+0093.38Show/hide
Query:  MKAKSTLRQAVDLLCSRSTVTAEAYTQLVLECVRTNEINQAKRLQSHMEHHLFQPTDPFLHNQLLHLYAKFGKLRDAQNLFDKMLKRDVFSWNALLSAYA
        MKAKS LRQA+DLLCS+ST T+EAYTQLVLECVR N+INQAKRLQSHMEHHLFQPTDPFLHNQLLHLYAKFGKLRDAQNLFDKML+RDVFSWNA+LSA+A
Subjt:  MKAKSTLRQAVDLLCSRSTVTAEAYTQLVLECVRTNEINQAKRLQSHMEHHLFQPTDPFLHNQLLHLYAKFGKLRDAQNLFDKMLKRDVFSWNALLSAYA

Query:  KSGSIQNLKATFDRMPFRDSVSYNSAIAGFAGNSCPQESLELFKRMQREGFEPTEYTIVSILNASAQLLDLRYGKQIHGSVIVRNFLGNVFIWNALTDMY
        KSGSIQNL+ATFD+MPFRDSVSYN+ IAGFAGNSCP+ESLELFKRMQREGFEPTEYTIVSILNAS QLLDLR GKQIHGSVIV NFLGNVFI NALTDMY
Subjt:  KSGSIQNLKATFDRMPFRDSVSYNSAIAGFAGNSCPQESLELFKRMQREGFEPTEYTIVSILNASAQLLDLRYGKQIHGSVIVRNFLGNVFIWNALTDMY

Query:  AKCGEIEQARWLFDCLTKKNLVSWNLMISGYAKNGQPEKCIGLLHQMRLSGHMPDQVTMSTIIAAYCQYGRVDEARRVLSEFKEKDIVCWTAMLVGYAKN
        AKCGEIEQARWLFDC T KNLVSWNLMISGYAKNG+PEKCIGLLH+MRLSGHMPDQVTMSTIIAAYCQ GRVD AR+V SEFKEKDIVCWTAMLVGYAKN
Subjt:  AKCGEIEQARWLFDCLTKKNLVSWNLMISGYAKNGQPEKCIGLLHQMRLSGHMPDQVTMSTIIAAYCQYGRVDEARRVLSEFKEKDIVCWTAMLVGYAKN

Query:  GREEDALLLFNEMLLEHIEPDSYTLSSVVSSCAKLASLHHGQAVHGKSILAGLNNNLLVSSALIDMYSKCGFIDDARSVFNLMPTRNVVSWNAMIVGCAQ
        GREEDAL LFNEMLLEHIEPDSYTLSSVVSSCAKLA LHHGQAVHGKSILAGLNNNLLVSSALIDMYSKCGFIDDARSVFNLMPTRNVVSWNAMIVGCAQ
Subjt:  GREEDALLLFNEMLLEHIEPDSYTLSSVVSSCAKLASLHHGQAVHGKSILAGLNNNLLVSSALIDMYSKCGFIDDARSVFNLMPTRNVVSWNAMIVGCAQ

Query:  NGHDKDALELFENMLQQKFKPDNVTFIGILSACLHCNWIEQGQEYFDSISNQHGMTPTLDHYACMVNLLGRTGRIEQAISLIKNMAHEPDFLILSTLLSI
        NGHDKDALELFENMLQQKFKPDNVTFIG+LSACLHCNWIEQGQ YFDSISNQHG+TPTLDHYACMVNLLGRTGRI QA+SLIKNMAHEPDFLI STLLSI
Subjt:  NGHDKDALELFENMLQQKFKPDNVTFIGILSACLHCNWIEQGQEYFDSISNQHGMTPTLDHYACMVNLLGRTGRIEQAISLIKNMAHEPDFLILSTLLSI

Query:  CSTKGDIVNAEMAARHLFELDPTSAVPYIMLSNMYASMGRWKDVASVRNLMKSKNVKKFAGFSWIEIDNEVHRFTSEDRTHPESENIYEELNILIGKLQE
         STKGD+VNAEMAA+HLFELDPTSAVPYIMLSNMYASMGRWKDVASVRNLM SKNVKKFAG+SWIEIDNEV RFTSEDRTHPE+E IYEELN+LIGKLQE
Subjt:  CSTKGDIVNAEMAARHLFELDPTSAVPYIMLSNMYASMGRWKDVASVRNLMKSKNVKKFAGFSWIEIDNEVHRFTSEDRTHPESENIYEELNILIGKLQE

Query:  EGFTPNTNLVLHDVGEDEKFKSICFHSEKLALAFGLIKKPNGISPIRIIKNIRICNDCHEFMKFASRIIGRQIILRDSNRFHHFSTGKCSCNDNW
        EGFTPNTNLVLHDVGEDEKFKSICFHSEKLALAFGLIKKPNGISPIRIIKNIRICNDCHEFMKFASRII RQIILRDSNRFHHFSTGKCSC DNW
Subjt:  EGFTPNTNLVLHDVGEDEKFKSICFHSEKLALAFGLIKKPNGISPIRIIKNIRICNDCHEFMKFASRIIGRQIILRDSNRFHHFSTGKCSCNDNW

TrEMBL top hitse value%identityAlignment
A0A0A0LUY3 DYW_deaminase domain-containing protein0.0e+0096.55Show/hide
Query:  MKAKSTLRQAVDLLCSRSTVTAEAYTQLVLECVRTNEINQAKRLQSHMEHHLFQPTDPFLHNQLLHLYAKFGKLRDAQNLFDKMLKRDVFSWNALLSAYA
        MKAKS LRQ+VDLLCSRST T+EAYTQLVLECVRTNEINQAKRLQSHMEHHLFQPTD FLHNQLLHLYAKFGKLRDAQNLFDKMLKRD+FSWNALLSAYA
Subjt:  MKAKSTLRQAVDLLCSRSTVTAEAYTQLVLECVRTNEINQAKRLQSHMEHHLFQPTDPFLHNQLLHLYAKFGKLRDAQNLFDKMLKRDVFSWNALLSAYA

Query:  KSGSIQNLKATFDRMPFRDSVSYNSAIAGFAGNSCPQESLELFKRMQREGFEPTEYTIVSILNASAQLLDLRYGKQIHGSVIVRNFLGNVFIWNALTDMY
        KSGSIQNLKATFDRMPFRDSVSYN+ IAGF+GNSCPQESLELFKRMQREGFEPTEYTIVSILNASAQL DLRYGKQIHGS+IVRNFLGNVFIWNALTDMY
Subjt:  KSGSIQNLKATFDRMPFRDSVSYNSAIAGFAGNSCPQESLELFKRMQREGFEPTEYTIVSILNASAQLLDLRYGKQIHGSVIVRNFLGNVFIWNALTDMY

Query:  AKCGEIEQARWLFDCLTKKNLVSWNLMISGYAKNGQPEKCIGLLHQMRLSGHMPDQVTMSTIIAAYCQYGRVDEARRVLSEFKEKDIVCWTAMLVGYAKN
        AKCGEIEQARWLFDCLTKKNLVSWNLMISGYAKNGQPEKCIGLLHQMRLSGHMPDQVTMSTIIAAYCQ GRVDEARRV SEFKEKDIVCWTAM+VGYAKN
Subjt:  AKCGEIEQARWLFDCLTKKNLVSWNLMISGYAKNGQPEKCIGLLHQMRLSGHMPDQVTMSTIIAAYCQYGRVDEARRVLSEFKEKDIVCWTAMLVGYAKN

Query:  GREEDALLLFNEMLLEHIEPDSYTLSSVVSSCAKLASLHHGQAVHGKSILAGLNNNLLVSSALIDMYSKCGFIDDARSVFNLMPTRNVVSWNAMIVGCAQ
        GREEDALLLFNEMLLEHIEPDSYTLSSVVSSCAKLASLHHGQAVHGKSILAGLNNNLLVSSALIDMYSKCGFIDDARSVFNLMPTRNVVSWNAMIVGCAQ
Subjt:  GREEDALLLFNEMLLEHIEPDSYTLSSVVSSCAKLASLHHGQAVHGKSILAGLNNNLLVSSALIDMYSKCGFIDDARSVFNLMPTRNVVSWNAMIVGCAQ

Query:  NGHDKDALELFENMLQQKFKPDNVTFIGILSACLHCNWIEQGQEYFDSISNQHGMTPTLDHYACMVNLLGRTGRIEQAISLIKNMAHEPDFLILSTLLSI
        NGHDKDALELFENMLQQKFKPDNVTFIGILSACLHCNWIEQGQEYFDSI+NQHGMTPTLDHYACMVNLLGRTGRIEQA++LIKNMAH+PDFLI STLLSI
Subjt:  NGHDKDALELFENMLQQKFKPDNVTFIGILSACLHCNWIEQGQEYFDSISNQHGMTPTLDHYACMVNLLGRTGRIEQAISLIKNMAHEPDFLILSTLLSI

Query:  CSTKGDIVNAEMAARHLFELDPTSAVPYIMLSNMYASMGRWKDVASVRNLMKSKNVKKFAGFSWIEIDNEVHRFTSEDRTHPESENIYEELNILIGKLQE
        CSTKGDIVNAE+AARHLFELDPT AVPYIMLSNMYASMGRWKDVASVRNLMKSKNVKKFAGFSWIEIDNEVHRFTSEDRTHPESE+IYE+LN+LIGKLQE
Subjt:  CSTKGDIVNAEMAARHLFELDPTSAVPYIMLSNMYASMGRWKDVASVRNLMKSKNVKKFAGFSWIEIDNEVHRFTSEDRTHPESENIYEELNILIGKLQE

Query:  EGFTPNTNLVLHDVGEDEKFKSICFHSEKLALAFGLIKKPNGISPIRIIKNIRICNDCHEFMKFASRIIGRQIILRDSNRFHHFSTGKCSCNDNW
        EGFTPNTNLVLHDVGEDEKFKSICFHSEKLALAFGLIKKPNGISPIRIIKNIRICNDCHEFMKFASRIIGRQIILRDSNRFHHFSTGKCSCNDNW
Subjt:  EGFTPNTNLVLHDVGEDEKFKSICFHSEKLALAFGLIKKPNGISPIRIIKNIRICNDCHEFMKFASRIIGRQIILRDSNRFHHFSTGKCSCNDNW

A0A5A7UC76 Pentatricopeptide repeat-containing protein0.0e+0097.27Show/hide
Query:  MKAKSTLRQAVDLLCSRSTVTAEAYTQLVLECVRTNEINQAKRLQSHMEHHLFQPTDPFLHNQLLHLYAKFGKLRDAQNLFDKMLKRDVFSWNALLSAYA
        MKAKSTLRQ+VDLLCSRST T+EAYTQLVLECVRTNEINQAKRLQSHMEHHLFQPTDPFLHNQLLHLYAKFGKLRDAQNLFDKMLKRD FSWNALLSAYA
Subjt:  MKAKSTLRQAVDLLCSRSTVTAEAYTQLVLECVRTNEINQAKRLQSHMEHHLFQPTDPFLHNQLLHLYAKFGKLRDAQNLFDKMLKRDVFSWNALLSAYA

Query:  KSGSIQNLKATFDRMPFRDSVSYNSAIAGFAGNSCPQESLELFKRMQREGFEPTEYTIVSILNASAQLLDLRYGKQIHGSVIVRNFLGNVFIWNALTDMY
        KSGSIQNLKATFDRMPFRDSVSYN+ IAGF+GNSCPQESL+LFKRMQREGFEPTEYTIVSILNASAQLLDLRYGKQIHGS+IVRNFLGNVFIWN LTDMY
Subjt:  KSGSIQNLKATFDRMPFRDSVSYNSAIAGFAGNSCPQESLELFKRMQREGFEPTEYTIVSILNASAQLLDLRYGKQIHGSVIVRNFLGNVFIWNALTDMY

Query:  AKCGEIEQARWLFDCLTKKNLVSWNLMISGYAKNGQPEKCIGLLHQMRLSGHMPDQVTMSTIIAAYCQYGRVDEARRVLSEFKEKDIVCWTAMLVGYAKN
        AKCGEIEQARWLFDCLTKKNLVSWNLMISGYAKNGQPEKCIGLLHQMRLSGHMP+QVTMSTIIAAYCQ GRVDEARRV SEFKEKDIVCWTAMLVGYAKN
Subjt:  AKCGEIEQARWLFDCLTKKNLVSWNLMISGYAKNGQPEKCIGLLHQMRLSGHMPDQVTMSTIIAAYCQYGRVDEARRVLSEFKEKDIVCWTAMLVGYAKN

Query:  GREEDALLLFNEMLLEHIEPDSYTLSSVVSSCAKLASLHHGQAVHGKSILAGLNNNLLVSSALIDMYSKCGFIDDARSVFNLMPTRNVVSWNAMIVGCAQ
        GREEDALLLFNEMLLEHIEPDSYTLSSVVSSCAKLASLHHGQAVHGKSILAGLNNNLLVSSALIDMYSKCGFIDDARSVFNLMPTRNVVSWNAMIVGCAQ
Subjt:  GREEDALLLFNEMLLEHIEPDSYTLSSVVSSCAKLASLHHGQAVHGKSILAGLNNNLLVSSALIDMYSKCGFIDDARSVFNLMPTRNVVSWNAMIVGCAQ

Query:  NGHDKDALELFENMLQQKFKPDNVTFIGILSACLHCNWIEQGQEYFDSISNQHGMTPTLDHYACMVNLLGRTGRIEQAISLIKNMAHEPDFLILSTLLSI
        NGHDKDALELFENMLQQKFKPDNVTFIGILSACLHCNWIEQGQEYFDSISNQHG+TPTLDHYACMVNLLGRTGRIEQA+SLIKNMAHEPDFLI STLLSI
Subjt:  NGHDKDALELFENMLQQKFKPDNVTFIGILSACLHCNWIEQGQEYFDSISNQHGMTPTLDHYACMVNLLGRTGRIEQAISLIKNMAHEPDFLILSTLLSI

Query:  CSTKGDIVNAEMAARHLFELDPTSAVPYIMLSNMYASMGRWKDVASVRNLMKSKNVKKFAGFSWIEIDNEVHRFTSEDRTHPESENIYEELNILIGKLQE
        CSTKGDIVNAEMAARHLFELDPTSAVPYIMLSNMYASMGRWK VASVRNLMKSKNVKKFAGFSWIEID EVHRFTSEDRTHPESENIYEELNILIGKLQE
Subjt:  CSTKGDIVNAEMAARHLFELDPTSAVPYIMLSNMYASMGRWKDVASVRNLMKSKNVKKFAGFSWIEIDNEVHRFTSEDRTHPESENIYEELNILIGKLQE

Query:  EGFTPNTNLVLHDVGEDEKFKSICFHSEKLALAFGLIKKPNGISPIRIIKNIRICNDCHEFMKFASRIIGRQIILRDSNRFHHFSTGKCSCNDNW
        EGFTPNT LVLHDVGEDEKFKSICFHSEKLALAFGLIKKPNGISPIRIIKNIRICNDCHEFMKFASRIIGRQIILRDSNRFHHFSTGKCSCNDNW
Subjt:  EGFTPNTNLVLHDVGEDEKFKSICFHSEKLALAFGLIKKPNGISPIRIIKNIRICNDCHEFMKFASRIIGRQIILRDSNRFHHFSTGKCSCNDNW

A0A6J1CU81 pentatricopeptide repeat-containing protein At4g02750-like0.0e+0085.76Show/hide
Query:  MKAKSTLRQAVDLLCSRSTVTAEAYTQLVLECVRTNEINQAKRLQSHMEHHLFQPTDPFLHNQLLHLYAKFGKLRDAQNLFDKMLKRDVFSWNALLSAYA
        M+AK  LRQA+DLLCSR + ++EAYT L+LECVRTNE++QAKRLQSHMEHHLFQP DPFL NQLLHLYAKFGK+RDAQNLFDKML+RDVFSWNALLSAYA
Subjt:  MKAKSTLRQAVDLLCSRSTVTAEAYTQLVLECVRTNEINQAKRLQSHMEHHLFQPTDPFLHNQLLHLYAKFGKLRDAQNLFDKMLKRDVFSWNALLSAYA

Query:  KSGSIQNLKATFDRMPFRDSVSYNSAIAGFAGNSCPQESLELFKRMQREGFEPTEYTIVSILNASAQLLDLRYGKQIHGSVIVRNFLGNVFIWNALTDMY
        KSGSIQNL+ATFDRMPFRDSVSYN+ IAGFAGN CP+ESLELF+RMQ EGF PTEYT VS LNA+AQLLDLR GK+IHGSVIV  FLGN FIWNALTDMY
Subjt:  KSGSIQNLKATFDRMPFRDSVSYNSAIAGFAGNSCPQESLELFKRMQREGFEPTEYTIVSILNASAQLLDLRYGKQIHGSVIVRNFLGNVFIWNALTDMY

Query:  AKCGEIEQARWLFDCLTKKNLVSWNLMISGYAKNGQPEKCIGLLHQMRLSGHMPDQVTMSTIIAAYCQYGRVDEARRVLSEFKEKDIVCWTAMLVGYAKN
        AKCGEIEQARWLFD L  KNL+SWNLMISGY KNGQPEKCIGLLH+M++SGHMPDQVTMSTIIAAYCQ   VDEAR+V SEFKEKDIVCWTAMLVGYAKN
Subjt:  AKCGEIEQARWLFDCLTKKNLVSWNLMISGYAKNGQPEKCIGLLHQMRLSGHMPDQVTMSTIIAAYCQYGRVDEARRVLSEFKEKDIVCWTAMLVGYAKN

Query:  GREEDALLLFNEMLLEHIEPDSYTLSSVVSSCAKLASLHHGQAVHGKSILAGLNNNLLVSSALIDMYSKCGFIDDARSVFNLMPTRNVVSWNAMIVGCAQ
        GREEDALLLFNEMLLEH++PDSYTLSSVVSSCAKLASL+HGQAVHGKSILAGL+NNLLVSSALIDMYSKCGF+D+ARSVFN+MPTRNV+SWNAMIVG AQ
Subjt:  GREEDALLLFNEMLLEHIEPDSYTLSSVVSSCAKLASLHHGQAVHGKSILAGLNNNLLVSSALIDMYSKCGFIDDARSVFNLMPTRNVVSWNAMIVGCAQ

Query:  NGHDKDALELFENMLQQKFKPDNVTFIGILSACLHCNWIEQGQEYFDSISNQHGMTPTLDHYACMVNLLGRTGRIEQAISLIKNMAHEPDFLILSTLLSI
        NGHDKDAL  FENMLQQKFKPDNVTFIG+LSACLH NWIE+GQ YFDSISNQHG+ PT+DHYACMVNLLGR GRI+QA+ LIK+M HEPD LI STLLS+
Subjt:  NGHDKDALELFENMLQQKFKPDNVTFIGILSACLHCNWIEQGQEYFDSISNQHGMTPTLDHYACMVNLLGRTGRIEQAISLIKNMAHEPDFLILSTLLSI

Query:  CSTKGDIVNAEMAARHLFELDPTSAVPYIMLSNMYASMGRWKDVASVRNLMKSKNVKKFAGFSWIEIDNEVHRFTSEDRTHPESENIYEELNILIGKLQE
         + KGDI NAEMAAR+LFELDP +AVPY+MLSNMYA MGRWKDVASVR LMKSKNVKKFAG+SWIEIDN+VH+FTSEDRTHPE+E IYEELN+LI K QE
Subjt:  CSTKGDIVNAEMAARHLFELDPTSAVPYIMLSNMYASMGRWKDVASVRNLMKSKNVKKFAGFSWIEIDNEVHRFTSEDRTHPESENIYEELNILIGKLQE

Query:  EGFTPNTNLVLHDVGEDEKFKSICFHSEKLALAFGLIKKPNGISPIRIIKNIRICNDCHEFMKFASRIIGRQIILRDSNRFHHFSTGKCSCNDNW
        +GFTPNTNLVLHDVGEDEKFKSICFHSEKLALAFGL+KKPNG++PIRIIKNIRIC+DCHEFMKFASRII RQIILRDSNRFHHF+TGKCSC DNW
Subjt:  EGFTPNTNLVLHDVGEDEKFKSICFHSEKLALAFGLIKKPNGISPIRIIKNIRICNDCHEFMKFASRIIGRQIILRDSNRFHHFSTGKCSCNDNW

A0A6J1HBT0 pentatricopeptide repeat-containing protein At4g02750-like isoform X10.0e+0085.76Show/hide
Query:  MKAKSTLRQAVDLLCSRSTVTAEAYTQLVLECVRTNEINQAKRLQSHMEHHLFQPTDPFLHNQLLHLYAKFGKLRDAQNLFDKMLKRDVFSWNALLSAYA
        MKAKS LRQAVDLLCSRST T+EAYTQLVLECVR NEI+QAKRLQSHMEHHLFQP DPFLHNQLLHLYAKFGKLRDAQNLFDKML+RDVFSWNALLSAYA
Subjt:  MKAKSTLRQAVDLLCSRSTVTAEAYTQLVLECVRTNEINQAKRLQSHMEHHLFQPTDPFLHNQLLHLYAKFGKLRDAQNLFDKMLKRDVFSWNALLSAYA

Query:  KSGSIQNLKATFDRMPFRDSVSYNSAIAGFAGNSCPQESLELFKRMQREGFEPTEYTIVSILNASAQLLDLRYGKQIHGSVIVRNFLGNVFIWNALTDMY
        KSGSIQ+L+ATFDRMP+RDSVSYN+ IAG +GNS P+ESLELF+RMQREG  PTEYT VS LNASAQLLDLR GKQIHGSVIV N+LGNVFI NALTDMY
Subjt:  KSGSIQNLKATFDRMPFRDSVSYNSAIAGFAGNSCPQESLELFKRMQREGFEPTEYTIVSILNASAQLLDLRYGKQIHGSVIVRNFLGNVFIWNALTDMY

Query:  AKCGEIEQARWLFDCLTKKNLVSWNLMISGYAKNGQPEKCIGLLHQMRLSGHMPDQVTMSTIIAAYCQYGRVDEARRVLSEFKEKDIVCWTAMLVGYAKN
        AKCGEIEQARWLFD LT KNLVSWNLMISGY KNGQPEKCIGLLH MRLSGHMPDQVT+ST+IAAYCQ GR DEARRV +EFK+KDIVCWTAMLVGYAK+
Subjt:  AKCGEIEQARWLFDCLTKKNLVSWNLMISGYAKNGQPEKCIGLLHQMRLSGHMPDQVTMSTIIAAYCQYGRVDEARRVLSEFKEKDIVCWTAMLVGYAKN

Query:  GREEDALLLFNEMLLEHIEPDSYTLSSVVSSCAKLASLHHGQAVHGKSILAGLNNNLLVSSALIDMYSKCGFIDDARSVFNLMPTRNVVSWNAMIVGCAQ
        GREEDALLLFNEMLLEH EPDSYTLSSVVSSCAKLASL+HGQA+HGKSILAGL+NNLLVSSALIDMYSKCG I+DARSVF++MPTRNV++WNAMIVG AQ
Subjt:  GREEDALLLFNEMLLEHIEPDSYTLSSVVSSCAKLASLHHGQAVHGKSILAGLNNNLLVSSALIDMYSKCGFIDDARSVFNLMPTRNVVSWNAMIVGCAQ

Query:  NGHDKDALELFENMLQQKFKPDNVTFIGILSACLHCNWIEQGQEYFDSISNQHGMTPTLDHYACMVNLLGRTGRIEQAISLIKNMAHEPDFLILSTLLSI
        NG DKD LELFENMLQ+KFKPDNVTF+G+LSACLH N+IEQGQ +FDSISNQHG+TP+LDHYACMVNLLGR+GRI+QA+ LIK+M HEPDFLI STLLS+
Subjt:  NGHDKDALELFENMLQQKFKPDNVTFIGILSACLHCNWIEQGQEYFDSISNQHGMTPTLDHYACMVNLLGRTGRIEQAISLIKNMAHEPDFLILSTLLSI

Query:  CSTKGDIVNAEMAARHLFELDPTSAVPYIMLSNMYASMGRWKDVASVRNLMKSKNVKKFAGFSWIEIDNEVHRFTSEDRTHPESENIYEELNILIGKLQE
         +TKGD+ +AEM  RHLFELDPT+AVPYIMLSNMYASMGRWKDVA+VR++MK+KNVKKFAG+SWIEIDNEVH+FTSEDRTHPE+E IYEEL ILI KL+E
Subjt:  CSTKGDIVNAEMAARHLFELDPTSAVPYIMLSNMYASMGRWKDVASVRNLMKSKNVKKFAGFSWIEIDNEVHRFTSEDRTHPESENIYEELNILIGKLQE

Query:  EGFTPNTNLVLHDVGEDEKFKSICFHSEKLALAFGLIKKPNGISPIRIIKNIRICNDCHEFMKFASRIIGRQIILRDSNRFHHFSTGKCSCNDNW
        +GF PNTNLVLHDVGE+EK KSICFHSEKLAL FGLIKK NG+SPIRIIKNIRIC+DCHEFMKFAS  I RQIILRDSNRFHHFS GKCSC DNW
Subjt:  EGFTPNTNLVLHDVGEDEKFKSICFHSEKLALAFGLIKKPNGISPIRIIKNIRICNDCHEFMKFASRIIGRQIILRDSNRFHHFSTGKCSCNDNW

A0A6J1JAW4 pentatricopeptide repeat-containing protein At4g02750-like isoform X10.0e+0085.61Show/hide
Query:  MKAKSTLRQAVDLLCSRSTVTAEAYTQLVLECVRTNEINQAKRLQSHMEHHLFQPTDPFLHNQLLHLYAKFGKLRDAQNLFDKMLKRDVFSWNALLSAYA
        MKAKS LRQAV LLCSRST T+EAYTQLVLECVR NEI+QAKRLQSHMEHHLFQP DPFLHNQLLHLYAKFGKLRDAQNLFDKML+RDVFSWNALLSAYA
Subjt:  MKAKSTLRQAVDLLCSRSTVTAEAYTQLVLECVRTNEINQAKRLQSHMEHHLFQPTDPFLHNQLLHLYAKFGKLRDAQNLFDKMLKRDVFSWNALLSAYA

Query:  KSGSIQNLKATFDRMPFRDSVSYNSAIAGFAGNSCPQESLELFKRMQREGFEPTEYTIVSILNASAQLLDLRYGKQIHGSVIVRNFLGNVFIWNALTDMY
        KSGSIQ+L+ATFDRMP+RDSVSYN+ IAG +GNS P+ESLELF+RMQREG EPTEYT VS LNASAQLLDLR GKQIHGSVIV N+LGNVFI NALTDMY
Subjt:  KSGSIQNLKATFDRMPFRDSVSYNSAIAGFAGNSCPQESLELFKRMQREGFEPTEYTIVSILNASAQLLDLRYGKQIHGSVIVRNFLGNVFIWNALTDMY

Query:  AKCGEIEQARWLFDCLTKKNLVSWNLMISGYAKNGQPEKCIGLLHQMRLSGHMPDQVTMSTIIAAYCQYGRVDEARRVLSEFKEKDIVCWTAMLVGYAKN
        AKCGEIE ARWLFD LT KNLVSWNLMISGY KNGQPEKCIGLLH+MRLSGHMPDQVT+ST+IAAYCQ GR DEARRV +EFK+KDIVCWTAMLVGYAK+
Subjt:  AKCGEIEQARWLFDCLTKKNLVSWNLMISGYAKNGQPEKCIGLLHQMRLSGHMPDQVTMSTIIAAYCQYGRVDEARRVLSEFKEKDIVCWTAMLVGYAKN

Query:  GREEDALLLFNEMLLEHIEPDSYTLSSVVSSCAKLASLHHGQAVHGKSILAGLNNNLLVSSALIDMYSKCGFIDDARSVFNLMPTRNVVSWNAMIVGCAQ
        GREEDALLLFNEMLLEH EPDSYT SSVVSSCAKLASL+HGQA+HGKSILAGL+NNLLVSSALIDMYSKCG IDDARSVF++MPTRNV++WNAMIVG AQ
Subjt:  GREEDALLLFNEMLLEHIEPDSYTLSSVVSSCAKLASLHHGQAVHGKSILAGLNNNLLVSSALIDMYSKCGFIDDARSVFNLMPTRNVVSWNAMIVGCAQ

Query:  NGHDKDALELFENMLQQKFKPDNVTFIGILSACLHCNWIEQGQEYFDSISNQHGMTPTLDHYACMVNLLGRTGRIEQAISLIKNMAHEPDFLILSTLLSI
        NG DKD LELFENMLQ+KFKPDNVTF+G+LSACLH N IEQGQ +FDSISNQHG+TP+LDHYACMVNLLGR+GRI+QA++LIK+M HEPDFLI STLLS+
Subjt:  NGHDKDALELFENMLQQKFKPDNVTFIGILSACLHCNWIEQGQEYFDSISNQHGMTPTLDHYACMVNLLGRTGRIEQAISLIKNMAHEPDFLILSTLLSI

Query:  CSTKGDIVNAEMAARHLFELDPTSAVPYIMLSNMYASMGRWKDVASVRNLMKSKNVKKFAGFSWIEIDNEVHRFTSEDRTHPESENIYEELNILIGKLQE
         +TKGD+  AEMA RHLFELD T+AVPYIMLSNMYASMGRWKDVA+VR++MK+KNVKKFAG+SWIEIDNEVH+FTSEDRTHPE+E IYEEL ILI KL+E
Subjt:  CSTKGDIVNAEMAARHLFELDPTSAVPYIMLSNMYASMGRWKDVASVRNLMKSKNVKKFAGFSWIEIDNEVHRFTSEDRTHPESENIYEELNILIGKLQE

Query:  EGFTPNTNLVLHDVGEDEKFKSICFHSEKLALAFGLIKKPNGISPIRIIKNIRICNDCHEFMKFASRIIGRQIILRDSNRFHHFSTGKCSCNDNW
        +GF PNTNLVLHDVGE+EK KSICFHSEKLAL FGLIKK NG+SPIRIIKNIRIC+DCHEFMKFAS  I RQIILRDSNRFHHFS GKCSC DNW
Subjt:  EGFTPNTNLVLHDVGEDEKFKSICFHSEKLALAFGLIKKPNGISPIRIIKNIRICNDCHEFMKFASRIIGRQIILRDSNRFHHFSTGKCSCNDNW

SwissProt top hitse value%identityAlignment
Q3E6Q1 Pentatricopeptide repeat-containing protein At1g11290, chloroplastic1.4e-13336.84Show/hide
Query:  FQPTDPFLHN--QLLHLYAKFGKLRDAQNLFDKMLKR----DVFSWNALLSAYAKSGSIQNLKATFDRMPFRDSVSYNSAIAGFAGNSCPQESLELFKRM
        +   +P ++N   LL +     +LR  + +   ++K     D+F+   L + YAK   +   +  FDRMP RD VS+N+ +AG++ N   + +LE+ K M
Subjt:  FQPTDPFLHN--QLLHLYAKFGKLRDAQNLFDKMLKR----DVFSWNALLSAYAKSGSIQNLKATFDRMPFRDSVSYNSAIAGFAGNSCPQESLELFKRM

Query:  QREGFEPTEYTIVSILNASAQLLDLRYGKQIHGSVIVRNFLGNVFIWNALTDMYAKCGEIEQARWLFDCLTKKNLVSWNLMISGYAKNGQPEKCIGLLHQ
          E  +P+  TIVS+L A + L  +  GK+IHG  +   F   V I  AL DMYAKCG +E AR LFD + ++N+VSWN MI  Y +N  P++ + +  +
Subjt:  QREGFEPTEYTIVSILNASAQLLDLRYGKQIHGSVIVRNFLGNVFIWNALTDMYAKCGEIEQARWLFDCLTKKNLVSWNLMISGYAKNGQPEKCIGLLHQ

Query:  MRLSGHMPDQVT-----------------------------------MSTIIAAYCQYGRVDEARRVLSEFKEKDIVCWTAMLVGYAKNGREEDALLLFN
        M   G  P  V+                                   ++++I+ YC+   VD A  +  + + + +V W AM++G+A+NGR  DAL  F+
Subjt:  MRLSGHMPDQVT-----------------------------------MSTIIAAYCQYGRVDEARRVLSEFKEKDIVCWTAMLVGYAKNGREEDALLLFN

Query:  EMLLEHIEPDSYTLSSVVSSCAKLASLHHGQAVHGKSILAGLNNNLLVSSALIDMYSKCGFIDDARSVFNLMPTRNVVSWNAMIVGCAQNGHDKDALELF
        +M    ++PD++T  SV+++ A+L+  HH + +HG  + + L+ N+ V++AL+DMY+KCG I  AR +F++M  R+V +WNAMI G   +G  K ALELF
Subjt:  EMLLEHIEPDSYTLSSVVSSCAKLASLHHGQAVHGKSILAGLNNNLLVSSALIDMYSKCGFIDDARSVFNLMPTRNVVSWNAMIVGCAQNGHDKDALELF

Query:  ENMLQQKFKPDNVTFIGILSACLHCNWIEQGQEYFDSISNQHGMTPTLDHYACMVNLLGRTGRIEQAISLIKNMAHEPDFLILSTLLSICSTKGDIVNAE
        E M +   KP+ VTF+ ++SAC H   +E G + F  +   + +  ++DHY  MV+LLGR GR+ +A   I  M  +P   +   +L  C    ++  AE
Subjt:  ENMLQQKFKPDNVTFIGILSACLHCNWIEQGQEYFDSISNQHGMTPTLDHYACMVNLLGRTGRIEQAISLIKNMAHEPDFLILSTLLSICSTKGDIVNAE

Query:  MAARHLFELDPTSAVPYIMLSNMYASMGRWKDVASVRNLMKSKNVKKFAGFSWIEIDNEVHRFTSEDRTHPESENIYEELNILIGKLQEEGFTPNTNLVL
         AA  LFEL+P     +++L+N+Y +   W+ V  VR  M  + ++K  G S +EI NEVH F S    HP+S+ IY  L  LI  ++E G+ P+TNLVL
Subjt:  MAARHLFELDPTSAVPYIMLSNMYASMGRWKDVASVRNLMKSKNVKKFAGFSWIEIDNEVHRFTSEDRTHPESENIYEELNILIGKLQEEGFTPNTNLVL

Query:  HDVGEDEKFKSICFHSEKLALAFGLIKKPNGISPIRIIKNIRICNDCHEFMKFASRIIGRQIILRDSNRFHHFSTGKCSCNDNW
          V  D K + +  HSEKLA++FGL+    G + I + KN+R+C DCH   K+ S + GR+I++RD  RFHHF  G CSC D W
Subjt:  HDVGEDEKFKSICFHSEKLALAFGLIKKPNGISPIRIIKNIRICNDCHEFMKFASRIIGRQIILRDSNRFHHFSTGKCSCNDNW

Q9CAA8 Putative pentatricopeptide repeat-containing protein At1g689302.9e-13935.76Show/hide
Query:  YTQLVLECVRTNEINQA---KRLQSHMEHHLFQPTDPFLHNQLLHLYAKFGKLRDAQNLFDKMLKRDVFSWNALLSAYAKSGSIQNLKATFDRMPFRDSV
        Y+  + +C+     NQ+   K +  ++   L  P + FL+N ++H YA       A+ +FD++ + ++FSWN LL AY+K+G I  +++TF+++P RD V
Subjt:  YTQLVLECVRTNEINQA---KRLQSHMEHHLFQPTDPFLHNQLLHLYAKFGKLRDAQNLFDKMLKRDVFSWNALLSAYAKSGSIQNLKATFDRMPFRDSV

Query:  SYNSAIAGFAGNSCPQESLELFKRMQRE-GFEPTEYTIVSILNASAQLLDLRYGKQIHGSVIVRNFLGNVFIWNALTDMYAK------------------
        ++N  I G++ +     +++ +  M R+     T  T++++L  S+    +  GKQIHG VI   F   + + + L  MYA                   
Subjt:  SYNSAIAGFAGNSCPQESLELFKRMQRE-GFEPTEYTIVSILNASAQLLDLRYGKQIHGSVIVRNFLGNVFIWNALTDMYAK------------------

Query:  -------------CGEIEQARWLFDCLTKKNLVSWNLMISGYAKNGQPEKCIGLLHQMRLSGHMPDQVTMSTIIAA------------------------
                     CG IE A  LF  + +K+ VSW  MI G A+NG  ++ I    +M++ G   DQ    +++ A                        
Subjt:  -------------CGEIEQARWLFDCLTKKNLVSWNLMISGYAKNGQPEKCIGLLHQMRLSGHMPDQVTMSTIIAA------------------------

Query:  -----------YCQYGRVDEARRVLSEFKEKDIVCWTAMLVGYAKNGREEDALLLFNEMLLEHIEPDSYTLSSVVSSCAKLASLHHGQAVHGKSILAGLN
                   YC+   +  A+ V    K+K++V WTAM+VGY + GR E+A+ +F +M    I+PD YTL   +S+CA ++SL  G   HGK+I +GL 
Subjt:  -----------YCQYGRVDEARRVLSEFKEKDIVCWTAMLVGYAKNGREEDALLLFNEMLLEHIEPDSYTLSSVVSSCAKLASLHHGQAVHGKSILAGLN

Query:  NNLLVSSALIDMYSKCGFIDDARSVFNLMPTRNVVSWNAMIVGCAQNGHDKDALELFENMLQQKFKPDNVTFIGILSACLHCNWIEQGQEYFDSISNQHG
        + + VS++L+ +Y KCG IDD+  +FN M  R+ VSW AM+   AQ G   + ++LF+ M+Q   KPD VT  G++SAC     +E+GQ YF  +++++G
Subjt:  NNLLVSSALIDMYSKCGFIDDARSVFNLMPTRNVVSWNAMIVGCAQNGHDKDALELFENMLQQKFKPDNVTFIGILSACLHCNWIEQGQEYFDSISNQHG

Query:  MTPTLDHYACMVNLLGRTGRIEQAISLIKNMAHEPDFLILSTLLSICSTKGDIVNAEMAARHLFELDPTSAVPYIMLSNMYASMGRWKDVASVRNLMKSK
        + P++ HY+CM++L  R+GR+E+A+  I  M   PD +  +TLLS C  KG++   + AA  L ELDP     Y +LS++YAS G+W  VA +R  M+ K
Subjt:  MTPTLDHYACMVNLLGRTGRIEQAISLIKNMAHEPDFLILSTLLSICSTKGDIVNAEMAARHLFELDPTSAVPYIMLSNMYASMGRWKDVASVRNLMKSK

Query:  NVKKFAGFSWIEIDNEVHRFTSEDRTHPESENIYEELNILIGKLQEEGFTPNTNLVLHDVGEDEKFKSICFHSEKLALAFGLIKKPNGISPIRIIKNIRI
        NVKK  G SWI+   ++H F+++D + P  + IY +L  L  K+ + G+ P+T+ V HDV E  K K + +HSE+LA+AFGLI  P+G  PIR+ KN+R+
Subjt:  NVKKFAGFSWIEIDNEVHRFTSEDRTHPESENIYEELNILIGKLQEEGFTPNTNLVLHDVGEDEKFKSICFHSEKLALAFGLIKKPNGISPIRIIKNIRI

Query:  CNDCHEFMKFASRIIGRQIILRDSNRFHHFSTGKCSCNDNW
        C DCH   K  S + GR+I++RD+ RFH F  G CSC D W
Subjt:  CNDCHEFMKFASRIIGRQIILRDSNRFHHFSTGKCSCNDNW

Q9LNU6 Pentatricopeptide repeat-containing protein At1g202304.0e-13335.77Show/hide
Query:  DPFLHNQLLHLYAKFGKLRDAQNLFDKMLKRDVFSWNALLSAYAKSGSIQNLKATFDRMPF----RDSVSYNSAIAGFAGNSCPQESLELFKRMQREGFE
        D F+   + H+Y + G++ DA+ +FD+M  +DV + +ALL AYA+ G ++ +      M       + VS+N  ++GF  +   +E++ +F+++   GF 
Subjt:  DPFLHNQLLHLYAKFGKLRDAQNLFDKMLKRDVFSWNALLSAYAKSGSIQNLKATFDRMPF----RDSVSYNSAIAGFAGNSCPQESLELFKRMQREGFE

Query:  PTEYTIVSILNASAQLLDLRYGKQIHGSVIVRNFLGNVFIWNALTDMYAKCGEIEQARWLFDCLTKKNLVSWNLMISGYAKNGQPEKCIGLLHQMRLSGH
        P + T+ S+L +      L  G+ IHG VI +  L +  + +A+ DMY K G +     LF+          N  I+G ++NG  +K + +    +    
Subjt:  PTEYTIVSILNASAQLLDLRYGKQIHGSVIVRNFLGNVFIWNALTDMYAKCGEIEQARWLFDCLTKKNLVSWNLMISGYAKNGQPEKCIGLLHQMRLSGH

Query:  MPDQVTMSTIIAAYCQYGRVDEARRVLSEFKEKDIVCWTAMLVGYAKNGREEDALLLFNEMLLEHIEPDSYTLSSVVSSCAKLASLHHGQAVHGKSILAG
                                    +  E ++V WT+++ G A+NG++ +AL LF EM +  ++P+  T+ S++ +C  +A+L HG++ HG ++   
Subjt:  MPDQVTMSTIIAAYCQYGRVDEARRVLSEFKEKDIVCWTAMLVGYAKNGREEDALLLFNEMLLEHIEPDSYTLSSVVSSCAKLASLHHGQAVHGKSILAG

Query:  LNNNLLVSSALIDMYSKCGFIDDARSVFNLMPTRNVVSWNAMIVGCAQNGHDKDALELFENMLQQKFKPDNVTFIGILSACLHCNWIEQGQEYFDSISNQ
        L +N+ V SALIDMY+KCG I+ ++ VFN+MPT+N+V WN+++ G + +G  K+ + +FE++++ + KPD ++F  +LSAC      ++G +YF  +S +
Subjt:  LNNNLLVSSALIDMYSKCGFIDDARSVFNLMPTRNVVSWNAMIVGCAQNGHDKDALELFENMLQQKFKPDNVTFIGILSACLHCNWIEQGQEYFDSISNQ

Query:  HGMTPTLDHYACMVNLLGRTGRIEQAISLIKNMAHEPDFLILSTLLSICSTKGDIVNAEMAARHLFELDPTSAVPYIMLSNMYASMGRWKDVASVRNLMK
        +G+ P L+HY+CMVNLLGR G++++A  LIK M  EPD  +   LL+ C  + ++  AE+AA  LF L+P +   Y++LSN+YA+ G W +V S+RN M+
Subjt:  HGMTPTLDHYACMVNLLGRTGRIEQAISLIKNMAHEPDFLILSTLLSICSTKGDIVNAEMAARHLFELDPTSAVPYIMLSNMYASMGRWKDVASVRNLMK

Query:  SKNVKKFAGFSWIEIDNEVHRFTSEDRTHPESENIYEELNILIGKLQEEGFTPNTNLVLHDVGEDEKFKSICFHSEKLALAFGLIKKPNGISPIRIIKNI
        S  +KK  G SWI++ N V+   + D++HP+ + I E+++ +  ++++ G  PN +  LHDV E E+ + +  HSEKLA+ FGL+  P+G +P+++IKN+
Subjt:  SKNVKKFAGFSWIEIDNEVHRFTSEDRTHPESENIYEELNILIGKLQEEGFTPNTNLVLHDVGEDEKFKSICFHSEKLALAFGLIKKPNGISPIRIIKNI

Query:  RICNDCHEFMKFASRIIGRQIILRDSNRFHHFSTGKCSCNDNW
        RIC DCH  +KF S   GR+I +RD+NRFHHF  G CSC D W
Subjt:  RICNDCHEFMKFASRIIGRQIILRDSNRFHHFSTGKCSCNDNW

Q9SHZ8 Pentatricopeptide repeat-containing protein At2g220706.1e-14235.64Show/hide
Query:  FLHNQLLHLYAKFGKLRDAQNLFDKMLKRDVFSWNALLSAYAKSGSIQNLKATFDRMPFRDSVSYNSAIAGFAGNSCPQESLELFKRMQREGFEPTEYTI
        +L N L+++Y+K G    A+ LFD+M  R  FSWN +LSAY+K G + +    FD++P RDSVS+ + I G+       +++ +   M +EG EPT++T+
Subjt:  FLHNQLLHLYAKFGKLRDAQNLFDKMLKRDVFSWNALLSAYAKSGSIQNLKATFDRMPFRDSVSYNSAIAGFAGNSCPQESLELFKRMQREGFEPTEYTI

Query:  VSILNASAQLLDLRYGKQIHGSVIVRNFLGNVFIWNALTDMYAKCGEIEQARWLFD-------------------------------CLTKKNLVSWNLM
         ++L + A    +  GK++H  ++     GNV + N+L +MYAKCG+   A+++FD                                + ++++V+WN M
Subjt:  VSILNASAQLLDLRYGKQIHGSVIVRNFLGNVFIWNALTDMYAKCGEIEQARWLFD-------------------------------CLTKKNLVSWNLM

Query:  ISGYAKNGQPEKCIGLLHQM-RLSGHMPDQVTMSTIIAA-----------------------------------YCQYGRVDEARRVLSE----------
        ISG+ + G   + + +  +M R S   PD+ T++++++A                                   Y + G V+ ARR++ +          
Subjt:  ISGYAKNGQPEKCIGLLHQM-RLSGHMPDQVTMSTIIAA-----------------------------------YCQYGRVDEARRVLSE----------

Query:  -----------------------FKEKDIVCWTAMLVGYAKNGREEDALLLFNEMLLEHIEPDSYTLSSVVSSCAKLASLHHGQAVHGKSILAGLNNNLL
                                K++D+V WTAM+VGY ++G   +A+ LF  M+     P+SYTL++++S  + LASL HG+ +HG ++ +G   ++ 
Subjt:  -----------------------FKEKDIVCWTAMLVGYAKNGREEDALLLFNEMLLEHIEPDSYTLSSVVSSCAKLASLHHGQAVHGKSILAGLNNNLL

Query:  VSSALIDMYSKCGFIDDARSVFNLMP-TRNVVSWNAMIVGCAQNGHDKDALELFENMLQQKFKPDNVTFIGILSACLHCNWIEQGQEYFDSISNQHGMTP
        VS+ALI MY+K G I  A   F+L+   R+ VSW +MI+  AQ+GH ++ALELFE ML +  +PD++T++G+ SAC H   + QG++YFD + +   + P
Subjt:  VSSALIDMYSKCGFIDDARSVFNLMP-TRNVVSWNAMIVGCAQNGHDKDALELFENMLQQKFKPDNVTFIGILSACLHCNWIEQGQEYFDSISNQHGMTP

Query:  TLDHYACMVNLLGRTGRIEQAISLIKNMAHEPDFLILSTLLSICSTKGDIVNAEMAARHLFELDPTSAVPYIMLSNMYASMGRWKDVASVRNLMKSKNVK
        TL HYACMV+L GR G +++A   I+ M  EPD +   +LLS C    +I   ++AA  L  L+P ++  Y  L+N+Y++ G+W++ A +R  MK   VK
Subjt:  TLDHYACMVNLLGRTGRIEQAISLIKNMAHEPDFLILSTLLSICSTKGDIVNAEMAARHLFELDPTSAVPYIMLSNMYASMGRWKDVASVRNLMKSKNVK

Query:  KFAGFSWIEIDNEVHRFTSEDRTHPESENIYEELNILIGKLQEEGFTPNTNLVLHDVGEDEKFKSICFHSEKLALAFGLIKKPNGISPIRIIKNIRICND
        K  GFSWIE+ ++VH F  ED THPE   IY  +  +  ++++ G+ P+T  VLHD+ E+ K + +  HSEKLA+AFGLI  P+  + +RI+KN+R+CND
Subjt:  KFAGFSWIEIDNEVHRFTSEDRTHPESENIYEELNILIGKLQEEGFTPNTNLVLHDVGEDEKFKSICFHSEKLALAFGLIKKPNGISPIRIIKNIRICND

Query:  CHEFMKFASRIIGRQIILRDSNRFHHFSTGKCSCNDNW
        CH  +KF S+++GR+II+RD+ RFHHF  G CSC D W
Subjt:  CHEFMKFASRIIGRQIILRDSNRFHHFSTGKCSCNDNW

Q9SY02 Pentatricopeptide repeat-containing protein At4g027503.2e-13836.62Show/hide
Query:  AYTQLVLECVRTNEINQAKRLQSHMEHHLFQPTDPFLHNQLLHLYAKFGKLRDAQNLFDKMLKRDVFSWNALLSAYAKSGSIQNLKATFDRMPFRDSVSY
        +Y  ++   +R  E   A++L   M        D    N ++  Y +   L  A+ LF+ M +RDV SWN +LS YA++G + + ++ FDRMP ++ VS+
Subjt:  AYTQLVLECVRTNEINQAKRLQSHMEHHLFQPTDPFLHNQLLHLYAKFGKLRDAQNLFDKMLKRDVFSWNALLSAYAKSGSIQNLKATFDRMPFRDSVSY

Query:  NSAIAGFAGNSCPQESLELFKRMQREGFEPTEYTIVSILNASAQLLDLRYGKQIHGSVIVRNFLGNVFIWNALTDMYAKCGEIEQARWLFDCLTKKNLVS
        N+ ++ +  NS  +E+  LFK   RE +    +    +L    +   +   +Q   S+ VR    +V  WN +   YA+ G+I++AR LFD    +++ +
Subjt:  NSAIAGFAGNSCPQESLELFKRMQREGFEPTEYTIVSILNASAQLLDLRYGKQIHGSVIVRNFLGNVFIWNALTDMYAKCGEIEQARWLFDCLTKKNLVS

Query:  WNLMISGYAKNGQPEKCIGLLHQM----------RLSGHMPDQ-----------------VTMSTIIAAYCQYGRVDEARRVLSEFKEKDIVCWTAMLVG
        W  M+SGY +N   E+   L  +M           L+G++  +                  T +T+I  Y Q G++ EA+ +  +  ++D V W AM+ G
Subjt:  WNLMISGYAKNGQPEKCIGLLHQM----------RLSGHMPDQ-----------------VTMSTIIAAYCQYGRVDEARRVLSEFKEKDIVCWTAMLVG

Query:  YAKNGREEDALLLFNEMLLEHIEPDSYTLSSVVSSCAKLASLHHGQAVHGKSILAGLNNNLLVSSALIDMYSKCGFIDDARSVFNLMPTRNVVSWNAMIV
        Y+++G   +AL LF +M  E    +  + SS +S+CA + +L  G+ +HG+ +  G      V +AL+ MY KCG I++A  +F  M  +++VSWN MI 
Subjt:  YAKNGREEDALLLFNEMLLEHIEPDSYTLSSVVSSCAKLASLHHGQAVHGKSILAGLNNNLLVSSALIDMYSKCGFIDDARSVFNLMPTRNVVSWNAMIV

Query:  GCAQNGHDKDALELFENMLQQKFKPDNVTFIGILSACLHCNWIEQGQEYFDSISNQHGMTPTLDHYACMVNLLGRTGRIEQAISLIKNMAHEPDFLILST
        G +++G  + AL  FE+M ++  KPD+ T + +LSAC H   +++G++YF +++  +G+ P   HYACMV+LLGR G +E A +L+KNM  EPD  I  T
Subjt:  GCAQNGHDKDALELFENMLQQKFKPDNVTFIGILSACLHCNWIEQGQEYFDSISNQHGMTPTLDHYACMVNLLGRTGRIEQAISLIKNMAHEPDFLILST

Query:  LLSICSTKGDIVNAEMAARHLFELDPTSAVPYIMLSNMYASMGRWKDVASVRNLMKSKNVKKFAGFSWIEIDNEVHRFTSEDRTHPESENIYEELNILIG
        LL      G+   AE AA  +F ++P ++  Y++LSN+YAS GRW DV  +R  M+ K VKK  G+SWIEI N+ H F+  D  HPE + I+  L  L  
Subjt:  LLSICSTKGDIVNAEMAARHLFELDPTSAVPYIMLSNMYASMGRWKDVASVRNLMKSKNVKKFAGFSWIEIDNEVHRFTSEDRTHPESENIYEELNILIG

Query:  KLQEEGFTPNTNLVLHDVGEDEKFKSICFHSEKLALAFGLIKKPNGISPIRIIKNIRICNDCHEFMKFASRIIGRQIILRDSNRFHHFSTGKCSCNDNW
        ++++ G+   T++VLHDV E+EK + + +HSE+LA+A+G+++  +G  PIR+IKN+R+C DCH  +K+ +RI GR IILRD+NRFHHF  G CSC D W
Subjt:  KLQEEGFTPNTNLVLHDVGEDEKFKSICFHSEKLALAFGLIKKPNGISPIRIIKNIRICNDCHEFMKFASRIIGRQIILRDSNRFHHFSTGKCSCNDNW

Arabidopsis top hitse value%identityAlignment
AT1G11290.1 Pentatricopeptide repeat (PPR) superfamily protein9.7e-13536.84Show/hide
Query:  FQPTDPFLHN--QLLHLYAKFGKLRDAQNLFDKMLKR----DVFSWNALLSAYAKSGSIQNLKATFDRMPFRDSVSYNSAIAGFAGNSCPQESLELFKRM
        +   +P ++N   LL +     +LR  + +   ++K     D+F+   L + YAK   +   +  FDRMP RD VS+N+ +AG++ N   + +LE+ K M
Subjt:  FQPTDPFLHN--QLLHLYAKFGKLRDAQNLFDKMLKR----DVFSWNALLSAYAKSGSIQNLKATFDRMPFRDSVSYNSAIAGFAGNSCPQESLELFKRM

Query:  QREGFEPTEYTIVSILNASAQLLDLRYGKQIHGSVIVRNFLGNVFIWNALTDMYAKCGEIEQARWLFDCLTKKNLVSWNLMISGYAKNGQPEKCIGLLHQ
          E  +P+  TIVS+L A + L  +  GK+IHG  +   F   V I  AL DMYAKCG +E AR LFD + ++N+VSWN MI  Y +N  P++ + +  +
Subjt:  QREGFEPTEYTIVSILNASAQLLDLRYGKQIHGSVIVRNFLGNVFIWNALTDMYAKCGEIEQARWLFDCLTKKNLVSWNLMISGYAKNGQPEKCIGLLHQ

Query:  MRLSGHMPDQVT-----------------------------------MSTIIAAYCQYGRVDEARRVLSEFKEKDIVCWTAMLVGYAKNGREEDALLLFN
        M   G  P  V+                                   ++++I+ YC+   VD A  +  + + + +V W AM++G+A+NGR  DAL  F+
Subjt:  MRLSGHMPDQVT-----------------------------------MSTIIAAYCQYGRVDEARRVLSEFKEKDIVCWTAMLVGYAKNGREEDALLLFN

Query:  EMLLEHIEPDSYTLSSVVSSCAKLASLHHGQAVHGKSILAGLNNNLLVSSALIDMYSKCGFIDDARSVFNLMPTRNVVSWNAMIVGCAQNGHDKDALELF
        +M    ++PD++T  SV+++ A+L+  HH + +HG  + + L+ N+ V++AL+DMY+KCG I  AR +F++M  R+V +WNAMI G   +G  K ALELF
Subjt:  EMLLEHIEPDSYTLSSVVSSCAKLASLHHGQAVHGKSILAGLNNNLLVSSALIDMYSKCGFIDDARSVFNLMPTRNVVSWNAMIVGCAQNGHDKDALELF

Query:  ENMLQQKFKPDNVTFIGILSACLHCNWIEQGQEYFDSISNQHGMTPTLDHYACMVNLLGRTGRIEQAISLIKNMAHEPDFLILSTLLSICSTKGDIVNAE
        E M +   KP+ VTF+ ++SAC H   +E G + F  +   + +  ++DHY  MV+LLGR GR+ +A   I  M  +P   +   +L  C    ++  AE
Subjt:  ENMLQQKFKPDNVTFIGILSACLHCNWIEQGQEYFDSISNQHGMTPTLDHYACMVNLLGRTGRIEQAISLIKNMAHEPDFLILSTLLSICSTKGDIVNAE

Query:  MAARHLFELDPTSAVPYIMLSNMYASMGRWKDVASVRNLMKSKNVKKFAGFSWIEIDNEVHRFTSEDRTHPESENIYEELNILIGKLQEEGFTPNTNLVL
         AA  LFEL+P     +++L+N+Y +   W+ V  VR  M  + ++K  G S +EI NEVH F S    HP+S+ IY  L  LI  ++E G+ P+TNLVL
Subjt:  MAARHLFELDPTSAVPYIMLSNMYASMGRWKDVASVRNLMKSKNVKKFAGFSWIEIDNEVHRFTSEDRTHPESENIYEELNILIGKLQEEGFTPNTNLVL

Query:  HDVGEDEKFKSICFHSEKLALAFGLIKKPNGISPIRIIKNIRICNDCHEFMKFASRIIGRQIILRDSNRFHHFSTGKCSCNDNW
          V  D K + +  HSEKLA++FGL+    G + I + KN+R+C DCH   K+ S + GR+I++RD  RFHHF  G CSC D W
Subjt:  HDVGEDEKFKSICFHSEKLALAFGLIKKPNGISPIRIIKNIRICNDCHEFMKFASRIIGRQIILRDSNRFHHFSTGKCSCNDNW

AT1G20230.1 Pentatricopeptide repeat (PPR) superfamily protein2.8e-13435.77Show/hide
Query:  DPFLHNQLLHLYAKFGKLRDAQNLFDKMLKRDVFSWNALLSAYAKSGSIQNLKATFDRMPF----RDSVSYNSAIAGFAGNSCPQESLELFKRMQREGFE
        D F+   + H+Y + G++ DA+ +FD+M  +DV + +ALL AYA+ G ++ +      M       + VS+N  ++GF  +   +E++ +F+++   GF 
Subjt:  DPFLHNQLLHLYAKFGKLRDAQNLFDKMLKRDVFSWNALLSAYAKSGSIQNLKATFDRMPF----RDSVSYNSAIAGFAGNSCPQESLELFKRMQREGFE

Query:  PTEYTIVSILNASAQLLDLRYGKQIHGSVIVRNFLGNVFIWNALTDMYAKCGEIEQARWLFDCLTKKNLVSWNLMISGYAKNGQPEKCIGLLHQMRLSGH
        P + T+ S+L +      L  G+ IHG VI +  L +  + +A+ DMY K G +     LF+          N  I+G ++NG  +K + +    +    
Subjt:  PTEYTIVSILNASAQLLDLRYGKQIHGSVIVRNFLGNVFIWNALTDMYAKCGEIEQARWLFDCLTKKNLVSWNLMISGYAKNGQPEKCIGLLHQMRLSGH

Query:  MPDQVTMSTIIAAYCQYGRVDEARRVLSEFKEKDIVCWTAMLVGYAKNGREEDALLLFNEMLLEHIEPDSYTLSSVVSSCAKLASLHHGQAVHGKSILAG
                                    +  E ++V WT+++ G A+NG++ +AL LF EM +  ++P+  T+ S++ +C  +A+L HG++ HG ++   
Subjt:  MPDQVTMSTIIAAYCQYGRVDEARRVLSEFKEKDIVCWTAMLVGYAKNGREEDALLLFNEMLLEHIEPDSYTLSSVVSSCAKLASLHHGQAVHGKSILAG

Query:  LNNNLLVSSALIDMYSKCGFIDDARSVFNLMPTRNVVSWNAMIVGCAQNGHDKDALELFENMLQQKFKPDNVTFIGILSACLHCNWIEQGQEYFDSISNQ
        L +N+ V SALIDMY+KCG I+ ++ VFN+MPT+N+V WN+++ G + +G  K+ + +FE++++ + KPD ++F  +LSAC      ++G +YF  +S +
Subjt:  LNNNLLVSSALIDMYSKCGFIDDARSVFNLMPTRNVVSWNAMIVGCAQNGHDKDALELFENMLQQKFKPDNVTFIGILSACLHCNWIEQGQEYFDSISNQ

Query:  HGMTPTLDHYACMVNLLGRTGRIEQAISLIKNMAHEPDFLILSTLLSICSTKGDIVNAEMAARHLFELDPTSAVPYIMLSNMYASMGRWKDVASVRNLMK
        +G+ P L+HY+CMVNLLGR G++++A  LIK M  EPD  +   LL+ C  + ++  AE+AA  LF L+P +   Y++LSN+YA+ G W +V S+RN M+
Subjt:  HGMTPTLDHYACMVNLLGRTGRIEQAISLIKNMAHEPDFLILSTLLSICSTKGDIVNAEMAARHLFELDPTSAVPYIMLSNMYASMGRWKDVASVRNLMK

Query:  SKNVKKFAGFSWIEIDNEVHRFTSEDRTHPESENIYEELNILIGKLQEEGFTPNTNLVLHDVGEDEKFKSICFHSEKLALAFGLIKKPNGISPIRIIKNI
        S  +KK  G SWI++ N V+   + D++HP+ + I E+++ +  ++++ G  PN +  LHDV E E+ + +  HSEKLA+ FGL+  P+G +P+++IKN+
Subjt:  SKNVKKFAGFSWIEIDNEVHRFTSEDRTHPESENIYEELNILIGKLQEEGFTPNTNLVLHDVGEDEKFKSICFHSEKLALAFGLIKKPNGISPIRIIKNI

Query:  RICNDCHEFMKFASRIIGRQIILRDSNRFHHFSTGKCSCNDNW
        RIC DCH  +KF S   GR+I +RD+NRFHHF  G CSC D W
Subjt:  RICNDCHEFMKFASRIIGRQIILRDSNRFHHFSTGKCSCNDNW

AT1G68930.1 pentatricopeptide (PPR) repeat-containing protein2.0e-14035.76Show/hide
Query:  YTQLVLECVRTNEINQA---KRLQSHMEHHLFQPTDPFLHNQLLHLYAKFGKLRDAQNLFDKMLKRDVFSWNALLSAYAKSGSIQNLKATFDRMPFRDSV
        Y+  + +C+     NQ+   K +  ++   L  P + FL+N ++H YA       A+ +FD++ + ++FSWN LL AY+K+G I  +++TF+++P RD V
Subjt:  YTQLVLECVRTNEINQA---KRLQSHMEHHLFQPTDPFLHNQLLHLYAKFGKLRDAQNLFDKMLKRDVFSWNALLSAYAKSGSIQNLKATFDRMPFRDSV

Query:  SYNSAIAGFAGNSCPQESLELFKRMQRE-GFEPTEYTIVSILNASAQLLDLRYGKQIHGSVIVRNFLGNVFIWNALTDMYAK------------------
        ++N  I G++ +     +++ +  M R+     T  T++++L  S+    +  GKQIHG VI   F   + + + L  MYA                   
Subjt:  SYNSAIAGFAGNSCPQESLELFKRMQRE-GFEPTEYTIVSILNASAQLLDLRYGKQIHGSVIVRNFLGNVFIWNALTDMYAK------------------

Query:  -------------CGEIEQARWLFDCLTKKNLVSWNLMISGYAKNGQPEKCIGLLHQMRLSGHMPDQVTMSTIIAA------------------------
                     CG IE A  LF  + +K+ VSW  MI G A+NG  ++ I    +M++ G   DQ    +++ A                        
Subjt:  -------------CGEIEQARWLFDCLTKKNLVSWNLMISGYAKNGQPEKCIGLLHQMRLSGHMPDQVTMSTIIAA------------------------

Query:  -----------YCQYGRVDEARRVLSEFKEKDIVCWTAMLVGYAKNGREEDALLLFNEMLLEHIEPDSYTLSSVVSSCAKLASLHHGQAVHGKSILAGLN
                   YC+   +  A+ V    K+K++V WTAM+VGY + GR E+A+ +F +M    I+PD YTL   +S+CA ++SL  G   HGK+I +GL 
Subjt:  -----------YCQYGRVDEARRVLSEFKEKDIVCWTAMLVGYAKNGREEDALLLFNEMLLEHIEPDSYTLSSVVSSCAKLASLHHGQAVHGKSILAGLN

Query:  NNLLVSSALIDMYSKCGFIDDARSVFNLMPTRNVVSWNAMIVGCAQNGHDKDALELFENMLQQKFKPDNVTFIGILSACLHCNWIEQGQEYFDSISNQHG
        + + VS++L+ +Y KCG IDD+  +FN M  R+ VSW AM+   AQ G   + ++LF+ M+Q   KPD VT  G++SAC     +E+GQ YF  +++++G
Subjt:  NNLLVSSALIDMYSKCGFIDDARSVFNLMPTRNVVSWNAMIVGCAQNGHDKDALELFENMLQQKFKPDNVTFIGILSACLHCNWIEQGQEYFDSISNQHG

Query:  MTPTLDHYACMVNLLGRTGRIEQAISLIKNMAHEPDFLILSTLLSICSTKGDIVNAEMAARHLFELDPTSAVPYIMLSNMYASMGRWKDVASVRNLMKSK
        + P++ HY+CM++L  R+GR+E+A+  I  M   PD +  +TLLS C  KG++   + AA  L ELDP     Y +LS++YAS G+W  VA +R  M+ K
Subjt:  MTPTLDHYACMVNLLGRTGRIEQAISLIKNMAHEPDFLILSTLLSICSTKGDIVNAEMAARHLFELDPTSAVPYIMLSNMYASMGRWKDVASVRNLMKSK

Query:  NVKKFAGFSWIEIDNEVHRFTSEDRTHPESENIYEELNILIGKLQEEGFTPNTNLVLHDVGEDEKFKSICFHSEKLALAFGLIKKPNGISPIRIIKNIRI
        NVKK  G SWI+   ++H F+++D + P  + IY +L  L  K+ + G+ P+T+ V HDV E  K K + +HSE+LA+AFGLI  P+G  PIR+ KN+R+
Subjt:  NVKKFAGFSWIEIDNEVHRFTSEDRTHPESENIYEELNILIGKLQEEGFTPNTNLVLHDVGEDEKFKSICFHSEKLALAFGLIKKPNGISPIRIIKNIRI

Query:  CNDCHEFMKFASRIIGRQIILRDSNRFHHFSTGKCSCNDNW
        C DCH   K  S + GR+I++RD+ RFH F  G CSC D W
Subjt:  CNDCHEFMKFASRIIGRQIILRDSNRFHHFSTGKCSCNDNW

AT2G22070.1 pentatricopeptide (PPR) repeat-containing protein4.4e-14335.64Show/hide
Query:  FLHNQLLHLYAKFGKLRDAQNLFDKMLKRDVFSWNALLSAYAKSGSIQNLKATFDRMPFRDSVSYNSAIAGFAGNSCPQESLELFKRMQREGFEPTEYTI
        +L N L+++Y+K G    A+ LFD+M  R  FSWN +LSAY+K G + +    FD++P RDSVS+ + I G+       +++ +   M +EG EPT++T+
Subjt:  FLHNQLLHLYAKFGKLRDAQNLFDKMLKRDVFSWNALLSAYAKSGSIQNLKATFDRMPFRDSVSYNSAIAGFAGNSCPQESLELFKRMQREGFEPTEYTI

Query:  VSILNASAQLLDLRYGKQIHGSVIVRNFLGNVFIWNALTDMYAKCGEIEQARWLFD-------------------------------CLTKKNLVSWNLM
         ++L + A    +  GK++H  ++     GNV + N+L +MYAKCG+   A+++FD                                + ++++V+WN M
Subjt:  VSILNASAQLLDLRYGKQIHGSVIVRNFLGNVFIWNALTDMYAKCGEIEQARWLFD-------------------------------CLTKKNLVSWNLM

Query:  ISGYAKNGQPEKCIGLLHQM-RLSGHMPDQVTMSTIIAA-----------------------------------YCQYGRVDEARRVLSE----------
        ISG+ + G   + + +  +M R S   PD+ T++++++A                                   Y + G V+ ARR++ +          
Subjt:  ISGYAKNGQPEKCIGLLHQM-RLSGHMPDQVTMSTIIAA-----------------------------------YCQYGRVDEARRVLSE----------

Query:  -----------------------FKEKDIVCWTAMLVGYAKNGREEDALLLFNEMLLEHIEPDSYTLSSVVSSCAKLASLHHGQAVHGKSILAGLNNNLL
                                K++D+V WTAM+VGY ++G   +A+ LF  M+     P+SYTL++++S  + LASL HG+ +HG ++ +G   ++ 
Subjt:  -----------------------FKEKDIVCWTAMLVGYAKNGREEDALLLFNEMLLEHIEPDSYTLSSVVSSCAKLASLHHGQAVHGKSILAGLNNNLL

Query:  VSSALIDMYSKCGFIDDARSVFNLMP-TRNVVSWNAMIVGCAQNGHDKDALELFENMLQQKFKPDNVTFIGILSACLHCNWIEQGQEYFDSISNQHGMTP
        VS+ALI MY+K G I  A   F+L+   R+ VSW +MI+  AQ+GH ++ALELFE ML +  +PD++T++G+ SAC H   + QG++YFD + +   + P
Subjt:  VSSALIDMYSKCGFIDDARSVFNLMP-TRNVVSWNAMIVGCAQNGHDKDALELFENMLQQKFKPDNVTFIGILSACLHCNWIEQGQEYFDSISNQHGMTP

Query:  TLDHYACMVNLLGRTGRIEQAISLIKNMAHEPDFLILSTLLSICSTKGDIVNAEMAARHLFELDPTSAVPYIMLSNMYASMGRWKDVASVRNLMKSKNVK
        TL HYACMV+L GR G +++A   I+ M  EPD +   +LLS C    +I   ++AA  L  L+P ++  Y  L+N+Y++ G+W++ A +R  MK   VK
Subjt:  TLDHYACMVNLLGRTGRIEQAISLIKNMAHEPDFLILSTLLSICSTKGDIVNAEMAARHLFELDPTSAVPYIMLSNMYASMGRWKDVASVRNLMKSKNVK

Query:  KFAGFSWIEIDNEVHRFTSEDRTHPESENIYEELNILIGKLQEEGFTPNTNLVLHDVGEDEKFKSICFHSEKLALAFGLIKKPNGISPIRIIKNIRICND
        K  GFSWIE+ ++VH F  ED THPE   IY  +  +  ++++ G+ P+T  VLHD+ E+ K + +  HSEKLA+AFGLI  P+  + +RI+KN+R+CND
Subjt:  KFAGFSWIEIDNEVHRFTSEDRTHPESENIYEELNILIGKLQEEGFTPNTNLVLHDVGEDEKFKSICFHSEKLALAFGLIKKPNGISPIRIIKNIRICND

Query:  CHEFMKFASRIIGRQIILRDSNRFHHFSTGKCSCNDNW
        CH  +KF S+++GR+II+RD+ RFHHF  G CSC D W
Subjt:  CHEFMKFASRIIGRQIILRDSNRFHHFSTGKCSCNDNW

AT4G02750.1 Tetratricopeptide repeat (TPR)-like superfamily protein2.2e-13936.62Show/hide
Query:  AYTQLVLECVRTNEINQAKRLQSHMEHHLFQPTDPFLHNQLLHLYAKFGKLRDAQNLFDKMLKRDVFSWNALLSAYAKSGSIQNLKATFDRMPFRDSVSY
        +Y  ++   +R  E   A++L   M        D    N ++  Y +   L  A+ LF+ M +RDV SWN +LS YA++G + + ++ FDRMP ++ VS+
Subjt:  AYTQLVLECVRTNEINQAKRLQSHMEHHLFQPTDPFLHNQLLHLYAKFGKLRDAQNLFDKMLKRDVFSWNALLSAYAKSGSIQNLKATFDRMPFRDSVSY

Query:  NSAIAGFAGNSCPQESLELFKRMQREGFEPTEYTIVSILNASAQLLDLRYGKQIHGSVIVRNFLGNVFIWNALTDMYAKCGEIEQARWLFDCLTKKNLVS
        N+ ++ +  NS  +E+  LFK   RE +    +    +L    +   +   +Q   S+ VR    +V  WN +   YA+ G+I++AR LFD    +++ +
Subjt:  NSAIAGFAGNSCPQESLELFKRMQREGFEPTEYTIVSILNASAQLLDLRYGKQIHGSVIVRNFLGNVFIWNALTDMYAKCGEIEQARWLFDCLTKKNLVS

Query:  WNLMISGYAKNGQPEKCIGLLHQM----------RLSGHMPDQ-----------------VTMSTIIAAYCQYGRVDEARRVLSEFKEKDIVCWTAMLVG
        W  M+SGY +N   E+   L  +M           L+G++  +                  T +T+I  Y Q G++ EA+ +  +  ++D V W AM+ G
Subjt:  WNLMISGYAKNGQPEKCIGLLHQM----------RLSGHMPDQ-----------------VTMSTIIAAYCQYGRVDEARRVLSEFKEKDIVCWTAMLVG

Query:  YAKNGREEDALLLFNEMLLEHIEPDSYTLSSVVSSCAKLASLHHGQAVHGKSILAGLNNNLLVSSALIDMYSKCGFIDDARSVFNLMPTRNVVSWNAMIV
        Y+++G   +AL LF +M  E    +  + SS +S+CA + +L  G+ +HG+ +  G      V +AL+ MY KCG I++A  +F  M  +++VSWN MI 
Subjt:  YAKNGREEDALLLFNEMLLEHIEPDSYTLSSVVSSCAKLASLHHGQAVHGKSILAGLNNNLLVSSALIDMYSKCGFIDDARSVFNLMPTRNVVSWNAMIV

Query:  GCAQNGHDKDALELFENMLQQKFKPDNVTFIGILSACLHCNWIEQGQEYFDSISNQHGMTPTLDHYACMVNLLGRTGRIEQAISLIKNMAHEPDFLILST
        G +++G  + AL  FE+M ++  KPD+ T + +LSAC H   +++G++YF +++  +G+ P   HYACMV+LLGR G +E A +L+KNM  EPD  I  T
Subjt:  GCAQNGHDKDALELFENMLQQKFKPDNVTFIGILSACLHCNWIEQGQEYFDSISNQHGMTPTLDHYACMVNLLGRTGRIEQAISLIKNMAHEPDFLILST

Query:  LLSICSTKGDIVNAEMAARHLFELDPTSAVPYIMLSNMYASMGRWKDVASVRNLMKSKNVKKFAGFSWIEIDNEVHRFTSEDRTHPESENIYEELNILIG
        LL      G+   AE AA  +F ++P ++  Y++LSN+YAS GRW DV  +R  M+ K VKK  G+SWIEI N+ H F+  D  HPE + I+  L  L  
Subjt:  LLSICSTKGDIVNAEMAARHLFELDPTSAVPYIMLSNMYASMGRWKDVASVRNLMKSKNVKKFAGFSWIEIDNEVHRFTSEDRTHPESENIYEELNILIG

Query:  KLQEEGFTPNTNLVLHDVGEDEKFKSICFHSEKLALAFGLIKKPNGISPIRIIKNIRICNDCHEFMKFASRIIGRQIILRDSNRFHHFSTGKCSCNDNW
        ++++ G+   T++VLHDV E+EK + + +HSE+LA+A+G+++  +G  PIR+IKN+R+C DCH  +K+ +RI GR IILRD+NRFHHF  G CSC D W
Subjt:  KLQEEGFTPNTNLVLHDVGEDEKFKSICFHSEKLALAFGLIKKPNGISPIRIIKNIRICNDCHEFMKFASRIIGRQIILRDSNRFHHFSTGKCSCNDNW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAAGCAAAATCCACGTTGCGGCAAGCAGTAGACTTGCTCTGTTCTCGTAGCACTGTCACCGCTGAGGCATACACCCAATTGGTCCTTGAATGTGTTCGTACAAATGA
AATCAACCAAGCTAAGAGACTCCAATCTCACATGGAGCACCATCTTTTCCAACCCACCGACCCTTTTCTCCACAATCAGCTACTTCATTTGTATGCAAAATTTGGAAAGC
TTCGAGATGCCCAAAACTTGTTTGATAAAATGCTTAAAAGGGATGTTTTTTCCTGGAATGCTCTGCTTTCTGCGTATGCTAAATCAGGTTCTATCCAGAATTTGAAGGCC
ACATTTGATCGAATGCCTTTTCGGGATTCAGTTTCGTACAATTCGGCCATTGCAGGTTTTGCTGGAAATAGTTGTCCACAAGAGTCGCTTGAGCTTTTTAAAAGAATGCA
AAGGGAAGGTTTTGAGCCTACGGAATATACGATTGTAAGTATATTGAATGCATCGGCTCAATTGTTGGACCTGAGGTACGGGAAACAGATTCATGGGAGCGTTATTGTGC
GTAACTTTTTGGGAAATGTGTTTATTTGGAATGCTTTAACAGATATGTATGCCAAATGTGGTGAGATTGAGCAGGCAAGGTGGTTGTTTGATTGTCTTACTAAAAAGAAT
TTGGTTTCTTGGAACTTAATGATATCCGGGTATGCAAAGAATGGACAGCCTGAAAAGTGTATTGGTTTGTTACATCAAATGCGGTTGTCTGGACATATGCCCGATCAAGT
TACCATGTCAACTATAATTGCAGCTTACTGTCAATACGGACGTGTAGATGAAGCAAGAAGGGTGCTTAGTGAGTTTAAAGAGAAGGATATTGTTTGCTGGACAGCCATGT
TGGTGGGTTATGCAAAAAATGGCAGAGAAGAGGATGCACTATTGTTGTTTAATGAAATGCTATTAGAACATATTGAACCTGACAGCTACACTTTATCAAGTGTTGTCAGT
TCTTGTGCCAAATTGGCATCGCTACATCATGGTCAGGCAGTCCACGGAAAATCAATTCTTGCTGGTCTTAACAATAATTTGCTTGTCTCTAGCGCACTAATTGATATGTA
TTCTAAATGTGGTTTCATTGATGATGCAAGGTCAGTCTTCAACCTGATGCCAACTAGGAATGTGGTTTCATGGAATGCTATGATTGTTGGTTGTGCACAAAATGGACACG
ATAAAGATGCCCTTGAACTCTTTGAAAATATGTTACAACAGAAATTTAAACCTGATAATGTAACTTTTATCGGCATTTTATCTGCTTGTCTCCATTGTAATTGGATAGAG
CAAGGGCAGGAGTACTTTGATTCTATAAGCAACCAACATGGAATGACACCTACTTTGGATCATTATGCATGTATGGTCAATCTCCTAGGACGTACAGGCCGCATAGAACA
AGCAATTAGTTTAATAAAAAATATGGCCCATGAACCAGATTTCCTCATTTTGTCCACACTTCTATCCATTTGCTCAACAAAGGGTGATATTGTAAATGCAGAAATGGCAG
CTAGGCATCTCTTTGAATTGGATCCTACGAGTGCCGTACCATATATCATGCTCTCAAATATGTATGCTTCTATGGGTAGATGGAAAGATGTAGCTTCAGTTAGGAATCTC
ATGAAAAGCAAGAATGTGAAGAAGTTTGCTGGGTTCAGTTGGATTGAAATTGATAACGAGGTGCACAGATTCACATCTGAAGATCGGACTCATCCAGAATCAGAAAATAT
ATATGAGGAACTGAACATTTTGATAGGGAAACTTCAAGAAGAAGGATTTACCCCTAACACAAATCTGGTTTTGCATGATGTTGGAGAGGACGAAAAGTTCAAATCCATAT
GTTTTCACAGTGAGAAACTTGCCCTTGCCTTTGGTTTGATTAAGAAACCTAATGGAATTAGTCCAATAAGGATCATAAAGAATATTCGAATTTGCAATGATTGCCATGAA
TTTATGAAGTTTGCATCTAGGATTATTGGAAGGCAAATTATATTGAGAGATTCAAATAGGTTTCATCATTTTTCAACTGGGAAGTGCTCCTGCAACGACAATTGGTAA
mRNA sequenceShow/hide mRNA sequence
CAAAATTCCACGGGAACCTCAAGATTTCGACGACGGAGCACGGAGCGTGGCGGCACGAATCCGAGGTGGGCGATGATGGAGCGGGGTGGCTGTATGCGGCGGAGATGGGT
GGCACCTTTAGGAAGAAGATAAGACGACTGAAAAATGAAAGCAAAATCCACGTTGCGGCAAGCAGTAGACTTGCTCTGTTCTCGTAGCACTGTCACCGCTGAGGCATACA
CCCAATTGGTCCTTGAATGTGTTCGTACAAATGAAATCAACCAAGCTAAGAGACTCCAATCTCACATGGAGCACCATCTTTTCCAACCCACCGACCCTTTTCTCCACAAT
CAGCTACTTCATTTGTATGCAAAATTTGGAAAGCTTCGAGATGCCCAAAACTTGTTTGATAAAATGCTTAAAAGGGATGTTTTTTCCTGGAATGCTCTGCTTTCTGCGTA
TGCTAAATCAGGTTCTATCCAGAATTTGAAGGCCACATTTGATCGAATGCCTTTTCGGGATTCAGTTTCGTACAATTCGGCCATTGCAGGTTTTGCTGGAAATAGTTGTC
CACAAGAGTCGCTTGAGCTTTTTAAAAGAATGCAAAGGGAAGGTTTTGAGCCTACGGAATATACGATTGTAAGTATATTGAATGCATCGGCTCAATTGTTGGACCTGAGG
TACGGGAAACAGATTCATGGGAGCGTTATTGTGCGTAACTTTTTGGGAAATGTGTTTATTTGGAATGCTTTAACAGATATGTATGCCAAATGTGGTGAGATTGAGCAGGC
AAGGTGGTTGTTTGATTGTCTTACTAAAAAGAATTTGGTTTCTTGGAACTTAATGATATCCGGGTATGCAAAGAATGGACAGCCTGAAAAGTGTATTGGTTTGTTACATC
AAATGCGGTTGTCTGGACATATGCCCGATCAAGTTACCATGTCAACTATAATTGCAGCTTACTGTCAATACGGACGTGTAGATGAAGCAAGAAGGGTGCTTAGTGAGTTT
AAAGAGAAGGATATTGTTTGCTGGACAGCCATGTTGGTGGGTTATGCAAAAAATGGCAGAGAAGAGGATGCACTATTGTTGTTTAATGAAATGCTATTAGAACATATTGA
ACCTGACAGCTACACTTTATCAAGTGTTGTCAGTTCTTGTGCCAAATTGGCATCGCTACATCATGGTCAGGCAGTCCACGGAAAATCAATTCTTGCTGGTCTTAACAATA
ATTTGCTTGTCTCTAGCGCACTAATTGATATGTATTCTAAATGTGGTTTCATTGATGATGCAAGGTCAGTCTTCAACCTGATGCCAACTAGGAATGTGGTTTCATGGAAT
GCTATGATTGTTGGTTGTGCACAAAATGGACACGATAAAGATGCCCTTGAACTCTTTGAAAATATGTTACAACAGAAATTTAAACCTGATAATGTAACTTTTATCGGCAT
TTTATCTGCTTGTCTCCATTGTAATTGGATAGAGCAAGGGCAGGAGTACTTTGATTCTATAAGCAACCAACATGGAATGACACCTACTTTGGATCATTATGCATGTATGG
TCAATCTCCTAGGACGTACAGGCCGCATAGAACAAGCAATTAGTTTAATAAAAAATATGGCCCATGAACCAGATTTCCTCATTTTGTCCACACTTCTATCCATTTGCTCA
ACAAAGGGTGATATTGTAAATGCAGAAATGGCAGCTAGGCATCTCTTTGAATTGGATCCTACGAGTGCCGTACCATATATCATGCTCTCAAATATGTATGCTTCTATGGG
TAGATGGAAAGATGTAGCTTCAGTTAGGAATCTCATGAAAAGCAAGAATGTGAAGAAGTTTGCTGGGTTCAGTTGGATTGAAATTGATAACGAGGTGCACAGATTCACAT
CTGAAGATCGGACTCATCCAGAATCAGAAAATATATATGAGGAACTGAACATTTTGATAGGGAAACTTCAAGAAGAAGGATTTACCCCTAACACAAATCTGGTTTTGCAT
GATGTTGGAGAGGACGAAAAGTTCAAATCCATATGTTTTCACAGTGAGAAACTTGCCCTTGCCTTTGGTTTGATTAAGAAACCTAATGGAATTAGTCCAATAAGGATCAT
AAAGAATATTCGAATTTGCAATGATTGCCATGAATTTATGAAGTTTGCATCTAGGATTATTGGAAGGCAAATTATATTGAGAGATTCAAATAGGTTTCATCATTTTTCAA
CTGGGAAGTGCTCCTGCAACGACAATTGGTAAGGAATTTGCTGGTGTAGATTGGGGGAAGTCACGAGATACAATGTGAGAAAACAATGATGTGAGGATACTGGGACTATA
ACAAAAAAGAGTAGAGGGTTCAAGGTGAAGGGATTGGTTCCTGAAATTTTCAAAAGGGAAAATGTTTCTGATATATGTCCCCAAGTCCCTCAAAATTCAGAAATAGAAAA
ATGAGTTCTTCATTGCACAATACTCCGTCTTATACAAAAAAATGTATAACATCAATTTCAAACTCGCCAACTCCAAGGTTGGGAGTGGAGAGTTGCTGAAAAATTCAAGA
AGCGTCTTAGCATTCATTAAAAAAGTCTTCAAACGCTGAATAATCAAATAATGTAGACATTGACTCCTGTATCAAAACATAGATATTTAGAAAAATATGTGCGGAAGCGA
TGTCAAATCCCCCAAAGCTCAATCGTAGATTCTTCTTGTCATTC
Protein sequenceShow/hide protein sequence
MKAKSTLRQAVDLLCSRSTVTAEAYTQLVLECVRTNEINQAKRLQSHMEHHLFQPTDPFLHNQLLHLYAKFGKLRDAQNLFDKMLKRDVFSWNALLSAYAKSGSIQNLKA
TFDRMPFRDSVSYNSAIAGFAGNSCPQESLELFKRMQREGFEPTEYTIVSILNASAQLLDLRYGKQIHGSVIVRNFLGNVFIWNALTDMYAKCGEIEQARWLFDCLTKKN
LVSWNLMISGYAKNGQPEKCIGLLHQMRLSGHMPDQVTMSTIIAAYCQYGRVDEARRVLSEFKEKDIVCWTAMLVGYAKNGREEDALLLFNEMLLEHIEPDSYTLSSVVS
SCAKLASLHHGQAVHGKSILAGLNNNLLVSSALIDMYSKCGFIDDARSVFNLMPTRNVVSWNAMIVGCAQNGHDKDALELFENMLQQKFKPDNVTFIGILSACLHCNWIE
QGQEYFDSISNQHGMTPTLDHYACMVNLLGRTGRIEQAISLIKNMAHEPDFLILSTLLSICSTKGDIVNAEMAARHLFELDPTSAVPYIMLSNMYASMGRWKDVASVRNL
MKSKNVKKFAGFSWIEIDNEVHRFTSEDRTHPESENIYEELNILIGKLQEEGFTPNTNLVLHDVGEDEKFKSICFHSEKLALAFGLIKKPNGISPIRIIKNIRICNDCHE
FMKFASRIIGRQIILRDSNRFHHFSTGKCSCNDNW