; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG05G025610 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG05G025610
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionPentatricopeptide repeat
Genome locationCG_Chr05:36953152..36955656
RNA-Seq ExpressionClCG05G025610
SyntenyClCG05G025610
Gene Ontology termsGO:0005739 - mitochondrion (cellular component)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008446176.1 PREDICTED: pentatricopeptide repeat-containing protein At1g60770 isoform X1 [Cucumis melo]3.7e-22776.44Show/hide
Query:  MALRQLSRTKNVAKRSTKKYLEEALYVRLFKDGGSEKSLRQQLNGFIKSHKRVFKWEVGDTLKKLRDRKLYNPALKVRLTKPLKFCGIDDEWRNILIRVV
        + LR+   +K++AKRST+KYLEEALY+RLFKDGGSEKS+R QLN FIKS KRVFKWEVGDTLKKLRDRKLY PALK                        
Subjt:  MALRQLSRTKNVAKRSTKKYLEEALYVRLFKDGGSEKSLRQQLNGFIKSHKRVFKWEVGDTLKKLRDRKLYNPALKVRLTKPLKFCGIDDEWRNILIRVV

Query:  VMAICSTCLSCAKIDKYSMVEWDRISGQNCSIMSEVVPLFRILSETMAKRGVNRTVSDQAIHLDLVAKARGIAAAESYFVGLPESSKNHLCYGSLLNCYC
                                                  LSETMAKRG+N+TVSDQAIHLDLVAKARGIAAAE+YFV LPESSKNHL Y SLLNCYC
Subjt:  VMAICSTCLSCAKIDKYSMVEWDRISGQNCSIMSEVVPLFRILSETMAKRGVNRTVSDQAIHLDLVAKARGIAAAESYFVGLPESSKNHLCYGSLLNCYC

Query:  KELLTEKAEALFEKMKELNLLVTSMSYNSLMTLYMKTGQPEKVRAIIQEMKAANVMFDSYTYNVWMRALAALNDISGVERVIDEMKRDGGVVGDWTTYSN
        KELLTEKAE+LFEKMKELNL +TSM  N LMTLY K GQP+KV +IIQEMKAANV FDSYTY VWMRALAALNDISGVERVIDEMKRD GV GDWTTYSN
Subjt:  KELLTEKAEALFEKMKELNLLVTSMSYNSLMTLYMKTGQPEKVRAIIQEMKAANVMFDSYTYNVWMRALAALNDISGVERVIDEMKRDGGVVGDWTTYSN

Query:  LASIYVNANLFEKAAKALKDLEKRNACRDLSAFQFLITLYGQIGNLPEVYRVWRSLRLAFPKTANISYLNMIQTLTKLKDLPGAEKCFKEWESGCSTYDI
        LASIYVNAN+FEKAAKAL DLEK N  RDL AFQFLITLYGQIG+L +VY VWRSLRLAFP+TANISYLNMIQTL KLKDLPGAEKCFKEWESGCSTYDI
Subjt:  LASIYVNANLFEKAAKALKDLEKRNACRDLSAFQFLITLYGQIGNLPEVYRVWRSLRLAFPKTANISYLNMIQTLTKLKDLPGAEKCFKEWESGCSTYDI

Query:  RIANALMGAYTKEGLLEKAMELKERARKRGAKPNAKTWEIFLDYYLKNGDFKLAADCVEKAVSKGKGDGGKWMPSSEIIKSLMSHFELEKDVDGAEGFLE
        RIANAL+GAYTKEGLLEKAM LKERA KRGA+PNAKTWEIFLDYYLKNG+FKLA DCV KAVS+GKGDGGKWMPS EIIKS MSHFE EKDVDGAE FLE
Subjt:  RIANALMGAYTKEGLLEKAMELKERARKRGAKPNAKTWEIFLDYYLKNGDFKLAADCVEKAVSKGKGDGGKWMPSSEIIKSLMSHFELEKDVDGAEGFLE

Query:  IVKKTVDTLDSEVFESLIRTYSAAGRKSSTMNRRLKMENVEVSEACKKLLNEISIE
        IVKKTVD+L+SEVFESLIRTYSAAGR SS+MNRRLKMENVEVSEACKKLLNEISIE
Subjt:  IVKKTVDTLDSEVFESLIRTYSAAGRKSSTMNRRLKMENVEVSEACKKLLNEISIE

XP_022151897.1 pentatricopeptide repeat-containing protein At1g60770 [Momordica charantia]4.6e-23076.98Show/hide
Query:  MALRQLSRTKNVAKRSTKKYLEEALYVRLFKDGGSEKSLRQQLNGFIKSHKRVFKWEVGDTLKKLRDRKLYNPALKVRLTKPLKFCGIDDEWRNILIRVV
        MALRQ SR KNVAKRS  KYLEEALYVRLFKDG SEKS+R QLN FIK HKRVFKWEVGDTLKKLR RKLYNPALK                        
Subjt:  MALRQLSRTKNVAKRSTKKYLEEALYVRLFKDGGSEKSLRQQLNGFIKSHKRVFKWEVGDTLKKLRDRKLYNPALKVRLTKPLKFCGIDDEWRNILIRVV

Query:  VMAICSTCLSCAKIDKYSMVEWDRISGQNCSIMSEVVPLFRILSETMAKRGVNRTVSDQAIHLDLVAKARGIAAAESYFVGLPESSKNHLCYGSLLNCYC
                                                  LSETMAKRG+N+T+SDQAIHLDLVAKARGIAAAESYFV LPESSKNHLCYGSLLNCYC
Subjt:  VMAICSTCLSCAKIDKYSMVEWDRISGQNCSIMSEVVPLFRILSETMAKRGVNRTVSDQAIHLDLVAKARGIAAAESYFVGLPESSKNHLCYGSLLNCYC

Query:  KELLTEKAEALFEKMKELNLLVTSMSYNSLMTLYMKTGQPEKVRAIIQEMKAANVMFDSYTYNVWMRALAALNDISGVERVIDEMKRDGGVVGDWTTYSN
        KEL+T++AEAL EKMKELNL VTSMSYNS+MTLY KTGQPEKVRAIIQEMKAANVMFDSYTYNVWMRALAALNDISGVERVIDEMKRDG VVGDWTTYSN
Subjt:  KELLTEKAEALFEKMKELNLLVTSMSYNSLMTLYMKTGQPEKVRAIIQEMKAANVMFDSYTYNVWMRALAALNDISGVERVIDEMKRDGGVVGDWTTYSN

Query:  LASIYVNANLFEKAAKALKDLEKRNACRDLSAFQFLITLYGQIGNLPEVYRVWRSLRLAFPKTANISYLNMIQTLTKLKDLPGAEKCFKEWESGCSTYDI
        LASIYV+A+LFEKA KAL+DLEKRN+ R+LSAFQF+ITLYG++GNL EVYRVWRSLRLAFPKTANISYLNMIQTLTKLKDLPGAEKCFKEWESGCSTYDI
Subjt:  LASIYVNANLFEKAAKALKDLEKRNACRDLSAFQFLITLYGQIGNLPEVYRVWRSLRLAFPKTANISYLNMIQTLTKLKDLPGAEKCFKEWESGCSTYDI

Query:  RIANALMGAYTKEGLLEKAMELKERARKRGAKPNAKTWEIFLDYYLKNGDFKLAADCVEKAVSKGKGDGGKWMPSSEIIKSLMSHFELEKDVDGAEGFLE
        RIAN L+GAY +EGLLEKAMELKERAR+RGAKPNAKTWEIFLDY+L+NG+FK A DCV KAVS G+  GGKWMPS EI+K+LMSHFE EKDVDGAE FLE
Subjt:  RIANALMGAYTKEGLLEKAMELKERARKRGAKPNAKTWEIFLDYYLKNGDFKLAADCVEKAVSKGKGDGGKWMPSSEIIKSLMSHFELEKDVDGAEGFLE

Query:  IVKKTVDTLDSEVFESLIRTYSAAGRKSSTMNRRLKMENVEVSEACKKLLNEISIE
         VKK VDTL+ EVFE+LIRTYSAAGRKSS M+R LKMENVEVSEACKKLL+EISIE
Subjt:  IVKKTVDTLDSEVFESLIRTYSAAGRKSSTMNRRLKMENVEVSEACKKLLNEISIE

XP_022945244.1 pentatricopeptide repeat-containing protein At1g60770 [Cucurbita moschata]1.5e-22574.82Show/hide
Query:  MALRQLSRTKNVAKRSTKKYLEEALYVRLFKDGGSEKSLRQQLNGFIKSHKRVFKWEVGDTLKKLRDRKLYNPALKVRLTKPLKFCGIDDEWRNILIRVV
        + LR+LSRTKNVAKRSTKKYLEE LYVRLFKDG SEKS+R QLN F+KS KRVFKWEVGDTLKKLRDRKLY PALK                        
Subjt:  MALRQLSRTKNVAKRSTKKYLEEALYVRLFKDGGSEKSLRQQLNGFIKSHKRVFKWEVGDTLKKLRDRKLYNPALKVRLTKPLKFCGIDDEWRNILIRVV

Query:  VMAICSTCLSCAKIDKYSMVEWDRISGQNCSIMSEVVPLFRILSETMAKRGVNRTVSDQAIHLDLVAKARGIAAAESYFVGLPESSKNHLCYGSLLNCYC
                                                  LSETMAKR +N+TVSDQAIHLDL+AKARGIAAAES+FV LPESSKNHLCYGSLLNCYC
Subjt:  VMAICSTCLSCAKIDKYSMVEWDRISGQNCSIMSEVVPLFRILSETMAKRGVNRTVSDQAIHLDLVAKARGIAAAESYFVGLPESSKNHLCYGSLLNCYC

Query:  KELLTEKAEALFEKMKELNLLVTSMSYNSLMTLYMKTGQPEKVRAIIQEMKAANVMFDSYTYNVWMRALAALNDISGVERVIDEMKRDGGVVGDWTTYSN
        KEL+TEKAEA+ EKMKELNL VTSM YNSLMTLY KTG PEKV AIIQEMKAA VMFD+YTYNVWMRALAALNDISGVERVIDEMK DG  VGDWTTYSN
Subjt:  KELLTEKAEALFEKMKELNLLVTSMSYNSLMTLYMKTGQPEKVRAIIQEMKAANVMFDSYTYNVWMRALAALNDISGVERVIDEMKRDGGVVGDWTTYSN

Query:  LASIYVNANLFEKAAKALKDLEKRNACRDLSAFQFLITLYGQIGNLPEVYRVWRSLRLAFPKTANISYLNMIQTLTKLKDLPGAEKCFKEWESGCSTYDI
        LASIYV+A++F+KA  ALK+LEKRNA RDLSAFQFLITL+GQ+GNL EVYRVWRSLRLAFPKTANISYLNMIQTL KLKDLPGAEKCFKEW+SGCSTYDI
Subjt:  LASIYVNANLFEKAAKALKDLEKRNACRDLSAFQFLITLYGQIGNLPEVYRVWRSLRLAFPKTANISYLNMIQTLTKLKDLPGAEKCFKEWESGCSTYDI

Query:  RIANALMGAYTKEGLLEKAMELKERARKRGAKPNAKTWEIFLDYYLKNGDFKLAADCVEKAVSKGKGDGGKWMPSSEIIKSLMSHFELEKDVDGAEGFLE
        RIANAL+GAY KEGLLEKA+ELK RAR+RGAKPNAKTWEIF+DYYLKNG+FKLAADC  KAVSKG+ DGGKW+PS E+I++ MSH+E EKDVDGAE F+E
Subjt:  RIANALMGAYTKEGLLEKAMELKERARKRGAKPNAKTWEIFLDYYLKNGDFKLAADCVEKAVSKGKGDGGKWMPSSEIIKSLMSHFELEKDVDGAEGFLE

Query:  IVKKTVDTLDSEVFESLIRTYSAAGRKSSTMNRRLKMENVEVSEACKKLLNEISIE
         VKK+VD+L+SEVFESLIRTYSAAGR+S  M+RRLKMENVEVSEACKKLL+EISI+
Subjt:  IVKKTVDTLDSEVFESLIRTYSAAGRKSSTMNRRLKMENVEVSEACKKLLNEISIE

XP_022966912.1 pentatricopeptide repeat-containing protein At1g60770 [Cucurbita maxima]7.9e-23075.54Show/hide
Query:  MALRQLSRTKNVAKRSTKKYLEEALYVRLFKDGGSEKSLRQQLNGFIKSHKRVFKWEVGDTLKKLRDRKLYNPALKVRLTKPLKFCGIDDEWRNILIRVV
        + LRQLSRTKNVAKRSTKKYLEE LYVRLFKDGGSEKS+R QLN F+KS KRVFKWEVGDTLKKLRDRKLY PALK                        
Subjt:  MALRQLSRTKNVAKRSTKKYLEEALYVRLFKDGGSEKSLRQQLNGFIKSHKRVFKWEVGDTLKKLRDRKLYNPALKVRLTKPLKFCGIDDEWRNILIRVV

Query:  VMAICSTCLSCAKIDKYSMVEWDRISGQNCSIMSEVVPLFRILSETMAKRGVNRTVSDQAIHLDLVAKARGIAAAESYFVGLPESSKNHLCYGSLLNCYC
                                                  LSETMAKR +N+TVSDQA HLDL+ KARGIAAAES+FV LPESSKNHLCYGSLLNCYC
Subjt:  VMAICSTCLSCAKIDKYSMVEWDRISGQNCSIMSEVVPLFRILSETMAKRGVNRTVSDQAIHLDLVAKARGIAAAESYFVGLPESSKNHLCYGSLLNCYC

Query:  KELLTEKAEALFEKMKELNLLVTSMSYNSLMTLYMKTGQPEKVRAIIQEMKAANVMFDSYTYNVWMRALAALNDISGVERVIDEMKRDGGVVGDWTTYSN
        KEL+TEKAEA+ EKMKELNL VTSM YNSLMTLY KTGQPEKVRAIIQEMKAANV+FD+YTYNVWMRALAA NDISGVERVIDEMKRDG  VGDWTTYSN
Subjt:  KELLTEKAEALFEKMKELNLLVTSMSYNSLMTLYMKTGQPEKVRAIIQEMKAANVMFDSYTYNVWMRALAALNDISGVERVIDEMKRDGGVVGDWTTYSN

Query:  LASIYVNANLFEKAAKALKDLEKRNACRDLSAFQFLITLYGQIGNLPEVYRVWRSLRLAFPKTANISYLNMIQTLTKLKDLPGAEKCFKEWESGCSTYDI
        LASIYV+A++F+KA  ALK+LEKRNACRDLSAFQFLITL+GQ+GNL EVYRVWRSLRLAFP TANISYLNMIQTL KLKDLPGAEKCFKEWESGCSTYDI
Subjt:  LASIYVNANLFEKAAKALKDLEKRNACRDLSAFQFLITLYGQIGNLPEVYRVWRSLRLAFPKTANISYLNMIQTLTKLKDLPGAEKCFKEWESGCSTYDI

Query:  RIANALMGAYTKEGLLEKAMELKERARKRGAKPNAKTWEIFLDYYLKNGDFKLAADCVEKAVSKGKGDGGKWMPSSEIIKSLMSHFELEKDVDGAEGFLE
        RIANAL+GAY KEGLLEKA+ELK RAR+RGAKPNAKTWEIF+DYYLKNG+FKLAADCV KAVSKG+ D GKW+PS E+I++ MSH+E EKDVDGAE F+E
Subjt:  RIANALMGAYTKEGLLEKAMELKERARKRGAKPNAKTWEIFLDYYLKNGDFKLAADCVEKAVSKGKGDGGKWMPSSEIIKSLMSHFELEKDVDGAEGFLE

Query:  IVKKTVDTLDSEVFESLIRTYSAAGRKSSTMNRRLKMENVEVSEACKKLLNEISIE
         VKK+VD+L+SEVFESLIRTYSAAGR+S  M+RRLKMENVEVSEACKKLL+EISIE
Subjt:  IVKKTVDTLDSEVFESLIRTYSAAGRKSSTMNRRLKMENVEVSEACKKLLNEISIE

XP_038893310.1 pentatricopeptide repeat-containing protein At1g60770 isoform X1 [Benincasa hispida]3.4e-24981.47Show/hide
Query:  MALRQLSRTKNVAKRSTKKYLEEALYVRLFKDGGSEKSLRQQLNGFIKSHKRVFKWEVGDTLKKLRDRKLYNPALKVRLTKPLKFCGIDDEWRNILIRVV
        MALRQ SRTKN+AKRSTKKYLEEALYVRLFKDGGSEKS+RQQLNGFIKSHKRVFKWEVGDTLKKLRDRKLYNPALK                        
Subjt:  MALRQLSRTKNVAKRSTKKYLEEALYVRLFKDGGSEKSLRQQLNGFIKSHKRVFKWEVGDTLKKLRDRKLYNPALKVRLTKPLKFCGIDDEWRNILIRVV

Query:  VMAICSTCLSCAKIDKYSMVEWDRISGQNCSIMSEVVPLFRILSETMAKRGVNRTVSDQAIHLDLVAKARGIAAAESYFVGLPESSKNHLCYGSLLNCYC
                                                  LSETM KRG+N+TVSDQAIHLDLVAKARG+AAAESYFV LPESSKNHLCYGSLLNCYC
Subjt:  VMAICSTCLSCAKIDKYSMVEWDRISGQNCSIMSEVVPLFRILSETMAKRGVNRTVSDQAIHLDLVAKARGIAAAESYFVGLPESSKNHLCYGSLLNCYC

Query:  KELLTEKAEALFEKMKELNLLVTSMSYNSLMTLYMKTGQPEKVRAIIQEMKAANVMFDSYTYNVWMRALAALNDISGVERVIDEMKRDGGVVGDWTTYSN
        KELLTEKAEALFEKMKELNL V SM YNSLMTLY KTGQP+KVR+IIQEMKAANVMFDSYTYNVWMRALAALNDISGVERV+DEMKRDGGVVGDWTTYSN
Subjt:  KELLTEKAEALFEKMKELNLLVTSMSYNSLMTLYMKTGQPEKVRAIIQEMKAANVMFDSYTYNVWMRALAALNDISGVERVIDEMKRDGGVVGDWTTYSN

Query:  LASIYVNANLFEKAAKALKDLEKRNACRDLSAFQFLITLYGQIGNLPEVYRVWRSLRLAFPKTANISYLNMIQTLTKLKDLPGAEKCFKEWESGCSTYDI
        LASIYV+ANLFEKA KALK+LEKRNACR+LSAFQFLITLYGQIGNLPEVYRVWRSLRLAF KTANISYLNMIQTL KLKDLPGAEKCFKEWESGCSTYDI
Subjt:  LASIYVNANLFEKAAKALKDLEKRNACRDLSAFQFLITLYGQIGNLPEVYRVWRSLRLAFPKTANISYLNMIQTLTKLKDLPGAEKCFKEWESGCSTYDI

Query:  RIANALMGAYTKEGLLEKAMELKERARKRGAKPNAKTWEIFLDYYLKNGDFKLAADCVEKAVSKGKGDGGKWMPSSEIIKSLMSHFELEKDVDGAEGFLE
        RIANAL+GAYTKEGLLEKAMELKERAR +GAKPN KTWE+FLDYYLKNGDFKLA DCV KAVSK KGDGGKWMPS EIIKS MSHFE EKDVDGAEGFLE
Subjt:  RIANALMGAYTKEGLLEKAMELKERARKRGAKPNAKTWEIFLDYYLKNGDFKLAADCVEKAVSKGKGDGGKWMPSSEIIKSLMSHFELEKDVDGAEGFLE

Query:  IVKKTVDTLDSEVFESLIRTYSAAGRKSSTMNRRLKMENVEVSEACKKLLNEISIE
        IVKKTVDTL+SEVFESLIRTYSAAGR+SS+MN RLKMENVEVSEACKKLL+EISIE
Subjt:  IVKKTVDTLDSEVFESLIRTYSAAGRKSSTMNRRLKMENVEVSEACKKLLNEISIE

TrEMBL top hitse value%identityAlignment
A0A0A0KS91 Uncharacterized protein2.0e-21874.19Show/hide
Query:  LRQLSRTKNVAKRSTKKYLEEALYVRLFKDGGSEKSLRQQLNGFIKSHKRVFKWEVGDTLKKLRDRKLYNPALKVRLTKPLKFCGIDDEWRNILIRVVVM
        L++   +K++AKRS +KYLEEALY+RLFKDGGSEKS+R QLN FIKSHKRVFKWEVGDTL+KLRDRKLY PALK                          
Subjt:  LRQLSRTKNVAKRSTKKYLEEALYVRLFKDGGSEKSLRQQLNGFIKSHKRVFKWEVGDTLKKLRDRKLYNPALKVRLTKPLKFCGIDDEWRNILIRVVVM

Query:  AICSTCLSCAKIDKYSMVEWDRISGQNCSIMSEVVPLFRILSETMAKRGVNRTVSDQAIHLDLVAKARGIAAAESYFVGLPESSKNHLCYGSLLNCYCKE
                                                LSE MAKRG+N+TVSDQAIHLDLVAKARGI AAE+YFV LPESSKNHL Y SLLNCYCKE
Subjt:  AICSTCLSCAKIDKYSMVEWDRISGQNCSIMSEVVPLFRILSETMAKRGVNRTVSDQAIHLDLVAKARGIAAAESYFVGLPESSKNHLCYGSLLNCYCKE

Query:  LLTEKAEALFEKMKELNLLVTSMSYNSLMTLYMKTGQPEKVRAIIQEMKAANVMFDSYTYNVWMRALAALNDISGVERVIDEMKRDGGVVGDWTTYSNLA
        LLTEKAEALFEK+KELNL VT + YNSLMTLY K G+P+KV  IIQEMKAANV FD YTY VWMRALAALNDISGVERVIDEMKRD GV GDWTTYSNLA
Subjt:  LLTEKAEALFEKMKELNLLVTSMSYNSLMTLYMKTGQPEKVRAIIQEMKAANVMFDSYTYNVWMRALAALNDISGVERVIDEMKRDGGVVGDWTTYSNLA

Query:  SIYVNANLFEKAAKALKDLEKRNACRDLSAFQFLITLYGQIGNLPEVYRVWRSLRLAFPKTANISYLNMIQTLTKLKDLPGAEKCFKEWESGCSTYDIRI
        SIYVNAN+FEKAAKALKDLEK N  RDL  FQFLITLYGQIG+L EVYRVWRSLRLAFP+TANISYLNMIQTLTKLKDLPGAEKCFKEWESG  TYDIRI
Subjt:  SIYVNANLFEKAAKALKDLEKRNACRDLSAFQFLITLYGQIGNLPEVYRVWRSLRLAFPKTANISYLNMIQTLTKLKDLPGAEKCFKEWESGCSTYDIRI

Query:  ANALMGAYTKEGLLEKAMELKERARKRGAKPNAKTWEIFLDYYLKNGDFKLAADCVEKAVSKGKGDGGKWMPSSEIIKSLMSHFELEKDVDGAEGFLEIV
         NAL+GAYTK GLLEKAM LKERA +RGA+PNAKTWE FL+YYLKNGDFKLA DCV KA+  GKGD GKW+PS EIIKS MSHFE EKDVDGAE FLEIV
Subjt:  ANALMGAYTKEGLLEKAMELKERARKRGAKPNAKTWEIFLDYYLKNGDFKLAADCVEKAVSKGKGDGGKWMPSSEIIKSLMSHFELEKDVDGAEGFLEIV

Query:  KKTVDTLDSEVFESLIRTYSAAGRKSSTMNRRLKMENVEVSEACKKLLNEISIE
        KKTVD+L+SEVFESLIRTYSAAGR SS+M+RRLKMENVEVSEACKKLLN+ISIE
Subjt:  KKTVDTLDSEVFESLIRTYSAAGRKSSTMNRRLKMENVEVSEACKKLLNEISIE

A0A1S3BF94 pentatricopeptide repeat-containing protein At1g60770 isoform X11.8e-22776.44Show/hide
Query:  MALRQLSRTKNVAKRSTKKYLEEALYVRLFKDGGSEKSLRQQLNGFIKSHKRVFKWEVGDTLKKLRDRKLYNPALKVRLTKPLKFCGIDDEWRNILIRVV
        + LR+   +K++AKRST+KYLEEALY+RLFKDGGSEKS+R QLN FIKS KRVFKWEVGDTLKKLRDRKLY PALK                        
Subjt:  MALRQLSRTKNVAKRSTKKYLEEALYVRLFKDGGSEKSLRQQLNGFIKSHKRVFKWEVGDTLKKLRDRKLYNPALKVRLTKPLKFCGIDDEWRNILIRVV

Query:  VMAICSTCLSCAKIDKYSMVEWDRISGQNCSIMSEVVPLFRILSETMAKRGVNRTVSDQAIHLDLVAKARGIAAAESYFVGLPESSKNHLCYGSLLNCYC
                                                  LSETMAKRG+N+TVSDQAIHLDLVAKARGIAAAE+YFV LPESSKNHL Y SLLNCYC
Subjt:  VMAICSTCLSCAKIDKYSMVEWDRISGQNCSIMSEVVPLFRILSETMAKRGVNRTVSDQAIHLDLVAKARGIAAAESYFVGLPESSKNHLCYGSLLNCYC

Query:  KELLTEKAEALFEKMKELNLLVTSMSYNSLMTLYMKTGQPEKVRAIIQEMKAANVMFDSYTYNVWMRALAALNDISGVERVIDEMKRDGGVVGDWTTYSN
        KELLTEKAE+LFEKMKELNL +TSM  N LMTLY K GQP+KV +IIQEMKAANV FDSYTY VWMRALAALNDISGVERVIDEMKRD GV GDWTTYSN
Subjt:  KELLTEKAEALFEKMKELNLLVTSMSYNSLMTLYMKTGQPEKVRAIIQEMKAANVMFDSYTYNVWMRALAALNDISGVERVIDEMKRDGGVVGDWTTYSN

Query:  LASIYVNANLFEKAAKALKDLEKRNACRDLSAFQFLITLYGQIGNLPEVYRVWRSLRLAFPKTANISYLNMIQTLTKLKDLPGAEKCFKEWESGCSTYDI
        LASIYVNAN+FEKAAKAL DLEK N  RDL AFQFLITLYGQIG+L +VY VWRSLRLAFP+TANISYLNMIQTL KLKDLPGAEKCFKEWESGCSTYDI
Subjt:  LASIYVNANLFEKAAKALKDLEKRNACRDLSAFQFLITLYGQIGNLPEVYRVWRSLRLAFPKTANISYLNMIQTLTKLKDLPGAEKCFKEWESGCSTYDI

Query:  RIANALMGAYTKEGLLEKAMELKERARKRGAKPNAKTWEIFLDYYLKNGDFKLAADCVEKAVSKGKGDGGKWMPSSEIIKSLMSHFELEKDVDGAEGFLE
        RIANAL+GAYTKEGLLEKAM LKERA KRGA+PNAKTWEIFLDYYLKNG+FKLA DCV KAVS+GKGDGGKWMPS EIIKS MSHFE EKDVDGAE FLE
Subjt:  RIANALMGAYTKEGLLEKAMELKERARKRGAKPNAKTWEIFLDYYLKNGDFKLAADCVEKAVSKGKGDGGKWMPSSEIIKSLMSHFELEKDVDGAEGFLE

Query:  IVKKTVDTLDSEVFESLIRTYSAAGRKSSTMNRRLKMENVEVSEACKKLLNEISIE
        IVKKTVD+L+SEVFESLIRTYSAAGR SS+MNRRLKMENVEVSEACKKLLNEISIE
Subjt:  IVKKTVDTLDSEVFESLIRTYSAAGRKSSTMNRRLKMENVEVSEACKKLLNEISIE

A0A6J1DEQ3 pentatricopeptide repeat-containing protein At1g607702.2e-23076.98Show/hide
Query:  MALRQLSRTKNVAKRSTKKYLEEALYVRLFKDGGSEKSLRQQLNGFIKSHKRVFKWEVGDTLKKLRDRKLYNPALKVRLTKPLKFCGIDDEWRNILIRVV
        MALRQ SR KNVAKRS  KYLEEALYVRLFKDG SEKS+R QLN FIK HKRVFKWEVGDTLKKLR RKLYNPALK                        
Subjt:  MALRQLSRTKNVAKRSTKKYLEEALYVRLFKDGGSEKSLRQQLNGFIKSHKRVFKWEVGDTLKKLRDRKLYNPALKVRLTKPLKFCGIDDEWRNILIRVV

Query:  VMAICSTCLSCAKIDKYSMVEWDRISGQNCSIMSEVVPLFRILSETMAKRGVNRTVSDQAIHLDLVAKARGIAAAESYFVGLPESSKNHLCYGSLLNCYC
                                                  LSETMAKRG+N+T+SDQAIHLDLVAKARGIAAAESYFV LPESSKNHLCYGSLLNCYC
Subjt:  VMAICSTCLSCAKIDKYSMVEWDRISGQNCSIMSEVVPLFRILSETMAKRGVNRTVSDQAIHLDLVAKARGIAAAESYFVGLPESSKNHLCYGSLLNCYC

Query:  KELLTEKAEALFEKMKELNLLVTSMSYNSLMTLYMKTGQPEKVRAIIQEMKAANVMFDSYTYNVWMRALAALNDISGVERVIDEMKRDGGVVGDWTTYSN
        KEL+T++AEAL EKMKELNL VTSMSYNS+MTLY KTGQPEKVRAIIQEMKAANVMFDSYTYNVWMRALAALNDISGVERVIDEMKRDG VVGDWTTYSN
Subjt:  KELLTEKAEALFEKMKELNLLVTSMSYNSLMTLYMKTGQPEKVRAIIQEMKAANVMFDSYTYNVWMRALAALNDISGVERVIDEMKRDGGVVGDWTTYSN

Query:  LASIYVNANLFEKAAKALKDLEKRNACRDLSAFQFLITLYGQIGNLPEVYRVWRSLRLAFPKTANISYLNMIQTLTKLKDLPGAEKCFKEWESGCSTYDI
        LASIYV+A+LFEKA KAL+DLEKRN+ R+LSAFQF+ITLYG++GNL EVYRVWRSLRLAFPKTANISYLNMIQTLTKLKDLPGAEKCFKEWESGCSTYDI
Subjt:  LASIYVNANLFEKAAKALKDLEKRNACRDLSAFQFLITLYGQIGNLPEVYRVWRSLRLAFPKTANISYLNMIQTLTKLKDLPGAEKCFKEWESGCSTYDI

Query:  RIANALMGAYTKEGLLEKAMELKERARKRGAKPNAKTWEIFLDYYLKNGDFKLAADCVEKAVSKGKGDGGKWMPSSEIIKSLMSHFELEKDVDGAEGFLE
        RIAN L+GAY +EGLLEKAMELKERAR+RGAKPNAKTWEIFLDY+L+NG+FK A DCV KAVS G+  GGKWMPS EI+K+LMSHFE EKDVDGAE FLE
Subjt:  RIANALMGAYTKEGLLEKAMELKERARKRGAKPNAKTWEIFLDYYLKNGDFKLAADCVEKAVSKGKGDGGKWMPSSEIIKSLMSHFELEKDVDGAEGFLE

Query:  IVKKTVDTLDSEVFESLIRTYSAAGRKSSTMNRRLKMENVEVSEACKKLLNEISIE
         VKK VDTL+ EVFE+LIRTYSAAGRKSS M+R LKMENVEVSEACKKLL+EISIE
Subjt:  IVKKTVDTLDSEVFESLIRTYSAAGRKSSTMNRRLKMENVEVSEACKKLLNEISIE

A0A6J1G0G4 pentatricopeptide repeat-containing protein At1g607707.4e-22674.82Show/hide
Query:  MALRQLSRTKNVAKRSTKKYLEEALYVRLFKDGGSEKSLRQQLNGFIKSHKRVFKWEVGDTLKKLRDRKLYNPALKVRLTKPLKFCGIDDEWRNILIRVV
        + LR+LSRTKNVAKRSTKKYLEE LYVRLFKDG SEKS+R QLN F+KS KRVFKWEVGDTLKKLRDRKLY PALK                        
Subjt:  MALRQLSRTKNVAKRSTKKYLEEALYVRLFKDGGSEKSLRQQLNGFIKSHKRVFKWEVGDTLKKLRDRKLYNPALKVRLTKPLKFCGIDDEWRNILIRVV

Query:  VMAICSTCLSCAKIDKYSMVEWDRISGQNCSIMSEVVPLFRILSETMAKRGVNRTVSDQAIHLDLVAKARGIAAAESYFVGLPESSKNHLCYGSLLNCYC
                                                  LSETMAKR +N+TVSDQAIHLDL+AKARGIAAAES+FV LPESSKNHLCYGSLLNCYC
Subjt:  VMAICSTCLSCAKIDKYSMVEWDRISGQNCSIMSEVVPLFRILSETMAKRGVNRTVSDQAIHLDLVAKARGIAAAESYFVGLPESSKNHLCYGSLLNCYC

Query:  KELLTEKAEALFEKMKELNLLVTSMSYNSLMTLYMKTGQPEKVRAIIQEMKAANVMFDSYTYNVWMRALAALNDISGVERVIDEMKRDGGVVGDWTTYSN
        KEL+TEKAEA+ EKMKELNL VTSM YNSLMTLY KTG PEKV AIIQEMKAA VMFD+YTYNVWMRALAALNDISGVERVIDEMK DG  VGDWTTYSN
Subjt:  KELLTEKAEALFEKMKELNLLVTSMSYNSLMTLYMKTGQPEKVRAIIQEMKAANVMFDSYTYNVWMRALAALNDISGVERVIDEMKRDGGVVGDWTTYSN

Query:  LASIYVNANLFEKAAKALKDLEKRNACRDLSAFQFLITLYGQIGNLPEVYRVWRSLRLAFPKTANISYLNMIQTLTKLKDLPGAEKCFKEWESGCSTYDI
        LASIYV+A++F+KA  ALK+LEKRNA RDLSAFQFLITL+GQ+GNL EVYRVWRSLRLAFPKTANISYLNMIQTL KLKDLPGAEKCFKEW+SGCSTYDI
Subjt:  LASIYVNANLFEKAAKALKDLEKRNACRDLSAFQFLITLYGQIGNLPEVYRVWRSLRLAFPKTANISYLNMIQTLTKLKDLPGAEKCFKEWESGCSTYDI

Query:  RIANALMGAYTKEGLLEKAMELKERARKRGAKPNAKTWEIFLDYYLKNGDFKLAADCVEKAVSKGKGDGGKWMPSSEIIKSLMSHFELEKDVDGAEGFLE
        RIANAL+GAY KEGLLEKA+ELK RAR+RGAKPNAKTWEIF+DYYLKNG+FKLAADC  KAVSKG+ DGGKW+PS E+I++ MSH+E EKDVDGAE F+E
Subjt:  RIANALMGAYTKEGLLEKAMELKERARKRGAKPNAKTWEIFLDYYLKNGDFKLAADCVEKAVSKGKGDGGKWMPSSEIIKSLMSHFELEKDVDGAEGFLE

Query:  IVKKTVDTLDSEVFESLIRTYSAAGRKSSTMNRRLKMENVEVSEACKKLLNEISIE
         VKK+VD+L+SEVFESLIRTYSAAGR+S  M+RRLKMENVEVSEACKKLL+EISI+
Subjt:  IVKKTVDTLDSEVFESLIRTYSAAGRKSSTMNRRLKMENVEVSEACKKLLNEISIE

A0A6J1HP97 pentatricopeptide repeat-containing protein At1g607703.8e-23075.54Show/hide
Query:  MALRQLSRTKNVAKRSTKKYLEEALYVRLFKDGGSEKSLRQQLNGFIKSHKRVFKWEVGDTLKKLRDRKLYNPALKVRLTKPLKFCGIDDEWRNILIRVV
        + LRQLSRTKNVAKRSTKKYLEE LYVRLFKDGGSEKS+R QLN F+KS KRVFKWEVGDTLKKLRDRKLY PALK                        
Subjt:  MALRQLSRTKNVAKRSTKKYLEEALYVRLFKDGGSEKSLRQQLNGFIKSHKRVFKWEVGDTLKKLRDRKLYNPALKVRLTKPLKFCGIDDEWRNILIRVV

Query:  VMAICSTCLSCAKIDKYSMVEWDRISGQNCSIMSEVVPLFRILSETMAKRGVNRTVSDQAIHLDLVAKARGIAAAESYFVGLPESSKNHLCYGSLLNCYC
                                                  LSETMAKR +N+TVSDQA HLDL+ KARGIAAAES+FV LPESSKNHLCYGSLLNCYC
Subjt:  VMAICSTCLSCAKIDKYSMVEWDRISGQNCSIMSEVVPLFRILSETMAKRGVNRTVSDQAIHLDLVAKARGIAAAESYFVGLPESSKNHLCYGSLLNCYC

Query:  KELLTEKAEALFEKMKELNLLVTSMSYNSLMTLYMKTGQPEKVRAIIQEMKAANVMFDSYTYNVWMRALAALNDISGVERVIDEMKRDGGVVGDWTTYSN
        KEL+TEKAEA+ EKMKELNL VTSM YNSLMTLY KTGQPEKVRAIIQEMKAANV+FD+YTYNVWMRALAA NDISGVERVIDEMKRDG  VGDWTTYSN
Subjt:  KELLTEKAEALFEKMKELNLLVTSMSYNSLMTLYMKTGQPEKVRAIIQEMKAANVMFDSYTYNVWMRALAALNDISGVERVIDEMKRDGGVVGDWTTYSN

Query:  LASIYVNANLFEKAAKALKDLEKRNACRDLSAFQFLITLYGQIGNLPEVYRVWRSLRLAFPKTANISYLNMIQTLTKLKDLPGAEKCFKEWESGCSTYDI
        LASIYV+A++F+KA  ALK+LEKRNACRDLSAFQFLITL+GQ+GNL EVYRVWRSLRLAFP TANISYLNMIQTL KLKDLPGAEKCFKEWESGCSTYDI
Subjt:  LASIYVNANLFEKAAKALKDLEKRNACRDLSAFQFLITLYGQIGNLPEVYRVWRSLRLAFPKTANISYLNMIQTLTKLKDLPGAEKCFKEWESGCSTYDI

Query:  RIANALMGAYTKEGLLEKAMELKERARKRGAKPNAKTWEIFLDYYLKNGDFKLAADCVEKAVSKGKGDGGKWMPSSEIIKSLMSHFELEKDVDGAEGFLE
        RIANAL+GAY KEGLLEKA+ELK RAR+RGAKPNAKTWEIF+DYYLKNG+FKLAADCV KAVSKG+ D GKW+PS E+I++ MSH+E EKDVDGAE F+E
Subjt:  RIANALMGAYTKEGLLEKAMELKERARKRGAKPNAKTWEIFLDYYLKNGDFKLAADCVEKAVSKGKGDGGKWMPSSEIIKSLMSHFELEKDVDGAEGFLE

Query:  IVKKTVDTLDSEVFESLIRTYSAAGRKSSTMNRRLKMENVEVSEACKKLLNEISIE
         VKK+VD+L+SEVFESLIRTYSAAGR+S  M+RRLKMENVEVSEACKKLL+EISIE
Subjt:  IVKKTVDTLDSEVFESLIRTYSAAGRKSSTMNRRLKMENVEVSEACKKLLNEISIE

SwissProt top hitse value%identityAlignment
O22714 Pentatricopeptide repeat-containing protein At1g607704.1e-18961.73Show/hide
Query:  MALRQLSRTKNVAKRSTKKYLEEALYVRLFKDGGSEKSLRQQLNGFIKSHKRVFKWEVGDTLKKLRDRKLYNPALKVRLTKPLKFCGIDDEWRNILIRVV
        MA+R LSR+++V KRSTKKY+EE LY RLFKDGG+E  +RQQLN F+K  K VFKWEVGDT+KKLR+R LY PALK                        
Subjt:  MALRQLSRTKNVAKRSTKKYLEEALYVRLFKDGGSEKSLRQQLNGFIKSHKRVFKWEVGDTLKKLRDRKLYNPALKVRLTKPLKFCGIDDEWRNILIRVV

Query:  VMAICSTCLSCAKIDKYSMVEWDRISGQNCSIMSEVVPLFRILSETMAKRGVNRTVSDQAIHLDLVAKARGIAAAESYFVGLPESSKNHLCYGSLLNCYC
                                                  LSE M +RG+N+TVSDQAIHLDLVAKAR I A E+YFV LPE+SK  L YGSLLNCYC
Subjt:  VMAICSTCLSCAKIDKYSMVEWDRISGQNCSIMSEVVPLFRILSETMAKRGVNRTVSDQAIHLDLVAKARGIAAAESYFVGLPESSKNHLCYGSLLNCYC

Query:  KELLTEKAEALFEKMKELNLLVTSMSYNSLMTLYMKTGQPEKVRAIIQEMKAANVMFDSYTYNVWMRALAALNDISGVERVIDEMKRDGGVVGDWTTYSN
        KELLTEKAE L  KMKELN+  +SMSYNSLMTLY KTG+ EKV A+IQE+KA NVM DSYTYNVWMRALAA NDISGVERVI+EM RDG V  DWTTYSN
Subjt:  KELLTEKAEALFEKMKELNLLVTSMSYNSLMTLYMKTGQPEKVRAIIQEMKAANVMFDSYTYNVWMRALAALNDISGVERVIDEMKRDGGVVGDWTTYSN

Query:  LASIYVNANLFEKAAKALKDLEKRNACRDLSAFQFLITLYGQIGNLPEVYRVWRSLRLAFPKTANISYLNMIQTLTKLKDLPGAEKCFKEWESGCSTYDI
        +ASIYV+A L +KA KAL++LE +N  RD +A+QFLITLYG++G L EVYR+WRSLRLA PKT+N++YLNMIQ L KL DLPGAE  FKEW++ CSTYDI
Subjt:  LASIYVNANLFEKAAKALKDLEKRNACRDLSAFQFLITLYGQIGNLPEVYRVWRSLRLAFPKTANISYLNMIQTLTKLKDLPGAEKCFKEWESGCSTYDI

Query:  RIANALMGAYTKEGLLEKAMELKERARKRGAKPNAKTWEIFLDYYLKNGDFKLAADCVEKAVSKGKGDGGKWMPSSEIIKSLMSHFELEKDVDGAEGFLE
        RI N L+GAY +EGL++KA ELKE+A +RG K NAKTWEIF+DYY+K+GD   A +C+ KAVS GKGDGGKW+PS E +++LMS+FE +KDV+GAE  LE
Subjt:  RIANALMGAYTKEGLLEKAMELKERARKRGAKPNAKTWEIFLDYYLKNGDFKLAADCVEKAVSKGKGDGGKWMPSSEIIKSLMSHFELEKDVDGAEGFLE

Query:  IVKKTVDTLDSEVFESLIRTYSAAGRKSSTMNRRLKMENVEVSEACKKLLNEIS
        I+K   D + +E+FE LIRTY+AAG+    M RRLKMENVEV+EA KKLL+E+S
Subjt:  IVKKTVDTLDSEVFESLIRTYSAAGRKSSTMNRRLKMENVEVSEACKKLLNEIS

Q8LPS6 Pentatricopeptide repeat-containing protein At1g021502.7e-6335.96Show/hide
Query:  TVSDQAIHLDLVAKARGIAAAESYFVGLPESSKNHLCYGSLLNCYCKELLTEKAEALFEKMKELNLLVTSMSYNSLMTLYMKTGQPEKVRAIIQEMKAAN
        + SD AI LDL+ K RGI  AE +F+ LPE+ K+   YGSLLN Y +    EKAEAL   M++    +  + +N +MTLYM   + +KV A++ EMK  +
Subjt:  TVSDQAIHLDLVAKARGIAAAESYFVGLPESSKNHLCYGSLLNCYCKELLTEKAEALFEKMKELNLLVTSMSYNSLMTLYMKTGQPEKVRAIIQEMKAAN

Query:  VMFDSYTYNVWMRALAALNDISGVERVIDEMKRDGGVVGDWTTYSNLASIYVNANLFEKAAKALKDLEKRNACRDLSAFQFLITLYGQIGNLPEVYRVWR
        +  D Y+YN+W+ +  +L  +  +E V  +MK D  +  +WTT+S +A++Y+     EKA  AL+ +E R   R+   + +L++LYG +GN  E+YRVW 
Subjt:  VMFDSYTYNVWMRALAALNDISGVERVIDEMKRDGGVVGDWTTYSNLASIYVNANLFEKAAKALKDLEKRNACRDLSAFQFLITLYGQIGNLPEVYRVWR

Query:  SLRLAFPKTANISYLNMIQTLTKLKDLPGAEKCFKEWESGCSTYDIRIANALMGAYTKEGLLEKAMELKERARKRGAKPNAKTWEIFLDYYLKNGDFKLA
          +   P   N+ Y  ++ +L ++ D+ GAEK ++EW    S+YD RI N LM AY K   LE A  L +   + G KP++ TWEI    + +      A
Subjt:  SLRLAFPKTANISYLNMIQTLTKLKDLPGAEKCFKEWESGCSTYDIRIANALMGAYTKEGLLEKAMELKERARKRGAKPNAKTWEIFLDYYLKNGDFKLA

Query:  ADCVEKAVSKGKGDGGKWMPSSEIIKSLMSHFELEKDVDGAEGFLEIVKKTVDTLD
          C+  A S        W P   ++       E E DV   E  LE+++++ D  D
Subjt:  ADCVEKAVSKGKGDGGKWMPSSEIIKSLMSHFELEKDVDGAEGFLEIVKKTVDTLD

Q93WC5 Pentatricopeptide repeat-containing protein At4g01990, mitochondrial2.0e-8241.81Show/hide
Query:  ETMAKRGVNRTVSDQAIHLDLVAKARGIAAAESYFVGLPESSKNHLCYGSLLNCYCKELLTEKAEALFEKMKELNLLVTSMSYNSLMTLYMKTGQPEKVR
        E M ++ +  T SD AI L+L+AK++G+ AAE+YF  L +S KN   YGSLLNCYC E    KA+A FE M +LN +  S+ +N+LM +YM  GQPEKV 
Subjt:  ETMAKRGVNRTVSDQAIHLDLVAKARGIAAAESYFVGLPESSKNHLCYGSLLNCYCKELLTEKAEALFEKMKELNLLVTSMSYNSLMTLYMKTGQPEKVR

Query:  AIIQEMKAANVMFDSYTYNVWMRALAALNDISGVERVIDEMKRDGGVVGDWTTYSNLASIYVNANLFEKAAKALKDLEKRNACRDLSAFQFLITLYGQIG
        A++  MK  ++     TY++W+++  +L D+ GVE+V+DEMK +G  +  W T++NLA+IY+   L+ KA +ALK LE          + FLI LY  I 
Subjt:  AIIQEMKAANVMFDSYTYNVWMRALAALNDISGVERVIDEMKRDGGVVGDWTTYSNLASIYVNANLFEKAAKALKDLEKRNACRDLSAFQFLITLYGQIG

Query:  NLPEVYRVWRSLRLAFPKTANISYLNMIQTLTKLKDLPGAEKCFKEWESGCSTYDIRIANALMGAYTKEGLLEKAMELKERARKRGAKPNAKTWEIFLDY
        N  EVYRVW  L+  +P   N SYL M++ L+KL D+ G +K F EWES C TYD+R+AN  + +Y K+ + E+A  +   A K+     +K  ++ + +
Subjt:  NLPEVYRVWRSLRLAFPKTANISYLNMIQTLTKLKDLPGAEKCFKEWESGCSTYDIRIANALMGAYTKEGLLEKAMELKERARKRGAKPNAKTWEIFLDY

Query:  YLKNGDFKLAADCVEKAVSKGKGDGGKWMPSSEIIKSLMSHFELEKDVDGAEGFLEIVKKTVDTLDSEVFESLIRTYSAAGRKSSTMNRRLKMENVEVSE
         LKN    LA    E AV         W  SSE+I S   HFE  KDVDGAE F + + K    L SE +  L++TY AAG+    M +RL+ + + V E
Subjt:  YLKNGDFKLAADCVEKAVSKGKGDGGKWMPSSEIIKSLMSHFELEKDVDGAEGFLEIVKKTVDTLDSEVFESLIRTYSAAGRKSSTMNRRLKMENVEVSE

Query:  ACKKLLNEI
          + LL++I
Subjt:  ACKKLLNEI

Q9FZ24 Pentatricopeptide repeat-containing protein At1g02370, mitochondrial1.3e-8943.14Show/hide
Query:  MAKRGVNRTVSDQAIHLDLVAKARGIAAAESYFVGLPESSKNH-LCYGSLLNCYCKELLTEKAEALFEKMKELNLLVTSMSYNSLMTLYMKTGQPEKVRA
        M KR +  +VSD AI LDL+ K +G+ AAE+YF  L  S+KNH   YG+L+NCYC EL  EKA+A FE M ELN +  S+ +N++M++YM+  QPEKV  
Subjt:  MAKRGVNRTVSDQAIHLDLVAKARGIAAAESYFVGLPESSKNH-LCYGSLLNCYCKELLTEKAEALFEKMKELNLLVTSMSYNSLMTLYMKTGQPEKVRA

Query:  IIQEMKAANVMFDSYTYNVWMRALAALNDISGVERVIDEMKRDGGVVGDWTTYSNLASIYVNANLFEKAAKALKDLEKRNACRDLSAFQFLITLYGQIGN
        ++  MK   +     TY++WM++  +LND+ G+E++IDEM +D      W T+SNLA+IY  A L+EKA  ALK +E++    +  +  FL++LY  I  
Subjt:  IIQEMKAANVMFDSYTYNVWMRALAALNDISGVERVIDEMKRDGGVVGDWTTYSNLASIYVNANLFEKAAKALKDLEKRNACRDLSAFQFLITLYGQIGN

Query:  LPEVYRVWRSLRLAFPKTANISYLNMIQTLTKLKDLPGAEKCFKEWESGCSTYDIRIANALMGAYTKEGLLEKAMELKERARKRGAKPNAKTWEIFLDYY
         PEVYRVW SL+ A P+  N+SYL M+Q ++KL DL G +K F EWES C  YD+R+AN  +  Y K  + E+A ++ + A K+   P +K  ++ + + 
Subjt:  LPEVYRVWRSLRLAFPKTANISYLNMIQTLTKLKDLPGAEKCFKEWESGCSTYDIRIANALMGAYTKEGLLEKAMELKERARKRGAKPNAKTWEIFLDYY

Query:  LKNGDFKLAADCVEKAVSKGKGDGGKWMPSSEIIKSLMSHFELEKDVDGAEGFLEIVKKTVDTLDSEVFESLIRTYSAAGRKSSTMNRRLKMENVEVSEA
        L+N    LA   +E AVS    +  +W  SSE++     HFE  KDVDGAE F +I+      LDSE    LI+TY+AA + S  M  RL  + +EVSE 
Subjt:  LKNGDFKLAADCVEKAVSKGKGDGGKWMPSSEIIKSLMSHFELEKDVDGAEGFLEIVKKTVDTLDSEVFESLIRTYSAAGRKSSTMNRRLKMENVEVSEA

Query:  CKKLLNEI
         + LL  +
Subjt:  CKKLLNEI

Q9SY07 Pentatricopeptide repeat-containing protein At4g02820, mitochondrial1.2e-6333.42Show/hide
Query:  MAKRGVNRTVSDQAIHLDLVAKARGIAAAESYFVGLPESSKNHLCYGSLLNCYCKELLTEKAEALFEKMKELNLLVTSMSYNSLMTLYMKTGQPEKVRAI
        + +  +     D A+HLDL++K RG+ +AE +F  +P+  + H    SLL+ Y +  L++KAEALFEKM E   L + + YN ++++Y+  GQ EKV  +
Subjt:  MAKRGVNRTVSDQAIHLDLVAKARGIAAAESYFVGLPESSKNHLCYGSLLNCYCKELLTEKAEALFEKMKELNLLVTSMSYNSLMTLYMKTGQPEKVRAI

Query:  IQEMKAANVMFDSYTYNVWMRALAALNDISGVERVIDEMKRDGGVVGDWTTYSNLASIYVNANLFEKAAKALKDLEKRNACRDLSAFQFLITLYGQIGNL
        I+E+K      D  TYN+W+ A A+ ND+ G E+V  + K +  +  DW TYS L ++Y   +  EKA  ALK++EK  + ++  A+  LI+L+  +G+ 
Subjt:  IQEMKAANVMFDSYTYNVWMRALAALNDISGVERVIDEMKRDGGVVGDWTTYSNLASIYVNANLFEKAAKALKDLEKRNACRDLSAFQFLITLYGQIGNL

Query:  PEVYRVWRSLRLAFPKTANISYLNMIQTLTKLKDLPGAEKCFKEWESGCSTYDIRIANALMGAYTKEGLLEKAMELKERARKRGAKPNAKTWEIFLDYYL
          V   W+ ++ +F K  +  YL+MI  + KL +   A+  + EWES   T D RI N ++  Y     +    +  ER  ++G  P+  TWEI    YL
Subjt:  PEVYRVWRSLRLAFPKTANISYLNMIQTLTKLKDLPGAEKCFKEWESGCSTYDIRIANALMGAYTKEGLLEKAMELKERARKRGAKPNAKTWEIFLDYYL

Query:  KNGDFKLAADCVEKAVSKGKGDGGKWMPSSEIIKSLMSHFELEKDVDGAEGFLEIVKKTVDTLDSEVFESLIRTYSAAGRKSSTMNRRLKMENVEVSEAC
        K  D +   DC  KA+   K    KW  +  ++K      E + +V GAE  + +++K    ++++++ SL+RTY+ AG  +  +  R+  +NVE+ E  
Subjt:  KNGDFKLAADCVEKAVSKGKGDGGKWMPSSEIIKSLMSHFELEKDVDGAEGFLEIVKKTVDTLDSEVFESLIRTYSAAGRKSSTMNRRLKMENVEVSEAC

Query:  KKLL
        K+L+
Subjt:  KKLL

Arabidopsis top hitse value%identityAlignment
AT1G02150.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.9e-6435.96Show/hide
Query:  TVSDQAIHLDLVAKARGIAAAESYFVGLPESSKNHLCYGSLLNCYCKELLTEKAEALFEKMKELNLLVTSMSYNSLMTLYMKTGQPEKVRAIIQEMKAAN
        + SD AI LDL+ K RGI  AE +F+ LPE+ K+   YGSLLN Y +    EKAEAL   M++    +  + +N +MTLYM   + +KV A++ EMK  +
Subjt:  TVSDQAIHLDLVAKARGIAAAESYFVGLPESSKNHLCYGSLLNCYCKELLTEKAEALFEKMKELNLLVTSMSYNSLMTLYMKTGQPEKVRAIIQEMKAAN

Query:  VMFDSYTYNVWMRALAALNDISGVERVIDEMKRDGGVVGDWTTYSNLASIYVNANLFEKAAKALKDLEKRNACRDLSAFQFLITLYGQIGNLPEVYRVWR
        +  D Y+YN+W+ +  +L  +  +E V  +MK D  +  +WTT+S +A++Y+     EKA  AL+ +E R   R+   + +L++LYG +GN  E+YRVW 
Subjt:  VMFDSYTYNVWMRALAALNDISGVERVIDEMKRDGGVVGDWTTYSNLASIYVNANLFEKAAKALKDLEKRNACRDLSAFQFLITLYGQIGNLPEVYRVWR

Query:  SLRLAFPKTANISYLNMIQTLTKLKDLPGAEKCFKEWESGCSTYDIRIANALMGAYTKEGLLEKAMELKERARKRGAKPNAKTWEIFLDYYLKNGDFKLA
          +   P   N+ Y  ++ +L ++ D+ GAEK ++EW    S+YD RI N LM AY K   LE A  L +   + G KP++ TWEI    + +      A
Subjt:  SLRLAFPKTANISYLNMIQTLTKLKDLPGAEKCFKEWESGCSTYDIRIANALMGAYTKEGLLEKAMELKERARKRGAKPNAKTWEIFLDYYLKNGDFKLA

Query:  ADCVEKAVSKGKGDGGKWMPSSEIIKSLMSHFELEKDVDGAEGFLEIVKKTVDTLD
          C+  A S        W P   ++       E E DV   E  LE+++++ D  D
Subjt:  ADCVEKAVSKGKGDGGKWMPSSEIIKSLMSHFELEKDVDGAEGFLEIVKKTVDTLD

AT1G02370.1 Tetratricopeptide repeat (TPR)-like superfamily protein9.0e-9143.14Show/hide
Query:  MAKRGVNRTVSDQAIHLDLVAKARGIAAAESYFVGLPESSKNH-LCYGSLLNCYCKELLTEKAEALFEKMKELNLLVTSMSYNSLMTLYMKTGQPEKVRA
        M KR +  +VSD AI LDL+ K +G+ AAE+YF  L  S+KNH   YG+L+NCYC EL  EKA+A FE M ELN +  S+ +N++M++YM+  QPEKV  
Subjt:  MAKRGVNRTVSDQAIHLDLVAKARGIAAAESYFVGLPESSKNH-LCYGSLLNCYCKELLTEKAEALFEKMKELNLLVTSMSYNSLMTLYMKTGQPEKVRA

Query:  IIQEMKAANVMFDSYTYNVWMRALAALNDISGVERVIDEMKRDGGVVGDWTTYSNLASIYVNANLFEKAAKALKDLEKRNACRDLSAFQFLITLYGQIGN
        ++  MK   +     TY++WM++  +LND+ G+E++IDEM +D      W T+SNLA+IY  A L+EKA  ALK +E++    +  +  FL++LY  I  
Subjt:  IIQEMKAANVMFDSYTYNVWMRALAALNDISGVERVIDEMKRDGGVVGDWTTYSNLASIYVNANLFEKAAKALKDLEKRNACRDLSAFQFLITLYGQIGN

Query:  LPEVYRVWRSLRLAFPKTANISYLNMIQTLTKLKDLPGAEKCFKEWESGCSTYDIRIANALMGAYTKEGLLEKAMELKERARKRGAKPNAKTWEIFLDYY
         PEVYRVW SL+ A P+  N+SYL M+Q ++KL DL G +K F EWES C  YD+R+AN  +  Y K  + E+A ++ + A K+   P +K  ++ + + 
Subjt:  LPEVYRVWRSLRLAFPKTANISYLNMIQTLTKLKDLPGAEKCFKEWESGCSTYDIRIANALMGAYTKEGLLEKAMELKERARKRGAKPNAKTWEIFLDYY

Query:  LKNGDFKLAADCVEKAVSKGKGDGGKWMPSSEIIKSLMSHFELEKDVDGAEGFLEIVKKTVDTLDSEVFESLIRTYSAAGRKSSTMNRRLKMENVEVSEA
        L+N    LA   +E AVS    +  +W  SSE++     HFE  KDVDGAE F +I+      LDSE    LI+TY+AA + S  M  RL  + +EVSE 
Subjt:  LKNGDFKLAADCVEKAVSKGKGDGGKWMPSSEIIKSLMSHFELEKDVDGAEGFLEIVKKTVDTLDSEVFESLIRTYSAAGRKSSTMNRRLKMENVEVSEA

Query:  CKKLLNEI
         + LL  +
Subjt:  CKKLLNEI

AT1G60770.1 Tetratricopeptide repeat (TPR)-like superfamily protein2.9e-19061.73Show/hide
Query:  MALRQLSRTKNVAKRSTKKYLEEALYVRLFKDGGSEKSLRQQLNGFIKSHKRVFKWEVGDTLKKLRDRKLYNPALKVRLTKPLKFCGIDDEWRNILIRVV
        MA+R LSR+++V KRSTKKY+EE LY RLFKDGG+E  +RQQLN F+K  K VFKWEVGDT+KKLR+R LY PALK                        
Subjt:  MALRQLSRTKNVAKRSTKKYLEEALYVRLFKDGGSEKSLRQQLNGFIKSHKRVFKWEVGDTLKKLRDRKLYNPALKVRLTKPLKFCGIDDEWRNILIRVV

Query:  VMAICSTCLSCAKIDKYSMVEWDRISGQNCSIMSEVVPLFRILSETMAKRGVNRTVSDQAIHLDLVAKARGIAAAESYFVGLPESSKNHLCYGSLLNCYC
                                                  LSE M +RG+N+TVSDQAIHLDLVAKAR I A E+YFV LPE+SK  L YGSLLNCYC
Subjt:  VMAICSTCLSCAKIDKYSMVEWDRISGQNCSIMSEVVPLFRILSETMAKRGVNRTVSDQAIHLDLVAKARGIAAAESYFVGLPESSKNHLCYGSLLNCYC

Query:  KELLTEKAEALFEKMKELNLLVTSMSYNSLMTLYMKTGQPEKVRAIIQEMKAANVMFDSYTYNVWMRALAALNDISGVERVIDEMKRDGGVVGDWTTYSN
        KELLTEKAE L  KMKELN+  +SMSYNSLMTLY KTG+ EKV A+IQE+KA NVM DSYTYNVWMRALAA NDISGVERVI+EM RDG V  DWTTYSN
Subjt:  KELLTEKAEALFEKMKELNLLVTSMSYNSLMTLYMKTGQPEKVRAIIQEMKAANVMFDSYTYNVWMRALAALNDISGVERVIDEMKRDGGVVGDWTTYSN

Query:  LASIYVNANLFEKAAKALKDLEKRNACRDLSAFQFLITLYGQIGNLPEVYRVWRSLRLAFPKTANISYLNMIQTLTKLKDLPGAEKCFKEWESGCSTYDI
        +ASIYV+A L +KA KAL++LE +N  RD +A+QFLITLYG++G L EVYR+WRSLRLA PKT+N++YLNMIQ L KL DLPGAE  FKEW++ CSTYDI
Subjt:  LASIYVNANLFEKAAKALKDLEKRNACRDLSAFQFLITLYGQIGNLPEVYRVWRSLRLAFPKTANISYLNMIQTLTKLKDLPGAEKCFKEWESGCSTYDI

Query:  RIANALMGAYTKEGLLEKAMELKERARKRGAKPNAKTWEIFLDYYLKNGDFKLAADCVEKAVSKGKGDGGKWMPSSEIIKSLMSHFELEKDVDGAEGFLE
        RI N L+GAY +EGL++KA ELKE+A +RG K NAKTWEIF+DYY+K+GD   A +C+ KAVS GKGDGGKW+PS E +++LMS+FE +KDV+GAE  LE
Subjt:  RIANALMGAYTKEGLLEKAMELKERARKRGAKPNAKTWEIFLDYYLKNGDFKLAADCVEKAVSKGKGDGGKWMPSSEIIKSLMSHFELEKDVDGAEGFLE

Query:  IVKKTVDTLDSEVFESLIRTYSAAGRKSSTMNRRLKMENVEVSEACKKLLNEIS
        I+K   D + +E+FE LIRTY+AAG+    M RRLKMENVEV+EA KKLL+E+S
Subjt:  IVKKTVDTLDSEVFESLIRTYSAAGRKSSTMNRRLKMENVEVSEACKKLLNEIS

AT4G01990.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.4e-8341.81Show/hide
Query:  ETMAKRGVNRTVSDQAIHLDLVAKARGIAAAESYFVGLPESSKNHLCYGSLLNCYCKELLTEKAEALFEKMKELNLLVTSMSYNSLMTLYMKTGQPEKVR
        E M ++ +  T SD AI L+L+AK++G+ AAE+YF  L +S KN   YGSLLNCYC E    KA+A FE M +LN +  S+ +N+LM +YM  GQPEKV 
Subjt:  ETMAKRGVNRTVSDQAIHLDLVAKARGIAAAESYFVGLPESSKNHLCYGSLLNCYCKELLTEKAEALFEKMKELNLLVTSMSYNSLMTLYMKTGQPEKVR

Query:  AIIQEMKAANVMFDSYTYNVWMRALAALNDISGVERVIDEMKRDGGVVGDWTTYSNLASIYVNANLFEKAAKALKDLEKRNACRDLSAFQFLITLYGQIG
        A++  MK  ++     TY++W+++  +L D+ GVE+V+DEMK +G  +  W T++NLA+IY+   L+ KA +ALK LE          + FLI LY  I 
Subjt:  AIIQEMKAANVMFDSYTYNVWMRALAALNDISGVERVIDEMKRDGGVVGDWTTYSNLASIYVNANLFEKAAKALKDLEKRNACRDLSAFQFLITLYGQIG

Query:  NLPEVYRVWRSLRLAFPKTANISYLNMIQTLTKLKDLPGAEKCFKEWESGCSTYDIRIANALMGAYTKEGLLEKAMELKERARKRGAKPNAKTWEIFLDY
        N  EVYRVW  L+  +P   N SYL M++ L+KL D+ G +K F EWES C TYD+R+AN  + +Y K+ + E+A  +   A K+     +K  ++ + +
Subjt:  NLPEVYRVWRSLRLAFPKTANISYLNMIQTLTKLKDLPGAEKCFKEWESGCSTYDIRIANALMGAYTKEGLLEKAMELKERARKRGAKPNAKTWEIFLDY

Query:  YLKNGDFKLAADCVEKAVSKGKGDGGKWMPSSEIIKSLMSHFELEKDVDGAEGFLEIVKKTVDTLDSEVFESLIRTYSAAGRKSSTMNRRLKMENVEVSE
         LKN    LA    E AV         W  SSE+I S   HFE  KDVDGAE F + + K    L SE +  L++TY AAG+    M +RL+ + + V E
Subjt:  YLKNGDFKLAADCVEKAVSKGKGDGGKWMPSSEIIKSLMSHFELEKDVDGAEGFLEIVKKTVDTLDSEVFESLIRTYSAAGRKSSTMNRRLKMENVEVSE

Query:  ACKKLLNEI
          + LL++I
Subjt:  ACKKLLNEI

AT4G02820.1 Pentatricopeptide repeat (PPR) superfamily protein8.5e-6533.42Show/hide
Query:  MAKRGVNRTVSDQAIHLDLVAKARGIAAAESYFVGLPESSKNHLCYGSLLNCYCKELLTEKAEALFEKMKELNLLVTSMSYNSLMTLYMKTGQPEKVRAI
        + +  +     D A+HLDL++K RG+ +AE +F  +P+  + H    SLL+ Y +  L++KAEALFEKM E   L + + YN ++++Y+  GQ EKV  +
Subjt:  MAKRGVNRTVSDQAIHLDLVAKARGIAAAESYFVGLPESSKNHLCYGSLLNCYCKELLTEKAEALFEKMKELNLLVTSMSYNSLMTLYMKTGQPEKVRAI

Query:  IQEMKAANVMFDSYTYNVWMRALAALNDISGVERVIDEMKRDGGVVGDWTTYSNLASIYVNANLFEKAAKALKDLEKRNACRDLSAFQFLITLYGQIGNL
        I+E+K      D  TYN+W+ A A+ ND+ G E+V  + K +  +  DW TYS L ++Y   +  EKA  ALK++EK  + ++  A+  LI+L+  +G+ 
Subjt:  IQEMKAANVMFDSYTYNVWMRALAALNDISGVERVIDEMKRDGGVVGDWTTYSNLASIYVNANLFEKAAKALKDLEKRNACRDLSAFQFLITLYGQIGNL

Query:  PEVYRVWRSLRLAFPKTANISYLNMIQTLTKLKDLPGAEKCFKEWESGCSTYDIRIANALMGAYTKEGLLEKAMELKERARKRGAKPNAKTWEIFLDYYL
          V   W+ ++ +F K  +  YL+MI  + KL +   A+  + EWES   T D RI N ++  Y     +    +  ER  ++G  P+  TWEI    YL
Subjt:  PEVYRVWRSLRLAFPKTANISYLNMIQTLTKLKDLPGAEKCFKEWESGCSTYDIRIANALMGAYTKEGLLEKAMELKERARKRGAKPNAKTWEIFLDYYL

Query:  KNGDFKLAADCVEKAVSKGKGDGGKWMPSSEIIKSLMSHFELEKDVDGAEGFLEIVKKTVDTLDSEVFESLIRTYSAAGRKSSTMNRRLKMENVEVSEAC
        K  D +   DC  KA+   K    KW  +  ++K      E + +V GAE  + +++K    ++++++ SL+RTY+ AG  +  +  R+  +NVE+ E  
Subjt:  KNGDFKLAADCVEKAVSKGKGDGGKWMPSSEIIKSLMSHFELEKDVDGAEGFLEIVKKTVDTLDSEVFESLIRTYSAAGRKSSTMNRRLKMENVEVSEAC

Query:  KKLL
        K+L+
Subjt:  KKLL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGTTACGGCAGTTAAGTCGCACCAAGAATGTGGCGAAGAGGTCGACGAAGAAGTATCTGGAGGAAGCACTGTACGTGAGGCTCTTTAAAGACGGTGGCTCAGAGAA
GAGCCTTCGGCAACAGTTGAATGGTTTCATCAAGAGTCACAAGCGAGTTTTCAAATGGGAGGTTGGAGATACACTCAAAAAGCTTCGCGATAGGAAGCTGTACAATCCTG
CTCTCAAGGTTCGTCTAACCAAACCCTTGAAATTCTGTGGTATTGATGATGAATGGCGAAATATTTTAATCAGAGTTGTTGTTATGGCTATCTGTTCTACTTGTCTTAGC
TGTGCAAAAATAGATAAGTATTCTATGGTTGAATGGGACAGAATCAGTGGACAGAATTGTAGTATAATGTCGGAAGTGGTTCCATTATTTAGGATTCTTTCAGAAACTAT
GGCCAAACGGGGCGTGAACAGGACAGTAAGTGATCAAGCAATACATCTTGATTTAGTAGCCAAGGCTCGAGGAATTGCTGCTGCCGAGAGTTACTTTGTTGGTCTTCCTG
AATCATCAAAGAATCACCTTTGCTATGGCTCTCTTCTCAACTGTTACTGCAAGGAATTATTGACTGAAAAGGCTGAAGCTCTCTTTGAAAAGATGAAGGAACTCAACCTT
CTGGTGACCTCTATGTCATATAATAGCCTTATGACACTATACATGAAGACTGGGCAGCCAGAAAAAGTTCGTGCAATCATACAGGAAATGAAGGCTGCTAACGTAATGTT
TGACTCCTATACATACAATGTGTGGATGAGGGCACTTGCTGCTTTAAATGATATCTCTGGTGTGGAAAGGGTTATTGATGAGATGAAGAGGGATGGTGGAGTTGTGGGAG
ATTGGACAACATATAGCAATTTAGCCTCAATTTATGTCAATGCAAACTTGTTCGAGAAGGCAGCCAAGGCACTGAAGGACTTGGAGAAGAGAAATGCTTGCCGAGATCTC
TCTGCTTTCCAGTTCCTGATTACGTTGTATGGACAAATTGGTAACCTGCCTGAAGTTTATAGAGTTTGGCGCTCATTAAGGTTGGCCTTTCCGAAAACTGCAAATATAAG
CTATCTCAATATGATCCAGACTCTGACGAAATTAAAAGATTTACCTGGCGCAGAGAAATGTTTCAAGGAATGGGAATCAGGGTGCTCGACTTATGATATTAGGATTGCAA
ATGCTCTTATGGGAGCTTATACCAAGGAGGGTTTGCTAGAGAAAGCTATGGAGCTGAAGGAACGAGCCCGAAAAAGAGGAGCTAAACCTAATGCAAAAACTTGGGAAATT
TTCCTGGATTATTATCTCAAAAATGGAGACTTCAAACTGGCAGCTGATTGTGTTGAGAAAGCAGTATCTAAAGGTAAAGGAGATGGTGGGAAATGGATGCCTTCTTCTGA
GATAATTAAATCATTAATGAGCCATTTTGAGCTAGAAAAAGATGTTGATGGAGCAGAAGGTTTTCTTGAAATTGTGAAGAAGACCGTTGACACTTTAGACTCCGAGGTTT
TTGAATCATTGATCAGAACATATTCTGCAGCGGGAAGGAAAAGTTCTACGATGAATCGCAGGTTGAAAATGGAGAATGTTGAGGTCAGTGAGGCTTGCAAGAAGCTACTT
AACGAAATATCTATCGAATGA
mRNA sequenceShow/hide mRNA sequence
ATGGCGTTACGGCAGTTAAGTCGCACCAAGAATGTGGCGAAGAGGTCGACGAAGAAGTATCTGGAGGAAGCACTGTACGTGAGGCTCTTTAAAGACGGTGGCTCAGAGAA
GAGCCTTCGGCAACAGTTGAATGGTTTCATCAAGAGTCACAAGCGAGTTTTCAAATGGGAGGTTGGAGATACACTCAAAAAGCTTCGCGATAGGAAGCTGTACAATCCTG
CTCTCAAGGTTCGTCTAACCAAACCCTTGAAATTCTGTGGTATTGATGATGAATGGCGAAATATTTTAATCAGAGTTGTTGTTATGGCTATCTGTTCTACTTGTCTTAGC
TGTGCAAAAATAGATAAGTATTCTATGGTTGAATGGGACAGAATCAGTGGACAGAATTGTAGTATAATGTCGGAAGTGGTTCCATTATTTAGGATTCTTTCAGAAACTAT
GGCCAAACGGGGCGTGAACAGGACAGTAAGTGATCAAGCAATACATCTTGATTTAGTAGCCAAGGCTCGAGGAATTGCTGCTGCCGAGAGTTACTTTGTTGGTCTTCCTG
AATCATCAAAGAATCACCTTTGCTATGGCTCTCTTCTCAACTGTTACTGCAAGGAATTATTGACTGAAAAGGCTGAAGCTCTCTTTGAAAAGATGAAGGAACTCAACCTT
CTGGTGACCTCTATGTCATATAATAGCCTTATGACACTATACATGAAGACTGGGCAGCCAGAAAAAGTTCGTGCAATCATACAGGAAATGAAGGCTGCTAACGTAATGTT
TGACTCCTATACATACAATGTGTGGATGAGGGCACTTGCTGCTTTAAATGATATCTCTGGTGTGGAAAGGGTTATTGATGAGATGAAGAGGGATGGTGGAGTTGTGGGAG
ATTGGACAACATATAGCAATTTAGCCTCAATTTATGTCAATGCAAACTTGTTCGAGAAGGCAGCCAAGGCACTGAAGGACTTGGAGAAGAGAAATGCTTGCCGAGATCTC
TCTGCTTTCCAGTTCCTGATTACGTTGTATGGACAAATTGGTAACCTGCCTGAAGTTTATAGAGTTTGGCGCTCATTAAGGTTGGCCTTTCCGAAAACTGCAAATATAAG
CTATCTCAATATGATCCAGACTCTGACGAAATTAAAAGATTTACCTGGCGCAGAGAAATGTTTCAAGGAATGGGAATCAGGGTGCTCGACTTATGATATTAGGATTGCAA
ATGCTCTTATGGGAGCTTATACCAAGGAGGGTTTGCTAGAGAAAGCTATGGAGCTGAAGGAACGAGCCCGAAAAAGAGGAGCTAAACCTAATGCAAAAACTTGGGAAATT
TTCCTGGATTATTATCTCAAAAATGGAGACTTCAAACTGGCAGCTGATTGTGTTGAGAAAGCAGTATCTAAAGGTAAAGGAGATGGTGGGAAATGGATGCCTTCTTCTGA
GATAATTAAATCATTAATGAGCCATTTTGAGCTAGAAAAAGATGTTGATGGAGCAGAAGGTTTTCTTGAAATTGTGAAGAAGACCGTTGACACTTTAGACTCCGAGGTTT
TTGAATCATTGATCAGAACATATTCTGCAGCGGGAAGGAAAAGTTCTACGATGAATCGCAGGTTGAAAATGGAGAATGTTGAGGTCAGTGAGGCTTGCAAGAAGCTACTT
AACGAAATATCTATCGAATGAGCTTTTGTTGAATCACAACTTTTTATCATTTGGAATTCCAAGCTCAGAGGGAAATGATTTTTTCAGTGGTTCTTTAATTTTAAATAAAG
TTACTTTTCCTTCTGTCCTTGTGGAATACTGACAACAGAGCCACCTCTTGTTGGTTCCTTGCTGAAAGTAAAGTTCATGTGCAATTGTATTGAAATCCTTTCTCATTTAC
ATTATTGTGGTTGGTAGTTGCCAAGCTCCTGGATACAATATACGTTTGTTTCAGTAATCAGAGTAGTAAATGACTGC
Protein sequenceShow/hide protein sequence
MALRQLSRTKNVAKRSTKKYLEEALYVRLFKDGGSEKSLRQQLNGFIKSHKRVFKWEVGDTLKKLRDRKLYNPALKVRLTKPLKFCGIDDEWRNILIRVVVMAICSTCLS
CAKIDKYSMVEWDRISGQNCSIMSEVVPLFRILSETMAKRGVNRTVSDQAIHLDLVAKARGIAAAESYFVGLPESSKNHLCYGSLLNCYCKELLTEKAEALFEKMKELNL
LVTSMSYNSLMTLYMKTGQPEKVRAIIQEMKAANVMFDSYTYNVWMRALAALNDISGVERVIDEMKRDGGVVGDWTTYSNLASIYVNANLFEKAAKALKDLEKRNACRDL
SAFQFLITLYGQIGNLPEVYRVWRSLRLAFPKTANISYLNMIQTLTKLKDLPGAEKCFKEWESGCSTYDIRIANALMGAYTKEGLLEKAMELKERARKRGAKPNAKTWEI
FLDYYLKNGDFKLAADCVEKAVSKGKGDGGKWMPSSEIIKSLMSHFELEKDVDGAEGFLEIVKKTVDTLDSEVFESLIRTYSAAGRKSSTMNRRLKMENVEVSEACKKLL
NEISIE