; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg038370 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg038370
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationscaffold1:56913028..56915310
RNA-Seq ExpressionSpg038370
SyntenySpg038370
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily
IPR033443 - Pentacotripeptide-repeat region of PRORP


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6589742.1 Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0090.39Show/hide
Query:  MAHISLSKPHYSHFRVLSTSSISNPTVFNSLHFFSSTQEPISTATQNEIPNDPSASSVAAVPQPVETAAVNGVEQVKRRTPRGKHRNPEKLEDIICRMMA
        MAHISLSKPHYSH +VLS+SSIS P  FNSLHFFSSTQ+P +TATQNE P DP  SS AAVPQPVE  AVNG +QVKR  PRG  RNPEKLEDIICRMMA
Subjt:  MAHISLSKPHYSHFRVLSTSSISNPTVFNSLHFFSSTQEPISTATQNEIPNDPSASSVAAVPQPVETAAVNGVEQVKRRTPRGKHRNPEKLEDIICRMMA

Query:  NREWTTRLQNSIRSLVPQFDHSLVWNVLHAAKNSDHALKFFRWVERAGLFRHDRDTHLKIIEILGRASKLNHARCILLDMPNKGVEWDEELFVVLIDSYG
        NREWTTRLQNSIRSLVPQFDHS+VWNVLHAAKNSDHALKFFRWVERAGLF+HDR TH KIIEILGRASKLNHARCILLDMPNKGVEWDE+LFV++IDSYG
Subjt:  NREWTTRLQNSIRSLVPQFDHSLVWNVLHAAKNSDHALKFFRWVERAGLFRHDRDTHLKIIEILGRASKLNHARCILLDMPNKGVEWDEELFVVLIDSYG

Query:  KTGIVQEAVKIFQKMKELGVERSVKSYDALFKVIMRRGRYMMAKRYFNAMLNEGIEPTRHTYNVMLWGFFLSLRLETTKRFYEDMKSRGISPDVVTYNTM
        K GIVQEAVKIFQKMKELGVERS+KSY+ LFKVI+RRGRYMMAKRYFNAMLNEGIEPT HTYNVMLWGFFLSLRLET KRFYEDMK+RGI+PDVVTYNTM
Subjt:  KTGIVQEAVKIFQKMKELGVERSVKSYDALFKVIMRRGRYMMAKRYFNAMLNEGIEPTRHTYNVMLWGFFLSLRLETTKRFYEDMKSRGISPDVVTYNTM

Query:  INGYYRFKMMEEAEQFFTEMKGKNIVPTVISYTTMIKGYVSVGRVDDGLRLFEEMKAVGVKPNDITYSTLLPGLCDAEKMSEARQILTEMVDKYIAPKDN
        INGYYRFKMMEEAEQFFTEMKGKN+VPTVISYTTMIKGYVS GRVDDGLRLFEEMKAVGVKPND TYSTLLPGLCDAE+MSEARQILTEMVDKYIAPKDN
Subjt:  INGYYRFKMMEEAEQFFTEMKGKNIVPTVISYTTMIKGYVSVGRVDDGLRLFEEMKAVGVKPNDITYSTLLPGLCDAEKMSEARQILTEMVDKYIAPKDN

Query:  SIFMRLLSCQCKHGDLDAAMHVLKAMIRLSIPTEAGHYGILIENCCKAGMYDQAVKLLDKLVEKEIILRPQSTLEMEASAYNLIIQYLCNHGQTGKAETF
        SIFMRLLSCQCKHGDLDAAMHVLKAMIRLS+PTE+GHYGILIENCCKAG+YD+AVKLLDKLVEKEIIL+PQSTLEMEASAYN +IQYLC+HGQTGKAETF
Subjt:  SIFMRLLSCQCKHGDLDAAMHVLKAMIRLSIPTEAGHYGILIENCCKAGMYDQAVKLLDKLVEKEIILRPQSTLEMEASAYNLIIQYLCNHGQTGKAETF

Query:  FRQLLKKGIQDEVAFNNLIRGHSKEGNPELAYEMLKIMGRRGVSRDAESYKLLIKSYLSKGEPADAKTTLDNMIESGHYPDSALFRSVMESLFADGRVQT
        FRQLLKKGIQDEVAFNNLIRGHSKEGNPELA+E+LKIMGR+ VSRDAESYKLLIKSYLSKGEPADAKT LD+MIESGHYPDS LFRSVMESLFADGRVQT
Subjt:  FRQLLKKGIQDEVAFNNLIRGHSKEGNPELAYEMLKIMGRRGVSRDAESYKLLIKSYLSKGEPADAKTTLDNMIESGHYPDSALFRSVMESLFADGRVQT

Query:  ASRVMNSMLDKGITENLDLVAKILEALFMRGHVEEALGRIDLLMSCNCPPDFDSLLSVLCEKGKTIAALKLLDFGLERECNIDFSSYEKVLDALLGAGKT
        ASRVMNSMLDKGITENLDLVAKILEALFMRGHVEEALGR+DLLM  +CPPDFDSLLSVLCEKGKTIAALKLLDFGLERECNI+ SSYEKVLDALL AGKT
Subjt:  ASRVMNSMLDKGITENLDLVAKILEALFMRGHVEEALGRIDLLMSCNCPPDFDSLLSVLCEKGKTIAALKLLDFGLERECNIDFSSYEKVLDALLGAGKT

Query:  LNAYSILCKIMEKGGAKDWSSCDDLIRSLNQEGNTKQADILSRMRKGGDRKGSKKTSLAA
        LNAYSILCKIMEKGGAK+WSSCDDLI SLNQEG+TKQADILSRM KGGDR   KK S AA
Subjt:  LNAYSILCKIMEKGGAKDWSSCDDLIRSLNQEGNTKQADILSRMRKGGDRKGSKKTSLAA

XP_022135178.1 pentatricopeptide repeat-containing protein At2g37230 [Momordica charantia]0.0e+0092.32Show/hide
Query:  MAHISLSKPHYSHFRVLSTSSISNPTVFNSLHFFSSTQEPISTATQNEIPNDPSASSVAAVPQPVETAAVNGVEQVKRRTPRGKHRNPEKLEDIICRMMA
        MAHISLS+P   +FRVLS+SSISNP+  N LHFFSSTQE I     NE P+DPSASS AA PQP   AAVNG EQVK+RTPRGKHRNPEKLED+ICRMMA
Subjt:  MAHISLSKPHYSHFRVLSTSSISNPTVFNSLHFFSSTQEPISTATQNEIPNDPSASSVAAVPQPVETAAVNGVEQVKRRTPRGKHRNPEKLEDIICRMMA

Query:  NREWTTRLQNSIRSLVPQFDHSLVWNVLHAAKNSDHALKFFRWVERAGLFRHDRDTHLKIIEILGRASKLNHARCILLDMPNKGVEWDEELFVVLIDSYG
        NREWTTRLQNSIRSLVP+FDHSLVWNVLHAAKNSDHALKFFRWVER+GLFRHDRDTHLKIIEILGRASKLNHARCILLDMPNKGVEWDE+LFV++IDSYG
Subjt:  NREWTTRLQNSIRSLVPQFDHSLVWNVLHAAKNSDHALKFFRWVERAGLFRHDRDTHLKIIEILGRASKLNHARCILLDMPNKGVEWDEELFVVLIDSYG

Query:  KTGIVQEAVKIFQKMKELGVERSVKSYDALFKVIMRRGRYMMAKRYFNAMLNEGIEPTRHTYNVMLWGFFLSLRLETTKRFYEDMKSRGISPDVVTYNTM
        K GIVQEAVKIFQKMKELGVERS+KSYDALFKVI+RRGRYMMAKRYFNAMLNEGIEPTRHTYNVMLWGFFLSLRLET K+FYEDMKSRGISPDVVTYNTM
Subjt:  KTGIVQEAVKIFQKMKELGVERSVKSYDALFKVIMRRGRYMMAKRYFNAMLNEGIEPTRHTYNVMLWGFFLSLRLETTKRFYEDMKSRGISPDVVTYNTM

Query:  INGYYRFKMMEEAEQFFTEMKGKNIVPTVISYTTMIKGYVSVGRVDDGLRLFEEMKAVGVKPNDITYSTLLPGLCDAEKMSEARQILTEMVDKYIAPKDN
        ING+YRFKMMEEAEQFFTEMKGKNIVPTVISYTTMIKGYVSVGRVDDGLRLFEEMKAVGVKPNDITYSTLLPGLC+AEKMSEARQIL EMVDKYIAPKDN
Subjt:  INGYYRFKMMEEAEQFFTEMKGKNIVPTVISYTTMIKGYVSVGRVDDGLRLFEEMKAVGVKPNDITYSTLLPGLCDAEKMSEARQILTEMVDKYIAPKDN

Query:  SIFMRLLSCQCKHGDLDAAMHVLKAMIRLSIPTEAGHYGILIENCCKAGMYDQAVKLLDKLVEKEIILRPQSTLEMEASAYNLIIQYLCNHGQTGKAETF
        SIFMRLLSCQCKHGDLDAAMHVLKAMIRLSIPT+AGHYGILIENCCKA  YDQAVKLLDKLVEKEIILRPQSTL+MEASAYN IIQYLCNHGQTGKAETF
Subjt:  SIFMRLLSCQCKHGDLDAAMHVLKAMIRLSIPTEAGHYGILIENCCKAGMYDQAVKLLDKLVEKEIILRPQSTLEMEASAYNLIIQYLCNHGQTGKAETF

Query:  FRQLLKKGIQDEVAFNNLIRGHSKEGNPELAYEMLKIMGRRGVSRDAESYKLLIKSYLSKGEPADAKTTLDNMIESGHYPDSALFRSVMESLFADGRVQT
        FRQL+KKGIQDEVAFNNLIRGHSKEGNPEL YE+LKIMGRRGVSRDAESYKLLIKSYLSKGEPADAKT LD+MIESGHYPDS LFRSVMESLFADGRVQT
Subjt:  FRQLLKKGIQDEVAFNNLIRGHSKEGNPELAYEMLKIMGRRGVSRDAESYKLLIKSYLSKGEPADAKTTLDNMIESGHYPDSALFRSVMESLFADGRVQT

Query:  ASRVMNSMLDKGITENLDLVAKILEALFMRGHVEEALGRIDLLMSCNCPPDFDSLLSVLCEKGKTIAALKLLDFGLERECNIDFSSYEKVLDALLGAGKT
        ASRVM SML+KGITENLDLVAKILEALFMRGHVEEALGRIDLLMSCNCPPDF SLLSVLCEKGKTIAALKLLDFGLEREC+I+ SSYEKVLDALLGAGKT
Subjt:  ASRVMNSMLDKGITENLDLVAKILEALFMRGHVEEALGRIDLLMSCNCPPDFDSLLSVLCEKGKTIAALKLLDFGLERECNIDFSSYEKVLDALLGAGKT

Query:  LNAYSILCKIMEKGGAKDWSSCDDLIRSLNQEGNTKQADILSRMRKGGDRKGSKK
        LNAYSILCKIMEKGGAKDWSSCDDLIRSLNQEGNTKQAD+LSRM KGGDRKGSKK
Subjt:  LNAYSILCKIMEKGGAKDWSSCDDLIRSLNQEGNTKQADILSRMRKGGDRKGSKK

XP_022921867.1 pentatricopeptide repeat-containing protein At2g37230-like [Cucurbita moschata]0.0e+0090.26Show/hide
Query:  MAHISLSKPHYSHFRVLSTSSISNPTVFNSLHFFSSTQEPISTATQNEIPNDPSASSVAAVPQPVETAAVNGVEQVKRRTPRGKHRNPEKLEDIICRMMA
        MAHISLSKPHYSH +VLS+SSIS P  FNSLHFFSSTQ+P +TATQNE P DP  SS AAVPQPVE  AVNG +QVKR  PRG  RNPEKLEDIICRMMA
Subjt:  MAHISLSKPHYSHFRVLSTSSISNPTVFNSLHFFSSTQEPISTATQNEIPNDPSASSVAAVPQPVETAAVNGVEQVKRRTPRGKHRNPEKLEDIICRMMA

Query:  NREWTTRLQNSIRSLVPQFDHSLVWNVLHAAKNSDHALKFFRWVERAGLFRHDRDTHLKIIEILGRASKLNHARCILLDMPNKGVEWDEELFVVLIDSYG
        NREWTTRLQNSIRSLVPQFDHS+VWNVLHAAKNSDHALKFFRWVERAGLF+HDR TH KIIEILGRASKLNHARCILLDMPNKGVEWDE+LFV++IDSYG
Subjt:  NREWTTRLQNSIRSLVPQFDHSLVWNVLHAAKNSDHALKFFRWVERAGLFRHDRDTHLKIIEILGRASKLNHARCILLDMPNKGVEWDEELFVVLIDSYG

Query:  KTGIVQEAVKIFQKMKELGVERSVKSYDALFKVIMRRGRYMMAKRYFNAMLNEGIEPTRHTYNVMLWGFFLSLRLETTKRFYEDMKSRGISPDVVTYNTM
        K GIVQEAVKIFQKMKELGVERS+KSY+ LFKVI+RRGRYMMAKRYFNAMLNEGIEPT HTYNVMLWGFFLSLRLET KRFYEDMK+RGI+PDVVTYNTM
Subjt:  KTGIVQEAVKIFQKMKELGVERSVKSYDALFKVIMRRGRYMMAKRYFNAMLNEGIEPTRHTYNVMLWGFFLSLRLETTKRFYEDMKSRGISPDVVTYNTM

Query:  INGYYRFKMMEEAEQFFTEMKGKNIVPTVISYTTMIKGYVSVGRVDDGLRLFEEMKAVGVKPNDITYSTLLPGLCDAEKMSEARQILTEMVDKYIAPKDN
        INGYYRFKMMEEAEQFFTEMKGKN+VPTVISYTTMIKGYVS GRVDDGLRLFEEMKAVGVKPND TYSTLLPGLCDAE+MSEARQILTEMVDKYIAPKDN
Subjt:  INGYYRFKMMEEAEQFFTEMKGKNIVPTVISYTTMIKGYVSVGRVDDGLRLFEEMKAVGVKPNDITYSTLLPGLCDAEKMSEARQILTEMVDKYIAPKDN

Query:  SIFMRLLSCQCKHGDLDAAMHVLKAMIRLSIPTEAGHYGILIENCCKAGMYDQAVKLLDKLVEKEIILRPQSTLEMEASAYNLIIQYLCNHGQTGKAETF
        SIFMRLLSCQCKHGDLDAAMHVLKAM RLS+PTEAGHYGILIENCCKAG+YD+AVKLLDKLVEKEIIL+PQSTLEMEASAYN +IQYLC+HGQTGKAETF
Subjt:  SIFMRLLSCQCKHGDLDAAMHVLKAMIRLSIPTEAGHYGILIENCCKAGMYDQAVKLLDKLVEKEIILRPQSTLEMEASAYNLIIQYLCNHGQTGKAETF

Query:  FRQLLKKGIQDEVAFNNLIRGHSKEGNPELAYEMLKIMGRRGVSRDAESYKLLIKSYLSKGEPADAKTTLDNMIESGHYPDSALFRSVMESLFADGRVQT
        FRQLLKKGIQDEVAFNNLIRGHSKEGNPELA+E+LKIMGR+ VSRDAESYKLLIKSYLSKGEPADAKT LD+MIESGHYPDSALFRSVMESLFADGRVQT
Subjt:  FRQLLKKGIQDEVAFNNLIRGHSKEGNPELAYEMLKIMGRRGVSRDAESYKLLIKSYLSKGEPADAKTTLDNMIESGHYPDSALFRSVMESLFADGRVQT

Query:  ASRVMNSMLDKGITENLDLVAKILEALFMRGHVEEALGRIDLLMSCNCPPDFDSLLSVLCEKGKTIAALKLLDFGLERECNIDFSSYEKVLDALLGAGKT
        ASRVMNSML KGITENLDLVAKILEALFMRGHVEEALGR+DLLM  +CPPDFDSLLSVLCEKGKTIAALKLLDFGLERECNI+ SSYEKVLDALL AGKT
Subjt:  ASRVMNSMLDKGITENLDLVAKILEALFMRGHVEEALGRIDLLMSCNCPPDFDSLLSVLCEKGKTIAALKLLDFGLERECNIDFSSYEKVLDALLGAGKT

Query:  LNAYSILCKIMEKGGAKDWSSCDDLIRSLNQEGNTKQADILSRMRKGGDRKGSKKTSLAA
        LNAYSILCKIMEKGGAK+WSSCDDLI SLNQEG+TKQADILSRM KGGDR   KK S  A
Subjt:  LNAYSILCKIMEKGGAKDWSSCDDLIRSLNQEGNTKQADILSRMRKGGDRKGSKKTSLAA

XP_023515807.1 pentatricopeptide repeat-containing protein At2g37230-like [Cucurbita pepo subsp. pepo]0.0e+0090.39Show/hide
Query:  MAHISLSKPHYSHFRVLSTSSISNPTVFNSLHFFSSTQEPISTATQNEIPNDPSASSVAAVPQPVETAAVNGVEQVKRRTPRGKHRNPEKLEDIICRMMA
        MAHISLSKPHYSH +VLS+SSIS P  FNSLHFFSS Q+P +TATQNE P DP  SS AAVPQPVE  AVNG +QVKR  PRG  RNPEKLEDIICRMMA
Subjt:  MAHISLSKPHYSHFRVLSTSSISNPTVFNSLHFFSSTQEPISTATQNEIPNDPSASSVAAVPQPVETAAVNGVEQVKRRTPRGKHRNPEKLEDIICRMMA

Query:  NREWTTRLQNSIRSLVPQFDHSLVWNVLHAAKNSDHALKFFRWVERAGLFRHDRDTHLKIIEILGRASKLNHARCILLDMPNKGVEWDEELFVVLIDSYG
        NREWTTRLQNSIRSLVPQFDHS+VWNVLHAAKNSDHAL+FFRWVER+GLF+HDR TH KIIEILGRASKLNHARCILLDMPNKGVEWDE+LFV++IDSYG
Subjt:  NREWTTRLQNSIRSLVPQFDHSLVWNVLHAAKNSDHALKFFRWVERAGLFRHDRDTHLKIIEILGRASKLNHARCILLDMPNKGVEWDEELFVVLIDSYG

Query:  KTGIVQEAVKIFQKMKELGVERSVKSYDALFKVIMRRGRYMMAKRYFNAMLNEGIEPTRHTYNVMLWGFFLSLRLETTKRFYEDMKSRGISPDVVTYNTM
        K GIVQEAVKIFQKMKELGVERS+KSY+ LFKVI+RRGRYMMAKRYFNAMLNEGIEPT HTYNVMLWGFFLSLRLET KRFYEDMK+RGI+PDVVTYNTM
Subjt:  KTGIVQEAVKIFQKMKELGVERSVKSYDALFKVIMRRGRYMMAKRYFNAMLNEGIEPTRHTYNVMLWGFFLSLRLETTKRFYEDMKSRGISPDVVTYNTM

Query:  INGYYRFKMMEEAEQFFTEMKGKNIVPTVISYTTMIKGYVSVGRVDDGLRLFEEMKAVGVKPNDITYSTLLPGLCDAEKMSEARQILTEMVDKYIAPKDN
        INGYYRFKMMEEAEQFFTEMKG N++PTVISYTTMIKGYVS GRVDDGLRLFEEMKAVGVKPND TYSTLLPGLCDAEKMSEARQILTEMVDKYIAPKDN
Subjt:  INGYYRFKMMEEAEQFFTEMKGKNIVPTVISYTTMIKGYVSVGRVDDGLRLFEEMKAVGVKPNDITYSTLLPGLCDAEKMSEARQILTEMVDKYIAPKDN

Query:  SIFMRLLSCQCKHGDLDAAMHVLKAMIRLSIPTEAGHYGILIENCCKAGMYDQAVKLLDKLVEKEIILRPQSTLEMEASAYNLIIQYLCNHGQTGKAETF
        SIFMRLLSCQCKHGDLDAAMHVLKAMIRLSIPTEAGHYGILIENCCKAG+YD+AVKLLDKLVEKEIIL+PQSTLEMEASAYN IIQYLC+HGQTGKAETF
Subjt:  SIFMRLLSCQCKHGDLDAAMHVLKAMIRLSIPTEAGHYGILIENCCKAGMYDQAVKLLDKLVEKEIILRPQSTLEMEASAYNLIIQYLCNHGQTGKAETF

Query:  FRQLLKKGIQDEVAFNNLIRGHSKEGNPELAYEMLKIMGRRGVSRDAESYKLLIKSYLSKGEPADAKTTLDNMIESGHYPDSALFRSVMESLFADGRVQT
        FRQLLKKGIQDEVAFNNLIRGHSKEGNPELA+E+LKIMGR+ VSRDAESYKLLIKSYLSKGEPADAKT LD+MIESGHYPDSALFRSVMESLFADGRVQT
Subjt:  FRQLLKKGIQDEVAFNNLIRGHSKEGNPELAYEMLKIMGRRGVSRDAESYKLLIKSYLSKGEPADAKTTLDNMIESGHYPDSALFRSVMESLFADGRVQT

Query:  ASRVMNSMLDKGITENLDLVAKILEALFMRGHVEEALGRIDLLMSCNCPPDFDSLLSVLCEKGKTIAALKLLDFGLERECNIDFSSYEKVLDALLGAGKT
        ASRVMNSMLDKGITEN+DLVAKILEALFMRGHVEEALGR+DLLM C+CPPDFDSLLSVLCEKGKTIAALKLLDFGLERECNI+ SSYEKVLDALL AGKT
Subjt:  ASRVMNSMLDKGITENLDLVAKILEALFMRGHVEEALGRIDLLMSCNCPPDFDSLLSVLCEKGKTIAALKLLDFGLERECNIDFSSYEKVLDALLGAGKT

Query:  LNAYSILCKIMEKGGAKDWSSCDDLIRSLNQEGNTKQADILSRMRKGGDRKGSKKTSLAA
        LNAYSILCKIMEKGGAK+WSSCDDLI SLNQEG+TKQADILSRM KGGDR   KK S AA
Subjt:  LNAYSILCKIMEKGGAKDWSSCDDLIRSLNQEGNTKQADILSRMRKGGDRKGSKKTSLAA

XP_038880029.1 pentatricopeptide repeat-containing protein At2g37230 [Benincasa hispida]0.0e+0092.89Show/hide
Query:  MAHISLSKPHYSHFRVLSTSSISNPTVFNSLHFFSSTQEPISTATQNEIPNDPSASSVAAVPQPVETAAVNGVEQVKRRTPRGKHRNPEKLEDIICRMMA
        MAHIS+SK H SH+RVLS+SSI  PT   SLHFFSSTQEPISTATQNE PN PSASS AAVPQP E+ AVNG EQVKRRTPRGK RNPEKLED+IC+MMA
Subjt:  MAHISLSKPHYSHFRVLSTSSISNPTVFNSLHFFSSTQEPISTATQNEIPNDPSASSVAAVPQPVETAAVNGVEQVKRRTPRGKHRNPEKLEDIICRMMA

Query:  NREWTTRLQNSIRSLVPQFDHSLVWNVLHAAKNSDHALKFFRWVERAGLFRHDRDTHLKIIEILGRASKLNHARCILLDMPNKGVEWDEELFVVLIDSYG
        NREWTTRLQNSIRSLVPQFDHSLVWNVLHAAKNSDHAL FFRWVERAGLF+HDR+THLKIIEILGRASKLNHARCILLDM NKG+EWDE+LFV+LI+SYG
Subjt:  NREWTTRLQNSIRSLVPQFDHSLVWNVLHAAKNSDHALKFFRWVERAGLFRHDRDTHLKIIEILGRASKLNHARCILLDMPNKGVEWDEELFVVLIDSYG

Query:  KTGIVQEAVKIFQKMKELGVERSVKSYDALFKVIMRRGRYMMAKRYFNAMLNEGIEPTRHTYNVMLWGFFLSLRLETTKRFYEDMKSRGISPDVVTYNTM
        K GIVQEAVKIFQKMKELGVERSVKSYDALFKVI+RRGRYMMAKRYFNAMLNEGIEPTRHTYNVMLWGFFLSLRLET KRFYEDMKSRGISPDVVTYNTM
Subjt:  KTGIVQEAVKIFQKMKELGVERSVKSYDALFKVIMRRGRYMMAKRYFNAMLNEGIEPTRHTYNVMLWGFFLSLRLETTKRFYEDMKSRGISPDVVTYNTM

Query:  INGYYRFKMMEEAEQFFTEMKGKNIVPTVISYTTMIKGYVSVGRVDDGLRLFEEMKAVGVKPNDITYSTLLPGLCDAEKMSEARQILTEMVDKYIAPKDN
        INGYYRFKMMEEAEQFFTEMKGKNIVPTVISYTTMIKGYVS GRVDDGLRLFEEMKAVGVKPNDITYSTLLPGLCDAEKMSEARQILTEMVDKYIAPKDN
Subjt:  INGYYRFKMMEEAEQFFTEMKGKNIVPTVISYTTMIKGYVSVGRVDDGLRLFEEMKAVGVKPNDITYSTLLPGLCDAEKMSEARQILTEMVDKYIAPKDN

Query:  SIFMRLLSCQCKHGDLDAAMHVLKAMIRLSIPTEAGHYGILIENCCKAGMYDQAVKLLDKLVEKEIILRPQSTLEMEASAYNLIIQYLCNHGQTGKAETF
        SIFMRLLSCQC HGDLDAAMHVLKAMIRLSIPTEAGHYGILIENCCKAGMYD+AVKLLDKLVEKEIILRPQSTLEMEASAYNLIIQYLCNHGQTGKA+TF
Subjt:  SIFMRLLSCQCKHGDLDAAMHVLKAMIRLSIPTEAGHYGILIENCCKAGMYDQAVKLLDKLVEKEIILRPQSTLEMEASAYNLIIQYLCNHGQTGKAETF

Query:  FRQLLKKGIQDEVAFNNLIRGHSKEGNPELAYEMLKIMGRRGVSRDAESYKLLIKSYLSKGEPADAKTTLDNMIESGHYPDSALFRSVMESLFADGRVQT
        FRQLLKKGIQDEVAFNNLIRGHSKEGNPELA+E+LKIMGRR VSRDAESYKLLIKSYLSKGEPADAKT LD+MIESGHYPDSALFRSVMESLF DGRVQT
Subjt:  FRQLLKKGIQDEVAFNNLIRGHSKEGNPELAYEMLKIMGRRGVSRDAESYKLLIKSYLSKGEPADAKTTLDNMIESGHYPDSALFRSVMESLFADGRVQT

Query:  ASRVMNSMLDKGITENLDLVAKILEALFMRGHVEEALGRIDLLMSCNCPPDFDSLLSVLCEKGKTIAALKLLDFGLERECNIDFSSYEKVLDALLGAGKT
        ASRVMNSMLDKGITENLDLVAKILEAL MRGHVEEALGRIDLLMSCNCPPDFDSLLSVLCE+GKTIAALKLLDFGLERECNI+FSSYEKVLDALLGAGKT
Subjt:  ASRVMNSMLDKGITENLDLVAKILEALFMRGHVEEALGRIDLLMSCNCPPDFDSLLSVLCEKGKTIAALKLLDFGLERECNIDFSSYEKVLDALLGAGKT

Query:  LNAYSILCKIMEKGGAKDWSSCDDLIRSLNQEGNTKQADILSRMRKGGDRKGSKKTSLAA
        LNAY+ILCKIMEKGGA DW S DDLI+SLNQEGNTKQADILSR  KGGDRK  KK SLAA
Subjt:  LNAYSILCKIMEKGGAKDWSSCDDLIRSLNQEGNTKQADILSRMRKGGDRKGSKKTSLAA

TrEMBL top hitse value%identityAlignment
A0A0A0LTL3 PPR_long domain-containing protein0.0e+0090Show/hide
Query:  MAHISLSKPHYSHFRVLSTSSISNPTVFNSLHFFSSTQEPISTATQNEIPNDPSASSVAAVPQPVETAAVNGVEQVKRRTPRGKHRNPEKLEDIICRMMA
        MAHIS+SK H++H+RVLS+SSIS PT  NSLHFFSSTQEPISTATQN  PNDPSASS AA+PQ  E+AAVNGV+QVK R PRG+ R+PEKLE IIC+MMA
Subjt:  MAHISLSKPHYSHFRVLSTSSISNPTVFNSLHFFSSTQEPISTATQNEIPNDPSASSVAAVPQPVETAAVNGVEQVKRRTPRGKHRNPEKLEDIICRMMA

Query:  NREWTTRLQNSIRSLVPQFDHSLVWNVLHAAKNSDHALKFFRWVERAGLFRHDRDTHLKIIEILGRASKLNHARCILLDMPNKGVEWDEELFVVLIDSYG
        NREWTTRLQNSIRSLVPQFDH+LV+NVLHAAK S+HAL FFRWVERAGLF+HDR+TH KIIEILGRASKLNHARCILLDMPNKGV+WDE+LFVVLI+SYG
Subjt:  NREWTTRLQNSIRSLVPQFDHSLVWNVLHAAKNSDHALKFFRWVERAGLFRHDRDTHLKIIEILGRASKLNHARCILLDMPNKGVEWDEELFVVLIDSYG

Query:  KTGIVQEAVKIFQKMKELGVERSVKSYDALFKVIMRRGRYMMAKRYFNAMLNEGIEPTRHTYNVMLWGFFLSLRLETTKRFYEDMKSRGISPDVVTYNTM
        K GIVQEAVKIFQKMKELGVERSVKSYDALFK IMRRGRYMMAKRYFNAMLNEGIEP RHTYNVMLWGFFLSLRLET KRFYEDMKSRGISPDVVTYNTM
Subjt:  KTGIVQEAVKIFQKMKELGVERSVKSYDALFKVIMRRGRYMMAKRYFNAMLNEGIEPTRHTYNVMLWGFFLSLRLETTKRFYEDMKSRGISPDVVTYNTM

Query:  INGYYRFKMMEEAEQFFTEMKGKNIVPTVISYTTMIKGYVSVGRVDDGLRLFEEMKAVGVKPNDITYSTLLPGLCDAEKMSEARQILTEMVDKYIAPKDN
        INGY RFKMMEEAEQFFTEMKGKNI PTVISYTTMIKGYVSV R DD LRLFEEMKA G KPNDITYSTLLPGLCDAEK+ EAR+ILTEMV ++ APKDN
Subjt:  INGYYRFKMMEEAEQFFTEMKGKNIVPTVISYTTMIKGYVSVGRVDDGLRLFEEMKAVGVKPNDITYSTLLPGLCDAEKMSEARQILTEMVDKYIAPKDN

Query:  SIFMRLLSCQCKHGDLDAAMHVLKAMIRLSIPTEAGHYGILIENCCKAGMYDQAVKLLDKLVEKEIILRPQSTLEMEASAYNLIIQYLCNHGQTGKAETF
        SIFMRLLSCQCKHGDLDAAMHVLKAMIRLSIPTEAGHYGILIENCCKAGMYDQAVKLL+ LVEKEIILRPQSTLEMEASAYNLIIQYLCNHGQTGKA+TF
Subjt:  SIFMRLLSCQCKHGDLDAAMHVLKAMIRLSIPTEAGHYGILIENCCKAGMYDQAVKLLDKLVEKEIILRPQSTLEMEASAYNLIIQYLCNHGQTGKAETF

Query:  FRQLLKKGIQDEVAFNNLIRGHSKEGNPELAYEMLKIMGRRGVSRDAESYKLLIKSYLSKGEPADAKTTLDNMIESGHYPDSALFRSVMESLFADGRVQT
        FRQLLKKGIQDEVAFNNLIRGH+KEGNP+LA+EMLKIMGRRGVSRDAESYKLLIKSYLSKGEPADAKT LD+MIE+GH PDSALFRSVMESLFADGRVQT
Subjt:  FRQLLKKGIQDEVAFNNLIRGHSKEGNPELAYEMLKIMGRRGVSRDAESYKLLIKSYLSKGEPADAKTTLDNMIESGHYPDSALFRSVMESLFADGRVQT

Query:  ASRVMNSMLDKGITENLDLVAKILEALFMRGHVEEALGRIDLLMSCNCPPDFDSLLSVLCEKGKTIAALKLLDFGLERECNIDFSSYEKVLDALLGAGKT
        ASRVMNSMLDKGITENLDLVAKILEALFMRGH EEALGRI+LLM+CNCPPDF+SLLSVLCEKGKT +A KLLDFGLERECNI+FSSYEKVLDALLGAGKT
Subjt:  ASRVMNSMLDKGITENLDLVAKILEALFMRGHVEEALGRIDLLMSCNCPPDFDSLLSVLCEKGKTIAALKLLDFGLERECNIDFSSYEKVLDALLGAGKT

Query:  LNAYSILCKIMEKGGAKDWSSCDDLIRSLNQEGNTKQADILSRMRKGGDRKGSKKTSLAA
        LNAY+ILCKIMEKGGAKDWSSCDDLI+SLNQEGNTKQADILSRM KGGDRK SKK SLAA
Subjt:  LNAYSILCKIMEKGGAKDWSSCDDLIRSLNQEGNTKQADILSRMRKGGDRKGSKKTSLAA

A0A5A7T0L7 Pentatricopeptide repeat-containing protein0.0e+0090.13Show/hide
Query:  MAHISLSKPHYSHFRVLSTSSISNPTVFNSLHFFSSTQEPISTATQNEIPNDPSASSVAAVPQPVETAAVNGVEQVKRRTPRGKHRNPEKLEDIICRMMA
        MAHIS+SK H++H+RVLS+SSIS PT  NSLHFFSSTQEPIS ATQNE PNDP ASS AA+PQ  E+AAVNGV+QVK R PRG+ RN EKLED+ICRMMA
Subjt:  MAHISLSKPHYSHFRVLSTSSISNPTVFNSLHFFSSTQEPISTATQNEIPNDPSASSVAAVPQPVETAAVNGVEQVKRRTPRGKHRNPEKLEDIICRMMA

Query:  NREWTTRLQNSIRSLVPQFDHSLVWNVLHAAKNSDHALKFFRWVERAGLFRHDRDTHLKIIEILGRASKLNHARCILLDMPNKGVEWDEELFVVLIDSYG
        +REWTTRLQNSIRSLVPQFDH LV+NVLHAAK S+HAL FFRWVERAGLF+HDR+THLKIIEILG ASKLNHARCILLDMPNKGVEWDE+LFVVLIDSYG
Subjt:  NREWTTRLQNSIRSLVPQFDHSLVWNVLHAAKNSDHALKFFRWVERAGLFRHDRDTHLKIIEILGRASKLNHARCILLDMPNKGVEWDEELFVVLIDSYG

Query:  KTGIVQEAVKIFQKMKELGVERSVKSYDALFKVIMRRGRYMMAKRYFNAMLNEGIEPTRHTYNVMLWGFFLSLRLETTKRFYEDMKSRGISPDVVTYNTM
        K GIVQEAVKIF+KMKELGVERS KSYDALFKVI+RRGRYMMAKRYFNAMLNEG+EPTRHTYNVMLWGFFLSLRLET KRFYEDMKSRGISPDVVTYNTM
Subjt:  KTGIVQEAVKIFQKMKELGVERSVKSYDALFKVIMRRGRYMMAKRYFNAMLNEGIEPTRHTYNVMLWGFFLSLRLETTKRFYEDMKSRGISPDVVTYNTM

Query:  INGYYRFKMMEEAEQFFTEMKGKNIVPTVISYTTMIKGYVSVGRVDDGLRLFEEMKAVGVKPNDITYSTLLPGLCDAEKMSEARQILTEMVDKYIAPKDN
        INGY RFKMMEEAEQFFTEMKGKNI PTVISYTTMIKGYVSVGRVDDGLRLFEEMKA G KPNDITYSTLLPGLCDAEK+ EAR+ILTEMV +YIAPKDN
Subjt:  INGYYRFKMMEEAEQFFTEMKGKNIVPTVISYTTMIKGYVSVGRVDDGLRLFEEMKAVGVKPNDITYSTLLPGLCDAEKMSEARQILTEMVDKYIAPKDN

Query:  SIFMRLLSCQCKHGDLDAAMHVLKAMIRLSIPTEAGHYGILIENCCKAGMYDQAVKLLDKLVEKEIILRPQSTLEMEASAYNLIIQYLCNHGQTGKAETF
        SIFMRLLSCQCKHGDLDAAMHVLKAM+RLSIPTEAGHYGILIENCCKAGMYD+AVKLLD+LVEKEIIL+PQSTLEMEASAYNLIIQYLCNHGQTGKAE F
Subjt:  SIFMRLLSCQCKHGDLDAAMHVLKAMIRLSIPTEAGHYGILIENCCKAGMYDQAVKLLDKLVEKEIILRPQSTLEMEASAYNLIIQYLCNHGQTGKAETF

Query:  FRQLLKKGIQDEVAFNNLIRGHSKEGNPELAYEMLKIMGRRGVSRDAESYKLLIKSYLSKGEPADAKTTLDNMIESGHYPDSALFRSVMESLFADGRVQT
        FRQLLKKGIQDEVAFNNLIRGH+KEGNPE A+EMLKIMGRRGVSRDAESYKLLIKSYLSKGEPADAKT LD+MIE+GH PDSALFRSVMESLFADGRVQT
Subjt:  FRQLLKKGIQDEVAFNNLIRGHSKEGNPELAYEMLKIMGRRGVSRDAESYKLLIKSYLSKGEPADAKTTLDNMIESGHYPDSALFRSVMESLFADGRVQT

Query:  ASRVMNSMLDKGITENLDLVAKILEALFMRGHVEEALGRIDLLMSCNCPPDFDSLLSVLCEKGKTIAALKLLDFGLERECNIDFSSYEKVLDALLGAGKT
        ASRVMNSMLDKGITENLDLVAKILEALFMRGH EE LGRI+LLM+CNCPPDFDSLLSVLCEKGKTIAA KLL+FGLERECNI FSSYEKVLDAL+GAGKT
Subjt:  ASRVMNSMLDKGITENLDLVAKILEALFMRGHVEEALGRIDLLMSCNCPPDFDSLLSVLCEKGKTIAALKLLDFGLERECNIDFSSYEKVLDALLGAGKT

Query:  LNAYSILCKIMEKGGAKDWSSCDDLIRSLNQEGNTKQADILSRMRKGGDRKGSKKTSLAA
        LNAY+ILCKIMEKGGAKDWSSCDDLI++LNQEGNTKQADILSRM KGGDRK SKK+SLAA
Subjt:  LNAYSILCKIMEKGGAKDWSSCDDLIRSLNQEGNTKQADILSRMRKGGDRKGSKKTSLAA

A0A6J1C431 pentatricopeptide repeat-containing protein At2g372300.0e+0092.32Show/hide
Query:  MAHISLSKPHYSHFRVLSTSSISNPTVFNSLHFFSSTQEPISTATQNEIPNDPSASSVAAVPQPVETAAVNGVEQVKRRTPRGKHRNPEKLEDIICRMMA
        MAHISLS+P   +FRVLS+SSISNP+  N LHFFSSTQE I     NE P+DPSASS AA PQP   AAVNG EQVK+RTPRGKHRNPEKLED+ICRMMA
Subjt:  MAHISLSKPHYSHFRVLSTSSISNPTVFNSLHFFSSTQEPISTATQNEIPNDPSASSVAAVPQPVETAAVNGVEQVKRRTPRGKHRNPEKLEDIICRMMA

Query:  NREWTTRLQNSIRSLVPQFDHSLVWNVLHAAKNSDHALKFFRWVERAGLFRHDRDTHLKIIEILGRASKLNHARCILLDMPNKGVEWDEELFVVLIDSYG
        NREWTTRLQNSIRSLVP+FDHSLVWNVLHAAKNSDHALKFFRWVER+GLFRHDRDTHLKIIEILGRASKLNHARCILLDMPNKGVEWDE+LFV++IDSYG
Subjt:  NREWTTRLQNSIRSLVPQFDHSLVWNVLHAAKNSDHALKFFRWVERAGLFRHDRDTHLKIIEILGRASKLNHARCILLDMPNKGVEWDEELFVVLIDSYG

Query:  KTGIVQEAVKIFQKMKELGVERSVKSYDALFKVIMRRGRYMMAKRYFNAMLNEGIEPTRHTYNVMLWGFFLSLRLETTKRFYEDMKSRGISPDVVTYNTM
        K GIVQEAVKIFQKMKELGVERS+KSYDALFKVI+RRGRYMMAKRYFNAMLNEGIEPTRHTYNVMLWGFFLSLRLET K+FYEDMKSRGISPDVVTYNTM
Subjt:  KTGIVQEAVKIFQKMKELGVERSVKSYDALFKVIMRRGRYMMAKRYFNAMLNEGIEPTRHTYNVMLWGFFLSLRLETTKRFYEDMKSRGISPDVVTYNTM

Query:  INGYYRFKMMEEAEQFFTEMKGKNIVPTVISYTTMIKGYVSVGRVDDGLRLFEEMKAVGVKPNDITYSTLLPGLCDAEKMSEARQILTEMVDKYIAPKDN
        ING+YRFKMMEEAEQFFTEMKGKNIVPTVISYTTMIKGYVSVGRVDDGLRLFEEMKAVGVKPNDITYSTLLPGLC+AEKMSEARQIL EMVDKYIAPKDN
Subjt:  INGYYRFKMMEEAEQFFTEMKGKNIVPTVISYTTMIKGYVSVGRVDDGLRLFEEMKAVGVKPNDITYSTLLPGLCDAEKMSEARQILTEMVDKYIAPKDN

Query:  SIFMRLLSCQCKHGDLDAAMHVLKAMIRLSIPTEAGHYGILIENCCKAGMYDQAVKLLDKLVEKEIILRPQSTLEMEASAYNLIIQYLCNHGQTGKAETF
        SIFMRLLSCQCKHGDLDAAMHVLKAMIRLSIPT+AGHYGILIENCCKA  YDQAVKLLDKLVEKEIILRPQSTL+MEASAYN IIQYLCNHGQTGKAETF
Subjt:  SIFMRLLSCQCKHGDLDAAMHVLKAMIRLSIPTEAGHYGILIENCCKAGMYDQAVKLLDKLVEKEIILRPQSTLEMEASAYNLIIQYLCNHGQTGKAETF

Query:  FRQLLKKGIQDEVAFNNLIRGHSKEGNPELAYEMLKIMGRRGVSRDAESYKLLIKSYLSKGEPADAKTTLDNMIESGHYPDSALFRSVMESLFADGRVQT
        FRQL+KKGIQDEVAFNNLIRGHSKEGNPEL YE+LKIMGRRGVSRDAESYKLLIKSYLSKGEPADAKT LD+MIESGHYPDS LFRSVMESLFADGRVQT
Subjt:  FRQLLKKGIQDEVAFNNLIRGHSKEGNPELAYEMLKIMGRRGVSRDAESYKLLIKSYLSKGEPADAKTTLDNMIESGHYPDSALFRSVMESLFADGRVQT

Query:  ASRVMNSMLDKGITENLDLVAKILEALFMRGHVEEALGRIDLLMSCNCPPDFDSLLSVLCEKGKTIAALKLLDFGLERECNIDFSSYEKVLDALLGAGKT
        ASRVM SML+KGITENLDLVAKILEALFMRGHVEEALGRIDLLMSCNCPPDF SLLSVLCEKGKTIAALKLLDFGLEREC+I+ SSYEKVLDALLGAGKT
Subjt:  ASRVMNSMLDKGITENLDLVAKILEALFMRGHVEEALGRIDLLMSCNCPPDFDSLLSVLCEKGKTIAALKLLDFGLERECNIDFSSYEKVLDALLGAGKT

Query:  LNAYSILCKIMEKGGAKDWSSCDDLIRSLNQEGNTKQADILSRMRKGGDRKGSKK
        LNAYSILCKIMEKGGAKDWSSCDDLIRSLNQEGNTKQAD+LSRM KGGDRKGSKK
Subjt:  LNAYSILCKIMEKGGAKDWSSCDDLIRSLNQEGNTKQADILSRMRKGGDRKGSKK

A0A6J1E1L0 pentatricopeptide repeat-containing protein At2g37230-like0.0e+0090.26Show/hide
Query:  MAHISLSKPHYSHFRVLSTSSISNPTVFNSLHFFSSTQEPISTATQNEIPNDPSASSVAAVPQPVETAAVNGVEQVKRRTPRGKHRNPEKLEDIICRMMA
        MAHISLSKPHYSH +VLS+SSIS P  FNSLHFFSSTQ+P +TATQNE P DP  SS AAVPQPVE  AVNG +QVKR  PRG  RNPEKLEDIICRMMA
Subjt:  MAHISLSKPHYSHFRVLSTSSISNPTVFNSLHFFSSTQEPISTATQNEIPNDPSASSVAAVPQPVETAAVNGVEQVKRRTPRGKHRNPEKLEDIICRMMA

Query:  NREWTTRLQNSIRSLVPQFDHSLVWNVLHAAKNSDHALKFFRWVERAGLFRHDRDTHLKIIEILGRASKLNHARCILLDMPNKGVEWDEELFVVLIDSYG
        NREWTTRLQNSIRSLVPQFDHS+VWNVLHAAKNSDHALKFFRWVERAGLF+HDR TH KIIEILGRASKLNHARCILLDMPNKGVEWDE+LFV++IDSYG
Subjt:  NREWTTRLQNSIRSLVPQFDHSLVWNVLHAAKNSDHALKFFRWVERAGLFRHDRDTHLKIIEILGRASKLNHARCILLDMPNKGVEWDEELFVVLIDSYG

Query:  KTGIVQEAVKIFQKMKELGVERSVKSYDALFKVIMRRGRYMMAKRYFNAMLNEGIEPTRHTYNVMLWGFFLSLRLETTKRFYEDMKSRGISPDVVTYNTM
        K GIVQEAVKIFQKMKELGVERS+KSY+ LFKVI+RRGRYMMAKRYFNAMLNEGIEPT HTYNVMLWGFFLSLRLET KRFYEDMK+RGI+PDVVTYNTM
Subjt:  KTGIVQEAVKIFQKMKELGVERSVKSYDALFKVIMRRGRYMMAKRYFNAMLNEGIEPTRHTYNVMLWGFFLSLRLETTKRFYEDMKSRGISPDVVTYNTM

Query:  INGYYRFKMMEEAEQFFTEMKGKNIVPTVISYTTMIKGYVSVGRVDDGLRLFEEMKAVGVKPNDITYSTLLPGLCDAEKMSEARQILTEMVDKYIAPKDN
        INGYYRFKMMEEAEQFFTEMKGKN+VPTVISYTTMIKGYVS GRVDDGLRLFEEMKAVGVKPND TYSTLLPGLCDAE+MSEARQILTEMVDKYIAPKDN
Subjt:  INGYYRFKMMEEAEQFFTEMKGKNIVPTVISYTTMIKGYVSVGRVDDGLRLFEEMKAVGVKPNDITYSTLLPGLCDAEKMSEARQILTEMVDKYIAPKDN

Query:  SIFMRLLSCQCKHGDLDAAMHVLKAMIRLSIPTEAGHYGILIENCCKAGMYDQAVKLLDKLVEKEIILRPQSTLEMEASAYNLIIQYLCNHGQTGKAETF
        SIFMRLLSCQCKHGDLDAAMHVLKAM RLS+PTEAGHYGILIENCCKAG+YD+AVKLLDKLVEKEIIL+PQSTLEMEASAYN +IQYLC+HGQTGKAETF
Subjt:  SIFMRLLSCQCKHGDLDAAMHVLKAMIRLSIPTEAGHYGILIENCCKAGMYDQAVKLLDKLVEKEIILRPQSTLEMEASAYNLIIQYLCNHGQTGKAETF

Query:  FRQLLKKGIQDEVAFNNLIRGHSKEGNPELAYEMLKIMGRRGVSRDAESYKLLIKSYLSKGEPADAKTTLDNMIESGHYPDSALFRSVMESLFADGRVQT
        FRQLLKKGIQDEVAFNNLIRGHSKEGNPELA+E+LKIMGR+ VSRDAESYKLLIKSYLSKGEPADAKT LD+MIESGHYPDSALFRSVMESLFADGRVQT
Subjt:  FRQLLKKGIQDEVAFNNLIRGHSKEGNPELAYEMLKIMGRRGVSRDAESYKLLIKSYLSKGEPADAKTTLDNMIESGHYPDSALFRSVMESLFADGRVQT

Query:  ASRVMNSMLDKGITENLDLVAKILEALFMRGHVEEALGRIDLLMSCNCPPDFDSLLSVLCEKGKTIAALKLLDFGLERECNIDFSSYEKVLDALLGAGKT
        ASRVMNSML KGITENLDLVAKILEALFMRGHVEEALGR+DLLM  +CPPDFDSLLSVLCEKGKTIAALKLLDFGLERECNI+ SSYEKVLDALL AGKT
Subjt:  ASRVMNSMLDKGITENLDLVAKILEALFMRGHVEEALGRIDLLMSCNCPPDFDSLLSVLCEKGKTIAALKLLDFGLERECNIDFSSYEKVLDALLGAGKT

Query:  LNAYSILCKIMEKGGAKDWSSCDDLIRSLNQEGNTKQADILSRMRKGGDRKGSKKTSLAA
        LNAYSILCKIMEKGGAK+WSSCDDLI SLNQEG+TKQADILSRM KGGDR   KK S  A
Subjt:  LNAYSILCKIMEKGGAKDWSSCDDLIRSLNQEGNTKQADILSRMRKGGDRKGSKKTSLAA

A0A6J1JJW0 pentatricopeptide repeat-containing protein At2g37230-like0.0e+0090Show/hide
Query:  MAHISLSKPHYSHFRVLSTSSISNPTVFNSLHFFSSTQEPISTATQNEIPNDPSASSVAAVPQPVETAAVNGVEQVKRRTPRGKHRNPEKLEDIICRMMA
        MAHISLSKPHYSH +V S+SSIS    FNSLHFFSSTQ+PIST TQNE PNDP  SS AAVPQ VE  AVNG +QVKR  PRG  RNPEKLEDIICRMMA
Subjt:  MAHISLSKPHYSHFRVLSTSSISNPTVFNSLHFFSSTQEPISTATQNEIPNDPSASSVAAVPQPVETAAVNGVEQVKRRTPRGKHRNPEKLEDIICRMMA

Query:  NREWTTRLQNSIRSLVPQFDHSLVWNVLHAAKNSDHALKFFRWVERAGLFRHDRDTHLKIIEILGRASKLNHARCILLDMPNKGVEWDEELFVVLIDSYG
        NREWTTRLQNSIRSLVPQFDHSLVWNVLHAAKNSDHALKFFRWVERAGLF+HDR TH KIIEILGRASKLNHARCILLDMP KGVEWDE+LFV++IDSYG
Subjt:  NREWTTRLQNSIRSLVPQFDHSLVWNVLHAAKNSDHALKFFRWVERAGLFRHDRDTHLKIIEILGRASKLNHARCILLDMPNKGVEWDEELFVVLIDSYG

Query:  KTGIVQEAVKIFQKMKELGVERSVKSYDALFKVIMRRGRYMMAKRYFNAMLNEGIEPTRHTYNVMLWGFFLSLRLETTKRFYEDMKSRGISPDVVTYNTM
        K GIVQEAVKIFQKMKELGVERS+KSYDALFKVI+RRGRYMMAKRYFN MLNEGIEPTRHTYNVMLWGFFLSLRLET KRFYEDMK+RGI+PDVVTYNTM
Subjt:  KTGIVQEAVKIFQKMKELGVERSVKSYDALFKVIMRRGRYMMAKRYFNAMLNEGIEPTRHTYNVMLWGFFLSLRLETTKRFYEDMKSRGISPDVVTYNTM

Query:  INGYYRFKMMEEAEQFFTEMKGKNIVPTVISYTTMIKGYVSVGRVDDGLRLFEEMKAVGVKPNDITYSTLLPGLCDAEKMSEARQILTEMVDKYIAPKDN
        INGY+RFKMMEEAEQFFTEMKGKNIVPTVISYTTMIKGYVS GRVDDGLRLFEEMKAVGVKPND+TYSTLLPGLCDAEKM EA QILTEMVD+YIAPKDN
Subjt:  INGYYRFKMMEEAEQFFTEMKGKNIVPTVISYTTMIKGYVSVGRVDDGLRLFEEMKAVGVKPNDITYSTLLPGLCDAEKMSEARQILTEMVDKYIAPKDN

Query:  SIFMRLLSCQCKHGDLDAAMHVLKAMIRLSIPTEAGHYGILIENCCKAGMYDQAVKLLDKLVEKEIILRPQSTLEMEASAYNLIIQYLCNHGQTGKAETF
        SIFMRLLSCQC HGDLDAAMHVLKAM RLS+PTEAGHYGILIENCCKAG+YD+AVKLLDKLV+KEIIL+PQSTLEMEASAYN IIQYLC+HGQTGKAETF
Subjt:  SIFMRLLSCQCKHGDLDAAMHVLKAMIRLSIPTEAGHYGILIENCCKAGMYDQAVKLLDKLVEKEIILRPQSTLEMEASAYNLIIQYLCNHGQTGKAETF

Query:  FRQLLKKGIQDEVAFNNLIRGHSKEGNPELAYEMLKIMGRRGVSRDAESYKLLIKSYLSKGEPADAKTTLDNMIESGHYPDSALFRSVMESLFADGRVQT
        FRQLLKKGIQDE+AFNNLIRGHSKEGNPELA+EMLKIMGR+ VSRDAESYKLLIKSYLSKGEPADAKT LD+MIESGHYPDSALFRSVMESLFADGRVQT
Subjt:  FRQLLKKGIQDEVAFNNLIRGHSKEGNPELAYEMLKIMGRRGVSRDAESYKLLIKSYLSKGEPADAKTTLDNMIESGHYPDSALFRSVMESLFADGRVQT

Query:  ASRVMNSMLDKGITENLDLVAKILEALFMRGHVEEALGRIDLLMSCNCPPDFDSLLSVLCEKGKTIAALKLLDFGLERECNIDFSSYEKVLDALLGAGKT
        ASRVMNSMLDKGITENLDLVAKILEALFMRGHVEEALGR+DLLM C+CPPDFDSLLSVLCEKGKTIAALKLLDFGLERECNI+ SSYEKVLDALL AGKT
Subjt:  ASRVMNSMLDKGITENLDLVAKILEALFMRGHVEEALGRIDLLMSCNCPPDFDSLLSVLCEKGKTIAALKLLDFGLERECNIDFSSYEKVLDALLGAGKT

Query:  LNAYSILCKIMEKGGAKDWSSCDDLIRSLNQEGNTKQADILSRMRKGGDRKGSKKTSLAA
        LNAYSILCKIMEKGGAK+WSSCDDLI SLNQEG+TKQAD+LSRM KGGD    K  S AA
Subjt:  LNAYSILCKIMEKGGAKDWSSCDDLIRSLNQEGNTKQADILSRMRKGGDRKGSKKTSLAA

SwissProt top hitse value%identityAlignment
O81908 Pentatricopeptide repeat-containing protein At1g02060, chloroplastic3.5e-9832.98Show/hide
Query:  KLEDIICRMMANREWTTRLQNSIRSLVPQ--FDHSLVWNVLHAAKNSDHALKFFRWVERAGLFRHDRDTHLKIIEILGRASKLNHARCILLDMPNKG---
        KL   + R + +  W+  L++S+ SL P      + V   L   K     L+FF WV   G F H   +   ++E LGRA  LN AR  L  +  +    
Subjt:  KLEDIICRMMANREWTTRLQNSIRSLVPQ--FDHSLVWNVLHAAKNSDHALKFFRWVERAGLFRHDRDTHLKIIEILGRASKLNHARCILLDMPNKG---

Query:  VEWDEELFVVLIDSYGKTGIVQEAVKIFQKMKELGVERSVKSYDALFKVIMRRGRYMMAKRYFNAMLNE-GIEPTRHTYNVMLWGFFLSLRLETTKRFYE
        V+  +  F  LI SYG  G+ QE+VK+FQ MK++G+  SV ++++L  ++++RGR  MA   F+ M    G+ P  +T+N ++ GF  +  ++   R ++
Subjt:  VEWDEELFVVLIDSYGKTGIVQEAVKIFQKMKELGVERSVKSYDALFKVIMRRGRYMMAKRYFNAMLNE-GIEPTRHTYNVMLWGFFLSLRLETTKRFYE

Query:  DMKSRGISPDVVTYNTMINGYYRFKMMEEAEQFFTEM--KGKNIVPTVISYTTMIKGYVSVGRVDDGLRLFEEMKAVGVKPNDITYSTLLPGLCDAEKMS
        DM+    +PDVVTYNT+I+G  R   ++ A    + M  K  ++ P V+SYTT+++GY     +D+ + +F +M + G+KPN +TY+TL+ GL +A +  
Subjt:  DMKSRGISPDVVTYNTMINGYYRFKMMEEAEQFFTEM--KGKNIVPTVISYTTMIKGYVSVGRVDDGLRLFEEMKAVGVKPNDITYSTLLPGLCDAEKMS

Query:  EARQILTEMVDKY--IAPKDNSIFMRLLSCQCKHGDLDAAMHVLKAMIRLSIPTEAGHYGILIENCCKAGMYDQAVKLLDKLVEKEIILRPQSTLEMEAS
        E + IL    D +   AP D   F  L+   C  G LDAAM V + M+ + +  ++  Y +LI   C    +D+A  L ++L EKE++L       + A+
Subjt:  EARQILTEMVDKY--IAPKDNSIFMRLLSCQCKHGDLDAAMHVLKAMIRLSIPTEAGHYGILIENCCKAGMYDQAVKLLDKLVEKEIILRPQSTLEMEAS

Query:  AYNLIIQYLCNHGQTGKAETFFRQLLKKGIQDEVAFNNLIRGHSKEGNPELAYEMLKIMGRRGVSRDAESYKLLIKSYLSKGEPADAKTTLDNMIESGHY
        AYN + +YLC +G+T +AE  FRQL+K+G+QD  ++  LI GH +EG  + AYE+L +M RR    D E+Y+LLI   L  GE   A  TL  M+ S + 
Subjt:  AYNLIIQYLCNHGQTGKAETFFRQLLKKGIQDEVAFNNLIRGHSKEGNPELAYEMLKIMGRRGVSRDAESYKLLIKSYLSKGEPADAKTTLDNMIESGHY

Query:  PDSALFRSVMESLFADGRVQTASRVMNSMLDKGITENLDLVAKILEALFMRGHVEEALGRIDLLMSCNCPPDFDSLLSVLCEKGKTIAALKLLDFGLERE
        P +  F SV+  L        +  ++  ML+K I +N+DL  +++  LF     E+A   + LL         + LL  LCE  K + A  L+ F LE+ 
Subjt:  PDSALFRSVMESLFADGRVQTASRVMNSMLDKGITENLDLVAKILEALFMRGHVEEALGRIDLLMSCNCPPDFDSLLSVLCEKGKTIAALKLLDFGLERE

Query:  CNIDFSSYEKVLDALLGAGKTLNAYSILCKIMEKGGAKDWSSCDDLIRSLNQEGNTKQADILSR
          +D  +   V++ L    +   A+S+  +++E G  +  S    L  +L   G  ++   +S+
Subjt:  CNIDFSSYEKVLDALLGAGKTLNAYSILCKIMEKGGAKDWSSCDDLIRSLNQEGNTKQADILSR

Q9CA58 Putative pentatricopeptide repeat-containing protein At1g745808.1e-5525.17Show/hide
Query:  VLHAAKNSDHALKFFRWVERAGLFRHDRDTHLKIIEILGRASKLNHARCILLDM-PNKGVEWDEELFVVLIDSYGKTGIVQEAVKIFQKMKELGVERSVK
        V+   K+   AL+ F  + +   F+H   T+  +IE LG   K      +L+DM  N G    E ++V  + +YG+ G VQEAV +F++M     E +V 
Subjt:  VLHAAKNSDHALKFFRWVERAGLFRHDRDTHLKIIEILGRASKLNHARCILLDM-PNKGVEWDEELFVVLIDSYGKTGIVQEAVKIFQKMKELGVERSVK

Query:  SYDALFKVIMRRGRYMMAKRYFNAMLNEGIEPTRHTYNVMLWGFFLSLRLETTKRFYEDMKSRGISPDVVTYNTMINGYYRFKMMEEAEQFFTEMKGKNI
        SY+A+  V++  G +  A + +  M + GI P  +++ + +  F  + R     R   +M S+G   +VV Y T++ G+Y      E  + F +M    +
Subjt:  SYDALFKVIMRRGRYMMAKRYFNAMLNEGIEPTRHTYNVMLWGFFLSLRLETTKRFYEDMKSRGISPDVVTYNTMINGYYRFKMMEEAEQFFTEMKGKNI

Query:  VPTVISYTTMIKGYVSVGRVDDGLRLFEEMKAVGVKPNDITYSTLLPGLCDAEKMSEARQILTEMVDKYIAPKDNSI-FMRLLSCQCKHGDLDAAMHVLK
           + ++  +++     G V +  +L +++   GV PN  TY+  + GLC   ++  A +++  ++++   PK + I +  L+   CK+     A   L 
Subjt:  VPTVISYTTMIKGYVSVGRVDDGLRLFEEMKAVGVKPNDITYSTLLPGLCDAEKMSEARQILTEMVDKYIAPKDNSI-FMRLLSCQCKHGDLDAAMHVLK

Query:  AMIRLSIPTEAGHYGILIENCCKAGMYDQAVKLLDKLVEKEIILRPQSTLEMEASAYNLIIQYLCNHGQTGKAETFFRQLLKKGIQDEV-AFNNLIRGHS
         M+   +  ++  Y  LI   CK GM    V+L +++V   +     +    +   Y  +I  LC+ G+T +A   F + L KGI+  V  +N LI+G S
Subjt:  AMIRLSIPTEAGHYGILIENCCKAGMYDQAVKLLDKLVEKEIILRPQSTLEMEASAYNLIIQYLCNHGQTGKAETFFRQLLKKGIQDEV-AFNNLIRGHS

Query:  KEGNPELAYEMLKIMGRRGVSRDAESYKLLIKSYLSKGEPADAKTTLDNMIESGHYPDSALFRSVMESLFADGRVQTASRVMNSMLDKGITENLDLVAKI
         +G    A ++   M  +G+  + +++ +L+      G  +DA   +  MI  G++PD   F  ++       +++ A  +++ MLD G+  ++     +
Subjt:  KEGNPELAYEMLKIMGRRGVSRDAESYKLLIKSYLSKGEPADAKTTLDNMIESGHYPDSALFRSVMESLFADGRVQTASRVMNSMLDKGITENLDLVAKI

Query:  LEALFMRGHVEEALGRIDLLMSCNCPPD---FDSLLSVLCEKGKTIAALKLLDFGLERECNIDFSSYEKVLDALLGAGKTLNAYSILCKIME
        L  L      E+ +     ++   C P+   F+ LL  LC   K   AL LL+    +  N D  ++  ++D     G    AY++  K+ E
Subjt:  LEALFMRGHVEEALGRIDLLMSCNCPPD---FDSLLSVLCEKGKTIAALKLLDFGLERECNIDFSSYEKVLDALLGAGKTLNAYSILCKIME

Q9LPX2 Pentatricopeptide repeat-containing protein At1g12775, mitochondrial6.5e-5228.22Show/hide
Query:  HSLVWNVLHAAKNSDHALKFFRWVERAGLFRHDRDTHLKIIEILGRASKLNHARCILLDMPNKGVEWDEELFVVLIDSYGKTGIVQEAVKIFQKMKELGV
        ++LV  +    K SD  +   R VE    F+ +  T+  ++ ++ ++ +   A  +L  M  + ++ D   + ++ID   K G +  A  +F +M+  G 
Subjt:  HSLVWNVLHAAKNSDHALKFFRWVERAGLFRHDRDTHLKIIEILGRASKLNHARCILLDMPNKGVEWDEELFVVLIDSYGKTGIVQEAVKIFQKMKELGV

Query:  ERSVKSYDALFKVIMRRGRYMMAKRYFNAMLNEGIEPTRHTYNVMLWGFFLSLRLETTKRFYEDMKSRGISPDVVTYNTMINGYYRFKMMEEAEQFFTEM
        +  + +Y+ L       GR+    +    M+   I P   T++V++  F    +L    +  ++M  RGI+P+ +TYN++I+G+ +   +EEA Q    M
Subjt:  ERSVKSYDALFKVIMRRGRYMMAKRYFNAMLNEGIEPTRHTYNVMLWGFFLSLRLETTKRFYEDMKSRGISPDVVTYNTMINGYYRFKMMEEAEQFFTEM

Query:  KGKNIVPTVISYTTMIKGYVSVGRVDDGLRLFEEMKAVGVKPNDITYSTLLPGLCDAEKMSEARQILTEMVDKYIAPKDNSIFMRLLSCQCKHGDLDAAM
          K   P ++++  +I GY    R+DDGL LF EM   GV  N +TY+TL+ G C + K+  A+++  EMV + + P D   +  LL   C +G+L+ A+
Subjt:  KGKNIVPTVISYTTMIKGYVSVGRVDDGLRLFEEMKAVGVKPNDITYSTLLPGLCDAEKMSEARQILTEMVDKYIAPKDNSIFMRLLSCQCKHGDLDAAM

Query:  HVLKAMIRLSIPTEAGHYGILIENCCKAGMYDQAVKLLDKLVEKEIILRPQSTLEMEASAYNLIIQYLCNHGQTGKAETFFRQLLKKG-IQDEVAFNNLI
         +   + +  +  + G Y I+I   C A   D A  L   L        P   ++++A AYN++I  LC      KA+  FR++ ++G   DE+ +N LI
Subjt:  HVLKAMIRLSIPTEAGHYGILIENCCKAGMYDQAVKLLDKLVEKEIILRPQSTLEMEASAYNLIIQYLCNHGQTGKAETFFRQLLKKG-IQDEVAFNNLI

Query:  RGHSKEGNPELAYEMLKIMGRRGVSRDAESYKLLIKSYLSKGE
        R H  + +   A E+++ M   G   D  + K++I + LS GE
Subjt:  RGHSKEGNPELAYEMLKIMGRRGVSRDAESYKLLIKSYLSKGE

Q9LSL9 Pentatricopeptide repeat-containing protein At5g655603.5e-5021.66Show/hide
Query:  SSVAAVPQPVETAAVNGVEQVKRRTPRGKHRNPEKLEDIICRMMANREWTTRLQNSIRSLVPQFDHSLVWNVLHAAKNSDHALKFFRWVERAGLFRHDRD
        S+   VP PV       V  + R  P  +  +   +   +  +++   W      S++S+V     S V ++     +   AL F  W+ +   ++H   
Subjt:  SSVAAVPQPVETAAVNGVEQVKRRTPRGKHRNPEKLEDIICRMMANREWTTRLQNSIRSLVPQFDHSLVWNVLHAAKNSDHALKFFRWVERAGLFRHDRD

Query:  THLKIIEIL---GRASKLNHARCIL-------------LDMPNKGVEWDEEL----------FVVLIDSYGKTGIVQEAVKIFQKMKELGVERSVKSYDA
        ++  ++ +L   G    +   R ++             LD+  K +  DE            +  L++S  + G+V E  +++ +M E  V  ++ +Y+ 
Subjt:  THLKIIEIL---GRASKLNHARCIL-------------LDMPNKGVEWDEEL----------FVVLIDSYGKTGIVQEAVKIFQKMKELGVERSVKSYDA

Query:  LFKVIMRRGRYMMAKRYFNAMLNEGIEPTRHTYNVMLWGFFLSLRLETTKRFYEDMKSRGISPDVVTYNTMINGYYRFKMMEEAEQFFTEMKGKNIVPTV
        +     + G    A +Y + ++  G++P   TY  ++ G+     L++  + + +M  +G   + V Y  +I+G    + ++EA   F +MK     PTV
Subjt:  LFKVIMRRGRYMMAKRYFNAMLNEGIEPTRHTYNVMLWGFFLSLRLETTKRFYEDMKSRGISPDVVTYNTMINGYYRFKMMEEAEQFFTEMKGKNIVPTV

Query:  ISYTTMIKGYVSVGRVDDGLRLFEEMKAVGVKPNDITYSTLLPGLCDAEKMSEARQILTEMVDKYIAPKDNSIFMRLLSCQCKHGDLDAAMHVLKAMIRL
         +YT +IK      R  + L L +EM+  G+KPN  TY+ L+  LC   K  +AR++L +M++K + P +   +  L++  CK G ++ A+ V++ M   
Subjt:  ISYTTMIKGYVSVGRVDDGLRLFEEMKAVGVKPNDITYSTLLPGLCDAEKMSEARQILTEMVDKYIAPKDNSIFMRLLSCQCKHGDLDAAMHVLKAMIRL

Query:  SIPTEAGHYGILIENCCKAGMYDQAVKLLDKLVEKEIILRPQSTLEMEASAYNLIIQYLCNHGQTGKAETFFRQLLKKG-IQDEVAFNNLIRGHSKEGNP
         +      Y  LI+  CK+ ++ +A+ +L+K++E++++         +   YN +I   C  G    A      +  +G + D+  + ++I    K    
Subjt:  SIPTEAGHYGILIENCCKAGMYDQAVKLLDKLVEKEIILRPQSTLEMEASAYNLIIQYLCNHGQTGKAETFFRQLLKKG-IQDEVAFNNLIRGHSKEGNP

Query:  ELAYEMLKIMGRRGVSRDAESYKLLIKSYLSKGEPADAKTTLDNMIESGHYPDSALFRSVMESLFADGRVQTASRVMNSMLDKGITENLDLVAKILEALF
        E A ++   + ++GV+ +   Y  LI  Y   G+  +A   L+ M+     P+S  F +++  L ADG+++ A+ +   M+  G+   +     ++  L 
Subjt:  ELAYEMLKIMGRRGVSRDAESYKLLIKSYLSKGEPADAKTTLDNMIESGHYPDSALFRSVMESLFADGRVQTASRVMNSMLDKGITENLDLVAKILEALF

Query:  MRGHVEEALGRIDLLMSCNCPPD---FDSLLSVLCEKGKTIAALKLLDFGLERECNIDFSSYEKVLDALLGAGKTLNAYSILCKIMEKGGAKDWSSCDDL
          G  + A  R   ++S    PD   + + +   C +G+ + A  ++    E   + D  +Y  ++      G+T  A+ +L ++ + G      +   L
Subjt:  MRGHVEEALGRIDLLMSCNCPPD---FDSLLSVLCEKGKTIAALKLLDFGLERECNIDFSSYEKVLDALLGAGKTLNAYSILCKIMEKGGAKDWSSCDDL

Query:  IRSL------NQEGNTKQADILSRM
        I+ L       Q+G+  +   +S M
Subjt:  IRSL------NQEGNTKQADILSRM

Q9ZUU3 Pentatricopeptide repeat-containing protein At2g372306.9e-29668.21Show/hide
Query:  MAHISLSKPHYSHFRV-LSTSSISNPTVFNSLHFFSSTQEPISTATQNEIPNDPSASSVAAVPQPVETAAVNGVEQVKRRTPRGKHRNPEKLEDIICRMM
        MA IS SK + S  RV LS    SN ++F+    FS+ +E  + A  N     P A S     +  +         ++ R  RGK +N EKLED ICRMM
Subjt:  MAHISLSKPHYSHFRV-LSTSSISNPTVFNSLHFFSSTQEPISTATQNEIPNDPSASSVAAVPQPVETAAVNGVEQVKRRTPRGKHRNPEKLEDIICRMM

Query:  ANREWTTRLQNSIRSLVPQFDHSLVWNVLHAAKNSDHALKFFRWVERAGLFRHDRDTHLKIIEILGRASKLNHARCILLDMPNKGVEWDEELFVVLIDSY
         NR WTTRLQNSIR LVP++DHSLV+NVLH AK  +HAL+FFRW ER+GL RHDRDTH+K+I++LG  SKLNHARCILLDMP KGV WDE++FVVLI+SY
Subjt:  ANREWTTRLQNSIRSLVPQFDHSLVWNVLHAAKNSDHALKFFRWVERAGLFRHDRDTHLKIIEILGRASKLNHARCILLDMPNKGVEWDEELFVVLIDSY

Query:  GKTGIVQEAVKIFQKMKELGVERSVKSYDALFKVIMRRGRYMMAKRYFNAMLNEGIEPTRHTYNVMLWGFFLSLRLETTKRFYEDMKSRGISPDVVTYNT
        GK GIVQE+VKIFQKMK+LGVER++KSY++LFKVI+RRGRYMMAKRYFN M++EG+EPTRHTYN+MLWGFFLSLRLET  RF+EDMK+RGISPD  T+NT
Subjt:  GKTGIVQEAVKIFQKMKELGVERSVKSYDALFKVIMRRGRYMMAKRYFNAMLNEGIEPTRHTYNVMLWGFFLSLRLETTKRFYEDMKSRGISPDVVTYNT

Query:  MINGYYRFKMMEEAEQFFTEMKGKNIVPTVISYTTMIKGYVSVGRVDDGLRLFEEMKAVGVKPNDITYSTLLPGLCDAEKMSEARQILTEMVDKYIAPKD
        MING+ RFK M+EAE+ F EMKG  I P+V+SYTTMIKGY++V RVDDGLR+FEEM++ G++PN  TYSTLLPGLCDA KM EA+ IL  M+ K+IAPKD
Subjt:  MINGYYRFKMMEEAEQFFTEMKGKNIVPTVISYTTMIKGYVSVGRVDDGLRLFEEMKAVGVKPNDITYSTLLPGLCDAEKMSEARQILTEMVDKYIAPKD

Query:  NSIFMRLLSCQCKHGDLDAAMHVLKAMIRLSIPTEAGHYGILIENCCKAGMYDQAVKLLDKLVEKEIILRPQSTLEMEASAYNLIIQYLCNHGQTGKAET
        NSIF++LL  Q K GD+ AA  VLKAM  L++P EAGHYG+LIEN CKA  Y++A+KLLD L+EKEIILR Q TLEME SAYN II+YLCN+GQT KAE 
Subjt:  NSIFMRLLSCQCKHGDLDAAMHVLKAMIRLSIPTEAGHYGILIENCCKAGMYDQAVKLLDKLVEKEIILRPQSTLEMEASAYNLIIQYLCNHGQTGKAET

Query:  FFRQLLKKGIQDEVAFNNLIRGHSKEGNPELAYEMLKIMGRRGVSRDAESYKLLIKSYLSKGEPADAKTTLDNMIESGHYPDSALFRSVMESLFADGRVQ
         FRQL+K+G+QD+ A NNLIRGH+KEGNP+ +YE+LKIM RRGV R++ +Y+LLIKSY+SKGEP DAKT LD+M+E GH PDS+LFRSV+ESLF DGRVQ
Subjt:  FFRQLLKKGIQDEVAFNNLIRGHSKEGNPELAYEMLKIMGRRGVSRDAESYKLLIKSYLSKGEPADAKTTLDNMIESGHYPDSALFRSVMESLFADGRVQ

Query:  TASRVMNSMLDK--GITENLDLVAKILEALFMRGHVEEALGRIDLLMSCNCPPDFDSLLSVLCEKGKTIAALKLLDFGLERECNIDFSSYEKVLDALLGA
        TASRVM  M+DK  GI +N+DL+AKILEAL MRGHVEEALGRIDLL       D DSLLSVL EKGKTIAALKLLDFGLER+ +++FSSY+KVLDALLGA
Subjt:  TASRVMNSMLDK--GITENLDLVAKILEALFMRGHVEEALGRIDLLMSCNCPPDFDSLLSVLCEKGKTIAALKLLDFGLERECNIDFSSYEKVLDALLGA

Query:  GKTLNAYSILCKIMEKGGAKDWSSCDDLIRSLNQEGNTKQADILSRMRKGGDRKGSKK
        GKTLNAYS+LCKIMEKG + DW S D+LI+SLNQEGNTKQAD+LSRM K G  +G KK
Subjt:  GKTLNAYSILCKIMEKGGAKDWSSCDDLIRSLNQEGNTKQADILSRMRKGGDRKGSKK

Arabidopsis top hitse value%identityAlignment
AT1G02060.1 Tetratricopeptide repeat (TPR)-like superfamily protein2.5e-9932.98Show/hide
Query:  KLEDIICRMMANREWTTRLQNSIRSLVPQ--FDHSLVWNVLHAAKNSDHALKFFRWVERAGLFRHDRDTHLKIIEILGRASKLNHARCILLDMPNKG---
        KL   + R + +  W+  L++S+ SL P      + V   L   K     L+FF WV   G F H   +   ++E LGRA  LN AR  L  +  +    
Subjt:  KLEDIICRMMANREWTTRLQNSIRSLVPQ--FDHSLVWNVLHAAKNSDHALKFFRWVERAGLFRHDRDTHLKIIEILGRASKLNHARCILLDMPNKG---

Query:  VEWDEELFVVLIDSYGKTGIVQEAVKIFQKMKELGVERSVKSYDALFKVIMRRGRYMMAKRYFNAMLNE-GIEPTRHTYNVMLWGFFLSLRLETTKRFYE
        V+  +  F  LI SYG  G+ QE+VK+FQ MK++G+  SV ++++L  ++++RGR  MA   F+ M    G+ P  +T+N ++ GF  +  ++   R ++
Subjt:  VEWDEELFVVLIDSYGKTGIVQEAVKIFQKMKELGVERSVKSYDALFKVIMRRGRYMMAKRYFNAMLNE-GIEPTRHTYNVMLWGFFLSLRLETTKRFYE

Query:  DMKSRGISPDVVTYNTMINGYYRFKMMEEAEQFFTEM--KGKNIVPTVISYTTMIKGYVSVGRVDDGLRLFEEMKAVGVKPNDITYSTLLPGLCDAEKMS
        DM+    +PDVVTYNT+I+G  R   ++ A    + M  K  ++ P V+SYTT+++GY     +D+ + +F +M + G+KPN +TY+TL+ GL +A +  
Subjt:  DMKSRGISPDVVTYNTMINGYYRFKMMEEAEQFFTEM--KGKNIVPTVISYTTMIKGYVSVGRVDDGLRLFEEMKAVGVKPNDITYSTLLPGLCDAEKMS

Query:  EARQILTEMVDKY--IAPKDNSIFMRLLSCQCKHGDLDAAMHVLKAMIRLSIPTEAGHYGILIENCCKAGMYDQAVKLLDKLVEKEIILRPQSTLEMEAS
        E + IL    D +   AP D   F  L+   C  G LDAAM V + M+ + +  ++  Y +LI   C    +D+A  L ++L EKE++L       + A+
Subjt:  EARQILTEMVDKY--IAPKDNSIFMRLLSCQCKHGDLDAAMHVLKAMIRLSIPTEAGHYGILIENCCKAGMYDQAVKLLDKLVEKEIILRPQSTLEMEAS

Query:  AYNLIIQYLCNHGQTGKAETFFRQLLKKGIQDEVAFNNLIRGHSKEGNPELAYEMLKIMGRRGVSRDAESYKLLIKSYLSKGEPADAKTTLDNMIESGHY
        AYN + +YLC +G+T +AE  FRQL+K+G+QD  ++  LI GH +EG  + AYE+L +M RR    D E+Y+LLI   L  GE   A  TL  M+ S + 
Subjt:  AYNLIIQYLCNHGQTGKAETFFRQLLKKGIQDEVAFNNLIRGHSKEGNPELAYEMLKIMGRRGVSRDAESYKLLIKSYLSKGEPADAKTTLDNMIESGHY

Query:  PDSALFRSVMESLFADGRVQTASRVMNSMLDKGITENLDLVAKILEALFMRGHVEEALGRIDLLMSCNCPPDFDSLLSVLCEKGKTIAALKLLDFGLERE
        P +  F SV+  L        +  ++  ML+K I +N+DL  +++  LF     E+A   + LL         + LL  LCE  K + A  L+ F LE+ 
Subjt:  PDSALFRSVMESLFADGRVQTASRVMNSMLDKGITENLDLVAKILEALFMRGHVEEALGRIDLLMSCNCPPDFDSLLSVLCEKGKTIAALKLLDFGLERE

Query:  CNIDFSSYEKVLDALLGAGKTLNAYSILCKIMEKGGAKDWSSCDDLIRSLNQEGNTKQADILSR
          +D  +   V++ L    +   A+S+  +++E G  +  S    L  +L   G  ++   +S+
Subjt:  CNIDFSSYEKVLDALLGAGKTLNAYSILCKIMEKGGAKDWSSCDDLIRSLNQEGNTKQADILSR

AT1G12775.1 Pentatricopeptide repeat (PPR) superfamily protein4.6e-5328.22Show/hide
Query:  HSLVWNVLHAAKNSDHALKFFRWVERAGLFRHDRDTHLKIIEILGRASKLNHARCILLDMPNKGVEWDEELFVVLIDSYGKTGIVQEAVKIFQKMKELGV
        ++LV  +    K SD  +   R VE    F+ +  T+  ++ ++ ++ +   A  +L  M  + ++ D   + ++ID   K G +  A  +F +M+  G 
Subjt:  HSLVWNVLHAAKNSDHALKFFRWVERAGLFRHDRDTHLKIIEILGRASKLNHARCILLDMPNKGVEWDEELFVVLIDSYGKTGIVQEAVKIFQKMKELGV

Query:  ERSVKSYDALFKVIMRRGRYMMAKRYFNAMLNEGIEPTRHTYNVMLWGFFLSLRLETTKRFYEDMKSRGISPDVVTYNTMINGYYRFKMMEEAEQFFTEM
        +  + +Y+ L       GR+    +    M+   I P   T++V++  F    +L    +  ++M  RGI+P+ +TYN++I+G+ +   +EEA Q    M
Subjt:  ERSVKSYDALFKVIMRRGRYMMAKRYFNAMLNEGIEPTRHTYNVMLWGFFLSLRLETTKRFYEDMKSRGISPDVVTYNTMINGYYRFKMMEEAEQFFTEM

Query:  KGKNIVPTVISYTTMIKGYVSVGRVDDGLRLFEEMKAVGVKPNDITYSTLLPGLCDAEKMSEARQILTEMVDKYIAPKDNSIFMRLLSCQCKHGDLDAAM
          K   P ++++  +I GY    R+DDGL LF EM   GV  N +TY+TL+ G C + K+  A+++  EMV + + P D   +  LL   C +G+L+ A+
Subjt:  KGKNIVPTVISYTTMIKGYVSVGRVDDGLRLFEEMKAVGVKPNDITYSTLLPGLCDAEKMSEARQILTEMVDKYIAPKDNSIFMRLLSCQCKHGDLDAAM

Query:  HVLKAMIRLSIPTEAGHYGILIENCCKAGMYDQAVKLLDKLVEKEIILRPQSTLEMEASAYNLIIQYLCNHGQTGKAETFFRQLLKKG-IQDEVAFNNLI
         +   + +  +  + G Y I+I   C A   D A  L   L        P   ++++A AYN++I  LC      KA+  FR++ ++G   DE+ +N LI
Subjt:  HVLKAMIRLSIPTEAGHYGILIENCCKAGMYDQAVKLLDKLVEKEIILRPQSTLEMEASAYNLIIQYLCNHGQTGKAETFFRQLLKKG-IQDEVAFNNLI

Query:  RGHSKEGNPELAYEMLKIMGRRGVSRDAESYKLLIKSYLSKGE
        R H  + +   A E+++ M   G   D  + K++I + LS GE
Subjt:  RGHSKEGNPELAYEMLKIMGRRGVSRDAESYKLLIKSYLSKGE

AT1G30290.1 Tetratricopeptide repeat (TPR)-like superfamily protein9.5e-5924.43Show/hide
Query:  NEIPNDPSASSV-AAVPQPVETAAVNGVEQVKRRTPRGKHRNPEKLEDIICRMMANR-EWTTRLQNSIRSLVPQFDHSLVWNVLHAAKNSDHALKFFRWV
        +E+ +D    +V  ++PQ  E A    VE+ + R P         L   + R++  R  W  + +  +R+L+     S V  VL +  +   ALKFF W 
Subjt:  NEIPNDPSASSV-AAVPQPVETAAVNGVEQVKRRTPRGKHRNPEKLEDIICRMMANR-EWTTRLQNSIRSLVPQFDHSLVWNVLHAAKNSDHALKFFRWV

Query:  ERAGLFRHDRDTHLKIIEILGRASKLNHARCILLDMPNKGVEWDEELFVVLIDSYGKTGIVQEAVKIFQKMKELGVERSVKSYDALFKVIMRRGRYMMAK
        +R   +RHD   +  ++E+L +      +R +L+ M  +G+    E F  ++ SY + G +++A+K+   M+  GVE ++   +    V +R  R   A 
Subjt:  ERAGLFRHDRDTHLKIIEILGRASKLNHARCILLDMPNKGVEWDEELFVVLIDSYGKTGIVQEAVKIFQKMKELGVERSVKSYDALFKVIMRRGRYMMAK

Query:  RYFNAMLNEGIEPTRHTYNVMLWGFFLSLRLETTKRFYEDMKSRGISPDVVTYNTMINGYYRFKMMEEAEQFFTEM-KGKNIVPTVISYTTMIKGYVSVG
        R+   M   GI P   TYN M+ G+    R+E      EDM S+G  PD V+Y T++    + K + E      +M K   +VP  ++Y T+I       
Subjt:  RYFNAMLNEGIEPTRHTYNVMLWGFFLSLRLETTKRFYEDMKSRGISPDVVTYNTMINGYYRFKMMEEAEQFFTEM-KGKNIVPTVISYTTMIKGYVSVG

Query:  RVDDGLRLFEEMKAVGVKPNDITYSTLLPGLCDAEKMSEARQILTEMVDKYIAPKDNSIFMRLLSCQCKHGDLDAAMHVLKAMIRLSIPTEAGHYGILIE
          D+ L   ++ +  G + + + YS ++  LC   +MSEA+ ++ EM+ K   P D   +  +++  C+ G++D A  +L+ M           Y  L+ 
Subjt:  RVDDGLRLFEEMKAVGVKPNDITYSTLLPGLCDAEKMSEARQILTEMVDKYIAPKDNSIFMRLLSCQCKHGDLDAAMHVLKAMIRLSIPTEAGHYGILIE

Query:  NCCKAGMYDQAVKLLDKLVEKEIILRPQSTLEMEASAYNLIIQYLCNHGQTGKAETFFRQLLKKG-IQDEVAFNNLIRGHSKEGNPELAYEMLKIMGRRG
          C+ G   +A ++++  + +E    P S        Y++I+  L   G+  +A    R+++ KG     V  N L++   ++G    A + ++    +G
Subjt:  NCCKAGMYDQAVKLLDKLVEKEIILRPQSTLEMEASAYNLIIQYLCNHGQTGKAETFFRQLLKKG-IQDEVAFNNLIRGHSKEGNPELAYEMLKIMGRRG

Query:  VSRDAESYKLLIKSYLSKGEPADAKTTLDNMIESGHYPDSALFRSVMESLFADGRVQTASRVMNSMLDKGITENLDLVAKILEALFMRGHVEEALGRIDL
         + +  ++  +I  +    E   A + LD+M     + D   + +++++L   GR+  A+ +M  ML KGI         ++      G V++ +  ++ 
Subjt:  VSRDAESYKLLIKSYLSKGEPADAKTTLDNMIESGHYPDSALFRSVMESLFADGRVQTASRVMNSMLDKGITENLDLVAKILEALFMRGHVEEALGRIDL

Query:  LMS-CNCPPDFDSLLSVLCEKGKTIAALKLLDFGLERECNIDFSSYEKVLDALLGAGKTLNAYSILCKIMEKGGAKDWSSCDDLIRSLNQEGNTKQADIL
        ++S   C   ++ ++  LC  GK   A  LL   L      D  +   +++  L  G  L+AY + C++  +    D   C+ L + L  +G   +AD L
Subjt:  LMS-CNCPPDFDSLLSVLCEKGKTIAALKLLDFGLERECNIDFSSYEKVLDALLGAGKTLNAYSILCKIMEKGGAKDWSSCDDLIRSLNQEGNTKQADIL

AT1G74580.1 Pentatricopeptide repeat (PPR) superfamily protein5.8e-5625.17Show/hide
Query:  VLHAAKNSDHALKFFRWVERAGLFRHDRDTHLKIIEILGRASKLNHARCILLDM-PNKGVEWDEELFVVLIDSYGKTGIVQEAVKIFQKMKELGVERSVK
        V+   K+   AL+ F  + +   F+H   T+  +IE LG   K      +L+DM  N G    E ++V  + +YG+ G VQEAV +F++M     E +V 
Subjt:  VLHAAKNSDHALKFFRWVERAGLFRHDRDTHLKIIEILGRASKLNHARCILLDM-PNKGVEWDEELFVVLIDSYGKTGIVQEAVKIFQKMKELGVERSVK

Query:  SYDALFKVIMRRGRYMMAKRYFNAMLNEGIEPTRHTYNVMLWGFFLSLRLETTKRFYEDMKSRGISPDVVTYNTMINGYYRFKMMEEAEQFFTEMKGKNI
        SY+A+  V++  G +  A + +  M + GI P  +++ + +  F  + R     R   +M S+G   +VV Y T++ G+Y      E  + F +M    +
Subjt:  SYDALFKVIMRRGRYMMAKRYFNAMLNEGIEPTRHTYNVMLWGFFLSLRLETTKRFYEDMKSRGISPDVVTYNTMINGYYRFKMMEEAEQFFTEMKGKNI

Query:  VPTVISYTTMIKGYVSVGRVDDGLRLFEEMKAVGVKPNDITYSTLLPGLCDAEKMSEARQILTEMVDKYIAPKDNSI-FMRLLSCQCKHGDLDAAMHVLK
           + ++  +++     G V +  +L +++   GV PN  TY+  + GLC   ++  A +++  ++++   PK + I +  L+   CK+     A   L 
Subjt:  VPTVISYTTMIKGYVSVGRVDDGLRLFEEMKAVGVKPNDITYSTLLPGLCDAEKMSEARQILTEMVDKYIAPKDNSI-FMRLLSCQCKHGDLDAAMHVLK

Query:  AMIRLSIPTEAGHYGILIENCCKAGMYDQAVKLLDKLVEKEIILRPQSTLEMEASAYNLIIQYLCNHGQTGKAETFFRQLLKKGIQDEV-AFNNLIRGHS
         M+   +  ++  Y  LI   CK GM    V+L +++V   +     +    +   Y  +I  LC+ G+T +A   F + L KGI+  V  +N LI+G S
Subjt:  AMIRLSIPTEAGHYGILIENCCKAGMYDQAVKLLDKLVEKEIILRPQSTLEMEASAYNLIIQYLCNHGQTGKAETFFRQLLKKGIQDEV-AFNNLIRGHS

Query:  KEGNPELAYEMLKIMGRRGVSRDAESYKLLIKSYLSKGEPADAKTTLDNMIESGHYPDSALFRSVMESLFADGRVQTASRVMNSMLDKGITENLDLVAKI
         +G    A ++   M  +G+  + +++ +L+      G  +DA   +  MI  G++PD   F  ++       +++ A  +++ MLD G+  ++     +
Subjt:  KEGNPELAYEMLKIMGRRGVSRDAESYKLLIKSYLSKGEPADAKTTLDNMIESGHYPDSALFRSVMESLFADGRVQTASRVMNSMLDKGITENLDLVAKI

Query:  LEALFMRGHVEEALGRIDLLMSCNCPPD---FDSLLSVLCEKGKTIAALKLLDFGLERECNIDFSSYEKVLDALLGAGKTLNAYSILCKIME
        L  L      E+ +     ++   C P+   F+ LL  LC   K   AL LL+    +  N D  ++  ++D     G    AY++  K+ E
Subjt:  LEALFMRGHVEEALGRIDLLMSCNCPPD---FDSLLSVLCEKGKTIAALKLLDFGLERECNIDFSSYEKVLDALLGAGKTLNAYSILCKIME

AT2G37230.1 Tetratricopeptide repeat (TPR)-like superfamily protein4.9e-29768.21Show/hide
Query:  MAHISLSKPHYSHFRV-LSTSSISNPTVFNSLHFFSSTQEPISTATQNEIPNDPSASSVAAVPQPVETAAVNGVEQVKRRTPRGKHRNPEKLEDIICRMM
        MA IS SK + S  RV LS    SN ++F+    FS+ +E  + A  N     P A S     +  +         ++ R  RGK +N EKLED ICRMM
Subjt:  MAHISLSKPHYSHFRV-LSTSSISNPTVFNSLHFFSSTQEPISTATQNEIPNDPSASSVAAVPQPVETAAVNGVEQVKRRTPRGKHRNPEKLEDIICRMM

Query:  ANREWTTRLQNSIRSLVPQFDHSLVWNVLHAAKNSDHALKFFRWVERAGLFRHDRDTHLKIIEILGRASKLNHARCILLDMPNKGVEWDEELFVVLIDSY
         NR WTTRLQNSIR LVP++DHSLV+NVLH AK  +HAL+FFRW ER+GL RHDRDTH+K+I++LG  SKLNHARCILLDMP KGV WDE++FVVLI+SY
Subjt:  ANREWTTRLQNSIRSLVPQFDHSLVWNVLHAAKNSDHALKFFRWVERAGLFRHDRDTHLKIIEILGRASKLNHARCILLDMPNKGVEWDEELFVVLIDSY

Query:  GKTGIVQEAVKIFQKMKELGVERSVKSYDALFKVIMRRGRYMMAKRYFNAMLNEGIEPTRHTYNVMLWGFFLSLRLETTKRFYEDMKSRGISPDVVTYNT
        GK GIVQE+VKIFQKMK+LGVER++KSY++LFKVI+RRGRYMMAKRYFN M++EG+EPTRHTYN+MLWGFFLSLRLET  RF+EDMK+RGISPD  T+NT
Subjt:  GKTGIVQEAVKIFQKMKELGVERSVKSYDALFKVIMRRGRYMMAKRYFNAMLNEGIEPTRHTYNVMLWGFFLSLRLETTKRFYEDMKSRGISPDVVTYNT

Query:  MINGYYRFKMMEEAEQFFTEMKGKNIVPTVISYTTMIKGYVSVGRVDDGLRLFEEMKAVGVKPNDITYSTLLPGLCDAEKMSEARQILTEMVDKYIAPKD
        MING+ RFK M+EAE+ F EMKG  I P+V+SYTTMIKGY++V RVDDGLR+FEEM++ G++PN  TYSTLLPGLCDA KM EA+ IL  M+ K+IAPKD
Subjt:  MINGYYRFKMMEEAEQFFTEMKGKNIVPTVISYTTMIKGYVSVGRVDDGLRLFEEMKAVGVKPNDITYSTLLPGLCDAEKMSEARQILTEMVDKYIAPKD

Query:  NSIFMRLLSCQCKHGDLDAAMHVLKAMIRLSIPTEAGHYGILIENCCKAGMYDQAVKLLDKLVEKEIILRPQSTLEMEASAYNLIIQYLCNHGQTGKAET
        NSIF++LL  Q K GD+ AA  VLKAM  L++P EAGHYG+LIEN CKA  Y++A+KLLD L+EKEIILR Q TLEME SAYN II+YLCN+GQT KAE 
Subjt:  NSIFMRLLSCQCKHGDLDAAMHVLKAMIRLSIPTEAGHYGILIENCCKAGMYDQAVKLLDKLVEKEIILRPQSTLEMEASAYNLIIQYLCNHGQTGKAET

Query:  FFRQLLKKGIQDEVAFNNLIRGHSKEGNPELAYEMLKIMGRRGVSRDAESYKLLIKSYLSKGEPADAKTTLDNMIESGHYPDSALFRSVMESLFADGRVQ
         FRQL+K+G+QD+ A NNLIRGH+KEGNP+ +YE+LKIM RRGV R++ +Y+LLIKSY+SKGEP DAKT LD+M+E GH PDS+LFRSV+ESLF DGRVQ
Subjt:  FFRQLLKKGIQDEVAFNNLIRGHSKEGNPELAYEMLKIMGRRGVSRDAESYKLLIKSYLSKGEPADAKTTLDNMIESGHYPDSALFRSVMESLFADGRVQ

Query:  TASRVMNSMLDK--GITENLDLVAKILEALFMRGHVEEALGRIDLLMSCNCPPDFDSLLSVLCEKGKTIAALKLLDFGLERECNIDFSSYEKVLDALLGA
        TASRVM  M+DK  GI +N+DL+AKILEAL MRGHVEEALGRIDLL       D DSLLSVL EKGKTIAALKLLDFGLER+ +++FSSY+KVLDALLGA
Subjt:  TASRVMNSMLDK--GITENLDLVAKILEALFMRGHVEEALGRIDLLMSCNCPPDFDSLLSVLCEKGKTIAALKLLDFGLERECNIDFSSYEKVLDALLGA

Query:  GKTLNAYSILCKIMEKGGAKDWSSCDDLIRSLNQEGNTKQADILSRMRKGGDRKGSKK
        GKTLNAYS+LCKIMEKG + DW S D+LI+SLNQEGNTKQAD+LSRM K G  +G KK
Subjt:  GKTLNAYSILCKIMEKGGAKDWSSCDDLIRSLNQEGNTKQADILSRMRKGGDRKGSKK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTCACATTTCTTTATCTAAACCTCACTACAGCCATTTCAGGGTTCTTTCCACTTCTTCAATTTCGAACCCAACAGTCTTCAATTCGCTTCATTTCTTCAGCTCCAC
TCAAGAACCGATCTCCACAGCTACTCAAAATGAAATCCCCAATGATCCATCCGCCAGTTCTGTTGCTGCAGTGCCTCAACCCGTGGAAACGGCGGCTGTTAATGGCGTCG
AGCAAGTTAAGCGAAGAACACCTAGAGGTAAGCACCGAAACCCAGAAAAATTAGAGGATATTATTTGTAGAATGATGGCAAATCGTGAATGGACAACACGTTTACAGAAC
TCCATTCGGTCCTTGGTTCCCCAATTTGATCACTCCCTTGTTTGGAATGTGTTGCATGCTGCTAAGAACTCGGACCATGCGCTCAAGTTCTTCCGGTGGGTGGAGCGAGC
CGGCTTATTCCGGCACGATCGTGATACCCATTTGAAAATAATAGAGATTCTAGGTCGGGCTTCAAAGCTTAACCATGCCCGTTGTATTCTTCTTGATATGCCGAACAAGG
GTGTTGAATGGGATGAAGAATTATTCGTTGTATTGATTGATAGTTATGGCAAAACTGGGATAGTTCAGGAGGCTGTGAAAATTTTTCAAAAGATGAAGGAATTGGGTGTT
GAGAGAAGTGTTAAATCTTATGATGCTCTATTTAAGGTGATTATGAGGAGGGGGCGGTATATGATGGCCAAGAGGTACTTTAATGCTATGTTGAATGAAGGAATAGAGCC
AACTCGGCATACCTATAATGTGATGCTTTGGGGATTTTTTCTGTCGTTGAGGCTTGAAACAACCAAGAGATTTTATGAAGACATGAAGAGTAGAGGTATTTCACCTGACG
TTGTTACATATAACACTATGATTAATGGATATTATCGGTTCAAGATGATGGAGGAGGCAGAGCAGTTCTTTACTGAGATGAAGGGGAAGAATATTGTACCAACAGTGATA
AGCTATACTACTATGATAAAAGGTTATGTGTCTGTTGGTCGAGTAGATGATGGATTGAGATTGTTTGAAGAGATGAAGGCTGTTGGTGTCAAGCCAAATGATATTACTTA
TTCAACATTGCTCCCTGGTCTCTGCGATGCAGAGAAAATGTCTGAAGCGCGACAAATTTTGACAGAAATGGTGGACAAGTATATTGCTCCAAAGGACAATTCGATTTTCA
TGAGGCTGTTATCTTGCCAGTGCAAGCATGGTGATTTAGATGCTGCTATGCATGTGCTGAAAGCAATGATTCGATTAAGCATTCCAACAGAGGCTGGGCATTACGGTATT
TTAATTGAGAACTGTTGCAAAGCTGGAATGTATGATCAGGCAGTTAAATTGCTTGACAAGCTTGTAGAAAAAGAAATTATACTGAGGCCACAAAGTACATTGGAAATGGA
GGCTAGTGCATATAACCTTATAATTCAGTATCTGTGTAACCATGGCCAGACTGGAAAAGCTGAAACCTTTTTCCGGCAGTTGTTGAAGAAGGGCATTCAGGATGAGGTTG
CATTTAATAATTTGATCCGTGGCCATTCCAAAGAAGGTAATCCTGAATTGGCATATGAAATGTTGAAAATCATGGGTAGGAGAGGTGTGTCTAGGGATGCAGAGTCTTAT
AAGTTGCTTATCAAGAGTTACTTGAGTAAAGGTGAACCGGCTGATGCTAAAACAACTTTGGACAACATGATCGAAAGTGGGCACTATCCCGATTCAGCGTTGTTTAGGTC
AGTGATGGAAAGTCTATTTGCAGATGGGCGGGTGCAAACTGCGAGCCGAGTGATGAATAGTATGTTGGATAAAGGAATAACAGAAAACTTGGACTTGGTTGCTAAAATCC
TGGAAGCCCTTTTCATGAGAGGTCATGTCGAGGAGGCTTTGGGACGAATTGATTTACTAATGAGCTGCAATTGCCCACCTGATTTTGATAGTCTTTTATCTGTTCTTTGT
GAAAAGGGGAAAACCATTGCTGCTCTCAAGCTTTTAGATTTTGGGTTGGAAAGAGAGTGCAACATAGACTTCTCAAGTTATGAGAAAGTACTTGATGCGCTGTTGGGAGC
AGGGAAGACGCTGAACGCATACTCGATTCTCTGCAAGATAATGGAGAAAGGAGGGGCAAAGGATTGGAGCAGCTGTGATGATCTGATCAGAAGCCTCAATCAGGAAGGGA
ACACAAAACAAGCTGATATTCTCTCAAGAATGAGAAAGGGTGGAGACAGAAAGGGGAGTAAGAAAACTTCTCTTGCTGCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCTCACATTTCTTTATCTAAACCTCACTACAGCCATTTCAGGGTTCTTTCCACTTCTTCAATTTCGAACCCAACAGTCTTCAATTCGCTTCATTTCTTCAGCTCCAC
TCAAGAACCGATCTCCACAGCTACTCAAAATGAAATCCCCAATGATCCATCCGCCAGTTCTGTTGCTGCAGTGCCTCAACCCGTGGAAACGGCGGCTGTTAATGGCGTCG
AGCAAGTTAAGCGAAGAACACCTAGAGGTAAGCACCGAAACCCAGAAAAATTAGAGGATATTATTTGTAGAATGATGGCAAATCGTGAATGGACAACACGTTTACAGAAC
TCCATTCGGTCCTTGGTTCCCCAATTTGATCACTCCCTTGTTTGGAATGTGTTGCATGCTGCTAAGAACTCGGACCATGCGCTCAAGTTCTTCCGGTGGGTGGAGCGAGC
CGGCTTATTCCGGCACGATCGTGATACCCATTTGAAAATAATAGAGATTCTAGGTCGGGCTTCAAAGCTTAACCATGCCCGTTGTATTCTTCTTGATATGCCGAACAAGG
GTGTTGAATGGGATGAAGAATTATTCGTTGTATTGATTGATAGTTATGGCAAAACTGGGATAGTTCAGGAGGCTGTGAAAATTTTTCAAAAGATGAAGGAATTGGGTGTT
GAGAGAAGTGTTAAATCTTATGATGCTCTATTTAAGGTGATTATGAGGAGGGGGCGGTATATGATGGCCAAGAGGTACTTTAATGCTATGTTGAATGAAGGAATAGAGCC
AACTCGGCATACCTATAATGTGATGCTTTGGGGATTTTTTCTGTCGTTGAGGCTTGAAACAACCAAGAGATTTTATGAAGACATGAAGAGTAGAGGTATTTCACCTGACG
TTGTTACATATAACACTATGATTAATGGATATTATCGGTTCAAGATGATGGAGGAGGCAGAGCAGTTCTTTACTGAGATGAAGGGGAAGAATATTGTACCAACAGTGATA
AGCTATACTACTATGATAAAAGGTTATGTGTCTGTTGGTCGAGTAGATGATGGATTGAGATTGTTTGAAGAGATGAAGGCTGTTGGTGTCAAGCCAAATGATATTACTTA
TTCAACATTGCTCCCTGGTCTCTGCGATGCAGAGAAAATGTCTGAAGCGCGACAAATTTTGACAGAAATGGTGGACAAGTATATTGCTCCAAAGGACAATTCGATTTTCA
TGAGGCTGTTATCTTGCCAGTGCAAGCATGGTGATTTAGATGCTGCTATGCATGTGCTGAAAGCAATGATTCGATTAAGCATTCCAACAGAGGCTGGGCATTACGGTATT
TTAATTGAGAACTGTTGCAAAGCTGGAATGTATGATCAGGCAGTTAAATTGCTTGACAAGCTTGTAGAAAAAGAAATTATACTGAGGCCACAAAGTACATTGGAAATGGA
GGCTAGTGCATATAACCTTATAATTCAGTATCTGTGTAACCATGGCCAGACTGGAAAAGCTGAAACCTTTTTCCGGCAGTTGTTGAAGAAGGGCATTCAGGATGAGGTTG
CATTTAATAATTTGATCCGTGGCCATTCCAAAGAAGGTAATCCTGAATTGGCATATGAAATGTTGAAAATCATGGGTAGGAGAGGTGTGTCTAGGGATGCAGAGTCTTAT
AAGTTGCTTATCAAGAGTTACTTGAGTAAAGGTGAACCGGCTGATGCTAAAACAACTTTGGACAACATGATCGAAAGTGGGCACTATCCCGATTCAGCGTTGTTTAGGTC
AGTGATGGAAAGTCTATTTGCAGATGGGCGGGTGCAAACTGCGAGCCGAGTGATGAATAGTATGTTGGATAAAGGAATAACAGAAAACTTGGACTTGGTTGCTAAAATCC
TGGAAGCCCTTTTCATGAGAGGTCATGTCGAGGAGGCTTTGGGACGAATTGATTTACTAATGAGCTGCAATTGCCCACCTGATTTTGATAGTCTTTTATCTGTTCTTTGT
GAAAAGGGGAAAACCATTGCTGCTCTCAAGCTTTTAGATTTTGGGTTGGAAAGAGAGTGCAACATAGACTTCTCAAGTTATGAGAAAGTACTTGATGCGCTGTTGGGAGC
AGGGAAGACGCTGAACGCATACTCGATTCTCTGCAAGATAATGGAGAAAGGAGGGGCAAAGGATTGGAGCAGCTGTGATGATCTGATCAGAAGCCTCAATCAGGAAGGGA
ACACAAAACAAGCTGATATTCTCTCAAGAATGAGAAAGGGTGGAGACAGAAAGGGGAGTAAGAAAACTTCTCTTGCTGCTTGA
Protein sequenceShow/hide protein sequence
MAHISLSKPHYSHFRVLSTSSISNPTVFNSLHFFSSTQEPISTATQNEIPNDPSASSVAAVPQPVETAAVNGVEQVKRRTPRGKHRNPEKLEDIICRMMANREWTTRLQN
SIRSLVPQFDHSLVWNVLHAAKNSDHALKFFRWVERAGLFRHDRDTHLKIIEILGRASKLNHARCILLDMPNKGVEWDEELFVVLIDSYGKTGIVQEAVKIFQKMKELGV
ERSVKSYDALFKVIMRRGRYMMAKRYFNAMLNEGIEPTRHTYNVMLWGFFLSLRLETTKRFYEDMKSRGISPDVVTYNTMINGYYRFKMMEEAEQFFTEMKGKNIVPTVI
SYTTMIKGYVSVGRVDDGLRLFEEMKAVGVKPNDITYSTLLPGLCDAEKMSEARQILTEMVDKYIAPKDNSIFMRLLSCQCKHGDLDAAMHVLKAMIRLSIPTEAGHYGI
LIENCCKAGMYDQAVKLLDKLVEKEIILRPQSTLEMEASAYNLIIQYLCNHGQTGKAETFFRQLLKKGIQDEVAFNNLIRGHSKEGNPELAYEMLKIMGRRGVSRDAESY
KLLIKSYLSKGEPADAKTTLDNMIESGHYPDSALFRSVMESLFADGRVQTASRVMNSMLDKGITENLDLVAKILEALFMRGHVEEALGRIDLLMSCNCPPDFDSLLSVLC
EKGKTIAALKLLDFGLERECNIDFSSYEKVLDALLGAGKTLNAYSILCKIMEKGGAKDWSSCDDLIRSLNQEGNTKQADILSRMRKGGDRKGSKKTSLAA