; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0003060 (gene) of Snake gourd v1 genome

Gene IDTan0003060
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationLG09:67890858..67898938
RNA-Seq ExpressionTan0003060
SyntenyTan0003060
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6589742.1 Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0090Show/hide
Query:  MAHISLSKPHYTHFRVLSSSN---PNAFNSLHFFSSTQEPISTASQNESPNDPPASFHAAVPQPVEPVAVNGTEQVKRRTPRGKHRNPEKLEDIICRMMA
        MAHISLSKPHY+H +VLSSS+   P +FNSLHFFSSTQ+P +TA+QNESP DP  S  AAVPQPVEPVAVNG +QVKR  PRG  RNPEKLEDIICRMMA
Subjt:  MAHISLSKPHYTHFRVLSSSN---PNAFNSLHFFSSTQEPISTASQNESPNDPPASFHAAVPQPVEPVAVNGTEQVKRRTPRGKHRNPEKLEDIICRMMA

Query:  NREWTTRLQNSIRSLVPQFDHSLVWNVLHAAKNSDHALKFFRWVERAGLFRHDRDTHLKIIEILGRASKLNHARCILLDMPNKGVEWDEDLFVVLIDSYG
        NREWTTRLQNSIRSLVPQFDHS+VWNVLHAAKNSDHALKFFRWVERAGLF+HDR TH KIIEILGRASKLNHARCILLDMPNKGVEWDEDLFV++IDSYG
Subjt:  NREWTTRLQNSIRSLVPQFDHSLVWNVLHAAKNSDHALKFFRWVERAGLFRHDRDTHLKIIEILGRASKLNHARCILLDMPNKGVEWDEDLFVVLIDSYG

Query:  KAGIVQEAVKIFQKMKELDVERSVKSYDALFKVILRRGRYMMAKRYFNAMLNEGIEPTRHTYNLMLWGFFLSLRLETAKRFYEDMKSRGISPDVVTYNTM
        KAGIVQEAVKIFQKMKEL VERS+KSY+ LFKVILRRGRYMMAKRYFNAMLNEGIEPT HTYN+MLWGFFLSLRLETAKRFYEDMK+RGI+PDVVTYNTM
Subjt:  KAGIVQEAVKIFQKMKELDVERSVKSYDALFKVILRRGRYMMAKRYFNAMLNEGIEPTRHTYNLMLWGFFLSLRLETAKRFYEDMKSRGISPDVVTYNTM

Query:  INGFYRFKMMEEAEQFFTEMKGKNIVPTVISYTTMIKGYVSVGRVDDGLRLFEEMKAVGVKPNDITYSTLLPGLCDAEKMSEARQILTEMVDKYIAPKDN
        ING+YRFKMMEEAEQFFTEMKGKN+VPTVISYTTMIKGYVS GRVDDGLRLFEEMKAVGVKPND TYSTLLPGLCDAE+MSEARQILTEMVDKYIAPKDN
Subjt:  INGFYRFKMMEEAEQFFTEMKGKNIVPTVISYTTMIKGYVSVGRVDDGLRLFEEMKAVGVKPNDITYSTLLPGLCDAEKMSEARQILTEMVDKYIAPKDN

Query:  SIFMRLLSCQCKHGDLDAAMHVLKAMIRLSIPTEAGHYGILIENCCKAEMYDQAVKLLDKLVEKEIVLRPQSTLEIEPSAYNPIIQYLCNHGQTGKAETF
        SIFMRLLSCQCKHGDLDAAMHVLKAMIRLS+PTE+GHYGILIENCCKA +YD+AVKLLDKLVEKEI+L+PQSTLE+E SAYNP+IQYLC+HGQTGKAETF
Subjt:  SIFMRLLSCQCKHGDLDAAMHVLKAMIRLSIPTEAGHYGILIENCCKAEMYDQAVKLLDKLVEKEIVLRPQSTLEIEPSAYNPIIQYLCNHGQTGKAETF

Query:  FRQLLKKGIQDEVAFNNLIRGHSKEGNPELAYEMLKIMGRRGVSRDAESYKLLIKSYLSKGEPADAKTALDSMIECGHYPDSALFRSVMESLFADGRVQT
        FRQLLKKGIQDEVAFNNLIRGHSKEGNPELA+E+LKIMGR+ VSRDAESYKLLIKSYLSKGEPADAKTALDSMIE GHYPDS LFRSVMESLFADGRVQT
Subjt:  FRQLLKKGIQDEVAFNNLIRGHSKEGNPELAYEMLKIMGRRGVSRDAESYKLLIKSYLSKGEPADAKTALDSMIECGHYPDSALFRSVMESLFADGRVQT

Query:  ASRVMNSMLDKGITENLDLVAKILEALFMRGHVEEALGRIDLLMSCNCPPDFDSLLFVLCEKGKTIAALKLLDFGLERECSIEFSSYEKVLDTLLGAGKT
        ASRVMNSMLDKGITENLDLVAKILEALFMRGHVEEALGR+DLLM  +CPPDFDSLL VLCEKGKTIAALKLLDFGLEREC+IE SSYEKVLD LL AGKT
Subjt:  ASRVMNSMLDKGITENLDLVAKILEALFMRGHVEEALGRIDLLMSCNCPPDFDSLLFVLCEKGKTIAALKLLDFGLERECSIEFSSYEKVLDTLLGAGKT

Query:  LNAYSILCKIMEKGGAKDWSSCDDLIRSLNQEGNTKQADILSRMIKGGDRKGSKKASLAA
        LNAYSILCKIMEKGGAK+WSSCDDLI SLNQEG+TKQADILSRMIKGGDR   KKAS AA
Subjt:  LNAYSILCKIMEKGGAKDWSSCDDLIRSLNQEGNTKQADILSRMIKGGDRKGSKKASLAA

XP_022135178.1 pentatricopeptide repeat-containing protein At2g37230 [Momordica charantia]0.0e+0092.33Show/hide
Query:  MAHISLSKPHYTHFRVLSS---SNPNAFNSLHFFSSTQEPISTASQNESPNDPPASFHAAVPQPVEPVAVNGTEQVKRRTPRGKHRNPEKLEDIICRMMA
        MAHISLS+P   +FRVLSS   SNP+A N LHFFSSTQE I     NESP+DP AS  AA PQP    AVNG EQVK+RTPRGKHRNPEKLED+ICRMMA
Subjt:  MAHISLSKPHYTHFRVLSS---SNPNAFNSLHFFSSTQEPISTASQNESPNDPPASFHAAVPQPVEPVAVNGTEQVKRRTPRGKHRNPEKLEDIICRMMA

Query:  NREWTTRLQNSIRSLVPQFDHSLVWNVLHAAKNSDHALKFFRWVERAGLFRHDRDTHLKIIEILGRASKLNHARCILLDMPNKGVEWDEDLFVVLIDSYG
        NREWTTRLQNSIRSLVP+FDHSLVWNVLHAAKNSDHALKFFRWVER+GLFRHDRDTHLKIIEILGRASKLNHARCILLDMPNKGVEWDEDLFV++IDSYG
Subjt:  NREWTTRLQNSIRSLVPQFDHSLVWNVLHAAKNSDHALKFFRWVERAGLFRHDRDTHLKIIEILGRASKLNHARCILLDMPNKGVEWDEDLFVVLIDSYG

Query:  KAGIVQEAVKIFQKMKELDVERSVKSYDALFKVILRRGRYMMAKRYFNAMLNEGIEPTRHTYNLMLWGFFLSLRLETAKRFYEDMKSRGISPDVVTYNTM
        KAGIVQEAVKIFQKMKEL VERS+KSYDALFKVILRRGRYMMAKRYFNAMLNEGIEPTRHTYN+MLWGFFLSLRLETAK+FYEDMKSRGISPDVVTYNTM
Subjt:  KAGIVQEAVKIFQKMKELDVERSVKSYDALFKVILRRGRYMMAKRYFNAMLNEGIEPTRHTYNLMLWGFFLSLRLETAKRFYEDMKSRGISPDVVTYNTM

Query:  INGFYRFKMMEEAEQFFTEMKGKNIVPTVISYTTMIKGYVSVGRVDDGLRLFEEMKAVGVKPNDITYSTLLPGLCDAEKMSEARQILTEMVDKYIAPKDN
        INGFYRFKMMEEAEQFFTEMKGKNIVPTVISYTTMIKGYVSVGRVDDGLRLFEEMKAVGVKPNDITYSTLLPGLC+AEKMSEARQIL EMVDKYIAPKDN
Subjt:  INGFYRFKMMEEAEQFFTEMKGKNIVPTVISYTTMIKGYVSVGRVDDGLRLFEEMKAVGVKPNDITYSTLLPGLCDAEKMSEARQILTEMVDKYIAPKDN

Query:  SIFMRLLSCQCKHGDLDAAMHVLKAMIRLSIPTEAGHYGILIENCCKAEMYDQAVKLLDKLVEKEIVLRPQSTLEIEPSAYNPIIQYLCNHGQTGKAETF
        SIFMRLLSCQCKHGDLDAAMHVLKAMIRLSIPT+AGHYGILIENCCKAE YDQAVKLLDKLVEKEI+LRPQSTL++E SAYNPIIQYLCNHGQTGKAETF
Subjt:  SIFMRLLSCQCKHGDLDAAMHVLKAMIRLSIPTEAGHYGILIENCCKAEMYDQAVKLLDKLVEKEIVLRPQSTLEIEPSAYNPIIQYLCNHGQTGKAETF

Query:  FRQLLKKGIQDEVAFNNLIRGHSKEGNPELAYEMLKIMGRRGVSRDAESYKLLIKSYLSKGEPADAKTALDSMIECGHYPDSALFRSVMESLFADGRVQT
        FRQL+KKGIQDEVAFNNLIRGHSKEGNPEL YE+LKIMGRRGVSRDAESYKLLIKSYLSKGEPADAKTALDSMIE GHYPDS LFRSVMESLFADGRVQT
Subjt:  FRQLLKKGIQDEVAFNNLIRGHSKEGNPELAYEMLKIMGRRGVSRDAESYKLLIKSYLSKGEPADAKTALDSMIECGHYPDSALFRSVMESLFADGRVQT

Query:  ASRVMNSMLDKGITENLDLVAKILEALFMRGHVEEALGRIDLLMSCNCPPDFDSLLFVLCEKGKTIAALKLLDFGLERECSIEFSSYEKVLDTLLGAGKT
        ASRVM SML+KGITENLDLVAKILEALFMRGHVEEALGRIDLLMSCNCPPDF SLL VLCEKGKTIAALKLLDFGLEREC IE SSYEKVLD LLGAGKT
Subjt:  ASRVMNSMLDKGITENLDLVAKILEALFMRGHVEEALGRIDLLMSCNCPPDFDSLLFVLCEKGKTIAALKLLDFGLERECSIEFSSYEKVLDTLLGAGKT

Query:  LNAYSILCKIMEKGGAKDWSSCDDLIRSLNQEGNTKQADILSRMIKGGDRKGSKKA
        LNAYSILCKIMEKGGAKDWSSCDDLIRSLNQEGNTKQAD+LSRMIKGGDRKGSKKA
Subjt:  LNAYSILCKIMEKGGAKDWSSCDDLIRSLNQEGNTKQADILSRMIKGGDRKGSKKA

XP_022921867.1 pentatricopeptide repeat-containing protein At2g37230-like [Cucurbita moschata]0.0e+0089.87Show/hide
Query:  MAHISLSKPHYTHFRVLSSSN---PNAFNSLHFFSSTQEPISTASQNESPNDPPASFHAAVPQPVEPVAVNGTEQVKRRTPRGKHRNPEKLEDIICRMMA
        MAHISLSKPHY+H +VLSSS+   P +FNSLHFFSSTQ+P +TA+QNESP DP  S  AAVPQPVEPVAVNG +QVKR  PRG  RNPEKLEDIICRMMA
Subjt:  MAHISLSKPHYTHFRVLSSSN---PNAFNSLHFFSSTQEPISTASQNESPNDPPASFHAAVPQPVEPVAVNGTEQVKRRTPRGKHRNPEKLEDIICRMMA

Query:  NREWTTRLQNSIRSLVPQFDHSLVWNVLHAAKNSDHALKFFRWVERAGLFRHDRDTHLKIIEILGRASKLNHARCILLDMPNKGVEWDEDLFVVLIDSYG
        NREWTTRLQNSIRSLVPQFDHS+VWNVLHAAKNSDHALKFFRWVERAGLF+HDR TH KIIEILGRASKLNHARCILLDMPNKGVEWDEDLFV++IDSYG
Subjt:  NREWTTRLQNSIRSLVPQFDHSLVWNVLHAAKNSDHALKFFRWVERAGLFRHDRDTHLKIIEILGRASKLNHARCILLDMPNKGVEWDEDLFVVLIDSYG

Query:  KAGIVQEAVKIFQKMKELDVERSVKSYDALFKVILRRGRYMMAKRYFNAMLNEGIEPTRHTYNLMLWGFFLSLRLETAKRFYEDMKSRGISPDVVTYNTM
        KAGIVQEAVKIFQKMKEL VERS+KSY+ LFKVILRRGRYMMAKRYFNAMLNEGIEPT HTYN+MLWGFFLSLRLETAKRFYEDMK+RGI+PDVVTYNTM
Subjt:  KAGIVQEAVKIFQKMKELDVERSVKSYDALFKVILRRGRYMMAKRYFNAMLNEGIEPTRHTYNLMLWGFFLSLRLETAKRFYEDMKSRGISPDVVTYNTM

Query:  INGFYRFKMMEEAEQFFTEMKGKNIVPTVISYTTMIKGYVSVGRVDDGLRLFEEMKAVGVKPNDITYSTLLPGLCDAEKMSEARQILTEMVDKYIAPKDN
        ING+YRFKMMEEAEQFFTEMKGKN+VPTVISYTTMIKGYVS GRVDDGLRLFEEMKAVGVKPND TYSTLLPGLCDAE+MSEARQILTEMVDKYIAPKDN
Subjt:  INGFYRFKMMEEAEQFFTEMKGKNIVPTVISYTTMIKGYVSVGRVDDGLRLFEEMKAVGVKPNDITYSTLLPGLCDAEKMSEARQILTEMVDKYIAPKDN

Query:  SIFMRLLSCQCKHGDLDAAMHVLKAMIRLSIPTEAGHYGILIENCCKAEMYDQAVKLLDKLVEKEIVLRPQSTLEIEPSAYNPIIQYLCNHGQTGKAETF
        SIFMRLLSCQCKHGDLDAAMHVLKAM RLS+PTEAGHYGILIENCCKA +YD+AVKLLDKLVEKEI+L+PQSTLE+E SAYNP+IQYLC+HGQTGKAETF
Subjt:  SIFMRLLSCQCKHGDLDAAMHVLKAMIRLSIPTEAGHYGILIENCCKAEMYDQAVKLLDKLVEKEIVLRPQSTLEIEPSAYNPIIQYLCNHGQTGKAETF

Query:  FRQLLKKGIQDEVAFNNLIRGHSKEGNPELAYEMLKIMGRRGVSRDAESYKLLIKSYLSKGEPADAKTALDSMIECGHYPDSALFRSVMESLFADGRVQT
        FRQLLKKGIQDEVAFNNLIRGHSKEGNPELA+E+LKIMGR+ VSRDAESYKLLIKSYLSKGEPADAKTALDSMIE GHYPDSALFRSVMESLFADGRVQT
Subjt:  FRQLLKKGIQDEVAFNNLIRGHSKEGNPELAYEMLKIMGRRGVSRDAESYKLLIKSYLSKGEPADAKTALDSMIECGHYPDSALFRSVMESLFADGRVQT

Query:  ASRVMNSMLDKGITENLDLVAKILEALFMRGHVEEALGRIDLLMSCNCPPDFDSLLFVLCEKGKTIAALKLLDFGLERECSIEFSSYEKVLDTLLGAGKT
        ASRVMNSML KGITENLDLVAKILEALFMRGHVEEALGR+DLLM  +CPPDFDSLL VLCEKGKTIAALKLLDFGLEREC+IE SSYEKVLD LL AGKT
Subjt:  ASRVMNSMLDKGITENLDLVAKILEALFMRGHVEEALGRIDLLMSCNCPPDFDSLLFVLCEKGKTIAALKLLDFGLERECSIEFSSYEKVLDTLLGAGKT

Query:  LNAYSILCKIMEKGGAKDWSSCDDLIRSLNQEGNTKQADILSRMIKGGDRKGSKKASLAA
        LNAYSILCKIMEKGGAK+WSSCDDLI SLNQEG+TKQADILSRMIKGGDR   KKAS  A
Subjt:  LNAYSILCKIMEKGGAKDWSSCDDLIRSLNQEGNTKQADILSRMIKGGDRKGSKKASLAA

XP_023515807.1 pentatricopeptide repeat-containing protein At2g37230-like [Cucurbita pepo subsp. pepo]0.0e+0090Show/hide
Query:  MAHISLSKPHYTHFRVLSSSN---PNAFNSLHFFSSTQEPISTASQNESPNDPPASFHAAVPQPVEPVAVNGTEQVKRRTPRGKHRNPEKLEDIICRMMA
        MAHISLSKPHY+H +VLSSS+   P +FNSLHFFSS Q+P +TA+QNESP DP  S  AAVPQPVEPVAVNG +QVKR  PRG  RNPEKLEDIICRMMA
Subjt:  MAHISLSKPHYTHFRVLSSSN---PNAFNSLHFFSSTQEPISTASQNESPNDPPASFHAAVPQPVEPVAVNGTEQVKRRTPRGKHRNPEKLEDIICRMMA

Query:  NREWTTRLQNSIRSLVPQFDHSLVWNVLHAAKNSDHALKFFRWVERAGLFRHDRDTHLKIIEILGRASKLNHARCILLDMPNKGVEWDEDLFVVLIDSYG
        NREWTTRLQNSIRSLVPQFDHS+VWNVLHAAKNSDHAL+FFRWVER+GLF+HDR TH KIIEILGRASKLNHARCILLDMPNKGVEWDEDLFV++IDSYG
Subjt:  NREWTTRLQNSIRSLVPQFDHSLVWNVLHAAKNSDHALKFFRWVERAGLFRHDRDTHLKIIEILGRASKLNHARCILLDMPNKGVEWDEDLFVVLIDSYG

Query:  KAGIVQEAVKIFQKMKELDVERSVKSYDALFKVILRRGRYMMAKRYFNAMLNEGIEPTRHTYNLMLWGFFLSLRLETAKRFYEDMKSRGISPDVVTYNTM
        KAGIVQEAVKIFQKMKEL VERS+KSY+ LFKVILRRGRYMMAKRYFNAMLNEGIEPT HTYN+MLWGFFLSLRLETAKRFYEDMK+RGI+PDVVTYNTM
Subjt:  KAGIVQEAVKIFQKMKELDVERSVKSYDALFKVILRRGRYMMAKRYFNAMLNEGIEPTRHTYNLMLWGFFLSLRLETAKRFYEDMKSRGISPDVVTYNTM

Query:  INGFYRFKMMEEAEQFFTEMKGKNIVPTVISYTTMIKGYVSVGRVDDGLRLFEEMKAVGVKPNDITYSTLLPGLCDAEKMSEARQILTEMVDKYIAPKDN
        ING+YRFKMMEEAEQFFTEMKG N++PTVISYTTMIKGYVS GRVDDGLRLFEEMKAVGVKPND TYSTLLPGLCDAEKMSEARQILTEMVDKYIAPKDN
Subjt:  INGFYRFKMMEEAEQFFTEMKGKNIVPTVISYTTMIKGYVSVGRVDDGLRLFEEMKAVGVKPNDITYSTLLPGLCDAEKMSEARQILTEMVDKYIAPKDN

Query:  SIFMRLLSCQCKHGDLDAAMHVLKAMIRLSIPTEAGHYGILIENCCKAEMYDQAVKLLDKLVEKEIVLRPQSTLEIEPSAYNPIIQYLCNHGQTGKAETF
        SIFMRLLSCQCKHGDLDAAMHVLKAMIRLSIPTEAGHYGILIENCCKA +YD+AVKLLDKLVEKEI+L+PQSTLE+E SAYNPIIQYLC+HGQTGKAETF
Subjt:  SIFMRLLSCQCKHGDLDAAMHVLKAMIRLSIPTEAGHYGILIENCCKAEMYDQAVKLLDKLVEKEIVLRPQSTLEIEPSAYNPIIQYLCNHGQTGKAETF

Query:  FRQLLKKGIQDEVAFNNLIRGHSKEGNPELAYEMLKIMGRRGVSRDAESYKLLIKSYLSKGEPADAKTALDSMIECGHYPDSALFRSVMESLFADGRVQT
        FRQLLKKGIQDEVAFNNLIRGHSKEGNPELA+E+LKIMGR+ VSRDAESYKLLIKSYLSKGEPADAKTALDSMIE GHYPDSALFRSVMESLFADGRVQT
Subjt:  FRQLLKKGIQDEVAFNNLIRGHSKEGNPELAYEMLKIMGRRGVSRDAESYKLLIKSYLSKGEPADAKTALDSMIECGHYPDSALFRSVMESLFADGRVQT

Query:  ASRVMNSMLDKGITENLDLVAKILEALFMRGHVEEALGRIDLLMSCNCPPDFDSLLFVLCEKGKTIAALKLLDFGLERECSIEFSSYEKVLDTLLGAGKT
        ASRVMNSMLDKGITEN+DLVAKILEALFMRGHVEEALGR+DLLM C+CPPDFDSLL VLCEKGKTIAALKLLDFGLEREC+IE SSYEKVLD LL AGKT
Subjt:  ASRVMNSMLDKGITENLDLVAKILEALFMRGHVEEALGRIDLLMSCNCPPDFDSLLFVLCEKGKTIAALKLLDFGLERECSIEFSSYEKVLDTLLGAGKT

Query:  LNAYSILCKIMEKGGAKDWSSCDDLIRSLNQEGNTKQADILSRMIKGGDRKGSKKASLAA
        LNAYSILCKIMEKGGAK+WSSCDDLI SLNQEG+TKQADILSRMIKGGDR   KKAS AA
Subjt:  LNAYSILCKIMEKGGAKDWSSCDDLIRSLNQEGNTKQADILSRMIKGGDRKGSKKASLAA

XP_038880029.1 pentatricopeptide repeat-containing protein At2g37230 [Benincasa hispida]0.0e+0091.84Show/hide
Query:  MAHISLSKPHYTHFRVLSSSN---PNAFNSLHFFSSTQEPISTASQNESPNDPPASFHAAVPQPVEPVAVNGTEQVKRRTPRGKHRNPEKLEDIICRMMA
        MAHIS+SK H +H+RVLSSS+   P A  SLHFFSSTQEPISTA+QNESPN P AS  AAVPQP E VAVNG EQVKRRTPRGK RNPEKLED+IC+MMA
Subjt:  MAHISLSKPHYTHFRVLSSSN---PNAFNSLHFFSSTQEPISTASQNESPNDPPASFHAAVPQPVEPVAVNGTEQVKRRTPRGKHRNPEKLEDIICRMMA

Query:  NREWTTRLQNSIRSLVPQFDHSLVWNVLHAAKNSDHALKFFRWVERAGLFRHDRDTHLKIIEILGRASKLNHARCILLDMPNKGVEWDEDLFVVLIDSYG
        NREWTTRLQNSIRSLVPQFDHSLVWNVLHAAKNSDHAL FFRWVERAGLF+HDR+THLKIIEILGRASKLNHARCILLDM NKG+EWDEDLFV+LI+SYG
Subjt:  NREWTTRLQNSIRSLVPQFDHSLVWNVLHAAKNSDHALKFFRWVERAGLFRHDRDTHLKIIEILGRASKLNHARCILLDMPNKGVEWDEDLFVVLIDSYG

Query:  KAGIVQEAVKIFQKMKELDVERSVKSYDALFKVILRRGRYMMAKRYFNAMLNEGIEPTRHTYNLMLWGFFLSLRLETAKRFYEDMKSRGISPDVVTYNTM
        KAGIVQEAVKIFQKMKEL VERSVKSYDALFKVILRRGRYMMAKRYFNAMLNEGIEPTRHTYN+MLWGFFLSLRLETAKRFYEDMKSRGISPDVVTYNTM
Subjt:  KAGIVQEAVKIFQKMKELDVERSVKSYDALFKVILRRGRYMMAKRYFNAMLNEGIEPTRHTYNLMLWGFFLSLRLETAKRFYEDMKSRGISPDVVTYNTM

Query:  INGFYRFKMMEEAEQFFTEMKGKNIVPTVISYTTMIKGYVSVGRVDDGLRLFEEMKAVGVKPNDITYSTLLPGLCDAEKMSEARQILTEMVDKYIAPKDN
        ING+YRFKMMEEAEQFFTEMKGKNIVPTVISYTTMIKGYVS GRVDDGLRLFEEMKAVGVKPNDITYSTLLPGLCDAEKMSEARQILTEMVDKYIAPKDN
Subjt:  INGFYRFKMMEEAEQFFTEMKGKNIVPTVISYTTMIKGYVSVGRVDDGLRLFEEMKAVGVKPNDITYSTLLPGLCDAEKMSEARQILTEMVDKYIAPKDN

Query:  SIFMRLLSCQCKHGDLDAAMHVLKAMIRLSIPTEAGHYGILIENCCKAEMYDQAVKLLDKLVEKEIVLRPQSTLEIEPSAYNPIIQYLCNHGQTGKAETF
        SIFMRLLSCQC HGDLDAAMHVLKAMIRLSIPTEAGHYGILIENCCKA MYD+AVKLLDKLVEKEI+LRPQSTLE+E SAYN IIQYLCNHGQTGKA+TF
Subjt:  SIFMRLLSCQCKHGDLDAAMHVLKAMIRLSIPTEAGHYGILIENCCKAEMYDQAVKLLDKLVEKEIVLRPQSTLEIEPSAYNPIIQYLCNHGQTGKAETF

Query:  FRQLLKKGIQDEVAFNNLIRGHSKEGNPELAYEMLKIMGRRGVSRDAESYKLLIKSYLSKGEPADAKTALDSMIECGHYPDSALFRSVMESLFADGRVQT
        FRQLLKKGIQDEVAFNNLIRGHSKEGNPELA+E+LKIMGRR VSRDAESYKLLIKSYLSKGEPADAKTALDSMIE GHYPDSALFRSVMESLF DGRVQT
Subjt:  FRQLLKKGIQDEVAFNNLIRGHSKEGNPELAYEMLKIMGRRGVSRDAESYKLLIKSYLSKGEPADAKTALDSMIECGHYPDSALFRSVMESLFADGRVQT

Query:  ASRVMNSMLDKGITENLDLVAKILEALFMRGHVEEALGRIDLLMSCNCPPDFDSLLFVLCEKGKTIAALKLLDFGLERECSIEFSSYEKVLDTLLGAGKT
        ASRVMNSMLDKGITENLDLVAKILEAL MRGHVEEALGRIDLLMSCNCPPDFDSLL VLCE+GKTIAALKLLDFGLEREC+IEFSSYEKVLD LLGAGKT
Subjt:  ASRVMNSMLDKGITENLDLVAKILEALFMRGHVEEALGRIDLLMSCNCPPDFDSLLFVLCEKGKTIAALKLLDFGLERECSIEFSSYEKVLDTLLGAGKT

Query:  LNAYSILCKIMEKGGAKDWSSCDDLIRSLNQEGNTKQADILSRMIKGGDRKGSKKASLAA
        LNAY+ILCKIMEKGGA DW S DDLI+SLNQEGNTKQADILSR +KGGDRK  KK SLAA
Subjt:  LNAYSILCKIMEKGGAKDWSSCDDLIRSLNQEGNTKQADILSRMIKGGDRKGSKKASLAA

TrEMBL top hitse value%identityAlignment
A0A6J1C431 pentatricopeptide repeat-containing protein At2g372300.0e+0092.33Show/hide
Query:  MAHISLSKPHYTHFRVLSS---SNPNAFNSLHFFSSTQEPISTASQNESPNDPPASFHAAVPQPVEPVAVNGTEQVKRRTPRGKHRNPEKLEDIICRMMA
        MAHISLS+P   +FRVLSS   SNP+A N LHFFSSTQE I     NESP+DP AS  AA PQP    AVNG EQVK+RTPRGKHRNPEKLED+ICRMMA
Subjt:  MAHISLSKPHYTHFRVLSS---SNPNAFNSLHFFSSTQEPISTASQNESPNDPPASFHAAVPQPVEPVAVNGTEQVKRRTPRGKHRNPEKLEDIICRMMA

Query:  NREWTTRLQNSIRSLVPQFDHSLVWNVLHAAKNSDHALKFFRWVERAGLFRHDRDTHLKIIEILGRASKLNHARCILLDMPNKGVEWDEDLFVVLIDSYG
        NREWTTRLQNSIRSLVP+FDHSLVWNVLHAAKNSDHALKFFRWVER+GLFRHDRDTHLKIIEILGRASKLNHARCILLDMPNKGVEWDEDLFV++IDSYG
Subjt:  NREWTTRLQNSIRSLVPQFDHSLVWNVLHAAKNSDHALKFFRWVERAGLFRHDRDTHLKIIEILGRASKLNHARCILLDMPNKGVEWDEDLFVVLIDSYG

Query:  KAGIVQEAVKIFQKMKELDVERSVKSYDALFKVILRRGRYMMAKRYFNAMLNEGIEPTRHTYNLMLWGFFLSLRLETAKRFYEDMKSRGISPDVVTYNTM
        KAGIVQEAVKIFQKMKEL VERS+KSYDALFKVILRRGRYMMAKRYFNAMLNEGIEPTRHTYN+MLWGFFLSLRLETAK+FYEDMKSRGISPDVVTYNTM
Subjt:  KAGIVQEAVKIFQKMKELDVERSVKSYDALFKVILRRGRYMMAKRYFNAMLNEGIEPTRHTYNLMLWGFFLSLRLETAKRFYEDMKSRGISPDVVTYNTM

Query:  INGFYRFKMMEEAEQFFTEMKGKNIVPTVISYTTMIKGYVSVGRVDDGLRLFEEMKAVGVKPNDITYSTLLPGLCDAEKMSEARQILTEMVDKYIAPKDN
        INGFYRFKMMEEAEQFFTEMKGKNIVPTVISYTTMIKGYVSVGRVDDGLRLFEEMKAVGVKPNDITYSTLLPGLC+AEKMSEARQIL EMVDKYIAPKDN
Subjt:  INGFYRFKMMEEAEQFFTEMKGKNIVPTVISYTTMIKGYVSVGRVDDGLRLFEEMKAVGVKPNDITYSTLLPGLCDAEKMSEARQILTEMVDKYIAPKDN

Query:  SIFMRLLSCQCKHGDLDAAMHVLKAMIRLSIPTEAGHYGILIENCCKAEMYDQAVKLLDKLVEKEIVLRPQSTLEIEPSAYNPIIQYLCNHGQTGKAETF
        SIFMRLLSCQCKHGDLDAAMHVLKAMIRLSIPT+AGHYGILIENCCKAE YDQAVKLLDKLVEKEI+LRPQSTL++E SAYNPIIQYLCNHGQTGKAETF
Subjt:  SIFMRLLSCQCKHGDLDAAMHVLKAMIRLSIPTEAGHYGILIENCCKAEMYDQAVKLLDKLVEKEIVLRPQSTLEIEPSAYNPIIQYLCNHGQTGKAETF

Query:  FRQLLKKGIQDEVAFNNLIRGHSKEGNPELAYEMLKIMGRRGVSRDAESYKLLIKSYLSKGEPADAKTALDSMIECGHYPDSALFRSVMESLFADGRVQT
        FRQL+KKGIQDEVAFNNLIRGHSKEGNPEL YE+LKIMGRRGVSRDAESYKLLIKSYLSKGEPADAKTALDSMIE GHYPDS LFRSVMESLFADGRVQT
Subjt:  FRQLLKKGIQDEVAFNNLIRGHSKEGNPELAYEMLKIMGRRGVSRDAESYKLLIKSYLSKGEPADAKTALDSMIECGHYPDSALFRSVMESLFADGRVQT

Query:  ASRVMNSMLDKGITENLDLVAKILEALFMRGHVEEALGRIDLLMSCNCPPDFDSLLFVLCEKGKTIAALKLLDFGLERECSIEFSSYEKVLDTLLGAGKT
        ASRVM SML+KGITENLDLVAKILEALFMRGHVEEALGRIDLLMSCNCPPDF SLL VLCEKGKTIAALKLLDFGLEREC IE SSYEKVLD LLGAGKT
Subjt:  ASRVMNSMLDKGITENLDLVAKILEALFMRGHVEEALGRIDLLMSCNCPPDFDSLLFVLCEKGKTIAALKLLDFGLERECSIEFSSYEKVLDTLLGAGKT

Query:  LNAYSILCKIMEKGGAKDWSSCDDLIRSLNQEGNTKQADILSRMIKGGDRKGSKKA
        LNAYSILCKIMEKGGAKDWSSCDDLIRSLNQEGNTKQAD+LSRMIKGGDRKGSKKA
Subjt:  LNAYSILCKIMEKGGAKDWSSCDDLIRSLNQEGNTKQADILSRMIKGGDRKGSKKA

A0A6J1E1L0 pentatricopeptide repeat-containing protein At2g37230-like0.0e+0089.87Show/hide
Query:  MAHISLSKPHYTHFRVLSSSN---PNAFNSLHFFSSTQEPISTASQNESPNDPPASFHAAVPQPVEPVAVNGTEQVKRRTPRGKHRNPEKLEDIICRMMA
        MAHISLSKPHY+H +VLSSS+   P +FNSLHFFSSTQ+P +TA+QNESP DP  S  AAVPQPVEPVAVNG +QVKR  PRG  RNPEKLEDIICRMMA
Subjt:  MAHISLSKPHYTHFRVLSSSN---PNAFNSLHFFSSTQEPISTASQNESPNDPPASFHAAVPQPVEPVAVNGTEQVKRRTPRGKHRNPEKLEDIICRMMA

Query:  NREWTTRLQNSIRSLVPQFDHSLVWNVLHAAKNSDHALKFFRWVERAGLFRHDRDTHLKIIEILGRASKLNHARCILLDMPNKGVEWDEDLFVVLIDSYG
        NREWTTRLQNSIRSLVPQFDHS+VWNVLHAAKNSDHALKFFRWVERAGLF+HDR TH KIIEILGRASKLNHARCILLDMPNKGVEWDEDLFV++IDSYG
Subjt:  NREWTTRLQNSIRSLVPQFDHSLVWNVLHAAKNSDHALKFFRWVERAGLFRHDRDTHLKIIEILGRASKLNHARCILLDMPNKGVEWDEDLFVVLIDSYG

Query:  KAGIVQEAVKIFQKMKELDVERSVKSYDALFKVILRRGRYMMAKRYFNAMLNEGIEPTRHTYNLMLWGFFLSLRLETAKRFYEDMKSRGISPDVVTYNTM
        KAGIVQEAVKIFQKMKEL VERS+KSY+ LFKVILRRGRYMMAKRYFNAMLNEGIEPT HTYN+MLWGFFLSLRLETAKRFYEDMK+RGI+PDVVTYNTM
Subjt:  KAGIVQEAVKIFQKMKELDVERSVKSYDALFKVILRRGRYMMAKRYFNAMLNEGIEPTRHTYNLMLWGFFLSLRLETAKRFYEDMKSRGISPDVVTYNTM

Query:  INGFYRFKMMEEAEQFFTEMKGKNIVPTVISYTTMIKGYVSVGRVDDGLRLFEEMKAVGVKPNDITYSTLLPGLCDAEKMSEARQILTEMVDKYIAPKDN
        ING+YRFKMMEEAEQFFTEMKGKN+VPTVISYTTMIKGYVS GRVDDGLRLFEEMKAVGVKPND TYSTLLPGLCDAE+MSEARQILTEMVDKYIAPKDN
Subjt:  INGFYRFKMMEEAEQFFTEMKGKNIVPTVISYTTMIKGYVSVGRVDDGLRLFEEMKAVGVKPNDITYSTLLPGLCDAEKMSEARQILTEMVDKYIAPKDN

Query:  SIFMRLLSCQCKHGDLDAAMHVLKAMIRLSIPTEAGHYGILIENCCKAEMYDQAVKLLDKLVEKEIVLRPQSTLEIEPSAYNPIIQYLCNHGQTGKAETF
        SIFMRLLSCQCKHGDLDAAMHVLKAM RLS+PTEAGHYGILIENCCKA +YD+AVKLLDKLVEKEI+L+PQSTLE+E SAYNP+IQYLC+HGQTGKAETF
Subjt:  SIFMRLLSCQCKHGDLDAAMHVLKAMIRLSIPTEAGHYGILIENCCKAEMYDQAVKLLDKLVEKEIVLRPQSTLEIEPSAYNPIIQYLCNHGQTGKAETF

Query:  FRQLLKKGIQDEVAFNNLIRGHSKEGNPELAYEMLKIMGRRGVSRDAESYKLLIKSYLSKGEPADAKTALDSMIECGHYPDSALFRSVMESLFADGRVQT
        FRQLLKKGIQDEVAFNNLIRGHSKEGNPELA+E+LKIMGR+ VSRDAESYKLLIKSYLSKGEPADAKTALDSMIE GHYPDSALFRSVMESLFADGRVQT
Subjt:  FRQLLKKGIQDEVAFNNLIRGHSKEGNPELAYEMLKIMGRRGVSRDAESYKLLIKSYLSKGEPADAKTALDSMIECGHYPDSALFRSVMESLFADGRVQT

Query:  ASRVMNSMLDKGITENLDLVAKILEALFMRGHVEEALGRIDLLMSCNCPPDFDSLLFVLCEKGKTIAALKLLDFGLERECSIEFSSYEKVLDTLLGAGKT
        ASRVMNSML KGITENLDLVAKILEALFMRGHVEEALGR+DLLM  +CPPDFDSLL VLCEKGKTIAALKLLDFGLEREC+IE SSYEKVLD LL AGKT
Subjt:  ASRVMNSMLDKGITENLDLVAKILEALFMRGHVEEALGRIDLLMSCNCPPDFDSLLFVLCEKGKTIAALKLLDFGLERECSIEFSSYEKVLDTLLGAGKT

Query:  LNAYSILCKIMEKGGAKDWSSCDDLIRSLNQEGNTKQADILSRMIKGGDRKGSKKASLAA
        LNAYSILCKIMEKGGAK+WSSCDDLI SLNQEG+TKQADILSRMIKGGDR   KKAS  A
Subjt:  LNAYSILCKIMEKGGAKDWSSCDDLIRSLNQEGNTKQADILSRMIKGGDRKGSKKASLAA

A0A6J1IBX7 pentatricopeptide repeat-containing protein At2g37230-like0.0e+0090.96Show/hide
Query:  MAHISLSKPHYTHFRVLSSSN---PNAFNSLHFFSSTQEPISTASQNESPNDPPASFHAAVPQPVEPVAVNGTEQVKRRTPRGKHRNPEKLEDIICRMMA
        MAHISLSKPHY   RVLSSS+   P A NSL+FFSSTQE ISTA+Q +SPND   S  AA       VAV+   QVKRRTPRGKHRNPEK+EDIICRMMA
Subjt:  MAHISLSKPHYTHFRVLSSSN---PNAFNSLHFFSSTQEPISTASQNESPNDPPASFHAAVPQPVEPVAVNGTEQVKRRTPRGKHRNPEKLEDIICRMMA

Query:  NREWTTRLQNSIRSLVPQFDHSLVWNVLHAAKNSDHALKFFRWVERAGLFRHDRDTHLKIIEILGRASKLNHARCILLDMPNKGVEWDEDLFVVLIDSYG
        NREWTTRLQNSIRSLVPQFDHSLVWNVLHA KNSDHALKFFRWVERAGLFRHDRDTHLKIIEILGRASKLNHARCILLDMPNKGVEWDEDLFVVLIDSYG
Subjt:  NREWTTRLQNSIRSLVPQFDHSLVWNVLHAAKNSDHALKFFRWVERAGLFRHDRDTHLKIIEILGRASKLNHARCILLDMPNKGVEWDEDLFVVLIDSYG

Query:  KAGIVQEAVKIFQKMKELDVERSVKSYDALFKVILRRGRYMMAKRYFNAMLNEGIEPTRHTYNLMLWGFFLSLRLETAKRFYEDMKSRGISPDVVTYNTM
        KA IVQEAVKIFQKMKEL VERSVKSYDALFKVILRRGRYMMAKRYFNAMLNEGIEPTRHTYNLMLWGFFLS RLETAKRFYEDMKSRGISPDVVTYNTM
Subjt:  KAGIVQEAVKIFQKMKELDVERSVKSYDALFKVILRRGRYMMAKRYFNAMLNEGIEPTRHTYNLMLWGFFLSLRLETAKRFYEDMKSRGISPDVVTYNTM

Query:  INGFYRFKMMEEAEQFFTEMKGKNIVPTVISYTTMIKGYVSVGRVDDGLRLFEEMKAVGVKPNDITYSTLLPGLCDAEKMSEARQILTEMVDKYIAPKDN
        ING+YRFKMMEEAEQFFTEMKGKNIVPTVISYTTMIKGYVSVGRVDDGLRLFEEMKAVGV+PN ITYSTLLPGLCDAEKMSEARQILTEM  KYI+PKD+
Subjt:  INGFYRFKMMEEAEQFFTEMKGKNIVPTVISYTTMIKGYVSVGRVDDGLRLFEEMKAVGVKPNDITYSTLLPGLCDAEKMSEARQILTEMVDKYIAPKDN

Query:  SIFMRLLSCQCKHGDLDAAMHVLKAMIRLSIPTEAGHYGILIENCCKAEMYDQAVKLLDKLVEKEIVLRPQSTLEIEPSAYNPIIQYLCNHGQTGKAETF
        SIFMRLLSCQCKHGDLDAAMHVLK MIRLSIPTEAGHYGILIENCCKAEMY++AVKLLD LVEKEI+LRPQS+LEIEPSAYNPIIQYLCN+GQTGKAETF
Subjt:  SIFMRLLSCQCKHGDLDAAMHVLKAMIRLSIPTEAGHYGILIENCCKAEMYDQAVKLLDKLVEKEIVLRPQSTLEIEPSAYNPIIQYLCNHGQTGKAETF

Query:  FRQLLKKGIQDEVAFNNLIRGHSKEGNPELAYEMLKIMGRRGVSRDAESYKLLIKSYLSKGEPADAKTALDSMIECGHYPDSALFRSVMESLFADGRVQT
        FRQLLKKGIQDEVAFNNLIRGHSKEGNPELAYEMLKIMGRRGVSRDAES+KLLI+SYLSKGEPADAKTALDSMIECGHYP SALFRSVMESLFADGRVQT
Subjt:  FRQLLKKGIQDEVAFNNLIRGHSKEGNPELAYEMLKIMGRRGVSRDAESYKLLIKSYLSKGEPADAKTALDSMIECGHYPDSALFRSVMESLFADGRVQT

Query:  ASRVMNSMLDKGITENLDLVAKILEALFMRGHVEEALGRIDLLMSCNCPPDFDSLLFVLCEKGKTIAALKLLDFGLERECSIEFSSYEKVLDTLLGAGKT
        ASRVMNSMLDK ITENLDLVAKILEALFMRGHVEEALGRIDLLMSC+CPPDF+SLL VLCEKGKTIAALKLLDFGLEREC IEFSSYEKVLD LLGAGKT
Subjt:  ASRVMNSMLDKGITENLDLVAKILEALFMRGHVEEALGRIDLLMSCNCPPDFDSLLFVLCEKGKTIAALKLLDFGLERECSIEFSSYEKVLDTLLGAGKT

Query:  LNAYSILCKIMEK---GGAKDWSSCDDLIRSLNQEGNTKQADILSRMIK-GGDRKGSKKASLA
        LNAYSILCKIMEK   GGAKDWSSCDDLI+ LNQEGNTKQADILSRMI  GGDRKGSK + +A
Subjt:  LNAYSILCKIMEK---GGAKDWSSCDDLIRSLNQEGNTKQADILSRMIK-GGDRKGSKKASLA

A0A6J1IEG8 pentatricopeptide repeat-containing protein At2g37230-like0.0e+0090.83Show/hide
Query:  MAHISLSKPHYTHFRVLSSSN---PNAFNSLHFFSSTQEPISTASQNESPNDPPASFHAAVPQPVEPVAVNGTEQVKRRTPRGKHRNPEKLEDIICRMMA
        MAHISLSKPHY   RVLSSS+   P A NSL+FFSSTQE ISTA+QN+SPND  +S  AA       VAV+   QVKR TPRGKHRNPEK+EDIICRMMA
Subjt:  MAHISLSKPHYTHFRVLSSSN---PNAFNSLHFFSSTQEPISTASQNESPNDPPASFHAAVPQPVEPVAVNGTEQVKRRTPRGKHRNPEKLEDIICRMMA

Query:  NREWTTRLQNSIRSLVPQFDHSLVWNVLHAAKNSDHALKFFRWVERAGLFRHDRDTHLKIIEILGRASKLNHARCILLDMPNKGVEWDEDLFVVLIDSYG
        NREWTTRLQNSIRSLVPQFDHSLVWNVLHA KNSDHALKFFRWVERAGLFRHDRDTHLKIIEILGRASKLNHARCILLDMPNKGVEWDEDLFVVLIDSYG
Subjt:  NREWTTRLQNSIRSLVPQFDHSLVWNVLHAAKNSDHALKFFRWVERAGLFRHDRDTHLKIIEILGRASKLNHARCILLDMPNKGVEWDEDLFVVLIDSYG

Query:  KAGIVQEAVKIFQKMKELDVERSVKSYDALFKVILRRGRYMMAKRYFNAMLNEGIEPTRHTYNLMLWGFFLSLRLETAKRFYEDMKSRGISPDVVTYNTM
        KA IVQEAVKIFQKMKEL VERSVKSYDALFKVILRRGRYMMAKRYFNAMLNEGIEPTRHTYNLMLWGFFLS RLETAKRFYEDMKSRGISPDVVTYNTM
Subjt:  KAGIVQEAVKIFQKMKELDVERSVKSYDALFKVILRRGRYMMAKRYFNAMLNEGIEPTRHTYNLMLWGFFLSLRLETAKRFYEDMKSRGISPDVVTYNTM

Query:  INGFYRFKMMEEAEQFFTEMKGKNIVPTVISYTTMIKGYVSVGRVDDGLRLFEEMKAVGVKPNDITYSTLLPGLCDAEKMSEARQILTEMVDKYIAPKDN
        ING+YRFKMMEEAEQFFTEMKGKNIVPTVISYTTMIKGYVSVG+VDDGLRLFEEMKAVGVKPN ITYSTLLPGLCDAEKMSEARQILTEM  KYI+PKD+
Subjt:  INGFYRFKMMEEAEQFFTEMKGKNIVPTVISYTTMIKGYVSVGRVDDGLRLFEEMKAVGVKPNDITYSTLLPGLCDAEKMSEARQILTEMVDKYIAPKDN

Query:  SIFMRLLSCQCKHGDLDAAMHVLKAMIRLSIPTEAGHYGILIENCCKAEMYDQAVKLLDKLVEKEIVLRPQSTLEIEPSAYNPIIQYLCNHGQTGKAETF
        SIFMRLLSCQCKHGDLDAAMHVLK MIRLSIPTEAGHYGILIENCCKAEMY++AVKLLD LVEKEI+LRPQS+LEIEPSAYNPIIQYLCN+GQTGKAETF
Subjt:  SIFMRLLSCQCKHGDLDAAMHVLKAMIRLSIPTEAGHYGILIENCCKAEMYDQAVKLLDKLVEKEIVLRPQSTLEIEPSAYNPIIQYLCNHGQTGKAETF

Query:  FRQLLKKGIQDEVAFNNLIRGHSKEGNPELAYEMLKIMGRRGVSRDAESYKLLIKSYLSKGEPADAKTALDSMIECGHYPDSALFRSVMESLFADGRVQT
        FRQLLKKGIQDEVAFNNLI GHSKEGNPELAYEMLKIMGRRGVSRDAES+KLLI+SYLSKGEPADAKTALDSMIECGHYP SALFRSVMESLFADGRVQT
Subjt:  FRQLLKKGIQDEVAFNNLIRGHSKEGNPELAYEMLKIMGRRGVSRDAESYKLLIKSYLSKGEPADAKTALDSMIECGHYPDSALFRSVMESLFADGRVQT

Query:  ASRVMNSMLDKGITENLDLVAKILEALFMRGHVEEALGRIDLLMSCNCPPDFDSLLFVLCEKGKTIAALKLLDFGLERECSIEFSSYEKVLDTLLGAGKT
        ASRVMNSMLDK ITENLDLVAKILEALFMRGHVEEALGRIDLLMSC+CPPDF+SLL VLCEKGKTIAALKLLDFGLEREC IEFSSYEKVLD LLGAGKT
Subjt:  ASRVMNSMLDKGITENLDLVAKILEALFMRGHVEEALGRIDLLMSCNCPPDFDSLLFVLCEKGKTIAALKLLDFGLERECSIEFSSYEKVLDTLLGAGKT

Query:  LNAYSILCKIMEK---GGAKDWSSCDDLIRSLNQEGNTKQADILSRMIK-GGDRKGSKKASLA
        LNAYSILCKIMEK   GGAKDWSSCDDLI+ LNQEGNTKQADILSRMI  GGDRKGSK + +A
Subjt:  LNAYSILCKIMEK---GGAKDWSSCDDLIRSLNQEGNTKQADILSRMIK-GGDRKGSKKASLA

A0A6J1JJW0 pentatricopeptide repeat-containing protein At2g37230-like0.0e+0089.61Show/hide
Query:  MAHISLSKPHYTHFRVLSSSNPN---AFNSLHFFSSTQEPISTASQNESPNDPPASFHAAVPQPVEPVAVNGTEQVKRRTPRGKHRNPEKLEDIICRMMA
        MAHISLSKPHY+H +V SSS+ +   +FNSLHFFSSTQ+PIST +QNESPNDP  S  AAVPQ VEPVAVNG +QVKR  PRG  RNPEKLEDIICRMMA
Subjt:  MAHISLSKPHYTHFRVLSSSNPN---AFNSLHFFSSTQEPISTASQNESPNDPPASFHAAVPQPVEPVAVNGTEQVKRRTPRGKHRNPEKLEDIICRMMA

Query:  NREWTTRLQNSIRSLVPQFDHSLVWNVLHAAKNSDHALKFFRWVERAGLFRHDRDTHLKIIEILGRASKLNHARCILLDMPNKGVEWDEDLFVVLIDSYG
        NREWTTRLQNSIRSLVPQFDHSLVWNVLHAAKNSDHALKFFRWVERAGLF+HDR TH KIIEILGRASKLNHARCILLDMP KGVEWDEDLFV++IDSYG
Subjt:  NREWTTRLQNSIRSLVPQFDHSLVWNVLHAAKNSDHALKFFRWVERAGLFRHDRDTHLKIIEILGRASKLNHARCILLDMPNKGVEWDEDLFVVLIDSYG

Query:  KAGIVQEAVKIFQKMKELDVERSVKSYDALFKVILRRGRYMMAKRYFNAMLNEGIEPTRHTYNLMLWGFFLSLRLETAKRFYEDMKSRGISPDVVTYNTM
        KAGIVQEAVKIFQKMKEL VERS+KSYDALFKVILRRGRYMMAKRYFN MLNEGIEPTRHTYN+MLWGFFLSLRLETAKRFYEDMK+RGI+PDVVTYNTM
Subjt:  KAGIVQEAVKIFQKMKELDVERSVKSYDALFKVILRRGRYMMAKRYFNAMLNEGIEPTRHTYNLMLWGFFLSLRLETAKRFYEDMKSRGISPDVVTYNTM

Query:  INGFYRFKMMEEAEQFFTEMKGKNIVPTVISYTTMIKGYVSVGRVDDGLRLFEEMKAVGVKPNDITYSTLLPGLCDAEKMSEARQILTEMVDKYIAPKDN
        ING++RFKMMEEAEQFFTEMKGKNIVPTVISYTTMIKGYVS GRVDDGLRLFEEMKAVGVKPND+TYSTLLPGLCDAEKM EA QILTEMVD+YIAPKDN
Subjt:  INGFYRFKMMEEAEQFFTEMKGKNIVPTVISYTTMIKGYVSVGRVDDGLRLFEEMKAVGVKPNDITYSTLLPGLCDAEKMSEARQILTEMVDKYIAPKDN

Query:  SIFMRLLSCQCKHGDLDAAMHVLKAMIRLSIPTEAGHYGILIENCCKAEMYDQAVKLLDKLVEKEIVLRPQSTLEIEPSAYNPIIQYLCNHGQTGKAETF
        SIFMRLLSCQC HGDLDAAMHVLKAM RLS+PTEAGHYGILIENCCKA +YD+AVKLLDKLV+KEI+L+PQSTLE+E SAYNPIIQYLC+HGQTGKAETF
Subjt:  SIFMRLLSCQCKHGDLDAAMHVLKAMIRLSIPTEAGHYGILIENCCKAEMYDQAVKLLDKLVEKEIVLRPQSTLEIEPSAYNPIIQYLCNHGQTGKAETF

Query:  FRQLLKKGIQDEVAFNNLIRGHSKEGNPELAYEMLKIMGRRGVSRDAESYKLLIKSYLSKGEPADAKTALDSMIECGHYPDSALFRSVMESLFADGRVQT
        FRQLLKKGIQDE+AFNNLIRGHSKEGNPELA+EMLKIMGR+ VSRDAESYKLLIKSYLSKGEPADAKTALDSMIE GHYPDSALFRSVMESLFADGRVQT
Subjt:  FRQLLKKGIQDEVAFNNLIRGHSKEGNPELAYEMLKIMGRRGVSRDAESYKLLIKSYLSKGEPADAKTALDSMIECGHYPDSALFRSVMESLFADGRVQT

Query:  ASRVMNSMLDKGITENLDLVAKILEALFMRGHVEEALGRIDLLMSCNCPPDFDSLLFVLCEKGKTIAALKLLDFGLERECSIEFSSYEKVLDTLLGAGKT
        ASRVMNSMLDKGITENLDLVAKILEALFMRGHVEEALGR+DLLM C+CPPDFDSLL VLCEKGKTIAALKLLDFGLEREC+IE SSYEKVLD LL AGKT
Subjt:  ASRVMNSMLDKGITENLDLVAKILEALFMRGHVEEALGRIDLLMSCNCPPDFDSLLFVLCEKGKTIAALKLLDFGLERECSIEFSSYEKVLDTLLGAGKT

Query:  LNAYSILCKIMEKGGAKDWSSCDDLIRSLNQEGNTKQADILSRMIKGGDRKGSKKASLAA
        LNAYSILCKIMEKGGAK+WSSCDDLI SLNQEG+TKQAD+LSRMIKGGD    K AS AA
Subjt:  LNAYSILCKIMEKGGAKDWSSCDDLIRSLNQEGNTKQADILSRMIKGGDRKGSKKASLAA

SwissProt top hitse value%identityAlignment
O81908 Pentatricopeptide repeat-containing protein At1g02060, chloroplastic8.6e-9732.83Show/hide
Query:  KLEDIICRMMANREWTTRLQNSIRSLVPQ--FDHSLVWNVLHAAKNSDHALKFFRWVERAGLFRHDRDTHLKIIEILGRASKLNHARCILLDMPNKG---
        KL   + R + +  W+  L++S+ SL P      + V   L   K     L+FF WV   G F H   +   ++E LGRA  LN AR  L  +  +    
Subjt:  KLEDIICRMMANREWTTRLQNSIRSLVPQ--FDHSLVWNVLHAAKNSDHALKFFRWVERAGLFRHDRDTHLKIIEILGRASKLNHARCILLDMPNKG---

Query:  VEWDEDLFVVLIDSYGKAGIVQEAVKIFQKMKELDVERSVKSYDALFKVILRRGRYMMAKRYFNAMLNE-GIEPTRHTYNLMLWGFFLSLRLETAKRFYE
        V+  +  F  LI SYG AG+ QE+VK+FQ MK++ +  SV ++++L  ++L+RGR  MA   F+ M    G+ P  +T+N ++ GF  +  ++ A R ++
Subjt:  VEWDEDLFVVLIDSYGKAGIVQEAVKIFQKMKELDVERSVKSYDALFKVILRRGRYMMAKRYFNAMLNE-GIEPTRHTYNLMLWGFFLSLRLETAKRFYE

Query:  DMKSRGISPDVVTYNTMINGFYRFKMMEEAEQFFTEM--KGKNIVPTVISYTTMIKGYVSVGRVDDGLRLFEEMKAVGVKPNDITYSTLLPGLCDAEKMS
        DM+    +PDVVTYNT+I+G  R   ++ A    + M  K  ++ P V+SYTT+++GY     +D+ + +F +M + G+KPN +TY+TL+ GL +A +  
Subjt:  DMKSRGISPDVVTYNTMINGFYRFKMMEEAEQFFTEM--KGKNIVPTVISYTTMIKGYVSVGRVDDGLRLFEEMKAVGVKPNDITYSTLLPGLCDAEKMS

Query:  EARQILTEMVDKY--IAPKDNSIFMRLLSCQCKHGDLDAAMHVLKAMIRLSIPTEAGHYGILIENCCKAEMYDQAVKLLDKLVEKEIVLRPQSTLEIEPS
        E + IL    D +   AP D   F  L+   C  G LDAAM V + M+ + +  ++  Y +LI   C    +D+A  L ++L EKE++L       +  +
Subjt:  EARQILTEMVDKY--IAPKDNSIFMRLLSCQCKHGDLDAAMHVLKAMIRLSIPTEAGHYGILIENCCKAEMYDQAVKLLDKLVEKEIVLRPQSTLEIEPS

Query:  AYNPIIQYLCNHGQTGKAETFFRQLLKKGIQDEVAFNNLIRGHSKEGNPELAYEMLKIMGRRGVSRDAESYKLLIKSYLSKGEPADAKTALDSMIECGHY
        AYNP+ +YLC +G+T +AE  FRQL+K+G+QD  ++  LI GH +EG  + AYE+L +M RR    D E+Y+LLI   L  GE   A   L  M+   + 
Subjt:  AYNPIIQYLCNHGQTGKAETFFRQLLKKGIQDEVAFNNLIRGHSKEGNPELAYEMLKIMGRRGVSRDAESYKLLIKSYLSKGEPADAKTALDSMIECGHY

Query:  PDSALFRSVMESLFADGRVQTASRVMNSMLDKGITENLDLVAKILEALFMRGHVEEALGRIDLLMSCNCPPDFDSLLFVLCEKGKTIAALKLLDFGLERE
        P +  F SV+  L        +  ++  ML+K I +N+DL  +++  LF     E+A   + LL         + LL  LCE  K + A  L+ F LE+ 
Subjt:  PDSALFRSVMESLFADGRVQTASRVMNSMLDKGITENLDLVAKILEALFMRGHVEEALGRIDLLMSCNCPPDFDSLLFVLCEKGKTIAALKLLDFGLERE

Query:  CSIEFSSYEKVLDTLLGAGKTLNAYSILCKIMEKGGAKDWSSCDDLIRSLNQEGNTKQADILSR
          ++  +   V++ L    +   A+S+  +++E G  +  S    L  +L   G  ++   +S+
Subjt:  CSIEFSSYEKVLDTLLGAGKTLNAYSILCKIMEKGGAKDWSSCDDLIRSLNQEGNTKQADILSR

P0C894 Putative pentatricopeptide repeat-containing protein At2g021502.1e-5025.64Show/hide
Query:  ALKFFRWVERAGLFRHDRDTHLKIIEILGRASKLNHARCILLDMPNKGVEWD-------------------EDLFVVLIDSYGKAGIVQEAVKIFQKMKE
        A KFF+W      F+H  +++  +  IL  A     A  +L +M     + D                   + LF VLID     G+++EA++ F KMK 
Subjt:  ALKFFRWVERAGLFRHDRDTHLKIIEILGRASKLNHARCILLDMPNKGVEWD-------------------EDLFVVLIDSYGKAGIVQEAVKIFQKMKE

Query:  LDVERSVKSYDALFKVILRRGRYMMAKRYFNAMLNEGIEPTRHTYNLMLWGFFLSLRLETAKRFYEDMKSRGISPDVVTYNTMINGFYRFKMMEEAEQFF
          V    +S + L     + G+    KR+F  M+  G  PT  TYN+M+        +E A+  +E+MK RG+ PD VTYN+MI+GF +   +++   FF
Subjt:  LDVERSVKSYDALFKVILRRGRYMMAKRYFNAMLNEGIEPTRHTYNLMLWGFFLSLRLETAKRFYEDMKSRGISPDVVTYNTMINGFYRFKMMEEAEQFF

Query:  TEMKGKNIVPTVISYTTMIKGYVSVGRVDDGLRLFEEMKAVGVKPNDITYSTLLPGLCDAEKMSEARQILTEMVDKYIAPKDNSIFMRLLSCQCKHGDLD
         EMK     P VI+Y  +I  +   G++  GL  + EMK  G+KPN ++YSTL+   C    M +A +   +M    + P + + +  L+   CK G+L 
Subjt:  TEMKGKNIVPTVISYTTMIKGYVSVGRVDDGLRLFEEMKAVGVKPNDITYSTLLPGLCDAEKMSEARQILTEMVDKYIAPKDNSIFMRLLSCQCKHGDLD

Query:  AAMHVLKAMIRLSIPTEAGHYGILIENCCKAEMYDQAVKLLDKLVEKEIVLRPQSTLEIEP--SAYNPIIQYLCNHGQTGKAETFFRQLLKKGIQ-DEVA
         A  +   M+++ +      Y  LI+  C AE   +A +L  K+           T  + P  ++YN +I          +A     +L  +GI+ D + 
Subjt:  AAMHVLKAMIRLSIPTEAGHYGILIENCCKAEMYDQAVKLLDKLVEKEIVLRPQSTLEIEP--SAYNPIIQYLCNHGQTGKAETFFRQLLKKGIQ-DEVA

Query:  FNNLIRGHSKEGNPELAYEMLKIMGRRGVSRDAESYKLLIKSYLSKGEPADAKTALDSMIECGHYPDSALFRSVMESLFADGRVQTASRVMNSML-DKGI
        +   I G       E A  ++  M   G+  ++  Y  L+ +Y   G P +    LD M E         F  +++ L  +  V  A    N +  D G+
Subjt:  FNNLIRGHSKEGNPELAYEMLKIMGRRGVSRDAESYKLLIKSYLSKGEPADAKTALDSMIECGHYPDSALFRSVMESLFADGRVQTASRVMNSML-DKGI

Query:  TENLDLVAKILEALFMRGHVEEALGRIDLLMSCNCPPD---FDSLLFVLCEKGKTIAALKLLDFGLERECSIEFSSYEKVLDTLLGAGKTLNAYSILCKI
          N  +   +++ L     VE A    + ++     PD   + SL+    ++G  + AL L D   E    ++  +Y  ++  L    +   A S L ++
Subjt:  TENLDLVAKILEALFMRGHVEEALGRIDLLMSCNCPPD---FDSLLFVLCEKGKTIAALKLLDFGLERECSIEFSSYEKVLDTLLGAGKTLNAYSILCKI

Query:  MEKGGAKDWSSCDDLIRSLNQEGNTKQA
        + +G   D   C  +++   + G   +A
Subjt:  MEKGGAKDWSSCDDLIRSLNQEGNTKQA

Q9CA58 Putative pentatricopeptide repeat-containing protein At1g745805.3e-5425.17Show/hide
Query:  VLHAAKNSDHALKFFRWVERAGLFRHDRDTHLKIIEILGRASKLNHARCILLDM-PNKGVEWDEDLFVVLIDSYGKAGIVQEAVKIFQKMKELDVERSVK
        V+   K+   AL+ F  + +   F+H   T+  +IE LG   K      +L+DM  N G    E ++V  + +YG+ G VQEAV +F++M   D E +V 
Subjt:  VLHAAKNSDHALKFFRWVERAGLFRHDRDTHLKIIEILGRASKLNHARCILLDM-PNKGVEWDEDLFVVLIDSYGKAGIVQEAVKIFQKMKELDVERSVK

Query:  SYDALFKVILRRGRYMMAKRYFNAMLNEGIEPTRHTYNLMLWGFFLSLRLETAKRFYEDMKSRGISPDVVTYNTMINGFYRFKMMEEAEQFFTEMKGKNI
        SY+A+  V++  G +  A + +  M + GI P  +++ + +  F  + R   A R   +M S+G   +VV Y T++ GFY      E  + F +M    +
Subjt:  SYDALFKVILRRGRYMMAKRYFNAMLNEGIEPTRHTYNLMLWGFFLSLRLETAKRFYEDMKSRGISPDVVTYNTMINGFYRFKMMEEAEQFFTEMKGKNI

Query:  VPTVISYTTMIKGYVSVGRVDDGLRLFEEMKAVGVKPNDITYSTLLPGLCDAEKMSEARQILTEMVDKYIAPKDNSI-FMRLLSCQCKHGDLDAAMHVLK
           + ++  +++     G V +  +L +++   GV PN  TY+  + GLC   ++  A +++  ++++   PK + I +  L+   CK+     A   L 
Subjt:  VPTVISYTTMIKGYVSVGRVDDGLRLFEEMKAVGVKPNDITYSTLLPGLCDAEKMSEARQILTEMVDKYIAPKDNSI-FMRLLSCQCKHGDLDAAMHVLK

Query:  AMIRLSIPTEAGHYGILIENCCKAEMYDQAVKLLDKLVEKEIVLRPQSTLEIEPSAYNPIIQYLCNHGQTGKAETFFRQLLKKGIQDEV-AFNNLIRGHS
         M+   +  ++  Y  LI   CK  M   A +++   V    V         +   Y  +I  LC+ G+T +A   F + L KGI+  V  +N LI+G S
Subjt:  AMIRLSIPTEAGHYGILIENCCKAEMYDQAVKLLDKLVEKEIVLRPQSTLEIEPSAYNPIIQYLCNHGQTGKAETFFRQLLKKGIQDEV-AFNNLIRGHS

Query:  KEGNPELAYEMLKIMGRRGVSRDAESYKLLIKSYLSKGEPADAKTALDSMIECGHYPDSALFRSVMESLFADGRVQTASRVMNSMLDKGITENLDLVAKI
         +G    A ++   M  +G+  + +++ +L+      G  +DA   +  MI  G++PD   F  ++       +++ A  +++ MLD G+  ++     +
Subjt:  KEGNPELAYEMLKIMGRRGVSRDAESYKLLIKSYLSKGEPADAKTALDSMIECGHYPDSALFRSVMESLFADGRVQTASRVMNSMLDKGITENLDLVAKI

Query:  LEALFMRGHVEEALGRIDLLMSCNCPPD---FDSLLFVLCEKGKTIAALKLLDFGLERECSIEFSSYEKVLDTLLGAGKTLNAYSILCKIME
        L  L      E+ +     ++   C P+   F+ LL  LC   K   AL LL+    +  + +  ++  ++D     G    AY++  K+ E
Subjt:  LEALFMRGHVEEALGRIDLLMSCNCPPD---FDSLLFVLCEKGKTIAALKLLDFGLERECSIEFSSYEKVLDTLLGAGKTLNAYSILCKIME

Q9LPX2 Pentatricopeptide repeat-containing protein At1g12775, mitochondrial2.7e-5027.99Show/hide
Query:  HSLVWNVLHAAKNSDHALKFFRWVERAGLFRHDRDTHLKIIEILGRASKLNHARCILLDMPNKGVEWDEDLFVVLIDSYGKAGIVQEAVKIFQKMKELDV
        ++LV  +    K SD  +   R VE    F+ +  T+  ++ ++ ++ +   A  +L  M  + ++ D   + ++ID   K G +  A  +F +M+    
Subjt:  HSLVWNVLHAAKNSDHALKFFRWVERAGLFRHDRDTHLKIIEILGRASKLNHARCILLDMPNKGVEWDEDLFVVLIDSYGKAGIVQEAVKIFQKMKELDV

Query:  ERSVKSYDALFKVILRRGRYMMAKRYFNAMLNEGIEPTRHTYNLMLWGFFLSLRLETAKRFYEDMKSRGISPDVVTYNTMINGFYRFKMMEEAEQFFTEM
        +  + +Y+ L       GR+    +    M+   I P   T+++++  F    +L  A +  ++M  RGI+P+ +TYN++I+GF +   +EEA Q    M
Subjt:  ERSVKSYDALFKVILRRGRYMMAKRYFNAMLNEGIEPTRHTYNLMLWGFFLSLRLETAKRFYEDMKSRGISPDVVTYNTMINGFYRFKMMEEAEQFFTEM

Query:  KGKNIVPTVISYTTMIKGYVSVGRVDDGLRLFEEMKAVGVKPNDITYSTLLPGLCDAEKMSEARQILTEMVDKYIAPKDNSIFMRLLSCQCKHGDLDAAM
          K   P ++++  +I GY    R+DDGL LF EM   GV  N +TY+TL+ G C + K+  A+++  EMV + + P D   +  LL   C +G+L+ A+
Subjt:  KGKNIVPTVISYTTMIKGYVSVGRVDDGLRLFEEMKAVGVKPNDITYSTLLPGLCDAEKMSEARQILTEMVDKYIAPKDNSIFMRLLSCQCKHGDLDAAM

Query:  HVLKAMIRLSIPTEAGHYGILIENCCKAEMYDQAVKLLDKLVEKEIVLRPQSTLEIEPSAYNPIIQYLCNHGQTGKAETFFRQLLKKG-IQDEVAFNNLI
         +   + +  +  + G Y I+I   C A   D A  L   L        P   ++++  AYN +I  LC      KA+  FR++ ++G   DE+ +N LI
Subjt:  HVLKAMIRLSIPTEAGHYGILIENCCKAEMYDQAVKLLDKLVEKEIVLRPQSTLEIEPSAYNPIIQYLCNHGQTGKAETFFRQLLKKG-IQDEVAFNNLI

Query:  RGHSKEGNPELAYEMLKIMGRRGVSRDAESYKLLIKSYLSKGE
        R H  + +   A E+++ M   G   D  + K++I + LS GE
Subjt:  RGHSKEGNPELAYEMLKIMGRRGVSRDAESYKLLIKSYLSKGE

Q9ZUU3 Pentatricopeptide repeat-containing protein At2g372303.0e-29969Show/hide
Query:  MAHISLSKPHYTHFRVL----SSSNPNAFNSLHFFSSTQEPISTASQNESPNDPPASFHAAVPQPVEPVAVNGTEQVKRRTPRGKHRNPEKLEDIICRMM
        MA IS SK + +  RV      SSN + F+    FS+ +E  + A+ N     P A       +  + +    T  ++ R  RGK +N EKLED ICRMM
Subjt:  MAHISLSKPHYTHFRVL----SSSNPNAFNSLHFFSSTQEPISTASQNESPNDPPASFHAAVPQPVEPVAVNGTEQVKRRTPRGKHRNPEKLEDIICRMM

Query:  ANREWTTRLQNSIRSLVPQFDHSLVWNVLHAAKNSDHALKFFRWVERAGLFRHDRDTHLKIIEILGRASKLNHARCILLDMPNKGVEWDEDLFVVLIDSY
         NR WTTRLQNSIR LVP++DHSLV+NVLH AK  +HAL+FFRW ER+GL RHDRDTH+K+I++LG  SKLNHARCILLDMP KGV WDED+FVVLI+SY
Subjt:  ANREWTTRLQNSIRSLVPQFDHSLVWNVLHAAKNSDHALKFFRWVERAGLFRHDRDTHLKIIEILGRASKLNHARCILLDMPNKGVEWDEDLFVVLIDSY

Query:  GKAGIVQEAVKIFQKMKELDVERSVKSYDALFKVILRRGRYMMAKRYFNAMLNEGIEPTRHTYNLMLWGFFLSLRLETAKRFYEDMKSRGISPDVVTYNT
        GKAGIVQE+VKIFQKMK+L VER++KSY++LFKVILRRGRYMMAKRYFN M++EG+EPTRHTYNLMLWGFFLSLRLETA RF+EDMK+RGISPD  T+NT
Subjt:  GKAGIVQEAVKIFQKMKELDVERSVKSYDALFKVILRRGRYMMAKRYFNAMLNEGIEPTRHTYNLMLWGFFLSLRLETAKRFYEDMKSRGISPDVVTYNT

Query:  MINGFYRFKMMEEAEQFFTEMKGKNIVPTVISYTTMIKGYVSVGRVDDGLRLFEEMKAVGVKPNDITYSTLLPGLCDAEKMSEARQILTEMVDKYIAPKD
        MINGF RFK M+EAE+ F EMKG  I P+V+SYTTMIKGY++V RVDDGLR+FEEM++ G++PN  TYSTLLPGLCDA KM EA+ IL  M+ K+IAPKD
Subjt:  MINGFYRFKMMEEAEQFFTEMKGKNIVPTVISYTTMIKGYVSVGRVDDGLRLFEEMKAVGVKPNDITYSTLLPGLCDAEKMSEARQILTEMVDKYIAPKD

Query:  NSIFMRLLSCQCKHGDLDAAMHVLKAMIRLSIPTEAGHYGILIENCCKAEMYDQAVKLLDKLVEKEIVLRPQSTLEIEPSAYNPIIQYLCNHGQTGKAET
        NSIF++LL  Q K GD+ AA  VLKAM  L++P EAGHYG+LIEN CKA  Y++A+KLLD L+EKEI+LR Q TLE+EPSAYNPII+YLCN+GQT KAE 
Subjt:  NSIFMRLLSCQCKHGDLDAAMHVLKAMIRLSIPTEAGHYGILIENCCKAEMYDQAVKLLDKLVEKEIVLRPQSTLEIEPSAYNPIIQYLCNHGQTGKAET

Query:  FFRQLLKKGIQDEVAFNNLIRGHSKEGNPELAYEMLKIMGRRGVSRDAESYKLLIKSYLSKGEPADAKTALDSMIECGHYPDSALFRSVMESLFADGRVQ
         FRQL+K+G+QD+ A NNLIRGH+KEGNP+ +YE+LKIM RRGV R++ +Y+LLIKSY+SKGEP DAKTALDSM+E GH PDS+LFRSV+ESLF DGRVQ
Subjt:  FFRQLLKKGIQDEVAFNNLIRGHSKEGNPELAYEMLKIMGRRGVSRDAESYKLLIKSYLSKGEPADAKTALDSMIECGHYPDSALFRSVMESLFADGRVQ

Query:  TASRVMNSMLDK--GITENLDLVAKILEALFMRGHVEEALGRIDLLMSCNCPPDFDSLLFVLCEKGKTIAALKLLDFGLERECSIEFSSYEKVLDTLLGA
        TASRVM  M+DK  GI +N+DL+AKILEAL MRGHVEEALGRIDLL       D DSLL VL EKGKTIAALKLLDFGLER+ S+EFSSY+KVLD LLGA
Subjt:  TASRVMNSMLDK--GITENLDLVAKILEALFMRGHVEEALGRIDLLMSCNCPPDFDSLLFVLCEKGKTIAALKLLDFGLERECSIEFSSYEKVLDTLLGA

Query:  GKTLNAYSILCKIMEKGGAKDWSSCDDLIRSLNQEGNTKQADILSRMIKGGDRKGSKK
        GKTLNAYS+LCKIMEKG + DW S D+LI+SLNQEGNTKQAD+LSRMIK G  +G KK
Subjt:  GKTLNAYSILCKIMEKGGAKDWSSCDDLIRSLNQEGNTKQADILSRMIKGGDRKGSKK

Arabidopsis top hitse value%identityAlignment
AT1G02060.1 Tetratricopeptide repeat (TPR)-like superfamily protein6.1e-9832.83Show/hide
Query:  KLEDIICRMMANREWTTRLQNSIRSLVPQ--FDHSLVWNVLHAAKNSDHALKFFRWVERAGLFRHDRDTHLKIIEILGRASKLNHARCILLDMPNKG---
        KL   + R + +  W+  L++S+ SL P      + V   L   K     L+FF WV   G F H   +   ++E LGRA  LN AR  L  +  +    
Subjt:  KLEDIICRMMANREWTTRLQNSIRSLVPQ--FDHSLVWNVLHAAKNSDHALKFFRWVERAGLFRHDRDTHLKIIEILGRASKLNHARCILLDMPNKG---

Query:  VEWDEDLFVVLIDSYGKAGIVQEAVKIFQKMKELDVERSVKSYDALFKVILRRGRYMMAKRYFNAMLNE-GIEPTRHTYNLMLWGFFLSLRLETAKRFYE
        V+  +  F  LI SYG AG+ QE+VK+FQ MK++ +  SV ++++L  ++L+RGR  MA   F+ M    G+ P  +T+N ++ GF  +  ++ A R ++
Subjt:  VEWDEDLFVVLIDSYGKAGIVQEAVKIFQKMKELDVERSVKSYDALFKVILRRGRYMMAKRYFNAMLNE-GIEPTRHTYNLMLWGFFLSLRLETAKRFYE

Query:  DMKSRGISPDVVTYNTMINGFYRFKMMEEAEQFFTEM--KGKNIVPTVISYTTMIKGYVSVGRVDDGLRLFEEMKAVGVKPNDITYSTLLPGLCDAEKMS
        DM+    +PDVVTYNT+I+G  R   ++ A    + M  K  ++ P V+SYTT+++GY     +D+ + +F +M + G+KPN +TY+TL+ GL +A +  
Subjt:  DMKSRGISPDVVTYNTMINGFYRFKMMEEAEQFFTEM--KGKNIVPTVISYTTMIKGYVSVGRVDDGLRLFEEMKAVGVKPNDITYSTLLPGLCDAEKMS

Query:  EARQILTEMVDKY--IAPKDNSIFMRLLSCQCKHGDLDAAMHVLKAMIRLSIPTEAGHYGILIENCCKAEMYDQAVKLLDKLVEKEIVLRPQSTLEIEPS
        E + IL    D +   AP D   F  L+   C  G LDAAM V + M+ + +  ++  Y +LI   C    +D+A  L ++L EKE++L       +  +
Subjt:  EARQILTEMVDKY--IAPKDNSIFMRLLSCQCKHGDLDAAMHVLKAMIRLSIPTEAGHYGILIENCCKAEMYDQAVKLLDKLVEKEIVLRPQSTLEIEPS

Query:  AYNPIIQYLCNHGQTGKAETFFRQLLKKGIQDEVAFNNLIRGHSKEGNPELAYEMLKIMGRRGVSRDAESYKLLIKSYLSKGEPADAKTALDSMIECGHY
        AYNP+ +YLC +G+T +AE  FRQL+K+G+QD  ++  LI GH +EG  + AYE+L +M RR    D E+Y+LLI   L  GE   A   L  M+   + 
Subjt:  AYNPIIQYLCNHGQTGKAETFFRQLLKKGIQDEVAFNNLIRGHSKEGNPELAYEMLKIMGRRGVSRDAESYKLLIKSYLSKGEPADAKTALDSMIECGHY

Query:  PDSALFRSVMESLFADGRVQTASRVMNSMLDKGITENLDLVAKILEALFMRGHVEEALGRIDLLMSCNCPPDFDSLLFVLCEKGKTIAALKLLDFGLERE
        P +  F SV+  L        +  ++  ML+K I +N+DL  +++  LF     E+A   + LL         + LL  LCE  K + A  L+ F LE+ 
Subjt:  PDSALFRSVMESLFADGRVQTASRVMNSMLDKGITENLDLVAKILEALFMRGHVEEALGRIDLLMSCNCPPDFDSLLFVLCEKGKTIAALKLLDFGLERE

Query:  CSIEFSSYEKVLDTLLGAGKTLNAYSILCKIMEKGGAKDWSSCDDLIRSLNQEGNTKQADILSR
          ++  +   V++ L    +   A+S+  +++E G  +  S    L  +L   G  ++   +S+
Subjt:  CSIEFSSYEKVLDTLLGAGKTLNAYSILCKIMEKGGAKDWSSCDDLIRSLNQEGNTKQADILSR

AT1G30290.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.2e-5624.04Show/hide
Query:  WTTRLQNSIRSLVPQFDHSLVWNVLHAAKNSDHALKFFRWVERAGLFRHDRDTHLKIIEILGRASKLNHARCILLDMPNKGVEWDEDLFVVLIDSYGKAG
        W  + +  +R+L+     S V  VL +  +   ALKFF W +R   +RHD   +  ++E+L +      +R +L+ M  +G+    + F  ++ SY +AG
Subjt:  WTTRLQNSIRSLVPQFDHSLVWNVLHAAKNSDHALKFFRWVERAGLFRHDRDTHLKIIEILGRASKLNHARCILLDMPNKGVEWDEDLFVVLIDSYGKAG

Query:  IVQEAVKIFQKMKELDVERSVKSYDALFKVILRRGRYMMAKRYFNAMLNEGIEPTRHTYNLMLWGFFLSLRLETAKRFYEDMKSRGISPDVVTYNTMING
         +++A+K+   M+   VE ++   +    V +R  R   A R+   M   GI P   TYN M+ G+    R+E A    EDM S+G  PD V+Y T++  
Subjt:  IVQEAVKIFQKMKELDVERSVKSYDALFKVILRRGRYMMAKRYFNAMLNEGIEPTRHTYNLMLWGFFLSLRLETAKRFYEDMKSRGISPDVVTYNTMING

Query:  FYRFKMMEEAEQFFTEM-KGKNIVPTVISYTTMIKGYVSVGRVDDGLRLFEEMKAVGVKPNDITYSTLLPGLCDAEKMSEARQILTEMVDKYIAPKDNSI
          + K + E      +M K   +VP  ++Y T+I         D+ L   ++ +  G + + + YS ++  LC   +MSEA+ ++ EM+ K   P D   
Subjt:  FYRFKMMEEAEQFFTEM-KGKNIVPTVISYTTMIKGYVSVGRVDDGLRLFEEMKAVGVKPNDITYSTLLPGLCDAEKMSEARQILTEMVDKYIAPKDNSI

Query:  FMRLLSCQCKHGDLDAAMHVLKAMIRLSIPTEAGHYGILIENCCKAEMYDQAVKLLDKLVEKEIVLRPQSTLEIEPSAYNPIIQYLCNHGQTGKAETFFR
        +  +++  C+ G++D A  +L+ M           Y  L+   C+     +A ++++  + +E    P S        Y+ I+  L   G+  +A    R
Subjt:  FMRLLSCQCKHGDLDAAMHVLKAMIRLSIPTEAGHYGILIENCCKAEMYDQAVKLLDKLVEKEIVLRPQSTLEIEPSAYNPIIQYLCNHGQTGKAETFFR

Query:  QLLKKG-IQDEVAFNNLIRGHSKEGNPELAYEMLKIMGRRGVSRDAESYKLLIKSYLSKGEPADAKTALDSMIECGHYPDSALFRSVMESLFADGRVQTA
        +++ KG     V  N L++   ++G    A + ++    +G + +  ++  +I  +    E   A + LD M     + D   + +++++L   GR+  A
Subjt:  QLLKKG-IQDEVAFNNLIRGHSKEGNPELAYEMLKIMGRRGVSRDAESYKLLIKSYLSKGEPADAKTALDSMIECGHYPDSALFRSVMESLFADGRVQTA

Query:  SRVMNSMLDKGITENLDLVAKILEALFMRGHVEEALGRIDLLMS-CNCPPDFDSLLFVLCEKGKTIAALKLLDFGLERECSIEFSSYEKVLDTLLGAGKT
        + +M  ML KGI         ++      G V++ +  ++ ++S   C   ++ ++  LC  GK   A  LL   L      +  +   +++  L  G  
Subjt:  SRVMNSMLDKGITENLDLVAKILEALFMRGHVEEALGRIDLLMS-CNCPPDFDSLLFVLCEKGKTIAALKLLDFGLERECSIEFSSYEKVLDTLLGAGKT

Query:  LNAYSILCKIMEKGGAKDWSSCDDLIRSLNQEGNTKQAD-ILSRMIKGG
        L+AY + C++  +    D   C+ L + L  +G   +AD ++ R+++ G
Subjt:  LNAYSILCKIMEKGGAKDWSSCDDLIRSLNQEGNTKQAD-ILSRMIKGG

AT1G74580.1 Pentatricopeptide repeat (PPR) superfamily protein3.7e-5525.17Show/hide
Query:  VLHAAKNSDHALKFFRWVERAGLFRHDRDTHLKIIEILGRASKLNHARCILLDM-PNKGVEWDEDLFVVLIDSYGKAGIVQEAVKIFQKMKELDVERSVK
        V+   K+   AL+ F  + +   F+H   T+  +IE LG   K      +L+DM  N G    E ++V  + +YG+ G VQEAV +F++M   D E +V 
Subjt:  VLHAAKNSDHALKFFRWVERAGLFRHDRDTHLKIIEILGRASKLNHARCILLDM-PNKGVEWDEDLFVVLIDSYGKAGIVQEAVKIFQKMKELDVERSVK

Query:  SYDALFKVILRRGRYMMAKRYFNAMLNEGIEPTRHTYNLMLWGFFLSLRLETAKRFYEDMKSRGISPDVVTYNTMINGFYRFKMMEEAEQFFTEMKGKNI
        SY+A+  V++  G +  A + +  M + GI P  +++ + +  F  + R   A R   +M S+G   +VV Y T++ GFY      E  + F +M    +
Subjt:  SYDALFKVILRRGRYMMAKRYFNAMLNEGIEPTRHTYNLMLWGFFLSLRLETAKRFYEDMKSRGISPDVVTYNTMINGFYRFKMMEEAEQFFTEMKGKNI

Query:  VPTVISYTTMIKGYVSVGRVDDGLRLFEEMKAVGVKPNDITYSTLLPGLCDAEKMSEARQILTEMVDKYIAPKDNSI-FMRLLSCQCKHGDLDAAMHVLK
           + ++  +++     G V +  +L +++   GV PN  TY+  + GLC   ++  A +++  ++++   PK + I +  L+   CK+     A   L 
Subjt:  VPTVISYTTMIKGYVSVGRVDDGLRLFEEMKAVGVKPNDITYSTLLPGLCDAEKMSEARQILTEMVDKYIAPKDNSI-FMRLLSCQCKHGDLDAAMHVLK

Query:  AMIRLSIPTEAGHYGILIENCCKAEMYDQAVKLLDKLVEKEIVLRPQSTLEIEPSAYNPIIQYLCNHGQTGKAETFFRQLLKKGIQDEV-AFNNLIRGHS
         M+   +  ++  Y  LI   CK  M   A +++   V    V         +   Y  +I  LC+ G+T +A   F + L KGI+  V  +N LI+G S
Subjt:  AMIRLSIPTEAGHYGILIENCCKAEMYDQAVKLLDKLVEKEIVLRPQSTLEIEPSAYNPIIQYLCNHGQTGKAETFFRQLLKKGIQDEV-AFNNLIRGHS

Query:  KEGNPELAYEMLKIMGRRGVSRDAESYKLLIKSYLSKGEPADAKTALDSMIECGHYPDSALFRSVMESLFADGRVQTASRVMNSMLDKGITENLDLVAKI
         +G    A ++   M  +G+  + +++ +L+      G  +DA   +  MI  G++PD   F  ++       +++ A  +++ MLD G+  ++     +
Subjt:  KEGNPELAYEMLKIMGRRGVSRDAESYKLLIKSYLSKGEPADAKTALDSMIECGHYPDSALFRSVMESLFADGRVQTASRVMNSMLDKGITENLDLVAKI

Query:  LEALFMRGHVEEALGRIDLLMSCNCPPD---FDSLLFVLCEKGKTIAALKLLDFGLERECSIEFSSYEKVLDTLLGAGKTLNAYSILCKIME
        L  L      E+ +     ++   C P+   F+ LL  LC   K   AL LL+    +  + +  ++  ++D     G    AY++  K+ E
Subjt:  LEALFMRGHVEEALGRIDLLMSCNCPPD---FDSLLFVLCEKGKTIAALKLLDFGLERECSIEFSSYEKVLDTLLGAGKTLNAYSILCKIME

AT2G02150.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.5e-5125.64Show/hide
Query:  ALKFFRWVERAGLFRHDRDTHLKIIEILGRASKLNHARCILLDMPNKGVEWD-------------------EDLFVVLIDSYGKAGIVQEAVKIFQKMKE
        A KFF+W      F+H  +++  +  IL  A     A  +L +M     + D                   + LF VLID     G+++EA++ F KMK 
Subjt:  ALKFFRWVERAGLFRHDRDTHLKIIEILGRASKLNHARCILLDMPNKGVEWD-------------------EDLFVVLIDSYGKAGIVQEAVKIFQKMKE

Query:  LDVERSVKSYDALFKVILRRGRYMMAKRYFNAMLNEGIEPTRHTYNLMLWGFFLSLRLETAKRFYEDMKSRGISPDVVTYNTMINGFYRFKMMEEAEQFF
          V    +S + L     + G+    KR+F  M+  G  PT  TYN+M+        +E A+  +E+MK RG+ PD VTYN+MI+GF +   +++   FF
Subjt:  LDVERSVKSYDALFKVILRRGRYMMAKRYFNAMLNEGIEPTRHTYNLMLWGFFLSLRLETAKRFYEDMKSRGISPDVVTYNTMINGFYRFKMMEEAEQFF

Query:  TEMKGKNIVPTVISYTTMIKGYVSVGRVDDGLRLFEEMKAVGVKPNDITYSTLLPGLCDAEKMSEARQILTEMVDKYIAPKDNSIFMRLLSCQCKHGDLD
         EMK     P VI+Y  +I  +   G++  GL  + EMK  G+KPN ++YSTL+   C    M +A +   +M    + P + + +  L+   CK G+L 
Subjt:  TEMKGKNIVPTVISYTTMIKGYVSVGRVDDGLRLFEEMKAVGVKPNDITYSTLLPGLCDAEKMSEARQILTEMVDKYIAPKDNSIFMRLLSCQCKHGDLD

Query:  AAMHVLKAMIRLSIPTEAGHYGILIENCCKAEMYDQAVKLLDKLVEKEIVLRPQSTLEIEP--SAYNPIIQYLCNHGQTGKAETFFRQLLKKGIQ-DEVA
         A  +   M+++ +      Y  LI+  C AE   +A +L  K+           T  + P  ++YN +I          +A     +L  +GI+ D + 
Subjt:  AAMHVLKAMIRLSIPTEAGHYGILIENCCKAEMYDQAVKLLDKLVEKEIVLRPQSTLEIEP--SAYNPIIQYLCNHGQTGKAETFFRQLLKKGIQ-DEVA

Query:  FNNLIRGHSKEGNPELAYEMLKIMGRRGVSRDAESYKLLIKSYLSKGEPADAKTALDSMIECGHYPDSALFRSVMESLFADGRVQTASRVMNSML-DKGI
        +   I G       E A  ++  M   G+  ++  Y  L+ +Y   G P +    LD M E         F  +++ L  +  V  A    N +  D G+
Subjt:  FNNLIRGHSKEGNPELAYEMLKIMGRRGVSRDAESYKLLIKSYLSKGEPADAKTALDSMIECGHYPDSALFRSVMESLFADGRVQTASRVMNSML-DKGI

Query:  TENLDLVAKILEALFMRGHVEEALGRIDLLMSCNCPPD---FDSLLFVLCEKGKTIAALKLLDFGLERECSIEFSSYEKVLDTLLGAGKTLNAYSILCKI
          N  +   +++ L     VE A    + ++     PD   + SL+    ++G  + AL L D   E    ++  +Y  ++  L    +   A S L ++
Subjt:  TENLDLVAKILEALFMRGHVEEALGRIDLLMSCNCPPD---FDSLLFVLCEKGKTIAALKLLDFGLERECSIEFSSYEKVLDTLLGAGKTLNAYSILCKI

Query:  MEKGGAKDWSSCDDLIRSLNQEGNTKQA
        + +G   D   C  +++   + G   +A
Subjt:  MEKGGAKDWSSCDDLIRSLNQEGNTKQA

AT2G37230.1 Tetratricopeptide repeat (TPR)-like superfamily protein2.1e-30069Show/hide
Query:  MAHISLSKPHYTHFRVL----SSSNPNAFNSLHFFSSTQEPISTASQNESPNDPPASFHAAVPQPVEPVAVNGTEQVKRRTPRGKHRNPEKLEDIICRMM
        MA IS SK + +  RV      SSN + F+    FS+ +E  + A+ N     P A       +  + +    T  ++ R  RGK +N EKLED ICRMM
Subjt:  MAHISLSKPHYTHFRVL----SSSNPNAFNSLHFFSSTQEPISTASQNESPNDPPASFHAAVPQPVEPVAVNGTEQVKRRTPRGKHRNPEKLEDIICRMM

Query:  ANREWTTRLQNSIRSLVPQFDHSLVWNVLHAAKNSDHALKFFRWVERAGLFRHDRDTHLKIIEILGRASKLNHARCILLDMPNKGVEWDEDLFVVLIDSY
         NR WTTRLQNSIR LVP++DHSLV+NVLH AK  +HAL+FFRW ER+GL RHDRDTH+K+I++LG  SKLNHARCILLDMP KGV WDED+FVVLI+SY
Subjt:  ANREWTTRLQNSIRSLVPQFDHSLVWNVLHAAKNSDHALKFFRWVERAGLFRHDRDTHLKIIEILGRASKLNHARCILLDMPNKGVEWDEDLFVVLIDSY

Query:  GKAGIVQEAVKIFQKMKELDVERSVKSYDALFKVILRRGRYMMAKRYFNAMLNEGIEPTRHTYNLMLWGFFLSLRLETAKRFYEDMKSRGISPDVVTYNT
        GKAGIVQE+VKIFQKMK+L VER++KSY++LFKVILRRGRYMMAKRYFN M++EG+EPTRHTYNLMLWGFFLSLRLETA RF+EDMK+RGISPD  T+NT
Subjt:  GKAGIVQEAVKIFQKMKELDVERSVKSYDALFKVILRRGRYMMAKRYFNAMLNEGIEPTRHTYNLMLWGFFLSLRLETAKRFYEDMKSRGISPDVVTYNT

Query:  MINGFYRFKMMEEAEQFFTEMKGKNIVPTVISYTTMIKGYVSVGRVDDGLRLFEEMKAVGVKPNDITYSTLLPGLCDAEKMSEARQILTEMVDKYIAPKD
        MINGF RFK M+EAE+ F EMKG  I P+V+SYTTMIKGY++V RVDDGLR+FEEM++ G++PN  TYSTLLPGLCDA KM EA+ IL  M+ K+IAPKD
Subjt:  MINGFYRFKMMEEAEQFFTEMKGKNIVPTVISYTTMIKGYVSVGRVDDGLRLFEEMKAVGVKPNDITYSTLLPGLCDAEKMSEARQILTEMVDKYIAPKD

Query:  NSIFMRLLSCQCKHGDLDAAMHVLKAMIRLSIPTEAGHYGILIENCCKAEMYDQAVKLLDKLVEKEIVLRPQSTLEIEPSAYNPIIQYLCNHGQTGKAET
        NSIF++LL  Q K GD+ AA  VLKAM  L++P EAGHYG+LIEN CKA  Y++A+KLLD L+EKEI+LR Q TLE+EPSAYNPII+YLCN+GQT KAE 
Subjt:  NSIFMRLLSCQCKHGDLDAAMHVLKAMIRLSIPTEAGHYGILIENCCKAEMYDQAVKLLDKLVEKEIVLRPQSTLEIEPSAYNPIIQYLCNHGQTGKAET

Query:  FFRQLLKKGIQDEVAFNNLIRGHSKEGNPELAYEMLKIMGRRGVSRDAESYKLLIKSYLSKGEPADAKTALDSMIECGHYPDSALFRSVMESLFADGRVQ
         FRQL+K+G+QD+ A NNLIRGH+KEGNP+ +YE+LKIM RRGV R++ +Y+LLIKSY+SKGEP DAKTALDSM+E GH PDS+LFRSV+ESLF DGRVQ
Subjt:  FFRQLLKKGIQDEVAFNNLIRGHSKEGNPELAYEMLKIMGRRGVSRDAESYKLLIKSYLSKGEPADAKTALDSMIECGHYPDSALFRSVMESLFADGRVQ

Query:  TASRVMNSMLDK--GITENLDLVAKILEALFMRGHVEEALGRIDLLMSCNCPPDFDSLLFVLCEKGKTIAALKLLDFGLERECSIEFSSYEKVLDTLLGA
        TASRVM  M+DK  GI +N+DL+AKILEAL MRGHVEEALGRIDLL       D DSLL VL EKGKTIAALKLLDFGLER+ S+EFSSY+KVLD LLGA
Subjt:  TASRVMNSMLDK--GITENLDLVAKILEALFMRGHVEEALGRIDLLMSCNCPPDFDSLLFVLCEKGKTIAALKLLDFGLERECSIEFSSYEKVLDTLLGA

Query:  GKTLNAYSILCKIMEKGGAKDWSSCDDLIRSLNQEGNTKQADILSRMIKGGDRKGSKK
        GKTLNAYS+LCKIMEKG + DW S D+LI+SLNQEGNTKQAD+LSRMIK G  +G KK
Subjt:  GKTLNAYSILCKIMEKGGAKDWSSCDDLIRSLNQEGNTKQADILSRMIKGGDRKGSKK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTCACATTTCTTTATCTAAACCTCACTACACCCATTTCCGAGTTCTGTCCAGTTCGAATCCGAACGCCTTCAATTCACTTCATTTCTTCAGCTCCACTCAAGAACC
GATCTCCACGGCTTCTCAAAATGAAAGCCCCAATGATCCACCCGCCAGTTTTCATGCCGCAGTGCCTCAACCCGTGGAACCGGTGGCTGTTAATGGCACCGAGCAAGTTA
AGCGGAGAACCCCTAGAGGTAAGCATCGAAACCCAGAAAAATTAGAGGATATTATTTGTAGAATGATGGCAAATCGTGAATGGACAACACGTTTACAGAACTCCATTCGG
TCGTTGGTTCCCCAATTTGATCACTCCCTTGTTTGGAATGTGTTACACGCTGCTAAGAACTCGGACCATGCGCTCAAATTCTTCCGGTGGGTAGAGCGAGCCGGGTTATT
CCGGCACGATCGCGATACCCATTTGAAAATAATTGAGATTCTGGGTCGGGCTTCAAAGCTTAACCATGCCCGTTGCATTCTTCTTGATATGCCCAATAAGGGCGTCGAAT
GGGATGAAGACTTATTCGTTGTATTGATTGATAGTTATGGTAAAGCTGGGATTGTTCAGGAAGCTGTGAAAATATTTCAAAAGATGAAGGAATTGGATGTTGAGAGGAGC
GTTAAATCTTATGATGCTTTATTTAAGGTGATTTTGAGGAGAGGGAGGTATATGATGGCCAAGAGGTACTTTAATGCTATGTTGAATGAAGGAATAGAGCCTACTCGCCA
TACCTACAACTTGATGCTTTGGGGGTTTTTTTTGTCGTTGAGGCTTGAGACAGCCAAGAGATTTTATGAAGATATGAAGAGTAGAGGTATTTCACCCGATGTCGTTACAT
ATAACACTATGATTAATGGGTTTTATCGGTTCAAGATGATGGAGGAGGCCGAGCAGTTCTTTACTGAGATGAAGGGGAAGAATATTGTACCAACTGTGATTAGCTATACT
ACTATGATAAAAGGTTATGTTTCAGTGGGTCGAGTAGATGATGGATTGAGATTGTTTGAAGAGATGAAGGCTGTTGGTGTGAAGCCAAATGATATTACTTATTCAACTCT
GCTCCCTGGTCTCTGCGATGCAGAGAAAATGTCTGAAGCACGACAAATTTTGACAGAAATGGTGGACAAGTATATTGCTCCAAAGGATAATTCAATTTTCATGAGGTTGT
TATCTTGTCAGTGCAAGCATGGTGATTTAGATGCTGCTATGCATGTGCTGAAAGCAATGATTCGATTAAGCATTCCAACAGAGGCTGGACATTACGGTATTTTGATTGAG
AACTGTTGCAAAGCTGAAATGTACGATCAAGCAGTTAAATTGCTTGACAAACTTGTGGAAAAAGAAATCGTATTGAGGCCACAAAGTACTCTGGAAATTGAGCCTAGTGC
ATATAACCCTATAATTCAGTATCTGTGCAACCATGGGCAGACTGGAAAAGCTGAAACCTTTTTCCGGCAGTTGTTGAAGAAGGGTATTCAGGATGAGGTTGCATTTAATA
ATTTGATTCGTGGCCATTCCAAAGAAGGTAATCCTGAATTGGCATATGAAATGTTGAAAATCATGGGTAGGAGAGGTGTGTCTAGGGATGCAGAATCTTATAAGTTGCTT
ATCAAGAGCTACTTGAGTAAAGGTGAACCAGCTGATGCTAAAACAGCTTTGGACAGCATGATTGAATGTGGGCACTATCCTGATTCGGCGTTGTTTAGATCAGTGATGGA
AAGTCTATTTGCAGATGGGAGGGTGCAGACCGCAAGCCGAGTGATGAATAGCATGTTGGATAAAGGAATAACAGAAAACTTAGACTTGGTTGCTAAAATCCTGGAAGCCC
TTTTCATGAGAGGTCATGTCGAGGAAGCATTGGGACGAATCGATTTGCTAATGAGCTGCAACTGCCCACCTGATTTTGATAGTCTATTATTTGTTCTTTGTGAAAAGGGG
AAGACCATTGCTGCTCTCAAGCTTTTAGATTTTGGGTTGGAAAGAGAATGTAGCATAGAATTCTCAAGTTATGAGAAAGTACTTGATACGCTGTTGGGGGCGGGGAAAAC
GCTGAACGCGTACTCAATTCTATGTAAGATAATGGAGAAAGGAGGGGCCAAGGATTGGAGCAGCTGTGATGATCTGATCAGAAGCCTCAATCAGGAAGGGAACACGAAGC
AAGCTGATATTCTCTCAAGAATGATAAAGGGTGGAGACAGAAAGGGGAGTAAGAAAGCTTCTCTTGCTGCTTGA
mRNA sequenceShow/hide mRNA sequence
AAAAAAAAACCAATAAGAAGGATTTCAATTTTTTCAAACCAACAATTTGAAAACCTATTGATTCTAACAACTGATTGCAGAGTTGATCATTCGATTCAACGTGGGCAATA
TCTCGTACGTTGTGGATTATTCATAACCTTTTTGCATATCAAATGTATGAATTTAACTTCGATGGGTTTCATATGATATTTTTTGTTGGGTTGGCAGTTCCATGATCTTG
GTTTTATAGCCAAGATTTGGGCCCACAACTTAGACCCGGTTCCTGCCTGGTTAGGAATTAGGGTGGGCTGTTATTCGGATTTCTGTTCTTCCTTTTCCTCTGGCGAAGCA
AGGGCTTCTTCTTGCTTAAACCCTAAACTTCACTCTGTCACTCTTTCTGTGTCTATAGATCTGTGTTTGTACAAATTTCAAATGCACTGAGATTTGACCTTGTAAGACAC
TTGTCTAAGAAACACCCTTTGGCGATTCGAAGTAATCTTCTCATTCATGGCTCACATTTCTTTATCTAAACCTCACTACACCCATTTCCGAGTTCTGTCCAGTTCGAATC
CGAACGCCTTCAATTCACTTCATTTCTTCAGCTCCACTCAAGAACCGATCTCCACGGCTTCTCAAAATGAAAGCCCCAATGATCCACCCGCCAGTTTTCATGCCGCAGTG
CCTCAACCCGTGGAACCGGTGGCTGTTAATGGCACCGAGCAAGTTAAGCGGAGAACCCCTAGAGGTAAGCATCGAAACCCAGAAAAATTAGAGGATATTATTTGTAGAAT
GATGGCAAATCGTGAATGGACAACACGTTTACAGAACTCCATTCGGTCGTTGGTTCCCCAATTTGATCACTCCCTTGTTTGGAATGTGTTACACGCTGCTAAGAACTCGG
ACCATGCGCTCAAATTCTTCCGGTGGGTAGAGCGAGCCGGGTTATTCCGGCACGATCGCGATACCCATTTGAAAATAATTGAGATTCTGGGTCGGGCTTCAAAGCTTAAC
CATGCCCGTTGCATTCTTCTTGATATGCCCAATAAGGGCGTCGAATGGGATGAAGACTTATTCGTTGTATTGATTGATAGTTATGGTAAAGCTGGGATTGTTCAGGAAGC
TGTGAAAATATTTCAAAAGATGAAGGAATTGGATGTTGAGAGGAGCGTTAAATCTTATGATGCTTTATTTAAGGTGATTTTGAGGAGAGGGAGGTATATGATGGCCAAGA
GGTACTTTAATGCTATGTTGAATGAAGGAATAGAGCCTACTCGCCATACCTACAACTTGATGCTTTGGGGGTTTTTTTTGTCGTTGAGGCTTGAGACAGCCAAGAGATTT
TATGAAGATATGAAGAGTAGAGGTATTTCACCCGATGTCGTTACATATAACACTATGATTAATGGGTTTTATCGGTTCAAGATGATGGAGGAGGCCGAGCAGTTCTTTAC
TGAGATGAAGGGGAAGAATATTGTACCAACTGTGATTAGCTATACTACTATGATAAAAGGTTATGTTTCAGTGGGTCGAGTAGATGATGGATTGAGATTGTTTGAAGAGA
TGAAGGCTGTTGGTGTGAAGCCAAATGATATTACTTATTCAACTCTGCTCCCTGGTCTCTGCGATGCAGAGAAAATGTCTGAAGCACGACAAATTTTGACAGAAATGGTG
GACAAGTATATTGCTCCAAAGGATAATTCAATTTTCATGAGGTTGTTATCTTGTCAGTGCAAGCATGGTGATTTAGATGCTGCTATGCATGTGCTGAAAGCAATGATTCG
ATTAAGCATTCCAACAGAGGCTGGACATTACGGTATTTTGATTGAGAACTGTTGCAAAGCTGAAATGTACGATCAAGCAGTTAAATTGCTTGACAAACTTGTGGAAAAAG
AAATCGTATTGAGGCCACAAAGTACTCTGGAAATTGAGCCTAGTGCATATAACCCTATAATTCAGTATCTGTGCAACCATGGGCAGACTGGAAAAGCTGAAACCTTTTTC
CGGCAGTTGTTGAAGAAGGGTATTCAGGATGAGGTTGCATTTAATAATTTGATTCGTGGCCATTCCAAAGAAGGTAATCCTGAATTGGCATATGAAATGTTGAAAATCAT
GGGTAGGAGAGGTGTGTCTAGGGATGCAGAATCTTATAAGTTGCTTATCAAGAGCTACTTGAGTAAAGGTGAACCAGCTGATGCTAAAACAGCTTTGGACAGCATGATTG
AATGTGGGCACTATCCTGATTCGGCGTTGTTTAGATCAGTGATGGAAAGTCTATTTGCAGATGGGAGGGTGCAGACCGCAAGCCGAGTGATGAATAGCATGTTGGATAAA
GGAATAACAGAAAACTTAGACTTGGTTGCTAAAATCCTGGAAGCCCTTTTCATGAGAGGTCATGTCGAGGAAGCATTGGGACGAATCGATTTGCTAATGAGCTGCAACTG
CCCACCTGATTTTGATAGTCTATTATTTGTTCTTTGTGAAAAGGGGAAGACCATTGCTGCTCTCAAGCTTTTAGATTTTGGGTTGGAAAGAGAATGTAGCATAGAATTCT
CAAGTTATGAGAAAGTACTTGATACGCTGTTGGGGGCGGGGAAAACGCTGAACGCGTACTCAATTCTATGTAAGATAATGGAGAAAGGAGGGGCCAAGGATTGGAGCAGC
TGTGATGATCTGATCAGAAGCCTCAATCAGGAAGGGAACACGAAGCAAGCTGATATTCTCTCAAGAATGATAAAGGGTGGAGACAGAAAGGGGAGTAAGAAAGCTTCTCT
TGCTGCTTGATTATCATCTCCTCCTTCCCTGGTTTGCTTTTTTTCCTCCTCTCTTTTAGATGATATTTCATGAAAGTTTTGAATAACACTTGGGTTGTTATTTGATCCAA
TATACTCAAATTGTAGCTTTAATATTTTTAGTCTTCTTTCCCCCATGATTTTCTTAGTTTGTTTGTTCAGCTGATGAGTAATGAGGGCTTTGATTTTCTTAAGAAGTTTC
CTCGAACTTGGTAATATCTGAAACTTGGTAGGGATGTAATATGCCTCGCTGAGAAAATGTCTAGAAGATTTTGGGATGCAAGGAGTTCTATCTTACTTTCGTTTATAAAT
TTCATATTTTATAGATTTTCGATCTTTGTAAAAAGTTATAAGTAAACATTATATTTGTTTCTGTTATAGAATCACAAGTAACCAAGTTTTAAGTTTTAAGTTTTTCC
Protein sequenceShow/hide protein sequence
MAHISLSKPHYTHFRVLSSSNPNAFNSLHFFSSTQEPISTASQNESPNDPPASFHAAVPQPVEPVAVNGTEQVKRRTPRGKHRNPEKLEDIICRMMANREWTTRLQNSIR
SLVPQFDHSLVWNVLHAAKNSDHALKFFRWVERAGLFRHDRDTHLKIIEILGRASKLNHARCILLDMPNKGVEWDEDLFVVLIDSYGKAGIVQEAVKIFQKMKELDVERS
VKSYDALFKVILRRGRYMMAKRYFNAMLNEGIEPTRHTYNLMLWGFFLSLRLETAKRFYEDMKSRGISPDVVTYNTMINGFYRFKMMEEAEQFFTEMKGKNIVPTVISYT
TMIKGYVSVGRVDDGLRLFEEMKAVGVKPNDITYSTLLPGLCDAEKMSEARQILTEMVDKYIAPKDNSIFMRLLSCQCKHGDLDAAMHVLKAMIRLSIPTEAGHYGILIE
NCCKAEMYDQAVKLLDKLVEKEIVLRPQSTLEIEPSAYNPIIQYLCNHGQTGKAETFFRQLLKKGIQDEVAFNNLIRGHSKEGNPELAYEMLKIMGRRGVSRDAESYKLL
IKSYLSKGEPADAKTALDSMIECGHYPDSALFRSVMESLFADGRVQTASRVMNSMLDKGITENLDLVAKILEALFMRGHVEEALGRIDLLMSCNCPPDFDSLLFVLCEKG
KTIAALKLLDFGLERECSIEFSSYEKVLDTLLGAGKTLNAYSILCKIMEKGGAKDWSSCDDLIRSLNQEGNTKQADILSRMIKGGDRKGSKKASLAA