; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0011958 (gene) of Snake gourd v1 genome

Gene IDTan0011958
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationLG05:77966915..77971946
RNA-Seq ExpressionTan0011958
SyntenyTan0011958
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6573373.1 Pentatricopeptide repeat-containing protein, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0084.07Show/hide
Query:  MLHLRRSNPIIHSFVFCFKFQNFPATQSRLLNTLSSLFNRCNSRQHLEQIHARFILHGFHQNSTLSCKLIDCYANLGLLNLSQQVFYSIIDPNSTLYNAI
        M HL+RS PI     F FKF NFPATQSRLLNTLSSLF+RC SRQ L+QIHARF+LHGFHQN TLSCKLIDCYAN GLLNLS  VF SIIDPNS LYNAI
Subjt:  MLHLRRSNPIIHSFVFCFKFQNFPATQSRLLNTLSSLFNRCNSRQHLEQIHARFILHGFHQNSTLSCKLIDCYANLGLLNLSQQVFYSIIDPNSTLYNAI

Query:  LRNLTKYGEYERTLLVYREMFAKSMHSDEETYPFVLRSCCCLSNVEFGTKIHGRLVKLGVDSYDTVATALAEMYDECIDFENYHQPFDKMFVKDLECWSS
        LRNLT++GEYERTLLVYREM AKSMH DE+TYPFVLRSCCCLSNV+FG  IHG L+KLGVDSYDTV T L EMY++CIDFEN HQ FDKM VKDL+CWSS
Subjt:  LRNLTKYGEYERTLLVYREMFAKSMHSDEETYPFVLRSCCCLSNVEFGTKIHGRLVKLGVDSYDTVATALAEMYDECIDFENYHQPFDKMFVKDLECWSS

Query:  LISKNSQNRNGDENSLFSGRMRTEQLVPDSLTFINLLRSIAGFNSIQLAKIVHCIAIVSNLCGDLLVNTAVLSLYSKLGSLVDARKLFNKMPENDCVVWN
        LI++  QN NGD+ S   GRM++E LV DSLTFINLLRS++G +SIQLAKIVHCIAIVSNLCGDLLV+TAVLSLYSKLGSLVDARKLF K+PE D VVWN
Subjt:  LISKNSQNRNGDENSLFSGRMRTEQLVPDSLTFINLLRSIAGFNSIQLAKIVHCIAIVSNLCGDLLVNTAVLSLYSKLGSLVDARKLFNKMPENDCVVWN

Query:  IMISAYAREGKPTECLELFMSMARSGIRADLFTALPVISSVSQLRCVDWGKQTHAHILRNGLDSQVSVHNSLIDMYCECNILDSACKIFNGTTDKTVISW
        IMI+AYAREG+P ECLELF SMARSGIRADLFTALPVISS+SQL+  DWGKQTHA+ILRNG DSQVSVHNSLIDMYCECN LDSACKIFN  T+KTVISW
Subjt:  IMISAYAREGKPTECLELFMSMARSGIRADLFTALPVISSVSQLRCVDWGKQTHAHILRNGLDSQVSVHNSLIDMYCECNILDSACKIFNGTTDKTVISW

Query:  SAMIKGYVKHGRSLIALSLFLRMKCEGIQADFITMINILPAFVHIGALENVKYLHGYSVKLGLTSLPSLNTTLLITYAKCGCIEMAQRLFEEERVDDKDL
        SAMIKG VKHG  LIALSLF RMK +GIQADFIT+INI+PAFV IGALENVKYLHGYS+KL LTSLPSLNT LLITYAKCGCI+MAQRLFEEERVDDKDL
Subjt:  SAMIKGYVKHGRSLIALSLFLRMKCEGIQADFITMINILPAFVHIGALENVKYLHGYSVKLGLTSLPSLNTTLLITYAKCGCIEMAQRLFEEERVDDKDL

Query:  IMWNSMISAHANHGDWSQCFKLYNQLKCSNSKPDQVTFLGLLTTCVNCGLVEKGKEFFKEMVENYGCQPSQEHYACMVNLLGRAGLINEAGELVRTMPIK
        IMWNSMISAHANHGDWSQCF LYNQ+KCSNS PDQVTFLGLLT CVN GLVEKGKEFFKEM+E+Y CQPSQEHYACMVNLLGRAGLINEAGELVR MPIK
Subjt:  IMWNSMISAHANHGDWSQCFKLYNQLKCSNSKPDQVTFLGLLTTCVNCGLVEKGKEFFKEMVENYGCQPSQEHYACMVNLLGRAGLINEAGELVRTMPIK

Query:  PDARVWGPLLSACKLHSRSKLAEFAAEQLIDMEPKNAGNYILLSNIYAAAGRWDRVAKMRSFLRDKGLKKTPGCSWLEINGQVTEFRVADQTHPRAEDIY
        PDARVWGPLLSACKLH  SKLAEFAAE+LIDMEPKNAGNYILLSNIYAAAG+WD VAKMRSFLRDKGLKKTPGCSWLEING+V EFRVAD+THPRAEDIY
Subjt:  PDARVWGPLLSACKLHSRSKLAEFAAEQLIDMEPKNAGNYILLSNIYAAAGRWDRVAKMRSFLRDKGLKKTPGCSWLEINGQVTEFRVADQTHPRAEDIY

Query:  TILENLEHEIKEARAKSPEKLS
         IL NLE +IKEA+  SPEKL+
Subjt:  TILENLEHEIKEARAKSPEKLS

KAG7012542.1 Pentatricopeptide repeat-containing protein, chloroplastic, partial [Cucurbita argyrosperma subsp. argyrosperma]0.0e+0084.14Show/hide
Query:  MLHLRRSNPIIHSFVFCFKFQNFPATQSRLLNTLSSLFNRCNSRQHLEQIHARFILHGFHQNSTLSCKLIDCYANLGLLNLSQQVFYSIIDPNSTLYNAI
        M HL+RS PI     F FKF NFPATQSRLLNTLSSLF+RC SRQ L+QIHARF+LHGFHQN TLSCKLIDCYAN GLLNLS  VF SIIDPNS LYNAI
Subjt:  MLHLRRSNPIIHSFVFCFKFQNFPATQSRLLNTLSSLFNRCNSRQHLEQIHARFILHGFHQNSTLSCKLIDCYANLGLLNLSQQVFYSIIDPNSTLYNAI

Query:  LRNLTKYGEYERTLLVYREMFAKSMHSDEETYPFVLRSCCCLSNVEFGTKIHGRLVKLGVDSYDTVATALAEMYDECIDFENYHQPFDKMFVKDLECWSS
        LRNLT++GEYERTLLVYREM AKSMH DE+TYPFVLRSCCCLSNV+FG  IHG L+KLGVDSYDTV T L EMY++CIDFEN HQ FDKM VKDL+CWSS
Subjt:  LRNLTKYGEYERTLLVYREMFAKSMHSDEETYPFVLRSCCCLSNVEFGTKIHGRLVKLGVDSYDTVATALAEMYDECIDFENYHQPFDKMFVKDLECWSS

Query:  LISKNSQNRNGDENSLFSGRMRTEQLVPDSLTFINLLRSIAGFNSIQLAKIVHCIAIVSNLCGDLLVNTAVLSLYSKLGSLVDARKLFNKMPENDCVVWN
        L+S   QN NGD+ SL  GRM++E LV DSLTFINLLRSI+G +SIQLAK+VHCIAIVSNLCGDLLV+TAVLSLYSKLGSLVDARKLF K+PE D VVWN
Subjt:  LISKNSQNRNGDENSLFSGRMRTEQLVPDSLTFINLLRSIAGFNSIQLAKIVHCIAIVSNLCGDLLVNTAVLSLYSKLGSLVDARKLFNKMPENDCVVWN

Query:  IMISAYAREGKPTECLELFMSMARSGIRADLFTALPVISSVSQLRCVDWGKQTHAHILRNGLDSQVSVHNSLIDMYCECNILDSACKIFNGTTDKTVISW
        IMI+AYAREG+P ECLELF SMARSGIRADLFTALPVISS+SQL+  DWGKQTHA+ILRNG DSQVSVHNSLIDMYCECN LDSACKIFN  T+KTVISW
Subjt:  IMISAYAREGKPTECLELFMSMARSGIRADLFTALPVISSVSQLRCVDWGKQTHAHILRNGLDSQVSVHNSLIDMYCECNILDSACKIFNGTTDKTVISW

Query:  SAMIKGYVKHGRSLIALSLFLRMKCEGIQADFITMINILPAFVHIGALENVKYLHGYSVKLGLTSLPSLNTTLLITYAKCGCIEMAQRLFEEERVDDKDL
        SAMIKG VKHG  LIALSLF RMK +GIQADFIT+INI+PAFV IGALENVKYLHGYS+KL LTSLPSLNT LLITYAKCGCI+MAQRLFEEERVDDKDL
Subjt:  SAMIKGYVKHGRSLIALSLFLRMKCEGIQADFITMINILPAFVHIGALENVKYLHGYSVKLGLTSLPSLNTTLLITYAKCGCIEMAQRLFEEERVDDKDL

Query:  IMWNSMISAHANHGDWSQCFKLYNQLKCSNSKPDQVTFLGLLTTCVNCGLVEKGKEFFKEMVENYGCQPSQEHYACMVNLLGRAGLINEAGELVRTMPIK
        IMWNSMISAHANHGDWSQCFKLYNQ+KCSNS PDQVTFLGLLT CVN GLVEKGKEFFKEM+E Y CQPSQEHYACMVNLLGRAGLINEAGELVR MPIK
Subjt:  IMWNSMISAHANHGDWSQCFKLYNQLKCSNSKPDQVTFLGLLTTCVNCGLVEKGKEFFKEMVENYGCQPSQEHYACMVNLLGRAGLINEAGELVRTMPIK

Query:  PDARVWGPLLSACKLHSRSKLAEFAAEQLIDMEPKNAGNYILLSNIYAAAGRWDRVAKMRSFLRDKGLKKTPGCSWLEINGQVTEFRVADQTHPRAEDIY
        PDARVWGPLLSACKLH  SKLAEFAAE+LIDMEPKNAGNYILLSNIYAAAG+WD VAKMRSFLRDKGLKKTPGCSWLEING+V EFRVAD+THPRAEDIY
Subjt:  PDARVWGPLLSACKLHSRSKLAEFAAEQLIDMEPKNAGNYILLSNIYAAAGRWDRVAKMRSFLRDKGLKKTPGCSWLEINGQVTEFRVADQTHPRAEDIY

Query:  TILENLEHEIKEARAKSPEKLSNLL
         IL NLE +IKE +  SPEKL  LL
Subjt:  TILENLEHEIKEARAKSPEKLSNLL

XP_022994744.1 pentatricopeptide repeat-containing protein At1g11290, chloroplastic-like [Cucurbita maxima]0.0e+0084.97Show/hide
Query:  MLHLRRSNPIIHSFVFCFKFQNFPATQSRLLNTLSSLFNRCNSRQHLEQIHARFILHGFHQNSTLSCKLIDCYANLGLLNLSQQVFYSIIDPNSTLYNAI
        M HL+RS  I  S +F FKF NFPATQSRLLNTLSSLF+RC SRQ LEQIHARF+LHGFHQN TLSCKLIDCYAN GLLN+S  VF SIIDPNSTLYNAI
Subjt:  MLHLRRSNPIIHSFVFCFKFQNFPATQSRLLNTLSSLFNRCNSRQHLEQIHARFILHGFHQNSTLSCKLIDCYANLGLLNLSQQVFYSIIDPNSTLYNAI

Query:  LRNLTKYGEYERTLLVYREMFAKSMHSDEETYPFVLRSCCCLSNVEFGTKIHGRLVKLGVDSYDTVATALAEMYDECIDFENYHQPFDKMFVKDLECWSS
        LRNLT++GEYERTLLVYREM AKSMH DE+TYPFVL+SCCCLSNVEFG  IHG L+KLGVDSYDTV T LAEMY +CIDFEN HQ FDKM VKDL+CWSS
Subjt:  LRNLTKYGEYERTLLVYREMFAKSMHSDEETYPFVLRSCCCLSNVEFGTKIHGRLVKLGVDSYDTVATALAEMYDECIDFENYHQPFDKMFVKDLECWSS

Query:  LISKNSQNRNGDENSLFSGRMRTEQLVPDSLTFINLLRSIAGFNSIQLAKIVHCIAIVSNLCGDLLVNTAVLSLYSKLGSLVDARKLFNKMPENDCVVWN
        LIS+  QN NGDE SL  GRM++E LV DSLTFINLLRSI+G +SIQLAKIVHCIAIVSNLCGDLLV+TAVLSLYSKLGSLVDARKLF KMPE D VVWN
Subjt:  LISKNSQNRNGDENSLFSGRMRTEQLVPDSLTFINLLRSIAGFNSIQLAKIVHCIAIVSNLCGDLLVNTAVLSLYSKLGSLVDARKLFNKMPENDCVVWN

Query:  IMISAYAREGKPTECLELFMSMARSGIRADLFTALPVISSVSQLRCVDWGKQTHAHILRNGLDSQVSVHNSLIDMYCECNILDSACKIFNGTTDKTVISW
        IMI+AYAREG+P ECLELF SMARSGIRADLFTALPVISS+SQL+C DWGKQTHA+ILRNG DSQVSVHNSLIDMYCECN L+SACKIFN  T+KTVISW
Subjt:  IMISAYAREGKPTECLELFMSMARSGIRADLFTALPVISSVSQLRCVDWGKQTHAHILRNGLDSQVSVHNSLIDMYCECNILDSACKIFNGTTDKTVISW

Query:  SAMIKGYVKHGRSLIALSLFLRMKCEGIQADFITMINILPAFVHIGALENVKYLHGYSVKLGLTSLPSLNTTLLITYAKCGCIEMAQRLFEEERVDDKDL
        SAMIKG VKHG  LIALSLF  MK +GIQADFIT+INI+PAFV IGALENVKYLHGYS+KL LTSLPSLNT LLITYAKCGCIEMAQRLFEEERV+DKDL
Subjt:  SAMIKGYVKHGRSLIALSLFLRMKCEGIQADFITMINILPAFVHIGALENVKYLHGYSVKLGLTSLPSLNTTLLITYAKCGCIEMAQRLFEEERVDDKDL

Query:  IMWNSMISAHANHGDWSQCFKLYNQLKCSNSKPDQVTFLGLLTTCVNCGLVEKGKEFFKEMVENYGCQPSQEHYACMVNLLGRAGLINEAGELVRTMPIK
        IMWNSMISAHANHGDWSQCFKLYNQ+KCSNS PDQVTFLGLLT CVN GLVEKGKEFFKEM+E+Y CQPSQEHYACMVNLLGRAGLINEAGELVR MPIK
Subjt:  IMWNSMISAHANHGDWSQCFKLYNQLKCSNSKPDQVTFLGLLTTCVNCGLVEKGKEFFKEMVENYGCQPSQEHYACMVNLLGRAGLINEAGELVRTMPIK

Query:  PDARVWGPLLSACKLHSRSKLAEFAAEQLIDMEPKNAGNYILLSNIYAAAGRWDRVAKMRSFLRDKGLKKTPGCSWLEINGQVTEFRVADQTHPRAEDIY
        PDARVWGPLLSACKLH  SKLAEFAAE+LIDMEPKNAGNYILLSNIYAAAG+WD VAKMRSFLRDKGLKKTPGCSWLEING+V EFRVAD+THPRAEDIY
Subjt:  PDARVWGPLLSACKLHSRSKLAEFAAEQLIDMEPKNAGNYILLSNIYAAAGRWDRVAKMRSFLRDKGLKKTPGCSWLEINGQVTEFRVADQTHPRAEDIY

Query:  TILENLEHEIKEARAKSPEKLSNLL
         IL NLE +IKEA+  SPEKL  LL
Subjt:  TILENLEHEIKEARAKSPEKLSNLL

XP_023541395.1 pentatricopeptide repeat-containing protein At1g11290, chloroplastic-like [Cucurbita pepo subsp. pepo]0.0e+0084Show/hide
Query:  MLHLRRSNPIIHSFVFCFKFQNFPATQSRLLNTLSSLFNRCNSRQHLEQIHARFILHGFHQNSTLSCKLIDCYANLGLLNLSQQVFYSIIDPNSTLYNAI
        M HL+RS PI  S +F FKF NFPATQSRL NTLSSLF+RC SRQ L+QIHARF+LHGFHQN TLSCKLIDCYAN GLLNLS  VF SIIDPNSTLYNAI
Subjt:  MLHLRRSNPIIHSFVFCFKFQNFPATQSRLLNTLSSLFNRCNSRQHLEQIHARFILHGFHQNSTLSCKLIDCYANLGLLNLSQQVFYSIIDPNSTLYNAI

Query:  LRNLTKYGEYERTLLVYREMFAKSMHSDEETYPFVLRSCCCLSNVEFGTKIHGRLVKLGVDSYDTVATALAEMYDECIDFENYHQPFDKMFVKDLECWSS
        LRNLT++GEYERTLL+YREM  KSMH DE+TYPFVLRSCCCLS+VEFG  IHG L+KLGVDSYDTV T LAEMY++CIDFEN HQ FDKM VKDL+CWSS
Subjt:  LRNLTKYGEYERTLLVYREMFAKSMHSDEETYPFVLRSCCCLSNVEFGTKIHGRLVKLGVDSYDTVATALAEMYDECIDFENYHQPFDKMFVKDLECWSS

Query:  LISKNSQNRNGDENSLFSGRMRTEQLVPDSLTFINLLRSIAGFNSIQLAKIVHCIAIVSNLCGDLLVNTAVLSLYSKLGSLVDARKLFNKMPENDCVVWN
        L+S+  QN NGD+ SL  GRM++E +V DSLTFIN LRS++G +SIQLAKIVHCIAIVSNLCGDLLV+TAVLSLYSKLGSLVDARKLF KMPE D VVWN
Subjt:  LISKNSQNRNGDENSLFSGRMRTEQLVPDSLTFINLLRSIAGFNSIQLAKIVHCIAIVSNLCGDLLVNTAVLSLYSKLGSLVDARKLFNKMPENDCVVWN

Query:  IMISAYAREGKPTECLELFMSMARSGIRADLFTALPVISSVSQLRCVDWGKQTHAHILRNGLDSQVSVHNSLIDMYCECNILDSACKIFNGTTDKTVISW
        IMI+AYAREG+P ECLELF SMARSGIRADLFTALPVISS+SQL+  DWGKQTHA+ILRNG DSQVSVHNSLIDMYCECN LDSACKIFN  T+KTVISW
Subjt:  IMISAYAREGKPTECLELFMSMARSGIRADLFTALPVISSVSQLRCVDWGKQTHAHILRNGLDSQVSVHNSLIDMYCECNILDSACKIFNGTTDKTVISW

Query:  SAMIKGYVKHGRSLIALSLFLRMKCEGIQADFITMINILPAFVHIGALENVKYLHGYSVKLGLTSLPSLNTTLLITYAKCGCIEMAQRLFEEERVDDKDL
        SAMIKG VKHG  LIALSLF RMK +GIQADFIT+INI+PAFV IGALENVKYLHGYS+KL LTSLPSLNT LLITYAKCGCI+MAQRLFEEERVDDKDL
Subjt:  SAMIKGYVKHGRSLIALSLFLRMKCEGIQADFITMINILPAFVHIGALENVKYLHGYSVKLGLTSLPSLNTTLLITYAKCGCIEMAQRLFEEERVDDKDL

Query:  IMWNSMISAHANHGDWSQCFKLYNQLKCSNSKPDQVTFLGLLTTCVNCGLVEKGKEFFKEMVENYGCQPSQEHYACMVNLLGRAGLINEAGELVRTMPIK
        IMWNSMISAHANHGDWSQCFKLY+Q+KCSNS PDQVTFLGLLT CVN GLVEKGKEFFKEM+E+Y CQPSQEHYACMVNLLGRAGLINEAGELVR MPIK
Subjt:  IMWNSMISAHANHGDWSQCFKLYNQLKCSNSKPDQVTFLGLLTTCVNCGLVEKGKEFFKEMVENYGCQPSQEHYACMVNLLGRAGLINEAGELVRTMPIK

Query:  PDARVWGPLLSACKLHSRSKLAEFAAEQLIDMEPKNAGNYILLSNIYAAAGRWDRVAKMRSFLRDKGLKKTPGCSWLEINGQVTEFRVADQTHPRAEDIY
        PDARVWGPLLSACKLH  SKLAEFAAE+LIDMEPKNAGNYILLSNIYAAAG+WD VAKMRSFLRDKGLKKTPGCSWLEING+V EFRVAD+THPRAEDIY
Subjt:  PDARVWGPLLSACKLHSRSKLAEFAAEQLIDMEPKNAGNYILLSNIYAAAGRWDRVAKMRSFLRDKGLKKTPGCSWLEINGQVTEFRVADQTHPRAEDIY

Query:  TILENLEHEIKEARAKSPEKLSNLL
         IL NLE +IKEA+  SPEKL  LL
Subjt:  TILENLEHEIKEARAKSPEKLSNLL

XP_038894029.1 pentatricopeptide repeat-containing protein At1g11290, chloroplastic-like [Benincasa hispida]0.0e+0084.83Show/hide
Query:  MLHLRRSNPIIHSFVFCFKFQNFPATQSRLLNTLSSLFNRCNSRQHLEQIHARFILHGFHQNSTLSCKLIDCYANLGLLNLSQQVFYSIIDPNSTLYNAI
        MLHL+RS P+IHS +    F NFPATQSRLLNTLS LF+RC+SRQHL+QIHARF+LHGFHQN TLS KLIDCYANLGLLNLS QVFYSI +PNST+YNAI
Subjt:  MLHLRRSNPIIHSFVFCFKFQNFPATQSRLLNTLSSLFNRCNSRQHLEQIHARFILHGFHQNSTLSCKLIDCYANLGLLNLSQQVFYSIIDPNSTLYNAI

Query:  LRNLTKYGEYERTLLVYREMFAKSMHSDEETYPFVLRSCCCLSNVEFGTKIHGRLVKLGVDSYDTVATALAEMYDECIDFENYHQPFDKMFVKDLECWSS
        LRNLT+YGE ERTLLVYR+M AKSMH DEETYP VLRSCC  SNV  G KIHG LVKLG DS+D VATAL EMY+ECIDFE+ HQ FDK  VKDLECWSS
Subjt:  LRNLTKYGEYERTLLVYREMFAKSMHSDEETYPFVLRSCCCLSNVEFGTKIHGRLVKLGVDSYDTVATALAEMYDECIDFENYHQPFDKMFVKDLECWSS

Query:  LISKNSQNRNGDENSLFSGRMRTEQLVPDSLTFINLLRSIAGFNSIQLAKIVHCIAIVSNLCGDLLVNTAVLSLYSKLGSLVDARKLFNKMPENDCVVWN
          ++  QN NG+      GRMR EQLV DSLTFINLLR IAGFNSIQLAKIVHCIAIVS LCGDLLVNTAVLSLYSKLGSLVDARKLF+KMPEND VVWN
Subjt:  LISKNSQNRNGDENSLFSGRMRTEQLVPDSLTFINLLRSIAGFNSIQLAKIVHCIAIVSNLCGDLLVNTAVLSLYSKLGSLVDARKLFNKMPENDCVVWN

Query:  IMISAYAREGKPTECLELFMSMARSGIRADLFTALPVISSVSQLRCVDWGKQTHAHILRNGLDSQVSVHNSLIDMYCECNILDSACKIFNGTTDKTVISW
        IMI+AYAREGKPTECL LF SMARSGIR+D+FTALPVISS+SQL+  DWGKQTHA+ILRNG DSQVSV+NSLIDMYCECNILDSACKIFN   DKTVISW
Subjt:  IMISAYAREGKPTECLELFMSMARSGIRADLFTALPVISSVSQLRCVDWGKQTHAHILRNGLDSQVSVHNSLIDMYCECNILDSACKIFNGTTDKTVISW

Query:  SAMIKGYVKHGRSLIALSLFLRMKCEGIQADFITMINILPAFVHIGALENVKYLHGYSVKLGLTSLPSLNTTLLITYAKCGCIEMAQRLFEEERVDDKDL
        SAMIKGYVKHG SLIALSLF  MK +GIQ+DFIT+INILPAFVHIG LENVKYLHGYS+KLGLTSLPSLNT LLITYAKCGCIEMAQR+FEEER+DDKDL
Subjt:  SAMIKGYVKHGRSLIALSLFLRMKCEGIQADFITMINILPAFVHIGALENVKYLHGYSVKLGLTSLPSLNTTLLITYAKCGCIEMAQRLFEEERVDDKDL

Query:  IMWNSMISAHANHGDWSQCFKLYNQLKCSNSKPDQVTFLGLLTTCVNCGLVEKGKEFFKEMVENYGCQPSQEHYACMVNLLGRAGLINEAGELVRTMPIK
        IMWNSMISAHANHGDWSQCFKLYNQ+KCSN+KPDQVTFLGLLT CVN GLVEKGKEF KEM ENYGCQPSQEHYACMVNLLGRAGLINEAGELVR MPIK
Subjt:  IMWNSMISAHANHGDWSQCFKLYNQLKCSNSKPDQVTFLGLLTTCVNCGLVEKGKEFFKEMVENYGCQPSQEHYACMVNLLGRAGLINEAGELVRTMPIK

Query:  PDARVWGPLLSACKLHSRSKLAEFAAEQLIDMEPKNAGNYILLSNIYAAAGRWDRVAKMRSFLRDKGLKKTPGCSWLEINGQVTEFRVADQTHPRAEDIY
        PDARVWGPLLSACKLH  SKLAEFAAE+L+DMEPKNAGNYILLSNIYAAAG+WD VAKMRSFLRDKGLKKTPGCSWLEING VTEFRVADQTHPRAEDIY
Subjt:  PDARVWGPLLSACKLHSRSKLAEFAAEQLIDMEPKNAGNYILLSNIYAAAGRWDRVAKMRSFLRDKGLKKTPGCSWLEINGQVTEFRVADQTHPRAEDIY

Query:  TILENLEHEIKEARAKSPEKLSNLL
        TIL NLE EIKEAR KS EKL N L
Subjt:  TILENLEHEIKEARAKSPEKLSNLL

TrEMBL top hitse value%identityAlignment
A0A0A0M0Z6 Uncharacterized protein0.0e+0082.62Show/hide
Query:  MLHLRRSNPIIHSFVFCFKFQNFPATQSRLLNTLSSLFNRCNSRQHLEQIHARFILHGFHQNSTLSCKLIDCYANLGLLNLSQQVFYSIIDPNSTLYNAI
        MLHL RS PIIHS +F     NFPATQSRLLNTLS LF+RCNS QHL+QIHARFILHGFHQN TLS KLIDCYANLGLLN S QVF S+IDPN TL+NAI
Subjt:  MLHLRRSNPIIHSFVFCFKFQNFPATQSRLLNTLSSLFNRCNSRQHLEQIHARFILHGFHQNSTLSCKLIDCYANLGLLNLSQQVFYSIIDPNSTLYNAI

Query:  LRNLTKYGEYERTLLVYREMFAKSMHSDEETYPFVLRSCCCLSNVEFGTKIHGRLVKLGVDSYDTVATALAEMYDECIDFENYHQPFDKMFVKDLECWSS
        LRNLT+YGE ERTLLVY++M AKSMH DEETYPFVLRSC   SNV FG  IHG LVKLG D +D VATALAEMY+ECI+FEN HQ FDK  VKDL   SS
Subjt:  LRNLTKYGEYERTLLVYREMFAKSMHSDEETYPFVLRSCCCLSNVEFGTKIHGRLVKLGVDSYDTVATALAEMYDECIDFENYHQPFDKMFVKDLECWSS

Query:  LISKNSQNRNGDENSLFSGRMRTEQLVPDSLTFINLLRSIAGFNSIQLAKIVHCIAIVSNLCGDLLVNTAVLSLYSKLGSLVDARKLFNKMPENDCVVWN
        L ++  QN NG+      GRM  EQLVPDS TF NLLR IAG NSIQLAKIVHCIAIVS L GDLLVNTAVLSLYSKL SLVDARKLF+KMPE D VVWN
Subjt:  LISKNSQNRNGDENSLFSGRMRTEQLVPDSLTFINLLRSIAGFNSIQLAKIVHCIAIVSNLCGDLLVNTAVLSLYSKLGSLVDARKLFNKMPENDCVVWN

Query:  IMISAYAREGKPTECLELFMSMARSGIRADLFTALPVISSVSQLRCVDWGKQTHAHILRNGLDSQVSVHNSLIDMYCECNILDSACKIFNGTTDKTVISW
        IMI+AYAREGKPTECLELF SMARSGIR+DLFTALPVISS++QL+CVDWGKQTHAHILRNG DSQVSVHNSLIDMYCEC ILDSACKIFN  TDK+VISW
Subjt:  IMISAYAREGKPTECLELFMSMARSGIRADLFTALPVISSVSQLRCVDWGKQTHAHILRNGLDSQVSVHNSLIDMYCECNILDSACKIFNGTTDKTVISW

Query:  SAMIKGYVKHGRSLIALSLFLRMKCEGIQADFITMINILPAFVHIGALENVKYLHGYSVKLGLTSLPSLNTTLLITYAKCGCIEMAQRLFEEERVDDKDL
        SAMIKGYVK+G+SL ALSLF +MK +GIQADF+ MINILPAFVHIGALENVKYLHGYS+KLGLTSLPSLNT LLITYAKCG IEMAQRLFEEE++DDKDL
Subjt:  SAMIKGYVKHGRSLIALSLFLRMKCEGIQADFITMINILPAFVHIGALENVKYLHGYSVKLGLTSLPSLNTTLLITYAKCGCIEMAQRLFEEERVDDKDL

Query:  IMWNSMISAHANHGDWSQCFKLYNQLKCSNSKPDQVTFLGLLTTCVNCGLVEKGKEFFKEMVENYGCQPSQEHYACMVNLLGRAGLINEAGELVRTMPIK
        IMWNSMISAHANHGDWSQCFKLYN++KCSNSKPDQVTFLGLLT CVN GLVEKGKEFFKEM E+YGCQPSQEHYACMVNLLGRAGLI+EAGELV+ MPIK
Subjt:  IMWNSMISAHANHGDWSQCFKLYNQLKCSNSKPDQVTFLGLLTTCVNCGLVEKGKEFFKEMVENYGCQPSQEHYACMVNLLGRAGLINEAGELVRTMPIK

Query:  PDARVWGPLLSACKLHSRSKLAEFAAEQLIDMEPKNAGNYILLSNIYAAAGRWDRVAKMRSFLRDKGLKKTPGCSWLEINGQVTEFRVADQTHPRAEDIY
        PDARVWGPLLSACK+H  SKLAEFAAE+LI+MEP+NAGNYILLSNIYAAAG+WD VAKMRSFLR+KGLKK PGCSWLEING VTEFRVADQTHPRA DIY
Subjt:  PDARVWGPLLSACKLHSRSKLAEFAAEQLIDMEPKNAGNYILLSNIYAAAGRWDRVAKMRSFLRDKGLKKTPGCSWLEINGQVTEFRVADQTHPRAEDIY

Query:  TILENLEHEIKEARAKSPEKLSNLL
        TIL NLE EIKE R KSP+ L N L
Subjt:  TILENLEHEIKEARAKSPEKLSNLL

A0A5D3DB69 Pentatricopeptide repeat-containing protein0.0e+0081.52Show/hide
Query:  MLHLRRSNPIIHSFVFCFKFQNFPATQSRLLNTLSSLFNRCNSRQHLEQIHARFILHGFHQNSTLSCKLIDCYANLGLLNLSQQVFYSIIDPNSTLYNAI
        MLHL+RS PIIH+ +      NFPATQSRLLNTLS LFNRCNS QHL+QIHARFILHGFHQN TLS KLIDCYANLGLL  S QVF SIIDPN TL+NAI
Subjt:  MLHLRRSNPIIHSFVFCFKFQNFPATQSRLLNTLSSLFNRCNSRQHLEQIHARFILHGFHQNSTLSCKLIDCYANLGLLNLSQQVFYSIIDPNSTLYNAI

Query:  LRNLTKYGEYERTLLVYREMFAKSMHSDEETYPFVLRSCCCLSNVEFGTKIHGRLVKLGVDSYDTVATALAEMYDECIDFENYHQPFDKMFVKDLECWSS
        LRNLT+YGE ER LLVY++M AKSMH DEETYPF+ RSC   SNV FG  IHG LVKLG DS+D VATALAEMY++ I FEN HQ FDK  VKDL   SS
Subjt:  LRNLTKYGEYERTLLVYREMFAKSMHSDEETYPFVLRSCCCLSNVEFGTKIHGRLVKLGVDSYDTVATALAEMYDECIDFENYHQPFDKMFVKDLECWSS

Query:  LISKNSQNRNGDENSLFSGRMRTEQLVPDSLTFINLLRSIAGFNSIQLAKIVHCIAIVSNLCGDLLVNTAVLSLYSKLGSLVDARKLFNKMPENDCVVWN
        L ++ SQN NG+       RMR EQLVPDSLTF+NLLR IAG NSIQLAKIVHCIAIVS L GDLLV TAVLSLYSKL SLVDAR+LF+KMPE D VVWN
Subjt:  LISKNSQNRNGDENSLFSGRMRTEQLVPDSLTFINLLRSIAGFNSIQLAKIVHCIAIVSNLCGDLLVNTAVLSLYSKLGSLVDARKLFNKMPENDCVVWN

Query:  IMISAYAREGKPTECLELFMSMARSGIRADLFTALPVISSVSQLRCVDWGKQTHAHILRNGLDSQVSVHNSLIDMYCECNILDSACKIFNGTTDKTVISW
        IMI+AYAREGKP ECLELF SMARSGIR+DLFTALPVISS++QL+CVDWGKQTHAHILRNG DSQVSVHNSLIDMYCEC +LDSAC IFN  TDK+VISW
Subjt:  IMISAYAREGKPTECLELFMSMARSGIRADLFTALPVISSVSQLRCVDWGKQTHAHILRNGLDSQVSVHNSLIDMYCECNILDSACKIFNGTTDKTVISW

Query:  SAMIKGYVKHGRSLIALSLFLRMKCEGIQADFITMINILPAFVHIGALENVKYLHGYSVKLGLTSLPSLNTTLLITYAKCGCIEMAQRLFEEERVDDKDL
        SAMIKGYVK+G+SL A SLF +MK +GIQADF+TMINILPAFVHIGALENVKYLHGYS+KLGLTSLPSLNT LLITYAKCG IEMAQRLFEEER+DDKDL
Subjt:  SAMIKGYVKHGRSLIALSLFLRMKCEGIQADFITMINILPAFVHIGALENVKYLHGYSVKLGLTSLPSLNTTLLITYAKCGCIEMAQRLFEEERVDDKDL

Query:  IMWNSMISAHANHGDWSQCFKLYNQLKCSNSKPDQVTFLGLLTTCVNCGLVEKGKEFFKEMVENYGCQPSQEHYACMVNLLGRAGLINEAGELVRTMPIK
        IMWNSMISAHANHGDWSQCFKLYN++KCSNSKPDQVTFLGLLT CVN GL+EKGKEFFKEM E+YGC PSQEH+ACMVNLLGRAGLI+EAGELVR MPIK
Subjt:  IMWNSMISAHANHGDWSQCFKLYNQLKCSNSKPDQVTFLGLLTTCVNCGLVEKGKEFFKEMVENYGCQPSQEHYACMVNLLGRAGLINEAGELVRTMPIK

Query:  PDARVWGPLLSACKLHSRSKLAEFAAEQLIDMEPKNAGNYILLSNIYAAAGRWDRVAKMRSFLRDKGLKKTPGCSWLEINGQVTEFRVADQTHPRAEDIY
        PDARVWGPLLSACK+H  SKLAEFAAE+LIDMEPKNAGNYILLSNIYAAAG+W+ VAKMRSFLR+KGLKKTPGCS LEING+VTEFRVADQTHPRAEDIY
Subjt:  PDARVWGPLLSACKLHSRSKLAEFAAEQLIDMEPKNAGNYILLSNIYAAAGRWDRVAKMRSFLRDKGLKKTPGCSWLEINGQVTEFRVADQTHPRAEDIY

Query:  TILENLEHEIKEARAKSPEKLSNLL
        TIL NLE EIKE R KS + L N L
Subjt:  TILENLEHEIKEARAKSPEKLSNLL

A0A6J1CE61 pentatricopeptide repeat-containing protein At1g11290, chloroplastic-like0.0e+0084.83Show/hide
Query:  MLHLRRSNPIIHSFVFCFKFQNFPATQSRLLNTLSSLFNRCNSRQHLEQIHARFILHGFHQNSTLSCKLIDCYANLGLLNLSQQVFYSIIDPNSTLYNAI
        MLHL+RS PI     F F+F NFPATQSR LNTLS LF+RC+SRQ LEQIHARFILHG HQN  LSC+LID YANLGLL LSQQVF SIIDP STLY+AI
Subjt:  MLHLRRSNPIIHSFVFCFKFQNFPATQSRLLNTLSSLFNRCNSRQHLEQIHARFILHGFHQNSTLSCKLIDCYANLGLLNLSQQVFYSIIDPNSTLYNAI

Query:  LRNLTKYGEYERTLLVYREMFAKSMHSDEETYPFVLRSCCCLSNVEFGTKIHGRLVKLGVDSYDTVATALAEMYDECIDFENYHQPFDKMFVKDLECWSS
        LRNL+ +GEYERTLLVYREMFAKSMH DEETYP VLRSCCCLSNVE+G KIHG LVKLGVD YD+ ATALAEMY +CI FEN H  FDKM +KD ECW+S
Subjt:  LRNLTKYGEYERTLLVYREMFAKSMHSDEETYPFVLRSCCCLSNVEFGTKIHGRLVKLGVDSYDTVATALAEMYDECIDFENYHQPFDKMFVKDLECWSS

Query:  LISKNSQNRNGDENSLFSGRMRTEQLVPDSLTFINLLRSIAGFNSIQLAKIVHCIAIVSNLCGDLLVNTAVLSLYSKLGSLVDARKLFNKMPENDCVVWN
        L S+ SQN NGDE     GRMRTEQLV DSLTFINLLRSI G NSIQLAKIVHC+AI SNLCGDLLVNTAVLSLYSKLG LV+ARKLF+KMPE D VVWN
Subjt:  LISKNSQNRNGDENSLFSGRMRTEQLVPDSLTFINLLRSIAGFNSIQLAKIVHCIAIVSNLCGDLLVNTAVLSLYSKLGSLVDARKLFNKMPENDCVVWN

Query:  IMISAYAREGKPTECLELFMSMARSGIRADLFTALPVISSVSQLRCVDWGKQTHAHILRNGLDSQVSVHNSLIDMYCECNILDSACKIFNGTTDKTVISW
        IMI+AY REG P ECLELF SMARSGIRADLFTALPVISS+SQL+CVDWGKQTHAH LRNG D+QVSVHNSLIDMYCE NILDSACKIF+  T+KTVISW
Subjt:  IMISAYAREGKPTECLELFMSMARSGIRADLFTALPVISSVSQLRCVDWGKQTHAHILRNGLDSQVSVHNSLIDMYCECNILDSACKIFNGTTDKTVISW

Query:  SAMIKGYVKHGRSLIALSLFLRMKCEGIQADFITMINILPAFVHIGALENVKYLHGYSVKLGLTSLPSLNTTLLITYAKCGCIEMAQRLFEEERVDDKDL
        SAMIKG VKHG+SL ALSLF RMK +GIQADFIT+INILPAFVHIGALENVKYLHGYS+KLGLTSLPSLNT LLITYAKCGCIEMAQRLFEEERVDDKDL
Subjt:  SAMIKGYVKHGRSLIALSLFLRMKCEGIQADFITMINILPAFVHIGALENVKYLHGYSVKLGLTSLPSLNTTLLITYAKCGCIEMAQRLFEEERVDDKDL

Query:  IMWNSMISAHANHGDWSQCFKLYNQLKCSNSKPDQVTFLGLLTTCVNCGLVEKGKEFFKEMVENYGCQPSQEHYACMVNLLGRAGLINEAGELVRTMPIK
        IMWNSMISAHANHGDWSQCFK+YNQ+KCSNS+PDQVTFLGLLT CVN GLVEKGKE FKEM+ENYGCQPSQEHYACMVNLLGRAGLIN+AG LVR MPIK
Subjt:  IMWNSMISAHANHGDWSQCFKLYNQLKCSNSKPDQVTFLGLLTTCVNCGLVEKGKEFFKEMVENYGCQPSQEHYACMVNLLGRAGLINEAGELVRTMPIK

Query:  PDARVWGPLLSACKLHSRSKLAEFAAEQLIDMEPKNAGNYILLSNIYAAAGRWDRVAKMRSFLRDKGLKKTPGCSWLEINGQVTEFRVADQTHPRAEDIY
        PDARVWGPLLSACKLH  SKLAEFAAE+LIDMEPKNAGNYILLSNIYAAAG+WD VAKMRSFLRDKGLKKTPGCSWLEING VTEFRVAD+THPRAEDIY
Subjt:  PDARVWGPLLSACKLHSRSKLAEFAAEQLIDMEPKNAGNYILLSNIYAAAGRWDRVAKMRSFLRDKGLKKTPGCSWLEINGQVTEFRVADQTHPRAEDIY

Query:  TILENLEHEIKEARAKSPEKLSNLL
        TIL NLE EIKEAR KSPEKL  LL
Subjt:  TILENLEHEIKEARAKSPEKLSNLL

A0A6J1GR57 pentatricopeptide repeat-containing protein At1g11290, chloroplastic-like0.0e+0083.45Show/hide
Query:  MLHLRRSNPIIHSFVFCFKFQNFPATQSRLLNTLSSLFNRCNSRQHLEQIHARFILHGFHQNSTLSCKLIDCYANLGLLNLSQQVFYSIIDPNSTLYNAI
        M HL+RS PI     F FKF NFPAT SRLLNTLSSLF+RC SRQ L+QIHARF+LHGFHQN TLSCKLIDCYAN GLLNLS  VF SIIDPNS LYNAI
Subjt:  MLHLRRSNPIIHSFVFCFKFQNFPATQSRLLNTLSSLFNRCNSRQHLEQIHARFILHGFHQNSTLSCKLIDCYANLGLLNLSQQVFYSIIDPNSTLYNAI

Query:  LRNLTKYGEYERTLLVYREMFAKSMHSDEETYPFVLRSCCCLSNVEFGTKIHGRLVKLGVDSYDTVATALAEMYDECIDFENYHQPFDKMFVKDLECWSS
        LRNLT++GEYERTLLVYREM AKSMH DE+TYPFVLRSCCCLSNV+FG  IHG L+KLGVDSYDTV T L EMY++CIDFEN HQ FDKM VKDL+CWSS
Subjt:  LRNLTKYGEYERTLLVYREMFAKSMHSDEETYPFVLRSCCCLSNVEFGTKIHGRLVKLGVDSYDTVATALAEMYDECIDFENYHQPFDKMFVKDLECWSS

Query:  LISKNSQNRNGDENSLFSGRMRTEQLVPDSLTFINLLRSIAGFNSIQLAKIVHCIAIVSNLCGDLLVNTAVLSLYSKLGSLVDARKLFNKMPENDCVVWN
        LI++  QN NGD+ S   GRM++E LV DSLTFINLLRS++G +SIQLAKIVHCIAIVSNLCGDLLV+TAVLSLYSKLGSLVDARKLF K+PE D VVWN
Subjt:  LISKNSQNRNGDENSLFSGRMRTEQLVPDSLTFINLLRSIAGFNSIQLAKIVHCIAIVSNLCGDLLVNTAVLSLYSKLGSLVDARKLFNKMPENDCVVWN

Query:  IMISAYAREGKPTECLELFMSMARSGIRADLFTALPVISSVSQLRCVDWGKQTHAHILRNGLDSQVSVHNSLIDMYCECNILDSACKIFNGTTDKTVISW
        IMI+AYAREG+P ECLELF SMARSGIRADLFT LPVISS+SQL+  DWGKQTHA+ILRNG DSQVSVHNSLIDMYCECN LDSA KIFN  T+KTVISW
Subjt:  IMISAYAREGKPTECLELFMSMARSGIRADLFTALPVISSVSQLRCVDWGKQTHAHILRNGLDSQVSVHNSLIDMYCECNILDSACKIFNGTTDKTVISW

Query:  SAMIKGYVKHGRSLIALSLFLRMKCEGIQADFITMINILPAFVHIGALENVKYLHGYSVKLGLTSLPSLNTTLLITYAKCGCIEMAQRLFEEERVDDKDL
        SAMIKG VKHG  LIALSLF RMK +GIQADFIT+INI+PAFV IGALENVKYLHGYS+KL LTSLPSLNT LLITYAKCGCI+MAQRLFEEERV+DKDL
Subjt:  SAMIKGYVKHGRSLIALSLFLRMKCEGIQADFITMINILPAFVHIGALENVKYLHGYSVKLGLTSLPSLNTTLLITYAKCGCIEMAQRLFEEERVDDKDL

Query:  IMWNSMISAHANHGDWSQCFKLYNQLKCSNSKPDQVTFLGLLTTCVNCGLVEKGKEFFKEMVENYGCQPSQEHYACMVNLLGRAGLINEAGELVRTMPIK
        IMWNSMISAHANHGDWSQCFKLYNQ+KCSNS PDQVTFLGLLT CVN GLVEKGKEFFKEM+E+Y CQPSQEHYACMVNLLGRAGLINEAGELVR MPIK
Subjt:  IMWNSMISAHANHGDWSQCFKLYNQLKCSNSKPDQVTFLGLLTTCVNCGLVEKGKEFFKEMVENYGCQPSQEHYACMVNLLGRAGLINEAGELVRTMPIK

Query:  PDARVWGPLLSACKLHSRSKLAEFAAEQLIDMEPKNAGNYILLSNIYAAAGRWDRVAKMRSFLRDKGLKKTPGCSWLEINGQVTEFRVADQTHPRAEDIY
        PDARVWGPLLSACKLH  SKLAEFAAE+LIDMEPKNAGNYILLSNIYAAAG+WD VAKMRSFLRDKGLKKTPGCSWLEING+V EFRVAD+THPRAEDIY
Subjt:  PDARVWGPLLSACKLHSRSKLAEFAAEQLIDMEPKNAGNYILLSNIYAAAGRWDRVAKMRSFLRDKGLKKTPGCSWLEINGQVTEFRVADQTHPRAEDIY

Query:  TILENLEHEIKEARAKSPEKLSNLL
         IL NLE +IKE +  SPEKL  LL
Subjt:  TILENLEHEIKEARAKSPEKLSNLL

A0A6J1K3Q8 pentatricopeptide repeat-containing protein At1g11290, chloroplastic-like0.0e+0084.97Show/hide
Query:  MLHLRRSNPIIHSFVFCFKFQNFPATQSRLLNTLSSLFNRCNSRQHLEQIHARFILHGFHQNSTLSCKLIDCYANLGLLNLSQQVFYSIIDPNSTLYNAI
        M HL+RS  I  S +F FKF NFPATQSRLLNTLSSLF+RC SRQ LEQIHARF+LHGFHQN TLSCKLIDCYAN GLLN+S  VF SIIDPNSTLYNAI
Subjt:  MLHLRRSNPIIHSFVFCFKFQNFPATQSRLLNTLSSLFNRCNSRQHLEQIHARFILHGFHQNSTLSCKLIDCYANLGLLNLSQQVFYSIIDPNSTLYNAI

Query:  LRNLTKYGEYERTLLVYREMFAKSMHSDEETYPFVLRSCCCLSNVEFGTKIHGRLVKLGVDSYDTVATALAEMYDECIDFENYHQPFDKMFVKDLECWSS
        LRNLT++GEYERTLLVYREM AKSMH DE+TYPFVL+SCCCLSNVEFG  IHG L+KLGVDSYDTV T LAEMY +CIDFEN HQ FDKM VKDL+CWSS
Subjt:  LRNLTKYGEYERTLLVYREMFAKSMHSDEETYPFVLRSCCCLSNVEFGTKIHGRLVKLGVDSYDTVATALAEMYDECIDFENYHQPFDKMFVKDLECWSS

Query:  LISKNSQNRNGDENSLFSGRMRTEQLVPDSLTFINLLRSIAGFNSIQLAKIVHCIAIVSNLCGDLLVNTAVLSLYSKLGSLVDARKLFNKMPENDCVVWN
        LIS+  QN NGDE SL  GRM++E LV DSLTFINLLRSI+G +SIQLAKIVHCIAIVSNLCGDLLV+TAVLSLYSKLGSLVDARKLF KMPE D VVWN
Subjt:  LISKNSQNRNGDENSLFSGRMRTEQLVPDSLTFINLLRSIAGFNSIQLAKIVHCIAIVSNLCGDLLVNTAVLSLYSKLGSLVDARKLFNKMPENDCVVWN

Query:  IMISAYAREGKPTECLELFMSMARSGIRADLFTALPVISSVSQLRCVDWGKQTHAHILRNGLDSQVSVHNSLIDMYCECNILDSACKIFNGTTDKTVISW
        IMI+AYAREG+P ECLELF SMARSGIRADLFTALPVISS+SQL+C DWGKQTHA+ILRNG DSQVSVHNSLIDMYCECN L+SACKIFN  T+KTVISW
Subjt:  IMISAYAREGKPTECLELFMSMARSGIRADLFTALPVISSVSQLRCVDWGKQTHAHILRNGLDSQVSVHNSLIDMYCECNILDSACKIFNGTTDKTVISW

Query:  SAMIKGYVKHGRSLIALSLFLRMKCEGIQADFITMINILPAFVHIGALENVKYLHGYSVKLGLTSLPSLNTTLLITYAKCGCIEMAQRLFEEERVDDKDL
        SAMIKG VKHG  LIALSLF  MK +GIQADFIT+INI+PAFV IGALENVKYLHGYS+KL LTSLPSLNT LLITYAKCGCIEMAQRLFEEERV+DKDL
Subjt:  SAMIKGYVKHGRSLIALSLFLRMKCEGIQADFITMINILPAFVHIGALENVKYLHGYSVKLGLTSLPSLNTTLLITYAKCGCIEMAQRLFEEERVDDKDL

Query:  IMWNSMISAHANHGDWSQCFKLYNQLKCSNSKPDQVTFLGLLTTCVNCGLVEKGKEFFKEMVENYGCQPSQEHYACMVNLLGRAGLINEAGELVRTMPIK
        IMWNSMISAHANHGDWSQCFKLYNQ+KCSNS PDQVTFLGLLT CVN GLVEKGKEFFKEM+E+Y CQPSQEHYACMVNLLGRAGLINEAGELVR MPIK
Subjt:  IMWNSMISAHANHGDWSQCFKLYNQLKCSNSKPDQVTFLGLLTTCVNCGLVEKGKEFFKEMVENYGCQPSQEHYACMVNLLGRAGLINEAGELVRTMPIK

Query:  PDARVWGPLLSACKLHSRSKLAEFAAEQLIDMEPKNAGNYILLSNIYAAAGRWDRVAKMRSFLRDKGLKKTPGCSWLEINGQVTEFRVADQTHPRAEDIY
        PDARVWGPLLSACKLH  SKLAEFAAE+LIDMEPKNAGNYILLSNIYAAAG+WD VAKMRSFLRDKGLKKTPGCSWLEING+V EFRVAD+THPRAEDIY
Subjt:  PDARVWGPLLSACKLHSRSKLAEFAAEQLIDMEPKNAGNYILLSNIYAAAGRWDRVAKMRSFLRDKGLKKTPGCSWLEINGQVTEFRVADQTHPRAEDIY

Query:  TILENLEHEIKEARAKSPEKLSNLL
         IL NLE +IKEA+  SPEKL  LL
Subjt:  TILENLEHEIKEARAKSPEKLSNLL

SwissProt top hitse value%identityAlignment
Q3E6Q1 Pentatricopeptide repeat-containing protein At1g11290, chloroplastic8.7e-11532.5Show/hide
Query:  LFNRCNSRQHLEQIHARFILHGFHQNSTLSCKLIDCYANLGLLNLSQQVFYSIIDPNSTLYNAILRNLTKYGEYERTLLVYREMFAKSMHSDEETYPFVL
        L  RC+S + L QI      +G +Q      KL+  +   G ++ + +VF  I    + LY+ +L+   K  + ++ L  +  M    +      + ++L
Subjt:  LFNRCNSRQHLEQIHARFILHGFHQNSTLSCKLIDCYANLGLLNLSQQVFYSIIDPNSTLYNAILRNLTKYGEYERTLLVYREMFAKSMHSDEETYPFVL

Query:  RSCCCLSNVEFGTKIHGRLVKLGVDSYDTVATALAEMYDECIDFENYHQPFDKMFVKDLECWSSLISKNSQNRNGDENSLFSGRMRTEQLVPDSLTFINL
        + C   + +  G +IHG LVK G        T L  MY +C       + FD+M  +DL  W+++++  SQN            M  E L P  +T +++
Subjt:  RSCCCLSNVEFGTKIHGRLVKLGVDSYDTVATALAEMYDECIDFENYHQPFDKMFVKDLECWSSLISKNSQNRNGDENSLFSGRMRTEQLVPDSLTFINL

Query:  LRSIAGFNSIQLAKIVHCIAIVSNLCGDLLVNTAVLSLYSKLGSLVDARKLFNKMPENDCVVWNIMISAYAREGKPTECLELFMSMARSGIRADLFTALP
        L +++    I + K +H  A+ S     + ++TA++ +Y+K GSL  AR+LF+ M E + V WN MI AY +   P E + +F  M   G++    + + 
Subjt:  LRSIAGFNSIQLAKIVHCIAIVSNLCGDLLVNTAVLSLYSKLGSLVDARKLFNKMPENDCVVWNIMISAYAREGKPTECLELFMSMARSGIRADLFTALP

Query:  VISSVSQLRCVDWGKQTHAHILRNGLDSQVSVHNSLIDMYCECNILDSACKIFNGTTDKTVISWSAMIKGYVKHGRSLIALSLFLRMKCEGIQADFITMI
         + + + L  ++ G+  H   +  GLD  VSV NSLI MYC+C  +D+A  +F     +T++SW+AMI G+ ++GR + AL+ F +M+   ++ D  T +
Subjt:  VISSVSQLRCVDWGKQTHAHILRNGLDSQVSVHNSLIDMYCECNILDSACKIFNGTTDKTVISWSAMIKGYVKHGRSLIALSLFLRMKCEGIQADFITMI

Query:  NILPAFVHIGALENVKYLHGYSVKLGLTSLPSLNTTLLITYAKCGCIEMAQRLFEEERVDDKDLIMWNSMISAHANHGDWSQCFKLYNQLKCSNSKPDQV
        +++ A   +    + K++HG  ++  L     + T L+  YAKCG I +A+ +F  + + ++ +  WN+MI  +  HG      +L+ +++    KP+ V
Subjt:  NILPAFVHIGALENVKYLHGYSVKLGLTSLPSLNTTLLITYAKCGCIEMAQRLFEEERVDDKDLIMWNSMISAHANHGDWSQCFKLYNQLKCSNSKPDQV

Query:  TFLGLLTTCVNCGLVEKGKEFFKEMVENYGCQPSQEHYACMVNLLGRAGLINEAGELVRTMPIKPDARVWGPLLSACKLHSRSKLAEFAAEQLIDMEPKN
        TFL +++ C + GLVE G + F  M ENY  + S +HY  MV+LLGRAG +NEA + +  MP+KP   V+G +L AC++H     AE AAE+L ++ P +
Subjt:  TFLGLLTTCVNCGLVEKGKEFFKEMVENYGCQPSQEHYACMVNLLGRAGLINEAGELVRTMPIKPDARVWGPLLSACKLHSRSKLAEFAAEQLIDMEPKN

Query:  AGNYILLSNIYAAAGRWDRVAKMRSFLRDKGLKKTPGCSWLEINGQVTEFRVADQTHPRAEDIYTILENLEHEIKEA
         G ++LL+NIY AA  W++V ++R  +  +GL+KTPGCS +EI  +V  F      HP ++ IY  LE L   IKEA
Subjt:  AGNYILLSNIYAAAGRWDRVAKMRSFLRDKGLKKTPGCSWLEINGQVTEFRVADQTHPRAEDIYTILENLEHEIKEA

Q9LUJ2 Pentatricopeptide repeat-containing protein At3g226902.9e-11031.66Show/hide
Query:  QSRLLNTLSSLFNRCNSRQHLEQIHARFILHGFHQNSTLSCKLIDCYANLGL---LNLSQQVFYSIIDPNST-LYNAILRNLTKYGEYERTLLVYREMFA
        QS+      S    C +   L+  H      G   + +   KL+     LG    L+ +++VF +     +  +YN+++R     G     +L++  M  
Subjt:  QSRLLNTLSSLFNRCNSRQHLEQIHARFILHGFHQNSTLSCKLIDCYANLGL---LNLSQQVFYSIIDPNST-LYNAILRNLTKYGEYERTLLVYREMFA

Query:  KSMHSDEETYPFVLRSCCCLSNVEFGTKIHGRLVKLGVDSYDTVATALAEMYDECIDFENYHQPFDKMFVKDLECWSSLISKNSQNRNG-DENSLFSGRM
          +  D+ T+PF L +C        G +IHG +VK+G      V  +L   Y EC + ++  + FD+M  +++  W+S+I   ++     D   LF   +
Subjt:  KSMHSDEETYPFVLRSCCCLSNVEFGTKIHGRLVKLGVDSYDTVATALAEMYDECIDFENYHQPFDKMFVKDLECWSSLISKNSQNRNG-DENSLFSGRM

Query:  RTEQLVPDSLTFINLLRSIAGFNSIQLAKIVHCIAIVSNLCGDLLVNTAVLSLYSKLGSLVDARKLFNKMPENDCVVWNIMISAYAREGKPTECLELFMS
        R E++ P+S+T + ++ + A    ++  + V+     S +  + L+ +A++ +Y K  ++  A++LF++   ++  + N M S Y R+G   E L +F  
Subjt:  RTEQLVPDSLTFINLLRSIAGFNSIQLAKIVHCIAIVSNLCGDLLVNTAVLSLYSKLGSLVDARKLFNKMPENDCVVWNIMISAYAREGKPTECLELFMS

Query:  MARSGIRADLFTALPVISSVSQLRCVDWGKQTHAHILRNGLDSQVSVHNSLIDMYCECNILDSACKIFNGTTDKTVISWSAMIKGYVKHGR---------
        M  SG+R D  + L  ISS SQLR + WGK  H ++LRNG +S  ++ N+LIDMY +C+  D+A +IF+  ++KTV++W++++ GYV++G          
Subjt:  MARSGIRADLFTALPVISSVSQLRCVDWGKQTHAHILRNGLDSQVSVHNSLIDMYCECNILDSACKIFNGTTDKTVISWSAMIKGYVKHGR---------

Query:  ---------------SLIALSLF---LRMKC-----EGIQADFITMINILPAFVHIGALENVKYLHGYSVKLGLTSLPSLNTTLLITYAKCGCIEMAQRL
                        L+  SLF   + + C     EG+ AD +TM++I  A  H+GAL+  K+++ Y  K G+     L TTL+  +++CG  E A  +
Subjt:  ---------------SLIALSLF---LRMKC-----EGIQADFITMINILPAFVHIGALENVKYLHGYSVKLGLTSLPSLNTTLLITYAKCGCIEMAQRL

Query:  FEEERVDDKDLIMWNSMISAHANHGDWSQCFKLYNQLKCSNSKPDQVTFLGLLTTCVNCGLVEKGKEFFKEMVENYGCQPSQEHYACMVNLLGRAGLINE
        F    + ++D+  W + I A A  G+  +  +L++ +     KPD V F+G LT C + GLV++GKE F  M++ +G  P   HY CMV+LLGRAGL+ E
Subjt:  FEEERVDDKDLIMWNSMISAHANHGDWSQCFKLYNQLKCSNSKPDQVTFLGLLTTCVNCGLVEKGKEFFKEMVENYGCQPSQEHYACMVNLLGRAGLINE

Query:  AGELVRTMPIKPDARVWGPLLSACKLHSRSKLAEFAAEQLIDMEPKNAGNYILLSNIYAAAGRWDRVAKMRSFLRDKGLKKTPGCSWLEINGQVTEFRVA
        A +L+  MP++P+  +W  LL+AC++    ++A +AAE++  + P+  G+Y+LLSN+YA+AGRW+ +AK+R  +++KGL+K PG S ++I G+  EF   
Subjt:  AGELVRTMPIKPDARVWGPLLSACKLHSRSKLAEFAAEQLIDMEPKNAGNYILLSNIYAAAGRWDRVAKMRSFLRDKGLKKTPGCSWLEINGQVTEFRVA

Query:  DQTHPRAEDIYTILENL
        D++HP   +I  +L+ +
Subjt:  DQTHPRAEDIYTILENL

Q9SN39 Pentatricopeptide repeat-containing protein DOT4, chloroplastic2.3e-10731.96Show/hide
Query:  HGFHQNSTLSCKLIDCYANLGLLNLSQQVFYSIIDPNSTLYNAILRNLTKYGEYERTLLVYREMFAKSMHSDEETYPFVLRSCCCLSNVEFGTKIHGRLV
        +GF  +S L  KL   Y N G L  + +VF  +    +  +N ++  L K G++  ++ ++++M +  +  D  T+  V +S   L +V  G ++HG ++
Subjt:  HGFHQNSTLSCKLIDCYANLGLLNLSQQVFYSIIDPNSTLYNAILRNLTKYGEYERTLLVYREMFAKSMHSDEETYPFVLRSCCCLSNVEFGTKIHGRLV

Query:  KLGVDSYDTVATALAEMYDECIDFENYHQPFDKMFVKDLECWSSLISKNSQNRNGDENSLFSGRMRTEQLVPDSLTFINLLRSIAGFNSIQLAKIVHCIA
        K G    ++V  +L   Y +    ++  + FD+M  +D+  W+S+I+    N   ++      +M    +  D  T +++    A    I L + VH I 
Subjt:  KLGVDSYDTVATALAEMYDECIDFENYHQPFDKMFVKDLECWSSLISKNSQNRNGDENSLFSGRMRTEQLVPDSLTFINLLRSIAGFNSIQLAKIVHCIA

Query:  IVSNLCGDLLVNTAVLSLYSKLGSLVDARKLFNKMPENDCVVWNIMISAYAREGKPTECLELFMSMARSGIRADLFTALPVISSVSQLRCVDWGKQTHAH
        + +    +      +L +YSK G L  A+ +F +M +   V +  MI+ YAREG   E ++LF  M   GI  D++T   V++  ++ R +D GK+ H  
Subjt:  IVSNLCGDLLVNTAVLSLYSKLGSLVDARKLFNKMPENDCVVWNIMISAYAREGKPTECLELFMSMARSGIRADLFTALPVISSVSQLRCVDWGKQTHAH

Query:  ILRNGLDSQVSVHNSLIDMYCECNILDSACKIFNGTTDKTVISWSAMIKGYVKHGRSLIALSLF-LRMKCEGIQADFITMINILPAFVHIGALENVKYLH
        I  N L   + V N+L+DMY +C  +  A  +F+    K +ISW+ +I GY K+  +  ALSLF L ++ +    D  T+  +LPA   + A +  + +H
Subjt:  ILRNGLDSQVSVHNSLIDMYCECNILDSACKIFNGTTDKTVISWSAMIKGYVKHGRSLIALSLF-LRMKCEGIQADFITMINILPAFVHIGALENVKYLH

Query:  GYSVKLGLTSLPSLNTTLLITYAKCGCIEMAQRLFEEERVDDKDLIMWNSMISAHANHGDWSQCFKLYNQLKCSNSKPDQVTFLGLLTTCVNCGLVEKGK
        GY ++ G  S   +  +L+  YAKCG + +A  LF++  +  KDL+ W  MI+ +  HG   +   L+NQ++ +  + D+++F+ LL  C + GLV++G 
Subjt:  GYSVKLGLTSLPSLNTTLLITYAKCGCIEMAQRLFEEERVDDKDLIMWNSMISAHANHGDWSQCFKLYNQLKCSNSKPDQVTFLGLLTTCVNCGLVEKGK

Query:  EFFKEMVENYGCQPSQEHYACMVNLLGRAGLINEAGELVRTMPIKPDARVWGPLLSACKLHSRSKLAEFAAEQLIDMEPKNAGNYILLSNIYAAAGRWDR
         FF  M      +P+ EHYAC+V++L R G + +A   +  MPI PDA +WG LL  C++H   KLAE  AE++ ++EP+N G Y+L++NIYA A +W++
Subjt:  EFFKEMVENYGCQPSQEHYACMVNLLGRAGLINEAGELVRTMPIKPDARVWGPLLSACKLHSRSKLAEFAAEQLIDMEPKNAGNYILLSNIYAAAGRWDR

Query:  VAKMRSFLRDKGLKKTPGCSWLEINGQVTEFRVADQTHPRAEDIYTILENLEHEIKE
        V ++R  +  +GL+K PGCSW+EI G+V  F   D ++P  E+I   L  +   + E
Subjt:  VAKMRSFLRDKGLKKTPGCSWLEINGQVTEFRVADQTHPRAEDIYTILENLEHEIKE

Q9STE1 Pentatricopeptide repeat-containing protein At4g213002.2e-11030.78Show/hide
Query:  LHLRRS-----NPIIHSFV----------FCFKFQNFPATQSRLLNTLSSLFNRCNSRQHLEQIHARFILH-----GFHQNSTLSCKLIDCYANLGLLNL
        L LRRS     N II SFV          F FK   F  +    ++T   L   C + ++ + I   F+       G   N  ++  LI  Y   G +++
Subjt:  LHLRRS-----NPIIHSFV----------FCFKFQNFPATQSRLLNTLSSLFNRCNSRQHLEQIHARFILH-----GFHQNSTLSCKLIDCYANLGLLNL

Query:  SQQVFYSIIDPNSTLYNAILRNLTKYGEYERTLLVYREMFAKSMHSDEETYPFVLRSCCCLSNVEFGTKIHGRLVKLGVDSYDTVATALAEMYDECIDFE
          ++F  ++  +  ++N +L    K G  +  +  +  M    +  +  T+  VL  C     ++ G ++HG +V  GVD   ++  +L  MY +C  F+
Subjt:  SQQVFYSIIDPNSTLYNAILRNLTKYGEYERTLLVYREMFAKSMHSDEETYPFVLRSCCCLSNVEFGTKIHGRLVKLGVDSYDTVATALAEMYDECIDFE

Query:  NYHQPFDKMFVKDLECWSSLISKNSQNRNGDENSLFSGRMRTEQLVPDSLTFINLLRSIAGFNSIQLAKIVHCIAIVSNLCGDLLVNTAVLSLYSKLGSL
        +  + F  M   D   W+ +IS   Q+   +E+  F   M +  ++PD++TF +LL S++ F +++  K +HC  +  ++  D+ + +A++  Y K   +
Subjt:  NYHQPFDKMFVKDLECWSSLISKNSQNRNGDENSLFSGRMRTEQLVPDSLTFINLLRSIAGFNSIQLAKIVHCIAIVSNLCGDLLVNTAVLSLYSKLGSL

Query:  VDARKLFNKMPENDCVVWNIMISAYAREGKPTECLELFMSMARSGIRADLFTALPVISSVSQLRCVDWGKQTHAHILRNGLDSQVSVHNSLIDMYCECNI
          A+ +F++    D VV+  MIS Y   G   + LE+F  + +  I  +  T + ++  +  L  +  G++ H  I++ G D++ ++  ++IDMY +C  
Subjt:  VDARKLFNKMPENDCVVWNIMISAYAREGKPTECLELFMSMARSGIRADLFTALPVISSVSQLRCVDWGKQTHAHILRNGLDSQVSVHNSLIDMYCECNI

Query:  LDSACKIFNGTTDKTVISWSAMIKGYVKHGRSLIALSLFLRMKCEGIQADFITMINILPAFVHIGALENVKYLHGYSVKLGLTSLPSLNTTLLITYAKCG
        ++ A +IF   + + ++SW++MI    +      A+ +F +M   GI  D +++   L A  ++ +    K +HG+ +K  L S     +TL+  YAKCG
Subjt:  LDSACKIFNGTTDKTVISWSAMIKGYVKHGRSLIALSLFLRMKCEGIQADFITMINILPAFVHIGALENVKYLHGYSVKLGLTSLPSLNTTLLITYAKCG

Query:  CIEMAQRLFEEERVDDKDLIMWNSMISAHANHGDWSQCFKLYNQL-KCSNSKPDQVTFLGLLTTCVNCGLVEKGKEFFKEMVENYGCQPSQEHYACMVNL
         ++ A  +F  + + +K+++ WNS+I+A  NHG       L++++ + S  +PDQ+TFL ++++C + G V++G  FF+ M E+YG QP QEHYAC+V+L
Subjt:  CIEMAQRLFEEERVDDKDLIMWNSMISAHANHGDWSQCFKLYNQL-KCSNSKPDQVTFLGLLTTCVNCGLVEKGKEFFKEMVENYGCQPSQEHYACMVNL

Query:  LGRAGLINEAGELVRTMPIKPDARVWGPLLSACKLHSRSKLAEFAAEQLIDMEPKNAGNYILLSNIYAAAGRWDRVAKMRSFLRDKGLKKTPGCSWLEIN
         GRAG + EA E V++MP  PDA VWG LL AC+LH   +LAE A+ +L+D++P N+G Y+L+SN +A A  W+ V K+RS ++++ ++K PG SW+EIN
Subjt:  LGRAGLINEAGELVRTMPIKPDARVWGPLLSACKLHSRSKLAEFAAEQLIDMEPKNAGNYILLSNIYAAAGRWDRVAKMRSFLRDKGLKKTPGCSWLEIN

Query:  GQVTEFRVADQTHPRAEDIYTILENLEHEIK
         +   F   D  HP +  IY++L +L  E++
Subjt:  GQVTEFRVADQTHPRAEDIYTILENLEHEIK

Q9SVP7 Pentatricopeptide repeat-containing protein At4g136501.8e-10429.96Show/hide
Query:  SSLFNRCNSRQHL---EQIHARFILHGFHQNSTLSCKLIDCYANLGLLNLSQQVFYSIIDPNSTLYNAILRNLTKYGEYERTLLVYREMFAKSMHSDEET
        SS+ + C   + L   EQ+H   +  GF  ++ +   L+  Y +LG L  ++ +F ++   ++  YN ++  L++ G  E+ + +++ M    +  D  T
Subjt:  SSLFNRCNSRQHL---EQIHARFILHGFHQNSTLSCKLIDCYANLGLLNLSQQVFYSIIDPNSTLYNAILRNLTKYGEYERTLLVYREMFAKSMHSDEET

Query:  YPFVLRSCCCLSNVEFGTKIHGRLVKLGVDSYDTVATALAEMYDECIDFENYHQPFDKMFVKDLECWSSLISKNSQNRNGDENSLFSGRMRTEQLVPDSL
           ++ +C     +  G ++H    KLG  S + +  AL  +Y +C D E     F +  V+++  W+ ++       +   +     +M+ E++VP+  
Subjt:  YPFVLRSCCCLSNVEFGTKIHGRLVKLGVDSYDTVATALAEMYDECIDFENYHQPFDKMFVKDLECWSSLISKNSQNRNGDENSLFSGRMRTEQLVPDSL

Query:  TFINLLRSIAGFNSIQLAKIVHCIAIVSNLCGDLLVNTAVLSLYSKLGSLVDARKLFNKMPENDCVVWNIMISAYAREGKPTECLELFMSMARSGIRADL
        T+ ++L++      ++L + +H   I +N   +  V + ++ +Y+KLG L  A  +  +    D V W  MI+ Y +     + L  F  M   GIR+D 
Subjt:  TFINLLRSIAGFNSIQLAKIVHCIAIVSNLCGDLLVNTAVLSLYSKLGSLVDARKLFNKMPENDCVVWNIMISAYAREGKPTECLELFMSMARSGIRADL

Query:  FTALPVISSVSQLRCVDWGKQTHAHILRNGLDSQVSVHNSLIDMYCECNILDSACKIFNGTTDKTVISWSAMIKGYVKHGRSLIALSLFLRMKCEGIQAD
              +S+ + L+ +  G+Q HA    +G  S +   N+L+ +Y  C  ++ +   F  T     I+W+A++ G+ + G +  AL +F+RM  EGI  +
Subjt:  FTALPVISSVSQLRCVDWGKQTHAHILRNGLDSQVSVHNSLIDMYCECNILDSACKIFNGTTDKTVISWSAMIKGYVKHGRSLIALSLFLRMKCEGIQAD

Query:  FITMINILPAFVHIGALENVKYLHGYSVKLGLTSLPSLNTTLLITYAKCGCIEMAQRLFEEERVDDKDLIMWNSMISAHANHGDWSQCFKLYNQLKCSNS
          T  + + A      ++  K +H    K G  S   +   L+  YAKCG I  A++ F E  V  K+ + WN++I+A++ HG  S+    ++Q+  SN 
Subjt:  FITMINILPAFVHIGALENVKYLHGYSVKLGLTSLPSLNTTLLITYAKCGCIEMAQRLFEEERVDDKDLIMWNSMISAHANHGDWSQCFKLYNQLKCSNS

Query:  KPDQVTFLGLLTTCVNCGLVEKGKEFFKEMVENYGCQPSQEHYACMVNLLGRAGLINEAGELVRTMPIKPDARVWGPLLSACKLHSRSKLAEFAAEQLID
        +P+ VT +G+L+ C + GLV+KG  +F+ M   YG  P  EHY C+V++L RAGL++ A E ++ MPIKPDA VW  LLSAC +H   ++ EFAA  L++
Subjt:  KPDQVTFLGLLTTCVNCGLVEKGKEFFKEMVENYGCQPSQEHYACMVNLLGRAGLINEAGELVRTMPIKPDARVWGPLLSACKLHSRSKLAEFAAEQLID

Query:  MEPKNAGNYILLSNIYAAAGRWDRVAKMRSFLRDKGLKKTPGCSWLEINGQVTEFRVADQTHPRAEDIYTILENLEHEIKE
        +EP+++  Y+LLSN+YA + +WD     R  +++KG+KK PG SW+E+   +  F V DQ HP A++I+   ++L     E
Subjt:  MEPKNAGNYILLSNIYAAAGRWDRVAKMRSFLRDKGLKKTPGCSWLEINGQVTEFRVADQTHPRAEDIYTILENLEHEIKE

Arabidopsis top hitse value%identityAlignment
AT1G11290.1 Pentatricopeptide repeat (PPR) superfamily protein6.2e-11632.5Show/hide
Query:  LFNRCNSRQHLEQIHARFILHGFHQNSTLSCKLIDCYANLGLLNLSQQVFYSIIDPNSTLYNAILRNLTKYGEYERTLLVYREMFAKSMHSDEETYPFVL
        L  RC+S + L QI      +G +Q      KL+  +   G ++ + +VF  I    + LY+ +L+   K  + ++ L  +  M    +      + ++L
Subjt:  LFNRCNSRQHLEQIHARFILHGFHQNSTLSCKLIDCYANLGLLNLSQQVFYSIIDPNSTLYNAILRNLTKYGEYERTLLVYREMFAKSMHSDEETYPFVL

Query:  RSCCCLSNVEFGTKIHGRLVKLGVDSYDTVATALAEMYDECIDFENYHQPFDKMFVKDLECWSSLISKNSQNRNGDENSLFSGRMRTEQLVPDSLTFINL
        + C   + +  G +IHG LVK G        T L  MY +C       + FD+M  +DL  W+++++  SQN            M  E L P  +T +++
Subjt:  RSCCCLSNVEFGTKIHGRLVKLGVDSYDTVATALAEMYDECIDFENYHQPFDKMFVKDLECWSSLISKNSQNRNGDENSLFSGRMRTEQLVPDSLTFINL

Query:  LRSIAGFNSIQLAKIVHCIAIVSNLCGDLLVNTAVLSLYSKLGSLVDARKLFNKMPENDCVVWNIMISAYAREGKPTECLELFMSMARSGIRADLFTALP
        L +++    I + K +H  A+ S     + ++TA++ +Y+K GSL  AR+LF+ M E + V WN MI AY +   P E + +F  M   G++    + + 
Subjt:  LRSIAGFNSIQLAKIVHCIAIVSNLCGDLLVNTAVLSLYSKLGSLVDARKLFNKMPENDCVVWNIMISAYAREGKPTECLELFMSMARSGIRADLFTALP

Query:  VISSVSQLRCVDWGKQTHAHILRNGLDSQVSVHNSLIDMYCECNILDSACKIFNGTTDKTVISWSAMIKGYVKHGRSLIALSLFLRMKCEGIQADFITMI
         + + + L  ++ G+  H   +  GLD  VSV NSLI MYC+C  +D+A  +F     +T++SW+AMI G+ ++GR + AL+ F +M+   ++ D  T +
Subjt:  VISSVSQLRCVDWGKQTHAHILRNGLDSQVSVHNSLIDMYCECNILDSACKIFNGTTDKTVISWSAMIKGYVKHGRSLIALSLFLRMKCEGIQADFITMI

Query:  NILPAFVHIGALENVKYLHGYSVKLGLTSLPSLNTTLLITYAKCGCIEMAQRLFEEERVDDKDLIMWNSMISAHANHGDWSQCFKLYNQLKCSNSKPDQV
        +++ A   +    + K++HG  ++  L     + T L+  YAKCG I +A+ +F  + + ++ +  WN+MI  +  HG      +L+ +++    KP+ V
Subjt:  NILPAFVHIGALENVKYLHGYSVKLGLTSLPSLNTTLLITYAKCGCIEMAQRLFEEERVDDKDLIMWNSMISAHANHGDWSQCFKLYNQLKCSNSKPDQV

Query:  TFLGLLTTCVNCGLVEKGKEFFKEMVENYGCQPSQEHYACMVNLLGRAGLINEAGELVRTMPIKPDARVWGPLLSACKLHSRSKLAEFAAEQLIDMEPKN
        TFL +++ C + GLVE G + F  M ENY  + S +HY  MV+LLGRAG +NEA + +  MP+KP   V+G +L AC++H     AE AAE+L ++ P +
Subjt:  TFLGLLTTCVNCGLVEKGKEFFKEMVENYGCQPSQEHYACMVNLLGRAGLINEAGELVRTMPIKPDARVWGPLLSACKLHSRSKLAEFAAEQLIDMEPKN

Query:  AGNYILLSNIYAAAGRWDRVAKMRSFLRDKGLKKTPGCSWLEINGQVTEFRVADQTHPRAEDIYTILENLEHEIKEA
         G ++LL+NIY AA  W++V ++R  +  +GL+KTPGCS +EI  +V  F      HP ++ IY  LE L   IKEA
Subjt:  AGNYILLSNIYAAAGRWDRVAKMRSFLRDKGLKKTPGCSWLEINGQVTEFRVADQTHPRAEDIYTILENLEHEIKEA

AT3G22690.1 CONTAINS InterPro DOMAIN/s: Protein of unknown function DUF1685 (InterPro:IPR012881), Pentatricopeptide repeat (InterPro:IPR002885)2.1e-11131.66Show/hide
Query:  QSRLLNTLSSLFNRCNSRQHLEQIHARFILHGFHQNSTLSCKLIDCYANLGL---LNLSQQVFYSIIDPNST-LYNAILRNLTKYGEYERTLLVYREMFA
        QS+      S    C +   L+  H      G   + +   KL+     LG    L+ +++VF +     +  +YN+++R     G     +L++  M  
Subjt:  QSRLLNTLSSLFNRCNSRQHLEQIHARFILHGFHQNSTLSCKLIDCYANLGL---LNLSQQVFYSIIDPNST-LYNAILRNLTKYGEYERTLLVYREMFA

Query:  KSMHSDEETYPFVLRSCCCLSNVEFGTKIHGRLVKLGVDSYDTVATALAEMYDECIDFENYHQPFDKMFVKDLECWSSLISKNSQNRNG-DENSLFSGRM
          +  D+ T+PF L +C        G +IHG +VK+G      V  +L   Y EC + ++  + FD+M  +++  W+S+I   ++     D   LF   +
Subjt:  KSMHSDEETYPFVLRSCCCLSNVEFGTKIHGRLVKLGVDSYDTVATALAEMYDECIDFENYHQPFDKMFVKDLECWSSLISKNSQNRNG-DENSLFSGRM

Query:  RTEQLVPDSLTFINLLRSIAGFNSIQLAKIVHCIAIVSNLCGDLLVNTAVLSLYSKLGSLVDARKLFNKMPENDCVVWNIMISAYAREGKPTECLELFMS
        R E++ P+S+T + ++ + A    ++  + V+     S +  + L+ +A++ +Y K  ++  A++LF++   ++  + N M S Y R+G   E L +F  
Subjt:  RTEQLVPDSLTFINLLRSIAGFNSIQLAKIVHCIAIVSNLCGDLLVNTAVLSLYSKLGSLVDARKLFNKMPENDCVVWNIMISAYAREGKPTECLELFMS

Query:  MARSGIRADLFTALPVISSVSQLRCVDWGKQTHAHILRNGLDSQVSVHNSLIDMYCECNILDSACKIFNGTTDKTVISWSAMIKGYVKHGR---------
        M  SG+R D  + L  ISS SQLR + WGK  H ++LRNG +S  ++ N+LIDMY +C+  D+A +IF+  ++KTV++W++++ GYV++G          
Subjt:  MARSGIRADLFTALPVISSVSQLRCVDWGKQTHAHILRNGLDSQVSVHNSLIDMYCECNILDSACKIFNGTTDKTVISWSAMIKGYVKHGR---------

Query:  ---------------SLIALSLF---LRMKC-----EGIQADFITMINILPAFVHIGALENVKYLHGYSVKLGLTSLPSLNTTLLITYAKCGCIEMAQRL
                        L+  SLF   + + C     EG+ AD +TM++I  A  H+GAL+  K+++ Y  K G+     L TTL+  +++CG  E A  +
Subjt:  ---------------SLIALSLF---LRMKC-----EGIQADFITMINILPAFVHIGALENVKYLHGYSVKLGLTSLPSLNTTLLITYAKCGCIEMAQRL

Query:  FEEERVDDKDLIMWNSMISAHANHGDWSQCFKLYNQLKCSNSKPDQVTFLGLLTTCVNCGLVEKGKEFFKEMVENYGCQPSQEHYACMVNLLGRAGLINE
        F    + ++D+  W + I A A  G+  +  +L++ +     KPD V F+G LT C + GLV++GKE F  M++ +G  P   HY CMV+LLGRAGL+ E
Subjt:  FEEERVDDKDLIMWNSMISAHANHGDWSQCFKLYNQLKCSNSKPDQVTFLGLLTTCVNCGLVEKGKEFFKEMVENYGCQPSQEHYACMVNLLGRAGLINE

Query:  AGELVRTMPIKPDARVWGPLLSACKLHSRSKLAEFAAEQLIDMEPKNAGNYILLSNIYAAAGRWDRVAKMRSFLRDKGLKKTPGCSWLEINGQVTEFRVA
        A +L+  MP++P+  +W  LL+AC++    ++A +AAE++  + P+  G+Y+LLSN+YA+AGRW+ +AK+R  +++KGL+K PG S ++I G+  EF   
Subjt:  AGELVRTMPIKPDARVWGPLLSACKLHSRSKLAEFAAEQLIDMEPKNAGNYILLSNIYAAAGRWDRVAKMRSFLRDKGLKKTPGCSWLEINGQVTEFRVA

Query:  DQTHPRAEDIYTILENL
        D++HP   +I  +L+ +
Subjt:  DQTHPRAEDIYTILENL

AT3G22690.2 INVOLVED IN: photosystem II assembly, regulation of chlorophyll biosynthetic process, photosystem I assembly, thylakoid membrane organization, RNA modification2.1e-11131.66Show/hide
Query:  QSRLLNTLSSLFNRCNSRQHLEQIHARFILHGFHQNSTLSCKLIDCYANLGL---LNLSQQVFYSIIDPNST-LYNAILRNLTKYGEYERTLLVYREMFA
        QS+      S    C +   L+  H      G   + +   KL+     LG    L+ +++VF +     +  +YN+++R     G     +L++  M  
Subjt:  QSRLLNTLSSLFNRCNSRQHLEQIHARFILHGFHQNSTLSCKLIDCYANLGL---LNLSQQVFYSIIDPNST-LYNAILRNLTKYGEYERTLLVYREMFA

Query:  KSMHSDEETYPFVLRSCCCLSNVEFGTKIHGRLVKLGVDSYDTVATALAEMYDECIDFENYHQPFDKMFVKDLECWSSLISKNSQNRNG-DENSLFSGRM
          +  D+ T+PF L +C        G +IHG +VK+G      V  +L   Y EC + ++  + FD+M  +++  W+S+I   ++     D   LF   +
Subjt:  KSMHSDEETYPFVLRSCCCLSNVEFGTKIHGRLVKLGVDSYDTVATALAEMYDECIDFENYHQPFDKMFVKDLECWSSLISKNSQNRNG-DENSLFSGRM

Query:  RTEQLVPDSLTFINLLRSIAGFNSIQLAKIVHCIAIVSNLCGDLLVNTAVLSLYSKLGSLVDARKLFNKMPENDCVVWNIMISAYAREGKPTECLELFMS
        R E++ P+S+T + ++ + A    ++  + V+     S +  + L+ +A++ +Y K  ++  A++LF++   ++  + N M S Y R+G   E L +F  
Subjt:  RTEQLVPDSLTFINLLRSIAGFNSIQLAKIVHCIAIVSNLCGDLLVNTAVLSLYSKLGSLVDARKLFNKMPENDCVVWNIMISAYAREGKPTECLELFMS

Query:  MARSGIRADLFTALPVISSVSQLRCVDWGKQTHAHILRNGLDSQVSVHNSLIDMYCECNILDSACKIFNGTTDKTVISWSAMIKGYVKHGR---------
        M  SG+R D  + L  ISS SQLR + WGK  H ++LRNG +S  ++ N+LIDMY +C+  D+A +IF+  ++KTV++W++++ GYV++G          
Subjt:  MARSGIRADLFTALPVISSVSQLRCVDWGKQTHAHILRNGLDSQVSVHNSLIDMYCECNILDSACKIFNGTTDKTVISWSAMIKGYVKHGR---------

Query:  ---------------SLIALSLF---LRMKC-----EGIQADFITMINILPAFVHIGALENVKYLHGYSVKLGLTSLPSLNTTLLITYAKCGCIEMAQRL
                        L+  SLF   + + C     EG+ AD +TM++I  A  H+GAL+  K+++ Y  K G+     L TTL+  +++CG  E A  +
Subjt:  ---------------SLIALSLF---LRMKC-----EGIQADFITMINILPAFVHIGALENVKYLHGYSVKLGLTSLPSLNTTLLITYAKCGCIEMAQRL

Query:  FEEERVDDKDLIMWNSMISAHANHGDWSQCFKLYNQLKCSNSKPDQVTFLGLLTTCVNCGLVEKGKEFFKEMVENYGCQPSQEHYACMVNLLGRAGLINE
        F    + ++D+  W + I A A  G+  +  +L++ +     KPD V F+G LT C + GLV++GKE F  M++ +G  P   HY CMV+LLGRAGL+ E
Subjt:  FEEERVDDKDLIMWNSMISAHANHGDWSQCFKLYNQLKCSNSKPDQVTFLGLLTTCVNCGLVEKGKEFFKEMVENYGCQPSQEHYACMVNLLGRAGLINE

Query:  AGELVRTMPIKPDARVWGPLLSACKLHSRSKLAEFAAEQLIDMEPKNAGNYILLSNIYAAAGRWDRVAKMRSFLRDKGLKKTPGCSWLEINGQVTEFRVA
        A +L+  MP++P+  +W  LL+AC++    ++A +AAE++  + P+  G+Y+LLSN+YA+AGRW+ +AK+R  +++KGL+K PG S ++I G+  EF   
Subjt:  AGELVRTMPIKPDARVWGPLLSACKLHSRSKLAEFAAEQLIDMEPKNAGNYILLSNIYAAAGRWDRVAKMRSFLRDKGLKKTPGCSWLEINGQVTEFRVA

Query:  DQTHPRAEDIYTILENL
        D++HP   +I  +L+ +
Subjt:  DQTHPRAEDIYTILENL

AT4G18750.1 Pentatricopeptide repeat (PPR) superfamily protein1.6e-10831.96Show/hide
Query:  HGFHQNSTLSCKLIDCYANLGLLNLSQQVFYSIIDPNSTLYNAILRNLTKYGEYERTLLVYREMFAKSMHSDEETYPFVLRSCCCLSNVEFGTKIHGRLV
        +GF  +S L  KL   Y N G L  + +VF  +    +  +N ++  L K G++  ++ ++++M +  +  D  T+  V +S   L +V  G ++HG ++
Subjt:  HGFHQNSTLSCKLIDCYANLGLLNLSQQVFYSIIDPNSTLYNAILRNLTKYGEYERTLLVYREMFAKSMHSDEETYPFVLRSCCCLSNVEFGTKIHGRLV

Query:  KLGVDSYDTVATALAEMYDECIDFENYHQPFDKMFVKDLECWSSLISKNSQNRNGDENSLFSGRMRTEQLVPDSLTFINLLRSIAGFNSIQLAKIVHCIA
        K G    ++V  +L   Y +    ++  + FD+M  +D+  W+S+I+    N   ++      +M    +  D  T +++    A    I L + VH I 
Subjt:  KLGVDSYDTVATALAEMYDECIDFENYHQPFDKMFVKDLECWSSLISKNSQNRNGDENSLFSGRMRTEQLVPDSLTFINLLRSIAGFNSIQLAKIVHCIA

Query:  IVSNLCGDLLVNTAVLSLYSKLGSLVDARKLFNKMPENDCVVWNIMISAYAREGKPTECLELFMSMARSGIRADLFTALPVISSVSQLRCVDWGKQTHAH
        + +    +      +L +YSK G L  A+ +F +M +   V +  MI+ YAREG   E ++LF  M   GI  D++T   V++  ++ R +D GK+ H  
Subjt:  IVSNLCGDLLVNTAVLSLYSKLGSLVDARKLFNKMPENDCVVWNIMISAYAREGKPTECLELFMSMARSGIRADLFTALPVISSVSQLRCVDWGKQTHAH

Query:  ILRNGLDSQVSVHNSLIDMYCECNILDSACKIFNGTTDKTVISWSAMIKGYVKHGRSLIALSLF-LRMKCEGIQADFITMINILPAFVHIGALENVKYLH
        I  N L   + V N+L+DMY +C  +  A  +F+    K +ISW+ +I GY K+  +  ALSLF L ++ +    D  T+  +LPA   + A +  + +H
Subjt:  ILRNGLDSQVSVHNSLIDMYCECNILDSACKIFNGTTDKTVISWSAMIKGYVKHGRSLIALSLF-LRMKCEGIQADFITMINILPAFVHIGALENVKYLH

Query:  GYSVKLGLTSLPSLNTTLLITYAKCGCIEMAQRLFEEERVDDKDLIMWNSMISAHANHGDWSQCFKLYNQLKCSNSKPDQVTFLGLLTTCVNCGLVEKGK
        GY ++ G  S   +  +L+  YAKCG + +A  LF++  +  KDL+ W  MI+ +  HG   +   L+NQ++ +  + D+++F+ LL  C + GLV++G 
Subjt:  GYSVKLGLTSLPSLNTTLLITYAKCGCIEMAQRLFEEERVDDKDLIMWNSMISAHANHGDWSQCFKLYNQLKCSNSKPDQVTFLGLLTTCVNCGLVEKGK

Query:  EFFKEMVENYGCQPSQEHYACMVNLLGRAGLINEAGELVRTMPIKPDARVWGPLLSACKLHSRSKLAEFAAEQLIDMEPKNAGNYILLSNIYAAAGRWDR
         FF  M      +P+ EHYAC+V++L R G + +A   +  MPI PDA +WG LL  C++H   KLAE  AE++ ++EP+N G Y+L++NIYA A +W++
Subjt:  EFFKEMVENYGCQPSQEHYACMVNLLGRAGLINEAGELVRTMPIKPDARVWGPLLSACKLHSRSKLAEFAAEQLIDMEPKNAGNYILLSNIYAAAGRWDR

Query:  VAKMRSFLRDKGLKKTPGCSWLEINGQVTEFRVADQTHPRAEDIYTILENLEHEIKE
        V ++R  +  +GL+K PGCSW+EI G+V  F   D ++P  E+I   L  +   + E
Subjt:  VAKMRSFLRDKGLKKTPGCSWLEINGQVTEFRVADQTHPRAEDIYTILENLEHEIKE

AT4G21300.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.6e-11130.78Show/hide
Query:  LHLRRS-----NPIIHSFV----------FCFKFQNFPATQSRLLNTLSSLFNRCNSRQHLEQIHARFILH-----GFHQNSTLSCKLIDCYANLGLLNL
        L LRRS     N II SFV          F FK   F  +    ++T   L   C + ++ + I   F+       G   N  ++  LI  Y   G +++
Subjt:  LHLRRS-----NPIIHSFV----------FCFKFQNFPATQSRLLNTLSSLFNRCNSRQHLEQIHARFILH-----GFHQNSTLSCKLIDCYANLGLLNL

Query:  SQQVFYSIIDPNSTLYNAILRNLTKYGEYERTLLVYREMFAKSMHSDEETYPFVLRSCCCLSNVEFGTKIHGRLVKLGVDSYDTVATALAEMYDECIDFE
          ++F  ++  +  ++N +L    K G  +  +  +  M    +  +  T+  VL  C     ++ G ++HG +V  GVD   ++  +L  MY +C  F+
Subjt:  SQQVFYSIIDPNSTLYNAILRNLTKYGEYERTLLVYREMFAKSMHSDEETYPFVLRSCCCLSNVEFGTKIHGRLVKLGVDSYDTVATALAEMYDECIDFE

Query:  NYHQPFDKMFVKDLECWSSLISKNSQNRNGDENSLFSGRMRTEQLVPDSLTFINLLRSIAGFNSIQLAKIVHCIAIVSNLCGDLLVNTAVLSLYSKLGSL
        +  + F  M   D   W+ +IS   Q+   +E+  F   M +  ++PD++TF +LL S++ F +++  K +HC  +  ++  D+ + +A++  Y K   +
Subjt:  NYHQPFDKMFVKDLECWSSLISKNSQNRNGDENSLFSGRMRTEQLVPDSLTFINLLRSIAGFNSIQLAKIVHCIAIVSNLCGDLLVNTAVLSLYSKLGSL

Query:  VDARKLFNKMPENDCVVWNIMISAYAREGKPTECLELFMSMARSGIRADLFTALPVISSVSQLRCVDWGKQTHAHILRNGLDSQVSVHNSLIDMYCECNI
          A+ +F++    D VV+  MIS Y   G   + LE+F  + +  I  +  T + ++  +  L  +  G++ H  I++ G D++ ++  ++IDMY +C  
Subjt:  VDARKLFNKMPENDCVVWNIMISAYAREGKPTECLELFMSMARSGIRADLFTALPVISSVSQLRCVDWGKQTHAHILRNGLDSQVSVHNSLIDMYCECNI

Query:  LDSACKIFNGTTDKTVISWSAMIKGYVKHGRSLIALSLFLRMKCEGIQADFITMINILPAFVHIGALENVKYLHGYSVKLGLTSLPSLNTTLLITYAKCG
        ++ A +IF   + + ++SW++MI    +      A+ +F +M   GI  D +++   L A  ++ +    K +HG+ +K  L S     +TL+  YAKCG
Subjt:  LDSACKIFNGTTDKTVISWSAMIKGYVKHGRSLIALSLFLRMKCEGIQADFITMINILPAFVHIGALENVKYLHGYSVKLGLTSLPSLNTTLLITYAKCG

Query:  CIEMAQRLFEEERVDDKDLIMWNSMISAHANHGDWSQCFKLYNQL-KCSNSKPDQVTFLGLLTTCVNCGLVEKGKEFFKEMVENYGCQPSQEHYACMVNL
         ++ A  +F  + + +K+++ WNS+I+A  NHG       L++++ + S  +PDQ+TFL ++++C + G V++G  FF+ M E+YG QP QEHYAC+V+L
Subjt:  CIEMAQRLFEEERVDDKDLIMWNSMISAHANHGDWSQCFKLYNQL-KCSNSKPDQVTFLGLLTTCVNCGLVEKGKEFFKEMVENYGCQPSQEHYACMVNL

Query:  LGRAGLINEAGELVRTMPIKPDARVWGPLLSACKLHSRSKLAEFAAEQLIDMEPKNAGNYILLSNIYAAAGRWDRVAKMRSFLRDKGLKKTPGCSWLEIN
         GRAG + EA E V++MP  PDA VWG LL AC+LH   +LAE A+ +L+D++P N+G Y+L+SN +A A  W+ V K+RS ++++ ++K PG SW+EIN
Subjt:  LGRAGLINEAGELVRTMPIKPDARVWGPLLSACKLHSRSKLAEFAAEQLIDMEPKNAGNYILLSNIYAAAGRWDRVAKMRSFLRDKGLKKTPGCSWLEIN

Query:  GQVTEFRVADQTHPRAEDIYTILENLEHEIK
         +   F   D  HP +  IY++L +L  E++
Subjt:  GQVTEFRVADQTHPRAEDIYTILENLEHEIK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTTCACCTTCGACGATCAAATCCCATTATTCATAGTTTCGTTTTTTGTTTCAAATTCCAGAACTTTCCTGCCACCCAATCAAGATTGCTCAACACGCTTTCCTCCCT
CTTCAATCGATGCAACTCACGTCAACACCTCGAACAAATTCATGCCAGATTCATTCTCCATGGTTTCCACCAAAACTCAACTCTCTCTTGCAAGCTTATTGACTGTTATG
CGAATCTTGGACTCCTTAATCTCTCTCAGCAAGTTTTCTACTCTATAATCGATCCCAATTCAACTCTTTATAATGCTATACTGAGAAATTTGACTAAATATGGTGAATAC
GAGCGTACATTGTTGGTGTACCGAGAAATGTTTGCCAAGTCCATGCACTCGGATGAAGAGACTTACCCTTTTGTTTTGCGATCCTGTTGTTGCTTATCAAATGTTGAATT
TGGGACGAAGATTCATGGGCGTTTGGTTAAACTTGGTGTTGATTCATATGATACGGTAGCCACTGCTCTAGCTGAGATGTATGATGAGTGCATTGATTTTGAGAATTATC
ATCAACCGTTTGATAAAATGTTTGTGAAGGATTTGGAATGCTGGAGTTCCTTGATTTCAAAGAATTCTCAAAATAGGAATGGAGATGAAAATTCCTTGTTCTCTGGGAGA
ATGAGAACAGAGCAATTAGTACCAGATTCACTCACATTCATCAATCTCTTGAGGTCCATTGCAGGTTTTAATTCAATTCAGCTTGCAAAGATTGTTCATTGTATTGCAAT
TGTGAGCAACTTGTGTGGAGATTTGTTAGTAAATACTGCTGTATTGTCTCTTTACTCAAAGTTAGGTAGCTTAGTGGATGCTAGAAAATTATTTAACAAAATGCCAGAGA
ATGACTGTGTTGTATGGAATATAATGATATCAGCTTACGCCCGAGAAGGGAAACCGACAGAATGTCTCGAGCTCTTCATGTCCATGGCACGATCTGGGATTAGAGCTGAT
CTATTTACTGCACTCCCTGTTATCTCTTCAGTTTCACAGTTGAGATGTGTTGATTGGGGCAAACAAACCCATGCCCATATATTGAGGAATGGTTTAGACAGTCAAGTTTC
AGTTCATAACTCTCTCATTGACATGTACTGCGAGTGTAACATCTTAGATTCGGCTTGTAAGATCTTCAACGGGACGACAGACAAGACTGTAATTTCATGGAGTGCAATGA
TCAAGGGGTATGTCAAACATGGTCGGTCTCTCATTGCTTTGTCTCTCTTCCTTAGGATGAAATGTGAAGGGATTCAAGCTGATTTCATTACAATGATTAATATCTTACCT
GCATTTGTTCACATAGGAGCACTTGAAAATGTCAAATATTTACATGGGTACTCAGTGAAGCTAGGTCTGACTTCCCTTCCATCACTTAACACAACCCTCCTAATTACCTA
TGCAAAATGTGGCTGTATAGAGATGGCCCAAAGGCTATTTGAGGAAGAAAGAGTTGATGACAAGGATTTGATAATGTGGAACTCCATGATCAGTGCCCATGCCAACCATG
GAGATTGGTCCCAATGTTTCAAGTTGTACAATCAACTGAAGTGCTCAAATTCAAAGCCAGATCAAGTAACATTTTTGGGACTACTAACAACTTGCGTCAATTGCGGTCTC
GTAGAAAAAGGAAAGGAGTTTTTCAAGGAGATGGTTGAAAATTATGGTTGCCAGCCAAGTCAAGAGCATTATGCTTGTATGGTTAATCTCTTAGGGAGAGCTGGACTTAT
CAATGAAGCTGGAGAACTTGTAAGAACCATGCCCATCAAACCCGATGCTCGAGTTTGGGGTCCATTGTTGAGTGCTTGTAAGTTGCATTCTAGGTCCAAGCTTGCAGAGT
TTGCAGCAGAGCAGCTCATTGATATGGAGCCTAAAAATGCAGGGAATTACATATTGCTTTCGAACATATATGCTGCTGCAGGAAGATGGGATAGAGTTGCAAAAATGAGA
AGTTTCCTTAGGGATAAAGGGCTCAAGAAAACCCCTGGTTGTAGTTGGCTGGAGATAAATGGCCAGGTAACTGAGTTTCGTGTTGCTGATCAAACTCATCCTAGAGCAGA
AGATATATATACCATCCTAGAAAACCTTGAACATGAAATCAAAGAAGCTAGAGCAAAGAGTCCAGAAAAATTGAGTAATCTTCTATAA
mRNA sequenceShow/hide mRNA sequence
CGATGTTCCAATTTTATTTGAATTTGAAAGTGTGAGTGTGCTTGTGTTTTATTTTAGATTTGGAAGGGTTAAAAAGAAGAGTTGGGAGGGATAGGTTGGTCATTTTAACT
ATTTTAGTGTCAAAAATAAGAGAAACAAAGAAAATAGAAAGTTAAAAAAAAAAAAAACCCACCGACCTTTTCCTTTTGTTCTTCGCGGGCACACCGGAAGGTAAGGGGGG
TTTAGGGTTTAAGGGATCACTCGACGGTGCACGGAGGGGAAGGGACCGACAAACGATCCGAGTCAGCGGCGGTTTCCGGCAGATGTTTCTCCCTCTCCTTCGCGAATTTC
TTTTTCTCTCTCTTGCGCTCGAGCAAAACTCCGGACGCCATCCACTCCTTTGGGTCAGCGGCGGCATCTGCAGCAAGCGGTGGCCGATTGTAGTTGACGACTCCGATAGA
TCTGATTGACGTCGACACCCTGCGACGCCATTCCCTTGCATCAGATCTGTGCCATTGACACGAGCTTCGACTACAACGGCGATCTCACCGCTCGCGGACGATTCTTCTTC
AGTGACAAACGCAACCTTCGGCAAAGACCCACGCGTAATTGACTCCAGATCTGCAGTACCCAGTTTTTTTTCGGGTTGGATCGGTACACAAACGACTTTATCGCCTTTGT
TTGAGCTAAGAATCGAATACTCTTAGCCTTACAATGTCAGATCTGCGAGGTAATGTTTTTTTACTGCTTGGTTCTTGGTCGGTTTTGGTGTTTTGAAGATAACCCACAGT
AGTTCTAACTGTTTCTGATTACCCATAACATTTGGGTATTGGGTTTGCGGAATCAACCCATTCGGAGTCAAATTCAGCCAGGATTCAAATTAAAAAACTAAGAAGTGGTT
GGGCGAGCGAGAGCGAAGTTCGCCAGGTAGTCACTCTTGAGGTGATGTGCAACCATGAGGCGGATGATAGAAGAGATGATCGACCAAGGCTATGGAGACGTCATGAGAGT
TTCTCCCTAGGGTCATAGTTTAGTTTTGTCCTTGTCAAGTAACGTGGGGTAGATTTGTTTAATTGCAGTTTCCTTTGTGTTCTCTTTTGTTCAGTGGTTTCTCTTAGAGC
TAAAGTCTGTGGTTGGCCAAACTCCGTTTTTATTCGTAGTTTGGTGGATGTTGAGCGGTTCTTCTGAGATATAGAAGTATTTTTGTACCTTTCAGTCCCTTGTAAACTTT
TGGTTTCCTTGTAATTATATTGTTATTTTGTCGAAATTTCAGGCATAAGTGGTGTTGATCAGTTTCATGTCGTCCATAGCCTATCCAAGTTTAGGGTCATTATATTAATG
CTTTCTTGAATTCAAAAAGCGAACTCCGAGTCTCATATCCCTACCAAAAACGCGCTGCCCCTTTTGCCTAACATGCTTCACCTTCGACGATCAAATCCCATTATTCATAG
TTTCGTTTTTTGTTTCAAATTCCAGAACTTTCCTGCCACCCAATCAAGATTGCTCAACACGCTTTCCTCCCTCTTCAATCGATGCAACTCACGTCAACACCTCGAACAAA
TTCATGCCAGATTCATTCTCCATGGTTTCCACCAAAACTCAACTCTCTCTTGCAAGCTTATTGACTGTTATGCGAATCTTGGACTCCTTAATCTCTCTCAGCAAGTTTTC
TACTCTATAATCGATCCCAATTCAACTCTTTATAATGCTATACTGAGAAATTTGACTAAATATGGTGAATACGAGCGTACATTGTTGGTGTACCGAGAAATGTTTGCCAA
GTCCATGCACTCGGATGAAGAGACTTACCCTTTTGTTTTGCGATCCTGTTGTTGCTTATCAAATGTTGAATTTGGGACGAAGATTCATGGGCGTTTGGTTAAACTTGGTG
TTGATTCATATGATACGGTAGCCACTGCTCTAGCTGAGATGTATGATGAGTGCATTGATTTTGAGAATTATCATCAACCGTTTGATAAAATGTTTGTGAAGGATTTGGAA
TGCTGGAGTTCCTTGATTTCAAAGAATTCTCAAAATAGGAATGGAGATGAAAATTCCTTGTTCTCTGGGAGAATGAGAACAGAGCAATTAGTACCAGATTCACTCACATT
CATCAATCTCTTGAGGTCCATTGCAGGTTTTAATTCAATTCAGCTTGCAAAGATTGTTCATTGTATTGCAATTGTGAGCAACTTGTGTGGAGATTTGTTAGTAAATACTG
CTGTATTGTCTCTTTACTCAAAGTTAGGTAGCTTAGTGGATGCTAGAAAATTATTTAACAAAATGCCAGAGAATGACTGTGTTGTATGGAATATAATGATATCAGCTTAC
GCCCGAGAAGGGAAACCGACAGAATGTCTCGAGCTCTTCATGTCCATGGCACGATCTGGGATTAGAGCTGATCTATTTACTGCACTCCCTGTTATCTCTTCAGTTTCACA
GTTGAGATGTGTTGATTGGGGCAAACAAACCCATGCCCATATATTGAGGAATGGTTTAGACAGTCAAGTTTCAGTTCATAACTCTCTCATTGACATGTACTGCGAGTGTA
ACATCTTAGATTCGGCTTGTAAGATCTTCAACGGGACGACAGACAAGACTGTAATTTCATGGAGTGCAATGATCAAGGGGTATGTCAAACATGGTCGGTCTCTCATTGCT
TTGTCTCTCTTCCTTAGGATGAAATGTGAAGGGATTCAAGCTGATTTCATTACAATGATTAATATCTTACCTGCATTTGTTCACATAGGAGCACTTGAAAATGTCAAATA
TTTACATGGGTACTCAGTGAAGCTAGGTCTGACTTCCCTTCCATCACTTAACACAACCCTCCTAATTACCTATGCAAAATGTGGCTGTATAGAGATGGCCCAAAGGCTAT
TTGAGGAAGAAAGAGTTGATGACAAGGATTTGATAATGTGGAACTCCATGATCAGTGCCCATGCCAACCATGGAGATTGGTCCCAATGTTTCAAGTTGTACAATCAACTG
AAGTGCTCAAATTCAAAGCCAGATCAAGTAACATTTTTGGGACTACTAACAACTTGCGTCAATTGCGGTCTCGTAGAAAAAGGAAAGGAGTTTTTCAAGGAGATGGTTGA
AAATTATGGTTGCCAGCCAAGTCAAGAGCATTATGCTTGTATGGTTAATCTCTTAGGGAGAGCTGGACTTATCAATGAAGCTGGAGAACTTGTAAGAACCATGCCCATCA
AACCCGATGCTCGAGTTTGGGGTCCATTGTTGAGTGCTTGTAAGTTGCATTCTAGGTCCAAGCTTGCAGAGTTTGCAGCAGAGCAGCTCATTGATATGGAGCCTAAAAAT
GCAGGGAATTACATATTGCTTTCGAACATATATGCTGCTGCAGGAAGATGGGATAGAGTTGCAAAAATGAGAAGTTTCCTTAGGGATAAAGGGCTCAAGAAAACCCCTGG
TTGTAGTTGGCTGGAGATAAATGGCCAGGTAACTGAGTTTCGTGTTGCTGATCAAACTCATCCTAGAGCAGAAGATATATATACCATCCTAGAAAACCTTGAACATGAAA
TCAAAGAAGCTAGAGCAAAGAGTCCAGAAAAATTGAGTAATCTTCTATAACACTTGTATCCATTTTTTGTTTTTAATGATATATCTTCTCATATAACAGGTTTTACTTGC
TCATTTATTTACGTCTTCACATTACATTGTTTACATGACGATGTCCAATAAATATATTAATGTAAATGATCTCATTTCTTCAGCATTTATGTC
Protein sequenceShow/hide protein sequence
MLHLRRSNPIIHSFVFCFKFQNFPATQSRLLNTLSSLFNRCNSRQHLEQIHARFILHGFHQNSTLSCKLIDCYANLGLLNLSQQVFYSIIDPNSTLYNAILRNLTKYGEY
ERTLLVYREMFAKSMHSDEETYPFVLRSCCCLSNVEFGTKIHGRLVKLGVDSYDTVATALAEMYDECIDFENYHQPFDKMFVKDLECWSSLISKNSQNRNGDENSLFSGR
MRTEQLVPDSLTFINLLRSIAGFNSIQLAKIVHCIAIVSNLCGDLLVNTAVLSLYSKLGSLVDARKLFNKMPENDCVVWNIMISAYAREGKPTECLELFMSMARSGIRAD
LFTALPVISSVSQLRCVDWGKQTHAHILRNGLDSQVSVHNSLIDMYCECNILDSACKIFNGTTDKTVISWSAMIKGYVKHGRSLIALSLFLRMKCEGIQADFITMINILP
AFVHIGALENVKYLHGYSVKLGLTSLPSLNTTLLITYAKCGCIEMAQRLFEEERVDDKDLIMWNSMISAHANHGDWSQCFKLYNQLKCSNSKPDQVTFLGLLTTCVNCGL
VEKGKEFFKEMVENYGCQPSQEHYACMVNLLGRAGLINEAGELVRTMPIKPDARVWGPLLSACKLHSRSKLAEFAAEQLIDMEPKNAGNYILLSNIYAAAGRWDRVAKMR
SFLRDKGLKKTPGCSWLEINGQVTEFRVADQTHPRAEDIYTILENLEHEIKEARAKSPEKLSNLL