; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0024360 (gene) of Chayote v1 genome

Gene IDSed0024360
OrganismSechium edule (Chayote v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationLG01:17072524..17075174
RNA-Seq ExpressionSed0024360
SyntenySed0024360
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6573373.1 Pentatricopeptide repeat-containing protein, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0084.4Show/hide
Query:  MLHLQRSNPLFRFNFQNFPATQSRPLNTLSLLFNRCSSRQHLQQIHARFVLHGFHQNPTLSSKLIDSYANLGLLNLSHQLFYSVIDPNSTLYNAILRNLA
        M HLQRS P+FRF F NFPATQSR LNTLS LF+RC SRQ LQQIHARFVLHGFHQNPTLS KLID YAN GLLNLSH +F S+IDPNS LYNAILRNL 
Subjt:  MLHLQRSNPLFRFNFQNFPATQSRPLNTLSLLFNRCSSRQHLQQIHARFVLHGFHQNPTLSSKLIDSYANLGLLNLSHQLFYSVIDPNSTLYNAILRNLA

Query:  TYGEYERTLLVYREMVAKSMYPDEETYPFVLRSCCCLSNVEFGRNIHGCLIKLGVDSYHLVAAALAEMYRECIDFENAHQLFDKMSGKDFECCSPLISNG
         +GEYERTLLVYREMVAKSM+PDE+TYPFVLRSCCCLSNV+FG+NIHGCLIKLGVDSY  V   L EMY +CIDFENAHQLFDKMS KD +C S LI+  
Subjt:  TYGEYERTLLVYREMVAKSMYPDEETYPFVLRSCCCLSNVEFGRNIHGCLIKLGVDSYHLVAAALAEMYRECIDFENAHQLFDKMSGKDFECCSPLISNG

Query:  SQNGNGDEIFRSLGTVGMRSEQLVPDSLAFFNLLKSIAGFNSIQLAKVVHCIAIVSNLCADLLVDTAVLSLYSKLGSLVDARKLFDKMPEKDRVVWNIMI
         QNGNGD+I R  G   M+SE LV DSL F NLL+S++G +SIQLAK+VHCIAIVSNLC DLLVDTAVLSLYSKLGSLVDARKLF+K+PEKDRVVWNIMI
Subjt:  SQNGNGDEIFRSLGTVGMRSEQLVPDSLAFFNLLKSIAGFNSIQLAKVVHCIAIVSNLCADLLVDTAVLSLYSKLGSLVDARKLFDKMPEKDRVVWNIMI

Query:  ATYAREGNPMECLELFKSMAGLGIRADLFTALPVISSISQLKCVDWGKQTHAHVLRNGSDSQVSVYNSLIDMYCECNILESACKIFNWMTDKTVISWSIM
        A YAREG PMECLELF+SMA  GIRADLFTALPVISSISQLK  DWGKQTHA++LRNGSDSQVSV+NSLIDMYCECN L+SACKIFN +T+KTVISWS M
Subjt:  ATYAREGNPMECLELFKSMAGLGIRADLFTALPVISSISQLKCVDWGKQTHAHVLRNGSDSQVSVYNSLIDMYCECNILESACKIFNWMTDKTVISWSIM

Query:  FKGYVKYGQSLIALSLFTRMKSEGIQADFITVINILPAFVHIGALENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGCIEMAQRLFEEERVYDKDLIMW
         KG VK+G  LIALSLF RMKS+GIQADFITVINI+PAFV IGALENVKYLHGYS+KL LTSLPSLNTALLITYAKCGCI+MAQRLFEEERV DKDLIMW
Subjt:  FKGYVKYGQSLIALSLFTRMKSEGIQADFITVINILPAFVHIGALENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGCIEMAQRLFEEERVYDKDLIMW

Query:  NSMISAHANHGDWSHCFKLYNQMKCSSSKPDQVTFLGLLTACVNSGLVEKGKEVFKEMLESYGCQPSQEHYACMVNLLGRAGLVNEAGELVQNMPIKPDA
        NSMISAHANHGDWS CF LYNQMKCS+S PDQVTFLGLLTACVNSGLVEKGKE FKEM+ESY CQPSQEHYACMVNLLGRAGL+NEAGELV+NMPIKPDA
Subjt:  NSMISAHANHGDWSHCFKLYNQMKCSSSKPDQVTFLGLLTACVNSGLVEKGKEVFKEMLESYGCQPSQEHYACMVNLLGRAGLVNEAGELVQNMPIKPDA

Query:  RVWGPLLSACKLHPGSKLAEYAAEKLIDIEPKNAGNYVLLSNIYAAAGKWDGVAKMRSFLRDKGLKKTPGCSWLEINGHVTEFRVADQTHPRAEDIYTIL
        RVWGPLLSACKLHPGSKLAE+AAEKLID+EPKNAGNY+LLSNIYAAAGKWDGVAKMRSFLRDKGLKKTPGCSWLEING V EFRVAD+THPRAEDIY IL
Subjt:  RVWGPLLSACKLHPGSKLAEYAAEKLIDIEPKNAGNYVLLSNIYAAAGKWDGVAKMRSFLRDKGLKKTPGCSWLEINGHVTEFRVADQTHPRAEDIYTIL

Query:  GNLEHEIKEAREKSPEKL
        GNLE +IKEA+E SPEKL
Subjt:  GNLEHEIKEAREKSPEKL

KAG7012542.1 Pentatricopeptide repeat-containing protein, chloroplastic, partial [Cucurbita argyrosperma subsp. argyrosperma]0.0e+0084.05Show/hide
Query:  MLHLQRSNPLFRFNFQNFPATQSRPLNTLSLLFNRCSSRQHLQQIHARFVLHGFHQNPTLSSKLIDSYANLGLLNLSHQLFYSVIDPNSTLYNAILRNLA
        M HLQRS P+FRF F NFPATQSR LNTLS LF+RC SRQ LQQIHARFVLHGFHQNPTLS KLID YAN GLLNLSH +F S+IDPNS LYNAILRNL 
Subjt:  MLHLQRSNPLFRFNFQNFPATQSRPLNTLSLLFNRCSSRQHLQQIHARFVLHGFHQNPTLSSKLIDSYANLGLLNLSHQLFYSVIDPNSTLYNAILRNLA

Query:  TYGEYERTLLVYREMVAKSMYPDEETYPFVLRSCCCLSNVEFGRNIHGCLIKLGVDSYHLVAAALAEMYRECIDFENAHQLFDKMSGKDFECCSPLISNG
         +GEYERTLLVYREMVAKSM+PDE+TYPFVLRSCCCLSNV+FG+NIHGCLIKLGVDSY  V   L EMY +CIDFENAHQLFDKMS KD +C S L+S+ 
Subjt:  TYGEYERTLLVYREMVAKSMYPDEETYPFVLRSCCCLSNVEFGRNIHGCLIKLGVDSYHLVAAALAEMYRECIDFENAHQLFDKMSGKDFECCSPLISNG

Query:  SQNGNGDEIFRSLGTVGMRSEQLVPDSLAFFNLLKSIAGFNSIQLAKVVHCIAIVSNLCADLLVDTAVLSLYSKLGSLVDARKLFDKMPEKDRVVWNIMI
         QNGNGD+I    G   M+SE LV DSL F NLL+SI+G +SIQLAK+VHCIAIVSNLC DLLVDTAVLSLYSKLGSLVDARKLF+K+PEKDRVVWNIMI
Subjt:  SQNGNGDEIFRSLGTVGMRSEQLVPDSLAFFNLLKSIAGFNSIQLAKVVHCIAIVSNLCADLLVDTAVLSLYSKLGSLVDARKLFDKMPEKDRVVWNIMI

Query:  ATYAREGNPMECLELFKSMAGLGIRADLFTALPVISSISQLKCVDWGKQTHAHVLRNGSDSQVSVYNSLIDMYCECNILESACKIFNWMTDKTVISWSIM
        A YAREG PMECLELF+SMA  GIRADLFTALPVISSISQLK  DWGKQTHA++LRNGSDSQVSV+NSLIDMYCECN L+SACKIFN +T+KTVISWS M
Subjt:  ATYAREGNPMECLELFKSMAGLGIRADLFTALPVISSISQLKCVDWGKQTHAHVLRNGSDSQVSVYNSLIDMYCECNILESACKIFNWMTDKTVISWSIM

Query:  FKGYVKYGQSLIALSLFTRMKSEGIQADFITVINILPAFVHIGALENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGCIEMAQRLFEEERVYDKDLIMW
         KG VK+G  LIALSLF RMKS+GIQADFITVINI+PAFV IGALENVKYLHGYS+KL LTSLPSLNTALLITYAKCGCI+MAQRLFEEERV DKDLIMW
Subjt:  FKGYVKYGQSLIALSLFTRMKSEGIQADFITVINILPAFVHIGALENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGCIEMAQRLFEEERVYDKDLIMW

Query:  NSMISAHANHGDWSHCFKLYNQMKCSSSKPDQVTFLGLLTACVNSGLVEKGKEVFKEMLESYGCQPSQEHYACMVNLLGRAGLVNEAGELVQNMPIKPDA
        NSMISAHANHGDWS CFKLYNQMKCS+S PDQVTFLGLLTACVNSGLVEKGKE FKEM+E Y CQPSQEHYACMVNLLGRAGL+NEAGELV+NMPIKPDA
Subjt:  NSMISAHANHGDWSHCFKLYNQMKCSSSKPDQVTFLGLLTACVNSGLVEKGKEVFKEMLESYGCQPSQEHYACMVNLLGRAGLVNEAGELVQNMPIKPDA

Query:  RVWGPLLSACKLHPGSKLAEYAAEKLIDIEPKNAGNYVLLSNIYAAAGKWDGVAKMRSFLRDKGLKKTPGCSWLEINGHVTEFRVADQTHPRAEDIYTIL
        RVWGPLLSACKLHPGSKLAE+AAEKLID+EPKNAGNY+LLSNIYAAAGKWDGVAKMRSFLRDKGLKKTPGCSWLEING V EFRVAD+THPRAEDIY IL
Subjt:  RVWGPLLSACKLHPGSKLAEYAAEKLIDIEPKNAGNYVLLSNIYAAAGKWDGVAKMRSFLRDKGLKKTPGCSWLEINGHVTEFRVADQTHPRAEDIYTIL

Query:  GNLEHEIKEAREKSPEKLDNL
        GNLE +IKE +E SPEKL  L
Subjt:  GNLEHEIKEAREKSPEKLDNL

XP_022139869.1 pentatricopeptide repeat-containing protein At1g11290, chloroplastic-like [Momordica charantia]0.0e+0085.65Show/hide
Query:  MLHLQRSNPLFRFNFQNFPATQSRPLNTLSLLFNRCSSRQHLQQIHARFVLHGFHQNPTLSSKLIDSYANLGLLNLSHQLFYSVIDPNSTLYNAILRNLA
        MLHLQRS P+FRF F NFPATQSRPLNTLS LF+RCSSRQ L+QIHARF+LHG HQNP LS +LIDSYANLGLL LS Q+F S+IDP STLY+AILRNL+
Subjt:  MLHLQRSNPLFRFNFQNFPATQSRPLNTLSLLFNRCSSRQHLQQIHARFVLHGFHQNPTLSSKLIDSYANLGLLNLSHQLFYSVIDPNSTLYNAILRNLA

Query:  TYGEYERTLLVYREMVAKSMYPDEETYPFVLRSCCCLSNVEFGRNIHGCLIKLGVDSYHLVAAALAEMYRECIDFENAHQLFDKMSGKDFECCSPLISNG
        ++GEYERTLLVYREM AKSM+PDEETYP VLRSCCCLSNVE+GR IHG L+KLGVD Y   A ALAEMYR+CI FEN H LFDKM  KDFEC + L S  
Subjt:  TYGEYERTLLVYREMVAKSMYPDEETYPFVLRSCCCLSNVEFGRNIHGCLIKLGVDSYHLVAAALAEMYRECIDFENAHQLFDKMSGKDFECCSPLISNG

Query:  SQNGNGDEIFRSLGTVGMRSEQLVPDSLAFFNLLKSIAGFNSIQLAKVVHCIAIVSNLCADLLVDTAVLSLYSKLGSLVDARKLFDKMPEKDRVVWNIMI
        SQNGNGDEIF+  G   MR+EQLV DSL F NLL+SI G NSIQLAK+VHC+AI SNLC DLLV+TAVLSLYSKLG LV+ARKLFDKMPEKDRVVWNIMI
Subjt:  SQNGNGDEIFRSLGTVGMRSEQLVPDSLAFFNLLKSIAGFNSIQLAKVVHCIAIVSNLCADLLVDTAVLSLYSKLGSLVDARKLFDKMPEKDRVVWNIMI

Query:  ATYAREGNPMECLELFKSMAGLGIRADLFTALPVISSISQLKCVDWGKQTHAHVLRNGSDSQVSVYNSLIDMYCECNILESACKIFNWMTDKTVISWSIM
        A Y REGNP ECLELFKSMA  GIRADLFTALPVISSISQLKCVDWGKQTHAH LRNGSD+QVSV+NSLIDMYCE NIL+SACKIF+WMT+KTVISWS M
Subjt:  ATYAREGNPMECLELFKSMAGLGIRADLFTALPVISSISQLKCVDWGKQTHAHVLRNGSDSQVSVYNSLIDMYCECNILESACKIFNWMTDKTVISWSIM

Query:  FKGYVKYGQSLIALSLFTRMKSEGIQADFITVINILPAFVHIGALENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGCIEMAQRLFEEERVYDKDLIMW
         KG VK+GQSL ALSLF+RMKS+GIQADFITVINILPAFVHIGALENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGCIEMAQRLFEEERV DKDLIMW
Subjt:  FKGYVKYGQSLIALSLFTRMKSEGIQADFITVINILPAFVHIGALENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGCIEMAQRLFEEERVYDKDLIMW

Query:  NSMISAHANHGDWSHCFKLYNQMKCSSSKPDQVTFLGLLTACVNSGLVEKGKEVFKEMLESYGCQPSQEHYACMVNLLGRAGLVNEAGELVQNMPIKPDA
        NSMISAHANHGDWS CFK+YNQMKCS+S+PDQVTFLGLLTACVNSGLVEKGKE FKEM+E+YGCQPSQEHYACMVNLLGRAGL+N+AG LV+NMPIKPDA
Subjt:  NSMISAHANHGDWSHCFKLYNQMKCSSSKPDQVTFLGLLTACVNSGLVEKGKEVFKEMLESYGCQPSQEHYACMVNLLGRAGLVNEAGELVQNMPIKPDA

Query:  RVWGPLLSACKLHPGSKLAEYAAEKLIDIEPKNAGNYVLLSNIYAAAGKWDGVAKMRSFLRDKGLKKTPGCSWLEINGHVTEFRVADQTHPRAEDIYTIL
        RVWGPLLSACKLHPGSKLAE+AAEKLID+EPKNAGNY+LLSNIYAAAGKWDGVAKMRSFLRDKGLKKTPGCSWLEINGHVTEFRVAD+THPRAEDIYTIL
Subjt:  RVWGPLLSACKLHPGSKLAEYAAEKLIDIEPKNAGNYVLLSNIYAAAGKWDGVAKMRSFLRDKGLKKTPGCSWLEINGHVTEFRVADQTHPRAEDIYTIL

Query:  GNLEHEIKEAREKSPEKL
        GNLE EIKEAREKSPEKL
Subjt:  GNLEHEIKEAREKSPEKL

XP_022954531.1 pentatricopeptide repeat-containing protein At1g11290, chloroplastic-like [Cucurbita moschata]0.0e+0083.77Show/hide
Query:  MLHLQRSNPLFRFNFQNFPATQSRPLNTLSLLFNRCSSRQHLQQIHARFVLHGFHQNPTLSSKLIDSYANLGLLNLSHQLFYSVIDPNSTLYNAILRNLA
        M HLQRS P+FRF F NFPAT SR LNTLS LF+RC SRQ LQQIHARFVLHGFHQNPTLS KLID YAN GLLNLSH +F S+IDPNS LYNAILRNL 
Subjt:  MLHLQRSNPLFRFNFQNFPATQSRPLNTLSLLFNRCSSRQHLQQIHARFVLHGFHQNPTLSSKLIDSYANLGLLNLSHQLFYSVIDPNSTLYNAILRNLA

Query:  TYGEYERTLLVYREMVAKSMYPDEETYPFVLRSCCCLSNVEFGRNIHGCLIKLGVDSYHLVAAALAEMYRECIDFENAHQLFDKMSGKDFECCSPLISNG
         +GEYERTLLVYREMVAKSM+PDE+TYPFVLRSCCCLSNV+FG+NIHGCLIKLGVDSY  V   L EMY +CIDFENAHQLFDKMS KD +C S LI+  
Subjt:  TYGEYERTLLVYREMVAKSMYPDEETYPFVLRSCCCLSNVEFGRNIHGCLIKLGVDSYHLVAAALAEMYRECIDFENAHQLFDKMSGKDFECCSPLISNG

Query:  SQNGNGDEIFRSLGTVGMRSEQLVPDSLAFFNLLKSIAGFNSIQLAKVVHCIAIVSNLCADLLVDTAVLSLYSKLGSLVDARKLFDKMPEKDRVVWNIMI
         QNGNGD+I R  G   M+SE LV DSL F NLL+S++G +SIQLAK+VHCIAIVSNLC DLLVDTAVLSLYSKLGSLVDARKLF+K+PEKDRVVWNIMI
Subjt:  SQNGNGDEIFRSLGTVGMRSEQLVPDSLAFFNLLKSIAGFNSIQLAKVVHCIAIVSNLCADLLVDTAVLSLYSKLGSLVDARKLFDKMPEKDRVVWNIMI

Query:  ATYAREGNPMECLELFKSMAGLGIRADLFTALPVISSISQLKCVDWGKQTHAHVLRNGSDSQVSVYNSLIDMYCECNILESACKIFNWMTDKTVISWSIM
        A YAREG PMECLELF+SMA  GIRADLFT LPVISSISQLK  DWGKQTHA++LRNGSDSQVSV+NSLIDMYCECN L+SA KIFN +T+KTVISWS M
Subjt:  ATYAREGNPMECLELFKSMAGLGIRADLFTALPVISSISQLKCVDWGKQTHAHVLRNGSDSQVSVYNSLIDMYCECNILESACKIFNWMTDKTVISWSIM

Query:  FKGYVKYGQSLIALSLFTRMKSEGIQADFITVINILPAFVHIGALENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGCIEMAQRLFEEERVYDKDLIMW
         KG VK+G  LIALSLF RMKS+GIQADFITVINI+PAFV IGALENVKYLHGYS+KL LTSLPSLNTALLITYAKCGCI+MAQRLFEEERV DKDLIMW
Subjt:  FKGYVKYGQSLIALSLFTRMKSEGIQADFITVINILPAFVHIGALENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGCIEMAQRLFEEERVYDKDLIMW

Query:  NSMISAHANHGDWSHCFKLYNQMKCSSSKPDQVTFLGLLTACVNSGLVEKGKEVFKEMLESYGCQPSQEHYACMVNLLGRAGLVNEAGELVQNMPIKPDA
        NSMISAHANHGDWS CFKLYNQMKCS+S PDQVTFLGLLTACVNSGLVEKGKE FKEM+ESY CQPSQEHYACMVNLLGRAGL+NEAGELV+NMPIKPDA
Subjt:  NSMISAHANHGDWSHCFKLYNQMKCSSSKPDQVTFLGLLTACVNSGLVEKGKEVFKEMLESYGCQPSQEHYACMVNLLGRAGLVNEAGELVQNMPIKPDA

Query:  RVWGPLLSACKLHPGSKLAEYAAEKLIDIEPKNAGNYVLLSNIYAAAGKWDGVAKMRSFLRDKGLKKTPGCSWLEINGHVTEFRVADQTHPRAEDIYTIL
        RVWGPLLSACKLHPGSKLAE+AAEKLID+EPKNAGNY+LLSNIYAAAGKWDGVAKMRSFLRDKGLKKTPGCSWLEING V EFRVAD+THPRAEDIY IL
Subjt:  RVWGPLLSACKLHPGSKLAEYAAEKLIDIEPKNAGNYVLLSNIYAAAGKWDGVAKMRSFLRDKGLKKTPGCSWLEINGHVTEFRVADQTHPRAEDIYTIL

Query:  GNLEHEIKEAREKSPEKLDNL
        GNLE +IKE +E SPEKL  L
Subjt:  GNLEHEIKEAREKSPEKLDNL

XP_022994744.1 pentatricopeptide repeat-containing protein At1g11290, chloroplastic-like [Cucurbita maxima]0.0e+0084.57Show/hide
Query:  MLHLQRS-----NPLFRFNFQNFPATQSRPLNTLSLLFNRCSSRQHLQQIHARFVLHGFHQNPTLSSKLIDSYANLGLLNLSHQLFYSVIDPNSTLYNAI
        M HLQRS     +P+FRF F NFPATQSR LNTLS LF+RC SRQ L+QIHARFVLHGFHQNPTLS KLID YAN GLLN+SH +F S+IDPNSTLYNAI
Subjt:  MLHLQRS-----NPLFRFNFQNFPATQSRPLNTLSLLFNRCSSRQHLQQIHARFVLHGFHQNPTLSSKLIDSYANLGLLNLSHQLFYSVIDPNSTLYNAI

Query:  LRNLATYGEYERTLLVYREMVAKSMYPDEETYPFVLRSCCCLSNVEFGRNIHGCLIKLGVDSYHLVAAALAEMYRECIDFENAHQLFDKMSGKDFECCSP
        LRNL  +GEYERTLLVYREMVAKSM+PDE+TYPFVL+SCCCLSNVEFG+NIHGCLIKLGVDSY  V   LAEMY +CIDFENAHQLFDKMS KD +C S 
Subjt:  LRNLATYGEYERTLLVYREMVAKSMYPDEETYPFVLRSCCCLSNVEFGRNIHGCLIKLGVDSYHLVAAALAEMYRECIDFENAHQLFDKMSGKDFECCSP

Query:  LISNGSQNGNGDEIFRSLGTVGMRSEQLVPDSLAFFNLLKSIAGFNSIQLAKVVHCIAIVSNLCADLLVDTAVLSLYSKLGSLVDARKLFDKMPEKDRVV
        LIS   QNGNGDEI   LG   M+SE LV DSL F NLL+SI+G +SIQLAK+VHCIAIVSNLC DLLVDTAVLSLYSKLGSLVDARKLF+KMPEKDRVV
Subjt:  LISNGSQNGNGDEIFRSLGTVGMRSEQLVPDSLAFFNLLKSIAGFNSIQLAKVVHCIAIVSNLCADLLVDTAVLSLYSKLGSLVDARKLFDKMPEKDRVV

Query:  WNIMIATYAREGNPMECLELFKSMAGLGIRADLFTALPVISSISQLKCVDWGKQTHAHVLRNGSDSQVSVYNSLIDMYCECNILESACKIFNWMTDKTVI
        WNIMIA YAREG PMECLELF+SMA  GIRADLFTALPVISSISQLKC DWGKQTHA++LRNGSDSQVSV+NSLIDMYCECN LESACKIFN +T+KTVI
Subjt:  WNIMIATYAREGNPMECLELFKSMAGLGIRADLFTALPVISSISQLKCVDWGKQTHAHVLRNGSDSQVSVYNSLIDMYCECNILESACKIFNWMTDKTVI

Query:  SWSIMFKGYVKYGQSLIALSLFTRMKSEGIQADFITVINILPAFVHIGALENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGCIEMAQRLFEEERVYDK
        SWS M KG VK+G  LIALSLF  MKS+GIQADFITVINI+PAFV IGALENVKYLHGYS+KL LTSLPSLNTALLITYAKCGCIEMAQRLFEEERV DK
Subjt:  SWSIMFKGYVKYGQSLIALSLFTRMKSEGIQADFITVINILPAFVHIGALENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGCIEMAQRLFEEERVYDK

Query:  DLIMWNSMISAHANHGDWSHCFKLYNQMKCSSSKPDQVTFLGLLTACVNSGLVEKGKEVFKEMLESYGCQPSQEHYACMVNLLGRAGLVNEAGELVQNMP
        DLIMWNSMISAHANHGDWS CFKLYNQMKCS+S PDQVTFLGLLTACVNSGLVEKGKE FKEM+ESY CQPSQEHYACMVNLLGRAGL+NEAGELV+NMP
Subjt:  DLIMWNSMISAHANHGDWSHCFKLYNQMKCSSSKPDQVTFLGLLTACVNSGLVEKGKEVFKEMLESYGCQPSQEHYACMVNLLGRAGLVNEAGELVQNMP

Query:  IKPDARVWGPLLSACKLHPGSKLAEYAAEKLIDIEPKNAGNYVLLSNIYAAAGKWDGVAKMRSFLRDKGLKKTPGCSWLEINGHVTEFRVADQTHPRAED
        IKPDARVWGPLLSACKLHPGSKLAE+AAEKLID+EPKNAGNY+LLSNIYAAAGKWDGVAKMRSFLRDKGLKKTPGCSWLEING V EFRVAD+THPRAED
Subjt:  IKPDARVWGPLLSACKLHPGSKLAEYAAEKLIDIEPKNAGNYVLLSNIYAAAGKWDGVAKMRSFLRDKGLKKTPGCSWLEINGHVTEFRVADQTHPRAED

Query:  IYTILGNLEHEIKEAREKSPEKLDNL
        IY ILGNLE +IKEA+E SPEKL  L
Subjt:  IYTILGNLEHEIKEAREKSPEKLDNL

TrEMBL top hitse value%identityAlignment
A0A0A0M0Z6 Uncharacterized protein0.0e+0083.08Show/hide
Query:  MLHLQRSNPLFRFN-FQNFPATQSRPLNTLSLLFNRCSSRQHLQQIHARFVLHGFHQNPTLSSKLIDSYANLGLLNLSHQLFYSVIDPNSTLYNAILRNL
        MLHL RS P+     F NFPATQSR LNTLSLLF+RC+S QHLQQIHARF+LHGFHQNPTLSSKLID YANLGLLN S Q+F SVIDPN TL+NAILRNL
Subjt:  MLHLQRSNPLFRFN-FQNFPATQSRPLNTLSLLFNRCSSRQHLQQIHARFVLHGFHQNPTLSSKLIDSYANLGLLNLSHQLFYSVIDPNSTLYNAILRNL

Query:  ATYGEYERTLLVYREMVAKSMYPDEETYPFVLRSCCCLSNVEFGRNIHGCLIKLGVDSYHLVAAALAEMYRECIDFENAHQLFDKMSGKDFECCSPLISN
          YGE ERTLLVY++MVAKSM+PDEETYPFVLRSC   SNV FGR IHG L+KLG D + +VA ALAEMY ECI+FENAHQLFDK S KD    S L + 
Subjt:  ATYGEYERTLLVYREMVAKSMYPDEETYPFVLRSCCCLSNVEFGRNIHGCLIKLGVDSYHLVAAALAEMYRECIDFENAHQLFDKMSGKDFECCSPLISN

Query:  GSQNGNGDEIFRSLGTVGMRSEQLVPDSLAFFNLLKSIAGFNSIQLAKVVHCIAIVSNLCADLLVDTAVLSLYSKLGSLVDARKLFDKMPEKDRVVWNIM
        G QN NG+ IFR  G   M +EQLVPDS  FFNLL+ IAG NSIQLAK+VHCIAIVS L  DLLV+TAVLSLYSKL SLVDARKLFDKMPEKDRVVWNIM
Subjt:  GSQNGNGDEIFRSLGTVGMRSEQLVPDSLAFFNLLKSIAGFNSIQLAKVVHCIAIVSNLCADLLVDTAVLSLYSKLGSLVDARKLFDKMPEKDRVVWNIM

Query:  IATYAREGNPMECLELFKSMAGLGIRADLFTALPVISSISQLKCVDWGKQTHAHVLRNGSDSQVSVYNSLIDMYCECNILESACKIFNWMTDKTVISWSI
        IA YAREG P ECLELFKSMA  GIR+DLFTALPVISSI+QLKCVDWGKQTHAH+LRNGSDSQVSV+NSLIDMYCEC IL+SACKIFNWMTDK+VISWS 
Subjt:  IATYAREGNPMECLELFKSMAGLGIRADLFTALPVISSISQLKCVDWGKQTHAHVLRNGSDSQVSVYNSLIDMYCECNILESACKIFNWMTDKTVISWSI

Query:  MFKGYVKYGQSLIALSLFTRMKSEGIQADFITVINILPAFVHIGALENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGCIEMAQRLFEEERVYDKDLIM
        M KGYVK GQSL ALSLF++MKS+GIQADF+ +INILPAFVHIGALENVKYLHGYSMKLGLTSLPSLNTALLITYAKCG IEMAQRLFEEE++ DKDLIM
Subjt:  MFKGYVKYGQSLIALSLFTRMKSEGIQADFITVINILPAFVHIGALENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGCIEMAQRLFEEERVYDKDLIM

Query:  WNSMISAHANHGDWSHCFKLYNQMKCSSSKPDQVTFLGLLTACVNSGLVEKGKEVFKEMLESYGCQPSQEHYACMVNLLGRAGLVNEAGELVQNMPIKPD
        WNSMISAHANHGDWS CFKLYN+MKCS+SKPDQVTFLGLLTACVNSGLVEKGKE FKEM ESYGCQPSQEHYACMVNLLGRAGL++EAGELV+NMPIKPD
Subjt:  WNSMISAHANHGDWSHCFKLYNQMKCSSSKPDQVTFLGLLTACVNSGLVEKGKEVFKEMLESYGCQPSQEHYACMVNLLGRAGLVNEAGELVQNMPIKPD

Query:  ARVWGPLLSACKLHPGSKLAEYAAEKLIDIEPKNAGNYVLLSNIYAAAGKWDGVAKMRSFLRDKGLKKTPGCSWLEINGHVTEFRVADQTHPRAEDIYTI
        ARVWGPLLSACK+HPGSKLAE+AAEKLI++EP+NAGNY+LLSNIYAAAGKWDGVAKMRSFLR+KGLKK PGCSWLEINGHVTEFRVADQTHPRA DIYTI
Subjt:  ARVWGPLLSACKLHPGSKLAEYAAEKLIDIEPKNAGNYVLLSNIYAAAGKWDGVAKMRSFLRDKGLKKTPGCSWLEINGHVTEFRVADQTHPRAEDIYTI

Query:  LGNLEHEIKEAREKSPEKLDN
        LGNLE EIKE REKSP+ L N
Subjt:  LGNLEHEIKEAREKSPEKLDN

A0A5D3DB69 Pentatricopeptide repeat-containing protein0.0e+0081.97Show/hide
Query:  MLHLQRSNPLFRFN-FQNFPATQSRPLNTLSLLFNRCSSRQHLQQIHARFVLHGFHQNPTLSSKLIDSYANLGLLNLSHQLFYSVIDPNSTLYNAILRNL
        MLHLQRS P+       NFPATQSR LNTLSLLFNRC+S QHLQQIHARF+LHGFHQNPTLSSKLID YANLGLL  S Q+F S+IDPN TL+NAILRNL
Subjt:  MLHLQRSNPLFRFN-FQNFPATQSRPLNTLSLLFNRCSSRQHLQQIHARFVLHGFHQNPTLSSKLIDSYANLGLLNLSHQLFYSVIDPNSTLYNAILRNL

Query:  ATYGEYERTLLVYREMVAKSMYPDEETYPFVLRSCCCLSNVEFGRNIHGCLIKLGVDSYHLVAAALAEMYRECIDFENAHQLFDKMSGKDFECCSPLISN
          YGE ER LLVY++MVAKSM+PDEETYPF+ RSC   SNV FGR IHG L+KLG DS+ +VA ALAEMY + I FENAHQLFDK S KD    S L + 
Subjt:  ATYGEYERTLLVYREMVAKSMYPDEETYPFVLRSCCCLSNVEFGRNIHGCLIKLGVDSYHLVAAALAEMYRECIDFENAHQLFDKMSGKDFECCSPLISN

Query:  GSQNGNGDEIFRSLGTVGMRSEQLVPDSLAFFNLLKSIAGFNSIQLAKVVHCIAIVSNLCADLLVDTAVLSLYSKLGSLVDARKLFDKMPEKDRVVWNIM
        GSQNGNG+ IFR    V MR+EQLVPDSL F NLL+ IAG NSIQLAK+VHCIAIVS L  DLLV TAVLSLYSKL SLVDAR+LFDKMPEKDRVVWNIM
Subjt:  GSQNGNGDEIFRSLGTVGMRSEQLVPDSLAFFNLLKSIAGFNSIQLAKVVHCIAIVSNLCADLLVDTAVLSLYSKLGSLVDARKLFDKMPEKDRVVWNIM

Query:  IATYAREGNPMECLELFKSMAGLGIRADLFTALPVISSISQLKCVDWGKQTHAHVLRNGSDSQVSVYNSLIDMYCECNILESACKIFNWMTDKTVISWSI
        IA YAREG P ECLELFKSMA  GIR+DLFTALPVISSI+QLKCVDWGKQTHAH+LRNGSDSQVSV+NSLIDMYCEC +L+SAC IFNWMTDK+VISWS 
Subjt:  IATYAREGNPMECLELFKSMAGLGIRADLFTALPVISSISQLKCVDWGKQTHAHVLRNGSDSQVSVYNSLIDMYCECNILESACKIFNWMTDKTVISWSI

Query:  MFKGYVKYGQSLIALSLFTRMKSEGIQADFITVINILPAFVHIGALENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGCIEMAQRLFEEERVYDKDLIM
        M KGYVK GQSL A SLF++MKS+GIQADF+T+INILPAFVHIGALENVKYLHGYSMKLGLTSLPSLNTALLITYAKCG IEMAQRLFEEER+ DKDLIM
Subjt:  MFKGYVKYGQSLIALSLFTRMKSEGIQADFITVINILPAFVHIGALENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGCIEMAQRLFEEERVYDKDLIM

Query:  WNSMISAHANHGDWSHCFKLYNQMKCSSSKPDQVTFLGLLTACVNSGLVEKGKEVFKEMLESYGCQPSQEHYACMVNLLGRAGLVNEAGELVQNMPIKPD
        WNSMISAHANHGDWS CFKLYN+MKCS+SKPDQVTFLGLLTACVNSGL+EKGKE FKEM ESYGC PSQEH+ACMVNLLGRAGL++EAGELV+NMPIKPD
Subjt:  WNSMISAHANHGDWSHCFKLYNQMKCSSSKPDQVTFLGLLTACVNSGLVEKGKEVFKEMLESYGCQPSQEHYACMVNLLGRAGLVNEAGELVQNMPIKPD

Query:  ARVWGPLLSACKLHPGSKLAEYAAEKLIDIEPKNAGNYVLLSNIYAAAGKWDGVAKMRSFLRDKGLKKTPGCSWLEINGHVTEFRVADQTHPRAEDIYTI
        ARVWGPLLSACK+HPGSKLAE+AAEKLID+EPKNAGNY+LLSNIYAAAGKW+ VAKMRSFLR+KGLKKTPGCS LEING VTEFRVADQTHPRAEDIYTI
Subjt:  ARVWGPLLSACKLHPGSKLAEYAAEKLIDIEPKNAGNYVLLSNIYAAAGKWDGVAKMRSFLRDKGLKKTPGCSWLEINGHVTEFRVADQTHPRAEDIYTI

Query:  LGNLEHEIKEAREKSPEKLDN
        LGNLE EIKE REKS + L N
Subjt:  LGNLEHEIKEAREKSPEKLDN

A0A6J1CE61 pentatricopeptide repeat-containing protein At1g11290, chloroplastic-like0.0e+0085.65Show/hide
Query:  MLHLQRSNPLFRFNFQNFPATQSRPLNTLSLLFNRCSSRQHLQQIHARFVLHGFHQNPTLSSKLIDSYANLGLLNLSHQLFYSVIDPNSTLYNAILRNLA
        MLHLQRS P+FRF F NFPATQSRPLNTLS LF+RCSSRQ L+QIHARF+LHG HQNP LS +LIDSYANLGLL LS Q+F S+IDP STLY+AILRNL+
Subjt:  MLHLQRSNPLFRFNFQNFPATQSRPLNTLSLLFNRCSSRQHLQQIHARFVLHGFHQNPTLSSKLIDSYANLGLLNLSHQLFYSVIDPNSTLYNAILRNLA

Query:  TYGEYERTLLVYREMVAKSMYPDEETYPFVLRSCCCLSNVEFGRNIHGCLIKLGVDSYHLVAAALAEMYRECIDFENAHQLFDKMSGKDFECCSPLISNG
        ++GEYERTLLVYREM AKSM+PDEETYP VLRSCCCLSNVE+GR IHG L+KLGVD Y   A ALAEMYR+CI FEN H LFDKM  KDFEC + L S  
Subjt:  TYGEYERTLLVYREMVAKSMYPDEETYPFVLRSCCCLSNVEFGRNIHGCLIKLGVDSYHLVAAALAEMYRECIDFENAHQLFDKMSGKDFECCSPLISNG

Query:  SQNGNGDEIFRSLGTVGMRSEQLVPDSLAFFNLLKSIAGFNSIQLAKVVHCIAIVSNLCADLLVDTAVLSLYSKLGSLVDARKLFDKMPEKDRVVWNIMI
        SQNGNGDEIF+  G   MR+EQLV DSL F NLL+SI G NSIQLAK+VHC+AI SNLC DLLV+TAVLSLYSKLG LV+ARKLFDKMPEKDRVVWNIMI
Subjt:  SQNGNGDEIFRSLGTVGMRSEQLVPDSLAFFNLLKSIAGFNSIQLAKVVHCIAIVSNLCADLLVDTAVLSLYSKLGSLVDARKLFDKMPEKDRVVWNIMI

Query:  ATYAREGNPMECLELFKSMAGLGIRADLFTALPVISSISQLKCVDWGKQTHAHVLRNGSDSQVSVYNSLIDMYCECNILESACKIFNWMTDKTVISWSIM
        A Y REGNP ECLELFKSMA  GIRADLFTALPVISSISQLKCVDWGKQTHAH LRNGSD+QVSV+NSLIDMYCE NIL+SACKIF+WMT+KTVISWS M
Subjt:  ATYAREGNPMECLELFKSMAGLGIRADLFTALPVISSISQLKCVDWGKQTHAHVLRNGSDSQVSVYNSLIDMYCECNILESACKIFNWMTDKTVISWSIM

Query:  FKGYVKYGQSLIALSLFTRMKSEGIQADFITVINILPAFVHIGALENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGCIEMAQRLFEEERVYDKDLIMW
         KG VK+GQSL ALSLF+RMKS+GIQADFITVINILPAFVHIGALENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGCIEMAQRLFEEERV DKDLIMW
Subjt:  FKGYVKYGQSLIALSLFTRMKSEGIQADFITVINILPAFVHIGALENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGCIEMAQRLFEEERVYDKDLIMW

Query:  NSMISAHANHGDWSHCFKLYNQMKCSSSKPDQVTFLGLLTACVNSGLVEKGKEVFKEMLESYGCQPSQEHYACMVNLLGRAGLVNEAGELVQNMPIKPDA
        NSMISAHANHGDWS CFK+YNQMKCS+S+PDQVTFLGLLTACVNSGLVEKGKE FKEM+E+YGCQPSQEHYACMVNLLGRAGL+N+AG LV+NMPIKPDA
Subjt:  NSMISAHANHGDWSHCFKLYNQMKCSSSKPDQVTFLGLLTACVNSGLVEKGKEVFKEMLESYGCQPSQEHYACMVNLLGRAGLVNEAGELVQNMPIKPDA

Query:  RVWGPLLSACKLHPGSKLAEYAAEKLIDIEPKNAGNYVLLSNIYAAAGKWDGVAKMRSFLRDKGLKKTPGCSWLEINGHVTEFRVADQTHPRAEDIYTIL
        RVWGPLLSACKLHPGSKLAE+AAEKLID+EPKNAGNY+LLSNIYAAAGKWDGVAKMRSFLRDKGLKKTPGCSWLEINGHVTEFRVAD+THPRAEDIYTIL
Subjt:  RVWGPLLSACKLHPGSKLAEYAAEKLIDIEPKNAGNYVLLSNIYAAAGKWDGVAKMRSFLRDKGLKKTPGCSWLEINGHVTEFRVADQTHPRAEDIYTIL

Query:  GNLEHEIKEAREKSPEKL
        GNLE EIKEAREKSPEKL
Subjt:  GNLEHEIKEAREKSPEKL

A0A6J1GR57 pentatricopeptide repeat-containing protein At1g11290, chloroplastic-like0.0e+0083.77Show/hide
Query:  MLHLQRSNPLFRFNFQNFPATQSRPLNTLSLLFNRCSSRQHLQQIHARFVLHGFHQNPTLSSKLIDSYANLGLLNLSHQLFYSVIDPNSTLYNAILRNLA
        M HLQRS P+FRF F NFPAT SR LNTLS LF+RC SRQ LQQIHARFVLHGFHQNPTLS KLID YAN GLLNLSH +F S+IDPNS LYNAILRNL 
Subjt:  MLHLQRSNPLFRFNFQNFPATQSRPLNTLSLLFNRCSSRQHLQQIHARFVLHGFHQNPTLSSKLIDSYANLGLLNLSHQLFYSVIDPNSTLYNAILRNLA

Query:  TYGEYERTLLVYREMVAKSMYPDEETYPFVLRSCCCLSNVEFGRNIHGCLIKLGVDSYHLVAAALAEMYRECIDFENAHQLFDKMSGKDFECCSPLISNG
         +GEYERTLLVYREMVAKSM+PDE+TYPFVLRSCCCLSNV+FG+NIHGCLIKLGVDSY  V   L EMY +CIDFENAHQLFDKMS KD +C S LI+  
Subjt:  TYGEYERTLLVYREMVAKSMYPDEETYPFVLRSCCCLSNVEFGRNIHGCLIKLGVDSYHLVAAALAEMYRECIDFENAHQLFDKMSGKDFECCSPLISNG

Query:  SQNGNGDEIFRSLGTVGMRSEQLVPDSLAFFNLLKSIAGFNSIQLAKVVHCIAIVSNLCADLLVDTAVLSLYSKLGSLVDARKLFDKMPEKDRVVWNIMI
         QNGNGD+I R  G   M+SE LV DSL F NLL+S++G +SIQLAK+VHCIAIVSNLC DLLVDTAVLSLYSKLGSLVDARKLF+K+PEKDRVVWNIMI
Subjt:  SQNGNGDEIFRSLGTVGMRSEQLVPDSLAFFNLLKSIAGFNSIQLAKVVHCIAIVSNLCADLLVDTAVLSLYSKLGSLVDARKLFDKMPEKDRVVWNIMI

Query:  ATYAREGNPMECLELFKSMAGLGIRADLFTALPVISSISQLKCVDWGKQTHAHVLRNGSDSQVSVYNSLIDMYCECNILESACKIFNWMTDKTVISWSIM
        A YAREG PMECLELF+SMA  GIRADLFT LPVISSISQLK  DWGKQTHA++LRNGSDSQVSV+NSLIDMYCECN L+SA KIFN +T+KTVISWS M
Subjt:  ATYAREGNPMECLELFKSMAGLGIRADLFTALPVISSISQLKCVDWGKQTHAHVLRNGSDSQVSVYNSLIDMYCECNILESACKIFNWMTDKTVISWSIM

Query:  FKGYVKYGQSLIALSLFTRMKSEGIQADFITVINILPAFVHIGALENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGCIEMAQRLFEEERVYDKDLIMW
         KG VK+G  LIALSLF RMKS+GIQADFITVINI+PAFV IGALENVKYLHGYS+KL LTSLPSLNTALLITYAKCGCI+MAQRLFEEERV DKDLIMW
Subjt:  FKGYVKYGQSLIALSLFTRMKSEGIQADFITVINILPAFVHIGALENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGCIEMAQRLFEEERVYDKDLIMW

Query:  NSMISAHANHGDWSHCFKLYNQMKCSSSKPDQVTFLGLLTACVNSGLVEKGKEVFKEMLESYGCQPSQEHYACMVNLLGRAGLVNEAGELVQNMPIKPDA
        NSMISAHANHGDWS CFKLYNQMKCS+S PDQVTFLGLLTACVNSGLVEKGKE FKEM+ESY CQPSQEHYACMVNLLGRAGL+NEAGELV+NMPIKPDA
Subjt:  NSMISAHANHGDWSHCFKLYNQMKCSSSKPDQVTFLGLLTACVNSGLVEKGKEVFKEMLESYGCQPSQEHYACMVNLLGRAGLVNEAGELVQNMPIKPDA

Query:  RVWGPLLSACKLHPGSKLAEYAAEKLIDIEPKNAGNYVLLSNIYAAAGKWDGVAKMRSFLRDKGLKKTPGCSWLEINGHVTEFRVADQTHPRAEDIYTIL
        RVWGPLLSACKLHPGSKLAE+AAEKLID+EPKNAGNY+LLSNIYAAAGKWDGVAKMRSFLRDKGLKKTPGCSWLEING V EFRVAD+THPRAEDIY IL
Subjt:  RVWGPLLSACKLHPGSKLAEYAAEKLIDIEPKNAGNYVLLSNIYAAAGKWDGVAKMRSFLRDKGLKKTPGCSWLEINGHVTEFRVADQTHPRAEDIYTIL

Query:  GNLEHEIKEAREKSPEKLDNL
        GNLE +IKE +E SPEKL  L
Subjt:  GNLEHEIKEAREKSPEKLDNL

A0A6J1K3Q8 pentatricopeptide repeat-containing protein At1g11290, chloroplastic-like0.0e+0084.57Show/hide
Query:  MLHLQRS-----NPLFRFNFQNFPATQSRPLNTLSLLFNRCSSRQHLQQIHARFVLHGFHQNPTLSSKLIDSYANLGLLNLSHQLFYSVIDPNSTLYNAI
        M HLQRS     +P+FRF F NFPATQSR LNTLS LF+RC SRQ L+QIHARFVLHGFHQNPTLS KLID YAN GLLN+SH +F S+IDPNSTLYNAI
Subjt:  MLHLQRS-----NPLFRFNFQNFPATQSRPLNTLSLLFNRCSSRQHLQQIHARFVLHGFHQNPTLSSKLIDSYANLGLLNLSHQLFYSVIDPNSTLYNAI

Query:  LRNLATYGEYERTLLVYREMVAKSMYPDEETYPFVLRSCCCLSNVEFGRNIHGCLIKLGVDSYHLVAAALAEMYRECIDFENAHQLFDKMSGKDFECCSP
        LRNL  +GEYERTLLVYREMVAKSM+PDE+TYPFVL+SCCCLSNVEFG+NIHGCLIKLGVDSY  V   LAEMY +CIDFENAHQLFDKMS KD +C S 
Subjt:  LRNLATYGEYERTLLVYREMVAKSMYPDEETYPFVLRSCCCLSNVEFGRNIHGCLIKLGVDSYHLVAAALAEMYRECIDFENAHQLFDKMSGKDFECCSP

Query:  LISNGSQNGNGDEIFRSLGTVGMRSEQLVPDSLAFFNLLKSIAGFNSIQLAKVVHCIAIVSNLCADLLVDTAVLSLYSKLGSLVDARKLFDKMPEKDRVV
        LIS   QNGNGDEI   LG   M+SE LV DSL F NLL+SI+G +SIQLAK+VHCIAIVSNLC DLLVDTAVLSLYSKLGSLVDARKLF+KMPEKDRVV
Subjt:  LISNGSQNGNGDEIFRSLGTVGMRSEQLVPDSLAFFNLLKSIAGFNSIQLAKVVHCIAIVSNLCADLLVDTAVLSLYSKLGSLVDARKLFDKMPEKDRVV

Query:  WNIMIATYAREGNPMECLELFKSMAGLGIRADLFTALPVISSISQLKCVDWGKQTHAHVLRNGSDSQVSVYNSLIDMYCECNILESACKIFNWMTDKTVI
        WNIMIA YAREG PMECLELF+SMA  GIRADLFTALPVISSISQLKC DWGKQTHA++LRNGSDSQVSV+NSLIDMYCECN LESACKIFN +T+KTVI
Subjt:  WNIMIATYAREGNPMECLELFKSMAGLGIRADLFTALPVISSISQLKCVDWGKQTHAHVLRNGSDSQVSVYNSLIDMYCECNILESACKIFNWMTDKTVI

Query:  SWSIMFKGYVKYGQSLIALSLFTRMKSEGIQADFITVINILPAFVHIGALENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGCIEMAQRLFEEERVYDK
        SWS M KG VK+G  LIALSLF  MKS+GIQADFITVINI+PAFV IGALENVKYLHGYS+KL LTSLPSLNTALLITYAKCGCIEMAQRLFEEERV DK
Subjt:  SWSIMFKGYVKYGQSLIALSLFTRMKSEGIQADFITVINILPAFVHIGALENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGCIEMAQRLFEEERVYDK

Query:  DLIMWNSMISAHANHGDWSHCFKLYNQMKCSSSKPDQVTFLGLLTACVNSGLVEKGKEVFKEMLESYGCQPSQEHYACMVNLLGRAGLVNEAGELVQNMP
        DLIMWNSMISAHANHGDWS CFKLYNQMKCS+S PDQVTFLGLLTACVNSGLVEKGKE FKEM+ESY CQPSQEHYACMVNLLGRAGL+NEAGELV+NMP
Subjt:  DLIMWNSMISAHANHGDWSHCFKLYNQMKCSSSKPDQVTFLGLLTACVNSGLVEKGKEVFKEMLESYGCQPSQEHYACMVNLLGRAGLVNEAGELVQNMP

Query:  IKPDARVWGPLLSACKLHPGSKLAEYAAEKLIDIEPKNAGNYVLLSNIYAAAGKWDGVAKMRSFLRDKGLKKTPGCSWLEINGHVTEFRVADQTHPRAED
        IKPDARVWGPLLSACKLHPGSKLAE+AAEKLID+EPKNAGNY+LLSNIYAAAGKWDGVAKMRSFLRDKGLKKTPGCSWLEING V EFRVAD+THPRAED
Subjt:  IKPDARVWGPLLSACKLHPGSKLAEYAAEKLIDIEPKNAGNYVLLSNIYAAAGKWDGVAKMRSFLRDKGLKKTPGCSWLEINGHVTEFRVADQTHPRAED

Query:  IYTILGNLEHEIKEAREKSPEKLDNL
        IY ILGNLE +IKEA+E SPEKL  L
Subjt:  IYTILGNLEHEIKEAREKSPEKLDNL

SwissProt top hitse value%identityAlignment
O81767 Pentatricopeptide repeat-containing protein At4g339903.0e-10733.04Show/hide
Query:  QSRPLNTLSLLFNRCSSRQHLQQIHARFVLHGFHQNPTLSSKLIDSYANLGLLNLSHQLFYSVIDPNSTLYNAILRNLATYGEYERTLLVYR-EMVAKSM
        +S+ ++ +  LF  C++ Q  + +HAR V+    QN  +S+KL++ Y  LG + L+   F  + + +   +N ++      G     +  +   M++  +
Subjt:  QSRPLNTLSLLFNRCSSRQHLQQIHARFVLHGFHQNPTLSSKLIDSYANLGLLNLSHQLFYSVIDPNSTLYNAILRNLATYGEYERTLLVYR-EMVAKSM

Query:  YPDEETYPFVLRSCCCLSNVEFGRNIHGCLIKLGVDSYHLVAAALAEMYRECIDFENAHQLFDKMSGKDFECCSPLISNGSQNGNGDEIFRSLGTVGMRS
         PD  T+P VL++C     V  G  IH   +K G      VAA+L  +Y       NA  LFD+M  +D    + +IS   Q+GN  E      + G+R+
Subjt:  YPDEETYPFVLRSCCCLSNVEFGRNIHGCLIKLGVDSYHLVAAALAEMYRECIDFENAHQLFDKMSGKDFECCSPLISNGSQNGNGDEIFRSLGTVGMRS

Query:  EQLVPDSLAFFNLLKSIAGFNSIQLAKVVHCIAIVSNLCADLLVDTAVLSLYSKLGSLVDARKLFDKMPEKDRVVWNIMIATYAREGNPMECLELFKSMA
             DS+   +LL +            +H  +I   L ++L V   ++ LY++ G L D +K+FD+M  +D + WN +I  Y     P+  + LF+ M 
Subjt:  EQLVPDSLAFFNLLKSIAGFNSIQLAKVVHCIAIVSNLCADLLVDTAVLSLYSKLGSLVDARKLFDKMPEKDRVVWNIMIATYAREGNPMECLELFKSMA

Query:  GLGIRADLFTALPVISSISQLKCVDWGKQTHAHVLRNG-SDSQVSVYNSLIDMYCECNILESACKIFNWMTDKTVISWSIMFKGYVKYGQSLIALSLFTR
           I+ D  T + + S +SQL  +   +      LR G     +++ N+++ MY +  +++SA  +FNW+ +  VISW+ +  GY + G +  A+ ++  
Subjt:  GLGIRADLFTALPVISSISQLKCVDWGKQTHAHVLRNG-SDSQVSVYNSLIDMYCECNILESACKIFNWMTDKTVISWSIMFKGYVKYGQSLIALSLFTR

Query:  MKSEG-IQADFITVINILPAFVHIGALENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGCIEMAQRLFEEERVYDKDLIMWNSMISAHANHGDWSHCFK
        M+ EG I A+  T +++LPA    GAL     LHG  +K GL     + T+L   Y KCG +E A  LF +  +   + + WN++I+ H  HG       
Subjt:  MKSEG-IQADFITVINILPAFVHIGALENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGCIEMAQRLFEEERVYDKDLIMWNSMISAHANHGDWSHCFK

Query:  LYNQMKCSSSKPDQVTFLGLLTACVNSGLVEKGKEVFKEMLESYGCQPSQEHYACMVNLLGRAGLVNEAGELVQNMPIKPDARVWGPLLSACKLHPGSKL
        L+ +M     KPD +TF+ LL+AC +SGLV++G+  F+ M   YG  PS +HY CMV++ GRAG +  A + +++M ++PDA +WG LLSAC++H    L
Subjt:  LYNQMKCSSSKPDQVTFLGLLTACVNSGLVEKGKEVFKEMLESYGCQPSQEHYACMVNLLGRAGLVNEAGELVQNMPIKPDARVWGPLLSACKLHPGSKL

Query:  AEYAAEKLIDIEPKNAGNYVLLSNIYAAAGKWDGVAKMRSFLRDKGLKKTPGCSWLEINGHVTEFRVADQTHPRAEDIYTILGNLEHEIK
         + A+E L ++EP++ G +VLLSN+YA+AGKW+GV ++RS    KGL+KTPG S +E++  V  F   +QTHP  E++Y  L  L+ ++K
Subjt:  AEYAAEKLIDIEPKNAGNYVLLSNIYAAAGKWDGVAKMRSFLRDKGLKKTPGCSWLEINGHVTEFRVADQTHPRAEDIYTILGNLEHEIK

Q3E6Q1 Pentatricopeptide repeat-containing protein At1g11290, chloroplastic2.5e-11432.16Show/hide
Query:  SLLFNRCSSRQHLQQIHARFVLHGFHQNPTLSSKLIDSYANLGLLNLSHQLFYSVIDPNSTLYNAILRNLATYGEYERTLLVYREMVAKSMYPDEETYPF
        +LL  RCSS + L+QI      +G +Q     +KL+  +   G ++ + ++F  +    + LY+ +L+  A   + ++ L  +  M    + P    + +
Subjt:  SLLFNRCSSRQHLQQIHARFVLHGFHQNPTLSSKLIDSYANLGLLNLSHQLFYSVIDPNSTLYNAILRNLATYGEYERTLLVYREMVAKSMYPDEETYPF

Query:  VLRSCCCLSNVEFGRNIHGCLIKLGVDSYHLVAAALAEMYRECIDFENAHQLFDKMSGKDFECCSPLISNGSQNGNGDEIFRSLGTVGMRSEQLVPDSLA
        +L+ C   + +  G+ IHG L+K G          L  MY +C     A ++FD+M  +D    + +++  SQNG        + +  M  E L P  + 
Subjt:  VLRSCCCLSNVEFGRNIHGCLIKLGVDSYHLVAAALAEMYRECIDFENAHQLFDKMSGKDFECCSPLISNGSQNGNGDEIFRSLGTVGMRSEQLVPDSLA

Query:  FFNLLKSIAGFNSIQLAKVVHCIAIVSNLCADLLVDTAVLSLYSKLGSLVDARKLFDKMPEKDRVVWNIMIATYAREGNPMECLELFKSMAGLGIRADLF
          ++L +++    I + K +H  A+ S   + + + TA++ +Y+K GSL  AR+LFD M E++ V WN MI  Y +  NP E + +F+ M   G++    
Subjt:  FFNLLKSIAGFNSIQLAKVVHCIAIVSNLCADLLVDTAVLSLYSKLGSLVDARKLFDKMPEKDRVVWNIMIATYAREGNPMECLELFKSMAGLGIRADLF

Query:  TALPVISSISQLKCVDWGKQTHAHVLRNGSDSQVSVYNSLIDMYCECNILESACKIFNWMTDKTVISWSIMFKGYVKYGQSLIALSLFTRMKSEGIQADF
        + +  + + + L  ++ G+  H   +  G D  VSV NSLI MYC+C  +++A  +F  +  +T++SW+ M  G+ + G+ + AL+ F++M+S  ++ D 
Subjt:  TALPVISSISQLKCVDWGKQTHAHVLRNGSDSQVSVYNSLIDMYCECNILESACKIFNWMTDKTVISWSIMFKGYVKYGQSLIALSLFTRMKSEGIQADF

Query:  ITVINILPAFVHIGALENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGCIEMAQRLFEEERVYDKDLIMWNSMISAHANHGDWSHCFKLYNQMKCSSSK
         T ++++ A   +    + K++HG  M+  L     + TAL+  YAKCG I +A+ +F  + + ++ +  WN+MI  +  HG      +L+ +M+  + K
Subjt:  ITVINILPAFVHIGALENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGCIEMAQRLFEEERVYDKDLIMWNSMISAHANHGDWSHCFKLYNQMKCSSSK

Query:  PDQVTFLGLLTACVNSGLVEKGKEVFKEMLESYGCQPSQEHYACMVNLLGRAGLVNEAGELVQNMPIKPDARVWGPLLSACKLHPGSKLAEYAAEKLIDI
        P+ VTFL +++AC +SGLVE G + F  M E+Y  + S +HY  MV+LLGRAG +NEA + +  MP+KP   V+G +L AC++H     AE AAE+L ++
Subjt:  PDQVTFLGLLTACVNSGLVEKGKEVFKEMLESYGCQPSQEHYACMVNLLGRAGLVNEAGELVQNMPIKPDARVWGPLLSACKLHPGSKLAEYAAEKLIDI

Query:  EPKNAGNYVLLSNIYAAAGKWDGVAKMRSFLRDKGLKKTPGCSWLEINGHVTEFRVADQTHPRAEDIYTILGNLEHEIKEA
         P + G +VLL+NIY AA  W+ V ++R  +  +GL+KTPGCS +EI   V  F      HP ++ IY  L  L   IKEA
Subjt:  EPKNAGNYVLLSNIYAAAGKWDGVAKMRSFLRDKGLKKTPGCSWLEINGHVTEFRVADQTHPRAEDIYTILGNLEHEIKEA

Q9LN01 Pentatricopeptide repeat-containing protein At1g08070, chloroplastic4.2e-11735.2Show/hide
Query:  PLFRFNFQNFPATQSRPLN------TLSLLFNRCSSRQHLQQIHARFVLHGFHQNPTLSSKLID---SYANLGLLNLSHQLFYSVIDPNSTLYNAILRNL
        P   + F   P++   P +      +LSLL N C + Q L+ IHA+ +  G H      SKLI+      +   L  +  +F ++ +PN  ++N + R  
Subjt:  PLFRFNFQNFPATQSRPLN------TLSLLFNRCSSRQHLQQIHARFVLHGFHQNPTLSSKLID---SYANLGLLNLSHQLFYSVIDPNSTLYNAILRNL

Query:  ATYGEYERTLLVYREMVAKSMYPDEETYPFVLRSCCCLSNVEFGRNIHGCLIKLGVDSYHLVAAALAEMYRECIDFENAHQLFDKMSGKDFECCSPLISN
        A   +    L +Y  M++  + P+  T+PFVL+SC      + G+ IHG ++KLG D    V  +L  MY +    E+AH++FDK   +D          
Subjt:  ATYGEYERTLLVYREMVAKSMYPDEETYPFVLRSCCCLSNVEFGRNIHGCLIKLGVDSYHLVAAALAEMYRECIDFENAHQLFDKMSGKDFECCSPLISN

Query:  GSQNGNGDEIFRSLGTVGMRSEQLVPDSLAFFNLLKSIAGFNSIQLAKVVHCIAIVSNLCADLLVDTAVLSLYSKLGSLVDARKLFDKMPEKDRVVWNIM
                                                              +VS         TA++  Y+  G + +A+KLFD++P KD V WN M
Subjt:  GSQNGNGDEIFRSLGTVGMRSEQLVPDSLAFFNLLKSIAGFNSIQLAKVVHCIAIVSNLCADLLVDTAVLSLYSKLGSLVDARKLFDKMPEKDRVVWNIM

Query:  IATYAREGNPMECLELFKSMAGLGIRADLFTALPVISSISQLKCVDWGKQTHAHVLRNGSDSQVSVYNSLIDMYCECNILESACKIFNWMTDKTVISWSI
        I+ YA  GN  E LELFK M    +R D  T + V+S+ +Q   ++ G+Q H  +  +G  S + + N+LID+Y +C  LE+AC +F  +  K VISW+ 
Subjt:  IATYAREGNPMECLELFKSMAGLGIRADLFTALPVISSISQLKCVDWGKQTHAHVLRNGSDSQVSVYNSLIDMYCECNILESACKIFNWMTDKTVISWSI

Query:  MFKGYVKYGQSLIALSLFTRMKSEGIQADFITVINILPAFVHIGALENVKYLHGYSMK--LGLTSLPSLNTALLITYAKCGCIEMAQRLFEEERVYDKDL
        +  GY        AL LF  M   G   + +T+++ILPA  H+GA++  +++H Y  K   G+T+  SL T+L+  YAKCG IE A ++F    +  K L
Subjt:  MFKGYVKYGQSLIALSLFTRMKSEGIQADFITVINILPAFVHIGALENVKYLHGYSMK--LGLTSLPSLNTALLITYAKCGCIEMAQRLFEEERVYDKDL

Query:  IMWNSMISAHANHGDWSHCFKLYNQMKCSSSKPDQVTFLGLLTACVNSGLVEKGKEVFKEMLESYGCQPSQEHYACMVNLLGRAGLVNEAGELVQNMPIK
          WN+MI   A HG     F L+++M+    +PD +TF+GLL+AC +SG+++ G+ +F+ M + Y   P  EHY CM++LLG +GL  EA E++  M ++
Subjt:  IMWNSMISAHANHGDWSHCFKLYNQMKCSSSKPDQVTFLGLLTACVNSGLVEKGKEVFKEMLESYGCQPSQEHYACMVNLLGRAGLVNEAGELVQNMPIK

Query:  PDARVWGPLLSACKLHPGSKLAEYAAEKLIDIEPKNAGNYVLLSNIYAAAGKWDGVAKMRSFLRDKGLKKTPGCSWLEINGHVTEFRVADQTHPRAEDIY
        PD  +W  LL ACK+H   +L E  AE LI IEP+N G+YVLLSNIYA+AG+W+ VAK R+ L DKG+KK PGCS +EI+  V EF + D+ HPR  +IY
Subjt:  PDARVWGPLLSACKLHPGSKLAEYAAEKLIDIEPKNAGNYVLLSNIYAAAGKWDGVAKMRSFLRDKGLKKTPGCSWLEINGHVTEFRVADQTHPRAEDIY

Query:  TILGNLEHEIKEA
         +L  +E  +++A
Subjt:  TILGNLEHEIKEA

Q9LUJ2 Pentatricopeptide repeat-containing protein At3g226901.6e-10831.71Show/hide
Query:  QSRPLNTLSLLFNRCSSRQHLQQIHARFVLHGFHQNPTLSSKLIDSYANLGL---LNLSHQLFYSVIDPNST-LYNAILRNLATYGEYERTLLVYREMVA
        QS+           C +   L+  H      G   + +  +KL+     LG    L+ + ++F +     +  +YN+++R  A+ G     +L++  M+ 
Subjt:  QSRPLNTLSLLFNRCSSRQHLQQIHARFVLHGFHQNPTLSSKLIDSYANLGL---LNLSHQLFYSVIDPNST-LYNAILRNLATYGEYERTLLVYREMVA

Query:  KSMYPDEETYPFVLRSCCCLSNVEFGRNIHGCLIKLGVDSYHLVAAALAEMYRECIDFENAHQLFDKMSGKDFECCSPLISNGSQNGNG----DEIFRSL
          + PD+ T+PF L +C        G  IHG ++K+G      V  +L   Y EC + ++A ++FD+MS ++    + +I   ++        D  FR  
Subjt:  KSMYPDEETYPFVLRSCCCLSNVEFGRNIHGCLIKLGVDSYHLVAAALAEMYRECIDFENAHQLFDKMSGKDFECCSPLISNGSQNGNG----DEIFRSL

Query:  GTVGMRSEQLVPDSLAFFNLLKSIAGFNSIQLAKVVHCIAIVSNLCADLLVDTAVLSLYSKLGSLVDARKLFDKMPEKDRVVWNIMIATYAREGNPMECL
            +R E++ P+S+    ++ + A    ++  + V+     S +  + L+ +A++ +Y K  ++  A++LFD+    +  + N M + Y R+G   E L
Subjt:  GTVGMRSEQLVPDSLAFFNLLKSIAGFNSIQLAKVVHCIAIVSNLCADLLVDTAVLSLYSKLGSLVDARKLFDKMPEKDRVVWNIMIATYAREGNPMECL

Query:  ELFKSMAGLGIRADLFTALPVISSISQLKCVDWGKQTHAHVLRNGSDSQVSVYNSLIDMYCECNILESACKIFNWMTDKTVISWSIMFKGYVKYGQ----
         +F  M   G+R D  + L  ISS SQL+ + WGK  H +VLRNG +S  ++ N+LIDMY +C+  ++A +IF+ M++KTV++W+ +  GYV+ G+    
Subjt:  ELFKSMAGLGIRADLFTALPVISSISQLKCVDWGKQTHAHVLRNGSDSQVSVYNSLIDMYCECNILESACKIFNWMTDKTVISWSIMFKGYVKYGQ----

Query:  -------------------------SLI--ALSLFTRMKS-EGIQADFITVINILPAFVHIGALENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGCIE
                                 SL   A+ +F  M+S EG+ AD +T+++I  A  H+GAL+  K+++ Y  K G+     L T L+  +++CG  E
Subjt:  -------------------------SLI--ALSLFTRMKS-EGIQADFITVINILPAFVHIGALENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGCIE

Query:  MAQRLFEEERVYDKDLIMWNSMISAHANHGDWSHCFKLYNQMKCSSSKPDQVTFLGLLTACVNSGLVEKGKEVFKEMLESYGCQPSQEHYACMVNLLGRA
         A  +F    + ++D+  W + I A A  G+     +L++ M     KPD V F+G LTAC + GLV++GKE+F  ML+ +G  P   HY CMV+LLGRA
Subjt:  MAQRLFEEERVYDKDLIMWNSMISAHANHGDWSHCFKLYNQMKCSSSKPDQVTFLGLLTACVNSGLVEKGKEVFKEMLESYGCQPSQEHYACMVNLLGRA

Query:  GLVNEAGELVQNMPIKPDARVWGPLLSACKLHPGSKLAEYAAEKLIDIEPKNAGNYVLLSNIYAAAGKWDGVAKMRSFLRDKGLKKTPGCSWLEINGHVT
        GL+ EA +L+++MP++P+  +W  LL+AC++    ++A YAAEK+  + P+  G+YVLLSN+YA+AG+W+ +AK+R  +++KGL+K PG S ++I G   
Subjt:  GLVNEAGELVQNMPIKPDARVWGPLLSACKLHPGSKLAEYAAEKLIDIEPKNAGNYVLLSNIYAAAGKWDGVAKMRSFLRDKGLKKTPGCSWLEINGHVT

Query:  EFRVADQTHPRAEDIYTIL
        EF   D++HP   +I  +L
Subjt:  EFRVADQTHPRAEDIYTIL

Q9STE1 Pentatricopeptide repeat-containing protein At4g213001.8e-10731.81Show/hide
Query:  GFHQNPTLSSKLIDSYANLGLLNLSHQLFYSVIDPNSTLYNAILRNLATYGEYERTLLVYREMVAKSMYPDEETYPFVLRSCCCLSNVEFGRNIHGCLIK
        G   N  ++S LI +Y   G +++  +LF  V+  +  ++N +L   A  G  +  +  +  M    + P+  T+  VL  C     ++ G  +HG ++ 
Subjt:  GFHQNPTLSSKLIDSYANLGLLNLSHQLFYSVIDPNSTLYNAILRNLATYGEYERTLLVYREMVAKSMYPDEETYPFVLRSCCCLSNVEFGRNIHGCLIK

Query:  LGVDSYHLVAAALAEMYRECIDFENAHQLFDKMSGKDFECCSPLISNGSQNGNGDEIFRSLGTVGMRSEQLVPDSLAFFNLLKSIAGFNSIQLAKVVHCI
         GVD    +  +L  MY +C  F++A +LF  MS  D    + +IS   Q+G  +E         M S  ++PD++ F +LL S++ F +++  K +HC 
Subjt:  LGVDSYHLVAAALAEMYRECIDFENAHQLFDKMSGKDFECCSPLISNGSQNGNGDEIFRSLGTVGMRSEQLVPDSLAFFNLLKSIAGFNSIQLAKVVHCI

Query:  AIVSNLCADLLVDTAVLSLYSKLGSLVDARKLFDKMPEKDRVVWNIMIATYAREGNPMECLELFKSMAGLGIRADLFTALPVISSISQLKCVDWGKQTHA
         +  ++  D+ + +A++  Y K   +  A+ +F +    D VV+  MI+ Y   G  ++ LE+F+ +  + I  +  T + ++  I  L  +  G++ H 
Subjt:  AIVSNLCADLLVDTAVLSLYSKLGSLVDARKLFDKMPEKDRVVWNIMIATYAREGNPMECLELFKSMAGLGIRADLFTALPVISSISQLKCVDWGKQTHA

Query:  HVLRNGSDSQVSVYNSLIDMYCECNILESACKIFNWMTDKTVISWSIMFKGYVKYGQSLIALSLFTRMKSEGIQADFITVINILPAFVHIGALENVKYLH
         +++ G D++ ++  ++IDMY +C  +  A +IF  ++ + ++SW+ M     +      A+ +F +M   GI  D +++   L A  ++ +    K +H
Subjt:  HVLRNGSDSQVSVYNSLIDMYCECNILESACKIFNWMTDKTVISWSIMFKGYVKYGQSLIALSLFTRMKSEGIQADFITVINILPAFVHIGALENVKYLH

Query:  GYSMKLGLTSLPSLNTALLITYAKCGCIEMAQRLFEEERVYDKDLIMWNSMISAHANHGDWSHCFKLYNQM-KCSSSKPDQVTFLGLLTACVNSGLVEKG
        G+ +K  L S     + L+  YAKCG ++ A  +F+  +  +K+++ WNS+I+A  NHG       L+++M + S  +PDQ+TFL ++++C + G V++G
Subjt:  GYSMKLGLTSLPSLNTALLITYAKCGCIEMAQRLFEEERVYDKDLIMWNSMISAHANHGDWSHCFKLYNQM-KCSSSKPDQVTFLGLLTACVNSGLVEKG

Query:  KEVFKEMLESYGCQPSQEHYACMVNLLGRAGLVNEAGELVQNMPIKPDARVWGPLLSACKLHPGSKLAEYAAEKLIDIEPKNAGNYVLLSNIYAAAGKWD
           F+ M E YG QP QEHYAC+V+L GRAG + EA E V++MP  PDA VWG LL AC+LH   +LAE A+ KL+D++P N+G YVL+SN +A A +W+
Subjt:  KEVFKEMLESYGCQPSQEHYACMVNLLGRAGLVNEAGELVQNMPIKPDARVWGPLLSACKLHPGSKLAEYAAEKLIDIEPKNAGNYVLLSNIYAAAGKWD

Query:  GVAKMRSFLRDKGLKKTPGCSWLEINGHVTEFRVADQTHPRAEDIYTILGNLEHEIK
         V K+RS ++++ ++K PG SW+EIN     F   D  HP +  IY++L +L  E++
Subjt:  GVAKMRSFLRDKGLKKTPGCSWLEINGHVTEFRVADQTHPRAEDIYTILGNLEHEIK

Arabidopsis top hitse value%identityAlignment
AT1G08070.1 Tetratricopeptide repeat (TPR)-like superfamily protein3.0e-11835.2Show/hide
Query:  PLFRFNFQNFPATQSRPLN------TLSLLFNRCSSRQHLQQIHARFVLHGFHQNPTLSSKLID---SYANLGLLNLSHQLFYSVIDPNSTLYNAILRNL
        P   + F   P++   P +      +LSLL N C + Q L+ IHA+ +  G H      SKLI+      +   L  +  +F ++ +PN  ++N + R  
Subjt:  PLFRFNFQNFPATQSRPLN------TLSLLFNRCSSRQHLQQIHARFVLHGFHQNPTLSSKLID---SYANLGLLNLSHQLFYSVIDPNSTLYNAILRNL

Query:  ATYGEYERTLLVYREMVAKSMYPDEETYPFVLRSCCCLSNVEFGRNIHGCLIKLGVDSYHLVAAALAEMYRECIDFENAHQLFDKMSGKDFECCSPLISN
        A   +    L +Y  M++  + P+  T+PFVL+SC      + G+ IHG ++KLG D    V  +L  MY +    E+AH++FDK   +D          
Subjt:  ATYGEYERTLLVYREMVAKSMYPDEETYPFVLRSCCCLSNVEFGRNIHGCLIKLGVDSYHLVAAALAEMYRECIDFENAHQLFDKMSGKDFECCSPLISN

Query:  GSQNGNGDEIFRSLGTVGMRSEQLVPDSLAFFNLLKSIAGFNSIQLAKVVHCIAIVSNLCADLLVDTAVLSLYSKLGSLVDARKLFDKMPEKDRVVWNIM
                                                              +VS         TA++  Y+  G + +A+KLFD++P KD V WN M
Subjt:  GSQNGNGDEIFRSLGTVGMRSEQLVPDSLAFFNLLKSIAGFNSIQLAKVVHCIAIVSNLCADLLVDTAVLSLYSKLGSLVDARKLFDKMPEKDRVVWNIM

Query:  IATYAREGNPMECLELFKSMAGLGIRADLFTALPVISSISQLKCVDWGKQTHAHVLRNGSDSQVSVYNSLIDMYCECNILESACKIFNWMTDKTVISWSI
        I+ YA  GN  E LELFK M    +R D  T + V+S+ +Q   ++ G+Q H  +  +G  S + + N+LID+Y +C  LE+AC +F  +  K VISW+ 
Subjt:  IATYAREGNPMECLELFKSMAGLGIRADLFTALPVISSISQLKCVDWGKQTHAHVLRNGSDSQVSVYNSLIDMYCECNILESACKIFNWMTDKTVISWSI

Query:  MFKGYVKYGQSLIALSLFTRMKSEGIQADFITVINILPAFVHIGALENVKYLHGYSMK--LGLTSLPSLNTALLITYAKCGCIEMAQRLFEEERVYDKDL
        +  GY        AL LF  M   G   + +T+++ILPA  H+GA++  +++H Y  K   G+T+  SL T+L+  YAKCG IE A ++F    +  K L
Subjt:  MFKGYVKYGQSLIALSLFTRMKSEGIQADFITVINILPAFVHIGALENVKYLHGYSMK--LGLTSLPSLNTALLITYAKCGCIEMAQRLFEEERVYDKDL

Query:  IMWNSMISAHANHGDWSHCFKLYNQMKCSSSKPDQVTFLGLLTACVNSGLVEKGKEVFKEMLESYGCQPSQEHYACMVNLLGRAGLVNEAGELVQNMPIK
          WN+MI   A HG     F L+++M+    +PD +TF+GLL+AC +SG+++ G+ +F+ M + Y   P  EHY CM++LLG +GL  EA E++  M ++
Subjt:  IMWNSMISAHANHGDWSHCFKLYNQMKCSSSKPDQVTFLGLLTACVNSGLVEKGKEVFKEMLESYGCQPSQEHYACMVNLLGRAGLVNEAGELVQNMPIK

Query:  PDARVWGPLLSACKLHPGSKLAEYAAEKLIDIEPKNAGNYVLLSNIYAAAGKWDGVAKMRSFLRDKGLKKTPGCSWLEINGHVTEFRVADQTHPRAEDIY
        PD  +W  LL ACK+H   +L E  AE LI IEP+N G+YVLLSNIYA+AG+W+ VAK R+ L DKG+KK PGCS +EI+  V EF + D+ HPR  +IY
Subjt:  PDARVWGPLLSACKLHPGSKLAEYAAEKLIDIEPKNAGNYVLLSNIYAAAGKWDGVAKMRSFLRDKGLKKTPGCSWLEINGHVTEFRVADQTHPRAEDIY

Query:  TILGNLEHEIKEA
         +L  +E  +++A
Subjt:  TILGNLEHEIKEA

AT1G11290.1 Pentatricopeptide repeat (PPR) superfamily protein1.8e-11532.16Show/hide
Query:  SLLFNRCSSRQHLQQIHARFVLHGFHQNPTLSSKLIDSYANLGLLNLSHQLFYSVIDPNSTLYNAILRNLATYGEYERTLLVYREMVAKSMYPDEETYPF
        +LL  RCSS + L+QI      +G +Q     +KL+  +   G ++ + ++F  +    + LY+ +L+  A   + ++ L  +  M    + P    + +
Subjt:  SLLFNRCSSRQHLQQIHARFVLHGFHQNPTLSSKLIDSYANLGLLNLSHQLFYSVIDPNSTLYNAILRNLATYGEYERTLLVYREMVAKSMYPDEETYPF

Query:  VLRSCCCLSNVEFGRNIHGCLIKLGVDSYHLVAAALAEMYRECIDFENAHQLFDKMSGKDFECCSPLISNGSQNGNGDEIFRSLGTVGMRSEQLVPDSLA
        +L+ C   + +  G+ IHG L+K G          L  MY +C     A ++FD+M  +D    + +++  SQNG        + +  M  E L P  + 
Subjt:  VLRSCCCLSNVEFGRNIHGCLIKLGVDSYHLVAAALAEMYRECIDFENAHQLFDKMSGKDFECCSPLISNGSQNGNGDEIFRSLGTVGMRSEQLVPDSLA

Query:  FFNLLKSIAGFNSIQLAKVVHCIAIVSNLCADLLVDTAVLSLYSKLGSLVDARKLFDKMPEKDRVVWNIMIATYAREGNPMECLELFKSMAGLGIRADLF
          ++L +++    I + K +H  A+ S   + + + TA++ +Y+K GSL  AR+LFD M E++ V WN MI  Y +  NP E + +F+ M   G++    
Subjt:  FFNLLKSIAGFNSIQLAKVVHCIAIVSNLCADLLVDTAVLSLYSKLGSLVDARKLFDKMPEKDRVVWNIMIATYAREGNPMECLELFKSMAGLGIRADLF

Query:  TALPVISSISQLKCVDWGKQTHAHVLRNGSDSQVSVYNSLIDMYCECNILESACKIFNWMTDKTVISWSIMFKGYVKYGQSLIALSLFTRMKSEGIQADF
        + +  + + + L  ++ G+  H   +  G D  VSV NSLI MYC+C  +++A  +F  +  +T++SW+ M  G+ + G+ + AL+ F++M+S  ++ D 
Subjt:  TALPVISSISQLKCVDWGKQTHAHVLRNGSDSQVSVYNSLIDMYCECNILESACKIFNWMTDKTVISWSIMFKGYVKYGQSLIALSLFTRMKSEGIQADF

Query:  ITVINILPAFVHIGALENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGCIEMAQRLFEEERVYDKDLIMWNSMISAHANHGDWSHCFKLYNQMKCSSSK
         T ++++ A   +    + K++HG  M+  L     + TAL+  YAKCG I +A+ +F  + + ++ +  WN+MI  +  HG      +L+ +M+  + K
Subjt:  ITVINILPAFVHIGALENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGCIEMAQRLFEEERVYDKDLIMWNSMISAHANHGDWSHCFKLYNQMKCSSSK

Query:  PDQVTFLGLLTACVNSGLVEKGKEVFKEMLESYGCQPSQEHYACMVNLLGRAGLVNEAGELVQNMPIKPDARVWGPLLSACKLHPGSKLAEYAAEKLIDI
        P+ VTFL +++AC +SGLVE G + F  M E+Y  + S +HY  MV+LLGRAG +NEA + +  MP+KP   V+G +L AC++H     AE AAE+L ++
Subjt:  PDQVTFLGLLTACVNSGLVEKGKEVFKEMLESYGCQPSQEHYACMVNLLGRAGLVNEAGELVQNMPIKPDARVWGPLLSACKLHPGSKLAEYAAEKLIDI

Query:  EPKNAGNYVLLSNIYAAAGKWDGVAKMRSFLRDKGLKKTPGCSWLEINGHVTEFRVADQTHPRAEDIYTILGNLEHEIKEA
         P + G +VLL+NIY AA  W+ V ++R  +  +GL+KTPGCS +EI   V  F      HP ++ IY  L  L   IKEA
Subjt:  EPKNAGNYVLLSNIYAAAGKWDGVAKMRSFLRDKGLKKTPGCSWLEINGHVTEFRVADQTHPRAEDIYTILGNLEHEIKEA

AT3G22690.1 CONTAINS InterPro DOMAIN/s: Protein of unknown function DUF1685 (InterPro:IPR012881), Pentatricopeptide repeat (InterPro:IPR002885)1.1e-10931.71Show/hide
Query:  QSRPLNTLSLLFNRCSSRQHLQQIHARFVLHGFHQNPTLSSKLIDSYANLGL---LNLSHQLFYSVIDPNST-LYNAILRNLATYGEYERTLLVYREMVA
        QS+           C +   L+  H      G   + +  +KL+     LG    L+ + ++F +     +  +YN+++R  A+ G     +L++  M+ 
Subjt:  QSRPLNTLSLLFNRCSSRQHLQQIHARFVLHGFHQNPTLSSKLIDSYANLGL---LNLSHQLFYSVIDPNST-LYNAILRNLATYGEYERTLLVYREMVA

Query:  KSMYPDEETYPFVLRSCCCLSNVEFGRNIHGCLIKLGVDSYHLVAAALAEMYRECIDFENAHQLFDKMSGKDFECCSPLISNGSQNGNG----DEIFRSL
          + PD+ T+PF L +C        G  IHG ++K+G      V  +L   Y EC + ++A ++FD+MS ++    + +I   ++        D  FR  
Subjt:  KSMYPDEETYPFVLRSCCCLSNVEFGRNIHGCLIKLGVDSYHLVAAALAEMYRECIDFENAHQLFDKMSGKDFECCSPLISNGSQNGNG----DEIFRSL

Query:  GTVGMRSEQLVPDSLAFFNLLKSIAGFNSIQLAKVVHCIAIVSNLCADLLVDTAVLSLYSKLGSLVDARKLFDKMPEKDRVVWNIMIATYAREGNPMECL
            +R E++ P+S+    ++ + A    ++  + V+     S +  + L+ +A++ +Y K  ++  A++LFD+    +  + N M + Y R+G   E L
Subjt:  GTVGMRSEQLVPDSLAFFNLLKSIAGFNSIQLAKVVHCIAIVSNLCADLLVDTAVLSLYSKLGSLVDARKLFDKMPEKDRVVWNIMIATYAREGNPMECL

Query:  ELFKSMAGLGIRADLFTALPVISSISQLKCVDWGKQTHAHVLRNGSDSQVSVYNSLIDMYCECNILESACKIFNWMTDKTVISWSIMFKGYVKYGQ----
         +F  M   G+R D  + L  ISS SQL+ + WGK  H +VLRNG +S  ++ N+LIDMY +C+  ++A +IF+ M++KTV++W+ +  GYV+ G+    
Subjt:  ELFKSMAGLGIRADLFTALPVISSISQLKCVDWGKQTHAHVLRNGSDSQVSVYNSLIDMYCECNILESACKIFNWMTDKTVISWSIMFKGYVKYGQ----

Query:  -------------------------SLI--ALSLFTRMKS-EGIQADFITVINILPAFVHIGALENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGCIE
                                 SL   A+ +F  M+S EG+ AD +T+++I  A  H+GAL+  K+++ Y  K G+     L T L+  +++CG  E
Subjt:  -------------------------SLI--ALSLFTRMKS-EGIQADFITVINILPAFVHIGALENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGCIE

Query:  MAQRLFEEERVYDKDLIMWNSMISAHANHGDWSHCFKLYNQMKCSSSKPDQVTFLGLLTACVNSGLVEKGKEVFKEMLESYGCQPSQEHYACMVNLLGRA
         A  +F    + ++D+  W + I A A  G+     +L++ M     KPD V F+G LTAC + GLV++GKE+F  ML+ +G  P   HY CMV+LLGRA
Subjt:  MAQRLFEEERVYDKDLIMWNSMISAHANHGDWSHCFKLYNQMKCSSSKPDQVTFLGLLTACVNSGLVEKGKEVFKEMLESYGCQPSQEHYACMVNLLGRA

Query:  GLVNEAGELVQNMPIKPDARVWGPLLSACKLHPGSKLAEYAAEKLIDIEPKNAGNYVLLSNIYAAAGKWDGVAKMRSFLRDKGLKKTPGCSWLEINGHVT
        GL+ EA +L+++MP++P+  +W  LL+AC++    ++A YAAEK+  + P+  G+YVLLSN+YA+AG+W+ +AK+R  +++KGL+K PG S ++I G   
Subjt:  GLVNEAGELVQNMPIKPDARVWGPLLSACKLHPGSKLAEYAAEKLIDIEPKNAGNYVLLSNIYAAAGKWDGVAKMRSFLRDKGLKKTPGCSWLEINGHVT

Query:  EFRVADQTHPRAEDIYTIL
        EF   D++HP   +I  +L
Subjt:  EFRVADQTHPRAEDIYTIL

AT3G22690.2 INVOLVED IN: photosystem II assembly, regulation of chlorophyll biosynthetic process, photosystem I assembly, thylakoid membrane organization, RNA modification1.1e-10931.71Show/hide
Query:  QSRPLNTLSLLFNRCSSRQHLQQIHARFVLHGFHQNPTLSSKLIDSYANLGL---LNLSHQLFYSVIDPNST-LYNAILRNLATYGEYERTLLVYREMVA
        QS+           C +   L+  H      G   + +  +KL+     LG    L+ + ++F +     +  +YN+++R  A+ G     +L++  M+ 
Subjt:  QSRPLNTLSLLFNRCSSRQHLQQIHARFVLHGFHQNPTLSSKLIDSYANLGL---LNLSHQLFYSVIDPNST-LYNAILRNLATYGEYERTLLVYREMVA

Query:  KSMYPDEETYPFVLRSCCCLSNVEFGRNIHGCLIKLGVDSYHLVAAALAEMYRECIDFENAHQLFDKMSGKDFECCSPLISNGSQNGNG----DEIFRSL
          + PD+ T+PF L +C        G  IHG ++K+G      V  +L   Y EC + ++A ++FD+MS ++    + +I   ++        D  FR  
Subjt:  KSMYPDEETYPFVLRSCCCLSNVEFGRNIHGCLIKLGVDSYHLVAAALAEMYRECIDFENAHQLFDKMSGKDFECCSPLISNGSQNGNG----DEIFRSL

Query:  GTVGMRSEQLVPDSLAFFNLLKSIAGFNSIQLAKVVHCIAIVSNLCADLLVDTAVLSLYSKLGSLVDARKLFDKMPEKDRVVWNIMIATYAREGNPMECL
            +R E++ P+S+    ++ + A    ++  + V+     S +  + L+ +A++ +Y K  ++  A++LFD+    +  + N M + Y R+G   E L
Subjt:  GTVGMRSEQLVPDSLAFFNLLKSIAGFNSIQLAKVVHCIAIVSNLCADLLVDTAVLSLYSKLGSLVDARKLFDKMPEKDRVVWNIMIATYAREGNPMECL

Query:  ELFKSMAGLGIRADLFTALPVISSISQLKCVDWGKQTHAHVLRNGSDSQVSVYNSLIDMYCECNILESACKIFNWMTDKTVISWSIMFKGYVKYGQ----
         +F  M   G+R D  + L  ISS SQL+ + WGK  H +VLRNG +S  ++ N+LIDMY +C+  ++A +IF+ M++KTV++W+ +  GYV+ G+    
Subjt:  ELFKSMAGLGIRADLFTALPVISSISQLKCVDWGKQTHAHVLRNGSDSQVSVYNSLIDMYCECNILESACKIFNWMTDKTVISWSIMFKGYVKYGQ----

Query:  -------------------------SLI--ALSLFTRMKS-EGIQADFITVINILPAFVHIGALENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGCIE
                                 SL   A+ +F  M+S EG+ AD +T+++I  A  H+GAL+  K+++ Y  K G+     L T L+  +++CG  E
Subjt:  -------------------------SLI--ALSLFTRMKS-EGIQADFITVINILPAFVHIGALENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGCIE

Query:  MAQRLFEEERVYDKDLIMWNSMISAHANHGDWSHCFKLYNQMKCSSSKPDQVTFLGLLTACVNSGLVEKGKEVFKEMLESYGCQPSQEHYACMVNLLGRA
         A  +F    + ++D+  W + I A A  G+     +L++ M     KPD V F+G LTAC + GLV++GKE+F  ML+ +G  P   HY CMV+LLGRA
Subjt:  MAQRLFEEERVYDKDLIMWNSMISAHANHGDWSHCFKLYNQMKCSSSKPDQVTFLGLLTACVNSGLVEKGKEVFKEMLESYGCQPSQEHYACMVNLLGRA

Query:  GLVNEAGELVQNMPIKPDARVWGPLLSACKLHPGSKLAEYAAEKLIDIEPKNAGNYVLLSNIYAAAGKWDGVAKMRSFLRDKGLKKTPGCSWLEINGHVT
        GL+ EA +L+++MP++P+  +W  LL+AC++    ++A YAAEK+  + P+  G+YVLLSN+YA+AG+W+ +AK+R  +++KGL+K PG S ++I G   
Subjt:  GLVNEAGELVQNMPIKPDARVWGPLLSACKLHPGSKLAEYAAEKLIDIEPKNAGNYVLLSNIYAAAGKWDGVAKMRSFLRDKGLKKTPGCSWLEINGHVT

Query:  EFRVADQTHPRAEDIYTIL
        EF   D++HP   +I  +L
Subjt:  EFRVADQTHPRAEDIYTIL

AT4G21300.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.2e-10831.81Show/hide
Query:  GFHQNPTLSSKLIDSYANLGLLNLSHQLFYSVIDPNSTLYNAILRNLATYGEYERTLLVYREMVAKSMYPDEETYPFVLRSCCCLSNVEFGRNIHGCLIK
        G   N  ++S LI +Y   G +++  +LF  V+  +  ++N +L   A  G  +  +  +  M    + P+  T+  VL  C     ++ G  +HG ++ 
Subjt:  GFHQNPTLSSKLIDSYANLGLLNLSHQLFYSVIDPNSTLYNAILRNLATYGEYERTLLVYREMVAKSMYPDEETYPFVLRSCCCLSNVEFGRNIHGCLIK

Query:  LGVDSYHLVAAALAEMYRECIDFENAHQLFDKMSGKDFECCSPLISNGSQNGNGDEIFRSLGTVGMRSEQLVPDSLAFFNLLKSIAGFNSIQLAKVVHCI
         GVD    +  +L  MY +C  F++A +LF  MS  D    + +IS   Q+G  +E         M S  ++PD++ F +LL S++ F +++  K +HC 
Subjt:  LGVDSYHLVAAALAEMYRECIDFENAHQLFDKMSGKDFECCSPLISNGSQNGNGDEIFRSLGTVGMRSEQLVPDSLAFFNLLKSIAGFNSIQLAKVVHCI

Query:  AIVSNLCADLLVDTAVLSLYSKLGSLVDARKLFDKMPEKDRVVWNIMIATYAREGNPMECLELFKSMAGLGIRADLFTALPVISSISQLKCVDWGKQTHA
         +  ++  D+ + +A++  Y K   +  A+ +F +    D VV+  MI+ Y   G  ++ LE+F+ +  + I  +  T + ++  I  L  +  G++ H 
Subjt:  AIVSNLCADLLVDTAVLSLYSKLGSLVDARKLFDKMPEKDRVVWNIMIATYAREGNPMECLELFKSMAGLGIRADLFTALPVISSISQLKCVDWGKQTHA

Query:  HVLRNGSDSQVSVYNSLIDMYCECNILESACKIFNWMTDKTVISWSIMFKGYVKYGQSLIALSLFTRMKSEGIQADFITVINILPAFVHIGALENVKYLH
         +++ G D++ ++  ++IDMY +C  +  A +IF  ++ + ++SW+ M     +      A+ +F +M   GI  D +++   L A  ++ +    K +H
Subjt:  HVLRNGSDSQVSVYNSLIDMYCECNILESACKIFNWMTDKTVISWSIMFKGYVKYGQSLIALSLFTRMKSEGIQADFITVINILPAFVHIGALENVKYLH

Query:  GYSMKLGLTSLPSLNTALLITYAKCGCIEMAQRLFEEERVYDKDLIMWNSMISAHANHGDWSHCFKLYNQM-KCSSSKPDQVTFLGLLTACVNSGLVEKG
        G+ +K  L S     + L+  YAKCG ++ A  +F+  +  +K+++ WNS+I+A  NHG       L+++M + S  +PDQ+TFL ++++C + G V++G
Subjt:  GYSMKLGLTSLPSLNTALLITYAKCGCIEMAQRLFEEERVYDKDLIMWNSMISAHANHGDWSHCFKLYNQM-KCSSSKPDQVTFLGLLTACVNSGLVEKG

Query:  KEVFKEMLESYGCQPSQEHYACMVNLLGRAGLVNEAGELVQNMPIKPDARVWGPLLSACKLHPGSKLAEYAAEKLIDIEPKNAGNYVLLSNIYAAAGKWD
           F+ M E YG QP QEHYAC+V+L GRAG + EA E V++MP  PDA VWG LL AC+LH   +LAE A+ KL+D++P N+G YVL+SN +A A +W+
Subjt:  KEVFKEMLESYGCQPSQEHYACMVNLLGRAGLVNEAGELVQNMPIKPDARVWGPLLSACKLHPGSKLAEYAAEKLIDIEPKNAGNYVLLSNIYAAAGKWD

Query:  GVAKMRSFLRDKGLKKTPGCSWLEINGHVTEFRVADQTHPRAEDIYTILGNLEHEIK
         V K+RS ++++ ++K PG SW+EIN     F   D  HP +  IY++L +L  E++
Subjt:  GVAKMRSFLRDKGLKKTPGCSWLEINGHVTEFRVADQTHPRAEDIYTILGNLEHEIK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTTCACCTTCAACGATCAAATCCCCTTTTCCGTTTCAATTTCCAGAACTTTCCCGCCACCCAATCCAGACCACTCAACACGCTTTCGCTTCTCTTCAATCGATGCAG
CTCTCGCCAACACCTCCAGCAGATCCACGCCAGGTTCGTCCTTCATGGCTTCCACCAAAACCCAACTCTCTCTTCCAAACTCATTGATTCTTATGCCAATCTTGGACTCC
TTAATCTCTCTCACCAACTTTTCTACTCTGTAATCGATCCTAATTCGACCCTTTACAATGCCATACTCAGAAATTTGGCTACGTATGGTGAGTATGAGCGGACCCTGTTG
GTGTATCGAGAAATGGTCGCCAAATCCATGTACCCGGATGAAGAAACTTACCCTTTTGTTTTGCGATCGTGCTGTTGTTTGTCGAATGTAGAATTTGGGAGGAACATTCA
TGGGTGTTTGATTAAGCTTGGTGTTGATTCTTATCATTTGGTAGCTGCTGCTCTGGCTGAGATGTACCGAGAGTGCATTGATTTTGAGAATGCTCATCAACTGTTTGATA
AAATGTCTGGAAAGGATTTTGAATGCTGCAGTCCTTTGATTTCCAACGGTTCTCAAAATGGGAATGGGGATGAAATTTTCCGATCACTTGGGACTGTGGGAATGAGATCA
GAGCAATTAGTACCTGACTCGCTCGCATTTTTCAATCTCTTGAAGTCCATTGCGGGTTTCAATTCCATTCAGCTTGCAAAGGTTGTCCATTGTATTGCAATTGTGAGCAA
CTTGTGTGCAGATTTGCTAGTAGATACTGCTGTGTTGTCTCTGTACTCTAAGTTAGGTAGCTTAGTTGATGCTAGAAAGTTATTTGACAAAATGCCAGAGAAGGACCGTG
TTGTATGGAATATAATGATAGCAACTTATGCCCGAGAAGGGAACCCAATGGAATGTCTCGAGCTCTTCAAGTCCATGGCTGGATTGGGGATTAGAGCTGATCTGTTTACT
GCACTTCCTGTTATCTCTTCAATTTCACAGTTGAAATGTGTTGATTGGGGTAAACAAACCCATGCCCATGTATTGAGGAATGGTTCAGACAGTCAAGTTTCAGTTTATAA
CTCACTCATTGACATGTACTGCGAATGTAACATCTTAGAGTCAGCTTGTAAGATCTTCAACTGGATGACAGACAAGACTGTAATTTCATGGAGTATTATGTTCAAGGGGT
ATGTCAAATATGGACAGTCTCTCATTGCTTTGTCTCTCTTCACCAGGATGAAATCTGAAGGTATTCAAGCTGATTTCATTACAGTGATCAATATCTTGCCTGCATTTGTT
CACATTGGAGCACTTGAAAATGTTAAATATTTACATGGGTACTCAATGAAGCTAGGTCTGACTTCACTTCCATCACTTAACACAGCCCTTTTAATCACCTATGCAAAATG
TGGATGTATAGAAATGGCCCAGAGGCTATTCGAGGAAGAGAGAGTGTACGACAAAGATTTGATAATGTGGAACTCTATGATCAGTGCCCATGCCAACCATGGAGACTGGT
CTCATTGTTTCAAGCTATACAATCAAATGAAATGCTCAAGTTCAAAGCCAGACCAGGTAACATTTCTGGGACTTCTAACAGCTTGTGTCAATTCTGGTCTCGTAGAAAAG
GGGAAGGAGGTTTTCAAGGAGATGCTTGAAAGTTATGGTTGCCAACCAAGTCAGGAGCATTACGCTTGTATGGTTAATCTCTTAGGGAGAGCTGGGCTTGTCAATGAAGC
TGGAGAACTTGTTCAAAACATGCCAATCAAACCCGATGCTCGAGTTTGGGGTCCATTGTTGAGTGCTTGTAAGTTGCATCCTGGGTCCAAGCTTGCAGAGTATGCGGCCG
AGAAGCTCATAGATATTGAGCCTAAAAATGCAGGGAATTACGTGTTGCTCTCAAACATATATGCTGCTGCAGGTAAATGGGATGGAGTGGCAAAGATGAGAAGTTTCCTT
AGGGACAAAGGGCTCAAGAAAACCCCTGGTTGTAGTTGGCTGGAGATAAATGGCCATGTAACTGAGTTTCGTGTTGCTGATCAAACTCATCCCAGAGCAGAAGATATTTA
TACCATCCTAGGGAACCTTGAACATGAAATCAAAGAGGCTAGAGAAAAGAGTCCAGAAAAATTGGATAATCTTTGA
mRNA sequenceShow/hide mRNA sequence
CAGTAAAATCGATAGTCAAGGTGTGATTAAATTTTGTAAGTTCAACGCAGAGCGCCTCTTTCACCGGGCATTCTCGCCGGTAATGGTCGTCGTCTTCTAGATTCTCCGGC
GAGTCCTTATACTTCACCCATCGCCTTTGAAACGACCTCGCTTCATTCTTCCTCTTCCGCCCGACAATGCTTCACCTTCAACGATCAAATCCCCTTTTCCGTTTCAATTT
CCAGAACTTTCCCGCCACCCAATCCAGACCACTCAACACGCTTTCGCTTCTCTTCAATCGATGCAGCTCTCGCCAACACCTCCAGCAGATCCACGCCAGGTTCGTCCTTC
ATGGCTTCCACCAAAACCCAACTCTCTCTTCCAAACTCATTGATTCTTATGCCAATCTTGGACTCCTTAATCTCTCTCACCAACTTTTCTACTCTGTAATCGATCCTAAT
TCGACCCTTTACAATGCCATACTCAGAAATTTGGCTACGTATGGTGAGTATGAGCGGACCCTGTTGGTGTATCGAGAAATGGTCGCCAAATCCATGTACCCGGATGAAGA
AACTTACCCTTTTGTTTTGCGATCGTGCTGTTGTTTGTCGAATGTAGAATTTGGGAGGAACATTCATGGGTGTTTGATTAAGCTTGGTGTTGATTCTTATCATTTGGTAG
CTGCTGCTCTGGCTGAGATGTACCGAGAGTGCATTGATTTTGAGAATGCTCATCAACTGTTTGATAAAATGTCTGGAAAGGATTTTGAATGCTGCAGTCCTTTGATTTCC
AACGGTTCTCAAAATGGGAATGGGGATGAAATTTTCCGATCACTTGGGACTGTGGGAATGAGATCAGAGCAATTAGTACCTGACTCGCTCGCATTTTTCAATCTCTTGAA
GTCCATTGCGGGTTTCAATTCCATTCAGCTTGCAAAGGTTGTCCATTGTATTGCAATTGTGAGCAACTTGTGTGCAGATTTGCTAGTAGATACTGCTGTGTTGTCTCTGT
ACTCTAAGTTAGGTAGCTTAGTTGATGCTAGAAAGTTATTTGACAAAATGCCAGAGAAGGACCGTGTTGTATGGAATATAATGATAGCAACTTATGCCCGAGAAGGGAAC
CCAATGGAATGTCTCGAGCTCTTCAAGTCCATGGCTGGATTGGGGATTAGAGCTGATCTGTTTACTGCACTTCCTGTTATCTCTTCAATTTCACAGTTGAAATGTGTTGA
TTGGGGTAAACAAACCCATGCCCATGTATTGAGGAATGGTTCAGACAGTCAAGTTTCAGTTTATAACTCACTCATTGACATGTACTGCGAATGTAACATCTTAGAGTCAG
CTTGTAAGATCTTCAACTGGATGACAGACAAGACTGTAATTTCATGGAGTATTATGTTCAAGGGGTATGTCAAATATGGACAGTCTCTCATTGCTTTGTCTCTCTTCACC
AGGATGAAATCTGAAGGTATTCAAGCTGATTTCATTACAGTGATCAATATCTTGCCTGCATTTGTTCACATTGGAGCACTTGAAAATGTTAAATATTTACATGGGTACTC
AATGAAGCTAGGTCTGACTTCACTTCCATCACTTAACACAGCCCTTTTAATCACCTATGCAAAATGTGGATGTATAGAAATGGCCCAGAGGCTATTCGAGGAAGAGAGAG
TGTACGACAAAGATTTGATAATGTGGAACTCTATGATCAGTGCCCATGCCAACCATGGAGACTGGTCTCATTGTTTCAAGCTATACAATCAAATGAAATGCTCAAGTTCA
AAGCCAGACCAGGTAACATTTCTGGGACTTCTAACAGCTTGTGTCAATTCTGGTCTCGTAGAAAAGGGGAAGGAGGTTTTCAAGGAGATGCTTGAAAGTTATGGTTGCCA
ACCAAGTCAGGAGCATTACGCTTGTATGGTTAATCTCTTAGGGAGAGCTGGGCTTGTCAATGAAGCTGGAGAACTTGTTCAAAACATGCCAATCAAACCCGATGCTCGAG
TTTGGGGTCCATTGTTGAGTGCTTGTAAGTTGCATCCTGGGTCCAAGCTTGCAGAGTATGCGGCCGAGAAGCTCATAGATATTGAGCCTAAAAATGCAGGGAATTACGTG
TTGCTCTCAAACATATATGCTGCTGCAGGTAAATGGGATGGAGTGGCAAAGATGAGAAGTTTCCTTAGGGACAAAGGGCTCAAGAAAACCCCTGGTTGTAGTTGGCTGGA
GATAAATGGCCATGTAACTGAGTTTCGTGTTGCTGATCAAACTCATCCCAGAGCAGAAGATATTTATACCATCCTAGGGAACCTTGAACATGAAATCAAAGAGGCTAGAG
AAAAGAGTCCAGAAAAATTGGATAATCTTTGACACTTGCATCCATTTCTTTCTAATGATATCTTCTCGAATAACAGGGATTACTCGCTCATTTATTTATATCTTCACATT
ACATTGTTTACATGATCTATTTTAATGTATCATCATATCACGATGTCGAACAAACCATATTAATGTAAATGATCTCATTTCATTAGTCTCATTTTTCTCTTAAAGATTCA
AGTAGTTGCATAGCTGACTATTGATTCCAACTTCATCTTCTGATATTTTCTATTCTCACCATGTTGTTGATGTTAAGTTGCTTGTACTTTACGGCTTAAGTTGTGAATTT
ACAGAATGTCA
Protein sequenceShow/hide protein sequence
MLHLQRSNPLFRFNFQNFPATQSRPLNTLSLLFNRCSSRQHLQQIHARFVLHGFHQNPTLSSKLIDSYANLGLLNLSHQLFYSVIDPNSTLYNAILRNLATYGEYERTLL
VYREMVAKSMYPDEETYPFVLRSCCCLSNVEFGRNIHGCLIKLGVDSYHLVAAALAEMYRECIDFENAHQLFDKMSGKDFECCSPLISNGSQNGNGDEIFRSLGTVGMRS
EQLVPDSLAFFNLLKSIAGFNSIQLAKVVHCIAIVSNLCADLLVDTAVLSLYSKLGSLVDARKLFDKMPEKDRVVWNIMIATYAREGNPMECLELFKSMAGLGIRADLFT
ALPVISSISQLKCVDWGKQTHAHVLRNGSDSQVSVYNSLIDMYCECNILESACKIFNWMTDKTVISWSIMFKGYVKYGQSLIALSLFTRMKSEGIQADFITVINILPAFV
HIGALENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGCIEMAQRLFEEERVYDKDLIMWNSMISAHANHGDWSHCFKLYNQMKCSSSKPDQVTFLGLLTACVNSGLVEK
GKEVFKEMLESYGCQPSQEHYACMVNLLGRAGLVNEAGELVQNMPIKPDARVWGPLLSACKLHPGSKLAEYAAEKLIDIEPKNAGNYVLLSNIYAAAGKWDGVAKMRSFL
RDKGLKKTPGCSWLEINGHVTEFRVADQTHPRAEDIYTILGNLEHEIKEAREKSPEKLDNL