; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc03g25700 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc03g25700
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionPentatricopeptide repeat-containing protein
Genome locationchr3:18480304..18482466
RNA-Seq ExpressionMoc03g25700
SyntenyMoc03g25700
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6573373.1 Pentatricopeptide repeat-containing protein, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0085.61Show/hide
Query:  MLHLQRSKPIFRFEFSNFPATQSRPLNTLSFLFSRCSSRQQLEQIHARFILHGLHQNPALSCELIDSYANLGLLTLSQQVFNSIIDPTSTLYSAILRNLS
        M HLQRSKPIFRF+F NFPATQSR LNTLS LFSRC SRQQL+QIHARF+LHG HQNP LSC+LID YAN GLL LS  VFNSIIDP S LY+AILRNL+
Subjt:  MLHLQRSKPIFRFEFSNFPATQSRPLNTLSFLFSRCSSRQQLEQIHARFILHGLHQNPALSCELIDSYANLGLLTLSQQVFNSIIDPTSTLYSAILRNLS

Query:  SFGEYERTLLVYREMFAKSMHPDEETYPSVLRSCCCLSNVEYGRKIHGHLVKLGVDLYDSTATALAEMYRKCIGFENGHDLFDKMPMKDFECWNSLNSEA
         FGEYERTLLVYREM AKSMHPDE+TYP VLRSCCCLSNV++G+ IHG L+KLGVD YD+  T L EMY KCI FEN H LFDKM +KD +CW+SL +EA
Subjt:  SFGEYERTLLVYREMFAKSMHPDEETYPSVLRSCCCLSNVEYGRKIHGHLVKLGVDLYDSTATALAEMYRKCIGFENGHDLFDKMPMKDFECWNSLNSEA

Query:  SQNGNGDEIFQLFGRMRTEQLVSDSLTFINLLRSIVGLNSIQLAKIVHCVAITSNLCGDLLVNTAVLSLYSKLGCLVNARKLFDKMPEKDRVVWNIMIAA
         QNGNGD+I +LFGRM++E LV+DSLTFINLLRS+ GL+SIQLAKIVHC+AI SNLCGDLLV+TAVLSLYSKLG LV+ARKLF+K+PEKDRVVWNIMIAA
Subjt:  SQNGNGDEIFQLFGRMRTEQLVSDSLTFINLLRSIVGLNSIQLAKIVHCVAITSNLCGDLLVNTAVLSLYSKLGCLVNARKLFDKMPEKDRVVWNIMIAA

Query:  YDREGNPAECLELFKSMARSGIRADLFTALPVISSISQLKCVDWGKQTHAHTLRNGSDNQVSVHNSLIDMYCELNILDSACKIFSWMTNKTVISWSAMIK
        Y REG P ECLELF+SMARSGIRADLFTALPVISSISQLK  DWGKQTHA+ LRNGSD+QVSVHNSLIDMYCE N LDSACKIF+ +TNKTVISWSAMIK
Subjt:  YDREGNPAECLELFKSMARSGIRADLFTALPVISSISQLKCVDWGKQTHAHTLRNGSDNQVSVHNSLIDMYCELNILDSACKIFSWMTNKTVISWSAMIK

Query:  GCVKHGQSLNALSLFSRMKSDGIQADFITVINILPAFVHIGALENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGCIEMAQRLFEEERVDDKDLIMWNS
        G VKHG  L ALSLF RMKSDGIQADFITVINI+PAFV IGALENVKYLHGYS+KL LTSLPSLNTALLITYAKCGCI+MAQRLFEEERVDDKDLIMWNS
Subjt:  GCVKHGQSLNALSLFSRMKSDGIQADFITVINILPAFVHIGALENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGCIEMAQRLFEEERVDDKDLIMWNS

Query:  MISAHANHGDWSQCFKIYNQMKCSNSRPDQVTFLGLLTACVNSGLVEKGKECFKEMIENYGCQPSQEHYACMVNLLGRAGLINDAGALVRNMPIKPDARV
        MISAHANHGDWSQCF +YNQMKCSNS PDQVTFLGLLTACVNSGLVEKGKE FKEMIE+Y CQPSQEHYACMVNLLGRAGLIN+AG LVRNMPIKPDARV
Subjt:  MISAHANHGDWSQCFKIYNQMKCSNSRPDQVTFLGLLTACVNSGLVEKGKECFKEMIENYGCQPSQEHYACMVNLLGRAGLINDAGALVRNMPIKPDARV

Query:  WGPLLSACKLHPGSKLAEFAAEKLIDMEPKNAGNYILLSNIYAAAGKWDGVAKMRSFLRDKGLKKTPGCSWLEINGHVTEFRVADRTHPRAEDIYTILGN
        WGPLLSACKLHPGSKLAEFAAEKLIDMEPKNAGNYILLSNIYAAAGKWDGVAKMRSFLRDKGLKKTPGCSWLEING V EFRVADRTHPRAEDIY ILGN
Subjt:  WGPLLSACKLHPGSKLAEFAAEKLIDMEPKNAGNYILLSNIYAAAGKWDGVAKMRSFLRDKGLKKTPGCSWLEINGHVTEFRVADRTHPRAEDIYTILGN

Query:  LELEIKEAREKSPEKL
        LEL+IKEA+E SPEKL
Subjt:  LELEIKEAREKSPEKL

KAG7012542.1 Pentatricopeptide repeat-containing protein, chloroplastic, partial [Cucurbita argyrosperma subsp. argyrosperma]0.0e+0085.56Show/hide
Query:  MLHLQRSKPIFRFEFSNFPATQSRPLNTLSFLFSRCSSRQQLEQIHARFILHGLHQNPALSCELIDSYANLGLLTLSQQVFNSIIDPTSTLYSAILRNLS
        M HLQRSKPIFRF+F NFPATQSR LNTLS LFSRC SRQQL+QIHARF+LHG HQNP LSC+LID YAN GLL LS  VFNSIIDP S LY+AILRNL+
Subjt:  MLHLQRSKPIFRFEFSNFPATQSRPLNTLSFLFSRCSSRQQLEQIHARFILHGLHQNPALSCELIDSYANLGLLTLSQQVFNSIIDPTSTLYSAILRNLS

Query:  SFGEYERTLLVYREMFAKSMHPDEETYPSVLRSCCCLSNVEYGRKIHGHLVKLGVDLYDSTATALAEMYRKCIGFENGHDLFDKMPMKDFECWNSLNSEA
         FGEYERTLLVYREM AKSMHPDE+TYP VLRSCCCLSNV++G+ IHG L+KLGVD YD+  T L EMY KCI FEN H LFDKM +KD +CW+SL S+A
Subjt:  SFGEYERTLLVYREMFAKSMHPDEETYPSVLRSCCCLSNVEYGRKIHGHLVKLGVDLYDSTATALAEMYRKCIGFENGHDLFDKMPMKDFECWNSLNSEA

Query:  SQNGNGDEIFQLFGRMRTEQLVSDSLTFINLLRSIVGLNSIQLAKIVHCVAITSNLCGDLLVNTAVLSLYSKLGCLVNARKLFDKMPEKDRVVWNIMIAA
         QNGNGD+I  LFGRM++E LV+DSLTFINLLRSI GL+SIQLAK+VHC+AI SNLCGDLLV+TAVLSLYSKLG LV+ARKLF+K+PEKDRVVWNIMIAA
Subjt:  SQNGNGDEIFQLFGRMRTEQLVSDSLTFINLLRSIVGLNSIQLAKIVHCVAITSNLCGDLLVNTAVLSLYSKLGCLVNARKLFDKMPEKDRVVWNIMIAA

Query:  YDREGNPAECLELFKSMARSGIRADLFTALPVISSISQLKCVDWGKQTHAHTLRNGSDNQVSVHNSLIDMYCELNILDSACKIFSWMTNKTVISWSAMIK
        Y REG P ECLELF+SMARSGIRADLFTALPVISSISQLK  DWGKQTHA+ LRNGSD+QVSVHNSLIDMYCE N LDSACKIF+ +TNKTVISWSAMIK
Subjt:  YDREGNPAECLELFKSMARSGIRADLFTALPVISSISQLKCVDWGKQTHAHTLRNGSDNQVSVHNSLIDMYCELNILDSACKIFSWMTNKTVISWSAMIK

Query:  GCVKHGQSLNALSLFSRMKSDGIQADFITVINILPAFVHIGALENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGCIEMAQRLFEEERVDDKDLIMWNS
        G VKHG  L ALSLF RMKSDGIQADFITVINI+PAFV IGALENVKYLHGYS+KL LTSLPSLNTALLITYAKCGCI+MAQRLFEEERVDDKDLIMWNS
Subjt:  GCVKHGQSLNALSLFSRMKSDGIQADFITVINILPAFVHIGALENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGCIEMAQRLFEEERVDDKDLIMWNS

Query:  MISAHANHGDWSQCFKIYNQMKCSNSRPDQVTFLGLLTACVNSGLVEKGKECFKEMIENYGCQPSQEHYACMVNLLGRAGLINDAGALVRNMPIKPDARV
        MISAHANHGDWSQCFK+YNQMKCSNS PDQVTFLGLLTACVNSGLVEKGKE FKEMIE Y CQPSQEHYACMVNLLGRAGLIN+AG LVRNMPIKPDARV
Subjt:  MISAHANHGDWSQCFKIYNQMKCSNSRPDQVTFLGLLTACVNSGLVEKGKECFKEMIENYGCQPSQEHYACMVNLLGRAGLINDAGALVRNMPIKPDARV

Query:  WGPLLSACKLHPGSKLAEFAAEKLIDMEPKNAGNYILLSNIYAAAGKWDGVAKMRSFLRDKGLKKTPGCSWLEINGHVTEFRVADRTHPRAEDIYTILGN
        WGPLLSACKLHPGSKLAEFAAEKLIDMEPKNAGNYILLSNIYAAAGKWDGVAKMRSFLRDKGLKKTPGCSWLEING V EFRVADRTHPRAEDIY ILGN
Subjt:  WGPLLSACKLHPGSKLAEFAAEKLIDMEPKNAGNYILLSNIYAAAGKWDGVAKMRSFLRDKGLKKTPGCSWLEINGHVTEFRVADRTHPRAEDIYTILGN

Query:  LELEIKEAREKSPEKLGILL
        LEL+IKE +E SPEKLG LL
Subjt:  LELEIKEAREKSPEKLGILL

XP_022139869.1 pentatricopeptide repeat-containing protein At1g11290, chloroplastic-like [Momordica charantia]0.0e+00100Show/hide
Query:  MLHLQRSKPIFRFEFSNFPATQSRPLNTLSFLFSRCSSRQQLEQIHARFILHGLHQNPALSCELIDSYANLGLLTLSQQVFNSIIDPTSTLYSAILRNLS
        MLHLQRSKPIFRFEFSNFPATQSRPLNTLSFLFSRCSSRQQLEQIHARFILHGLHQNPALSCELIDSYANLGLLTLSQQVFNSIIDPTSTLYSAILRNLS
Subjt:  MLHLQRSKPIFRFEFSNFPATQSRPLNTLSFLFSRCSSRQQLEQIHARFILHGLHQNPALSCELIDSYANLGLLTLSQQVFNSIIDPTSTLYSAILRNLS

Query:  SFGEYERTLLVYREMFAKSMHPDEETYPSVLRSCCCLSNVEYGRKIHGHLVKLGVDLYDSTATALAEMYRKCIGFENGHDLFDKMPMKDFECWNSLNSEA
        SFGEYERTLLVYREMFAKSMHPDEETYPSVLRSCCCLSNVEYGRKIHGHLVKLGVDLYDSTATALAEMYRKCIGFENGHDLFDKMPMKDFECWNSLNSEA
Subjt:  SFGEYERTLLVYREMFAKSMHPDEETYPSVLRSCCCLSNVEYGRKIHGHLVKLGVDLYDSTATALAEMYRKCIGFENGHDLFDKMPMKDFECWNSLNSEA

Query:  SQNGNGDEIFQLFGRMRTEQLVSDSLTFINLLRSIVGLNSIQLAKIVHCVAITSNLCGDLLVNTAVLSLYSKLGCLVNARKLFDKMPEKDRVVWNIMIAA
        SQNGNGDEIFQLFGRMRTEQLVSDSLTFINLLRSIVGLNSIQLAKIVHCVAITSNLCGDLLVNTAVLSLYSKLGCLVNARKLFDKMPEKDRVVWNIMIAA
Subjt:  SQNGNGDEIFQLFGRMRTEQLVSDSLTFINLLRSIVGLNSIQLAKIVHCVAITSNLCGDLLVNTAVLSLYSKLGCLVNARKLFDKMPEKDRVVWNIMIAA

Query:  YDREGNPAECLELFKSMARSGIRADLFTALPVISSISQLKCVDWGKQTHAHTLRNGSDNQVSVHNSLIDMYCELNILDSACKIFSWMTNKTVISWSAMIK
        YDREGNPAECLELFKSMARSGIRADLFTALPVISSISQLKCVDWGKQTHAHTLRNGSDNQVSVHNSLIDMYCELNILDSACKIFSWMTNKTVISWSAMIK
Subjt:  YDREGNPAECLELFKSMARSGIRADLFTALPVISSISQLKCVDWGKQTHAHTLRNGSDNQVSVHNSLIDMYCELNILDSACKIFSWMTNKTVISWSAMIK

Query:  GCVKHGQSLNALSLFSRMKSDGIQADFITVINILPAFVHIGALENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGCIEMAQRLFEEERVDDKDLIMWNS
        GCVKHGQSLNALSLFSRMKSDGIQADFITVINILPAFVHIGALENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGCIEMAQRLFEEERVDDKDLIMWNS
Subjt:  GCVKHGQSLNALSLFSRMKSDGIQADFITVINILPAFVHIGALENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGCIEMAQRLFEEERVDDKDLIMWNS

Query:  MISAHANHGDWSQCFKIYNQMKCSNSRPDQVTFLGLLTACVNSGLVEKGKECFKEMIENYGCQPSQEHYACMVNLLGRAGLINDAGALVRNMPIKPDARV
        MISAHANHGDWSQCFKIYNQMKCSNSRPDQVTFLGLLTACVNSGLVEKGKECFKEMIENYGCQPSQEHYACMVNLLGRAGLINDAGALVRNMPIKPDARV
Subjt:  MISAHANHGDWSQCFKIYNQMKCSNSRPDQVTFLGLLTACVNSGLVEKGKECFKEMIENYGCQPSQEHYACMVNLLGRAGLINDAGALVRNMPIKPDARV

Query:  WGPLLSACKLHPGSKLAEFAAEKLIDMEPKNAGNYILLSNIYAAAGKWDGVAKMRSFLRDKGLKKTPGCSWLEINGHVTEFRVADRTHPRAEDIYTILGN
        WGPLLSACKLHPGSKLAEFAAEKLIDMEPKNAGNYILLSNIYAAAGKWDGVAKMRSFLRDKGLKKTPGCSWLEINGHVTEFRVADRTHPRAEDIYTILGN
Subjt:  WGPLLSACKLHPGSKLAEFAAEKLIDMEPKNAGNYILLSNIYAAAGKWDGVAKMRSFLRDKGLKKTPGCSWLEINGHVTEFRVADRTHPRAEDIYTILGN

Query:  LELEIKEAREKSPEKLGILL
        LELEIKEAREKSPEKLGILL
Subjt:  LELEIKEAREKSPEKLGILL

XP_022954531.1 pentatricopeptide repeat-containing protein At1g11290, chloroplastic-like [Cucurbita moschata]0.0e+0085Show/hide
Query:  MLHLQRSKPIFRFEFSNFPATQSRPLNTLSFLFSRCSSRQQLEQIHARFILHGLHQNPALSCELIDSYANLGLLTLSQQVFNSIIDPTSTLYSAILRNLS
        M HLQRSKPIFRF+F NFPAT SR LNTLS LFSRC SRQQL+QIHARF+LHG HQNP LSC+LID YAN GLL LS  VFNSIIDP S LY+AILRNL+
Subjt:  MLHLQRSKPIFRFEFSNFPATQSRPLNTLSFLFSRCSSRQQLEQIHARFILHGLHQNPALSCELIDSYANLGLLTLSQQVFNSIIDPTSTLYSAILRNLS

Query:  SFGEYERTLLVYREMFAKSMHPDEETYPSVLRSCCCLSNVEYGRKIHGHLVKLGVDLYDSTATALAEMYRKCIGFENGHDLFDKMPMKDFECWNSLNSEA
         FGEYERTLLVYREM AKSMHPDE+TYP VLRSCCCLSNV++G+ IHG L+KLGVD YD+  T L EMY KCI FEN H LFDKM +KD +CW+SL +EA
Subjt:  SFGEYERTLLVYREMFAKSMHPDEETYPSVLRSCCCLSNVEYGRKIHGHLVKLGVDLYDSTATALAEMYRKCIGFENGHDLFDKMPMKDFECWNSLNSEA

Query:  SQNGNGDEIFQLFGRMRTEQLVSDSLTFINLLRSIVGLNSIQLAKIVHCVAITSNLCGDLLVNTAVLSLYSKLGCLVNARKLFDKMPEKDRVVWNIMIAA
         QNGNGD+I +LFGRM++E LV+DSLTFINLLRS+ GL+SIQLAKIVHC+AI SNLCGDLLV+TAVLSLYSKLG LV+ARKLF+K+PEKDRVVWNIMIAA
Subjt:  SQNGNGDEIFQLFGRMRTEQLVSDSLTFINLLRSIVGLNSIQLAKIVHCVAITSNLCGDLLVNTAVLSLYSKLGCLVNARKLFDKMPEKDRVVWNIMIAA

Query:  YDREGNPAECLELFKSMARSGIRADLFTALPVISSISQLKCVDWGKQTHAHTLRNGSDNQVSVHNSLIDMYCELNILDSACKIFSWMTNKTVISWSAMIK
        Y REG P ECLELF+SMARSGIRADLFT LPVISSISQLK  DWGKQTHA+ LRNGSD+QVSVHNSLIDMYCE N LDSA KIF+ +TNKTVISWSAMIK
Subjt:  YDREGNPAECLELFKSMARSGIRADLFTALPVISSISQLKCVDWGKQTHAHTLRNGSDNQVSVHNSLIDMYCELNILDSACKIFSWMTNKTVISWSAMIK

Query:  GCVKHGQSLNALSLFSRMKSDGIQADFITVINILPAFVHIGALENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGCIEMAQRLFEEERVDDKDLIMWNS
        G VKHG  L ALSLF RMKSDGIQADFITVINI+PAFV IGALENVKYLHGYS+KL LTSLPSLNTALLITYAKCGCI+MAQRLFEEERV+DKDLIMWNS
Subjt:  GCVKHGQSLNALSLFSRMKSDGIQADFITVINILPAFVHIGALENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGCIEMAQRLFEEERVDDKDLIMWNS

Query:  MISAHANHGDWSQCFKIYNQMKCSNSRPDQVTFLGLLTACVNSGLVEKGKECFKEMIENYGCQPSQEHYACMVNLLGRAGLINDAGALVRNMPIKPDARV
        MISAHANHGDWSQCFK+YNQMKCSNS PDQVTFLGLLTACVNSGLVEKGKE FKEMIE+Y CQPSQEHYACMVNLLGRAGLIN+AG LVRNMPIKPDARV
Subjt:  MISAHANHGDWSQCFKIYNQMKCSNSRPDQVTFLGLLTACVNSGLVEKGKECFKEMIENYGCQPSQEHYACMVNLLGRAGLINDAGALVRNMPIKPDARV

Query:  WGPLLSACKLHPGSKLAEFAAEKLIDMEPKNAGNYILLSNIYAAAGKWDGVAKMRSFLRDKGLKKTPGCSWLEINGHVTEFRVADRTHPRAEDIYTILGN
        WGPLLSACKLHPGSKLAEFAAEKLIDMEPKNAGNYILLSNIYAAAGKWDGVAKMRSFLRDKGLKKTPGCSWLEING V EFRVADRTHPRAEDIY ILGN
Subjt:  WGPLLSACKLHPGSKLAEFAAEKLIDMEPKNAGNYILLSNIYAAAGKWDGVAKMRSFLRDKGLKKTPGCSWLEINGHVTEFRVADRTHPRAEDIYTILGN

Query:  LELEIKEAREKSPEKLGILL
        LEL+IKE +E SPEKLG LL
Subjt:  LELEIKEAREKSPEKLGILL

XP_022994744.1 pentatricopeptide repeat-containing protein At1g11290, chloroplastic-like [Cucurbita maxima]0.0e+0085.66Show/hide
Query:  MLHLQRSK-----PIFRFEFSNFPATQSRPLNTLSFLFSRCSSRQQLEQIHARFILHGLHQNPALSCELIDSYANLGLLTLSQQVFNSIIDPTSTLYSAI
        M HLQRSK     PIFRF+F NFPATQSR LNTLS LFSRC SRQQLEQIHARF+LHG HQNP LSC+LID YAN GLL +S  VFNSIIDP STLY+AI
Subjt:  MLHLQRSK-----PIFRFEFSNFPATQSRPLNTLSFLFSRCSSRQQLEQIHARFILHGLHQNPALSCELIDSYANLGLLTLSQQVFNSIIDPTSTLYSAI

Query:  LRNLSSFGEYERTLLVYREMFAKSMHPDEETYPSVLRSCCCLSNVEYGRKIHGHLVKLGVDLYDSTATALAEMYRKCIGFENGHDLFDKMPMKDFECWNS
        LRNL+ FGEYERTLLVYREM AKSMHPDE+TYP VL+SCCCLSNVE+G+ IHG L+KLGVD YD+  T LAEMY KCI FEN H LFDKM +KD +CW+S
Subjt:  LRNLSSFGEYERTLLVYREMFAKSMHPDEETYPSVLRSCCCLSNVEYGRKIHGHLVKLGVDLYDSTATALAEMYRKCIGFENGHDLFDKMPMKDFECWNS

Query:  LNSEASQNGNGDEIFQLFGRMRTEQLVSDSLTFINLLRSIVGLNSIQLAKIVHCVAITSNLCGDLLVNTAVLSLYSKLGCLVNARKLFDKMPEKDRVVWN
        L SEA QNGNGDEI  L GRM++E LV+DSLTFINLLRSI GL+SIQLAKIVHC+AI SNLCGDLLV+TAVLSLYSKLG LV+ARKLF+KMPEKDRVVWN
Subjt:  LNSEASQNGNGDEIFQLFGRMRTEQLVSDSLTFINLLRSIVGLNSIQLAKIVHCVAITSNLCGDLLVNTAVLSLYSKLGCLVNARKLFDKMPEKDRVVWN

Query:  IMIAAYDREGNPAECLELFKSMARSGIRADLFTALPVISSISQLKCVDWGKQTHAHTLRNGSDNQVSVHNSLIDMYCELNILDSACKIFSWMTNKTVISW
        IMIAAY REG P ECLELF+SMARSGIRADLFTALPVISSISQLKC DWGKQTHA+ LRNGSD+QVSVHNSLIDMYCE N L+SACKIF+ +TNKTVISW
Subjt:  IMIAAYDREGNPAECLELFKSMARSGIRADLFTALPVISSISQLKCVDWGKQTHAHTLRNGSDNQVSVHNSLIDMYCELNILDSACKIFSWMTNKTVISW

Query:  SAMIKGCVKHGQSLNALSLFSRMKSDGIQADFITVINILPAFVHIGALENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGCIEMAQRLFEEERVDDKDL
        SAMIKG VKHG  L ALSLF  MKSDGIQADFITVINI+PAFV IGALENVKYLHGYS+KL LTSLPSLNTALLITYAKCGCIEMAQRLFEEERV+DKDL
Subjt:  SAMIKGCVKHGQSLNALSLFSRMKSDGIQADFITVINILPAFVHIGALENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGCIEMAQRLFEEERVDDKDL

Query:  IMWNSMISAHANHGDWSQCFKIYNQMKCSNSRPDQVTFLGLLTACVNSGLVEKGKECFKEMIENYGCQPSQEHYACMVNLLGRAGLINDAGALVRNMPIK
        IMWNSMISAHANHGDWSQCFK+YNQMKCSNS PDQVTFLGLLTACVNSGLVEKGKE FKEMIE+Y CQPSQEHYACMVNLLGRAGLIN+AG LVRNMPIK
Subjt:  IMWNSMISAHANHGDWSQCFKIYNQMKCSNSRPDQVTFLGLLTACVNSGLVEKGKECFKEMIENYGCQPSQEHYACMVNLLGRAGLINDAGALVRNMPIK

Query:  PDARVWGPLLSACKLHPGSKLAEFAAEKLIDMEPKNAGNYILLSNIYAAAGKWDGVAKMRSFLRDKGLKKTPGCSWLEINGHVTEFRVADRTHPRAEDIY
        PDARVWGPLLSACKLHPGSKLAEFAAEKLIDMEPKNAGNYILLSNIYAAAGKWDGVAKMRSFLRDKGLKKTPGCSWLEING V EFRVADRTHPRAEDIY
Subjt:  PDARVWGPLLSACKLHPGSKLAEFAAEKLIDMEPKNAGNYILLSNIYAAAGKWDGVAKMRSFLRDKGLKKTPGCSWLEINGHVTEFRVADRTHPRAEDIY

Query:  TILGNLELEIKEAREKSPEKLGILL
         ILGNLEL+IKEA+E SPEKLG LL
Subjt:  TILGNLELEIKEAREKSPEKLGILL

TrEMBL top hitse value%identityAlignment
A0A0A0M0Z6 Uncharacterized protein0.0e+0082.71Show/hide
Query:  MLHLQRSKPIFRFE-FSNFPATQSRPLNTLSFLFSRCSSRQQLEQIHARFILHGLHQNPALSCELIDSYANLGLLTLSQQVFNSIIDPTSTLYSAILRNL
        MLHL RSKPI     F NFPATQSR LNTLS LFSRC+S Q L+QIHARFILHG HQNP LS +LID YANLGLL  S QVF S+IDP  TL++AILRNL
Subjt:  MLHLQRSKPIFRFE-FSNFPATQSRPLNTLSFLFSRCSSRQQLEQIHARFILHGLHQNPALSCELIDSYANLGLLTLSQQVFNSIIDPTSTLYSAILRNL

Query:  SSFGEYERTLLVYREMFAKSMHPDEETYPSVLRSCCCLSNVEYGRKIHGHLVKLGVDLYDSTATALAEMYRKCIGFENGHDLFDKMPMKDFECWNSLNSE
        + +GE ERTLLVY++M AKSMHPDEETYP VLRSC   SNV +GR IHG+LVKLG DL+D  ATALAEMY +CI FEN H LFDK  +KD    +SL +E
Subjt:  SSFGEYERTLLVYREMFAKSMHPDEETYPSVLRSCCCLSNVEYGRKIHGHLVKLGVDLYDSTATALAEMYRKCIGFENGHDLFDKMPMKDFECWNSLNSE

Query:  ASQNGNGDEIFQLFGRMRTEQLVSDSLTFINLLRSIVGLNSIQLAKIVHCVAITSNLCGDLLVNTAVLSLYSKLGCLVNARKLFDKMPEKDRVVWNIMIA
          QN NG+ IF++FGRM  EQLV DS TF NLLR I GLNSIQLAKIVHC+AI S L GDLLVNTAVLSLYSKL  LV+ARKLFDKMPEKDRVVWNIMIA
Subjt:  ASQNGNGDEIFQLFGRMRTEQLVSDSLTFINLLRSIVGLNSIQLAKIVHCVAITSNLCGDLLVNTAVLSLYSKLGCLVNARKLFDKMPEKDRVVWNIMIA

Query:  AYDREGNPAECLELFKSMARSGIRADLFTALPVISSISQLKCVDWGKQTHAHTLRNGSDNQVSVHNSLIDMYCELNILDSACKIFSWMTNKTVISWSAMI
        AY REG P ECLELFKSMARSGIR+DLFTALPVISSI+QLKCVDWGKQTHAH LRNGSD+QVSVHNSLIDMYCE  ILDSACKIF+WMT+K+VISWSAMI
Subjt:  AYDREGNPAECLELFKSMARSGIRADLFTALPVISSISQLKCVDWGKQTHAHTLRNGSDNQVSVHNSLIDMYCELNILDSACKIFSWMTNKTVISWSAMI

Query:  KGCVKHGQSLNALSLFSRMKSDGIQADFITVINILPAFVHIGALENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGCIEMAQRLFEEERVDDKDLIMWN
        KG VK+GQSL ALSLFS+MKSDGIQADF+ +INILPAFVHIGALENVKYLHGYSMKLGLTSLPSLNTALLITYAKCG IEMAQRLFEEE++DDKDLIMWN
Subjt:  KGCVKHGQSLNALSLFSRMKSDGIQADFITVINILPAFVHIGALENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGCIEMAQRLFEEERVDDKDLIMWN

Query:  SMISAHANHGDWSQCFKIYNQMKCSNSRPDQVTFLGLLTACVNSGLVEKGKECFKEMIENYGCQPSQEHYACMVNLLGRAGLINDAGALVRNMPIKPDAR
        SMISAHANHGDWSQCFK+YN+MKCSNS+PDQVTFLGLLTACVNSGLVEKGKE FKEM E+YGCQPSQEHYACMVNLLGRAGLI++AG LV+NMPIKPDAR
Subjt:  SMISAHANHGDWSQCFKIYNQMKCSNSRPDQVTFLGLLTACVNSGLVEKGKECFKEMIENYGCQPSQEHYACMVNLLGRAGLINDAGALVRNMPIKPDAR

Query:  VWGPLLSACKLHPGSKLAEFAAEKLIDMEPKNAGNYILLSNIYAAAGKWDGVAKMRSFLRDKGLKKTPGCSWLEINGHVTEFRVADRTHPRAEDIYTILG
        VWGPLLSACK+HPGSKLAEFAAEKLI+MEP+NAGNYILLSNIYAAAGKWDGVAKMRSFLR+KGLKK PGCSWLEINGHVTEFRVAD+THPRA DIYTILG
Subjt:  VWGPLLSACKLHPGSKLAEFAAEKLIDMEPKNAGNYILLSNIYAAAGKWDGVAKMRSFLRDKGLKKTPGCSWLEINGHVTEFRVADRTHPRAEDIYTILG

Query:  NLELEIKEAREKSPEKL
        NLELEIKE REKSP+ L
Subjt:  NLELEIKEAREKSPEKL

A0A5D3DB69 Pentatricopeptide repeat-containing protein0.0e+0081.73Show/hide
Query:  MLHLQRSKPIFRFE-FSNFPATQSRPLNTLSFLFSRCSSRQQLEQIHARFILHGLHQNPALSCELIDSYANLGLLTLSQQVFNSIIDPTSTLYSAILRNL
        MLHLQRSKPI       NFPATQSR LNTLS LF+RC+S Q L+QIHARFILHG HQNP LS +LID YANLGLL  S QVF SIIDP  TL++AILRNL
Subjt:  MLHLQRSKPIFRFE-FSNFPATQSRPLNTLSFLFSRCSSRQQLEQIHARFILHGLHQNPALSCELIDSYANLGLLTLSQQVFNSIIDPTSTLYSAILRNL

Query:  SSFGEYERTLLVYREMFAKSMHPDEETYPSVLRSCCCLSNVEYGRKIHGHLVKLGVDLYDSTATALAEMYRKCIGFENGHDLFDKMPMKDFECWNSLNSE
        + +GE ER LLVY++M AKSMHPDEETYP + RSC   SNV +GR IHG+LVKLG D +D  ATALAEMY K I FEN H LFDK  +KD    +SL +E
Subjt:  SSFGEYERTLLVYREMFAKSMHPDEETYPSVLRSCCCLSNVEYGRKIHGHLVKLGVDLYDSTATALAEMYRKCIGFENGHDLFDKMPMKDFECWNSLNSE

Query:  ASQNGNGDEIFQLFGRMRTEQLVSDSLTFINLLRSIVGLNSIQLAKIVHCVAITSNLCGDLLVNTAVLSLYSKLGCLVNARKLFDKMPEKDRVVWNIMIA
         SQNGNG+ IF++F RMR EQLV DSLTF+NLLR I GLNSIQLAKIVHC+AI S L GDLLV TAVLSLYSKL  LV+AR+LFDKMPEKDRVVWNIMIA
Subjt:  ASQNGNGDEIFQLFGRMRTEQLVSDSLTFINLLRSIVGLNSIQLAKIVHCVAITSNLCGDLLVNTAVLSLYSKLGCLVNARKLFDKMPEKDRVVWNIMIA

Query:  AYDREGNPAECLELFKSMARSGIRADLFTALPVISSISQLKCVDWGKQTHAHTLRNGSDNQVSVHNSLIDMYCELNILDSACKIFSWMTNKTVISWSAMI
        AY REG P ECLELFKSMARSGIR+DLFTALPVISSI+QLKCVDWGKQTHAH LRNGSD+QVSVHNSLIDMYCE  +LDSAC IF+WMT+K+VISWSAMI
Subjt:  AYDREGNPAECLELFKSMARSGIRADLFTALPVISSISQLKCVDWGKQTHAHTLRNGSDNQVSVHNSLIDMYCELNILDSACKIFSWMTNKTVISWSAMI

Query:  KGCVKHGQSLNALSLFSRMKSDGIQADFITVINILPAFVHIGALENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGCIEMAQRLFEEERVDDKDLIMWN
        KG VK+GQSL A SLFS+MKSDGIQADF+T+INILPAFVHIGALENVKYLHGYSMKLGLTSLPSLNTALLITYAKCG IEMAQRLFEEER+DDKDLIMWN
Subjt:  KGCVKHGQSLNALSLFSRMKSDGIQADFITVINILPAFVHIGALENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGCIEMAQRLFEEERVDDKDLIMWN

Query:  SMISAHANHGDWSQCFKIYNQMKCSNSRPDQVTFLGLLTACVNSGLVEKGKECFKEMIENYGCQPSQEHYACMVNLLGRAGLINDAGALVRNMPIKPDAR
        SMISAHANHGDWSQCFK+YN+MKCSNS+PDQVTFLGLLTACVNSGL+EKGKE FKEM E+YGC PSQEH+ACMVNLLGRAGLI++AG LVRNMPIKPDAR
Subjt:  SMISAHANHGDWSQCFKIYNQMKCSNSRPDQVTFLGLLTACVNSGLVEKGKECFKEMIENYGCQPSQEHYACMVNLLGRAGLINDAGALVRNMPIKPDAR

Query:  VWGPLLSACKLHPGSKLAEFAAEKLIDMEPKNAGNYILLSNIYAAAGKWDGVAKMRSFLRDKGLKKTPGCSWLEINGHVTEFRVADRTHPRAEDIYTILG
        VWGPLLSACK+HPGSKLAEFAAEKLIDMEPKNAGNYILLSNIYAAAGKW+ VAKMRSFLR+KGLKKTPGCS LEING VTEFRVAD+THPRAEDIYTILG
Subjt:  VWGPLLSACKLHPGSKLAEFAAEKLIDMEPKNAGNYILLSNIYAAAGKWDGVAKMRSFLRDKGLKKTPGCSWLEINGHVTEFRVADRTHPRAEDIYTILG

Query:  NLELEIKEAREKSPEKL
        NLELEIKE REKS + L
Subjt:  NLELEIKEAREKSPEKL

A0A6J1CE61 pentatricopeptide repeat-containing protein At1g11290, chloroplastic-like0.0e+00100Show/hide
Query:  MLHLQRSKPIFRFEFSNFPATQSRPLNTLSFLFSRCSSRQQLEQIHARFILHGLHQNPALSCELIDSYANLGLLTLSQQVFNSIIDPTSTLYSAILRNLS
        MLHLQRSKPIFRFEFSNFPATQSRPLNTLSFLFSRCSSRQQLEQIHARFILHGLHQNPALSCELIDSYANLGLLTLSQQVFNSIIDPTSTLYSAILRNLS
Subjt:  MLHLQRSKPIFRFEFSNFPATQSRPLNTLSFLFSRCSSRQQLEQIHARFILHGLHQNPALSCELIDSYANLGLLTLSQQVFNSIIDPTSTLYSAILRNLS

Query:  SFGEYERTLLVYREMFAKSMHPDEETYPSVLRSCCCLSNVEYGRKIHGHLVKLGVDLYDSTATALAEMYRKCIGFENGHDLFDKMPMKDFECWNSLNSEA
        SFGEYERTLLVYREMFAKSMHPDEETYPSVLRSCCCLSNVEYGRKIHGHLVKLGVDLYDSTATALAEMYRKCIGFENGHDLFDKMPMKDFECWNSLNSEA
Subjt:  SFGEYERTLLVYREMFAKSMHPDEETYPSVLRSCCCLSNVEYGRKIHGHLVKLGVDLYDSTATALAEMYRKCIGFENGHDLFDKMPMKDFECWNSLNSEA

Query:  SQNGNGDEIFQLFGRMRTEQLVSDSLTFINLLRSIVGLNSIQLAKIVHCVAITSNLCGDLLVNTAVLSLYSKLGCLVNARKLFDKMPEKDRVVWNIMIAA
        SQNGNGDEIFQLFGRMRTEQLVSDSLTFINLLRSIVGLNSIQLAKIVHCVAITSNLCGDLLVNTAVLSLYSKLGCLVNARKLFDKMPEKDRVVWNIMIAA
Subjt:  SQNGNGDEIFQLFGRMRTEQLVSDSLTFINLLRSIVGLNSIQLAKIVHCVAITSNLCGDLLVNTAVLSLYSKLGCLVNARKLFDKMPEKDRVVWNIMIAA

Query:  YDREGNPAECLELFKSMARSGIRADLFTALPVISSISQLKCVDWGKQTHAHTLRNGSDNQVSVHNSLIDMYCELNILDSACKIFSWMTNKTVISWSAMIK
        YDREGNPAECLELFKSMARSGIRADLFTALPVISSISQLKCVDWGKQTHAHTLRNGSDNQVSVHNSLIDMYCELNILDSACKIFSWMTNKTVISWSAMIK
Subjt:  YDREGNPAECLELFKSMARSGIRADLFTALPVISSISQLKCVDWGKQTHAHTLRNGSDNQVSVHNSLIDMYCELNILDSACKIFSWMTNKTVISWSAMIK

Query:  GCVKHGQSLNALSLFSRMKSDGIQADFITVINILPAFVHIGALENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGCIEMAQRLFEEERVDDKDLIMWNS
        GCVKHGQSLNALSLFSRMKSDGIQADFITVINILPAFVHIGALENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGCIEMAQRLFEEERVDDKDLIMWNS
Subjt:  GCVKHGQSLNALSLFSRMKSDGIQADFITVINILPAFVHIGALENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGCIEMAQRLFEEERVDDKDLIMWNS

Query:  MISAHANHGDWSQCFKIYNQMKCSNSRPDQVTFLGLLTACVNSGLVEKGKECFKEMIENYGCQPSQEHYACMVNLLGRAGLINDAGALVRNMPIKPDARV
        MISAHANHGDWSQCFKIYNQMKCSNSRPDQVTFLGLLTACVNSGLVEKGKECFKEMIENYGCQPSQEHYACMVNLLGRAGLINDAGALVRNMPIKPDARV
Subjt:  MISAHANHGDWSQCFKIYNQMKCSNSRPDQVTFLGLLTACVNSGLVEKGKECFKEMIENYGCQPSQEHYACMVNLLGRAGLINDAGALVRNMPIKPDARV

Query:  WGPLLSACKLHPGSKLAEFAAEKLIDMEPKNAGNYILLSNIYAAAGKWDGVAKMRSFLRDKGLKKTPGCSWLEINGHVTEFRVADRTHPRAEDIYTILGN
        WGPLLSACKLHPGSKLAEFAAEKLIDMEPKNAGNYILLSNIYAAAGKWDGVAKMRSFLRDKGLKKTPGCSWLEINGHVTEFRVADRTHPRAEDIYTILGN
Subjt:  WGPLLSACKLHPGSKLAEFAAEKLIDMEPKNAGNYILLSNIYAAAGKWDGVAKMRSFLRDKGLKKTPGCSWLEINGHVTEFRVADRTHPRAEDIYTILGN

Query:  LELEIKEAREKSPEKLGILL
        LELEIKEAREKSPEKLGILL
Subjt:  LELEIKEAREKSPEKLGILL

A0A6J1GR57 pentatricopeptide repeat-containing protein At1g11290, chloroplastic-like0.0e+0085Show/hide
Query:  MLHLQRSKPIFRFEFSNFPATQSRPLNTLSFLFSRCSSRQQLEQIHARFILHGLHQNPALSCELIDSYANLGLLTLSQQVFNSIIDPTSTLYSAILRNLS
        M HLQRSKPIFRF+F NFPAT SR LNTLS LFSRC SRQQL+QIHARF+LHG HQNP LSC+LID YAN GLL LS  VFNSIIDP S LY+AILRNL+
Subjt:  MLHLQRSKPIFRFEFSNFPATQSRPLNTLSFLFSRCSSRQQLEQIHARFILHGLHQNPALSCELIDSYANLGLLTLSQQVFNSIIDPTSTLYSAILRNLS

Query:  SFGEYERTLLVYREMFAKSMHPDEETYPSVLRSCCCLSNVEYGRKIHGHLVKLGVDLYDSTATALAEMYRKCIGFENGHDLFDKMPMKDFECWNSLNSEA
         FGEYERTLLVYREM AKSMHPDE+TYP VLRSCCCLSNV++G+ IHG L+KLGVD YD+  T L EMY KCI FEN H LFDKM +KD +CW+SL +EA
Subjt:  SFGEYERTLLVYREMFAKSMHPDEETYPSVLRSCCCLSNVEYGRKIHGHLVKLGVDLYDSTATALAEMYRKCIGFENGHDLFDKMPMKDFECWNSLNSEA

Query:  SQNGNGDEIFQLFGRMRTEQLVSDSLTFINLLRSIVGLNSIQLAKIVHCVAITSNLCGDLLVNTAVLSLYSKLGCLVNARKLFDKMPEKDRVVWNIMIAA
         QNGNGD+I +LFGRM++E LV+DSLTFINLLRS+ GL+SIQLAKIVHC+AI SNLCGDLLV+TAVLSLYSKLG LV+ARKLF+K+PEKDRVVWNIMIAA
Subjt:  SQNGNGDEIFQLFGRMRTEQLVSDSLTFINLLRSIVGLNSIQLAKIVHCVAITSNLCGDLLVNTAVLSLYSKLGCLVNARKLFDKMPEKDRVVWNIMIAA

Query:  YDREGNPAECLELFKSMARSGIRADLFTALPVISSISQLKCVDWGKQTHAHTLRNGSDNQVSVHNSLIDMYCELNILDSACKIFSWMTNKTVISWSAMIK
        Y REG P ECLELF+SMARSGIRADLFT LPVISSISQLK  DWGKQTHA+ LRNGSD+QVSVHNSLIDMYCE N LDSA KIF+ +TNKTVISWSAMIK
Subjt:  YDREGNPAECLELFKSMARSGIRADLFTALPVISSISQLKCVDWGKQTHAHTLRNGSDNQVSVHNSLIDMYCELNILDSACKIFSWMTNKTVISWSAMIK

Query:  GCVKHGQSLNALSLFSRMKSDGIQADFITVINILPAFVHIGALENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGCIEMAQRLFEEERVDDKDLIMWNS
        G VKHG  L ALSLF RMKSDGIQADFITVINI+PAFV IGALENVKYLHGYS+KL LTSLPSLNTALLITYAKCGCI+MAQRLFEEERV+DKDLIMWNS
Subjt:  GCVKHGQSLNALSLFSRMKSDGIQADFITVINILPAFVHIGALENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGCIEMAQRLFEEERVDDKDLIMWNS

Query:  MISAHANHGDWSQCFKIYNQMKCSNSRPDQVTFLGLLTACVNSGLVEKGKECFKEMIENYGCQPSQEHYACMVNLLGRAGLINDAGALVRNMPIKPDARV
        MISAHANHGDWSQCFK+YNQMKCSNS PDQVTFLGLLTACVNSGLVEKGKE FKEMIE+Y CQPSQEHYACMVNLLGRAGLIN+AG LVRNMPIKPDARV
Subjt:  MISAHANHGDWSQCFKIYNQMKCSNSRPDQVTFLGLLTACVNSGLVEKGKECFKEMIENYGCQPSQEHYACMVNLLGRAGLINDAGALVRNMPIKPDARV

Query:  WGPLLSACKLHPGSKLAEFAAEKLIDMEPKNAGNYILLSNIYAAAGKWDGVAKMRSFLRDKGLKKTPGCSWLEINGHVTEFRVADRTHPRAEDIYTILGN
        WGPLLSACKLHPGSKLAEFAAEKLIDMEPKNAGNYILLSNIYAAAGKWDGVAKMRSFLRDKGLKKTPGCSWLEING V EFRVADRTHPRAEDIY ILGN
Subjt:  WGPLLSACKLHPGSKLAEFAAEKLIDMEPKNAGNYILLSNIYAAAGKWDGVAKMRSFLRDKGLKKTPGCSWLEINGHVTEFRVADRTHPRAEDIYTILGN

Query:  LELEIKEAREKSPEKLGILL
        LEL+IKE +E SPEKLG LL
Subjt:  LELEIKEAREKSPEKLGILL

A0A6J1K3Q8 pentatricopeptide repeat-containing protein At1g11290, chloroplastic-like0.0e+0085.66Show/hide
Query:  MLHLQRSK-----PIFRFEFSNFPATQSRPLNTLSFLFSRCSSRQQLEQIHARFILHGLHQNPALSCELIDSYANLGLLTLSQQVFNSIIDPTSTLYSAI
        M HLQRSK     PIFRF+F NFPATQSR LNTLS LFSRC SRQQLEQIHARF+LHG HQNP LSC+LID YAN GLL +S  VFNSIIDP STLY+AI
Subjt:  MLHLQRSK-----PIFRFEFSNFPATQSRPLNTLSFLFSRCSSRQQLEQIHARFILHGLHQNPALSCELIDSYANLGLLTLSQQVFNSIIDPTSTLYSAI

Query:  LRNLSSFGEYERTLLVYREMFAKSMHPDEETYPSVLRSCCCLSNVEYGRKIHGHLVKLGVDLYDSTATALAEMYRKCIGFENGHDLFDKMPMKDFECWNS
        LRNL+ FGEYERTLLVYREM AKSMHPDE+TYP VL+SCCCLSNVE+G+ IHG L+KLGVD YD+  T LAEMY KCI FEN H LFDKM +KD +CW+S
Subjt:  LRNLSSFGEYERTLLVYREMFAKSMHPDEETYPSVLRSCCCLSNVEYGRKIHGHLVKLGVDLYDSTATALAEMYRKCIGFENGHDLFDKMPMKDFECWNS

Query:  LNSEASQNGNGDEIFQLFGRMRTEQLVSDSLTFINLLRSIVGLNSIQLAKIVHCVAITSNLCGDLLVNTAVLSLYSKLGCLVNARKLFDKMPEKDRVVWN
        L SEA QNGNGDEI  L GRM++E LV+DSLTFINLLRSI GL+SIQLAKIVHC+AI SNLCGDLLV+TAVLSLYSKLG LV+ARKLF+KMPEKDRVVWN
Subjt:  LNSEASQNGNGDEIFQLFGRMRTEQLVSDSLTFINLLRSIVGLNSIQLAKIVHCVAITSNLCGDLLVNTAVLSLYSKLGCLVNARKLFDKMPEKDRVVWN

Query:  IMIAAYDREGNPAECLELFKSMARSGIRADLFTALPVISSISQLKCVDWGKQTHAHTLRNGSDNQVSVHNSLIDMYCELNILDSACKIFSWMTNKTVISW
        IMIAAY REG P ECLELF+SMARSGIRADLFTALPVISSISQLKC DWGKQTHA+ LRNGSD+QVSVHNSLIDMYCE N L+SACKIF+ +TNKTVISW
Subjt:  IMIAAYDREGNPAECLELFKSMARSGIRADLFTALPVISSISQLKCVDWGKQTHAHTLRNGSDNQVSVHNSLIDMYCELNILDSACKIFSWMTNKTVISW

Query:  SAMIKGCVKHGQSLNALSLFSRMKSDGIQADFITVINILPAFVHIGALENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGCIEMAQRLFEEERVDDKDL
        SAMIKG VKHG  L ALSLF  MKSDGIQADFITVINI+PAFV IGALENVKYLHGYS+KL LTSLPSLNTALLITYAKCGCIEMAQRLFEEERV+DKDL
Subjt:  SAMIKGCVKHGQSLNALSLFSRMKSDGIQADFITVINILPAFVHIGALENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGCIEMAQRLFEEERVDDKDL

Query:  IMWNSMISAHANHGDWSQCFKIYNQMKCSNSRPDQVTFLGLLTACVNSGLVEKGKECFKEMIENYGCQPSQEHYACMVNLLGRAGLINDAGALVRNMPIK
        IMWNSMISAHANHGDWSQCFK+YNQMKCSNS PDQVTFLGLLTACVNSGLVEKGKE FKEMIE+Y CQPSQEHYACMVNLLGRAGLIN+AG LVRNMPIK
Subjt:  IMWNSMISAHANHGDWSQCFKIYNQMKCSNSRPDQVTFLGLLTACVNSGLVEKGKECFKEMIENYGCQPSQEHYACMVNLLGRAGLINDAGALVRNMPIK

Query:  PDARVWGPLLSACKLHPGSKLAEFAAEKLIDMEPKNAGNYILLSNIYAAAGKWDGVAKMRSFLRDKGLKKTPGCSWLEINGHVTEFRVADRTHPRAEDIY
        PDARVWGPLLSACKLHPGSKLAEFAAEKLIDMEPKNAGNYILLSNIYAAAGKWDGVAKMRSFLRDKGLKKTPGCSWLEING V EFRVADRTHPRAEDIY
Subjt:  PDARVWGPLLSACKLHPGSKLAEFAAEKLIDMEPKNAGNYILLSNIYAAAGKWDGVAKMRSFLRDKGLKKTPGCSWLEINGHVTEFRVADRTHPRAEDIY

Query:  TILGNLELEIKEAREKSPEKLGILL
         ILGNLEL+IKEA+E SPEKLG LL
Subjt:  TILGNLELEIKEAREKSPEKLGILL

SwissProt top hitse value%identityAlignment
O81767 Pentatricopeptide repeat-containing protein At4g339901.7e-11533.33Show/hide
Query:  QSRPLNTLSFLFSRCSSRQQLEQIHARFILHGLHQNPALSCELIDSYANLGLLTLSQQVFNSIIDPTSTLYSAILRNLSSFGEYERTLLVYR-EMFAKSM
        +S+ ++ +  LF  C++ Q  + +HAR ++    QN  +S +L++ Y  LG + L++  F+ I +     ++ ++      G     +  +   M +  +
Subjt:  QSRPLNTLSFLFSRCSSRQQLEQIHARFILHGLHQNPALSCELIDSYANLGLLTLSQQVFNSIIDPTSTLYSAILRNLSSFGEYERTLLVYR-EMFAKSM

Query:  HPDEETYPSVLRSCCCLSNVEYGRKIHGHLVKLGV--DLYDSTATALAEMYRKCIGFENGHDLFDKMPMKDFECWNSLNSEASQNGNGDEIFQLFGRMRT
         PD  T+PSVL++C     V  G KIH   +K G   D+Y   A +L  +Y +     N   LFD+MP++D   WN++ S   Q+GN  E   L   +R 
Subjt:  HPDEETYPSVLRSCCCLSNVEYGRKIHGHLVKLGV--DLYDSTATALAEMYRKCIGFENGHDLFDKMPMKDFECWNSLNSEASQNGNGDEIFQLFGRMRT

Query:  EQLVSDSLTFINLLRSIVGLNSIQLAKIVHCVAITSNLCGDLLVNTAVLSLYSKLGCLVNARKLFDKMPEKDRVVWNIMIAAYDREGNPAECLELFKSMA
             DS+T ++LL +            +H  +I   L  +L V+  ++ LY++ G L + +K+FD+M  +D + WN +I AY+    P   + LF+ M 
Subjt:  EQLVSDSLTFINLLRSIVGLNSIQLAKIVHCVAITSNLCGDLLVNTAVLSLYSKLGCLVNARKLFDKMPEKDRVVWNIMIAAYDREGNPAECLELFKSMA

Query:  RSGIRADLFTALPVISSISQLKCVDWGKQTHAHTLRNG-SDNQVSVHNSLIDMYCELNILDSACKIFSWMTNKTVISWSAMIKGCVKHGQSLNALSLFSR
         S I+ D  T + + S +SQL  +   +     TLR G     +++ N+++ MY +L ++DSA  +F+W+ N  VISW+ +I G  ++G +  A+ +++ 
Subjt:  RSGIRADLFTALPVISSISQLKCVDWGKQTHAHTLRNG-SDNQVSVHNSLIDMYCELNILDSACKIFSWMTNKTVISWSAMIKGCVKHGQSLNALSLFSR

Query:  MKSDG-IQADFITVINILPAFVHIGALENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGCIEMAQRLFEEERVDDKDLIMWNSMISAHANHGDWSQCFK
        M+ +G I A+  T +++LPA    GAL     LHG  +K GL     + T+L   Y KCG +E A  LF +  +   + + WN++I+ H  HG   +   
Subjt:  MKSDG-IQADFITVINILPAFVHIGALENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGCIEMAQRLFEEERVDDKDLIMWNSMISAHANHGDWSQCFK

Query:  IYNQMKCSNSRPDQVTFLGLLTACVNSGLVEKGKECFKEMIENYGCQPSQEHYACMVNLLGRAGLINDAGALVRNMPIKPDARVWGPLLSACKLHPGSKL
        ++ +M     +PD +TF+ LL+AC +SGLV++G+ CF+ M  +YG  PS +HY CMV++ GRAG +  A   +++M ++PDA +WG LLSAC++H    L
Subjt:  IYNQMKCSNSRPDQVTFLGLLTACVNSGLVEKGKECFKEMIENYGCQPSQEHYACMVNLLGRAGLINDAGALVRNMPIKPDARVWGPLLSACKLHPGSKL

Query:  AEFAAEKLIDMEPKNAGNYILLSNIYAAAGKWDGVAKMRSFLRDKGLKKTPGCSWLEINGHVTEFRVADRTHPRAEDIYTILGNLELEIK
         + A+E L ++EP++ G ++LLSN+YA+AGKW+GV ++RS    KGL+KTPG S +E++  V  F   ++THP  E++Y  L  L+ ++K
Subjt:  AEFAAEKLIDMEPKNAGNYILLSNIYAAAGKWDGVAKMRSFLRDKGLKKTPGCSWLEINGHVTEFRVADRTHPRAEDIYTILGNLELEIK

Q3E6Q1 Pentatricopeptide repeat-containing protein At1g11290, chloroplastic1.4e-12033.43Show/hide
Query:  SFLFSRCSSRQQLEQIHARFILHGLHQNPALSCELIDSYANLGLLTLSQQVFNSIIDPTSTLYSAILRNLSSFGEYERTLLVYREMFAKSMHPDEETYPS
        + L  RCSS ++L QI      +GL+Q      +L+  +   G +  + +VF  I    + LY  +L+  +   + ++ L  +  M    + P    +  
Subjt:  SFLFSRCSSRQQLEQIHARFILHGLHQNPALSCELIDSYANLGLLTLSQQVFNSIIDPTSTLYSAILRNLSSFGEYERTLLVYREMFAKSMHPDEETYPS

Query:  VLRSCCCLSNVEYGRKIHGHLVKLGVDLYDSTATALAEMYRKCIGFENGHDLFDKMPMKDFECWNSLNSEASQNGNGDEIFQLFGRMRTEQLVSDSLTFI
        +L+ C   + +  G++IHG LVK G  L     T L  MY KC        +FD+MP +D   WN++ +  SQNG      ++   M  E L    +T +
Subjt:  VLRSCCCLSNVEYGRKIHGHLVKLGVDLYDSTATALAEMYRKCIGFENGHDLFDKMPMKDFECWNSLNSEASQNGNGDEIFQLFGRMRTEQLVSDSLTFI

Query:  NLLRSIVGLNSIQLAKIVHCVAITSNLCGDLLVNTAVLSLYSKLGCLVNARKLFDKMPEKDRVVWNIMIAAYDREGNPAECLELFKSMARSGIRADLFTA
        ++L ++  L  I + K +H  A+ S     + ++TA++ +Y+K G L  AR+LFD M E++ V WN MI AY +  NP E + +F+ M   G++    + 
Subjt:  NLLRSIVGLNSIQLAKIVHCVAITSNLCGDLLVNTAVLSLYSKLGCLVNARKLFDKMPEKDRVVWNIMIAAYDREGNPAECLELFKSMARSGIRADLFTA

Query:  LPVISSISQLKCVDWGKQTHAHTLRNGSDNQVSVHNSLIDMYCELNILDSACKIFSWMTNKTVISWSAMIKGCVKHGQSLNALSLFSRMKSDGIQADFIT
        +  + + + L  ++ G+  H  ++  G D  VSV NSLI MYC+   +D+A  +F  + ++T++SW+AMI G  ++G+ ++AL+ FS+M+S  ++ D  T
Subjt:  LPVISSISQLKCVDWGKQTHAHTLRNGSDNQVSVHNSLIDMYCELNILDSACKIFSWMTNKTVISWSAMIKGCVKHGQSLNALSLFSRMKSDGIQADFIT

Query:  VINILPAFVHIGALENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGCIEMAQRLFEEERVDDKDLIMWNSMISAHANHGDWSQCFKIYNQMKCSNSRPD
         ++++ A   +    + K++HG  M+  L     + TAL+  YAKCG I +A+ +F  + + ++ +  WN+MI  +  HG      +++ +M+    +P+
Subjt:  VINILPAFVHIGALENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGCIEMAQRLFEEERVDDKDLIMWNSMISAHANHGDWSQCFKIYNQMKCSNSRPD

Query:  QVTFLGLLTACVNSGLVEKGKECFKEMIENYGCQPSQEHYACMVNLLGRAGLINDAGALVRNMPIKPDARVWGPLLSACKLHPGSKLAEFAAEKLIDMEP
         VTFL +++AC +SGLVE G +CF  M ENY  + S +HY  MV+LLGRAG +N+A   +  MP+KP   V+G +L AC++H     AE AAE+L ++ P
Subjt:  QVTFLGLLTACVNSGLVEKGKECFKEMIENYGCQPSQEHYACMVNLLGRAGLINDAGALVRNMPIKPDARVWGPLLSACKLHPGSKLAEFAAEKLIDMEP

Query:  KNAGNYILLSNIYAAAGKWDGVAKMRSFLRDKGLKKTPGCSWLEINGHVTEFRVADRTHPRAEDIYTILGNLELEIKEA
         + G ++LL+NIY AA  W+ V ++R  +  +GL+KTPGCS +EI   V  F      HP ++ IY  L  L   IKEA
Subjt:  KNAGNYILLSNIYAAAGKWDGVAKMRSFLRDKGLKKTPGCSWLEINGHVTEFRVADRTHPRAEDIYTILGNLELEIKEA

Q9LN01 Pentatricopeptide repeat-containing protein At1g08070, chloroplastic5.6e-11433.8Show/hide
Query:  PIFRFEFSNFPATQSRPLNTLS-----FLFSRCSSRQQLEQIHARFILHGLHQ-NPALS-----CELIDSYANLGLLTLSQQVFNSIIDPTSTLYSAILR
        P   + F   P++   P +++       L   C + Q L  IHA+ I  GLH  N ALS     C L   +     L  +  VF +I +P   +++ + R
Subjt:  PIFRFEFSNFPATQSRPLNTLS-----FLFSRCSSRQQLEQIHARFILHGLHQ-NPALS-----CELIDSYANLGLLTLSQQVFNSIIDPTSTLYSAILR

Query:  NLSSFGEYERTLLVYREMFAKSMHPDEETYPSVLRSCCCLSNVEYGRKIHGHLVKLGVDLYDSTATALAEMYRKCIGFENGHDLFDKMPMKDFECWNSLN
          +   +    L +Y  M +  + P+  T+P VL+SC      + G++IHGH++KLG DL     T+L  MY +    E+ H +FDK P +         
Subjt:  NLSSFGEYERTLLVYREMFAKSMHPDEETYPSVLRSCCCLSNVEYGRKIHGHLVKLGVDLYDSTATALAEMYRKCIGFENGHDLFDKMPMKDFECWNSLN

Query:  SEASQNGNGDEIFQLFGRMRTEQLVSDSLTFINLLRSIVGLNSIQLAKIVHCVAITSNLCGDLLVNTAVLSLYSKLGCLVNARKLFDKMPEKDRVVWNIM
                                                                     D++  TA++  Y+  G + NA+KLFD++P KD V WN M
Subjt:  SEASQNGNGDEIFQLFGRMRTEQLVSDSLTFINLLRSIVGLNSIQLAKIVHCVAITSNLCGDLLVNTAVLSLYSKLGCLVNARKLFDKMPEKDRVVWNIM

Query:  IAAYDREGNPAECLELFKSMARSGIRADLFTALPVISSISQLKCVDWGKQTHAHTLRNGSDNQVSVHNSLIDMYCELNILDSACKIFSWMTNKTVISWSA
        I+ Y   GN  E LELFK M ++ +R D  T + V+S+ +Q   ++ G+Q H     +G  + + + N+LID+Y +   L++AC +F  +  K VISW+ 
Subjt:  IAAYDREGNPAECLELFKSMARSGIRADLFTALPVISSISQLKCVDWGKQTHAHTLRNGSDNQVSVHNSLIDMYCELNILDSACKIFSWMTNKTVISWSA

Query:  MIKGCVKHGQSLNALSLFSRMKSDGIQADFITVINILPAFVHIGALENVKYLHGYSMK--LGLTSLPSLNTALLITYAKCGCIEMAQRLFEEERVDDKDL
        +I G         AL LF  M   G   + +T+++ILPA  H+GA++  +++H Y  K   G+T+  SL T+L+  YAKCG IE A ++F    +  K L
Subjt:  MIKGCVKHGQSLNALSLFSRMKSDGIQADFITVINILPAFVHIGALENVKYLHGYSMK--LGLTSLPSLNTALLITYAKCGCIEMAQRLFEEERVDDKDL

Query:  IMWNSMISAHANHGDWSQCFKIYNQMKCSNSRPDQVTFLGLLTACVNSGLVEKGKECFKEMIENYGCQPSQEHYACMVNLLGRAGLINDAGALVRNMPIK
          WN+MI   A HG     F ++++M+    +PD +TF+GLL+AC +SG+++ G+  F+ M ++Y   P  EHY CM++LLG +GL  +A  ++  M ++
Subjt:  IMWNSMISAHANHGDWSQCFKIYNQMKCSNSRPDQVTFLGLLTACVNSGLVEKGKECFKEMIENYGCQPSQEHYACMVNLLGRAGLINDAGALVRNMPIK

Query:  PDARVWGPLLSACKLHPGSKLAEFAAEKLIDMEPKNAGNYILLSNIYAAAGKWDGVAKMRSFLRDKGLKKTPGCSWLEINGHVTEFRVADRTHPRAEDIY
        PD  +W  LL ACK+H   +L E  AE LI +EP+N G+Y+LLSNIYA+AG+W+ VAK R+ L DKG+KK PGCS +EI+  V EF + D+ HPR  +IY
Subjt:  PDARVWGPLLSACKLHPGSKLAEFAAEKLIDMEPKNAGNYILLSNIYAAAGKWDGVAKMRSFLRDKGLKKTPGCSWLEINGHVTEFRVADRTHPRAEDIY

Query:  TILGNLELEIKEA
         +L  +E+ +++A
Subjt:  TILGNLELEIKEA

Q9SN39 Pentatricopeptide repeat-containing protein DOT4, chloroplastic4.3e-10631.9Show/hide
Query:  LSCELIDSYANLGLLTLSQQVFNSIIDPTSTLYSAILRNLSSFGEYERTLLVYREMFAKSMHPDEETYPSVLRSCCCLSNVEYGRKIHGHLVKLGVDLYD
        L  +L   Y N G L  + +VF+ +    +  ++ ++  L+  G++  ++ ++++M +  +  D  T+  V +S   L +V  G ++HG ++K G    +
Subjt:  LSCELIDSYANLGLLTLSQQVFNSIIDPTSTLYSAILRNLSSFGEYERTLLVYREMFAKSMHPDEETYPSVLRSCCCLSNVEYGRKIHGHLVKLGVDLYD

Query:  STATALAEMYRKCIGFENGHDLFDKMPMKDFECWNSLNSEASQNGNGDEIFQLFGRMRTEQLVSDSLTFINLLRSIVGLNSIQLAKIVHCVAITSNLCGD
        S   +L   Y K    ++   +FD+M  +D   WNS+ +    NG  ++   +F +M    +  D  T +++         I L + VH + + +    +
Subjt:  STATALAEMYRKCIGFENGHDLFDKMPMKDFECWNSLNSEASQNGNGDEIFQLFGRMRTEQLVSDSLTFINLLRSIVGLNSIQLAKIVHCVAITSNLCGD

Query:  LLVNTAVLSLYSKLGCLVNARKLFDKMPEKDRVVWNIMIAAYDREGNPAECLELFKSMARSGIRADLFTALPVISSISQLKCVDWGKQTHAHTLRNGSDN
              +L +YSK G L +A+ +F +M ++  V +  MIA Y REG   E ++LF+ M   GI  D++T   V++  ++ + +D GK+ H     N    
Subjt:  LLVNTAVLSLYSKLGCLVNARKLFDKMPEKDRVVWNIMIAAYDREGNPAECLELFKSMARSGIRADLFTALPVISSISQLKCVDWGKQTHAHTLRNGSDN

Query:  QVSVHNSLIDMYCELNILDSACKIFSWMTNKTVISWSAMIKGCVKHGQSLNALSLFS-RMKSDGIQADFITVINILPAFVHIGALENVKYLHGYSMKLGL
         + V N+L+DMY +   +  A  +FS M  K +ISW+ +I G  K+  +  ALSLF+  ++      D  TV  +LPA   + A +  + +HGY M+ G 
Subjt:  QVSVHNSLIDMYCELNILDSACKIFSWMTNKTVISWSAMIKGCVKHGQSLNALSLFS-RMKSDGIQADFITVINILPAFVHIGALENVKYLHGYSMKLGL

Query:  TSLPSLNTALLITYAKCGCIEMAQRLFEEERVDDKDLIMWNSMISAHANHGDWSQCFKIYNQMKCSNSRPDQVTFLGLLTACVNSGLVEKGKECFKEMIE
         S   +  +L+  YAKCG + +A  LF++  +  KDL+ W  MI+ +  HG   +   ++NQM+ +    D+++F+ LL AC +SGLV++G   F  M  
Subjt:  TSLPSLNTALLITYAKCGCIEMAQRLFEEERVDDKDLIMWNSMISAHANHGDWSQCFKIYNQMKCSNSRPDQVTFLGLLTACVNSGLVEKGKECFKEMIE

Query:  NYGCQPSQEHYACMVNLLGRAGLINDAGALVRNMPIKPDARVWGPLLSACKLHPGSKLAEFAAEKLIDMEPKNAGNYILLSNIYAAAGKWDGVAKMRSFL
            +P+ EHYAC+V++L R G +  A   + NMPI PDA +WG LL  C++H   KLAE  AEK+ ++EP+N G Y+L++NIYA A KW+ V ++R  +
Subjt:  NYGCQPSQEHYACMVNLLGRAGLINDAGALVRNMPIKPDARVWGPLLSACKLHPGSKLAEFAAEKLIDMEPKNAGNYILLSNIYAAAGKWDGVAKMRSFL

Query:  RDKGLKKTPGCSWLEINGHVTEFRVADRTHPRAEDIYTILGNLELEIKE
          +GL+K PGCSW+EI G V  F   D ++P  E+I   L  +   + E
Subjt:  RDKGLKKTPGCSWLEINGHVTEFRVADRTHPRAEDIYTILGNLELEIKE

Q9STE1 Pentatricopeptide repeat-containing protein At4g213001.6e-10831.15Show/hide
Query:  GLHQNPALSCELIDSYANLGLLTLSQQVFNSIIDPTSTLYSAILRNLSSFGEYERTLLVYREMFAKSMHPDEETYPSVLRSCCCLSNVEYGRKIHGHLVK
        G+  N  ++  LI +Y   G + +  ++F+ ++     +++ +L   +  G  +  +  +  M    + P+  T+  VL  C     ++ G ++HG +V 
Subjt:  GLHQNPALSCELIDSYANLGLLTLSQQVFNSIIDPTSTLYSAILRNLSSFGEYERTLLVYREMFAKSMHPDEETYPSVLRSCCCLSNVEYGRKIHGHLVK

Query:  LGVDLYDSTATALAEMYRKCIGFENGHDLFDKMPMKDFECWNSLNSEASQNGNGDEIFQLFGRMRTEQLVSDSLTFINLLRSIVGLNSIQLAKIVHCVAI
         GVD   S   +L  MY KC  F++   LF  M   D   WN + S   Q+G  +E    F  M +  ++ D++TF +LL S+    +++  K +HC  +
Subjt:  LGVDLYDSTATALAEMYRKCIGFENGHDLFDKMPMKDFECWNSLNSEASQNGNGDEIFQLFGRMRTEQLVSDSLTFINLLRSIVGLNSIQLAKIVHCVAI

Query:  TSNLCGDLLVNTAVLSLYSKLGCLVNARKLFDKMPEKDRVVWNIMIAAYDREGNPAECLELFKSMARSGIRADLFTALPVISSISQLKCVDWGKQTHAHT
          ++  D+ + +A++  Y K   +  A+ +F +    D VV+  MI+ Y   G   + LE+F+ + +  I  +  T + ++  I  L  +  G++ H   
Subjt:  TSNLCGDLLVNTAVLSLYSKLGCLVNARKLFDKMPEKDRVVWNIMIAAYDREGNPAECLELFKSMARSGIRADLFTALPVISSISQLKCVDWGKQTHAHT

Query:  LRNGSDNQVSVHNSLIDMYCELNILDSACKIFSWMTNKTVISWSAMIKGCVKHGQSLNALSLFSRMKSDGIQADFITVINILPAFVHIGALENVKYLHGY
        ++ G DN+ ++  ++IDMY +   ++ A +IF  ++ + ++SW++MI  C +      A+ +F +M   GI  D +++   L A  ++ +    K +HG+
Subjt:  LRNGSDNQVSVHNSLIDMYCELNILDSACKIFSWMTNKTVISWSAMIKGCVKHGQSLNALSLFSRMKSDGIQADFITVINILPAFVHIGALENVKYLHGY

Query:  SMKLGLTSLPSLNTALLITYAKCGCIEMAQRLFEEERVDDKDLIMWNSMISAHANHGDWSQCFKIYNQM-KCSNSRPDQVTFLGLLTACVNSGLVEKGKE
         +K  L S     + L+  YAKCG ++ A  +F  + + +K+++ WNS+I+A  NHG       ++++M + S  RPDQ+TFL ++++C + G V++G  
Subjt:  SMKLGLTSLPSLNTALLITYAKCGCIEMAQRLFEEERVDDKDLIMWNSMISAHANHGDWSQCFKIYNQM-KCSNSRPDQVTFLGLLTACVNSGLVEKGKE

Query:  CFKEMIENYGCQPSQEHYACMVNLLGRAGLINDAGALVRNMPIKPDARVWGPLLSACKLHPGSKLAEFAAEKLIDMEPKNAGNYILLSNIYAAAGKWDGV
         F+ M E+YG QP QEHYAC+V+L GRAG + +A   V++MP  PDA VWG LL AC+LH   +LAE A+ KL+D++P N+G Y+L+SN +A A +W+ V
Subjt:  CFKEMIENYGCQPSQEHYACMVNLLGRAGLINDAGALVRNMPIKPDARVWGPLLSACKLHPGSKLAEFAAEKLIDMEPKNAGNYILLSNIYAAAGKWDGV

Query:  AKMRSFLRDKGLKKTPGCSWLEINGHVTEFRVADRTHPRAEDIYTILGNLELEIK
         K+RS ++++ ++K PG SW+EIN     F   D  HP +  IY++L +L  E++
Subjt:  AKMRSFLRDKGLKKTPGCSWLEINGHVTEFRVADRTHPRAEDIYTILGNLELEIK

Arabidopsis top hitse value%identityAlignment
AT1G08070.1 Tetratricopeptide repeat (TPR)-like superfamily protein4.0e-11533.8Show/hide
Query:  PIFRFEFSNFPATQSRPLNTLS-----FLFSRCSSRQQLEQIHARFILHGLHQ-NPALS-----CELIDSYANLGLLTLSQQVFNSIIDPTSTLYSAILR
        P   + F   P++   P +++       L   C + Q L  IHA+ I  GLH  N ALS     C L   +     L  +  VF +I +P   +++ + R
Subjt:  PIFRFEFSNFPATQSRPLNTLS-----FLFSRCSSRQQLEQIHARFILHGLHQ-NPALS-----CELIDSYANLGLLTLSQQVFNSIIDPTSTLYSAILR

Query:  NLSSFGEYERTLLVYREMFAKSMHPDEETYPSVLRSCCCLSNVEYGRKIHGHLVKLGVDLYDSTATALAEMYRKCIGFENGHDLFDKMPMKDFECWNSLN
          +   +    L +Y  M +  + P+  T+P VL+SC      + G++IHGH++KLG DL     T+L  MY +    E+ H +FDK P +         
Subjt:  NLSSFGEYERTLLVYREMFAKSMHPDEETYPSVLRSCCCLSNVEYGRKIHGHLVKLGVDLYDSTATALAEMYRKCIGFENGHDLFDKMPMKDFECWNSLN

Query:  SEASQNGNGDEIFQLFGRMRTEQLVSDSLTFINLLRSIVGLNSIQLAKIVHCVAITSNLCGDLLVNTAVLSLYSKLGCLVNARKLFDKMPEKDRVVWNIM
                                                                     D++  TA++  Y+  G + NA+KLFD++P KD V WN M
Subjt:  SEASQNGNGDEIFQLFGRMRTEQLVSDSLTFINLLRSIVGLNSIQLAKIVHCVAITSNLCGDLLVNTAVLSLYSKLGCLVNARKLFDKMPEKDRVVWNIM

Query:  IAAYDREGNPAECLELFKSMARSGIRADLFTALPVISSISQLKCVDWGKQTHAHTLRNGSDNQVSVHNSLIDMYCELNILDSACKIFSWMTNKTVISWSA
        I+ Y   GN  E LELFK M ++ +R D  T + V+S+ +Q   ++ G+Q H     +G  + + + N+LID+Y +   L++AC +F  +  K VISW+ 
Subjt:  IAAYDREGNPAECLELFKSMARSGIRADLFTALPVISSISQLKCVDWGKQTHAHTLRNGSDNQVSVHNSLIDMYCELNILDSACKIFSWMTNKTVISWSA

Query:  MIKGCVKHGQSLNALSLFSRMKSDGIQADFITVINILPAFVHIGALENVKYLHGYSMK--LGLTSLPSLNTALLITYAKCGCIEMAQRLFEEERVDDKDL
        +I G         AL LF  M   G   + +T+++ILPA  H+GA++  +++H Y  K   G+T+  SL T+L+  YAKCG IE A ++F    +  K L
Subjt:  MIKGCVKHGQSLNALSLFSRMKSDGIQADFITVINILPAFVHIGALENVKYLHGYSMK--LGLTSLPSLNTALLITYAKCGCIEMAQRLFEEERVDDKDL

Query:  IMWNSMISAHANHGDWSQCFKIYNQMKCSNSRPDQVTFLGLLTACVNSGLVEKGKECFKEMIENYGCQPSQEHYACMVNLLGRAGLINDAGALVRNMPIK
          WN+MI   A HG     F ++++M+    +PD +TF+GLL+AC +SG+++ G+  F+ M ++Y   P  EHY CM++LLG +GL  +A  ++  M ++
Subjt:  IMWNSMISAHANHGDWSQCFKIYNQMKCSNSRPDQVTFLGLLTACVNSGLVEKGKECFKEMIENYGCQPSQEHYACMVNLLGRAGLINDAGALVRNMPIK

Query:  PDARVWGPLLSACKLHPGSKLAEFAAEKLIDMEPKNAGNYILLSNIYAAAGKWDGVAKMRSFLRDKGLKKTPGCSWLEINGHVTEFRVADRTHPRAEDIY
        PD  +W  LL ACK+H   +L E  AE LI +EP+N G+Y+LLSNIYA+AG+W+ VAK R+ L DKG+KK PGCS +EI+  V EF + D+ HPR  +IY
Subjt:  PDARVWGPLLSACKLHPGSKLAEFAAEKLIDMEPKNAGNYILLSNIYAAAGKWDGVAKMRSFLRDKGLKKTPGCSWLEINGHVTEFRVADRTHPRAEDIY

Query:  TILGNLELEIKEA
         +L  +E+ +++A
Subjt:  TILGNLELEIKEA

AT1G11290.1 Pentatricopeptide repeat (PPR) superfamily protein9.8e-12233.43Show/hide
Query:  SFLFSRCSSRQQLEQIHARFILHGLHQNPALSCELIDSYANLGLLTLSQQVFNSIIDPTSTLYSAILRNLSSFGEYERTLLVYREMFAKSMHPDEETYPS
        + L  RCSS ++L QI      +GL+Q      +L+  +   G +  + +VF  I    + LY  +L+  +   + ++ L  +  M    + P    +  
Subjt:  SFLFSRCSSRQQLEQIHARFILHGLHQNPALSCELIDSYANLGLLTLSQQVFNSIIDPTSTLYSAILRNLSSFGEYERTLLVYREMFAKSMHPDEETYPS

Query:  VLRSCCCLSNVEYGRKIHGHLVKLGVDLYDSTATALAEMYRKCIGFENGHDLFDKMPMKDFECWNSLNSEASQNGNGDEIFQLFGRMRTEQLVSDSLTFI
        +L+ C   + +  G++IHG LVK G  L     T L  MY KC        +FD+MP +D   WN++ +  SQNG      ++   M  E L    +T +
Subjt:  VLRSCCCLSNVEYGRKIHGHLVKLGVDLYDSTATALAEMYRKCIGFENGHDLFDKMPMKDFECWNSLNSEASQNGNGDEIFQLFGRMRTEQLVSDSLTFI

Query:  NLLRSIVGLNSIQLAKIVHCVAITSNLCGDLLVNTAVLSLYSKLGCLVNARKLFDKMPEKDRVVWNIMIAAYDREGNPAECLELFKSMARSGIRADLFTA
        ++L ++  L  I + K +H  A+ S     + ++TA++ +Y+K G L  AR+LFD M E++ V WN MI AY +  NP E + +F+ M   G++    + 
Subjt:  NLLRSIVGLNSIQLAKIVHCVAITSNLCGDLLVNTAVLSLYSKLGCLVNARKLFDKMPEKDRVVWNIMIAAYDREGNPAECLELFKSMARSGIRADLFTA

Query:  LPVISSISQLKCVDWGKQTHAHTLRNGSDNQVSVHNSLIDMYCELNILDSACKIFSWMTNKTVISWSAMIKGCVKHGQSLNALSLFSRMKSDGIQADFIT
        +  + + + L  ++ G+  H  ++  G D  VSV NSLI MYC+   +D+A  +F  + ++T++SW+AMI G  ++G+ ++AL+ FS+M+S  ++ D  T
Subjt:  LPVISSISQLKCVDWGKQTHAHTLRNGSDNQVSVHNSLIDMYCELNILDSACKIFSWMTNKTVISWSAMIKGCVKHGQSLNALSLFSRMKSDGIQADFIT

Query:  VINILPAFVHIGALENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGCIEMAQRLFEEERVDDKDLIMWNSMISAHANHGDWSQCFKIYNQMKCSNSRPD
         ++++ A   +    + K++HG  M+  L     + TAL+  YAKCG I +A+ +F  + + ++ +  WN+MI  +  HG      +++ +M+    +P+
Subjt:  VINILPAFVHIGALENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGCIEMAQRLFEEERVDDKDLIMWNSMISAHANHGDWSQCFKIYNQMKCSNSRPD

Query:  QVTFLGLLTACVNSGLVEKGKECFKEMIENYGCQPSQEHYACMVNLLGRAGLINDAGALVRNMPIKPDARVWGPLLSACKLHPGSKLAEFAAEKLIDMEP
         VTFL +++AC +SGLVE G +CF  M ENY  + S +HY  MV+LLGRAG +N+A   +  MP+KP   V+G +L AC++H     AE AAE+L ++ P
Subjt:  QVTFLGLLTACVNSGLVEKGKECFKEMIENYGCQPSQEHYACMVNLLGRAGLINDAGALVRNMPIKPDARVWGPLLSACKLHPGSKLAEFAAEKLIDMEP

Query:  KNAGNYILLSNIYAAAGKWDGVAKMRSFLRDKGLKKTPGCSWLEINGHVTEFRVADRTHPRAEDIYTILGNLELEIKEA
         + G ++LL+NIY AA  W+ V ++R  +  +GL+KTPGCS +EI   V  F      HP ++ IY  L  L   IKEA
Subjt:  KNAGNYILLSNIYAAAGKWDGVAKMRSFLRDKGLKKTPGCSWLEINGHVTEFRVADRTHPRAEDIYTILGNLELEIKEA

AT4G18750.1 Pentatricopeptide repeat (PPR) superfamily protein3.1e-10731.9Show/hide
Query:  LSCELIDSYANLGLLTLSQQVFNSIIDPTSTLYSAILRNLSSFGEYERTLLVYREMFAKSMHPDEETYPSVLRSCCCLSNVEYGRKIHGHLVKLGVDLYD
        L  +L   Y N G L  + +VF+ +    +  ++ ++  L+  G++  ++ ++++M +  +  D  T+  V +S   L +V  G ++HG ++K G    +
Subjt:  LSCELIDSYANLGLLTLSQQVFNSIIDPTSTLYSAILRNLSSFGEYERTLLVYREMFAKSMHPDEETYPSVLRSCCCLSNVEYGRKIHGHLVKLGVDLYD

Query:  STATALAEMYRKCIGFENGHDLFDKMPMKDFECWNSLNSEASQNGNGDEIFQLFGRMRTEQLVSDSLTFINLLRSIVGLNSIQLAKIVHCVAITSNLCGD
        S   +L   Y K    ++   +FD+M  +D   WNS+ +    NG  ++   +F +M    +  D  T +++         I L + VH + + +    +
Subjt:  STATALAEMYRKCIGFENGHDLFDKMPMKDFECWNSLNSEASQNGNGDEIFQLFGRMRTEQLVSDSLTFINLLRSIVGLNSIQLAKIVHCVAITSNLCGD

Query:  LLVNTAVLSLYSKLGCLVNARKLFDKMPEKDRVVWNIMIAAYDREGNPAECLELFKSMARSGIRADLFTALPVISSISQLKCVDWGKQTHAHTLRNGSDN
              +L +YSK G L +A+ +F +M ++  V +  MIA Y REG   E ++LF+ M   GI  D++T   V++  ++ + +D GK+ H     N    
Subjt:  LLVNTAVLSLYSKLGCLVNARKLFDKMPEKDRVVWNIMIAAYDREGNPAECLELFKSMARSGIRADLFTALPVISSISQLKCVDWGKQTHAHTLRNGSDN

Query:  QVSVHNSLIDMYCELNILDSACKIFSWMTNKTVISWSAMIKGCVKHGQSLNALSLFS-RMKSDGIQADFITVINILPAFVHIGALENVKYLHGYSMKLGL
         + V N+L+DMY +   +  A  +FS M  K +ISW+ +I G  K+  +  ALSLF+  ++      D  TV  +LPA   + A +  + +HGY M+ G 
Subjt:  QVSVHNSLIDMYCELNILDSACKIFSWMTNKTVISWSAMIKGCVKHGQSLNALSLFS-RMKSDGIQADFITVINILPAFVHIGALENVKYLHGYSMKLGL

Query:  TSLPSLNTALLITYAKCGCIEMAQRLFEEERVDDKDLIMWNSMISAHANHGDWSQCFKIYNQMKCSNSRPDQVTFLGLLTACVNSGLVEKGKECFKEMIE
         S   +  +L+  YAKCG + +A  LF++  +  KDL+ W  MI+ +  HG   +   ++NQM+ +    D+++F+ LL AC +SGLV++G   F  M  
Subjt:  TSLPSLNTALLITYAKCGCIEMAQRLFEEERVDDKDLIMWNSMISAHANHGDWSQCFKIYNQMKCSNSRPDQVTFLGLLTACVNSGLVEKGKECFKEMIE

Query:  NYGCQPSQEHYACMVNLLGRAGLINDAGALVRNMPIKPDARVWGPLLSACKLHPGSKLAEFAAEKLIDMEPKNAGNYILLSNIYAAAGKWDGVAKMRSFL
            +P+ EHYAC+V++L R G +  A   + NMPI PDA +WG LL  C++H   KLAE  AEK+ ++EP+N G Y+L++NIYA A KW+ V ++R  +
Subjt:  NYGCQPSQEHYACMVNLLGRAGLINDAGALVRNMPIKPDARVWGPLLSACKLHPGSKLAEFAAEKLIDMEPKNAGNYILLSNIYAAAGKWDGVAKMRSFL

Query:  RDKGLKKTPGCSWLEINGHVTEFRVADRTHPRAEDIYTILGNLELEIKE
          +GL+K PGCSW+EI G V  F   D ++P  E+I   L  +   + E
Subjt:  RDKGLKKTPGCSWLEINGHVTEFRVADRTHPRAEDIYTILGNLELEIKE

AT4G21300.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.1e-10931.15Show/hide
Query:  GLHQNPALSCELIDSYANLGLLTLSQQVFNSIIDPTSTLYSAILRNLSSFGEYERTLLVYREMFAKSMHPDEETYPSVLRSCCCLSNVEYGRKIHGHLVK
        G+  N  ++  LI +Y   G + +  ++F+ ++     +++ +L   +  G  +  +  +  M    + P+  T+  VL  C     ++ G ++HG +V 
Subjt:  GLHQNPALSCELIDSYANLGLLTLSQQVFNSIIDPTSTLYSAILRNLSSFGEYERTLLVYREMFAKSMHPDEETYPSVLRSCCCLSNVEYGRKIHGHLVK

Query:  LGVDLYDSTATALAEMYRKCIGFENGHDLFDKMPMKDFECWNSLNSEASQNGNGDEIFQLFGRMRTEQLVSDSLTFINLLRSIVGLNSIQLAKIVHCVAI
         GVD   S   +L  MY KC  F++   LF  M   D   WN + S   Q+G  +E    F  M +  ++ D++TF +LL S+    +++  K +HC  +
Subjt:  LGVDLYDSTATALAEMYRKCIGFENGHDLFDKMPMKDFECWNSLNSEASQNGNGDEIFQLFGRMRTEQLVSDSLTFINLLRSIVGLNSIQLAKIVHCVAI

Query:  TSNLCGDLLVNTAVLSLYSKLGCLVNARKLFDKMPEKDRVVWNIMIAAYDREGNPAECLELFKSMARSGIRADLFTALPVISSISQLKCVDWGKQTHAHT
          ++  D+ + +A++  Y K   +  A+ +F +    D VV+  MI+ Y   G   + LE+F+ + +  I  +  T + ++  I  L  +  G++ H   
Subjt:  TSNLCGDLLVNTAVLSLYSKLGCLVNARKLFDKMPEKDRVVWNIMIAAYDREGNPAECLELFKSMARSGIRADLFTALPVISSISQLKCVDWGKQTHAHT

Query:  LRNGSDNQVSVHNSLIDMYCELNILDSACKIFSWMTNKTVISWSAMIKGCVKHGQSLNALSLFSRMKSDGIQADFITVINILPAFVHIGALENVKYLHGY
        ++ G DN+ ++  ++IDMY +   ++ A +IF  ++ + ++SW++MI  C +      A+ +F +M   GI  D +++   L A  ++ +    K +HG+
Subjt:  LRNGSDNQVSVHNSLIDMYCELNILDSACKIFSWMTNKTVISWSAMIKGCVKHGQSLNALSLFSRMKSDGIQADFITVINILPAFVHIGALENVKYLHGY

Query:  SMKLGLTSLPSLNTALLITYAKCGCIEMAQRLFEEERVDDKDLIMWNSMISAHANHGDWSQCFKIYNQM-KCSNSRPDQVTFLGLLTACVNSGLVEKGKE
         +K  L S     + L+  YAKCG ++ A  +F  + + +K+++ WNS+I+A  NHG       ++++M + S  RPDQ+TFL ++++C + G V++G  
Subjt:  SMKLGLTSLPSLNTALLITYAKCGCIEMAQRLFEEERVDDKDLIMWNSMISAHANHGDWSQCFKIYNQM-KCSNSRPDQVTFLGLLTACVNSGLVEKGKE

Query:  CFKEMIENYGCQPSQEHYACMVNLLGRAGLINDAGALVRNMPIKPDARVWGPLLSACKLHPGSKLAEFAAEKLIDMEPKNAGNYILLSNIYAAAGKWDGV
         F+ M E+YG QP QEHYAC+V+L GRAG + +A   V++MP  PDA VWG LL AC+LH   +LAE A+ KL+D++P N+G Y+L+SN +A A +W+ V
Subjt:  CFKEMIENYGCQPSQEHYACMVNLLGRAGLINDAGALVRNMPIKPDARVWGPLLSACKLHPGSKLAEFAAEKLIDMEPKNAGNYILLSNIYAAAGKWDGV

Query:  AKMRSFLRDKGLKKTPGCSWLEINGHVTEFRVADRTHPRAEDIYTILGNLELEIK
         K+RS ++++ ++K PG SW+EIN     F   D  HP +  IY++L +L  E++
Subjt:  AKMRSFLRDKGLKKTPGCSWLEINGHVTEFRVADRTHPRAEDIYTILGNLELEIK

AT4G33990.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.2e-11633.33Show/hide
Query:  QSRPLNTLSFLFSRCSSRQQLEQIHARFILHGLHQNPALSCELIDSYANLGLLTLSQQVFNSIIDPTSTLYSAILRNLSSFGEYERTLLVYR-EMFAKSM
        +S+ ++ +  LF  C++ Q  + +HAR ++    QN  +S +L++ Y  LG + L++  F+ I +     ++ ++      G     +  +   M +  +
Subjt:  QSRPLNTLSFLFSRCSSRQQLEQIHARFILHGLHQNPALSCELIDSYANLGLLTLSQQVFNSIIDPTSTLYSAILRNLSSFGEYERTLLVYR-EMFAKSM

Query:  HPDEETYPSVLRSCCCLSNVEYGRKIHGHLVKLGV--DLYDSTATALAEMYRKCIGFENGHDLFDKMPMKDFECWNSLNSEASQNGNGDEIFQLFGRMRT
         PD  T+PSVL++C     V  G KIH   +K G   D+Y   A +L  +Y +     N   LFD+MP++D   WN++ S   Q+GN  E   L   +R 
Subjt:  HPDEETYPSVLRSCCCLSNVEYGRKIHGHLVKLGV--DLYDSTATALAEMYRKCIGFENGHDLFDKMPMKDFECWNSLNSEASQNGNGDEIFQLFGRMRT

Query:  EQLVSDSLTFINLLRSIVGLNSIQLAKIVHCVAITSNLCGDLLVNTAVLSLYSKLGCLVNARKLFDKMPEKDRVVWNIMIAAYDREGNPAECLELFKSMA
             DS+T ++LL +            +H  +I   L  +L V+  ++ LY++ G L + +K+FD+M  +D + WN +I AY+    P   + LF+ M 
Subjt:  EQLVSDSLTFINLLRSIVGLNSIQLAKIVHCVAITSNLCGDLLVNTAVLSLYSKLGCLVNARKLFDKMPEKDRVVWNIMIAAYDREGNPAECLELFKSMA

Query:  RSGIRADLFTALPVISSISQLKCVDWGKQTHAHTLRNG-SDNQVSVHNSLIDMYCELNILDSACKIFSWMTNKTVISWSAMIKGCVKHGQSLNALSLFSR
         S I+ D  T + + S +SQL  +   +     TLR G     +++ N+++ MY +L ++DSA  +F+W+ N  VISW+ +I G  ++G +  A+ +++ 
Subjt:  RSGIRADLFTALPVISSISQLKCVDWGKQTHAHTLRNG-SDNQVSVHNSLIDMYCELNILDSACKIFSWMTNKTVISWSAMIKGCVKHGQSLNALSLFSR

Query:  MKSDG-IQADFITVINILPAFVHIGALENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGCIEMAQRLFEEERVDDKDLIMWNSMISAHANHGDWSQCFK
        M+ +G I A+  T +++LPA    GAL     LHG  +K GL     + T+L   Y KCG +E A  LF +  +   + + WN++I+ H  HG   +   
Subjt:  MKSDG-IQADFITVINILPAFVHIGALENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGCIEMAQRLFEEERVDDKDLIMWNSMISAHANHGDWSQCFK

Query:  IYNQMKCSNSRPDQVTFLGLLTACVNSGLVEKGKECFKEMIENYGCQPSQEHYACMVNLLGRAGLINDAGALVRNMPIKPDARVWGPLLSACKLHPGSKL
        ++ +M     +PD +TF+ LL+AC +SGLV++G+ CF+ M  +YG  PS +HY CMV++ GRAG +  A   +++M ++PDA +WG LLSAC++H    L
Subjt:  IYNQMKCSNSRPDQVTFLGLLTACVNSGLVEKGKECFKEMIENYGCQPSQEHYACMVNLLGRAGLINDAGALVRNMPIKPDARVWGPLLSACKLHPGSKL

Query:  AEFAAEKLIDMEPKNAGNYILLSNIYAAAGKWDGVAKMRSFLRDKGLKKTPGCSWLEINGHVTEFRVADRTHPRAEDIYTILGNLELEIK
         + A+E L ++EP++ G ++LLSN+YA+AGKW+GV ++RS    KGL+KTPG S +E++  V  F   ++THP  E++Y  L  L+ ++K
Subjt:  AEFAAEKLIDMEPKNAGNYILLSNIYAAAGKWDGVAKMRSFLRDKGLKKTPGCSWLEINGHVTEFRVADRTHPRAEDIYTILGNLELEIK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTTCATCTTCAACGATCGAAACCCATTTTCCGTTTCGAATTCTCGAATTTTCCCGCCACCCAATCGAGACCGCTCAACACGCTTTCGTTCCTCTTCAGCCGATGCAG
CTCACGTCAACAACTCGAGCAGATTCACGCTAGGTTCATCCTCCATGGCTTGCACCAAAACCCAGCTCTCTCTTGCGAACTTATCGACTCCTATGCGAATCTTGGACTTC
TTACTCTCTCTCAACAAGTTTTCAACTCTATAATCGACCCCACTTCAACTCTTTACAGTGCGATACTGAGAAATTTGTCGAGCTTTGGAGAATATGAGCGGACCCTGTTG
GTGTACCGGGAAATGTTCGCAAAATCTATGCACCCGGATGAAGAAACTTACCCTTCTGTATTGCGATCGTGTTGCTGTTTATCAAATGTTGAATATGGGAGGAAGATTCA
CGGGCATTTGGTTAAACTTGGTGTTGATTTATATGATTCGACAGCCACTGCTCTGGCTGAGATGTATCGGAAGTGCATAGGTTTTGAGAATGGTCATGATCTGTTCGACA
AAATGCCTATGAAGGATTTTGAATGCTGGAATTCGTTGAATTCAGAGGCTTCTCAAAATGGGAACGGGGACGAAATTTTCCAGCTCTTTGGGAGAATGAGAACAGAACAA
CTGGTATCGGACTCACTCACATTTATCAATCTCTTGAGGTCCATAGTGGGTTTGAATTCAATTCAGCTTGCAAAGATTGTGCATTGTGTTGCAATCACGAGCAACTTGTG
CGGAGACTTGTTAGTAAATACTGCTGTGTTGTCTCTTTACTCAAAGTTGGGTTGCTTAGTCAATGCTAGAAAATTGTTTGACAAAATGCCAGAGAAGGACCGTGTTGTAT
GGAATATAATGATAGCAGCTTATGACCGAGAAGGGAACCCGGCAGAATGTCTCGAGCTTTTCAAGTCCATGGCACGATCAGGGATTAGAGCTGATCTATTTACTGCACTT
CCTGTCATCTCTTCAATCTCACAGTTGAAATGTGTTGATTGGGGCAAACAAACTCATGCCCATACATTGAGGAATGGCTCAGACAATCAAGTTTCAGTTCATAACTCTCT
CATTGATATGTACTGCGAGTTGAACATTTTAGATTCAGCCTGTAAGATCTTCAGCTGGATGACTAACAAGACTGTAATTTCATGGAGTGCAATGATTAAGGGGTGTGTGA
AACATGGTCAATCTCTTAATGCTTTGTCCCTCTTCTCCAGGATGAAGTCAGATGGGATTCAAGCTGATTTTATTACAGTGATCAATATCTTGCCTGCTTTTGTTCACATA
GGGGCACTAGAGAATGTCAAATATTTACATGGTTACTCAATGAAGCTAGGCCTGACTTCTCTTCCATCACTTAACACAGCCCTCCTAATCACCTATGCAAAATGTGGGTG
TATTGAGATGGCCCAGAGGCTATTTGAGGAAGAGAGAGTTGATGATAAAGATTTGATAATGTGGAACTCCATGATCAGTGCCCACGCCAACCATGGAGACTGGTCTCAAT
GTTTCAAGATATACAATCAAATGAAGTGCTCAAATTCAAGGCCCGATCAGGTAACCTTTCTGGGACTATTAACAGCTTGTGTCAATTCCGGTCTCGTGGAAAAAGGAAAG
GAGTGTTTCAAGGAGATGATTGAAAATTATGGTTGCCAACCAAGTCAAGAGCATTATGCTTGTATGGTTAATCTCTTGGGGAGAGCCGGGCTTATCAATGATGCTGGAGC
ACTTGTGAGAAACATGCCCATCAAACCCGATGCTCGAGTTTGGGGTCCATTGTTGAGTGCCTGTAAGTTGCATCCTGGGTCAAAGCTTGCAGAGTTTGCTGCCGAGAAGC
TCATCGATATGGAGCCGAAAAATGCAGGGAATTACATATTGCTCTCAAACATATATGCTGCAGCAGGAAAATGGGATGGAGTGGCAAAAATGAGAAGTTTCCTCAGAGAT
AAAGGGCTCAAGAAAACCCCCGGTTGTAGTTGGCTGGAAATAAATGGCCATGTAACCGAGTTTCGTGTTGCAGATCGAACTCATCCCAGAGCAGAAGATATATATACCAT
CCTAGGAAACCTTGAACTCGAAATCAAGGAGGCCAGAGAAAAGAGTCCAGAGAAATTGGGTATTCTTCTATAA
mRNA sequenceShow/hide mRNA sequence
ATGCTTCATCTTCAACGATCGAAACCCATTTTCCGTTTCGAATTCTCGAATTTTCCCGCCACCCAATCGAGACCGCTCAACACGCTTTCGTTCCTCTTCAGCCGATGCAG
CTCACGTCAACAACTCGAGCAGATTCACGCTAGGTTCATCCTCCATGGCTTGCACCAAAACCCAGCTCTCTCTTGCGAACTTATCGACTCCTATGCGAATCTTGGACTTC
TTACTCTCTCTCAACAAGTTTTCAACTCTATAATCGACCCCACTTCAACTCTTTACAGTGCGATACTGAGAAATTTGTCGAGCTTTGGAGAATATGAGCGGACCCTGTTG
GTGTACCGGGAAATGTTCGCAAAATCTATGCACCCGGATGAAGAAACTTACCCTTCTGTATTGCGATCGTGTTGCTGTTTATCAAATGTTGAATATGGGAGGAAGATTCA
CGGGCATTTGGTTAAACTTGGTGTTGATTTATATGATTCGACAGCCACTGCTCTGGCTGAGATGTATCGGAAGTGCATAGGTTTTGAGAATGGTCATGATCTGTTCGACA
AAATGCCTATGAAGGATTTTGAATGCTGGAATTCGTTGAATTCAGAGGCTTCTCAAAATGGGAACGGGGACGAAATTTTCCAGCTCTTTGGGAGAATGAGAACAGAACAA
CTGGTATCGGACTCACTCACATTTATCAATCTCTTGAGGTCCATAGTGGGTTTGAATTCAATTCAGCTTGCAAAGATTGTGCATTGTGTTGCAATCACGAGCAACTTGTG
CGGAGACTTGTTAGTAAATACTGCTGTGTTGTCTCTTTACTCAAAGTTGGGTTGCTTAGTCAATGCTAGAAAATTGTTTGACAAAATGCCAGAGAAGGACCGTGTTGTAT
GGAATATAATGATAGCAGCTTATGACCGAGAAGGGAACCCGGCAGAATGTCTCGAGCTTTTCAAGTCCATGGCACGATCAGGGATTAGAGCTGATCTATTTACTGCACTT
CCTGTCATCTCTTCAATCTCACAGTTGAAATGTGTTGATTGGGGCAAACAAACTCATGCCCATACATTGAGGAATGGCTCAGACAATCAAGTTTCAGTTCATAACTCTCT
CATTGATATGTACTGCGAGTTGAACATTTTAGATTCAGCCTGTAAGATCTTCAGCTGGATGACTAACAAGACTGTAATTTCATGGAGTGCAATGATTAAGGGGTGTGTGA
AACATGGTCAATCTCTTAATGCTTTGTCCCTCTTCTCCAGGATGAAGTCAGATGGGATTCAAGCTGATTTTATTACAGTGATCAATATCTTGCCTGCTTTTGTTCACATA
GGGGCACTAGAGAATGTCAAATATTTACATGGTTACTCAATGAAGCTAGGCCTGACTTCTCTTCCATCACTTAACACAGCCCTCCTAATCACCTATGCAAAATGTGGGTG
TATTGAGATGGCCCAGAGGCTATTTGAGGAAGAGAGAGTTGATGATAAAGATTTGATAATGTGGAACTCCATGATCAGTGCCCACGCCAACCATGGAGACTGGTCTCAAT
GTTTCAAGATATACAATCAAATGAAGTGCTCAAATTCAAGGCCCGATCAGGTAACCTTTCTGGGACTATTAACAGCTTGTGTCAATTCCGGTCTCGTGGAAAAAGGAAAG
GAGTGTTTCAAGGAGATGATTGAAAATTATGGTTGCCAACCAAGTCAAGAGCATTATGCTTGTATGGTTAATCTCTTGGGGAGAGCCGGGCTTATCAATGATGCTGGAGC
ACTTGTGAGAAACATGCCCATCAAACCCGATGCTCGAGTTTGGGGTCCATTGTTGAGTGCCTGTAAGTTGCATCCTGGGTCAAAGCTTGCAGAGTTTGCTGCCGAGAAGC
TCATCGATATGGAGCCGAAAAATGCAGGGAATTACATATTGCTCTCAAACATATATGCTGCAGCAGGAAAATGGGATGGAGTGGCAAAAATGAGAAGTTTCCTCAGAGAT
AAAGGGCTCAAGAAAACCCCCGGTTGTAGTTGGCTGGAAATAAATGGCCATGTAACCGAGTTTCGTGTTGCAGATCGAACTCATCCCAGAGCAGAAGATATATATACCAT
CCTAGGAAACCTTGAACTCGAAATCAAGGAGGCCAGAGAAAAGAGTCCAGAGAAATTGGGTATTCTTCTATAA
Protein sequenceShow/hide protein sequence
MLHLQRSKPIFRFEFSNFPATQSRPLNTLSFLFSRCSSRQQLEQIHARFILHGLHQNPALSCELIDSYANLGLLTLSQQVFNSIIDPTSTLYSAILRNLSSFGEYERTLL
VYREMFAKSMHPDEETYPSVLRSCCCLSNVEYGRKIHGHLVKLGVDLYDSTATALAEMYRKCIGFENGHDLFDKMPMKDFECWNSLNSEASQNGNGDEIFQLFGRMRTEQ
LVSDSLTFINLLRSIVGLNSIQLAKIVHCVAITSNLCGDLLVNTAVLSLYSKLGCLVNARKLFDKMPEKDRVVWNIMIAAYDREGNPAECLELFKSMARSGIRADLFTAL
PVISSISQLKCVDWGKQTHAHTLRNGSDNQVSVHNSLIDMYCELNILDSACKIFSWMTNKTVISWSAMIKGCVKHGQSLNALSLFSRMKSDGIQADFITVINILPAFVHI
GALENVKYLHGYSMKLGLTSLPSLNTALLITYAKCGCIEMAQRLFEEERVDDKDLIMWNSMISAHANHGDWSQCFKIYNQMKCSNSRPDQVTFLGLLTACVNSGLVEKGK
ECFKEMIENYGCQPSQEHYACMVNLLGRAGLINDAGALVRNMPIKPDARVWGPLLSACKLHPGSKLAEFAAEKLIDMEPKNAGNYILLSNIYAAAGKWDGVAKMRSFLRD
KGLKKTPGCSWLEINGHVTEFRVADRTHPRAEDIYTILGNLELEIKEAREKSPEKLGILL