; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0022067 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0022067
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionPentatricopeptide repeat-containing protein DOT4
Genome locationchr7:17281050..17283047
RNA-Seq ExpressionLag0022067
SyntenyLag0022067
Gene Ontology termsGO:0005515 - protein binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily
IPR032867 - DYW domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6597728.1 Pentatricopeptide repeat-containing protein DOT4, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0086.62Show/hide
Query:  MDLLLSTHIHRLLITQKPNHTYHRHRLFNNPPHVRTTTAENYANLCIAHQPFDEIPIWDTFAWNSLIQTHLTNGDVGHVISTYQQMLFRGVRPDKHTLPR
        MDLLLST IHRL +TQKPNHTYHRHRLFNNPPHVRTTTAE  A+LC+AHQ FD+IPIWDTFAWN+LIQTHLT+GDVGHVISTYQQML RGVRPD HTLPR
Subjt:  MDLLLSTHIHRLLITQKPNHTYHRHRLFNNPPHVRTTTAENYANLCIAHQPFDEIPIWDTFAWNSLIQTHLTNGDVGHVISTYQQMLFRGVRPDKHTLPR

Query:  IICATRQYGDLQVGKQLHAQAFKLGFSSNLYVITSLIELYGILDGADTAKWLHDKSACRNSVSWTLLAKLYLMEDKPSFAVDLFYQMVELAADIDAVALA
        +ICA+R YGDLQ+GKQLHAQAFKLG  SNLYV TSLIELYGILD ADTA+WLHDKSACRN+VSWT+LAKLYLMEDKPSF++DLFYQMVELAADIDAVALA
Subjt:  IICATRQYGDLQVGKQLHAQAFKLGFSSNLYVITSLIELYGILDGADTAKWLHDKSACRNSVSWTLLAKLYLMEDKPSFAVDLFYQMVELAADIDAVALA

Query:  TAIGACGALKLLQHGRNIHHIARIHALEFDVLVSNSLLKMYLDCGSIKDARGFFDRMPYKDVISWTELIHAYVKKGGINEGFKLFRQMNMDGGLKADPLT
        TAIGACGA KLLQHGRNIHH+ARIH LEFDVLVSN LLKMYLDC SIKDARG F+RMP++D+ISWT+LIH YVK GGINE  KLFRQMNMDG LK DPLT
Subjt:  TAIGACGALKLLQHGRNIHHIARIHALEFDVLVSNSLLKMYLDCGSIKDARGFFDRMPYKDVISWTELIHAYVKKGGINEGFKLFRQMNMDGGLKADPLT

Query:  ISSLLPACGRMAAHKHGREIHGYVLKNAINENLIVQNALVDMYVKSGCIQSASKIFSRMKEKDMVSWTVMILGYSLHGQGKLGVSLFQEMERNLRVHRDE
        ISS+LPACGR+AAHKHGREIHGYVLKN  ++NLIVQNALVDMYVKSGCIQSA KIFSRMKEKD+VSWTVMI GYSLHGQGKLGV LF+EM+RN RVHRDE
Subjt:  ISSLLPACGRMAAHKHGREIHGYVLKNAINENLIVQNALVDMYVKSGCIQSASKIFSRMKEKDMVSWTVMILGYSLHGQGKLGVSLFQEMERNLRVHRDE

Query:  ITYTAVLHACSTARMVDEGDFYFNCITEPTVAHFALKVALLARAGRLDEARTFVEKHKLDKHAEVLRALLDGCRNHHQEKLGKRIIEQLCDLEPLNAENY
        ITYTAVL ACSTA MV+EGDFYFNCITEPT+AHF LKVALL RAGR DEARTFV+KHKLDK++E+LRALLDGCR HHQ+KLGKRIIEQLCDLEPLNAENY
Subjt:  ITYTAVLHACSTARMVDEGDFYFNCITEPTVAHFALKVALLARAGRLDEARTFVEKHKLDKHAEVLRALLDGCRNHHQEKLGKRIIEQLCDLEPLNAENY

Query:  ILLSNWYACNEKWDMVERLRETIRDMGLRPKKAYSWMEFRNKIHVFGTGDVSHPRSQNIYWNLQCLMKKMEEDGFKPKPDFRFHDVDEERECVLIGHSEL
        +LLSNWYA NE+W+MVE+LR+TIRDMGLRPKKAYSWMEFRNKIH FGTGDVSHPRSQ IYWNLQCLMKKMEEDGFK   DFRFHDVDEEREC LIGHSEL
Subjt:  ILLSNWYACNEKWDMVERLRETIRDMGLRPKKAYSWMEFRNKIHVFGTGDVSHPRSQNIYWNLQCLMKKMEEDGFKPKPDFRFHDVDEERECVLIGHSEL

Query:  LAISFGLISTEAGRRIRITKNLRVCHSCHESAKFISKIVGREIIVRDPYTFHHLKDGYCSCEAFC
        LAISFGLISTEAGR IRI KNLRVCHSCHESAKFISK VGREIIV+DPY FHH KDG CSCE FC
Subjt:  LAISFGLISTEAGRRIRITKNLRVCHSCHESAKFISKIVGREIIVRDPYTFHHLKDGYCSCEAFC

KAG7029175.1 Pentatricopeptide repeat-containing protein DOT4, chloroplastic, partial [Cucurbita argyrosperma subsp. argyrosperma]0.0e+0086.62Show/hide
Query:  MDLLLSTHIHRLLITQKPNHTYHRHRLFNNPPHVRTTTAENYANLCIAHQPFDEIPIWDTFAWNSLIQTHLTNGDVGHVISTYQQMLFRGVRPDKHTLPR
        MDLLLST IHRL +TQKPNHTYHRHRLFNNPPHVRTTTAE  A+LC+AHQ FD+IPIWDTFAWN+LIQTHLT+GDVGHVISTYQQML RGVRPD HTLPR
Subjt:  MDLLLSTHIHRLLITQKPNHTYHRHRLFNNPPHVRTTTAENYANLCIAHQPFDEIPIWDTFAWNSLIQTHLTNGDVGHVISTYQQMLFRGVRPDKHTLPR

Query:  IICATRQYGDLQVGKQLHAQAFKLGFSSNLYVITSLIELYGILDGADTAKWLHDKSACRNSVSWTLLAKLYLMEDKPSFAVDLFYQMVELAADIDAVALA
        +ICA+R YGDLQ+GKQLHAQAFKLG  SNLYV TSLIELYGILD ADTA+WLHDKSACRN+VSWT+LAKLYLMEDKPSF++DLFYQMVELAADIDAVALA
Subjt:  IICATRQYGDLQVGKQLHAQAFKLGFSSNLYVITSLIELYGILDGADTAKWLHDKSACRNSVSWTLLAKLYLMEDKPSFAVDLFYQMVELAADIDAVALA

Query:  TAIGACGALKLLQHGRNIHHIARIHALEFDVLVSNSLLKMYLDCGSIKDARGFFDRMPYKDVISWTELIHAYVKKGGINEGFKLFRQMNMDGGLKADPLT
        TAIGACGA KLLQHGRNIHH+ARIH LEFDVLVSN LLKMYLDC SIKDARG F+RMP++D+ISWT+LIH YVK GGINE  KLFRQMNMDG LK DPLT
Subjt:  TAIGACGALKLLQHGRNIHHIARIHALEFDVLVSNSLLKMYLDCGSIKDARGFFDRMPYKDVISWTELIHAYVKKGGINEGFKLFRQMNMDGGLKADPLT

Query:  ISSLLPACGRMAAHKHGREIHGYVLKNAINENLIVQNALVDMYVKSGCIQSASKIFSRMKEKDMVSWTVMILGYSLHGQGKLGVSLFQEMERNLRVHRDE
        ISS+LPACGR+AAHKHGREIHGYVLKN  ++NLIVQNALVDMYVKSGCIQSA KIFSRMKEKDMVSWTVMI GYSLHGQGKLGV LF+EM+RN RVHRDE
Subjt:  ISSLLPACGRMAAHKHGREIHGYVLKNAINENLIVQNALVDMYVKSGCIQSASKIFSRMKEKDMVSWTVMILGYSLHGQGKLGVSLFQEMERNLRVHRDE

Query:  ITYTAVLHACSTARMVDEGDFYFNCITEPTVAHFALKVALLARAGRLDEARTFVEKHKLDKHAEVLRALLDGCRNHHQEKLGKRIIEQLCDLEPLNAENY
        ITYTAVL ACSTA MV+EGDFYFNCITEPT+AHF LKVALL RAGR DEARTFV+KHKLDK++E+LRALLDGCR HHQ+KLGKRIIEQLCDLEPLNAENY
Subjt:  ITYTAVLHACSTARMVDEGDFYFNCITEPTVAHFALKVALLARAGRLDEARTFVEKHKLDKHAEVLRALLDGCRNHHQEKLGKRIIEQLCDLEPLNAENY

Query:  ILLSNWYACNEKWDMVERLRETIRDMGLRPKKAYSWMEFRNKIHVFGTGDVSHPRSQNIYWNLQCLMKKMEEDGFKPKPDFRFHDVDEERECVLIGHSEL
        +LLSNWYA NE+W+MVE+LR+TIRDMGLRPKKAYSWMEFRNKIH FGTGDVSHPRSQ IYWNLQCLMKKMEEDGFK   DFRFHDVDEEREC LIGHSEL
Subjt:  ILLSNWYACNEKWDMVERLRETIRDMGLRPKKAYSWMEFRNKIHVFGTGDVSHPRSQNIYWNLQCLMKKMEEDGFKPKPDFRFHDVDEERECVLIGHSEL

Query:  LAISFGLISTEAGRRIRITKNLRVCHSCHESAKFISKIVGREIIVRDPYTFHHLKDGYCSCEAFC
        LAISFGLISTEAGR IRI KNLRVCHSCHESAKFIS  VGREIIV+DPY FHH KDG CSCE FC
Subjt:  LAISFGLISTEAGRRIRITKNLRVCHSCHESAKFISKIVGREIIVRDPYTFHHLKDGYCSCEAFC

XP_004137884.2 pentatricopeptide repeat-containing protein DOT4, chloroplastic [Cucumis sativus]0.0e+0087.37Show/hide
Query:  MDLLLSTHIHRLLITQKPNHTYHRHRLFNNPPHVRTTTAENYANLCIAHQPFDEIPIWDTFAWNSLIQTHLTNGDVGHVISTYQQMLFRGVRPDKHTLPR
        M+LLLSTH H L ITQKPNH YHRH  FNN PHVRT T ENYANLC+AHQ FD+IPIWDTFAWN+LIQTHLTNGD+GHVISTY+QMLFRGVRPDKHTLPR
Subjt:  MDLLLSTHIHRLLITQKPNHTYHRHRLFNNPPHVRTTTAENYANLCIAHQPFDEIPIWDTFAWNSLIQTHLTNGDVGHVISTYQQMLFRGVRPDKHTLPR

Query:  IICATRQYGDLQVGKQLHAQAFKLGFSSNLYVITSLIELYGILDGADTAKWLHDKSACRNSVSWTLLAKLYLMEDKPSFAVDLFYQMVELAADIDAVALA
        IICATRQYGDLQVGKQLHAQAFKLGFSSNLYV+TSLIELYGILD ADTAKWLHDKS CRNSVSWT+LAKLYL EDKPS A+DLFYQMVELA DIDAVALA
Subjt:  IICATRQYGDLQVGKQLHAQAFKLGFSSNLYVITSLIELYGILDGADTAKWLHDKSACRNSVSWTLLAKLYLMEDKPSFAVDLFYQMVELAADIDAVALA

Query:  TAIGACGALKLLQHGRNIHHIARIHALEFDVLVSNSLLKMYLDCGSIKDARGFFDRMPYKDVISWTELIHAYVKKGGINEGFKLFRQMNMDGGLKADPLT
        TAIGACGALK+L HGRNIHH+AR+H LEF++LVSNSLLKMY+DC SIKDARGFFD+MP KD+ISWTELIH YVKKGGINE FKLFRQMNMDG LK DP T
Subjt:  TAIGACGALKLLQHGRNIHHIARIHALEFDVLVSNSLLKMYLDCGSIKDARGFFDRMPYKDVISWTELIHAYVKKGGINEGFKLFRQMNMDGGLKADPLT

Query:  ISSLLPACGRMAAHKHGREIHGYVLKNAINENLIVQNALVDMYVKSGCIQSASKIFSRMKEKDMVSWTVMILGYSLHGQGKLGVSLFQEMERNLRVHRDE
        ISS+LPACGRMAAHKHG+EIHGYV+KNA +ENLIVQNALVDMYVKSGCIQSASK FS MKEKDMVSW++M LGYSLHGQGKLGVSLF+EME+N ++ RDE
Subjt:  ISSLLPACGRMAAHKHGREIHGYVLKNAINENLIVQNALVDMYVKSGCIQSASKIFSRMKEKDMVSWTVMILGYSLHGQGKLGVSLFQEMERNLRVHRDE

Query:  ITYTAVLHACSTARMVDEGDFYFNCITEPTVAHFALKVALLARAGRLDEARTFVEKHKLDKHAEVLRALLDGCRNHHQEKLGKRIIEQLCDLEPLNAENY
        ITYTAVLHAC+TA MVDEGD YF+CIT+PTVAH ALKVALLARAGRLDEARTFVEK KLDKH E+LRALLDGCRNH Q+KLGKRIIEQLCDLEPLNAENY
Subjt:  ITYTAVLHACSTARMVDEGDFYFNCITEPTVAHFALKVALLARAGRLDEARTFVEKHKLDKHAEVLRALLDGCRNHHQEKLGKRIIEQLCDLEPLNAENY

Query:  ILLSNWYACNEKWDMVERLRETIRDMGLRPKKAYSWMEFRNKIHVFGTGDVSHPRSQNIYWNLQCLMKKMEEDGFKPKPDFRFHDVDEERECVLIGHSEL
        ILLSNWYACNEKWDMVE+LRETIRDMGLRPKKAYSW+EF NKIHVFGTGDVSHPRSQNIYWNLQCLMK+MEEDG KP PDF  HDVDEERECV IGHSEL
Subjt:  ILLSNWYACNEKWDMVERLRETIRDMGLRPKKAYSWMEFRNKIHVFGTGDVSHPRSQNIYWNLQCLMKKMEEDGFKPKPDFRFHDVDEERECVLIGHSEL

Query:  LAISFGLISTEAGRRIRITKNLRVCHSCHESAKFISKIVGREIIVRDPYTFHHLKDGYCSCEAFC
        LAISFGLISTEAGR IRITKNLRVCHSCHESAKFISK+VGREIIV+DPY FHH KDG CSCE FC
Subjt:  LAISFGLISTEAGRRIRITKNLRVCHSCHESAKFISKIVGREIIVRDPYTFHHLKDGYCSCEAFC

XP_008465161.1 PREDICTED: pentatricopeptide repeat-containing protein DOT4, chloroplastic-like [Cucumis melo]0.0e+0087.07Show/hide
Query:  MDLLLSTHIHRLLITQKPNHTYHRHRLFNNPPHVRTTTAENYANLCIAHQPFDEIPIWDTFAWNSLIQTHLTNGDVGHVISTYQQMLFRGVRPDKHTLPR
        M+LLLSTH H L ITQKP H YHRH  FNN PHVRTTT ENYA+LC+AHQ FDEIPIWDTFAWN+LIQTHLTNGD GHVIS Y+QMLFRGVRPDKHTLPR
Subjt:  MDLLLSTHIHRLLITQKPNHTYHRHRLFNNPPHVRTTTAENYANLCIAHQPFDEIPIWDTFAWNSLIQTHLTNGDVGHVISTYQQMLFRGVRPDKHTLPR

Query:  IICATRQYGDLQVGKQLHAQAFKLGFSSNLYVITSLIELYGILDGADTAKWLHDKSACRNSVSWTLLAKLYLMEDKPSFAVDLFYQMVELAADIDAVALA
        IICATRQYGDL VGKQLHAQAFKLGFSS+LYV+TSLIELYGILD ADTAKWLHDKS CRNSVSWT+LAKLYL EDKPSFA+DLFYQMVELA DID+VALA
Subjt:  IICATRQYGDLQVGKQLHAQAFKLGFSSNLYVITSLIELYGILDGADTAKWLHDKSACRNSVSWTLLAKLYLMEDKPSFAVDLFYQMVELAADIDAVALA

Query:  TAIGACGALKLLQHGRNIHHIARIHALEFDVLVSNSLLKMYLDCGSIKDARGFFDRMPYKDVISWTELIHAYVKKGGINEGFKLFRQMNMDGGLKADPLT
        TAIGACGALK+L HGRNIHH+ARIH LEF++LVSNSLLKMYLDC SIKDARGFFD+MP KDVISWTELIH YVKKGGINE FKLFRQMNMDG LK DPLT
Subjt:  TAIGACGALKLLQHGRNIHHIARIHALEFDVLVSNSLLKMYLDCGSIKDARGFFDRMPYKDVISWTELIHAYVKKGGINEGFKLFRQMNMDGGLKADPLT

Query:  ISSLLPACGRMAAHKHGREIHGYVLKNAINENLIVQNALVDMYVKSGCIQSASKIFSRMKEKDMVSWTVMILGYSLHGQGKLGVSLFQEMERNLRVHRDE
        ISS+LPACGRMAAHKHG+EIHGYVLKN  +ENLIVQNALVDMYVKSGCIQSASK FS MKEKDMVSW++M LGYSLHGQGKLGV LF+EME+NL++HRDE
Subjt:  ISSLLPACGRMAAHKHGREIHGYVLKNAINENLIVQNALVDMYVKSGCIQSASKIFSRMKEKDMVSWTVMILGYSLHGQGKLGVSLFQEMERNLRVHRDE

Query:  ITYTAVLHACSTARMVDEGDFYFNCITEPTVAHFALKVALLARAGRLDEARTFVEKHKLDKHAEVLRALLDGCRNHHQEKLGKRIIEQLCDLEPLNAENY
        ITYTAVLHAC+TA MVDEGDFYF+ IT+PTVAH ALKVALLARAGRLDEARTFVEK KL+KH E+LRALLDGCRNH Q+KLGKRIIEQLCDLEPLN ENY
Subjt:  ITYTAVLHACSTARMVDEGDFYFNCITEPTVAHFALKVALLARAGRLDEARTFVEKHKLDKHAEVLRALLDGCRNHHQEKLGKRIIEQLCDLEPLNAENY

Query:  ILLSNWYACNEKWDMVERLRETIRDMGLRPKKAYSWMEFRNKIHVFGTGDVSHPRSQNIYWNLQCLMKKMEEDGFKPKPDFRFHDVDEERECVLIGHSEL
        ILLSNWYACN+KWDMVE LRETIRDMGLRPKKAYSW+EF NKIHVFGTGDVSHPRSQNIYWNLQCLMKKMEEDG K  P+F  HDVDEERECV IGHSEL
Subjt:  ILLSNWYACNEKWDMVERLRETIRDMGLRPKKAYSWMEFRNKIHVFGTGDVSHPRSQNIYWNLQCLMKKMEEDGFKPKPDFRFHDVDEERECVLIGHSEL

Query:  LAISFGLISTEAGRRIRITKNLRVCHSCHESAKFISKIVGREIIVRDPYTFHHLKDGYCSCEAFC
        LAISFGLISTEAGR IRITKNLRVCHSCHESAKFISK+VGREIIV+DPY FHH KDG CSCE FC
Subjt:  LAISFGLISTEAGRRIRITKNLRVCHSCHESAKFISKIVGREIIVRDPYTFHHLKDGYCSCEAFC

XP_023539701.1 pentatricopeptide repeat-containing protein DOT4, chloroplastic-like [Cucurbita pepo subsp. pepo]0.0e+0086.32Show/hide
Query:  MDLLLSTHIHRLLITQKPNHTYHRHRLFNNPPHVRTTTAENYANLCIAHQPFDEIPIWDTFAWNSLIQTHLTNGDVGHVISTYQQMLFRGVRPDKHTLPR
        M+LLLST IHRL +TQKPNHTYHRHRLFNNPPHVRTTTAE  A+LC+AHQ FD+IPIWDTFAWN+LIQTHLT+GDVGHVISTYQQML RGVRPD HTLPR
Subjt:  MDLLLSTHIHRLLITQKPNHTYHRHRLFNNPPHVRTTTAENYANLCIAHQPFDEIPIWDTFAWNSLIQTHLTNGDVGHVISTYQQMLFRGVRPDKHTLPR

Query:  IICATRQYGDLQVGKQLHAQAFKLGFSSNLYVITSLIELYGILDGADTAKWLHDKSACRNSVSWTLLAKLYLMEDKPSFAVDLFYQMVELAADIDAVALA
        +ICA+R YGDLQ+GKQLHAQAFKLG  SNLYV TSLIELYGILD ADTA+WLHDKSACRN+VSWT+LAKLYLMEDKPSF++DLFYQMVELAADIDAVALA
Subjt:  IICATRQYGDLQVGKQLHAQAFKLGFSSNLYVITSLIELYGILDGADTAKWLHDKSACRNSVSWTLLAKLYLMEDKPSFAVDLFYQMVELAADIDAVALA

Query:  TAIGACGALKLLQHGRNIHHIARIHALEFDVLVSNSLLKMYLDCGSIKDARGFFDRMPYKDVISWTELIHAYVKKGGINEGFKLFRQMNMDGGLKADPLT
        TA+GACGA KLLQHGRNIHH+ARIH LEFDVLVSN LLKMYLDC SIKDARG F+RMP++D+ISWT+LIH YVK GGINE  KLFRQMNMDG LK DPLT
Subjt:  TAIGACGALKLLQHGRNIHHIARIHALEFDVLVSNSLLKMYLDCGSIKDARGFFDRMPYKDVISWTELIHAYVKKGGINEGFKLFRQMNMDGGLKADPLT

Query:  ISSLLPACGRMAAHKHGREIHGYVLKNAINENLIVQNALVDMYVKSGCIQSASKIFSRMKEKDMVSWTVMILGYSLHGQGKLGVSLFQEMERNLRVHRDE
        ISS+LPACGR+AAHKHGREIHGYVLKN  ++NLIVQNALVDMYVKSGCIQSA KIFSRMKEKDMVSWTVMI GYSLHGQGKLGV LF+EM+RN RVHRDE
Subjt:  ISSLLPACGRMAAHKHGREIHGYVLKNAINENLIVQNALVDMYVKSGCIQSASKIFSRMKEKDMVSWTVMILGYSLHGQGKLGVSLFQEMERNLRVHRDE

Query:  ITYTAVLHACSTARMVDEGDFYFNCITEPTVAHFALKVALLARAGRLDEARTFVEKHKLDKHAEVLRALLDGCRNHHQEKLGKRIIEQLCDLEPLNAENY
        ITYTAVL ACSTA MV+EGDFYFNCITEPT+AHF LKVALL RAGR DEARTFV+KHKLDK++E+LRALLDGCR HHQ+KLGKRIIEQLCDLEPLNAENY
Subjt:  ITYTAVLHACSTARMVDEGDFYFNCITEPTVAHFALKVALLARAGRLDEARTFVEKHKLDKHAEVLRALLDGCRNHHQEKLGKRIIEQLCDLEPLNAENY

Query:  ILLSNWYACNEKWDMVERLRETIRDMGLRPKKAYSWMEFRNKIHVFGTGDVSHPRSQNIYWNLQCLMKKMEEDGFKPKPDFRFHDVDEERECVLIGHSEL
        +LLSNWYA NE+W+MVE+LR+TIRDMGLRPKKAYSWMEFRNKIH FGTGDVSHPRSQ IYWNLQCLMKKMEEDGFK   DFRFHDVDEEREC LIGHSEL
Subjt:  ILLSNWYACNEKWDMVERLRETIRDMGLRPKKAYSWMEFRNKIHVFGTGDVSHPRSQNIYWNLQCLMKKMEEDGFKPKPDFRFHDVDEERECVLIGHSEL

Query:  LAISFGLISTEAGRRIRITKNLRVCHSCHESAKFISKIVGREIIVRDPYTFHHLKDGYCSCEAFC
        LAISFGLISTEAGR IRI+KNLRVCHSCHESAKFIS  VGREIIV+DPY FHH KDG CSCE FC
Subjt:  LAISFGLISTEAGRRIRITKNLRVCHSCHESAKFISKIVGREIIVRDPYTFHHLKDGYCSCEAFC

TrEMBL top hitse value%identityAlignment
A0A0A0L9N4 DYW_deaminase domain-containing protein0.0e+0085.65Show/hide
Query:  MDLLLSTHIHRLLITQKPNHTYHRHRLFNNPPHVRTTTAENYANLCIAHQPFDEIPIWDTFAWNSLIQTHLTNGDVGHVISTYQQMLFRGVRPDKHTLPR
        M+LLLSTH H L ITQKPNH YHRH  FNN PHVRT T ENYANLC+AHQ FD+IPIWDTFAWN+LIQTHLTNGD+GHVISTY+QMLFRGVRPDKHTLPR
Subjt:  MDLLLSTHIHRLLITQKPNHTYHRHRLFNNPPHVRTTTAENYANLCIAHQPFDEIPIWDTFAWNSLIQTHLTNGDVGHVISTYQQMLFRGVRPDKHTLPR

Query:  IICATRQYGDLQVGKQLHAQAFKLGFSSNLYVITSLIELYGILDGADTAKWLHDKSACRNSVSWTLLAKLYLMEDKPSFAVDLFYQMVELAADIDAVALA
        IICATRQYGDLQVGKQLHAQAFKLGFSSNLYV+TSLIELYGILD ADTAKWLHDKS CRNSVSWT+LAKLYL EDKPS A+DLFYQMVELA DIDAVALA
Subjt:  IICATRQYGDLQVGKQLHAQAFKLGFSSNLYVITSLIELYGILDGADTAKWLHDKSACRNSVSWTLLAKLYLMEDKPSFAVDLFYQMVELAADIDAVALA

Query:  TAIGACGALKLLQHGRNIHHIARIHALEFDVLVSNSLLKMYLDCGSIKDARGFFDRMPYKDVISWTELIHAYVKKGGINEGFKLFRQMNMDGGLKADPLT
        TAIGACGALK+L HGRNIHH+AR+H LEF++LVSNSLLKMY+DC SIKDARGFFD+MP KD+ISWTELIH YVKKGGINE FKLFRQMNMDG LK DP T
Subjt:  TAIGACGALKLLQHGRNIHHIARIHALEFDVLVSNSLLKMYLDCGSIKDARGFFDRMPYKDVISWTELIHAYVKKGGINEGFKLFRQMNMDGGLKADPLT

Query:  ISSLLPACGRMAAHKHGREIHGYVLKNAINENLIVQNALVDMYVKSGCIQSASKIFSRMKEKDMVSWTVMILGYSLHGQGKLGVSLFQEMERNLRVHRDE
        ISS+LPACGRMAAHKHG+EIHGYV+KNA +ENLIVQNALVDMYVKSGCIQSASK FS MKEKDMVSW++M LGYSLHGQGKLGVSLF+EME+N ++ RDE
Subjt:  ISSLLPACGRMAAHKHGREIHGYVLKNAINENLIVQNALVDMYVKSGCIQSASKIFSRMKEKDMVSWTVMILGYSLHGQGKLGVSLFQEMERNLRVHRDE

Query:  ITYTAVLHACSTARMVDEGDFYFNCITEPTVAHFALKVALLARAGRLDEARTFVEKHKLDKHAEVLRALLDGCRNHHQEKLGKRIIEQLCDLEPLNAENY
        ITYTAVLHAC+TA MVDEGD YF+CIT+PTVAH ALKVALLARAGRLDEARTFVEK KLDKH E+LRALLDGCRNH Q+KLGKRIIEQLCDLEPLNAENY
Subjt:  ITYTAVLHACSTARMVDEGDFYFNCITEPTVAHFALKVALLARAGRLDEARTFVEKHKLDKHAEVLRALLDGCRNHHQEKLGKRIIEQLCDLEPLNAENY

Query:  ILLSNWYACNEKWDMVERLRETIRDMGLRPKKAYSWMEFRNKIHVFGTGDVSHPRSQNIYWNLQCLMKKMEEDGFKPKPDFRFHDVDEERECVLIGHSEL
        ILLSNWYACNEKWDMVE+LRETIRDMGLRPKKAYSW+EF NKIHVFGTGDVSHPRSQNIYWNLQCLMK+MEEDG KP PDF  HDVDEERECV IGHSEL
Subjt:  ILLSNWYACNEKWDMVERLRETIRDMGLRPKKAYSWMEFRNKIHVFGTGDVSHPRSQNIYWNLQCLMKKMEEDGFKPKPDFRFHDVDEERECVLIGHSEL

Query:  LAISFGLISTEAGRRIRITKNLRVCHSCHESAKFISKIVGR
        LAISFGLISTEAGR IRITKNLR+  +  ++  F++ I GR
Subjt:  LAISFGLISTEAGRRIRITKNLRVCHSCHESAKFISKIVGR

A0A1S3CPR5 pentatricopeptide repeat-containing protein DOT4, chloroplastic-like0.0e+0087.07Show/hide
Query:  MDLLLSTHIHRLLITQKPNHTYHRHRLFNNPPHVRTTTAENYANLCIAHQPFDEIPIWDTFAWNSLIQTHLTNGDVGHVISTYQQMLFRGVRPDKHTLPR
        M+LLLSTH H L ITQKP H YHRH  FNN PHVRTTT ENYA+LC+AHQ FDEIPIWDTFAWN+LIQTHLTNGD GHVIS Y+QMLFRGVRPDKHTLPR
Subjt:  MDLLLSTHIHRLLITQKPNHTYHRHRLFNNPPHVRTTTAENYANLCIAHQPFDEIPIWDTFAWNSLIQTHLTNGDVGHVISTYQQMLFRGVRPDKHTLPR

Query:  IICATRQYGDLQVGKQLHAQAFKLGFSSNLYVITSLIELYGILDGADTAKWLHDKSACRNSVSWTLLAKLYLMEDKPSFAVDLFYQMVELAADIDAVALA
        IICATRQYGDL VGKQLHAQAFKLGFSS+LYV+TSLIELYGILD ADTAKWLHDKS CRNSVSWT+LAKLYL EDKPSFA+DLFYQMVELA DID+VALA
Subjt:  IICATRQYGDLQVGKQLHAQAFKLGFSSNLYVITSLIELYGILDGADTAKWLHDKSACRNSVSWTLLAKLYLMEDKPSFAVDLFYQMVELAADIDAVALA

Query:  TAIGACGALKLLQHGRNIHHIARIHALEFDVLVSNSLLKMYLDCGSIKDARGFFDRMPYKDVISWTELIHAYVKKGGINEGFKLFRQMNMDGGLKADPLT
        TAIGACGALK+L HGRNIHH+ARIH LEF++LVSNSLLKMYLDC SIKDARGFFD+MP KDVISWTELIH YVKKGGINE FKLFRQMNMDG LK DPLT
Subjt:  TAIGACGALKLLQHGRNIHHIARIHALEFDVLVSNSLLKMYLDCGSIKDARGFFDRMPYKDVISWTELIHAYVKKGGINEGFKLFRQMNMDGGLKADPLT

Query:  ISSLLPACGRMAAHKHGREIHGYVLKNAINENLIVQNALVDMYVKSGCIQSASKIFSRMKEKDMVSWTVMILGYSLHGQGKLGVSLFQEMERNLRVHRDE
        ISS+LPACGRMAAHKHG+EIHGYVLKN  +ENLIVQNALVDMYVKSGCIQSASK FS MKEKDMVSW++M LGYSLHGQGKLGV LF+EME+NL++HRDE
Subjt:  ISSLLPACGRMAAHKHGREIHGYVLKNAINENLIVQNALVDMYVKSGCIQSASKIFSRMKEKDMVSWTVMILGYSLHGQGKLGVSLFQEMERNLRVHRDE

Query:  ITYTAVLHACSTARMVDEGDFYFNCITEPTVAHFALKVALLARAGRLDEARTFVEKHKLDKHAEVLRALLDGCRNHHQEKLGKRIIEQLCDLEPLNAENY
        ITYTAVLHAC+TA MVDEGDFYF+ IT+PTVAH ALKVALLARAGRLDEARTFVEK KL+KH E+LRALLDGCRNH Q+KLGKRIIEQLCDLEPLN ENY
Subjt:  ITYTAVLHACSTARMVDEGDFYFNCITEPTVAHFALKVALLARAGRLDEARTFVEKHKLDKHAEVLRALLDGCRNHHQEKLGKRIIEQLCDLEPLNAENY

Query:  ILLSNWYACNEKWDMVERLRETIRDMGLRPKKAYSWMEFRNKIHVFGTGDVSHPRSQNIYWNLQCLMKKMEEDGFKPKPDFRFHDVDEERECVLIGHSEL
        ILLSNWYACN+KWDMVE LRETIRDMGLRPKKAYSW+EF NKIHVFGTGDVSHPRSQNIYWNLQCLMKKMEEDG K  P+F  HDVDEERECV IGHSEL
Subjt:  ILLSNWYACNEKWDMVERLRETIRDMGLRPKKAYSWMEFRNKIHVFGTGDVSHPRSQNIYWNLQCLMKKMEEDGFKPKPDFRFHDVDEERECVLIGHSEL

Query:  LAISFGLISTEAGRRIRITKNLRVCHSCHESAKFISKIVGREIIVRDPYTFHHLKDGYCSCEAFC
        LAISFGLISTEAGR IRITKNLRVCHSCHESAKFISK+VGREIIV+DPY FHH KDG CSCE FC
Subjt:  LAISFGLISTEAGRRIRITKNLRVCHSCHESAKFISKIVGREIIVRDPYTFHHLKDGYCSCEAFC

A0A6J1E0A4 pentatricopeptide repeat-containing protein DOT4, chloroplastic-like0.0e+0087.82Show/hide
Query:  MDLLLSTHIHRLLITQKPNHTYHRHRLFNNPPHVRTTTAENYANLCIAHQPFDEIPIWDTFAWNSLIQTHLTNGDVGHVISTYQQMLFRGVRPDKHTLPR
        MDLLLSTH  RL IT K + TY R R FNNPPHVRT   ENYANLC AH PFDEIP WDTFAWN+LIQTHLTNGDVG VISTY+QML RGVRPD HTLPR
Subjt:  MDLLLSTHIHRLLITQKPNHTYHRHRLFNNPPHVRTTTAENYANLCIAHQPFDEIPIWDTFAWNSLIQTHLTNGDVGHVISTYQQMLFRGVRPDKHTLPR

Query:  IICATRQYGDLQVGKQLHAQAFKLGFSSNLYVITSLIELYGILDGADTAKWLHDKSACRNSVSWTLLAKLYLMEDKPSFAVDLFYQMVELAADIDAVALA
        II A+RQ GDLQVGKQLHAQ FKLGFSSNLYVITSLIELYGILDGADTAKWLHDKSACRNSVSWT+LAKLY+MEDKPSFA+DLFYQMVELAADIDAVALA
Subjt:  IICATRQYGDLQVGKQLHAQAFKLGFSSNLYVITSLIELYGILDGADTAKWLHDKSACRNSVSWTLLAKLYLMEDKPSFAVDLFYQMVELAADIDAVALA

Query:  TAIGACGALKLLQHGRNIHHIARIHALEFDVLVSNSLLKMYLDCGSIKDARGFFDRMPYKDVISWTELIHAYVKKGGINEGFKLFRQMNMDGGLKADPLT
        TAIGACG+LKLLQHGRNIH +AR H LEFDVLVSNSLLKMYLDCGSI+DARGFF+RMP KDVISWTELI AYVKKGGINEGFKLFRQMNMDGGLK DP+T
Subjt:  TAIGACGALKLLQHGRNIHHIARIHALEFDVLVSNSLLKMYLDCGSIKDARGFFDRMPYKDVISWTELIHAYVKKGGINEGFKLFRQMNMDGGLKADPLT

Query:  ISSLLPACGRMAAHKHGREIHGYVLKNAINENLIVQNALVDMYVKSGCIQSASKIFSRMKEKDMVSWTVMILGYSLHGQGKLGVSLFQEMERNLRVHRDE
        ISS+LPACGRMAAHKHGREIHGYVLK+AI+ NLIVQNALVDMYVKSGCIQSA KIFSRMKEKD +SWTVMILGYSLHGQGKLGVSLF+ MERNLR+HRDE
Subjt:  ISSLLPACGRMAAHKHGREIHGYVLKNAINENLIVQNALVDMYVKSGCIQSASKIFSRMKEKDMVSWTVMILGYSLHGQGKLGVSLFQEMERNLRVHRDE

Query:  ITYTAVLHACSTARMVDEGDFYFNCITEPTVAHFALKVALLARAGRLDEARTFVEKHKLDKHAEVLRALLDGCRNHHQEKLGKRIIEQLCDLEPLNAENY
        ITYT+VLHACSTA +V+EGDFYFNCI EPT +HFALKVALLARAGRLDEAR FVE+HKLDKH E+LRALLDGCR H  +KLGKRIIEQLCDLEPLNAENY
Subjt:  ITYTAVLHACSTARMVDEGDFYFNCITEPTVAHFALKVALLARAGRLDEARTFVEKHKLDKHAEVLRALLDGCRNHHQEKLGKRIIEQLCDLEPLNAENY

Query:  ILLSNWYACNEKWDMVERLRETIRDMGLRPKKAYSWMEFRNKIHVFGTGDVSHPRSQNIYWNLQCLMKKMEEDGFKPKPDFRFHDVDEERECVLIGHSEL
        ILLSNWYACN K DMVE+ RE +RDMGLRPKKAYSWMEFRNKIHVFGTGDVSHPRSQNIYWNL+CLMKKME+DG KPKPDF FHDVDEERECVLIGHSEL
Subjt:  ILLSNWYACNEKWDMVERLRETIRDMGLRPKKAYSWMEFRNKIHVFGTGDVSHPRSQNIYWNLQCLMKKMEEDGFKPKPDFRFHDVDEERECVLIGHSEL

Query:  LAISFGLISTEAGRRIRITKNLRVCHSCHESAKFISKIVGREIIVRDPYTFHHLKDGYCSCEAFC
        LAISFGLISTEAGR I ITKNLRVCHSCHESAKFISKIVGREIIV+DPY FHH KDG CSCE FC
Subjt:  LAISFGLISTEAGRRIRITKNLRVCHSCHESAKFISKIVGREIIVRDPYTFHHLKDGYCSCEAFC

A0A6J1EXC6 pentatricopeptide repeat-containing protein DOT4, chloroplastic-like0.0e+0085.86Show/hide
Query:  MDLLLSTHIHRLLITQKPNHTYHRHRLFNNPPHVRTTTAENYANLCIAHQPFDEIPIWDTFAWNSLIQTHLTNGDVGHVISTYQQMLFRGVRPDKHTLPR
        MDLLLST IHRL +TQKPNHTY RHRLFNNPPHVRTTTAE  A+LC+AHQ FD+IPIWDTFAWN+LIQTHLT+GDVGHVISTYQQML RGVRPD HTLPR
Subjt:  MDLLLSTHIHRLLITQKPNHTYHRHRLFNNPPHVRTTTAENYANLCIAHQPFDEIPIWDTFAWNSLIQTHLTNGDVGHVISTYQQMLFRGVRPDKHTLPR

Query:  IICATRQYGDLQVGKQLHAQAFKLGFSSNLYVITSLIELYGILDGADTAKWLHDKSACRNSVSWTLLAKLYLMEDKPSFAVDLFYQMVELAADIDAVALA
        +ICA+R YGDLQ+GKQLHAQAFKLG  SNLYV TSLIELYGILD ADTA+WLHDKSACRN+VSWT+LAKLYLMEDKPSF++DLFYQMVELAADIDAVALA
Subjt:  IICATRQYGDLQVGKQLHAQAFKLGFSSNLYVITSLIELYGILDGADTAKWLHDKSACRNSVSWTLLAKLYLMEDKPSFAVDLFYQMVELAADIDAVALA

Query:  TAIGACGALKLLQHGRNIHHIARIHALEFDVLVSNSLLKMYLDCGSIKDARGFFDRMPYKDVISWTELIHAYVKKGGINEGFKLFRQMNMDGGLKADPLT
        TAIGACGA KLLQHGRNIHH+ARIH LEFD+LVSN LLKMYLDCGSIKDARG F+RMP++D+ISWT+LIH YVK GGINE  KLFRQMNMDG LK DPLT
Subjt:  TAIGACGALKLLQHGRNIHHIARIHALEFDVLVSNSLLKMYLDCGSIKDARGFFDRMPYKDVISWTELIHAYVKKGGINEGFKLFRQMNMDGGLKADPLT

Query:  ISSLLPACGRMAAHKHGREIHGYVLKNAINENLIVQNALVDMYVKSGCIQSASKIFSRMKEKDMVSWTVMILGYSLHGQGKLGVSLFQEMERNLRVHRDE
        ISS+LPACGR+ AHKHGREIHGYVLKN  ++NLIVQNALVDMYVKSGCIQSA KIFSRMKEKDMVSWTV+I GYSLHGQGKLGV LF+EM+RN  VHRDE
Subjt:  ISSLLPACGRMAAHKHGREIHGYVLKNAINENLIVQNALVDMYVKSGCIQSASKIFSRMKEKDMVSWTVMILGYSLHGQGKLGVSLFQEMERNLRVHRDE

Query:  ITYTAVLHACSTARMVDEGDFYFNCITEPTVAHFALKVALLARAGRLDEARTFVEKHKLDKHAEVLRALLDGCRNHHQEKLGKRIIEQLCDLEPLNAENY
        ITYTAVL ACSTA MV+EGDFYFNCITEPT+AHF LKVALL RAGR +EARTFV+KHKLDK+ E+LRALLDGCR HHQ+KLGKRIIEQLCDLEPLNAENY
Subjt:  ITYTAVLHACSTARMVDEGDFYFNCITEPTVAHFALKVALLARAGRLDEARTFVEKHKLDKHAEVLRALLDGCRNHHQEKLGKRIIEQLCDLEPLNAENY

Query:  ILLSNWYACNEKWDMVERLRETIRDMGLRPKKAYSWMEFRNKIHVFGTGDVSHPRSQNIYWNLQCLMKKMEEDGFKPKPDFRFHDVDEERECVLIGHSEL
        +LLSNWYA NE+W+MVE+LR+TIRDMGLRPKKAYSWMEFRNKIH FGTGDVSHPRSQ IYWNLQCLMKKMEEDGFK   DFRFHDVDEEREC LIGHSEL
Subjt:  ILLSNWYACNEKWDMVERLRETIRDMGLRPKKAYSWMEFRNKIHVFGTGDVSHPRSQNIYWNLQCLMKKMEEDGFKPKPDFRFHDVDEERECVLIGHSEL

Query:  LAISFGLISTEAGRRIRITKNLRVCHSCHESAKFISKIVGREIIVRDPYTFHHLKDGYCSCEAFC
        LAISFGLISTEAGR IRI KNLRVCHSCHESAKFIS  VGREIIV+DPY FHH KDG CSCE FC
Subjt:  LAISFGLISTEAGRRIRITKNLRVCHSCHESAKFISKIVGREIIVRDPYTFHHLKDGYCSCEAFC

A0A6J1I9E1 pentatricopeptide repeat-containing protein DOT4, chloroplastic-like0.0e+0086.32Show/hide
Query:  MDLLLSTHIHRLLITQKPNHTYHRHRLFNNPPHVRTTTAENYANLCIAHQPFDEIPIWDTFAWNSLIQTHLTNGDVGHVISTYQQMLFRGVRPDKHTLPR
        MDLLLST IHRL +TQKPNHTYHRHRLFNNPPHVRTTTAE  A+LC+AHQ FD+IPIWDTFAWN+LIQTHLT+GDVGHVISTYQQML RGVRPD HTLPR
Subjt:  MDLLLSTHIHRLLITQKPNHTYHRHRLFNNPPHVRTTTAENYANLCIAHQPFDEIPIWDTFAWNSLIQTHLTNGDVGHVISTYQQMLFRGVRPDKHTLPR

Query:  IICATRQYGDLQVGKQLHAQAFKLGFSSNLYVITSLIELYGILDGADTAKWLHDKSACRNSVSWTLLAKLYLMEDKPSFAVDLFYQMVELAADIDAVALA
        +ICA+R YGDLQ+GKQLHAQAFKLG  SNLYV TSLIELYGILD ADTA+WLHDKSACRN+VSWT+LAKLYLMEDKPSF++DLFYQMVELAADIDAVALA
Subjt:  IICATRQYGDLQVGKQLHAQAFKLGFSSNLYVITSLIELYGILDGADTAKWLHDKSACRNSVSWTLLAKLYLMEDKPSFAVDLFYQMVELAADIDAVALA

Query:  TAIGACGALKLLQHGRNIHHIARIHALEFDVLVSNSLLKMYLDCGSIKDARGFFDRMPYKDVISWTELIHAYVKKGGINEGFKLFRQMNMDGGLKADPLT
        TAIGACGA KLLQHGRNIHH+ARIH LEFDVLVSN LLKMYLDC SIKDARG F+RMP++D+ISWT+LIH YVK GGINE  KLFRQMNMDG LK DPLT
Subjt:  TAIGACGALKLLQHGRNIHHIARIHALEFDVLVSNSLLKMYLDCGSIKDARGFFDRMPYKDVISWTELIHAYVKKGGINEGFKLFRQMNMDGGLKADPLT

Query:  ISSLLPACGRMAAHKHGREIHGYVLKNAINENLIVQNALVDMYVKSGCIQSASKIFSRMKEKDMVSWTVMILGYSLHGQGKLGVSLFQEMERNLRVHRDE
        ISS+LPACGR+AAHKHGREIHGYVLKN  ++NLIVQNALVDMYVKSGCIQSA KIFSRMKEKDMVSWTVMI GYSLHGQGKLGV LF+EM+RN RVHRDE
Subjt:  ISSLLPACGRMAAHKHGREIHGYVLKNAINENLIVQNALVDMYVKSGCIQSASKIFSRMKEKDMVSWTVMILGYSLHGQGKLGVSLFQEMERNLRVHRDE

Query:  ITYTAVLHACSTARMVDEGDFYFNCITEPTVAHFALKVALLARAGRLDEARTFVEKHKLDKHAEVLRALLDGCRNHHQEKLGKRIIEQLCDLEPLNAENY
        ITYTAVL +CSTA MV+EGDFYFNCITEPT+AHF LKVALL RAGR DEARTFV+KHKLDK++E+LRALLDGCR HHQ KLGKRIIEQLCDLEPLNAENY
Subjt:  ITYTAVLHACSTARMVDEGDFYFNCITEPTVAHFALKVALLARAGRLDEARTFVEKHKLDKHAEVLRALLDGCRNHHQEKLGKRIIEQLCDLEPLNAENY

Query:  ILLSNWYACNEKWDMVERLRETIRDMGLRPKKAYSWMEFRNKIHVFGTGDVSHPRSQNIYWNLQCLMKKMEEDGFKPKPDFRFHDVDEERECVLIGHSEL
        +LLSNWYA NE+W+MVE+LR+TIRDMGLRPKKAYSWMEFRNKIH FGTGDVSHPRSQ IYWNLQCLMKKMEEDGFK   DFRFHDVDEEREC  IGHSEL
Subjt:  ILLSNWYACNEKWDMVERLRETIRDMGLRPKKAYSWMEFRNKIHVFGTGDVSHPRSQNIYWNLQCLMKKMEEDGFKPKPDFRFHDVDEERECVLIGHSEL

Query:  LAISFGLISTEAGRRIRITKNLRVCHSCHESAKFISKIVGREIIVRDPYTFHHLKDGYCSCEAFC
        LAISFGLISTEAGR IRI+KNLRVCHSCHESAKFIS  VGREIIV+DPY FHH KDG CSCE FC
Subjt:  LAISFGLISTEAGRRIRITKNLRVCHSCHESAKFISKIVGREIIVRDPYTFHHLKDGYCSCEAFC

SwissProt top hitse value%identityAlignment
O82380 Pentatricopeptide repeat-containing protein At2g29760, chloroplastic3.0e-10632.71Show/hide
Query:  LLSTHIHRLLITQKPNHTYHRHRLFNNPPHVRTTTAENYANLCIAHQPFDEIPIWDTFAWNSLIQTHLTNGDVGHVISTYQQMLFRG-VRPDKHTLPRII
        L  TH H ++ T   +  Y   +LF            ++A+L  A + FDEIP  ++FAWN+LI+ + +  D    I  +  M+      P+K+T P +I
Subjt:  LLSTHIHRLLITQKPNHTYHRHRLFNNPPHVRTTTAENYANLCIAHQPFDEIPIWDTFAWNSLIQTHLTNGDVGHVISTYQQMLFRG-VRPDKHTLPRII

Query:  CATRQYGDLQVGKQLHAQAFKLGFSSNLYVITSLIELYGILDGADTAKWLHDKSACRNSVSWTLLAKLYLMEDKPSFAVDLFYQMVELAADIDAVALATA
         A  +   L +G+ LH  A K    S+++V  SLI  Y      D+A  +      ++ VSW  +   ++ +  P  A++LF +M         V +   
Subjt:  CATRQYGDLQVGKQLHAQAFKLGFSSNLYVITSLIELYGILDGADTAKWLHDKSACRNSVSWTLLAKLYLMEDKPSFAVDLFYQMVELAADIDAVALATA

Query:  IGACGALKLLQHGRNIHHIARIHALEFDVLVSNSLLKMYLDCGSIKDARGFFD-------------------------------RMPYKDVISWTELIHA
        + AC  ++ L+ GR +      + +  ++ ++N++L MY  CGSI+DA+  FD                                MP KD+++W  LI A
Subjt:  IGACGALKLLQHGRNIHHIARIHALEFDVLVSNSLLKMYLDCGSIKDARGFFD-------------------------------RMPYKDVISWTELIHA

Query:  YVKKGGINEGFKLFRQMNMDGGLKADPLTISSLLPACGRMAAHKHGREIHGYVLKNAINENLIVQNALVDMYVKSGCIQSASKIFSRMKEKDMVSWTVMI
        Y + G  NE   +F ++ +   +K + +T+ S L AC ++ A + GR IH Y+ K+ I  N  V +AL+ MY K G ++ + ++F+ ++++D+  W+ MI
Subjt:  YVKKGGINEGFKLFRQMNMDGGLKADPLTISSLLPACGRMAAHKHGREIHGYVLKNAINENLIVQNALVDMYVKSGCIQSASKIFSRMKEKDMVSWTVMI

Query:  LGYSLHGQGKLGVSLFQEMERNLRVHRDEITYTAVLHACSTARMVDEGDFYFNCITE-----PTVAHFALKVALLARAGRLDEARTFVEKHKLDKHAEVL
         G ++HG G   V +F +M+    V  + +T+T V  ACS   +VDE +  F+ +       P   H+A  V +L R+G L++A  F+E   +     V 
Subjt:  LGYSLHGQGKLGVSLFQEMERNLRVHRDEITYTAVLHACSTARMVDEGDFYFNCITE-----PTVAHFALKVALLARAGRLDEARTFVEKHKLDKHAEVL

Query:  RALLDGCRNHHQEKLGKRIIEQLCDLEPLNAENYILLSNWYACNEKWDMVERLRETIRDMGLRPKKAYSWMEFRNKIHVFGTGDVSHPRSQNIYWNLQCL
         ALL  C+ H    L +    +L +LEP N   ++LLSN YA   KW+ V  LR+ +R  GL+ +   S +E    IH F +GD +HP S+ +Y  L  +
Subjt:  RALLDGCRNHHQEKLGKRIIEQLCDLEPLNAENYILLSNWYACNEKWDMVERLRETIRDMGLRPKKAYSWMEFRNKIHVFGTGDVSHPRSQNIYWNLQCL

Query:  MKKMEEDGFKPKPDFRFHDVDEE--RECVLIGHSELLAISFGLISTEAGRRIRITKNLRVCHSCHESAKFISKIVGREIIVRDPYTFHHLKDGYCSCEAF
        M+K++ +G++P+       ++EE  +E  L  HSE LAI +GLISTEA + IR+ KNLRVC  CH  AK IS++  REIIVRD Y FHH ++G CSC  F
Subjt:  MKKMEEDGFKPKPDFRFHDVDEE--RECVLIGHSELLAISFGLISTEAGRRIRITKNLRVCHSCHESAKFISKIVGREIIVRDPYTFHHLKDGYCSCEAF

Q9LTV8 Pentatricopeptide repeat-containing protein At3g127702.7e-11834.92Show/hide
Query:  AENYANLCIAHQPFDEIPIWDTFAWNSLIQTHLTNGDVGHVISTYQQMLFRGVRPDKHTLPRIICATRQYGDLQVGKQLHAQAFKLGFSSNLYVITSLIE
        + ++ ++  A Q FD++P    F WN++I+ +  N      +  Y  M    V PD  T P ++ A      LQ+G+ +HAQ F+LGF ++++V   LI 
Subjt:  AENYANLCIAHQPFDEIPIWDTFAWNSLIQTHLTNGDVGHVISTYQQMLFRGVRPDKHTLPRIICATRQYGDLQVGKQLHAQAFKLGFSSNLYVITSLIE

Query:  LYGILDGADTAKWLHDKSAC--RNSVSWTLLAKLYLMEDKPSFAVDLFYQMVELAADIDAVALATAIGACGALKLLQHGRNIHHIARIHALEFDVLVSNS
        LY       +A+ + +      R  VSWT +   Y    +P  A+++F QM ++    D VAL + + A   L+ L+ GR+IH       LE +  +  S
Subjt:  LYGILDGADTAKWLHDKSAC--RNSVSWTLLAKLYLMEDKPSFAVDLFYQMVELAADIDAVALATAIGACGALKLLQHGRNIHHIARIHALEFDVLVSNS

Query:  LLKMYLDCGSIKDARGFFDRMPYKDVISWTELIHAYVKKGGINEGFKLFRQMNMDGGLKADPLTISSLLPACGRMAAHKHGREIHGYVLKNAINENLIVQ
        L  MY  CG +  A+  FD+M   ++I W  +I  Y K G   E   +F +M ++  ++ D ++I+S + AC ++ + +  R ++ YV ++   +++ + 
Subjt:  LLKMYLDCGSIKDARGFFDRMPYKDVISWTELIHAYVKKGGINEGFKLFRQMNMDGGLKADPLTISSLLPACGRMAAHKHGREIHGYVLKNAINENLIVQ

Query:  NALVDMYVKSGCIQSASKIFSRMKEKDMVSWTVMILGYSLHGQGKLGVSLFQEMERNLRVHRDEITYTAVLHACSTARMVDEGDFYFNCITE----PTVA
        +AL+DM+ K G ++ A  +F R  ++D+V W+ MI+GY LHG+ +  +SL++ MER   VH +++T+  +L AC+ + MV EG ++FN + +    P   
Subjt:  NALVDMYVKSGCIQSASKIFSRMKEKDMVSWTVMILGYSLHGQGKLGVSLFQEMERNLRVHRDEITYTAVLHACSTARMVDEGDFYFNCITE----PTVA

Query:  HFALKVALLARAGRLDEARTFVEKHKLDKHAEVLRALLDGCRNHHQEKLGKRIIEQLCDLEPLNAENYILLSNWYACNEKWDMVERLRETIRDMGLRPKK
        H+A  + LL RAG LD+A   ++   +     V  ALL  C+ H   +LG+   +QL  ++P N  +Y+ LSN YA    WD V  +R  +++ GL    
Subjt:  HFALKVALLARAGRLDEARTFVEKHKLDKHAEVLRALLDGCRNHHQEKLGKRIIEQLCDLEPLNAENYILLSNWYACNEKWDMVERLRETIRDMGLRPKK

Query:  AYSWMEFRNKIHVFGTGDVSHPRSQNIYWNLQCLMKKMEEDGFKPKPDFRFHDV-DEERECVLIGHSELLAISFGLISTEAGRRIRITKNLRVCHSCHES
          SW+E R ++  F  GD SHPR + I   ++ +  +++E GF    D   HD+ DEE E  L  HSE +AI++GLIST  G  +RITKNLR C +CH +
Subjt:  AYSWMEFRNKIHVFGTGDVSHPRSQNIYWNLQCLMKKMEEDGFKPKPDFRFHDV-DEERECVLIGHSELLAISFGLISTEAGRRIRITKNLRVCHSCHES

Query:  AKFISKIVGREIIVRDPYTFHHLKDGYCSC
         K ISK+V REI+VRD   FHH KDG CSC
Subjt:  AKFISKIVGREIIVRDPYTFHHLKDGYCSC

Q9LW63 Putative pentatricopeptide repeat-containing protein At3g233301.5e-10833.68Show/hide
Query:  YANLCIAHQP---FDEIPIWDTFAWNSLIQTHLTNGDVGHVISTYQQMLFRGVRPDKHTLPRIICATRQYGDLQVGKQLHAQAFKLGFSSNLYVITSLIE
        Y NL + H+    F  +      AW S+I+           ++++ +M   G  PD +  P ++ +     DL+ G+ +H    +LG   +LY   +L+ 
Subjt:  YANLCIAHQP---FDEIPIWDTFAWNSLIQTHLTNGDVGHVISTYQQMLFRGVRPDKHTLPRIICATRQYGDLQVGKQLHAQAFKLGFSSNLYVITSLIE

Query:  LYGIL------------------------------------DGADTAKWLHDKSACRNSVSWTLLAKLYLMEDKPSFAVDLFYQMVELAADIDAVALATA
        +Y  L                                     G D+ + + +    ++ VS+  +   Y        A+ +  +M       D+  L++ 
Subjt:  LYGIL------------------------------------DGADTAKWLHDKSACRNSVSWTLLAKLYLMEDKPSFAVDLFYQMVELAADIDAVALATA

Query:  IGACGALKLLQHGRNIHHIARIHALEFDVLVSNSLLKMYLDCGSIKDARGFFDRMPYKDVISWTELIHAYVKKGGINEGFKLFRQMNMDGGLKADPLTIS
        +        +  G+ IH       ++ DV + +SL+ MY     I+D+   F R+  +D ISW  L+  YV+ G  NE  +LFRQM +   +K   +  S
Subjt:  IGACGALKLLQHGRNIHHIARIHALEFDVLVSNSLLKMYLDCGSIKDARGFFDRMPYKDVISWTELIHAYVKKGGINEGFKLFRQMNMDGGLKADPLTIS

Query:  SLLPACGRMAAHKHGREIHGYVLKNAINENLIVQNALVDMYVKSGCIQSASKIFSRMKEKDMVSWTVMILGYSLHGQGKLGVSLFQEMERNLRVHRDEIT
        S++PAC  +A    G+++HGYVL+     N+ + +ALVDMY K G I++A KIF RM   D VSWT +I+G++LHG G   VSLF+EM+R   V  +++ 
Subjt:  SLLPACGRMAAHKHGREIHGYVLKNAINENLIVQNALVDMYVKSGCIQSASKIFSRMKEKDMVSWTVMILGYSLHGQGKLGVSLFQEMERNLRVHRDEIT

Query:  YTAVLHACSTARMVDEGDFYFNCITE-----PTVAHFALKVALLARAGRLDEARTFVEKHKLDKHAEVLRALLDGCRNHHQEKLGKRIIEQLCDLEPLNA
        + AVL ACS   +VDE   YFN +T+       + H+A    LL RAG+L+EA  F+ K  ++    V   LL  C  H   +L +++ E++  ++  N 
Subjt:  YTAVLHACSTARMVDEGDFYFNCITE-----PTVAHFALKVALLARAGRLDEARTFVEKHKLDKHAEVLRALLDGCRNHHQEKLGKRIIEQLCDLEPLNA

Query:  ENYILLSNWYACNEKWDMVERLRETIRDMGLRPKKAYSWMEFRNKIHVFGTGDVSHPRSQNIYWNLQCLMKKMEEDGFKPKPDFRFHDVDEE-RECVLIG
          Y+L+ N YA N +W  + +LR  +R  GLR K A SW+E +NK H F +GD SHP    I   L+ +M++ME++G+        HDVDEE +  +L G
Subjt:  ENYILLSNWYACNEKWDMVERLRETIRDMGLRPKKAYSWMEFRNKIHVFGTGDVSHPRSQNIYWNLQCLMKKMEEDGFKPKPDFRFHDVDEE-RECVLIG

Query:  HSELLAISFGLISTEAGRRIRITKNLRVCHSCHESAKFISKIVGREIIVRDPYTFHHLKDGYCSC
        HSE LA++FG+I+TE G  IR+TKN+R+C  CH + KFISKI  REIIVRD   FHH   G CSC
Subjt:  HSELLAISFGLISTEAGRRIRITKNLRVCHSCHESAKFISKIVGREIIVRDPYTFHHLKDGYCSC

Q9SHZ8 Pentatricopeptide repeat-containing protein At2g220702.8e-10731.87Show/hide
Query:  ITQKPNHTYHRHRLFNNPPHVRTTTAEN--------YANLCIAHQPFDEIPIWDTFAWNSLIQTHLTNGDVGHVISTYQQMLFRGVRPDKHTLPRIICAT
        +  K  +  H  +LF+  P +RT  + N          ++    + FD++P  D+ +W ++I  +   G     I     M+  G+ P + TL  ++ + 
Subjt:  ITQKPNHTYHRHRLFNNPPHVRTTTAEN--------YANLCIAHQPFDEIPIWDTFAWNSLIQTHLTNGDVGHVISTYQQMLFRGVRPDKHTLPRIICAT

Query:  RQYGDLQVGKQLHAQAFKLGFSSNLYVITSLIELYGILDGADTAKWLHDKSACRNSVSWTLLAKLYLMEDKPSFAVDLFYQMVE----------------
             ++ GK++H+   KLG   N+ V  SL+ +Y        AK++ D+   R+  SW  +  L++   +   A+  F QM E                
Subjt:  RQYGDLQVGKQLHAQAFKLGFSSNLYVITSLIELYGILDGADTAKWLHDKSACRNSVSWTLLAKLYLMEDKPSFAVDLFYQMVE----------------

Query:  ----LAADI------------DAVALATAIGACGALKLLQHGRNIHHIARIHALEFDVLVSNSLLKMYLDCGSIKDAR--------------GF------
             A DI            D   LA+ + AC  L+ L  G+ IH        +   +V N+L+ MY  CG ++ AR              GF      
Subjt:  ----LAADI------------DAVALATAIGACGALKLLQHGRNIHHIARIHALEFDVLVSNSLLKMYLDCGSIKDAR--------------GF------

Query:  -------------FDRMPYKDVISWTELIHAYVKKGGINEGFKLFRQMNMDGGLKADPLTISSLLPACGRMAAHKHGREIHGYVLKNAINENLIVQNALV
                     F  +  +DV++WT +I  Y + G   E   LFR M + GG + +  T++++L     +A+  HG++IHG  +K+    ++ V NAL+
Subjt:  -------------FDRMPYKDVISWTELIHAYVKKGGINEGFKLFRQMNMDGGLKADPLTISSLLPACGRMAAHKHGREIHGYVLKNAINENLIVQNALV

Query:  DMYVKSGCIQSASKIFSRMK-EKDMVSWTVMILGYSLHGQGKLGVSLFQEMERNLRVHRDEITYTAVLHACSTARMVDEGDFYFNCITE-----PTVAHF
         MY K+G I SAS+ F  ++ E+D VSWT MI+  + HG  +  + LF+ M     +  D ITY  V  AC+ A +V++G  YF+ + +     PT++H+
Subjt:  DMYVKSGCIQSASKIFSRMK-EKDMVSWTVMILGYSLHGQGKLGVSLFQEMERNLRVHRDEITYTAVLHACSTARMVDEGDFYFNCITE-----PTVAHF

Query:  ALKVALLARAGRLDEARTFVEKHKLDKHAEVLRALLDGCRNHHQEKLGKRIIEQLCDLEPLNAENYILLSNWYACNEKWDMVERLRETIRDMGLRPKKAY
        A  V L  RAG L EA+ F+EK  ++       +LL  CR H    LGK   E+L  LEP N+  Y  L+N Y+   KW+   ++R++++D  ++ ++ +
Subjt:  ALKVALLARAGRLDEARTFVEKHKLDKHAEVLRALLDGCRNHHQEKLGKRIIEQLCDLEPLNAENYILLSNWYACNEKWDMVERLRETIRDMGLRPKKAY

Query:  SWMEFRNKIHVFGTGDVSHPRSQNIYWNLQCLMKKMEEDGFKPKPDFRFHDVDEE-RECVLIGHSELLAISFGLISTEAGRRIRITKNLRVCHSCHESAK
        SW+E ++K+HVFG  D +HP    IY  ++ +  ++++ G+ P      HD++EE +E +L  HSE LAI+FGLIST     +RI KNLRVC+ CH + K
Subjt:  SWMEFRNKIHVFGTGDVSHPRSQNIYWNLQCLMKKMEEDGFKPKPDFRFHDVDEE-RECVLIGHSELLAISFGLISTEAGRRIRITKNLRVCHSCHESAK

Query:  FISKIVGREIIVRDPYTFHHLKDGYCSCEAF
        FISK+VGREIIVRD   FHH KDG+CSC  +
Subjt:  FISKIVGREIIVRDPYTFHHLKDGYCSCEAF

Q9SN39 Pentatricopeptide repeat-containing protein DOT4, chloroplastic1.2e-12938.52Show/hide
Query:  AHQPFDEIPIWDTFAWNSLIQTHLTNGDVGHVISTYQQMLFRGVRPDKHTLPRIICATRQYGDLQVGKQLHAQAFKLGFSSNLYVITSLIELYGILDGAD
        A + FDE+   D  +WNS+I  +++NG     +S + QML  G+  D  T+  +         + +G+ +H+   K  FS       +L+++Y      D
Subjt:  AHQPFDEIPIWDTFAWNSLIQTHLTNGDVGHVISTYQQMLFRGVRPDKHTLPRIICATRQYGDLQVGKQLHAQAFKLGFSSNLYVITSLIELYGILDGAD

Query:  TAKWLHDKSACRNSVSWTLLAKLYLMEDKPSFAVDLFYQMVELAADIDAVALATAIGACGALKLLQHGRNIHHIARIHALEFDVLVSNSLLKMYLDCGSI
        +AK +  + + R+ VS+T +   Y  E     AV LF +M E     D   +   +  C   +LL  G+ +H   + + L FD+ VSN+L+ MY  CGS+
Subjt:  TAKWLHDKSACRNSVSWTLLAKLYLMEDKPSFAVDLFYQMVELAADIDAVALATAIGACGALKLLQHGRNIHHIARIHALEFDVLVSNSLLKMYLDCGSI

Query:  KDARGFFDRMPYKDVISWTELIHAYVKKGGINEGFKLFRQMNMDGGLKADPLTISSLLPACGRMAAHKHGREIHGYVLKNAINENLIVQNALVDMYVKSG
        ++A   F  M  KD+ISW  +I  Y K    NE   LF  +  +     D  T++ +LPAC  ++A   GREIHGY+++N    +  V N+LVDMY K G
Subjt:  KDARGFFDRMPYKDVISWTELIHAYVKKGGINEGFKLFRQMNMDGGLKADPLTISSLLPACGRMAAHKHGREIHGYVLKNAINENLIVQNALVDMYVKSG

Query:  CIQSASKIFSRMKEKDMVSWTVMILGYSLHGQGKLGVSLFQEMERNLRVHRDEITYTAVLHACSTARMVDEGDFYFN-----CITEPTVAHFALKVALLA
         +  A  +F  +  KD+VSWTVMI GY +HG GK  ++LF +M R   +  DEI++ ++L+ACS + +VDEG  +FN     C  EPTV H+A  V +LA
Subjt:  CIQSASKIFSRMKEKDMVSWTVMILGYSLHGQGKLGVSLFQEMERNLRVHRDEITYTAVLHACSTARMVDEGDFYFN-----CITEPTVAHFALKVALLA

Query:  RAGRLDEARTFVEKHKLDKHAEVLRALLDGCRNHHQEKLGKRIIEQLCDLEPLNAENYILLSNWYACNEKWDMVERLRETIRDMGLRPKKAYSWMEFRNK
        R G L +A  F+E   +   A +  ALL GCR HH  KL +++ E++ +LEP N   Y+L++N YA  EKW+ V+RLR+ I   GLR     SW+E + +
Subjt:  RAGRLDEARTFVEKHKLDKHAEVLRALLDGCRNHHQEKLGKRIIEQLCDLEPLNAENYILLSNWYACNEKWDMVERLRETIRDMGLRPKKAYSWMEFRNK

Query:  IHVFGTGDVSHPRSQNIYWNLQCLMKKMEEDGFKPKPDFRFHDVDE-ERECVLIGHSELLAISFGLISTEAGRRIRITKNLRVCHSCHESAKFISKIVGR
        +++F  GD S+P ++NI   L+ +  +M E+G+ P   +   D +E E+E  L GHSE LA++ G+IS+  G+ IR+TKNLRVC  CHE AKF+SK+  R
Subjt:  IHVFGTGDVSHPRSQNIYWNLQCLMKKMEEDGFKPKPDFRFHDVDE-ERECVLIGHSELLAISFGLISTEAGRRIRITKNLRVCHSCHESAKFISKIVGR

Query:  EIIVRDPYTFHHLKDGYCSCEAF
        EI++RD   FH  KDG+CSC  F
Subjt:  EIIVRDPYTFHHLKDGYCSCEAF

Arabidopsis top hitse value%identityAlignment
AT2G22070.1 pentatricopeptide (PPR) repeat-containing protein2.0e-10831.87Show/hide
Query:  ITQKPNHTYHRHRLFNNPPHVRTTTAEN--------YANLCIAHQPFDEIPIWDTFAWNSLIQTHLTNGDVGHVISTYQQMLFRGVRPDKHTLPRIICAT
        +  K  +  H  +LF+  P +RT  + N          ++    + FD++P  D+ +W ++I  +   G     I     M+  G+ P + TL  ++ + 
Subjt:  ITQKPNHTYHRHRLFNNPPHVRTTTAEN--------YANLCIAHQPFDEIPIWDTFAWNSLIQTHLTNGDVGHVISTYQQMLFRGVRPDKHTLPRIICAT

Query:  RQYGDLQVGKQLHAQAFKLGFSSNLYVITSLIELYGILDGADTAKWLHDKSACRNSVSWTLLAKLYLMEDKPSFAVDLFYQMVE----------------
             ++ GK++H+   KLG   N+ V  SL+ +Y        AK++ D+   R+  SW  +  L++   +   A+  F QM E                
Subjt:  RQYGDLQVGKQLHAQAFKLGFSSNLYVITSLIELYGILDGADTAKWLHDKSACRNSVSWTLLAKLYLMEDKPSFAVDLFYQMVE----------------

Query:  ----LAADI------------DAVALATAIGACGALKLLQHGRNIHHIARIHALEFDVLVSNSLLKMYLDCGSIKDAR--------------GF------
             A DI            D   LA+ + AC  L+ L  G+ IH        +   +V N+L+ MY  CG ++ AR              GF      
Subjt:  ----LAADI------------DAVALATAIGACGALKLLQHGRNIHHIARIHALEFDVLVSNSLLKMYLDCGSIKDAR--------------GF------

Query:  -------------FDRMPYKDVISWTELIHAYVKKGGINEGFKLFRQMNMDGGLKADPLTISSLLPACGRMAAHKHGREIHGYVLKNAINENLIVQNALV
                     F  +  +DV++WT +I  Y + G   E   LFR M + GG + +  T++++L     +A+  HG++IHG  +K+    ++ V NAL+
Subjt:  -------------FDRMPYKDVISWTELIHAYVKKGGINEGFKLFRQMNMDGGLKADPLTISSLLPACGRMAAHKHGREIHGYVLKNAINENLIVQNALV

Query:  DMYVKSGCIQSASKIFSRMK-EKDMVSWTVMILGYSLHGQGKLGVSLFQEMERNLRVHRDEITYTAVLHACSTARMVDEGDFYFNCITE-----PTVAHF
         MY K+G I SAS+ F  ++ E+D VSWT MI+  + HG  +  + LF+ M     +  D ITY  V  AC+ A +V++G  YF+ + +     PT++H+
Subjt:  DMYVKSGCIQSASKIFSRMK-EKDMVSWTVMILGYSLHGQGKLGVSLFQEMERNLRVHRDEITYTAVLHACSTARMVDEGDFYFNCITE-----PTVAHF

Query:  ALKVALLARAGRLDEARTFVEKHKLDKHAEVLRALLDGCRNHHQEKLGKRIIEQLCDLEPLNAENYILLSNWYACNEKWDMVERLRETIRDMGLRPKKAY
        A  V L  RAG L EA+ F+EK  ++       +LL  CR H    LGK   E+L  LEP N+  Y  L+N Y+   KW+   ++R++++D  ++ ++ +
Subjt:  ALKVALLARAGRLDEARTFVEKHKLDKHAEVLRALLDGCRNHHQEKLGKRIIEQLCDLEPLNAENYILLSNWYACNEKWDMVERLRETIRDMGLRPKKAY

Query:  SWMEFRNKIHVFGTGDVSHPRSQNIYWNLQCLMKKMEEDGFKPKPDFRFHDVDEE-RECVLIGHSELLAISFGLISTEAGRRIRITKNLRVCHSCHESAK
        SW+E ++K+HVFG  D +HP    IY  ++ +  ++++ G+ P      HD++EE +E +L  HSE LAI+FGLIST     +RI KNLRVC+ CH + K
Subjt:  SWMEFRNKIHVFGTGDVSHPRSQNIYWNLQCLMKKMEEDGFKPKPDFRFHDVDEE-RECVLIGHSELLAISFGLISTEAGRRIRITKNLRVCHSCHESAK

Query:  FISKIVGREIIVRDPYTFHHLKDGYCSCEAF
        FISK+VGREIIVRD   FHH KDG+CSC  +
Subjt:  FISKIVGREIIVRDPYTFHHLKDGYCSCEAF

AT2G29760.1 Tetratricopeptide repeat (TPR)-like superfamily protein2.2e-10732.71Show/hide
Query:  LLSTHIHRLLITQKPNHTYHRHRLFNNPPHVRTTTAENYANLCIAHQPFDEIPIWDTFAWNSLIQTHLTNGDVGHVISTYQQMLFRG-VRPDKHTLPRII
        L  TH H ++ T   +  Y   +LF            ++A+L  A + FDEIP  ++FAWN+LI+ + +  D    I  +  M+      P+K+T P +I
Subjt:  LLSTHIHRLLITQKPNHTYHRHRLFNNPPHVRTTTAENYANLCIAHQPFDEIPIWDTFAWNSLIQTHLTNGDVGHVISTYQQMLFRG-VRPDKHTLPRII

Query:  CATRQYGDLQVGKQLHAQAFKLGFSSNLYVITSLIELYGILDGADTAKWLHDKSACRNSVSWTLLAKLYLMEDKPSFAVDLFYQMVELAADIDAVALATA
         A  +   L +G+ LH  A K    S+++V  SLI  Y      D+A  +      ++ VSW  +   ++ +  P  A++LF +M         V +   
Subjt:  CATRQYGDLQVGKQLHAQAFKLGFSSNLYVITSLIELYGILDGADTAKWLHDKSACRNSVSWTLLAKLYLMEDKPSFAVDLFYQMVELAADIDAVALATA

Query:  IGACGALKLLQHGRNIHHIARIHALEFDVLVSNSLLKMYLDCGSIKDARGFFD-------------------------------RMPYKDVISWTELIHA
        + AC  ++ L+ GR +      + +  ++ ++N++L MY  CGSI+DA+  FD                                MP KD+++W  LI A
Subjt:  IGACGALKLLQHGRNIHHIARIHALEFDVLVSNSLLKMYLDCGSIKDARGFFD-------------------------------RMPYKDVISWTELIHA

Query:  YVKKGGINEGFKLFRQMNMDGGLKADPLTISSLLPACGRMAAHKHGREIHGYVLKNAINENLIVQNALVDMYVKSGCIQSASKIFSRMKEKDMVSWTVMI
        Y + G  NE   +F ++ +   +K + +T+ S L AC ++ A + GR IH Y+ K+ I  N  V +AL+ MY K G ++ + ++F+ ++++D+  W+ MI
Subjt:  YVKKGGINEGFKLFRQMNMDGGLKADPLTISSLLPACGRMAAHKHGREIHGYVLKNAINENLIVQNALVDMYVKSGCIQSASKIFSRMKEKDMVSWTVMI

Query:  LGYSLHGQGKLGVSLFQEMERNLRVHRDEITYTAVLHACSTARMVDEGDFYFNCITE-----PTVAHFALKVALLARAGRLDEARTFVEKHKLDKHAEVL
         G ++HG G   V +F +M+    V  + +T+T V  ACS   +VDE +  F+ +       P   H+A  V +L R+G L++A  F+E   +     V 
Subjt:  LGYSLHGQGKLGVSLFQEMERNLRVHRDEITYTAVLHACSTARMVDEGDFYFNCITE-----PTVAHFALKVALLARAGRLDEARTFVEKHKLDKHAEVL

Query:  RALLDGCRNHHQEKLGKRIIEQLCDLEPLNAENYILLSNWYACNEKWDMVERLRETIRDMGLRPKKAYSWMEFRNKIHVFGTGDVSHPRSQNIYWNLQCL
         ALL  C+ H    L +    +L +LEP N   ++LLSN YA   KW+ V  LR+ +R  GL+ +   S +E    IH F +GD +HP S+ +Y  L  +
Subjt:  RALLDGCRNHHQEKLGKRIIEQLCDLEPLNAENYILLSNWYACNEKWDMVERLRETIRDMGLRPKKAYSWMEFRNKIHVFGTGDVSHPRSQNIYWNLQCL

Query:  MKKMEEDGFKPKPDFRFHDVDEE--RECVLIGHSELLAISFGLISTEAGRRIRITKNLRVCHSCHESAKFISKIVGREIIVRDPYTFHHLKDGYCSCEAF
        M+K++ +G++P+       ++EE  +E  L  HSE LAI +GLISTEA + IR+ KNLRVC  CH  AK IS++  REIIVRD Y FHH ++G CSC  F
Subjt:  MKKMEEDGFKPKPDFRFHDVDEE--RECVLIGHSELLAISFGLISTEAGRRIRITKNLRVCHSCHESAKFISKIVGREIIVRDPYTFHHLKDGYCSCEAF

AT3G12770.1 mitochondrial editing factor 221.9e-11934.92Show/hide
Query:  AENYANLCIAHQPFDEIPIWDTFAWNSLIQTHLTNGDVGHVISTYQQMLFRGVRPDKHTLPRIICATRQYGDLQVGKQLHAQAFKLGFSSNLYVITSLIE
        + ++ ++  A Q FD++P    F WN++I+ +  N      +  Y  M    V PD  T P ++ A      LQ+G+ +HAQ F+LGF ++++V   LI 
Subjt:  AENYANLCIAHQPFDEIPIWDTFAWNSLIQTHLTNGDVGHVISTYQQMLFRGVRPDKHTLPRIICATRQYGDLQVGKQLHAQAFKLGFSSNLYVITSLIE

Query:  LYGILDGADTAKWLHDKSAC--RNSVSWTLLAKLYLMEDKPSFAVDLFYQMVELAADIDAVALATAIGACGALKLLQHGRNIHHIARIHALEFDVLVSNS
        LY       +A+ + +      R  VSWT +   Y    +P  A+++F QM ++    D VAL + + A   L+ L+ GR+IH       LE +  +  S
Subjt:  LYGILDGADTAKWLHDKSAC--RNSVSWTLLAKLYLMEDKPSFAVDLFYQMVELAADIDAVALATAIGACGALKLLQHGRNIHHIARIHALEFDVLVSNS

Query:  LLKMYLDCGSIKDARGFFDRMPYKDVISWTELIHAYVKKGGINEGFKLFRQMNMDGGLKADPLTISSLLPACGRMAAHKHGREIHGYVLKNAINENLIVQ
        L  MY  CG +  A+  FD+M   ++I W  +I  Y K G   E   +F +M ++  ++ D ++I+S + AC ++ + +  R ++ YV ++   +++ + 
Subjt:  LLKMYLDCGSIKDARGFFDRMPYKDVISWTELIHAYVKKGGINEGFKLFRQMNMDGGLKADPLTISSLLPACGRMAAHKHGREIHGYVLKNAINENLIVQ

Query:  NALVDMYVKSGCIQSASKIFSRMKEKDMVSWTVMILGYSLHGQGKLGVSLFQEMERNLRVHRDEITYTAVLHACSTARMVDEGDFYFNCITE----PTVA
        +AL+DM+ K G ++ A  +F R  ++D+V W+ MI+GY LHG+ +  +SL++ MER   VH +++T+  +L AC+ + MV EG ++FN + +    P   
Subjt:  NALVDMYVKSGCIQSASKIFSRMKEKDMVSWTVMILGYSLHGQGKLGVSLFQEMERNLRVHRDEITYTAVLHACSTARMVDEGDFYFNCITE----PTVA

Query:  HFALKVALLARAGRLDEARTFVEKHKLDKHAEVLRALLDGCRNHHQEKLGKRIIEQLCDLEPLNAENYILLSNWYACNEKWDMVERLRETIRDMGLRPKK
        H+A  + LL RAG LD+A   ++   +     V  ALL  C+ H   +LG+   +QL  ++P N  +Y+ LSN YA    WD V  +R  +++ GL    
Subjt:  HFALKVALLARAGRLDEARTFVEKHKLDKHAEVLRALLDGCRNHHQEKLGKRIIEQLCDLEPLNAENYILLSNWYACNEKWDMVERLRETIRDMGLRPKK

Query:  AYSWMEFRNKIHVFGTGDVSHPRSQNIYWNLQCLMKKMEEDGFKPKPDFRFHDV-DEERECVLIGHSELLAISFGLISTEAGRRIRITKNLRVCHSCHES
          SW+E R ++  F  GD SHPR + I   ++ +  +++E GF    D   HD+ DEE E  L  HSE +AI++GLIST  G  +RITKNLR C +CH +
Subjt:  AYSWMEFRNKIHVFGTGDVSHPRSQNIYWNLQCLMKKMEEDGFKPKPDFRFHDV-DEERECVLIGHSELLAISFGLISTEAGRRIRITKNLRVCHSCHES

Query:  AKFISKIVGREIIVRDPYTFHHLKDGYCSC
         K ISK+V REI+VRD   FHH KDG CSC
Subjt:  AKFISKIVGREIIVRDPYTFHHLKDGYCSC

AT3G23330.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.0e-10933.68Show/hide
Query:  YANLCIAHQP---FDEIPIWDTFAWNSLIQTHLTNGDVGHVISTYQQMLFRGVRPDKHTLPRIICATRQYGDLQVGKQLHAQAFKLGFSSNLYVITSLIE
        Y NL + H+    F  +      AW S+I+           ++++ +M   G  PD +  P ++ +     DL+ G+ +H    +LG   +LY   +L+ 
Subjt:  YANLCIAHQP---FDEIPIWDTFAWNSLIQTHLTNGDVGHVISTYQQMLFRGVRPDKHTLPRIICATRQYGDLQVGKQLHAQAFKLGFSSNLYVITSLIE

Query:  LYGIL------------------------------------DGADTAKWLHDKSACRNSVSWTLLAKLYLMEDKPSFAVDLFYQMVELAADIDAVALATA
        +Y  L                                     G D+ + + +    ++ VS+  +   Y        A+ +  +M       D+  L++ 
Subjt:  LYGIL------------------------------------DGADTAKWLHDKSACRNSVSWTLLAKLYLMEDKPSFAVDLFYQMVELAADIDAVALATA

Query:  IGACGALKLLQHGRNIHHIARIHALEFDVLVSNSLLKMYLDCGSIKDARGFFDRMPYKDVISWTELIHAYVKKGGINEGFKLFRQMNMDGGLKADPLTIS
        +        +  G+ IH       ++ DV + +SL+ MY     I+D+   F R+  +D ISW  L+  YV+ G  NE  +LFRQM +   +K   +  S
Subjt:  IGACGALKLLQHGRNIHHIARIHALEFDVLVSNSLLKMYLDCGSIKDARGFFDRMPYKDVISWTELIHAYVKKGGINEGFKLFRQMNMDGGLKADPLTIS

Query:  SLLPACGRMAAHKHGREIHGYVLKNAINENLIVQNALVDMYVKSGCIQSASKIFSRMKEKDMVSWTVMILGYSLHGQGKLGVSLFQEMERNLRVHRDEIT
        S++PAC  +A    G+++HGYVL+     N+ + +ALVDMY K G I++A KIF RM   D VSWT +I+G++LHG G   VSLF+EM+R   V  +++ 
Subjt:  SLLPACGRMAAHKHGREIHGYVLKNAINENLIVQNALVDMYVKSGCIQSASKIFSRMKEKDMVSWTVMILGYSLHGQGKLGVSLFQEMERNLRVHRDEIT

Query:  YTAVLHACSTARMVDEGDFYFNCITE-----PTVAHFALKVALLARAGRLDEARTFVEKHKLDKHAEVLRALLDGCRNHHQEKLGKRIIEQLCDLEPLNA
        + AVL ACS   +VDE   YFN +T+       + H+A    LL RAG+L+EA  F+ K  ++    V   LL  C  H   +L +++ E++  ++  N 
Subjt:  YTAVLHACSTARMVDEGDFYFNCITE-----PTVAHFALKVALLARAGRLDEARTFVEKHKLDKHAEVLRALLDGCRNHHQEKLGKRIIEQLCDLEPLNA

Query:  ENYILLSNWYACNEKWDMVERLRETIRDMGLRPKKAYSWMEFRNKIHVFGTGDVSHPRSQNIYWNLQCLMKKMEEDGFKPKPDFRFHDVDEE-RECVLIG
          Y+L+ N YA N +W  + +LR  +R  GLR K A SW+E +NK H F +GD SHP    I   L+ +M++ME++G+        HDVDEE +  +L G
Subjt:  ENYILLSNWYACNEKWDMVERLRETIRDMGLRPKKAYSWMEFRNKIHVFGTGDVSHPRSQNIYWNLQCLMKKMEEDGFKPKPDFRFHDVDEE-RECVLIG

Query:  HSELLAISFGLISTEAGRRIRITKNLRVCHSCHESAKFISKIVGREIIVRDPYTFHHLKDGYCSC
        HSE LA++FG+I+TE G  IR+TKN+R+C  CH + KFISKI  REIIVRD   FHH   G CSC
Subjt:  HSELLAISFGLISTEAGRRIRITKNLRVCHSCHESAKFISKIVGREIIVRDPYTFHHLKDGYCSC

AT4G18750.1 Pentatricopeptide repeat (PPR) superfamily protein8.2e-13138.52Show/hide
Query:  AHQPFDEIPIWDTFAWNSLIQTHLTNGDVGHVISTYQQMLFRGVRPDKHTLPRIICATRQYGDLQVGKQLHAQAFKLGFSSNLYVITSLIELYGILDGAD
        A + FDE+   D  +WNS+I  +++NG     +S + QML  G+  D  T+  +         + +G+ +H+   K  FS       +L+++Y      D
Subjt:  AHQPFDEIPIWDTFAWNSLIQTHLTNGDVGHVISTYQQMLFRGVRPDKHTLPRIICATRQYGDLQVGKQLHAQAFKLGFSSNLYVITSLIELYGILDGAD

Query:  TAKWLHDKSACRNSVSWTLLAKLYLMEDKPSFAVDLFYQMVELAADIDAVALATAIGACGALKLLQHGRNIHHIARIHALEFDVLVSNSLLKMYLDCGSI
        +AK +  + + R+ VS+T +   Y  E     AV LF +M E     D   +   +  C   +LL  G+ +H   + + L FD+ VSN+L+ MY  CGS+
Subjt:  TAKWLHDKSACRNSVSWTLLAKLYLMEDKPSFAVDLFYQMVELAADIDAVALATAIGACGALKLLQHGRNIHHIARIHALEFDVLVSNSLLKMYLDCGSI

Query:  KDARGFFDRMPYKDVISWTELIHAYVKKGGINEGFKLFRQMNMDGGLKADPLTISSLLPACGRMAAHKHGREIHGYVLKNAINENLIVQNALVDMYVKSG
        ++A   F  M  KD+ISW  +I  Y K    NE   LF  +  +     D  T++ +LPAC  ++A   GREIHGY+++N    +  V N+LVDMY K G
Subjt:  KDARGFFDRMPYKDVISWTELIHAYVKKGGINEGFKLFRQMNMDGGLKADPLTISSLLPACGRMAAHKHGREIHGYVLKNAINENLIVQNALVDMYVKSG

Query:  CIQSASKIFSRMKEKDMVSWTVMILGYSLHGQGKLGVSLFQEMERNLRVHRDEITYTAVLHACSTARMVDEGDFYFN-----CITEPTVAHFALKVALLA
         +  A  +F  +  KD+VSWTVMI GY +HG GK  ++LF +M R   +  DEI++ ++L+ACS + +VDEG  +FN     C  EPTV H+A  V +LA
Subjt:  CIQSASKIFSRMKEKDMVSWTVMILGYSLHGQGKLGVSLFQEMERNLRVHRDEITYTAVLHACSTARMVDEGDFYFN-----CITEPTVAHFALKVALLA

Query:  RAGRLDEARTFVEKHKLDKHAEVLRALLDGCRNHHQEKLGKRIIEQLCDLEPLNAENYILLSNWYACNEKWDMVERLRETIRDMGLRPKKAYSWMEFRNK
        R G L +A  F+E   +   A +  ALL GCR HH  KL +++ E++ +LEP N   Y+L++N YA  EKW+ V+RLR+ I   GLR     SW+E + +
Subjt:  RAGRLDEARTFVEKHKLDKHAEVLRALLDGCRNHHQEKLGKRIIEQLCDLEPLNAENYILLSNWYACNEKWDMVERLRETIRDMGLRPKKAYSWMEFRNK

Query:  IHVFGTGDVSHPRSQNIYWNLQCLMKKMEEDGFKPKPDFRFHDVDE-ERECVLIGHSELLAISFGLISTEAGRRIRITKNLRVCHSCHESAKFISKIVGR
        +++F  GD S+P ++NI   L+ +  +M E+G+ P   +   D +E E+E  L GHSE LA++ G+IS+  G+ IR+TKNLRVC  CHE AKF+SK+  R
Subjt:  IHVFGTGDVSHPRSQNIYWNLQCLMKKMEEDGFKPKPDFRFHDVDE-ERECVLIGHSELLAISFGLISTEAGRRIRITKNLRVCHSCHESAKFISKIVGR

Query:  EIIVRDPYTFHHLKDGYCSCEAF
        EI++RD   FH  KDG+CSC  F
Subjt:  EIIVRDPYTFHHLKDGYCSCEAF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATCTCCTCCTCTCCACCCACATTCATCGTCTCCTCATTACTCAGAAACCGAATCACACATATCATCGCCACCGACTATTTAATAATCCCCCTCATGTTCGCACCAC
AACTGCCGAGAATTATGCCAATTTATGTATAGCCCACCAACCGTTCGACGAAATTCCTATATGGGATACTTTTGCTTGGAACAGTCTTATCCAAACCCATCTCACCAATG
GAGATGTGGGGCATGTTATTTCAACATATCAACAGATGCTGTTTCGAGGTGTTCGCCCTGACAAACACACCCTTCCTCGAATTATATGCGCTACACGCCAGTATGGTGAT
CTGCAGGTTGGCAAACAGCTCCATGCTCAAGCTTTCAAACTTGGGTTCTCCTCTAACCTCTATGTAATTACTTCCTTGATTGAATTGTATGGGATTCTTGACGGTGCTGA
CACTGCAAAGTGGCTCCATGACAAATCGGCTTGCAGAAACTCTGTTTCTTGGACACTGTTAGCCAAGCTGTACTTGATGGAAGATAAACCCAGTTTTGCCGTAGACTTGT
TTTATCAAATGGTGGAGTTGGCCGCTGATATTGATGCAGTGGCATTGGCCACGGCCATTGGTGCCTGTGGTGCACTCAAATTGCTGCAACACGGAAGAAATATCCACCAT
ATCGCTAGAATTCATGCCTTGGAATTTGACGTCCTGGTCAGTAATTCCCTATTGAAAATGTACCTTGATTGTGGTAGTATCAAAGATGCTCGGGGATTCTTTGACCGAAT
GCCGTACAAAGATGTCATTTCGTGGACAGAACTCATCCATGCGTATGTTAAGAAAGGTGGAATCAATGAGGGCTTTAAGCTGTTTCGGCAGATGAATATGGATGGAGGAT
TGAAGGCTGATCCTCTTACAATTAGCAGCCTTCTCCCAGCCTGTGGAAGAATGGCTGCGCATAAGCATGGAAGAGAGATTCATGGATATGTGCTTAAAAATGCTATTAAT
GAGAATCTCATTGTCCAAAATGCTTTGGTTGACATGTATGTCAAATCTGGATGTATCCAATCTGCATCAAAAATTTTCTCGAGGATGAAGGAGAAAGATATGGTTTCGTG
GACCGTCATGATCTTGGGCTACAGCTTACATGGCCAAGGAAAACTCGGAGTCAGTTTGTTCCAGGAAATGGAGAGGAACTTGAGGGTGCATAGAGATGAGATCACTTATA
CTGCAGTTTTGCATGCTTGTAGTACTGCAAGAATGGTAGATGAAGGGGATTTTTACTTCAATTGCATTACTGAACCAACTGTGGCACACTTTGCTTTAAAGGTGGCTCTT
TTAGCCCGTGCAGGACGACTGGATGAAGCAAGGACCTTTGTCGAAAAACATAAACTTGACAAACATGCTGAGGTTTTGAGAGCATTACTCGACGGATGCAGGAACCACCA
TCAAGAAAAATTAGGCAAGCGAATCATTGAGCAGCTGTGTGATTTGGAACCTCTAAATGCCGAGAATTACATCCTACTTTCAAATTGGTATGCCTGCAACGAAAAATGGG
ATATGGTGGAAAGGTTGAGAGAAACAATTAGAGACATGGGATTAAGACCAAAAAAGGCTTACAGTTGGATGGAGTTCCGCAACAAAATTCATGTGTTTGGGACAGGGGAT
GTATCCCACCCGAGATCACAGAACATATATTGGAATTTACAGTGCCTGATGAAGAAAATGGAAGAAGATGGTTTCAAGCCGAAACCAGATTTTAGATTCCACGACGTGGA
TGAGGAGCGAGAGTGTGTTCTAATAGGACACAGTGAGCTCTTAGCAATTTCGTTCGGGCTGATTAGTACAGAAGCAGGAAGGAGAATTCGTATTACAAAGAACCTTCGTG
TATGCCATAGTTGTCATGAGTCTGCAAAGTTCATATCCAAGATTGTTGGGCGAGAAATCATAGTAAGAGATCCTTATACTTTCCATCATTTGAAGGATGGCTATTGTTCT
TGTGAAGCTTTTTGTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGATCTCCTCCTCTCCACCCACATTCATCGTCTCCTCATTACTCAGAAACCGAATCACACATATCATCGCCACCGACTATTTAATAATCCCCCTCATGTTCGCACCAC
AACTGCCGAGAATTATGCCAATTTATGTATAGCCCACCAACCGTTCGACGAAATTCCTATATGGGATACTTTTGCTTGGAACAGTCTTATCCAAACCCATCTCACCAATG
GAGATGTGGGGCATGTTATTTCAACATATCAACAGATGCTGTTTCGAGGTGTTCGCCCTGACAAACACACCCTTCCTCGAATTATATGCGCTACACGCCAGTATGGTGAT
CTGCAGGTTGGCAAACAGCTCCATGCTCAAGCTTTCAAACTTGGGTTCTCCTCTAACCTCTATGTAATTACTTCCTTGATTGAATTGTATGGGATTCTTGACGGTGCTGA
CACTGCAAAGTGGCTCCATGACAAATCGGCTTGCAGAAACTCTGTTTCTTGGACACTGTTAGCCAAGCTGTACTTGATGGAAGATAAACCCAGTTTTGCCGTAGACTTGT
TTTATCAAATGGTGGAGTTGGCCGCTGATATTGATGCAGTGGCATTGGCCACGGCCATTGGTGCCTGTGGTGCACTCAAATTGCTGCAACACGGAAGAAATATCCACCAT
ATCGCTAGAATTCATGCCTTGGAATTTGACGTCCTGGTCAGTAATTCCCTATTGAAAATGTACCTTGATTGTGGTAGTATCAAAGATGCTCGGGGATTCTTTGACCGAAT
GCCGTACAAAGATGTCATTTCGTGGACAGAACTCATCCATGCGTATGTTAAGAAAGGTGGAATCAATGAGGGCTTTAAGCTGTTTCGGCAGATGAATATGGATGGAGGAT
TGAAGGCTGATCCTCTTACAATTAGCAGCCTTCTCCCAGCCTGTGGAAGAATGGCTGCGCATAAGCATGGAAGAGAGATTCATGGATATGTGCTTAAAAATGCTATTAAT
GAGAATCTCATTGTCCAAAATGCTTTGGTTGACATGTATGTCAAATCTGGATGTATCCAATCTGCATCAAAAATTTTCTCGAGGATGAAGGAGAAAGATATGGTTTCGTG
GACCGTCATGATCTTGGGCTACAGCTTACATGGCCAAGGAAAACTCGGAGTCAGTTTGTTCCAGGAAATGGAGAGGAACTTGAGGGTGCATAGAGATGAGATCACTTATA
CTGCAGTTTTGCATGCTTGTAGTACTGCAAGAATGGTAGATGAAGGGGATTTTTACTTCAATTGCATTACTGAACCAACTGTGGCACACTTTGCTTTAAAGGTGGCTCTT
TTAGCCCGTGCAGGACGACTGGATGAAGCAAGGACCTTTGTCGAAAAACATAAACTTGACAAACATGCTGAGGTTTTGAGAGCATTACTCGACGGATGCAGGAACCACCA
TCAAGAAAAATTAGGCAAGCGAATCATTGAGCAGCTGTGTGATTTGGAACCTCTAAATGCCGAGAATTACATCCTACTTTCAAATTGGTATGCCTGCAACGAAAAATGGG
ATATGGTGGAAAGGTTGAGAGAAACAATTAGAGACATGGGATTAAGACCAAAAAAGGCTTACAGTTGGATGGAGTTCCGCAACAAAATTCATGTGTTTGGGACAGGGGAT
GTATCCCACCCGAGATCACAGAACATATATTGGAATTTACAGTGCCTGATGAAGAAAATGGAAGAAGATGGTTTCAAGCCGAAACCAGATTTTAGATTCCACGACGTGGA
TGAGGAGCGAGAGTGTGTTCTAATAGGACACAGTGAGCTCTTAGCAATTTCGTTCGGGCTGATTAGTACAGAAGCAGGAAGGAGAATTCGTATTACAAAGAACCTTCGTG
TATGCCATAGTTGTCATGAGTCTGCAAAGTTCATATCCAAGATTGTTGGGCGAGAAATCATAGTAAGAGATCCTTATACTTTCCATCATTTGAAGGATGGCTATTGTTCT
TGTGAAGCTTTTTGTTAA
Protein sequenceShow/hide protein sequence
MDLLLSTHIHRLLITQKPNHTYHRHRLFNNPPHVRTTTAENYANLCIAHQPFDEIPIWDTFAWNSLIQTHLTNGDVGHVISTYQQMLFRGVRPDKHTLPRIICATRQYGD
LQVGKQLHAQAFKLGFSSNLYVITSLIELYGILDGADTAKWLHDKSACRNSVSWTLLAKLYLMEDKPSFAVDLFYQMVELAADIDAVALATAIGACGALKLLQHGRNIHH
IARIHALEFDVLVSNSLLKMYLDCGSIKDARGFFDRMPYKDVISWTELIHAYVKKGGINEGFKLFRQMNMDGGLKADPLTISSLLPACGRMAAHKHGREIHGYVLKNAIN
ENLIVQNALVDMYVKSGCIQSASKIFSRMKEKDMVSWTVMILGYSLHGQGKLGVSLFQEMERNLRVHRDEITYTAVLHACSTARMVDEGDFYFNCITEPTVAHFALKVAL
LARAGRLDEARTFVEKHKLDKHAEVLRALLDGCRNHHQEKLGKRIIEQLCDLEPLNAENYILLSNWYACNEKWDMVERLRETIRDMGLRPKKAYSWMEFRNKIHVFGTGD
VSHPRSQNIYWNLQCLMKKMEEDGFKPKPDFRFHDVDEERECVLIGHSELLAISFGLISTEAGRRIRITKNLRVCHSCHESAKFISKIVGREIIVRDPYTFHHLKDGYCS
CEAFC