; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0022633 (gene) of Snake gourd v1 genome

Gene IDTan0022633
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionPentatricopeptide repeat-containing protein DOT4
Genome locationLG10:21362020..21364413
RNA-Seq ExpressionTan0022633
SyntenyTan0022633
Gene Ontology termsGO:0005515 - protein binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily
IPR032867 - DYW domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6597728.1 Pentatricopeptide repeat-containing protein DOT4, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia]1.3e-27988.17Show/hide
Query:  VICSADTAKWLHDKSACRNSVSWTMLAKLYLMEDKPSFAIDLFYQMVELATDIDAVALSTAIGACGALKLLQHGRNIHHVARIHGLEFDVLVSNSLLKMY
        ++ SADTA+WLHDKSACRN+VSWTMLAKLYLMEDKPSF+IDLFYQMVELA DIDAVAL+TAIGACGA KLLQHGRNIHHVARIHGLEFDVLVSN LLKMY
Subjt:  VICSADTAKWLHDKSACRNSVSWTMLAKLYLMEDKPSFAIDLFYQMVELATDIDAVALSTAIGACGALKLLQHGRNIHHVARIHGLEFDVLVSNSLLKMY

Query:  LDCGSIKDARGFFNRMPSKDVISWTELIHVYVKKGGINEGFKLFRRMNMDGGLKPDPLTISSILPACGRMAAHKHGREIHGYVLKNAFDQNLIVQNALVD
        LDC SIKDARG FNRMP +D+ISWT+LIH YVK GGINE  KLFR+MNMDG LKPDPLTISSILPACGR+AAHKHGREIHGYVLKN FD NLIVQNALVD
Subjt:  LDCGSIKDARGFFNRMPSKDVISWTELIHVYVKKGGINEGFKLFRRMNMDGGLKPDPLTISSILPACGRMAAHKHGREIHGYVLKNAFDQNLIVQNALVD

Query:  MYVKSGCIQSASKIFLRMKEKDMVSWTIMILGYSLHGQGKLGVRLFREMERNLRVHRDEITYTAVLHACSTASMVDEGGFYFNCITEPTMAHFALKVALL
        MYVKSGCIQSA KIF RMKEKD+VSWT+MI GYSLHGQGKLGV LFREM+RN RVHRDEITYTAVL ACSTASMV+EG FYFNCITEPTMAHF LKVALL
Subjt:  MYVKSGCIQSASKIFLRMKEKDMVSWTIMILGYSLHGQGKLGVRLFREMERNLRVHRDEITYTAVLHACSTASMVDEGGFYFNCITEPTMAHFALKVALL

Query:  ARAGRLDEARTFVEEHKLDKHAEILRALLDGCRNHHQQKLGKRIIEQLCDLEPLNAENYVLLSNWYASNENWDMVEKLRETIRDMGLRPKKAYSWIEFHN
         RAGR DEARTFV++HKLDK++EILRALLDGCR HHQQKLGKRIIEQLCDLEPLNAENYVLLSNWYASNE W+MVEKLR+TIRDMGLRPKKAYSW+EF N
Subjt:  ARAGRLDEARTFVEEHKLDKHAEILRALLDGCRNHHQQKLGKRIIEQLCDLEPLNAENYVLLSNWYASNENWDMVEKLRETIRDMGLRPKKAYSWIEFHN

Query:  KIHVFGTGDVAHPRSQNIYWKLQCLMEKMEEDGFKPNPDFRFHDVDEERECVLIGHSELLAISFGLISTEAGRTIRITKNLRVCHNCHESAKFISKIVHR
        KIH FGTGDV+HPRSQ IYW LQCLM+KMEEDGFK N DFRFHDVDEEREC LIGHSELLAISFGLISTEAGRTIRI KNLRVCH+CHESAKFISK V R
Subjt:  KIHVFGTGDVAHPRSQNIYWKLQCLMEKMEEDGFKPNPDFRFHDVDEERECVLIGHSELLAISFGLISTEAGRTIRITKNLRVCHNCHESAKFISKIVHR

Query:  EIVVKDPYVFHHFKDGRCSCEDFC
        EI+VKDPYVFHHFKDGRCSCEDFC
Subjt:  EIVVKDPYVFHHFKDGRCSCEDFC

KAG7029175.1 Pentatricopeptide repeat-containing protein DOT4, chloroplastic, partial [Cucurbita argyrosperma subsp. argyrosperma]1.7e-27988.17Show/hide
Query:  VICSADTAKWLHDKSACRNSVSWTMLAKLYLMEDKPSFAIDLFYQMVELATDIDAVALSTAIGACGALKLLQHGRNIHHVARIHGLEFDVLVSNSLLKMY
        ++ SADTA+WLHDKSACRN+VSWTMLAKLYLMEDKPSF+IDLFYQMVELA DIDAVAL+TAIGACGA KLLQHGRNIHHVARIHGLEFDVLVSN LLKMY
Subjt:  VICSADTAKWLHDKSACRNSVSWTMLAKLYLMEDKPSFAIDLFYQMVELATDIDAVALSTAIGACGALKLLQHGRNIHHVARIHGLEFDVLVSNSLLKMY

Query:  LDCGSIKDARGFFNRMPSKDVISWTELIHVYVKKGGINEGFKLFRRMNMDGGLKPDPLTISSILPACGRMAAHKHGREIHGYVLKNAFDQNLIVQNALVD
        LDC SIKDARG FNRMP +D+ISWT+LIH YVK GGINE  KLFR+MNMDG LKPDPLTISSILPACGR+AAHKHGREIHGYVLKN FD NLIVQNALVD
Subjt:  LDCGSIKDARGFFNRMPSKDVISWTELIHVYVKKGGINEGFKLFRRMNMDGGLKPDPLTISSILPACGRMAAHKHGREIHGYVLKNAFDQNLIVQNALVD

Query:  MYVKSGCIQSASKIFLRMKEKDMVSWTIMILGYSLHGQGKLGVRLFREMERNLRVHRDEITYTAVLHACSTASMVDEGGFYFNCITEPTMAHFALKVALL
        MYVKSGCIQSA KIF RMKEKDMVSWT+MI GYSLHGQGKLGV LFREM+RN RVHRDEITYTAVL ACSTASMV+EG FYFNCITEPTMAHF LKVALL
Subjt:  MYVKSGCIQSASKIFLRMKEKDMVSWTIMILGYSLHGQGKLGVRLFREMERNLRVHRDEITYTAVLHACSTASMVDEGGFYFNCITEPTMAHFALKVALL

Query:  ARAGRLDEARTFVEEHKLDKHAEILRALLDGCRNHHQQKLGKRIIEQLCDLEPLNAENYVLLSNWYASNENWDMVEKLRETIRDMGLRPKKAYSWIEFHN
         RAGR DEARTFV++HKLDK++EILRALLDGCR HHQQKLGKRIIEQLCDLEPLNAENYVLLSNWYASNE W+MVEKLR+TIRDMGLRPKKAYSW+EF N
Subjt:  ARAGRLDEARTFVEEHKLDKHAEILRALLDGCRNHHQQKLGKRIIEQLCDLEPLNAENYVLLSNWYASNENWDMVEKLRETIRDMGLRPKKAYSWIEFHN

Query:  KIHVFGTGDVAHPRSQNIYWKLQCLMEKMEEDGFKPNPDFRFHDVDEERECVLIGHSELLAISFGLISTEAGRTIRITKNLRVCHNCHESAKFISKIVHR
        KIH FGTGDV+HPRSQ IYW LQCLM+KMEEDGFK N DFRFHDVDEEREC LIGHSELLAISFGLISTEAGRTIRI KNLRVCH+CHESAKFIS  V R
Subjt:  KIHVFGTGDVAHPRSQNIYWKLQCLMEKMEEDGFKPNPDFRFHDVDEERECVLIGHSELLAISFGLISTEAGRTIRITKNLRVCHNCHESAKFISKIVHR

Query:  EIVVKDPYVFHHFKDGRCSCEDFC
        EI+VKDPYVFHHFKDGRCSCEDFC
Subjt:  EIVVKDPYVFHHFKDGRCSCEDFC

XP_022158739.1 pentatricopeptide repeat-containing protein DOT4, chloroplastic-like [Momordica charantia]4.3e-27887.79Show/hide
Query:  VICSADTAKWLHDKSACRNSVSWTMLAKLYLMEDKPSFAIDLFYQMVELATDIDAVALSTAIGACGALKLLQHGRNIHHVARIHGLEFDVLVSNSLLKMY
        ++  ADTAKWLHDKSACRNSVSWTMLAKLY+MEDKPSFAIDLFYQMVELA DIDAVAL+TAIGACG+LKLLQHGRNIH +AR HGLEFDVLVSNSLLKMY
Subjt:  VICSADTAKWLHDKSACRNSVSWTMLAKLYLMEDKPSFAIDLFYQMVELATDIDAVALSTAIGACGALKLLQHGRNIHHVARIHGLEFDVLVSNSLLKMY

Query:  LDCGSIKDARGFFNRMPSKDVISWTELIHVYVKKGGINEGFKLFRRMNMDGGLKPDPLTISSILPACGRMAAHKHGREIHGYVLKNAFDQNLIVQNALVD
        LDCGSI+DARGFFNRMPSKDVISWTELI  YVKKGGINEGFKLFR+MNMDGGLKPDP+TISSILPACGRMAAHKHGREIHGYVLK+A D NLIVQNALVD
Subjt:  LDCGSIKDARGFFNRMPSKDVISWTELIHVYVKKGGINEGFKLFRRMNMDGGLKPDPLTISSILPACGRMAAHKHGREIHGYVLKNAFDQNLIVQNALVD

Query:  MYVKSGCIQSASKIFLRMKEKDMVSWTIMILGYSLHGQGKLGVRLFREMERNLRVHRDEITYTAVLHACSTASMVDEGGFYFNCITEPTMAHFALKVALL
        MYVKSGCIQSA KIF RMKEKD +SWT+MILGYSLHGQGKLGV LFR MERNLR+HRDEITYT+VLHACSTAS+V+EG FYFNCI EPT +HFALKVALL
Subjt:  MYVKSGCIQSASKIFLRMKEKDMVSWTIMILGYSLHGQGKLGVRLFREMERNLRVHRDEITYTAVLHACSTASMVDEGGFYFNCITEPTMAHFALKVALL

Query:  ARAGRLDEARTFVEEHKLDKHAEILRALLDGCRNHHQQKLGKRIIEQLCDLEPLNAENYVLLSNWYASNENWDMVEKLRETIRDMGLRPKKAYSWIEFHN
        ARAGRLDEAR FVE+HKLDKH EILRALLDGCR H  +KLGKRIIEQLCDLEPLNAENY+LLSNWYA N   DMVEK RE +RDMGLRPKKAYSW+EF N
Subjt:  ARAGRLDEARTFVEEHKLDKHAEILRALLDGCRNHHQQKLGKRIIEQLCDLEPLNAENYVLLSNWYASNENWDMVEKLRETIRDMGLRPKKAYSWIEFHN

Query:  KIHVFGTGDVAHPRSQNIYWKLQCLMEKMEEDGFKPNPDFRFHDVDEERECVLIGHSELLAISFGLISTEAGRTIRITKNLRVCHNCHESAKFISKIVHR
        KIHVFGTGDV+HPRSQNIYW L+CLM+KME+DG KP PDF FHDVDEERECVLIGHSELLAISFGLISTEAGRTI ITKNLRVCH+CHESAKFISKIV R
Subjt:  KIHVFGTGDVAHPRSQNIYWKLQCLMEKMEEDGFKPNPDFRFHDVDEERECVLIGHSELLAISFGLISTEAGRTIRITKNLRVCHNCHESAKFISKIVHR

Query:  EIVVKDPYVFHHFKDGRCSCEDFC
        EI+VKDPYVFHHFKDG CSCEDFC
Subjt:  EIVVKDPYVFHHFKDGRCSCEDFC

XP_023539701.1 pentatricopeptide repeat-containing protein DOT4, chloroplastic-like [Cucurbita pepo subsp. pepo]2.3e-27987.98Show/hide
Query:  VICSADTAKWLHDKSACRNSVSWTMLAKLYLMEDKPSFAIDLFYQMVELATDIDAVALSTAIGACGALKLLQHGRNIHHVARIHGLEFDVLVSNSLLKMY
        ++ SADTA+WLHDKSACRN+VSWTMLAKLYLMEDKPSF+IDLFYQMVELA DIDAVAL+TA+GACGA KLLQHGRNIHHVARIHGLEFDVLVSN LLKMY
Subjt:  VICSADTAKWLHDKSACRNSVSWTMLAKLYLMEDKPSFAIDLFYQMVELATDIDAVALSTAIGACGALKLLQHGRNIHHVARIHGLEFDVLVSNSLLKMY

Query:  LDCGSIKDARGFFNRMPSKDVISWTELIHVYVKKGGINEGFKLFRRMNMDGGLKPDPLTISSILPACGRMAAHKHGREIHGYVLKNAFDQNLIVQNALVD
        LDC SIKDARG FNRMP +D+ISWT+LIH YVK GGINE  KLFR+MNMDG LKPDPLTISSILPACGR+AAHKHGREIHGYVLKN FD NLIVQNALVD
Subjt:  LDCGSIKDARGFFNRMPSKDVISWTELIHVYVKKGGINEGFKLFRRMNMDGGLKPDPLTISSILPACGRMAAHKHGREIHGYVLKNAFDQNLIVQNALVD

Query:  MYVKSGCIQSASKIFLRMKEKDMVSWTIMILGYSLHGQGKLGVRLFREMERNLRVHRDEITYTAVLHACSTASMVDEGGFYFNCITEPTMAHFALKVALL
        MYVKSGCIQSA KIF RMKEKDMVSWT+MI GYSLHGQGKLGV LFREM+RN RVHRDEITYTAVL ACSTASMV+EG FYFNCITEPTMAHF LKVALL
Subjt:  MYVKSGCIQSASKIFLRMKEKDMVSWTIMILGYSLHGQGKLGVRLFREMERNLRVHRDEITYTAVLHACSTASMVDEGGFYFNCITEPTMAHFALKVALL

Query:  ARAGRLDEARTFVEEHKLDKHAEILRALLDGCRNHHQQKLGKRIIEQLCDLEPLNAENYVLLSNWYASNENWDMVEKLRETIRDMGLRPKKAYSWIEFHN
         RAGR DEARTFV++HKLDK++EILRALLDGCR HHQQKLGKRIIEQLCDLEPLNAENYVLLSNWYASNE W+MVEKLR+TIRDMGLRPKKAYSW+EF N
Subjt:  ARAGRLDEARTFVEEHKLDKHAEILRALLDGCRNHHQQKLGKRIIEQLCDLEPLNAENYVLLSNWYASNENWDMVEKLRETIRDMGLRPKKAYSWIEFHN

Query:  KIHVFGTGDVAHPRSQNIYWKLQCLMEKMEEDGFKPNPDFRFHDVDEERECVLIGHSELLAISFGLISTEAGRTIRITKNLRVCHNCHESAKFISKIVHR
        KIH FGTGDV+HPRSQ IYW LQCLM+KMEEDGFK N DFRFHDVDEEREC LIGHSELLAISFGLISTEAGRTIRI+KNLRVCH+CHESAKFIS  V R
Subjt:  KIHVFGTGDVAHPRSQNIYWKLQCLMEKMEEDGFKPNPDFRFHDVDEERECVLIGHSELLAISFGLISTEAGRTIRITKNLRVCHNCHESAKFISKIVHR

Query:  EIVVKDPYVFHHFKDGRCSCEDFC
        EI+VKDPYVFHHFKDGRCSCEDFC
Subjt:  EIVVKDPYVFHHFKDGRCSCEDFC

XP_038905218.1 pentatricopeptide repeat-containing protein DOT4, chloroplastic-like isoform X1 [Benincasa hispida]9.6e-27888Show/hide
Query:  VICSADTAKWLHDKSACRNSVSWTMLAKLYLMEDKPSFAIDLFYQMVELATDIDAVALSTAIGACGALKLLQHGRNIHHVARIHGLEFDVLVSNSLLKMY
        ++ SADTAKWLHDKSACRNSVSWTMLAKLYLMEDKPS AIDLFYQMVELA DIDAVAL+TAIGACGALK+LQHGRNIH +ARIHGLEF+VLVSNSLLKMY
Subjt:  VICSADTAKWLHDKSACRNSVSWTMLAKLYLMEDKPSFAIDLFYQMVELATDIDAVALSTAIGACGALKLLQHGRNIHHVARIHGLEFDVLVSNSLLKMY

Query:  LDCGSIKDARGFFNRMPSKDVISWTELIHVYVKKGGINEGFKLFRRMNMDGGLKPDPLTISSILPACGRMAAHKHGREIHGYVLKNAFDQNLIVQNALVD
        LDC SIKDARGFF+RMPSKDVISWTELIH+YVKKGGINE FKLFR+MN DGGLKPDPLTISSILPACGRMAAHKHG+EIHGYVLKNAFD+NLIVQNALVD
Subjt:  LDCGSIKDARGFFNRMPSKDVISWTELIHVYVKKGGINEGFKLFRRMNMDGGLKPDPLTISSILPACGRMAAHKHGREIHGYVLKNAFDQNLIVQNALVD

Query:  MYVKSGCIQSASKIFLRMKEKDMVSWTIMILGYSLHGQGKLGVRLFREMERNLRVH-RDEITYTAVLHACSTASMVDEGGFYFNCITEPTMAHFALKVAL
        MYVKSGCIQSAS+ F  MKEKDMVSWTIM LGYSLHGQGKLGV LFRE+ERNLR+H RD+ITYTAVLHAC+TA+MVDEG FYF+CITEPT+AH ALKVAL
Subjt:  MYVKSGCIQSASKIFLRMKEKDMVSWTIMILGYSLHGQGKLGVRLFREMERNLRVH-RDEITYTAVLHACSTASMVDEGGFYFNCITEPTMAHFALKVAL

Query:  LARAGRLDEARTFVEEHKLDKHAEILRALLDGCRNHHQQKLGKRIIEQLCDLEPLNAENYVLLSNWYASNENWDMVEKLRETIRDMGLRPKKAYSWIEFH
        LARAGRLDEA TFVE++KLDKHA ILRALLDGCR HHQ+KLGK+IIE+LCDLEPLNAENY+LLSNWYA N+ WDMVEKLRET+RDMGLRPKKAYSW+EF 
Subjt:  LARAGRLDEARTFVEEHKLDKHAEILRALLDGCRNHHQQKLGKRIIEQLCDLEPLNAENYVLLSNWYASNENWDMVEKLRETIRDMGLRPKKAYSWIEFH

Query:  NKIHVFGTGDVAHPRSQNIYWKLQCLMEKMEEDGFKPNPDFRFHDVDEERECVLIGHSELLAISFGLISTEAGRTIRITKNLRVCHNCHESAKFISKIVH
        NKIHVFGTGDV+HPRS+NIYW LQCLM+KMEEDG KPNPDF FHDVDEERECV IGHSELLAISFGLIST+AGRTIRITKNLRVCH+CHESAKFISK+V 
Subjt:  NKIHVFGTGDVAHPRSQNIYWKLQCLMEKMEEDGFKPNPDFRFHDVDEERECVLIGHSELLAISFGLISTEAGRTIRITKNLRVCHNCHESAKFISKIVH

Query:  REIVVKDPYVFHHFKDGRCSCEDFC
        REI+VKDPYVFHHFKDG CSCED C
Subjt:  REIVVKDPYVFHHFKDGRCSCEDFC

TrEMBL top hitse value%identityAlignment
A0A1S3CPR5 pentatricopeptide repeat-containing protein DOT4, chloroplastic-like4.3e-27687.02Show/hide
Query:  VICSADTAKWLHDKSACRNSVSWTMLAKLYLMEDKPSFAIDLFYQMVELATDIDAVALSTAIGACGALKLLQHGRNIHHVARIHGLEFDVLVSNSLLKMY
        ++ SADTAKWLHDKS CRNSVSWT+LAKLYL EDKPSFAIDLFYQMVELA DID+VAL+TAIGACGALK+L HGRNIHH+ARIHGLEF++LVSNSLLKMY
Subjt:  VICSADTAKWLHDKSACRNSVSWTMLAKLYLMEDKPSFAIDLFYQMVELATDIDAVALSTAIGACGALKLLQHGRNIHHVARIHGLEFDVLVSNSLLKMY

Query:  LDCGSIKDARGFFNRMPSKDVISWTELIHVYVKKGGINEGFKLFRRMNMDGGLKPDPLTISSILPACGRMAAHKHGREIHGYVLKNAFDQNLIVQNALVD
        LDC SIKDARGFF++MPSKDVISWTELIH+YVKKGGINE FKLFR+MNMDG LKPDPLTISSILPACGRMAAHKHG+EIHGYVLKN FD+NLIVQNALVD
Subjt:  LDCGSIKDARGFFNRMPSKDVISWTELIHVYVKKGGINEGFKLFRRMNMDGGLKPDPLTISSILPACGRMAAHKHGREIHGYVLKNAFDQNLIVQNALVD

Query:  MYVKSGCIQSASKIFLRMKEKDMVSWTIMILGYSLHGQGKLGVRLFREMERNLRVHRDEITYTAVLHACSTASMVDEGGFYFNCITEPTMAHFALKVALL
        MYVKSGCIQSASK F  MKEKDMVSW+IM LGYSLHGQGKLGV LFREME+NL++HRDEITYTAVLHAC+TA+MVDEG FYF+ IT+PT+AH ALKVALL
Subjt:  MYVKSGCIQSASKIFLRMKEKDMVSWTIMILGYSLHGQGKLGVRLFREMERNLRVHRDEITYTAVLHACSTASMVDEGGFYFNCITEPTMAHFALKVALL

Query:  ARAGRLDEARTFVEEHKLDKHAEILRALLDGCRNHHQQKLGKRIIEQLCDLEPLNAENYVLLSNWYASNENWDMVEKLRETIRDMGLRPKKAYSWIEFHN
        ARAGRLDEARTFVE+ KL+KH EILRALLDGCRNH QQKLGKRIIEQLCDLEPLN ENY+LLSNWYA N+ WDMVE+LRETIRDMGLRPKKAYSWIEF N
Subjt:  ARAGRLDEARTFVEEHKLDKHAEILRALLDGCRNHHQQKLGKRIIEQLCDLEPLNAENYVLLSNWYASNENWDMVEKLRETIRDMGLRPKKAYSWIEFHN

Query:  KIHVFGTGDVAHPRSQNIYWKLQCLMEKMEEDGFKPNPDFRFHDVDEERECVLIGHSELLAISFGLISTEAGRTIRITKNLRVCHNCHESAKFISKIVHR
        KIHVFGTGDV+HPRSQNIYW LQCLM+KMEEDG K NP+F  HDVDEERECV IGHSELLAISFGLISTEAGRTIRITKNLRVCH+CHESAKFISK+V R
Subjt:  KIHVFGTGDVAHPRSQNIYWKLQCLMEKMEEDGFKPNPDFRFHDVDEERECVLIGHSELLAISFGLISTEAGRTIRITKNLRVCHNCHESAKFISKIVHR

Query:  EIVVKDPYVFHHFKDGRCSCEDFC
        EI+VKDPYVFHHFKDG CSCE+FC
Subjt:  EIVVKDPYVFHHFKDGRCSCEDFC

A0A5A7T6A7 Pentatricopeptide repeat-containing protein DOT44.3e-27687.02Show/hide
Query:  VICSADTAKWLHDKSACRNSVSWTMLAKLYLMEDKPSFAIDLFYQMVELATDIDAVALSTAIGACGALKLLQHGRNIHHVARIHGLEFDVLVSNSLLKMY
        ++ SADTAKWLHDKS CRNSVSWT+LAKLYL EDKPSFAIDLFYQMVELA DID+VAL+TAIGACGALK+L HGRNIHH+ARIHGLEF++LVSNSLLKMY
Subjt:  VICSADTAKWLHDKSACRNSVSWTMLAKLYLMEDKPSFAIDLFYQMVELATDIDAVALSTAIGACGALKLLQHGRNIHHVARIHGLEFDVLVSNSLLKMY

Query:  LDCGSIKDARGFFNRMPSKDVISWTELIHVYVKKGGINEGFKLFRRMNMDGGLKPDPLTISSILPACGRMAAHKHGREIHGYVLKNAFDQNLIVQNALVD
        LDC SIKDARGFF++MPSKDVISWTELIH+YVKKGGINE FKLFR+MNMDG LKPDPLTISSILPACGRMAAHKHG+EIHGYVLKN FD+NLIVQNALVD
Subjt:  LDCGSIKDARGFFNRMPSKDVISWTELIHVYVKKGGINEGFKLFRRMNMDGGLKPDPLTISSILPACGRMAAHKHGREIHGYVLKNAFDQNLIVQNALVD

Query:  MYVKSGCIQSASKIFLRMKEKDMVSWTIMILGYSLHGQGKLGVRLFREMERNLRVHRDEITYTAVLHACSTASMVDEGGFYFNCITEPTMAHFALKVALL
        MYVKSGCIQSASK F  MKEKDMVSW+IM LGYSLHGQGKLGV LFREME+NL++HRDEITYTAVLHAC+TA+MVDEG FYF+ IT+PT+AH ALKVALL
Subjt:  MYVKSGCIQSASKIFLRMKEKDMVSWTIMILGYSLHGQGKLGVRLFREMERNLRVHRDEITYTAVLHACSTASMVDEGGFYFNCITEPTMAHFALKVALL

Query:  ARAGRLDEARTFVEEHKLDKHAEILRALLDGCRNHHQQKLGKRIIEQLCDLEPLNAENYVLLSNWYASNENWDMVEKLRETIRDMGLRPKKAYSWIEFHN
        ARAGRLDEARTFVE+ KL+KH EILRALLDGCRNH QQKLGKRIIEQLCDLEPLN ENY+LLSNWYA N+ WDMVE+LRETIRDMGLRPKKAYSWIEF N
Subjt:  ARAGRLDEARTFVEEHKLDKHAEILRALLDGCRNHHQQKLGKRIIEQLCDLEPLNAENYVLLSNWYASNENWDMVEKLRETIRDMGLRPKKAYSWIEFHN

Query:  KIHVFGTGDVAHPRSQNIYWKLQCLMEKMEEDGFKPNPDFRFHDVDEERECVLIGHSELLAISFGLISTEAGRTIRITKNLRVCHNCHESAKFISKIVHR
        KIHVFGTGDV+HPRSQNIYW LQCLM+KMEEDG K NP+F  HDVDEERECV IGHSELLAISFGLISTEAGRTIRITKNLRVCH+CHESAKFISK+V R
Subjt:  KIHVFGTGDVAHPRSQNIYWKLQCLMEKMEEDGFKPNPDFRFHDVDEERECVLIGHSELLAISFGLISTEAGRTIRITKNLRVCHNCHESAKFISKIVHR

Query:  EIVVKDPYVFHHFKDGRCSCEDFC
        EI+VKDPYVFHHFKDG CSCE+FC
Subjt:  EIVVKDPYVFHHFKDGRCSCEDFC

A0A6J1E0A4 pentatricopeptide repeat-containing protein DOT4, chloroplastic-like2.1e-27887.79Show/hide
Query:  VICSADTAKWLHDKSACRNSVSWTMLAKLYLMEDKPSFAIDLFYQMVELATDIDAVALSTAIGACGALKLLQHGRNIHHVARIHGLEFDVLVSNSLLKMY
        ++  ADTAKWLHDKSACRNSVSWTMLAKLY+MEDKPSFAIDLFYQMVELA DIDAVAL+TAIGACG+LKLLQHGRNIH +AR HGLEFDVLVSNSLLKMY
Subjt:  VICSADTAKWLHDKSACRNSVSWTMLAKLYLMEDKPSFAIDLFYQMVELATDIDAVALSTAIGACGALKLLQHGRNIHHVARIHGLEFDVLVSNSLLKMY

Query:  LDCGSIKDARGFFNRMPSKDVISWTELIHVYVKKGGINEGFKLFRRMNMDGGLKPDPLTISSILPACGRMAAHKHGREIHGYVLKNAFDQNLIVQNALVD
        LDCGSI+DARGFFNRMPSKDVISWTELI  YVKKGGINEGFKLFR+MNMDGGLKPDP+TISSILPACGRMAAHKHGREIHGYVLK+A D NLIVQNALVD
Subjt:  LDCGSIKDARGFFNRMPSKDVISWTELIHVYVKKGGINEGFKLFRRMNMDGGLKPDPLTISSILPACGRMAAHKHGREIHGYVLKNAFDQNLIVQNALVD

Query:  MYVKSGCIQSASKIFLRMKEKDMVSWTIMILGYSLHGQGKLGVRLFREMERNLRVHRDEITYTAVLHACSTASMVDEGGFYFNCITEPTMAHFALKVALL
        MYVKSGCIQSA KIF RMKEKD +SWT+MILGYSLHGQGKLGV LFR MERNLR+HRDEITYT+VLHACSTAS+V+EG FYFNCI EPT +HFALKVALL
Subjt:  MYVKSGCIQSASKIFLRMKEKDMVSWTIMILGYSLHGQGKLGVRLFREMERNLRVHRDEITYTAVLHACSTASMVDEGGFYFNCITEPTMAHFALKVALL

Query:  ARAGRLDEARTFVEEHKLDKHAEILRALLDGCRNHHQQKLGKRIIEQLCDLEPLNAENYVLLSNWYASNENWDMVEKLRETIRDMGLRPKKAYSWIEFHN
        ARAGRLDEAR FVE+HKLDKH EILRALLDGCR H  +KLGKRIIEQLCDLEPLNAENY+LLSNWYA N   DMVEK RE +RDMGLRPKKAYSW+EF N
Subjt:  ARAGRLDEARTFVEEHKLDKHAEILRALLDGCRNHHQQKLGKRIIEQLCDLEPLNAENYVLLSNWYASNENWDMVEKLRETIRDMGLRPKKAYSWIEFHN

Query:  KIHVFGTGDVAHPRSQNIYWKLQCLMEKMEEDGFKPNPDFRFHDVDEERECVLIGHSELLAISFGLISTEAGRTIRITKNLRVCHNCHESAKFISKIVHR
        KIHVFGTGDV+HPRSQNIYW L+CLM+KME+DG KP PDF FHDVDEERECVLIGHSELLAISFGLISTEAGRTI ITKNLRVCH+CHESAKFISKIV R
Subjt:  KIHVFGTGDVAHPRSQNIYWKLQCLMEKMEEDGFKPNPDFRFHDVDEERECVLIGHSELLAISFGLISTEAGRTIRITKNLRVCHNCHESAKFISKIVHR

Query:  EIVVKDPYVFHHFKDGRCSCEDFC
        EI+VKDPYVFHHFKDG CSCEDFC
Subjt:  EIVVKDPYVFHHFKDGRCSCEDFC

A0A6J1EXC6 pentatricopeptide repeat-containing protein DOT4, chloroplastic-like6.1e-27887.4Show/hide
Query:  VICSADTAKWLHDKSACRNSVSWTMLAKLYLMEDKPSFAIDLFYQMVELATDIDAVALSTAIGACGALKLLQHGRNIHHVARIHGLEFDVLVSNSLLKMY
        ++ SADTA+WLHDKSACRN+VSWTMLAKLYLMEDKPSF+IDLFYQMVELA DIDAVAL+TAIGACGA KLLQHGRNIHHVARIHGLEFD+LVSN LLKMY
Subjt:  VICSADTAKWLHDKSACRNSVSWTMLAKLYLMEDKPSFAIDLFYQMVELATDIDAVALSTAIGACGALKLLQHGRNIHHVARIHGLEFDVLVSNSLLKMY

Query:  LDCGSIKDARGFFNRMPSKDVISWTELIHVYVKKGGINEGFKLFRRMNMDGGLKPDPLTISSILPACGRMAAHKHGREIHGYVLKNAFDQNLIVQNALVD
        LDCGSIKDARG FNRMP +D+ISWT+LIH YVK GGINE  KLFR+MNMDG LKPDPLTISSILPACGR+ AHKHGREIHGYVLKN FD NLIVQNALVD
Subjt:  LDCGSIKDARGFFNRMPSKDVISWTELIHVYVKKGGINEGFKLFRRMNMDGGLKPDPLTISSILPACGRMAAHKHGREIHGYVLKNAFDQNLIVQNALVD

Query:  MYVKSGCIQSASKIFLRMKEKDMVSWTIMILGYSLHGQGKLGVRLFREMERNLRVHRDEITYTAVLHACSTASMVDEGGFYFNCITEPTMAHFALKVALL
        MYVKSGCIQSA KIF RMKEKDMVSWT++I GYSLHGQGKLGV LFREM+RN  VHRDEITYTAVL ACSTASMV+EG FYFNCITEPTMAHF LKVALL
Subjt:  MYVKSGCIQSASKIFLRMKEKDMVSWTIMILGYSLHGQGKLGVRLFREMERNLRVHRDEITYTAVLHACSTASMVDEGGFYFNCITEPTMAHFALKVALL

Query:  ARAGRLDEARTFVEEHKLDKHAEILRALLDGCRNHHQQKLGKRIIEQLCDLEPLNAENYVLLSNWYASNENWDMVEKLRETIRDMGLRPKKAYSWIEFHN
         RAGR +EARTFV++HKLDK+ EILRALLDGCR HHQQKLGKRIIEQLCDLEPLNAENYVLLSNWYASNE W+MVEKLR+TIRDMGLRPKKAYSW+EF N
Subjt:  ARAGRLDEARTFVEEHKLDKHAEILRALLDGCRNHHQQKLGKRIIEQLCDLEPLNAENYVLLSNWYASNENWDMVEKLRETIRDMGLRPKKAYSWIEFHN

Query:  KIHVFGTGDVAHPRSQNIYWKLQCLMEKMEEDGFKPNPDFRFHDVDEERECVLIGHSELLAISFGLISTEAGRTIRITKNLRVCHNCHESAKFISKIVHR
        KIH FGTGDV+HPRSQ IYW LQCLM+KMEEDGFK N DFRFHDVDEEREC LIGHSELLAISFGLISTEAGRTIRI KNLRVCH+CHESAKFIS  V R
Subjt:  KIHVFGTGDVAHPRSQNIYWKLQCLMEKMEEDGFKPNPDFRFHDVDEERECVLIGHSELLAISFGLISTEAGRTIRITKNLRVCHNCHESAKFISKIVHR

Query:  EIVVKDPYVFHHFKDGRCSCEDFC
        EI+VKDPYVFHHFKDGRCSCEDFC
Subjt:  EIVVKDPYVFHHFKDGRCSCEDFC

A0A6J1I9E1 pentatricopeptide repeat-containing protein DOT4, chloroplastic-like6.1e-27887.4Show/hide
Query:  VICSADTAKWLHDKSACRNSVSWTMLAKLYLMEDKPSFAIDLFYQMVELATDIDAVALSTAIGACGALKLLQHGRNIHHVARIHGLEFDVLVSNSLLKMY
        ++ SADTA+WLHDKSACRN+VSWTMLAKLYLMEDKPSF++DLFYQMVELA DIDAVAL+TAIGACGA KLLQHGRNIHHVARIHGLEFDVLVSN LLKMY
Subjt:  VICSADTAKWLHDKSACRNSVSWTMLAKLYLMEDKPSFAIDLFYQMVELATDIDAVALSTAIGACGALKLLQHGRNIHHVARIHGLEFDVLVSNSLLKMY

Query:  LDCGSIKDARGFFNRMPSKDVISWTELIHVYVKKGGINEGFKLFRRMNMDGGLKPDPLTISSILPACGRMAAHKHGREIHGYVLKNAFDQNLIVQNALVD
        LDC SIKDARG FNRMP +D+ISWT+LIH YVK GGINE  KLFR+MNMDG LKPDPLTISSILPACGR+AAHKHGREIHGYVLKN FD NLIVQNALVD
Subjt:  LDCGSIKDARGFFNRMPSKDVISWTELIHVYVKKGGINEGFKLFRRMNMDGGLKPDPLTISSILPACGRMAAHKHGREIHGYVLKNAFDQNLIVQNALVD

Query:  MYVKSGCIQSASKIFLRMKEKDMVSWTIMILGYSLHGQGKLGVRLFREMERNLRVHRDEITYTAVLHACSTASMVDEGGFYFNCITEPTMAHFALKVALL
        MYVKSGCIQSA KIF RMKEKDMVSWT+MI GYSLHGQGKLGV LFREM+RN RVHRDEITYTAVL +CSTASMV+EG FYFNCITEPTMAHF LKVALL
Subjt:  MYVKSGCIQSASKIFLRMKEKDMVSWTIMILGYSLHGQGKLGVRLFREMERNLRVHRDEITYTAVLHACSTASMVDEGGFYFNCITEPTMAHFALKVALL

Query:  ARAGRLDEARTFVEEHKLDKHAEILRALLDGCRNHHQQKLGKRIIEQLCDLEPLNAENYVLLSNWYASNENWDMVEKLRETIRDMGLRPKKAYSWIEFHN
         RAGR DEARTFV++HKLDK++EILRALLDGCR HHQ KLGKRIIEQLCDLEPLNAENYVLLSNWYASNE W+MVEKLR+TIRDMGLRPKKAYSW+EF N
Subjt:  ARAGRLDEARTFVEEHKLDKHAEILRALLDGCRNHHQQKLGKRIIEQLCDLEPLNAENYVLLSNWYASNENWDMVEKLRETIRDMGLRPKKAYSWIEFHN

Query:  KIHVFGTGDVAHPRSQNIYWKLQCLMEKMEEDGFKPNPDFRFHDVDEERECVLIGHSELLAISFGLISTEAGRTIRITKNLRVCHNCHESAKFISKIVHR
        KIH FGTGDV+HPRSQ IYW LQCLM+KMEEDGFK N DFRFHDVDEEREC  IGHSELLAISFGLISTEAGRTIRI+KNLRVCH+CHESAKFIS  V R
Subjt:  KIHVFGTGDVAHPRSQNIYWKLQCLMEKMEEDGFKPNPDFRFHDVDEERECVLIGHSELLAISFGLISTEAGRTIRITKNLRVCHNCHESAKFISKIVHR

Query:  EIVVKDPYVFHHFKDGRCSCEDFC
        EI+VKDPYVFHHFKDGRCSCEDFC
Subjt:  EIVVKDPYVFHHFKDGRCSCEDFC

SwissProt top hitse value%identityAlignment
Q9LTV8 Pentatricopeptide repeat-containing protein At3g127701.2e-10237.18Show/hide
Query:  RNSVSWTMLAKLYLMEDKPSFAIDLFYQMVELATDIDAVALSTAIGACGALKLLQHGRNIHHVARIHGLEFDVLVSNSLLKMYLDCGSIKDARGFFNRMP
        R  VSWT +   Y    +P  A+++F QM ++    D VAL + + A   L+ L+ GR+IH      GLE +  +  SL  MY  CG +  A+  F++M 
Subjt:  RNSVSWTMLAKLYLMEDKPSFAIDLFYQMVELATDIDAVALSTAIGACGALKLLQHGRNIHHVARIHGLEFDVLVSNSLLKMYLDCGSIKDARGFFNRMP

Query:  SKDVISWTELIHVYVKKGGINEGFKLFRRMNMDGGLKPDPLTISSILPACGRMAAHKHGREIHGYVLKNAFDQNLIVQNALVDMYVKSGCIQSASKIFLR
        S ++I W  +I  Y K G   E   +F  M ++  ++PD ++I+S + AC ++ + +  R ++ YV ++ +  ++ + +AL+DM+ K G ++ A  +F R
Subjt:  SKDVISWTELIHVYVKKGGINEGFKLFRRMNMDGGLKPDPLTISSILPACGRMAAHKHGREIHGYVLKNAFDQNLIVQNALVDMYVKSGCIQSASKIFLR

Query:  MKEKDMVSWTIMILGYSLHGQGKLGVRLFREMERNLRVHRDEITYTAVLHACSTASMVDEGGFYFNCITE----PTMAHFALKVALLARAGRLDEARTFV
          ++D+V W+ MI+GY LHG+ +  + L+R MER   VH +++T+  +L AC+ + MV EG ++FN + +    P   H+A  + LL RAG LD+A   +
Subjt:  MKEKDMVSWTIMILGYSLHGQGKLGVRLFREMERNLRVHRDEITYTAVLHACSTASMVDEGGFYFNCITE----PTMAHFALKVALLARAGRLDEARTFV

Query:  EEHKLDKHAEILRALLDGCRNHHQQKLGKRIIEQLCDLEPLNAENYVLLSNWYASNENWDMVEKLRETIRDMGLRPKKAYSWIEFHNKIHVFGTGDVAHP
        +   +     +  ALL  C+ H   +LG+   +QL  ++P N  +YV LSN YA+   WD V ++R  +++ GL      SW+E   ++  F  GD +HP
Subjt:  EEHKLDKHAEILRALLDGCRNHHQQKLGKRIIEQLCDLEPLNAENYVLLSNWYASNENWDMVEKLRETIRDMGLRPKKAYSWIEFHNKIHVFGTGDVAHP

Query:  RSQNIYWKLQCLMEKMEEDGFKPNPDFRFHDV-DEERECVLIGHSELLAISFGLISTEAGRTIRITKNLRVCHNCHESAKFISKIVHREIVVKDPYVFHH
        R + I  +++ +  +++E GF  N D   HD+ DEE E  L  HSE +AI++GLIST  G  +RITKNLR C NCH + K ISK+V REIVV+D   FHH
Subjt:  RSQNIYWKLQCLMEKMEEDGFKPNPDFRFHDV-DEERECVLIGHSELLAISFGLISTEAGRTIRITKNLRVCHNCHESAKFISKIVHREIVVKDPYVFHH

Query:  FKDGRCSCEDF
        FKDG CSC D+
Subjt:  FKDGRCSCEDF

Q9LW63 Putative pentatricopeptide repeat-containing protein At3g233303.5e-10538.74Show/hide
Query:  DTAKWLHDKSACRNSVSWTMLAKLYLMEDKPSFAIDLFYQMVELATDIDAVALSTAIGACGALKLLQHGRNIHHVARIHGLEFDVLVSNSLLKMYLDCGS
        D+ + + +    ++ VS+  +   Y        A+ +  +M       D+  LS+ +        +  G+ IH      G++ DV + +SL+ MY     
Subjt:  DTAKWLHDKSACRNSVSWTMLAKLYLMEDKPSFAIDLFYQMVELATDIDAVALSTAIGACGALKLLQHGRNIHHVARIHGLEFDVLVSNSLLKMYLDCGS

Query:  IKDARGFFNRMPSKDVISWTELIHVYVKKGGINEGFKLFRRMNMDGGLKPDPLTISSILPACGRMAAHKHGREIHGYVLKNAFDQNLIVQNALVDMYVKS
        I+D+   F+R+  +D ISW  L+  YV+ G  NE  +LFR+M +   +KP  +  SS++PAC  +A    G+++HGYVL+  F  N+ + +ALVDMY K 
Subjt:  IKDARGFFNRMPSKDVISWTELIHVYVKKGGINEGFKLFRRMNMDGGLKPDPLTISSILPACGRMAAHKHGREIHGYVLKNAFDQNLIVQNALVDMYVKS

Query:  GCIQSASKIFLRMKEKDMVSWTIMILGYSLHGQGKLGVRLFREMERNLRVHRDEITYTAVLHACSTASMVDEGGFYFNCITE-----PTMAHFALKVALL
        G I++A KIF RM   D VSWT +I+G++LHG G   V LF EM+R   V  +++ + AVL ACS   +VDE   YFN +T+       + H+A    LL
Subjt:  GCIQSASKIFLRMKEKDMVSWTIMILGYSLHGQGKLGVRLFREMERNLRVHRDEITYTAVLHACSTASMVDEGGFYFNCITE-----PTMAHFALKVALL

Query:  ARAGRLDEARTFVEEHKLDKHAEILRALLDGCRNHHQQKLGKRIIEQLCDLEPLNAENYVLLSNWYASNENWDMVEKLRETIRDMGLRPKKAYSWIEFHN
         RAG+L+EA  F+ +  ++    +   LL  C  H   +L +++ E++  ++  N   YVL+ N YASN  W  + KLR  +R  GLR K A SWIE  N
Subjt:  ARAGRLDEARTFVEEHKLDKHAEILRALLDGCRNHHQQKLGKRIIEQLCDLEPLNAENYVLLSNWYASNENWDMVEKLRETIRDMGLRPKKAYSWIEFHN

Query:  KIHVFGTGDVAHPRSQNIYWKLQCLMEKMEEDGFKPNPDFRFHDVDEE-RECVLIGHSELLAISFGLISTEAGRTIRITKNLRVCHNCHESAKFISKIVH
        K H F +GD +HP    I   L+ +ME+ME++G+  +     HDVDEE +  +L GHSE LA++FG+I+TE G TIR+TKN+R+C +CH + KFISKI  
Subjt:  KIHVFGTGDVAHPRSQNIYWKLQCLMEKMEEDGFKPNPDFRFHDVDEE-RECVLIGHSELLAISFGLISTEAGRTIRITKNLRVCHNCHESAKFISKIVH

Query:  REIVVKDPYVFHHFKDGRCSCEDF
        REI+V+D   FHHF  G CSC D+
Subjt:  REIVVKDPYVFHHFKDGRCSCEDF

Q9SHZ8 Pentatricopeptide repeat-containing protein At2g220705.6e-9535.22Show/hide
Query:  MVICSADTAKWLHDKSACRNSVSWTMLAKLYLMEDKPSFAIDLFYQMV-ELATDIDAVALSTAIGACGALKLLQHGRNIHHVARIHGLEFDVLVSNSLLK
        M +   D A    ++ A R+ V+W  +   +        A+D+F +M+ +     D   L++ + AC  L+ L  G+ IH      G +   +V N+L+ 
Subjt:  MVICSADTAKWLHDKSACRNSVSWTMLAKLYLMEDKPSFAIDLFYQMV-ELATDIDAVALSTAIGACGALKLLQHGRNIHHVARIHGLEFDVLVSNSLLK

Query:  MYLDCGSIKDAR--------------GF-------------------FNRMPSKDVISWTELIHVYVKKGGINEGFKLFRRMNMDGGLKPDPLTISSILP
        MY  CG ++ AR              GF                   F  +  +DV++WT +I  Y + G   E   LFR M + GG +P+  T++++L 
Subjt:  MYLDCGSIKDAR--------------GF-------------------FNRMPSKDVISWTELIHVYVKKGGINEGFKLFRRMNMDGGLKPDPLTISSILP

Query:  ACGRMAAHKHGREIHGYVLKNAFDQNLIVQNALVDMYVKSGCIQSASKIF-LRMKEKDMVSWTIMILGYSLHGQGKLGVRLFREMERNLRVHRDEITYTA
            +A+  HG++IHG  +K+    ++ V NAL+ MY K+G I SAS+ F L   E+D VSWT MI+  + HG  +  + LF  M     +  D ITY  
Subjt:  ACGRMAAHKHGREIHGYVLKNAFDQNLIVQNALVDMYVKSGCIQSASKIF-LRMKEKDMVSWTIMILGYSLHGQGKLGVRLFREMERNLRVHRDEITYTA

Query:  VLHACSTASMVDEGGFYFNCITE-----PTMAHFALKVALLARAGRLDEARTFVEEHKLDKHAEILRALLDGCRNHHQQKLGKRIIEQLCDLEPLNAENY
        V  AC+ A +V++G  YF+ + +     PT++H+A  V L  RAG L EA+ F+E+  ++       +LL  CR H    LGK   E+L  LEP N+  Y
Subjt:  VLHACSTASMVDEGGFYFNCITE-----PTMAHFALKVALLARAGRLDEARTFVEEHKLDKHAEILRALLDGCRNHHQQKLGKRIIEQLCDLEPLNAENY

Query:  VLLSNWYASNENWDMVEKLRETIRDMGLRPKKAYSWIEFHNKIHVFGTGDVAHPRSQNIYWKLQCLMEKMEEDGFKPNPDFRFHDVDEE-RECVLIGHSE
          L+N Y++   W+   K+R++++D  ++ ++ +SWIE  +K+HVFG  D  HP    IY  ++ + +++++ G+ P+     HD++EE +E +L  HSE
Subjt:  VLLSNWYASNENWDMVEKLRETIRDMGLRPKKAYSWIEFHNKIHVFGTGDVAHPRSQNIYWKLQCLMEKMEEDGFKPNPDFRFHDVDEE-RECVLIGHSE

Query:  LLAISFGLISTEAGRTIRITKNLRVCHNCHESAKFISKIVHREIVVKDPYVFHHFKDGRCSCEDF
         LAI+FGLIST    T+RI KNLRVC++CH + KFISK+V REI+V+D   FHHFKDG CSC D+
Subjt:  LLAISFGLISTEAGRTIRITKNLRVCHNCHESAKFISKIVHREIVVKDPYVFHHFKDGRCSCEDF

Q9SN39 Pentatricopeptide repeat-containing protein DOT4, chloroplastic5.2e-11740.84Show/hide
Query:  DTAKWLHDKSACRNSVSWTMLAKLYLMEDKPSFAIDLFYQMVELATDIDAVALSTAIGACGALKLLQHGRNIHHVARIHGLEFDVLVSNSLLKMYLDCGS
        D+AK +  + + R+ VS+T +   Y  E     A+ LF +M E     D   ++  +  C   +LL  G+ +H   + + L FD+ VSN+L+ MY  CGS
Subjt:  DTAKWLHDKSACRNSVSWTMLAKLYLMEDKPSFAIDLFYQMVELATDIDAVALSTAIGACGALKLLQHGRNIHHVARIHGLEFDVLVSNSLLKMYLDCGS

Query:  IKDARGFFNRMPSKDVISWTELIHVYVKKGGINEGFKLFRRMNMDGGLKPDPLTISSILPACGRMAAHKHGREIHGYVLKNAFDQNLIVQNALVDMYVKS
        +++A   F+ M  KD+ISW  +I  Y K    NE   LF  +  +    PD  T++ +LPAC  ++A   GREIHGY+++N +  +  V N+LVDMY K 
Subjt:  IKDARGFFNRMPSKDVISWTELIHVYVKKGGINEGFKLFRRMNMDGGLKPDPLTISSILPACGRMAAHKHGREIHGYVLKNAFDQNLIVQNALVDMYVKS

Query:  GCIQSASKIFLRMKEKDMVSWTIMILGYSLHGQGKLGVRLFREMERNLRVHRDEITYTAVLHACSTASMVDEGGFYFN-----CITEPTMAHFALKVALL
        G +  A  +F  +  KD+VSWT+MI GY +HG GK  + LF +M R   +  DEI++ ++L+ACS + +VDEG  +FN     C  EPT+ H+A  V +L
Subjt:  GCIQSASKIFLRMKEKDMVSWTIMILGYSLHGQGKLGVRLFREMERNLRVHRDEITYTAVLHACSTASMVDEGGFYFN-----CITEPTMAHFALKVALL

Query:  ARAGRLDEARTFVEEHKLDKHAEILRALLDGCRNHHQQKLGKRIIEQLCDLEPLNAENYVLLSNWYASNENWDMVEKLRETIRDMGLRPKKAYSWIEFHN
        AR G L +A  F+E   +   A I  ALL GCR HH  KL +++ E++ +LEP N   YVL++N YA  E W+ V++LR+ I   GLR     SWIE   
Subjt:  ARAGRLDEARTFVEEHKLDKHAEILRALLDGCRNHHQQKLGKRIIEQLCDLEPLNAENYVLLSNWYASNENWDMVEKLRETIRDMGLRPKKAYSWIEFHN

Query:  KIHVFGTGDVAHPRSQNIYWKLQCLMEKMEEDGFKPNPDFRFHDVDE-ERECVLIGHSELLAISFGLISTEAGRTIRITKNLRVCHNCHESAKFISKIVH
        ++++F  GD ++P ++NI   L+ +  +M E+G+ P   +   D +E E+E  L GHSE LA++ G+IS+  G+ IR+TKNLRVC +CHE AKF+SK+  
Subjt:  KIHVFGTGDVAHPRSQNIYWKLQCLMEKMEEDGFKPNPDFRFHDVDE-ERECVLIGHSELLAISFGLISTEAGRTIRITKNLRVCHNCHESAKFISKIVH

Query:  REIVVKDPYVFHHFKDGRCSCEDF
        REIV++D   FH FKDG CSC  F
Subjt:  REIVVKDPYVFHHFKDGRCSCEDF

Q9SS60 Pentatricopeptide repeat-containing protein At3g035807.3e-9534.03Show/hide
Query:  TAKWLHDKSACRNSVSWTMLAKLYLMEDKPSFAIDLFYQMVELATDIDAVALSTAIGACGALKLLQHGRNIHHVARIHGLEFDVLVSNSLLKMYLDCGSI
        TA+ + +   C+++VSW  +   Y+       A+ LF  M+ +    D +     I     L  L+ G+ +H      G+  D+ VSN+L+ MY  CG +
Subjt:  TAKWLHDKSACRNSVSWTMLAKLYLMEDKPSFAIDLFYQMVELATDIDAVALSTAIGACGALKLLQHGRNIHHVARIHGLEFDVLVSNSLLKMYLDCGSI

Query:  KDARGFFNRMPSKDVISWTELIHVYVKKGGINEGFKLFRRMNMDGGLKPDPLTISSILPACGRMAAHKHGREIHGYVLKNAFDQNLIVQNALVDMYVKSG
         D+   F+ M + D ++W  +I   V+ G    G ++  +M     + PD  T    LP C  +AA + G+EIH  +L+  ++  L + NAL++MY K G
Subjt:  KDARGFFNRMPSKDVISWTELIHVYVKKGGINEGFKLFRRMNMDGGLKPDPLTISSILPACGRMAAHKHGREIHGYVLKNAFDQNLIVQNALVDMYVKSG

Query:  CIQSASKIFLRMKEKDMVSWTIMILGYSLHGQGKLGVRLFREMERNLRVHRDEITYTAVLHACSTASMVDEGGFYFNCI-----TEPTMAHFALKVALLA
        C++++S++F RM  +D+V+WT MI  Y ++G+G+  +  F +ME++  +  D + + A+++ACS + +VDEG   F  +      +P + H+A  V LL+
Subjt:  CIQSASKIFLRMKEKDMVSWTIMILGYSLHGQGKLGVRLFREMERNLRVHRDEITYTAVLHACSTASMVDEGGFYFNCI-----TEPTMAHFALKVALLA

Query:  RAGRLDEARTFVEEHKLDKHAEILRALLDGCRNHHQQKLGKRIIEQLCDLEPLNAENYVLLSNWYASNENWDMVEKLRETIRDMGLRPKKAYSWIEFHNK
        R+ ++ +A  F++   +   A I  ++L  CR     +  +R+  ++ +L P +    +L SN YA+   WD V  +R++++D  +     YSWIE    
Subjt:  RAGRLDEARTFVEEHKLDKHAEILRALLDGCRNHHQQKLGKRIIEQLCDLEPLNAENYVLLSNWYASNENWDMVEKLRETIRDMGLRPKKAYSWIEFHNK

Query:  IHVFGTGDVAHPRSQNIYWKLQCLMEKMEEDGFKPNPDFRFHDVDEERE--CVLIGHSELLAISFGLISTEAGRTIRITKNLRVCHNCHESAKFISKIVH
        +HVF +GD + P+S+ IY  L+ L   M ++G+ P+P     +++EE E   ++ GHSE LAI+FGL++TE G  +++ KNLRVC +CHE  K ISKIV 
Subjt:  IHVFGTGDVAHPRSQNIYWKLQCLMEKMEEDGFKPNPDFRFHDVDEERE--CVLIGHSELLAISFGLISTEAGRTIRITKNLRVCHNCHESAKFISKIVH

Query:  REIVVKDPYVFHHFKDGRCSCED
        REI+V+D   FH FKDG CSC+D
Subjt:  REIVVKDPYVFHHFKDGRCSCED

Arabidopsis top hitse value%identityAlignment
AT2G22070.1 pentatricopeptide (PPR) repeat-containing protein4.0e-9635.22Show/hide
Query:  MVICSADTAKWLHDKSACRNSVSWTMLAKLYLMEDKPSFAIDLFYQMV-ELATDIDAVALSTAIGACGALKLLQHGRNIHHVARIHGLEFDVLVSNSLLK
        M +   D A    ++ A R+ V+W  +   +        A+D+F +M+ +     D   L++ + AC  L+ L  G+ IH      G +   +V N+L+ 
Subjt:  MVICSADTAKWLHDKSACRNSVSWTMLAKLYLMEDKPSFAIDLFYQMV-ELATDIDAVALSTAIGACGALKLLQHGRNIHHVARIHGLEFDVLVSNSLLK

Query:  MYLDCGSIKDAR--------------GF-------------------FNRMPSKDVISWTELIHVYVKKGGINEGFKLFRRMNMDGGLKPDPLTISSILP
        MY  CG ++ AR              GF                   F  +  +DV++WT +I  Y + G   E   LFR M + GG +P+  T++++L 
Subjt:  MYLDCGSIKDAR--------------GF-------------------FNRMPSKDVISWTELIHVYVKKGGINEGFKLFRRMNMDGGLKPDPLTISSILP

Query:  ACGRMAAHKHGREIHGYVLKNAFDQNLIVQNALVDMYVKSGCIQSASKIF-LRMKEKDMVSWTIMILGYSLHGQGKLGVRLFREMERNLRVHRDEITYTA
            +A+  HG++IHG  +K+    ++ V NAL+ MY K+G I SAS+ F L   E+D VSWT MI+  + HG  +  + LF  M     +  D ITY  
Subjt:  ACGRMAAHKHGREIHGYVLKNAFDQNLIVQNALVDMYVKSGCIQSASKIF-LRMKEKDMVSWTIMILGYSLHGQGKLGVRLFREMERNLRVHRDEITYTA

Query:  VLHACSTASMVDEGGFYFNCITE-----PTMAHFALKVALLARAGRLDEARTFVEEHKLDKHAEILRALLDGCRNHHQQKLGKRIIEQLCDLEPLNAENY
        V  AC+ A +V++G  YF+ + +     PT++H+A  V L  RAG L EA+ F+E+  ++       +LL  CR H    LGK   E+L  LEP N+  Y
Subjt:  VLHACSTASMVDEGGFYFNCITE-----PTMAHFALKVALLARAGRLDEARTFVEEHKLDKHAEILRALLDGCRNHHQQKLGKRIIEQLCDLEPLNAENY

Query:  VLLSNWYASNENWDMVEKLRETIRDMGLRPKKAYSWIEFHNKIHVFGTGDVAHPRSQNIYWKLQCLMEKMEEDGFKPNPDFRFHDVDEE-RECVLIGHSE
          L+N Y++   W+   K+R++++D  ++ ++ +SWIE  +K+HVFG  D  HP    IY  ++ + +++++ G+ P+     HD++EE +E +L  HSE
Subjt:  VLLSNWYASNENWDMVEKLRETIRDMGLRPKKAYSWIEFHNKIHVFGTGDVAHPRSQNIYWKLQCLMEKMEEDGFKPNPDFRFHDVDEE-RECVLIGHSE

Query:  LLAISFGLISTEAGRTIRITKNLRVCHNCHESAKFISKIVHREIVVKDPYVFHHFKDGRCSCEDF
         LAI+FGLIST    T+RI KNLRVC++CH + KFISK+V REI+V+D   FHHFKDG CSC D+
Subjt:  LLAISFGLISTEAGRTIRITKNLRVCHNCHESAKFISKIVHREIVVKDPYVFHHFKDGRCSCEDF

AT3G03580.1 Tetratricopeptide repeat (TPR)-like superfamily protein5.2e-9634.03Show/hide
Query:  TAKWLHDKSACRNSVSWTMLAKLYLMEDKPSFAIDLFYQMVELATDIDAVALSTAIGACGALKLLQHGRNIHHVARIHGLEFDVLVSNSLLKMYLDCGSI
        TA+ + +   C+++VSW  +   Y+       A+ LF  M+ +    D +     I     L  L+ G+ +H      G+  D+ VSN+L+ MY  CG +
Subjt:  TAKWLHDKSACRNSVSWTMLAKLYLMEDKPSFAIDLFYQMVELATDIDAVALSTAIGACGALKLLQHGRNIHHVARIHGLEFDVLVSNSLLKMYLDCGSI

Query:  KDARGFFNRMPSKDVISWTELIHVYVKKGGINEGFKLFRRMNMDGGLKPDPLTISSILPACGRMAAHKHGREIHGYVLKNAFDQNLIVQNALVDMYVKSG
         D+   F+ M + D ++W  +I   V+ G    G ++  +M     + PD  T    LP C  +AA + G+EIH  +L+  ++  L + NAL++MY K G
Subjt:  KDARGFFNRMPSKDVISWTELIHVYVKKGGINEGFKLFRRMNMDGGLKPDPLTISSILPACGRMAAHKHGREIHGYVLKNAFDQNLIVQNALVDMYVKSG

Query:  CIQSASKIFLRMKEKDMVSWTIMILGYSLHGQGKLGVRLFREMERNLRVHRDEITYTAVLHACSTASMVDEGGFYFNCI-----TEPTMAHFALKVALLA
        C++++S++F RM  +D+V+WT MI  Y ++G+G+  +  F +ME++  +  D + + A+++ACS + +VDEG   F  +      +P + H+A  V LL+
Subjt:  CIQSASKIFLRMKEKDMVSWTIMILGYSLHGQGKLGVRLFREMERNLRVHRDEITYTAVLHACSTASMVDEGGFYFNCI-----TEPTMAHFALKVALLA

Query:  RAGRLDEARTFVEEHKLDKHAEILRALLDGCRNHHQQKLGKRIIEQLCDLEPLNAENYVLLSNWYASNENWDMVEKLRETIRDMGLRPKKAYSWIEFHNK
        R+ ++ +A  F++   +   A I  ++L  CR     +  +R+  ++ +L P +    +L SN YA+   WD V  +R++++D  +     YSWIE    
Subjt:  RAGRLDEARTFVEEHKLDKHAEILRALLDGCRNHHQQKLGKRIIEQLCDLEPLNAENYVLLSNWYASNENWDMVEKLRETIRDMGLRPKKAYSWIEFHNK

Query:  IHVFGTGDVAHPRSQNIYWKLQCLMEKMEEDGFKPNPDFRFHDVDEERE--CVLIGHSELLAISFGLISTEAGRTIRITKNLRVCHNCHESAKFISKIVH
        +HVF +GD + P+S+ IY  L+ L   M ++G+ P+P     +++EE E   ++ GHSE LAI+FGL++TE G  +++ KNLRVC +CHE  K ISKIV 
Subjt:  IHVFGTGDVAHPRSQNIYWKLQCLMEKMEEDGFKPNPDFRFHDVDEERE--CVLIGHSELLAISFGLISTEAGRTIRITKNLRVCHNCHESAKFISKIVH

Query:  REIVVKDPYVFHHFKDGRCSCED
        REI+V+D   FH FKDG CSC+D
Subjt:  REIVVKDPYVFHHFKDGRCSCED

AT3G12770.1 mitochondrial editing factor 228.8e-10437.18Show/hide
Query:  RNSVSWTMLAKLYLMEDKPSFAIDLFYQMVELATDIDAVALSTAIGACGALKLLQHGRNIHHVARIHGLEFDVLVSNSLLKMYLDCGSIKDARGFFNRMP
        R  VSWT +   Y    +P  A+++F QM ++    D VAL + + A   L+ L+ GR+IH      GLE +  +  SL  MY  CG +  A+  F++M 
Subjt:  RNSVSWTMLAKLYLMEDKPSFAIDLFYQMVELATDIDAVALSTAIGACGALKLLQHGRNIHHVARIHGLEFDVLVSNSLLKMYLDCGSIKDARGFFNRMP

Query:  SKDVISWTELIHVYVKKGGINEGFKLFRRMNMDGGLKPDPLTISSILPACGRMAAHKHGREIHGYVLKNAFDQNLIVQNALVDMYVKSGCIQSASKIFLR
        S ++I W  +I  Y K G   E   +F  M ++  ++PD ++I+S + AC ++ + +  R ++ YV ++ +  ++ + +AL+DM+ K G ++ A  +F R
Subjt:  SKDVISWTELIHVYVKKGGINEGFKLFRRMNMDGGLKPDPLTISSILPACGRMAAHKHGREIHGYVLKNAFDQNLIVQNALVDMYVKSGCIQSASKIFLR

Query:  MKEKDMVSWTIMILGYSLHGQGKLGVRLFREMERNLRVHRDEITYTAVLHACSTASMVDEGGFYFNCITE----PTMAHFALKVALLARAGRLDEARTFV
          ++D+V W+ MI+GY LHG+ +  + L+R MER   VH +++T+  +L AC+ + MV EG ++FN + +    P   H+A  + LL RAG LD+A   +
Subjt:  MKEKDMVSWTIMILGYSLHGQGKLGVRLFREMERNLRVHRDEITYTAVLHACSTASMVDEGGFYFNCITE----PTMAHFALKVALLARAGRLDEARTFV

Query:  EEHKLDKHAEILRALLDGCRNHHQQKLGKRIIEQLCDLEPLNAENYVLLSNWYASNENWDMVEKLRETIRDMGLRPKKAYSWIEFHNKIHVFGTGDVAHP
        +   +     +  ALL  C+ H   +LG+   +QL  ++P N  +YV LSN YA+   WD V ++R  +++ GL      SW+E   ++  F  GD +HP
Subjt:  EEHKLDKHAEILRALLDGCRNHHQQKLGKRIIEQLCDLEPLNAENYVLLSNWYASNENWDMVEKLRETIRDMGLRPKKAYSWIEFHNKIHVFGTGDVAHP

Query:  RSQNIYWKLQCLMEKMEEDGFKPNPDFRFHDV-DEERECVLIGHSELLAISFGLISTEAGRTIRITKNLRVCHNCHESAKFISKIVHREIVVKDPYVFHH
        R + I  +++ +  +++E GF  N D   HD+ DEE E  L  HSE +AI++GLIST  G  +RITKNLR C NCH + K ISK+V REIVV+D   FHH
Subjt:  RSQNIYWKLQCLMEKMEEDGFKPNPDFRFHDV-DEERECVLIGHSELLAISFGLISTEAGRTIRITKNLRVCHNCHESAKFISKIVHREIVVKDPYVFHH

Query:  FKDGRCSCEDF
        FKDG CSC D+
Subjt:  FKDGRCSCEDF

AT3G23330.1 Tetratricopeptide repeat (TPR)-like superfamily protein2.5e-10638.74Show/hide
Query:  DTAKWLHDKSACRNSVSWTMLAKLYLMEDKPSFAIDLFYQMVELATDIDAVALSTAIGACGALKLLQHGRNIHHVARIHGLEFDVLVSNSLLKMYLDCGS
        D+ + + +    ++ VS+  +   Y        A+ +  +M       D+  LS+ +        +  G+ IH      G++ DV + +SL+ MY     
Subjt:  DTAKWLHDKSACRNSVSWTMLAKLYLMEDKPSFAIDLFYQMVELATDIDAVALSTAIGACGALKLLQHGRNIHHVARIHGLEFDVLVSNSLLKMYLDCGS

Query:  IKDARGFFNRMPSKDVISWTELIHVYVKKGGINEGFKLFRRMNMDGGLKPDPLTISSILPACGRMAAHKHGREIHGYVLKNAFDQNLIVQNALVDMYVKS
        I+D+   F+R+  +D ISW  L+  YV+ G  NE  +LFR+M +   +KP  +  SS++PAC  +A    G+++HGYVL+  F  N+ + +ALVDMY K 
Subjt:  IKDARGFFNRMPSKDVISWTELIHVYVKKGGINEGFKLFRRMNMDGGLKPDPLTISSILPACGRMAAHKHGREIHGYVLKNAFDQNLIVQNALVDMYVKS

Query:  GCIQSASKIFLRMKEKDMVSWTIMILGYSLHGQGKLGVRLFREMERNLRVHRDEITYTAVLHACSTASMVDEGGFYFNCITE-----PTMAHFALKVALL
        G I++A KIF RM   D VSWT +I+G++LHG G   V LF EM+R   V  +++ + AVL ACS   +VDE   YFN +T+       + H+A    LL
Subjt:  GCIQSASKIFLRMKEKDMVSWTIMILGYSLHGQGKLGVRLFREMERNLRVHRDEITYTAVLHACSTASMVDEGGFYFNCITE-----PTMAHFALKVALL

Query:  ARAGRLDEARTFVEEHKLDKHAEILRALLDGCRNHHQQKLGKRIIEQLCDLEPLNAENYVLLSNWYASNENWDMVEKLRETIRDMGLRPKKAYSWIEFHN
         RAG+L+EA  F+ +  ++    +   LL  C  H   +L +++ E++  ++  N   YVL+ N YASN  W  + KLR  +R  GLR K A SWIE  N
Subjt:  ARAGRLDEARTFVEEHKLDKHAEILRALLDGCRNHHQQKLGKRIIEQLCDLEPLNAENYVLLSNWYASNENWDMVEKLRETIRDMGLRPKKAYSWIEFHN

Query:  KIHVFGTGDVAHPRSQNIYWKLQCLMEKMEEDGFKPNPDFRFHDVDEE-RECVLIGHSELLAISFGLISTEAGRTIRITKNLRVCHNCHESAKFISKIVH
        K H F +GD +HP    I   L+ +ME+ME++G+  +     HDVDEE +  +L GHSE LA++FG+I+TE G TIR+TKN+R+C +CH + KFISKI  
Subjt:  KIHVFGTGDVAHPRSQNIYWKLQCLMEKMEEDGFKPNPDFRFHDVDEE-RECVLIGHSELLAISFGLISTEAGRTIRITKNLRVCHNCHESAKFISKIVH

Query:  REIVVKDPYVFHHFKDGRCSCEDF
        REI+V+D   FHHF  G CSC D+
Subjt:  REIVVKDPYVFHHFKDGRCSCEDF

AT4G18750.1 Pentatricopeptide repeat (PPR) superfamily protein3.7e-11840.84Show/hide
Query:  DTAKWLHDKSACRNSVSWTMLAKLYLMEDKPSFAIDLFYQMVELATDIDAVALSTAIGACGALKLLQHGRNIHHVARIHGLEFDVLVSNSLLKMYLDCGS
        D+AK +  + + R+ VS+T +   Y  E     A+ LF +M E     D   ++  +  C   +LL  G+ +H   + + L FD+ VSN+L+ MY  CGS
Subjt:  DTAKWLHDKSACRNSVSWTMLAKLYLMEDKPSFAIDLFYQMVELATDIDAVALSTAIGACGALKLLQHGRNIHHVARIHGLEFDVLVSNSLLKMYLDCGS

Query:  IKDARGFFNRMPSKDVISWTELIHVYVKKGGINEGFKLFRRMNMDGGLKPDPLTISSILPACGRMAAHKHGREIHGYVLKNAFDQNLIVQNALVDMYVKS
        +++A   F+ M  KD+ISW  +I  Y K    NE   LF  +  +    PD  T++ +LPAC  ++A   GREIHGY+++N +  +  V N+LVDMY K 
Subjt:  IKDARGFFNRMPSKDVISWTELIHVYVKKGGINEGFKLFRRMNMDGGLKPDPLTISSILPACGRMAAHKHGREIHGYVLKNAFDQNLIVQNALVDMYVKS

Query:  GCIQSASKIFLRMKEKDMVSWTIMILGYSLHGQGKLGVRLFREMERNLRVHRDEITYTAVLHACSTASMVDEGGFYFN-----CITEPTMAHFALKVALL
        G +  A  +F  +  KD+VSWT+MI GY +HG GK  + LF +M R   +  DEI++ ++L+ACS + +VDEG  +FN     C  EPT+ H+A  V +L
Subjt:  GCIQSASKIFLRMKEKDMVSWTIMILGYSLHGQGKLGVRLFREMERNLRVHRDEITYTAVLHACSTASMVDEGGFYFN-----CITEPTMAHFALKVALL

Query:  ARAGRLDEARTFVEEHKLDKHAEILRALLDGCRNHHQQKLGKRIIEQLCDLEPLNAENYVLLSNWYASNENWDMVEKLRETIRDMGLRPKKAYSWIEFHN
        AR G L +A  F+E   +   A I  ALL GCR HH  KL +++ E++ +LEP N   YVL++N YA  E W+ V++LR+ I   GLR     SWIE   
Subjt:  ARAGRLDEARTFVEEHKLDKHAEILRALLDGCRNHHQQKLGKRIIEQLCDLEPLNAENYVLLSNWYASNENWDMVEKLRETIRDMGLRPKKAYSWIEFHN

Query:  KIHVFGTGDVAHPRSQNIYWKLQCLMEKMEEDGFKPNPDFRFHDVDE-ERECVLIGHSELLAISFGLISTEAGRTIRITKNLRVCHNCHESAKFISKIVH
        ++++F  GD ++P ++NI   L+ +  +M E+G+ P   +   D +E E+E  L GHSE LA++ G+IS+  G+ IR+TKNLRVC +CHE AKF+SK+  
Subjt:  KIHVFGTGDVAHPRSQNIYWKLQCLMEKMEEDGFKPNPDFRFHDVDE-ERECVLIGHSELLAISFGLISTEAGRTIRITKNLRVCHNCHESAKFISKIVH

Query:  REIVVKDPYVFHHFKDGRCSCEDF
        REIV++D   FH FKDG CSC  F
Subjt:  REIVVKDPYVFHHFKDGRCSCEDF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTGATCTGCAGTGCTGACACTGCAAAGTGGCTCCATGACAAGTCGGCTTGCAGAAACTCTGTTTCTTGGACCATGCTAGCCAAGCTGTACTTGATGGAAGATAAACC
CAGTTTTGCCATAGACTTGTTTTACCAAATGGTGGAGTTGGCGACTGATATTGATGCAGTGGCATTGTCCACGGCTATTGGTGCATGTGGTGCACTCAAATTGCTGCAAC
ACGGAAGAAATATCCACCATGTCGCCAGAATTCATGGTTTGGAATTTGATGTCTTGGTCAGTAATTCCCTGTTGAAAATGTACCTTGACTGTGGTAGTATCAAAGATGCT
CGGGGGTTCTTCAATCGAATGCCGTCCAAAGATGTCATTTCGTGGACAGAACTCATCCATGTGTATGTTAAGAAAGGTGGAATCAATGAGGGCTTTAAGCTGTTTCGACG
GATGAATATGGATGGAGGATTGAAGCCTGATCCTCTTACAATCAGCAGCATTCTCCCAGCCTGTGGAAGAATGGCTGCTCATAAGCATGGAAGAGAGATTCATGGATACG
TGCTTAAAAATGCTTTTGATCAGAATCTCATTGTCCAAAACGCTTTGGTCGACATGTATGTCAAATCTGGATGTATCCAATCTGCATCGAAAATTTTTTTGAGGATGAAG
GAGAAAGATATGGTTTCGTGGACGATCATGATCTTGGGCTACAGCTTACATGGCCAAGGAAAACTTGGAGTCCGTTTGTTCCGTGAGATGGAGAGGAACTTGAGGGTGCA
TAGAGATGAGATCACTTATACTGCAGTTTTGCATGCTTGTAGTACTGCAAGCATGGTAGATGAAGGGGGTTTTTACTTCAATTGCATTACCGAACCAACCATGGCACACT
TTGCTCTAAAGGTGGCACTTTTAGCCCGAGCAGGACGACTGGATGAAGCAAGGACATTTGTTGAAGAACATAAACTTGACAAACATGCTGAGATTTTGAGAGCACTGCTT
GATGGATGCAGGAACCACCATCAACAAAAATTAGGCAAGCGAATCATTGAGCAGCTGTGTGATTTAGAACCTCTAAATGCTGAGAATTACGTTCTACTTTCGAACTGGTA
TGCCAGCAACGAAAACTGGGACATGGTCGAAAAGTTGAGAGAAACAATTAGAGACATGGGATTAAGACCAAAGAAGGCTTACAGTTGGATTGAGTTCCACAACAAAATTC
ATGTGTTTGGGACAGGGGATGTAGCCCACCCGAGATCACAGAACATATATTGGAAATTACAGTGCTTGATGGAGAAAATGGAAGAAGATGGTTTCAAACCGAATCCCGAT
TTCAGATTCCACGATGTCGATGAGGAGCGAGAGTGTGTTCTAATAGGACACAGTGAGCTCTTGGCAATTTCATTCGGGCTTATTAGTACAGAAGCAGGAAGGACAATTCG
TATTACAAAGAACCTTCGTGTATGCCATAATTGTCATGAATCTGCAAAGTTCATATCCAAAATTGTTCATCGAGAAATCGTAGTAAAAGATCCTTATGTTTTCCATCATT
TCAAGGATGGTCGTTGTTCTTGTGAAGATTTTTGTTAA
mRNA sequenceShow/hide mRNA sequence
ATTCTCTTCCGGTGCCCCCATCTGCCCATTCATCACATCGTGACAGACGCCCAATCTTACACTTCTGGGAAGCAACGACAGAAGCAAGAAATGGCCTCCTTAGAATGCTG
AAGAAATGAATTAGCTCAGTATCCTATCCTCTCTCTTTCAGTCTTTCTTGATTTCGCTTCCTTTCCTACAATTTTTTGCAACATGATACATTTATATAATGACTGACAAT
GTAAGGCTGAAATTCTCCTTTTTGCTTTTCAATGGCTATTTCCTGAAATTCCACAATGGATCTCCTCCTATTCACCCACGTTCATCGTCTTCCCCTTACTCAAAAACCCA
ATTACACATACCATCGCCACCGACTATTTAATAATCCCCCTCATGCTCGTACGACAACTGTAGAGAATTATGCTACTTTATGCGTAGCCCACCAACTGTTCGACGAAATT
CCTATATGGGATACTTTTGCTTGGAACAATCTGATTCAAACCCATCTCACCAATGGAGATGTGGGGCACGTAATTTCTACGTATCAACAGATGTTGTTTCGAGGGGTTCG
CCCTGACAAACACACTCTTCCTCGAATTATATGCGCTTCCCGTCAGAATGGTGATCTGCAGTGCTGACACTGCAAAGTGGCTCCATGACAAGTCGGCTTGCAGAAACTCT
GTTTCTTGGACCATGCTAGCCAAGCTGTACTTGATGGAAGATAAACCCAGTTTTGCCATAGACTTGTTTTACCAAATGGTGGAGTTGGCGACTGATATTGATGCAGTGGC
ATTGTCCACGGCTATTGGTGCATGTGGTGCACTCAAATTGCTGCAACACGGAAGAAATATCCACCATGTCGCCAGAATTCATGGTTTGGAATTTGATGTCTTGGTCAGTA
ATTCCCTGTTGAAAATGTACCTTGACTGTGGTAGTATCAAAGATGCTCGGGGGTTCTTCAATCGAATGCCGTCCAAAGATGTCATTTCGTGGACAGAACTCATCCATGTG
TATGTTAAGAAAGGTGGAATCAATGAGGGCTTTAAGCTGTTTCGACGGATGAATATGGATGGAGGATTGAAGCCTGATCCTCTTACAATCAGCAGCATTCTCCCAGCCTG
TGGAAGAATGGCTGCTCATAAGCATGGAAGAGAGATTCATGGATACGTGCTTAAAAATGCTTTTGATCAGAATCTCATTGTCCAAAACGCTTTGGTCGACATGTATGTCA
AATCTGGATGTATCCAATCTGCATCGAAAATTTTTTTGAGGATGAAGGAGAAAGATATGGTTTCGTGGACGATCATGATCTTGGGCTACAGCTTACATGGCCAAGGAAAA
CTTGGAGTCCGTTTGTTCCGTGAGATGGAGAGGAACTTGAGGGTGCATAGAGATGAGATCACTTATACTGCAGTTTTGCATGCTTGTAGTACTGCAAGCATGGTAGATGA
AGGGGGTTTTTACTTCAATTGCATTACCGAACCAACCATGGCACACTTTGCTCTAAAGGTGGCACTTTTAGCCCGAGCAGGACGACTGGATGAAGCAAGGACATTTGTTG
AAGAACATAAACTTGACAAACATGCTGAGATTTTGAGAGCACTGCTTGATGGATGCAGGAACCACCATCAACAAAAATTAGGCAAGCGAATCATTGAGCAGCTGTGTGAT
TTAGAACCTCTAAATGCTGAGAATTACGTTCTACTTTCGAACTGGTATGCCAGCAACGAAAACTGGGACATGGTCGAAAAGTTGAGAGAAACAATTAGAGACATGGGATT
AAGACCAAAGAAGGCTTACAGTTGGATTGAGTTCCACAACAAAATTCATGTGTTTGGGACAGGGGATGTAGCCCACCCGAGATCACAGAACATATATTGGAAATTACAGT
GCTTGATGGAGAAAATGGAAGAAGATGGTTTCAAACCGAATCCCGATTTCAGATTCCACGATGTCGATGAGGAGCGAGAGTGTGTTCTAATAGGACACAGTGAGCTCTTG
GCAATTTCATTCGGGCTTATTAGTACAGAAGCAGGAAGGACAATTCGTATTACAAAGAACCTTCGTGTATGCCATAATTGTCATGAATCTGCAAAGTTCATATCCAAAAT
TGTTCATCGAGAAATCGTAGTAAAAGATCCTTATGTTTTCCATCATTTCAAGGATGGTCGTTGTTCTTGTGAAGATTTTTGTTAACCTTATCTGTTCTGTTCGTTGTTTT
AGATTATAGAGATCCCCATTGCTCCCAATTCGACTGCTTGTTACTCTTTCTTCTATAACATGACAAGGGCAATAAGATTTTGATAGCAAAGATGGC
Protein sequenceShow/hide protein sequence
MVICSADTAKWLHDKSACRNSVSWTMLAKLYLMEDKPSFAIDLFYQMVELATDIDAVALSTAIGACGALKLLQHGRNIHHVARIHGLEFDVLVSNSLLKMYLDCGSIKDA
RGFFNRMPSKDVISWTELIHVYVKKGGINEGFKLFRRMNMDGGLKPDPLTISSILPACGRMAAHKHGREIHGYVLKNAFDQNLIVQNALVDMYVKSGCIQSASKIFLRMK
EKDMVSWTIMILGYSLHGQGKLGVRLFREMERNLRVHRDEITYTAVLHACSTASMVDEGGFYFNCITEPTMAHFALKVALLARAGRLDEARTFVEEHKLDKHAEILRALL
DGCRNHHQQKLGKRIIEQLCDLEPLNAENYVLLSNWYASNENWDMVEKLRETIRDMGLRPKKAYSWIEFHNKIHVFGTGDVAHPRSQNIYWKLQCLMEKMEEDGFKPNPD
FRFHDVDEERECVLIGHSELLAISFGLISTEAGRTIRITKNLRVCHNCHESAKFISKIVHREIVVKDPYVFHHFKDGRCSCEDFC