; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0021994 (gene) of Snake gourd v1 genome

Gene IDTan0021994
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationLG03:72783363..72787412
RNA-Seq ExpressionTan0021994
SyntenyTan0021994
Gene Ontology termsGO:0009658 - chloroplast organization (biological process)
GO:0009507 - chloroplast (cellular component)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily
IPR033443 - Pentacotripeptide-repeat region of PRORP
IPR044645 - Pentatricopeptide repeat-containing protein DG1/EMB2279-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008440398.1 PREDICTED: pentatricopeptide repeat-containing protein At5g67570, chloroplastic [Cucumis melo]0.0e+0085.43Show/hide
Query:  MEAVRANAPIPSPKFEPDTEKIKRTLLQKGVYPTPKIVRSLRKKEIQKHNRKLKRLPKRQADQTPPLSESQKQLIDEETRFLTLRSEYKEFSKAIEAKPA
        MEA+ +NAPIPSPKFEPDT+KIKR LLQKGVYPTP+IVRSLRKKEIQK+NRKLKRL +RQ+DQ+PPLSESQKQLI EET FLTLRSEYKEFSKAIEAKPA
Subjt:  MEAVRANAPIPSPKFEPDTEKIKRTLLQKGVYPTPKIVRSLRKKEIQKHNRKLKRLPKRQADQTPPLSESQKQLIDEETRFLTLRSEYKEFSKAIEAKPA

Query:  GGLMVGRPWERLERLNLKELTDSRTGYNRDNLKKESLRELRKLFEARKLEELQWVLDDDVELKDEWLESENGHCGTVKRRRGDGEVIRFLVDRLSSMEIS
        GGLMVGRPWERLER+N  ELT  RTGYNRD+LKKESLRELRKLFE RKLEEL+W LDDDVELK+EWL+SENG    VKRRRGDGEVIRFLVDRLSS  IS
Subjt:  GGLMVGRPWERLERLNLKELTDSRTGYNRDNLKKESLRELRKLFEARKLEELQWVLDDDVELKDEWLESENGHCGTVKRRRGDGEVIRFLVDRLSSMEIS

Query:  MRDWKFSRMMIRSGLQFNEGQLLRILDALGAKGCWKQALSVVEWVYNLKSHSYSKSRFVYTKLLAVLGMARKPQEALQIFNFMRGDGQIYPDMAAYHSIA
        MRDWKFSRMMIRSGLQFNEGQLL+ILDALGAKGCWKQALSVVEWVYNLKSHS+SKSRFVYTKLLAVLGMARKPQEALQIFN MRGDGQIYPDMAAYHSIA
Subjt:  MRDWKFSRMMIRSGLQFNEGQLLRILDALGAKGCWKQALSVVEWVYNLKSHSYSKSRFVYTKLLAVLGMARKPQEALQIFNFMRGDGQIYPDMAAYHSIA

Query:  VTLGQAGHLKQLLKVIECMRQQPSKKIRNKYRKSWDPAVEPSLVIYNAILNACIPTLEWKGVFWVFTQLRKSGLKPNGATYGLSMEVMLKSGKYEHVHKL
        VTLGQAG LKQLLKVIECMRQQPSKK+RNK RKSWDPAVEP LVIYN ILNACIPTLEWKGV+WVFTQLRKSGL+PNGATYGLSMEVMLKSGKYE +H L
Subjt:  VTLGQAGHLKQLLKVIECMRQQPSKKIRNKYRKSWDPAVEPSLVIYNAILNACIPTLEWKGVFWVFTQLRKSGLKPNGATYGLSMEVMLKSGKYEHVHKL

Query:  FTKMKKSGETLKANTYRVLVKAFWEEGNVNGAIEAVRDMEQRGVVGSASVYYELACCLCYNGRWRDALVEVEKMKMLSHMKPLVVTFTGMILSSFDGGHI
        FTKMKKSGETLKANTYRVLVKAFWEEGN +GAIEAVRDMEQRGVVGSASVYYELACCLCYNG+W+DALVEVEKMK LSHMKPLVVTFTGMILSSF+GGHI
Subjt:  FTKMKKSGETLKANTYRVLVKAFWEEGNVNGAIEAVRDMEQRGVVGSASVYYELACCLCYNGRWRDALVEVEKMKMLSHMKPLVVTFTGMILSSFDGGHI

Query:  DDCISIFKYMKQNCAPNIGTINTMLKVYGRNDMFSKAKDLFEEIKRKTDHSSQISVVPSIIPDEYTYGSMLEAAASALQWEYFENVYREMALSGYRLDQS
        DDCISIF+YMKQ CAPNIGTINTMLKVYGRNDMFSKAKDLFEEIK+K D SS  S VPS++PDEYTY SML+AAAS+LQWEYFENVYREMALSGYRLDQS
Subjt:  DDCISIFKYMKQNCAPNIGTINTMLKVYGRNDMFSKAKDLFEEIKRKTDHSSQISVVPSIIPDEYTYGSMLEAAASALQWEYFENVYREMALSGYRLDQS

Query:  KHARLLVEASRAGKWYLLDHAFDSILEAGQIPHPLLFTEMILQLMTQDSYEQAVTLVKAMGYAPFQVSERQWTELFEANTDRICWNNLKQLSDALSNCNI
        KHA LLVEAS+AGKWYLLDHAFD+ILEAGQIPHPLLFTEMILQL TQ++YEQAVTLV+ MGYAPFQVSERQWTELFE N DRIC NNLKQL DAL +C+ 
Subjt:  KHARLLVEASRAGKWYLLDHAFDSILEAGQIPHPLLFTEMILQLMTQDSYEQAVTLVKAMGYAPFQVSERQWTELFEANTDRICWNNLKQLSDALSNCNI

Query:  SEATVSNLSRSLQSLCKPGIPENTSQSVACDHDATDGLQLPGSENMENLKLHPDCFTGRCGKSLDNTPVSRASLNIKVESDSKMAPWSLSLSEGVLGTDK
        SEATVSNLSRSLQSLCK GI E+TSQS+ACD +ATDGL+LP S+NMEN+KLHPD       +SLD  PV  ASLN+KV S+S M+PWS S+S+GVLGT +
Subjt:  SEATVSNLSRSLQSLCKPGIPENTSQSVACDHDATDGLQLPGSENMENLKLHPDCFTGRCGKSLDNTPVSRASLNIKVESDSKMAPWSLSLSEGVLGTDK

Query:  FMDRSENELSTFDLCDGSEDDEEELNILLDRFGDSYDSNLPSVNEILKTWKEDRKTDGLFLHPLN
        F D S NE STFD  D SEDDE ELN+LLD F DSYDSNLP+ NEIL+TWKE+RK DGLFLHPLN
Subjt:  FMDRSENELSTFDLCDGSEDDEEELNILLDRFGDSYDSNLPSVNEILKTWKEDRKTDGLFLHPLN

XP_022950896.1 pentatricopeptide repeat-containing protein At5g67570, chloroplastic [Cucurbita moschata]0.0e+0086.1Show/hide
Query:  MEAVRANAPIPSPKFEPDTEKIKRTLLQKGVYPTPKIVRSLRKKEIQKHNRKLKRLPKRQADQTPPLSESQKQLIDEETRFLTLRSEYKEFSKAIEAKPA
        MEA+  NA +PSPKFEPD EKIKRTLLQKGV+PTPKIVRSL KKEIQKHNRKLKRL +RQ  Q+PPLSESQKQLI EET+FLTLRSEYKEFSKAIEA+PA
Subjt:  MEAVRANAPIPSPKFEPDTEKIKRTLLQKGVYPTPKIVRSLRKKEIQKHNRKLKRLPKRQADQTPPLSESQKQLIDEETRFLTLRSEYKEFSKAIEAKPA

Query:  GGLMVGRPWERLERLNLKELTDSRTGYNRDNLKKESLRELRKLFEARKLEELQWVLDDDVELKDEWLESENGHCGTVKRRRGDGEVIRFLVDRLSSMEIS
        GGLMVGRPWERLER+NLKELT  RT YNRDNLKKESLRELRKLFEARKLEELQWVLDDDVELKDEWL SENGH   VKRRRGDGEVIRFLVDRLSS  IS
Subjt:  GGLMVGRPWERLERLNLKELTDSRTGYNRDNLKKESLRELRKLFEARKLEELQWVLDDDVELKDEWLESENGHCGTVKRRRGDGEVIRFLVDRLSSMEIS

Query:  MRDWKFSRMMIRSGLQFNEGQLLRILDALGAKGCWKQALSVVEWVYNLKSHSYSKSRFVYTKLLAVLGMARKPQEALQIFNFMRGDGQIYPDMAAYHSIA
        MRDWKFSRMMI+SGLQFNEGQLL+ILDALGAKGCWKQALSVVEWVYNLKSHS+SKSRFVYTKLLAVLG ARKPQEALQIFN MRGDGQIYPDMAAYHSIA
Subjt:  MRDWKFSRMMIRSGLQFNEGQLLRILDALGAKGCWKQALSVVEWVYNLKSHSYSKSRFVYTKLLAVLGMARKPQEALQIFNFMRGDGQIYPDMAAYHSIA

Query:  VTLGQAGHLKQLLKVIECMRQQPSKKIRNKYRKSWDPAVEPSLVIYNAILNACIPTLEWKGVFWVFTQLRKSGLKPNGATYGLSMEVMLKSGKYEHVHKL
        VTLGQAG LKQLLK+IE MRQQPSKK+RN  RK WDPAVEP LVIYNAILNAC+PTLEWK V+WVFTQLRK+GLKPNGATYGLSMEVMLKSGKYE VH L
Subjt:  VTLGQAGHLKQLLKVIECMRQQPSKKIRNKYRKSWDPAVEPSLVIYNAILNACIPTLEWKGVFWVFTQLRKSGLKPNGATYGLSMEVMLKSGKYEHVHKL

Query:  FTKMKKSGETLKANTYRVLVKAFWEEGNVNGAIEAVRDMEQRGVVGSASVYYELACCLCYNGRWRDALVEVEKMKMLSHMKPLVVTFTGMILSSFDGGHI
        FTKMK SG TLKANTYRVLVKAFWEEGNV+GAIEAVRDMEQRGVVGSASVYYELACCLCY+GRW+DALVEVEKMK LSHMKPLVVTFTGMILSSFDGGHI
Subjt:  FTKMKKSGETLKANTYRVLVKAFWEEGNVNGAIEAVRDMEQRGVVGSASVYYELACCLCYNGRWRDALVEVEKMKMLSHMKPLVVTFTGMILSSFDGGHI

Query:  DDCISIFKYMKQNCAPNIGTINTMLKVYGRNDMFSKAKDLFEEIKRKTDHSSQISVVPSIIPDEYTYGSMLEAAASALQWEYFENVYREMALSGYRLDQS
        DDCISIF+YMKQ CAPNIGTINTMLKV+GRNDMFSKAKDL+EEIKRK DHSSQ S V SI+PD+YTY SML+AAASA QWEYFENVYREMALSGYRLDQS
Subjt:  DDCISIFKYMKQNCAPNIGTINTMLKVYGRNDMFSKAKDLFEEIKRKTDHSSQISVVPSIIPDEYTYGSMLEAAASALQWEYFENVYREMALSGYRLDQS

Query:  KHARLLVEASRAGKWYLLDHAFDSILEAGQIPHPLLFTEMILQLMTQDSYEQAVTLVKAMGYAPFQVSERQWTELFEANTDRICWNNLKQLSDALSNCNI
        KHA LLVEASRAGKWYLLDHAFD+ILEAGQIPHPLLFTEMILQL TQD+YEQAVTLV+ M YAPFQVSERQWTELFE NTDRICWNNLK+LSDALS+C+ 
Subjt:  KHARLLVEASRAGKWYLLDHAFDSILEAGQIPHPLLFTEMILQLMTQDSYEQAVTLVKAMGYAPFQVSERQWTELFEANTDRICWNNLKQLSDALSNCNI

Query:  SEATVSNLSRSLQSLCKPGIPENTSQSVACDHDATDGLQLPGSENMENLKLHPDCFTGRCGKSLDNTPVSRASLNIKVESDSKMAPWSLSLSEGVLGTDK
        SEATV NLS SLQSLCK GIPEN SQS+A DHD TDGLQLPG ENM+N+KLH D  T  C  SLD  PV+ ASLN     DS+M PWSLSLS+GVL T K
Subjt:  SEATVSNLSRSLQSLCKPGIPENTSQSVACDHDATDGLQLPGSENMENLKLHPDCFTGRCGKSLDNTPVSRASLNIKVESDSKMAPWSLSLSEGVLGTDK

Query:  FMDRSENELSTFDLCDGSEDDEEELNILLDRFGDSYDSNLPSVNEILKTWKEDRKTDGLFLHP
        F DRS NE STFDL D SEDDEEEL++LLD F DSYDSNLPSV+EILKTWKE+RK DGL+LHP
Subjt:  FMDRSENELSTFDLCDGSEDDEEELNILLDRFGDSYDSNLPSVNEILKTWKEDRKTDGLFLHP

XP_023543129.1 pentatricopeptide repeat-containing protein At5g67570, chloroplastic [Cucurbita pepo subsp. pepo]0.0e+0086.21Show/hide
Query:  MEAVRANAPIPSPKFEPDTEKIKRTLLQKGVYPTPKIVRSLRKKEIQKHNRKLKRLPKRQADQTPPLSESQKQLIDEETRFLTLRSEYKEFSKAIEAKPA
        MEA+  NA +PSPKFEPD EKIKRTLLQKGV+PTPKIVRSL KKEIQKHNRKLKRL +RQ  Q+PPLSESQKQLI+EET FLTLRSEYKEFSKAIEA+PA
Subjt:  MEAVRANAPIPSPKFEPDTEKIKRTLLQKGVYPTPKIVRSLRKKEIQKHNRKLKRLPKRQADQTPPLSESQKQLIDEETRFLTLRSEYKEFSKAIEAKPA

Query:  GGLMVGRPWERLERLNLKELTDSRTGYNRDNLKKESLRELRKLFEARKLEELQWVLDDDVELKDEWLESENGHCGTVKRRRGDGEVIRFLVDRLSSMEIS
        GGLMVGRPWERLER+NLKELT  RT YNRDNLKKESLRELRKLFEARKLEELQWVLDDDVELK+EWL SENG    VKRRRGDGEVIRFLVDRLSS  IS
Subjt:  GGLMVGRPWERLERLNLKELTDSRTGYNRDNLKKESLRELRKLFEARKLEELQWVLDDDVELKDEWLESENGHCGTVKRRRGDGEVIRFLVDRLSSMEIS

Query:  MRDWKFSRMMIRSGLQFNEGQLLRILDALGAKGCWKQALSVVEWVYNLKSHSYSKSRFVYTKLLAVLGMARKPQEALQIFNFMRGDGQIYPDMAAYHSIA
        MRDWKFSRMMI+SGLQFNEGQLL+ILDALGAKGCWKQALSVVEWVYNLKSHS+SKSRFVYTKLLAVLG ARKPQEALQIFN MRGDGQIYPDMAAYHSIA
Subjt:  MRDWKFSRMMIRSGLQFNEGQLLRILDALGAKGCWKQALSVVEWVYNLKSHSYSKSRFVYTKLLAVLGMARKPQEALQIFNFMRGDGQIYPDMAAYHSIA

Query:  VTLGQAGHLKQLLKVIECMRQQPSKKIRNKYRKSWDPAVEPSLVIYNAILNACIPTLEWKGVFWVFTQLRKSGLKPNGATYGLSMEVMLKSGKYEHVHKL
        VTLGQAG LKQLLK+IECMRQQPSKK+RN  RK WDPAVEP LVIYNAILNACIPTLEWK V+WVFTQLRK+GLKPNGATYGLSMEVMLKSGKYE VH L
Subjt:  VTLGQAGHLKQLLKVIECMRQQPSKKIRNKYRKSWDPAVEPSLVIYNAILNACIPTLEWKGVFWVFTQLRKSGLKPNGATYGLSMEVMLKSGKYEHVHKL

Query:  FTKMKKSGETLKANTYRVLVKAFWEEGNVNGAIEAVRDMEQRGVVGSASVYYELACCLCYNGRWRDALVEVEKMKMLSHMKPLVVTFTGMILSSFDGGHI
        FTKMK SG TLKANTYRVLVKAFWEEGNV+GAIEAVRDMEQRGVVGSASVYYELACCLCY+GRW+DALVEVEKMK LSHMKPLVVTFTGMILSSFDGGHI
Subjt:  FTKMKKSGETLKANTYRVLVKAFWEEGNVNGAIEAVRDMEQRGVVGSASVYYELACCLCYNGRWRDALVEVEKMKMLSHMKPLVVTFTGMILSSFDGGHI

Query:  DDCISIFKYMKQNCAPNIGTINTMLKVYGRNDMFSKAKDLFEEIKRKTDHSSQISVVPSIIPDEYTYGSMLEAAASALQWEYFENVYREMALSGYRLDQS
        DDCISIF+YMKQ CAPNIGTINTMLKV+GRNDMFSKAKDL+EEIKRK DHSSQ S V SI+PD+YTY SML+AAASA QWEYFENVYREMALSGYRLDQS
Subjt:  DDCISIFKYMKQNCAPNIGTINTMLKVYGRNDMFSKAKDLFEEIKRKTDHSSQISVVPSIIPDEYTYGSMLEAAASALQWEYFENVYREMALSGYRLDQS

Query:  KHARLLVEASRAGKWYLLDHAFDSILEAGQIPHPLLFTEMILQLMTQDSYEQAVTLVKAMGYAPFQVSERQWTELFEANTDRICWNNLKQLSDALSNCNI
        KHA LLVEASRAGKWYLLDHAFD+ILEAGQIPHPLLFTE+ILQL TQD+YEQAVTLV+ M YAPFQVSERQWTE+FE NTDRICWNNLK+LSDALS+C+ 
Subjt:  KHARLLVEASRAGKWYLLDHAFDSILEAGQIPHPLLFTEMILQLMTQDSYEQAVTLVKAMGYAPFQVSERQWTELFEANTDRICWNNLKQLSDALSNCNI

Query:  SEATVSNLSRSLQSLCKPGIPENTSQSVACDHDATDGLQLPGSENMENLKLHPDCFTGRCGKSLDNTPVSRASLNIKVESDSKMAPWSLSLSEGVLGTDK
        SEATVSNLSRSLQ LCK GIPENTSQS+  DHD TDGLQLPGSENMEN+KLH D  T  C  SLD  PV+ ASLN     DSKM PWSLSLS+GVL T K
Subjt:  SEATVSNLSRSLQSLCKPGIPENTSQSVACDHDATDGLQLPGSENMENLKLHPDCFTGRCGKSLDNTPVSRASLNIKVESDSKMAPWSLSLSEGVLGTDK

Query:  FMDRSENELSTFDLCDGSEDDEEELNILLDRFGDSYDSNLPSVNEILKTWKEDRKTDGLFLHP
        F DRS NE STFDL D SEDDEEEL++LLD F D YDSNLPSV+EILKTWKE+RK DGL+LHP
Subjt:  FMDRSENELSTFDLCDGSEDDEEELNILLDRFGDSYDSNLPSVNEILKTWKEDRKTDGLFLHP

XP_038881784.1 pentatricopeptide repeat-containing protein At5g67570, chloroplastic isoform X1 [Benincasa hispida]0.0e+0087.51Show/hide
Query:  MEAVRANAPIPSPKFEPDTEKIKRTLLQKGVYPTPKIVRSLRKKEIQKHNRKLKRLPKRQADQTPPLSESQKQLIDEETRFLTLRSEYKEFSKAIEAKPA
        MEA+  N+PIPSPKFEPD EKIKRTL+ KGV+PTP+IVRSLRKKEIQK+NRKLKRL +RQADQ+PPLSESQKQLI EET FLTLRSEYKEFSKAIEAKPA
Subjt:  MEAVRANAPIPSPKFEPDTEKIKRTLLQKGVYPTPKIVRSLRKKEIQKHNRKLKRLPKRQADQTPPLSESQKQLIDEETRFLTLRSEYKEFSKAIEAKPA

Query:  GGLMVGRPWERLERLNLKELTDSRTGYNRDNLKKESLRELRKLFEARKLEELQWVLDDDVELKDEWLESENGHCGTVKRRRGDGEVIRFLVDRLSSMEIS
        GGLMVGRPWERLER+NLKELT  RTGYNRDNLKKESLRELRKLFEARKLEELQWVLDDDVELK+EWLESEN H   ++RRRGDGEVIRFLVDRLSS  IS
Subjt:  GGLMVGRPWERLERLNLKELTDSRTGYNRDNLKKESLRELRKLFEARKLEELQWVLDDDVELKDEWLESENGHCGTVKRRRGDGEVIRFLVDRLSSMEIS

Query:  MRDWKFSRMMIRSGLQFNEGQLLRILDALGAKGCWKQALSVVEWVYNLKSHSYSKSRFVYTKLLAVLGMARKPQEALQIFNFMR--------GDGQIYPD
        MRDWKFSRMMIRSGLQFNEGQLL+ILDALGAKGCWKQALSVVEWVYNLKSHS+SKSRFVYTKLLAVLGM+RKPQEALQIF+ MR        GDGQIYPD
Subjt:  MRDWKFSRMMIRSGLQFNEGQLLRILDALGAKGCWKQALSVVEWVYNLKSHSYSKSRFVYTKLLAVLGMARKPQEALQIFNFMR--------GDGQIYPD

Query:  MAAYHSIAVTLGQAGHLKQLLKVIECMRQQPSKKIRNKYRKSWDPAVEPSLVIYNAILNACIPTLEWKGVFWVFTQLRKSGLKPNGATYGLSMEVMLKSG
        MAAYHSIAVTLGQAG LKQLLKV+ECMRQQPS+K+RNK RKSWDPAVEP LV+YNAILNACIPTLEWKGV+WVFTQLRKSGL+PNGATYGLSMEVMLKSG
Subjt:  MAAYHSIAVTLGQAGHLKQLLKVIECMRQQPSKKIRNKYRKSWDPAVEPSLVIYNAILNACIPTLEWKGVFWVFTQLRKSGLKPNGATYGLSMEVMLKSG

Query:  KYEHVHKLFTKMKKSGETLKANTYRVLVKAFWEEGNVNGAIEAVRDMEQRGVVGSASVYYELACCLCYNGRWRDALVEVEKMKMLSHMKPLVVTFTGMIL
        KYE +HKLFTK+KKSGETLKANTYRVLVKAFWEEGNVNGAIEAVRDMEQRGVVGSASVYYELACCLCYNGRW+DALVEVEKMK LSHMKPLVVTFTGMIL
Subjt:  KYEHVHKLFTKMKKSGETLKANTYRVLVKAFWEEGNVNGAIEAVRDMEQRGVVGSASVYYELACCLCYNGRWRDALVEVEKMKMLSHMKPLVVTFTGMIL

Query:  SSFDGGHIDDCISIFKYMKQNCAPNIGTINTMLKVYGRNDMFSKAKDLFEEIKRKTDHSSQISVVPSIIPDEYTYGSMLEAAASALQWEYFENVYREMAL
        SSFDGGHIDDCISIF+YMKQ CAPNIGTIN+MLKVYGRNDMF KAKDLFEEIKRK D SS  S VPS++PDEYTYGSMLEAAASALQWEYFENVYREMAL
Subjt:  SSFDGGHIDDCISIFKYMKQNCAPNIGTINTMLKVYGRNDMFSKAKDLFEEIKRKTDHSSQISVVPSIIPDEYTYGSMLEAAASALQWEYFENVYREMAL

Query:  SGYRLDQSKHARLLVEASRAGKWYLLDHAFDSILEAGQIPHPLLFTEMILQLMTQDSYEQAVTLVKAMGYAPFQVSERQWTELFEANTDRICWNNLKQLS
        SGYRLDQSKHARLLVEASRAGKWYLLDHAFDSILEAGQIPHPLLFTEMIL L TQD+YEQAVTLV+ MGYAPFQVSERQWTELFE N DRICW NLK+L 
Subjt:  SGYRLDQSKHARLLVEASRAGKWYLLDHAFDSILEAGQIPHPLLFTEMILQLMTQDSYEQAVTLVKAMGYAPFQVSERQWTELFEANTDRICWNNLKQLS

Query:  DALSNCNISEATVSNLSRSLQSLCKPGIPENTSQSVACDHDATDGLQLPGSENMENLKLHPDCFTGRCGKSLDNTPVSRASLNIKVESDSKMAPWSLSLS
        DAL NC+ SEATVSNLSRSLQSLCK GIPENTSQSVACDHD TDGLQLPGSEN EN+KLHPD  T  C +SLD  PV+ ASLN+KV+SDS+++PWS S S
Subjt:  DALSNCNISEATVSNLSRSLQSLCKPGIPENTSQSVACDHDATDGLQLPGSENMENLKLHPDCFTGRCGKSLDNTPVSRASLNIKVESDSKMAPWSLSLS

Query:  EGVLGTDKFMDRSENELSTFDLCDGSEDDEEELNILLDRFGDSYDSNLPSVNEILKTWKEDRKTDGLFLHPLN
        EGVLGTD+F DRS NELST DLCD SEDDEE LN+LLD F DSYDSNLPSVNEILKTWKE+RKTDGLFLHPLN
Subjt:  EGVLGTDKFMDRSENELSTFDLCDGSEDDEEELNILLDRFGDSYDSNLPSVNEILKTWKEDRKTDGLFLHPLN

XP_038881786.1 pentatricopeptide repeat-containing protein At5g67570, chloroplastic isoform X2 [Benincasa hispida]0.0e+0088.32Show/hide
Query:  MEAVRANAPIPSPKFEPDTEKIKRTLLQKGVYPTPKIVRSLRKKEIQKHNRKLKRLPKRQADQTPPLSESQKQLIDEETRFLTLRSEYKEFSKAIEAKPA
        MEA+  N+PIPSPKFEPD EKIKRTL+ KGV+PTP+IVRSLRKKEIQK+NRKLKRL +RQADQ+PPLSESQKQLI EET FLTLRSEYKEFSKAIEAKPA
Subjt:  MEAVRANAPIPSPKFEPDTEKIKRTLLQKGVYPTPKIVRSLRKKEIQKHNRKLKRLPKRQADQTPPLSESQKQLIDEETRFLTLRSEYKEFSKAIEAKPA

Query:  GGLMVGRPWERLERLNLKELTDSRTGYNRDNLKKESLRELRKLFEARKLEELQWVLDDDVELKDEWLESENGHCGTVKRRRGDGEVIRFLVDRLSSMEIS
        GGLMVGRPWERLER+NLKELT  RTGYNRDNLKKESLRELRKLFEARKLEELQWVLDDDVELK+EWLESEN H   ++RRRGDGEVIRFLVDRLSS  IS
Subjt:  GGLMVGRPWERLERLNLKELTDSRTGYNRDNLKKESLRELRKLFEARKLEELQWVLDDDVELKDEWLESENGHCGTVKRRRGDGEVIRFLVDRLSSMEIS

Query:  MRDWKFSRMMIRSGLQFNEGQLLRILDALGAKGCWKQALSVVEWVYNLKSHSYSKSRFVYTKLLAVLGMARKPQEALQIFNFMRGDGQIYPDMAAYHSIA
        MRDWKFSRMMIRSGLQFNEGQLL+ILDALGAKGCWKQALSVVEWVYNLKSHS+SKSRFVYTKLLAVLGM+RKPQEALQIF+ MRGDGQIYPDMAAYHSIA
Subjt:  MRDWKFSRMMIRSGLQFNEGQLLRILDALGAKGCWKQALSVVEWVYNLKSHSYSKSRFVYTKLLAVLGMARKPQEALQIFNFMRGDGQIYPDMAAYHSIA

Query:  VTLGQAGHLKQLLKVIECMRQQPSKKIRNKYRKSWDPAVEPSLVIYNAILNACIPTLEWKGVFWVFTQLRKSGLKPNGATYGLSMEVMLKSGKYEHVHKL
        VTLGQAG LKQLLKV+ECMRQQPS+K+RNK RKSWDPAVEP LV+YNAILNACIPTLEWKGV+WVFTQLRKSGL+PNGATYGLSMEVMLKSGKYE +HKL
Subjt:  VTLGQAGHLKQLLKVIECMRQQPSKKIRNKYRKSWDPAVEPSLVIYNAILNACIPTLEWKGVFWVFTQLRKSGLKPNGATYGLSMEVMLKSGKYEHVHKL

Query:  FTKMKKSGETLKANTYRVLVKAFWEEGNVNGAIEAVRDMEQRGVVGSASVYYELACCLCYNGRWRDALVEVEKMKMLSHMKPLVVTFTGMILSSFDGGHI
        FTK+KKSGETLKANTYRVLVKAFWEEGNVNGAIEAVRDMEQRGVVGSASVYYELACCLCYNGRW+DALVEVEKMK LSHMKPLVVTFTGMILSSFDGGHI
Subjt:  FTKMKKSGETLKANTYRVLVKAFWEEGNVNGAIEAVRDMEQRGVVGSASVYYELACCLCYNGRWRDALVEVEKMKMLSHMKPLVVTFTGMILSSFDGGHI

Query:  DDCISIFKYMKQNCAPNIGTINTMLKVYGRNDMFSKAKDLFEEIKRKTDHSSQISVVPSIIPDEYTYGSMLEAAASALQWEYFENVYREMALSGYRLDQS
        DDCISIF+YMKQ CAPNIGTIN+MLKVYGRNDMF KAKDLFEEIKRK D SS  S VPS++PDEYTYGSMLEAAASALQWEYFENVYREMALSGYRLDQS
Subjt:  DDCISIFKYMKQNCAPNIGTINTMLKVYGRNDMFSKAKDLFEEIKRKTDHSSQISVVPSIIPDEYTYGSMLEAAASALQWEYFENVYREMALSGYRLDQS

Query:  KHARLLVEASRAGKWYLLDHAFDSILEAGQIPHPLLFTEMILQLMTQDSYEQAVTLVKAMGYAPFQVSERQWTELFEANTDRICWNNLKQLSDALSNCNI
        KHARLLVEASRAGKWYLLDHAFDSILEAGQIPHPLLFTEMIL L TQD+YEQAVTLV+ MGYAPFQVSERQWTELFE N DRICW NLK+L DAL NC+ 
Subjt:  KHARLLVEASRAGKWYLLDHAFDSILEAGQIPHPLLFTEMILQLMTQDSYEQAVTLVKAMGYAPFQVSERQWTELFEANTDRICWNNLKQLSDALSNCNI

Query:  SEATVSNLSRSLQSLCKPGIPENTSQSVACDHDATDGLQLPGSENMENLKLHPDCFTGRCGKSLDNTPVSRASLNIKVESDSKMAPWSLSLSEGVLGTDK
        SEATVSNLSRSLQSLCK GIPENTSQSVACDHD TDGLQLPGSEN EN+KLHPD  T  C +SLD  PV+ ASLN+KV+SDS+++PWS S SEGVLGTD+
Subjt:  SEATVSNLSRSLQSLCKPGIPENTSQSVACDHDATDGLQLPGSENMENLKLHPDCFTGRCGKSLDNTPVSRASLNIKVESDSKMAPWSLSLSEGVLGTDK

Query:  FMDRSENELSTFDLCDGSEDDEEELNILLDRFGDSYDSNLPSVNEILKTWKEDRKTDGLFLHPLN
        F DRS NELST DLCD SEDDEE LN+LLD F DSYDSNLPSVNEILKTWKE+RKTDGLFLHPLN
Subjt:  FMDRSENELSTFDLCDGSEDDEEELNILLDRFGDSYDSNLPSVNEILKTWKEDRKTDGLFLHPLN

TrEMBL top hitse value%identityAlignment
A0A0A0KFY8 Uncharacterized protein0.0e+0085.43Show/hide
Query:  MEAVRANAPIPSPKFEPDTEKIKRTLLQKGVYPTPKIVRSLRKKEIQKHNRKLKRLPKRQADQTPPLSESQKQLIDEETRFLTLRSEYKEFSKAIEAKPA
        MEA+ +N PIPSPKFEPDT+KIKR LLQKGVYPTP+IVRSLRKKEIQK+NRKLKR+ +RQ+ Q+PPLSESQKQLI EET FLTLRSEYKEFSKAIEAKPA
Subjt:  MEAVRANAPIPSPKFEPDTEKIKRTLLQKGVYPTPKIVRSLRKKEIQKHNRKLKRLPKRQADQTPPLSESQKQLIDEETRFLTLRSEYKEFSKAIEAKPA

Query:  GGLMVGRPWERLERLNLKELTDSRTGYNRDNLKKESLRELRKLFEARKLEELQWVLDDDVELKDEWLESENGHCGTVKRRRGDGEVIRFLVDRLSSMEIS
        GGLMVGRPWERLER+N KELT  RTGYNRD+LKKESLRELRKLFE RKLEE QW LDDDVELK+EWLESEN     VKRRRGDGEVIRFLVDRLSS  IS
Subjt:  GGLMVGRPWERLERLNLKELTDSRTGYNRDNLKKESLRELRKLFEARKLEELQWVLDDDVELKDEWLESENGHCGTVKRRRGDGEVIRFLVDRLSSMEIS

Query:  MRDWKFSRMMIRSGLQFNEGQLLRILDALGAKGCWKQALSVVEWVYNLKSHSYSKSRFVYTKLLAVLGMARKPQEALQIFNFMRGDGQIYPDMAAYHSIA
        MRDWKFSRMMIRSGLQFNEGQLL+ILDALGAKGCWKQALSVVEWVYNLKSHS+SKSRFVYTKLLAVLGMARKPQEALQIFN MRGDGQIYPDMAAYHSIA
Subjt:  MRDWKFSRMMIRSGLQFNEGQLLRILDALGAKGCWKQALSVVEWVYNLKSHSYSKSRFVYTKLLAVLGMARKPQEALQIFNFMRGDGQIYPDMAAYHSIA

Query:  VTLGQAGHLKQLLKVIECMRQQPSKKIRNKYRKSWDPAVEPSLVIYNAILNACIPTLEWKGVFWVFTQLRKSGLKPNGATYGLSMEVMLKSGKYEHVHKL
        VTLGQAG LKQLLKVIE MRQQPSKK+RNK RKSWDPAVEP LVIYNAILNACIPTLEWKGV+WVFTQLRKSGL+PNGATYGLSMEVMLKSGKYE +H L
Subjt:  VTLGQAGHLKQLLKVIECMRQQPSKKIRNKYRKSWDPAVEPSLVIYNAILNACIPTLEWKGVFWVFTQLRKSGLKPNGATYGLSMEVMLKSGKYEHVHKL

Query:  FTKMKKSGETLKANTYRVLVKAFWEEGNVNGAIEAVRDMEQRGVVGSASVYYELACCLCYNGRWRDALVEVEKMKMLSHMKPLVVTFTGMILSSFDGGHI
        FTKMKK+G+TLKANTYRVLVKAFWEEGNVNGAIEAVRDMEQRGVVGSASVYYELACCLCYNG+W+DALVEVEKMK LSHMKPLVVTFTGMI SSF+GGHI
Subjt:  FTKMKKSGETLKANTYRVLVKAFWEEGNVNGAIEAVRDMEQRGVVGSASVYYELACCLCYNGRWRDALVEVEKMKMLSHMKPLVVTFTGMILSSFDGGHI

Query:  DDCISIFKYMKQNCAPNIGTINTMLKVYGRNDMFSKAKDLFEEIKRKTDHSSQISVVPSIIPDEYTYGSMLEAAASALQWEYFENVYREMALSGYRLDQS
        DDCISIF+YMKQ CAPNIGTINTMLKVYGRNDM+SKAKDLFEEIKRK D SS  S VPS++PDEYTY SMLEAAAS+LQWEYFE+VYREMALSGY+LDQS
Subjt:  DDCISIFKYMKQNCAPNIGTINTMLKVYGRNDMFSKAKDLFEEIKRKTDHSSQISVVPSIIPDEYTYGSMLEAAASALQWEYFENVYREMALSGYRLDQS

Query:  KHARLLVEASRAGKWYLLDHAFDSILEAGQIPHPLLFTEMILQLMTQDSYEQAVTLVKAMGYAPFQVSERQWTELFEANTDRICWNNLKQLSDALSNCNI
        KHA LLVEAS+AGKWYLLDHAFD+ILEAGQIPHPLLFTEMILQL TQD+YEQAVTLV+ MGYAPFQVSERQWTELFE NTDRI  NNLKQL  AL +C+ 
Subjt:  KHARLLVEASRAGKWYLLDHAFDSILEAGQIPHPLLFTEMILQLMTQDSYEQAVTLVKAMGYAPFQVSERQWTELFEANTDRICWNNLKQLSDALSNCNI

Query:  SEATVSNLSRSLQSLCKPGIPENTSQSVACDHDATDGLQLPGSENMENLKLHPDCFTGRCGKSLDNTPVSRASLNIKVESDSKMAPWSLSLSEGVLGTDK
        SEATVSNLSRSLQSLCK  IPENTSQSVACDHDATD LQLP SENMEN+KLHPD       +SLD  PV  ASLN+KV S+SKM+PWS+S+S+G LGT +
Subjt:  SEATVSNLSRSLQSLCKPGIPENTSQSVACDHDATDGLQLPGSENMENLKLHPDCFTGRCGKSLDNTPVSRASLNIKVESDSKMAPWSLSLSEGVLGTDK

Query:  FMDRSENELSTFDLCDGSEDDEEELNILLDRFGDSYDSNLPSVNEILKTWKEDRKTDGLFLHPLN
        F D S N  S FDLC  SEDDEEELN LLD F D+YDSNLP+VNEIL+TWKE+RK DGLFLH LN
Subjt:  FMDRSENELSTFDLCDGSEDDEEELNILLDRFGDSYDSNLPSVNEILKTWKEDRKTDGLFLHPLN

A0A1S3B127 pentatricopeptide repeat-containing protein At5g67570, chloroplastic0.0e+0085.43Show/hide
Query:  MEAVRANAPIPSPKFEPDTEKIKRTLLQKGVYPTPKIVRSLRKKEIQKHNRKLKRLPKRQADQTPPLSESQKQLIDEETRFLTLRSEYKEFSKAIEAKPA
        MEA+ +NAPIPSPKFEPDT+KIKR LLQKGVYPTP+IVRSLRKKEIQK+NRKLKRL +RQ+DQ+PPLSESQKQLI EET FLTLRSEYKEFSKAIEAKPA
Subjt:  MEAVRANAPIPSPKFEPDTEKIKRTLLQKGVYPTPKIVRSLRKKEIQKHNRKLKRLPKRQADQTPPLSESQKQLIDEETRFLTLRSEYKEFSKAIEAKPA

Query:  GGLMVGRPWERLERLNLKELTDSRTGYNRDNLKKESLRELRKLFEARKLEELQWVLDDDVELKDEWLESENGHCGTVKRRRGDGEVIRFLVDRLSSMEIS
        GGLMVGRPWERLER+N  ELT  RTGYNRD+LKKESLRELRKLFE RKLEEL+W LDDDVELK+EWL+SENG    VKRRRGDGEVIRFLVDRLSS  IS
Subjt:  GGLMVGRPWERLERLNLKELTDSRTGYNRDNLKKESLRELRKLFEARKLEELQWVLDDDVELKDEWLESENGHCGTVKRRRGDGEVIRFLVDRLSSMEIS

Query:  MRDWKFSRMMIRSGLQFNEGQLLRILDALGAKGCWKQALSVVEWVYNLKSHSYSKSRFVYTKLLAVLGMARKPQEALQIFNFMRGDGQIYPDMAAYHSIA
        MRDWKFSRMMIRSGLQFNEGQLL+ILDALGAKGCWKQALSVVEWVYNLKSHS+SKSRFVYTKLLAVLGMARKPQEALQIFN MRGDGQIYPDMAAYHSIA
Subjt:  MRDWKFSRMMIRSGLQFNEGQLLRILDALGAKGCWKQALSVVEWVYNLKSHSYSKSRFVYTKLLAVLGMARKPQEALQIFNFMRGDGQIYPDMAAYHSIA

Query:  VTLGQAGHLKQLLKVIECMRQQPSKKIRNKYRKSWDPAVEPSLVIYNAILNACIPTLEWKGVFWVFTQLRKSGLKPNGATYGLSMEVMLKSGKYEHVHKL
        VTLGQAG LKQLLKVIECMRQQPSKK+RNK RKSWDPAVEP LVIYN ILNACIPTLEWKGV+WVFTQLRKSGL+PNGATYGLSMEVMLKSGKYE +H L
Subjt:  VTLGQAGHLKQLLKVIECMRQQPSKKIRNKYRKSWDPAVEPSLVIYNAILNACIPTLEWKGVFWVFTQLRKSGLKPNGATYGLSMEVMLKSGKYEHVHKL

Query:  FTKMKKSGETLKANTYRVLVKAFWEEGNVNGAIEAVRDMEQRGVVGSASVYYELACCLCYNGRWRDALVEVEKMKMLSHMKPLVVTFTGMILSSFDGGHI
        FTKMKKSGETLKANTYRVLVKAFWEEGN +GAIEAVRDMEQRGVVGSASVYYELACCLCYNG+W+DALVEVEKMK LSHMKPLVVTFTGMILSSF+GGHI
Subjt:  FTKMKKSGETLKANTYRVLVKAFWEEGNVNGAIEAVRDMEQRGVVGSASVYYELACCLCYNGRWRDALVEVEKMKMLSHMKPLVVTFTGMILSSFDGGHI

Query:  DDCISIFKYMKQNCAPNIGTINTMLKVYGRNDMFSKAKDLFEEIKRKTDHSSQISVVPSIIPDEYTYGSMLEAAASALQWEYFENVYREMALSGYRLDQS
        DDCISIF+YMKQ CAPNIGTINTMLKVYGRNDMFSKAKDLFEEIK+K D SS  S VPS++PDEYTY SML+AAAS+LQWEYFENVYREMALSGYRLDQS
Subjt:  DDCISIFKYMKQNCAPNIGTINTMLKVYGRNDMFSKAKDLFEEIKRKTDHSSQISVVPSIIPDEYTYGSMLEAAASALQWEYFENVYREMALSGYRLDQS

Query:  KHARLLVEASRAGKWYLLDHAFDSILEAGQIPHPLLFTEMILQLMTQDSYEQAVTLVKAMGYAPFQVSERQWTELFEANTDRICWNNLKQLSDALSNCNI
        KHA LLVEAS+AGKWYLLDHAFD+ILEAGQIPHPLLFTEMILQL TQ++YEQAVTLV+ MGYAPFQVSERQWTELFE N DRIC NNLKQL DAL +C+ 
Subjt:  KHARLLVEASRAGKWYLLDHAFDSILEAGQIPHPLLFTEMILQLMTQDSYEQAVTLVKAMGYAPFQVSERQWTELFEANTDRICWNNLKQLSDALSNCNI

Query:  SEATVSNLSRSLQSLCKPGIPENTSQSVACDHDATDGLQLPGSENMENLKLHPDCFTGRCGKSLDNTPVSRASLNIKVESDSKMAPWSLSLSEGVLGTDK
        SEATVSNLSRSLQSLCK GI E+TSQS+ACD +ATDGL+LP S+NMEN+KLHPD       +SLD  PV  ASLN+KV S+S M+PWS S+S+GVLGT +
Subjt:  SEATVSNLSRSLQSLCKPGIPENTSQSVACDHDATDGLQLPGSENMENLKLHPDCFTGRCGKSLDNTPVSRASLNIKVESDSKMAPWSLSLSEGVLGTDK

Query:  FMDRSENELSTFDLCDGSEDDEEELNILLDRFGDSYDSNLPSVNEILKTWKEDRKTDGLFLHPLN
        F D S NE STFD  D SEDDE ELN+LLD F DSYDSNLP+ NEIL+TWKE+RK DGLFLHPLN
Subjt:  FMDRSENELSTFDLCDGSEDDEEELNILLDRFGDSYDSNLPSVNEILKTWKEDRKTDGLFLHPLN

A0A5D3CLI0 Pentatricopeptide repeat-containing protein0.0e+0085.43Show/hide
Query:  MEAVRANAPIPSPKFEPDTEKIKRTLLQKGVYPTPKIVRSLRKKEIQKHNRKLKRLPKRQADQTPPLSESQKQLIDEETRFLTLRSEYKEFSKAIEAKPA
        MEA+ +NAPIPSPKFEPDT+KIKR LLQKGVYPTP+IVRSLRKKEIQK+NRKLKRL +RQ+DQ+PPLSESQKQLI EET FLTLRSEYKEFSKAIEAKPA
Subjt:  MEAVRANAPIPSPKFEPDTEKIKRTLLQKGVYPTPKIVRSLRKKEIQKHNRKLKRLPKRQADQTPPLSESQKQLIDEETRFLTLRSEYKEFSKAIEAKPA

Query:  GGLMVGRPWERLERLNLKELTDSRTGYNRDNLKKESLRELRKLFEARKLEELQWVLDDDVELKDEWLESENGHCGTVKRRRGDGEVIRFLVDRLSSMEIS
        GGLMVGRPWERLER+N  ELT  RTGYNRD+LKKESLRELRKLFE RKLEEL+W LDDDVELK+EWL+SENG    VKRRRGDGEVIRFLVDRLSS  IS
Subjt:  GGLMVGRPWERLERLNLKELTDSRTGYNRDNLKKESLRELRKLFEARKLEELQWVLDDDVELKDEWLESENGHCGTVKRRRGDGEVIRFLVDRLSSMEIS

Query:  MRDWKFSRMMIRSGLQFNEGQLLRILDALGAKGCWKQALSVVEWVYNLKSHSYSKSRFVYTKLLAVLGMARKPQEALQIFNFMRGDGQIYPDMAAYHSIA
        MRDWKFSRMMIRSGLQFNEGQLL+ILDALGAKGCWKQALSVVEWVYNLKSHS+SKSRFVYTKLLAVLGMARKPQEALQIFN MRGDGQIYPDMAAYHSIA
Subjt:  MRDWKFSRMMIRSGLQFNEGQLLRILDALGAKGCWKQALSVVEWVYNLKSHSYSKSRFVYTKLLAVLGMARKPQEALQIFNFMRGDGQIYPDMAAYHSIA

Query:  VTLGQAGHLKQLLKVIECMRQQPSKKIRNKYRKSWDPAVEPSLVIYNAILNACIPTLEWKGVFWVFTQLRKSGLKPNGATYGLSMEVMLKSGKYEHVHKL
        VTLGQAG LKQLLKVIECMRQQPSKK+RNK RKSWDPAVEP LVIYN ILNACIPTLEWKGV+WVFTQLRKSGL+PNGATYGLSMEVMLKSGKYE +H L
Subjt:  VTLGQAGHLKQLLKVIECMRQQPSKKIRNKYRKSWDPAVEPSLVIYNAILNACIPTLEWKGVFWVFTQLRKSGLKPNGATYGLSMEVMLKSGKYEHVHKL

Query:  FTKMKKSGETLKANTYRVLVKAFWEEGNVNGAIEAVRDMEQRGVVGSASVYYELACCLCYNGRWRDALVEVEKMKMLSHMKPLVVTFTGMILSSFDGGHI
        FTKMKKSGETLKANTYRVLVKAFWEEGN +GAIEAVRDMEQRGVVGSASVYYELACCLCYNG+W+DALVEVEKMK LSHMKPLVVTFTGMILSSF+GGHI
Subjt:  FTKMKKSGETLKANTYRVLVKAFWEEGNVNGAIEAVRDMEQRGVVGSASVYYELACCLCYNGRWRDALVEVEKMKMLSHMKPLVVTFTGMILSSFDGGHI

Query:  DDCISIFKYMKQNCAPNIGTINTMLKVYGRNDMFSKAKDLFEEIKRKTDHSSQISVVPSIIPDEYTYGSMLEAAASALQWEYFENVYREMALSGYRLDQS
        DDCISIF+YMKQ CAPNIGTINTMLKVYGRNDMFSKAKDLFEEIK+K D SS  S VPS++PDEYTY SML+AAAS+LQWEYFENVYREMALSGYRLDQS
Subjt:  DDCISIFKYMKQNCAPNIGTINTMLKVYGRNDMFSKAKDLFEEIKRKTDHSSQISVVPSIIPDEYTYGSMLEAAASALQWEYFENVYREMALSGYRLDQS

Query:  KHARLLVEASRAGKWYLLDHAFDSILEAGQIPHPLLFTEMILQLMTQDSYEQAVTLVKAMGYAPFQVSERQWTELFEANTDRICWNNLKQLSDALSNCNI
        KHA LLVEAS+AGKWYLLDHAFD+ILEAGQIPHPLLFTEMILQL TQ++YEQAVTLV+ MGYAPFQVSERQWTELFE N DRIC NNLKQL DAL +C+ 
Subjt:  KHARLLVEASRAGKWYLLDHAFDSILEAGQIPHPLLFTEMILQLMTQDSYEQAVTLVKAMGYAPFQVSERQWTELFEANTDRICWNNLKQLSDALSNCNI

Query:  SEATVSNLSRSLQSLCKPGIPENTSQSVACDHDATDGLQLPGSENMENLKLHPDCFTGRCGKSLDNTPVSRASLNIKVESDSKMAPWSLSLSEGVLGTDK
        SEATVSNLSRSLQSLCK GI E+TSQS+ACD +ATDGL+LP S+NMEN+KLHPD       +SLD  PV  ASLN+KV S+S M+PWS S+S+GVLGT +
Subjt:  SEATVSNLSRSLQSLCKPGIPENTSQSVACDHDATDGLQLPGSENMENLKLHPDCFTGRCGKSLDNTPVSRASLNIKVESDSKMAPWSLSLSEGVLGTDK

Query:  FMDRSENELSTFDLCDGSEDDEEELNILLDRFGDSYDSNLPSVNEILKTWKEDRKTDGLFLHPLN
        F D S NE STFD  D SEDDE ELN+LLD F DSYDSNLP+ NEIL+TWKE+RK DGLFLHPLN
Subjt:  FMDRSENELSTFDLCDGSEDDEEELNILLDRFGDSYDSNLPSVNEILKTWKEDRKTDGLFLHPLN

A0A6J1GG29 pentatricopeptide repeat-containing protein At5g67570, chloroplastic0.0e+0086.1Show/hide
Query:  MEAVRANAPIPSPKFEPDTEKIKRTLLQKGVYPTPKIVRSLRKKEIQKHNRKLKRLPKRQADQTPPLSESQKQLIDEETRFLTLRSEYKEFSKAIEAKPA
        MEA+  NA +PSPKFEPD EKIKRTLLQKGV+PTPKIVRSL KKEIQKHNRKLKRL +RQ  Q+PPLSESQKQLI EET+FLTLRSEYKEFSKAIEA+PA
Subjt:  MEAVRANAPIPSPKFEPDTEKIKRTLLQKGVYPTPKIVRSLRKKEIQKHNRKLKRLPKRQADQTPPLSESQKQLIDEETRFLTLRSEYKEFSKAIEAKPA

Query:  GGLMVGRPWERLERLNLKELTDSRTGYNRDNLKKESLRELRKLFEARKLEELQWVLDDDVELKDEWLESENGHCGTVKRRRGDGEVIRFLVDRLSSMEIS
        GGLMVGRPWERLER+NLKELT  RT YNRDNLKKESLRELRKLFEARKLEELQWVLDDDVELKDEWL SENGH   VKRRRGDGEVIRFLVDRLSS  IS
Subjt:  GGLMVGRPWERLERLNLKELTDSRTGYNRDNLKKESLRELRKLFEARKLEELQWVLDDDVELKDEWLESENGHCGTVKRRRGDGEVIRFLVDRLSSMEIS

Query:  MRDWKFSRMMIRSGLQFNEGQLLRILDALGAKGCWKQALSVVEWVYNLKSHSYSKSRFVYTKLLAVLGMARKPQEALQIFNFMRGDGQIYPDMAAYHSIA
        MRDWKFSRMMI+SGLQFNEGQLL+ILDALGAKGCWKQALSVVEWVYNLKSHS+SKSRFVYTKLLAVLG ARKPQEALQIFN MRGDGQIYPDMAAYHSIA
Subjt:  MRDWKFSRMMIRSGLQFNEGQLLRILDALGAKGCWKQALSVVEWVYNLKSHSYSKSRFVYTKLLAVLGMARKPQEALQIFNFMRGDGQIYPDMAAYHSIA

Query:  VTLGQAGHLKQLLKVIECMRQQPSKKIRNKYRKSWDPAVEPSLVIYNAILNACIPTLEWKGVFWVFTQLRKSGLKPNGATYGLSMEVMLKSGKYEHVHKL
        VTLGQAG LKQLLK+IE MRQQPSKK+RN  RK WDPAVEP LVIYNAILNAC+PTLEWK V+WVFTQLRK+GLKPNGATYGLSMEVMLKSGKYE VH L
Subjt:  VTLGQAGHLKQLLKVIECMRQQPSKKIRNKYRKSWDPAVEPSLVIYNAILNACIPTLEWKGVFWVFTQLRKSGLKPNGATYGLSMEVMLKSGKYEHVHKL

Query:  FTKMKKSGETLKANTYRVLVKAFWEEGNVNGAIEAVRDMEQRGVVGSASVYYELACCLCYNGRWRDALVEVEKMKMLSHMKPLVVTFTGMILSSFDGGHI
        FTKMK SG TLKANTYRVLVKAFWEEGNV+GAIEAVRDMEQRGVVGSASVYYELACCLCY+GRW+DALVEVEKMK LSHMKPLVVTFTGMILSSFDGGHI
Subjt:  FTKMKKSGETLKANTYRVLVKAFWEEGNVNGAIEAVRDMEQRGVVGSASVYYELACCLCYNGRWRDALVEVEKMKMLSHMKPLVVTFTGMILSSFDGGHI

Query:  DDCISIFKYMKQNCAPNIGTINTMLKVYGRNDMFSKAKDLFEEIKRKTDHSSQISVVPSIIPDEYTYGSMLEAAASALQWEYFENVYREMALSGYRLDQS
        DDCISIF+YMKQ CAPNIGTINTMLKV+GRNDMFSKAKDL+EEIKRK DHSSQ S V SI+PD+YTY SML+AAASA QWEYFENVYREMALSGYRLDQS
Subjt:  DDCISIFKYMKQNCAPNIGTINTMLKVYGRNDMFSKAKDLFEEIKRKTDHSSQISVVPSIIPDEYTYGSMLEAAASALQWEYFENVYREMALSGYRLDQS

Query:  KHARLLVEASRAGKWYLLDHAFDSILEAGQIPHPLLFTEMILQLMTQDSYEQAVTLVKAMGYAPFQVSERQWTELFEANTDRICWNNLKQLSDALSNCNI
        KHA LLVEASRAGKWYLLDHAFD+ILEAGQIPHPLLFTEMILQL TQD+YEQAVTLV+ M YAPFQVSERQWTELFE NTDRICWNNLK+LSDALS+C+ 
Subjt:  KHARLLVEASRAGKWYLLDHAFDSILEAGQIPHPLLFTEMILQLMTQDSYEQAVTLVKAMGYAPFQVSERQWTELFEANTDRICWNNLKQLSDALSNCNI

Query:  SEATVSNLSRSLQSLCKPGIPENTSQSVACDHDATDGLQLPGSENMENLKLHPDCFTGRCGKSLDNTPVSRASLNIKVESDSKMAPWSLSLSEGVLGTDK
        SEATV NLS SLQSLCK GIPEN SQS+A DHD TDGLQLPG ENM+N+KLH D  T  C  SLD  PV+ ASLN     DS+M PWSLSLS+GVL T K
Subjt:  SEATVSNLSRSLQSLCKPGIPENTSQSVACDHDATDGLQLPGSENMENLKLHPDCFTGRCGKSLDNTPVSRASLNIKVESDSKMAPWSLSLSEGVLGTDK

Query:  FMDRSENELSTFDLCDGSEDDEEELNILLDRFGDSYDSNLPSVNEILKTWKEDRKTDGLFLHP
        F DRS NE STFDL D SEDDEEEL++LLD F DSYDSNLPSV+EILKTWKE+RK DGL+LHP
Subjt:  FMDRSENELSTFDLCDGSEDDEEELNILLDRFGDSYDSNLPSVNEILKTWKEDRKTDGLFLHP

A0A6J1ILT7 pentatricopeptide repeat-containing protein At5g67570, chloroplastic0.0e+0085.75Show/hide
Query:  MEAVRANAPIPSPKFEPDTEKIKRTLLQKGVYPTPKIVRSLRKKEIQKHNRKLKRLPKRQADQTPPLSESQKQLIDEETRFLTLRSEYKEFSKAIEAKPA
        MEA+  NA +PSPKFEPD EKIKRTL+QKGV+PTPKIVRSL KKEIQKHNRKLKRL +RQ  Q+PPLSESQKQLI+EET FLTLRSEYKEFSKAIEA+PA
Subjt:  MEAVRANAPIPSPKFEPDTEKIKRTLLQKGVYPTPKIVRSLRKKEIQKHNRKLKRLPKRQADQTPPLSESQKQLIDEETRFLTLRSEYKEFSKAIEAKPA

Query:  GGLMVGRPWERLERLNLKELTDSRTGYNRDNLKKESLRELRKLFEARKLEELQWVLDDDVELKDEWLESENGHCGTVKRRRGDGEVIRFLVDRLSSMEIS
        GGLMVGRPWERLER+NLKE T  RT YNRDNLKKESLRELRKLFEARKLEELQWVLDDDVELK+EWL SENGH   VKRRRGDGEVIRFLVDRLSS  IS
Subjt:  GGLMVGRPWERLERLNLKELTDSRTGYNRDNLKKESLRELRKLFEARKLEELQWVLDDDVELKDEWLESENGHCGTVKRRRGDGEVIRFLVDRLSSMEIS

Query:  MRDWKFSRMMIRSGLQFNEGQLLRILDALGAKGCWKQALSVVEWVYNLKSHSYSKSRFVYTKLLAVLGMARKPQEALQIFNFMRGDGQIYPDMAAYHSIA
        MRDWKFSRMMI+SGLQFNEGQLL+ILD+LGAKGCWKQALSVVEWVYNLKSHS+SKSRFVYTKLLAVLG ARKPQEALQIFN MRGDGQIYPDMAAYHSIA
Subjt:  MRDWKFSRMMIRSGLQFNEGQLLRILDALGAKGCWKQALSVVEWVYNLKSHSYSKSRFVYTKLLAVLGMARKPQEALQIFNFMRGDGQIYPDMAAYHSIA

Query:  VTLGQAGHLKQLLKVIECMRQQPSKKIRNKYRKSWDPAVEPSLVIYNAILNACIPTLEWKGVFWVFTQLRKSGLKPNGATYGLSMEVMLKSGKYEHVHKL
        VTLGQAG LKQLLK+IE MRQQPSKK+RN  RK WDPAVEP LVIYNAILNACIPTLEWK V+WVFTQLRK+GLKPNGATYGLSMEVMLKSGKYE VH L
Subjt:  VTLGQAGHLKQLLKVIECMRQQPSKKIRNKYRKSWDPAVEPSLVIYNAILNACIPTLEWKGVFWVFTQLRKSGLKPNGATYGLSMEVMLKSGKYEHVHKL

Query:  FTKMKKSGETLKANTYRVLVKAFWEEGNVNGAIEAVRDMEQRGVVGSASVYYELACCLCYNGRWRDALVEVEKMKMLSHMKPLVVTFTGMILSSFDGGHI
        FTKMK SG TLKANTYRVLVKAFWEEGNV+GAIEAVRDMEQRGVVGSASVYYELACCLCY+GRW+DALVEVEKMK LSHMKPLVVTFTGMILSSFDGGHI
Subjt:  FTKMKKSGETLKANTYRVLVKAFWEEGNVNGAIEAVRDMEQRGVVGSASVYYELACCLCYNGRWRDALVEVEKMKMLSHMKPLVVTFTGMILSSFDGGHI

Query:  DDCISIFKYMKQNCAPNIGTINTMLKVYGRNDMFSKAKDLFEEIKRKTDHSSQISVVPSIIPDEYTYGSMLEAAASALQWEYFENVYREMALSGYRLDQS
        DDCISIF+YMKQ CAPNIGTINTMLKV+GRNDMFSKAKDL+EEIKRK DHSSQ S V SI+PD+YTY SML+AAASA QWEYFENVYREMALSGYRLDQS
Subjt:  DDCISIFKYMKQNCAPNIGTINTMLKVYGRNDMFSKAKDLFEEIKRKTDHSSQISVVPSIIPDEYTYGSMLEAAASALQWEYFENVYREMALSGYRLDQS

Query:  KHARLLVEASRAGKWYLLDHAFDSILEAGQIPHPLLFTEMILQLMTQDSYEQAVTLVKAMGYAPFQVSERQWTELFEANTDRICWNNLKQLSDALSNCNI
        KHA LLVEASRAGKWYLLDHAFD+ILEAGQIPHPLLFTEMILQL TQD+YEQA+TLV+ M YAPFQVSERQWTELFE NTDRICWNNLK+LSDALS+C+ 
Subjt:  KHARLLVEASRAGKWYLLDHAFDSILEAGQIPHPLLFTEMILQLMTQDSYEQAVTLVKAMGYAPFQVSERQWTELFEANTDRICWNNLKQLSDALSNCNI

Query:  SEATVSNLSRSLQSLCKPGIPENTSQSVACDHDATDGLQLPGSENMENLKLHPDCFTGRCGKSLDNTPVSRASLNIKVESDSKMAPWSLSLSEGVLGTDK
        SEATVSNLS SLQSLCK GIPENTSQS+A DHD TDGLQLPGSEN +N+KLH D  T  C  SLD  PV+ ASLN     DSKM PWSLSLS+GVL T K
Subjt:  SEATVSNLSRSLQSLCKPGIPENTSQSVACDHDATDGLQLPGSENMENLKLHPDCFTGRCGKSLDNTPVSRASLNIKVESDSKMAPWSLSLSEGVLGTDK

Query:  FMDRSENELSTFDLCDGSEDDEEELNILLDRFGDSYDSNLPSVNEILKTWKEDRKTDGLFLHP
        F DRS NE STFDL D SEDDEEEL++LLD F DSY SNLPSV+EILKTW+E+RK DGL+LHP
Subjt:  FMDRSENELSTFDLCDGSEDDEEELNILLDRFGDSYDSNLPSVNEILKTWKEDRKTDGLFLHP

SwissProt top hitse value%identityAlignment
A0A1D6IEG9 Pentatricopeptide repeat-containing protein CRP1, chloroplastic1.5e-1721.13Show/hide
Query:  MIRSGLQFNEGQLLRILDALGAKGCWKQALSVVEWVYNLKSHSYSKSRFVYTKLLAVLGMARKPQEALQIFNFMRGDGQIYPDMAAYHSIAVTLGQAGHL
        M + G+  +E     ++DA    G W+ A  +++    +++     S +V++++LA        Q+A  +   M+  G + PD   Y+ +  T G+   L
Subjt:  MIRSGLQFNEGQLLRILDALGAKGCWKQALSVVEWVYNLKSHSYSKSRFVYTKLLAVLGMARKPQEALQIFNFMRGDGQIYPDMAAYHSIAVTLGQAGHL

Query:  KQLLKVIECMRQQPSKKIRNKYRKSWDPAVEPSLVIYNAILNACIPTLEWKGVFWVFTQLRKSGLKPNGATYGLSMEVMLKSGKYEHVHKLFTKMKKSGE
           +     MR++                +EP +V +N +++A            +F ++R+S   P   TY + + ++ +   +E V  + ++MK+ G 
Subjt:  KQLLKVIECMRQQPSKKIRNKYRKSWDPAVEPSLVIYNAILNACIPTLEWKGVFWVFTQLRKSGLKPNGATYGLSMEVMLKSGKYEHVHKLFTKMKKSGE

Query:  TLKANTYRVLVKAFWEEGNVNGAIEAVRDMEQRGVVGSASVYYELACCLCYNGRWRDALVEVEKMKMLSHMKPLVVTFTGMILSSFDGGHIDDCISIFKY
             TY  LV  +   G    AI+ +  M+  G+  S ++Y+ L       G    AL  V+ MK    ++  ++    +I +  +   + +  S+ ++
Subjt:  TLKANTYRVLVKAFWEEGNVNGAIEAVRDMEQRGVVGSASVYYELACCLCYNGRWRDALVEVEKMKMLSHMKPLVVTFTGMILSSFDGGHIDDCISIFKY

Query:  MKQN-CAPNIGTINTMLKVYGRNDMFSKAKDLFEEI
        M++N   P++ T  T++K   R + F K   ++EE+
Subjt:  MKQN-CAPNIGTINTMLKVYGRNDMFSKAKDLFEEI

Q3EDF8 Pentatricopeptide repeat-containing protein At1g099001.2e-1722.44Show/hide
Query:  KPQEALQIFNFMRGDGQIYPDMAAYHSIAVTLGQAGHLKQLLKVIECMRQQPSKKIRNKYRKSWDPAVEPSLVIYNAILNACIPTLEWKGVFWVFTQLRK
        K ++A +I   + G G + PD+  Y+ +     +AG +   L V++ M                  +V P +V YN IL +   + + K    V  ++ +
Subjt:  KPQEALQIFNFMRGDGQIYPDMAAYHSIAVTLGQAGHLKQLLKVIECMRQQPSKKIRNKYRKSWDPAVEPSLVIYNAILNACIPTLEWKGVFWVFTQLRK

Query:  SGLKPNGATYGLSMEVMLKSGKYEHVHKLFTKMKKSGETLKANTYRVLVKAFWEEGNVNGAIEAVRDMEQRGVVGSASVYYELACCLCYNGRWRDALVEV
            P+  TY + +E   +     H  KL  +M+  G T    TY VLV    +EG ++ AI+ + DM   G   +   +  +   +C  GRW DA   +
Subjt:  SGLKPNGATYGLSMEVMLKSGKYEHVHKLFTKMKKSGETLKANTYRVLVKAFWEEGNVNGAIEAVRDMEQRGVVGSASVYYELACCLCYNGRWRDALVEV

Query:  EKMKMLSHMKPLVVTFTGMILSSFDGGHIDDCISIFKYMKQN-CAPNIGTINTMLKVYGRNDMFSKAKDLFEEIKRKTDHSSQISVVPSIIPDEYTYGSM
          M +     P VVTF  +I      G +   I I + M Q+ C PN  + N +L  + +     +A +  E +           V     PD  TY +M
Subjt:  EKMKMLSHMKPLVVTFTGMILSSFDGGHIDDCISIFKYMKQN-CAPNIGTINTMLKVYGRNDMFSKAKDLFEEIKRKTDHSSQISVVPSIIPDEYTYGSM

Query:  LEAAASALQWEYFENVYREMALSGYRLDQSKHARLLVEASRAGKWYLLDHAFDSILEAGQIPHPLLFTEMILQLMTQDSYEQAVTL---VKAMGYAPFQV
        L A     + E    +  +++  G       +  ++   ++AGK        D +      P  + ++ ++  L  +   ++A+      + MG  P  V
Subjt:  LEAAASALQWEYFENVYREMALSGYRLDQSKHARLLVEASRAGKWYLLDHAFDSILEAGQIPHPLLFTEMILQLMTQDSYEQAVTL---VKAMGYAPFQV

Query:  S
        +
Subjt:  S

Q8S9D1 Pentatricopeptide repeat-containing protein At5g212223.1e-1823.65Show/hide
Query:  TKLLAVLGMARKPQEALQIFNFMRGDGQIYPDMAAYHSIAVTLGQAGHLKQLLKVIECMRQQPSKKIRNKYRKSWDPAVEPSLVIYNAILNACIPTLEWK
        TKL+  L    +PQEA  IFN +  +G   P +  Y ++   L +  H   LL +I       SK  +N         ++P  +++NAI+NA   +    
Subjt:  TKLLAVLGMARKPQEALQIFNFMRGDGQIYPDMAAYHSIAVTLGQAGHLKQLLKVIECMRQQPSKKIRNKYRKSWDPAVEPSLVIYNAILNACIPTLEWK

Query:  GVFWVFTQLRKSGLKPNGATYGLSMEVMLKSGKYEHVHKLFTKMKKSGETLKAN--TYRVLVKAFWEEGNVNGAIEAVRDMEQRGVVGSASVYYELACCL
            +F ++++SG KP  +T+   ++   K GK E   +L   M +  E L+ N  T  +LV+A+  +  +  A   V  M+  GV      +  LA   
Subjt:  GVFWVFTQLRKSGLKPNGATYGLSMEVMLKSGKYEHVHKLFTKMKKSGETLKAN--TYRVLVKAFWEEGNVNGAIEAVRDMEQRGVVGSASVYYELACCL

Query:  CYNGRWRDALVEVEKMKMLSHMKPLVVTFTGMILSSFDGGHIDDCISIFKYMKQ-NCAPNIGTINTMLKVY-GRNDM--FSKAKDLFEEIKRKTDHSSQI
           G    A   +    + + +KP V T   ++    + G +++ +  F  MK+    PN+   N+++K +   NDM    +  DL EE           
Subjt:  CYNGRWRDALVEVEKMKMLSHMKPLVVTFTGMILSSFDGGHIDDCISIFKYMKQ-NCAPNIGTINTMLKVY-GRNDM--FSKAKDLFEEIKRKTDHSSQI

Query:  SVVPSIIPDEYTYGSMLEAAASALQWEYFENVYREMALSGYRLDQSKHARLLVEASRAGKWYLLDHAFDSILEAGQIPHPLLFTEMILQLMTQDSYEQAV
             + PD  T+ +++ A +S    +  E +Y +M   G   D    + L    +RAG+    +   + + + G  P+ +++T++I    +    ++A+
Subjt:  SVVPSIIPDEYTYGSMLEAAASALQWEYFENVYREMALSGYRLDQSKHARLLVEASRAGKWYLLDHAFDSILEAGQIPHPLLFTEMILQLMTQDSYEQAV

Query:  TLVKAM
         + K M
Subjt:  TLVKAM

Q9FJW6 Pentatricopeptide repeat-containing protein At5g67570, chloroplastic7.2e-24159.44Show/hide
Query:  PKFEPDTEKIKRTLLQKGVYPTPKIVRSLRKKEIQKHNRKLKRLPKRQADQTPPLSESQKQLIDEETRFLTLRSEYKEFSKAIEAKPAG--GLMVGRPWE
        P+FEPD EKIKR LL+ GV PTPKI+ +LRKKEIQKHNR+ KR  + +A+     +E+QKQ ++EE RF TLR EYK+F+++I  K  G  GLMVG PWE
Subjt:  PKFEPDTEKIKRTLLQKGVYPTPKIVRSLRKKEIQKHNRKLKRLPKRQADQTPPLSESQKQLIDEETRFLTLRSEYKEFSKAIEAKPAG--GLMVGRPWE

Query:  RLERLNLKELTDS--RTGYNRDNLKKESLRELRKLFEARKLEELQWVLDDDVELKDEWLESENGHCGTVKRRRGDGEVIRFLVDRLSSMEISMRDWKFSR
         +ER+ LKEL     R   +   LKKE+L+EL+K+ E    ++L+WVLDDDV++++  L+ E       KR R +GE +R LVDRLS  EI+ + WKF R
Subjt:  RLERLNLKELTDS--RTGYNRDNLKKESLRELRKLFEARKLEELQWVLDDDVELKDEWLESENGHCGTVKRRRGDGEVIRFLVDRLSSMEISMRDWKFSR

Query:  MMIRSGLQFNEGQLLRILDALGAKGCWKQALSVVEWVYNLKSHSYSKSRFVYTKLLAVLGMARKPQEALQIFNFMRGDGQIYPDMAAYHSIAVTLGQAGH
        MM +SGLQF E Q+L+I+D LG K  WKQA +VV WVY+ K   + +SRFVYTKLL+VLG AR+PQEALQIFN M GD Q+YPDMAAYH IAVTLGQAG 
Subjt:  MMIRSGLQFNEGQLLRILDALGAKGCWKQALSVVEWVYNLKSHSYSKSRFVYTKLLAVLGMARKPQEALQIFNFMRGDGQIYPDMAAYHSIAVTLGQAGH

Query:  LKQLLKVIECMRQQPSKKIRNKYRKSWDPAVEPSLVIYNAILNACIPTLEWKGVFWVFTQLRKSGLKPNGATYGLSMEVMLKSGKYEHVHKLFTKMKKSG
        LK+LLKVIE MRQ+P+K  +N  +K+WDP +EP LV+YNAILNAC+PTL+WK V WVF +LRK+GL+PNGATYGL+MEVML+SGK++ VH  F KMK SG
Subjt:  LKQLLKVIECMRQQPSKKIRNKYRKSWDPAVEPSLVIYNAILNACIPTLEWKGVFWVFTQLRKSGLKPNGATYGLSMEVMLKSGKYEHVHKLFTKMKKSG

Query:  ETLKANTYRVLVKAFWEEGNVNGAIEAVRDMEQRGVVGSASVYYELACCLCYNGRWRDALVEVEKMKMLSHMKPLVVTFTGMILSSFDGGHIDDCISIFK
        E  KA TY+VLV+A W EG +  A+EAVRDMEQ+GV+G+ SVYYELACCLC NGRW DA++EV +MK L + +PL +TFTG+I +S +GGH+DDC++IF+
Subjt:  ETLKANTYRVLVKAFWEEGNVNGAIEAVRDMEQRGVVGSASVYYELACCLCYNGRWRDALVEVEKMKMLSHMKPLVVTFTGMILSSFDGGHIDDCISIFK

Query:  YMKQNCAPNIGTINTMLKVYGRNDMFSKAKDLFEEI-KRKTDHSSQISVVPSIIPDEYTYGSMLEAAASALQWEYFENVYREMALSGYRLDQSKHARLLV
        YMK  C PNIGT N MLKVYGRNDMFS+AK+LFEEI  RK  H         ++P+EYTY  MLEA+A +LQWEYFE+VY+ M LSGY++DQ+KHA +L+
Subjt:  YMKQNCAPNIGTINTMLKVYGRNDMFSKAKDLFEEI-KRKTDHSSQISVVPSIIPDEYTYGSMLEAAASALQWEYFENVYREMALSGYRLDQSKHARLLV

Query:  EASRAGKWYLLDHAFDSILEAGQIPHPLLFTEMILQLMTQDSYEQAVTLVKAMGYAPFQVSERQWTELFEANTDRICWNNLKQLSDALSNCN-ISEATVS
        EASRAGKW LL+HAFD++LE G+IPHPL FTE++     +  +++A+TL+  +  A FQ+SE +WT+LFE + D +  +NL +LSD L  C+ +SE TVS
Subjt:  EASRAGKWYLLDHAFDSILEAGQIPHPLLFTEMILQLMTQDSYEQAVTLVKAMGYAPFQVSERQWTELFEANTDRICWNNLKQLSDALSNCN-ISEATVS

Query:  NLSRSLQSLC
        NLS+SL+S C
Subjt:  NLSRSLQSLC

Q9SA76 Pentatricopeptide repeat-containing protein At1g30610, chloroplastic2.5e-10038.63Show/hide
Query:  IRFLVDRLSSMEISMRDWKFSRMMIRSGLQFNEGQLLRILDALGAKGCWKQALSVVEWVYNLKSHSYSKSRFVYTKLLAVLGMARKPQEALQIFNFMRGD
        I  L   L+  +I+M +W+FS+ +  + +++ +  ++R++  LG  G W++ L V+EW+     +  +K R +YT  L VLG +R+P EAL +F+ M   
Subjt:  IRFLVDRLSSMEISMRDWKFSRMMIRSGLQFNEGQLLRILDALGAKGCWKQALSVVEWVYNLKSHSYSKSRFVYTKLLAVLGMARKPQEALQIFNFMRGD

Query:  GQIYPDMAAYHSIAVTLGQAGHLKQLLKVIECMRQQPSKKIRNKYRKSWDPAVEPSLVIYNAILNACIPTLEWKGVFWVFTQLRKSGLKPNGATYGLSME
           YPDM AY SIAVTLGQAGH+K+L  VI+ MR  P KK +    + WDP +EP +V+YNA+LNAC+   +W+G FWV  QL++ G KP+  TYGL ME
Subjt:  GQIYPDMAAYHSIAVTLGQAGHLKQLLKVIECMRQQPSKKIRNKYRKSWDPAVEPSLVIYNAILNACIPTLEWKGVFWVFTQLRKSGLKPNGATYGLSME

Query:  VMLKSGKYEHVHKLFTKMKKSGETLKANTYRVLVKAFWEEGNVNGAIEAVRDMEQRGVVGSASVYYELACCLCYNGRWRDAL-----VEVEKMKMLSHM-
        VML   KY  VH+ F KM+KS     A  YRVLV   W+EG  + A+  V DME RG+VGSA++YY+LA CLC  GR  + L     V    +K++ ++ 
Subjt:  VMLKSGKYEHVHKLFTKMKKSGETLKANTYRVLVKAFWEEGNVNGAIEAVRDMEQRGVVGSASVYYELACCLCYNGRWRDAL-----VEVEKMKMLSHM-

Query:  ---------------------KPLVVTFTGMILSSFDGGHIDDCISIFKYMKQNCAPNIGTINTMLKVYGRNDMFSKAKDLFEEIKRKTDHSSQISVVPS
                             KPLVVT+TG+I +  D G+I +   IF  MK+ C+PN+ T N MLK Y +  +F +A++LF+++    +H    S   S
Subjt:  ---------------------KPLVVTFTGMILSSFDGGHIDDCISIFKYMKQNCAPNIGTINTMLKVYGRNDMFSKAKDLFEEIKRKTDHSSQISVVPS

Query:  -IIPDEYTYGSMLEAAASALQWEYFENVYREMALSGYRLDQSKHARLLVEASRAGKWYLLDHAFDSILEAGQIPHPLLFTEMILQLMTQDSYEQAVT
         ++PD YT+ +ML+  A   +W+ F   YREM   GY  +  +H R+++EASRAGK  +++  ++ +  + +IP   L  E   + + +  +  A++
Subjt:  -IIPDEYTYGSMLEAAASALQWEYFENVYREMALSGYRLDQSKHARLLVEASRAGKWYLLDHAFDSILEAGQIPHPLLFTEMILQLMTQDSYEQAVT

Arabidopsis top hitse value%identityAlignment
AT1G09900.1 Pentatricopeptide repeat (PPR-like) superfamily protein8.4e-1922.44Show/hide
Query:  KPQEALQIFNFMRGDGQIYPDMAAYHSIAVTLGQAGHLKQLLKVIECMRQQPSKKIRNKYRKSWDPAVEPSLVIYNAILNACIPTLEWKGVFWVFTQLRK
        K ++A +I   + G G + PD+  Y+ +     +AG +   L V++ M                  +V P +V YN IL +   + + K    V  ++ +
Subjt:  KPQEALQIFNFMRGDGQIYPDMAAYHSIAVTLGQAGHLKQLLKVIECMRQQPSKKIRNKYRKSWDPAVEPSLVIYNAILNACIPTLEWKGVFWVFTQLRK

Query:  SGLKPNGATYGLSMEVMLKSGKYEHVHKLFTKMKKSGETLKANTYRVLVKAFWEEGNVNGAIEAVRDMEQRGVVGSASVYYELACCLCYNGRWRDALVEV
            P+  TY + +E   +     H  KL  +M+  G T    TY VLV    +EG ++ AI+ + DM   G   +   +  +   +C  GRW DA   +
Subjt:  SGLKPNGATYGLSMEVMLKSGKYEHVHKLFTKMKKSGETLKANTYRVLVKAFWEEGNVNGAIEAVRDMEQRGVVGSASVYYELACCLCYNGRWRDALVEV

Query:  EKMKMLSHMKPLVVTFTGMILSSFDGGHIDDCISIFKYMKQN-CAPNIGTINTMLKVYGRNDMFSKAKDLFEEIKRKTDHSSQISVVPSIIPDEYTYGSM
          M +     P VVTF  +I      G +   I I + M Q+ C PN  + N +L  + +     +A +  E +           V     PD  TY +M
Subjt:  EKMKMLSHMKPLVVTFTGMILSSFDGGHIDDCISIFKYMKQN-CAPNIGTINTMLKVYGRNDMFSKAKDLFEEIKRKTDHSSQISVVPSIIPDEYTYGSM

Query:  LEAAASALQWEYFENVYREMALSGYRLDQSKHARLLVEASRAGKWYLLDHAFDSILEAGQIPHPLLFTEMILQLMTQDSYEQAVTL---VKAMGYAPFQV
        L A     + E    +  +++  G       +  ++   ++AGK        D +      P  + ++ ++  L  +   ++A+      + MG  P  V
Subjt:  LEAAASALQWEYFENVYREMALSGYRLDQSKHARLLVEASRAGKWYLLDHAFDSILEAGQIPHPLLFTEMILQLMTQDSYEQAVTL---VKAMGYAPFQV

Query:  S
        +
Subjt:  S

AT1G30610.1 pentatricopeptide (PPR) repeat-containing protein1.8e-10138.63Show/hide
Query:  IRFLVDRLSSMEISMRDWKFSRMMIRSGLQFNEGQLLRILDALGAKGCWKQALSVVEWVYNLKSHSYSKSRFVYTKLLAVLGMARKPQEALQIFNFMRGD
        I  L   L+  +I+M +W+FS+ +  + +++ +  ++R++  LG  G W++ L V+EW+     +  +K R +YT  L VLG +R+P EAL +F+ M   
Subjt:  IRFLVDRLSSMEISMRDWKFSRMMIRSGLQFNEGQLLRILDALGAKGCWKQALSVVEWVYNLKSHSYSKSRFVYTKLLAVLGMARKPQEALQIFNFMRGD

Query:  GQIYPDMAAYHSIAVTLGQAGHLKQLLKVIECMRQQPSKKIRNKYRKSWDPAVEPSLVIYNAILNACIPTLEWKGVFWVFTQLRKSGLKPNGATYGLSME
           YPDM AY SIAVTLGQAGH+K+L  VI+ MR  P KK +    + WDP +EP +V+YNA+LNAC+   +W+G FWV  QL++ G KP+  TYGL ME
Subjt:  GQIYPDMAAYHSIAVTLGQAGHLKQLLKVIECMRQQPSKKIRNKYRKSWDPAVEPSLVIYNAILNACIPTLEWKGVFWVFTQLRKSGLKPNGATYGLSME

Query:  VMLKSGKYEHVHKLFTKMKKSGETLKANTYRVLVKAFWEEGNVNGAIEAVRDMEQRGVVGSASVYYELACCLCYNGRWRDAL-----VEVEKMKMLSHM-
        VML   KY  VH+ F KM+KS     A  YRVLV   W+EG  + A+  V DME RG+VGSA++YY+LA CLC  GR  + L     V    +K++ ++ 
Subjt:  VMLKSGKYEHVHKLFTKMKKSGETLKANTYRVLVKAFWEEGNVNGAIEAVRDMEQRGVVGSASVYYELACCLCYNGRWRDAL-----VEVEKMKMLSHM-

Query:  ---------------------KPLVVTFTGMILSSFDGGHIDDCISIFKYMKQNCAPNIGTINTMLKVYGRNDMFSKAKDLFEEIKRKTDHSSQISVVPS
                             KPLVVT+TG+I +  D G+I +   IF  MK+ C+PN+ T N MLK Y +  +F +A++LF+++    +H    S   S
Subjt:  ---------------------KPLVVTFTGMILSSFDGGHIDDCISIFKYMKQNCAPNIGTINTMLKVYGRNDMFSKAKDLFEEIKRKTDHSSQISVVPS

Query:  -IIPDEYTYGSMLEAAASALQWEYFENVYREMALSGYRLDQSKHARLLVEASRAGKWYLLDHAFDSILEAGQIPHPLLFTEMILQLMTQDSYEQAVT
         ++PD YT+ +ML+  A   +W+ F   YREM   GY  +  +H R+++EASRAGK  +++  ++ +  + +IP   L  E   + + +  +  A++
Subjt:  -IIPDEYTYGSMLEAAASALQWEYFENVYREMALSGYRLDQSKHARLLVEASRAGKWYLLDHAFDSILEAGQIPHPLLFTEMILQLMTQDSYEQAVT

AT1G30610.2 pentatricopeptide (PPR) repeat-containing protein1.3e-10440.64Show/hide
Query:  IRFLVDRLSSMEISMRDWKFSRMMIRSGLQFNEGQLLRILDALGAKGCWKQALSVVEWVYNLKSHSYSKSRFVYTKLLAVLGMARKPQEALQIFNFMRGD
        I  L   L+  +I+M +W+FS+ +  + +++ +  ++R++  LG  G W++ L V+EW+     +  +K R +YT  L VLG +R+P EAL +F+ M   
Subjt:  IRFLVDRLSSMEISMRDWKFSRMMIRSGLQFNEGQLLRILDALGAKGCWKQALSVVEWVYNLKSHSYSKSRFVYTKLLAVLGMARKPQEALQIFNFMRGD

Query:  GQIYPDMAAYHSIAVTLGQAGHLKQLLKVIECMRQQPSKKIRNKYRKSWDPAVEPSLVIYNAILNACIPTLEWKGVFWVFTQLRKSGLKPNGATYGLSME
           YPDM AY SIAVTLGQAGH+K+L  VI+ MR  P KK +    + WDP +EP +V+YNA+LNAC+   +W+G FWV  QL++ G KP+  TYGL ME
Subjt:  GQIYPDMAAYHSIAVTLGQAGHLKQLLKVIECMRQQPSKKIRNKYRKSWDPAVEPSLVIYNAILNACIPTLEWKGVFWVFTQLRKSGLKPNGATYGLSME

Query:  VMLKSGKYEHVHKLFTKMKKSGETLKANTYRVLVKAFWEEGNVNGAIEAVRDMEQRGVVGSASVYYELACCLCYNGRWRDALVEVEKMKMLSHMKPLVVT
        VML   KY  VH+ F KM+KS     A  YRVLV   W+EG  + A+  V DME RG+VGSA++YY+LA CLC  GR  + L  ++K+  +++ KPLVVT
Subjt:  VMLKSGKYEHVHKLFTKMKKSGETLKANTYRVLVKAFWEEGNVNGAIEAVRDMEQRGVVGSASVYYELACCLCYNGRWRDALVEVEKMKMLSHMKPLVVT

Query:  FTGMILSSFDGGHIDDCISIFKYMKQNCAPNIGTINTMLKVYGRNDMFSKAKDLFEEIKRKTDHSSQISVVPS-IIPDEYTYGSMLEAAASALQWEYFEN
        +TG+I +  D G+I +   IF  MK+ C+PN+ T N MLK Y +  +F +A++LF+++    +H    S   S ++PD YT+ +ML+  A   +W+ F  
Subjt:  FTGMILSSFDGGHIDDCISIFKYMKQNCAPNIGTINTMLKVYGRNDMFSKAKDLFEEIKRKTDHSSQISVVPS-IIPDEYTYGSMLEAAASALQWEYFEN

Query:  VYREMALSGYRLDQSKHARLLVEASRAGKWYLLDHAFDSILEAGQIPHPLLFTEMILQLMTQDSYEQAVT
         YREM   GY  +  +H R+++EASRAGK  +++  ++ +  + +IP   L  E   + + +  +  A++
Subjt:  VYREMALSGYRLDQSKHARLLVEASRAGKWYLLDHAFDSILEAGQIPHPLLFTEMILQLMTQDSYEQAVT

AT5G21222.1 protein kinase family protein2.2e-1923.65Show/hide
Query:  TKLLAVLGMARKPQEALQIFNFMRGDGQIYPDMAAYHSIAVTLGQAGHLKQLLKVIECMRQQPSKKIRNKYRKSWDPAVEPSLVIYNAILNACIPTLEWK
        TKL+  L    +PQEA  IFN +  +G   P +  Y ++   L +  H   LL +I       SK  +N         ++P  +++NAI+NA   +    
Subjt:  TKLLAVLGMARKPQEALQIFNFMRGDGQIYPDMAAYHSIAVTLGQAGHLKQLLKVIECMRQQPSKKIRNKYRKSWDPAVEPSLVIYNAILNACIPTLEWK

Query:  GVFWVFTQLRKSGLKPNGATYGLSMEVMLKSGKYEHVHKLFTKMKKSGETLKAN--TYRVLVKAFWEEGNVNGAIEAVRDMEQRGVVGSASVYYELACCL
            +F ++++SG KP  +T+   ++   K GK E   +L   M +  E L+ N  T  +LV+A+  +  +  A   V  M+  GV      +  LA   
Subjt:  GVFWVFTQLRKSGLKPNGATYGLSMEVMLKSGKYEHVHKLFTKMKKSGETLKAN--TYRVLVKAFWEEGNVNGAIEAVRDMEQRGVVGSASVYYELACCL

Query:  CYNGRWRDALVEVEKMKMLSHMKPLVVTFTGMILSSFDGGHIDDCISIFKYMKQ-NCAPNIGTINTMLKVY-GRNDM--FSKAKDLFEEIKRKTDHSSQI
           G    A   +    + + +KP V T   ++    + G +++ +  F  MK+    PN+   N+++K +   NDM    +  DL EE           
Subjt:  CYNGRWRDALVEVEKMKMLSHMKPLVVTFTGMILSSFDGGHIDDCISIFKYMKQ-NCAPNIGTINTMLKVY-GRNDM--FSKAKDLFEEIKRKTDHSSQI

Query:  SVVPSIIPDEYTYGSMLEAAASALQWEYFENVYREMALSGYRLDQSKHARLLVEASRAGKWYLLDHAFDSILEAGQIPHPLLFTEMILQLMTQDSYEQAV
             + PD  T+ +++ A +S    +  E +Y +M   G   D    + L    +RAG+    +   + + + G  P+ +++T++I    +    ++A+
Subjt:  SVVPSIIPDEYTYGSMLEAAASALQWEYFENVYREMALSGYRLDQSKHARLLVEASRAGKWYLLDHAFDSILEAGQIPHPLLFTEMILQLMTQDSYEQAV

Query:  TLVKAM
         + K M
Subjt:  TLVKAM

AT5G67570.1 Tetratricopeptide repeat (TPR)-like superfamily protein5.1e-24259.44Show/hide
Query:  PKFEPDTEKIKRTLLQKGVYPTPKIVRSLRKKEIQKHNRKLKRLPKRQADQTPPLSESQKQLIDEETRFLTLRSEYKEFSKAIEAKPAG--GLMVGRPWE
        P+FEPD EKIKR LL+ GV PTPKI+ +LRKKEIQKHNR+ KR  + +A+     +E+QKQ ++EE RF TLR EYK+F+++I  K  G  GLMVG PWE
Subjt:  PKFEPDTEKIKRTLLQKGVYPTPKIVRSLRKKEIQKHNRKLKRLPKRQADQTPPLSESQKQLIDEETRFLTLRSEYKEFSKAIEAKPAG--GLMVGRPWE

Query:  RLERLNLKELTDS--RTGYNRDNLKKESLRELRKLFEARKLEELQWVLDDDVELKDEWLESENGHCGTVKRRRGDGEVIRFLVDRLSSMEISMRDWKFSR
         +ER+ LKEL     R   +   LKKE+L+EL+K+ E    ++L+WVLDDDV++++  L+ E       KR R +GE +R LVDRLS  EI+ + WKF R
Subjt:  RLERLNLKELTDS--RTGYNRDNLKKESLRELRKLFEARKLEELQWVLDDDVELKDEWLESENGHCGTVKRRRGDGEVIRFLVDRLSSMEISMRDWKFSR

Query:  MMIRSGLQFNEGQLLRILDALGAKGCWKQALSVVEWVYNLKSHSYSKSRFVYTKLLAVLGMARKPQEALQIFNFMRGDGQIYPDMAAYHSIAVTLGQAGH
        MM +SGLQF E Q+L+I+D LG K  WKQA +VV WVY+ K   + +SRFVYTKLL+VLG AR+PQEALQIFN M GD Q+YPDMAAYH IAVTLGQAG 
Subjt:  MMIRSGLQFNEGQLLRILDALGAKGCWKQALSVVEWVYNLKSHSYSKSRFVYTKLLAVLGMARKPQEALQIFNFMRGDGQIYPDMAAYHSIAVTLGQAGH

Query:  LKQLLKVIECMRQQPSKKIRNKYRKSWDPAVEPSLVIYNAILNACIPTLEWKGVFWVFTQLRKSGLKPNGATYGLSMEVMLKSGKYEHVHKLFTKMKKSG
        LK+LLKVIE MRQ+P+K  +N  +K+WDP +EP LV+YNAILNAC+PTL+WK V WVF +LRK+GL+PNGATYGL+MEVML+SGK++ VH  F KMK SG
Subjt:  LKQLLKVIECMRQQPSKKIRNKYRKSWDPAVEPSLVIYNAILNACIPTLEWKGVFWVFTQLRKSGLKPNGATYGLSMEVMLKSGKYEHVHKLFTKMKKSG

Query:  ETLKANTYRVLVKAFWEEGNVNGAIEAVRDMEQRGVVGSASVYYELACCLCYNGRWRDALVEVEKMKMLSHMKPLVVTFTGMILSSFDGGHIDDCISIFK
        E  KA TY+VLV+A W EG +  A+EAVRDMEQ+GV+G+ SVYYELACCLC NGRW DA++EV +MK L + +PL +TFTG+I +S +GGH+DDC++IF+
Subjt:  ETLKANTYRVLVKAFWEEGNVNGAIEAVRDMEQRGVVGSASVYYELACCLCYNGRWRDALVEVEKMKMLSHMKPLVVTFTGMILSSFDGGHIDDCISIFK

Query:  YMKQNCAPNIGTINTMLKVYGRNDMFSKAKDLFEEI-KRKTDHSSQISVVPSIIPDEYTYGSMLEAAASALQWEYFENVYREMALSGYRLDQSKHARLLV
        YMK  C PNIGT N MLKVYGRNDMFS+AK+LFEEI  RK  H         ++P+EYTY  MLEA+A +LQWEYFE+VY+ M LSGY++DQ+KHA +L+
Subjt:  YMKQNCAPNIGTINTMLKVYGRNDMFSKAKDLFEEI-KRKTDHSSQISVVPSIIPDEYTYGSMLEAAASALQWEYFENVYREMALSGYRLDQSKHARLLV

Query:  EASRAGKWYLLDHAFDSILEAGQIPHPLLFTEMILQLMTQDSYEQAVTLVKAMGYAPFQVSERQWTELFEANTDRICWNNLKQLSDALSNCN-ISEATVS
        EASRAGKW LL+HAFD++LE G+IPHPL FTE++     +  +++A+TL+  +  A FQ+SE +WT+LFE + D +  +NL +LSD L  C+ +SE TVS
Subjt:  EASRAGKWYLLDHAFDSILEAGQIPHPLLFTEMILQLMTQDSYEQAVTLVKAMGYAPFQVSERQWTELFEANTDRICWNNLKQLSDALSNCN-ISEATVS

Query:  NLSRSLQSLC
        NLS+SL+S C
Subjt:  NLSRSLQSLC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGCAGTGAGAGCAAATGCACCCATTCCTTCACCGAAGTTCGAACCAGATACCGAGAAGATCAAGCGTACGCTCCTCCAAAAGGGTGTTTATCCCACTCCCAAAAT
CGTCCGCTCACTCCGCAAGAAAGAAATTCAGAAGCACAATCGCAAACTCAAGAGACTCCCGAAACGACAAGCCGATCAGACGCCACCCCTTTCTGAGTCCCAAAAGCAAC
TAATCGACGAGGAAACTCGTTTTCTGACGTTGAGAAGCGAGTACAAGGAGTTCTCGAAGGCCATTGAGGCAAAACCAGCCGGGGGCTTGATGGTCGGAAGGCCTTGGGAG
AGACTGGAAAGACTTAACCTTAAAGAACTTACCGATTCTAGGACGGGATACAATCGGGATAATCTGAAGAAGGAGAGTTTGAGAGAATTGAGGAAGTTGTTTGAGGCTCG
TAAGCTCGAGGAATTGCAGTGGGTTTTAGATGACGATGTGGAACTGAAGGATGAGTGGTTAGAGAGTGAAAATGGTCACTGTGGCACCGTAAAACGGAGGCGCGGTGACG
GAGAGGTTATTCGGTTTCTTGTTGACAGGCTCAGTTCGATGGAAATTTCCATGAGAGATTGGAAATTCTCGAGGATGATGATACGGTCAGGATTGCAGTTTAATGAAGGT
CAACTTCTTAGAATTTTGGATGCCCTTGGTGCTAAGGGATGTTGGAAACAGGCCTTGTCAGTGGTCGAATGGGTGTACAATCTTAAAAGTCACAGTTATTCGAAAAGCAG
ATTTGTTTATACAAAGCTCTTAGCGGTTCTTGGGATGGCAAGGAAACCTCAGGAAGCCCTTCAGATATTCAATTTCATGCGTGGAGATGGCCAGATATATCCCGATATGG
CTGCATATCACAGTATTGCTGTTACGCTGGGTCAAGCTGGTCACTTGAAACAATTGCTGAAAGTTATTGAGTGCATGAGGCAGCAACCGTCTAAAAAAATTAGAAACAAG
TACCGCAAATCTTGGGATCCAGCAGTTGAACCCAGCCTTGTTATATATAATGCTATTTTAAACGCATGCATTCCCACCCTTGAGTGGAAAGGTGTCTTCTGGGTGTTCAC
ACAATTGAGAAAGAGTGGTTTGAAACCTAACGGAGCAACATATGGGCTTTCTATGGAGGTAATGCTGAAATCTGGAAAGTATGAGCATGTCCACAAGCTTTTTACAAAAA
TGAAGAAGAGTGGAGAAACACTAAAGGCAAACACTTACAGAGTTCTTGTCAAAGCTTTTTGGGAGGAAGGAAATGTCAATGGAGCTATTGAAGCAGTCAGGGATATGGAA
CAAAGAGGAGTAGTTGGATCTGCCAGTGTCTACTATGAACTAGCTTGTTGTCTATGCTACAATGGGAGGTGGCGAGATGCATTAGTGGAGGTTGAGAAAATGAAAATGCT
ATCGCACATGAAACCATTGGTGGTTACCTTCACTGGCATGATCTTATCTTCCTTTGATGGTGGACATATTGATGATTGTATATCTATTTTCAAGTACATGAAGCAAAATT
GTGCGCCAAATATAGGGACCATAAATACCATGCTTAAAGTTTATGGCCGAAATGATATGTTTTCTAAGGCTAAAGATTTATTTGAAGAGATAAAGAGAAAAACTGATCAT
TCCTCCCAGATTAGTGTTGTTCCTTCTATTATCCCAGATGAATATACGTATGGCTCAATGCTTGAGGCAGCTGCTAGTGCACTCCAGTGGGAATATTTTGAAAATGTATA
CAGGGAAATGGCTCTGTCTGGATACCGGCTTGATCAAAGTAAACATGCAAGGCTACTTGTGGAAGCATCCAGAGCTGGGAAGTGGTATCTATTAGATCATGCATTTGACT
CAATCTTGGAGGCTGGACAAATTCCCCATCCGCTGTTGTTCACAGAAATGATATTGCAACTTATGACTCAAGATAGCTATGAGCAGGCTGTCACCCTGGTTAAAGCCATG
GGTTATGCTCCATTTCAAGTAAGTGAAAGGCAATGGACAGAACTTTTTGAAGCGAACACGGACAGGATCTGTTGGAATAACTTGAAGCAACTGTCGGATGCTCTTAGCAA
CTGTAATATATCAGAAGCCACAGTCTCGAACTTGTCAAGATCATTGCAGTCTCTTTGCAAACCTGGCATACCAGAAAACACCTCCCAGTCTGTAGCCTGCGATCATGATG
CAACAGATGGATTACAACTTCCTGGTTCTGAAAACATGGAGAATTTGAAACTTCACCCAGATTGTTTCACAGGTCGTTGTGGCAAGTCGTTGGATAATACTCCTGTTAGC
CGTGCTAGCTTGAACATAAAGGTGGAGAGTGACTCAAAGATGGCCCCTTGGTCGCTGAGTCTTTCTGAAGGTGTTTTAGGGACTGACAAGTTTATGGATCGTTCCGAGAA
TGAGCTCTCGACTTTTGATTTGTGTGATGGCAGTGAAGATGATGAGGAAGAACTTAATATATTACTCGATAGATTTGGCGATTCTTACGATTCAAACTTGCCTTCTGTCA
ATGAAATACTGAAAACCTGGAAAGAAGACAGGAAAACCGATGGGTTATTTCTCCACCCTTTGAATTAG
mRNA sequenceShow/hide mRNA sequence
ATGGAAGCAGTGAGAGCAAATGCACCCATTCCTTCACCGAAGTTCGAACCAGATACCGAGAAGATCAAGCGTACGCTCCTCCAAAAGGGTGTTTATCCCACTCCCAAAAT
CGTCCGCTCACTCCGCAAGAAAGAAATTCAGAAGCACAATCGCAAACTCAAGAGACTCCCGAAACGACAAGCCGATCAGACGCCACCCCTTTCTGAGTCCCAAAAGCAAC
TAATCGACGAGGAAACTCGTTTTCTGACGTTGAGAAGCGAGTACAAGGAGTTCTCGAAGGCCATTGAGGCAAAACCAGCCGGGGGCTTGATGGTCGGAAGGCCTTGGGAG
AGACTGGAAAGACTTAACCTTAAAGAACTTACCGATTCTAGGACGGGATACAATCGGGATAATCTGAAGAAGGAGAGTTTGAGAGAATTGAGGAAGTTGTTTGAGGCTCG
TAAGCTCGAGGAATTGCAGTGGGTTTTAGATGACGATGTGGAACTGAAGGATGAGTGGTTAGAGAGTGAAAATGGTCACTGTGGCACCGTAAAACGGAGGCGCGGTGACG
GAGAGGTTATTCGGTTTCTTGTTGACAGGCTCAGTTCGATGGAAATTTCCATGAGAGATTGGAAATTCTCGAGGATGATGATACGGTCAGGATTGCAGTTTAATGAAGGT
CAACTTCTTAGAATTTTGGATGCCCTTGGTGCTAAGGGATGTTGGAAACAGGCCTTGTCAGTGGTCGAATGGGTGTACAATCTTAAAAGTCACAGTTATTCGAAAAGCAG
ATTTGTTTATACAAAGCTCTTAGCGGTTCTTGGGATGGCAAGGAAACCTCAGGAAGCCCTTCAGATATTCAATTTCATGCGTGGAGATGGCCAGATATATCCCGATATGG
CTGCATATCACAGTATTGCTGTTACGCTGGGTCAAGCTGGTCACTTGAAACAATTGCTGAAAGTTATTGAGTGCATGAGGCAGCAACCGTCTAAAAAAATTAGAAACAAG
TACCGCAAATCTTGGGATCCAGCAGTTGAACCCAGCCTTGTTATATATAATGCTATTTTAAACGCATGCATTCCCACCCTTGAGTGGAAAGGTGTCTTCTGGGTGTTCAC
ACAATTGAGAAAGAGTGGTTTGAAACCTAACGGAGCAACATATGGGCTTTCTATGGAGGTAATGCTGAAATCTGGAAAGTATGAGCATGTCCACAAGCTTTTTACAAAAA
TGAAGAAGAGTGGAGAAACACTAAAGGCAAACACTTACAGAGTTCTTGTCAAAGCTTTTTGGGAGGAAGGAAATGTCAATGGAGCTATTGAAGCAGTCAGGGATATGGAA
CAAAGAGGAGTAGTTGGATCTGCCAGTGTCTACTATGAACTAGCTTGTTGTCTATGCTACAATGGGAGGTGGCGAGATGCATTAGTGGAGGTTGAGAAAATGAAAATGCT
ATCGCACATGAAACCATTGGTGGTTACCTTCACTGGCATGATCTTATCTTCCTTTGATGGTGGACATATTGATGATTGTATATCTATTTTCAAGTACATGAAGCAAAATT
GTGCGCCAAATATAGGGACCATAAATACCATGCTTAAAGTTTATGGCCGAAATGATATGTTTTCTAAGGCTAAAGATTTATTTGAAGAGATAAAGAGAAAAACTGATCAT
TCCTCCCAGATTAGTGTTGTTCCTTCTATTATCCCAGATGAATATACGTATGGCTCAATGCTTGAGGCAGCTGCTAGTGCACTCCAGTGGGAATATTTTGAAAATGTATA
CAGGGAAATGGCTCTGTCTGGATACCGGCTTGATCAAAGTAAACATGCAAGGCTACTTGTGGAAGCATCCAGAGCTGGGAAGTGGTATCTATTAGATCATGCATTTGACT
CAATCTTGGAGGCTGGACAAATTCCCCATCCGCTGTTGTTCACAGAAATGATATTGCAACTTATGACTCAAGATAGCTATGAGCAGGCTGTCACCCTGGTTAAAGCCATG
GGTTATGCTCCATTTCAAGTAAGTGAAAGGCAATGGACAGAACTTTTTGAAGCGAACACGGACAGGATCTGTTGGAATAACTTGAAGCAACTGTCGGATGCTCTTAGCAA
CTGTAATATATCAGAAGCCACAGTCTCGAACTTGTCAAGATCATTGCAGTCTCTTTGCAAACCTGGCATACCAGAAAACACCTCCCAGTCTGTAGCCTGCGATCATGATG
CAACAGATGGATTACAACTTCCTGGTTCTGAAAACATGGAGAATTTGAAACTTCACCCAGATTGTTTCACAGGTCGTTGTGGCAAGTCGTTGGATAATACTCCTGTTAGC
CGTGCTAGCTTGAACATAAAGGTGGAGAGTGACTCAAAGATGGCCCCTTGGTCGCTGAGTCTTTCTGAAGGTGTTTTAGGGACTGACAAGTTTATGGATCGTTCCGAGAA
TGAGCTCTCGACTTTTGATTTGTGTGATGGCAGTGAAGATGATGAGGAAGAACTTAATATATTACTCGATAGATTTGGCGATTCTTACGATTCAAACTTGCCTTCTGTCA
ATGAAATACTGAAAACCTGGAAAGAAGACAGGAAAACCGATGGGTTATTTCTCCACCCTTTGAATTAG
Protein sequenceShow/hide protein sequence
MEAVRANAPIPSPKFEPDTEKIKRTLLQKGVYPTPKIVRSLRKKEIQKHNRKLKRLPKRQADQTPPLSESQKQLIDEETRFLTLRSEYKEFSKAIEAKPAGGLMVGRPWE
RLERLNLKELTDSRTGYNRDNLKKESLRELRKLFEARKLEELQWVLDDDVELKDEWLESENGHCGTVKRRRGDGEVIRFLVDRLSSMEISMRDWKFSRMMIRSGLQFNEG
QLLRILDALGAKGCWKQALSVVEWVYNLKSHSYSKSRFVYTKLLAVLGMARKPQEALQIFNFMRGDGQIYPDMAAYHSIAVTLGQAGHLKQLLKVIECMRQQPSKKIRNK
YRKSWDPAVEPSLVIYNAILNACIPTLEWKGVFWVFTQLRKSGLKPNGATYGLSMEVMLKSGKYEHVHKLFTKMKKSGETLKANTYRVLVKAFWEEGNVNGAIEAVRDME
QRGVVGSASVYYELACCLCYNGRWRDALVEVEKMKMLSHMKPLVVTFTGMILSSFDGGHIDDCISIFKYMKQNCAPNIGTINTMLKVYGRNDMFSKAKDLFEEIKRKTDH
SSQISVVPSIIPDEYTYGSMLEAAASALQWEYFENVYREMALSGYRLDQSKHARLLVEASRAGKWYLLDHAFDSILEAGQIPHPLLFTEMILQLMTQDSYEQAVTLVKAM
GYAPFQVSERQWTELFEANTDRICWNNLKQLSDALSNCNISEATVSNLSRSLQSLCKPGIPENTSQSVACDHDATDGLQLPGSENMENLKLHPDCFTGRCGKSLDNTPVS
RASLNIKVESDSKMAPWSLSLSEGVLGTDKFMDRSENELSTFDLCDGSEDDEEELNILLDRFGDSYDSNLPSVNEILKTWKEDRKTDGLFLHPLN