; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr026223 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr026223
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationtig00153031:2986202..2991672
RNA-Seq ExpressionSgr026223
SyntenySgr026223
Gene Ontology termsGO:0009658 - chloroplast organization (biological process)
GO:0009507 - chloroplast (cellular component)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily
IPR033443 - Pentacotripeptide-repeat region of PRORP
IPR044645 - Pentatricopeptide repeat-containing protein DG1/EMB2279-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004141982.1 pentatricopeptide repeat-containing protein At5g67570, chloroplastic [Cucumis sativus]0.0e+0084.92Show/hide
Query:  MEALGANAPIPAPKFEPDIEKIKRALLQKGVYPSPKIVRSLRKKEIQKHNRKLKRQAERQADQSPPLSESEKQVIAEETHFLTLRSEYKEFSKAIEAKPG
        MEAL +N PIP+PKFEPD +KIKR LLQKGVYP+P+IVRSLRKKEIQK+NRKLKR AERQ+ QSPPLSES+KQ+IAEETHFLTLRSEYKEFSKAIEAKP 
Subjt:  MEALGANAPIPAPKFEPDIEKIKRALLQKGVYPSPKIVRSLRKKEIQKHNRKLKRQAERQADQSPPLSESEKQVIAEETHFLTLRSEYKEFSKAIEAKPG

Query:  GGLMVGRPWERLERVNLKELTGFRTGYDGDNLKKESLRELRKLFEARKLEELQWVLDDDVELKEEWSESENGHFDTVKRRRGDGEVIRFLVDRLSSREIA
        GGLMVGRPWERLERVN KELTG RTGY+ D+LKKESLRELRKLFE RKLEE QW LDDDVELKEEW ESEN  +D VKRRRGDGEVIRFLVDRLSS  I+
Subjt:  GGLMVGRPWERLERVNLKELTGFRTGYDGDNLKKESLRELRKLFEARKLEELQWVLDDDVELKEEWSESENGHFDTVKRRRGDGEVIRFLVDRLSSREIA

Query:  MRDWKFSKMMIRSGLQFNEGQLLKILDALGAKGCWKQALSVVEWVYNLKSHSHSKSRFVYTKLLAVLGMAREPQEALQIFNLMRGDGQIYPDMAAYHSIA
        MRDWKFS+MMIRSGLQFNEGQLLKILDALGAKGCWKQALSVVEWVYNLKSHSHSKSRFVYTKLLAVLGMAR+PQEALQIFNLMRGDGQIYPDMAAYHSIA
Subjt:  MRDWKFSKMMIRSGLQFNEGQLLKILDALGAKGCWKQALSVVEWVYNLKSHSHSKSRFVYTKLLAVLGMAREPQEALQIFNLMRGDGQIYPDMAAYHSIA

Query:  VTLGQAGLLKQLLKVIECMRQQPSKKIRNKCRKSWDPAVEPDLVVYNAILNACIPTLEWKGVYWVFTQLRKSCLKPNGATYGLSMEVMLKSGKYEHVHKL
        VTLGQAGLLKQLLKVIE MRQQPSKK+RNKCRKSWDPAVEPDLV+YNAILNACIPTLEWKGVYWVFTQLRKS L+PNGATYGLSMEVMLKSGKYE +H L
Subjt:  VTLGQAGLLKQLLKVIECMRQQPSKKIRNKCRKSWDPAVEPDLVVYNAILNACIPTLEWKGVYWVFTQLRKSCLKPNGATYGLSMEVMLKSGKYEHVHKL

Query:  FTKMKKSGETPKANTYRVLVKAFWEEGKVNGAIEAVRDMEQRGVVGSASVYYELACCLCYNGRWRDALVEVEKMKTLSHMKPLVVAFTGMILSSFDGGHI
        FTKMKK+G+T KANTYRVLVKAFWEEG VNGAIEAVRDMEQRGVVGSASVYYELACCLCYNG+W+DALVEVEKMKTLSHMKPLVV FTGMI SSF+GGHI
Subjt:  FTKMKKSGETPKANTYRVLVKAFWEEGKVNGAIEAVRDMEQRGVVGSASVYYELACCLCYNGRWRDALVEVEKMKTLSHMKPLVVAFTGMILSSFDGGHI

Query:  DDCISIFEYMKQNCAPNIGAINTMLRVYGRNDMFSKAKDLFEEIKRKADRSSPSSSAPSIIPDEYTYGSMLEAAASALQWEYFENVYREMALSGYRLDQS
        DDCISIFEYMKQ CAPNIG INTML+VYGRNDM+SKAKDLFEEIKRKAD SS  S+ PS++PDEYTY SMLEAAAS+LQWEYFE+VYREMALSGY+LDQS
Subjt:  DDCISIFEYMKQNCAPNIGAINTMLRVYGRNDMFSKAKDLFEEIKRKADRSSPSSSAPSIIPDEYTYGSMLEAAASALQWEYFENVYREMALSGYRLDQS

Query:  KHAMLLVEASRAGKWYLLDHAYDTILEAGQIPHPLLFTEMILQLIVQDNYEQAVTLVKAMGYAPFQVSERQWTELFEANTDRICSNNLKKLSDALNDCNA
        KHA+LLVEAS+AGKWYLLDHA+DTILEAGQIPHPLLFTEMILQL  QDNYEQAVTLV+ MGYAPFQVSERQWTELFE NTDRI  NNLK+L  AL DC+A
Subjt:  KHAMLLVEASRAGKWYLLDHAYDTILEAGQIPHPLLFTEMILQLIVQDNYEQAVTLVKAMGYAPFQVSERQWTELFEANTDRICSNNLKKLSDALNDCNA

Query:  SEATVSNLSISLHSLCKPGIPENTSQSVACDRDATDGLPIRGSENMENMETTKLHPDRHVNHCDESLDMISVNHLSSNMKVESDSKMAPWSLSLTEGVLG
        SEATVSNLS SL SLCK  IPENTSQSVACD DATD L +  SENMENM   KLHPD      DESLD+I V+H S NMKV S+SKM+PWS+S+++G LG
Subjt:  SEATVSNLSISLHSLCKPGIPENTSQSVACDRDATDGLPIRGSENMENMETTKLHPDRHVNHCDESLDMISVNHLSSNMKVESDSKMAPWSLSLTEGVLG

Query:  TQKFSDPSNNEVSSFDSCDDSEDDEEELDMLLDRFDDSYDSNLPSVNEILKTWKEDRKNDGL
        T +FSD SNN  S FD C +SEDDEEEL+ LLD FDD+YDSNLP+VNEIL+TWKE+RK DGL
Subjt:  TQKFSDPSNNEVSSFDSCDDSEDDEEELDMLLDRFDDSYDSNLPSVNEILKTWKEDRKNDGL

XP_008440398.1 PREDICTED: pentatricopeptide repeat-containing protein At5g67570, chloroplastic [Cucumis melo]0.0e+0085.15Show/hide
Query:  MEALGANAPIPAPKFEPDIEKIKRALLQKGVYPSPKIVRSLRKKEIQKHNRKLKRQAERQADQSPPLSESEKQVIAEETHFLTLRSEYKEFSKAIEAKPG
        MEAL +NAPIP+PKFEPD +KIKR LLQKGVYP+P+IVRSLRKKEIQK+NRKLKR AERQ+DQSPPLSES+KQ+IAEETHFLTLRSEYKEFSKAIEAKP 
Subjt:  MEALGANAPIPAPKFEPDIEKIKRALLQKGVYPSPKIVRSLRKKEIQKHNRKLKRQAERQADQSPPLSESEKQVIAEETHFLTLRSEYKEFSKAIEAKPG

Query:  GGLMVGRPWERLERVNLKELTGFRTGYDGDNLKKESLRELRKLFEARKLEELQWVLDDDVELKEEWSESENGHFDTVKRRRGDGEVIRFLVDRLSSREIA
        GGLMVGRPWERLERVN  ELTG RTGY+ D+LKKESLRELRKLFE RKLEEL+W LDDDVELKEEW +SENG +D VKRRRGDGEVIRFLVDRLSS  I+
Subjt:  GGLMVGRPWERLERVNLKELTGFRTGYDGDNLKKESLRELRKLFEARKLEELQWVLDDDVELKEEWSESENGHFDTVKRRRGDGEVIRFLVDRLSSREIA

Query:  MRDWKFSKMMIRSGLQFNEGQLLKILDALGAKGCWKQALSVVEWVYNLKSHSHSKSRFVYTKLLAVLGMAREPQEALQIFNLMRGDGQIYPDMAAYHSIA
        MRDWKFS+MMIRSGLQFNEGQLLKILDALGAKGCWKQALSVVEWVYNLKSHSHSKSRFVYTKLLAVLGMAR+PQEALQIFNLMRGDGQIYPDMAAYHSIA
Subjt:  MRDWKFSKMMIRSGLQFNEGQLLKILDALGAKGCWKQALSVVEWVYNLKSHSHSKSRFVYTKLLAVLGMAREPQEALQIFNLMRGDGQIYPDMAAYHSIA

Query:  VTLGQAGLLKQLLKVIECMRQQPSKKIRNKCRKSWDPAVEPDLVVYNAILNACIPTLEWKGVYWVFTQLRKSCLKPNGATYGLSMEVMLKSGKYEHVHKL
        VTLGQAGLLKQLLKVIECMRQQPSKK+RNKCRKSWDPAVEPDLV+YN ILNACIPTLEWKGVYWVFTQLRKS L+PNGATYGLSMEVMLKSGKYE +H L
Subjt:  VTLGQAGLLKQLLKVIECMRQQPSKKIRNKCRKSWDPAVEPDLVVYNAILNACIPTLEWKGVYWVFTQLRKSCLKPNGATYGLSMEVMLKSGKYEHVHKL

Query:  FTKMKKSGETPKANTYRVLVKAFWEEGKVNGAIEAVRDMEQRGVVGSASVYYELACCLCYNGRWRDALVEVEKMKTLSHMKPLVVAFTGMILSSFDGGHI
        FTKMKKSGET KANTYRVLVKAFWEEG  +GAIEAVRDMEQRGVVGSASVYYELACCLCYNG+W+DALVEVEKMKTLSHMKPLVV FTGMILSSF+GGHI
Subjt:  FTKMKKSGETPKANTYRVLVKAFWEEGKVNGAIEAVRDMEQRGVVGSASVYYELACCLCYNGRWRDALVEVEKMKTLSHMKPLVVAFTGMILSSFDGGHI

Query:  DDCISIFEYMKQNCAPNIGAINTMLRVYGRNDMFSKAKDLFEEIKRKADRSSPSSSAPSIIPDEYTYGSMLEAAASALQWEYFENVYREMALSGYRLDQS
        DDCISIFEYMKQ CAPNIG INTML+VYGRNDMFSKAKDLFEEIK+KAD SS +S+ PS++PDEYTY SML+AAAS+LQWEYFENVYREMALSGYRLDQS
Subjt:  DDCISIFEYMKQNCAPNIGAINTMLRVYGRNDMFSKAKDLFEEIKRKADRSSPSSSAPSIIPDEYTYGSMLEAAASALQWEYFENVYREMALSGYRLDQS

Query:  KHAMLLVEASRAGKWYLLDHAYDTILEAGQIPHPLLFTEMILQLIVQDNYEQAVTLVKAMGYAPFQVSERQWTELFEANTDRICSNNLKKLSDALNDCNA
        KHA+LLVEAS+AGKWYLLDHA+DTILEAGQIPHPLLFTEMILQL  Q+NYEQAVTLV+ MGYAPFQVSERQWTELFE N DRIC NNLK+L DAL DC+A
Subjt:  KHAMLLVEASRAGKWYLLDHAYDTILEAGQIPHPLLFTEMILQLIVQDNYEQAVTLVKAMGYAPFQVSERQWTELFEANTDRICSNNLKKLSDALNDCNA

Query:  SEATVSNLSISLHSLCKPGIPENTSQSVACDRDATDGLPIRGSENMENMETTKLHPDRHVNHCDESLDMISVNHLSSNMKVESDSKMAPWSLSLTEGVLG
        SEATVSNLS SL SLCK GI E+TSQS+ACD +ATDGL +  S+NMENM   KLHPD+     DESLD+I V+H S NMKV S+S M+PWS S+++GVLG
Subjt:  SEATVSNLSISLHSLCKPGIPENTSQSVACDRDATDGLPIRGSENMENMETTKLHPDRHVNHCDESLDMISVNHLSSNMKVESDSKMAPWSLSLTEGVLG

Query:  TQKFSDPSNNEVSSFDSCDDSEDDEEELDMLLDRFDDSYDSNLPSVNEILKTWKEDRKNDGL
        T +FSD SNNE S+FDS D+SEDDE EL+MLLD FDDSYDSNLP+ NEIL+TWKE+RK DGL
Subjt:  TQKFSDPSNNEVSSFDSCDDSEDDEEELDMLLDRFDDSYDSNLPSVNEILKTWKEDRKNDGL

XP_023543129.1 pentatricopeptide repeat-containing protein At5g67570, chloroplastic [Cucurbita pepo subsp. pepo]0.0e+0085.73Show/hide
Query:  MEALGANAPIPAPKFEPDIEKIKRALLQKGVYPSPKIVRSLRKKEIQKHNRKLKRQAERQADQSPPLSESEKQVIAEETHFLTLRSEYKEFSKAIEAKPG
        MEAL  NA +P+PKFEPDIEKIKR LLQKGV+P+PKIVRSL KKEIQKHNRKLKR AERQ  QSPPLSES+KQ+I EETHFLTLRSEYKEFSKAIEA+P 
Subjt:  MEALGANAPIPAPKFEPDIEKIKRALLQKGVYPSPKIVRSLRKKEIQKHNRKLKRQAERQADQSPPLSESEKQVIAEETHFLTLRSEYKEFSKAIEAKPG

Query:  GGLMVGRPWERLERVNLKELTGFRTGYDGDNLKKESLRELRKLFEARKLEELQWVLDDDVELKEEWSESENGHFDTVKRRRGDGEVIRFLVDRLSSREIA
        GGLMVGRPWERLERVNLKELTGFRT Y+ DNLKKESLRELRKLFEARKLEELQWVLDDDVELKEEW  SENG  D VKRRRGDGEVIRFLVDRLSSR I+
Subjt:  GGLMVGRPWERLERVNLKELTGFRTGYDGDNLKKESLRELRKLFEARKLEELQWVLDDDVELKEEWSESENGHFDTVKRRRGDGEVIRFLVDRLSSREIA

Query:  MRDWKFSKMMIRSGLQFNEGQLLKILDALGAKGCWKQALSVVEWVYNLKSHSHSKSRFVYTKLLAVLGMAREPQEALQIFNLMRGDGQIYPDMAAYHSIA
        MRDWKFS+MMI+SGLQFNEGQLLKILDALGAKGCWKQALSVVEWVYNLKSHSHSKSRFVYTKLLAVLG AR+PQEALQIFNLMRGDGQIYPDMAAYHSIA
Subjt:  MRDWKFSKMMIRSGLQFNEGQLLKILDALGAKGCWKQALSVVEWVYNLKSHSHSKSRFVYTKLLAVLGMAREPQEALQIFNLMRGDGQIYPDMAAYHSIA

Query:  VTLGQAGLLKQLLKVIECMRQQPSKKIRNKCRKSWDPAVEPDLVVYNAILNACIPTLEWKGVYWVFTQLRKSCLKPNGATYGLSMEVMLKSGKYEHVHKL
        VTLGQAGLLKQLLK+IECMRQQPSKK+RN CRK WDPAVEPDLV+YNAILNACIPTLEWK VYWVFTQLRK+ LKPNGATYGLSMEVMLKSGKYE VH L
Subjt:  VTLGQAGLLKQLLKVIECMRQQPSKKIRNKCRKSWDPAVEPDLVVYNAILNACIPTLEWKGVYWVFTQLRKSCLKPNGATYGLSMEVMLKSGKYEHVHKL

Query:  FTKMKKSGETPKANTYRVLVKAFWEEGKVNGAIEAVRDMEQRGVVGSASVYYELACCLCYNGRWRDALVEVEKMKTLSHMKPLVVAFTGMILSSFDGGHI
        FTKMK SG T KANTYRVLVKAFWEEG V+GAIEAVRDMEQRGVVGSASVYYELACCLCY+GRW+DALVEVEKMKTLSHMKPLVV FTGMILSSFDGGHI
Subjt:  FTKMKKSGETPKANTYRVLVKAFWEEGKVNGAIEAVRDMEQRGVVGSASVYYELACCLCYNGRWRDALVEVEKMKTLSHMKPLVVAFTGMILSSFDGGHI

Query:  DDCISIFEYMKQNCAPNIGAINTMLRVYGRNDMFSKAKDLFEEIKRKADRSSPSSSAPSIIPDEYTYGSMLEAAASALQWEYFENVYREMALSGYRLDQS
        DDCISIFEYMKQ CAPNIG INTML+V+GRNDMFSKAKDL+EEIKRKAD SS SS+  SI+PD+YTY SML+AAASA QWEYFENVYREMALSGYRLDQS
Subjt:  DDCISIFEYMKQNCAPNIGAINTMLRVYGRNDMFSKAKDLFEEIKRKADRSSPSSSAPSIIPDEYTYGSMLEAAASALQWEYFENVYREMALSGYRLDQS

Query:  KHAMLLVEASRAGKWYLLDHAYDTILEAGQIPHPLLFTEMILQLIVQDNYEQAVTLVKAMGYAPFQVSERQWTELFEANTDRICSNNLKKLSDALNDCNA
        KHAMLLVEASRAGKWYLLDHA+DTILEAGQIPHPLLFTE+ILQL  QDNYEQAVTLV+ M YAPFQVSERQWTE+FE NTDRIC NNLKKLSDAL+DC+A
Subjt:  KHAMLLVEASRAGKWYLLDHAYDTILEAGQIPHPLLFTEMILQLIVQDNYEQAVTLVKAMGYAPFQVSERQWTELFEANTDRICSNNLKKLSDALNDCNA

Query:  SEATVSNLSISLHSLCKPGIPENTSQSVACDRDATDGLPIRGSENMENMETTKLHPDRHVNHCDESLDMISVNHLSSNMKVESDSKMAPWSLSLTEGVLG
        SEATVSNLS SL  LCK GIPENTSQS+  D D TDGL + GSENMENM   KLH D   + C+ SLD+I VNH S N     DSKM PWSLSL++GVL 
Subjt:  SEATVSNLSISLHSLCKPGIPENTSQSVACDRDATDGLPIRGSENMENMETTKLHPDRHVNHCDESLDMISVNHLSSNMKVESDSKMAPWSLSLTEGVLG

Query:  TQKFSDPSNNEVSSFDSCDDSEDDEEELDMLLDRFDDSYDSNLPSVNEILKTWKEDRKNDGL
        T KFSD SNNE S+FD  DDSEDDEEEL MLLD FDD YDSNLPSV+EILKTWKE+RKNDGL
Subjt:  TQKFSDPSNNEVSSFDSCDDSEDDEEELDMLLDRFDDSYDSNLPSVNEILKTWKEDRKNDGL

XP_038881784.1 pentatricopeptide repeat-containing protein At5g67570, chloroplastic isoform X1 [Benincasa hispida]0.0e+0086.55Show/hide
Query:  MEALGANAPIPAPKFEPDIEKIKRALLQKGVYPSPKIVRSLRKKEIQKHNRKLKRQAERQADQSPPLSESEKQVIAEETHFLTLRSEYKEFSKAIEAKPG
        MEAL  N+PIP+PKFEPDIEKIKR L+ KGV+P+P+IVRSLRKKEIQK+NRKLKR  ERQADQSPPLSES+KQ+IAEETHFLTLRSEYKEFSKAIEAKP 
Subjt:  MEALGANAPIPAPKFEPDIEKIKRALLQKGVYPSPKIVRSLRKKEIQKHNRKLKRQAERQADQSPPLSESEKQVIAEETHFLTLRSEYKEFSKAIEAKPG

Query:  GGLMVGRPWERLERVNLKELTGFRTGYDGDNLKKESLRELRKLFEARKLEELQWVLDDDVELKEEWSESENGHFDTVKRRRGDGEVIRFLVDRLSSREIA
        GGLMVGRPWERLERVNLKELTGFRTGY+ DNLKKESLRELRKLFEARKLEELQWVLDDDVELKEEW ESEN H D ++RRRGDGEVIRFLVDRLSSR I+
Subjt:  GGLMVGRPWERLERVNLKELTGFRTGYDGDNLKKESLRELRKLFEARKLEELQWVLDDDVELKEEWSESENGHFDTVKRRRGDGEVIRFLVDRLSSREIA

Query:  MRDWKFSKMMIRSGLQFNEGQLLKILDALGAKGCWKQALSVVEWVYNLKSHSHSKSRFVYTKLLAVLGMAREPQEALQIFNLMR--------GDGQIYPD
        MRDWKFS+MMIRSGLQFNEGQLLKILDALGAKGCWKQALSVVEWVYNLKSHSHSKSRFVYTKLLAVLGM+R+PQEALQIF+LMR        GDGQIYPD
Subjt:  MRDWKFSKMMIRSGLQFNEGQLLKILDALGAKGCWKQALSVVEWVYNLKSHSHSKSRFVYTKLLAVLGMAREPQEALQIFNLMR--------GDGQIYPD

Query:  MAAYHSIAVTLGQAGLLKQLLKVIECMRQQPSKKIRNKCRKSWDPAVEPDLVVYNAILNACIPTLEWKGVYWVFTQLRKSCLKPNGATYGLSMEVMLKSG
        MAAYHSIAVTLGQAGLLKQLLKV+ECMRQQPS+K+RNKCRKSWDPAVEPDLVVYNAILNACIPTLEWKGVYWVFTQLRKS L+PNGATYGLSMEVMLKSG
Subjt:  MAAYHSIAVTLGQAGLLKQLLKVIECMRQQPSKKIRNKCRKSWDPAVEPDLVVYNAILNACIPTLEWKGVYWVFTQLRKSCLKPNGATYGLSMEVMLKSG

Query:  KYEHVHKLFTKMKKSGETPKANTYRVLVKAFWEEGKVNGAIEAVRDMEQRGVVGSASVYYELACCLCYNGRWRDALVEVEKMKTLSHMKPLVVAFTGMIL
        KYE +HKLFTK+KKSGET KANTYRVLVKAFWEEG VNGAIEAVRDMEQRGVVGSASVYYELACCLCYNGRW+DALVEVEKMKTLSHMKPLVV FTGMIL
Subjt:  KYEHVHKLFTKMKKSGETPKANTYRVLVKAFWEEGKVNGAIEAVRDMEQRGVVGSASVYYELACCLCYNGRWRDALVEVEKMKTLSHMKPLVVAFTGMIL

Query:  SSFDGGHIDDCISIFEYMKQNCAPNIGAINTMLRVYGRNDMFSKAKDLFEEIKRKADRSSPSSSAPSIIPDEYTYGSMLEAAASALQWEYFENVYREMAL
        SSFDGGHIDDCISIFEYMKQ CAPNIG IN+ML+VYGRNDMF KAKDLFEEIKRKAD SS SS+ PS++PDEYTYGSMLEAAASALQWEYFENVYREMAL
Subjt:  SSFDGGHIDDCISIFEYMKQNCAPNIGAINTMLRVYGRNDMFSKAKDLFEEIKRKADRSSPSSSAPSIIPDEYTYGSMLEAAASALQWEYFENVYREMAL

Query:  SGYRLDQSKHAMLLVEASRAGKWYLLDHAYDTILEAGQIPHPLLFTEMILQLIVQDNYEQAVTLVKAMGYAPFQVSERQWTELFEANTDRICSNNLKKLS
        SGYRLDQSKHA LLVEASRAGKWYLLDHA+D+ILEAGQIPHPLLFTEMIL L  QDNYEQAVTLV+ MGYAPFQVSERQWTELFE N DRIC  NLKKL 
Subjt:  SGYRLDQSKHAMLLVEASRAGKWYLLDHAYDTILEAGQIPHPLLFTEMILQLIVQDNYEQAVTLVKAMGYAPFQVSERQWTELFEANTDRICSNNLKKLS

Query:  DALNDCNASEATVSNLSISLHSLCKPGIPENTSQSVACDRDATDGLPIRGSENMENMETTKLHPDRHVNHCDESLDMISVNHLSSNMKVESDSKMAPWSL
        DAL +C+ASEATVSNLS SL SLCK GIPENTSQSVACD D TDGL + GSEN ENM   KLHPDR  + CDESLD+I VNH S NMKV+SDS+++PWS 
Subjt:  DALNDCNASEATVSNLSISLHSLCKPGIPENTSQSVACDRDATDGLPIRGSENMENMETTKLHPDRHVNHCDESLDMISVNHLSSNMKVESDSKMAPWSL

Query:  SLTEGVLGTQKFSDPSNNEVSSFDSCDDSEDDEEELDMLLDRFDDSYDSNLPSVNEILKTWKEDRKNDGL
        S +EGVLGT +FSD S NE+S+ D CDDSEDDEE L+MLLD FDDSYDSNLPSVNEILKTWKE+RK DGL
Subjt:  SLTEGVLGTQKFSDPSNNEVSSFDSCDDSEDDEEELDMLLDRFDDSYDSNLPSVNEILKTWKEDRKNDGL

XP_038881786.1 pentatricopeptide repeat-containing protein At5g67570, chloroplastic isoform X2 [Benincasa hispida]0.0e+0087.35Show/hide
Query:  MEALGANAPIPAPKFEPDIEKIKRALLQKGVYPSPKIVRSLRKKEIQKHNRKLKRQAERQADQSPPLSESEKQVIAEETHFLTLRSEYKEFSKAIEAKPG
        MEAL  N+PIP+PKFEPDIEKIKR L+ KGV+P+P+IVRSLRKKEIQK+NRKLKR  ERQADQSPPLSES+KQ+IAEETHFLTLRSEYKEFSKAIEAKP 
Subjt:  MEALGANAPIPAPKFEPDIEKIKRALLQKGVYPSPKIVRSLRKKEIQKHNRKLKRQAERQADQSPPLSESEKQVIAEETHFLTLRSEYKEFSKAIEAKPG

Query:  GGLMVGRPWERLERVNLKELTGFRTGYDGDNLKKESLRELRKLFEARKLEELQWVLDDDVELKEEWSESENGHFDTVKRRRGDGEVIRFLVDRLSSREIA
        GGLMVGRPWERLERVNLKELTGFRTGY+ DNLKKESLRELRKLFEARKLEELQWVLDDDVELKEEW ESEN H D ++RRRGDGEVIRFLVDRLSSR I+
Subjt:  GGLMVGRPWERLERVNLKELTGFRTGYDGDNLKKESLRELRKLFEARKLEELQWVLDDDVELKEEWSESENGHFDTVKRRRGDGEVIRFLVDRLSSREIA

Query:  MRDWKFSKMMIRSGLQFNEGQLLKILDALGAKGCWKQALSVVEWVYNLKSHSHSKSRFVYTKLLAVLGMAREPQEALQIFNLMRGDGQIYPDMAAYHSIA
        MRDWKFS+MMIRSGLQFNEGQLLKILDALGAKGCWKQALSVVEWVYNLKSHSHSKSRFVYTKLLAVLGM+R+PQEALQIF+LMRGDGQIYPDMAAYHSIA
Subjt:  MRDWKFSKMMIRSGLQFNEGQLLKILDALGAKGCWKQALSVVEWVYNLKSHSHSKSRFVYTKLLAVLGMAREPQEALQIFNLMRGDGQIYPDMAAYHSIA

Query:  VTLGQAGLLKQLLKVIECMRQQPSKKIRNKCRKSWDPAVEPDLVVYNAILNACIPTLEWKGVYWVFTQLRKSCLKPNGATYGLSMEVMLKSGKYEHVHKL
        VTLGQAGLLKQLLKV+ECMRQQPS+K+RNKCRKSWDPAVEPDLVVYNAILNACIPTLEWKGVYWVFTQLRKS L+PNGATYGLSMEVMLKSGKYE +HKL
Subjt:  VTLGQAGLLKQLLKVIECMRQQPSKKIRNKCRKSWDPAVEPDLVVYNAILNACIPTLEWKGVYWVFTQLRKSCLKPNGATYGLSMEVMLKSGKYEHVHKL

Query:  FTKMKKSGETPKANTYRVLVKAFWEEGKVNGAIEAVRDMEQRGVVGSASVYYELACCLCYNGRWRDALVEVEKMKTLSHMKPLVVAFTGMILSSFDGGHI
        FTK+KKSGET KANTYRVLVKAFWEEG VNGAIEAVRDMEQRGVVGSASVYYELACCLCYNGRW+DALVEVEKMKTLSHMKPLVV FTGMILSSFDGGHI
Subjt:  FTKMKKSGETPKANTYRVLVKAFWEEGKVNGAIEAVRDMEQRGVVGSASVYYELACCLCYNGRWRDALVEVEKMKTLSHMKPLVVAFTGMILSSFDGGHI

Query:  DDCISIFEYMKQNCAPNIGAINTMLRVYGRNDMFSKAKDLFEEIKRKADRSSPSSSAPSIIPDEYTYGSMLEAAASALQWEYFENVYREMALSGYRLDQS
        DDCISIFEYMKQ CAPNIG IN+ML+VYGRNDMF KAKDLFEEIKRKAD SS SS+ PS++PDEYTYGSMLEAAASALQWEYFENVYREMALSGYRLDQS
Subjt:  DDCISIFEYMKQNCAPNIGAINTMLRVYGRNDMFSKAKDLFEEIKRKADRSSPSSSAPSIIPDEYTYGSMLEAAASALQWEYFENVYREMALSGYRLDQS

Query:  KHAMLLVEASRAGKWYLLDHAYDTILEAGQIPHPLLFTEMILQLIVQDNYEQAVTLVKAMGYAPFQVSERQWTELFEANTDRICSNNLKKLSDALNDCNA
        KHA LLVEASRAGKWYLLDHA+D+ILEAGQIPHPLLFTEMIL L  QDNYEQAVTLV+ MGYAPFQVSERQWTELFE N DRIC  NLKKL DAL +C+A
Subjt:  KHAMLLVEASRAGKWYLLDHAYDTILEAGQIPHPLLFTEMILQLIVQDNYEQAVTLVKAMGYAPFQVSERQWTELFEANTDRICSNNLKKLSDALNDCNA

Query:  SEATVSNLSISLHSLCKPGIPENTSQSVACDRDATDGLPIRGSENMENMETTKLHPDRHVNHCDESLDMISVNHLSSNMKVESDSKMAPWSLSLTEGVLG
        SEATVSNLS SL SLCK GIPENTSQSVACD D TDGL + GSEN ENM   KLHPDR  + CDESLD+I VNH S NMKV+SDS+++PWS S +EGVLG
Subjt:  SEATVSNLSISLHSLCKPGIPENTSQSVACDRDATDGLPIRGSENMENMETTKLHPDRHVNHCDESLDMISVNHLSSNMKVESDSKMAPWSLSLTEGVLG

Query:  TQKFSDPSNNEVSSFDSCDDSEDDEEELDMLLDRFDDSYDSNLPSVNEILKTWKEDRKNDGL
        T +FSD S NE+S+ D CDDSEDDEE L+MLLD FDDSYDSNLPSVNEILKTWKE+RK DGL
Subjt:  TQKFSDPSNNEVSSFDSCDDSEDDEEELDMLLDRFDDSYDSNLPSVNEILKTWKEDRKNDGL

TrEMBL top hitse value%identityAlignment
A0A0A0KFY8 Uncharacterized protein0.0e+0084.92Show/hide
Query:  MEALGANAPIPAPKFEPDIEKIKRALLQKGVYPSPKIVRSLRKKEIQKHNRKLKRQAERQADQSPPLSESEKQVIAEETHFLTLRSEYKEFSKAIEAKPG
        MEAL +N PIP+PKFEPD +KIKR LLQKGVYP+P+IVRSLRKKEIQK+NRKLKR AERQ+ QSPPLSES+KQ+IAEETHFLTLRSEYKEFSKAIEAKP 
Subjt:  MEALGANAPIPAPKFEPDIEKIKRALLQKGVYPSPKIVRSLRKKEIQKHNRKLKRQAERQADQSPPLSESEKQVIAEETHFLTLRSEYKEFSKAIEAKPG

Query:  GGLMVGRPWERLERVNLKELTGFRTGYDGDNLKKESLRELRKLFEARKLEELQWVLDDDVELKEEWSESENGHFDTVKRRRGDGEVIRFLVDRLSSREIA
        GGLMVGRPWERLERVN KELTG RTGY+ D+LKKESLRELRKLFE RKLEE QW LDDDVELKEEW ESEN  +D VKRRRGDGEVIRFLVDRLSS  I+
Subjt:  GGLMVGRPWERLERVNLKELTGFRTGYDGDNLKKESLRELRKLFEARKLEELQWVLDDDVELKEEWSESENGHFDTVKRRRGDGEVIRFLVDRLSSREIA

Query:  MRDWKFSKMMIRSGLQFNEGQLLKILDALGAKGCWKQALSVVEWVYNLKSHSHSKSRFVYTKLLAVLGMAREPQEALQIFNLMRGDGQIYPDMAAYHSIA
        MRDWKFS+MMIRSGLQFNEGQLLKILDALGAKGCWKQALSVVEWVYNLKSHSHSKSRFVYTKLLAVLGMAR+PQEALQIFNLMRGDGQIYPDMAAYHSIA
Subjt:  MRDWKFSKMMIRSGLQFNEGQLLKILDALGAKGCWKQALSVVEWVYNLKSHSHSKSRFVYTKLLAVLGMAREPQEALQIFNLMRGDGQIYPDMAAYHSIA

Query:  VTLGQAGLLKQLLKVIECMRQQPSKKIRNKCRKSWDPAVEPDLVVYNAILNACIPTLEWKGVYWVFTQLRKSCLKPNGATYGLSMEVMLKSGKYEHVHKL
        VTLGQAGLLKQLLKVIE MRQQPSKK+RNKCRKSWDPAVEPDLV+YNAILNACIPTLEWKGVYWVFTQLRKS L+PNGATYGLSMEVMLKSGKYE +H L
Subjt:  VTLGQAGLLKQLLKVIECMRQQPSKKIRNKCRKSWDPAVEPDLVVYNAILNACIPTLEWKGVYWVFTQLRKSCLKPNGATYGLSMEVMLKSGKYEHVHKL

Query:  FTKMKKSGETPKANTYRVLVKAFWEEGKVNGAIEAVRDMEQRGVVGSASVYYELACCLCYNGRWRDALVEVEKMKTLSHMKPLVVAFTGMILSSFDGGHI
        FTKMKK+G+T KANTYRVLVKAFWEEG VNGAIEAVRDMEQRGVVGSASVYYELACCLCYNG+W+DALVEVEKMKTLSHMKPLVV FTGMI SSF+GGHI
Subjt:  FTKMKKSGETPKANTYRVLVKAFWEEGKVNGAIEAVRDMEQRGVVGSASVYYELACCLCYNGRWRDALVEVEKMKTLSHMKPLVVAFTGMILSSFDGGHI

Query:  DDCISIFEYMKQNCAPNIGAINTMLRVYGRNDMFSKAKDLFEEIKRKADRSSPSSSAPSIIPDEYTYGSMLEAAASALQWEYFENVYREMALSGYRLDQS
        DDCISIFEYMKQ CAPNIG INTML+VYGRNDM+SKAKDLFEEIKRKAD SS  S+ PS++PDEYTY SMLEAAAS+LQWEYFE+VYREMALSGY+LDQS
Subjt:  DDCISIFEYMKQNCAPNIGAINTMLRVYGRNDMFSKAKDLFEEIKRKADRSSPSSSAPSIIPDEYTYGSMLEAAASALQWEYFENVYREMALSGYRLDQS

Query:  KHAMLLVEASRAGKWYLLDHAYDTILEAGQIPHPLLFTEMILQLIVQDNYEQAVTLVKAMGYAPFQVSERQWTELFEANTDRICSNNLKKLSDALNDCNA
        KHA+LLVEAS+AGKWYLLDHA+DTILEAGQIPHPLLFTEMILQL  QDNYEQAVTLV+ MGYAPFQVSERQWTELFE NTDRI  NNLK+L  AL DC+A
Subjt:  KHAMLLVEASRAGKWYLLDHAYDTILEAGQIPHPLLFTEMILQLIVQDNYEQAVTLVKAMGYAPFQVSERQWTELFEANTDRICSNNLKKLSDALNDCNA

Query:  SEATVSNLSISLHSLCKPGIPENTSQSVACDRDATDGLPIRGSENMENMETTKLHPDRHVNHCDESLDMISVNHLSSNMKVESDSKMAPWSLSLTEGVLG
        SEATVSNLS SL SLCK  IPENTSQSVACD DATD L +  SENMENM   KLHPD      DESLD+I V+H S NMKV S+SKM+PWS+S+++G LG
Subjt:  SEATVSNLSISLHSLCKPGIPENTSQSVACDRDATDGLPIRGSENMENMETTKLHPDRHVNHCDESLDMISVNHLSSNMKVESDSKMAPWSLSLTEGVLG

Query:  TQKFSDPSNNEVSSFDSCDDSEDDEEELDMLLDRFDDSYDSNLPSVNEILKTWKEDRKNDGL
        T +FSD SNN  S FD C +SEDDEEEL+ LLD FDD+YDSNLP+VNEIL+TWKE+RK DGL
Subjt:  TQKFSDPSNNEVSSFDSCDDSEDDEEELDMLLDRFDDSYDSNLPSVNEILKTWKEDRKNDGL

A0A1S3B127 pentatricopeptide repeat-containing protein At5g67570, chloroplastic0.0e+0085.15Show/hide
Query:  MEALGANAPIPAPKFEPDIEKIKRALLQKGVYPSPKIVRSLRKKEIQKHNRKLKRQAERQADQSPPLSESEKQVIAEETHFLTLRSEYKEFSKAIEAKPG
        MEAL +NAPIP+PKFEPD +KIKR LLQKGVYP+P+IVRSLRKKEIQK+NRKLKR AERQ+DQSPPLSES+KQ+IAEETHFLTLRSEYKEFSKAIEAKP 
Subjt:  MEALGANAPIPAPKFEPDIEKIKRALLQKGVYPSPKIVRSLRKKEIQKHNRKLKRQAERQADQSPPLSESEKQVIAEETHFLTLRSEYKEFSKAIEAKPG

Query:  GGLMVGRPWERLERVNLKELTGFRTGYDGDNLKKESLRELRKLFEARKLEELQWVLDDDVELKEEWSESENGHFDTVKRRRGDGEVIRFLVDRLSSREIA
        GGLMVGRPWERLERVN  ELTG RTGY+ D+LKKESLRELRKLFE RKLEEL+W LDDDVELKEEW +SENG +D VKRRRGDGEVIRFLVDRLSS  I+
Subjt:  GGLMVGRPWERLERVNLKELTGFRTGYDGDNLKKESLRELRKLFEARKLEELQWVLDDDVELKEEWSESENGHFDTVKRRRGDGEVIRFLVDRLSSREIA

Query:  MRDWKFSKMMIRSGLQFNEGQLLKILDALGAKGCWKQALSVVEWVYNLKSHSHSKSRFVYTKLLAVLGMAREPQEALQIFNLMRGDGQIYPDMAAYHSIA
        MRDWKFS+MMIRSGLQFNEGQLLKILDALGAKGCWKQALSVVEWVYNLKSHSHSKSRFVYTKLLAVLGMAR+PQEALQIFNLMRGDGQIYPDMAAYHSIA
Subjt:  MRDWKFSKMMIRSGLQFNEGQLLKILDALGAKGCWKQALSVVEWVYNLKSHSHSKSRFVYTKLLAVLGMAREPQEALQIFNLMRGDGQIYPDMAAYHSIA

Query:  VTLGQAGLLKQLLKVIECMRQQPSKKIRNKCRKSWDPAVEPDLVVYNAILNACIPTLEWKGVYWVFTQLRKSCLKPNGATYGLSMEVMLKSGKYEHVHKL
        VTLGQAGLLKQLLKVIECMRQQPSKK+RNKCRKSWDPAVEPDLV+YN ILNACIPTLEWKGVYWVFTQLRKS L+PNGATYGLSMEVMLKSGKYE +H L
Subjt:  VTLGQAGLLKQLLKVIECMRQQPSKKIRNKCRKSWDPAVEPDLVVYNAILNACIPTLEWKGVYWVFTQLRKSCLKPNGATYGLSMEVMLKSGKYEHVHKL

Query:  FTKMKKSGETPKANTYRVLVKAFWEEGKVNGAIEAVRDMEQRGVVGSASVYYELACCLCYNGRWRDALVEVEKMKTLSHMKPLVVAFTGMILSSFDGGHI
        FTKMKKSGET KANTYRVLVKAFWEEG  +GAIEAVRDMEQRGVVGSASVYYELACCLCYNG+W+DALVEVEKMKTLSHMKPLVV FTGMILSSF+GGHI
Subjt:  FTKMKKSGETPKANTYRVLVKAFWEEGKVNGAIEAVRDMEQRGVVGSASVYYELACCLCYNGRWRDALVEVEKMKTLSHMKPLVVAFTGMILSSFDGGHI

Query:  DDCISIFEYMKQNCAPNIGAINTMLRVYGRNDMFSKAKDLFEEIKRKADRSSPSSSAPSIIPDEYTYGSMLEAAASALQWEYFENVYREMALSGYRLDQS
        DDCISIFEYMKQ CAPNIG INTML+VYGRNDMFSKAKDLFEEIK+KAD SS +S+ PS++PDEYTY SML+AAAS+LQWEYFENVYREMALSGYRLDQS
Subjt:  DDCISIFEYMKQNCAPNIGAINTMLRVYGRNDMFSKAKDLFEEIKRKADRSSPSSSAPSIIPDEYTYGSMLEAAASALQWEYFENVYREMALSGYRLDQS

Query:  KHAMLLVEASRAGKWYLLDHAYDTILEAGQIPHPLLFTEMILQLIVQDNYEQAVTLVKAMGYAPFQVSERQWTELFEANTDRICSNNLKKLSDALNDCNA
        KHA+LLVEAS+AGKWYLLDHA+DTILEAGQIPHPLLFTEMILQL  Q+NYEQAVTLV+ MGYAPFQVSERQWTELFE N DRIC NNLK+L DAL DC+A
Subjt:  KHAMLLVEASRAGKWYLLDHAYDTILEAGQIPHPLLFTEMILQLIVQDNYEQAVTLVKAMGYAPFQVSERQWTELFEANTDRICSNNLKKLSDALNDCNA

Query:  SEATVSNLSISLHSLCKPGIPENTSQSVACDRDATDGLPIRGSENMENMETTKLHPDRHVNHCDESLDMISVNHLSSNMKVESDSKMAPWSLSLTEGVLG
        SEATVSNLS SL SLCK GI E+TSQS+ACD +ATDGL +  S+NMENM   KLHPD+     DESLD+I V+H S NMKV S+S M+PWS S+++GVLG
Subjt:  SEATVSNLSISLHSLCKPGIPENTSQSVACDRDATDGLPIRGSENMENMETTKLHPDRHVNHCDESLDMISVNHLSSNMKVESDSKMAPWSLSLTEGVLG

Query:  TQKFSDPSNNEVSSFDSCDDSEDDEEELDMLLDRFDDSYDSNLPSVNEILKTWKEDRKNDGL
        T +FSD SNNE S+FDS D+SEDDE EL+MLLD FDDSYDSNLP+ NEIL+TWKE+RK DGL
Subjt:  TQKFSDPSNNEVSSFDSCDDSEDDEEELDMLLDRFDDSYDSNLPSVNEILKTWKEDRKNDGL

A0A5D3CLI0 Pentatricopeptide repeat-containing protein0.0e+0085.15Show/hide
Query:  MEALGANAPIPAPKFEPDIEKIKRALLQKGVYPSPKIVRSLRKKEIQKHNRKLKRQAERQADQSPPLSESEKQVIAEETHFLTLRSEYKEFSKAIEAKPG
        MEAL +NAPIP+PKFEPD +KIKR LLQKGVYP+P+IVRSLRKKEIQK+NRKLKR AERQ+DQSPPLSES+KQ+IAEETHFLTLRSEYKEFSKAIEAKP 
Subjt:  MEALGANAPIPAPKFEPDIEKIKRALLQKGVYPSPKIVRSLRKKEIQKHNRKLKRQAERQADQSPPLSESEKQVIAEETHFLTLRSEYKEFSKAIEAKPG

Query:  GGLMVGRPWERLERVNLKELTGFRTGYDGDNLKKESLRELRKLFEARKLEELQWVLDDDVELKEEWSESENGHFDTVKRRRGDGEVIRFLVDRLSSREIA
        GGLMVGRPWERLERVN  ELTG RTGY+ D+LKKESLRELRKLFE RKLEEL+W LDDDVELKEEW +SENG +D VKRRRGDGEVIRFLVDRLSS  I+
Subjt:  GGLMVGRPWERLERVNLKELTGFRTGYDGDNLKKESLRELRKLFEARKLEELQWVLDDDVELKEEWSESENGHFDTVKRRRGDGEVIRFLVDRLSSREIA

Query:  MRDWKFSKMMIRSGLQFNEGQLLKILDALGAKGCWKQALSVVEWVYNLKSHSHSKSRFVYTKLLAVLGMAREPQEALQIFNLMRGDGQIYPDMAAYHSIA
        MRDWKFS+MMIRSGLQFNEGQLLKILDALGAKGCWKQALSVVEWVYNLKSHSHSKSRFVYTKLLAVLGMAR+PQEALQIFNLMRGDGQIYPDMAAYHSIA
Subjt:  MRDWKFSKMMIRSGLQFNEGQLLKILDALGAKGCWKQALSVVEWVYNLKSHSHSKSRFVYTKLLAVLGMAREPQEALQIFNLMRGDGQIYPDMAAYHSIA

Query:  VTLGQAGLLKQLLKVIECMRQQPSKKIRNKCRKSWDPAVEPDLVVYNAILNACIPTLEWKGVYWVFTQLRKSCLKPNGATYGLSMEVMLKSGKYEHVHKL
        VTLGQAGLLKQLLKVIECMRQQPSKK+RNKCRKSWDPAVEPDLV+YN ILNACIPTLEWKGVYWVFTQLRKS L+PNGATYGLSMEVMLKSGKYE +H L
Subjt:  VTLGQAGLLKQLLKVIECMRQQPSKKIRNKCRKSWDPAVEPDLVVYNAILNACIPTLEWKGVYWVFTQLRKSCLKPNGATYGLSMEVMLKSGKYEHVHKL

Query:  FTKMKKSGETPKANTYRVLVKAFWEEGKVNGAIEAVRDMEQRGVVGSASVYYELACCLCYNGRWRDALVEVEKMKTLSHMKPLVVAFTGMILSSFDGGHI
        FTKMKKSGET KANTYRVLVKAFWEEG  +GAIEAVRDMEQRGVVGSASVYYELACCLCYNG+W+DALVEVEKMKTLSHMKPLVV FTGMILSSF+GGHI
Subjt:  FTKMKKSGETPKANTYRVLVKAFWEEGKVNGAIEAVRDMEQRGVVGSASVYYELACCLCYNGRWRDALVEVEKMKTLSHMKPLVVAFTGMILSSFDGGHI

Query:  DDCISIFEYMKQNCAPNIGAINTMLRVYGRNDMFSKAKDLFEEIKRKADRSSPSSSAPSIIPDEYTYGSMLEAAASALQWEYFENVYREMALSGYRLDQS
        DDCISIFEYMKQ CAPNIG INTML+VYGRNDMFSKAKDLFEEIK+KAD SS +S+ PS++PDEYTY SML+AAAS+LQWEYFENVYREMALSGYRLDQS
Subjt:  DDCISIFEYMKQNCAPNIGAINTMLRVYGRNDMFSKAKDLFEEIKRKADRSSPSSSAPSIIPDEYTYGSMLEAAASALQWEYFENVYREMALSGYRLDQS

Query:  KHAMLLVEASRAGKWYLLDHAYDTILEAGQIPHPLLFTEMILQLIVQDNYEQAVTLVKAMGYAPFQVSERQWTELFEANTDRICSNNLKKLSDALNDCNA
        KHA+LLVEAS+AGKWYLLDHA+DTILEAGQIPHPLLFTEMILQL  Q+NYEQAVTLV+ MGYAPFQVSERQWTELFE N DRIC NNLK+L DAL DC+A
Subjt:  KHAMLLVEASRAGKWYLLDHAYDTILEAGQIPHPLLFTEMILQLIVQDNYEQAVTLVKAMGYAPFQVSERQWTELFEANTDRICSNNLKKLSDALNDCNA

Query:  SEATVSNLSISLHSLCKPGIPENTSQSVACDRDATDGLPIRGSENMENMETTKLHPDRHVNHCDESLDMISVNHLSSNMKVESDSKMAPWSLSLTEGVLG
        SEATVSNLS SL SLCK GI E+TSQS+ACD +ATDGL +  S+NMENM   KLHPD+     DESLD+I V+H S NMKV S+S M+PWS S+++GVLG
Subjt:  SEATVSNLSISLHSLCKPGIPENTSQSVACDRDATDGLPIRGSENMENMETTKLHPDRHVNHCDESLDMISVNHLSSNMKVESDSKMAPWSLSLTEGVLG

Query:  TQKFSDPSNNEVSSFDSCDDSEDDEEELDMLLDRFDDSYDSNLPSVNEILKTWKEDRKNDGL
        T +FSD SNNE S+FDS D+SEDDE EL+MLLD FDDSYDSNLP+ NEIL+TWKE+RK DGL
Subjt:  TQKFSDPSNNEVSSFDSCDDSEDDEEELDMLLDRFDDSYDSNLPSVNEILKTWKEDRKNDGL

A0A6J1GG29 pentatricopeptide repeat-containing protein At5g67570, chloroplastic0.0e+0085.38Show/hide
Query:  MEALGANAPIPAPKFEPDIEKIKRALLQKGVYPSPKIVRSLRKKEIQKHNRKLKRQAERQADQSPPLSESEKQVIAEETHFLTLRSEYKEFSKAIEAKPG
        MEAL  NA +P+PKFEPDIEKIKR LLQKGV+P+PKIVRSL KKEIQKHNRKLKR AERQ  QSPPLSES+KQ+I EET FLTLRSEYKEFSKAIEA+P 
Subjt:  MEALGANAPIPAPKFEPDIEKIKRALLQKGVYPSPKIVRSLRKKEIQKHNRKLKRQAERQADQSPPLSESEKQVIAEETHFLTLRSEYKEFSKAIEAKPG

Query:  GGLMVGRPWERLERVNLKELTGFRTGYDGDNLKKESLRELRKLFEARKLEELQWVLDDDVELKEEWSESENGHFDTVKRRRGDGEVIRFLVDRLSSREIA
        GGLMVGRPWERLERVNLKELTGFRT Y+ DNLKKESLRELRKLFEARKLEELQWVLDDDVELK+EW  SENGH D VKRRRGDGEVIRFLVDRLSSR I+
Subjt:  GGLMVGRPWERLERVNLKELTGFRTGYDGDNLKKESLRELRKLFEARKLEELQWVLDDDVELKEEWSESENGHFDTVKRRRGDGEVIRFLVDRLSSREIA

Query:  MRDWKFSKMMIRSGLQFNEGQLLKILDALGAKGCWKQALSVVEWVYNLKSHSHSKSRFVYTKLLAVLGMAREPQEALQIFNLMRGDGQIYPDMAAYHSIA
        MRDWKFS+MMI+SGLQFNEGQLLKILDALGAKGCWKQALSVVEWVYNLKSHSHSKSRFVYTKLLAVLG AR+PQEALQIFNLMRGDGQIYPDMAAYHSIA
Subjt:  MRDWKFSKMMIRSGLQFNEGQLLKILDALGAKGCWKQALSVVEWVYNLKSHSHSKSRFVYTKLLAVLGMAREPQEALQIFNLMRGDGQIYPDMAAYHSIA

Query:  VTLGQAGLLKQLLKVIECMRQQPSKKIRNKCRKSWDPAVEPDLVVYNAILNACIPTLEWKGVYWVFTQLRKSCLKPNGATYGLSMEVMLKSGKYEHVHKL
        VTLGQAGLLKQLLK+IE MRQQPSKK+RN CRK WDPAVEPDLV+YNAILNAC+PTLEWK VYWVFTQLRK+ LKPNGATYGLSMEVMLKSGKYE VH L
Subjt:  VTLGQAGLLKQLLKVIECMRQQPSKKIRNKCRKSWDPAVEPDLVVYNAILNACIPTLEWKGVYWVFTQLRKSCLKPNGATYGLSMEVMLKSGKYEHVHKL

Query:  FTKMKKSGETPKANTYRVLVKAFWEEGKVNGAIEAVRDMEQRGVVGSASVYYELACCLCYNGRWRDALVEVEKMKTLSHMKPLVVAFTGMILSSFDGGHI
        FTKMK SG T KANTYRVLVKAFWEEG V+GAIEAVRDMEQRGVVGSASVYYELACCLCY+GRW+DALVEVEKMKTLSHMKPLVV FTGMILSSFDGGHI
Subjt:  FTKMKKSGETPKANTYRVLVKAFWEEGKVNGAIEAVRDMEQRGVVGSASVYYELACCLCYNGRWRDALVEVEKMKTLSHMKPLVVAFTGMILSSFDGGHI

Query:  DDCISIFEYMKQNCAPNIGAINTMLRVYGRNDMFSKAKDLFEEIKRKADRSSPSSSAPSIIPDEYTYGSMLEAAASALQWEYFENVYREMALSGYRLDQS
        DDCISIFEYMKQ CAPNIG INTML+V+GRNDMFSKAKDL+EEIKRKAD SS SS+  SI+PD+YTY SML+AAASA QWEYFENVYREMALSGYRLDQS
Subjt:  DDCISIFEYMKQNCAPNIGAINTMLRVYGRNDMFSKAKDLFEEIKRKADRSSPSSSAPSIIPDEYTYGSMLEAAASALQWEYFENVYREMALSGYRLDQS

Query:  KHAMLLVEASRAGKWYLLDHAYDTILEAGQIPHPLLFTEMILQLIVQDNYEQAVTLVKAMGYAPFQVSERQWTELFEANTDRICSNNLKKLSDALNDCNA
        KHAMLLVEASRAGKWYLLDHA+DTILEAGQIPHPLLFTEMILQL  QDNYEQAVTLV+ M YAPFQVSERQWTELFE NTDRIC NNLKKLSDAL+DC+A
Subjt:  KHAMLLVEASRAGKWYLLDHAYDTILEAGQIPHPLLFTEMILQLIVQDNYEQAVTLVKAMGYAPFQVSERQWTELFEANTDRICSNNLKKLSDALNDCNA

Query:  SEATVSNLSISLHSLCKPGIPENTSQSVACDRDATDGLPIRGSENMENMETTKLHPDRHVNHCDESLDMISVNHLSSNMKVESDSKMAPWSLSLTEGVLG
        SEATV NLS SL SLCK GIPEN SQS+A D D TDGL + G ENM+NM   KLH D   + C+ SLD+I VNH S N     DS+M PWSLSL++GVL 
Subjt:  SEATVSNLSISLHSLCKPGIPENTSQSVACDRDATDGLPIRGSENMENMETTKLHPDRHVNHCDESLDMISVNHLSSNMKVESDSKMAPWSLSLTEGVLG

Query:  TQKFSDPSNNEVSSFDSCDDSEDDEEELDMLLDRFDDSYDSNLPSVNEILKTWKEDRKNDGL
        T KFSD SNNE S+FD  DDSEDDEEEL MLLD FDDSYDSNLPSV+EILKTWKE+RKNDGL
Subjt:  TQKFSDPSNNEVSSFDSCDDSEDDEEELDMLLDRFDDSYDSNLPSVNEILKTWKEDRKNDGL

A0A6J1ILT7 pentatricopeptide repeat-containing protein At5g67570, chloroplastic0.0e+0085.38Show/hide
Query:  MEALGANAPIPAPKFEPDIEKIKRALLQKGVYPSPKIVRSLRKKEIQKHNRKLKRQAERQADQSPPLSESEKQVIAEETHFLTLRSEYKEFSKAIEAKPG
        MEAL  NA +P+PKFEPDIEKIKR L+QKGV+P+PKIVRSL KKEIQKHNRKLKR AERQ  QSPPLSES+KQ+I EETHFLTLRSEYKEFSKAIEA+P 
Subjt:  MEALGANAPIPAPKFEPDIEKIKRALLQKGVYPSPKIVRSLRKKEIQKHNRKLKRQAERQADQSPPLSESEKQVIAEETHFLTLRSEYKEFSKAIEAKPG

Query:  GGLMVGRPWERLERVNLKELTGFRTGYDGDNLKKESLRELRKLFEARKLEELQWVLDDDVELKEEWSESENGHFDTVKRRRGDGEVIRFLVDRLSSREIA
        GGLMVGRPWERLERVNLKE TGFRT Y+ DNLKKESLRELRKLFEARKLEELQWVLDDDVELKEEW  SENGH D VKRRRGDGEVIRFLVDRLSSR I+
Subjt:  GGLMVGRPWERLERVNLKELTGFRTGYDGDNLKKESLRELRKLFEARKLEELQWVLDDDVELKEEWSESENGHFDTVKRRRGDGEVIRFLVDRLSSREIA

Query:  MRDWKFSKMMIRSGLQFNEGQLLKILDALGAKGCWKQALSVVEWVYNLKSHSHSKSRFVYTKLLAVLGMAREPQEALQIFNLMRGDGQIYPDMAAYHSIA
        MRDWKFS+MMI+SGLQFNEGQLLKILD+LGAKGCWKQALSVVEWVYNLKSHSHSKSRFVYTKLLAVLG AR+PQEALQIFNLMRGDGQIYPDMAAYHSIA
Subjt:  MRDWKFSKMMIRSGLQFNEGQLLKILDALGAKGCWKQALSVVEWVYNLKSHSHSKSRFVYTKLLAVLGMAREPQEALQIFNLMRGDGQIYPDMAAYHSIA

Query:  VTLGQAGLLKQLLKVIECMRQQPSKKIRNKCRKSWDPAVEPDLVVYNAILNACIPTLEWKGVYWVFTQLRKSCLKPNGATYGLSMEVMLKSGKYEHVHKL
        VTLGQAGLLKQLLK+IE MRQQPSKK+RN CRK WDPAVEPDLV+YNAILNACIPTLEWK VYWVFTQLRK+ LKPNGATYGLSMEVMLKSGKYE VH L
Subjt:  VTLGQAGLLKQLLKVIECMRQQPSKKIRNKCRKSWDPAVEPDLVVYNAILNACIPTLEWKGVYWVFTQLRKSCLKPNGATYGLSMEVMLKSGKYEHVHKL

Query:  FTKMKKSGETPKANTYRVLVKAFWEEGKVNGAIEAVRDMEQRGVVGSASVYYELACCLCYNGRWRDALVEVEKMKTLSHMKPLVVAFTGMILSSFDGGHI
        FTKMK SG T KANTYRVLVKAFWEEG V+GAIEAVRDMEQRGVVGSASVYYELACCLCY+GRW+DALVEVEKMKTLSHMKPLVV FTGMILSSFDGGHI
Subjt:  FTKMKKSGETPKANTYRVLVKAFWEEGKVNGAIEAVRDMEQRGVVGSASVYYELACCLCYNGRWRDALVEVEKMKTLSHMKPLVVAFTGMILSSFDGGHI

Query:  DDCISIFEYMKQNCAPNIGAINTMLRVYGRNDMFSKAKDLFEEIKRKADRSSPSSSAPSIIPDEYTYGSMLEAAASALQWEYFENVYREMALSGYRLDQS
        DDCISIFEYMKQ CAPNIG INTML+V+GRNDMFSKAKDL+EEIKRKAD SS SS+  SI+PD+YTY SML+AAASA QWEYFENVYREMALSGYRLDQS
Subjt:  DDCISIFEYMKQNCAPNIGAINTMLRVYGRNDMFSKAKDLFEEIKRKADRSSPSSSAPSIIPDEYTYGSMLEAAASALQWEYFENVYREMALSGYRLDQS

Query:  KHAMLLVEASRAGKWYLLDHAYDTILEAGQIPHPLLFTEMILQLIVQDNYEQAVTLVKAMGYAPFQVSERQWTELFEANTDRICSNNLKKLSDALNDCNA
        KHAMLLVEASRAGKWYLLDHA+DTILEAGQIPHPLLFTEMILQL  QDNYEQA+TLV+ M YAPFQVSERQWTELFE NTDRIC NNLKKLSDAL+DC+A
Subjt:  KHAMLLVEASRAGKWYLLDHAYDTILEAGQIPHPLLFTEMILQLIVQDNYEQAVTLVKAMGYAPFQVSERQWTELFEANTDRICSNNLKKLSDALNDCNA

Query:  SEATVSNLSISLHSLCKPGIPENTSQSVACDRDATDGLPIRGSENMENMETTKLHPDRHVNHCDESLDMISVNHLSSNMKVESDSKMAPWSLSLTEGVLG
        SEATVSNLS SL SLCK GIPENTSQS+A D D TDGL + GSEN +NM   KLH D   + C+ SLD+I VNH S N     DSKM PWSLSL++GVL 
Subjt:  SEATVSNLSISLHSLCKPGIPENTSQSVACDRDATDGLPIRGSENMENMETTKLHPDRHVNHCDESLDMISVNHLSSNMKVESDSKMAPWSLSLTEGVLG

Query:  TQKFSDPSNNEVSSFDSCDDSEDDEEELDMLLDRFDDSYDSNLPSVNEILKTWKEDRKNDGL
        T KFSD SNNE S+FD  DDSEDDEEEL MLLD FDDSY SNLPSV+EILKTW+E+RKNDGL
Subjt:  TQKFSDPSNNEVSSFDSCDDSEDDEEELDMLLDRFDDSYDSNLPSVNEILKTWKEDRKNDGL

SwissProt top hitse value%identityAlignment
A0A1D6IEG9 Pentatricopeptide repeat-containing protein CRP1, chloroplastic2.8e-1921.13Show/hide
Query:  MIRSGLQFNEGQLLKILDALGAKGCWKQALSVVEWVYNLKSHSHSKSRFVYTKLLAVLGMAREPQEALQIFNLMRGDGQIYPDMAAYHSIAVTLGQAGLL
        M + G+  +E     ++DA    G W+ A  +++    +++     S +V++++LA      + Q+A  +   M+  G + PD   Y+ +  T G+   L
Subjt:  MIRSGLQFNEGQLLKILDALGAKGCWKQALSVVEWVYNLKSHSHSKSRFVYTKLLAVLGMAREPQEALQIFNLMRGDGQIYPDMAAYHSIAVTLGQAGLL

Query:  KQLLKVIECMRQQPSKKIRNKCRKSWDPAVEPDLVVYNAILNACIPTLEWKGVYWVFTQLRKSCLKPNGATYGLSMEVMLKSGKYEHVHKLFTKMKKSGE
           +     MR++                +EPD+V +N +++A            +F ++R+S   P   TY + + ++ +   +E V  + ++MK+ G 
Subjt:  KQLLKVIECMRQQPSKKIRNKCRKSWDPAVEPDLVVYNAILNACIPTLEWKGVYWVFTQLRKSCLKPNGATYGLSMEVMLKSGKYEHVHKLFTKMKKSGE

Query:  TPKANTYRVLVKAFWEEGKVNGAIEAVRDMEQRGVVGSASVYYELACCLCYNGRWRDALVEVEKMKTLSHMKPLVVAFTGMILSSFDGGHIDDCISIFEY
         P   TY  LV  +   G+   AI+ +  M+  G+  S ++Y+ L       G    AL  V+ MK    ++  ++    +I +  +   + +  S+ ++
Subjt:  TPKANTYRVLVKAFWEEGKVNGAIEAVRDMEQRGVVGSASVYYELACCLCYNGRWRDALVEVEKMKTLSHMKPLVVAFTGMILSSFDGGHIDDCISIFEY

Query:  MKQN-CAPNIGAINTMLRVYGRNDMFSKAKDLFEEI
        M++N   P++    T+++   R + F K   ++EE+
Subjt:  MKQN-CAPNIGAINTMLRVYGRNDMFSKAKDLFEEI

Q84ZD2 Pentatricopeptide repeat-containing protein CRP1 homolog, chloroplastic3.7e-1921.43Show/hide
Query:  MIRSGLQFNEGQLLKILDALGAKGCWKQALSVVEWVYNLKSHSHSKSRFVYTKLLAVLGMAREPQEALQIFNLMRGDGQIYPDMAAYHSIAVTLGQAGLL
        M + G+  +E     ++DA    G W+ A  +++    +++     S +V++++LA      E Q+A  +   M   G + PD   Y+ +  T G+   L
Subjt:  MIRSGLQFNEGQLLKILDALGAKGCWKQALSVVEWVYNLKSHSHSKSRFVYTKLLAVLGMAREPQEALQIFNLMRGDGQIYPDMAAYHSIAVTLGQAGLL

Query:  KQLLKVIECMRQQPSKKIRNKCRKSWDPAVEPDLVVYNAILNACIPTLEWKGVYWVFTQLRKSCLKPNGATYGLSMEVMLKSGKYEHVHKLFTKMKKSGE
           +   + MR++                +EPD+V +N +++A            +F ++R+S       TY + + ++ +  ++E V  +  +MK+ G 
Subjt:  KQLLKVIECMRQQPSKKIRNKCRKSWDPAVEPDLVVYNAILNACIPTLEWKGVYWVFTQLRKSCLKPNGATYGLSMEVMLKSGKYEHVHKLFTKMKKSGE

Query:  TPKANTYRVLVKAFWEEGKVNGAIEAVRDMEQRGVVGSASVYYELACCLCYNGRWRDALVEVEKMKTLSHMKPLVVAFTGMILSSFDGGHIDDCISIFEY
         P   TY  LV  +   G+   A++ +  M+  G+  S ++Y+ L       G    AL  V+ M+    ++   V    +I +  +   I +  S+ ++
Subjt:  TPKANTYRVLVKAFWEEGKVNGAIEAVRDMEQRGVVGSASVYYELACCLCYNGRWRDALVEVEKMKTLSHMKPLVVAFTGMILSSFDGGHIDDCISIFEY

Query:  MKQN-CAPNIGAINTMLRVYGRNDMFSKAKDLFEEI
        MK+N   P++    T+++   R + F K   ++EE+
Subjt:  MKQN-CAPNIGAINTMLRVYGRNDMFSKAKDLFEEI

Q9FJW6 Pentatricopeptide repeat-containing protein At5g67570, chloroplastic1.3e-24252.93Show/hide
Query:  PKFEPDIEKIKRALLQKGVYPSPKIVRSLRKKEIQKHNRKLKRQAERQADQSPPLSESEKQVIAEETHFLTLRSEYKEFSKAIEAKPGG--GLMVGRPWE
        P+FEPD+EKIKR LL+ GV P+PKI+ +LRKKEIQKHNR+ KR+ E +A+     +E++KQ + EE  F TLR EYK+F+++I  K GG  GLMVG PWE
Subjt:  PKFEPDIEKIKRALLQKGVYPSPKIVRSLRKKEIQKHNRKLKRQAERQADQSPPLSESEKQVIAEETHFLTLRSEYKEFSKAIEAKPGG--GLMVGRPWE

Query:  RLERVNLKELTG--FRTGYDGDNLKKESLRELRKLFEARKLEELQWVLDDDVELKEEWSESENGHFDTVKRRRGDGEVIRFLVDRLSSREIAMRDWKFSK
         +ERV LKEL     R       LKKE+L+EL+K+ E    ++L+WVLDDDV+++E   + E   FD  KR R +GE +R LVDRLS REI  + WKF +
Subjt:  RLERVNLKELTG--FRTGYDGDNLKKESLRELRKLFEARKLEELQWVLDDDVELKEEWSESENGHFDTVKRRRGDGEVIRFLVDRLSSREIAMRDWKFSK

Query:  MMIRSGLQFNEGQLLKILDALGAKGCWKQALSVVEWVYNLKSHSHSKSRFVYTKLLAVLGMAREPQEALQIFNLMRGDGQIYPDMAAYHSIAVTLGQAGL
        MM +SGLQF E Q+LKI+D LG K  WKQA +VV WVY+ K   H +SRFVYTKLL+VLG AR PQEALQIFN M GD Q+YPDMAAYH IAVTLGQAGL
Subjt:  MMIRSGLQFNEGQLLKILDALGAKGCWKQALSVVEWVYNLKSHSHSKSRFVYTKLLAVLGMAREPQEALQIFNLMRGDGQIYPDMAAYHSIAVTLGQAGL

Query:  LKQLLKVIECMRQQPSKKIRNKCRKSWDPAVEPDLVVYNAILNACIPTLEWKGVYWVFTQLRKSCLKPNGATYGLSMEVMLKSGKYEHVHKLFTKMKKSG
        LK+LLKVIE MRQ+P+K  +N  +K+WDP +EPDLVVYNAILNAC+PTL+WK V WVF +LRK+ L+PNGATYGL+MEVML+SGK++ VH  F KMK SG
Subjt:  LKQLLKVIECMRQQPSKKIRNKCRKSWDPAVEPDLVVYNAILNACIPTLEWKGVYWVFTQLRKSCLKPNGATYGLSMEVMLKSGKYEHVHKLFTKMKKSG

Query:  ETPKANTYRVLVKAFWEEGKVNGAIEAVRDMEQRGVVGSASVYYELACCLCYNGRWRDALVEVEKMKTLSHMKPLVVAFTGMILSSFDGGHIDDCISIFE
        E PKA TY+VLV+A W EGK+  A+EAVRDMEQ+GV+G+ SVYYELACCLC NGRW DA++EV +MK L + +PL + FTG+I +S +GGH+DDC++IF+
Subjt:  ETPKANTYRVLVKAFWEEGKVNGAIEAVRDMEQRGVVGSASVYYELACCLCYNGRWRDALVEVEKMKTLSHMKPLVVAFTGMILSSFDGGHIDDCISIFE

Query:  YMKQNCAPNIGAINTMLRVYGRNDMFSKAKDLFEEIKRKADRSSPSSSAPSIIPDEYTYGSMLEAAASALQWEYFENVYREMALSGYRLDQSKHAMLLVE
        YMK  C PNIG  N ML+VYGRNDMFS+AK+LFEEI         S     ++P+EYTY  MLEA+A +LQWEYFE+VY+ M LSGY++DQ+KHA +L+E
Subjt:  YMKQNCAPNIGAINTMLRVYGRNDMFSKAKDLFEEIKRKADRSSPSSSAPSIIPDEYTYGSMLEAAASALQWEYFENVYREMALSGYRLDQSKHAMLLVE

Query:  ASRAGKWYLLDHAYDTILEAGQIPHPLLFTEMILQLIVQDNYEQAVTLVKAMGYAPFQVSERQWTELFEANTDRICSNNLKKLSDALNDCN-ASEATVSN
        ASRAGKW LL+HA+D +LE G+IPHPL FTE++     + ++++A+TL+  +  A FQ+SE +WT+LFE + D +  +NL KLSD L +C+  SE TVSN
Subjt:  ASRAGKWYLLDHAYDTILEAGQIPHPLLFTEMILQLIVQDNYEQAVTLVKAMGYAPFQVSERQWTELFEANTDRICSNNLKKLSDALNDCN-ASEATVSN

Query:  LSISLHSLCKPGIPENTSQSVACDRDATDGLPIRGSENMENMETTKLHPDRHVNHCDESLDMISVNHLSSNMKVESDSKMAPWSLSLTE-GVLGTQKFSD
        LS SL S C  G   +++Q +      T     +  E++   +TT                          M+ ++ +    W  + TE   LG ++   
Subjt:  LSISLHSLCKPGIPENTSQSVACDRDATDGLPIRGSENMENMETTKLHPDRHVNHCDESLDMISVNHLSSNMKVESDSKMAPWSLSLTE-GVLGTQKFSD

Query:  PSNNEVSSFDSCDDSEDDEEELDMLLDRFDDSYDSNLPSVNEILKTWKEDRKND
                     + +DDEE           S DS+  SV +ILK W+E  K +
Subjt:  PSNNEVSSFDSCDDSEDDEEELDMLLDRFDDSYDSNLPSVNEILKTWKEDRKND

Q9LYZ9 Pentatricopeptide repeat-containing protein At5g028604.5e-2522.54Show/hide
Query:  QLLKILDALGAKGCWKQALSVVEWVYNLKSHSHSKSRFVYTKLLAVLGMAREPQEALQIFNLMRGDGQIYPDMAAYHSIAVTLGQAGLLKQLLKVIECMR
        +LL  L  LG    +  AL   +W    K +       V   ++++LG       A  +FN ++ DG    D+ +Y S+      +G  ++ + V + M 
Subjt:  QLLKILDALGAKGCWKQALSVVEWVYNLKSHSHSKSRFVYTKLLAVLGMAREPQEALQIFNLMRGDGQIYPDMAAYHSIAVTLGQAGLLKQLLKVIECMR

Query:  QQ---------------------PSKKIRNKCRKSWDPAVEPDLVVYNAILNACIPTLEWKGVYWVFTQLRKSCLKPNGATYGLSMEVMLKSGKYEHVHK
        +                      P  KI +   K     + PD   YN ++  C      +    VF +++ +    +  TY   ++V  KS + +   K
Subjt:  QQ---------------------PSKKIRNKCRKSWDPAVEPDLVVYNAILNACIPTLEWKGVYWVFTQLRKSCLKPNGATYGLSMEVMLKSGKYEHVHK

Query:  LFTKMKKSGETPKANTYRVLVKAFWEEGKVNGAIEAVRDMEQRGVVGSASVYYELACCLCYNGRWRDALVEVEKMKTLSHMKPLVVAFTGMILSSFDGGH
        +  +M  +G +P   TY  L+ A+  +G ++ A+E    M ++G       Y  L       G+   A+   E+M+  +  KP +  F   I    + G 
Subjt:  LFTKMKKSGETPKANTYRVLVKAFWEEGKVNGAIEAVRDMEQRGVVGSASVYYELACCLCYNGRWRDALVEVEKMKTLSHMKPLVVAFTGMILSSFDGGH

Query:  IDDCISIFEYMKQ-NCAPNIGAINTMLRVYGRNDMFSKAKDLFEEIKRKADRSSPSSSAPSIIPDEYTYGSMLEAAASALQWEYFENVYREMALSGYRLD
          + + IF+ +     +P+I   NT+L V+G+N M S+   +F+E+KR              +P+  T+ +++ A +    +E    VYR M  +G   D
Subjt:  IDDCISIFEYMKQ-NCAPNIGAINTMLRVYGRNDMFSKAKDLFEEIKRKADRSSPSSSAPSIIPDEYTYGSMLEAAASALQWEYFENVYREMALSGYRLD

Query:  QSKHAMLLVEASRAGKW
         S +  +L   +R G W
Subjt:  QSKHAMLLVEASRAGKW

Q9SA76 Pentatricopeptide repeat-containing protein At1g30610, chloroplastic5.8e-9735.56Show/hide
Query:  IRFLVDRLSSREIAMRDWKFSKMMIRSGLQFNEGQLLKILDALGAKGCWKQALSVVEWVYNLKSHSHSKSRFVYTKLLAVLGMAREPQEALQIFNLMRGD
        I  L   L+  +I M +W+FSK +  + +++ +  +++++  LG  G W++ L V+EW+     +  +K R +YT  L VLG +R P EAL +F+ M   
Subjt:  IRFLVDRLSSREIAMRDWKFSKMMIRSGLQFNEGQLLKILDALGAKGCWKQALSVVEWVYNLKSHSHSKSRFVYTKLLAVLGMAREPQEALQIFNLMRGD

Query:  GQIYPDMAAYHSIAVTLGQAGLLKQLLKVIECMRQQPSKKIRNKCRKSWDPAVEPDLVVYNAILNACIPTLEWKGVYWVFTQLRKSCLKPNGATYGLSME
           YPDM AY SIAVTLGQAG +K+L  VI+ MR  P KK +    + WDP +EPD+VVYNA+LNAC+   +W+G +WV  QL++   KP+  TYGL ME
Subjt:  GQIYPDMAAYHSIAVTLGQAGLLKQLLKVIECMRQQPSKKIRNKCRKSWDPAVEPDLVVYNAILNACIPTLEWKGVYWVFTQLRKSCLKPNGATYGLSME

Query:  VMLKSGKYEHVHKLFTKMKKSGETPKANTYRVLVKAFWEEGKVNGAIEAVRDMEQRGVVGSASVYYELACCLCYNGRWRDAL-----VEVEKMKTLSHM-
        VML   KY  VH+ F KM+KS   P A  YRVLV   W+EGK + A+  V DME RG+VGSA++YY+LA CLC  GR  + L     V    +K + ++ 
Subjt:  VMLKSGKYEHVHKLFTKMKKSGETPKANTYRVLVKAFWEEGKVNGAIEAVRDMEQRGVVGSASVYYELACCLCYNGRWRDAL-----VEVEKMKTLSHM-

Query:  ---------------------KPLVVAFTGMILSSFDGGHIDDCISIFEYMKQNCAPNIGAINTMLRVYGRNDMFSKAKDLFEEIKRKADRSSPSSSAPS
                             KPLVV +TG+I +  D G+I +   IF+ MK+ C+PN+   N ML+ Y +  +F +A++LF+++    +    SS   S
Subjt:  ---------------------KPLVVAFTGMILSSFDGGHIDDCISIFEYMKQNCAPNIGAINTMLRVYGRNDMFSKAKDLFEEIKRKADRSSPSSSAPS

Query:  -IIPDEYTYGSMLEAAASALQWEYFENVYREMALSGYRLDQSKHAMLLVEASRAGKWYLLDHAYDTILEAGQIPHPLLFTEMILQLIVQDNYEQAVTLVK
         ++PD YT+ +ML+  A   +W+ F   YREM   GY  +  +H  +++EASRAGK  +++  ++ +  + +IP   L  E   + + + ++  A++ + 
Subjt:  -IIPDEYTYGSMLEAAASALQWEYFENVYREMALSGYRLDQSKHAMLLVEASRAGKWYLLDHAYDTILEAGQIPHPLLFTEMILQLIVQDNYEQAVTLVK

Query:  AMGYAPFQVSERQW-TELFEANTDRICSNNLKKLSDALNDCNASEATVSNLSI-SLHSLCKPGIPENT
         +     +   R + T  +     R   +++ +L D +N    S +  S+  + +L S CK  +   T
Subjt:  AMGYAPFQVSERQW-TELFEANTDRICSNNLKKLSDALNDCNASEATVSNLSI-SLHSLCKPGIPENT

Arabidopsis top hitse value%identityAlignment
AT1G09900.1 Pentatricopeptide repeat (PPR-like) superfamily protein2.9e-1922.33Show/hide
Query:  EWVYNLKSHSHSKSRFVYTKLLAVLGMAREPQEALQIFNLMRGDGQIYPDMAAYHSIAVTLGQAGLLKQLLKVIECMRQQPSKKIRNKCRKSWDPAVEPD
        +++ N+  H +       T L+       + ++A +I  ++ G G + PD+  Y+ +     +AG +   L V++ M                  +V PD
Subjt:  EWVYNLKSHSHSKSRFVYTKLLAVLGMAREPQEALQIFNLMRGDGQIYPDMAAYHSIAVTLGQAGLLKQLLKVIECMRQQPSKKIRNKCRKSWDPAVEPD

Query:  LVVYNAILNACIPTLEWKGVYWVFTQLRKSCLKPNGATYGLSMEVMLKSGKYEHVHKLFTKMKKSGETPKANTYRVLVKAFWEEGKVNGAIEAVRDMEQR
        +V YN IL +   + + K    V  ++ +    P+  TY + +E   +     H  KL  +M+  G TP   TY VLV    +EG+++ AI+ + DM   
Subjt:  LVVYNAILNACIPTLEWKGVYWVFTQLRKSCLKPNGATYGLSMEVMLKSGKYEHVHKLFTKMKKSGETPKANTYRVLVKAFWEEGKVNGAIEAVRDMEQR

Query:  GVVGSASVYYELACCLCYNGRWRDALVEVEKMKTLSHMKPLVVAFTGMILSSFDGGHIDDCISIFEYMKQN-CAPNIGAINTMLRVYGRNDMFSKAKDLF
        G   +   +  +   +C  GRW DA   +  M       P VV F  +I      G +   I I E M Q+ C PN  + N +L  + +     +A    
Subjt:  GVVGSASVYYELACCLCYNGRWRDALVEVEKMKTLSHMKPLVVAFTGMILSSFDGGHIDDCISIFEYMKQN-CAPNIGAINTMLRVYGRNDMFSKAKDLF

Query:  EEIKRKADRSSPSSSAPSIIPDEYTYGSMLEAAASALQWEYFENVYREMALSGYRLDQSKHAMLLVEASRAGKWYLLDHAYDTILEAGQIPHPLLFTEMI
        E ++R   R           PD  TY +ML A     + E    +  +++  G       +  ++   ++AGK        D +      P  + ++ ++
Subjt:  EEIKRKADRSSPSSSAPSIIPDEYTYGSMLEAAASALQWEYFENVYREMALSGYRLDQSKHAMLLVEASRAGKWYLLDHAYDTILEAGQIPHPLLFTEMI

Query:  LQLIVQDNYEQAVTL---VKAMGYAPFQVS
          L  +   ++A+      + MG  P  V+
Subjt:  LQLIVQDNYEQAVTL---VKAMGYAPFQVS

AT1G30610.1 pentatricopeptide (PPR) repeat-containing protein4.1e-9835.56Show/hide
Query:  IRFLVDRLSSREIAMRDWKFSKMMIRSGLQFNEGQLLKILDALGAKGCWKQALSVVEWVYNLKSHSHSKSRFVYTKLLAVLGMAREPQEALQIFNLMRGD
        I  L   L+  +I M +W+FSK +  + +++ +  +++++  LG  G W++ L V+EW+     +  +K R +YT  L VLG +R P EAL +F+ M   
Subjt:  IRFLVDRLSSREIAMRDWKFSKMMIRSGLQFNEGQLLKILDALGAKGCWKQALSVVEWVYNLKSHSHSKSRFVYTKLLAVLGMAREPQEALQIFNLMRGD

Query:  GQIYPDMAAYHSIAVTLGQAGLLKQLLKVIECMRQQPSKKIRNKCRKSWDPAVEPDLVVYNAILNACIPTLEWKGVYWVFTQLRKSCLKPNGATYGLSME
           YPDM AY SIAVTLGQAG +K+L  VI+ MR  P KK +    + WDP +EPD+VVYNA+LNAC+   +W+G +WV  QL++   KP+  TYGL ME
Subjt:  GQIYPDMAAYHSIAVTLGQAGLLKQLLKVIECMRQQPSKKIRNKCRKSWDPAVEPDLVVYNAILNACIPTLEWKGVYWVFTQLRKSCLKPNGATYGLSME

Query:  VMLKSGKYEHVHKLFTKMKKSGETPKANTYRVLVKAFWEEGKVNGAIEAVRDMEQRGVVGSASVYYELACCLCYNGRWRDAL-----VEVEKMKTLSHM-
        VML   KY  VH+ F KM+KS   P A  YRVLV   W+EGK + A+  V DME RG+VGSA++YY+LA CLC  GR  + L     V    +K + ++ 
Subjt:  VMLKSGKYEHVHKLFTKMKKSGETPKANTYRVLVKAFWEEGKVNGAIEAVRDMEQRGVVGSASVYYELACCLCYNGRWRDAL-----VEVEKMKTLSHM-

Query:  ---------------------KPLVVAFTGMILSSFDGGHIDDCISIFEYMKQNCAPNIGAINTMLRVYGRNDMFSKAKDLFEEIKRKADRSSPSSSAPS
                             KPLVV +TG+I +  D G+I +   IF+ MK+ C+PN+   N ML+ Y +  +F +A++LF+++    +    SS   S
Subjt:  ---------------------KPLVVAFTGMILSSFDGGHIDDCISIFEYMKQNCAPNIGAINTMLRVYGRNDMFSKAKDLFEEIKRKADRSSPSSSAPS

Query:  -IIPDEYTYGSMLEAAASALQWEYFENVYREMALSGYRLDQSKHAMLLVEASRAGKWYLLDHAYDTILEAGQIPHPLLFTEMILQLIVQDNYEQAVTLVK
         ++PD YT+ +ML+  A   +W+ F   YREM   GY  +  +H  +++EASRAGK  +++  ++ +  + +IP   L  E   + + + ++  A++ + 
Subjt:  -IIPDEYTYGSMLEAAASALQWEYFENVYREMALSGYRLDQSKHAMLLVEASRAGKWYLLDHAYDTILEAGQIPHPLLFTEMILQLIVQDNYEQAVTLVK

Query:  AMGYAPFQVSERQW-TELFEANTDRICSNNLKKLSDALNDCNASEATVSNLSI-SLHSLCKPGIPENT
         +     +   R + T  +     R   +++ +L D +N    S +  S+  + +L S CK  +   T
Subjt:  AMGYAPFQVSERQW-TELFEANTDRICSNNLKKLSDALNDCNASEATVSNLSI-SLHSLCKPGIPENT

AT1G30610.2 pentatricopeptide (PPR) repeat-containing protein1.4e-10137.15Show/hide
Query:  IRFLVDRLSSREIAMRDWKFSKMMIRSGLQFNEGQLLKILDALGAKGCWKQALSVVEWVYNLKSHSHSKSRFVYTKLLAVLGMAREPQEALQIFNLMRGD
        I  L   L+  +I M +W+FSK +  + +++ +  +++++  LG  G W++ L V+EW+     +  +K R +YT  L VLG +R P EAL +F+ M   
Subjt:  IRFLVDRLSSREIAMRDWKFSKMMIRSGLQFNEGQLLKILDALGAKGCWKQALSVVEWVYNLKSHSHSKSRFVYTKLLAVLGMAREPQEALQIFNLMRGD

Query:  GQIYPDMAAYHSIAVTLGQAGLLKQLLKVIECMRQQPSKKIRNKCRKSWDPAVEPDLVVYNAILNACIPTLEWKGVYWVFTQLRKSCLKPNGATYGLSME
           YPDM AY SIAVTLGQAG +K+L  VI+ MR  P KK +    + WDP +EPD+VVYNA+LNAC+   +W+G +WV  QL++   KP+  TYGL ME
Subjt:  GQIYPDMAAYHSIAVTLGQAGLLKQLLKVIECMRQQPSKKIRNKCRKSWDPAVEPDLVVYNAILNACIPTLEWKGVYWVFTQLRKSCLKPNGATYGLSME

Query:  VMLKSGKYEHVHKLFTKMKKSGETPKANTYRVLVKAFWEEGKVNGAIEAVRDMEQRGVVGSASVYYELACCLCYNGRWRDALVEVEKMKTLSHMKPLVVA
        VML   KY  VH+ F KM+KS   P A  YRVLV   W+EGK + A+  V DME RG+VGSA++YY+LA CLC  GR  + L  ++K+  +++ KPLVV 
Subjt:  VMLKSGKYEHVHKLFTKMKKSGETPKANTYRVLVKAFWEEGKVNGAIEAVRDMEQRGVVGSASVYYELACCLCYNGRWRDALVEVEKMKTLSHMKPLVVA

Query:  FTGMILSSFDGGHIDDCISIFEYMKQNCAPNIGAINTMLRVYGRNDMFSKAKDLFEEIKRKADRSSPSSSAPS-IIPDEYTYGSMLEAAASALQWEYFEN
        +TG+I +  D G+I +   IF+ MK+ C+PN+   N ML+ Y +  +F +A++LF+++    +    SS   S ++PD YT+ +ML+  A   +W+ F  
Subjt:  FTGMILSSFDGGHIDDCISIFEYMKQNCAPNIGAINTMLRVYGRNDMFSKAKDLFEEIKRKADRSSPSSSAPS-IIPDEYTYGSMLEAAASALQWEYFEN

Query:  VYREMALSGYRLDQSKHAMLLVEASRAGKWYLLDHAYDTILEAGQIPHPLLFTEMILQLIVQDNYEQAVTLVKAMGYAPFQVSERQW-TELFEANTDRIC
         YREM   GY  +  +H  +++EASRAGK  +++  ++ +  + +IP   L  E   + + + ++  A++ +  +     +   R + T  +     R  
Subjt:  VYREMALSGYRLDQSKHAMLLVEASRAGKWYLLDHAYDTILEAGQIPHPLLFTEMILQLIVQDNYEQAVTLVKAMGYAPFQVSERQW-TELFEANTDRIC

Query:  SNNLKKLSDALNDCNASEATVSNLSI-SLHSLCKPGIPENT
         +++ +L D +N    S +  S+  + +L S CK  +   T
Subjt:  SNNLKKLSDALNDCNASEATVSNLSI-SLHSLCKPGIPENT

AT5G02860.1 Pentatricopeptide repeat (PPR) superfamily protein3.2e-2622.54Show/hide
Query:  QLLKILDALGAKGCWKQALSVVEWVYNLKSHSHSKSRFVYTKLLAVLGMAREPQEALQIFNLMRGDGQIYPDMAAYHSIAVTLGQAGLLKQLLKVIECMR
        +LL  L  LG    +  AL   +W    K +       V   ++++LG       A  +FN ++ DG    D+ +Y S+      +G  ++ + V + M 
Subjt:  QLLKILDALGAKGCWKQALSVVEWVYNLKSHSHSKSRFVYTKLLAVLGMAREPQEALQIFNLMRGDGQIYPDMAAYHSIAVTLGQAGLLKQLLKVIECMR

Query:  QQ---------------------PSKKIRNKCRKSWDPAVEPDLVVYNAILNACIPTLEWKGVYWVFTQLRKSCLKPNGATYGLSMEVMLKSGKYEHVHK
        +                      P  KI +   K     + PD   YN ++  C      +    VF +++ +    +  TY   ++V  KS + +   K
Subjt:  QQ---------------------PSKKIRNKCRKSWDPAVEPDLVVYNAILNACIPTLEWKGVYWVFTQLRKSCLKPNGATYGLSMEVMLKSGKYEHVHK

Query:  LFTKMKKSGETPKANTYRVLVKAFWEEGKVNGAIEAVRDMEQRGVVGSASVYYELACCLCYNGRWRDALVEVEKMKTLSHMKPLVVAFTGMILSSFDGGH
        +  +M  +G +P   TY  L+ A+  +G ++ A+E    M ++G       Y  L       G+   A+   E+M+  +  KP +  F   I    + G 
Subjt:  LFTKMKKSGETPKANTYRVLVKAFWEEGKVNGAIEAVRDMEQRGVVGSASVYYELACCLCYNGRWRDALVEVEKMKTLSHMKPLVVAFTGMILSSFDGGH

Query:  IDDCISIFEYMKQ-NCAPNIGAINTMLRVYGRNDMFSKAKDLFEEIKRKADRSSPSSSAPSIIPDEYTYGSMLEAAASALQWEYFENVYREMALSGYRLD
          + + IF+ +     +P+I   NT+L V+G+N M S+   +F+E+KR              +P+  T+ +++ A +    +E    VYR M  +G   D
Subjt:  IDDCISIFEYMKQ-NCAPNIGAINTMLRVYGRNDMFSKAKDLFEEIKRKADRSSPSSSAPSIIPDEYTYGSMLEAAASALQWEYFENVYREMALSGYRLD

Query:  QSKHAMLLVEASRAGKW
         S +  +L   +R G W
Subjt:  QSKHAMLLVEASRAGKW

AT5G67570.1 Tetratricopeptide repeat (TPR)-like superfamily protein9.3e-24452.93Show/hide
Query:  PKFEPDIEKIKRALLQKGVYPSPKIVRSLRKKEIQKHNRKLKRQAERQADQSPPLSESEKQVIAEETHFLTLRSEYKEFSKAIEAKPGG--GLMVGRPWE
        P+FEPD+EKIKR LL+ GV P+PKI+ +LRKKEIQKHNR+ KR+ E +A+     +E++KQ + EE  F TLR EYK+F+++I  K GG  GLMVG PWE
Subjt:  PKFEPDIEKIKRALLQKGVYPSPKIVRSLRKKEIQKHNRKLKRQAERQADQSPPLSESEKQVIAEETHFLTLRSEYKEFSKAIEAKPGG--GLMVGRPWE

Query:  RLERVNLKELTG--FRTGYDGDNLKKESLRELRKLFEARKLEELQWVLDDDVELKEEWSESENGHFDTVKRRRGDGEVIRFLVDRLSSREIAMRDWKFSK
         +ERV LKEL     R       LKKE+L+EL+K+ E    ++L+WVLDDDV+++E   + E   FD  KR R +GE +R LVDRLS REI  + WKF +
Subjt:  RLERVNLKELTG--FRTGYDGDNLKKESLRELRKLFEARKLEELQWVLDDDVELKEEWSESENGHFDTVKRRRGDGEVIRFLVDRLSSREIAMRDWKFSK

Query:  MMIRSGLQFNEGQLLKILDALGAKGCWKQALSVVEWVYNLKSHSHSKSRFVYTKLLAVLGMAREPQEALQIFNLMRGDGQIYPDMAAYHSIAVTLGQAGL
        MM +SGLQF E Q+LKI+D LG K  WKQA +VV WVY+ K   H +SRFVYTKLL+VLG AR PQEALQIFN M GD Q+YPDMAAYH IAVTLGQAGL
Subjt:  MMIRSGLQFNEGQLLKILDALGAKGCWKQALSVVEWVYNLKSHSHSKSRFVYTKLLAVLGMAREPQEALQIFNLMRGDGQIYPDMAAYHSIAVTLGQAGL

Query:  LKQLLKVIECMRQQPSKKIRNKCRKSWDPAVEPDLVVYNAILNACIPTLEWKGVYWVFTQLRKSCLKPNGATYGLSMEVMLKSGKYEHVHKLFTKMKKSG
        LK+LLKVIE MRQ+P+K  +N  +K+WDP +EPDLVVYNAILNAC+PTL+WK V WVF +LRK+ L+PNGATYGL+MEVML+SGK++ VH  F KMK SG
Subjt:  LKQLLKVIECMRQQPSKKIRNKCRKSWDPAVEPDLVVYNAILNACIPTLEWKGVYWVFTQLRKSCLKPNGATYGLSMEVMLKSGKYEHVHKLFTKMKKSG

Query:  ETPKANTYRVLVKAFWEEGKVNGAIEAVRDMEQRGVVGSASVYYELACCLCYNGRWRDALVEVEKMKTLSHMKPLVVAFTGMILSSFDGGHIDDCISIFE
        E PKA TY+VLV+A W EGK+  A+EAVRDMEQ+GV+G+ SVYYELACCLC NGRW DA++EV +MK L + +PL + FTG+I +S +GGH+DDC++IF+
Subjt:  ETPKANTYRVLVKAFWEEGKVNGAIEAVRDMEQRGVVGSASVYYELACCLCYNGRWRDALVEVEKMKTLSHMKPLVVAFTGMILSSFDGGHIDDCISIFE

Query:  YMKQNCAPNIGAINTMLRVYGRNDMFSKAKDLFEEIKRKADRSSPSSSAPSIIPDEYTYGSMLEAAASALQWEYFENVYREMALSGYRLDQSKHAMLLVE
        YMK  C PNIG  N ML+VYGRNDMFS+AK+LFEEI         S     ++P+EYTY  MLEA+A +LQWEYFE+VY+ M LSGY++DQ+KHA +L+E
Subjt:  YMKQNCAPNIGAINTMLRVYGRNDMFSKAKDLFEEIKRKADRSSPSSSAPSIIPDEYTYGSMLEAAASALQWEYFENVYREMALSGYRLDQSKHAMLLVE

Query:  ASRAGKWYLLDHAYDTILEAGQIPHPLLFTEMILQLIVQDNYEQAVTLVKAMGYAPFQVSERQWTELFEANTDRICSNNLKKLSDALNDCN-ASEATVSN
        ASRAGKW LL+HA+D +LE G+IPHPL FTE++     + ++++A+TL+  +  A FQ+SE +WT+LFE + D +  +NL KLSD L +C+  SE TVSN
Subjt:  ASRAGKWYLLDHAYDTILEAGQIPHPLLFTEMILQLIVQDNYEQAVTLVKAMGYAPFQVSERQWTELFEANTDRICSNNLKKLSDALNDCN-ASEATVSN

Query:  LSISLHSLCKPGIPENTSQSVACDRDATDGLPIRGSENMENMETTKLHPDRHVNHCDESLDMISVNHLSSNMKVESDSKMAPWSLSLTE-GVLGTQKFSD
        LS SL S C  G   +++Q +      T     +  E++   +TT                          M+ ++ +    W  + TE   LG ++   
Subjt:  LSISLHSLCKPGIPENTSQSVACDRDATDGLPIRGSENMENMETTKLHPDRHVNHCDESLDMISVNHLSSNMKVESDSKMAPWSLSLTE-GVLGTQKFSD

Query:  PSNNEVSSFDSCDDSEDDEEELDMLLDRFDDSYDSNLPSVNEILKTWKEDRKND
                     + +DDEE           S DS+  SV +ILK W+E  K +
Subjt:  PSNNEVSSFDSCDDSEDDEEELDMLLDRFDDSYDSNLPSVNEILKTWKEDRKND


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGCTTTAGGCGCAAATGCACCAATTCCTGCGCCGAAGTTCGAACCAGATATCGAGAAAATTAAGCGTGCGCTCCTCCAGAAGGGTGTTTATCCGTCTCCCAAAAT
CGTCCGCTCACTTCGCAAGAAAGAAATTCAGAAGCACAACCGCAAACTCAAGCGACAGGCTGAACGACAAGCCGATCAGTCGCCGCCCCTTTCTGAGTCCGAAAAGCAAG
TAATTGCGGAGGAAACCCATTTTCTGACGTTGAGAAGCGAGTACAAGGAGTTCTCGAAGGCCATAGAGGCGAAACCAGGCGGTGGCTTGATGGTCGGGAGGCCGTGGGAG
AGACTAGAAAGAGTTAACCTTAAAGAACTTACCGGTTTTAGAACAGGATACGATGGGGATAATCTGAAGAAGGAGAGTTTGAGAGAATTGAGGAAACTGTTTGAGGCTCG
TAAGCTCGAGGAATTGCAGTGGGTTTTAGACGACGATGTAGAACTGAAGGAAGAGTGGTCGGAGAGTGAAAATGGTCATTTCGATACCGTAAAACGGAGGCGCGGCGATG
GAGAGGTTATTCGGTTCCTTGTTGACAGGCTCAGTTCGAGGGAGATTGCCATGAGAGACTGGAAATTCTCAAAGATGATGATACGGTCAGGATTGCAGTTTAATGAAGGT
CAACTACTTAAAATTTTGGATGCCCTTGGTGCTAAGGGATGTTGGAAACAGGCCTTGTCAGTGGTCGAATGGGTGTACAATCTTAAGAGTCACAGTCATTCTAAAAGCAG
GTTTGTCTACACAAAGCTCTTAGCTGTTCTTGGGATGGCAAGGGAACCTCAGGAAGCCCTTCAGATATTTAATTTGATGCGTGGAGATGGCCAGATATATCCCGATATGG
CTGCATATCACAGTATCGCTGTTACGCTGGGTCAAGCTGGTCTCTTGAAACAATTGCTGAAAGTTATTGAATGCATGAGGCAGCAACCGTCTAAAAAAATTAGAAACAAG
TGCCGCAAATCTTGGGATCCTGCAGTTGAACCTGACCTTGTTGTATATAATGCTATTTTGAACGCATGCATTCCAACCCTTGAGTGGAAAGGTGTCTACTGGGTGTTCAC
CCAATTGAGAAAGAGTTGTTTGAAACCTAATGGAGCGACATATGGACTTTCTATGGAGGTAATGCTTAAATCTGGAAAGTATGAGCATGTCCACAAGCTTTTTACAAAAA
TGAAGAAGAGTGGGGAAACTCCAAAGGCAAACACTTACAGAGTTCTTGTCAAAGCTTTTTGGGAGGAAGGAAAAGTCAATGGAGCTATTGAAGCAGTCAGGGATATGGAA
CAGAGAGGAGTAGTTGGATCTGCCAGTGTCTATTATGAACTAGCTTGTTGTCTATGCTACAATGGGAGGTGGCGAGATGCCTTAGTAGAGGTTGAAAAAATGAAAACACT
ATCACACATGAAACCATTGGTGGTGGCCTTCACCGGCATGATCTTATCTTCCTTTGACGGTGGACATATTGATGATTGTATATCTATTTTCGAGTACATGAAGCAAAATT
GTGCGCCGAATATAGGGGCCATTAATACCATGCTTAGAGTCTATGGCCGAAATGATATGTTTTCTAAAGCTAAAGATTTATTTGAAGAAATAAAGAGAAAAGCCGACCGT
TCCTCTCCAAGTAGTTCTGCTCCTTCTATAATCCCAGATGAATATACGTATGGCTCAATGCTTGAGGCAGCTGCTAGTGCACTCCAGTGGGAATATTTCGAGAACGTATA
CAGGGAAATGGCTCTGTCGGGATACCGGCTAGATCAAAGTAAACATGCAATGCTACTTGTGGAAGCATCCAGAGCTGGGAAGTGGTATCTGTTAGATCATGCATATGACA
CAATCTTGGAAGCTGGACAAATTCCCCATCCGCTGTTGTTCACAGAAATGATATTGCAGCTTATAGTTCAAGATAACTATGAGCAGGCTGTCACCTTGGTTAAAGCCATG
GGTTATGCTCCATTTCAAGTAAGTGAAAGGCAATGGACCGAACTTTTTGAAGCAAACACTGACAGAATTTGTTCGAATAACTTGAAGAAACTGTCGGATGCTCTAAACGA
CTGCAATGCATCAGAAGCCACAGTCTCGAACTTGTCAATATCACTGCACTCTCTTTGCAAACCTGGCATACCAGAAAACACCTCCCAGTCTGTAGCCTGTGACCGTGATG
CAACAGATGGATTGCCAATTCGTGGTTCTGAAAACATGGAGAACATGGAGACTACGAAGCTTCACCCAGATCGCCATGTGAATCACTGTGATGAGTCATTGGACATGATT
TCTGTTAACCATCTCAGCTCGAACATGAAGGTTGAGAGTGACTCAAAGATGGCCCCTTGGTCACTGAGTCTTACTGAAGGTGTTTTAGGGACTCAGAAATTTTCAGATCC
TTCCAACAATGAGGTCTCGAGTTTTGATTCGTGCGACGACAGTGAAGATGACGAGGAAGAACTTGACATGTTACTTGATAGATTTGATGATTCTTATGATTCGAACTTGC
CTTCAGTCAATGAAATACTGAAAACCTGGAAGGAAGACAGGAAAAACGATGGGTTAATTAGTACAATTCAGTAG
mRNA sequenceShow/hide mRNA sequence
ATGGAAGCTTTAGGCGCAAATGCACCAATTCCTGCGCCGAAGTTCGAACCAGATATCGAGAAAATTAAGCGTGCGCTCCTCCAGAAGGGTGTTTATCCGTCTCCCAAAAT
CGTCCGCTCACTTCGCAAGAAAGAAATTCAGAAGCACAACCGCAAACTCAAGCGACAGGCTGAACGACAAGCCGATCAGTCGCCGCCCCTTTCTGAGTCCGAAAAGCAAG
TAATTGCGGAGGAAACCCATTTTCTGACGTTGAGAAGCGAGTACAAGGAGTTCTCGAAGGCCATAGAGGCGAAACCAGGCGGTGGCTTGATGGTCGGGAGGCCGTGGGAG
AGACTAGAAAGAGTTAACCTTAAAGAACTTACCGGTTTTAGAACAGGATACGATGGGGATAATCTGAAGAAGGAGAGTTTGAGAGAATTGAGGAAACTGTTTGAGGCTCG
TAAGCTCGAGGAATTGCAGTGGGTTTTAGACGACGATGTAGAACTGAAGGAAGAGTGGTCGGAGAGTGAAAATGGTCATTTCGATACCGTAAAACGGAGGCGCGGCGATG
GAGAGGTTATTCGGTTCCTTGTTGACAGGCTCAGTTCGAGGGAGATTGCCATGAGAGACTGGAAATTCTCAAAGATGATGATACGGTCAGGATTGCAGTTTAATGAAGGT
CAACTACTTAAAATTTTGGATGCCCTTGGTGCTAAGGGATGTTGGAAACAGGCCTTGTCAGTGGTCGAATGGGTGTACAATCTTAAGAGTCACAGTCATTCTAAAAGCAG
GTTTGTCTACACAAAGCTCTTAGCTGTTCTTGGGATGGCAAGGGAACCTCAGGAAGCCCTTCAGATATTTAATTTGATGCGTGGAGATGGCCAGATATATCCCGATATGG
CTGCATATCACAGTATCGCTGTTACGCTGGGTCAAGCTGGTCTCTTGAAACAATTGCTGAAAGTTATTGAATGCATGAGGCAGCAACCGTCTAAAAAAATTAGAAACAAG
TGCCGCAAATCTTGGGATCCTGCAGTTGAACCTGACCTTGTTGTATATAATGCTATTTTGAACGCATGCATTCCAACCCTTGAGTGGAAAGGTGTCTACTGGGTGTTCAC
CCAATTGAGAAAGAGTTGTTTGAAACCTAATGGAGCGACATATGGACTTTCTATGGAGGTAATGCTTAAATCTGGAAAGTATGAGCATGTCCACAAGCTTTTTACAAAAA
TGAAGAAGAGTGGGGAAACTCCAAAGGCAAACACTTACAGAGTTCTTGTCAAAGCTTTTTGGGAGGAAGGAAAAGTCAATGGAGCTATTGAAGCAGTCAGGGATATGGAA
CAGAGAGGAGTAGTTGGATCTGCCAGTGTCTATTATGAACTAGCTTGTTGTCTATGCTACAATGGGAGGTGGCGAGATGCCTTAGTAGAGGTTGAAAAAATGAAAACACT
ATCACACATGAAACCATTGGTGGTGGCCTTCACCGGCATGATCTTATCTTCCTTTGACGGTGGACATATTGATGATTGTATATCTATTTTCGAGTACATGAAGCAAAATT
GTGCGCCGAATATAGGGGCCATTAATACCATGCTTAGAGTCTATGGCCGAAATGATATGTTTTCTAAAGCTAAAGATTTATTTGAAGAAATAAAGAGAAAAGCCGACCGT
TCCTCTCCAAGTAGTTCTGCTCCTTCTATAATCCCAGATGAATATACGTATGGCTCAATGCTTGAGGCAGCTGCTAGTGCACTCCAGTGGGAATATTTCGAGAACGTATA
CAGGGAAATGGCTCTGTCGGGATACCGGCTAGATCAAAGTAAACATGCAATGCTACTTGTGGAAGCATCCAGAGCTGGGAAGTGGTATCTGTTAGATCATGCATATGACA
CAATCTTGGAAGCTGGACAAATTCCCCATCCGCTGTTGTTCACAGAAATGATATTGCAGCTTATAGTTCAAGATAACTATGAGCAGGCTGTCACCTTGGTTAAAGCCATG
GGTTATGCTCCATTTCAAGTAAGTGAAAGGCAATGGACCGAACTTTTTGAAGCAAACACTGACAGAATTTGTTCGAATAACTTGAAGAAACTGTCGGATGCTCTAAACGA
CTGCAATGCATCAGAAGCCACAGTCTCGAACTTGTCAATATCACTGCACTCTCTTTGCAAACCTGGCATACCAGAAAACACCTCCCAGTCTGTAGCCTGTGACCGTGATG
CAACAGATGGATTGCCAATTCGTGGTTCTGAAAACATGGAGAACATGGAGACTACGAAGCTTCACCCAGATCGCCATGTGAATCACTGTGATGAGTCATTGGACATGATT
TCTGTTAACCATCTCAGCTCGAACATGAAGGTTGAGAGTGACTCAAAGATGGCCCCTTGGTCACTGAGTCTTACTGAAGGTGTTTTAGGGACTCAGAAATTTTCAGATCC
TTCCAACAATGAGGTCTCGAGTTTTGATTCGTGCGACGACAGTGAAGATGACGAGGAAGAACTTGACATGTTACTTGATAGATTTGATGATTCTTATGATTCGAACTTGC
CTTCAGTCAATGAAATACTGAAAACCTGGAAGGAAGACAGGAAAAACGATGGGTTAATTAGTACAATTCAGTAG
Protein sequenceShow/hide protein sequence
MEALGANAPIPAPKFEPDIEKIKRALLQKGVYPSPKIVRSLRKKEIQKHNRKLKRQAERQADQSPPLSESEKQVIAEETHFLTLRSEYKEFSKAIEAKPGGGLMVGRPWE
RLERVNLKELTGFRTGYDGDNLKKESLRELRKLFEARKLEELQWVLDDDVELKEEWSESENGHFDTVKRRRGDGEVIRFLVDRLSSREIAMRDWKFSKMMIRSGLQFNEG
QLLKILDALGAKGCWKQALSVVEWVYNLKSHSHSKSRFVYTKLLAVLGMAREPQEALQIFNLMRGDGQIYPDMAAYHSIAVTLGQAGLLKQLLKVIECMRQQPSKKIRNK
CRKSWDPAVEPDLVVYNAILNACIPTLEWKGVYWVFTQLRKSCLKPNGATYGLSMEVMLKSGKYEHVHKLFTKMKKSGETPKANTYRVLVKAFWEEGKVNGAIEAVRDME
QRGVVGSASVYYELACCLCYNGRWRDALVEVEKMKTLSHMKPLVVAFTGMILSSFDGGHIDDCISIFEYMKQNCAPNIGAINTMLRVYGRNDMFSKAKDLFEEIKRKADR
SSPSSSAPSIIPDEYTYGSMLEAAASALQWEYFENVYREMALSGYRLDQSKHAMLLVEASRAGKWYLLDHAYDTILEAGQIPHPLLFTEMILQLIVQDNYEQAVTLVKAM
GYAPFQVSERQWTELFEANTDRICSNNLKKLSDALNDCNASEATVSNLSISLHSLCKPGIPENTSQSVACDRDATDGLPIRGSENMENMETTKLHPDRHVNHCDESLDMI
SVNHLSSNMKVESDSKMAPWSLSLTEGVLGTQKFSDPSNNEVSSFDSCDDSEDDEEELDMLLDRFDDSYDSNLPSVNEILKTWKEDRKNDGLISTIQ