; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr002331 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr002331
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationtig00001478:11289..15938
RNA-Seq ExpressionSgr002331
SyntenySgr002331
Gene Ontology termsGO:0009658 - chloroplast organization (biological process)
GO:0009507 - chloroplast (cellular component)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily
IPR033443 - Pentacotripeptide-repeat region of PRORP
IPR044645 - Pentatricopeptide repeat-containing protein DG1/EMB2279-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004141982.1 pentatricopeptide repeat-containing protein At5g67570, chloroplastic [Cucumis sativus]0.0e+0084.69Show/hide
Query:  MEALGANAPIPAPKFEPDIEKIKRALLQKGVYPSPKIVRSLRKKEIQKHNRKLKRQAERQADQSPPLSESEKQVIAEETHFLTLRSEYKEFSKAIEAKPG
        MEAL +N PIP+PKFEPD +KIKR LLQKGVYP+P+IVRSLRKKEIQK+NRKLKR AERQ+ QSPPLSES+KQ+IAEETHFLTLRSEYKEFSKAIEAKP 
Subjt:  MEALGANAPIPAPKFEPDIEKIKRALLQKGVYPSPKIVRSLRKKEIQKHNRKLKRQAERQADQSPPLSESEKQVIAEETHFLTLRSEYKEFSKAIEAKPG

Query:  GGLMVGRPWERLERVNLKELTGFRTGYNGDNLKKESLRELRKLFEARKLEELQWVLDDDVELKEEWSESENGHFDTVKRRRGDGEVIRFLVDRLSSREIA
        GGLMVGRPWERLERVN KELTG RTGYN D+LKKESLRELRKLFE RKLEE QW LDDDVELKEEW ESEN  +D VKRRRGDGEVIRFLVDRLSS  I+
Subjt:  GGLMVGRPWERLERVNLKELTGFRTGYNGDNLKKESLRELRKLFEARKLEELQWVLDDDVELKEEWSESENGHFDTVKRRRGDGEVIRFLVDRLSSREIA

Query:  MRDWKFSKMMIRSGLQFNEGQLLKILDALGAKGCWKQALSVVEWVYNLKSHSHSKSRFVYTKLLAVLGMARESQEALQIFNLMRGDGQIYPDMAAYHSIA
        MRDWKFS+MMIRSGLQFNEGQLLKILDALGAKGCWKQALSVVEWVYNLKSHSHSKSRFVYTKLLAVLGMAR+ QEALQIFNLMRGDGQIYPDMAAYHSIA
Subjt:  MRDWKFSKMMIRSGLQFNEGQLLKILDALGAKGCWKQALSVVEWVYNLKSHSHSKSRFVYTKLLAVLGMARESQEALQIFNLMRGDGQIYPDMAAYHSIA

Query:  VTLGQAGLLKQLLKVIECMRQQPSKKIRNKCRKSWDPAVEPDLVVYNAILNACIPTLEWKGVYWVFTQLRKSCVKPNGATYGLSMEVMLKSGKYEHVHKL
        VTLGQAGLLKQLLKVIE MRQQPSKK+RNKCRKSWDPAVEPDLV+YNAILNACIPTLEWKGVYWVFTQLRKS ++PNGATYGLSMEVMLKSGKYE +H L
Subjt:  VTLGQAGLLKQLLKVIECMRQQPSKKIRNKCRKSWDPAVEPDLVVYNAILNACIPTLEWKGVYWVFTQLRKSCVKPNGATYGLSMEVMLKSGKYEHVHKL

Query:  FTKMKKSGETPKANTYRVLVKAFWEEGKVNGAVEAVRDMEQRGVVGSASVYYELACCLCYNGRWRDALVEVEKMKTLSHMKPLVVAFTGMILSSFDGGHI
        FTKMKK+G+T KANTYRVLVKAFWEEG VNGA+EAVRDMEQRGVVGSASVYYELACCLCYNG+W+DALVEVEKMKTLSHMKPLVV FTGMI SSF+GGHI
Subjt:  FTKMKKSGETPKANTYRVLVKAFWEEGKVNGAVEAVRDMEQRGVVGSASVYYELACCLCYNGRWRDALVEVEKMKTLSHMKPLVVAFTGMILSSFDGGHI

Query:  DDCISIFEYMKQNCAPNIGAINTMLRVYGRNDMFSKAKDLFEEIKRKADRSSPSSSAPSIIPDEYTYGSMLEAAASALQWEYFENVYREMALSGYRLDQS
        DDCISIFEYMKQ CAPNIG INTML+VYGRNDM+SKAKDLFEEIKRKAD SS  S+ PS++PDEYTY SMLEAAAS+LQWEYFE+VYREMALSGY+LDQS
Subjt:  DDCISIFEYMKQNCAPNIGAINTMLRVYGRNDMFSKAKDLFEEIKRKADRSSPSSSAPSIIPDEYTYGSMLEAAASALQWEYFENVYREMALSGYRLDQS

Query:  KHAMLLVEASRAGKWYLLDHAYDTILEAGQIPHPLLFTEMILQLIVQDNYEQAVTLVKAMGYAPFQVSERQWTELFEANTDRICSDNLKKLSDALSDCNA
        KHA+LLVEAS+AGKWYLLDHA+DTILEAGQIPHPLLFTEMILQL  QDNYEQAVTLV+ MGYAPFQVSERQWTELFE NTDRI  +NLK+L  AL DC+A
Subjt:  KHAMLLVEASRAGKWYLLDHAYDTILEAGQIPHPLLFTEMILQLIVQDNYEQAVTLVKAMGYAPFQVSERQWTELFEANTDRICSDNLKKLSDALSDCNA

Query:  SEATVSNLSRSLHSLCKPGIPENTSQSVACDRDATDGLPIRGSENMENMETMKLHPDRHVNHCDESLDMISVNHLSSNMKVESDSKMAPWSLSFTEGVLG
        SEATVSNLSRSL SLCK  IPENTSQSVACD DATD L +  SENMEN   MKLHPD      DESLD+I V+H S NMKV S+SKM+PWS+S ++G LG
Subjt:  SEATVSNLSRSLHSLCKPGIPENTSQSVACDRDATDGLPIRGSENMENMETMKLHPDRHVNHCDESLDMISVNHLSSNMKVESDSKMAPWSLSFTEGVLG

Query:  TQKFSDPSNNEVSSFDSCDDSEDDEEELDMLLDRFDDSYDSNLPSVNEILKTWKEDRKNDGL
        T +FSD SNN  S FD C +SEDDEEEL+ LLD FDD+YDSNLP+VNEIL+TWKE+RK DGL
Subjt:  TQKFSDPSNNEVSSFDSCDDSEDDEEELDMLLDRFDDSYDSNLPSVNEILKTWKEDRKNDGL

XP_008440398.1 PREDICTED: pentatricopeptide repeat-containing protein At5g67570, chloroplastic [Cucumis melo]0.0e+0084.92Show/hide
Query:  MEALGANAPIPAPKFEPDIEKIKRALLQKGVYPSPKIVRSLRKKEIQKHNRKLKRQAERQADQSPPLSESEKQVIAEETHFLTLRSEYKEFSKAIEAKPG
        MEAL +NAPIP+PKFEPD +KIKR LLQKGVYP+P+IVRSLRKKEIQK+NRKLKR AERQ+DQSPPLSES+KQ+IAEETHFLTLRSEYKEFSKAIEAKP 
Subjt:  MEALGANAPIPAPKFEPDIEKIKRALLQKGVYPSPKIVRSLRKKEIQKHNRKLKRQAERQADQSPPLSESEKQVIAEETHFLTLRSEYKEFSKAIEAKPG

Query:  GGLMVGRPWERLERVNLKELTGFRTGYNGDNLKKESLRELRKLFEARKLEELQWVLDDDVELKEEWSESENGHFDTVKRRRGDGEVIRFLVDRLSSREIA
        GGLMVGRPWERLERVN  ELTG RTGYN D+LKKESLRELRKLFE RKLEEL+W LDDDVELKEEW +SENG +D VKRRRGDGEVIRFLVDRLSS  I+
Subjt:  GGLMVGRPWERLERVNLKELTGFRTGYNGDNLKKESLRELRKLFEARKLEELQWVLDDDVELKEEWSESENGHFDTVKRRRGDGEVIRFLVDRLSSREIA

Query:  MRDWKFSKMMIRSGLQFNEGQLLKILDALGAKGCWKQALSVVEWVYNLKSHSHSKSRFVYTKLLAVLGMARESQEALQIFNLMRGDGQIYPDMAAYHSIA
        MRDWKFS+MMIRSGLQFNEGQLLKILDALGAKGCWKQALSVVEWVYNLKSHSHSKSRFVYTKLLAVLGMAR+ QEALQIFNLMRGDGQIYPDMAAYHSIA
Subjt:  MRDWKFSKMMIRSGLQFNEGQLLKILDALGAKGCWKQALSVVEWVYNLKSHSHSKSRFVYTKLLAVLGMARESQEALQIFNLMRGDGQIYPDMAAYHSIA

Query:  VTLGQAGLLKQLLKVIECMRQQPSKKIRNKCRKSWDPAVEPDLVVYNAILNACIPTLEWKGVYWVFTQLRKSCVKPNGATYGLSMEVMLKSGKYEHVHKL
        VTLGQAGLLKQLLKVIECMRQQPSKK+RNKCRKSWDPAVEPDLV+YN ILNACIPTLEWKGVYWVFTQLRKS ++PNGATYGLSMEVMLKSGKYE +H L
Subjt:  VTLGQAGLLKQLLKVIECMRQQPSKKIRNKCRKSWDPAVEPDLVVYNAILNACIPTLEWKGVYWVFTQLRKSCVKPNGATYGLSMEVMLKSGKYEHVHKL

Query:  FTKMKKSGETPKANTYRVLVKAFWEEGKVNGAVEAVRDMEQRGVVGSASVYYELACCLCYNGRWRDALVEVEKMKTLSHMKPLVVAFTGMILSSFDGGHI
        FTKMKKSGET KANTYRVLVKAFWEEG  +GA+EAVRDMEQRGVVGSASVYYELACCLCYNG+W+DALVEVEKMKTLSHMKPLVV FTGMILSSF+GGHI
Subjt:  FTKMKKSGETPKANTYRVLVKAFWEEGKVNGAVEAVRDMEQRGVVGSASVYYELACCLCYNGRWRDALVEVEKMKTLSHMKPLVVAFTGMILSSFDGGHI

Query:  DDCISIFEYMKQNCAPNIGAINTMLRVYGRNDMFSKAKDLFEEIKRKADRSSPSSSAPSIIPDEYTYGSMLEAAASALQWEYFENVYREMALSGYRLDQS
        DDCISIFEYMKQ CAPNIG INTML+VYGRNDMFSKAKDLFEEIK+KAD SS +S+ PS++PDEYTY SML+AAAS+LQWEYFENVYREMALSGYRLDQS
Subjt:  DDCISIFEYMKQNCAPNIGAINTMLRVYGRNDMFSKAKDLFEEIKRKADRSSPSSSAPSIIPDEYTYGSMLEAAASALQWEYFENVYREMALSGYRLDQS

Query:  KHAMLLVEASRAGKWYLLDHAYDTILEAGQIPHPLLFTEMILQLIVQDNYEQAVTLVKAMGYAPFQVSERQWTELFEANTDRICSDNLKKLSDALSDCNA
        KHA+LLVEAS+AGKWYLLDHA+DTILEAGQIPHPLLFTEMILQL  Q+NYEQAVTLV+ MGYAPFQVSERQWTELFE N DRIC +NLK+L DAL DC+A
Subjt:  KHAMLLVEASRAGKWYLLDHAYDTILEAGQIPHPLLFTEMILQLIVQDNYEQAVTLVKAMGYAPFQVSERQWTELFEANTDRICSDNLKKLSDALSDCNA

Query:  SEATVSNLSRSLHSLCKPGIPENTSQSVACDRDATDGLPIRGSENMENMETMKLHPDRHVNHCDESLDMISVNHLSSNMKVESDSKMAPWSLSFTEGVLG
        SEATVSNLSRSL SLCK GI E+TSQS+ACD +ATDGL +  S+NMEN   MKLHPD+     DESLD+I V+H S NMKV S+S M+PWS S ++GVLG
Subjt:  SEATVSNLSRSLHSLCKPGIPENTSQSVACDRDATDGLPIRGSENMENMETMKLHPDRHVNHCDESLDMISVNHLSSNMKVESDSKMAPWSLSFTEGVLG

Query:  TQKFSDPSNNEVSSFDSCDDSEDDEEELDMLLDRFDDSYDSNLPSVNEILKTWKEDRKNDGL
        T +FSD SNNE S+FDS D+SEDDE EL+MLLD FDDSYDSNLP+ NEIL+TWKE+RK DGL
Subjt:  TQKFSDPSNNEVSSFDSCDDSEDDEEELDMLLDRFDDSYDSNLPSVNEILKTWKEDRKNDGL

XP_023543129.1 pentatricopeptide repeat-containing protein At5g67570, chloroplastic [Cucurbita pepo subsp. pepo]0.0e+0085.5Show/hide
Query:  MEALGANAPIPAPKFEPDIEKIKRALLQKGVYPSPKIVRSLRKKEIQKHNRKLKRQAERQADQSPPLSESEKQVIAEETHFLTLRSEYKEFSKAIEAKPG
        MEAL  NA +P+PKFEPDIEKIKR LLQKGV+P+PKIVRSL KKEIQKHNRKLKR AERQ  QSPPLSES+KQ+I EETHFLTLRSEYKEFSKAIEA+P 
Subjt:  MEALGANAPIPAPKFEPDIEKIKRALLQKGVYPSPKIVRSLRKKEIQKHNRKLKRQAERQADQSPPLSESEKQVIAEETHFLTLRSEYKEFSKAIEAKPG

Query:  GGLMVGRPWERLERVNLKELTGFRTGYNGDNLKKESLRELRKLFEARKLEELQWVLDDDVELKEEWSESENGHFDTVKRRRGDGEVIRFLVDRLSSREIA
        GGLMVGRPWERLERVNLKELTGFRT YN DNLKKESLRELRKLFEARKLEELQWVLDDDVELKEEW  SENG  D VKRRRGDGEVIRFLVDRLSSR I+
Subjt:  GGLMVGRPWERLERVNLKELTGFRTGYNGDNLKKESLRELRKLFEARKLEELQWVLDDDVELKEEWSESENGHFDTVKRRRGDGEVIRFLVDRLSSREIA

Query:  MRDWKFSKMMIRSGLQFNEGQLLKILDALGAKGCWKQALSVVEWVYNLKSHSHSKSRFVYTKLLAVLGMARESQEALQIFNLMRGDGQIYPDMAAYHSIA
        MRDWKFS+MMI+SGLQFNEGQLLKILDALGAKGCWKQALSVVEWVYNLKSHSHSKSRFVYTKLLAVLG AR+ QEALQIFNLMRGDGQIYPDMAAYHSIA
Subjt:  MRDWKFSKMMIRSGLQFNEGQLLKILDALGAKGCWKQALSVVEWVYNLKSHSHSKSRFVYTKLLAVLGMARESQEALQIFNLMRGDGQIYPDMAAYHSIA

Query:  VTLGQAGLLKQLLKVIECMRQQPSKKIRNKCRKSWDPAVEPDLVVYNAILNACIPTLEWKGVYWVFTQLRKSCVKPNGATYGLSMEVMLKSGKYEHVHKL
        VTLGQAGLLKQLLK+IECMRQQPSKK+RN CRK WDPAVEPDLV+YNAILNACIPTLEWK VYWVFTQLRK+ +KPNGATYGLSMEVMLKSGKYE VH L
Subjt:  VTLGQAGLLKQLLKVIECMRQQPSKKIRNKCRKSWDPAVEPDLVVYNAILNACIPTLEWKGVYWVFTQLRKSCVKPNGATYGLSMEVMLKSGKYEHVHKL

Query:  FTKMKKSGETPKANTYRVLVKAFWEEGKVNGAVEAVRDMEQRGVVGSASVYYELACCLCYNGRWRDALVEVEKMKTLSHMKPLVVAFTGMILSSFDGGHI
        FTKMK SG T KANTYRVLVKAFWEEG V+GA+EAVRDMEQRGVVGSASVYYELACCLCY+GRW+DALVEVEKMKTLSHMKPLVV FTGMILSSFDGGHI
Subjt:  FTKMKKSGETPKANTYRVLVKAFWEEGKVNGAVEAVRDMEQRGVVGSASVYYELACCLCYNGRWRDALVEVEKMKTLSHMKPLVVAFTGMILSSFDGGHI

Query:  DDCISIFEYMKQNCAPNIGAINTMLRVYGRNDMFSKAKDLFEEIKRKADRSSPSSSAPSIIPDEYTYGSMLEAAASALQWEYFENVYREMALSGYRLDQS
        DDCISIFEYMKQ CAPNIG INTML+V+GRNDMFSKAKDL+EEIKRKAD SS SS+  SI+PD+YTY SML+AAASA QWEYFENVYREMALSGYRLDQS
Subjt:  DDCISIFEYMKQNCAPNIGAINTMLRVYGRNDMFSKAKDLFEEIKRKADRSSPSSSAPSIIPDEYTYGSMLEAAASALQWEYFENVYREMALSGYRLDQS

Query:  KHAMLLVEASRAGKWYLLDHAYDTILEAGQIPHPLLFTEMILQLIVQDNYEQAVTLVKAMGYAPFQVSERQWTELFEANTDRICSDNLKKLSDALSDCNA
        KHAMLLVEASRAGKWYLLDHA+DTILEAGQIPHPLLFTE+ILQL  QDNYEQAVTLV+ M YAPFQVSERQWTE+FE NTDRIC +NLKKLSDALSDC+A
Subjt:  KHAMLLVEASRAGKWYLLDHAYDTILEAGQIPHPLLFTEMILQLIVQDNYEQAVTLVKAMGYAPFQVSERQWTELFEANTDRICSDNLKKLSDALSDCNA

Query:  SEATVSNLSRSLHSLCKPGIPENTSQSVACDRDATDGLPIRGSENMENMETMKLHPDRHVNHCDESLDMISVNHLSSNMKVESDSKMAPWSLSFTEGVLG
        SEATVSNLSRSL  LCK GIPENTSQS+  D D TDGL + GSENMEN   MKLH D   + C+ SLD+I VNH S N     DSKM PWSLS ++GVL 
Subjt:  SEATVSNLSRSLHSLCKPGIPENTSQSVACDRDATDGLPIRGSENMENMETMKLHPDRHVNHCDESLDMISVNHLSSNMKVESDSKMAPWSLSFTEGVLG

Query:  TQKFSDPSNNEVSSFDSCDDSEDDEEELDMLLDRFDDSYDSNLPSVNEILKTWKEDRKNDGL
        T KFSD SNNE S+FD  DDSEDDEEEL MLLD FDD YDSNLPSV+EILKTWKE+RKNDGL
Subjt:  TQKFSDPSNNEVSSFDSCDDSEDDEEELDMLLDRFDDSYDSNLPSVNEILKTWKEDRKNDGL

XP_038881784.1 pentatricopeptide repeat-containing protein At5g67570, chloroplastic isoform X1 [Benincasa hispida]0.0e+0086.44Show/hide
Query:  MEALGANAPIPAPKFEPDIEKIKRALLQKGVYPSPKIVRSLRKKEIQKHNRKLKRQAERQADQSPPLSESEKQVIAEETHFLTLRSEYKEFSKAIEAKPG
        MEAL  N+PIP+PKFEPDIEKIKR L+ KGV+P+P+IVRSLRKKEIQK+NRKLKR  ERQADQSPPLSES+KQ+IAEETHFLTLRSEYKEFSKAIEAKP 
Subjt:  MEALGANAPIPAPKFEPDIEKIKRALLQKGVYPSPKIVRSLRKKEIQKHNRKLKRQAERQADQSPPLSESEKQVIAEETHFLTLRSEYKEFSKAIEAKPG

Query:  GGLMVGRPWERLERVNLKELTGFRTGYNGDNLKKESLRELRKLFEARKLEELQWVLDDDVELKEEWSESENGHFDTVKRRRGDGEVIRFLVDRLSSREIA
        GGLMVGRPWERLERVNLKELTGFRTGYN DNLKKESLRELRKLFEARKLEELQWVLDDDVELKEEW ESEN H D ++RRRGDGEVIRFLVDRLSSR I+
Subjt:  GGLMVGRPWERLERVNLKELTGFRTGYNGDNLKKESLRELRKLFEARKLEELQWVLDDDVELKEEWSESENGHFDTVKRRRGDGEVIRFLVDRLSSREIA

Query:  MRDWKFSKMMIRSGLQFNEGQLLKILDALGAKGCWKQALSVVEWVYNLKSHSHSKSRFVYTKLLAVLGMARESQEALQIFNLMR--------GDGQIYPD
        MRDWKFS+MMIRSGLQFNEGQLLKILDALGAKGCWKQALSVVEWVYNLKSHSHSKSRFVYTKLLAVLGM+R+ QEALQIF+LMR        GDGQIYPD
Subjt:  MRDWKFSKMMIRSGLQFNEGQLLKILDALGAKGCWKQALSVVEWVYNLKSHSHSKSRFVYTKLLAVLGMARESQEALQIFNLMR--------GDGQIYPD

Query:  MAAYHSIAVTLGQAGLLKQLLKVIECMRQQPSKKIRNKCRKSWDPAVEPDLVVYNAILNACIPTLEWKGVYWVFTQLRKSCVKPNGATYGLSMEVMLKSG
        MAAYHSIAVTLGQAGLLKQLLKV+ECMRQQPS+K+RNKCRKSWDPAVEPDLVVYNAILNACIPTLEWKGVYWVFTQLRKS ++PNGATYGLSMEVMLKSG
Subjt:  MAAYHSIAVTLGQAGLLKQLLKVIECMRQQPSKKIRNKCRKSWDPAVEPDLVVYNAILNACIPTLEWKGVYWVFTQLRKSCVKPNGATYGLSMEVMLKSG

Query:  KYEHVHKLFTKMKKSGETPKANTYRVLVKAFWEEGKVNGAVEAVRDMEQRGVVGSASVYYELACCLCYNGRWRDALVEVEKMKTLSHMKPLVVAFTGMIL
        KYE +HKLFTK+KKSGET KANTYRVLVKAFWEEG VNGA+EAVRDMEQRGVVGSASVYYELACCLCYNGRW+DALVEVEKMKTLSHMKPLVV FTGMIL
Subjt:  KYEHVHKLFTKMKKSGETPKANTYRVLVKAFWEEGKVNGAVEAVRDMEQRGVVGSASVYYELACCLCYNGRWRDALVEVEKMKTLSHMKPLVVAFTGMIL

Query:  SSFDGGHIDDCISIFEYMKQNCAPNIGAINTMLRVYGRNDMFSKAKDLFEEIKRKADRSSPSSSAPSIIPDEYTYGSMLEAAASALQWEYFENVYREMAL
        SSFDGGHIDDCISIFEYMKQ CAPNIG IN+ML+VYGRNDMF KAKDLFEEIKRKAD SS SS+ PS++PDEYTYGSMLEAAASALQWEYFENVYREMAL
Subjt:  SSFDGGHIDDCISIFEYMKQNCAPNIGAINTMLRVYGRNDMFSKAKDLFEEIKRKADRSSPSSSAPSIIPDEYTYGSMLEAAASALQWEYFENVYREMAL

Query:  SGYRLDQSKHAMLLVEASRAGKWYLLDHAYDTILEAGQIPHPLLFTEMILQLIVQDNYEQAVTLVKAMGYAPFQVSERQWTELFEANTDRICSDNLKKLS
        SGYRLDQSKHA LLVEASRAGKWYLLDHA+D+ILEAGQIPHPLLFTEMIL L  QDNYEQAVTLV+ MGYAPFQVSERQWTELFE N DRIC  NLKKL 
Subjt:  SGYRLDQSKHAMLLVEASRAGKWYLLDHAYDTILEAGQIPHPLLFTEMILQLIVQDNYEQAVTLVKAMGYAPFQVSERQWTELFEANTDRICSDNLKKLS

Query:  DALSDCNASEATVSNLSRSLHSLCKPGIPENTSQSVACDRDATDGLPIRGSENMENMETMKLHPDRHVNHCDESLDMISVNHLSSNMKVESDSKMAPWSL
        DAL +C+ASEATVSNLSRSL SLCK GIPENTSQSVACD D TDGL + GSEN EN   MKLHPDR  + CDESLD+I VNH S NMKV+SDS+++PWS 
Subjt:  DALSDCNASEATVSNLSRSLHSLCKPGIPENTSQSVACDRDATDGLPIRGSENMENMETMKLHPDRHVNHCDESLDMISVNHLSSNMKVESDSKMAPWSL

Query:  SFTEGVLGTQKFSDPSNNEVSSFDSCDDSEDDEEELDMLLDRFDDSYDSNLPSVNEILKTWKEDRKNDGL
        S +EGVLGT +FSD S NE+S+ D CDDSEDDEE L+MLLD FDDSYDSNLPSVNEILKTWKE+RK DGL
Subjt:  SFTEGVLGTQKFSDPSNNEVSSFDSCDDSEDDEEELDMLLDRFDDSYDSNLPSVNEILKTWKEDRKNDGL

XP_038881786.1 pentatricopeptide repeat-containing protein At5g67570, chloroplastic isoform X2 [Benincasa hispida]0.0e+0087.24Show/hide
Query:  MEALGANAPIPAPKFEPDIEKIKRALLQKGVYPSPKIVRSLRKKEIQKHNRKLKRQAERQADQSPPLSESEKQVIAEETHFLTLRSEYKEFSKAIEAKPG
        MEAL  N+PIP+PKFEPDIEKIKR L+ KGV+P+P+IVRSLRKKEIQK+NRKLKR  ERQADQSPPLSES+KQ+IAEETHFLTLRSEYKEFSKAIEAKP 
Subjt:  MEALGANAPIPAPKFEPDIEKIKRALLQKGVYPSPKIVRSLRKKEIQKHNRKLKRQAERQADQSPPLSESEKQVIAEETHFLTLRSEYKEFSKAIEAKPG

Query:  GGLMVGRPWERLERVNLKELTGFRTGYNGDNLKKESLRELRKLFEARKLEELQWVLDDDVELKEEWSESENGHFDTVKRRRGDGEVIRFLVDRLSSREIA
        GGLMVGRPWERLERVNLKELTGFRTGYN DNLKKESLRELRKLFEARKLEELQWVLDDDVELKEEW ESEN H D ++RRRGDGEVIRFLVDRLSSR I+
Subjt:  GGLMVGRPWERLERVNLKELTGFRTGYNGDNLKKESLRELRKLFEARKLEELQWVLDDDVELKEEWSESENGHFDTVKRRRGDGEVIRFLVDRLSSREIA

Query:  MRDWKFSKMMIRSGLQFNEGQLLKILDALGAKGCWKQALSVVEWVYNLKSHSHSKSRFVYTKLLAVLGMARESQEALQIFNLMRGDGQIYPDMAAYHSIA
        MRDWKFS+MMIRSGLQFNEGQLLKILDALGAKGCWKQALSVVEWVYNLKSHSHSKSRFVYTKLLAVLGM+R+ QEALQIF+LMRGDGQIYPDMAAYHSIA
Subjt:  MRDWKFSKMMIRSGLQFNEGQLLKILDALGAKGCWKQALSVVEWVYNLKSHSHSKSRFVYTKLLAVLGMARESQEALQIFNLMRGDGQIYPDMAAYHSIA

Query:  VTLGQAGLLKQLLKVIECMRQQPSKKIRNKCRKSWDPAVEPDLVVYNAILNACIPTLEWKGVYWVFTQLRKSCVKPNGATYGLSMEVMLKSGKYEHVHKL
        VTLGQAGLLKQLLKV+ECMRQQPS+K+RNKCRKSWDPAVEPDLVVYNAILNACIPTLEWKGVYWVFTQLRKS ++PNGATYGLSMEVMLKSGKYE +HKL
Subjt:  VTLGQAGLLKQLLKVIECMRQQPSKKIRNKCRKSWDPAVEPDLVVYNAILNACIPTLEWKGVYWVFTQLRKSCVKPNGATYGLSMEVMLKSGKYEHVHKL

Query:  FTKMKKSGETPKANTYRVLVKAFWEEGKVNGAVEAVRDMEQRGVVGSASVYYELACCLCYNGRWRDALVEVEKMKTLSHMKPLVVAFTGMILSSFDGGHI
        FTK+KKSGET KANTYRVLVKAFWEEG VNGA+EAVRDMEQRGVVGSASVYYELACCLCYNGRW+DALVEVEKMKTLSHMKPLVV FTGMILSSFDGGHI
Subjt:  FTKMKKSGETPKANTYRVLVKAFWEEGKVNGAVEAVRDMEQRGVVGSASVYYELACCLCYNGRWRDALVEVEKMKTLSHMKPLVVAFTGMILSSFDGGHI

Query:  DDCISIFEYMKQNCAPNIGAINTMLRVYGRNDMFSKAKDLFEEIKRKADRSSPSSSAPSIIPDEYTYGSMLEAAASALQWEYFENVYREMALSGYRLDQS
        DDCISIFEYMKQ CAPNIG IN+ML+VYGRNDMF KAKDLFEEIKRKAD SS SS+ PS++PDEYTYGSMLEAAASALQWEYFENVYREMALSGYRLDQS
Subjt:  DDCISIFEYMKQNCAPNIGAINTMLRVYGRNDMFSKAKDLFEEIKRKADRSSPSSSAPSIIPDEYTYGSMLEAAASALQWEYFENVYREMALSGYRLDQS

Query:  KHAMLLVEASRAGKWYLLDHAYDTILEAGQIPHPLLFTEMILQLIVQDNYEQAVTLVKAMGYAPFQVSERQWTELFEANTDRICSDNLKKLSDALSDCNA
        KHA LLVEASRAGKWYLLDHA+D+ILEAGQIPHPLLFTEMIL L  QDNYEQAVTLV+ MGYAPFQVSERQWTELFE N DRIC  NLKKL DAL +C+A
Subjt:  KHAMLLVEASRAGKWYLLDHAYDTILEAGQIPHPLLFTEMILQLIVQDNYEQAVTLVKAMGYAPFQVSERQWTELFEANTDRICSDNLKKLSDALSDCNA

Query:  SEATVSNLSRSLHSLCKPGIPENTSQSVACDRDATDGLPIRGSENMENMETMKLHPDRHVNHCDESLDMISVNHLSSNMKVESDSKMAPWSLSFTEGVLG
        SEATVSNLSRSL SLCK GIPENTSQSVACD D TDGL + GSEN EN   MKLHPDR  + CDESLD+I VNH S NMKV+SDS+++PWS S +EGVLG
Subjt:  SEATVSNLSRSLHSLCKPGIPENTSQSVACDRDATDGLPIRGSENMENMETMKLHPDRHVNHCDESLDMISVNHLSSNMKVESDSKMAPWSLSFTEGVLG

Query:  TQKFSDPSNNEVSSFDSCDDSEDDEEELDMLLDRFDDSYDSNLPSVNEILKTWKEDRKNDGL
        T +FSD S NE+S+ D CDDSEDDEE L+MLLD FDDSYDSNLPSVNEILKTWKE+RK DGL
Subjt:  TQKFSDPSNNEVSSFDSCDDSEDDEEELDMLLDRFDDSYDSNLPSVNEILKTWKEDRKNDGL

TrEMBL top hitse value%identityAlignment
A0A0A0KFY8 Uncharacterized protein0.0e+0084.69Show/hide
Query:  MEALGANAPIPAPKFEPDIEKIKRALLQKGVYPSPKIVRSLRKKEIQKHNRKLKRQAERQADQSPPLSESEKQVIAEETHFLTLRSEYKEFSKAIEAKPG
        MEAL +N PIP+PKFEPD +KIKR LLQKGVYP+P+IVRSLRKKEIQK+NRKLKR AERQ+ QSPPLSES+KQ+IAEETHFLTLRSEYKEFSKAIEAKP 
Subjt:  MEALGANAPIPAPKFEPDIEKIKRALLQKGVYPSPKIVRSLRKKEIQKHNRKLKRQAERQADQSPPLSESEKQVIAEETHFLTLRSEYKEFSKAIEAKPG

Query:  GGLMVGRPWERLERVNLKELTGFRTGYNGDNLKKESLRELRKLFEARKLEELQWVLDDDVELKEEWSESENGHFDTVKRRRGDGEVIRFLVDRLSSREIA
        GGLMVGRPWERLERVN KELTG RTGYN D+LKKESLRELRKLFE RKLEE QW LDDDVELKEEW ESEN  +D VKRRRGDGEVIRFLVDRLSS  I+
Subjt:  GGLMVGRPWERLERVNLKELTGFRTGYNGDNLKKESLRELRKLFEARKLEELQWVLDDDVELKEEWSESENGHFDTVKRRRGDGEVIRFLVDRLSSREIA

Query:  MRDWKFSKMMIRSGLQFNEGQLLKILDALGAKGCWKQALSVVEWVYNLKSHSHSKSRFVYTKLLAVLGMARESQEALQIFNLMRGDGQIYPDMAAYHSIA
        MRDWKFS+MMIRSGLQFNEGQLLKILDALGAKGCWKQALSVVEWVYNLKSHSHSKSRFVYTKLLAVLGMAR+ QEALQIFNLMRGDGQIYPDMAAYHSIA
Subjt:  MRDWKFSKMMIRSGLQFNEGQLLKILDALGAKGCWKQALSVVEWVYNLKSHSHSKSRFVYTKLLAVLGMARESQEALQIFNLMRGDGQIYPDMAAYHSIA

Query:  VTLGQAGLLKQLLKVIECMRQQPSKKIRNKCRKSWDPAVEPDLVVYNAILNACIPTLEWKGVYWVFTQLRKSCVKPNGATYGLSMEVMLKSGKYEHVHKL
        VTLGQAGLLKQLLKVIE MRQQPSKK+RNKCRKSWDPAVEPDLV+YNAILNACIPTLEWKGVYWVFTQLRKS ++PNGATYGLSMEVMLKSGKYE +H L
Subjt:  VTLGQAGLLKQLLKVIECMRQQPSKKIRNKCRKSWDPAVEPDLVVYNAILNACIPTLEWKGVYWVFTQLRKSCVKPNGATYGLSMEVMLKSGKYEHVHKL

Query:  FTKMKKSGETPKANTYRVLVKAFWEEGKVNGAVEAVRDMEQRGVVGSASVYYELACCLCYNGRWRDALVEVEKMKTLSHMKPLVVAFTGMILSSFDGGHI
        FTKMKK+G+T KANTYRVLVKAFWEEG VNGA+EAVRDMEQRGVVGSASVYYELACCLCYNG+W+DALVEVEKMKTLSHMKPLVV FTGMI SSF+GGHI
Subjt:  FTKMKKSGETPKANTYRVLVKAFWEEGKVNGAVEAVRDMEQRGVVGSASVYYELACCLCYNGRWRDALVEVEKMKTLSHMKPLVVAFTGMILSSFDGGHI

Query:  DDCISIFEYMKQNCAPNIGAINTMLRVYGRNDMFSKAKDLFEEIKRKADRSSPSSSAPSIIPDEYTYGSMLEAAASALQWEYFENVYREMALSGYRLDQS
        DDCISIFEYMKQ CAPNIG INTML+VYGRNDM+SKAKDLFEEIKRKAD SS  S+ PS++PDEYTY SMLEAAAS+LQWEYFE+VYREMALSGY+LDQS
Subjt:  DDCISIFEYMKQNCAPNIGAINTMLRVYGRNDMFSKAKDLFEEIKRKADRSSPSSSAPSIIPDEYTYGSMLEAAASALQWEYFENVYREMALSGYRLDQS

Query:  KHAMLLVEASRAGKWYLLDHAYDTILEAGQIPHPLLFTEMILQLIVQDNYEQAVTLVKAMGYAPFQVSERQWTELFEANTDRICSDNLKKLSDALSDCNA
        KHA+LLVEAS+AGKWYLLDHA+DTILEAGQIPHPLLFTEMILQL  QDNYEQAVTLV+ MGYAPFQVSERQWTELFE NTDRI  +NLK+L  AL DC+A
Subjt:  KHAMLLVEASRAGKWYLLDHAYDTILEAGQIPHPLLFTEMILQLIVQDNYEQAVTLVKAMGYAPFQVSERQWTELFEANTDRICSDNLKKLSDALSDCNA

Query:  SEATVSNLSRSLHSLCKPGIPENTSQSVACDRDATDGLPIRGSENMENMETMKLHPDRHVNHCDESLDMISVNHLSSNMKVESDSKMAPWSLSFTEGVLG
        SEATVSNLSRSL SLCK  IPENTSQSVACD DATD L +  SENMEN   MKLHPD      DESLD+I V+H S NMKV S+SKM+PWS+S ++G LG
Subjt:  SEATVSNLSRSLHSLCKPGIPENTSQSVACDRDATDGLPIRGSENMENMETMKLHPDRHVNHCDESLDMISVNHLSSNMKVESDSKMAPWSLSFTEGVLG

Query:  TQKFSDPSNNEVSSFDSCDDSEDDEEELDMLLDRFDDSYDSNLPSVNEILKTWKEDRKNDGL
        T +FSD SNN  S FD C +SEDDEEEL+ LLD FDD+YDSNLP+VNEIL+TWKE+RK DGL
Subjt:  TQKFSDPSNNEVSSFDSCDDSEDDEEELDMLLDRFDDSYDSNLPSVNEILKTWKEDRKNDGL

A0A1S3B127 pentatricopeptide repeat-containing protein At5g67570, chloroplastic0.0e+0084.92Show/hide
Query:  MEALGANAPIPAPKFEPDIEKIKRALLQKGVYPSPKIVRSLRKKEIQKHNRKLKRQAERQADQSPPLSESEKQVIAEETHFLTLRSEYKEFSKAIEAKPG
        MEAL +NAPIP+PKFEPD +KIKR LLQKGVYP+P+IVRSLRKKEIQK+NRKLKR AERQ+DQSPPLSES+KQ+IAEETHFLTLRSEYKEFSKAIEAKP 
Subjt:  MEALGANAPIPAPKFEPDIEKIKRALLQKGVYPSPKIVRSLRKKEIQKHNRKLKRQAERQADQSPPLSESEKQVIAEETHFLTLRSEYKEFSKAIEAKPG

Query:  GGLMVGRPWERLERVNLKELTGFRTGYNGDNLKKESLRELRKLFEARKLEELQWVLDDDVELKEEWSESENGHFDTVKRRRGDGEVIRFLVDRLSSREIA
        GGLMVGRPWERLERVN  ELTG RTGYN D+LKKESLRELRKLFE RKLEEL+W LDDDVELKEEW +SENG +D VKRRRGDGEVIRFLVDRLSS  I+
Subjt:  GGLMVGRPWERLERVNLKELTGFRTGYNGDNLKKESLRELRKLFEARKLEELQWVLDDDVELKEEWSESENGHFDTVKRRRGDGEVIRFLVDRLSSREIA

Query:  MRDWKFSKMMIRSGLQFNEGQLLKILDALGAKGCWKQALSVVEWVYNLKSHSHSKSRFVYTKLLAVLGMARESQEALQIFNLMRGDGQIYPDMAAYHSIA
        MRDWKFS+MMIRSGLQFNEGQLLKILDALGAKGCWKQALSVVEWVYNLKSHSHSKSRFVYTKLLAVLGMAR+ QEALQIFNLMRGDGQIYPDMAAYHSIA
Subjt:  MRDWKFSKMMIRSGLQFNEGQLLKILDALGAKGCWKQALSVVEWVYNLKSHSHSKSRFVYTKLLAVLGMARESQEALQIFNLMRGDGQIYPDMAAYHSIA

Query:  VTLGQAGLLKQLLKVIECMRQQPSKKIRNKCRKSWDPAVEPDLVVYNAILNACIPTLEWKGVYWVFTQLRKSCVKPNGATYGLSMEVMLKSGKYEHVHKL
        VTLGQAGLLKQLLKVIECMRQQPSKK+RNKCRKSWDPAVEPDLV+YN ILNACIPTLEWKGVYWVFTQLRKS ++PNGATYGLSMEVMLKSGKYE +H L
Subjt:  VTLGQAGLLKQLLKVIECMRQQPSKKIRNKCRKSWDPAVEPDLVVYNAILNACIPTLEWKGVYWVFTQLRKSCVKPNGATYGLSMEVMLKSGKYEHVHKL

Query:  FTKMKKSGETPKANTYRVLVKAFWEEGKVNGAVEAVRDMEQRGVVGSASVYYELACCLCYNGRWRDALVEVEKMKTLSHMKPLVVAFTGMILSSFDGGHI
        FTKMKKSGET KANTYRVLVKAFWEEG  +GA+EAVRDMEQRGVVGSASVYYELACCLCYNG+W+DALVEVEKMKTLSHMKPLVV FTGMILSSF+GGHI
Subjt:  FTKMKKSGETPKANTYRVLVKAFWEEGKVNGAVEAVRDMEQRGVVGSASVYYELACCLCYNGRWRDALVEVEKMKTLSHMKPLVVAFTGMILSSFDGGHI

Query:  DDCISIFEYMKQNCAPNIGAINTMLRVYGRNDMFSKAKDLFEEIKRKADRSSPSSSAPSIIPDEYTYGSMLEAAASALQWEYFENVYREMALSGYRLDQS
        DDCISIFEYMKQ CAPNIG INTML+VYGRNDMFSKAKDLFEEIK+KAD SS +S+ PS++PDEYTY SML+AAAS+LQWEYFENVYREMALSGYRLDQS
Subjt:  DDCISIFEYMKQNCAPNIGAINTMLRVYGRNDMFSKAKDLFEEIKRKADRSSPSSSAPSIIPDEYTYGSMLEAAASALQWEYFENVYREMALSGYRLDQS

Query:  KHAMLLVEASRAGKWYLLDHAYDTILEAGQIPHPLLFTEMILQLIVQDNYEQAVTLVKAMGYAPFQVSERQWTELFEANTDRICSDNLKKLSDALSDCNA
        KHA+LLVEAS+AGKWYLLDHA+DTILEAGQIPHPLLFTEMILQL  Q+NYEQAVTLV+ MGYAPFQVSERQWTELFE N DRIC +NLK+L DAL DC+A
Subjt:  KHAMLLVEASRAGKWYLLDHAYDTILEAGQIPHPLLFTEMILQLIVQDNYEQAVTLVKAMGYAPFQVSERQWTELFEANTDRICSDNLKKLSDALSDCNA

Query:  SEATVSNLSRSLHSLCKPGIPENTSQSVACDRDATDGLPIRGSENMENMETMKLHPDRHVNHCDESLDMISVNHLSSNMKVESDSKMAPWSLSFTEGVLG
        SEATVSNLSRSL SLCK GI E+TSQS+ACD +ATDGL +  S+NMEN   MKLHPD+     DESLD+I V+H S NMKV S+S M+PWS S ++GVLG
Subjt:  SEATVSNLSRSLHSLCKPGIPENTSQSVACDRDATDGLPIRGSENMENMETMKLHPDRHVNHCDESLDMISVNHLSSNMKVESDSKMAPWSLSFTEGVLG

Query:  TQKFSDPSNNEVSSFDSCDDSEDDEEELDMLLDRFDDSYDSNLPSVNEILKTWKEDRKNDGL
        T +FSD SNNE S+FDS D+SEDDE EL+MLLD FDDSYDSNLP+ NEIL+TWKE+RK DGL
Subjt:  TQKFSDPSNNEVSSFDSCDDSEDDEEELDMLLDRFDDSYDSNLPSVNEILKTWKEDRKNDGL

A0A5D3CLI0 Pentatricopeptide repeat-containing protein0.0e+0084.92Show/hide
Query:  MEALGANAPIPAPKFEPDIEKIKRALLQKGVYPSPKIVRSLRKKEIQKHNRKLKRQAERQADQSPPLSESEKQVIAEETHFLTLRSEYKEFSKAIEAKPG
        MEAL +NAPIP+PKFEPD +KIKR LLQKGVYP+P+IVRSLRKKEIQK+NRKLKR AERQ+DQSPPLSES+KQ+IAEETHFLTLRSEYKEFSKAIEAKP 
Subjt:  MEALGANAPIPAPKFEPDIEKIKRALLQKGVYPSPKIVRSLRKKEIQKHNRKLKRQAERQADQSPPLSESEKQVIAEETHFLTLRSEYKEFSKAIEAKPG

Query:  GGLMVGRPWERLERVNLKELTGFRTGYNGDNLKKESLRELRKLFEARKLEELQWVLDDDVELKEEWSESENGHFDTVKRRRGDGEVIRFLVDRLSSREIA
        GGLMVGRPWERLERVN  ELTG RTGYN D+LKKESLRELRKLFE RKLEEL+W LDDDVELKEEW +SENG +D VKRRRGDGEVIRFLVDRLSS  I+
Subjt:  GGLMVGRPWERLERVNLKELTGFRTGYNGDNLKKESLRELRKLFEARKLEELQWVLDDDVELKEEWSESENGHFDTVKRRRGDGEVIRFLVDRLSSREIA

Query:  MRDWKFSKMMIRSGLQFNEGQLLKILDALGAKGCWKQALSVVEWVYNLKSHSHSKSRFVYTKLLAVLGMARESQEALQIFNLMRGDGQIYPDMAAYHSIA
        MRDWKFS+MMIRSGLQFNEGQLLKILDALGAKGCWKQALSVVEWVYNLKSHSHSKSRFVYTKLLAVLGMAR+ QEALQIFNLMRGDGQIYPDMAAYHSIA
Subjt:  MRDWKFSKMMIRSGLQFNEGQLLKILDALGAKGCWKQALSVVEWVYNLKSHSHSKSRFVYTKLLAVLGMARESQEALQIFNLMRGDGQIYPDMAAYHSIA

Query:  VTLGQAGLLKQLLKVIECMRQQPSKKIRNKCRKSWDPAVEPDLVVYNAILNACIPTLEWKGVYWVFTQLRKSCVKPNGATYGLSMEVMLKSGKYEHVHKL
        VTLGQAGLLKQLLKVIECMRQQPSKK+RNKCRKSWDPAVEPDLV+YN ILNACIPTLEWKGVYWVFTQLRKS ++PNGATYGLSMEVMLKSGKYE +H L
Subjt:  VTLGQAGLLKQLLKVIECMRQQPSKKIRNKCRKSWDPAVEPDLVVYNAILNACIPTLEWKGVYWVFTQLRKSCVKPNGATYGLSMEVMLKSGKYEHVHKL

Query:  FTKMKKSGETPKANTYRVLVKAFWEEGKVNGAVEAVRDMEQRGVVGSASVYYELACCLCYNGRWRDALVEVEKMKTLSHMKPLVVAFTGMILSSFDGGHI
        FTKMKKSGET KANTYRVLVKAFWEEG  +GA+EAVRDMEQRGVVGSASVYYELACCLCYNG+W+DALVEVEKMKTLSHMKPLVV FTGMILSSF+GGHI
Subjt:  FTKMKKSGETPKANTYRVLVKAFWEEGKVNGAVEAVRDMEQRGVVGSASVYYELACCLCYNGRWRDALVEVEKMKTLSHMKPLVVAFTGMILSSFDGGHI

Query:  DDCISIFEYMKQNCAPNIGAINTMLRVYGRNDMFSKAKDLFEEIKRKADRSSPSSSAPSIIPDEYTYGSMLEAAASALQWEYFENVYREMALSGYRLDQS
        DDCISIFEYMKQ CAPNIG INTML+VYGRNDMFSKAKDLFEEIK+KAD SS +S+ PS++PDEYTY SML+AAAS+LQWEYFENVYREMALSGYRLDQS
Subjt:  DDCISIFEYMKQNCAPNIGAINTMLRVYGRNDMFSKAKDLFEEIKRKADRSSPSSSAPSIIPDEYTYGSMLEAAASALQWEYFENVYREMALSGYRLDQS

Query:  KHAMLLVEASRAGKWYLLDHAYDTILEAGQIPHPLLFTEMILQLIVQDNYEQAVTLVKAMGYAPFQVSERQWTELFEANTDRICSDNLKKLSDALSDCNA
        KHA+LLVEAS+AGKWYLLDHA+DTILEAGQIPHPLLFTEMILQL  Q+NYEQAVTLV+ MGYAPFQVSERQWTELFE N DRIC +NLK+L DAL DC+A
Subjt:  KHAMLLVEASRAGKWYLLDHAYDTILEAGQIPHPLLFTEMILQLIVQDNYEQAVTLVKAMGYAPFQVSERQWTELFEANTDRICSDNLKKLSDALSDCNA

Query:  SEATVSNLSRSLHSLCKPGIPENTSQSVACDRDATDGLPIRGSENMENMETMKLHPDRHVNHCDESLDMISVNHLSSNMKVESDSKMAPWSLSFTEGVLG
        SEATVSNLSRSL SLCK GI E+TSQS+ACD +ATDGL +  S+NMEN   MKLHPD+     DESLD+I V+H S NMKV S+S M+PWS S ++GVLG
Subjt:  SEATVSNLSRSLHSLCKPGIPENTSQSVACDRDATDGLPIRGSENMENMETMKLHPDRHVNHCDESLDMISVNHLSSNMKVESDSKMAPWSLSFTEGVLG

Query:  TQKFSDPSNNEVSSFDSCDDSEDDEEELDMLLDRFDDSYDSNLPSVNEILKTWKEDRKNDGL
        T +FSD SNNE S+FDS D+SEDDE EL+MLLD FDDSYDSNLP+ NEIL+TWKE+RK DGL
Subjt:  TQKFSDPSNNEVSSFDSCDDSEDDEEELDMLLDRFDDSYDSNLPSVNEILKTWKEDRKNDGL

A0A6J1GG29 pentatricopeptide repeat-containing protein At5g67570, chloroplastic0.0e+0085.03Show/hide
Query:  MEALGANAPIPAPKFEPDIEKIKRALLQKGVYPSPKIVRSLRKKEIQKHNRKLKRQAERQADQSPPLSESEKQVIAEETHFLTLRSEYKEFSKAIEAKPG
        MEAL  NA +P+PKFEPDIEKIKR LLQKGV+P+PKIVRSL KKEIQKHNRKLKR AERQ  QSPPLSES+KQ+I EET FLTLRSEYKEFSKAIEA+P 
Subjt:  MEALGANAPIPAPKFEPDIEKIKRALLQKGVYPSPKIVRSLRKKEIQKHNRKLKRQAERQADQSPPLSESEKQVIAEETHFLTLRSEYKEFSKAIEAKPG

Query:  GGLMVGRPWERLERVNLKELTGFRTGYNGDNLKKESLRELRKLFEARKLEELQWVLDDDVELKEEWSESENGHFDTVKRRRGDGEVIRFLVDRLSSREIA
        GGLMVGRPWERLERVNLKELTGFRT YN DNLKKESLRELRKLFEARKLEELQWVLDDDVELK+EW  SENGH D VKRRRGDGEVIRFLVDRLSSR I+
Subjt:  GGLMVGRPWERLERVNLKELTGFRTGYNGDNLKKESLRELRKLFEARKLEELQWVLDDDVELKEEWSESENGHFDTVKRRRGDGEVIRFLVDRLSSREIA

Query:  MRDWKFSKMMIRSGLQFNEGQLLKILDALGAKGCWKQALSVVEWVYNLKSHSHSKSRFVYTKLLAVLGMARESQEALQIFNLMRGDGQIYPDMAAYHSIA
        MRDWKFS+MMI+SGLQFNEGQLLKILDALGAKGCWKQALSVVEWVYNLKSHSHSKSRFVYTKLLAVLG AR+ QEALQIFNLMRGDGQIYPDMAAYHSIA
Subjt:  MRDWKFSKMMIRSGLQFNEGQLLKILDALGAKGCWKQALSVVEWVYNLKSHSHSKSRFVYTKLLAVLGMARESQEALQIFNLMRGDGQIYPDMAAYHSIA

Query:  VTLGQAGLLKQLLKVIECMRQQPSKKIRNKCRKSWDPAVEPDLVVYNAILNACIPTLEWKGVYWVFTQLRKSCVKPNGATYGLSMEVMLKSGKYEHVHKL
        VTLGQAGLLKQLLK+IE MRQQPSKK+RN CRK WDPAVEPDLV+YNAILNAC+PTLEWK VYWVFTQLRK+ +KPNGATYGLSMEVMLKSGKYE VH L
Subjt:  VTLGQAGLLKQLLKVIECMRQQPSKKIRNKCRKSWDPAVEPDLVVYNAILNACIPTLEWKGVYWVFTQLRKSCVKPNGATYGLSMEVMLKSGKYEHVHKL

Query:  FTKMKKSGETPKANTYRVLVKAFWEEGKVNGAVEAVRDMEQRGVVGSASVYYELACCLCYNGRWRDALVEVEKMKTLSHMKPLVVAFTGMILSSFDGGHI
        FTKMK SG T KANTYRVLVKAFWEEG V+GA+EAVRDMEQRGVVGSASVYYELACCLCY+GRW+DALVEVEKMKTLSHMKPLVV FTGMILSSFDGGHI
Subjt:  FTKMKKSGETPKANTYRVLVKAFWEEGKVNGAVEAVRDMEQRGVVGSASVYYELACCLCYNGRWRDALVEVEKMKTLSHMKPLVVAFTGMILSSFDGGHI

Query:  DDCISIFEYMKQNCAPNIGAINTMLRVYGRNDMFSKAKDLFEEIKRKADRSSPSSSAPSIIPDEYTYGSMLEAAASALQWEYFENVYREMALSGYRLDQS
        DDCISIFEYMKQ CAPNIG INTML+V+GRNDMFSKAKDL+EEIKRKAD SS SS+  SI+PD+YTY SML+AAASA QWEYFENVYREMALSGYRLDQS
Subjt:  DDCISIFEYMKQNCAPNIGAINTMLRVYGRNDMFSKAKDLFEEIKRKADRSSPSSSAPSIIPDEYTYGSMLEAAASALQWEYFENVYREMALSGYRLDQS

Query:  KHAMLLVEASRAGKWYLLDHAYDTILEAGQIPHPLLFTEMILQLIVQDNYEQAVTLVKAMGYAPFQVSERQWTELFEANTDRICSDNLKKLSDALSDCNA
        KHAMLLVEASRAGKWYLLDHA+DTILEAGQIPHPLLFTEMILQL  QDNYEQAVTLV+ M YAPFQVSERQWTELFE NTDRIC +NLKKLSDALSDC+A
Subjt:  KHAMLLVEASRAGKWYLLDHAYDTILEAGQIPHPLLFTEMILQLIVQDNYEQAVTLVKAMGYAPFQVSERQWTELFEANTDRICSDNLKKLSDALSDCNA

Query:  SEATVSNLSRSLHSLCKPGIPENTSQSVACDRDATDGLPIRGSENMENMETMKLHPDRHVNHCDESLDMISVNHLSSNMKVESDSKMAPWSLSFTEGVLG
        SEATV NLS SL SLCK GIPEN SQS+A D D TDGL + G ENM+N   MKLH D   + C+ SLD+I VNH S N     DS+M PWSLS ++GVL 
Subjt:  SEATVSNLSRSLHSLCKPGIPENTSQSVACDRDATDGLPIRGSENMENMETMKLHPDRHVNHCDESLDMISVNHLSSNMKVESDSKMAPWSLSFTEGVLG

Query:  TQKFSDPSNNEVSSFDSCDDSEDDEEELDMLLDRFDDSYDSNLPSVNEILKTWKEDRKNDGL
        T KFSD SNNE S+FD  DDSEDDEEEL MLLD FDDSYDSNLPSV+EILKTWKE+RKNDGL
Subjt:  TQKFSDPSNNEVSSFDSCDDSEDDEEELDMLLDRFDDSYDSNLPSVNEILKTWKEDRKNDGL

A0A6J1ILT7 pentatricopeptide repeat-containing protein At5g67570, chloroplastic0.0e+0085.03Show/hide
Query:  MEALGANAPIPAPKFEPDIEKIKRALLQKGVYPSPKIVRSLRKKEIQKHNRKLKRQAERQADQSPPLSESEKQVIAEETHFLTLRSEYKEFSKAIEAKPG
        MEAL  NA +P+PKFEPDIEKIKR L+QKGV+P+PKIVRSL KKEIQKHNRKLKR AERQ  QSPPLSES+KQ+I EETHFLTLRSEYKEFSKAIEA+P 
Subjt:  MEALGANAPIPAPKFEPDIEKIKRALLQKGVYPSPKIVRSLRKKEIQKHNRKLKRQAERQADQSPPLSESEKQVIAEETHFLTLRSEYKEFSKAIEAKPG

Query:  GGLMVGRPWERLERVNLKELTGFRTGYNGDNLKKESLRELRKLFEARKLEELQWVLDDDVELKEEWSESENGHFDTVKRRRGDGEVIRFLVDRLSSREIA
        GGLMVGRPWERLERVNLKE TGFRT YN DNLKKESLRELRKLFEARKLEELQWVLDDDVELKEEW  SENGH D VKRRRGDGEVIRFLVDRLSSR I+
Subjt:  GGLMVGRPWERLERVNLKELTGFRTGYNGDNLKKESLRELRKLFEARKLEELQWVLDDDVELKEEWSESENGHFDTVKRRRGDGEVIRFLVDRLSSREIA

Query:  MRDWKFSKMMIRSGLQFNEGQLLKILDALGAKGCWKQALSVVEWVYNLKSHSHSKSRFVYTKLLAVLGMARESQEALQIFNLMRGDGQIYPDMAAYHSIA
        MRDWKFS+MMI+SGLQFNEGQLLKILD+LGAKGCWKQALSVVEWVYNLKSHSHSKSRFVYTKLLAVLG AR+ QEALQIFNLMRGDGQIYPDMAAYHSIA
Subjt:  MRDWKFSKMMIRSGLQFNEGQLLKILDALGAKGCWKQALSVVEWVYNLKSHSHSKSRFVYTKLLAVLGMARESQEALQIFNLMRGDGQIYPDMAAYHSIA

Query:  VTLGQAGLLKQLLKVIECMRQQPSKKIRNKCRKSWDPAVEPDLVVYNAILNACIPTLEWKGVYWVFTQLRKSCVKPNGATYGLSMEVMLKSGKYEHVHKL
        VTLGQAGLLKQLLK+IE MRQQPSKK+RN CRK WDPAVEPDLV+YNAILNACIPTLEWK VYWVFTQLRK+ +KPNGATYGLSMEVMLKSGKYE VH L
Subjt:  VTLGQAGLLKQLLKVIECMRQQPSKKIRNKCRKSWDPAVEPDLVVYNAILNACIPTLEWKGVYWVFTQLRKSCVKPNGATYGLSMEVMLKSGKYEHVHKL

Query:  FTKMKKSGETPKANTYRVLVKAFWEEGKVNGAVEAVRDMEQRGVVGSASVYYELACCLCYNGRWRDALVEVEKMKTLSHMKPLVVAFTGMILSSFDGGHI
        FTKMK SG T KANTYRVLVKAFWEEG V+GA+EAVRDMEQRGVVGSASVYYELACCLCY+GRW+DALVEVEKMKTLSHMKPLVV FTGMILSSFDGGHI
Subjt:  FTKMKKSGETPKANTYRVLVKAFWEEGKVNGAVEAVRDMEQRGVVGSASVYYELACCLCYNGRWRDALVEVEKMKTLSHMKPLVVAFTGMILSSFDGGHI

Query:  DDCISIFEYMKQNCAPNIGAINTMLRVYGRNDMFSKAKDLFEEIKRKADRSSPSSSAPSIIPDEYTYGSMLEAAASALQWEYFENVYREMALSGYRLDQS
        DDCISIFEYMKQ CAPNIG INTML+V+GRNDMFSKAKDL+EEIKRKAD SS SS+  SI+PD+YTY SML+AAASA QWEYFENVYREMALSGYRLDQS
Subjt:  DDCISIFEYMKQNCAPNIGAINTMLRVYGRNDMFSKAKDLFEEIKRKADRSSPSSSAPSIIPDEYTYGSMLEAAASALQWEYFENVYREMALSGYRLDQS

Query:  KHAMLLVEASRAGKWYLLDHAYDTILEAGQIPHPLLFTEMILQLIVQDNYEQAVTLVKAMGYAPFQVSERQWTELFEANTDRICSDNLKKLSDALSDCNA
        KHAMLLVEASRAGKWYLLDHA+DTILEAGQIPHPLLFTEMILQL  QDNYEQA+TLV+ M YAPFQVSERQWTELFE NTDRIC +NLKKLSDALSDC+A
Subjt:  KHAMLLVEASRAGKWYLLDHAYDTILEAGQIPHPLLFTEMILQLIVQDNYEQAVTLVKAMGYAPFQVSERQWTELFEANTDRICSDNLKKLSDALSDCNA

Query:  SEATVSNLSRSLHSLCKPGIPENTSQSVACDRDATDGLPIRGSENMENMETMKLHPDRHVNHCDESLDMISVNHLSSNMKVESDSKMAPWSLSFTEGVLG
        SEATVSNLS SL SLCK GIPENTSQS+A D D TDGL + GSEN +N   MKLH D   + C+ SLD+I VNH S N     DSKM PWSLS ++GVL 
Subjt:  SEATVSNLSRSLHSLCKPGIPENTSQSVACDRDATDGLPIRGSENMENMETMKLHPDRHVNHCDESLDMISVNHLSSNMKVESDSKMAPWSLSFTEGVLG

Query:  TQKFSDPSNNEVSSFDSCDDSEDDEEELDMLLDRFDDSYDSNLPSVNEILKTWKEDRKNDGL
        T KFSD SNNE S+FD  DDSEDDEEEL MLLD FDDSY SNLPSV+EILKTW+E+RKNDGL
Subjt:  TQKFSDPSNNEVSSFDSCDDSEDDEEELDMLLDRFDDSYDSNLPSVNEILKTWKEDRKNDGL

SwissProt top hitse value%identityAlignment
A0A1D6IEG9 Pentatricopeptide repeat-containing protein CRP1, chloroplastic4.8e-1920.83Show/hide
Query:  MIRSGLQFNEGQLLKILDALGAKGCWKQALSVVEWVYNLKSHSHSKSRFVYTKLLAVLGMARESQEALQIFNLMRGDGQIYPDMAAYHSIAVTLGQAGLL
        M + G+  +E     ++DA    G W+ A  +++    +++     S +V++++LA      + Q+A  +   M+  G + PD   Y+ +  T G+   L
Subjt:  MIRSGLQFNEGQLLKILDALGAKGCWKQALSVVEWVYNLKSHSHSKSRFVYTKLLAVLGMARESQEALQIFNLMRGDGQIYPDMAAYHSIAVTLGQAGLL

Query:  KQLLKVIECMRQQPSKKIRNKCRKSWDPAVEPDLVVYNAILNACIPTLEWKGVYWVFTQLRKSCVKPNGATYGLSMEVMLKSGKYEHVHKLFTKMKKSGE
           +     MR++                +EPD+V +N +++A            +F ++R+S   P   TY + + ++ +   +E V  + ++MK+ G 
Subjt:  KQLLKVIECMRQQPSKKIRNKCRKSWDPAVEPDLVVYNAILNACIPTLEWKGVYWVFTQLRKSCVKPNGATYGLSMEVMLKSGKYEHVHKLFTKMKKSGE

Query:  TPKANTYRVLVKAFWEEGKVNGAVEAVRDMEQRGVVGSASVYYELACCLCYNGRWRDALVEVEKMKTLSHMKPLVVAFTGMILSSFDGGHIDDCISIFEY
         P   TY  LV  +   G+   A++ +  M+  G+  S ++Y+ L       G    AL  V+ MK    ++  ++    +I +  +   + +  S+ ++
Subjt:  TPKANTYRVLVKAFWEEGKVNGAVEAVRDMEQRGVVGSASVYYELACCLCYNGRWRDALVEVEKMKTLSHMKPLVVAFTGMILSSFDGGHIDDCISIFEY

Query:  MKQN-CAPNIGAINTMLRVYGRNDMFSKAKDLFEEI
        M++N   P++    T+++   R + F K   ++EE+
Subjt:  MKQN-CAPNIGAINTMLRVYGRNDMFSKAKDLFEEI

Q84ZD2 Pentatricopeptide repeat-containing protein CRP1 homolog, chloroplastic3.7e-1921.73Show/hide
Query:  MIRSGLQFNEGQLLKILDALGAKGCWKQALSVVEWVYNLKSHSHSKSRFVYTKLLAVLGMARESQEALQIFNLMRGDGQIYPDMAAYHSIAVTLGQAGLL
        M + G+  +E     ++DA    G W+ A  +++    +++     S +V++++LA      E Q+A  +   M   G + PD   Y+ +  T G+   L
Subjt:  MIRSGLQFNEGQLLKILDALGAKGCWKQALSVVEWVYNLKSHSHSKSRFVYTKLLAVLGMARESQEALQIFNLMRGDGQIYPDMAAYHSIAVTLGQAGLL

Query:  KQLLKVIECMRQQPSKKIRNKCRKSWDPAVEPDLVVYNAILNACIPTLEWKGVYWVFTQLRKSCVKPNGATYGLSMEVMLKSGKYEHVHKLFTKMKKSGE
           +   + MR++                +EPD+V +N +++A            +F ++R+S       TY + + ++ +  ++E V  +  +MK+ G 
Subjt:  KQLLKVIECMRQQPSKKIRNKCRKSWDPAVEPDLVVYNAILNACIPTLEWKGVYWVFTQLRKSCVKPNGATYGLSMEVMLKSGKYEHVHKLFTKMKKSGE

Query:  TPKANTYRVLVKAFWEEGKVNGAVEAVRDMEQRGVVGSASVYYELACCLCYNGRWRDALVEVEKMKTLSHMKPLVVAFTGMILSSFDGGHIDDCISIFEY
         P   TY  LV  +   G+   AV+ +  M+  G+  S ++Y+ L       G    AL  V+ M+    ++   V    +I +  +   I +  S+ ++
Subjt:  TPKANTYRVLVKAFWEEGKVNGAVEAVRDMEQRGVVGSASVYYELACCLCYNGRWRDALVEVEKMKTLSHMKPLVVAFTGMILSSFDGGHIDDCISIFEY

Query:  MKQN-CAPNIGAINTMLRVYGRNDMFSKAKDLFEEI
        MK+N   P++    T+++   R + F K   ++EE+
Subjt:  MKQN-CAPNIGAINTMLRVYGRNDMFSKAKDLFEEI

Q9FJW6 Pentatricopeptide repeat-containing protein At5g67570, chloroplastic2.1e-24053.11Show/hide
Query:  PKFEPDIEKIKRALLQKGVYPSPKIVRSLRKKEIQKHNRKLKRQAERQADQSPPLSESEKQVIAEETHFLTLRSEYKEFSKAIEAKPGG--GLMVGRPWE
        P+FEPD+EKIKR LL+ GV P+PKI+ +LRKKEIQKHNR+ KR+ E +A+     +E++KQ + EE  F TLR EYK+F+++I  K GG  GLMVG PWE
Subjt:  PKFEPDIEKIKRALLQKGVYPSPKIVRSLRKKEIQKHNRKLKRQAERQADQSPPLSESEKQVIAEETHFLTLRSEYKEFSKAIEAKPGG--GLMVGRPWE

Query:  RLERVNLKELTG--FRTGYNGDNLKKESLRELRKLFEARKLEELQWVLDDDVELKEEWSESENGHFDTVKRRRGDGEVIRFLVDRLSSREIAMRDWKFSK
         +ERV LKEL     R   +   LKKE+L+EL+K+ E    ++L+WVLDDDV+++E   + E   FD  KR R +GE +R LVDRLS REI  + WKF +
Subjt:  RLERVNLKELTG--FRTGYNGDNLKKESLRELRKLFEARKLEELQWVLDDDVELKEEWSESENGHFDTVKRRRGDGEVIRFLVDRLSSREIAMRDWKFSK

Query:  MMIRSGLQFNEGQLLKILDALGAKGCWKQALSVVEWVYNLKSHSHSKSRFVYTKLLAVLGMARESQEALQIFNLMRGDGQIYPDMAAYHSIAVTLGQAGL
        MM +SGLQF E Q+LKI+D LG K  WKQA +VV WVY+ K   H +SRFVYTKLL+VLG AR  QEALQIFN M GD Q+YPDMAAYH IAVTLGQAGL
Subjt:  MMIRSGLQFNEGQLLKILDALGAKGCWKQALSVVEWVYNLKSHSHSKSRFVYTKLLAVLGMARESQEALQIFNLMRGDGQIYPDMAAYHSIAVTLGQAGL

Query:  LKQLLKVIECMRQQPSKKIRNKCRKSWDPAVEPDLVVYNAILNACIPTLEWKGVYWVFTQLRKSCVKPNGATYGLSMEVMLKSGKYEHVHKLFTKMKKSG
        LK+LLKVIE MRQ+P+K  +N  +K+WDP +EPDLVVYNAILNAC+PTL+WK V WVF +LRK+ ++PNGATYGL+MEVML+SGK++ VH  F KMK SG
Subjt:  LKQLLKVIECMRQQPSKKIRNKCRKSWDPAVEPDLVVYNAILNACIPTLEWKGVYWVFTQLRKSCVKPNGATYGLSMEVMLKSGKYEHVHKLFTKMKKSG

Query:  ETPKANTYRVLVKAFWEEGKVNGAVEAVRDMEQRGVVGSASVYYELACCLCYNGRWRDALVEVEKMKTLSHMKPLVVAFTGMILSSFDGGHIDDCISIFE
        E PKA TY+VLV+A W EGK+  AVEAVRDMEQ+GV+G+ SVYYELACCLC NGRW DA++EV +MK L + +PL + FTG+I +S +GGH+DDC++IF+
Subjt:  ETPKANTYRVLVKAFWEEGKVNGAVEAVRDMEQRGVVGSASVYYELACCLCYNGRWRDALVEVEKMKTLSHMKPLVVAFTGMILSSFDGGHIDDCISIFE

Query:  YMKQNCAPNIGAINTMLRVYGRNDMFSKAKDLFEEIKRKADRSSPSSSAPSIIPDEYTYGSMLEAAASALQWEYFENVYREMALSGYRLDQSKHAMLLVE
        YMK  C PNIG  N ML+VYGRNDMFS+AK+LFEEI         S     ++P+EYTY  MLEA+A +LQWEYFE+VY+ M LSGY++DQ+KHA +L+E
Subjt:  YMKQNCAPNIGAINTMLRVYGRNDMFSKAKDLFEEIKRKADRSSPSSSAPSIIPDEYTYGSMLEAAASALQWEYFENVYREMALSGYRLDQSKHAMLLVE

Query:  ASRAGKWYLLDHAYDTILEAGQIPHPLLFTEMILQLIVQDNYEQAVTLVKAMGYAPFQVSERQWTELFEANTDRICSDNLKKLSDALSDCN-ASEATVSN
        ASRAGKW LL+HA+D +LE G+IPHPL FTE++     + ++++A+TL+  +  A FQ+SE +WT+LFE + D +  DNL KLSD L +C+  SE TVSN
Subjt:  ASRAGKWYLLDHAYDTILEAGQIPHPLLFTEMILQLIVQDNYEQAVTLVKAMGYAPFQVSERQWTELFEANTDRICSDNLKKLSDALSDCN-ASEATVSN

Query:  LSRSLHSLCKPGIPENTSQSVACDRDATDGLPIRGSENMENMETMKLHPDRHVNHCDESLDMISVNHLSSNMKVESDSKMAPWSLSFTEGVLGTQKFSDP
        LS+SL S C       +S S A                                   + L  + V   S   K E D  +             T +  + 
Subjt:  LSRSLHSLCKPGIPENTSQSVACDRDATDGLPIRGSENMENMETMKLHPDRHVNHCDESLDMISVNHLSSNMKVESDSKMAPWSLSFTEGVLGTQKFSDP

Query:  SNNEVSSFDSCDDSEDDEEELDMLLDRFDDSYDSNLPSVNEILKTWKEDRKND
        +N E   F   +      EEL+  +D  ++S DS+  SV +ILK W+E  K +
Subjt:  SNNEVSSFDSCDDSEDDEEELDMLLDRFDDSYDSNLPSVNEILKTWKEDRKND

Q9LYZ9 Pentatricopeptide repeat-containing protein At5g028601.3e-2422.54Show/hide
Query:  QLLKILDALGAKGCWKQALSVVEWVYNLKSHSHSKSRFVYTKLLAVLGMARESQEALQIFNLMRGDGQIYPDMAAYHSIAVTLGQAGLLKQLLKVIECMR
        +LL  L  LG    +  AL   +W    K +       V   ++++LG       A  +FN ++ DG    D+ +Y S+      +G  ++ + V + M 
Subjt:  QLLKILDALGAKGCWKQALSVVEWVYNLKSHSHSKSRFVYTKLLAVLGMARESQEALQIFNLMRGDGQIYPDMAAYHSIAVTLGQAGLLKQLLKVIECMR

Query:  QQ---------------------PSKKIRNKCRKSWDPAVEPDLVVYNAILNACIPTLEWKGVYWVFTQLRKSCVKPNGATYGLSMEVMLKSGKYEHVHK
        +                      P  KI +   K     + PD   YN ++  C      +    VF +++ +    +  TY   ++V  KS + +   K
Subjt:  QQ---------------------PSKKIRNKCRKSWDPAVEPDLVVYNAILNACIPTLEWKGVYWVFTQLRKSCVKPNGATYGLSMEVMLKSGKYEHVHK

Query:  LFTKMKKSGETPKANTYRVLVKAFWEEGKVNGAVEAVRDMEQRGVVGSASVYYELACCLCYNGRWRDALVEVEKMKTLSHMKPLVVAFTGMILSSFDGGH
        +  +M  +G +P   TY  L+ A+  +G ++ A+E    M ++G       Y  L       G+   A+   E+M+  +  KP +  F   I    + G 
Subjt:  LFTKMKKSGETPKANTYRVLVKAFWEEGKVNGAVEAVRDMEQRGVVGSASVYYELACCLCYNGRWRDALVEVEKMKTLSHMKPLVVAFTGMILSSFDGGH

Query:  IDDCISIFEYMKQ-NCAPNIGAINTMLRVYGRNDMFSKAKDLFEEIKRKADRSSPSSSAPSIIPDEYTYGSMLEAAASALQWEYFENVYREMALSGYRLD
          + + IF+ +     +P+I   NT+L V+G+N M S+   +F+E+KR              +P+  T+ +++ A +    +E    VYR M  +G   D
Subjt:  IDDCISIFEYMKQ-NCAPNIGAINTMLRVYGRNDMFSKAKDLFEEIKRKADRSSPSSSAPSIIPDEYTYGSMLEAAASALQWEYFENVYREMALSGYRLD

Query:  QSKHAMLLVEASRAGKW
         S +  +L   +R G W
Subjt:  QSKHAMLLVEASRAGKW

Q9SA76 Pentatricopeptide repeat-containing protein At1g30610, chloroplastic4.9e-9635.58Show/hide
Query:  IRFLVDRLSSREIAMRDWKFSKMMIRSGLQFNEGQLLKILDALGAKGCWKQALSVVEWVYNLKSHSHSKSRFVYTKLLAVLGMARESQEALQIFNLMRGD
        I  L   L+  +I M +W+FSK +  + +++ +  +++++  LG  G W++ L V+EW+     +  +K R +YT  L VLG +R   EAL +F+ M   
Subjt:  IRFLVDRLSSREIAMRDWKFSKMMIRSGLQFNEGQLLKILDALGAKGCWKQALSVVEWVYNLKSHSHSKSRFVYTKLLAVLGMARESQEALQIFNLMRGD

Query:  GQIYPDMAAYHSIAVTLGQAGLLKQLLKVIECMRQQPSKKIRNKCRKSWDPAVEPDLVVYNAILNACIPTLEWKGVYWVFTQLRKSCVKPNGATYGLSME
           YPDM AY SIAVTLGQAG +K+L  VI+ MR  P KK +    + WDP +EPD+VVYNA+LNAC+   +W+G +WV  QL++   KP+  TYGL ME
Subjt:  GQIYPDMAAYHSIAVTLGQAGLLKQLLKVIECMRQQPSKKIRNKCRKSWDPAVEPDLVVYNAILNACIPTLEWKGVYWVFTQLRKSCVKPNGATYGLSME

Query:  VMLKSGKYEHVHKLFTKMKKSGETPKANTYRVLVKAFWEEGKVNGAVEAVRDMEQRGVVGSASVYYELACCLCYNGRWRDAL-----VEVEKMKTLSHM-
        VML   KY  VH+ F KM+KS   P A  YRVLV   W+EGK + AV  V DME RG+VGSA++YY+LA CLC  GR  + L     V    +K + ++ 
Subjt:  VMLKSGKYEHVHKLFTKMKKSGETPKANTYRVLVKAFWEEGKVNGAVEAVRDMEQRGVVGSASVYYELACCLCYNGRWRDAL-----VEVEKMKTLSHM-

Query:  ---------------------KPLVVAFTGMILSSFDGGHIDDCISIFEYMKQNCAPNIGAINTMLRVYGRNDMFSKAKDLFEEIKRKADRSSPSSSAPS
                             KPLVV +TG+I +  D G+I +   IF+ MK+ C+PN+   N ML+ Y +  +F +A++LF+++    +    SS   S
Subjt:  ---------------------KPLVVAFTGMILSSFDGGHIDDCISIFEYMKQNCAPNIGAINTMLRVYGRNDMFSKAKDLFEEIKRKADRSSPSSSAPS

Query:  -IIPDEYTYGSMLEAAASALQWEYFENVYREMALSGYRLDQSKHAMLLVEASRAGKWYLLDHAYDTILEAGQIPHPLLFTEMILQLIVQDNYEQAVTLVK
         ++PD YT+ +ML+  A   +W+ F   YREM   GY  +  +H  +++EASRAGK  +++  ++ +  + +IP   L  E   + + + ++  A++ + 
Subjt:  -IIPDEYTYGSMLEAAASALQWEYFENVYREMALSGYRLDQSKHAMLLVEASRAGKWYLLDHAYDTILEAGQIPHPLLFTEMILQLIVQDNYEQAVTLVK

Query:  AMGYAPFQVSERQW-TELFEANTDRICSDNLKKLSDAL-----SDCNASEATVSNLSRSLHSLCK
         +     +   R + T  +     R   D++ +L D +     S   +S++ + NL  S     K
Subjt:  AMGYAPFQVSERQW-TELFEANTDRICSDNLKKLSDAL-----SDCNASEATVSNLSRSLHSLCK

Arabidopsis top hitse value%identityAlignment
AT1G09900.1 Pentatricopeptide repeat (PPR-like) superfamily protein2.9e-1922.09Show/hide
Query:  EWVYNLKSHSHSKSRFVYTKLLAVLGMARESQEALQIFNLMRGDGQIYPDMAAYHSIAVTLGQAGLLKQLLKVIECMRQQPSKKIRNKCRKSWDPAVEPD
        +++ N+  H +       T L+       ++++A +I  ++ G G + PD+  Y+ +     +AG +   L V++ M                  +V PD
Subjt:  EWVYNLKSHSHSKSRFVYTKLLAVLGMARESQEALQIFNLMRGDGQIYPDMAAYHSIAVTLGQAGLLKQLLKVIECMRQQPSKKIRNKCRKSWDPAVEPD

Query:  LVVYNAILNACIPTLEWKGVYWVFTQLRKSCVKPNGATYGLSMEVMLKSGKYEHVHKLFTKMKKSGETPKANTYRVLVKAFWEEGKVNGAVEAVRDMEQR
        +V YN IL +   + + K    V  ++ +    P+  TY + +E   +     H  KL  +M+  G TP   TY VLV    +EG+++ A++ + DM   
Subjt:  LVVYNAILNACIPTLEWKGVYWVFTQLRKSCVKPNGATYGLSMEVMLKSGKYEHVHKLFTKMKKSGETPKANTYRVLVKAFWEEGKVNGAVEAVRDMEQR

Query:  GVVGSASVYYELACCLCYNGRWRDALVEVEKMKTLSHMKPLVVAFTGMILSSFDGGHIDDCISIFEYMKQN-CAPNIGAINTMLRVYGRNDMFSKAKDLF
        G   +   +  +   +C  GRW DA   +  M       P VV F  +I      G +   I I E M Q+ C PN  + N +L  + +     +A    
Subjt:  GVVGSASVYYELACCLCYNGRWRDALVEVEKMKTLSHMKPLVVAFTGMILSSFDGGHIDDCISIFEYMKQN-CAPNIGAINTMLRVYGRNDMFSKAKDLF

Query:  EEIKRKADRSSPSSSAPSIIPDEYTYGSMLEAAASALQWEYFENVYREMALSGYRLDQSKHAMLLVEASRAGKWYLLDHAYDTILEAGQIPHPLLFTEMI
        E ++R   R           PD  TY +ML A     + E    +  +++  G       +  ++   ++AGK        D +      P  + ++ ++
Subjt:  EEIKRKADRSSPSSSAPSIIPDEYTYGSMLEAAASALQWEYFENVYREMALSGYRLDQSKHAMLLVEASRAGKWYLLDHAYDTILEAGQIPHPLLFTEMI

Query:  LQLIVQDNYEQAVTL---VKAMGYAPFQVS
          L  +   ++A+      + MG  P  V+
Subjt:  LQLIVQDNYEQAVTL---VKAMGYAPFQVS

AT1G30610.1 pentatricopeptide (PPR) repeat-containing protein3.5e-9735.58Show/hide
Query:  IRFLVDRLSSREIAMRDWKFSKMMIRSGLQFNEGQLLKILDALGAKGCWKQALSVVEWVYNLKSHSHSKSRFVYTKLLAVLGMARESQEALQIFNLMRGD
        I  L   L+  +I M +W+FSK +  + +++ +  +++++  LG  G W++ L V+EW+     +  +K R +YT  L VLG +R   EAL +F+ M   
Subjt:  IRFLVDRLSSREIAMRDWKFSKMMIRSGLQFNEGQLLKILDALGAKGCWKQALSVVEWVYNLKSHSHSKSRFVYTKLLAVLGMARESQEALQIFNLMRGD

Query:  GQIYPDMAAYHSIAVTLGQAGLLKQLLKVIECMRQQPSKKIRNKCRKSWDPAVEPDLVVYNAILNACIPTLEWKGVYWVFTQLRKSCVKPNGATYGLSME
           YPDM AY SIAVTLGQAG +K+L  VI+ MR  P KK +    + WDP +EPD+VVYNA+LNAC+   +W+G +WV  QL++   KP+  TYGL ME
Subjt:  GQIYPDMAAYHSIAVTLGQAGLLKQLLKVIECMRQQPSKKIRNKCRKSWDPAVEPDLVVYNAILNACIPTLEWKGVYWVFTQLRKSCVKPNGATYGLSME

Query:  VMLKSGKYEHVHKLFTKMKKSGETPKANTYRVLVKAFWEEGKVNGAVEAVRDMEQRGVVGSASVYYELACCLCYNGRWRDAL-----VEVEKMKTLSHM-
        VML   KY  VH+ F KM+KS   P A  YRVLV   W+EGK + AV  V DME RG+VGSA++YY+LA CLC  GR  + L     V    +K + ++ 
Subjt:  VMLKSGKYEHVHKLFTKMKKSGETPKANTYRVLVKAFWEEGKVNGAVEAVRDMEQRGVVGSASVYYELACCLCYNGRWRDAL-----VEVEKMKTLSHM-

Query:  ---------------------KPLVVAFTGMILSSFDGGHIDDCISIFEYMKQNCAPNIGAINTMLRVYGRNDMFSKAKDLFEEIKRKADRSSPSSSAPS
                             KPLVV +TG+I +  D G+I +   IF+ MK+ C+PN+   N ML+ Y +  +F +A++LF+++    +    SS   S
Subjt:  ---------------------KPLVVAFTGMILSSFDGGHIDDCISIFEYMKQNCAPNIGAINTMLRVYGRNDMFSKAKDLFEEIKRKADRSSPSSSAPS

Query:  -IIPDEYTYGSMLEAAASALQWEYFENVYREMALSGYRLDQSKHAMLLVEASRAGKWYLLDHAYDTILEAGQIPHPLLFTEMILQLIVQDNYEQAVTLVK
         ++PD YT+ +ML+  A   +W+ F   YREM   GY  +  +H  +++EASRAGK  +++  ++ +  + +IP   L  E   + + + ++  A++ + 
Subjt:  -IIPDEYTYGSMLEAAASALQWEYFENVYREMALSGYRLDQSKHAMLLVEASRAGKWYLLDHAYDTILEAGQIPHPLLFTEMILQLIVQDNYEQAVTLVK

Query:  AMGYAPFQVSERQW-TELFEANTDRICSDNLKKLSDAL-----SDCNASEATVSNLSRSLHSLCK
         +     +   R + T  +     R   D++ +L D +     S   +S++ + NL  S     K
Subjt:  AMGYAPFQVSERQW-TELFEANTDRICSDNLKKLSDAL-----SDCNASEATVSNLSRSLHSLCK

AT1G30610.2 pentatricopeptide (PPR) repeat-containing protein1.1e-10037.17Show/hide
Query:  IRFLVDRLSSREIAMRDWKFSKMMIRSGLQFNEGQLLKILDALGAKGCWKQALSVVEWVYNLKSHSHSKSRFVYTKLLAVLGMARESQEALQIFNLMRGD
        I  L   L+  +I M +W+FSK +  + +++ +  +++++  LG  G W++ L V+EW+     +  +K R +YT  L VLG +R   EAL +F+ M   
Subjt:  IRFLVDRLSSREIAMRDWKFSKMMIRSGLQFNEGQLLKILDALGAKGCWKQALSVVEWVYNLKSHSHSKSRFVYTKLLAVLGMARESQEALQIFNLMRGD

Query:  GQIYPDMAAYHSIAVTLGQAGLLKQLLKVIECMRQQPSKKIRNKCRKSWDPAVEPDLVVYNAILNACIPTLEWKGVYWVFTQLRKSCVKPNGATYGLSME
           YPDM AY SIAVTLGQAG +K+L  VI+ MR  P KK +    + WDP +EPD+VVYNA+LNAC+   +W+G +WV  QL++   KP+  TYGL ME
Subjt:  GQIYPDMAAYHSIAVTLGQAGLLKQLLKVIECMRQQPSKKIRNKCRKSWDPAVEPDLVVYNAILNACIPTLEWKGVYWVFTQLRKSCVKPNGATYGLSME

Query:  VMLKSGKYEHVHKLFTKMKKSGETPKANTYRVLVKAFWEEGKVNGAVEAVRDMEQRGVVGSASVYYELACCLCYNGRWRDALVEVEKMKTLSHMKPLVVA
        VML   KY  VH+ F KM+KS   P A  YRVLV   W+EGK + AV  V DME RG+VGSA++YY+LA CLC  GR  + L  ++K+  +++ KPLVV 
Subjt:  VMLKSGKYEHVHKLFTKMKKSGETPKANTYRVLVKAFWEEGKVNGAVEAVRDMEQRGVVGSASVYYELACCLCYNGRWRDALVEVEKMKTLSHMKPLVVA

Query:  FTGMILSSFDGGHIDDCISIFEYMKQNCAPNIGAINTMLRVYGRNDMFSKAKDLFEEIKRKADRSSPSSSAPS-IIPDEYTYGSMLEAAASALQWEYFEN
        +TG+I +  D G+I +   IF+ MK+ C+PN+   N ML+ Y +  +F +A++LF+++    +    SS   S ++PD YT+ +ML+  A   +W+ F  
Subjt:  FTGMILSSFDGGHIDDCISIFEYMKQNCAPNIGAINTMLRVYGRNDMFSKAKDLFEEIKRKADRSSPSSSAPS-IIPDEYTYGSMLEAAASALQWEYFEN

Query:  VYREMALSGYRLDQSKHAMLLVEASRAGKWYLLDHAYDTILEAGQIPHPLLFTEMILQLIVQDNYEQAVTLVKAMGYAPFQVSERQW-TELFEANTDRIC
         YREM   GY  +  +H  +++EASRAGK  +++  ++ +  + +IP   L  E   + + + ++  A++ +  +     +   R + T  +     R  
Subjt:  VYREMALSGYRLDQSKHAMLLVEASRAGKWYLLDHAYDTILEAGQIPHPLLFTEMILQLIVQDNYEQAVTLVKAMGYAPFQVSERQW-TELFEANTDRIC

Query:  SDNLKKLSDAL-----SDCNASEATVSNLSRSLHSLCK
         D++ +L D +     S   +S++ + NL  S     K
Subjt:  SDNLKKLSDAL-----SDCNASEATVSNLSRSLHSLCK

AT5G02860.1 Pentatricopeptide repeat (PPR) superfamily protein9.3e-2622.54Show/hide
Query:  QLLKILDALGAKGCWKQALSVVEWVYNLKSHSHSKSRFVYTKLLAVLGMARESQEALQIFNLMRGDGQIYPDMAAYHSIAVTLGQAGLLKQLLKVIECMR
        +LL  L  LG    +  AL   +W    K +       V   ++++LG       A  +FN ++ DG    D+ +Y S+      +G  ++ + V + M 
Subjt:  QLLKILDALGAKGCWKQALSVVEWVYNLKSHSHSKSRFVYTKLLAVLGMARESQEALQIFNLMRGDGQIYPDMAAYHSIAVTLGQAGLLKQLLKVIECMR

Query:  QQ---------------------PSKKIRNKCRKSWDPAVEPDLVVYNAILNACIPTLEWKGVYWVFTQLRKSCVKPNGATYGLSMEVMLKSGKYEHVHK
        +                      P  KI +   K     + PD   YN ++  C      +    VF +++ +    +  TY   ++V  KS + +   K
Subjt:  QQ---------------------PSKKIRNKCRKSWDPAVEPDLVVYNAILNACIPTLEWKGVYWVFTQLRKSCVKPNGATYGLSMEVMLKSGKYEHVHK

Query:  LFTKMKKSGETPKANTYRVLVKAFWEEGKVNGAVEAVRDMEQRGVVGSASVYYELACCLCYNGRWRDALVEVEKMKTLSHMKPLVVAFTGMILSSFDGGH
        +  +M  +G +P   TY  L+ A+  +G ++ A+E    M ++G       Y  L       G+   A+   E+M+  +  KP +  F   I    + G 
Subjt:  LFTKMKKSGETPKANTYRVLVKAFWEEGKVNGAVEAVRDMEQRGVVGSASVYYELACCLCYNGRWRDALVEVEKMKTLSHMKPLVVAFTGMILSSFDGGH

Query:  IDDCISIFEYMKQ-NCAPNIGAINTMLRVYGRNDMFSKAKDLFEEIKRKADRSSPSSSAPSIIPDEYTYGSMLEAAASALQWEYFENVYREMALSGYRLD
          + + IF+ +     +P+I   NT+L V+G+N M S+   +F+E+KR              +P+  T+ +++ A +    +E    VYR M  +G   D
Subjt:  IDDCISIFEYMKQ-NCAPNIGAINTMLRVYGRNDMFSKAKDLFEEIKRKADRSSPSSSAPSIIPDEYTYGSMLEAAASALQWEYFENVYREMALSGYRLD

Query:  QSKHAMLLVEASRAGKW
         S +  +L   +R G W
Subjt:  QSKHAMLLVEASRAGKW

AT5G67570.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.5e-24153.11Show/hide
Query:  PKFEPDIEKIKRALLQKGVYPSPKIVRSLRKKEIQKHNRKLKRQAERQADQSPPLSESEKQVIAEETHFLTLRSEYKEFSKAIEAKPGG--GLMVGRPWE
        P+FEPD+EKIKR LL+ GV P+PKI+ +LRKKEIQKHNR+ KR+ E +A+     +E++KQ + EE  F TLR EYK+F+++I  K GG  GLMVG PWE
Subjt:  PKFEPDIEKIKRALLQKGVYPSPKIVRSLRKKEIQKHNRKLKRQAERQADQSPPLSESEKQVIAEETHFLTLRSEYKEFSKAIEAKPGG--GLMVGRPWE

Query:  RLERVNLKELTG--FRTGYNGDNLKKESLRELRKLFEARKLEELQWVLDDDVELKEEWSESENGHFDTVKRRRGDGEVIRFLVDRLSSREIAMRDWKFSK
         +ERV LKEL     R   +   LKKE+L+EL+K+ E    ++L+WVLDDDV+++E   + E   FD  KR R +GE +R LVDRLS REI  + WKF +
Subjt:  RLERVNLKELTG--FRTGYNGDNLKKESLRELRKLFEARKLEELQWVLDDDVELKEEWSESENGHFDTVKRRRGDGEVIRFLVDRLSSREIAMRDWKFSK

Query:  MMIRSGLQFNEGQLLKILDALGAKGCWKQALSVVEWVYNLKSHSHSKSRFVYTKLLAVLGMARESQEALQIFNLMRGDGQIYPDMAAYHSIAVTLGQAGL
        MM +SGLQF E Q+LKI+D LG K  WKQA +VV WVY+ K   H +SRFVYTKLL+VLG AR  QEALQIFN M GD Q+YPDMAAYH IAVTLGQAGL
Subjt:  MMIRSGLQFNEGQLLKILDALGAKGCWKQALSVVEWVYNLKSHSHSKSRFVYTKLLAVLGMARESQEALQIFNLMRGDGQIYPDMAAYHSIAVTLGQAGL

Query:  LKQLLKVIECMRQQPSKKIRNKCRKSWDPAVEPDLVVYNAILNACIPTLEWKGVYWVFTQLRKSCVKPNGATYGLSMEVMLKSGKYEHVHKLFTKMKKSG
        LK+LLKVIE MRQ+P+K  +N  +K+WDP +EPDLVVYNAILNAC+PTL+WK V WVF +LRK+ ++PNGATYGL+MEVML+SGK++ VH  F KMK SG
Subjt:  LKQLLKVIECMRQQPSKKIRNKCRKSWDPAVEPDLVVYNAILNACIPTLEWKGVYWVFTQLRKSCVKPNGATYGLSMEVMLKSGKYEHVHKLFTKMKKSG

Query:  ETPKANTYRVLVKAFWEEGKVNGAVEAVRDMEQRGVVGSASVYYELACCLCYNGRWRDALVEVEKMKTLSHMKPLVVAFTGMILSSFDGGHIDDCISIFE
        E PKA TY+VLV+A W EGK+  AVEAVRDMEQ+GV+G+ SVYYELACCLC NGRW DA++EV +MK L + +PL + FTG+I +S +GGH+DDC++IF+
Subjt:  ETPKANTYRVLVKAFWEEGKVNGAVEAVRDMEQRGVVGSASVYYELACCLCYNGRWRDALVEVEKMKTLSHMKPLVVAFTGMILSSFDGGHIDDCISIFE

Query:  YMKQNCAPNIGAINTMLRVYGRNDMFSKAKDLFEEIKRKADRSSPSSSAPSIIPDEYTYGSMLEAAASALQWEYFENVYREMALSGYRLDQSKHAMLLVE
        YMK  C PNIG  N ML+VYGRNDMFS+AK+LFEEI         S     ++P+EYTY  MLEA+A +LQWEYFE+VY+ M LSGY++DQ+KHA +L+E
Subjt:  YMKQNCAPNIGAINTMLRVYGRNDMFSKAKDLFEEIKRKADRSSPSSSAPSIIPDEYTYGSMLEAAASALQWEYFENVYREMALSGYRLDQSKHAMLLVE

Query:  ASRAGKWYLLDHAYDTILEAGQIPHPLLFTEMILQLIVQDNYEQAVTLVKAMGYAPFQVSERQWTELFEANTDRICSDNLKKLSDALSDCN-ASEATVSN
        ASRAGKW LL+HA+D +LE G+IPHPL FTE++     + ++++A+TL+  +  A FQ+SE +WT+LFE + D +  DNL KLSD L +C+  SE TVSN
Subjt:  ASRAGKWYLLDHAYDTILEAGQIPHPLLFTEMILQLIVQDNYEQAVTLVKAMGYAPFQVSERQWTELFEANTDRICSDNLKKLSDALSDCN-ASEATVSN

Query:  LSRSLHSLCKPGIPENTSQSVACDRDATDGLPIRGSENMENMETMKLHPDRHVNHCDESLDMISVNHLSSNMKVESDSKMAPWSLSFTEGVLGTQKFSDP
        LS+SL S C       +S S A                                   + L  + V   S   K E D  +             T +  + 
Subjt:  LSRSLHSLCKPGIPENTSQSVACDRDATDGLPIRGSENMENMETMKLHPDRHVNHCDESLDMISVNHLSSNMKVESDSKMAPWSLSFTEGVLGTQKFSDP

Query:  SNNEVSSFDSCDDSEDDEEELDMLLDRFDDSYDSNLPSVNEILKTWKEDRKND
        +N E   F   +      EEL+  +D  ++S DS+  SV +ILK W+E  K +
Subjt:  SNNEVSSFDSCDDSEDDEEELDMLLDRFDDSYDSNLPSVNEILKTWKEDRKND


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGCTTTAGGCGCAAATGCACCAATTCCTGCACCGAAGTTCGAACCAGATATCGAGAAAATTAAGCGTGCGCTCCTCCAGAAGGGTGTTTATCCGTCTCCCAAAAT
CGTCCGCTCACTTCGCAAGAAAGAAATTCAGAAGCACAACCGCAAACTCAAGCGACAGGCTGAACGACAAGCCGATCAGTCGCCGCCCCTTTCTGAGTCCGAAAAGCAAG
TAATTGCGGAGGAAACCCATTTTCTGACGTTGAGAAGCGAGTACAAGGAGTTCTCGAAGGCCATAGAGGCGAAACCAGGCGGTGGCTTGATGGTCGGGAGGCCGTGGGAG
AGACTAGAAAGAGTTAACCTTAAAGAACTTACCGGTTTTAGAACAGGATACAATGGGGATAATCTGAAGAAGGAGAGTTTGAGAGAATTGAGGAAACTGTTTGAGGCTCG
TAAGCTCGAGGAATTGCAGTGGGTTTTAGACGACGATGTAGAACTGAAGGAAGAGTGGTCGGAGAGTGAAAATGGTCATTTCGATACCGTAAAACGGAGGCGCGGCGATG
GAGAGGTTATTCGGTTCCTTGTTGACAGGCTCAGTTCGAGGGAGATTGCCATGAGAGACTGGAAATTCTCAAAGATGATGATACGGTCAGGATTGCAGTTTAATGAAGGT
CAACTACTTAAAATTTTGGATGCCCTTGGTGCTAAGGGATGTTGGAAACAGGCCTTGTCAGTGGTCGAATGGGTGTACAATCTTAAGAGTCACAGTCATTCTAAAAGCAG
GTTTGTCTACACAAAGCTCTTAGCTGTTCTTGGGATGGCAAGGGAATCTCAGGAAGCCCTTCAGATATTTAATTTGATGCGTGGAGATGGCCAGATATATCCCGATATGG
CTGCATATCACAGTATCGCTGTTACGCTGGGTCAAGCTGGTCTCTTGAAACAATTGCTGAAAGTTATTGAATGCATGAGGCAGCAACCGTCTAAAAAAATTAGAAACAAG
TGCCGCAAATCTTGGGATCCTGCAGTTGAACCTGACCTTGTTGTATATAATGCTATTTTGAACGCATGCATTCCAACCCTTGAGTGGAAAGGTGTCTACTGGGTGTTCAC
CCAATTGAGAAAGAGTTGTGTGAAACCTAATGGAGCGACATATGGACTTTCTATGGAGGTAATGCTTAAATCTGGAAAGTATGAGCATGTCCACAAGCTTTTTACAAAAA
TGAAGAAGAGTGGGGAAACTCCAAAGGCAAACACTTACAGAGTTCTTGTCAAAGCTTTTTGGGAGGAAGGAAAAGTCAATGGAGCTGTTGAAGCAGTCAGGGATATGGAA
CAGAGAGGAGTAGTTGGATCTGCCAGTGTCTATTATGAACTAGCTTGTTGTCTATGCTACAATGGGAGGTGGCGAGATGCCTTAGTAGAGGTTGAAAAAATGAAAACACT
ATCACACATGAAACCATTGGTGGTGGCCTTCACCGGCATGATCTTATCTTCCTTTGACGGTGGACATATTGATGATTGTATATCTATTTTCGAGTACATGAAGCAAAATT
GTGCGCCGAATATAGGGGCCATTAATACCATGCTTAGAGTCTATGGCCGAAATGATATGTTTTCTAAAGCTAAAGATTTATTTGAAGAAATAAAGAGAAAAGCCGACCGT
TCCTCTCCAAGTAGTTCTGCTCCTTCTATAATCCCAGATGAATATACGTATGGCTCAATGCTTGAGGCAGCTGCTAGTGCACTCCAGTGGGAATATTTCGAGAACGTATA
CAGGGAAATGGCTCTGTCGGGATACCGGCTAGATCAAAGTAAACATGCAATGCTACTTGTGGAAGCATCCAGAGCTGGGAAGTGGTATCTGTTAGATCATGCATATGACA
CAATCTTGGAAGCTGGACAAATTCCCCATCCGCTGTTGTTCACAGAAATGATATTGCAGCTTATAGTTCAAGATAACTATGAGCAGGCTGTCACCTTGGTTAAAGCCATG
GGTTATGCTCCATTTCAAGTAAGTGAAAGGCAATGGACCGAACTTTTTGAAGCAAACACTGACAGAATTTGTTCGGATAACTTGAAGAAACTGTCGGATGCTCTAAGCGA
CTGCAATGCATCAGAAGCCACAGTCTCGAACTTGTCAAGATCACTGCACTCTCTTTGCAAACCTGGCATACCAGAAAACACCTCCCAGTCTGTAGCCTGTGACCGTGATG
CAACAGATGGATTGCCAATTCGTGGTTCTGAAAACATGGAGAACATGGAGACTATGAAGCTTCACCCAGATCGCCATGTGAATCACTGTGATGAGTCATTGGACATGATT
TCTGTTAACCATCTCAGCTCGAACATGAAGGTTGAGAGTGACTCAAAGATGGCCCCTTGGTCACTGAGTTTTACTGAAGGTGTTTTAGGGACTCAGAAATTTTCAGATCC
TTCCAACAATGAGGTCTCGAGTTTTGATTCGTGCGACGACAGTGAAGATGATGAGGAAGAACTTGACATGTTGCTTGATAGATTTGATGATTCTTATGATTCGAACTTGC
CTTCTGTCAATGAAATACTGAAAACCTGGAAGGAAGACAGGAAAAACGATGGGTTAATTAGTACAATTCAGTAG
mRNA sequenceShow/hide mRNA sequence
ATGGAAGCTTTAGGCGCAAATGCACCAATTCCTGCACCGAAGTTCGAACCAGATATCGAGAAAATTAAGCGTGCGCTCCTCCAGAAGGGTGTTTATCCGTCTCCCAAAAT
CGTCCGCTCACTTCGCAAGAAAGAAATTCAGAAGCACAACCGCAAACTCAAGCGACAGGCTGAACGACAAGCCGATCAGTCGCCGCCCCTTTCTGAGTCCGAAAAGCAAG
TAATTGCGGAGGAAACCCATTTTCTGACGTTGAGAAGCGAGTACAAGGAGTTCTCGAAGGCCATAGAGGCGAAACCAGGCGGTGGCTTGATGGTCGGGAGGCCGTGGGAG
AGACTAGAAAGAGTTAACCTTAAAGAACTTACCGGTTTTAGAACAGGATACAATGGGGATAATCTGAAGAAGGAGAGTTTGAGAGAATTGAGGAAACTGTTTGAGGCTCG
TAAGCTCGAGGAATTGCAGTGGGTTTTAGACGACGATGTAGAACTGAAGGAAGAGTGGTCGGAGAGTGAAAATGGTCATTTCGATACCGTAAAACGGAGGCGCGGCGATG
GAGAGGTTATTCGGTTCCTTGTTGACAGGCTCAGTTCGAGGGAGATTGCCATGAGAGACTGGAAATTCTCAAAGATGATGATACGGTCAGGATTGCAGTTTAATGAAGGT
CAACTACTTAAAATTTTGGATGCCCTTGGTGCTAAGGGATGTTGGAAACAGGCCTTGTCAGTGGTCGAATGGGTGTACAATCTTAAGAGTCACAGTCATTCTAAAAGCAG
GTTTGTCTACACAAAGCTCTTAGCTGTTCTTGGGATGGCAAGGGAATCTCAGGAAGCCCTTCAGATATTTAATTTGATGCGTGGAGATGGCCAGATATATCCCGATATGG
CTGCATATCACAGTATCGCTGTTACGCTGGGTCAAGCTGGTCTCTTGAAACAATTGCTGAAAGTTATTGAATGCATGAGGCAGCAACCGTCTAAAAAAATTAGAAACAAG
TGCCGCAAATCTTGGGATCCTGCAGTTGAACCTGACCTTGTTGTATATAATGCTATTTTGAACGCATGCATTCCAACCCTTGAGTGGAAAGGTGTCTACTGGGTGTTCAC
CCAATTGAGAAAGAGTTGTGTGAAACCTAATGGAGCGACATATGGACTTTCTATGGAGGTAATGCTTAAATCTGGAAAGTATGAGCATGTCCACAAGCTTTTTACAAAAA
TGAAGAAGAGTGGGGAAACTCCAAAGGCAAACACTTACAGAGTTCTTGTCAAAGCTTTTTGGGAGGAAGGAAAAGTCAATGGAGCTGTTGAAGCAGTCAGGGATATGGAA
CAGAGAGGAGTAGTTGGATCTGCCAGTGTCTATTATGAACTAGCTTGTTGTCTATGCTACAATGGGAGGTGGCGAGATGCCTTAGTAGAGGTTGAAAAAATGAAAACACT
ATCACACATGAAACCATTGGTGGTGGCCTTCACCGGCATGATCTTATCTTCCTTTGACGGTGGACATATTGATGATTGTATATCTATTTTCGAGTACATGAAGCAAAATT
GTGCGCCGAATATAGGGGCCATTAATACCATGCTTAGAGTCTATGGCCGAAATGATATGTTTTCTAAAGCTAAAGATTTATTTGAAGAAATAAAGAGAAAAGCCGACCGT
TCCTCTCCAAGTAGTTCTGCTCCTTCTATAATCCCAGATGAATATACGTATGGCTCAATGCTTGAGGCAGCTGCTAGTGCACTCCAGTGGGAATATTTCGAGAACGTATA
CAGGGAAATGGCTCTGTCGGGATACCGGCTAGATCAAAGTAAACATGCAATGCTACTTGTGGAAGCATCCAGAGCTGGGAAGTGGTATCTGTTAGATCATGCATATGACA
CAATCTTGGAAGCTGGACAAATTCCCCATCCGCTGTTGTTCACAGAAATGATATTGCAGCTTATAGTTCAAGATAACTATGAGCAGGCTGTCACCTTGGTTAAAGCCATG
GGTTATGCTCCATTTCAAGTAAGTGAAAGGCAATGGACCGAACTTTTTGAAGCAAACACTGACAGAATTTGTTCGGATAACTTGAAGAAACTGTCGGATGCTCTAAGCGA
CTGCAATGCATCAGAAGCCACAGTCTCGAACTTGTCAAGATCACTGCACTCTCTTTGCAAACCTGGCATACCAGAAAACACCTCCCAGTCTGTAGCCTGTGACCGTGATG
CAACAGATGGATTGCCAATTCGTGGTTCTGAAAACATGGAGAACATGGAGACTATGAAGCTTCACCCAGATCGCCATGTGAATCACTGTGATGAGTCATTGGACATGATT
TCTGTTAACCATCTCAGCTCGAACATGAAGGTTGAGAGTGACTCAAAGATGGCCCCTTGGTCACTGAGTTTTACTGAAGGTGTTTTAGGGACTCAGAAATTTTCAGATCC
TTCCAACAATGAGGTCTCGAGTTTTGATTCGTGCGACGACAGTGAAGATGATGAGGAAGAACTTGACATGTTGCTTGATAGATTTGATGATTCTTATGATTCGAACTTGC
CTTCTGTCAATGAAATACTGAAAACCTGGAAGGAAGACAGGAAAAACGATGGGTTAATTAGTACAATTCAGTAG
Protein sequenceShow/hide protein sequence
MEALGANAPIPAPKFEPDIEKIKRALLQKGVYPSPKIVRSLRKKEIQKHNRKLKRQAERQADQSPPLSESEKQVIAEETHFLTLRSEYKEFSKAIEAKPGGGLMVGRPWE
RLERVNLKELTGFRTGYNGDNLKKESLRELRKLFEARKLEELQWVLDDDVELKEEWSESENGHFDTVKRRRGDGEVIRFLVDRLSSREIAMRDWKFSKMMIRSGLQFNEG
QLLKILDALGAKGCWKQALSVVEWVYNLKSHSHSKSRFVYTKLLAVLGMARESQEALQIFNLMRGDGQIYPDMAAYHSIAVTLGQAGLLKQLLKVIECMRQQPSKKIRNK
CRKSWDPAVEPDLVVYNAILNACIPTLEWKGVYWVFTQLRKSCVKPNGATYGLSMEVMLKSGKYEHVHKLFTKMKKSGETPKANTYRVLVKAFWEEGKVNGAVEAVRDME
QRGVVGSASVYYELACCLCYNGRWRDALVEVEKMKTLSHMKPLVVAFTGMILSSFDGGHIDDCISIFEYMKQNCAPNIGAINTMLRVYGRNDMFSKAKDLFEEIKRKADR
SSPSSSAPSIIPDEYTYGSMLEAAASALQWEYFENVYREMALSGYRLDQSKHAMLLVEASRAGKWYLLDHAYDTILEAGQIPHPLLFTEMILQLIVQDNYEQAVTLVKAM
GYAPFQVSERQWTELFEANTDRICSDNLKKLSDALSDCNASEATVSNLSRSLHSLCKPGIPENTSQSVACDRDATDGLPIRGSENMENMETMKLHPDRHVNHCDESLDMI
SVNHLSSNMKVESDSKMAPWSLSFTEGVLGTQKFSDPSNNEVSSFDSCDDSEDDEEELDMLLDRFDDSYDSNLPSVNEILKTWKEDRKNDGLISTIQ