; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS011506 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS011506
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationscaffold239:2472087..2476267
RNA-Seq ExpressionMS011506
SyntenyMS011506
Gene Ontology termsGO:0009658 - chloroplast organization (biological process)
GO:0009507 - chloroplast (cellular component)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily
IPR044645 - Pentatricopeptide repeat-containing protein DG1/EMB2279-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008440398.1 PREDICTED: pentatricopeptide repeat-containing protein At5g67570, chloroplastic [Cucumis melo]0.0e+0080Show/hide
Query:  MDALSTNAPIPAPKFEPDLEKIKRTLLQKGVCPSPKILRALRKKEIQKHNRKLNRLAH----QSPPLSESQKQLIAEETHFRTLRSEYKEFSKAIEAKPG
        M+AL++NAPIP+PKFEPD +KIKR LLQKGV P+P+I+R+LRKKEIQK+NRKL RLA     QSPPLSESQKQLIAEETHF TLRSEYKEFSKAIEAKP 
Subjt:  MDALSTNAPIPAPKFEPDLEKIKRTLLQKGVCPSPKILRALRKKEIQKHNRKLNRLAH----QSPPLSESQKQLIAEETHFRTLRSEYKEFSKAIEAKPG

Query:  GGLMVGRPWERLEGVNLKELTGFRTGYDGENLKKENLTELRKLFEARKLEELQWVLDDDVELKDEWLDSENGRSDAVKRRRGDGEVIRFLVDRLSSREIS
        GGLMVGRPWERLE VN  ELTG RTGY+ ++LKKE+L ELRKLFE RKLEEL+W LDDDVELK+EWL SENG+ DAVKRRRGDGEVIRFLVDRLSS  IS
Subjt:  GGLMVGRPWERLEGVNLKELTGFRTGYDGENLKKENLTELRKLFEARKLEELQWVLDDDVELKDEWLDSENGRSDAVKRRRGDGEVIRFLVDRLSSREIS

Query:  MRDWKFSRMMIRSGLQFNEGQLLKILDGLGARGCWKQSLSVVEWVYNLKSHSHFKSRQSFLYYEKICTFCRFVYTKLLAVLGMARKPQEALQIFNLMRGD
        MRDWKFSRMMIRSGLQFNEGQLLKILD LGA+GCWKQ+LSVVEWVYNLKSHSH KS              RFVYTKLLAVLGMARKPQEALQIFNLMRGD
Subjt:  MRDWKFSRMMIRSGLQFNEGQLLKILDGLGARGCWKQSLSVVEWVYNLKSHSHFKSRQSFLYYEKICTFCRFVYTKLLAVLGMARKPQEALQIFNLMRGD

Query:  GHIYPDMAAYHSIAVTLGQAGLLKQLLKVIECMRQQPSKKIRNKCRKSWDPAVEPDLVVYNAILNACIPTREWKGVYWVFTQLRKSGLKPNGATYGLSME
        G IYPDMAAYHSIAVTLGQAGLLKQLLKVIECMRQQPSKK+RNKCRKSWDPAVEPDLV+YN ILNACIPT EWKGVYWVFTQLRKSGL+PNGATYGLSME
Subjt:  GHIYPDMAAYHSIAVTLGQAGLLKQLLKVIECMRQQPSKKIRNKCRKSWDPAVEPDLVVYNAILNACIPTREWKGVYWVFTQLRKSGLKPNGATYGLSME

Query:  VMLKSGKYEHLHKLFTKMKKSGETLKANTYRVLVKAFWEEGNVNGAIEAVRDMEQRGVVGSASVYYELACCLCYNGRWQDALVEVRKMKTLPHMKPLVVT
        VMLKSGKYE LH LFTKMKKSGETLKANTYRVLVKAFWEEGN +GAIEAVRDMEQRGVVGSASVYYELACCLCYNG+WQDALVEV KMKTL HMKPLVVT
Subjt:  VMLKSGKYEHLHKLFTKMKKSGETLKANTYRVLVKAFWEEGNVNGAIEAVRDMEQRGVVGSASVYYELACCLCYNGRWQDALVEVRKMKTLPHMKPLVVT

Query:  FTGMILSSFDGGHIDDCISIFEYMRRNCAPNIGTINTMLKVYGRNDMFSKAKDLFEEIKRKADRSSPSCSVHSIIPDEYTYGSMLEAAASALQWEYFENV
        FTGMILSSF+GGHIDDCISIFEYM++ CAPNIGTINTMLKVYGRNDMFSKAKDLFEEIK+KAD SS + +V S++PDEYTY SML+AAAS+LQWEYFENV
Subjt:  FTGMILSSFDGGHIDDCISIFEYMRRNCAPNIGTINTMLKVYGRNDMFSKAKDLFEEIKRKADRSSPSCSVHSIIPDEYTYGSMLEAAASALQWEYFENV

Query:  YREMALSGYQLDQSKHATVLVEASRAGKWYLLDHAFDSILEAGQIPHPLLFTEMILQLIVQENYEQAVTLVKAMGYAPFQVTERQWTELFEVSMDRICWN
        YREMALSGY+LDQSKHA +LVEAS+AGKWYLLDHAFD+ILEAGQIPHPLLFTEMILQL  QENYEQAVTLV+ MGYAPFQV+ERQWTELFE + DRIC N
Subjt:  YREMALSGYQLDQSKHATVLVEASRAGKWYLLDHAFDSILEAGQIPHPLLFTEMILQLIVQENYEQAVTLVKAMGYAPFQVTERQWTELFEVSMDRICWN

Query:  NLKKLSDALSDCDASEATVSNLSRSLRVLCKSRIPEDTSQS-------------------ENME-------------------NMKVESDSNMAPWSPSL
        NLK+L DAL DCDASEATVSNLSRSL+ LCK  I E TSQS                   ENM+                   NMKV S+SNM+PWSPS+
Subjt:  NLKKLSDALSDCDASEATVSNLSRSLRVLCKSRIPEDTSQS-------------------ENME-------------------NMKVESDSNMAPWSPSL

Query:  SDEGAVGTNTFSGHSNDELSTFDLCVGSEDDEEVLNMLLDRSDDSYDSNSPSVNEILKTWKEERKTDGLFLHPLN
        SD G +GT  FS  SN+E STFD    SEDDE  LNMLLD  DDSYDSN P+ NEIL+TWKEERK DGLFLHPLN
Subjt:  SDEGAVGTNTFSGHSNDELSTFDLCVGSEDDEEVLNMLLDRSDDSYDSNSPSVNEILKTWKEERKTDGLFLHPLN

XP_022133001.1 pentatricopeptide repeat-containing protein At5g67570, chloroplastic [Momordica charantia]0.0e+0097.6Show/hide
Query:  MDALSTNAPIPAPKFEPDLEKIKRTLLQKGVCPSPKILRALRKKEIQKHNRKLNRLAHQSPPLSESQKQLIAEETHFRTLRSEYKEFSKAIEAKPGGGLM
        MDALSTNAPIPAPKFEPDLEKIKRTLLQKGVCPSPKILRALRKKEIQKHNRKLNRLAHQSPPLSESQKQLIAEETHFRTLRSEYKEFSKAIEAKPG GLM
Subjt:  MDALSTNAPIPAPKFEPDLEKIKRTLLQKGVCPSPKILRALRKKEIQKHNRKLNRLAHQSPPLSESQKQLIAEETHFRTLRSEYKEFSKAIEAKPGGGLM

Query:  VGRPWERLEGVNLKELTGFRTGYDGENLKKENLTELRKLFEARKLEELQWVLDDDVELKDEWLDSENGRSDAVKRRRGDGEVIRFLVDRLSSREISMRDW
        VGRPWERLEGVNLKELTGFRTGYDGENLKKENLTELRKLFEARKLEELQWVLDDDVELKDEWLDSENGRSDAVKRRRGDGEVIRFLVDRLSSREISMRDW
Subjt:  VGRPWERLEGVNLKELTGFRTGYDGENLKKENLTELRKLFEARKLEELQWVLDDDVELKDEWLDSENGRSDAVKRRRGDGEVIRFLVDRLSSREISMRDW

Query:  KFSRMMIRSGLQFNEGQLLKILDGLGARGCWKQSLSVVEWVYNLKSHSHFKSRQSFLYYEKICTFCRFVYTKLLAVLGMARKPQEALQIFNLMRGDGHIY
        KFSRMMIRSGLQFNEGQLLKILDGLGARGCWKQSLSVVEWVYNLKSHSHFKS              RFVYTKLLAVLGMARKPQEALQIFNLMRGDGHIY
Subjt:  KFSRMMIRSGLQFNEGQLLKILDGLGARGCWKQSLSVVEWVYNLKSHSHFKSRQSFLYYEKICTFCRFVYTKLLAVLGMARKPQEALQIFNLMRGDGHIY

Query:  PDMAAYHSIAVTLGQAGLLKQLLKVIECMRQQPSKKIRNKCRKSWDPAVEPDLVVYNAILNACIPTREWKGVYWVFTQLRKSGLKPNGATYGLSMEVMLK
        PDMAAYHSIAVTLGQAGLLKQLLKVIECMRQQPSKKIRNKCRKSWDPAVEPDLVVYNAILNACIPT EWKGVYWVFTQLRKSGLKPNGATYGLSMEVMLK
Subjt:  PDMAAYHSIAVTLGQAGLLKQLLKVIECMRQQPSKKIRNKCRKSWDPAVEPDLVVYNAILNACIPTREWKGVYWVFTQLRKSGLKPNGATYGLSMEVMLK

Query:  SGKYEHLHKLFTKMKKSGETLKANTYRVLVKAFWEEGNVNGAIEAVRDMEQRGVVGSASVYYELACCLCYNGRWQDALVEVRKMKTLPHMKPLVVTFTGM
        SGKYEHLHKLFTKMKKSGETLKANTYRVLVKAFWEEGNVNGAIEAVRDMEQRGVVGSASVYYELACCLCYNGRWQDALVEVRKMKTLPHMKPLVVTFTGM
Subjt:  SGKYEHLHKLFTKMKKSGETLKANTYRVLVKAFWEEGNVNGAIEAVRDMEQRGVVGSASVYYELACCLCYNGRWQDALVEVRKMKTLPHMKPLVVTFTGM

Query:  ILSSFDGGHIDDCISIFEYMRRNCAPNIGTINTMLKVYGRNDMFSKAKDLFEEIKRKADRSSPSCSVHSIIPDEYTYGSMLEAAASALQWEYFENVYREM
        ILSSFDGGHIDDCISIFEYMRRNCAPNIGTINTMLKVYGRNDMFSKAKDLFEEIKRKADRSSPSCSVHSIIPDEYTYGSMLEAAASALQWEYFENVYREM
Subjt:  ILSSFDGGHIDDCISIFEYMRRNCAPNIGTINTMLKVYGRNDMFSKAKDLFEEIKRKADRSSPSCSVHSIIPDEYTYGSMLEAAASALQWEYFENVYREM

Query:  ALSGYQLDQSKHATVLVEASRAGKWYLLDHAFDSILEAGQIPHPLLFTEMILQLIVQENYEQAVTLVKAMGYAPFQVTERQWTELFEVSMDRICWNNLKK
        ALSGYQLDQSKHATVLVEASRAGKWYLLDHAFDSILEAGQIPHPLLFTEMILQLIVQENYEQAVTLVKAM YAPFQVT+RQWTELFEVSMDRICWNNLKK
Subjt:  ALSGYQLDQSKHATVLVEASRAGKWYLLDHAFDSILEAGQIPHPLLFTEMILQLIVQENYEQAVTLVKAMGYAPFQVTERQWTELFEVSMDRICWNNLKK

Query:  LSDALSDCDASEATVSNLSRSLRVLCKSRIPEDTSQSENMENMKVESDSNMAPWSPSLSDEGAVGTNTFSGHSNDELSTFDLCVGSEDDEEVLNMLLDRS
        LSDALSDCDASEATVSNLSRSLRVLCKSRIPEDTSQSENMENMKVESDSNMAPWSPSLSDEGA+GTNTFSGHSNDELSTFDLCV SEDDEEVLNMLLDRS
Subjt:  LSDALSDCDASEATVSNLSRSLRVLCKSRIPEDTSQSENMENMKVESDSNMAPWSPSLSDEGAVGTNTFSGHSNDELSTFDLCVGSEDDEEVLNMLLDRS

Query:  DDSYDSNSPSVNEILKTWKEERKTDGLFLHPLN
        DDSYDSNSPSVNEILKTWKEERKTDGLFLHPLN
Subjt:  DDSYDSNSPSVNEILKTWKEERKTDGLFLHPLN

XP_023543129.1 pentatricopeptide repeat-containing protein At5g67570, chloroplastic [Cucurbita pepo subsp. pepo]0.0e+0080.32Show/hide
Query:  MDALSTNAPIPAPKFEPDLEKIKRTLLQKGVCPSPKILRALRKKEIQKHNRKLNRLAH----QSPPLSESQKQLIAEETHFRTLRSEYKEFSKAIEAKPG
        M+ALSTNA +P+PKFEPD+EKIKRTLLQKGV P+PKI+R+L KKEIQKHNRKL RLA     QSPPLSESQKQLI EETHF TLRSEYKEFSKAIEA+P 
Subjt:  MDALSTNAPIPAPKFEPDLEKIKRTLLQKGVCPSPKILRALRKKEIQKHNRKLNRLAH----QSPPLSESQKQLIAEETHFRTLRSEYKEFSKAIEAKPG

Query:  GGLMVGRPWERLEGVNLKELTGFRTGYDGENLKKENLTELRKLFEARKLEELQWVLDDDVELKDEWLDSENGRSDAVKRRRGDGEVIRFLVDRLSSREIS
        GGLMVGRPWERLE VNLKELTGFRT Y+ +NLKKE+L ELRKLFEARKLEELQWVLDDDVELK+EWL SENG+SDAVKRRRGDGEVIRFLVDRLSSR IS
Subjt:  GGLMVGRPWERLEGVNLKELTGFRTGYDGENLKKENLTELRKLFEARKLEELQWVLDDDVELKDEWLDSENGRSDAVKRRRGDGEVIRFLVDRLSSREIS

Query:  MRDWKFSRMMIRSGLQFNEGQLLKILDGLGARGCWKQSLSVVEWVYNLKSHSHFKSRQSFLYYEKICTFCRFVYTKLLAVLGMARKPQEALQIFNLMRGD
        MRDWKFSRMMI+SGLQFNEGQLLKILD LGA+GCWKQ+LSVVEWVYNLKSHSH KS              RFVYTKLLAVLG ARKPQEALQIFNLMRGD
Subjt:  MRDWKFSRMMIRSGLQFNEGQLLKILDGLGARGCWKQSLSVVEWVYNLKSHSHFKSRQSFLYYEKICTFCRFVYTKLLAVLGMARKPQEALQIFNLMRGD

Query:  GHIYPDMAAYHSIAVTLGQAGLLKQLLKVIECMRQQPSKKIRNKCRKSWDPAVEPDLVVYNAILNACIPTREWKGVYWVFTQLRKSGLKPNGATYGLSME
        G IYPDMAAYHSIAVTLGQAGLLKQLLK+IECMRQQPSKK+RN CRK WDPAVEPDLV+YNAILNACIPT EWK VYWVFTQLRK+GLKPNGATYGLSME
Subjt:  GHIYPDMAAYHSIAVTLGQAGLLKQLLKVIECMRQQPSKKIRNKCRKSWDPAVEPDLVVYNAILNACIPTREWKGVYWVFTQLRKSGLKPNGATYGLSME

Query:  VMLKSGKYEHLHKLFTKMKKSGETLKANTYRVLVKAFWEEGNVNGAIEAVRDMEQRGVVGSASVYYELACCLCYNGRWQDALVEVRKMKTLPHMKPLVVT
        VMLKSGKYE +H LFTKMK SG TLKANTYRVLVKAFWEEGNV+GAIEAVRDMEQRGVVGSASVYYELACCLCY+GRWQDALVEV KMKTL HMKPLVVT
Subjt:  VMLKSGKYEHLHKLFTKMKKSGETLKANTYRVLVKAFWEEGNVNGAIEAVRDMEQRGVVGSASVYYELACCLCYNGRWQDALVEVRKMKTLPHMKPLVVT

Query:  FTGMILSSFDGGHIDDCISIFEYMRRNCAPNIGTINTMLKVYGRNDMFSKAKDLFEEIKRKADRSSPSCSVHSIIPDEYTYGSMLEAAASALQWEYFENV
        FTGMILSSFDGGHIDDCISIFEYM++ CAPNIGTINTMLKV+GRNDMFSKAKDL+EEIKRKAD SS S +V SI+PD+YTY SML+AAASA QWEYFENV
Subjt:  FTGMILSSFDGGHIDDCISIFEYMRRNCAPNIGTINTMLKVYGRNDMFSKAKDLFEEIKRKADRSSPSCSVHSIIPDEYTYGSMLEAAASALQWEYFENV

Query:  YREMALSGYQLDQSKHATVLVEASRAGKWYLLDHAFDSILEAGQIPHPLLFTEMILQLIVQENYEQAVTLVKAMGYAPFQVTERQWTELFEVSMDRICWN
        YREMALSGY+LDQSKHA +LVEASRAGKWYLLDHAFD+ILEAGQIPHPLLFTE+ILQL  Q+NYEQAVTLV+ M YAPFQV+ERQWTE+FE + DRICWN
Subjt:  YREMALSGYQLDQSKHATVLVEASRAGKWYLLDHAFDSILEAGQIPHPLLFTEMILQLIVQENYEQAVTLVKAMGYAPFQVTERQWTELFEVSMDRICWN

Query:  NLKKLSDALSDCDASEATVSNLSRSLRVLCKSRIPEDTSQ----------------SENMENMK-----------------------VESDSNMAPWSPS
        NLKKLSDALSDCDASEATVSNLSRSL+ LCK  IPE+TSQ                SENMENMK                       +  DS M PWS S
Subjt:  NLKKLSDALSDCDASEATVSNLSRSLRVLCKSRIPEDTSQ----------------SENMENMK-----------------------VESDSNMAPWSPS

Query:  LSDEGAVGTNTFSGHSNDELSTFDLCVGSEDDEEVLNMLLDRSDDSYDSNSPSVNEILKTWKEERKTDGLFLHP
        LSD G + T  FS  SN+E STFDL   SEDDEE L+MLLD  DD YDSN PSV+EILKTWKEERK DGL+LHP
Subjt:  LSDEGAVGTNTFSGHSNDELSTFDLCVGSEDDEEVLNMLLDRSDDSYDSNSPSVNEILKTWKEERKTDGLFLHP

XP_038881784.1 pentatricopeptide repeat-containing protein At5g67570, chloroplastic isoform X1 [Benincasa hispida]0.0e+0080.63Show/hide
Query:  MDALSTNAPIPAPKFEPDLEKIKRTLLQKGVCPSPKILRALRKKEIQKHNRKLNRL----AHQSPPLSESQKQLIAEETHFRTLRSEYKEFSKAIEAKPG
        M+ALSTN+PIP+PKFEPD+EKIKRTL+ KGV P+P+I+R+LRKKEIQK+NRKL RL    A QSPPLSESQKQLIAEETHF TLRSEYKEFSKAIEAKP 
Subjt:  MDALSTNAPIPAPKFEPDLEKIKRTLLQKGVCPSPKILRALRKKEIQKHNRKLNRL----AHQSPPLSESQKQLIAEETHFRTLRSEYKEFSKAIEAKPG

Query:  GGLMVGRPWERLEGVNLKELTGFRTGYDGENLKKENLTELRKLFEARKLEELQWVLDDDVELKDEWLDSENGRSDAVKRRRGDGEVIRFLVDRLSSREIS
        GGLMVGRPWERLE VNLKELTGFRTGY+ +NLKKE+L ELRKLFEARKLEELQWVLDDDVELK+EWL+SEN   DA++RRRGDGEVIRFLVDRLSSR IS
Subjt:  GGLMVGRPWERLEGVNLKELTGFRTGYDGENLKKENLTELRKLFEARKLEELQWVLDDDVELKDEWLDSENGRSDAVKRRRGDGEVIRFLVDRLSSREIS

Query:  MRDWKFSRMMIRSGLQFNEGQLLKILDGLGARGCWKQSLSVVEWVYNLKSHSHFKSRQSFLYYEKICTFCRFVYTKLLAVLGMARKPQEALQIFNLMR--
        MRDWKFSRMMIRSGLQFNEGQLLKILD LGA+GCWKQ+LSVVEWVYNLKSHSH KS              RFVYTKLLAVLGM+RKPQEALQIF+LMR  
Subjt:  MRDWKFSRMMIRSGLQFNEGQLLKILDGLGARGCWKQSLSVVEWVYNLKSHSHFKSRQSFLYYEKICTFCRFVYTKLLAVLGMARKPQEALQIFNLMR--

Query:  ------GDGHIYPDMAAYHSIAVTLGQAGLLKQLLKVIECMRQQPSKKIRNKCRKSWDPAVEPDLVVYNAILNACIPTREWKGVYWVFTQLRKSGLKPNG
              GDG IYPDMAAYHSIAVTLGQAGLLKQLLKV+ECMRQQPS+K+RNKCRKSWDPAVEPDLVVYNAILNACIPT EWKGVYWVFTQLRKSGL+PNG
Subjt:  ------GDGHIYPDMAAYHSIAVTLGQAGLLKQLLKVIECMRQQPSKKIRNKCRKSWDPAVEPDLVVYNAILNACIPTREWKGVYWVFTQLRKSGLKPNG

Query:  ATYGLSMEVMLKSGKYEHLHKLFTKMKKSGETLKANTYRVLVKAFWEEGNVNGAIEAVRDMEQRGVVGSASVYYELACCLCYNGRWQDALVEVRKMKTLP
        ATYGLSMEVMLKSGKYE LHKLFTK+KKSGETLKANTYRVLVKAFWEEGNVNGAIEAVRDMEQRGVVGSASVYYELACCLCYNGRWQDALVEV KMKTL 
Subjt:  ATYGLSMEVMLKSGKYEHLHKLFTKMKKSGETLKANTYRVLVKAFWEEGNVNGAIEAVRDMEQRGVVGSASVYYELACCLCYNGRWQDALVEVRKMKTLP

Query:  HMKPLVVTFTGMILSSFDGGHIDDCISIFEYMRRNCAPNIGTINTMLKVYGRNDMFSKAKDLFEEIKRKADRSSPSCSVHSIIPDEYTYGSMLEAAASAL
        HMKPLVVTFTGMILSSFDGGHIDDCISIFEYM++ CAPNIGTIN+MLKVYGRNDMF KAKDLFEEIKRKAD SS S +V S++PDEYTYGSMLEAAASAL
Subjt:  HMKPLVVTFTGMILSSFDGGHIDDCISIFEYMRRNCAPNIGTINTMLKVYGRNDMFSKAKDLFEEIKRKADRSSPSCSVHSIIPDEYTYGSMLEAAASAL

Query:  QWEYFENVYREMALSGYQLDQSKHATVLVEASRAGKWYLLDHAFDSILEAGQIPHPLLFTEMILQLIVQENYEQAVTLVKAMGYAPFQVTERQWTELFEV
        QWEYFENVYREMALSGY+LDQSKHA +LVEASRAGKWYLLDHAFDSILEAGQIPHPLLFTEMIL L  Q+NYEQAVTLV+ MGYAPFQV+ERQWTELFE 
Subjt:  QWEYFENVYREMALSGYQLDQSKHATVLVEASRAGKWYLLDHAFDSILEAGQIPHPLLFTEMILQLIVQENYEQAVTLVKAMGYAPFQVTERQWTELFEV

Query:  SMDRICWNNLKKLSDALSDCDASEATVSNLSRSLRVLCKSRIPEDTSQS-------------------ENME------------------------NMKV
        + DRICW NLKKL DAL +CDASEATVSNLSRSL+ LCK  IPE+TSQS                   ENM+                        NMKV
Subjt:  SMDRICWNNLKKLSDALSDCDASEATVSNLSRSLRVLCKSRIPEDTSQS-------------------ENME------------------------NMKV

Query:  ESDSNMAPWSPSLSDEGAVGTNTFSGHSNDELSTFDLCVGSEDDEEVLNMLLDRSDDSYDSNSPSVNEILKTWKEERKTDGLFLHPLN
        +SDS ++PWS S S EG +GT+ FS  S +ELST DLC  SEDDEE LNMLLD  DDSYDSN PSVNEILKTWKEERKTDGLFLHPLN
Subjt:  ESDSNMAPWSPSLSDEGAVGTNTFSGHSNDELSTFDLCVGSEDDEEVLNMLLDRSDDSYDSNSPSVNEILKTWKEERKTDGLFLHPLN

XP_038881786.1 pentatricopeptide repeat-containing protein At5g67570, chloroplastic isoform X2 [Benincasa hispida]0.0e+0081.36Show/hide
Query:  MDALSTNAPIPAPKFEPDLEKIKRTLLQKGVCPSPKILRALRKKEIQKHNRKLNRL----AHQSPPLSESQKQLIAEETHFRTLRSEYKEFSKAIEAKPG
        M+ALSTN+PIP+PKFEPD+EKIKRTL+ KGV P+P+I+R+LRKKEIQK+NRKL RL    A QSPPLSESQKQLIAEETHF TLRSEYKEFSKAIEAKP 
Subjt:  MDALSTNAPIPAPKFEPDLEKIKRTLLQKGVCPSPKILRALRKKEIQKHNRKLNRL----AHQSPPLSESQKQLIAEETHFRTLRSEYKEFSKAIEAKPG

Query:  GGLMVGRPWERLEGVNLKELTGFRTGYDGENLKKENLTELRKLFEARKLEELQWVLDDDVELKDEWLDSENGRSDAVKRRRGDGEVIRFLVDRLSSREIS
        GGLMVGRPWERLE VNLKELTGFRTGY+ +NLKKE+L ELRKLFEARKLEELQWVLDDDVELK+EWL+SEN   DA++RRRGDGEVIRFLVDRLSSR IS
Subjt:  GGLMVGRPWERLEGVNLKELTGFRTGYDGENLKKENLTELRKLFEARKLEELQWVLDDDVELKDEWLDSENGRSDAVKRRRGDGEVIRFLVDRLSSREIS

Query:  MRDWKFSRMMIRSGLQFNEGQLLKILDGLGARGCWKQSLSVVEWVYNLKSHSHFKSRQSFLYYEKICTFCRFVYTKLLAVLGMARKPQEALQIFNLMRGD
        MRDWKFSRMMIRSGLQFNEGQLLKILD LGA+GCWKQ+LSVVEWVYNLKSHSH KS              RFVYTKLLAVLGM+RKPQEALQIF+LMRGD
Subjt:  MRDWKFSRMMIRSGLQFNEGQLLKILDGLGARGCWKQSLSVVEWVYNLKSHSHFKSRQSFLYYEKICTFCRFVYTKLLAVLGMARKPQEALQIFNLMRGD

Query:  GHIYPDMAAYHSIAVTLGQAGLLKQLLKVIECMRQQPSKKIRNKCRKSWDPAVEPDLVVYNAILNACIPTREWKGVYWVFTQLRKSGLKPNGATYGLSME
        G IYPDMAAYHSIAVTLGQAGLLKQLLKV+ECMRQQPS+K+RNKCRKSWDPAVEPDLVVYNAILNACIPT EWKGVYWVFTQLRKSGL+PNGATYGLSME
Subjt:  GHIYPDMAAYHSIAVTLGQAGLLKQLLKVIECMRQQPSKKIRNKCRKSWDPAVEPDLVVYNAILNACIPTREWKGVYWVFTQLRKSGLKPNGATYGLSME

Query:  VMLKSGKYEHLHKLFTKMKKSGETLKANTYRVLVKAFWEEGNVNGAIEAVRDMEQRGVVGSASVYYELACCLCYNGRWQDALVEVRKMKTLPHMKPLVVT
        VMLKSGKYE LHKLFTK+KKSGETLKANTYRVLVKAFWEEGNVNGAIEAVRDMEQRGVVGSASVYYELACCLCYNGRWQDALVEV KMKTL HMKPLVVT
Subjt:  VMLKSGKYEHLHKLFTKMKKSGETLKANTYRVLVKAFWEEGNVNGAIEAVRDMEQRGVVGSASVYYELACCLCYNGRWQDALVEVRKMKTLPHMKPLVVT

Query:  FTGMILSSFDGGHIDDCISIFEYMRRNCAPNIGTINTMLKVYGRNDMFSKAKDLFEEIKRKADRSSPSCSVHSIIPDEYTYGSMLEAAASALQWEYFENV
        FTGMILSSFDGGHIDDCISIFEYM++ CAPNIGTIN+MLKVYGRNDMF KAKDLFEEIKRKAD SS S +V S++PDEYTYGSMLEAAASALQWEYFENV
Subjt:  FTGMILSSFDGGHIDDCISIFEYMRRNCAPNIGTINTMLKVYGRNDMFSKAKDLFEEIKRKADRSSPSCSVHSIIPDEYTYGSMLEAAASALQWEYFENV

Query:  YREMALSGYQLDQSKHATVLVEASRAGKWYLLDHAFDSILEAGQIPHPLLFTEMILQLIVQENYEQAVTLVKAMGYAPFQVTERQWTELFEVSMDRICWN
        YREMALSGY+LDQSKHA +LVEASRAGKWYLLDHAFDSILEAGQIPHPLLFTEMIL L  Q+NYEQAVTLV+ MGYAPFQV+ERQWTELFE + DRICW 
Subjt:  YREMALSGYQLDQSKHATVLVEASRAGKWYLLDHAFDSILEAGQIPHPLLFTEMILQLIVQENYEQAVTLVKAMGYAPFQVTERQWTELFEVSMDRICWN

Query:  NLKKLSDALSDCDASEATVSNLSRSLRVLCKSRIPEDTSQS-------------------ENME------------------------NMKVESDSNMAP
        NLKKL DAL +CDASEATVSNLSRSL+ LCK  IPE+TSQS                   ENM+                        NMKV+SDS ++P
Subjt:  NLKKLSDALSDCDASEATVSNLSRSLRVLCKSRIPEDTSQS-------------------ENME------------------------NMKVESDSNMAP

Query:  WSPSLSDEGAVGTNTFSGHSNDELSTFDLCVGSEDDEEVLNMLLDRSDDSYDSNSPSVNEILKTWKEERKTDGLFLHPLN
        WS S S EG +GT+ FS  S +ELST DLC  SEDDEE LNMLLD  DDSYDSN PSVNEILKTWKEERKTDGLFLHPLN
Subjt:  WSPSLSDEGAVGTNTFSGHSNDELSTFDLCVGSEDDEEVLNMLLDRSDDSYDSNSPSVNEILKTWKEERKTDGLFLHPLN

TrEMBL top hitse value%identityAlignment
A0A1S3B127 pentatricopeptide repeat-containing protein At5g67570, chloroplastic0.0e+0080Show/hide
Query:  MDALSTNAPIPAPKFEPDLEKIKRTLLQKGVCPSPKILRALRKKEIQKHNRKLNRLAH----QSPPLSESQKQLIAEETHFRTLRSEYKEFSKAIEAKPG
        M+AL++NAPIP+PKFEPD +KIKR LLQKGV P+P+I+R+LRKKEIQK+NRKL RLA     QSPPLSESQKQLIAEETHF TLRSEYKEFSKAIEAKP 
Subjt:  MDALSTNAPIPAPKFEPDLEKIKRTLLQKGVCPSPKILRALRKKEIQKHNRKLNRLAH----QSPPLSESQKQLIAEETHFRTLRSEYKEFSKAIEAKPG

Query:  GGLMVGRPWERLEGVNLKELTGFRTGYDGENLKKENLTELRKLFEARKLEELQWVLDDDVELKDEWLDSENGRSDAVKRRRGDGEVIRFLVDRLSSREIS
        GGLMVGRPWERLE VN  ELTG RTGY+ ++LKKE+L ELRKLFE RKLEEL+W LDDDVELK+EWL SENG+ DAVKRRRGDGEVIRFLVDRLSS  IS
Subjt:  GGLMVGRPWERLEGVNLKELTGFRTGYDGENLKKENLTELRKLFEARKLEELQWVLDDDVELKDEWLDSENGRSDAVKRRRGDGEVIRFLVDRLSSREIS

Query:  MRDWKFSRMMIRSGLQFNEGQLLKILDGLGARGCWKQSLSVVEWVYNLKSHSHFKSRQSFLYYEKICTFCRFVYTKLLAVLGMARKPQEALQIFNLMRGD
        MRDWKFSRMMIRSGLQFNEGQLLKILD LGA+GCWKQ+LSVVEWVYNLKSHSH KS              RFVYTKLLAVLGMARKPQEALQIFNLMRGD
Subjt:  MRDWKFSRMMIRSGLQFNEGQLLKILDGLGARGCWKQSLSVVEWVYNLKSHSHFKSRQSFLYYEKICTFCRFVYTKLLAVLGMARKPQEALQIFNLMRGD

Query:  GHIYPDMAAYHSIAVTLGQAGLLKQLLKVIECMRQQPSKKIRNKCRKSWDPAVEPDLVVYNAILNACIPTREWKGVYWVFTQLRKSGLKPNGATYGLSME
        G IYPDMAAYHSIAVTLGQAGLLKQLLKVIECMRQQPSKK+RNKCRKSWDPAVEPDLV+YN ILNACIPT EWKGVYWVFTQLRKSGL+PNGATYGLSME
Subjt:  GHIYPDMAAYHSIAVTLGQAGLLKQLLKVIECMRQQPSKKIRNKCRKSWDPAVEPDLVVYNAILNACIPTREWKGVYWVFTQLRKSGLKPNGATYGLSME

Query:  VMLKSGKYEHLHKLFTKMKKSGETLKANTYRVLVKAFWEEGNVNGAIEAVRDMEQRGVVGSASVYYELACCLCYNGRWQDALVEVRKMKTLPHMKPLVVT
        VMLKSGKYE LH LFTKMKKSGETLKANTYRVLVKAFWEEGN +GAIEAVRDMEQRGVVGSASVYYELACCLCYNG+WQDALVEV KMKTL HMKPLVVT
Subjt:  VMLKSGKYEHLHKLFTKMKKSGETLKANTYRVLVKAFWEEGNVNGAIEAVRDMEQRGVVGSASVYYELACCLCYNGRWQDALVEVRKMKTLPHMKPLVVT

Query:  FTGMILSSFDGGHIDDCISIFEYMRRNCAPNIGTINTMLKVYGRNDMFSKAKDLFEEIKRKADRSSPSCSVHSIIPDEYTYGSMLEAAASALQWEYFENV
        FTGMILSSF+GGHIDDCISIFEYM++ CAPNIGTINTMLKVYGRNDMFSKAKDLFEEIK+KAD SS + +V S++PDEYTY SML+AAAS+LQWEYFENV
Subjt:  FTGMILSSFDGGHIDDCISIFEYMRRNCAPNIGTINTMLKVYGRNDMFSKAKDLFEEIKRKADRSSPSCSVHSIIPDEYTYGSMLEAAASALQWEYFENV

Query:  YREMALSGYQLDQSKHATVLVEASRAGKWYLLDHAFDSILEAGQIPHPLLFTEMILQLIVQENYEQAVTLVKAMGYAPFQVTERQWTELFEVSMDRICWN
        YREMALSGY+LDQSKHA +LVEAS+AGKWYLLDHAFD+ILEAGQIPHPLLFTEMILQL  QENYEQAVTLV+ MGYAPFQV+ERQWTELFE + DRIC N
Subjt:  YREMALSGYQLDQSKHATVLVEASRAGKWYLLDHAFDSILEAGQIPHPLLFTEMILQLIVQENYEQAVTLVKAMGYAPFQVTERQWTELFEVSMDRICWN

Query:  NLKKLSDALSDCDASEATVSNLSRSLRVLCKSRIPEDTSQS-------------------ENME-------------------NMKVESDSNMAPWSPSL
        NLK+L DAL DCDASEATVSNLSRSL+ LCK  I E TSQS                   ENM+                   NMKV S+SNM+PWSPS+
Subjt:  NLKKLSDALSDCDASEATVSNLSRSLRVLCKSRIPEDTSQS-------------------ENME-------------------NMKVESDSNMAPWSPSL

Query:  SDEGAVGTNTFSGHSNDELSTFDLCVGSEDDEEVLNMLLDRSDDSYDSNSPSVNEILKTWKEERKTDGLFLHPLN
        SD G +GT  FS  SN+E STFD    SEDDE  LNMLLD  DDSYDSN P+ NEIL+TWKEERK DGLFLHPLN
Subjt:  SDEGAVGTNTFSGHSNDELSTFDLCVGSEDDEEVLNMLLDRSDDSYDSNSPSVNEILKTWKEERKTDGLFLHPLN

A0A5D3CLI0 Pentatricopeptide repeat-containing protein0.0e+0080Show/hide
Query:  MDALSTNAPIPAPKFEPDLEKIKRTLLQKGVCPSPKILRALRKKEIQKHNRKLNRLAH----QSPPLSESQKQLIAEETHFRTLRSEYKEFSKAIEAKPG
        M+AL++NAPIP+PKFEPD +KIKR LLQKGV P+P+I+R+LRKKEIQK+NRKL RLA     QSPPLSESQKQLIAEETHF TLRSEYKEFSKAIEAKP 
Subjt:  MDALSTNAPIPAPKFEPDLEKIKRTLLQKGVCPSPKILRALRKKEIQKHNRKLNRLAH----QSPPLSESQKQLIAEETHFRTLRSEYKEFSKAIEAKPG

Query:  GGLMVGRPWERLEGVNLKELTGFRTGYDGENLKKENLTELRKLFEARKLEELQWVLDDDVELKDEWLDSENGRSDAVKRRRGDGEVIRFLVDRLSSREIS
        GGLMVGRPWERLE VN  ELTG RTGY+ ++LKKE+L ELRKLFE RKLEEL+W LDDDVELK+EWL SENG+ DAVKRRRGDGEVIRFLVDRLSS  IS
Subjt:  GGLMVGRPWERLEGVNLKELTGFRTGYDGENLKKENLTELRKLFEARKLEELQWVLDDDVELKDEWLDSENGRSDAVKRRRGDGEVIRFLVDRLSSREIS

Query:  MRDWKFSRMMIRSGLQFNEGQLLKILDGLGARGCWKQSLSVVEWVYNLKSHSHFKSRQSFLYYEKICTFCRFVYTKLLAVLGMARKPQEALQIFNLMRGD
        MRDWKFSRMMIRSGLQFNEGQLLKILD LGA+GCWKQ+LSVVEWVYNLKSHSH KS              RFVYTKLLAVLGMARKPQEALQIFNLMRGD
Subjt:  MRDWKFSRMMIRSGLQFNEGQLLKILDGLGARGCWKQSLSVVEWVYNLKSHSHFKSRQSFLYYEKICTFCRFVYTKLLAVLGMARKPQEALQIFNLMRGD

Query:  GHIYPDMAAYHSIAVTLGQAGLLKQLLKVIECMRQQPSKKIRNKCRKSWDPAVEPDLVVYNAILNACIPTREWKGVYWVFTQLRKSGLKPNGATYGLSME
        G IYPDMAAYHSIAVTLGQAGLLKQLLKVIECMRQQPSKK+RNKCRKSWDPAVEPDLV+YN ILNACIPT EWKGVYWVFTQLRKSGL+PNGATYGLSME
Subjt:  GHIYPDMAAYHSIAVTLGQAGLLKQLLKVIECMRQQPSKKIRNKCRKSWDPAVEPDLVVYNAILNACIPTREWKGVYWVFTQLRKSGLKPNGATYGLSME

Query:  VMLKSGKYEHLHKLFTKMKKSGETLKANTYRVLVKAFWEEGNVNGAIEAVRDMEQRGVVGSASVYYELACCLCYNGRWQDALVEVRKMKTLPHMKPLVVT
        VMLKSGKYE LH LFTKMKKSGETLKANTYRVLVKAFWEEGN +GAIEAVRDMEQRGVVGSASVYYELACCLCYNG+WQDALVEV KMKTL HMKPLVVT
Subjt:  VMLKSGKYEHLHKLFTKMKKSGETLKANTYRVLVKAFWEEGNVNGAIEAVRDMEQRGVVGSASVYYELACCLCYNGRWQDALVEVRKMKTLPHMKPLVVT

Query:  FTGMILSSFDGGHIDDCISIFEYMRRNCAPNIGTINTMLKVYGRNDMFSKAKDLFEEIKRKADRSSPSCSVHSIIPDEYTYGSMLEAAASALQWEYFENV
        FTGMILSSF+GGHIDDCISIFEYM++ CAPNIGTINTMLKVYGRNDMFSKAKDLFEEIK+KAD SS + +V S++PDEYTY SML+AAAS+LQWEYFENV
Subjt:  FTGMILSSFDGGHIDDCISIFEYMRRNCAPNIGTINTMLKVYGRNDMFSKAKDLFEEIKRKADRSSPSCSVHSIIPDEYTYGSMLEAAASALQWEYFENV

Query:  YREMALSGYQLDQSKHATVLVEASRAGKWYLLDHAFDSILEAGQIPHPLLFTEMILQLIVQENYEQAVTLVKAMGYAPFQVTERQWTELFEVSMDRICWN
        YREMALSGY+LDQSKHA +LVEAS+AGKWYLLDHAFD+ILEAGQIPHPLLFTEMILQL  QENYEQAVTLV+ MGYAPFQV+ERQWTELFE + DRIC N
Subjt:  YREMALSGYQLDQSKHATVLVEASRAGKWYLLDHAFDSILEAGQIPHPLLFTEMILQLIVQENYEQAVTLVKAMGYAPFQVTERQWTELFEVSMDRICWN

Query:  NLKKLSDALSDCDASEATVSNLSRSLRVLCKSRIPEDTSQS-------------------ENME-------------------NMKVESDSNMAPWSPSL
        NLK+L DAL DCDASEATVSNLSRSL+ LCK  I E TSQS                   ENM+                   NMKV S+SNM+PWSPS+
Subjt:  NLKKLSDALSDCDASEATVSNLSRSLRVLCKSRIPEDTSQS-------------------ENME-------------------NMKVESDSNMAPWSPSL

Query:  SDEGAVGTNTFSGHSNDELSTFDLCVGSEDDEEVLNMLLDRSDDSYDSNSPSVNEILKTWKEERKTDGLFLHPLN
        SD G +GT  FS  SN+E STFD    SEDDE  LNMLLD  DDSYDSN P+ NEIL+TWKEERK DGLFLHPLN
Subjt:  SDEGAVGTNTFSGHSNDELSTFDLCVGSEDDEEVLNMLLDRSDDSYDSNSPSVNEILKTWKEERKTDGLFLHPLN

A0A6J1BTT8 pentatricopeptide repeat-containing protein At5g67570, chloroplastic0.0e+0097.6Show/hide
Query:  MDALSTNAPIPAPKFEPDLEKIKRTLLQKGVCPSPKILRALRKKEIQKHNRKLNRLAHQSPPLSESQKQLIAEETHFRTLRSEYKEFSKAIEAKPGGGLM
        MDALSTNAPIPAPKFEPDLEKIKRTLLQKGVCPSPKILRALRKKEIQKHNRKLNRLAHQSPPLSESQKQLIAEETHFRTLRSEYKEFSKAIEAKPG GLM
Subjt:  MDALSTNAPIPAPKFEPDLEKIKRTLLQKGVCPSPKILRALRKKEIQKHNRKLNRLAHQSPPLSESQKQLIAEETHFRTLRSEYKEFSKAIEAKPGGGLM

Query:  VGRPWERLEGVNLKELTGFRTGYDGENLKKENLTELRKLFEARKLEELQWVLDDDVELKDEWLDSENGRSDAVKRRRGDGEVIRFLVDRLSSREISMRDW
        VGRPWERLEGVNLKELTGFRTGYDGENLKKENLTELRKLFEARKLEELQWVLDDDVELKDEWLDSENGRSDAVKRRRGDGEVIRFLVDRLSSREISMRDW
Subjt:  VGRPWERLEGVNLKELTGFRTGYDGENLKKENLTELRKLFEARKLEELQWVLDDDVELKDEWLDSENGRSDAVKRRRGDGEVIRFLVDRLSSREISMRDW

Query:  KFSRMMIRSGLQFNEGQLLKILDGLGARGCWKQSLSVVEWVYNLKSHSHFKSRQSFLYYEKICTFCRFVYTKLLAVLGMARKPQEALQIFNLMRGDGHIY
        KFSRMMIRSGLQFNEGQLLKILDGLGARGCWKQSLSVVEWVYNLKSHSHFKS              RFVYTKLLAVLGMARKPQEALQIFNLMRGDGHIY
Subjt:  KFSRMMIRSGLQFNEGQLLKILDGLGARGCWKQSLSVVEWVYNLKSHSHFKSRQSFLYYEKICTFCRFVYTKLLAVLGMARKPQEALQIFNLMRGDGHIY

Query:  PDMAAYHSIAVTLGQAGLLKQLLKVIECMRQQPSKKIRNKCRKSWDPAVEPDLVVYNAILNACIPTREWKGVYWVFTQLRKSGLKPNGATYGLSMEVMLK
        PDMAAYHSIAVTLGQAGLLKQLLKVIECMRQQPSKKIRNKCRKSWDPAVEPDLVVYNAILNACIPT EWKGVYWVFTQLRKSGLKPNGATYGLSMEVMLK
Subjt:  PDMAAYHSIAVTLGQAGLLKQLLKVIECMRQQPSKKIRNKCRKSWDPAVEPDLVVYNAILNACIPTREWKGVYWVFTQLRKSGLKPNGATYGLSMEVMLK

Query:  SGKYEHLHKLFTKMKKSGETLKANTYRVLVKAFWEEGNVNGAIEAVRDMEQRGVVGSASVYYELACCLCYNGRWQDALVEVRKMKTLPHMKPLVVTFTGM
        SGKYEHLHKLFTKMKKSGETLKANTYRVLVKAFWEEGNVNGAIEAVRDMEQRGVVGSASVYYELACCLCYNGRWQDALVEVRKMKTLPHMKPLVVTFTGM
Subjt:  SGKYEHLHKLFTKMKKSGETLKANTYRVLVKAFWEEGNVNGAIEAVRDMEQRGVVGSASVYYELACCLCYNGRWQDALVEVRKMKTLPHMKPLVVTFTGM

Query:  ILSSFDGGHIDDCISIFEYMRRNCAPNIGTINTMLKVYGRNDMFSKAKDLFEEIKRKADRSSPSCSVHSIIPDEYTYGSMLEAAASALQWEYFENVYREM
        ILSSFDGGHIDDCISIFEYMRRNCAPNIGTINTMLKVYGRNDMFSKAKDLFEEIKRKADRSSPSCSVHSIIPDEYTYGSMLEAAASALQWEYFENVYREM
Subjt:  ILSSFDGGHIDDCISIFEYMRRNCAPNIGTINTMLKVYGRNDMFSKAKDLFEEIKRKADRSSPSCSVHSIIPDEYTYGSMLEAAASALQWEYFENVYREM

Query:  ALSGYQLDQSKHATVLVEASRAGKWYLLDHAFDSILEAGQIPHPLLFTEMILQLIVQENYEQAVTLVKAMGYAPFQVTERQWTELFEVSMDRICWNNLKK
        ALSGYQLDQSKHATVLVEASRAGKWYLLDHAFDSILEAGQIPHPLLFTEMILQLIVQENYEQAVTLVKAM YAPFQVT+RQWTELFEVSMDRICWNNLKK
Subjt:  ALSGYQLDQSKHATVLVEASRAGKWYLLDHAFDSILEAGQIPHPLLFTEMILQLIVQENYEQAVTLVKAMGYAPFQVTERQWTELFEVSMDRICWNNLKK

Query:  LSDALSDCDASEATVSNLSRSLRVLCKSRIPEDTSQSENMENMKVESDSNMAPWSPSLSDEGAVGTNTFSGHSNDELSTFDLCVGSEDDEEVLNMLLDRS
        LSDALSDCDASEATVSNLSRSLRVLCKSRIPEDTSQSENMENMKVESDSNMAPWSPSLSDEGA+GTNTFSGHSNDELSTFDLCV SEDDEEVLNMLLDRS
Subjt:  LSDALSDCDASEATVSNLSRSLRVLCKSRIPEDTSQSENMENMKVESDSNMAPWSPSLSDEGAVGTNTFSGHSNDELSTFDLCVGSEDDEEVLNMLLDRS

Query:  DDSYDSNSPSVNEILKTWKEERKTDGLFLHPLN
        DDSYDSNSPSVNEILKTWKEERKTDGLFLHPLN
Subjt:  DDSYDSNSPSVNEILKTWKEERKTDGLFLHPLN

A0A6J1GG29 pentatricopeptide repeat-containing protein At5g67570, chloroplastic0.0e+0079.98Show/hide
Query:  MDALSTNAPIPAPKFEPDLEKIKRTLLQKGVCPSPKILRALRKKEIQKHNRKLNRLAH----QSPPLSESQKQLIAEETHFRTLRSEYKEFSKAIEAKPG
        M+ALSTNA +P+PKFEPD+EKIKRTLLQKGV P+PKI+R+L KKEIQKHNRKL RLA     QSPPLSESQKQLI EET F TLRSEYKEFSKAIEA+P 
Subjt:  MDALSTNAPIPAPKFEPDLEKIKRTLLQKGVCPSPKILRALRKKEIQKHNRKLNRLAH----QSPPLSESQKQLIAEETHFRTLRSEYKEFSKAIEAKPG

Query:  GGLMVGRPWERLEGVNLKELTGFRTGYDGENLKKENLTELRKLFEARKLEELQWVLDDDVELKDEWLDSENGRSDAVKRRRGDGEVIRFLVDRLSSREIS
        GGLMVGRPWERLE VNLKELTGFRT Y+ +NLKKE+L ELRKLFEARKLEELQWVLDDDVELKDEWL SENG SDAVKRRRGDGEVIRFLVDRLSSR IS
Subjt:  GGLMVGRPWERLEGVNLKELTGFRTGYDGENLKKENLTELRKLFEARKLEELQWVLDDDVELKDEWLDSENGRSDAVKRRRGDGEVIRFLVDRLSSREIS

Query:  MRDWKFSRMMIRSGLQFNEGQLLKILDGLGARGCWKQSLSVVEWVYNLKSHSHFKSRQSFLYYEKICTFCRFVYTKLLAVLGMARKPQEALQIFNLMRGD
        MRDWKFSRMMI+SGLQFNEGQLLKILD LGA+GCWKQ+LSVVEWVYNLKSHSH KS              RFVYTKLLAVLG ARKPQEALQIFNLMRGD
Subjt:  MRDWKFSRMMIRSGLQFNEGQLLKILDGLGARGCWKQSLSVVEWVYNLKSHSHFKSRQSFLYYEKICTFCRFVYTKLLAVLGMARKPQEALQIFNLMRGD

Query:  GHIYPDMAAYHSIAVTLGQAGLLKQLLKVIECMRQQPSKKIRNKCRKSWDPAVEPDLVVYNAILNACIPTREWKGVYWVFTQLRKSGLKPNGATYGLSME
        G IYPDMAAYHSIAVTLGQAGLLKQLLK+IE MRQQPSKK+RN CRK WDPAVEPDLV+YNAILNAC+PT EWK VYWVFTQLRK+GLKPNGATYGLSME
Subjt:  GHIYPDMAAYHSIAVTLGQAGLLKQLLKVIECMRQQPSKKIRNKCRKSWDPAVEPDLVVYNAILNACIPTREWKGVYWVFTQLRKSGLKPNGATYGLSME

Query:  VMLKSGKYEHLHKLFTKMKKSGETLKANTYRVLVKAFWEEGNVNGAIEAVRDMEQRGVVGSASVYYELACCLCYNGRWQDALVEVRKMKTLPHMKPLVVT
        VMLKSGKYE +H LFTKMK SG TLKANTYRVLVKAFWEEGNV+GAIEAVRDMEQRGVVGSASVYYELACCLCY+GRWQDALVEV KMKTL HMKPLVVT
Subjt:  VMLKSGKYEHLHKLFTKMKKSGETLKANTYRVLVKAFWEEGNVNGAIEAVRDMEQRGVVGSASVYYELACCLCYNGRWQDALVEVRKMKTLPHMKPLVVT

Query:  FTGMILSSFDGGHIDDCISIFEYMRRNCAPNIGTINTMLKVYGRNDMFSKAKDLFEEIKRKADRSSPSCSVHSIIPDEYTYGSMLEAAASALQWEYFENV
        FTGMILSSFDGGHIDDCISIFEYM++ CAPNIGTINTMLKV+GRNDMFSKAKDL+EEIKRKAD SS S +V SI+PD+YTY SML+AAASA QWEYFENV
Subjt:  FTGMILSSFDGGHIDDCISIFEYMRRNCAPNIGTINTMLKVYGRNDMFSKAKDLFEEIKRKADRSSPSCSVHSIIPDEYTYGSMLEAAASALQWEYFENV

Query:  YREMALSGYQLDQSKHATVLVEASRAGKWYLLDHAFDSILEAGQIPHPLLFTEMILQLIVQENYEQAVTLVKAMGYAPFQVTERQWTELFEVSMDRICWN
        YREMALSGY+LDQSKHA +LVEASRAGKWYLLDHAFD+ILEAGQIPHPLLFTEMILQL  Q+NYEQAVTLV+ M YAPFQV+ERQWTELFE + DRICWN
Subjt:  YREMALSGYQLDQSKHATVLVEASRAGKWYLLDHAFDSILEAGQIPHPLLFTEMILQLIVQENYEQAVTLVKAMGYAPFQVTERQWTELFEVSMDRICWN

Query:  NLKKLSDALSDCDASEATVSNLSRSLRVLCKSRIPEDTSQS----------------ENMENMK-----------------------VESDSNMAPWSPS
        NLKKLSDALSDCDASEATV NLS SL+ LCK  IPE+ SQS                ENM+NMK                       +  DS M PWS S
Subjt:  NLKKLSDALSDCDASEATVSNLSRSLRVLCKSRIPEDTSQS----------------ENMENMK-----------------------VESDSNMAPWSPS

Query:  LSDEGAVGTNTFSGHSNDELSTFDLCVGSEDDEEVLNMLLDRSDDSYDSNSPSVNEILKTWKEERKTDGLFLHP
        LSD G + T  FS  SN+E STFDL   SEDDEE L+MLLD  DDSYDSN PSV+EILKTWKEERK DGL+LHP
Subjt:  LSDEGAVGTNTFSGHSNDELSTFDLCVGSEDDEEVLNMLLDRSDDSYDSNSPSVNEILKTWKEERKTDGLFLHP

A0A6J1ILT7 pentatricopeptide repeat-containing protein At5g67570, chloroplastic0.0e+0079.63Show/hide
Query:  MDALSTNAPIPAPKFEPDLEKIKRTLLQKGVCPSPKILRALRKKEIQKHNRKLNRLAH----QSPPLSESQKQLIAEETHFRTLRSEYKEFSKAIEAKPG
        M+ALSTNA +P+PKFEPD+EKIKRTL+QKGV P+PKI+R+L KKEIQKHNRKL RLA     QSPPLSESQKQLI EETHF TLRSEYKEFSKAIEA+P 
Subjt:  MDALSTNAPIPAPKFEPDLEKIKRTLLQKGVCPSPKILRALRKKEIQKHNRKLNRLAH----QSPPLSESQKQLIAEETHFRTLRSEYKEFSKAIEAKPG

Query:  GGLMVGRPWERLEGVNLKELTGFRTGYDGENLKKENLTELRKLFEARKLEELQWVLDDDVELKDEWLDSENGRSDAVKRRRGDGEVIRFLVDRLSSREIS
        GGLMVGRPWERLE VNLKE TGFRT Y+ +NLKKE+L ELRKLFEARKLEELQWVLDDDVELK+EWL SENG SDAVKRRRGDGEVIRFLVDRLSSR IS
Subjt:  GGLMVGRPWERLEGVNLKELTGFRTGYDGENLKKENLTELRKLFEARKLEELQWVLDDDVELKDEWLDSENGRSDAVKRRRGDGEVIRFLVDRLSSREIS

Query:  MRDWKFSRMMIRSGLQFNEGQLLKILDGLGARGCWKQSLSVVEWVYNLKSHSHFKSRQSFLYYEKICTFCRFVYTKLLAVLGMARKPQEALQIFNLMRGD
        MRDWKFSRMMI+SGLQFNEGQLLKILD LGA+GCWKQ+LSVVEWVYNLKSHSH KS              RFVYTKLLAVLG ARKPQEALQIFNLMRGD
Subjt:  MRDWKFSRMMIRSGLQFNEGQLLKILDGLGARGCWKQSLSVVEWVYNLKSHSHFKSRQSFLYYEKICTFCRFVYTKLLAVLGMARKPQEALQIFNLMRGD

Query:  GHIYPDMAAYHSIAVTLGQAGLLKQLLKVIECMRQQPSKKIRNKCRKSWDPAVEPDLVVYNAILNACIPTREWKGVYWVFTQLRKSGLKPNGATYGLSME
        G IYPDMAAYHSIAVTLGQAGLLKQLLK+IE MRQQPSKK+RN CRK WDPAVEPDLV+YNAILNACIPT EWK VYWVFTQLRK+GLKPNGATYGLSME
Subjt:  GHIYPDMAAYHSIAVTLGQAGLLKQLLKVIECMRQQPSKKIRNKCRKSWDPAVEPDLVVYNAILNACIPTREWKGVYWVFTQLRKSGLKPNGATYGLSME

Query:  VMLKSGKYEHLHKLFTKMKKSGETLKANTYRVLVKAFWEEGNVNGAIEAVRDMEQRGVVGSASVYYELACCLCYNGRWQDALVEVRKMKTLPHMKPLVVT
        VMLKSGKYE +H LFTKMK SG TLKANTYRVLVKAFWEEGNV+GAIEAVRDMEQRGVVGSASVYYELACCLCY+GRWQDALVEV KMKTL HMKPLVVT
Subjt:  VMLKSGKYEHLHKLFTKMKKSGETLKANTYRVLVKAFWEEGNVNGAIEAVRDMEQRGVVGSASVYYELACCLCYNGRWQDALVEVRKMKTLPHMKPLVVT

Query:  FTGMILSSFDGGHIDDCISIFEYMRRNCAPNIGTINTMLKVYGRNDMFSKAKDLFEEIKRKADRSSPSCSVHSIIPDEYTYGSMLEAAASALQWEYFENV
        FTGMILSSFDGGHIDDCISIFEYM++ CAPNIGTINTMLKV+GRNDMFSKAKDL+EEIKRKAD SS S +V SI+PD+YTY SML+AAASA QWEYFENV
Subjt:  FTGMILSSFDGGHIDDCISIFEYMRRNCAPNIGTINTMLKVYGRNDMFSKAKDLFEEIKRKADRSSPSCSVHSIIPDEYTYGSMLEAAASALQWEYFENV

Query:  YREMALSGYQLDQSKHATVLVEASRAGKWYLLDHAFDSILEAGQIPHPLLFTEMILQLIVQENYEQAVTLVKAMGYAPFQVTERQWTELFEVSMDRICWN
        YREMALSGY+LDQSKHA +LVEASRAGKWYLLDHAFD+ILEAGQIPHPLLFTEMILQL  Q+NYEQA+TLV+ M YAPFQV+ERQWTELFE + DRICWN
Subjt:  YREMALSGYQLDQSKHATVLVEASRAGKWYLLDHAFDSILEAGQIPHPLLFTEMILQLIVQENYEQAVTLVKAMGYAPFQVTERQWTELFEVSMDRICWN

Query:  NLKKLSDALSDCDASEATVSNLSRSLRVLCKSRIPEDTSQ----------------SENMENMK-----------------------VESDSNMAPWSPS
        NLKKLSDALSDCDASEATVSNLS SL+ LCK  IPE+TSQ                SEN +NMK                       +  DS M PWS S
Subjt:  NLKKLSDALSDCDASEATVSNLSRSLRVLCKSRIPEDTSQ----------------SENMENMK-----------------------VESDSNMAPWSPS

Query:  LSDEGAVGTNTFSGHSNDELSTFDLCVGSEDDEEVLNMLLDRSDDSYDSNSPSVNEILKTWKEERKTDGLFLHP
        LSD G + T  FS  SN+E STFDL   SEDDEE L+MLLD  DDSY SN PSV+EILKTW+EERK DGL+LHP
Subjt:  LSDEGAVGTNTFSGHSNDELSTFDLCVGSEDDEEVLNMLLDRSDDSYDSNSPSVNEILKTWKEERKTDGLFLHP

SwissProt top hitse value%identityAlignment
Q3EDF8 Pentatricopeptide repeat-containing protein At1g099004.6e-1923.93Show/hide
Query:  FKSRQSFLYYEKI-----CT-----FCRFVYTKLLAVLGMARKPQEALQIFNLMRGDGHIYPDMAAYHSIAVTLGQAGLLKQLLKVIECMRQQPSKKIRN
        FK  ++ +Y+  +     CT     FCR         LG  RK   A +I  ++ G G + PD+  Y+ +     +AG +   L V++ M          
Subjt:  FKSRQSFLYYEKI-----CT-----FCRFVYTKLLAVLGMARKPQEALQIFNLMRGDGHIYPDMAAYHSIAVTLGQAGLLKQLLKVIECMRQQPSKKIRN

Query:  KCRKSWDPAVEPDLVVYNAILNACIPTREWKGVYWVFTQLRKSGLKPNGATYGLSMEVMLKSGKYEHLHKLFTKMKKSGETLKANTYRVLVKAFWEEGNV
                +V PD+V YN IL +   + + K    V  ++ +    P+  TY + +E   +     H  KL  +M+  G T    TY VLV    +EG +
Subjt:  KCRKSWDPAVEPDLVVYNAILNACIPTREWKGVYWVFTQLRKSGLKPNGATYGLSMEVMLKSGKYEHLHKLFTKMKKSGETLKANTYRVLVKAFWEEGNV

Query:  NGAIEAVRDMEQRGVVGSASVYYELACCLCYNGRWQDALVEVRKMKTLPHMKPLVVTFTGMILSSFDGGHIDDCISIFEYM-RRNCAPNIGTINTMLKVY
        + AI+ + DM   G   +   +  +   +C  GRW DA   +  M       P VVTF  +I      G +   I I E M +  C PN  + N +L  +
Subjt:  NGAIEAVRDMEQRGVVGSASVYYELACCLCYNGRWQDALVEVRKMKTLPHMKPLVVTFTGMILSSFDGGHIDDCISIFEYM-RRNCAPNIGTINTMLKVY

Query:  GRNDMFSKAKDLFEEIKRKADRSSPSCSVHSIIPDEYTYGSMLEAAASALQWEYFENVYREMALSGYQLDQSKHATVLVEASRAGKWYLLDHAFDSILEA
         +     +A    E ++R   R           PD  TY +ML A     + E    +  +++  G       + TV+   ++AGK        D +   
Subjt:  GRNDMFSKAKDLFEEIKRKADRSSPSCSVHSIIPDEYTYGSMLEAAASALQWEYFENVYREMALSGYQLDQSKHATVLVEASRAGKWYLLDHAFDSILEA

Query:  GQIPHPLLFTEMILQLIVQENYEQAVTL---VKAMGYAPFQVT
           P  + ++ ++  L  +   ++A+      + MG  P  VT
Subjt:  GQIPHPLLFTEMILQLIVQENYEQAVTL---VKAMGYAPFQVT

Q9FJW6 Pentatricopeptide repeat-containing protein At5g67570, chloroplastic2.2e-24254.33Show/hide
Query:  PKFEPDLEKIKRTLLQKGVCPSPKILRALRKKEIQKHNRKLNR-LAHQSPPLSESQKQLIAEETHFRTLRSEYKEFSKAIEAKPGG--GLMVGRPWERLE
        P+FEPD+EKIKR LL+ GV P+PKIL  LRKKEIQKHNR+  R    ++   +E+QKQ + EE  F+TLR EYK+F+++I  K GG  GLMVG PWE +E
Subjt:  PKFEPDLEKIKRTLLQKGVCPSPKILRALRKKEIQKHNRKLNR-LAHQSPPLSESQKQLIAEETHFRTLRSEYKEFSKAIEAKPGG--GLMVGRPWERLE

Query:  GVNLKELTG--FRTGYDGENLKKENLTELRKLFEARKLEELQWVLDDDVELKDEWLDSENGRSDAVKRRRGDGEVIRFLVDRLSSREISMRDWKFSRMMI
         V LKEL     R       LKKENL EL+K+ E    ++L+WVLDDDV++++  LD E    D  KR R +GE +R LVDRLS REI+ + WKF RMM 
Subjt:  GVNLKELTG--FRTGYDGENLKKENLTELRKLFEARKLEELQWVLDDDVELKDEWLDSENGRSDAVKRRRGDGEVIRFLVDRLSSREISMRDWKFSRMMI

Query:  RSGLQFNEGQLLKILDGLGARGCWKQSLSVVEWVYNLKSHSHFKSRQSFLYYEKICTFCRFVYTKLLAVLGMARKPQEALQIFNLMRGDGHIYPDMAAYH
        +SGLQF E Q+LKI+D LG +  WKQ+ +VV WVY+ K   H +S              RFVYTKLL+VLG AR+PQEALQIFN M GD  +YPDMAAYH
Subjt:  RSGLQFNEGQLLKILDGLGARGCWKQSLSVVEWVYNLKSHSHFKSRQSFLYYEKICTFCRFVYTKLLAVLGMARKPQEALQIFNLMRGDGHIYPDMAAYH

Query:  SIAVTLGQAGLLKQLLKVIECMRQQPSKKIRNKCRKSWDPAVEPDLVVYNAILNACIPTREWKGVYWVFTQLRKSGLKPNGATYGLSMEVMLKSGKYEHL
         IAVTLGQAGLLK+LLKVIE MRQ+P+K  +N  +K+WDP +EPDLVVYNAILNAC+PT +WK V WVF +LRK+GL+PNGATYGL+MEVML+SGK++ +
Subjt:  SIAVTLGQAGLLKQLLKVIECMRQQPSKKIRNKCRKSWDPAVEPDLVVYNAILNACIPTREWKGVYWVFTQLRKSGLKPNGATYGLSMEVMLKSGKYEHL

Query:  HKLFTKMKKSGETLKANTYRVLVKAFWEEGNVNGAIEAVRDMEQRGVVGSASVYYELACCLCYNGRWQDALVEVRKMKTLPHMKPLVVTFTGMILSSFDG
        H  F KMK SGE  KA TY+VLV+A W EG +  A+EAVRDMEQ+GV+G+ SVYYELACCLC NGRW DA++EV +MK L + +PL +TFTG+I +S +G
Subjt:  HKLFTKMKKSGETLKANTYRVLVKAFWEEGNVNGAIEAVRDMEQRGVVGSASVYYELACCLCYNGRWQDALVEVRKMKTLPHMKPLVVTFTGMILSSFDG

Query:  GHIDDCISIFEYMRRNCAPNIGTINTMLKVYGRNDMFSKAKDLFEEIKRKADRSSPSCSVHSIIPDEYTYGSMLEAAASALQWEYFENVYREMALSGYQL
        GH+DDC++IF+YM+  C PNIGT N MLKVYGRNDMFS+AK+LFEEI  + +          ++P+EYTY  MLEA+A +LQWEYFE+VY+ M LSGYQ+
Subjt:  GHIDDCISIFEYMRRNCAPNIGTINTMLKVYGRNDMFSKAKDLFEEIKRKADRSSPSCSVHSIIPDEYTYGSMLEAAASALQWEYFENVYREMALSGYQL

Query:  DQSKHATVLVEASRAGKWYLLDHAFDSILEAGQIPHPLLFTEMILQLIVQENYEQAVTLVKAMGYAPFQVTERQWTELFEVSMDRICWNNLKKLSDALSD
        DQ+KHA++L+EASRAGKW LL+HAFD++LE G+IPHPL FTE++     + ++++A+TL+  +  A FQ++E +WT+LFE   D +  +NL KLSD L +
Subjt:  DQSKHATVLVEASRAGKWYLLDHAFDSILEAGQIPHPLLFTEMILQLIVQENYEQAVTLVKAMGYAPFQVTERQWTELFEVSMDRICWNNLKKLSDALSD

Query:  CD-ASEATVSNLSRSLRVLCKSRIPEDTSQSENMENMKVESDSNMAPWSPSLSDEGAVGTNTFSGHSNDELSTFDLCVGSEDDEEVLNMLLDRSDDSYDS
        CD  SE TVSNLS+SL+  C S     ++Q     ++  +S          L D      N+ +G + +   T    +G E+ E      +D  ++S DS
Subjt:  CD-ASEATVSNLSRSLRVLCKSRIPEDTSQSENMENMKVESDSNMAPWSPSLSDEGAVGTNTFSGHSNDELSTFDLCVGSEDDEEVLNMLLDRSDDSYDS

Query:  NSPSVNEILKTWKEERKTD
        +S SV +ILK W+E  K +
Subjt:  NSPSVNEILKTWKEERKTD

Q9S7Q2 Pentatricopeptide repeat-containing protein At1g74850, chloroplastic3.0e-1822.33Show/hide
Query:  YTKLLAVLGMARKPQEALQIFNLMRGDGHIYPDMAAYHSIAVTLGQAGLLKQLLKVIECMRQQPSKKIRNKCRKSWDPAVEPDLVVYNAILNACIPTREW
        Y  LL+   +     EA  +F  M  DG I PD+  Y  +  T G+   L++L KV + + +  S                PD+  YN +L A   +   
Subjt:  YTKLLAVLGMARKPQEALQIFNLMRGDGHIYPDMAAYHSIAVTLGQAGLLKQLLKVIECMRQQPSKKIRNKCRKSWDPAVEPDLVVYNAILNACIPTREW

Query:  KGVYWVFTQLRKSGLKPNGATYGLSMEVMLKSGKYEHLHKLFTKMKKSGETLKANTYRVLVKAFWEEGNVNGAIEAVRDMEQRGVVGSASVYYELACCLC
        K    VF Q++ +G  PN  TY + + +  +SG+Y+ + +LF +MK S     A TY +L++ F E G     +    DM +  +      Y  +     
Subjt:  KGVYWVFTQLRKSGLKPNGATYGLSMEVMLKSGKYEHLHKLFTKMKKSGETLKANTYRVLVKAFWEEGNVNGAIEAVRDMEQRGVVGSASVYYELACCLC

Query:  YNGRWQDALVEVRKMKTLPHMKPLVVTFTGMILSSFDGGHIDDCISIFEYMRR-NCAPNIGTINTMLKVYGRNDMFSKAKDLFEEIKRKADRSSPSCSVH
          G  +DA  ++ +  T   + P    +TG+I +       ++ +  F  M      P+I T +++L  + R  +  +++ +   +            V 
Subjt:  YNGRWQDALVEVRKMKTLPHMKPLVVTFTGMILSSFDGGHIDDCISIFEYMRR-NCAPNIGTINTMLKVYGRNDMFSKAKDLFEEIKRKADRSSPSCSVH

Query:  SIIP-DEYTYGSMLEAAASALQWEYFENVYREMALSGYQLDQSKHATVLVEASRAGKWYLLDHAFDSILEAGQIPHPLLFTEMILQLIVQENYEQAVTLV
        S IP +  T+ + +EA     ++E     Y +M  S    D+     VL   S A         F+ +  +  +P  + +  M+      E ++    L+
Subjt:  SIIP-DEYTYGSMLEAAASALQWEYFENVYREMALSGYQLDQSKHATVLVEASRAGKWYLLDHAFDSILEAGQIPHPLLFTEMILQLIVQENYEQAVTLV

Query:  KAM
        + M
Subjt:  KAM

Q9SA76 Pentatricopeptide repeat-containing protein At1g30610, chloroplastic1.8e-9534.6Show/hide
Query:  IRFLVDRLSSREISMRDWKFSRMMIRSGLQFNEGQLLKILDGLGARGCWKQSLSVVEWVYNLKSHSHFKSRQSFLYYEKICTFCRFVYTKLLAVLGMARK
        I  L   L+  +I+M +W+FS+ +  + +++ +  +++++  LG  G W++ L V+EW   L+    +KS +            R +YT  L VLG +R+
Subjt:  IRFLVDRLSSREISMRDWKFSRMMIRSGLQFNEGQLLKILDGLGARGCWKQSLSVVEWVYNLKSHSHFKSRQSFLYYEKICTFCRFVYTKLLAVLGMARK

Query:  PQEALQIFNLMRGDGHIYPDMAAYHSIAVTLGQAGLLKQLLKVIECMRQQPSKKIRNKCRKSWDPAVEPDLVVYNAILNACIPTREWKGVYWVFTQLRKS
        P EAL +F+ M      YPDM AY SIAVTLGQAG +K+L  VI+ MR  P KK +    + WDP +EPD+VVYNA+LNAC+  ++W+G +WV  QL++ 
Subjt:  PQEALQIFNLMRGDGHIYPDMAAYHSIAVTLGQAGLLKQLLKVIECMRQQPSKKIRNKCRKSWDPAVEPDLVVYNAILNACIPTREWKGVYWVFTQLRKS

Query:  GLKPNGATYGLSMEVMLKSGKYEHLHKLFTKMKKSGETLKANTYRVLVKAFWEEGNVNGAIEAVRDMEQRGVVGSASVYYELACCLCYNGRWQDAL----
        G KP+  TYGL MEVML   KY  +H+ F KM+KS     A  YRVLV   W+EG  + A+  V DME RG+VGSA++YY+LA CLC  GR  + L    
Subjt:  GLKPNGATYGLSMEVMLKSGKYEHLHKLFTKMKKSGETLKANTYRVLVKAFWEEGNVNGAIEAVRDMEQRGVVGSASVYYELACCLCYNGRWQDAL----

Query:  ----VEVRKMKTLPHM-------------------KPLVVTFTGMILSSFDGGHIDDCISIFEYMRRNCAPNIGTINTMLKVYGRNDMFSKAKDLFEEIK
            V ++ ++ L +                    KPLVVT+TG+I +  D G+I +   IF+ M++ C+PN+ T N MLK Y +  +F +A++LF+++ 
Subjt:  ----VEVRKMKTLPHM-------------------KPLVVTFTGMILSSFDGGHIDDCISIFEYMRRNCAPNIGTINTMLKVYGRNDMFSKAKDLFEEIK

Query:  RKADRSSPSCSVHS-IIPDEYTYGSMLEAAASALQWEYFENVYREMALSGYQLDQSKHATVLVEASRAGKWYLLDHAFDSILEAGQIPHPLLFTEMILQL
           +    S    S ++PD YT+ +ML+  A   +W+ F   YREM   GY  +  +H  +++EASRAGK  +++  ++ +  + +IP   L  E   + 
Subjt:  RKADRSSPSCSVHS-IIPDEYTYGSMLEAAASALQWEYFENVYREMALSGYQLDQSKHATVLVEASRAGKWYLLDHAFDSILEAGQIPHPLLFTEMILQL

Query:  IVQENYEQAVTLVKAMGYAPFQVTERQW-TELFEVSMDRICWNNLKKLSDAL-----SDCDASEATVSNLSRSLRVLCKSR
        + + ++  A++ +  +     +   R + T  +   + R   +++ +L D +     S  ++S++ + NL  S +   K+R
Subjt:  IVQENYEQAVTLVKAMGYAPFQVTERQW-TELFEVSMDRICWNNLKKLSDAL-----SDCDASEATVSNLSRSLRVLCKSR

Q9SR00 Pentatricopeptide repeat-containing protein At3g04760, chloroplastic3.9e-1825Show/hide
Query:  YTKLLAVLGMARKPQEALQIFNLMRGDGHIYPDMAAYHSIAVTLGQAGLLKQLLKVIECMRQQPSKKIRNKCRKSWDPAVEPDLVVYNAILNACIPTREW
        YT L+    +     EAL++ + M   G + PDM  Y++I   + + G++ +  +++  +  +                 EPD++ YN +L A +   +W
Subjt:  YTKLLAVLGMARKPQEALQIFNLMRGDGHIYPDMAAYHSIAVTLGQAGLLKQLLKVIECMRQQPSKKIRNKCRKSWDPAVEPDLVVYNAILNACIPTREW

Query:  KGVYWVFTQLRKSGLKPNGATYGLSMEVMLKSGKYEHLHKLFTKMKKSGETLKANTYRVLVKAFWEEGNVNGAIEAVRDMEQRGVVGSASVYYELACCLC
        +    + T++      PN  TY + +  + + GK E    L   MK+ G T  A +Y  L+ AF  EG ++ AIE +  M   G +     Y  +   LC
Subjt:  KGVYWVFTQLRKSGLKPNGATYGLSMEVMLKSGKYEHLHKLFTKMKKSGETLKANTYRVLVKAFWEEGNVNGAIEAVRDMEQRGVVGSASVYYELACCLC

Query:  YNGRWQDALVEVRKMKTL---PHMKPLVVTFTGMILSSFDGGHIDDCISIFEYMRRNCAPNIGTINTMLKVYGRNDMFSKAKDLFEEIKRKADRSSPSCS
         NG+   AL    K+  +   P+       F+ +  S   G  I     I E M     P+  T N+M+    R  M  +A +L  +++        SC 
Subjt:  YNGRWQDALVEVRKMKTL---PHMKPLVVTFTGMILSSFDGGHIDDCISIFEYMRRNCAPNIGTINTMLKVYGRNDMFSKAKDLFEEIKRKADRSSPSCS

Query:  VHSIIPDEYTYGSMLEAAASALQWEYFENVYREMALSGYQLDQSKHATVLVE
         H   P   TY  +L     A + E   NV   M  +G + +++ + TVL+E
Subjt:  VHSIIPDEYTYGSMLEAAASALQWEYFENVYREMALSGYQLDQSKHATVLVE

Arabidopsis top hitse value%identityAlignment
AT1G09900.1 Pentatricopeptide repeat (PPR-like) superfamily protein3.3e-2023.93Show/hide
Query:  FKSRQSFLYYEKI-----CT-----FCRFVYTKLLAVLGMARKPQEALQIFNLMRGDGHIYPDMAAYHSIAVTLGQAGLLKQLLKVIECMRQQPSKKIRN
        FK  ++ +Y+  +     CT     FCR         LG  RK   A +I  ++ G G + PD+  Y+ +     +AG +   L V++ M          
Subjt:  FKSRQSFLYYEKI-----CT-----FCRFVYTKLLAVLGMARKPQEALQIFNLMRGDGHIYPDMAAYHSIAVTLGQAGLLKQLLKVIECMRQQPSKKIRN

Query:  KCRKSWDPAVEPDLVVYNAILNACIPTREWKGVYWVFTQLRKSGLKPNGATYGLSMEVMLKSGKYEHLHKLFTKMKKSGETLKANTYRVLVKAFWEEGNV
                +V PD+V YN IL +   + + K    V  ++ +    P+  TY + +E   +     H  KL  +M+  G T    TY VLV    +EG +
Subjt:  KCRKSWDPAVEPDLVVYNAILNACIPTREWKGVYWVFTQLRKSGLKPNGATYGLSMEVMLKSGKYEHLHKLFTKMKKSGETLKANTYRVLVKAFWEEGNV

Query:  NGAIEAVRDMEQRGVVGSASVYYELACCLCYNGRWQDALVEVRKMKTLPHMKPLVVTFTGMILSSFDGGHIDDCISIFEYM-RRNCAPNIGTINTMLKVY
        + AI+ + DM   G   +   +  +   +C  GRW DA   +  M       P VVTF  +I      G +   I I E M +  C PN  + N +L  +
Subjt:  NGAIEAVRDMEQRGVVGSASVYYELACCLCYNGRWQDALVEVRKMKTLPHMKPLVVTFTGMILSSFDGGHIDDCISIFEYM-RRNCAPNIGTINTMLKVY

Query:  GRNDMFSKAKDLFEEIKRKADRSSPSCSVHSIIPDEYTYGSMLEAAASALQWEYFENVYREMALSGYQLDQSKHATVLVEASRAGKWYLLDHAFDSILEA
         +     +A    E ++R   R           PD  TY +ML A     + E    +  +++  G       + TV+   ++AGK        D +   
Subjt:  GRNDMFSKAKDLFEEIKRKADRSSPSCSVHSIIPDEYTYGSMLEAAASALQWEYFENVYREMALSGYQLDQSKHATVLVEASRAGKWYLLDHAFDSILEA

Query:  GQIPHPLLFTEMILQLIVQENYEQAVTL---VKAMGYAPFQVT
           P  + ++ ++  L  +   ++A+      + MG  P  VT
Subjt:  GQIPHPLLFTEMILQLIVQENYEQAVTL---VKAMGYAPFQVT

AT1G30610.1 pentatricopeptide (PPR) repeat-containing protein1.3e-9634.6Show/hide
Query:  IRFLVDRLSSREISMRDWKFSRMMIRSGLQFNEGQLLKILDGLGARGCWKQSLSVVEWVYNLKSHSHFKSRQSFLYYEKICTFCRFVYTKLLAVLGMARK
        I  L   L+  +I+M +W+FS+ +  + +++ +  +++++  LG  G W++ L V+EW   L+    +KS +            R +YT  L VLG +R+
Subjt:  IRFLVDRLSSREISMRDWKFSRMMIRSGLQFNEGQLLKILDGLGARGCWKQSLSVVEWVYNLKSHSHFKSRQSFLYYEKICTFCRFVYTKLLAVLGMARK

Query:  PQEALQIFNLMRGDGHIYPDMAAYHSIAVTLGQAGLLKQLLKVIECMRQQPSKKIRNKCRKSWDPAVEPDLVVYNAILNACIPTREWKGVYWVFTQLRKS
        P EAL +F+ M      YPDM AY SIAVTLGQAG +K+L  VI+ MR  P KK +    + WDP +EPD+VVYNA+LNAC+  ++W+G +WV  QL++ 
Subjt:  PQEALQIFNLMRGDGHIYPDMAAYHSIAVTLGQAGLLKQLLKVIECMRQQPSKKIRNKCRKSWDPAVEPDLVVYNAILNACIPTREWKGVYWVFTQLRKS

Query:  GLKPNGATYGLSMEVMLKSGKYEHLHKLFTKMKKSGETLKANTYRVLVKAFWEEGNVNGAIEAVRDMEQRGVVGSASVYYELACCLCYNGRWQDAL----
        G KP+  TYGL MEVML   KY  +H+ F KM+KS     A  YRVLV   W+EG  + A+  V DME RG+VGSA++YY+LA CLC  GR  + L    
Subjt:  GLKPNGATYGLSMEVMLKSGKYEHLHKLFTKMKKSGETLKANTYRVLVKAFWEEGNVNGAIEAVRDMEQRGVVGSASVYYELACCLCYNGRWQDAL----

Query:  ----VEVRKMKTLPHM-------------------KPLVVTFTGMILSSFDGGHIDDCISIFEYMRRNCAPNIGTINTMLKVYGRNDMFSKAKDLFEEIK
            V ++ ++ L +                    KPLVVT+TG+I +  D G+I +   IF+ M++ C+PN+ T N MLK Y +  +F +A++LF+++ 
Subjt:  ----VEVRKMKTLPHM-------------------KPLVVTFTGMILSSFDGGHIDDCISIFEYMRRNCAPNIGTINTMLKVYGRNDMFSKAKDLFEEIK

Query:  RKADRSSPSCSVHS-IIPDEYTYGSMLEAAASALQWEYFENVYREMALSGYQLDQSKHATVLVEASRAGKWYLLDHAFDSILEAGQIPHPLLFTEMILQL
           +    S    S ++PD YT+ +ML+  A   +W+ F   YREM   GY  +  +H  +++EASRAGK  +++  ++ +  + +IP   L  E   + 
Subjt:  RKADRSSPSCSVHS-IIPDEYTYGSMLEAAASALQWEYFENVYREMALSGYQLDQSKHATVLVEASRAGKWYLLDHAFDSILEAGQIPHPLLFTEMILQL

Query:  IVQENYEQAVTLVKAMGYAPFQVTERQW-TELFEVSMDRICWNNLKKLSDAL-----SDCDASEATVSNLSRSLRVLCKSR
        + + ++  A++ +  +     +   R + T  +   + R   +++ +L D +     S  ++S++ + NL  S +   K+R
Subjt:  IVQENYEQAVTLVKAMGYAPFQVTERQW-TELFEVSMDRICWNNLKKLSDAL-----SDCDASEATVSNLSRSLRVLCKSR

AT1G30610.2 pentatricopeptide (PPR) repeat-containing protein3.2e-10036.1Show/hide
Query:  IRFLVDRLSSREISMRDWKFSRMMIRSGLQFNEGQLLKILDGLGARGCWKQSLSVVEWVYNLKSHSHFKSRQSFLYYEKICTFCRFVYTKLLAVLGMARK
        I  L   L+  +I+M +W+FS+ +  + +++ +  +++++  LG  G W++ L V+EW   L+    +KS +            R +YT  L VLG +R+
Subjt:  IRFLVDRLSSREISMRDWKFSRMMIRSGLQFNEGQLLKILDGLGARGCWKQSLSVVEWVYNLKSHSHFKSRQSFLYYEKICTFCRFVYTKLLAVLGMARK

Query:  PQEALQIFNLMRGDGHIYPDMAAYHSIAVTLGQAGLLKQLLKVIECMRQQPSKKIRNKCRKSWDPAVEPDLVVYNAILNACIPTREWKGVYWVFTQLRKS
        P EAL +F+ M      YPDM AY SIAVTLGQAG +K+L  VI+ MR  P KK +    + WDP +EPD+VVYNA+LNAC+  ++W+G +WV  QL++ 
Subjt:  PQEALQIFNLMRGDGHIYPDMAAYHSIAVTLGQAGLLKQLLKVIECMRQQPSKKIRNKCRKSWDPAVEPDLVVYNAILNACIPTREWKGVYWVFTQLRKS

Query:  GLKPNGATYGLSMEVMLKSGKYEHLHKLFTKMKKSGETLKANTYRVLVKAFWEEGNVNGAIEAVRDMEQRGVVGSASVYYELACCLCYNGRWQDALVEVR
        G KP+  TYGL MEVML   KY  +H+ F KM+KS     A  YRVLV   W+EG  + A+  V DME RG+VGSA++YY+LA CLC  GR  + L  ++
Subjt:  GLKPNGATYGLSMEVMLKSGKYEHLHKLFTKMKKSGETLKANTYRVLVKAFWEEGNVNGAIEAVRDMEQRGVVGSASVYYELACCLCYNGRWQDALVEVR

Query:  KMKTLPHMKPLVVTFTGMILSSFDGGHIDDCISIFEYMRRNCAPNIGTINTMLKVYGRNDMFSKAKDLFEEIKRKADRSSPSCSVHS-IIPDEYTYGSML
        K+  + + KPLVVT+TG+I +  D G+I +   IF+ M++ C+PN+ T N MLK Y +  +F +A++LF+++    +    S    S ++PD YT+ +ML
Subjt:  KMKTLPHMKPLVVTFTGMILSSFDGGHIDDCISIFEYMRRNCAPNIGTINTMLKVYGRNDMFSKAKDLFEEIKRKADRSSPSCSVHS-IIPDEYTYGSML

Query:  EAAASALQWEYFENVYREMALSGYQLDQSKHATVLVEASRAGKWYLLDHAFDSILEAGQIPHPLLFTEMILQLIVQENYEQAVTLVKAMGYAPFQVTERQ
        +  A   +W+ F   YREM   GY  +  +H  +++EASRAGK  +++  ++ +  + +IP   L  E   + + + ++  A++ +  +     +   R 
Subjt:  EAAASALQWEYFENVYREMALSGYQLDQSKHATVLVEASRAGKWYLLDHAFDSILEAGQIPHPLLFTEMILQLIVQENYEQAVTLVKAMGYAPFQVTERQ

Query:  W-TELFEVSMDRICWNNLKKLSDAL-----SDCDASEATVSNLSRSLRVLCKSR
        + T  +   + R   +++ +L D +     S  ++S++ + NL  S +   K+R
Subjt:  W-TELFEVSMDRICWNNLKKLSDAL-----SDCDASEATVSNLSRSLRVLCKSR

AT1G74850.1 plastid transcriptionally active 22.1e-1922.33Show/hide
Query:  YTKLLAVLGMARKPQEALQIFNLMRGDGHIYPDMAAYHSIAVTLGQAGLLKQLLKVIECMRQQPSKKIRNKCRKSWDPAVEPDLVVYNAILNACIPTREW
        Y  LL+   +     EA  +F  M  DG I PD+  Y  +  T G+   L++L KV + + +  S                PD+  YN +L A   +   
Subjt:  YTKLLAVLGMARKPQEALQIFNLMRGDGHIYPDMAAYHSIAVTLGQAGLLKQLLKVIECMRQQPSKKIRNKCRKSWDPAVEPDLVVYNAILNACIPTREW

Query:  KGVYWVFTQLRKSGLKPNGATYGLSMEVMLKSGKYEHLHKLFTKMKKSGETLKANTYRVLVKAFWEEGNVNGAIEAVRDMEQRGVVGSASVYYELACCLC
        K    VF Q++ +G  PN  TY + + +  +SG+Y+ + +LF +MK S     A TY +L++ F E G     +    DM +  +      Y  +     
Subjt:  KGVYWVFTQLRKSGLKPNGATYGLSMEVMLKSGKYEHLHKLFTKMKKSGETLKANTYRVLVKAFWEEGNVNGAIEAVRDMEQRGVVGSASVYYELACCLC

Query:  YNGRWQDALVEVRKMKTLPHMKPLVVTFTGMILSSFDGGHIDDCISIFEYMRR-NCAPNIGTINTMLKVYGRNDMFSKAKDLFEEIKRKADRSSPSCSVH
          G  +DA  ++ +  T   + P    +TG+I +       ++ +  F  M      P+I T +++L  + R  +  +++ +   +            V 
Subjt:  YNGRWQDALVEVRKMKTLPHMKPLVVTFTGMILSSFDGGHIDDCISIFEYMRR-NCAPNIGTINTMLKVYGRNDMFSKAKDLFEEIKRKADRSSPSCSVH

Query:  SIIP-DEYTYGSMLEAAASALQWEYFENVYREMALSGYQLDQSKHATVLVEASRAGKWYLLDHAFDSILEAGQIPHPLLFTEMILQLIVQENYEQAVTLV
        S IP +  T+ + +EA     ++E     Y +M  S    D+     VL   S A         F+ +  +  +P  + +  M+      E ++    L+
Subjt:  SIIP-DEYTYGSMLEAAASALQWEYFENVYREMALSGYQLDQSKHATVLVEASRAGKWYLLDHAFDSILEAGQIPHPLLFTEMILQLIVQENYEQAVTLV

Query:  KAM
        + M
Subjt:  KAM

AT5G67570.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.5e-24354.33Show/hide
Query:  PKFEPDLEKIKRTLLQKGVCPSPKILRALRKKEIQKHNRKLNR-LAHQSPPLSESQKQLIAEETHFRTLRSEYKEFSKAIEAKPGG--GLMVGRPWERLE
        P+FEPD+EKIKR LL+ GV P+PKIL  LRKKEIQKHNR+  R    ++   +E+QKQ + EE  F+TLR EYK+F+++I  K GG  GLMVG PWE +E
Subjt:  PKFEPDLEKIKRTLLQKGVCPSPKILRALRKKEIQKHNRKLNR-LAHQSPPLSESQKQLIAEETHFRTLRSEYKEFSKAIEAKPGG--GLMVGRPWERLE

Query:  GVNLKELTG--FRTGYDGENLKKENLTELRKLFEARKLEELQWVLDDDVELKDEWLDSENGRSDAVKRRRGDGEVIRFLVDRLSSREISMRDWKFSRMMI
         V LKEL     R       LKKENL EL+K+ E    ++L+WVLDDDV++++  LD E    D  KR R +GE +R LVDRLS REI+ + WKF RMM 
Subjt:  GVNLKELTG--FRTGYDGENLKKENLTELRKLFEARKLEELQWVLDDDVELKDEWLDSENGRSDAVKRRRGDGEVIRFLVDRLSSREISMRDWKFSRMMI

Query:  RSGLQFNEGQLLKILDGLGARGCWKQSLSVVEWVYNLKSHSHFKSRQSFLYYEKICTFCRFVYTKLLAVLGMARKPQEALQIFNLMRGDGHIYPDMAAYH
        +SGLQF E Q+LKI+D LG +  WKQ+ +VV WVY+ K   H +S              RFVYTKLL+VLG AR+PQEALQIFN M GD  +YPDMAAYH
Subjt:  RSGLQFNEGQLLKILDGLGARGCWKQSLSVVEWVYNLKSHSHFKSRQSFLYYEKICTFCRFVYTKLLAVLGMARKPQEALQIFNLMRGDGHIYPDMAAYH

Query:  SIAVTLGQAGLLKQLLKVIECMRQQPSKKIRNKCRKSWDPAVEPDLVVYNAILNACIPTREWKGVYWVFTQLRKSGLKPNGATYGLSMEVMLKSGKYEHL
         IAVTLGQAGLLK+LLKVIE MRQ+P+K  +N  +K+WDP +EPDLVVYNAILNAC+PT +WK V WVF +LRK+GL+PNGATYGL+MEVML+SGK++ +
Subjt:  SIAVTLGQAGLLKQLLKVIECMRQQPSKKIRNKCRKSWDPAVEPDLVVYNAILNACIPTREWKGVYWVFTQLRKSGLKPNGATYGLSMEVMLKSGKYEHL

Query:  HKLFTKMKKSGETLKANTYRVLVKAFWEEGNVNGAIEAVRDMEQRGVVGSASVYYELACCLCYNGRWQDALVEVRKMKTLPHMKPLVVTFTGMILSSFDG
        H  F KMK SGE  KA TY+VLV+A W EG +  A+EAVRDMEQ+GV+G+ SVYYELACCLC NGRW DA++EV +MK L + +PL +TFTG+I +S +G
Subjt:  HKLFTKMKKSGETLKANTYRVLVKAFWEEGNVNGAIEAVRDMEQRGVVGSASVYYELACCLCYNGRWQDALVEVRKMKTLPHMKPLVVTFTGMILSSFDG

Query:  GHIDDCISIFEYMRRNCAPNIGTINTMLKVYGRNDMFSKAKDLFEEIKRKADRSSPSCSVHSIIPDEYTYGSMLEAAASALQWEYFENVYREMALSGYQL
        GH+DDC++IF+YM+  C PNIGT N MLKVYGRNDMFS+AK+LFEEI  + +          ++P+EYTY  MLEA+A +LQWEYFE+VY+ M LSGYQ+
Subjt:  GHIDDCISIFEYMRRNCAPNIGTINTMLKVYGRNDMFSKAKDLFEEIKRKADRSSPSCSVHSIIPDEYTYGSMLEAAASALQWEYFENVYREMALSGYQL

Query:  DQSKHATVLVEASRAGKWYLLDHAFDSILEAGQIPHPLLFTEMILQLIVQENYEQAVTLVKAMGYAPFQVTERQWTELFEVSMDRICWNNLKKLSDALSD
        DQ+KHA++L+EASRAGKW LL+HAFD++LE G+IPHPL FTE++     + ++++A+TL+  +  A FQ++E +WT+LFE   D +  +NL KLSD L +
Subjt:  DQSKHATVLVEASRAGKWYLLDHAFDSILEAGQIPHPLLFTEMILQLIVQENYEQAVTLVKAMGYAPFQVTERQWTELFEVSMDRICWNNLKKLSDALSD

Query:  CD-ASEATVSNLSRSLRVLCKSRIPEDTSQSENMENMKVESDSNMAPWSPSLSDEGAVGTNTFSGHSNDELSTFDLCVGSEDDEEVLNMLLDRSDDSYDS
        CD  SE TVSNLS+SL+  C S     ++Q     ++  +S          L D      N+ +G + +   T    +G E+ E      +D  ++S DS
Subjt:  CD-ASEATVSNLSRSLRVLCKSRIPEDTSQSENMENMKVESDSNMAPWSPSLSDEGAVGTNTFSGHSNDELSTFDLCVGSEDDEEVLNMLLDRSDDSYDS

Query:  NSPSVNEILKTWKEERKTD
        +S SV +ILK W+E  K +
Subjt:  NSPSVNEILKTWKEERKTD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATGCATTGAGCACAAATGCACCAATTCCTGCACCGAAGTTCGAACCAGATTTGGAGAAAATTAAGCGAACGCTCCTCCAAAAGGGTGTCTGTCCCTCTCCTAAGAT
CCTCCGCGCACTTCGGAAGAAAGAAATTCAGAAGCACAACCGCAAACTCAACCGACTGGCTCATCAGTCGCCGCCCCTTTCTGAGTCCCAAAAGCAGCTAATTGCCGAGG
AAACCCATTTCCGGACTTTGAGAAGCGAGTACAAGGAGTTCTCCAAGGCCATAGAGGCGAAACCAGGCGGTGGCTTGATGGTCGGCAGGCCTTGGGAGAGACTGGAAGGA
GTAAACCTTAAAGAACTTACCGGTTTCAGAACAGGATACGATGGGGAGAATCTGAAGAAGGAGAATTTGACAGAGTTGAGGAAACTGTTTGAGGCTCGTAAGCTCGAGGA
GTTGCAGTGGGTTTTAGACGACGATGTGGAACTGAAGGATGAGTGGCTGGACAGTGAAAATGGTCGCTCTGATGCCGTAAAACGGAGGCGCGGCGACGGAGAGGTTATTC
GGTTCCTTGTTGACAGGCTCAGTTCGAGGGAGATTTCCATGAGGGACTGGAAATTCTCCAGGATGATGATACGGTCAGGATTGCAGTTTAATGAAGGTCAACTACTTAAA
ATTTTGGATGGCCTCGGTGCTAGGGGATGCTGGAAACAGTCCTTGTCAGTGGTCGAATGGGTGTACAATCTTAAAAGTCACAGTCATTTTAAAAGCAGGCAATCCTTCCT
TTATTATGAAAAAATTTGTACCTTTTGCAGGTTTGTCTATACAAAGCTCTTAGCTGTTCTGGGGATGGCGAGGAAACCTCAGGAAGCCCTTCAGATATTTAATTTGATGC
GGGGAGATGGCCATATATATCCCGACATGGCTGCATATCACAGTATCGCTGTTACGCTGGGTCAAGCTGGTCTTTTGAAACAATTGCTGAAAGTTATTGAATGCATGAGG
CAGCAGCCGTCCAAAAAAATTAGAAACAAGTGCCGAAAATCTTGGGATCCTGCAGTTGAACCCGATCTTGTTGTATATAATGCTATTTTGAACGCATGCATTCCCACGCG
CGAGTGGAAAGGTGTCTACTGGGTGTTCACACAGTTGAGAAAGAGTGGTTTGAAACCTAATGGAGCAACATATGGACTTTCTATGGAGGTAATGCTGAAATCTGGAAAGT
ATGAGCATCTCCACAAGCTTTTTACAAAAATGAAGAAGAGTGGGGAAACTCTAAAGGCGAACACGTACAGAGTTCTTGTCAAAGCTTTTTGGGAGGAAGGAAATGTTAAT
GGAGCTATTGAAGCAGTCAGGGATATGGAACAAAGAGGAGTAGTTGGATCTGCCAGTGTCTATTATGAACTAGCTTGTTGTCTATGTTACAATGGGAGGTGGCAAGATGC
ATTGGTAGAGGTAAGAAAAATGAAAACACTACCACATATGAAACCGTTGGTGGTGACCTTCACTGGCATGATCTTATCTTCCTTTGATGGTGGACATATTGATGATTGCA
TATCTATCTTCGAGTACATGAGGCGAAATTGTGCGCCTAATATAGGGACTATAAATACCATGCTTAAAGTTTATGGCCGAAATGATATGTTTTCTAAAGCTAAAGATTTA
TTTGAAGAAATAAAGAGAAAAGCTGATCGTTCCTCCCCAAGTTGTTCTGTTCATTCTATAATCCCAGATGAATATACGTATGGCTCGATGCTTGAGGCAGCTGCTAGTGC
ACTCCAGTGGGAATATTTTGAGAATGTATACAGGGAAATGGCTCTGTCTGGATACCAGCTAGATCAAAGTAAACATGCAACGGTACTTGTGGAAGCTTCCAGAGCTGGGA
AGTGGTATCTATTAGATCATGCATTTGACTCAATCTTGGAGGCTGGACAAATTCCCCATCCACTGTTGTTCACAGAAATGATATTGCAGCTTATAGTTCAAGAGAACTAT
GAGCAGGCTGTCACCTTGGTTAAAGCCATGGGTTATGCTCCATTCCAAGTAACCGAAAGACAATGGACAGAACTTTTTGAAGTGAGCATGGACAGGATTTGTTGGAATAA
CTTGAAGAAACTATCAGACGCTCTTAGCGACTGTGATGCATCAGAAGCCACAGTCTCGAACTTGTCAAGGTCGCTGCGGGTTCTCTGCAAATCCAGGATACCAGAAGACA
CCTCCCAGTCTGAAAACATGGAGAACATGAAGGTCGAGAGTGACTCAAATATGGCCCCCTGGTCACCGAGTCTTTCTGATGAAGGTGCTGTAGGGACTAACACGTTTTCG
GGTCATTCCAACGATGAGCTCTCGACTTTCGATTTGTGTGTCGGCAGTGAAGATGATGAGGAAGTGCTTAACATGTTACTTGATAGATCTGATGATTCTTATGATTCAAA
CTCGCCTTCTGTTAATGAAATACTGAAAACTTGGAAGGAAGAGAGGAAAACCGATGGGTTATTTCTCCACCCTTTGAAT
mRNA sequenceShow/hide mRNA sequence
ATGGATGCATTGAGCACAAATGCACCAATTCCTGCACCGAAGTTCGAACCAGATTTGGAGAAAATTAAGCGAACGCTCCTCCAAAAGGGTGTCTGTCCCTCTCCTAAGAT
CCTCCGCGCACTTCGGAAGAAAGAAATTCAGAAGCACAACCGCAAACTCAACCGACTGGCTCATCAGTCGCCGCCCCTTTCTGAGTCCCAAAAGCAGCTAATTGCCGAGG
AAACCCATTTCCGGACTTTGAGAAGCGAGTACAAGGAGTTCTCCAAGGCCATAGAGGCGAAACCAGGCGGTGGCTTGATGGTCGGCAGGCCTTGGGAGAGACTGGAAGGA
GTAAACCTTAAAGAACTTACCGGTTTCAGAACAGGATACGATGGGGAGAATCTGAAGAAGGAGAATTTGACAGAGTTGAGGAAACTGTTTGAGGCTCGTAAGCTCGAGGA
GTTGCAGTGGGTTTTAGACGACGATGTGGAACTGAAGGATGAGTGGCTGGACAGTGAAAATGGTCGCTCTGATGCCGTAAAACGGAGGCGCGGCGACGGAGAGGTTATTC
GGTTCCTTGTTGACAGGCTCAGTTCGAGGGAGATTTCCATGAGGGACTGGAAATTCTCCAGGATGATGATACGGTCAGGATTGCAGTTTAATGAAGGTCAACTACTTAAA
ATTTTGGATGGCCTCGGTGCTAGGGGATGCTGGAAACAGTCCTTGTCAGTGGTCGAATGGGTGTACAATCTTAAAAGTCACAGTCATTTTAAAAGCAGGCAATCCTTCCT
TTATTATGAAAAAATTTGTACCTTTTGCAGGTTTGTCTATACAAAGCTCTTAGCTGTTCTGGGGATGGCGAGGAAACCTCAGGAAGCCCTTCAGATATTTAATTTGATGC
GGGGAGATGGCCATATATATCCCGACATGGCTGCATATCACAGTATCGCTGTTACGCTGGGTCAAGCTGGTCTTTTGAAACAATTGCTGAAAGTTATTGAATGCATGAGG
CAGCAGCCGTCCAAAAAAATTAGAAACAAGTGCCGAAAATCTTGGGATCCTGCAGTTGAACCCGATCTTGTTGTATATAATGCTATTTTGAACGCATGCATTCCCACGCG
CGAGTGGAAAGGTGTCTACTGGGTGTTCACACAGTTGAGAAAGAGTGGTTTGAAACCTAATGGAGCAACATATGGACTTTCTATGGAGGTAATGCTGAAATCTGGAAAGT
ATGAGCATCTCCACAAGCTTTTTACAAAAATGAAGAAGAGTGGGGAAACTCTAAAGGCGAACACGTACAGAGTTCTTGTCAAAGCTTTTTGGGAGGAAGGAAATGTTAAT
GGAGCTATTGAAGCAGTCAGGGATATGGAACAAAGAGGAGTAGTTGGATCTGCCAGTGTCTATTATGAACTAGCTTGTTGTCTATGTTACAATGGGAGGTGGCAAGATGC
ATTGGTAGAGGTAAGAAAAATGAAAACACTACCACATATGAAACCGTTGGTGGTGACCTTCACTGGCATGATCTTATCTTCCTTTGATGGTGGACATATTGATGATTGCA
TATCTATCTTCGAGTACATGAGGCGAAATTGTGCGCCTAATATAGGGACTATAAATACCATGCTTAAAGTTTATGGCCGAAATGATATGTTTTCTAAAGCTAAAGATTTA
TTTGAAGAAATAAAGAGAAAAGCTGATCGTTCCTCCCCAAGTTGTTCTGTTCATTCTATAATCCCAGATGAATATACGTATGGCTCGATGCTTGAGGCAGCTGCTAGTGC
ACTCCAGTGGGAATATTTTGAGAATGTATACAGGGAAATGGCTCTGTCTGGATACCAGCTAGATCAAAGTAAACATGCAACGGTACTTGTGGAAGCTTCCAGAGCTGGGA
AGTGGTATCTATTAGATCATGCATTTGACTCAATCTTGGAGGCTGGACAAATTCCCCATCCACTGTTGTTCACAGAAATGATATTGCAGCTTATAGTTCAAGAGAACTAT
GAGCAGGCTGTCACCTTGGTTAAAGCCATGGGTTATGCTCCATTCCAAGTAACCGAAAGACAATGGACAGAACTTTTTGAAGTGAGCATGGACAGGATTTGTTGGAATAA
CTTGAAGAAACTATCAGACGCTCTTAGCGACTGTGATGCATCAGAAGCCACAGTCTCGAACTTGTCAAGGTCGCTGCGGGTTCTCTGCAAATCCAGGATACCAGAAGACA
CCTCCCAGTCTGAAAACATGGAGAACATGAAGGTCGAGAGTGACTCAAATATGGCCCCCTGGTCACCGAGTCTTTCTGATGAAGGTGCTGTAGGGACTAACACGTTTTCG
GGTCATTCCAACGATGAGCTCTCGACTTTCGATTTGTGTGTCGGCAGTGAAGATGATGAGGAAGTGCTTAACATGTTACTTGATAGATCTGATGATTCTTATGATTCAAA
CTCGCCTTCTGTTAATGAAATACTGAAAACTTGGAAGGAAGAGAGGAAAACCGATGGGTTATTTCTCCACCCTTTGAAT
Protein sequenceShow/hide protein sequence
MDALSTNAPIPAPKFEPDLEKIKRTLLQKGVCPSPKILRALRKKEIQKHNRKLNRLAHQSPPLSESQKQLIAEETHFRTLRSEYKEFSKAIEAKPGGGLMVGRPWERLEG
VNLKELTGFRTGYDGENLKKENLTELRKLFEARKLEELQWVLDDDVELKDEWLDSENGRSDAVKRRRGDGEVIRFLVDRLSSREISMRDWKFSRMMIRSGLQFNEGQLLK
ILDGLGARGCWKQSLSVVEWVYNLKSHSHFKSRQSFLYYEKICTFCRFVYTKLLAVLGMARKPQEALQIFNLMRGDGHIYPDMAAYHSIAVTLGQAGLLKQLLKVIECMR
QQPSKKIRNKCRKSWDPAVEPDLVVYNAILNACIPTREWKGVYWVFTQLRKSGLKPNGATYGLSMEVMLKSGKYEHLHKLFTKMKKSGETLKANTYRVLVKAFWEEGNVN
GAIEAVRDMEQRGVVGSASVYYELACCLCYNGRWQDALVEVRKMKTLPHMKPLVVTFTGMILSSFDGGHIDDCISIFEYMRRNCAPNIGTINTMLKVYGRNDMFSKAKDL
FEEIKRKADRSSPSCSVHSIIPDEYTYGSMLEAAASALQWEYFENVYREMALSGYQLDQSKHATVLVEASRAGKWYLLDHAFDSILEAGQIPHPLLFTEMILQLIVQENY
EQAVTLVKAMGYAPFQVTERQWTELFEVSMDRICWNNLKKLSDALSDCDASEATVSNLSRSLRVLCKSRIPEDTSQSENMENMKVESDSNMAPWSPSLSDEGAVGTNTFS
GHSNDELSTFDLCVGSEDDEEVLNMLLDRSDDSYDSNSPSVNEILKTWKEERKTDGLFLHPLN