; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC01g0936 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC01g0936
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationMC01:15074736..15080059
RNA-Seq ExpressionMC01g0936
SyntenyMC01g0936
Gene Ontology termsGO:0009658 - chloroplast organization (biological process)
GO:0009507 - chloroplast (cellular component)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily
IPR044645 - Pentatricopeptide repeat-containing protein DG1/EMB2279-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6604125.1 Calcium-transporting ATPase 2, plasma membrane-type, partial [Cucurbita argyrosperma subsp. sororia]0.081.42Show/hide
Query:  MDALSTNAPIPAPKFEPDLEKIKRTLLQKGVCPSPKILRALRKKEIQKHNRKLNRLAH----QSPPLSESQKQLIAEETHFRTLRSEYKEFSKAIEAKPG
        M+ALSTNA +P+PKFEPD+EKIKRTLLQKGV P+PKI+R+L KKEIQKHNRKL RLA     QSPPLSESQKQLI EET F TLRSEYKEFSKAIEA+P 
Subjt:  MDALSTNAPIPAPKFEPDLEKIKRTLLQKGVCPSPKILRALRKKEIQKHNRKLNRLAH----QSPPLSESQKQLIAEETHFRTLRSEYKEFSKAIEAKPG

Query:  GGLMVGRPWERLEGVNLKELTGFRTGYDGENLKKENLTELRKLFEARKLEELQWVLDDDVELKDEWLDSENGRSDAVKRRRGDGEVIRFLVDRLSSREIS
        GGLMVGRPWERLE VNLKELTGFRT Y+ +NLKKE+L ELRKLFEARKLEELQWVLDDDVELKDEWL SENG SDAVKRRRGDGEVIRFLVDRLSSR IS
Subjt:  GGLMVGRPWERLEGVNLKELTGFRTGYDGENLKKENLTELRKLFEARKLEELQWVLDDDVELKDEWLDSENGRSDAVKRRRGDGEVIRFLVDRLSSREIS

Query:  MRDWKFSRMMIRSGLQFNEGQLLKILDGLGARGCWKQSLSVVEWVYNLKSHSHFKSRQFVYTKLLAVLGMARKPQEALQIFNLMRGDGHIYPDMAAYHSI
        MRDWKFSRMMI+SGLQFNEGQLLKILD LGA+GCWKQ+LSVVEWVYNLKSHSH KSR FVYTKLLAVLG ARKPQEALQIFNLMRGDG IYPDMAAYHSI
Subjt:  MRDWKFSRMMIRSGLQFNEGQLLKILDGLGARGCWKQSLSVVEWVYNLKSHSHFKSRQFVYTKLLAVLGMARKPQEALQIFNLMRGDGHIYPDMAAYHSI

Query:  AVTLGQAGLLKQLLKVIECMRQQPSKKIRNKCRKSWDPAVEPDLVVYNAILNACIPTHEWKGVYWVFTQLRKSGLKPNGATYGLSMEVMLKSGKYEHLHK
        AVTLGQAGLLKQLLK+IE MRQQPSKK+RN CRK WDPAVEPDLV+YNAILNAC+PT EWK VYWVFTQLRK+GLKPNGATYGLSMEVMLKSGKYE +H 
Subjt:  AVTLGQAGLLKQLLKVIECMRQQPSKKIRNKCRKSWDPAVEPDLVVYNAILNACIPTHEWKGVYWVFTQLRKSGLKPNGATYGLSMEVMLKSGKYEHLHK

Query:  LFTKMKKSGETLKANTYRVLVKAFWEEGNVNGAIEAVRDMEQRGVVGSASVYYELACCLCYNGRWQDALVEVRKMKTLPHMKPLVVTFTGMILSSFDGGH
        LFTKMK SG TLKANTYRVLVKAFWEEGNV+GAIEAVRDMEQRGVVGSASVYYELACCLCY+GRWQDAL+EV KMKTL HMKPLVVTFTGMILSSFDGGH
Subjt:  LFTKMKKSGETLKANTYRVLVKAFWEEGNVNGAIEAVRDMEQRGVVGSASVYYELACCLCYNGRWQDALVEVRKMKTLPHMKPLVVTFTGMILSSFDGGH

Query:  IDDCISIFEYMRRNCAPNIGTINTMLKVYGRNDMFSKAKDLFEEIKRKADRSSPSCSVHSIIPDEYTYGSMLEAAASALQWEYFENVYREMALSGYQLDQ
        IDDCISIFEYM++ CAPNIGTINTMLKV+GR DMFSKAKDL+EEIKRKAD SS SC+V SI+PD+YTY SML+AAASA QWEYFENVYREMALSGY+LDQ
Subjt:  IDDCISIFEYMRRNCAPNIGTINTMLKVYGRNDMFSKAKDLFEEIKRKADRSSPSCSVHSIIPDEYTYGSMLEAAASALQWEYFENVYREMALSGYQLDQ

Query:  SKHATVLVEASRAGKWYLLDHAFDSILEAGQIPHPLLFTEMILQLIVQENYEQAVTLVKAMCYAPFQVTKRQWTELFEVSMDRICWNNLKKLSDALSDCD
        SKHA +LVEASRAGKWYLLDHAFD+ILEAGQIPHPLLFTEMILQL  Q+NYEQAVTLV+ M YAPFQV++RQWTELFE + DRICWNNLKKLSDALSDCD
Subjt:  SKHATVLVEASRAGKWYLLDHAFDSILEAGQIPHPLLFTEMILQLIVQENYEQAVTLVKAMCYAPFQVTKRQWTELFEVSMDRICWNNLKKLSDALSDCD

Query:  ASEATVSNLSRSLRVLCKSRIPEDTSQS----------------ENMENMKVE-----------------------SDSNMAPWSPSLSDEGALGTNTFS
        ASEATVSNLSRSL+ LCK  IPE+T QS                ENMENMK+                         DS M PWS SLSD G L T  FS
Subjt:  ASEATVSNLSRSLRVLCKSRIPEDTSQS----------------ENMENMKVE-----------------------SDSNMAPWSPSLSDEGALGTNTFS

Query:  GHSNDELSTFDLCVSSEDDEEVLNMLLDRSDDSYDSNSPSVNEILKTWKEERKTDGLFLHP
          SN+E STFDL   SEDDEE L+MLLD  DDSYDSN PSV+EILKTWKEERK DGL+LHP
Subjt:  GHSNDELSTFDLCVSSEDDEEVLNMLLDRSDDSYDSNSPSVNEILKTWKEERKTDGLFLHP

XP_022133001.1 pentatricopeptide repeat-containing protein At5g67570, chloroplastic [Momordica charantia]0.099.76Show/hide
Query:  MDALSTNAPIPAPKFEPDLEKIKRTLLQKGVCPSPKILRALRKKEIQKHNRKLNRLAHQSPPLSESQKQLIAEETHFRTLRSEYKEFSKAIEAKPGGGLM
        MDALSTNAPIPAPKFEPDLEKIKRTLLQKGVCPSPKILRALRKKEIQKHNRKLNRLAHQSPPLSESQKQLIAEETHFRTLRSEYKEFSKAIEAKPG GLM
Subjt:  MDALSTNAPIPAPKFEPDLEKIKRTLLQKGVCPSPKILRALRKKEIQKHNRKLNRLAHQSPPLSESQKQLIAEETHFRTLRSEYKEFSKAIEAKPGGGLM

Query:  VGRPWERLEGVNLKELTGFRTGYDGENLKKENLTELRKLFEARKLEELQWVLDDDVELKDEWLDSENGRSDAVKRRRGDGEVIRFLVDRLSSREISMRDW
        VGRPWERLEGVNLKELTGFRTGYDGENLKKENLTELRKLFEARKLEELQWVLDDDVELKDEWLDSENGRSDAVKRRRGDGEVIRFLVDRLSSREISMRDW
Subjt:  VGRPWERLEGVNLKELTGFRTGYDGENLKKENLTELRKLFEARKLEELQWVLDDDVELKDEWLDSENGRSDAVKRRRGDGEVIRFLVDRLSSREISMRDW

Query:  KFSRMMIRSGLQFNEGQLLKILDGLGARGCWKQSLSVVEWVYNLKSHSHFKSRQFVYTKLLAVLGMARKPQEALQIFNLMRGDGHIYPDMAAYHSIAVTL
        KFSRMMIRSGLQFNEGQLLKILDGLGARGCWKQSLSVVEWVYNLKSHSHFKSR FVYTKLLAVLGMARKPQEALQIFNLMRGDGHIYPDMAAYHSIAVTL
Subjt:  KFSRMMIRSGLQFNEGQLLKILDGLGARGCWKQSLSVVEWVYNLKSHSHFKSRQFVYTKLLAVLGMARKPQEALQIFNLMRGDGHIYPDMAAYHSIAVTL

Query:  GQAGLLKQLLKVIECMRQQPSKKIRNKCRKSWDPAVEPDLVVYNAILNACIPTHEWKGVYWVFTQLRKSGLKPNGATYGLSMEVMLKSGKYEHLHKLFTK
        GQAGLLKQLLKVIECMRQQPSKKIRNKCRKSWDPAVEPDLVVYNAILNACIPTHEWKGVYWVFTQLRKSGLKPNGATYGLSMEVMLKSGKYEHLHKLFTK
Subjt:  GQAGLLKQLLKVIECMRQQPSKKIRNKCRKSWDPAVEPDLVVYNAILNACIPTHEWKGVYWVFTQLRKSGLKPNGATYGLSMEVMLKSGKYEHLHKLFTK

Query:  MKKSGETLKANTYRVLVKAFWEEGNVNGAIEAVRDMEQRGVVGSASVYYELACCLCYNGRWQDALVEVRKMKTLPHMKPLVVTFTGMILSSFDGGHIDDC
        MKKSGETLKANTYRVLVKAFWEEGNVNGAIEAVRDMEQRGVVGSASVYYELACCLCYNGRWQDALVEVRKMKTLPHMKPLVVTFTGMILSSFDGGHIDDC
Subjt:  MKKSGETLKANTYRVLVKAFWEEGNVNGAIEAVRDMEQRGVVGSASVYYELACCLCYNGRWQDALVEVRKMKTLPHMKPLVVTFTGMILSSFDGGHIDDC

Query:  ISIFEYMRRNCAPNIGTINTMLKVYGRNDMFSKAKDLFEEIKRKADRSSPSCSVHSIIPDEYTYGSMLEAAASALQWEYFENVYREMALSGYQLDQSKHA
        ISIFEYMRRNCAPNIGTINTMLKVYGRNDMFSKAKDLFEEIKRKADRSSPSCSVHSIIPDEYTYGSMLEAAASALQWEYFENVYREMALSGYQLDQSKHA
Subjt:  ISIFEYMRRNCAPNIGTINTMLKVYGRNDMFSKAKDLFEEIKRKADRSSPSCSVHSIIPDEYTYGSMLEAAASALQWEYFENVYREMALSGYQLDQSKHA

Query:  TVLVEASRAGKWYLLDHAFDSILEAGQIPHPLLFTEMILQLIVQENYEQAVTLVKAMCYAPFQVTKRQWTELFEVSMDRICWNNLKKLSDALSDCDASEA
        TVLVEASRAGKWYLLDHAFDSILEAGQIPHPLLFTEMILQLIVQENYEQAVTLVKAMCYAPFQVTKRQWTELFEVSMDRICWNNLKKLSDALSDCDASEA
Subjt:  TVLVEASRAGKWYLLDHAFDSILEAGQIPHPLLFTEMILQLIVQENYEQAVTLVKAMCYAPFQVTKRQWTELFEVSMDRICWNNLKKLSDALSDCDASEA

Query:  TVSNLSRSLRVLCKSRIPEDTSQSENMENMKVESDSNMAPWSPSLSDEGALGTNTFSGHSNDELSTFDLCVSSEDDEEVLNMLLDRSDDSYDSNSPSVNE
        TVSNLSRSLRVLCKSRIPEDTSQSENMENMKVESDSNMAPWSPSLSDEGALGTNTFSGHSNDELSTFDLCVSSEDDEEVLNMLLDRSDDSYDSNSPSVNE
Subjt:  TVSNLSRSLRVLCKSRIPEDTSQSENMENMKVESDSNMAPWSPSLSDEGALGTNTFSGHSNDELSTFDLCVSSEDDEEVLNMLLDRSDDSYDSNSPSVNE

Query:  ILKTWKEERKTDGLFLHPLN
        ILKTWKEERKTDGLFLHPLN
Subjt:  ILKTWKEERKTDGLFLHPLN

XP_023543129.1 pentatricopeptide repeat-containing protein At5g67570, chloroplastic [Cucurbita pepo subsp. pepo]0.081.53Show/hide
Query:  MDALSTNAPIPAPKFEPDLEKIKRTLLQKGVCPSPKILRALRKKEIQKHNRKLNRLAH----QSPPLSESQKQLIAEETHFRTLRSEYKEFSKAIEAKPG
        M+ALSTNA +P+PKFEPD+EKIKRTLLQKGV P+PKI+R+L KKEIQKHNRKL RLA     QSPPLSESQKQLI EETHF TLRSEYKEFSKAIEA+P 
Subjt:  MDALSTNAPIPAPKFEPDLEKIKRTLLQKGVCPSPKILRALRKKEIQKHNRKLNRLAH----QSPPLSESQKQLIAEETHFRTLRSEYKEFSKAIEAKPG

Query:  GGLMVGRPWERLEGVNLKELTGFRTGYDGENLKKENLTELRKLFEARKLEELQWVLDDDVELKDEWLDSENGRSDAVKRRRGDGEVIRFLVDRLSSREIS
        GGLMVGRPWERLE VNLKELTGFRT Y+ +NLKKE+L ELRKLFEARKLEELQWVLDDDVELK+EWL SENG+SDAVKRRRGDGEVIRFLVDRLSSR IS
Subjt:  GGLMVGRPWERLEGVNLKELTGFRTGYDGENLKKENLTELRKLFEARKLEELQWVLDDDVELKDEWLDSENGRSDAVKRRRGDGEVIRFLVDRLSSREIS

Query:  MRDWKFSRMMIRSGLQFNEGQLLKILDGLGARGCWKQSLSVVEWVYNLKSHSHFKSRQFVYTKLLAVLGMARKPQEALQIFNLMRGDGHIYPDMAAYHSI
        MRDWKFSRMMI+SGLQFNEGQLLKILD LGA+GCWKQ+LSVVEWVYNLKSHSH KSR FVYTKLLAVLG ARKPQEALQIFNLMRGDG IYPDMAAYHSI
Subjt:  MRDWKFSRMMIRSGLQFNEGQLLKILDGLGARGCWKQSLSVVEWVYNLKSHSHFKSRQFVYTKLLAVLGMARKPQEALQIFNLMRGDGHIYPDMAAYHSI

Query:  AVTLGQAGLLKQLLKVIECMRQQPSKKIRNKCRKSWDPAVEPDLVVYNAILNACIPTHEWKGVYWVFTQLRKSGLKPNGATYGLSMEVMLKSGKYEHLHK
        AVTLGQAGLLKQLLK+IECMRQQPSKK+RN CRK WDPAVEPDLV+YNAILNACIPT EWK VYWVFTQLRK+GLKPNGATYGLSMEVMLKSGKYE +H 
Subjt:  AVTLGQAGLLKQLLKVIECMRQQPSKKIRNKCRKSWDPAVEPDLVVYNAILNACIPTHEWKGVYWVFTQLRKSGLKPNGATYGLSMEVMLKSGKYEHLHK

Query:  LFTKMKKSGETLKANTYRVLVKAFWEEGNVNGAIEAVRDMEQRGVVGSASVYYELACCLCYNGRWQDALVEVRKMKTLPHMKPLVVTFTGMILSSFDGGH
        LFTKMK SG TLKANTYRVLVKAFWEEGNV+GAIEAVRDMEQRGVVGSASVYYELACCLCY+GRWQDALVEV KMKTL HMKPLVVTFTGMILSSFDGGH
Subjt:  LFTKMKKSGETLKANTYRVLVKAFWEEGNVNGAIEAVRDMEQRGVVGSASVYYELACCLCYNGRWQDALVEVRKMKTLPHMKPLVVTFTGMILSSFDGGH

Query:  IDDCISIFEYMRRNCAPNIGTINTMLKVYGRNDMFSKAKDLFEEIKRKADRSSPSCSVHSIIPDEYTYGSMLEAAASALQWEYFENVYREMALSGYQLDQ
        IDDCISIFEYM++ CAPNIGTINTMLKV+GRNDMFSKAKDL+EEIKRKAD SS S +V SI+PD+YTY SML+AAASA QWEYFENVYREMALSGY+LDQ
Subjt:  IDDCISIFEYMRRNCAPNIGTINTMLKVYGRNDMFSKAKDLFEEIKRKADRSSPSCSVHSIIPDEYTYGSMLEAAASALQWEYFENVYREMALSGYQLDQ

Query:  SKHATVLVEASRAGKWYLLDHAFDSILEAGQIPHPLLFTEMILQLIVQENYEQAVTLVKAMCYAPFQVTKRQWTELFEVSMDRICWNNLKKLSDALSDCD
        SKHA +LVEASRAGKWYLLDHAFD+ILEAGQIPHPLLFTE+ILQL  Q+NYEQAVTLV+ M YAPFQV++RQWTE+FE + DRICWNNLKKLSDALSDCD
Subjt:  SKHATVLVEASRAGKWYLLDHAFDSILEAGQIPHPLLFTEMILQLIVQENYEQAVTLVKAMCYAPFQVTKRQWTELFEVSMDRICWNNLKKLSDALSDCD

Query:  ASEATVSNLSRSLRVLCKSRIPEDTSQS----------------ENMENMKVE-----------------------SDSNMAPWSPSLSDEGALGTNTFS
        ASEATVSNLSRSL+ LCK  IPE+TSQS                ENMENMK+                         DS M PWS SLSD G L T  FS
Subjt:  ASEATVSNLSRSLRVLCKSRIPEDTSQS----------------ENMENMKVE-----------------------SDSNMAPWSPSLSDEGALGTNTFS

Query:  GHSNDELSTFDLCVSSEDDEEVLNMLLDRSDDSYDSNSPSVNEILKTWKEERKTDGLFLHP
          SN+E STFDL   SEDDEE L+MLLD  DD YDSN PSV+EILKTWKEERK DGL+LHP
Subjt:  GHSNDELSTFDLCVSSEDDEEVLNMLLDRSDDSYDSNSPSVNEILKTWKEERKTDGLFLHP

XP_038881784.1 pentatricopeptide repeat-containing protein At5g67570, chloroplastic isoform X1 [Benincasa hispida]0.081.71Show/hide
Query:  MDALSTNAPIPAPKFEPDLEKIKRTLLQKGVCPSPKILRALRKKEIQKHNRKLNRL----AHQSPPLSESQKQLIAEETHFRTLRSEYKEFSKAIEAKPG
        M+ALSTN+PIP+PKFEPD+EKIKRTL+ KGV P+P+I+R+LRKKEIQK+NRKL RL    A QSPPLSESQKQLIAEETHF TLRSEYKEFSKAIEAKP 
Subjt:  MDALSTNAPIPAPKFEPDLEKIKRTLLQKGVCPSPKILRALRKKEIQKHNRKLNRL----AHQSPPLSESQKQLIAEETHFRTLRSEYKEFSKAIEAKPG

Query:  GGLMVGRPWERLEGVNLKELTGFRTGYDGENLKKENLTELRKLFEARKLEELQWVLDDDVELKDEWLDSENGRSDAVKRRRGDGEVIRFLVDRLSSREIS
        GGLMVGRPWERLE VNLKELTGFRTGY+ +NLKKE+L ELRKLFEARKLEELQWVLDDDVELK+EWL+SEN   DA++RRRGDGEVIRFLVDRLSSR IS
Subjt:  GGLMVGRPWERLEGVNLKELTGFRTGYDGENLKKENLTELRKLFEARKLEELQWVLDDDVELKDEWLDSENGRSDAVKRRRGDGEVIRFLVDRLSSREIS

Query:  MRDWKFSRMMIRSGLQFNEGQLLKILDGLGARGCWKQSLSVVEWVYNLKSHSHFKSRQFVYTKLLAVLGMARKPQEALQIFNLMR--------GDGHIYP
        MRDWKFSRMMIRSGLQFNEGQLLKILD LGA+GCWKQ+LSVVEWVYNLKSHSH KSR FVYTKLLAVLGM+RKPQEALQIF+LMR        GDG IYP
Subjt:  MRDWKFSRMMIRSGLQFNEGQLLKILDGLGARGCWKQSLSVVEWVYNLKSHSHFKSRQFVYTKLLAVLGMARKPQEALQIFNLMR--------GDGHIYP

Query:  DMAAYHSIAVTLGQAGLLKQLLKVIECMRQQPSKKIRNKCRKSWDPAVEPDLVVYNAILNACIPTHEWKGVYWVFTQLRKSGLKPNGATYGLSMEVMLKS
        DMAAYHSIAVTLGQAGLLKQLLKV+ECMRQQPS+K+RNKCRKSWDPAVEPDLVVYNAILNACIPT EWKGVYWVFTQLRKSGL+PNGATYGLSMEVMLKS
Subjt:  DMAAYHSIAVTLGQAGLLKQLLKVIECMRQQPSKKIRNKCRKSWDPAVEPDLVVYNAILNACIPTHEWKGVYWVFTQLRKSGLKPNGATYGLSMEVMLKS

Query:  GKYEHLHKLFTKMKKSGETLKANTYRVLVKAFWEEGNVNGAIEAVRDMEQRGVVGSASVYYELACCLCYNGRWQDALVEVRKMKTLPHMKPLVVTFTGMI
        GKYE LHKLFTK+KKSGETLKANTYRVLVKAFWEEGNVNGAIEAVRDMEQRGVVGSASVYYELACCLCYNGRWQDALVEV KMKTL HMKPLVVTFTGMI
Subjt:  GKYEHLHKLFTKMKKSGETLKANTYRVLVKAFWEEGNVNGAIEAVRDMEQRGVVGSASVYYELACCLCYNGRWQDALVEVRKMKTLPHMKPLVVTFTGMI

Query:  LSSFDGGHIDDCISIFEYMRRNCAPNIGTINTMLKVYGRNDMFSKAKDLFEEIKRKADRSSPSCSVHSIIPDEYTYGSMLEAAASALQWEYFENVYREMA
        LSSFDGGHIDDCISIFEYM++ CAPNIGTIN+MLKVYGRNDMF KAKDLFEEIKRKAD SS S +V S++PDEYTYGSMLEAAASALQWEYFENVYREMA
Subjt:  LSSFDGGHIDDCISIFEYMRRNCAPNIGTINTMLKVYGRNDMFSKAKDLFEEIKRKADRSSPSCSVHSIIPDEYTYGSMLEAAASALQWEYFENVYREMA

Query:  LSGYQLDQSKHATVLVEASRAGKWYLLDHAFDSILEAGQIPHPLLFTEMILQLIVQENYEQAVTLVKAMCYAPFQVTKRQWTELFEVSMDRICWNNLKKL
        LSGY+LDQSKHA +LVEASRAGKWYLLDHAFDSILEAGQIPHPLLFTEMIL L  Q+NYEQAVTLV+ M YAPFQV++RQWTELFE + DRICW NLKKL
Subjt:  LSGYQLDQSKHATVLVEASRAGKWYLLDHAFDSILEAGQIPHPLLFTEMILQLIVQENYEQAVTLVKAMCYAPFQVTKRQWTELFEVSMDRICWNNLKKL

Query:  SDALSDCDASEATVSNLSRSLRVLCKSRIPEDTSQS-------------------ENME------------------------NMKVESDSNMAPWSPSL
         DAL +CDASEATVSNLSRSL+ LCK  IPE+TSQS                   ENM+                        NMKV+SDS ++PWS S 
Subjt:  SDALSDCDASEATVSNLSRSLRVLCKSRIPEDTSQS-------------------ENME------------------------NMKVESDSNMAPWSPSL

Query:  SDEGALGTNTFSGHSNDELSTFDLCVSSEDDEEVLNMLLDRSDDSYDSNSPSVNEILKTWKEERKTDGLFLHPLN
        S EG LGT+ FS  S +ELST DLC  SEDDEE LNMLLD  DDSYDSN PSVNEILKTWKEERKTDGLFLHPLN
Subjt:  SDEGALGTNTFSGHSNDELSTFDLCVSSEDDEEVLNMLLDRSDDSYDSNSPSVNEILKTWKEERKTDGLFLHPLN

XP_038881786.1 pentatricopeptide repeat-containing protein At5g67570, chloroplastic isoform X2 [Benincasa hispida]0.082.47Show/hide
Query:  MDALSTNAPIPAPKFEPDLEKIKRTLLQKGVCPSPKILRALRKKEIQKHNRKLNRL----AHQSPPLSESQKQLIAEETHFRTLRSEYKEFSKAIEAKPG
        M+ALSTN+PIP+PKFEPD+EKIKRTL+ KGV P+P+I+R+LRKKEIQK+NRKL RL    A QSPPLSESQKQLIAEETHF TLRSEYKEFSKAIEAKP 
Subjt:  MDALSTNAPIPAPKFEPDLEKIKRTLLQKGVCPSPKILRALRKKEIQKHNRKLNRL----AHQSPPLSESQKQLIAEETHFRTLRSEYKEFSKAIEAKPG

Query:  GGLMVGRPWERLEGVNLKELTGFRTGYDGENLKKENLTELRKLFEARKLEELQWVLDDDVELKDEWLDSENGRSDAVKRRRGDGEVIRFLVDRLSSREIS
        GGLMVGRPWERLE VNLKELTGFRTGY+ +NLKKE+L ELRKLFEARKLEELQWVLDDDVELK+EWL+SEN   DA++RRRGDGEVIRFLVDRLSSR IS
Subjt:  GGLMVGRPWERLEGVNLKELTGFRTGYDGENLKKENLTELRKLFEARKLEELQWVLDDDVELKDEWLDSENGRSDAVKRRRGDGEVIRFLVDRLSSREIS

Query:  MRDWKFSRMMIRSGLQFNEGQLLKILDGLGARGCWKQSLSVVEWVYNLKSHSHFKSRQFVYTKLLAVLGMARKPQEALQIFNLMRGDGHIYPDMAAYHSI
        MRDWKFSRMMIRSGLQFNEGQLLKILD LGA+GCWKQ+LSVVEWVYNLKSHSH KSR FVYTKLLAVLGM+RKPQEALQIF+LMRGDG IYPDMAAYHSI
Subjt:  MRDWKFSRMMIRSGLQFNEGQLLKILDGLGARGCWKQSLSVVEWVYNLKSHSHFKSRQFVYTKLLAVLGMARKPQEALQIFNLMRGDGHIYPDMAAYHSI

Query:  AVTLGQAGLLKQLLKVIECMRQQPSKKIRNKCRKSWDPAVEPDLVVYNAILNACIPTHEWKGVYWVFTQLRKSGLKPNGATYGLSMEVMLKSGKYEHLHK
        AVTLGQAGLLKQLLKV+ECMRQQPS+K+RNKCRKSWDPAVEPDLVVYNAILNACIPT EWKGVYWVFTQLRKSGL+PNGATYGLSMEVMLKSGKYE LHK
Subjt:  AVTLGQAGLLKQLLKVIECMRQQPSKKIRNKCRKSWDPAVEPDLVVYNAILNACIPTHEWKGVYWVFTQLRKSGLKPNGATYGLSMEVMLKSGKYEHLHK

Query:  LFTKMKKSGETLKANTYRVLVKAFWEEGNVNGAIEAVRDMEQRGVVGSASVYYELACCLCYNGRWQDALVEVRKMKTLPHMKPLVVTFTGMILSSFDGGH
        LFTK+KKSGETLKANTYRVLVKAFWEEGNVNGAIEAVRDMEQRGVVGSASVYYELACCLCYNGRWQDALVEV KMKTL HMKPLVVTFTGMILSSFDGGH
Subjt:  LFTKMKKSGETLKANTYRVLVKAFWEEGNVNGAIEAVRDMEQRGVVGSASVYYELACCLCYNGRWQDALVEVRKMKTLPHMKPLVVTFTGMILSSFDGGH

Query:  IDDCISIFEYMRRNCAPNIGTINTMLKVYGRNDMFSKAKDLFEEIKRKADRSSPSCSVHSIIPDEYTYGSMLEAAASALQWEYFENVYREMALSGYQLDQ
        IDDCISIFEYM++ CAPNIGTIN+MLKVYGRNDMF KAKDLFEEIKRKAD SS S +V S++PDEYTYGSMLEAAASALQWEYFENVYREMALSGY+LDQ
Subjt:  IDDCISIFEYMRRNCAPNIGTINTMLKVYGRNDMFSKAKDLFEEIKRKADRSSPSCSVHSIIPDEYTYGSMLEAAASALQWEYFENVYREMALSGYQLDQ

Query:  SKHATVLVEASRAGKWYLLDHAFDSILEAGQIPHPLLFTEMILQLIVQENYEQAVTLVKAMCYAPFQVTKRQWTELFEVSMDRICWNNLKKLSDALSDCD
        SKHA +LVEASRAGKWYLLDHAFDSILEAGQIPHPLLFTEMIL L  Q+NYEQAVTLV+ M YAPFQV++RQWTELFE + DRICW NLKKL DAL +CD
Subjt:  SKHATVLVEASRAGKWYLLDHAFDSILEAGQIPHPLLFTEMILQLIVQENYEQAVTLVKAMCYAPFQVTKRQWTELFEVSMDRICWNNLKKLSDALSDCD

Query:  ASEATVSNLSRSLRVLCKSRIPEDTSQS-------------------ENME------------------------NMKVESDSNMAPWSPSLSDEGALGT
        ASEATVSNLSRSL+ LCK  IPE+TSQS                   ENM+                        NMKV+SDS ++PWS S S EG LGT
Subjt:  ASEATVSNLSRSLRVLCKSRIPEDTSQS-------------------ENME------------------------NMKVESDSNMAPWSPSLSDEGALGT

Query:  NTFSGHSNDELSTFDLCVSSEDDEEVLNMLLDRSDDSYDSNSPSVNEILKTWKEERKTDGLFLHPLN
        + FS  S +ELST DLC  SEDDEE LNMLLD  DDSYDSN PSVNEILKTWKEERKTDGLFLHPLN
Subjt:  NTFSGHSNDELSTFDLCVSSEDDEEVLNMLLDRSDDSYDSNSPSVNEILKTWKEERKTDGLFLHPLN

TrEMBL top hitse value%identityAlignment
A0A1S3B127 pentatricopeptide repeat-containing protein At5g67570, chloroplastic0.081.09Show/hide
Query:  MDALSTNAPIPAPKFEPDLEKIKRTLLQKGVCPSPKILRALRKKEIQKHNRKLNRLAH----QSPPLSESQKQLIAEETHFRTLRSEYKEFSKAIEAKPG
        M+AL++NAPIP+PKFEPD +KIKR LLQKGV P+P+I+R+LRKKEIQK+NRKL RLA     QSPPLSESQKQLIAEETHF TLRSEYKEFSKAIEAKP 
Subjt:  MDALSTNAPIPAPKFEPDLEKIKRTLLQKGVCPSPKILRALRKKEIQKHNRKLNRLAH----QSPPLSESQKQLIAEETHFRTLRSEYKEFSKAIEAKPG

Query:  GGLMVGRPWERLEGVNLKELTGFRTGYDGENLKKENLTELRKLFEARKLEELQWVLDDDVELKDEWLDSENGRSDAVKRRRGDGEVIRFLVDRLSSREIS
        GGLMVGRPWERLE VN  ELTG RTGY+ ++LKKE+L ELRKLFE RKLEEL+W LDDDVELK+EWL SENG+ DAVKRRRGDGEVIRFLVDRLSS  IS
Subjt:  GGLMVGRPWERLEGVNLKELTGFRTGYDGENLKKENLTELRKLFEARKLEELQWVLDDDVELKDEWLDSENGRSDAVKRRRGDGEVIRFLVDRLSSREIS

Query:  MRDWKFSRMMIRSGLQFNEGQLLKILDGLGARGCWKQSLSVVEWVYNLKSHSHFKSRQFVYTKLLAVLGMARKPQEALQIFNLMRGDGHIYPDMAAYHSI
        MRDWKFSRMMIRSGLQFNEGQLLKILD LGA+GCWKQ+LSVVEWVYNLKSHSH KSR FVYTKLLAVLGMARKPQEALQIFNLMRGDG IYPDMAAYHSI
Subjt:  MRDWKFSRMMIRSGLQFNEGQLLKILDGLGARGCWKQSLSVVEWVYNLKSHSHFKSRQFVYTKLLAVLGMARKPQEALQIFNLMRGDGHIYPDMAAYHSI

Query:  AVTLGQAGLLKQLLKVIECMRQQPSKKIRNKCRKSWDPAVEPDLVVYNAILNACIPTHEWKGVYWVFTQLRKSGLKPNGATYGLSMEVMLKSGKYEHLHK
        AVTLGQAGLLKQLLKVIECMRQQPSKK+RNKCRKSWDPAVEPDLV+YN ILNACIPT EWKGVYWVFTQLRKSGL+PNGATYGLSMEVMLKSGKYE LH 
Subjt:  AVTLGQAGLLKQLLKVIECMRQQPSKKIRNKCRKSWDPAVEPDLVVYNAILNACIPTHEWKGVYWVFTQLRKSGLKPNGATYGLSMEVMLKSGKYEHLHK

Query:  LFTKMKKSGETLKANTYRVLVKAFWEEGNVNGAIEAVRDMEQRGVVGSASVYYELACCLCYNGRWQDALVEVRKMKTLPHMKPLVVTFTGMILSSFDGGH
        LFTKMKKSGETLKANTYRVLVKAFWEEGN +GAIEAVRDMEQRGVVGSASVYYELACCLCYNG+WQDALVEV KMKTL HMKPLVVTFTGMILSSF+GGH
Subjt:  LFTKMKKSGETLKANTYRVLVKAFWEEGNVNGAIEAVRDMEQRGVVGSASVYYELACCLCYNGRWQDALVEVRKMKTLPHMKPLVVTFTGMILSSFDGGH

Query:  IDDCISIFEYMRRNCAPNIGTINTMLKVYGRNDMFSKAKDLFEEIKRKADRSSPSCSVHSIIPDEYTYGSMLEAAASALQWEYFENVYREMALSGYQLDQ
        IDDCISIFEYM++ CAPNIGTINTMLKVYGRNDMFSKAKDLFEEIK+KAD SS + +V S++PDEYTY SML+AAAS+LQWEYFENVYREMALSGY+LDQ
Subjt:  IDDCISIFEYMRRNCAPNIGTINTMLKVYGRNDMFSKAKDLFEEIKRKADRSSPSCSVHSIIPDEYTYGSMLEAAASALQWEYFENVYREMALSGYQLDQ

Query:  SKHATVLVEASRAGKWYLLDHAFDSILEAGQIPHPLLFTEMILQLIVQENYEQAVTLVKAMCYAPFQVTKRQWTELFEVSMDRICWNNLKKLSDALSDCD
        SKHA +LVEAS+AGKWYLLDHAFD+ILEAGQIPHPLLFTEMILQL  QENYEQAVTLV+ M YAPFQV++RQWTELFE + DRIC NNLK+L DAL DCD
Subjt:  SKHATVLVEASRAGKWYLLDHAFDSILEAGQIPHPLLFTEMILQLIVQENYEQAVTLVKAMCYAPFQVTKRQWTELFEVSMDRICWNNLKKLSDALSDCD

Query:  ASEATVSNLSRSLRVLCKSRIPEDTSQS----------------ENMENMK----------------------VESDSNMAPWSPSLSDEGALGTNTFSG
        ASEATVSNLSRSL+ LCK  I E TSQS                +NMENMK                      V S+SNM+PWSPS+SD G LGT  FS 
Subjt:  ASEATVSNLSRSLRVLCKSRIPEDTSQS----------------ENMENMK----------------------VESDSNMAPWSPSLSDEGALGTNTFSG

Query:  HSNDELSTFDLCVSSEDDEEVLNMLLDRSDDSYDSNSPSVNEILKTWKEERKTDGLFLHPLN
         SN+E STFD    SEDDE  LNMLLD  DDSYDSN P+ NEIL+TWKEERK DGLFLHPLN
Subjt:  HSNDELSTFDLCVSSEDDEEVLNMLLDRSDDSYDSNSPSVNEILKTWKEERKTDGLFLHPLN

A0A5D3CLI0 Pentatricopeptide repeat-containing protein0.081.09Show/hide
Query:  MDALSTNAPIPAPKFEPDLEKIKRTLLQKGVCPSPKILRALRKKEIQKHNRKLNRLAH----QSPPLSESQKQLIAEETHFRTLRSEYKEFSKAIEAKPG
        M+AL++NAPIP+PKFEPD +KIKR LLQKGV P+P+I+R+LRKKEIQK+NRKL RLA     QSPPLSESQKQLIAEETHF TLRSEYKEFSKAIEAKP 
Subjt:  MDALSTNAPIPAPKFEPDLEKIKRTLLQKGVCPSPKILRALRKKEIQKHNRKLNRLAH----QSPPLSESQKQLIAEETHFRTLRSEYKEFSKAIEAKPG

Query:  GGLMVGRPWERLEGVNLKELTGFRTGYDGENLKKENLTELRKLFEARKLEELQWVLDDDVELKDEWLDSENGRSDAVKRRRGDGEVIRFLVDRLSSREIS
        GGLMVGRPWERLE VN  ELTG RTGY+ ++LKKE+L ELRKLFE RKLEEL+W LDDDVELK+EWL SENG+ DAVKRRRGDGEVIRFLVDRLSS  IS
Subjt:  GGLMVGRPWERLEGVNLKELTGFRTGYDGENLKKENLTELRKLFEARKLEELQWVLDDDVELKDEWLDSENGRSDAVKRRRGDGEVIRFLVDRLSSREIS

Query:  MRDWKFSRMMIRSGLQFNEGQLLKILDGLGARGCWKQSLSVVEWVYNLKSHSHFKSRQFVYTKLLAVLGMARKPQEALQIFNLMRGDGHIYPDMAAYHSI
        MRDWKFSRMMIRSGLQFNEGQLLKILD LGA+GCWKQ+LSVVEWVYNLKSHSH KSR FVYTKLLAVLGMARKPQEALQIFNLMRGDG IYPDMAAYHSI
Subjt:  MRDWKFSRMMIRSGLQFNEGQLLKILDGLGARGCWKQSLSVVEWVYNLKSHSHFKSRQFVYTKLLAVLGMARKPQEALQIFNLMRGDGHIYPDMAAYHSI

Query:  AVTLGQAGLLKQLLKVIECMRQQPSKKIRNKCRKSWDPAVEPDLVVYNAILNACIPTHEWKGVYWVFTQLRKSGLKPNGATYGLSMEVMLKSGKYEHLHK
        AVTLGQAGLLKQLLKVIECMRQQPSKK+RNKCRKSWDPAVEPDLV+YN ILNACIPT EWKGVYWVFTQLRKSGL+PNGATYGLSMEVMLKSGKYE LH 
Subjt:  AVTLGQAGLLKQLLKVIECMRQQPSKKIRNKCRKSWDPAVEPDLVVYNAILNACIPTHEWKGVYWVFTQLRKSGLKPNGATYGLSMEVMLKSGKYEHLHK

Query:  LFTKMKKSGETLKANTYRVLVKAFWEEGNVNGAIEAVRDMEQRGVVGSASVYYELACCLCYNGRWQDALVEVRKMKTLPHMKPLVVTFTGMILSSFDGGH
        LFTKMKKSGETLKANTYRVLVKAFWEEGN +GAIEAVRDMEQRGVVGSASVYYELACCLCYNG+WQDALVEV KMKTL HMKPLVVTFTGMILSSF+GGH
Subjt:  LFTKMKKSGETLKANTYRVLVKAFWEEGNVNGAIEAVRDMEQRGVVGSASVYYELACCLCYNGRWQDALVEVRKMKTLPHMKPLVVTFTGMILSSFDGGH

Query:  IDDCISIFEYMRRNCAPNIGTINTMLKVYGRNDMFSKAKDLFEEIKRKADRSSPSCSVHSIIPDEYTYGSMLEAAASALQWEYFENVYREMALSGYQLDQ
        IDDCISIFEYM++ CAPNIGTINTMLKVYGRNDMFSKAKDLFEEIK+KAD SS + +V S++PDEYTY SML+AAAS+LQWEYFENVYREMALSGY+LDQ
Subjt:  IDDCISIFEYMRRNCAPNIGTINTMLKVYGRNDMFSKAKDLFEEIKRKADRSSPSCSVHSIIPDEYTYGSMLEAAASALQWEYFENVYREMALSGYQLDQ

Query:  SKHATVLVEASRAGKWYLLDHAFDSILEAGQIPHPLLFTEMILQLIVQENYEQAVTLVKAMCYAPFQVTKRQWTELFEVSMDRICWNNLKKLSDALSDCD
        SKHA +LVEAS+AGKWYLLDHAFD+ILEAGQIPHPLLFTEMILQL  QENYEQAVTLV+ M YAPFQV++RQWTELFE + DRIC NNLK+L DAL DCD
Subjt:  SKHATVLVEASRAGKWYLLDHAFDSILEAGQIPHPLLFTEMILQLIVQENYEQAVTLVKAMCYAPFQVTKRQWTELFEVSMDRICWNNLKKLSDALSDCD

Query:  ASEATVSNLSRSLRVLCKSRIPEDTSQS----------------ENMENMK----------------------VESDSNMAPWSPSLSDEGALGTNTFSG
        ASEATVSNLSRSL+ LCK  I E TSQS                +NMENMK                      V S+SNM+PWSPS+SD G LGT  FS 
Subjt:  ASEATVSNLSRSLRVLCKSRIPEDTSQS----------------ENMENMK----------------------VESDSNMAPWSPSLSDEGALGTNTFSG

Query:  HSNDELSTFDLCVSSEDDEEVLNMLLDRSDDSYDSNSPSVNEILKTWKEERKTDGLFLHPLN
         SN+E STFD    SEDDE  LNMLLD  DDSYDSN P+ NEIL+TWKEERK DGLFLHPLN
Subjt:  HSNDELSTFDLCVSSEDDEEVLNMLLDRSDDSYDSNSPSVNEILKTWKEERKTDGLFLHPLN

A0A6J1BTT8 pentatricopeptide repeat-containing protein At5g67570, chloroplastic0.099.76Show/hide
Query:  MDALSTNAPIPAPKFEPDLEKIKRTLLQKGVCPSPKILRALRKKEIQKHNRKLNRLAHQSPPLSESQKQLIAEETHFRTLRSEYKEFSKAIEAKPGGGLM
        MDALSTNAPIPAPKFEPDLEKIKRTLLQKGVCPSPKILRALRKKEIQKHNRKLNRLAHQSPPLSESQKQLIAEETHFRTLRSEYKEFSKAIEAKPG GLM
Subjt:  MDALSTNAPIPAPKFEPDLEKIKRTLLQKGVCPSPKILRALRKKEIQKHNRKLNRLAHQSPPLSESQKQLIAEETHFRTLRSEYKEFSKAIEAKPGGGLM

Query:  VGRPWERLEGVNLKELTGFRTGYDGENLKKENLTELRKLFEARKLEELQWVLDDDVELKDEWLDSENGRSDAVKRRRGDGEVIRFLVDRLSSREISMRDW
        VGRPWERLEGVNLKELTGFRTGYDGENLKKENLTELRKLFEARKLEELQWVLDDDVELKDEWLDSENGRSDAVKRRRGDGEVIRFLVDRLSSREISMRDW
Subjt:  VGRPWERLEGVNLKELTGFRTGYDGENLKKENLTELRKLFEARKLEELQWVLDDDVELKDEWLDSENGRSDAVKRRRGDGEVIRFLVDRLSSREISMRDW

Query:  KFSRMMIRSGLQFNEGQLLKILDGLGARGCWKQSLSVVEWVYNLKSHSHFKSRQFVYTKLLAVLGMARKPQEALQIFNLMRGDGHIYPDMAAYHSIAVTL
        KFSRMMIRSGLQFNEGQLLKILDGLGARGCWKQSLSVVEWVYNLKSHSHFKSR FVYTKLLAVLGMARKPQEALQIFNLMRGDGHIYPDMAAYHSIAVTL
Subjt:  KFSRMMIRSGLQFNEGQLLKILDGLGARGCWKQSLSVVEWVYNLKSHSHFKSRQFVYTKLLAVLGMARKPQEALQIFNLMRGDGHIYPDMAAYHSIAVTL

Query:  GQAGLLKQLLKVIECMRQQPSKKIRNKCRKSWDPAVEPDLVVYNAILNACIPTHEWKGVYWVFTQLRKSGLKPNGATYGLSMEVMLKSGKYEHLHKLFTK
        GQAGLLKQLLKVIECMRQQPSKKIRNKCRKSWDPAVEPDLVVYNAILNACIPTHEWKGVYWVFTQLRKSGLKPNGATYGLSMEVMLKSGKYEHLHKLFTK
Subjt:  GQAGLLKQLLKVIECMRQQPSKKIRNKCRKSWDPAVEPDLVVYNAILNACIPTHEWKGVYWVFTQLRKSGLKPNGATYGLSMEVMLKSGKYEHLHKLFTK

Query:  MKKSGETLKANTYRVLVKAFWEEGNVNGAIEAVRDMEQRGVVGSASVYYELACCLCYNGRWQDALVEVRKMKTLPHMKPLVVTFTGMILSSFDGGHIDDC
        MKKSGETLKANTYRVLVKAFWEEGNVNGAIEAVRDMEQRGVVGSASVYYELACCLCYNGRWQDALVEVRKMKTLPHMKPLVVTFTGMILSSFDGGHIDDC
Subjt:  MKKSGETLKANTYRVLVKAFWEEGNVNGAIEAVRDMEQRGVVGSASVYYELACCLCYNGRWQDALVEVRKMKTLPHMKPLVVTFTGMILSSFDGGHIDDC

Query:  ISIFEYMRRNCAPNIGTINTMLKVYGRNDMFSKAKDLFEEIKRKADRSSPSCSVHSIIPDEYTYGSMLEAAASALQWEYFENVYREMALSGYQLDQSKHA
        ISIFEYMRRNCAPNIGTINTMLKVYGRNDMFSKAKDLFEEIKRKADRSSPSCSVHSIIPDEYTYGSMLEAAASALQWEYFENVYREMALSGYQLDQSKHA
Subjt:  ISIFEYMRRNCAPNIGTINTMLKVYGRNDMFSKAKDLFEEIKRKADRSSPSCSVHSIIPDEYTYGSMLEAAASALQWEYFENVYREMALSGYQLDQSKHA

Query:  TVLVEASRAGKWYLLDHAFDSILEAGQIPHPLLFTEMILQLIVQENYEQAVTLVKAMCYAPFQVTKRQWTELFEVSMDRICWNNLKKLSDALSDCDASEA
        TVLVEASRAGKWYLLDHAFDSILEAGQIPHPLLFTEMILQLIVQENYEQAVTLVKAMCYAPFQVTKRQWTELFEVSMDRICWNNLKKLSDALSDCDASEA
Subjt:  TVLVEASRAGKWYLLDHAFDSILEAGQIPHPLLFTEMILQLIVQENYEQAVTLVKAMCYAPFQVTKRQWTELFEVSMDRICWNNLKKLSDALSDCDASEA

Query:  TVSNLSRSLRVLCKSRIPEDTSQSENMENMKVESDSNMAPWSPSLSDEGALGTNTFSGHSNDELSTFDLCVSSEDDEEVLNMLLDRSDDSYDSNSPSVNE
        TVSNLSRSLRVLCKSRIPEDTSQSENMENMKVESDSNMAPWSPSLSDEGALGTNTFSGHSNDELSTFDLCVSSEDDEEVLNMLLDRSDDSYDSNSPSVNE
Subjt:  TVSNLSRSLRVLCKSRIPEDTSQSENMENMKVESDSNMAPWSPSLSDEGALGTNTFSGHSNDELSTFDLCVSSEDDEEVLNMLLDRSDDSYDSNSPSVNE

Query:  ILKTWKEERKTDGLFLHPLN
        ILKTWKEERKTDGLFLHPLN
Subjt:  ILKTWKEERKTDGLFLHPLN

A0A6J1GG29 pentatricopeptide repeat-containing protein At5g67570, chloroplastic0.081.18Show/hide
Query:  MDALSTNAPIPAPKFEPDLEKIKRTLLQKGVCPSPKILRALRKKEIQKHNRKLNRLAH----QSPPLSESQKQLIAEETHFRTLRSEYKEFSKAIEAKPG
        M+ALSTNA +P+PKFEPD+EKIKRTLLQKGV P+PKI+R+L KKEIQKHNRKL RLA     QSPPLSESQKQLI EET F TLRSEYKEFSKAIEA+P 
Subjt:  MDALSTNAPIPAPKFEPDLEKIKRTLLQKGVCPSPKILRALRKKEIQKHNRKLNRLAH----QSPPLSESQKQLIAEETHFRTLRSEYKEFSKAIEAKPG

Query:  GGLMVGRPWERLEGVNLKELTGFRTGYDGENLKKENLTELRKLFEARKLEELQWVLDDDVELKDEWLDSENGRSDAVKRRRGDGEVIRFLVDRLSSREIS
        GGLMVGRPWERLE VNLKELTGFRT Y+ +NLKKE+L ELRKLFEARKLEELQWVLDDDVELKDEWL SENG SDAVKRRRGDGEVIRFLVDRLSSR IS
Subjt:  GGLMVGRPWERLEGVNLKELTGFRTGYDGENLKKENLTELRKLFEARKLEELQWVLDDDVELKDEWLDSENGRSDAVKRRRGDGEVIRFLVDRLSSREIS

Query:  MRDWKFSRMMIRSGLQFNEGQLLKILDGLGARGCWKQSLSVVEWVYNLKSHSHFKSRQFVYTKLLAVLGMARKPQEALQIFNLMRGDGHIYPDMAAYHSI
        MRDWKFSRMMI+SGLQFNEGQLLKILD LGA+GCWKQ+LSVVEWVYNLKSHSH KSR FVYTKLLAVLG ARKPQEALQIFNLMRGDG IYPDMAAYHSI
Subjt:  MRDWKFSRMMIRSGLQFNEGQLLKILDGLGARGCWKQSLSVVEWVYNLKSHSHFKSRQFVYTKLLAVLGMARKPQEALQIFNLMRGDGHIYPDMAAYHSI

Query:  AVTLGQAGLLKQLLKVIECMRQQPSKKIRNKCRKSWDPAVEPDLVVYNAILNACIPTHEWKGVYWVFTQLRKSGLKPNGATYGLSMEVMLKSGKYEHLHK
        AVTLGQAGLLKQLLK+IE MRQQPSKK+RN CRK WDPAVEPDLV+YNAILNAC+PT EWK VYWVFTQLRK+GLKPNGATYGLSMEVMLKSGKYE +H 
Subjt:  AVTLGQAGLLKQLLKVIECMRQQPSKKIRNKCRKSWDPAVEPDLVVYNAILNACIPTHEWKGVYWVFTQLRKSGLKPNGATYGLSMEVMLKSGKYEHLHK

Query:  LFTKMKKSGETLKANTYRVLVKAFWEEGNVNGAIEAVRDMEQRGVVGSASVYYELACCLCYNGRWQDALVEVRKMKTLPHMKPLVVTFTGMILSSFDGGH
        LFTKMK SG TLKANTYRVLVKAFWEEGNV+GAIEAVRDMEQRGVVGSASVYYELACCLCY+GRWQDALVEV KMKTL HMKPLVVTFTGMILSSFDGGH
Subjt:  LFTKMKKSGETLKANTYRVLVKAFWEEGNVNGAIEAVRDMEQRGVVGSASVYYELACCLCYNGRWQDALVEVRKMKTLPHMKPLVVTFTGMILSSFDGGH

Query:  IDDCISIFEYMRRNCAPNIGTINTMLKVYGRNDMFSKAKDLFEEIKRKADRSSPSCSVHSIIPDEYTYGSMLEAAASALQWEYFENVYREMALSGYQLDQ
        IDDCISIFEYM++ CAPNIGTINTMLKV+GRNDMFSKAKDL+EEIKRKAD SS S +V SI+PD+YTY SML+AAASA QWEYFENVYREMALSGY+LDQ
Subjt:  IDDCISIFEYMRRNCAPNIGTINTMLKVYGRNDMFSKAKDLFEEIKRKADRSSPSCSVHSIIPDEYTYGSMLEAAASALQWEYFENVYREMALSGYQLDQ

Query:  SKHATVLVEASRAGKWYLLDHAFDSILEAGQIPHPLLFTEMILQLIVQENYEQAVTLVKAMCYAPFQVTKRQWTELFEVSMDRICWNNLKKLSDALSDCD
        SKHA +LVEASRAGKWYLLDHAFD+ILEAGQIPHPLLFTEMILQL  Q+NYEQAVTLV+ M YAPFQV++RQWTELFE + DRICWNNLKKLSDALSDCD
Subjt:  SKHATVLVEASRAGKWYLLDHAFDSILEAGQIPHPLLFTEMILQLIVQENYEQAVTLVKAMCYAPFQVTKRQWTELFEVSMDRICWNNLKKLSDALSDCD

Query:  ASEATVSNLSRSLRVLCKSRIPEDTSQS----------------ENMENMKVE-----------------------SDSNMAPWSPSLSDEGALGTNTFS
        ASEATV NLS SL+ LCK  IPE+ SQS                ENM+NMK+                         DS M PWS SLSD G L T  FS
Subjt:  ASEATVSNLSRSLRVLCKSRIPEDTSQS----------------ENMENMKVE-----------------------SDSNMAPWSPSLSDEGALGTNTFS

Query:  GHSNDELSTFDLCVSSEDDEEVLNMLLDRSDDSYDSNSPSVNEILKTWKEERKTDGLFLHP
          SN+E STFDL   SEDDEE L+MLLD  DDSYDSN PSV+EILKTWKEERK DGL+LHP
Subjt:  GHSNDELSTFDLCVSSEDDEEVLNMLLDRSDDSYDSNSPSVNEILKTWKEERKTDGLFLHP

A0A6J1ILT7 pentatricopeptide repeat-containing protein At5g67570, chloroplastic0.080.84Show/hide
Query:  MDALSTNAPIPAPKFEPDLEKIKRTLLQKGVCPSPKILRALRKKEIQKHNRKLNRLAH----QSPPLSESQKQLIAEETHFRTLRSEYKEFSKAIEAKPG
        M+ALSTNA +P+PKFEPD+EKIKRTL+QKGV P+PKI+R+L KKEIQKHNRKL RLA     QSPPLSESQKQLI EETHF TLRSEYKEFSKAIEA+P 
Subjt:  MDALSTNAPIPAPKFEPDLEKIKRTLLQKGVCPSPKILRALRKKEIQKHNRKLNRLAH----QSPPLSESQKQLIAEETHFRTLRSEYKEFSKAIEAKPG

Query:  GGLMVGRPWERLEGVNLKELTGFRTGYDGENLKKENLTELRKLFEARKLEELQWVLDDDVELKDEWLDSENGRSDAVKRRRGDGEVIRFLVDRLSSREIS
        GGLMVGRPWERLE VNLKE TGFRT Y+ +NLKKE+L ELRKLFEARKLEELQWVLDDDVELK+EWL SENG SDAVKRRRGDGEVIRFLVDRLSSR IS
Subjt:  GGLMVGRPWERLEGVNLKELTGFRTGYDGENLKKENLTELRKLFEARKLEELQWVLDDDVELKDEWLDSENGRSDAVKRRRGDGEVIRFLVDRLSSREIS

Query:  MRDWKFSRMMIRSGLQFNEGQLLKILDGLGARGCWKQSLSVVEWVYNLKSHSHFKSRQFVYTKLLAVLGMARKPQEALQIFNLMRGDGHIYPDMAAYHSI
        MRDWKFSRMMI+SGLQFNEGQLLKILD LGA+GCWKQ+LSVVEWVYNLKSHSH KSR FVYTKLLAVLG ARKPQEALQIFNLMRGDG IYPDMAAYHSI
Subjt:  MRDWKFSRMMIRSGLQFNEGQLLKILDGLGARGCWKQSLSVVEWVYNLKSHSHFKSRQFVYTKLLAVLGMARKPQEALQIFNLMRGDGHIYPDMAAYHSI

Query:  AVTLGQAGLLKQLLKVIECMRQQPSKKIRNKCRKSWDPAVEPDLVVYNAILNACIPTHEWKGVYWVFTQLRKSGLKPNGATYGLSMEVMLKSGKYEHLHK
        AVTLGQAGLLKQLLK+IE MRQQPSKK+RN CRK WDPAVEPDLV+YNAILNACIPT EWK VYWVFTQLRK+GLKPNGATYGLSMEVMLKSGKYE +H 
Subjt:  AVTLGQAGLLKQLLKVIECMRQQPSKKIRNKCRKSWDPAVEPDLVVYNAILNACIPTHEWKGVYWVFTQLRKSGLKPNGATYGLSMEVMLKSGKYEHLHK

Query:  LFTKMKKSGETLKANTYRVLVKAFWEEGNVNGAIEAVRDMEQRGVVGSASVYYELACCLCYNGRWQDALVEVRKMKTLPHMKPLVVTFTGMILSSFDGGH
        LFTKMK SG TLKANTYRVLVKAFWEEGNV+GAIEAVRDMEQRGVVGSASVYYELACCLCY+GRWQDALVEV KMKTL HMKPLVVTFTGMILSSFDGGH
Subjt:  LFTKMKKSGETLKANTYRVLVKAFWEEGNVNGAIEAVRDMEQRGVVGSASVYYELACCLCYNGRWQDALVEVRKMKTLPHMKPLVVTFTGMILSSFDGGH

Query:  IDDCISIFEYMRRNCAPNIGTINTMLKVYGRNDMFSKAKDLFEEIKRKADRSSPSCSVHSIIPDEYTYGSMLEAAASALQWEYFENVYREMALSGYQLDQ
        IDDCISIFEYM++ CAPNIGTINTMLKV+GRNDMFSKAKDL+EEIKRKAD SS S +V SI+PD+YTY SML+AAASA QWEYFENVYREMALSGY+LDQ
Subjt:  IDDCISIFEYMRRNCAPNIGTINTMLKVYGRNDMFSKAKDLFEEIKRKADRSSPSCSVHSIIPDEYTYGSMLEAAASALQWEYFENVYREMALSGYQLDQ

Query:  SKHATVLVEASRAGKWYLLDHAFDSILEAGQIPHPLLFTEMILQLIVQENYEQAVTLVKAMCYAPFQVTKRQWTELFEVSMDRICWNNLKKLSDALSDCD
        SKHA +LVEASRAGKWYLLDHAFD+ILEAGQIPHPLLFTEMILQL  Q+NYEQA+TLV+ M YAPFQV++RQWTELFE + DRICWNNLKKLSDALSDCD
Subjt:  SKHATVLVEASRAGKWYLLDHAFDSILEAGQIPHPLLFTEMILQLIVQENYEQAVTLVKAMCYAPFQVTKRQWTELFEVSMDRICWNNLKKLSDALSDCD

Query:  ASEATVSNLSRSLRVLCKSRIPEDTSQS----------------ENMENMKVE-----------------------SDSNMAPWSPSLSDEGALGTNTFS
        ASEATVSNLS SL+ LCK  IPE+TSQS                EN +NMK+                         DS M PWS SLSD G L T  FS
Subjt:  ASEATVSNLSRSLRVLCKSRIPEDTSQS----------------ENMENMKVE-----------------------SDSNMAPWSPSLSDEGALGTNTFS

Query:  GHSNDELSTFDLCVSSEDDEEVLNMLLDRSDDSYDSNSPSVNEILKTWKEERKTDGLFLHP
          SN+E STFDL   SEDDEE L+MLLD  DDSY SN PSV+EILKTW+EERK DGL+LHP
Subjt:  GHSNDELSTFDLCVSSEDDEEVLNMLLDRSDDSYDSNSPSVNEILKTWKEERKTDGLFLHP

SwissProt top hitse value%identityAlignment
Q3EDF8 Pentatricopeptide repeat-containing protein At1g099001.3e-1823.44Show/hide
Query:  KPQEALQIFNLMRGDGHIYPDMAAYHSIAVTLGQAGLLKQLLKVIECMRQQPSKKIRNKCRKSWDPAVEPDLVVYNAILNACIPTHEWKGVYWVFTQLRK
        K ++A +I  ++ G G + PD+  Y+ +     +AG +   L V++ M                  +V PD+V YN IL +   + + K    V  ++ +
Subjt:  KPQEALQIFNLMRGDGHIYPDMAAYHSIAVTLGQAGLLKQLLKVIECMRQQPSKKIRNKCRKSWDPAVEPDLVVYNAILNACIPTHEWKGVYWVFTQLRK

Query:  SGLKPNGATYGLSMEVMLKSGKYEHLHKLFTKMKKSGETLKANTYRVLVKAFWEEGNVNGAIEAVRDMEQRGVVGSASVYYELACCLCYNGRWQDALVEV
            P+  TY + +E   +     H  KL  +M+  G T    TY VLV    +EG ++ AI+ + DM   G   +   +  +   +C  GRW DA   +
Subjt:  SGLKPNGATYGLSMEVMLKSGKYEHLHKLFTKMKKSGETLKANTYRVLVKAFWEEGNVNGAIEAVRDMEQRGVVGSASVYYELACCLCYNGRWQDALVEV

Query:  RKMKTLPHMKPLVVTFTGMILSSFDGGHIDDCISIFEYM-RRNCAPNIGTINTMLKVYGRNDMFSKAKDLFEEIKRKADRSSPSCSVHSIIPDEYTYGSM
          M       P VVTF  +I      G +   I I E M +  C PN  + N +L  + +     +A    E ++R   R           PD  TY +M
Subjt:  RKMKTLPHMKPLVVTFTGMILSSFDGGHIDDCISIFEYM-RRNCAPNIGTINTMLKVYGRNDMFSKAKDLFEEIKRKADRSSPSCSVHSIIPDEYTYGSM

Query:  LEAAASALQWEYFENVYREMALSGYQLDQSKHATVLVEASRAGKWYLLDHAFDSILEAGQIPHPLLFTEMILQLIVQENYEQAV
        L A     + E    +  +++  G       + TV+   ++AGK        D +      P  + ++ ++  L  +   ++A+
Subjt:  LEAAASALQWEYFENVYREMALSGYQLDQSKHATVLVEASRAGKWYLLDHAFDSILEAGQIPHPLLFTEMILQLIVQENYEQAV

Q9FIX3 Pentatricopeptide repeat-containing protein At5g397103.5e-1921.21Show/hide
Query:  KQSLSVVEWVYNLKSHSHFKSRQFVYTKLLAVLGMARKPQEALQIFNLMRGDGHIYPDMAAYHSIAVTLGQAGLLKQLLKVIECMRQQPSKKIRNKCRKS
        K+++S  E V+     S      F Y  L+     A     AL +F+ M   G + P++  Y+++     +   +    K++  M  +            
Subjt:  KQSLSVVEWVYNLKSHSHFKSRQFVYTKLLAVLGMARKPQEALQIFNLMRGDGHIYPDMAAYHSIAVTLGQAGLLKQLLKVIECMRQQPSKKIRNKCRKS

Query:  WDPAVEPDLVVYNAILNACIPTHEWKGVYWVFTQLRKSGLKPNGATYGLSMEVMLKSGKYEHLHKLFTKMKKSGETLKANTYRVLVKAFWEEGNVNGAIE
            +EP+L+ YN ++N        K V +V T++ + G   +  TY   ++   K G +     +  +M + G T    TY  L+ +  + GN+N A+E
Subjt:  WDPAVEPDLVVYNAILNACIPTHEWKGVYWVFTQLRKSGLKPNGATYGLSMEVMLKSGKYEHLHKLFTKMKKSGETLKANTYRVLVKAFWEEGNVNGAIE

Query:  AVRDMEQRGVVGSASVYYELACCLCYNGRWQDALVEVRKMKTLPHMKPLVVTFTGMILSSFDGGHIDDCISIFEYMR-RNCAPNIGTINTMLKVYGRNDM
         +  M  RG+  +   Y  L       G   +A   +R+M       P VVT+  +I      G ++D I++ E M+ +  +P++ + +T+L  + R+  
Subjt:  AVRDMEQRGVVGSASVYYELACCLCYNGRWQDALVEVRKMKTLPHMKPLVVTFTGMILSSFDGGHIDDCISIFEYMR-RNCAPNIGTINTMLKVYGRNDM

Query:  FSKAKDLFEEIKRKADRSSPSCSVHSIIPDEYTYGSMLEAAASALQWEYFENVYREMALSGYQLDQSKHATVLVEASRAGKWYLLDHAFDSILEAGQIPH
          +A  +  E+  K            I PD  TY S+++      + +   ++Y EM   G   D+  +  ++      G         + ++E G +P 
Subjt:  FSKAKDLFEEIKRKADRSSPSCSVHSIIPDEYTYGSMLEAAASALQWEYFENVYREMALSGYQLDQSKHATVLVEASRAGKWYLLDHAFDSILEAGQIPH

Query:  PLLFTEMILQLIVQENYEQAVTLVKAMCY
         + ++ +I  L  Q    +A  L+  + Y
Subjt:  PLLFTEMILQLIVQENYEQAVTLVKAMCY

Q9FJW6 Pentatricopeptide repeat-containing protein At5g67570, chloroplastic1.2e-24255.06Show/hide
Query:  PKFEPDLEKIKRTLLQKGVCPSPKILRALRKKEIQKHNRKLNR-LAHQSPPLSESQKQLIAEETHFRTLRSEYKEFSKAIEAKPGG--GLMVGRPWERLE
        P+FEPD+EKIKR LL+ GV P+PKIL  LRKKEIQKHNR+  R    ++   +E+QKQ + EE  F+TLR EYK+F+++I  K GG  GLMVG PWE +E
Subjt:  PKFEPDLEKIKRTLLQKGVCPSPKILRALRKKEIQKHNRKLNR-LAHQSPPLSESQKQLIAEETHFRTLRSEYKEFSKAIEAKPGG--GLMVGRPWERLE

Query:  GVNLKELTG--FRTGYDGENLKKENLTELRKLFEARKLEELQWVLDDDVELKDEWLDSENGRSDAVKRRRGDGEVIRFLVDRLSSREISMRDWKFSRMMI
         V LKEL     R       LKKENL EL+K+ E    ++L+WVLDDDV++++  LD E    D  KR R +GE +R LVDRLS REI+ + WKF RMM 
Subjt:  GVNLKELTG--FRTGYDGENLKKENLTELRKLFEARKLEELQWVLDDDVELKDEWLDSENGRSDAVKRRRGDGEVIRFLVDRLSSREISMRDWKFSRMMI

Query:  RSGLQFNEGQLLKILDGLGARGCWKQSLSVVEWVYNLKSHSHFKSRQFVYTKLLAVLGMARKPQEALQIFNLMRGDGHIYPDMAAYHSIAVTLGQAGLLK
        +SGLQF E Q+LKI+D LG +  WKQ+ +VV WVY+ K   H +SR FVYTKLL+VLG AR+PQEALQIFN M GD  +YPDMAAYH IAVTLGQAGLLK
Subjt:  RSGLQFNEGQLLKILDGLGARGCWKQSLSVVEWVYNLKSHSHFKSRQFVYTKLLAVLGMARKPQEALQIFNLMRGDGHIYPDMAAYHSIAVTLGQAGLLK

Query:  QLLKVIECMRQQPSKKIRNKCRKSWDPAVEPDLVVYNAILNACIPTHEWKGVYWVFTQLRKSGLKPNGATYGLSMEVMLKSGKYEHLHKLFTKMKKSGET
        +LLKVIE MRQ+P+K  +N  +K+WDP +EPDLVVYNAILNAC+PT +WK V WVF +LRK+GL+PNGATYGL+MEVML+SGK++ +H  F KMK SGE 
Subjt:  QLLKVIECMRQQPSKKIRNKCRKSWDPAVEPDLVVYNAILNACIPTHEWKGVYWVFTQLRKSGLKPNGATYGLSMEVMLKSGKYEHLHKLFTKMKKSGET

Query:  LKANTYRVLVKAFWEEGNVNGAIEAVRDMEQRGVVGSASVYYELACCLCYNGRWQDALVEVRKMKTLPHMKPLVVTFTGMILSSFDGGHIDDCISIFEYM
         KA TY+VLV+A W EG +  A+EAVRDMEQ+GV+G+ SVYYELACCLC NGRW DA++EV +MK L + +PL +TFTG+I +S +GGH+DDC++IF+YM
Subjt:  LKANTYRVLVKAFWEEGNVNGAIEAVRDMEQRGVVGSASVYYELACCLCYNGRWQDALVEVRKMKTLPHMKPLVVTFTGMILSSFDGGHIDDCISIFEYM

Query:  RRNCAPNIGTINTMLKVYGRNDMFSKAKDLFEEIKRKADRSSPSCSVHSIIPDEYTYGSMLEAAASALQWEYFENVYREMALSGYQLDQSKHATVLVEAS
        +  C PNIGT N MLKVYGRNDMFS+AK+LFEEI  + +          ++P+EYTY  MLEA+A +LQWEYFE+VY+ M LSGYQ+DQ+KHA++L+EAS
Subjt:  RRNCAPNIGTINTMLKVYGRNDMFSKAKDLFEEIKRKADRSSPSCSVHSIIPDEYTYGSMLEAAASALQWEYFENVYREMALSGYQLDQSKHATVLVEAS

Query:  RAGKWYLLDHAFDSILEAGQIPHPLLFTEMILQLIVQENYEQAVTLVKAMCYAPFQVTKRQWTELFEVSMDRICWNNLKKLSDALSDCD-ASEATVSNLS
        RAGKW LL+HAFD++LE G+IPHPL FTE++     + ++++A+TL+  +  A FQ+++ +WT+LFE   D +  +NL KLSD L +CD  SE TVSNLS
Subjt:  RAGKWYLLDHAFDSILEAGQIPHPLLFTEMILQLIVQENYEQAVTLVKAMCYAPFQVTKRQWTELFEVSMDRICWNNLKKLSDALSDCD-ASEATVSNLS

Query:  RSLRVLCKSRIPEDTSQSENMENMKVESDSNMAPWSPSLSDEGALGTNTFSGH----SNDELSTFDLCVSSEDDEEVLNMLLDRSDDSYDSNSPSVNEIL
        +SL+  C S     ++Q     ++  +S          L D      N+ +G     +  EL T  L     DD+E          +S DS+S SV +IL
Subjt:  RSLRVLCKSRIPEDTSQSENMENMKVESDSNMAPWSPSLSDEGALGTNTFSGH----SNDELSTFDLCVSSEDDEEVLNMLLDRSDDSYDSNSPSVNEIL

Query:  KTWKEERKTD
        K W+E  K +
Subjt:  KTWKEERKTD

Q9SA76 Pentatricopeptide repeat-containing protein At1g30610, chloroplastic3.0e-9535.09Show/hide
Query:  IRFLVDRLSSREISMRDWKFSRMMIRSGLQFNEGQLLKILDGLGARGCWKQSLSVVEWVYNLKSHSHFKSR--QFVYTKLLAVLGMARKPQEALQIFNLM
        I  L   L+  +I+M +W+FS+ +  + +++ +  +++++  LG  G W++ L V+EW   L+    +KS   + +YT  L VLG +R+P EAL +F+ M
Subjt:  IRFLVDRLSSREISMRDWKFSRMMIRSGLQFNEGQLLKILDGLGARGCWKQSLSVVEWVYNLKSHSHFKSR--QFVYTKLLAVLGMARKPQEALQIFNLM

Query:  RGDGHIYPDMAAYHSIAVTLGQAGLLKQLLKVIECMRQQPSKKIRNKCRKSWDPAVEPDLVVYNAILNACIPTHEWKGVYWVFTQLRKSGLKPNGATYGL
              YPDM AY SIAVTLGQAG +K+L  VI+ MR  P KK +    + WDP +EPD+VVYNA+LNAC+   +W+G +WV  QL++ G KP+  TYGL
Subjt:  RGDGHIYPDMAAYHSIAVTLGQAGLLKQLLKVIECMRQQPSKKIRNKCRKSWDPAVEPDLVVYNAILNACIPTHEWKGVYWVFTQLRKSGLKPNGATYGL

Query:  SMEVMLKSGKYEHLHKLFTKMKKSGETLKANTYRVLVKAFWEEGNVNGAIEAVRDMEQRGVVGSASVYYELACCLCYNGRWQDAL--------VEVRKMK
         MEVML   KY  +H+ F KM+KS     A  YRVLV   W+EG  + A+  V DME RG+VGSA++YY+LA CLC  GR  + L        V ++ ++
Subjt:  SMEVMLKSGKYEHLHKLFTKMKKSGETLKANTYRVLVKAFWEEGNVNGAIEAVRDMEQRGVVGSASVYYELACCLCYNGRWQDAL--------VEVRKMK

Query:  TLPHM-------------------KPLVVTFTGMILSSFDGGHIDDCISIFEYMRRNCAPNIGTINTMLKVYGRNDMFSKAKDLFEEIKRKADRSSPSCS
         L +                    KPLVVT+TG+I +  D G+I +   IF+ M++ C+PN+ T N MLK Y +  +F +A++LF+++    +    S  
Subjt:  TLPHM-------------------KPLVVTFTGMILSSFDGGHIDDCISIFEYMRRNCAPNIGTINTMLKVYGRNDMFSKAKDLFEEIKRKADRSSPSCS

Query:  VHS-IIPDEYTYGSMLEAAASALQWEYFENVYREMALSGYQLDQSKHATVLVEASRAGKWYLLDHAFDSILEAGQIPHPLLFTEMILQLIVQENYEQAVT
          S ++PD YT+ +ML+  A   +W+ F   YREM   GY  +  +H  +++EASRAGK  +++  ++ +  + +IP   L  E   + + + ++  A++
Subjt:  VHS-IIPDEYTYGSMLEAAASALQWEYFENVYREMALSGYQLDQSKHATVLVEASRAGKWYLLDHAFDSILEAGQIPHPLLFTEMILQLIVQENYEQAVT

Query:  LVKAMCYAPFQVTKRQW-TELFEVSMDRICWNNLKKLSDAL-----SDCDASEATVSNLSRSLRVLCKSR
         +  +     +   R + T  +   + R   +++ +L D +     S  ++S++ + NL  S +   K+R
Subjt:  LVKAMCYAPFQVTKRQW-TELFEVSMDRICWNNLKKLSDAL-----SDCDASEATVSNLSRSLRVLCKSR

Q9SR00 Pentatricopeptide repeat-containing protein At3g04760, chloroplastic1.7e-1824.27Show/hide
Query:  CWKQSLSVVEWVYNLKSHSHFKSRQFVYTKLLAVLGMARKPQEALQIFNLMRGDGHIYPDMAAYHSIAVTLGQAGLLKQLLKVIECMRQQPSKKIRNKCR
        C +  L +   V N     + +     YT L+    +     EAL++ + M   G + PDM  Y++I   + + G++ +  +++  +  +          
Subjt:  CWKQSLSVVEWVYNLKSHSHFKSRQFVYTKLLAVLGMARKPQEALQIFNLMRGDGHIYPDMAAYHSIAVTLGQAGLLKQLLKVIECMRQQPSKKIRNKCR

Query:  KSWDPAVEPDLVVYNAILNACIPTHEWKGVYWVFTQLRKSGLKPNGATYGLSMEVMLKSGKYEHLHKLFTKMKKSGETLKANTYRVLVKAFWEEGNVNGA
               EPD++ YN +L A +   +W+    + T++      PN  TY + +  + + GK E    L   MK+ G T  A +Y  L+ AF  EG ++ A
Subjt:  KSWDPAVEPDLVVYNAILNACIPTHEWKGVYWVFTQLRKSGLKPNGATYGLSMEVMLKSGKYEHLHKLFTKMKKSGETLKANTYRVLVKAFWEEGNVNGA

Query:  IEAVRDMEQRGVVGSASVYYELACCLCYNGRWQDALVEVRKMKTL---PHMKPLVVTFTGMILSSFDGGHIDDCISIFEYMRRNCAPNIGTINTMLKVYG
        IE +  M   G +     Y  +   LC NG+   AL    K+  +   P+       F+ +  S   G  I     I E M     P+  T N+M+    
Subjt:  IEAVRDMEQRGVVGSASVYYELACCLCYNGRWQDALVEVRKMKTL---PHMKPLVVTFTGMILSSFDGGHIDDCISIFEYMRRNCAPNIGTINTMLKVYG

Query:  RNDMFSKAKDLFEEIKRKADRSSPSCSVHSIIPDEYTYGSMLEAAASALQWEYFENVYREMALSGYQLDQSKHATVLVE
        R  M  +A +L  +++        SC  H   P   TY  +L     A + E   NV   M  +G + +++ + TVL+E
Subjt:  RNDMFSKAKDLFEEIKRKADRSSPSCSVHSIIPDEYTYGSMLEAAASALQWEYFENVYREMALSGYQLDQSKHATVLVE

Arabidopsis top hitse value%identityAlignment
AT1G09900.1 Pentatricopeptide repeat (PPR-like) superfamily protein9.4e-2023.44Show/hide
Query:  KPQEALQIFNLMRGDGHIYPDMAAYHSIAVTLGQAGLLKQLLKVIECMRQQPSKKIRNKCRKSWDPAVEPDLVVYNAILNACIPTHEWKGVYWVFTQLRK
        K ++A +I  ++ G G + PD+  Y+ +     +AG +   L V++ M                  +V PD+V YN IL +   + + K    V  ++ +
Subjt:  KPQEALQIFNLMRGDGHIYPDMAAYHSIAVTLGQAGLLKQLLKVIECMRQQPSKKIRNKCRKSWDPAVEPDLVVYNAILNACIPTHEWKGVYWVFTQLRK

Query:  SGLKPNGATYGLSMEVMLKSGKYEHLHKLFTKMKKSGETLKANTYRVLVKAFWEEGNVNGAIEAVRDMEQRGVVGSASVYYELACCLCYNGRWQDALVEV
            P+  TY + +E   +     H  KL  +M+  G T    TY VLV    +EG ++ AI+ + DM   G   +   +  +   +C  GRW DA   +
Subjt:  SGLKPNGATYGLSMEVMLKSGKYEHLHKLFTKMKKSGETLKANTYRVLVKAFWEEGNVNGAIEAVRDMEQRGVVGSASVYYELACCLCYNGRWQDALVEV

Query:  RKMKTLPHMKPLVVTFTGMILSSFDGGHIDDCISIFEYM-RRNCAPNIGTINTMLKVYGRNDMFSKAKDLFEEIKRKADRSSPSCSVHSIIPDEYTYGSM
          M       P VVTF  +I      G +   I I E M +  C PN  + N +L  + +     +A    E ++R   R           PD  TY +M
Subjt:  RKMKTLPHMKPLVVTFTGMILSSFDGGHIDDCISIFEYM-RRNCAPNIGTINTMLKVYGRNDMFSKAKDLFEEIKRKADRSSPSCSVHSIIPDEYTYGSM

Query:  LEAAASALQWEYFENVYREMALSGYQLDQSKHATVLVEASRAGKWYLLDHAFDSILEAGQIPHPLLFTEMILQLIVQENYEQAV
        L A     + E    +  +++  G       + TV+   ++AGK        D +      P  + ++ ++  L  +   ++A+
Subjt:  LEAAASALQWEYFENVYREMALSGYQLDQSKHATVLVEASRAGKWYLLDHAFDSILEAGQIPHPLLFTEMILQLIVQENYEQAV

AT1G30610.1 pentatricopeptide (PPR) repeat-containing protein2.1e-9635.09Show/hide
Query:  IRFLVDRLSSREISMRDWKFSRMMIRSGLQFNEGQLLKILDGLGARGCWKQSLSVVEWVYNLKSHSHFKSR--QFVYTKLLAVLGMARKPQEALQIFNLM
        I  L   L+  +I+M +W+FS+ +  + +++ +  +++++  LG  G W++ L V+EW   L+    +KS   + +YT  L VLG +R+P EAL +F+ M
Subjt:  IRFLVDRLSSREISMRDWKFSRMMIRSGLQFNEGQLLKILDGLGARGCWKQSLSVVEWVYNLKSHSHFKSR--QFVYTKLLAVLGMARKPQEALQIFNLM

Query:  RGDGHIYPDMAAYHSIAVTLGQAGLLKQLLKVIECMRQQPSKKIRNKCRKSWDPAVEPDLVVYNAILNACIPTHEWKGVYWVFTQLRKSGLKPNGATYGL
              YPDM AY SIAVTLGQAG +K+L  VI+ MR  P KK +    + WDP +EPD+VVYNA+LNAC+   +W+G +WV  QL++ G KP+  TYGL
Subjt:  RGDGHIYPDMAAYHSIAVTLGQAGLLKQLLKVIECMRQQPSKKIRNKCRKSWDPAVEPDLVVYNAILNACIPTHEWKGVYWVFTQLRKSGLKPNGATYGL

Query:  SMEVMLKSGKYEHLHKLFTKMKKSGETLKANTYRVLVKAFWEEGNVNGAIEAVRDMEQRGVVGSASVYYELACCLCYNGRWQDAL--------VEVRKMK
         MEVML   KY  +H+ F KM+KS     A  YRVLV   W+EG  + A+  V DME RG+VGSA++YY+LA CLC  GR  + L        V ++ ++
Subjt:  SMEVMLKSGKYEHLHKLFTKMKKSGETLKANTYRVLVKAFWEEGNVNGAIEAVRDMEQRGVVGSASVYYELACCLCYNGRWQDAL--------VEVRKMK

Query:  TLPHM-------------------KPLVVTFTGMILSSFDGGHIDDCISIFEYMRRNCAPNIGTINTMLKVYGRNDMFSKAKDLFEEIKRKADRSSPSCS
         L +                    KPLVVT+TG+I +  D G+I +   IF+ M++ C+PN+ T N MLK Y +  +F +A++LF+++    +    S  
Subjt:  TLPHM-------------------KPLVVTFTGMILSSFDGGHIDDCISIFEYMRRNCAPNIGTINTMLKVYGRNDMFSKAKDLFEEIKRKADRSSPSCS

Query:  VHS-IIPDEYTYGSMLEAAASALQWEYFENVYREMALSGYQLDQSKHATVLVEASRAGKWYLLDHAFDSILEAGQIPHPLLFTEMILQLIVQENYEQAVT
          S ++PD YT+ +ML+  A   +W+ F   YREM   GY  +  +H  +++EASRAGK  +++  ++ +  + +IP   L  E   + + + ++  A++
Subjt:  VHS-IIPDEYTYGSMLEAAASALQWEYFENVYREMALSGYQLDQSKHATVLVEASRAGKWYLLDHAFDSILEAGQIPHPLLFTEMILQLIVQENYEQAVT

Query:  LVKAMCYAPFQVTKRQW-TELFEVSMDRICWNNLKKLSDAL-----SDCDASEATVSNLSRSLRVLCKSR
         +  +     +   R + T  +   + R   +++ +L D +     S  ++S++ + NL  S +   K+R
Subjt:  LVKAMCYAPFQVTKRQW-TELFEVSMDRICWNNLKKLSDAL-----SDCDASEATVSNLSRSLRVLCKSR

AT1G30610.2 pentatricopeptide (PPR) repeat-containing protein4.1e-10036.65Show/hide
Query:  IRFLVDRLSSREISMRDWKFSRMMIRSGLQFNEGQLLKILDGLGARGCWKQSLSVVEWVYNLKSHSHFKSR--QFVYTKLLAVLGMARKPQEALQIFNLM
        I  L   L+  +I+M +W+FS+ +  + +++ +  +++++  LG  G W++ L V+EW   L+    +KS   + +YT  L VLG +R+P EAL +F+ M
Subjt:  IRFLVDRLSSREISMRDWKFSRMMIRSGLQFNEGQLLKILDGLGARGCWKQSLSVVEWVYNLKSHSHFKSR--QFVYTKLLAVLGMARKPQEALQIFNLM

Query:  RGDGHIYPDMAAYHSIAVTLGQAGLLKQLLKVIECMRQQPSKKIRNKCRKSWDPAVEPDLVVYNAILNACIPTHEWKGVYWVFTQLRKSGLKPNGATYGL
              YPDM AY SIAVTLGQAG +K+L  VI+ MR  P KK +    + WDP +EPD+VVYNA+LNAC+   +W+G +WV  QL++ G KP+  TYGL
Subjt:  RGDGHIYPDMAAYHSIAVTLGQAGLLKQLLKVIECMRQQPSKKIRNKCRKSWDPAVEPDLVVYNAILNACIPTHEWKGVYWVFTQLRKSGLKPNGATYGL

Query:  SMEVMLKSGKYEHLHKLFTKMKKSGETLKANTYRVLVKAFWEEGNVNGAIEAVRDMEQRGVVGSASVYYELACCLCYNGRWQDALVEVRKMKTLPHMKPL
         MEVML   KY  +H+ F KM+KS     A  YRVLV   W+EG  + A+  V DME RG+VGSA++YY+LA CLC  GR  + L  ++K+  + + KPL
Subjt:  SMEVMLKSGKYEHLHKLFTKMKKSGETLKANTYRVLVKAFWEEGNVNGAIEAVRDMEQRGVVGSASVYYELACCLCYNGRWQDALVEVRKMKTLPHMKPL

Query:  VVTFTGMILSSFDGGHIDDCISIFEYMRRNCAPNIGTINTMLKVYGRNDMFSKAKDLFEEIKRKADRSSPSCSVHS-IIPDEYTYGSMLEAAASALQWEY
        VVT+TG+I +  D G+I +   IF+ M++ C+PN+ T N MLK Y +  +F +A++LF+++    +    S    S ++PD YT+ +ML+  A   +W+ 
Subjt:  VVTFTGMILSSFDGGHIDDCISIFEYMRRNCAPNIGTINTMLKVYGRNDMFSKAKDLFEEIKRKADRSSPSCSVHS-IIPDEYTYGSMLEAAASALQWEY

Query:  FENVYREMALSGYQLDQSKHATVLVEASRAGKWYLLDHAFDSILEAGQIPHPLLFTEMILQLIVQENYEQAVTLVKAMCYAPFQVTKRQW-TELFEVSMD
        F   YREM   GY  +  +H  +++EASRAGK  +++  ++ +  + +IP   L  E   + + + ++  A++ +  +     +   R + T  +   + 
Subjt:  FENVYREMALSGYQLDQSKHATVLVEASRAGKWYLLDHAFDSILEAGQIPHPLLFTEMILQLIVQENYEQAVTLVKAMCYAPFQVTKRQW-TELFEVSMD

Query:  RICWNNLKKLSDAL-----SDCDASEATVSNLSRSLRVLCKSR
        R   +++ +L D +     S  ++S++ + NL  S +   K+R
Subjt:  RICWNNLKKLSDAL-----SDCDASEATVSNLSRSLRVLCKSR

AT5G39710.1 Tetratricopeptide repeat (TPR)-like superfamily protein2.5e-2021.21Show/hide
Query:  KQSLSVVEWVYNLKSHSHFKSRQFVYTKLLAVLGMARKPQEALQIFNLMRGDGHIYPDMAAYHSIAVTLGQAGLLKQLLKVIECMRQQPSKKIRNKCRKS
        K+++S  E V+     S      F Y  L+     A     AL +F+ M   G + P++  Y+++     +   +    K++  M  +            
Subjt:  KQSLSVVEWVYNLKSHSHFKSRQFVYTKLLAVLGMARKPQEALQIFNLMRGDGHIYPDMAAYHSIAVTLGQAGLLKQLLKVIECMRQQPSKKIRNKCRKS

Query:  WDPAVEPDLVVYNAILNACIPTHEWKGVYWVFTQLRKSGLKPNGATYGLSMEVMLKSGKYEHLHKLFTKMKKSGETLKANTYRVLVKAFWEEGNVNGAIE
            +EP+L+ YN ++N        K V +V T++ + G   +  TY   ++   K G +     +  +M + G T    TY  L+ +  + GN+N A+E
Subjt:  WDPAVEPDLVVYNAILNACIPTHEWKGVYWVFTQLRKSGLKPNGATYGLSMEVMLKSGKYEHLHKLFTKMKKSGETLKANTYRVLVKAFWEEGNVNGAIE

Query:  AVRDMEQRGVVGSASVYYELACCLCYNGRWQDALVEVRKMKTLPHMKPLVVTFTGMILSSFDGGHIDDCISIFEYMR-RNCAPNIGTINTMLKVYGRNDM
         +  M  RG+  +   Y  L       G   +A   +R+M       P VVT+  +I      G ++D I++ E M+ +  +P++ + +T+L  + R+  
Subjt:  AVRDMEQRGVVGSASVYYELACCLCYNGRWQDALVEVRKMKTLPHMKPLVVTFTGMILSSFDGGHIDDCISIFEYMR-RNCAPNIGTINTMLKVYGRNDM

Query:  FSKAKDLFEEIKRKADRSSPSCSVHSIIPDEYTYGSMLEAAASALQWEYFENVYREMALSGYQLDQSKHATVLVEASRAGKWYLLDHAFDSILEAGQIPH
          +A  +  E+  K            I PD  TY S+++      + +   ++Y EM   G   D+  +  ++      G         + ++E G +P 
Subjt:  FSKAKDLFEEIKRKADRSSPSCSVHSIIPDEYTYGSMLEAAASALQWEYFENVYREMALSGYQLDQSKHATVLVEASRAGKWYLLDHAFDSILEAGQIPH

Query:  PLLFTEMILQLIVQENYEQAVTLVKAMCY
         + ++ +I  L  Q    +A  L+  + Y
Subjt:  PLLFTEMILQLIVQENYEQAVTLVKAMCY

AT5G67570.1 Tetratricopeptide repeat (TPR)-like superfamily protein8.8e-24455.06Show/hide
Query:  PKFEPDLEKIKRTLLQKGVCPSPKILRALRKKEIQKHNRKLNR-LAHQSPPLSESQKQLIAEETHFRTLRSEYKEFSKAIEAKPGG--GLMVGRPWERLE
        P+FEPD+EKIKR LL+ GV P+PKIL  LRKKEIQKHNR+  R    ++   +E+QKQ + EE  F+TLR EYK+F+++I  K GG  GLMVG PWE +E
Subjt:  PKFEPDLEKIKRTLLQKGVCPSPKILRALRKKEIQKHNRKLNR-LAHQSPPLSESQKQLIAEETHFRTLRSEYKEFSKAIEAKPGG--GLMVGRPWERLE

Query:  GVNLKELTG--FRTGYDGENLKKENLTELRKLFEARKLEELQWVLDDDVELKDEWLDSENGRSDAVKRRRGDGEVIRFLVDRLSSREISMRDWKFSRMMI
         V LKEL     R       LKKENL EL+K+ E    ++L+WVLDDDV++++  LD E    D  KR R +GE +R LVDRLS REI+ + WKF RMM 
Subjt:  GVNLKELTG--FRTGYDGENLKKENLTELRKLFEARKLEELQWVLDDDVELKDEWLDSENGRSDAVKRRRGDGEVIRFLVDRLSSREISMRDWKFSRMMI

Query:  RSGLQFNEGQLLKILDGLGARGCWKQSLSVVEWVYNLKSHSHFKSRQFVYTKLLAVLGMARKPQEALQIFNLMRGDGHIYPDMAAYHSIAVTLGQAGLLK
        +SGLQF E Q+LKI+D LG +  WKQ+ +VV WVY+ K   H +SR FVYTKLL+VLG AR+PQEALQIFN M GD  +YPDMAAYH IAVTLGQAGLLK
Subjt:  RSGLQFNEGQLLKILDGLGARGCWKQSLSVVEWVYNLKSHSHFKSRQFVYTKLLAVLGMARKPQEALQIFNLMRGDGHIYPDMAAYHSIAVTLGQAGLLK

Query:  QLLKVIECMRQQPSKKIRNKCRKSWDPAVEPDLVVYNAILNACIPTHEWKGVYWVFTQLRKSGLKPNGATYGLSMEVMLKSGKYEHLHKLFTKMKKSGET
        +LLKVIE MRQ+P+K  +N  +K+WDP +EPDLVVYNAILNAC+PT +WK V WVF +LRK+GL+PNGATYGL+MEVML+SGK++ +H  F KMK SGE 
Subjt:  QLLKVIECMRQQPSKKIRNKCRKSWDPAVEPDLVVYNAILNACIPTHEWKGVYWVFTQLRKSGLKPNGATYGLSMEVMLKSGKYEHLHKLFTKMKKSGET

Query:  LKANTYRVLVKAFWEEGNVNGAIEAVRDMEQRGVVGSASVYYELACCLCYNGRWQDALVEVRKMKTLPHMKPLVVTFTGMILSSFDGGHIDDCISIFEYM
         KA TY+VLV+A W EG +  A+EAVRDMEQ+GV+G+ SVYYELACCLC NGRW DA++EV +MK L + +PL +TFTG+I +S +GGH+DDC++IF+YM
Subjt:  LKANTYRVLVKAFWEEGNVNGAIEAVRDMEQRGVVGSASVYYELACCLCYNGRWQDALVEVRKMKTLPHMKPLVVTFTGMILSSFDGGHIDDCISIFEYM

Query:  RRNCAPNIGTINTMLKVYGRNDMFSKAKDLFEEIKRKADRSSPSCSVHSIIPDEYTYGSMLEAAASALQWEYFENVYREMALSGYQLDQSKHATVLVEAS
        +  C PNIGT N MLKVYGRNDMFS+AK+LFEEI  + +          ++P+EYTY  MLEA+A +LQWEYFE+VY+ M LSGYQ+DQ+KHA++L+EAS
Subjt:  RRNCAPNIGTINTMLKVYGRNDMFSKAKDLFEEIKRKADRSSPSCSVHSIIPDEYTYGSMLEAAASALQWEYFENVYREMALSGYQLDQSKHATVLVEAS

Query:  RAGKWYLLDHAFDSILEAGQIPHPLLFTEMILQLIVQENYEQAVTLVKAMCYAPFQVTKRQWTELFEVSMDRICWNNLKKLSDALSDCD-ASEATVSNLS
        RAGKW LL+HAFD++LE G+IPHPL FTE++     + ++++A+TL+  +  A FQ+++ +WT+LFE   D +  +NL KLSD L +CD  SE TVSNLS
Subjt:  RAGKWYLLDHAFDSILEAGQIPHPLLFTEMILQLIVQENYEQAVTLVKAMCYAPFQVTKRQWTELFEVSMDRICWNNLKKLSDALSDCD-ASEATVSNLS

Query:  RSLRVLCKSRIPEDTSQSENMENMKVESDSNMAPWSPSLSDEGALGTNTFSGH----SNDELSTFDLCVSSEDDEEVLNMLLDRSDDSYDSNSPSVNEIL
        +SL+  C S     ++Q     ++  +S          L D      N+ +G     +  EL T  L     DD+E          +S DS+S SV +IL
Subjt:  RSLRVLCKSRIPEDTSQSENMENMKVESDSNMAPWSPSLSDEGALGTNTFSGH----SNDELSTFDLCVSSEDDEEVLNMLLDRSDDSYDSNSPSVNEIL

Query:  KTWKEERKTD
        K W+E  K +
Subjt:  KTWKEERKTD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATGCATTGAGCACAAATGCACCAATTCCTGCACCGAAGTTCGAACCAGATTTGGAGAAAATTAAGCGAACGCTCCTCCAAAAGGGTGTCTGTCCCTCTCCTAAGAT
CCTCCGCGCACTTCGGAAGAAAGAAATTCAGAAGCACAACCGCAAACTCAACCGACTGGCTCATCAGTCGCCGCCCCTTTCTGAGTCCCAAAAGCAGCTAATTGCCGAGG
AAACCCATTTCCGGACTTTGAGAAGCGAGTACAAGGAGTTCTCCAAGGCCATAGAGGCGAAACCAGGCGGTGGCTTGATGGTCGGCAGGCCTTGGGAGAGACTGGAAGGA
GTAAACCTTAAAGAACTTACCGGTTTCAGAACAGGATACGATGGGGAGAATCTGAAGAAGGAGAATTTGACAGAGTTGAGGAAACTGTTTGAGGCTCGTAAGCTCGAGGA
GTTGCAGTGGGTTTTAGACGACGATGTGGAACTGAAGGATGAGTGGCTGGACAGTGAAAATGGTCGCTCTGATGCCGTAAAACGGAGGCGCGGCGACGGAGAGGTTATTC
GGTTCCTTGTTGACAGGCTCAGTTCGAGGGAGATTTCCATGAGGGACTGGAAATTCTCCAGGATGATGATACGGTCAGGATTGCAGTTTAATGAAGGTCAACTACTTAAA
ATTTTGGATGGCCTCGGTGCTAGGGGATGCTGGAAACAGTCCTTGTCAGTGGTCGAATGGGTGTACAATCTTAAAAGTCACAGTCATTTTAAAAGCAGGCAGTTTGTCTA
TACAAAGCTCTTAGCTGTTCTGGGGATGGCGAGGAAACCTCAGGAAGCCCTTCAGATATTTAATTTGATGCGGGGAGATGGCCATATATATCCCGACATGGCTGCATATC
ACAGTATCGCTGTTACGCTGGGTCAAGCTGGTCTTTTGAAACAATTGCTGAAAGTTATTGAATGCATGAGGCAGCAGCCGTCCAAAAAAATTAGAAACAAGTGCCGAAAA
TCTTGGGATCCTGCAGTTGAACCCGATCTTGTTGTATATAATGCTATTTTGAACGCATGCATTCCCACGCACGAGTGGAAAGGTGTCTACTGGGTGTTCACACAGTTGAG
AAAGAGTGGTTTGAAACCTAATGGAGCAACATATGGACTTTCTATGGAGGTAATGCTGAAATCTGGAAAGTATGAGCATCTCCACAAGCTTTTTACAAAAATGAAGAAGA
GTGGGGAAACTCTAAAGGCGAACACGTACAGAGTTCTTGTCAAAGCTTTTTGGGAGGAAGGAAATGTTAATGGAGCTATTGAAGCAGTCAGGGATATGGAACAAAGAGGA
GTAGTTGGATCTGCCAGTGTCTATTATGAACTAGCTTGTTGTCTATGTTACAATGGGAGGTGGCAAGATGCATTGGTAGAGGTAAGAAAAATGAAAACACTACCACATAT
GAAACCGTTGGTGGTGACCTTCACTGGCATGATCTTATCTTCCTTTGATGGTGGACATATTGATGATTGCATATCTATCTTCGAGTACATGAGGCGAAATTGTGCGCCTA
ATATAGGGACTATAAATACCATGCTTAAAGTTTATGGCCGAAATGATATGTTTTCTAAAGCTAAAGATTTATTTGAAGAAATAAAGAGAAAAGCTGATCGTTCCTCCCCA
AGTTGTTCTGTTCATTCTATAATCCCAGATGAATATACGTATGGCTCGATGCTTGAGGCAGCTGCTAGTGCACTCCAGTGGGAATATTTTGAGAATGTATACAGGGAAAT
GGCTCTGTCTGGATACCAGCTAGATCAAAGTAAACATGCAACGGTACTTGTGGAAGCTTCCAGAGCTGGGAAGTGGTATCTATTAGATCATGCATTTGACTCAATCTTGG
AGGCTGGACAAATTCCCCATCCACTGTTGTTCACAGAAATGATATTGCAGCTTATAGTTCAAGAGAACTATGAGCAGGCTGTCACCTTGGTTAAAGCCATGTGTTATGCT
CCATTCCAAGTAACCAAAAGACAATGGACAGAACTTTTTGAAGTGAGCATGGACAGGATTTGTTGGAATAACTTGAAGAAACTATCAGACGCTCTTAGCGACTGTGATGC
ATCAGAAGCCACAGTCTCGAACTTGTCAAGGTCGCTGCGGGTTCTCTGCAAATCCAGGATACCAGAGGACACCTCCCAGTCTGAAAACATGGAGAACATGAAGGTCGAGA
GTGACTCAAATATGGCCCCCTGGTCACCGAGTCTTTCTGATGAAGGTGCTCTAGGGACTAACACGTTTTCGGGTCATTCCAACGATGAGCTCTCGACTTTCGATTTGTGT
GTCAGCAGTGAAGATGATGAGGAAGTGCTTAACATGTTACTTGATAGATCTGATGATTCTTATGATTCAAACTCGCCTTCTGTTAATGAAATACTGAAAACTTGGAAGGA
AGAGAGGAAAACCGATGGGTTATTTCTCCACCCTTTGAATTAG
mRNA sequenceShow/hide mRNA sequence
GTTACAAACTTTAGGAATTAAATTGGGAGAACAGAGAAATTAAATTTTCCACTAAATTAGGATATGGAATTTGGAGCAATGTGATTTATTTCAATTTCGCATCTTTTCAT
ATTACTTCTCTTAGTTTCTCCTCTCCCTCCCCGATGTGAAATTCTAAAGCTCATTTGATAGAAGTACCAAGTAGTGATGGGTGTACGTAGTTGATACTTGATATTAGTTC
ATCCAATTAGCTCGGTTCAGAACGACGAAAGCGATAATAGCCTTTGCTCTTCCACACTTATCCGTTGCCCTCCAGCCGCGTTTCTGTCGAAGCAATTGGTATCTTTGCTT
CCAAAAATGGATGCATTGAGCACAAATGCACCAATTCCTGCACCGAAGTTCGAACCAGATTTGGAGAAAATTAAGCGAACGCTCCTCCAAAAGGGTGTCTGTCCCTCTCC
TAAGATCCTCCGCGCACTTCGGAAGAAAGAAATTCAGAAGCACAACCGCAAACTCAACCGACTGGCTCATCAGTCGCCGCCCCTTTCTGAGTCCCAAAAGCAGCTAATTG
CCGAGGAAACCCATTTCCGGACTTTGAGAAGCGAGTACAAGGAGTTCTCCAAGGCCATAGAGGCGAAACCAGGCGGTGGCTTGATGGTCGGCAGGCCTTGGGAGAGACTG
GAAGGAGTAAACCTTAAAGAACTTACCGGTTTCAGAACAGGATACGATGGGGAGAATCTGAAGAAGGAGAATTTGACAGAGTTGAGGAAACTGTTTGAGGCTCGTAAGCT
CGAGGAGTTGCAGTGGGTTTTAGACGACGATGTGGAACTGAAGGATGAGTGGCTGGACAGTGAAAATGGTCGCTCTGATGCCGTAAAACGGAGGCGCGGCGACGGAGAGG
TTATTCGGTTCCTTGTTGACAGGCTCAGTTCGAGGGAGATTTCCATGAGGGACTGGAAATTCTCCAGGATGATGATACGGTCAGGATTGCAGTTTAATGAAGGTCAACTA
CTTAAAATTTTGGATGGCCTCGGTGCTAGGGGATGCTGGAAACAGTCCTTGTCAGTGGTCGAATGGGTGTACAATCTTAAAAGTCACAGTCATTTTAAAAGCAGGCAGTT
TGTCTATACAAAGCTCTTAGCTGTTCTGGGGATGGCGAGGAAACCTCAGGAAGCCCTTCAGATATTTAATTTGATGCGGGGAGATGGCCATATATATCCCGACATGGCTG
CATATCACAGTATCGCTGTTACGCTGGGTCAAGCTGGTCTTTTGAAACAATTGCTGAAAGTTATTGAATGCATGAGGCAGCAGCCGTCCAAAAAAATTAGAAACAAGTGC
CGAAAATCTTGGGATCCTGCAGTTGAACCCGATCTTGTTGTATATAATGCTATTTTGAACGCATGCATTCCCACGCACGAGTGGAAAGGTGTCTACTGGGTGTTCACACA
GTTGAGAAAGAGTGGTTTGAAACCTAATGGAGCAACATATGGACTTTCTATGGAGGTAATGCTGAAATCTGGAAAGTATGAGCATCTCCACAAGCTTTTTACAAAAATGA
AGAAGAGTGGGGAAACTCTAAAGGCGAACACGTACAGAGTTCTTGTCAAAGCTTTTTGGGAGGAAGGAAATGTTAATGGAGCTATTGAAGCAGTCAGGGATATGGAACAA
AGAGGAGTAGTTGGATCTGCCAGTGTCTATTATGAACTAGCTTGTTGTCTATGTTACAATGGGAGGTGGCAAGATGCATTGGTAGAGGTAAGAAAAATGAAAACACTACC
ACATATGAAACCGTTGGTGGTGACCTTCACTGGCATGATCTTATCTTCCTTTGATGGTGGACATATTGATGATTGCATATCTATCTTCGAGTACATGAGGCGAAATTGTG
CGCCTAATATAGGGACTATAAATACCATGCTTAAAGTTTATGGCCGAAATGATATGTTTTCTAAAGCTAAAGATTTATTTGAAGAAATAAAGAGAAAAGCTGATCGTTCC
TCCCCAAGTTGTTCTGTTCATTCTATAATCCCAGATGAATATACGTATGGCTCGATGCTTGAGGCAGCTGCTAGTGCACTCCAGTGGGAATATTTTGAGAATGTATACAG
GGAAATGGCTCTGTCTGGATACCAGCTAGATCAAAGTAAACATGCAACGGTACTTGTGGAAGCTTCCAGAGCTGGGAAGTGGTATCTATTAGATCATGCATTTGACTCAA
TCTTGGAGGCTGGACAAATTCCCCATCCACTGTTGTTCACAGAAATGATATTGCAGCTTATAGTTCAAGAGAACTATGAGCAGGCTGTCACCTTGGTTAAAGCCATGTGT
TATGCTCCATTCCAAGTAACCAAAAGACAATGGACAGAACTTTTTGAAGTGAGCATGGACAGGATTTGTTGGAATAACTTGAAGAAACTATCAGACGCTCTTAGCGACTG
TGATGCATCAGAAGCCACAGTCTCGAACTTGTCAAGGTCGCTGCGGGTTCTCTGCAAATCCAGGATACCAGAGGACACCTCCCAGTCTGAAAACATGGAGAACATGAAGG
TCGAGAGTGACTCAAATATGGCCCCCTGGTCACCGAGTCTTTCTGATGAAGGTGCTCTAGGGACTAACACGTTTTCGGGTCATTCCAACGATGAGCTCTCGACTTTCGAT
TTGTGTGTCAGCAGTGAAGATGATGAGGAAGTGCTTAACATGTTACTTGATAGATCTGATGATTCTTATGATTCAAACTCGCCTTCTGTTAATGAAATACTGAAAACTTG
GAAGGAAGAGAGGAAAACCGATGGGTTATTTCTCCACCCTTTGAATTAGTACACTTTAGTGCAGTCCTTTTATGTGTCTGTTTATTGTACATATCTACACAATTTGCATA
TGCATAGCTTTAAATCATTGCATTTTATCAAAATAATAAGGCGCAACAAGTGTTTGAATTCTTACCACTTGTCTCTTTCAAGGTTGAATCTATAGTGTCTTGGAATGATA
TTGAGATTGAGA
Protein sequenceShow/hide protein sequence
MDALSTNAPIPAPKFEPDLEKIKRTLLQKGVCPSPKILRALRKKEIQKHNRKLNRLAHQSPPLSESQKQLIAEETHFRTLRSEYKEFSKAIEAKPGGGLMVGRPWERLEG
VNLKELTGFRTGYDGENLKKENLTELRKLFEARKLEELQWVLDDDVELKDEWLDSENGRSDAVKRRRGDGEVIRFLVDRLSSREISMRDWKFSRMMIRSGLQFNEGQLLK
ILDGLGARGCWKQSLSVVEWVYNLKSHSHFKSRQFVYTKLLAVLGMARKPQEALQIFNLMRGDGHIYPDMAAYHSIAVTLGQAGLLKQLLKVIECMRQQPSKKIRNKCRK
SWDPAVEPDLVVYNAILNACIPTHEWKGVYWVFTQLRKSGLKPNGATYGLSMEVMLKSGKYEHLHKLFTKMKKSGETLKANTYRVLVKAFWEEGNVNGAIEAVRDMEQRG
VVGSASVYYELACCLCYNGRWQDALVEVRKMKTLPHMKPLVVTFTGMILSSFDGGHIDDCISIFEYMRRNCAPNIGTINTMLKVYGRNDMFSKAKDLFEEIKRKADRSSP
SCSVHSIIPDEYTYGSMLEAAASALQWEYFENVYREMALSGYQLDQSKHATVLVEASRAGKWYLLDHAFDSILEAGQIPHPLLFTEMILQLIVQENYEQAVTLVKAMCYA
PFQVTKRQWTELFEVSMDRICWNNLKKLSDALSDCDASEATVSNLSRSLRVLCKSRIPEDTSQSENMENMKVESDSNMAPWSPSLSDEGALGTNTFSGHSNDELSTFDLC
VSSEDDEEVLNMLLDRSDDSYDSNSPSVNEILKTWKEERKTDGLFLHPLN