; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr020796 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr020796
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationtig00153573:171908..182255
RNA-Seq ExpressionSgr020796
SyntenySgr020796
Gene Ontology termsGO:0009658 - chloroplast organization (biological process)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily
IPR044645 - Pentatricopeptide repeat-containing protein DG1/EMB2279-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7019446.1 Pentatricopeptide repeat-containing protein, chloroplastic, partial [Cucurbita argyrosperma subsp. argyrosperma]0.0e+0081.48Show/hide
Query:  MVGVIMANANLCIPCYERNGFPAVHYTQNSHNLWVASFFPSPIFGIDLNVGDAKNRVFRNRGNKCGAIKASSKGESDIRSSNGNLLEKDFQFEPSFDEYV
        MVGVIMANANLCIPC E NGFPA++ TQNSH L   SFFPS + G  LN G AK+RV R+RG+KCGAIKASSKGESDI+ ++GNLLEKDFQF+PSFDEYV
Subjt:  MVGVIMANANLCIPCYERNGFPAVHYTQNSHNLWVASFFPSPIFGIDLNVGDAKNRVFRNRGNKCGAIKASSKGESDIRSSNGNLLEKDFQFEPSFDEYV

Query:  RVMESVRNSRYKKQSDDPNKLKMKENASAKRAVGISKSKIDNEKNKAADVHGNVDVKNMFKPVDQKDLFNNAERIIRKKDLSRNNFDNKRKGVTRFKDEV
        RVMESVR+ RYK+QSDDPN  KMKENASAK A   S S I        DV GN+DVKN    VD +DLF+N+E+I RK DLS N FD+KRKGVTR KDE+
Subjt:  RVMESVRNSRYKKQSDDPNKLKMKENASAKRAVGISKSKIDNEKNKAADVHGNVDVKNMFKPVDQKDLFNNAERIIRKKDLSRNNFDNKRKGVTRFKDEV

Query:  KGKVTRFDSQVNEKQHEEKRKGHLFDCIEPKVRRSNNETLVRLKANTLDVKRERQRVCDESSTKTLETIWADDATKPAKDALKVEKSGVQLARNYIPGEK
        KGKVT FDSQVN+KQHEEKR G+  + IEPK  RSN++  +  KANTLDVK E   V   SS K  + IWADD TKP KD LKV K GVQL  NYIPG+K
Subjt:  KGKVTRFDSQVNEKQHEEKRKGHLFDCIEPKVRRSNNETLVRLKANTLDVKRERQRVCDESSTKTLETIWADDATKPAKDALKVEKSGVQLARNYIPGEK

Query:  VDRKKTGQPYQGLSKSGKPFLESTEESGLEVERAAFNNFDALDIMDKPRVSKMEMEERIQMLSKRLNGADIDMPEWMFARMMRSAKIRYSDHSILRVIQV
        V RKKT Q Y+GLSKSGK F E TEES LEVE AAFN+FDA DIMDKPRVSKMEMEERIQMLSKRLNGADIDMPEWMFA+MMRSAKIRYSDHSILRVIQV
Subjt:  VDRKKTGQPYQGLSKSGKPFLESTEESGLEVERAAFNNFDALDIMDKPRVSKMEMEERIQMLSKRLNGADIDMPEWMFARMMRSAKIRYSDHSILRVIQV

Query:  LGKLGNWRRVLQVIEWLQMRERFKSHKLRFIYTTALDVLGKARRPVEALNIFHAMQQHFSSYPDLVAYHSIAVTLGQAGYMRELFDVIDSMRSPPKKKFK
        LGKLGNW+RVLQVIEWLQMRERFKSHKLRFIYTTALDVLGKARRPVEALN+FHAMQQHFSSYPDLVAYHSIAVTLGQAGYMRELFDVIDSMRSPPKKKFK
Subjt:  LGKLGNWRRVLQVIEWLQMRERFKSHKLRFIYTTALDVLGKARRPVEALNIFHAMQQHFSSYPDLVAYHSIAVTLGQAGYMRELFDVIDSMRSPPKKKFK

Query:  TGALEKWDPRLQPDIVIYNAVLNACVKRKNWEGAFWVLQELKKQGLQPSTSTYGLVMEVMLACGKYNLVHEFFRKVQRSSIPNALTYKVLVNTLWKEGKT
        TGA EKWDPRLQPDIVIYNAVLNACVKRKNWEGAFWVLQELK+QGLQPST+TYGLVMEVML CGKYNLVHEFFRKVQ+SSIPNALTYKVLVNTLWKEGKT
Subjt:  TGALEKWDPRLQPDIVIYNAVLNACVKRKNWEGAFWVLQELKKQGLQPSTSTYGLVMEVMLACGKYNLVHEFFRKVQRSSIPNALTYKVLVNTLWKEGKT

Query:  DEAVVAIQNMEQRGIVGSAALYYDFARCLCSAGRCKEALMQIEKICKVANKPLVVTYTGLIQACLDSKNLQSAVYIFNHMKDFCSPNLVTYNILLKGYLE
        DEAV+AIQ ME+RGIVGSAALYYDFARCLCSAGRC+EALMQ+EKICKVANKPLVVTYTGLIQACLDSKNLQSAVYIFNHMK FCSPNLVT NILLKGYL+
Subjt:  DEAVVAIQNMEQRGIVGSAALYYDFARCLCSAGRCKEALMQIEKICKVANKPLVVTYTGLIQACLDSKNLQSAVYIFNHMKDFCSPNLVTYNILLKGYLE

Query:  HRMFEEARELFQNLSEHGRNISTISDYRDRVLPDIYTFNTMLDASFAEKRWDDFGYFYDQMFLYGYHFNPKRHLRMILEAGRAGKDELLETTWKHLAMAN
        H MF+EA+ELFQN+SE+GRNIS +SDYRDRVLPDIYTFNTMLDASFAEKRWDDF +FY+QM LYGYHFNPKRHLRMI+EA R GKDELLETTWKHLA A+
Subjt:  HRMFEEARELFQNLSEHGRNISTISDYRDRVLPDIYTFNTMLDASFAEKRWDDFGYFYDQMFLYGYHFNPKRHLRMILEAGRAGKDELLETTWKHLAMAN

Query:  QTLPPPLVKERFCMKLARGNYSEALSCISDHHSGDVHHFSESVWLNLLKEKRFPKDTVIQLIHKVSMLLTRNDPPNPVFQNLLMSCKEFCRTRITVADNK
        +TLPPPL+KERFC+ LARG+YSEALSCIS HHS D HHFS+S WLNLLKEKRFPKD+VI+LIHKVSMLL RND PNPV QNLL+S KEFCR+RITVAD +
Subjt:  QTLPPPLVKERFCMKLARGNYSEALSCISDHHSGDVHHFSESVWLNLLKEKRFPKDTVIQLIHKVSMLLTRNDPPNPVFQNLLMSCKEFCRTRITVADNK

Query:  LEDIVCTDETQFTAVMHI
        LE++VCT+E+Q   VMH+
Subjt:  LEDIVCTDETQFTAVMHI

XP_022142514.1 pentatricopeptide repeat-containing protein At1g30610, chloroplastic [Momordica charantia]0.0e+0087.36Show/hide
Query:  MVGVIMANANLCIPCYERNGFPAVHYTQNSHNLWVASFFPSPIFGIDLNVGDAKNRVFRNRGNKCGAIKASSKGESDIRSSNGNLLEKDFQFEPSFDEYV
        MVGVIMANAN+CIPC ERNGF A+H TQ+SHNL+  S FPSPI GI LNVG  KNR+FR RGNKCGAI+ SSKGESDIR  NGN+LE DF F+PSFDEYV
Subjt:  MVGVIMANANLCIPCYERNGFPAVHYTQNSHNLWVASFFPSPIFGIDLNVGDAKNRVFRNRGNKCGAIKASSKGESDIRSSNGNLLEKDFQFEPSFDEYV

Query:  RVMESVRNSRYKKQSDDPNKLKMKENASAKRAVGISKSKIDNEKNKAADVHGNVDVKNMFKPVDQKDLFNNAERIIRKKDLSRNNFDNKRKGVTRFKDEV
        RVMESVR SRYKKQ DDPNKLKMKENASAK A   S S+IDNEK K  DV GNVDVKNMFK VDQK LFNNAER+ RKKDL  N FDNKRKG+TR KDE 
Subjt:  RVMESVRNSRYKKQSDDPNKLKMKENASAKRAVGISKSKIDNEKNKAADVHGNVDVKNMFKPVDQKDLFNNAERIIRKKDLSRNNFDNKRKGVTRFKDEV

Query:  KGKVTRFDSQVNEKQHEEKRKGHLFDCIEPKVRRSNNETLVRLKANTLDVKRERQRVCDESSTKTLETIWADDATKPAKDALKVEKSGVQLARNYIPGEK
        +GKVT FDSQVN+KQHEE+RK +  DCIEPKVRR NNE LV  KANTLD+KR+RQRVCDESS KT+E IWAD  TK AK  L+V KSGVQLARNY+PGEK
Subjt:  KGKVTRFDSQVNEKQHEEKRKGHLFDCIEPKVRRSNNETLVRLKANTLDVKRERQRVCDESSTKTLETIWADDATKPAKDALKVEKSGVQLARNYIPGEK

Query:  VDRKKTGQPYQGLSKSGKPFLESTEESGLEVERAAFNNFDALDIMDKPRVSKMEMEERIQMLSKRLNGADIDMPEWMFARMMRSAKIRYSDHSILRVIQV
        V  KKTGQ YQGLSKSGKPF+ESTEES LEVERAA NNFDALDIMDKPRVSKMEMEERIQMLSKRLNGADIDMPEWMFA+MMRSAKIRYSDHSILRVIQV
Subjt:  VDRKKTGQPYQGLSKSGKPFLESTEESGLEVERAAFNNFDALDIMDKPRVSKMEMEERIQMLSKRLNGADIDMPEWMFARMMRSAKIRYSDHSILRVIQV

Query:  LGKLGNWRRVLQVIEWLQMRERFKSHKLRFIYTTALDVLGKARRPVEALNIFHAMQQHFSSYPDLVAYHSIAVTLGQAGYMRELFDVIDSMRSPPKKKFK
        LGKLGNW+RVLQVIEWLQMRERFKSHKLRFIYTTALDVLGKARRPVEALN+FHAMQQHFSSYPDLVAYHSIAVTLGQAGYMRELFDVIDSMRSPPKKKFK
Subjt:  LGKLGNWRRVLQVIEWLQMRERFKSHKLRFIYTTALDVLGKARRPVEALNIFHAMQQHFSSYPDLVAYHSIAVTLGQAGYMRELFDVIDSMRSPPKKKFK

Query:  TGALEKWDPRLQPDIVIYNAVLNACVKRKNWEGAFWVLQELKKQGLQPSTSTYGLVMEVMLACGKYNLVHEFFRKVQRSSIPNALTYKVLVNTLWKEGKT
        TGALEKWDPRLQPDIVIYNAVLNACVKRKNWEGAFWVLQELKKQGLQPSTSTYGLVMEVML CGKYNLVHEFFRKVQRSSIPNALTYKVLVNTL KEGKT
Subjt:  TGALEKWDPRLQPDIVIYNAVLNACVKRKNWEGAFWVLQELKKQGLQPSTSTYGLVMEVMLACGKYNLVHEFFRKVQRSSIPNALTYKVLVNTLWKEGKT

Query:  DEAVVAIQNMEQRGIVGSAALYYDFARCLCSAGRCKEALMQIEKICKVANKPLVVTYTGLIQACLDSKNLQSAVYIFNHMKDFCSPNLVTYNILLKGYLE
        DEAV+AIQNME+RGIVGSAALYYDFARCLCSAGRCKEALMQIEKICKVANKPLVVTYTGLIQACLDSKNL SAVYIFNHMK FCSPNLVTYNILLKGYL+
Subjt:  DEAVVAIQNMEQRGIVGSAALYYDFARCLCSAGRCKEALMQIEKICKVANKPLVVTYTGLIQACLDSKNLQSAVYIFNHMKDFCSPNLVTYNILLKGYLE

Query:  HRMFEEARELFQNLSEHGRNISTISDYRDRVLPDIYTFNTMLDASFAEKRWDDFGYFYDQMFLYGYHFNPKRHLRMILEAGRAGKDELLETTWKHLAMAN
        H MFEEARELFQNLSE G++ISTISDY+DRVLPDIYTFN MLDA FA KRWDDFGYFY+QMFLYGYHFNPKRHLRMILEAGRAGKDE+LETTWKHLA  +
Subjt:  HRMFEEARELFQNLSEHGRNISTISDYRDRVLPDIYTFNTMLDASFAEKRWDDFGYFYDQMFLYGYHFNPKRHLRMILEAGRAGKDELLETTWKHLAMAN

Query:  QTLPPPLVKERFCMKLARGNYSEALSCISDHHSGDVHHFSESVWLNLLKEKRFPKDTVIQLIHKVSMLLTRNDPPNPVFQNLLMSCKEFCRTRITVADNK
        +TLPPPLVKERFCMKLARG+YSEALSCIS+HHS D HHFSES WLNLLKEK FPKDTVI LIHKVSMLLT N PPNPVFQNLL SCKEFCRTRITVAD+K
Subjt:  QTLPPPLVKERFCMKLARGNYSEALSCISDHHSGDVHHFSESVWLNLLKEKRFPKDTVIQLIHKVSMLLTRNDPPNPVFQNLLMSCKEFCRTRITVADNK

Query:  LEDIVCTDETQFTAVMHI
        LE IVC DETQ  AVMHI
Subjt:  LEDIVCTDETQFTAVMHI

XP_023000737.1 pentatricopeptide repeat-containing protein At1g30610, chloroplastic [Cucurbita maxima]0.0e+0081.26Show/hide
Query:  MVGVIMANANLCIPCYERNGFPAVHYTQNSHNLWVASFFPSPIFGIDLNVGDAKNRVFRNRGNKCGAIKASSKGESDIRSSNGNLLEKDFQFEPSFDEYV
        MVGVIMANANLCIPC E NGF A++ TQNSH L   SFFPS + G  LN G AK+RV R+RG+KCGAIKASSKGESDI+ ++GNLLEKDFQF+PSFDEYV
Subjt:  MVGVIMANANLCIPCYERNGFPAVHYTQNSHNLWVASFFPSPIFGIDLNVGDAKNRVFRNRGNKCGAIKASSKGESDIRSSNGNLLEKDFQFEPSFDEYV

Query:  RVMESVRNSRYKKQSDDPNKLKMKENASAKRAVGISKSKIDNEKNKAADVHGNVDVKNMFKPVDQKDLFNNAERIIRKKDLSRNNFDNKRKGVTRFKDEV
        RVMESVR+ RYK+QSDDPN  KMKENASAK A   S S I        DV GN+DVKN    VD +DLF+N+ERI RK DLS N FD+KRKGVTR KDE+
Subjt:  RVMESVRNSRYKKQSDDPNKLKMKENASAKRAVGISKSKIDNEKNKAADVHGNVDVKNMFKPVDQKDLFNNAERIIRKKDLSRNNFDNKRKGVTRFKDEV

Query:  KGKVTRFDSQVNEKQHEEKRKGHLFDCIEPKVRRSNNETLVRLKANTLDVKRERQRVCDESSTKTLETIWADDATKPAKDALKVEKSGVQLARNYIPGEK
        KGKVT FDSQ+N+KQHEEKR G+  + IEPKV RSN++  +  KANTLDVK E   V   SS K  E IWADD  KP KD LKV K GVQL  NYIPG+K
Subjt:  KGKVTRFDSQVNEKQHEEKRKGHLFDCIEPKVRRSNNETLVRLKANTLDVKRERQRVCDESSTKTLETIWADDATKPAKDALKVEKSGVQLARNYIPGEK

Query:  VDRKKTGQPYQGLSKSGKPFLESTEESGLEVERAAFNNFDALDIMDKPRVSKMEMEERIQMLSKRLNGADIDMPEWMFARMMRSAKIRYSDHSILRVIQV
        V RKKT Q Y+GLSKSGK F E TEES LEVE AAFN+ DA DIMDKPRVSKMEMEERIQMLSKRLNGADIDMPEWMFA+MMRSAKIRYSDHSILRVIQV
Subjt:  VDRKKTGQPYQGLSKSGKPFLESTEESGLEVERAAFNNFDALDIMDKPRVSKMEMEERIQMLSKRLNGADIDMPEWMFARMMRSAKIRYSDHSILRVIQV

Query:  LGKLGNWRRVLQVIEWLQMRERFKSHKLRFIYTTALDVLGKARRPVEALNIFHAMQQHFSSYPDLVAYHSIAVTLGQAGYMRELFDVIDSMRSPPKKKFK
        LGKLGNW+RVLQVIEWLQMRERFKSHKLRFIYTTALDVLGKARRPVEALN+FHAMQQHFSSYPDLVAYHSIAVTLGQAGYMRELFDVIDSMRSPPKKKFK
Subjt:  LGKLGNWRRVLQVIEWLQMRERFKSHKLRFIYTTALDVLGKARRPVEALNIFHAMQQHFSSYPDLVAYHSIAVTLGQAGYMRELFDVIDSMRSPPKKKFK

Query:  TGALEKWDPRLQPDIVIYNAVLNACVKRKNWEGAFWVLQELKKQGLQPSTSTYGLVMEVMLACGKYNLVHEFFRKVQRSSIPNALTYKVLVNTLWKEGKT
        TGA EKWDPRLQPDIVIYNAVLNACVKRKNWEGAFWVLQELK+QGLQPST+TYGLVMEVML CGKYNLVHEFFRKVQ+SSIPNALTYKVLVNTLWKEGKT
Subjt:  TGALEKWDPRLQPDIVIYNAVLNACVKRKNWEGAFWVLQELKKQGLQPSTSTYGLVMEVMLACGKYNLVHEFFRKVQRSSIPNALTYKVLVNTLWKEGKT

Query:  DEAVVAIQNMEQRGIVGSAALYYDFARCLCSAGRCKEALMQIEKICKVANKPLVVTYTGLIQACLDSKNLQSAVYIFNHMKDFCSPNLVTYNILLKGYLE
        DEAV+AIQ ME+RGIVGSAALYYDFARCLCSAGR +EALMQ+EKICKVANKPLVVTYTGLIQACLDSKNLQSAVYIFNHMK FCSPNLVT NILLKGYL+
Subjt:  DEAVVAIQNMEQRGIVGSAALYYDFARCLCSAGRCKEALMQIEKICKVANKPLVVTYTGLIQACLDSKNLQSAVYIFNHMKDFCSPNLVTYNILLKGYLE

Query:  HRMFEEARELFQNLSEHGRNISTISDYRDRVLPDIYTFNTMLDASFAEKRWDDFGYFYDQMFLYGYHFNPKRHLRMILEAGRAGKDELLETTWKHLAMAN
        H MF EA+ELFQN+SE+GRNIS +SDYRDRVLPDIYTFNTMLDASFAEKRWDDF +FY+QM LYGYHFNPKRHLRMI+EA R GKDELLETTWKHLA A+
Subjt:  HRMFEEARELFQNLSEHGRNISTISDYRDRVLPDIYTFNTMLDASFAEKRWDDFGYFYDQMFLYGYHFNPKRHLRMILEAGRAGKDELLETTWKHLAMAN

Query:  QTLPPPLVKERFCMKLARGNYSEALSCISDHHSGDVHHFSESVWLNLLKEKRFPKDTVIQLIHKVSMLLTRNDPPNPVFQNLLMSCKEFCRTRITVADNK
        + LPPPL+KERFC+ LARG+YSEALSCIS HHS D HHFS+S WLNLLKEKRFPKD+VIQLIHKVSMLL RND PNPV QNLL+S KEFCR+RI+VAD +
Subjt:  QTLPPPLVKERFCMKLARGNYSEALSCISDHHSGDVHHFSESVWLNLLKEKRFPKDTVIQLIHKVSMLLTRNDPPNPVFQNLLMSCKEFCRTRITVADNK

Query:  LEDIVCTDETQFTAVMHI
        LE++VCT+E+Q  AVMH+
Subjt:  LEDIVCTDETQFTAVMHI

XP_023519692.1 pentatricopeptide repeat-containing protein At1g30610, chloroplastic [Cucurbita pepo subsp. pepo]0.0e+0081.59Show/hide
Query:  MVGVIMANANLCIPCYERNGFPAVHYTQNSHNLWVASFFPSPIFGIDLNVGDAKNRVFRNRGNKCGAIKASSKGESDIRSSNGNLLEKDFQFEPSFDEYV
        MVGVIMANANLCIPC E NGFPA+H TQNSH L   SFF S + G  LN G AK+RV R+RG+KCGAIKASSKGESDIR ++GNLLE DFQF+PSFDEYV
Subjt:  MVGVIMANANLCIPCYERNGFPAVHYTQNSHNLWVASFFPSPIFGIDLNVGDAKNRVFRNRGNKCGAIKASSKGESDIRSSNGNLLEKDFQFEPSFDEYV

Query:  RVMESVRNSRYKKQSDDPNKLKMKENASAKRAVGISKSKIDNEKNKAADVHGNVDVKNMFKPVDQKDLFNNAERIIRKKDLSRNNFDNKRKGVTRFKDEV
        RVMESVR+ RYK+QSDDPN  KMKENASAK A   S S I        DV GN+DVK     VDQ+DLF+N+ERI RK DLS N FD+KRKGVTR KDE+
Subjt:  RVMESVRNSRYKKQSDDPNKLKMKENASAKRAVGISKSKIDNEKNKAADVHGNVDVKNMFKPVDQKDLFNNAERIIRKKDLSRNNFDNKRKGVTRFKDEV

Query:  KGKVTRFDSQVNEKQHEEKRKGHLFDCIEPKVRRSNNETLVRLKANTLDVKRERQRVCDESSTKTLETIWADDATKPAKDALKVEKSGVQLARNYIPGEK
        KGKVT FDSQVN+KQH EKR G+  + IEPKV RSN++  +  KANTLDVK E   V   SS K  E IWADD TK  KD LKV K GVQL  NYIPG+K
Subjt:  KGKVTRFDSQVNEKQHEEKRKGHLFDCIEPKVRRSNNETLVRLKANTLDVKRERQRVCDESSTKTLETIWADDATKPAKDALKVEKSGVQLARNYIPGEK

Query:  VDRKKTGQPYQGLSKSGKPFLESTEESGLEVERAAFNNFDALDIMDKPRVSKMEMEERIQMLSKRLNGADIDMPEWMFARMMRSAKIRYSDHSILRVIQV
        V RKKT Q Y+GLSKSGK F E TEES LEVE AAFN+ DA DIMDKPRVSKMEMEERIQMLSKRLNGADIDMPEWMFA+MMRSAKIRYSDHSILRVIQV
Subjt:  VDRKKTGQPYQGLSKSGKPFLESTEESGLEVERAAFNNFDALDIMDKPRVSKMEMEERIQMLSKRLNGADIDMPEWMFARMMRSAKIRYSDHSILRVIQV

Query:  LGKLGNWRRVLQVIEWLQMRERFKSHKLRFIYTTALDVLGKARRPVEALNIFHAMQQHFSSYPDLVAYHSIAVTLGQAGYMRELFDVIDSMRSPPKKKFK
        LGKLGNW+RVLQVIEWLQMRERFKSHKLRFIYTTALDVLGKARRPVEALN+FHAMQ+HFSSYPDLVAYHSIAVTLGQAGYMRELFDVIDSMRSPPKKKFK
Subjt:  LGKLGNWRRVLQVIEWLQMRERFKSHKLRFIYTTALDVLGKARRPVEALNIFHAMQQHFSSYPDLVAYHSIAVTLGQAGYMRELFDVIDSMRSPPKKKFK

Query:  TGALEKWDPRLQPDIVIYNAVLNACVKRKNWEGAFWVLQELKKQGLQPSTSTYGLVMEVMLACGKYNLVHEFFRKVQRSSIPNALTYKVLVNTLWKEGKT
        TGALEKWDPRLQPDIVIYNAVLNACVKRKNWEGAFWVLQELK+QGLQPST+TYGLVMEVML CGKYNLVHEFFRKVQ+SSIPNALTYKVLVNTLWKEGKT
Subjt:  TGALEKWDPRLQPDIVIYNAVLNACVKRKNWEGAFWVLQELKKQGLQPSTSTYGLVMEVMLACGKYNLVHEFFRKVQRSSIPNALTYKVLVNTLWKEGKT

Query:  DEAVVAIQNMEQRGIVGSAALYYDFARCLCSAGRCKEALMQIEKICKVANKPLVVTYTGLIQACLDSKNLQSAVYIFNHMKDFCSPNLVTYNILLKGYLE
        DEAV+AIQ ME+RGIVGSAALYYDFARCLCSAGRCKEALMQ+EKICKVANKPLVVTYTGLIQACLDSKNLQSAVYIFNHMK FCSPNLVT NILLKGYL+
Subjt:  DEAVVAIQNMEQRGIVGSAALYYDFARCLCSAGRCKEALMQIEKICKVANKPLVVTYTGLIQACLDSKNLQSAVYIFNHMKDFCSPNLVTYNILLKGYLE

Query:  HRMFEEARELFQNLSEHGRNISTISDYRDRVLPDIYTFNTMLDASFAEKRWDDFGYFYDQMFLYGYHFNPKRHLRMILEAGRAGKDELLETTWKHLAMAN
        H MF+EA+ELFQN+SE+GRNIS +SDYRDRVLPDIYTFNTMLDASFAEKRWDDF +FY+QM LYGYHFNPKRHLRMI+EA R GKDELLETTWKHLA A+
Subjt:  HRMFEEARELFQNLSEHGRNISTISDYRDRVLPDIYTFNTMLDASFAEKRWDDFGYFYDQMFLYGYHFNPKRHLRMILEAGRAGKDELLETTWKHLAMAN

Query:  QTLPPPLVKERFCMKLARGNYSEALSCISDHHSGDVHHFSESVWLNLLKEKRFPKDTVIQLIHKVSMLLTRNDPPNPVFQNLLMSCKEFCRTRITVADNK
        +TLPPPL+KERFC+ LARG+YSEALSCIS HHS D HHFS+S WLNLLKEKRFPKD+VI+LIHKVSMLL RND PNPV QNLL+S KEFCR+RI+VAD +
Subjt:  QTLPPPLVKERFCMKLARGNYSEALSCISDHHSGDVHHFSESVWLNLLKEKRFPKDTVIQLIHKVSMLLTRNDPPNPVFQNLLMSCKEFCRTRITVADNK

Query:  LEDIVCTDETQFTAVMHI
        LE++VCT+E Q  AVMH+
Subjt:  LEDIVCTDETQFTAVMHI

XP_038894404.1 pentatricopeptide repeat-containing protein At1g30610, chloroplastic isoform X1 [Benincasa hispida]0.0e+0082.68Show/hide
Query:  MVGVIMANANLCIPCYERNGFPAVHYTQNSHNLWVASFFPSPIFGIDLNVGDAKNRVFRNRGNKCGAIKASSKGESDIRSSNGNLLEKDFQFEPSFDEYV
        MVGVIMAN NLCIP  ERNGFPA+H TQNSHN +  SFFPS + G DLN GDAK+RV R+R +KCG+IKASS GESDIR  + NLLE DFQF+PSFDEYV
Subjt:  MVGVIMANANLCIPCYERNGFPAVHYTQNSHNLWVASFFPSPIFGIDLNVGDAKNRVFRNRGNKCGAIKASSKGESDIRSSNGNLLEKDFQFEPSFDEYV

Query:  RVMESVRNSRYKKQSDDPNKLKMKENASAKRAVGISKSKIDNEKNKAADVHGNVDVKNMFKPVDQKDLFNNAERIIRKKDLSRNNFDNKRKGVTRFKDEV
        RVME+VR  RYK+QSDDPNKL MKENAS K A   S SKIDN KNK  DV GNVDVKNMFK VD+KDLFNN ERI R++DLS N  D+KRKG++R  DEV
Subjt:  RVMESVRNSRYKKQSDDPNKLKMKENASAKRAVGISKSKIDNEKNKAADVHGNVDVKNMFKPVDQKDLFNNAERIIRKKDLSRNNFDNKRKGVTRFKDEV

Query:  KGKVTRFDSQVNEKQHEEKRKGHLFDCIEPKVRRSNNETLVRLKANTLDVKRERQRVCDESSTKTLETIWADDATKPAKDALKVEKSGVQLARNYIPGEK
        KGKVT FDSQVN+KQHEEKR  +  +  EPKV R  NE  +  KANTLD+KRE  R  + SS +    IWA+D TKPAKD L   K  VQL RNYI G+K
Subjt:  KGKVTRFDSQVNEKQHEEKRKGHLFDCIEPKVRRSNNETLVRLKANTLDVKRERQRVCDESSTKTLETIWADDATKPAKDALKVEKSGVQLARNYIPGEK

Query:  VDRKKTGQPYQGLSKSGKPFLESTEESGLEVERAAFNNFDALDIMDKPRVSKMEMEERIQMLSKRLNGADIDMPEWMFARMMRSAKIRYSDHSILRVIQV
        V RKKT Q Y+  SKSGK FLE TE+S LEVE AAFNNFDALDIMDKPRVSKMEMEERIQML KRLNGADIDMPEWMF++MMRSAKIRYSDHSILRVIQV
Subjt:  VDRKKTGQPYQGLSKSGKPFLESTEESGLEVERAAFNNFDALDIMDKPRVSKMEMEERIQMLSKRLNGADIDMPEWMFARMMRSAKIRYSDHSILRVIQV

Query:  LGKLGNWRRVLQVIEWLQMRERFKSHKLRFIYTTALDVLGKARRPVEALNIFHAMQQHFSSYPDLVAYHSIAVTLGQAGYMRELFDVIDSMRSPPKKKFK
        LGKLGNWRRVLQVIEWLQMRERFKSHKLRFIYTTALDVLGKARRPVEALN+FHAMQQHF+SYPDLVAYHSIAVTLGQAGYM+ELFDVIDSMRSPPKKKFK
Subjt:  LGKLGNWRRVLQVIEWLQMRERFKSHKLRFIYTTALDVLGKARRPVEALNIFHAMQQHFSSYPDLVAYHSIAVTLGQAGYMRELFDVIDSMRSPPKKKFK

Query:  TGALEKWDPRLQPDIVIYNAVLNACVKRKNWEGAFWVLQELKKQGLQPSTSTYGLVMEVMLACGKYNLVHEFFRKVQRSSIPNALTYKVLVNTLWKEGKT
        TG LEKWDPRL+PDIVIYNAVLNACVKRKN EGAFWVLQELKKQGLQPSTSTYGLVMEVML CGKYNLVHEFFRKVQ+SSIPNALTYKVLVNTLWKEGKT
Subjt:  TGALEKWDPRLQPDIVIYNAVLNACVKRKNWEGAFWVLQELKKQGLQPSTSTYGLVMEVMLACGKYNLVHEFFRKVQRSSIPNALTYKVLVNTLWKEGKT

Query:  DEAVVAIQNMEQRGIVGSAALYYDFARCLCSAGRCKEALMQIEKICKVANKPLVVTYTGLIQACLDSKNLQSAVYIFNHMKDFCSPNLVTYNILLKGYLE
        DEAV+AI+NME+RGIVGSAALYYDFARCLCSAGRCKEALMQ+EKICKVA KPLVVTYTGLIQACLDSK+++SAVYIFNHMK FCSPNLVTYN+LLKGYLE
Subjt:  DEAVVAIQNMEQRGIVGSAALYYDFARCLCSAGRCKEALMQIEKICKVANKPLVVTYTGLIQACLDSKNLQSAVYIFNHMKDFCSPNLVTYNILLKGYLE

Query:  HRMFEEARELFQNLSEHGRNISTISDYRDRVLPDIYTFNTMLDASFAEKRWDDFGYFYDQMFLYGYHFNPKRHLRMILEAGRAGKDELLETTWKHLAMAN
        H MFEEARELFQNLSEHGRNIST+SDYRDRVLPDIY FNTMLDASFAEKRWDDFGYFYDQM LYGYHFNPKRHLRMILEA RAGKDELLETTWKHLA A+
Subjt:  HRMFEEARELFQNLSEHGRNISTISDYRDRVLPDIYTFNTMLDASFAEKRWDDFGYFYDQMFLYGYHFNPKRHLRMILEAGRAGKDELLETTWKHLAMAN

Query:  QTLPPPLVKERFCMKLARGNYSEALSCISDHHSGDVHHFSESVWLNLLKEKRFPKDTVIQLIHKVSMLLTRNDPPNPVFQNLLMSCKEFCRTRITVADNK
        +T PPPL+KERFCMKLARG+YSEALSCIS+H S DVHHFSES WLNLLKEKRFPKDTVIQLI+KVSMLLTRND PNPVF+NLL+SCKEFCRTRI+VAD++
Subjt:  QTLPPPLVKERFCMKLARGNYSEALSCISDHHSGDVHHFSESVWLNLLKEKRFPKDTVIQLIHKVSMLLTRNDPPNPVFQNLLMSCKEFCRTRITVADNK

Query:  LEDIVCTDETQFTAVMHI
        LE+ VCT+ETQ  AV+ I
Subjt:  LEDIVCTDETQFTAVMHI

TrEMBL top hitse value%identityAlignment
A0A0A0LVN7 Uncharacterized protein0.0e+0080.71Show/hide
Query:  MVGVIMANANLCIPCYERNGFPAVHYTQNSHNLWVASFFPSPIFGIDLNVGDAKNRVFRNRGNKCGAIKASSKGESDIRSSNGNLLEKDFQFEPSFDEYV
        MVGVIMAN NLCIP  ER GFP +H T NSHN +  SFFPS + G D ++ DAKNRV R+R +KCG+IKA S GESDI   +GNLLE DFQF+PSFDEYV
Subjt:  MVGVIMANANLCIPCYERNGFPAVHYTQNSHNLWVASFFPSPIFGIDLNVGDAKNRVFRNRGNKCGAIKASSKGESDIRSSNGNLLEKDFQFEPSFDEYV

Query:  RVMESVRNSRYKKQSDDPNKLKMKENASAKRAVGISKSKIDNEKNKAADVHGNVDVKNMFKPVDQKDLFNNAERIIRKKDLSRNNFDNKRKGVTRFKDEV
        +VME+VR  RYK+Q DDPNKL MKEN SAK A   S SKIDN KNK  DV  NVDVKNMFK VD+KDLFNN ERI  +KDLS N FD +RK VTR  D+V
Subjt:  RVMESVRNSRYKKQSDDPNKLKMKENASAKRAVGISKSKIDNEKNKAADVHGNVDVKNMFKPVDQKDLFNNAERIIRKKDLSRNNFDNKRKGVTRFKDEV

Query:  KGKVTRFDSQVNEKQHEEKRKGHLFDCIEPKVRRSNNETLVRLKANTLDVKRERQRVCDESSTKTLETIWA--DDATKPAKDALKVEKSGVQLARNYIPG
        KGK+T F S VN+KQHEEKR  +    IEP+V RSN++  +  KANTL+VK+E  RV D +S KT E IWA  DD  KPAK  LK  K G+QL R+Y PG
Subjt:  KGKVTRFDSQVNEKQHEEKRKGHLFDCIEPKVRRSNNETLVRLKANTLDVKRERQRVCDESSTKTLETIWA--DDATKPAKDALKVEKSGVQLARNYIPG

Query:  EKVDRKKTGQPYQGLSKSGKPFLESTEESGLEVERAAFNNFDALDIMDKPRVSKMEMEERIQMLSKRLNGADIDMPEWMFARMMRSAKIRYSDHSILRVI
        +KV RKKT Q Y+G S SGK FLE  E++ LEVE AAFNNFDA DIMDKPRVSKMEMEERIQMLSKRLNGADIDMPEWMF++MMRSAKIRYSDHSILRVI
Subjt:  EKVDRKKTGQPYQGLSKSGKPFLESTEESGLEVERAAFNNFDALDIMDKPRVSKMEMEERIQMLSKRLNGADIDMPEWMFARMMRSAKIRYSDHSILRVI

Query:  QVLGKLGNWRRVLQVIEWLQMRERFKSHKLRFIYTTALDVLGKARRPVEALNIFHAMQQHFSSYPDLVAYHSIAVTLGQAGYMRELFDVIDSMRSPPKKK
        QVLGKLGNWRRVLQ+IEWLQMRERFKSHKLRFIYTTALDVLGKARRPVEALN+FHAMQ+HFSSYPDLVAYHSIAVTLGQAGYMRELFDVIDSMRSPPKKK
Subjt:  QVLGKLGNWRRVLQVIEWLQMRERFKSHKLRFIYTTALDVLGKARRPVEALNIFHAMQQHFSSYPDLVAYHSIAVTLGQAGYMRELFDVIDSMRSPPKKK

Query:  FKTGALEKWDPRLQPDIVIYNAVLNACVKRKNWEGAFWVLQELKKQGLQPSTSTYGLVMEVMLACGKYNLVHEFFRKVQRSSIPNALTYKVLVNTLWKEG
        FKTG LEKWDPRLQPDIVIYNAVLNACVKRKN EGAFWVLQELKKQ LQPSTSTYGLVMEVML CGKYNLVHEFFRKVQ+SSIPNALTYKVLVNTLWKEG
Subjt:  FKTGALEKWDPRLQPDIVIYNAVLNACVKRKNWEGAFWVLQELKKQGLQPSTSTYGLVMEVMLACGKYNLVHEFFRKVQRSSIPNALTYKVLVNTLWKEG

Query:  KTDEAVVAIQNMEQRGIVGSAALYYDFARCLCSAGRCKEALMQIEKICKVANKPLVVTYTGLIQACLDSKNLQSAVYIFNHMKDFCSPNLVTYNILLKGY
        KTDEAV+AI+NME RGIVGSAALYYDFARCLCSAGRCKEALMQ+EKICKVANKPLVVTYTGLIQACLDSK+LQSAVYIFNHMK FCSPNLVTYNILLKGY
Subjt:  KTDEAVVAIQNMEQRGIVGSAALYYDFARCLCSAGRCKEALMQIEKICKVANKPLVVTYTGLIQACLDSKNLQSAVYIFNHMKDFCSPNLVTYNILLKGY

Query:  LEHRMFEEARELFQNLSEHGRNISTISDYRDRVLPDIYTFNTMLDASFAEKRWDDFGYFYDQMFLYGYHFNPKRHLRMILEAGRAGKDELLETTWKHLAM
        LEH MFEEARELFQNLSE  RNIST+SDYRDRVLPDIY FNTMLDASFAEKRWDDF YFY+QMFLYGYHFNPKRHLRMILEA R GKDELLETTWKHLA 
Subjt:  LEHRMFEEARELFQNLSEHGRNISTISDYRDRVLPDIYTFNTMLDASFAEKRWDDFGYFYDQMFLYGYHFNPKRHLRMILEAGRAGKDELLETTWKHLAM

Query:  ANQTLPPPLVKERFCMKLARGNYSEALSCISDHHSGDVHHFSESVWLNLLKEKRFPKDTVIQLIHKVSMLLTRNDPPNPVFQNLLMSCKEFCRTRITVAD
        A++T PPPL+KERFCMKLARG+YSEALS I  H+SGD HHFSES WLNLLKEKRFP+DTVI+LIHKV M+LTRN+ PNPVF+NLL+SCKEFCRTRI++AD
Subjt:  ANQTLPPPLVKERFCMKLARGNYSEALSCISDHHSGDVHHFSESVWLNLLKEKRFPKDTVIQLIHKVSMLLTRNDPPNPVFQNLLMSCKEFCRTRITVAD

Query:  NKLEDIV
        ++LE+ V
Subjt:  NKLEDIV

A0A1S3C8Z0 pentatricopeptide repeat-containing protein At1g30610, chloroplastic0.0e+0079.85Show/hide
Query:  MVGVIMANANLCIPCYERNGFPAVHYTQNSHNLWVASFFPSPIF--GIDLNVGDAKNRVFRNRGNKCGAIKASSKGESDIRSSNGNLLEKDFQFEPSFDE
        MVGVIMAN NL IP  ER GFP +H T NSH  +  SFFPS +   G DLN  DAKNRV R+R +KCG+IKA S GESDI   NGNLLE DFQF+PSFDE
Subjt:  MVGVIMANANLCIPCYERNGFPAVHYTQNSHNLWVASFFPSPIF--GIDLNVGDAKNRVFRNRGNKCGAIKASSKGESDIRSSNGNLLEKDFQFEPSFDE

Query:  YVRVMESVRNSRYKKQSDDPNKLKMKENASAKRAVGISKSKIDNEKNKAADVHGNVDVKNMFKPVDQKDLFNNAERIIRKKDLSRNNFDNKRKGVTRFKD
        YV+VME+VR  RYK+Q D PNKL MKEN SAK A   S SKIDN KNK  DV  NV+VKNMFK VD+KDLFNN ERI R+K LS N FD + KGVTR  D
Subjt:  YVRVMESVRNSRYKKQSDDPNKLKMKENASAKRAVGISKSKIDNEKNKAADVHGNVDVKNMFKPVDQKDLFNNAERIIRKKDLSRNNFDNKRKGVTRFKD

Query:  EVKGKVTRFDSQVNEKQHEEKRKGHLFDCIEPKVRRSNNETLVRLKANTLDVKRERQRVCDESSTKTLETIWA--DDATKPAKDALKVEKSGVQLARNYI
        +VKGK+T F S VN+KQHEEK+ G+    IEPKV RSN E  +  KAN L+ K+E  RV   +S KT E IWA  +D  KPAKD LK  K G+QL R+Y 
Subjt:  EVKGKVTRFDSQVNEKQHEEKRKGHLFDCIEPKVRRSNNETLVRLKANTLDVKRERQRVCDESSTKTLETIWA--DDATKPAKDALKVEKSGVQLARNYI

Query:  PGEKVDRKKTGQPYQGLSKSGKPFLESTEESGLEVERAAFNNFDALDIMDKPRVSKMEMEERIQMLSKRLNGADIDMPEWMFARMMRSAKIRYSDHSILR
        PG+KV RKKT Q Y+G S SGK FLE TEE+ LEVE AAFNNFDALDIMDKPRVSKMEMEERIQMLSKRLNGADIDMPEWMF++MMR AKIRYSDHSILR
Subjt:  PGEKVDRKKTGQPYQGLSKSGKPFLESTEESGLEVERAAFNNFDALDIMDKPRVSKMEMEERIQMLSKRLNGADIDMPEWMFARMMRSAKIRYSDHSILR

Query:  VIQVLGKLGNWRRVLQVIEWLQMRERFKSHKLRFIYTTALDVLGKARRPVEALNIFHAMQQHFSSYPDLVAYHSIAVTLGQAGYMRELFDVIDSMRSPPK
        VIQVLGKLGNWRRVLQVIEWLQMRERFKSHK RFIYTTALDVLGKARRPVEALN+FHAMQ+HFSSYPDLVAYHSIAVTLGQAGYMRELFDVIDSMRSPPK
Subjt:  VIQVLGKLGNWRRVLQVIEWLQMRERFKSHKLRFIYTTALDVLGKARRPVEALNIFHAMQQHFSSYPDLVAYHSIAVTLGQAGYMRELFDVIDSMRSPPK

Query:  KKFKTGALEKWDPRLQPDIVIYNAVLNACVKRKNWEGAFWVLQELKKQGLQPSTSTYGLVMEVMLACGKYNLVHEFFRKVQRSSIPNALTYKVLVNTLWK
        KKFKTGALEKWDPRLQPDIVIYNAVLNACVKRKN EGAFWVLQELKKQGLQPSTSTYGLVMEVML CGKYNLVHEFFRKVQ+SSIPNALTYKVLVNTLWK
Subjt:  KKFKTGALEKWDPRLQPDIVIYNAVLNACVKRKNWEGAFWVLQELKKQGLQPSTSTYGLVMEVMLACGKYNLVHEFFRKVQRSSIPNALTYKVLVNTLWK

Query:  EGKTDEAVVAIQNMEQRGIVGSAALYYDFARCLCSAGRCKEALMQIEKICKVANKPLVVTYTGLIQACLDSKNLQSAVYIFNHMKDFCSPNLVTYNILLK
        EGKTDEAV+AI+NME RG+VGSAALYYDFARCLCSAGRCKEALMQ+EKICKVANKPLVVTYTGLIQACLDSK+LQSAVY+FN MK FCSPNLVTYNILLK
Subjt:  EGKTDEAVVAIQNMEQRGIVGSAALYYDFARCLCSAGRCKEALMQIEKICKVANKPLVVTYTGLIQACLDSKNLQSAVYIFNHMKDFCSPNLVTYNILLK

Query:  GYLEHRMFEEARELFQNLSEHGRNISTISDYRDRVLPDIYTFNTMLDASFAEKRWDDFGYFYDQMFLYGYHFNPKRHLRMILEAGRAGKDELLETTWKHL
        GYLEH MFEEAREL QNLSE  +NIST+SDYRDRVLPDIY FNTMLDASFAEKRWDDF YFY+QMFLYGYHFNPKRHLRMILEA R GKDELLETTWKHL
Subjt:  GYLEHRMFEEARELFQNLSEHGRNISTISDYRDRVLPDIYTFNTMLDASFAEKRWDDFGYFYDQMFLYGYHFNPKRHLRMILEAGRAGKDELLETTWKHL

Query:  AMANQTLPPPLVKERFCMKLARGNYSEALSCISDHHSGDVHHFSESVWLNLLKEKRFPKDTVIQLIHKVSMLLTRNDPPNPVFQNLLMSCKEFCRTRITV
        A A++T PPPL+KERFCMK+ARG+Y+EAL CIS+H+SGD HHFSES WLNLLKEKRFPKDTVI+LIHKV M+   N+ PNPVF+NLL+SCKEFCRTRI+V
Subjt:  AMANQTLPPPLVKERFCMKLARGNYSEALSCISDHHSGDVHHFSESVWLNLLKEKRFPKDTVIQLIHKVSMLLTRNDPPNPVFQNLLMSCKEFCRTRITV

Query:  ADNKLEDIVCTDE
        AD++LE+ V T+E
Subjt:  ADNKLEDIVCTDE

A0A6J1CLQ9 pentatricopeptide repeat-containing protein At1g30610, chloroplastic0.0e+0087.36Show/hide
Query:  MVGVIMANANLCIPCYERNGFPAVHYTQNSHNLWVASFFPSPIFGIDLNVGDAKNRVFRNRGNKCGAIKASSKGESDIRSSNGNLLEKDFQFEPSFDEYV
        MVGVIMANAN+CIPC ERNGF A+H TQ+SHNL+  S FPSPI GI LNVG  KNR+FR RGNKCGAI+ SSKGESDIR  NGN+LE DF F+PSFDEYV
Subjt:  MVGVIMANANLCIPCYERNGFPAVHYTQNSHNLWVASFFPSPIFGIDLNVGDAKNRVFRNRGNKCGAIKASSKGESDIRSSNGNLLEKDFQFEPSFDEYV

Query:  RVMESVRNSRYKKQSDDPNKLKMKENASAKRAVGISKSKIDNEKNKAADVHGNVDVKNMFKPVDQKDLFNNAERIIRKKDLSRNNFDNKRKGVTRFKDEV
        RVMESVR SRYKKQ DDPNKLKMKENASAK A   S S+IDNEK K  DV GNVDVKNMFK VDQK LFNNAER+ RKKDL  N FDNKRKG+TR KDE 
Subjt:  RVMESVRNSRYKKQSDDPNKLKMKENASAKRAVGISKSKIDNEKNKAADVHGNVDVKNMFKPVDQKDLFNNAERIIRKKDLSRNNFDNKRKGVTRFKDEV

Query:  KGKVTRFDSQVNEKQHEEKRKGHLFDCIEPKVRRSNNETLVRLKANTLDVKRERQRVCDESSTKTLETIWADDATKPAKDALKVEKSGVQLARNYIPGEK
        +GKVT FDSQVN+KQHEE+RK +  DCIEPKVRR NNE LV  KANTLD+KR+RQRVCDESS KT+E IWAD  TK AK  L+V KSGVQLARNY+PGEK
Subjt:  KGKVTRFDSQVNEKQHEEKRKGHLFDCIEPKVRRSNNETLVRLKANTLDVKRERQRVCDESSTKTLETIWADDATKPAKDALKVEKSGVQLARNYIPGEK

Query:  VDRKKTGQPYQGLSKSGKPFLESTEESGLEVERAAFNNFDALDIMDKPRVSKMEMEERIQMLSKRLNGADIDMPEWMFARMMRSAKIRYSDHSILRVIQV
        V  KKTGQ YQGLSKSGKPF+ESTEES LEVERAA NNFDALDIMDKPRVSKMEMEERIQMLSKRLNGADIDMPEWMFA+MMRSAKIRYSDHSILRVIQV
Subjt:  VDRKKTGQPYQGLSKSGKPFLESTEESGLEVERAAFNNFDALDIMDKPRVSKMEMEERIQMLSKRLNGADIDMPEWMFARMMRSAKIRYSDHSILRVIQV

Query:  LGKLGNWRRVLQVIEWLQMRERFKSHKLRFIYTTALDVLGKARRPVEALNIFHAMQQHFSSYPDLVAYHSIAVTLGQAGYMRELFDVIDSMRSPPKKKFK
        LGKLGNW+RVLQVIEWLQMRERFKSHKLRFIYTTALDVLGKARRPVEALN+FHAMQQHFSSYPDLVAYHSIAVTLGQAGYMRELFDVIDSMRSPPKKKFK
Subjt:  LGKLGNWRRVLQVIEWLQMRERFKSHKLRFIYTTALDVLGKARRPVEALNIFHAMQQHFSSYPDLVAYHSIAVTLGQAGYMRELFDVIDSMRSPPKKKFK

Query:  TGALEKWDPRLQPDIVIYNAVLNACVKRKNWEGAFWVLQELKKQGLQPSTSTYGLVMEVMLACGKYNLVHEFFRKVQRSSIPNALTYKVLVNTLWKEGKT
        TGALEKWDPRLQPDIVIYNAVLNACVKRKNWEGAFWVLQELKKQGLQPSTSTYGLVMEVML CGKYNLVHEFFRKVQRSSIPNALTYKVLVNTL KEGKT
Subjt:  TGALEKWDPRLQPDIVIYNAVLNACVKRKNWEGAFWVLQELKKQGLQPSTSTYGLVMEVMLACGKYNLVHEFFRKVQRSSIPNALTYKVLVNTLWKEGKT

Query:  DEAVVAIQNMEQRGIVGSAALYYDFARCLCSAGRCKEALMQIEKICKVANKPLVVTYTGLIQACLDSKNLQSAVYIFNHMKDFCSPNLVTYNILLKGYLE
        DEAV+AIQNME+RGIVGSAALYYDFARCLCSAGRCKEALMQIEKICKVANKPLVVTYTGLIQACLDSKNL SAVYIFNHMK FCSPNLVTYNILLKGYL+
Subjt:  DEAVVAIQNMEQRGIVGSAALYYDFARCLCSAGRCKEALMQIEKICKVANKPLVVTYTGLIQACLDSKNLQSAVYIFNHMKDFCSPNLVTYNILLKGYLE

Query:  HRMFEEARELFQNLSEHGRNISTISDYRDRVLPDIYTFNTMLDASFAEKRWDDFGYFYDQMFLYGYHFNPKRHLRMILEAGRAGKDELLETTWKHLAMAN
        H MFEEARELFQNLSE G++ISTISDY+DRVLPDIYTFN MLDA FA KRWDDFGYFY+QMFLYGYHFNPKRHLRMILEAGRAGKDE+LETTWKHLA  +
Subjt:  HRMFEEARELFQNLSEHGRNISTISDYRDRVLPDIYTFNTMLDASFAEKRWDDFGYFYDQMFLYGYHFNPKRHLRMILEAGRAGKDELLETTWKHLAMAN

Query:  QTLPPPLVKERFCMKLARGNYSEALSCISDHHSGDVHHFSESVWLNLLKEKRFPKDTVIQLIHKVSMLLTRNDPPNPVFQNLLMSCKEFCRTRITVADNK
        +TLPPPLVKERFCMKLARG+YSEALSCIS+HHS D HHFSES WLNLLKEK FPKDTVI LIHKVSMLLT N PPNPVFQNLL SCKEFCRTRITVAD+K
Subjt:  QTLPPPLVKERFCMKLARGNYSEALSCISDHHSGDVHHFSESVWLNLLKEKRFPKDTVIQLIHKVSMLLTRNDPPNPVFQNLLMSCKEFCRTRITVADNK

Query:  LEDIVCTDETQFTAVMHI
        LE IVC DETQ  AVMHI
Subjt:  LEDIVCTDETQFTAVMHI

A0A6J1EH18 LOW QUALITY PROTEIN: pentatricopeptide repeat-containing protein At1g30610, chloroplastic0.0e+0080.72Show/hide
Query:  MVGVIMANANLCIPCYERNGFPAVHYTQNSHNLWVASFFPSPIFGIDLNVGDAKNRVFRNRGNKCGAIKASSKGESDIRSSNGNLLEKDFQFEPSFDEYV
        MVGVIMANANLCIPC E NGFPA++ TQNSH L   S FPS + G  LN G AK+RV R+RG+KCGAIKASSKGESDI+ ++GNLLEKDFQF+PSFDEYV
Subjt:  MVGVIMANANLCIPCYERNGFPAVHYTQNSHNLWVASFFPSPIFGIDLNVGDAKNRVFRNRGNKCGAIKASSKGESDIRSSNGNLLEKDFQFEPSFDEYV

Query:  RVMESVRNSRYKKQSDDPNKLKMKENASAKRAVGISKSKIDNEKNKAADVHGNVDVKNMFKPVDQKDLFNNAERIIRKKDLSRNNFDNKRKGVTRFKDEV
        RVMESVR+ RYK+QSDDPN  KMKENASAK A     S I        DV GN+DVKN    VD +DLF+N+E+I RK DLS N FD+KRKGVTR KDE+
Subjt:  RVMESVRNSRYKKQSDDPNKLKMKENASAKRAVGISKSKIDNEKNKAADVHGNVDVKNMFKPVDQKDLFNNAERIIRKKDLSRNNFDNKRKGVTRFKDEV

Query:  KGKVTRFDSQVNEKQHEEKRKGHLFDCIEPKVRRSNNETLVRLKANTLDVKRERQRVCDESSTKTLETIWADDATKPAKDALKVEKSGVQLARNYIPGEK
        KGKVT F+SQVN+KQHEEKR G+  + IEPK  RSN++  +  KANTLDVK E   V   SS K  + IWADD +KP KD LKV K GVQL  NYIPG+K
Subjt:  KGKVTRFDSQVNEKQHEEKRKGHLFDCIEPKVRRSNNETLVRLKANTLDVKRERQRVCDESSTKTLETIWADDATKPAKDALKVEKSGVQLARNYIPGEK

Query:  VDRKKTGQPYQGLSKSGKPFLESTEESGLEVERAAFNNFDALDIMDKPRVSKMEMEERIQMLSKRLNGADIDMPEWMFARMMRSAKIRYSDHSILRVIQV
        V RKKT Q Y+GLSKSGK F E TEES LEVE AAFN+ DA DIMDKPRVSKMEMEERIQMLS RLNGADIDMPEWMFA+MMRSAKIRYSDHSILRVIQV
Subjt:  VDRKKTGQPYQGLSKSGKPFLESTEESGLEVERAAFNNFDALDIMDKPRVSKMEMEERIQMLSKRLNGADIDMPEWMFARMMRSAKIRYSDHSILRVIQV

Query:  LGKLGNWRRVLQVIEWLQMRERFKSHKLRFIYTTALDVLGKARRPVEALNIFHAMQQHFSSYPDLVAYHSIAVTLGQAGYMRELFDVIDSMRSPPKKKFK
        LGKLGNW+RVLQVIEWLQMRERFKSHKLRFIYTTALDVLGKARRPVEALN+FHAMQQHFSSYPDLVAYHSIAVTLGQAGYMRELFDVIDSMRSPPKKKFK
Subjt:  LGKLGNWRRVLQVIEWLQMRERFKSHKLRFIYTTALDVLGKARRPVEALNIFHAMQQHFSSYPDLVAYHSIAVTLGQAGYMRELFDVIDSMRSPPKKKFK

Query:  TGALEKWDPRLQPDIVIYNAVLNACVKRKNWEGAFWVLQELKKQGLQPSTSTYGLVMEVMLACGKYNLVHEFFRKVQRSSIPNALTYKVLVNTLWKEGKT
        TGA EKWDPRLQPDIVIYNAVLNACVKRKNWEGAFWVLQELK+QGLQPST+TYGLVMEVML CGKYNLVHEFFRKVQ+SSIPNALTYKVLVNTLWKEGKT
Subjt:  TGALEKWDPRLQPDIVIYNAVLNACVKRKNWEGAFWVLQELKKQGLQPSTSTYGLVMEVMLACGKYNLVHEFFRKVQRSSIPNALTYKVLVNTLWKEGKT

Query:  DEAVVAIQNMEQRGIVGSAALYYDFARCLCSAGRCKEALMQIEKICKVANKPLVVTYTGLIQACLDSKNLQSAVYIFNHMKDFCSPNLVTYNILLKGYLE
        DEAV+AIQ ME+RGIVGSAALYYDFARCLCSAGRC+EALMQ+EKICKVANKPLVVTYTGLIQACLDSKNLQSAVYIFNHMK FCSPNLVT NILLKGYL+
Subjt:  DEAVVAIQNMEQRGIVGSAALYYDFARCLCSAGRCKEALMQIEKICKVANKPLVVTYTGLIQACLDSKNLQSAVYIFNHMKDFCSPNLVTYNILLKGYLE

Query:  HRMFEEARELFQNLSEHGRNISTISDYRDRVLPDIYTFNTMLDASFAEKRWDDFGYFYDQMFLYGYHFNPKRHLRMILEAGRAGKDELLETTWKHLAMAN
        H MF+EA+ELFQN+SE+GRNIS +SDYRDRVLPDIYTFNTMLDASFAEKRWDDF +FY+QM LYGYHFNPKRHLRMI+EA R GKDELLETTWKHLA A+
Subjt:  HRMFEEARELFQNLSEHGRNISTISDYRDRVLPDIYTFNTMLDASFAEKRWDDFGYFYDQMFLYGYHFNPKRHLRMILEAGRAGKDELLETTWKHLAMAN

Query:  QTLPPPLVKERFCMKLARGNYSEALSCISDHHSGDVHHFSESVWLNLLKEKRFPKDTVIQLIHKVSMLLTRNDPPNPVFQNLLMSCKEFCRTRITVADNK
        +TLPPPL+KERFC+ LARG+YSEALSCIS HHS D HHFS+S WLNLLKEKRFPKD+VI+LIHKVSMLL RND PNPV QNLL+S KEFCR+RI+VAD +
Subjt:  QTLPPPLVKERFCMKLARGNYSEALSCISDHHSGDVHHFSESVWLNLLKEKRFPKDTVIQLIHKVSMLLTRNDPPNPVFQNLLMSCKEFCRTRITVADNK

Query:  LEDIVCTDETQFTAVMHI
        LE++VCT+E+Q   VMH+
Subjt:  LEDIVCTDETQFTAVMHI

A0A6J1KEH7 pentatricopeptide repeat-containing protein At1g30610, chloroplastic0.0e+0081.26Show/hide
Query:  MVGVIMANANLCIPCYERNGFPAVHYTQNSHNLWVASFFPSPIFGIDLNVGDAKNRVFRNRGNKCGAIKASSKGESDIRSSNGNLLEKDFQFEPSFDEYV
        MVGVIMANANLCIPC E NGF A++ TQNSH L   SFFPS + G  LN G AK+RV R+RG+KCGAIKASSKGESDI+ ++GNLLEKDFQF+PSFDEYV
Subjt:  MVGVIMANANLCIPCYERNGFPAVHYTQNSHNLWVASFFPSPIFGIDLNVGDAKNRVFRNRGNKCGAIKASSKGESDIRSSNGNLLEKDFQFEPSFDEYV

Query:  RVMESVRNSRYKKQSDDPNKLKMKENASAKRAVGISKSKIDNEKNKAADVHGNVDVKNMFKPVDQKDLFNNAERIIRKKDLSRNNFDNKRKGVTRFKDEV
        RVMESVR+ RYK+QSDDPN  KMKENASAK A   S S I        DV GN+DVKN    VD +DLF+N+ERI RK DLS N FD+KRKGVTR KDE+
Subjt:  RVMESVRNSRYKKQSDDPNKLKMKENASAKRAVGISKSKIDNEKNKAADVHGNVDVKNMFKPVDQKDLFNNAERIIRKKDLSRNNFDNKRKGVTRFKDEV

Query:  KGKVTRFDSQVNEKQHEEKRKGHLFDCIEPKVRRSNNETLVRLKANTLDVKRERQRVCDESSTKTLETIWADDATKPAKDALKVEKSGVQLARNYIPGEK
        KGKVT FDSQ+N+KQHEEKR G+  + IEPKV RSN++  +  KANTLDVK E   V   SS K  E IWADD  KP KD LKV K GVQL  NYIPG+K
Subjt:  KGKVTRFDSQVNEKQHEEKRKGHLFDCIEPKVRRSNNETLVRLKANTLDVKRERQRVCDESSTKTLETIWADDATKPAKDALKVEKSGVQLARNYIPGEK

Query:  VDRKKTGQPYQGLSKSGKPFLESTEESGLEVERAAFNNFDALDIMDKPRVSKMEMEERIQMLSKRLNGADIDMPEWMFARMMRSAKIRYSDHSILRVIQV
        V RKKT Q Y+GLSKSGK F E TEES LEVE AAFN+ DA DIMDKPRVSKMEMEERIQMLSKRLNGADIDMPEWMFA+MMRSAKIRYSDHSILRVIQV
Subjt:  VDRKKTGQPYQGLSKSGKPFLESTEESGLEVERAAFNNFDALDIMDKPRVSKMEMEERIQMLSKRLNGADIDMPEWMFARMMRSAKIRYSDHSILRVIQV

Query:  LGKLGNWRRVLQVIEWLQMRERFKSHKLRFIYTTALDVLGKARRPVEALNIFHAMQQHFSSYPDLVAYHSIAVTLGQAGYMRELFDVIDSMRSPPKKKFK
        LGKLGNW+RVLQVIEWLQMRERFKSHKLRFIYTTALDVLGKARRPVEALN+FHAMQQHFSSYPDLVAYHSIAVTLGQAGYMRELFDVIDSMRSPPKKKFK
Subjt:  LGKLGNWRRVLQVIEWLQMRERFKSHKLRFIYTTALDVLGKARRPVEALNIFHAMQQHFSSYPDLVAYHSIAVTLGQAGYMRELFDVIDSMRSPPKKKFK

Query:  TGALEKWDPRLQPDIVIYNAVLNACVKRKNWEGAFWVLQELKKQGLQPSTSTYGLVMEVMLACGKYNLVHEFFRKVQRSSIPNALTYKVLVNTLWKEGKT
        TGA EKWDPRLQPDIVIYNAVLNACVKRKNWEGAFWVLQELK+QGLQPST+TYGLVMEVML CGKYNLVHEFFRKVQ+SSIPNALTYKVLVNTLWKEGKT
Subjt:  TGALEKWDPRLQPDIVIYNAVLNACVKRKNWEGAFWVLQELKKQGLQPSTSTYGLVMEVMLACGKYNLVHEFFRKVQRSSIPNALTYKVLVNTLWKEGKT

Query:  DEAVVAIQNMEQRGIVGSAALYYDFARCLCSAGRCKEALMQIEKICKVANKPLVVTYTGLIQACLDSKNLQSAVYIFNHMKDFCSPNLVTYNILLKGYLE
        DEAV+AIQ ME+RGIVGSAALYYDFARCLCSAGR +EALMQ+EKICKVANKPLVVTYTGLIQACLDSKNLQSAVYIFNHMK FCSPNLVT NILLKGYL+
Subjt:  DEAVVAIQNMEQRGIVGSAALYYDFARCLCSAGRCKEALMQIEKICKVANKPLVVTYTGLIQACLDSKNLQSAVYIFNHMKDFCSPNLVTYNILLKGYLE

Query:  HRMFEEARELFQNLSEHGRNISTISDYRDRVLPDIYTFNTMLDASFAEKRWDDFGYFYDQMFLYGYHFNPKRHLRMILEAGRAGKDELLETTWKHLAMAN
        H MF EA+ELFQN+SE+GRNIS +SDYRDRVLPDIYTFNTMLDASFAEKRWDDF +FY+QM LYGYHFNPKRHLRMI+EA R GKDELLETTWKHLA A+
Subjt:  HRMFEEARELFQNLSEHGRNISTISDYRDRVLPDIYTFNTMLDASFAEKRWDDFGYFYDQMFLYGYHFNPKRHLRMILEAGRAGKDELLETTWKHLAMAN

Query:  QTLPPPLVKERFCMKLARGNYSEALSCISDHHSGDVHHFSESVWLNLLKEKRFPKDTVIQLIHKVSMLLTRNDPPNPVFQNLLMSCKEFCRTRITVADNK
        + LPPPL+KERFC+ LARG+YSEALSCIS HHS D HHFS+S WLNLLKEKRFPKD+VIQLIHKVSMLL RND PNPV QNLL+S KEFCR+RI+VAD +
Subjt:  QTLPPPLVKERFCMKLARGNYSEALSCISDHHSGDVHHFSESVWLNLLKEKRFPKDTVIQLIHKVSMLLTRNDPPNPVFQNLLMSCKEFCRTRITVADNK

Query:  LEDIVCTDETQFTAVMHI
        LE++VCT+E+Q  AVMH+
Subjt:  LEDIVCTDETQFTAVMHI

SwissProt top hitse value%identityAlignment
Q3EDF8 Pentatricopeptide repeat-containing protein At1g099002.5e-2225.24Show/hide
Query:  LGKARRPVEALNIFHAMQQHFSSYPDLVAYHSIAVTLGQAGYMRELFDVIDSMRSPPKKKFKTGALEKWDPRLQPDIVIYNAVLNACVKRKNWEGAFWVL
        LGK R+  + L I         + PD++ Y+ +     +AG +     V+D M                   + PD+V YN +L +       + A  VL
Subjt:  LGKARRPVEALNIFHAMQQHFSSYPDLVAYHSIAVTLGQAGYMRELFDVIDSMRSPPKKKFKTGALEKWDPRLQPDIVIYNAVLNACVKRKNWEGAFWVL

Query:  QELKKQGLQPSTSTYGLVMEVMLACGKYNLVHEFFRKVQ-RSSIPNALTYKVLVNTLWKEGKTDEAVVAIQNMEQRGIVGSAALYYDFARCLCSAGRCKE
          + ++   P   TY +++E            +   +++ R   P+ +TY VLVN + KEG+ DEA+  + +M   G   +   +    R +CS GR  +
Subjt:  QELKKQGLQPSTSTYGLVMEVMLACGKYNLVHEFFRKVQ-RSSIPNALTYKVLVNTLWKEGKTDEAVVAIQNMEQRGIVGSAALYYDFARCLCSAGRCKE

Query:  ALMQIEKICKVANKPLVVTYTGLIQACLDSKNLQSAVYIFNHMKDF-CSPNLVTYNILLKGYLEHRMFEEARELFQNLSEHGRNISTISDYRDRVLPDIY
        A   +  + +    P VVT+  LI        L  A+ I   M    C PN ++YN LL G+ + +  + A E  + +   G              PDI 
Subjt:  ALMQIEKICKVANKPLVVTYTGLIQACLDSKNLQSAVYIFNHMKDF-CSPNLVTYNILLKGYLEHRMFEEARELFQNLSEHGRNISTISDYRDRVLPDIY

Query:  TFNTMLDASFAEKRWDD
        T+NTML A   + + +D
Subjt:  TFNTMLDASFAEKRWDD

Q9FJW6 Pentatricopeptide repeat-containing protein At5g67570, chloroplastic9.8e-11239.78Show/hide
Query:  ERIQMLSKRLNGADIDMPEWMFARMMRSAKIRYSDHSILRVIQVLGKLGNWRRVLQVIEWLQMRERFKSHKLRFIYTTALDVLGKARRPVEALNIFHAMQ
        E +++L  RL+G +I+   W F RMM  + +++++  +L+++  LG+  +W++   V+ W+   ++ K  + RF+YT  L VLG ARRP EAL IF+ M 
Subjt:  ERIQMLSKRLNGADIDMPEWMFARMMRSAKIRYSDHSILRVIQVLGKLGNWRRVLQVIEWLQMRERFKSHKLRFIYTTALDVLGKARRPVEALNIFHAMQ

Query:  QHFSSYPDLVAYHSIAVTLGQAGYMRELFDVIDSMRSPPKKKFKTGALEKWDPRLQPDIVIYNAVLNACVKRKNWEGAFWVLQELKKQGLQPSTSTYGLV
             YPD+ AYH IAVTLGQAG ++EL  VI+ MR  P K  K    + WDP L+PD+V+YNA+LNACV    W+   WV  EL+K GL+P+ +TYGL 
Subjt:  QHFSSYPDLVAYHSIAVTLGQAGYMRELFDVIDSMRSPPKKKFKTGALEKWDPRLQPDIVIYNAVLNACVKRKNWEGAFWVLQELKKQGLQPSTSTYGLV

Query:  MEVMLACGKYNLVHEFFRKVQRS-SIPNALTYKVLVNTLWKEGKTDEAVVAIQNMEQRGIVGSAALYYDFARCLCSAGRCKEALMQIEKICKVAN-KPLV
        MEVML  GK++ VH+FFRK++ S   P A+TYKVLV  LW+EGK +EAV A+++MEQ+G++G+ ++YY+ A CLC+ GR  +A++++ ++ ++ N +PL 
Subjt:  MEVMLACGKYNLVHEFFRKVQRS-SIPNALTYKVLVNTLWKEGKTDEAVVAIQNMEQRGIVGSAALYYDFARCLCSAGRCKEALMQIEKICKVAN-KPLV

Query:  VTYTGLIQACLDSKNLQSAVYIFNHMKDFCSPNLVTYNILLKGYLEHRMFEEARELFQNLSEHGRNISTISDYRDRVLPDIYTFNTMLDASFAEKRWDDF
        +T+TGLI A L+  ++   + IF +MKD C PN+ T N++LK Y  + MF EA+ELF+ +         +S     ++P+ YT++ ML+AS    +W+ F
Subjt:  VTYTGLIQACLDSKNLQSAVYIFNHMKDFCSPNLVTYNILLKGYLEHRMFEEARELFQNLSEHGRNISTISDYRDRVLPDIYTFNTMLDASFAEKRWDDF

Query:  GYFYDQMFLYGYHFNPKRHLRMILEAGRAGKDELLETTWKHLAMANQTLPPPLVKERFCMKLARGNYSEALSCISDHHSGDVHHFSESVWLNLLKEKRFP
         + Y  M L GY  +  +H  M++EA RAGK  LLE  +  +    +   P    E  C   A+G++  A++ I+          SE  W +L +E    
Subjt:  GYFYDQMFLYGYHFNPKRHLRMILEAGRAGKDELLETTWKHLAMANQTLPPPLVKERFCMKLARGNYSEALSCISDHHSGDVHHFSESVWLNLLKEKRFP

Query:  KDTVIQ-LIHKVSMLLTRND-PPNPVFQNLLMSCKEFC
        +D + Q  +HK+S  L   D    P   NL  S K  C
Subjt:  KDTVIQ-LIHKVSMLLTRND-PPNPVFQNLLMSCKEFC

Q9LQ16 Pentatricopeptide repeat-containing protein At1g629101.9e-2224.7Show/hide
Query:  LQMRERFKSHKLRFIYTTALDVLGKARRPVEALNIFHAMQQHFSSYPDLVAYHSIAVTLGQAGYMRELFDVIDSMRSPPKKKFKTGALEKWDPRLQPDIV
        L+  E+ K      IY T +D L K +   +ALN+F  M       PD+  Y S+   L   G   +   ++  M            +E+   ++ P++V
Subjt:  LQMRERFKSHKLRFIYTTALDVLGKARRPVEALNIFHAMQQHFSSYPDLVAYHSIAVTLGQAGYMRELFDVIDSMRSPPKKKFKTGALEKWDPRLQPDIV

Query:  IYNAVLNACVKRKNWEGAFWVLQELKKQGLQPSTSTYGLVMEVMLACGKYN-LVHEFFRKVQRSSIPNALTYKVLVNTLWKEGKTDEAVVAIQNMEQRGI
         ++A+++A VK      A  +  E+ K+ + P   TY  ++       + +   H F   + +   PN +TY  L+    K  + +E +   + M QRG+
Subjt:  IYNAVLNACVKRKNWEGAFWVLQELKKQGLQPSTSTYGLVMEVMLACGKYN-LVHEFFRKVQRSSIPNALTYKVLVNTLWKEGKTDEAVVAIQNMEQRGI

Query:  VGSAALYYDFARCLCSAGRCKEALMQIEKICKVANKPLVVTYTGLIQACLDSKNLQSAVYIFNHM-KDFCSPNLVTYNILLKGYLEHRMFEEARELFQNL
        VG+   Y         A  C  A M  +++  V   P ++TY  L+     +  L  A+ +F ++ +    P++ TYNI+++G  +    E+  ELF NL
Subjt:  VGSAALYYDFARCLCSAGRCKEALMQIEKICKVANKPLVVTYTGLIQACLDSKNLQSAVYIFNHM-KDFCSPNLVTYNILLKGYLEHRMFEEARELFQNL

Query:  SEHGRNISTISDYRDRVLPDIYTFNTML
        S  G            V P++  +NTM+
Subjt:  SEHGRNISTISDYRDRVLPDIYTFNTML

Q9SA76 Pentatricopeptide repeat-containing protein At1g30610, chloroplastic1.3e-21259.42Show/hide
Query:  GLSKSGKPFLESTEESGLEVERAAFNNFD-ALDIMDKPRVSKMEMEERIQMLSKRLNGADIDMPEWMFARMMRSAKIRYSDHSILRVIQVLGKLGNWRRV
        G  + G    +  ++S   +E  AF   D + DI+DKP  S++EME+RI+ L+K LNGADI+MPEW F++ +RSAKIRY+D++++R+I  LGKLGNWRRV
Subjt:  GLSKSGKPFLESTEESGLEVERAAFNNFD-ALDIMDKPRVSKMEMEERIQMLSKRLNGADIDMPEWMFARMMRSAKIRYSDHSILRVIQVLGKLGNWRRV

Query:  LQVIEWLQMRERFKSHKLRFIYTTALDVLGKARRPVEALNIFHAMQQHFSSYPDLVAYHSIAVTLGQAGYMRELFDVIDSMRSPPKKKFKTGALEKWDPR
        LQVIEWLQ ++R+KS+K+R IYTTAL+VLGK+RRPVEALN+FHAM    SSYPD+VAY SIAVTLGQAG+++ELF VID+MRSPPKKKFK   LEKWDPR
Subjt:  LQVIEWLQMRERFKSHKLRFIYTTALDVLGKARRPVEALNIFHAMQQHFSSYPDLVAYHSIAVTLGQAGYMRELFDVIDSMRSPPKKKFKTGALEKWDPR

Query:  LQPDIVIYNAVLNACVKRKNWEGAFWVLQELKKQGLQPSTSTYGLVMEVMLACGKYNLVHEFFRKVQRSSIPNALTYKVLVNTLWKEGKTDEAVVAIQNM
        L+PD+V+YNAVLNACV+RK WEGAFWVLQ+LK++G +PS  TYGL+MEVMLAC KYNLVHEFFRK+Q+SSIPNAL Y+VLVNTLWKEGK+DEAV  +++M
Subjt:  LQPDIVIYNAVLNACVKRKNWEGAFWVLQELKKQGLQPSTSTYGLVMEVMLACGKYNLVHEFFRKVQRSSIPNALTYKVLVNTLWKEGKTDEAVVAIQNM

Query:  EQRGIVGSAALYYDFARCLCSAGRCKEAL----------------------------MQIEKICKVANKPLVVTYTGLIQACLDSKNLQSAVYIFNHMKD
        E RGIVGSAALYYD ARCLCSAGRC E L                             Q++KIC+VANKPLVVTYTGLIQAC+DS N+++A YIF+ MK 
Subjt:  EQRGIVGSAALYYDFARCLCSAGRCKEAL----------------------------MQIEKICKVANKPLVVTYTGLIQACLDSKNLQSAVYIFNHMKD

Query:  FCSPNLVTYNILLKGYLEHRMFEEARELFQNLSEHGRNISTISDYRDRVLPDIYTFNTMLDASFAEKRWDDFGYFYDQMFLYGYHFNPKRHLRMILEAGR
         CSPNLVT NI+LK YL+  +FEEARELFQ +SE G +I   SD+  RVLPD YTFNTMLD    +++WDDFGY Y +M  +GYHFN KRHLRM+LEA R
Subjt:  FCSPNLVTYNILLKGYLEHRMFEEARELFQNLSEHGRNISTISDYRDRVLPDIYTFNTMLDASFAEKRWDDFGYFYDQMFLYGYHFNPKRHLRMILEAGR

Query:  AGKDELLETTWKHLAMANQTLPPPLVKERFCMKLARGNYSEALSCISDHH----SGDVHHFSESVWLNLLKEKRFPKDTVIQLIHKVSMLL-TRNDPPNP
        AGK+E++E TW+H+  +N+  P PL+KERF  KL +G++  A+S ++D +      ++  FS S W  +L   RF +D+V++L+  V+  L +R++  + 
Subjt:  AGKDELLETTWKHLAMANQTLPPPLVKERFCMKLARGNYSEALSCISDHH----SGDVHHFSESVWLNLLKEKRFPKDTVIQLIHKVSMLL-TRNDPPNP

Query:  VFQNLLMSCKEFCRTR
        V  NLL SCK++ +TR
Subjt:  VFQNLLMSCKEFCRTR

Q9SXD1 Pentatricopeptide repeat-containing protein At1g62670, mitochondrial1.1e-2224.84Show/hide
Query:  IYTTALDVLGKARRPVEALNIFHAMQQHFSSYPDLVAYHSIAVTLGQAGYMRELFDVIDSMRSPPKKKFKTGALEKWDPRLQPDIVIYNAVLNACVKRKN
        IY T +D L K +   +ALN+F  M+      P++V Y S+   L   G   +   ++  M            +E+   ++ PD+  ++A+++A VK   
Subjt:  IYTTALDVLGKARRPVEALNIFHAMQQHFSSYPDLVAYHSIAVTLGQAGYMRELFDVIDSMRSPPKKKFKTGALEKWDPRLQPDIVIYNAVLNACVKRKN

Query:  WEGAFWVLQELKKQGLQPSTSTYGLVMEVMLACGKYNLVHEFFR-KVQRSSIPNALTYKVLVNTLWKEGKTDEAVVAIQNMEQRGIVGSAALYYDFARCL
           A  +  E+ K+ + PS  TY  ++       + +   + F   V +   P+ +TY  L+    K  + +E +   + M QRG+VG+   Y    + L
Subjt:  WEGAFWVLQELKKQGLQPSTSTYGLVMEVMLACGKYNLVHEFFR-KVQRSSIPNALTYKVLVNTLWKEGKTDEAVVAIQNMEQRGIVGSAALYYDFARCL

Query:  CSAGRCKEALMQIEKICKVANKPLVVTYTGLIQACLDSKNLQSAVYIFNHM-KDFCSPNLVTYNILLKGYLEHRMFEEARELFQNLSEHGRNISTISDYR
          AG C  A    +++      P ++TY  L+     +  L+ A+ +F ++ +    P + TYNI+++G  +    E+  +LF NLS  G          
Subjt:  CSAGRCKEALMQIEKICKVANKPLVVTYTGLIQACLDSKNLQSAVYIFNHM-KDFCSPNLVTYNILLKGYLEHRMFEEARELFQNLSEHGRNISTISDYR

Query:  DRVLPDIYTFNTML
          V PD+  +NTM+
Subjt:  DRVLPDIYTFNTML

Arabidopsis top hitse value%identityAlignment
AT1G30610.1 pentatricopeptide (PPR) repeat-containing protein9.2e-21459.42Show/hide
Query:  GLSKSGKPFLESTEESGLEVERAAFNNFD-ALDIMDKPRVSKMEMEERIQMLSKRLNGADIDMPEWMFARMMRSAKIRYSDHSILRVIQVLGKLGNWRRV
        G  + G    +  ++S   +E  AF   D + DI+DKP  S++EME+RI+ L+K LNGADI+MPEW F++ +RSAKIRY+D++++R+I  LGKLGNWRRV
Subjt:  GLSKSGKPFLESTEESGLEVERAAFNNFD-ALDIMDKPRVSKMEMEERIQMLSKRLNGADIDMPEWMFARMMRSAKIRYSDHSILRVIQVLGKLGNWRRV

Query:  LQVIEWLQMRERFKSHKLRFIYTTALDVLGKARRPVEALNIFHAMQQHFSSYPDLVAYHSIAVTLGQAGYMRELFDVIDSMRSPPKKKFKTGALEKWDPR
        LQVIEWLQ ++R+KS+K+R IYTTAL+VLGK+RRPVEALN+FHAM    SSYPD+VAY SIAVTLGQAG+++ELF VID+MRSPPKKKFK   LEKWDPR
Subjt:  LQVIEWLQMRERFKSHKLRFIYTTALDVLGKARRPVEALNIFHAMQQHFSSYPDLVAYHSIAVTLGQAGYMRELFDVIDSMRSPPKKKFKTGALEKWDPR

Query:  LQPDIVIYNAVLNACVKRKNWEGAFWVLQELKKQGLQPSTSTYGLVMEVMLACGKYNLVHEFFRKVQRSSIPNALTYKVLVNTLWKEGKTDEAVVAIQNM
        L+PD+V+YNAVLNACV+RK WEGAFWVLQ+LK++G +PS  TYGL+MEVMLAC KYNLVHEFFRK+Q+SSIPNAL Y+VLVNTLWKEGK+DEAV  +++M
Subjt:  LQPDIVIYNAVLNACVKRKNWEGAFWVLQELKKQGLQPSTSTYGLVMEVMLACGKYNLVHEFFRKVQRSSIPNALTYKVLVNTLWKEGKTDEAVVAIQNM

Query:  EQRGIVGSAALYYDFARCLCSAGRCKEAL----------------------------MQIEKICKVANKPLVVTYTGLIQACLDSKNLQSAVYIFNHMKD
        E RGIVGSAALYYD ARCLCSAGRC E L                             Q++KIC+VANKPLVVTYTGLIQAC+DS N+++A YIF+ MK 
Subjt:  EQRGIVGSAALYYDFARCLCSAGRCKEAL----------------------------MQIEKICKVANKPLVVTYTGLIQACLDSKNLQSAVYIFNHMKD

Query:  FCSPNLVTYNILLKGYLEHRMFEEARELFQNLSEHGRNISTISDYRDRVLPDIYTFNTMLDASFAEKRWDDFGYFYDQMFLYGYHFNPKRHLRMILEAGR
         CSPNLVT NI+LK YL+  +FEEARELFQ +SE G +I   SD+  RVLPD YTFNTMLD    +++WDDFGY Y +M  +GYHFN KRHLRM+LEA R
Subjt:  FCSPNLVTYNILLKGYLEHRMFEEARELFQNLSEHGRNISTISDYRDRVLPDIYTFNTMLDASFAEKRWDDFGYFYDQMFLYGYHFNPKRHLRMILEAGR

Query:  AGKDELLETTWKHLAMANQTLPPPLVKERFCMKLARGNYSEALSCISDHH----SGDVHHFSESVWLNLLKEKRFPKDTVIQLIHKVSMLL-TRNDPPNP
        AGK+E++E TW+H+  +N+  P PL+KERF  KL +G++  A+S ++D +      ++  FS S W  +L   RF +D+V++L+  V+  L +R++  + 
Subjt:  AGKDELLETTWKHLAMANQTLPPPLVKERFCMKLARGNYSEALSCISDHH----SGDVHHFSESVWLNLLKEKRFPKDTVIQLIHKVSMLL-TRNDPPNP

Query:  VFQNLLMSCKEFCRTR
        V  NLL SCK++ +TR
Subjt:  VFQNLLMSCKEFCRTR

AT1G30610.1 pentatricopeptide (PPR) repeat-containing protein1.3e-0247.5Show/hide
Query:  EKDFQFEPSFDEYVRVMESVRNSRYKKQSDDPNKLKMKEN
        +K F+F+PSFD+Y+++MESV+ +R KK+ D   +LK++E+
Subjt:  EKDFQFEPSFDEYVRVMESVRNSRYKKQSDDPNKLKMKEN

AT1G30610.2 pentatricopeptide (PPR) repeat-containing protein1.8e-21762.07Show/hide
Query:  GLSKSGKPFLESTEESGLEVERAAFNNFD-ALDIMDKPRVSKMEMEERIQMLSKRLNGADIDMPEWMFARMMRSAKIRYSDHSILRVIQVLGKLGNWRRV
        G  + G    +  ++S   +E  AF   D + DI+DKP  S++EME+RI+ L+K LNGADI+MPEW F++ +RSAKIRY+D++++R+I  LGKLGNWRRV
Subjt:  GLSKSGKPFLESTEESGLEVERAAFNNFD-ALDIMDKPRVSKMEMEERIQMLSKRLNGADIDMPEWMFARMMRSAKIRYSDHSILRVIQVLGKLGNWRRV

Query:  LQVIEWLQMRERFKSHKLRFIYTTALDVLGKARRPVEALNIFHAMQQHFSSYPDLVAYHSIAVTLGQAGYMRELFDVIDSMRSPPKKKFKTGALEKWDPR
        LQVIEWLQ ++R+KS+K+R IYTTAL+VLGK+RRPVEALN+FHAM    SSYPD+VAY SIAVTLGQAG+++ELF VID+MRSPPKKKFK   LEKWDPR
Subjt:  LQVIEWLQMRERFKSHKLRFIYTTALDVLGKARRPVEALNIFHAMQQHFSSYPDLVAYHSIAVTLGQAGYMRELFDVIDSMRSPPKKKFKTGALEKWDPR

Query:  LQPDIVIYNAVLNACVKRKNWEGAFWVLQELKKQGLQPSTSTYGLVMEVMLACGKYNLVHEFFRKVQRSSIPNALTYKVLVNTLWKEGKTDEAVVAIQNM
        L+PD+V+YNAVLNACV+RK WEGAFWVLQ+LK++G +PS  TYGL+MEVMLAC KYNLVHEFFRK+Q+SSIPNAL Y+VLVNTLWKEGK+DEAV  +++M
Subjt:  LQPDIVIYNAVLNACVKRKNWEGAFWVLQELKKQGLQPSTSTYGLVMEVMLACGKYNLVHEFFRKVQRSSIPNALTYKVLVNTLWKEGKTDEAVVAIQNM

Query:  EQRGIVGSAALYYDFARCLCSAGRCKEALMQIEKICKVANKPLVVTYTGLIQACLDSKNLQSAVYIFNHMKDFCSPNLVTYNILLKGYLEHRMFEEAREL
        E RGIVGSAALYYD ARCLCSAGRC E L  ++KIC+VANKPLVVTYTGLIQAC+DS N+++A YIF+ MK  CSPNLVT NI+LK YL+  +FEEAREL
Subjt:  EQRGIVGSAALYYDFARCLCSAGRCKEALMQIEKICKVANKPLVVTYTGLIQACLDSKNLQSAVYIFNHMKDFCSPNLVTYNILLKGYLEHRMFEEAREL

Query:  FQNLSEHGRNISTISDYRDRVLPDIYTFNTMLDASFAEKRWDDFGYFYDQMFLYGYHFNPKRHLRMILEAGRAGKDELLETTWKHLAMANQTLPPPLVKE
        FQ +SE G +I   SD+  RVLPD YTFNTMLD    +++WDDFGY Y +M  +GYHFN KRHLRM+LEA RAGK+E++E TW+H+  +N+  P PL+KE
Subjt:  FQNLSEHGRNISTISDYRDRVLPDIYTFNTMLDASFAEKRWDDFGYFYDQMFLYGYHFNPKRHLRMILEAGRAGKDELLETTWKHLAMANQTLPPPLVKE

Query:  RFCMKLARGNYSEALSCISDHH----SGDVHHFSESVWLNLLKEKRFPKDTVIQLIHKVSMLL-TRNDPPNPVFQNLLMSCKEFCRTR
        RF  KL +G++  A+S ++D +      ++  FS S W  +L   RF +D+V++L+  V+  L +R++  + V  NLL SCK++ +TR
Subjt:  RFCMKLARGNYSEALSCISDHH----SGDVHHFSESVWLNLLKEKRFPKDTVIQLIHKVSMLL-TRNDPPNPVFQNLLMSCKEFCRTR

AT1G30610.2 pentatricopeptide (PPR) repeat-containing protein1.3e-0247.5Show/hide
Query:  EKDFQFEPSFDEYVRVMESVRNSRYKKQSDDPNKLKMKEN
        +K F+F+PSFD+Y+++MESV+ +R KK+ D   +LK++E+
Subjt:  EKDFQFEPSFDEYVRVMESVRNSRYKKQSDDPNKLKMKEN

AT1G62670.1 rna processing factor 27.9e-2424.84Show/hide
Query:  IYTTALDVLGKARRPVEALNIFHAMQQHFSSYPDLVAYHSIAVTLGQAGYMRELFDVIDSMRSPPKKKFKTGALEKWDPRLQPDIVIYNAVLNACVKRKN
        IY T +D L K +   +ALN+F  M+      P++V Y S+   L   G   +   ++  M            +E+   ++ PD+  ++A+++A VK   
Subjt:  IYTTALDVLGKARRPVEALNIFHAMQQHFSSYPDLVAYHSIAVTLGQAGYMRELFDVIDSMRSPPKKKFKTGALEKWDPRLQPDIVIYNAVLNACVKRKN

Query:  WEGAFWVLQELKKQGLQPSTSTYGLVMEVMLACGKYNLVHEFFR-KVQRSSIPNALTYKVLVNTLWKEGKTDEAVVAIQNMEQRGIVGSAALYYDFARCL
           A  +  E+ K+ + PS  TY  ++       + +   + F   V +   P+ +TY  L+    K  + +E +   + M QRG+VG+   Y    + L
Subjt:  WEGAFWVLQELKKQGLQPSTSTYGLVMEVMLACGKYNLVHEFFR-KVQRSSIPNALTYKVLVNTLWKEGKTDEAVVAIQNMEQRGIVGSAALYYDFARCL

Query:  CSAGRCKEALMQIEKICKVANKPLVVTYTGLIQACLDSKNLQSAVYIFNHM-KDFCSPNLVTYNILLKGYLEHRMFEEARELFQNLSEHGRNISTISDYR
          AG C  A    +++      P ++TY  L+     +  L+ A+ +F ++ +    P + TYNI+++G  +    E+  +LF NLS  G          
Subjt:  CSAGRCKEALMQIEKICKVANKPLVVTYTGLIQACLDSKNLQSAVYIFNHM-KDFCSPNLVTYNILLKGYLEHRMFEEARELFQNLSEHGRNISTISDYR

Query:  DRVLPDIYTFNTML
          V PD+  +NTM+
Subjt:  DRVLPDIYTFNTML

AT1G62910.1 Pentatricopeptide repeat (PPR) superfamily protein1.4e-2324.7Show/hide
Query:  LQMRERFKSHKLRFIYTTALDVLGKARRPVEALNIFHAMQQHFSSYPDLVAYHSIAVTLGQAGYMRELFDVIDSMRSPPKKKFKTGALEKWDPRLQPDIV
        L+  E+ K      IY T +D L K +   +ALN+F  M       PD+  Y S+   L   G   +   ++  M            +E+   ++ P++V
Subjt:  LQMRERFKSHKLRFIYTTALDVLGKARRPVEALNIFHAMQQHFSSYPDLVAYHSIAVTLGQAGYMRELFDVIDSMRSPPKKKFKTGALEKWDPRLQPDIV

Query:  IYNAVLNACVKRKNWEGAFWVLQELKKQGLQPSTSTYGLVMEVMLACGKYN-LVHEFFRKVQRSSIPNALTYKVLVNTLWKEGKTDEAVVAIQNMEQRGI
         ++A+++A VK      A  +  E+ K+ + P   TY  ++       + +   H F   + +   PN +TY  L+    K  + +E +   + M QRG+
Subjt:  IYNAVLNACVKRKNWEGAFWVLQELKKQGLQPSTSTYGLVMEVMLACGKYN-LVHEFFRKVQRSSIPNALTYKVLVNTLWKEGKTDEAVVAIQNMEQRGI

Query:  VGSAALYYDFARCLCSAGRCKEALMQIEKICKVANKPLVVTYTGLIQACLDSKNLQSAVYIFNHM-KDFCSPNLVTYNILLKGYLEHRMFEEARELFQNL
        VG+   Y         A  C  A M  +++  V   P ++TY  L+     +  L  A+ +F ++ +    P++ TYNI+++G  +    E+  ELF NL
Subjt:  VGSAALYYDFARCLCSAGRCKEALMQIEKICKVANKPLVVTYTGLIQACLDSKNLQSAVYIFNHM-KDFCSPNLVTYNILLKGYLEHRMFEEARELFQNL

Query:  SEHGRNISTISDYRDRVLPDIYTFNTML
        S  G            V P++  +NTM+
Subjt:  SEHGRNISTISDYRDRVLPDIYTFNTML

AT5G67570.1 Tetratricopeptide repeat (TPR)-like superfamily protein7.0e-11339.78Show/hide
Query:  ERIQMLSKRLNGADIDMPEWMFARMMRSAKIRYSDHSILRVIQVLGKLGNWRRVLQVIEWLQMRERFKSHKLRFIYTTALDVLGKARRPVEALNIFHAMQ
        E +++L  RL+G +I+   W F RMM  + +++++  +L+++  LG+  +W++   V+ W+   ++ K  + RF+YT  L VLG ARRP EAL IF+ M 
Subjt:  ERIQMLSKRLNGADIDMPEWMFARMMRSAKIRYSDHSILRVIQVLGKLGNWRRVLQVIEWLQMRERFKSHKLRFIYTTALDVLGKARRPVEALNIFHAMQ

Query:  QHFSSYPDLVAYHSIAVTLGQAGYMRELFDVIDSMRSPPKKKFKTGALEKWDPRLQPDIVIYNAVLNACVKRKNWEGAFWVLQELKKQGLQPSTSTYGLV
             YPD+ AYH IAVTLGQAG ++EL  VI+ MR  P K  K    + WDP L+PD+V+YNA+LNACV    W+   WV  EL+K GL+P+ +TYGL 
Subjt:  QHFSSYPDLVAYHSIAVTLGQAGYMRELFDVIDSMRSPPKKKFKTGALEKWDPRLQPDIVIYNAVLNACVKRKNWEGAFWVLQELKKQGLQPSTSTYGLV

Query:  MEVMLACGKYNLVHEFFRKVQRS-SIPNALTYKVLVNTLWKEGKTDEAVVAIQNMEQRGIVGSAALYYDFARCLCSAGRCKEALMQIEKICKVAN-KPLV
        MEVML  GK++ VH+FFRK++ S   P A+TYKVLV  LW+EGK +EAV A+++MEQ+G++G+ ++YY+ A CLC+ GR  +A++++ ++ ++ N +PL 
Subjt:  MEVMLACGKYNLVHEFFRKVQRS-SIPNALTYKVLVNTLWKEGKTDEAVVAIQNMEQRGIVGSAALYYDFARCLCSAGRCKEALMQIEKICKVAN-KPLV

Query:  VTYTGLIQACLDSKNLQSAVYIFNHMKDFCSPNLVTYNILLKGYLEHRMFEEARELFQNLSEHGRNISTISDYRDRVLPDIYTFNTMLDASFAEKRWDDF
        +T+TGLI A L+  ++   + IF +MKD C PN+ T N++LK Y  + MF EA+ELF+ +         +S     ++P+ YT++ ML+AS    +W+ F
Subjt:  VTYTGLIQACLDSKNLQSAVYIFNHMKDFCSPNLVTYNILLKGYLEHRMFEEARELFQNLSEHGRNISTISDYRDRVLPDIYTFNTMLDASFAEKRWDDF

Query:  GYFYDQMFLYGYHFNPKRHLRMILEAGRAGKDELLETTWKHLAMANQTLPPPLVKERFCMKLARGNYSEALSCISDHHSGDVHHFSESVWLNLLKEKRFP
         + Y  M L GY  +  +H  M++EA RAGK  LLE  +  +    +   P    E  C   A+G++  A++ I+          SE  W +L +E    
Subjt:  GYFYDQMFLYGYHFNPKRHLRMILEAGRAGKDELLETTWKHLAMANQTLPPPLVKERFCMKLARGNYSEALSCISDHHSGDVHHFSESVWLNLLKEKRFP

Query:  KDTVIQ-LIHKVSMLLTRND-PPNPVFQNLLMSCKEFC
        +D + Q  +HK+S  L   D    P   NL  S K  C
Subjt:  KDTVIQ-LIHKVSMLLTRND-PPNPVFQNLLMSCKEFC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGGGGATATTTATCCAAATCCCCTCTGGTCTTGTCCTTTGTAAAGAAGAAACTCGCTTTTGCCTCTCTGTCGGAGCACCGCCAGGTTAAGTACTATCCGAGTTTGCA
ACGGAGCATTAGGTCGGTGCAAGAATCTAAAATGCCTTTGGAGCCGCAGATAGCCGGAGAAGATAGAACAGGAACAGGGAAGAAGACTGATGTAATCGTGTCGCGAGATG
TAGGGAGAGGTTGCTCCTCCGTCATTCTGGAAAATACTCCTACTAGTGACGAGCAGTTAGTTTTTCTTGTTTCACCGTTTGCTGTTCTACTTGCTTTTCTTTTTGGGAGT
AGCTTCGAAATGGTGGGAGTAATAATGGCGAATGCAAATTTGTGTATCCCTTGTTATGAAAGAAATGGATTTCCGGCTGTGCATTATACCCAGAATTCCCATAATTTATG
GGTGGCTTCGTTCTTTCCTAGTCCGATTTTTGGAATTGACTTAAATGTTGGCGACGCAAAGAATAGAGTTTTTCGGAATAGGGGAAATAAATGTGGAGCGATTAAGGCTT
CGTCAAAGGGAGAATCTGATATTCGATCGTCAAATGGGAATCTTCTCGAAAAGGATTTTCAATTTGAGCCATCGTTCGATGAATATGTGAGGGTTATGGAGTCCGTTAGA
AATAGTAGGTATAAGAAGCAGTCGGACGATCCTAATAAGCTGAAGATGAAGGAAAATGCGAGTGCAAAGAGAGCTGTAGGCATTTCCAAGTCTAAAATAGATAATGAAAA
AAACAAAGCGGCTGATGTTCATGGCAATGTTGATGTAAAGAACATGTTTAAACCTGTTGATCAGAAAGATTTGTTCAATAATGCAGAGAGAATTATACGTAAAAAAGATC
TGTCGCGAAACAATTTCGATAACAAAAGGAAAGGAGTTACAAGATTTAAGGACGAGGTTAAAGGCAAGGTGACCCGTTTTGACTCACAGGTTAATGAAAAACAACATGAA
GAGAAAAGGAAAGGACACTTATTTGATTGCATTGAGCCAAAAGTAAGAAGGTCGAACAATGAGACACTAGTTCGTTTGAAGGCTAATACATTGGATGTCAAAAGAGAAAG
GCAACGAGTATGTGATGAAAGTTCCACAAAAACATTGGAAACGATTTGGGCTGATGATGCTACTAAACCAGCTAAGGATGCTCTGAAGGTCGAGAAATCTGGTGTTCAGC
TTGCAAGGAACTATATTCCAGGTGAGAAGGTTGATAGAAAGAAAACTGGGCAGCCCTACCAAGGCTTATCCAAAAGTGGTAAGCCGTTCCTTGAATCTACTGAAGAGAGT
GGCTTGGAGGTAGAACGAGCAGCCTTCAACAATTTTGATGCATTAGACATCATGGATAAACCAAGAGTTTCAAAGATGGAAATGGAAGAGAGAATCCAGATGCTTTCTAA
GAGATTGAATGGTGCAGACATTGATATGCCCGAGTGGATGTTTGCTCGAATGATGAGGAGTGCAAAGATTAGATATTCAGATCACTCAATATTAAGGGTTATTCAAGTTC
TGGGTAAGCTAGGAAATTGGAGGAGAGTGCTACAAGTCATTGAATGGCTTCAAATGCGTGAACGGTTCAAGTCACACAAGCTCAGGTTTATATACACCACTGCCCTTGAT
GTACTTGGAAAAGCGAGGAGACCTGTGGAGGCACTCAACATATTCCATGCGATGCAGCAACACTTCTCCTCATATCCTGACTTAGTAGCATATCATAGTATTGCTGTCAC
TCTTGGACAAGCAGGATATATGAGGGAACTCTTTGACGTGATTGATAGCATGCGGTCTCCTCCAAAGAAGAAGTTTAAAACAGGGGCACTTGAAAAGTGGGACCCACGGC
TGCAACCTGATATAGTTATCTATAATGCGGTTTTAAATGCTTGTGTTAAGCGAAAGAATTGGGAAGGGGCATTTTGGGTCTTACAGGAATTGAAGAAACAAGGTCTACAG
CCTTCTACGTCAACATATGGATTGGTCATGGAGGTGATGCTTGCATGTGGCAAGTACAACTTAGTTCATGAGTTCTTCAGAAAAGTGCAGAGATCTTCCATTCCCAATGC
TTTAACGTATAAAGTTCTTGTCAATACACTTTGGAAAGAAGGTAAAACCGATGAGGCTGTAGTGGCCATTCAGAACATGGAACAACGAGGGATAGTGGGGTCAGCAGCTC
TTTATTACGACTTTGCTCGTTGTCTTTGCAGTGCTGGAAGGTGCAAAGAAGCCCTAATGCAGATTGAGAAGATATGTAAAGTTGCTAATAAGCCTCTTGTCGTGACTTAC
ACCGGTTTGATTCAAGCTTGTTTGGATTCAAAAAACTTACAAAGTGCAGTATATATATTCAACCACATGAAGGACTTCTGCTCCCCCAATCTTGTTACTTATAATATATT
GCTGAAAGGTTACCTGGAGCATAGGATGTTCGAAGAAGCTAGAGAGCTGTTCCAGAATTTATCAGAACATGGACGAAATATCAGCACCATATCTGACTATAGAGATCGAG
TATTACCAGATATCTACACGTTCAACACCATGCTAGATGCATCTTTTGCAGAAAAGAGATGGGATGATTTTGGCTATTTCTATGACCAAATGTTTCTTTATGGGTATCAC
TTCAACCCAAAACGTCATCTGCGAATGATATTGGAGGCTGGAAGGGCTGGAAAGGATGAGCTACTGGAAACAACATGGAAGCACCTTGCTATGGCTAACCAGACTCTGCC
GCCTCCACTTGTCAAAGAAAGGTTTTGCATGAAGCTGGCTAGGGGCAACTACTCCGAAGCTCTCTCTTGCATTTCCGATCACCATAGTGGCGATGTGCATCATTTCTCTG
AGTCGGTATGGCTAAATTTACTCAAAGAGAAAAGGTTTCCCAAGGATACTGTTATTCAGTTAATTCATAAGGTTAGCATGCTTCTTACTAGAAATGACCCACCAAATCCA
GTGTTTCAGAATCTGCTAATGAGTTGTAAAGAATTTTGCAGAACTAGAATTACTGTAGCTGACAATAAACTTGAAGACATTGTTTGTACAGATGAAACCCAGTTTACTGC
TGTTATGCATATTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGGGGGATATTTATCCAAATCCCCTCTGGTCTTGTCCTTTGTAAAGAAGAAACTCGCTTTTGCCTCTCTGTCGGAGCACCGCCAGGTTAAGTACTATCCGAGTTTGCA
ACGGAGCATTAGGTCGGTGCAAGAATCTAAAATGCCTTTGGAGCCGCAGATAGCCGGAGAAGATAGAACAGGAACAGGGAAGAAGACTGATGTAATCGTGTCGCGAGATG
TAGGGAGAGGTTGCTCCTCCGTCATTCTGGAAAATACTCCTACTAGTGACGAGCAGTTAGTTTTTCTTGTTTCACCGTTTGCTGTTCTACTTGCTTTTCTTTTTGGGAGT
AGCTTCGAAATGGTGGGAGTAATAATGGCGAATGCAAATTTGTGTATCCCTTGTTATGAAAGAAATGGATTTCCGGCTGTGCATTATACCCAGAATTCCCATAATTTATG
GGTGGCTTCGTTCTTTCCTAGTCCGATTTTTGGAATTGACTTAAATGTTGGCGACGCAAAGAATAGAGTTTTTCGGAATAGGGGAAATAAATGTGGAGCGATTAAGGCTT
CGTCAAAGGGAGAATCTGATATTCGATCGTCAAATGGGAATCTTCTCGAAAAGGATTTTCAATTTGAGCCATCGTTCGATGAATATGTGAGGGTTATGGAGTCCGTTAGA
AATAGTAGGTATAAGAAGCAGTCGGACGATCCTAATAAGCTGAAGATGAAGGAAAATGCGAGTGCAAAGAGAGCTGTAGGCATTTCCAAGTCTAAAATAGATAATGAAAA
AAACAAAGCGGCTGATGTTCATGGCAATGTTGATGTAAAGAACATGTTTAAACCTGTTGATCAGAAAGATTTGTTCAATAATGCAGAGAGAATTATACGTAAAAAAGATC
TGTCGCGAAACAATTTCGATAACAAAAGGAAAGGAGTTACAAGATTTAAGGACGAGGTTAAAGGCAAGGTGACCCGTTTTGACTCACAGGTTAATGAAAAACAACATGAA
GAGAAAAGGAAAGGACACTTATTTGATTGCATTGAGCCAAAAGTAAGAAGGTCGAACAATGAGACACTAGTTCGTTTGAAGGCTAATACATTGGATGTCAAAAGAGAAAG
GCAACGAGTATGTGATGAAAGTTCCACAAAAACATTGGAAACGATTTGGGCTGATGATGCTACTAAACCAGCTAAGGATGCTCTGAAGGTCGAGAAATCTGGTGTTCAGC
TTGCAAGGAACTATATTCCAGGTGAGAAGGTTGATAGAAAGAAAACTGGGCAGCCCTACCAAGGCTTATCCAAAAGTGGTAAGCCGTTCCTTGAATCTACTGAAGAGAGT
GGCTTGGAGGTAGAACGAGCAGCCTTCAACAATTTTGATGCATTAGACATCATGGATAAACCAAGAGTTTCAAAGATGGAAATGGAAGAGAGAATCCAGATGCTTTCTAA
GAGATTGAATGGTGCAGACATTGATATGCCCGAGTGGATGTTTGCTCGAATGATGAGGAGTGCAAAGATTAGATATTCAGATCACTCAATATTAAGGGTTATTCAAGTTC
TGGGTAAGCTAGGAAATTGGAGGAGAGTGCTACAAGTCATTGAATGGCTTCAAATGCGTGAACGGTTCAAGTCACACAAGCTCAGGTTTATATACACCACTGCCCTTGAT
GTACTTGGAAAAGCGAGGAGACCTGTGGAGGCACTCAACATATTCCATGCGATGCAGCAACACTTCTCCTCATATCCTGACTTAGTAGCATATCATAGTATTGCTGTCAC
TCTTGGACAAGCAGGATATATGAGGGAACTCTTTGACGTGATTGATAGCATGCGGTCTCCTCCAAAGAAGAAGTTTAAAACAGGGGCACTTGAAAAGTGGGACCCACGGC
TGCAACCTGATATAGTTATCTATAATGCGGTTTTAAATGCTTGTGTTAAGCGAAAGAATTGGGAAGGGGCATTTTGGGTCTTACAGGAATTGAAGAAACAAGGTCTACAG
CCTTCTACGTCAACATATGGATTGGTCATGGAGGTGATGCTTGCATGTGGCAAGTACAACTTAGTTCATGAGTTCTTCAGAAAAGTGCAGAGATCTTCCATTCCCAATGC
TTTAACGTATAAAGTTCTTGTCAATACACTTTGGAAAGAAGGTAAAACCGATGAGGCTGTAGTGGCCATTCAGAACATGGAACAACGAGGGATAGTGGGGTCAGCAGCTC
TTTATTACGACTTTGCTCGTTGTCTTTGCAGTGCTGGAAGGTGCAAAGAAGCCCTAATGCAGATTGAGAAGATATGTAAAGTTGCTAATAAGCCTCTTGTCGTGACTTAC
ACCGGTTTGATTCAAGCTTGTTTGGATTCAAAAAACTTACAAAGTGCAGTATATATATTCAACCACATGAAGGACTTCTGCTCCCCCAATCTTGTTACTTATAATATATT
GCTGAAAGGTTACCTGGAGCATAGGATGTTCGAAGAAGCTAGAGAGCTGTTCCAGAATTTATCAGAACATGGACGAAATATCAGCACCATATCTGACTATAGAGATCGAG
TATTACCAGATATCTACACGTTCAACACCATGCTAGATGCATCTTTTGCAGAAAAGAGATGGGATGATTTTGGCTATTTCTATGACCAAATGTTTCTTTATGGGTATCAC
TTCAACCCAAAACGTCATCTGCGAATGATATTGGAGGCTGGAAGGGCTGGAAAGGATGAGCTACTGGAAACAACATGGAAGCACCTTGCTATGGCTAACCAGACTCTGCC
GCCTCCACTTGTCAAAGAAAGGTTTTGCATGAAGCTGGCTAGGGGCAACTACTCCGAAGCTCTCTCTTGCATTTCCGATCACCATAGTGGCGATGTGCATCATTTCTCTG
AGTCGGTATGGCTAAATTTACTCAAAGAGAAAAGGTTTCCCAAGGATACTGTTATTCAGTTAATTCATAAGGTTAGCATGCTTCTTACTAGAAATGACCCACCAAATCCA
GTGTTTCAGAATCTGCTAATGAGTTGTAAAGAATTTTGCAGAACTAGAATTACTGTAGCTGACAATAAACTTGAAGACATTGTTTGTACAGATGAAACCCAGTTTACTGC
TGTTATGCATATTTAG
Protein sequenceShow/hide protein sequence
MGGYLSKSPLVLSFVKKKLAFASLSEHRQVKYYPSLQRSIRSVQESKMPLEPQIAGEDRTGTGKKTDVIVSRDVGRGCSSVILENTPTSDEQLVFLVSPFAVLLAFLFGS
SFEMVGVIMANANLCIPCYERNGFPAVHYTQNSHNLWVASFFPSPIFGIDLNVGDAKNRVFRNRGNKCGAIKASSKGESDIRSSNGNLLEKDFQFEPSFDEYVRVMESVR
NSRYKKQSDDPNKLKMKENASAKRAVGISKSKIDNEKNKAADVHGNVDVKNMFKPVDQKDLFNNAERIIRKKDLSRNNFDNKRKGVTRFKDEVKGKVTRFDSQVNEKQHE
EKRKGHLFDCIEPKVRRSNNETLVRLKANTLDVKRERQRVCDESSTKTLETIWADDATKPAKDALKVEKSGVQLARNYIPGEKVDRKKTGQPYQGLSKSGKPFLESTEES
GLEVERAAFNNFDALDIMDKPRVSKMEMEERIQMLSKRLNGADIDMPEWMFARMMRSAKIRYSDHSILRVIQVLGKLGNWRRVLQVIEWLQMRERFKSHKLRFIYTTALD
VLGKARRPVEALNIFHAMQQHFSSYPDLVAYHSIAVTLGQAGYMRELFDVIDSMRSPPKKKFKTGALEKWDPRLQPDIVIYNAVLNACVKRKNWEGAFWVLQELKKQGLQ
PSTSTYGLVMEVMLACGKYNLVHEFFRKVQRSSIPNALTYKVLVNTLWKEGKTDEAVVAIQNMEQRGIVGSAALYYDFARCLCSAGRCKEALMQIEKICKVANKPLVVTY
TGLIQACLDSKNLQSAVYIFNHMKDFCSPNLVTYNILLKGYLEHRMFEEARELFQNLSEHGRNISTISDYRDRVLPDIYTFNTMLDASFAEKRWDDFGYFYDQMFLYGYH
FNPKRHLRMILEAGRAGKDELLETTWKHLAMANQTLPPPLVKERFCMKLARGNYSEALSCISDHHSGDVHHFSESVWLNLLKEKRFPKDTVIQLIHKVSMLLTRNDPPNP
VFQNLLMSCKEFCRTRITVADNKLEDIVCTDETQFTAVMHI