; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr012809 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr012809
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationtig00153561:21460..31652
RNA-Seq ExpressionSgr012809
SyntenySgr012809
Gene Ontology termsGO:0009658 - chloroplast organization (biological process)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily
IPR044645 - Pentatricopeptide repeat-containing protein DG1/EMB2279-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7019446.1 Pentatricopeptide repeat-containing protein, chloroplastic, partial [Cucurbita argyrosperma subsp. argyrosperma]0.0e+0080.79Show/hide
Query:  NSHNLWVASFFPSPIFGIDLNVGDAKNRVFRNRGNKCGAIKASSKGKSDIRSSNGNLLEKDFQFKPSFDEYVRVMESVRNSRYKKQSDDPNKLKMKENAS
        NSH L   SFFPS + G  LN G AK+RV R+RG+KCGAIKASSKG+SDI+ ++GNLLEKDFQFKPSFDEYVRVMESVR+ RYK+QSDDPN  KMKENAS
Subjt:  NSHNLWVASFFPSPIFGIDLNVGDAKNRVFRNRGNKCGAIKASSKGKSDIRSSNGNLLEKDFQFKPSFDEYVRVMESVRNSRYKKQSDDPNKLKMKENAS

Query:  AKSAEGISKSKIDNEKNKATDVHGNVDVKNMFKPVDQKDLFNNAERIIRKKDLSRNNFDNKRKGVTRFKDEVKGKVTRFDSQVNEKQHEEKRKGHLFDCI
        AKSAE  S S I       TDV GN+DVKN    VD +DLF+N+E+I RK DLS N FD+KRKGVTR KDE+KGKVT FDSQVN+KQHEEKR G+  + I
Subjt:  AKSAEGISKSKIDNEKNKATDVHGNVDVKNMFKPVDQKDLFNNAERIIRKKDLSRNNFDNKRKGVTRFKDEVKGKVTRFDSQVNEKQHEEKRKGHLFDCI

Query:  EPKVRRSNNETLVRLKANTLDVKRE--------RQRVCDE---------TKDALKVEKSGVQLARNYIPGEKVDRKKTGQPYQGLSKSGKPFLESTEESG
        EPK  RSN++  +  KANTLDVK E          ++ D+         TKD LKV K GVQL  NYIPG+KV RKKT Q Y+GLSKSGK F E TEES 
Subjt:  EPKVRRSNNETLVRLKANTLDVKRE--------RQRVCDE---------TKDALKVEKSGVQLARNYIPGEKVDRKKTGQPYQGLSKSGKPFLESTEESG

Query:  LEVERAAFNNFDALDIMDKPRVSKMEMEERIQMLSKRLNGADIDMPEWMFARMMRSAKIRYSDHSILRVIQVLGKLGNWRRVLQVIEWLQMRERFKSHKL
        LEVE AAFN+FDA DIMDKPRVSKMEMEERIQMLSKRLNGADIDMPEWMFA+MMRSAKIRYSDHSILRVIQVLGKLGNW+RVLQVIEWLQMRERFKSHKL
Subjt:  LEVERAAFNNFDALDIMDKPRVSKMEMEERIQMLSKRLNGADIDMPEWMFARMMRSAKIRYSDHSILRVIQVLGKLGNWRRVLQVIEWLQMRERFKSHKL

Query:  RFIYTTALDVLGKARRPVEALNIFHAMQQHFSSYPDLVAYHSIAVTLGQAGYMRELFDVIDSMRSPPKKKFKTGALEKWDPRLQPDIVIYNAVLNACVKR
        RFIYTTALDVLGKARRPVEALN+FHAMQQHFSSYPDLVAYHSIAVTLGQAGYMRELFDVIDSMRSPPKKKFKTGA EKWDPRLQPDIVIYNAVLNACVKR
Subjt:  RFIYTTALDVLGKARRPVEALNIFHAMQQHFSSYPDLVAYHSIAVTLGQAGYMRELFDVIDSMRSPPKKKFKTGALEKWDPRLQPDIVIYNAVLNACVKR

Query:  KNWEGAFWVLQELKKQGLQPSTSTYGLVMEVMLACGKYNLVHEFFRKVQRSSIPNALTYKVLVNTLWKEGKTDEAVVAIQNMEQRGIVGSAALYYDFARC
        KNWEGAFWVLQELK+QGLQPST+TYGLVMEVML CGKYNLVHEFFRKVQ+SSIPNALTYKVLVNTLWKEGKTDEAV+AIQ ME+RGIVGSAALYYDFARC
Subjt:  KNWEGAFWVLQELKKQGLQPSTSTYGLVMEVMLACGKYNLVHEFFRKVQRSSIPNALTYKVLVNTLWKEGKTDEAVVAIQNMEQRGIVGSAALYYDFARC

Query:  LCSAGRCKEALMQIEKICKVANKPLVVTYTGLIQACLDSKTLQSAVYIFNHMKDFCSPNLVTYNILLKGCLEHRMFEEARELFQNLSEHGRNISTISDYR
        LCSAGRC+EALMQ+EKICKVANKPLVVTYTGLIQACLDSK LQSAVYIFNHMK FCSPNLVT NILLKG L+H MF+EA+ELFQN+SE+GRNIS +SDYR
Subjt:  LCSAGRCKEALMQIEKICKVANKPLVVTYTGLIQACLDSKTLQSAVYIFNHMKDFCSPNLVTYNILLKGCLEHRMFEEARELFQNLSEHGRNISTISDYR

Query:  DRVLPDIYTFNTMLDASFAEKRWDDFGYFYDQMFLYGYHFNPKRHLRMILEAGRAGKDELLETTWKHLAMANQTLPPPLVKERFCMKLARGDYSEALSCI
        DRVLPDIYTFNTMLDASFAEKRWDDF +FY+QM LYGYHFNPKRHLRMI+EA R GKDELLETTWKHLA A++TLPPPL+KERFC+ LARGDYSEALSCI
Subjt:  DRVLPDIYTFNTMLDASFAEKRWDDFGYFYDQMFLYGYHFNPKRHLRMILEAGRAGKDELLETTWKHLAMANQTLPPPLVKERFCMKLARGDYSEALSCI

Query:  SDHHSSDVHHFSESAWLNLLKEKRFPKDTVIQLIHKVSMLLTRNDPPNPVFQNLLMSCKEFCRTRITVADNKLEDIVCTDETQFTAVMHI
        S HHSSD HHFS+SAWLNLLKEKRFPKD+VI+LIHKVSMLL RND PNPV QNLL+S KEFCR+RITVAD +LE++VCT+E+Q   VMH+
Subjt:  SDHHSSDVHHFSESAWLNLLKEKRFPKDTVIQLIHKVSMLLTRNDPPNPVFQNLLMSCKEFCRTRITVADNKLEDIVCTDETQFTAVMHI

XP_022142514.1 pentatricopeptide repeat-containing protein At1g30610, chloroplastic [Momordica charantia]0.0e+0086.63Show/hide
Query:  NSHNLWVASFFPSPIFGIDLNVGDAKNRVFRNRGNKCGAIKASSKGKSDIRSSNGNLLEKDFQFKPSFDEYVRVMESVRNSRYKKQSDDPNKLKMKENAS
        +SHNL+  S FPSPI GI LNVG  KNR+FR RGNKCGAI+ SSKG+SDIR  NGN+LE DF FKPSFDEYVRVMESVR SRYKKQ DDPNKLKMKENAS
Subjt:  NSHNLWVASFFPSPIFGIDLNVGDAKNRVFRNRGNKCGAIKASSKGKSDIRSSNGNLLEKDFQFKPSFDEYVRVMESVRNSRYKKQSDDPNKLKMKENAS

Query:  AKSAEGISKSKIDNEKNKATDVHGNVDVKNMFKPVDQKDLFNNAERIIRKKDLSRNNFDNKRKGVTRFKDEVKGKVTRFDSQVNEKQHEEKRKGHLFDCI
        AKSAE  S S+IDNEK K TDV GNVDVKNMFK VDQK LFNNAER+ RKKDL  N FDNKRKG+TR KDE +GKVT FDSQVN+KQHEE+RK +  DCI
Subjt:  AKSAEGISKSKIDNEKNKATDVHGNVDVKNMFKPVDQKDLFNNAERIIRKKDLSRNNFDNKRKGVTRFKDEVKGKVTRFDSQVNEKQHEEKRKGHLFDCI

Query:  EPKVRRSNNETLVRLKANTLDVKRERQRVCDET-----------------KDALKVEKSGVQLARNYIPGEKVDRKKTGQPYQGLSKSGKPFLESTEESG
        EPKVRR NNE LV  KANTLD+KR+RQRVCDE+                 K  L+V KSGVQLARNY+PGEKV  KKTGQ YQGLSKSGKPF+ESTEES 
Subjt:  EPKVRRSNNETLVRLKANTLDVKRERQRVCDET-----------------KDALKVEKSGVQLARNYIPGEKVDRKKTGQPYQGLSKSGKPFLESTEESG

Query:  LEVERAAFNNFDALDIMDKPRVSKMEMEERIQMLSKRLNGADIDMPEWMFARMMRSAKIRYSDHSILRVIQVLGKLGNWRRVLQVIEWLQMRERFKSHKL
        LEVERAA NNFDALDIMDKPRVSKMEMEERIQMLSKRLNGADIDMPEWMFA+MMRSAKIRYSDHSILRVIQVLGKLGNW+RVLQVIEWLQMRERFKSHKL
Subjt:  LEVERAAFNNFDALDIMDKPRVSKMEMEERIQMLSKRLNGADIDMPEWMFARMMRSAKIRYSDHSILRVIQVLGKLGNWRRVLQVIEWLQMRERFKSHKL

Query:  RFIYTTALDVLGKARRPVEALNIFHAMQQHFSSYPDLVAYHSIAVTLGQAGYMRELFDVIDSMRSPPKKKFKTGALEKWDPRLQPDIVIYNAVLNACVKR
        RFIYTTALDVLGKARRPVEALN+FHAMQQHFSSYPDLVAYHSIAVTLGQAGYMRELFDVIDSMRSPPKKKFKTGALEKWDPRLQPDIVIYNAVLNACVKR
Subjt:  RFIYTTALDVLGKARRPVEALNIFHAMQQHFSSYPDLVAYHSIAVTLGQAGYMRELFDVIDSMRSPPKKKFKTGALEKWDPRLQPDIVIYNAVLNACVKR

Query:  KNWEGAFWVLQELKKQGLQPSTSTYGLVMEVMLACGKYNLVHEFFRKVQRSSIPNALTYKVLVNTLWKEGKTDEAVVAIQNMEQRGIVGSAALYYDFARC
        KNWEGAFWVLQELKKQGLQPSTSTYGLVMEVML CGKYNLVHEFFRKVQRSSIPNALTYKVLVNTL KEGKTDEAV+AIQNME+RGIVGSAALYYDFARC
Subjt:  KNWEGAFWVLQELKKQGLQPSTSTYGLVMEVMLACGKYNLVHEFFRKVQRSSIPNALTYKVLVNTLWKEGKTDEAVVAIQNMEQRGIVGSAALYYDFARC

Query:  LCSAGRCKEALMQIEKICKVANKPLVVTYTGLIQACLDSKTLQSAVYIFNHMKDFCSPNLVTYNILLKGCLEHRMFEEARELFQNLSEHGRNISTISDYR
        LCSAGRCKEALMQIEKICKVANKPLVVTYTGLIQACLDSK L SAVYIFNHMK FCSPNLVTYNILLKG L+H MFEEARELFQNLSE G++ISTISDY+
Subjt:  LCSAGRCKEALMQIEKICKVANKPLVVTYTGLIQACLDSKTLQSAVYIFNHMKDFCSPNLVTYNILLKGCLEHRMFEEARELFQNLSEHGRNISTISDYR

Query:  DRVLPDIYTFNTMLDASFAEKRWDDFGYFYDQMFLYGYHFNPKRHLRMILEAGRAGKDELLETTWKHLAMANQTLPPPLVKERFCMKLARGDYSEALSCI
        DRVLPDIYTFN MLDA FA KRWDDFGYFY+QMFLYGYHFNPKRHLRMILEAGRAGKDE+LETTWKHLA  ++TLPPPLVKERFCMKLARGDYSEALSCI
Subjt:  DRVLPDIYTFNTMLDASFAEKRWDDFGYFYDQMFLYGYHFNPKRHLRMILEAGRAGKDELLETTWKHLAMANQTLPPPLVKERFCMKLARGDYSEALSCI

Query:  SDHHSSDVHHFSESAWLNLLKEKRFPKDTVIQLIHKVSMLLTRNDPPNPVFQNLLMSCKEFCRTRITVADNKLEDIVCTDETQFTAVMHI
        S+HHSSD HHFSESAWLNLLKEK FPKDTVI LIHKVSMLLT N PPNPVFQNLL SCKEFCRTRITVAD+KLE IVC DETQ  AVMHI
Subjt:  SDHHSSDVHHFSESAWLNLLKEKRFPKDTVIQLIHKVSMLLTRNDPPNPVFQNLLMSCKEFCRTRITVADNKLEDIVCTDETQFTAVMHI

XP_023000737.1 pentatricopeptide repeat-containing protein At1g30610, chloroplastic [Cucurbita maxima]0.0e+0080.67Show/hide
Query:  NSHNLWVASFFPSPIFGIDLNVGDAKNRVFRNRGNKCGAIKASSKGKSDIRSSNGNLLEKDFQFKPSFDEYVRVMESVRNSRYKKQSDDPNKLKMKENAS
        NSH L   SFFPS + G  LN G AK+RV R+RG+KCGAIKASSKG+SDI+ ++GNLLEKDFQFKPSFDEYVRVMESVR+ RYK+QSDDPN  KMKENAS
Subjt:  NSHNLWVASFFPSPIFGIDLNVGDAKNRVFRNRGNKCGAIKASSKGKSDIRSSNGNLLEKDFQFKPSFDEYVRVMESVRNSRYKKQSDDPNKLKMKENAS

Query:  AKSAEGISKSKIDNEKNKATDVHGNVDVKNMFKPVDQKDLFNNAERIIRKKDLSRNNFDNKRKGVTRFKDEVKGKVTRFDSQVNEKQHEEKRKGHLFDCI
        AKSAE  S S I       TDV GN+DVKN    VD +DLF+N+ERI RK DLS N FD+KRKGVTR KDE+KGKVT FDSQ+N+KQHEEKR G+  + I
Subjt:  AKSAEGISKSKIDNEKNKATDVHGNVDVKNMFKPVDQKDLFNNAERIIRKKDLSRNNFDNKRKGVTRFKDEVKGKVTRFDSQVNEKQHEEKRKGHLFDCI

Query:  EPKVRRSNNETLVRLKANTLDVKRERQRV-----------------CDETKDALKVEKSGVQLARNYIPGEKVDRKKTGQPYQGLSKSGKPFLESTEESG
        EPKV RSN++  +  KANTLDVK E   V                    TKD LKV K GVQL  NYIPG+KV RKKT Q Y+GLSKSGK F E TEES 
Subjt:  EPKVRRSNNETLVRLKANTLDVKRERQRV-----------------CDETKDALKVEKSGVQLARNYIPGEKVDRKKTGQPYQGLSKSGKPFLESTEESG

Query:  LEVERAAFNNFDALDIMDKPRVSKMEMEERIQMLSKRLNGADIDMPEWMFARMMRSAKIRYSDHSILRVIQVLGKLGNWRRVLQVIEWLQMRERFKSHKL
        LEVE AAFN+ DA DIMDKPRVSKMEMEERIQMLSKRLNGADIDMPEWMFA+MMRSAKIRYSDHSILRVIQVLGKLGNW+RVLQVIEWLQMRERFKSHKL
Subjt:  LEVERAAFNNFDALDIMDKPRVSKMEMEERIQMLSKRLNGADIDMPEWMFARMMRSAKIRYSDHSILRVIQVLGKLGNWRRVLQVIEWLQMRERFKSHKL

Query:  RFIYTTALDVLGKARRPVEALNIFHAMQQHFSSYPDLVAYHSIAVTLGQAGYMRELFDVIDSMRSPPKKKFKTGALEKWDPRLQPDIVIYNAVLNACVKR
        RFIYTTALDVLGKARRPVEALN+FHAMQQHFSSYPDLVAYHSIAVTLGQAGYMRELFDVIDSMRSPPKKKFKTGA EKWDPRLQPDIVIYNAVLNACVKR
Subjt:  RFIYTTALDVLGKARRPVEALNIFHAMQQHFSSYPDLVAYHSIAVTLGQAGYMRELFDVIDSMRSPPKKKFKTGALEKWDPRLQPDIVIYNAVLNACVKR

Query:  KNWEGAFWVLQELKKQGLQPSTSTYGLVMEVMLACGKYNLVHEFFRKVQRSSIPNALTYKVLVNTLWKEGKTDEAVVAIQNMEQRGIVGSAALYYDFARC
        KNWEGAFWVLQELK+QGLQPST+TYGLVMEVML CGKYNLVHEFFRKVQ+SSIPNALTYKVLVNTLWKEGKTDEAV+AIQ ME+RGIVGSAALYYDFARC
Subjt:  KNWEGAFWVLQELKKQGLQPSTSTYGLVMEVMLACGKYNLVHEFFRKVQRSSIPNALTYKVLVNTLWKEGKTDEAVVAIQNMEQRGIVGSAALYYDFARC

Query:  LCSAGRCKEALMQIEKICKVANKPLVVTYTGLIQACLDSKTLQSAVYIFNHMKDFCSPNLVTYNILLKGCLEHRMFEEARELFQNLSEHGRNISTISDYR
        LCSAGR +EALMQ+EKICKVANKPLVVTYTGLIQACLDSK LQSAVYIFNHMK FCSPNLVT NILLKG L+H MF EA+ELFQN+SE+GRNIS +SDYR
Subjt:  LCSAGRCKEALMQIEKICKVANKPLVVTYTGLIQACLDSKTLQSAVYIFNHMKDFCSPNLVTYNILLKGCLEHRMFEEARELFQNLSEHGRNISTISDYR

Query:  DRVLPDIYTFNTMLDASFAEKRWDDFGYFYDQMFLYGYHFNPKRHLRMILEAGRAGKDELLETTWKHLAMANQTLPPPLVKERFCMKLARGDYSEALSCI
        DRVLPDIYTFNTMLDASFAEKRWDDF +FY+QM LYGYHFNPKRHLRMI+EA R GKDELLETTWKHLA A++ LPPPL+KERFC+ LARGDYSEALSCI
Subjt:  DRVLPDIYTFNTMLDASFAEKRWDDFGYFYDQMFLYGYHFNPKRHLRMILEAGRAGKDELLETTWKHLAMANQTLPPPLVKERFCMKLARGDYSEALSCI

Query:  SDHHSSDVHHFSESAWLNLLKEKRFPKDTVIQLIHKVSMLLTRNDPPNPVFQNLLMSCKEFCRTRITVADNKLEDIVCTDETQFTAVMHI
        S HHSSD HHFS+SAWLNLLKEKRFPKD+VIQLIHKVSMLL RND PNPV QNLL+S KEFCR+RI+VAD +LE++VCT+E+Q  AVMH+
Subjt:  SDHHSSDVHHFSESAWLNLLKEKRFPKDTVIQLIHKVSMLLTRNDPPNPVFQNLLMSCKEFCRTRITVADNKLEDIVCTDETQFTAVMHI

XP_023519692.1 pentatricopeptide repeat-containing protein At1g30610, chloroplastic [Cucurbita pepo subsp. pepo]0.0e+0080.79Show/hide
Query:  NSHNLWVASFFPSPIFGIDLNVGDAKNRVFRNRGNKCGAIKASSKGKSDIRSSNGNLLEKDFQFKPSFDEYVRVMESVRNSRYKKQSDDPNKLKMKENAS
        NSH L   SFF S + G  LN G AK+RV R+RG+KCGAIKASSKG+SDIR ++GNLLE DFQFKPSFDEYVRVMESVR+ RYK+QSDDPN  KMKENAS
Subjt:  NSHNLWVASFFPSPIFGIDLNVGDAKNRVFRNRGNKCGAIKASSKGKSDIRSSNGNLLEKDFQFKPSFDEYVRVMESVRNSRYKKQSDDPNKLKMKENAS

Query:  AKSAEGISKSKIDNEKNKATDVHGNVDVKNMFKPVDQKDLFNNAERIIRKKDLSRNNFDNKRKGVTRFKDEVKGKVTRFDSQVNEKQHEEKRKGHLFDCI
        AKSAE  S S I       TDV GN+DVK     VDQ+DLF+N+ERI RK DLS N FD+KRKGVTR KDE+KGKVT FDSQVN+KQH EKR G+  + I
Subjt:  AKSAEGISKSKIDNEKNKATDVHGNVDVKNMFKPVDQKDLFNNAERIIRKKDLSRNNFDNKRKGVTRFKDEVKGKVTRFDSQVNEKQHEEKRKGHLFDCI

Query:  EPKVRRSNNETLVRLKANTLDVKRERQRV-----------------CDETKDALKVEKSGVQLARNYIPGEKVDRKKTGQPYQGLSKSGKPFLESTEESG
        EPKV RSN++  +  KANTLDVK E   V                    TKD LKV K GVQL  NYIPG+KV RKKT Q Y+GLSKSGK F E TEES 
Subjt:  EPKVRRSNNETLVRLKANTLDVKRERQRV-----------------CDETKDALKVEKSGVQLARNYIPGEKVDRKKTGQPYQGLSKSGKPFLESTEESG

Query:  LEVERAAFNNFDALDIMDKPRVSKMEMEERIQMLSKRLNGADIDMPEWMFARMMRSAKIRYSDHSILRVIQVLGKLGNWRRVLQVIEWLQMRERFKSHKL
        LEVE AAFN+ DA DIMDKPRVSKMEMEERIQMLSKRLNGADIDMPEWMFA+MMRSAKIRYSDHSILRVIQVLGKLGNW+RVLQVIEWLQMRERFKSHKL
Subjt:  LEVERAAFNNFDALDIMDKPRVSKMEMEERIQMLSKRLNGADIDMPEWMFARMMRSAKIRYSDHSILRVIQVLGKLGNWRRVLQVIEWLQMRERFKSHKL

Query:  RFIYTTALDVLGKARRPVEALNIFHAMQQHFSSYPDLVAYHSIAVTLGQAGYMRELFDVIDSMRSPPKKKFKTGALEKWDPRLQPDIVIYNAVLNACVKR
        RFIYTTALDVLGKARRPVEALN+FHAMQ+HFSSYPDLVAYHSIAVTLGQAGYMRELFDVIDSMRSPPKKKFKTGALEKWDPRLQPDIVIYNAVLNACVKR
Subjt:  RFIYTTALDVLGKARRPVEALNIFHAMQQHFSSYPDLVAYHSIAVTLGQAGYMRELFDVIDSMRSPPKKKFKTGALEKWDPRLQPDIVIYNAVLNACVKR

Query:  KNWEGAFWVLQELKKQGLQPSTSTYGLVMEVMLACGKYNLVHEFFRKVQRSSIPNALTYKVLVNTLWKEGKTDEAVVAIQNMEQRGIVGSAALYYDFARC
        KNWEGAFWVLQELK+QGLQPST+TYGLVMEVML CGKYNLVHEFFRKVQ+SSIPNALTYKVLVNTLWKEGKTDEAV+AIQ ME+RGIVGSAALYYDFARC
Subjt:  KNWEGAFWVLQELKKQGLQPSTSTYGLVMEVMLACGKYNLVHEFFRKVQRSSIPNALTYKVLVNTLWKEGKTDEAVVAIQNMEQRGIVGSAALYYDFARC

Query:  LCSAGRCKEALMQIEKICKVANKPLVVTYTGLIQACLDSKTLQSAVYIFNHMKDFCSPNLVTYNILLKGCLEHRMFEEARELFQNLSEHGRNISTISDYR
        LCSAGRCKEALMQ+EKICKVANKPLVVTYTGLIQACLDSK LQSAVYIFNHMK FCSPNLVT NILLKG L+H MF+EA+ELFQN+SE+GRNIS +SDYR
Subjt:  LCSAGRCKEALMQIEKICKVANKPLVVTYTGLIQACLDSKTLQSAVYIFNHMKDFCSPNLVTYNILLKGCLEHRMFEEARELFQNLSEHGRNISTISDYR

Query:  DRVLPDIYTFNTMLDASFAEKRWDDFGYFYDQMFLYGYHFNPKRHLRMILEAGRAGKDELLETTWKHLAMANQTLPPPLVKERFCMKLARGDYSEALSCI
        DRVLPDIYTFNTMLDASFAEKRWDDF +FY+QM LYGYHFNPKRHLRMI+EA R GKDELLETTWKHLA A++TLPPPL+KERFC+ LARGDYSEALSCI
Subjt:  DRVLPDIYTFNTMLDASFAEKRWDDFGYFYDQMFLYGYHFNPKRHLRMILEAGRAGKDELLETTWKHLAMANQTLPPPLVKERFCMKLARGDYSEALSCI

Query:  SDHHSSDVHHFSESAWLNLLKEKRFPKDTVIQLIHKVSMLLTRNDPPNPVFQNLLMSCKEFCRTRITVADNKLEDIVCTDETQFTAVMHI
        S HHSSD HHFS+SAWLNLLKEKRFPKD+VI+LIHKVSMLL RND PNPV QNLL+S KEFCR+RI+VAD +LE++VCT+E Q  AVMH+
Subjt:  SDHHSSDVHHFSESAWLNLLKEKRFPKDTVIQLIHKVSMLLTRNDPPNPVFQNLLMSCKEFCRTRITVADNKLEDIVCTDETQFTAVMHI

XP_038894404.1 pentatricopeptide repeat-containing protein At1g30610, chloroplastic isoform X1 [Benincasa hispida]0.0e+0082.25Show/hide
Query:  NSHNLWVASFFPSPIFGIDLNVGDAKNRVFRNRGNKCGAIKASSKGKSDIRSSNGNLLEKDFQFKPSFDEYVRVMESVRNSRYKKQSDDPNKLKMKENAS
        NSHN +  SFFPS + G DLN GDAK+RV R+R +KCG+IKASS G+SDIR  + NLLE DFQFKPSFDEYVRVME+VR  RYK+QSDDPNKL MKENAS
Subjt:  NSHNLWVASFFPSPIFGIDLNVGDAKNRVFRNRGNKCGAIKASSKGKSDIRSSNGNLLEKDFQFKPSFDEYVRVMESVRNSRYKKQSDDPNKLKMKENAS

Query:  AKSAEGISKSKIDNEKNKATDVHGNVDVKNMFKPVDQKDLFNNAERIIRKKDLSRNNFDNKRKGVTRFKDEVKGKVTRFDSQVNEKQHEEKRKGHLFDCI
         KSAE  S SKIDN KNK TDV GNVDVKNMFK VD+KDLFNN ERI R++DLS N  D+KRKG++R  DEVKGKVT FDSQVN+KQHEEKR  +  +  
Subjt:  AKSAEGISKSKIDNEKNKATDVHGNVDVKNMFKPVDQKDLFNNAERIIRKKDLSRNNFDNKRKGVTRFKDEVKGKVTRFDSQVNEKQHEEKRKGHLFDCI

Query:  EPKVRRSNNETLVRLKANTLDVKRERQRVC--------------DET---KDALKVEKSGVQLARNYIPGEKVDRKKTGQPYQGLSKSGKPFLESTEESG
        EPKV R  NE  +  KANTLD+KRE  R                D+T   KD L   K  VQL RNYI G+KV RKKT Q Y+  SKSGK FLE TE+S 
Subjt:  EPKVRRSNNETLVRLKANTLDVKRERQRVC--------------DET---KDALKVEKSGVQLARNYIPGEKVDRKKTGQPYQGLSKSGKPFLESTEESG

Query:  LEVERAAFNNFDALDIMDKPRVSKMEMEERIQMLSKRLNGADIDMPEWMFARMMRSAKIRYSDHSILRVIQVLGKLGNWRRVLQVIEWLQMRERFKSHKL
        LEVE AAFNNFDALDIMDKPRVSKMEMEERIQML KRLNGADIDMPEWMF++MMRSAKIRYSDHSILRVIQVLGKLGNWRRVLQVIEWLQMRERFKSHKL
Subjt:  LEVERAAFNNFDALDIMDKPRVSKMEMEERIQMLSKRLNGADIDMPEWMFARMMRSAKIRYSDHSILRVIQVLGKLGNWRRVLQVIEWLQMRERFKSHKL

Query:  RFIYTTALDVLGKARRPVEALNIFHAMQQHFSSYPDLVAYHSIAVTLGQAGYMRELFDVIDSMRSPPKKKFKTGALEKWDPRLQPDIVIYNAVLNACVKR
        RFIYTTALDVLGKARRPVEALN+FHAMQQHF+SYPDLVAYHSIAVTLGQAGYM+ELFDVIDSMRSPPKKKFKTG LEKWDPRL+PDIVIYNAVLNACVKR
Subjt:  RFIYTTALDVLGKARRPVEALNIFHAMQQHFSSYPDLVAYHSIAVTLGQAGYMRELFDVIDSMRSPPKKKFKTGALEKWDPRLQPDIVIYNAVLNACVKR

Query:  KNWEGAFWVLQELKKQGLQPSTSTYGLVMEVMLACGKYNLVHEFFRKVQRSSIPNALTYKVLVNTLWKEGKTDEAVVAIQNMEQRGIVGSAALYYDFARC
        KN EGAFWVLQELKKQGLQPSTSTYGLVMEVML CGKYNLVHEFFRKVQ+SSIPNALTYKVLVNTLWKEGKTDEAV+AI+NME+RGIVGSAALYYDFARC
Subjt:  KNWEGAFWVLQELKKQGLQPSTSTYGLVMEVMLACGKYNLVHEFFRKVQRSSIPNALTYKVLVNTLWKEGKTDEAVVAIQNMEQRGIVGSAALYYDFARC

Query:  LCSAGRCKEALMQIEKICKVANKPLVVTYTGLIQACLDSKTLQSAVYIFNHMKDFCSPNLVTYNILLKGCLEHRMFEEARELFQNLSEHGRNISTISDYR
        LCSAGRCKEALMQ+EKICKVA KPLVVTYTGLIQACLDSK ++SAVYIFNHMK FCSPNLVTYN+LLKG LEH MFEEARELFQNLSEHGRNIST+SDYR
Subjt:  LCSAGRCKEALMQIEKICKVANKPLVVTYTGLIQACLDSKTLQSAVYIFNHMKDFCSPNLVTYNILLKGCLEHRMFEEARELFQNLSEHGRNISTISDYR

Query:  DRVLPDIYTFNTMLDASFAEKRWDDFGYFYDQMFLYGYHFNPKRHLRMILEAGRAGKDELLETTWKHLAMANQTLPPPLVKERFCMKLARGDYSEALSCI
        DRVLPDIY FNTMLDASFAEKRWDDFGYFYDQM LYGYHFNPKRHLRMILEA RAGKDELLETTWKHLA A++T PPPL+KERFCMKLARGDYSEALSCI
Subjt:  DRVLPDIYTFNTMLDASFAEKRWDDFGYFYDQMFLYGYHFNPKRHLRMILEAGRAGKDELLETTWKHLAMANQTLPPPLVKERFCMKLARGDYSEALSCI

Query:  SDHHSSDVHHFSESAWLNLLKEKRFPKDTVIQLIHKVSMLLTRNDPPNPVFQNLLMSCKEFCRTRITVADNKLEDIVCTDETQFTAVMHI
        S+H SSDVHHFSES WLNLLKEKRFPKDTVIQLI+KVSMLLTRND PNPVF+NLL+SCKEFCRTRI+VAD++LE+ VCT+ETQ  AV+ I
Subjt:  SDHHSSDVHHFSESAWLNLLKEKRFPKDTVIQLIHKVSMLLTRNDPPNPVFQNLLMSCKEFCRTRITVADNKLEDIVCTDETQFTAVMHI

TrEMBL top hitse value%identityAlignment
A0A0A0LVN7 Uncharacterized protein0.0e+0079.98Show/hide
Query:  NSHNLWVASFFPSPIFGIDLNVGDAKNRVFRNRGNKCGAIKASSKGKSDIRSSNGNLLEKDFQFKPSFDEYVRVMESVRNSRYKKQSDDPNKLKMKENAS
        NSHN +  SFFPS + G D ++ DAKNRV R+R +KCG+IKA S G+SDI   +GNLLE DFQFKPSFDEYV+VME+VR  RYK+Q DDPNKL MKEN S
Subjt:  NSHNLWVASFFPSPIFGIDLNVGDAKNRVFRNRGNKCGAIKASSKGKSDIRSSNGNLLEKDFQFKPSFDEYVRVMESVRNSRYKKQSDDPNKLKMKENAS

Query:  AKSAEGISKSKIDNEKNKATDVHGNVDVKNMFKPVDQKDLFNNAERIIRKKDLSRNNFDNKRKGVTRFKDEVKGKVTRFDSQVNEKQHEEKRKGHLFDCI
        AKSAE  S SKIDN KNK TDV  NVDVKNMFK VD+KDLFNN ERI  +KDLS N FD +RK VTR  D+VKGK+T F S VN+KQHEEKR  +    I
Subjt:  AKSAEGISKSKIDNEKNKATDVHGNVDVKNMFKPVDQKDLFNNAERIIRKKDLSRNNFDNKRKGVTRFKDEVKGKVTRFDSQVNEKQHEEKRKGHLFDCI

Query:  EPKVRRSNNETLVRLKANTLDVKRERQRVCD-------------------ETKDALKVEKSGVQLARNYIPGEKVDRKKTGQPYQGLSKSGKPFLESTEE
        EP+V RSN++  +  KANTL+VK+E  RV D                     K  LK  K G+QL R+Y PG+KV RKKT Q Y+G S SGK FLE  E+
Subjt:  EPKVRRSNNETLVRLKANTLDVKRERQRVCD-------------------ETKDALKVEKSGVQLARNYIPGEKVDRKKTGQPYQGLSKSGKPFLESTEE

Query:  SGLEVERAAFNNFDALDIMDKPRVSKMEMEERIQMLSKRLNGADIDMPEWMFARMMRSAKIRYSDHSILRVIQVLGKLGNWRRVLQVIEWLQMRERFKSH
        + LEVE AAFNNFDA DIMDKPRVSKMEMEERIQMLSKRLNGADIDMPEWMF++MMRSAKIRYSDHSILRVIQVLGKLGNWRRVLQ+IEWLQMRERFKSH
Subjt:  SGLEVERAAFNNFDALDIMDKPRVSKMEMEERIQMLSKRLNGADIDMPEWMFARMMRSAKIRYSDHSILRVIQVLGKLGNWRRVLQVIEWLQMRERFKSH

Query:  KLRFIYTTALDVLGKARRPVEALNIFHAMQQHFSSYPDLVAYHSIAVTLGQAGYMRELFDVIDSMRSPPKKKFKTGALEKWDPRLQPDIVIYNAVLNACV
        KLRFIYTTALDVLGKARRPVEALN+FHAMQ+HFSSYPDLVAYHSIAVTLGQAGYMRELFDVIDSMRSPPKKKFKTG LEKWDPRLQPDIVIYNAVLNACV
Subjt:  KLRFIYTTALDVLGKARRPVEALNIFHAMQQHFSSYPDLVAYHSIAVTLGQAGYMRELFDVIDSMRSPPKKKFKTGALEKWDPRLQPDIVIYNAVLNACV

Query:  KRKNWEGAFWVLQELKKQGLQPSTSTYGLVMEVMLACGKYNLVHEFFRKVQRSSIPNALTYKVLVNTLWKEGKTDEAVVAIQNMEQRGIVGSAALYYDFA
        KRKN EGAFWVLQELKKQ LQPSTSTYGLVMEVML CGKYNLVHEFFRKVQ+SSIPNALTYKVLVNTLWKEGKTDEAV+AI+NME RGIVGSAALYYDFA
Subjt:  KRKNWEGAFWVLQELKKQGLQPSTSTYGLVMEVMLACGKYNLVHEFFRKVQRSSIPNALTYKVLVNTLWKEGKTDEAVVAIQNMEQRGIVGSAALYYDFA

Query:  RCLCSAGRCKEALMQIEKICKVANKPLVVTYTGLIQACLDSKTLQSAVYIFNHMKDFCSPNLVTYNILLKGCLEHRMFEEARELFQNLSEHGRNISTISD
        RCLCSAGRCKEALMQ+EKICKVANKPLVVTYTGLIQACLDSK LQSAVYIFNHMK FCSPNLVTYNILLKG LEH MFEEARELFQNLSE  RNIST+SD
Subjt:  RCLCSAGRCKEALMQIEKICKVANKPLVVTYTGLIQACLDSKTLQSAVYIFNHMKDFCSPNLVTYNILLKGCLEHRMFEEARELFQNLSEHGRNISTISD

Query:  YRDRVLPDIYTFNTMLDASFAEKRWDDFGYFYDQMFLYGYHFNPKRHLRMILEAGRAGKDELLETTWKHLAMANQTLPPPLVKERFCMKLARGDYSEALS
        YRDRVLPDIY FNTMLDASFAEKRWDDF YFY+QMFLYGYHFNPKRHLRMILEA R GKDELLETTWKHLA A++T PPPL+KERFCMKLARGDYSEALS
Subjt:  YRDRVLPDIYTFNTMLDASFAEKRWDDFGYFYDQMFLYGYHFNPKRHLRMILEAGRAGKDELLETTWKHLAMANQTLPPPLVKERFCMKLARGDYSEALS

Query:  CISDHHSSDVHHFSESAWLNLLKEKRFPKDTVIQLIHKVSMLLTRNDPPNPVFQNLLMSCKEFCRTRITVADNKLEDIV
         I  H+S D HHFSESAWLNLLKEKRFP+DTVI+LIHKV M+LTRN+ PNPVF+NLL+SCKEFCRTRI++AD++LE+ V
Subjt:  CISDHHSSDVHHFSESAWLNLLKEKRFPKDTVIQLIHKVSMLLTRNDPPNPVFQNLLMSCKEFCRTRITVADNKLEDIV

A0A1S3C8Z0 pentatricopeptide repeat-containing protein At1g30610, chloroplastic0.0e+0079.32Show/hide
Query:  NSHNLWVASFFPSPIF--GIDLNVGDAKNRVFRNRGNKCGAIKASSKGKSDIRSSNGNLLEKDFQFKPSFDEYVRVMESVRNSRYKKQSDDPNKLKMKEN
        NSH  +  SFFPS +   G DLN  DAKNRV R+R +KCG+IKA S G+SDI   NGNLLE DFQFKPSFDEYV+VME+VR  RYK+Q D PNKL MKEN
Subjt:  NSHNLWVASFFPSPIF--GIDLNVGDAKNRVFRNRGNKCGAIKASSKGKSDIRSSNGNLLEKDFQFKPSFDEYVRVMESVRNSRYKKQSDDPNKLKMKEN

Query:  ASAKSAEGISKSKIDNEKNKATDVHGNVDVKNMFKPVDQKDLFNNAERIIRKKDLSRNNFDNKRKGVTRFKDEVKGKVTRFDSQVNEKQHEEKRKGHLFD
         SAKSAE  S SKIDN KNK TDV  NV+VKNMFK VD+KDLFNN ERI R+K LS N FD + KGVTR  D+VKGK+T F S VN+KQHEEK+ G+   
Subjt:  ASAKSAEGISKSKIDNEKNKATDVHGNVDVKNMFKPVDQKDLFNNAERIIRKKDLSRNNFDNKRKGVTRFKDEVKGKVTRFDSQVNEKQHEEKRKGHLFD

Query:  CIEPKVRRSNNETLVRLKANTLDVKRERQRV-------------------CDETKDALKVEKSGVQLARNYIPGEKVDRKKTGQPYQGLSKSGKPFLEST
         IEPKV RSN E  +  KAN L+ K+E  RV                       KD LK  K G+QL R+Y PG+KV RKKT Q Y+G S SGK FLE T
Subjt:  CIEPKVRRSNNETLVRLKANTLDVKRERQRV-------------------CDETKDALKVEKSGVQLARNYIPGEKVDRKKTGQPYQGLSKSGKPFLEST

Query:  EESGLEVERAAFNNFDALDIMDKPRVSKMEMEERIQMLSKRLNGADIDMPEWMFARMMRSAKIRYSDHSILRVIQVLGKLGNWRRVLQVIEWLQMRERFK
        EE+ LEVE AAFNNFDALDIMDKPRVSKMEMEERIQMLSKRLNGADIDMPEWMF++MMR AKIRYSDHSILRVIQVLGKLGNWRRVLQVIEWLQMRERFK
Subjt:  EESGLEVERAAFNNFDALDIMDKPRVSKMEMEERIQMLSKRLNGADIDMPEWMFARMMRSAKIRYSDHSILRVIQVLGKLGNWRRVLQVIEWLQMRERFK

Query:  SHKLRFIYTTALDVLGKARRPVEALNIFHAMQQHFSSYPDLVAYHSIAVTLGQAGYMRELFDVIDSMRSPPKKKFKTGALEKWDPRLQPDIVIYNAVLNA
        SHK RFIYTTALDVLGKARRPVEALN+FHAMQ+HFSSYPDLVAYHSIAVTLGQAGYMRELFDVIDSMRSPPKKKFKTGALEKWDPRLQPDIVIYNAVLNA
Subjt:  SHKLRFIYTTALDVLGKARRPVEALNIFHAMQQHFSSYPDLVAYHSIAVTLGQAGYMRELFDVIDSMRSPPKKKFKTGALEKWDPRLQPDIVIYNAVLNA

Query:  CVKRKNWEGAFWVLQELKKQGLQPSTSTYGLVMEVMLACGKYNLVHEFFRKVQRSSIPNALTYKVLVNTLWKEGKTDEAVVAIQNMEQRGIVGSAALYYD
        CVKRKN EGAFWVLQELKKQGLQPSTSTYGLVMEVML CGKYNLVHEFFRKVQ+SSIPNALTYKVLVNTLWKEGKTDEAV+AI+NME RG+VGSAALYYD
Subjt:  CVKRKNWEGAFWVLQELKKQGLQPSTSTYGLVMEVMLACGKYNLVHEFFRKVQRSSIPNALTYKVLVNTLWKEGKTDEAVVAIQNMEQRGIVGSAALYYD

Query:  FARCLCSAGRCKEALMQIEKICKVANKPLVVTYTGLIQACLDSKTLQSAVYIFNHMKDFCSPNLVTYNILLKGCLEHRMFEEARELFQNLSEHGRNISTI
        FARCLCSAGRCKEALMQ+EKICKVANKPLVVTYTGLIQACLDSK LQSAVY+FN MK FCSPNLVTYNILLKG LEH MFEEAREL QNLSE  +NIST+
Subjt:  FARCLCSAGRCKEALMQIEKICKVANKPLVVTYTGLIQACLDSKTLQSAVYIFNHMKDFCSPNLVTYNILLKGCLEHRMFEEARELFQNLSEHGRNISTI

Query:  SDYRDRVLPDIYTFNTMLDASFAEKRWDDFGYFYDQMFLYGYHFNPKRHLRMILEAGRAGKDELLETTWKHLAMANQTLPPPLVKERFCMKLARGDYSEA
        SDYRDRVLPDIY FNTMLDASFAEKRWDDF YFY+QMFLYGYHFNPKRHLRMILEA R GKDELLETTWKHLA A++T PPPL+KERFCMK+ARGDY+EA
Subjt:  SDYRDRVLPDIYTFNTMLDASFAEKRWDDFGYFYDQMFLYGYHFNPKRHLRMILEAGRAGKDELLETTWKHLAMANQTLPPPLVKERFCMKLARGDYSEA

Query:  LSCISDHHSSDVHHFSESAWLNLLKEKRFPKDTVIQLIHKVSMLLTRNDPPNPVFQNLLMSCKEFCRTRITVADNKLEDIVCTDE
        L CIS+H+S D HHFSESAWLNLLKEKRFPKDTVI+LIHKV M+   N+ PNPVF+NLL+SCKEFCRTRI+VAD++LE+ V T+E
Subjt:  LSCISDHHSSDVHHFSESAWLNLLKEKRFPKDTVIQLIHKVSMLLTRNDPPNPVFQNLLMSCKEFCRTRITVADNKLEDIVCTDE

A0A6J1CLQ9 pentatricopeptide repeat-containing protein At1g30610, chloroplastic0.0e+0086.63Show/hide
Query:  NSHNLWVASFFPSPIFGIDLNVGDAKNRVFRNRGNKCGAIKASSKGKSDIRSSNGNLLEKDFQFKPSFDEYVRVMESVRNSRYKKQSDDPNKLKMKENAS
        +SHNL+  S FPSPI GI LNVG  KNR+FR RGNKCGAI+ SSKG+SDIR  NGN+LE DF FKPSFDEYVRVMESVR SRYKKQ DDPNKLKMKENAS
Subjt:  NSHNLWVASFFPSPIFGIDLNVGDAKNRVFRNRGNKCGAIKASSKGKSDIRSSNGNLLEKDFQFKPSFDEYVRVMESVRNSRYKKQSDDPNKLKMKENAS

Query:  AKSAEGISKSKIDNEKNKATDVHGNVDVKNMFKPVDQKDLFNNAERIIRKKDLSRNNFDNKRKGVTRFKDEVKGKVTRFDSQVNEKQHEEKRKGHLFDCI
        AKSAE  S S+IDNEK K TDV GNVDVKNMFK VDQK LFNNAER+ RKKDL  N FDNKRKG+TR KDE +GKVT FDSQVN+KQHEE+RK +  DCI
Subjt:  AKSAEGISKSKIDNEKNKATDVHGNVDVKNMFKPVDQKDLFNNAERIIRKKDLSRNNFDNKRKGVTRFKDEVKGKVTRFDSQVNEKQHEEKRKGHLFDCI

Query:  EPKVRRSNNETLVRLKANTLDVKRERQRVCDET-----------------KDALKVEKSGVQLARNYIPGEKVDRKKTGQPYQGLSKSGKPFLESTEESG
        EPKVRR NNE LV  KANTLD+KR+RQRVCDE+                 K  L+V KSGVQLARNY+PGEKV  KKTGQ YQGLSKSGKPF+ESTEES 
Subjt:  EPKVRRSNNETLVRLKANTLDVKRERQRVCDET-----------------KDALKVEKSGVQLARNYIPGEKVDRKKTGQPYQGLSKSGKPFLESTEESG

Query:  LEVERAAFNNFDALDIMDKPRVSKMEMEERIQMLSKRLNGADIDMPEWMFARMMRSAKIRYSDHSILRVIQVLGKLGNWRRVLQVIEWLQMRERFKSHKL
        LEVERAA NNFDALDIMDKPRVSKMEMEERIQMLSKRLNGADIDMPEWMFA+MMRSAKIRYSDHSILRVIQVLGKLGNW+RVLQVIEWLQMRERFKSHKL
Subjt:  LEVERAAFNNFDALDIMDKPRVSKMEMEERIQMLSKRLNGADIDMPEWMFARMMRSAKIRYSDHSILRVIQVLGKLGNWRRVLQVIEWLQMRERFKSHKL

Query:  RFIYTTALDVLGKARRPVEALNIFHAMQQHFSSYPDLVAYHSIAVTLGQAGYMRELFDVIDSMRSPPKKKFKTGALEKWDPRLQPDIVIYNAVLNACVKR
        RFIYTTALDVLGKARRPVEALN+FHAMQQHFSSYPDLVAYHSIAVTLGQAGYMRELFDVIDSMRSPPKKKFKTGALEKWDPRLQPDIVIYNAVLNACVKR
Subjt:  RFIYTTALDVLGKARRPVEALNIFHAMQQHFSSYPDLVAYHSIAVTLGQAGYMRELFDVIDSMRSPPKKKFKTGALEKWDPRLQPDIVIYNAVLNACVKR

Query:  KNWEGAFWVLQELKKQGLQPSTSTYGLVMEVMLACGKYNLVHEFFRKVQRSSIPNALTYKVLVNTLWKEGKTDEAVVAIQNMEQRGIVGSAALYYDFARC
        KNWEGAFWVLQELKKQGLQPSTSTYGLVMEVML CGKYNLVHEFFRKVQRSSIPNALTYKVLVNTL KEGKTDEAV+AIQNME+RGIVGSAALYYDFARC
Subjt:  KNWEGAFWVLQELKKQGLQPSTSTYGLVMEVMLACGKYNLVHEFFRKVQRSSIPNALTYKVLVNTLWKEGKTDEAVVAIQNMEQRGIVGSAALYYDFARC

Query:  LCSAGRCKEALMQIEKICKVANKPLVVTYTGLIQACLDSKTLQSAVYIFNHMKDFCSPNLVTYNILLKGCLEHRMFEEARELFQNLSEHGRNISTISDYR
        LCSAGRCKEALMQIEKICKVANKPLVVTYTGLIQACLDSK L SAVYIFNHMK FCSPNLVTYNILLKG L+H MFEEARELFQNLSE G++ISTISDY+
Subjt:  LCSAGRCKEALMQIEKICKVANKPLVVTYTGLIQACLDSKTLQSAVYIFNHMKDFCSPNLVTYNILLKGCLEHRMFEEARELFQNLSEHGRNISTISDYR

Query:  DRVLPDIYTFNTMLDASFAEKRWDDFGYFYDQMFLYGYHFNPKRHLRMILEAGRAGKDELLETTWKHLAMANQTLPPPLVKERFCMKLARGDYSEALSCI
        DRVLPDIYTFN MLDA FA KRWDDFGYFY+QMFLYGYHFNPKRHLRMILEAGRAGKDE+LETTWKHLA  ++TLPPPLVKERFCMKLARGDYSEALSCI
Subjt:  DRVLPDIYTFNTMLDASFAEKRWDDFGYFYDQMFLYGYHFNPKRHLRMILEAGRAGKDELLETTWKHLAMANQTLPPPLVKERFCMKLARGDYSEALSCI

Query:  SDHHSSDVHHFSESAWLNLLKEKRFPKDTVIQLIHKVSMLLTRNDPPNPVFQNLLMSCKEFCRTRITVADNKLEDIVCTDETQFTAVMHI
        S+HHSSD HHFSESAWLNLLKEK FPKDTVI LIHKVSMLLT N PPNPVFQNLL SCKEFCRTRITVAD+KLE IVC DETQ  AVMHI
Subjt:  SDHHSSDVHHFSESAWLNLLKEKRFPKDTVIQLIHKVSMLLTRNDPPNPVFQNLLMSCKEFCRTRITVADNKLEDIVCTDETQFTAVMHI

A0A6J1EH18 LOW QUALITY PROTEIN: pentatricopeptide repeat-containing protein At1g30610, chloroplastic0.0e+0080.11Show/hide
Query:  NSHNLWVASFFPSPIFGIDLNVGDAKNRVFRNRGNKCGAIKASSKGKSDIRSSNGNLLEKDFQFKPSFDEYVRVMESVRNSRYKKQSDDPNKLKMKENAS
        NSH L   S FPS + G  LN G AK+RV R+RG+KCGAIKASSKG+SDI+ ++GNLLEKDFQFKPSFDEYVRVMESVR+ RYK+QSDDPN  KMKENAS
Subjt:  NSHNLWVASFFPSPIFGIDLNVGDAKNRVFRNRGNKCGAIKASSKGKSDIRSSNGNLLEKDFQFKPSFDEYVRVMESVRNSRYKKQSDDPNKLKMKENAS

Query:  AKSAEGISKSKIDNEKNKATDVHGNVDVKNMFKPVDQKDLFNNAERIIRKKDLSRNNFDNKRKGVTRFKDEVKGKVTRFDSQVNEKQHEEKRKGHLFDCI
        AKSAE    S I       TDV GN+DVKN    VD +DLF+N+E+I RK DLS N FD+KRKGVTR KDE+KGKVT F+SQVN+KQHEEKR G+  + I
Subjt:  AKSAEGISKSKIDNEKNKATDVHGNVDVKNMFKPVDQKDLFNNAERIIRKKDLSRNNFDNKRKGVTRFKDEVKGKVTRFDSQVNEKQHEEKRKGHLFDCI

Query:  EPKVRRSNNETLVRLKANTLDVKRE--------RQRVCDE---------TKDALKVEKSGVQLARNYIPGEKVDRKKTGQPYQGLSKSGKPFLESTEESG
        EPK  RSN++  +  KANTLDVK E          ++ D+         TKD LKV K GVQL  NYIPG+KV RKKT Q Y+GLSKSGK F E TEES 
Subjt:  EPKVRRSNNETLVRLKANTLDVKRE--------RQRVCDE---------TKDALKVEKSGVQLARNYIPGEKVDRKKTGQPYQGLSKSGKPFLESTEESG

Query:  LEVERAAFNNFDALDIMDKPRVSKMEMEERIQMLSKRLNGADIDMPEWMFARMMRSAKIRYSDHSILRVIQVLGKLGNWRRVLQVIEWLQMRERFKSHKL
        LEVE AAFN+ DA DIMDKPRVSKMEMEERIQMLS RLNGADIDMPEWMFA+MMRSAKIRYSDHSILRVIQVLGKLGNW+RVLQVIEWLQMRERFKSHKL
Subjt:  LEVERAAFNNFDALDIMDKPRVSKMEMEERIQMLSKRLNGADIDMPEWMFARMMRSAKIRYSDHSILRVIQVLGKLGNWRRVLQVIEWLQMRERFKSHKL

Query:  RFIYTTALDVLGKARRPVEALNIFHAMQQHFSSYPDLVAYHSIAVTLGQAGYMRELFDVIDSMRSPPKKKFKTGALEKWDPRLQPDIVIYNAVLNACVKR
        RFIYTTALDVLGKARRPVEALN+FHAMQQHFSSYPDLVAYHSIAVTLGQAGYMRELFDVIDSMRSPPKKKFKTGA EKWDPRLQPDIVIYNAVLNACVKR
Subjt:  RFIYTTALDVLGKARRPVEALNIFHAMQQHFSSYPDLVAYHSIAVTLGQAGYMRELFDVIDSMRSPPKKKFKTGALEKWDPRLQPDIVIYNAVLNACVKR

Query:  KNWEGAFWVLQELKKQGLQPSTSTYGLVMEVMLACGKYNLVHEFFRKVQRSSIPNALTYKVLVNTLWKEGKTDEAVVAIQNMEQRGIVGSAALYYDFARC
        KNWEGAFWVLQELK+QGLQPST+TYGLVMEVML CGKYNLVHEFFRKVQ+SSIPNALTYKVLVNTLWKEGKTDEAV+AIQ ME+RGIVGSAALYYDFARC
Subjt:  KNWEGAFWVLQELKKQGLQPSTSTYGLVMEVMLACGKYNLVHEFFRKVQRSSIPNALTYKVLVNTLWKEGKTDEAVVAIQNMEQRGIVGSAALYYDFARC

Query:  LCSAGRCKEALMQIEKICKVANKPLVVTYTGLIQACLDSKTLQSAVYIFNHMKDFCSPNLVTYNILLKGCLEHRMFEEARELFQNLSEHGRNISTISDYR
        LCSAGRC+EALMQ+EKICKVANKPLVVTYTGLIQACLDSK LQSAVYIFNHMK FCSPNLVT NILLKG L+H MF+EA+ELFQN+SE+GRNIS +SDYR
Subjt:  LCSAGRCKEALMQIEKICKVANKPLVVTYTGLIQACLDSKTLQSAVYIFNHMKDFCSPNLVTYNILLKGCLEHRMFEEARELFQNLSEHGRNISTISDYR

Query:  DRVLPDIYTFNTMLDASFAEKRWDDFGYFYDQMFLYGYHFNPKRHLRMILEAGRAGKDELLETTWKHLAMANQTLPPPLVKERFCMKLARGDYSEALSCI
        DRVLPDIYTFNTMLDASFAEKRWDDF +FY+QM LYGYHFNPKRHLRMI+EA R GKDELLETTWKHLA A++TLPPPL+KERFC+ LARGDYSEALSCI
Subjt:  DRVLPDIYTFNTMLDASFAEKRWDDFGYFYDQMFLYGYHFNPKRHLRMILEAGRAGKDELLETTWKHLAMANQTLPPPLVKERFCMKLARGDYSEALSCI

Query:  SDHHSSDVHHFSESAWLNLLKEKRFPKDTVIQLIHKVSMLLTRNDPPNPVFQNLLMSCKEFCRTRITVADNKLEDIVCTDETQFTAVMHI
        S HHSSD HHFS+SAWLNLLKEKRFPKD+VI+LIHKVSMLL RND PNPV QNLL+S KEFCR+RI+VAD +LE++VCT+E+Q   VMH+
Subjt:  SDHHSSDVHHFSESAWLNLLKEKRFPKDTVIQLIHKVSMLLTRNDPPNPVFQNLLMSCKEFCRTRITVADNKLEDIVCTDETQFTAVMHI

A0A6J1KEH7 pentatricopeptide repeat-containing protein At1g30610, chloroplastic0.0e+0080.67Show/hide
Query:  NSHNLWVASFFPSPIFGIDLNVGDAKNRVFRNRGNKCGAIKASSKGKSDIRSSNGNLLEKDFQFKPSFDEYVRVMESVRNSRYKKQSDDPNKLKMKENAS
        NSH L   SFFPS + G  LN G AK+RV R+RG+KCGAIKASSKG+SDI+ ++GNLLEKDFQFKPSFDEYVRVMESVR+ RYK+QSDDPN  KMKENAS
Subjt:  NSHNLWVASFFPSPIFGIDLNVGDAKNRVFRNRGNKCGAIKASSKGKSDIRSSNGNLLEKDFQFKPSFDEYVRVMESVRNSRYKKQSDDPNKLKMKENAS

Query:  AKSAEGISKSKIDNEKNKATDVHGNVDVKNMFKPVDQKDLFNNAERIIRKKDLSRNNFDNKRKGVTRFKDEVKGKVTRFDSQVNEKQHEEKRKGHLFDCI
        AKSAE  S S I       TDV GN+DVKN    VD +DLF+N+ERI RK DLS N FD+KRKGVTR KDE+KGKVT FDSQ+N+KQHEEKR G+  + I
Subjt:  AKSAEGISKSKIDNEKNKATDVHGNVDVKNMFKPVDQKDLFNNAERIIRKKDLSRNNFDNKRKGVTRFKDEVKGKVTRFDSQVNEKQHEEKRKGHLFDCI

Query:  EPKVRRSNNETLVRLKANTLDVKRERQRV-----------------CDETKDALKVEKSGVQLARNYIPGEKVDRKKTGQPYQGLSKSGKPFLESTEESG
        EPKV RSN++  +  KANTLDVK E   V                    TKD LKV K GVQL  NYIPG+KV RKKT Q Y+GLSKSGK F E TEES 
Subjt:  EPKVRRSNNETLVRLKANTLDVKRERQRV-----------------CDETKDALKVEKSGVQLARNYIPGEKVDRKKTGQPYQGLSKSGKPFLESTEESG

Query:  LEVERAAFNNFDALDIMDKPRVSKMEMEERIQMLSKRLNGADIDMPEWMFARMMRSAKIRYSDHSILRVIQVLGKLGNWRRVLQVIEWLQMRERFKSHKL
        LEVE AAFN+ DA DIMDKPRVSKMEMEERIQMLSKRLNGADIDMPEWMFA+MMRSAKIRYSDHSILRVIQVLGKLGNW+RVLQVIEWLQMRERFKSHKL
Subjt:  LEVERAAFNNFDALDIMDKPRVSKMEMEERIQMLSKRLNGADIDMPEWMFARMMRSAKIRYSDHSILRVIQVLGKLGNWRRVLQVIEWLQMRERFKSHKL

Query:  RFIYTTALDVLGKARRPVEALNIFHAMQQHFSSYPDLVAYHSIAVTLGQAGYMRELFDVIDSMRSPPKKKFKTGALEKWDPRLQPDIVIYNAVLNACVKR
        RFIYTTALDVLGKARRPVEALN+FHAMQQHFSSYPDLVAYHSIAVTLGQAGYMRELFDVIDSMRSPPKKKFKTGA EKWDPRLQPDIVIYNAVLNACVKR
Subjt:  RFIYTTALDVLGKARRPVEALNIFHAMQQHFSSYPDLVAYHSIAVTLGQAGYMRELFDVIDSMRSPPKKKFKTGALEKWDPRLQPDIVIYNAVLNACVKR

Query:  KNWEGAFWVLQELKKQGLQPSTSTYGLVMEVMLACGKYNLVHEFFRKVQRSSIPNALTYKVLVNTLWKEGKTDEAVVAIQNMEQRGIVGSAALYYDFARC
        KNWEGAFWVLQELK+QGLQPST+TYGLVMEVML CGKYNLVHEFFRKVQ+SSIPNALTYKVLVNTLWKEGKTDEAV+AIQ ME+RGIVGSAALYYDFARC
Subjt:  KNWEGAFWVLQELKKQGLQPSTSTYGLVMEVMLACGKYNLVHEFFRKVQRSSIPNALTYKVLVNTLWKEGKTDEAVVAIQNMEQRGIVGSAALYYDFARC

Query:  LCSAGRCKEALMQIEKICKVANKPLVVTYTGLIQACLDSKTLQSAVYIFNHMKDFCSPNLVTYNILLKGCLEHRMFEEARELFQNLSEHGRNISTISDYR
        LCSAGR +EALMQ+EKICKVANKPLVVTYTGLIQACLDSK LQSAVYIFNHMK FCSPNLVT NILLKG L+H MF EA+ELFQN+SE+GRNIS +SDYR
Subjt:  LCSAGRCKEALMQIEKICKVANKPLVVTYTGLIQACLDSKTLQSAVYIFNHMKDFCSPNLVTYNILLKGCLEHRMFEEARELFQNLSEHGRNISTISDYR

Query:  DRVLPDIYTFNTMLDASFAEKRWDDFGYFYDQMFLYGYHFNPKRHLRMILEAGRAGKDELLETTWKHLAMANQTLPPPLVKERFCMKLARGDYSEALSCI
        DRVLPDIYTFNTMLDASFAEKRWDDF +FY+QM LYGYHFNPKRHLRMI+EA R GKDELLETTWKHLA A++ LPPPL+KERFC+ LARGDYSEALSCI
Subjt:  DRVLPDIYTFNTMLDASFAEKRWDDFGYFYDQMFLYGYHFNPKRHLRMILEAGRAGKDELLETTWKHLAMANQTLPPPLVKERFCMKLARGDYSEALSCI

Query:  SDHHSSDVHHFSESAWLNLLKEKRFPKDTVIQLIHKVSMLLTRNDPPNPVFQNLLMSCKEFCRTRITVADNKLEDIVCTDETQFTAVMHI
        S HHSSD HHFS+SAWLNLLKEKRFPKD+VIQLIHKVSMLL RND PNPV QNLL+S KEFCR+RI+VAD +LE++VCT+E+Q  AVMH+
Subjt:  SDHHSSDVHHFSESAWLNLLKEKRFPKDTVIQLIHKVSMLLTRNDPPNPVFQNLLMSCKEFCRTRITVADNKLEDIVCTDETQFTAVMHI

SwissProt top hitse value%identityAlignment
Q9FJW6 Pentatricopeptide repeat-containing protein At5g67570, chloroplastic4.6e-11139.78Show/hide
Query:  ERIQMLSKRLNGADIDMPEWMFARMMRSAKIRYSDHSILRVIQVLGKLGNWRRVLQVIEWLQMRERFKSHKLRFIYTTALDVLGKARRPVEALNIFHAMQ
        E +++L  RL+G +I+   W F RMM  + +++++  +L+++  LG+  +W++   V+ W+   ++ K  + RF+YT  L VLG ARRP EAL IF+ M 
Subjt:  ERIQMLSKRLNGADIDMPEWMFARMMRSAKIRYSDHSILRVIQVLGKLGNWRRVLQVIEWLQMRERFKSHKLRFIYTTALDVLGKARRPVEALNIFHAMQ

Query:  QHFSSYPDLVAYHSIAVTLGQAGYMRELFDVIDSMRSPPKKKFKTGALEKWDPRLQPDIVIYNAVLNACVKRKNWEGAFWVLQELKKQGLQPSTSTYGLV
             YPD+ AYH IAVTLGQAG ++EL  VI+ MR  P K  K    + WDP L+PD+V+YNA+LNACV    W+   WV  EL+K GL+P+ +TYGL 
Subjt:  QHFSSYPDLVAYHSIAVTLGQAGYMRELFDVIDSMRSPPKKKFKTGALEKWDPRLQPDIVIYNAVLNACVKRKNWEGAFWVLQELKKQGLQPSTSTYGLV

Query:  MEVMLACGKYNLVHEFFRKVQRS-SIPNALTYKVLVNTLWKEGKTDEAVVAIQNMEQRGIVGSAALYYDFARCLCSAGRCKEALMQIEKICKVAN-KPLV
        MEVML  GK++ VH+FFRK++ S   P A+TYKVLV  LW+EGK +EAV A+++MEQ+G++G+ ++YY+ A CLC+ GR  +A++++ ++ ++ N +PL 
Subjt:  MEVMLACGKYNLVHEFFRKVQRS-SIPNALTYKVLVNTLWKEGKTDEAVVAIQNMEQRGIVGSAALYYDFARCLCSAGRCKEALMQIEKICKVAN-KPLV

Query:  VTYTGLIQACLDSKTLQSAVYIFNHMKDFCSPNLVTYNILLKGCLEHRMFEEARELFQNLSEHGRNISTISDYRDRVLPDIYTFNTMLDASFAEKRWDDF
        +T+TGLI A L+   +   + IF +MKD C PN+ T N++LK    + MF EA+ELF+ +         +S     ++P+ YT++ ML+AS    +W+ F
Subjt:  VTYTGLIQACLDSKTLQSAVYIFNHMKDFCSPNLVTYNILLKGCLEHRMFEEARELFQNLSEHGRNISTISDYRDRVLPDIYTFNTMLDASFAEKRWDDF

Query:  GYFYDQMFLYGYHFNPKRHLRMILEAGRAGKDELLETTWKHLAMANQTLPPPLVKERFCMKLARGDYSEALSCISDHHSSDVHHFSESAWLNLLKEKRFP
         + Y  M L GY  +  +H  M++EA RAGK  LLE  +  +    +   P    E  C   A+GD+  A++ I+    +     SE  W +L +E    
Subjt:  GYFYDQMFLYGYHFNPKRHLRMILEAGRAGKDELLETTWKHLAMANQTLPPPLVKERFCMKLARGDYSEALSCISDHHSSDVHHFSESAWLNLLKEKRFP

Query:  KDTVIQ-LIHKVSMLLTRND-PPNPVFQNLLMSCKEFC
        +D + Q  +HK+S  L   D    P   NL  S K  C
Subjt:  KDTVIQ-LIHKVSMLLTRND-PPNPVFQNLLMSCKEFC

Q9FMD3 Pentatricopeptide repeat-containing protein At5g16640, mitochondrial2.8e-2324.2Show/hide
Query:  IYTTALDVLGKARRPVEALNIFHAMQQHFSSYPDLVAYHSIAVTLGQAGYMRELFDVIDSMRSPPKKKFKTGALEKWDPRLQPDIVIYNAVLNACVKRKN
        IY T +D L K+++   AL++ + M++     PD+V Y+S+   L  +G   +   ++  M                   + PD+  +NA+++ACVK   
Subjt:  IYTTALDVLGKARRPVEALNIFHAMQQHFSSYPDLVAYHSIAVTLGQAGYMRELFDVIDSMRSPPKKKFKTGALEKWDPRLQPDIVIYNAVLNACVKRKN

Query:  WEGAFWVLQELKKQGLQPSTSTYGLVMEVMLACGKYNLVHEFFR-KVQRSSIPNALTYKVLVNTLWKEGKTDEAVVAIQNMEQRGIVGSAALYYDFARCL
           A    +E+ ++ L P   TY L++  +    + +   E F   V +   P+ +TY +L+N   K  K +  +     M QRG+V +   Y    +  
Subjt:  WEGAFWVLQELKKQGLQPSTSTYGLVMEVMLACGKYNLVHEFFR-KVQRSSIPNALTYKVLVNTLWKEGKTDEAVVAIQNMEQRGIVGSAALYYDFARCL

Query:  CSAGRCKEALMQIEKICKVANKPLVVTYTGLIQACLDSKTLQSAVYIFNHM-KDFCSPNLVTYNILLKGCLEHRMFEEARELFQNLSEHGRNISTISDYR
        C AG+   A     ++      P ++TY  L+    D+  ++ A+ I   M K+    ++VTYNI+++G  +     +A +++ +L+  G          
Subjt:  CSAGRCKEALMQIEKICKVANKPLVVTYTGLIQACLDSKTLQSAVYIFNHM-KDFCSPNLVTYNILLKGCLEHRMFEEARELFQNLSEHGRNISTISDYR

Query:  DRVLPDIYTFNTML
          ++PDI+T+ TM+
Subjt:  DRVLPDIYTFNTML

Q9SA76 Pentatricopeptide repeat-containing protein At1g30610, chloroplastic2.7e-21259.42Show/hide
Query:  GLSKSGKPFLESTEESGLEVERAAFNNFD-ALDIMDKPRVSKMEMEERIQMLSKRLNGADIDMPEWMFARMMRSAKIRYSDHSILRVIQVLGKLGNWRRV
        G  + G    +  ++S   +E  AF   D + DI+DKP  S++EME+RI+ L+K LNGADI+MPEW F++ +RSAKIRY+D++++R+I  LGKLGNWRRV
Subjt:  GLSKSGKPFLESTEESGLEVERAAFNNFD-ALDIMDKPRVSKMEMEERIQMLSKRLNGADIDMPEWMFARMMRSAKIRYSDHSILRVIQVLGKLGNWRRV

Query:  LQVIEWLQMRERFKSHKLRFIYTTALDVLGKARRPVEALNIFHAMQQHFSSYPDLVAYHSIAVTLGQAGYMRELFDVIDSMRSPPKKKFKTGALEKWDPR
        LQVIEWLQ ++R+KS+K+R IYTTAL+VLGK+RRPVEALN+FHAM    SSYPD+VAY SIAVTLGQAG+++ELF VID+MRSPPKKKFK   LEKWDPR
Subjt:  LQVIEWLQMRERFKSHKLRFIYTTALDVLGKARRPVEALNIFHAMQQHFSSYPDLVAYHSIAVTLGQAGYMRELFDVIDSMRSPPKKKFKTGALEKWDPR

Query:  LQPDIVIYNAVLNACVKRKNWEGAFWVLQELKKQGLQPSTSTYGLVMEVMLACGKYNLVHEFFRKVQRSSIPNALTYKVLVNTLWKEGKTDEAVVAIQNM
        L+PD+V+YNAVLNACV+RK WEGAFWVLQ+LK++G +PS  TYGL+MEVMLAC KYNLVHEFFRK+Q+SSIPNAL Y+VLVNTLWKEGK+DEAV  +++M
Subjt:  LQPDIVIYNAVLNACVKRKNWEGAFWVLQELKKQGLQPSTSTYGLVMEVMLACGKYNLVHEFFRKVQRSSIPNALTYKVLVNTLWKEGKTDEAVVAIQNM

Query:  EQRGIVGSAALYYDFARCLCSAGRCKEAL----------------------------MQIEKICKVANKPLVVTYTGLIQACLDSKTLQSAVYIFNHMKD
        E RGIVGSAALYYD ARCLCSAGRC E L                             Q++KIC+VANKPLVVTYTGLIQAC+DS  +++A YIF+ MK 
Subjt:  EQRGIVGSAALYYDFARCLCSAGRCKEAL----------------------------MQIEKICKVANKPLVVTYTGLIQACLDSKTLQSAVYIFNHMKD

Query:  FCSPNLVTYNILLKGCLEHRMFEEARELFQNLSEHGRNISTISDYRDRVLPDIYTFNTMLDASFAEKRWDDFGYFYDQMFLYGYHFNPKRHLRMILEAGR
         CSPNLVT NI+LK  L+  +FEEARELFQ +SE G +I   SD+  RVLPD YTFNTMLD    +++WDDFGY Y +M  +GYHFN KRHLRM+LEA R
Subjt:  FCSPNLVTYNILLKGCLEHRMFEEARELFQNLSEHGRNISTISDYRDRVLPDIYTFNTMLDASFAEKRWDDFGYFYDQMFLYGYHFNPKRHLRMILEAGR

Query:  AGKDELLETTWKHLAMANQTLPPPLVKERFCMKLARGDYSEALSCISDHH----SSDVHHFSESAWLNLLKEKRFPKDTVIQLIHKVSMLL-TRNDPPNP
        AGK+E++E TW+H+  +N+  P PL+KERF  KL +GD+  A+S ++D +     +++  FS SAW  +L   RF +D+V++L+  V+  L +R++  + 
Subjt:  AGKDELLETTWKHLAMANQTLPPPLVKERFCMKLARGDYSEALSCISDHH----SSDVHHFSESAWLNLLKEKRFPKDTVIQLIHKVSMLL-TRNDPPNP

Query:  VFQNLLMSCKEFCRTR
        V  NLL SCK++ +TR
Subjt:  VFQNLLMSCKEFCRTR

Q9SA76 Pentatricopeptide repeat-containing protein At1g30610, chloroplastic6.0e-0250Show/hide
Query:  EKDFQFKPSFDEYVRVMESVRNSRYKKQSDDPNKLKMKEN
        +K F+FKPSFD+Y+++MESV+ +R KK+ D   +LK++E+
Subjt:  EKDFQFKPSFDEYVRVMESVRNSRYKKQSDDPNKLKMKEN

Q9SH26 Pentatricopeptide repeat-containing protein At1g634001.6e-2325.16Show/hide
Query:  IYTTALDVLGKARRPVEALNIFHAMQQHFSSYPDLVAYHSIAVTLGQAGYMRELFDVIDSMRSPPKKKFKTGALEKWDPRLQPDIVIYNAVLNACVKRKN
        IY+T +D L K R   +ALN+F  M+      P+++ Y S+   L             +  R     +  +  +E+   ++ P++V +NA+++A VK   
Subjt:  IYTTALDVLGKARRPVEALNIFHAMQQHFSSYPDLVAYHSIAVTLGQAGYMRELFDVIDSMRSPPKKKFKTGALEKWDPRLQPDIVIYNAVLNACVKRKN

Query:  WEGAFWVLQELKKQGLQPSTSTYGLVMEVMLACGKYN-LVHEFFRKVQRSSIPNALTYKVLVNTLWKEGKTDEAVVAIQNMEQRGIVGSAALYYDFARCL
           A  +  E+ K+ + P   TY  ++       + +   H F   + +   PN +TY  L+N   K  + DE V   + M QRG+VG+   Y       
Subjt:  WEGAFWVLQELKKQGLQPSTSTYGLVMEVMLACGKYN-LVHEFFRKVQRSSIPNALTYKVLVNTLWKEGKTDEAVVAIQNMEQRGIVGSAALYYDFARCL

Query:  CSAGRCKEALMQIEKICKVANKPLVVTYTGLIQACLDSKTLQSAVYIFNHM-KDFCSPNLVTYNILLKGCLEHRMFEEARELFQNLSEHGRNISTISDYR
          A  C  A M  +++      P ++TY  L+     +  L+ A+ +F ++ +    P + TYNI+++G  +    E+  +LF +LS  G          
Subjt:  CSAGRCKEALMQIEKICKVANKPLVVTYTGLIQACLDSKTLQSAVYIFNHM-KDFCSPNLVTYNILLKGCLEHRMFEEARELFQNLSEHGRNISTISDYR

Query:  DRVLPDIYTFNTML
          V PD+  +NTM+
Subjt:  DRVLPDIYTFNTML

Q9SXD1 Pentatricopeptide repeat-containing protein At1g62670, mitochondrial1.0e-2224.84Show/hide
Query:  IYTTALDVLGKARRPVEALNIFHAMQQHFSSYPDLVAYHSIAVTLGQAGYMRELFDVIDSMRSPPKKKFKTGALEKWDPRLQPDIVIYNAVLNACVKRKN
        IY T +D L K +   +ALN+F  M+      P++V Y S+   L   G   +   ++  M            +E+   ++ PD+  ++A+++A VK   
Subjt:  IYTTALDVLGKARRPVEALNIFHAMQQHFSSYPDLVAYHSIAVTLGQAGYMRELFDVIDSMRSPPKKKFKTGALEKWDPRLQPDIVIYNAVLNACVKRKN

Query:  WEGAFWVLQELKKQGLQPSTSTYGLVMEVMLACGKYNLVHEFFR-KVQRSSIPNALTYKVLVNTLWKEGKTDEAVVAIQNMEQRGIVGSAALYYDFARCL
           A  +  E+ K+ + PS  TY  ++       + +   + F   V +   P+ +TY  L+    K  + +E +   + M QRG+VG+   Y    + L
Subjt:  WEGAFWVLQELKKQGLQPSTSTYGLVMEVMLACGKYNLVHEFFR-KVQRSSIPNALTYKVLVNTLWKEGKTDEAVVAIQNMEQRGIVGSAALYYDFARCL

Query:  CSAGRCKEALMQIEKICKVANKPLVVTYTGLIQACLDSKTLQSAVYIFNHM-KDFCSPNLVTYNILLKGCLEHRMFEEARELFQNLSEHGRNISTISDYR
          AG C  A    +++      P ++TY  L+     +  L+ A+ +F ++ +    P + TYNI+++G  +    E+  +LF NLS  G          
Subjt:  CSAGRCKEALMQIEKICKVANKPLVVTYTGLIQACLDSKTLQSAVYIFNHM-KDFCSPNLVTYNILLKGCLEHRMFEEARELFQNLSEHGRNISTISDYR

Query:  DRVLPDIYTFNTML
          V PD+  +NTM+
Subjt:  DRVLPDIYTFNTML

Arabidopsis top hitse value%identityAlignment
AT1G30610.1 pentatricopeptide (PPR) repeat-containing protein1.9e-21359.42Show/hide
Query:  GLSKSGKPFLESTEESGLEVERAAFNNFD-ALDIMDKPRVSKMEMEERIQMLSKRLNGADIDMPEWMFARMMRSAKIRYSDHSILRVIQVLGKLGNWRRV
        G  + G    +  ++S   +E  AF   D + DI+DKP  S++EME+RI+ L+K LNGADI+MPEW F++ +RSAKIRY+D++++R+I  LGKLGNWRRV
Subjt:  GLSKSGKPFLESTEESGLEVERAAFNNFD-ALDIMDKPRVSKMEMEERIQMLSKRLNGADIDMPEWMFARMMRSAKIRYSDHSILRVIQVLGKLGNWRRV

Query:  LQVIEWLQMRERFKSHKLRFIYTTALDVLGKARRPVEALNIFHAMQQHFSSYPDLVAYHSIAVTLGQAGYMRELFDVIDSMRSPPKKKFKTGALEKWDPR
        LQVIEWLQ ++R+KS+K+R IYTTAL+VLGK+RRPVEALN+FHAM    SSYPD+VAY SIAVTLGQAG+++ELF VID+MRSPPKKKFK   LEKWDPR
Subjt:  LQVIEWLQMRERFKSHKLRFIYTTALDVLGKARRPVEALNIFHAMQQHFSSYPDLVAYHSIAVTLGQAGYMRELFDVIDSMRSPPKKKFKTGALEKWDPR

Query:  LQPDIVIYNAVLNACVKRKNWEGAFWVLQELKKQGLQPSTSTYGLVMEVMLACGKYNLVHEFFRKVQRSSIPNALTYKVLVNTLWKEGKTDEAVVAIQNM
        L+PD+V+YNAVLNACV+RK WEGAFWVLQ+LK++G +PS  TYGL+MEVMLAC KYNLVHEFFRK+Q+SSIPNAL Y+VLVNTLWKEGK+DEAV  +++M
Subjt:  LQPDIVIYNAVLNACVKRKNWEGAFWVLQELKKQGLQPSTSTYGLVMEVMLACGKYNLVHEFFRKVQRSSIPNALTYKVLVNTLWKEGKTDEAVVAIQNM

Query:  EQRGIVGSAALYYDFARCLCSAGRCKEAL----------------------------MQIEKICKVANKPLVVTYTGLIQACLDSKTLQSAVYIFNHMKD
        E RGIVGSAALYYD ARCLCSAGRC E L                             Q++KIC+VANKPLVVTYTGLIQAC+DS  +++A YIF+ MK 
Subjt:  EQRGIVGSAALYYDFARCLCSAGRCKEAL----------------------------MQIEKICKVANKPLVVTYTGLIQACLDSKTLQSAVYIFNHMKD

Query:  FCSPNLVTYNILLKGCLEHRMFEEARELFQNLSEHGRNISTISDYRDRVLPDIYTFNTMLDASFAEKRWDDFGYFYDQMFLYGYHFNPKRHLRMILEAGR
         CSPNLVT NI+LK  L+  +FEEARELFQ +SE G +I   SD+  RVLPD YTFNTMLD    +++WDDFGY Y +M  +GYHFN KRHLRM+LEA R
Subjt:  FCSPNLVTYNILLKGCLEHRMFEEARELFQNLSEHGRNISTISDYRDRVLPDIYTFNTMLDASFAEKRWDDFGYFYDQMFLYGYHFNPKRHLRMILEAGR

Query:  AGKDELLETTWKHLAMANQTLPPPLVKERFCMKLARGDYSEALSCISDHH----SSDVHHFSESAWLNLLKEKRFPKDTVIQLIHKVSMLL-TRNDPPNP
        AGK+E++E TW+H+  +N+  P PL+KERF  KL +GD+  A+S ++D +     +++  FS SAW  +L   RF +D+V++L+  V+  L +R++  + 
Subjt:  AGKDELLETTWKHLAMANQTLPPPLVKERFCMKLARGDYSEALSCISDHH----SSDVHHFSESAWLNLLKEKRFPKDTVIQLIHKVSMLL-TRNDPPNP

Query:  VFQNLLMSCKEFCRTR
        V  NLL SCK++ +TR
Subjt:  VFQNLLMSCKEFCRTR

AT1G30610.1 pentatricopeptide (PPR) repeat-containing protein4.2e-0350Show/hide
Query:  EKDFQFKPSFDEYVRVMESVRNSRYKKQSDDPNKLKMKEN
        +K F+FKPSFD+Y+++MESV+ +R KK+ D   +LK++E+
Subjt:  EKDFQFKPSFDEYVRVMESVRNSRYKKQSDDPNKLKMKEN

AT1G30610.2 pentatricopeptide (PPR) repeat-containing protein3.7e-21762.07Show/hide
Query:  GLSKSGKPFLESTEESGLEVERAAFNNFD-ALDIMDKPRVSKMEMEERIQMLSKRLNGADIDMPEWMFARMMRSAKIRYSDHSILRVIQVLGKLGNWRRV
        G  + G    +  ++S   +E  AF   D + DI+DKP  S++EME+RI+ L+K LNGADI+MPEW F++ +RSAKIRY+D++++R+I  LGKLGNWRRV
Subjt:  GLSKSGKPFLESTEESGLEVERAAFNNFD-ALDIMDKPRVSKMEMEERIQMLSKRLNGADIDMPEWMFARMMRSAKIRYSDHSILRVIQVLGKLGNWRRV

Query:  LQVIEWLQMRERFKSHKLRFIYTTALDVLGKARRPVEALNIFHAMQQHFSSYPDLVAYHSIAVTLGQAGYMRELFDVIDSMRSPPKKKFKTGALEKWDPR
        LQVIEWLQ ++R+KS+K+R IYTTAL+VLGK+RRPVEALN+FHAM    SSYPD+VAY SIAVTLGQAG+++ELF VID+MRSPPKKKFK   LEKWDPR
Subjt:  LQVIEWLQMRERFKSHKLRFIYTTALDVLGKARRPVEALNIFHAMQQHFSSYPDLVAYHSIAVTLGQAGYMRELFDVIDSMRSPPKKKFKTGALEKWDPR

Query:  LQPDIVIYNAVLNACVKRKNWEGAFWVLQELKKQGLQPSTSTYGLVMEVMLACGKYNLVHEFFRKVQRSSIPNALTYKVLVNTLWKEGKTDEAVVAIQNM
        L+PD+V+YNAVLNACV+RK WEGAFWVLQ+LK++G +PS  TYGL+MEVMLAC KYNLVHEFFRK+Q+SSIPNAL Y+VLVNTLWKEGK+DEAV  +++M
Subjt:  LQPDIVIYNAVLNACVKRKNWEGAFWVLQELKKQGLQPSTSTYGLVMEVMLACGKYNLVHEFFRKVQRSSIPNALTYKVLVNTLWKEGKTDEAVVAIQNM

Query:  EQRGIVGSAALYYDFARCLCSAGRCKEALMQIEKICKVANKPLVVTYTGLIQACLDSKTLQSAVYIFNHMKDFCSPNLVTYNILLKGCLEHRMFEEAREL
        E RGIVGSAALYYD ARCLCSAGRC E L  ++KIC+VANKPLVVTYTGLIQAC+DS  +++A YIF+ MK  CSPNLVT NI+LK  L+  +FEEAREL
Subjt:  EQRGIVGSAALYYDFARCLCSAGRCKEALMQIEKICKVANKPLVVTYTGLIQACLDSKTLQSAVYIFNHMKDFCSPNLVTYNILLKGCLEHRMFEEAREL

Query:  FQNLSEHGRNISTISDYRDRVLPDIYTFNTMLDASFAEKRWDDFGYFYDQMFLYGYHFNPKRHLRMILEAGRAGKDELLETTWKHLAMANQTLPPPLVKE
        FQ +SE G +I   SD+  RVLPD YTFNTMLD    +++WDDFGY Y +M  +GYHFN KRHLRM+LEA RAGK+E++E TW+H+  +N+  P PL+KE
Subjt:  FQNLSEHGRNISTISDYRDRVLPDIYTFNTMLDASFAEKRWDDFGYFYDQMFLYGYHFNPKRHLRMILEAGRAGKDELLETTWKHLAMANQTLPPPLVKE

Query:  RFCMKLARGDYSEALSCISDHH----SSDVHHFSESAWLNLLKEKRFPKDTVIQLIHKVSMLL-TRNDPPNPVFQNLLMSCKEFCRTR
        RF  KL +GD+  A+S ++D +     +++  FS SAW  +L   RF +D+V++L+  V+  L +R++  + V  NLL SCK++ +TR
Subjt:  RFCMKLARGDYSEALSCISDHH----SSDVHHFSESAWLNLLKEKRFPKDTVIQLIHKVSMLL-TRNDPPNPVFQNLLMSCKEFCRTR

AT1G30610.2 pentatricopeptide (PPR) repeat-containing protein4.2e-0350Show/hide
Query:  EKDFQFKPSFDEYVRVMESVRNSRYKKQSDDPNKLKMKEN
        +K F+FKPSFD+Y+++MESV+ +R KK+ D   +LK++E+
Subjt:  EKDFQFKPSFDEYVRVMESVRNSRYKKQSDDPNKLKMKEN

AT1G63400.1 Pentatricopeptide repeat (PPR) superfamily protein1.1e-2425.16Show/hide
Query:  IYTTALDVLGKARRPVEALNIFHAMQQHFSSYPDLVAYHSIAVTLGQAGYMRELFDVIDSMRSPPKKKFKTGALEKWDPRLQPDIVIYNAVLNACVKRKN
        IY+T +D L K R   +ALN+F  M+      P+++ Y S+   L             +  R     +  +  +E+   ++ P++V +NA+++A VK   
Subjt:  IYTTALDVLGKARRPVEALNIFHAMQQHFSSYPDLVAYHSIAVTLGQAGYMRELFDVIDSMRSPPKKKFKTGALEKWDPRLQPDIVIYNAVLNACVKRKN

Query:  WEGAFWVLQELKKQGLQPSTSTYGLVMEVMLACGKYN-LVHEFFRKVQRSSIPNALTYKVLVNTLWKEGKTDEAVVAIQNMEQRGIVGSAALYYDFARCL
           A  +  E+ K+ + P   TY  ++       + +   H F   + +   PN +TY  L+N   K  + DE V   + M QRG+VG+   Y       
Subjt:  WEGAFWVLQELKKQGLQPSTSTYGLVMEVMLACGKYN-LVHEFFRKVQRSSIPNALTYKVLVNTLWKEGKTDEAVVAIQNMEQRGIVGSAALYYDFARCL

Query:  CSAGRCKEALMQIEKICKVANKPLVVTYTGLIQACLDSKTLQSAVYIFNHM-KDFCSPNLVTYNILLKGCLEHRMFEEARELFQNLSEHGRNISTISDYR
          A  C  A M  +++      P ++TY  L+     +  L+ A+ +F ++ +    P + TYNI+++G  +    E+  +LF +LS  G          
Subjt:  CSAGRCKEALMQIEKICKVANKPLVVTYTGLIQACLDSKTLQSAVYIFNHM-KDFCSPNLVTYNILLKGCLEHRMFEEARELFQNLSEHGRNISTISDYR

Query:  DRVLPDIYTFNTML
          V PD+  +NTM+
Subjt:  DRVLPDIYTFNTML

AT5G16640.1 Pentatricopeptide repeat (PPR) superfamily protein2.0e-2424.2Show/hide
Query:  IYTTALDVLGKARRPVEALNIFHAMQQHFSSYPDLVAYHSIAVTLGQAGYMRELFDVIDSMRSPPKKKFKTGALEKWDPRLQPDIVIYNAVLNACVKRKN
        IY T +D L K+++   AL++ + M++     PD+V Y+S+   L  +G   +   ++  M                   + PD+  +NA+++ACVK   
Subjt:  IYTTALDVLGKARRPVEALNIFHAMQQHFSSYPDLVAYHSIAVTLGQAGYMRELFDVIDSMRSPPKKKFKTGALEKWDPRLQPDIVIYNAVLNACVKRKN

Query:  WEGAFWVLQELKKQGLQPSTSTYGLVMEVMLACGKYNLVHEFFR-KVQRSSIPNALTYKVLVNTLWKEGKTDEAVVAIQNMEQRGIVGSAALYYDFARCL
           A    +E+ ++ L P   TY L++  +    + +   E F   V +   P+ +TY +L+N   K  K +  +     M QRG+V +   Y    +  
Subjt:  WEGAFWVLQELKKQGLQPSTSTYGLVMEVMLACGKYNLVHEFFR-KVQRSSIPNALTYKVLVNTLWKEGKTDEAVVAIQNMEQRGIVGSAALYYDFARCL

Query:  CSAGRCKEALMQIEKICKVANKPLVVTYTGLIQACLDSKTLQSAVYIFNHM-KDFCSPNLVTYNILLKGCLEHRMFEEARELFQNLSEHGRNISTISDYR
        C AG+   A     ++      P ++TY  L+    D+  ++ A+ I   M K+    ++VTYNI+++G  +     +A +++ +L+  G          
Subjt:  CSAGRCKEALMQIEKICKVANKPLVVTYTGLIQACLDSKTLQSAVYIFNHM-KDFCSPNLVTYNILLKGCLEHRMFEEARELFQNLSEHGRNISTISDYR

Query:  DRVLPDIYTFNTML
          ++PDI+T+ TM+
Subjt:  DRVLPDIYTFNTML

AT5G67570.1 Tetratricopeptide repeat (TPR)-like superfamily protein3.3e-11239.78Show/hide
Query:  ERIQMLSKRLNGADIDMPEWMFARMMRSAKIRYSDHSILRVIQVLGKLGNWRRVLQVIEWLQMRERFKSHKLRFIYTTALDVLGKARRPVEALNIFHAMQ
        E +++L  RL+G +I+   W F RMM  + +++++  +L+++  LG+  +W++   V+ W+   ++ K  + RF+YT  L VLG ARRP EAL IF+ M 
Subjt:  ERIQMLSKRLNGADIDMPEWMFARMMRSAKIRYSDHSILRVIQVLGKLGNWRRVLQVIEWLQMRERFKSHKLRFIYTTALDVLGKARRPVEALNIFHAMQ

Query:  QHFSSYPDLVAYHSIAVTLGQAGYMRELFDVIDSMRSPPKKKFKTGALEKWDPRLQPDIVIYNAVLNACVKRKNWEGAFWVLQELKKQGLQPSTSTYGLV
             YPD+ AYH IAVTLGQAG ++EL  VI+ MR  P K  K    + WDP L+PD+V+YNA+LNACV    W+   WV  EL+K GL+P+ +TYGL 
Subjt:  QHFSSYPDLVAYHSIAVTLGQAGYMRELFDVIDSMRSPPKKKFKTGALEKWDPRLQPDIVIYNAVLNACVKRKNWEGAFWVLQELKKQGLQPSTSTYGLV

Query:  MEVMLACGKYNLVHEFFRKVQRS-SIPNALTYKVLVNTLWKEGKTDEAVVAIQNMEQRGIVGSAALYYDFARCLCSAGRCKEALMQIEKICKVAN-KPLV
        MEVML  GK++ VH+FFRK++ S   P A+TYKVLV  LW+EGK +EAV A+++MEQ+G++G+ ++YY+ A CLC+ GR  +A++++ ++ ++ N +PL 
Subjt:  MEVMLACGKYNLVHEFFRKVQRS-SIPNALTYKVLVNTLWKEGKTDEAVVAIQNMEQRGIVGSAALYYDFARCLCSAGRCKEALMQIEKICKVAN-KPLV

Query:  VTYTGLIQACLDSKTLQSAVYIFNHMKDFCSPNLVTYNILLKGCLEHRMFEEARELFQNLSEHGRNISTISDYRDRVLPDIYTFNTMLDASFAEKRWDDF
        +T+TGLI A L+   +   + IF +MKD C PN+ T N++LK    + MF EA+ELF+ +         +S     ++P+ YT++ ML+AS    +W+ F
Subjt:  VTYTGLIQACLDSKTLQSAVYIFNHMKDFCSPNLVTYNILLKGCLEHRMFEEARELFQNLSEHGRNISTISDYRDRVLPDIYTFNTMLDASFAEKRWDDF

Query:  GYFYDQMFLYGYHFNPKRHLRMILEAGRAGKDELLETTWKHLAMANQTLPPPLVKERFCMKLARGDYSEALSCISDHHSSDVHHFSESAWLNLLKEKRFP
         + Y  M L GY  +  +H  M++EA RAGK  LLE  +  +    +   P    E  C   A+GD+  A++ I+    +     SE  W +L +E    
Subjt:  GYFYDQMFLYGYHFNPKRHLRMILEAGRAGKDELLETTWKHLAMANQTLPPPLVKERFCMKLARGDYSEALSCISDHHSSDVHHFSESAWLNLLKEKRFP

Query:  KDTVIQ-LIHKVSMLLTRND-PPNPVFQNLLMSCKEFC
        +D + Q  +HK+S  L   D    P   NL  S K  C
Subjt:  KDTVIQ-LIHKVSMLLTRND-PPNPVFQNLLMSCKEFC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGGGGGTATTTATCCAAATCCCCTCTGGTCTTGTCCTTTGTAAAGAAGAAACTCGCTTTTGCCTCTCTGTCGGAGCACCGCCAGGTTAAGTACTATCCGAGTTTGCA
ACGGAGCATTAGGTCGGTGCAAGAATCTAAAATGCCTTTGGAGCCGCAGATAGCCGGAGAAGATAGAACAGGAACAGGGAAGAAGAATGATGTAATCGTGTCGCGAGATG
TAGGGAGAGGTTGCTCCTCCGTCATTCTGGAAAATACTCCTACTAGTGACATACTAAGAGCAAATTCGCATAATTTATGGGTGGCTTCGTTCTTTCCTAGTCCGATTTTT
GGAATTGACTTAAATGTTGGCGACGCAAAGAATAGAGTTTTTCGGAATAGGGGAAATAAATGTGGAGCGATTAAGGCTTCGTCAAAGGGAAAATCTGATATTCGATCGTC
AAATGGGAATCTTCTCGAAAAGGATTTTCAATTTAAGCCATCGTTCGATGAATATGTGAGGGTTATGGAGTCCGTTAGAAATAGTAGGTATAAGAAGCAGTCGGACGATC
CTAATAAGCTGAAGATGAAGGAAAATGCGAGTGCAAAGAGTGCTGAAGGCATTTCCAAGTCTAAAATAGATAATGAAAAAAACAAAGCAACTGATGTTCATGGCAATGTT
GATGTAAAGAACATGTTTAAACCTGTTGATCAGAAAGATTTGTTCAATAATGCAGAGAGAATTATACGTAAAAAAGATCTGTCGCGAAACAATTTCGATAACAAAAGGAA
AGGAGTTACAAGATTTAAGGACGAGGTTAAAGGCAAGGTGACCCGTTTTGACTCACAGGTTAATGAAAAACAACATGAAGAGAAAAGGAAAGGACACTTATTTGATTGCA
TTGAGCCAAAAGTAAGAAGGTCGAACAATGAGACACTAGTTCGTTTGAAGGCTAATACATTGGATGTCAAAAGAGAAAGGCAACGAGTATGTGATGAAACTAAGGATGCT
CTGAAGGTCGAGAAATCTGGTGTTCAGCTTGCAAGGAACTATATTCCAGGTGAGAAGGTTGATAGAAAGAAAACTGGGCAGCCCTACCAAGGGTTATCCAAAAGTGGTAA
GCCGTTCCTTGAATCTACTGAAGAGAGTGGCTTGGAGGTAGAACGAGCAGCCTTCAACAATTTTGATGCATTAGACATCATGGATAAACCAAGAGTTTCAAAGATGGAAA
TGGAAGAGAGAATCCAGATGCTTTCTAAGAGATTGAATGGTGCAGACATTGATATGCCCGAGTGGATGTTTGCTCGAATGATGAGGAGTGCAAAGATTAGATATTCAGAT
CACTCAATATTAAGGGTTATTCAAGTTCTGGGTAAGCTAGGAAATTGGAGGAGAGTGCTACAAGTCATTGAATGGCTTCAAATGCGTGAACGGTTCAAGTCACACAAGCT
CAGATTTATATACACAACTGCCCTTGATGTACTTGGAAAAGCGAGGAGACCTGTGGAGGCACTCAACATATTCCATGCGATGCAGCAACACTTCTCCTCATATCCTGACT
TAGTAGCATATCATAGTATTGCTGTCACTCTTGGACAAGCAGGATATATGAGGGAACTCTTTGACGTGATTGATAGCATGCGGTCTCCTCCAAAGAAGAAGTTTAAAACA
GGGGCACTTGAAAAGTGGGACCCACGGCTGCAACCTGATATAGTTATCTATAATGCGGTTTTAAATGCTTGTGTTAAGCGAAAGAATTGGGAAGGGGCATTTTGGGTCTT
ACAGGAATTGAAGAAACAAGGTCTACAGCCTTCTACGTCAACATATGGATTGGTCATGGAGGTGATGCTTGCATGTGGCAAGTACAACTTAGTTCATGAGTTCTTCAGAA
AAGTGCAGAGATCTTCCATTCCCAATGCTTTAACGTATAAAGTTCTTGTCAATACACTTTGGAAAGAAGGTAAAACCGATGAGGCTGTAGTGGCCATTCAGAACATGGAA
CAACGAGGGATAGTGGGGTCTGCAGCTCTTTATTACGACTTTGCTCGTTGTCTTTGCAGTGCTGGAAGGTGCAAAGAAGCCCTAATGCAGATTGAGAAGATATGTAAAGT
TGCTAATAAGCCTCTTGTTGTGACTTACACCGGTTTGATTCAAGCTTGTTTGGATTCAAAAACCTTACAAAGTGCAGTATATATATTCAACCACATGAAGGACTTCTGCT
CCCCCAATCTTGTTACTTATAATATATTGCTGAAAGGTTGCCTGGAGCATAGGATGTTCGAAGAAGCTAGAGAGCTGTTCCAGAATTTATCAGAACATGGACGAAATATC
AGCACTATATCTGACTATAGAGATCGAGTATTACCAGATATCTACACGTTCAACACCATGCTAGATGCATCTTTTGCAGAAAAGAGATGGGATGATTTTGGCTATTTCTA
TGACCAAATGTTTCTTTATGGGTATCACTTCAACCCAAAACGTCATCTGCGAATGATATTGGAGGCTGGAAGGGCTGGAAAGGATGAGCTACTGGAAACAACATGGAAGC
ACCTTGCTATGGCTAACCAGACTCTGCCGCCTCCACTTGTCAAAGAAAGGTTTTGCATGAAGCTGGCTAGGGGCGACTACTCCGAAGCTCTCTCTTGCATTTCCGATCAC
CATAGTAGCGATGTGCATCATTTCTCTGAGTCGGCATGGCTAAATTTACTCAAAGAGAAAAGGTTTCCCAAGGATACTGTTATTCAGTTAATTCATAAGGTTAGCATGCT
TCTTACTAGAAATGACCCACCAAATCCAGTGTTTCAGAATCTGCTAATGAGTTGTAAAGAATTTTGCAGAACTAGAATTACTGTAGCTGACAATAAACTTGAAGACATTG
TTTGTACAGATGAAACCCAGTTTACTGCTGTTATGCATATTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGGGGGGTATTTATCCAAATCCCCTCTGGTCTTGTCCTTTGTAAAGAAGAAACTCGCTTTTGCCTCTCTGTCGGAGCACCGCCAGGTTAAGTACTATCCGAGTTTGCA
ACGGAGCATTAGGTCGGTGCAAGAATCTAAAATGCCTTTGGAGCCGCAGATAGCCGGAGAAGATAGAACAGGAACAGGGAAGAAGAATGATGTAATCGTGTCGCGAGATG
TAGGGAGAGGTTGCTCCTCCGTCATTCTGGAAAATACTCCTACTAGTGACATACTAAGAGCAAATTCGCATAATTTATGGGTGGCTTCGTTCTTTCCTAGTCCGATTTTT
GGAATTGACTTAAATGTTGGCGACGCAAAGAATAGAGTTTTTCGGAATAGGGGAAATAAATGTGGAGCGATTAAGGCTTCGTCAAAGGGAAAATCTGATATTCGATCGTC
AAATGGGAATCTTCTCGAAAAGGATTTTCAATTTAAGCCATCGTTCGATGAATATGTGAGGGTTATGGAGTCCGTTAGAAATAGTAGGTATAAGAAGCAGTCGGACGATC
CTAATAAGCTGAAGATGAAGGAAAATGCGAGTGCAAAGAGTGCTGAAGGCATTTCCAAGTCTAAAATAGATAATGAAAAAAACAAAGCAACTGATGTTCATGGCAATGTT
GATGTAAAGAACATGTTTAAACCTGTTGATCAGAAAGATTTGTTCAATAATGCAGAGAGAATTATACGTAAAAAAGATCTGTCGCGAAACAATTTCGATAACAAAAGGAA
AGGAGTTACAAGATTTAAGGACGAGGTTAAAGGCAAGGTGACCCGTTTTGACTCACAGGTTAATGAAAAACAACATGAAGAGAAAAGGAAAGGACACTTATTTGATTGCA
TTGAGCCAAAAGTAAGAAGGTCGAACAATGAGACACTAGTTCGTTTGAAGGCTAATACATTGGATGTCAAAAGAGAAAGGCAACGAGTATGTGATGAAACTAAGGATGCT
CTGAAGGTCGAGAAATCTGGTGTTCAGCTTGCAAGGAACTATATTCCAGGTGAGAAGGTTGATAGAAAGAAAACTGGGCAGCCCTACCAAGGGTTATCCAAAAGTGGTAA
GCCGTTCCTTGAATCTACTGAAGAGAGTGGCTTGGAGGTAGAACGAGCAGCCTTCAACAATTTTGATGCATTAGACATCATGGATAAACCAAGAGTTTCAAAGATGGAAA
TGGAAGAGAGAATCCAGATGCTTTCTAAGAGATTGAATGGTGCAGACATTGATATGCCCGAGTGGATGTTTGCTCGAATGATGAGGAGTGCAAAGATTAGATATTCAGAT
CACTCAATATTAAGGGTTATTCAAGTTCTGGGTAAGCTAGGAAATTGGAGGAGAGTGCTACAAGTCATTGAATGGCTTCAAATGCGTGAACGGTTCAAGTCACACAAGCT
CAGATTTATATACACAACTGCCCTTGATGTACTTGGAAAAGCGAGGAGACCTGTGGAGGCACTCAACATATTCCATGCGATGCAGCAACACTTCTCCTCATATCCTGACT
TAGTAGCATATCATAGTATTGCTGTCACTCTTGGACAAGCAGGATATATGAGGGAACTCTTTGACGTGATTGATAGCATGCGGTCTCCTCCAAAGAAGAAGTTTAAAACA
GGGGCACTTGAAAAGTGGGACCCACGGCTGCAACCTGATATAGTTATCTATAATGCGGTTTTAAATGCTTGTGTTAAGCGAAAGAATTGGGAAGGGGCATTTTGGGTCTT
ACAGGAATTGAAGAAACAAGGTCTACAGCCTTCTACGTCAACATATGGATTGGTCATGGAGGTGATGCTTGCATGTGGCAAGTACAACTTAGTTCATGAGTTCTTCAGAA
AAGTGCAGAGATCTTCCATTCCCAATGCTTTAACGTATAAAGTTCTTGTCAATACACTTTGGAAAGAAGGTAAAACCGATGAGGCTGTAGTGGCCATTCAGAACATGGAA
CAACGAGGGATAGTGGGGTCTGCAGCTCTTTATTACGACTTTGCTCGTTGTCTTTGCAGTGCTGGAAGGTGCAAAGAAGCCCTAATGCAGATTGAGAAGATATGTAAAGT
TGCTAATAAGCCTCTTGTTGTGACTTACACCGGTTTGATTCAAGCTTGTTTGGATTCAAAAACCTTACAAAGTGCAGTATATATATTCAACCACATGAAGGACTTCTGCT
CCCCCAATCTTGTTACTTATAATATATTGCTGAAAGGTTGCCTGGAGCATAGGATGTTCGAAGAAGCTAGAGAGCTGTTCCAGAATTTATCAGAACATGGACGAAATATC
AGCACTATATCTGACTATAGAGATCGAGTATTACCAGATATCTACACGTTCAACACCATGCTAGATGCATCTTTTGCAGAAAAGAGATGGGATGATTTTGGCTATTTCTA
TGACCAAATGTTTCTTTATGGGTATCACTTCAACCCAAAACGTCATCTGCGAATGATATTGGAGGCTGGAAGGGCTGGAAAGGATGAGCTACTGGAAACAACATGGAAGC
ACCTTGCTATGGCTAACCAGACTCTGCCGCCTCCACTTGTCAAAGAAAGGTTTTGCATGAAGCTGGCTAGGGGCGACTACTCCGAAGCTCTCTCTTGCATTTCCGATCAC
CATAGTAGCGATGTGCATCATTTCTCTGAGTCGGCATGGCTAAATTTACTCAAAGAGAAAAGGTTTCCCAAGGATACTGTTATTCAGTTAATTCATAAGGTTAGCATGCT
TCTTACTAGAAATGACCCACCAAATCCAGTGTTTCAGAATCTGCTAATGAGTTGTAAAGAATTTTGCAGAACTAGAATTACTGTAGCTGACAATAAACTTGAAGACATTG
TTTGTACAGATGAAACCCAGTTTACTGCTGTTATGCATATTTAG
Protein sequenceShow/hide protein sequence
MGGYLSKSPLVLSFVKKKLAFASLSEHRQVKYYPSLQRSIRSVQESKMPLEPQIAGEDRTGTGKKNDVIVSRDVGRGCSSVILENTPTSDILRANSHNLWVASFFPSPIF
GIDLNVGDAKNRVFRNRGNKCGAIKASSKGKSDIRSSNGNLLEKDFQFKPSFDEYVRVMESVRNSRYKKQSDDPNKLKMKENASAKSAEGISKSKIDNEKNKATDVHGNV
DVKNMFKPVDQKDLFNNAERIIRKKDLSRNNFDNKRKGVTRFKDEVKGKVTRFDSQVNEKQHEEKRKGHLFDCIEPKVRRSNNETLVRLKANTLDVKRERQRVCDETKDA
LKVEKSGVQLARNYIPGEKVDRKKTGQPYQGLSKSGKPFLESTEESGLEVERAAFNNFDALDIMDKPRVSKMEMEERIQMLSKRLNGADIDMPEWMFARMMRSAKIRYSD
HSILRVIQVLGKLGNWRRVLQVIEWLQMRERFKSHKLRFIYTTALDVLGKARRPVEALNIFHAMQQHFSSYPDLVAYHSIAVTLGQAGYMRELFDVIDSMRSPPKKKFKT
GALEKWDPRLQPDIVIYNAVLNACVKRKNWEGAFWVLQELKKQGLQPSTSTYGLVMEVMLACGKYNLVHEFFRKVQRSSIPNALTYKVLVNTLWKEGKTDEAVVAIQNME
QRGIVGSAALYYDFARCLCSAGRCKEALMQIEKICKVANKPLVVTYTGLIQACLDSKTLQSAVYIFNHMKDFCSPNLVTYNILLKGCLEHRMFEEARELFQNLSEHGRNI
STISDYRDRVLPDIYTFNTMLDASFAEKRWDDFGYFYDQMFLYGYHFNPKRHLRMILEAGRAGKDELLETTWKHLAMANQTLPPPLVKERFCMKLARGDYSEALSCISDH
HSSDVHHFSESAWLNLLKEKRFPKDTVIQLIHKVSMLLTRNDPPNPVFQNLLMSCKEFCRTRITVADNKLEDIVCTDETQFTAVMHI