; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CcUC01G009290 (gene) of Watermelon (PI 537277) v1 genome

Gene IDCcUC01G009290
OrganismCitrullus colocynthis (Watermelon (PI 537277) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationCicolChr01:10727250..10740160
RNA-Seq ExpressionCcUC01G009290
SyntenyCcUC01G009290
Gene Ontology termsGO:0009658 - chloroplast organization (biological process)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily
IPR044645 - Pentatricopeptide repeat-containing protein DG1/EMB2279-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7019446.1 Pentatricopeptide repeat-containing protein, chloroplastic, partial [Cucurbita argyrosperma subsp. argyrosperma]0.0e+0084.16Show/hide
Query:  NGFPALHCTQNSHNFFGFSFFPCSISGTDLNMGDVKNRVLRHRGHKCGAIKASSNGESDIRLPSGNLLENDFPFKPSFDEYVRVMETVRTRRYKKQTDDP
        NGFPAL+CTQNSH   GFSFFP S+SG+ LN G  K+RVLRHRGHKCGAIKASS GESDI+L SGNLLE DF FKPSFDEYVRVME+VR+RRYK+Q+DDP
Subjt:  NGFPALHCTQNSHNFFGFSFFPCSISGTDLNMGDVKNRVLRHRGHKCGAIKASSNGESDIRLPSGNLLENDFPFKPSFDEYVRVMETVRTRRYKKQTDDP

Query:  NKLTMKENASAKSAESTSISKIDNGKNKGTDVQRDVDVKNMFKRVDRKDLFNNTERITHRRDLSGNKFDSKRKGVTRSNDEVKGKVTPFYSQADDKQHEK
        NK  MKENASAKSAESTSIS I       TDVQ ++DVKN    VD +DLF+N+E+IT + DLSGNKFDSKRKGVTRS DE+KGKVTPF SQ +DKQHE+
Subjt:  NKLTMKENASAKSAESTSISKIDNGKNKGTDVQRDVDVKNMFKRVDRKDLFNNTERITHRRDLSGNKFDSKRKGVTRSNDEVKGKVTPFYSQADDKQHEK

Query:  KRNRNWSSDIEPKVPRSYNEKLINFKANTLDVKRENRRVCDRSPMKISEKIWADDDTKRAKDVLKAGKFGVQLERNYIPGEKVGRKKNEQSYRGPSKSGK
        KRN NWS+ IEPK  RS ++K ++FKANTLDVK E+  V   S MKIS+KIWADDDTK  KDVLK GK+GVQLE NYIPG+KVGRKK EQSYRG SKSGK
Subjt:  KRNRNWSSDIEPKVPRSYNEKLINFKANTLDVKRENRRVCDRSPMKISEKIWADDDTKRAKDVLKAGKFGVQLERNYIPGEKVGRKKNEQSYRGPSKSGK

Query:  QFLEFPQESSLEVEHAAFNNFDALDIMDKPRVSKMEMEERIQMLSKRLNGSNIDMPEWMFSQMMRSAKIRYSDHSILRVIQVLGKLGNWRRVLQVIEWLQ
        +F EF +ESSLEVEHAAFN+FDA DIMDKPRVSKMEMEERIQMLSKRLNG++IDMPEWMF+QMMRSAKIRYSDHSILRVIQVLGKLGNW+RVLQVIEWLQ
Subjt:  QFLEFPQESSLEVEHAAFNNFDALDIMDKPRVSKMEMEERIQMLSKRLNGSNIDMPEWMFSQMMRSAKIRYSDHSILRVIQVLGKLGNWRRVLQVIEWLQ

Query:  MRERFKSHKLRFIYTTALDVLGKARRPVEALNVFHAMQQHFSSYPDLVAYHSIAVTLGQAGYMRELFDVIDSMRFPP-KKFKTGALEKWDPRLQPDIVIY
        MRERFKSHKLRFIYTTALDVLGKARRPVEALNVFHAMQQHFSSYPDLVAYHSIAVTLGQAGYMRELFDVIDSMR PP KKFKTGA EKWDPRLQPDIVIY
Subjt:  MRERFKSHKLRFIYTTALDVLGKARRPVEALNVFHAMQQHFSSYPDLVAYHSIAVTLGQAGYMRELFDVIDSMRFPP-KKFKTGALEKWDPRLQPDIVIY

Query:  NSVLNACVKRKNLEGAFWVLQELKKQGLQPSTSTYGLVMEVMLECGKYNLVHEFFRKVQKSSVPNALTYKVLVNTLWKEGKTDEAVLAIENMERRGIVGS
        N+VLNACVKRKN EGAFWVLQELK+QGLQPST+TYGLVMEVML+CGKYNLVHEFFRKVQKSS+PNALTYKVLVNTLWKEGKTDEAVLAI+ ME+RGIVGS
Subjt:  NSVLNACVKRKNLEGAFWVLQELKKQGLQPSTSTYGLVMEVMLECGKYNLVHEFFRKVQKSSVPNALTYKVLVNTLWKEGKTDEAVLAIENMERRGIVGS

Query:  AALYYDFARCLCSAGRCKEALMQMEKICKVANKPLVVTYTGLIQACLDSKDLQSAVYIFNHMKTFCSPNLVTYNILLKGYLEHGMFEEARELFQNLSEHG
        AALYYDFARCLCSAGRC+EALMQMEKICKVANKPLVVTYTGLIQACLDSK+LQSAVYIFNHMK FCSPNLVT NILLKGYL+HGMF+EA+ELFQN+SE+G
Subjt:  AALYYDFARCLCSAGRCKEALMQMEKICKVANKPLVVTYTGLIQACLDSKDLQSAVYIFNHMKTFCSPNLVTYNILLKGYLEHGMFEEARELFQNLSEHG

Query:  RNISNVSDYRDRVLPDIYMFNTMLDASFAEKRWDDFGYFFNQMLLYGYHFNPKRHLRMILEAARAGKDELLETTWKHLSQVDRIPPPALLKERFCMKLAR
        RNIS VSDYRDRVLPDIY FNTMLDASFAEKRWDDF +F+NQMLLYGYHFNPKRHLRMI+EAAR GKDELLETTWKHL+Q DR  PP L+KERFC+ LAR
Subjt:  RNISNVSDYRDRVLPDIYMFNTMLDASFAEKRWDDFGYFFNQMLLYGYHFNPKRHLRMILEAARAGKDELLETTWKHLSQVDRIPPPALLKERFCMKLAR

Query:  GDYSDALSCISNHDSSDAHHFSEPAWLNLLKEKRFPKDTVIQLIHK
        GDYS+ALSCIS H SSD HHFS+ AWLNLLKEKRFPKD+VI+LIHK
Subjt:  GDYSDALSCISNHDSSDAHHFSEPAWLNLLKEKRFPKDTVIQLIHK

XP_008459122.1 PREDICTED: pentatricopeptide repeat-containing protein At1g30610, chloroplastic [Cucumis melo]0.0e+0086.22Show/hide
Query:  GFPALHCTQNSHNFFGFSFFPCSIS--GTDLNMGDVKNRVLRHRGHKCGAIKASSNGESDIRLPSGNLLENDFPFKPSFDEYVRVMETVRTRRYKKQTDD
        GFP LHCT NSH  F  SFFP S+S  GTDLN  D KNRVLRHR HKCG+IKA SNGESDI LP+GNLLE+DF FKPSFDEYV+VMETVRTRRYK+Q D 
Subjt:  GFPALHCTQNSHNFFGFSFFPCSIS--GTDLNMGDVKNRVLRHRGHKCGAIKASSNGESDIRLPSGNLLENDFPFKPSFDEYVRVMETVRTRRYKKQTDD

Query:  PNKLTMKENASAKSAESTSISKIDNGKNKGTDVQRDVDVKNMFKRVDRKDLFNNTERITHRRDLSGNKFDSKRKGVTRSNDEVKGKVTPFYSQADDKQHE
        PNKLTMKEN SAKSAESTSISKIDNGKNK TDVQ +V+VKNMFKRVD+KDLFNNTERI   + LSGNKFD + KGVTRSND+VKGK+TPF S  +DKQHE
Subjt:  PNKLTMKENASAKSAESTSISKIDNGKNKGTDVQRDVDVKNMFKRVDRKDLFNNTERITHRRDLSGNKFDSKRKGVTRSNDEVKGKVTPFYSQADDKQHE

Query:  KKRNRNWSSDIEPKVPRSYNEKLINFKANTLDVKRENRRVCDRSPMKISEKIWA--DDDTKRAKDVLKAGKFGVQLERNYIPGEKVGRKKNEQSYRGPSK
        +K+N NWSS IEPKV RS  EK I+FKAN L+ K+E  RV   + MK SEKIWA  +DD K AKDVLKAGK+G+QLER+Y PG+KVGRKK EQSYRG S 
Subjt:  KKRNRNWSSDIEPKVPRSYNEKLINFKANTLDVKRENRRVCDRSPMKISEKIWA--DDDTKRAKDVLKAGKFGVQLERNYIPGEKVGRKKNEQSYRGPSK

Query:  SGKQFLEFPQESSLEVEHAAFNNFDALDIMDKPRVSKMEMEERIQMLSKRLNGSNIDMPEWMFSQMMRSAKIRYSDHSILRVIQVLGKLGNWRRVLQVIE
        SGK+FLEF +E+SLEVEHAAFNNFDALDIMDKPRVSKMEMEERIQMLSKRLNG++IDMPEWMFSQMMR AKIRYSDHSILRVIQVLGKLGNWRRVLQVIE
Subjt:  SGKQFLEFPQESSLEVEHAAFNNFDALDIMDKPRVSKMEMEERIQMLSKRLNGSNIDMPEWMFSQMMRSAKIRYSDHSILRVIQVLGKLGNWRRVLQVIE

Query:  WLQMRERFKSHKLRFIYTTALDVLGKARRPVEALNVFHAMQQHFSSYPDLVAYHSIAVTLGQAGYMRELFDVIDSMRFPP-KKFKTGALEKWDPRLQPDI
        WLQMRERFKSHK RFIYTTALDVLGKARRPVEALNVFHAMQ+HFSSYPDLVAYHSIAVTLGQAGYMRELFDVIDSMR PP KKFKTGALEKWDPRLQPDI
Subjt:  WLQMRERFKSHKLRFIYTTALDVLGKARRPVEALNVFHAMQQHFSSYPDLVAYHSIAVTLGQAGYMRELFDVIDSMRFPP-KKFKTGALEKWDPRLQPDI

Query:  VIYNSVLNACVKRKNLEGAFWVLQELKKQGLQPSTSTYGLVMEVMLECGKYNLVHEFFRKVQKSSVPNALTYKVLVNTLWKEGKTDEAVLAIENMERRGI
        VIYN+VLNACVKRKNLEGAFWVLQELKKQGLQPSTSTYGLVMEVMLECGKYNLVHEFFRKVQKSS+PNALTYKVLVNTLWKEGKTDEAVLAIENME RG+
Subjt:  VIYNSVLNACVKRKNLEGAFWVLQELKKQGLQPSTSTYGLVMEVMLECGKYNLVHEFFRKVQKSSVPNALTYKVLVNTLWKEGKTDEAVLAIENMERRGI

Query:  VGSAALYYDFARCLCSAGRCKEALMQMEKICKVANKPLVVTYTGLIQACLDSKDLQSAVYIFNHMKTFCSPNLVTYNILLKGYLEHGMFEEARELFQNLS
        VGSAALYYDFARCLCSAGRCKEALMQMEKICKVANKPLVVTYTGLIQACLDSKDLQSAVY+FN MK FCSPNLVTYNILLKGYLEHGMFEEAREL QNLS
Subjt:  VGSAALYYDFARCLCSAGRCKEALMQMEKICKVANKPLVVTYTGLIQACLDSKDLQSAVYIFNHMKTFCSPNLVTYNILLKGYLEHGMFEEARELFQNLS

Query:  EHGRNISNVSDYRDRVLPDIYMFNTMLDASFAEKRWDDFGYFFNQMLLYGYHFNPKRHLRMILEAARAGKDELLETTWKHLSQVDRIPPPALLKERFCMK
        E  +NIS VSDYRDRVLPDIYMFNTMLDASFAEKRWDDF YF+NQM LYGYHFNPKRHLRMILEAAR GKDELLETTWKHL+Q DR PPP LLKERFCMK
Subjt:  EHGRNISNVSDYRDRVLPDIYMFNTMLDASFAEKRWDDFGYFFNQMLLYGYHFNPKRHLRMILEAARAGKDELLETTWKHLSQVDRIPPPALLKERFCMK

Query:  LARGDYSDALSCISNHDSSDAHHFSEPAWLNLLKEKRFPKDTVIQLIHK
        +ARGDY++AL CISNH+S DAHHFSE AWLNLLKEKRFPKDTVI+LIHK
Subjt:  LARGDYSDALSCISNHDSSDAHHFSEPAWLNLLKEKRFPKDTVIQLIHK

XP_023519692.1 pentatricopeptide repeat-containing protein At1g30610, chloroplastic [Cucurbita pepo subsp. pepo]0.0e+0084.75Show/hide
Query:  NGFPALHCTQNSHNFFGFSFFPCSISGTDLNMGDVKNRVLRHRGHKCGAIKASSNGESDIRLPSGNLLENDFPFKPSFDEYVRVMETVRTRRYKKQTDDP
        NGFPALHCTQNSH   GFSFF  S+SG+ LN G  K+RVLRHRGHKCGAIKASS GESDIRL SGNLLENDF FKPSFDEYVRVME+VR+RRYK+Q+DDP
Subjt:  NGFPALHCTQNSHNFFGFSFFPCSISGTDLNMGDVKNRVLRHRGHKCGAIKASSNGESDIRLPSGNLLENDFPFKPSFDEYVRVMETVRTRRYKKQTDDP

Query:  NKLTMKENASAKSAESTSISKIDNGKNKGTDVQRDVDVKNMFKRVDRKDLFNNTERITHRRDLSGNKFDSKRKGVTRSNDEVKGKVTPFYSQADDKQHEK
        NK  MKENASAKSAESTSIS I       TDVQ ++DVK     VD++DLF+N+ERIT + DLSGNKFDSKRKGVTRS DE+KGKVTPF SQ +DKQH +
Subjt:  NKLTMKENASAKSAESTSISKIDNGKNKGTDVQRDVDVKNMFKRVDRKDLFNNTERITHRRDLSGNKFDSKRKGVTRSNDEVKGKVTPFYSQADDKQHEK

Query:  KRNRNWSSDIEPKVPRSYNEKLINFKANTLDVKRENRRVCDRSPMKISEKIWADDDTKRAKDVLKAGKFGVQLERNYIPGEKVGRKKNEQSYRGPSKSGK
        KRN NWS+ IEPKV RS ++K ++FKANTLDVK E+  V   S MKISEKIWADDDTKR KDVLK GK+GVQLE NYIPG+KVGRKK EQSYRG SKSGK
Subjt:  KRNRNWSSDIEPKVPRSYNEKLINFKANTLDVKRENRRVCDRSPMKISEKIWADDDTKRAKDVLKAGKFGVQLERNYIPGEKVGRKKNEQSYRGPSKSGK

Query:  QFLEFPQESSLEVEHAAFNNFDALDIMDKPRVSKMEMEERIQMLSKRLNGSNIDMPEWMFSQMMRSAKIRYSDHSILRVIQVLGKLGNWRRVLQVIEWLQ
        QF EF +ESSLEVEHAAFN+ DA DIMDKPRVSKMEMEERIQMLSKRLNG++IDMPEWMF+QMMRSAKIRYSDHSILRVIQVLGKLGNW+RVLQVIEWLQ
Subjt:  QFLEFPQESSLEVEHAAFNNFDALDIMDKPRVSKMEMEERIQMLSKRLNGSNIDMPEWMFSQMMRSAKIRYSDHSILRVIQVLGKLGNWRRVLQVIEWLQ

Query:  MRERFKSHKLRFIYTTALDVLGKARRPVEALNVFHAMQQHFSSYPDLVAYHSIAVTLGQAGYMRELFDVIDSMRFPP-KKFKTGALEKWDPRLQPDIVIY
        MRERFKSHKLRFIYTTALDVLGKARRPVEALNVFHAMQ+HFSSYPDLVAYHSIAVTLGQAGYMRELFDVIDSMR PP KKFKTGALEKWDPRLQPDIVIY
Subjt:  MRERFKSHKLRFIYTTALDVLGKARRPVEALNVFHAMQQHFSSYPDLVAYHSIAVTLGQAGYMRELFDVIDSMRFPP-KKFKTGALEKWDPRLQPDIVIY

Query:  NSVLNACVKRKNLEGAFWVLQELKKQGLQPSTSTYGLVMEVMLECGKYNLVHEFFRKVQKSSVPNALTYKVLVNTLWKEGKTDEAVLAIENMERRGIVGS
        N+VLNACVKRKN EGAFWVLQELK+QGLQPST+TYGLVMEVML+CGKYNLVHEFFRKVQKSS+PNALTYKVLVNTLWKEGKTDEAVLAI+ ME+RGIVGS
Subjt:  NSVLNACVKRKNLEGAFWVLQELKKQGLQPSTSTYGLVMEVMLECGKYNLVHEFFRKVQKSSVPNALTYKVLVNTLWKEGKTDEAVLAIENMERRGIVGS

Query:  AALYYDFARCLCSAGRCKEALMQMEKICKVANKPLVVTYTGLIQACLDSKDLQSAVYIFNHMKTFCSPNLVTYNILLKGYLEHGMFEEARELFQNLSEHG
        AALYYDFARCLCSAGRCKEALMQMEKICKVANKPLVVTYTGLIQACLDSK+LQSAVYIFNHMK FCSPNLVT NILLKGYL+HGMF+EA+ELFQN+SE+G
Subjt:  AALYYDFARCLCSAGRCKEALMQMEKICKVANKPLVVTYTGLIQACLDSKDLQSAVYIFNHMKTFCSPNLVTYNILLKGYLEHGMFEEARELFQNLSEHG

Query:  RNISNVSDYRDRVLPDIYMFNTMLDASFAEKRWDDFGYFFNQMLLYGYHFNPKRHLRMILEAARAGKDELLETTWKHLSQVDRIPPPALLKERFCMKLAR
        RNIS VSDYRDRVLPDIY FNTMLDASFAEKRWDDF +F+NQMLLYGYHFNPKRHLRMI+EAAR GKDELLETTWKHL+Q DR  PP L+KERFC+ LAR
Subjt:  RNISNVSDYRDRVLPDIYMFNTMLDASFAEKRWDDFGYFFNQMLLYGYHFNPKRHLRMILEAARAGKDELLETTWKHLSQVDRIPPPALLKERFCMKLAR

Query:  GDYSDALSCISNHDSSDAHHFSEPAWLNLLKEKRFPKDTVIQLIHK
        GDYS+ALSCIS H SSD HHFS+ AWLNLLKEKRFPKD+VI+LIHK
Subjt:  GDYSDALSCISNHDSSDAHHFSEPAWLNLLKEKRFPKDTVIQLIHK

XP_031741862.1 pentatricopeptide repeat-containing protein At1g30610, chloroplastic [Cucumis sativus]0.0e+0087.13Show/hide
Query:  GFPALHCTQNSHNFFGFSFFPCSISGTDLNMGDVKNRVLRHRGHKCGAIKASSNGESDIRLPSGNLLENDFPFKPSFDEYVRVMETVRTRRYKKQTDDPN
        GFP LHCT NSHN F  SFFP S+SGTD ++ D KNRVLRHR HKCG+IKA SNGESDI LPSGNLLE+DF FKPSFDEYV+VMETVRTRRYK+Q DDPN
Subjt:  GFPALHCTQNSHNFFGFSFFPCSISGTDLNMGDVKNRVLRHRGHKCGAIKASSNGESDIRLPSGNLLENDFPFKPSFDEYVRVMETVRTRRYKKQTDDPN

Query:  KLTMKENASAKSAESTSISKIDNGKNKGTDVQRDVDVKNMFKRVDRKDLFNNTERITHRRDLSGNKFDSKRKGVTRSNDEVKGKVTPFYSQADDKQHEKK
        KLTMKEN SAKSAESTSISKIDNGKNK TDVQ +VDVKNMFKRVD+KDLFNNTERI   +DLSGNKFD +RK VTRSND+VKGK+TPF S  +DKQHE+K
Subjt:  KLTMKENASAKSAESTSISKIDNGKNKGTDVQRDVDVKNMFKRVDRKDLFNNTERITHRRDLSGNKFDSKRKGVTRSNDEVKGKVTPFYSQADDKQHEKK

Query:  RNRNWSSDIEPKVPRSYNEKLINFKANTLDVKRENRRVCDRSPMKISEKIWA--DDDTKRAKDVLKAGKFGVQLERNYIPGEKVGRKKNEQSYRGPSKSG
        RN NWSS IEP+V RS ++K I+FKANTL+VK+E+ RV D + MK SEKIWA  DDD K AK VLKAGK+G+QLER+Y PG+KVGRKK EQSYRG S SG
Subjt:  RNRNWSSDIEPKVPRSYNEKLINFKANTLDVKRENRRVCDRSPMKISEKIWA--DDDTKRAKDVLKAGKFGVQLERNYIPGEKVGRKKNEQSYRGPSKSG

Query:  KQFLEFPQESSLEVEHAAFNNFDALDIMDKPRVSKMEMEERIQMLSKRLNGSNIDMPEWMFSQMMRSAKIRYSDHSILRVIQVLGKLGNWRRVLQVIEWL
        K+FLEF +++SLEVEHAAFNNFDA DIMDKPRVSKMEMEERIQMLSKRLNG++IDMPEWMFSQMMRSAKIRYSDHSILRVIQVLGKLGNWRRVLQ+IEWL
Subjt:  KQFLEFPQESSLEVEHAAFNNFDALDIMDKPRVSKMEMEERIQMLSKRLNGSNIDMPEWMFSQMMRSAKIRYSDHSILRVIQVLGKLGNWRRVLQVIEWL

Query:  QMRERFKSHKLRFIYTTALDVLGKARRPVEALNVFHAMQQHFSSYPDLVAYHSIAVTLGQAGYMRELFDVIDSMRFPP-KKFKTGALEKWDPRLQPDIVI
        QMRERFKSHKLRFIYTTALDVLGKARRPVEALNVFHAMQ+HFSSYPDLVAYHSIAVTLGQAGYMRELFDVIDSMR PP KKFKTG LEKWDPRLQPDIVI
Subjt:  QMRERFKSHKLRFIYTTALDVLGKARRPVEALNVFHAMQQHFSSYPDLVAYHSIAVTLGQAGYMRELFDVIDSMRFPP-KKFKTGALEKWDPRLQPDIVI

Query:  YNSVLNACVKRKNLEGAFWVLQELKKQGLQPSTSTYGLVMEVMLECGKYNLVHEFFRKVQKSSVPNALTYKVLVNTLWKEGKTDEAVLAIENMERRGIVG
        YN+VLNACVKRKNLEGAFWVLQELKKQ LQPSTSTYGLVMEVMLECGKYNLVHEFFRKVQKSS+PNALTYKVLVNTLWKEGKTDEAVLAIENME RGIVG
Subjt:  YNSVLNACVKRKNLEGAFWVLQELKKQGLQPSTSTYGLVMEVMLECGKYNLVHEFFRKVQKSSVPNALTYKVLVNTLWKEGKTDEAVLAIENMERRGIVG

Query:  SAALYYDFARCLCSAGRCKEALMQMEKICKVANKPLVVTYTGLIQACLDSKDLQSAVYIFNHMKTFCSPNLVTYNILLKGYLEHGMFEEARELFQNLSEH
        SAALYYDFARCLCSAGRCKEALMQMEKICKVANKPLVVTYTGLIQACLDSKDLQSAVYIFNHMK FCSPNLVTYNILLKGYLEHGMFEEARELFQNLSE 
Subjt:  SAALYYDFARCLCSAGRCKEALMQMEKICKVANKPLVVTYTGLIQACLDSKDLQSAVYIFNHMKTFCSPNLVTYNILLKGYLEHGMFEEARELFQNLSEH

Query:  GRNISNVSDYRDRVLPDIYMFNTMLDASFAEKRWDDFGYFFNQMLLYGYHFNPKRHLRMILEAARAGKDELLETTWKHLSQVDRIPPPALLKERFCMKLA
         RNIS VSDYRDRVLPDIYMFNTMLDASFAEKRWDDF YF+NQM LYGYHFNPKRHLRMILEAAR GKDELLETTWKHL+Q DR PPP LLKERFCMKLA
Subjt:  GRNISNVSDYRDRVLPDIYMFNTMLDASFAEKRWDDFGYFFNQMLLYGYHFNPKRHLRMILEAARAGKDELLETTWKHLSQVDRIPPPALLKERFCMKLA

Query:  RGDYSDALSCISNHDSSDAHHFSEPAWLNLLKEKRFPKDTVIQLIHK
        RGDYS+ALS I +H+S DAHHFSE AWLNLLKEKRFP+DTVI+LIHK
Subjt:  RGDYSDALSCISNHDSSDAHHFSEPAWLNLLKEKRFPKDTVIQLIHK

XP_038894404.1 pentatricopeptide repeat-containing protein At1g30610, chloroplastic isoform X1 [Benincasa hispida]0.0e+0089.72Show/hide
Query:  NGFPALHCTQNSHNFFGFSFFPCSISGTDLNMGDVKNRVLRHRGHKCGAIKASSNGESDIRLPSGNLLENDFPFKPSFDEYVRVMETVRTRRYKKQTDDP
        NGFPALHCTQNSHNFFGFSFFP S+SG DLN GD K+RVLRHR HKCG+IKASSNGESDIRLPS NLLENDF FKPSFDEYVRVMETVRTRRYK+Q+DDP
Subjt:  NGFPALHCTQNSHNFFGFSFFPCSISGTDLNMGDVKNRVLRHRGHKCGAIKASSNGESDIRLPSGNLLENDFPFKPSFDEYVRVMETVRTRRYKKQTDDP

Query:  NKLTMKENASAKSAESTSISKIDNGKNKGTDVQRDVDVKNMFKRVDRKDLFNNTERITHRRDLSGNKFDSKRKGVTRSNDEVKGKVTPFYSQADDKQHEK
        NKLTMKENAS KSAE TSISKIDNGKNK TDVQ +VDVKNMFKRVDRKDLFNNTERIT  RDLSGNK DSKRKG++RSNDEVKGKVTPF SQ +DKQHE+
Subjt:  NKLTMKENASAKSAESTSISKIDNGKNKGTDVQRDVDVKNMFKRVDRKDLFNNTERITHRRDLSGNKFDSKRKGVTRSNDEVKGKVTPFYSQADDKQHEK

Query:  KRNRNWSSDIEPKVPRSYNEKLINFKANTLDVKRENRRVCDRSPMKISEKIWADDDTKRAKDVLKAGKFGVQLERNYIPGEKVGRKKNEQSYRGPSKSGK
        KRN N S+  EPKVPR YNEK INFKANTLD+KRE+ R  + S M+IS KIWA+DDTK AKD+L A K+ VQLERNYI G+KVGRKK EQSYR  SKSGK
Subjt:  KRNRNWSSDIEPKVPRSYNEKLINFKANTLDVKRENRRVCDRSPMKISEKIWADDDTKRAKDVLKAGKFGVQLERNYIPGEKVGRKKNEQSYRGPSKSGK

Query:  QFLEFPQESSLEVEHAAFNNFDALDIMDKPRVSKMEMEERIQMLSKRLNGSNIDMPEWMFSQMMRSAKIRYSDHSILRVIQVLGKLGNWRRVLQVIEWLQ
        +FLEF ++SSLEVEHAAFNNFDALDIMDKPRVSKMEMEERIQML KRLNG++IDMPEWMFSQMMRSAKIRYSDHSILRVIQVLGKLGNWRRVLQVIEWLQ
Subjt:  QFLEFPQESSLEVEHAAFNNFDALDIMDKPRVSKMEMEERIQMLSKRLNGSNIDMPEWMFSQMMRSAKIRYSDHSILRVIQVLGKLGNWRRVLQVIEWLQ

Query:  MRERFKSHKLRFIYTTALDVLGKARRPVEALNVFHAMQQHFSSYPDLVAYHSIAVTLGQAGYMRELFDVIDSMRFPP-KKFKTGALEKWDPRLQPDIVIY
        MRERFKSHKLRFIYTTALDVLGKARRPVEALN+FHAMQQHF+SYPDLVAYHSIAVTLGQAGYM+ELFDVIDSMR PP KKFKTG LEKWDPRL+PDIVIY
Subjt:  MRERFKSHKLRFIYTTALDVLGKARRPVEALNVFHAMQQHFSSYPDLVAYHSIAVTLGQAGYMRELFDVIDSMRFPP-KKFKTGALEKWDPRLQPDIVIY

Query:  NSVLNACVKRKNLEGAFWVLQELKKQGLQPSTSTYGLVMEVMLECGKYNLVHEFFRKVQKSSVPNALTYKVLVNTLWKEGKTDEAVLAIENMERRGIVGS
        N+VLNACVKRKNLEGAFWVLQELKKQGLQPSTSTYGLVMEVMLECGKYNLVHEFFRKVQKSS+PNALTYKVLVNTLWKEGKTDEAVLAIENMERRGIVGS
Subjt:  NSVLNACVKRKNLEGAFWVLQELKKQGLQPSTSTYGLVMEVMLECGKYNLVHEFFRKVQKSSVPNALTYKVLVNTLWKEGKTDEAVLAIENMERRGIVGS

Query:  AALYYDFARCLCSAGRCKEALMQMEKICKVANKPLVVTYTGLIQACLDSKDLQSAVYIFNHMKTFCSPNLVTYNILLKGYLEHGMFEEARELFQNLSEHG
        AALYYDFARCLCSAGRCKEALMQMEKICKVA KPLVVTYTGLIQACLDSKD++SAVYIFNHMKTFCSPNLVTYN+LLKGYLEHGMFEEARELFQNLSEHG
Subjt:  AALYYDFARCLCSAGRCKEALMQMEKICKVANKPLVVTYTGLIQACLDSKDLQSAVYIFNHMKTFCSPNLVTYNILLKGYLEHGMFEEARELFQNLSEHG

Query:  RNISNVSDYRDRVLPDIYMFNTMLDASFAEKRWDDFGYFFNQMLLYGYHFNPKRHLRMILEAARAGKDELLETTWKHLSQVDRIPPPALLKERFCMKLAR
        RNIS VSDYRDRVLPDIYMFNTMLDASFAEKRWDDFGYF++QMLLYGYHFNPKRHLRMILEAARAGKDELLETTWKHL+Q DR PPP LLKERFCMKLAR
Subjt:  RNISNVSDYRDRVLPDIYMFNTMLDASFAEKRWDDFGYFFNQMLLYGYHFNPKRHLRMILEAARAGKDELLETTWKHLSQVDRIPPPALLKERFCMKLAR

Query:  GDYSDALSCISNHDSSDAHHFSEPAWLNLLKEKRFPKDTVIQLIHK
        GDYS+ALSCISNHDSSD HHFSE  WLNLLKEKRFPKDTVIQLI+K
Subjt:  GDYSDALSCISNHDSSDAHHFSEPAWLNLLKEKRFPKDTVIQLIHK

TrEMBL top hitse value%identityAlignment
A0A0A0LVN7 Uncharacterized protein0.0e+0087.13Show/hide
Query:  GFPALHCTQNSHNFFGFSFFPCSISGTDLNMGDVKNRVLRHRGHKCGAIKASSNGESDIRLPSGNLLENDFPFKPSFDEYVRVMETVRTRRYKKQTDDPN
        GFP LHCT NSHN F  SFFP S+SGTD ++ D KNRVLRHR HKCG+IKA SNGESDI LPSGNLLE+DF FKPSFDEYV+VMETVRTRRYK+Q DDPN
Subjt:  GFPALHCTQNSHNFFGFSFFPCSISGTDLNMGDVKNRVLRHRGHKCGAIKASSNGESDIRLPSGNLLENDFPFKPSFDEYVRVMETVRTRRYKKQTDDPN

Query:  KLTMKENASAKSAESTSISKIDNGKNKGTDVQRDVDVKNMFKRVDRKDLFNNTERITHRRDLSGNKFDSKRKGVTRSNDEVKGKVTPFYSQADDKQHEKK
        KLTMKEN SAKSAESTSISKIDNGKNK TDVQ +VDVKNMFKRVD+KDLFNNTERI   +DLSGNKFD +RK VTRSND+VKGK+TPF S  +DKQHE+K
Subjt:  KLTMKENASAKSAESTSISKIDNGKNKGTDVQRDVDVKNMFKRVDRKDLFNNTERITHRRDLSGNKFDSKRKGVTRSNDEVKGKVTPFYSQADDKQHEKK

Query:  RNRNWSSDIEPKVPRSYNEKLINFKANTLDVKRENRRVCDRSPMKISEKIWA--DDDTKRAKDVLKAGKFGVQLERNYIPGEKVGRKKNEQSYRGPSKSG
        RN NWSS IEP+V RS ++K I+FKANTL+VK+E+ RV D + MK SEKIWA  DDD K AK VLKAGK+G+QLER+Y PG+KVGRKK EQSYRG S SG
Subjt:  RNRNWSSDIEPKVPRSYNEKLINFKANTLDVKRENRRVCDRSPMKISEKIWA--DDDTKRAKDVLKAGKFGVQLERNYIPGEKVGRKKNEQSYRGPSKSG

Query:  KQFLEFPQESSLEVEHAAFNNFDALDIMDKPRVSKMEMEERIQMLSKRLNGSNIDMPEWMFSQMMRSAKIRYSDHSILRVIQVLGKLGNWRRVLQVIEWL
        K+FLEF +++SLEVEHAAFNNFDA DIMDKPRVSKMEMEERIQMLSKRLNG++IDMPEWMFSQMMRSAKIRYSDHSILRVIQVLGKLGNWRRVLQ+IEWL
Subjt:  KQFLEFPQESSLEVEHAAFNNFDALDIMDKPRVSKMEMEERIQMLSKRLNGSNIDMPEWMFSQMMRSAKIRYSDHSILRVIQVLGKLGNWRRVLQVIEWL

Query:  QMRERFKSHKLRFIYTTALDVLGKARRPVEALNVFHAMQQHFSSYPDLVAYHSIAVTLGQAGYMRELFDVIDSMRFPP-KKFKTGALEKWDPRLQPDIVI
        QMRERFKSHKLRFIYTTALDVLGKARRPVEALNVFHAMQ+HFSSYPDLVAYHSIAVTLGQAGYMRELFDVIDSMR PP KKFKTG LEKWDPRLQPDIVI
Subjt:  QMRERFKSHKLRFIYTTALDVLGKARRPVEALNVFHAMQQHFSSYPDLVAYHSIAVTLGQAGYMRELFDVIDSMRFPP-KKFKTGALEKWDPRLQPDIVI

Query:  YNSVLNACVKRKNLEGAFWVLQELKKQGLQPSTSTYGLVMEVMLECGKYNLVHEFFRKVQKSSVPNALTYKVLVNTLWKEGKTDEAVLAIENMERRGIVG
        YN+VLNACVKRKNLEGAFWVLQELKKQ LQPSTSTYGLVMEVMLECGKYNLVHEFFRKVQKSS+PNALTYKVLVNTLWKEGKTDEAVLAIENME RGIVG
Subjt:  YNSVLNACVKRKNLEGAFWVLQELKKQGLQPSTSTYGLVMEVMLECGKYNLVHEFFRKVQKSSVPNALTYKVLVNTLWKEGKTDEAVLAIENMERRGIVG

Query:  SAALYYDFARCLCSAGRCKEALMQMEKICKVANKPLVVTYTGLIQACLDSKDLQSAVYIFNHMKTFCSPNLVTYNILLKGYLEHGMFEEARELFQNLSEH
        SAALYYDFARCLCSAGRCKEALMQMEKICKVANKPLVVTYTGLIQACLDSKDLQSAVYIFNHMK FCSPNLVTYNILLKGYLEHGMFEEARELFQNLSE 
Subjt:  SAALYYDFARCLCSAGRCKEALMQMEKICKVANKPLVVTYTGLIQACLDSKDLQSAVYIFNHMKTFCSPNLVTYNILLKGYLEHGMFEEARELFQNLSEH

Query:  GRNISNVSDYRDRVLPDIYMFNTMLDASFAEKRWDDFGYFFNQMLLYGYHFNPKRHLRMILEAARAGKDELLETTWKHLSQVDRIPPPALLKERFCMKLA
         RNIS VSDYRDRVLPDIYMFNTMLDASFAEKRWDDF YF+NQM LYGYHFNPKRHLRMILEAAR GKDELLETTWKHL+Q DR PPP LLKERFCMKLA
Subjt:  GRNISNVSDYRDRVLPDIYMFNTMLDASFAEKRWDDFGYFFNQMLLYGYHFNPKRHLRMILEAARAGKDELLETTWKHLSQVDRIPPPALLKERFCMKLA

Query:  RGDYSDALSCISNHDSSDAHHFSEPAWLNLLKEKRFPKDTVIQLIHK
        RGDYS+ALS I +H+S DAHHFSE AWLNLLKEKRFP+DTVI+LIHK
Subjt:  RGDYSDALSCISNHDSSDAHHFSEPAWLNLLKEKRFPKDTVIQLIHK

A0A1S3C8Z0 pentatricopeptide repeat-containing protein At1g30610, chloroplastic0.0e+0086.22Show/hide
Query:  GFPALHCTQNSHNFFGFSFFPCSIS--GTDLNMGDVKNRVLRHRGHKCGAIKASSNGESDIRLPSGNLLENDFPFKPSFDEYVRVMETVRTRRYKKQTDD
        GFP LHCT NSH  F  SFFP S+S  GTDLN  D KNRVLRHR HKCG+IKA SNGESDI LP+GNLLE+DF FKPSFDEYV+VMETVRTRRYK+Q D 
Subjt:  GFPALHCTQNSHNFFGFSFFPCSIS--GTDLNMGDVKNRVLRHRGHKCGAIKASSNGESDIRLPSGNLLENDFPFKPSFDEYVRVMETVRTRRYKKQTDD

Query:  PNKLTMKENASAKSAESTSISKIDNGKNKGTDVQRDVDVKNMFKRVDRKDLFNNTERITHRRDLSGNKFDSKRKGVTRSNDEVKGKVTPFYSQADDKQHE
        PNKLTMKEN SAKSAESTSISKIDNGKNK TDVQ +V+VKNMFKRVD+KDLFNNTERI   + LSGNKFD + KGVTRSND+VKGK+TPF S  +DKQHE
Subjt:  PNKLTMKENASAKSAESTSISKIDNGKNKGTDVQRDVDVKNMFKRVDRKDLFNNTERITHRRDLSGNKFDSKRKGVTRSNDEVKGKVTPFYSQADDKQHE

Query:  KKRNRNWSSDIEPKVPRSYNEKLINFKANTLDVKRENRRVCDRSPMKISEKIWA--DDDTKRAKDVLKAGKFGVQLERNYIPGEKVGRKKNEQSYRGPSK
        +K+N NWSS IEPKV RS  EK I+FKAN L+ K+E  RV   + MK SEKIWA  +DD K AKDVLKAGK+G+QLER+Y PG+KVGRKK EQSYRG S 
Subjt:  KKRNRNWSSDIEPKVPRSYNEKLINFKANTLDVKRENRRVCDRSPMKISEKIWA--DDDTKRAKDVLKAGKFGVQLERNYIPGEKVGRKKNEQSYRGPSK

Query:  SGKQFLEFPQESSLEVEHAAFNNFDALDIMDKPRVSKMEMEERIQMLSKRLNGSNIDMPEWMFSQMMRSAKIRYSDHSILRVIQVLGKLGNWRRVLQVIE
        SGK+FLEF +E+SLEVEHAAFNNFDALDIMDKPRVSKMEMEERIQMLSKRLNG++IDMPEWMFSQMMR AKIRYSDHSILRVIQVLGKLGNWRRVLQVIE
Subjt:  SGKQFLEFPQESSLEVEHAAFNNFDALDIMDKPRVSKMEMEERIQMLSKRLNGSNIDMPEWMFSQMMRSAKIRYSDHSILRVIQVLGKLGNWRRVLQVIE

Query:  WLQMRERFKSHKLRFIYTTALDVLGKARRPVEALNVFHAMQQHFSSYPDLVAYHSIAVTLGQAGYMRELFDVIDSMRFPP-KKFKTGALEKWDPRLQPDI
        WLQMRERFKSHK RFIYTTALDVLGKARRPVEALNVFHAMQ+HFSSYPDLVAYHSIAVTLGQAGYMRELFDVIDSMR PP KKFKTGALEKWDPRLQPDI
Subjt:  WLQMRERFKSHKLRFIYTTALDVLGKARRPVEALNVFHAMQQHFSSYPDLVAYHSIAVTLGQAGYMRELFDVIDSMRFPP-KKFKTGALEKWDPRLQPDI

Query:  VIYNSVLNACVKRKNLEGAFWVLQELKKQGLQPSTSTYGLVMEVMLECGKYNLVHEFFRKVQKSSVPNALTYKVLVNTLWKEGKTDEAVLAIENMERRGI
        VIYN+VLNACVKRKNLEGAFWVLQELKKQGLQPSTSTYGLVMEVMLECGKYNLVHEFFRKVQKSS+PNALTYKVLVNTLWKEGKTDEAVLAIENME RG+
Subjt:  VIYNSVLNACVKRKNLEGAFWVLQELKKQGLQPSTSTYGLVMEVMLECGKYNLVHEFFRKVQKSSVPNALTYKVLVNTLWKEGKTDEAVLAIENMERRGI

Query:  VGSAALYYDFARCLCSAGRCKEALMQMEKICKVANKPLVVTYTGLIQACLDSKDLQSAVYIFNHMKTFCSPNLVTYNILLKGYLEHGMFEEARELFQNLS
        VGSAALYYDFARCLCSAGRCKEALMQMEKICKVANKPLVVTYTGLIQACLDSKDLQSAVY+FN MK FCSPNLVTYNILLKGYLEHGMFEEAREL QNLS
Subjt:  VGSAALYYDFARCLCSAGRCKEALMQMEKICKVANKPLVVTYTGLIQACLDSKDLQSAVYIFNHMKTFCSPNLVTYNILLKGYLEHGMFEEARELFQNLS

Query:  EHGRNISNVSDYRDRVLPDIYMFNTMLDASFAEKRWDDFGYFFNQMLLYGYHFNPKRHLRMILEAARAGKDELLETTWKHLSQVDRIPPPALLKERFCMK
        E  +NIS VSDYRDRVLPDIYMFNTMLDASFAEKRWDDF YF+NQM LYGYHFNPKRHLRMILEAAR GKDELLETTWKHL+Q DR PPP LLKERFCMK
Subjt:  EHGRNISNVSDYRDRVLPDIYMFNTMLDASFAEKRWDDFGYFFNQMLLYGYHFNPKRHLRMILEAARAGKDELLETTWKHLSQVDRIPPPALLKERFCMK

Query:  LARGDYSDALSCISNHDSSDAHHFSEPAWLNLLKEKRFPKDTVIQLIHK
        +ARGDY++AL CISNH+S DAHHFSE AWLNLLKEKRFPKDTVI+LIHK
Subjt:  LARGDYSDALSCISNHDSSDAHHFSEPAWLNLLKEKRFPKDTVIQLIHK

A0A5D3CBM0 Pentatricopeptide repeat-containing protein0.0e+0085.63Show/hide
Query:  GTDLNMGDVKNRVLRHRGHKCGAIKASSNGESDIRLPSGNLLENDFPFKPSFDEYVRVMETVRTRRYKKQTDDPNKLTMKENASAKSAESTSISKIDNGK
        GTDLN  D KNRVLRHR HKCG+IKA SNGESDI LP+GNLLE+DF FKPSFDEYV+VMETVRTRRYK+Q D PNKLTMKEN SAKSAESTSISKIDNGK
Subjt:  GTDLNMGDVKNRVLRHRGHKCGAIKASSNGESDIRLPSGNLLENDFPFKPSFDEYVRVMETVRTRRYKKQTDDPNKLTMKENASAKSAESTSISKIDNGK

Query:  NKGTDVQRDVDVKNMFKRVDRKDLFNNTERITHRRDLSGNKFDSKRKGVTRSNDEVKGKVTPFYSQADDKQHEKKRNRNWSSDIEPKVPRSYNEKLINFK
        NK TDVQ +V+VKNMFKRVD+KDLFNNTERI   + LSGNKFD + KGVTRSND+VKGK+TPF S  +DKQHE+K+N NWSS IEPKV RS  EK I+FK
Subjt:  NKGTDVQRDVDVKNMFKRVDRKDLFNNTERITHRRDLSGNKFDSKRKGVTRSNDEVKGKVTPFYSQADDKQHEKKRNRNWSSDIEPKVPRSYNEKLINFK

Query:  ANTLDVKRENRRVCDRSPMKISEKIWA--DDDTKRAKDVLKAGKFGVQLERNYIPGEKVGRKKNEQSYRGPSKSGKQFLEFPQESSLEVEHAAFNNFDAL
        AN L+ K+E  RV   + MK SEKIWA  +DD K AKDVLKAGK+G+QLER+Y PG+KVGRKK EQSYRG S SGK+FLEF +E+SLEVEHAAFNNFDAL
Subjt:  ANTLDVKRENRRVCDRSPMKISEKIWA--DDDTKRAKDVLKAGKFGVQLERNYIPGEKVGRKKNEQSYRGPSKSGKQFLEFPQESSLEVEHAAFNNFDAL

Query:  DIMDKPRVSKMEMEERIQMLSK-------------RLNGSNIDMPEWMFSQMMRSAKIRYSDHSILRVIQVLGKLGNWRRVLQVIEWLQMRERFKSHKLR
        DIMDKPRVSKMEMEERIQMLSK             RLNG++IDMPEWMFSQMMR AKIRYSDHSILRVIQVLGKLGNWRRVLQVIEWLQMRERFKSHK R
Subjt:  DIMDKPRVSKMEMEERIQMLSK-------------RLNGSNIDMPEWMFSQMMRSAKIRYSDHSILRVIQVLGKLGNWRRVLQVIEWLQMRERFKSHKLR

Query:  FIYTTALDVLGKARRPVEALNVFHAMQQHFSSYPDLVAYHSIAVTLGQAGYMRELFDVIDSMRFPP-KKFKTGALEKWDPRLQPDIVIYNSVLNACVKRK
        FIYTTALDVLGKARRPVEALNVFHAMQ+HFSSYPDLVAYHSIAVTLGQAGYMRELFDVIDSMR PP KKFKTGALEKWDPRLQPDIVIYN+VLNACVKRK
Subjt:  FIYTTALDVLGKARRPVEALNVFHAMQQHFSSYPDLVAYHSIAVTLGQAGYMRELFDVIDSMRFPP-KKFKTGALEKWDPRLQPDIVIYNSVLNACVKRK

Query:  NLEGAFWVLQELKKQGLQPSTSTYGLVMEVMLECGKYNLVHEFFRKVQKSSVPNALTYKVLVNTLWKEGKTDEAVLAIENMERRGIVGSAALYYDFARCL
        NLEGAFWVLQELKKQGLQPSTSTYGLVMEVMLECGKYNLVHEFFRKVQKSS+PNALTYKVLVNTLWKEGKTDEAVLAIENME RG+VGSAALYYDFARCL
Subjt:  NLEGAFWVLQELKKQGLQPSTSTYGLVMEVMLECGKYNLVHEFFRKVQKSSVPNALTYKVLVNTLWKEGKTDEAVLAIENMERRGIVGSAALYYDFARCL

Query:  CSAGRCKEALMQMEKICKVANKPLVVTYTGLIQACLDSKDLQSAVYIFNHMKTFCSPNLVTYNILLKGYLEHGMFEEARELFQNLSEHGRNISNVSDYRD
        CSAGRCKEALMQMEKICKVANKPLVVTYTGLIQACLDSKDLQSAVY+FN MK FCSPNLVTYNILLKGYLEHGMFEEAREL QNLSE  +NIS VSDYRD
Subjt:  CSAGRCKEALMQMEKICKVANKPLVVTYTGLIQACLDSKDLQSAVYIFNHMKTFCSPNLVTYNILLKGYLEHGMFEEARELFQNLSEHGRNISNVSDYRD

Query:  RVLPDIYMFNTMLDASFAEKRWDDFGYFFNQMLLYGYHFNPKRHLRMILEAARAGKDELLETTWKHLSQVDRIPPPALLKERFCMKLARGDYSDALSCIS
        RVLPDIYMFNTMLDASFAEKRWDDF YF+NQM LYGYHFNPKRHLRMILEAAR GKDELLETTWKHL+Q DR PPP LLKERFCMK+ARGDY++AL CIS
Subjt:  RVLPDIYMFNTMLDASFAEKRWDDFGYFFNQMLLYGYHFNPKRHLRMILEAARAGKDELLETTWKHLSQVDRIPPPALLKERFCMKLARGDYSDALSCIS

Query:  NHDSSDAHHFSEPAWLNLLKEKRFPKDTVIQLIHK
        NH+S DAHHFSE AWLNLLKEKRFPKDTVI+LIHK
Subjt:  NHDSSDAHHFSEPAWLNLLKEKRFPKDTVIQLIHK

A0A6J1EH18 LOW QUALITY PROTEIN: pentatricopeptide repeat-containing protein At1g30610, chloroplastic0.0e+0083.57Show/hide
Query:  NGFPALHCTQNSHNFFGFSFFPCSISGTDLNMGDVKNRVLRHRGHKCGAIKASSNGESDIRLPSGNLLENDFPFKPSFDEYVRVMETVRTRRYKKQTDDP
        NGFPAL+CTQNSH   GFS FP S+SG+ LN G  K+RVLRHRGHKCGAIKASS GESDI+L SGNLLE DF FKPSFDEYVRVME+VR+RRYK+Q+DDP
Subjt:  NGFPALHCTQNSHNFFGFSFFPCSISGTDLNMGDVKNRVLRHRGHKCGAIKASSNGESDIRLPSGNLLENDFPFKPSFDEYVRVMETVRTRRYKKQTDDP

Query:  NKLTMKENASAKSAESTSISKIDNGKNKGTDVQRDVDVKNMFKRVDRKDLFNNTERITHRRDLSGNKFDSKRKGVTRSNDEVKGKVTPFYSQADDKQHEK
        NK  MKENASAKSAEST IS I       TDVQ ++DVKN    VD +DLF+N+E+IT + DLSGNKFDSKRKGVTRS DE+KGKVTPF SQ +DKQHE+
Subjt:  NKLTMKENASAKSAESTSISKIDNGKNKGTDVQRDVDVKNMFKRVDRKDLFNNTERITHRRDLSGNKFDSKRKGVTRSNDEVKGKVTPFYSQADDKQHEK

Query:  KRNRNWSSDIEPKVPRSYNEKLINFKANTLDVKRENRRVCDRSPMKISEKIWADDDTKRAKDVLKAGKFGVQLERNYIPGEKVGRKKNEQSYRGPSKSGK
        KRN NWS+ IEPK  RS ++K ++FKANTLDVK E+  V   S MKIS+KIWADDD+K  KDVLK GK+GVQLE NYIPG+KVGRKK EQSYRG SKSGK
Subjt:  KRNRNWSSDIEPKVPRSYNEKLINFKANTLDVKRENRRVCDRSPMKISEKIWADDDTKRAKDVLKAGKFGVQLERNYIPGEKVGRKKNEQSYRGPSKSGK

Query:  QFLEFPQESSLEVEHAAFNNFDALDIMDKPRVSKMEMEERIQMLSKRLNGSNIDMPEWMFSQMMRSAKIRYSDHSILRVIQVLGKLGNWRRVLQVIEWLQ
        +F EF +ESSLEVEHAAFN+ DA DIMDKPRVSKMEMEERIQMLS RLNG++IDMPEWMF+QMMRSAKIRYSDHSILRVIQVLGKLGNW+RVLQVIEWLQ
Subjt:  QFLEFPQESSLEVEHAAFNNFDALDIMDKPRVSKMEMEERIQMLSKRLNGSNIDMPEWMFSQMMRSAKIRYSDHSILRVIQVLGKLGNWRRVLQVIEWLQ

Query:  MRERFKSHKLRFIYTTALDVLGKARRPVEALNVFHAMQQHFSSYPDLVAYHSIAVTLGQAGYMRELFDVIDSMRFPP-KKFKTGALEKWDPRLQPDIVIY
        MRERFKSHKLRFIYTTALDVLGKARRPVEALNVFHAMQQHFSSYPDLVAYHSIAVTLGQAGYMRELFDVIDSMR PP KKFKTGA EKWDPRLQPDIVIY
Subjt:  MRERFKSHKLRFIYTTALDVLGKARRPVEALNVFHAMQQHFSSYPDLVAYHSIAVTLGQAGYMRELFDVIDSMRFPP-KKFKTGALEKWDPRLQPDIVIY

Query:  NSVLNACVKRKNLEGAFWVLQELKKQGLQPSTSTYGLVMEVMLECGKYNLVHEFFRKVQKSSVPNALTYKVLVNTLWKEGKTDEAVLAIENMERRGIVGS
        N+VLNACVKRKN EGAFWVLQELK+QGLQPST+TYGLVMEVML+CGKYNLVHEFFRKVQKSS+PNALTYKVLVNTLWKEGKTDEAVLAI+ ME+RGIVGS
Subjt:  NSVLNACVKRKNLEGAFWVLQELKKQGLQPSTSTYGLVMEVMLECGKYNLVHEFFRKVQKSSVPNALTYKVLVNTLWKEGKTDEAVLAIENMERRGIVGS

Query:  AALYYDFARCLCSAGRCKEALMQMEKICKVANKPLVVTYTGLIQACLDSKDLQSAVYIFNHMKTFCSPNLVTYNILLKGYLEHGMFEEARELFQNLSEHG
        AALYYDFARCLCSAGRC+EALMQMEKICKVANKPLVVTYTGLIQACLDSK+LQSAVYIFNHMK FCSPNLVT NILLKGYL+HGMF+EA+ELFQN+SE+G
Subjt:  AALYYDFARCLCSAGRCKEALMQMEKICKVANKPLVVTYTGLIQACLDSKDLQSAVYIFNHMKTFCSPNLVTYNILLKGYLEHGMFEEARELFQNLSEHG

Query:  RNISNVSDYRDRVLPDIYMFNTMLDASFAEKRWDDFGYFFNQMLLYGYHFNPKRHLRMILEAARAGKDELLETTWKHLSQVDRIPPPALLKERFCMKLAR
        RNIS VSDYRDRVLPDIY FNTMLDASFAEKRWDDF +F+NQMLLYGYHFNPKRHLRMI+EAAR GKDELLETTWKHL+Q DR  PP L+KERFC+ LAR
Subjt:  RNISNVSDYRDRVLPDIYMFNTMLDASFAEKRWDDFGYFFNQMLLYGYHFNPKRHLRMILEAARAGKDELLETTWKHLSQVDRIPPPALLKERFCMKLAR

Query:  GDYSDALSCISNHDSSDAHHFSEPAWLNLLKEKRFPKDTVIQLIHK
        GDYS+ALSCIS H SSD HHFS+ AWLNLLKEKRFPKD+VI+LIHK
Subjt:  GDYSDALSCISNHDSSDAHHFSEPAWLNLLKEKRFPKDTVIQLIHK

A0A6J1KEH7 pentatricopeptide repeat-containing protein At1g30610, chloroplastic0.0e+0084.04Show/hide
Query:  NGFPALHCTQNSHNFFGFSFFPCSISGTDLNMGDVKNRVLRHRGHKCGAIKASSNGESDIRLPSGNLLENDFPFKPSFDEYVRVMETVRTRRYKKQTDDP
        NGF AL+CTQNSH   G SFFP S+SG+ LN G  K+RVLRHRGHKCGAIKASS GESDI+L SGNLLE DF FKPSFDEYVRVME+VR+RRYK+Q+DDP
Subjt:  NGFPALHCTQNSHNFFGFSFFPCSISGTDLNMGDVKNRVLRHRGHKCGAIKASSNGESDIRLPSGNLLENDFPFKPSFDEYVRVMETVRTRRYKKQTDDP

Query:  NKLTMKENASAKSAESTSISKIDNGKNKGTDVQRDVDVKNMFKRVDRKDLFNNTERITHRRDLSGNKFDSKRKGVTRSNDEVKGKVTPFYSQADDKQHEK
        NK  MKENASAKSAESTSIS I       TDVQ ++DVKN    VD +DLF+N+ERIT + DLSGNKFDSKRKGVTRS DE+KGKVTPF SQ +DKQHE+
Subjt:  NKLTMKENASAKSAESTSISKIDNGKNKGTDVQRDVDVKNMFKRVDRKDLFNNTERITHRRDLSGNKFDSKRKGVTRSNDEVKGKVTPFYSQADDKQHEK

Query:  KRNRNWSSDIEPKVPRSYNEKLINFKANTLDVKRENRRVCDRSPMKISEKIWADDDTKRAKDVLKAGKFGVQLERNYIPGEKVGRKKNEQSYRGPSKSGK
        KRN NWS+ IEPKV RS ++K ++FKANTLDVK E+  V   S MKISEKIWADDD K  KDVLK GK+GVQL+ NYIPG+KVGRKK EQSYRG SKSGK
Subjt:  KRNRNWSSDIEPKVPRSYNEKLINFKANTLDVKRENRRVCDRSPMKISEKIWADDDTKRAKDVLKAGKFGVQLERNYIPGEKVGRKKNEQSYRGPSKSGK

Query:  QFLEFPQESSLEVEHAAFNNFDALDIMDKPRVSKMEMEERIQMLSKRLNGSNIDMPEWMFSQMMRSAKIRYSDHSILRVIQVLGKLGNWRRVLQVIEWLQ
        +F EF +ESSLEVEHAAFN+ DA DIMDKPRVSKMEMEERIQMLSKRLNG++IDMPEWMF+QMMRSAKIRYSDHSILRVIQVLGKLGNW+RVLQVIEWLQ
Subjt:  QFLEFPQESSLEVEHAAFNNFDALDIMDKPRVSKMEMEERIQMLSKRLNGSNIDMPEWMFSQMMRSAKIRYSDHSILRVIQVLGKLGNWRRVLQVIEWLQ

Query:  MRERFKSHKLRFIYTTALDVLGKARRPVEALNVFHAMQQHFSSYPDLVAYHSIAVTLGQAGYMRELFDVIDSMRFPP-KKFKTGALEKWDPRLQPDIVIY
        MRERFKSHKLRFIYTTALDVLGKARRPVEALNVFHAMQQHFSSYPDLVAYHSIAVTLGQAGYMRELFDVIDSMR PP KKFKTGA EKWDPRLQPDIVIY
Subjt:  MRERFKSHKLRFIYTTALDVLGKARRPVEALNVFHAMQQHFSSYPDLVAYHSIAVTLGQAGYMRELFDVIDSMRFPP-KKFKTGALEKWDPRLQPDIVIY

Query:  NSVLNACVKRKNLEGAFWVLQELKKQGLQPSTSTYGLVMEVMLECGKYNLVHEFFRKVQKSSVPNALTYKVLVNTLWKEGKTDEAVLAIENMERRGIVGS
        N+VLNACVKRKN EGAFWVLQELK+QGLQPST+TYGLVMEVML+CGKYNLVHEFFRKVQKSS+PNALTYKVLVNTLWKEGKTDEAVLAI+ ME+RGIVGS
Subjt:  NSVLNACVKRKNLEGAFWVLQELKKQGLQPSTSTYGLVMEVMLECGKYNLVHEFFRKVQKSSVPNALTYKVLVNTLWKEGKTDEAVLAIENMERRGIVGS

Query:  AALYYDFARCLCSAGRCKEALMQMEKICKVANKPLVVTYTGLIQACLDSKDLQSAVYIFNHMKTFCSPNLVTYNILLKGYLEHGMFEEARELFQNLSEHG
        AALYYDFARCLCSAGR +EALMQMEKICKVANKPLVVTYTGLIQACLDSK+LQSAVYIFNHMK FCSPNLVT NILLKGYL+HGMF EA+ELFQN+SE+G
Subjt:  AALYYDFARCLCSAGRCKEALMQMEKICKVANKPLVVTYTGLIQACLDSKDLQSAVYIFNHMKTFCSPNLVTYNILLKGYLEHGMFEEARELFQNLSEHG

Query:  RNISNVSDYRDRVLPDIYMFNTMLDASFAEKRWDDFGYFFNQMLLYGYHFNPKRHLRMILEAARAGKDELLETTWKHLSQVDRIPPPALLKERFCMKLAR
        RNIS VSDYRDRVLPDIY FNTMLDASFAEKRWDDF +F+NQMLLYGYHFNPKRHLRMI+EAAR GKDELLETTWKHL+Q DRI PP L+KERFC+ LAR
Subjt:  RNISNVSDYRDRVLPDIYMFNTMLDASFAEKRWDDFGYFFNQMLLYGYHFNPKRHLRMILEAARAGKDELLETTWKHLSQVDRIPPPALLKERFCMKLAR

Query:  GDYSDALSCISNHDSSDAHHFSEPAWLNLLKEKRFPKDTVIQLIHK
        GDYS+ALSCIS H SSD HHFS+ AWLNLLKEKRFPKD+VIQLIHK
Subjt:  GDYSDALSCISNHDSSDAHHFSEPAWLNLLKEKRFPKDTVIQLIHK

SwissProt top hitse value%identityAlignment
Q0WPZ6 Pentatricopeptide repeat-containing protein At2g171408.3e-2326.98Show/hide
Query:  PRLQPDIVIYNSVLNACVKRKNLEGAFWVLQELKKQGLQPSTSTYGLVMEVMLECGKYNLVHEFFRKV-QKSSVPNALTYKVLVNTLWKEGKTDEAVLAI
        P  +P + +YN +L +C+K + +E   W+ +++   G+ P T T+ L++  + +    +   E F ++ +K   PN  T+ +LV    K G TD+ +  +
Subjt:  PRLQPDIVIYNSVLNACVKRKNLEGAFWVLQELKKQGLQPSTSTYGLVMEVMLECGKYNLVHEFFRKV-QKSSVPNALTYKVLVNTLWKEGKTDEAVLAI

Query:  ENMERRGIVGSAALYYDFARCLCSAGRCKEALMQMEKICKVANKPLVVTYTGLIQACLDSKDLQSAVYIFNHMKT-----FCSPNLVTYNILLKGYLEHG
          ME  G++ +  +Y       C  GR  ++   +EK+ +    P +VT+   I A      +  A  IF+ M+         PN +TYN++LKG+ + G
Subjt:  ENMERRGIVGSAALYYDFARCLCSAGRCKEALMQMEKICKVANKPLVVTYTGLIQACLDSKDLQSAVYIFNHMKT-----FCSPNLVTYNILLKGYLEHG

Query:  MFEEARELFQNLSEH
        + E+A+ LF+++ E+
Subjt:  MFEEARELFQNLSEH

Q9FJW6 Pentatricopeptide repeat-containing protein At5g67570, chloroplastic4.9e-10840.12Show/hide
Query:  ERIQMLSKRLNGSNIDMPEWMFSQMMRSAKIRYSDHSILRVIQVLGKLGNWRRVLQVIEWLQMRERFKSHKLRFIYTTALDVLGKARRPVEALNVFHAMQ
        E +++L  RL+G  I+   W F +MM  + +++++  +L+++  LG+  +W++   V+ W+   ++ K  + RF+YT  L VLG ARRP EAL +F+ M 
Subjt:  ERIQMLSKRLNGSNIDMPEWMFSQMMRSAKIRYSDHSILRVIQVLGKLGNWRRVLQVIEWLQMRERFKSHKLRFIYTTALDVLGKARRPVEALNVFHAMQ

Query:  QHFSSYPDLVAYHSIAVTLGQAGYMRELFDVIDSMRFPPKKFKTGALEK-WDPRLQPDIVIYNSVLNACVKRKNLEGAFWVLQELKKQGLQPSTSTYGLV
             YPD+ AYH IAVTLGQAG ++EL  VI+ MR  P K      +K WDP L+PD+V+YN++LNACV     +   WV  EL+K GL+P+ +TYGL 
Subjt:  QHFSSYPDLVAYHSIAVTLGQAGYMRELFDVIDSMRFPPKKFKTGALEK-WDPRLQPDIVIYNSVLNACVKRKNLEGAFWVLQELKKQGLQPSTSTYGLV

Query:  MEVMLECGKYNLVHEFFRKVQKS-SVPNALTYKVLVNTLWKEGKTDEAVLAIENMERRGIVGSAALYYDFARCLCSAGRCKEALMQMEKICKVAN-KPLV
        MEVMLE GK++ VH+FFRK++ S   P A+TYKVLV  LW+EGK +EAV A+ +ME++G++G+ ++YY+ A CLC+ GR  +A++++ ++ ++ N +PL 
Subjt:  MEVMLECGKYNLVHEFFRKVQKS-SVPNALTYKVLVNTLWKEGKTDEAVLAIENMERRGIVGSAALYYDFARCLCSAGRCKEALMQMEKICKVAN-KPLV

Query:  VTYTGLIQACLDSKDLQSAVYIFNHMKTFCSPNLVTYNILLKGYLEHGMFEEARELFQNLSEHGRNISNVSDYRDRVLPDIYMFNTMLDASFAEKRWDDF
        +T+TGLI A L+   +   + IF +MK  C PN+ T N++LK Y  + MF EA+ELF+ +         VS     ++P+ Y ++ ML+AS    +W+ F
Subjt:  VTYTGLIQACLDSKDLQSAVYIFNHMKTFCSPNLVTYNILLKGYLEHGMFEEARELFQNLSEHGRNISNVSDYRDRVLPDIYMFNTMLDASFAEKRWDDF

Query:  GYFFNQMLLYGYHFNPKRHLRMILEAARAGKDELLETTWKHLSQVDRIPPPALLKERFCMKLARGDYSDALSCISNHDSSDAHHFSEPAWLNLLKE
         + +  M+L GY  +  +H  M++EA+RAGK  LLE  +  + +   IP P    E  C   A+GD+  A++ I N  +  +   SE  W +L +E
Subjt:  GYFFNQMLLYGYHFNPKRHLRMILEAARAGKDELLETTWKHLSQVDRIPPPALLKERFCMKLARGDYSDALSCISNHDSSDAHHFSEPAWLNLLKE

Q9FMD3 Pentatricopeptide repeat-containing protein At5g16640, mitochondrial1.1e-2223.96Show/hide
Query:  IYTTALDVLGKARRPVEALNVFHAMQQHFSSYPDLVAYHSIAVTLGQAGYMRELFDVIDSMRFPPKKFKTGALEKWDPRLQPDIVIYNSVLNACVKRKNL
        IY T +D L K+++   AL++ + M++     PD+V Y+S+   L  +G   +   ++  M                  + PD+  +N++++ACVK   +
Subjt:  IYTTALDVLGKARRPVEALNVFHAMQQHFSSYPDLVAYHSIAVTLGQAGYMRELFDVIDSMRFPPKKFKTGALEKWDPRLQPDIVIYNSVLNACVKRKNL

Query:  EGAFWVLQELKKQGLQPSTSTYGLVMEVMLECGKYNLVHEFFR-KVQKSSVPNALTYKVLVNTLWKEGKTDEAVLAIENMERRGIVGSAALYYDFARCLC
          A    +E+ ++ L P   TY L++  +    + +   E F   V K   P+ +TY +L+N   K  K +  +     M +RG+V +   Y    +  C
Subjt:  EGAFWVLQELKKQGLQPSTSTYGLVMEVMLECGKYNLVHEFFR-KVQKSSVPNALTYKVLVNTLWKEGKTDEAVLAIENMERRGIVGSAALYYDFARCLC

Query:  SAGRCKEALMQMEKICKVANKPLVVTYTGLIQACLDSKDLQSAVYIFNHM-KTFCSPNLVTYNILLKGYLEHGMFEEARELFQNLSEHGRNISNVSDYRD
         AG+   A     ++      P ++TY  L+    D+  ++ A+ I   M K     ++VTYNI+++G  + G   +A +++ +L+  G           
Subjt:  SAGRCKEALMQMEKICKVANKPLVVTYTGLIQACLDSKDLQSAVYIFNHM-KTFCSPNLVTYNILLKGYLEHGMFEEARELFQNLSEHGRNISNVSDYRD

Query:  RVLPDIYMFNTML
         ++PDI+ + TM+
Subjt:  RVLPDIYMFNTML

Q9LQ16 Pentatricopeptide repeat-containing protein At1g629106.4e-2323.42Show/hide
Query:  FIYTTALDVLGKARRPVEALNVFHAMQQHFSSYPDLVAYHSIAVTLGQAGYMRELFDVIDSMRFPPKKFKTGALEKWDPRLQPDIVIYNSVLNACVKRKN
        F +TT +  L    +  EA+ +   M Q     PDLV Y ++   L + G +     ++       KK + G       +++ D+VIYN++++   K K+
Subjt:  FIYTTALDVLGKARRPVEALNVFHAMQQHFSSYPDLVAYHSIAVTLGQAGYMRELFDVIDSMRFPPKKFKTGALEKWDPRLQPDIVIYNSVLNACVKRKN

Query:  LEGAFWVLQELKKQGLQPSTSTYGLVMEVMLECGKYNLVHEFFR-KVQKSSVPNALTYKVLVNTLWKEGKTDEAVLAIENMERRGIVGSAALYYDFARCL
        ++ A  +  E+  +G++P   TY  ++  +   G+++         +++   PN +T+  L++   KEGK  EA    + M +R I      Y       
Subjt:  LEGAFWVLQELKKQGLQPSTSTYGLVMEVMLECGKYNLVHEFFR-KVQKSSVPNALTYKVLVNTLWKEGKTDEAVLAIENMERRGIVGSAALYYDFARCL

Query:  CSAGRCKEALMQMEKI-------------------CK-------------VANKPLV---VTYTGLIQACLDSKDLQSAVYIFNHMKTF-CSPNLVTYNI
        C   R  EA    E +                   CK             ++ + LV   VTYT LI     ++D  +A  +F  M +    PN++TYNI
Subjt:  CSAGRCKEALMQMEKI-------------------CK-------------VANKPLV---VTYTGLIQACLDSKDLQSAVYIFNHMKTF-CSPNLVTYNI

Query:  LLKGYLEHGMFEEARELFQNLSEHGRNISNVSDYRDRVLPDIYMFNTMLDASFAEKRWDDFGYFFNQMLLYGYHFNPKRHLRMILEAARAGKDELLETTW
        LL G  ++G   +A  +F+ L             R  + PDIY +N M++      + +D    F  + L G   N   +  MI    R G  E  ++  
Subjt:  LLKGYLEHGMFEEARELFQNLSEHGRNISNVSDYRDRVLPDIYMFNTMLDASFAEKRWDDFGYFFNQMLLYGYHFNPKRHLRMILEAARAGKDELLETTW

Query:  KHLSQVDRIPPPALLKERFCMKLARGD
        K + +   +P           +L  GD
Subjt:  KHLSQVDRIPPPALLKERFCMKLARGD

Q9SA76 Pentatricopeptide repeat-containing protein At1g30610, chloroplastic1.1e-20047.57Show/hide
Query:  GHKCGAIKASSNGESDIRLPSGNLLENDFPFKPSFDEYVRVMETVRTRRYKKQTDDPNKLT--------MKENASAKSAESTSISKIDNGKNKGTDVQRD
        G    A+K S +GES + +P     +  F  + S  EY R  +T R      + D+ + +          K+   +KS ES+   K  N       + +D
Subjt:  GHKCGAIKASSNGESDIRLPSGNLLENDFPFKPSFDEYVRVMETVRTRRYKKQTDDPNKLT--------MKENASAKSAESTSISKIDNGKNKGTDVQRD

Query:  VDVKNMFKRVDRKDLFNNTERITHRRDLSGNKFDSKRKGVTRSNDEVKGKVTPFYSQADDKQHEKKRNRNWSSDIEPKVPRSYNEKLINFKANTLDVKRE
           +  + + +            H R           +G+ R +   KG       +    Q   K  R WS   E  VP S +E               
Subjt:  VDVKNMFKRVDRKDLFNNTERITHRRDLSGNKFDSKRKGVTRSNDEVKGKVTPFYSQADDKQHEKKRNRNWSSDIEPKVPRSYNEKLINFKANTLDVKRE

Query:  NRRVCDRSPMKISEKIWADDDTKRAKDVLKAGKFGVQLERNYIPGEKVGRKKNE---QSYRGPSKSGKQFLEFPQESSLEVEHAAFNNFD-ALDIMDKPR
         RR   +  M   +++    DT R  +    G  G+ L       E++  +++E       G  + G +  +   +S   +E  AF   D + DI+DKP 
Subjt:  NRRVCDRSPMKISEKIWADDDTKRAKDVLKAGKFGVQLERNYIPGEKVGRKKNE---QSYRGPSKSGKQFLEFPQESSLEVEHAAFNNFD-ALDIMDKPR

Query:  VSKMEMEERIQMLSKRLNGSNIDMPEWMFSQMMRSAKIRYSDHSILRVIQVLGKLGNWRRVLQVIEWLQMRERFKSHKLRFIYTTALDVLGKARRPVEAL
         S++EME+RI+ L+K LNG++I+MPEW FS+ +RSAKIRY+D++++R+I  LGKLGNWRRVLQVIEWLQ ++R+KS+K+R IYTTAL+VLGK+RRPVEAL
Subjt:  VSKMEMEERIQMLSKRLNGSNIDMPEWMFSQMMRSAKIRYSDHSILRVIQVLGKLGNWRRVLQVIEWLQMRERFKSHKLRFIYTTALDVLGKARRPVEAL

Query:  NVFHAMQQHFSSYPDLVAYHSIAVTLGQAGYMRELFDVIDSMRFPP-KKFKTGALEKWDPRLQPDIVIYNSVLNACVKRKNLEGAFWVLQELKKQGLQPS
        NVFHAM    SSYPD+VAY SIAVTLGQAG+++ELF VID+MR PP KKFK   LEKWDPRL+PD+V+YN+VLNACV+RK  EGAFWVLQ+LK++G +PS
Subjt:  NVFHAMQQHFSSYPDLVAYHSIAVTLGQAGYMRELFDVIDSMRFPP-KKFKTGALEKWDPRLQPDIVIYNSVLNACVKRKNLEGAFWVLQELKKQGLQPS

Query:  TSTYGLVMEVMLECGKYNLVHEFFRKVQKSSVPNALTYKVLVNTLWKEGKTDEAVLAIENMERRGIVGSAALYYDFARCLCSAGRCKEAL----------
          TYGL+MEVML C KYNLVHEFFRK+QKSS+PNAL Y+VLVNTLWKEGK+DEAV  +E+ME RGIVGSAALYYD ARCLCSAGRC E L          
Subjt:  TSTYGLVMEVMLECGKYNLVHEFFRKVQKSSVPNALTYKVLVNTLWKEGKTDEAVLAIENMERRGIVGSAALYYDFARCLCSAGRCKEAL----------

Query:  ------------------MQMEKICKVANKPLVVTYTGLIQACLDSKDLQSAVYIFNHMKTFCSPNLVTYNILLKGYLEHGMFEEARELFQNLSEHGRNI
                           Q++KIC+VANKPLVVTYTGLIQAC+DS ++++A YIF+ MK  CSPNLVT NI+LK YL+ G+FEEARELFQ +SE G +I
Subjt:  ------------------MQMEKICKVANKPLVVTYTGLIQACLDSKDLQSAVYIFNHMKTFCSPNLVTYNILLKGYLEHGMFEEARELFQNLSEHGRNI

Query:  SNVSDYRDRVLPDIYMFNTMLDASFAEKRWDDFGYFFNQMLLYGYHFNPKRHLRMILEAARAGKDELLETTWKHLSQVDRIPPPALLKERFCMKLARGDY
         N SD+  RVLPD Y FNTMLD    +++WDDFGY + +ML +GYHFN KRHLRM+LEA+RAGK+E++E TW+H+ + +RIPP  L+KERF  KL +GD+
Subjt:  SNVSDYRDRVLPDIYMFNTMLDASFAEKRWDDFGYFFNQMLLYGYHFNPKRHLRMILEAARAGKDELLETTWKHLSQVDRIPPPALLKERFCMKLARGDY

Query:  SDALSCISN----HDSSDAHHFSEPAWLNLLKEKRFPKDTVIQLI
          A+S +++     + ++   FS  AW  +L   RF +D+V++L+
Subjt:  SDALSCISN----HDSSDAHHFSEPAWLNLLKEKRFPKDTVIQLI

Arabidopsis top hitse value%identityAlignment
AT1G30610.1 pentatricopeptide (PPR) repeat-containing protein7.8e-20247.57Show/hide
Query:  GHKCGAIKASSNGESDIRLPSGNLLENDFPFKPSFDEYVRVMETVRTRRYKKQTDDPNKLT--------MKENASAKSAESTSISKIDNGKNKGTDVQRD
        G    A+K S +GES + +P     +  F  + S  EY R  +T R      + D+ + +          K+   +KS ES+   K  N       + +D
Subjt:  GHKCGAIKASSNGESDIRLPSGNLLENDFPFKPSFDEYVRVMETVRTRRYKKQTDDPNKLT--------MKENASAKSAESTSISKIDNGKNKGTDVQRD

Query:  VDVKNMFKRVDRKDLFNNTERITHRRDLSGNKFDSKRKGVTRSNDEVKGKVTPFYSQADDKQHEKKRNRNWSSDIEPKVPRSYNEKLINFKANTLDVKRE
           +  + + +            H R           +G+ R +   KG       +    Q   K  R WS   E  VP S +E               
Subjt:  VDVKNMFKRVDRKDLFNNTERITHRRDLSGNKFDSKRKGVTRSNDEVKGKVTPFYSQADDKQHEKKRNRNWSSDIEPKVPRSYNEKLINFKANTLDVKRE

Query:  NRRVCDRSPMKISEKIWADDDTKRAKDVLKAGKFGVQLERNYIPGEKVGRKKNE---QSYRGPSKSGKQFLEFPQESSLEVEHAAFNNFD-ALDIMDKPR
         RR   +  M   +++    DT R  +    G  G+ L       E++  +++E       G  + G +  +   +S   +E  AF   D + DI+DKP 
Subjt:  NRRVCDRSPMKISEKIWADDDTKRAKDVLKAGKFGVQLERNYIPGEKVGRKKNE---QSYRGPSKSGKQFLEFPQESSLEVEHAAFNNFD-ALDIMDKPR

Query:  VSKMEMEERIQMLSKRLNGSNIDMPEWMFSQMMRSAKIRYSDHSILRVIQVLGKLGNWRRVLQVIEWLQMRERFKSHKLRFIYTTALDVLGKARRPVEAL
         S++EME+RI+ L+K LNG++I+MPEW FS+ +RSAKIRY+D++++R+I  LGKLGNWRRVLQVIEWLQ ++R+KS+K+R IYTTAL+VLGK+RRPVEAL
Subjt:  VSKMEMEERIQMLSKRLNGSNIDMPEWMFSQMMRSAKIRYSDHSILRVIQVLGKLGNWRRVLQVIEWLQMRERFKSHKLRFIYTTALDVLGKARRPVEAL

Query:  NVFHAMQQHFSSYPDLVAYHSIAVTLGQAGYMRELFDVIDSMRFPP-KKFKTGALEKWDPRLQPDIVIYNSVLNACVKRKNLEGAFWVLQELKKQGLQPS
        NVFHAM    SSYPD+VAY SIAVTLGQAG+++ELF VID+MR PP KKFK   LEKWDPRL+PD+V+YN+VLNACV+RK  EGAFWVLQ+LK++G +PS
Subjt:  NVFHAMQQHFSSYPDLVAYHSIAVTLGQAGYMRELFDVIDSMRFPP-KKFKTGALEKWDPRLQPDIVIYNSVLNACVKRKNLEGAFWVLQELKKQGLQPS

Query:  TSTYGLVMEVMLECGKYNLVHEFFRKVQKSSVPNALTYKVLVNTLWKEGKTDEAVLAIENMERRGIVGSAALYYDFARCLCSAGRCKEAL----------
          TYGL+MEVML C KYNLVHEFFRK+QKSS+PNAL Y+VLVNTLWKEGK+DEAV  +E+ME RGIVGSAALYYD ARCLCSAGRC E L          
Subjt:  TSTYGLVMEVMLECGKYNLVHEFFRKVQKSSVPNALTYKVLVNTLWKEGKTDEAVLAIENMERRGIVGSAALYYDFARCLCSAGRCKEAL----------

Query:  ------------------MQMEKICKVANKPLVVTYTGLIQACLDSKDLQSAVYIFNHMKTFCSPNLVTYNILLKGYLEHGMFEEARELFQNLSEHGRNI
                           Q++KIC+VANKPLVVTYTGLIQAC+DS ++++A YIF+ MK  CSPNLVT NI+LK YL+ G+FEEARELFQ +SE G +I
Subjt:  ------------------MQMEKICKVANKPLVVTYTGLIQACLDSKDLQSAVYIFNHMKTFCSPNLVTYNILLKGYLEHGMFEEARELFQNLSEHGRNI

Query:  SNVSDYRDRVLPDIYMFNTMLDASFAEKRWDDFGYFFNQMLLYGYHFNPKRHLRMILEAARAGKDELLETTWKHLSQVDRIPPPALLKERFCMKLARGDY
         N SD+  RVLPD Y FNTMLD    +++WDDFGY + +ML +GYHFN KRHLRM+LEA+RAGK+E++E TW+H+ + +RIPP  L+KERF  KL +GD+
Subjt:  SNVSDYRDRVLPDIYMFNTMLDASFAEKRWDDFGYFFNQMLLYGYHFNPKRHLRMILEAARAGKDELLETTWKHLSQVDRIPPPALLKERFCMKLARGDY

Query:  SDALSCISN----HDSSDAHHFSEPAWLNLLKEKRFPKDTVIQLI
          A+S +++     + ++   FS  AW  +L   RF +D+V++L+
Subjt:  SDALSCISN----HDSSDAHHFSEPAWLNLLKEKRFPKDTVIQLI

AT1G30610.2 pentatricopeptide (PPR) repeat-containing protein1.5e-20549.08Show/hide
Query:  GHKCGAIKASSNGESDIRLPSGNLLENDFPFKPSFDEYVRVMETVRTRRYKKQTDDPNKLT--------MKENASAKSAESTSISKIDNGKNKGTDVQRD
        G    A+K S +GES + +P     +  F  + S  EY R  +T R      + D+ + +          K+   +KS ES+   K  N       + +D
Subjt:  GHKCGAIKASSNGESDIRLPSGNLLENDFPFKPSFDEYVRVMETVRTRRYKKQTDDPNKLT--------MKENASAKSAESTSISKIDNGKNKGTDVQRD

Query:  VDVKNMFKRVDRKDLFNNTERITHRRDLSGNKFDSKRKGVTRSNDEVKGKVTPFYSQADDKQHEKKRNRNWSSDIEPKVPRSYNEKLINFKANTLDVKRE
           +  + + +            H R           +G+ R +   KG       +    Q   K  R WS   E  VP S +E               
Subjt:  VDVKNMFKRVDRKDLFNNTERITHRRDLSGNKFDSKRKGVTRSNDEVKGKVTPFYSQADDKQHEKKRNRNWSSDIEPKVPRSYNEKLINFKANTLDVKRE

Query:  NRRVCDRSPMKISEKIWADDDTKRAKDVLKAGKFGVQLERNYIPGEKVGRKKNE---QSYRGPSKSGKQFLEFPQESSLEVEHAAFNNFD-ALDIMDKPR
         RR   +  M   +++    DT R  +    G  G+ L       E++  +++E       G  + G +  +   +S   +E  AF   D + DI+DKP 
Subjt:  NRRVCDRSPMKISEKIWADDDTKRAKDVLKAGKFGVQLERNYIPGEKVGRKKNE---QSYRGPSKSGKQFLEFPQESSLEVEHAAFNNFD-ALDIMDKPR

Query:  VSKMEMEERIQMLSKRLNGSNIDMPEWMFSQMMRSAKIRYSDHSILRVIQVLGKLGNWRRVLQVIEWLQMRERFKSHKLRFIYTTALDVLGKARRPVEAL
         S++EME+RI+ L+K LNG++I+MPEW FS+ +RSAKIRY+D++++R+I  LGKLGNWRRVLQVIEWLQ ++R+KS+K+R IYTTAL+VLGK+RRPVEAL
Subjt:  VSKMEMEERIQMLSKRLNGSNIDMPEWMFSQMMRSAKIRYSDHSILRVIQVLGKLGNWRRVLQVIEWLQMRERFKSHKLRFIYTTALDVLGKARRPVEAL

Query:  NVFHAMQQHFSSYPDLVAYHSIAVTLGQAGYMRELFDVIDSMRFPP-KKFKTGALEKWDPRLQPDIVIYNSVLNACVKRKNLEGAFWVLQELKKQGLQPS
        NVFHAM    SSYPD+VAY SIAVTLGQAG+++ELF VID+MR PP KKFK   LEKWDPRL+PD+V+YN+VLNACV+RK  EGAFWVLQ+LK++G +PS
Subjt:  NVFHAMQQHFSSYPDLVAYHSIAVTLGQAGYMRELFDVIDSMRFPP-KKFKTGALEKWDPRLQPDIVIYNSVLNACVKRKNLEGAFWVLQELKKQGLQPS

Query:  TSTYGLVMEVMLECGKYNLVHEFFRKVQKSSVPNALTYKVLVNTLWKEGKTDEAVLAIENMERRGIVGSAALYYDFARCLCSAGRCKEALMQMEKICKVA
          TYGL+MEVML C KYNLVHEFFRK+QKSS+PNAL Y+VLVNTLWKEGK+DEAV  +E+ME RGIVGSAALYYD ARCLCSAGRC E L  ++KIC+VA
Subjt:  TSTYGLVMEVMLECGKYNLVHEFFRKVQKSSVPNALTYKVLVNTLWKEGKTDEAVLAIENMERRGIVGSAALYYDFARCLCSAGRCKEALMQMEKICKVA

Query:  NKPLVVTYTGLIQACLDSKDLQSAVYIFNHMKTFCSPNLVTYNILLKGYLEHGMFEEARELFQNLSEHGRNISNVSDYRDRVLPDIYMFNTMLDASFAEK
        NKPLVVTYTGLIQAC+DS ++++A YIF+ MK  CSPNLVT NI+LK YL+ G+FEEARELFQ +SE G +I N SD+  RVLPD Y FNTMLD    ++
Subjt:  NKPLVVTYTGLIQACLDSKDLQSAVYIFNHMKTFCSPNLVTYNILLKGYLEHGMFEEARELFQNLSEHGRNISNVSDYRDRVLPDIYMFNTMLDASFAEK

Query:  RWDDFGYFFNQMLLYGYHFNPKRHLRMILEAARAGKDELLETTWKHLSQVDRIPPPALLKERFCMKLARGDYSDALSCISN----HDSSDAHHFSEPAWL
        +WDDFGY + +ML +GYHFN KRHLRM+LEA+RAGK+E++E TW+H+ + +RIPP  L+KERF  KL +GD+  A+S +++     + ++   FS  AW 
Subjt:  RWDDFGYFFNQMLLYGYHFNPKRHLRMILEAARAGKDELLETTWKHLSQVDRIPPPALLKERFCMKLARGDYSDALSCISN----HDSSDAHHFSEPAWL

Query:  NLLKEKRFPKDTVIQLI
         +L   RF +D+V++L+
Subjt:  NLLKEKRFPKDTVIQLI

AT1G62910.1 Pentatricopeptide repeat (PPR) superfamily protein4.5e-2423.42Show/hide
Query:  FIYTTALDVLGKARRPVEALNVFHAMQQHFSSYPDLVAYHSIAVTLGQAGYMRELFDVIDSMRFPPKKFKTGALEKWDPRLQPDIVIYNSVLNACVKRKN
        F +TT +  L    +  EA+ +   M Q     PDLV Y ++   L + G +     ++       KK + G       +++ D+VIYN++++   K K+
Subjt:  FIYTTALDVLGKARRPVEALNVFHAMQQHFSSYPDLVAYHSIAVTLGQAGYMRELFDVIDSMRFPPKKFKTGALEKWDPRLQPDIVIYNSVLNACVKRKN

Query:  LEGAFWVLQELKKQGLQPSTSTYGLVMEVMLECGKYNLVHEFFR-KVQKSSVPNALTYKVLVNTLWKEGKTDEAVLAIENMERRGIVGSAALYYDFARCL
        ++ A  +  E+  +G++P   TY  ++  +   G+++         +++   PN +T+  L++   KEGK  EA    + M +R I      Y       
Subjt:  LEGAFWVLQELKKQGLQPSTSTYGLVMEVMLECGKYNLVHEFFR-KVQKSSVPNALTYKVLVNTLWKEGKTDEAVLAIENMERRGIVGSAALYYDFARCL

Query:  CSAGRCKEALMQMEKI-------------------CK-------------VANKPLV---VTYTGLIQACLDSKDLQSAVYIFNHMKTF-CSPNLVTYNI
        C   R  EA    E +                   CK             ++ + LV   VTYT LI     ++D  +A  +F  M +    PN++TYNI
Subjt:  CSAGRCKEALMQMEKI-------------------CK-------------VANKPLV---VTYTGLIQACLDSKDLQSAVYIFNHMKTF-CSPNLVTYNI

Query:  LLKGYLEHGMFEEARELFQNLSEHGRNISNVSDYRDRVLPDIYMFNTMLDASFAEKRWDDFGYFFNQMLLYGYHFNPKRHLRMILEAARAGKDELLETTW
        LL G  ++G   +A  +F+ L             R  + PDIY +N M++      + +D    F  + L G   N   +  MI    R G  E  ++  
Subjt:  LLKGYLEHGMFEEARELFQNLSEHGRNISNVSDYRDRVLPDIYMFNTMLDASFAEKRWDDFGYFFNQMLLYGYHFNPKRHLRMILEAARAGKDELLETTW

Query:  KHLSQVDRIPPPALLKERFCMKLARGD
        K + +   +P           +L  GD
Subjt:  KHLSQVDRIPPPALLKERFCMKLARGD

AT2G17140.1 Pentatricopeptide repeat (PPR) superfamily protein5.9e-2426.98Show/hide
Query:  PRLQPDIVIYNSVLNACVKRKNLEGAFWVLQELKKQGLQPSTSTYGLVMEVMLECGKYNLVHEFFRKV-QKSSVPNALTYKVLVNTLWKEGKTDEAVLAI
        P  +P + +YN +L +C+K + +E   W+ +++   G+ P T T+ L++  + +    +   E F ++ +K   PN  T+ +LV    K G TD+ +  +
Subjt:  PRLQPDIVIYNSVLNACVKRKNLEGAFWVLQELKKQGLQPSTSTYGLVMEVMLECGKYNLVHEFFRKV-QKSSVPNALTYKVLVNTLWKEGKTDEAVLAI

Query:  ENMERRGIVGSAALYYDFARCLCSAGRCKEALMQMEKICKVANKPLVVTYTGLIQACLDSKDLQSAVYIFNHMKT-----FCSPNLVTYNILLKGYLEHG
          ME  G++ +  +Y       C  GR  ++   +EK+ +    P +VT+   I A      +  A  IF+ M+         PN +TYN++LKG+ + G
Subjt:  ENMERRGIVGSAALYYDFARCLCSAGRCKEALMQMEKICKVANKPLVVTYTGLIQACLDSKDLQSAVYIFNHMKT-----FCSPNLVTYNILLKGYLEHG

Query:  MFEEARELFQNLSEH
        + E+A+ LF+++ E+
Subjt:  MFEEARELFQNLSEH

AT5G67570.1 Tetratricopeptide repeat (TPR)-like superfamily protein3.5e-10940.12Show/hide
Query:  ERIQMLSKRLNGSNIDMPEWMFSQMMRSAKIRYSDHSILRVIQVLGKLGNWRRVLQVIEWLQMRERFKSHKLRFIYTTALDVLGKARRPVEALNVFHAMQ
        E +++L  RL+G  I+   W F +MM  + +++++  +L+++  LG+  +W++   V+ W+   ++ K  + RF+YT  L VLG ARRP EAL +F+ M 
Subjt:  ERIQMLSKRLNGSNIDMPEWMFSQMMRSAKIRYSDHSILRVIQVLGKLGNWRRVLQVIEWLQMRERFKSHKLRFIYTTALDVLGKARRPVEALNVFHAMQ

Query:  QHFSSYPDLVAYHSIAVTLGQAGYMRELFDVIDSMRFPPKKFKTGALEK-WDPRLQPDIVIYNSVLNACVKRKNLEGAFWVLQELKKQGLQPSTSTYGLV
             YPD+ AYH IAVTLGQAG ++EL  VI+ MR  P K      +K WDP L+PD+V+YN++LNACV     +   WV  EL+K GL+P+ +TYGL 
Subjt:  QHFSSYPDLVAYHSIAVTLGQAGYMRELFDVIDSMRFPPKKFKTGALEK-WDPRLQPDIVIYNSVLNACVKRKNLEGAFWVLQELKKQGLQPSTSTYGLV

Query:  MEVMLECGKYNLVHEFFRKVQKS-SVPNALTYKVLVNTLWKEGKTDEAVLAIENMERRGIVGSAALYYDFARCLCSAGRCKEALMQMEKICKVAN-KPLV
        MEVMLE GK++ VH+FFRK++ S   P A+TYKVLV  LW+EGK +EAV A+ +ME++G++G+ ++YY+ A CLC+ GR  +A++++ ++ ++ N +PL 
Subjt:  MEVMLECGKYNLVHEFFRKVQKS-SVPNALTYKVLVNTLWKEGKTDEAVLAIENMERRGIVGSAALYYDFARCLCSAGRCKEALMQMEKICKVAN-KPLV

Query:  VTYTGLIQACLDSKDLQSAVYIFNHMKTFCSPNLVTYNILLKGYLEHGMFEEARELFQNLSEHGRNISNVSDYRDRVLPDIYMFNTMLDASFAEKRWDDF
        +T+TGLI A L+   +   + IF +MK  C PN+ T N++LK Y  + MF EA+ELF+ +         VS     ++P+ Y ++ ML+AS    +W+ F
Subjt:  VTYTGLIQACLDSKDLQSAVYIFNHMKTFCSPNLVTYNILLKGYLEHGMFEEARELFQNLSEHGRNISNVSDYRDRVLPDIYMFNTMLDASFAEKRWDDF

Query:  GYFFNQMLLYGYHFNPKRHLRMILEAARAGKDELLETTWKHLSQVDRIPPPALLKERFCMKLARGDYSDALSCISNHDSSDAHHFSEPAWLNLLKE
         + +  M+L GY  +  +H  M++EA+RAGK  LLE  +  + +   IP P    E  C   A+GD+  A++ I N  +  +   SE  W +L +E
Subjt:  GYFFNQMLLYGYHFNPKRHLRMILEAARAGKDELLETTWKHLSQVDRIPPPALLKERFCMKLARGDYSDALSCISNHDSSDAHHFSEPAWLNLLKE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTCCACGTCTTAGTCGCCTACTCCTACACGCTGCCGCCCGTCGCTCTACACACCACCGTCTGCTGCAACCCGCAGTCGCCCGCTCGGCCACTCCTCTGAAGAAATT
CGCTTTCGAAGAAATTCGCTTTCGTCTCTCCATCGGAGCGCCGCCGATAGCCGGAGAAGGTGTAGAAGGTCGAAATAACGGCACGGAACTGCAGTATCAACCAAATTTTC
ACGGACTCTTTGGTCGGTGCGAGAACCTAAAATGGCCGCTGGAGCTGCCGATTGCCGGAGAAGATGTGAACAGTCCTATATTATCACCGCGCGGAATTGCAGAGCTTAAG
TATTATCTGACTTTTCCGCGGAGAATTAGGTCGGTGCAACATGCTAAAATGCCTTATGAGCCGCAGATAGAACAGGCACAGCCGAGAATAATGATAAATGGATTTCCGGC
ACTGCATTGTACCCAGAATTCCCATAATTTTTTTGGGTTTTCGTTCTTTCCTTGTTCAATTTCTGGAACTGACTTAAATATGGGCGACGTGAAGAATAGAGTTTTAAGGC
ACAGGGGACATAAATGTGGAGCAATTAAGGCTTCGTCAAATGGAGAATCTGATATTCGATTGCCAAGTGGGAATCTCCTCGAAAATGATTTTCCATTTAAGCCATCGTTC
GATGAATATGTGAGGGTCATGGAGACTGTTAGAACTAGAAGGTACAAGAAGCAGACAGACGATCCTAATAAACTAACGATGAAGGAAAATGCAAGTGCAAAGAGTGCTGA
GAGCACTTCCATTTCTAAAATAGATAATGGAAAAAACAAAGGGACTGATGTTCAACGTGATGTGGATGTAAAGAACATGTTTAAACGTGTTGATCGTAAAGATTTGTTCA
ATAATACAGAGAGAATTACTCATAGAAGAGATTTGTCAGGAAATAAATTTGATAGCAAAAGGAAAGGTGTTACAAGATCAAATGATGAGGTTAAAGGCAAGGTGACCCCA
TTTTACTCACAGGCTGATGATAAACAACATGAAAAGAAAAGGAATAGAAACTGGTCGAGTGACATTGAGCCAAAAGTACCGAGGTCATACAATGAGAAACTAATTAATTT
TAAGGCTAATACATTGGATGTCAAAAGAGAAAACCGCCGTGTATGTGATAGAAGTCCCATGAAAATATCAGAAAAGATTTGGGCCGATGATGACACTAAACGAGCTAAGG
ATGTTCTCAAGGCTGGAAAATTTGGTGTTCAGCTTGAAAGAAACTATATCCCAGGTGAAAAGGTTGGTAGAAAGAAAAATGAGCAGTCCTACAGAGGGCCGTCCAAAAGT
GGCAAGCAGTTTCTTGAATTTCCTCAAGAGAGTAGCTTGGAGGTAGAACATGCAGCCTTCAACAATTTTGATGCATTAGACATAATGGATAAACCAAGAGTTTCAAAGAT
GGAAATGGAAGAGAGAATCCAGATGCTATCTAAGAGATTGAATGGTTCAAACATTGATATGCCTGAGTGGATGTTCTCTCAAATGATGAGGAGTGCAAAGATTAGATATT
CAGATCACTCAATATTAAGGGTTATTCAAGTGTTGGGTAAGCTAGGAAATTGGAGGCGAGTGCTACAAGTCATTGAATGGCTTCAAATGCGTGAACGGTTCAAGTCACAT
AAGCTGAGATTTATATACACCACTGCCCTTGATGTACTTGGAAAAGCAAGGAGACCTGTGGAGGCACTCAATGTATTCCATGCAATGCAGCAACACTTTTCCTCATATCC
TGACTTAGTAGCATATCATAGTATTGCTGTCACTCTTGGACAAGCAGGATATATGAGGGAACTCTTTGATGTGATTGATAGCATGCGGTTTCCTCCAAAGAAGTTTAAAA
CAGGGGCACTTGAGAAGTGGGACCCACGGCTGCAACCTGATATAGTTATCTATAATTCGGTTTTAAATGCTTGTGTTAAACGAAAAAATTTGGAAGGGGCATTTTGGGTC
TTGCAGGAATTGAAGAAACAAGGTCTACAGCCTTCGACCTCAACATATGGATTGGTCATGGAGGTGATGCTTGAATGTGGCAAGTACAACTTAGTTCATGAGTTCTTCAG
AAAAGTGCAGAAATCTTCCGTTCCTAATGCTTTAACATATAAAGTTCTTGTCAATACACTTTGGAAAGAGGGAAAAACTGATGAGGCTGTGCTGGCCATTGAGAACATGG
AAAGACGAGGGATAGTAGGGTCTGCAGCTCTTTATTACGACTTTGCTCGTTGTCTTTGCAGTGCTGGTAGGTGCAAAGAAGCCCTGATGCAGATGGAGAAGATATGCAAA
GTTGCTAATAAGCCTCTTGTAGTGACTTACACTGGTTTGATTCAAGCTTGTTTGGACTCAAAAGACTTGCAAAGTGCAGTCTATATATTCAACCACATGAAGACCTTTTG
CTCTCCCAATCTTGTTACTTATAATATACTGTTGAAAGGTTACTTGGAACATGGGATGTTTGAAGAGGCTAGAGAGCTGTTTCAAAATTTGTCAGAGCATGGACGAAATA
TTAGCAATGTATCTGACTATAGGGATCGAGTATTACCAGATATCTACATGTTCAATACCATGCTAGATGCATCTTTTGCCGAAAAAAGATGGGATGATTTTGGCTATTTC
TTTAACCAGATGCTTCTTTATGGATATCACTTCAACCCGAAACGTCATCTGCGGATGATATTGGAGGCTGCTAGGGCTGGAAAGGATGAGCTACTGGAAACAACATGGAA
GCACCTATCTCAGGTTGACCGGATTCCGCCACCGGCGCTTCTCAAAGAAAGGTTTTGCATGAAGCTGGCTAGAGGTGACTACTCTGATGCTCTCTCTTGCATTTCAAATC
ACGATAGTAGCGATGCACATCATTTCTCTGAGCCGGCTTGGCTAAATTTACTGAAAGAGAAAAGGTTTCCCAAGGATACTGTCATTCAGTTAATTCATAAGAATGCAGGT
GGGAAGGGTCTCACCAGGCTTCTTGTGAGTGAAAGGGAAAGTTAA
mRNA sequenceShow/hide mRNA sequence
TCTCCTTGTCCTCTGCGTCTTCATCTTCGACTCTCCAATCGCCCATCCATTCCATCTCCACTCCATGAGTCCACGTCTTAGTCGCCTACTCCTACACGCTGCCGCCCGTC
GCTCTACACACCACCGTCTGCTGCAACCCGCAGTCGCCCGCTCGGCCACTCCTCTGAAGAAATTCGCTTTCGAAGAAATTCGCTTTCGTCTCTCCATCGGAGCGCCGCCG
ATAGCCGGAGAAGGTGTAGAAGGTCGAAATAACGGCACGGAACTGCAGTATCAACCAAATTTTCACGGACTCTTTGGTCGGTGCGAGAACCTAAAATGGCCGCTGGAGCT
GCCGATTGCCGGAGAAGATGTGAACAGTCCTATATTATCACCGCGCGGAATTGCAGAGCTTAAGTATTATCTGACTTTTCCGCGGAGAATTAGGTCGGTGCAACATGCTA
AAATGCCTTATGAGCCGCAGATAGAACAGGCACAGCCGAGAATAATGATAAATGGATTTCCGGCACTGCATTGTACCCAGAATTCCCATAATTTTTTTGGGTTTTCGTTC
TTTCCTTGTTCAATTTCTGGAACTGACTTAAATATGGGCGACGTGAAGAATAGAGTTTTAAGGCACAGGGGACATAAATGTGGAGCAATTAAGGCTTCGTCAAATGGAGA
ATCTGATATTCGATTGCCAAGTGGGAATCTCCTCGAAAATGATTTTCCATTTAAGCCATCGTTCGATGAATATGTGAGGGTCATGGAGACTGTTAGAACTAGAAGGTACA
AGAAGCAGACAGACGATCCTAATAAACTAACGATGAAGGAAAATGCAAGTGCAAAGAGTGCTGAGAGCACTTCCATTTCTAAAATAGATAATGGAAAAAACAAAGGGACT
GATGTTCAACGTGATGTGGATGTAAAGAACATGTTTAAACGTGTTGATCGTAAAGATTTGTTCAATAATACAGAGAGAATTACTCATAGAAGAGATTTGTCAGGAAATAA
ATTTGATAGCAAAAGGAAAGGTGTTACAAGATCAAATGATGAGGTTAAAGGCAAGGTGACCCCATTTTACTCACAGGCTGATGATAAACAACATGAAAAGAAAAGGAATA
GAAACTGGTCGAGTGACATTGAGCCAAAAGTACCGAGGTCATACAATGAGAAACTAATTAATTTTAAGGCTAATACATTGGATGTCAAAAGAGAAAACCGCCGTGTATGT
GATAGAAGTCCCATGAAAATATCAGAAAAGATTTGGGCCGATGATGACACTAAACGAGCTAAGGATGTTCTCAAGGCTGGAAAATTTGGTGTTCAGCTTGAAAGAAACTA
TATCCCAGGTGAAAAGGTTGGTAGAAAGAAAAATGAGCAGTCCTACAGAGGGCCGTCCAAAAGTGGCAAGCAGTTTCTTGAATTTCCTCAAGAGAGTAGCTTGGAGGTAG
AACATGCAGCCTTCAACAATTTTGATGCATTAGACATAATGGATAAACCAAGAGTTTCAAAGATGGAAATGGAAGAGAGAATCCAGATGCTATCTAAGAGATTGAATGGT
TCAAACATTGATATGCCTGAGTGGATGTTCTCTCAAATGATGAGGAGTGCAAAGATTAGATATTCAGATCACTCAATATTAAGGGTTATTCAAGTGTTGGGTAAGCTAGG
AAATTGGAGGCGAGTGCTACAAGTCATTGAATGGCTTCAAATGCGTGAACGGTTCAAGTCACATAAGCTGAGATTTATATACACCACTGCCCTTGATGTACTTGGAAAAG
CAAGGAGACCTGTGGAGGCACTCAATGTATTCCATGCAATGCAGCAACACTTTTCCTCATATCCTGACTTAGTAGCATATCATAGTATTGCTGTCACTCTTGGACAAGCA
GGATATATGAGGGAACTCTTTGATGTGATTGATAGCATGCGGTTTCCTCCAAAGAAGTTTAAAACAGGGGCACTTGAGAAGTGGGACCCACGGCTGCAACCTGATATAGT
TATCTATAATTCGGTTTTAAATGCTTGTGTTAAACGAAAAAATTTGGAAGGGGCATTTTGGGTCTTGCAGGAATTGAAGAAACAAGGTCTACAGCCTTCGACCTCAACAT
ATGGATTGGTCATGGAGGTGATGCTTGAATGTGGCAAGTACAACTTAGTTCATGAGTTCTTCAGAAAAGTGCAGAAATCTTCCGTTCCTAATGCTTTAACATATAAAGTT
CTTGTCAATACACTTTGGAAAGAGGGAAAAACTGATGAGGCTGTGCTGGCCATTGAGAACATGGAAAGACGAGGGATAGTAGGGTCTGCAGCTCTTTATTACGACTTTGC
TCGTTGTCTTTGCAGTGCTGGTAGGTGCAAAGAAGCCCTGATGCAGATGGAGAAGATATGCAAAGTTGCTAATAAGCCTCTTGTAGTGACTTACACTGGTTTGATTCAAG
CTTGTTTGGACTCAAAAGACTTGCAAAGTGCAGTCTATATATTCAACCACATGAAGACCTTTTGCTCTCCCAATCTTGTTACTTATAATATACTGTTGAAAGGTTACTTG
GAACATGGGATGTTTGAAGAGGCTAGAGAGCTGTTTCAAAATTTGTCAGAGCATGGACGAAATATTAGCAATGTATCTGACTATAGGGATCGAGTATTACCAGATATCTA
CATGTTCAATACCATGCTAGATGCATCTTTTGCCGAAAAAAGATGGGATGATTTTGGCTATTTCTTTAACCAGATGCTTCTTTATGGATATCACTTCAACCCGAAACGTC
ATCTGCGGATGATATTGGAGGCTGCTAGGGCTGGAAAGGATGAGCTACTGGAAACAACATGGAAGCACCTATCTCAGGTTGACCGGATTCCGCCACCGGCGCTTCTCAAA
GAAAGGTTTTGCATGAAGCTGGCTAGAGGTGACTACTCTGATGCTCTCTCTTGCATTTCAAATCACGATAGTAGCGATGCACATCATTTCTCTGAGCCGGCTTGGCTAAA
TTTACTGAAAGAGAAAAGGTTTCCCAAGGATACTGTCATTCAGTTAATTCATAAGAATGCAGGTGGGAAGGGTCTCACCAGGCTTCTTGTGAGTGAAAGGGAAAGTTAAG
GCTTTTAATATTGTGATTGTGATACCAGTGGCTCCATCTGGATTCTCTTCACTGGTGGGATGGGCAGCTCTAAATTTGGCCCCTCTTTTTTCCTTTTTCCTTTGGTTTAT
CTCTCAAAGTTCCAATTGAACAAGTTGAAAGAGGTTCGACTCGAACTTACGACCTCTAATCGGTTAGGACATGTTAGTATGAAATGTAAATTAGAAGAGTTTTACACATG
GTTCTAATAATAGGTTTGTGAAAATTTGGTATAGCTCATGTTTCAATTATACAAAGATGTAAATATCTTTTCTTTTCTAGTTTAACAACATGCATAAGTGAAAGTTTTAA
TCCTTGACCTTTTGGTCCAAG
Protein sequenceShow/hide protein sequence
MSPRLSRLLLHAAARRSTHHRLLQPAVARSATPLKKFAFEEIRFRLSIGAPPIAGEGVEGRNNGTELQYQPNFHGLFGRCENLKWPLELPIAGEDVNSPILSPRGIAELK
YYLTFPRRIRSVQHAKMPYEPQIEQAQPRIMINGFPALHCTQNSHNFFGFSFFPCSISGTDLNMGDVKNRVLRHRGHKCGAIKASSNGESDIRLPSGNLLENDFPFKPSF
DEYVRVMETVRTRRYKKQTDDPNKLTMKENASAKSAESTSISKIDNGKNKGTDVQRDVDVKNMFKRVDRKDLFNNTERITHRRDLSGNKFDSKRKGVTRSNDEVKGKVTP
FYSQADDKQHEKKRNRNWSSDIEPKVPRSYNEKLINFKANTLDVKRENRRVCDRSPMKISEKIWADDDTKRAKDVLKAGKFGVQLERNYIPGEKVGRKKNEQSYRGPSKS
GKQFLEFPQESSLEVEHAAFNNFDALDIMDKPRVSKMEMEERIQMLSKRLNGSNIDMPEWMFSQMMRSAKIRYSDHSILRVIQVLGKLGNWRRVLQVIEWLQMRERFKSH
KLRFIYTTALDVLGKARRPVEALNVFHAMQQHFSSYPDLVAYHSIAVTLGQAGYMRELFDVIDSMRFPPKKFKTGALEKWDPRLQPDIVIYNSVLNACVKRKNLEGAFWV
LQELKKQGLQPSTSTYGLVMEVMLECGKYNLVHEFFRKVQKSSVPNALTYKVLVNTLWKEGKTDEAVLAIENMERRGIVGSAALYYDFARCLCSAGRCKEALMQMEKICK
VANKPLVVTYTGLIQACLDSKDLQSAVYIFNHMKTFCSPNLVTYNILLKGYLEHGMFEEARELFQNLSEHGRNISNVSDYRDRVLPDIYMFNTMLDASFAEKRWDDFGYF
FNQMLLYGYHFNPKRHLRMILEAARAGKDELLETTWKHLSQVDRIPPPALLKERFCMKLARGDYSDALSCISNHDSSDAHHFSEPAWLNLLKEKRFPKDTVIQLIHKNAG
GKGLTRLLVSERES