; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CaUC04G069010 (gene) of Watermelon (USVL246-FR2) v1 genome

Gene IDCaUC04G069010
OrganismCitrullus amarus (Watermelon (USVL246-FR2) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationCiama_Chr04:9128078..9131423
RNA-Seq ExpressionCaUC04G069010
SyntenyCaUC04G069010
Gene Ontology termsGO:0006520 - cellular amino acid metabolic process (biological process)
GO:0005515 - protein binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
GO:0016597 - amino acid binding (molecular function)
GO:0016743 - carboxyl- or carbamoyltransferase activity (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR006132 - Aspartate/ornithine carbamoyltransferase, carbamoyl-P binding
IPR011990 - Tetratricopeptide-like helical domain superfamily
IPR032867 - DYW domain
IPR036901 - Aspartate/ornithine carbamoyltransferase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6600674.1 Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0088.81Show/hide
Query:  MHQFSSLLQSFRQILKTCIAQRDLRTGKSLHALYIKSFLPTSTYLSNHFILLYSKCRRLSAARWVFDQTHDCNIFSFNALISAYAKESFVEVARQLFDKM
        MHQFS++LQSFR +LKTCIAQRDLRTGKSLHALYIKSF+P STY+SNHFILLYSKCRRLSAAR VFDQT +CNIFSFNALI+AYAKESFVEVAR+LFDKM
Subjt:  MHQFSSLLQSFRQILKTCIAQRDLRTGKSLHALYIKSFLPTSTYLSNHFILLYSKCRRLSAARWVFDQTHDCNIFSFNALISAYAKESFVEVARQLFDKM

Query:  PQPDPVSYNTLIAAYAHRGDTEPAFQLFLEMREAFLDMDGFTLSGIITACGDNVALIRQLHALSVVAGFDSYASVGNALITYYSKNGFLNEAQRIFRWMG
         QPDPVSYNTLIAAYA RGD E AFQLF+EMREA LDMDGFTLSGIITACGD+VALIRQLHALSV AGFD YASVGNALITYYSKNGFLNEAQRIF  MG
Subjt:  PQPDPVSYNTLIAAYAHRGDTEPAFQLFLEMREAFLDMDGFTLSGIITACGDNVALIRQLHALSVVAGFDSYASVGNALITYYSKNGFLNEAQRIFRWMG

Query:  EDRDEVSWNSMVVAYMQHREGSKALELYLEMTVRGLIVDMFTLASVLTAFTNVQDLSGGLQFHAKLIKSGYHQNCHVGSGLIDLYSKCGGCMLDCRKVFD
        ED+DEVSWNSMVVAYMQHREGSKAL LY+EMT+RGL+VDMFTLASVLTAFTNVQDLSGGLQFHAKLIKSGYHQNCHVGSGLIDLYSKCGG ML CRKVFD
Subjt:  EDRDEVSWNSMVVAYMQHREGSKALELYLEMTVRGLIVDMFTLASVLTAFTNVQDLSGGLQFHAKLIKSGYHQNCHVGSGLIDLYSKCGGCMLDCRKVFD

Query:  EICKPDLVLWNTMISGYSLYEELSNEALECFRQLQGVGHRPDDCSLVCVISACSNMSSPSQGRQVHGLAFKLDIPSNRISVNNALVAMYSKCGNLRDARR
        EICKPDLVLWNTMISGYSL+EE S+EALECFR+LQGVGH PDDCSLVCVISAC+NMSSPSQGRQVH L FKLDIPSNRISVNNAL+AMYSKCGNLRDARR
Subjt:  EICKPDLVLWNTMISGYSLYEELSNEALECFRQLQGVGHRPDDCSLVCVISACSNMSSPSQGRQVHGLAFKLDIPSNRISVNNALVAMYSKCGNLRDARR

Query:  LFDTMPEHNTVSYNSMIAGYAQHGMGSQSLHLFQRMLELGFTPTNITFISVLAACAHTGRVEDGKIYFNMMK-KFGIEPEAGHFSCMIDLLGRAGKLSEA
        LFDTMPEHNTVS+NSMIAGYAQHGMG QSL+LFQRML++GFTPTNITFISVLAACAHTGRV+DGKIYFNMMK KFGIEPEA HFSC+IDLLGRAGKLSEA
Subjt:  LFDTMPEHNTVSYNSMIAGYAQHGMGSQSLHLFQRMLELGFTPTNITFISVLAACAHTGRVEDGKIYFNMMK-KFGIEPEAGHFSCMIDLLGRAGKLSEA

Query:  ERLIETIPFDPGSILWSALLGACRTHGNVELAVKAANHLLQMDPSNAAPYVMLANIYADNGRLEDAATVRKLMRDRGVKKKPGCSWIEVNRRIHIFVAED
        ERLIETIPF+PGSI WSALLGACRTHGN+ELA KAANHLLQ++PSNAAPYVMLANIYADNGR ED A+VRKLMRDRGVKKKPGCSWIEV+RR HIFVAED
Subjt:  ERLIETIPFDPGSILWSALLGACRTHGNVELAVKAANHLLQMDPSNAAPYVMLANIYADNGRLEDAATVRKLMRDRGVKKKPGCSWIEVNRRIHIFVAED

Query:  TSHPMIKKIQEYLEEMMRKIKKAGYTQDVRSATIGDYDRERQIEEELRLGHHSEKLAVSFGLMSTREEIP
        TSHPMIKKI EYLEEMMRKIKKAGY  DVRS +IG  D  R+ EEELRLGHHSEKLAV+FGLM T E  P
Subjt:  TSHPMIKKIQEYLEEMMRKIKKAGYTQDVRSATIGDYDRERQIEEELRLGHHSEKLAVSFGLMSTREEIP

XP_004146400.1 pentatricopeptide repeat-containing protein At3g49710 isoform X1 [Cucumis sativus]0.0e+0090Show/hide
Query:  MHQFSSLLQSFRQILKTCIAQRDLRTGKSLHALYIKSFLPTSTYLSNHFILLYSKCRRLSAARWVFDQTHDCNIFSFNALISAYAKESFVEVARQLFDKM
        MH FSSLL +FRQ LKTCIA RDLRTGKSLHALYIKSF+PTSTYLSNHF+LLYSKCRRLSAAR VFD THDCN+FSFN LISAYAKES+VEVA QLFD+M
Subjt:  MHQFSSLLQSFRQILKTCIAQRDLRTGKSLHALYIKSFLPTSTYLSNHFILLYSKCRRLSAARWVFDQTHDCNIFSFNALISAYAKESFVEVARQLFDKM

Query:  PQPDPVSYNTLIAAYAHRGDTEPAFQLFLEMREAFLDMDGFTLSGIITACGDNVALIRQLHALSVVAGFDSYASVGNALITYYSKNGFLNEAQRIFRWMG
        PQPD VSYNTLIAAYA RGDT+PAFQLFLEMREAFLDMDGFTLSGIITACG NV LIRQLHALSVV G DSY SVGNALIT YSKNGFL EA+RIF W+ 
Subjt:  PQPDPVSYNTLIAAYAHRGDTEPAFQLFLEMREAFLDMDGFTLSGIITACGDNVALIRQLHALSVVAGFDSYASVGNALITYYSKNGFLNEAQRIFRWMG

Query:  EDRDEVSWNSMVVAYMQHREGSKALELYLEMTVRGLIVDMFTLASVLTAFTNVQDLSGGLQFHAKLIKSGYHQNCHVGSGLIDLYSKCGGCMLDCRKVFD
        EDRDEVSWNSMVVAYMQHREGSKALELYLEMTVRGLIVD+FTLASVLTAFTNVQDL GGLQFHAKLIKSGYHQN HVGSGLIDLYSKCGGCMLDCRKVFD
Subjt:  EDRDEVSWNSMVVAYMQHREGSKALELYLEMTVRGLIVDMFTLASVLTAFTNVQDLSGGLQFHAKLIKSGYHQNCHVGSGLIDLYSKCGGCMLDCRKVFD

Query:  EICKPDLVLWNTMISGYSLYEELSNEALECFRQLQGVGHRPDDCSLVCVISACSNMSSPSQGRQVHGLAFKLDIPSNRISVNNALVAMYSKCGNLRDARR
        EI  PDLVLWNTMISGYSLYE+LS+EALECFRQLQ VGHRPDDCSLVCVISACSNMSSPSQGRQVHGLA KLDIPSNRISVNNAL+AMYSKCGNLRDA+ 
Subjt:  EICKPDLVLWNTMISGYSLYEELSNEALECFRQLQGVGHRPDDCSLVCVISACSNMSSPSQGRQVHGLAFKLDIPSNRISVNNALVAMYSKCGNLRDARR

Query:  LFDTMPEHNTVSYNSMIAGYAQHGMGSQSLHLFQRMLELGFTPTNITFISVLAACAHTGRVEDGKIYFNMMK-KFGIEPEAGHFSCMIDLLGRAGKLSEA
        LFDTMPEHNTVSYNSMIAGYAQHGMG QSLHLFQRMLE+GFTPTNITFISVLAACAHTGRVEDGKIYFNMMK KFGIEPEAGHFSCMIDLLGRAGKLSEA
Subjt:  LFDTMPEHNTVSYNSMIAGYAQHGMGSQSLHLFQRMLELGFTPTNITFISVLAACAHTGRVEDGKIYFNMMK-KFGIEPEAGHFSCMIDLLGRAGKLSEA

Query:  ERLIETIPFDPGSILWSALLGACRTHGNVELAVKAANHLLQMDPSNAAPYVMLANIYADNGRLEDAATVRKLMRDRGVKKKPGCSWIEVNRRIHIFVAED
        ERLIETIPFDPG   WSALLGACR HGNVELA+KAAN LLQ+DP NAAPYVMLANIY+DNGRL+DAA+VRKLMRDRGVKKKPGCSWIEVNRRIHIFVAED
Subjt:  ERLIETIPFDPGSILWSALLGACRTHGNVELAVKAANHLLQMDPSNAAPYVMLANIYADNGRLEDAATVRKLMRDRGVKKKPGCSWIEVNRRIHIFVAED

Query:  TSHPMIKKIQEYLEEMMRKIKKAGYTQDVRSATIGDYDRERQIEEELRLGHHSEKLAVSFGLMSTREEIP
        T HPMIKKIQEYLEEMMRKIKK GYT +VRSA +G  DR  Q EEELRLGHHSEKLAVSFGLMSTRE  P
Subjt:  TSHPMIKKIQEYLEEMMRKIKKAGYTQDVRSATIGDYDRERQIEEELRLGHHSEKLAVSFGLMSTREEIP

XP_008442084.1 PREDICTED: pentatricopeptide repeat-containing protein At3g49710 [Cucumis melo]0.0e+0090.3Show/hide
Query:  MHQFSSLLQSFRQILKTCIAQRDLRTGKSLHALYIKSFLPTSTYLSNHFILLYSKCRRLSAARWVFDQTHDCNIFSFNALISAYAKESFVEVARQLFDKM
        MHQFSSLLQSFR+ILKTCIAQRDLRTGKSLHALYIKSF+PTSTYLSNHF+LLYSKCRRLSAAR VFD THDCN+FSFN LISAYAKES+VEVAR+LFD+M
Subjt:  MHQFSSLLQSFRQILKTCIAQRDLRTGKSLHALYIKSFLPTSTYLSNHFILLYSKCRRLSAARWVFDQTHDCNIFSFNALISAYAKESFVEVARQLFDKM

Query:  PQPDPVSYNTLIAAYAHRGDTEPAFQLFLEMREAFLDMDGFTLSGIITACGDNVALIRQLHALSVVAGFDSYASVGNALITYYSKNGFLNEAQRIFRWMG
        PQPD VSYNTLIAAYA  GDT+PAFQLFLEMREAFLDMDGFTLSGIITACG NVALI QLHALSVV G DSY SVGN LIT YSKNGFL EA+RIF W+ 
Subjt:  PQPDPVSYNTLIAAYAHRGDTEPAFQLFLEMREAFLDMDGFTLSGIITACGDNVALIRQLHALSVVAGFDSYASVGNALITYYSKNGFLNEAQRIFRWMG

Query:  EDRDEVSWNSMVVAYMQHREGSKALELYLEMTVRGLIVDMFTLASVLTAFTNVQDLSGGLQFHAKLIKSGYHQNCHVGSGLIDLYSKCGGCMLDCRKVFD
        EDRDEVSWNSMVVAYMQHREGSKALELYLEMTVRGLIVD+FTLASVLTAFTNVQDL GGLQFHAKLIKSGYHQNCHVGSGLIDLYSKCGGCMLDCRKVF+
Subjt:  EDRDEVSWNSMVVAYMQHREGSKALELYLEMTVRGLIVDMFTLASVLTAFTNVQDLSGGLQFHAKLIKSGYHQNCHVGSGLIDLYSKCGGCMLDCRKVFD

Query:  EICKPDLVLWNTMISGYSLYEELSNEALECFRQLQGVGHRPDDCSLVCVISACSNMSSPSQGRQVHGLAFKLDIPSNRISVNNALVAMYSKCGNLRDARR
        EIC PDLVLWNTMISGYSLYE+LSNEALECFRQLQ VGHRPDDCSLVCVISACSNMSSPSQGRQVHGLA KLDIPSNRISVNNAL+AMYSKCGNLRDA+ 
Subjt:  EICKPDLVLWNTMISGYSLYEELSNEALECFRQLQGVGHRPDDCSLVCVISACSNMSSPSQGRQVHGLAFKLDIPSNRISVNNALVAMYSKCGNLRDARR

Query:  LFDTMPEHNTVSYNSMIAGYAQHGMGSQSLHLFQRMLELGFTPTNITFISVLAACAHTGRVEDGKIYFNMMK-KFGIEPEAGHFSCMIDLLGRAGKLSEA
        LFDTMPEHN VSYNSMIAGYAQHG+G QSLHLFQRMLE+GFTPTNITFISVLAACAHTGRVEDGKIYFNMMK KFGIEPEAGHFSCMIDLL RAGKL+EA
Subjt:  LFDTMPEHNTVSYNSMIAGYAQHGMGSQSLHLFQRMLELGFTPTNITFISVLAACAHTGRVEDGKIYFNMMK-KFGIEPEAGHFSCMIDLLGRAGKLSEA

Query:  ERLIETIPFDPGSILWSALLGACRTHGNVELAVKAANHLLQMDPSNAAPYVMLANIYADNGRLEDAATVRKLMRDRGVKKKPGCSWIEVNRRIHIFVAED
        ERLIETIPFDPGS  WSALLGACR HGNVELAVKAAN LLQ+DPSNAAPYVMLANIY+DNGRL+DAA+VRKLMRDRGVKKKPGCSWIEVNRRIHIFVAED
Subjt:  ERLIETIPFDPGSILWSALLGACRTHGNVELAVKAANHLLQMDPSNAAPYVMLANIYADNGRLEDAATVRKLMRDRGVKKKPGCSWIEVNRRIHIFVAED

Query:  TSHPMIKKIQEYLEEMMRKIKKAGYTQDVRSATIGDYDRERQIEEELRLGHHSEKLAVSFGLMSTREEIP
        T HPMIKKIQEYLEEM+RKIKK GYT +VRSA +GD DR  Q EEELRLG+HSEKLAVSFGLMSTRE  P
Subjt:  TSHPMIKKIQEYLEEMMRKIKKAGYTQDVRSATIGDYDRERQIEEELRLGHHSEKLAVSFGLMSTREEIP

XP_023547019.1 pentatricopeptide repeat-containing protein At3g49710 [Cucurbita pepo subsp. pepo]0.0e+0088.66Show/hide
Query:  MHQFSSLLQSFRQILKTCIAQRDLRTGKSLHALYIKSFLPTSTYLSNHFILLYSKCRRLSAARWVFDQTHDCNIFSFNALISAYAKESFVEVARQLFDKM
        MHQFS++LQSFR +LKTCIAQRDLRTGK LHALYIKSF+P STY+SNHFILLYSKCRRLSAAR VFDQT +CNIFSFNALI+AYAKESFVEVAR+LFDKM
Subjt:  MHQFSSLLQSFRQILKTCIAQRDLRTGKSLHALYIKSFLPTSTYLSNHFILLYSKCRRLSAARWVFDQTHDCNIFSFNALISAYAKESFVEVARQLFDKM

Query:  PQPDPVSYNTLIAAYAHRGDTEPAFQLFLEMREAFLDMDGFTLSGIITACGDNVALIRQLHALSVVAGFDSYASVGNALITYYSKNGFLNEAQRIFRWMG
         QPDP+SYNTLIAAYA RGD + AFQLF+EMREA LDMDGFTLSGIITACGD+VALIRQLHALSV AGFD YASVGNALITYYSKNGFLNEAQRIF  MG
Subjt:  PQPDPVSYNTLIAAYAHRGDTEPAFQLFLEMREAFLDMDGFTLSGIITACGDNVALIRQLHALSVVAGFDSYASVGNALITYYSKNGFLNEAQRIFRWMG

Query:  EDRDEVSWNSMVVAYMQHREGSKALELYLEMTVRGLIVDMFTLASVLTAFTNVQDLSGGLQFHAKLIKSGYHQNCHVGSGLIDLYSKCGGCMLDCRKVFD
        ED+DEVSWNSMVVAYMQHREGSKAL LY+EMT+RGL+VDMFTLASVLTAFTNVQDLSGGLQFHAKLIKSGYHQNCHVGSGLIDLYSKCGG ML CRKVFD
Subjt:  EDRDEVSWNSMVVAYMQHREGSKALELYLEMTVRGLIVDMFTLASVLTAFTNVQDLSGGLQFHAKLIKSGYHQNCHVGSGLIDLYSKCGGCMLDCRKVFD

Query:  EICKPDLVLWNTMISGYSLYEELSNEALECFRQLQGVGHRPDDCSLVCVISACSNMSSPSQGRQVHGLAFKLDIPSNRISVNNALVAMYSKCGNLRDARR
        EICKPDLVLWNTMISGYSL+EE S+EALECFR+LQGVGH PDDCSLVCVISAC+NMSSPSQGRQVH L FKLDIPSNRISVNNAL+AMYSKCGNLRDARR
Subjt:  EICKPDLVLWNTMISGYSLYEELSNEALECFRQLQGVGHRPDDCSLVCVISACSNMSSPSQGRQVHGLAFKLDIPSNRISVNNALVAMYSKCGNLRDARR

Query:  LFDTMPEHNTVSYNSMIAGYAQHGMGSQSLHLFQRMLELGFTPTNITFISVLAACAHTGRVEDGKIYFNMMK-KFGIEPEAGHFSCMIDLLGRAGKLSEA
        LFDTMPEHNTVS+NSMIAGYAQHGMG QSL+LFQRMLE+GFTPTNITFISVLAACAHTGRV+DGKIYFNMMK KFGIEPEA HFSC+IDLLGRAGKLSEA
Subjt:  LFDTMPEHNTVSYNSMIAGYAQHGMGSQSLHLFQRMLELGFTPTNITFISVLAACAHTGRVEDGKIYFNMMK-KFGIEPEAGHFSCMIDLLGRAGKLSEA

Query:  ERLIETIPFDPGSILWSALLGACRTHGNVELAVKAANHLLQMDPSNAAPYVMLANIYADNGRLEDAATVRKLMRDRGVKKKPGCSWIEVNRRIHIFVAED
        ERLIETIPF+PGSI WSALLGACRTHGN+ELA KAANHLLQ++PSNAAPYVMLANIYADNGR ED A+VRKLMRDRGVKKKPGCSWIEV+RR HIFVAED
Subjt:  ERLIETIPFDPGSILWSALLGACRTHGNVELAVKAANHLLQMDPSNAAPYVMLANIYADNGRLEDAATVRKLMRDRGVKKKPGCSWIEVNRRIHIFVAED

Query:  TSHPMIKKIQEYLEEMMRKIKKAGYTQDVRSATIGDYDRERQIEEELRLGHHSEKLAVSFGLMSTREEIP
        TSHPMIKKI EYLEEMMRKIKKAGY  DVRS +IG  D  R+ EEELRLGHHSEKLAV+FGLM TRE  P
Subjt:  TSHPMIKKIQEYLEEMMRKIKKAGYTQDVRSATIGDYDRERQIEEELRLGHHSEKLAVSFGLMSTREEIP

XP_038881007.1 pentatricopeptide repeat-containing protein At3g49710 [Benincasa hispida]0.0e+0093.88Show/hide
Query:  MHQFSSLLQSFRQILKTCIAQRDLRTGKSLHALYIKSFLPTSTYLSNHFILLYSKCRRLSAARWVFDQTHDCNIFSFNALISAYAKESFVEVARQLFDKM
        MHQFSS+LQSFRQILKTCIA+RDLRTGKSLHALYIKSF+PTSTYLSNHF+LLYSKCRRLSAAR VFD THDCN FSFNALISAYAKESFVEVARQLFD+M
Subjt:  MHQFSSLLQSFRQILKTCIAQRDLRTGKSLHALYIKSFLPTSTYLSNHFILLYSKCRRLSAARWVFDQTHDCNIFSFNALISAYAKESFVEVARQLFDKM

Query:  PQPDPVSYNTLIAAYAHRGDTEPAFQLFLEMREAFLDMDGFTLSGIITACGDNVALIRQLHALSVVAGFDSYASVGNALITYYSKNGFLNEAQRIFRWMG
        PQPD VSYNTLIAAYA RGDTEPAFQLF+EMRE+FL+MDGFTLSGIITACGDNVALIRQLH LSVVAG DSY SVGNALIT+YSKNGFLNEAQRIF WMG
Subjt:  PQPDPVSYNTLIAAYAHRGDTEPAFQLFLEMREAFLDMDGFTLSGIITACGDNVALIRQLHALSVVAGFDSYASVGNALITYYSKNGFLNEAQRIFRWMG

Query:  EDRDEVSWNSMVVAYMQHREGSKALELYLEMTVRGLIVDMFTLASVLTAFTNVQDLSGGLQFHAKLIKSGYHQNCHVGSGLIDLYSKCGGCMLDCRKVFD
        EDRD+VSWNSMVVAYMQHR+GSKALELYLEMTVR LIVDMFTLASVLTAFTNVQDLSGGLQFHAKLIKSGYH+NCHVGSGLIDLYSKCGGCMLDCRKVFD
Subjt:  EDRDEVSWNSMVVAYMQHREGSKALELYLEMTVRGLIVDMFTLASVLTAFTNVQDLSGGLQFHAKLIKSGYHQNCHVGSGLIDLYSKCGGCMLDCRKVFD

Query:  EICKPDLVLWNTMISGYSLYEELSNEALECFRQLQGVGHRPDDCSLVCVISACSNMSSPSQGRQVHGLAFKLDIPSNRISVNNALVAMYSKCGNLRDARR
        EICKPDLVLWNTMISGYSLYEELS+EALECFRQLQGVGH+PDDCSLVCVISACSNMSSPSQGRQVHGLAFKLDIPSNRISVNNAL+AMYSKCGNLRDARR
Subjt:  EICKPDLVLWNTMISGYSLYEELSNEALECFRQLQGVGHRPDDCSLVCVISACSNMSSPSQGRQVHGLAFKLDIPSNRISVNNALVAMYSKCGNLRDARR

Query:  LFDTMPEHNTVSYNSMIAGYAQHGMGSQSLHLFQRMLELGFTPTNITFISVLAACAHTGRVEDGKIYFNMMK-KFGIEPEAGHFSCMIDLLGRAGKLSEA
        LF+TMPEHNTVSYNSMIAGYAQHGMG QSLHLFQRMLE+GFTPTNITFISVLAACAHTGRVEDGKIYFNMMK KFGIEPEAGHFSCM+DLLGRAGKLSEA
Subjt:  LFDTMPEHNTVSYNSMIAGYAQHGMGSQSLHLFQRMLELGFTPTNITFISVLAACAHTGRVEDGKIYFNMMK-KFGIEPEAGHFSCMIDLLGRAGKLSEA

Query:  ERLIETIPFDPGSILWSALLGACRTHGNVELAVKAANHLLQMDPSNAAPYVMLANIYADNGRLEDAATVRKLMRDRGVKKKPGCSWIEVNRRIHIFVAED
        ERLIETIPFDPGSI WSALLGACRTHGNVELAVKAANHLLQ+DPSNAAPYVMLANIYADNGRL+DAA VRKLMRDRGVKKKPGCSWIEVNRRIHIFVAED
Subjt:  ERLIETIPFDPGSILWSALLGACRTHGNVELAVKAANHLLQMDPSNAAPYVMLANIYADNGRLEDAATVRKLMRDRGVKKKPGCSWIEVNRRIHIFVAED

Query:  TSHPMIKKIQEYLEEMMRKIKKAGYTQDVRSATIGDYDRERQIEEELRLGHHSEKLAVSFGLMSTREEIP
        TSHPMIKKIQEYLEEMMRKIK+AGYTQDVRSA++GDYDRERQ EEELRLGHHSEKLAV+FGLMSTRE  P
Subjt:  TSHPMIKKIQEYLEEMMRKIKKAGYTQDVRSATIGDYDRERQIEEELRLGHHSEKLAVSFGLMSTREEIP

TrEMBL top hitse value%identityAlignment
A0A0A0L2J0 DYW_deaminase domain-containing protein0.0e+0090Show/hide
Query:  MHQFSSLLQSFRQILKTCIAQRDLRTGKSLHALYIKSFLPTSTYLSNHFILLYSKCRRLSAARWVFDQTHDCNIFSFNALISAYAKESFVEVARQLFDKM
        MH FSSLL +FRQ LKTCIA RDLRTGKSLHALYIKSF+PTSTYLSNHF+LLYSKCRRLSAAR VFD THDCN+FSFN LISAYAKES+VEVA QLFD+M
Subjt:  MHQFSSLLQSFRQILKTCIAQRDLRTGKSLHALYIKSFLPTSTYLSNHFILLYSKCRRLSAARWVFDQTHDCNIFSFNALISAYAKESFVEVARQLFDKM

Query:  PQPDPVSYNTLIAAYAHRGDTEPAFQLFLEMREAFLDMDGFTLSGIITACGDNVALIRQLHALSVVAGFDSYASVGNALITYYSKNGFLNEAQRIFRWMG
        PQPD VSYNTLIAAYA RGDT+PAFQLFLEMREAFLDMDGFTLSGIITACG NV LIRQLHALSVV G DSY SVGNALIT YSKNGFL EA+RIF W+ 
Subjt:  PQPDPVSYNTLIAAYAHRGDTEPAFQLFLEMREAFLDMDGFTLSGIITACGDNVALIRQLHALSVVAGFDSYASVGNALITYYSKNGFLNEAQRIFRWMG

Query:  EDRDEVSWNSMVVAYMQHREGSKALELYLEMTVRGLIVDMFTLASVLTAFTNVQDLSGGLQFHAKLIKSGYHQNCHVGSGLIDLYSKCGGCMLDCRKVFD
        EDRDEVSWNSMVVAYMQHREGSKALELYLEMTVRGLIVD+FTLASVLTAFTNVQDL GGLQFHAKLIKSGYHQN HVGSGLIDLYSKCGGCMLDCRKVFD
Subjt:  EDRDEVSWNSMVVAYMQHREGSKALELYLEMTVRGLIVDMFTLASVLTAFTNVQDLSGGLQFHAKLIKSGYHQNCHVGSGLIDLYSKCGGCMLDCRKVFD

Query:  EICKPDLVLWNTMISGYSLYEELSNEALECFRQLQGVGHRPDDCSLVCVISACSNMSSPSQGRQVHGLAFKLDIPSNRISVNNALVAMYSKCGNLRDARR
        EI  PDLVLWNTMISGYSLYE+LS+EALECFRQLQ VGHRPDDCSLVCVISACSNMSSPSQGRQVHGLA KLDIPSNRISVNNAL+AMYSKCGNLRDA+ 
Subjt:  EICKPDLVLWNTMISGYSLYEELSNEALECFRQLQGVGHRPDDCSLVCVISACSNMSSPSQGRQVHGLAFKLDIPSNRISVNNALVAMYSKCGNLRDARR

Query:  LFDTMPEHNTVSYNSMIAGYAQHGMGSQSLHLFQRMLELGFTPTNITFISVLAACAHTGRVEDGKIYFNMMK-KFGIEPEAGHFSCMIDLLGRAGKLSEA
        LFDTMPEHNTVSYNSMIAGYAQHGMG QSLHLFQRMLE+GFTPTNITFISVLAACAHTGRVEDGKIYFNMMK KFGIEPEAGHFSCMIDLLGRAGKLSEA
Subjt:  LFDTMPEHNTVSYNSMIAGYAQHGMGSQSLHLFQRMLELGFTPTNITFISVLAACAHTGRVEDGKIYFNMMK-KFGIEPEAGHFSCMIDLLGRAGKLSEA

Query:  ERLIETIPFDPGSILWSALLGACRTHGNVELAVKAANHLLQMDPSNAAPYVMLANIYADNGRLEDAATVRKLMRDRGVKKKPGCSWIEVNRRIHIFVAED
        ERLIETIPFDPG   WSALLGACR HGNVELA+KAAN LLQ+DP NAAPYVMLANIY+DNGRL+DAA+VRKLMRDRGVKKKPGCSWIEVNRRIHIFVAED
Subjt:  ERLIETIPFDPGSILWSALLGACRTHGNVELAVKAANHLLQMDPSNAAPYVMLANIYADNGRLEDAATVRKLMRDRGVKKKPGCSWIEVNRRIHIFVAED

Query:  TSHPMIKKIQEYLEEMMRKIKKAGYTQDVRSATIGDYDRERQIEEELRLGHHSEKLAVSFGLMSTREEIP
        T HPMIKKIQEYLEEMMRKIKK GYT +VRSA +G  DR  Q EEELRLGHHSEKLAVSFGLMSTRE  P
Subjt:  TSHPMIKKIQEYLEEMMRKIKKAGYTQDVRSATIGDYDRERQIEEELRLGHHSEKLAVSFGLMSTREEIP

A0A1S3B5M8 pentatricopeptide repeat-containing protein At3g497100.0e+0090.3Show/hide
Query:  MHQFSSLLQSFRQILKTCIAQRDLRTGKSLHALYIKSFLPTSTYLSNHFILLYSKCRRLSAARWVFDQTHDCNIFSFNALISAYAKESFVEVARQLFDKM
        MHQFSSLLQSFR+ILKTCIAQRDLRTGKSLHALYIKSF+PTSTYLSNHF+LLYSKCRRLSAAR VFD THDCN+FSFN LISAYAKES+VEVAR+LFD+M
Subjt:  MHQFSSLLQSFRQILKTCIAQRDLRTGKSLHALYIKSFLPTSTYLSNHFILLYSKCRRLSAARWVFDQTHDCNIFSFNALISAYAKESFVEVARQLFDKM

Query:  PQPDPVSYNTLIAAYAHRGDTEPAFQLFLEMREAFLDMDGFTLSGIITACGDNVALIRQLHALSVVAGFDSYASVGNALITYYSKNGFLNEAQRIFRWMG
        PQPD VSYNTLIAAYA  GDT+PAFQLFLEMREAFLDMDGFTLSGIITACG NVALI QLHALSVV G DSY SVGN LIT YSKNGFL EA+RIF W+ 
Subjt:  PQPDPVSYNTLIAAYAHRGDTEPAFQLFLEMREAFLDMDGFTLSGIITACGDNVALIRQLHALSVVAGFDSYASVGNALITYYSKNGFLNEAQRIFRWMG

Query:  EDRDEVSWNSMVVAYMQHREGSKALELYLEMTVRGLIVDMFTLASVLTAFTNVQDLSGGLQFHAKLIKSGYHQNCHVGSGLIDLYSKCGGCMLDCRKVFD
        EDRDEVSWNSMVVAYMQHREGSKALELYLEMTVRGLIVD+FTLASVLTAFTNVQDL GGLQFHAKLIKSGYHQNCHVGSGLIDLYSKCGGCMLDCRKVF+
Subjt:  EDRDEVSWNSMVVAYMQHREGSKALELYLEMTVRGLIVDMFTLASVLTAFTNVQDLSGGLQFHAKLIKSGYHQNCHVGSGLIDLYSKCGGCMLDCRKVFD

Query:  EICKPDLVLWNTMISGYSLYEELSNEALECFRQLQGVGHRPDDCSLVCVISACSNMSSPSQGRQVHGLAFKLDIPSNRISVNNALVAMYSKCGNLRDARR
        EIC PDLVLWNTMISGYSLYE+LSNEALECFRQLQ VGHRPDDCSLVCVISACSNMSSPSQGRQVHGLA KLDIPSNRISVNNAL+AMYSKCGNLRDA+ 
Subjt:  EICKPDLVLWNTMISGYSLYEELSNEALECFRQLQGVGHRPDDCSLVCVISACSNMSSPSQGRQVHGLAFKLDIPSNRISVNNALVAMYSKCGNLRDARR

Query:  LFDTMPEHNTVSYNSMIAGYAQHGMGSQSLHLFQRMLELGFTPTNITFISVLAACAHTGRVEDGKIYFNMMK-KFGIEPEAGHFSCMIDLLGRAGKLSEA
        LFDTMPEHN VSYNSMIAGYAQHG+G QSLHLFQRMLE+GFTPTNITFISVLAACAHTGRVEDGKIYFNMMK KFGIEPEAGHFSCMIDLL RAGKL+EA
Subjt:  LFDTMPEHNTVSYNSMIAGYAQHGMGSQSLHLFQRMLELGFTPTNITFISVLAACAHTGRVEDGKIYFNMMK-KFGIEPEAGHFSCMIDLLGRAGKLSEA

Query:  ERLIETIPFDPGSILWSALLGACRTHGNVELAVKAANHLLQMDPSNAAPYVMLANIYADNGRLEDAATVRKLMRDRGVKKKPGCSWIEVNRRIHIFVAED
        ERLIETIPFDPGS  WSALLGACR HGNVELAVKAAN LLQ+DPSNAAPYVMLANIY+DNGRL+DAA+VRKLMRDRGVKKKPGCSWIEVNRRIHIFVAED
Subjt:  ERLIETIPFDPGSILWSALLGACRTHGNVELAVKAANHLLQMDPSNAAPYVMLANIYADNGRLEDAATVRKLMRDRGVKKKPGCSWIEVNRRIHIFVAED

Query:  TSHPMIKKIQEYLEEMMRKIKKAGYTQDVRSATIGDYDRERQIEEELRLGHHSEKLAVSFGLMSTREEIP
        T HPMIKKIQEYLEEM+RKIKK GYT +VRSA +GD DR  Q EEELRLG+HSEKLAVSFGLMSTRE  P
Subjt:  TSHPMIKKIQEYLEEMMRKIKKAGYTQDVRSATIGDYDRERQIEEELRLGHHSEKLAVSFGLMSTREEIP

A0A5A7TIR2 Pentatricopeptide repeat-containing protein0.0e+0090.3Show/hide
Query:  MHQFSSLLQSFRQILKTCIAQRDLRTGKSLHALYIKSFLPTSTYLSNHFILLYSKCRRLSAARWVFDQTHDCNIFSFNALISAYAKESFVEVARQLFDKM
        MHQFSSLLQSFR+ILKTCIAQRDLRTGKSLHALYIKSF+PTSTYLSNHF+LLYSKCRRLSAAR VFD THDCN+FSFN LISAYAKES+VEVAR+LFD+M
Subjt:  MHQFSSLLQSFRQILKTCIAQRDLRTGKSLHALYIKSFLPTSTYLSNHFILLYSKCRRLSAARWVFDQTHDCNIFSFNALISAYAKESFVEVARQLFDKM

Query:  PQPDPVSYNTLIAAYAHRGDTEPAFQLFLEMREAFLDMDGFTLSGIITACGDNVALIRQLHALSVVAGFDSYASVGNALITYYSKNGFLNEAQRIFRWMG
        PQPD VSYNTLIAAYA  GDT+PAFQLFLEMREAFLDMDGFTLSGIITACG NVALI QLHALSVV G DSY SVGN LIT YSKNGFL EA+RIF W+ 
Subjt:  PQPDPVSYNTLIAAYAHRGDTEPAFQLFLEMREAFLDMDGFTLSGIITACGDNVALIRQLHALSVVAGFDSYASVGNALITYYSKNGFLNEAQRIFRWMG

Query:  EDRDEVSWNSMVVAYMQHREGSKALELYLEMTVRGLIVDMFTLASVLTAFTNVQDLSGGLQFHAKLIKSGYHQNCHVGSGLIDLYSKCGGCMLDCRKVFD
        EDRDEVSWNSMVVAYMQHREGSKALELYLEMTVRGLIVD+FTLASVLTAFTNVQDL GGLQFHAKLIKSGYHQNCHVGSGLIDLYSKCGGCMLDCRKVF+
Subjt:  EDRDEVSWNSMVVAYMQHREGSKALELYLEMTVRGLIVDMFTLASVLTAFTNVQDLSGGLQFHAKLIKSGYHQNCHVGSGLIDLYSKCGGCMLDCRKVFD

Query:  EICKPDLVLWNTMISGYSLYEELSNEALECFRQLQGVGHRPDDCSLVCVISACSNMSSPSQGRQVHGLAFKLDIPSNRISVNNALVAMYSKCGNLRDARR
        EIC PDLVLWNTMISGYSLYE+LSNEALECFRQLQ VGHRPDDCSLVCVISACSNMSSPSQGRQVHGLA KLDIPSNRISVNNAL+AMYSKCGNLRDA+ 
Subjt:  EICKPDLVLWNTMISGYSLYEELSNEALECFRQLQGVGHRPDDCSLVCVISACSNMSSPSQGRQVHGLAFKLDIPSNRISVNNALVAMYSKCGNLRDARR

Query:  LFDTMPEHNTVSYNSMIAGYAQHGMGSQSLHLFQRMLELGFTPTNITFISVLAACAHTGRVEDGKIYFNMMK-KFGIEPEAGHFSCMIDLLGRAGKLSEA
        LFDTMPEHN VSYNSMIAGYAQHG+G QSLHLFQRMLE+GFTPTNITFISVLAACAHTGRVEDGKIYFNMMK KFGIEPEAGHFSCMIDLL RAGKL+EA
Subjt:  LFDTMPEHNTVSYNSMIAGYAQHGMGSQSLHLFQRMLELGFTPTNITFISVLAACAHTGRVEDGKIYFNMMK-KFGIEPEAGHFSCMIDLLGRAGKLSEA

Query:  ERLIETIPFDPGSILWSALLGACRTHGNVELAVKAANHLLQMDPSNAAPYVMLANIYADNGRLEDAATVRKLMRDRGVKKKPGCSWIEVNRRIHIFVAED
        ERLIETIPFDPGS  WSALLGACR HGNVELAVKAAN LLQ+DPSNAAPYVMLANIY+DNGRL+DAA+VRKLMRDRGVKKKPGCSWIEVNRRIHIFVAED
Subjt:  ERLIETIPFDPGSILWSALLGACRTHGNVELAVKAANHLLQMDPSNAAPYVMLANIYADNGRLEDAATVRKLMRDRGVKKKPGCSWIEVNRRIHIFVAED

Query:  TSHPMIKKIQEYLEEMMRKIKKAGYTQDVRSATIGDYDRERQIEEELRLGHHSEKLAVSFGLMSTREEIP
        T HPMIKKIQEYLEEM+RKIKK GYT +VRSA +GD DR  Q EEELRLG+HSEKLAVSFGLMSTRE  P
Subjt:  TSHPMIKKIQEYLEEMMRKIKKAGYTQDVRSATIGDYDRERQIEEELRLGHHSEKLAVSFGLMSTREEIP

A0A6J1FSH2 pentatricopeptide repeat-containing protein At3g497100.0e+0088.51Show/hide
Query:  MHQFSSLLQSFRQILKTCIAQRDLRTGKSLHALYIKSFLPTSTYLSNHFILLYSKCRRLSAARWVFDQTHDCNIFSFNALISAYAKESFVEVARQLFDKM
        MHQFS++LQSFR +LKTCIAQRDLRTG SLHALYIKSF+P STY SNHFILLYSKCRRLSAAR VFDQT +CNIFSFNALI+AYAKESFVEVAR+LFDKM
Subjt:  MHQFSSLLQSFRQILKTCIAQRDLRTGKSLHALYIKSFLPTSTYLSNHFILLYSKCRRLSAARWVFDQTHDCNIFSFNALISAYAKESFVEVARQLFDKM

Query:  PQPDPVSYNTLIAAYAHRGDTEPAFQLFLEMREAFLDMDGFTLSGIITACGDNVALIRQLHALSVVAGFDSYASVGNALITYYSKNGFLNEAQRIFRWMG
         QPDPVSYNTLIAAYA RGD E AFQLF+EMREA LDMDGFTLSGIITACGD+VALIRQLHALSV AGFD YASVGNALITYYSKNGFLNEAQRIF  MG
Subjt:  PQPDPVSYNTLIAAYAHRGDTEPAFQLFLEMREAFLDMDGFTLSGIITACGDNVALIRQLHALSVVAGFDSYASVGNALITYYSKNGFLNEAQRIFRWMG

Query:  EDRDEVSWNSMVVAYMQHREGSKALELYLEMTVRGLIVDMFTLASVLTAFTNVQDLSGGLQFHAKLIKSGYHQNCHVGSGLIDLYSKCGGCMLDCRKVFD
        ED+DEVSWNSMVVAYMQHREGSKAL LY+EMT+RGL+VDMFTLASVLTAFTNVQDLSGGLQFHAKLIKSGYHQNCHVGSGLIDLYSKCGG ML CRKVFD
Subjt:  EDRDEVSWNSMVVAYMQHREGSKALELYLEMTVRGLIVDMFTLASVLTAFTNVQDLSGGLQFHAKLIKSGYHQNCHVGSGLIDLYSKCGGCMLDCRKVFD

Query:  EICKPDLVLWNTMISGYSLYEELSNEALECFRQLQGVGHRPDDCSLVCVISACSNMSSPSQGRQVHGLAFKLDIPSNRISVNNALVAMYSKCGNLRDARR
        EICKPDLVLWNTMISGYSL+EE S+EALECFR+LQGVGH PDDCSLVCVISAC+NMSSPSQGRQVH L FKLDIPSNRISVNNAL+AMYSKCGNLRDARR
Subjt:  EICKPDLVLWNTMISGYSLYEELSNEALECFRQLQGVGHRPDDCSLVCVISACSNMSSPSQGRQVHGLAFKLDIPSNRISVNNALVAMYSKCGNLRDARR

Query:  LFDTMPEHNTVSYNSMIAGYAQHGMGSQSLHLFQRMLELGFTPTNITFISVLAACAHTGRVEDGKIYFNMMK-KFGIEPEAGHFSCMIDLLGRAGKLSEA
        LFDTMPEHNTVS+NS+IAGYAQHGMG QSL+LFQRML++GFTPTNITFISVLAACAHTGRV+DGKIYFNMMK KFGIEPEA HFSC+IDLLGRAGKLSEA
Subjt:  LFDTMPEHNTVSYNSMIAGYAQHGMGSQSLHLFQRMLELGFTPTNITFISVLAACAHTGRVEDGKIYFNMMK-KFGIEPEAGHFSCMIDLLGRAGKLSEA

Query:  ERLIETIPFDPGSILWSALLGACRTHGNVELAVKAANHLLQMDPSNAAPYVMLANIYADNGRLEDAATVRKLMRDRGVKKKPGCSWIEVNRRIHIFVAED
        ERLIETIPF+PGSI WSALLGACRTHGN+ELA KAANHLLQ++PSNAAPYVMLANIYADNGR ED A+VRKLMRDRGVKKKPGCSWIEV+RR HIFVAED
Subjt:  ERLIETIPFDPGSILWSALLGACRTHGNVELAVKAANHLLQMDPSNAAPYVMLANIYADNGRLEDAATVRKLMRDRGVKKKPGCSWIEVNRRIHIFVAED

Query:  TSHPMIKKIQEYLEEMMRKIKKAGYTQDVRSATIGDYDRERQIEEELRLGHHSEKLAVSFGLMSTREEIP
        TSHPMIKKI EYLEEMMRKIKKAGY  DVRS +IG  +  R+ EEELRLGHHSEKLAV+FGLM TRE  P
Subjt:  TSHPMIKKIQEYLEEMMRKIKKAGYTQDVRSATIGDYDRERQIEEELRLGHHSEKLAVSFGLMSTREEIP

A0A6J1KGE1 pentatricopeptide repeat-containing protein At3g49710-like0.0e+0088.81Show/hide
Query:  MHQFSSLLQSFRQILKTCIAQRDLRTGKSLHALYIKSFLPTSTYLSNHFILLYSKCRRLSAARWVFDQTHDCNIFSFNALISAYAKESFVEVARQLFDKM
        MHQFS++LQSFR +LKTCIAQRD+RTGKSLHALYIKSF+P STY+SNHFILLYSKCRRLSAAR VFDQT +CNIFSFNALISAYAKESFVEVAR+LFDKM
Subjt:  MHQFSSLLQSFRQILKTCIAQRDLRTGKSLHALYIKSFLPTSTYLSNHFILLYSKCRRLSAARWVFDQTHDCNIFSFNALISAYAKESFVEVARQLFDKM

Query:  PQPDPVSYNTLIAAYAHRGDTEPAFQLFLEMREAFLDMDGFTLSGIITACGDNVALIRQLHALSVVAGFDSYASVGNALITYYSKNGFLNEAQRIFRWMG
         QPDPVSYNTLIAAYA RGD E AFQLF+EMREA LDMDGFTLSGIITACGD+VALIRQLHALSVVAGFD Y SVGNALITYYSKN FLNEAQRIF  MG
Subjt:  PQPDPVSYNTLIAAYAHRGDTEPAFQLFLEMREAFLDMDGFTLSGIITACGDNVALIRQLHALSVVAGFDSYASVGNALITYYSKNGFLNEAQRIFRWMG

Query:  EDRDEVSWNSMVVAYMQHREGSKALELYLEMTVRGLIVDMFTLASVLTAFTNVQDLSGGLQFHAKLIKSGYHQNCHVGSGLIDLYSKCGGCMLDCRKVFD
        ED+DEVSWNSMVVAYMQHREGSKALELY+EMT+RGL+VDMFTLASVLTAFTNVQDLSGGLQFHAKLIKSGYHQNCHVGSGLIDLYSKCGG ML CRKVFD
Subjt:  EDRDEVSWNSMVVAYMQHREGSKALELYLEMTVRGLIVDMFTLASVLTAFTNVQDLSGGLQFHAKLIKSGYHQNCHVGSGLIDLYSKCGGCMLDCRKVFD

Query:  EICKPDLVLWNTMISGYSLYEELSNEALECFRQLQGVGHRPDDCSLVCVISACSNMSSPSQGRQVHGLAFKLDIPSNRISVNNALVAMYSKCGNLRDARR
        EICKPDLVLWNTMISGYSL+EE S+EALECFR+LQGVGH PDDCSLVCVISAC+NMSSPSQGRQVH L FKLDIPSNRISVNNAL+AMYSKCGNLRDARR
Subjt:  EICKPDLVLWNTMISGYSLYEELSNEALECFRQLQGVGHRPDDCSLVCVISACSNMSSPSQGRQVHGLAFKLDIPSNRISVNNALVAMYSKCGNLRDARR

Query:  LFDTMPEHNTVSYNSMIAGYAQHGMGSQSLHLFQRMLELGFTPTNITFISVLAACAHTGRVEDGKIYFNMMK-KFGIEPEAGHFSCMIDLLGRAGKLSEA
        LFDTMPEHNTVS+NSMIAGYAQHGMG QSL+LFQRMLE+GFTPT ITFISVLAACAHTGRV+DGKIYFNMMK KFGIEPEA HFSC+IDLLGRAGKLSEA
Subjt:  LFDTMPEHNTVSYNSMIAGYAQHGMGSQSLHLFQRMLELGFTPTNITFISVLAACAHTGRVEDGKIYFNMMK-KFGIEPEAGHFSCMIDLLGRAGKLSEA

Query:  ERLIETIPFDPGSILWSALLGACRTHGNVELAVKAANHLLQMDPSNAAPYVMLANIYADNGRLEDAATVRKLMRDRGVKKKPGCSWIEVNRRIHIFVAED
        ERLIETIPF+PGSILWSALLGACRTHGN+ELA KAANHLLQ++PSNAAPYVMLANIYADNGR ED  +VRKLMRDRGVKKKPGCSWIEV+RR HIFVAED
Subjt:  ERLIETIPFDPGSILWSALLGACRTHGNVELAVKAANHLLQMDPSNAAPYVMLANIYADNGRLEDAATVRKLMRDRGVKKKPGCSWIEVNRRIHIFVAED

Query:  TSHPMIKKIQEYLEEMMRKIKKAGYTQDVRSATIGDYDRERQIEEELRLGHHSEKLAVSFGLMSTREEIP
        TSHPMIKKI EYLEEMMRKIKKAGY  DVRS +I   D  R+ EEELRLGHHSEKLAV+FGLM TRE  P
Subjt:  TSHPMIKKIQEYLEEMMRKIKKAGYTQDVRSATIGDYDRERQIEEELRLGHHSEKLAVSFGLMSTREEIP

SwissProt top hitse value%identityAlignment
Q9CAA8 Putative pentatricopeptide repeat-containing protein At1g689301.7e-11534.36Show/hide
Query:  LKTCI---AQRDLRTGKSLHALYIKSFLPTSTYLSNHFILLYSKCRRLSAARWVFDQTHDCNIFSFNALISAYAKESFVEVARQLFDKMPQPDPVSYNTL
        +K CI   A+   R  K +H   I++     T+L N+ +  Y+  +  + AR VFD+    N+FS+N L+ AY+K   +      F+K+P  D V++N L
Subjt:  LKTCI---AQRDLRTGKSLHALYIKSFLPTSTYLSNHFILLYSKCRRLSAARWVFDQTHDCNIFSFNALISAYAKESFVEVARQLFDKMPQPDPVSYNTL

Query:  IAAYAHRGDTEPAFQLF-LEMREAFLDMDGFTLSGIITACGDN--VALIRQLHALSVVAGFDSYASVGNALITYYSKNGFLNEAQRIF------------
        I  Y+  G    A + +   MR+   ++   TL  ++     N  V+L +Q+H   +  GF+SY  VG+ L+  Y+  G +++A+++F            
Subjt:  IAAYAHRGDTEPAFQLF-LEMREAFLDMDGFTLSGIITACGDN--VALIRQLHALSVVAGFDSYASVGNALITYYSKNGFLNEAQRIF------------

Query:  -----------------RWMGEDRDEVSWNSMVVAYMQHREGSKALELYLEMTVRGLIVDMFTLASVLTAFTNVQDLSGGLQFHAKLIKSGYHQNCHVGS
                          + G ++D VSW +M+    Q+    +A+E + EM V+GL +D +   SVL A   +  ++ G Q HA +I++ +  + +VGS
Subjt:  -----------------RWMGEDRDEVSWNSMVVAYMQHREGSKALELYLEMTVRGLIVDMFTLASVLTAFTNVQDLSGGLQFHAKLIKSGYHQNCHVGS

Query:  GLIDLYSKCGGCMLDCRKVFDEICKPDLVLWNTMISGYSLYEELSNEALECFRQLQGVGHRPDDCSLVCVISACSNMSSPSQGRQVHGLAFKLDIPSNRI
         LID+Y KC  C+   + VFD + + ++V W  M+ GY      + EA++ F  +Q  G  PD  +L   ISAC+N+SS  +G Q HG A    +  + +
Subjt:  GLIDLYSKCGGCMLDCRKVFDEICKPDLVLWNTMISGYSLYEELSNEALECFRQLQGVGHRPDDCSLVCVISACSNMSSPSQGRQVHGLAFKLDIPSNRI

Query:  SVNNALVAMYSKCGNLRDARRLFDTMPEHNTVSYNSMIAGYAQHGMGSQSLHLFQRMLELGFTPTNITFISVLAACAHTGRVEDGKIYFNMM-KKFGIEP
        +V+N+LV +Y KCG++ D+ RLF+ M   + VS+ +M++ YAQ G   +++ LF +M++ G  P  +T   V++AC+  G VE G+ YF +M  ++GI P
Subjt:  SVNNALVAMYSKCGNLRDARRLFDTMPEHNTVSYNSMIAGYAQHGMGSQSLHLFQRMLELGFTPTNITFISVLAACAHTGRVEDGKIYFNMM-KKFGIEP

Query:  EAGHFSCMIDLLGRAGKLSEAERLIETIPFDPGSILWSALLGACRTHGNVELAVKAANHLLQMDPSNAAPYVMLANIYADNGRLEDAATVRKLMRDRGVK
          GH+SCMIDL  R+G+L EA R I  +PF P +I W+ LL ACR  GN+E+   AA  L+++DP + A Y +L++IYA  G+ +  A +R+ MR++ VK
Subjt:  EAGHFSCMIDLLGRAGKLSEAERLIETIPFDPGSILWSALLGACRTHGNVELAVKAANHLLQMDPSNAAPYVMLANIYADNGRLEDAATVRKLMRDRGVK

Query:  KKPGCSWIEVNRRIHIFVAEDTSHPMIKKIQEYLEEMMRKIKKAGYTQDVRSATIGDYDRERQIEEELRLGHHSEKLAVSFGLM
        K+PG SWI+   ++H F A+D S P + +I   LEE+  KI   GY  D    +   +D E  ++ ++ L +HSE+LA++FGL+
Subjt:  KKPGCSWIEVNRRIHIFVAEDTSHPMIKKIQEYLEEMMRKIKKAGYTQDVRSATIGDYDRERQIEEELRLGHHSEKLAVSFGLM

Q9M2Y7 Pentatricopeptide repeat-containing protein At3g497101.4e-24762.63Show/hide
Query:  MHQFSSLLQSFRQILKTCIAQRDLRTGKSLHALYIKSFLPTSTYLSNHFILLYSKCRRLSAARWVFDQTHDCNIFSFNALISAYAKESFVEVARQLFDKM
        M+Q     ++FR +L   +A+RDL TGKSLHALY+KS + +STYLSNHF+ LYSKC RLS AR  F  T + N+FS+N ++ AYAK+S + +ARQLFD++
Subjt:  MHQFSSLLQSFRQILKTCIAQRDLRTGKSLHALYIKSFLPTSTYLSNHFILLYSKCRRLSAARWVFDQTHDCNIFSFNALISAYAKESFVEVARQLFDKM

Query:  PQPDPVSYNTLIAAYAHRGDTEPAFQLFLEMREAFLDMDGFTLSGIITACGDNVALIRQLHALSVVAGFDSYASVGNALITYYSKNGFLNEAQRIFRWMG
        PQPD VSYNTLI+ YA   +T  A  LF  MR+   ++DGFTLSG+I AC D V LI+QLH  SV  GFDSY+SV NA +TYYSK G L EA  +F  M 
Subjt:  PQPDPVSYNTLIAAYAHRGDTEPAFQLFLEMREAFLDMDGFTLSGIITACGDNVALIRQLHALSVVAGFDSYASVGNALITYYSKNGFLNEAQRIFRWMG

Query:  EDRDEVSWNSMVVAYMQHREGSKALELYLEMTVRGLIVDMFTLASVLTAFTNVQDLSGGLQFHAKLIKSGYHQNCHVGSGLIDLYSKCGGC--MLDCRKV
        E RDEVSWNSM+VAY QH+EG+KAL LY EM  +G  +DMFTLASVL A T++  L GG QFH KLIK+G+HQN HVGSGLID YSKCGGC  M D  KV
Subjt:  EDRDEVSWNSMVVAYMQHREGSKALELYLEMTVRGLIVDMFTLASVLTAFTNVQDLSGGLQFHAKLIKSGYHQNCHVGSGLIDLYSKCGGC--MLDCRKV

Query:  FDEICKPDLVLWNTMISGYSLYEELSNEALECFRQLQGVGHRPDDCSLVCVISACSNMSSPSQGRQVHGLAFKLDIPSNRISVNNALVAMYSKCGNLRDA
        F EI  PDLV+WNTMISGYS+ EELS EA++ FRQ+Q +GHRPDDCS VCV SACSN+SSPSQ +Q+HGLA K  IPSNRISVNNAL+++Y K GNL+DA
Subjt:  FDEICKPDLVLWNTMISGYSLYEELSNEALECFRQLQGVGHRPDDCSLVCVISACSNMSSPSQGRQVHGLAFKLDIPSNRISVNNALVAMYSKCGNLRDA

Query:  RRLFDTMPEHNTVSYNSMIAGYAQHGMGSQSLHLFQRMLELGFTPTNITFISVLAACAHTGRVEDGKIYFNMMKK-FGIEPEAGHFSCMIDLLGRAGKLS
        R +FD MPE N VS+N MI GYAQHG G+++L L+QRML+ G  P  ITF++VL+ACAH G+V++G+ YFN MK+ F IEPEA H+SCMIDLLGRAGKL 
Subjt:  RRLFDTMPEHNTVSYNSMIAGYAQHGMGSQSLHLFQRMLELGFTPTNITFISVLAACAHTGRVEDGKIYFNMMKK-FGIEPEAGHFSCMIDLLGRAGKLS

Query:  EAERLIETIPFDPGSILWSALLGACRTHGNVELAVKAANHLLQMDPSNAAPYVMLANIYADNGRLEDAATVRKLMRDRGVKKKPGCSWIEVNRRIHIFVA
        EAER I+ +P+ PGS+ W+ALLGACR H N+ LA +AAN L+ M P  A PYVMLAN+YAD  + E+ A+VRK MR + ++KKPGCSWIEV ++ H+FVA
Subjt:  EAERLIETIPFDPGSILWSALLGACRTHGNVELAVKAANHLLQMDPSNAAPYVMLANIYADNGRLEDAATVRKLMRDRGVKKKPGCSWIEVNRRIHIFVA

Query:  EDTSHPMIKKIQEYLEEMMRKIKKAGYTQDVRSATIGDYDRERQIEEELRLGHHSEKLAVSFGLMSTRE
        ED SHPMI+++ EYLEEMM+K+KK GY  D + A + + D   + +EE+RLGHHSEKLAV+FGLMSTR+
Subjt:  EDTSHPMIKKIQEYLEEMMRKIKKAGYTQDVRSATIGDYDRERQIEEELRLGHHSEKLAVSFGLMSTRE

Q9S7F4 Putative pentatricopeptide repeat-containing protein At2g015106.6e-11235.75Show/hide
Query:  NALISAYAKESFVEVARQLFDKMPQPDPVSYNTLIAAYAHRGDTEPAFQLFLEMREAFLDMDGFTLSGIITACG--DNVALIRQLHALSVVAGFDSYASV
        N L+ +Y +   +++A  LF+++P+ D V++NTLI  Y   G    +  LFL+MR++      FT SG++ A     + AL +QLHALSV  GF   ASV
Subjt:  NALISAYAKESFVEVARQLFDKMPQPDPVSYNTLIAAYAHRGDTEPAFQLFLEMREAFLDMDGFTLSGIITACG--DNVALIRQLHALSVVAGFDSYASV

Query:  GNALITYYSKNGFLNEAQRIFRWMGEDRDEVSWNSMVVAYMQHREGSKALELYLEMTVRGLIVDMFTLASVLTAFTNVQDLSGGLQFHAKLIKSGYHQNC
        GN ++ +YSK+  + E + +F  M E  D VS+N ++ +Y Q  +   +L  + EM   G     F  A++L+   N+  L  G Q H + + +      
Subjt:  GNALITYYSKNGFLNEAQRIFRWMGEDRDEVSWNSMVVAYMQHREGSKALELYLEMTVRGLIVDMFTLASVLTAFTNVQDLSGGLQFHAKLIKSGYHQNC

Query:  HVGSGLIDLYSKCGGCMLDCRKVFDEICKPDLVLWNTMISGYSLYEELSNEALECFRQLQGVGHRPDDCSLVCVISACSNMSSPSQGRQVHGLAFKLDIP
        HVG+ L+D+Y+KC     +   +F  + +   V W  +ISGY + + L    L+ F +++G   R D  +   V+ A ++ +S   G+Q+H    +    
Subjt:  HVGSGLIDLYSKCGGCMLDCRKVFDEICKPDLVLWNTMISGYSLYEELSNEALECFRQLQGVGHRPDDCSLVCVISACSNMSSPSQGRQVHGLAFKLDIP

Query:  SNRISVNNALVAMYSKCGNLRDARRLFDTMPEHNTVSYNSMIAGYAQHGMGSQSLHLFQRMLELGFTPTNITFISVLAACAHTGRVEDGKIYFNMMKK-F
         N  S  + LV MY+KCG+++DA ++F+ MP+ N VS+N++I+ +A +G G  ++  F +M+E G  P +++ + VL AC+H G VE G  YF  M   +
Subjt:  SNRISVNNALVAMYSKCGNLRDARRLFDTMPEHNTVSYNSMIAGYAQHGMGSQSLHLFQRMLELGFTPTNITFISVLAACAHTGRVEDGKIYFNMMKK-F

Query:  GIEPEAGHFSCMIDLLGRAGKLSEAERLIETIPFDPGSILWSALLGACRTHGNVELAVKAANHLLQMDP-SNAAPYVMLANIYADNGRLEDAATVRKLMR
        GI P+  H++CM+DLLGR G+ +EAE+L++ +PF+P  I+WS++L ACR H N  LA +AA  L  M+   +AA YV ++NIYA  G  E    V+K MR
Subjt:  GIEPEAGHFSCMIDLLGRAGKLSEAERLIETIPFDPGSILWSALLGACRTHGNVELAVKAANHLLQMDP-SNAAPYVMLANIYADNGRLEDAATVRKLMR

Query:  DRGVKKKPGCSWIEVNRRIHIFVAEDTSHPMIKKIQEYLEEMMRKIKKAGYTQDVRSATIGDYDRERQIEEELRLGHHSEKLAVSFGLMSTREEIP----
        +RG+KK P  SW+EVN +IH+F + D +HP   +I   + E+  +I++ GY  D  S+ + D D + +IE    L +HSE+LAV+F L+ST E  P    
Subjt:  DRGVKKKPGCSWIEVNRRIHIFVAEDTSHPMIKKIQEYLEEMMRKIKKAGYTQDVRSATIGDYDRERQIEEELRLGHHSEKLAVSFGLMSTREEIP----

Query:  TDFTASRAGNALVVIIGELLTGAVDAPSLVFFHGF
         +  A R  +A + +I +++   +       FH F
Subjt:  TDFTASRAGNALVVIIGELLTGAVDAPSLVFFHGF

Q9SHZ8 Pentatricopeptide repeat-containing protein At2g220702.0e-11633.76Show/hide
Query:  SSLLQSFRQILKTCIAQRDLR-TGKSLHALYIKSFLPTSTYLSNHFILLYSKCRRLSAARWVFDQTHDCNIFSFNALISAYAKESFVEVARQLFDKMPQP
        S+LL+    +L+  + + + R T + +H   IKS L  S YL N+ + +YSK      AR +FD+      FS+N ++SAY+K   ++   + FD++PQ 
Subjt:  SSLLQSFRQILKTCIAQRDLR-TGKSLHALYIKSFLPTSTYLSNHFILLYSKCRRLSAARWVFDQTHDCNIFSFNALISAYAKESFVEVARQLFDKMPQP

Query:  DPVSYNTLIAAYAHRGDTEPAFQLFLEMREAFLDMDGFTLSGIITACGDNVALI--RQLHALSVVAGFDSYASVGNALITYYSKNGFLNEAQRIF-----
        D VS+ T+I  Y + G    A ++  +M +  ++   FTL+ ++ +      +   +++H+  V  G     SV N+L+  Y+K G    A+ +F     
Subjt:  DPVSYNTLIAAYAHRGDTEPAFQLFLEMREAFLDMDGFTLSGIITACGDNVALI--RQLHALSVVAGFDSYASVGNALITYYSKNGFLNEAQRIF-----

Query:  ----RW---------MGE------------DRDEVSWNSMVVAYMQHREGSKALELYLEMTVRGLI-VDMFTLASVLTAFTNVQDLSGGLQFHAKLIKSG
             W         +G+            +RD V+WNSM+  + Q     +AL+++ +M    L+  D FTLASVL+A  N++ L  G Q H+ ++ +G
Subjt:  ----RW---------MGE------------DRDEVSWNSMVVAYMQHREGSKALELYLEMTVRGLI-VDMFTLASVLTAFTNVQDLSGGLQFHAKLIKSG

Query:  YHQNCHVGSGLIDLYSKCGGC--------------------------------MLDCRKVFDEICKPDLVLWNTMISGYSLYEELSNEALECFRQLQGVG
        +  +  V + LI +YS+CGG                                 M   + +F  +   D+V W  MI GY  +     EA+  FR + G G
Subjt:  YHQNCHVGSGLIDLYSKCGGC--------------------------------MLDCRKVFDEICKPDLVLWNTMISGYSLYEELSNEALECFRQLQGVG

Query:  HRPDDCSLVCVISACSNMSSPSQGRQVHGLAFKL-DIPSNRISVNNALVAMYSKCGNLRDARRLFDTMP-EHNTVSYNSMIAGYAQHGMGSQSLHLFQRM
         RP+  +L  ++S  S+++S S G+Q+HG A K  +I S  +SV+NAL+ MY+K GN+  A R FD +  E +TVS+ SMI   AQHG   ++L LF+ M
Subjt:  HRPDDCSLVCVISACSNMSSPSQGRQVHGLAFKL-DIPSNRISVNNALVAMYSKCGNLRDARRLFDTMP-EHNTVSYNSMIAGYAQHGMGSQSLHLFQRM

Query:  LELGFTPTNITFISVLAACAHTGRVEDGKIYFNMMKKFG-IEPEAGHFSCMIDLLGRAGKLSEAERLIETIPFDPGSILWSALLGACRTHGNVELAVKAA
        L  G  P +IT++ V +AC H G V  G+ YF+MMK    I P   H++CM+DL GRAG L EA+  IE +P +P  + W +LL ACR H N++L   AA
Subjt:  LELGFTPTNITFISVLAACAHTGRVEDGKIYFNMMKKFG-IEPEAGHFSCMIDLLGRAGKLSEAERLIETIPFDPGSILWSALLGACRTHGNVELAVKAA

Query:  NHLLQMDPSNAAPYVMLANIYADNGRLEDAATVRKLMRDRGVKKKPGCSWIEVNRRIHIFVAEDTSHPMIKKIQEYLEEMMRKIKKAGYTQDVRSATIGD
          LL ++P N+  Y  LAN+Y+  G+ E+AA +RK M+D  VKK+ G SWIEV  ++H+F  ED +HP   +I   ++++  +IKK GY  D  S     
Subjt:  NHLLQMDPSNAAPYVMLANIYADNGRLEDAATVRKLMRDRGVKKKPGCSWIEVNRRIHIFVAEDTSHPMIKKIQEYLEEMMRKIKKAGYTQDVRSATIGD

Query:  YDRERQIEEELRLGHHSEKLAVSFGLMSTREE----IPTDFTASRAGNALVVIIGELLTGAVDAPSLVFFH----GFCT
        +D E +++E++ L HHSEKLA++FGL+ST ++    I  +       +  +  I +L+   +       FH    GFC+
Subjt:  YDRERQIEEELRLGHHSEKLAVSFGLMSTREE----IPTDFTASRAGNALVVIIGELLTGAVDAPSLVFFH----GFCT

Q9SN39 Pentatricopeptide repeat-containing protein DOT4, chloroplastic2.3e-10936.38Show/hide
Query:  NALISAYAKESFVEVARQLFDKMPQPDPVSYNTLIAAYAHRGDTEPAFQLFLEMREAFLDMDGFTLSGIITACGDN--VALIRQLHALSVVAGFDSYASV
        N+L++ Y K   V+ AR++FD+M + D +S+N++I  Y   G  E    +F++M  + +++D  T+  +   C D+  ++L R +H++ V A F      
Subjt:  NALISAYAKESFVEVARQLFDKMPQPDPVSYNTLIAAYAHRGDTEPAFQLFLEMREAFLDMDGFTLSGIITACGDN--VALIRQLHALSVVAGFDSYASV

Query:  GNALITYYSKNGFLNEAQRIFRWMGEDRDEVSWNSMVVAYMQHREGSKALELYLEMTVRGLIVDMFTLASVLTAFTNVQDLSGGLQFHAKLIKSGYHQNC
         N L+  YSK G L+ A+ +FR M  DR  VS+ SM+  Y +     +A++L+ EM   G+  D++T+ +VL      + L  G + H  + ++    + 
Subjt:  GNALITYYSKNGFLNEAQRIFRWMGEDRDEVSWNSMVVAYMQHREGSKALELYLEMTVRGLIVDMFTLASVLTAFTNVQDLSGGLQFHAKLIKSGYHQNC

Query:  HVGSGLIDLYSKCGGCMLDCRKVFDEICKPDLVLWNTMISGYSLYEELSNEALECFR-QLQGVGHRPDDCSLVCVISACSNMSSPSQGRQVHGLAFKLDI
         V + L+D+Y+KCG  M +   VF E+   D++ WNT+I GYS     +NEAL  F   L+     PD+ ++ CV+ AC+++S+  +GR++HG   +   
Subjt:  HVGSGLIDLYSKCGGCMLDCRKVFDEICKPDLVLWNTMISGYSLYEELSNEALECFR-QLQGVGHRPDDCSLVCVISACSNMSSPSQGRQVHGLAFKLDI

Query:  PSNRISVNNALVAMYSKCGNLRDARRLFDTMPEHNTVSYNSMIAGYAQHGMGSQSLHLFQRMLELGFTPTNITFISVLAACAHTGRVEDGKIYFNMMK-K
         S+R  V N+LV MY+KCG L  A  LFD +   + VS+  MIAGY  HG G +++ LF +M + G     I+F+S+L AC+H+G V++G  +FN+M+ +
Subjt:  PSNRISVNNALVAMYSKCGNLRDARRLFDTMPEHNTVSYNSMIAGYAQHGMGSQSLHLFQRMLELGFTPTNITFISVLAACAHTGRVEDGKIYFNMMK-K

Query:  FGIEPEAGHFSCMIDLLGRAGKLSEAERLIETIPFDPGSILWSALLGACRTHGNVELAVKAANHLLQMDPSNAAPYVMLANIYADNGRLEDAATVRKLMR
          IEP   H++C++D+L R G L +A R IE +P  P + +W ALL  CR H +V+LA K A  + +++P N   YV++ANIYA+  + E    +RK + 
Subjt:  FGIEPEAGHFSCMIDLLGRAGKLSEAERLIETIPFDPGSILWSALLGACRTHGNVELAVKAANHLLQMDPSNAAPYVMLANIYADNGRLEDAATVRKLMR

Query:  DRGVKKKPGCSWIEVNRRIHIFVAEDTSHPMIKKIQEYLEEMMRKIKKAGYTQDVRSATIGDYDRERQIEEELRLGHHSEKLAVSFGLMST
         RG++K PGCSWIE+  R++IFVA D+S+P  + I+ +L ++  ++ + GY+   + A I   D E   +EE   G HSEKLA++ G++S+
Subjt:  DRGVKKKPGCSWIEVNRRIHIFVAEDTSHPMIKKIQEYLEEMMRKIKKAGYTQDVRSATIGDYDRERQIEEELRLGHHSEKLAVSFGLMST

Arabidopsis top hitse value%identityAlignment
AT1G68930.1 pentatricopeptide (PPR) repeat-containing protein1.2e-11634.36Show/hide
Query:  LKTCI---AQRDLRTGKSLHALYIKSFLPTSTYLSNHFILLYSKCRRLSAARWVFDQTHDCNIFSFNALISAYAKESFVEVARQLFDKMPQPDPVSYNTL
        +K CI   A+   R  K +H   I++     T+L N+ +  Y+  +  + AR VFD+    N+FS+N L+ AY+K   +      F+K+P  D V++N L
Subjt:  LKTCI---AQRDLRTGKSLHALYIKSFLPTSTYLSNHFILLYSKCRRLSAARWVFDQTHDCNIFSFNALISAYAKESFVEVARQLFDKMPQPDPVSYNTL

Query:  IAAYAHRGDTEPAFQLF-LEMREAFLDMDGFTLSGIITACGDN--VALIRQLHALSVVAGFDSYASVGNALITYYSKNGFLNEAQRIF------------
        I  Y+  G    A + +   MR+   ++   TL  ++     N  V+L +Q+H   +  GF+SY  VG+ L+  Y+  G +++A+++F            
Subjt:  IAAYAHRGDTEPAFQLF-LEMREAFLDMDGFTLSGIITACGDN--VALIRQLHALSVVAGFDSYASVGNALITYYSKNGFLNEAQRIF------------

Query:  -----------------RWMGEDRDEVSWNSMVVAYMQHREGSKALELYLEMTVRGLIVDMFTLASVLTAFTNVQDLSGGLQFHAKLIKSGYHQNCHVGS
                          + G ++D VSW +M+    Q+    +A+E + EM V+GL +D +   SVL A   +  ++ G Q HA +I++ +  + +VGS
Subjt:  -----------------RWMGEDRDEVSWNSMVVAYMQHREGSKALELYLEMTVRGLIVDMFTLASVLTAFTNVQDLSGGLQFHAKLIKSGYHQNCHVGS

Query:  GLIDLYSKCGGCMLDCRKVFDEICKPDLVLWNTMISGYSLYEELSNEALECFRQLQGVGHRPDDCSLVCVISACSNMSSPSQGRQVHGLAFKLDIPSNRI
         LID+Y KC  C+   + VFD + + ++V W  M+ GY      + EA++ F  +Q  G  PD  +L   ISAC+N+SS  +G Q HG A    +  + +
Subjt:  GLIDLYSKCGGCMLDCRKVFDEICKPDLVLWNTMISGYSLYEELSNEALECFRQLQGVGHRPDDCSLVCVISACSNMSSPSQGRQVHGLAFKLDIPSNRI

Query:  SVNNALVAMYSKCGNLRDARRLFDTMPEHNTVSYNSMIAGYAQHGMGSQSLHLFQRMLELGFTPTNITFISVLAACAHTGRVEDGKIYFNMM-KKFGIEP
        +V+N+LV +Y KCG++ D+ RLF+ M   + VS+ +M++ YAQ G   +++ LF +M++ G  P  +T   V++AC+  G VE G+ YF +M  ++GI P
Subjt:  SVNNALVAMYSKCGNLRDARRLFDTMPEHNTVSYNSMIAGYAQHGMGSQSLHLFQRMLELGFTPTNITFISVLAACAHTGRVEDGKIYFNMM-KKFGIEP

Query:  EAGHFSCMIDLLGRAGKLSEAERLIETIPFDPGSILWSALLGACRTHGNVELAVKAANHLLQMDPSNAAPYVMLANIYADNGRLEDAATVRKLMRDRGVK
          GH+SCMIDL  R+G+L EA R I  +PF P +I W+ LL ACR  GN+E+   AA  L+++DP + A Y +L++IYA  G+ +  A +R+ MR++ VK
Subjt:  EAGHFSCMIDLLGRAGKLSEAERLIETIPFDPGSILWSALLGACRTHGNVELAVKAANHLLQMDPSNAAPYVMLANIYADNGRLEDAATVRKLMRDRGVK

Query:  KKPGCSWIEVNRRIHIFVAEDTSHPMIKKIQEYLEEMMRKIKKAGYTQDVRSATIGDYDRERQIEEELRLGHHSEKLAVSFGLM
        K+PG SWI+   ++H F A+D S P + +I   LEE+  KI   GY  D    +   +D E  ++ ++ L +HSE+LA++FGL+
Subjt:  KKPGCSWIEVNRRIHIFVAEDTSHPMIKKIQEYLEEMMRKIKKAGYTQDVRSATIGDYDRERQIEEELRLGHHSEKLAVSFGLM

AT2G22070.1 pentatricopeptide (PPR) repeat-containing protein1.4e-11733.76Show/hide
Query:  SSLLQSFRQILKTCIAQRDLR-TGKSLHALYIKSFLPTSTYLSNHFILLYSKCRRLSAARWVFDQTHDCNIFSFNALISAYAKESFVEVARQLFDKMPQP
        S+LL+    +L+  + + + R T + +H   IKS L  S YL N+ + +YSK      AR +FD+      FS+N ++SAY+K   ++   + FD++PQ 
Subjt:  SSLLQSFRQILKTCIAQRDLR-TGKSLHALYIKSFLPTSTYLSNHFILLYSKCRRLSAARWVFDQTHDCNIFSFNALISAYAKESFVEVARQLFDKMPQP

Query:  DPVSYNTLIAAYAHRGDTEPAFQLFLEMREAFLDMDGFTLSGIITACGDNVALI--RQLHALSVVAGFDSYASVGNALITYYSKNGFLNEAQRIF-----
        D VS+ T+I  Y + G    A ++  +M +  ++   FTL+ ++ +      +   +++H+  V  G     SV N+L+  Y+K G    A+ +F     
Subjt:  DPVSYNTLIAAYAHRGDTEPAFQLFLEMREAFLDMDGFTLSGIITACGDNVALI--RQLHALSVVAGFDSYASVGNALITYYSKNGFLNEAQRIF-----

Query:  ----RW---------MGE------------DRDEVSWNSMVVAYMQHREGSKALELYLEMTVRGLI-VDMFTLASVLTAFTNVQDLSGGLQFHAKLIKSG
             W         +G+            +RD V+WNSM+  + Q     +AL+++ +M    L+  D FTLASVL+A  N++ L  G Q H+ ++ +G
Subjt:  ----RW---------MGE------------DRDEVSWNSMVVAYMQHREGSKALELYLEMTVRGLI-VDMFTLASVLTAFTNVQDLSGGLQFHAKLIKSG

Query:  YHQNCHVGSGLIDLYSKCGGC--------------------------------MLDCRKVFDEICKPDLVLWNTMISGYSLYEELSNEALECFRQLQGVG
        +  +  V + LI +YS+CGG                                 M   + +F  +   D+V W  MI GY  +     EA+  FR + G G
Subjt:  YHQNCHVGSGLIDLYSKCGGC--------------------------------MLDCRKVFDEICKPDLVLWNTMISGYSLYEELSNEALECFRQLQGVG

Query:  HRPDDCSLVCVISACSNMSSPSQGRQVHGLAFKL-DIPSNRISVNNALVAMYSKCGNLRDARRLFDTMP-EHNTVSYNSMIAGYAQHGMGSQSLHLFQRM
         RP+  +L  ++S  S+++S S G+Q+HG A K  +I S  +SV+NAL+ MY+K GN+  A R FD +  E +TVS+ SMI   AQHG   ++L LF+ M
Subjt:  HRPDDCSLVCVISACSNMSSPSQGRQVHGLAFKL-DIPSNRISVNNALVAMYSKCGNLRDARRLFDTMP-EHNTVSYNSMIAGYAQHGMGSQSLHLFQRM

Query:  LELGFTPTNITFISVLAACAHTGRVEDGKIYFNMMKKFG-IEPEAGHFSCMIDLLGRAGKLSEAERLIETIPFDPGSILWSALLGACRTHGNVELAVKAA
        L  G  P +IT++ V +AC H G V  G+ YF+MMK    I P   H++CM+DL GRAG L EA+  IE +P +P  + W +LL ACR H N++L   AA
Subjt:  LELGFTPTNITFISVLAACAHTGRVEDGKIYFNMMKKFG-IEPEAGHFSCMIDLLGRAGKLSEAERLIETIPFDPGSILWSALLGACRTHGNVELAVKAA

Query:  NHLLQMDPSNAAPYVMLANIYADNGRLEDAATVRKLMRDRGVKKKPGCSWIEVNRRIHIFVAEDTSHPMIKKIQEYLEEMMRKIKKAGYTQDVRSATIGD
          LL ++P N+  Y  LAN+Y+  G+ E+AA +RK M+D  VKK+ G SWIEV  ++H+F  ED +HP   +I   ++++  +IKK GY  D  S     
Subjt:  NHLLQMDPSNAAPYVMLANIYADNGRLEDAATVRKLMRDRGVKKKPGCSWIEVNRRIHIFVAEDTSHPMIKKIQEYLEEMMRKIKKAGYTQDVRSATIGD

Query:  YDRERQIEEELRLGHHSEKLAVSFGLMSTREE----IPTDFTASRAGNALVVIIGELLTGAVDAPSLVFFH----GFCT
        +D E +++E++ L HHSEKLA++FGL+ST ++    I  +       +  +  I +L+   +       FH    GFC+
Subjt:  YDRERQIEEELRLGHHSEKLAVSFGLMSTREE----IPTDFTASRAGNALVVIIGELLTGAVDAPSLVFFH----GFCT

AT3G02010.1 Pentatricopeptide repeat (PPR) superfamily protein4.7e-11335.75Show/hide
Query:  NALISAYAKESFVEVARQLFDKMPQPDPVSYNTLIAAYAHRGDTEPAFQLFLEMREAFLDMDGFTLSGIITACG--DNVALIRQLHALSVVAGFDSYASV
        N L+ +Y +   +++A  LF+++P+ D V++NTLI  Y   G    +  LFL+MR++      FT SG++ A     + AL +QLHALSV  GF   ASV
Subjt:  NALISAYAKESFVEVARQLFDKMPQPDPVSYNTLIAAYAHRGDTEPAFQLFLEMREAFLDMDGFTLSGIITACG--DNVALIRQLHALSVVAGFDSYASV

Query:  GNALITYYSKNGFLNEAQRIFRWMGEDRDEVSWNSMVVAYMQHREGSKALELYLEMTVRGLIVDMFTLASVLTAFTNVQDLSGGLQFHAKLIKSGYHQNC
        GN ++ +YSK+  + E + +F  M E  D VS+N ++ +Y Q  +   +L  + EM   G     F  A++L+   N+  L  G Q H + + +      
Subjt:  GNALITYYSKNGFLNEAQRIFRWMGEDRDEVSWNSMVVAYMQHREGSKALELYLEMTVRGLIVDMFTLASVLTAFTNVQDLSGGLQFHAKLIKSGYHQNC

Query:  HVGSGLIDLYSKCGGCMLDCRKVFDEICKPDLVLWNTMISGYSLYEELSNEALECFRQLQGVGHRPDDCSLVCVISACSNMSSPSQGRQVHGLAFKLDIP
        HVG+ L+D+Y+KC     +   +F  + +   V W  +ISGY + + L    L+ F +++G   R D  +   V+ A ++ +S   G+Q+H    +    
Subjt:  HVGSGLIDLYSKCGGCMLDCRKVFDEICKPDLVLWNTMISGYSLYEELSNEALECFRQLQGVGHRPDDCSLVCVISACSNMSSPSQGRQVHGLAFKLDIP

Query:  SNRISVNNALVAMYSKCGNLRDARRLFDTMPEHNTVSYNSMIAGYAQHGMGSQSLHLFQRMLELGFTPTNITFISVLAACAHTGRVEDGKIYFNMMKK-F
         N  S  + LV MY+KCG+++DA ++F+ MP+ N VS+N++I+ +A +G G  ++  F +M+E G  P +++ + VL AC+H G VE G  YF  M   +
Subjt:  SNRISVNNALVAMYSKCGNLRDARRLFDTMPEHNTVSYNSMIAGYAQHGMGSQSLHLFQRMLELGFTPTNITFISVLAACAHTGRVEDGKIYFNMMKK-F

Query:  GIEPEAGHFSCMIDLLGRAGKLSEAERLIETIPFDPGSILWSALLGACRTHGNVELAVKAANHLLQMDP-SNAAPYVMLANIYADNGRLEDAATVRKLMR
        GI P+  H++CM+DLLGR G+ +EAE+L++ +PF+P  I+WS++L ACR H N  LA +AA  L  M+   +AA YV ++NIYA  G  E    V+K MR
Subjt:  GIEPEAGHFSCMIDLLGRAGKLSEAERLIETIPFDPGSILWSALLGACRTHGNVELAVKAANHLLQMDP-SNAAPYVMLANIYADNGRLEDAATVRKLMR

Query:  DRGVKKKPGCSWIEVNRRIHIFVAEDTSHPMIKKIQEYLEEMMRKIKKAGYTQDVRSATIGDYDRERQIEEELRLGHHSEKLAVSFGLMSTREEIP----
        +RG+KK P  SW+EVN +IH+F + D +HP   +I   + E+  +I++ GY  D  S+ + D D + +IE    L +HSE+LAV+F L+ST E  P    
Subjt:  DRGVKKKPGCSWIEVNRRIHIFVAEDTSHPMIKKIQEYLEEMMRKIKKAGYTQDVRSATIGDYDRERQIEEELRLGHHSEKLAVSFGLMSTREEIP----

Query:  TDFTASRAGNALVVIIGELLTGAVDAPSLVFFHGF
         +  A R  +A + +I +++   +       FH F
Subjt:  TDFTASRAGNALVVIIGELLTGAVDAPSLVFFHGF

AT3G49710.1 Pentatricopeptide repeat (PPR) superfamily protein1.0e-24862.63Show/hide
Query:  MHQFSSLLQSFRQILKTCIAQRDLRTGKSLHALYIKSFLPTSTYLSNHFILLYSKCRRLSAARWVFDQTHDCNIFSFNALISAYAKESFVEVARQLFDKM
        M+Q     ++FR +L   +A+RDL TGKSLHALY+KS + +STYLSNHF+ LYSKC RLS AR  F  T + N+FS+N ++ AYAK+S + +ARQLFD++
Subjt:  MHQFSSLLQSFRQILKTCIAQRDLRTGKSLHALYIKSFLPTSTYLSNHFILLYSKCRRLSAARWVFDQTHDCNIFSFNALISAYAKESFVEVARQLFDKM

Query:  PQPDPVSYNTLIAAYAHRGDTEPAFQLFLEMREAFLDMDGFTLSGIITACGDNVALIRQLHALSVVAGFDSYASVGNALITYYSKNGFLNEAQRIFRWMG
        PQPD VSYNTLI+ YA   +T  A  LF  MR+   ++DGFTLSG+I AC D V LI+QLH  SV  GFDSY+SV NA +TYYSK G L EA  +F  M 
Subjt:  PQPDPVSYNTLIAAYAHRGDTEPAFQLFLEMREAFLDMDGFTLSGIITACGDNVALIRQLHALSVVAGFDSYASVGNALITYYSKNGFLNEAQRIFRWMG

Query:  EDRDEVSWNSMVVAYMQHREGSKALELYLEMTVRGLIVDMFTLASVLTAFTNVQDLSGGLQFHAKLIKSGYHQNCHVGSGLIDLYSKCGGC--MLDCRKV
        E RDEVSWNSM+VAY QH+EG+KAL LY EM  +G  +DMFTLASVL A T++  L GG QFH KLIK+G+HQN HVGSGLID YSKCGGC  M D  KV
Subjt:  EDRDEVSWNSMVVAYMQHREGSKALELYLEMTVRGLIVDMFTLASVLTAFTNVQDLSGGLQFHAKLIKSGYHQNCHVGSGLIDLYSKCGGC--MLDCRKV

Query:  FDEICKPDLVLWNTMISGYSLYEELSNEALECFRQLQGVGHRPDDCSLVCVISACSNMSSPSQGRQVHGLAFKLDIPSNRISVNNALVAMYSKCGNLRDA
        F EI  PDLV+WNTMISGYS+ EELS EA++ FRQ+Q +GHRPDDCS VCV SACSN+SSPSQ +Q+HGLA K  IPSNRISVNNAL+++Y K GNL+DA
Subjt:  FDEICKPDLVLWNTMISGYSLYEELSNEALECFRQLQGVGHRPDDCSLVCVISACSNMSSPSQGRQVHGLAFKLDIPSNRISVNNALVAMYSKCGNLRDA

Query:  RRLFDTMPEHNTVSYNSMIAGYAQHGMGSQSLHLFQRMLELGFTPTNITFISVLAACAHTGRVEDGKIYFNMMKK-FGIEPEAGHFSCMIDLLGRAGKLS
        R +FD MPE N VS+N MI GYAQHG G+++L L+QRML+ G  P  ITF++VL+ACAH G+V++G+ YFN MK+ F IEPEA H+SCMIDLLGRAGKL 
Subjt:  RRLFDTMPEHNTVSYNSMIAGYAQHGMGSQSLHLFQRMLELGFTPTNITFISVLAACAHTGRVEDGKIYFNMMKK-FGIEPEAGHFSCMIDLLGRAGKLS

Query:  EAERLIETIPFDPGSILWSALLGACRTHGNVELAVKAANHLLQMDPSNAAPYVMLANIYADNGRLEDAATVRKLMRDRGVKKKPGCSWIEVNRRIHIFVA
        EAER I+ +P+ PGS+ W+ALLGACR H N+ LA +AAN L+ M P  A PYVMLAN+YAD  + E+ A+VRK MR + ++KKPGCSWIEV ++ H+FVA
Subjt:  EAERLIETIPFDPGSILWSALLGACRTHGNVELAVKAANHLLQMDPSNAAPYVMLANIYADNGRLEDAATVRKLMRDRGVKKKPGCSWIEVNRRIHIFVA

Query:  EDTSHPMIKKIQEYLEEMMRKIKKAGYTQDVRSATIGDYDRERQIEEELRLGHHSEKLAVSFGLMSTRE
        ED SHPMI+++ EYLEEMM+K+KK GY  D + A + + D   + +EE+RLGHHSEKLAV+FGLMSTR+
Subjt:  EDTSHPMIKKIQEYLEEMMRKIKKAGYTQDVRSATIGDYDRERQIEEELRLGHHSEKLAVSFGLMSTRE

AT4G18750.1 Pentatricopeptide repeat (PPR) superfamily protein1.7e-11036.38Show/hide
Query:  NALISAYAKESFVEVARQLFDKMPQPDPVSYNTLIAAYAHRGDTEPAFQLFLEMREAFLDMDGFTLSGIITACGDN--VALIRQLHALSVVAGFDSYASV
        N+L++ Y K   V+ AR++FD+M + D +S+N++I  Y   G  E    +F++M  + +++D  T+  +   C D+  ++L R +H++ V A F      
Subjt:  NALISAYAKESFVEVARQLFDKMPQPDPVSYNTLIAAYAHRGDTEPAFQLFLEMREAFLDMDGFTLSGIITACGDN--VALIRQLHALSVVAGFDSYASV

Query:  GNALITYYSKNGFLNEAQRIFRWMGEDRDEVSWNSMVVAYMQHREGSKALELYLEMTVRGLIVDMFTLASVLTAFTNVQDLSGGLQFHAKLIKSGYHQNC
         N L+  YSK G L+ A+ +FR M  DR  VS+ SM+  Y +     +A++L+ EM   G+  D++T+ +VL      + L  G + H  + ++    + 
Subjt:  GNALITYYSKNGFLNEAQRIFRWMGEDRDEVSWNSMVVAYMQHREGSKALELYLEMTVRGLIVDMFTLASVLTAFTNVQDLSGGLQFHAKLIKSGYHQNC

Query:  HVGSGLIDLYSKCGGCMLDCRKVFDEICKPDLVLWNTMISGYSLYEELSNEALECFR-QLQGVGHRPDDCSLVCVISACSNMSSPSQGRQVHGLAFKLDI
         V + L+D+Y+KCG  M +   VF E+   D++ WNT+I GYS     +NEAL  F   L+     PD+ ++ CV+ AC+++S+  +GR++HG   +   
Subjt:  HVGSGLIDLYSKCGGCMLDCRKVFDEICKPDLVLWNTMISGYSLYEELSNEALECFR-QLQGVGHRPDDCSLVCVISACSNMSSPSQGRQVHGLAFKLDI

Query:  PSNRISVNNALVAMYSKCGNLRDARRLFDTMPEHNTVSYNSMIAGYAQHGMGSQSLHLFQRMLELGFTPTNITFISVLAACAHTGRVEDGKIYFNMMK-K
         S+R  V N+LV MY+KCG L  A  LFD +   + VS+  MIAGY  HG G +++ LF +M + G     I+F+S+L AC+H+G V++G  +FN+M+ +
Subjt:  PSNRISVNNALVAMYSKCGNLRDARRLFDTMPEHNTVSYNSMIAGYAQHGMGSQSLHLFQRMLELGFTPTNITFISVLAACAHTGRVEDGKIYFNMMK-K

Query:  FGIEPEAGHFSCMIDLLGRAGKLSEAERLIETIPFDPGSILWSALLGACRTHGNVELAVKAANHLLQMDPSNAAPYVMLANIYADNGRLEDAATVRKLMR
          IEP   H++C++D+L R G L +A R IE +P  P + +W ALL  CR H +V+LA K A  + +++P N   YV++ANIYA+  + E    +RK + 
Subjt:  FGIEPEAGHFSCMIDLLGRAGKLSEAERLIETIPFDPGSILWSALLGACRTHGNVELAVKAANHLLQMDPSNAAPYVMLANIYADNGRLEDAATVRKLMR

Query:  DRGVKKKPGCSWIEVNRRIHIFVAEDTSHPMIKKIQEYLEEMMRKIKKAGYTQDVRSATIGDYDRERQIEEELRLGHHSEKLAVSFGLMST
         RG++K PGCSWIE+  R++IFVA D+S+P  + I+ +L ++  ++ + GY+   + A I   D E   +EE   G HSEKLA++ G++S+
Subjt:  DRGVKKKPGCSWIEVNRRIHIFVAEDTSHPMIKKIQEYLEEMMRKIKKAGYTQDVRSATIGDYDRERQIEEELRLGHHSEKLAVSFGLMST


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCATCAATTCTCCTCTCTTCTCCAAAGCTTTCGGCAGATTCTGAAAACGTGTATTGCCCAGAGAGACCTCCGTACTGGAAAGTCCCTCCATGCTCTCTATATC
AAGTCCTTCCTCCCCACATCCACCTACCTCTCCAACCACTTCATTCTTCTTTACTCCAAATGTCGTCGCCTCTCCGCTGCTCGTTGGGTCTTCGATCAAACCCAC
GATTGCAATATCTTTTCCTTCAACGCCTTGATTTCTGCTTACGCGAAAGAATCGTTTGTTGAAGTTGCACGCCAACTGTTTGATAAAATGCCCCAACCAGACCCT
GTTTCGTATAACACTCTCATTGCTGCATATGCACACCGTGGGGACACTGAGCCTGCTTTTCAGTTGTTTCTGGAGATGAGAGAGGCTTTTCTTGACATGGATGGG
TTCACTCTTTCTGGTATAATTACCGCTTGTGGCGATAATGTTGCTTTGATAAGGCAGTTGCACGCACTGAGTGTTGTGGCTGGGTTTGATTCTTATGCGTCTGTT
GGTAATGCACTCATTACATATTACAGCAAAAATGGATTTTTGAATGAGGCCCAGCGGATTTTTCGTTGGATGGGTGAAGATAGAGATGAAGTGTCGTGGAATTCG
ATGGTTGTAGCATATATGCAGCATCGTGAGGGCTCTAAGGCCCTAGAATTGTATTTGGAAATGACTGTGAGGGGCTTGATTGTTGACATGTTTACTTTAGCAAGC
GTCTTGACAGCGTTTACAAATGTGCAGGACTTATCAGGCGGGCTTCAGTTTCATGCCAAATTGATAAAGTCTGGGTACCATCAGAATTGTCATGTGGGCAGTGGT
TTGATTGATTTGTATTCAAAATGTGGTGGTTGTATGTTGGATTGTAGAAAAGTTTTTGATGAAATATGTAAGCCGGATTTGGTCCTTTGGAATACAATGATTTCT
GGATACTCCCTGTATGAGGAGTTATCCAACGAAGCCCTTGAATGTTTTCGGCAGCTGCAAGGTGTTGGCCACCGGCCTGACGATTGCAGCCTCGTTTGCGTAATA
AGTGCCTGTTCAAATATGTCGTCACCCTCTCAAGGACGACAAGTTCATGGATTAGCTTTCAAATTGGACATCCCTTCCAATAGAATATCAGTGAATAATGCTCTG
GTTGCTATGTACTCAAAATGTGGAAATCTGAGGGACGCAAGAAGGTTGTTCGATACAATGCCGGAGCATAATACAGTATCATATAATTCAATGATTGCGGGCTAT
GCACAACATGGGATGGGGTCTCAATCACTTCATCTTTTTCAGAGGATGCTGGAGCTGGGCTTTACCCCTACAAACATAACGTTTATTTCAGTACTTGCCGCATGT
GCACACACCGGAAGAGTTGAGGATGGTAAGATTTACTTCAATATGATGAAGAAGTTTGGCATTGAACCAGAAGCAGGGCACTTCTCATGCATGATAGATCTTTTG
GGTCGAGCAGGCAAGTTAAGTGAGGCAGAGAGGCTGATTGAGACGATCCCTTTTGATCCTGGCTCCATTTTGTGGTCTGCATTACTTGGGGCCTGTCGAACACAT
GGGAACGTGGAGCTAGCTGTCAAAGCAGCAAACCATTTGCTTCAGATGGATCCTTCAAATGCTGCACCTTATGTCATGCTGGCAAATATCTATGCTGACAATGGG
AGATTGGAGGATGCTGCGACTGTAAGAAAACTTATGCGAGACCGAGGCGTGAAGAAGAAACCTGGTTGTAGTTGGATTGAAGTAAACAGGAGAATACACATTTTT
GTGGCCGAAGATACTTCCCACCCAATGATAAAAAAGATTCAGGAATACCTGGAGGAGATGATGAGAAAGATAAAGAAAGCAGGATATACTCAGGACGTGAGGTCA
GCAACGATAGGGGACTACGATAGAGAAAGGCAAATAGAGGAAGAGTTAAGGTTAGGACATCATAGTGAGAAGTTAGCTGTTTCATTTGGTCTCATGTCTACTAGA
GAAGAGATTCCCACAGATTTCACTGCTTCAAGGGCGGGAAATGCTCTTGTGGTGATTATTGGTGAACTTTTAACTGGGGCTGTTGACGCGCCATCCCTTGTTTTC
TTTCATGGCTTTTGTACGAGATTCATAATGGGCTATATGCATCGTAACTTATGTTCATCGTACTCCTGGATTGGATACTCCAAGTCCAAGTTTCTAACAAGTCCT
AAGTTTTCACATAACGAGAATTCATCAAAATGGGAACATATTGAGAAATGCCCAATGAGGAATGAAATGTGGTGCCGAGCTATGGAAATTGAGAATTTACTTTCT
CCTTTTGTTAGCAAAAAGTTTGAGCTTGATGATGTGATTGAAGCACAACAATTTGATAGAGAGATTCTCAATTTTGTTTTTGAAGTTGCACATGATATGGAAAAG
ATTGAAAAAAGTTCATCTAAAAGGCAAATGCTCAAGGGATATTTGATGGCTACCCTGTTCTACGATCCCTCAACTAGAGCTAGGCTTTCATTTGAATCTGCCATG
AAAAGTTTGGGTGAGGAGGTATTGGCCACTGAAAATGCACTTGATTCAGATATCATCAGAACTGTTGAGGGCTCTACTCGGAAAATTGTGATGCGGAATTTTGAA
AGTGGTACTGCTAGGAAAGCTGCTGGAACAGTTAGCATTCCTATTATTAATGCTGGTGATCCTGGAACAACATAA
mRNA sequenceShow/hide mRNA sequence
ATGCATCAATTCTCCTCTCTTCTCCAAAGCTTTCGGCAGATTCTGAAAACGTGTATTGCCCAGAGAGACCTCCGTACTGGAAAGTCCCTCCATGCTCTCTATATC
AAGTCCTTCCTCCCCACATCCACCTACCTCTCCAACCACTTCATTCTTCTTTACTCCAAATGTCGTCGCCTCTCCGCTGCTCGTTGGGTCTTCGATCAAACCCAC
GATTGCAATATCTTTTCCTTCAACGCCTTGATTTCTGCTTACGCGAAAGAATCGTTTGTTGAAGTTGCACGCCAACTGTTTGATAAAATGCCCCAACCAGACCCT
GTTTCGTATAACACTCTCATTGCTGCATATGCACACCGTGGGGACACTGAGCCTGCTTTTCAGTTGTTTCTGGAGATGAGAGAGGCTTTTCTTGACATGGATGGG
TTCACTCTTTCTGGTATAATTACCGCTTGTGGCGATAATGTTGCTTTGATAAGGCAGTTGCACGCACTGAGTGTTGTGGCTGGGTTTGATTCTTATGCGTCTGTT
GGTAATGCACTCATTACATATTACAGCAAAAATGGATTTTTGAATGAGGCCCAGCGGATTTTTCGTTGGATGGGTGAAGATAGAGATGAAGTGTCGTGGAATTCG
ATGGTTGTAGCATATATGCAGCATCGTGAGGGCTCTAAGGCCCTAGAATTGTATTTGGAAATGACTGTGAGGGGCTTGATTGTTGACATGTTTACTTTAGCAAGC
GTCTTGACAGCGTTTACAAATGTGCAGGACTTATCAGGCGGGCTTCAGTTTCATGCCAAATTGATAAAGTCTGGGTACCATCAGAATTGTCATGTGGGCAGTGGT
TTGATTGATTTGTATTCAAAATGTGGTGGTTGTATGTTGGATTGTAGAAAAGTTTTTGATGAAATATGTAAGCCGGATTTGGTCCTTTGGAATACAATGATTTCT
GGATACTCCCTGTATGAGGAGTTATCCAACGAAGCCCTTGAATGTTTTCGGCAGCTGCAAGGTGTTGGCCACCGGCCTGACGATTGCAGCCTCGTTTGCGTAATA
AGTGCCTGTTCAAATATGTCGTCACCCTCTCAAGGACGACAAGTTCATGGATTAGCTTTCAAATTGGACATCCCTTCCAATAGAATATCAGTGAATAATGCTCTG
GTTGCTATGTACTCAAAATGTGGAAATCTGAGGGACGCAAGAAGGTTGTTCGATACAATGCCGGAGCATAATACAGTATCATATAATTCAATGATTGCGGGCTAT
GCACAACATGGGATGGGGTCTCAATCACTTCATCTTTTTCAGAGGATGCTGGAGCTGGGCTTTACCCCTACAAACATAACGTTTATTTCAGTACTTGCCGCATGT
GCACACACCGGAAGAGTTGAGGATGGTAAGATTTACTTCAATATGATGAAGAAGTTTGGCATTGAACCAGAAGCAGGGCACTTCTCATGCATGATAGATCTTTTG
GGTCGAGCAGGCAAGTTAAGTGAGGCAGAGAGGCTGATTGAGACGATCCCTTTTGATCCTGGCTCCATTTTGTGGTCTGCATTACTTGGGGCCTGTCGAACACAT
GGGAACGTGGAGCTAGCTGTCAAAGCAGCAAACCATTTGCTTCAGATGGATCCTTCAAATGCTGCACCTTATGTCATGCTGGCAAATATCTATGCTGACAATGGG
AGATTGGAGGATGCTGCGACTGTAAGAAAACTTATGCGAGACCGAGGCGTGAAGAAGAAACCTGGTTGTAGTTGGATTGAAGTAAACAGGAGAATACACATTTTT
GTGGCCGAAGATACTTCCCACCCAATGATAAAAAAGATTCAGGAATACCTGGAGGAGATGATGAGAAAGATAAAGAAAGCAGGATATACTCAGGACGTGAGGTCA
GCAACGATAGGGGACTACGATAGAGAAAGGCAAATAGAGGAAGAGTTAAGGTTAGGACATCATAGTGAGAAGTTAGCTGTTTCATTTGGTCTCATGTCTACTAGA
GAAGAGATTCCCACAGATTTCACTGCTTCAAGGGCGGGAAATGCTCTTGTGGTGATTATTGGTGAACTTTTAACTGGGGCTGTTGACGCGCCATCCCTTGTTTTC
TTTCATGGCTTTTGTACGAGATTCATAATGGGCTATATGCATCGTAACTTATGTTCATCGTACTCCTGGATTGGATACTCCAAGTCCAAGTTTCTAACAAGTCCT
AAGTTTTCACATAACGAGAATTCATCAAAATGGGAACATATTGAGAAATGCCCAATGAGGAATGAAATGTGGTGCCGAGCTATGGAAATTGAGAATTTACTTTCT
CCTTTTGTTAGCAAAAAGTTTGAGCTTGATGATGTGATTGAAGCACAACAATTTGATAGAGAGATTCTCAATTTTGTTTTTGAAGTTGCACATGATATGGAAAAG
ATTGAAAAAAGTTCATCTAAAAGGCAAATGCTCAAGGGATATTTGATGGCTACCCTGTTCTACGATCCCTCAACTAGAGCTAGGCTTTCATTTGAATCTGCCATG
AAAAGTTTGGGTGAGGAGGTATTGGCCACTGAAAATGCACTTGATTCAGATATCATCAGAACTGTTGAGGGCTCTACTCGGAAAATTGTGATGCGGAATTTTGAA
AGTGGTACTGCTAGGAAAGCTGCTGGAACAGTTAGCATTCCTATTATTAATGCTGGTGATCCTGGAACAACATAA
Protein sequenceShow/hide protein sequence
MHQFSSLLQSFRQILKTCIAQRDLRTGKSLHALYIKSFLPTSTYLSNHFILLYSKCRRLSAARWVFDQTHDCNIFSFNALISAYAKESFVEVARQLFDKMPQPDP
VSYNTLIAAYAHRGDTEPAFQLFLEMREAFLDMDGFTLSGIITACGDNVALIRQLHALSVVAGFDSYASVGNALITYYSKNGFLNEAQRIFRWMGEDRDEVSWNS
MVVAYMQHREGSKALELYLEMTVRGLIVDMFTLASVLTAFTNVQDLSGGLQFHAKLIKSGYHQNCHVGSGLIDLYSKCGGCMLDCRKVFDEICKPDLVLWNTMIS
GYSLYEELSNEALECFRQLQGVGHRPDDCSLVCVISACSNMSSPSQGRQVHGLAFKLDIPSNRISVNNALVAMYSKCGNLRDARRLFDTMPEHNTVSYNSMIAGY
AQHGMGSQSLHLFQRMLELGFTPTNITFISVLAACAHTGRVEDGKIYFNMMKKFGIEPEAGHFSCMIDLLGRAGKLSEAERLIETIPFDPGSILWSALLGACRTH
GNVELAVKAANHLLQMDPSNAAPYVMLANIYADNGRLEDAATVRKLMRDRGVKKKPGCSWIEVNRRIHIFVAEDTSHPMIKKIQEYLEEMMRKIKKAGYTQDVRS
ATIGDYDRERQIEEELRLGHHSEKLAVSFGLMSTREEIPTDFTASRAGNALVVIIGELLTGAVDAPSLVFFHGFCTRFIMGYMHRNLCSSYSWIGYSKSKFLTSP
KFSHNENSSKWEHIEKCPMRNEMWCRAMEIENLLSPFVSKKFELDDVIEAQQFDREILNFVFEVAHDMEKIEKSSSKRQMLKGYLMATLFYDPSTRARLSFESAM
KSLGEEVLATENALDSDIIRTVEGSTRKIVMRNFESGTARKAAGTVSIPIINAGDPGTT