; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr028999 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr028999
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionTetratricopeptide repeat (TPR)-like superfamily protein
Genome locationtig00153210:2434709..2441009
RNA-Seq ExpressionSgr028999
SyntenySgr028999
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7032140.1 putative pentatricopeptide repeat-containing protein [Cucurbita argyrosperma subsp. argyrosperma]3.4e-25683.05Show/hide
Query:  RPSVSAAQLEVESVSTSFVLDQNDPVRAILTGLNSFGFRAYIGGCYFRTIVPTLSETVVDDVLDSLRIQNPDFAVDFFFLLRNEYGFRHSEFSQLAVSHI
        RPSVSAA+LE ESV+ SFVL QNDPV  ILTGLNSFGFRAY+GGC FRT+V TLSETVVD VL+SL IQNPD AV FF+LLRN+YGFRHS FSQLAVSHI
Subjt:  RPSVSAAQLEVESVSTSFVLDQNDPVRAILTGLNSFGFRAYIGGCYFRTIVPTLSETVVDDVLDSLRIQNPDFAVDFFFLLRNEYGFRHSEFSQLAVSHI

Query:  LAGKKRFKELHSVMKQLLEDQGSGSAPSCCDLLLNQFRNWDSNGVVWDILAFAYSRHEMIHDALFVIAKMKDLNLQASVPTYNSLLHNLRHTDIMWDIYD
        LAGK RFKELH V+KQL+E+QGSGSA S CDLLLN+FRNWDSNGVVWD+LAFAYSRHEMIHDALFV+AKMKDLNLQASVPTYNSLLHNLRHTD+MWDI +
Subjt:  LAGKKRFKELHSVMKQLLEDQGSGSAPSCCDLLLNQFRNWDSNGVVWDILAFAYSRHEMIHDALFVIAKMKDLNLQASVPTYNSLLHNLRHTDIMWDIYD

Query:  EIRVSGP--------------LGVNILPPYLYMAFDSNEVVGPSIVSFNTIMSKFCNIGLIDVARSFFCLMVKNGLLPDSYSYNILIHGLCVAGSMDEAL
        EI+ SG                  + L   +    DSNEVVGPSIVS NT+MSKFC +GL+DVARSFFCLMVKNGLLPDSYSYNILIHGLCVAGSMDEAL
Subjt:  EIRVSGP--------------LGVNILPPYLYMAFDSNEVVGPSIVSFNTIMSKFCNIGLIDVARSFFCLMVKNGLLPDSYSYNILIHGLCVAGSMDEAL

Query:  EFTDDMEKHGVEPDIVTYNTLAKGFLLLGLMSGAWKVIQKMLLKGLNPDLVTYTILICGHCQMGNIEEALKLRQETLSRGFKLNIISYSVLLSCLCKVGR
        EFTDDMEKHGVEPD+VTYNTLAKGFLLLG MSGAWKV+QKMLLKGLNPD+VTYTILICGHCQMGNIEEALKLRQETLSRGF+LNIISYSVLLSCLCKVGR
Subjt:  EFTDDMEKHGVEPDIVTYNTLAKGFLLLGLMSGAWKVIQKMLLKGLNPDLVTYTILICGHCQMGNIEEALKLRQETLSRGFKLNIISYSVLLSCLCKVGR

Query:  IEEALTLLDEMEILRLKPDFIVYSILIHGLCKEGFVQRAYQLYEQMCLKRLFPDYFAQRAVLLGLFENGNISEARKYFDALTQMNLIEDVILYNIMIDGY
        IEEAL LL+EME LRLKPD IVYSILIHGLCKEGFVQRAYQLYEQM LKR FP+YFAQRAVLLG FENGNISEAR+YFDALT M+LIED+ILYNIMIDGY
Subjt:  IEEALTLLDEMEILRLKPDFIVYSILIHGLCKEGFVQRAYQLYEQMCLKRLFPDYFAQRAVLLGLFENGNISEARKYFDALTQMNLIEDVILYNIMIDGY

Query:  VRLGDIAEAMQLYNRMIERGITPSVITFNTLVNGFCK
        VRLGDI+EAMQLY RM ERGITP+V+TFNTLV+GFC+
Subjt:  VRLGDIAEAMQLYNRMIERGITPSVITFNTLVNGFCK

XP_022149078.1 putative pentatricopeptide repeat-containing protein At1g13630 isoform X1 [Momordica charantia]5.2e-25784.36Show/hide
Query:  RPSVSAAQLEVESVSTSFVLDQNDPVRAILTGLNSFGFRAYIGGCYFRTIVPTLSETVVDDVLDSLRIQNPDFAVDFFFLLRNEYGFRHSEFSQLAVSHI
        RPS SAA LE ES+  S VL QNDPVR ILTGLN FGF AY GGC FRTIVPTLSETVVDDV+ SL+ +N DFAV FF+LLRNEY FRHSEFSQ AVSHI
Subjt:  RPSVSAAQLEVESVSTSFVLDQNDPVRAILTGLNSFGFRAYIGGCYFRTIVPTLSETVVDDVLDSLRIQNPDFAVDFFFLLRNEYGFRHSEFSQLAVSHI

Query:  LAGKKRFKELHSVMKQLLEDQGSGSAPSCCDLLLNQFRNWDSNGVVWDILAFAYSRHEMIHDALFVIAKMKDLNLQASVPTYNSLLHNLRHTDIMWDIYD
        LAGK+RFKELHSVMKQLLEDQGSGSAPS CDLLLN+FRNWDSNGVVWD+LAFAYSRH+MIHDALFVIAKMK LNLQAS+PTYNSLLHNLRHTDIMWDIY+
Subjt:  LAGKKRFKELHSVMKQLLEDQGSGSAPSCCDLLLNQFRNWDSNGVVWDILAFAYSRHEMIHDALFVIAKMKDLNLQASVPTYNSLLHNLRHTDIMWDIYD

Query:  EIRVSGPLGVNILPPYLYMAF--------------DSNEVVGPSIVSFNTIMSKFCNIGLIDVARSFFCLMVKNGLLPDSYSYNILIHGLCVAGSMDEAL
        EI+VSG          L                  DS+E VGPSIVSFNTIMSKFC +GLIDVA+SFFCLMVKNGLLPDSYSYNILIHGLCVAGSMDEAL
Subjt:  EIRVSGPLGVNILPPYLYMAF--------------DSNEVVGPSIVSFNTIMSKFCNIGLIDVARSFFCLMVKNGLLPDSYSYNILIHGLCVAGSMDEAL

Query:  EFTDDMEKHGVEPDIVTYNTLAKGFLLLGLMSGAWKVIQKMLLKGLNPDLVTYTILICGHCQMGNIEEALKLRQETLSRGFKLNIISYSVLLSCLCKVGR
        EFTDDMEKHGVEPD+VTYNTLAKGFLLLGLMSGA KVIQKMLLKGLNPDLVTYTILICGHCQ GN+EEALKLRQETLSRGFKLNIISYSVLLSCLCK GR
Subjt:  EFTDDMEKHGVEPDIVTYNTLAKGFLLLGLMSGAWKVIQKMLLKGLNPDLVTYTILICGHCQMGNIEEALKLRQETLSRGFKLNIISYSVLLSCLCKVGR

Query:  IEEALTLLDEMEILRLKPDFIVYSILIHGLCKEGFVQRAYQLYEQMCLKRLFPDYFAQRAVLLGLFENGNISEARKYFDALTQMNLIEDVILYNIMIDGY
        IEEALTLLDEME L LKPDFIVYSILIHGLCK+GF QRAYQLYEQMCLKR+FP+YFAQRAVLLGLFENGNI EARKYFDAL+ M+LIEDVILYNI+IDGY
Subjt:  IEEALTLLDEMEILRLKPDFIVYSILIHGLCKEGFVQRAYQLYEQMCLKRLFPDYFAQRAVLLGLFENGNISEARKYFDALTQMNLIEDVILYNIMIDGY

Query:  VRLGDIAEAMQLYNRMIERGITPSVITFNTLVNGFCK
        VRLGDIAEAM+LYNRMIERGITPSV+TFNTLVNGFC+
Subjt:  VRLGDIAEAMQLYNRMIERGITPSVITFNTLVNGFCK

XP_022956868.1 putative pentatricopeptide repeat-containing protein At1g13630 isoform X1 [Cucurbita moschata]3.4e-25683.05Show/hide
Query:  RPSVSAAQLEVESVSTSFVLDQNDPVRAILTGLNSFGFRAYIGGCYFRTIVPTLSETVVDDVLDSLRIQNPDFAVDFFFLLRNEYGFRHSEFSQLAVSHI
        RPSVSAA+LE ESV+ SFVL QNDPV  ILTGLNSFGFRAY+GGC FRT+V TLSETVVD VL+SL IQNPD AV FF+LLRN+YGFRHS FSQLAVSHI
Subjt:  RPSVSAAQLEVESVSTSFVLDQNDPVRAILTGLNSFGFRAYIGGCYFRTIVPTLSETVVDDVLDSLRIQNPDFAVDFFFLLRNEYGFRHSEFSQLAVSHI

Query:  LAGKKRFKELHSVMKQLLEDQGSGSAPSCCDLLLNQFRNWDSNGVVWDILAFAYSRHEMIHDALFVIAKMKDLNLQASVPTYNSLLHNLRHTDIMWDIYD
        LAGK RFKELH V+KQL+E+QGSGSA S CDLLLN+FRNWDSNGVVWD+LAFAYSRHEMIHDALFV+AKMKDLNLQASVPTYNSLLHNLRHTD+MWDI +
Subjt:  LAGKKRFKELHSVMKQLLEDQGSGSAPSCCDLLLNQFRNWDSNGVVWDILAFAYSRHEMIHDALFVIAKMKDLNLQASVPTYNSLLHNLRHTDIMWDIYD

Query:  EIRVSGP--------------LGVNILPPYLYMAFDSNEVVGPSIVSFNTIMSKFCNIGLIDVARSFFCLMVKNGLLPDSYSYNILIHGLCVAGSMDEAL
        EI+ SG                  + L   +    DSNEVVGPSIVS NT+MSKFC +GL+DVARSFFCLMVKNGLLPDSYSYNILIHGLCVAGSMDEAL
Subjt:  EIRVSGP--------------LGVNILPPYLYMAFDSNEVVGPSIVSFNTIMSKFCNIGLIDVARSFFCLMVKNGLLPDSYSYNILIHGLCVAGSMDEAL

Query:  EFTDDMEKHGVEPDIVTYNTLAKGFLLLGLMSGAWKVIQKMLLKGLNPDLVTYTILICGHCQMGNIEEALKLRQETLSRGFKLNIISYSVLLSCLCKVGR
        EFTDDMEKHGVEPD+VTYNTLAKGFLLLG MSGAWKV+QKMLLKGLNPD+VTYTILICGHCQMGNIEEALKLRQETLSRGF+LNIISYSVLLSCLCKVGR
Subjt:  EFTDDMEKHGVEPDIVTYNTLAKGFLLLGLMSGAWKVIQKMLLKGLNPDLVTYTILICGHCQMGNIEEALKLRQETLSRGFKLNIISYSVLLSCLCKVGR

Query:  IEEALTLLDEMEILRLKPDFIVYSILIHGLCKEGFVQRAYQLYEQMCLKRLFPDYFAQRAVLLGLFENGNISEARKYFDALTQMNLIEDVILYNIMIDGY
        IEEAL LL+EME LRLKPD IVYSILIHGLCKEGFVQRAYQLYEQM LKR FP+YFAQRAVLLG FENGNISEAR+YFDALT M+LIED+ILYNIMIDGY
Subjt:  IEEALTLLDEMEILRLKPDFIVYSILIHGLCKEGFVQRAYQLYEQMCLKRLFPDYFAQRAVLLGLFENGNISEARKYFDALTQMNLIEDVILYNIMIDGY

Query:  VRLGDIAEAMQLYNRMIERGITPSVITFNTLVNGFCK
        VRLGDI+EAMQLY RM ERGITP+V+TFNTLV+GFC+
Subjt:  VRLGDIAEAMQLYNRMIERGITPSVITFNTLVNGFCK

XP_022977564.1 putative pentatricopeptide repeat-containing protein At1g13630 [Cucurbita maxima]5.4e-25482.87Show/hide
Query:  RPSVSAAQLEVESVSTSFVLDQNDPVRAILTGLNSFGFRAYIGGCYFRTIVPTLSETVVDDVLDSLRIQNPDFAVDFFFLLRNEYGFRHSEFSQLAVSHI
        RPSVS+A+LE ESV+ SFVL QNDPVR ILTGLNSFGFRAY+GG  F+T+V TLSETVVD VL+SL IQNPD AV FF+LLRN+YGFRHS FSQLAVSHI
Subjt:  RPSVSAAQLEVESVSTSFVLDQNDPVRAILTGLNSFGFRAYIGGCYFRTIVPTLSETVVDDVLDSLRIQNPDFAVDFFFLLRNEYGFRHSEFSQLAVSHI

Query:  LAGKKRFKELHSVMKQLLEDQGSGSAPSCCDLLLNQFRNWDSNGVVWDILAFAYSRHEMIHDALFVIAKMKDLNLQASVPTYNSLLHNLRHTDIMWDIYD
        LAGK RFKEL  V+KQL+E+QGSGSA S CDLLLN+FRNWDSNGVVWD+LAFAYSRHEMIHDALFVIAKMKDLNLQASVPTYNSLLHNLRHTD+MWDI +
Subjt:  LAGKKRFKELHSVMKQLLEDQGSGSAPSCCDLLLNQFRNWDSNGVVWDILAFAYSRHEMIHDALFVIAKMKDLNLQASVPTYNSLLHNLRHTDIMWDIYD

Query:  EIRVSGP--------------LGVNILPPYLYMAFDSNEVVGPSIVSFNTIMSKFCNIGLIDVARSFFCLMVKNGLLPDSYSYNILIHGLCVAGSMDEAL
        EI+ SG                  + L   +    DSNEVVGPSIVS NT+MSKFC +GL+DVARSFFCLMVKNGLLPDSYSYNILIHGLCVAGSMDEAL
Subjt:  EIRVSGP--------------LGVNILPPYLYMAFDSNEVVGPSIVSFNTIMSKFCNIGLIDVARSFFCLMVKNGLLPDSYSYNILIHGLCVAGSMDEAL

Query:  EFTDDMEKHGVEPDIVTYNTLAKGFLLLGLMSGAWKVIQKMLLKGLNPDLVTYTILICGHCQMGNIEEALKLRQETLSRGFKLNIISYSVLLSCLCKVGR
        EFTDDMEKHGVEPD+VTYNTLAKGFLLLG MSGAWKV+QKMLLKGLNPD+VTYTILICGHCQMGNIEEALKLRQETLSRGF+LNIISYSVLLSCLCKVGR
Subjt:  EFTDDMEKHGVEPDIVTYNTLAKGFLLLGLMSGAWKVIQKMLLKGLNPDLVTYTILICGHCQMGNIEEALKLRQETLSRGFKLNIISYSVLLSCLCKVGR

Query:  IEEALTLLDEMEILRLKPDFIVYSILIHGLCKEGFVQRAYQLYEQMCLKRLFPDYFAQRAVLLGLFENGNISEARKYFDALTQMNLIEDVILYNIMIDGY
        I+EAL LL+EME LRLKPD IVYSILIHGLCKEGFVQRAYQLYEQM LKR FP+YFAQRAVLLG FENGNISEARKYFDALT M+LIEDVILYNIMIDGY
Subjt:  IEEALTLLDEMEILRLKPDFIVYSILIHGLCKEGFVQRAYQLYEQMCLKRLFPDYFAQRAVLLGLFENGNISEARKYFDALTQMNLIEDVILYNIMIDGY

Query:  VRLGDIAEAMQLYNRMIERGITPSVITFNTLVNGFCK
        VRLGDI+EAMQLY RM ERGITP+V+TFNTLV+GFC+
Subjt:  VRLGDIAEAMQLYNRMIERGITPSVITFNTLVNGFCK

XP_023536817.1 putative pentatricopeptide repeat-containing protein At1g13630 [Cucurbita pepo subsp. pepo]1.2e-25683.24Show/hide
Query:  RPSVSAAQLEVESVSTSFVLDQNDPVRAILTGLNSFGFRAYIGGCYFRTIVPTLSETVVDDVLDSLRIQNPDFAVDFFFLLRNEYGFRHSEFSQLAVSHI
        RPSVSAA+LE ESV+ SFVL QNDPVR ILTGLNSFGFRAY+GGC FRT+V TLSETVVD VL+SL IQNPD AV FF+LLRN+YGFRHS FSQLAVSHI
Subjt:  RPSVSAAQLEVESVSTSFVLDQNDPVRAILTGLNSFGFRAYIGGCYFRTIVPTLSETVVDDVLDSLRIQNPDFAVDFFFLLRNEYGFRHSEFSQLAVSHI

Query:  LAGKKRFKELHSVMKQLLEDQGSGSAPSCCDLLLNQFRNWDSNGVVWDILAFAYSRHEMIHDALFVIAKMKDLNLQASVPTYNSLLHNLRHTDIMWDIYD
        LAGK RFKELH V+KQL+E+QGSGSA S CDLLLN+FRNWDSNGVVWD+LAFAYSRHEMIHDALFVIAKMKDLNLQASVPTYNSLLHNLRHTD+MWDI +
Subjt:  LAGKKRFKELHSVMKQLLEDQGSGSAPSCCDLLLNQFRNWDSNGVVWDILAFAYSRHEMIHDALFVIAKMKDLNLQASVPTYNSLLHNLRHTDIMWDIYD

Query:  EIRVSGP--------------LGVNILPPYLYMAFDSNEVVGPSIVSFNTIMSKFCNIGLIDVARSFFCLMVKNGLLPDSYSYNILIHGLCVAGSMDEAL
        E++ SG                  + L   +    DSNEVVGPSIVS NT+MSKFC +GL+DVARSFFCLMVKNGLLPDSYSYNILIHGLCVAGSMDEAL
Subjt:  EIRVSGP--------------LGVNILPPYLYMAFDSNEVVGPSIVSFNTIMSKFCNIGLIDVARSFFCLMVKNGLLPDSYSYNILIHGLCVAGSMDEAL

Query:  EFTDDMEKHGVEPDIVTYNTLAKGFLLLGLMSGAWKVIQKMLLKGLNPDLVTYTILICGHCQMGNIEEALKLRQETLSRGFKLNIISYSVLLSCLCKVGR
        EFTDDMEKHGVEPD+VTYNTLAKGFLLLG MSGAWKV+QKMLLKGLNPD+VTYTILICGHCQMGNIEEALKLRQETLSRGF+LNIISYSVLLSCLCKVGR
Subjt:  EFTDDMEKHGVEPDIVTYNTLAKGFLLLGLMSGAWKVIQKMLLKGLNPDLVTYTILICGHCQMGNIEEALKLRQETLSRGFKLNIISYSVLLSCLCKVGR

Query:  IEEALTLLDEMEILRLKPDFIVYSILIHGLCKEGFVQRAYQLYEQMCLKRLFPDYFAQRAVLLGLFENGNISEARKYFDALTQMNLIEDVILYNIMIDGY
        IEEAL LL+EME LRLKPD IVYSILIHGLCKEGFVQRAYQLYEQM LKR FP+YF+QRAVLLG FENGNISEARKYFDALT M+LIEDVIL+NIMIDGY
Subjt:  IEEALTLLDEMEILRLKPDFIVYSILIHGLCKEGFVQRAYQLYEQMCLKRLFPDYFAQRAVLLGLFENGNISEARKYFDALTQMNLIEDVILYNIMIDGY

Query:  VRLGDIAEAMQLYNRMIERGITPSVITFNTLVNGFCK
        VRLGDI+EAMQLY RM ERGITP+V+TFNTLV+GFC+
Subjt:  VRLGDIAEAMQLYNRMIERGITPSVITFNTLVNGFCK

TrEMBL top hitse value%identityAlignment
A0A1S3CNC6 putative pentatricopeptide repeat-containing protein At1g136303.3e-24981.31Show/hide
Query:  SVSAAQLEVESVSTSFVLDQNDPVRAILTGLNSFGFRAYIGGCYFRTIVPTLSETVVDDVLDSLRIQNPDFAVDFFFLLRNEYGFRHSEFSQLAVSHILA
        SVSAA+LE  +V+TSF  DQND VR ILTGLNS GFRAY+GGC FRT+V TLSETVVD VLDSLR   PD AV FF+LL NEYGFRHS FSQ  VSHILA
Subjt:  SVSAAQLEVESVSTSFVLDQNDPVRAILTGLNSFGFRAYIGGCYFRTIVPTLSETVVDDVLDSLRIQNPDFAVDFFFLLRNEYGFRHSEFSQLAVSHILA

Query:  GKKRFKELHSVMKQLLEDQGSGSAPSCCDLLLNQFRNWDSNGVVWDILAFAYSRHEMIHDALFVIAKMKDLNLQASVPTYNSLLHNLRHTDIMWDIYDEI
        G+ RFKELHSV+K L+E+QG GSA + CDLLLN+FRNWDSNGVVWD+LAFAYSRHEMIHDALFV AKMKDLNLQASVPTYNSLLHNLRHTDI+WD+Y+EI
Subjt:  GKKRFKELHSVMKQLLEDQGSGSAPSCCDLLLNQFRNWDSNGVVWDILAFAYSRHEMIHDALFVIAKMKDLNLQASVPTYNSLLHNLRHTDIMWDIYDEI

Query:  RVSGPLGVNILPPYLYMAF--------------DSNEVVGPSIVSFNTIMSKFCNIGLIDVARSFFCLMVKNGLLPDSYSYNILIHGLCVAGSMDEALEF
        +VSG          L                  DSNEVVGPS VS NTIMSKFC +GLIDVARSFFCL+VK+GLL DS+SYNIL+HGLCVAGSMDEALEF
Subjt:  RVSGPLGVNILPPYLYMAF--------------DSNEVVGPSIVSFNTIMSKFCNIGLIDVARSFFCLMVKNGLLPDSYSYNILIHGLCVAGSMDEALEF

Query:  TDDMEKHGVEPDIVTYNTLAKGFLLLGLMSGAWKVIQKMLLKGLNPDLVTYTILICGHCQMGNIEEALKLRQETLSRGFKLNIISYSVLLSCLCKVGRIE
        TDDMEKHGVEPD+VTYNTLAKGFLLLGLMSGA KV+QKMLL+GLNPD+VTYTILICGHCQMGNIEEALKLRQETLSRGFKLNIISYSVLLSCLCKVGRIE
Subjt:  TDDMEKHGVEPDIVTYNTLAKGFLLLGLMSGAWKVIQKMLLKGLNPDLVTYTILICGHCQMGNIEEALKLRQETLSRGFKLNIISYSVLLSCLCKVGRIE

Query:  EALTLLDEMEILRLKPDFIVYSILIHGLCKEGFVQRAYQLYEQMCLKRLFPDYFAQRAVLLGLFENGNISEARKYFDALTQMNLIEDVILYNIMIDGYVR
        EALTL DEME L LKPDFIVYSILIHGLCKEGFVQRAYQLYEQMCLKR+FP YFAQRAVLLGLF+NGNISEARKYFD L +M+LIEDV+LYNIMIDGYVR
Subjt:  EALTLLDEMEILRLKPDFIVYSILIHGLCKEGFVQRAYQLYEQMCLKRLFPDYFAQRAVLLGLFENGNISEARKYFDALTQMNLIEDVILYNIMIDGYVR

Query:  LGDIAEAMQLYNRMIERGITPSVITFNTLVNGFCK
        LGDIAEAMQLY  MIERGITPSV+TFNTL+NGFC+
Subjt:  LGDIAEAMQLYNRMIERGITPSVITFNTLVNGFCK

A0A5D3DRM7 Putative pentatricopeptide repeat-containing protein3.3e-24981.31Show/hide
Query:  SVSAAQLEVESVSTSFVLDQNDPVRAILTGLNSFGFRAYIGGCYFRTIVPTLSETVVDDVLDSLRIQNPDFAVDFFFLLRNEYGFRHSEFSQLAVSHILA
        SVSAA+LE  +V+TSF  DQND VR ILTGLNS GFRAY+GGC FRT+V TLSETVVD VLDSLR   PD AV FF+LL NEYGFRHS FSQ  VSHILA
Subjt:  SVSAAQLEVESVSTSFVLDQNDPVRAILTGLNSFGFRAYIGGCYFRTIVPTLSETVVDDVLDSLRIQNPDFAVDFFFLLRNEYGFRHSEFSQLAVSHILA

Query:  GKKRFKELHSVMKQLLEDQGSGSAPSCCDLLLNQFRNWDSNGVVWDILAFAYSRHEMIHDALFVIAKMKDLNLQASVPTYNSLLHNLRHTDIMWDIYDEI
        G+ RFKELHSV+K L+E+QG GSA + CDLLLN+FRNWDSNGVVWD+LAFAYSRHEMIHDALFV AKMKDLNLQASVPTYNSLLHNLRHTDI+WD+Y+EI
Subjt:  GKKRFKELHSVMKQLLEDQGSGSAPSCCDLLLNQFRNWDSNGVVWDILAFAYSRHEMIHDALFVIAKMKDLNLQASVPTYNSLLHNLRHTDIMWDIYDEI

Query:  RVSGPLGVNILPPYLYMAF--------------DSNEVVGPSIVSFNTIMSKFCNIGLIDVARSFFCLMVKNGLLPDSYSYNILIHGLCVAGSMDEALEF
        +VSG          L                  DSNEVVGPS VS NTIMSKFC +GLIDVARSFFCL+VK+GLL DS+SYNIL+HGLCVAGSMDEALEF
Subjt:  RVSGPLGVNILPPYLYMAF--------------DSNEVVGPSIVSFNTIMSKFCNIGLIDVARSFFCLMVKNGLLPDSYSYNILIHGLCVAGSMDEALEF

Query:  TDDMEKHGVEPDIVTYNTLAKGFLLLGLMSGAWKVIQKMLLKGLNPDLVTYTILICGHCQMGNIEEALKLRQETLSRGFKLNIISYSVLLSCLCKVGRIE
        TDDMEKHGVEPD+VTYNTLAKGFLLLGLMSGA KV+QKMLL+GLNPD+VTYTILICGHCQMGNIEEALKLRQETLSRGFKLNIISYSVLLSCLCKVGRIE
Subjt:  TDDMEKHGVEPDIVTYNTLAKGFLLLGLMSGAWKVIQKMLLKGLNPDLVTYTILICGHCQMGNIEEALKLRQETLSRGFKLNIISYSVLLSCLCKVGRIE

Query:  EALTLLDEMEILRLKPDFIVYSILIHGLCKEGFVQRAYQLYEQMCLKRLFPDYFAQRAVLLGLFENGNISEARKYFDALTQMNLIEDVILYNIMIDGYVR
        EALTL DEME L LKPDFIVYSILIHGLCKEGFVQRAYQLYEQMCLKR+FP YFAQRAVLLGLF+NGNISEARKYFD L +M+LIEDV+LYNIMIDGYVR
Subjt:  EALTLLDEMEILRLKPDFIVYSILIHGLCKEGFVQRAYQLYEQMCLKRLFPDYFAQRAVLLGLFENGNISEARKYFDALTQMNLIEDVILYNIMIDGYVR

Query:  LGDIAEAMQLYNRMIERGITPSVITFNTLVNGFCK
        LGDIAEAMQLY  MIERGITPSV+TFNTL+NGFC+
Subjt:  LGDIAEAMQLYNRMIERGITPSVITFNTLVNGFCK

A0A6J1D5Y2 putative pentatricopeptide repeat-containing protein At1g13630 isoform X12.5e-25784.36Show/hide
Query:  RPSVSAAQLEVESVSTSFVLDQNDPVRAILTGLNSFGFRAYIGGCYFRTIVPTLSETVVDDVLDSLRIQNPDFAVDFFFLLRNEYGFRHSEFSQLAVSHI
        RPS SAA LE ES+  S VL QNDPVR ILTGLN FGF AY GGC FRTIVPTLSETVVDDV+ SL+ +N DFAV FF+LLRNEY FRHSEFSQ AVSHI
Subjt:  RPSVSAAQLEVESVSTSFVLDQNDPVRAILTGLNSFGFRAYIGGCYFRTIVPTLSETVVDDVLDSLRIQNPDFAVDFFFLLRNEYGFRHSEFSQLAVSHI

Query:  LAGKKRFKELHSVMKQLLEDQGSGSAPSCCDLLLNQFRNWDSNGVVWDILAFAYSRHEMIHDALFVIAKMKDLNLQASVPTYNSLLHNLRHTDIMWDIYD
        LAGK+RFKELHSVMKQLLEDQGSGSAPS CDLLLN+FRNWDSNGVVWD+LAFAYSRH+MIHDALFVIAKMK LNLQAS+PTYNSLLHNLRHTDIMWDIY+
Subjt:  LAGKKRFKELHSVMKQLLEDQGSGSAPSCCDLLLNQFRNWDSNGVVWDILAFAYSRHEMIHDALFVIAKMKDLNLQASVPTYNSLLHNLRHTDIMWDIYD

Query:  EIRVSGPLGVNILPPYLYMAF--------------DSNEVVGPSIVSFNTIMSKFCNIGLIDVARSFFCLMVKNGLLPDSYSYNILIHGLCVAGSMDEAL
        EI+VSG          L                  DS+E VGPSIVSFNTIMSKFC +GLIDVA+SFFCLMVKNGLLPDSYSYNILIHGLCVAGSMDEAL
Subjt:  EIRVSGPLGVNILPPYLYMAF--------------DSNEVVGPSIVSFNTIMSKFCNIGLIDVARSFFCLMVKNGLLPDSYSYNILIHGLCVAGSMDEAL

Query:  EFTDDMEKHGVEPDIVTYNTLAKGFLLLGLMSGAWKVIQKMLLKGLNPDLVTYTILICGHCQMGNIEEALKLRQETLSRGFKLNIISYSVLLSCLCKVGR
        EFTDDMEKHGVEPD+VTYNTLAKGFLLLGLMSGA KVIQKMLLKGLNPDLVTYTILICGHCQ GN+EEALKLRQETLSRGFKLNIISYSVLLSCLCK GR
Subjt:  EFTDDMEKHGVEPDIVTYNTLAKGFLLLGLMSGAWKVIQKMLLKGLNPDLVTYTILICGHCQMGNIEEALKLRQETLSRGFKLNIISYSVLLSCLCKVGR

Query:  IEEALTLLDEMEILRLKPDFIVYSILIHGLCKEGFVQRAYQLYEQMCLKRLFPDYFAQRAVLLGLFENGNISEARKYFDALTQMNLIEDVILYNIMIDGY
        IEEALTLLDEME L LKPDFIVYSILIHGLCK+GF QRAYQLYEQMCLKR+FP+YFAQRAVLLGLFENGNI EARKYFDAL+ M+LIEDVILYNI+IDGY
Subjt:  IEEALTLLDEMEILRLKPDFIVYSILIHGLCKEGFVQRAYQLYEQMCLKRLFPDYFAQRAVLLGLFENGNISEARKYFDALTQMNLIEDVILYNIMIDGY

Query:  VRLGDIAEAMQLYNRMIERGITPSVITFNTLVNGFCK
        VRLGDIAEAM+LYNRMIERGITPSV+TFNTLVNGFC+
Subjt:  VRLGDIAEAMQLYNRMIERGITPSVITFNTLVNGFCK

A0A6J1H0A7 putative pentatricopeptide repeat-containing protein At1g13630 isoform X11.6e-25683.05Show/hide
Query:  RPSVSAAQLEVESVSTSFVLDQNDPVRAILTGLNSFGFRAYIGGCYFRTIVPTLSETVVDDVLDSLRIQNPDFAVDFFFLLRNEYGFRHSEFSQLAVSHI
        RPSVSAA+LE ESV+ SFVL QNDPV  ILTGLNSFGFRAY+GGC FRT+V TLSETVVD VL+SL IQNPD AV FF+LLRN+YGFRHS FSQLAVSHI
Subjt:  RPSVSAAQLEVESVSTSFVLDQNDPVRAILTGLNSFGFRAYIGGCYFRTIVPTLSETVVDDVLDSLRIQNPDFAVDFFFLLRNEYGFRHSEFSQLAVSHI

Query:  LAGKKRFKELHSVMKQLLEDQGSGSAPSCCDLLLNQFRNWDSNGVVWDILAFAYSRHEMIHDALFVIAKMKDLNLQASVPTYNSLLHNLRHTDIMWDIYD
        LAGK RFKELH V+KQL+E+QGSGSA S CDLLLN+FRNWDSNGVVWD+LAFAYSRHEMIHDALFV+AKMKDLNLQASVPTYNSLLHNLRHTD+MWDI +
Subjt:  LAGKKRFKELHSVMKQLLEDQGSGSAPSCCDLLLNQFRNWDSNGVVWDILAFAYSRHEMIHDALFVIAKMKDLNLQASVPTYNSLLHNLRHTDIMWDIYD

Query:  EIRVSGP--------------LGVNILPPYLYMAFDSNEVVGPSIVSFNTIMSKFCNIGLIDVARSFFCLMVKNGLLPDSYSYNILIHGLCVAGSMDEAL
        EI+ SG                  + L   +    DSNEVVGPSIVS NT+MSKFC +GL+DVARSFFCLMVKNGLLPDSYSYNILIHGLCVAGSMDEAL
Subjt:  EIRVSGP--------------LGVNILPPYLYMAFDSNEVVGPSIVSFNTIMSKFCNIGLIDVARSFFCLMVKNGLLPDSYSYNILIHGLCVAGSMDEAL

Query:  EFTDDMEKHGVEPDIVTYNTLAKGFLLLGLMSGAWKVIQKMLLKGLNPDLVTYTILICGHCQMGNIEEALKLRQETLSRGFKLNIISYSVLLSCLCKVGR
        EFTDDMEKHGVEPD+VTYNTLAKGFLLLG MSGAWKV+QKMLLKGLNPD+VTYTILICGHCQMGNIEEALKLRQETLSRGF+LNIISYSVLLSCLCKVGR
Subjt:  EFTDDMEKHGVEPDIVTYNTLAKGFLLLGLMSGAWKVIQKMLLKGLNPDLVTYTILICGHCQMGNIEEALKLRQETLSRGFKLNIISYSVLLSCLCKVGR

Query:  IEEALTLLDEMEILRLKPDFIVYSILIHGLCKEGFVQRAYQLYEQMCLKRLFPDYFAQRAVLLGLFENGNISEARKYFDALTQMNLIEDVILYNIMIDGY
        IEEAL LL+EME LRLKPD IVYSILIHGLCKEGFVQRAYQLYEQM LKR FP+YFAQRAVLLG FENGNISEAR+YFDALT M+LIED+ILYNIMIDGY
Subjt:  IEEALTLLDEMEILRLKPDFIVYSILIHGLCKEGFVQRAYQLYEQMCLKRLFPDYFAQRAVLLGLFENGNISEARKYFDALTQMNLIEDVILYNIMIDGY

Query:  VRLGDIAEAMQLYNRMIERGITPSVITFNTLVNGFCK
        VRLGDI+EAMQLY RM ERGITP+V+TFNTLV+GFC+
Subjt:  VRLGDIAEAMQLYNRMIERGITPSVITFNTLVNGFCK

A0A6J1IIW4 putative pentatricopeptide repeat-containing protein At1g136302.6e-25482.87Show/hide
Query:  RPSVSAAQLEVESVSTSFVLDQNDPVRAILTGLNSFGFRAYIGGCYFRTIVPTLSETVVDDVLDSLRIQNPDFAVDFFFLLRNEYGFRHSEFSQLAVSHI
        RPSVS+A+LE ESV+ SFVL QNDPVR ILTGLNSFGFRAY+GG  F+T+V TLSETVVD VL+SL IQNPD AV FF+LLRN+YGFRHS FSQLAVSHI
Subjt:  RPSVSAAQLEVESVSTSFVLDQNDPVRAILTGLNSFGFRAYIGGCYFRTIVPTLSETVVDDVLDSLRIQNPDFAVDFFFLLRNEYGFRHSEFSQLAVSHI

Query:  LAGKKRFKELHSVMKQLLEDQGSGSAPSCCDLLLNQFRNWDSNGVVWDILAFAYSRHEMIHDALFVIAKMKDLNLQASVPTYNSLLHNLRHTDIMWDIYD
        LAGK RFKEL  V+KQL+E+QGSGSA S CDLLLN+FRNWDSNGVVWD+LAFAYSRHEMIHDALFVIAKMKDLNLQASVPTYNSLLHNLRHTD+MWDI +
Subjt:  LAGKKRFKELHSVMKQLLEDQGSGSAPSCCDLLLNQFRNWDSNGVVWDILAFAYSRHEMIHDALFVIAKMKDLNLQASVPTYNSLLHNLRHTDIMWDIYD

Query:  EIRVSGP--------------LGVNILPPYLYMAFDSNEVVGPSIVSFNTIMSKFCNIGLIDVARSFFCLMVKNGLLPDSYSYNILIHGLCVAGSMDEAL
        EI+ SG                  + L   +    DSNEVVGPSIVS NT+MSKFC +GL+DVARSFFCLMVKNGLLPDSYSYNILIHGLCVAGSMDEAL
Subjt:  EIRVSGP--------------LGVNILPPYLYMAFDSNEVVGPSIVSFNTIMSKFCNIGLIDVARSFFCLMVKNGLLPDSYSYNILIHGLCVAGSMDEAL

Query:  EFTDDMEKHGVEPDIVTYNTLAKGFLLLGLMSGAWKVIQKMLLKGLNPDLVTYTILICGHCQMGNIEEALKLRQETLSRGFKLNIISYSVLLSCLCKVGR
        EFTDDMEKHGVEPD+VTYNTLAKGFLLLG MSGAWKV+QKMLLKGLNPD+VTYTILICGHCQMGNIEEALKLRQETLSRGF+LNIISYSVLLSCLCKVGR
Subjt:  EFTDDMEKHGVEPDIVTYNTLAKGFLLLGLMSGAWKVIQKMLLKGLNPDLVTYTILICGHCQMGNIEEALKLRQETLSRGFKLNIISYSVLLSCLCKVGR

Query:  IEEALTLLDEMEILRLKPDFIVYSILIHGLCKEGFVQRAYQLYEQMCLKRLFPDYFAQRAVLLGLFENGNISEARKYFDALTQMNLIEDVILYNIMIDGY
        I+EAL LL+EME LRLKPD IVYSILIHGLCKEGFVQRAYQLYEQM LKR FP+YFAQRAVLLG FENGNISEARKYFDALT M+LIEDVILYNIMIDGY
Subjt:  IEEALTLLDEMEILRLKPDFIVYSILIHGLCKEGFVQRAYQLYEQMCLKRLFPDYFAQRAVLLGLFENGNISEARKYFDALTQMNLIEDVILYNIMIDGY

Query:  VRLGDIAEAMQLYNRMIERGITPSVITFNTLVNGFCK
        VRLGDI+EAMQLY RM ERGITP+V+TFNTLV+GFC+
Subjt:  VRLGDIAEAMQLYNRMIERGITPSVITFNTLVNGFCK

SwissProt top hitse value%identityAlignment
O04491 Putative pentatricopeptide repeat-containing protein At1g096808.0e-5128.48Show/hide
Query:  RTIVPTLSETVVDDVLDSLRIQNPDFAV-DFFFLLRNEYGFRHSEFSQLAVSHILAGKKRFKELHSVMKQLLEDQGSGSAPSCCDLLLNQFRNWDSNGVV
        R ++P+LS   V D+++   +  P  ++  FF  + ++ GFR +  +   ++  LA  + F E  S+++ ++  +G  SA S   + L + R     G +
Subjt:  RTIVPTLSETVVDDVLDSLRIQNPDFAV-DFFFLLRNEYGFRHSEFSQLAVSHILAGKKRFKELHSVMKQLLEDQGSGSAPSCCDLLLNQFRNWDSNGVV

Query:  WDILAFAYSRHEMIHDALFVIAKMKDLNLQASVPTYNSLLH---NLRHTDIMWDIYDEIRVSG-PLGVNILPPYLYMAFDSNEVVGPSIVSFNTIMSKFC
         D L   Y+    I DA+      +       +    +LL     L  T  +W  Y EI  +G PL V +                     FN +M+KFC
Subjt:  WDILAFAYSRHEMIHDALFVIAKMKDLNLQASVPTYNSLLH---NLRHTDIMWDIYDEIRVSG-PLGVNILPPYLYMAFDSNEVVGPSIVSFNTIMSKFC

Query:  NIGLIDVARSFFCLMVKNGLLPDSYSYNILIHGLCVAGSMDEALEFTDDMEKHGVEPDIVTYNTLAKGFLLLGLMSGAWKVIQKMLLKGLNPDLVTYTIL
          G I  A+  F  + K  L P   S+N LI+G C  G++DE       MEK    PD+ TY+ L         M GA  +  +M  +GL P+ V +T L
Subjt:  NIGLIDVARSFFCLMVKNGLLPDSYSYNILIHGLCVAGSMDEALEFTDDMEKHGVEPDIVTYNTLAKGFLLLGLMSGAWKVIQKMLLKGLNPDLVTYTIL

Query:  ICGHCQMGNIEEALKLRQETLSRGFKLNIISYSVLLSCLCKVGRIEEALTLLDEMEILRLKPDFIVYSILIHGLCKEGFVQRAYQLYEQMCLKRLFPDYF
        I GH + G I+   +  Q+ LS+G + +I+ Y+ L++  CK G +  A  ++D M    L+PD I Y+ LI G C+ G V+ A ++ ++M    +  D  
Subjt:  ICGHCQMGNIEEALKLRQETLSRGFKLNIISYSVLLSCLCKVGRIEEALTLLDEMEILRLKPDFIVYSILIHGLCKEGFVQRAYQLYEQMCLKRLFPDYF

Query:  AQRAVLLGLFENGNISEARKYFDALTQMNLIEDVILYNIMIDGYVRLGDIAEAMQLYNRMIERGITPSVITFNTLVNGFCK
           A++ G+ + G + +A +    + +  +  D + Y +M+D + + GD     +L   M   G  PSV+T+N L+NG CK
Subjt:  AQRAVLLGLFENGNISEARKYFDALTQMNLIEDVILYNIMIDGYVRLGDIAEAMQLYNRMIERGITPSVITFNTLVNGFCK

Q0WVK7 Pentatricopeptide repeat-containing protein At1g05670, mitochondrial3.6e-5128.45Show/hide
Query:  VDFFFLLRNEYGFRHSEFSQLA-VSHILAGKKRFKELHSVMKQLLED---QGSGSAPSCCDLLLNQFRNWDSNGVVWDILAFAYSRHEMIHDALFVIAKM
        +DFF   R+    R S    L  V H+    K  K   S++    E      + S     DLL+  +++W S+  V+D+         ++ +A  V  KM
Subjt:  VDFFFLLRNEYGFRHSEFSQLA-VSHILAGKKRFKELHSVMKQLLED---QGSGSAPSCCDLLLNQFRNWDSNGVVWDILAFAYSRHEMIHDALFVIAKM

Query:  KDLNLQASVPTYNSLLHNLRH-------TDIMWDIYDEIRVS-GPLGVNILPPYLYMAFDSNEV-----------VGPSIVSFNTIMSKFCNIGLIDVAR
         +  L  SV + N  L  L           I++  + E+ V       NI+  ++       E              P ++S++T+++ +C  G +D   
Subjt:  KDLNLQASVPTYNSLLHNLRH-------TDIMWDIYDEIRVS-GPLGVNILPPYLYMAFDSNEV-----------VGPSIVSFNTIMSKFCNIGLIDVAR

Query:  SFFCLMVKNGLLPDSYSYNILIHGLCVAGSMDEALEFTDDMEKHGVEPDIVTYNTLAKGFLLLGLMSGAWKVIQKMLLKGLNPDLVTYTILICGHCQMGN
            +M + GL P+SY Y  +I  LC    + EA E   +M + G+ PD V Y TL  GF   G +  A K   +M  + + PD++TYT +I G CQ+G+
Subjt:  SFFCLMVKNGLLPDSYSYNILIHGLCVAGSMDEALEFTDDMEKHGVEPDIVTYNTLAKGFLLLGLMSGAWKVIQKMLLKGLNPDLVTYTILICGHCQMGN

Query:  IEEALKLRQETLSRGFKLNIISYSVLLSCLCKVGRIEEALTLLDEMEILRLKPDFIVYSILIHGLCKEGFVQRAYQLYEQMCLKRLFPDYFAQRAVLLGL
        + EA KL  E   +G + + ++++ L++  CK G +++A  + + M      P+ + Y+ LI GLCKEG +  A +L  +M    L P+ F   +++ GL
Subjt:  IEEALKLRQETLSRGFKLNIISYSVLLSCLCKVGRIEEALTLLDEMEILRLKPDFIVYSILIHGLCKEGFVQRAYQLYEQMCLKRLFPDYFAQRAVLLGL

Query:  FENGNISEARKYFDALTQMNLIEDVILYNIMIDGYVRLGDIAEAMQLYNRMIERGITPSVITFNTLVNGFC
         ++GNI EA K         L  D + Y  ++D Y + G++ +A ++   M+ +G+ P+++TFN L+NGFC
Subjt:  FENGNISEARKYFDALTQMNLIEDVILYNIMIDGYVRLGDIAEAMQLYNRMIERGITPSVITFNTLVNGFC

Q76C99 Protein Rf1, mitochondrial7.5e-4927.75Show/hide
Query:  AVDFFFLLRNEYGFRHSEFSQLAVSHILAGKKRFKELHSVMKQLLEDQGSGSAP---SCCDLLLNQFRNWDSNGVVWDILAFAYSR-HEMIHDALFVIAK
        A+D       E G   + FS   +   L  + R +E   ++  + +D+G GS P   S   ++   F+  DS+         AYS  HEM+         
Subjt:  AVDFFFLLRNEYGFRHSEFSQLAVSHILAGKKRFKELHSVMKQLLEDQGSGSAP---SCCDLLLNQFRNWDSNGVVWDILAFAYSR-HEMIHDALFVIAK

Query:  MKDLNLQASVPTYNSLLHNLRHTDIMWDIYDEIRVSGPLGVNILPPYLYMAFDSNEVVGPSIVSFNTIMSKFCNIGLIDVARSFFCLMVKNGLLPDSYSY
          D  +   V TYNS++  L     M    + +      GV                  P  +++N+I+  +C+ G    A  F   M  +G+ PD  +Y
Subjt:  MKDLNLQASVPTYNSLLHNLRHTDIMWDIYDEIRVSGPLGVNILPPYLYMAFDSNEVVGPSIVSFNTIMSKFCNIGLIDVARSFFCLMVKNGLLPDSYSY

Query:  NILIHGLCVAGSMDEALEFTDDMEKHGVEPDIVTYNTLAKGFLLLGLMSGAWKVIQKMLLKGLNPDLVTYTILICGHCQMGNIEEALKLRQETLSRGFKL
        ++L+  LC  G   EA +  D M K G++P+I TY TL +G+   G +     ++  M+  G++PD   ++ILIC + + G +++A+ +  +   +G   
Subjt:  NILIHGLCVAGSMDEALEFTDDMEKHGVEPDIVTYNTLAKGFLLLGLMSGAWKVIQKMLLKGLNPDLVTYTILICGHCQMGNIEEALKLRQETLSRGFKL

Query:  NIISYSVLLSCLCKVGRIEEALTLLDEMEILRLKPDFIVYSILIHGLCKEGFVQRAYQLYEQMCLKRLFPDYFAQRAVLLGLFENGNISEARKYFDALTQ
        N ++Y  ++  LCK GR+E+A+   ++M    L P  IVY+ LIHGLC     +RA +L  +M  + +  +     +++    + G + E+ K F+ + +
Subjt:  NIISYSVLLSCLCKVGRIEEALTLLDEMEILRLKPDFIVYSILIHGLCKEGFVQRAYQLYEQMCLKRLFPDYFAQRAVLLGLFENGNISEARKYFDALTQ

Query:  MNLIEDVILYNIMIDGYVRLGDIAEAMQLYNRMIERGITPSVITFNTLVNGFCK
        + +  +VI YN +I+GY   G + EAM+L + M+  G+ P+ +T++TL+NG+CK
Subjt:  MNLIEDVILYNIMIDGYVRLGDIAEAMQLYNRMIERGITPSVITFNTLVNGFCK

Q9LFC5 Pentatricopeptide repeat-containing protein At5g011104.4e-4926.81Show/hide
Query:  FRHSEFSQLAVSHILAGKKRFKELHSVMKQLLEDQGSGSAPSCCDLLLNQFRNWDSNGVVWDILAFAYSRHEMIHDALFVIAKMKDLNLQASVPTYNSLL
        F+H+  S  A+ HIL    R  +  S + +++   G  S     + L + F N  SN  V+D+L   Y +   + +A      ++      S+   N+L+
Subjt:  FRHSEFSQLAVSHILAGKKRFKELHSVMKQLLEDQGSGSAPSCCDLLLNQFRNWDSNGVVWDILAFAYSRHEMIHDALFVIAKMKDLNLQASVPTYNSLL

Query:  HNLRH---TDIMWDIYDEIRVSGPLGVNILPPYL----------------YMAFDSNEVVGPSIVSFNTIMSKFCNIGLIDVARSFFCLMVKNGLLPDSY
         +L      ++ W +Y EI  SG +G+N+    +                +++    + V P IV++NT++S + + GL++ A      M   G  P  Y
Subjt:  HNLRH---TDIMWDIYDEIRVSGPLGVNILPPYL----------------YMAFDSNEVVGPSIVSFNTIMSKFCNIGLIDVARSFFCLMVKNGLLPDSY

Query:  SYNILIHGLCVAGSMDEALEFTDDMEKHGVEPDIVTYNTLAKGFLLLGLMSGAWKVIQKMLLKGLNPDLVTYTILICGHCQMGNIEEALKLRQETLSRGF
        +YN +I+GLC  G  + A E   +M + G+ PD  TY +L       G +    KV   M  + + PDLV ++ ++    + GN+++AL         G 
Subjt:  SYNILIHGLCVAGSMDEALEFTDDMEKHGVEPDIVTYNTLAKGFLLLGLMSGAWKVIQKMLLKGLNPDLVTYTILICGHCQMGNIEEALKLRQETLSRGF

Query:  KLNIISYSVLLSCLCKVGRIEEALTLLDEMEILRLKPDFIVYSILIHGLCKEGFVQRAYQLYEQMCLKRLFPDYFAQRAVLLGLFENGNISEARKYFDAL
          + + Y++L+   C+ G I  A+ L +EM       D + Y+ ++HGLCK   +  A +L+ +M  + LFPD +    ++ G  + GN+  A + F  +
Subjt:  KLNIISYSVLLSCLCKVGRIEEALTLLDEMEILRLKPDFIVYSILIHGLCKEGFVQRAYQLYEQMCLKRLFPDYFAQRAVLLGLFENGNISEARKYFDAL

Query:  TQMNLIEDVILYNIMIDGYVRLGDIAEAMQLYNRMIERGITPSVITFNTLVNGFC
         +  +  DV+ YN ++DG+ ++GDI  A +++  M+ + I P+ I+++ LVN  C
Subjt:  TQMNLIEDVILYNIMIDGYVRLGDIAEAMQLYNRMIERGITPSVITFNTLVNGFC

Q9LMY5 Putative pentatricopeptide repeat-containing protein At1g136309.5e-14549.07Show/hide
Query:  RPSVSAAQLEVESV-STSFVLDQNDPVRAILTGLNSFGFRAYIGGCYFRTIVPTLSETVVDDVLDSLRIQNPDFAVDFFFLLRNEYGFRHSEFSQLAVSH
        + S S A+++ ES+ +T+   D     + IL G+   GFR ++ G +FR +V  L    V++++D L  ++ D +V FF  LR+ Y FRHS FS L VSH
Subjt:  RPSVSAAQLEVESV-STSFVLDQNDPVRAILTGLNSFGFRAYIGGCYFRTIVPTLSETVVDDVLDSLRIQNPDFAVDFFFLLRNEYGFRHSEFSQLAVSH

Query:  ILAGKKRFKELHSVMKQLLEDQGSGSAPSCCDLLLNQFRNWDSNGVVWDILAFAYSRHEMIHDALFVIAKMKDLNLQASVPTYNSLLHNLRHTDIMWDIY
        +LAG++RFKEL  +++QLL+++G     + C+LL N FR W+S G+VWD+L F  SR  M+ D+L+++ KMKD NL  S  +YNS+L++ R TD MWD+Y
Subjt:  ILAGKKRFKELHSVMKQLLEDQGSGSAPSCCDLLLNQFRNWDSNGVVWDILAFAYSRHEMIHDALFVIAKMKDLNLQASVPTYNSLLHNLRHTDIMWDIY

Query:  DEIR----------VSGPLGVNIL-PPYLYMAFDSNEVVGPSIVSFNTIMSKFCNIGLIDVARSFFCLMVKNGLLPDSYSYNILIHGLCVAGSMDEALEF
         EI+          V G      L    L++     + +GPS+VSFN+IMS +C +G +D+A+SFFC ++K GL+P  YS+NILI+GLC+ GS+ EALE 
Subjt:  DEIR----------VSGPLGVNIL-PPYLYMAFDSNEVVGPSIVSFNTIMSKFCNIGLIDVARSFFCLMVKNGLLPDSYSYNILIHGLCVAGSMDEALEF

Query:  TDDMEKHGVEPDIVTYNTLAKGFLLLGLMSGAWKVIQKMLLKGLNPDLVTYTILICGHCQMGNIEEALKLRQETLSRGFKLN-IISYSVLLSCLCKVGRI
          DM KHGVEPD VTYN LAKGF LLG++SGAW+VI+ ML KGL+PD++TYTIL+CG CQ+GNI+  L L ++ LSRGF+LN II  SV+LS LCK GRI
Subjt:  TDDMEKHGVEPDIVTYNTLAKGFLLLGLMSGAWKVIQKMLLKGLNPDLVTYTILICGHCQMGNIEEALKLRQETLSRGFKLN-IISYSVLLSCLCKVGRI

Query:  EEALTLLDEMEILRLKPDFIVYSILIHGLCKEGFVQRAYQLYEQMCLKRLFPDYFAQRAVLLGLFENGNISEARKYFDALTQMNLIEDVILYNIMIDGYV
        +EAL+L ++M+   L PD + YSI+IHGLCK G    A  LY++MC KR+ P+     A+LLGL + G + EAR   D+L       D++LYNI+IDGY 
Subjt:  EEALTLLDEMEILRLKPDFIVYSILIHGLCKEGFVQRAYQLYEQMCLKRLFPDYFAQRAVLLGLFENGNISEARKYFDALTQMNLIEDVILYNIMIDGYV

Query:  RLGDIAEAMQLYNRMIERGITPSVITFNTLVNGFCKGQ
        + G I EA++L+  +IE GITPSV TFN+L+ G+CK Q
Subjt:  RLGDIAEAMQLYNRMIERGITPSVITFNTLVNGFCKGQ

Arabidopsis top hitse value%identityAlignment
AT1G05670.1 Pentatricopeptide repeat (PPR-like) superfamily protein2.5e-5228.45Show/hide
Query:  VDFFFLLRNEYGFRHSEFSQLA-VSHILAGKKRFKELHSVMKQLLED---QGSGSAPSCCDLLLNQFRNWDSNGVVWDILAFAYSRHEMIHDALFVIAKM
        +DFF   R+    R S    L  V H+    K  K   S++    E      + S     DLL+  +++W S+  V+D+         ++ +A  V  KM
Subjt:  VDFFFLLRNEYGFRHSEFSQLA-VSHILAGKKRFKELHSVMKQLLED---QGSGSAPSCCDLLLNQFRNWDSNGVVWDILAFAYSRHEMIHDALFVIAKM

Query:  KDLNLQASVPTYNSLLHNLRH-------TDIMWDIYDEIRVS-GPLGVNILPPYLYMAFDSNEV-----------VGPSIVSFNTIMSKFCNIGLIDVAR
         +  L  SV + N  L  L           I++  + E+ V       NI+  ++       E              P ++S++T+++ +C  G +D   
Subjt:  KDLNLQASVPTYNSLLHNLRH-------TDIMWDIYDEIRVS-GPLGVNILPPYLYMAFDSNEV-----------VGPSIVSFNTIMSKFCNIGLIDVAR

Query:  SFFCLMVKNGLLPDSYSYNILIHGLCVAGSMDEALEFTDDMEKHGVEPDIVTYNTLAKGFLLLGLMSGAWKVIQKMLLKGLNPDLVTYTILICGHCQMGN
            +M + GL P+SY Y  +I  LC    + EA E   +M + G+ PD V Y TL  GF   G +  A K   +M  + + PD++TYT +I G CQ+G+
Subjt:  SFFCLMVKNGLLPDSYSYNILIHGLCVAGSMDEALEFTDDMEKHGVEPDIVTYNTLAKGFLLLGLMSGAWKVIQKMLLKGLNPDLVTYTILICGHCQMGN

Query:  IEEALKLRQETLSRGFKLNIISYSVLLSCLCKVGRIEEALTLLDEMEILRLKPDFIVYSILIHGLCKEGFVQRAYQLYEQMCLKRLFPDYFAQRAVLLGL
        + EA KL  E   +G + + ++++ L++  CK G +++A  + + M      P+ + Y+ LI GLCKEG +  A +L  +M    L P+ F   +++ GL
Subjt:  IEEALKLRQETLSRGFKLNIISYSVLLSCLCKVGRIEEALTLLDEMEILRLKPDFIVYSILIHGLCKEGFVQRAYQLYEQMCLKRLFPDYFAQRAVLLGL

Query:  FENGNISEARKYFDALTQMNLIEDVILYNIMIDGYVRLGDIAEAMQLYNRMIERGITPSVITFNTLVNGFC
         ++GNI EA K         L  D + Y  ++D Y + G++ +A ++   M+ +G+ P+++TFN L+NGFC
Subjt:  FENGNISEARKYFDALTQMNLIEDVILYNIMIDGYVRLGDIAEAMQLYNRMIERGITPSVITFNTLVNGFC

AT1G05670.2 Pentatricopeptide repeat (PPR-like) superfamily protein2.5e-5228.45Show/hide
Query:  VDFFFLLRNEYGFRHSEFSQLA-VSHILAGKKRFKELHSVMKQLLED---QGSGSAPSCCDLLLNQFRNWDSNGVVWDILAFAYSRHEMIHDALFVIAKM
        +DFF   R+    R S    L  V H+    K  K   S++    E      + S     DLL+  +++W S+  V+D+         ++ +A  V  KM
Subjt:  VDFFFLLRNEYGFRHSEFSQLA-VSHILAGKKRFKELHSVMKQLLED---QGSGSAPSCCDLLLNQFRNWDSNGVVWDILAFAYSRHEMIHDALFVIAKM

Query:  KDLNLQASVPTYNSLLHNLRH-------TDIMWDIYDEIRVS-GPLGVNILPPYLYMAFDSNEV-----------VGPSIVSFNTIMSKFCNIGLIDVAR
         +  L  SV + N  L  L           I++  + E+ V       NI+  ++       E              P ++S++T+++ +C  G +D   
Subjt:  KDLNLQASVPTYNSLLHNLRH-------TDIMWDIYDEIRVS-GPLGVNILPPYLYMAFDSNEV-----------VGPSIVSFNTIMSKFCNIGLIDVAR

Query:  SFFCLMVKNGLLPDSYSYNILIHGLCVAGSMDEALEFTDDMEKHGVEPDIVTYNTLAKGFLLLGLMSGAWKVIQKMLLKGLNPDLVTYTILICGHCQMGN
            +M + GL P+SY Y  +I  LC    + EA E   +M + G+ PD V Y TL  GF   G +  A K   +M  + + PD++TYT +I G CQ+G+
Subjt:  SFFCLMVKNGLLPDSYSYNILIHGLCVAGSMDEALEFTDDMEKHGVEPDIVTYNTLAKGFLLLGLMSGAWKVIQKMLLKGLNPDLVTYTILICGHCQMGN

Query:  IEEALKLRQETLSRGFKLNIISYSVLLSCLCKVGRIEEALTLLDEMEILRLKPDFIVYSILIHGLCKEGFVQRAYQLYEQMCLKRLFPDYFAQRAVLLGL
        + EA KL  E   +G + + ++++ L++  CK G +++A  + + M      P+ + Y+ LI GLCKEG +  A +L  +M    L P+ F   +++ GL
Subjt:  IEEALKLRQETLSRGFKLNIISYSVLLSCLCKVGRIEEALTLLDEMEILRLKPDFIVYSILIHGLCKEGFVQRAYQLYEQMCLKRLFPDYFAQRAVLLGL

Query:  FENGNISEARKYFDALTQMNLIEDVILYNIMIDGYVRLGDIAEAMQLYNRMIERGITPSVITFNTLVNGFC
         ++GNI EA K         L  D + Y  ++D Y + G++ +A ++   M+ +G+ P+++TFN L+NGFC
Subjt:  FENGNISEARKYFDALTQMNLIEDVILYNIMIDGYVRLGDIAEAMQLYNRMIERGITPSVITFNTLVNGFC

AT1G09680.1 Pentatricopeptide repeat (PPR) superfamily protein5.7e-5228.48Show/hide
Query:  RTIVPTLSETVVDDVLDSLRIQNPDFAV-DFFFLLRNEYGFRHSEFSQLAVSHILAGKKRFKELHSVMKQLLEDQGSGSAPSCCDLLLNQFRNWDSNGVV
        R ++P+LS   V D+++   +  P  ++  FF  + ++ GFR +  +   ++  LA  + F E  S+++ ++  +G  SA S   + L + R     G +
Subjt:  RTIVPTLSETVVDDVLDSLRIQNPDFAV-DFFFLLRNEYGFRHSEFSQLAVSHILAGKKRFKELHSVMKQLLEDQGSGSAPSCCDLLLNQFRNWDSNGVV

Query:  WDILAFAYSRHEMIHDALFVIAKMKDLNLQASVPTYNSLLH---NLRHTDIMWDIYDEIRVSG-PLGVNILPPYLYMAFDSNEVVGPSIVSFNTIMSKFC
         D L   Y+    I DA+      +       +    +LL     L  T  +W  Y EI  +G PL V +                     FN +M+KFC
Subjt:  WDILAFAYSRHEMIHDALFVIAKMKDLNLQASVPTYNSLLH---NLRHTDIMWDIYDEIRVSG-PLGVNILPPYLYMAFDSNEVVGPSIVSFNTIMSKFC

Query:  NIGLIDVARSFFCLMVKNGLLPDSYSYNILIHGLCVAGSMDEALEFTDDMEKHGVEPDIVTYNTLAKGFLLLGLMSGAWKVIQKMLLKGLNPDLVTYTIL
          G I  A+  F  + K  L P   S+N LI+G C  G++DE       MEK    PD+ TY+ L         M GA  +  +M  +GL P+ V +T L
Subjt:  NIGLIDVARSFFCLMVKNGLLPDSYSYNILIHGLCVAGSMDEALEFTDDMEKHGVEPDIVTYNTLAKGFLLLGLMSGAWKVIQKMLLKGLNPDLVTYTIL

Query:  ICGHCQMGNIEEALKLRQETLSRGFKLNIISYSVLLSCLCKVGRIEEALTLLDEMEILRLKPDFIVYSILIHGLCKEGFVQRAYQLYEQMCLKRLFPDYF
        I GH + G I+   +  Q+ LS+G + +I+ Y+ L++  CK G +  A  ++D M    L+PD I Y+ LI G C+ G V+ A ++ ++M    +  D  
Subjt:  ICGHCQMGNIEEALKLRQETLSRGFKLNIISYSVLLSCLCKVGRIEEALTLLDEMEILRLKPDFIVYSILIHGLCKEGFVQRAYQLYEQMCLKRLFPDYF

Query:  AQRAVLLGLFENGNISEARKYFDALTQMNLIEDVILYNIMIDGYVRLGDIAEAMQLYNRMIERGITPSVITFNTLVNGFCK
           A++ G+ + G + +A +    + +  +  D + Y +M+D + + GD     +L   M   G  PSV+T+N L+NG CK
Subjt:  AQRAVLLGLFENGNISEARKYFDALTQMNLIEDVILYNIMIDGYVRLGDIAEAMQLYNRMIERGITPSVITFNTLVNGFCK

AT1G13630.1 Tetratricopeptide repeat (TPR)-like superfamily protein2.2e-14148.76Show/hide
Query:  STSFVLDQNDPVRAILTGLNSFGFRAYIGGCYFRTIVPTLSETVVDDVLDSLRIQNPDFAVDFFFLLRNEYGFRHSEFSQLAVSHILAGKKRFKELHSVM
        +T+   D     + IL G+   GFR ++ G +FR +V  L    V++++D L  ++ D +V FF  LR+ Y FRHS FS L VSH+LAG++RFKEL  ++
Subjt:  STSFVLDQNDPVRAILTGLNSFGFRAYIGGCYFRTIVPTLSETVVDDVLDSLRIQNPDFAVDFFFLLRNEYGFRHSEFSQLAVSHILAGKKRFKELHSVM

Query:  KQLLEDQGSGSAPSCCDLLLNQFRNWDSNGVVWDILAFAYSRHEMIHDALFVIAKMKDLNLQASVPTYNSLLHNLRHTDIMWDIYDEIR----------V
        +QLL+++G+             FR W+S G+VWD+L F  SR  M+ D+L+++ KMKD NL  S  +YNS+L++ R TD MWD+Y EI+          V
Subjt:  KQLLEDQGSGSAPSCCDLLLNQFRNWDSNGVVWDILAFAYSRHEMIHDALFVIAKMKDLNLQASVPTYNSLLHNLRHTDIMWDIYDEIR----------V

Query:  SGPLGVNIL-PPYLYMAFDSNEVVGPSIVSFNTIMSKFCNIGLIDVARSFFCLMVKNGLLPDSYSYNILIHGLCVAGSMDEALEFTDDMEKHGVEPDIVT
         G      L    L++     + +GPS+VSFN+IMS +C +G +D+A+SFFC ++K GL+P  YS+NILI+GLC+ GS+ EALE   DM KHGVEPD VT
Subjt:  SGPLGVNIL-PPYLYMAFDSNEVVGPSIVSFNTIMSKFCNIGLIDVARSFFCLMVKNGLLPDSYSYNILIHGLCVAGSMDEALEFTDDMEKHGVEPDIVT

Query:  YNTLAKGFLLLGLMSGAWKVIQKMLLKGLNPDLVTYTILICGHCQMGNIEEALKLRQETLSRGFKLN-IISYSVLLSCLCKVGRIEEALTLLDEMEILRL
        YN LAKGF LLG++SGAW+VI+ ML KGL+PD++TYTIL+CG CQ+GNI+  L L ++ LSRGF+LN II  SV+LS LCK GRI+EAL+L ++M+   L
Subjt:  YNTLAKGFLLLGLMSGAWKVIQKMLLKGLNPDLVTYTILICGHCQMGNIEEALKLRQETLSRGFKLN-IISYSVLLSCLCKVGRIEEALTLLDEMEILRL

Query:  KPDFIVYSILIHGLCKEGFVQRAYQLYEQMCLKRLFPDYFAQRAVLLGLFENGNISEARKYFDALTQMNLIEDVILYNIMIDGYVRLGDIAEAMQLYNRM
         PD + YSI+IHGLCK G    A  LY++MC KR+ P+     A+LLGL + G + EAR   D+L       D++LYNI+IDGY + G I EA++L+  +
Subjt:  KPDFIVYSILIHGLCKEGFVQRAYQLYEQMCLKRLFPDYFAQRAVLLGLFENGNISEARKYFDALTQMNLIEDVILYNIMIDGYVRLGDIAEAMQLYNRM

Query:  IERGITPSVITFNTLVNGFCKGQ
        IE GITPSV TFN+L+ G+CK Q
Subjt:  IERGITPSVITFNTLVNGFCKGQ

AT1G13630.2 Tetratricopeptide repeat (TPR)-like superfamily protein2.2e-14148.76Show/hide
Query:  STSFVLDQNDPVRAILTGLNSFGFRAYIGGCYFRTIVPTLSETVVDDVLDSLRIQNPDFAVDFFFLLRNEYGFRHSEFSQLAVSHILAGKKRFKELHSVM
        +T+   D     + IL G+   GFR ++ G +FR +V  L    V++++D L  ++ D +V FF  LR+ Y FRHS FS L VSH+LAG++RFKEL  ++
Subjt:  STSFVLDQNDPVRAILTGLNSFGFRAYIGGCYFRTIVPTLSETVVDDVLDSLRIQNPDFAVDFFFLLRNEYGFRHSEFSQLAVSHILAGKKRFKELHSVM

Query:  KQLLEDQGSGSAPSCCDLLLNQFRNWDSNGVVWDILAFAYSRHEMIHDALFVIAKMKDLNLQASVPTYNSLLHNLRHTDIMWDIYDEIR----------V
        +QLL+++G+             FR W+S G+VWD+L F  SR  M+ D+L+++ KMKD NL  S  +YNS+L++ R TD MWD+Y EI+          V
Subjt:  KQLLEDQGSGSAPSCCDLLLNQFRNWDSNGVVWDILAFAYSRHEMIHDALFVIAKMKDLNLQASVPTYNSLLHNLRHTDIMWDIYDEIR----------V

Query:  SGPLGVNIL-PPYLYMAFDSNEVVGPSIVSFNTIMSKFCNIGLIDVARSFFCLMVKNGLLPDSYSYNILIHGLCVAGSMDEALEFTDDMEKHGVEPDIVT
         G      L    L++     + +GPS+VSFN+IMS +C +G +D+A+SFFC ++K GL+P  YS+NILI+GLC+ GS+ EALE   DM KHGVEPD VT
Subjt:  SGPLGVNIL-PPYLYMAFDSNEVVGPSIVSFNTIMSKFCNIGLIDVARSFFCLMVKNGLLPDSYSYNILIHGLCVAGSMDEALEFTDDMEKHGVEPDIVT

Query:  YNTLAKGFLLLGLMSGAWKVIQKMLLKGLNPDLVTYTILICGHCQMGNIEEALKLRQETLSRGFKLN-IISYSVLLSCLCKVGRIEEALTLLDEMEILRL
        YN LAKGF LLG++SGAW+VI+ ML KGL+PD++TYTIL+CG CQ+GNI+  L L ++ LSRGF+LN II  SV+LS LCK GRI+EAL+L ++M+   L
Subjt:  YNTLAKGFLLLGLMSGAWKVIQKMLLKGLNPDLVTYTILICGHCQMGNIEEALKLRQETLSRGFKLN-IISYSVLLSCLCKVGRIEEALTLLDEMEILRL

Query:  KPDFIVYSILIHGLCKEGFVQRAYQLYEQMCLKRLFPDYFAQRAVLLGLFENGNISEARKYFDALTQMNLIEDVILYNIMIDGYVRLGDIAEAMQLYNRM
         PD + YSI+IHGLCK G    A  LY++MC KR+ P+     A+LLGL + G + EAR   D+L       D++LYNI+IDGY + G I EA++L+  +
Subjt:  KPDFIVYSILIHGLCKEGFVQRAYQLYEQMCLKRLFPDYFAQRAVLLGLFENGNISEARKYFDALTQMNLIEDVILYNIMIDGYVRLGDIAEAMQLYNRM

Query:  IERGITPSVITFNTLVNGFCKGQ
        IE GITPSV TFN+L+ G+CK Q
Subjt:  IERGITPSVITFNTLVNGFCKGQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATCCATGGACTTGAATCTCAGTCCAATTTGAAGGATCAAGCTTGGAGTGCCATCATGAAACTCAATAGAAAGGCCTTCCTTCGTCGACATTTTCTCGAGGCAGGGAA
AGATATAAGGAGTTGGGTCGAGGAGTCGGTAAGAGAAGAGTCATTTGAATTTTGGGACAAAGACAATGATAGTGACGATAAAGAGAATGATGAACTGCTTCTTTGGAGTC
CAATTGGGCTACAGAGAGATGCCAATTGCCAAATAAAACGCTGCGTTTCGGCCCTTTTGCGAAGTGTCCCAGATTGCTGGCTATCAATGGCTGAAAAACGGCAGCTTCCA
GCACCAGCAGTGGCCGCTGCCCTTCAGAATTGCCGCGAAAGCCAGCGGCCTTCAGTTTCTGCCGCCCAGCTCGAAGTGGAATCCGTCAGTACCTCCTTCGTTCTGGACCA
GAACGACCCAGTTCGTGCAATTCTTACGGGGTTAAATTCCTTTGGTTTTAGAGCATATATTGGTGGATGTTACTTTCGAACTATAGTTCCTACTTTGAGTGAAACTGTAG
TGGATGACGTTCTTGATAGCTTGAGAATTCAGAATCCTGATTTTGCTGTGGATTTTTTCTTTCTGTTAAGAAATGAGTACGGATTTCGGCATTCCGAGTTCTCACAGCTT
GCTGTTTCTCATATTTTAGCGGGTAAAAAACGATTCAAGGAGTTGCATTCCGTTATGAAGCAATTGCTTGAGGACCAAGGGTCTGGTTCTGCACCTTCCTGTTGTGACCT
GCTCTTGAACCAATTCAGGAATTGGGATTCAAATGGTGTGGTATGGGATATTTTGGCATTTGCGTACTCTAGACATGAAATGATTCATGATGCCCTCTTTGTCATCGCAA
AGATGAAGGATCTAAATTTACAGGCTTCAGTTCCAACTTATAACTCTCTATTGCACAACTTGAGGCACACTGATATTATGTGGGATATATACGATGAAATCAGAGTTAGT
GGGCCCCTCGGAGTGAATATACTACCTCCATACTTATACATGGCCTTTGACAGCAATGAAGTAGTTGGACCTTCCATTGTTTCTTTCAATACCATTATGTCAAAGTTTTG
CAATATTGGGCTAATAGATGTTGCAAGGTCATTTTTTTGTTTGATGGTCAAGAATGGACTTCTTCCTGATTCGTACAGTTATAATATTCTTATTCATGGGTTATGTGTAG
CAGGTTCCATGGATGAAGCACTAGAGTTCACAGATGACATGGAGAAGCATGGTGTGGAACCTGATATAGTGACATACAACACCCTTGCCAAAGGGTTTCTCTTACTTGGT
TTAATGAGTGGGGCCTGGAAAGTCATTCAGAAAATGTTGCTAAAAGGTCTAAATCCAGATCTTGTCACATATACAATACTGATATGTGGACATTGTCAGATGGGAAACAT
TGAGGAAGCGCTTAAGCTGCGACAAGAAACCCTTTCAAGGGGGTTTAAGTTGAATATCATTTCCTACAGTGTGCTACTTAGTTGCTTGTGTAAAGTTGGACGAATAGAAG
AAGCATTGACATTGCTCGATGAAATGGAAATTCTACGTTTGAAACCTGATTTTATAGTATATTCAATCCTCATTCATGGCCTCTGCAAGGAAGGGTTTGTACAAAGGGCT
TACCAACTATATGAACAAATGTGCTTGAAGAGACTTTTTCCAGACTACTTTGCTCAACGTGCTGTACTTTTGGGTTTATTTGAGAATGGAAATATATCTGAGGCAAGAAA
ATATTTTGATGCTTTGACTCAAATGAATCTGATTGAGGATGTTATATTGTATAATATTATGATCGATGGTTATGTAAGACTAGGTGATATTGCTGAGGCTATGCAGTTAT
ATAACAGAATGATTGAAAGAGGGATTACTCCAAGTGTTATCACCTTCAACACTCTTGTTAATGGATTCTGCAAAGGGCAAACTTAG
mRNA sequenceShow/hide mRNA sequence
ATGATCCATGGACTTGAATCTCAGTCCAATTTGAAGGATCAAGCTTGGAGTGCCATCATGAAACTCAATAGAAAGGCCTTCCTTCGTCGACATTTTCTCGAGGCAGGGAA
AGATATAAGGAGTTGGGTCGAGGAGTCGGTAAGAGAAGAGTCATTTGAATTTTGGGACAAAGACAATGATAGTGACGATAAAGAGAATGATGAACTGCTTCTTTGGAGTC
CAATTGGGCTACAGAGAGATGCCAATTGCCAAATAAAACGCTGCGTTTCGGCCCTTTTGCGAAGTGTCCCAGATTGCTGGCTATCAATGGCTGAAAAACGGCAGCTTCCA
GCACCAGCAGTGGCCGCTGCCCTTCAGAATTGCCGCGAAAGCCAGCGGCCTTCAGTTTCTGCCGCCCAGCTCGAAGTGGAATCCGTCAGTACCTCCTTCGTTCTGGACCA
GAACGACCCAGTTCGTGCAATTCTTACGGGGTTAAATTCCTTTGGTTTTAGAGCATATATTGGTGGATGTTACTTTCGAACTATAGTTCCTACTTTGAGTGAAACTGTAG
TGGATGACGTTCTTGATAGCTTGAGAATTCAGAATCCTGATTTTGCTGTGGATTTTTTCTTTCTGTTAAGAAATGAGTACGGATTTCGGCATTCCGAGTTCTCACAGCTT
GCTGTTTCTCATATTTTAGCGGGTAAAAAACGATTCAAGGAGTTGCATTCCGTTATGAAGCAATTGCTTGAGGACCAAGGGTCTGGTTCTGCACCTTCCTGTTGTGACCT
GCTCTTGAACCAATTCAGGAATTGGGATTCAAATGGTGTGGTATGGGATATTTTGGCATTTGCGTACTCTAGACATGAAATGATTCATGATGCCCTCTTTGTCATCGCAA
AGATGAAGGATCTAAATTTACAGGCTTCAGTTCCAACTTATAACTCTCTATTGCACAACTTGAGGCACACTGATATTATGTGGGATATATACGATGAAATCAGAGTTAGT
GGGCCCCTCGGAGTGAATATACTACCTCCATACTTATACATGGCCTTTGACAGCAATGAAGTAGTTGGACCTTCCATTGTTTCTTTCAATACCATTATGTCAAAGTTTTG
CAATATTGGGCTAATAGATGTTGCAAGGTCATTTTTTTGTTTGATGGTCAAGAATGGACTTCTTCCTGATTCGTACAGTTATAATATTCTTATTCATGGGTTATGTGTAG
CAGGTTCCATGGATGAAGCACTAGAGTTCACAGATGACATGGAGAAGCATGGTGTGGAACCTGATATAGTGACATACAACACCCTTGCCAAAGGGTTTCTCTTACTTGGT
TTAATGAGTGGGGCCTGGAAAGTCATTCAGAAAATGTTGCTAAAAGGTCTAAATCCAGATCTTGTCACATATACAATACTGATATGTGGACATTGTCAGATGGGAAACAT
TGAGGAAGCGCTTAAGCTGCGACAAGAAACCCTTTCAAGGGGGTTTAAGTTGAATATCATTTCCTACAGTGTGCTACTTAGTTGCTTGTGTAAAGTTGGACGAATAGAAG
AAGCATTGACATTGCTCGATGAAATGGAAATTCTACGTTTGAAACCTGATTTTATAGTATATTCAATCCTCATTCATGGCCTCTGCAAGGAAGGGTTTGTACAAAGGGCT
TACCAACTATATGAACAAATGTGCTTGAAGAGACTTTTTCCAGACTACTTTGCTCAACGTGCTGTACTTTTGGGTTTATTTGAGAATGGAAATATATCTGAGGCAAGAAA
ATATTTTGATGCTTTGACTCAAATGAATCTGATTGAGGATGTTATATTGTATAATATTATGATCGATGGTTATGTAAGACTAGGTGATATTGCTGAGGCTATGCAGTTAT
ATAACAGAATGATTGAAAGAGGGATTACTCCAAGTGTTATCACCTTCAACACTCTTGTTAATGGATTCTGCAAAGGGCAAACTTAG
Protein sequenceShow/hide protein sequence
MIHGLESQSNLKDQAWSAIMKLNRKAFLRRHFLEAGKDIRSWVEESVREESFEFWDKDNDSDDKENDELLLWSPIGLQRDANCQIKRCVSALLRSVPDCWLSMAEKRQLP
APAVAAALQNCRESQRPSVSAAQLEVESVSTSFVLDQNDPVRAILTGLNSFGFRAYIGGCYFRTIVPTLSETVVDDVLDSLRIQNPDFAVDFFFLLRNEYGFRHSEFSQL
AVSHILAGKKRFKELHSVMKQLLEDQGSGSAPSCCDLLLNQFRNWDSNGVVWDILAFAYSRHEMIHDALFVIAKMKDLNLQASVPTYNSLLHNLRHTDIMWDIYDEIRVS
GPLGVNILPPYLYMAFDSNEVVGPSIVSFNTIMSKFCNIGLIDVARSFFCLMVKNGLLPDSYSYNILIHGLCVAGSMDEALEFTDDMEKHGVEPDIVTYNTLAKGFLLLG
LMSGAWKVIQKMLLKGLNPDLVTYTILICGHCQMGNIEEALKLRQETLSRGFKLNIISYSVLLSCLCKVGRIEEALTLLDEMEILRLKPDFIVYSILIHGLCKEGFVQRA
YQLYEQMCLKRLFPDYFAQRAVLLGLFENGNISEARKYFDALTQMNLIEDVILYNIMIDGYVRLGDIAEAMQLYNRMIERGITPSVITFNTLVNGFCKGQT