; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG09G018170 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG09G018170
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionPentatricopeptide repeat-containing protein
Genome locationCG_Chr09:35175677..35177782
RNA-Seq ExpressionClCG09G018170
SyntenyClCG09G018170
Gene Ontology termsGO:0005515 - protein binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily
IPR032867 - DYW domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0039490.1 pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa]0.0e+0091.77Show/hide
Query:  MKRSLLSNTRKPRKSLCSIKCSSFEQGLRPRPQPKPSKVDPDVRKETPLKETRISKSSVGICSQIEKLVLCKKYRDALEMFEIFELEGGFHVGNSTFDAL
        MK S LSN RK R S C +KCSSFEQGLRPRPQPKPSK+D  VRKE PLKET + KSSVGICSQIEKLVLCK+YRDALEMFEIFELE GFHVGNST+DAL
Subjt:  MKRSLLSNTRKPRKSLCSIKCSSFEQGLRPRPQPKPSKVDPDVRKETPLKETRISKSSVGICSQIEKLVLCKKYRDALEMFEIFELEGGFHVGNSTFDAL

Query:  INACIGLKSIRGVKRLCNYMVDNGFEPDQYMRNRILLMHVKCGMMIDACRLFDEMPERNAVSWTTIIWGYVDSGNYVEAFRLFILMWEEYYACGPRTLAT
        INACIGLKSIRGVKRL NYMVDNGFEPDQYMRNR+LLMHVKCGMMIDACRLFDEMPERNAVSW+TII GYVDSGNYVEAFRLFILMWEE Y CGPRTLAT
Subjt:  INACIGLKSIRGVKRLCNYMVDNGFEPDQYMRNRILLMHVKCGMMIDACRLFDEMPERNAVSWTTIIWGYVDSGNYVEAFRLFILMWEEYYACGPRTLAT

Query:  MIRASAGLELIFPGRQLHSCAIKAGLGRDIFVSCALIDMYSKCGSLEDAHCVFDDMPDKTIVGWNSIIAAYAFHGYSEEALDLYYKMRDSGIKMDHFTFS
        MIRASAGLE+IF GRQLHSCAIKAGLG+DIFVSCALIDMYSKCGSLEDAHCVFD+MPDKTIVGWNSIIA YA HGYSEEALDLY++M  SG+KMDHFTFS
Subjt:  MIRASAGLELIFPGRQLHSCAIKAGLGRDIFVSCALIDMYSKCGSLEDAHCVFDDMPDKTIVGWNSIIAAYAFHGYSEEALDLYYKMRDSGIKMDHFTFS

Query:  IIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRMSCRNVISWNALIAGYGNHGRGEEAIEMFEKMLREGMIPNHVTF
        IIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRMSCRNVISWNALIAGYGNHGRGEEAI+MFEKMLREGM+PNHVTF
Subjt:  IIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRMSCRNVISWNALIAGYGNHGRGEEAIEMFEKMLREGMIPNHVTF

Query:  LAVLSACSISGLFERGWEIFQSMTRDHKIKPRAMHYACMIELLGREGLLDEAYALIRKAPFQPAPNMWAALLRACRVYGNLELGKFAAEKLYGMEPVKLS
        LAVLSACSISGLFERGWEIFQSMTRDHK++PRAMH+ACMIELLGREGLLDEAYALIRKAPFQP  NMWAALLRACRV+GNLELGKFAAEKLYGMEP KLS
Subjt:  LAVLSACSISGLFERGWEIFQSMTRDHKIKPRAMHYACMIELLGREGLLDEAYALIRKAPFQPAPNMWAALLRACRVYGNLELGKFAAEKLYGMEPVKLS

Query:  NYIVLLNIYNSSGKLKEAADVVQTLKRKGLRMLPACSWIEVNNQPHAFLSGDKHHAQIEKVVGKVDELMLKVSKLGYVP-EQNFMLPDVDEHEEKTQMYH
        NYIVLLNIYNSSGKLKEAADVVQTLKRKGLRMLPACSWIEVNNQPHAFLSGDKHH Q+EKVVGKVDELMLK+SKLGYVP EQNFMLPDVDEHEEK +MYH
Subjt:  NYIVLLNIYNSSGKLKEAADVVQTLKRKGLRMLPACSWIEVNNQPHAFLSGDKHHAQIEKVVGKVDELMLKVSKLGYVP-EQNFMLPDVDEHEEKTQMYH

Query:  SEKLAIAYGLLNTLEQTPLQIVQSHRICGDCHSVIKLIAMLTKREIVVRDASRFHHFRDGSCSCGDYW
        SEKLAIAYGLLNTLE+TPLQIVQSHRIC DCHSVIKLIAM+TKREIV+RDASRFHHFRDG+CSCGDYW
Subjt:  SEKLAIAYGLLNTLEQTPLQIVQSHRICGDCHSVIKLIAMLTKREIVVRDASRFHHFRDGSCSCGDYW

XP_004148701.1 pentatricopeptide repeat-containing protein At5g50390, chloroplastic isoform X1 [Cucumis sativus]0.0e+0090.62Show/hide
Query:  MEVPLFRYQNYVYDPLQCNSTSYFSVRFSDSELFMKRSLLSNTRKPRKSLCSIKCSSFEQGL--RPRPQPKPSKVDPDVRKETPLKETRISKSSVGICSQ
        ME+PL RYQNYVYD LQCNSTS+FS+R+SDS+LF K S LSN RK R S C IKCSSFEQGL  RPRPQPKPSK+D   RKETPLKET + KSSVGICSQ
Subjt:  MEVPLFRYQNYVYDPLQCNSTSYFSVRFSDSELFMKRSLLSNTRKPRKSLCSIKCSSFEQGL--RPRPQPKPSKVDPDVRKETPLKETRISKSSVGICSQ

Query:  IEKLVLCKKYRDALEMFEIFELEGGFHVGNSTFDALINACIGLKSIRGVKRLCNYMVDNGFEPDQYMRNRILLMHVKCGMMIDACRLFDEMPERNAVSWT
        IEKLVLCKKYRDALEMFEIFELE GFHVG ST+DALINACIGLKSIRGVKRLCNYMVDNGFEPDQYMRNR+LLMHVKCGMMIDACRLFDEMP RNAVSW 
Subjt:  IEKLVLCKKYRDALEMFEIFELEGGFHVGNSTFDALINACIGLKSIRGVKRLCNYMVDNGFEPDQYMRNRILLMHVKCGMMIDACRLFDEMPERNAVSWT

Query:  TIIWGYVDSGNYVEAFRLFILMWEEYYACGPRTLATMIRASAGLELIFPGRQLHSCAIKAGLGRDIFVSCALIDMYSKCGSLEDAHCVFDDMPDKTIVGW
        TII GYVDSGNYVEAFRLFILM EE+Y CGPRT ATMIRASAGLE+IFPGRQLHSCAIKAGLG+DIFVSCALIDMYSKCGSLEDAHCVFD+MPDKTIVGW
Subjt:  TIIWGYVDSGNYVEAFRLFILMWEEYYACGPRTLATMIRASAGLELIFPGRQLHSCAIKAGLGRDIFVSCALIDMYSKCGSLEDAHCVFDDMPDKTIVGW

Query:  NSIIAAYAFHGYSEEALDLYYKMRDSGIKMDHFTFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRMSCRNVIS
        NSIIA YA HGYSEEALDLY++MRDSG+KMDHFTFSIIIRICSRLASVARAKQ HASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRMSCRN+IS
Subjt:  NSIIAAYAFHGYSEEALDLYYKMRDSGIKMDHFTFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRMSCRNVIS

Query:  WNALIAGYGNHGRGEEAIEMFEKMLREGMIPNHVTFLAVLSACSISGLFERGWEIFQSMTRDHKIKPRAMHYACMIELLGREGLLDEAYALIRKAPFQPA
        WNALIAGYGNHG GEEAI+MFEKMLREGM+PNHVTFLAVLSACSISGLFERGWEIFQSMTRDHK+KPRAMH+ACMIELLGREGLLDEAYALIRKAPFQP 
Subjt:  WNALIAGYGNHGRGEEAIEMFEKMLREGMIPNHVTFLAVLSACSISGLFERGWEIFQSMTRDHKIKPRAMHYACMIELLGREGLLDEAYALIRKAPFQPA

Query:  PNMWAALLRACRVYGNLELGKFAAEKLYGMEPVKLSNYIVLLNIYNSSGKLKEAADVVQTLKRKGLRMLPACSWIEVNNQPHAFLSGDKHHAQIEKVVGK
         NMWAALLRACRV+GNLELGKFAAEKLYGMEP KLSNYIVLLNIYNSSGKLKEAADV QTLKRKGLRMLPACSWIEVNNQPHAFLSGDKHH QIEKVVGK
Subjt:  PNMWAALLRACRVYGNLELGKFAAEKLYGMEPVKLSNYIVLLNIYNSSGKLKEAADVVQTLKRKGLRMLPACSWIEVNNQPHAFLSGDKHHAQIEKVVGK

Query:  VDELMLKVSKLGYVP-EQNFMLPDVDEHEEKTQMYHSEKLAIAYGLLNTLEQTPLQIVQSHRICGDCHSVIKLIAMLTKREIVVRDASRFHHFRDGSCSC
        VDELML +SKLGYVP EQNFMLPDVDE+EEK +MYHSEKLAIAYGLLNTLE+TPLQIVQSHRIC DCHSVIKLIAM+TKREIV+RDASRFHHFRDGSCSC
Subjt:  VDELMLKVSKLGYVP-EQNFMLPDVDEHEEKTQMYHSEKLAIAYGLLNTLEQTPLQIVQSHRICGDCHSVIKLIAMLTKREIVVRDASRFHHFRDGSCSC

Query:  GDYW
        GDYW
Subjt:  GDYW

XP_008459324.1 PREDICTED: pentatricopeptide repeat-containing protein At5g50390, chloroplastic [Cucumis melo]0.0e+0090.88Show/hide
Query:  MEVPLFRYQNYVYDPLQCNSTSYFSVRFSDSELFMKRSLLSNTRKPRKSLCSIKCSSFEQGLRPRPQPKPSKVDPDVRKETPLKETRISKSSVGICSQIE
        ME+PL RYQNYVYD LQC ST YFS+R+SDS LFMK S LSN RK R S C +KCSSFEQGLRPRPQPKPSK+D  VRKE PLKET + KSSVGICSQIE
Subjt:  MEVPLFRYQNYVYDPLQCNSTSYFSVRFSDSELFMKRSLLSNTRKPRKSLCSIKCSSFEQGLRPRPQPKPSKVDPDVRKETPLKETRISKSSVGICSQIE

Query:  KLVLCKKYRDALEMFEIFELEGGFHVGNSTFDALINACIGLKSIRGVKRLCNYMVDNGFEPDQYMRNRILLMHVKCGMMIDACRLFDEMPERNAVSWTTI
        KLVLCK+YRDALEMFEIFELE GFHVGNST+DALINACIGLKSIRGVKRL NYMVDNGFEPDQYMRNR+LLMHVKCGMMIDACRLFDEMPERNAVSW+TI
Subjt:  KLVLCKKYRDALEMFEIFELEGGFHVGNSTFDALINACIGLKSIRGVKRLCNYMVDNGFEPDQYMRNRILLMHVKCGMMIDACRLFDEMPERNAVSWTTI

Query:  IWGYVDSGNYVEAFRLFILMWEEYYACGPRTLATMIRASAGLELIFPGRQLHSCAIKAGLGRDIFVSCALIDMYSKCGSLEDAHCVFDDMPDKTIVGWNS
        I GYVDSGNYVEAFRLFILMWEE Y CGPRTLATMIRASAGLE+IF GRQLHSCAIKAGLG+DIFVSCALIDMYSKCGSLEDAHCVFD+MPDKTIVGWNS
Subjt:  IWGYVDSGNYVEAFRLFILMWEEYYACGPRTLATMIRASAGLELIFPGRQLHSCAIKAGLGRDIFVSCALIDMYSKCGSLEDAHCVFDDMPDKTIVGWNS

Query:  IIAAYAFHGYSEEALDLYYKMRDSGIKMDHFTFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRMSCRNVISWN
        IIA YA HGYSEEALDLY++M  SG+KMDHFTFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRMSCRNVISWN
Subjt:  IIAAYAFHGYSEEALDLYYKMRDSGIKMDHFTFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRMSCRNVISWN

Query:  ALIAGYGNHGRGEEAIEMFEKMLREGMIPNHVTFLAVLSACSISGLFERGWEIFQSMTRDHKIKPRAMHYACMIELLGREGLLDEAYALIRKAPFQPAPN
        ALIAGYGNHGRGEEAI+MFEKMLREGM+PNHVTFLAVLSACSISGLFERGWEIFQSMTRDHK++PRAMH+ACMIELLGREGLLDEAYALIRKAPFQP  N
Subjt:  ALIAGYGNHGRGEEAIEMFEKMLREGMIPNHVTFLAVLSACSISGLFERGWEIFQSMTRDHKIKPRAMHYACMIELLGREGLLDEAYALIRKAPFQPAPN

Query:  MWAALLRACRVYGNLELGKFAAEKLYGMEPVKLSNYIVLLNIYNSSGKLKEAADVVQTLKRKGLRMLPACSWIEVNNQPHAFLSGDKHHAQIEKVVGKVD
        MWAALLRACRV+GNLELGKFAAEKLYGMEP KLSNYIVLLNIYN+SGKLKEAADVVQTLKRKGLRMLPACSWIEVNNQPHAFLSGDKHH Q+EKVVGKVD
Subjt:  MWAALLRACRVYGNLELGKFAAEKLYGMEPVKLSNYIVLLNIYNSSGKLKEAADVVQTLKRKGLRMLPACSWIEVNNQPHAFLSGDKHHAQIEKVVGKVD

Query:  ELMLKVSKLGYVP-EQNFMLPDVDEHEEKTQMYHSEKLAIAYGLLNTLEQTPLQIVQSHRICGDCHSVIKLIAMLTKREIVVRDASRFHHFRDGSCSCGD
        ELMLK+SKLGYVP EQNFMLPDVDEHEEK +MYHSEKLAIAYGLLNTLE+TPLQIVQSHRIC DCHSVIKLIAM+TKREIV+RDASRFHHFRDG+CSCGD
Subjt:  ELMLKVSKLGYVP-EQNFMLPDVDEHEEKTQMYHSEKLAIAYGLLNTLEQTPLQIVQSHRICGDCHSVIKLIAMLTKREIVVRDASRFHHFRDGSCSCGD

Query:  YW
        YW
Subjt:  YW

XP_022133879.1 pentatricopeptide repeat-containing protein At5g50390, chloroplastic [Momordica charantia]0.0e+0086.1Show/hide
Query:  MEVPLFRYQNYVYDPLQCNST----SYFSVRFSDSELFMKRSLL------SNTRKPRKSLCSIKCSSFEQGLRPRPQPKPSKVDPDVRKETPLKE-TRIS
        MEVPL RYQNYVYD LQC+ST    SY  VRF+DS+LF KRSLL      SN RK R S C IKCSS EQGLRPRP+P+PSK+D DVRK T   E TRI 
Subjt:  MEVPLFRYQNYVYDPLQCNST----SYFSVRFSDSELFMKRSLL------SNTRKPRKSLCSIKCSSFEQGLRPRPQPKPSKVDPDVRKETPLKE-TRIS

Query:  KSSVGICSQIEKLVLCKKYRDALEMFEIFELEGGFHVGNSTFDALINACIGLKSIRGVKRLCNYMVDNGFEPDQYMRNRILLMHVKCGMMIDACRLFDEM
        KS VGICSQIEKLVLCKKYRDALEMFEIFELEGG+ +GNST+DALINACIGLKSIRGVKRLCNYM+DNGFEPDQYM+NRILLMHVKCGMMIDACRLFDEM
Subjt:  KSSVGICSQIEKLVLCKKYRDALEMFEIFELEGGFHVGNSTFDALINACIGLKSIRGVKRLCNYMVDNGFEPDQYMRNRILLMHVKCGMMIDACRLFDEM

Query:  PERNAVSWTTIIWGYVDSGNYVEAFRLFILMWEEYYACGPRTLATMIRASAGLELIFPGRQLHSCAIKAGLGRDIFVSCALIDMYSKCGSLEDAHCVFDD
        PERNAVSW+TII GYVDSGNY+EAFRLFI+MWEE    GPRT A MIRASAGLELIFPGRQLHSCAIKAG+G+DIFVSCALIDMYSKCGSLEDAHCVFD+
Subjt:  PERNAVSWTTIIWGYVDSGNYVEAFRLFILMWEEYYACGPRTLATMIRASAGLELIFPGRQLHSCAIKAGLGRDIFVSCALIDMYSKCGSLEDAHCVFDD

Query:  MPDKTIVGWNSIIAAYAFHGYSEEALDLYYKMRDSGIKMDHFTFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFD
        MPDKTIVGWNSIIA YA HGYSEEALDL Y+MRDSGIKMDHFTFSIIIRICSRLASVARAKQ HA LVRNGFGLDVVANTALVDFYSKWGK+DDARH+FD
Subjt:  MPDKTIVGWNSIIAAYAFHGYSEEALDLYYKMRDSGIKMDHFTFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFD

Query:  RMSCRNVISWNALIAGYGNHGRGEEAIEMFEKMLREGMIPNHVTFLAVLSACSISGLFERGWEIFQSMTRDHKIKPRAMHYACMIELLGREGLLDEAYAL
        RMS +N+ISWNALIAGYGNHGRGEEAI+MFE+MLREGM PNHVTFLAVLSACSISGLFERGWEIFQS+T DHKIKPRAMH+ACMIELLGREGLLDEAYAL
Subjt:  RMSCRNVISWNALIAGYGNHGRGEEAIEMFEKMLREGMIPNHVTFLAVLSACSISGLFERGWEIFQSMTRDHKIKPRAMHYACMIELLGREGLLDEAYAL

Query:  IRKAPFQPAPNMWAALLRACRVYGNLELGKFAAEKLYGMEPVKLSNYIVLLNIYNSSGKLKEAADVVQTLKRKGLRMLPACSWIEVNNQPHAFLSGDKHH
        IR APF+P  NMWAALLRACRV+ NLELGK AAE LYGMEP KLSNYIVLLNIYNSSGKLKEAADVVQTLKRKGLRM+PACSWIEV NQPH+FLSGDKHH
Subjt:  IRKAPFQPAPNMWAALLRACRVYGNLELGKFAAEKLYGMEPVKLSNYIVLLNIYNSSGKLKEAADVVQTLKRKGLRMLPACSWIEVNNQPHAFLSGDKHH

Query:  AQIEKVVGKVDELMLKVSKLGYVPEQNFMLPDVDEHEEKTQMYHSEKLAIAYGLLNTLEQTPLQIVQSHRICGDCHSVIKLIAMLTKREIVVRDASRFHH
        A+IEKVV KVDE+MLK+SKLGYV EQNF+LPDVDE EEK  MYHSEKLAIAYGLL+TL++TPLQIVQSHRICGDCHS IKLIA++T+REIVVRDASRFHH
Subjt:  AQIEKVVGKVDELMLKVSKLGYVPEQNFMLPDVDEHEEKTQMYHSEKLAIAYGLLNTLEQTPLQIVQSHRICGDCHSVIKLIAMLTKREIVVRDASRFHH

Query:  FRDGSCSCGDYW
        FRDGSCSCGDYW
Subjt:  FRDGSCSCGDYW

XP_038890388.1 pentatricopeptide repeat-containing protein At5g50390, chloroplastic isoform X1 [Benincasa hispida]0.0e+0090.74Show/hide
Query:  MEVPLFRYQNYVYDPLQCNSTSYFSVRFSDSELFMKRSLLSNTRKPRKSLCSIKCSSFEQGLRPRPQPKPSKVDPDVRKETPLKETRISKSSVGICSQIE
        ME+PL  YQNY+YD +QCNSTSY S+RFS  +LF +R  L N RK R SL  IKCSSFEQGLRPRPQPKPSK+DP V K TPLKET + +SSVGICSQIE
Subjt:  MEVPLFRYQNYVYDPLQCNSTSYFSVRFSDSELFMKRSLLSNTRKPRKSLCSIKCSSFEQGLRPRPQPKPSKVDPDVRKETPLKETRISKSSVGICSQIE

Query:  KLVLCKKYRDALEMFEIFELEGGFHVGNSTFDALINACIGLKSIRGVKRLCNYMVDNGFEPDQYMRNRILLMHVKCGMMIDACRLFDEMPERNAVSWTTI
        KLVLCKKYRDALEMFEIFELEGGFH GN+T DALINAC+ LKSIRGVK+LCNYMVDNGFEPDQYMRNR+LLMHVKCGMMIDACRLFD+MPERNAVSW TI
Subjt:  KLVLCKKYRDALEMFEIFELEGGFHVGNSTFDALINACIGLKSIRGVKRLCNYMVDNGFEPDQYMRNRILLMHVKCGMMIDACRLFDEMPERNAVSWTTI

Query:  IWGYVDSGNYVEAFRLFILMWEEYYACGPRTLATMIRASAGLELIFPGRQLHSCAIKAGLGRDIFVSCALIDMYSKCGSLEDAHCVFDDMPDKTIVGWNS
        I G+VDSGNYVEAFRLFILMWEEYY CGPRT ATMIRASAGLELIFPGRQLHSCAIKA LG+DIFVSCALIDMYSKCGSLEDAHCVFD+MPDKTIVGWNS
Subjt:  IWGYVDSGNYVEAFRLFILMWEEYYACGPRTLATMIRASAGLELIFPGRQLHSCAIKAGLGRDIFVSCALIDMYSKCGSLEDAHCVFDDMPDKTIVGWNS

Query:  IIAAYAFHGYSEEALDLYYKMRDSGIKMDHFTFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRMSCRNVISWN
        IIA YA HGYSEEALDLYY+MRDSGIKMDHFTFSIIIRICSRLASVA AKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRMSCRN+ISWN
Subjt:  IIAAYAFHGYSEEALDLYYKMRDSGIKMDHFTFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRMSCRNVISWN

Query:  ALIAGYGNHGRGEEAIEMFEKMLREGMIPNHVTFLAVLSACSISGLFERGWEIFQSMTRDHKIKPRAMHYACMIELLGREGLLDEAYALIRKAPFQPAPN
        ALIAGYGNHGRG EAI+MFEKMLREG IPNHVTFLAVLSACSISGLFERGWEIFQSMTRDHKIKPRAMHYACMIELLGREGLLDEAYALIRKAPFQP  N
Subjt:  ALIAGYGNHGRGEEAIEMFEKMLREGMIPNHVTFLAVLSACSISGLFERGWEIFQSMTRDHKIKPRAMHYACMIELLGREGLLDEAYALIRKAPFQPAPN

Query:  MWAALLRACRVYGNLELGKFAAEKLYGMEPVKLSNYIVLLNIYNSSGKLKEAADVVQTLKRKGLRMLPACSWIEVNNQPHAFLSGDKHHAQIEKVVGKVD
        MWAALLRACRV+GNLELGKFAAEKLYGMEP KLSNYIVLLNIYNSSGKLKEAADVVQTLKRKGLRMLPACSWIEVNNQPHAFLSGDKHH QIEKVVGKVD
Subjt:  MWAALLRACRVYGNLELGKFAAEKLYGMEPVKLSNYIVLLNIYNSSGKLKEAADVVQTLKRKGLRMLPACSWIEVNNQPHAFLSGDKHHAQIEKVVGKVD

Query:  ELMLKVSKLGYVP-EQNFMLPDVDEHEEKTQMYHSEKLAIAYGLLNTLEQTPLQIVQSHRICGDCHSVIKLIAMLTKREIVVRDASRFHHFRDGSCSCGD
        ELMLK+SKLGYVP EQNFMLPDVDEHEEK QMYHSEKLAIAYGLLNTLEQTPLQIVQSHRIC DCH VIKLIAM+TKREIV+RDASRFHHFRDGSCSCGD
Subjt:  ELMLKVSKLGYVP-EQNFMLPDVDEHEEKTQMYHSEKLAIAYGLLNTLEQTPLQIVQSHRICGDCHSVIKLIAMLTKREIVVRDASRFHHFRDGSCSCGD

Query:  YW
        YW
Subjt:  YW

TrEMBL top hitse value%identityAlignment
A0A0A0KXD9 DYW_deaminase domain-containing protein0.0e+0090.62Show/hide
Query:  MEVPLFRYQNYVYDPLQCNSTSYFSVRFSDSELFMKRSLLSNTRKPRKSLCSIKCSSFEQGL--RPRPQPKPSKVDPDVRKETPLKETRISKSSVGICSQ
        ME+PL RYQNYVYD LQCNSTS+FS+R+SDS+LF K S LSN RK R S C IKCSSFEQGL  RPRPQPKPSK+D   RKETPLKET + KSSVGICSQ
Subjt:  MEVPLFRYQNYVYDPLQCNSTSYFSVRFSDSELFMKRSLLSNTRKPRKSLCSIKCSSFEQGL--RPRPQPKPSKVDPDVRKETPLKETRISKSSVGICSQ

Query:  IEKLVLCKKYRDALEMFEIFELEGGFHVGNSTFDALINACIGLKSIRGVKRLCNYMVDNGFEPDQYMRNRILLMHVKCGMMIDACRLFDEMPERNAVSWT
        IEKLVLCKKYRDALEMFEIFELE GFHVG ST+DALINACIGLKSIRGVKRLCNYMVDNGFEPDQYMRNR+LLMHVKCGMMIDACRLFDEMP RNAVSW 
Subjt:  IEKLVLCKKYRDALEMFEIFELEGGFHVGNSTFDALINACIGLKSIRGVKRLCNYMVDNGFEPDQYMRNRILLMHVKCGMMIDACRLFDEMPERNAVSWT

Query:  TIIWGYVDSGNYVEAFRLFILMWEEYYACGPRTLATMIRASAGLELIFPGRQLHSCAIKAGLGRDIFVSCALIDMYSKCGSLEDAHCVFDDMPDKTIVGW
        TII GYVDSGNYVEAFRLFILM EE+Y CGPRT ATMIRASAGLE+IFPGRQLHSCAIKAGLG+DIFVSCALIDMYSKCGSLEDAHCVFD+MPDKTIVGW
Subjt:  TIIWGYVDSGNYVEAFRLFILMWEEYYACGPRTLATMIRASAGLELIFPGRQLHSCAIKAGLGRDIFVSCALIDMYSKCGSLEDAHCVFDDMPDKTIVGW

Query:  NSIIAAYAFHGYSEEALDLYYKMRDSGIKMDHFTFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRMSCRNVIS
        NSIIA YA HGYSEEALDLY++MRDSG+KMDHFTFSIIIRICSRLASVARAKQ HASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRMSCRN+IS
Subjt:  NSIIAAYAFHGYSEEALDLYYKMRDSGIKMDHFTFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRMSCRNVIS

Query:  WNALIAGYGNHGRGEEAIEMFEKMLREGMIPNHVTFLAVLSACSISGLFERGWEIFQSMTRDHKIKPRAMHYACMIELLGREGLLDEAYALIRKAPFQPA
        WNALIAGYGNHG GEEAI+MFEKMLREGM+PNHVTFLAVLSACSISGLFERGWEIFQSMTRDHK+KPRAMH+ACMIELLGREGLLDEAYALIRKAPFQP 
Subjt:  WNALIAGYGNHGRGEEAIEMFEKMLREGMIPNHVTFLAVLSACSISGLFERGWEIFQSMTRDHKIKPRAMHYACMIELLGREGLLDEAYALIRKAPFQPA

Query:  PNMWAALLRACRVYGNLELGKFAAEKLYGMEPVKLSNYIVLLNIYNSSGKLKEAADVVQTLKRKGLRMLPACSWIEVNNQPHAFLSGDKHHAQIEKVVGK
         NMWAALLRACRV+GNLELGKFAAEKLYGMEP KLSNYIVLLNIYNSSGKLKEAADV QTLKRKGLRMLPACSWIEVNNQPHAFLSGDKHH QIEKVVGK
Subjt:  PNMWAALLRACRVYGNLELGKFAAEKLYGMEPVKLSNYIVLLNIYNSSGKLKEAADVVQTLKRKGLRMLPACSWIEVNNQPHAFLSGDKHHAQIEKVVGK

Query:  VDELMLKVSKLGYVP-EQNFMLPDVDEHEEKTQMYHSEKLAIAYGLLNTLEQTPLQIVQSHRICGDCHSVIKLIAMLTKREIVVRDASRFHHFRDGSCSC
        VDELML +SKLGYVP EQNFMLPDVDE+EEK +MYHSEKLAIAYGLLNTLE+TPLQIVQSHRIC DCHSVIKLIAM+TKREIV+RDASRFHHFRDGSCSC
Subjt:  VDELMLKVSKLGYVP-EQNFMLPDVDEHEEKTQMYHSEKLAIAYGLLNTLEQTPLQIVQSHRICGDCHSVIKLIAMLTKREIVVRDASRFHHFRDGSCSC

Query:  GDYW
        GDYW
Subjt:  GDYW

A0A1S3C9W7 pentatricopeptide repeat-containing protein At5g50390, chloroplastic0.0e+0090.88Show/hide
Query:  MEVPLFRYQNYVYDPLQCNSTSYFSVRFSDSELFMKRSLLSNTRKPRKSLCSIKCSSFEQGLRPRPQPKPSKVDPDVRKETPLKETRISKSSVGICSQIE
        ME+PL RYQNYVYD LQC ST YFS+R+SDS LFMK S LSN RK R S C +KCSSFEQGLRPRPQPKPSK+D  VRKE PLKET + KSSVGICSQIE
Subjt:  MEVPLFRYQNYVYDPLQCNSTSYFSVRFSDSELFMKRSLLSNTRKPRKSLCSIKCSSFEQGLRPRPQPKPSKVDPDVRKETPLKETRISKSSVGICSQIE

Query:  KLVLCKKYRDALEMFEIFELEGGFHVGNSTFDALINACIGLKSIRGVKRLCNYMVDNGFEPDQYMRNRILLMHVKCGMMIDACRLFDEMPERNAVSWTTI
        KLVLCK+YRDALEMFEIFELE GFHVGNST+DALINACIGLKSIRGVKRL NYMVDNGFEPDQYMRNR+LLMHVKCGMMIDACRLFDEMPERNAVSW+TI
Subjt:  KLVLCKKYRDALEMFEIFELEGGFHVGNSTFDALINACIGLKSIRGVKRLCNYMVDNGFEPDQYMRNRILLMHVKCGMMIDACRLFDEMPERNAVSWTTI

Query:  IWGYVDSGNYVEAFRLFILMWEEYYACGPRTLATMIRASAGLELIFPGRQLHSCAIKAGLGRDIFVSCALIDMYSKCGSLEDAHCVFDDMPDKTIVGWNS
        I GYVDSGNYVEAFRLFILMWEE Y CGPRTLATMIRASAGLE+IF GRQLHSCAIKAGLG+DIFVSCALIDMYSKCGSLEDAHCVFD+MPDKTIVGWNS
Subjt:  IWGYVDSGNYVEAFRLFILMWEEYYACGPRTLATMIRASAGLELIFPGRQLHSCAIKAGLGRDIFVSCALIDMYSKCGSLEDAHCVFDDMPDKTIVGWNS

Query:  IIAAYAFHGYSEEALDLYYKMRDSGIKMDHFTFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRMSCRNVISWN
        IIA YA HGYSEEALDLY++M  SG+KMDHFTFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRMSCRNVISWN
Subjt:  IIAAYAFHGYSEEALDLYYKMRDSGIKMDHFTFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRMSCRNVISWN

Query:  ALIAGYGNHGRGEEAIEMFEKMLREGMIPNHVTFLAVLSACSISGLFERGWEIFQSMTRDHKIKPRAMHYACMIELLGREGLLDEAYALIRKAPFQPAPN
        ALIAGYGNHGRGEEAI+MFEKMLREGM+PNHVTFLAVLSACSISGLFERGWEIFQSMTRDHK++PRAMH+ACMIELLGREGLLDEAYALIRKAPFQP  N
Subjt:  ALIAGYGNHGRGEEAIEMFEKMLREGMIPNHVTFLAVLSACSISGLFERGWEIFQSMTRDHKIKPRAMHYACMIELLGREGLLDEAYALIRKAPFQPAPN

Query:  MWAALLRACRVYGNLELGKFAAEKLYGMEPVKLSNYIVLLNIYNSSGKLKEAADVVQTLKRKGLRMLPACSWIEVNNQPHAFLSGDKHHAQIEKVVGKVD
        MWAALLRACRV+GNLELGKFAAEKLYGMEP KLSNYIVLLNIYN+SGKLKEAADVVQTLKRKGLRMLPACSWIEVNNQPHAFLSGDKHH Q+EKVVGKVD
Subjt:  MWAALLRACRVYGNLELGKFAAEKLYGMEPVKLSNYIVLLNIYNSSGKLKEAADVVQTLKRKGLRMLPACSWIEVNNQPHAFLSGDKHHAQIEKVVGKVD

Query:  ELMLKVSKLGYVP-EQNFMLPDVDEHEEKTQMYHSEKLAIAYGLLNTLEQTPLQIVQSHRICGDCHSVIKLIAMLTKREIVVRDASRFHHFRDGSCSCGD
        ELMLK+SKLGYVP EQNFMLPDVDEHEEK +MYHSEKLAIAYGLLNTLE+TPLQIVQSHRIC DCHSVIKLIAM+TKREIV+RDASRFHHFRDG+CSCGD
Subjt:  ELMLKVSKLGYVP-EQNFMLPDVDEHEEKTQMYHSEKLAIAYGLLNTLEQTPLQIVQSHRICGDCHSVIKLIAMLTKREIVVRDASRFHHFRDGSCSCGD

Query:  YW
        YW
Subjt:  YW

A0A5A7T8C6 Pentatricopeptide repeat-containing protein0.0e+0091.77Show/hide
Query:  MKRSLLSNTRKPRKSLCSIKCSSFEQGLRPRPQPKPSKVDPDVRKETPLKETRISKSSVGICSQIEKLVLCKKYRDALEMFEIFELEGGFHVGNSTFDAL
        MK S LSN RK R S C +KCSSFEQGLRPRPQPKPSK+D  VRKE PLKET + KSSVGICSQIEKLVLCK+YRDALEMFEIFELE GFHVGNST+DAL
Subjt:  MKRSLLSNTRKPRKSLCSIKCSSFEQGLRPRPQPKPSKVDPDVRKETPLKETRISKSSVGICSQIEKLVLCKKYRDALEMFEIFELEGGFHVGNSTFDAL

Query:  INACIGLKSIRGVKRLCNYMVDNGFEPDQYMRNRILLMHVKCGMMIDACRLFDEMPERNAVSWTTIIWGYVDSGNYVEAFRLFILMWEEYYACGPRTLAT
        INACIGLKSIRGVKRL NYMVDNGFEPDQYMRNR+LLMHVKCGMMIDACRLFDEMPERNAVSW+TII GYVDSGNYVEAFRLFILMWEE Y CGPRTLAT
Subjt:  INACIGLKSIRGVKRLCNYMVDNGFEPDQYMRNRILLMHVKCGMMIDACRLFDEMPERNAVSWTTIIWGYVDSGNYVEAFRLFILMWEEYYACGPRTLAT

Query:  MIRASAGLELIFPGRQLHSCAIKAGLGRDIFVSCALIDMYSKCGSLEDAHCVFDDMPDKTIVGWNSIIAAYAFHGYSEEALDLYYKMRDSGIKMDHFTFS
        MIRASAGLE+IF GRQLHSCAIKAGLG+DIFVSCALIDMYSKCGSLEDAHCVFD+MPDKTIVGWNSIIA YA HGYSEEALDLY++M  SG+KMDHFTFS
Subjt:  MIRASAGLELIFPGRQLHSCAIKAGLGRDIFVSCALIDMYSKCGSLEDAHCVFDDMPDKTIVGWNSIIAAYAFHGYSEEALDLYYKMRDSGIKMDHFTFS

Query:  IIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRMSCRNVISWNALIAGYGNHGRGEEAIEMFEKMLREGMIPNHVTF
        IIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRMSCRNVISWNALIAGYGNHGRGEEAI+MFEKMLREGM+PNHVTF
Subjt:  IIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRMSCRNVISWNALIAGYGNHGRGEEAIEMFEKMLREGMIPNHVTF

Query:  LAVLSACSISGLFERGWEIFQSMTRDHKIKPRAMHYACMIELLGREGLLDEAYALIRKAPFQPAPNMWAALLRACRVYGNLELGKFAAEKLYGMEPVKLS
        LAVLSACSISGLFERGWEIFQSMTRDHK++PRAMH+ACMIELLGREGLLDEAYALIRKAPFQP  NMWAALLRACRV+GNLELGKFAAEKLYGMEP KLS
Subjt:  LAVLSACSISGLFERGWEIFQSMTRDHKIKPRAMHYACMIELLGREGLLDEAYALIRKAPFQPAPNMWAALLRACRVYGNLELGKFAAEKLYGMEPVKLS

Query:  NYIVLLNIYNSSGKLKEAADVVQTLKRKGLRMLPACSWIEVNNQPHAFLSGDKHHAQIEKVVGKVDELMLKVSKLGYVP-EQNFMLPDVDEHEEKTQMYH
        NYIVLLNIYNSSGKLKEAADVVQTLKRKGLRMLPACSWIEVNNQPHAFLSGDKHH Q+EKVVGKVDELMLK+SKLGYVP EQNFMLPDVDEHEEK +MYH
Subjt:  NYIVLLNIYNSSGKLKEAADVVQTLKRKGLRMLPACSWIEVNNQPHAFLSGDKHHAQIEKVVGKVDELMLKVSKLGYVP-EQNFMLPDVDEHEEKTQMYH

Query:  SEKLAIAYGLLNTLEQTPLQIVQSHRICGDCHSVIKLIAMLTKREIVVRDASRFHHFRDGSCSCGDYW
        SEKLAIAYGLLNTLE+TPLQIVQSHRIC DCHSVIKLIAM+TKREIV+RDASRFHHFRDG+CSCGDYW
Subjt:  SEKLAIAYGLLNTLEQTPLQIVQSHRICGDCHSVIKLIAMLTKREIVVRDASRFHHFRDGSCSCGDYW

A0A6J1BWH3 pentatricopeptide repeat-containing protein At5g50390, chloroplastic0.0e+0086.1Show/hide
Query:  MEVPLFRYQNYVYDPLQCNST----SYFSVRFSDSELFMKRSLL------SNTRKPRKSLCSIKCSSFEQGLRPRPQPKPSKVDPDVRKETPLKE-TRIS
        MEVPL RYQNYVYD LQC+ST    SY  VRF+DS+LF KRSLL      SN RK R S C IKCSS EQGLRPRP+P+PSK+D DVRK T   E TRI 
Subjt:  MEVPLFRYQNYVYDPLQCNST----SYFSVRFSDSELFMKRSLL------SNTRKPRKSLCSIKCSSFEQGLRPRPQPKPSKVDPDVRKETPLKE-TRIS

Query:  KSSVGICSQIEKLVLCKKYRDALEMFEIFELEGGFHVGNSTFDALINACIGLKSIRGVKRLCNYMVDNGFEPDQYMRNRILLMHVKCGMMIDACRLFDEM
        KS VGICSQIEKLVLCKKYRDALEMFEIFELEGG+ +GNST+DALINACIGLKSIRGVKRLCNYM+DNGFEPDQYM+NRILLMHVKCGMMIDACRLFDEM
Subjt:  KSSVGICSQIEKLVLCKKYRDALEMFEIFELEGGFHVGNSTFDALINACIGLKSIRGVKRLCNYMVDNGFEPDQYMRNRILLMHVKCGMMIDACRLFDEM

Query:  PERNAVSWTTIIWGYVDSGNYVEAFRLFILMWEEYYACGPRTLATMIRASAGLELIFPGRQLHSCAIKAGLGRDIFVSCALIDMYSKCGSLEDAHCVFDD
        PERNAVSW+TII GYVDSGNY+EAFRLFI+MWEE    GPRT A MIRASAGLELIFPGRQLHSCAIKAG+G+DIFVSCALIDMYSKCGSLEDAHCVFD+
Subjt:  PERNAVSWTTIIWGYVDSGNYVEAFRLFILMWEEYYACGPRTLATMIRASAGLELIFPGRQLHSCAIKAGLGRDIFVSCALIDMYSKCGSLEDAHCVFDD

Query:  MPDKTIVGWNSIIAAYAFHGYSEEALDLYYKMRDSGIKMDHFTFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFD
        MPDKTIVGWNSIIA YA HGYSEEALDL Y+MRDSGIKMDHFTFSIIIRICSRLASVARAKQ HA LVRNGFGLDVVANTALVDFYSKWGK+DDARH+FD
Subjt:  MPDKTIVGWNSIIAAYAFHGYSEEALDLYYKMRDSGIKMDHFTFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFD

Query:  RMSCRNVISWNALIAGYGNHGRGEEAIEMFEKMLREGMIPNHVTFLAVLSACSISGLFERGWEIFQSMTRDHKIKPRAMHYACMIELLGREGLLDEAYAL
        RMS +N+ISWNALIAGYGNHGRGEEAI+MFE+MLREGM PNHVTFLAVLSACSISGLFERGWEIFQS+T DHKIKPRAMH+ACMIELLGREGLLDEAYAL
Subjt:  RMSCRNVISWNALIAGYGNHGRGEEAIEMFEKMLREGMIPNHVTFLAVLSACSISGLFERGWEIFQSMTRDHKIKPRAMHYACMIELLGREGLLDEAYAL

Query:  IRKAPFQPAPNMWAALLRACRVYGNLELGKFAAEKLYGMEPVKLSNYIVLLNIYNSSGKLKEAADVVQTLKRKGLRMLPACSWIEVNNQPHAFLSGDKHH
        IR APF+P  NMWAALLRACRV+ NLELGK AAE LYGMEP KLSNYIVLLNIYNSSGKLKEAADVVQTLKRKGLRM+PACSWIEV NQPH+FLSGDKHH
Subjt:  IRKAPFQPAPNMWAALLRACRVYGNLELGKFAAEKLYGMEPVKLSNYIVLLNIYNSSGKLKEAADVVQTLKRKGLRMLPACSWIEVNNQPHAFLSGDKHH

Query:  AQIEKVVGKVDELMLKVSKLGYVPEQNFMLPDVDEHEEKTQMYHSEKLAIAYGLLNTLEQTPLQIVQSHRICGDCHSVIKLIAMLTKREIVVRDASRFHH
        A+IEKVV KVDE+MLK+SKLGYV EQNF+LPDVDE EEK  MYHSEKLAIAYGLL+TL++TPLQIVQSHRICGDCHS IKLIA++T+REIVVRDASRFHH
Subjt:  AQIEKVVGKVDELMLKVSKLGYVPEQNFMLPDVDEHEEKTQMYHSEKLAIAYGLLNTLEQTPLQIVQSHRICGDCHSVIKLIAMLTKREIVVRDASRFHH

Query:  FRDGSCSCGDYW
        FRDGSCSCGDYW
Subjt:  FRDGSCSCGDYW

A0A6J1JGW0 pentatricopeptide repeat-containing protein At5g50390, chloroplastic0.0e+0084.11Show/hide
Query:  MEVPLFRYQNYVYDPLQ----CNSTSYFSVRFSDSELFMKRSLL------SNTRKPRKSLCSIKCSSFEQGLRPRPQPKPSKVDPDVRKETPLKETRISK
        MEVPL  YQNYV+D L+     +STSYFS  FS SELF  RSLL      SN RK R S C +KCSS EQGLRPR +PKPSKVD DVRK TP KETRI+K
Subjt:  MEVPLFRYQNYVYDPLQ----CNSTSYFSVRFSDSELFMKRSLL------SNTRKPRKSLCSIKCSSFEQGLRPRPQPKPSKVDPDVRKETPLKETRISK

Query:  SSVGICSQIEKLVLCKKYRDALEMFEIFELEGGFHVGNSTFDALINACIGLKSIRGVKRLCNYMVDNGFEPDQYMRNRILLMHVKCGMMIDACRLFDEMP
        SSV IC  IEKLVLC K+RDALEMFEI ELEGG+ VGNSTFDALI ACIGLKSIRG KRLC YM+DNG EPDQY+ NRILLMHV+CGMMIDA +LFDEMP
Subjt:  SSVGICSQIEKLVLCKKYRDALEMFEIFELEGGFHVGNSTFDALINACIGLKSIRGVKRLCNYMVDNGFEPDQYMRNRILLMHVKCGMMIDACRLFDEMP

Query:  ERNAVSWTTIIWGYVDSGNYVEAFRLFILMWEEYYACGPRTLATMIRASAGLELIFPGRQLHSCAIKAGLGRDIFVSCALIDMYSKCGSLEDAHCVFDDM
        ERNAVSW TII GYVDSGNY EAFRLFI+MWEEY  C PRT AT+IRASAGLELIFPG+QLHSCA+KAG+G+DIFVSCALIDMYSKCG LEDAHCVFD+M
Subjt:  ERNAVSWTTIIWGYVDSGNYVEAFRLFILMWEEYYACGPRTLATMIRASAGLELIFPGRQLHSCAIKAGLGRDIFVSCALIDMYSKCGSLEDAHCVFDDM

Query:  PDKTIVGWNSIIAAYAFHGYSEEALDLYYKMRDSGIKMDHFTFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDR
        PDKTIVGWNSIIA YA HG+SEEAL+LY++MRDSG+K+DHFTFSIIIRICSRLASV RAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARH+FDR
Subjt:  PDKTIVGWNSIIAAYAFHGYSEEALDLYYKMRDSGIKMDHFTFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDR

Query:  MSCRNVISWNALIAGYGNHGRGEEAIEMFEKMLREGMIPNHVTFLAVLSACSISGLFERGWEIFQSMTRDHKIKPRAMHYACMIELLGREGLLDEAYALI
        MSC+N+ISWNALIAGYGNHGRGEEAIE+FE+MLREGM+PNHVTFLAVLSACSISGLFERGWEIFQSMTRDHKIK RAMHY CMIELLGREGLLDEAYALI
Subjt:  MSCRNVISWNALIAGYGNHGRGEEAIEMFEKMLREGMIPNHVTFLAVLSACSISGLFERGWEIFQSMTRDHKIKPRAMHYACMIELLGREGLLDEAYALI

Query:  RKAPFQPAPNMWAALLRACRVYGNLELGKFAAEKLYGMEPVKLSNYIVLLNIYNSSGKLKEAADVVQTLKRKGLRMLPACSWIEVNNQPHAFLSGDKHHA
        RKAPFQP  NMWAALLRACRV+ NLELGK+AAEKLYGMEP KL NYIVLLNIY SSGKLKEAADVV+TLKRKGL MLPACSWIEV +QPHAFLSGDKHH 
Subjt:  RKAPFQPAPNMWAALLRACRVYGNLELGKFAAEKLYGMEPVKLSNYIVLLNIYNSSGKLKEAADVVQTLKRKGLRMLPACSWIEVNNQPHAFLSGDKHHA

Query:  QIEKVVGKVDELMLKVSKLGYVPEQNFMLPDVDEHEEKTQMYHSEKLAIAYGLLNTLEQTPLQIVQSHRICGDCHSVIKLIAMLTKREIVVRDASRFHHF
        +IEKVV KVDELML++SKLGYVPEQN +LPDVD HEEK Q+YHSEKLAIAYGL+NTL+QTPLQIVQ HR+CGDCHSVIKLIAM+TKREIVVRDASRFHHF
Subjt:  QIEKVVGKVDELMLKVSKLGYVPEQNFMLPDVDEHEEKTQMYHSEKLAIAYGLLNTLEQTPLQIVQSHRICGDCHSVIKLIAMLTKREIVVRDASRFHHF

Query:  RDGSCSCGDYW
        RDG CSCGDYW
Subjt:  RDGSCSCGDYW

SwissProt top hitse value%identityAlignment
Q9FK33 Pentatricopeptide repeat-containing protein At5g50390, chloroplastic1.2e-24959.32Show/hide
Query:  MEVPLFRYQNYVYDPLQCNSTSYFSVRFSDSELFMKRSLLSNTRKPRKSLCSIKCSSFEQGLRPRP--QPKPSKVDPDVRKETPLKETRISKSSVGICSQ
        ME+PL RYQ+   D ++ +S+       +   L   R      R+ +     + CSS  QGL+P+P  +P+P +++    K+  L +T+ISKS V ICSQ
Subjt:  MEVPLFRYQNYVYDPLQCNSTSYFSVRFSDSELFMKRSLLSNTRKPRKSLCSIKCSSFEQGLRPRP--QPKPSKVDPDVRKETPLKETRISKSSVGICSQ

Query:  IEKLVLCKKYRDALEMFEIFELEGGFHVGNSTFDALINACIGLKSIRGVKRLCNYMVDNGFEPDQYMRNRILLMHVKCGMMIDACRLFDEMPERNAVSWT
        IEKLVLC ++R+A E+FEI E+   F VG ST+DAL+ ACI LKSIR VKR+  +M+ NGFEP+QYM NRILLMHVKCGM+IDA RLFDE+PERN  S+ 
Subjt:  IEKLVLCKKYRDALEMFEIFELEGGFHVGNSTFDALINACIGLKSIRGVKRLCNYMVDNGFEPDQYMRNRILLMHVKCGMMIDACRLFDEMPERNAVSWT

Query:  TIIWGYVDSGNYVEAFRLFILMWEEYYACGPRTLATMIRASAGLELIFPGRQLHSCAIKAGLGRDIFVSCALIDMYSKCGSLEDAHCVFDDMPDKTIVGW
        +II G+V+ GNYVEAF LF +MWEE   C   T A M+RASAGL  I+ G+QLH CA+K G+  + FVSC LIDMYSKCG +EDA C F+ MP+KT V W
Subjt:  TIIWGYVDSGNYVEAFRLFILMWEEYYACGPRTLATMIRASAGLELIFPGRQLHSCAIKAGLGRDIFVSCALIDMYSKCGSLEDAHCVFDDMPDKTIVGW

Query:  NSIIAAYAFHGYSEEALDLYYKMRDSGIKMDHFTFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRMSCRNVIS
        N++IA YA HGYSEEAL L Y MRDSG+ +D FT SI+IRI ++LA +   KQAHASL+RNGF  ++VANTALVDFYSKWG+VD AR+VFD++  +N+IS
Subjt:  NSIIAAYAFHGYSEEALDLYYKMRDSGIKMDHFTFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRMSCRNVIS

Query:  WNALIAGYGNHGRGEEAIEMFEKMLREGMIPNHVTFLAVLSACSISGLFERGWEIFQSMTRDHKIKPRAMHYACMIELLGREGLLDEAYALIRKAPFQPA
        WNAL+ GY NHGRG +A+++FEKM+   + PNHVTFLAVLSAC+ SGL E+GWEIF SM+  H IKPRAMHYACMIELLGR+GLLDEA A IR+AP +  
Subjt:  WNALIAGYGNHGRGEEAIEMFEKMLREGMIPNHVTFLAVLSACSISGLFERGWEIFQSMTRDHKIKPRAMHYACMIELLGREGLLDEAYALIRKAPFQPA

Query:  PNMWAALLRACRVYGNLELGKFAAEKLYGMEPVKLSNYIVLLNIYNSSGKLKEAADVVQTLKRKGLRMLPACSWIEVNNQPHAFLSGDKHHAQIE----K
         NMWAALL ACR+  NLELG+  AEKLYGM P KL NY+V+ N+YNS GK  EAA V++TL+ KGL M+PAC+W+EV +Q H+FLSGD+  +  E    +
Subjt:  PNMWAALLRACRVYGNLELGKFAAEKLYGMEPVKLSNYIVLLNIYNSSGKLKEAADVVQTLKRKGLRMLPACSWIEVNNQPHAFLSGDKHHAQIE----K

Query:  VVGKVDELMLKVSKLGYVPEQNFMLPDVDE-HEEKTQMYHSEKLAIAYGLLNTLEQTPLQIVQSHRICGDCHSVIKLIAMLTKREIVVRDASRFHHFRDG
        +  KVDELM ++S+ GY  E+  +LPDVDE  EE+   YHSEKLAIAYGL+NT E  PLQI Q+HRIC +CH V++ I+++T RE+VVRDASRFHHF++G
Subjt:  VVGKVDELMLKVSKLGYVPEQNFMLPDVDE-HEEKTQMYHSEKLAIAYGLLNTLEQTPLQIVQSHRICGDCHSVIKLIAMLTKREIVVRDASRFHHFRDG

Query:  SCSCGDYW
         CSCG YW
Subjt:  SCSCGDYW

Q9LIQ7 Pentatricopeptide repeat-containing protein At3g24000, mitochondrial2.4e-13338.46Show/hide
Query:  ELEGGFHVGNSTF-DALINACIGLKSIRGVKRLCNYMVDNGFEPDQYMRNRILLMHVKCGMMIDACRLFDEMPERNAVSWTTIIWGYVDSGNYVEAFRLF
        +LEG +   +  F + L+  C   K +   + +  +++ + F  D  M N +L M+ KCG + +A ++F++MP+R+ V+WTT+I GY       +A   F
Subjt:  ELEGGFHVGNSTF-DALINACIGLKSIRGVKRLCNYMVDNGFEPDQYMRNRILLMHVKCGMMIDACRLFDEMPERNAVSWTTIIWGYVDSGNYVEAFRLF

Query:  ILMWEEYYACGPRTLATMIRASAGLELIFPGRQLHSCAIKAGLGRDIFVSCALIDMYSKCGSLEDAHCVFDDMPDKTIVGWNSIIAAYAFHGYSEEALDL
          M    Y+    TL+++I+A+A       G QLH   +K G   ++ V  AL+D+Y++ G ++DA  VFD +  +  V WN++IA +A    +E+AL+L
Subjt:  ILMWEEYYACGPRTLATMIRASAGLELIFPGRQLHSCAIKAGLGRDIFVSCALIDMYSKCGSLEDAHCVFDDMPDKTIVGWNSIIAAYAFHGYSEEALDL

Query:  YYKMRDSGIKMDHFTFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRMSCRNVISWNALIAGYGNHGRGEEAIE
        +  M   G +  HF+++ +   CS    + + K  HA ++++G  L   A   L+D Y+K G + DAR +FDR++ R+V+SWN+L+  Y  HG G+EA+ 
Subjt:  YYKMRDSGIKMDHFTFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRMSCRNVISWNALIAGYGNHGRGEEAIE

Query:  MFEKMLREGMIPNHVTFLAVLSACSISGLFERGWEIFQSMTRDHKIKPRAMHYACMIELLGREGLLDEAYALIRKAPFQPAPNMWAALLRACRVYGNLEL
         FE+M R G+ PN ++FL+VL+ACS SGL + GW  ++ M +D  I P A HY  +++LLGR G L+ A   I + P +P   +W ALL ACR++ N EL
Subjt:  MFEKMLREGMIPNHVTFLAVLSACSISGLFERGWEIFQSMTRDHKIKPRAMHYACMIELLGREGLLDEAYALIRKAPFQPAPNMWAALLRACRVYGNLEL

Query:  GKFAAEKLYGMEPVKLSNYIVLLNIYNSSGKLKEAADVVQTLKRKGLRMLPACSWIEVNNQPHAFLSGDKHHAQIEKVVGKVDELMLKVSKLGYVPEQNF
        G +AAE ++ ++P     +++L NIY S G+  +AA V + +K  G++  PACSW+E+ N  H F++ D+ H Q E++  K +E++ K+ +LGYVP+ + 
Subjt:  GKFAAEKLYGMEPVKLSNYIVLLNIYNSSGKLKEAADVVQTLKRKGLRMLPACSWIEVNNQPHAFLSGDKHHAQIEKVVGKVDELMLKVSKLGYVPEQNF

Query:  MLPDVDEHEEKTQM-YHSEKLAIAYGLLNTLEQTPLQIVQSHRICGDCHSVIKLIAMLTKREIVVRDASRFHHFRDGSCSCGDYW
        ++  VD+ E +  + YHSEK+A+A+ LLNT   + + I ++ R+CGDCH+ IKL + +  REI+VRD +RFHHF+DG+CSC DYW
Subjt:  MLPDVDEHEEKTQM-YHSEKLAIAYGLLNTLEQTPLQIVQSHRICGDCHSVIKLIAMLTKREIVVRDASRFHHFRDGSCSCGDYW

Q9LTV8 Pentatricopeptide repeat-containing protein At3g127702.1e-12637.86Show/hide
Query:  YRDALEMFEIFELEGGFHVGNSTFDALINACIGLKSIRGVKRLCNYMVDNGFEPDQYMRNRILLMHVKCGMMIDACRLFD--EMPERNAVSWTTIIWGYV
        ++DAL M+   +L       + TF  L+ AC GL  ++  + +   +   GF+ D +++N ++ ++ KC  +  A  +F+   +PER  VSWT I+  Y 
Subjt:  YRDALEMFEIFELEGGFHVGNSTFDALINACIGLKSIRGVKRLCNYMVDNGFEPDQYMRNRILLMHVKCGMMIDACRLFD--EMPERNAVSWTTIIWGYV

Query:  DSGNYVEAFRLFILMWEEYYACGPRTLATMIRASAGLELIFPGRQLHSCAIKAGLGRDIFVSCALIDMYSKCGSLEDAHCVFDDMPDKTIVGWNSIIAAY
         +G  +EA  +F  M +         L +++ A   L+ +  GR +H+  +K GL  +  +  +L  MY+KCG +  A  +FD M    ++ WN++I+ Y
Subjt:  DSGNYVEAFRLFILMWEEYYACGPRTLATMIRASAGLELIFPGRQLHSCAIKAGLGRDIFVSCALIDMYSKCGSLEDAHCVFDDMPDKTIVGWNSIIAAY

Query:  AFHGYSEEALDLYYKMRDSGIKMDHFTFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRMSCRNVISWNALIAG
        A +GY+ EA+D++++M +  ++ D  + +  I  C+++ S+ +A+  +  + R+ +  DV  ++AL+D ++K G V+ AR VFDR   R+V+ W+A+I G
Subjt:  AFHGYSEEALDLYYKMRDSGIKMDHFTFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRMSCRNVISWNALIAG

Query:  YGNHGRGEEAIEMFEKMLREGMIPNHVTFLAVLSACSISGLFERGWEIFQSMTRDHKIKPRAMHYACMIELLGREGLLDEAYALIRKAPFQPAPNMWAAL
        YG HGR  EAI ++  M R G+ PN VTFL +L AC+ SG+   GW  F  M  DHKI P+  HYAC+I+LLGR G LD+AY +I+  P QP   +W AL
Subjt:  YGNHGRGEEAIEMFEKMLREGMIPNHVTFLAVLSACSISGLFERGWEIFQSMTRDHKIKPRAMHYACMIELLGREGLLDEAYALIRKAPFQPAPNMWAAL

Query:  LRACRVYGNLELGKFAAEKLYGMEPVKLSNYIVLLNIYNSSGKLKEAADVVQTLKRKGLRMLPACSWIEVNNQPHAFLSGDKHHAQIEKVVGKVDELMLK
        L AC+ + ++ELG++AA++L+ ++P    +Y+ L N+Y ++      A+V   +K KGL     CSW+EV  +  AF  GDK H + E++  +V+ +  +
Subjt:  LRACRVYGNLELGKFAAEKLYGMEPVKLSNYIVLLNIYNSSGKLKEAADVVQTLKRKGLRMLPACSWIEVNNQPHAFLSGDKHHAQIEKVVGKVDELMLK

Query:  VSKLGYVPEQNFMLPDV-DEHEEKTQMYHSEKLAIAYGLLNTLEQTPLQIVQSHRICGDCHSVIKLIAMLTKREIVVRDASRFHHFRDGSCSCGDYW
        + + G+V  ++  L D+ DE  E+T   HSE++AIAYGL++T + TPL+I ++ R C +CH+  KLI+ L  REIVVRD +RFHHF+DG CSCGDYW
Subjt:  VSKLGYVPEQNFMLPDV-DEHEEKTQMYHSEKLAIAYGLLNTLEQTPLQIVQSHRICGDCHSVIKLIAMLTKREIVVRDASRFHHFRDGSCSCGDYW

Q9S7F4 Putative pentatricopeptide repeat-containing protein At2g015101.3e-12637.92Show/hide
Query:  YRDALEMFEIFELEGGFHVGNSTFDALINACIGLKSIRGVKRLCNYMVDNGFEPDQYMRNRILLMHVKCGMMIDACRLFDEMPERNAVSWTTIIWGYVDS
        Y +++ +F +   + G    + TF  ++ A +GL      ++L    V  GF  D  + N+IL  + K   +++   LFDEMPE + VS+  +I  Y  +
Subjt:  YRDALEMFEIFELEGGFHVGNSTFDALINACIGLKSIRGVKRLCNYMVDNGFEPDQYMRNRILLMHVKCGMMIDACRLFDEMPERNAVSWTTIIWGYVDS

Query:  GNYVEAFRLFILMWEEYYACGPRTLATMIRASAGLELIFPGRQLHSCAIKAGLGRDIFVSCALIDMYSKCGSLEDAHCVFDDMPDKTIVGWNSIIAAYAF
          Y  +   F  M    +       ATM+  +A L  +  GRQLH  A+ A     + V  +L+DMY+KC   E+A  +F  +P +T V W ++I+ Y  
Subjt:  GNYVEAFRLFILMWEEYYACGPRTLATMIRASAGLELIFPGRQLHSCAIKAGLGRDIFVSCALIDMYSKCGSLEDAHCVFDDMPDKTIVGWNSIIAAYAF

Query:  HGYSEEALDLYYKMRDSGIKMDHFTFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRMSCRNVISWNALIAGYG
         G     L L+ KMR S ++ D  TF+ +++  +  AS+   KQ HA ++R+G   +V + + LVD Y+K G + DA  VF+ M  RN +SWNALI+ + 
Subjt:  HGYSEEALDLYYKMRDSGIKMDHFTFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRMSCRNVISWNALIAGYG

Query:  NHGRGEEAIEMFEKMLREGMIPNHVTFLAVLSACSISGLFERGWEIFQSMTRDHKIKPRAMHYACMIELLGREGLLDEAYALIRKAPFQPAPNMWAALLR
        ++G GE AI  F KM+  G+ P+ V+ L VL+ACS  G  E+G E FQ+M+  + I P+  HYACM++LLGR G   EA  L+ + PF+P   MW+++L 
Subjt:  NHGRGEEAIEMFEKMLREGMIPNHVTFLAVLSACSISGLFERGWEIFQSMTRDHKIKPRAMHYACMIELLGREGLLDEAYALIRKAPFQPAPNMWAALLR

Query:  ACRVYGNLELGKFAAEKLYGMEPVK-LSNYIVLLNIYNSSGKLKEAADVVQTLKRKGLRMLPACSWIEVNNQPHAFLSGDKHHAQIEKVVGKVDELMLKV
        ACR++ N  L + AAEKL+ ME ++  + Y+ + NIY ++G+ ++  DV + ++ +G++ +PA SW+EVN++ H F S D+ H   +++V K++EL  ++
Subjt:  ACRVYGNLELGKFAAEKLYGMEPVK-LSNYIVLLNIYNSSGKLKEAADVVQTLKRKGLRMLPACSWIEVNNQPHAFLSGDKHHAQIEKVVGKVDELMLKV

Query:  SKLGYVPEQNFMLPDVDEHEE-KTQMYHSEKLAIAYGLLNTLEQTPLQIVQSHRICGDCHSVIKLIAMLTKREIVVRDASRFHHFRDGSCSCGDYW
         + GY P+ + ++ DVDE  + ++  YHSE+LA+A+ L++T E  P+ ++++ R C DCH+ IKLI+ + KREI VRD SRFHHF +G CSCGDYW
Subjt:  SKLGYVPEQNFMLPDVDEHEE-KTQMYHSEKLAIAYGLLNTLEQTPLQIVQSHRICGDCHSVIKLIAMLTKREIVVRDASRFHHFRDGSCSCGDYW

Q9SI53 Pentatricopeptide repeat-containing protein At2g03880, mitochondrial3.2e-13039.48Show/hide
Query:  GFHVGNSTFDALINACIGLKSIRGVKRLCNYMVDNGFEPDQYMRNRILLMHVKCGMMIDACRLFDEMPERNAVSWTTIIWGYVDSGNYVEAFRLFILMWE
        G    ++T+  LI  CI  +++     +C ++  NG  P  ++ N ++ M+VK  ++ DA +LFD+MP+RN +SWTT+I  Y     + +A  L +LM  
Subjt:  GFHVGNSTFDALINACIGLKSIRGVKRLCNYMVDNGFEPDQYMRNRILLMHVKCGMMIDACRLFDEMPERNAVSWTTIIWGYVDSGNYVEAFRLFILMWE

Query:  EYYACGPRTLATMIRASAGLELIFPGRQLHSCAIKAGLGRDIFVSCALIDMYSKCGSLEDAHCVFDDMPDKTIVGWNSIIAAYAFHGYSEEALDLYYKMR
        +       T ++++R+  G+  +   R LH   IK GL  D+FV  ALID+++K G  EDA  VFD+M     + WNSII  +A +  S+ AL+L+ +M+
Subjt:  EYYACGPRTLATMIRASAGLELIFPGRQLHSCAIKAGLGRDIFVSCALIDMYSKCGSLEDAHCVFDDMPDKTIVGWNSIIAAYAFHGYSEEALDLYYKMR

Query:  DSGIKMDHFTFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRMSCRNVISWNALIAGYGNHGRGEEAIEMFEKM
         +G   +  T + ++R C+ LA +    QAH  +V+  +  D++ N ALVD Y K G ++DA  VF++M  R+VI+W+ +I+G   +G  +EA+++FE+M
Subjt:  DSGIKMDHFTFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRMSCRNVISWNALIAGYGNHGRGEEAIEMFEKM

Query:  LREGMIPNHVTFLAVLSACSISGLFERGWEIFQSMTRDHKIKPRAMHYACMIELLGREGLLDEAYALIRKAPFQPAPNMWAALLRACRVYGNLELGKFAA
           G  PN++T + VL ACS +GL E GW  F+SM + + I P   HY CMI+LLG+ G LD+A  L+ +   +P    W  LL ACRV  N+ L ++AA
Subjt:  LREGMIPNHVTFLAVLSACSISGLFERGWEIFQSMTRDHKIKPRAMHYACMIELLGREGLLDEAYALIRKAPFQPAPNMWAALLRACRVYGNLELGKFAA

Query:  EKLYGMEPVKLSNYIVLLNIYNSSGKLKEAADVVQTLKRKGLRMLPACSWIEVNNQPHAFLSGDKHHAQIEKVVGKVDELMLKVSKLGYVPEQNFMLPDV
        +K+  ++P     Y +L NIY +S K     ++   ++ +G++  P CSWIEVN Q HAF+ GD  H QI +V  K+++L+ +++ +GYVPE NF+L D+
Subjt:  EKLYGMEPVKLSNYIVLLNIYNSSGKLKEAADVVQTLKRKGLRMLPACSWIEVNNQPHAFLSGDKHHAQIEKVVGKVDELMLKVSKLGYVPEQNFMLPDV

Query:  D-EHEEKTQMYHSEKLAIAYGLLNTLEQTPLQIVQSHRICGDCHSVIKLIAMLTKREIVVRDASRFHHFRDGSCSCGDYW
        + E  E +  +HSEKLA+A+GL+    +  ++I ++ RICGDCH   KL + L  R IV+RD  R+HHF+DG CSCGDYW
Subjt:  D-EHEEKTQMYHSEKLAIAYGLLNTLEQTPLQIVQSHRICGDCHSVIKLIAMLTKREIVVRDASRFHHFRDGSCSCGDYW

Arabidopsis top hitse value%identityAlignment
AT2G03880.1 Pentatricopeptide repeat (PPR) superfamily protein2.3e-13139.48Show/hide
Query:  GFHVGNSTFDALINACIGLKSIRGVKRLCNYMVDNGFEPDQYMRNRILLMHVKCGMMIDACRLFDEMPERNAVSWTTIIWGYVDSGNYVEAFRLFILMWE
        G    ++T+  LI  CI  +++     +C ++  NG  P  ++ N ++ M+VK  ++ DA +LFD+MP+RN +SWTT+I  Y     + +A  L +LM  
Subjt:  GFHVGNSTFDALINACIGLKSIRGVKRLCNYMVDNGFEPDQYMRNRILLMHVKCGMMIDACRLFDEMPERNAVSWTTIIWGYVDSGNYVEAFRLFILMWE

Query:  EYYACGPRTLATMIRASAGLELIFPGRQLHSCAIKAGLGRDIFVSCALIDMYSKCGSLEDAHCVFDDMPDKTIVGWNSIIAAYAFHGYSEEALDLYYKMR
        +       T ++++R+  G+  +   R LH   IK GL  D+FV  ALID+++K G  EDA  VFD+M     + WNSII  +A +  S+ AL+L+ +M+
Subjt:  EYYACGPRTLATMIRASAGLELIFPGRQLHSCAIKAGLGRDIFVSCALIDMYSKCGSLEDAHCVFDDMPDKTIVGWNSIIAAYAFHGYSEEALDLYYKMR

Query:  DSGIKMDHFTFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRMSCRNVISWNALIAGYGNHGRGEEAIEMFEKM
         +G   +  T + ++R C+ LA +    QAH  +V+  +  D++ N ALVD Y K G ++DA  VF++M  R+VI+W+ +I+G   +G  +EA+++FE+M
Subjt:  DSGIKMDHFTFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRMSCRNVISWNALIAGYGNHGRGEEAIEMFEKM

Query:  LREGMIPNHVTFLAVLSACSISGLFERGWEIFQSMTRDHKIKPRAMHYACMIELLGREGLLDEAYALIRKAPFQPAPNMWAALLRACRVYGNLELGKFAA
           G  PN++T + VL ACS +GL E GW  F+SM + + I P   HY CMI+LLG+ G LD+A  L+ +   +P    W  LL ACRV  N+ L ++AA
Subjt:  LREGMIPNHVTFLAVLSACSISGLFERGWEIFQSMTRDHKIKPRAMHYACMIELLGREGLLDEAYALIRKAPFQPAPNMWAALLRACRVYGNLELGKFAA

Query:  EKLYGMEPVKLSNYIVLLNIYNSSGKLKEAADVVQTLKRKGLRMLPACSWIEVNNQPHAFLSGDKHHAQIEKVVGKVDELMLKVSKLGYVPEQNFMLPDV
        +K+  ++P     Y +L NIY +S K     ++   ++ +G++  P CSWIEVN Q HAF+ GD  H QI +V  K+++L+ +++ +GYVPE NF+L D+
Subjt:  EKLYGMEPVKLSNYIVLLNIYNSSGKLKEAADVVQTLKRKGLRMLPACSWIEVNNQPHAFLSGDKHHAQIEKVVGKVDELMLKVSKLGYVPEQNFMLPDV

Query:  D-EHEEKTQMYHSEKLAIAYGLLNTLEQTPLQIVQSHRICGDCHSVIKLIAMLTKREIVVRDASRFHHFRDGSCSCGDYW
        + E  E +  +HSEKLA+A+GL+    +  ++I ++ RICGDCH   KL + L  R IV+RD  R+HHF+DG CSCGDYW
Subjt:  D-EHEEKTQMYHSEKLAIAYGLLNTLEQTPLQIVQSHRICGDCHSVIKLIAMLTKREIVVRDASRFHHFRDGSCSCGDYW

AT3G02010.1 Pentatricopeptide repeat (PPR) superfamily protein8.9e-12837.92Show/hide
Query:  YRDALEMFEIFELEGGFHVGNSTFDALINACIGLKSIRGVKRLCNYMVDNGFEPDQYMRNRILLMHVKCGMMIDACRLFDEMPERNAVSWTTIIWGYVDS
        Y +++ +F +   + G    + TF  ++ A +GL      ++L    V  GF  D  + N+IL  + K   +++   LFDEMPE + VS+  +I  Y  +
Subjt:  YRDALEMFEIFELEGGFHVGNSTFDALINACIGLKSIRGVKRLCNYMVDNGFEPDQYMRNRILLMHVKCGMMIDACRLFDEMPERNAVSWTTIIWGYVDS

Query:  GNYVEAFRLFILMWEEYYACGPRTLATMIRASAGLELIFPGRQLHSCAIKAGLGRDIFVSCALIDMYSKCGSLEDAHCVFDDMPDKTIVGWNSIIAAYAF
          Y  +   F  M    +       ATM+  +A L  +  GRQLH  A+ A     + V  +L+DMY+KC   E+A  +F  +P +T V W ++I+ Y  
Subjt:  GNYVEAFRLFILMWEEYYACGPRTLATMIRASAGLELIFPGRQLHSCAIKAGLGRDIFVSCALIDMYSKCGSLEDAHCVFDDMPDKTIVGWNSIIAAYAF

Query:  HGYSEEALDLYYKMRDSGIKMDHFTFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRMSCRNVISWNALIAGYG
         G     L L+ KMR S ++ D  TF+ +++  +  AS+   KQ HA ++R+G   +V + + LVD Y+K G + DA  VF+ M  RN +SWNALI+ + 
Subjt:  HGYSEEALDLYYKMRDSGIKMDHFTFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRMSCRNVISWNALIAGYG

Query:  NHGRGEEAIEMFEKMLREGMIPNHVTFLAVLSACSISGLFERGWEIFQSMTRDHKIKPRAMHYACMIELLGREGLLDEAYALIRKAPFQPAPNMWAALLR
        ++G GE AI  F KM+  G+ P+ V+ L VL+ACS  G  E+G E FQ+M+  + I P+  HYACM++LLGR G   EA  L+ + PF+P   MW+++L 
Subjt:  NHGRGEEAIEMFEKMLREGMIPNHVTFLAVLSACSISGLFERGWEIFQSMTRDHKIKPRAMHYACMIELLGREGLLDEAYALIRKAPFQPAPNMWAALLR

Query:  ACRVYGNLELGKFAAEKLYGMEPVK-LSNYIVLLNIYNSSGKLKEAADVVQTLKRKGLRMLPACSWIEVNNQPHAFLSGDKHHAQIEKVVGKVDELMLKV
        ACR++ N  L + AAEKL+ ME ++  + Y+ + NIY ++G+ ++  DV + ++ +G++ +PA SW+EVN++ H F S D+ H   +++V K++EL  ++
Subjt:  ACRVYGNLELGKFAAEKLYGMEPVK-LSNYIVLLNIYNSSGKLKEAADVVQTLKRKGLRMLPACSWIEVNNQPHAFLSGDKHHAQIEKVVGKVDELMLKV

Query:  SKLGYVPEQNFMLPDVDEHEE-KTQMYHSEKLAIAYGLLNTLEQTPLQIVQSHRICGDCHSVIKLIAMLTKREIVVRDASRFHHFRDGSCSCGDYW
         + GY P+ + ++ DVDE  + ++  YHSE+LA+A+ L++T E  P+ ++++ R C DCH+ IKLI+ + KREI VRD SRFHHF +G CSCGDYW
Subjt:  SKLGYVPEQNFMLPDVDEHEE-KTQMYHSEKLAIAYGLLNTLEQTPLQIVQSHRICGDCHSVIKLIAMLTKREIVVRDASRFHHFRDGSCSCGDYW

AT3G12770.1 mitochondrial editing factor 221.5e-12737.86Show/hide
Query:  YRDALEMFEIFELEGGFHVGNSTFDALINACIGLKSIRGVKRLCNYMVDNGFEPDQYMRNRILLMHVKCGMMIDACRLFD--EMPERNAVSWTTIIWGYV
        ++DAL M+   +L       + TF  L+ AC GL  ++  + +   +   GF+ D +++N ++ ++ KC  +  A  +F+   +PER  VSWT I+  Y 
Subjt:  YRDALEMFEIFELEGGFHVGNSTFDALINACIGLKSIRGVKRLCNYMVDNGFEPDQYMRNRILLMHVKCGMMIDACRLFD--EMPERNAVSWTTIIWGYV

Query:  DSGNYVEAFRLFILMWEEYYACGPRTLATMIRASAGLELIFPGRQLHSCAIKAGLGRDIFVSCALIDMYSKCGSLEDAHCVFDDMPDKTIVGWNSIIAAY
         +G  +EA  +F  M +         L +++ A   L+ +  GR +H+  +K GL  +  +  +L  MY+KCG +  A  +FD M    ++ WN++I+ Y
Subjt:  DSGNYVEAFRLFILMWEEYYACGPRTLATMIRASAGLELIFPGRQLHSCAIKAGLGRDIFVSCALIDMYSKCGSLEDAHCVFDDMPDKTIVGWNSIIAAY

Query:  AFHGYSEEALDLYYKMRDSGIKMDHFTFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRMSCRNVISWNALIAG
        A +GY+ EA+D++++M +  ++ D  + +  I  C+++ S+ +A+  +  + R+ +  DV  ++AL+D ++K G V+ AR VFDR   R+V+ W+A+I G
Subjt:  AFHGYSEEALDLYYKMRDSGIKMDHFTFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRMSCRNVISWNALIAG

Query:  YGNHGRGEEAIEMFEKMLREGMIPNHVTFLAVLSACSISGLFERGWEIFQSMTRDHKIKPRAMHYACMIELLGREGLLDEAYALIRKAPFQPAPNMWAAL
        YG HGR  EAI ++  M R G+ PN VTFL +L AC+ SG+   GW  F  M  DHKI P+  HYAC+I+LLGR G LD+AY +I+  P QP   +W AL
Subjt:  YGNHGRGEEAIEMFEKMLREGMIPNHVTFLAVLSACSISGLFERGWEIFQSMTRDHKIKPRAMHYACMIELLGREGLLDEAYALIRKAPFQPAPNMWAAL

Query:  LRACRVYGNLELGKFAAEKLYGMEPVKLSNYIVLLNIYNSSGKLKEAADVVQTLKRKGLRMLPACSWIEVNNQPHAFLSGDKHHAQIEKVVGKVDELMLK
        L AC+ + ++ELG++AA++L+ ++P    +Y+ L N+Y ++      A+V   +K KGL     CSW+EV  +  AF  GDK H + E++  +V+ +  +
Subjt:  LRACRVYGNLELGKFAAEKLYGMEPVKLSNYIVLLNIYNSSGKLKEAADVVQTLKRKGLRMLPACSWIEVNNQPHAFLSGDKHHAQIEKVVGKVDELMLK

Query:  VSKLGYVPEQNFMLPDV-DEHEEKTQMYHSEKLAIAYGLLNTLEQTPLQIVQSHRICGDCHSVIKLIAMLTKREIVVRDASRFHHFRDGSCSCGDYW
        + + G+V  ++  L D+ DE  E+T   HSE++AIAYGL++T + TPL+I ++ R C +CH+  KLI+ L  REIVVRD +RFHHF+DG CSCGDYW
Subjt:  VSKLGYVPEQNFMLPDV-DEHEEKTQMYHSEKLAIAYGLLNTLEQTPLQIVQSHRICGDCHSVIKLIAMLTKREIVVRDASRFHHFRDGSCSCGDYW

AT3G24000.1 Tetratricopeptide repeat (TPR)-like superfamily protein4.7e-12937.89Show/hide
Query:  ELEGGFHVGNSTF-DALINACIGLKSIRGVKRLCNYMVDNGFEPDQYMRNRILLMHVKCGMMIDACRLFDEMPERNAVSWTTIIWGYVDSGNYVEAFRLF
        +LEG +   +  F + L+  C   K +   + +  +++ + F  D  M N +L M+ KCG + +A ++F++MP+R+ V+WTT+I GY       +A   F
Subjt:  ELEGGFHVGNSTF-DALINACIGLKSIRGVKRLCNYMVDNGFEPDQYMRNRILLMHVKCGMMIDACRLFDEMPERNAVSWTTIIWGYVDSGNYVEAFRLF

Query:  ILMWEEYYACGPRTLATMIRASAGLELIFPGRQLHSCAIKAGLGRDIFVSCALIDMYSKCGSLEDAHCVFDDMPDKTIVGWNSIIAAYAFHGYSEEALDL
          M    Y+    TL+++I+A+A       G QLH   +K G   ++ V  AL+D+Y++ G ++DA  VFD +  +  V WN++IA +A    +E+AL+L
Subjt:  ILMWEEYYACGPRTLATMIRASAGLELIFPGRQLHSCAIKAGLGRDIFVSCALIDMYSKCGSLEDAHCVFDDMPDKTIVGWNSIIAAYAFHGYSEEALDL

Query:  YYKMRDSGIKMDHFTFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRMSCRNVISWNALIAGYGNHGRGEEAIE
        +  M   G +  HF+++ +   CS    + + K  HA ++++G  L   A   L+D Y+K G + DAR +FDR++ R+V+SWN+L+  Y  HG G+EA+ 
Subjt:  YYKMRDSGIKMDHFTFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRMSCRNVISWNALIAGYGNHGRGEEAIE

Query:  MFEKMLREGMIPNHVTFLAVLSACSISGLFERGWEIFQSMTRDHKIKPRAMHYACMIELLGREGLLDEAYALIRKAPFQPAPNMWAALLRACRVYGNLEL
         FE+M R G+ PN ++FL+VL+ACS SGL + GW  ++ M +D  I P A HY  +++LLGR G L+ A   I + P +P   +W ALL ACR++ N EL
Subjt:  MFEKMLREGMIPNHVTFLAVLSACSISGLFERGWEIFQSMTRDHKIKPRAMHYACMIELLGREGLLDEAYALIRKAPFQPAPNMWAALLRACRVYGNLEL

Query:  GKFAAEKLYGMEPVKLSNYIVLLNIYNSSGKLKEAADVVQTLKRKGLRMLPACSWIEVNNQPHAFLSGDKHHAQIEKVVGKVDELMLKVSKLGYVPEQNF
        G +AAE ++ ++P     +++L NIY S G+  +AA V + +K  G++  PACSW+E+ N  H F++ D+ H Q E++  K +E++ K+ +LGYVP+ + 
Subjt:  GKFAAEKLYGMEPVKLSNYIVLLNIYNSSGKLKEAADVVQTLKRKGLRMLPACSWIEVNNQPHAFLSGDKHHAQIEKVVGKVDELMLKVSKLGYVPEQNF

Query:  MLPDVDEHEEKTQM-YHSEKLAIAYGLLNTLEQTPLQIVQSHRICGDCHSVIKLIAMLTKREIVVRDASRFHHFRDGS
        ++  VD+ E +  + YHSEK+A+A+ LLNT   + + I ++ R+CGDCH+ IKL + +  REI+VRD +RFHHF+D S
Subjt:  MLPDVDEHEEKTQM-YHSEKLAIAYGLLNTLEQTPLQIVQSHRICGDCHSVIKLIAMLTKREIVVRDASRFHHFRDGS

AT5G50390.1 Pentatricopeptide repeat (PPR-like) superfamily protein8.3e-25159.32Show/hide
Query:  MEVPLFRYQNYVYDPLQCNSTSYFSVRFSDSELFMKRSLLSNTRKPRKSLCSIKCSSFEQGLRPRP--QPKPSKVDPDVRKETPLKETRISKSSVGICSQ
        ME+PL RYQ+   D ++ +S+       +   L   R      R+ +     + CSS  QGL+P+P  +P+P +++    K+  L +T+ISKS V ICSQ
Subjt:  MEVPLFRYQNYVYDPLQCNSTSYFSVRFSDSELFMKRSLLSNTRKPRKSLCSIKCSSFEQGLRPRP--QPKPSKVDPDVRKETPLKETRISKSSVGICSQ

Query:  IEKLVLCKKYRDALEMFEIFELEGGFHVGNSTFDALINACIGLKSIRGVKRLCNYMVDNGFEPDQYMRNRILLMHVKCGMMIDACRLFDEMPERNAVSWT
        IEKLVLC ++R+A E+FEI E+   F VG ST+DAL+ ACI LKSIR VKR+  +M+ NGFEP+QYM NRILLMHVKCGM+IDA RLFDE+PERN  S+ 
Subjt:  IEKLVLCKKYRDALEMFEIFELEGGFHVGNSTFDALINACIGLKSIRGVKRLCNYMVDNGFEPDQYMRNRILLMHVKCGMMIDACRLFDEMPERNAVSWT

Query:  TIIWGYVDSGNYVEAFRLFILMWEEYYACGPRTLATMIRASAGLELIFPGRQLHSCAIKAGLGRDIFVSCALIDMYSKCGSLEDAHCVFDDMPDKTIVGW
        +II G+V+ GNYVEAF LF +MWEE   C   T A M+RASAGL  I+ G+QLH CA+K G+  + FVSC LIDMYSKCG +EDA C F+ MP+KT V W
Subjt:  TIIWGYVDSGNYVEAFRLFILMWEEYYACGPRTLATMIRASAGLELIFPGRQLHSCAIKAGLGRDIFVSCALIDMYSKCGSLEDAHCVFDDMPDKTIVGW

Query:  NSIIAAYAFHGYSEEALDLYYKMRDSGIKMDHFTFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRMSCRNVIS
        N++IA YA HGYSEEAL L Y MRDSG+ +D FT SI+IRI ++LA +   KQAHASL+RNGF  ++VANTALVDFYSKWG+VD AR+VFD++  +N+IS
Subjt:  NSIIAAYAFHGYSEEALDLYYKMRDSGIKMDHFTFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRMSCRNVIS

Query:  WNALIAGYGNHGRGEEAIEMFEKMLREGMIPNHVTFLAVLSACSISGLFERGWEIFQSMTRDHKIKPRAMHYACMIELLGREGLLDEAYALIRKAPFQPA
        WNAL+ GY NHGRG +A+++FEKM+   + PNHVTFLAVLSAC+ SGL E+GWEIF SM+  H IKPRAMHYACMIELLGR+GLLDEA A IR+AP +  
Subjt:  WNALIAGYGNHGRGEEAIEMFEKMLREGMIPNHVTFLAVLSACSISGLFERGWEIFQSMTRDHKIKPRAMHYACMIELLGREGLLDEAYALIRKAPFQPA

Query:  PNMWAALLRACRVYGNLELGKFAAEKLYGMEPVKLSNYIVLLNIYNSSGKLKEAADVVQTLKRKGLRMLPACSWIEVNNQPHAFLSGDKHHAQIE----K
         NMWAALL ACR+  NLELG+  AEKLYGM P KL NY+V+ N+YNS GK  EAA V++TL+ KGL M+PAC+W+EV +Q H+FLSGD+  +  E    +
Subjt:  PNMWAALLRACRVYGNLELGKFAAEKLYGMEPVKLSNYIVLLNIYNSSGKLKEAADVVQTLKRKGLRMLPACSWIEVNNQPHAFLSGDKHHAQIE----K

Query:  VVGKVDELMLKVSKLGYVPEQNFMLPDVDE-HEEKTQMYHSEKLAIAYGLLNTLEQTPLQIVQSHRICGDCHSVIKLIAMLTKREIVVRDASRFHHFRDG
        +  KVDELM ++S+ GY  E+  +LPDVDE  EE+   YHSEKLAIAYGL+NT E  PLQI Q+HRIC +CH V++ I+++T RE+VVRDASRFHHF++G
Subjt:  VVGKVDELMLKVSKLGYVPEQNFMLPDVDE-HEEKTQMYHSEKLAIAYGLLNTLEQTPLQIVQSHRICGDCHSVIKLIAMLTKREIVVRDASRFHHFRDG

Query:  SCSCGDYW
         CSCG YW
Subjt:  SCSCGDYW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGTCCCTCTCTTCCGCTATCAAAACTATGTTTATGATCCGCTTCAATGTAACTCCACTTCCTACTTCTCCGTGCGTTTCTCAGATTCCGAGCTTTTTATGAAGAG
ATCTTTGCTTTCTAATACAAGAAAACCCCGTAAATCACTTTGTTCGATCAAGTGCTCTTCGTTTGAACAAGGGCTACGCCCACGGCCCCAACCTAAACCTTCCAAAGTTG
ATCCGGACGTTCGTAAAGAAACCCCTTTGAAGGAGACCCGTATTAGTAAATCCAGTGTAGGGATCTGTAGCCAGATAGAGAAGCTGGTTTTGTGTAAAAAATATCGAGAT
GCACTTGAGATGTTTGAAATTTTTGAACTGGAAGGTGGTTTTCATGTTGGTAACAGCACGTTCGATGCGTTGATTAATGCGTGTATTGGGTTGAAGTCTATACGAGGCGT
GAAGAGGTTGTGTAATTACATGGTTGATAATGGATTTGAACCCGATCAGTACATGAGGAACAGGATTCTACTTATGCATGTGAAATGTGGGATGATGATTGATGCTTGTA
GATTGTTCGATGAAATGCCTGAAAGGAATGCAGTTTCGTGGACTACTATAATTTGGGGGTATGTAGACTCTGGAAATTATGTTGAAGCGTTTAGATTGTTCATTTTGATG
TGGGAAGAGTATTATGCTTGTGGGCCTCGTACCTTAGCCACAATGATACGGGCGTCGGCTGGTTTGGAACTTATTTTTCCTGGTAGGCAATTGCATTCATGTGCGATAAA
GGCAGGTTTGGGACGGGACATTTTTGTTTCTTGTGCGCTGATTGATATGTACAGCAAGTGTGGAAGCCTCGAAGATGCTCATTGTGTTTTTGATGATATGCCCGATAAGA
CGATAGTTGGATGGAATTCAATTATAGCTGCTTACGCATTCCATGGCTACAGTGAGGAAGCTCTGGATCTATATTACAAGATGCGTGACTCCGGTATTAAAATGGACCAT
TTCACCTTTTCTATAATTATAAGAATATGCTCGAGATTGGCCTCGGTAGCACGTGCTAAGCAAGCGCATGCGAGTTTAGTTCGAAATGGCTTTGGGTTAGATGTAGTAGC
TAATACAGCCCTTGTGGATTTCTATAGCAAATGGGGAAAAGTAGATGATGCTAGACATGTTTTTGACAGGATGTCCTGTAGAAACGTAATATCATGGAATGCTTTGATTG
CTGGATATGGGAATCATGGTCGTGGGGAGGAGGCCATTGAGATGTTCGAGAAGATGCTTCGGGAAGGCATGATACCAAACCATGTGACGTTTCTTGCTGTTTTATCTGCT
TGTAGTATTTCAGGTTTGTTTGAACGTGGATGGGAAATTTTTCAATCGATGACTAGAGATCACAAGATTAAACCGCGTGCTATGCATTATGCGTGCATGATTGAATTGCT
AGGTCGAGAAGGGCTCCTTGATGAAGCCTATGCTCTTATAAGGAAAGCTCCATTTCAACCCGCACCAAATATGTGGGCTGCCTTACTTAGAGCTTGTAGAGTTTATGGAA
ATCTAGAACTTGGGAAGTTTGCTGCTGAAAAACTTTATGGGATGGAACCCGTGAAGCTTAGTAATTATATTGTGCTTTTAAACATATACAACAGTTCTGGTAAGTTAAAG
GAAGCAGCTGATGTTGTTCAGACATTGAAAAGAAAGGGCTTAAGAATGCTTCCAGCATGCAGTTGGATTGAAGTTAATAACCAACCCCATGCATTCCTGTCTGGGGATAA
ACACCATGCCCAAATAGAAAAAGTTGTGGGAAAAGTGGATGAATTAATGTTGAAGGTCTCAAAGCTTGGTTATGTACCTGAACAGAACTTCATGCTTCCAGATGTAGATG
AACATGAAGAAAAGACACAGATGTACCACAGTGAGAAATTGGCAATAGCTTATGGATTATTAAATACTTTAGAGCAAACGCCATTGCAGATTGTACAGAGCCATCGCATT
TGCGGTGACTGCCATTCTGTGATTAAGCTGATTGCTATGTTAACCAAACGTGAAATTGTGGTCAGAGATGCTAGCAGATTCCATCATTTCAGAGATGGGAGTTGCTCTTG
TGGAGACTATTGGTGA
mRNA sequenceShow/hide mRNA sequence
ATGGAAGTCCCTCTCTTCCGCTATCAAAACTATGTTTATGATCCGCTTCAATGTAACTCCACTTCCTACTTCTCCGTGCGTTTCTCAGATTCCGAGCTTTTTATGAAGAG
ATCTTTGCTTTCTAATACAAGAAAACCCCGTAAATCACTTTGTTCGATCAAGTGCTCTTCGTTTGAACAAGGGCTACGCCCACGGCCCCAACCTAAACCTTCCAAAGTTG
ATCCGGACGTTCGTAAAGAAACCCCTTTGAAGGAGACCCGTATTAGTAAATCCAGTGTAGGGATCTGTAGCCAGATAGAGAAGCTGGTTTTGTGTAAAAAATATCGAGAT
GCACTTGAGATGTTTGAAATTTTTGAACTGGAAGGTGGTTTTCATGTTGGTAACAGCACGTTCGATGCGTTGATTAATGCGTGTATTGGGTTGAAGTCTATACGAGGCGT
GAAGAGGTTGTGTAATTACATGGTTGATAATGGATTTGAACCCGATCAGTACATGAGGAACAGGATTCTACTTATGCATGTGAAATGTGGGATGATGATTGATGCTTGTA
GATTGTTCGATGAAATGCCTGAAAGGAATGCAGTTTCGTGGACTACTATAATTTGGGGGTATGTAGACTCTGGAAATTATGTTGAAGCGTTTAGATTGTTCATTTTGATG
TGGGAAGAGTATTATGCTTGTGGGCCTCGTACCTTAGCCACAATGATACGGGCGTCGGCTGGTTTGGAACTTATTTTTCCTGGTAGGCAATTGCATTCATGTGCGATAAA
GGCAGGTTTGGGACGGGACATTTTTGTTTCTTGTGCGCTGATTGATATGTACAGCAAGTGTGGAAGCCTCGAAGATGCTCATTGTGTTTTTGATGATATGCCCGATAAGA
CGATAGTTGGATGGAATTCAATTATAGCTGCTTACGCATTCCATGGCTACAGTGAGGAAGCTCTGGATCTATATTACAAGATGCGTGACTCCGGTATTAAAATGGACCAT
TTCACCTTTTCTATAATTATAAGAATATGCTCGAGATTGGCCTCGGTAGCACGTGCTAAGCAAGCGCATGCGAGTTTAGTTCGAAATGGCTTTGGGTTAGATGTAGTAGC
TAATACAGCCCTTGTGGATTTCTATAGCAAATGGGGAAAAGTAGATGATGCTAGACATGTTTTTGACAGGATGTCCTGTAGAAACGTAATATCATGGAATGCTTTGATTG
CTGGATATGGGAATCATGGTCGTGGGGAGGAGGCCATTGAGATGTTCGAGAAGATGCTTCGGGAAGGCATGATACCAAACCATGTGACGTTTCTTGCTGTTTTATCTGCT
TGTAGTATTTCAGGTTTGTTTGAACGTGGATGGGAAATTTTTCAATCGATGACTAGAGATCACAAGATTAAACCGCGTGCTATGCATTATGCGTGCATGATTGAATTGCT
AGGTCGAGAAGGGCTCCTTGATGAAGCCTATGCTCTTATAAGGAAAGCTCCATTTCAACCCGCACCAAATATGTGGGCTGCCTTACTTAGAGCTTGTAGAGTTTATGGAA
ATCTAGAACTTGGGAAGTTTGCTGCTGAAAAACTTTATGGGATGGAACCCGTGAAGCTTAGTAATTATATTGTGCTTTTAAACATATACAACAGTTCTGGTAAGTTAAAG
GAAGCAGCTGATGTTGTTCAGACATTGAAAAGAAAGGGCTTAAGAATGCTTCCAGCATGCAGTTGGATTGAAGTTAATAACCAACCCCATGCATTCCTGTCTGGGGATAA
ACACCATGCCCAAATAGAAAAAGTTGTGGGAAAAGTGGATGAATTAATGTTGAAGGTCTCAAAGCTTGGTTATGTACCTGAACAGAACTTCATGCTTCCAGATGTAGATG
AACATGAAGAAAAGACACAGATGTACCACAGTGAGAAATTGGCAATAGCTTATGGATTATTAAATACTTTAGAGCAAACGCCATTGCAGATTGTACAGAGCCATCGCATT
TGCGGTGACTGCCATTCTGTGATTAAGCTGATTGCTATGTTAACCAAACGTGAAATTGTGGTCAGAGATGCTAGCAGATTCCATCATTTCAGAGATGGGAGTTGCTCTTG
TGGAGACTATTGGTGA
Protein sequenceShow/hide protein sequence
MEVPLFRYQNYVYDPLQCNSTSYFSVRFSDSELFMKRSLLSNTRKPRKSLCSIKCSSFEQGLRPRPQPKPSKVDPDVRKETPLKETRISKSSVGICSQIEKLVLCKKYRD
ALEMFEIFELEGGFHVGNSTFDALINACIGLKSIRGVKRLCNYMVDNGFEPDQYMRNRILLMHVKCGMMIDACRLFDEMPERNAVSWTTIIWGYVDSGNYVEAFRLFILM
WEEYYACGPRTLATMIRASAGLELIFPGRQLHSCAIKAGLGRDIFVSCALIDMYSKCGSLEDAHCVFDDMPDKTIVGWNSIIAAYAFHGYSEEALDLYYKMRDSGIKMDH
FTFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRMSCRNVISWNALIAGYGNHGRGEEAIEMFEKMLREGMIPNHVTFLAVLSA
CSISGLFERGWEIFQSMTRDHKIKPRAMHYACMIELLGREGLLDEAYALIRKAPFQPAPNMWAALLRACRVYGNLELGKFAAEKLYGMEPVKLSNYIVLLNIYNSSGKLK
EAADVVQTLKRKGLRMLPACSWIEVNNQPHAFLSGDKHHAQIEKVVGKVDELMLKVSKLGYVPEQNFMLPDVDEHEEKTQMYHSEKLAIAYGLLNTLEQTPLQIVQSHRI
CGDCHSVIKLIAMLTKREIVVRDASRFHHFRDGSCSCGDYW