; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc09g0240071 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc09g0240071
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationCMiso1.1chr09:2180540..2184757
RNA-Seq ExpressionCmc09g0240071
SyntenyCmc09g0240071
Gene Ontology termsGO:0005515 - protein binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily
IPR032867 - DYW domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0039490.1 pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa]0.0e+0095.96Show/hide
Query:  MKTSFLSNRRKCRNSFCWVKCSSFEQGLRPRPQPKPSKLDVGVRKEAPLKETPVRKSSVGICSQIEKLVLCKQYRDALEMFEIFELEDGFHVGNSTYDAL
        MKTSFLSNRRKCRNSFCWVKCSSFEQGLRPRPQPKPSKLDVGVRKEAPLKETPVRKSSVGICSQIEKLVLCKQYRDALEMFEIFELEDGFHVGNSTYDAL
Subjt:  MKTSFLSNRRKCRNSFCWVKCSSFEQGLRPRPQPKPSKLDVGVRKEAPLKETPVRKSSVGICSQIEKLVLCKQYRDALEMFEIFELEDGFHVGNSTYDAL

Query:  INACIGLKSIRGVKRLFNYMVDNGFEPDQYMRNRVLLMHVKCGMMIDACRLFDEMPERNAVSWSTIISGYVDSGNYVEAFRLFILMWEESYGCGPRTLAT
        INACIGLKSIRGVKRLFNYMVDNGFEPDQYMRNRVLLMHVKCGMMIDACRLFDEMPERNAVSWSTIISGYVDSGNYVEAFRLFILMWEESYGCGPRTLAT
Subjt:  INACIGLKSIRGVKRLFNYMVDNGFEPDQYMRNRVLLMHVKCGMMIDACRLFDEMPERNAVSWSTIISGYVDSGNYVEAFRLFILMWEESYGCGPRTLAT

Query:  MIRASAGLEIIFVGRQLHSCAIKAGLGQDIFVSCALIDMYSKCGSLEDAHCVFDEMPDKTIVGWNSIIAGYALHGYSEEALDLYHEMCRSGVKMDHFTFS
        MIRASAGLEIIFVGRQLHSCAIKAGLGQDIFVSCALIDMYSKCGSLEDAHCVFDEMPDKTIVGWNSIIAGYALHGYSEEALDLYHEMCRSGVKMDHFTFS
Subjt:  MIRASAGLEIIFVGRQLHSCAIKAGLGQDIFVSCALIDMYSKCGSLEDAHCVFDEMPDKTIVGWNSIIAGYALHGYSEEALDLYHEMCRSGVKMDHFTFS

Query:  IIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRMSCRNVISWNALIAGYGNHGRGEEAIDMFEKMLREGMMPNHVTF
        IIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRMSCRNVISWNALIAGYGNHGRGEEAIDMFEKMLREGMMPNHVTF
Subjt:  IIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRMSCRNVISWNALIAGYGNHGRGEEAIDMFEKMLREGMMPNHVTF

Query:  LAVLSACSISGLFERGWEIFQSMTRDHKVRPRAMHFACMIELLGREGLLDEAYALIRKAPFQPTANMWAALLRACRVHGNLELG----------------
        LAVLSACSISGLFERGWEIFQSMTRDHKVRPRAMHFACMIELLGREGLLDEAYALIRKAPFQPTANMWAALLRACRVHGNLELG                
Subjt:  LAVLSACSISGLFERGWEIFQSMTRDHKVRPRAMHFACMIELLGREGLLDEAYALIRKAPFQPTANMWAALLRACRVHGNLELG----------------

Query:  ----------NSGKLKEAADVVQTLKRKGLRMLPACSWIEVNNQPHAFLSGDKHHVQLEKVVGKVDELMLKISKLGYVPEEQNFMLPDVDEHEEKIRMYH
                  +SGKLKEAADVVQTLKRKGLRMLPACSWIEVNNQPHAFLSGDKHHVQLEKVVGKVDELMLKISKLGYVPEEQNFMLPDVDEHEEKIRMYH
Subjt:  ----------NSGKLKEAADVVQTLKRKGLRMLPACSWIEVNNQPHAFLSGDKHHVQLEKVVGKVDELMLKISKLGYVPEEQNFMLPDVDEHEEKIRMYH

Query:  SEKLAIAYGLLNTLERTPLQIVQSHRICSDCHSVIKLIAMITKREIVIRDASRFHHFRDGNCSCGDYW
        SEKLAIAYGLLNTLERTPLQIVQSHRICSDCHSVIKLIAMITKREIVIRDASRFHHFRDGNCSCGDYW
Subjt:  SEKLAIAYGLLNTLERTPLQIVQSHRICSDCHSVIKLIAMITKREIVIRDASRFHHFRDGNCSCGDYW

XP_004148701.1 pentatricopeptide repeat-containing protein At5g50390, chloroplastic isoform X1 [Cucumis sativus]0.0e+0091.08Show/hide
Query:  MNMELPLSRYQNYVYDRLQCYSTPYFSLRYSDSHLFMKTSFLSNRRKCRNSFCWVKCSSFEQGL--RPRPQPKPSKLDVGVRKEAPLKETPVRKSSVGIC
        MNMELPLSRYQNYVYDRLQC ST +FSLRYSDS LF KTSFLSN RK RNSFCW+KCSSFEQGL  RPRPQPKPSKLDVG RKE PLKET V+KSSVGIC
Subjt:  MNMELPLSRYQNYVYDRLQCYSTPYFSLRYSDSHLFMKTSFLSNRRKCRNSFCWVKCSSFEQGL--RPRPQPKPSKLDVGVRKEAPLKETPVRKSSVGIC

Query:  SQIEKLVLCKQYRDALEMFEIFELEDGFHVGNSTYDALINACIGLKSIRGVKRLFNYMVDNGFEPDQYMRNRVLLMHVKCGMMIDACRLFDEMPERNAVS
        SQIEKLVLCK+YRDALEMFEIFELEDGFHVG STYDALINACIGLKSIRGVKRL NYMVDNGFEPDQYMRNRVLLMHVKCGMMIDACRLFDEMP RNAVS
Subjt:  SQIEKLVLCKQYRDALEMFEIFELEDGFHVGNSTYDALINACIGLKSIRGVKRLFNYMVDNGFEPDQYMRNRVLLMHVKCGMMIDACRLFDEMPERNAVS

Query:  WSTIISGYVDSGNYVEAFRLFILMWEESYGCGPRTLATMIRASAGLEIIFVGRQLHSCAIKAGLGQDIFVSCALIDMYSKCGSLEDAHCVFDEMPDKTIV
        W TIISGYVDSGNYVEAFRLFILM EE Y CGPRT ATMIRASAGLEIIF GRQLHSCAIKAGLGQDIFVSCALIDMYSKCGSLEDAHCVFDEMPDKTIV
Subjt:  WSTIISGYVDSGNYVEAFRLFILMWEESYGCGPRTLATMIRASAGLEIIFVGRQLHSCAIKAGLGQDIFVSCALIDMYSKCGSLEDAHCVFDEMPDKTIV

Query:  GWNSIIAGYALHGYSEEALDLYHEMCRSGVKMDHFTFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRMSCRNV
        GWNSIIAGYALHGYSEEALDLYHEM  SGVKMDHFTFSIIIRICSRLASVARAKQ HASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRMSCRN+
Subjt:  GWNSIIAGYALHGYSEEALDLYHEMCRSGVKMDHFTFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRMSCRNV

Query:  ISWNALIAGYGNHGRGEEAIDMFEKMLREGMMPNHVTFLAVLSACSISGLFERGWEIFQSMTRDHKVRPRAMHFACMIELLGREGLLDEAYALIRKAPFQ
        ISWNALIAGYGNHG GEEAIDMFEKMLREGMMPNHVTFLAVLSACSISGLFERGWEIFQSMTRDHKV+PRAMHFACMIELLGREGLLDEAYALIRKAPFQ
Subjt:  ISWNALIAGYGNHGRGEEAIDMFEKMLREGMMPNHVTFLAVLSACSISGLFERGWEIFQSMTRDHKVRPRAMHFACMIELLGREGLLDEAYALIRKAPFQ

Query:  PTANMWAALLRACRVHGNLELG--------------------------NSGKLKEAADVVQTLKRKGLRMLPACSWIEVNNQPHAFLSGDKHHVQLEKVV
        PTANMWAALLRACRVHGNLELG                          +SGKLKEAADV QTLKRKGLRMLPACSWIEVNNQPHAFLSGDKHHVQ+EKVV
Subjt:  PTANMWAALLRACRVHGNLELG--------------------------NSGKLKEAADVVQTLKRKGLRMLPACSWIEVNNQPHAFLSGDKHHVQLEKVV

Query:  GKVDELMLKISKLGYVPEEQNFMLPDVDEHEEKIRMYHSEKLAIAYGLLNTLERTPLQIVQSHRICSDCHSVIKLIAMITKREIVIRDASRFHHFRDGNC
        GKVDELML ISKLGYVPEEQNFMLPDVDE+EEKIRMYHSEKLAIAYGLLNTLE+TPLQIVQSHRICSDCHSVIKLIAMITKREIVIRDASRFHHFRDG+C
Subjt:  GKVDELMLKISKLGYVPEEQNFMLPDVDEHEEKIRMYHSEKLAIAYGLLNTLERTPLQIVQSHRICSDCHSVIKLIAMITKREIVIRDASRFHHFRDGNC

Query:  SCGDYW
        SCGDYW
Subjt:  SCGDYW

XP_008459324.1 PREDICTED: pentatricopeptide repeat-containing protein At5g50390, chloroplastic [Cucumis melo]0.0e+0096.16Show/hide
Query:  MNMELPLSRYQNYVYDRLQCYSTPYFSLRYSDSHLFMKTSFLSNRRKCRNSFCWVKCSSFEQGLRPRPQPKPSKLDVGVRKEAPLKETPVRKSSVGICSQ
        MNMELPLSRYQNYVYDRLQCYSTPYFSLRYSDSHLFMKTSFLSNRRKCRNSFCWVKCSSFEQGLRPRPQPKPSKLDVGVRKEAPLKETPVRKSSVGICSQ
Subjt:  MNMELPLSRYQNYVYDRLQCYSTPYFSLRYSDSHLFMKTSFLSNRRKCRNSFCWVKCSSFEQGLRPRPQPKPSKLDVGVRKEAPLKETPVRKSSVGICSQ

Query:  IEKLVLCKQYRDALEMFEIFELEDGFHVGNSTYDALINACIGLKSIRGVKRLFNYMVDNGFEPDQYMRNRVLLMHVKCGMMIDACRLFDEMPERNAVSWS
        IEKLVLCKQYRDALEMFEIFELEDGFHVGNSTYDALINACIGLKSIRGVKRLFNYMVDNGFEPDQYMRNRVLLMHVKCGMMIDACRLFDEMPERNAVSWS
Subjt:  IEKLVLCKQYRDALEMFEIFELEDGFHVGNSTYDALINACIGLKSIRGVKRLFNYMVDNGFEPDQYMRNRVLLMHVKCGMMIDACRLFDEMPERNAVSWS

Query:  TIISGYVDSGNYVEAFRLFILMWEESYGCGPRTLATMIRASAGLEIIFVGRQLHSCAIKAGLGQDIFVSCALIDMYSKCGSLEDAHCVFDEMPDKTIVGW
        TIISGYVDSGNYVEAFRLFILMWEESYGCGPRTLATMIRASAGLEIIFVGRQLHSCAIKAGLGQDIFVSCALIDMYSKCGSLEDAHCVFDEMPDKTIVGW
Subjt:  TIISGYVDSGNYVEAFRLFILMWEESYGCGPRTLATMIRASAGLEIIFVGRQLHSCAIKAGLGQDIFVSCALIDMYSKCGSLEDAHCVFDEMPDKTIVGW

Query:  NSIIAGYALHGYSEEALDLYHEMCRSGVKMDHFTFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRMSCRNVIS
        NSIIAGYALHGYSEEALDLYHEMCRSGVKMDHFTFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRMSCRNVIS
Subjt:  NSIIAGYALHGYSEEALDLYHEMCRSGVKMDHFTFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRMSCRNVIS

Query:  WNALIAGYGNHGRGEEAIDMFEKMLREGMMPNHVTFLAVLSACSISGLFERGWEIFQSMTRDHKVRPRAMHFACMIELLGREGLLDEAYALIRKAPFQPT
        WNALIAGYGNHGRGEEAIDMFEKMLREGMMPNHVTFLAVLSACSISGLFERGWEIFQSMTRDHKVRPRAMHFACMIELLGREGLLDEAYALIRKAPFQPT
Subjt:  WNALIAGYGNHGRGEEAIDMFEKMLREGMMPNHVTFLAVLSACSISGLFERGWEIFQSMTRDHKVRPRAMHFACMIELLGREGLLDEAYALIRKAPFQPT

Query:  ANMWAALLRACRVHGNLELG--------------------------NSGKLKEAADVVQTLKRKGLRMLPACSWIEVNNQPHAFLSGDKHHVQLEKVVGK
        ANMWAALLRACRVHGNLELG                           SGKLKEAADVVQTLKRKGLRMLPACSWIEVNNQPHAFLSGDKHHVQLEKVVGK
Subjt:  ANMWAALLRACRVHGNLELG--------------------------NSGKLKEAADVVQTLKRKGLRMLPACSWIEVNNQPHAFLSGDKHHVQLEKVVGK

Query:  VDELMLKISKLGYVPEEQNFMLPDVDEHEEKIRMYHSEKLAIAYGLLNTLERTPLQIVQSHRICSDCHSVIKLIAMITKREIVIRDASRFHHFRDGNCSC
        VDELMLKISKLGYVPEEQNFMLPDVDEHEEKIRMYHSEKLAIAYGLLNTLERTPLQIVQSHRICSDCHSVIKLIAMITKREIVIRDASRFHHFRDGNCSC
Subjt:  VDELMLKISKLGYVPEEQNFMLPDVDEHEEKIRMYHSEKLAIAYGLLNTLERTPLQIVQSHRICSDCHSVIKLIAMITKREIVIRDASRFHHFRDGNCSC

Query:  GDYW
        GDYW
Subjt:  GDYW

XP_022133879.1 pentatricopeptide repeat-containing protein At5g50390, chloroplastic [Momordica charantia]0.0e+0081.54Show/hide
Query:  MNMELPLSRYQNYVYDRLQCYST----PYFSLRYSDSHLFMKTSFL------SNRRKCRNSFCWVKCSSFEQGLRPRPQPKPSKLDVGVRKEAPLKETP-
        M ME+PL RYQNYVYDRLQC ST     Y  +R++DS LF K S L      SNRRK RNSFCW+KCSS EQGLRPRP+P+PSK+D  VRK     ET  
Subjt:  MNMELPLSRYQNYVYDRLQCYST----PYFSLRYSDSHLFMKTSFL------SNRRKCRNSFCWVKCSSFEQGLRPRPQPKPSKLDVGVRKEAPLKETP-

Query:  VRKSSVGICSQIEKLVLCKQYRDALEMFEIFELEDGFHVGNSTYDALINACIGLKSIRGVKRLFNYMVDNGFEPDQYMRNRVLLMHVKCGMMIDACRLFD
        +RKS VGICSQIEKLVLCK+YRDALEMFEIFELE G+ +GNSTYDALINACIGLKSIRGVKRL NYM+DNGFEPDQYM+NR+LLMHVKCGMMIDACRLFD
Subjt:  VRKSSVGICSQIEKLVLCKQYRDALEMFEIFELEDGFHVGNSTYDALINACIGLKSIRGVKRLFNYMVDNGFEPDQYMRNRVLLMHVKCGMMIDACRLFD

Query:  EMPERNAVSWSTIISGYVDSGNYVEAFRLFILMWEESYGCGPRTLATMIRASAGLEIIFVGRQLHSCAIKAGLGQDIFVSCALIDMYSKCGSLEDAHCVF
        EMPERNAVSWSTIISGYVDSGNY+EAFRLFI+MWEE    GPRT A MIRASAGLE+IF GRQLHSCAIKAG+GQDIFVSCALIDMYSKCGSLEDAHCVF
Subjt:  EMPERNAVSWSTIISGYVDSGNYVEAFRLFILMWEESYGCGPRTLATMIRASAGLEIIFVGRQLHSCAIKAGLGQDIFVSCALIDMYSKCGSLEDAHCVF

Query:  DEMPDKTIVGWNSIIAGYALHGYSEEALDLYHEMCRSGVKMDHFTFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHV
        DEMPDKTIVGWNSIIAGYALHGYSEEALDL +EM  SG+KMDHFTFSIIIRICSRLASVARAKQ HA LVRNGFGLDVVANTALVDFYSKWGK+DDARH+
Subjt:  DEMPDKTIVGWNSIIAGYALHGYSEEALDLYHEMCRSGVKMDHFTFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHV

Query:  FDRMSCRNVISWNALIAGYGNHGRGEEAIDMFEKMLREGMMPNHVTFLAVLSACSISGLFERGWEIFQSMTRDHKVRPRAMHFACMIELLGREGLLDEAY
        FDRMS +N+ISWNALIAGYGNHGRGEEAI MFE+MLREGM PNHVTFLAVLSACSISGLFERGWEIFQS+T DHK++PRAMHFACMIELLGREGLLDEAY
Subjt:  FDRMSCRNVISWNALIAGYGNHGRGEEAIDMFEKMLREGMMPNHVTFLAVLSACSISGLFERGWEIFQSMTRDHKVRPRAMHFACMIELLGREGLLDEAY

Query:  ALIRKAPFQPTANMWAALLRACRVHGNLELG--------------------------NSGKLKEAADVVQTLKRKGLRMLPACSWIEVNNQPHAFLSGDK
        ALIR APF+PTANMWAALLRACRVH NLELG                          +SGKLKEAADVVQTLKRKGLRM+PACSWIEV NQPH+FLSGDK
Subjt:  ALIRKAPFQPTANMWAALLRACRVHGNLELG--------------------------NSGKLKEAADVVQTLKRKGLRMLPACSWIEVNNQPHAFLSGDK

Query:  HHVQLEKVVGKVDELMLKISKLGYVPEEQNFMLPDVDEHEEKIRMYHSEKLAIAYGLLNTLERTPLQIVQSHRICSDCHSVIKLIAMITKREIVIRDASR
        HH ++EKVV KVDE+MLKISKLGYV  EQNF+LPDVDE EEKI MYHSEKLAIAYGLL+TL++TPLQIVQSHRIC DCHS IKLIA+IT+REIV+RDASR
Subjt:  HHVQLEKVVGKVDELMLKISKLGYVPEEQNFMLPDVDEHEEKIRMYHSEKLAIAYGLLNTLERTPLQIVQSHRICSDCHSVIKLIAMITKREIVIRDASR

Query:  FHHFRDGNCSCGDYW
        FHHFRDG+CSCGDYW
Subjt:  FHHFRDGNCSCGDYW

XP_038890388.1 pentatricopeptide repeat-containing protein At5g50390, chloroplastic isoform X1 [Benincasa hispida]0.0e+0087.5Show/hide
Query:  MNMELPLSRYQNYVYDRLQCYSTPYFSLRYSDSHLFMKTSFLSNRRKCRNSFCWVKCSSFEQGLRPRPQPKPSKLDVGVRKEAPLKETPVRKSSVGICSQ
        MNME+PLS YQNY+YDR+QC ST Y SLR+S   LF +  FL NRRKCRNS  W+KCSSFEQGLRPRPQPKPSKLD GV K  PLKET V +SSVGICSQ
Subjt:  MNMELPLSRYQNYVYDRLQCYSTPYFSLRYSDSHLFMKTSFLSNRRKCRNSFCWVKCSSFEQGLRPRPQPKPSKLDVGVRKEAPLKETPVRKSSVGICSQ

Query:  IEKLVLCKQYRDALEMFEIFELEDGFHVGNSTYDALINACIGLKSIRGVKRLFNYMVDNGFEPDQYMRNRVLLMHVKCGMMIDACRLFDEMPERNAVSWS
        IEKLVLCK+YRDALEMFEIFELE GFH GN+T DALINAC+ LKSIRGVK+L NYMVDNGFEPDQYMRNRVLLMHVKCGMMIDACRLFD+MPERNAVSW+
Subjt:  IEKLVLCKQYRDALEMFEIFELEDGFHVGNSTYDALINACIGLKSIRGVKRLFNYMVDNGFEPDQYMRNRVLLMHVKCGMMIDACRLFDEMPERNAVSWS

Query:  TIISGYVDSGNYVEAFRLFILMWEESYGCGPRTLATMIRASAGLEIIFVGRQLHSCAIKAGLGQDIFVSCALIDMYSKCGSLEDAHCVFDEMPDKTIVGW
        TIISG+VDSGNYVEAFRLFILMWEE Y CGPRT ATMIRASAGLE+IF GRQLHSCAIKA LGQDIFVSCALIDMYSKCGSLEDAHCVFDEMPDKTIVGW
Subjt:  TIISGYVDSGNYVEAFRLFILMWEESYGCGPRTLATMIRASAGLEIIFVGRQLHSCAIKAGLGQDIFVSCALIDMYSKCGSLEDAHCVFDEMPDKTIVGW

Query:  NSIIAGYALHGYSEEALDLYHEMCRSGVKMDHFTFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRMSCRNVIS
        NSIIAGYALHGYSEEALDLY+EM  SG+KMDHFTFSIIIRICSRLASVA AKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRMSCRN+IS
Subjt:  NSIIAGYALHGYSEEALDLYHEMCRSGVKMDHFTFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRMSCRNVIS

Query:  WNALIAGYGNHGRGEEAIDMFEKMLREGMMPNHVTFLAVLSACSISGLFERGWEIFQSMTRDHKVRPRAMHFACMIELLGREGLLDEAYALIRKAPFQPT
        WNALIAGYGNHGRG EAIDMFEKMLREG +PNHVTFLAVLSACSISGLFERGWEIFQSMTRDHK++PRAMH+ACMIELLGREGLLDEAYALIRKAPFQPT
Subjt:  WNALIAGYGNHGRGEEAIDMFEKMLREGMMPNHVTFLAVLSACSISGLFERGWEIFQSMTRDHKVRPRAMHFACMIELLGREGLLDEAYALIRKAPFQPT

Query:  ANMWAALLRACRVHGNLELG--------------------------NSGKLKEAADVVQTLKRKGLRMLPACSWIEVNNQPHAFLSGDKHHVQLEKVVGK
        ANMWAALLRACRVHGNLELG                          +SGKLKEAADVVQTLKRKGLRMLPACSWIEVNNQPHAFLSGDKHHVQ+EKVVGK
Subjt:  ANMWAALLRACRVHGNLELG--------------------------NSGKLKEAADVVQTLKRKGLRMLPACSWIEVNNQPHAFLSGDKHHVQLEKVVGK

Query:  VDELMLKISKLGYVPEEQNFMLPDVDEHEEKIRMYHSEKLAIAYGLLNTLERTPLQIVQSHRICSDCHSVIKLIAMITKREIVIRDASRFHHFRDGNCSC
        VDELMLKISKLGYVPEEQNFMLPDVDEHEEKI+MYHSEKLAIAYGLLNTLE+TPLQIVQSHRICSDCH VIKLIAMITKREIVIRDASRFHHFRDG+CSC
Subjt:  VDELMLKISKLGYVPEEQNFMLPDVDEHEEKIRMYHSEKLAIAYGLLNTLERTPLQIVQSHRICSDCHSVIKLIAMITKREIVIRDASRFHHFRDGNCSC

Query:  GDYW
        GDYW
Subjt:  GDYW

TrEMBL top hitse value%identityAlignment
A0A0A0KXD9 DYW_deaminase domain-containing protein0.0e+0091.08Show/hide
Query:  MNMELPLSRYQNYVYDRLQCYSTPYFSLRYSDSHLFMKTSFLSNRRKCRNSFCWVKCSSFEQGL--RPRPQPKPSKLDVGVRKEAPLKETPVRKSSVGIC
        MNMELPLSRYQNYVYDRLQC ST +FSLRYSDS LF KTSFLSN RK RNSFCW+KCSSFEQGL  RPRPQPKPSKLDVG RKE PLKET V+KSSVGIC
Subjt:  MNMELPLSRYQNYVYDRLQCYSTPYFSLRYSDSHLFMKTSFLSNRRKCRNSFCWVKCSSFEQGL--RPRPQPKPSKLDVGVRKEAPLKETPVRKSSVGIC

Query:  SQIEKLVLCKQYRDALEMFEIFELEDGFHVGNSTYDALINACIGLKSIRGVKRLFNYMVDNGFEPDQYMRNRVLLMHVKCGMMIDACRLFDEMPERNAVS
        SQIEKLVLCK+YRDALEMFEIFELEDGFHVG STYDALINACIGLKSIRGVKRL NYMVDNGFEPDQYMRNRVLLMHVKCGMMIDACRLFDEMP RNAVS
Subjt:  SQIEKLVLCKQYRDALEMFEIFELEDGFHVGNSTYDALINACIGLKSIRGVKRLFNYMVDNGFEPDQYMRNRVLLMHVKCGMMIDACRLFDEMPERNAVS

Query:  WSTIISGYVDSGNYVEAFRLFILMWEESYGCGPRTLATMIRASAGLEIIFVGRQLHSCAIKAGLGQDIFVSCALIDMYSKCGSLEDAHCVFDEMPDKTIV
        W TIISGYVDSGNYVEAFRLFILM EE Y CGPRT ATMIRASAGLEIIF GRQLHSCAIKAGLGQDIFVSCALIDMYSKCGSLEDAHCVFDEMPDKTIV
Subjt:  WSTIISGYVDSGNYVEAFRLFILMWEESYGCGPRTLATMIRASAGLEIIFVGRQLHSCAIKAGLGQDIFVSCALIDMYSKCGSLEDAHCVFDEMPDKTIV

Query:  GWNSIIAGYALHGYSEEALDLYHEMCRSGVKMDHFTFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRMSCRNV
        GWNSIIAGYALHGYSEEALDLYHEM  SGVKMDHFTFSIIIRICSRLASVARAKQ HASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRMSCRN+
Subjt:  GWNSIIAGYALHGYSEEALDLYHEMCRSGVKMDHFTFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRMSCRNV

Query:  ISWNALIAGYGNHGRGEEAIDMFEKMLREGMMPNHVTFLAVLSACSISGLFERGWEIFQSMTRDHKVRPRAMHFACMIELLGREGLLDEAYALIRKAPFQ
        ISWNALIAGYGNHG GEEAIDMFEKMLREGMMPNHVTFLAVLSACSISGLFERGWEIFQSMTRDHKV+PRAMHFACMIELLGREGLLDEAYALIRKAPFQ
Subjt:  ISWNALIAGYGNHGRGEEAIDMFEKMLREGMMPNHVTFLAVLSACSISGLFERGWEIFQSMTRDHKVRPRAMHFACMIELLGREGLLDEAYALIRKAPFQ

Query:  PTANMWAALLRACRVHGNLELG--------------------------NSGKLKEAADVVQTLKRKGLRMLPACSWIEVNNQPHAFLSGDKHHVQLEKVV
        PTANMWAALLRACRVHGNLELG                          +SGKLKEAADV QTLKRKGLRMLPACSWIEVNNQPHAFLSGDKHHVQ+EKVV
Subjt:  PTANMWAALLRACRVHGNLELG--------------------------NSGKLKEAADVVQTLKRKGLRMLPACSWIEVNNQPHAFLSGDKHHVQLEKVV

Query:  GKVDELMLKISKLGYVPEEQNFMLPDVDEHEEKIRMYHSEKLAIAYGLLNTLERTPLQIVQSHRICSDCHSVIKLIAMITKREIVIRDASRFHHFRDGNC
        GKVDELML ISKLGYVPEEQNFMLPDVDE+EEKIRMYHSEKLAIAYGLLNTLE+TPLQIVQSHRICSDCHSVIKLIAMITKREIVIRDASRFHHFRDG+C
Subjt:  GKVDELMLKISKLGYVPEEQNFMLPDVDEHEEKIRMYHSEKLAIAYGLLNTLERTPLQIVQSHRICSDCHSVIKLIAMITKREIVIRDASRFHHFRDGNC

Query:  SCGDYW
        SCGDYW
Subjt:  SCGDYW

A0A1S3C9W7 pentatricopeptide repeat-containing protein At5g50390, chloroplastic0.0e+0096.16Show/hide
Query:  MNMELPLSRYQNYVYDRLQCYSTPYFSLRYSDSHLFMKTSFLSNRRKCRNSFCWVKCSSFEQGLRPRPQPKPSKLDVGVRKEAPLKETPVRKSSVGICSQ
        MNMELPLSRYQNYVYDRLQCYSTPYFSLRYSDSHLFMKTSFLSNRRKCRNSFCWVKCSSFEQGLRPRPQPKPSKLDVGVRKEAPLKETPVRKSSVGICSQ
Subjt:  MNMELPLSRYQNYVYDRLQCYSTPYFSLRYSDSHLFMKTSFLSNRRKCRNSFCWVKCSSFEQGLRPRPQPKPSKLDVGVRKEAPLKETPVRKSSVGICSQ

Query:  IEKLVLCKQYRDALEMFEIFELEDGFHVGNSTYDALINACIGLKSIRGVKRLFNYMVDNGFEPDQYMRNRVLLMHVKCGMMIDACRLFDEMPERNAVSWS
        IEKLVLCKQYRDALEMFEIFELEDGFHVGNSTYDALINACIGLKSIRGVKRLFNYMVDNGFEPDQYMRNRVLLMHVKCGMMIDACRLFDEMPERNAVSWS
Subjt:  IEKLVLCKQYRDALEMFEIFELEDGFHVGNSTYDALINACIGLKSIRGVKRLFNYMVDNGFEPDQYMRNRVLLMHVKCGMMIDACRLFDEMPERNAVSWS

Query:  TIISGYVDSGNYVEAFRLFILMWEESYGCGPRTLATMIRASAGLEIIFVGRQLHSCAIKAGLGQDIFVSCALIDMYSKCGSLEDAHCVFDEMPDKTIVGW
        TIISGYVDSGNYVEAFRLFILMWEESYGCGPRTLATMIRASAGLEIIFVGRQLHSCAIKAGLGQDIFVSCALIDMYSKCGSLEDAHCVFDEMPDKTIVGW
Subjt:  TIISGYVDSGNYVEAFRLFILMWEESYGCGPRTLATMIRASAGLEIIFVGRQLHSCAIKAGLGQDIFVSCALIDMYSKCGSLEDAHCVFDEMPDKTIVGW

Query:  NSIIAGYALHGYSEEALDLYHEMCRSGVKMDHFTFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRMSCRNVIS
        NSIIAGYALHGYSEEALDLYHEMCRSGVKMDHFTFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRMSCRNVIS
Subjt:  NSIIAGYALHGYSEEALDLYHEMCRSGVKMDHFTFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRMSCRNVIS

Query:  WNALIAGYGNHGRGEEAIDMFEKMLREGMMPNHVTFLAVLSACSISGLFERGWEIFQSMTRDHKVRPRAMHFACMIELLGREGLLDEAYALIRKAPFQPT
        WNALIAGYGNHGRGEEAIDMFEKMLREGMMPNHVTFLAVLSACSISGLFERGWEIFQSMTRDHKVRPRAMHFACMIELLGREGLLDEAYALIRKAPFQPT
Subjt:  WNALIAGYGNHGRGEEAIDMFEKMLREGMMPNHVTFLAVLSACSISGLFERGWEIFQSMTRDHKVRPRAMHFACMIELLGREGLLDEAYALIRKAPFQPT

Query:  ANMWAALLRACRVHGNLELG--------------------------NSGKLKEAADVVQTLKRKGLRMLPACSWIEVNNQPHAFLSGDKHHVQLEKVVGK
        ANMWAALLRACRVHGNLELG                           SGKLKEAADVVQTLKRKGLRMLPACSWIEVNNQPHAFLSGDKHHVQLEKVVGK
Subjt:  ANMWAALLRACRVHGNLELG--------------------------NSGKLKEAADVVQTLKRKGLRMLPACSWIEVNNQPHAFLSGDKHHVQLEKVVGK

Query:  VDELMLKISKLGYVPEEQNFMLPDVDEHEEKIRMYHSEKLAIAYGLLNTLERTPLQIVQSHRICSDCHSVIKLIAMITKREIVIRDASRFHHFRDGNCSC
        VDELMLKISKLGYVPEEQNFMLPDVDEHEEKIRMYHSEKLAIAYGLLNTLERTPLQIVQSHRICSDCHSVIKLIAMITKREIVIRDASRFHHFRDGNCSC
Subjt:  VDELMLKISKLGYVPEEQNFMLPDVDEHEEKIRMYHSEKLAIAYGLLNTLERTPLQIVQSHRICSDCHSVIKLIAMITKREIVIRDASRFHHFRDGNCSC

Query:  GDYW
        GDYW
Subjt:  GDYW

A0A5A7T8C6 Pentatricopeptide repeat-containing protein0.0e+0095.96Show/hide
Query:  MKTSFLSNRRKCRNSFCWVKCSSFEQGLRPRPQPKPSKLDVGVRKEAPLKETPVRKSSVGICSQIEKLVLCKQYRDALEMFEIFELEDGFHVGNSTYDAL
        MKTSFLSNRRKCRNSFCWVKCSSFEQGLRPRPQPKPSKLDVGVRKEAPLKETPVRKSSVGICSQIEKLVLCKQYRDALEMFEIFELEDGFHVGNSTYDAL
Subjt:  MKTSFLSNRRKCRNSFCWVKCSSFEQGLRPRPQPKPSKLDVGVRKEAPLKETPVRKSSVGICSQIEKLVLCKQYRDALEMFEIFELEDGFHVGNSTYDAL

Query:  INACIGLKSIRGVKRLFNYMVDNGFEPDQYMRNRVLLMHVKCGMMIDACRLFDEMPERNAVSWSTIISGYVDSGNYVEAFRLFILMWEESYGCGPRTLAT
        INACIGLKSIRGVKRLFNYMVDNGFEPDQYMRNRVLLMHVKCGMMIDACRLFDEMPERNAVSWSTIISGYVDSGNYVEAFRLFILMWEESYGCGPRTLAT
Subjt:  INACIGLKSIRGVKRLFNYMVDNGFEPDQYMRNRVLLMHVKCGMMIDACRLFDEMPERNAVSWSTIISGYVDSGNYVEAFRLFILMWEESYGCGPRTLAT

Query:  MIRASAGLEIIFVGRQLHSCAIKAGLGQDIFVSCALIDMYSKCGSLEDAHCVFDEMPDKTIVGWNSIIAGYALHGYSEEALDLYHEMCRSGVKMDHFTFS
        MIRASAGLEIIFVGRQLHSCAIKAGLGQDIFVSCALIDMYSKCGSLEDAHCVFDEMPDKTIVGWNSIIAGYALHGYSEEALDLYHEMCRSGVKMDHFTFS
Subjt:  MIRASAGLEIIFVGRQLHSCAIKAGLGQDIFVSCALIDMYSKCGSLEDAHCVFDEMPDKTIVGWNSIIAGYALHGYSEEALDLYHEMCRSGVKMDHFTFS

Query:  IIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRMSCRNVISWNALIAGYGNHGRGEEAIDMFEKMLREGMMPNHVTF
        IIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRMSCRNVISWNALIAGYGNHGRGEEAIDMFEKMLREGMMPNHVTF
Subjt:  IIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRMSCRNVISWNALIAGYGNHGRGEEAIDMFEKMLREGMMPNHVTF

Query:  LAVLSACSISGLFERGWEIFQSMTRDHKVRPRAMHFACMIELLGREGLLDEAYALIRKAPFQPTANMWAALLRACRVHGNLELG----------------
        LAVLSACSISGLFERGWEIFQSMTRDHKVRPRAMHFACMIELLGREGLLDEAYALIRKAPFQPTANMWAALLRACRVHGNLELG                
Subjt:  LAVLSACSISGLFERGWEIFQSMTRDHKVRPRAMHFACMIELLGREGLLDEAYALIRKAPFQPTANMWAALLRACRVHGNLELG----------------

Query:  ----------NSGKLKEAADVVQTLKRKGLRMLPACSWIEVNNQPHAFLSGDKHHVQLEKVVGKVDELMLKISKLGYVPEEQNFMLPDVDEHEEKIRMYH
                  +SGKLKEAADVVQTLKRKGLRMLPACSWIEVNNQPHAFLSGDKHHVQLEKVVGKVDELMLKISKLGYVPEEQNFMLPDVDEHEEKIRMYH
Subjt:  ----------NSGKLKEAADVVQTLKRKGLRMLPACSWIEVNNQPHAFLSGDKHHVQLEKVVGKVDELMLKISKLGYVPEEQNFMLPDVDEHEEKIRMYH

Query:  SEKLAIAYGLLNTLERTPLQIVQSHRICSDCHSVIKLIAMITKREIVIRDASRFHHFRDGNCSCGDYW
        SEKLAIAYGLLNTLERTPLQIVQSHRICSDCHSVIKLIAMITKREIVIRDASRFHHFRDGNCSCGDYW
Subjt:  SEKLAIAYGLLNTLERTPLQIVQSHRICSDCHSVIKLIAMITKREIVIRDASRFHHFRDGNCSCGDYW

A0A6J1BWH3 pentatricopeptide repeat-containing protein At5g50390, chloroplastic0.0e+0081.54Show/hide
Query:  MNMELPLSRYQNYVYDRLQCYST----PYFSLRYSDSHLFMKTSFL------SNRRKCRNSFCWVKCSSFEQGLRPRPQPKPSKLDVGVRKEAPLKETP-
        M ME+PL RYQNYVYDRLQC ST     Y  +R++DS LF K S L      SNRRK RNSFCW+KCSS EQGLRPRP+P+PSK+D  VRK     ET  
Subjt:  MNMELPLSRYQNYVYDRLQCYST----PYFSLRYSDSHLFMKTSFL------SNRRKCRNSFCWVKCSSFEQGLRPRPQPKPSKLDVGVRKEAPLKETP-

Query:  VRKSSVGICSQIEKLVLCKQYRDALEMFEIFELEDGFHVGNSTYDALINACIGLKSIRGVKRLFNYMVDNGFEPDQYMRNRVLLMHVKCGMMIDACRLFD
        +RKS VGICSQIEKLVLCK+YRDALEMFEIFELE G+ +GNSTYDALINACIGLKSIRGVKRL NYM+DNGFEPDQYM+NR+LLMHVKCGMMIDACRLFD
Subjt:  VRKSSVGICSQIEKLVLCKQYRDALEMFEIFELEDGFHVGNSTYDALINACIGLKSIRGVKRLFNYMVDNGFEPDQYMRNRVLLMHVKCGMMIDACRLFD

Query:  EMPERNAVSWSTIISGYVDSGNYVEAFRLFILMWEESYGCGPRTLATMIRASAGLEIIFVGRQLHSCAIKAGLGQDIFVSCALIDMYSKCGSLEDAHCVF
        EMPERNAVSWSTIISGYVDSGNY+EAFRLFI+MWEE    GPRT A MIRASAGLE+IF GRQLHSCAIKAG+GQDIFVSCALIDMYSKCGSLEDAHCVF
Subjt:  EMPERNAVSWSTIISGYVDSGNYVEAFRLFILMWEESYGCGPRTLATMIRASAGLEIIFVGRQLHSCAIKAGLGQDIFVSCALIDMYSKCGSLEDAHCVF

Query:  DEMPDKTIVGWNSIIAGYALHGYSEEALDLYHEMCRSGVKMDHFTFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHV
        DEMPDKTIVGWNSIIAGYALHGYSEEALDL +EM  SG+KMDHFTFSIIIRICSRLASVARAKQ HA LVRNGFGLDVVANTALVDFYSKWGK+DDARH+
Subjt:  DEMPDKTIVGWNSIIAGYALHGYSEEALDLYHEMCRSGVKMDHFTFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHV

Query:  FDRMSCRNVISWNALIAGYGNHGRGEEAIDMFEKMLREGMMPNHVTFLAVLSACSISGLFERGWEIFQSMTRDHKVRPRAMHFACMIELLGREGLLDEAY
        FDRMS +N+ISWNALIAGYGNHGRGEEAI MFE+MLREGM PNHVTFLAVLSACSISGLFERGWEIFQS+T DHK++PRAMHFACMIELLGREGLLDEAY
Subjt:  FDRMSCRNVISWNALIAGYGNHGRGEEAIDMFEKMLREGMMPNHVTFLAVLSACSISGLFERGWEIFQSMTRDHKVRPRAMHFACMIELLGREGLLDEAY

Query:  ALIRKAPFQPTANMWAALLRACRVHGNLELG--------------------------NSGKLKEAADVVQTLKRKGLRMLPACSWIEVNNQPHAFLSGDK
        ALIR APF+PTANMWAALLRACRVH NLELG                          +SGKLKEAADVVQTLKRKGLRM+PACSWIEV NQPH+FLSGDK
Subjt:  ALIRKAPFQPTANMWAALLRACRVHGNLELG--------------------------NSGKLKEAADVVQTLKRKGLRMLPACSWIEVNNQPHAFLSGDK

Query:  HHVQLEKVVGKVDELMLKISKLGYVPEEQNFMLPDVDEHEEKIRMYHSEKLAIAYGLLNTLERTPLQIVQSHRICSDCHSVIKLIAMITKREIVIRDASR
        HH ++EKVV KVDE+MLKISKLGYV  EQNF+LPDVDE EEKI MYHSEKLAIAYGLL+TL++TPLQIVQSHRIC DCHS IKLIA+IT+REIV+RDASR
Subjt:  HHVQLEKVVGKVDELMLKISKLGYVPEEQNFMLPDVDEHEEKIRMYHSEKLAIAYGLLNTLERTPLQIVQSHRICSDCHSVIKLIAMITKREIVIRDASR

Query:  FHHFRDGNCSCGDYW
        FHHFRDG+CSCGDYW
Subjt:  FHHFRDGNCSCGDYW

A0A6J1JGW0 pentatricopeptide repeat-containing protein At5g50390, chloroplastic0.0e+0079.07Show/hide
Query:  MELPLSRYQNYVYDRLQ----CYSTPYFSLRYSDSHLFMKTSFL------SNRRKCRNSFCWVKCSSFEQGLRPRPQPKPSKLDVGVRKEAPLKETPVRK
        ME+PL  YQNYV+D L+      ST YFS  +S S LF   S L      SNRRK RNSFCWVKCSS EQGLRPR +PKPSK+D  VRK  P KET + K
Subjt:  MELPLSRYQNYVYDRLQ----CYSTPYFSLRYSDSHLFMKTSFL------SNRRKCRNSFCWVKCSSFEQGLRPRPQPKPSKLDVGVRKEAPLKETPVRK

Query:  SSVGICSQIEKLVLCKQYRDALEMFEIFELEDGFHVGNSTYDALINACIGLKSIRGVKRLFNYMVDNGFEPDQYMRNRVLLMHVKCGMMIDACRLFDEMP
        SSV IC  IEKLVLC ++RDALEMFEI ELE G+ VGNST+DALI ACIGLKSIRG KRL  YM+DNG EPDQY+ NR+LLMHV+CGMMIDA +LFDEMP
Subjt:  SSVGICSQIEKLVLCKQYRDALEMFEIFELEDGFHVGNSTYDALINACIGLKSIRGVKRLFNYMVDNGFEPDQYMRNRVLLMHVKCGMMIDACRLFDEMP

Query:  ERNAVSWSTIISGYVDSGNYVEAFRLFILMWEESYGCGPRTLATMIRASAGLEIIFVGRQLHSCAIKAGLGQDIFVSCALIDMYSKCGSLEDAHCVFDEM
        ERNAVSW+TIISGYVDSGNY EAFRLFI+MWEE  GC PRT AT+IRASAGLE+IF G+QLHSCA+KAG+GQDIFVSCALIDMYSKCG LEDAHCVFDEM
Subjt:  ERNAVSWSTIISGYVDSGNYVEAFRLFILMWEESYGCGPRTLATMIRASAGLEIIFVGRQLHSCAIKAGLGQDIFVSCALIDMYSKCGSLEDAHCVFDEM

Query:  PDKTIVGWNSIIAGYALHGYSEEALDLYHEMCRSGVKMDHFTFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDR
        PDKTIVGWNSIIAGYALHG+SEEAL+LY +M  SGVK+DHFTFSIIIRICSRLASV RAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARH+FDR
Subjt:  PDKTIVGWNSIIAGYALHGYSEEALDLYHEMCRSGVKMDHFTFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDR

Query:  MSCRNVISWNALIAGYGNHGRGEEAIDMFEKMLREGMMPNHVTFLAVLSACSISGLFERGWEIFQSMTRDHKVRPRAMHFACMIELLGREGLLDEAYALI
        MSC+N+ISWNALIAGYGNHGRGEEAI++FE+MLREGM+PNHVTFLAVLSACSISGLFERGWEIFQSMTRDHK++ RAMH+ CMIELLGREGLLDEAYALI
Subjt:  MSCRNVISWNALIAGYGNHGRGEEAIDMFEKMLREGMMPNHVTFLAVLSACSISGLFERGWEIFQSMTRDHKVRPRAMHFACMIELLGREGLLDEAYALI

Query:  RKAPFQPTANMWAALLRACRVHGNLELG--------------------------NSGKLKEAADVVQTLKRKGLRMLPACSWIEVNNQPHAFLSGDKHHV
        RKAPFQPTANMWAALLRACRVH NLELG                          +SGKLKEAADVV+TLKRKGL MLPACSWIEV +QPHAFLSGDKHH 
Subjt:  RKAPFQPTANMWAALLRACRVHGNLELG--------------------------NSGKLKEAADVVQTLKRKGLRMLPACSWIEVNNQPHAFLSGDKHHV

Query:  QLEKVVGKVDELMLKISKLGYVPEEQNFMLPDVDEHEEKIRMYHSEKLAIAYGLLNTLERTPLQIVQSHRICSDCHSVIKLIAMITKREIVIRDASRFHH
        ++EKVV KVDELML+ISKLGYVP EQN +LPDVD HEEKI++YHSEKLAIAYGL+NTL++TPLQIVQ HR+C DCHSVIKLIAMITKREIV+RDASRFHH
Subjt:  QLEKVVGKVDELMLKISKLGYVPEEQNFMLPDVDEHEEKIRMYHSEKLAIAYGLLNTLERTPLQIVQSHRICSDCHSVIKLIAMITKREIVIRDASRFHH

Query:  FRDGNCSCGDYW
        FRDG CSCGDYW
Subjt:  FRDGNCSCGDYW

SwissProt top hitse value%identityAlignment
Q9FK33 Pentatricopeptide repeat-containing protein At5g50390, chloroplastic1.7e-23757.26Show/hide
Query:  MELPLSRYQNYVYDRLQCYSTPYFSLRYSDSHLFMKTSFLSNRRKCRNSFCWVKCSSFEQGLRPRP--QPKPSKLDVGVRKEAPLKETPVRKSSVGICSQ
        ME+PLSRYQ+   D ++  S+       +   L     F    R+ +N F  + CSS  QGL+P+P  +P+P +++V   K+  L +T + KS V ICSQ
Subjt:  MELPLSRYQNYVYDRLQCYSTPYFSLRYSDSHLFMKTSFLSNRRKCRNSFCWVKCSSFEQGLRPRP--QPKPSKLDVGVRKEAPLKETPVRKSSVGICSQ

Query:  IEKLVLCKQYRDALEMFEIFELEDGFHVGNSTYDALINACIGLKSIRGVKRLFNYMVDNGFEPDQYMRNRVLLMHVKCGMMIDACRLFDEMPERNAVSWS
        IEKLVLC ++R+A E+FEI E+   F VG STYDAL+ ACI LKSIR VKR++ +M+ NGFEP+QYM NR+LLMHVKCGM+IDA RLFDE+PERN  S+ 
Subjt:  IEKLVLCKQYRDALEMFEIFELEDGFHVGNSTYDALINACIGLKSIRGVKRLFNYMVDNGFEPDQYMRNRVLLMHVKCGMMIDACRLFDEMPERNAVSWS

Query:  TIISGYVDSGNYVEAFRLFILMWEESYGCGPRTLATMIRASAGLEIIFVGRQLHSCAIKAGLGQDIFVSCALIDMYSKCGSLEDAHCVFDEMPDKTIVGW
        +IISG+V+ GNYVEAF LF +MWEE   C   T A M+RASAGL  I+VG+QLH CA+K G+  + FVSC LIDMYSKCG +EDA C F+ MP+KT V W
Subjt:  TIISGYVDSGNYVEAFRLFILMWEESYGCGPRTLATMIRASAGLEIIFVGRQLHSCAIKAGLGQDIFVSCALIDMYSKCGSLEDAHCVFDEMPDKTIVGW

Query:  NSIIAGYALHGYSEEALDLYHEMCRSGVKMDHFTFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRMSCRNVIS
        N++IAGYALHGYSEEAL L ++M  SGV +D FT SI+IRI ++LA +   KQAHASL+RNGF  ++VANTALVDFYSKWG+VD AR+VFD++  +N+IS
Subjt:  NSIIAGYALHGYSEEALDLYHEMCRSGVKMDHFTFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRMSCRNVIS

Query:  WNALIAGYGNHGRGEEAIDMFEKMLREGMMPNHVTFLAVLSACSISGLFERGWEIFQSMTRDHKVRPRAMHFACMIELLGREGLLDEAYALIRKAPFQPT
        WNAL+ GY NHGRG +A+ +FEKM+   + PNHVTFLAVLSAC+ SGL E+GWEIF SM+  H ++PRAMH+ACMIELLGR+GLLDEA A IR+AP + T
Subjt:  WNALIAGYGNHGRGEEAIDMFEKMLREGMMPNHVTFLAVLSACSISGLFERGWEIFQSMTRDHKVRPRAMHFACMIELLGREGLLDEAYALIRKAPFQPT

Query:  ANMWAALLRACRVHGNLELG--------------------------NSGKLKEAADVVQTLKRKGLRMLPACSWIEVNNQPHAFLSGDK----HHVQLEK
         NMWAALL ACR+  NLELG                          + GK  EAA V++TL+ KGL M+PAC+W+EV +Q H+FLSGD+    +     +
Subjt:  ANMWAALLRACRVHGNLELG--------------------------NSGKLKEAADVVQTLKRKGLRMLPACSWIEVNNQPHAFLSGDK----HHVQLEK

Query:  VVGKVDELMLKISKLGYVPEEQNFMLPDVDE-HEEKIRMYHSEKLAIAYGLLNTLERTPLQIVQSHRICSDCHSVIKLIAMITKREIVIRDASRFHHFRD
        +  KVDELM +IS+ GY  EEQ+ +LPDVDE  EE++  YHSEKLAIAYGL+NT E  PLQI Q+HRIC +CH V++ I+++T RE+V+RDASRFHHF++
Subjt:  VVGKVDELMLKISKLGYVPEEQNFMLPDVDE-HEEKIRMYHSEKLAIAYGLLNTLERTPLQIVQSHRICSDCHSVIKLIAMITKREIVIRDASRFHHFRD

Query:  GNCSCGDYW
        G CSCG YW
Subjt:  GNCSCGDYW

Q9LIQ7 Pentatricopeptide repeat-containing protein At3g24000, mitochondrial9.3e-12737.87Show/hide
Query:  YDALINACIGLKSIRGVKRLFNYMVDNGFEPDQYMRNRVLLMHVKCGMMIDACRLFDEMPERNAVSWSTIISGYVDSGNYVEAFRLFILMWEESYGCGPR
        Y+ L+  C   K +   + +  +++ + F  D  M N +L M+ KCG + +A ++F++MP+R+ V+W+T+ISGY       +A   F  M    Y     
Subjt:  YDALINACIGLKSIRGVKRLFNYMVDNGFEPDQYMRNRVLLMHVKCGMMIDACRLFDEMPERNAVSWSTIISGYVDSGNYVEAFRLFILMWEESYGCGPR

Query:  TLATMIRASAGLEIIFVGRQLHSCAIKAGLGQDIFVSCALIDMYSKCGSLEDAHCVFDEMPDKTIVGWNSIIAGYALHGYSEEALDLYHEMCRSGVKMDH
        TL+++I+A+A       G QLH   +K G   ++ V  AL+D+Y++ G ++DA  VFD +  +  V WN++IAG+A    +E+AL+L+  M R G +  H
Subjt:  TLATMIRASAGLEIIFVGRQLHSCAIKAGLGQDIFVSCALIDMYSKCGSLEDAHCVFDEMPDKTIVGWNSIIAGYALHGYSEEALDLYHEMCRSGVKMDH

Query:  FTFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRMSCRNVISWNALIAGYGNHGRGEEAIDMFEKMLREGMMPN
        F+++ +   CS    + + K  HA ++++G  L   A   L+D Y+K G + DAR +FDR++ R+V+SWN+L+  Y  HG G+EA+  FE+M R G+ PN
Subjt:  FTFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRMSCRNVISWNALIAGYGNHGRGEEAIDMFEKMLREGMMPN

Query:  HVTFLAVLSACSISGLFERGWEIFQSMTRDHKVRPRAMHFACMIELLGREGLLDEAYALIRKAPFQPTANMWAALLRACRVHGNLELG------------
         ++FL+VL+ACS SGL + GW  ++ M +D  V P A H+  +++LLGR G L+ A   I + P +PTA +W ALL ACR+H N ELG            
Subjt:  HVTFLAVLSACSISGLFERGWEIFQSMTRDHKVRPRAMHFACMIELLGREGLLDEAYALIRKAPFQPTANMWAALLRACRVHGNLELG------------

Query:  --------------NSGKLKEAADVVQTLKRKGLRMLPACSWIEVNNQPHAFLSGDKHHVQLEKVVGKVDELMLKISKLGYVPEEQNFMLPDVDEHEEKI
                      + G+  +AA V + +K  G++  PACSW+E+ N  H F++ D+ H Q E++  K +E++ KI +LGYVP+  + ++  VD+ E ++
Subjt:  --------------NSGKLKEAADVVQTLKRKGLRMLPACSWIEVNNQPHAFLSGDKHHVQLEKVVGKVDELMLKISKLGYVPEEQNFMLPDVDEHEEKI

Query:  RM-YHSEKLAIAYGLLNTLERTPLQIVQSHRICSDCHSVIKLIAMITKREIVIRDASRFHHFRDGNCSCGDYW
         + YHSEK+A+A+ LLNT   + + I ++ R+C DCH+ IKL + +  REI++RD +RFHHF+DGNCSC DYW
Subjt:  RM-YHSEKLAIAYGLLNTLERTPLQIVQSHRICSDCHSVIKLIAMITKREIVIRDASRFHHFRDGNCSCGDYW

Q9LTV8 Pentatricopeptide repeat-containing protein At3g127701.3e-11736.44Show/hide
Query:  YRDALEMFEIFEL----EDGFHVGNSTYDALINACIGLKSIRGVKRLFNYMVDNGFEPDQYMRNRVLLMHVKCGMMIDACRLFD--EMPERNAVSWSTII
        ++DAL M+   +L     D F     T+  L+ AC GL  ++  + +   +   GF+ D +++N ++ ++ KC  +  A  +F+   +PER  VSW+ I+
Subjt:  YRDALEMFEIFEL----EDGFHVGNSTYDALINACIGLKSIRGVKRLFNYMVDNGFEPDQYMRNRVLLMHVKCGMMIDACRLFD--EMPERNAVSWSTII

Query:  SGYVDSGNYVEAFRLFILMWEESYGCGPRTLATMIRASAGLEIIFVGRQLHSCAIKAGLGQDIFVSCALIDMYSKCGSLEDAHCVFDEMPDKTIVGWNSI
        S Y  +G  +EA  +F  M +         L +++ A   L+ +  GR +H+  +K GL  +  +  +L  MY+KCG +  A  +FD+M    ++ WN++
Subjt:  SGYVDSGNYVEAFRLFILMWEESYGCGPRTLATMIRASAGLEIIFVGRQLHSCAIKAGLGQDIFVSCALIDMYSKCGSLEDAHCVFDEMPDKTIVGWNSI

Query:  IAGYALHGYSEEALDLYHEMCRSGVKMDHFTFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRMSCRNVISWNA
        I+GYA +GY+ EA+D++HEM    V+ D  + +  I  C+++ S+ +A+  +  + R+ +  DV  ++AL+D ++K G V+ AR VFDR   R+V+ W+A
Subjt:  IAGYALHGYSEEALDLYHEMCRSGVKMDHFTFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRMSCRNVISWNA

Query:  LIAGYGNHGRGEEAIDMFEKMLREGMMPNHVTFLAVLSACSISGLFERGWEIFQSMTRDHKVRPRAMHFACMIELLGREGLLDEAYALIRKAPFQPTANM
        +I GYG HGR  EAI ++  M R G+ PN VTFL +L AC+ SG+   GW  F  M  DHK+ P+  H+AC+I+LLGR G LD+AY +I+  P QP   +
Subjt:  LIAGYGNHGRGEEAIDMFEKMLREGMMPNHVTFLAVLSACSISGLFERGWEIFQSMTRDHKVRPRAMHFACMIELLGREGLLDEAYALIRKAPFQPTANM

Query:  WAALLRACRVHGNLELG-------------NSGKLKE-------------AADVVQTLKRKGLRMLPACSWIEVNNQPHAFLSGDKHHVQLEKVVGKVDE
        W ALL AC+ H ++ELG             N+G   +              A+V   +K KGL     CSW+EV  +  AF  GDK H + E++  +V+ 
Subjt:  WAALLRACRVHGNLELG-------------NSGKLKE-------------AADVVQTLKRKGLRMLPACSWIEVNNQPHAFLSGDKHHVQLEKVVGKVDE

Query:  LMLKISKLGYVPEEQNFMLPDVDEHEEKIRMYHSEKLAIAYGLLNTLERTPLQIVQSHRICSDCHSVIKLIAMITKREIVIRDASRFHHFRDGNCSCGDY
        +  ++ + G+V  +   +    DE  E+    HSE++AIAYGL++T + TPL+I ++ R C +CH+  KLI+ +  REIV+RD +RFHHF+DG CSCGDY
Subjt:  LMLKISKLGYVPEEQNFMLPDVDEHEEKIRMYHSEKLAIAYGLLNTLERTPLQIVQSHRICSDCHSVIKLIAMITKREIVIRDASRFHHFRDGNCSCGDY

Query:  W
        W
Subjt:  W

Q9LW63 Putative pentatricopeptide repeat-containing protein At3g233302.5e-11935.78Show/hide
Query:  NSTYDALINACIGLKSIRGVKRLFNYMVDNGFEPDQYMRNRVLLMHVK---CGMMIDACRLFDEMPER--------------------------------
        ++ + +++ +C  +  +R  + +  ++V  G + D Y  N ++ M+ K    G  I    +FDEMP+R                                
Subjt:  NSTYDALINACIGLKSIRGVKRLFNYMVDNGFEPDQYMRNRVLLMHVK---CGMMIDACRLFDEMPER--------------------------------

Query:  -NAVSWSTIISGYVDSGNYVEAFRLFILMWEESYGCGPRTLATMIRASAGLEIIFVGRQLHSCAIKAGLGQDIFVSCALIDMYSKCGSLEDAHCVFDEMP
         + VS++TII+GY  SG Y +A R+   M          TL++++   +    +  G+++H   I+ G+  D+++  +L+DMY+K   +ED+  VF  + 
Subjt:  -NAVSWSTIISGYVDSGNYVEAFRLFILMWEESYGCGPRTLATMIRASAGLEIIFVGRQLHSCAIKAGLGQDIFVSCALIDMYSKCGSLEDAHCVFDEMP

Query:  DKTIVGWNSIIAGYALHGYSEEALDLYHEMCRSGVKMDHFTFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRM
         +  + WNS++AGY  +G   EAL L+ +M  + VK     FS +I  C+ LA++   KQ H  ++R GFG ++   +ALVD YSK G +  AR +FDRM
Subjt:  DKTIVGWNSIIAGYALHGYSEEALDLYHEMCRSGVKMDHFTFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRM

Query:  SCRNVISWNALIAGYGNHGRGEEAIDMFEKMLREGMMPNHVTFLAVLSACSISGLFERGWEIFQSMTRDHKVRPRAMHFACMIELLGREGLLDEAYALIR
        +  + +SW A+I G+  HG G EA+ +FE+M R+G+ PN V F+AVL+ACS  GL +  W  F SMT+ + +     H+A + +LLGR G L+EAY  I 
Subjt:  SCRNVISWNALIAGYGNHGRGEEAIDMFEKMLREGMMPNHVTFLAVLSACSISGLFERGWEIFQSMTRDHKVRPRAMHFACMIELLGREGLLDEAYALIR

Query:  KAPFQPTANMWAALLRACRVHGNLEL--------------------------GNSGKLKEAADVVQTLKRKGLRMLPACSWIEVNNQPHAFLSGDKHHVQ
        K   +PT ++W+ LL +C VH NLEL                           ++G+ KE A +   +++KGLR  PACSWIE+ N+ H F+SGD+ H  
Subjt:  KAPFQPTANMWAALLRACRVHGNLEL--------------------------GNSGKLKEAADVVQTLKRKGLRMLPACSWIEVNNQPHAFLSGDKHHVQ

Query:  LEKVVGKVDELMLKISKLGYVPEEQNFMLPDVD-EHEEKIRMYHSEKLAIAYGLLNTLERTPLQIVQSHRICSDCHSVIKLIAMITKREIVIRDASRFHH
        ++K+   +  +M ++ K GYV +    +L DVD EH+ ++   HSE+LA+A+G++NT   T +++ ++ RIC+DCH  IK I+ IT+REI++RD SRFHH
Subjt:  LEKVVGKVDELMLKISKLGYVPEEQNFMLPDVD-EHEEKIRMYHSEKLAIAYGLLNTLERTPLQIVQSHRICSDCHSVIKLIAMITKREIVIRDASRFHH

Query:  FRDGNCSCGDYW
        F  GNCSCGDYW
Subjt:  FRDGNCSCGDYW

Q9SI53 Pentatricopeptide repeat-containing protein At2g03880, mitochondrial3.2e-11938.04Show/hide
Query:  GFHVGNSTYDALINACIGLKSIRGVKRLFNYMVDNGFEPDQYMRNRVLLMHVKCGMMIDACRLFDEMPERNAVSWSTIISGYVDSGNYVEAFRLFILMWE
        G    ++TY  LI  CI  +++     +  ++  NG  P  ++ N ++ M+VK  ++ DA +LFD+MP+RN +SW+T+IS Y     + +A  L +LM  
Subjt:  GFHVGNSTYDALINACIGLKSIRGVKRLFNYMVDNGFEPDQYMRNRVLLMHVKCGMMIDACRLFDEMPERNAVSWSTIISGYVDSGNYVEAFRLFILMWE

Query:  ESYGCGPRTLATMIRASAGLEIIFVGRQLHSCAIKAGLGQDIFVSCALIDMYSKCGSLEDAHCVFDEMPDKTIVGWNSIIAGYALHGYSEEALDLYHEMC
        ++      T ++++R+  G+  +   R LH   IK GL  D+FV  ALID+++K G  EDA  VFDEM     + WNSII G+A +  S+ AL+L+  M 
Subjt:  ESYGCGPRTLATMIRASAGLEIIFVGRQLHSCAIKAGLGQDIFVSCALIDMYSKCGSLEDAHCVFDEMPDKTIVGWNSIIAGYALHGYSEEALDLYHEMC

Query:  RSGVKMDHFTFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRMSCRNVISWNALIAGYGNHGRGEEAIDMFEKM
        R+G   +  T + ++R C+ LA +    QAH  +V+  +  D++ N ALVD Y K G ++DA  VF++M  R+VI+W+ +I+G   +G  +EA+ +FE+M
Subjt:  RSGVKMDHFTFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRMSCRNVISWNALIAGYGNHGRGEEAIDMFEKM

Query:  LREGMMPNHVTFLAVLSACSISGLFERGWEIFQSMTRDHKVRPRAMHFACMIELLGREGLLDEAYALIRKAPFQPTANMWAALLRACRVHGNLEL-----
           G  PN++T + VL ACS +GL E GW  F+SM + + + P   H+ CMI+LLG+ G LD+A  L+ +   +P A  W  LL ACRV  N+ L     
Subjt:  LREGMMPNHVTFLAVLSACSISGLFERGWEIFQSMTRDHKVRPRAMHFACMIELLGREGLLDEAYALIRKAPFQPTANMWAALLRACRVHGNLEL-----

Query:  ---------------------GNSGKLKEAADVVQTLKRKGLRMLPACSWIEVNNQPHAFLSGDKHHVQLEKVVGKVDELMLKISKLGYVPEEQNFMLPD
                              NS K     ++   ++ +G++  P CSWIEVN Q HAF+ GD  H Q+ +V  K+++L+ +++ +GYVP E NF+L D
Subjt:  ---------------------GNSGKLKEAADVVQTLKRKGLRMLPACSWIEVNNQPHAFLSGDKHHVQLEKVVGKVDELMLKISKLGYVPEEQNFMLPD

Query:  VD-EHEEKIRMYHSEKLAIAYGLLNTLERTPLQIVQSHRICSDCHSVIKLIAMITKREIVIRDASRFHHFRDGNCSCGDYW
        ++ E  E    +HSEKLA+A+GL+       ++I ++ RIC DCH   KL + +  R IVIRD  R+HHF+DG CSCGDYW
Subjt:  VD-EHEEKIRMYHSEKLAIAYGLLNTLERTPLQIVQSHRICSDCHSVIKLIAMITKREIVIRDASRFHHFRDGNCSCGDYW

Arabidopsis top hitse value%identityAlignment
AT2G03880.1 Pentatricopeptide repeat (PPR) superfamily protein2.3e-12038.04Show/hide
Query:  GFHVGNSTYDALINACIGLKSIRGVKRLFNYMVDNGFEPDQYMRNRVLLMHVKCGMMIDACRLFDEMPERNAVSWSTIISGYVDSGNYVEAFRLFILMWE
        G    ++TY  LI  CI  +++     +  ++  NG  P  ++ N ++ M+VK  ++ DA +LFD+MP+RN +SW+T+IS Y     + +A  L +LM  
Subjt:  GFHVGNSTYDALINACIGLKSIRGVKRLFNYMVDNGFEPDQYMRNRVLLMHVKCGMMIDACRLFDEMPERNAVSWSTIISGYVDSGNYVEAFRLFILMWE

Query:  ESYGCGPRTLATMIRASAGLEIIFVGRQLHSCAIKAGLGQDIFVSCALIDMYSKCGSLEDAHCVFDEMPDKTIVGWNSIIAGYALHGYSEEALDLYHEMC
        ++      T ++++R+  G+  +   R LH   IK GL  D+FV  ALID+++K G  EDA  VFDEM     + WNSII G+A +  S+ AL+L+  M 
Subjt:  ESYGCGPRTLATMIRASAGLEIIFVGRQLHSCAIKAGLGQDIFVSCALIDMYSKCGSLEDAHCVFDEMPDKTIVGWNSIIAGYALHGYSEEALDLYHEMC

Query:  RSGVKMDHFTFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRMSCRNVISWNALIAGYGNHGRGEEAIDMFEKM
        R+G   +  T + ++R C+ LA +    QAH  +V+  +  D++ N ALVD Y K G ++DA  VF++M  R+VI+W+ +I+G   +G  +EA+ +FE+M
Subjt:  RSGVKMDHFTFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRMSCRNVISWNALIAGYGNHGRGEEAIDMFEKM

Query:  LREGMMPNHVTFLAVLSACSISGLFERGWEIFQSMTRDHKVRPRAMHFACMIELLGREGLLDEAYALIRKAPFQPTANMWAALLRACRVHGNLEL-----
           G  PN++T + VL ACS +GL E GW  F+SM + + + P   H+ CMI+LLG+ G LD+A  L+ +   +P A  W  LL ACRV  N+ L     
Subjt:  LREGMMPNHVTFLAVLSACSISGLFERGWEIFQSMTRDHKVRPRAMHFACMIELLGREGLLDEAYALIRKAPFQPTANMWAALLRACRVHGNLEL-----

Query:  ---------------------GNSGKLKEAADVVQTLKRKGLRMLPACSWIEVNNQPHAFLSGDKHHVQLEKVVGKVDELMLKISKLGYVPEEQNFMLPD
                              NS K     ++   ++ +G++  P CSWIEVN Q HAF+ GD  H Q+ +V  K+++L+ +++ +GYVP E NF+L D
Subjt:  ---------------------GNSGKLKEAADVVQTLKRKGLRMLPACSWIEVNNQPHAFLSGDKHHVQLEKVVGKVDELMLKISKLGYVPEEQNFMLPD

Query:  VD-EHEEKIRMYHSEKLAIAYGLLNTLERTPLQIVQSHRICSDCHSVIKLIAMITKREIVIRDASRFHHFRDGNCSCGDYW
        ++ E  E    +HSEKLA+A+GL+       ++I ++ RIC DCH   KL + +  R IVIRD  R+HHF+DG CSCGDYW
Subjt:  VD-EHEEKIRMYHSEKLAIAYGLLNTLERTPLQIVQSHRICSDCHSVIKLIAMITKREIVIRDASRFHHFRDGNCSCGDYW

AT3G12770.1 mitochondrial editing factor 229.6e-11936.44Show/hide
Query:  YRDALEMFEIFEL----EDGFHVGNSTYDALINACIGLKSIRGVKRLFNYMVDNGFEPDQYMRNRVLLMHVKCGMMIDACRLFD--EMPERNAVSWSTII
        ++DAL M+   +L     D F     T+  L+ AC GL  ++  + +   +   GF+ D +++N ++ ++ KC  +  A  +F+   +PER  VSW+ I+
Subjt:  YRDALEMFEIFEL----EDGFHVGNSTYDALINACIGLKSIRGVKRLFNYMVDNGFEPDQYMRNRVLLMHVKCGMMIDACRLFD--EMPERNAVSWSTII

Query:  SGYVDSGNYVEAFRLFILMWEESYGCGPRTLATMIRASAGLEIIFVGRQLHSCAIKAGLGQDIFVSCALIDMYSKCGSLEDAHCVFDEMPDKTIVGWNSI
        S Y  +G  +EA  +F  M +         L +++ A   L+ +  GR +H+  +K GL  +  +  +L  MY+KCG +  A  +FD+M    ++ WN++
Subjt:  SGYVDSGNYVEAFRLFILMWEESYGCGPRTLATMIRASAGLEIIFVGRQLHSCAIKAGLGQDIFVSCALIDMYSKCGSLEDAHCVFDEMPDKTIVGWNSI

Query:  IAGYALHGYSEEALDLYHEMCRSGVKMDHFTFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRMSCRNVISWNA
        I+GYA +GY+ EA+D++HEM    V+ D  + +  I  C+++ S+ +A+  +  + R+ +  DV  ++AL+D ++K G V+ AR VFDR   R+V+ W+A
Subjt:  IAGYALHGYSEEALDLYHEMCRSGVKMDHFTFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRMSCRNVISWNA

Query:  LIAGYGNHGRGEEAIDMFEKMLREGMMPNHVTFLAVLSACSISGLFERGWEIFQSMTRDHKVRPRAMHFACMIELLGREGLLDEAYALIRKAPFQPTANM
        +I GYG HGR  EAI ++  M R G+ PN VTFL +L AC+ SG+   GW  F  M  DHK+ P+  H+AC+I+LLGR G LD+AY +I+  P QP   +
Subjt:  LIAGYGNHGRGEEAIDMFEKMLREGMMPNHVTFLAVLSACSISGLFERGWEIFQSMTRDHKVRPRAMHFACMIELLGREGLLDEAYALIRKAPFQPTANM

Query:  WAALLRACRVHGNLELG-------------NSGKLKE-------------AADVVQTLKRKGLRMLPACSWIEVNNQPHAFLSGDKHHVQLEKVVGKVDE
        W ALL AC+ H ++ELG             N+G   +              A+V   +K KGL     CSW+EV  +  AF  GDK H + E++  +V+ 
Subjt:  WAALLRACRVHGNLELG-------------NSGKLKE-------------AADVVQTLKRKGLRMLPACSWIEVNNQPHAFLSGDKHHVQLEKVVGKVDE

Query:  LMLKISKLGYVPEEQNFMLPDVDEHEEKIRMYHSEKLAIAYGLLNTLERTPLQIVQSHRICSDCHSVIKLIAMITKREIVIRDASRFHHFRDGNCSCGDY
        +  ++ + G+V  +   +    DE  E+    HSE++AIAYGL++T + TPL+I ++ R C +CH+  KLI+ +  REIV+RD +RFHHF+DG CSCGDY
Subjt:  LMLKISKLGYVPEEQNFMLPDVDEHEEKIRMYHSEKLAIAYGLLNTLERTPLQIVQSHRICSDCHSVIKLIAMITKREIVIRDASRFHHFRDGNCSCGDY

Query:  W
        W
Subjt:  W

AT3G23330.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.7e-12035.78Show/hide
Query:  NSTYDALINACIGLKSIRGVKRLFNYMVDNGFEPDQYMRNRVLLMHVK---CGMMIDACRLFDEMPER--------------------------------
        ++ + +++ +C  +  +R  + +  ++V  G + D Y  N ++ M+ K    G  I    +FDEMP+R                                
Subjt:  NSTYDALINACIGLKSIRGVKRLFNYMVDNGFEPDQYMRNRVLLMHVK---CGMMIDACRLFDEMPER--------------------------------

Query:  -NAVSWSTIISGYVDSGNYVEAFRLFILMWEESYGCGPRTLATMIRASAGLEIIFVGRQLHSCAIKAGLGQDIFVSCALIDMYSKCGSLEDAHCVFDEMP
         + VS++TII+GY  SG Y +A R+   M          TL++++   +    +  G+++H   I+ G+  D+++  +L+DMY+K   +ED+  VF  + 
Subjt:  -NAVSWSTIISGYVDSGNYVEAFRLFILMWEESYGCGPRTLATMIRASAGLEIIFVGRQLHSCAIKAGLGQDIFVSCALIDMYSKCGSLEDAHCVFDEMP

Query:  DKTIVGWNSIIAGYALHGYSEEALDLYHEMCRSGVKMDHFTFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRM
         +  + WNS++AGY  +G   EAL L+ +M  + VK     FS +I  C+ LA++   KQ H  ++R GFG ++   +ALVD YSK G +  AR +FDRM
Subjt:  DKTIVGWNSIIAGYALHGYSEEALDLYHEMCRSGVKMDHFTFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRM

Query:  SCRNVISWNALIAGYGNHGRGEEAIDMFEKMLREGMMPNHVTFLAVLSACSISGLFERGWEIFQSMTRDHKVRPRAMHFACMIELLGREGLLDEAYALIR
        +  + +SW A+I G+  HG G EA+ +FE+M R+G+ PN V F+AVL+ACS  GL +  W  F SMT+ + +     H+A + +LLGR G L+EAY  I 
Subjt:  SCRNVISWNALIAGYGNHGRGEEAIDMFEKMLREGMMPNHVTFLAVLSACSISGLFERGWEIFQSMTRDHKVRPRAMHFACMIELLGREGLLDEAYALIR

Query:  KAPFQPTANMWAALLRACRVHGNLEL--------------------------GNSGKLKEAADVVQTLKRKGLRMLPACSWIEVNNQPHAFLSGDKHHVQ
        K   +PT ++W+ LL +C VH NLEL                           ++G+ KE A +   +++KGLR  PACSWIE+ N+ H F+SGD+ H  
Subjt:  KAPFQPTANMWAALLRACRVHGNLEL--------------------------GNSGKLKEAADVVQTLKRKGLRMLPACSWIEVNNQPHAFLSGDKHHVQ

Query:  LEKVVGKVDELMLKISKLGYVPEEQNFMLPDVD-EHEEKIRMYHSEKLAIAYGLLNTLERTPLQIVQSHRICSDCHSVIKLIAMITKREIVIRDASRFHH
        ++K+   +  +M ++ K GYV +    +L DVD EH+ ++   HSE+LA+A+G++NT   T +++ ++ RIC+DCH  IK I+ IT+REI++RD SRFHH
Subjt:  LEKVVGKVDELMLKISKLGYVPEEQNFMLPDVD-EHEEKIRMYHSEKLAIAYGLLNTLERTPLQIVQSHRICSDCHSVIKLIAMITKREIVIRDASRFHH

Query:  FRDGNCSCGDYW
        F  GNCSCGDYW
Subjt:  FRDGNCSCGDYW

AT3G24000.1 Tetratricopeptide repeat (TPR)-like superfamily protein2.1e-12136.93Show/hide
Query:  YDALINACIGLKSIRGVKRLFNYMVDNGFEPDQYMRNRVLLMHVKCGMMIDACRLFDEMPERNAVSWSTIISGYVDSGNYVEAFRLFILMWEESYGCGPR
        Y+ L+  C   K +   + +  +++ + F  D  M N +L M+ KCG + +A ++F++MP+R+ V+W+T+ISGY       +A   F  M    Y     
Subjt:  YDALINACIGLKSIRGVKRLFNYMVDNGFEPDQYMRNRVLLMHVKCGMMIDACRLFDEMPERNAVSWSTIISGYVDSGNYVEAFRLFILMWEESYGCGPR

Query:  TLATMIRASAGLEIIFVGRQLHSCAIKAGLGQDIFVSCALIDMYSKCGSLEDAHCVFDEMPDKTIVGWNSIIAGYALHGYSEEALDLYHEMCRSGVKMDH
        TL+++I+A+A       G QLH   +K G   ++ V  AL+D+Y++ G ++DA  VFD +  +  V WN++IAG+A    +E+AL+L+  M R G +  H
Subjt:  TLATMIRASAGLEIIFVGRQLHSCAIKAGLGQDIFVSCALIDMYSKCGSLEDAHCVFDEMPDKTIVGWNSIIAGYALHGYSEEALDLYHEMCRSGVKMDH

Query:  FTFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRMSCRNVISWNALIAGYGNHGRGEEAIDMFEKMLREGMMPN
        F+++ +   CS    + + K  HA ++++G  L   A   L+D Y+K G + DAR +FDR++ R+V+SWN+L+  Y  HG G+EA+  FE+M R G+ PN
Subjt:  FTFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRMSCRNVISWNALIAGYGNHGRGEEAIDMFEKMLREGMMPN

Query:  HVTFLAVLSACSISGLFERGWEIFQSMTRDHKVRPRAMHFACMIELLGREGLLDEAYALIRKAPFQPTANMWAALLRACRVHGNLELG------------
         ++FL+VL+ACS SGL + GW  ++ M +D  V P A H+  +++LLGR G L+ A   I + P +PTA +W ALL ACR+H N ELG            
Subjt:  HVTFLAVLSACSISGLFERGWEIFQSMTRDHKVRPRAMHFACMIELLGREGLLDEAYALIRKAPFQPTANMWAALLRACRVHGNLELG------------

Query:  --------------NSGKLKEAADVVQTLKRKGLRMLPACSWIEVNNQPHAFLSGDKHHVQLEKVVGKVDELMLKISKLGYVPEEQNFMLPDVDEHEEKI
                      + G+  +AA V + +K  G++  PACSW+E+ N  H F++ D+ H Q E++  K +E++ KI +LGYVP+  + ++  VD+ E ++
Subjt:  --------------NSGKLKEAADVVQTLKRKGLRMLPACSWIEVNNQPHAFLSGDKHHVQLEKVVGKVDELMLKISKLGYVPEEQNFMLPDVDEHEEKI

Query:  RM-YHSEKLAIAYGLLNTLERTPLQIVQSHRICSDCHSVIKLIAMITKREIVIRDASRFHHFRDGN
         + YHSEK+A+A+ LLNT   + + I ++ R+C DCH+ IKL + +  REI++RD +RFHHF+D +
Subjt:  RM-YHSEKLAIAYGLLNTLERTPLQIVQSHRICSDCHSVIKLIAMITKREIVIRDASRFHHFRDGN

AT5G50390.1 Pentatricopeptide repeat (PPR-like) superfamily protein1.2e-23857.26Show/hide
Query:  MELPLSRYQNYVYDRLQCYSTPYFSLRYSDSHLFMKTSFLSNRRKCRNSFCWVKCSSFEQGLRPRP--QPKPSKLDVGVRKEAPLKETPVRKSSVGICSQ
        ME+PLSRYQ+   D ++  S+       +   L     F    R+ +N F  + CSS  QGL+P+P  +P+P +++V   K+  L +T + KS V ICSQ
Subjt:  MELPLSRYQNYVYDRLQCYSTPYFSLRYSDSHLFMKTSFLSNRRKCRNSFCWVKCSSFEQGLRPRP--QPKPSKLDVGVRKEAPLKETPVRKSSVGICSQ

Query:  IEKLVLCKQYRDALEMFEIFELEDGFHVGNSTYDALINACIGLKSIRGVKRLFNYMVDNGFEPDQYMRNRVLLMHVKCGMMIDACRLFDEMPERNAVSWS
        IEKLVLC ++R+A E+FEI E+   F VG STYDAL+ ACI LKSIR VKR++ +M+ NGFEP+QYM NR+LLMHVKCGM+IDA RLFDE+PERN  S+ 
Subjt:  IEKLVLCKQYRDALEMFEIFELEDGFHVGNSTYDALINACIGLKSIRGVKRLFNYMVDNGFEPDQYMRNRVLLMHVKCGMMIDACRLFDEMPERNAVSWS

Query:  TIISGYVDSGNYVEAFRLFILMWEESYGCGPRTLATMIRASAGLEIIFVGRQLHSCAIKAGLGQDIFVSCALIDMYSKCGSLEDAHCVFDEMPDKTIVGW
        +IISG+V+ GNYVEAF LF +MWEE   C   T A M+RASAGL  I+VG+QLH CA+K G+  + FVSC LIDMYSKCG +EDA C F+ MP+KT V W
Subjt:  TIISGYVDSGNYVEAFRLFILMWEESYGCGPRTLATMIRASAGLEIIFVGRQLHSCAIKAGLGQDIFVSCALIDMYSKCGSLEDAHCVFDEMPDKTIVGW

Query:  NSIIAGYALHGYSEEALDLYHEMCRSGVKMDHFTFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRMSCRNVIS
        N++IAGYALHGYSEEAL L ++M  SGV +D FT SI+IRI ++LA +   KQAHASL+RNGF  ++VANTALVDFYSKWG+VD AR+VFD++  +N+IS
Subjt:  NSIIAGYALHGYSEEALDLYHEMCRSGVKMDHFTFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRMSCRNVIS

Query:  WNALIAGYGNHGRGEEAIDMFEKMLREGMMPNHVTFLAVLSACSISGLFERGWEIFQSMTRDHKVRPRAMHFACMIELLGREGLLDEAYALIRKAPFQPT
        WNAL+ GY NHGRG +A+ +FEKM+   + PNHVTFLAVLSAC+ SGL E+GWEIF SM+  H ++PRAMH+ACMIELLGR+GLLDEA A IR+AP + T
Subjt:  WNALIAGYGNHGRGEEAIDMFEKMLREGMMPNHVTFLAVLSACSISGLFERGWEIFQSMTRDHKVRPRAMHFACMIELLGREGLLDEAYALIRKAPFQPT

Query:  ANMWAALLRACRVHGNLELG--------------------------NSGKLKEAADVVQTLKRKGLRMLPACSWIEVNNQPHAFLSGDK----HHVQLEK
         NMWAALL ACR+  NLELG                          + GK  EAA V++TL+ KGL M+PAC+W+EV +Q H+FLSGD+    +     +
Subjt:  ANMWAALLRACRVHGNLELG--------------------------NSGKLKEAADVVQTLKRKGLRMLPACSWIEVNNQPHAFLSGDK----HHVQLEK

Query:  VVGKVDELMLKISKLGYVPEEQNFMLPDVDE-HEEKIRMYHSEKLAIAYGLLNTLERTPLQIVQSHRICSDCHSVIKLIAMITKREIVIRDASRFHHFRD
        +  KVDELM +IS+ GY  EEQ+ +LPDVDE  EE++  YHSEKLAIAYGL+NT E  PLQI Q+HRIC +CH V++ I+++T RE+V+RDASRFHHF++
Subjt:  VVGKVDELMLKISKLGYVPEEQNFMLPDVDE-HEEKIRMYHSEKLAIAYGLLNTLERTPLQIVQSHRICSDCHSVIKLIAMITKREIVIRDASRFHHFRD

Query:  GNCSCGDYW
        G CSCG YW
Subjt:  GNCSCGDYW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAACATGGAACTCCCTCTCTCCCGCTATCAAAACTATGTTTATGATCGCCTTCAATGTTACTCCACTCCCTACTTCTCCCTGCGTTACTCAGATTCCCACCTTTTTAT
GAAAACTTCTTTTCTTTCTAATCGAAGAAAATGCCGTAATTCATTTTGTTGGGTTAAGTGTTCTTCGTTTGAACAAGGGCTACGCCCACGGCCTCAGCCTAAACCTTCGA
AACTTGATGTGGGTGTTCGTAAAGAAGCTCCTTTGAAGGAGACCCCTGTTAGGAAATCCAGTGTTGGGATCTGTAGTCAAATAGAGAAGTTGGTTTTGTGTAAACAGTAT
CGAGATGCACTTGAGATGTTTGAAATTTTTGAACTTGAAGATGGTTTTCACGTTGGTAACAGTACGTACGATGCGTTGATTAATGCATGTATTGGGTTGAAGTCTATAAG
AGGAGTGAAGAGGTTGTTTAATTACATGGTTGATAATGGATTTGAACCTGATCAGTACATGAGGAACAGGGTTCTACTTATGCATGTGAAATGTGGTATGATGATTGATG
CTTGTAGACTGTTCGACGAAATGCCTGAGAGGAATGCGGTTTCGTGGAGTACTATAATTTCCGGGTATGTAGACTCTGGAAATTATGTTGAAGCGTTTAGATTGTTCATT
TTGATGTGGGAGGAGTCTTATGGTTGTGGGCCTCGTACCCTTGCTACAATGATACGGGCATCAGCTGGTTTGGAAATTATTTTTGTTGGTAGGCAATTGCATTCATGTGC
AATAAAAGCAGGACTAGGACAGGACATTTTTGTTTCCTGTGCGTTGATTGATATGTACAGCAAGTGTGGAAGCCTTGAAGATGCTCATTGCGTTTTTGATGAGATGCCTG
ATAAGACAATAGTTGGATGGAATTCAATTATAGCTGGTTACGCACTCCATGGCTACAGTGAAGAAGCTCTGGATCTATACCATGAGATGTGTCGCTCTGGTGTTAAGATG
GACCATTTCACTTTTTCTATAATTATAAGAATATGCTCGAGATTGGCATCGGTAGCACGTGCTAAGCAAGCACATGCGAGTTTAGTTCGTAATGGCTTTGGGTTAGATGT
AGTAGCTAATACAGCCCTTGTGGATTTCTATAGCAAATGGGGAAAAGTAGATGATGCTAGACATGTTTTTGACAGGATGTCTTGTAGAAACGTAATATCATGGAATGCCT
TGATTGCTGGATATGGGAATCATGGTCGTGGGGAGGAAGCCATTGATATGTTTGAGAAGATGCTTCGGGAAGGCATGATGCCAAACCATGTGACATTTCTTGCTGTTTTA
TCTGCTTGTAGTATTTCAGGTTTGTTTGAACGTGGATGGGAAATTTTTCAATCAATGACTAGAGATCACAAAGTTAGACCACGTGCTATGCATTTTGCATGCATGATTGA
ATTGCTAGGTCGAGAAGGGCTGTTAGACGAAGCATATGCCCTTATAAGGAAAGCTCCATTTCAACCAACAGCAAATATGTGGGCTGCATTGCTTAGAGCTTGTAGAGTTC
ATGGAAACCTAGAACTTGGTAATTCTGGTAAGTTAAAGGAAGCAGCTGATGTTGTTCAGACATTAAAAAGAAAGGGCTTAAGAATGCTTCCAGCATGCAGTTGGATAGAA
GTTAACAACCAGCCGCATGCATTCCTATCTGGGGATAAACACCATGTCCAATTAGAAAAAGTAGTGGGAAAAGTGGATGAATTAATGTTGAAGATCTCAAAGCTTGGTTA
TGTACCTGAAGAACAGAACTTCATGCTTCCAGATGTTGATGAACACGAAGAAAAGATACGGATGTACCACAGTGAGAAATTAGCAATAGCTTATGGTCTTCTAAACACTT
TAGAACGAACGCCATTGCAGATTGTGCAAAGCCATCGCATTTGCAGTGACTGCCATTCTGTGATTAAACTGATTGCTATGATTACCAAACGTGAAATTGTGATCAGAGAT
GCTAGCCGATTTCATCATTTCAGAGATGGGAATTGTTCTTGTGGAGACTATTGGTGA
mRNA sequenceShow/hide mRNA sequence
AGGAAGTTGGAACTACCAACAAGAAAACCAGCTCTAGGGCGGAGTACTCGATGCTCTTCCTCGTTCTTCAATCCACATTTCTTCTTCTTCCTCTTCATCATTCTGTTTTT
TATTACAATTTCATCTCTTAATCCTTCCAATTCATGAATGTCAGTGTAGCCACCACCTTCCTAATCTCTCCAGGTAACTTCCTCCACTCCCCACCATTCTAGATCTCACA
AATCTATTTCAATTTATGAACATGGAACTCCCTCTCTCCCGCTATCAAAACTATGTTTATGATCGCCTTCAATGTTACTCCACTCCCTACTTCTCCCTGCGTTACTCAGA
TTCCCACCTTTTTATGAAAACTTCTTTTCTTTCTAATCGAAGAAAATGCCGTAATTCATTTTGTTGGGTTAAGTGTTCTTCGTTTGAACAAGGGCTACGCCCACGGCCTC
AGCCTAAACCTTCGAAACTTGATGTGGGTGTTCGTAAAGAAGCTCCTTTGAAGGAGACCCCTGTTAGGAAATCCAGTGTTGGGATCTGTAGTCAAATAGAGAAGTTGGTT
TTGTGTAAACAGTATCGAGATGCACTTGAGATGTTTGAAATTTTTGAACTTGAAGATGGTTTTCACGTTGGTAACAGTACGTACGATGCGTTGATTAATGCATGTATTGG
GTTGAAGTCTATAAGAGGAGTGAAGAGGTTGTTTAATTACATGGTTGATAATGGATTTGAACCTGATCAGTACATGAGGAACAGGGTTCTACTTATGCATGTGAAATGTG
GTATGATGATTGATGCTTGTAGACTGTTCGACGAAATGCCTGAGAGGAATGCGGTTTCGTGGAGTACTATAATTTCCGGGTATGTAGACTCTGGAAATTATGTTGAAGCG
TTTAGATTGTTCATTTTGATGTGGGAGGAGTCTTATGGTTGTGGGCCTCGTACCCTTGCTACAATGATACGGGCATCAGCTGGTTTGGAAATTATTTTTGTTGGTAGGCA
ATTGCATTCATGTGCAATAAAAGCAGGACTAGGACAGGACATTTTTGTTTCCTGTGCGTTGATTGATATGTACAGCAAGTGTGGAAGCCTTGAAGATGCTCATTGCGTTT
TTGATGAGATGCCTGATAAGACAATAGTTGGATGGAATTCAATTATAGCTGGTTACGCACTCCATGGCTACAGTGAAGAAGCTCTGGATCTATACCATGAGATGTGTCGC
TCTGGTGTTAAGATGGACCATTTCACTTTTTCTATAATTATAAGAATATGCTCGAGATTGGCATCGGTAGCACGTGCTAAGCAAGCACATGCGAGTTTAGTTCGTAATGG
CTTTGGGTTAGATGTAGTAGCTAATACAGCCCTTGTGGATTTCTATAGCAAATGGGGAAAAGTAGATGATGCTAGACATGTTTTTGACAGGATGTCTTGTAGAAACGTAA
TATCATGGAATGCCTTGATTGCTGGATATGGGAATCATGGTCGTGGGGAGGAAGCCATTGATATGTTTGAGAAGATGCTTCGGGAAGGCATGATGCCAAACCATGTGACA
TTTCTTGCTGTTTTATCTGCTTGTAGTATTTCAGGTTTGTTTGAACGTGGATGGGAAATTTTTCAATCAATGACTAGAGATCACAAAGTTAGACCACGTGCTATGCATTT
TGCATGCATGATTGAATTGCTAGGTCGAGAAGGGCTGTTAGACGAAGCATATGCCCTTATAAGGAAAGCTCCATTTCAACCAACAGCAAATATGTGGGCTGCATTGCTTA
GAGCTTGTAGAGTTCATGGAAACCTAGAACTTGGTAATTCTGGTAAGTTAAAGGAAGCAGCTGATGTTGTTCAGACATTAAAAAGAAAGGGCTTAAGAATGCTTCCAGCA
TGCAGTTGGATAGAAGTTAACAACCAGCCGCATGCATTCCTATCTGGGGATAAACACCATGTCCAATTAGAAAAAGTAGTGGGAAAAGTGGATGAATTAATGTTGAAGAT
CTCAAAGCTTGGTTATGTACCTGAAGAACAGAACTTCATGCTTCCAGATGTTGATGAACACGAAGAAAAGATACGGATGTACCACAGTGAGAAATTAGCAATAGCTTATG
GTCTTCTAAACACTTTAGAACGAACGCCATTGCAGATTGTGCAAAGCCATCGCATTTGCAGTGACTGCCATTCTGTGATTAAACTGATTGCTATGATTACCAAACGTGAA
ATTGTGATCAGAGATGCTAGCCGATTTCATCATTTCAGAGATGGGAATTGTTCTTGTGGAGACTATTGGTGATGCTGGTTTTTGGCCTGTTCTAATTAACGATTTCGTCA
AGCACATGAACAGGTGGTCAGGATCTAATAAAGTCTCAAAGTTATGTGTAAGTGGAGACTCAGAATCACAATCAGGATTAAAATCTTTCCCTGGTTTGAAGAGGAAATTA
CCTGAGGACATGCAAATGGATGACATTGTGTCTCTGAGTGTTCCTAGAACTGGCGCCATGGAGTTGAGTCCTGTTACGAGCGACAAGAAAAGTAGATTTATGCTTGCAGG
CCAGTGAACTTGGAGATAAACTTACCATCATGTACTGCAGATGAGAACGTAACTATGGTTGGACATAATAGTACATTAGGTTCCAAATCTCCATGTGCTGTTGAAGGTTG
TCTGTCAAAAGGTTTTGCTGGAGTTTTCCCTGACACATTTGTTCATAGGATTGGACTGACCAAGAAAGGAAAGCGTCAATTCACATGAGCACAAGCGATTTATCCCTTGT
GATACGTTTACAGGTTGTATGATCTACAATGTGGTTATATCTTCATTTCTGCTGTGGTACAGCATTATTAATTAGACAAGTTGATGATTTGGCAGCAAACTAGAACAAAG
GATTGATTATGTTTCTATTGAGGGAAAGAGAAATTATTAGTCTGTTTCAATTCAAAGAACGAATAAGAGTGTACTGGTTTGTGCTTTCTTCAACCCTTCTTATCGTTCTA
TTTGGCTTTAAATGATATGGTTCAATCTGGAGATTGTGTTAAGCTGTCTTCATTTGTCTACTCTATCTTCATTTCAACTG
Protein sequenceShow/hide protein sequence
MNMELPLSRYQNYVYDRLQCYSTPYFSLRYSDSHLFMKTSFLSNRRKCRNSFCWVKCSSFEQGLRPRPQPKPSKLDVGVRKEAPLKETPVRKSSVGICSQIEKLVLCKQY
RDALEMFEIFELEDGFHVGNSTYDALINACIGLKSIRGVKRLFNYMVDNGFEPDQYMRNRVLLMHVKCGMMIDACRLFDEMPERNAVSWSTIISGYVDSGNYVEAFRLFI
LMWEESYGCGPRTLATMIRASAGLEIIFVGRQLHSCAIKAGLGQDIFVSCALIDMYSKCGSLEDAHCVFDEMPDKTIVGWNSIIAGYALHGYSEEALDLYHEMCRSGVKM
DHFTFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRMSCRNVISWNALIAGYGNHGRGEEAIDMFEKMLREGMMPNHVTFLAVL
SACSISGLFERGWEIFQSMTRDHKVRPRAMHFACMIELLGREGLLDEAYALIRKAPFQPTANMWAALLRACRVHGNLELGNSGKLKEAADVVQTLKRKGLRMLPACSWIE
VNNQPHAFLSGDKHHVQLEKVVGKVDELMLKISKLGYVPEEQNFMLPDVDEHEEKIRMYHSEKLAIAYGLLNTLERTPLQIVQSHRICSDCHSVIKLIAMITKREIVIRD
ASRFHHFRDGNCSCGDYW