; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr004209 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr004209
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationtig00002668:48187..50301
RNA-Seq ExpressionSgr004209
SyntenySgr004209
Gene Ontology termsGO:0009658 - chloroplast organization (biological process)
GO:0031425 - chloroplast RNA processing (biological process)
GO:0009570 - chloroplast stroma (cellular component)
GO:0009941 - chloroplast envelope (cellular component)
GO:0003729 - mRNA binding (molecular function)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR002625 - Smr domain
IPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily
IPR033443 - Pentacotripeptide-repeat region of PRORP


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6583722.1 Pentatricopeptide repeat-containing protein, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0086.93Show/hide
Query:  MAFQLCHSPSTLFTDHHPLFNSLTSQRKISLCKSSHR-FKLNPIPLKSKTFLQITNVSLQEYAPEETQNPSPPEDDRSKYPDGKSSSSSKSYVWVNPRSP
        MAFQL H PST FTDH    NSLT   K +LCKSS R FKLNPIP  SK FLQITNVS QEYAP+ET+NPSP +D+ SK+PDGKS SSSK+ VWVNP SP
Subjt:  MAFQLCHSPSTLFTDHHPLFNSLTSQRKISLCKSSHR-FKLNPIPLKSKTFLQITNVSLQEYAPEETQNPSPPEDDRSKYPDGKSSSSSKSYVWVNPRSP

Query:  RASKLRKQSYEARYASLTRISESLESCNPCEEDVADVLKVTGNNILEQDAVIVLNNMSNSSTALLALQYFQKVLRSSKQAILYNVTLKVFRKSRDMEGAE
        RASKLRKQSYEARYASL +ISESL+SCNPCE+DVADVLK   + ILEQDA+ VLNNMSNS TALL L+YFQ VL+SSKQA+ YNVTLKVFRK RD EGAE
Subjt:  RASKLRKQSYEARYASLTRISESLESCNPCEEDVADVLKVTGNNILEQDAVIVLNNMSNSSTALLALQYFQKVLRSSKQAILYNVTLKVFRKSRDMEGAE

Query:  KLFDEMLQRGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPNYDCNPDDVTYSAMIDAYGRAGNVDMAFSLYDRARTENWRIDPVTFSTLIKIHGVAGN
        KLFDEML+RGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMP++DCNPD++TYS MIDAYGRAGNVDMAFSLYDRARTENWRID  TFST+IKIHGVAGN
Subjt:  KLFDEMLQRGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPNYDCNPDDVTYSAMIDAYGRAGNVDMAFSLYDRARTENWRIDPVTFSTLIKIHGVAGN

Query:  YDGCLNVYEEMKAIGIKPNLVIYNSMLDAMGRAKRPWQIKTIYKEMIQNGFSPSWTTYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMC
        YDGCLNVYEEMKA+GIKPNL IYNS+L AMGRAKRPWQIKTIYKEM +NGFSPSW TYASLLRAY RARY ED +LVYKEMKEKGLQLNVILYNTLLAMC
Subjt:  YDGCLNVYEEMKAIGIKPNLVIYNSMLDAMGRAKRPWQIKTIYKEMIQNGFSPSWTTYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMC

Query:  ADVGYINEAVEIFEDMKNSGTCSPDSWTFSSMITIYSCSGKVSGAEEMLNEMVEAGFDPNIFVLTSLIQCYGKAKRVDDVVRTFNRLVELGLTPDDRFCG
        ADVGY+NEA+E+F+DMK+SGTCSPDSWTFSSMITIYSCSG VS AEEMLNEM+EAGFDPNIFVLTSLIQCYGKAKRVDDVVRTF+RL+ELGLTPDDRFCG
Subjt:  ADVGYINEAVEIFEDMKNSGTCSPDSWTFSSMITIYSCSGKVSGAEEMLNEMVEAGFDPNIFVLTSLIQCYGKAKRVDDVVRTFNRLVELGLTPDDRFCG

Query:  CLLNVITQTPKGELSKLIDCVERANSKLGFVVKLLLGEQDKEGDFKTEASELFSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDLGLTLQIYTDLQSR
        CLLNVITQTPK ELSKLIDCVERAN KLGFVVKLLLGE+D EGDF+TEASELFSVVS DVRKAYCNCLIDLCVNLDLLDKACELLDLGL++QIYTDLQSR
Subjt:  CLLNVITQTPKGELSKLIDCVERANSKLGFVVKLLLGEQDKEGDFKTEASELFSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDLGLTLQIYTDLQSR

Query:  SPTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSGKGLASVFESHLKELSAPFHEAPEKVGWFLTTKVAAKSWLESRGSP
        SPTQWSLYLKGLSLGAALTALHVWINDLTK L+SGEELPPLLGINTGHGKHKYS KGL+SVFESHLKEL+APFHEAPEKVGWFLTTKVAAKSWLESRGSP
Subjt:  SPTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSGKGLASVFESHLKELSAPFHEAPEKVGWFLTTKVAAKSWLESRGSP

Query:  ELVA
        ELVA
Subjt:  ELVA

XP_004139516.1 pentatricopeptide repeat-containing protein At4g16390, chloroplastic [Cucumis sativus]0.0e+0088.62Show/hide
Query:  MAFQLCHSPSTLFTDHHPLFNSLTSQRKISLCKSSHRFKLNPIPLKSKTFLQITNVSLQEYAPEETQNPSPPEDDRSKYPDGKSSSSSKSYVWVNPRSPR
        MAFQLC+SP T FT+HH L NSLT QRK +L  SS  FKL+PIP  SK FLQITNVSLQE+AP++TQN  P  D+ SKYPD KS SSS S VWVNPRSPR
Subjt:  MAFQLCHSPSTLFTDHHPLFNSLTSQRKISLCKSSHRFKLNPIPLKSKTFLQITNVSLQEYAPEETQNPSPPEDDRSKYPDGKSSSSSKSYVWVNPRSPR

Query:  ASKLRKQSYEARYASLTRISESLESCNPCEEDVADVLKVTGNNILEQDAVIVLNNMSNSSTALLALQYFQKVLRSSKQAILYNVTLKVFRKSRDMEGAEK
        ASKLRKQSYEARYASL R+SESL+S NPCE DVADVLKV GNNILE+DA++VLNNMSNS TALLAL+YFQ +L+SSKQ I YNVTLKVFRK RDMEGAEK
Subjt:  ASKLRKQSYEARYASLTRISESLESCNPCEEDVADVLKVTGNNILEQDAVIVLNNMSNSSTALLALQYFQKVLRSSKQAILYNVTLKVFRKSRDMEGAEK

Query:  LFDEMLQRGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPNYDCNPDDVTYSAMIDAYGRAGNVDMAFSLYDRARTENWRIDPVTFSTLIKIHGVAGNY
        LF+EM+ RGVKPDNVTFSTIISCARLCSLP+KAVEWFEKMP++DCNPDDVTYS MIDAYGRAGNVDMAFSLYDRARTENWRIDP TFST+IKIHGVAGNY
Subjt:  LFDEMLQRGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPNYDCNPDDVTYSAMIDAYGRAGNVDMAFSLYDRARTENWRIDPVTFSTLIKIHGVAGNY

Query:  DGCLNVYEEMKAIGIKPNLVIYNSMLDAMGRAKRPWQIKTIYKEMIQNGFSPSWTTYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCA
        DGCLNVYEEMKAIGIKPNLVIYN +LDAMGRAKRPWQIKTIYKEMI+NGFSPSW TYASLLRAYGRARYGEDAL+VYKEMKEKGLQLNVILYNTLLAMCA
Subjt:  DGCLNVYEEMKAIGIKPNLVIYNSMLDAMGRAKRPWQIKTIYKEMIQNGFSPSWTTYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCA

Query:  DVGYINEAVEIFEDMKNSGTCSPDSWTFSSMITIYSCSGKVSGAEEMLNEMVEAGFDPNIFVLTSLIQCYGKAKRVDDVVRTFNRLVELGLTPDDRFCGC
        DVGY+NEAVEIF+DMK+SGTCSPDSWTFSSMITIYSC GKVS AEEMLN+MVEAGFDPNIFVLTSLIQCYGKAKRVDDVVRTFN+L+ELGLTPDDRFCGC
Subjt:  DVGYINEAVEIFEDMKNSGTCSPDSWTFSSMITIYSCSGKVSGAEEMLNEMVEAGFDPNIFVLTSLIQCYGKAKRVDDVVRTFNRLVELGLTPDDRFCGC

Query:  LLNVITQTPKGELSKLIDCVERANSKLGFVVKLLLGEQDKEGDFKTEASELFSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDLGLTLQIYTDLQSRS
        LLNVITQTPKGEL KLIDCV RAN KLGFVV+LLLGEQDKEG+F+TEASELFSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDLGLTLQIY DLQSRS
Subjt:  LLNVITQTPKGELSKLIDCVERANSKLGFVVKLLLGEQDKEGDFKTEASELFSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDLGLTLQIYTDLQSRS

Query:  PTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSGKGLASVFESHLKELSAPFHEAPEKVGWFLTTKVAAKSWLESRGSPE
        PTQWSLYLKGLSLGAALTALHVWI DLTKVLESGEELPPLLGINTGHGKHKYS KGLASVFESHLKEL+APFHEAPEKVGWFLTTKVAAKSWLESR SPE
Subjt:  PTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSGKGLASVFESHLKELSAPFHEAPEKVGWFLTTKVAAKSWLESRGSPE

Query:  LVA
        LVA
Subjt:  LVA

XP_008464281.1 PREDICTED: pentatricopeptide repeat-containing protein At4g16390, chloroplastic [Cucumis melo]0.0e+0089.19Show/hide
Query:  MAFQLCHSPSTLFTDHHPLFNSLTSQRKISLCKSSHRFKLNPIPLKSKTFLQITNVSLQEYAPEETQNPSPPEDDRSKYPDGKSSSSSKSYVWVNPRSPR
        MAFQLCHSP T FT HH L NSLT QRK +L  SS  FKLNPIP  S  FLQITN+SLQE++P+ET N  P +D+ SKY D KS SSSKS VWVNPRSPR
Subjt:  MAFQLCHSPSTLFTDHHPLFNSLTSQRKISLCKSSHRFKLNPIPLKSKTFLQITNVSLQEYAPEETQNPSPPEDDRSKYPDGKSSSSSKSYVWVNPRSPR

Query:  ASKLRKQSYEARYASLTRISESLESCNPCEEDVADVLKVTGNNILEQDAVIVLNNMSNSSTALLALQYFQKVLRSSKQAILYNVTLKVFRKSRDMEGAEK
        ASKLRKQSYEARYASL RISESL+SCNPCE DVADVLKV GNNILEQDAV+VLNNMSNS TALLAL+YFQ +L+SSKQ I YNVTLKVFRK RDMEGAE+
Subjt:  ASKLRKQSYEARYASLTRISESLESCNPCEEDVADVLKVTGNNILEQDAVIVLNNMSNSSTALLALQYFQKVLRSSKQAILYNVTLKVFRKSRDMEGAEK

Query:  LFDEMLQRGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPNYDCNPDDVTYSAMIDAYGRAGNVDMAFSLYDRARTENWRIDPVTFSTLIKIHGVAGNY
        LF+EML RGVKPDNVTFSTIISCARLCSLP+KAVEWFEKMP++DCNPDDVTYS MIDAYGRAGNVDMAFSLYDRARTENWRIDP TFST+IKIHGVAGNY
Subjt:  LFDEMLQRGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPNYDCNPDDVTYSAMIDAYGRAGNVDMAFSLYDRARTENWRIDPVTFSTLIKIHGVAGNY

Query:  DGCLNVYEEMKAIGIKPNLVIYNSMLDAMGRAKRPWQIKTIYKEMIQNGFSPSWTTYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCA
        DGCLNVYEEMKAIGIKPNLVIYNS+LDAMGRAKRPWQIKTIYKEMI++GFSPSW TYASLLRAYGRARYGEDAL+VYKEMKEKGLQLNVILYNTLLAMCA
Subjt:  DGCLNVYEEMKAIGIKPNLVIYNSMLDAMGRAKRPWQIKTIYKEMIQNGFSPSWTTYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCA

Query:  DVGYINEAVEIFEDMKNSGTCSPDSWTFSSMITIYSCSGKVSGAEEMLNEMVEAGFDPNIFVLTSLIQCYGKAKRVDDVVRTFNRLVELGLTPDDRFCGC
        DVGY+NEAVEIF+DMKNSGTCSPDSWTFSSMITIYSCSGKVS AEEMLN+MVEAGFDPNIFVLTSLIQCYGKAKRVDDVVRTFN+L+ELGLTPDDRFCGC
Subjt:  DVGYINEAVEIFEDMKNSGTCSPDSWTFSSMITIYSCSGKVSGAEEMLNEMVEAGFDPNIFVLTSLIQCYGKAKRVDDVVRTFNRLVELGLTPDDRFCGC

Query:  LLNVITQTPKGELSKLIDCVERANSKLGFVVKLLLGEQDKEGDFKTEASELFSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDLGLTLQIYTDLQSRS
        LLNVITQTPK E+SKLIDCV RAN KLGFVV+LLLGEQDKEG+F+TEASELFSVVSADVRKAYCNCLIDLCVNLDLLDKACELL+LGLTLQIY DLQSRS
Subjt:  LLNVITQTPKGELSKLIDCVERANSKLGFVVKLLLGEQDKEGDFKTEASELFSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDLGLTLQIYTDLQSRS

Query:  PTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSGKGLASVFESHLKELSAPFHEAPEKVGWFLTTKVAAKSWLESRGSPE
        PTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYS KGLASVFESHLKEL+APFHEAPEKVGWFLTTKVAAKSWLESR SPE
Subjt:  PTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSGKGLASVFESHLKELSAPFHEAPEKVGWFLTTKVAAKSWLESRGSPE

Query:  LVA
        LVA
Subjt:  LVA

XP_022142513.1 pentatricopeptide repeat-containing protein At4g16390, chloroplastic [Momordica charantia]0.0e+0091.61Show/hide
Query:  MAFQLCHSPSTLFTDHHPLFNSLTSQRKISLCKSSHRFKLNPIPLKSKTFLQITNVSLQEYAPEETQNPSPPEDDRSKYPDGKSSSSSKSYVWVNPRSPR
        MAFQLCHSPST F+DHHPL NSL SQ +I+L KSSHRFKLNP P  SKT L+ITNVSLQEYA +E QNP P +D+ SKYPDGKS+SSSKS VWVNPRSPR
Subjt:  MAFQLCHSPSTLFTDHHPLFNSLTSQRKISLCKSSHRFKLNPIPLKSKTFLQITNVSLQEYAPEETQNPSPPEDDRSKYPDGKSSSSSKSYVWVNPRSPR

Query:  ASKLRKQSYEARYASLTRISESLESCNPCEEDVADVLKVTGNNILEQDAVIVLNNMSNSSTALLALQYFQKVLRSSKQAILYNVTLKVFRKSRDMEGAEK
        ASKLR QSYEARYASLTRISESL+SCNPCEEDVADVLK  G+NILEQDAV VLNNMSNSSTALLALQ FQKVL+SSK+AILYNVTLKV RKSRDMEGAEK
Subjt:  ASKLRKQSYEARYASLTRISESLESCNPCEEDVADVLKVTGNNILEQDAVIVLNNMSNSSTALLALQYFQKVLRSSKQAILYNVTLKVFRKSRDMEGAEK

Query:  LFDEMLQRGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPNYDCNPDDVTYSAMIDAYGRAGNVDMAFSLYDRARTENWRIDPVTFSTLIKIHGVAGNY
        LFDEML+RGVKPDNVTFST+ISCARLCSLPNKAVEWFEKMP++DCNPDDVTYSAMIDAYGRAGNVDMAFSLYDRARTENWRIDP TFSTLIKIHGVAGNY
Subjt:  LFDEMLQRGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPNYDCNPDDVTYSAMIDAYGRAGNVDMAFSLYDRARTENWRIDPVTFSTLIKIHGVAGNY

Query:  DGCLNVYEEMKAIGIKPNLVIYNSMLDAMGRAKRPWQIKTIYKEMIQNGFSPSWTTYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCA
        DGCLNVYEEMKAIGIKPNLVIYNS+LDAMGRAKRPWQIKTIYKEMI+NGFSPSW TYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCA
Subjt:  DGCLNVYEEMKAIGIKPNLVIYNSMLDAMGRAKRPWQIKTIYKEMIQNGFSPSWTTYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCA

Query:  DVGYINEAVEIFEDMKNSGTCSPDSWTFSSMITIYSCSGKVSGAEEMLNEMVEAGFDPNIFVLTSLIQCYGKAKRVDDVVRTFNRLVELGLTPDDRFCGC
        DVGY+NEAVEIFEDMK+SG CSPDSWTFSSMITIYSCSGKVS AEEMLNEM+EAGFDPNIFVLTSLIQCYGK KRVDDVVRTF+RLVELGLTPDDRFCGC
Subjt:  DVGYINEAVEIFEDMKNSGTCSPDSWTFSSMITIYSCSGKVSGAEEMLNEMVEAGFDPNIFVLTSLIQCYGKAKRVDDVVRTFNRLVELGLTPDDRFCGC

Query:  LLNVITQTPKGELSKLIDCVERANSKLGFVVKLLLGEQDKEGDFKTEASELFSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDLGLTLQIYTDLQSRS
        LLNVITQTPK ELSKLIDCVERANSKLG+VVKLLLGEQDKEGD +TEASEL SVVSADVRKAYCNCLIDLCVNLDLL+KACELLDLGLTLQIYT LQS S
Subjt:  LLNVITQTPKGELSKLIDCVERANSKLGFVVKLLLGEQDKEGDFKTEASELFSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDLGLTLQIYTDLQSRS

Query:  PTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSGKGLASVFESHLKELSAPFHEAPEKVGWFLTTKVAAKSWLESRGSPE
        PTQWSL+LKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYS KGLASVFESHLKEL+APFHEAPEKVGWFLTTKVAAKSWLESRGSPE
Subjt:  PTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSGKGLASVFESHLKELSAPFHEAPEKVGWFLTTKVAAKSWLESRGSPE

Query:  LVA
        LVA
Subjt:  LVA

XP_038877791.1 pentatricopeptide repeat-containing protein At4g16390, chloroplastic [Benincasa hispida]0.0e+0091.61Show/hide
Query:  MAFQLCHSPSTLFTDHHPLFNSLTSQRKISLCKSSHRFKLNPIPLKSKTFLQITNVSLQEYAPEETQNPSPPEDDRSKYPDGKSSSSSKSYVWVNPRSPR
        MAFQLCHSPST FTDHH L NSLTSQRK +LC SS  FKLNPIP  SK FLQITNVSLQEYAP+ET NPSP  D+ SKYPDGKSSSSSKS VWVNPRSPR
Subjt:  MAFQLCHSPSTLFTDHHPLFNSLTSQRKISLCKSSHRFKLNPIPLKSKTFLQITNVSLQEYAPEETQNPSPPEDDRSKYPDGKSSSSSKSYVWVNPRSPR

Query:  ASKLRKQSYEARYASLTRISESLESCNPCEEDVADVLKVTGNNILEQDAVIVLNNMSNSSTALLALQYFQKVLRSSKQAILYNVTLKVFRKSRDMEGAEK
        ASKLRKQSYEARYASLTRISESL+SCNPC+EDVADVLK  G+NIL+QDAV+VLNNMSNS TALLAL+YFQ VL+SSKQAI YNVTLKVFRK RDMEGAEK
Subjt:  ASKLRKQSYEARYASLTRISESLESCNPCEEDVADVLKVTGNNILEQDAVIVLNNMSNSSTALLALQYFQKVLRSSKQAILYNVTLKVFRKSRDMEGAEK

Query:  LFDEMLQRGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPNYDCNPDDVTYSAMIDAYGRAGNVDMAFSLYDRARTENWRIDPVTFSTLIKIHGVAGNY
        LF+EML+RGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMP++DCNPDDVTYSAMIDAYGRAGNVDMAFSLYDRARTENWRIDP TFST+IKIHGVAGNY
Subjt:  LFDEMLQRGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPNYDCNPDDVTYSAMIDAYGRAGNVDMAFSLYDRARTENWRIDPVTFSTLIKIHGVAGNY

Query:  DGCLNVYEEMKAIGIKPNLVIYNSMLDAMGRAKRPWQIKTIYKEMIQNGFSPSWTTYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCA
        DGCLNVYEEMKAIGIKPNLVIYNS+LDAMGRAKRPWQIKTIYKEMI+NGFSPSW TYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCA
Subjt:  DGCLNVYEEMKAIGIKPNLVIYNSMLDAMGRAKRPWQIKTIYKEMIQNGFSPSWTTYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCA

Query:  DVGYINEAVEIFEDMKNSGTCSPDSWTFSSMITIYSCSGKVSGAEEMLNEMVEAGFDPNIFVLTSLIQCYGKAKRVDDVVRTFNRLVELGLTPDDRFCGC
        DVGY+ EAVE+F+DMK+SGTCSPDSWTFSSMITIYSCSGKVS AEEMLNEMVEAGFDPNIFVLTSLIQCYGKAKRVDDVVRTF+RL+ELGLTPDDRFCGC
Subjt:  DVGYINEAVEIFEDMKNSGTCSPDSWTFSSMITIYSCSGKVSGAEEMLNEMVEAGFDPNIFVLTSLIQCYGKAKRVDDVVRTFNRLVELGLTPDDRFCGC

Query:  LLNVITQTPKGELSKLIDCVERANSKLGFVVKLLLGEQDKEGDFKTEASELFSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDLGLTLQIYTDLQSRS
        LLNVITQTPK ELSKLIDCV RAN KLGFVVKLL+GEQDKEGDF+TEASELFSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDLGLTLQ+Y DLQSRS
Subjt:  LLNVITQTPKGELSKLIDCVERANSKLGFVVKLLLGEQDKEGDFKTEASELFSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDLGLTLQIYTDLQSRS

Query:  PTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSGKGLASVFESHLKELSAPFHEAPEKVGWFLTTKVAAKSWLESRGSPE
        PTQWSLYLKGLSLGA LTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYS KGLASVFESHLKEL+APFHEAPEKVGWFLTTKVAAKSWLESR SPE
Subjt:  PTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSGKGLASVFESHLKELSAPFHEAPEKVGWFLTTKVAAKSWLESRGSPE

Query:  LVA
        LVA
Subjt:  LVA

TrEMBL top hitse value%identityAlignment
A0A0A0LVP1 Smr domain-containing protein0.0e+0088.62Show/hide
Query:  MAFQLCHSPSTLFTDHHPLFNSLTSQRKISLCKSSHRFKLNPIPLKSKTFLQITNVSLQEYAPEETQNPSPPEDDRSKYPDGKSSSSSKSYVWVNPRSPR
        MAFQLC+SP T FT+HH L NSLT QRK +L  SS  FKL+PIP  SK FLQITNVSLQE+AP++TQN  P  D+ SKYPD KS SSS S VWVNPRSPR
Subjt:  MAFQLCHSPSTLFTDHHPLFNSLTSQRKISLCKSSHRFKLNPIPLKSKTFLQITNVSLQEYAPEETQNPSPPEDDRSKYPDGKSSSSSKSYVWVNPRSPR

Query:  ASKLRKQSYEARYASLTRISESLESCNPCEEDVADVLKVTGNNILEQDAVIVLNNMSNSSTALLALQYFQKVLRSSKQAILYNVTLKVFRKSRDMEGAEK
        ASKLRKQSYEARYASL R+SESL+S NPCE DVADVLKV GNNILE+DA++VLNNMSNS TALLAL+YFQ +L+SSKQ I YNVTLKVFRK RDMEGAEK
Subjt:  ASKLRKQSYEARYASLTRISESLESCNPCEEDVADVLKVTGNNILEQDAVIVLNNMSNSSTALLALQYFQKVLRSSKQAILYNVTLKVFRKSRDMEGAEK

Query:  LFDEMLQRGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPNYDCNPDDVTYSAMIDAYGRAGNVDMAFSLYDRARTENWRIDPVTFSTLIKIHGVAGNY
        LF+EM+ RGVKPDNVTFSTIISCARLCSLP+KAVEWFEKMP++DCNPDDVTYS MIDAYGRAGNVDMAFSLYDRARTENWRIDP TFST+IKIHGVAGNY
Subjt:  LFDEMLQRGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPNYDCNPDDVTYSAMIDAYGRAGNVDMAFSLYDRARTENWRIDPVTFSTLIKIHGVAGNY

Query:  DGCLNVYEEMKAIGIKPNLVIYNSMLDAMGRAKRPWQIKTIYKEMIQNGFSPSWTTYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCA
        DGCLNVYEEMKAIGIKPNLVIYN +LDAMGRAKRPWQIKTIYKEMI+NGFSPSW TYASLLRAYGRARYGEDAL+VYKEMKEKGLQLNVILYNTLLAMCA
Subjt:  DGCLNVYEEMKAIGIKPNLVIYNSMLDAMGRAKRPWQIKTIYKEMIQNGFSPSWTTYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCA

Query:  DVGYINEAVEIFEDMKNSGTCSPDSWTFSSMITIYSCSGKVSGAEEMLNEMVEAGFDPNIFVLTSLIQCYGKAKRVDDVVRTFNRLVELGLTPDDRFCGC
        DVGY+NEAVEIF+DMK+SGTCSPDSWTFSSMITIYSC GKVS AEEMLN+MVEAGFDPNIFVLTSLIQCYGKAKRVDDVVRTFN+L+ELGLTPDDRFCGC
Subjt:  DVGYINEAVEIFEDMKNSGTCSPDSWTFSSMITIYSCSGKVSGAEEMLNEMVEAGFDPNIFVLTSLIQCYGKAKRVDDVVRTFNRLVELGLTPDDRFCGC

Query:  LLNVITQTPKGELSKLIDCVERANSKLGFVVKLLLGEQDKEGDFKTEASELFSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDLGLTLQIYTDLQSRS
        LLNVITQTPKGEL KLIDCV RAN KLGFVV+LLLGEQDKEG+F+TEASELFSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDLGLTLQIY DLQSRS
Subjt:  LLNVITQTPKGELSKLIDCVERANSKLGFVVKLLLGEQDKEGDFKTEASELFSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDLGLTLQIYTDLQSRS

Query:  PTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSGKGLASVFESHLKELSAPFHEAPEKVGWFLTTKVAAKSWLESRGSPE
        PTQWSLYLKGLSLGAALTALHVWI DLTKVLESGEELPPLLGINTGHGKHKYS KGLASVFESHLKEL+APFHEAPEKVGWFLTTKVAAKSWLESR SPE
Subjt:  PTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSGKGLASVFESHLKELSAPFHEAPEKVGWFLTTKVAAKSWLESRGSPE

Query:  LVA
        LVA
Subjt:  LVA

A0A1S3CL39 pentatricopeptide repeat-containing protein At4g16390, chloroplastic0.0e+0089.19Show/hide
Query:  MAFQLCHSPSTLFTDHHPLFNSLTSQRKISLCKSSHRFKLNPIPLKSKTFLQITNVSLQEYAPEETQNPSPPEDDRSKYPDGKSSSSSKSYVWVNPRSPR
        MAFQLCHSP T FT HH L NSLT QRK +L  SS  FKLNPIP  S  FLQITN+SLQE++P+ET N  P +D+ SKY D KS SSSKS VWVNPRSPR
Subjt:  MAFQLCHSPSTLFTDHHPLFNSLTSQRKISLCKSSHRFKLNPIPLKSKTFLQITNVSLQEYAPEETQNPSPPEDDRSKYPDGKSSSSSKSYVWVNPRSPR

Query:  ASKLRKQSYEARYASLTRISESLESCNPCEEDVADVLKVTGNNILEQDAVIVLNNMSNSSTALLALQYFQKVLRSSKQAILYNVTLKVFRKSRDMEGAEK
        ASKLRKQSYEARYASL RISESL+SCNPCE DVADVLKV GNNILEQDAV+VLNNMSNS TALLAL+YFQ +L+SSKQ I YNVTLKVFRK RDMEGAE+
Subjt:  ASKLRKQSYEARYASLTRISESLESCNPCEEDVADVLKVTGNNILEQDAVIVLNNMSNSSTALLALQYFQKVLRSSKQAILYNVTLKVFRKSRDMEGAEK

Query:  LFDEMLQRGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPNYDCNPDDVTYSAMIDAYGRAGNVDMAFSLYDRARTENWRIDPVTFSTLIKIHGVAGNY
        LF+EML RGVKPDNVTFSTIISCARLCSLP+KAVEWFEKMP++DCNPDDVTYS MIDAYGRAGNVDMAFSLYDRARTENWRIDP TFST+IKIHGVAGNY
Subjt:  LFDEMLQRGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPNYDCNPDDVTYSAMIDAYGRAGNVDMAFSLYDRARTENWRIDPVTFSTLIKIHGVAGNY

Query:  DGCLNVYEEMKAIGIKPNLVIYNSMLDAMGRAKRPWQIKTIYKEMIQNGFSPSWTTYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCA
        DGCLNVYEEMKAIGIKPNLVIYNS+LDAMGRAKRPWQIKTIYKEMI++GFSPSW TYASLLRAYGRARYGEDAL+VYKEMKEKGLQLNVILYNTLLAMCA
Subjt:  DGCLNVYEEMKAIGIKPNLVIYNSMLDAMGRAKRPWQIKTIYKEMIQNGFSPSWTTYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCA

Query:  DVGYINEAVEIFEDMKNSGTCSPDSWTFSSMITIYSCSGKVSGAEEMLNEMVEAGFDPNIFVLTSLIQCYGKAKRVDDVVRTFNRLVELGLTPDDRFCGC
        DVGY+NEAVEIF+DMKNSGTCSPDSWTFSSMITIYSCSGKVS AEEMLN+MVEAGFDPNIFVLTSLIQCYGKAKRVDDVVRTFN+L+ELGLTPDDRFCGC
Subjt:  DVGYINEAVEIFEDMKNSGTCSPDSWTFSSMITIYSCSGKVSGAEEMLNEMVEAGFDPNIFVLTSLIQCYGKAKRVDDVVRTFNRLVELGLTPDDRFCGC

Query:  LLNVITQTPKGELSKLIDCVERANSKLGFVVKLLLGEQDKEGDFKTEASELFSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDLGLTLQIYTDLQSRS
        LLNVITQTPK E+SKLIDCV RAN KLGFVV+LLLGEQDKEG+F+TEASELFSVVSADVRKAYCNCLIDLCVNLDLLDKACELL+LGLTLQIY DLQSRS
Subjt:  LLNVITQTPKGELSKLIDCVERANSKLGFVVKLLLGEQDKEGDFKTEASELFSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDLGLTLQIYTDLQSRS

Query:  PTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSGKGLASVFESHLKELSAPFHEAPEKVGWFLTTKVAAKSWLESRGSPE
        PTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYS KGLASVFESHLKEL+APFHEAPEKVGWFLTTKVAAKSWLESR SPE
Subjt:  PTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSGKGLASVFESHLKELSAPFHEAPEKVGWFLTTKVAAKSWLESRGSPE

Query:  LVA
        LVA
Subjt:  LVA

A0A5A7TLM5 Pentatricopeptide repeat-containing protein0.0e+0089.19Show/hide
Query:  MAFQLCHSPSTLFTDHHPLFNSLTSQRKISLCKSSHRFKLNPIPLKSKTFLQITNVSLQEYAPEETQNPSPPEDDRSKYPDGKSSSSSKSYVWVNPRSPR
        MAFQLCHSP T FT HH L NSLT QRK +L  SS  FKLNPIP  S  FLQITN+SLQE++P+ET N  P +D+ SKY D KS SSSKS VWVNPRSPR
Subjt:  MAFQLCHSPSTLFTDHHPLFNSLTSQRKISLCKSSHRFKLNPIPLKSKTFLQITNVSLQEYAPEETQNPSPPEDDRSKYPDGKSSSSSKSYVWVNPRSPR

Query:  ASKLRKQSYEARYASLTRISESLESCNPCEEDVADVLKVTGNNILEQDAVIVLNNMSNSSTALLALQYFQKVLRSSKQAILYNVTLKVFRKSRDMEGAEK
        ASKLRKQSYEARYASL RISESL+SCNPCE DVADVLKV GNNILEQDAV+VLNNMSNS TALLAL+YFQ +L+SSKQ I YNVTLKVFRK RDMEGAE+
Subjt:  ASKLRKQSYEARYASLTRISESLESCNPCEEDVADVLKVTGNNILEQDAVIVLNNMSNSSTALLALQYFQKVLRSSKQAILYNVTLKVFRKSRDMEGAEK

Query:  LFDEMLQRGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPNYDCNPDDVTYSAMIDAYGRAGNVDMAFSLYDRARTENWRIDPVTFSTLIKIHGVAGNY
        LF+EML RGVKPDNVTFSTIISCARLCSLP+KAVEWFEKMP++DCNPDDVTYS MIDAYGRAGNVDMAFSLYDRARTENWRIDP TFST+IKIHGVAGNY
Subjt:  LFDEMLQRGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPNYDCNPDDVTYSAMIDAYGRAGNVDMAFSLYDRARTENWRIDPVTFSTLIKIHGVAGNY

Query:  DGCLNVYEEMKAIGIKPNLVIYNSMLDAMGRAKRPWQIKTIYKEMIQNGFSPSWTTYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCA
        DGCLNVYEEMKAIGIKPNLVIYNS+LDAMGRAKRPWQIKTIYKEMI++GFSPSW TYASLLRAYGRARYGEDAL+VYKEMKEKGLQLNVILYNTLLAMCA
Subjt:  DGCLNVYEEMKAIGIKPNLVIYNSMLDAMGRAKRPWQIKTIYKEMIQNGFSPSWTTYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCA

Query:  DVGYINEAVEIFEDMKNSGTCSPDSWTFSSMITIYSCSGKVSGAEEMLNEMVEAGFDPNIFVLTSLIQCYGKAKRVDDVVRTFNRLVELGLTPDDRFCGC
        DVGY+NEAVEIF+DMKNSGTCSPDSWTFSSMITIYSCSGKVS AEEMLN+MVEAGFDPNIFVLTSLIQCYGKAKRVDDVVRTFN+L+ELGLTPDDRFCGC
Subjt:  DVGYINEAVEIFEDMKNSGTCSPDSWTFSSMITIYSCSGKVSGAEEMLNEMVEAGFDPNIFVLTSLIQCYGKAKRVDDVVRTFNRLVELGLTPDDRFCGC

Query:  LLNVITQTPKGELSKLIDCVERANSKLGFVVKLLLGEQDKEGDFKTEASELFSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDLGLTLQIYTDLQSRS
        LLNVITQTPK E+SKLIDCV RAN KLGFVV+LLLGEQDKEG+F+TEASELFSVVSADVRKAYCNCLIDLCVNLDLLDKACELL+LGLTLQIY DLQSRS
Subjt:  LLNVITQTPKGELSKLIDCVERANSKLGFVVKLLLGEQDKEGDFKTEASELFSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDLGLTLQIYTDLQSRS

Query:  PTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSGKGLASVFESHLKELSAPFHEAPEKVGWFLTTKVAAKSWLESRGSPE
        PTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYS KGLASVFESHLKEL+APFHEAPEKVGWFLTTKVAAKSWLESR SPE
Subjt:  PTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSGKGLASVFESHLKELSAPFHEAPEKVGWFLTTKVAAKSWLESRGSPE

Query:  LVA
        LVA
Subjt:  LVA

A0A6J1CNE5 pentatricopeptide repeat-containing protein At4g16390, chloroplastic0.0e+0091.61Show/hide
Query:  MAFQLCHSPSTLFTDHHPLFNSLTSQRKISLCKSSHRFKLNPIPLKSKTFLQITNVSLQEYAPEETQNPSPPEDDRSKYPDGKSSSSSKSYVWVNPRSPR
        MAFQLCHSPST F+DHHPL NSL SQ +I+L KSSHRFKLNP P  SKT L+ITNVSLQEYA +E QNP P +D+ SKYPDGKS+SSSKS VWVNPRSPR
Subjt:  MAFQLCHSPSTLFTDHHPLFNSLTSQRKISLCKSSHRFKLNPIPLKSKTFLQITNVSLQEYAPEETQNPSPPEDDRSKYPDGKSSSSSKSYVWVNPRSPR

Query:  ASKLRKQSYEARYASLTRISESLESCNPCEEDVADVLKVTGNNILEQDAVIVLNNMSNSSTALLALQYFQKVLRSSKQAILYNVTLKVFRKSRDMEGAEK
        ASKLR QSYEARYASLTRISESL+SCNPCEEDVADVLK  G+NILEQDAV VLNNMSNSSTALLALQ FQKVL+SSK+AILYNVTLKV RKSRDMEGAEK
Subjt:  ASKLRKQSYEARYASLTRISESLESCNPCEEDVADVLKVTGNNILEQDAVIVLNNMSNSSTALLALQYFQKVLRSSKQAILYNVTLKVFRKSRDMEGAEK

Query:  LFDEMLQRGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPNYDCNPDDVTYSAMIDAYGRAGNVDMAFSLYDRARTENWRIDPVTFSTLIKIHGVAGNY
        LFDEML+RGVKPDNVTFST+ISCARLCSLPNKAVEWFEKMP++DCNPDDVTYSAMIDAYGRAGNVDMAFSLYDRARTENWRIDP TFSTLIKIHGVAGNY
Subjt:  LFDEMLQRGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPNYDCNPDDVTYSAMIDAYGRAGNVDMAFSLYDRARTENWRIDPVTFSTLIKIHGVAGNY

Query:  DGCLNVYEEMKAIGIKPNLVIYNSMLDAMGRAKRPWQIKTIYKEMIQNGFSPSWTTYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCA
        DGCLNVYEEMKAIGIKPNLVIYNS+LDAMGRAKRPWQIKTIYKEMI+NGFSPSW TYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCA
Subjt:  DGCLNVYEEMKAIGIKPNLVIYNSMLDAMGRAKRPWQIKTIYKEMIQNGFSPSWTTYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCA

Query:  DVGYINEAVEIFEDMKNSGTCSPDSWTFSSMITIYSCSGKVSGAEEMLNEMVEAGFDPNIFVLTSLIQCYGKAKRVDDVVRTFNRLVELGLTPDDRFCGC
        DVGY+NEAVEIFEDMK+SG CSPDSWTFSSMITIYSCSGKVS AEEMLNEM+EAGFDPNIFVLTSLIQCYGK KRVDDVVRTF+RLVELGLTPDDRFCGC
Subjt:  DVGYINEAVEIFEDMKNSGTCSPDSWTFSSMITIYSCSGKVSGAEEMLNEMVEAGFDPNIFVLTSLIQCYGKAKRVDDVVRTFNRLVELGLTPDDRFCGC

Query:  LLNVITQTPKGELSKLIDCVERANSKLGFVVKLLLGEQDKEGDFKTEASELFSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDLGLTLQIYTDLQSRS
        LLNVITQTPK ELSKLIDCVERANSKLG+VVKLLLGEQDKEGD +TEASEL SVVSADVRKAYCNCLIDLCVNLDLL+KACELLDLGLTLQIYT LQS S
Subjt:  LLNVITQTPKGELSKLIDCVERANSKLGFVVKLLLGEQDKEGDFKTEASELFSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDLGLTLQIYTDLQSRS

Query:  PTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSGKGLASVFESHLKELSAPFHEAPEKVGWFLTTKVAAKSWLESRGSPE
        PTQWSL+LKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYS KGLASVFESHLKEL+APFHEAPEKVGWFLTTKVAAKSWLESRGSPE
Subjt:  PTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSGKGLASVFESHLKELSAPFHEAPEKVGWFLTTKVAAKSWLESRGSPE

Query:  LVA
        LVA
Subjt:  LVA

A0A6J1EHV4 pentatricopeptide repeat-containing protein At4g16390, chloroplastic0.0e+0086.65Show/hide
Query:  MAFQLCHSPSTLFTDHHPLFNSLTSQRKISLCKSSHR-FKLNPIPLKSKTFLQITNVSLQEYAPEETQNPSPPEDDRSKYPDGKSSSSSKSYVWVNPRSP
        MAFQL H PST FTDH    NSLT   K +LCKS  R FKLNPIP  SK FLQITNVS QEYAP+ET+NPSP +D+ SK+PDGKS SSSK+ VWVNP SP
Subjt:  MAFQLCHSPSTLFTDHHPLFNSLTSQRKISLCKSSHR-FKLNPIPLKSKTFLQITNVSLQEYAPEETQNPSPPEDDRSKYPDGKSSSSSKSYVWVNPRSP

Query:  RASKLRKQSYEARYASLTRISESLESCNPCEEDVADVLKVTGNNILEQDAVIVLNNMSNSSTALLALQYFQKVLRSSKQAILYNVTLKVFRKSRDMEGAE
        RASKLRKQSYEARYASL +ISESL+SCNPCE DVADVLK   + ILEQDA+ VLNNMSNS TALL  +YFQ VL+SSKQA+ YNVTLKVFRK RD EGAE
Subjt:  RASKLRKQSYEARYASLTRISESLESCNPCEEDVADVLKVTGNNILEQDAVIVLNNMSNSSTALLALQYFQKVLRSSKQAILYNVTLKVFRKSRDMEGAE

Query:  KLFDEMLQRGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPNYDCNPDDVTYSAMIDAYGRAGNVDMAFSLYDRARTENWRIDPVTFSTLIKIHGVAGN
        KLFDEML+RGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMP++DCNPD++TYS MIDAYGRAGNVDMAFSLYDRARTENWRID  TFST+IKIHGVAGN
Subjt:  KLFDEMLQRGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPNYDCNPDDVTYSAMIDAYGRAGNVDMAFSLYDRARTENWRIDPVTFSTLIKIHGVAGN

Query:  YDGCLNVYEEMKAIGIKPNLVIYNSMLDAMGRAKRPWQIKTIYKEMIQNGFSPSWTTYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMC
        YDGCLNVYEEMKA+GIKPNL IYNS+L AMGRAKRPWQIKTIYKEM +NGFSPSW TYASLLRAY RARY ED +LVYKEMKEKGLQLNVILYNTLLAMC
Subjt:  YDGCLNVYEEMKAIGIKPNLVIYNSMLDAMGRAKRPWQIKTIYKEMIQNGFSPSWTTYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMC

Query:  ADVGYINEAVEIFEDMKNSGTCSPDSWTFSSMITIYSCSGKVSGAEEMLNEMVEAGFDPNIFVLTSLIQCYGKAKRVDDVVRTFNRLVELGLTPDDRFCG
        ADVGY+NEA+E+F+DMK+SGTCSPDSWTFSSMITIYSCSG VS AEEMLNEM+EAGFDPNIFVLTSLIQCYGKAKRVDDVVRTF+RL+ELGLTPDDRFCG
Subjt:  ADVGYINEAVEIFEDMKNSGTCSPDSWTFSSMITIYSCSGKVSGAEEMLNEMVEAGFDPNIFVLTSLIQCYGKAKRVDDVVRTFNRLVELGLTPDDRFCG

Query:  CLLNVITQTPKGELSKLIDCVERANSKLGFVVKLLLGEQDKEGDFKTEASELFSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDLGLTLQIYTDLQSR
        CLLNVITQTPK ELSKLIDCVERAN KLGFVVKLLLGE+D EGDF+TEASELFSVVS DVRKAYCNCLIDLCVNLDLLDKACELLDLGL++QIYTDLQSR
Subjt:  CLLNVITQTPKGELSKLIDCVERANSKLGFVVKLLLGEQDKEGDFKTEASELFSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDLGLTLQIYTDLQSR

Query:  SPTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSGKGLASVFESHLKELSAPFHEAPEKVGWFLTTKVAAKSWLESRGSP
        SPTQWSLYLKGLSLGAALTALHVWINDLTK L+SGEELPPLLGINTGHGKHKYS KGL+SVFESHLKEL+APFHEAPEKVGWFLTTKVAAKSWLESRGSP
Subjt:  SPTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSGKGLASVFESHLKELSAPFHEAPEKVGWFLTTKVAAKSWLESRGSP

Query:  ELVA
        ELVA
Subjt:  ELVA

SwissProt top hitse value%identityAlignment
B4F8Z1 Pentatricopeptide repeat-containing protein ATP4, chloroplastic2.2e-20854.29Show/hide
Query:  LCHSPSTLFTD--HHPLFNSLTSQRKISLCKSSHRFKLNPIPLKSKTFLQITNVSLQEYAPEETQNPSPPEDDRSKYPDG--KSSSSSKSYVWVNPRSPR
        LC SPS+L     H P+  S                  NP   K+ +     +VS+QE  P + Q+PSPP D     P+G   SSSS+  ++WVNP SPR
Subjt:  LCHSPSTLFTD--HHPLFNSLTSQRKISLCKSSHRFKLNPIPLKSKTFLQITNVSLQEYAPEETQNPSPPEDDRSKYPDG--KSSSSSKSYVWVNPRSPR

Query:  ASKL-RKQSYEARYASLTRISESLESCNPCEEDVADVLKVT-GNNILEQDAVIVLNN--MSNSSTALLALQYFQKVLRSSKQAILYNVTLKVFRKSRDME
        A+ + R ++   R A L   + +L +C   E  V   L+        EQDAVIVLN    + + TA+LAL++F    +  K+ ILYNV LK+ RK R   
Subjt:  ASKL-RKQSYEARYASLTRISESLESCNPCEEDVADVLKVT-GNNILEQDAVIVLNN--MSNSSTALLALQYFQKVLRSSKQAILYNVTLKVFRKSRDME

Query:  GAEKLFDEMLQRGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPNYDCNPDDVTYSAMIDAYGRAGNVDMAFSLYDRARTENWRIDPVTFSTLIKIHGV
          E L+ EML+ GV+PDN TFST+ISCAR C L +KAVEWF+KMP + C+PD +TYSA+IDAYG AGN + A  LYDRAR E W++DPV  ST+IK+H  
Subjt:  GAEKLFDEMLQRGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPNYDCNPDDVTYSAMIDAYGRAGNVDMAFSLYDRARTENWRIDPVTFSTLIKIHGV

Query:  AGNYDGCLNVYEEMKAIGIKPNLVIYNSMLDAMGRAKRPWQIKTIYKEMIQNGFSPSWTTYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLL
        +GN+DG LNV+EEMKAIG++PNLV+YN+MLDAMGRA RPW +KTI++EM+     PS  TY  LL AY RARYGEDA+ VY+ MK++ + ++V+LYN LL
Subjt:  AGNYDGCLNVYEEMKAIGIKPNLVIYNSMLDAMGRAKRPWQIKTIYKEMIQNGFSPSWTTYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLL

Query:  AMCADVGYINEAVEIFEDMKNS--GTCSPDSWTFSSMITIYSCSGKVSGAEEMLNEMVEAGFDPNIFVLTSLIQCYGKAKRVDDVVRTFNRLVELGLTPD
        +MCAD+GY++EA EIF DMK S      PDSW++SSM+T+YS +  V  AE +LNEMVEAGF PNIFVLTSLI+CYGK  R DDVVR+F  L +LG+ PD
Subjt:  AMCADVGYINEAVEIFEDMKNS--GTCSPDSWTFSSMITIYSCSGKVSGAEEMLNEMVEAGFDPNIFVLTSLIQCYGKAKRVDDVVRTFNRLVELGLTPD

Query:  DRFCGCLLNVITQTPKGELSKLIDCVERANSKLGFVVKLLLGEQDKEGDFKTEASELFSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDLGLTLQIYT
        DRFCGCLL+V   TP  EL K+I C+ER+N +LG VVKLL+     E  F+  A EL       V+  YCNCL+DLCVNL+ ++KAC LLD    L IY 
Subjt:  DRFCGCLLNVITQTPKGELSKLIDCVERANSKLGFVVKLLLGEQDKEGDFKTEASELFSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDLGLTLQIYT

Query:  DLQSRSPTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEE-LPPLLGINTGHGKHKYSGKGLASVFESHLKELSAPFHEAPEKVGWFLTTKVAAKSWL
        ++Q+R+ TQWSL+L+GLS+GAALT LHVW+NDL   L++G E LPPLLGI+TG GK+ YS +GLA++FE+HLKEL APFHEAP+K GWFLTT VAAK WL
Subjt:  DLQSRSPTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEE-LPPLLGINTGHGKHKYSGKGLASVFESHLKELSAPFHEAPEKVGWFLTTKVAAKSWL

Query:  ESRGSPELVAV
        ES+ + ELV V
Subjt:  ESRGSPELVAV

Q10PZ4 Pentatricopeptide repeat-containing protein ATP4 homolog, chloroplastic1.7e-21157.39Show/hide
Query:  QNPSPPEDDRSKYPDGKSSSSSKSYVWVNPRSPRASKL-RKQSYEARYASLTRISESLESCNPCEEDVADVLKVT-GNNILEQDAVIVLNNMS-NSSTAL
        Q+P PP  D +  P G+SS++S+ YVWVNP SPRA+ L R ++   R A L   + +L +C   E  VA  L+        EQDAVIVLN  S   +  +
Subjt:  QNPSPPEDDRSKYPDGKSSSSSKSYVWVNPRSPRASKL-RKQSYEARYASLTRISESLESCNPCEEDVADVLKVT-GNNILEQDAVIVLNNMS-NSSTAL

Query:  LALQYFQKVLRSSKQAILYNVTLKVFRKSRDMEGAEKLFDEMLQRGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPNYDCNPDDVTYSAMIDAYGRAG
        LAL +F +     K+ ILYNV LK  RK R    AE L++EML+ GV+PDN TFST+ISCAR C +P KAVEWFEKMP++ C+PD +TYSA+IDAYGRAG
Subjt:  LALQYFQKVLRSSKQAILYNVTLKVFRKSRDMEGAEKLFDEMLQRGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPNYDCNPDDVTYSAMIDAYGRAG

Query:  NVDMAFSLYDRARTENWRIDPVTFSTLIKIHGVAGNYDGCLNVYEEMKAIGIKPNLVIYNSMLDAMGRAKRPWQIKTIYKEMIQNGFSPSWTTYASLLRA
        + + A  LYDRAR E W++DPV  +T+I++H  +GN+DG LNV+EEMKA G+KPNLV+YN++LDAMGRA RPW +KTI++E++     P+  TY  LL A
Subjt:  NVDMAFSLYDRARTENWRIDPVTFSTLIKIHGVAGNYDGCLNVYEEMKAIGIKPNLVIYNSMLDAMGRAKRPWQIKTIYKEMIQNGFSPSWTTYASLLRA

Query:  YGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCADVGYINEAVEIFEDMKNS--GTCSPDSWTFSSMITIYSCSGKVSGAEEMLNEMVEAGFDPNIF
        Y RARYGEDA+ VY+ MK++ + ++V+LYN LL+MCAD+GY+ EA EIF DMK S      PDSW++SSM+T+YSC+G V+GAE +LNEMVEAGF PNIF
Subjt:  YGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCADVGYINEAVEIFEDMKNS--GTCSPDSWTFSSMITIYSCSGKVSGAEEMLNEMVEAGFDPNIF

Query:  VLTSLIQCYGKAKRVDDVVRTFNRLVELGLTPDDRFCGCLLNVITQTPKGELSKLIDCVERANSKLGFVVKLLLGEQDKEGDFKTEASELFSVVSADVRK
        +LTSLI+CYGKA R DDVVR+F  L +LG+TPDDRFCGCLL V   TP  EL K+I C++R++++LG VV+LL+         +  A EL       VR 
Subjt:  VLTSLIQCYGKAKRVDDVVRTFNRLVELGLTPDDRFCGCLLNVITQTPKGELSKLIDCVERANSKLGFVVKLLLGEQDKEGDFKTEASELFSVVSADVRK

Query:  AYCNCLIDLCVNLDLLDKACELLDLGLTLQIYTDLQSRSPTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSGKGLASVF
         YCNCL+DL VNL  ++KAC LLD+ L L IY+++Q+R+ TQWSL+L+GLS+GAALT LHVW++DL   L++G+ELPPLLGI+TG GK+ YS KGLA+VF
Subjt:  AYCNCLIDLCVNLDLLDKACELLDLGLTLQIYTDLQSRSPTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSGKGLASVF

Query:  ESHLKELSAPFHEAPEKVGWFLTTKVAAKSWLESRGSPELVAV
        ESHLKEL APFHEAP+K GWFLTT VAA+ WLE++ S ELVAV
Subjt:  ESHLKELSAPFHEAPEKVGWFLTTKVAAKSWLESRGSPELVAV

Q8GWE0 Pentatricopeptide repeat-containing protein At4g16390, chloroplastic1.0e-26965.8Show/hide
Query:  LCHSPSTLFTDHHPLFNSLTSQRKISLCKSSHRFKLNPIPLKSKTFLQITNVSLQEYAPEETQNPSPPEDDRSKYPDGKSSSSSKSYVWVNPRSPRASKL
        LC SPS+L  D  PL N L+   K +       +  N     S+  LQ T+VS+QE  P+  ++     D     P     ++SKSYVWVNP+SPRAS+L
Subjt:  LCHSPSTLFTDHHPLFNSLTSQRKISLCKSSHRFKLNPIPLKSKTFLQITNVSLQEYAPEETQNPSPPEDDRSKYPDGKSSSSSKSYVWVNPRSPRASKL

Query:  RKQSYEARYASLTRISESLESCNPCEEDVADVLKVTGNNILEQDAVIVLNNMSNSSTALLALQYFQKVLRSSKQAILYNVTLKVFRKSRDMEGAEKLFDE
        R++SY++RY+SL +++ESL++C P E DV DV+   G  + EQDAV+ LNNM+N  TA L L    + ++ S++ ILYNVT+KVFRKS+D+E +EKLFDE
Subjt:  RKQSYEARYASLTRISESLESCNPCEEDVADVLKVTGNNILEQDAVIVLNNMSNSSTALLALQYFQKVLRSSKQAILYNVTLKVFRKSRDMEGAEKLFDE

Query:  MLQRGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPNYDCNPDDVTYSAMIDAYGRAGNVDMAFSLYDRARTENWRIDPVTFSTLIKIHGVAGNYDGCL
        ML+RG+KPDN TF+TIISCAR   +P +AVEWFEKM ++ C PD+VT +AMIDAYGRAGNVDMA SLYDRARTE WRID VTFSTLI+I+GV+GNYDGCL
Subjt:  MLQRGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPNYDCNPDDVTYSAMIDAYGRAGNVDMAFSLYDRARTENWRIDPVTFSTLIKIHGVAGNYDGCL

Query:  NVYEEMKAIGIKPNLVIYNSMLDAMGRAKRPWQIKTIYKEMIQNGFSPSWTTYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCADVGY
        N+YEEMKA+G+KPNLVIYN ++D+MGRAKRPWQ K IYK++I NGF+P+W+TYA+L+RAYGRARYG+DAL +Y+EMKEKGL L VILYNTLL+MCAD  Y
Subjt:  NVYEEMKAIGIKPNLVIYNSMLDAMGRAKRPWQIKTIYKEMIQNGFSPSWTTYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCADVGY

Query:  INEAVEIFEDMKNSGTCSPDSWTFSSMITIYSCSGKVSGAEEMLNEMVEAGFDPNIFVLTSLIQCYGKAKRVDDVVRTFNRLVELGLTPDDRFCGCLLNV
        ++EA EIF+DMKN  TC PDSWTFSS+IT+Y+CSG+VS AE  L +M EAGF+P +FVLTS+IQCYGKAK+VDDVVRTF++++ELG+TPDDRFCGCLLNV
Subjt:  INEAVEIFEDMKNSGTCSPDSWTFSSMITIYSCSGKVSGAEEMLNEMVEAGFDPNIFVLTSLIQCYGKAKRVDDVVRTFNRLVELGLTPDDRFCGCLLNV

Query:  ITQTPKGELSKLIDCVERANSKLGFVVKLLLGEQD-KEGDFKTEASELFSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDLGLTLQIYTDLQSRSPTQ
        +TQTP  E+ KLI CVE+A  KLG VVK+L+ EQ+ +EG FK EASEL   + +DV+KAY NCLIDLCVNL+ L++ACE+L LGL   IYT LQS+S TQ
Subjt:  ITQTPKGELSKLIDCVERANSKLGFVVKLLLGEQD-KEGDFKTEASELFSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDLGLTLQIYTDLQSRSPTQ

Query:  WSLYLKGLSLGAALTALHVWINDLTK-VLESGEELPPLLGINTGHGKHKYSGKGLASVFESHLKELSAPFHEAPEKVGWFLTTKVAAKSWLESRGS
        WSL+LK LSLGAALTALHVW+NDL++  LESGEE PPLLGINTGHGKHKYS KGLA+VFESHLKEL+APFHEAP+KVGWFLTT VAAK+WLESR S
Subjt:  WSLYLKGLSLGAALTALHVWINDLTK-VLESGEELPPLLGINTGHGKHKYSGKGLASVFESHLKELSAPFHEAPEKVGWFLTTKVAAKSWLESRGS

Q9LS25 Pentatricopeptide repeat-containing protein At5g46580, chloroplastic4.9e-13138.35Show/hide
Query:  AFQLCHSPSTLFTDHHPLFNSLTSQRKISLCKSSHRFKLNPIPLKSKTFLQITNVSLQEYAPEETQNPSPPEDDRSKYPDGKSSSSSKSYVWVNPRSPRA
        A  +C +P    T  H LF       K SL + S   KLN      K    +    +    P  ++   P      +    +  S  KS VWVNP  P+ 
Subjt:  AFQLCHSPSTLFTDHHPLFNSLTSQRKISLCKSSHRFKLNPIPLKSKTFLQITNVSLQEYAPEETQNPSPPEDDRSKYPDGKSSSSSKSYVWVNPRSPRA

Query:  SKLRKQSYEARYASLTRISESLESCNPCEEDV-ADVLKVTGNNILEQDAVIVL-------NNMSNSSTALLALQYFQKV------LRSSK----QAILYN
        S L          SL R   S  S NP  +D+ A  LK+  +   E+   + L        N  N+   L +L+ +QK       ++S      + I YN
Subjt:  SKLRKQSYEARYASLTRISESLESCNPCEEDV-ADVLKVTGNNILEQDAVIVL-------NNMSNSSTALLALQYFQKV------LRSSK----QAILYN

Query:  VTLKVFRKSRDMEGAEKLFDEMLQRGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPNYDCNPDDVTYSAMIDAYGRAGNVDMAFSLYDRARTENWRID
        VT+K  R  R  +  E++  EM++ GV+ DN+T+STII+CA+ C+L NKA+EWFE+M      PD+VTYSA++D Y ++G V+   SLY+RA    W+ D
Subjt:  VTLKVFRKSRDMEGAEKLFDEMLQRGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPNYDCNPDDVTYSAMIDAYGRAGNVDMAFSLYDRARTENWRID

Query:  PVTFSTLIKIHGVAGNYDGCLNVYEEMKAIGIKPNLVIYNSMLDAMGRAKRPWQIKTIYKEMIQNGFSPSWTTYASLLRAYGRARYGEDALLVYKEMKEK
         + FS L K+ G AG+YDG   V +EMK++ +KPN+V+YN++L+AMGRA +P   ++++ EM++ G +P+  T  +L++ YG+AR+  DAL +++EMK K
Subjt:  PVTFSTLIKIHGVAGNYDGCLNVYEEMKAIGIKPNLVIYNSMLDAMGRAKRPWQIKTIYKEMIQNGFSPSWTTYASLLRAYGRARYGEDALLVYKEMKEK

Query:  GLQLNVILYNTLLAMCADVGYINEAVEIFEDMKNSGTCSPDSWTFSSMITIYSCSGKVSGAEEMLNEMVEAGFDPNIFVLTSLIQCYGKAKRVDDVVRTF
           ++ ILYNTLL MCAD+G   EA  +F DMK S  C PD++++++M+ IY   GK   A E+  EM++AG   N+   T L+QC GKAKR+DDVV  F
Subjt:  GLQLNVILYNTLLAMCADVGYINEAVEIFEDMKNSGTCSPDSWTFSSMITIYSCSGKVSGAEEMLNEMVEAGFDPNIFVLTSLIQCYGKAKRVDDVVRTF

Query:  NRLVELGLTPDDRFCGCLLNVITQTPKGE-LSKLIDCVERANSKLGFVVKLLLGEQDKEGDFKTEASELFSVVSADVRKAYCNCLIDLCVNLDLLDKACE
        +  ++ G+ PDDR CGCLL+V+      E   K++ C+ERAN KL   V L++ E+ +    K E   + +    + R+ +CNCLID+C   +  ++A E
Subjt:  NRLVELGLTPDDRFCGCLLNVITQTPKGE-LSKLIDCVERANSKLGFVVKLLLGEQDKEGDFKTEASELFSVVSADVRKAYCNCLIDLCVNLDLLDKACE

Query:  LLDLGLTLQIYTDLQSRSPTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSGKGLASVFESHLKELSAPFHEAPEKVGWF
        LL LG    +Y  L +++  +WSL ++ LS+GAA TAL  W+  L  +++  EELP L    TG G H++S +GLA+ F  HL++LSAPF ++ ++ G F
Subjt:  LLDLGLTLQIYTDLQSRSPTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSGKGLASVFESHLKELSAPFHEAPEKVGWF

Query:  LTTKVAAKSWLESRGSP
        + TK    SWLES+  P
Subjt:  LTTKVAAKSWLESRGSP

Q9SIC9 Pentatricopeptide repeat-containing protein At2g31400, chloroplastic1.1e-4826.1Show/hide
Query:  QYFQKVLRSSKQ--AILYNVTLKVFRKSRDMEGAEKLFDEMLQRGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPNYDCNPDDVTYSAMIDAYGRAGN
        ++F ++ R+  Q   I +N  L V  +    E A  LFDEM  R ++ D  +++T++         + A E   +MP     P+ V+YS +ID + +AG 
Subjt:  QYFQKVLRSSKQ--AILYNVTLKVFRKSRDMEGAEKLFDEMLQRGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPNYDCNPDDVTYSAMIDAYGRAGN

Query:  VDMAFSLYDRARTENWRIDPVTFSTLIKIHGVAGNYDGCLNVYEEMKAIGIKPNLVIYNSMLDAMGRAKRPWQIKTIYKEMIQNGFSPSWTTYASLLRAY
         D A +L+   R     +D V+++TL+ I+   G  +  L++  EM ++GIK ++V YN++L   G+  +  ++K ++ EM +    P+  TY++L+  Y
Subjt:  VDMAFSLYDRARTENWRIDPVTFSTLIKIHGVAGNYDGCLNVYEEMKAIGIKPNLVIYNSMLDAMGRAKRPWQIKTIYKEMIQNGFSPSWTTYASLLRAY

Query:  GRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCADVGYINEAVEIFEDMKNSGTCSPDSWTFSSMITI------------YSCSGKVSGAEEMLNEMV
         +    ++A+ +++E K  GL+ +V+LY+ L+      G +  AV + ++M   G  SP+  T++S+I              YS  G +  +   L+ + 
Subjt:  GRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCADVGYINEAVEIFEDMKNSGTCSPDSWTFSSMITI------------YSCSGKVSGAEEMLNEMV

Query:  EAGFDPNIFVLTSLI---------QCYGKAKRVDDVVRTFNRLVELGLTPDDRFCGCLLNVITQTPKGE-LSKLIDCVERANSKL-GFVVKLLLGEQDKE
        E   +  I +   L           C    + +  ++  F ++ +L + P+      +LN  ++    E  S L++ +   ++K+ G V  LL+G+++  
Subjt:  EAGFDPNIFVLTSLI---------QCYGKAKRVDDVVRTFNRLVELGLTPDDRFCGCLLNVITQTPKGE-LSKLIDCVERANSKL-GFVVKLLLGEQDKE

Query:  GDFKTEASELFSVVS---ADVRKAYCNCLIDLCVNLDLLDKACELLDL-GLTLQIYTDLQSRSPTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEEL
         +   +A  LF  V+        A+ N L D+  +     +  EL+ L G + Q++ ++ S S     L L  +S GAA   +H W+ ++  ++  G EL
Subjt:  GDFKTEASELFSVVS---ADVRKAYCNCLIDLCVNLDLLDKACELLDL-GLTLQIYTDLQSRSPTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEEL

Query:  PPLLGINTGHGKH-KYSGKG-LASVFESHLKELSAPFHEAPEKVGWFLTTKVAAKSWLESRGSPELV
        P +L I TG GKH K  G G L    E  L+ + APFH +   +G F ++     +WL    + +L+
Subjt:  PPLLGINTGHGKH-KYSGKG-LASVFESHLKELSAPFHEAPEKVGWFLTTKVAAKSWLESRGSPELV

Arabidopsis top hitse value%identityAlignment
AT1G74750.1 Pentatricopeptide repeat (PPR) superfamily protein9.2e-4022.29Show/hide
Query:  QRGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPNYDCNPDDVTYSAMIDAYGRAGNVDMAFSLYDRARTENWRIDPVTFSTLIKIHGVAGNYDGCLNV
        Q G K D  T++T++          +  +  ++M    C P+ VTY+ +I +YGRA  +  A +++++ +      D VT+ TLI IH  AG  D  +++
Subjt:  QRGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPNYDCNPDDVTYSAMIDAYGRAGNVDMAFSLYDRARTENWRIDPVTFSTLIKIHGVAGNYDGCLNV

Query:  YEEMKAIGIKPNLVIYNSMLDAMGRAKRPWQIKTIYKEMIQNGFSPSWTTYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCADVGYIN
        Y+ M+  G+ P+   Y+ +++ +G+A        ++ EM+  G +P+  T+  ++  + +AR  E AL +Y++M+  G Q + + Y+ ++ +    G++ 
Subjt:  YEEMKAIGIKPNLVIYNSMLDAMGRAKRPWQIKTIYKEMIQNGFSPSWTTYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCADVGYIN

Query:  EAVEIFEDMKNSGTCSPDSWTFSSMITIYSCSGKVSGAEEMLNEMVEAGFDPNIFVLTSLIQCYGKAKRVDDVVRTFNRLVELGLTPDDRFCGCLLNVIT
        EA  +F +M+      PD   +  ++ ++  +G V  A +    M++AG  PN+    SL+  + +  R+ +       ++ LGL P  +    LL+  T
Subjt:  EAVEIFEDMKNSGTCSPDSWTFSSMITIYSCSGKVSGAEEMLNEMVEAGFDPNIFVLTSLIQCYGKAKRVDDVVRTFNRLVELGLTPDDRFCGCLLNVIT

Query:  QTPKGELSKLIDCVERANSKLGFVVKLLL-----------------GEQDKEGDFKTEASELFSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDLGLT
                       R+N  +GF  +L+                   +  K  D  +   +       + ++   + ++D      L ++A  + ++   
Subjt:  QTPKGELSKLIDCVERANSKLGFVVKLLL-----------------GEQDKEGDFKTEASELFSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDLGLT

Query:  LQIYTD-LQSRSPTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKH-KYSGKGLA-SVFESHLKELSAPFHEAPEKVGWFLTTK
          +Y D L+ +S + W + L  +S G A+ AL   +    K +    + P  + I TG G+  + +G  +     E  L   + PF       G F+ + 
Subjt:  LQIYTD-LQSRSPTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKH-KYSGKGLA-SVFESHLKELSAPFHEAPEKVGWFLTTK

Query:  VAAKSWL
           K+WL
Subjt:  VAAKSWL

AT2G31400.1 genomes uncoupled 17.5e-5026.1Show/hide
Query:  QYFQKVLRSSKQ--AILYNVTLKVFRKSRDMEGAEKLFDEMLQRGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPNYDCNPDDVTYSAMIDAYGRAGN
        ++F ++ R+  Q   I +N  L V  +    E A  LFDEM  R ++ D  +++T++         + A E   +MP     P+ V+YS +ID + +AG 
Subjt:  QYFQKVLRSSKQ--AILYNVTLKVFRKSRDMEGAEKLFDEMLQRGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPNYDCNPDDVTYSAMIDAYGRAGN

Query:  VDMAFSLYDRARTENWRIDPVTFSTLIKIHGVAGNYDGCLNVYEEMKAIGIKPNLVIYNSMLDAMGRAKRPWQIKTIYKEMIQNGFSPSWTTYASLLRAY
         D A +L+   R     +D V+++TL+ I+   G  +  L++  EM ++GIK ++V YN++L   G+  +  ++K ++ EM +    P+  TY++L+  Y
Subjt:  VDMAFSLYDRARTENWRIDPVTFSTLIKIHGVAGNYDGCLNVYEEMKAIGIKPNLVIYNSMLDAMGRAKRPWQIKTIYKEMIQNGFSPSWTTYASLLRAY

Query:  GRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCADVGYINEAVEIFEDMKNSGTCSPDSWTFSSMITI------------YSCSGKVSGAEEMLNEMV
         +    ++A+ +++E K  GL+ +V+LY+ L+      G +  AV + ++M   G  SP+  T++S+I              YS  G +  +   L+ + 
Subjt:  GRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCADVGYINEAVEIFEDMKNSGTCSPDSWTFSSMITI------------YSCSGKVSGAEEMLNEMV

Query:  EAGFDPNIFVLTSLI---------QCYGKAKRVDDVVRTFNRLVELGLTPDDRFCGCLLNVITQTPKGE-LSKLIDCVERANSKL-GFVVKLLLGEQDKE
        E   +  I +   L           C    + +  ++  F ++ +L + P+      +LN  ++    E  S L++ +   ++K+ G V  LL+G+++  
Subjt:  EAGFDPNIFVLTSLI---------QCYGKAKRVDDVVRTFNRLVELGLTPDDRFCGCLLNVITQTPKGE-LSKLIDCVERANSKL-GFVVKLLLGEQDKE

Query:  GDFKTEASELFSVVS---ADVRKAYCNCLIDLCVNLDLLDKACELLDL-GLTLQIYTDLQSRSPTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEEL
         +   +A  LF  V+        A+ N L D+  +     +  EL+ L G + Q++ ++ S S     L L  +S GAA   +H W+ ++  ++  G EL
Subjt:  GDFKTEASELFSVVS---ADVRKAYCNCLIDLCVNLDLLDKACELLDL-GLTLQIYTDLQSRSPTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEEL

Query:  PPLLGINTGHGKH-KYSGKG-LASVFESHLKELSAPFHEAPEKVGWFLTTKVAAKSWLESRGSPELV
        P +L I TG GKH K  G G L    E  L+ + APFH +   +G F ++     +WL    + +L+
Subjt:  PPLLGINTGHGKH-KYSGKG-LASVFESHLKELSAPFHEAPEKVGWFLTTKVAAKSWLESRGSPELV

AT4G16390.1 pentatricopeptide (PPR) repeat-containing protein7.3e-27165.8Show/hide
Query:  LCHSPSTLFTDHHPLFNSLTSQRKISLCKSSHRFKLNPIPLKSKTFLQITNVSLQEYAPEETQNPSPPEDDRSKYPDGKSSSSSKSYVWVNPRSPRASKL
        LC SPS+L  D  PL N L+   K +       +  N     S+  LQ T+VS+QE  P+  ++     D     P     ++SKSYVWVNP+SPRAS+L
Subjt:  LCHSPSTLFTDHHPLFNSLTSQRKISLCKSSHRFKLNPIPLKSKTFLQITNVSLQEYAPEETQNPSPPEDDRSKYPDGKSSSSSKSYVWVNPRSPRASKL

Query:  RKQSYEARYASLTRISESLESCNPCEEDVADVLKVTGNNILEQDAVIVLNNMSNSSTALLALQYFQKVLRSSKQAILYNVTLKVFRKSRDMEGAEKLFDE
        R++SY++RY+SL +++ESL++C P E DV DV+   G  + EQDAV+ LNNM+N  TA L L    + ++ S++ ILYNVT+KVFRKS+D+E +EKLFDE
Subjt:  RKQSYEARYASLTRISESLESCNPCEEDVADVLKVTGNNILEQDAVIVLNNMSNSSTALLALQYFQKVLRSSKQAILYNVTLKVFRKSRDMEGAEKLFDE

Query:  MLQRGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPNYDCNPDDVTYSAMIDAYGRAGNVDMAFSLYDRARTENWRIDPVTFSTLIKIHGVAGNYDGCL
        ML+RG+KPDN TF+TIISCAR   +P +AVEWFEKM ++ C PD+VT +AMIDAYGRAGNVDMA SLYDRARTE WRID VTFSTLI+I+GV+GNYDGCL
Subjt:  MLQRGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPNYDCNPDDVTYSAMIDAYGRAGNVDMAFSLYDRARTENWRIDPVTFSTLIKIHGVAGNYDGCL

Query:  NVYEEMKAIGIKPNLVIYNSMLDAMGRAKRPWQIKTIYKEMIQNGFSPSWTTYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCADVGY
        N+YEEMKA+G+KPNLVIYN ++D+MGRAKRPWQ K IYK++I NGF+P+W+TYA+L+RAYGRARYG+DAL +Y+EMKEKGL L VILYNTLL+MCAD  Y
Subjt:  NVYEEMKAIGIKPNLVIYNSMLDAMGRAKRPWQIKTIYKEMIQNGFSPSWTTYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCADVGY

Query:  INEAVEIFEDMKNSGTCSPDSWTFSSMITIYSCSGKVSGAEEMLNEMVEAGFDPNIFVLTSLIQCYGKAKRVDDVVRTFNRLVELGLTPDDRFCGCLLNV
        ++EA EIF+DMKN  TC PDSWTFSS+IT+Y+CSG+VS AE  L +M EAGF+P +FVLTS+IQCYGKAK+VDDVVRTF++++ELG+TPDDRFCGCLLNV
Subjt:  INEAVEIFEDMKNSGTCSPDSWTFSSMITIYSCSGKVSGAEEMLNEMVEAGFDPNIFVLTSLIQCYGKAKRVDDVVRTFNRLVELGLTPDDRFCGCLLNV

Query:  ITQTPKGELSKLIDCVERANSKLGFVVKLLLGEQD-KEGDFKTEASELFSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDLGLTLQIYTDLQSRSPTQ
        +TQTP  E+ KLI CVE+A  KLG VVK+L+ EQ+ +EG FK EASEL   + +DV+KAY NCLIDLCVNL+ L++ACE+L LGL   IYT LQS+S TQ
Subjt:  ITQTPKGELSKLIDCVERANSKLGFVVKLLLGEQD-KEGDFKTEASELFSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDLGLTLQIYTDLQSRSPTQ

Query:  WSLYLKGLSLGAALTALHVWINDLTK-VLESGEELPPLLGINTGHGKHKYSGKGLASVFESHLKELSAPFHEAPEKVGWFLTTKVAAKSWLESRGS
        WSL+LK LSLGAALTALHVW+NDL++  LESGEE PPLLGINTGHGKHKYS KGLA+VFESHLKEL+APFHEAP+KVGWFLTT VAAK+WLESR S
Subjt:  WSLYLKGLSLGAALTALHVWINDLTK-VLESGEELPPLLGINTGHGKHKYSGKGLASVFESHLKELSAPFHEAPEKVGWFLTTKVAAKSWLESRGS

AT5G02860.1 Pentatricopeptide repeat (PPR) superfamily protein1.7e-4128.49Show/hide
Query:  ALLALQYFQKVLRSSKQAILYN----VTLKVFRKSRDMEGAEKLFDEMLQRGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPNYDCNPDDVTYSAMID
        AL A  +F K  +   Q++L N    + + +  K   +  A  +F+ + + G   D  +++++IS         +AV  F+KM    C P  +TY+ +++
Subjt:  ALLALQYFQKVLRSSKQAILYN----VTLKVFRKSRDMEGAEKLFDEMLQRGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPNYDCNPDDVTYSAMID

Query:  AYGRAGNV-DMAFSLYDRARTENWRIDPVTFSTLIKIHGVAGNYDGCLNVYEEMKAIGIKPNLVIYNSMLDAMGRAKRPWQIKTIYKEMIQNGFSPSWTT
         +G+ G   +   SL ++ +++    D  T++TLI        +     V+EEMKA G   + V YN++LD  G++ RP +   +  EM+ NGFSPS  T
Subjt:  AYGRAGNV-DMAFSLYDRARTENWRIDPVTFSTLIKIHGVAGNYDGCLNVYEEMKAIGIKPNLVIYNSMLDAMGRAKRPWQIKTIYKEMIQNGFSPSWTT

Query:  YASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCADVGYINEAVEIFEDMKNSGTCSPDSWTFSSMITIYSCSGKVSGAEEMLNEMVEAGF
        Y SL+ AY R    ++A+ +  +M EKG + +V  Y TLL+     G +  A+ IFE+M+N+G C P+  TF++ I +Y   GK +   ++ +E+   G 
Subjt:  YASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCADVGYINEAVEIFEDMKNSGTCSPDSWTFSSMITIYSCSGKVSGAEEMLNEMVEAGF

Query:  DPNIFVLTSLIQCYGKAKRVDDVVRTFNRLVELGLTPDDRFCGCLLNVITQ
         P+I    +L+  +G+     +V   F  +   G  P+      L++  ++
Subjt:  DPNIFVLTSLIQCYGKAKRVDDVVRTFNRLVELGLTPDDRFCGCLLNVITQ

AT5G46580.1 pentatricopeptide (PPR) repeat-containing protein3.5e-13238.35Show/hide
Query:  AFQLCHSPSTLFTDHHPLFNSLTSQRKISLCKSSHRFKLNPIPLKSKTFLQITNVSLQEYAPEETQNPSPPEDDRSKYPDGKSSSSSKSYVWVNPRSPRA
        A  +C +P    T  H LF       K SL + S   KLN      K    +    +    P  ++   P      +    +  S  KS VWVNP  P+ 
Subjt:  AFQLCHSPSTLFTDHHPLFNSLTSQRKISLCKSSHRFKLNPIPLKSKTFLQITNVSLQEYAPEETQNPSPPEDDRSKYPDGKSSSSSKSYVWVNPRSPRA

Query:  SKLRKQSYEARYASLTRISESLESCNPCEEDV-ADVLKVTGNNILEQDAVIVL-------NNMSNSSTALLALQYFQKV------LRSSK----QAILYN
        S L          SL R   S  S NP  +D+ A  LK+  +   E+   + L        N  N+   L +L+ +QK       ++S      + I YN
Subjt:  SKLRKQSYEARYASLTRISESLESCNPCEEDV-ADVLKVTGNNILEQDAVIVL-------NNMSNSSTALLALQYFQKV------LRSSK----QAILYN

Query:  VTLKVFRKSRDMEGAEKLFDEMLQRGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPNYDCNPDDVTYSAMIDAYGRAGNVDMAFSLYDRARTENWRID
        VT+K  R  R  +  E++  EM++ GV+ DN+T+STII+CA+ C+L NKA+EWFE+M      PD+VTYSA++D Y ++G V+   SLY+RA    W+ D
Subjt:  VTLKVFRKSRDMEGAEKLFDEMLQRGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPNYDCNPDDVTYSAMIDAYGRAGNVDMAFSLYDRARTENWRID

Query:  PVTFSTLIKIHGVAGNYDGCLNVYEEMKAIGIKPNLVIYNSMLDAMGRAKRPWQIKTIYKEMIQNGFSPSWTTYASLLRAYGRARYGEDALLVYKEMKEK
         + FS L K+ G AG+YDG   V +EMK++ +KPN+V+YN++L+AMGRA +P   ++++ EM++ G +P+  T  +L++ YG+AR+  DAL +++EMK K
Subjt:  PVTFSTLIKIHGVAGNYDGCLNVYEEMKAIGIKPNLVIYNSMLDAMGRAKRPWQIKTIYKEMIQNGFSPSWTTYASLLRAYGRARYGEDALLVYKEMKEK

Query:  GLQLNVILYNTLLAMCADVGYINEAVEIFEDMKNSGTCSPDSWTFSSMITIYSCSGKVSGAEEMLNEMVEAGFDPNIFVLTSLIQCYGKAKRVDDVVRTF
           ++ ILYNTLL MCAD+G   EA  +F DMK S  C PD++++++M+ IY   GK   A E+  EM++AG   N+   T L+QC GKAKR+DDVV  F
Subjt:  GLQLNVILYNTLLAMCADVGYINEAVEIFEDMKNSGTCSPDSWTFSSMITIYSCSGKVSGAEEMLNEMVEAGFDPNIFVLTSLIQCYGKAKRVDDVVRTF

Query:  NRLVELGLTPDDRFCGCLLNVITQTPKGE-LSKLIDCVERANSKLGFVVKLLLGEQDKEGDFKTEASELFSVVSADVRKAYCNCLIDLCVNLDLLDKACE
        +  ++ G+ PDDR CGCLL+V+      E   K++ C+ERAN KL   V L++ E+ +    K E   + +    + R+ +CNCLID+C   +  ++A E
Subjt:  NRLVELGLTPDDRFCGCLLNVITQTPKGE-LSKLIDCVERANSKLGFVVKLLLGEQDKEGDFKTEASELFSVVSADVRKAYCNCLIDLCVNLDLLDKACE

Query:  LLDLGLTLQIYTDLQSRSPTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSGKGLASVFESHLKELSAPFHEAPEKVGWF
        LL LG    +Y  L +++  +WSL ++ LS+GAA TAL  W+  L  +++  EELP L    TG G H++S +GLA+ F  HL++LSAPF ++ ++ G F
Subjt:  LLDLGLTLQIYTDLQSRSPTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSGKGLASVFESHLKELSAPFHEAPEKVGWF

Query:  LTTKVAAKSWLESRGSP
        + TK    SWLES+  P
Subjt:  LTTKVAAKSWLESRGSP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTTCCAGCTCTGCCATTCGCCGTCCACCCTCTTCACCGACCACCATCCTCTCTTCAATTCTCTTACCTCTCAACGCAAAATAAGTCTCTGCAAGTCTTCTCACCG
TTTCAAGCTTAATCCTATACCACTGAAGTCAAAAACTTTTCTCCAGATAACCAATGTTTCGTTACAGGAATATGCTCCTGAAGAAACCCAGAATCCTAGCCCCCCTGAGG
ATGATAGGTCGAAATACCCAGATGGGAAATCTAGTTCCTCATCCAAAAGCTACGTTTGGGTCAATCCCAGAAGCCCCAGAGCTTCTAAACTTCGGAAGCAATCGTACGAA
GCCAGGTATGCTTCACTTACGAGAATATCAGAGTCTTTGGAGTCTTGTAATCCATGTGAGGAAGATGTTGCGGACGTCTTGAAGGTGACAGGCAATAACATTCTAGAACA
AGACGCTGTTATAGTGCTGAATAACATGTCGAATTCCAGTACTGCGTTGCTTGCTCTTCAGTACTTTCAGAAGGTATTGAGATCAAGTAAACAGGCAATTCTTTACAATG
TGACCCTGAAGGTGTTTAGGAAGTCTAGAGATATGGAGGGAGCAGAGAAACTGTTTGACGAAATGCTTCAGAGAGGAGTTAAACCTGATAATGTGACATTCTCTACAATA
ATTAGCTGTGCTAGGTTATGTTCGTTGCCAAATAAGGCTGTTGAGTGGTTTGAAAAGATGCCAAATTATGACTGTAATCCGGACGATGTCACTTACTCTGCAATGATAGA
TGCCTATGGACGTGCTGGTAATGTTGACATGGCTTTCAGCTTGTATGACCGTGCAAGAACAGAAAACTGGCGTATTGATCCTGTGACATTCTCAACATTGATCAAAATTC
ATGGAGTGGCTGGAAACTATGATGGGTGCTTGAATGTGTATGAAGAAATGAAAGCTATAGGCATCAAGCCAAACTTGGTTATATATAACAGCATGCTGGATGCTATGGGC
AGGGCTAAAAGACCATGGCAGATCAAGACCATTTACAAAGAGATGATTCAAAATGGGTTTTCACCGAGTTGGACAACTTATGCTTCTCTTTTACGTGCTTATGGGAGAGC
CAGATATGGTGAGGATGCTCTCCTTGTGTACAAGGAGATGAAGGAAAAGGGATTGCAGTTAAATGTAATTCTCTACAATACACTTTTAGCCATGTGTGCTGATGTTGGCT
ACATTAATGAAGCTGTTGAAATTTTTGAAGATATGAAGAATTCTGGGACTTGCTCACCTGACAGTTGGACTTTTTCTTCCATGATTACCATTTATTCCTGCAGTGGAAAA
GTATCAGGGGCGGAGGAAATGTTGAACGAGATGGTGGAAGCTGGTTTTGACCCTAATATCTTTGTCTTGACATCACTAATCCAGTGTTATGGGAAAGCCAAACGTGTTGA
TGATGTTGTGAGGACATTCAATCGACTGGTTGAGTTGGGATTAACTCCAGATGATCGATTCTGTGGCTGTCTTCTCAATGTAATTACCCAGACACCAAAAGGGGAACTTA
GTAAGCTGATTGATTGTGTTGAGAGAGCTAATTCAAAACTTGGTTTTGTGGTTAAGCTTTTGCTAGGGGAACAAGATAAGGAAGGAGATTTCAAAACTGAAGCCTCAGAA
CTATTTAGTGTTGTTAGTGCTGATGTGAGAAAAGCTTATTGCAATTGCCTAATTGATCTCTGTGTGAATTTAGATCTTTTGGATAAGGCCTGTGAACTCTTGGATTTGGG
GCTTACGCTTCAGATATATACAGATTTGCAGTCCAGGTCTCCAACTCAGTGGTCTCTATATCTCAAGGGTCTTTCTCTTGGGGCTGCTCTCACTGCATTACATGTTTGGA
TAAATGACTTAACAAAGGTACTCGAATCTGGGGAGGAACTTCCACCGTTACTTGGAATAAATACTGGGCATGGAAAACACAAATATTCTGGCAAGGGTTTGGCAAGTGTC
TTTGAGTCTCATTTAAAGGAATTAAGTGCTCCATTCCATGAGGCTCCAGAAAAGGTCGGGTGGTTTTTGACCACTAAAGTGGCAGCAAAATCATGGTTGGAGTCTAGAGG
TTCACCTGAATTAGTTGCAGTATAG
mRNA sequenceShow/hide mRNA sequence
ATGGCTTTCCAGCTCTGCCATTCGCCGTCCACCCTCTTCACCGACCACCATCCTCTCTTCAATTCTCTTACCTCTCAACGCAAAATAAGTCTCTGCAAGTCTTCTCACCG
TTTCAAGCTTAATCCTATACCACTGAAGTCAAAAACTTTTCTCCAGATAACCAATGTTTCGTTACAGGAATATGCTCCTGAAGAAACCCAGAATCCTAGCCCCCCTGAGG
ATGATAGGTCGAAATACCCAGATGGGAAATCTAGTTCCTCATCCAAAAGCTACGTTTGGGTCAATCCCAGAAGCCCCAGAGCTTCTAAACTTCGGAAGCAATCGTACGAA
GCCAGGTATGCTTCACTTACGAGAATATCAGAGTCTTTGGAGTCTTGTAATCCATGTGAGGAAGATGTTGCGGACGTCTTGAAGGTGACAGGCAATAACATTCTAGAACA
AGACGCTGTTATAGTGCTGAATAACATGTCGAATTCCAGTACTGCGTTGCTTGCTCTTCAGTACTTTCAGAAGGTATTGAGATCAAGTAAACAGGCAATTCTTTACAATG
TGACCCTGAAGGTGTTTAGGAAGTCTAGAGATATGGAGGGAGCAGAGAAACTGTTTGACGAAATGCTTCAGAGAGGAGTTAAACCTGATAATGTGACATTCTCTACAATA
ATTAGCTGTGCTAGGTTATGTTCGTTGCCAAATAAGGCTGTTGAGTGGTTTGAAAAGATGCCAAATTATGACTGTAATCCGGACGATGTCACTTACTCTGCAATGATAGA
TGCCTATGGACGTGCTGGTAATGTTGACATGGCTTTCAGCTTGTATGACCGTGCAAGAACAGAAAACTGGCGTATTGATCCTGTGACATTCTCAACATTGATCAAAATTC
ATGGAGTGGCTGGAAACTATGATGGGTGCTTGAATGTGTATGAAGAAATGAAAGCTATAGGCATCAAGCCAAACTTGGTTATATATAACAGCATGCTGGATGCTATGGGC
AGGGCTAAAAGACCATGGCAGATCAAGACCATTTACAAAGAGATGATTCAAAATGGGTTTTCACCGAGTTGGACAACTTATGCTTCTCTTTTACGTGCTTATGGGAGAGC
CAGATATGGTGAGGATGCTCTCCTTGTGTACAAGGAGATGAAGGAAAAGGGATTGCAGTTAAATGTAATTCTCTACAATACACTTTTAGCCATGTGTGCTGATGTTGGCT
ACATTAATGAAGCTGTTGAAATTTTTGAAGATATGAAGAATTCTGGGACTTGCTCACCTGACAGTTGGACTTTTTCTTCCATGATTACCATTTATTCCTGCAGTGGAAAA
GTATCAGGGGCGGAGGAAATGTTGAACGAGATGGTGGAAGCTGGTTTTGACCCTAATATCTTTGTCTTGACATCACTAATCCAGTGTTATGGGAAAGCCAAACGTGTTGA
TGATGTTGTGAGGACATTCAATCGACTGGTTGAGTTGGGATTAACTCCAGATGATCGATTCTGTGGCTGTCTTCTCAATGTAATTACCCAGACACCAAAAGGGGAACTTA
GTAAGCTGATTGATTGTGTTGAGAGAGCTAATTCAAAACTTGGTTTTGTGGTTAAGCTTTTGCTAGGGGAACAAGATAAGGAAGGAGATTTCAAAACTGAAGCCTCAGAA
CTATTTAGTGTTGTTAGTGCTGATGTGAGAAAAGCTTATTGCAATTGCCTAATTGATCTCTGTGTGAATTTAGATCTTTTGGATAAGGCCTGTGAACTCTTGGATTTGGG
GCTTACGCTTCAGATATATACAGATTTGCAGTCCAGGTCTCCAACTCAGTGGTCTCTATATCTCAAGGGTCTTTCTCTTGGGGCTGCTCTCACTGCATTACATGTTTGGA
TAAATGACTTAACAAAGGTACTCGAATCTGGGGAGGAACTTCCACCGTTACTTGGAATAAATACTGGGCATGGAAAACACAAATATTCTGGCAAGGGTTTGGCAAGTGTC
TTTGAGTCTCATTTAAAGGAATTAAGTGCTCCATTCCATGAGGCTCCAGAAAAGGTCGGGTGGTTTTTGACCACTAAAGTGGCAGCAAAATCATGGTTGGAGTCTAGAGG
TTCACCTGAATTAGTTGCAGTATAG
Protein sequenceShow/hide protein sequence
MAFQLCHSPSTLFTDHHPLFNSLTSQRKISLCKSSHRFKLNPIPLKSKTFLQITNVSLQEYAPEETQNPSPPEDDRSKYPDGKSSSSSKSYVWVNPRSPRASKLRKQSYE
ARYASLTRISESLESCNPCEEDVADVLKVTGNNILEQDAVIVLNNMSNSSTALLALQYFQKVLRSSKQAILYNVTLKVFRKSRDMEGAEKLFDEMLQRGVKPDNVTFSTI
ISCARLCSLPNKAVEWFEKMPNYDCNPDDVTYSAMIDAYGRAGNVDMAFSLYDRARTENWRIDPVTFSTLIKIHGVAGNYDGCLNVYEEMKAIGIKPNLVIYNSMLDAMG
RAKRPWQIKTIYKEMIQNGFSPSWTTYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCADVGYINEAVEIFEDMKNSGTCSPDSWTFSSMITIYSCSGK
VSGAEEMLNEMVEAGFDPNIFVLTSLIQCYGKAKRVDDVVRTFNRLVELGLTPDDRFCGCLLNVITQTPKGELSKLIDCVERANSKLGFVVKLLLGEQDKEGDFKTEASE
LFSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDLGLTLQIYTDLQSRSPTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSGKGLASV
FESHLKELSAPFHEAPEKVGWFLTTKVAAKSWLESRGSPELVAV