; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr020881 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr020881
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationtig00153574:883459..885573
RNA-Seq ExpressionSgr020881
SyntenySgr020881
Gene Ontology termsGO:0009658 - chloroplast organization (biological process)
GO:0031425 - chloroplast RNA processing (biological process)
GO:0009570 - chloroplast stroma (cellular component)
GO:0009941 - chloroplast envelope (cellular component)
GO:0003729 - mRNA binding (molecular function)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR002625 - Smr domain
IPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily
IPR033443 - Pentacotripeptide-repeat region of PRORP


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6583722.1 Pentatricopeptide repeat-containing protein, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0087.36Show/hide
Query:  MAFQLCHSPSTLFTDHHPLFNSLTSQRKISLCKSSHR-FKLNPIPLKSKTFLQITNVSLQEYAPEETQNPSPPEDDRSKYPDGKSSSSSKSYVWVNPRSP
        MAFQL H PST FTDH    NSLT   K +LCKSS R FKLNPIP  SK FLQITNVS QEYAP+ET+NPSP +D+ SK+PDGKS SSSK+ VWVNP SP
Subjt:  MAFQLCHSPSTLFTDHHPLFNSLTSQRKISLCKSSHR-FKLNPIPLKSKTFLQITNVSLQEYAPEETQNPSPPEDDRSKYPDGKSSSSSKSYVWVNPRSP

Query:  RASKLRKQSYEARYASLTRISESLDSCNPCEEDVADVLKVTGNNILEQDAVIVLNNMSNSSTALLALQYFQKVLRSSKQAILYNVTLKVFRKSRDMEGAE
        RASKLRKQSYEARYASL +ISESLDSCNPCE+DVADVLK   + ILEQDA+ VLNNMSNS TALL L+YFQ VL+SSKQA+ YNVTLKVFRK RD EGAE
Subjt:  RASKLRKQSYEARYASLTRISESLDSCNPCEEDVADVLKVTGNNILEQDAVIVLNNMSNSSTALLALQYFQKVLRSSKQAILYNVTLKVFRKSRDMEGAE

Query:  KLFDEMLQRGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPNYDCNPDDVTYSAMIDAYGRAGNVDMAFSLYDRARTENWRIDPATFSTLIKIHGVAGN
        KLFDEML+RGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMP++DCNPD++TYS MIDAYGRAGNVDMAFSLYDRARTENWRID +TFST+IKIHGVAGN
Subjt:  KLFDEMLQRGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPNYDCNPDDVTYSAMIDAYGRAGNVDMAFSLYDRARTENWRIDPATFSTLIKIHGVAGN

Query:  YDGCLNVYEEMKAIGIKRNLVIYNSMLDAMGRAKRPWQIKTIYKEMIQNGFSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMC
        YDGCLNVYEEMKA+GIK NL IYNS+L AMGRAKRPWQIKTIYKEM +NGFSPSWATYASLLRAY RARY ED +LVYKEMKEKGLQLNVILYNTLLAMC
Subjt:  YDGCLNVYEEMKAIGIKRNLVIYNSMLDAMGRAKRPWQIKTIYKEMIQNGFSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMC

Query:  ADVGYINEAVEIFEDMKNSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVEAGFDPNIFVLTSLIQCYGKAKRVDDVVRTFNRLVELGLTPDDRFCG
        ADVGY+NEA+E+F+DMK+SGTCSPDSWTFSSMITIYSCSG VSEAEEMLNEM+EAGFDPNIFVLTSLIQCYGKAKRVDDVVRTF+RL+ELGLTPDDRFCG
Subjt:  ADVGYINEAVEIFEDMKNSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVEAGFDPNIFVLTSLIQCYGKAKRVDDVVRTFNRLVELGLTPDDRFCG

Query:  CLLNVITQTPKGELSKLIDCVERANSKLGFVVKLLLGEQDKEGDFKTEASELFSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDLGLTLQIYTDLQSR
        CLLNVITQTPK ELSKLIDCVERAN KLGFVVKLLLGE+D EGDF+TEASELFSVVS DVRKAYCNCLIDLCVNLDLLDKACELLDLGL++QIYTDLQSR
Subjt:  CLLNVITQTPKGELSKLIDCVERANSKLGFVVKLLLGEQDKEGDFKTEASELFSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDLGLTLQIYTDLQSR

Query:  SPTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELSAPFHEAPEKVGWFLTTKVAAKSWLESRGSP
        SPTQWSLYLKGLSLGAALTALHVWINDLTK L+SGEELPPLLGINTGHGKHKYSDKGL+SVFESHLKEL+APFHEAPEKVGWFLTTKVAAKSWLESRGSP
Subjt:  SPTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELSAPFHEAPEKVGWFLTTKVAAKSWLESRGSP

Query:  ELVA
        ELVA
Subjt:  ELVA

XP_004139516.1 pentatricopeptide repeat-containing protein At4g16390, chloroplastic [Cucumis sativus]0.0e+0089.19Show/hide
Query:  MAFQLCHSPSTLFTDHHPLFNSLTSQRKISLCKSSHRFKLNPIPLKSKTFLQITNVSLQEYAPEETQNPSPPEDDRSKYPDGKSSSSSKSYVWVNPRSPR
        MAFQLC+SP T FT+HH L NSLT QRK +L  SS  FKL+PIP  SK FLQITNVSLQE+AP++TQN  P  D+ SKYPD KS SSS S VWVNPRSPR
Subjt:  MAFQLCHSPSTLFTDHHPLFNSLTSQRKISLCKSSHRFKLNPIPLKSKTFLQITNVSLQEYAPEETQNPSPPEDDRSKYPDGKSSSSSKSYVWVNPRSPR

Query:  ASKLRKQSYEARYASLTRISESLDSCNPCEEDVADVLKVTGNNILEQDAVIVLNNMSNSSTALLALQYFQKVLRSSKQAILYNVTLKVFRKSRDMEGAEK
        ASKLRKQSYEARYASL R+SESLDS NPCE DVADVLKV GNNILE+DA++VLNNMSNS TALLAL+YFQ +L+SSKQ I YNVTLKVFRK RDMEGAEK
Subjt:  ASKLRKQSYEARYASLTRISESLDSCNPCEEDVADVLKVTGNNILEQDAVIVLNNMSNSSTALLALQYFQKVLRSSKQAILYNVTLKVFRKSRDMEGAEK

Query:  LFDEMLQRGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPNYDCNPDDVTYSAMIDAYGRAGNVDMAFSLYDRARTENWRIDPATFSTLIKIHGVAGNY
        LF+EM+ RGVKPDNVTFSTIISCARLCSLP+KAVEWFEKMP++DCNPDDVTYS MIDAYGRAGNVDMAFSLYDRARTENWRIDPATFST+IKIHGVAGNY
Subjt:  LFDEMLQRGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPNYDCNPDDVTYSAMIDAYGRAGNVDMAFSLYDRARTENWRIDPATFSTLIKIHGVAGNY

Query:  DGCLNVYEEMKAIGIKRNLVIYNSMLDAMGRAKRPWQIKTIYKEMIQNGFSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCA
        DGCLNVYEEMKAIGIK NLVIYN +LDAMGRAKRPWQIKTIYKEMI+NGFSPSWATYASLLRAYGRARYGEDAL+VYKEMKEKGLQLNVILYNTLLAMCA
Subjt:  DGCLNVYEEMKAIGIKRNLVIYNSMLDAMGRAKRPWQIKTIYKEMIQNGFSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCA

Query:  DVGYINEAVEIFEDMKNSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVEAGFDPNIFVLTSLIQCYGKAKRVDDVVRTFNRLVELGLTPDDRFCGC
        DVGY+NEAVEIF+DMK+SGTCSPDSWTFSSMITIYSC GKVSEAEEMLN+MVEAGFDPNIFVLTSLIQCYGKAKRVDDVVRTFN+L+ELGLTPDDRFCGC
Subjt:  DVGYINEAVEIFEDMKNSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVEAGFDPNIFVLTSLIQCYGKAKRVDDVVRTFNRLVELGLTPDDRFCGC

Query:  LLNVITQTPKGELSKLIDCVERANSKLGFVVKLLLGEQDKEGDFKTEASELFSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDLGLTLQIYTDLQSRS
        LLNVITQTPKGEL KLIDCV RAN KLGFVV+LLLGEQDKEG+F+TEASELFSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDLGLTLQIY DLQSRS
Subjt:  LLNVITQTPKGELSKLIDCVERANSKLGFVVKLLLGEQDKEGDFKTEASELFSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDLGLTLQIYTDLQSRS

Query:  PTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELSAPFHEAPEKVGWFLTTKVAAKSWLESRGSPE
        PTQWSLYLKGLSLGAALTALHVWI DLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKEL+APFHEAPEKVGWFLTTKVAAKSWLESR SPE
Subjt:  PTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELSAPFHEAPEKVGWFLTTKVAAKSWLESRGSPE

Query:  LVA
        LVA
Subjt:  LVA

XP_008464281.1 PREDICTED: pentatricopeptide repeat-containing protein At4g16390, chloroplastic [Cucumis melo]0.0e+0089.76Show/hide
Query:  MAFQLCHSPSTLFTDHHPLFNSLTSQRKISLCKSSHRFKLNPIPLKSKTFLQITNVSLQEYAPEETQNPSPPEDDRSKYPDGKSSSSSKSYVWVNPRSPR
        MAFQLCHSP T FT HH L NSLT QRK +L  SS  FKLNPIP  S  FLQITN+SLQE++P+ET N  P +D+ SKY D KS SSSKS VWVNPRSPR
Subjt:  MAFQLCHSPSTLFTDHHPLFNSLTSQRKISLCKSSHRFKLNPIPLKSKTFLQITNVSLQEYAPEETQNPSPPEDDRSKYPDGKSSSSSKSYVWVNPRSPR

Query:  ASKLRKQSYEARYASLTRISESLDSCNPCEEDVADVLKVTGNNILEQDAVIVLNNMSNSSTALLALQYFQKVLRSSKQAILYNVTLKVFRKSRDMEGAEK
        ASKLRKQSYEARYASL RISESLDSCNPCE DVADVLKV GNNILEQDAV+VLNNMSNS TALLAL+YFQ +L+SSKQ I YNVTLKVFRK RDMEGAE+
Subjt:  ASKLRKQSYEARYASLTRISESLDSCNPCEEDVADVLKVTGNNILEQDAVIVLNNMSNSSTALLALQYFQKVLRSSKQAILYNVTLKVFRKSRDMEGAEK

Query:  LFDEMLQRGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPNYDCNPDDVTYSAMIDAYGRAGNVDMAFSLYDRARTENWRIDPATFSTLIKIHGVAGNY
        LF+EML RGVKPDNVTFSTIISCARLCSLP+KAVEWFEKMP++DCNPDDVTYS MIDAYGRAGNVDMAFSLYDRARTENWRIDPATFST+IKIHGVAGNY
Subjt:  LFDEMLQRGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPNYDCNPDDVTYSAMIDAYGRAGNVDMAFSLYDRARTENWRIDPATFSTLIKIHGVAGNY

Query:  DGCLNVYEEMKAIGIKRNLVIYNSMLDAMGRAKRPWQIKTIYKEMIQNGFSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCA
        DGCLNVYEEMKAIGIK NLVIYNS+LDAMGRAKRPWQIKTIYKEMI++GFSPSWATYASLLRAYGRARYGEDAL+VYKEMKEKGLQLNVILYNTLLAMCA
Subjt:  DGCLNVYEEMKAIGIKRNLVIYNSMLDAMGRAKRPWQIKTIYKEMIQNGFSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCA

Query:  DVGYINEAVEIFEDMKNSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVEAGFDPNIFVLTSLIQCYGKAKRVDDVVRTFNRLVELGLTPDDRFCGC
        DVGY+NEAVEIF+DMKNSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLN+MVEAGFDPNIFVLTSLIQCYGKAKRVDDVVRTFN+L+ELGLTPDDRFCGC
Subjt:  DVGYINEAVEIFEDMKNSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVEAGFDPNIFVLTSLIQCYGKAKRVDDVVRTFNRLVELGLTPDDRFCGC

Query:  LLNVITQTPKGELSKLIDCVERANSKLGFVVKLLLGEQDKEGDFKTEASELFSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDLGLTLQIYTDLQSRS
        LLNVITQTPK E+SKLIDCV RAN KLGFVV+LLLGEQDKEG+F+TEASELFSVVSADVRKAYCNCLIDLCVNLDLLDKACELL+LGLTLQIY DLQSRS
Subjt:  LLNVITQTPKGELSKLIDCVERANSKLGFVVKLLLGEQDKEGDFKTEASELFSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDLGLTLQIYTDLQSRS

Query:  PTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELSAPFHEAPEKVGWFLTTKVAAKSWLESRGSPE
        PTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKEL+APFHEAPEKVGWFLTTKVAAKSWLESR SPE
Subjt:  PTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELSAPFHEAPEKVGWFLTTKVAAKSWLESRGSPE

Query:  LVA
        LVA
Subjt:  LVA

XP_022142513.1 pentatricopeptide repeat-containing protein At4g16390, chloroplastic [Momordica charantia]0.0e+0092.18Show/hide
Query:  MAFQLCHSPSTLFTDHHPLFNSLTSQRKISLCKSSHRFKLNPIPLKSKTFLQITNVSLQEYAPEETQNPSPPEDDRSKYPDGKSSSSSKSYVWVNPRSPR
        MAFQLCHSPST F+DHHPL NSL SQ +I+L KSSHRFKLNP P  SKT L+ITNVSLQEYA +E QNP P +D+ SKYPDGKS+SSSKS VWVNPRSPR
Subjt:  MAFQLCHSPSTLFTDHHPLFNSLTSQRKISLCKSSHRFKLNPIPLKSKTFLQITNVSLQEYAPEETQNPSPPEDDRSKYPDGKSSSSSKSYVWVNPRSPR

Query:  ASKLRKQSYEARYASLTRISESLDSCNPCEEDVADVLKVTGNNILEQDAVIVLNNMSNSSTALLALQYFQKVLRSSKQAILYNVTLKVFRKSRDMEGAEK
        ASKLR QSYEARYASLTRISESLDSCNPCEEDVADVLK  G+NILEQDAV VLNNMSNSSTALLALQ FQKVL+SSK+AILYNVTLKV RKSRDMEGAEK
Subjt:  ASKLRKQSYEARYASLTRISESLDSCNPCEEDVADVLKVTGNNILEQDAVIVLNNMSNSSTALLALQYFQKVLRSSKQAILYNVTLKVFRKSRDMEGAEK

Query:  LFDEMLQRGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPNYDCNPDDVTYSAMIDAYGRAGNVDMAFSLYDRARTENWRIDPATFSTLIKIHGVAGNY
        LFDEML+RGVKPDNVTFST+ISCARLCSLPNKAVEWFEKMP++DCNPDDVTYSAMIDAYGRAGNVDMAFSLYDRARTENWRIDPATFSTLIKIHGVAGNY
Subjt:  LFDEMLQRGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPNYDCNPDDVTYSAMIDAYGRAGNVDMAFSLYDRARTENWRIDPATFSTLIKIHGVAGNY

Query:  DGCLNVYEEMKAIGIKRNLVIYNSMLDAMGRAKRPWQIKTIYKEMIQNGFSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCA
        DGCLNVYEEMKAIGIK NLVIYNS+LDAMGRAKRPWQIKTIYKEMI+NGFSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCA
Subjt:  DGCLNVYEEMKAIGIKRNLVIYNSMLDAMGRAKRPWQIKTIYKEMIQNGFSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCA

Query:  DVGYINEAVEIFEDMKNSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVEAGFDPNIFVLTSLIQCYGKAKRVDDVVRTFNRLVELGLTPDDRFCGC
        DVGY+NEAVEIFEDMK+SG CSPDSWTFSSMITIYSCSGKVSEAEEMLNEM+EAGFDPNIFVLTSLIQCYGK KRVDDVVRTF+RLVELGLTPDDRFCGC
Subjt:  DVGYINEAVEIFEDMKNSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVEAGFDPNIFVLTSLIQCYGKAKRVDDVVRTFNRLVELGLTPDDRFCGC

Query:  LLNVITQTPKGELSKLIDCVERANSKLGFVVKLLLGEQDKEGDFKTEASELFSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDLGLTLQIYTDLQSRS
        LLNVITQTPK ELSKLIDCVERANSKLG+VVKLLLGEQDKEGD +TEASEL SVVSADVRKAYCNCLIDLCVNLDLL+KACELLDLGLTLQIYT LQS S
Subjt:  LLNVITQTPKGELSKLIDCVERANSKLGFVVKLLLGEQDKEGDFKTEASELFSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDLGLTLQIYTDLQSRS

Query:  PTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELSAPFHEAPEKVGWFLTTKVAAKSWLESRGSPE
        PTQWSL+LKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKEL+APFHEAPEKVGWFLTTKVAAKSWLESRGSPE
Subjt:  PTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELSAPFHEAPEKVGWFLTTKVAAKSWLESRGSPE

Query:  LVA
        LVA
Subjt:  LVA

XP_038877791.1 pentatricopeptide repeat-containing protein At4g16390, chloroplastic [Benincasa hispida]0.0e+0092.18Show/hide
Query:  MAFQLCHSPSTLFTDHHPLFNSLTSQRKISLCKSSHRFKLNPIPLKSKTFLQITNVSLQEYAPEETQNPSPPEDDRSKYPDGKSSSSSKSYVWVNPRSPR
        MAFQLCHSPST FTDHH L NSLTSQRK +LC SS  FKLNPIP  SK FLQITNVSLQEYAP+ET NPSP  D+ SKYPDGKSSSSSKS VWVNPRSPR
Subjt:  MAFQLCHSPSTLFTDHHPLFNSLTSQRKISLCKSSHRFKLNPIPLKSKTFLQITNVSLQEYAPEETQNPSPPEDDRSKYPDGKSSSSSKSYVWVNPRSPR

Query:  ASKLRKQSYEARYASLTRISESLDSCNPCEEDVADVLKVTGNNILEQDAVIVLNNMSNSSTALLALQYFQKVLRSSKQAILYNVTLKVFRKSRDMEGAEK
        ASKLRKQSYEARYASLTRISESLDSCNPC+EDVADVLK  G+NIL+QDAV+VLNNMSNS TALLAL+YFQ VL+SSKQAI YNVTLKVFRK RDMEGAEK
Subjt:  ASKLRKQSYEARYASLTRISESLDSCNPCEEDVADVLKVTGNNILEQDAVIVLNNMSNSSTALLALQYFQKVLRSSKQAILYNVTLKVFRKSRDMEGAEK

Query:  LFDEMLQRGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPNYDCNPDDVTYSAMIDAYGRAGNVDMAFSLYDRARTENWRIDPATFSTLIKIHGVAGNY
        LF+EML+RGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMP++DCNPDDVTYSAMIDAYGRAGNVDMAFSLYDRARTENWRIDPATFST+IKIHGVAGNY
Subjt:  LFDEMLQRGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPNYDCNPDDVTYSAMIDAYGRAGNVDMAFSLYDRARTENWRIDPATFSTLIKIHGVAGNY

Query:  DGCLNVYEEMKAIGIKRNLVIYNSMLDAMGRAKRPWQIKTIYKEMIQNGFSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCA
        DGCLNVYEEMKAIGIK NLVIYNS+LDAMGRAKRPWQIKTIYKEMI+NGFSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCA
Subjt:  DGCLNVYEEMKAIGIKRNLVIYNSMLDAMGRAKRPWQIKTIYKEMIQNGFSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCA

Query:  DVGYINEAVEIFEDMKNSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVEAGFDPNIFVLTSLIQCYGKAKRVDDVVRTFNRLVELGLTPDDRFCGC
        DVGY+ EAVE+F+DMK+SGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVEAGFDPNIFVLTSLIQCYGKAKRVDDVVRTF+RL+ELGLTPDDRFCGC
Subjt:  DVGYINEAVEIFEDMKNSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVEAGFDPNIFVLTSLIQCYGKAKRVDDVVRTFNRLVELGLTPDDRFCGC

Query:  LLNVITQTPKGELSKLIDCVERANSKLGFVVKLLLGEQDKEGDFKTEASELFSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDLGLTLQIYTDLQSRS
        LLNVITQTPK ELSKLIDCV RAN KLGFVVKLL+GEQDKEGDF+TEASELFSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDLGLTLQ+Y DLQSRS
Subjt:  LLNVITQTPKGELSKLIDCVERANSKLGFVVKLLLGEQDKEGDFKTEASELFSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDLGLTLQIYTDLQSRS

Query:  PTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELSAPFHEAPEKVGWFLTTKVAAKSWLESRGSPE
        PTQWSLYLKGLSLGA LTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKEL+APFHEAPEKVGWFLTTKVAAKSWLESR SPE
Subjt:  PTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELSAPFHEAPEKVGWFLTTKVAAKSWLESRGSPE

Query:  LVA
        LVA
Subjt:  LVA

TrEMBL top hitse value%identityAlignment
A0A0A0LVP1 Smr domain-containing protein0.0e+0089.19Show/hide
Query:  MAFQLCHSPSTLFTDHHPLFNSLTSQRKISLCKSSHRFKLNPIPLKSKTFLQITNVSLQEYAPEETQNPSPPEDDRSKYPDGKSSSSSKSYVWVNPRSPR
        MAFQLC+SP T FT+HH L NSLT QRK +L  SS  FKL+PIP  SK FLQITNVSLQE+AP++TQN  P  D+ SKYPD KS SSS S VWVNPRSPR
Subjt:  MAFQLCHSPSTLFTDHHPLFNSLTSQRKISLCKSSHRFKLNPIPLKSKTFLQITNVSLQEYAPEETQNPSPPEDDRSKYPDGKSSSSSKSYVWVNPRSPR

Query:  ASKLRKQSYEARYASLTRISESLDSCNPCEEDVADVLKVTGNNILEQDAVIVLNNMSNSSTALLALQYFQKVLRSSKQAILYNVTLKVFRKSRDMEGAEK
        ASKLRKQSYEARYASL R+SESLDS NPCE DVADVLKV GNNILE+DA++VLNNMSNS TALLAL+YFQ +L+SSKQ I YNVTLKVFRK RDMEGAEK
Subjt:  ASKLRKQSYEARYASLTRISESLDSCNPCEEDVADVLKVTGNNILEQDAVIVLNNMSNSSTALLALQYFQKVLRSSKQAILYNVTLKVFRKSRDMEGAEK

Query:  LFDEMLQRGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPNYDCNPDDVTYSAMIDAYGRAGNVDMAFSLYDRARTENWRIDPATFSTLIKIHGVAGNY
        LF+EM+ RGVKPDNVTFSTIISCARLCSLP+KAVEWFEKMP++DCNPDDVTYS MIDAYGRAGNVDMAFSLYDRARTENWRIDPATFST+IKIHGVAGNY
Subjt:  LFDEMLQRGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPNYDCNPDDVTYSAMIDAYGRAGNVDMAFSLYDRARTENWRIDPATFSTLIKIHGVAGNY

Query:  DGCLNVYEEMKAIGIKRNLVIYNSMLDAMGRAKRPWQIKTIYKEMIQNGFSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCA
        DGCLNVYEEMKAIGIK NLVIYN +LDAMGRAKRPWQIKTIYKEMI+NGFSPSWATYASLLRAYGRARYGEDAL+VYKEMKEKGLQLNVILYNTLLAMCA
Subjt:  DGCLNVYEEMKAIGIKRNLVIYNSMLDAMGRAKRPWQIKTIYKEMIQNGFSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCA

Query:  DVGYINEAVEIFEDMKNSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVEAGFDPNIFVLTSLIQCYGKAKRVDDVVRTFNRLVELGLTPDDRFCGC
        DVGY+NEAVEIF+DMK+SGTCSPDSWTFSSMITIYSC GKVSEAEEMLN+MVEAGFDPNIFVLTSLIQCYGKAKRVDDVVRTFN+L+ELGLTPDDRFCGC
Subjt:  DVGYINEAVEIFEDMKNSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVEAGFDPNIFVLTSLIQCYGKAKRVDDVVRTFNRLVELGLTPDDRFCGC

Query:  LLNVITQTPKGELSKLIDCVERANSKLGFVVKLLLGEQDKEGDFKTEASELFSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDLGLTLQIYTDLQSRS
        LLNVITQTPKGEL KLIDCV RAN KLGFVV+LLLGEQDKEG+F+TEASELFSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDLGLTLQIY DLQSRS
Subjt:  LLNVITQTPKGELSKLIDCVERANSKLGFVVKLLLGEQDKEGDFKTEASELFSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDLGLTLQIYTDLQSRS

Query:  PTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELSAPFHEAPEKVGWFLTTKVAAKSWLESRGSPE
        PTQWSLYLKGLSLGAALTALHVWI DLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKEL+APFHEAPEKVGWFLTTKVAAKSWLESR SPE
Subjt:  PTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELSAPFHEAPEKVGWFLTTKVAAKSWLESRGSPE

Query:  LVA
        LVA
Subjt:  LVA

A0A1S3CL39 pentatricopeptide repeat-containing protein At4g16390, chloroplastic0.0e+0089.76Show/hide
Query:  MAFQLCHSPSTLFTDHHPLFNSLTSQRKISLCKSSHRFKLNPIPLKSKTFLQITNVSLQEYAPEETQNPSPPEDDRSKYPDGKSSSSSKSYVWVNPRSPR
        MAFQLCHSP T FT HH L NSLT QRK +L  SS  FKLNPIP  S  FLQITN+SLQE++P+ET N  P +D+ SKY D KS SSSKS VWVNPRSPR
Subjt:  MAFQLCHSPSTLFTDHHPLFNSLTSQRKISLCKSSHRFKLNPIPLKSKTFLQITNVSLQEYAPEETQNPSPPEDDRSKYPDGKSSSSSKSYVWVNPRSPR

Query:  ASKLRKQSYEARYASLTRISESLDSCNPCEEDVADVLKVTGNNILEQDAVIVLNNMSNSSTALLALQYFQKVLRSSKQAILYNVTLKVFRKSRDMEGAEK
        ASKLRKQSYEARYASL RISESLDSCNPCE DVADVLKV GNNILEQDAV+VLNNMSNS TALLAL+YFQ +L+SSKQ I YNVTLKVFRK RDMEGAE+
Subjt:  ASKLRKQSYEARYASLTRISESLDSCNPCEEDVADVLKVTGNNILEQDAVIVLNNMSNSSTALLALQYFQKVLRSSKQAILYNVTLKVFRKSRDMEGAEK

Query:  LFDEMLQRGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPNYDCNPDDVTYSAMIDAYGRAGNVDMAFSLYDRARTENWRIDPATFSTLIKIHGVAGNY
        LF+EML RGVKPDNVTFSTIISCARLCSLP+KAVEWFEKMP++DCNPDDVTYS MIDAYGRAGNVDMAFSLYDRARTENWRIDPATFST+IKIHGVAGNY
Subjt:  LFDEMLQRGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPNYDCNPDDVTYSAMIDAYGRAGNVDMAFSLYDRARTENWRIDPATFSTLIKIHGVAGNY

Query:  DGCLNVYEEMKAIGIKRNLVIYNSMLDAMGRAKRPWQIKTIYKEMIQNGFSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCA
        DGCLNVYEEMKAIGIK NLVIYNS+LDAMGRAKRPWQIKTIYKEMI++GFSPSWATYASLLRAYGRARYGEDAL+VYKEMKEKGLQLNVILYNTLLAMCA
Subjt:  DGCLNVYEEMKAIGIKRNLVIYNSMLDAMGRAKRPWQIKTIYKEMIQNGFSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCA

Query:  DVGYINEAVEIFEDMKNSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVEAGFDPNIFVLTSLIQCYGKAKRVDDVVRTFNRLVELGLTPDDRFCGC
        DVGY+NEAVEIF+DMKNSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLN+MVEAGFDPNIFVLTSLIQCYGKAKRVDDVVRTFN+L+ELGLTPDDRFCGC
Subjt:  DVGYINEAVEIFEDMKNSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVEAGFDPNIFVLTSLIQCYGKAKRVDDVVRTFNRLVELGLTPDDRFCGC

Query:  LLNVITQTPKGELSKLIDCVERANSKLGFVVKLLLGEQDKEGDFKTEASELFSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDLGLTLQIYTDLQSRS
        LLNVITQTPK E+SKLIDCV RAN KLGFVV+LLLGEQDKEG+F+TEASELFSVVSADVRKAYCNCLIDLCVNLDLLDKACELL+LGLTLQIY DLQSRS
Subjt:  LLNVITQTPKGELSKLIDCVERANSKLGFVVKLLLGEQDKEGDFKTEASELFSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDLGLTLQIYTDLQSRS

Query:  PTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELSAPFHEAPEKVGWFLTTKVAAKSWLESRGSPE
        PTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKEL+APFHEAPEKVGWFLTTKVAAKSWLESR SPE
Subjt:  PTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELSAPFHEAPEKVGWFLTTKVAAKSWLESRGSPE

Query:  LVA
        LVA
Subjt:  LVA

A0A5A7TLM5 Pentatricopeptide repeat-containing protein0.0e+0089.76Show/hide
Query:  MAFQLCHSPSTLFTDHHPLFNSLTSQRKISLCKSSHRFKLNPIPLKSKTFLQITNVSLQEYAPEETQNPSPPEDDRSKYPDGKSSSSSKSYVWVNPRSPR
        MAFQLCHSP T FT HH L NSLT QRK +L  SS  FKLNPIP  S  FLQITN+SLQE++P+ET N  P +D+ SKY D KS SSSKS VWVNPRSPR
Subjt:  MAFQLCHSPSTLFTDHHPLFNSLTSQRKISLCKSSHRFKLNPIPLKSKTFLQITNVSLQEYAPEETQNPSPPEDDRSKYPDGKSSSSSKSYVWVNPRSPR

Query:  ASKLRKQSYEARYASLTRISESLDSCNPCEEDVADVLKVTGNNILEQDAVIVLNNMSNSSTALLALQYFQKVLRSSKQAILYNVTLKVFRKSRDMEGAEK
        ASKLRKQSYEARYASL RISESLDSCNPCE DVADVLKV GNNILEQDAV+VLNNMSNS TALLAL+YFQ +L+SSKQ I YNVTLKVFRK RDMEGAE+
Subjt:  ASKLRKQSYEARYASLTRISESLDSCNPCEEDVADVLKVTGNNILEQDAVIVLNNMSNSSTALLALQYFQKVLRSSKQAILYNVTLKVFRKSRDMEGAEK

Query:  LFDEMLQRGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPNYDCNPDDVTYSAMIDAYGRAGNVDMAFSLYDRARTENWRIDPATFSTLIKIHGVAGNY
        LF+EML RGVKPDNVTFSTIISCARLCSLP+KAVEWFEKMP++DCNPDDVTYS MIDAYGRAGNVDMAFSLYDRARTENWRIDPATFST+IKIHGVAGNY
Subjt:  LFDEMLQRGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPNYDCNPDDVTYSAMIDAYGRAGNVDMAFSLYDRARTENWRIDPATFSTLIKIHGVAGNY

Query:  DGCLNVYEEMKAIGIKRNLVIYNSMLDAMGRAKRPWQIKTIYKEMIQNGFSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCA
        DGCLNVYEEMKAIGIK NLVIYNS+LDAMGRAKRPWQIKTIYKEMI++GFSPSWATYASLLRAYGRARYGEDAL+VYKEMKEKGLQLNVILYNTLLAMCA
Subjt:  DGCLNVYEEMKAIGIKRNLVIYNSMLDAMGRAKRPWQIKTIYKEMIQNGFSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCA

Query:  DVGYINEAVEIFEDMKNSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVEAGFDPNIFVLTSLIQCYGKAKRVDDVVRTFNRLVELGLTPDDRFCGC
        DVGY+NEAVEIF+DMKNSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLN+MVEAGFDPNIFVLTSLIQCYGKAKRVDDVVRTFN+L+ELGLTPDDRFCGC
Subjt:  DVGYINEAVEIFEDMKNSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVEAGFDPNIFVLTSLIQCYGKAKRVDDVVRTFNRLVELGLTPDDRFCGC

Query:  LLNVITQTPKGELSKLIDCVERANSKLGFVVKLLLGEQDKEGDFKTEASELFSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDLGLTLQIYTDLQSRS
        LLNVITQTPK E+SKLIDCV RAN KLGFVV+LLLGEQDKEG+F+TEASELFSVVSADVRKAYCNCLIDLCVNLDLLDKACELL+LGLTLQIY DLQSRS
Subjt:  LLNVITQTPKGELSKLIDCVERANSKLGFVVKLLLGEQDKEGDFKTEASELFSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDLGLTLQIYTDLQSRS

Query:  PTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELSAPFHEAPEKVGWFLTTKVAAKSWLESRGSPE
        PTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKEL+APFHEAPEKVGWFLTTKVAAKSWLESR SPE
Subjt:  PTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELSAPFHEAPEKVGWFLTTKVAAKSWLESRGSPE

Query:  LVA
        LVA
Subjt:  LVA

A0A6J1CNE5 pentatricopeptide repeat-containing protein At4g16390, chloroplastic0.0e+0092.18Show/hide
Query:  MAFQLCHSPSTLFTDHHPLFNSLTSQRKISLCKSSHRFKLNPIPLKSKTFLQITNVSLQEYAPEETQNPSPPEDDRSKYPDGKSSSSSKSYVWVNPRSPR
        MAFQLCHSPST F+DHHPL NSL SQ +I+L KSSHRFKLNP P  SKT L+ITNVSLQEYA +E QNP P +D+ SKYPDGKS+SSSKS VWVNPRSPR
Subjt:  MAFQLCHSPSTLFTDHHPLFNSLTSQRKISLCKSSHRFKLNPIPLKSKTFLQITNVSLQEYAPEETQNPSPPEDDRSKYPDGKSSSSSKSYVWVNPRSPR

Query:  ASKLRKQSYEARYASLTRISESLDSCNPCEEDVADVLKVTGNNILEQDAVIVLNNMSNSSTALLALQYFQKVLRSSKQAILYNVTLKVFRKSRDMEGAEK
        ASKLR QSYEARYASLTRISESLDSCNPCEEDVADVLK  G+NILEQDAV VLNNMSNSSTALLALQ FQKVL+SSK+AILYNVTLKV RKSRDMEGAEK
Subjt:  ASKLRKQSYEARYASLTRISESLDSCNPCEEDVADVLKVTGNNILEQDAVIVLNNMSNSSTALLALQYFQKVLRSSKQAILYNVTLKVFRKSRDMEGAEK

Query:  LFDEMLQRGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPNYDCNPDDVTYSAMIDAYGRAGNVDMAFSLYDRARTENWRIDPATFSTLIKIHGVAGNY
        LFDEML+RGVKPDNVTFST+ISCARLCSLPNKAVEWFEKMP++DCNPDDVTYSAMIDAYGRAGNVDMAFSLYDRARTENWRIDPATFSTLIKIHGVAGNY
Subjt:  LFDEMLQRGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPNYDCNPDDVTYSAMIDAYGRAGNVDMAFSLYDRARTENWRIDPATFSTLIKIHGVAGNY

Query:  DGCLNVYEEMKAIGIKRNLVIYNSMLDAMGRAKRPWQIKTIYKEMIQNGFSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCA
        DGCLNVYEEMKAIGIK NLVIYNS+LDAMGRAKRPWQIKTIYKEMI+NGFSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCA
Subjt:  DGCLNVYEEMKAIGIKRNLVIYNSMLDAMGRAKRPWQIKTIYKEMIQNGFSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCA

Query:  DVGYINEAVEIFEDMKNSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVEAGFDPNIFVLTSLIQCYGKAKRVDDVVRTFNRLVELGLTPDDRFCGC
        DVGY+NEAVEIFEDMK+SG CSPDSWTFSSMITIYSCSGKVSEAEEMLNEM+EAGFDPNIFVLTSLIQCYGK KRVDDVVRTF+RLVELGLTPDDRFCGC
Subjt:  DVGYINEAVEIFEDMKNSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVEAGFDPNIFVLTSLIQCYGKAKRVDDVVRTFNRLVELGLTPDDRFCGC

Query:  LLNVITQTPKGELSKLIDCVERANSKLGFVVKLLLGEQDKEGDFKTEASELFSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDLGLTLQIYTDLQSRS
        LLNVITQTPK ELSKLIDCVERANSKLG+VVKLLLGEQDKEGD +TEASEL SVVSADVRKAYCNCLIDLCVNLDLL+KACELLDLGLTLQIYT LQS S
Subjt:  LLNVITQTPKGELSKLIDCVERANSKLGFVVKLLLGEQDKEGDFKTEASELFSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDLGLTLQIYTDLQSRS

Query:  PTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELSAPFHEAPEKVGWFLTTKVAAKSWLESRGSPE
        PTQWSL+LKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKEL+APFHEAPEKVGWFLTTKVAAKSWLESRGSPE
Subjt:  PTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELSAPFHEAPEKVGWFLTTKVAAKSWLESRGSPE

Query:  LVA
        LVA
Subjt:  LVA

A0A6J1EHV4 pentatricopeptide repeat-containing protein At4g16390, chloroplastic0.0e+0087.07Show/hide
Query:  MAFQLCHSPSTLFTDHHPLFNSLTSQRKISLCKSSHR-FKLNPIPLKSKTFLQITNVSLQEYAPEETQNPSPPEDDRSKYPDGKSSSSSKSYVWVNPRSP
        MAFQL H PST FTDH    NSLT   K +LCKS  R FKLNPIP  SK FLQITNVS QEYAP+ET+NPSP +D+ SK+PDGKS SSSK+ VWVNP SP
Subjt:  MAFQLCHSPSTLFTDHHPLFNSLTSQRKISLCKSSHR-FKLNPIPLKSKTFLQITNVSLQEYAPEETQNPSPPEDDRSKYPDGKSSSSSKSYVWVNPRSP

Query:  RASKLRKQSYEARYASLTRISESLDSCNPCEEDVADVLKVTGNNILEQDAVIVLNNMSNSSTALLALQYFQKVLRSSKQAILYNVTLKVFRKSRDMEGAE
        RASKLRKQSYEARYASL +ISESLDSCNPCE DVADVLK   + ILEQDA+ VLNNMSNS TALL  +YFQ VL+SSKQA+ YNVTLKVFRK RD EGAE
Subjt:  RASKLRKQSYEARYASLTRISESLDSCNPCEEDVADVLKVTGNNILEQDAVIVLNNMSNSSTALLALQYFQKVLRSSKQAILYNVTLKVFRKSRDMEGAE

Query:  KLFDEMLQRGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPNYDCNPDDVTYSAMIDAYGRAGNVDMAFSLYDRARTENWRIDPATFSTLIKIHGVAGN
        KLFDEML+RGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMP++DCNPD++TYS MIDAYGRAGNVDMAFSLYDRARTENWRID +TFST+IKIHGVAGN
Subjt:  KLFDEMLQRGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPNYDCNPDDVTYSAMIDAYGRAGNVDMAFSLYDRARTENWRIDPATFSTLIKIHGVAGN

Query:  YDGCLNVYEEMKAIGIKRNLVIYNSMLDAMGRAKRPWQIKTIYKEMIQNGFSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMC
        YDGCLNVYEEMKA+GIK NL IYNS+L AMGRAKRPWQIKTIYKEM +NGFSPSWATYASLLRAY RARY ED +LVYKEMKEKGLQLNVILYNTLLAMC
Subjt:  YDGCLNVYEEMKAIGIKRNLVIYNSMLDAMGRAKRPWQIKTIYKEMIQNGFSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMC

Query:  ADVGYINEAVEIFEDMKNSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVEAGFDPNIFVLTSLIQCYGKAKRVDDVVRTFNRLVELGLTPDDRFCG
        ADVGY+NEA+E+F+DMK+SGTCSPDSWTFSSMITIYSCSG VSEAEEMLNEM+EAGFDPNIFVLTSLIQCYGKAKRVDDVVRTF+RL+ELGLTPDDRFCG
Subjt:  ADVGYINEAVEIFEDMKNSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVEAGFDPNIFVLTSLIQCYGKAKRVDDVVRTFNRLVELGLTPDDRFCG

Query:  CLLNVITQTPKGELSKLIDCVERANSKLGFVVKLLLGEQDKEGDFKTEASELFSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDLGLTLQIYTDLQSR
        CLLNVITQTPK ELSKLIDCVERAN KLGFVVKLLLGE+D EGDF+TEASELFSVVS DVRKAYCNCLIDLCVNLDLLDKACELLDLGL++QIYTDLQSR
Subjt:  CLLNVITQTPKGELSKLIDCVERANSKLGFVVKLLLGEQDKEGDFKTEASELFSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDLGLTLQIYTDLQSR

Query:  SPTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELSAPFHEAPEKVGWFLTTKVAAKSWLESRGSP
        SPTQWSLYLKGLSLGAALTALHVWINDLTK L+SGEELPPLLGINTGHGKHKYSDKGL+SVFESHLKEL+APFHEAPEKVGWFLTTKVAAKSWLESRGSP
Subjt:  SPTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELSAPFHEAPEKVGWFLTTKVAAKSWLESRGSP

Query:  ELVA
        ELVA
Subjt:  ELVA

SwissProt top hitse value%identityAlignment
B4F8Z1 Pentatricopeptide repeat-containing protein ATP4, chloroplastic3.8e-20854.29Show/hide
Query:  LCHSPSTLFTD--HHPLFNSLTSQRKISLCKSSHRFKLNPIPLKSKTFLQITNVSLQEYAPEETQNPSPPEDDRSKYPDG--KSSSSSKSYVWVNPRSPR
        LC SPS+L     H P+  S                  NP   K+ +     +VS+QE  P + Q+PSPP D     P+G   SSSS+  ++WVNP SPR
Subjt:  LCHSPSTLFTD--HHPLFNSLTSQRKISLCKSSHRFKLNPIPLKSKTFLQITNVSLQEYAPEETQNPSPPEDDRSKYPDG--KSSSSSKSYVWVNPRSPR

Query:  ASKL-RKQSYEARYASLTRISESLDSCNPCEEDVADVLKVT-GNNILEQDAVIVLNN--MSNSSTALLALQYFQKVLRSSKQAILYNVTLKVFRKSRDME
        A+ + R ++   R A L   + +L +C   E  V   L+        EQDAVIVLN    + + TA+LAL++F    +  K+ ILYNV LK+ RK R   
Subjt:  ASKL-RKQSYEARYASLTRISESLDSCNPCEEDVADVLKVT-GNNILEQDAVIVLNN--MSNSSTALLALQYFQKVLRSSKQAILYNVTLKVFRKSRDME

Query:  GAEKLFDEMLQRGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPNYDCNPDDVTYSAMIDAYGRAGNVDMAFSLYDRARTENWRIDPATFSTLIKIHGV
          E L+ EML+ GV+PDN TFST+ISCAR C L +KAVEWF+KMP + C+PD +TYSA+IDAYG AGN + A  LYDRAR E W++DP   ST+IK+H  
Subjt:  GAEKLFDEMLQRGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPNYDCNPDDVTYSAMIDAYGRAGNVDMAFSLYDRARTENWRIDPATFSTLIKIHGV

Query:  AGNYDGCLNVYEEMKAIGIKRNLVIYNSMLDAMGRAKRPWQIKTIYKEMIQNGFSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLL
        +GN+DG LNV+EEMKAIG++ NLV+YN+MLDAMGRA RPW +KTI++EM+     PS ATY  LL AY RARYGEDA+ VY+ MK++ + ++V+LYN LL
Subjt:  AGNYDGCLNVYEEMKAIGIKRNLVIYNSMLDAMGRAKRPWQIKTIYKEMIQNGFSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLL

Query:  AMCADVGYINEAVEIFEDMKNS--GTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVEAGFDPNIFVLTSLIQCYGKAKRVDDVVRTFNRLVELGLTPD
        +MCAD+GY++EA EIF DMK S      PDSW++SSM+T+YS +  V  AE +LNEMVEAGF PNIFVLTSLI+CYGK  R DDVVR+F  L +LG+ PD
Subjt:  AMCADVGYINEAVEIFEDMKNS--GTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVEAGFDPNIFVLTSLIQCYGKAKRVDDVVRTFNRLVELGLTPD

Query:  DRFCGCLLNVITQTPKGELSKLIDCVERANSKLGFVVKLLLGEQDKEGDFKTEASELFSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDLGLTLQIYT
        DRFCGCLL+V   TP  EL K+I C+ER+N +LG VVKLL+     E  F+  A EL       V+  YCNCL+DLCVNL+ ++KAC LLD    L IY 
Subjt:  DRFCGCLLNVITQTPKGELSKLIDCVERANSKLGFVVKLLLGEQDKEGDFKTEASELFSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDLGLTLQIYT

Query:  DLQSRSPTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEE-LPPLLGINTGHGKHKYSDKGLASVFESHLKELSAPFHEAPEKVGWFLTTKVAAKSWL
        ++Q+R+ TQWSL+L+GLS+GAALT LHVW+NDL   L++G E LPPLLGI+TG GK+ YSD+GLA++FE+HLKEL APFHEAP+K GWFLTT VAAK WL
Subjt:  DLQSRSPTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEE-LPPLLGINTGHGKHKYSDKGLASVFESHLKELSAPFHEAPEKVGWFLTTKVAAKSWL

Query:  ESRGSPELVAV
        ES+ + ELV V
Subjt:  ESRGSPELVAV

Q10PZ4 Pentatricopeptide repeat-containing protein ATP4 homolog, chloroplastic3.5e-20957.08Show/hide
Query:  QNPSPPEDDRSKYPDGKSSSSSKSYVWVNPRSPRASKL-RKQSYEARYASLTRISESLDSCNPCEEDVADVLKVT-GNNILEQDAVIVLNNMS-NSSTAL
        Q+P PP  D +  P G+SS++S+ YVWVNP SPRA+ L R ++   R A L   + +L +C   E  VA  L+        EQDAVIVLN  S   +  +
Subjt:  QNPSPPEDDRSKYPDGKSSSSSKSYVWVNPRSPRASKL-RKQSYEARYASLTRISESLDSCNPCEEDVADVLKVT-GNNILEQDAVIVLNNMS-NSSTAL

Query:  LALQYFQKVLRSSKQAILYNVTLKVFRKSRDMEGAEKLFDEMLQRGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPNYDCNPDDVTYSAMIDAYGRAG
        LAL +F +     K+ ILYNV LK  RK R    AE L++EML+ GV+PDN TFST+ISCAR C +P KAVEWFEKMP++ C+PD +TYSA+IDAYGRAG
Subjt:  LALQYFQKVLRSSKQAILYNVTLKVFRKSRDMEGAEKLFDEMLQRGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPNYDCNPDDVTYSAMIDAYGRAG

Query:  NVDMAFSLYDRARTENWRIDPATFSTLIKIHGVAGNYDGCLNVYEEMKAIGIKRNLVIYNSMLDAMGRAKRPWQIKTIYKEMIQNGFSPSWATYASLLRA
        + + A  LYDRAR E W++DP   +T+I++H  +GN+DG LNV+EEMKA G+K NLV+YN++LDAMGRA RPW +KTI++E++     P+ ATY  LL A
Subjt:  NVDMAFSLYDRARTENWRIDPATFSTLIKIHGVAGNYDGCLNVYEEMKAIGIKRNLVIYNSMLDAMGRAKRPWQIKTIYKEMIQNGFSPSWATYASLLRA

Query:  YGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCADVGYINEAVEIFEDMKNS--GTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVEAGFDPNIF
        Y RARYGEDA+ VY+ MK++ + ++V+LYN LL+MCAD+GY+ EA EIF DMK S      PDSW++SSM+T+YSC+G V+ AE +LNEMVEAGF PNIF
Subjt:  YGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCADVGYINEAVEIFEDMKNS--GTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVEAGFDPNIF

Query:  VLTSLIQCYGKAKRVDDVVRTFNRLVELGLTPDDRFCGCLLNVITQTPKGELSKLIDCVERANSKLGFVVKLLLGEQDKEGDFKTEASELFSVVSADVRK
        +LTSLI+CYGKA R DDVVR+F  L +LG+TPDDRFCGCLL V   TP  EL K+I C++R++++LG VV+LL+         +  A EL       VR 
Subjt:  VLTSLIQCYGKAKRVDDVVRTFNRLVELGLTPDDRFCGCLLNVITQTPKGELSKLIDCVERANSKLGFVVKLLLGEQDKEGDFKTEASELFSVVSADVRK

Query:  AYCNCLIDLCVNLDLLDKACELLDLGLTLQIYTDLQSRSPTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVF
         YCNCL+DL VNL  ++KAC LLD+ L L IY+++Q+R+ TQWSL+L+GLS+GAALT LHVW++DL   L++G+ELPPLLGI+TG GK+ YS KGLA+VF
Subjt:  AYCNCLIDLCVNLDLLDKACELLDLGLTLQIYTDLQSRSPTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVF

Query:  ESHLKELSAPFHEAPEKVGWFLTTKVAAKSWLESRGSPELVAV
        ESHLKEL APFHEAP+K GWFLTT VAA+ WLE++ S ELVAV
Subjt:  ESHLKELSAPFHEAPEKVGWFLTTKVAAKSWLESRGSPELVAV

Q8GWE0 Pentatricopeptide repeat-containing protein At4g16390, chloroplastic3.5e-27065.95Show/hide
Query:  LCHSPSTLFTDHHPLFNSLTSQRKISLCKSSHRFKLNPIPLKSKTFLQITNVSLQEYAPEETQNPSPPEDDRSKYPDGKSSSSSKSYVWVNPRSPRASKL
        LC SPS+L  D  PL N L+   K +       +  N     S+  LQ T+VS+QE  P+  ++     D     P     ++SKSYVWVNP+SPRAS+L
Subjt:  LCHSPSTLFTDHHPLFNSLTSQRKISLCKSSHRFKLNPIPLKSKTFLQITNVSLQEYAPEETQNPSPPEDDRSKYPDGKSSSSSKSYVWVNPRSPRASKL

Query:  RKQSYEARYASLTRISESLDSCNPCEEDVADVLKVTGNNILEQDAVIVLNNMSNSSTALLALQYFQKVLRSSKQAILYNVTLKVFRKSRDMEGAEKLFDE
        R++SY++RY+SL +++ESLD+C P E DV DV+   G  + EQDAV+ LNNM+N  TA L L    + ++ S++ ILYNVT+KVFRKS+D+E +EKLFDE
Subjt:  RKQSYEARYASLTRISESLDSCNPCEEDVADVLKVTGNNILEQDAVIVLNNMSNSSTALLALQYFQKVLRSSKQAILYNVTLKVFRKSRDMEGAEKLFDE

Query:  MLQRGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPNYDCNPDDVTYSAMIDAYGRAGNVDMAFSLYDRARTENWRIDPATFSTLIKIHGVAGNYDGCL
        ML+RG+KPDN TF+TIISCAR   +P +AVEWFEKM ++ C PD+VT +AMIDAYGRAGNVDMA SLYDRARTE WRID  TFSTLI+I+GV+GNYDGCL
Subjt:  MLQRGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPNYDCNPDDVTYSAMIDAYGRAGNVDMAFSLYDRARTENWRIDPATFSTLIKIHGVAGNYDGCL

Query:  NVYEEMKAIGIKRNLVIYNSMLDAMGRAKRPWQIKTIYKEMIQNGFSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCADVGY
        N+YEEMKA+G+K NLVIYN ++D+MGRAKRPWQ K IYK++I NGF+P+W+TYA+L+RAYGRARYG+DAL +Y+EMKEKGL L VILYNTLL+MCAD  Y
Subjt:  NVYEEMKAIGIKRNLVIYNSMLDAMGRAKRPWQIKTIYKEMIQNGFSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCADVGY

Query:  INEAVEIFEDMKNSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVEAGFDPNIFVLTSLIQCYGKAKRVDDVVRTFNRLVELGLTPDDRFCGCLLNV
        ++EA EIF+DMKN  TC PDSWTFSS+IT+Y+CSG+VSEAE  L +M EAGF+P +FVLTS+IQCYGKAK+VDDVVRTF++++ELG+TPDDRFCGCLLNV
Subjt:  INEAVEIFEDMKNSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVEAGFDPNIFVLTSLIQCYGKAKRVDDVVRTFNRLVELGLTPDDRFCGCLLNV

Query:  ITQTPKGELSKLIDCVERANSKLGFVVKLLLGEQD-KEGDFKTEASELFSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDLGLTLQIYTDLQSRSPTQ
        +TQTP  E+ KLI CVE+A  KLG VVK+L+ EQ+ +EG FK EASEL   + +DV+KAY NCLIDLCVNL+ L++ACE+L LGL   IYT LQS+S TQ
Subjt:  ITQTPKGELSKLIDCVERANSKLGFVVKLLLGEQD-KEGDFKTEASELFSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDLGLTLQIYTDLQSRSPTQ

Query:  WSLYLKGLSLGAALTALHVWINDLTK-VLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELSAPFHEAPEKVGWFLTTKVAAKSWLESRGS
        WSL+LK LSLGAALTALHVW+NDL++  LESGEE PPLLGINTGHGKHKYSDKGLA+VFESHLKEL+APFHEAP+KVGWFLTT VAAK+WLESR S
Subjt:  WSLYLKGLSLGAALTALHVWINDLTK-VLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELSAPFHEAPEKVGWFLTTKVAAKSWLESRGS

Q9LS25 Pentatricopeptide repeat-containing protein At5g46580, chloroplastic1.6e-12938.21Show/hide
Query:  AFQLCHSPSTLFTDHHPLFNSLTSQRKISLCKSSHRFKLNPIPLKSKTFLQITNVSLQEYAPEETQNPSPPEDDRSKYPDGKSSSSSKSYVWVNPRSPRA
        A  +C +P    T  H LF       K SL + S   KLN      K    +    +    P  ++   P      +    +  S  KS VWVNP  P+ 
Subjt:  AFQLCHSPSTLFTDHHPLFNSLTSQRKISLCKSSHRFKLNPIPLKSKTFLQITNVSLQEYAPEETQNPSPPEDDRSKYPDGKSSSSSKSYVWVNPRSPRA

Query:  SKLRKQSYEARYASLTRISESLDSCNPCEEDV-ADVLKVTGNNILEQDAVIVL-------NNMSNSSTALLALQYFQKV------LRSSK----QAILYN
        S L          SL R   S  S NP  +D+ A  LK+  +   E+   + L        N  N+   L +L+ +QK       ++S      + I YN
Subjt:  SKLRKQSYEARYASLTRISESLDSCNPCEEDV-ADVLKVTGNNILEQDAVIVL-------NNMSNSSTALLALQYFQKV------LRSSK----QAILYN

Query:  VTLKVFRKSRDMEGAEKLFDEMLQRGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPNYDCNPDDVTYSAMIDAYGRAGNVDMAFSLYDRARTENWRID
        VT+K  R  R  +  E++  EM++ GV+ DN+T+STII+CA+ C+L NKA+EWFE+M      PD+VTYSA++D Y ++G V+   SLY+RA    W+ D
Subjt:  VTLKVFRKSRDMEGAEKLFDEMLQRGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPNYDCNPDDVTYSAMIDAYGRAGNVDMAFSLYDRARTENWRID

Query:  PATFSTLIKIHGVAGNYDGCLNVYEEMKAIGIKRNLVIYNSMLDAMGRAKRPWQIKTIYKEMIQNGFSPSWATYASLLRAYGRARYGEDALLVYKEMKEK
           FS L K+ G AG+YDG   V +EMK++ +K N+V+YN++L+AMGRA +P   ++++ EM++ G +P+  T  +L++ YG+AR+  DAL +++EMK K
Subjt:  PATFSTLIKIHGVAGNYDGCLNVYEEMKAIGIKRNLVIYNSMLDAMGRAKRPWQIKTIYKEMIQNGFSPSWATYASLLRAYGRARYGEDALLVYKEMKEK

Query:  GLQLNVILYNTLLAMCADVGYINEAVEIFEDMKNSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVEAGFDPNIFVLTSLIQCYGKAKRVDDVVRTF
           ++ ILYNTLL MCAD+G   EA  +F DMK S  C PD++++++M+ IY   GK  +A E+  EM++AG   N+   T L+QC GKAKR+DDVV  F
Subjt:  GLQLNVILYNTLLAMCADVGYINEAVEIFEDMKNSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVEAGFDPNIFVLTSLIQCYGKAKRVDDVVRTF

Query:  NRLVELGLTPDDRFCGCLLNVITQTPKGE-LSKLIDCVERANSKLGFVVKLLLGEQDKEGDFKTEASELFSVVSADVRKAYCNCLIDLCVNLDLLDKACE
        +  ++ G+ PDDR CGCLL+V+      E   K++ C+ERAN KL   V L++ E+ +    K E   + +    + R+ +CNCLID+C   +  ++A E
Subjt:  NRLVELGLTPDDRFCGCLLNVITQTPKGE-LSKLIDCVERANSKLGFVVKLLLGEQDKEGDFKTEASELFSVVSADVRKAYCNCLIDLCVNLDLLDKACE

Query:  LLDLGLTLQIYTDLQSRSPTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELSAPFHEAPEKVGWF
        LL LG    +Y  L +++  +WSL ++ LS+GAA TAL  W+  L  +++  EELP L    TG G H++S +GLA+ F  HL++LSAPF ++ ++ G F
Subjt:  LLDLGLTLQIYTDLQSRSPTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELSAPFHEAPEKVGWF

Query:  LTTKVAAKSWLESRGSP
        + TK    SWLES+  P
Subjt:  LTTKVAAKSWLESRGSP

Q9SIC9 Pentatricopeptide repeat-containing protein At2g31400, chloroplastic1.8e-4825.57Show/hide
Query:  QYFQKVLRSSKQ--AILYNVTLKVFRKSRDMEGAEKLFDEMLQRGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPNYDCNPDDVTYSAMIDAYGRAGN
        ++F ++ R+  Q   I +N  L V  +    E A  LFDEM  R ++ D  +++T++         + A E   +MP     P+ V+YS +ID + +AG 
Subjt:  QYFQKVLRSSKQ--AILYNVTLKVFRKSRDMEGAEKLFDEMLQRGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPNYDCNPDDVTYSAMIDAYGRAGN

Query:  VDMAFSLYDRARTENWRIDPATFSTLIKIHGVAGNYDGCLNVYEEMKAIGIKRNLVIYNSMLDAMGRAKRPWQIKTIYKEMIQNGFSPSWATYASLLRAY
         D A +L+   R     +D  +++TL+ I+   G  +  L++  EM ++GIK+++V YN++L   G+  +  ++K ++ EM +    P+  TY++L+  Y
Subjt:  VDMAFSLYDRARTENWRIDPATFSTLIKIHGVAGNYDGCLNVYEEMKAIGIKRNLVIYNSMLDAMGRAKRPWQIKTIYKEMIQNGFSPSWATYASLLRAY

Query:  GRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCADVGYINEAVEIFEDMKNSGTCSPDSWTFSSMITI------------YSCSGKVSEAEEMLNEMV
         +    ++A+ +++E K  GL+ +V+LY+ L+      G +  AV + ++M   G  SP+  T++S+I              YS  G +  +   L+ + 
Subjt:  GRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCADVGYINEAVEIFEDMKNSGTCSPDSWTFSSMITI------------YSCSGKVSEAEEMLNEMV

Query:  EAGFDPNIFVLTSLI---------QCYGKAKRVDDVVRTFNRLVELGLTPDDRFCGCLLNVITQTPKGE-LSKLIDCVERANSKL-GFVVKLLLGEQDKE
        E   +  I +   L           C    + +  ++  F ++ +L + P+      +LN  ++    E  S L++ +   ++K+ G V  LL+G+++  
Subjt:  EAGFDPNIFVLTSLI---------QCYGKAKRVDDVVRTFNRLVELGLTPDDRFCGCLLNVITQTPKGE-LSKLIDCVERANSKL-GFVVKLLLGEQDKE

Query:  GDFKTEASELFSVVS---ADVRKAYCNCLIDLCVNLDLLDKACELLDL-GLTLQIYTDLQSRSPTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEEL
         +   +A  LF  V+        A+ N L D+  +     +  EL+ L G + Q++ ++ S S     L L  +S GAA   +H W+ ++  ++  G EL
Subjt:  GDFKTEASELFSVVS---ADVRKAYCNCLIDLCVNLDLLDKACELLDL-GLTLQIYTDLQSRSPTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEEL

Query:  PPLLGINTGHGKHK--YSDKGLASVFESHLKELSAPFHEAPEKVGWFLTTKVAAKSWLESRGSPELV
        P +L I TG GKH     D  L    E  L+ + APFH +   +G F ++     +WL    + +L+
Subjt:  PPLLGINTGHGKHK--YSDKGLASVFESHLKELSAPFHEAPEKVGWFLTTKVAAKSWLESRGSPELV

Arabidopsis top hitse value%identityAlignment
AT1G74750.1 Pentatricopeptide repeat (PPR) superfamily protein6.6e-3821.7Show/hide
Query:  QRGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPNYDCNPDDVTYSAMIDAYGRAGNVDMAFSLYDRARTENWRIDPATFSTLIKIHGVAGNYDGCLNV
        Q G K D  T++T++          +  +  ++M    C P+ VTY+ +I +YGRA  +  A +++++ +      D  T+ TLI IH  AG  D  +++
Subjt:  QRGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPNYDCNPDDVTYSAMIDAYGRAGNVDMAFSLYDRARTENWRIDPATFSTLIKIHGVAGNYDGCLNV

Query:  YEEMKAIGIKRNLVIYNSMLDAMGRAKRPWQIKTIYKEMIQNGFSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCADVGYIN
        Y+ M+  G+  +   Y+ +++ +G+A        ++ EM+  G +P+  T+  ++  + +AR  E AL +Y++M+  G Q + + Y+ ++ +    G++ 
Subjt:  YEEMKAIGIKRNLVIYNSMLDAMGRAKRPWQIKTIYKEMIQNGFSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCADVGYIN

Query:  EAVEIFEDMKNSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVEAGFDPNIFVLTSLIQCYGKAKRVDDVVRTFNRLVELGLTPDDRFCGCLLNVIT
        EA  +F +M+      PD   +  ++ ++  +G V +A +    M++AG  PN+    SL+  + +  R+ +       ++ LGL P  +    LL+  T
Subjt:  EAVEIFEDMKNSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVEAGFDPNIFVLTSLIQCYGKAKRVDDVVRTFNRLVELGLTPDDRFCGCLLNVIT

Query:  QTPKGELSKLIDCVERANSKLGFVVKLLL-----------------GEQDKEGDFKTEASELFSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDLGLT
                       R+N  +GF  +L+                   +  K  D  +   +       + ++   + ++D      L ++A  + ++   
Subjt:  QTPKGELSKLIDCVERANSKLGFVVKLLL-----------------GEQDKEGDFKTEASELFSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDLGLT

Query:  LQIYTD-LQSRSPTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHK--YSDKGLASVFESHLKELSAPFHEAPEKVGWFLTTK
          +Y D L+ +S + W + L  +S G A+ AL   +    K +    + P  + I TG G+         +    E  L   + PF       G F+ + 
Subjt:  LQIYTD-LQSRSPTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHK--YSDKGLASVFESHLKELSAPFHEAPEKVGWFLTTK

Query:  VAAKSWL
           K+WL
Subjt:  VAAKSWL

AT2G31400.1 genomes uncoupled 11.3e-4925.57Show/hide
Query:  QYFQKVLRSSKQ--AILYNVTLKVFRKSRDMEGAEKLFDEMLQRGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPNYDCNPDDVTYSAMIDAYGRAGN
        ++F ++ R+  Q   I +N  L V  +    E A  LFDEM  R ++ D  +++T++         + A E   +MP     P+ V+YS +ID + +AG 
Subjt:  QYFQKVLRSSKQ--AILYNVTLKVFRKSRDMEGAEKLFDEMLQRGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPNYDCNPDDVTYSAMIDAYGRAGN

Query:  VDMAFSLYDRARTENWRIDPATFSTLIKIHGVAGNYDGCLNVYEEMKAIGIKRNLVIYNSMLDAMGRAKRPWQIKTIYKEMIQNGFSPSWATYASLLRAY
         D A +L+   R     +D  +++TL+ I+   G  +  L++  EM ++GIK+++V YN++L   G+  +  ++K ++ EM +    P+  TY++L+  Y
Subjt:  VDMAFSLYDRARTENWRIDPATFSTLIKIHGVAGNYDGCLNVYEEMKAIGIKRNLVIYNSMLDAMGRAKRPWQIKTIYKEMIQNGFSPSWATYASLLRAY

Query:  GRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCADVGYINEAVEIFEDMKNSGTCSPDSWTFSSMITI------------YSCSGKVSEAEEMLNEMV
         +    ++A+ +++E K  GL+ +V+LY+ L+      G +  AV + ++M   G  SP+  T++S+I              YS  G +  +   L+ + 
Subjt:  GRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCADVGYINEAVEIFEDMKNSGTCSPDSWTFSSMITI------------YSCSGKVSEAEEMLNEMV

Query:  EAGFDPNIFVLTSLI---------QCYGKAKRVDDVVRTFNRLVELGLTPDDRFCGCLLNVITQTPKGE-LSKLIDCVERANSKL-GFVVKLLLGEQDKE
        E   +  I +   L           C    + +  ++  F ++ +L + P+      +LN  ++    E  S L++ +   ++K+ G V  LL+G+++  
Subjt:  EAGFDPNIFVLTSLI---------QCYGKAKRVDDVVRTFNRLVELGLTPDDRFCGCLLNVITQTPKGE-LSKLIDCVERANSKL-GFVVKLLLGEQDKE

Query:  GDFKTEASELFSVVS---ADVRKAYCNCLIDLCVNLDLLDKACELLDL-GLTLQIYTDLQSRSPTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEEL
         +   +A  LF  V+        A+ N L D+  +     +  EL+ L G + Q++ ++ S S     L L  +S GAA   +H W+ ++  ++  G EL
Subjt:  GDFKTEASELFSVVS---ADVRKAYCNCLIDLCVNLDLLDKACELLDL-GLTLQIYTDLQSRSPTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEEL

Query:  PPLLGINTGHGKHK--YSDKGLASVFESHLKELSAPFHEAPEKVGWFLTTKVAAKSWLESRGSPELV
        P +L I TG GKH     D  L    E  L+ + APFH +   +G F ++     +WL    + +L+
Subjt:  PPLLGINTGHGKHK--YSDKGLASVFESHLKELSAPFHEAPEKVGWFLTTKVAAKSWLESRGSPELV

AT4G16390.1 pentatricopeptide (PPR) repeat-containing protein2.5e-27165.95Show/hide
Query:  LCHSPSTLFTDHHPLFNSLTSQRKISLCKSSHRFKLNPIPLKSKTFLQITNVSLQEYAPEETQNPSPPEDDRSKYPDGKSSSSSKSYVWVNPRSPRASKL
        LC SPS+L  D  PL N L+   K +       +  N     S+  LQ T+VS+QE  P+  ++     D     P     ++SKSYVWVNP+SPRAS+L
Subjt:  LCHSPSTLFTDHHPLFNSLTSQRKISLCKSSHRFKLNPIPLKSKTFLQITNVSLQEYAPEETQNPSPPEDDRSKYPDGKSSSSSKSYVWVNPRSPRASKL

Query:  RKQSYEARYASLTRISESLDSCNPCEEDVADVLKVTGNNILEQDAVIVLNNMSNSSTALLALQYFQKVLRSSKQAILYNVTLKVFRKSRDMEGAEKLFDE
        R++SY++RY+SL +++ESLD+C P E DV DV+   G  + EQDAV+ LNNM+N  TA L L    + ++ S++ ILYNVT+KVFRKS+D+E +EKLFDE
Subjt:  RKQSYEARYASLTRISESLDSCNPCEEDVADVLKVTGNNILEQDAVIVLNNMSNSSTALLALQYFQKVLRSSKQAILYNVTLKVFRKSRDMEGAEKLFDE

Query:  MLQRGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPNYDCNPDDVTYSAMIDAYGRAGNVDMAFSLYDRARTENWRIDPATFSTLIKIHGVAGNYDGCL
        ML+RG+KPDN TF+TIISCAR   +P +AVEWFEKM ++ C PD+VT +AMIDAYGRAGNVDMA SLYDRARTE WRID  TFSTLI+I+GV+GNYDGCL
Subjt:  MLQRGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPNYDCNPDDVTYSAMIDAYGRAGNVDMAFSLYDRARTENWRIDPATFSTLIKIHGVAGNYDGCL

Query:  NVYEEMKAIGIKRNLVIYNSMLDAMGRAKRPWQIKTIYKEMIQNGFSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCADVGY
        N+YEEMKA+G+K NLVIYN ++D+MGRAKRPWQ K IYK++I NGF+P+W+TYA+L+RAYGRARYG+DAL +Y+EMKEKGL L VILYNTLL+MCAD  Y
Subjt:  NVYEEMKAIGIKRNLVIYNSMLDAMGRAKRPWQIKTIYKEMIQNGFSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCADVGY

Query:  INEAVEIFEDMKNSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVEAGFDPNIFVLTSLIQCYGKAKRVDDVVRTFNRLVELGLTPDDRFCGCLLNV
        ++EA EIF+DMKN  TC PDSWTFSS+IT+Y+CSG+VSEAE  L +M EAGF+P +FVLTS+IQCYGKAK+VDDVVRTF++++ELG+TPDDRFCGCLLNV
Subjt:  INEAVEIFEDMKNSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVEAGFDPNIFVLTSLIQCYGKAKRVDDVVRTFNRLVELGLTPDDRFCGCLLNV

Query:  ITQTPKGELSKLIDCVERANSKLGFVVKLLLGEQD-KEGDFKTEASELFSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDLGLTLQIYTDLQSRSPTQ
        +TQTP  E+ KLI CVE+A  KLG VVK+L+ EQ+ +EG FK EASEL   + +DV+KAY NCLIDLCVNL+ L++ACE+L LGL   IYT LQS+S TQ
Subjt:  ITQTPKGELSKLIDCVERANSKLGFVVKLLLGEQD-KEGDFKTEASELFSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDLGLTLQIYTDLQSRSPTQ

Query:  WSLYLKGLSLGAALTALHVWINDLTK-VLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELSAPFHEAPEKVGWFLTTKVAAKSWLESRGS
        WSL+LK LSLGAALTALHVW+NDL++  LESGEE PPLLGINTGHGKHKYSDKGLA+VFESHLKEL+APFHEAP+KVGWFLTT VAAK+WLESR S
Subjt:  WSLYLKGLSLGAALTALHVWINDLTK-VLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELSAPFHEAPEKVGWFLTTKVAAKSWLESRGS

AT5G02860.1 Pentatricopeptide repeat (PPR) superfamily protein5.8e-4228.77Show/hide
Query:  ALLALQYFQKVLRSSKQAILYN----VTLKVFRKSRDMEGAEKLFDEMLQRGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPNYDCNPDDVTYSAMID
        AL A  +F K  +   Q++L N    + + +  K   +  A  +F+ + + G   D  +++++IS         +AV  F+KM    C P  +TY+ +++
Subjt:  ALLALQYFQKVLRSSKQAILYN----VTLKVFRKSRDMEGAEKLFDEMLQRGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPNYDCNPDDVTYSAMID

Query:  AYGRAGNV-DMAFSLYDRARTENWRIDPATFSTLIKIHGVAGNYDGCLNVYEEMKAIGIKRNLVIYNSMLDAMGRAKRPWQIKTIYKEMIQNGFSPSWAT
         +G+ G   +   SL ++ +++    D  T++TLI        +     V+EEMKA G   + V YN++LD  G++ RP +   +  EM+ NGFSPS  T
Subjt:  AYGRAGNV-DMAFSLYDRARTENWRIDPATFSTLIKIHGVAGNYDGCLNVYEEMKAIGIKRNLVIYNSMLDAMGRAKRPWQIKTIYKEMIQNGFSPSWAT

Query:  YASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCADVGYINEAVEIFEDMKNSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVEAGF
        Y SL+ AY R    ++A+ +  +M EKG + +V  Y TLL+     G +  A+ IFE+M+N+G C P+  TF++ I +Y   GK +E  ++ +E+   G 
Subjt:  YASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCADVGYINEAVEIFEDMKNSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVEAGF

Query:  DPNIFVLTSLIQCYGKAKRVDDVVRTFNRLVELGLTPDDRFCGCLLNVITQ
         P+I    +L+  +G+     +V   F  +   G  P+      L++  ++
Subjt:  DPNIFVLTSLIQCYGKAKRVDDVVRTFNRLVELGLTPDDRFCGCLLNVITQ

AT5G46580.1 pentatricopeptide (PPR) repeat-containing protein1.1e-13038.21Show/hide
Query:  AFQLCHSPSTLFTDHHPLFNSLTSQRKISLCKSSHRFKLNPIPLKSKTFLQITNVSLQEYAPEETQNPSPPEDDRSKYPDGKSSSSSKSYVWVNPRSPRA
        A  +C +P    T  H LF       K SL + S   KLN      K    +    +    P  ++   P      +    +  S  KS VWVNP  P+ 
Subjt:  AFQLCHSPSTLFTDHHPLFNSLTSQRKISLCKSSHRFKLNPIPLKSKTFLQITNVSLQEYAPEETQNPSPPEDDRSKYPDGKSSSSSKSYVWVNPRSPRA

Query:  SKLRKQSYEARYASLTRISESLDSCNPCEEDV-ADVLKVTGNNILEQDAVIVL-------NNMSNSSTALLALQYFQKV------LRSSK----QAILYN
        S L          SL R   S  S NP  +D+ A  LK+  +   E+   + L        N  N+   L +L+ +QK       ++S      + I YN
Subjt:  SKLRKQSYEARYASLTRISESLDSCNPCEEDV-ADVLKVTGNNILEQDAVIVL-------NNMSNSSTALLALQYFQKV------LRSSK----QAILYN

Query:  VTLKVFRKSRDMEGAEKLFDEMLQRGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPNYDCNPDDVTYSAMIDAYGRAGNVDMAFSLYDRARTENWRID
        VT+K  R  R  +  E++  EM++ GV+ DN+T+STII+CA+ C+L NKA+EWFE+M      PD+VTYSA++D Y ++G V+   SLY+RA    W+ D
Subjt:  VTLKVFRKSRDMEGAEKLFDEMLQRGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPNYDCNPDDVTYSAMIDAYGRAGNVDMAFSLYDRARTENWRID

Query:  PATFSTLIKIHGVAGNYDGCLNVYEEMKAIGIKRNLVIYNSMLDAMGRAKRPWQIKTIYKEMIQNGFSPSWATYASLLRAYGRARYGEDALLVYKEMKEK
           FS L K+ G AG+YDG   V +EMK++ +K N+V+YN++L+AMGRA +P   ++++ EM++ G +P+  T  +L++ YG+AR+  DAL +++EMK K
Subjt:  PATFSTLIKIHGVAGNYDGCLNVYEEMKAIGIKRNLVIYNSMLDAMGRAKRPWQIKTIYKEMIQNGFSPSWATYASLLRAYGRARYGEDALLVYKEMKEK

Query:  GLQLNVILYNTLLAMCADVGYINEAVEIFEDMKNSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVEAGFDPNIFVLTSLIQCYGKAKRVDDVVRTF
           ++ ILYNTLL MCAD+G   EA  +F DMK S  C PD++++++M+ IY   GK  +A E+  EM++AG   N+   T L+QC GKAKR+DDVV  F
Subjt:  GLQLNVILYNTLLAMCADVGYINEAVEIFEDMKNSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVEAGFDPNIFVLTSLIQCYGKAKRVDDVVRTF

Query:  NRLVELGLTPDDRFCGCLLNVITQTPKGE-LSKLIDCVERANSKLGFVVKLLLGEQDKEGDFKTEASELFSVVSADVRKAYCNCLIDLCVNLDLLDKACE
        +  ++ G+ PDDR CGCLL+V+      E   K++ C+ERAN KL   V L++ E+ +    K E   + +    + R+ +CNCLID+C   +  ++A E
Subjt:  NRLVELGLTPDDRFCGCLLNVITQTPKGE-LSKLIDCVERANSKLGFVVKLLLGEQDKEGDFKTEASELFSVVSADVRKAYCNCLIDLCVNLDLLDKACE

Query:  LLDLGLTLQIYTDLQSRSPTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELSAPFHEAPEKVGWF
        LL LG    +Y  L +++  +WSL ++ LS+GAA TAL  W+  L  +++  EELP L    TG G H++S +GLA+ F  HL++LSAPF ++ ++ G F
Subjt:  LLDLGLTLQIYTDLQSRSPTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELSAPFHEAPEKVGWF

Query:  LTTKVAAKSWLESRGSP
        + TK    SWLES+  P
Subjt:  LTTKVAAKSWLESRGSP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTTCCAGCTCTGCCATTCGCCGTCCACCCTCTTCACCGACCACCATCCTCTCTTCAATTCTCTTACCTCTCAACGCAAAATAAGTCTCTGCAAGTCTTCTCACCG
TTTCAAGCTTAATCCTATACCACTGAAGTCAAAAACTTTTCTCCAGATAACCAATGTTTCGTTACAGGAATATGCTCCTGAAGAAACCCAGAATCCTAGCCCCCCTGAGG
ATGATAGGTCGAAATACCCAGATGGGAAATCTAGTTCCTCATCCAAAAGCTACGTTTGGGTCAATCCCAGAAGCCCCAGAGCTTCTAAACTTCGGAAGCAATCGTACGAA
GCCAGGTATGCTTCACTTACGAGAATATCAGAGTCTTTGGACTCTTGTAATCCATGTGAGGAAGATGTTGCGGACGTCTTGAAGGTGACAGGCAATAACATTCTAGAACA
AGACGCTGTTATAGTGCTGAATAACATGTCAAATTCCAGTACTGCGTTGCTTGCTCTTCAGTACTTTCAGAAGGTATTGAGATCAAGTAAACAGGCAATTCTTTACAATG
TGACCCTGAAGGTGTTTAGGAAGTCTAGAGATATGGAGGGAGCAGAGAAACTGTTTGACGAAATGCTTCAGAGAGGAGTTAAACCTGATAATGTGACATTCTCTACAATA
ATTAGCTGTGCTAGGTTGTGTTCGTTGCCAAATAAGGCTGTTGAGTGGTTTGAAAAGATGCCAAATTATGACTGTAATCCGGACGATGTCACTTACTCTGCAATGATAGA
TGCCTATGGACGTGCTGGTAATGTTGACATGGCTTTCAGCTTGTATGACCGTGCAAGAACAGAAAACTGGCGTATTGATCCTGCGACATTCTCAACATTGATCAAAATTC
ATGGAGTGGCTGGAAACTATGATGGGTGCTTGAATGTGTATGAAGAAATGAAAGCTATAGGCATCAAGCGAAACTTGGTTATATATAACAGCATGCTGGATGCTATGGGC
AGGGCTAAAAGACCATGGCAGATCAAGACCATTTACAAAGAGATGATTCAAAATGGGTTTTCACCGAGTTGGGCAACTTATGCTTCTCTTTTACGTGCTTATGGGAGAGC
CAGATATGGTGAGGATGCTCTCCTTGTGTACAAGGAGATGAAGGAAAAGGGATTGCAGTTAAATGTAATTCTCTACAATACACTTTTAGCCATGTGTGCTGATGTTGGCT
ACATTAATGAAGCTGTTGAAATTTTTGAAGATATGAAGAATTCTGGGACTTGCTCACCTGACAGTTGGACTTTTTCTTCCATGATTACCATTTATTCCTGCAGTGGAAAA
GTATCAGAGGCGGAGGAAATGTTGAACGAGATGGTGGAAGCTGGTTTTGACCCTAATATCTTTGTCTTGACATCACTAATCCAGTGTTATGGGAAAGCCAAACGTGTTGA
TGATGTTGTGAGGACATTCAATCGACTGGTTGAGTTGGGATTAACTCCAGATGATCGATTCTGTGGCTGTCTTCTCAATGTAATTACCCAGACACCAAAAGGGGAACTTA
GTAAGCTGATTGATTGTGTTGAGAGAGCTAATTCAAAACTTGGTTTTGTGGTTAAGCTTTTGCTAGGGGAACAAGATAAGGAAGGAGATTTCAAAACTGAAGCCTCAGAA
CTATTTAGTGTTGTTAGTGCTGATGTGAGAAAAGCTTATTGCAATTGCCTAATTGATCTCTGTGTGAATTTAGATCTTTTGGATAAGGCCTGTGAACTCTTGGATTTGGG
GCTTACGCTTCAGATATATACAGATTTGCAGTCCAGGTCTCCAACTCAGTGGTCTCTATATCTCAAGGGTCTTTCTCTTGGGGCTGCTCTCACTGCATTACATGTTTGGA
TAAATGACTTAACAAAGGTACTCGAATCTGGGGAGGAACTTCCACCGTTACTTGGAATAAATACTGGGCATGGAAAACACAAATATTCTGACAAGGGTTTGGCAAGTGTC
TTTGAGTCTCATTTAAAGGAATTAAGTGCTCCATTCCATGAGGCTCCAGAAAAGGTCGGGTGGTTTTTGACCACTAAAGTGGCAGCAAAATCATGGTTGGAGTCTAGAGG
TTCACCTGAATTAGTTGCAGTATAG
mRNA sequenceShow/hide mRNA sequence
ATGGCTTTCCAGCTCTGCCATTCGCCGTCCACCCTCTTCACCGACCACCATCCTCTCTTCAATTCTCTTACCTCTCAACGCAAAATAAGTCTCTGCAAGTCTTCTCACCG
TTTCAAGCTTAATCCTATACCACTGAAGTCAAAAACTTTTCTCCAGATAACCAATGTTTCGTTACAGGAATATGCTCCTGAAGAAACCCAGAATCCTAGCCCCCCTGAGG
ATGATAGGTCGAAATACCCAGATGGGAAATCTAGTTCCTCATCCAAAAGCTACGTTTGGGTCAATCCCAGAAGCCCCAGAGCTTCTAAACTTCGGAAGCAATCGTACGAA
GCCAGGTATGCTTCACTTACGAGAATATCAGAGTCTTTGGACTCTTGTAATCCATGTGAGGAAGATGTTGCGGACGTCTTGAAGGTGACAGGCAATAACATTCTAGAACA
AGACGCTGTTATAGTGCTGAATAACATGTCAAATTCCAGTACTGCGTTGCTTGCTCTTCAGTACTTTCAGAAGGTATTGAGATCAAGTAAACAGGCAATTCTTTACAATG
TGACCCTGAAGGTGTTTAGGAAGTCTAGAGATATGGAGGGAGCAGAGAAACTGTTTGACGAAATGCTTCAGAGAGGAGTTAAACCTGATAATGTGACATTCTCTACAATA
ATTAGCTGTGCTAGGTTGTGTTCGTTGCCAAATAAGGCTGTTGAGTGGTTTGAAAAGATGCCAAATTATGACTGTAATCCGGACGATGTCACTTACTCTGCAATGATAGA
TGCCTATGGACGTGCTGGTAATGTTGACATGGCTTTCAGCTTGTATGACCGTGCAAGAACAGAAAACTGGCGTATTGATCCTGCGACATTCTCAACATTGATCAAAATTC
ATGGAGTGGCTGGAAACTATGATGGGTGCTTGAATGTGTATGAAGAAATGAAAGCTATAGGCATCAAGCGAAACTTGGTTATATATAACAGCATGCTGGATGCTATGGGC
AGGGCTAAAAGACCATGGCAGATCAAGACCATTTACAAAGAGATGATTCAAAATGGGTTTTCACCGAGTTGGGCAACTTATGCTTCTCTTTTACGTGCTTATGGGAGAGC
CAGATATGGTGAGGATGCTCTCCTTGTGTACAAGGAGATGAAGGAAAAGGGATTGCAGTTAAATGTAATTCTCTACAATACACTTTTAGCCATGTGTGCTGATGTTGGCT
ACATTAATGAAGCTGTTGAAATTTTTGAAGATATGAAGAATTCTGGGACTTGCTCACCTGACAGTTGGACTTTTTCTTCCATGATTACCATTTATTCCTGCAGTGGAAAA
GTATCAGAGGCGGAGGAAATGTTGAACGAGATGGTGGAAGCTGGTTTTGACCCTAATATCTTTGTCTTGACATCACTAATCCAGTGTTATGGGAAAGCCAAACGTGTTGA
TGATGTTGTGAGGACATTCAATCGACTGGTTGAGTTGGGATTAACTCCAGATGATCGATTCTGTGGCTGTCTTCTCAATGTAATTACCCAGACACCAAAAGGGGAACTTA
GTAAGCTGATTGATTGTGTTGAGAGAGCTAATTCAAAACTTGGTTTTGTGGTTAAGCTTTTGCTAGGGGAACAAGATAAGGAAGGAGATTTCAAAACTGAAGCCTCAGAA
CTATTTAGTGTTGTTAGTGCTGATGTGAGAAAAGCTTATTGCAATTGCCTAATTGATCTCTGTGTGAATTTAGATCTTTTGGATAAGGCCTGTGAACTCTTGGATTTGGG
GCTTACGCTTCAGATATATACAGATTTGCAGTCCAGGTCTCCAACTCAGTGGTCTCTATATCTCAAGGGTCTTTCTCTTGGGGCTGCTCTCACTGCATTACATGTTTGGA
TAAATGACTTAACAAAGGTACTCGAATCTGGGGAGGAACTTCCACCGTTACTTGGAATAAATACTGGGCATGGAAAACACAAATATTCTGACAAGGGTTTGGCAAGTGTC
TTTGAGTCTCATTTAAAGGAATTAAGTGCTCCATTCCATGAGGCTCCAGAAAAGGTCGGGTGGTTTTTGACCACTAAAGTGGCAGCAAAATCATGGTTGGAGTCTAGAGG
TTCACCTGAATTAGTTGCAGTATAG
Protein sequenceShow/hide protein sequence
MAFQLCHSPSTLFTDHHPLFNSLTSQRKISLCKSSHRFKLNPIPLKSKTFLQITNVSLQEYAPEETQNPSPPEDDRSKYPDGKSSSSSKSYVWVNPRSPRASKLRKQSYE
ARYASLTRISESLDSCNPCEEDVADVLKVTGNNILEQDAVIVLNNMSNSSTALLALQYFQKVLRSSKQAILYNVTLKVFRKSRDMEGAEKLFDEMLQRGVKPDNVTFSTI
ISCARLCSLPNKAVEWFEKMPNYDCNPDDVTYSAMIDAYGRAGNVDMAFSLYDRARTENWRIDPATFSTLIKIHGVAGNYDGCLNVYEEMKAIGIKRNLVIYNSMLDAMG
RAKRPWQIKTIYKEMIQNGFSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCADVGYINEAVEIFEDMKNSGTCSPDSWTFSSMITIYSCSGK
VSEAEEMLNEMVEAGFDPNIFVLTSLIQCYGKAKRVDDVVRTFNRLVELGLTPDDRFCGCLLNVITQTPKGELSKLIDCVERANSKLGFVVKLLLGEQDKEGDFKTEASE
LFSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDLGLTLQIYTDLQSRSPTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASV
FESHLKELSAPFHEAPEKVGWFLTTKVAAKSWLESRGSPELVAV