; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS002892 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS002892
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationscaffold359:499329..501440
RNA-Seq ExpressionMS002892
SyntenyMS002892
Gene Ontology termsGO:0009658 - chloroplast organization (biological process)
GO:0031425 - chloroplast RNA processing (biological process)
GO:0009570 - chloroplast stroma (cellular component)
GO:0009941 - chloroplast envelope (cellular component)
GO:0003729 - mRNA binding (molecular function)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR002625 - Smr domain
IPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily
IPR033443 - Pentacotripeptide-repeat region of PRORP


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6583722.1 Pentatricopeptide repeat-containing protein, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0086.67Show/hide
Query:  MAFQLCHSPSTFFSDHLPLSNSLNSQHRITLRKSSHR-FKLNPTPHHSKTCLRITNVSLQEYAAQEAQNPIPTQDESSKYPDGKSNSSSKSSVWVNPRSP
        MAFQL H PSTFF+DH    NSL   ++ TL KSS R FKLNP P+HSK  L+ITNVS QEYA QE +NP P+ DE SK+PDGKS SSSK+SVWVNP SP
Subjt:  MAFQLCHSPSTFFSDHLPLSNSLNSQHRITLRKSSHR-FKLNPTPHHSKTCLRITNVSLQEYAAQEAQNPIPTQDESSKYPDGKSNSSSKSSVWVNPRSP

Query:  RASKLRNQSYEARYASLTRISESLDSCNPCEEDVADVLKGMGSNILEQDAVAVLNNMSNSSTALLALQMFQKVLKSSKKAILYNVTLKVLRKSRDMEGAE
        RASKLR QSYEARYASL +ISESLDSCNPCE+DVADVLK + S ILEQDA+ VLNNMSNS TALL L+ FQ VLKSSK+A+ YNVTLKV RK RD EGAE
Subjt:  RASKLRNQSYEARYASLTRISESLDSCNPCEEDVADVLKGMGSNILEQDAVAVLNNMSNSSTALLALQMFQKVLKSSKKAILYNVTLKVLRKSRDMEGAE

Query:  KLFDEMLKRGVKPDNVTFSTVISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDMAFSLYDRARTENWRIDPATFSTLIKIHGVAGN
        KLFDEML+RGVKPDNVTFST+ISCARLCSLPNKAVEWFEKMPSFDCNPD++TYS MIDAYGRAGNVDMAFSLYDRARTENWRID +TFST+IKIHGVAGN
Subjt:  KLFDEMLKRGVKPDNVTFSTVISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDMAFSLYDRARTENWRIDPATFSTLIKIHGVAGN

Query:  YDGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMC
        YDGCLNVYEEMKA+GIKPNL IYNSLL AMGRAKRPWQIKTIYKEM KNGFSPSWATYASLLRAY RARY ED +LVYKEMKEKGLQLNVILYNTLLAMC
Subjt:  YDGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMC

Query:  ADVGYVNEAVEIFEDMKSSGACSPDSWTFSSMITIYSCSGKVSEAEEMLNEMMEAGFDPNIFVLTSLIQCYGKGKRVDDVVRTFDRLVELGLTPDDRFCG
        ADVGYVNEA+E+F+DMKSSG CSPDSWTFSSMITIYSCSG VSEAEEMLNEMMEAGFDPNIFVLTSLIQCYGK KRVDDVVRTFDRL+ELGLTPDDRFCG
Subjt:  ADVGYVNEAVEIFEDMKSSGACSPDSWTFSSMITIYSCSGKVSEAEEMLNEMMEAGFDPNIFVLTSLIQCYGKGKRVDDVVRTFDRLVELGLTPDDRFCG

Query:  CLLNVITQTPKEELSKLIDCVERANSKLGYVVKLLLGEQDKEGDLRTEASELLSVVSADVRKAYCNCLIDLCVNLDLLEKACELLDLGLTLQIYTGLQST
        CLLNVITQTPK ELSKLIDCVERAN KLG+VVKLLLGE+D EGD RTEASEL SVVS DVRKAYCNCLIDLCVNLDLL+KACELLDLGL++QIYT LQS 
Subjt:  CLLNVITQTPKEELSKLIDCVERANSKLGYVVKLLLGEQDKEGDLRTEASELLSVVSADVRKAYCNCLIDLCVNLDLLEKACELLDLGLTLQIYTGLQST

Query:  SPTQWSLHLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRGSP
        SPTQWSL+LKGLSLGAALTALHVWINDLTK L+SGEELPPLLGINTGHGKHKYSDKGL+SVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRGSP
Subjt:  SPTQWSLHLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRGSP

Query:  ELVAA
        ELVAA
Subjt:  ELVAA

XP_004139516.1 pentatricopeptide repeat-containing protein At4g16390, chloroplastic [Cucumis sativus]0.0e+0087.64Show/hide
Query:  MAFQLCHSPSTFFSDHLPLSNSLNSQHRITLRKSSHRFKLNPTPHHSKTCLRITNVSLQEYAAQEAQNPIPTQDESSKYPDGKSNSSSKSSVWVNPRSPR
        MAFQLC+SP TFF++H  LSNSL  Q + TL  SS  FKL+P P HSK  L+ITNVSLQE+A Q+ QN IP+ DE SKYPD KS SSS SSVWVNPRSPR
Subjt:  MAFQLCHSPSTFFSDHLPLSNSLNSQHRITLRKSSHRFKLNPTPHHSKTCLRITNVSLQEYAAQEAQNPIPTQDESSKYPDGKSNSSSKSSVWVNPRSPR

Query:  ASKLRNQSYEARYASLTRISESLDSCNPCEEDVADVLKGMGSNILEQDAVAVLNNMSNSSTALLALQMFQKVLKSSKKAILYNVTLKVLRKSRDMEGAEK
        ASKLR QSYEARYASL R+SESLDS NPCE DVADVLK +G+NILE+DA+ VLNNMSNS TALLAL+ FQ +LKSSK+ I YNVTLKV RK RDMEGAEK
Subjt:  ASKLRNQSYEARYASLTRISESLDSCNPCEEDVADVLKGMGSNILEQDAVAVLNNMSNSSTALLALQMFQKVLKSSKKAILYNVTLKVLRKSRDMEGAEK

Query:  LFDEMLKRGVKPDNVTFSTVISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDMAFSLYDRARTENWRIDPATFSTLIKIHGVAGNY
        LF+EM+ RGVKPDNVTFST+ISCARLCSLP+KAVEWFEKMPSFDCNPDDVTYS MIDAYGRAGNVDMAFSLYDRARTENWRIDPATFST+IKIHGVAGNY
Subjt:  LFDEMLKRGVKPDNVTFSTVISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDMAFSLYDRARTENWRIDPATFSTLIKIHGVAGNY

Query:  DGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCA
        DGCLNVYEEMKAIGIKPNLVIYN LLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRARYGEDAL+VYKEMKEKGLQLNVILYNTLLAMCA
Subjt:  DGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCA

Query:  DVGYVNEAVEIFEDMKSSGACSPDSWTFSSMITIYSCSGKVSEAEEMLNEMMEAGFDPNIFVLTSLIQCYGKGKRVDDVVRTFDRLVELGLTPDDRFCGC
        DVGYVNEAVEIF+DMKSSG CSPDSWTFSSMITIYSC GKVSEAEEMLN+M+EAGFDPNIFVLTSLIQCYGK KRVDDVVRTF++L+ELGLTPDDRFCGC
Subjt:  DVGYVNEAVEIFEDMKSSGACSPDSWTFSSMITIYSCSGKVSEAEEMLNEMMEAGFDPNIFVLTSLIQCYGKGKRVDDVVRTFDRLVELGLTPDDRFCGC

Query:  LLNVITQTPKEELSKLIDCVERANSKLGYVVKLLLGEQDKEGDLRTEASELLSVVSADVRKAYCNCLIDLCVNLDLLEKACELLDLGLTLQIYTGLQSTS
        LLNVITQTPK EL KLIDCV RAN KLG+VV+LLLGEQDKEG+ RTEASEL SVVSADVRKAYCNCLIDLCVNLDLL+KACELLDLGLTLQIY  LQS S
Subjt:  LLNVITQTPKEELSKLIDCVERANSKLGYVVKLLLGEQDKEGDLRTEASELLSVVSADVRKAYCNCLIDLCVNLDLLEKACELLDLGLTLQIYTGLQSTS

Query:  PTQWSLHLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRGSPE
        PTQWSL+LKGLSLGAALTALHVWI DLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESR SPE
Subjt:  PTQWSLHLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRGSPE

Query:  LVAA
        LVAA
Subjt:  LVAA

XP_008464281.1 PREDICTED: pentatricopeptide repeat-containing protein At4g16390, chloroplastic [Cucumis melo]0.0e+0088.21Show/hide
Query:  MAFQLCHSPSTFFSDHLPLSNSLNSQHRITLRKSSHRFKLNPTPHHSKTCLRITNVSLQEYAAQEAQNPIPTQDESSKYPDGKSNSSSKSSVWVNPRSPR
        MAFQLCHSP TFF+ H  LSNSL  Q + TL  SS  FKLNP P HS   L+ITN+SLQE++ QE  N IP+ DE SKY D KS SSSKSSVWVNPRSPR
Subjt:  MAFQLCHSPSTFFSDHLPLSNSLNSQHRITLRKSSHRFKLNPTPHHSKTCLRITNVSLQEYAAQEAQNPIPTQDESSKYPDGKSNSSSKSSVWVNPRSPR

Query:  ASKLRNQSYEARYASLTRISESLDSCNPCEEDVADVLKGMGSNILEQDAVAVLNNMSNSSTALLALQMFQKVLKSSKKAILYNVTLKVLRKSRDMEGAEK
        ASKLR QSYEARYASL RISESLDSCNPCE DVADVLK +G+NILEQDAV VLNNMSNS TALLAL+ FQ +LKSSK+ I YNVTLKV RK RDMEGAE+
Subjt:  ASKLRNQSYEARYASLTRISESLDSCNPCEEDVADVLKGMGSNILEQDAVAVLNNMSNSSTALLALQMFQKVLKSSKKAILYNVTLKVLRKSRDMEGAEK

Query:  LFDEMLKRGVKPDNVTFSTVISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDMAFSLYDRARTENWRIDPATFSTLIKIHGVAGNY
        LF+EML RGVKPDNVTFST+ISCARLCSLP+KAVEWFEKMPSFDCNPDDVTYS MIDAYGRAGNVDMAFSLYDRARTENWRIDPATFST+IKIHGVAGNY
Subjt:  LFDEMLKRGVKPDNVTFSTVISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDMAFSLYDRARTENWRIDPATFSTLIKIHGVAGNY

Query:  DGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCA
        DGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIK+GFSPSWATYASLLRAYGRARYGEDAL+VYKEMKEKGLQLNVILYNTLLAMCA
Subjt:  DGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCA

Query:  DVGYVNEAVEIFEDMKSSGACSPDSWTFSSMITIYSCSGKVSEAEEMLNEMMEAGFDPNIFVLTSLIQCYGKGKRVDDVVRTFDRLVELGLTPDDRFCGC
        DVGYVNEAVEIF+DMK+SG CSPDSWTFSSMITIYSCSGKVSEAEEMLN+M+EAGFDPNIFVLTSLIQCYGK KRVDDVVRTF++L+ELGLTPDDRFCGC
Subjt:  DVGYVNEAVEIFEDMKSSGACSPDSWTFSSMITIYSCSGKVSEAEEMLNEMMEAGFDPNIFVLTSLIQCYGKGKRVDDVVRTFDRLVELGLTPDDRFCGC

Query:  LLNVITQTPKEELSKLIDCVERANSKLGYVVKLLLGEQDKEGDLRTEASELLSVVSADVRKAYCNCLIDLCVNLDLLEKACELLDLGLTLQIYTGLQSTS
        LLNVITQTPKEE+SKLIDCV RAN KLG+VV+LLLGEQDKEG+ RTEASEL SVVSADVRKAYCNCLIDLCVNLDLL+KACELL+LGLTLQIY  LQS S
Subjt:  LLNVITQTPKEELSKLIDCVERANSKLGYVVKLLLGEQDKEGDLRTEASELLSVVSADVRKAYCNCLIDLCVNLDLLEKACELLDLGLTLQIYTGLQSTS

Query:  PTQWSLHLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRGSPE
        PTQWSL+LKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESR SPE
Subjt:  PTQWSLHLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRGSPE

Query:  LVAA
        LVAA
Subjt:  LVAA

XP_022142513.1 pentatricopeptide repeat-containing protein At4g16390, chloroplastic [Momordica charantia]0.0e+0099.86Show/hide
Query:  MAFQLCHSPSTFFSDHLPLSNSLNSQHRITLRKSSHRFKLNPTPHHSKTCLRITNVSLQEYAAQEAQNPIPTQDESSKYPDGKSNSSSKSSVWVNPRSPR
        MAFQLCHSPSTFFSDH PLSNSLNSQHRITLRKSSHRFKLNPTPHHSKTCLRITNVSLQEYAAQEAQNPIPTQDESSKYPDGKSNSSSKSSVWVNPRSPR
Subjt:  MAFQLCHSPSTFFSDHLPLSNSLNSQHRITLRKSSHRFKLNPTPHHSKTCLRITNVSLQEYAAQEAQNPIPTQDESSKYPDGKSNSSSKSSVWVNPRSPR

Query:  ASKLRNQSYEARYASLTRISESLDSCNPCEEDVADVLKGMGSNILEQDAVAVLNNMSNSSTALLALQMFQKVLKSSKKAILYNVTLKVLRKSRDMEGAEK
        ASKLRNQSYEARYASLTRISESLDSCNPCEEDVADVLKGMGSNILEQDAVAVLNNMSNSSTALLALQMFQKVLKSSKKAILYNVTLKVLRKSRDMEGAEK
Subjt:  ASKLRNQSYEARYASLTRISESLDSCNPCEEDVADVLKGMGSNILEQDAVAVLNNMSNSSTALLALQMFQKVLKSSKKAILYNVTLKVLRKSRDMEGAEK

Query:  LFDEMLKRGVKPDNVTFSTVISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDMAFSLYDRARTENWRIDPATFSTLIKIHGVAGNY
        LFDEMLKRGVKPDNVTFSTVISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDMAFSLYDRARTENWRIDPATFSTLIKIHGVAGNY
Subjt:  LFDEMLKRGVKPDNVTFSTVISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDMAFSLYDRARTENWRIDPATFSTLIKIHGVAGNY

Query:  DGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCA
        DGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCA
Subjt:  DGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCA

Query:  DVGYVNEAVEIFEDMKSSGACSPDSWTFSSMITIYSCSGKVSEAEEMLNEMMEAGFDPNIFVLTSLIQCYGKGKRVDDVVRTFDRLVELGLTPDDRFCGC
        DVGYVNEAVEIFEDMKSSGACSPDSWTFSSMITIYSCSGKVSEAEEMLNEMMEAGFDPNIFVLTSLIQCYGKGKRVDDVVRTFDRLVELGLTPDDRFCGC
Subjt:  DVGYVNEAVEIFEDMKSSGACSPDSWTFSSMITIYSCSGKVSEAEEMLNEMMEAGFDPNIFVLTSLIQCYGKGKRVDDVVRTFDRLVELGLTPDDRFCGC

Query:  LLNVITQTPKEELSKLIDCVERANSKLGYVVKLLLGEQDKEGDLRTEASELLSVVSADVRKAYCNCLIDLCVNLDLLEKACELLDLGLTLQIYTGLQSTS
        LLNVITQTPKEELSKLIDCVERANSKLGYVVKLLLGEQDKEGDLRTEASELLSVVSADVRKAYCNCLIDLCVNLDLLEKACELLDLGLTLQIYTGLQSTS
Subjt:  LLNVITQTPKEELSKLIDCVERANSKLGYVVKLLLGEQDKEGDLRTEASELLSVVSADVRKAYCNCLIDLCVNLDLLEKACELLDLGLTLQIYTGLQSTS

Query:  PTQWSLHLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRGSPE
        PTQWSLHLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRGSPE
Subjt:  PTQWSLHLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRGSPE

Query:  LVAA
        LVAA
Subjt:  LVAA

XP_038877791.1 pentatricopeptide repeat-containing protein At4g16390, chloroplastic [Benincasa hispida]0.0e+0091.05Show/hide
Query:  MAFQLCHSPSTFFSDHLPLSNSLNSQHRITLRKSSHRFKLNPTPHHSKTCLRITNVSLQEYAAQEAQNPIPTQDESSKYPDGKSNSSSKSSVWVNPRSPR
        MAFQLCHSPSTFF+DH  LSNSL SQ + TL  SS  FKLNP P HSK  L+ITNVSLQEYA QE  NP P+ DE SKYPDGKS+SSSKSSVWVNPRSPR
Subjt:  MAFQLCHSPSTFFSDHLPLSNSLNSQHRITLRKSSHRFKLNPTPHHSKTCLRITNVSLQEYAAQEAQNPIPTQDESSKYPDGKSNSSSKSSVWVNPRSPR

Query:  ASKLRNQSYEARYASLTRISESLDSCNPCEEDVADVLKGMGSNILEQDAVAVLNNMSNSSTALLALQMFQKVLKSSKKAILYNVTLKVLRKSRDMEGAEK
        ASKLR QSYEARYASLTRISESLDSCNPC+EDVADVLK +GSNIL+QDAV VLNNMSNS TALLAL+ FQ VLKSSK+AI YNVTLKV RK RDMEGAEK
Subjt:  ASKLRNQSYEARYASLTRISESLDSCNPCEEDVADVLKGMGSNILEQDAVAVLNNMSNSSTALLALQMFQKVLKSSKKAILYNVTLKVLRKSRDMEGAEK

Query:  LFDEMLKRGVKPDNVTFSTVISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDMAFSLYDRARTENWRIDPATFSTLIKIHGVAGNY
        LF+EMLKRGVKPDNVTFST+ISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDMAFSLYDRARTENWRIDPATFST+IKIHGVAGNY
Subjt:  LFDEMLKRGVKPDNVTFSTVISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDMAFSLYDRARTENWRIDPATFSTLIKIHGVAGNY

Query:  DGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCA
        DGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCA
Subjt:  DGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCA

Query:  DVGYVNEAVEIFEDMKSSGACSPDSWTFSSMITIYSCSGKVSEAEEMLNEMMEAGFDPNIFVLTSLIQCYGKGKRVDDVVRTFDRLVELGLTPDDRFCGC
        DVGYV EAVE+F+DMKSSG CSPDSWTFSSMITIYSCSGKVSEAEEMLNEM+EAGFDPNIFVLTSLIQCYGK KRVDDVVRTF RL+ELGLTPDDRFCGC
Subjt:  DVGYVNEAVEIFEDMKSSGACSPDSWTFSSMITIYSCSGKVSEAEEMLNEMMEAGFDPNIFVLTSLIQCYGKGKRVDDVVRTFDRLVELGLTPDDRFCGC

Query:  LLNVITQTPKEELSKLIDCVERANSKLGYVVKLLLGEQDKEGDLRTEASELLSVVSADVRKAYCNCLIDLCVNLDLLEKACELLDLGLTLQIYTGLQSTS
        LLNVITQTPKEELSKLIDCV RAN KLG+VVKLL+GEQDKEGD RTEASEL SVVSADVRKAYCNCLIDLCVNLDLL+KACELLDLGLTLQ+Y  LQS S
Subjt:  LLNVITQTPKEELSKLIDCVERANSKLGYVVKLLLGEQDKEGDLRTEASELLSVVSADVRKAYCNCLIDLCVNLDLLEKACELLDLGLTLQIYTGLQSTS

Query:  PTQWSLHLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRGSPE
        PTQWSL+LKGLSLGA LTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESR SPE
Subjt:  PTQWSLHLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRGSPE

Query:  LVAA
        LVAA
Subjt:  LVAA

TrEMBL top hitse value%identityAlignment
A0A0A0LVP1 Smr domain-containing protein0.0e+0087.64Show/hide
Query:  MAFQLCHSPSTFFSDHLPLSNSLNSQHRITLRKSSHRFKLNPTPHHSKTCLRITNVSLQEYAAQEAQNPIPTQDESSKYPDGKSNSSSKSSVWVNPRSPR
        MAFQLC+SP TFF++H  LSNSL  Q + TL  SS  FKL+P P HSK  L+ITNVSLQE+A Q+ QN IP+ DE SKYPD KS SSS SSVWVNPRSPR
Subjt:  MAFQLCHSPSTFFSDHLPLSNSLNSQHRITLRKSSHRFKLNPTPHHSKTCLRITNVSLQEYAAQEAQNPIPTQDESSKYPDGKSNSSSKSSVWVNPRSPR

Query:  ASKLRNQSYEARYASLTRISESLDSCNPCEEDVADVLKGMGSNILEQDAVAVLNNMSNSSTALLALQMFQKVLKSSKKAILYNVTLKVLRKSRDMEGAEK
        ASKLR QSYEARYASL R+SESLDS NPCE DVADVLK +G+NILE+DA+ VLNNMSNS TALLAL+ FQ +LKSSK+ I YNVTLKV RK RDMEGAEK
Subjt:  ASKLRNQSYEARYASLTRISESLDSCNPCEEDVADVLKGMGSNILEQDAVAVLNNMSNSSTALLALQMFQKVLKSSKKAILYNVTLKVLRKSRDMEGAEK

Query:  LFDEMLKRGVKPDNVTFSTVISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDMAFSLYDRARTENWRIDPATFSTLIKIHGVAGNY
        LF+EM+ RGVKPDNVTFST+ISCARLCSLP+KAVEWFEKMPSFDCNPDDVTYS MIDAYGRAGNVDMAFSLYDRARTENWRIDPATFST+IKIHGVAGNY
Subjt:  LFDEMLKRGVKPDNVTFSTVISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDMAFSLYDRARTENWRIDPATFSTLIKIHGVAGNY

Query:  DGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCA
        DGCLNVYEEMKAIGIKPNLVIYN LLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRARYGEDAL+VYKEMKEKGLQLNVILYNTLLAMCA
Subjt:  DGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCA

Query:  DVGYVNEAVEIFEDMKSSGACSPDSWTFSSMITIYSCSGKVSEAEEMLNEMMEAGFDPNIFVLTSLIQCYGKGKRVDDVVRTFDRLVELGLTPDDRFCGC
        DVGYVNEAVEIF+DMKSSG CSPDSWTFSSMITIYSC GKVSEAEEMLN+M+EAGFDPNIFVLTSLIQCYGK KRVDDVVRTF++L+ELGLTPDDRFCGC
Subjt:  DVGYVNEAVEIFEDMKSSGACSPDSWTFSSMITIYSCSGKVSEAEEMLNEMMEAGFDPNIFVLTSLIQCYGKGKRVDDVVRTFDRLVELGLTPDDRFCGC

Query:  LLNVITQTPKEELSKLIDCVERANSKLGYVVKLLLGEQDKEGDLRTEASELLSVVSADVRKAYCNCLIDLCVNLDLLEKACELLDLGLTLQIYTGLQSTS
        LLNVITQTPK EL KLIDCV RAN KLG+VV+LLLGEQDKEG+ RTEASEL SVVSADVRKAYCNCLIDLCVNLDLL+KACELLDLGLTLQIY  LQS S
Subjt:  LLNVITQTPKEELSKLIDCVERANSKLGYVVKLLLGEQDKEGDLRTEASELLSVVSADVRKAYCNCLIDLCVNLDLLEKACELLDLGLTLQIYTGLQSTS

Query:  PTQWSLHLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRGSPE
        PTQWSL+LKGLSLGAALTALHVWI DLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESR SPE
Subjt:  PTQWSLHLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRGSPE

Query:  LVAA
        LVAA
Subjt:  LVAA

A0A1S3CL39 pentatricopeptide repeat-containing protein At4g16390, chloroplastic0.0e+0088.21Show/hide
Query:  MAFQLCHSPSTFFSDHLPLSNSLNSQHRITLRKSSHRFKLNPTPHHSKTCLRITNVSLQEYAAQEAQNPIPTQDESSKYPDGKSNSSSKSSVWVNPRSPR
        MAFQLCHSP TFF+ H  LSNSL  Q + TL  SS  FKLNP P HS   L+ITN+SLQE++ QE  N IP+ DE SKY D KS SSSKSSVWVNPRSPR
Subjt:  MAFQLCHSPSTFFSDHLPLSNSLNSQHRITLRKSSHRFKLNPTPHHSKTCLRITNVSLQEYAAQEAQNPIPTQDESSKYPDGKSNSSSKSSVWVNPRSPR

Query:  ASKLRNQSYEARYASLTRISESLDSCNPCEEDVADVLKGMGSNILEQDAVAVLNNMSNSSTALLALQMFQKVLKSSKKAILYNVTLKVLRKSRDMEGAEK
        ASKLR QSYEARYASL RISESLDSCNPCE DVADVLK +G+NILEQDAV VLNNMSNS TALLAL+ FQ +LKSSK+ I YNVTLKV RK RDMEGAE+
Subjt:  ASKLRNQSYEARYASLTRISESLDSCNPCEEDVADVLKGMGSNILEQDAVAVLNNMSNSSTALLALQMFQKVLKSSKKAILYNVTLKVLRKSRDMEGAEK

Query:  LFDEMLKRGVKPDNVTFSTVISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDMAFSLYDRARTENWRIDPATFSTLIKIHGVAGNY
        LF+EML RGVKPDNVTFST+ISCARLCSLP+KAVEWFEKMPSFDCNPDDVTYS MIDAYGRAGNVDMAFSLYDRARTENWRIDPATFST+IKIHGVAGNY
Subjt:  LFDEMLKRGVKPDNVTFSTVISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDMAFSLYDRARTENWRIDPATFSTLIKIHGVAGNY

Query:  DGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCA
        DGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIK+GFSPSWATYASLLRAYGRARYGEDAL+VYKEMKEKGLQLNVILYNTLLAMCA
Subjt:  DGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCA

Query:  DVGYVNEAVEIFEDMKSSGACSPDSWTFSSMITIYSCSGKVSEAEEMLNEMMEAGFDPNIFVLTSLIQCYGKGKRVDDVVRTFDRLVELGLTPDDRFCGC
        DVGYVNEAVEIF+DMK+SG CSPDSWTFSSMITIYSCSGKVSEAEEMLN+M+EAGFDPNIFVLTSLIQCYGK KRVDDVVRTF++L+ELGLTPDDRFCGC
Subjt:  DVGYVNEAVEIFEDMKSSGACSPDSWTFSSMITIYSCSGKVSEAEEMLNEMMEAGFDPNIFVLTSLIQCYGKGKRVDDVVRTFDRLVELGLTPDDRFCGC

Query:  LLNVITQTPKEELSKLIDCVERANSKLGYVVKLLLGEQDKEGDLRTEASELLSVVSADVRKAYCNCLIDLCVNLDLLEKACELLDLGLTLQIYTGLQSTS
        LLNVITQTPKEE+SKLIDCV RAN KLG+VV+LLLGEQDKEG+ RTEASEL SVVSADVRKAYCNCLIDLCVNLDLL+KACELL+LGLTLQIY  LQS S
Subjt:  LLNVITQTPKEELSKLIDCVERANSKLGYVVKLLLGEQDKEGDLRTEASELLSVVSADVRKAYCNCLIDLCVNLDLLEKACELLDLGLTLQIYTGLQSTS

Query:  PTQWSLHLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRGSPE
        PTQWSL+LKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESR SPE
Subjt:  PTQWSLHLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRGSPE

Query:  LVAA
        LVAA
Subjt:  LVAA

A0A5A7TLM5 Pentatricopeptide repeat-containing protein0.0e+0088.21Show/hide
Query:  MAFQLCHSPSTFFSDHLPLSNSLNSQHRITLRKSSHRFKLNPTPHHSKTCLRITNVSLQEYAAQEAQNPIPTQDESSKYPDGKSNSSSKSSVWVNPRSPR
        MAFQLCHSP TFF+ H  LSNSL  Q + TL  SS  FKLNP P HS   L+ITN+SLQE++ QE  N IP+ DE SKY D KS SSSKSSVWVNPRSPR
Subjt:  MAFQLCHSPSTFFSDHLPLSNSLNSQHRITLRKSSHRFKLNPTPHHSKTCLRITNVSLQEYAAQEAQNPIPTQDESSKYPDGKSNSSSKSSVWVNPRSPR

Query:  ASKLRNQSYEARYASLTRISESLDSCNPCEEDVADVLKGMGSNILEQDAVAVLNNMSNSSTALLALQMFQKVLKSSKKAILYNVTLKVLRKSRDMEGAEK
        ASKLR QSYEARYASL RISESLDSCNPCE DVADVLK +G+NILEQDAV VLNNMSNS TALLAL+ FQ +LKSSK+ I YNVTLKV RK RDMEGAE+
Subjt:  ASKLRNQSYEARYASLTRISESLDSCNPCEEDVADVLKGMGSNILEQDAVAVLNNMSNSSTALLALQMFQKVLKSSKKAILYNVTLKVLRKSRDMEGAEK

Query:  LFDEMLKRGVKPDNVTFSTVISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDMAFSLYDRARTENWRIDPATFSTLIKIHGVAGNY
        LF+EML RGVKPDNVTFST+ISCARLCSLP+KAVEWFEKMPSFDCNPDDVTYS MIDAYGRAGNVDMAFSLYDRARTENWRIDPATFST+IKIHGVAGNY
Subjt:  LFDEMLKRGVKPDNVTFSTVISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDMAFSLYDRARTENWRIDPATFSTLIKIHGVAGNY

Query:  DGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCA
        DGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIK+GFSPSWATYASLLRAYGRARYGEDAL+VYKEMKEKGLQLNVILYNTLLAMCA
Subjt:  DGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCA

Query:  DVGYVNEAVEIFEDMKSSGACSPDSWTFSSMITIYSCSGKVSEAEEMLNEMMEAGFDPNIFVLTSLIQCYGKGKRVDDVVRTFDRLVELGLTPDDRFCGC
        DVGYVNEAVEIF+DMK+SG CSPDSWTFSSMITIYSCSGKVSEAEEMLN+M+EAGFDPNIFVLTSLIQCYGK KRVDDVVRTF++L+ELGLTPDDRFCGC
Subjt:  DVGYVNEAVEIFEDMKSSGACSPDSWTFSSMITIYSCSGKVSEAEEMLNEMMEAGFDPNIFVLTSLIQCYGKGKRVDDVVRTFDRLVELGLTPDDRFCGC

Query:  LLNVITQTPKEELSKLIDCVERANSKLGYVVKLLLGEQDKEGDLRTEASELLSVVSADVRKAYCNCLIDLCVNLDLLEKACELLDLGLTLQIYTGLQSTS
        LLNVITQTPKEE+SKLIDCV RAN KLG+VV+LLLGEQDKEG+ RTEASEL SVVSADVRKAYCNCLIDLCVNLDLL+KACELL+LGLTLQIY  LQS S
Subjt:  LLNVITQTPKEELSKLIDCVERANSKLGYVVKLLLGEQDKEGDLRTEASELLSVVSADVRKAYCNCLIDLCVNLDLLEKACELLDLGLTLQIYTGLQSTS

Query:  PTQWSLHLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRGSPE
        PTQWSL+LKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESR SPE
Subjt:  PTQWSLHLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRGSPE

Query:  LVAA
        LVAA
Subjt:  LVAA

A0A6J1CNE5 pentatricopeptide repeat-containing protein At4g16390, chloroplastic0.0e+0099.86Show/hide
Query:  MAFQLCHSPSTFFSDHLPLSNSLNSQHRITLRKSSHRFKLNPTPHHSKTCLRITNVSLQEYAAQEAQNPIPTQDESSKYPDGKSNSSSKSSVWVNPRSPR
        MAFQLCHSPSTFFSDH PLSNSLNSQHRITLRKSSHRFKLNPTPHHSKTCLRITNVSLQEYAAQEAQNPIPTQDESSKYPDGKSNSSSKSSVWVNPRSPR
Subjt:  MAFQLCHSPSTFFSDHLPLSNSLNSQHRITLRKSSHRFKLNPTPHHSKTCLRITNVSLQEYAAQEAQNPIPTQDESSKYPDGKSNSSSKSSVWVNPRSPR

Query:  ASKLRNQSYEARYASLTRISESLDSCNPCEEDVADVLKGMGSNILEQDAVAVLNNMSNSSTALLALQMFQKVLKSSKKAILYNVTLKVLRKSRDMEGAEK
        ASKLRNQSYEARYASLTRISESLDSCNPCEEDVADVLKGMGSNILEQDAVAVLNNMSNSSTALLALQMFQKVLKSSKKAILYNVTLKVLRKSRDMEGAEK
Subjt:  ASKLRNQSYEARYASLTRISESLDSCNPCEEDVADVLKGMGSNILEQDAVAVLNNMSNSSTALLALQMFQKVLKSSKKAILYNVTLKVLRKSRDMEGAEK

Query:  LFDEMLKRGVKPDNVTFSTVISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDMAFSLYDRARTENWRIDPATFSTLIKIHGVAGNY
        LFDEMLKRGVKPDNVTFSTVISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDMAFSLYDRARTENWRIDPATFSTLIKIHGVAGNY
Subjt:  LFDEMLKRGVKPDNVTFSTVISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDMAFSLYDRARTENWRIDPATFSTLIKIHGVAGNY

Query:  DGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCA
        DGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCA
Subjt:  DGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCA

Query:  DVGYVNEAVEIFEDMKSSGACSPDSWTFSSMITIYSCSGKVSEAEEMLNEMMEAGFDPNIFVLTSLIQCYGKGKRVDDVVRTFDRLVELGLTPDDRFCGC
        DVGYVNEAVEIFEDMKSSGACSPDSWTFSSMITIYSCSGKVSEAEEMLNEMMEAGFDPNIFVLTSLIQCYGKGKRVDDVVRTFDRLVELGLTPDDRFCGC
Subjt:  DVGYVNEAVEIFEDMKSSGACSPDSWTFSSMITIYSCSGKVSEAEEMLNEMMEAGFDPNIFVLTSLIQCYGKGKRVDDVVRTFDRLVELGLTPDDRFCGC

Query:  LLNVITQTPKEELSKLIDCVERANSKLGYVVKLLLGEQDKEGDLRTEASELLSVVSADVRKAYCNCLIDLCVNLDLLEKACELLDLGLTLQIYTGLQSTS
        LLNVITQTPKEELSKLIDCVERANSKLGYVVKLLLGEQDKEGDLRTEASELLSVVSADVRKAYCNCLIDLCVNLDLLEKACELLDLGLTLQIYTGLQSTS
Subjt:  LLNVITQTPKEELSKLIDCVERANSKLGYVVKLLLGEQDKEGDLRTEASELLSVVSADVRKAYCNCLIDLCVNLDLLEKACELLDLGLTLQIYTGLQSTS

Query:  PTQWSLHLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRGSPE
        PTQWSLHLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRGSPE
Subjt:  PTQWSLHLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRGSPE

Query:  LVAA
        LVAA
Subjt:  LVAA

A0A6J1EHV4 pentatricopeptide repeat-containing protein At4g16390, chloroplastic0.0e+0086.38Show/hide
Query:  MAFQLCHSPSTFFSDHLPLSNSLNSQHRITLRKSSHR-FKLNPTPHHSKTCLRITNVSLQEYAAQEAQNPIPTQDESSKYPDGKSNSSSKSSVWVNPRSP
        MAFQL H PSTFF+DH    NSL   ++ TL KS  R FKLNP P+HSK  L+ITNVS QEYA QE +NP P+ DE SK+PDGKS SSSK+SVWVNP SP
Subjt:  MAFQLCHSPSTFFSDHLPLSNSLNSQHRITLRKSSHR-FKLNPTPHHSKTCLRITNVSLQEYAAQEAQNPIPTQDESSKYPDGKSNSSSKSSVWVNPRSP

Query:  RASKLRNQSYEARYASLTRISESLDSCNPCEEDVADVLKGMGSNILEQDAVAVLNNMSNSSTALLALQMFQKVLKSSKKAILYNVTLKVLRKSRDMEGAE
        RASKLR QSYEARYASL +ISESLDSCNPCE DVADVLK + S ILEQDA+ VLNNMSNS TALL  + FQ VLKSSK+A+ YNVTLKV RK RD EGAE
Subjt:  RASKLRNQSYEARYASLTRISESLDSCNPCEEDVADVLKGMGSNILEQDAVAVLNNMSNSSTALLALQMFQKVLKSSKKAILYNVTLKVLRKSRDMEGAE

Query:  KLFDEMLKRGVKPDNVTFSTVISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDMAFSLYDRARTENWRIDPATFSTLIKIHGVAGN
        KLFDEML+RGVKPDNVTFST+ISCARLCSLPNKAVEWFEKMPSFDCNPD++TYS MIDAYGRAGNVDMAFSLYDRARTENWRID +TFST+IKIHGVAGN
Subjt:  KLFDEMLKRGVKPDNVTFSTVISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDMAFSLYDRARTENWRIDPATFSTLIKIHGVAGN

Query:  YDGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMC
        YDGCLNVYEEMKA+GIKPNL IYNSLL AMGRAKRPWQIKTIYKEM KNGFSPSWATYASLLRAY RARY ED +LVYKEMKEKGLQLNVILYNTLLAMC
Subjt:  YDGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMC

Query:  ADVGYVNEAVEIFEDMKSSGACSPDSWTFSSMITIYSCSGKVSEAEEMLNEMMEAGFDPNIFVLTSLIQCYGKGKRVDDVVRTFDRLVELGLTPDDRFCG
        ADVGYVNEA+E+F+DMKSSG CSPDSWTFSSMITIYSCSG VSEAEEMLNEMMEAGFDPNIFVLTSLIQCYGK KRVDDVVRTFDRL+ELGLTPDDRFCG
Subjt:  ADVGYVNEAVEIFEDMKSSGACSPDSWTFSSMITIYSCSGKVSEAEEMLNEMMEAGFDPNIFVLTSLIQCYGKGKRVDDVVRTFDRLVELGLTPDDRFCG

Query:  CLLNVITQTPKEELSKLIDCVERANSKLGYVVKLLLGEQDKEGDLRTEASELLSVVSADVRKAYCNCLIDLCVNLDLLEKACELLDLGLTLQIYTGLQST
        CLLNVITQTPK ELSKLIDCVERAN KLG+VVKLLLGE+D EGD RTEASEL SVVS DVRKAYCNCLIDLCVNLDLL+KACELLDLGL++QIYT LQS 
Subjt:  CLLNVITQTPKEELSKLIDCVERANSKLGYVVKLLLGEQDKEGDLRTEASELLSVVSADVRKAYCNCLIDLCVNLDLLEKACELLDLGLTLQIYTGLQST

Query:  SPTQWSLHLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRGSP
        SPTQWSL+LKGLSLGAALTALHVWINDLTK L+SGEELPPLLGINTGHGKHKYSDKGL+SVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRGSP
Subjt:  SPTQWSLHLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRGSP

Query:  ELVAA
        ELVAA
Subjt:  ELVAA

SwissProt top hitse value%identityAlignment
B4F8Z1 Pentatricopeptide repeat-containing protein ATP4, chloroplastic3.2e-20755.29Show/hide
Query:  LCHSPSTFFSD--HLPLSNSLNSQHRITLRKSSHRFKLNPTPHHSKTCLRITNVSLQEYAAQEAQNPIPTQDESSKYPDGKSNSSSKSS--VWVNPRSPR
        LC SPS+      H P+S S N ++             +P   H         VS+QE   Q  Q+P P  D +   P+G   SSS ++  +WVNP SPR
Subjt:  LCHSPSTFFSD--HLPLSNSLNSQHRITLRKSSHRFKLNPTPHHSKTCLRITNVSLQEYAAQEAQNPIPTQDESSKYPDGKSNSSSKSS--VWVNPRSPR

Query:  ASKL-RNQSYEARYASLTRISESLDSCNPCEEDVADVLK-GMGSNILEQDAVAVLNN--MSNSSTALLALQMFQKVLKSSKKAILYNVTLKVLRKSRDME
        A+ + R ++   R A L   + +L +C   E  V   L+        EQDAV VLN    + + TA+LAL+ F    K  KK ILYNV LK+LRK R   
Subjt:  ASKL-RNQSYEARYASLTRISESLDSCNPCEEDVADVLK-GMGSNILEQDAVAVLNN--MSNSSTALLALQMFQKVLKSSKKAILYNVTLKVLRKSRDME

Query:  GAEKLFDEMLKRGVKPDNVTFSTVISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDMAFSLYDRARTENWRIDPATFSTLIKIHGV
          E L+ EML+ GV+PDN TFSTVISCAR C L +KAVEWF+KMP F C+PD +TYSA+IDAYG AGN + A  LYDRAR E W++DP   ST+IK+H  
Subjt:  GAEKLFDEMLKRGVKPDNVTFSTVISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDMAFSLYDRARTENWRIDPATFSTLIKIHGV

Query:  AGNYDGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLL
        +GN+DG LNV+EEMKAIG++PNLV+YN++LDAMGRA RPW +KTI++EM+     PS ATY  LL AY RARYGEDA+ VY+ MK++ + ++V+LYN LL
Subjt:  AGNYDGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLL

Query:  AMCADVGYVNEAVEIFEDMKSS-GACS-PDSWTFSSMITIYSCSGKVSEAEEMLNEMMEAGFDPNIFVLTSLIQCYGKGKRVDDVVRTFDRLVELGLTPD
        +MCAD+GYV+EA EIF DMK+S GA S PDSW++SSM+T+YS +  V  AE +LNEM+EAGF PNIFVLTSLI+CYGK  R DDVVR+F  L +LG+ PD
Subjt:  AMCADVGYVNEAVEIFEDMKSS-GACS-PDSWTFSSMITIYSCSGKVSEAEEMLNEMMEAGFDPNIFVLTSLIQCYGKGKRVDDVVRTFDRLVELGLTPD

Query:  DRFCGCLLNVITQTPKEELSKLIDCVERANSKLGYVVKLLLGEQDKEGDLRTEASELLSVVSADVRKAYCNCLIDLCVNLDLLEKACELLDLGLTLQIYT
        DRFCGCLL+V   TP EEL K+I C+ER+N +LG VVKLL+     E   R  A ELL      V+  YCNCL+DLCVNL+ +EKAC LLD    L IY 
Subjt:  DRFCGCLLNVITQTPKEELSKLIDCVERANSKLGYVVKLLLGEQDKEGDLRTEASELLSVVSADVRKAYCNCLIDLCVNLDLLEKACELLDLGLTLQIYT

Query:  GLQSTSPTQWSLHLKGLSLGAALTALHVWINDLTKVLESGEE-LPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWL
         +Q+ + TQWSLHL+GLS+GAALT LHVW+NDL   L++G E LPPLLGI+TG GK+ YSD+GLA++FE+HLKEL+APFHEAP+K GWFLTT VAAK WL
Subjt:  GLQSTSPTQWSLHLKGLSLGAALTALHVWINDLTKVLESGEE-LPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWL

Query:  ESRGSPELV
        ES+ + ELV
Subjt:  ESRGSPELV

Q10PZ4 Pentatricopeptide repeat-containing protein ATP4 homolog, chloroplastic2.9e-20857.03Show/hide
Query:  AAQEAQNPIPTQDESSKYPDGKSNSSSKSSVWVNPRSPRASKL-RNQSYEARYASLTRISESLDSCNPCEEDVADVLK-GMGSNILEQDAVAVLNNMS-N
        A    Q+P P   +++  P   SN+S    VWVNP SPRA+ L R ++   R A L   + +L +C   E  VA  L+        EQDAV VLN  S  
Subjt:  AAQEAQNPIPTQDESSKYPDGKSNSSSKSSVWVNPRSPRASKL-RNQSYEARYASLTRISESLDSCNPCEEDVADVLK-GMGSNILEQDAVAVLNNMS-N

Query:  SSTALLALQMFQKVLKSSKKAILYNVTLKVLRKSRDMEGAEKLFDEMLKRGVKPDNVTFSTVISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDA
         +  +LAL  F +  +  K+ ILYNV LK LRK R    AE L++EML+ GV+PDN TFSTVISCAR C +P KAVEWFEKMP F C+PD +TYSA+IDA
Subjt:  SSTALLALQMFQKVLKSSKKAILYNVTLKVLRKSRDMEGAEKLFDEMLKRGVKPDNVTFSTVISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDA

Query:  YGRAGNVDMAFSLYDRARTENWRIDPATFSTLIKIHGVAGNYDGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYA
        YGRAG+ + A  LYDRAR E W++DP   +T+I++H  +GN+DG LNV+EEMKA G+KPNLV+YN++LDAMGRA RPW +KTI++E++     P+ ATY 
Subjt:  YGRAGNVDMAFSLYDRARTENWRIDPATFSTLIKIHGVAGNYDGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYA

Query:  SLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCADVGYVNEAVEIFEDMKSS--GACSPDSWTFSSMITIYSCSGKVSEAEEMLNEMMEAGF
         LL AY RARYGEDA+ VY+ MK++ + ++V+LYN LL+MCAD+GYV EA EIF DMK+S      PDSW++SSM+T+YSC+G V+ AE +LNEM+EAGF
Subjt:  SLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCADVGYVNEAVEIFEDMKSS--GACSPDSWTFSSMITIYSCSGKVSEAEEMLNEMMEAGF

Query:  DPNIFVLTSLIQCYGKGKRVDDVVRTFDRLVELGLTPDDRFCGCLLNVITQTPKEELSKLIDCVERANSKLGYVVKLLLGEQDKEGDLRTEASELLSVVS
         PNIF+LTSLI+CYGK  R DDVVR+F  L +LG+TPDDRFCGCLL V   TP +EL K+I C++R++++LG VV+LL+        LR  A ELL    
Subjt:  DPNIFVLTSLIQCYGKGKRVDDVVRTFDRLVELGLTPDDRFCGCLLNVITQTPKEELSKLIDCVERANSKLGYVVKLLLGEQDKEGDLRTEASELLSVVS

Query:  ADVRKAYCNCLIDLCVNLDLLEKACELLDLGLTLQIYTGLQSTSPTQWSLHLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKG
          VR  YCNCL+DL VNL  +EKAC LLD+ L L IY+ +Q+ + TQWSLHL+GLS+GAALT LHVW++DL   L++G+ELPPLLGI+TG GK+ YS KG
Subjt:  ADVRKAYCNCLIDLCVNLDLLEKACELLDLGLTLQIYTGLQSTSPTQWSLHLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKG

Query:  LASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRGSPELVA
        LA+VFESHLKEL+APFHEAP+K GWFLTT VAA+ WLE++ S ELVA
Subjt:  LASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRGSPELVA

Q8GWE0 Pentatricopeptide repeat-containing protein At4g16390, chloroplastic1.8e-27466.52Show/hide
Query:  LCHSPSTFFSDHLPLSNSLNSQHRITLRKSSHRFKLNPTPHHSKTCLRITNVSLQEYAAQEAQNPIPTQDESSKYPDGKSNSSSKSSVWVNPRSPRASKL
        LC SPS+   D LPL N L+   + T R     +  N +  HS+  L+ T+VS+QE   Q  ++ +   D     P     ++SKS VWVNP+SPRAS+L
Subjt:  LCHSPSTFFSDHLPLSNSLNSQHRITLRKSSHRFKLNPTPHHSKTCLRITNVSLQEYAAQEAQNPIPTQDESSKYPDGKSNSSSKSSVWVNPRSPRASKL

Query:  RNQSYEARYASLTRISESLDSCNPCEEDVADVLKGMGSNILEQDAVAVLNNMSNSSTALLALQMFQKVLKSSKKAILYNVTLKVLRKSRDMEGAEKLFDE
        R +SY++RY+SL +++ESLD+C P E DV DV+ G G  + EQDAV  LNNM+N  TA L L    + +K S++ ILYNVT+KV RKS+D+E +EKLFDE
Subjt:  RNQSYEARYASLTRISESLDSCNPCEEDVADVLKGMGSNILEQDAVAVLNNMSNSSTALLALQMFQKVLKSSKKAILYNVTLKVLRKSRDMEGAEKLFDE

Query:  MLKRGVKPDNVTFSTVISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDMAFSLYDRARTENWRIDPATFSTLIKIHGVAGNYDGCL
        ML+RG+KPDN TF+T+ISCAR   +P +AVEWFEKM SF C PD+VT +AMIDAYGRAGNVDMA SLYDRARTE WRID  TFSTLI+I+GV+GNYDGCL
Subjt:  MLKRGVKPDNVTFSTVISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDMAFSLYDRARTENWRIDPATFSTLIKIHGVAGNYDGCL

Query:  NVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCADVGY
        N+YEEMKA+G+KPNLVIYN L+D+MGRAKRPWQ K IYK++I NGF+P+W+TYA+L+RAYGRARYG+DAL +Y+EMKEKGL L VILYNTLL+MCAD  Y
Subjt:  NVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCADVGY

Query:  VNEAVEIFEDMKSSGACSPDSWTFSSMITIYSCSGKVSEAEEMLNEMMEAGFDPNIFVLTSLIQCYGKGKRVDDVVRTFDRLVELGLTPDDRFCGCLLNV
        V+EA EIF+DMK+   C PDSWTFSS+IT+Y+CSG+VSEAE  L +M EAGF+P +FVLTS+IQCYGK K+VDDVVRTFD+++ELG+TPDDRFCGCLLNV
Subjt:  VNEAVEIFEDMKSSGACSPDSWTFSSMITIYSCSGKVSEAEEMLNEMMEAGFDPNIFVLTSLIQCYGKGKRVDDVVRTFDRLVELGLTPDDRFCGCLLNV

Query:  ITQTPKEELSKLIDCVERANSKLGYVVKLLLGEQD-KEGDLRTEASELLSVVSADVRKAYCNCLIDLCVNLDLLEKACELLDLGLTLQIYTGLQSTSPTQ
        +TQTP EE+ KLI CVE+A  KLG VVK+L+ EQ+ +EG  + EASEL+  + +DV+KAY NCLIDLCVNL+ LE+ACE+L LGL   IYTGLQS S TQ
Subjt:  ITQTPKEELSKLIDCVERANSKLGYVVKLLLGEQD-KEGDLRTEASELLSVVSADVRKAYCNCLIDLCVNLDLLEKACELLDLGLTLQIYTGLQSTSPTQ

Query:  WSLHLKGLSLGAALTALHVWINDLTK-VLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRGSPELV
        WSLHLK LSLGAALTALHVW+NDL++  LESGEE PPLLGINTGHGKHKYSDKGLA+VFESHLKELNAPFHEAP+KVGWFLTT VAAK+WLESR S   V
Subjt:  WSLHLKGLSLGAALTALHVWINDLTK-VLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRGSPELV

Query:  AA
        +A
Subjt:  AA

Q9LS25 Pentatricopeptide repeat-containing protein At5g46580, chloroplastic1.5e-13240.82Show/hide
Query:  SSKSSVWVNPRSPRASKLRNQSYEARYASLTRISESLDSCNPCEEDVADVLKGMGSNILEQ--------DAVAVLNNMSNSSTALLALQMFQKV------
        S   SVWVNP  P+ S L          SL R   S  S NP  +D+      + S+I  +        D +    N  N+   L +L+ +QK       
Subjt:  SSKSSVWVNPRSPRASKLRNQSYEARYASLTRISESLDSCNPCEEDVADVLKGMGSNILEQ--------DAVAVLNNMSNSSTALLALQMFQKV------

Query:  LKSSK----KAILYNVTLKVLRKSRDMEGAEKLFDEMLKRGVKPDNVTFSTVISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDMA
        +KS      + I YNVT+K LR  R  +  E++  EM+K GV+ DN+T+ST+I+CA+ C+L NKA+EWFE+M      PD+VTYSA++D Y ++G V+  
Subjt:  LKSSK----KAILYNVTLKVLRKSRDMEGAEKLFDEMLKRGVKPDNVTFSTVISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDMA

Query:  FSLYDRARTENWRIDPATFSTLIKIHGVAGNYDGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRAR
         SLY+RA    W+ D   FS L K+ G AG+YDG   V +EMK++ +KPN+V+YN+LL+AMGRA +P   ++++ EM++ G +P+  T  +L++ YG+AR
Subjt:  FSLYDRARTENWRIDPATFSTLIKIHGVAGNYDGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRAR

Query:  YGEDALLVYKEMKEKGLQLNVILYNTLLAMCADVGYVNEAVEIFEDMKSSGACSPDSWTFSSMITIYSCSGKVSEAEEMLNEMMEAGFDPNIFVLTSLIQ
        +  DAL +++EMK K   ++ ILYNTLL MCAD+G   EA  +F DMK S  C PD++++++M+ IY   GK  +A E+  EM++AG   N+   T L+Q
Subjt:  YGEDALLVYKEMKEKGLQLNVILYNTLLAMCADVGYVNEAVEIFEDMKSSGACSPDSWTFSSMITIYSCSGKVSEAEEMLNEMMEAGFDPNIFVLTSLIQ

Query:  CYGKGKRVDDVVRTFDRLVELGLTPDDRFCGCLLNVITQ-TPKEELSKLIDCVERANSKLGYVVKLLLGEQDKEGDLRTEASELLSVVSADVRKAYCNCL
        C GK KR+DDVV  FD  ++ G+ PDDR CGCLL+V+      E+  K++ C+ERAN KL   V L++ E+ +   ++ E   +++    + R+ +CNCL
Subjt:  CYGKGKRVDDVVRTFDRLVELGLTPDDRFCGCLLNVITQ-TPKEELSKLIDCVERANSKLGYVVKLLLGEQDKEGDLRTEASELLSVVSADVRKAYCNCL

Query:  IDLCVNLDLLEKACELLDLGLTLQIYTGLQSTSPTQWSLHLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKE
        ID+C   +  E+A ELL LG    +Y GL + +  +WSL ++ LS+GAA TAL  W+  L  +++  EELP L    TG G H++S +GLA+ F  HL++
Subjt:  IDLCVNLDLLEKACELLDLGLTLQIYTGLQSTSPTQWSLHLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKE

Query:  LNAPFHEAPEKVGWFLTTKVAAKSWLESRGSP
        L+APF ++ ++ G F+ TK    SWLES+  P
Subjt:  LNAPFHEAPEKVGWFLTTKVAAKSWLESRGSP

Q9SIC9 Pentatricopeptide repeat-containing protein At2g31400, chloroplastic2.4e-4825.32Show/hide
Query:  ILYNVTLKVLRKSRDMEGAEKLFDEMLKRGVKPDNVTFSTVISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDMAFSLYDRARTEN
        I +N  L V  +    E A  LFDEM  R ++ D  +++T++         + A E   +MP     P+ V+YS +ID + +AG  D A +L+   R   
Subjt:  ILYNVTLKVLRKSRDMEGAEKLFDEMLKRGVKPDNVTFSTVISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDMAFSLYDRARTEN

Query:  WRIDPATFSTLIKIHGVAGNYDGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRARYGEDALLVYKE
          +D  +++TL+ I+   G  +  L++  EM ++GIK ++V YN+LL   G+  +  ++K ++ EM +    P+  TY++L+  Y +    ++A+ +++E
Subjt:  WRIDPATFSTLIKIHGVAGNYDGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRARYGEDALLVYKE

Query:  MKEKGLQLNVILYNTLLAMCADVGYVNEAVEIFEDMKSSGACSPDSWTFSSMITIYSCSGKVSEAEEMLNEMMEAGFDPNIFVLTS-----LIQCYGK--
         K  GL+ +V+LY+ L+      G V  AV + ++M   G  SP+  T++S+I  +  S  +  + +  N          +  LT      +IQ +G+  
Subjt:  MKEKGLQLNVILYNTLLAMCADVGYVNEAVEIFEDMKSSGACSPDSWTFSSMITIYSCSGKVSEAEEMLNEMMEAGFDPNIFVLTS-----LIQCYGK--

Query:  --------------GKRVDDVVRTFDRLVELGLTPDDRFCGCLLNVITQTPK-EELSKLIDCVERANSKL-GYVVKLLLGEQDKEGDLRTEASELLSVVS
                       + +  ++  F ++ +L + P+      +LN  ++    E+ S L++ +   ++K+ G V  LL+G+++          + ++ + 
Subjt:  --------------GKRVDDVVRTFDRLVELGLTPDDRFCGCLLNVITQTPK-EELSKLIDCVERANSKL-GYVVKLLLGEQDKEGDLRTEASELLSVVS

Query:  ADVRKAYCNCLIDLCVNLDLLEKACELLDL-GLTLQIYTGLQSTSPTQWSLHLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHK--YS
             A+ N L D+  +    ++  EL+ L G + Q++  + S S     L L  +S GAA   +H W+ ++  ++  G ELP +L I TG GKH     
Subjt:  ADVRKAYCNCLIDLCVNLDLLEKACELLDL-GLTLQIYTGLQSTSPTQWSLHLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHK--YS

Query:  DKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRGSPELV
        D  L    E  L+ ++APFH +   +G F ++     +WL    + +L+
Subjt:  DKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRGSPELV

Arabidopsis top hitse value%identityAlignment
AT1G18900.1 Pentatricopeptide repeat (PPR) superfamily protein6.0e-3923.98Show/hide
Query:  PTQDESSKYPDGKSNSSSKSSVWVNPRSPRASKLRNQSYEARYASLTRISESLDSC------NPCEEDVADVLKGMGSNILEQDAVAVLNNMSNSSTALL
        P  D +   P G SNSS +       + P  + L ++    +Y +   I E++ S        P  E   + L+ +G  I    A  VL  M++   AL 
Subjt:  PTQDESSKYPDGKSNSSSKSSVWVNPRSPRASKLRNQSYEARYASLTRISESLDSC------NPCEEDVADVLKGMGSNILEQDAVAVLNNMSNSSTALL

Query:  ALQMFQKVLKSSKKAILYNVTLKVLRKSRDMEGAEKLFDEMLKRGVKPDNVTFSTVISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGN
             ++          Y   +  L +++      KL DEM++ G +P+ VT++ +I      +  N+A+  F +M    C PD VTY  +ID + +AG 
Subjt:  ALQMFQKVLKSSKKAILYNVTLKVLRKSRDMEGAEKLFDEMLKRGVKPDNVTFSTVISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGN

Query:  VDMAFSLYDRARTENWRIDPATFSTLIKIHGVAGNYDGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAY
        +D+A  +Y R +      D  T+S +I   G AG+      ++ EM   G  PNLV YN ++D   +A+       +Y++M   GF P   TY+ ++   
Subjt:  VDMAFSLYDRARTENWRIDPATFSTLIKIHGVAGNYDGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAY

Query:  GRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCADVGYVNEAVEIFEDMKSSGACSPDSWTFSSMITIYSCSGKVSEAEEMLNEMMEAGFDPNIFVLT
        G   Y E+A  V+ EM++K    +  +Y  L+ +    G V +A + ++ M  +G   P+  T +S+++ +    K++EA E+L  M+  G  P++   T
Subjt:  GRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCADVGYVNEAVEIFEDMKSSGACSPDSWTFSSMITIYSCSGKVSEAEEMLNEMMEAGFDPNIFVLT

Query:  SLIQCYGKGKRVDDVVRTFDRLVELGLTPDDRFCGCLLNVITQTPKEELSKLIDCVERANSKLGYVVKLLLGEQDKEGDLRTEASELLSVVSADVRKAYC
         L+ C   G+   D+                 FCG L+                     +    +++K+     D E ++R  A+  L ++ ++ R++  
Subjt:  SLIQCYGKGKRVDDVVRTFDRLVELGLTPDDRFCGCLLNVITQTPKEELSKLIDCVERANSKLGYVVKLLLGEQDKEGDLRTEASELLSVVSADVRKAYC

Query:  NCLIDLCVNLDLLEKACELLDLGLTLQIYT-------GLQSTSPTQWSLHLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHK--YSDK
          L+D  V  D L K+ +  + G   ++          L+  S + W ++L  +S G A+TAL   +    K + +    P  + I TG G+        
Subjt:  NCLIDLCVNLDLLEKACELLDLGLTLQIYT-------GLQSTSPTQWSLHLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHK--YSDK

Query:  GLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWL
         +    E  L    +PF       G F+ +      WL
Subjt:  GLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWL

AT1G18900.2 Pentatricopeptide repeat (PPR) superfamily protein6.0e-3923.98Show/hide
Query:  PTQDESSKYPDGKSNSSSKSSVWVNPRSPRASKLRNQSYEARYASLTRISESLDSC------NPCEEDVADVLKGMGSNILEQDAVAVLNNMSNSSTALL
        P  D +   P G SNSS +       + P  + L ++    +Y +   I E++ S        P  E   + L+ +G  I    A  VL  M++   AL 
Subjt:  PTQDESSKYPDGKSNSSSKSSVWVNPRSPRASKLRNQSYEARYASLTRISESLDSC------NPCEEDVADVLKGMGSNILEQDAVAVLNNMSNSSTALL

Query:  ALQMFQKVLKSSKKAILYNVTLKVLRKSRDMEGAEKLFDEMLKRGVKPDNVTFSTVISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGN
             ++          Y   +  L +++      KL DEM++ G +P+ VT++ +I      +  N+A+  F +M    C PD VTY  +ID + +AG 
Subjt:  ALQMFQKVLKSSKKAILYNVTLKVLRKSRDMEGAEKLFDEMLKRGVKPDNVTFSTVISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGN

Query:  VDMAFSLYDRARTENWRIDPATFSTLIKIHGVAGNYDGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAY
        +D+A  +Y R +      D  T+S +I   G AG+      ++ EM   G  PNLV YN ++D   +A+       +Y++M   GF P   TY+ ++   
Subjt:  VDMAFSLYDRARTENWRIDPATFSTLIKIHGVAGNYDGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAY

Query:  GRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCADVGYVNEAVEIFEDMKSSGACSPDSWTFSSMITIYSCSGKVSEAEEMLNEMMEAGFDPNIFVLT
        G   Y E+A  V+ EM++K    +  +Y  L+ +    G V +A + ++ M  +G   P+  T +S+++ +    K++EA E+L  M+  G  P++   T
Subjt:  GRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCADVGYVNEAVEIFEDMKSSGACSPDSWTFSSMITIYSCSGKVSEAEEMLNEMMEAGFDPNIFVLT

Query:  SLIQCYGKGKRVDDVVRTFDRLVELGLTPDDRFCGCLLNVITQTPKEELSKLIDCVERANSKLGYVVKLLLGEQDKEGDLRTEASELLSVVSADVRKAYC
         L+ C   G+   D+                 FCG L+                     +    +++K+     D E ++R  A+  L ++ ++ R++  
Subjt:  SLIQCYGKGKRVDDVVRTFDRLVELGLTPDDRFCGCLLNVITQTPKEELSKLIDCVERANSKLGYVVKLLLGEQDKEGDLRTEASELLSVVSADVRKAYC

Query:  NCLIDLCVNLDLLEKACELLDLGLTLQIYT-------GLQSTSPTQWSLHLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHK--YSDK
          L+D  V  D L K+ +  + G   ++          L+  S + W ++L  +S G A+TAL   +    K + +    P  + I TG G+        
Subjt:  NCLIDLCVNLDLLEKACELLDLGLTLQIYT-------GLQSTSPTQWSLHLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHK--YSDK

Query:  GLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWL
         +    E  L    +PF       G F+ +      WL
Subjt:  GLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWL

AT2G31400.1 genomes uncoupled 11.7e-4925.32Show/hide
Query:  ILYNVTLKVLRKSRDMEGAEKLFDEMLKRGVKPDNVTFSTVISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDMAFSLYDRARTEN
        I +N  L V  +    E A  LFDEM  R ++ D  +++T++         + A E   +MP     P+ V+YS +ID + +AG  D A +L+   R   
Subjt:  ILYNVTLKVLRKSRDMEGAEKLFDEMLKRGVKPDNVTFSTVISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDMAFSLYDRARTEN

Query:  WRIDPATFSTLIKIHGVAGNYDGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRARYGEDALLVYKE
          +D  +++TL+ I+   G  +  L++  EM ++GIK ++V YN+LL   G+  +  ++K ++ EM +    P+  TY++L+  Y +    ++A+ +++E
Subjt:  WRIDPATFSTLIKIHGVAGNYDGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRARYGEDALLVYKE

Query:  MKEKGLQLNVILYNTLLAMCADVGYVNEAVEIFEDMKSSGACSPDSWTFSSMITIYSCSGKVSEAEEMLNEMMEAGFDPNIFVLTS-----LIQCYGK--
         K  GL+ +V+LY+ L+      G V  AV + ++M   G  SP+  T++S+I  +  S  +  + +  N          +  LT      +IQ +G+  
Subjt:  MKEKGLQLNVILYNTLLAMCADVGYVNEAVEIFEDMKSSGACSPDSWTFSSMITIYSCSGKVSEAEEMLNEMMEAGFDPNIFVLTS-----LIQCYGK--

Query:  --------------GKRVDDVVRTFDRLVELGLTPDDRFCGCLLNVITQTPK-EELSKLIDCVERANSKL-GYVVKLLLGEQDKEGDLRTEASELLSVVS
                       + +  ++  F ++ +L + P+      +LN  ++    E+ S L++ +   ++K+ G V  LL+G+++          + ++ + 
Subjt:  --------------GKRVDDVVRTFDRLVELGLTPDDRFCGCLLNVITQTPK-EELSKLIDCVERANSKL-GYVVKLLLGEQDKEGDLRTEASELLSVVS

Query:  ADVRKAYCNCLIDLCVNLDLLEKACELLDL-GLTLQIYTGLQSTSPTQWSLHLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHK--YS
             A+ N L D+  +    ++  EL+ L G + Q++  + S S     L L  +S GAA   +H W+ ++  ++  G ELP +L I TG GKH     
Subjt:  ADVRKAYCNCLIDLCVNLDLLEKACELLDL-GLTLQIYTGLQSTSPTQWSLHLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHK--YS

Query:  DKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRGSPELV
        D  L    E  L+ ++APFH +   +G F ++     +WL    + +L+
Subjt:  DKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRGSPELV

AT4G16390.1 pentatricopeptide (PPR) repeat-containing protein1.3e-27566.52Show/hide
Query:  LCHSPSTFFSDHLPLSNSLNSQHRITLRKSSHRFKLNPTPHHSKTCLRITNVSLQEYAAQEAQNPIPTQDESSKYPDGKSNSSSKSSVWVNPRSPRASKL
        LC SPS+   D LPL N L+   + T R     +  N +  HS+  L+ T+VS+QE   Q  ++ +   D     P     ++SKS VWVNP+SPRAS+L
Subjt:  LCHSPSTFFSDHLPLSNSLNSQHRITLRKSSHRFKLNPTPHHSKTCLRITNVSLQEYAAQEAQNPIPTQDESSKYPDGKSNSSSKSSVWVNPRSPRASKL

Query:  RNQSYEARYASLTRISESLDSCNPCEEDVADVLKGMGSNILEQDAVAVLNNMSNSSTALLALQMFQKVLKSSKKAILYNVTLKVLRKSRDMEGAEKLFDE
        R +SY++RY+SL +++ESLD+C P E DV DV+ G G  + EQDAV  LNNM+N  TA L L    + +K S++ ILYNVT+KV RKS+D+E +EKLFDE
Subjt:  RNQSYEARYASLTRISESLDSCNPCEEDVADVLKGMGSNILEQDAVAVLNNMSNSSTALLALQMFQKVLKSSKKAILYNVTLKVLRKSRDMEGAEKLFDE

Query:  MLKRGVKPDNVTFSTVISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDMAFSLYDRARTENWRIDPATFSTLIKIHGVAGNYDGCL
        ML+RG+KPDN TF+T+ISCAR   +P +AVEWFEKM SF C PD+VT +AMIDAYGRAGNVDMA SLYDRARTE WRID  TFSTLI+I+GV+GNYDGCL
Subjt:  MLKRGVKPDNVTFSTVISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDMAFSLYDRARTENWRIDPATFSTLIKIHGVAGNYDGCL

Query:  NVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCADVGY
        N+YEEMKA+G+KPNLVIYN L+D+MGRAKRPWQ K IYK++I NGF+P+W+TYA+L+RAYGRARYG+DAL +Y+EMKEKGL L VILYNTLL+MCAD  Y
Subjt:  NVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCADVGY

Query:  VNEAVEIFEDMKSSGACSPDSWTFSSMITIYSCSGKVSEAEEMLNEMMEAGFDPNIFVLTSLIQCYGKGKRVDDVVRTFDRLVELGLTPDDRFCGCLLNV
        V+EA EIF+DMK+   C PDSWTFSS+IT+Y+CSG+VSEAE  L +M EAGF+P +FVLTS+IQCYGK K+VDDVVRTFD+++ELG+TPDDRFCGCLLNV
Subjt:  VNEAVEIFEDMKSSGACSPDSWTFSSMITIYSCSGKVSEAEEMLNEMMEAGFDPNIFVLTSLIQCYGKGKRVDDVVRTFDRLVELGLTPDDRFCGCLLNV

Query:  ITQTPKEELSKLIDCVERANSKLGYVVKLLLGEQD-KEGDLRTEASELLSVVSADVRKAYCNCLIDLCVNLDLLEKACELLDLGLTLQIYTGLQSTSPTQ
        +TQTP EE+ KLI CVE+A  KLG VVK+L+ EQ+ +EG  + EASEL+  + +DV+KAY NCLIDLCVNL+ LE+ACE+L LGL   IYTGLQS S TQ
Subjt:  ITQTPKEELSKLIDCVERANSKLGYVVKLLLGEQD-KEGDLRTEASELLSVVSADVRKAYCNCLIDLCVNLDLLEKACELLDLGLTLQIYTGLQSTSPTQ

Query:  WSLHLKGLSLGAALTALHVWINDLTK-VLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRGSPELV
        WSLHLK LSLGAALTALHVW+NDL++  LESGEE PPLLGINTGHGKHKYSDKGLA+VFESHLKELNAPFHEAP+KVGWFLTT VAAK+WLESR S   V
Subjt:  WSLHLKGLSLGAALTALHVWINDLTK-VLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRGSPELV

Query:  AA
        +A
Subjt:  AA

AT5G46580.1 pentatricopeptide (PPR) repeat-containing protein1.1e-13340.82Show/hide
Query:  SSKSSVWVNPRSPRASKLRNQSYEARYASLTRISESLDSCNPCEEDVADVLKGMGSNILEQ--------DAVAVLNNMSNSSTALLALQMFQKV------
        S   SVWVNP  P+ S L          SL R   S  S NP  +D+      + S+I  +        D +    N  N+   L +L+ +QK       
Subjt:  SSKSSVWVNPRSPRASKLRNQSYEARYASLTRISESLDSCNPCEEDVADVLKGMGSNILEQ--------DAVAVLNNMSNSSTALLALQMFQKV------

Query:  LKSSK----KAILYNVTLKVLRKSRDMEGAEKLFDEMLKRGVKPDNVTFSTVISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDMA
        +KS      + I YNVT+K LR  R  +  E++  EM+K GV+ DN+T+ST+I+CA+ C+L NKA+EWFE+M      PD+VTYSA++D Y ++G V+  
Subjt:  LKSSK----KAILYNVTLKVLRKSRDMEGAEKLFDEMLKRGVKPDNVTFSTVISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDMA

Query:  FSLYDRARTENWRIDPATFSTLIKIHGVAGNYDGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRAR
         SLY+RA    W+ D   FS L K+ G AG+YDG   V +EMK++ +KPN+V+YN+LL+AMGRA +P   ++++ EM++ G +P+  T  +L++ YG+AR
Subjt:  FSLYDRARTENWRIDPATFSTLIKIHGVAGNYDGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRAR

Query:  YGEDALLVYKEMKEKGLQLNVILYNTLLAMCADVGYVNEAVEIFEDMKSSGACSPDSWTFSSMITIYSCSGKVSEAEEMLNEMMEAGFDPNIFVLTSLIQ
        +  DAL +++EMK K   ++ ILYNTLL MCAD+G   EA  +F DMK S  C PD++++++M+ IY   GK  +A E+  EM++AG   N+   T L+Q
Subjt:  YGEDALLVYKEMKEKGLQLNVILYNTLLAMCADVGYVNEAVEIFEDMKSSGACSPDSWTFSSMITIYSCSGKVSEAEEMLNEMMEAGFDPNIFVLTSLIQ

Query:  CYGKGKRVDDVVRTFDRLVELGLTPDDRFCGCLLNVITQ-TPKEELSKLIDCVERANSKLGYVVKLLLGEQDKEGDLRTEASELLSVVSADVRKAYCNCL
        C GK KR+DDVV  FD  ++ G+ PDDR CGCLL+V+      E+  K++ C+ERAN KL   V L++ E+ +   ++ E   +++    + R+ +CNCL
Subjt:  CYGKGKRVDDVVRTFDRLVELGLTPDDRFCGCLLNVITQ-TPKEELSKLIDCVERANSKLGYVVKLLLGEQDKEGDLRTEASELLSVVSADVRKAYCNCL

Query:  IDLCVNLDLLEKACELLDLGLTLQIYTGLQSTSPTQWSLHLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKE
        ID+C   +  E+A ELL LG    +Y GL + +  +WSL ++ LS+GAA TAL  W+  L  +++  EELP L    TG G H++S +GLA+ F  HL++
Subjt:  IDLCVNLDLLEKACELLDLGLTLQIYTGLQSTSPTQWSLHLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKE

Query:  LNAPFHEAPEKVGWFLTTKVAAKSWLESRGSP
        L+APF ++ ++ G F+ TK    SWLES+  P
Subjt:  LNAPFHEAPEKVGWFLTTKVAAKSWLESRGSP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTTCCAGCTCTGCCACTCACCGTCCACCTTCTTCTCCGACCACCTTCCTCTCTCCAATTCTTTAAACTCTCAACACAGAATAACTCTACGCAAGTCTTCTCACCG
TTTCAAGCTCAATCCCACACCTCACCACTCAAAAACATGTCTCCGGATAACCAATGTCTCCTTACAGGAATACGCTGCTCAAGAAGCCCAAAATCCAATTCCCACTCAGG
ATGAAAGCTCGAAATACCCGGATGGGAAATCTAATTCCTCGTCCAAAAGCTCCGTCTGGGTCAATCCCAGAAGCCCCAGAGCTTCGAAACTCCGCAACCAATCTTACGAA
GCCAGGTATGCTTCTCTTACGAGAATTTCGGAGTCTTTGGACTCTTGTAATCCATGTGAGGAAGATGTTGCTGATGTGTTGAAGGGGATGGGTAGTAACATTTTAGAACA
GGACGCTGTTGCGGTGCTGAATAACATGTCGAATTCCAGTACTGCGTTGCTTGCTCTTCAGATGTTTCAGAAGGTGTTGAAATCAAGTAAAAAGGCGATTCTTTACAATG
TAACGCTGAAGGTGCTTAGGAAGTCGAGAGATATGGAAGGTGCAGAGAAACTGTTCGACGAAATGCTTAAGAGAGGAGTTAAGCCTGATAACGTGACATTTTCTACAGTA
ATTAGTTGTGCTAGGTTGTGTTCGTTGCCAAATAAGGCTGTTGAGTGGTTTGAAAAGATGCCAAGTTTTGACTGTAATCCTGATGATGTCACTTACTCTGCGATGATTGA
TGCCTACGGGCGGGCTGGTAATGTTGACATGGCTTTCAGCTTGTATGATCGTGCAAGAACTGAAAACTGGCGTATTGATCCTGCGACATTCTCGACATTGATCAAAATTC
ATGGAGTGGCTGGGAACTATGATGGGTGCTTGAATGTGTATGAAGAAATGAAGGCTATAGGCATCAAGCCTAACTTGGTTATATATAACAGCTTGCTGGATGCTATGGGT
AGGGCTAAAAGACCCTGGCAGATCAAGACCATTTACAAAGAGATGATTAAAAACGGGTTTTCACCAAGCTGGGCAACTTATGCTTCGCTTTTACGTGCCTATGGGAGGGC
CAGATATGGTGAGGATGCCCTCCTTGTGTACAAGGAGATGAAGGAAAAAGGATTGCAGTTAAATGTAATTCTCTACAACACACTTTTAGCTATGTGTGCTGATGTTGGCT
ACGTTAATGAAGCTGTTGAAATTTTTGAAGACATGAAGAGCTCTGGGGCATGCTCACCTGACAGTTGGACTTTTTCTTCCATGATTACCATATATTCCTGCAGTGGAAAA
GTATCAGAGGCGGAGGAAATGTTGAACGAGATGATGGAAGCCGGTTTTGACCCTAATATCTTTGTCTTGACTTCACTAATCCAGTGTTATGGGAAAGGCAAACGCGTTGA
TGATGTTGTGAGAACATTCGATCGACTGGTAGAGCTGGGTTTAACTCCAGATGATCGATTCTGTGGCTGTCTTCTCAATGTAATCACCCAAACACCAAAAGAGGAACTTA
GTAAGCTGATTGATTGTGTTGAGAGAGCTAACTCGAAACTCGGTTATGTGGTTAAGCTTTTGCTAGGGGAACAAGACAAGGAAGGAGACCTCAGAACTGAAGCCTCAGAA
CTACTTAGTGTTGTTAGTGCTGATGTGAGAAAAGCCTATTGCAATTGCTTAATTGATCTCTGTGTGAATTTAGATCTTTTAGAGAAGGCATGCGAGCTCCTGGATTTGGG
GCTTACGCTTCAGATATACACAGGTTTGCAGTCCACGTCTCCAACTCAGTGGTCTCTACATCTTAAGGGTCTTTCTCTTGGGGCTGCTCTCACTGCATTACACGTTTGGA
TAAATGACTTAACAAAGGTGCTTGAATCTGGGGAGGAACTTCCACCATTACTTGGAATAAATACTGGACATGGAAAACACAAGTATTCTGATAAAGGTTTGGCGAGTGTC
TTTGAGTCACATTTGAAGGAATTAAATGCTCCATTCCATGAGGCTCCAGAAAAGGTCGGGTGGTTTTTGACAACTAAAGTGGCTGCAAAATCATGGTTGGAATCTAGAGG
TTCTCCTGAATTAGTTGCAGCA
mRNA sequenceShow/hide mRNA sequence
ATGGCTTTCCAGCTCTGCCACTCACCGTCCACCTTCTTCTCCGACCACCTTCCTCTCTCCAATTCTTTAAACTCTCAACACAGAATAACTCTACGCAAGTCTTCTCACCG
TTTCAAGCTCAATCCCACACCTCACCACTCAAAAACATGTCTCCGGATAACCAATGTCTCCTTACAGGAATACGCTGCTCAAGAAGCCCAAAATCCAATTCCCACTCAGG
ATGAAAGCTCGAAATACCCGGATGGGAAATCTAATTCCTCGTCCAAAAGCTCCGTCTGGGTCAATCCCAGAAGCCCCAGAGCTTCGAAACTCCGCAACCAATCTTACGAA
GCCAGGTATGCTTCTCTTACGAGAATTTCGGAGTCTTTGGACTCTTGTAATCCATGTGAGGAAGATGTTGCTGATGTGTTGAAGGGGATGGGTAGTAACATTTTAGAACA
GGACGCTGTTGCGGTGCTGAATAACATGTCGAATTCCAGTACTGCGTTGCTTGCTCTTCAGATGTTTCAGAAGGTGTTGAAATCAAGTAAAAAGGCGATTCTTTACAATG
TAACGCTGAAGGTGCTTAGGAAGTCGAGAGATATGGAAGGTGCAGAGAAACTGTTCGACGAAATGCTTAAGAGAGGAGTTAAGCCTGATAACGTGACATTTTCTACAGTA
ATTAGTTGTGCTAGGTTGTGTTCGTTGCCAAATAAGGCTGTTGAGTGGTTTGAAAAGATGCCAAGTTTTGACTGTAATCCTGATGATGTCACTTACTCTGCGATGATTGA
TGCCTACGGGCGGGCTGGTAATGTTGACATGGCTTTCAGCTTGTATGATCGTGCAAGAACTGAAAACTGGCGTATTGATCCTGCGACATTCTCGACATTGATCAAAATTC
ATGGAGTGGCTGGGAACTATGATGGGTGCTTGAATGTGTATGAAGAAATGAAGGCTATAGGCATCAAGCCTAACTTGGTTATATATAACAGCTTGCTGGATGCTATGGGT
AGGGCTAAAAGACCCTGGCAGATCAAGACCATTTACAAAGAGATGATTAAAAACGGGTTTTCACCAAGCTGGGCAACTTATGCTTCGCTTTTACGTGCCTATGGGAGGGC
CAGATATGGTGAGGATGCCCTCCTTGTGTACAAGGAGATGAAGGAAAAAGGATTGCAGTTAAATGTAATTCTCTACAACACACTTTTAGCTATGTGTGCTGATGTTGGCT
ACGTTAATGAAGCTGTTGAAATTTTTGAAGACATGAAGAGCTCTGGGGCATGCTCACCTGACAGTTGGACTTTTTCTTCCATGATTACCATATATTCCTGCAGTGGAAAA
GTATCAGAGGCGGAGGAAATGTTGAACGAGATGATGGAAGCCGGTTTTGACCCTAATATCTTTGTCTTGACTTCACTAATCCAGTGTTATGGGAAAGGCAAACGCGTTGA
TGATGTTGTGAGAACATTCGATCGACTGGTAGAGCTGGGTTTAACTCCAGATGATCGATTCTGTGGCTGTCTTCTCAATGTAATCACCCAAACACCAAAAGAGGAACTTA
GTAAGCTGATTGATTGTGTTGAGAGAGCTAACTCGAAACTCGGTTATGTGGTTAAGCTTTTGCTAGGGGAACAAGACAAGGAAGGAGACCTCAGAACTGAAGCCTCAGAA
CTACTTAGTGTTGTTAGTGCTGATGTGAGAAAAGCCTATTGCAATTGCTTAATTGATCTCTGTGTGAATTTAGATCTTTTAGAGAAGGCATGCGAGCTCCTGGATTTGGG
GCTTACGCTTCAGATATACACAGGTTTGCAGTCCACGTCTCCAACTCAGTGGTCTCTACATCTTAAGGGTCTTTCTCTTGGGGCTGCTCTCACTGCATTACACGTTTGGA
TAAATGACTTAACAAAGGTGCTTGAATCTGGGGAGGAACTTCCACCATTACTTGGAATAAATACTGGACATGGAAAACACAAGTATTCTGATAAAGGTTTGGCGAGTGTC
TTTGAGTCACATTTGAAGGAATTAAATGCTCCATTCCATGAGGCTCCAGAAAAGGTCGGGTGGTTTTTGACAACTAAAGTGGCTGCAAAATCATGGTTGGAATCTAGAGG
TTCTCCTGAATTAGTTGCAGCA
Protein sequenceShow/hide protein sequence
MAFQLCHSPSTFFSDHLPLSNSLNSQHRITLRKSSHRFKLNPTPHHSKTCLRITNVSLQEYAAQEAQNPIPTQDESSKYPDGKSNSSSKSSVWVNPRSPRASKLRNQSYE
ARYASLTRISESLDSCNPCEEDVADVLKGMGSNILEQDAVAVLNNMSNSSTALLALQMFQKVLKSSKKAILYNVTLKVLRKSRDMEGAEKLFDEMLKRGVKPDNVTFSTV
ISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDMAFSLYDRARTENWRIDPATFSTLIKIHGVAGNYDGCLNVYEEMKAIGIKPNLVIYNSLLDAMG
RAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCADVGYVNEAVEIFEDMKSSGACSPDSWTFSSMITIYSCSGK
VSEAEEMLNEMMEAGFDPNIFVLTSLIQCYGKGKRVDDVVRTFDRLVELGLTPDDRFCGCLLNVITQTPKEELSKLIDCVERANSKLGYVVKLLLGEQDKEGDLRTEASE
LLSVVSADVRKAYCNCLIDLCVNLDLLEKACELLDLGLTLQIYTGLQSTSPTQWSLHLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASV
FESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRGSPELVAA