; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc01G11200 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc01G11200
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionPentatricopeptide repeat-containing protein
Genome locationClcChr01:16401395..16403860
RNA-Seq ExpressionClc01G11200
SyntenyClc01G11200
Gene Ontology termsGO:0009658 - chloroplast organization (biological process)
GO:0031425 - chloroplast RNA processing (biological process)
GO:0009570 - chloroplast stroma (cellular component)
GO:0009941 - chloroplast envelope (cellular component)
GO:0003729 - mRNA binding (molecular function)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR002625 - Smr domain
IPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily
IPR033443 - Pentacotripeptide-repeat region of PRORP


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6583722.1 Pentatricopeptide repeat-containing protein, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0089.93Show/hide
Query:  MAFQLCHSPSTFFTDHHSLSNSLTSQRKTTL-YNSSRLFKLNPIPRHSKPFLQITNVSLQEYAPQETHNPTPSDDEISKYPDGKSGSSSKSSVWVNPRSP
        MAFQL H PSTFFTDH    NSLT   KTTL  +SSR+FKLNPIP HSKPFLQITNVS QEYAPQET NP+PSDDEISK+PDGKSGSSSK+SVWVNP SP
Subjt:  MAFQLCHSPSTFFTDHHSLSNSLTSQRKTTL-YNSSRLFKLNPIPRHSKPFLQITNVSLQEYAPQETHNPTPSDDEISKYPDGKSGSSSKSSVWVNPRSP

Query:  RASKLRKQSYEARYTSLTRISESLDSCNPCEEDVADVLKVIGSNILEQDAVVVLNNMSNSQTALIALRYFQDVLKSSKQTIFYNVTLKVFRKCRDMEGAE
        RASKLRKQSYEARY SL +ISESLDSCNPCE+DVADVLK I S ILEQDA+ VLNNMSNSQTAL+ LRYFQDVLKSSKQ +FYNVTLKVFRKCRD EGAE
Subjt:  RASKLRKQSYEARYTSLTRISESLDSCNPCEEDVADVLKVIGSNILEQDAVVVLNNMSNSQTALIALRYFQDVLKSSKQTIFYNVTLKVFRKCRDMEGAE

Query:  KLFEEMLERGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDMAFSLYDRARTENWRIDPATFSTMIKIHGVAGN
        KLF+EMLERGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPD++TYS MIDAYGRAGNVDMAFSLYDRARTENWRID +TFSTMIKIHGVAGN
Subjt:  KLFEEMLERGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDMAFSLYDRARTENWRIDPATFSTMIKIHGVAGN

Query:  YDGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMC
        YDGCLNVYEEMKA+GIKPNL IYNSLL AMGRAKRPWQIKTIYKEM KNGFSPSWATYASLLRAY RARY ED +LVYKEMKEKGLQLNVILYNTLLAMC
Subjt:  YDGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMC

Query:  ADVGYVNEAVEVFQDMKSSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVEAGFDPNIFVLTSLIQCYGKAKHVDDVVRTFNRLLELGLTPDDRFCG
        ADVGYVNEA+EVF+DMKSSGTCSPDSWTFSSMITIYSCSG VSEAEEMLNEM+EAGFDPNIFVLTSLIQCYGKAK VDDVVRTF+RLLELGLTPDDRFCG
Subjt:  ADVGYVNEAVEVFQDMKSSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVEAGFDPNIFVLTSLIQCYGKAKHVDDVVRTFNRLLELGLTPDDRFCG

Query:  CLLNVITQTPKEELSKLIDCVERANPKLGFVVELLLGEQEKEGDFRTEASELFSVVSADVRKAYCNCLIDLCVNLDLLDKACELLELGLTLQIYKDLQSR
        CLLNVITQTPK ELSKLIDCVERANPKLGFVV+LLLGE++ EGDFRTEASELFSVVS DVRKAYCNCLIDLCVNLDLLDKACELL+LGL++QIY DLQSR
Subjt:  CLLNVITQTPKEELSKLIDCVERANPKLGFVVELLLGEQEKEGDFRTEASELFSVVSADVRKAYCNCLIDLCVNLDLLDKACELLELGLTLQIYKDLQSR

Query:  SPTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGMASVFESHLKELKAPFHEAPDKVGWFLTTKVAAKSWLESRASP
        SPTQWSLYLKGLSLGAALTALHVWINDLTK L+SGEELPPLLGINTGHGKHKYSDKG++SVFESHLKEL APFHEAP+KVGWFLTTKVAAKSWLESR SP
Subjt:  SPTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGMASVFESHLKELKAPFHEAPDKVGWFLTTKVAAKSWLESRASP

Query:  ELVAA
        ELVAA
Subjt:  ELVAA

XP_004139516.1 pentatricopeptide repeat-containing protein At4g16390, chloroplastic [Cucumis sativus]0.0e+0092.9Show/hide
Query:  MAFQLCHSPSTFFTDHHSLSNSLTSQRKTTLYNSSRLFKLNPIPRHSKPFLQITNVSLQEYAPQETHNPTPSDDEISKYPDGKSGSSSKSSVWVNPRSPR
        MAFQLC+SP TFFT+HH LSNSLT QRKTTL NSS LFKL+PIPRHSKPFLQITNVSLQE+APQ+T N  PS DEISKYPD KSGSSS SSVWVNPRSPR
Subjt:  MAFQLCHSPSTFFTDHHSLSNSLTSQRKTTLYNSSRLFKLNPIPRHSKPFLQITNVSLQEYAPQETHNPTPSDDEISKYPDGKSGSSSKSSVWVNPRSPR

Query:  ASKLRKQSYEARYTSLTRISESLDSCNPCEEDVADVLKVIGSNILEQDAVVVLNNMSNSQTALIALRYFQDVLKSSKQTIFYNVTLKVFRKCRDMEGAEK
        ASKLRKQSYEARY SL R+SESLDS NPCE DVADVLKVIG+NILE+DA++VLNNMSNSQTAL+ALRYFQD+LKSSKQTIFYNVTLKVFRKCRDMEGAEK
Subjt:  ASKLRKQSYEARYTSLTRISESLDSCNPCEEDVADVLKVIGSNILEQDAVVVLNNMSNSQTALIALRYFQDVLKSSKQTIFYNVTLKVFRKCRDMEGAEK

Query:  LFEEMLERGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDMAFSLYDRARTENWRIDPATFSTMIKIHGVAGNY
        LFEEM+ RGVKPDNVTFSTIISCARLCSLP+KAVEWFEKMPSFDCNPDDVTYS MIDAYGRAGNVDMAFSLYDRARTENWRIDPATFSTMIKIHGVAGNY
Subjt:  LFEEMLERGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDMAFSLYDRARTENWRIDPATFSTMIKIHGVAGNY

Query:  DGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCA
        DGCLNVYEEMKAIGIKPNLVIYN LLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRARYGEDAL+VYKEMKEKGLQLNVILYNTLLAMCA
Subjt:  DGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCA

Query:  DVGYVNEAVEVFQDMKSSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVEAGFDPNIFVLTSLIQCYGKAKHVDDVVRTFNRLLELGLTPDDRFCGC
        DVGYVNEAVE+FQDMKSSGTCSPDSWTFSSMITIYSC GKVSEAEEMLN+MVEAGFDPNIFVLTSLIQCYGKAK VDDVVRTFN+L+ELGLTPDDRFCGC
Subjt:  DVGYVNEAVEVFQDMKSSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVEAGFDPNIFVLTSLIQCYGKAKHVDDVVRTFNRLLELGLTPDDRFCGC

Query:  LLNVITQTPKEELSKLIDCVERANPKLGFVVELLLGEQEKEGDFRTEASELFSVVSADVRKAYCNCLIDLCVNLDLLDKACELLELGLTLQIYKDLQSRS
        LLNVITQTPK EL KLIDCV RANPKLGFVVELLLGEQ+KEG+FRTEASELFSVVSADVRKAYCNCLIDLCVNLDLLDKACELL+LGLTLQIYKDLQSRS
Subjt:  LLNVITQTPKEELSKLIDCVERANPKLGFVVELLLGEQEKEGDFRTEASELFSVVSADVRKAYCNCLIDLCVNLDLLDKACELLELGLTLQIYKDLQSRS

Query:  PTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGMASVFESHLKELKAPFHEAPDKVGWFLTTKVAAKSWLESRASPE
        PTQWSLYLKGLSLGAALTALHVWI DLTKVLESGEELPPLLGINTGHGKHKYSDKG+ASVFESHLKEL APFHEAP+KVGWFLTTKVAAKSWLESR+SPE
Subjt:  PTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGMASVFESHLKELKAPFHEAPDKVGWFLTTKVAAKSWLESRASPE

Query:  LVAA
        LVAA
Subjt:  LVAA

XP_008464281.1 PREDICTED: pentatricopeptide repeat-containing protein At4g16390, chloroplastic [Cucumis melo]0.0e+0094.32Show/hide
Query:  MAFQLCHSPSTFFTDHHSLSNSLTSQRKTTLYNSSRLFKLNPIPRHSKPFLQITNVSLQEYAPQETHNPTPSDDEISKYPDGKSGSSSKSSVWVNPRSPR
        MAFQLCHSP TFFT HHSLSNSLT QRKTTL NSS LFKLNPIPRHS PFLQITN+SLQE++PQETHN  PSDDEISKY D KSGSSSKSSVWVNPRSPR
Subjt:  MAFQLCHSPSTFFTDHHSLSNSLTSQRKTTLYNSSRLFKLNPIPRHSKPFLQITNVSLQEYAPQETHNPTPSDDEISKYPDGKSGSSSKSSVWVNPRSPR

Query:  ASKLRKQSYEARYTSLTRISESLDSCNPCEEDVADVLKVIGSNILEQDAVVVLNNMSNSQTALIALRYFQDVLKSSKQTIFYNVTLKVFRKCRDMEGAEK
        ASKLRKQSYEARY SL RISESLDSCNPCE DVADVLKVIG+NILEQDAVVVLNNMSNSQTAL+ALRYFQD+LKSSKQTIFYNVTLKVFRKCRDMEGAE+
Subjt:  ASKLRKQSYEARYTSLTRISESLDSCNPCEEDVADVLKVIGSNILEQDAVVVLNNMSNSQTALIALRYFQDVLKSSKQTIFYNVTLKVFRKCRDMEGAEK

Query:  LFEEMLERGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDMAFSLYDRARTENWRIDPATFSTMIKIHGVAGNY
        LFEEML RGVKPDNVTFSTIISCARLCSLP+KAVEWFEKMPSFDCNPDDVTYS MIDAYGRAGNVDMAFSLYDRARTENWRIDPATFSTMIKIHGVAGNY
Subjt:  LFEEMLERGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDMAFSLYDRARTENWRIDPATFSTMIKIHGVAGNY

Query:  DGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCA
        DGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIK+GFSPSWATYASLLRAYGRARYGEDAL+VYKEMKEKGLQLNVILYNTLLAMCA
Subjt:  DGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCA

Query:  DVGYVNEAVEVFQDMKSSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVEAGFDPNIFVLTSLIQCYGKAKHVDDVVRTFNRLLELGLTPDDRFCGC
        DVGYVNEAVE+FQDMK+SGTCSPDSWTFSSMITIYSCSGKVSEAEEMLN+MVEAGFDPNIFVLTSLIQCYGKAK VDDVVRTFN+L+ELGLTPDDRFCGC
Subjt:  DVGYVNEAVEVFQDMKSSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVEAGFDPNIFVLTSLIQCYGKAKHVDDVVRTFNRLLELGLTPDDRFCGC

Query:  LLNVITQTPKEELSKLIDCVERANPKLGFVVELLLGEQEKEGDFRTEASELFSVVSADVRKAYCNCLIDLCVNLDLLDKACELLELGLTLQIYKDLQSRS
        LLNVITQTPKEE+SKLIDCV RANPKLGFVVELLLGEQ+KEG+FRTEASELFSVVSADVRKAYCNCLIDLCVNLDLLDKACELL LGLTLQIYKDLQSRS
Subjt:  LLNVITQTPKEELSKLIDCVERANPKLGFVVELLLGEQEKEGDFRTEASELFSVVSADVRKAYCNCLIDLCVNLDLLDKACELLELGLTLQIYKDLQSRS

Query:  PTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGMASVFESHLKELKAPFHEAPDKVGWFLTTKVAAKSWLESRASPE
        PTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKG+ASVFESHLKEL APFHEAP+KVGWFLTTKVAAKSWLESR+SPE
Subjt:  PTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGMASVFESHLKELKAPFHEAPDKVGWFLTTKVAAKSWLESRASPE

Query:  LVAA
        LVAA
Subjt:  LVAA

XP_022142513.1 pentatricopeptide repeat-containing protein At4g16390, chloroplastic [Momordica charantia]0.0e+0090.62Show/hide
Query:  MAFQLCHSPSTFFTDHHSLSNSLTSQRKTTLYNSSRLFKLNPIPRHSKPFLQITNVSLQEYAPQETHNPTPSDDEISKYPDGKSGSSSKSSVWVNPRSPR
        MAFQLCHSPSTFF+DHH LSNSL SQ + TL  SS  FKLNP P HSK  L+ITNVSLQEYA QE  NP P+ DE SKYPDGKS SSSKSSVWVNPRSPR
Subjt:  MAFQLCHSPSTFFTDHHSLSNSLTSQRKTTLYNSSRLFKLNPIPRHSKPFLQITNVSLQEYAPQETHNPTPSDDEISKYPDGKSGSSSKSSVWVNPRSPR

Query:  ASKLRKQSYEARYTSLTRISESLDSCNPCEEDVADVLKVIGSNILEQDAVVVLNNMSNSQTALIALRYFQDVLKSSKQTIFYNVTLKVFRKCRDMEGAEK
        ASKLR QSYEARY SLTRISESLDSCNPCEEDVADVLK +GSNILEQDAV VLNNMSNS TAL+AL+ FQ VLKSSK+ I YNVTLKV RK RDMEGAEK
Subjt:  ASKLRKQSYEARYTSLTRISESLDSCNPCEEDVADVLKVIGSNILEQDAVVVLNNMSNSQTALIALRYFQDVLKSSKQTIFYNVTLKVFRKCRDMEGAEK

Query:  LFEEMLERGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDMAFSLYDRARTENWRIDPATFSTMIKIHGVAGNY
        LF+EML+RGVKPDNVTFST+ISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDMAFSLYDRARTENWRIDPATFST+IKIHGVAGNY
Subjt:  LFEEMLERGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDMAFSLYDRARTENWRIDPATFSTMIKIHGVAGNY

Query:  DGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCA
        DGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCA
Subjt:  DGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCA

Query:  DVGYVNEAVEVFQDMKSSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVEAGFDPNIFVLTSLIQCYGKAKHVDDVVRTFNRLLELGLTPDDRFCGC
        DVGYVNEAVE+F+DMKSSG CSPDSWTFSSMITIYSCSGKVSEAEEMLNEM+EAGFDPNIFVLTSLIQCYGK K VDDVVRTF+RL+ELGLTPDDRFCGC
Subjt:  DVGYVNEAVEVFQDMKSSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVEAGFDPNIFVLTSLIQCYGKAKHVDDVVRTFNRLLELGLTPDDRFCGC

Query:  LLNVITQTPKEELSKLIDCVERANPKLGFVVELLLGEQEKEGDFRTEASELFSVVSADVRKAYCNCLIDLCVNLDLLDKACELLELGLTLQIYKDLQSRS
        LLNVITQTPKEELSKLIDCVERAN KLG+VV+LLLGEQ+KEGD RTEASEL SVVSADVRKAYCNCLIDLCVNLDLL+KACELL+LGLTLQIY  LQS S
Subjt:  LLNVITQTPKEELSKLIDCVERANPKLGFVVELLLGEQEKEGDFRTEASELFSVVSADVRKAYCNCLIDLCVNLDLLDKACELLELGLTLQIYKDLQSRS

Query:  PTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGMASVFESHLKELKAPFHEAPDKVGWFLTTKVAAKSWLESRASPE
        PTQWSL+LKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKG+ASVFESHLKEL APFHEAP+KVGWFLTTKVAAKSWLESR SPE
Subjt:  PTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGMASVFESHLKELKAPFHEAPDKVGWFLTTKVAAKSWLESRASPE

Query:  LVAA
        LVAA
Subjt:  LVAA

XP_038877791.1 pentatricopeptide repeat-containing protein At4g16390, chloroplastic [Benincasa hispida]0.0e+0096.31Show/hide
Query:  MAFQLCHSPSTFFTDHHSLSNSLTSQRKTTLYNSSRLFKLNPIPRHSKPFLQITNVSLQEYAPQETHNPTPSDDEISKYPDGKSGSSSKSSVWVNPRSPR
        MAFQLCHSPSTFFTDHHSLSNSLTSQRKTTL NSSRLFKLNPIPRHSKPFLQITNVSLQEYAPQETHNP+PS+DEISKYPDGKS SSSKSSVWVNPRSPR
Subjt:  MAFQLCHSPSTFFTDHHSLSNSLTSQRKTTLYNSSRLFKLNPIPRHSKPFLQITNVSLQEYAPQETHNPTPSDDEISKYPDGKSGSSSKSSVWVNPRSPR

Query:  ASKLRKQSYEARYTSLTRISESLDSCNPCEEDVADVLKVIGSNILEQDAVVVLNNMSNSQTALIALRYFQDVLKSSKQTIFYNVTLKVFRKCRDMEGAEK
        ASKLRKQSYEARY SLTRISESLDSCNPC+EDVADVLK IGSNIL+QDAVVVLNNMSNSQTAL+ALRYFQDVLKSSKQ IFYNVTLKVFRKCRDMEGAEK
Subjt:  ASKLRKQSYEARYTSLTRISESLDSCNPCEEDVADVLKVIGSNILEQDAVVVLNNMSNSQTALIALRYFQDVLKSSKQTIFYNVTLKVFRKCRDMEGAEK

Query:  LFEEMLERGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDMAFSLYDRARTENWRIDPATFSTMIKIHGVAGNY
        LFEEML+RGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDMAFSLYDRARTENWRIDPATFSTMIKIHGVAGNY
Subjt:  LFEEMLERGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDMAFSLYDRARTENWRIDPATFSTMIKIHGVAGNY

Query:  DGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCA
        DGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCA
Subjt:  DGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCA

Query:  DVGYVNEAVEVFQDMKSSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVEAGFDPNIFVLTSLIQCYGKAKHVDDVVRTFNRLLELGLTPDDRFCGC
        DVGYV EAVEVFQDMKSSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVEAGFDPNIFVLTSLIQCYGKAK VDDVVRTF+RL+ELGLTPDDRFCGC
Subjt:  DVGYVNEAVEVFQDMKSSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVEAGFDPNIFVLTSLIQCYGKAKHVDDVVRTFNRLLELGLTPDDRFCGC

Query:  LLNVITQTPKEELSKLIDCVERANPKLGFVVELLLGEQEKEGDFRTEASELFSVVSADVRKAYCNCLIDLCVNLDLLDKACELLELGLTLQIYKDLQSRS
        LLNVITQTPKEELSKLIDCV RANPKLGFVV+LL+GEQ+KEGDFRTEASELFSVVSADVRKAYCNCLIDLCVNLDLLDKACELL+LGLTLQ+YKDLQSRS
Subjt:  LLNVITQTPKEELSKLIDCVERANPKLGFVVELLLGEQEKEGDFRTEASELFSVVSADVRKAYCNCLIDLCVNLDLLDKACELLELGLTLQIYKDLQSRS

Query:  PTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGMASVFESHLKELKAPFHEAPDKVGWFLTTKVAAKSWLESRASPE
        PTQWSLYLKGLSLGA LTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKG+ASVFESHLKEL APFHEAP+KVGWFLTTKVAAKSWLESR+SPE
Subjt:  PTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGMASVFESHLKELKAPFHEAPDKVGWFLTTKVAAKSWLESRASPE

Query:  LVAA
        LVAA
Subjt:  LVAA

TrEMBL top hitse value%identityAlignment
A0A0A0LVP1 Smr domain-containing protein0.0e+0092.9Show/hide
Query:  MAFQLCHSPSTFFTDHHSLSNSLTSQRKTTLYNSSRLFKLNPIPRHSKPFLQITNVSLQEYAPQETHNPTPSDDEISKYPDGKSGSSSKSSVWVNPRSPR
        MAFQLC+SP TFFT+HH LSNSLT QRKTTL NSS LFKL+PIPRHSKPFLQITNVSLQE+APQ+T N  PS DEISKYPD KSGSSS SSVWVNPRSPR
Subjt:  MAFQLCHSPSTFFTDHHSLSNSLTSQRKTTLYNSSRLFKLNPIPRHSKPFLQITNVSLQEYAPQETHNPTPSDDEISKYPDGKSGSSSKSSVWVNPRSPR

Query:  ASKLRKQSYEARYTSLTRISESLDSCNPCEEDVADVLKVIGSNILEQDAVVVLNNMSNSQTALIALRYFQDVLKSSKQTIFYNVTLKVFRKCRDMEGAEK
        ASKLRKQSYEARY SL R+SESLDS NPCE DVADVLKVIG+NILE+DA++VLNNMSNSQTAL+ALRYFQD+LKSSKQTIFYNVTLKVFRKCRDMEGAEK
Subjt:  ASKLRKQSYEARYTSLTRISESLDSCNPCEEDVADVLKVIGSNILEQDAVVVLNNMSNSQTALIALRYFQDVLKSSKQTIFYNVTLKVFRKCRDMEGAEK

Query:  LFEEMLERGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDMAFSLYDRARTENWRIDPATFSTMIKIHGVAGNY
        LFEEM+ RGVKPDNVTFSTIISCARLCSLP+KAVEWFEKMPSFDCNPDDVTYS MIDAYGRAGNVDMAFSLYDRARTENWRIDPATFSTMIKIHGVAGNY
Subjt:  LFEEMLERGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDMAFSLYDRARTENWRIDPATFSTMIKIHGVAGNY

Query:  DGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCA
        DGCLNVYEEMKAIGIKPNLVIYN LLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRARYGEDAL+VYKEMKEKGLQLNVILYNTLLAMCA
Subjt:  DGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCA

Query:  DVGYVNEAVEVFQDMKSSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVEAGFDPNIFVLTSLIQCYGKAKHVDDVVRTFNRLLELGLTPDDRFCGC
        DVGYVNEAVE+FQDMKSSGTCSPDSWTFSSMITIYSC GKVSEAEEMLN+MVEAGFDPNIFVLTSLIQCYGKAK VDDVVRTFN+L+ELGLTPDDRFCGC
Subjt:  DVGYVNEAVEVFQDMKSSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVEAGFDPNIFVLTSLIQCYGKAKHVDDVVRTFNRLLELGLTPDDRFCGC

Query:  LLNVITQTPKEELSKLIDCVERANPKLGFVVELLLGEQEKEGDFRTEASELFSVVSADVRKAYCNCLIDLCVNLDLLDKACELLELGLTLQIYKDLQSRS
        LLNVITQTPK EL KLIDCV RANPKLGFVVELLLGEQ+KEG+FRTEASELFSVVSADVRKAYCNCLIDLCVNLDLLDKACELL+LGLTLQIYKDLQSRS
Subjt:  LLNVITQTPKEELSKLIDCVERANPKLGFVVELLLGEQEKEGDFRTEASELFSVVSADVRKAYCNCLIDLCVNLDLLDKACELLELGLTLQIYKDLQSRS

Query:  PTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGMASVFESHLKELKAPFHEAPDKVGWFLTTKVAAKSWLESRASPE
        PTQWSLYLKGLSLGAALTALHVWI DLTKVLESGEELPPLLGINTGHGKHKYSDKG+ASVFESHLKEL APFHEAP+KVGWFLTTKVAAKSWLESR+SPE
Subjt:  PTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGMASVFESHLKELKAPFHEAPDKVGWFLTTKVAAKSWLESRASPE

Query:  LVAA
        LVAA
Subjt:  LVAA

A0A1S3CL39 pentatricopeptide repeat-containing protein At4g16390, chloroplastic0.0e+0094.32Show/hide
Query:  MAFQLCHSPSTFFTDHHSLSNSLTSQRKTTLYNSSRLFKLNPIPRHSKPFLQITNVSLQEYAPQETHNPTPSDDEISKYPDGKSGSSSKSSVWVNPRSPR
        MAFQLCHSP TFFT HHSLSNSLT QRKTTL NSS LFKLNPIPRHS PFLQITN+SLQE++PQETHN  PSDDEISKY D KSGSSSKSSVWVNPRSPR
Subjt:  MAFQLCHSPSTFFTDHHSLSNSLTSQRKTTLYNSSRLFKLNPIPRHSKPFLQITNVSLQEYAPQETHNPTPSDDEISKYPDGKSGSSSKSSVWVNPRSPR

Query:  ASKLRKQSYEARYTSLTRISESLDSCNPCEEDVADVLKVIGSNILEQDAVVVLNNMSNSQTALIALRYFQDVLKSSKQTIFYNVTLKVFRKCRDMEGAEK
        ASKLRKQSYEARY SL RISESLDSCNPCE DVADVLKVIG+NILEQDAVVVLNNMSNSQTAL+ALRYFQD+LKSSKQTIFYNVTLKVFRKCRDMEGAE+
Subjt:  ASKLRKQSYEARYTSLTRISESLDSCNPCEEDVADVLKVIGSNILEQDAVVVLNNMSNSQTALIALRYFQDVLKSSKQTIFYNVTLKVFRKCRDMEGAEK

Query:  LFEEMLERGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDMAFSLYDRARTENWRIDPATFSTMIKIHGVAGNY
        LFEEML RGVKPDNVTFSTIISCARLCSLP+KAVEWFEKMPSFDCNPDDVTYS MIDAYGRAGNVDMAFSLYDRARTENWRIDPATFSTMIKIHGVAGNY
Subjt:  LFEEMLERGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDMAFSLYDRARTENWRIDPATFSTMIKIHGVAGNY

Query:  DGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCA
        DGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIK+GFSPSWATYASLLRAYGRARYGEDAL+VYKEMKEKGLQLNVILYNTLLAMCA
Subjt:  DGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCA

Query:  DVGYVNEAVEVFQDMKSSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVEAGFDPNIFVLTSLIQCYGKAKHVDDVVRTFNRLLELGLTPDDRFCGC
        DVGYVNEAVE+FQDMK+SGTCSPDSWTFSSMITIYSCSGKVSEAEEMLN+MVEAGFDPNIFVLTSLIQCYGKAK VDDVVRTFN+L+ELGLTPDDRFCGC
Subjt:  DVGYVNEAVEVFQDMKSSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVEAGFDPNIFVLTSLIQCYGKAKHVDDVVRTFNRLLELGLTPDDRFCGC

Query:  LLNVITQTPKEELSKLIDCVERANPKLGFVVELLLGEQEKEGDFRTEASELFSVVSADVRKAYCNCLIDLCVNLDLLDKACELLELGLTLQIYKDLQSRS
        LLNVITQTPKEE+SKLIDCV RANPKLGFVVELLLGEQ+KEG+FRTEASELFSVVSADVRKAYCNCLIDLCVNLDLLDKACELL LGLTLQIYKDLQSRS
Subjt:  LLNVITQTPKEELSKLIDCVERANPKLGFVVELLLGEQEKEGDFRTEASELFSVVSADVRKAYCNCLIDLCVNLDLLDKACELLELGLTLQIYKDLQSRS

Query:  PTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGMASVFESHLKELKAPFHEAPDKVGWFLTTKVAAKSWLESRASPE
        PTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKG+ASVFESHLKEL APFHEAP+KVGWFLTTKVAAKSWLESR+SPE
Subjt:  PTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGMASVFESHLKELKAPFHEAPDKVGWFLTTKVAAKSWLESRASPE

Query:  LVAA
        LVAA
Subjt:  LVAA

A0A5A7TLM5 Pentatricopeptide repeat-containing protein0.0e+0094.32Show/hide
Query:  MAFQLCHSPSTFFTDHHSLSNSLTSQRKTTLYNSSRLFKLNPIPRHSKPFLQITNVSLQEYAPQETHNPTPSDDEISKYPDGKSGSSSKSSVWVNPRSPR
        MAFQLCHSP TFFT HHSLSNSLT QRKTTL NSS LFKLNPIPRHS PFLQITN+SLQE++PQETHN  PSDDEISKY D KSGSSSKSSVWVNPRSPR
Subjt:  MAFQLCHSPSTFFTDHHSLSNSLTSQRKTTLYNSSRLFKLNPIPRHSKPFLQITNVSLQEYAPQETHNPTPSDDEISKYPDGKSGSSSKSSVWVNPRSPR

Query:  ASKLRKQSYEARYTSLTRISESLDSCNPCEEDVADVLKVIGSNILEQDAVVVLNNMSNSQTALIALRYFQDVLKSSKQTIFYNVTLKVFRKCRDMEGAEK
        ASKLRKQSYEARY SL RISESLDSCNPCE DVADVLKVIG+NILEQDAVVVLNNMSNSQTAL+ALRYFQD+LKSSKQTIFYNVTLKVFRKCRDMEGAE+
Subjt:  ASKLRKQSYEARYTSLTRISESLDSCNPCEEDVADVLKVIGSNILEQDAVVVLNNMSNSQTALIALRYFQDVLKSSKQTIFYNVTLKVFRKCRDMEGAEK

Query:  LFEEMLERGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDMAFSLYDRARTENWRIDPATFSTMIKIHGVAGNY
        LFEEML RGVKPDNVTFSTIISCARLCSLP+KAVEWFEKMPSFDCNPDDVTYS MIDAYGRAGNVDMAFSLYDRARTENWRIDPATFSTMIKIHGVAGNY
Subjt:  LFEEMLERGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDMAFSLYDRARTENWRIDPATFSTMIKIHGVAGNY

Query:  DGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCA
        DGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIK+GFSPSWATYASLLRAYGRARYGEDAL+VYKEMKEKGLQLNVILYNTLLAMCA
Subjt:  DGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCA

Query:  DVGYVNEAVEVFQDMKSSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVEAGFDPNIFVLTSLIQCYGKAKHVDDVVRTFNRLLELGLTPDDRFCGC
        DVGYVNEAVE+FQDMK+SGTCSPDSWTFSSMITIYSCSGKVSEAEEMLN+MVEAGFDPNIFVLTSLIQCYGKAK VDDVVRTFN+L+ELGLTPDDRFCGC
Subjt:  DVGYVNEAVEVFQDMKSSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVEAGFDPNIFVLTSLIQCYGKAKHVDDVVRTFNRLLELGLTPDDRFCGC

Query:  LLNVITQTPKEELSKLIDCVERANPKLGFVVELLLGEQEKEGDFRTEASELFSVVSADVRKAYCNCLIDLCVNLDLLDKACELLELGLTLQIYKDLQSRS
        LLNVITQTPKEE+SKLIDCV RANPKLGFVVELLLGEQ+KEG+FRTEASELFSVVSADVRKAYCNCLIDLCVNLDLLDKACELL LGLTLQIYKDLQSRS
Subjt:  LLNVITQTPKEELSKLIDCVERANPKLGFVVELLLGEQEKEGDFRTEASELFSVVSADVRKAYCNCLIDLCVNLDLLDKACELLELGLTLQIYKDLQSRS

Query:  PTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGMASVFESHLKELKAPFHEAPDKVGWFLTTKVAAKSWLESRASPE
        PTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKG+ASVFESHLKEL APFHEAP+KVGWFLTTKVAAKSWLESR+SPE
Subjt:  PTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGMASVFESHLKELKAPFHEAPDKVGWFLTTKVAAKSWLESRASPE

Query:  LVAA
        LVAA
Subjt:  LVAA

A0A6J1CNE5 pentatricopeptide repeat-containing protein At4g16390, chloroplastic0.0e+0090.62Show/hide
Query:  MAFQLCHSPSTFFTDHHSLSNSLTSQRKTTLYNSSRLFKLNPIPRHSKPFLQITNVSLQEYAPQETHNPTPSDDEISKYPDGKSGSSSKSSVWVNPRSPR
        MAFQLCHSPSTFF+DHH LSNSL SQ + TL  SS  FKLNP P HSK  L+ITNVSLQEYA QE  NP P+ DE SKYPDGKS SSSKSSVWVNPRSPR
Subjt:  MAFQLCHSPSTFFTDHHSLSNSLTSQRKTTLYNSSRLFKLNPIPRHSKPFLQITNVSLQEYAPQETHNPTPSDDEISKYPDGKSGSSSKSSVWVNPRSPR

Query:  ASKLRKQSYEARYTSLTRISESLDSCNPCEEDVADVLKVIGSNILEQDAVVVLNNMSNSQTALIALRYFQDVLKSSKQTIFYNVTLKVFRKCRDMEGAEK
        ASKLR QSYEARY SLTRISESLDSCNPCEEDVADVLK +GSNILEQDAV VLNNMSNS TAL+AL+ FQ VLKSSK+ I YNVTLKV RK RDMEGAEK
Subjt:  ASKLRKQSYEARYTSLTRISESLDSCNPCEEDVADVLKVIGSNILEQDAVVVLNNMSNSQTALIALRYFQDVLKSSKQTIFYNVTLKVFRKCRDMEGAEK

Query:  LFEEMLERGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDMAFSLYDRARTENWRIDPATFSTMIKIHGVAGNY
        LF+EML+RGVKPDNVTFST+ISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDMAFSLYDRARTENWRIDPATFST+IKIHGVAGNY
Subjt:  LFEEMLERGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDMAFSLYDRARTENWRIDPATFSTMIKIHGVAGNY

Query:  DGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCA
        DGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCA
Subjt:  DGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCA

Query:  DVGYVNEAVEVFQDMKSSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVEAGFDPNIFVLTSLIQCYGKAKHVDDVVRTFNRLLELGLTPDDRFCGC
        DVGYVNEAVE+F+DMKSSG CSPDSWTFSSMITIYSCSGKVSEAEEMLNEM+EAGFDPNIFVLTSLIQCYGK K VDDVVRTF+RL+ELGLTPDDRFCGC
Subjt:  DVGYVNEAVEVFQDMKSSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVEAGFDPNIFVLTSLIQCYGKAKHVDDVVRTFNRLLELGLTPDDRFCGC

Query:  LLNVITQTPKEELSKLIDCVERANPKLGFVVELLLGEQEKEGDFRTEASELFSVVSADVRKAYCNCLIDLCVNLDLLDKACELLELGLTLQIYKDLQSRS
        LLNVITQTPKEELSKLIDCVERAN KLG+VV+LLLGEQ+KEGD RTEASEL SVVSADVRKAYCNCLIDLCVNLDLL+KACELL+LGLTLQIY  LQS S
Subjt:  LLNVITQTPKEELSKLIDCVERANPKLGFVVELLLGEQEKEGDFRTEASELFSVVSADVRKAYCNCLIDLCVNLDLLDKACELLELGLTLQIYKDLQSRS

Query:  PTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGMASVFESHLKELKAPFHEAPDKVGWFLTTKVAAKSWLESRASPE
        PTQWSL+LKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKG+ASVFESHLKEL APFHEAP+KVGWFLTTKVAAKSWLESR SPE
Subjt:  PTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGMASVFESHLKELKAPFHEAPDKVGWFLTTKVAAKSWLESRASPE

Query:  LVAA
        LVAA
Subjt:  LVAA

A0A6J1EHV4 pentatricopeptide repeat-containing protein At4g16390, chloroplastic0.0e+0089.79Show/hide
Query:  MAFQLCHSPSTFFTDHHSLSNSLTSQRKTTLYNS-SRLFKLNPIPRHSKPFLQITNVSLQEYAPQETHNPTPSDDEISKYPDGKSGSSSKSSVWVNPRSP
        MAFQL H PSTFFTDH    NSLT   KTTL  S SR+FKLNPIP HSKPFLQITNVS QEYAPQET NP+PSDDEISK+PDGKSGSSSK+SVWVNP SP
Subjt:  MAFQLCHSPSTFFTDHHSLSNSLTSQRKTTLYNS-SRLFKLNPIPRHSKPFLQITNVSLQEYAPQETHNPTPSDDEISKYPDGKSGSSSKSSVWVNPRSP

Query:  RASKLRKQSYEARYTSLTRISESLDSCNPCEEDVADVLKVIGSNILEQDAVVVLNNMSNSQTALIALRYFQDVLKSSKQTIFYNVTLKVFRKCRDMEGAE
        RASKLRKQSYEARY SL +ISESLDSCNPCE DVADVLK I S ILEQDA+ VLNNMSNSQTAL+  RYFQDVLKSSKQ +FYNVTLKVFRKCRD EGAE
Subjt:  RASKLRKQSYEARYTSLTRISESLDSCNPCEEDVADVLKVIGSNILEQDAVVVLNNMSNSQTALIALRYFQDVLKSSKQTIFYNVTLKVFRKCRDMEGAE

Query:  KLFEEMLERGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDMAFSLYDRARTENWRIDPATFSTMIKIHGVAGN
        KLF+EMLERGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPD++TYS MIDAYGRAGNVDMAFSLYDRARTENWRID +TFSTMIKIHGVAGN
Subjt:  KLFEEMLERGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDMAFSLYDRARTENWRIDPATFSTMIKIHGVAGN

Query:  YDGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMC
        YDGCLNVYEEMKA+GIKPNL IYNSLL AMGRAKRPWQIKTIYKEM KNGFSPSWATYASLLRAY RARY ED +LVYKEMKEKGLQLNVILYNTLLAMC
Subjt:  YDGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMC

Query:  ADVGYVNEAVEVFQDMKSSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVEAGFDPNIFVLTSLIQCYGKAKHVDDVVRTFNRLLELGLTPDDRFCG
        ADVGYVNEA+EVF+DMKSSGTCSPDSWTFSSMITIYSCSG VSEAEEMLNEM+EAGFDPNIFVLTSLIQCYGKAK VDDVVRTF+RLLELGLTPDDRFCG
Subjt:  ADVGYVNEAVEVFQDMKSSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVEAGFDPNIFVLTSLIQCYGKAKHVDDVVRTFNRLLELGLTPDDRFCG

Query:  CLLNVITQTPKEELSKLIDCVERANPKLGFVVELLLGEQEKEGDFRTEASELFSVVSADVRKAYCNCLIDLCVNLDLLDKACELLELGLTLQIYKDLQSR
        CLLNVITQTPK ELSKLIDCVERANPKLGFVV+LLLGE++ EGDFRTEASELFSVVS DVRKAYCNCLIDLCVNLDLLDKACELL+LGL++QIY DLQSR
Subjt:  CLLNVITQTPKEELSKLIDCVERANPKLGFVVELLLGEQEKEGDFRTEASELFSVVSADVRKAYCNCLIDLCVNLDLLDKACELLELGLTLQIYKDLQSR

Query:  SPTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGMASVFESHLKELKAPFHEAPDKVGWFLTTKVAAKSWLESRASP
        SPTQWSLYLKGLSLGAALTALHVWINDLTK L+SGEELPPLLGINTGHGKHKYSDKG++SVFESHLKEL APFHEAP+KVGWFLTTKVAAKSWLESR SP
Subjt:  SPTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGMASVFESHLKELKAPFHEAPDKVGWFLTTKVAAKSWLESRASP

Query:  ELVAA
        ELVAA
Subjt:  ELVAA

SwissProt top hitse value%identityAlignment
B4F8Z1 Pentatricopeptide repeat-containing protein ATP4, chloroplastic2.7e-20653.46Show/hide
Query:  LCHSPSTFFTD--HHSLSNSLTSQRKTTLYNSSRLFKLNPIPRHSKPFLQITNVSLQEYAPQETHNPTPSDDEISKYPDGKSGSSSKSS--VWVNPRSPR
        LC SPS+      H  +S S   +  +           +P+  H         VS+QE  PQ      PSD      P+G   SSS ++  +WVNP SPR
Subjt:  LCHSPSTFFTD--HHSLSNSLTSQRKTTLYNSSRLFKLNPIPRHSKPFLQITNVSLQEYAPQETHNPTPSDDEISKYPDGKSGSSSKSS--VWVNPRSPR

Query:  ASKL-RKQSYEARYTSLTRISESLDSCNPCEEDVADVLK-VIGSNILEQDAVVVLNN--MSNSQTALIALRYFQDVLKSSKQTIFYNVTLKVFRKCRDME
        A+ + R ++   R   L   + +L +C   E  V   L+        EQDAV+VLN    + ++TA++ALR+F    K  K+ I YNV LK+ RK R   
Subjt:  ASKL-RKQSYEARYTSLTRISESLDSCNPCEEDVADVLK-VIGSNILEQDAVVVLNN--MSNSQTALIALRYFQDVLKSSKQTIFYNVTLKVFRKCRDME

Query:  GAEKLFEEMLERGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDMAFSLYDRARTENWRIDPATFSTMIKIHGV
          E L+ EML  GV+PDN TFST+ISCAR C L +KAVEWF+KMP F C+PD +TYSA+IDAYG AGN + A  LYDRAR E W++DP   ST+IK+H  
Subjt:  GAEKLFEEMLERGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDMAFSLYDRARTENWRIDPATFSTMIKIHGV

Query:  AGNYDGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLL
        +GN+DG LNV+EEMKAIG++PNLV+YN++LDAMGRA RPW +KTI++EM+     PS ATY  LL AY RARYGEDA+ VY+ MK++ + ++V+LYN LL
Subjt:  AGNYDGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLL

Query:  AMCADVGYVNEAVEVFQDMKSS--GTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVEAGFDPNIFVLTSLIQCYGKAKHVDDVVRTFNRLLELGLTPD
        +MCAD+GYV+EA E+F+DMK+S      PDSW++SSM+T+YS +  V  AE +LNEMVEAGF PNIFVLTSLI+CYGK    DDVVR+F  L +LG+ PD
Subjt:  AMCADVGYVNEAVEVFQDMKSS--GTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVEAGFDPNIFVLTSLIQCYGKAKHVDDVVRTFNRLLELGLTPD

Query:  DRFCGCLLNVITQTPKEELSKLIDCVERANPKLGFVVELLLGEQEKEGDFRTEASELFSVVSADVRKAYCNCLIDLCVNLDLLDKACELLELGLTLQIYK
        DRFCGCLL+V   TP EEL K+I C+ER+N +LG VV+LL+     E  FR  A EL       V+  YCNCL+DLCVNL+ ++KAC LL+    L IY 
Subjt:  DRFCGCLLNVITQTPKEELSKLIDCVERANPKLGFVVELLLGEQEKEGDFRTEASELFSVVSADVRKAYCNCLIDLCVNLDLLDKACELLELGLTLQIYK

Query:  DLQSRSPTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEE-LPPLLGINTGHGKHKYSDKGMASVFESHLKELKAPFHEAPDKVGWFLTTKVAAKSWL
        ++Q+R+ TQWSL+L+GLS+GAALT LHVW+NDL   L++G E LPPLLGI+TG GK+ YSD+G+A++FE+HLKEL APFHEAPDK GWFLTT VAAK WL
Subjt:  DLQSRSPTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEE-LPPLLGINTGHGKHKYSDKGMASVFESHLKELKAPFHEAPDKVGWFLTTKVAAKSWL

Query:  ESRASPELV
        ES+A+ ELV
Subjt:  ESRASPELV

Q10PZ4 Pentatricopeptide repeat-containing protein ATP4 homolog, chloroplastic1.6e-20654.6Show/hide
Query:  RLFKLNPIPRHSKPFLQITNVSLQEYAPQETH-NPTPSDDEISKYPDGKSGSSSKSSVWVNPRSPRASKL-RKQSYEARYTSLTRISESLDSCNPCEEDV
        R   L+  P++  P      VS+Q+  P  +  NP+P          G+S ++S+  VWVNP SPRA+ L R ++   R   L   + +L +C   E  V
Subjt:  RLFKLNPIPRHSKPFLQITNVSLQEYAPQETH-NPTPSDDEISKYPDGKSGSSSKSSVWVNPRSPRASKL-RKQSYEARYTSLTRISESLDSCNPCEEDV

Query:  ADVLK-VIGSNILEQDAVVVLNNMSNSQTALI-ALRYFQDVLKSSKQTIFYNVTLKVFRKCRDMEGAEKLFEEMLERGVKPDNVTFSTIISCARLCSLPN
        A  L+        EQDAV+VLN  S    A++ AL +F    +  K+ I YNV LK  RK R    AE L+EEML  GV+PDN TFST+ISCAR C +P 
Subjt:  ADVLK-VIGSNILEQDAVVVLNNMSNSQTALI-ALRYFQDVLKSSKQTIFYNVTLKVFRKCRDMEGAEKLFEEMLERGVKPDNVTFSTIISCARLCSLPN

Query:  KAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDMAFSLYDRARTENWRIDPATFSTMIKIHGVAGNYDGCLNVYEEMKAIGIKPNLVIYNSLLDAMGR
        KAVEWFEKMP F C+PD +TYSA+IDAYGRAG+ + A  LYDRAR E W++DP   +T+I++H  +GN+DG LNV+EEMKA G+KPNLV+YN++LDAMGR
Subjt:  KAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDMAFSLYDRARTENWRIDPATFSTMIKIHGVAGNYDGCLNVYEEMKAIGIKPNLVIYNSLLDAMGR

Query:  AKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCADVGYVNEAVEVFQDMKSS--GTCSPDSWTFS
        A RPW +KTI++E++     P+ ATY  LL AY RARYGEDA+ VY+ MK++ + ++V+LYN LL+MCAD+GYV EA E+F+DMK+S      PDSW++S
Subjt:  AKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCADVGYVNEAVEVFQDMKSS--GTCSPDSWTFS

Query:  SMITIYSCSGKVSEAEEMLNEMVEAGFDPNIFVLTSLIQCYGKAKHVDDVVRTFNRLLELGLTPDDRFCGCLLNVITQTPKEELSKLIDCVERANPKLGF
        SM+T+YSC+G V+ AE +LNEMVEAGF PNIF+LTSLI+CYGKA   DDVVR+F  L +LG+TPDDRFCGCLL V   TP +EL K+I C++R++ +LG 
Subjt:  SMITIYSCSGKVSEAEEMLNEMVEAGFDPNIFVLTSLIQCYGKAKHVDDVVRTFNRLLELGLTPDDRFCGCLLNVITQTPKEELSKLIDCVERANPKLGF

Query:  VVELLLGEQEKEGDFRTEASELFSVVSADVRKAYCNCLIDLCVNLDLLDKACELLELGLTLQIYKDLQSRSPTQWSLYLKGLSLGAALTALHVWINDLTK
        VV LL+         R  A EL       VR  YCNCL+DL VNL  ++KAC LL++ L L IY ++Q+R+ TQWSL+L+GLS+GAALT LHVW++DL  
Subjt:  VVELLLGEQEKEGDFRTEASELFSVVSADVRKAYCNCLIDLCVNLDLLDKACELLELGLTLQIYKDLQSRSPTQWSLYLKGLSLGAALTALHVWINDLTK

Query:  VLESGEELPPLLGINTGHGKHKYSDKGMASVFESHLKELKAPFHEAPDKVGWFLTTKVAAKSWLESRASPELVA
         L++G+ELPPLLGI+TG GK+ YS KG+A+VFESHLKEL APFHEAPDK GWFLTT VAA+ WLE++ S ELVA
Subjt:  VLESGEELPPLLGINTGHGKHKYSDKGMASVFESHLKELKAPFHEAPDKVGWFLTTKVAAKSWLESRASPELVA

Q8GWE0 Pentatricopeptide repeat-containing protein At4g16390, chloroplastic1.4e-27165.95Show/hide
Query:  LCHSPSTFFTDHHSLSNSLTSQRKTTLYNSSRLFKLNPIPRHSKPFLQITNVSLQEYAPQETHNPTPSDDEISKYPDGKSGSSSKSSVWVNPRSPRASKL
        LC SPS+   D   L N L+   K+T  +    +  N    HS+  LQ T+VS+QE  PQ   +     D     P     ++SKS VWVNP+SPRAS+L
Subjt:  LCHSPSTFFTDHHSLSNSLTSQRKTTLYNSSRLFKLNPIPRHSKPFLQITNVSLQEYAPQETHNPTPSDDEISKYPDGKSGSSSKSSVWVNPRSPRASKL

Query:  RKQSYEARYTSLTRISESLDSCNPCEEDVADVLKVIGSNILEQDAVVVLNNMSNSQTALIALRYFQDVLKSSKQTIFYNVTLKVFRKCRDMEGAEKLFEE
        R++SY++RY+SL +++ESLD+C P E DV DV+   G  + EQDAVV LNNM+N +TA + L    + +K S++ I YNVT+KVFRK +D+E +EKLF+E
Subjt:  RKQSYEARYTSLTRISESLDSCNPCEEDVADVLKVIGSNILEQDAVVVLNNMSNSQTALIALRYFQDVLKSSKQTIFYNVTLKVFRKCRDMEGAEKLFEE

Query:  MLERGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDMAFSLYDRARTENWRIDPATFSTMIKIHGVAGNYDGCL
        MLERG+KPDN TF+TIISCAR   +P +AVEWFEKM SF C PD+VT +AMIDAYGRAGNVDMA SLYDRARTE WRID  TFST+I+I+GV+GNYDGCL
Subjt:  MLERGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDMAFSLYDRARTENWRIDPATFSTMIKIHGVAGNYDGCL

Query:  NVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCADVGY
        N+YEEMKA+G+KPNLVIYN L+D+MGRAKRPWQ K IYK++I NGF+P+W+TYA+L+RAYGRARYG+DAL +Y+EMKEKGL L VILYNTLL+MCAD  Y
Subjt:  NVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCADVGY

Query:  VNEAVEVFQDMKSSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVEAGFDPNIFVLTSLIQCYGKAKHVDDVVRTFNRLLELGLTPDDRFCGCLLNV
        V+EA E+FQDMK+  TC PDSWTFSS+IT+Y+CSG+VSEAE  L +M EAGF+P +FVLTS+IQCYGKAK VDDVVRTF+++LELG+TPDDRFCGCLLNV
Subjt:  VNEAVEVFQDMKSSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVEAGFDPNIFVLTSLIQCYGKAKHVDDVVRTFNRLLELGLTPDDRFCGCLLNV

Query:  ITQTPKEELSKLIDCVERANPKLGFVVELLLGEQE-KEGDFRTEASELFSVVSADVRKAYCNCLIDLCVNLDLLDKACELLELGLTLQIYKDLQSRSPTQ
        +TQTP EE+ KLI CVE+A PKLG VV++L+ EQ  +EG F+ EASEL   + +DV+KAY NCLIDLCVNL+ L++ACE+L+LGL   IY  LQS+S TQ
Subjt:  ITQTPKEELSKLIDCVERANPKLGFVVELLLGEQE-KEGDFRTEASELFSVVSADVRKAYCNCLIDLCVNLDLLDKACELLELGLTLQIYKDLQSRSPTQ

Query:  WSLYLKGLSLGAALTALHVWINDLTK-VLESGEELPPLLGINTGHGKHKYSDKGMASVFESHLKELKAPFHEAPDKVGWFLTTKVAAKSWLESRASPELV
        WSL+LK LSLGAALTALHVW+NDL++  LESGEE PPLLGINTGHGKHKYSDKG+A+VFESHLKEL APFHEAPDKVGWFLTT VAAK+WLESR S   V
Subjt:  WSLYLKGLSLGAALTALHVWINDLTK-VLESGEELPPLLGINTGHGKHKYSDKGMASVFESHLKELKAPFHEAPDKVGWFLTTKVAAKSWLESRASPELV

Query:  AA
        +A
Subjt:  AA

Q9LS25 Pentatricopeptide repeat-containing protein At5g46580, chloroplastic2.2e-13137.62Show/hide
Query:  AFQLCHSPSTFFTDHHSLSNSLTSQRKTTLYNSSRLFKLN--------PIPRHSKPFLQITNVSLQEYAPQETHNPTPSDDEISKYPDGKSGSSSKSSVW
        A  +C +P    T  HSL        K +L+  SR  KLN        P     +P    T    ++  P           +I   P          SVW
Subjt:  AFQLCHSPSTFFTDHHSLSNSLTSQRKTTLYNSSRLFKLN--------PIPRHSKPFLQITNVSLQEYAPQETHNPTPSDDEISKYPDGKSGSSSKSSVW

Query:  VNPRSPRASKLRKQ-------SYEARYTSLTRISESLDSCNPCEE-DVADVLKVIGSNILEQDAVVVLNNMSNSQTALIALRYFQDVLKSSKQTIFYNVT
        VNP  P+ S L  Q       SY  +   L   +  L+S    E+ +   +L  I       +A++VLN++   Q       + +       +TIFYNVT
Subjt:  VNPRSPRASKLRKQ-------SYEARYTSLTRISESLDSCNPCEE-DVADVLKVIGSNILEQDAVVVLNNMSNSQTALIALRYFQDVLKSSKQTIFYNVT

Query:  LKVFRKCRDMEGAEKLFEEMLERGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDMAFSLYDRARTENWRIDPA
        +K  R  R  +  E++  EM++ GV+ DN+T+STII+CA+ C+L NKA+EWFE+M      PD+VTYSA++D Y ++G V+   SLY+RA    W+ D  
Subjt:  LKVFRKCRDMEGAEKLFEEMLERGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDMAFSLYDRARTENWRIDPA

Query:  TFSTMIKIHGVAGNYDGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGL
         FS + K+ G AG+YDG   V +EMK++ +KPN+V+YN+LL+AMGRA +P   ++++ EM++ G +P+  T  +L++ YG+AR+  DAL +++EMK K  
Subjt:  TFSTMIKIHGVAGNYDGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGL

Query:  QLNVILYNTLLAMCADVGYVNEAVEVFQDMKSSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVEAGFDPNIFVLTSLIQCYGKAKHVDDVVRTFNR
         ++ ILYNTLL MCAD+G   EA  +F DMK S  C PD++++++M+ IY   GK  +A E+  EM++AG   N+   T L+QC GKAK +DDVV  F+ 
Subjt:  QLNVILYNTLLAMCADVGYVNEAVEVFQDMKSSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVEAGFDPNIFVLTSLIQCYGKAKHVDDVVRTFNR

Query:  LLELGLTPDDRFCGCLLNVITQ-TPKEELSKLIDCVERANPKLGFVVELLLGEQEKEGDFRTEASELFSVVSADVRKAYCNCLIDLCVNLDLLDKACELL
         ++ G+ PDDR CGCLL+V+      E+  K++ C+ERAN KL   V L++ E+ +    + E   + +    + R+ +CNCLID+C   +  ++A ELL
Subjt:  LLELGLTPDDRFCGCLLNVITQ-TPKEELSKLIDCVERANPKLGFVVELLLGEQEKEGDFRTEASELFSVVSADVRKAYCNCLIDLCVNLDLLDKACELL

Query:  ELGLTLQIYKDLQSRSPTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGMASVFESHLKELKAPFHEAPDKVGWFLT
         LG    +Y  L +++  +WSL ++ LS+GAA TAL  W+  L  +++  EELP L    TG G H++S +G+A+ F  HL++L APF ++ D+ G F+ 
Subjt:  ELGLTLQIYKDLQSRSPTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGMASVFESHLKELKAPFHEAPDKVGWFLT

Query:  TKVAAKSWLESRASP
        TK    SWLES+  P
Subjt:  TKVAAKSWLESRASP

Q9SIC9 Pentatricopeptide repeat-containing protein At2g31400, chloroplastic5.2e-4825.21Show/hide
Query:  NILEQDAVVVLNNMSNSQTALIALRYFQDVLKSSKQ--TIFYNVTLKVFRKCRDMEGAEKLFEEMLERGVKPDNVTFSTIISCARLCSLPNKAVEWFEKM
        N++  +AV+        +   +A ++F ++ ++  Q   I +N  L V  +    E A  LF+EM  R ++ D  +++T++         + A E   +M
Subjt:  NILEQDAVVVLNNMSNSQTALIALRYFQDVLKSSKQ--TIFYNVTLKVFRKCRDMEGAEKLFEEMLERGVKPDNVTFSTIISCARLCSLPNKAVEWFEKM

Query:  PSFDCNPDDVTYSAMIDAYGRAGNVDMAFSLYDRARTENWRIDPATFSTMIKIHGVAGNYDGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKT
        P     P+ V+YS +ID + +AG  D A +L+   R     +D  +++T++ I+   G  +  L++  EM ++GIK ++V YN+LL   G+  +  ++K 
Subjt:  PSFDCNPDDVTYSAMIDAYGRAGNVDMAFSLYDRARTENWRIDPATFSTMIKIHGVAGNYDGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKT

Query:  IYKEMIKNGFSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCADVGYVNEAVEVFQDMKSSGTCSPDSWTFSSMITI------
        ++ EM +    P+  TY++L+  Y +    ++A+ +++E K  GL+ +V+LY+ L+      G V  AV +  +M   G  SP+  T++S+I        
Subjt:  IYKEMIKNGFSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCADVGYVNEAVEVFQDMKSSGTCSPDSWTFSSMITI------

Query:  ------YSCSGKVSEAEEMLNEMVEAGFDPNIFVLTSLI---------QCYGKAKHVDDVVRTFNRLLELGLTPDDRFCGCLLNVITQTPK-EELSKLID
              YS  G +  +   L+ + E   +  I +   L           C    + +  ++  F ++ +L + P+      +LN  ++    E+ S L++
Subjt:  ------YSCSGKVSEAEEMLNEMVEAGFDPNIFVLTSLI---------QCYGKAKHVDDVVRTFNRLLELGLTPDDRFCGCLLNVITQTPK-EELSKLID

Query:  CVERANPKL-GFVVELLLGEQEKEGDFRTEASELFSVVS---ADVRKAYCNCLIDLCVNLDLLDKACELLEL-GLTLQIYKDLQSRSPTQWSLYLKGLSL
         +   + K+ G V  LL+G++E   +   +A  LF  V+        A+ N L D+  +     +  EL+ L G + Q+++++ S S     L L  +S 
Subjt:  CVERANPKL-GFVVELLLGEQEKEGDFRTEASELFSVVS---ADVRKAYCNCLIDLCVNLDLLDKACELLEL-GLTLQIYKDLQSRSPTQWSLYLKGLSL

Query:  GAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHK--YSDKGMASVFESHLKELKAPFHEAPDKVGWFLTTKVAAKSWLESRASPELV
        GAA   +H W+ ++  ++  G ELP +L I TG GKH     D  +    E  L+ + APFH +   +G F ++     +WL   A+ +L+
Subjt:  GAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHK--YSDKGMASVFESHLKELKAPFHEAPDKVGWFLTTKVAAKSWLESRASPELV

Arabidopsis top hitse value%identityAlignment
AT1G18900.1 Pentatricopeptide repeat (PPR) superfamily protein1.3e-3823.63Show/hide
Query:  DVLKVIGSNILEQDAVVVLNNMSNSQTALIALRYFQDVLKSSKQTIFYNVTLKVFRKCRDMEGAEKLFEEMLERGVKPDNVTFSTIISCARLCSLPNKAV
        + L+ +G  I    A  VL  M++   AL    + +           Y   +    + +      KL +EM+  G +P+ VT++ +I      +  N+A+
Subjt:  DVLKVIGSNILEQDAVVVLNNMSNSQTALIALRYFQDVLKSSKQTIFYNVTLKVFRKCRDMEGAEKLFEEMLERGVKPDNVTFSTIISCARLCSLPNKAV

Query:  EWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDMAFSLYDRARTENWRIDPATFSTMIKIHGVAGNYDGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKR
          F +M    C PD VTY  +ID + +AG +D+A  +Y R +      D  T+S +I   G AG+      ++ EM   G  PNLV YN ++D   +A+ 
Subjt:  EWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDMAFSLYDRARTENWRIDPATFSTMIKIHGVAGNYDGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKR

Query:  PWQIKTIYKEMIKNGFSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCADVGYVNEAVEVFQDMKSSGTCSPDSWTFSSMITI
              +Y++M   GF P   TY+ ++   G   Y E+A  V+ EM++K    +  +Y  L+ +    G V +A + +Q M  +G   P+  T +S+++ 
Subjt:  PWQIKTIYKEMIKNGFSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCADVGYVNEAVEVFQDMKSSGTCSPDSWTFSSMITI

Query:  YSCSGKVSEAEEMLNEMVEAGFDPNIFVLTSLIQCY--GKAKHVDDVVRTFNRLLELGLTPDDRFCGCLLNVITQTPKEELSKLIDCVERANPKLGFVVE
        +    K++EA E+L  M+  G  P++   T L+ C   G++K            L++G      FCG L+                     +P   F+++
Subjt:  YSCSGKVSEAEEMLNEMVEAGFDPNIFVLTSLIQCY--GKAKHVDDVVRTFNRLLELGLTPDDRFCGCLLNVITQTPKEELSKLIDCVERANPKLGFVVE

Query:  LLLGEQEKEGDFRTEASELFSVVSADVR---KAYCNCLIDLCVNLDLLDKACELLELGLTLQIYKD-LQSRSPTQWSLYLKGLSLGAALTALHVWINDLT
        +     + E + R  A+    ++ ++ R   +   + ++D        ++A  + E+     ++ D L+ +S + W + L  +S G A+TAL   +    
Subjt:  LLLGEQEKEGDFRTEASELFSVVSADVR---KAYCNCLIDLCVNLDLLDKACELLELGLTLQIYKD-LQSRSPTQWSLYLKGLSLGAALTALHVWINDLT

Query:  KVLESGEELPPLLGINTGHGKHK--YSDKGMASVFESHLKELKAPFHEAPDKVGWFLTTKVAAKSWL
        K + +    P  + I TG G+         +    E  L    +PF       G F+ +      WL
Subjt:  KVLESGEELPPLLGINTGHGKHK--YSDKGMASVFESHLKELKAPFHEAPDKVGWFLTTKVAAKSWL

AT2G31400.1 genomes uncoupled 13.7e-4925.21Show/hide
Query:  NILEQDAVVVLNNMSNSQTALIALRYFQDVLKSSKQ--TIFYNVTLKVFRKCRDMEGAEKLFEEMLERGVKPDNVTFSTIISCARLCSLPNKAVEWFEKM
        N++  +AV+        +   +A ++F ++ ++  Q   I +N  L V  +    E A  LF+EM  R ++ D  +++T++         + A E   +M
Subjt:  NILEQDAVVVLNNMSNSQTALIALRYFQDVLKSSKQ--TIFYNVTLKVFRKCRDMEGAEKLFEEMLERGVKPDNVTFSTIISCARLCSLPNKAVEWFEKM

Query:  PSFDCNPDDVTYSAMIDAYGRAGNVDMAFSLYDRARTENWRIDPATFSTMIKIHGVAGNYDGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKT
        P     P+ V+YS +ID + +AG  D A +L+   R     +D  +++T++ I+   G  +  L++  EM ++GIK ++V YN+LL   G+  +  ++K 
Subjt:  PSFDCNPDDVTYSAMIDAYGRAGNVDMAFSLYDRARTENWRIDPATFSTMIKIHGVAGNYDGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKT

Query:  IYKEMIKNGFSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCADVGYVNEAVEVFQDMKSSGTCSPDSWTFSSMITI------
        ++ EM +    P+  TY++L+  Y +    ++A+ +++E K  GL+ +V+LY+ L+      G V  AV +  +M   G  SP+  T++S+I        
Subjt:  IYKEMIKNGFSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCADVGYVNEAVEVFQDMKSSGTCSPDSWTFSSMITI------

Query:  ------YSCSGKVSEAEEMLNEMVEAGFDPNIFVLTSLI---------QCYGKAKHVDDVVRTFNRLLELGLTPDDRFCGCLLNVITQTPK-EELSKLID
              YS  G +  +   L+ + E   +  I +   L           C    + +  ++  F ++ +L + P+      +LN  ++    E+ S L++
Subjt:  ------YSCSGKVSEAEEMLNEMVEAGFDPNIFVLTSLI---------QCYGKAKHVDDVVRTFNRLLELGLTPDDRFCGCLLNVITQTPK-EELSKLID

Query:  CVERANPKL-GFVVELLLGEQEKEGDFRTEASELFSVVS---ADVRKAYCNCLIDLCVNLDLLDKACELLEL-GLTLQIYKDLQSRSPTQWSLYLKGLSL
         +   + K+ G V  LL+G++E   +   +A  LF  V+        A+ N L D+  +     +  EL+ L G + Q+++++ S S     L L  +S 
Subjt:  CVERANPKL-GFVVELLLGEQEKEGDFRTEASELFSVVS---ADVRKAYCNCLIDLCVNLDLLDKACELLEL-GLTLQIYKDLQSRSPTQWSLYLKGLSL

Query:  GAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHK--YSDKGMASVFESHLKELKAPFHEAPDKVGWFLTTKVAAKSWLESRASPELV
        GAA   +H W+ ++  ++  G ELP +L I TG GKH     D  +    E  L+ + APFH +   +G F ++     +WL   A+ +L+
Subjt:  GAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHK--YSDKGMASVFESHLKELKAPFHEAPDKVGWFLTTKVAAKSWLESRASPELV

AT4G16390.1 pentatricopeptide (PPR) repeat-containing protein1.0e-27265.95Show/hide
Query:  LCHSPSTFFTDHHSLSNSLTSQRKTTLYNSSRLFKLNPIPRHSKPFLQITNVSLQEYAPQETHNPTPSDDEISKYPDGKSGSSSKSSVWVNPRSPRASKL
        LC SPS+   D   L N L+   K+T  +    +  N    HS+  LQ T+VS+QE  PQ   +     D     P     ++SKS VWVNP+SPRAS+L
Subjt:  LCHSPSTFFTDHHSLSNSLTSQRKTTLYNSSRLFKLNPIPRHSKPFLQITNVSLQEYAPQETHNPTPSDDEISKYPDGKSGSSSKSSVWVNPRSPRASKL

Query:  RKQSYEARYTSLTRISESLDSCNPCEEDVADVLKVIGSNILEQDAVVVLNNMSNSQTALIALRYFQDVLKSSKQTIFYNVTLKVFRKCRDMEGAEKLFEE
        R++SY++RY+SL +++ESLD+C P E DV DV+   G  + EQDAVV LNNM+N +TA + L    + +K S++ I YNVT+KVFRK +D+E +EKLF+E
Subjt:  RKQSYEARYTSLTRISESLDSCNPCEEDVADVLKVIGSNILEQDAVVVLNNMSNSQTALIALRYFQDVLKSSKQTIFYNVTLKVFRKCRDMEGAEKLFEE

Query:  MLERGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDMAFSLYDRARTENWRIDPATFSTMIKIHGVAGNYDGCL
        MLERG+KPDN TF+TIISCAR   +P +AVEWFEKM SF C PD+VT +AMIDAYGRAGNVDMA SLYDRARTE WRID  TFST+I+I+GV+GNYDGCL
Subjt:  MLERGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDMAFSLYDRARTENWRIDPATFSTMIKIHGVAGNYDGCL

Query:  NVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCADVGY
        N+YEEMKA+G+KPNLVIYN L+D+MGRAKRPWQ K IYK++I NGF+P+W+TYA+L+RAYGRARYG+DAL +Y+EMKEKGL L VILYNTLL+MCAD  Y
Subjt:  NVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCADVGY

Query:  VNEAVEVFQDMKSSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVEAGFDPNIFVLTSLIQCYGKAKHVDDVVRTFNRLLELGLTPDDRFCGCLLNV
        V+EA E+FQDMK+  TC PDSWTFSS+IT+Y+CSG+VSEAE  L +M EAGF+P +FVLTS+IQCYGKAK VDDVVRTF+++LELG+TPDDRFCGCLLNV
Subjt:  VNEAVEVFQDMKSSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVEAGFDPNIFVLTSLIQCYGKAKHVDDVVRTFNRLLELGLTPDDRFCGCLLNV

Query:  ITQTPKEELSKLIDCVERANPKLGFVVELLLGEQE-KEGDFRTEASELFSVVSADVRKAYCNCLIDLCVNLDLLDKACELLELGLTLQIYKDLQSRSPTQ
        +TQTP EE+ KLI CVE+A PKLG VV++L+ EQ  +EG F+ EASEL   + +DV+KAY NCLIDLCVNL+ L++ACE+L+LGL   IY  LQS+S TQ
Subjt:  ITQTPKEELSKLIDCVERANPKLGFVVELLLGEQE-KEGDFRTEASELFSVVSADVRKAYCNCLIDLCVNLDLLDKACELLELGLTLQIYKDLQSRSPTQ

Query:  WSLYLKGLSLGAALTALHVWINDLTK-VLESGEELPPLLGINTGHGKHKYSDKGMASVFESHLKELKAPFHEAPDKVGWFLTTKVAAKSWLESRASPELV
        WSL+LK LSLGAALTALHVW+NDL++  LESGEE PPLLGINTGHGKHKYSDKG+A+VFESHLKEL APFHEAPDKVGWFLTT VAAK+WLESR S   V
Subjt:  WSLYLKGLSLGAALTALHVWINDLTK-VLESGEELPPLLGINTGHGKHKYSDKGMASVFESHLKELKAPFHEAPDKVGWFLTTKVAAKSWLESRASPELV

Query:  AA
        +A
Subjt:  AA

AT5G02860.1 Pentatricopeptide repeat (PPR) superfamily protein1.1e-4028.57Show/hide
Query:  IALRYFQDVLKSSK-QTIFYNVTLKVFRKCRDMEG----AEKLFEEMLERGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDA
        +ALR F   +K    Q++  N  + +       EG    A  +F  + E G   D  +++++IS         +AV  F+KM    C P  +TY+ +++ 
Subjt:  IALRYFQDVLKSSK-QTIFYNVTLKVFRKCRDMEG----AEKLFEEMLERGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDA

Query:  YGRAGNV-DMAFSLYDRARTENWRIDPATFSTMIKIHGVAGNYDGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATY
        +G+ G   +   SL ++ +++    D  T++T+I        +     V+EEMKA G   + V YN+LLD  G++ RP +   +  EM+ NGFSPS  TY
Subjt:  YGRAGNV-DMAFSLYDRARTENWRIDPATFSTMIKIHGVAGNYDGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATY

Query:  ASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCADVGYVNEAVEVFQDMKSSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVEAGFD
         SL+ AY R    ++A+ +  +M EKG + +V  Y TLL+     G V  A+ +F++M+++G C P+  TF++ I +Y   GK +E  ++ +E+   G  
Subjt:  ASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCADVGYVNEAVEVFQDMKSSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVEAGFD

Query:  PNIFVLTSLIQCYGKAKHVDDVVRTFNRLLELGLTPDDRFCGCLLNVITQ
        P+I    +L+  +G+     +V   F  +   G  P+      L++  ++
Subjt:  PNIFVLTSLIQCYGKAKHVDDVVRTFNRLLELGLTPDDRFCGCLLNVITQ

AT5G46580.1 pentatricopeptide (PPR) repeat-containing protein1.6e-13237.62Show/hide
Query:  AFQLCHSPSTFFTDHHSLSNSLTSQRKTTLYNSSRLFKLN--------PIPRHSKPFLQITNVSLQEYAPQETHNPTPSDDEISKYPDGKSGSSSKSSVW
        A  +C +P    T  HSL        K +L+  SR  KLN        P     +P    T    ++  P           +I   P          SVW
Subjt:  AFQLCHSPSTFFTDHHSLSNSLTSQRKTTLYNSSRLFKLN--------PIPRHSKPFLQITNVSLQEYAPQETHNPTPSDDEISKYPDGKSGSSSKSSVW

Query:  VNPRSPRASKLRKQ-------SYEARYTSLTRISESLDSCNPCEE-DVADVLKVIGSNILEQDAVVVLNNMSNSQTALIALRYFQDVLKSSKQTIFYNVT
        VNP  P+ S L  Q       SY  +   L   +  L+S    E+ +   +L  I       +A++VLN++   Q       + +       +TIFYNVT
Subjt:  VNPRSPRASKLRKQ-------SYEARYTSLTRISESLDSCNPCEE-DVADVLKVIGSNILEQDAVVVLNNMSNSQTALIALRYFQDVLKSSKQTIFYNVT

Query:  LKVFRKCRDMEGAEKLFEEMLERGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDMAFSLYDRARTENWRIDPA
        +K  R  R  +  E++  EM++ GV+ DN+T+STII+CA+ C+L NKA+EWFE+M      PD+VTYSA++D Y ++G V+   SLY+RA    W+ D  
Subjt:  LKVFRKCRDMEGAEKLFEEMLERGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDMAFSLYDRARTENWRIDPA

Query:  TFSTMIKIHGVAGNYDGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGL
         FS + K+ G AG+YDG   V +EMK++ +KPN+V+YN+LL+AMGRA +P   ++++ EM++ G +P+  T  +L++ YG+AR+  DAL +++EMK K  
Subjt:  TFSTMIKIHGVAGNYDGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGL

Query:  QLNVILYNTLLAMCADVGYVNEAVEVFQDMKSSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVEAGFDPNIFVLTSLIQCYGKAKHVDDVVRTFNR
         ++ ILYNTLL MCAD+G   EA  +F DMK S  C PD++++++M+ IY   GK  +A E+  EM++AG   N+   T L+QC GKAK +DDVV  F+ 
Subjt:  QLNVILYNTLLAMCADVGYVNEAVEVFQDMKSSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVEAGFDPNIFVLTSLIQCYGKAKHVDDVVRTFNR

Query:  LLELGLTPDDRFCGCLLNVITQ-TPKEELSKLIDCVERANPKLGFVVELLLGEQEKEGDFRTEASELFSVVSADVRKAYCNCLIDLCVNLDLLDKACELL
         ++ G+ PDDR CGCLL+V+      E+  K++ C+ERAN KL   V L++ E+ +    + E   + +    + R+ +CNCLID+C   +  ++A ELL
Subjt:  LLELGLTPDDRFCGCLLNVITQ-TPKEELSKLIDCVERANPKLGFVVELLLGEQEKEGDFRTEASELFSVVSADVRKAYCNCLIDLCVNLDLLDKACELL

Query:  ELGLTLQIYKDLQSRSPTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGMASVFESHLKELKAPFHEAPDKVGWFLT
         LG    +Y  L +++  +WSL ++ LS+GAA TAL  W+  L  +++  EELP L    TG G H++S +G+A+ F  HL++L APF ++ D+ G F+ 
Subjt:  ELGLTLQIYKDLQSRSPTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGMASVFESHLKELKAPFHEAPDKVGWFLT

Query:  TKVAAKSWLESRASP
        TK    SWLES+  P
Subjt:  TKVAAKSWLESRASP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCATTCCAGCTCTGCCATTCTCCCTCCACCTTCTTCACCGACCACCATTCCCTCTCCAATTCTCTCACTTCTCAACGTAAAACAACTCTCTACAACTCCTCTCGCCT
TTTCAAGCTCAATCCCATTCCTCGTCACTCAAAACCATTCCTCCAAATTACCAATGTCTCGCTACAGGAATACGCTCCTCAAGAAACCCACAATCCAACCCCCTCTGATG
ATGAAATCTCCAAATACCCAGATGGGAAATCCGGTTCCTCGTCCAAAAGCTCCGTTTGGGTGAATCCTAGAAGCCCCAGAGCTTCCAAGCTTCGGAAGCAATCGTACGAA
GCCAGGTATACTTCTCTTACGAGAATATCGGAGTCCTTGGACTCTTGTAATCCATGTGAGGAAGATGTTGCTGATGTCTTGAAGGTGATAGGTAGCAACATTTTGGAACA
GGATGCTGTTGTAGTGCTGAATAACATGTCGAATTCCCAAACTGCGTTGATTGCTCTTCGCTACTTCCAGGATGTGTTGAAATCAAGTAAACAGACTATATTTTATAATG
TGACATTGAAGGTGTTTAGGAAGTGTAGAGATATGGAGGGTGCAGAGAAACTGTTCGAAGAAATGCTTGAGAGAGGAGTTAAGCCTGATAATGTGACATTTTCTACAATT
ATTAGTTGTGCTAGGTTGTGTTCGTTACCAAATAAGGCTGTTGAGTGGTTTGAGAAGATGCCAAGTTTTGACTGTAATCCTGATGATGTTACTTACTCCGCGATGATAGA
TGCCTATGGACGAGCTGGTAATGTTGACATGGCTTTCAGCTTGTATGACCGTGCAAGAACAGAAAACTGGCGTATCGATCCTGCAACATTCTCAACAATGATCAAAATTC
ATGGAGTGGCTGGGAACTATGATGGGTGCTTGAATGTGTATGAAGAAATGAAGGCTATAGGCATCAAGCCAAACTTGGTTATATATAACAGCTTGTTGGATGCTATGGGT
AGGGCTAAAAGACCTTGGCAGATCAAGACCATTTACAAAGAGATGATTAAAAATGGATTTTCACCAAGTTGGGCAACTTATGCTTCACTTTTACGTGCCTATGGAAGAGC
CAGGTATGGTGAGGATGCTCTCCTTGTTTACAAGGAGATGAAGGAAAAGGGACTGCAGTTAAATGTAATTCTCTACAATACGCTTTTAGCTATGTGTGCTGATGTTGGCT
ACGTTAATGAGGCTGTTGAAGTTTTTCAAGATATGAAAAGTTCTGGGACTTGCTCACCTGACAGTTGGACTTTTTCTTCCATGATCACCATATATTCCTGCAGTGGAAAA
GTATCAGAGGCCGAAGAAATGTTGAATGAGATGGTGGAAGCCGGTTTCGACCCTAATATCTTTGTCTTGACATCACTAATCCAGTGCTACGGGAAAGCCAAACATGTCGA
TGATGTAGTGAGGACATTCAATCGATTGCTAGAGTTGGGATTAACTCCAGATGATCGATTCTGTGGCTGTCTTCTCAATGTAATTACCCAGACGCCAAAAGAGGAACTTA
GTAAGCTGATTGATTGCGTTGAGAGAGCTAATCCAAAACTGGGTTTTGTGGTTGAACTCTTGCTAGGGGAGCAAGAAAAGGAAGGAGATTTCAGAACTGAAGCCTCAGAA
CTCTTTAGTGTTGTCAGTGCTGATGTGAGAAAAGCCTACTGCAATTGCTTAATTGATCTCTGTGTAAATTTAGATCTTTTGGATAAGGCATGTGAACTACTGGAGTTGGG
GCTTACACTTCAGATATACAAAGATTTGCAATCCAGGTCTCCAACTCAGTGGTCTCTTTATCTTAAGGGTCTTTCTCTTGGGGCTGCTCTCACTGCATTACACGTTTGGA
TAAATGACTTAACAAAGGTACTTGAATCCGGGGAGGAACTTCCGCCATTACTTGGAATAAATACTGGACATGGAAAACACAAATATTCAGATAAGGGTATGGCAAGCGTC
TTTGAATCACATTTAAAGGAATTAAAGGCTCCATTCCATGAGGCTCCAGACAAGGTCGGGTGGTTTTTGACGACTAAAGTGGCAGCAAAATCATGGTTGGAGTCTAGAGC
TTCACCTGAATTAGTTGCAGCATAG
mRNA sequenceShow/hide mRNA sequence
ATTTTTGTGACAAAAACACCCTTCAATTTGAAAGAGGATATTTTAATGTAAAGGCCTGCCAGCCGATAAGGCGGGGACGAGCCACTCAACCGGAAGCCCAGAAGAACTAA
CCGGAAAGGGCCACACAATCACCGGAATGGCATTCCAGCTCTGCCATTCTCCCTCCACCTTCTTCACCGACCACCATTCCCTCTCCAATTCTCTCACTTCTCAACGTAAA
ACAACTCTCTACAACTCCTCTCGCCTTTTCAAGCTCAATCCCATTCCTCGTCACTCAAAACCATTCCTCCAAATTACCAATGTCTCGCTACAGGAATACGCTCCTCAAGA
AACCCACAATCCAACCCCCTCTGATGATGAAATCTCCAAATACCCAGATGGGAAATCCGGTTCCTCGTCCAAAAGCTCCGTTTGGGTGAATCCTAGAAGCCCCAGAGCTT
CCAAGCTTCGGAAGCAATCGTACGAAGCCAGGTATACTTCTCTTACGAGAATATCGGAGTCCTTGGACTCTTGTAATCCATGTGAGGAAGATGTTGCTGATGTCTTGAAG
GTGATAGGTAGCAACATTTTGGAACAGGATGCTGTTGTAGTGCTGAATAACATGTCGAATTCCCAAACTGCGTTGATTGCTCTTCGCTACTTCCAGGATGTGTTGAAATC
AAGTAAACAGACTATATTTTATAATGTGACATTGAAGGTGTTTAGGAAGTGTAGAGATATGGAGGGTGCAGAGAAACTGTTCGAAGAAATGCTTGAGAGAGGAGTTAAGC
CTGATAATGTGACATTTTCTACAATTATTAGTTGTGCTAGGTTGTGTTCGTTACCAAATAAGGCTGTTGAGTGGTTTGAGAAGATGCCAAGTTTTGACTGTAATCCTGAT
GATGTTACTTACTCCGCGATGATAGATGCCTATGGACGAGCTGGTAATGTTGACATGGCTTTCAGCTTGTATGACCGTGCAAGAACAGAAAACTGGCGTATCGATCCTGC
AACATTCTCAACAATGATCAAAATTCATGGAGTGGCTGGGAACTATGATGGGTGCTTGAATGTGTATGAAGAAATGAAGGCTATAGGCATCAAGCCAAACTTGGTTATAT
ATAACAGCTTGTTGGATGCTATGGGTAGGGCTAAAAGACCTTGGCAGATCAAGACCATTTACAAAGAGATGATTAAAAATGGATTTTCACCAAGTTGGGCAACTTATGCT
TCACTTTTACGTGCCTATGGAAGAGCCAGGTATGGTGAGGATGCTCTCCTTGTTTACAAGGAGATGAAGGAAAAGGGACTGCAGTTAAATGTAATTCTCTACAATACGCT
TTTAGCTATGTGTGCTGATGTTGGCTACGTTAATGAGGCTGTTGAAGTTTTTCAAGATATGAAAAGTTCTGGGACTTGCTCACCTGACAGTTGGACTTTTTCTTCCATGA
TCACCATATATTCCTGCAGTGGAAAAGTATCAGAGGCCGAAGAAATGTTGAATGAGATGGTGGAAGCCGGTTTCGACCCTAATATCTTTGTCTTGACATCACTAATCCAG
TGCTACGGGAAAGCCAAACATGTCGATGATGTAGTGAGGACATTCAATCGATTGCTAGAGTTGGGATTAACTCCAGATGATCGATTCTGTGGCTGTCTTCTCAATGTAAT
TACCCAGACGCCAAAAGAGGAACTTAGTAAGCTGATTGATTGCGTTGAGAGAGCTAATCCAAAACTGGGTTTTGTGGTTGAACTCTTGCTAGGGGAGCAAGAAAAGGAAG
GAGATTTCAGAACTGAAGCCTCAGAACTCTTTAGTGTTGTCAGTGCTGATGTGAGAAAAGCCTACTGCAATTGCTTAATTGATCTCTGTGTAAATTTAGATCTTTTGGAT
AAGGCATGTGAACTACTGGAGTTGGGGCTTACACTTCAGATATACAAAGATTTGCAATCCAGGTCTCCAACTCAGTGGTCTCTTTATCTTAAGGGTCTTTCTCTTGGGGC
TGCTCTCACTGCATTACACGTTTGGATAAATGACTTAACAAAGGTACTTGAATCCGGGGAGGAACTTCCGCCATTACTTGGAATAAATACTGGACATGGAAAACACAAAT
ATTCAGATAAGGGTATGGCAAGCGTCTTTGAATCACATTTAAAGGAATTAAAGGCTCCATTCCATGAGGCTCCAGACAAGGTCGGGTGGTTTTTGACGACTAAAGTGGCA
GCAAAATCATGGTTGGAGTCTAGAGCTTCACCTGAATTAGTTGCAGCATAGGTTGTTCATGAAAGGTCTCATAATTCTGTAGTGATATCTTTTATTTAAACCCTTTTTTT
TCTCCTTTATTTTAATCGGCCATATATTTCTTGAAAGAGAACTTTATGTTAAAGAATGCAATGAAGAGTTCTGCTATCTTTGTTCTTCTAATGTATTATAATTCATTGGA
ATGTCACATTGTCATTCGAACACTCTAGTTTCTTGCTTACTGCTAA
Protein sequenceShow/hide protein sequence
MAFQLCHSPSTFFTDHHSLSNSLTSQRKTTLYNSSRLFKLNPIPRHSKPFLQITNVSLQEYAPQETHNPTPSDDEISKYPDGKSGSSSKSSVWVNPRSPRASKLRKQSYE
ARYTSLTRISESLDSCNPCEEDVADVLKVIGSNILEQDAVVVLNNMSNSQTALIALRYFQDVLKSSKQTIFYNVTLKVFRKCRDMEGAEKLFEEMLERGVKPDNVTFSTI
ISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDMAFSLYDRARTENWRIDPATFSTMIKIHGVAGNYDGCLNVYEEMKAIGIKPNLVIYNSLLDAMG
RAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCADVGYVNEAVEVFQDMKSSGTCSPDSWTFSSMITIYSCSGK
VSEAEEMLNEMVEAGFDPNIFVLTSLIQCYGKAKHVDDVVRTFNRLLELGLTPDDRFCGCLLNVITQTPKEELSKLIDCVERANPKLGFVVELLLGEQEKEGDFRTEASE
LFSVVSADVRKAYCNCLIDLCVNLDLLDKACELLELGLTLQIYKDLQSRSPTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGMASV
FESHLKELKAPFHEAPDKVGWFLTTKVAAKSWLESRASPELVAA