; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi02G011550 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi02G011550
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationchr02:13930770..13935811
RNA-Seq ExpressionLsi02G011550
SyntenyLsi02G011550
Gene Ontology termsGO:0009658 - chloroplast organization (biological process)
GO:0031425 - chloroplast RNA processing (biological process)
GO:0009570 - chloroplast stroma (cellular component)
GO:0009941 - chloroplast envelope (cellular component)
GO:0003729 - mRNA binding (molecular function)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR002625 - Smr domain
IPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily
IPR033443 - Pentacotripeptide-repeat region of PRORP


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6583722.1 Pentatricopeptide repeat-containing protein, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0090.21Show/hide
Query:  MAFQLCHSPSTFFTDHHCLSNSLTPQRKTTLC-NSSRLFKLNPIPRHSKPFLQITNVSLQDYATQETQNPTPSDDEISKYPDGKSGSSSKSSVWVNPRSP
        MAFQL H PSTFFTDH    NSLT   KTTLC +SSR+FKLNPIP HSKPFLQITNVS Q+YA QET+NP+PSDDEISK+PDGKSGSSSK+SVWVNP SP
Subjt:  MAFQLCHSPSTFFTDHHCLSNSLTPQRKTTLC-NSSRLFKLNPIPRHSKPFLQITNVSLQDYATQETQNPTPSDDEISKYPDGKSGSSSKSSVWVNPRSP

Query:  RASKLRKQSYEARYASLTRISESLDSCNPCEEDVADVLKVIGSNILEQDAVVVLNNMSNSQTALLALRYFQDVLKSSKQAIFYNVTLKVFRKCRDMEGAE
        RASKLRKQSYEARYASL +ISESLDSCNPCE+DVADVLK I S ILEQDA+ VLNNMSNSQTALL LRYFQDVLKSSKQA+FYNVTLKVFRKCRD EGAE
Subjt:  RASKLRKQSYEARYASLTRISESLDSCNPCEEDVADVLKVIGSNILEQDAVVVLNNMSNSQTALLALRYFQDVLKSSKQAIFYNVTLKVFRKCRDMEGAE

Query:  KLFEEMLKRGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDLAFSLYDRARTENWRIDPATFSTMIKIHGVAGN
        KLF+EML+RGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPD++TYS MIDAYGRAGNVD+AFSLYDRARTENWRID +TFSTMIKIHGVAGN
Subjt:  KLFEEMLKRGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDLAFSLYDRARTENWRIDPATFSTMIKIHGVAGN

Query:  YDGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRSRYGEDALLVYKEMKEKGLQLNVILYNTLLAMC
        YDGCLNVYEEMKA+GIKPNL IYNSLL AMGRAKRPWQIKTIYKEM KNGFSPSWATYASLLRAY R+RY ED +LVYKEMKEKGLQLNVILYNTLLAMC
Subjt:  YDGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRSRYGEDALLVYKEMKEKGLQLNVILYNTLLAMC

Query:  ADVGYVNEAVEVFQDMKSSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVEAGFDPNIFVLTSLIQCYGKAKRVDDVVRTFNQLLELGLTPDDRFCG
        ADVGYVNEA+EVF+DMKSSGTCSPDSWTFSSMITIYSCSG VSEAEEMLNEM+EAGFDPNIFVLTSLIQCYGKAKRVDDVVRTF++LLELGLTPDDRFCG
Subjt:  ADVGYVNEAVEVFQDMKSSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVEAGFDPNIFVLTSLIQCYGKAKRVDDVVRTFNQLLELGLTPDDRFCG

Query:  CLLNVITQTPKEELSKLIDCVVRANPKLGLVVELLLGEQDKEGDFRTEASELFSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDLGLTLQIYKDLQSR
        CLLNVITQTPK ELSKLIDCV RANPKLG VV+LLLGE+D EGDFRTEASELFSVVS DVRKAYCNCLIDLCVNLDLLDKACELLDLGL++QIY DLQSR
Subjt:  CLLNVITQTPKEELSKLIDCVVRANPKLGLVVELLLGEQDKEGDFRTEASELFSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDLGLTLQIYKDLQSR

Query:  SPTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRSSP
        SPTQWSLYLKGLSLGAALTALHVWINDLTK L+SGEELPPLLGINTGHGKHKYSDKGL+SVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESR SP
Subjt:  SPTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRSSP

Query:  ELVAA
        ELVAA
Subjt:  ELVAA

XP_004139516.1 pentatricopeptide repeat-containing protein At4g16390, chloroplastic [Cucumis sativus]0.0e+0093.89Show/hide
Query:  MAFQLCHSPSTFFTDHHCLSNSLTPQRKTTLCNSSRLFKLNPIPRHSKPFLQITNVSLQDYATQETQNPTPSDDEISKYPDGKSGSSSKSSVWVNPRSPR
        MAFQLC+SP TFFT+HH LSNSLTPQRKTTL NSS LFKL+PIPRHSKPFLQITNVSLQ++A Q+TQN  PS DEISKYPD KSGSSS SSVWVNPRSPR
Subjt:  MAFQLCHSPSTFFTDHHCLSNSLTPQRKTTLCNSSRLFKLNPIPRHSKPFLQITNVSLQDYATQETQNPTPSDDEISKYPDGKSGSSSKSSVWVNPRSPR

Query:  ASKLRKQSYEARYASLTRISESLDSCNPCEEDVADVLKVIGSNILEQDAVVVLNNMSNSQTALLALRYFQDVLKSSKQAIFYNVTLKVFRKCRDMEGAEK
        ASKLRKQSYEARYASL R+SESLDS NPCE DVADVLKVIG+NILE+DA++VLNNMSNSQTALLALRYFQD+LKSSKQ IFYNVTLKVFRKCRDMEGAEK
Subjt:  ASKLRKQSYEARYASLTRISESLDSCNPCEEDVADVLKVIGSNILEQDAVVVLNNMSNSQTALLALRYFQDVLKSSKQAIFYNVTLKVFRKCRDMEGAEK

Query:  LFEEMLKRGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDLAFSLYDRARTENWRIDPATFSTMIKIHGVAGNY
        LFEEM+ RGVKPDNVTFSTIISCARLCSLP+KAVEWFEKMPSFDCNPDDVTYS MIDAYGRAGNVD+AFSLYDRARTENWRIDPATFSTMIKIHGVAGNY
Subjt:  LFEEMLKRGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDLAFSLYDRARTENWRIDPATFSTMIKIHGVAGNY

Query:  DGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRSRYGEDALLVYKEMKEKGLQLNVILYNTLLAMCA
        DGCLNVYEEMKAIGIKPNLVIYN LLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGR+RYGEDAL+VYKEMKEKGLQLNVILYNTLLAMCA
Subjt:  DGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRSRYGEDALLVYKEMKEKGLQLNVILYNTLLAMCA

Query:  DVGYVNEAVEVFQDMKSSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVEAGFDPNIFVLTSLIQCYGKAKRVDDVVRTFNQLLELGLTPDDRFCGC
        DVGYVNEAVE+FQDMKSSGTCSPDSWTFSSMITIYSC GKVSEAEEMLN+MVEAGFDPNIFVLTSLIQCYGKAKRVDDVVRTFNQL+ELGLTPDDRFCGC
Subjt:  DVGYVNEAVEVFQDMKSSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVEAGFDPNIFVLTSLIQCYGKAKRVDDVVRTFNQLLELGLTPDDRFCGC

Query:  LLNVITQTPKEELSKLIDCVVRANPKLGLVVELLLGEQDKEGDFRTEASELFSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDLGLTLQIYKDLQSRS
        LLNVITQTPK EL KLIDCVVRANPKLG VVELLLGEQDKEG+FRTEASELFSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDLGLTLQIYKDLQSRS
Subjt:  LLNVITQTPKEELSKLIDCVVRANPKLGLVVELLLGEQDKEGDFRTEASELFSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDLGLTLQIYKDLQSRS

Query:  PTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRSSPE
        PTQWSLYLKGLSLGAALTALHVWI DLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRSSPE
Subjt:  PTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRSSPE

Query:  LVAA
        LVAA
Subjt:  LVAA

XP_008464281.1 PREDICTED: pentatricopeptide repeat-containing protein At4g16390, chloroplastic [Cucumis melo]0.0e+0094.74Show/hide
Query:  MAFQLCHSPSTFFTDHHCLSNSLTPQRKTTLCNSSRLFKLNPIPRHSKPFLQITNVSLQDYATQETQNPTPSDDEISKYPDGKSGSSSKSSVWVNPRSPR
        MAFQLCHSP TFFT HH LSNSLTPQRKTTL NSS LFKLNPIPRHS PFLQITN+SLQ+++ QET N  PSDDEISKY D KSGSSSKSSVWVNPRSPR
Subjt:  MAFQLCHSPSTFFTDHHCLSNSLTPQRKTTLCNSSRLFKLNPIPRHSKPFLQITNVSLQDYATQETQNPTPSDDEISKYPDGKSGSSSKSSVWVNPRSPR

Query:  ASKLRKQSYEARYASLTRISESLDSCNPCEEDVADVLKVIGSNILEQDAVVVLNNMSNSQTALLALRYFQDVLKSSKQAIFYNVTLKVFRKCRDMEGAEK
        ASKLRKQSYEARYASL RISESLDSCNPCE DVADVLKVIG+NILEQDAVVVLNNMSNSQTALLALRYFQD+LKSSKQ IFYNVTLKVFRKCRDMEGAE+
Subjt:  ASKLRKQSYEARYASLTRISESLDSCNPCEEDVADVLKVIGSNILEQDAVVVLNNMSNSQTALLALRYFQDVLKSSKQAIFYNVTLKVFRKCRDMEGAEK

Query:  LFEEMLKRGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDLAFSLYDRARTENWRIDPATFSTMIKIHGVAGNY
        LFEEML RGVKPDNVTFSTIISCARLCSLP+KAVEWFEKMPSFDCNPDDVTYS MIDAYGRAGNVD+AFSLYDRARTENWRIDPATFSTMIKIHGVAGNY
Subjt:  LFEEMLKRGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDLAFSLYDRARTENWRIDPATFSTMIKIHGVAGNY

Query:  DGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRSRYGEDALLVYKEMKEKGLQLNVILYNTLLAMCA
        DGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIK+GFSPSWATYASLLRAYGR+RYGEDAL+VYKEMKEKGLQLNVILYNTLLAMCA
Subjt:  DGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRSRYGEDALLVYKEMKEKGLQLNVILYNTLLAMCA

Query:  DVGYVNEAVEVFQDMKSSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVEAGFDPNIFVLTSLIQCYGKAKRVDDVVRTFNQLLELGLTPDDRFCGC
        DVGYVNEAVE+FQDMK+SGTCSPDSWTFSSMITIYSCSGKVSEAEEMLN+MVEAGFDPNIFVLTSLIQCYGKAKRVDDVVRTFNQL+ELGLTPDDRFCGC
Subjt:  DVGYVNEAVEVFQDMKSSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVEAGFDPNIFVLTSLIQCYGKAKRVDDVVRTFNQLLELGLTPDDRFCGC

Query:  LLNVITQTPKEELSKLIDCVVRANPKLGLVVELLLGEQDKEGDFRTEASELFSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDLGLTLQIYKDLQSRS
        LLNVITQTPKEE+SKLIDCVVRANPKLG VVELLLGEQDKEG+FRTEASELFSVVSADVRKAYCNCLIDLCVNLDLLDKACELL+LGLTLQIYKDLQSRS
Subjt:  LLNVITQTPKEELSKLIDCVVRANPKLGLVVELLLGEQDKEGDFRTEASELFSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDLGLTLQIYKDLQSRS

Query:  PTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRSSPE
        PTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRSSPE
Subjt:  PTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRSSPE

Query:  LVAA
        LVAA
Subjt:  LVAA

XP_022142513.1 pentatricopeptide repeat-containing protein At4g16390, chloroplastic [Momordica charantia]0.0e+0091.34Show/hide
Query:  MAFQLCHSPSTFFTDHHCLSNSLTPQRKTTLCNSSRLFKLNPIPRHSKPFLQITNVSLQDYATQETQNPTPSDDEISKYPDGKSGSSSKSSVWVNPRSPR
        MAFQLCHSPSTFF+DHH LSNSL  Q + TL  SS  FKLNP P HSK  L+ITNVSLQ+YA QE QNP P+ DE SKYPDGKS SSSKSSVWVNPRSPR
Subjt:  MAFQLCHSPSTFFTDHHCLSNSLTPQRKTTLCNSSRLFKLNPIPRHSKPFLQITNVSLQDYATQETQNPTPSDDEISKYPDGKSGSSSKSSVWVNPRSPR

Query:  ASKLRKQSYEARYASLTRISESLDSCNPCEEDVADVLKVIGSNILEQDAVVVLNNMSNSQTALLALRYFQDVLKSSKQAIFYNVTLKVFRKCRDMEGAEK
        ASKLR QSYEARYASLTRISESLDSCNPCEEDVADVLK +GSNILEQDAV VLNNMSNS TALLAL+ FQ VLKSSK+AI YNVTLKV RK RDMEGAEK
Subjt:  ASKLRKQSYEARYASLTRISESLDSCNPCEEDVADVLKVIGSNILEQDAVVVLNNMSNSQTALLALRYFQDVLKSSKQAIFYNVTLKVFRKCRDMEGAEK

Query:  LFEEMLKRGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDLAFSLYDRARTENWRIDPATFSTMIKIHGVAGNY
        LF+EMLKRGVKPDNVTFST+ISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVD+AFSLYDRARTENWRIDPATFST+IKIHGVAGNY
Subjt:  LFEEMLKRGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDLAFSLYDRARTENWRIDPATFSTMIKIHGVAGNY

Query:  DGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRSRYGEDALLVYKEMKEKGLQLNVILYNTLLAMCA
        DGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGR+RYGEDALLVYKEMKEKGLQLNVILYNTLLAMCA
Subjt:  DGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRSRYGEDALLVYKEMKEKGLQLNVILYNTLLAMCA

Query:  DVGYVNEAVEVFQDMKSSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVEAGFDPNIFVLTSLIQCYGKAKRVDDVVRTFNQLLELGLTPDDRFCGC
        DVGYVNEAVE+F+DMKSSG CSPDSWTFSSMITIYSCSGKVSEAEEMLNEM+EAGFDPNIFVLTSLIQCYGK KRVDDVVRTF++L+ELGLTPDDRFCGC
Subjt:  DVGYVNEAVEVFQDMKSSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVEAGFDPNIFVLTSLIQCYGKAKRVDDVVRTFNQLLELGLTPDDRFCGC

Query:  LLNVITQTPKEELSKLIDCVVRANPKLGLVVELLLGEQDKEGDFRTEASELFSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDLGLTLQIYKDLQSRS
        LLNVITQTPKEELSKLIDCV RAN KLG VV+LLLGEQDKEGD RTEASEL SVVSADVRKAYCNCLIDLCVNLDLL+KACELLDLGLTLQIY  LQS S
Subjt:  LLNVITQTPKEELSKLIDCVVRANPKLGLVVELLLGEQDKEGDFRTEASELFSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDLGLTLQIYKDLQSRS

Query:  PTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRSSPE
        PTQWSL+LKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESR SPE
Subjt:  PTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRSSPE

Query:  LVAA
        LVAA
Subjt:  LVAA

XP_038877791.1 pentatricopeptide repeat-containing protein At4g16390, chloroplastic [Benincasa hispida]0.0e+0096.88Show/hide
Query:  MAFQLCHSPSTFFTDHHCLSNSLTPQRKTTLCNSSRLFKLNPIPRHSKPFLQITNVSLQDYATQETQNPTPSDDEISKYPDGKSGSSSKSSVWVNPRSPR
        MAFQLCHSPSTFFTDHH LSNSLT QRKTTLCNSSRLFKLNPIPRHSKPFLQITNVSLQ+YA QET NP+PS+DEISKYPDGKS SSSKSSVWVNPRSPR
Subjt:  MAFQLCHSPSTFFTDHHCLSNSLTPQRKTTLCNSSRLFKLNPIPRHSKPFLQITNVSLQDYATQETQNPTPSDDEISKYPDGKSGSSSKSSVWVNPRSPR

Query:  ASKLRKQSYEARYASLTRISESLDSCNPCEEDVADVLKVIGSNILEQDAVVVLNNMSNSQTALLALRYFQDVLKSSKQAIFYNVTLKVFRKCRDMEGAEK
        ASKLRKQSYEARYASLTRISESLDSCNPC+EDVADVLK IGSNIL+QDAVVVLNNMSNSQTALLALRYFQDVLKSSKQAIFYNVTLKVFRKCRDMEGAEK
Subjt:  ASKLRKQSYEARYASLTRISESLDSCNPCEEDVADVLKVIGSNILEQDAVVVLNNMSNSQTALLALRYFQDVLKSSKQAIFYNVTLKVFRKCRDMEGAEK

Query:  LFEEMLKRGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDLAFSLYDRARTENWRIDPATFSTMIKIHGVAGNY
        LFEEMLKRGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVD+AFSLYDRARTENWRIDPATFSTMIKIHGVAGNY
Subjt:  LFEEMLKRGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDLAFSLYDRARTENWRIDPATFSTMIKIHGVAGNY

Query:  DGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRSRYGEDALLVYKEMKEKGLQLNVILYNTLLAMCA
        DGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGR+RYGEDALLVYKEMKEKGLQLNVILYNTLLAMCA
Subjt:  DGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRSRYGEDALLVYKEMKEKGLQLNVILYNTLLAMCA

Query:  DVGYVNEAVEVFQDMKSSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVEAGFDPNIFVLTSLIQCYGKAKRVDDVVRTFNQLLELGLTPDDRFCGC
        DVGYV EAVEVFQDMKSSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVEAGFDPNIFVLTSLIQCYGKAKRVDDVVRTF++L+ELGLTPDDRFCGC
Subjt:  DVGYVNEAVEVFQDMKSSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVEAGFDPNIFVLTSLIQCYGKAKRVDDVVRTFNQLLELGLTPDDRFCGC

Query:  LLNVITQTPKEELSKLIDCVVRANPKLGLVVELLLGEQDKEGDFRTEASELFSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDLGLTLQIYKDLQSRS
        LLNVITQTPKEELSKLIDCVVRANPKLG VV+LL+GEQDKEGDFRTEASELFSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDLGLTLQ+YKDLQSRS
Subjt:  LLNVITQTPKEELSKLIDCVVRANPKLGLVVELLLGEQDKEGDFRTEASELFSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDLGLTLQIYKDLQSRS

Query:  PTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRSSPE
        PTQWSLYLKGLSLGA LTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRSSPE
Subjt:  PTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRSSPE

Query:  LVAA
        LVAA
Subjt:  LVAA

TrEMBL top hitse value%identityAlignment
A0A0A0LVP1 Smr domain-containing protein0.0e+0093.89Show/hide
Query:  MAFQLCHSPSTFFTDHHCLSNSLTPQRKTTLCNSSRLFKLNPIPRHSKPFLQITNVSLQDYATQETQNPTPSDDEISKYPDGKSGSSSKSSVWVNPRSPR
        MAFQLC+SP TFFT+HH LSNSLTPQRKTTL NSS LFKL+PIPRHSKPFLQITNVSLQ++A Q+TQN  PS DEISKYPD KSGSSS SSVWVNPRSPR
Subjt:  MAFQLCHSPSTFFTDHHCLSNSLTPQRKTTLCNSSRLFKLNPIPRHSKPFLQITNVSLQDYATQETQNPTPSDDEISKYPDGKSGSSSKSSVWVNPRSPR

Query:  ASKLRKQSYEARYASLTRISESLDSCNPCEEDVADVLKVIGSNILEQDAVVVLNNMSNSQTALLALRYFQDVLKSSKQAIFYNVTLKVFRKCRDMEGAEK
        ASKLRKQSYEARYASL R+SESLDS NPCE DVADVLKVIG+NILE+DA++VLNNMSNSQTALLALRYFQD+LKSSKQ IFYNVTLKVFRKCRDMEGAEK
Subjt:  ASKLRKQSYEARYASLTRISESLDSCNPCEEDVADVLKVIGSNILEQDAVVVLNNMSNSQTALLALRYFQDVLKSSKQAIFYNVTLKVFRKCRDMEGAEK

Query:  LFEEMLKRGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDLAFSLYDRARTENWRIDPATFSTMIKIHGVAGNY
        LFEEM+ RGVKPDNVTFSTIISCARLCSLP+KAVEWFEKMPSFDCNPDDVTYS MIDAYGRAGNVD+AFSLYDRARTENWRIDPATFSTMIKIHGVAGNY
Subjt:  LFEEMLKRGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDLAFSLYDRARTENWRIDPATFSTMIKIHGVAGNY

Query:  DGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRSRYGEDALLVYKEMKEKGLQLNVILYNTLLAMCA
        DGCLNVYEEMKAIGIKPNLVIYN LLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGR+RYGEDAL+VYKEMKEKGLQLNVILYNTLLAMCA
Subjt:  DGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRSRYGEDALLVYKEMKEKGLQLNVILYNTLLAMCA

Query:  DVGYVNEAVEVFQDMKSSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVEAGFDPNIFVLTSLIQCYGKAKRVDDVVRTFNQLLELGLTPDDRFCGC
        DVGYVNEAVE+FQDMKSSGTCSPDSWTFSSMITIYSC GKVSEAEEMLN+MVEAGFDPNIFVLTSLIQCYGKAKRVDDVVRTFNQL+ELGLTPDDRFCGC
Subjt:  DVGYVNEAVEVFQDMKSSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVEAGFDPNIFVLTSLIQCYGKAKRVDDVVRTFNQLLELGLTPDDRFCGC

Query:  LLNVITQTPKEELSKLIDCVVRANPKLGLVVELLLGEQDKEGDFRTEASELFSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDLGLTLQIYKDLQSRS
        LLNVITQTPK EL KLIDCVVRANPKLG VVELLLGEQDKEG+FRTEASELFSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDLGLTLQIYKDLQSRS
Subjt:  LLNVITQTPKEELSKLIDCVVRANPKLGLVVELLLGEQDKEGDFRTEASELFSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDLGLTLQIYKDLQSRS

Query:  PTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRSSPE
        PTQWSLYLKGLSLGAALTALHVWI DLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRSSPE
Subjt:  PTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRSSPE

Query:  LVAA
        LVAA
Subjt:  LVAA

A0A1S3CL39 pentatricopeptide repeat-containing protein At4g16390, chloroplastic0.0e+0094.74Show/hide
Query:  MAFQLCHSPSTFFTDHHCLSNSLTPQRKTTLCNSSRLFKLNPIPRHSKPFLQITNVSLQDYATQETQNPTPSDDEISKYPDGKSGSSSKSSVWVNPRSPR
        MAFQLCHSP TFFT HH LSNSLTPQRKTTL NSS LFKLNPIPRHS PFLQITN+SLQ+++ QET N  PSDDEISKY D KSGSSSKSSVWVNPRSPR
Subjt:  MAFQLCHSPSTFFTDHHCLSNSLTPQRKTTLCNSSRLFKLNPIPRHSKPFLQITNVSLQDYATQETQNPTPSDDEISKYPDGKSGSSSKSSVWVNPRSPR

Query:  ASKLRKQSYEARYASLTRISESLDSCNPCEEDVADVLKVIGSNILEQDAVVVLNNMSNSQTALLALRYFQDVLKSSKQAIFYNVTLKVFRKCRDMEGAEK
        ASKLRKQSYEARYASL RISESLDSCNPCE DVADVLKVIG+NILEQDAVVVLNNMSNSQTALLALRYFQD+LKSSKQ IFYNVTLKVFRKCRDMEGAE+
Subjt:  ASKLRKQSYEARYASLTRISESLDSCNPCEEDVADVLKVIGSNILEQDAVVVLNNMSNSQTALLALRYFQDVLKSSKQAIFYNVTLKVFRKCRDMEGAEK

Query:  LFEEMLKRGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDLAFSLYDRARTENWRIDPATFSTMIKIHGVAGNY
        LFEEML RGVKPDNVTFSTIISCARLCSLP+KAVEWFEKMPSFDCNPDDVTYS MIDAYGRAGNVD+AFSLYDRARTENWRIDPATFSTMIKIHGVAGNY
Subjt:  LFEEMLKRGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDLAFSLYDRARTENWRIDPATFSTMIKIHGVAGNY

Query:  DGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRSRYGEDALLVYKEMKEKGLQLNVILYNTLLAMCA
        DGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIK+GFSPSWATYASLLRAYGR+RYGEDAL+VYKEMKEKGLQLNVILYNTLLAMCA
Subjt:  DGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRSRYGEDALLVYKEMKEKGLQLNVILYNTLLAMCA

Query:  DVGYVNEAVEVFQDMKSSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVEAGFDPNIFVLTSLIQCYGKAKRVDDVVRTFNQLLELGLTPDDRFCGC
        DVGYVNEAVE+FQDMK+SGTCSPDSWTFSSMITIYSCSGKVSEAEEMLN+MVEAGFDPNIFVLTSLIQCYGKAKRVDDVVRTFNQL+ELGLTPDDRFCGC
Subjt:  DVGYVNEAVEVFQDMKSSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVEAGFDPNIFVLTSLIQCYGKAKRVDDVVRTFNQLLELGLTPDDRFCGC

Query:  LLNVITQTPKEELSKLIDCVVRANPKLGLVVELLLGEQDKEGDFRTEASELFSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDLGLTLQIYKDLQSRS
        LLNVITQTPKEE+SKLIDCVVRANPKLG VVELLLGEQDKEG+FRTEASELFSVVSADVRKAYCNCLIDLCVNLDLLDKACELL+LGLTLQIYKDLQSRS
Subjt:  LLNVITQTPKEELSKLIDCVVRANPKLGLVVELLLGEQDKEGDFRTEASELFSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDLGLTLQIYKDLQSRS

Query:  PTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRSSPE
        PTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRSSPE
Subjt:  PTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRSSPE

Query:  LVAA
        LVAA
Subjt:  LVAA

A0A5A7TLM5 Pentatricopeptide repeat-containing protein0.0e+0094.74Show/hide
Query:  MAFQLCHSPSTFFTDHHCLSNSLTPQRKTTLCNSSRLFKLNPIPRHSKPFLQITNVSLQDYATQETQNPTPSDDEISKYPDGKSGSSSKSSVWVNPRSPR
        MAFQLCHSP TFFT HH LSNSLTPQRKTTL NSS LFKLNPIPRHS PFLQITN+SLQ+++ QET N  PSDDEISKY D KSGSSSKSSVWVNPRSPR
Subjt:  MAFQLCHSPSTFFTDHHCLSNSLTPQRKTTLCNSSRLFKLNPIPRHSKPFLQITNVSLQDYATQETQNPTPSDDEISKYPDGKSGSSSKSSVWVNPRSPR

Query:  ASKLRKQSYEARYASLTRISESLDSCNPCEEDVADVLKVIGSNILEQDAVVVLNNMSNSQTALLALRYFQDVLKSSKQAIFYNVTLKVFRKCRDMEGAEK
        ASKLRKQSYEARYASL RISESLDSCNPCE DVADVLKVIG+NILEQDAVVVLNNMSNSQTALLALRYFQD+LKSSKQ IFYNVTLKVFRKCRDMEGAE+
Subjt:  ASKLRKQSYEARYASLTRISESLDSCNPCEEDVADVLKVIGSNILEQDAVVVLNNMSNSQTALLALRYFQDVLKSSKQAIFYNVTLKVFRKCRDMEGAEK

Query:  LFEEMLKRGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDLAFSLYDRARTENWRIDPATFSTMIKIHGVAGNY
        LFEEML RGVKPDNVTFSTIISCARLCSLP+KAVEWFEKMPSFDCNPDDVTYS MIDAYGRAGNVD+AFSLYDRARTENWRIDPATFSTMIKIHGVAGNY
Subjt:  LFEEMLKRGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDLAFSLYDRARTENWRIDPATFSTMIKIHGVAGNY

Query:  DGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRSRYGEDALLVYKEMKEKGLQLNVILYNTLLAMCA
        DGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIK+GFSPSWATYASLLRAYGR+RYGEDAL+VYKEMKEKGLQLNVILYNTLLAMCA
Subjt:  DGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRSRYGEDALLVYKEMKEKGLQLNVILYNTLLAMCA

Query:  DVGYVNEAVEVFQDMKSSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVEAGFDPNIFVLTSLIQCYGKAKRVDDVVRTFNQLLELGLTPDDRFCGC
        DVGYVNEAVE+FQDMK+SGTCSPDSWTFSSMITIYSCSGKVSEAEEMLN+MVEAGFDPNIFVLTSLIQCYGKAKRVDDVVRTFNQL+ELGLTPDDRFCGC
Subjt:  DVGYVNEAVEVFQDMKSSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVEAGFDPNIFVLTSLIQCYGKAKRVDDVVRTFNQLLELGLTPDDRFCGC

Query:  LLNVITQTPKEELSKLIDCVVRANPKLGLVVELLLGEQDKEGDFRTEASELFSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDLGLTLQIYKDLQSRS
        LLNVITQTPKEE+SKLIDCVVRANPKLG VVELLLGEQDKEG+FRTEASELFSVVSADVRKAYCNCLIDLCVNLDLLDKACELL+LGLTLQIYKDLQSRS
Subjt:  LLNVITQTPKEELSKLIDCVVRANPKLGLVVELLLGEQDKEGDFRTEASELFSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDLGLTLQIYKDLQSRS

Query:  PTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRSSPE
        PTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRSSPE
Subjt:  PTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRSSPE

Query:  LVAA
        LVAA
Subjt:  LVAA

A0A6J1CNE5 pentatricopeptide repeat-containing protein At4g16390, chloroplastic0.0e+0091.34Show/hide
Query:  MAFQLCHSPSTFFTDHHCLSNSLTPQRKTTLCNSSRLFKLNPIPRHSKPFLQITNVSLQDYATQETQNPTPSDDEISKYPDGKSGSSSKSSVWVNPRSPR
        MAFQLCHSPSTFF+DHH LSNSL  Q + TL  SS  FKLNP P HSK  L+ITNVSLQ+YA QE QNP P+ DE SKYPDGKS SSSKSSVWVNPRSPR
Subjt:  MAFQLCHSPSTFFTDHHCLSNSLTPQRKTTLCNSSRLFKLNPIPRHSKPFLQITNVSLQDYATQETQNPTPSDDEISKYPDGKSGSSSKSSVWVNPRSPR

Query:  ASKLRKQSYEARYASLTRISESLDSCNPCEEDVADVLKVIGSNILEQDAVVVLNNMSNSQTALLALRYFQDVLKSSKQAIFYNVTLKVFRKCRDMEGAEK
        ASKLR QSYEARYASLTRISESLDSCNPCEEDVADVLK +GSNILEQDAV VLNNMSNS TALLAL+ FQ VLKSSK+AI YNVTLKV RK RDMEGAEK
Subjt:  ASKLRKQSYEARYASLTRISESLDSCNPCEEDVADVLKVIGSNILEQDAVVVLNNMSNSQTALLALRYFQDVLKSSKQAIFYNVTLKVFRKCRDMEGAEK

Query:  LFEEMLKRGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDLAFSLYDRARTENWRIDPATFSTMIKIHGVAGNY
        LF+EMLKRGVKPDNVTFST+ISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVD+AFSLYDRARTENWRIDPATFST+IKIHGVAGNY
Subjt:  LFEEMLKRGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDLAFSLYDRARTENWRIDPATFSTMIKIHGVAGNY

Query:  DGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRSRYGEDALLVYKEMKEKGLQLNVILYNTLLAMCA
        DGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGR+RYGEDALLVYKEMKEKGLQLNVILYNTLLAMCA
Subjt:  DGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRSRYGEDALLVYKEMKEKGLQLNVILYNTLLAMCA

Query:  DVGYVNEAVEVFQDMKSSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVEAGFDPNIFVLTSLIQCYGKAKRVDDVVRTFNQLLELGLTPDDRFCGC
        DVGYVNEAVE+F+DMKSSG CSPDSWTFSSMITIYSCSGKVSEAEEMLNEM+EAGFDPNIFVLTSLIQCYGK KRVDDVVRTF++L+ELGLTPDDRFCGC
Subjt:  DVGYVNEAVEVFQDMKSSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVEAGFDPNIFVLTSLIQCYGKAKRVDDVVRTFNQLLELGLTPDDRFCGC

Query:  LLNVITQTPKEELSKLIDCVVRANPKLGLVVELLLGEQDKEGDFRTEASELFSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDLGLTLQIYKDLQSRS
        LLNVITQTPKEELSKLIDCV RAN KLG VV+LLLGEQDKEGD RTEASEL SVVSADVRKAYCNCLIDLCVNLDLL+KACELLDLGLTLQIY  LQS S
Subjt:  LLNVITQTPKEELSKLIDCVVRANPKLGLVVELLLGEQDKEGDFRTEASELFSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDLGLTLQIYKDLQSRS

Query:  PTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRSSPE
        PTQWSL+LKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESR SPE
Subjt:  PTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRSSPE

Query:  LVAA
        LVAA
Subjt:  LVAA

A0A6J1EHV4 pentatricopeptide repeat-containing protein At4g16390, chloroplastic0.0e+0090.07Show/hide
Query:  MAFQLCHSPSTFFTDHHCLSNSLTPQRKTTLCNS-SRLFKLNPIPRHSKPFLQITNVSLQDYATQETQNPTPSDDEISKYPDGKSGSSSKSSVWVNPRSP
        MAFQL H PSTFFTDH    NSLT   KTTLC S SR+FKLNPIP HSKPFLQITNVS Q+YA QET+NP+PSDDEISK+PDGKSGSSSK+SVWVNP SP
Subjt:  MAFQLCHSPSTFFTDHHCLSNSLTPQRKTTLCNS-SRLFKLNPIPRHSKPFLQITNVSLQDYATQETQNPTPSDDEISKYPDGKSGSSSKSSVWVNPRSP

Query:  RASKLRKQSYEARYASLTRISESLDSCNPCEEDVADVLKVIGSNILEQDAVVVLNNMSNSQTALLALRYFQDVLKSSKQAIFYNVTLKVFRKCRDMEGAE
        RASKLRKQSYEARYASL +ISESLDSCNPCE DVADVLK I S ILEQDA+ VLNNMSNSQTALL  RYFQDVLKSSKQA+FYNVTLKVFRKCRD EGAE
Subjt:  RASKLRKQSYEARYASLTRISESLDSCNPCEEDVADVLKVIGSNILEQDAVVVLNNMSNSQTALLALRYFQDVLKSSKQAIFYNVTLKVFRKCRDMEGAE

Query:  KLFEEMLKRGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDLAFSLYDRARTENWRIDPATFSTMIKIHGVAGN
        KLF+EML+RGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPD++TYS MIDAYGRAGNVD+AFSLYDRARTENWRID +TFSTMIKIHGVAGN
Subjt:  KLFEEMLKRGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDLAFSLYDRARTENWRIDPATFSTMIKIHGVAGN

Query:  YDGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRSRYGEDALLVYKEMKEKGLQLNVILYNTLLAMC
        YDGCLNVYEEMKA+GIKPNL IYNSLL AMGRAKRPWQIKTIYKEM KNGFSPSWATYASLLRAY R+RY ED +LVYKEMKEKGLQLNVILYNTLLAMC
Subjt:  YDGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRSRYGEDALLVYKEMKEKGLQLNVILYNTLLAMC

Query:  ADVGYVNEAVEVFQDMKSSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVEAGFDPNIFVLTSLIQCYGKAKRVDDVVRTFNQLLELGLTPDDRFCG
        ADVGYVNEA+EVF+DMKSSGTCSPDSWTFSSMITIYSCSG VSEAEEMLNEM+EAGFDPNIFVLTSLIQCYGKAKRVDDVVRTF++LLELGLTPDDRFCG
Subjt:  ADVGYVNEAVEVFQDMKSSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVEAGFDPNIFVLTSLIQCYGKAKRVDDVVRTFNQLLELGLTPDDRFCG

Query:  CLLNVITQTPKEELSKLIDCVVRANPKLGLVVELLLGEQDKEGDFRTEASELFSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDLGLTLQIYKDLQSR
        CLLNVITQTPK ELSKLIDCV RANPKLG VV+LLLGE+D EGDFRTEASELFSVVS DVRKAYCNCLIDLCVNLDLLDKACELLDLGL++QIY DLQSR
Subjt:  CLLNVITQTPKEELSKLIDCVVRANPKLGLVVELLLGEQDKEGDFRTEASELFSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDLGLTLQIYKDLQSR

Query:  SPTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRSSP
        SPTQWSLYLKGLSLGAALTALHVWINDLTK L+SGEELPPLLGINTGHGKHKYSDKGL+SVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESR SP
Subjt:  SPTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRSSP

Query:  ELVAA
        ELVAA
Subjt:  ELVAA

SwissProt top hitse value%identityAlignment
B4F8Z1 Pentatricopeptide repeat-containing protein ATP4, chloroplastic4.4e-20753.6Show/hide
Query:  LCHSPSTFFTD--HHCLSNSLTPQRKTTLCNSSRLFKLNPIPRHSKPFLQITNVSLQDYATQETQNPTPSDDEISKYPDGKSGSSSKSS--VWVNPRSPR
        LC SPS+      H  +S S  P+  +           +P+  H         VS+Q+   Q  Q+P+P  D     P+G   SSS ++  +WVNP SPR
Subjt:  LCHSPSTFFTD--HHCLSNSLTPQRKTTLCNSSRLFKLNPIPRHSKPFLQITNVSLQDYATQETQNPTPSDDEISKYPDGKSGSSSKSS--VWVNPRSPR

Query:  ASKL-RKQSYEARYASLTRISESLDSCNPCEEDVADVLK-VIGSNILEQDAVVVLNN--MSNSQTALLALRYFQDVLKSSKQAIFYNVTLKVFRKCRDME
        A+ + R ++   R A L   + +L +C   E  V   L+        EQDAV+VLN    + ++TA+LALR+F    K  K+ I YNV LK+ RK R   
Subjt:  ASKL-RKQSYEARYASLTRISESLDSCNPCEEDVADVLK-VIGSNILEQDAVVVLNN--MSNSQTALLALRYFQDVLKSSKQAIFYNVTLKVFRKCRDME

Query:  GAEKLFEEMLKRGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDLAFSLYDRARTENWRIDPATFSTMIKIHGV
          E L+ EML+ GV+PDN TFST+ISCAR C L +KAVEWF+KMP F C+PD +TYSA+IDAYG AGN + A  LYDRAR E W++DP   ST+IK+H  
Subjt:  GAEKLFEEMLKRGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDLAFSLYDRARTENWRIDPATFSTMIKIHGV

Query:  AGNYDGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRSRYGEDALLVYKEMKEKGLQLNVILYNTLL
        +GN+DG LNV+EEMKAIG++PNLV+YN++LDAMGRA RPW +KTI++EM+     PS ATY  LL AY R+RYGEDA+ VY+ MK++ + ++V+LYN LL
Subjt:  AGNYDGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRSRYGEDALLVYKEMKEKGLQLNVILYNTLL

Query:  AMCADVGYVNEAVEVFQDMKSS--GTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVEAGFDPNIFVLTSLIQCYGKAKRVDDVVRTFNQLLELGLTPD
        +MCAD+GYV+EA E+F+DMK+S      PDSW++SSM+T+YS +  V  AE +LNEMVEAGF PNIFVLTSLI+CYGK  R DDVVR+F  L +LG+ PD
Subjt:  AMCADVGYVNEAVEVFQDMKSS--GTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVEAGFDPNIFVLTSLIQCYGKAKRVDDVVRTFNQLLELGLTPD

Query:  DRFCGCLLNVITQTPKEELSKLIDCVVRANPKLGLVVELLLGEQDKEGDFRTEASELFSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDLGLTLQIYK
        DRFCGCLL+V   TP EEL K+I C+ R+N +LG VV+LL+     E  FR  A EL       V+  YCNCL+DLCVNL+ ++KAC LLD    L IY 
Subjt:  DRFCGCLLNVITQTPKEELSKLIDCVVRANPKLGLVVELLLGEQDKEGDFRTEASELFSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDLGLTLQIYK

Query:  DLQSRSPTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEE-LPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWL
        ++Q+R+ TQWSL+L+GLS+GAALT LHVW+NDL   L++G E LPPLLGI+TG GK+ YSD+GLA++FE+HLKEL+APFHEAP+K GWFLTT VAAK WL
Subjt:  DLQSRSPTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEE-LPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWL

Query:  ESRSSPELV
        ES+++ ELV
Subjt:  ESRSSPELV

Q10PZ4 Pentatricopeptide repeat-containing protein ATP4 homolog, chloroplastic1.5e-20755.42Show/hide
Query:  RLFKLNPIPRHSKPFLQITNVSLQDYATQETQNPTPSDDEISKYPDGKSGSSSKSSVWVNPRSPRASKL-RKQSYEARYASLTRISESLDSCNPCEEDVA
        R   L+  P++  P      VS+QD        P PSD   S    G+S ++S+  VWVNP SPRA+ L R ++   R A L   + +L +C   E  VA
Subjt:  RLFKLNPIPRHSKPFLQITNVSLQDYATQETQNPTPSDDEISKYPDGKSGSSSKSSVWVNPRSPRASKL-RKQSYEARYASLTRISESLDSCNPCEEDVA

Query:  DVLK-VIGSNILEQDAVVVLNNMSNSQTA-LLALRYFQDVLKSSKQAIFYNVTLKVFRKCRDMEGAEKLFEEMLKRGVKPDNVTFSTIISCARLCSLPNK
          L+        EQDAV+VLN  S    A +LAL +F    +  K+ I YNV LK  RK R    AE L+EEML+ GV+PDN TFST+ISCAR C +P K
Subjt:  DVLK-VIGSNILEQDAVVVLNNMSNSQTA-LLALRYFQDVLKSSKQAIFYNVTLKVFRKCRDMEGAEKLFEEMLKRGVKPDNVTFSTIISCARLCSLPNK

Query:  AVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDLAFSLYDRARTENWRIDPATFSTMIKIHGVAGNYDGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRA
        AVEWFEKMP F C+PD +TYSA+IDAYGRAG+ + A  LYDRAR E W++DP   +T+I++H  +GN+DG LNV+EEMKA G+KPNLV+YN++LDAMGRA
Subjt:  AVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDLAFSLYDRARTENWRIDPATFSTMIKIHGVAGNYDGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRA

Query:  KRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRSRYGEDALLVYKEMKEKGLQLNVILYNTLLAMCADVGYVNEAVEVFQDMKSS--GTCSPDSWTFSS
         RPW +KTI++E++     P+ ATY  LL AY R+RYGEDA+ VY+ MK++ + ++V+LYN LL+MCAD+GYV EA E+F+DMK+S      PDSW++SS
Subjt:  KRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRSRYGEDALLVYKEMKEKGLQLNVILYNTLLAMCADVGYVNEAVEVFQDMKSS--GTCSPDSWTFSS

Query:  MITIYSCSGKVSEAEEMLNEMVEAGFDPNIFVLTSLIQCYGKAKRVDDVVRTFNQLLELGLTPDDRFCGCLLNVITQTPKEELSKLIDCVVRANPKLGLV
        M+T+YSC+G V+ AE +LNEMVEAGF PNIF+LTSLI+CYGKA R DDVVR+F  L +LG+TPDDRFCGCLL V   TP +EL K+I C+ R++ +LG V
Subjt:  MITIYSCSGKVSEAEEMLNEMVEAGFDPNIFVLTSLIQCYGKAKRVDDVVRTFNQLLELGLTPDDRFCGCLLNVITQTPKEELSKLIDCVVRANPKLGLV

Query:  VELLLGEQDKEGDFRTEASELFSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDLGLTLQIYKDLQSRSPTQWSLYLKGLSLGAALTALHVWINDLTKV
        V LL+         R  A EL       VR  YCNCL+DL VNL  ++KAC LLD+ L L IY ++Q+R+ TQWSL+L+GLS+GAALT LHVW++DL   
Subjt:  VELLLGEQDKEGDFRTEASELFSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDLGLTLQIYKDLQSRSPTQWSLYLKGLSLGAALTALHVWINDLTKV

Query:  LESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRSSPELVA
        L++G+ELPPLLGI+TG GK+ YS KGLA+VFESHLKEL+APFHEAP+K GWFLTT VAA+ WLE++ S ELVA
Subjt:  LESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRSSPELVA

Q8GWE0 Pentatricopeptide repeat-containing protein At4g16390, chloroplastic6.3e-27065.53Show/hide
Query:  LCHSPSTFFTDHHCLSNSLTPQRKTTLCNSSRLFKLNPIPRHSKPFLQITNVSLQDYATQETQNPTPSDDEISKYPDGKSGSSSKSSVWVNPRSPRASKL
        LC SPS+   D   L N L+   K+T  +    +  N    HS+  LQ T+VS+Q+   Q  ++     D     P     ++SKS VWVNP+SPRAS+L
Subjt:  LCHSPSTFFTDHHCLSNSLTPQRKTTLCNSSRLFKLNPIPRHSKPFLQITNVSLQDYATQETQNPTPSDDEISKYPDGKSGSSSKSSVWVNPRSPRASKL

Query:  RKQSYEARYASLTRISESLDSCNPCEEDVADVLKVIGSNILEQDAVVVLNNMSNSQTALLALRYFQDVLKSSKQAIFYNVTLKVFRKCRDMEGAEKLFEE
        R++SY++RY+SL +++ESLD+C P E DV DV+   G  + EQDAVV LNNM+N +TA L L    + +K S++ I YNVT+KVFRK +D+E +EKLF+E
Subjt:  RKQSYEARYASLTRISESLDSCNPCEEDVADVLKVIGSNILEQDAVVVLNNMSNSQTALLALRYFQDVLKSSKQAIFYNVTLKVFRKCRDMEGAEKLFEE

Query:  MLKRGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDLAFSLYDRARTENWRIDPATFSTMIKIHGVAGNYDGCL
        ML+RG+KPDN TF+TIISCAR   +P +AVEWFEKM SF C PD+VT +AMIDAYGRAGNVD+A SLYDRARTE WRID  TFST+I+I+GV+GNYDGCL
Subjt:  MLKRGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDLAFSLYDRARTENWRIDPATFSTMIKIHGVAGNYDGCL

Query:  NVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRSRYGEDALLVYKEMKEKGLQLNVILYNTLLAMCADVGY
        N+YEEMKA+G+KPNLVIYN L+D+MGRAKRPWQ K IYK++I NGF+P+W+TYA+L+RAYGR+RYG+DAL +Y+EMKEKGL L VILYNTLL+MCAD  Y
Subjt:  NVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRSRYGEDALLVYKEMKEKGLQLNVILYNTLLAMCADVGY

Query:  VNEAVEVFQDMKSSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVEAGFDPNIFVLTSLIQCYGKAKRVDDVVRTFNQLLELGLTPDDRFCGCLLNV
        V+EA E+FQDMK+  TC PDSWTFSS+IT+Y+CSG+VSEAE  L +M EAGF+P +FVLTS+IQCYGKAK+VDDVVRTF+Q+LELG+TPDDRFCGCLLNV
Subjt:  VNEAVEVFQDMKSSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVEAGFDPNIFVLTSLIQCYGKAKRVDDVVRTFNQLLELGLTPDDRFCGCLLNV

Query:  ITQTPKEELSKLIDCVVRANPKLGLVVELLLGEQD-KEGDFRTEASELFSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDLGLTLQIYKDLQSRSPTQ
        +TQTP EE+ KLI CV +A PKLG VV++L+ EQ+ +EG F+ EASEL   + +DV+KAY NCLIDLCVNL+ L++ACE+L LGL   IY  LQS+S TQ
Subjt:  ITQTPKEELSKLIDCVVRANPKLGLVVELLLGEQD-KEGDFRTEASELFSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDLGLTLQIYKDLQSRSPTQ

Query:  WSLYLKGLSLGAALTALHVWINDLTK-VLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRSSPELV
        WSL+LK LSLGAALTALHVW+NDL++  LESGEE PPLLGINTGHGKHKYSDKGLA+VFESHLKELNAPFHEAP+KVGWFLTT VAAK+WLESR S   V
Subjt:  WSLYLKGLSLGAALTALHVWINDLTK-VLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRSSPELV

Query:  AA
        +A
Subjt:  AA

Q9LS25 Pentatricopeptide repeat-containing protein At5g46580, chloroplastic4.4e-13037.67Show/hide
Query:  TVTGMAFQLCHSPSTFFTDHHCLSNSLTPQRKTTLCNSSRLFKLNPIPRHSKPFLQITNVSLQDYATQETQNPTPSDDEISKYPDGKSGS----------
        TV   A  +C +P    T  H L        K +L   SR  KLN           I+  SL+   T E +  T     +S+     S +          
Subjt:  TVTGMAFQLCHSPSTFFTDHHCLSNSLTPQRKTTLCNSSRLFKLNPIPRHSKPFLQITNVSLQDYATQETQNPTPSDDEISKYPDGKSGS----------

Query:  SSKSSVWVNPRSPRASKLRKQ-------SYEARYASLTRISESLDSCNPCEE-DVADVLKVIGSNILEQDAVVVLNNMSNSQTALLALRYFQDVLKSSKQ
        S   SVWVNP  P+ S L  Q       SY  +   L   +  L+S    E+ +   +L  I       +A++VLN++   Q       + +       +
Subjt:  SSKSSVWVNPRSPRASKLRKQ-------SYEARYASLTRISESLDSCNPCEE-DVADVLKVIGSNILEQDAVVVLNNMSNSQTALLALRYFQDVLKSSKQ

Query:  AIFYNVTLKVFRKCRDMEGAEKLFEEMLKRGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDLAFSLYDRARTE
         IFYNVT+K  R  R  +  E++  EM+K GV+ DN+T+STII+CA+ C+L NKA+EWFE+M      PD+VTYSA++D Y ++G V+   SLY+RA   
Subjt:  AIFYNVTLKVFRKCRDMEGAEKLFEEMLKRGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDLAFSLYDRARTE

Query:  NWRIDPATFSTMIKIHGVAGNYDGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRSRYGEDALLVYK
         W+ D   FS + K+ G AG+YDG   V +EMK++ +KPN+V+YN+LL+AMGRA +P   ++++ EM++ G +P+  T  +L++ YG++R+  DAL +++
Subjt:  NWRIDPATFSTMIKIHGVAGNYDGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRSRYGEDALLVYK

Query:  EMKEKGLQLNVILYNTLLAMCADVGYVNEAVEVFQDMKSSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVEAGFDPNIFVLTSLIQCYGKAKRVDD
        EMK K   ++ ILYNTLL MCAD+G   EA  +F DMK S  C PD++++++M+ IY   GK  +A E+  EM++AG   N+   T L+QC GKAKR+DD
Subjt:  EMKEKGLQLNVILYNTLLAMCADVGYVNEAVEVFQDMKSSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVEAGFDPNIFVLTSLIQCYGKAKRVDD

Query:  VVRTFNQLLELGLTPDDRFCGCLLNVITQ-TPKEELSKLIDCVVRANPKLGLVVELLLGEQDKEGDFRTEASELFSVVSADVRKAYCNCLIDLCVNLDLL
        VV  F+  ++ G+ PDDR CGCLL+V+      E+  K++ C+ RAN KL   V L++ E+ +    + E   + +    + R+ +CNCLID+C   +  
Subjt:  VVRTFNQLLELGLTPDDRFCGCLLNVITQ-TPKEELSKLIDCVVRANPKLGLVVELLLGEQDKEGDFRTEASELFSVVSADVRKAYCNCLIDLCVNLDLL

Query:  DKACELLDLGLTLQIYKDLQSRSPTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPE
        ++A ELL LG    +Y  L +++  +WSL ++ LS+GAA TAL  W+  L  +++  EELP L    TG G H++S +GLA+ F  HL++L+APF ++ +
Subjt:  DKACELLDLGLTLQIYKDLQSRSPTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPE

Query:  KVGWFLTTKVAAKSWLESRSSP
        + G F+ TK    SWLES+  P
Subjt:  KVGWFLTTKVAAKSWLESRSSP

Q9SIC9 Pentatricopeptide repeat-containing protein At2g31400, chloroplastic4.6e-4724.84Show/hide
Query:  VLKVIGSNILEQDAVVVLNNMSNSQTALLALRYFQDVLKSSKQAIFYNVTLKVFRK-CRDMEGAEKLFEEMLKRGVKPDNVTFSTIISCARLCSLPNKAV
        ++   G + L ++A+ V N+M         LR            + YN  +    K   + +   K F+EM + GV+PD +TF+++++      L   A 
Subjt:  VLKVIGSNILEQDAVVVLNNMSNSQTALLALRYFQDVLKSSKQAIFYNVTLKVFRK-CRDMEGAEKLFEEMLKRGVKPDNVTFSTIISCARLCSLPNKAV

Query:  EWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDLAFSLYDRARTENWRIDPATFSTMIKIHGVAGNYDGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKR
          F++M +     D  +Y+ ++DA  + G +DLAF +  +   +    +  ++ST+I     AG +D  LN++ EM+ +GI  + V YN+LL    +  R
Subjt:  EWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDLAFSLYDRARTENWRIDPATFSTMIKIHGVAGNYDGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKR

Query:  PWQIKTIYKEMIKNGFSPSWATYASLLRAYGRSRYGEDALLVYKEMKEKGLQLNVILYNTLLAMCADVGYVNEAVEVFQDMKSSGTCSPDSWTFSSMITI
          +   I +EM   G      TY +LL  YG+    ++   V+ EMK + +  N++ Y+TL+   +  G   EA+E+F++ KS+G    D   +S++I  
Subjt:  PWQIKTIYKEMIKNGFSPSWATYASLLRAYGRSRYGEDALLVYKEMKEKGLQLNVILYNTLLAMCADVGYVNEAVEVFQDMKSSGTCSPDSWTFSSMITI

Query:  YSCSGKVSEAEEMLNEMVEAGFDPNIFVLTSLIQCYGKAKRVD---------------------------DVVRTFNQLLELGLTPDDRFC-------GC
           +G V  A  +++EM + G  PN+    S+I  +G++  +D                            V++ F QL         + C        C
Subjt:  YSCSGKVSEAEEMLNEMVEAGFDPNIFVLTSLIQCYGKAKRVD---------------------------DVVRTFNQLLELGLTPDDRFC-------GC

Query:  LLNVITQTPKEEL-------SKLIDCVVRA-----------------NPKLGLVVELLLGEQDKEGDFRTEASELFSVVS---ADVRKAYCNCLIDLCVN
        +L V  +  + E+       S +++   R                  N   G+V  LL+G+++   +   +A  LF  V+        A+ N L D+  +
Subjt:  LLNVITQTPKEEL-------SKLIDCVVRA-----------------NPKLGLVVELLLGEQDKEGDFRTEASELFSVVS---ADVRKAYCNCLIDLCVN

Query:  LDLLDKACELLDL-GLTLQIYKDLQSRSPTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHK--YSDKGLASVFESHLKELNA
             +  EL+ L G + Q+++++ S S     L L  +S GAA   +H W+ ++  ++  G ELP +L I TG GKH     D  L    E  L+ ++A
Subjt:  LDLLDKACELLDL-GLTLQIYKDLQSRSPTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHK--YSDKGLASVFESHLKELNA

Query:  PFHEAPEKVGWFLTTKVAAKSWLESRSSPELV
        PFH +   +G F ++     +WL   ++ +L+
Subjt:  PFHEAPEKVGWFLTTKVAAKSWLESRSSPELV

Arabidopsis top hitse value%identityAlignment
AT1G18900.1 Pentatricopeptide repeat (PPR) superfamily protein2.4e-3823.46Show/hide
Query:  DVLKVIGSNILEQDAVVVLNNMSNSQTALLALRYFQDVLKSSKQAIFYNVTLKVFRKCRDMEGAEKLFEEMLKRGVKPDNVTFSTIISCARLCSLPNKAV
        + L+ +G  I    A  VL  M++   AL    + +           Y   +    + +      KL +EM++ G +P+ VT++ +I      +  N+A+
Subjt:  DVLKVIGSNILEQDAVVVLNNMSNSQTALLALRYFQDVLKSSKQAIFYNVTLKVFRKCRDMEGAEKLFEEMLKRGVKPDNVTFSTIISCARLCSLPNKAV

Query:  EWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDLAFSLYDRARTENWRIDPATFSTMIKIHGVAGNYDGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKR
          F +M    C PD VTY  +ID + +AG +D+A  +Y R +      D  T+S +I   G AG+      ++ EM   G  PNLV YN ++D   +A+ 
Subjt:  EWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDLAFSLYDRARTENWRIDPATFSTMIKIHGVAGNYDGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKR

Query:  PWQIKTIYKEMIKNGFSPSWATYASLLRAYGRSRYGEDALLVYKEMKEKGLQLNVILYNTLLAMCADVGYVNEAVEVFQDMKSSGTCSPDSWTFSSMITI
              +Y++M   GF P   TY+ ++   G   Y E+A  V+ EM++K    +  +Y  L+ +    G V +A + +Q M  +G   P+  T +S+++ 
Subjt:  PWQIKTIYKEMIKNGFSPSWATYASLLRAYGRSRYGEDALLVYKEMKEKGLQLNVILYNTLLAMCADVGYVNEAVEVFQDMKSSGTCSPDSWTFSSMITI

Query:  YSCSGKVSEAEEMLNEMVEAGFDPNIFVLTSLIQCY--GKAKRVDDVVRTFNQLLELGLTPDDRFCGCLLNVITQTPKEELSKLIDCVVRANPKLGLVVE
        +    K++EA E+L  M+  G  P++   T L+ C   G++K            L++G      FCG L+                     +P    +++
Subjt:  YSCSGKVSEAEEMLNEMVEAGFDPNIFVLTSLIQCY--GKAKRVDDVVRTFNQLLELGLTPDDRFCGCLLNVITQTPKEELSKLIDCVVRANPKLGLVVE

Query:  LLLGEQDKEGDFRTEASELFSVVSADVR---KAYCNCLIDLCVNLDLLDKACELLDLGLTLQIYKD-LQSRSPTQWSLYLKGLSLGAALTALHVWINDLT
        +     D E + R  A+    ++ ++ R   +   + ++D        ++A  + ++     ++ D L+ +S + W + L  +S G A+TAL   +    
Subjt:  LLLGEQDKEGDFRTEASELFSVVSADVR---KAYCNCLIDLCVNLDLLDKACELLDLGLTLQIYKD-LQSRSPTQWSLYLKGLSLGAALTALHVWINDLT

Query:  KVLESGEELPPLLGINTGHGKHK--YSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWL
        K + +    P  + I TG G+         +    E  L    +PF       G F+ +      WL
Subjt:  KVLESGEELPPLLGINTGHGKHK--YSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWL

AT2G31400.1 genomes uncoupled 13.3e-4824.84Show/hide
Query:  VLKVIGSNILEQDAVVVLNNMSNSQTALLALRYFQDVLKSSKQAIFYNVTLKVFRK-CRDMEGAEKLFEEMLKRGVKPDNVTFSTIISCARLCSLPNKAV
        ++   G + L ++A+ V N+M         LR            + YN  +    K   + +   K F+EM + GV+PD +TF+++++      L   A 
Subjt:  VLKVIGSNILEQDAVVVLNNMSNSQTALLALRYFQDVLKSSKQAIFYNVTLKVFRK-CRDMEGAEKLFEEMLKRGVKPDNVTFSTIISCARLCSLPNKAV

Query:  EWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDLAFSLYDRARTENWRIDPATFSTMIKIHGVAGNYDGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKR
          F++M +     D  +Y+ ++DA  + G +DLAF +  +   +    +  ++ST+I     AG +D  LN++ EM+ +GI  + V YN+LL    +  R
Subjt:  EWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDLAFSLYDRARTENWRIDPATFSTMIKIHGVAGNYDGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKR

Query:  PWQIKTIYKEMIKNGFSPSWATYASLLRAYGRSRYGEDALLVYKEMKEKGLQLNVILYNTLLAMCADVGYVNEAVEVFQDMKSSGTCSPDSWTFSSMITI
          +   I +EM   G      TY +LL  YG+    ++   V+ EMK + +  N++ Y+TL+   +  G   EA+E+F++ KS+G    D   +S++I  
Subjt:  PWQIKTIYKEMIKNGFSPSWATYASLLRAYGRSRYGEDALLVYKEMKEKGLQLNVILYNTLLAMCADVGYVNEAVEVFQDMKSSGTCSPDSWTFSSMITI

Query:  YSCSGKVSEAEEMLNEMVEAGFDPNIFVLTSLIQCYGKAKRVD---------------------------DVVRTFNQLLELGLTPDDRFC-------GC
           +G V  A  +++EM + G  PN+    S+I  +G++  +D                            V++ F QL         + C        C
Subjt:  YSCSGKVSEAEEMLNEMVEAGFDPNIFVLTSLIQCYGKAKRVD---------------------------DVVRTFNQLLELGLTPDDRFC-------GC

Query:  LLNVITQTPKEEL-------SKLIDCVVRA-----------------NPKLGLVVELLLGEQDKEGDFRTEASELFSVVS---ADVRKAYCNCLIDLCVN
        +L V  +  + E+       S +++   R                  N   G+V  LL+G+++   +   +A  LF  V+        A+ N L D+  +
Subjt:  LLNVITQTPKEEL-------SKLIDCVVRA-----------------NPKLGLVVELLLGEQDKEGDFRTEASELFSVVS---ADVRKAYCNCLIDLCVN

Query:  LDLLDKACELLDL-GLTLQIYKDLQSRSPTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHK--YSDKGLASVFESHLKELNA
             +  EL+ L G + Q+++++ S S     L L  +S GAA   +H W+ ++  ++  G ELP +L I TG GKH     D  L    E  L+ ++A
Subjt:  LDLLDKACELLDL-GLTLQIYKDLQSRSPTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHK--YSDKGLASVFESHLKELNA

Query:  PFHEAPEKVGWFLTTKVAAKSWLESRSSPELV
        PFH +   +G F ++     +WL   ++ +L+
Subjt:  PFHEAPEKVGWFLTTKVAAKSWLESRSSPELV

AT4G16390.1 pentatricopeptide (PPR) repeat-containing protein4.4e-27165.53Show/hide
Query:  LCHSPSTFFTDHHCLSNSLTPQRKTTLCNSSRLFKLNPIPRHSKPFLQITNVSLQDYATQETQNPTPSDDEISKYPDGKSGSSSKSSVWVNPRSPRASKL
        LC SPS+   D   L N L+   K+T  +    +  N    HS+  LQ T+VS+Q+   Q  ++     D     P     ++SKS VWVNP+SPRAS+L
Subjt:  LCHSPSTFFTDHHCLSNSLTPQRKTTLCNSSRLFKLNPIPRHSKPFLQITNVSLQDYATQETQNPTPSDDEISKYPDGKSGSSSKSSVWVNPRSPRASKL

Query:  RKQSYEARYASLTRISESLDSCNPCEEDVADVLKVIGSNILEQDAVVVLNNMSNSQTALLALRYFQDVLKSSKQAIFYNVTLKVFRKCRDMEGAEKLFEE
        R++SY++RY+SL +++ESLD+C P E DV DV+   G  + EQDAVV LNNM+N +TA L L    + +K S++ I YNVT+KVFRK +D+E +EKLF+E
Subjt:  RKQSYEARYASLTRISESLDSCNPCEEDVADVLKVIGSNILEQDAVVVLNNMSNSQTALLALRYFQDVLKSSKQAIFYNVTLKVFRKCRDMEGAEKLFEE

Query:  MLKRGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDLAFSLYDRARTENWRIDPATFSTMIKIHGVAGNYDGCL
        ML+RG+KPDN TF+TIISCAR   +P +AVEWFEKM SF C PD+VT +AMIDAYGRAGNVD+A SLYDRARTE WRID  TFST+I+I+GV+GNYDGCL
Subjt:  MLKRGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDLAFSLYDRARTENWRIDPATFSTMIKIHGVAGNYDGCL

Query:  NVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRSRYGEDALLVYKEMKEKGLQLNVILYNTLLAMCADVGY
        N+YEEMKA+G+KPNLVIYN L+D+MGRAKRPWQ K IYK++I NGF+P+W+TYA+L+RAYGR+RYG+DAL +Y+EMKEKGL L VILYNTLL+MCAD  Y
Subjt:  NVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRSRYGEDALLVYKEMKEKGLQLNVILYNTLLAMCADVGY

Query:  VNEAVEVFQDMKSSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVEAGFDPNIFVLTSLIQCYGKAKRVDDVVRTFNQLLELGLTPDDRFCGCLLNV
        V+EA E+FQDMK+  TC PDSWTFSS+IT+Y+CSG+VSEAE  L +M EAGF+P +FVLTS+IQCYGKAK+VDDVVRTF+Q+LELG+TPDDRFCGCLLNV
Subjt:  VNEAVEVFQDMKSSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVEAGFDPNIFVLTSLIQCYGKAKRVDDVVRTFNQLLELGLTPDDRFCGCLLNV

Query:  ITQTPKEELSKLIDCVVRANPKLGLVVELLLGEQD-KEGDFRTEASELFSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDLGLTLQIYKDLQSRSPTQ
        +TQTP EE+ KLI CV +A PKLG VV++L+ EQ+ +EG F+ EASEL   + +DV+KAY NCLIDLCVNL+ L++ACE+L LGL   IY  LQS+S TQ
Subjt:  ITQTPKEELSKLIDCVVRANPKLGLVVELLLGEQD-KEGDFRTEASELFSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDLGLTLQIYKDLQSRSPTQ

Query:  WSLYLKGLSLGAALTALHVWINDLTK-VLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRSSPELV
        WSL+LK LSLGAALTALHVW+NDL++  LESGEE PPLLGINTGHGKHKYSDKGLA+VFESHLKELNAPFHEAP+KVGWFLTT VAAK+WLESR S   V
Subjt:  WSLYLKGLSLGAALTALHVWINDLTK-VLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRSSPELV

Query:  AA
        +A
Subjt:  AA

AT5G02860.1 Pentatricopeptide repeat (PPR) superfamily protein6.6e-4128.57Show/hide
Query:  LALRYFQDVLKSSK-QAIFYNVTLKVFRKCRDMEG----AEKLFEEMLKRGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDA
        LALR F   +K    Q++  N  + +       EG    A  +F  + + G   D  +++++IS         +AV  F+KM    C P  +TY+ +++ 
Subjt:  LALRYFQDVLKSSK-QAIFYNVTLKVFRKCRDMEG----AEKLFEEMLKRGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDA

Query:  YGRAGNV-DLAFSLYDRARTENWRIDPATFSTMIKIHGVAGNYDGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATY
        +G+ G   +   SL ++ +++    D  T++T+I        +     V+EEMKA G   + V YN+LLD  G++ RP +   +  EM+ NGFSPS  TY
Subjt:  YGRAGNV-DLAFSLYDRARTENWRIDPATFSTMIKIHGVAGNYDGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATY

Query:  ASLLRAYGRSRYGEDALLVYKEMKEKGLQLNVILYNTLLAMCADVGYVNEAVEVFQDMKSSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVEAGFD
         SL+ AY R    ++A+ +  +M EKG + +V  Y TLL+     G V  A+ +F++M+++G C P+  TF++ I +Y   GK +E  ++ +E+   G  
Subjt:  ASLLRAYGRSRYGEDALLVYKEMKEKGLQLNVILYNTLLAMCADVGYVNEAVEVFQDMKSSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVEAGFD

Query:  PNIFVLTSLIQCYGKAKRVDDVVRTFNQLLELGLTPDDRFCGCLLNVITQ
        P+I    +L+  +G+     +V   F ++   G  P+      L++  ++
Subjt:  PNIFVLTSLIQCYGKAKRVDDVVRTFNQLLELGLTPDDRFCGCLLNVITQ

AT5G46580.1 pentatricopeptide (PPR) repeat-containing protein3.1e-13137.67Show/hide
Query:  TVTGMAFQLCHSPSTFFTDHHCLSNSLTPQRKTTLCNSSRLFKLNPIPRHSKPFLQITNVSLQDYATQETQNPTPSDDEISKYPDGKSGS----------
        TV   A  +C +P    T  H L        K +L   SR  KLN           I+  SL+   T E +  T     +S+     S +          
Subjt:  TVTGMAFQLCHSPSTFFTDHHCLSNSLTPQRKTTLCNSSRLFKLNPIPRHSKPFLQITNVSLQDYATQETQNPTPSDDEISKYPDGKSGS----------

Query:  SSKSSVWVNPRSPRASKLRKQ-------SYEARYASLTRISESLDSCNPCEE-DVADVLKVIGSNILEQDAVVVLNNMSNSQTALLALRYFQDVLKSSKQ
        S   SVWVNP  P+ S L  Q       SY  +   L   +  L+S    E+ +   +L  I       +A++VLN++   Q       + +       +
Subjt:  SSKSSVWVNPRSPRASKLRKQ-------SYEARYASLTRISESLDSCNPCEE-DVADVLKVIGSNILEQDAVVVLNNMSNSQTALLALRYFQDVLKSSKQ

Query:  AIFYNVTLKVFRKCRDMEGAEKLFEEMLKRGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDLAFSLYDRARTE
         IFYNVT+K  R  R  +  E++  EM+K GV+ DN+T+STII+CA+ C+L NKA+EWFE+M      PD+VTYSA++D Y ++G V+   SLY+RA   
Subjt:  AIFYNVTLKVFRKCRDMEGAEKLFEEMLKRGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDLAFSLYDRARTE

Query:  NWRIDPATFSTMIKIHGVAGNYDGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRSRYGEDALLVYK
         W+ D   FS + K+ G AG+YDG   V +EMK++ +KPN+V+YN+LL+AMGRA +P   ++++ EM++ G +P+  T  +L++ YG++R+  DAL +++
Subjt:  NWRIDPATFSTMIKIHGVAGNYDGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRSRYGEDALLVYK

Query:  EMKEKGLQLNVILYNTLLAMCADVGYVNEAVEVFQDMKSSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVEAGFDPNIFVLTSLIQCYGKAKRVDD
        EMK K   ++ ILYNTLL MCAD+G   EA  +F DMK S  C PD++++++M+ IY   GK  +A E+  EM++AG   N+   T L+QC GKAKR+DD
Subjt:  EMKEKGLQLNVILYNTLLAMCADVGYVNEAVEVFQDMKSSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVEAGFDPNIFVLTSLIQCYGKAKRVDD

Query:  VVRTFNQLLELGLTPDDRFCGCLLNVITQ-TPKEELSKLIDCVVRANPKLGLVVELLLGEQDKEGDFRTEASELFSVVSADVRKAYCNCLIDLCVNLDLL
        VV  F+  ++ G+ PDDR CGCLL+V+      E+  K++ C+ RAN KL   V L++ E+ +    + E   + +    + R+ +CNCLID+C   +  
Subjt:  VVRTFNQLLELGLTPDDRFCGCLLNVITQ-TPKEELSKLIDCVVRANPKLGLVVELLLGEQDKEGDFRTEASELFSVVSADVRKAYCNCLIDLCVNLDLL

Query:  DKACELLDLGLTLQIYKDLQSRSPTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPE
        ++A ELL LG    +Y  L +++  +WSL ++ LS+GAA TAL  W+  L  +++  EELP L    TG G H++S +GLA+ F  HL++L+APF ++ +
Subjt:  DKACELLDLGLTLQIYKDLQSRSPTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPE

Query:  KVGWFLTTKVAAKSWLESRSSP
        + G F+ TK    SWLES+  P
Subjt:  KVGWFLTTKVAAKSWLESRSSP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACGACGCCTGCAGCCGATAAGGCGGGGACGAGCCACTCAACCGGAAGCCCAGAAGCACTAACCGGAGAGGGACACACAGTCACCGGAATGGCATTCCAGCTTTGCCA
TTCGCCGTCCACCTTCTTCACCGACCACCATTGTCTTTCCAATTCTCTCACTCCTCAACGTAAAACAACTCTCTGCAACTCCTCTCGCCTTTTCAAGCTCAATCCCATTC
CTCGTCACTCAAAACCATTCCTCCAAATTACCAATGTCTCGCTACAGGATTACGCTACTCAAGAAACCCAGAATCCAACCCCCTCCGATGATGAAATCTCTAAATACCCA
GATGGGAAATCTGGTTCCTCGTCCAAAAGCTCCGTTTGGGTCAATCCTAGAAGCCCCAGAGCTTCCAAACTTCGGAAGCAATCGTACGAAGCCAGGTATGCTTCTCTAAC
GAGAATATCGGAGTCTTTGGACTCTTGTAATCCATGTGAGGAAGATGTTGCTGATGTCTTGAAGGTGATAGGTAGTAACATTTTAGAACAGGACGCTGTTGTAGTGCTGA
ATAACATGTCGAATTCCCAAACTGCGTTGCTTGCTCTTCGTTACTTCCAGGATGTGTTGAAATCAAGTAAACAGGCGATTTTTTATAATGTGACATTGAAGGTGTTTAGG
AAGTGCAGAGATATGGAGGGTGCGGAGAAACTGTTCGAAGAAATGCTTAAGAGAGGAGTTAAGCCTGATAATGTGACATTTTCTACAATTATTAGTTGTGCTAGGTTGTG
TTCCTTACCAAATAAGGCTGTTGAGTGGTTTGAGAAGATGCCAAGTTTTGACTGTAATCCCGATGATGTTACTTACTCTGCGATGATTGATGCCTATGGACGTGCTGGTA
ATGTTGACCTGGCTTTCAGTTTGTATGACCGTGCACGAACAGAAAACTGGCGTATTGATCCTGCGACATTCTCGACAATGATCAAAATTCATGGAGTGGCTGGGAACTAT
GATGGGTGCTTGAATGTGTATGAAGAAATGAAGGCTATAGGCATCAAGCCAAACTTGGTTATATATAACAGCTTGCTGGATGCTATGGGTAGGGCTAAAAGACCCTGGCA
GATCAAGACCATTTACAAAGAGATGATTAAGAATGGATTTTCACCAAGTTGGGCAACTTATGCTTCTCTTTTACGTGCCTATGGAAGATCCAGGTATGGTGAGGATGCTC
TCCTTGTTTACAAGGAGATGAAGGAAAAGGGGCTGCAGTTAAATGTAATTCTGTACAATACGCTTTTAGCTATGTGTGCTGATGTTGGCTACGTTAATGAGGCTGTTGAG
GTTTTTCAAGATATGAAGAGTTCTGGGACTTGCTCACCTGACAGTTGGACTTTTTCTTCCATGATCACCATATATTCCTGCAGTGGAAAAGTATCAGAGGCTGAAGAAAT
GTTGAATGAGATGGTGGAAGCCGGTTTTGACCCTAATATCTTTGTCTTGACATCACTAATCCAGTGCTACGGGAAAGCCAAACGTGTTGATGATGTAGTGAGGACATTCA
ATCAATTGCTAGAGTTGGGATTAACTCCAGACGATCGATTCTGTGGCTGTCTTCTCAATGTAATTACCCAGACGCCAAAAGAGGAACTTAGTAAGCTGATTGATTGTGTT
GTGAGAGCTAATCCAAAACTCGGGCTTGTGGTTGAACTCTTGCTAGGGGAGCAAGACAAGGAAGGAGATTTCAGAACTGAAGCCTCAGAACTCTTTAGTGTTGTCAGTGC
TGATGTGAGAAAAGCCTACTGCAATTGCTTAATTGATCTCTGTGTAAATTTAGATCTTTTGGATAAGGCATGTGAACTACTGGATTTGGGGCTTACACTTCAGATATATA
AAGATTTGCAGTCCAGGTCTCCAACTCAGTGGTCTCTTTATCTTAAGGGTCTTTCTCTTGGGGCTGCTCTCACTGCATTACACGTTTGGATAAATGACTTAACAAAGGTA
CTCGAATCCGGGGAGGAACTTCCACCATTACTTGGAATAAATACTGGACATGGAAAACACAAATATTCAGATAAGGGTCTGGCAAGCGTCTTTGAATCACATTTAAAGGA
ATTAAATGCTCCATTCCATGAGGCTCCAGAAAAGGTCGGGTGGTTTTTGACGACTAAAGTGGCAGCAAAATCATGGTTGGAGTCTAGAAGTTCACCTGAATTAGTTGCAG
CATAG
mRNA sequenceShow/hide mRNA sequence
ATGACGACGCCTGCAGCCGATAAGGCGGGGACGAGCCACTCAACCGGAAGCCCAGAAGCACTAACCGGAGAGGGACACACAGTCACCGGAATGGCATTCCAGCTTTGCCA
TTCGCCGTCCACCTTCTTCACCGACCACCATTGTCTTTCCAATTCTCTCACTCCTCAACGTAAAACAACTCTCTGCAACTCCTCTCGCCTTTTCAAGCTCAATCCCATTC
CTCGTCACTCAAAACCATTCCTCCAAATTACCAATGTCTCGCTACAGGATTACGCTACTCAAGAAACCCAGAATCCAACCCCCTCCGATGATGAAATCTCTAAATACCCA
GATGGGAAATCTGGTTCCTCGTCCAAAAGCTCCGTTTGGGTCAATCCTAGAAGCCCCAGAGCTTCCAAACTTCGGAAGCAATCGTACGAAGCCAGGTATGCTTCTCTAAC
GAGAATATCGGAGTCTTTGGACTCTTGTAATCCATGTGAGGAAGATGTTGCTGATGTCTTGAAGGTGATAGGTAGTAACATTTTAGAACAGGACGCTGTTGTAGTGCTGA
ATAACATGTCGAATTCCCAAACTGCGTTGCTTGCTCTTCGTTACTTCCAGGATGTGTTGAAATCAAGTAAACAGGCGATTTTTTATAATGTGACATTGAAGGTGTTTAGG
AAGTGCAGAGATATGGAGGGTGCGGAGAAACTGTTCGAAGAAATGCTTAAGAGAGGAGTTAAGCCTGATAATGTGACATTTTCTACAATTATTAGTTGTGCTAGGTTGTG
TTCCTTACCAAATAAGGCTGTTGAGTGGTTTGAGAAGATGCCAAGTTTTGACTGTAATCCCGATGATGTTACTTACTCTGCGATGATTGATGCCTATGGACGTGCTGGTA
ATGTTGACCTGGCTTTCAGTTTGTATGACCGTGCACGAACAGAAAACTGGCGTATTGATCCTGCGACATTCTCGACAATGATCAAAATTCATGGAGTGGCTGGGAACTAT
GATGGGTGCTTGAATGTGTATGAAGAAATGAAGGCTATAGGCATCAAGCCAAACTTGGTTATATATAACAGCTTGCTGGATGCTATGGGTAGGGCTAAAAGACCCTGGCA
GATCAAGACCATTTACAAAGAGATGATTAAGAATGGATTTTCACCAAGTTGGGCAACTTATGCTTCTCTTTTACGTGCCTATGGAAGATCCAGGTATGGTGAGGATGCTC
TCCTTGTTTACAAGGAGATGAAGGAAAAGGGGCTGCAGTTAAATGTAATTCTGTACAATACGCTTTTAGCTATGTGTGCTGATGTTGGCTACGTTAATGAGGCTGTTGAG
GTTTTTCAAGATATGAAGAGTTCTGGGACTTGCTCACCTGACAGTTGGACTTTTTCTTCCATGATCACCATATATTCCTGCAGTGGAAAAGTATCAGAGGCTGAAGAAAT
GTTGAATGAGATGGTGGAAGCCGGTTTTGACCCTAATATCTTTGTCTTGACATCACTAATCCAGTGCTACGGGAAAGCCAAACGTGTTGATGATGTAGTGAGGACATTCA
ATCAATTGCTAGAGTTGGGATTAACTCCAGACGATCGATTCTGTGGCTGTCTTCTCAATGTAATTACCCAGACGCCAAAAGAGGAACTTAGTAAGCTGATTGATTGTGTT
GTGAGAGCTAATCCAAAACTCGGGCTTGTGGTTGAACTCTTGCTAGGGGAGCAAGACAAGGAAGGAGATTTCAGAACTGAAGCCTCAGAACTCTTTAGTGTTGTCAGTGC
TGATGTGAGAAAAGCCTACTGCAATTGCTTAATTGATCTCTGTGTAAATTTAGATCTTTTGGATAAGGCATGTGAACTACTGGATTTGGGGCTTACACTTCAGATATATA
AAGATTTGCAGTCCAGGTCTCCAACTCAGTGGTCTCTTTATCTTAAGGGTCTTTCTCTTGGGGCTGCTCTCACTGCATTACACGTTTGGATAAATGACTTAACAAAGGTA
CTCGAATCCGGGGAGGAACTTCCACCATTACTTGGAATAAATACTGGACATGGAAAACACAAATATTCAGATAAGGGTCTGGCAAGCGTCTTTGAATCACATTTAAAGGA
ATTAAATGCTCCATTCCATGAGGCTCCAGAAAAGGTCGGGTGGTTTTTGACGACTAAAGTGGCAGCAAAATCATGGTTGGAGTCTAGAAGTTCACCTGAATTAGTTGCAG
CATAG
Protein sequenceShow/hide protein sequence
MTTPAADKAGTSHSTGSPEALTGEGHTVTGMAFQLCHSPSTFFTDHHCLSNSLTPQRKTTLCNSSRLFKLNPIPRHSKPFLQITNVSLQDYATQETQNPTPSDDEISKYP
DGKSGSSSKSSVWVNPRSPRASKLRKQSYEARYASLTRISESLDSCNPCEEDVADVLKVIGSNILEQDAVVVLNNMSNSQTALLALRYFQDVLKSSKQAIFYNVTLKVFR
KCRDMEGAEKLFEEMLKRGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDLAFSLYDRARTENWRIDPATFSTMIKIHGVAGNY
DGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRSRYGEDALLVYKEMKEKGLQLNVILYNTLLAMCADVGYVNEAVE
VFQDMKSSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVEAGFDPNIFVLTSLIQCYGKAKRVDDVVRTFNQLLELGLTPDDRFCGCLLNVITQTPKEELSKLIDCV
VRANPKLGLVVELLLGEQDKEGDFRTEASELFSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDLGLTLQIYKDLQSRSPTQWSLYLKGLSLGAALTALHVWINDLTKV
LESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRSSPELVAA