; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0025610 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0025610
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationchr10:16292699..16294813
RNA-Seq ExpressionLag0025610
SyntenyLag0025610
Gene Ontology termsGO:0009658 - chloroplast organization (biological process)
GO:0031425 - chloroplast RNA processing (biological process)
GO:0009570 - chloroplast stroma (cellular component)
GO:0009941 - chloroplast envelope (cellular component)
GO:0003729 - mRNA binding (molecular function)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR002625 - Smr domain
IPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily
IPR033443 - Pentacotripeptide-repeat region of PRORP


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6583722.1 Pentatricopeptide repeat-containing protein, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0089.5Show/hide
Query:  MAFQLCHSPSTFFTDHHYLCNSSNSQRKITLCKFSHR-FKLNPIPRHSKTFLQITNVSLQEYAPQETQNPSPSDDEISKNPDGKSGSSSKSSVWVNPRSP
        MAFQL H PSTFFTDH    NS     K TLCK S R FKLNPIP HSK FLQITNVS QEYAPQET+NPSPSDDEISK PDGKSGSSSK+SVWVNP SP
Subjt:  MAFQLCHSPSTFFTDHHYLCNSSNSQRKITLCKFSHR-FKLNPIPRHSKTFLQITNVSLQEYAPQETQNPSPSDDEISKNPDGKSGSSSKSSVWVNPRSP

Query:  RASRLRKQSYEARYASLTRISESLDSCNPCEEDVADVLKMIGSNILEQDAVVVLNNMSNSQTALLALRYFQEVLKSSKEAILFNVTLKVFRKCRDMEGAE
        RAS+LRKQSYEARYASL +ISESLDSCNPCE+DVADVLK I S ILEQDA+ VLNNMSNSQTALL LRYFQ+VLKSSK+A+ +NVTLKVFRKCRD EGAE
Subjt:  RASRLRKQSYEARYASLTRISESLDSCNPCEEDVADVLKMIGSNILEQDAVVVLNNMSNSQTALLALRYFQEVLKSSKEAILFNVTLKVFRKCRDMEGAE

Query:  KLFDEMLERGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDLAFGLYDRARTENWRIDPATFSTLIKIHGVAGN
        KLFDEMLERGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPD++TYS MIDAYGRAGNVD+AF LYDRARTENWRID +TFST+IKIHGVAGN
Subjt:  KLFDEMLERGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDLAFGLYDRARTENWRIDPATFSTLIKIHGVAGN

Query:  YDGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMC
        YDGCLNVYEEMKA+GIKPNL IYNSLL AMGRAKRPWQIKTIYKEM KNGFSPSWATYASLLRAY RARY ED +LVYKEMKEKGLQLNVILYNTLLAMC
Subjt:  YDGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMC

Query:  ADVGYINEAVEIFKDMKSSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVESGFDPNIFVLTSLIQCYGKAKRVDDVVRTFNRLIELGLTPDDRFCG
        ADVGY+NEA+E+FKDMKSSGTCSPDSWTFSSMITIYSCSG VSEAEEMLNEM+E+GFDPNIFVLTSLIQCYGKAKRVDDVVRTF+RL+ELGLTPDDRFCG
Subjt:  ADVGYINEAVEIFKDMKSSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVESGFDPNIFVLTSLIQCYGKAKRVDDVVRTFNRLIELGLTPDDRFCG

Query:  CLLNVITQTPKEELSKLIDCVERANPKLGYVVRLLLGEQDKEGDFRTEASELLSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDLGLMLQIYTDLQSR
        CLLNVITQTPK ELSKLIDCVERANPKLG+VV+LLLGE+D EGDFRTEASEL SVVS DVRKAYCNCLIDLCVNLDLLDKACELLDLGL +QIYTDLQSR
Subjt:  CLLNVITQTPKEELSKLIDCVERANPKLGYVVRLLLGEQDKEGDFRTEASELLSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDLGLMLQIYTDLQSR

Query:  SPTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRGSP
        SPTQWSLYLKGLSLGAALTALHVWINDLTK L+SGEELPPLLGINTGHGKHKYSDKGL+SVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRGSP
Subjt:  SPTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRGSP

Query:  ELVAA
        ELVAA
Subjt:  ELVAA

XP_004139516.1 pentatricopeptide repeat-containing protein At4g16390, chloroplastic [Cucumis sativus]0.0e+0090.48Show/hide
Query:  MAFQLCHSPSTFFTDHHYLCNSSNSQRKITLCKFSHRFKLNPIPRHSKTFLQITNVSLQEYAPQETQNPSPSDDEISKNPDGKSGSSSKSSVWVNPRSPR
        MAFQLC+SP TFFT+HH+L NS   QRK TL   S  FKL+PIPRHSK FLQITNVSLQE+APQ+TQN  PS DEISK PD KSGSSS SSVWVNPRSPR
Subjt:  MAFQLCHSPSTFFTDHHYLCNSSNSQRKITLCKFSHRFKLNPIPRHSKTFLQITNVSLQEYAPQETQNPSPSDDEISKNPDGKSGSSSKSSVWVNPRSPR

Query:  ASRLRKQSYEARYASLTRISESLDSCNPCEEDVADVLKMIGSNILEQDAVVVLNNMSNSQTALLALRYFQEVLKSSKEAILFNVTLKVFRKCRDMEGAEK
        AS+LRKQSYEARYASL R+SESLDS NPCE DVADVLK+IG+NILE+DA++VLNNMSNSQTALLALRYFQ++LKSSK+ I +NVTLKVFRKCRDMEGAEK
Subjt:  ASRLRKQSYEARYASLTRISESLDSCNPCEEDVADVLKMIGSNILEQDAVVVLNNMSNSQTALLALRYFQEVLKSSKEAILFNVTLKVFRKCRDMEGAEK

Query:  LFDEMLERGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDLAFGLYDRARTENWRIDPATFSTLIKIHGVAGNY
        LF+EM+ RGVKPDNVTFSTIISCARLCSLP+KAVEWFEKMPSFDCNPDDVTYS MIDAYGRAGNVD+AF LYDRARTENWRIDPATFST+IKIHGVAGNY
Subjt:  LFDEMLERGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDLAFGLYDRARTENWRIDPATFSTLIKIHGVAGNY

Query:  DGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCA
        DGCLNVYEEMKAIGIKPNLVIYN LLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRARYGEDAL+VYKEMKEKGLQLNVILYNTLLAMCA
Subjt:  DGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCA

Query:  DVGYINEAVEIFKDMKSSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVESGFDPNIFVLTSLIQCYGKAKRVDDVVRTFNRLIELGLTPDDRFCGC
        DVGY+NEAVEIF+DMKSSGTCSPDSWTFSSMITIYSC GKVSEAEEMLN+MVE+GFDPNIFVLTSLIQCYGKAKRVDDVVRTFN+LIELGLTPDDRFCGC
Subjt:  DVGYINEAVEIFKDMKSSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVESGFDPNIFVLTSLIQCYGKAKRVDDVVRTFNRLIELGLTPDDRFCGC

Query:  LLNVITQTPKEELSKLIDCVERANPKLGYVVRLLLGEQDKEGDFRTEASELLSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDLGLMLQIYTDLQSRS
        LLNVITQTPK EL KLIDCV RANPKLG+VV LLLGEQDKEG+FRTEASEL SVVSADVRKAYCNCLIDLCVNLDLLDKACELLDLGL LQIY DLQSRS
Subjt:  LLNVITQTPKEELSKLIDCVERANPKLGYVVRLLLGEQDKEGDFRTEASELLSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDLGLMLQIYTDLQSRS

Query:  PTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRGSPE
        PTQWSLYLKGLSLGAALTALHVWI DLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESR SPE
Subjt:  PTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRGSPE

Query:  LVAA
        LVAA
Subjt:  LVAA

XP_008464281.1 PREDICTED: pentatricopeptide repeat-containing protein At4g16390, chloroplastic [Cucumis melo]0.0e+0091.34Show/hide
Query:  MAFQLCHSPSTFFTDHHYLCNSSNSQRKITLCKFSHRFKLNPIPRHSKTFLQITNVSLQEYAPQETQNPSPSDDEISKNPDGKSGSSSKSSVWVNPRSPR
        MAFQLCHSP TFFT HH L NS   QRK TL   S  FKLNPIPRHS  FLQITN+SLQE++PQET N  PSDDEISK  D KSGSSSKSSVWVNPRSPR
Subjt:  MAFQLCHSPSTFFTDHHYLCNSSNSQRKITLCKFSHRFKLNPIPRHSKTFLQITNVSLQEYAPQETQNPSPSDDEISKNPDGKSGSSSKSSVWVNPRSPR

Query:  ASRLRKQSYEARYASLTRISESLDSCNPCEEDVADVLKMIGSNILEQDAVVVLNNMSNSQTALLALRYFQEVLKSSKEAILFNVTLKVFRKCRDMEGAEK
        AS+LRKQSYEARYASL RISESLDSCNPCE DVADVLK+IG+NILEQDAVVVLNNMSNSQTALLALRYFQ++LKSSK+ I +NVTLKVFRKCRDMEGAE+
Subjt:  ASRLRKQSYEARYASLTRISESLDSCNPCEEDVADVLKMIGSNILEQDAVVVLNNMSNSQTALLALRYFQEVLKSSKEAILFNVTLKVFRKCRDMEGAEK

Query:  LFDEMLERGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDLAFGLYDRARTENWRIDPATFSTLIKIHGVAGNY
        LF+EML RGVKPDNVTFSTIISCARLCSLP+KAVEWFEKMPSFDCNPDDVTYS MIDAYGRAGNVD+AF LYDRARTENWRIDPATFST+IKIHGVAGNY
Subjt:  LFDEMLERGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDLAFGLYDRARTENWRIDPATFSTLIKIHGVAGNY

Query:  DGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCA
        DGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIK+GFSPSWATYASLLRAYGRARYGEDAL+VYKEMKEKGLQLNVILYNTLLAMCA
Subjt:  DGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCA

Query:  DVGYINEAVEIFKDMKSSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVESGFDPNIFVLTSLIQCYGKAKRVDDVVRTFNRLIELGLTPDDRFCGC
        DVGY+NEAVEIF+DMK+SGTCSPDSWTFSSMITIYSCSGKVSEAEEMLN+MVE+GFDPNIFVLTSLIQCYGKAKRVDDVVRTFN+LIELGLTPDDRFCGC
Subjt:  DVGYINEAVEIFKDMKSSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVESGFDPNIFVLTSLIQCYGKAKRVDDVVRTFNRLIELGLTPDDRFCGC

Query:  LLNVITQTPKEELSKLIDCVERANPKLGYVVRLLLGEQDKEGDFRTEASELLSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDLGLMLQIYTDLQSRS
        LLNVITQTPKEE+SKLIDCV RANPKLG+VV LLLGEQDKEG+FRTEASEL SVVSADVRKAYCNCLIDLCVNLDLLDKACELL+LGL LQIY DLQSRS
Subjt:  LLNVITQTPKEELSKLIDCVERANPKLGYVVRLLLGEQDKEGDFRTEASELLSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDLGLMLQIYTDLQSRS

Query:  PTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRGSPE
        PTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESR SPE
Subjt:  PTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRGSPE

Query:  LVAA
        LVAA
Subjt:  LVAA

XP_022142513.1 pentatricopeptide repeat-containing protein At4g16390, chloroplastic [Momordica charantia]0.0e+0092.47Show/hide
Query:  MAFQLCHSPSTFFTDHHYLCNSSNSQRKITLCKFSHRFKLNPIPRHSKTFLQITNVSLQEYAPQETQNPSPSDDEISKNPDGKSGSSSKSSVWVNPRSPR
        MAFQLCHSPSTFF+DHH L NS NSQ +ITL K SHRFKLNP P HSKT L+ITNVSLQEYA QE QNP P+ DE SK PDGKS SSSKSSVWVNPRSPR
Subjt:  MAFQLCHSPSTFFTDHHYLCNSSNSQRKITLCKFSHRFKLNPIPRHSKTFLQITNVSLQEYAPQETQNPSPSDDEISKNPDGKSGSSSKSSVWVNPRSPR

Query:  ASRLRKQSYEARYASLTRISESLDSCNPCEEDVADVLKMIGSNILEQDAVVVLNNMSNSQTALLALRYFQEVLKSSKEAILFNVTLKVFRKCRDMEGAEK
        AS+LR QSYEARYASLTRISESLDSCNPCEEDVADVLK +GSNILEQDAV VLNNMSNS TALLAL+ FQ+VLKSSK+AIL+NVTLKV RK RDMEGAEK
Subjt:  ASRLRKQSYEARYASLTRISESLDSCNPCEEDVADVLKMIGSNILEQDAVVVLNNMSNSQTALLALRYFQEVLKSSKEAILFNVTLKVFRKCRDMEGAEK

Query:  LFDEMLERGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDLAFGLYDRARTENWRIDPATFSTLIKIHGVAGNY
        LFDEML+RGVKPDNVTFST+ISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVD+AF LYDRARTENWRIDPATFSTLIKIHGVAGNY
Subjt:  LFDEMLERGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDLAFGLYDRARTENWRIDPATFSTLIKIHGVAGNY

Query:  DGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCA
        DGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCA
Subjt:  DGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCA

Query:  DVGYINEAVEIFKDMKSSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVESGFDPNIFVLTSLIQCYGKAKRVDDVVRTFNRLIELGLTPDDRFCGC
        DVGY+NEAVEIF+DMKSSG CSPDSWTFSSMITIYSCSGKVSEAEEMLNEM+E+GFDPNIFVLTSLIQCYGK KRVDDVVRTF+RL+ELGLTPDDRFCGC
Subjt:  DVGYINEAVEIFKDMKSSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVESGFDPNIFVLTSLIQCYGKAKRVDDVVRTFNRLIELGLTPDDRFCGC

Query:  LLNVITQTPKEELSKLIDCVERANPKLGYVVRLLLGEQDKEGDFRTEASELLSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDLGLMLQIYTDLQSRS
        LLNVITQTPKEELSKLIDCVERAN KLGYVV+LLLGEQDKEGD RTEASELLSVVSADVRKAYCNCLIDLCVNLDLL+KACELLDLGL LQIYT LQS S
Subjt:  LLNVITQTPKEELSKLIDCVERANPKLGYVVRLLLGEQDKEGDFRTEASELLSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDLGLMLQIYTDLQSRS

Query:  PTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRGSPE
        PTQWSL+LKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRGSPE
Subjt:  PTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRGSPE

Query:  LVAA
        LVAA
Subjt:  LVAA

XP_038877791.1 pentatricopeptide repeat-containing protein At4g16390, chloroplastic [Benincasa hispida]0.0e+0093.89Show/hide
Query:  MAFQLCHSPSTFFTDHHYLCNSSNSQRKITLCKFSHRFKLNPIPRHSKTFLQITNVSLQEYAPQETQNPSPSDDEISKNPDGKSGSSSKSSVWVNPRSPR
        MAFQLCHSPSTFFTDHH L NS  SQRK TLC  S  FKLNPIPRHSK FLQITNVSLQEYAPQET NPSPS+DEISK PDGKS SSSKSSVWVNPRSPR
Subjt:  MAFQLCHSPSTFFTDHHYLCNSSNSQRKITLCKFSHRFKLNPIPRHSKTFLQITNVSLQEYAPQETQNPSPSDDEISKNPDGKSGSSSKSSVWVNPRSPR

Query:  ASRLRKQSYEARYASLTRISESLDSCNPCEEDVADVLKMIGSNILEQDAVVVLNNMSNSQTALLALRYFQEVLKSSKEAILFNVTLKVFRKCRDMEGAEK
        AS+LRKQSYEARYASLTRISESLDSCNPC+EDVADVLK IGSNIL+QDAVVVLNNMSNSQTALLALRYFQ+VLKSSK+AI +NVTLKVFRKCRDMEGAEK
Subjt:  ASRLRKQSYEARYASLTRISESLDSCNPCEEDVADVLKMIGSNILEQDAVVVLNNMSNSQTALLALRYFQEVLKSSKEAILFNVTLKVFRKCRDMEGAEK

Query:  LFDEMLERGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDLAFGLYDRARTENWRIDPATFSTLIKIHGVAGNY
        LF+EML+RGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVD+AF LYDRARTENWRIDPATFST+IKIHGVAGNY
Subjt:  LFDEMLERGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDLAFGLYDRARTENWRIDPATFSTLIKIHGVAGNY

Query:  DGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCA
        DGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCA
Subjt:  DGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCA

Query:  DVGYINEAVEIFKDMKSSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVESGFDPNIFVLTSLIQCYGKAKRVDDVVRTFNRLIELGLTPDDRFCGC
        DVGY+ EAVE+F+DMKSSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVE+GFDPNIFVLTSLIQCYGKAKRVDDVVRTF+RLIELGLTPDDRFCGC
Subjt:  DVGYINEAVEIFKDMKSSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVESGFDPNIFVLTSLIQCYGKAKRVDDVVRTFNRLIELGLTPDDRFCGC

Query:  LLNVITQTPKEELSKLIDCVERANPKLGYVVRLLLGEQDKEGDFRTEASELLSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDLGLMLQIYTDLQSRS
        LLNVITQTPKEELSKLIDCV RANPKLG+VV+LL+GEQDKEGDFRTEASEL SVVSADVRKAYCNCLIDLCVNLDLLDKACELLDLGL LQ+Y DLQSRS
Subjt:  LLNVITQTPKEELSKLIDCVERANPKLGYVVRLLLGEQDKEGDFRTEASELLSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDLGLMLQIYTDLQSRS

Query:  PTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRGSPE
        PTQWSLYLKGLSLGA LTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESR SPE
Subjt:  PTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRGSPE

Query:  LVAA
        LVAA
Subjt:  LVAA

TrEMBL top hitse value%identityAlignment
A0A0A0LVP1 Smr domain-containing protein0.0e+0090.48Show/hide
Query:  MAFQLCHSPSTFFTDHHYLCNSSNSQRKITLCKFSHRFKLNPIPRHSKTFLQITNVSLQEYAPQETQNPSPSDDEISKNPDGKSGSSSKSSVWVNPRSPR
        MAFQLC+SP TFFT+HH+L NS   QRK TL   S  FKL+PIPRHSK FLQITNVSLQE+APQ+TQN  PS DEISK PD KSGSSS SSVWVNPRSPR
Subjt:  MAFQLCHSPSTFFTDHHYLCNSSNSQRKITLCKFSHRFKLNPIPRHSKTFLQITNVSLQEYAPQETQNPSPSDDEISKNPDGKSGSSSKSSVWVNPRSPR

Query:  ASRLRKQSYEARYASLTRISESLDSCNPCEEDVADVLKMIGSNILEQDAVVVLNNMSNSQTALLALRYFQEVLKSSKEAILFNVTLKVFRKCRDMEGAEK
        AS+LRKQSYEARYASL R+SESLDS NPCE DVADVLK+IG+NILE+DA++VLNNMSNSQTALLALRYFQ++LKSSK+ I +NVTLKVFRKCRDMEGAEK
Subjt:  ASRLRKQSYEARYASLTRISESLDSCNPCEEDVADVLKMIGSNILEQDAVVVLNNMSNSQTALLALRYFQEVLKSSKEAILFNVTLKVFRKCRDMEGAEK

Query:  LFDEMLERGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDLAFGLYDRARTENWRIDPATFSTLIKIHGVAGNY
        LF+EM+ RGVKPDNVTFSTIISCARLCSLP+KAVEWFEKMPSFDCNPDDVTYS MIDAYGRAGNVD+AF LYDRARTENWRIDPATFST+IKIHGVAGNY
Subjt:  LFDEMLERGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDLAFGLYDRARTENWRIDPATFSTLIKIHGVAGNY

Query:  DGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCA
        DGCLNVYEEMKAIGIKPNLVIYN LLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRARYGEDAL+VYKEMKEKGLQLNVILYNTLLAMCA
Subjt:  DGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCA

Query:  DVGYINEAVEIFKDMKSSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVESGFDPNIFVLTSLIQCYGKAKRVDDVVRTFNRLIELGLTPDDRFCGC
        DVGY+NEAVEIF+DMKSSGTCSPDSWTFSSMITIYSC GKVSEAEEMLN+MVE+GFDPNIFVLTSLIQCYGKAKRVDDVVRTFN+LIELGLTPDDRFCGC
Subjt:  DVGYINEAVEIFKDMKSSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVESGFDPNIFVLTSLIQCYGKAKRVDDVVRTFNRLIELGLTPDDRFCGC

Query:  LLNVITQTPKEELSKLIDCVERANPKLGYVVRLLLGEQDKEGDFRTEASELLSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDLGLMLQIYTDLQSRS
        LLNVITQTPK EL KLIDCV RANPKLG+VV LLLGEQDKEG+FRTEASEL SVVSADVRKAYCNCLIDLCVNLDLLDKACELLDLGL LQIY DLQSRS
Subjt:  LLNVITQTPKEELSKLIDCVERANPKLGYVVRLLLGEQDKEGDFRTEASELLSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDLGLMLQIYTDLQSRS

Query:  PTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRGSPE
        PTQWSLYLKGLSLGAALTALHVWI DLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESR SPE
Subjt:  PTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRGSPE

Query:  LVAA
        LVAA
Subjt:  LVAA

A0A1S3CL39 pentatricopeptide repeat-containing protein At4g16390, chloroplastic0.0e+0091.34Show/hide
Query:  MAFQLCHSPSTFFTDHHYLCNSSNSQRKITLCKFSHRFKLNPIPRHSKTFLQITNVSLQEYAPQETQNPSPSDDEISKNPDGKSGSSSKSSVWVNPRSPR
        MAFQLCHSP TFFT HH L NS   QRK TL   S  FKLNPIPRHS  FLQITN+SLQE++PQET N  PSDDEISK  D KSGSSSKSSVWVNPRSPR
Subjt:  MAFQLCHSPSTFFTDHHYLCNSSNSQRKITLCKFSHRFKLNPIPRHSKTFLQITNVSLQEYAPQETQNPSPSDDEISKNPDGKSGSSSKSSVWVNPRSPR

Query:  ASRLRKQSYEARYASLTRISESLDSCNPCEEDVADVLKMIGSNILEQDAVVVLNNMSNSQTALLALRYFQEVLKSSKEAILFNVTLKVFRKCRDMEGAEK
        AS+LRKQSYEARYASL RISESLDSCNPCE DVADVLK+IG+NILEQDAVVVLNNMSNSQTALLALRYFQ++LKSSK+ I +NVTLKVFRKCRDMEGAE+
Subjt:  ASRLRKQSYEARYASLTRISESLDSCNPCEEDVADVLKMIGSNILEQDAVVVLNNMSNSQTALLALRYFQEVLKSSKEAILFNVTLKVFRKCRDMEGAEK

Query:  LFDEMLERGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDLAFGLYDRARTENWRIDPATFSTLIKIHGVAGNY
        LF+EML RGVKPDNVTFSTIISCARLCSLP+KAVEWFEKMPSFDCNPDDVTYS MIDAYGRAGNVD+AF LYDRARTENWRIDPATFST+IKIHGVAGNY
Subjt:  LFDEMLERGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDLAFGLYDRARTENWRIDPATFSTLIKIHGVAGNY

Query:  DGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCA
        DGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIK+GFSPSWATYASLLRAYGRARYGEDAL+VYKEMKEKGLQLNVILYNTLLAMCA
Subjt:  DGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCA

Query:  DVGYINEAVEIFKDMKSSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVESGFDPNIFVLTSLIQCYGKAKRVDDVVRTFNRLIELGLTPDDRFCGC
        DVGY+NEAVEIF+DMK+SGTCSPDSWTFSSMITIYSCSGKVSEAEEMLN+MVE+GFDPNIFVLTSLIQCYGKAKRVDDVVRTFN+LIELGLTPDDRFCGC
Subjt:  DVGYINEAVEIFKDMKSSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVESGFDPNIFVLTSLIQCYGKAKRVDDVVRTFNRLIELGLTPDDRFCGC

Query:  LLNVITQTPKEELSKLIDCVERANPKLGYVVRLLLGEQDKEGDFRTEASELLSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDLGLMLQIYTDLQSRS
        LLNVITQTPKEE+SKLIDCV RANPKLG+VV LLLGEQDKEG+FRTEASEL SVVSADVRKAYCNCLIDLCVNLDLLDKACELL+LGL LQIY DLQSRS
Subjt:  LLNVITQTPKEELSKLIDCVERANPKLGYVVRLLLGEQDKEGDFRTEASELLSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDLGLMLQIYTDLQSRS

Query:  PTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRGSPE
        PTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESR SPE
Subjt:  PTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRGSPE

Query:  LVAA
        LVAA
Subjt:  LVAA

A0A5A7TLM5 Pentatricopeptide repeat-containing protein0.0e+0091.34Show/hide
Query:  MAFQLCHSPSTFFTDHHYLCNSSNSQRKITLCKFSHRFKLNPIPRHSKTFLQITNVSLQEYAPQETQNPSPSDDEISKNPDGKSGSSSKSSVWVNPRSPR
        MAFQLCHSP TFFT HH L NS   QRK TL   S  FKLNPIPRHS  FLQITN+SLQE++PQET N  PSDDEISK  D KSGSSSKSSVWVNPRSPR
Subjt:  MAFQLCHSPSTFFTDHHYLCNSSNSQRKITLCKFSHRFKLNPIPRHSKTFLQITNVSLQEYAPQETQNPSPSDDEISKNPDGKSGSSSKSSVWVNPRSPR

Query:  ASRLRKQSYEARYASLTRISESLDSCNPCEEDVADVLKMIGSNILEQDAVVVLNNMSNSQTALLALRYFQEVLKSSKEAILFNVTLKVFRKCRDMEGAEK
        AS+LRKQSYEARYASL RISESLDSCNPCE DVADVLK+IG+NILEQDAVVVLNNMSNSQTALLALRYFQ++LKSSK+ I +NVTLKVFRKCRDMEGAE+
Subjt:  ASRLRKQSYEARYASLTRISESLDSCNPCEEDVADVLKMIGSNILEQDAVVVLNNMSNSQTALLALRYFQEVLKSSKEAILFNVTLKVFRKCRDMEGAEK

Query:  LFDEMLERGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDLAFGLYDRARTENWRIDPATFSTLIKIHGVAGNY
        LF+EML RGVKPDNVTFSTIISCARLCSLP+KAVEWFEKMPSFDCNPDDVTYS MIDAYGRAGNVD+AF LYDRARTENWRIDPATFST+IKIHGVAGNY
Subjt:  LFDEMLERGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDLAFGLYDRARTENWRIDPATFSTLIKIHGVAGNY

Query:  DGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCA
        DGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIK+GFSPSWATYASLLRAYGRARYGEDAL+VYKEMKEKGLQLNVILYNTLLAMCA
Subjt:  DGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCA

Query:  DVGYINEAVEIFKDMKSSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVESGFDPNIFVLTSLIQCYGKAKRVDDVVRTFNRLIELGLTPDDRFCGC
        DVGY+NEAVEIF+DMK+SGTCSPDSWTFSSMITIYSCSGKVSEAEEMLN+MVE+GFDPNIFVLTSLIQCYGKAKRVDDVVRTFN+LIELGLTPDDRFCGC
Subjt:  DVGYINEAVEIFKDMKSSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVESGFDPNIFVLTSLIQCYGKAKRVDDVVRTFNRLIELGLTPDDRFCGC

Query:  LLNVITQTPKEELSKLIDCVERANPKLGYVVRLLLGEQDKEGDFRTEASELLSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDLGLMLQIYTDLQSRS
        LLNVITQTPKEE+SKLIDCV RANPKLG+VV LLLGEQDKEG+FRTEASEL SVVSADVRKAYCNCLIDLCVNLDLLDKACELL+LGL LQIY DLQSRS
Subjt:  LLNVITQTPKEELSKLIDCVERANPKLGYVVRLLLGEQDKEGDFRTEASELLSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDLGLMLQIYTDLQSRS

Query:  PTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRGSPE
        PTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESR SPE
Subjt:  PTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRGSPE

Query:  LVAA
        LVAA
Subjt:  LVAA

A0A6J1CNE5 pentatricopeptide repeat-containing protein At4g16390, chloroplastic0.0e+0092.47Show/hide
Query:  MAFQLCHSPSTFFTDHHYLCNSSNSQRKITLCKFSHRFKLNPIPRHSKTFLQITNVSLQEYAPQETQNPSPSDDEISKNPDGKSGSSSKSSVWVNPRSPR
        MAFQLCHSPSTFF+DHH L NS NSQ +ITL K SHRFKLNP P HSKT L+ITNVSLQEYA QE QNP P+ DE SK PDGKS SSSKSSVWVNPRSPR
Subjt:  MAFQLCHSPSTFFTDHHYLCNSSNSQRKITLCKFSHRFKLNPIPRHSKTFLQITNVSLQEYAPQETQNPSPSDDEISKNPDGKSGSSSKSSVWVNPRSPR

Query:  ASRLRKQSYEARYASLTRISESLDSCNPCEEDVADVLKMIGSNILEQDAVVVLNNMSNSQTALLALRYFQEVLKSSKEAILFNVTLKVFRKCRDMEGAEK
        AS+LR QSYEARYASLTRISESLDSCNPCEEDVADVLK +GSNILEQDAV VLNNMSNS TALLAL+ FQ+VLKSSK+AIL+NVTLKV RK RDMEGAEK
Subjt:  ASRLRKQSYEARYASLTRISESLDSCNPCEEDVADVLKMIGSNILEQDAVVVLNNMSNSQTALLALRYFQEVLKSSKEAILFNVTLKVFRKCRDMEGAEK

Query:  LFDEMLERGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDLAFGLYDRARTENWRIDPATFSTLIKIHGVAGNY
        LFDEML+RGVKPDNVTFST+ISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVD+AF LYDRARTENWRIDPATFSTLIKIHGVAGNY
Subjt:  LFDEMLERGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDLAFGLYDRARTENWRIDPATFSTLIKIHGVAGNY

Query:  DGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCA
        DGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCA
Subjt:  DGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCA

Query:  DVGYINEAVEIFKDMKSSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVESGFDPNIFVLTSLIQCYGKAKRVDDVVRTFNRLIELGLTPDDRFCGC
        DVGY+NEAVEIF+DMKSSG CSPDSWTFSSMITIYSCSGKVSEAEEMLNEM+E+GFDPNIFVLTSLIQCYGK KRVDDVVRTF+RL+ELGLTPDDRFCGC
Subjt:  DVGYINEAVEIFKDMKSSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVESGFDPNIFVLTSLIQCYGKAKRVDDVVRTFNRLIELGLTPDDRFCGC

Query:  LLNVITQTPKEELSKLIDCVERANPKLGYVVRLLLGEQDKEGDFRTEASELLSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDLGLMLQIYTDLQSRS
        LLNVITQTPKEELSKLIDCVERAN KLGYVV+LLLGEQDKEGD RTEASELLSVVSADVRKAYCNCLIDLCVNLDLL+KACELLDLGL LQIYT LQS S
Subjt:  LLNVITQTPKEELSKLIDCVERANPKLGYVVRLLLGEQDKEGDFRTEASELLSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDLGLMLQIYTDLQSRS

Query:  PTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRGSPE
        PTQWSL+LKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRGSPE
Subjt:  PTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRGSPE

Query:  LVAA
        LVAA
Subjt:  LVAA

A0A6J1EHV4 pentatricopeptide repeat-containing protein At4g16390, chloroplastic0.0e+0089.36Show/hide
Query:  MAFQLCHSPSTFFTDHHYLCNSSNSQRKITLCK-FSHRFKLNPIPRHSKTFLQITNVSLQEYAPQETQNPSPSDDEISKNPDGKSGSSSKSSVWVNPRSP
        MAFQL H PSTFFTDH    NS     K TLCK FS  FKLNPIP HSK FLQITNVS QEYAPQET+NPSPSDDEISK PDGKSGSSSK+SVWVNP SP
Subjt:  MAFQLCHSPSTFFTDHHYLCNSSNSQRKITLCK-FSHRFKLNPIPRHSKTFLQITNVSLQEYAPQETQNPSPSDDEISKNPDGKSGSSSKSSVWVNPRSP

Query:  RASRLRKQSYEARYASLTRISESLDSCNPCEEDVADVLKMIGSNILEQDAVVVLNNMSNSQTALLALRYFQEVLKSSKEAILFNVTLKVFRKCRDMEGAE
        RAS+LRKQSYEARYASL +ISESLDSCNPCE DVADVLK I S ILEQDA+ VLNNMSNSQTALL  RYFQ+VLKSSK+A+ +NVTLKVFRKCRD EGAE
Subjt:  RASRLRKQSYEARYASLTRISESLDSCNPCEEDVADVLKMIGSNILEQDAVVVLNNMSNSQTALLALRYFQEVLKSSKEAILFNVTLKVFRKCRDMEGAE

Query:  KLFDEMLERGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDLAFGLYDRARTENWRIDPATFSTLIKIHGVAGN
        KLFDEMLERGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPD++TYS MIDAYGRAGNVD+AF LYDRARTENWRID +TFST+IKIHGVAGN
Subjt:  KLFDEMLERGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDLAFGLYDRARTENWRIDPATFSTLIKIHGVAGN

Query:  YDGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMC
        YDGCLNVYEEMKA+GIKPNL IYNSLL AMGRAKRPWQIKTIYKEM KNGFSPSWATYASLLRAY RARY ED +LVYKEMKEKGLQLNVILYNTLLAMC
Subjt:  YDGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMC

Query:  ADVGYINEAVEIFKDMKSSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVESGFDPNIFVLTSLIQCYGKAKRVDDVVRTFNRLIELGLTPDDRFCG
        ADVGY+NEA+E+FKDMKSSGTCSPDSWTFSSMITIYSCSG VSEAEEMLNEM+E+GFDPNIFVLTSLIQCYGKAKRVDDVVRTF+RL+ELGLTPDDRFCG
Subjt:  ADVGYINEAVEIFKDMKSSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVESGFDPNIFVLTSLIQCYGKAKRVDDVVRTFNRLIELGLTPDDRFCG

Query:  CLLNVITQTPKEELSKLIDCVERANPKLGYVVRLLLGEQDKEGDFRTEASELLSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDLGLMLQIYTDLQSR
        CLLNVITQTPK ELSKLIDCVERANPKLG+VV+LLLGE+D EGDFRTEASEL SVVS DVRKAYCNCLIDLCVNLDLLDKACELLDLGL +QIYTDLQSR
Subjt:  CLLNVITQTPKEELSKLIDCVERANPKLGYVVRLLLGEQDKEGDFRTEASELLSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDLGLMLQIYTDLQSR

Query:  SPTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRGSP
        SPTQWSLYLKGLSLGAALTALHVWINDLTK L+SGEELPPLLGINTGHGKHKYSDKGL+SVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRGSP
Subjt:  SPTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRGSP

Query:  ELVAA
        ELVAA
Subjt:  ELVAA

SwissProt top hitse value%identityAlignment
B4F8Z1 Pentatricopeptide repeat-containing protein ATP4, chloroplastic4.5e-20954.6Show/hide
Query:  LCHSPSTFFTDHHYLCNSSNSQRKITLCKFSHRFKLNPIPRHSKTFLQITNVSLQEYAPQETQNPSPSDDEISKNPDGKSGSSSKSS--VWVNPRSPRAS
        LC SPS+           S   R I+   F+ +   +P+  H         VS+QE  PQ  Q+PSP  D    NP+G   SSS ++  +WVNP SPRA+
Subjt:  LCHSPSTFFTDHHYLCNSSNSQRKITLCKFSHRFKLNPIPRHSKTFLQITNVSLQEYAPQETQNPSPSDDEISKNPDGKSGSSSKSS--VWVNPRSPRAS

Query:  RL-RKQSYEARYASLTRISESLDSCNPCEEDVADVLK-MIGSNILEQDAVVVLNN--MSNSQTALLALRYFQEVLKSSKEAILFNVTLKVFRKCRDMEGA
         + R ++   R A L   + +L +C   E  V   L+        EQDAV+VLN    + ++TA+LALR+F    K  K+ IL+NV LK+ RK R     
Subjt:  RL-RKQSYEARYASLTRISESLDSCNPCEEDVADVLK-MIGSNILEQDAVVVLNN--MSNSQTALLALRYFQEVLKSSKEAILFNVTLKVFRKCRDMEGA

Query:  EKLFDEMLERGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDLAFGLYDRARTENWRIDPATFSTLIKIHGVAG
        E L+ EML  GV+PDN TFST+ISCAR C L +KAVEWF+KMP F C+PD +TYSA+IDAYG AGN + A  LYDRAR E W++DP   ST+IK+H  +G
Subjt:  EKLFDEMLERGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDLAFGLYDRARTENWRIDPATFSTLIKIHGVAG

Query:  NYDGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAM
        N+DG LNV+EEMKAIG++PNLV+YN++LDAMGRA RPW +KTI++EM+     PS ATY  LL AY RARYGEDA+ VY+ MK++ + ++V+LYN LL+M
Subjt:  NYDGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAM

Query:  CADVGYINEAVEIFKDMKSS--GTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVESGFDPNIFVLTSLIQCYGKAKRVDDVVRTFNRLIELGLTPDDR
        CAD+GY++EA EIF+DMK+S      PDSW++SSM+T+YS +  V  AE +LNEMVE+GF PNIFVLTSLI+CYGK  R DDVVR+F  L +LG+ PDDR
Subjt:  CADVGYINEAVEIFKDMKSS--GTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVESGFDPNIFVLTSLIQCYGKAKRVDDVVRTFNRLIELGLTPDDR

Query:  FCGCLLNVITQTPKEELSKLIDCVERANPKLGYVVRLLLGEQDKEGDFRTEASELLSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDLGLMLQIYTDL
        FCGCLL+V   TP EEL K+I C+ER+N +LG VV+LL+     E  FR  A ELL      V+  YCNCL+DLCVNL+ ++KAC LLD    L IY ++
Subjt:  FCGCLLNVITQTPKEELSKLIDCVERANPKLGYVVRLLLGEQDKEGDFRTEASELLSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDLGLMLQIYTDL

Query:  QSRSPTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEE-LPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLES
        Q+R+ TQWSL+L+GLS+GAALT LHVW+NDL   L++G E LPPLLGI+TG GK+ YSD+GLA++FE+HLKEL+APFHEAP+K GWFLTT VAAK WLES
Subjt:  QSRSPTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEE-LPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLES

Query:  RGSPELV
        + + ELV
Subjt:  RGSPELV

Q10PZ4 Pentatricopeptide repeat-containing protein ATP4 homolog, chloroplastic2.2e-20857.01Show/hide
Query:  QNPSPSDDEISKNPDGKSGSSSKSSVWVNPRSPRASRL-RKQSYEARYASLTRISESLDSCNPCEEDVADVLK-MIGSNILEQDAVVVLNNMSNSQTA-L
        Q+P P   + + +P G+S ++S+  VWVNP SPRA+ L R ++   R A L   + +L +C   E  VA  L+        EQDAV+VLN  S    A +
Subjt:  QNPSPSDDEISKNPDGKSGSSSKSSVWVNPRSPRASRL-RKQSYEARYASLTRISESLDSCNPCEEDVADVLK-MIGSNILEQDAVVVLNNMSNSQTA-L

Query:  LALRYFQEVLKSSKEAILFNVTLKVFRKCRDMEGAEKLFDEMLERGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAG
        LAL +F    +  KE IL+NV LK  RK R    AE L++EML  GV+PDN TFST+ISCAR C +P KAVEWFEKMP F C+PD +TYSA+IDAYGRAG
Subjt:  LALRYFQEVLKSSKEAILFNVTLKVFRKCRDMEGAEKLFDEMLERGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAG

Query:  NVDLAFGLYDRARTENWRIDPATFSTLIKIHGVAGNYDGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRA
        + + A  LYDRAR E W++DP   +T+I++H  +GN+DG LNV+EEMKA G+KPNLV+YN++LDAMGRA RPW +KTI++E++     P+ ATY  LL A
Subjt:  NVDLAFGLYDRARTENWRIDPATFSTLIKIHGVAGNYDGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRA

Query:  YGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCADVGYINEAVEIFKDMKSS--GTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVESGFDPNIF
        Y RARYGEDA+ VY+ MK++ + ++V+LYN LL+MCAD+GY+ EA EIF+DMK+S      PDSW++SSM+T+YSC+G V+ AE +LNEMVE+GF PNIF
Subjt:  YGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCADVGYINEAVEIFKDMKSS--GTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVESGFDPNIF

Query:  VLTSLIQCYGKAKRVDDVVRTFNRLIELGLTPDDRFCGCLLNVITQTPKEELSKLIDCVERANPKLGYVVRLLLGEQDKEGDFRTEASELLSVVSADVRK
        +LTSLI+CYGKA R DDVVR+F  L +LG+TPDDRFCGCLL V   TP +EL K+I C++R++ +LG VVRLL+         R  A ELL      VR 
Subjt:  VLTSLIQCYGKAKRVDDVVRTFNRLIELGLTPDDRFCGCLLNVITQTPKEELSKLIDCVERANPKLGYVVRLLLGEQDKEGDFRTEASELLSVVSADVRK

Query:  AYCNCLIDLCVNLDLLDKACELLDLGLMLQIYTDLQSRSPTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVF
         YCNCL+DL VNL  ++KAC LLD+ L L IY+++Q+R+ TQWSL+L+GLS+GAALT LHVW++DL   L++G+ELPPLLGI+TG GK+ YS KGLA+VF
Subjt:  AYCNCLIDLCVNLDLLDKACELLDLGLMLQIYTDLQSRSPTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVF

Query:  ESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRGSPELVA
        ESHLKEL+APFHEAP+K GWFLTT VAA+ WLE++ S ELVA
Subjt:  ESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRGSPELVA

Q8GWE0 Pentatricopeptide repeat-containing protein At4g16390, chloroplastic1.8e-27466.38Show/hide
Query:  LCHSPSTFFTDHHYLCNSSNSQRKITLCKFSHRFKLNPIPRHSKTFLQITNVSLQEYAPQETQNPSPSDDEISKNPDGKSGSSSKSSVWVNPRSPRASRL
        LC SPS+   D   LCN  +   K T   F   +  N    HS+  LQ T+VS+QE  PQ  ++     D     P     ++SKS VWVNP+SPRAS+L
Subjt:  LCHSPSTFFTDHHYLCNSSNSQRKITLCKFSHRFKLNPIPRHSKTFLQITNVSLQEYAPQETQNPSPSDDEISKNPDGKSGSSSKSSVWVNPRSPRASRL

Query:  RKQSYEARYASLTRISESLDSCNPCEEDVADVLKMIGSNILEQDAVVVLNNMSNSQTALLALRYFQEVLKSSKEAILFNVTLKVFRKCRDMEGAEKLFDE
        R++SY++RY+SL +++ESLD+C P E DV DV+   G  + EQDAVV LNNM+N +TA L L    E +K S+E IL+NVT+KVFRK +D+E +EKLFDE
Subjt:  RKQSYEARYASLTRISESLDSCNPCEEDVADVLKMIGSNILEQDAVVVLNNMSNSQTALLALRYFQEVLKSSKEAILFNVTLKVFRKCRDMEGAEKLFDE

Query:  MLERGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDLAFGLYDRARTENWRIDPATFSTLIKIHGVAGNYDGCL
        MLERG+KPDN TF+TIISCAR   +P +AVEWFEKM SF C PD+VT +AMIDAYGRAGNVD+A  LYDRARTE WRID  TFSTLI+I+GV+GNYDGCL
Subjt:  MLERGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDLAFGLYDRARTENWRIDPATFSTLIKIHGVAGNYDGCL

Query:  NVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCADVGY
        N+YEEMKA+G+KPNLVIYN L+D+MGRAKRPWQ K IYK++I NGF+P+W+TYA+L+RAYGRARYG+DAL +Y+EMKEKGL L VILYNTLL+MCAD  Y
Subjt:  NVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCADVGY

Query:  INEAVEIFKDMKSSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVESGFDPNIFVLTSLIQCYGKAKRVDDVVRTFNRLIELGLTPDDRFCGCLLNV
        ++EA EIF+DMK+  TC PDSWTFSS+IT+Y+CSG+VSEAE  L +M E+GF+P +FVLTS+IQCYGKAK+VDDVVRTF++++ELG+TPDDRFCGCLLNV
Subjt:  INEAVEIFKDMKSSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVESGFDPNIFVLTSLIQCYGKAKRVDDVVRTFNRLIELGLTPDDRFCGCLLNV

Query:  ITQTPKEELSKLIDCVERANPKLGYVVRLLLGEQD-KEGDFRTEASELLSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDLGLMLQIYTDLQSRSPTQ
        +TQTP EE+ KLI CVE+A PKLG VV++L+ EQ+ +EG F+ EASEL+  + +DV+KAY NCLIDLCVNL+ L++ACE+L LGL   IYT LQS+S TQ
Subjt:  ITQTPKEELSKLIDCVERANPKLGYVVRLLLGEQD-KEGDFRTEASELLSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDLGLMLQIYTDLQSRSPTQ

Query:  WSLYLKGLSLGAALTALHVWINDLTK-VLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRGSPELV
        WSL+LK LSLGAALTALHVW+NDL++  LESGEE PPLLGINTGHGKHKYSDKGLA+VFESHLKELNAPFHEAP+KVGWFLTT VAAK+WLESR S   V
Subjt:  WSLYLKGLSLGAALTALHVWINDLTK-VLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRGSPELV

Query:  AA
        +A
Subjt:  AA

Q9LS25 Pentatricopeptide repeat-containing protein At5g46580, chloroplastic4.2e-13040.03Show/hide
Query:  SSKSSVWVNPRSPRASRLRKQ-------SYEARYASLTRISESLDSCNPCEE-DVADVLKMIGSNILEQDAVVVLNNMSNSQTALLALRYFQEVLKSSKE
        S   SVWVNP  P+ S L  Q       SY  +   L   +  L+S    E+ +   +L  I       +A++VLN++   Q       + +       E
Subjt:  SSKSSVWVNPRSPRASRLRKQ-------SYEARYASLTRISESLDSCNPCEE-DVADVLKMIGSNILEQDAVVVLNNMSNSQTALLALRYFQEVLKSSKE

Query:  AILFNVTLKVFRKCRDMEGAEKLFDEMLERGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDLAFGLYDRARTE
         I +NVT+K  R  R  +  E++  EM++ GV+ DN+T+STII+CA+ C+L NKA+EWFE+M      PD+VTYSA++D Y ++G V+    LY+RA   
Subjt:  AILFNVTLKVFRKCRDMEGAEKLFDEMLERGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDLAFGLYDRARTE

Query:  NWRIDPATFSTLIKIHGVAGNYDGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRARYGEDALLVYK
         W+ D   FS L K+ G AG+YDG   V +EMK++ +KPN+V+YN+LL+AMGRA +P   ++++ EM++ G +P+  T  +L++ YG+AR+  DAL +++
Subjt:  NWRIDPATFSTLIKIHGVAGNYDGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRARYGEDALLVYK

Query:  EMKEKGLQLNVILYNTLLAMCADVGYINEAVEIFKDMKSSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVESGFDPNIFVLTSLIQCYGKAKRVDD
        EMK K   ++ ILYNTLL MCAD+G   EA  +F DMK S  C PD++++++M+ IY   GK  +A E+  EM+++G   N+   T L+QC GKAKR+DD
Subjt:  EMKEKGLQLNVILYNTLLAMCADVGYINEAVEIFKDMKSSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVESGFDPNIFVLTSLIQCYGKAKRVDD

Query:  VVRTFNRLIELGLTPDDRFCGCLLNVITQ-TPKEELSKLIDCVERANPKLGYVVRLLLGEQDKEGDFRTEASELLSVVSADVRKAYCNCLIDLCVNLDLL
        VV  F+  I+ G+ PDDR CGCLL+V+      E+  K++ C+ERAN KL   V L++ E+ +    + E   +++    + R+ +CNCLID+C   +  
Subjt:  VVRTFNRLIELGLTPDDRFCGCLLNVITQ-TPKEELSKLIDCVERANPKLGYVVRLLLGEQDKEGDFRTEASELLSVVSADVRKAYCNCLIDLCVNLDLL

Query:  DKACELLDLGLMLQIYTDLQSRSPTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPE
        ++A ELL LG +  +Y  L +++  +WSL ++ LS+GAA TAL  W+  L  +++  EELP L    TG G H++S +GLA+ F  HL++L+APF ++ +
Subjt:  DKACELLDLGLMLQIYTDLQSRSPTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPE

Query:  KVGWFLTTKVAAKSWLESRGSP
        + G F+ TK    SWLES+  P
Subjt:  KVGWFLTTKVAAKSWLESRGSP

Q9SIC9 Pentatricopeptide repeat-containing protein At2g31400, chloroplastic9.9e-4724.1Show/hide
Query:  EISKNPDGKSGSSSKSSVWVNPRSPRASRLRKQSYEARYASLTRISESLDSC---NPCEEDVADVLKM-----IGSNILEQDAVVVLNNMSNSQTALLAL
        E  KN  GK  S+  S++    +   A R+ + ++   Y +      +L S    +   E+   V        +  N++  +AV+        +   +A 
Subjt:  EISKNPDGKSGSSSKSSVWVNPRSPRASRLRKQSYEARYASLTRISESLDSC---NPCEEDVADVLKM-----IGSNILEQDAVVVLNNMSNSQTALLAL

Query:  RYFQEVLKS--SKEAILFNVTLKVFRKCRDMEGAEKLFDEMLERGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGN
        ++F E+ ++    + I FN  L V  +    E A  LFDEM  R ++ D  +++T++         + A E   +MP     P+ V+YS +ID + +AG 
Subjt:  RYFQEVLKS--SKEAILFNVTLKVFRKCRDMEGAEKLFDEMLERGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGN

Query:  VDLAFGLYDRARTENWRIDPATFSTLIKIHGVAGNYDGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAY
         D A  L+   R     +D  +++TL+ I+   G  +  L++  EM ++GIK ++V YN+LL   G+  +  ++K ++ EM +    P+  TY++L+  Y
Subjt:  VDLAFGLYDRARTENWRIDPATFSTLIKIHGVAGNYDGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAY

Query:  GRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCADVGYINEAVEIFKDMKSSGTCSPDSWTFSSMITI------------YSCSGKVSEAEEMLNEMV
         +    ++A+ +++E K  GL+ +V+LY+ L+      G +  AV +  +M   G  SP+  T++S+I              YS  G +  +   L+ + 
Subjt:  GRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCADVGYINEAVEIFKDMKSSGTCSPDSWTFSSMITI------------YSCSGKVSEAEEMLNEMV

Query:  ESGFDPNIFVLTSLI---------QCYGKAKRVDDVVRTFNRLIELGLTPDDRFCGCLLNVITQTPK-EELSKLIDCVERANPKL-GYVVRLLLGEQDKE
        E+  +  I +   L           C    + +  ++  F ++ +L + P+      +LN  ++    E+ S L++ +   + K+ G V  LL+G+++  
Subjt:  ESGFDPNIFVLTSLI---------QCYGKAKRVDDVVRTFNRLIELGLTPDDRFCGCLLNVITQTPK-EELSKLIDCVERANPKL-GYVVRLLLGEQDKE

Query:  GDFRTEASELLSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDL-GLMLQIYTDLQSRSPTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPL
                + ++ +      A+ N L D+  +     +  EL+ L G   Q++ ++ S S     L L  +S GAA   +H W+ ++  ++  G ELP +
Subjt:  GDFRTEASELLSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDL-GLMLQIYTDLQSRSPTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPL

Query:  LGINTGHGKHK--YSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRGSPELV
        L I TG GKH     D  L    E  L+ ++APFH +   +G F ++     +WL    + +L+
Subjt:  LGINTGHGKHK--YSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRGSPELV

Arabidopsis top hitse value%identityAlignment
AT1G18900.1 Pentatricopeptide repeat (PPR) superfamily protein1.6e-3923.1Show/hide
Query:  DVLKMIGSNILEQDAVVVLNNMSNSQTALLALRYFQEVLKSSKEAILFNVTLKVFRKCRDMEGAEKLFDEMLERGVKPDNVTFSTIISCARLCSLPNKAV
        + L+ +G  I    A  VL  M++   AL    + +       +   +   +    + +      KL DEM+  G +P+ VT++ +I      +  N+A+
Subjt:  DVLKMIGSNILEQDAVVVLNNMSNSQTALLALRYFQEVLKSSKEAILFNVTLKVFRKCRDMEGAEKLFDEMLERGVKPDNVTFSTIISCARLCSLPNKAV

Query:  EWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDLAFGLYDRARTENWRIDPATFSTLIKIHGVAGNYDGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKR
          F +M    C PD VTY  +ID + +AG +D+A  +Y R +      D  T+S +I   G AG+      ++ EM   G  PNLV YN ++D   +A+ 
Subjt:  EWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDLAFGLYDRARTENWRIDPATFSTLIKIHGVAGNYDGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKR

Query:  PWQIKTIYKEMIKNGFSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCADVGYINEAVEIFKDMKSSGTCSPDSWTFSSMITI
              +Y++M   GF P   TY+ ++   G   Y E+A  V+ EM++K    +  +Y  L+ +    G + +A + ++ M  +G   P+  T +S+++ 
Subjt:  PWQIKTIYKEMIKNGFSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCADVGYINEAVEIFKDMKSSGTCSPDSWTFSSMITI

Query:  YSCSGKVSEAEEMLNEMVESGFDPNIFVLTSLIQCY--GKAKRVDDVVRTFNRLIELGLTPDDRFCGCLLNVITQTPKEELSKLIDCVERANPKLGYVVR
        +    K++EA E+L  M+  G  P++   T L+ C   G++K            +++G      FCG L+                     +P   ++++
Subjt:  YSCSGKVSEAEEMLNEMVESGFDPNIFVLTSLIQCY--GKAKRVDDVVRTFNRLIELGLTPDDRFCGCLLNVITQTPKEELSKLIDCVERANPKLGYVVR

Query:  LLLGEQDKEGDFRTEASELLSVVSADVR---KAYCNCLIDLCVNLDLLDKACELLDLGLMLQIYTD-LQSRSPTQWSLYLKGLSLGAALTALHVWINDLT
        +     D E + R  A+  L ++ ++ R   +   + ++D        ++A  + ++     ++ D L+ +S + W + L  +S G A+TAL   +    
Subjt:  LLLGEQDKEGDFRTEASELLSVVSADVR---KAYCNCLIDLCVNLDLLDKACELLDLGLMLQIYTD-LQSRSPTQWSLYLKGLSLGAALTALHVWINDLT

Query:  KVLESGEELPPLLGINTGHGKHK--YSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWL
        K + +    P  + I TG G+         +    E  L    +PF       G F+ +      WL
Subjt:  KVLESGEELPPLLGINTGHGKHK--YSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWL

AT1G18900.2 Pentatricopeptide repeat (PPR) superfamily protein1.6e-3923.1Show/hide
Query:  DVLKMIGSNILEQDAVVVLNNMSNSQTALLALRYFQEVLKSSKEAILFNVTLKVFRKCRDMEGAEKLFDEMLERGVKPDNVTFSTIISCARLCSLPNKAV
        + L+ +G  I    A  VL  M++   AL    + +       +   +   +    + +      KL DEM+  G +P+ VT++ +I      +  N+A+
Subjt:  DVLKMIGSNILEQDAVVVLNNMSNSQTALLALRYFQEVLKSSKEAILFNVTLKVFRKCRDMEGAEKLFDEMLERGVKPDNVTFSTIISCARLCSLPNKAV

Query:  EWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDLAFGLYDRARTENWRIDPATFSTLIKIHGVAGNYDGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKR
          F +M    C PD VTY  +ID + +AG +D+A  +Y R +      D  T+S +I   G AG+      ++ EM   G  PNLV YN ++D   +A+ 
Subjt:  EWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDLAFGLYDRARTENWRIDPATFSTLIKIHGVAGNYDGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKR

Query:  PWQIKTIYKEMIKNGFSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCADVGYINEAVEIFKDMKSSGTCSPDSWTFSSMITI
              +Y++M   GF P   TY+ ++   G   Y E+A  V+ EM++K    +  +Y  L+ +    G + +A + ++ M  +G   P+  T +S+++ 
Subjt:  PWQIKTIYKEMIKNGFSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCADVGYINEAVEIFKDMKSSGTCSPDSWTFSSMITI

Query:  YSCSGKVSEAEEMLNEMVESGFDPNIFVLTSLIQCY--GKAKRVDDVVRTFNRLIELGLTPDDRFCGCLLNVITQTPKEELSKLIDCVERANPKLGYVVR
        +    K++EA E+L  M+  G  P++   T L+ C   G++K            +++G      FCG L+                     +P   ++++
Subjt:  YSCSGKVSEAEEMLNEMVESGFDPNIFVLTSLIQCY--GKAKRVDDVVRTFNRLIELGLTPDDRFCGCLLNVITQTPKEELSKLIDCVERANPKLGYVVR

Query:  LLLGEQDKEGDFRTEASELLSVVSADVR---KAYCNCLIDLCVNLDLLDKACELLDLGLMLQIYTD-LQSRSPTQWSLYLKGLSLGAALTALHVWINDLT
        +     D E + R  A+  L ++ ++ R   +   + ++D        ++A  + ++     ++ D L+ +S + W + L  +S G A+TAL   +    
Subjt:  LLLGEQDKEGDFRTEASELLSVVSADVR---KAYCNCLIDLCVNLDLLDKACELLDLGLMLQIYTD-LQSRSPTQWSLYLKGLSLGAALTALHVWINDLT

Query:  KVLESGEELPPLLGINTGHGKHK--YSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWL
        K + +    P  + I TG G+         +    E  L    +PF       G F+ +      WL
Subjt:  KVLESGEELPPLLGINTGHGKHK--YSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWL

AT2G31400.1 genomes uncoupled 17.0e-4824.1Show/hide
Query:  EISKNPDGKSGSSSKSSVWVNPRSPRASRLRKQSYEARYASLTRISESLDSC---NPCEEDVADVLKM-----IGSNILEQDAVVVLNNMSNSQTALLAL
        E  KN  GK  S+  S++    +   A R+ + ++   Y +      +L S    +   E+   V        +  N++  +AV+        +   +A 
Subjt:  EISKNPDGKSGSSSKSSVWVNPRSPRASRLRKQSYEARYASLTRISESLDSC---NPCEEDVADVLKM-----IGSNILEQDAVVVLNNMSNSQTALLAL

Query:  RYFQEVLKS--SKEAILFNVTLKVFRKCRDMEGAEKLFDEMLERGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGN
        ++F E+ ++    + I FN  L V  +    E A  LFDEM  R ++ D  +++T++         + A E   +MP     P+ V+YS +ID + +AG 
Subjt:  RYFQEVLKS--SKEAILFNVTLKVFRKCRDMEGAEKLFDEMLERGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGN

Query:  VDLAFGLYDRARTENWRIDPATFSTLIKIHGVAGNYDGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAY
         D A  L+   R     +D  +++TL+ I+   G  +  L++  EM ++GIK ++V YN+LL   G+  +  ++K ++ EM +    P+  TY++L+  Y
Subjt:  VDLAFGLYDRARTENWRIDPATFSTLIKIHGVAGNYDGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAY

Query:  GRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCADVGYINEAVEIFKDMKSSGTCSPDSWTFSSMITI------------YSCSGKVSEAEEMLNEMV
         +    ++A+ +++E K  GL+ +V+LY+ L+      G +  AV +  +M   G  SP+  T++S+I              YS  G +  +   L+ + 
Subjt:  GRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCADVGYINEAVEIFKDMKSSGTCSPDSWTFSSMITI------------YSCSGKVSEAEEMLNEMV

Query:  ESGFDPNIFVLTSLI---------QCYGKAKRVDDVVRTFNRLIELGLTPDDRFCGCLLNVITQTPK-EELSKLIDCVERANPKL-GYVVRLLLGEQDKE
        E+  +  I +   L           C    + +  ++  F ++ +L + P+      +LN  ++    E+ S L++ +   + K+ G V  LL+G+++  
Subjt:  ESGFDPNIFVLTSLI---------QCYGKAKRVDDVVRTFNRLIELGLTPDDRFCGCLLNVITQTPK-EELSKLIDCVERANPKL-GYVVRLLLGEQDKE

Query:  GDFRTEASELLSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDL-GLMLQIYTDLQSRSPTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPL
                + ++ +      A+ N L D+  +     +  EL+ L G   Q++ ++ S S     L L  +S GAA   +H W+ ++  ++  G ELP +
Subjt:  GDFRTEASELLSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDL-GLMLQIYTDLQSRSPTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPL

Query:  LGINTGHGKHK--YSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRGSPELV
        L I TG GKH     D  L    E  L+ ++APFH +   +G F ++     +WL    + +L+
Subjt:  LGINTGHGKHK--YSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRGSPELV

AT4G16390.1 pentatricopeptide (PPR) repeat-containing protein1.3e-27566.38Show/hide
Query:  LCHSPSTFFTDHHYLCNSSNSQRKITLCKFSHRFKLNPIPRHSKTFLQITNVSLQEYAPQETQNPSPSDDEISKNPDGKSGSSSKSSVWVNPRSPRASRL
        LC SPS+   D   LCN  +   K T   F   +  N    HS+  LQ T+VS+QE  PQ  ++     D     P     ++SKS VWVNP+SPRAS+L
Subjt:  LCHSPSTFFTDHHYLCNSSNSQRKITLCKFSHRFKLNPIPRHSKTFLQITNVSLQEYAPQETQNPSPSDDEISKNPDGKSGSSSKSSVWVNPRSPRASRL

Query:  RKQSYEARYASLTRISESLDSCNPCEEDVADVLKMIGSNILEQDAVVVLNNMSNSQTALLALRYFQEVLKSSKEAILFNVTLKVFRKCRDMEGAEKLFDE
        R++SY++RY+SL +++ESLD+C P E DV DV+   G  + EQDAVV LNNM+N +TA L L    E +K S+E IL+NVT+KVFRK +D+E +EKLFDE
Subjt:  RKQSYEARYASLTRISESLDSCNPCEEDVADVLKMIGSNILEQDAVVVLNNMSNSQTALLALRYFQEVLKSSKEAILFNVTLKVFRKCRDMEGAEKLFDE

Query:  MLERGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDLAFGLYDRARTENWRIDPATFSTLIKIHGVAGNYDGCL
        MLERG+KPDN TF+TIISCAR   +P +AVEWFEKM SF C PD+VT +AMIDAYGRAGNVD+A  LYDRARTE WRID  TFSTLI+I+GV+GNYDGCL
Subjt:  MLERGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDLAFGLYDRARTENWRIDPATFSTLIKIHGVAGNYDGCL

Query:  NVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCADVGY
        N+YEEMKA+G+KPNLVIYN L+D+MGRAKRPWQ K IYK++I NGF+P+W+TYA+L+RAYGRARYG+DAL +Y+EMKEKGL L VILYNTLL+MCAD  Y
Subjt:  NVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCADVGY

Query:  INEAVEIFKDMKSSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVESGFDPNIFVLTSLIQCYGKAKRVDDVVRTFNRLIELGLTPDDRFCGCLLNV
        ++EA EIF+DMK+  TC PDSWTFSS+IT+Y+CSG+VSEAE  L +M E+GF+P +FVLTS+IQCYGKAK+VDDVVRTF++++ELG+TPDDRFCGCLLNV
Subjt:  INEAVEIFKDMKSSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVESGFDPNIFVLTSLIQCYGKAKRVDDVVRTFNRLIELGLTPDDRFCGCLLNV

Query:  ITQTPKEELSKLIDCVERANPKLGYVVRLLLGEQD-KEGDFRTEASELLSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDLGLMLQIYTDLQSRSPTQ
        +TQTP EE+ KLI CVE+A PKLG VV++L+ EQ+ +EG F+ EASEL+  + +DV+KAY NCLIDLCVNL+ L++ACE+L LGL   IYT LQS+S TQ
Subjt:  ITQTPKEELSKLIDCVERANPKLGYVVRLLLGEQD-KEGDFRTEASELLSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDLGLMLQIYTDLQSRSPTQ

Query:  WSLYLKGLSLGAALTALHVWINDLTK-VLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRGSPELV
        WSL+LK LSLGAALTALHVW+NDL++  LESGEE PPLLGINTGHGKHKYSDKGLA+VFESHLKELNAPFHEAP+KVGWFLTT VAAK+WLESR S   V
Subjt:  WSLYLKGLSLGAALTALHVWINDLTK-VLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRGSPELV

Query:  AA
        +A
Subjt:  AA

AT5G46580.1 pentatricopeptide (PPR) repeat-containing protein3.0e-13140.03Show/hide
Query:  SSKSSVWVNPRSPRASRLRKQ-------SYEARYASLTRISESLDSCNPCEE-DVADVLKMIGSNILEQDAVVVLNNMSNSQTALLALRYFQEVLKSSKE
        S   SVWVNP  P+ S L  Q       SY  +   L   +  L+S    E+ +   +L  I       +A++VLN++   Q       + +       E
Subjt:  SSKSSVWVNPRSPRASRLRKQ-------SYEARYASLTRISESLDSCNPCEE-DVADVLKMIGSNILEQDAVVVLNNMSNSQTALLALRYFQEVLKSSKE

Query:  AILFNVTLKVFRKCRDMEGAEKLFDEMLERGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDLAFGLYDRARTE
         I +NVT+K  R  R  +  E++  EM++ GV+ DN+T+STII+CA+ C+L NKA+EWFE+M      PD+VTYSA++D Y ++G V+    LY+RA   
Subjt:  AILFNVTLKVFRKCRDMEGAEKLFDEMLERGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDLAFGLYDRARTE

Query:  NWRIDPATFSTLIKIHGVAGNYDGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRARYGEDALLVYK
         W+ D   FS L K+ G AG+YDG   V +EMK++ +KPN+V+YN+LL+AMGRA +P   ++++ EM++ G +P+  T  +L++ YG+AR+  DAL +++
Subjt:  NWRIDPATFSTLIKIHGVAGNYDGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRARYGEDALLVYK

Query:  EMKEKGLQLNVILYNTLLAMCADVGYINEAVEIFKDMKSSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVESGFDPNIFVLTSLIQCYGKAKRVDD
        EMK K   ++ ILYNTLL MCAD+G   EA  +F DMK S  C PD++++++M+ IY   GK  +A E+  EM+++G   N+   T L+QC GKAKR+DD
Subjt:  EMKEKGLQLNVILYNTLLAMCADVGYINEAVEIFKDMKSSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVESGFDPNIFVLTSLIQCYGKAKRVDD

Query:  VVRTFNRLIELGLTPDDRFCGCLLNVITQ-TPKEELSKLIDCVERANPKLGYVVRLLLGEQDKEGDFRTEASELLSVVSADVRKAYCNCLIDLCVNLDLL
        VV  F+  I+ G+ PDDR CGCLL+V+      E+  K++ C+ERAN KL   V L++ E+ +    + E   +++    + R+ +CNCLID+C   +  
Subjt:  VVRTFNRLIELGLTPDDRFCGCLLNVITQ-TPKEELSKLIDCVERANPKLGYVVRLLLGEQDKEGDFRTEASELLSVVSADVRKAYCNCLIDLCVNLDLL

Query:  DKACELLDLGLMLQIYTDLQSRSPTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPE
        ++A ELL LG +  +Y  L +++  +WSL ++ LS+GAA TAL  W+  L  +++  EELP L    TG G H++S +GLA+ F  HL++L+APF ++ +
Subjt:  DKACELLDLGLMLQIYTDLQSRSPTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPE

Query:  KVGWFLTTKVAAKSWLESRGSP
        + G F+ TK    SWLES+  P
Subjt:  KVGWFLTTKVAAKSWLESRGSP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTTCCAGCTCTGCCATTCGCCGTCCACATTCTTCACCGACCACCATTACCTATGCAATTCTTCCAATTCTCAACGCAAAATAACTCTCTGCAAGTTCTCTCACCG
TTTCAAGCTCAATCCGATACCTCGCCACTCAAAAACATTCCTCCAAATTACCAATGTCTCGCTACAGGAATACGCTCCTCAAGAAACCCAGAATCCAAGCCCCTCTGATG
ATGAAATTTCGAAAAATCCAGATGGGAAATCCGGTTCCTCGTCCAAAAGCTCCGTTTGGGTCAATCCCAGAAGCCCCAGAGCTTCGAGACTTCGGAAGCAATCTTACGAG
GCCAGGTATGCTTCTCTTACGAGAATATCGGAGTCTTTGGACTCTTGTAATCCATGTGAGGAAGATGTTGCTGATGTCTTGAAGATGATAGGCAGTAACATTTTAGAACA
GGACGCAGTTGTAGTGCTGAATAACATGTCGAATTCCCAAACTGCGTTGCTTGCTCTTCGGTACTTTCAGGAGGTGTTGAAATCAAGTAAAGAGGCAATTCTTTTTAATG
TGACACTGAAGGTGTTTAGGAAGTGCAGAGATATGGAGGGTGCAGAGAAACTGTTCGACGAAATGCTTGAGAGAGGAGTTAAGCCTGATAATGTGACATTTTCTACAATT
ATTAGTTGTGCTAGGTTGTGTTCGTTGCCAAATAAGGCTGTCGAGTGGTTTGAGAAGATGCCAAGTTTTGACTGTAATCCTGATGATGTCACTTACTCTGCGATGATCGA
TGCCTATGGACGTGCTGGTAATGTTGACTTGGCTTTTGGCTTGTATGACCGTGCAAGAACGGAAAACTGGCGTATTGATCCTGCGACATTCTCAACGTTGATCAAAATTC
ATGGAGTGGCTGGGAATTATGATGGGTGTTTGAATGTGTATGAAGAAATGAAGGCTATAGGCATCAAGCCAAATTTGGTTATATATAACAGCTTGCTGGATGCTATGGGT
AGGGCTAAAAGACCCTGGCAGATCAAGACCATTTACAAAGAGATGATTAAAAATGGTTTTTCACCAAGTTGGGCAACTTATGCTTCTCTTTTACGTGCCTACGGAAGAGC
CAGATATGGTGAGGATGCTCTCCTTGTGTACAAGGAGATGAAGGAAAAGGGACTACAGTTAAATGTAATTCTCTACAATACACTTTTAGCTATGTGTGCTGATGTTGGCT
ATATCAATGAAGCCGTTGAAATTTTTAAAGACATGAAGAGTTCTGGGACTTGCTCACCTGACAGTTGGACTTTTTCTTCCATGATCACCATATATTCCTGCAGTGGAAAA
GTATCGGAGGCGGAGGAAATGTTAAACGAGATGGTGGAATCCGGTTTTGATCCTAATATCTTTGTCTTGACATCACTAATCCAGTGTTATGGGAAAGCCAAACGTGTTGA
TGATGTAGTGAGGACATTTAATCGACTGATAGAGTTGGGATTAACTCCAGATGATCGATTCTGTGGCTGTCTTCTCAATGTAATTACCCAGACGCCAAAAGAGGAACTTA
GTAAGCTGATTGATTGTGTTGAGAGAGCTAATCCGAAACTCGGTTATGTGGTTAGACTTTTGCTAGGGGAACAAGACAAGGAAGGAGATTTCAGAACTGAAGCCTCTGAA
TTACTTAGTGTTGTCAGTGCTGATGTGAGAAAAGCCTACTGCAATTGCTTAATTGATCTCTGTGTGAATTTAGATCTTTTGGATAAGGCATGCGAACTCCTGGATTTGGG
GCTTATGCTTCAGATATATACAGATTTGCAGTCCAGGTCTCCAACTCAGTGGTCTCTATATCTTAAGGGTCTTTCTCTTGGGGCTGCTCTCACTGCATTACACGTTTGGA
TAAATGACTTAACGAAGGTACTTGAATCCGGGGAGGAACTTCCACCATTACTTGGAATTAATACTGGACATGGAAAACACAAATATTCTGATAAGGGTTTAGCAAGCGTT
TTTGAATCACATTTAAAGGAACTAAATGCTCCATTCCATGAGGCTCCCGAAAAGGTTGGGTGGTTTTTGACGACTAAAGTGGCTGCAAAATCATGGTTGGAGTCTAGAGG
TTCACCTGAATTAGTTGCAGCGTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCTTTCCAGCTCTGCCATTCGCCGTCCACATTCTTCACCGACCACCATTACCTATGCAATTCTTCCAATTCTCAACGCAAAATAACTCTCTGCAAGTTCTCTCACCG
TTTCAAGCTCAATCCGATACCTCGCCACTCAAAAACATTCCTCCAAATTACCAATGTCTCGCTACAGGAATACGCTCCTCAAGAAACCCAGAATCCAAGCCCCTCTGATG
ATGAAATTTCGAAAAATCCAGATGGGAAATCCGGTTCCTCGTCCAAAAGCTCCGTTTGGGTCAATCCCAGAAGCCCCAGAGCTTCGAGACTTCGGAAGCAATCTTACGAG
GCCAGGTATGCTTCTCTTACGAGAATATCGGAGTCTTTGGACTCTTGTAATCCATGTGAGGAAGATGTTGCTGATGTCTTGAAGATGATAGGCAGTAACATTTTAGAACA
GGACGCAGTTGTAGTGCTGAATAACATGTCGAATTCCCAAACTGCGTTGCTTGCTCTTCGGTACTTTCAGGAGGTGTTGAAATCAAGTAAAGAGGCAATTCTTTTTAATG
TGACACTGAAGGTGTTTAGGAAGTGCAGAGATATGGAGGGTGCAGAGAAACTGTTCGACGAAATGCTTGAGAGAGGAGTTAAGCCTGATAATGTGACATTTTCTACAATT
ATTAGTTGTGCTAGGTTGTGTTCGTTGCCAAATAAGGCTGTCGAGTGGTTTGAGAAGATGCCAAGTTTTGACTGTAATCCTGATGATGTCACTTACTCTGCGATGATCGA
TGCCTATGGACGTGCTGGTAATGTTGACTTGGCTTTTGGCTTGTATGACCGTGCAAGAACGGAAAACTGGCGTATTGATCCTGCGACATTCTCAACGTTGATCAAAATTC
ATGGAGTGGCTGGGAATTATGATGGGTGTTTGAATGTGTATGAAGAAATGAAGGCTATAGGCATCAAGCCAAATTTGGTTATATATAACAGCTTGCTGGATGCTATGGGT
AGGGCTAAAAGACCCTGGCAGATCAAGACCATTTACAAAGAGATGATTAAAAATGGTTTTTCACCAAGTTGGGCAACTTATGCTTCTCTTTTACGTGCCTACGGAAGAGC
CAGATATGGTGAGGATGCTCTCCTTGTGTACAAGGAGATGAAGGAAAAGGGACTACAGTTAAATGTAATTCTCTACAATACACTTTTAGCTATGTGTGCTGATGTTGGCT
ATATCAATGAAGCCGTTGAAATTTTTAAAGACATGAAGAGTTCTGGGACTTGCTCACCTGACAGTTGGACTTTTTCTTCCATGATCACCATATATTCCTGCAGTGGAAAA
GTATCGGAGGCGGAGGAAATGTTAAACGAGATGGTGGAATCCGGTTTTGATCCTAATATCTTTGTCTTGACATCACTAATCCAGTGTTATGGGAAAGCCAAACGTGTTGA
TGATGTAGTGAGGACATTTAATCGACTGATAGAGTTGGGATTAACTCCAGATGATCGATTCTGTGGCTGTCTTCTCAATGTAATTACCCAGACGCCAAAAGAGGAACTTA
GTAAGCTGATTGATTGTGTTGAGAGAGCTAATCCGAAACTCGGTTATGTGGTTAGACTTTTGCTAGGGGAACAAGACAAGGAAGGAGATTTCAGAACTGAAGCCTCTGAA
TTACTTAGTGTTGTCAGTGCTGATGTGAGAAAAGCCTACTGCAATTGCTTAATTGATCTCTGTGTGAATTTAGATCTTTTGGATAAGGCATGCGAACTCCTGGATTTGGG
GCTTATGCTTCAGATATATACAGATTTGCAGTCCAGGTCTCCAACTCAGTGGTCTCTATATCTTAAGGGTCTTTCTCTTGGGGCTGCTCTCACTGCATTACACGTTTGGA
TAAATGACTTAACGAAGGTACTTGAATCCGGGGAGGAACTTCCACCATTACTTGGAATTAATACTGGACATGGAAAACACAAATATTCTGATAAGGGTTTAGCAAGCGTT
TTTGAATCACATTTAAAGGAACTAAATGCTCCATTCCATGAGGCTCCCGAAAAGGTTGGGTGGTTTTTGACGACTAAAGTGGCTGCAAAATCATGGTTGGAGTCTAGAGG
TTCACCTGAATTAGTTGCAGCGTAG
Protein sequenceShow/hide protein sequence
MAFQLCHSPSTFFTDHHYLCNSSNSQRKITLCKFSHRFKLNPIPRHSKTFLQITNVSLQEYAPQETQNPSPSDDEISKNPDGKSGSSSKSSVWVNPRSPRASRLRKQSYE
ARYASLTRISESLDSCNPCEEDVADVLKMIGSNILEQDAVVVLNNMSNSQTALLALRYFQEVLKSSKEAILFNVTLKVFRKCRDMEGAEKLFDEMLERGVKPDNVTFSTI
ISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDLAFGLYDRARTENWRIDPATFSTLIKIHGVAGNYDGCLNVYEEMKAIGIKPNLVIYNSLLDAMG
RAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCADVGYINEAVEIFKDMKSSGTCSPDSWTFSSMITIYSCSGK
VSEAEEMLNEMVESGFDPNIFVLTSLIQCYGKAKRVDDVVRTFNRLIELGLTPDDRFCGCLLNVITQTPKEELSKLIDCVERANPKLGYVVRLLLGEQDKEGDFRTEASE
LLSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDLGLMLQIYTDLQSRSPTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASV
FESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRGSPELVAA