; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg019921 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg019921
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationscaffold5:32569859..32574411
RNA-Seq ExpressionSpg019921
SyntenySpg019921
Gene Ontology termsGO:0009658 - chloroplast organization (biological process)
GO:0031425 - chloroplast RNA processing (biological process)
GO:0009570 - chloroplast stroma (cellular component)
GO:0009941 - chloroplast envelope (cellular component)
GO:0003729 - mRNA binding (molecular function)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR002625 - Smr domain
IPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily
IPR033443 - Pentacotripeptide-repeat region of PRORP


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6583722.1 Pentatricopeptide repeat-containing protein, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0088.6Show/hide
Query:  MAFQLCHSPSTFFTDHHYLCNSSNSQRKTTLCKFSHR-FKLNPIPRHPKTFLQITNVSLQEYAPQETQNPSPSDDEISKNPDGKFGSSSKSSVWVNPRSP
        MAFQL H PSTFFTDH    NS     KTTLCK S R FKLNPIP H K FLQITNVS QEYAPQET+NPSPSDDEISK PDGK GSSSK+SVWVNP SP
Subjt:  MAFQLCHSPSTFFTDHHYLCNSSNSQRKTTLCKFSHR-FKLNPIPRHPKTFLQITNVSLQEYAPQETQNPSPSDDEISKNPDGKFGSSSKSSVWVNPRSP

Query:  RASRLRKQSYEARYASLTRISESLDSCNPCEEDVADVLKVIGSNILEQDAVVVLNNMSNSQTALLALRYFQEVLKSSKEAILFNVTLKVFRKCRDMEGAE
        RAS+LRKQSYEARYASL +ISESLDSCNPCE+DVADVLK I S ILEQDA+ VLNNMSNSQTALL LRYFQ+VLKSSK+A+ +NVTLKVFRKCRD EGAE
Subjt:  RASRLRKQSYEARYASLTRISESLDSCNPCEEDVADVLKVIGSNILEQDAVVVLNNMSNSQTALLALRYFQEVLKSSKEAILFNVTLKVFRKCRDMEGAE

Query:  KLFDEMLKRGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDLAFGLYDRARTENWRIDPATFSTLIKIHGVAGN
        KLFDEML+RGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPD++TYS MIDAYGRAGNVD+AF LYDRARTENWRID +TFST+IKIHGVAGN
Subjt:  KLFDEMLKRGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDLAFGLYDRARTENWRIDPATFSTLIKIHGVAGN

Query:  YDGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGLSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMC
        YDGCLNVYEEMKA+GIKPNL IYNSLL AMGRAKRPWQIKTIYKEM KNG SPSWATYASLLRAY RARY ED +LVYKEMKEKGLQLNVILYNTLLAMC
Subjt:  YDGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGLSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMC

Query:  ADVGYINEAVEIFKDMKSSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVESGFDPNIFVLTSLIQCYGKAKRVDDVVRTFNLLIELGLTPDDRFCG
        ADVGY+NEA+E+FKDMKSSGTCSPDSWTFSSMITIYSCSG VSEAEEMLNEM+E+GFDPNIFVLTSLIQCYGKAKRVDDVVRTF+ L+ELGLTPDDRFCG
Subjt:  ADVGYINEAVEIFKDMKSSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVESGFDPNIFVLTSLIQCYGKAKRVDDVVRTFNLLIELGLTPDDRFCG

Query:  CLLNVITQTPKEELSKLIDCVERANPKLGYVVRLLLGEQDKEGDFRTEASELLSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDLGLMLQIYTDLQSR
        CLLNVITQTPK ELSKLIDCVERANPKLG+VV+LLLGE+D EGDFRTEASEL SVVS DVRKAYCNCLIDLCVNLDLLDKACELLDLGL +QIYTDLQSR
Subjt:  CLLNVITQTPKEELSKLIDCVERANPKLGYVVRLLLGEQDKEGDFRTEASELLSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDLGLMLQIYTDLQSR

Query:  SPTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRGLE
        SPTQWSLYLKGLSLGAALTALHVWINDLTK L+SGEELPPLLGINTGHGKHKYSDKGL+SVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRG  
Subjt:  SPTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRGLE

Query:  EL
        EL
Subjt:  EL

XP_004139516.1 pentatricopeptide repeat-containing protein At4g16390, chloroplastic [Cucumis sativus]0.0e+0090.01Show/hide
Query:  MAFQLCHSPSTFFTDHHYLCNSSNSQRKTTLCKFSHRFKLNPIPRHPKTFLQITNVSLQEYAPQETQNPSPSDDEISKNPDGKFGSSSKSSVWVNPRSPR
        MAFQLC+SP TFFT+HH+L NS   QRKTTL   S  FKL+PIPRH K FLQITNVSLQE+APQ+TQN  PS DEISK PD K GSSS SSVWVNPRSPR
Subjt:  MAFQLCHSPSTFFTDHHYLCNSSNSQRKTTLCKFSHRFKLNPIPRHPKTFLQITNVSLQEYAPQETQNPSPSDDEISKNPDGKFGSSSKSSVWVNPRSPR

Query:  ASRLRKQSYEARYASLTRISESLDSCNPCEEDVADVLKVIGSNILEQDAVVVLNNMSNSQTALLALRYFQEVLKSSKEAILFNVTLKVFRKCRDMEGAEK
        AS+LRKQSYEARYASL R+SESLDS NPCE DVADVLKVIG+NILE+DA++VLNNMSNSQTALLALRYFQ++LKSSK+ I +NVTLKVFRKCRDMEGAEK
Subjt:  ASRLRKQSYEARYASLTRISESLDSCNPCEEDVADVLKVIGSNILEQDAVVVLNNMSNSQTALLALRYFQEVLKSSKEAILFNVTLKVFRKCRDMEGAEK

Query:  LFDEMLKRGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDLAFGLYDRARTENWRIDPATFSTLIKIHGVAGNY
        LF+EM+ RGVKPDNVTFSTIISCARLCSLP+KAVEWFEKMPSFDCNPDDVTYS MIDAYGRAGNVD+AF LYDRARTENWRIDPATFST+IKIHGVAGNY
Subjt:  LFDEMLKRGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDLAFGLYDRARTENWRIDPATFSTLIKIHGVAGNY

Query:  DGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGLSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCA
        DGCLNVYEEMKAIGIKPNLVIYN LLDAMGRAKRPWQIKTIYKEMIKNG SPSWATYASLLRAYGRARYGEDAL+VYKEMKEKGLQLNVILYNTLLAMCA
Subjt:  DGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGLSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCA

Query:  DVGYINEAVEIFKDMKSSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVESGFDPNIFVLTSLIQCYGKAKRVDDVVRTFNLLIELGLTPDDRFCGC
        DVGY+NEAVEIF+DMKSSGTCSPDSWTFSSMITIYSC GKVSEAEEMLN+MVE+GFDPNIFVLTSLIQCYGKAKRVDDVVRTFN LIELGLTPDDRFCGC
Subjt:  DVGYINEAVEIFKDMKSSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVESGFDPNIFVLTSLIQCYGKAKRVDDVVRTFNLLIELGLTPDDRFCGC

Query:  LLNVITQTPKEELSKLIDCVERANPKLGYVVRLLLGEQDKEGDFRTEASELLSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDLGLMLQIYTDLQSRS
        LLNVITQTPK EL KLIDCV RANPKLG+VV LLLGEQDKEG+FRTEASEL SVVSADVRKAYCNCLIDLCVNLDLLDKACELLDLGL LQIY DLQSRS
Subjt:  LLNVITQTPKEELSKLIDCVERANPKLGYVVRLLLGEQDKEGDFRTEASELLSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDLGLMLQIYTDLQSRS

Query:  PTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRGLEE
        PTQWSLYLKGLSLGAALTALHVWI DLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESR   E
Subjt:  PTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRGLEE

Query:  L
        L
Subjt:  L

XP_008464281.1 PREDICTED: pentatricopeptide repeat-containing protein At4g16390, chloroplastic [Cucumis melo]0.0e+0090.87Show/hide
Query:  MAFQLCHSPSTFFTDHHYLCNSSNSQRKTTLCKFSHRFKLNPIPRHPKTFLQITNVSLQEYAPQETQNPSPSDDEISKNPDGKFGSSSKSSVWVNPRSPR
        MAFQLCHSP TFFT HH L NS   QRKTTL   S  FKLNPIPRH   FLQITN+SLQE++PQET N  PSDDEISK  D K GSSSKSSVWVNPRSPR
Subjt:  MAFQLCHSPSTFFTDHHYLCNSSNSQRKTTLCKFSHRFKLNPIPRHPKTFLQITNVSLQEYAPQETQNPSPSDDEISKNPDGKFGSSSKSSVWVNPRSPR

Query:  ASRLRKQSYEARYASLTRISESLDSCNPCEEDVADVLKVIGSNILEQDAVVVLNNMSNSQTALLALRYFQEVLKSSKEAILFNVTLKVFRKCRDMEGAEK
        AS+LRKQSYEARYASL RISESLDSCNPCE DVADVLKVIG+NILEQDAVVVLNNMSNSQTALLALRYFQ++LKSSK+ I +NVTLKVFRKCRDMEGAE+
Subjt:  ASRLRKQSYEARYASLTRISESLDSCNPCEEDVADVLKVIGSNILEQDAVVVLNNMSNSQTALLALRYFQEVLKSSKEAILFNVTLKVFRKCRDMEGAEK

Query:  LFDEMLKRGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDLAFGLYDRARTENWRIDPATFSTLIKIHGVAGNY
        LF+EML RGVKPDNVTFSTIISCARLCSLP+KAVEWFEKMPSFDCNPDDVTYS MIDAYGRAGNVD+AF LYDRARTENWRIDPATFST+IKIHGVAGNY
Subjt:  LFDEMLKRGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDLAFGLYDRARTENWRIDPATFSTLIKIHGVAGNY

Query:  DGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGLSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCA
        DGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIK+G SPSWATYASLLRAYGRARYGEDAL+VYKEMKEKGLQLNVILYNTLLAMCA
Subjt:  DGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGLSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCA

Query:  DVGYINEAVEIFKDMKSSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVESGFDPNIFVLTSLIQCYGKAKRVDDVVRTFNLLIELGLTPDDRFCGC
        DVGY+NEAVEIF+DMK+SGTCSPDSWTFSSMITIYSCSGKVSEAEEMLN+MVE+GFDPNIFVLTSLIQCYGKAKRVDDVVRTFN LIELGLTPDDRFCGC
Subjt:  DVGYINEAVEIFKDMKSSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVESGFDPNIFVLTSLIQCYGKAKRVDDVVRTFNLLIELGLTPDDRFCGC

Query:  LLNVITQTPKEELSKLIDCVERANPKLGYVVRLLLGEQDKEGDFRTEASELLSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDLGLMLQIYTDLQSRS
        LLNVITQTPKEE+SKLIDCV RANPKLG+VV LLLGEQDKEG+FRTEASEL SVVSADVRKAYCNCLIDLCVNLDLLDKACELL+LGL LQIY DLQSRS
Subjt:  LLNVITQTPKEELSKLIDCVERANPKLGYVVRLLLGEQDKEGDFRTEASELLSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDLGLMLQIYTDLQSRS

Query:  PTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRGLEE
        PTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESR   E
Subjt:  PTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRGLEE

Query:  L
        L
Subjt:  L

XP_022142513.1 pentatricopeptide repeat-containing protein At4g16390, chloroplastic [Momordica charantia]0.0e+0091.58Show/hide
Query:  MAFQLCHSPSTFFTDHHYLCNSSNSQRKTTLCKFSHRFKLNPIPRHPKTFLQITNVSLQEYAPQETQNPSPSDDEISKNPDGKFGSSSKSSVWVNPRSPR
        MAFQLCHSPSTFF+DHH L NS NSQ + TL K SHRFKLNP P H KT L+ITNVSLQEYA QE QNP P+ DE SK PDGK  SSSKSSVWVNPRSPR
Subjt:  MAFQLCHSPSTFFTDHHYLCNSSNSQRKTTLCKFSHRFKLNPIPRHPKTFLQITNVSLQEYAPQETQNPSPSDDEISKNPDGKFGSSSKSSVWVNPRSPR

Query:  ASRLRKQSYEARYASLTRISESLDSCNPCEEDVADVLKVIGSNILEQDAVVVLNNMSNSQTALLALRYFQEVLKSSKEAILFNVTLKVFRKCRDMEGAEK
        AS+LR QSYEARYASLTRISESLDSCNPCEEDVADVLK +GSNILEQDAV VLNNMSNS TALLAL+ FQ+VLKSSK+AIL+NVTLKV RK RDMEGAEK
Subjt:  ASRLRKQSYEARYASLTRISESLDSCNPCEEDVADVLKVIGSNILEQDAVVVLNNMSNSQTALLALRYFQEVLKSSKEAILFNVTLKVFRKCRDMEGAEK

Query:  LFDEMLKRGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDLAFGLYDRARTENWRIDPATFSTLIKIHGVAGNY
        LFDEMLKRGVKPDNVTFST+ISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVD+AF LYDRARTENWRIDPATFSTLIKIHGVAGNY
Subjt:  LFDEMLKRGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDLAFGLYDRARTENWRIDPATFSTLIKIHGVAGNY

Query:  DGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGLSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCA
        DGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNG SPSWATYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCA
Subjt:  DGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGLSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCA

Query:  DVGYINEAVEIFKDMKSSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVESGFDPNIFVLTSLIQCYGKAKRVDDVVRTFNLLIELGLTPDDRFCGC
        DVGY+NEAVEIF+DMKSSG CSPDSWTFSSMITIYSCSGKVSEAEEMLNEM+E+GFDPNIFVLTSLIQCYGK KRVDDVVRTF+ L+ELGLTPDDRFCGC
Subjt:  DVGYINEAVEIFKDMKSSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVESGFDPNIFVLTSLIQCYGKAKRVDDVVRTFNLLIELGLTPDDRFCGC

Query:  LLNVITQTPKEELSKLIDCVERANPKLGYVVRLLLGEQDKEGDFRTEASELLSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDLGLMLQIYTDLQSRS
        LLNVITQTPKEELSKLIDCVERAN KLGYVV+LLLGEQDKEGD RTEASELLSVVSADVRKAYCNCLIDLCVNLDLL+KACELLDLGL LQIYT LQS S
Subjt:  LLNVITQTPKEELSKLIDCVERANPKLGYVVRLLLGEQDKEGDFRTEASELLSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDLGLMLQIYTDLQSRS

Query:  PTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRGLEE
        PTQWSL+LKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRG  E
Subjt:  PTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRGLEE

Query:  L
        L
Subjt:  L

XP_038877791.1 pentatricopeptide repeat-containing protein At4g16390, chloroplastic [Benincasa hispida]0.0e+0093.3Show/hide
Query:  MAFQLCHSPSTFFTDHHYLCNSSNSQRKTTLCKFSHRFKLNPIPRHPKTFLQITNVSLQEYAPQETQNPSPSDDEISKNPDGKFGSSSKSSVWVNPRSPR
        MAFQLCHSPSTFFTDHH L NS  SQRKTTLC  S  FKLNPIPRH K FLQITNVSLQEYAPQET NPSPS+DEISK PDGK  SSSKSSVWVNPRSPR
Subjt:  MAFQLCHSPSTFFTDHHYLCNSSNSQRKTTLCKFSHRFKLNPIPRHPKTFLQITNVSLQEYAPQETQNPSPSDDEISKNPDGKFGSSSKSSVWVNPRSPR

Query:  ASRLRKQSYEARYASLTRISESLDSCNPCEEDVADVLKVIGSNILEQDAVVVLNNMSNSQTALLALRYFQEVLKSSKEAILFNVTLKVFRKCRDMEGAEK
        AS+LRKQSYEARYASLTRISESLDSCNPC+EDVADVLK IGSNIL+QDAVVVLNNMSNSQTALLALRYFQ+VLKSSK+AI +NVTLKVFRKCRDMEGAEK
Subjt:  ASRLRKQSYEARYASLTRISESLDSCNPCEEDVADVLKVIGSNILEQDAVVVLNNMSNSQTALLALRYFQEVLKSSKEAILFNVTLKVFRKCRDMEGAEK

Query:  LFDEMLKRGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDLAFGLYDRARTENWRIDPATFSTLIKIHGVAGNY
        LF+EMLKRGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVD+AF LYDRARTENWRIDPATFST+IKIHGVAGNY
Subjt:  LFDEMLKRGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDLAFGLYDRARTENWRIDPATFSTLIKIHGVAGNY

Query:  DGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGLSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCA
        DGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNG SPSWATYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCA
Subjt:  DGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGLSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCA

Query:  DVGYINEAVEIFKDMKSSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVESGFDPNIFVLTSLIQCYGKAKRVDDVVRTFNLLIELGLTPDDRFCGC
        DVGY+ EAVE+F+DMKSSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVE+GFDPNIFVLTSLIQCYGKAKRVDDVVRTF+ LIELGLTPDDRFCGC
Subjt:  DVGYINEAVEIFKDMKSSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVESGFDPNIFVLTSLIQCYGKAKRVDDVVRTFNLLIELGLTPDDRFCGC

Query:  LLNVITQTPKEELSKLIDCVERANPKLGYVVRLLLGEQDKEGDFRTEASELLSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDLGLMLQIYTDLQSRS
        LLNVITQTPKEELSKLIDCV RANPKLG+VV+LL+GEQDKEGDFRTEASEL SVVSADVRKAYCNCLIDLCVNLDLLDKACELLDLGL LQ+Y DLQSRS
Subjt:  LLNVITQTPKEELSKLIDCVERANPKLGYVVRLLLGEQDKEGDFRTEASELLSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDLGLMLQIYTDLQSRS

Query:  PTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRGLEE
        PTQWSLYLKGLSLGA LTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESR   E
Subjt:  PTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRGLEE

Query:  L
        L
Subjt:  L

TrEMBL top hitse value%identityAlignment
A0A0A0LVP1 Smr domain-containing protein0.0e+0090.01Show/hide
Query:  MAFQLCHSPSTFFTDHHYLCNSSNSQRKTTLCKFSHRFKLNPIPRHPKTFLQITNVSLQEYAPQETQNPSPSDDEISKNPDGKFGSSSKSSVWVNPRSPR
        MAFQLC+SP TFFT+HH+L NS   QRKTTL   S  FKL+PIPRH K FLQITNVSLQE+APQ+TQN  PS DEISK PD K GSSS SSVWVNPRSPR
Subjt:  MAFQLCHSPSTFFTDHHYLCNSSNSQRKTTLCKFSHRFKLNPIPRHPKTFLQITNVSLQEYAPQETQNPSPSDDEISKNPDGKFGSSSKSSVWVNPRSPR

Query:  ASRLRKQSYEARYASLTRISESLDSCNPCEEDVADVLKVIGSNILEQDAVVVLNNMSNSQTALLALRYFQEVLKSSKEAILFNVTLKVFRKCRDMEGAEK
        AS+LRKQSYEARYASL R+SESLDS NPCE DVADVLKVIG+NILE+DA++VLNNMSNSQTALLALRYFQ++LKSSK+ I +NVTLKVFRKCRDMEGAEK
Subjt:  ASRLRKQSYEARYASLTRISESLDSCNPCEEDVADVLKVIGSNILEQDAVVVLNNMSNSQTALLALRYFQEVLKSSKEAILFNVTLKVFRKCRDMEGAEK

Query:  LFDEMLKRGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDLAFGLYDRARTENWRIDPATFSTLIKIHGVAGNY
        LF+EM+ RGVKPDNVTFSTIISCARLCSLP+KAVEWFEKMPSFDCNPDDVTYS MIDAYGRAGNVD+AF LYDRARTENWRIDPATFST+IKIHGVAGNY
Subjt:  LFDEMLKRGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDLAFGLYDRARTENWRIDPATFSTLIKIHGVAGNY

Query:  DGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGLSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCA
        DGCLNVYEEMKAIGIKPNLVIYN LLDAMGRAKRPWQIKTIYKEMIKNG SPSWATYASLLRAYGRARYGEDAL+VYKEMKEKGLQLNVILYNTLLAMCA
Subjt:  DGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGLSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCA

Query:  DVGYINEAVEIFKDMKSSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVESGFDPNIFVLTSLIQCYGKAKRVDDVVRTFNLLIELGLTPDDRFCGC
        DVGY+NEAVEIF+DMKSSGTCSPDSWTFSSMITIYSC GKVSEAEEMLN+MVE+GFDPNIFVLTSLIQCYGKAKRVDDVVRTFN LIELGLTPDDRFCGC
Subjt:  DVGYINEAVEIFKDMKSSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVESGFDPNIFVLTSLIQCYGKAKRVDDVVRTFNLLIELGLTPDDRFCGC

Query:  LLNVITQTPKEELSKLIDCVERANPKLGYVVRLLLGEQDKEGDFRTEASELLSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDLGLMLQIYTDLQSRS
        LLNVITQTPK EL KLIDCV RANPKLG+VV LLLGEQDKEG+FRTEASEL SVVSADVRKAYCNCLIDLCVNLDLLDKACELLDLGL LQIY DLQSRS
Subjt:  LLNVITQTPKEELSKLIDCVERANPKLGYVVRLLLGEQDKEGDFRTEASELLSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDLGLMLQIYTDLQSRS

Query:  PTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRGLEE
        PTQWSLYLKGLSLGAALTALHVWI DLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESR   E
Subjt:  PTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRGLEE

Query:  L
        L
Subjt:  L

A0A1S3CL39 pentatricopeptide repeat-containing protein At4g16390, chloroplastic0.0e+0090.87Show/hide
Query:  MAFQLCHSPSTFFTDHHYLCNSSNSQRKTTLCKFSHRFKLNPIPRHPKTFLQITNVSLQEYAPQETQNPSPSDDEISKNPDGKFGSSSKSSVWVNPRSPR
        MAFQLCHSP TFFT HH L NS   QRKTTL   S  FKLNPIPRH   FLQITN+SLQE++PQET N  PSDDEISK  D K GSSSKSSVWVNPRSPR
Subjt:  MAFQLCHSPSTFFTDHHYLCNSSNSQRKTTLCKFSHRFKLNPIPRHPKTFLQITNVSLQEYAPQETQNPSPSDDEISKNPDGKFGSSSKSSVWVNPRSPR

Query:  ASRLRKQSYEARYASLTRISESLDSCNPCEEDVADVLKVIGSNILEQDAVVVLNNMSNSQTALLALRYFQEVLKSSKEAILFNVTLKVFRKCRDMEGAEK
        AS+LRKQSYEARYASL RISESLDSCNPCE DVADVLKVIG+NILEQDAVVVLNNMSNSQTALLALRYFQ++LKSSK+ I +NVTLKVFRKCRDMEGAE+
Subjt:  ASRLRKQSYEARYASLTRISESLDSCNPCEEDVADVLKVIGSNILEQDAVVVLNNMSNSQTALLALRYFQEVLKSSKEAILFNVTLKVFRKCRDMEGAEK

Query:  LFDEMLKRGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDLAFGLYDRARTENWRIDPATFSTLIKIHGVAGNY
        LF+EML RGVKPDNVTFSTIISCARLCSLP+KAVEWFEKMPSFDCNPDDVTYS MIDAYGRAGNVD+AF LYDRARTENWRIDPATFST+IKIHGVAGNY
Subjt:  LFDEMLKRGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDLAFGLYDRARTENWRIDPATFSTLIKIHGVAGNY

Query:  DGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGLSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCA
        DGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIK+G SPSWATYASLLRAYGRARYGEDAL+VYKEMKEKGLQLNVILYNTLLAMCA
Subjt:  DGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGLSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCA

Query:  DVGYINEAVEIFKDMKSSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVESGFDPNIFVLTSLIQCYGKAKRVDDVVRTFNLLIELGLTPDDRFCGC
        DVGY+NEAVEIF+DMK+SGTCSPDSWTFSSMITIYSCSGKVSEAEEMLN+MVE+GFDPNIFVLTSLIQCYGKAKRVDDVVRTFN LIELGLTPDDRFCGC
Subjt:  DVGYINEAVEIFKDMKSSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVESGFDPNIFVLTSLIQCYGKAKRVDDVVRTFNLLIELGLTPDDRFCGC

Query:  LLNVITQTPKEELSKLIDCVERANPKLGYVVRLLLGEQDKEGDFRTEASELLSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDLGLMLQIYTDLQSRS
        LLNVITQTPKEE+SKLIDCV RANPKLG+VV LLLGEQDKEG+FRTEASEL SVVSADVRKAYCNCLIDLCVNLDLLDKACELL+LGL LQIY DLQSRS
Subjt:  LLNVITQTPKEELSKLIDCVERANPKLGYVVRLLLGEQDKEGDFRTEASELLSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDLGLMLQIYTDLQSRS

Query:  PTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRGLEE
        PTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESR   E
Subjt:  PTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRGLEE

Query:  L
        L
Subjt:  L

A0A5A7TLM5 Pentatricopeptide repeat-containing protein0.0e+0090.87Show/hide
Query:  MAFQLCHSPSTFFTDHHYLCNSSNSQRKTTLCKFSHRFKLNPIPRHPKTFLQITNVSLQEYAPQETQNPSPSDDEISKNPDGKFGSSSKSSVWVNPRSPR
        MAFQLCHSP TFFT HH L NS   QRKTTL   S  FKLNPIPRH   FLQITN+SLQE++PQET N  PSDDEISK  D K GSSSKSSVWVNPRSPR
Subjt:  MAFQLCHSPSTFFTDHHYLCNSSNSQRKTTLCKFSHRFKLNPIPRHPKTFLQITNVSLQEYAPQETQNPSPSDDEISKNPDGKFGSSSKSSVWVNPRSPR

Query:  ASRLRKQSYEARYASLTRISESLDSCNPCEEDVADVLKVIGSNILEQDAVVVLNNMSNSQTALLALRYFQEVLKSSKEAILFNVTLKVFRKCRDMEGAEK
        AS+LRKQSYEARYASL RISESLDSCNPCE DVADVLKVIG+NILEQDAVVVLNNMSNSQTALLALRYFQ++LKSSK+ I +NVTLKVFRKCRDMEGAE+
Subjt:  ASRLRKQSYEARYASLTRISESLDSCNPCEEDVADVLKVIGSNILEQDAVVVLNNMSNSQTALLALRYFQEVLKSSKEAILFNVTLKVFRKCRDMEGAEK

Query:  LFDEMLKRGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDLAFGLYDRARTENWRIDPATFSTLIKIHGVAGNY
        LF+EML RGVKPDNVTFSTIISCARLCSLP+KAVEWFEKMPSFDCNPDDVTYS MIDAYGRAGNVD+AF LYDRARTENWRIDPATFST+IKIHGVAGNY
Subjt:  LFDEMLKRGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDLAFGLYDRARTENWRIDPATFSTLIKIHGVAGNY

Query:  DGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGLSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCA
        DGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIK+G SPSWATYASLLRAYGRARYGEDAL+VYKEMKEKGLQLNVILYNTLLAMCA
Subjt:  DGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGLSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCA

Query:  DVGYINEAVEIFKDMKSSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVESGFDPNIFVLTSLIQCYGKAKRVDDVVRTFNLLIELGLTPDDRFCGC
        DVGY+NEAVEIF+DMK+SGTCSPDSWTFSSMITIYSCSGKVSEAEEMLN+MVE+GFDPNIFVLTSLIQCYGKAKRVDDVVRTFN LIELGLTPDDRFCGC
Subjt:  DVGYINEAVEIFKDMKSSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVESGFDPNIFVLTSLIQCYGKAKRVDDVVRTFNLLIELGLTPDDRFCGC

Query:  LLNVITQTPKEELSKLIDCVERANPKLGYVVRLLLGEQDKEGDFRTEASELLSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDLGLMLQIYTDLQSRS
        LLNVITQTPKEE+SKLIDCV RANPKLG+VV LLLGEQDKEG+FRTEASEL SVVSADVRKAYCNCLIDLCVNLDLLDKACELL+LGL LQIY DLQSRS
Subjt:  LLNVITQTPKEELSKLIDCVERANPKLGYVVRLLLGEQDKEGDFRTEASELLSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDLGLMLQIYTDLQSRS

Query:  PTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRGLEE
        PTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESR   E
Subjt:  PTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRGLEE

Query:  L
        L
Subjt:  L

A0A6J1CNE5 pentatricopeptide repeat-containing protein At4g16390, chloroplastic0.0e+0091.58Show/hide
Query:  MAFQLCHSPSTFFTDHHYLCNSSNSQRKTTLCKFSHRFKLNPIPRHPKTFLQITNVSLQEYAPQETQNPSPSDDEISKNPDGKFGSSSKSSVWVNPRSPR
        MAFQLCHSPSTFF+DHH L NS NSQ + TL K SHRFKLNP P H KT L+ITNVSLQEYA QE QNP P+ DE SK PDGK  SSSKSSVWVNPRSPR
Subjt:  MAFQLCHSPSTFFTDHHYLCNSSNSQRKTTLCKFSHRFKLNPIPRHPKTFLQITNVSLQEYAPQETQNPSPSDDEISKNPDGKFGSSSKSSVWVNPRSPR

Query:  ASRLRKQSYEARYASLTRISESLDSCNPCEEDVADVLKVIGSNILEQDAVVVLNNMSNSQTALLALRYFQEVLKSSKEAILFNVTLKVFRKCRDMEGAEK
        AS+LR QSYEARYASLTRISESLDSCNPCEEDVADVLK +GSNILEQDAV VLNNMSNS TALLAL+ FQ+VLKSSK+AIL+NVTLKV RK RDMEGAEK
Subjt:  ASRLRKQSYEARYASLTRISESLDSCNPCEEDVADVLKVIGSNILEQDAVVVLNNMSNSQTALLALRYFQEVLKSSKEAILFNVTLKVFRKCRDMEGAEK

Query:  LFDEMLKRGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDLAFGLYDRARTENWRIDPATFSTLIKIHGVAGNY
        LFDEMLKRGVKPDNVTFST+ISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVD+AF LYDRARTENWRIDPATFSTLIKIHGVAGNY
Subjt:  LFDEMLKRGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDLAFGLYDRARTENWRIDPATFSTLIKIHGVAGNY

Query:  DGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGLSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCA
        DGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNG SPSWATYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCA
Subjt:  DGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGLSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCA

Query:  DVGYINEAVEIFKDMKSSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVESGFDPNIFVLTSLIQCYGKAKRVDDVVRTFNLLIELGLTPDDRFCGC
        DVGY+NEAVEIF+DMKSSG CSPDSWTFSSMITIYSCSGKVSEAEEMLNEM+E+GFDPNIFVLTSLIQCYGK KRVDDVVRTF+ L+ELGLTPDDRFCGC
Subjt:  DVGYINEAVEIFKDMKSSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVESGFDPNIFVLTSLIQCYGKAKRVDDVVRTFNLLIELGLTPDDRFCGC

Query:  LLNVITQTPKEELSKLIDCVERANPKLGYVVRLLLGEQDKEGDFRTEASELLSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDLGLMLQIYTDLQSRS
        LLNVITQTPKEELSKLIDCVERAN KLGYVV+LLLGEQDKEGD RTEASELLSVVSADVRKAYCNCLIDLCVNLDLL+KACELLDLGL LQIYT LQS S
Subjt:  LLNVITQTPKEELSKLIDCVERANPKLGYVVRLLLGEQDKEGDFRTEASELLSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDLGLMLQIYTDLQSRS

Query:  PTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRGLEE
        PTQWSL+LKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRG  E
Subjt:  PTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRGLEE

Query:  L
        L
Subjt:  L

A0A6J1EHV4 pentatricopeptide repeat-containing protein At4g16390, chloroplastic0.0e+0088.46Show/hide
Query:  MAFQLCHSPSTFFTDHHYLCNSSNSQRKTTLCK-FSHRFKLNPIPRHPKTFLQITNVSLQEYAPQETQNPSPSDDEISKNPDGKFGSSSKSSVWVNPRSP
        MAFQL H PSTFFTDH    NS     KTTLCK FS  FKLNPIP H K FLQITNVS QEYAPQET+NPSPSDDEISK PDGK GSSSK+SVWVNP SP
Subjt:  MAFQLCHSPSTFFTDHHYLCNSSNSQRKTTLCK-FSHRFKLNPIPRHPKTFLQITNVSLQEYAPQETQNPSPSDDEISKNPDGKFGSSSKSSVWVNPRSP

Query:  RASRLRKQSYEARYASLTRISESLDSCNPCEEDVADVLKVIGSNILEQDAVVVLNNMSNSQTALLALRYFQEVLKSSKEAILFNVTLKVFRKCRDMEGAE
        RAS+LRKQSYEARYASL +ISESLDSCNPCE DVADVLK I S ILEQDA+ VLNNMSNSQTALL  RYFQ+VLKSSK+A+ +NVTLKVFRKCRD EGAE
Subjt:  RASRLRKQSYEARYASLTRISESLDSCNPCEEDVADVLKVIGSNILEQDAVVVLNNMSNSQTALLALRYFQEVLKSSKEAILFNVTLKVFRKCRDMEGAE

Query:  KLFDEMLKRGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDLAFGLYDRARTENWRIDPATFSTLIKIHGVAGN
        KLFDEML+RGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPD++TYS MIDAYGRAGNVD+AF LYDRARTENWRID +TFST+IKIHGVAGN
Subjt:  KLFDEMLKRGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDLAFGLYDRARTENWRIDPATFSTLIKIHGVAGN

Query:  YDGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGLSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMC
        YDGCLNVYEEMKA+GIKPNL IYNSLL AMGRAKRPWQIKTIYKEM KNG SPSWATYASLLRAY RARY ED +LVYKEMKEKGLQLNVILYNTLLAMC
Subjt:  YDGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGLSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMC

Query:  ADVGYINEAVEIFKDMKSSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVESGFDPNIFVLTSLIQCYGKAKRVDDVVRTFNLLIELGLTPDDRFCG
        ADVGY+NEA+E+FKDMKSSGTCSPDSWTFSSMITIYSCSG VSEAEEMLNEM+E+GFDPNIFVLTSLIQCYGKAKRVDDVVRTF+ L+ELGLTPDDRFCG
Subjt:  ADVGYINEAVEIFKDMKSSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVESGFDPNIFVLTSLIQCYGKAKRVDDVVRTFNLLIELGLTPDDRFCG

Query:  CLLNVITQTPKEELSKLIDCVERANPKLGYVVRLLLGEQDKEGDFRTEASELLSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDLGLMLQIYTDLQSR
        CLLNVITQTPK ELSKLIDCVERANPKLG+VV+LLLGE+D EGDFRTEASEL SVVS DVRKAYCNCLIDLCVNLDLLDKACELLDLGL +QIYTDLQSR
Subjt:  CLLNVITQTPKEELSKLIDCVERANPKLGYVVRLLLGEQDKEGDFRTEASELLSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDLGLMLQIYTDLQSR

Query:  SPTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRGLE
        SPTQWSLYLKGLSLGAALTALHVWINDLTK L+SGEELPPLLGINTGHGKHKYSDKGL+SVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRG  
Subjt:  SPTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRGLE

Query:  EL
        EL
Subjt:  EL

SwissProt top hitse value%identityAlignment
B4F8Z1 Pentatricopeptide repeat-containing protein ATP4, chloroplastic1.0e-20956.37Show/hide
Query:  PRHPKTFLQITNVSLQEYAPQETQNPSPSDDEISKNPDGKFGSSSKSS--VWVNPRSPRASRL-RKQSYEARYASLTRISESLDSCNPCEEDVADVLK-V
        P++P + +   +VS+QE  PQ  Q+PSP  D    NP+G   SSS ++  +WVNP SPRA+ + R ++   R A L   + +L +C   E  V   L+  
Subjt:  PRHPKTFLQITNVSLQEYAPQETQNPSPSDDEISKNPDGKFGSSSKSS--VWVNPRSPRASRL-RKQSYEARYASLTRISESLDSCNPCEEDVADVLK-V

Query:  IGSNILEQDAVVVLNN--MSNSQTALLALRYFQEVLKSSKEAILFNVTLKVFRKCRDMEGAEKLFDEMLKRGVKPDNVTFSTIISCARLCSLPNKAVEWF
              EQDAV+VLN    + ++TA+LALR+F    K  K+ IL+NV LK+ RK R     E L+ EML+ GV+PDN TFST+ISCAR C L +KAVEWF
Subjt:  IGSNILEQDAVVVLNN--MSNSQTALLALRYFQEVLKSSKEAILFNVTLKVFRKCRDMEGAEKLFDEMLKRGVKPDNVTFSTIISCARLCSLPNKAVEWF

Query:  EKMPSFDCNPDDVTYSAMIDAYGRAGNVDLAFGLYDRARTENWRIDPATFSTLIKIHGVAGNYDGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQ
        +KMP F C+PD +TYSA+IDAYG AGN + A  LYDRAR E W++DP   ST+IK+H  +GN+DG LNV+EEMKAIG++PNLV+YN++LDAMGRA RPW 
Subjt:  EKMPSFDCNPDDVTYSAMIDAYGRAGNVDLAFGLYDRARTENWRIDPATFSTLIKIHGVAGNYDGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQ

Query:  IKTIYKEMIKNGLSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCADVGYINEAVEIFKDMKSS--GTCSPDSWTFSSMITIY
        +KTI++EM+   + PS ATY  LL AY RARYGEDA+ VY+ MK++ + ++V+LYN LL+MCAD+GY++EA EIF+DMK+S      PDSW++SSM+T+Y
Subjt:  IKTIYKEMIKNGLSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCADVGYINEAVEIFKDMKSS--GTCSPDSWTFSSMITIY

Query:  SCSGKVSEAEEMLNEMVESGFDPNIFVLTSLIQCYGKAKRVDDVVRTFNLLIELGLTPDDRFCGCLLNVITQTPKEELSKLIDCVERANPKLGYVVRLLL
        S +  V  AE +LNEMVE+GF PNIFVLTSLI+CYGK  R DDVVR+F +L +LG+ PDDRFCGCLL+V   TP EEL K+I C+ER+N +LG VV+LL+
Subjt:  SCSGKVSEAEEMLNEMVESGFDPNIFVLTSLIQCYGKAKRVDDVVRTFNLLIELGLTPDDRFCGCLLNVITQTPKEELSKLIDCVERANPKLGYVVRLLL

Query:  GEQDKEGDFRTEASELLSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDLGLMLQIYTDLQSRSPTQWSLYLKGLSLGAALTALHVWINDLTKVLESGE
             E  FR  A ELL      V+  YCNCL+DLCVNL+ ++KAC LLD    L IY ++Q+R+ TQWSL+L+GLS+GAALT LHVW+NDL   L++G 
Subjt:  GEQDKEGDFRTEASELLSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDLGLMLQIYTDLQSRSPTQWSLYLKGLSLGAALTALHVWINDLTKVLESGE

Query:  E-LPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRGLEEL
        E LPPLLGI+TG GK+ YSD+GLA++FE+HLKEL+APFHEAP+K GWFLTT VAAK WLES+   EL
Subjt:  E-LPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRGLEEL

Q10PZ4 Pentatricopeptide repeat-containing protein ATP4 homolog, chloroplastic4.4e-20854.79Show/hide
Query:  TTLCKFSHR-FKLNPIPRHPKTFLQITNVSLQEYAPQETQNPSPSDDEISKNPDGKFGSSSKSSVWVNPRSPRASRL-RKQSYEARYASLTRISESLDSC
        ++L  + HR   L+  P++P        VS+Q+        P PSD     NP     S++   VWVNP SPRA+ L R ++   R A L   + +L +C
Subjt:  TTLCKFSHR-FKLNPIPRHPKTFLQITNVSLQEYAPQETQNPSPSDDEISKNPDGKFGSSSKSSVWVNPRSPRASRL-RKQSYEARYASLTRISESLDSC

Query:  NPCEEDVADVLK-VIGSNILEQDAVVVLNNMSNSQTA-LLALRYFQEVLKSSKEAILFNVTLKVFRKCRDMEGAEKLFDEMLKRGVKPDNVTFSTIISCA
           E  VA  L+        EQDAV+VLN  S    A +LAL +F    +  KE IL+NV LK  RK R    AE L++EML+ GV+PDN TFST+ISCA
Subjt:  NPCEEDVADVLK-VIGSNILEQDAVVVLNNMSNSQTA-LLALRYFQEVLKSSKEAILFNVTLKVFRKCRDMEGAEKLFDEMLKRGVKPDNVTFSTIISCA

Query:  RLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDLAFGLYDRARTENWRIDPATFSTLIKIHGVAGNYDGCLNVYEEMKAIGIKPNLVIYNS
        R C +P KAVEWFEKMP F C+PD +TYSA+IDAYGRAG+ + A  LYDRAR E W++DP   +T+I++H  +GN+DG LNV+EEMKA G+KPNLV+YN+
Subjt:  RLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDLAFGLYDRARTENWRIDPATFSTLIKIHGVAGNYDGCLNVYEEMKAIGIKPNLVIYNS

Query:  LLDAMGRAKRPWQIKTIYKEMIKNGLSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCADVGYINEAVEIFKDMKSS--GTCS
        +LDAMGRA RPW +KTI++E++     P+ ATY  LL AY RARYGEDA+ VY+ MK++ + ++V+LYN LL+MCAD+GY+ EA EIF+DMK+S      
Subjt:  LLDAMGRAKRPWQIKTIYKEMIKNGLSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCADVGYINEAVEIFKDMKSS--GTCS

Query:  PDSWTFSSMITIYSCSGKVSEAEEMLNEMVESGFDPNIFVLTSLIQCYGKAKRVDDVVRTFNLLIELGLTPDDRFCGCLLNVITQTPKEELSKLIDCVER
        PDSW++SSM+T+YSC+G V+ AE +LNEMVE+GF PNIF+LTSLI+CYGKA R DDVVR+F +L +LG+TPDDRFCGCLL V   TP +EL K+I C++R
Subjt:  PDSWTFSSMITIYSCSGKVSEAEEMLNEMVESGFDPNIFVLTSLIQCYGKAKRVDDVVRTFNLLIELGLTPDDRFCGCLLNVITQTPKEELSKLIDCVER

Query:  ANPKLGYVVRLLLGEQDKEGDFRTEASELLSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDLGLMLQIYTDLQSRSPTQWSLYLKGLSLGAALTALHV
        ++ +LG VVRLL+         R  A ELL      VR  YCNCL+DL VNL  ++KAC LLD+ L L IY+++Q+R+ TQWSL+L+GLS+GAALT LHV
Subjt:  ANPKLGYVVRLLLGEQDKEGDFRTEASELLSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDLGLMLQIYTDLQSRSPTQWSLYLKGLSLGAALTALHV

Query:  WINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRGLEEL
        W++DL   L++G+ELPPLLGI+TG GK+ YS KGLA+VFESHLKEL+APFHEAP+K GWFLTT VAA+ WLE++   EL
Subjt:  WINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRGLEEL

Q8GWE0 Pentatricopeptide repeat-containing protein At4g16390, chloroplastic3.3e-27266.28Show/hide
Query:  LCHSPSTFFTDHHYLCNSSNSQRKTTLCKFSHRFKLNPIPRHPKTFLQITNVSLQEYAPQETQNPSPSDDEISKNPDGKFGSSSKSSVWVNPRSPRASRL
        LC SPS+   D   LCN  +   K+T   F   +  N    H +  LQ T+VS+QE  PQ  ++     D     P     ++SKS VWVNP+SPRAS+L
Subjt:  LCHSPSTFFTDHHYLCNSSNSQRKTTLCKFSHRFKLNPIPRHPKTFLQITNVSLQEYAPQETQNPSPSDDEISKNPDGKFGSSSKSSVWVNPRSPRASRL

Query:  RKQSYEARYASLTRISESLDSCNPCEEDVADVLKVIGSNILEQDAVVVLNNMSNSQTALLALRYFQEVLKSSKEAILFNVTLKVFRKCRDMEGAEKLFDE
        R++SY++RY+SL +++ESLD+C P E DV DV+   G  + EQDAVV LNNM+N +TA L L    E +K S+E IL+NVT+KVFRK +D+E +EKLFDE
Subjt:  RKQSYEARYASLTRISESLDSCNPCEEDVADVLKVIGSNILEQDAVVVLNNMSNSQTALLALRYFQEVLKSSKEAILFNVTLKVFRKCRDMEGAEKLFDE

Query:  MLKRGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDLAFGLYDRARTENWRIDPATFSTLIKIHGVAGNYDGCL
        ML+RG+KPDN TF+TIISCAR   +P +AVEWFEKM SF C PD+VT +AMIDAYGRAGNVD+A  LYDRARTE WRID  TFSTLI+I+GV+GNYDGCL
Subjt:  MLKRGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDLAFGLYDRARTENWRIDPATFSTLIKIHGVAGNYDGCL

Query:  NVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGLSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCADVGY
        N+YEEMKA+G+KPNLVIYN L+D+MGRAKRPWQ K IYK++I NG +P+W+TYA+L+RAYGRARYG+DAL +Y+EMKEKGL L VILYNTLL+MCAD  Y
Subjt:  NVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGLSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCADVGY

Query:  INEAVEIFKDMKSSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVESGFDPNIFVLTSLIQCYGKAKRVDDVVRTFNLLIELGLTPDDRFCGCLLNV
        ++EA EIF+DMK+  TC PDSWTFSS+IT+Y+CSG+VSEAE  L +M E+GF+P +FVLTS+IQCYGKAK+VDDVVRTF+ ++ELG+TPDDRFCGCLLNV
Subjt:  INEAVEIFKDMKSSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVESGFDPNIFVLTSLIQCYGKAKRVDDVVRTFNLLIELGLTPDDRFCGCLLNV

Query:  ITQTPKEELSKLIDCVERANPKLGYVVRLLLGEQD-KEGDFRTEASELLSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDLGLMLQIYTDLQSRSPTQ
        +TQTP EE+ KLI CVE+A PKLG VV++L+ EQ+ +EG F+ EASEL+  + +DV+KAY NCLIDLCVNL+ L++ACE+L LGL   IYT LQS+S TQ
Subjt:  ITQTPKEELSKLIDCVERANPKLGYVVRLLLGEQD-KEGDFRTEASELLSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDLGLMLQIYTDLQSRSPTQ

Query:  WSLYLKGLSLGAALTALHVWINDLTK-VLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESR
        WSL+LK LSLGAALTALHVW+NDL++  LESGEE PPLLGINTGHGKHKYSDKGLA+VFESHLKELNAPFHEAP+KVGWFLTT VAAK+WLESR
Subjt:  WSLYLKGLSLGAALTALHVWINDLTK-VLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESR

Q9LS25 Pentatricopeptide repeat-containing protein At5g46580, chloroplastic8.2e-13038.2Show/hide
Query:  AFQLCHSPSTFFTDHHYLCNSSNSQRKTTLCKFSHRFKLN---PIPRHPKTFLQITNVSLQEYAPQETQNPSPSD-----DEISKNPDGKFGSSSKSSVW
        A  +C +P    T  H L        K +L + S   KLN      + PKT          E  P  T+ PS S+        +   +     S   SVW
Subjt:  AFQLCHSPSTFFTDHHYLCNSSNSQRKTTLCKFSHRFKLN---PIPRHPKTFLQITNVSLQEYAPQETQNPSPSD-----DEISKNPDGKFGSSSKSSVW

Query:  VNPRSPRASRLRKQ-------SYEARYASLTRISESLDSCNPCEE-DVADVLKVIGSNILEQDAVVVLNNMSNSQTALLALRYFQEVLKSSKEAILFNVT
        VNP  P+ S L  Q       SY  +   L   +  L+S    E+ +   +L  I       +A++VLN++   Q       + +       E I +NVT
Subjt:  VNPRSPRASRLRKQ-------SYEARYASLTRISESLDSCNPCEE-DVADVLKVIGSNILEQDAVVVLNNMSNSQTALLALRYFQEVLKSSKEAILFNVT

Query:  LKVFRKCRDMEGAEKLFDEMLKRGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDLAFGLYDRARTENWRIDPA
        +K  R  R  +  E++  EM+K GV+ DN+T+STII+CA+ C+L NKA+EWFE+M      PD+VTYSA++D Y ++G V+    LY+RA    W+ D  
Subjt:  LKVFRKCRDMEGAEKLFDEMLKRGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDLAFGLYDRARTENWRIDPA

Query:  TFSTLIKIHGVAGNYDGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGLSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGL
         FS L K+ G AG+YDG   V +EMK++ +KPN+V+YN+LL+AMGRA +P   ++++ EM++ GL+P+  T  +L++ YG+AR+  DAL +++EMK K  
Subjt:  TFSTLIKIHGVAGNYDGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGLSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGL

Query:  QLNVILYNTLLAMCADVGYINEAVEIFKDMKSSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVESGFDPNIFVLTSLIQCYGKAKRVDDVVRTFNL
         ++ ILYNTLL MCAD+G   EA  +F DMK S  C PD++++++M+ IY   GK  +A E+  EM+++G   N+   T L+QC GKAKR+DDVV  F+L
Subjt:  QLNVILYNTLLAMCADVGYINEAVEIFKDMKSSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVESGFDPNIFVLTSLIQCYGKAKRVDDVVRTFNL

Query:  LIELGLTPDDRFCGCLLNVITQ-TPKEELSKLIDCVERANPKLGYVVRLLLGEQDKEGDFRTEASELLSVVSADVRKAYCNCLIDLCVNLDLLDKACELL
         I+ G+ PDDR CGCLL+V+      E+  K++ C+ERAN KL   V L++ E+ +    + E   +++    + R+ +CNCLID+C   +  ++A ELL
Subjt:  LIELGLTPDDRFCGCLLNVITQ-TPKEELSKLIDCVERANPKLGYVVRLLLGEQDKEGDFRTEASELLSVVSADVRKAYCNCLIDLCVNLDLLDKACELL

Query:  DLGLMLQIYTDLQSRSPTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLT
         LG +  +Y  L +++  +WSL ++ LS+GAA TAL  W+  L  +++  EELP L    TG G H++S +GLA+ F  HL++L+APF ++ ++ G F+ 
Subjt:  DLGLMLQIYTDLQSRSPTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLT

Query:  TKVAAKSWLESR
        TK    SWLES+
Subjt:  TKVAAKSWLESR

Q9SIC9 Pentatricopeptide repeat-containing protein At2g31400, chloroplastic1.5e-4624.33Show/hide
Query:  EISKNPDGKFGSSSKSSVWVNPRSPRASRLRKQSYEARYASLTRISESLDSC---NPCEEDVADV---LKVIG--SNILEQDAVVVLNNMSNSQTALLAL
        E  KN  GK  S+  S++    +   A R+ + ++   Y +      +L S    +   E+   V   +K  G   N++  +AV+        +   +A 
Subjt:  EISKNPDGKFGSSSKSSVWVNPRSPRASRLRKQSYEARYASLTRISESLDSC---NPCEEDVADV---LKVIG--SNILEQDAVVVLNNMSNSQTALLAL

Query:  RYFQEVLKS--SKEAILFNVTLKVFRKCRDMEGAEKLFDEMLKRGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGN
        ++F E+ ++    + I FN  L V  +    E A  LFDEM  R ++ D  +++T++         + A E   +MP     P+ V+YS +ID + +AG 
Subjt:  RYFQEVLKS--SKEAILFNVTLKVFRKCRDMEGAEKLFDEMLKRGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGN

Query:  VDLAFGLYDRARTENWRIDPATFSTLIKIHGVAGNYDGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGLSPSWATYASLLRAY
         D A  L+   R     +D  +++TL+ I+   G  +  L++  EM ++GIK ++V YN+LL   G+  +  ++K ++ EM +  + P+  TY++L+  Y
Subjt:  VDLAFGLYDRARTENWRIDPATFSTLIKIHGVAGNYDGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGLSPSWATYASLLRAY

Query:  GRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCADVGYINEAVEIFKDMKSSGTCSPDSWTFSSMITI------------YSCSGKVSEAEEMLNEMV
         +    ++A+ +++E K  GL+ +V+LY+ L+      G +  AV +  +M   G  SP+  T++S+I              YS  G +  +   L+ + 
Subjt:  GRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCADVGYINEAVEIFKDMKSSGTCSPDSWTFSSMITI------------YSCSGKVSEAEEMLNEMV

Query:  ESGFDPNIFVLTSLI---------QCYGKAKRVDDVVRTFNLLIELGLTPDDRFCGCLLNVITQTPK-EELSKLIDCVERANPKL-GYVVRLLLGEQDKE
        E+  +  I +   L           C    + +  ++  F  + +L + P+      +LN  ++    E+ S L++ +   + K+ G V  LL+G+++  
Subjt:  ESGFDPNIFVLTSLI---------QCYGKAKRVDDVVRTFNLLIELGLTPDDRFCGCLLNVITQTPK-EELSKLIDCVERANPKL-GYVVRLLLGEQDKE

Query:  GDFRTEASELLSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDL-GLMLQIYTDLQSRSPTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPL
                + ++ +      A+ N L D+  +     +  EL+ L G   Q++ ++ S S     L L  +S GAA   +H W+ ++  ++  G ELP +
Subjt:  GDFRTEASELLSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDL-GLMLQIYTDLQSRSPTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPL

Query:  LGINTGHGKHK--YSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRGLEELRPLAEEL
        L I TG GKH     D  L    E  L+ ++APFH +   +G F ++     +WL      +L  L + +
Subjt:  LGINTGHGKHK--YSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRGLEELRPLAEEL

Arabidopsis top hitse value%identityAlignment
AT1G74750.1 Pentatricopeptide repeat (PPR) superfamily protein4.0e-3922.44Show/hide
Query:  LKR--GVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDLAFGLYDRARTENWRIDPATFSTLIKIHGVAGNYDGC
        LKR  G K D  T++T++          +  +  ++M    C P+ VTY+ +I +YGRA  +  A  ++++ +      D  T+ TLI IH  AG  D  
Subjt:  LKR--GVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDLAFGLYDRARTENWRIDPATFSTLIKIHGVAGNYDGC

Query:  LNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGLSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCADVG
        +++Y+ M+  G+ P+   Y+ +++ +G+A        ++ EM+  G +P+  T+  ++  + +AR  E AL +Y++M+  G Q + + Y+ ++ +    G
Subjt:  LNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGLSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCADVG

Query:  YINEAVEIFKDMKSSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVESGFDPNIFVLTSLIQCYGKAKRVDDVVRTFNLLIELGLTPDDRFCGCLLN
        ++ EA  +F +M+      PD   +  ++ ++  +G V +A +    M+++G  PN+    SL+  + +  R+ +       ++ LGL P  +    LL+
Subjt:  YINEAVEIFKDMKSSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVESGFDPNIFVLTSLIQCYGKAKRVDDVVRTFNLLIELGLTPDDRFCGCLLN

Query:  VITQTPKEELSKLIDCVERANPKLGYVVRLLLGEQDKEGD-FRTEASELLSVVSADVR---KAYCNCLIDLCVNLDLLDKACELLDLGLMLQIYTD-LQS
          T             +   +    ++  L +     +G   R   S  L  + ++ R   +   + ++D      L ++A  + ++     +Y D L+ 
Subjt:  VITQTPKEELSKLIDCVERANPKLGYVVRLLLGEQDKEGD-FRTEASELLSVVSADVR---KAYCNCLIDLCVNLDLLDKACELLDLGLMLQIYTD-LQS

Query:  RSPTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHK--YSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESR
        +S + W + L  +S G A+ AL   +    K +    + P  + I TG G+         +    E  L   N PF       G F+ +    K+WL   
Subjt:  RSPTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHK--YSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESR

Query:  GLEELRPL
         +E +  L
Subjt:  GLEELRPL

AT2G31400.1 genomes uncoupled 11.0e-4724.33Show/hide
Query:  EISKNPDGKFGSSSKSSVWVNPRSPRASRLRKQSYEARYASLTRISESLDSC---NPCEEDVADV---LKVIG--SNILEQDAVVVLNNMSNSQTALLAL
        E  KN  GK  S+  S++    +   A R+ + ++   Y +      +L S    +   E+   V   +K  G   N++  +AV+        +   +A 
Subjt:  EISKNPDGKFGSSSKSSVWVNPRSPRASRLRKQSYEARYASLTRISESLDSC---NPCEEDVADV---LKVIG--SNILEQDAVVVLNNMSNSQTALLAL

Query:  RYFQEVLKS--SKEAILFNVTLKVFRKCRDMEGAEKLFDEMLKRGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGN
        ++F E+ ++    + I FN  L V  +    E A  LFDEM  R ++ D  +++T++         + A E   +MP     P+ V+YS +ID + +AG 
Subjt:  RYFQEVLKS--SKEAILFNVTLKVFRKCRDMEGAEKLFDEMLKRGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGN

Query:  VDLAFGLYDRARTENWRIDPATFSTLIKIHGVAGNYDGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGLSPSWATYASLLRAY
         D A  L+   R     +D  +++TL+ I+   G  +  L++  EM ++GIK ++V YN+LL   G+  +  ++K ++ EM +  + P+  TY++L+  Y
Subjt:  VDLAFGLYDRARTENWRIDPATFSTLIKIHGVAGNYDGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGLSPSWATYASLLRAY

Query:  GRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCADVGYINEAVEIFKDMKSSGTCSPDSWTFSSMITI------------YSCSGKVSEAEEMLNEMV
         +    ++A+ +++E K  GL+ +V+LY+ L+      G +  AV +  +M   G  SP+  T++S+I              YS  G +  +   L+ + 
Subjt:  GRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCADVGYINEAVEIFKDMKSSGTCSPDSWTFSSMITI------------YSCSGKVSEAEEMLNEMV

Query:  ESGFDPNIFVLTSLI---------QCYGKAKRVDDVVRTFNLLIELGLTPDDRFCGCLLNVITQTPK-EELSKLIDCVERANPKL-GYVVRLLLGEQDKE
        E+  +  I +   L           C    + +  ++  F  + +L + P+      +LN  ++    E+ S L++ +   + K+ G V  LL+G+++  
Subjt:  ESGFDPNIFVLTSLI---------QCYGKAKRVDDVVRTFNLLIELGLTPDDRFCGCLLNVITQTPK-EELSKLIDCVERANPKL-GYVVRLLLGEQDKE

Query:  GDFRTEASELLSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDL-GLMLQIYTDLQSRSPTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPL
                + ++ +      A+ N L D+  +     +  EL+ L G   Q++ ++ S S     L L  +S GAA   +H W+ ++  ++  G ELP +
Subjt:  GDFRTEASELLSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDL-GLMLQIYTDLQSRSPTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPL

Query:  LGINTGHGKHK--YSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRGLEELRPLAEEL
        L I TG GKH     D  L    E  L+ ++APFH +   +G F ++     +WL      +L  L + +
Subjt:  LGINTGHGKHK--YSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRGLEELRPLAEEL

AT4G16390.1 pentatricopeptide (PPR) repeat-containing protein2.3e-27366.28Show/hide
Query:  LCHSPSTFFTDHHYLCNSSNSQRKTTLCKFSHRFKLNPIPRHPKTFLQITNVSLQEYAPQETQNPSPSDDEISKNPDGKFGSSSKSSVWVNPRSPRASRL
        LC SPS+   D   LCN  +   K+T   F   +  N    H +  LQ T+VS+QE  PQ  ++     D     P     ++SKS VWVNP+SPRAS+L
Subjt:  LCHSPSTFFTDHHYLCNSSNSQRKTTLCKFSHRFKLNPIPRHPKTFLQITNVSLQEYAPQETQNPSPSDDEISKNPDGKFGSSSKSSVWVNPRSPRASRL

Query:  RKQSYEARYASLTRISESLDSCNPCEEDVADVLKVIGSNILEQDAVVVLNNMSNSQTALLALRYFQEVLKSSKEAILFNVTLKVFRKCRDMEGAEKLFDE
        R++SY++RY+SL +++ESLD+C P E DV DV+   G  + EQDAVV LNNM+N +TA L L    E +K S+E IL+NVT+KVFRK +D+E +EKLFDE
Subjt:  RKQSYEARYASLTRISESLDSCNPCEEDVADVLKVIGSNILEQDAVVVLNNMSNSQTALLALRYFQEVLKSSKEAILFNVTLKVFRKCRDMEGAEKLFDE

Query:  MLKRGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDLAFGLYDRARTENWRIDPATFSTLIKIHGVAGNYDGCL
        ML+RG+KPDN TF+TIISCAR   +P +AVEWFEKM SF C PD+VT +AMIDAYGRAGNVD+A  LYDRARTE WRID  TFSTLI+I+GV+GNYDGCL
Subjt:  MLKRGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDLAFGLYDRARTENWRIDPATFSTLIKIHGVAGNYDGCL

Query:  NVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGLSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCADVGY
        N+YEEMKA+G+KPNLVIYN L+D+MGRAKRPWQ K IYK++I NG +P+W+TYA+L+RAYGRARYG+DAL +Y+EMKEKGL L VILYNTLL+MCAD  Y
Subjt:  NVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGLSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCADVGY

Query:  INEAVEIFKDMKSSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVESGFDPNIFVLTSLIQCYGKAKRVDDVVRTFNLLIELGLTPDDRFCGCLLNV
        ++EA EIF+DMK+  TC PDSWTFSS+IT+Y+CSG+VSEAE  L +M E+GF+P +FVLTS+IQCYGKAK+VDDVVRTF+ ++ELG+TPDDRFCGCLLNV
Subjt:  INEAVEIFKDMKSSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVESGFDPNIFVLTSLIQCYGKAKRVDDVVRTFNLLIELGLTPDDRFCGCLLNV

Query:  ITQTPKEELSKLIDCVERANPKLGYVVRLLLGEQD-KEGDFRTEASELLSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDLGLMLQIYTDLQSRSPTQ
        +TQTP EE+ KLI CVE+A PKLG VV++L+ EQ+ +EG F+ EASEL+  + +DV+KAY NCLIDLCVNL+ L++ACE+L LGL   IYT LQS+S TQ
Subjt:  ITQTPKEELSKLIDCVERANPKLGYVVRLLLGEQD-KEGDFRTEASELLSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDLGLMLQIYTDLQSRSPTQ

Query:  WSLYLKGLSLGAALTALHVWINDLTK-VLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESR
        WSL+LK LSLGAALTALHVW+NDL++  LESGEE PPLLGINTGHGKHKYSDKGLA+VFESHLKELNAPFHEAP+KVGWFLTT VAAK+WLESR
Subjt:  WSLYLKGLSLGAALTALHVWINDLTK-VLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESR

AT5G02860.1 Pentatricopeptide repeat (PPR) superfamily protein3.1e-3928.24Show/hide
Query:  FQEVLKSSKEAILFNVTLKVFRKCRDMEGAEKLFDEMLKRGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNV-DL
        +Q +L +S  AI+ ++  K  R    +  A  +F+ + + G   D  +++++IS         +AV  F+KM    C P  +TY+ +++ +G+ G   + 
Subjt:  FQEVLKSSKEAILFNVTLKVFRKCRDMEGAEKLFDEMLKRGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNV-DL

Query:  AFGLYDRARTENWRIDPATFSTLIKIHGVAGNYDGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGLSPSWATYASLLRAYGRA
           L ++ +++    D  T++TLI        +     V+EEMKA G   + V YN+LLD  G++ RP +   +  EM+ NG SPS  TY SL+ AY R 
Subjt:  AFGLYDRARTENWRIDPATFSTLIKIHGVAGNYDGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGLSPSWATYASLLRAYGRA

Query:  RYGEDALLVYKEMKEKGLQLNVILYNTLLAMCADVGYINEAVEIFKDMKSSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVESGFDPNIFVLTSLI
           ++A+ +  +M EKG + +V  Y TLL+     G +  A+ IF++M+++G C P+  TF++ I +Y   GK +E  ++ +E+   G  P+I    +L+
Subjt:  RYGEDALLVYKEMKEKGLQLNVILYNTLLAMCADVGYINEAVEIFKDMKSSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVESGFDPNIFVLTSLI

Query:  QCYGKAKRVDDVVRTFNLLIELGLTPDDRFCGCLLNVITQ
          +G+     +V   F  +   G  P+      L++  ++
Subjt:  QCYGKAKRVDDVVRTFNLLIELGLTPDDRFCGCLLNVITQ

AT5G46580.1 pentatricopeptide (PPR) repeat-containing protein5.8e-13138.2Show/hide
Query:  AFQLCHSPSTFFTDHHYLCNSSNSQRKTTLCKFSHRFKLN---PIPRHPKTFLQITNVSLQEYAPQETQNPSPSD-----DEISKNPDGKFGSSSKSSVW
        A  +C +P    T  H L        K +L + S   KLN      + PKT          E  P  T+ PS S+        +   +     S   SVW
Subjt:  AFQLCHSPSTFFTDHHYLCNSSNSQRKTTLCKFSHRFKLN---PIPRHPKTFLQITNVSLQEYAPQETQNPSPSD-----DEISKNPDGKFGSSSKSSVW

Query:  VNPRSPRASRLRKQ-------SYEARYASLTRISESLDSCNPCEE-DVADVLKVIGSNILEQDAVVVLNNMSNSQTALLALRYFQEVLKSSKEAILFNVT
        VNP  P+ S L  Q       SY  +   L   +  L+S    E+ +   +L  I       +A++VLN++   Q       + +       E I +NVT
Subjt:  VNPRSPRASRLRKQ-------SYEARYASLTRISESLDSCNPCEE-DVADVLKVIGSNILEQDAVVVLNNMSNSQTALLALRYFQEVLKSSKEAILFNVT

Query:  LKVFRKCRDMEGAEKLFDEMLKRGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDLAFGLYDRARTENWRIDPA
        +K  R  R  +  E++  EM+K GV+ DN+T+STII+CA+ C+L NKA+EWFE+M      PD+VTYSA++D Y ++G V+    LY+RA    W+ D  
Subjt:  LKVFRKCRDMEGAEKLFDEMLKRGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDLAFGLYDRARTENWRIDPA

Query:  TFSTLIKIHGVAGNYDGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGLSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGL
         FS L K+ G AG+YDG   V +EMK++ +KPN+V+YN+LL+AMGRA +P   ++++ EM++ GL+P+  T  +L++ YG+AR+  DAL +++EMK K  
Subjt:  TFSTLIKIHGVAGNYDGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGLSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGL

Query:  QLNVILYNTLLAMCADVGYINEAVEIFKDMKSSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVESGFDPNIFVLTSLIQCYGKAKRVDDVVRTFNL
         ++ ILYNTLL MCAD+G   EA  +F DMK S  C PD++++++M+ IY   GK  +A E+  EM+++G   N+   T L+QC GKAKR+DDVV  F+L
Subjt:  QLNVILYNTLLAMCADVGYINEAVEIFKDMKSSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVESGFDPNIFVLTSLIQCYGKAKRVDDVVRTFNL

Query:  LIELGLTPDDRFCGCLLNVITQ-TPKEELSKLIDCVERANPKLGYVVRLLLGEQDKEGDFRTEASELLSVVSADVRKAYCNCLIDLCVNLDLLDKACELL
         I+ G+ PDDR CGCLL+V+      E+  K++ C+ERAN KL   V L++ E+ +    + E   +++    + R+ +CNCLID+C   +  ++A ELL
Subjt:  LIELGLTPDDRFCGCLLNVITQ-TPKEELSKLIDCVERANPKLGYVVRLLLGEQDKEGDFRTEASELLSVVSADVRKAYCNCLIDLCVNLDLLDKACELL

Query:  DLGLMLQIYTDLQSRSPTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLT
         LG +  +Y  L +++  +WSL ++ LS+GAA TAL  W+  L  +++  EELP L    TG G H++S +GLA+ F  HL++L+APF ++ ++ G F+ 
Subjt:  DLGLMLQIYTDLQSRSPTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLT

Query:  TKVAAKSWLESR
        TK    SWLES+
Subjt:  TKVAAKSWLESR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTTCCAGCTCTGCCATTCGCCGTCCACATTCTTCACCGACCACCATTACCTATGCAATTCTTCCAATTCTCAACGCAAAACAACTCTCTGCAAGTTCTCTCACCG
TTTCAAGCTCAATCCCATACCTCGCCACCCAAAAACATTCCTCCAAATTACCAATGTCTCGCTACAGGAATACGCTCCTCAAGAAACCCAGAATCCAAGCCCCTCTGATG
ATGAAATCTCCAAAAATCCAGATGGGAAATTCGGTTCCTCGTCCAAAAGCTCCGTTTGGGTCAATCCCAGAAGCCCCAGAGCTTCGAGACTTCGGAAGCAATCTTACGAG
GCCAGGTATGCTTCTCTTACGAGAATATCGGAGTCTTTGGACTCTTGTAATCCATGTGAGGAAGATGTTGCTGATGTCTTGAAGGTGATAGGTAGTAACATTTTAGAACA
GGACGCAGTTGTAGTGCTGAATAACATGTCGAATTCCCAAACTGCGTTGCTTGCTCTTCGGTACTTTCAGGAGGTGTTGAAATCAAGTAAAGAGGCAATTCTTTTTAATG
TGACACTGAAGGTGTTTAGGAAGTGCAGAGATATGGAGGGTGCAGAGAAACTGTTCGACGAAATGCTTAAGAGAGGAGTTAAGCCTGATAATGTGACATTTTCTACAATT
ATTAGTTGTGCTAGGTTGTGCTCGTTGCCAAATAAGGCTGTTGAGTGGTTTGAGAAGATGCCAAGTTTTGACTGTAATCCTGATGATGTCACTTACTCTGCGATGATCGA
TGCCTATGGACGTGCTGGTAATGTTGACTTGGCTTTCGGCTTGTATGACCGTGCAAGAACGGAAAACTGGCGTATTGATCCTGCAACATTCTCAACGTTGATCAAAATTC
ATGGAGTGGCTGGGAATTATGATGGGTGCTTGAATGTGTATGAAGAAATGAAGGCTATAGGCATCAAGCCAAATTTGGTTATATATAACAGCTTGCTGGATGCTATGGGT
AGGGCTAAAAGACCCTGGCAGATCAAGACCATATACAAAGAGATGATTAAAAATGGGTTGTCACCAAGTTGGGCAACTTATGCTTCCCTTTTACGTGCCTACGGAAGAGC
CAGATATGGTGAGGATGCTCTCCTTGTGTACAAGGAGATGAAGGAAAAGGGACTGCAGTTAAATGTTATTCTCTACAATACACTTTTAGCTATGTGTGCTGATGTTGGCT
ATATCAATGAAGCCGTTGAAATTTTTAAAGACATGAAGAGTTCTGGGACTTGCTCACCTGACAGTTGGACTTTTTCTTCCATGATCACCATATATTCTTGCAGTGGAAAA
GTATCAGAGGCAGAGGAAATGTTAAACGAGATGGTGGAATCCGGTTTTGATCCTAATATCTTTGTCTTGACATCACTAATCCAGTGTTATGGGAAAGCCAAACGTGTTGA
TGATGTAGTGAGGACATTTAATCTACTGATAGAGTTGGGATTAACTCCAGATGATCGATTCTGTGGCTGTCTTCTCAATGTAATTACCCAGACGCCAAAAGAGGAACTTA
GTAAGCTGATTGATTGTGTTGAGAGAGCTAATCCGAAACTCGGTTATGTGGTTAGACTTTTGCTAGGGGAACAAGACAAGGAAGGAGATTTCAGAACTGAAGCCTCTGAA
TTACTTAGTGTTGTCAGTGCCGATGTGAGAAAAGCCTACTGCAATTGCTTAATTGATCTCTGTGTGAATTTAGATCTTTTGGATAAGGCATGCGAACTCCTGGATTTGGG
GCTTATGCTTCAGATATATACAGATTTGCAGTCCAGGTCTCCAACTCAGTGGTCTCTATATCTTAAGGGCCTTTCTCTTGGGGCTGCTCTCACTGCATTACACGTTTGGA
TAAATGACTTAACGAAGGTACTTGAATCCGGGGAGGAACTTCCACCATTACTTGGAATTAATACTGGACATGGAAAACACAAATATTCTGATAAGGGTTTAGCAAGCGTT
TTTGAATCACATTTAAAGGAATTAAATGCTCCATTCCATGAGGCTCCCGAAAAGGTTGGGTGGTTTTTGACGACTAAAGTGGCTGCAAAATCATGGTTGGAGTCTAGAGG
ATTAGAAGAGTTAAGGCCTCTTGCAGAGGAATTGTTACTTGTTAGGCTCTTGAGTCCTAACAATTTTGTATCCAGCTTTGTCTTCAGTGTGTCCGTAGGAAAGAAGGTGG
ATGGTAGGTTTGAAGAAGTGGAACTGCCATCAGGAATGTTGGAAGAAGTTCAGAGGATACTGGTGATAGAGAAGACCCTAGTGGCCTCGATGGAGGGGTGTCACGCAGTT
GAAATTTCCCAAAGGTGTGTAATACGGATGGCCATGGGACTATGTAGTTCTGGCACTGGAGATCGACTCTCGAAAACTTGCTATCAAGGTCGTGAAGAAGCAGTGTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCTTTCCAGCTCTGCCATTCGCCGTCCACATTCTTCACCGACCACCATTACCTATGCAATTCTTCCAATTCTCAACGCAAAACAACTCTCTGCAAGTTCTCTCACCG
TTTCAAGCTCAATCCCATACCTCGCCACCCAAAAACATTCCTCCAAATTACCAATGTCTCGCTACAGGAATACGCTCCTCAAGAAACCCAGAATCCAAGCCCCTCTGATG
ATGAAATCTCCAAAAATCCAGATGGGAAATTCGGTTCCTCGTCCAAAAGCTCCGTTTGGGTCAATCCCAGAAGCCCCAGAGCTTCGAGACTTCGGAAGCAATCTTACGAG
GCCAGGTATGCTTCTCTTACGAGAATATCGGAGTCTTTGGACTCTTGTAATCCATGTGAGGAAGATGTTGCTGATGTCTTGAAGGTGATAGGTAGTAACATTTTAGAACA
GGACGCAGTTGTAGTGCTGAATAACATGTCGAATTCCCAAACTGCGTTGCTTGCTCTTCGGTACTTTCAGGAGGTGTTGAAATCAAGTAAAGAGGCAATTCTTTTTAATG
TGACACTGAAGGTGTTTAGGAAGTGCAGAGATATGGAGGGTGCAGAGAAACTGTTCGACGAAATGCTTAAGAGAGGAGTTAAGCCTGATAATGTGACATTTTCTACAATT
ATTAGTTGTGCTAGGTTGTGCTCGTTGCCAAATAAGGCTGTTGAGTGGTTTGAGAAGATGCCAAGTTTTGACTGTAATCCTGATGATGTCACTTACTCTGCGATGATCGA
TGCCTATGGACGTGCTGGTAATGTTGACTTGGCTTTCGGCTTGTATGACCGTGCAAGAACGGAAAACTGGCGTATTGATCCTGCAACATTCTCAACGTTGATCAAAATTC
ATGGAGTGGCTGGGAATTATGATGGGTGCTTGAATGTGTATGAAGAAATGAAGGCTATAGGCATCAAGCCAAATTTGGTTATATATAACAGCTTGCTGGATGCTATGGGT
AGGGCTAAAAGACCCTGGCAGATCAAGACCATATACAAAGAGATGATTAAAAATGGGTTGTCACCAAGTTGGGCAACTTATGCTTCCCTTTTACGTGCCTACGGAAGAGC
CAGATATGGTGAGGATGCTCTCCTTGTGTACAAGGAGATGAAGGAAAAGGGACTGCAGTTAAATGTTATTCTCTACAATACACTTTTAGCTATGTGTGCTGATGTTGGCT
ATATCAATGAAGCCGTTGAAATTTTTAAAGACATGAAGAGTTCTGGGACTTGCTCACCTGACAGTTGGACTTTTTCTTCCATGATCACCATATATTCTTGCAGTGGAAAA
GTATCAGAGGCAGAGGAAATGTTAAACGAGATGGTGGAATCCGGTTTTGATCCTAATATCTTTGTCTTGACATCACTAATCCAGTGTTATGGGAAAGCCAAACGTGTTGA
TGATGTAGTGAGGACATTTAATCTACTGATAGAGTTGGGATTAACTCCAGATGATCGATTCTGTGGCTGTCTTCTCAATGTAATTACCCAGACGCCAAAAGAGGAACTTA
GTAAGCTGATTGATTGTGTTGAGAGAGCTAATCCGAAACTCGGTTATGTGGTTAGACTTTTGCTAGGGGAACAAGACAAGGAAGGAGATTTCAGAACTGAAGCCTCTGAA
TTACTTAGTGTTGTCAGTGCCGATGTGAGAAAAGCCTACTGCAATTGCTTAATTGATCTCTGTGTGAATTTAGATCTTTTGGATAAGGCATGCGAACTCCTGGATTTGGG
GCTTATGCTTCAGATATATACAGATTTGCAGTCCAGGTCTCCAACTCAGTGGTCTCTATATCTTAAGGGCCTTTCTCTTGGGGCTGCTCTCACTGCATTACACGTTTGGA
TAAATGACTTAACGAAGGTACTTGAATCCGGGGAGGAACTTCCACCATTACTTGGAATTAATACTGGACATGGAAAACACAAATATTCTGATAAGGGTTTAGCAAGCGTT
TTTGAATCACATTTAAAGGAATTAAATGCTCCATTCCATGAGGCTCCCGAAAAGGTTGGGTGGTTTTTGACGACTAAAGTGGCTGCAAAATCATGGTTGGAGTCTAGAGG
ATTAGAAGAGTTAAGGCCTCTTGCAGAGGAATTGTTACTTGTTAGGCTCTTGAGTCCTAACAATTTTGTATCCAGCTTTGTCTTCAGTGTGTCCGTAGGAAAGAAGGTGG
ATGGTAGGTTTGAAGAAGTGGAACTGCCATCAGGAATGTTGGAAGAAGTTCAGAGGATACTGGTGATAGAGAAGACCCTAGTGGCCTCGATGGAGGGGTGTCACGCAGTT
GAAATTTCCCAAAGGTGTGTAATACGGATGGCCATGGGACTATGTAGTTCTGGCACTGGAGATCGACTCTCGAAAACTTGCTATCAAGGTCGTGAAGAAGCAGTGTAA
Protein sequenceShow/hide protein sequence
MAFQLCHSPSTFFTDHHYLCNSSNSQRKTTLCKFSHRFKLNPIPRHPKTFLQITNVSLQEYAPQETQNPSPSDDEISKNPDGKFGSSSKSSVWVNPRSPRASRLRKQSYE
ARYASLTRISESLDSCNPCEEDVADVLKVIGSNILEQDAVVVLNNMSNSQTALLALRYFQEVLKSSKEAILFNVTLKVFRKCRDMEGAEKLFDEMLKRGVKPDNVTFSTI
ISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDLAFGLYDRARTENWRIDPATFSTLIKIHGVAGNYDGCLNVYEEMKAIGIKPNLVIYNSLLDAMG
RAKRPWQIKTIYKEMIKNGLSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCADVGYINEAVEIFKDMKSSGTCSPDSWTFSSMITIYSCSGK
VSEAEEMLNEMVESGFDPNIFVLTSLIQCYGKAKRVDDVVRTFNLLIELGLTPDDRFCGCLLNVITQTPKEELSKLIDCVERANPKLGYVVRLLLGEQDKEGDFRTEASE
LLSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDLGLMLQIYTDLQSRSPTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASV
FESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRGLEELRPLAEELLLVRLLSPNNFVSSFVFSVSVGKKVDGRFEEVELPSGMLEEVQRILVIEKTLVASMEGCHAV
EISQRCVIRMAMGLCSSGTGDRLSKTCYQGREEAV