; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg000046 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg000046
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationscaffold5:26412278..26416829
RNA-Seq ExpressionSpg000046
SyntenySpg000046
Gene Ontology termsGO:0009658 - chloroplast organization (biological process)
GO:0031425 - chloroplast RNA processing (biological process)
GO:0009570 - chloroplast stroma (cellular component)
GO:0009941 - chloroplast envelope (cellular component)
GO:0003729 - mRNA binding (molecular function)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR002625 - Smr domain
IPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily
IPR033443 - Pentacotripeptide-repeat region of PRORP


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6583722.1 Pentatricopeptide repeat-containing protein, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0088.75Show/hide
Query:  MAFQLCHSPSTFFTDHHYLCNSSNSQRKTTLCKFSHR-FKLNPIPRHPKTFLQITNVSLQEYAPQETQNPSPSDDEISKNPDGKFGSSSKSSVWVNPRSP
        MAFQL H PSTFFTDH    NS     KTTLCK S R FKLNPIP H K FLQITNVS QEYAPQET+NPSPSDDEISK PDGK GSSSK+SVWVNP SP
Subjt:  MAFQLCHSPSTFFTDHHYLCNSSNSQRKTTLCKFSHR-FKLNPIPRHPKTFLQITNVSLQEYAPQETQNPSPSDDEISKNPDGKFGSSSKSSVWVNPRSP

Query:  RASRLRKQSYEARYASLTRISESLDSCNPCEEDVADVLKVIGSNILEQDAVVVLNNMSNSQTALLALRYFQEVLKSSKGAILFNVTLKVFRKCRDMEGAE
        RAS+LRKQSYEARYASL +ISESLDSCNPCE+DVADVLK I S ILEQDA+ VLNNMSNSQTALL LRYFQ+VLKSSK A+ +NVTLKVFRKCRD EGAE
Subjt:  RASRLRKQSYEARYASLTRISESLDSCNPCEEDVADVLKVIGSNILEQDAVVVLNNMSNSQTALLALRYFQEVLKSSKGAILFNVTLKVFRKCRDMEGAE

Query:  KLFDEMLKRGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDLAFGLYDRARTENWRIDPATFSTLIKIHGVAGN
        KLFDEML+RGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPD++TYS MIDAYGRAGNVD+AF LYDRARTENWRID +TFST+IKIHGVAGN
Subjt:  KLFDEMLKRGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDLAFGLYDRARTENWRIDPATFSTLIKIHGVAGN

Query:  YDGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMC
        YDGCLNVYEEMKA+GIKPNL IYNSLL AMGRAKRPWQIKTIYKEM KNGFSPSWATYASLLRAY RARY ED +LVYKEMKEKGLQLNVILYNTLLAMC
Subjt:  YDGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMC

Query:  ADVGYINEAVEIFKDMKSSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVESGFDPNIFVLTSLIQCYGKAKRVDDVVRTFNLLIELGLTPDDRFCG
        ADVGY+NEA+E+FKDMKSSGTCSPDSWTFSSMITIYSCSG VSEAEEMLNEM+E+GFDPNIFVLTSLIQCYGKAKRVDDVVRTF+ L+ELGLTPDDRFCG
Subjt:  ADVGYINEAVEIFKDMKSSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVESGFDPNIFVLTSLIQCYGKAKRVDDVVRTFNLLIELGLTPDDRFCG

Query:  CLLNVITQTPKEELSKLIDCVERANPKLGYVVRLLLGEQDKEGDFRTEASELLSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDLGLMLQIYTDLQSR
        CLLNVITQTPK ELSKLIDCVERANPKLG+VV+LLLGE+D EGDFRTEASEL SVVS DVRKAYCNCLIDLCVNLDLLDKACELLDLGL +QIYTDLQSR
Subjt:  CLLNVITQTPKEELSKLIDCVERANPKLGYVVRLLLGEQDKEGDFRTEASELLSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDLGLMLQIYTDLQSR

Query:  SPTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRGLE
        SPTQWSLYLKGLSLGAALTALHVWINDLTK L+SGEELPPLLGINTGHGKHKYSDKGL+SVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRG  
Subjt:  SPTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRGLE

Query:  EL
        EL
Subjt:  EL

XP_004139516.1 pentatricopeptide repeat-containing protein At4g16390, chloroplastic [Cucumis sativus]0.0e+0090.16Show/hide
Query:  MAFQLCHSPSTFFTDHHYLCNSSNSQRKTTLCKFSHRFKLNPIPRHPKTFLQITNVSLQEYAPQETQNPSPSDDEISKNPDGKFGSSSKSSVWVNPRSPR
        MAFQLC+SP TFFT+HH+L NS   QRKTTL   S  FKL+PIPRH K FLQITNVSLQE+APQ+TQN  PS DEISK PD K GSSS SSVWVNPRSPR
Subjt:  MAFQLCHSPSTFFTDHHYLCNSSNSQRKTTLCKFSHRFKLNPIPRHPKTFLQITNVSLQEYAPQETQNPSPSDDEISKNPDGKFGSSSKSSVWVNPRSPR

Query:  ASRLRKQSYEARYASLTRISESLDSCNPCEEDVADVLKVIGSNILEQDAVVVLNNMSNSQTALLALRYFQEVLKSSKGAILFNVTLKVFRKCRDMEGAEK
        AS+LRKQSYEARYASL R+SESLDS NPCE DVADVLKVIG+NILE+DA++VLNNMSNSQTALLALRYFQ++LKSSK  I +NVTLKVFRKCRDMEGAEK
Subjt:  ASRLRKQSYEARYASLTRISESLDSCNPCEEDVADVLKVIGSNILEQDAVVVLNNMSNSQTALLALRYFQEVLKSSKGAILFNVTLKVFRKCRDMEGAEK

Query:  LFDEMLKRGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDLAFGLYDRARTENWRIDPATFSTLIKIHGVAGNY
        LF+EM+ RGVKPDNVTFSTIISCARLCSLP+KAVEWFEKMPSFDCNPDDVTYS MIDAYGRAGNVD+AF LYDRARTENWRIDPATFST+IKIHGVAGNY
Subjt:  LFDEMLKRGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDLAFGLYDRARTENWRIDPATFSTLIKIHGVAGNY

Query:  DGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCA
        DGCLNVYEEMKAIGIKPNLVIYN LLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRARYGEDAL+VYKEMKEKGLQLNVILYNTLLAMCA
Subjt:  DGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCA

Query:  DVGYINEAVEIFKDMKSSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVESGFDPNIFVLTSLIQCYGKAKRVDDVVRTFNLLIELGLTPDDRFCGC
        DVGY+NEAVEIF+DMKSSGTCSPDSWTFSSMITIYSC GKVSEAEEMLN+MVE+GFDPNIFVLTSLIQCYGKAKRVDDVVRTFN LIELGLTPDDRFCGC
Subjt:  DVGYINEAVEIFKDMKSSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVESGFDPNIFVLTSLIQCYGKAKRVDDVVRTFNLLIELGLTPDDRFCGC

Query:  LLNVITQTPKEELSKLIDCVERANPKLGYVVRLLLGEQDKEGDFRTEASELLSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDLGLMLQIYTDLQSRS
        LLNVITQTPK EL KLIDCV RANPKLG+VV LLLGEQDKEG+FRTEASEL SVVSADVRKAYCNCLIDLCVNLDLLDKACELLDLGL LQIY DLQSRS
Subjt:  LLNVITQTPKEELSKLIDCVERANPKLGYVVRLLLGEQDKEGDFRTEASELLSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDLGLMLQIYTDLQSRS

Query:  PTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRGLEE
        PTQWSLYLKGLSLGAALTALHVWI DLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESR   E
Subjt:  PTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRGLEE

Query:  L
        L
Subjt:  L

XP_008464281.1 PREDICTED: pentatricopeptide repeat-containing protein At4g16390, chloroplastic [Cucumis melo]0.0e+0091.01Show/hide
Query:  MAFQLCHSPSTFFTDHHYLCNSSNSQRKTTLCKFSHRFKLNPIPRHPKTFLQITNVSLQEYAPQETQNPSPSDDEISKNPDGKFGSSSKSSVWVNPRSPR
        MAFQLCHSP TFFT HH L NS   QRKTTL   S  FKLNPIPRH   FLQITN+SLQE++PQET N  PSDDEISK  D K GSSSKSSVWVNPRSPR
Subjt:  MAFQLCHSPSTFFTDHHYLCNSSNSQRKTTLCKFSHRFKLNPIPRHPKTFLQITNVSLQEYAPQETQNPSPSDDEISKNPDGKFGSSSKSSVWVNPRSPR

Query:  ASRLRKQSYEARYASLTRISESLDSCNPCEEDVADVLKVIGSNILEQDAVVVLNNMSNSQTALLALRYFQEVLKSSKGAILFNVTLKVFRKCRDMEGAEK
        AS+LRKQSYEARYASL RISESLDSCNPCE DVADVLKVIG+NILEQDAVVVLNNMSNSQTALLALRYFQ++LKSSK  I +NVTLKVFRKCRDMEGAE+
Subjt:  ASRLRKQSYEARYASLTRISESLDSCNPCEEDVADVLKVIGSNILEQDAVVVLNNMSNSQTALLALRYFQEVLKSSKGAILFNVTLKVFRKCRDMEGAEK

Query:  LFDEMLKRGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDLAFGLYDRARTENWRIDPATFSTLIKIHGVAGNY
        LF+EML RGVKPDNVTFSTIISCARLCSLP+KAVEWFEKMPSFDCNPDDVTYS MIDAYGRAGNVD+AF LYDRARTENWRIDPATFST+IKIHGVAGNY
Subjt:  LFDEMLKRGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDLAFGLYDRARTENWRIDPATFSTLIKIHGVAGNY

Query:  DGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCA
        DGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIK+GFSPSWATYASLLRAYGRARYGEDAL+VYKEMKEKGLQLNVILYNTLLAMCA
Subjt:  DGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCA

Query:  DVGYINEAVEIFKDMKSSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVESGFDPNIFVLTSLIQCYGKAKRVDDVVRTFNLLIELGLTPDDRFCGC
        DVGY+NEAVEIF+DMK+SGTCSPDSWTFSSMITIYSCSGKVSEAEEMLN+MVE+GFDPNIFVLTSLIQCYGKAKRVDDVVRTFN LIELGLTPDDRFCGC
Subjt:  DVGYINEAVEIFKDMKSSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVESGFDPNIFVLTSLIQCYGKAKRVDDVVRTFNLLIELGLTPDDRFCGC

Query:  LLNVITQTPKEELSKLIDCVERANPKLGYVVRLLLGEQDKEGDFRTEASELLSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDLGLMLQIYTDLQSRS
        LLNVITQTPKEE+SKLIDCV RANPKLG+VV LLLGEQDKEG+FRTEASEL SVVSADVRKAYCNCLIDLCVNLDLLDKACELL+LGL LQIY DLQSRS
Subjt:  LLNVITQTPKEELSKLIDCVERANPKLGYVVRLLLGEQDKEGDFRTEASELLSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDLGLMLQIYTDLQSRS

Query:  PTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRGLEE
        PTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESR   E
Subjt:  PTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRGLEE

Query:  L
        L
Subjt:  L

XP_022142513.1 pentatricopeptide repeat-containing protein At4g16390, chloroplastic [Momordica charantia]0.0e+0091.73Show/hide
Query:  MAFQLCHSPSTFFTDHHYLCNSSNSQRKTTLCKFSHRFKLNPIPRHPKTFLQITNVSLQEYAPQETQNPSPSDDEISKNPDGKFGSSSKSSVWVNPRSPR
        MAFQLCHSPSTFF+DHH L NS NSQ + TL K SHRFKLNP P H KT L+ITNVSLQEYA QE QNP P+ DE SK PDGK  SSSKSSVWVNPRSPR
Subjt:  MAFQLCHSPSTFFTDHHYLCNSSNSQRKTTLCKFSHRFKLNPIPRHPKTFLQITNVSLQEYAPQETQNPSPSDDEISKNPDGKFGSSSKSSVWVNPRSPR

Query:  ASRLRKQSYEARYASLTRISESLDSCNPCEEDVADVLKVIGSNILEQDAVVVLNNMSNSQTALLALRYFQEVLKSSKGAILFNVTLKVFRKCRDMEGAEK
        AS+LR QSYEARYASLTRISESLDSCNPCEEDVADVLK +GSNILEQDAV VLNNMSNS TALLAL+ FQ+VLKSSK AIL+NVTLKV RK RDMEGAEK
Subjt:  ASRLRKQSYEARYASLTRISESLDSCNPCEEDVADVLKVIGSNILEQDAVVVLNNMSNSQTALLALRYFQEVLKSSKGAILFNVTLKVFRKCRDMEGAEK

Query:  LFDEMLKRGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDLAFGLYDRARTENWRIDPATFSTLIKIHGVAGNY
        LFDEMLKRGVKPDNVTFST+ISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVD+AF LYDRARTENWRIDPATFSTLIKIHGVAGNY
Subjt:  LFDEMLKRGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDLAFGLYDRARTENWRIDPATFSTLIKIHGVAGNY

Query:  DGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCA
        DGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCA
Subjt:  DGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCA

Query:  DVGYINEAVEIFKDMKSSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVESGFDPNIFVLTSLIQCYGKAKRVDDVVRTFNLLIELGLTPDDRFCGC
        DVGY+NEAVEIF+DMKSSG CSPDSWTFSSMITIYSCSGKVSEAEEMLNEM+E+GFDPNIFVLTSLIQCYGK KRVDDVVRTF+ L+ELGLTPDDRFCGC
Subjt:  DVGYINEAVEIFKDMKSSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVESGFDPNIFVLTSLIQCYGKAKRVDDVVRTFNLLIELGLTPDDRFCGC

Query:  LLNVITQTPKEELSKLIDCVERANPKLGYVVRLLLGEQDKEGDFRTEASELLSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDLGLMLQIYTDLQSRS
        LLNVITQTPKEELSKLIDCVERAN KLGYVV+LLLGEQDKEGD RTEASELLSVVSADVRKAYCNCLIDLCVNLDLL+KACELLDLGL LQIYT LQS S
Subjt:  LLNVITQTPKEELSKLIDCVERANPKLGYVVRLLLGEQDKEGDFRTEASELLSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDLGLMLQIYTDLQSRS

Query:  PTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRGLEE
        PTQWSL+LKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRG  E
Subjt:  PTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRGLEE

Query:  L
        L
Subjt:  L

XP_038877791.1 pentatricopeptide repeat-containing protein At4g16390, chloroplastic [Benincasa hispida]0.0e+0093.44Show/hide
Query:  MAFQLCHSPSTFFTDHHYLCNSSNSQRKTTLCKFSHRFKLNPIPRHPKTFLQITNVSLQEYAPQETQNPSPSDDEISKNPDGKFGSSSKSSVWVNPRSPR
        MAFQLCHSPSTFFTDHH L NS  SQRKTTLC  S  FKLNPIPRH K FLQITNVSLQEYAPQET NPSPS+DEISK PDGK  SSSKSSVWVNPRSPR
Subjt:  MAFQLCHSPSTFFTDHHYLCNSSNSQRKTTLCKFSHRFKLNPIPRHPKTFLQITNVSLQEYAPQETQNPSPSDDEISKNPDGKFGSSSKSSVWVNPRSPR

Query:  ASRLRKQSYEARYASLTRISESLDSCNPCEEDVADVLKVIGSNILEQDAVVVLNNMSNSQTALLALRYFQEVLKSSKGAILFNVTLKVFRKCRDMEGAEK
        AS+LRKQSYEARYASLTRISESLDSCNPC+EDVADVLK IGSNIL+QDAVVVLNNMSNSQTALLALRYFQ+VLKSSK AI +NVTLKVFRKCRDMEGAEK
Subjt:  ASRLRKQSYEARYASLTRISESLDSCNPCEEDVADVLKVIGSNILEQDAVVVLNNMSNSQTALLALRYFQEVLKSSKGAILFNVTLKVFRKCRDMEGAEK

Query:  LFDEMLKRGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDLAFGLYDRARTENWRIDPATFSTLIKIHGVAGNY
        LF+EMLKRGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVD+AF LYDRARTENWRIDPATFST+IKIHGVAGNY
Subjt:  LFDEMLKRGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDLAFGLYDRARTENWRIDPATFSTLIKIHGVAGNY

Query:  DGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCA
        DGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCA
Subjt:  DGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCA

Query:  DVGYINEAVEIFKDMKSSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVESGFDPNIFVLTSLIQCYGKAKRVDDVVRTFNLLIELGLTPDDRFCGC
        DVGY+ EAVE+F+DMKSSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVE+GFDPNIFVLTSLIQCYGKAKRVDDVVRTF+ LIELGLTPDDRFCGC
Subjt:  DVGYINEAVEIFKDMKSSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVESGFDPNIFVLTSLIQCYGKAKRVDDVVRTFNLLIELGLTPDDRFCGC

Query:  LLNVITQTPKEELSKLIDCVERANPKLGYVVRLLLGEQDKEGDFRTEASELLSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDLGLMLQIYTDLQSRS
        LLNVITQTPKEELSKLIDCV RANPKLG+VV+LL+GEQDKEGDFRTEASEL SVVSADVRKAYCNCLIDLCVNLDLLDKACELLDLGL LQ+Y DLQSRS
Subjt:  LLNVITQTPKEELSKLIDCVERANPKLGYVVRLLLGEQDKEGDFRTEASELLSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDLGLMLQIYTDLQSRS

Query:  PTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRGLEE
        PTQWSLYLKGLSLGA LTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESR   E
Subjt:  PTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRGLEE

Query:  L
        L
Subjt:  L

TrEMBL top hitse value%identityAlignment
A0A0A0LVP1 Smr domain-containing protein0.0e+0090.16Show/hide
Query:  MAFQLCHSPSTFFTDHHYLCNSSNSQRKTTLCKFSHRFKLNPIPRHPKTFLQITNVSLQEYAPQETQNPSPSDDEISKNPDGKFGSSSKSSVWVNPRSPR
        MAFQLC+SP TFFT+HH+L NS   QRKTTL   S  FKL+PIPRH K FLQITNVSLQE+APQ+TQN  PS DEISK PD K GSSS SSVWVNPRSPR
Subjt:  MAFQLCHSPSTFFTDHHYLCNSSNSQRKTTLCKFSHRFKLNPIPRHPKTFLQITNVSLQEYAPQETQNPSPSDDEISKNPDGKFGSSSKSSVWVNPRSPR

Query:  ASRLRKQSYEARYASLTRISESLDSCNPCEEDVADVLKVIGSNILEQDAVVVLNNMSNSQTALLALRYFQEVLKSSKGAILFNVTLKVFRKCRDMEGAEK
        AS+LRKQSYEARYASL R+SESLDS NPCE DVADVLKVIG+NILE+DA++VLNNMSNSQTALLALRYFQ++LKSSK  I +NVTLKVFRKCRDMEGAEK
Subjt:  ASRLRKQSYEARYASLTRISESLDSCNPCEEDVADVLKVIGSNILEQDAVVVLNNMSNSQTALLALRYFQEVLKSSKGAILFNVTLKVFRKCRDMEGAEK

Query:  LFDEMLKRGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDLAFGLYDRARTENWRIDPATFSTLIKIHGVAGNY
        LF+EM+ RGVKPDNVTFSTIISCARLCSLP+KAVEWFEKMPSFDCNPDDVTYS MIDAYGRAGNVD+AF LYDRARTENWRIDPATFST+IKIHGVAGNY
Subjt:  LFDEMLKRGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDLAFGLYDRARTENWRIDPATFSTLIKIHGVAGNY

Query:  DGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCA
        DGCLNVYEEMKAIGIKPNLVIYN LLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRARYGEDAL+VYKEMKEKGLQLNVILYNTLLAMCA
Subjt:  DGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCA

Query:  DVGYINEAVEIFKDMKSSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVESGFDPNIFVLTSLIQCYGKAKRVDDVVRTFNLLIELGLTPDDRFCGC
        DVGY+NEAVEIF+DMKSSGTCSPDSWTFSSMITIYSC GKVSEAEEMLN+MVE+GFDPNIFVLTSLIQCYGKAKRVDDVVRTFN LIELGLTPDDRFCGC
Subjt:  DVGYINEAVEIFKDMKSSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVESGFDPNIFVLTSLIQCYGKAKRVDDVVRTFNLLIELGLTPDDRFCGC

Query:  LLNVITQTPKEELSKLIDCVERANPKLGYVVRLLLGEQDKEGDFRTEASELLSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDLGLMLQIYTDLQSRS
        LLNVITQTPK EL KLIDCV RANPKLG+VV LLLGEQDKEG+FRTEASEL SVVSADVRKAYCNCLIDLCVNLDLLDKACELLDLGL LQIY DLQSRS
Subjt:  LLNVITQTPKEELSKLIDCVERANPKLGYVVRLLLGEQDKEGDFRTEASELLSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDLGLMLQIYTDLQSRS

Query:  PTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRGLEE
        PTQWSLYLKGLSLGAALTALHVWI DLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESR   E
Subjt:  PTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRGLEE

Query:  L
        L
Subjt:  L

A0A1S3CL39 pentatricopeptide repeat-containing protein At4g16390, chloroplastic0.0e+0091.01Show/hide
Query:  MAFQLCHSPSTFFTDHHYLCNSSNSQRKTTLCKFSHRFKLNPIPRHPKTFLQITNVSLQEYAPQETQNPSPSDDEISKNPDGKFGSSSKSSVWVNPRSPR
        MAFQLCHSP TFFT HH L NS   QRKTTL   S  FKLNPIPRH   FLQITN+SLQE++PQET N  PSDDEISK  D K GSSSKSSVWVNPRSPR
Subjt:  MAFQLCHSPSTFFTDHHYLCNSSNSQRKTTLCKFSHRFKLNPIPRHPKTFLQITNVSLQEYAPQETQNPSPSDDEISKNPDGKFGSSSKSSVWVNPRSPR

Query:  ASRLRKQSYEARYASLTRISESLDSCNPCEEDVADVLKVIGSNILEQDAVVVLNNMSNSQTALLALRYFQEVLKSSKGAILFNVTLKVFRKCRDMEGAEK
        AS+LRKQSYEARYASL RISESLDSCNPCE DVADVLKVIG+NILEQDAVVVLNNMSNSQTALLALRYFQ++LKSSK  I +NVTLKVFRKCRDMEGAE+
Subjt:  ASRLRKQSYEARYASLTRISESLDSCNPCEEDVADVLKVIGSNILEQDAVVVLNNMSNSQTALLALRYFQEVLKSSKGAILFNVTLKVFRKCRDMEGAEK

Query:  LFDEMLKRGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDLAFGLYDRARTENWRIDPATFSTLIKIHGVAGNY
        LF+EML RGVKPDNVTFSTIISCARLCSLP+KAVEWFEKMPSFDCNPDDVTYS MIDAYGRAGNVD+AF LYDRARTENWRIDPATFST+IKIHGVAGNY
Subjt:  LFDEMLKRGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDLAFGLYDRARTENWRIDPATFSTLIKIHGVAGNY

Query:  DGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCA
        DGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIK+GFSPSWATYASLLRAYGRARYGEDAL+VYKEMKEKGLQLNVILYNTLLAMCA
Subjt:  DGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCA

Query:  DVGYINEAVEIFKDMKSSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVESGFDPNIFVLTSLIQCYGKAKRVDDVVRTFNLLIELGLTPDDRFCGC
        DVGY+NEAVEIF+DMK+SGTCSPDSWTFSSMITIYSCSGKVSEAEEMLN+MVE+GFDPNIFVLTSLIQCYGKAKRVDDVVRTFN LIELGLTPDDRFCGC
Subjt:  DVGYINEAVEIFKDMKSSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVESGFDPNIFVLTSLIQCYGKAKRVDDVVRTFNLLIELGLTPDDRFCGC

Query:  LLNVITQTPKEELSKLIDCVERANPKLGYVVRLLLGEQDKEGDFRTEASELLSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDLGLMLQIYTDLQSRS
        LLNVITQTPKEE+SKLIDCV RANPKLG+VV LLLGEQDKEG+FRTEASEL SVVSADVRKAYCNCLIDLCVNLDLLDKACELL+LGL LQIY DLQSRS
Subjt:  LLNVITQTPKEELSKLIDCVERANPKLGYVVRLLLGEQDKEGDFRTEASELLSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDLGLMLQIYTDLQSRS

Query:  PTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRGLEE
        PTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESR   E
Subjt:  PTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRGLEE

Query:  L
        L
Subjt:  L

A0A5A7TLM5 Pentatricopeptide repeat-containing protein0.0e+0091.01Show/hide
Query:  MAFQLCHSPSTFFTDHHYLCNSSNSQRKTTLCKFSHRFKLNPIPRHPKTFLQITNVSLQEYAPQETQNPSPSDDEISKNPDGKFGSSSKSSVWVNPRSPR
        MAFQLCHSP TFFT HH L NS   QRKTTL   S  FKLNPIPRH   FLQITN+SLQE++PQET N  PSDDEISK  D K GSSSKSSVWVNPRSPR
Subjt:  MAFQLCHSPSTFFTDHHYLCNSSNSQRKTTLCKFSHRFKLNPIPRHPKTFLQITNVSLQEYAPQETQNPSPSDDEISKNPDGKFGSSSKSSVWVNPRSPR

Query:  ASRLRKQSYEARYASLTRISESLDSCNPCEEDVADVLKVIGSNILEQDAVVVLNNMSNSQTALLALRYFQEVLKSSKGAILFNVTLKVFRKCRDMEGAEK
        AS+LRKQSYEARYASL RISESLDSCNPCE DVADVLKVIG+NILEQDAVVVLNNMSNSQTALLALRYFQ++LKSSK  I +NVTLKVFRKCRDMEGAE+
Subjt:  ASRLRKQSYEARYASLTRISESLDSCNPCEEDVADVLKVIGSNILEQDAVVVLNNMSNSQTALLALRYFQEVLKSSKGAILFNVTLKVFRKCRDMEGAEK

Query:  LFDEMLKRGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDLAFGLYDRARTENWRIDPATFSTLIKIHGVAGNY
        LF+EML RGVKPDNVTFSTIISCARLCSLP+KAVEWFEKMPSFDCNPDDVTYS MIDAYGRAGNVD+AF LYDRARTENWRIDPATFST+IKIHGVAGNY
Subjt:  LFDEMLKRGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDLAFGLYDRARTENWRIDPATFSTLIKIHGVAGNY

Query:  DGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCA
        DGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIK+GFSPSWATYASLLRAYGRARYGEDAL+VYKEMKEKGLQLNVILYNTLLAMCA
Subjt:  DGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCA

Query:  DVGYINEAVEIFKDMKSSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVESGFDPNIFVLTSLIQCYGKAKRVDDVVRTFNLLIELGLTPDDRFCGC
        DVGY+NEAVEIF+DMK+SGTCSPDSWTFSSMITIYSCSGKVSEAEEMLN+MVE+GFDPNIFVLTSLIQCYGKAKRVDDVVRTFN LIELGLTPDDRFCGC
Subjt:  DVGYINEAVEIFKDMKSSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVESGFDPNIFVLTSLIQCYGKAKRVDDVVRTFNLLIELGLTPDDRFCGC

Query:  LLNVITQTPKEELSKLIDCVERANPKLGYVVRLLLGEQDKEGDFRTEASELLSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDLGLMLQIYTDLQSRS
        LLNVITQTPKEE+SKLIDCV RANPKLG+VV LLLGEQDKEG+FRTEASEL SVVSADVRKAYCNCLIDLCVNLDLLDKACELL+LGL LQIY DLQSRS
Subjt:  LLNVITQTPKEELSKLIDCVERANPKLGYVVRLLLGEQDKEGDFRTEASELLSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDLGLMLQIYTDLQSRS

Query:  PTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRGLEE
        PTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESR   E
Subjt:  PTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRGLEE

Query:  L
        L
Subjt:  L

A0A6J1CNE5 pentatricopeptide repeat-containing protein At4g16390, chloroplastic0.0e+0091.73Show/hide
Query:  MAFQLCHSPSTFFTDHHYLCNSSNSQRKTTLCKFSHRFKLNPIPRHPKTFLQITNVSLQEYAPQETQNPSPSDDEISKNPDGKFGSSSKSSVWVNPRSPR
        MAFQLCHSPSTFF+DHH L NS NSQ + TL K SHRFKLNP P H KT L+ITNVSLQEYA QE QNP P+ DE SK PDGK  SSSKSSVWVNPRSPR
Subjt:  MAFQLCHSPSTFFTDHHYLCNSSNSQRKTTLCKFSHRFKLNPIPRHPKTFLQITNVSLQEYAPQETQNPSPSDDEISKNPDGKFGSSSKSSVWVNPRSPR

Query:  ASRLRKQSYEARYASLTRISESLDSCNPCEEDVADVLKVIGSNILEQDAVVVLNNMSNSQTALLALRYFQEVLKSSKGAILFNVTLKVFRKCRDMEGAEK
        AS+LR QSYEARYASLTRISESLDSCNPCEEDVADVLK +GSNILEQDAV VLNNMSNS TALLAL+ FQ+VLKSSK AIL+NVTLKV RK RDMEGAEK
Subjt:  ASRLRKQSYEARYASLTRISESLDSCNPCEEDVADVLKVIGSNILEQDAVVVLNNMSNSQTALLALRYFQEVLKSSKGAILFNVTLKVFRKCRDMEGAEK

Query:  LFDEMLKRGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDLAFGLYDRARTENWRIDPATFSTLIKIHGVAGNY
        LFDEMLKRGVKPDNVTFST+ISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVD+AF LYDRARTENWRIDPATFSTLIKIHGVAGNY
Subjt:  LFDEMLKRGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDLAFGLYDRARTENWRIDPATFSTLIKIHGVAGNY

Query:  DGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCA
        DGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCA
Subjt:  DGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCA

Query:  DVGYINEAVEIFKDMKSSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVESGFDPNIFVLTSLIQCYGKAKRVDDVVRTFNLLIELGLTPDDRFCGC
        DVGY+NEAVEIF+DMKSSG CSPDSWTFSSMITIYSCSGKVSEAEEMLNEM+E+GFDPNIFVLTSLIQCYGK KRVDDVVRTF+ L+ELGLTPDDRFCGC
Subjt:  DVGYINEAVEIFKDMKSSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVESGFDPNIFVLTSLIQCYGKAKRVDDVVRTFNLLIELGLTPDDRFCGC

Query:  LLNVITQTPKEELSKLIDCVERANPKLGYVVRLLLGEQDKEGDFRTEASELLSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDLGLMLQIYTDLQSRS
        LLNVITQTPKEELSKLIDCVERAN KLGYVV+LLLGEQDKEGD RTEASELLSVVSADVRKAYCNCLIDLCVNLDLL+KACELLDLGL LQIYT LQS S
Subjt:  LLNVITQTPKEELSKLIDCVERANPKLGYVVRLLLGEQDKEGDFRTEASELLSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDLGLMLQIYTDLQSRS

Query:  PTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRGLEE
        PTQWSL+LKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRG  E
Subjt:  PTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRGLEE

Query:  L
        L
Subjt:  L

A0A6J1EHV4 pentatricopeptide repeat-containing protein At4g16390, chloroplastic0.0e+0088.6Show/hide
Query:  MAFQLCHSPSTFFTDHHYLCNSSNSQRKTTLCK-FSHRFKLNPIPRHPKTFLQITNVSLQEYAPQETQNPSPSDDEISKNPDGKFGSSSKSSVWVNPRSP
        MAFQL H PSTFFTDH    NS     KTTLCK FS  FKLNPIP H K FLQITNVS QEYAPQET+NPSPSDDEISK PDGK GSSSK+SVWVNP SP
Subjt:  MAFQLCHSPSTFFTDHHYLCNSSNSQRKTTLCK-FSHRFKLNPIPRHPKTFLQITNVSLQEYAPQETQNPSPSDDEISKNPDGKFGSSSKSSVWVNPRSP

Query:  RASRLRKQSYEARYASLTRISESLDSCNPCEEDVADVLKVIGSNILEQDAVVVLNNMSNSQTALLALRYFQEVLKSSKGAILFNVTLKVFRKCRDMEGAE
        RAS+LRKQSYEARYASL +ISESLDSCNPCE DVADVLK I S ILEQDA+ VLNNMSNSQTALL  RYFQ+VLKSSK A+ +NVTLKVFRKCRD EGAE
Subjt:  RASRLRKQSYEARYASLTRISESLDSCNPCEEDVADVLKVIGSNILEQDAVVVLNNMSNSQTALLALRYFQEVLKSSKGAILFNVTLKVFRKCRDMEGAE

Query:  KLFDEMLKRGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDLAFGLYDRARTENWRIDPATFSTLIKIHGVAGN
        KLFDEML+RGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPD++TYS MIDAYGRAGNVD+AF LYDRARTENWRID +TFST+IKIHGVAGN
Subjt:  KLFDEMLKRGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDLAFGLYDRARTENWRIDPATFSTLIKIHGVAGN

Query:  YDGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMC
        YDGCLNVYEEMKA+GIKPNL IYNSLL AMGRAKRPWQIKTIYKEM KNGFSPSWATYASLLRAY RARY ED +LVYKEMKEKGLQLNVILYNTLLAMC
Subjt:  YDGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMC

Query:  ADVGYINEAVEIFKDMKSSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVESGFDPNIFVLTSLIQCYGKAKRVDDVVRTFNLLIELGLTPDDRFCG
        ADVGY+NEA+E+FKDMKSSGTCSPDSWTFSSMITIYSCSG VSEAEEMLNEM+E+GFDPNIFVLTSLIQCYGKAKRVDDVVRTF+ L+ELGLTPDDRFCG
Subjt:  ADVGYINEAVEIFKDMKSSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVESGFDPNIFVLTSLIQCYGKAKRVDDVVRTFNLLIELGLTPDDRFCG

Query:  CLLNVITQTPKEELSKLIDCVERANPKLGYVVRLLLGEQDKEGDFRTEASELLSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDLGLMLQIYTDLQSR
        CLLNVITQTPK ELSKLIDCVERANPKLG+VV+LLLGE+D EGDFRTEASEL SVVS DVRKAYCNCLIDLCVNLDLLDKACELLDLGL +QIYTDLQSR
Subjt:  CLLNVITQTPKEELSKLIDCVERANPKLGYVVRLLLGEQDKEGDFRTEASELLSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDLGLMLQIYTDLQSR

Query:  SPTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRGLE
        SPTQWSLYLKGLSLGAALTALHVWINDLTK L+SGEELPPLLGINTGHGKHKYSDKGL+SVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRG  
Subjt:  SPTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRGLE

Query:  EL
        EL
Subjt:  EL

SwissProt top hitse value%identityAlignment
B4F8Z1 Pentatricopeptide repeat-containing protein ATP4, chloroplastic2.3e-20956.37Show/hide
Query:  PRHPKTFLQITNVSLQEYAPQETQNPSPSDDEISKNPDGKFGSSSKSS--VWVNPRSPRASRL-RKQSYEARYASLTRISESLDSCNPCEEDVADVLK-V
        P++P + +   +VS+QE  PQ  Q+PSP  D    NP+G   SSS ++  +WVNP SPRA+ + R ++   R A L   + +L +C   E  V   L+  
Subjt:  PRHPKTFLQITNVSLQEYAPQETQNPSPSDDEISKNPDGKFGSSSKSS--VWVNPRSPRASRL-RKQSYEARYASLTRISESLDSCNPCEEDVADVLK-V

Query:  IGSNILEQDAVVVLNN--MSNSQTALLALRYFQEVLKSSKGAILFNVTLKVFRKCRDMEGAEKLFDEMLKRGVKPDNVTFSTIISCARLCSLPNKAVEWF
              EQDAV+VLN    + ++TA+LALR+F    K  K  IL+NV LK+ RK R     E L+ EML+ GV+PDN TFST+ISCAR C L +KAVEWF
Subjt:  IGSNILEQDAVVVLNN--MSNSQTALLALRYFQEVLKSSKGAILFNVTLKVFRKCRDMEGAEKLFDEMLKRGVKPDNVTFSTIISCARLCSLPNKAVEWF

Query:  EKMPSFDCNPDDVTYSAMIDAYGRAGNVDLAFGLYDRARTENWRIDPATFSTLIKIHGVAGNYDGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQ
        +KMP F C+PD +TYSA+IDAYG AGN + A  LYDRAR E W++DP   ST+IK+H  +GN+DG LNV+EEMKAIG++PNLV+YN++LDAMGRA RPW 
Subjt:  EKMPSFDCNPDDVTYSAMIDAYGRAGNVDLAFGLYDRARTENWRIDPATFSTLIKIHGVAGNYDGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQ

Query:  IKTIYKEMIKNGFSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCADVGYINEAVEIFKDMKSS--GTCSPDSWTFSSMITIY
        +KTI++EM+     PS ATY  LL AY RARYGEDA+ VY+ MK++ + ++V+LYN LL+MCAD+GY++EA EIF+DMK+S      PDSW++SSM+T+Y
Subjt:  IKTIYKEMIKNGFSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCADVGYINEAVEIFKDMKSS--GTCSPDSWTFSSMITIY

Query:  SCSGKVSEAEEMLNEMVESGFDPNIFVLTSLIQCYGKAKRVDDVVRTFNLLIELGLTPDDRFCGCLLNVITQTPKEELSKLIDCVERANPKLGYVVRLLL
        S +  V  AE +LNEMVE+GF PNIFVLTSLI+CYGK  R DDVVR+F +L +LG+ PDDRFCGCLL+V   TP EEL K+I C+ER+N +LG VV+LL+
Subjt:  SCSGKVSEAEEMLNEMVESGFDPNIFVLTSLIQCYGKAKRVDDVVRTFNLLIELGLTPDDRFCGCLLNVITQTPKEELSKLIDCVERANPKLGYVVRLLL

Query:  GEQDKEGDFRTEASELLSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDLGLMLQIYTDLQSRSPTQWSLYLKGLSLGAALTALHVWINDLTKVLESGE
             E  FR  A ELL      V+  YCNCL+DLCVNL+ ++KAC LLD    L IY ++Q+R+ TQWSL+L+GLS+GAALT LHVW+NDL   L++G 
Subjt:  GEQDKEGDFRTEASELLSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDLGLMLQIYTDLQSRSPTQWSLYLKGLSLGAALTALHVWINDLTKVLESGE

Query:  E-LPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRGLEEL
        E LPPLLGI+TG GK+ YSD+GLA++FE+HLKEL+APFHEAP+K GWFLTT VAAK WLES+   EL
Subjt:  E-LPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRGLEEL

Q10PZ4 Pentatricopeptide repeat-containing protein ATP4 homolog, chloroplastic2.8e-20754.64Show/hide
Query:  TTLCKFSHR-FKLNPIPRHPKTFLQITNVSLQEYAPQETQNPSPSDDEISKNPDGKFGSSSKSSVWVNPRSPRASRL-RKQSYEARYASLTRISESLDSC
        ++L  + HR   L+  P++P        VS+Q+        P PSD     NP     S++   VWVNP SPRA+ L R ++   R A L   + +L +C
Subjt:  TTLCKFSHR-FKLNPIPRHPKTFLQITNVSLQEYAPQETQNPSPSDDEISKNPDGKFGSSSKSSVWVNPRSPRASRL-RKQSYEARYASLTRISESLDSC

Query:  NPCEEDVADVLK-VIGSNILEQDAVVVLNNMSNSQTA-LLALRYFQEVLKSSKGAILFNVTLKVFRKCRDMEGAEKLFDEMLKRGVKPDNVTFSTIISCA
           E  VA  L+        EQDAV+VLN  S    A +LAL +F    +  K  IL+NV LK  RK R    AE L++EML+ GV+PDN TFST+ISCA
Subjt:  NPCEEDVADVLK-VIGSNILEQDAVVVLNNMSNSQTA-LLALRYFQEVLKSSKGAILFNVTLKVFRKCRDMEGAEKLFDEMLKRGVKPDNVTFSTIISCA

Query:  RLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDLAFGLYDRARTENWRIDPATFSTLIKIHGVAGNYDGCLNVYEEMKAIGIKPNLVIYNS
        R C +P KAVEWFEKMP F C+PD +TYSA+IDAYGRAG+ + A  LYDRAR E W++DP   +T+I++H  +GN+DG LNV+EEMKA G+KPNLV+YN+
Subjt:  RLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDLAFGLYDRARTENWRIDPATFSTLIKIHGVAGNYDGCLNVYEEMKAIGIKPNLVIYNS

Query:  LLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCADVGYINEAVEIFKDMKSS--GTCS
        +LDAMGRA RPW +KTI++E++     P+ ATY  LL AY RARYGEDA+ VY+ MK++ + ++V+LYN LL+MCAD+GY+ EA EIF+DMK+S      
Subjt:  LLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCADVGYINEAVEIFKDMKSS--GTCS

Query:  PDSWTFSSMITIYSCSGKVSEAEEMLNEMVESGFDPNIFVLTSLIQCYGKAKRVDDVVRTFNLLIELGLTPDDRFCGCLLNVITQTPKEELSKLIDCVER
        PDSW++SSM+T+YSC+G V+ AE +LNEMVE+GF PNIF+LTSLI+CYGKA R DDVVR+F +L +LG+TPDDRFCGCLL V   TP +EL K+I C++R
Subjt:  PDSWTFSSMITIYSCSGKVSEAEEMLNEMVESGFDPNIFVLTSLIQCYGKAKRVDDVVRTFNLLIELGLTPDDRFCGCLLNVITQTPKEELSKLIDCVER

Query:  ANPKLGYVVRLLLGEQDKEGDFRTEASELLSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDLGLMLQIYTDLQSRSPTQWSLYLKGLSLGAALTALHV
        ++ +LG VVRLL+         R  A ELL      VR  YCNCL+DL VNL  ++KAC LLD+ L L IY+++Q+R+ TQWSL+L+GLS+GAALT LHV
Subjt:  ANPKLGYVVRLLLGEQDKEGDFRTEASELLSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDLGLMLQIYTDLQSRSPTQWSLYLKGLSLGAALTALHV

Query:  WINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRGLEEL
        W++DL   L++G+ELPPLLGI+TG GK+ YS KGLA+VFESHLKEL+APFHEAP+K GWFLTT VAA+ WLE++   EL
Subjt:  WINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRGLEEL

Q8GWE0 Pentatricopeptide repeat-containing protein At4g16390, chloroplastic3.3e-27266.28Show/hide
Query:  LCHSPSTFFTDHHYLCNSSNSQRKTTLCKFSHRFKLNPIPRHPKTFLQITNVSLQEYAPQETQNPSPSDDEISKNPDGKFGSSSKSSVWVNPRSPRASRL
        LC SPS+   D   LCN  +   K+T   F   +  N    H +  LQ T+VS+QE  PQ  ++     D     P     ++SKS VWVNP+SPRAS+L
Subjt:  LCHSPSTFFTDHHYLCNSSNSQRKTTLCKFSHRFKLNPIPRHPKTFLQITNVSLQEYAPQETQNPSPSDDEISKNPDGKFGSSSKSSVWVNPRSPRASRL

Query:  RKQSYEARYASLTRISESLDSCNPCEEDVADVLKVIGSNILEQDAVVVLNNMSNSQTALLALRYFQEVLKSSKGAILFNVTLKVFRKCRDMEGAEKLFDE
        R++SY++RY+SL +++ESLD+C P E DV DV+   G  + EQDAVV LNNM+N +TA L L    E +K S+  IL+NVT+KVFRK +D+E +EKLFDE
Subjt:  RKQSYEARYASLTRISESLDSCNPCEEDVADVLKVIGSNILEQDAVVVLNNMSNSQTALLALRYFQEVLKSSKGAILFNVTLKVFRKCRDMEGAEKLFDE

Query:  MLKRGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDLAFGLYDRARTENWRIDPATFSTLIKIHGVAGNYDGCL
        ML+RG+KPDN TF+TIISCAR   +P +AVEWFEKM SF C PD+VT +AMIDAYGRAGNVD+A  LYDRARTE WRID  TFSTLI+I+GV+GNYDGCL
Subjt:  MLKRGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDLAFGLYDRARTENWRIDPATFSTLIKIHGVAGNYDGCL

Query:  NVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCADVGY
        N+YEEMKA+G+KPNLVIYN L+D+MGRAKRPWQ K IYK++I NGF+P+W+TYA+L+RAYGRARYG+DAL +Y+EMKEKGL L VILYNTLL+MCAD  Y
Subjt:  NVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCADVGY

Query:  INEAVEIFKDMKSSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVESGFDPNIFVLTSLIQCYGKAKRVDDVVRTFNLLIELGLTPDDRFCGCLLNV
        ++EA EIF+DMK+  TC PDSWTFSS+IT+Y+CSG+VSEAE  L +M E+GF+P +FVLTS+IQCYGKAK+VDDVVRTF+ ++ELG+TPDDRFCGCLLNV
Subjt:  INEAVEIFKDMKSSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVESGFDPNIFVLTSLIQCYGKAKRVDDVVRTFNLLIELGLTPDDRFCGCLLNV

Query:  ITQTPKEELSKLIDCVERANPKLGYVVRLLLGEQD-KEGDFRTEASELLSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDLGLMLQIYTDLQSRSPTQ
        +TQTP EE+ KLI CVE+A PKLG VV++L+ EQ+ +EG F+ EASEL+  + +DV+KAY NCLIDLCVNL+ L++ACE+L LGL   IYT LQS+S TQ
Subjt:  ITQTPKEELSKLIDCVERANPKLGYVVRLLLGEQD-KEGDFRTEASELLSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDLGLMLQIYTDLQSRSPTQ

Query:  WSLYLKGLSLGAALTALHVWINDLTK-VLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESR
        WSL+LK LSLGAALTALHVW+NDL++  LESGEE PPLLGINTGHGKHKYSDKGLA+VFESHLKELNAPFHEAP+KVGWFLTT VAAK+WLESR
Subjt:  WSLYLKGLSLGAALTALHVWINDLTK-VLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESR

Q9LS25 Pentatricopeptide repeat-containing protein At5g46580, chloroplastic9.0e-12937.92Show/hide
Query:  AFQLCHSPSTFFTDHHYLCNSSNSQRKTTLCKFSHRFKLN---PIPRHPKTFLQITNVSLQEYAPQETQNPSPSD-----DEISKNPDGKFGSSSKSSVW
        A  +C +P    T  H L        K +L + S   KLN      + PKT          E  P  T+ PS S+        +   +     S   SVW
Subjt:  AFQLCHSPSTFFTDHHYLCNSSNSQRKTTLCKFSHRFKLN---PIPRHPKTFLQITNVSLQEYAPQETQNPSPSD-----DEISKNPDGKFGSSSKSSVW

Query:  VNPRSPRASRLRKQ-------SYEARYASLTRISESLDSCNPCEE-DVADVLKVIGSNILEQDAVVVLNNMSNSQTALLALRYFQEVLKSSKGAILFNVT
        VNP  P+ S L  Q       SY  +   L   +  L+S    E+ +   +L  I       +A++VLN++   Q       + +         I +NVT
Subjt:  VNPRSPRASRLRKQ-------SYEARYASLTRISESLDSCNPCEE-DVADVLKVIGSNILEQDAVVVLNNMSNSQTALLALRYFQEVLKSSKGAILFNVT

Query:  LKVFRKCRDMEGAEKLFDEMLKRGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDLAFGLYDRARTENWRIDPA
        +K  R  R  +  E++  EM+K GV+ DN+T+STII+CA+ C+L NKA+EWFE+M      PD+VTYSA++D Y ++G V+    LY+RA    W+ D  
Subjt:  LKVFRKCRDMEGAEKLFDEMLKRGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDLAFGLYDRARTENWRIDPA

Query:  TFSTLIKIHGVAGNYDGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGL
         FS L K+ G AG+YDG   V +EMK++ +KPN+V+YN+LL+AMGRA +P   ++++ EM++ G +P+  T  +L++ YG+AR+  DAL +++EMK K  
Subjt:  TFSTLIKIHGVAGNYDGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGL

Query:  QLNVILYNTLLAMCADVGYINEAVEIFKDMKSSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVESGFDPNIFVLTSLIQCYGKAKRVDDVVRTFNL
         ++ ILYNTLL MCAD+G   EA  +F DMK S  C PD++++++M+ IY   GK  +A E+  EM+++G   N+   T L+QC GKAKR+DDVV  F+L
Subjt:  QLNVILYNTLLAMCADVGYINEAVEIFKDMKSSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVESGFDPNIFVLTSLIQCYGKAKRVDDVVRTFNL

Query:  LIELGLTPDDRFCGCLLNVITQ-TPKEELSKLIDCVERANPKLGYVVRLLLGEQDKEGDFRTEASELLSVVSADVRKAYCNCLIDLCVNLDLLDKACELL
         I+ G+ PDDR CGCLL+V+      E+  K++ C+ERAN KL   V L++ E+ +    + E   +++    + R+ +CNCLID+C   +  ++A ELL
Subjt:  LIELGLTPDDRFCGCLLNVITQ-TPKEELSKLIDCVERANPKLGYVVRLLLGEQDKEGDFRTEASELLSVVSADVRKAYCNCLIDLCVNLDLLDKACELL

Query:  DLGLMLQIYTDLQSRSPTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLT
         LG +  +Y  L +++  +WSL ++ LS+GAA TAL  W+  L  +++  EELP L    TG G H++S +GLA+ F  HL++L+APF ++ ++ G F+ 
Subjt:  DLGLMLQIYTDLQSRSPTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLT

Query:  TKVAAKSWLESR
        TK    SWLES+
Subjt:  TKVAAKSWLESR

Q9SIC9 Pentatricopeptide repeat-containing protein At2g31400, chloroplastic4.3e-4624.33Show/hide
Query:  EISKNPDGKFGSSSKSSVWVNPRSPRASRLRKQSYEARYASLTRISESLDSC---NPCEEDVADV---LKVIG--SNILEQDAVVVLNNMSNSQTALLAL
        E  KN  GK  S+  S++    +   A R+ + ++   Y +      +L S    +   E+   V   +K  G   N++  +AV+        +   +A 
Subjt:  EISKNPDGKFGSSSKSSVWVNPRSPRASRLRKQSYEARYASLTRISESLDSC---NPCEEDVADV---LKVIG--SNILEQDAVVVLNNMSNSQTALLAL

Query:  RYFQEVLKS--SKGAILFNVTLKVFRKCRDMEGAEKLFDEMLKRGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGN
        ++F E+ ++      I FN  L V  +    E A  LFDEM  R ++ D  +++T++         + A E   +MP     P+ V+YS +ID + +AG 
Subjt:  RYFQEVLKS--SKGAILFNVTLKVFRKCRDMEGAEKLFDEMLKRGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGN

Query:  VDLAFGLYDRARTENWRIDPATFSTLIKIHGVAGNYDGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAY
         D A  L+   R     +D  +++TL+ I+   G  +  L++  EM ++GIK ++V YN+LL   G+  +  ++K ++ EM +    P+  TY++L+  Y
Subjt:  VDLAFGLYDRARTENWRIDPATFSTLIKIHGVAGNYDGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAY

Query:  GRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCADVGYINEAVEIFKDMKSSGTCSPDSWTFSSMITI------------YSCSGKVSEAEEMLNEMV
         +    ++A+ +++E K  GL+ +V+LY+ L+      G +  AV +  +M   G  SP+  T++S+I              YS  G +  +   L+ + 
Subjt:  GRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCADVGYINEAVEIFKDMKSSGTCSPDSWTFSSMITI------------YSCSGKVSEAEEMLNEMV

Query:  ESGFDPNIFVLTSLI---------QCYGKAKRVDDVVRTFNLLIELGLTPDDRFCGCLLNVITQTPK-EELSKLIDCVERANPKL-GYVVRLLLGEQDKE
        E+  +  I +   L           C    + +  ++  F  + +L + P+      +LN  ++    E+ S L++ +   + K+ G V  LL+G+++  
Subjt:  ESGFDPNIFVLTSLI---------QCYGKAKRVDDVVRTFNLLIELGLTPDDRFCGCLLNVITQTPK-EELSKLIDCVERANPKL-GYVVRLLLGEQDKE

Query:  GDFRTEASELLSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDL-GLMLQIYTDLQSRSPTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPL
                + ++ +      A+ N L D+  +     +  EL+ L G   Q++ ++ S S     L L  +S GAA   +H W+ ++  ++  G ELP +
Subjt:  GDFRTEASELLSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDL-GLMLQIYTDLQSRSPTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPL

Query:  LGINTGHGKHK--YSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRGLEELRPLAEEL
        L I TG GKH     D  L    E  L+ ++APFH +   +G F ++     +WL      +L  L + +
Subjt:  LGINTGHGKHK--YSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRGLEELRPLAEEL

Arabidopsis top hitse value%identityAlignment
AT1G18900.1 Pentatricopeptide repeat (PPR) superfamily protein1.1e-3923.01Show/hide
Query:  DVLKVIGSNILEQDAVVVLNNMSNSQTALLALRYFQEVLKSSKGAILFNVTLKVFRKCRDMEGAEKLFDEMLKRGVKPDNVTFSTIISCARLCSLPNKAV
        + L+ +G  I    A  VL  M++   AL    + +           +   +    + +      KL DEM++ G +P+ VT++ +I      +  N+A+
Subjt:  DVLKVIGSNILEQDAVVVLNNMSNSQTALLALRYFQEVLKSSKGAILFNVTLKVFRKCRDMEGAEKLFDEMLKRGVKPDNVTFSTIISCARLCSLPNKAV

Query:  EWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDLAFGLYDRARTENWRIDPATFSTLIKIHGVAGNYDGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKR
          F +M    C PD VTY  +ID + +AG +D+A  +Y R +      D  T+S +I   G AG+      ++ EM   G  PNLV YN ++D   +A+ 
Subjt:  EWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDLAFGLYDRARTENWRIDPATFSTLIKIHGVAGNYDGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKR

Query:  PWQIKTIYKEMIKNGFSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCADVGYINEAVEIFKDMKSSGTCSPDSWTFSSMITI
              +Y++M   GF P   TY+ ++   G   Y E+A  V+ EM++K    +  +Y  L+ +    G + +A + ++ M  +G   P+  T +S+++ 
Subjt:  PWQIKTIYKEMIKNGFSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCADVGYINEAVEIFKDMKSSGTCSPDSWTFSSMITI

Query:  YSCSGKVSEAEEMLNEMVESGFDPNIFVLTSLIQCY--GKAKRVDDVVRTFNLLIELGLTPDDRFCGCLLNVITQTPKEELSKLIDCVERANPKLGYVVR
        +    K++EA E+L  M+  G  P++   T L+ C   G++K            +++G      FCG L+                     +P   ++++
Subjt:  YSCSGKVSEAEEMLNEMVESGFDPNIFVLTSLIQCY--GKAKRVDDVVRTFNLLIELGLTPDDRFCGCLLNVITQTPKEELSKLIDCVERANPKLGYVVR

Query:  LLLGEQDKEGDFRTEASELLSVVSADVR---KAYCNCLIDLCVNLDLLDKACELLDLGLMLQIYTD-LQSRSPTQWSLYLKGLSLGAALTALHVWINDLT
        +     D E + R  A+  L ++ ++ R   +   + ++D        ++A  + ++     ++ D L+ +S + W + L  +S G A+TAL   +    
Subjt:  LLLGEQDKEGDFRTEASELLSVVSADVR---KAYCNCLIDLCVNLDLLDKACELLDLGLMLQIYTD-LQSRSPTQWSLYLKGLSLGAALTALHVWINDLT

Query:  KVLESGEELPPLLGINTGHGKHK--YSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRGLEELRPL
        K + +    P  + I TG G+         +    E  L    +PF       G F+ +      WL    +E +  L
Subjt:  KVLESGEELPPLLGINTGHGKHK--YSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRGLEELRPL

AT1G18900.2 Pentatricopeptide repeat (PPR) superfamily protein1.1e-3923.01Show/hide
Query:  DVLKVIGSNILEQDAVVVLNNMSNSQTALLALRYFQEVLKSSKGAILFNVTLKVFRKCRDMEGAEKLFDEMLKRGVKPDNVTFSTIISCARLCSLPNKAV
        + L+ +G  I    A  VL  M++   AL    + +           +   +    + +      KL DEM++ G +P+ VT++ +I      +  N+A+
Subjt:  DVLKVIGSNILEQDAVVVLNNMSNSQTALLALRYFQEVLKSSKGAILFNVTLKVFRKCRDMEGAEKLFDEMLKRGVKPDNVTFSTIISCARLCSLPNKAV

Query:  EWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDLAFGLYDRARTENWRIDPATFSTLIKIHGVAGNYDGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKR
          F +M    C PD VTY  +ID + +AG +D+A  +Y R +      D  T+S +I   G AG+      ++ EM   G  PNLV YN ++D   +A+ 
Subjt:  EWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDLAFGLYDRARTENWRIDPATFSTLIKIHGVAGNYDGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKR

Query:  PWQIKTIYKEMIKNGFSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCADVGYINEAVEIFKDMKSSGTCSPDSWTFSSMITI
              +Y++M   GF P   TY+ ++   G   Y E+A  V+ EM++K    +  +Y  L+ +    G + +A + ++ M  +G   P+  T +S+++ 
Subjt:  PWQIKTIYKEMIKNGFSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCADVGYINEAVEIFKDMKSSGTCSPDSWTFSSMITI

Query:  YSCSGKVSEAEEMLNEMVESGFDPNIFVLTSLIQCY--GKAKRVDDVVRTFNLLIELGLTPDDRFCGCLLNVITQTPKEELSKLIDCVERANPKLGYVVR
        +    K++EA E+L  M+  G  P++   T L+ C   G++K            +++G      FCG L+                     +P   ++++
Subjt:  YSCSGKVSEAEEMLNEMVESGFDPNIFVLTSLIQCY--GKAKRVDDVVRTFNLLIELGLTPDDRFCGCLLNVITQTPKEELSKLIDCVERANPKLGYVVR

Query:  LLLGEQDKEGDFRTEASELLSVVSADVR---KAYCNCLIDLCVNLDLLDKACELLDLGLMLQIYTD-LQSRSPTQWSLYLKGLSLGAALTALHVWINDLT
        +     D E + R  A+  L ++ ++ R   +   + ++D        ++A  + ++     ++ D L+ +S + W + L  +S G A+TAL   +    
Subjt:  LLLGEQDKEGDFRTEASELLSVVSADVR---KAYCNCLIDLCVNLDLLDKACELLDLGLMLQIYTD-LQSRSPTQWSLYLKGLSLGAALTALHVWINDLT

Query:  KVLESGEELPPLLGINTGHGKHK--YSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRGLEELRPL
        K + +    P  + I TG G+         +    E  L    +PF       G F+ +      WL    +E +  L
Subjt:  KVLESGEELPPLLGINTGHGKHK--YSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRGLEELRPL

AT2G31400.1 genomes uncoupled 13.0e-4724.33Show/hide
Query:  EISKNPDGKFGSSSKSSVWVNPRSPRASRLRKQSYEARYASLTRISESLDSC---NPCEEDVADV---LKVIG--SNILEQDAVVVLNNMSNSQTALLAL
        E  KN  GK  S+  S++    +   A R+ + ++   Y +      +L S    +   E+   V   +K  G   N++  +AV+        +   +A 
Subjt:  EISKNPDGKFGSSSKSSVWVNPRSPRASRLRKQSYEARYASLTRISESLDSC---NPCEEDVADV---LKVIG--SNILEQDAVVVLNNMSNSQTALLAL

Query:  RYFQEVLKS--SKGAILFNVTLKVFRKCRDMEGAEKLFDEMLKRGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGN
        ++F E+ ++      I FN  L V  +    E A  LFDEM  R ++ D  +++T++         + A E   +MP     P+ V+YS +ID + +AG 
Subjt:  RYFQEVLKS--SKGAILFNVTLKVFRKCRDMEGAEKLFDEMLKRGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGN

Query:  VDLAFGLYDRARTENWRIDPATFSTLIKIHGVAGNYDGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAY
         D A  L+   R     +D  +++TL+ I+   G  +  L++  EM ++GIK ++V YN+LL   G+  +  ++K ++ EM +    P+  TY++L+  Y
Subjt:  VDLAFGLYDRARTENWRIDPATFSTLIKIHGVAGNYDGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAY

Query:  GRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCADVGYINEAVEIFKDMKSSGTCSPDSWTFSSMITI------------YSCSGKVSEAEEMLNEMV
         +    ++A+ +++E K  GL+ +V+LY+ L+      G +  AV +  +M   G  SP+  T++S+I              YS  G +  +   L+ + 
Subjt:  GRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCADVGYINEAVEIFKDMKSSGTCSPDSWTFSSMITI------------YSCSGKVSEAEEMLNEMV

Query:  ESGFDPNIFVLTSLI---------QCYGKAKRVDDVVRTFNLLIELGLTPDDRFCGCLLNVITQTPK-EELSKLIDCVERANPKL-GYVVRLLLGEQDKE
        E+  +  I +   L           C    + +  ++  F  + +L + P+      +LN  ++    E+ S L++ +   + K+ G V  LL+G+++  
Subjt:  ESGFDPNIFVLTSLI---------QCYGKAKRVDDVVRTFNLLIELGLTPDDRFCGCLLNVITQTPK-EELSKLIDCVERANPKL-GYVVRLLLGEQDKE

Query:  GDFRTEASELLSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDL-GLMLQIYTDLQSRSPTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPL
                + ++ +      A+ N L D+  +     +  EL+ L G   Q++ ++ S S     L L  +S GAA   +H W+ ++  ++  G ELP +
Subjt:  GDFRTEASELLSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDL-GLMLQIYTDLQSRSPTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPL

Query:  LGINTGHGKHK--YSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRGLEELRPLAEEL
        L I TG GKH     D  L    E  L+ ++APFH +   +G F ++     +WL      +L  L + +
Subjt:  LGINTGHGKHK--YSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRGLEELRPLAEEL

AT4G16390.1 pentatricopeptide (PPR) repeat-containing protein2.3e-27366.28Show/hide
Query:  LCHSPSTFFTDHHYLCNSSNSQRKTTLCKFSHRFKLNPIPRHPKTFLQITNVSLQEYAPQETQNPSPSDDEISKNPDGKFGSSSKSSVWVNPRSPRASRL
        LC SPS+   D   LCN  +   K+T   F   +  N    H +  LQ T+VS+QE  PQ  ++     D     P     ++SKS VWVNP+SPRAS+L
Subjt:  LCHSPSTFFTDHHYLCNSSNSQRKTTLCKFSHRFKLNPIPRHPKTFLQITNVSLQEYAPQETQNPSPSDDEISKNPDGKFGSSSKSSVWVNPRSPRASRL

Query:  RKQSYEARYASLTRISESLDSCNPCEEDVADVLKVIGSNILEQDAVVVLNNMSNSQTALLALRYFQEVLKSSKGAILFNVTLKVFRKCRDMEGAEKLFDE
        R++SY++RY+SL +++ESLD+C P E DV DV+   G  + EQDAVV LNNM+N +TA L L    E +K S+  IL+NVT+KVFRK +D+E +EKLFDE
Subjt:  RKQSYEARYASLTRISESLDSCNPCEEDVADVLKVIGSNILEQDAVVVLNNMSNSQTALLALRYFQEVLKSSKGAILFNVTLKVFRKCRDMEGAEKLFDE

Query:  MLKRGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDLAFGLYDRARTENWRIDPATFSTLIKIHGVAGNYDGCL
        ML+RG+KPDN TF+TIISCAR   +P +AVEWFEKM SF C PD+VT +AMIDAYGRAGNVD+A  LYDRARTE WRID  TFSTLI+I+GV+GNYDGCL
Subjt:  MLKRGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDLAFGLYDRARTENWRIDPATFSTLIKIHGVAGNYDGCL

Query:  NVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCADVGY
        N+YEEMKA+G+KPNLVIYN L+D+MGRAKRPWQ K IYK++I NGF+P+W+TYA+L+RAYGRARYG+DAL +Y+EMKEKGL L VILYNTLL+MCAD  Y
Subjt:  NVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCADVGY

Query:  INEAVEIFKDMKSSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVESGFDPNIFVLTSLIQCYGKAKRVDDVVRTFNLLIELGLTPDDRFCGCLLNV
        ++EA EIF+DMK+  TC PDSWTFSS+IT+Y+CSG+VSEAE  L +M E+GF+P +FVLTS+IQCYGKAK+VDDVVRTF+ ++ELG+TPDDRFCGCLLNV
Subjt:  INEAVEIFKDMKSSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVESGFDPNIFVLTSLIQCYGKAKRVDDVVRTFNLLIELGLTPDDRFCGCLLNV

Query:  ITQTPKEELSKLIDCVERANPKLGYVVRLLLGEQD-KEGDFRTEASELLSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDLGLMLQIYTDLQSRSPTQ
        +TQTP EE+ KLI CVE+A PKLG VV++L+ EQ+ +EG F+ EASEL+  + +DV+KAY NCLIDLCVNL+ L++ACE+L LGL   IYT LQS+S TQ
Subjt:  ITQTPKEELSKLIDCVERANPKLGYVVRLLLGEQD-KEGDFRTEASELLSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDLGLMLQIYTDLQSRSPTQ

Query:  WSLYLKGLSLGAALTALHVWINDLTK-VLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESR
        WSL+LK LSLGAALTALHVW+NDL++  LESGEE PPLLGINTGHGKHKYSDKGLA+VFESHLKELNAPFHEAP+KVGWFLTT VAAK+WLESR
Subjt:  WSLYLKGLSLGAALTALHVWINDLTK-VLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESR

AT5G46580.1 pentatricopeptide (PPR) repeat-containing protein6.4e-13037.92Show/hide
Query:  AFQLCHSPSTFFTDHHYLCNSSNSQRKTTLCKFSHRFKLN---PIPRHPKTFLQITNVSLQEYAPQETQNPSPSD-----DEISKNPDGKFGSSSKSSVW
        A  +C +P    T  H L        K +L + S   KLN      + PKT          E  P  T+ PS S+        +   +     S   SVW
Subjt:  AFQLCHSPSTFFTDHHYLCNSSNSQRKTTLCKFSHRFKLN---PIPRHPKTFLQITNVSLQEYAPQETQNPSPSD-----DEISKNPDGKFGSSSKSSVW

Query:  VNPRSPRASRLRKQ-------SYEARYASLTRISESLDSCNPCEE-DVADVLKVIGSNILEQDAVVVLNNMSNSQTALLALRYFQEVLKSSKGAILFNVT
        VNP  P+ S L  Q       SY  +   L   +  L+S    E+ +   +L  I       +A++VLN++   Q       + +         I +NVT
Subjt:  VNPRSPRASRLRKQ-------SYEARYASLTRISESLDSCNPCEE-DVADVLKVIGSNILEQDAVVVLNNMSNSQTALLALRYFQEVLKSSKGAILFNVT

Query:  LKVFRKCRDMEGAEKLFDEMLKRGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDLAFGLYDRARTENWRIDPA
        +K  R  R  +  E++  EM+K GV+ DN+T+STII+CA+ C+L NKA+EWFE+M      PD+VTYSA++D Y ++G V+    LY+RA    W+ D  
Subjt:  LKVFRKCRDMEGAEKLFDEMLKRGVKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDLAFGLYDRARTENWRIDPA

Query:  TFSTLIKIHGVAGNYDGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGL
         FS L K+ G AG+YDG   V +EMK++ +KPN+V+YN+LL+AMGRA +P   ++++ EM++ G +P+  T  +L++ YG+AR+  DAL +++EMK K  
Subjt:  TFSTLIKIHGVAGNYDGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGL

Query:  QLNVILYNTLLAMCADVGYINEAVEIFKDMKSSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVESGFDPNIFVLTSLIQCYGKAKRVDDVVRTFNL
         ++ ILYNTLL MCAD+G   EA  +F DMK S  C PD++++++M+ IY   GK  +A E+  EM+++G   N+   T L+QC GKAKR+DDVV  F+L
Subjt:  QLNVILYNTLLAMCADVGYINEAVEIFKDMKSSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVESGFDPNIFVLTSLIQCYGKAKRVDDVVRTFNL

Query:  LIELGLTPDDRFCGCLLNVITQ-TPKEELSKLIDCVERANPKLGYVVRLLLGEQDKEGDFRTEASELLSVVSADVRKAYCNCLIDLCVNLDLLDKACELL
         I+ G+ PDDR CGCLL+V+      E+  K++ C+ERAN KL   V L++ E+ +    + E   +++    + R+ +CNCLID+C   +  ++A ELL
Subjt:  LIELGLTPDDRFCGCLLNVITQ-TPKEELSKLIDCVERANPKLGYVVRLLLGEQDKEGDFRTEASELLSVVSADVRKAYCNCLIDLCVNLDLLDKACELL

Query:  DLGLMLQIYTDLQSRSPTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLT
         LG +  +Y  L +++  +WSL ++ LS+GAA TAL  W+  L  +++  EELP L    TG G H++S +GLA+ F  HL++L+APF ++ ++ G F+ 
Subjt:  DLGLMLQIYTDLQSRSPTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLT

Query:  TKVAAKSWLESR
        TK    SWLES+
Subjt:  TKVAAKSWLESR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTTCCAGCTCTGCCATTCGCCGTCCACATTCTTCACCGACCACCATTACCTATGCAATTCTTCCAATTCTCAACGCAAAACAACTCTCTGCAAGTTCTCTCACCG
TTTCAAGCTCAATCCCATACCTCGCCACCCAAAAACATTCCTCCAAATTACCAATGTCTCGCTACAGGAATACGCTCCTCAAGAAACCCAGAATCCAAGCCCCTCTGATG
ATGAAATCTCCAAAAATCCAGATGGGAAATTCGGTTCCTCGTCCAAAAGCTCCGTTTGGGTCAATCCCAGAAGCCCCAGAGCTTCGAGACTTCGGAAGCAATCTTACGAG
GCCAGGTATGCTTCTCTTACGAGAATATCGGAGTCTTTGGACTCTTGTAATCCATGTGAGGAAGATGTTGCTGATGTCTTGAAGGTGATAGGTAGTAACATTTTAGAACA
GGACGCAGTTGTAGTGCTGAATAACATGTCGAATTCCCAAACTGCGTTGCTTGCTCTTCGGTACTTTCAGGAGGTGTTGAAATCAAGTAAAGGGGCAATTCTTTTTAATG
TGACACTGAAGGTGTTTAGGAAGTGCAGAGATATGGAGGGTGCAGAGAAACTGTTCGACGAAATGCTTAAGAGAGGAGTTAAGCCTGATAATGTGACATTTTCTACAATT
ATTAGTTGTGCTAGGTTGTGCTCGTTGCCAAATAAGGCTGTTGAGTGGTTTGAGAAGATGCCAAGTTTTGACTGTAATCCTGATGATGTCACTTACTCTGCGATGATCGA
TGCCTATGGACGTGCTGGTAATGTTGACTTGGCTTTCGGCTTGTATGACCGTGCAAGAACGGAAAACTGGCGTATTGATCCTGCAACATTCTCAACGTTGATCAAAATTC
ATGGAGTGGCTGGGAATTATGATGGGTGCTTGAATGTGTATGAAGAAATGAAGGCTATAGGCATCAAGCCAAATTTGGTTATATATAACAGCTTGCTGGATGCTATGGGT
AGGGCTAAAAGACCCTGGCAGATCAAGACCATATACAAAGAGATGATTAAAAATGGGTTTTCACCAAGTTGGGCAACTTATGCTTCCCTTTTACGTGCCTACGGAAGAGC
CAGATATGGTGAGGATGCTCTCCTTGTGTACAAGGAGATGAAGGAAAAGGGACTGCAGTTAAATGTTATTCTCTACAATACACTTTTAGCTATGTGTGCTGATGTTGGCT
ATATCAATGAAGCCGTTGAAATTTTTAAAGACATGAAGAGTTCTGGGACTTGCTCACCTGACAGTTGGACTTTTTCTTCCATGATCACCATATATTCTTGCAGTGGAAAA
GTATCAGAGGCAGAGGAAATGTTAAACGAGATGGTGGAATCCGGTTTTGATCCTAATATCTTTGTCTTGACATCACTAATCCAGTGTTATGGGAAAGCCAAACGTGTTGA
TGATGTAGTGAGGACATTTAATCTACTGATAGAGTTGGGATTAACTCCAGATGATCGATTCTGTGGCTGTCTTCTCAATGTAATTACCCAGACGCCAAAAGAGGAACTTA
GTAAGCTGATTGATTGTGTTGAGAGAGCTAATCCGAAACTCGGTTATGTGGTTAGACTTTTGCTAGGGGAACAAGACAAGGAAGGAGATTTCAGAACTGAAGCCTCTGAA
TTACTTAGTGTTGTCAGTGCCGATGTGAGAAAAGCCTACTGCAATTGCTTAATTGATCTCTGTGTGAATTTAGATCTTTTGGATAAGGCATGCGAACTCCTGGATTTGGG
GCTTATGCTTCAGATATATACAGATTTGCAGTCCAGGTCTCCAACTCAGTGGTCTCTATATCTTAAGGGCCTTTCTCTTGGGGCTGCTCTCACTGCTTTACACGTTTGGA
TAAATGACTTAACGAAGGTACTTGAATCCGGGGAGGAACTTCCACCATTACTTGGAATTAATACTGGACATGGAAAACACAAATATTCTGATAAGGGTTTAGCAAGCGTT
TTTGAATCACATTTAAAGGAATTAAATGCTCCATTCCATGAGGCTCCCGAAAAGGTTGGGTGGTTTTTGACGACTAAAGTGGCTGCAAAATCATGGTTGGAGTCTAGAGG
ATTAGAAGAGTTAAGGCCTCTTGCAGAGGAATTGTTACTTGTTAGGCTCTTGAGTCCTAACAATTTTGTATCCAGCTTTGTCTTCAGTGTGTCCGTAGGAAAGAAGGTGG
ATGGTAGGTTTGAAGAAGTGGAACTGCCATCAGGAATGTTGGAAGAAGTTCAGAGGATACTGGTGATAGAGAAGACCCTAGTGGCCTCGATGGAGGGGTGTCACGCAGTT
GAAATTTCCCAAAGGTGTGTAATACGGATGGCCATGGGACTATGTAGTTCTGGCACTGGAGATCGACTCTCGAAAACTTGCTATCAAGGTCGTGAAGAAGCAGTGTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCTTTCCAGCTCTGCCATTCGCCGTCCACATTCTTCACCGACCACCATTACCTATGCAATTCTTCCAATTCTCAACGCAAAACAACTCTCTGCAAGTTCTCTCACCG
TTTCAAGCTCAATCCCATACCTCGCCACCCAAAAACATTCCTCCAAATTACCAATGTCTCGCTACAGGAATACGCTCCTCAAGAAACCCAGAATCCAAGCCCCTCTGATG
ATGAAATCTCCAAAAATCCAGATGGGAAATTCGGTTCCTCGTCCAAAAGCTCCGTTTGGGTCAATCCCAGAAGCCCCAGAGCTTCGAGACTTCGGAAGCAATCTTACGAG
GCCAGGTATGCTTCTCTTACGAGAATATCGGAGTCTTTGGACTCTTGTAATCCATGTGAGGAAGATGTTGCTGATGTCTTGAAGGTGATAGGTAGTAACATTTTAGAACA
GGACGCAGTTGTAGTGCTGAATAACATGTCGAATTCCCAAACTGCGTTGCTTGCTCTTCGGTACTTTCAGGAGGTGTTGAAATCAAGTAAAGGGGCAATTCTTTTTAATG
TGACACTGAAGGTGTTTAGGAAGTGCAGAGATATGGAGGGTGCAGAGAAACTGTTCGACGAAATGCTTAAGAGAGGAGTTAAGCCTGATAATGTGACATTTTCTACAATT
ATTAGTTGTGCTAGGTTGTGCTCGTTGCCAAATAAGGCTGTTGAGTGGTTTGAGAAGATGCCAAGTTTTGACTGTAATCCTGATGATGTCACTTACTCTGCGATGATCGA
TGCCTATGGACGTGCTGGTAATGTTGACTTGGCTTTCGGCTTGTATGACCGTGCAAGAACGGAAAACTGGCGTATTGATCCTGCAACATTCTCAACGTTGATCAAAATTC
ATGGAGTGGCTGGGAATTATGATGGGTGCTTGAATGTGTATGAAGAAATGAAGGCTATAGGCATCAAGCCAAATTTGGTTATATATAACAGCTTGCTGGATGCTATGGGT
AGGGCTAAAAGACCCTGGCAGATCAAGACCATATACAAAGAGATGATTAAAAATGGGTTTTCACCAAGTTGGGCAACTTATGCTTCCCTTTTACGTGCCTACGGAAGAGC
CAGATATGGTGAGGATGCTCTCCTTGTGTACAAGGAGATGAAGGAAAAGGGACTGCAGTTAAATGTTATTCTCTACAATACACTTTTAGCTATGTGTGCTGATGTTGGCT
ATATCAATGAAGCCGTTGAAATTTTTAAAGACATGAAGAGTTCTGGGACTTGCTCACCTGACAGTTGGACTTTTTCTTCCATGATCACCATATATTCTTGCAGTGGAAAA
GTATCAGAGGCAGAGGAAATGTTAAACGAGATGGTGGAATCCGGTTTTGATCCTAATATCTTTGTCTTGACATCACTAATCCAGTGTTATGGGAAAGCCAAACGTGTTGA
TGATGTAGTGAGGACATTTAATCTACTGATAGAGTTGGGATTAACTCCAGATGATCGATTCTGTGGCTGTCTTCTCAATGTAATTACCCAGACGCCAAAAGAGGAACTTA
GTAAGCTGATTGATTGTGTTGAGAGAGCTAATCCGAAACTCGGTTATGTGGTTAGACTTTTGCTAGGGGAACAAGACAAGGAAGGAGATTTCAGAACTGAAGCCTCTGAA
TTACTTAGTGTTGTCAGTGCCGATGTGAGAAAAGCCTACTGCAATTGCTTAATTGATCTCTGTGTGAATTTAGATCTTTTGGATAAGGCATGCGAACTCCTGGATTTGGG
GCTTATGCTTCAGATATATACAGATTTGCAGTCCAGGTCTCCAACTCAGTGGTCTCTATATCTTAAGGGCCTTTCTCTTGGGGCTGCTCTCACTGCTTTACACGTTTGGA
TAAATGACTTAACGAAGGTACTTGAATCCGGGGAGGAACTTCCACCATTACTTGGAATTAATACTGGACATGGAAAACACAAATATTCTGATAAGGGTTTAGCAAGCGTT
TTTGAATCACATTTAAAGGAATTAAATGCTCCATTCCATGAGGCTCCCGAAAAGGTTGGGTGGTTTTTGACGACTAAAGTGGCTGCAAAATCATGGTTGGAGTCTAGAGG
ATTAGAAGAGTTAAGGCCTCTTGCAGAGGAATTGTTACTTGTTAGGCTCTTGAGTCCTAACAATTTTGTATCCAGCTTTGTCTTCAGTGTGTCCGTAGGAAAGAAGGTGG
ATGGTAGGTTTGAAGAAGTGGAACTGCCATCAGGAATGTTGGAAGAAGTTCAGAGGATACTGGTGATAGAGAAGACCCTAGTGGCCTCGATGGAGGGGTGTCACGCAGTT
GAAATTTCCCAAAGGTGTGTAATACGGATGGCCATGGGACTATGTAGTTCTGGCACTGGAGATCGACTCTCGAAAACTTGCTATCAAGGTCGTGAAGAAGCAGTGTAA
Protein sequenceShow/hide protein sequence
MAFQLCHSPSTFFTDHHYLCNSSNSQRKTTLCKFSHRFKLNPIPRHPKTFLQITNVSLQEYAPQETQNPSPSDDEISKNPDGKFGSSSKSSVWVNPRSPRASRLRKQSYE
ARYASLTRISESLDSCNPCEEDVADVLKVIGSNILEQDAVVVLNNMSNSQTALLALRYFQEVLKSSKGAILFNVTLKVFRKCRDMEGAEKLFDEMLKRGVKPDNVTFSTI
ISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDLAFGLYDRARTENWRIDPATFSTLIKIHGVAGNYDGCLNVYEEMKAIGIKPNLVIYNSLLDAMG
RAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRARYGEDALLVYKEMKEKGLQLNVILYNTLLAMCADVGYINEAVEIFKDMKSSGTCSPDSWTFSSMITIYSCSGK
VSEAEEMLNEMVESGFDPNIFVLTSLIQCYGKAKRVDDVVRTFNLLIELGLTPDDRFCGCLLNVITQTPKEELSKLIDCVERANPKLGYVVRLLLGEQDKEGDFRTEASE
LLSVVSADVRKAYCNCLIDLCVNLDLLDKACELLDLGLMLQIYTDLQSRSPTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASV
FESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRGLEELRPLAEELLLVRLLSPNNFVSSFVFSVSVGKKVDGRFEEVELPSGMLEEVQRILVIEKTLVASMEGCHAV
EISQRCVIRMAMGLCSSGTGDRLSKTCYQGREEAV