; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0014598 (gene) of Snake gourd v1 genome

Gene IDTan0014598
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationLG05:65491777..65494273
RNA-Seq ExpressionTan0014598
SyntenyTan0014598
Gene Ontology termsGO:0009658 - chloroplast organization (biological process)
GO:0031425 - chloroplast RNA processing (biological process)
GO:0009570 - chloroplast stroma (cellular component)
GO:0009941 - chloroplast envelope (cellular component)
GO:0003729 - mRNA binding (molecular function)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR002625 - Smr domain
IPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily
IPR033443 - Pentacotripeptide-repeat region of PRORP


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6583722.1 Pentatricopeptide repeat-containing protein, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0088.07Show/hide
Query:  MAFQLCHSPSTFFTDHHSLSSCLNSQRKATLYKSSDR-FKLNPIPRPSKTFLQITSVSLQEYAPQETQNPSPSDDEISKNSDGKSGSSSKSSVWVNPRSP
        MAFQL H PSTFFTDH+SL+       K TL KSS R FKLNPIP  SK FLQIT+VS QEYAPQET+NPSPSDDEISK  DGKSGSSSK+SVWVNP SP
Subjt:  MAFQLCHSPSTFFTDHHSLSSCLNSQRKATLYKSSDR-FKLNPIPRPSKTFLQITSVSLQEYAPQETQNPSPSDDEISKNSDGKSGSSSKSSVWVNPRSP

Query:  RASKLRKQSYEARYASLTRISESLDSCNPCEEDVADVLKVIGNNILEQDAVVVLNNMSNSQTALLALRYFQEVLKSSKEAILFNVTLKVFRKCRDMEGAE
        RASKLRKQSYEARYASL +ISESLDSCNPCE+DVADVLK I + ILEQDA+ VLNNMSNSQTALL LRYFQ+VLKSSK+A+ +NVTLKVFRKCRD EGAE
Subjt:  RASKLRKQSYEARYASLTRISESLDSCNPCEEDVADVLKVIGNNILEQDAVVVLNNMSNSQTALLALRYFQEVLKSSKEAILFNVTLKVFRKCRDMEGAE

Query:  KLFDEMLKRGLKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDLAFSLYDRARTENWRIDPATFSTLIKIHGVAGN
        KLFDEML+RG+KPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPD++TYS MIDAYGRAGNVD+AFSLYDRARTENWRID +TFST+IKIHGVAGN
Subjt:  KLFDEMLKRGLKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDLAFSLYDRARTENWRIDPATFSTLIKIHGVAGN

Query:  YDGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRSRYGEDALLVYKEMKEKGLQLNVILYNTLLAMC
        YDGCLNVYEEMKA+GIKPNL IYNSLL AMGRAKRPWQIKTIYKEM KNGFSPSWATYASLLRAY R+RY ED +LVYKEMKEKGLQLNVILYNTLLAMC
Subjt:  YDGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRSRYGEDALLVYKEMKEKGLQLNVILYNTLLAMC

Query:  ADVGYINEAVEIFEDMKRSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVESGFDPNIFVLTSLIQCYGKAKRVDDVVKTFNRLIELGLTPDDRFCG
        ADVGY+NEA+E+F+DMK SGTCSPDSWTFSSMITIYSCSG VSEAEEMLNEM+E+GFDPNIFVLTSLIQCYGKAKRVDDVV+TF+RL+ELGLTPDDRFCG
Subjt:  ADVGYINEAVEIFEDMKRSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVESGFDPNIFVLTSLIQCYGKAKRVDDVVKTFNRLIELGLTPDDRFCG

Query:  CLLNVITQTPKEELSKLIDCVERANPKLGFVVRLLLGEQDQEGDFRTEASELLSVVSADVRRAYCNCLIDLCVNLDLLDKACEVLDLGLTLQIYTDLQSR
        CLLNVITQTPK ELSKLIDCVERANPKLGFVV+LLLGE+D EGDFRTEASEL SVVS DVR+AYCNCLIDLCVNLDLLDKACE+LDLGL++QIYTDLQSR
Subjt:  CLLNVITQTPKEELSKLIDCVERANPKLGFVVRLLLGEQDQEGDFRTEASELLSVVSADVRRAYCNCLIDLCVNLDLLDKACEVLDLGLTLQIYTDLQSR

Query:  SPTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRGSP
        SPTQWSLYLKGLSLGAALTALHVWINDLTK L+SGEELPPLLGINTGHGKHKYSDKGL+SVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRGSP
Subjt:  SPTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRGSP

Query:  ELVS
        ELV+
Subjt:  ELVS

XP_004139516.1 pentatricopeptide repeat-containing protein At4g16390, chloroplastic [Cucumis sativus]0.0e+0089.9Show/hide
Query:  MAFQLCHSPSTFFTDHHSLSSCLNSQRKATLYKSSDRFKLNPIPRPSKTFLQITSVSLQEYAPQETQNPSPSDDEISKNSDGKSGSSSKSSVWVNPRSPR
        MAFQLC+SP TFFT+HH LS+ L  QRK TL  SS  FKL+PIPR SK FLQIT+VSLQE+APQ+TQN  PS DEISK  D KSGSSS SSVWVNPRSPR
Subjt:  MAFQLCHSPSTFFTDHHSLSSCLNSQRKATLYKSSDRFKLNPIPRPSKTFLQITSVSLQEYAPQETQNPSPSDDEISKNSDGKSGSSSKSSVWVNPRSPR

Query:  ASKLRKQSYEARYASLTRISESLDSCNPCEEDVADVLKVIGNNILEQDAVVVLNNMSNSQTALLALRYFQEVLKSSKEAILFNVTLKVFRKCRDMEGAEK
        ASKLRKQSYEARYASL R+SESLDS NPCE DVADVLKVIGNNILE+DA++VLNNMSNSQTALLALRYFQ++LKSSK+ I +NVTLKVFRKCRDMEGAEK
Subjt:  ASKLRKQSYEARYASLTRISESLDSCNPCEEDVADVLKVIGNNILEQDAVVVLNNMSNSQTALLALRYFQEVLKSSKEAILFNVTLKVFRKCRDMEGAEK

Query:  LFDEMLKRGLKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDLAFSLYDRARTENWRIDPATFSTLIKIHGVAGNY
        LF+EM+ RG+KPDNVTFSTIISCARLCSLP+KAVEWFEKMPSFDCNPDDVTYS MIDAYGRAGNVD+AFSLYDRARTENWRIDPATFST+IKIHGVAGNY
Subjt:  LFDEMLKRGLKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDLAFSLYDRARTENWRIDPATFSTLIKIHGVAGNY

Query:  DGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRSRYGEDALLVYKEMKEKGLQLNVILYNTLLAMCA
        DGCLNVYEEMKAIGIKPNLVIYN LLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGR+RYGEDAL+VYKEMKEKGLQLNVILYNTLLAMCA
Subjt:  DGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRSRYGEDALLVYKEMKEKGLQLNVILYNTLLAMCA

Query:  DVGYINEAVEIFEDMKRSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVESGFDPNIFVLTSLIQCYGKAKRVDDVVKTFNRLIELGLTPDDRFCGC
        DVGY+NEAVEIF+DMK SGTCSPDSWTFSSMITIYSC GKVSEAEEMLN+MVE+GFDPNIFVLTSLIQCYGKAKRVDDVV+TFN+LIELGLTPDDRFCGC
Subjt:  DVGYINEAVEIFEDMKRSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVESGFDPNIFVLTSLIQCYGKAKRVDDVVKTFNRLIELGLTPDDRFCGC

Query:  LLNVITQTPKEELSKLIDCVERANPKLGFVVRLLLGEQDQEGDFRTEASELLSVVSADVRRAYCNCLIDLCVNLDLLDKACEVLDLGLTLQIYTDLQSRS
        LLNVITQTPK EL KLIDCV RANPKLGFVV LLLGEQD+EG+FRTEASEL SVVSADVR+AYCNCLIDLCVNLDLLDKACE+LDLGLTLQIY DLQSRS
Subjt:  LLNVITQTPKEELSKLIDCVERANPKLGFVVRLLLGEQDQEGDFRTEASELLSVVSADVRRAYCNCLIDLCVNLDLLDKACEVLDLGLTLQIYTDLQSRS

Query:  PTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRGSPE
        PTQWSLYLKGLSLGAALTALHVWI DLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESR SPE
Subjt:  PTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRGSPE

Query:  LVS
        LV+
Subjt:  LVS

XP_008464281.1 PREDICTED: pentatricopeptide repeat-containing protein At4g16390, chloroplastic [Cucumis melo]0.0e+0091.32Show/hide
Query:  MAFQLCHSPSTFFTDHHSLSSCLNSQRKATLYKSSDRFKLNPIPRPSKTFLQITSVSLQEYAPQETQNPSPSDDEISKNSDGKSGSSSKSSVWVNPRSPR
        MAFQLCHSP TFFT HHSLS+ L  QRK TL  SS  FKLNPIPR S  FLQIT++SLQE++PQET N  PSDDEISK SD KSGSSSKSSVWVNPRSPR
Subjt:  MAFQLCHSPSTFFTDHHSLSSCLNSQRKATLYKSSDRFKLNPIPRPSKTFLQITSVSLQEYAPQETQNPSPSDDEISKNSDGKSGSSSKSSVWVNPRSPR

Query:  ASKLRKQSYEARYASLTRISESLDSCNPCEEDVADVLKVIGNNILEQDAVVVLNNMSNSQTALLALRYFQEVLKSSKEAILFNVTLKVFRKCRDMEGAEK
        ASKLRKQSYEARYASL RISESLDSCNPCE DVADVLKVIGNNILEQDAVVVLNNMSNSQTALLALRYFQ++LKSSK+ I +NVTLKVFRKCRDMEGAE+
Subjt:  ASKLRKQSYEARYASLTRISESLDSCNPCEEDVADVLKVIGNNILEQDAVVVLNNMSNSQTALLALRYFQEVLKSSKEAILFNVTLKVFRKCRDMEGAEK

Query:  LFDEMLKRGLKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDLAFSLYDRARTENWRIDPATFSTLIKIHGVAGNY
        LF+EML RG+KPDNVTFSTIISCARLCSLP+KAVEWFEKMPSFDCNPDDVTYS MIDAYGRAGNVD+AFSLYDRARTENWRIDPATFST+IKIHGVAGNY
Subjt:  LFDEMLKRGLKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDLAFSLYDRARTENWRIDPATFSTLIKIHGVAGNY

Query:  DGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRSRYGEDALLVYKEMKEKGLQLNVILYNTLLAMCA
        DGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIK+GFSPSWATYASLLRAYGR+RYGEDAL+VYKEMKEKGLQLNVILYNTLLAMCA
Subjt:  DGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRSRYGEDALLVYKEMKEKGLQLNVILYNTLLAMCA

Query:  DVGYINEAVEIFEDMKRSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVESGFDPNIFVLTSLIQCYGKAKRVDDVVKTFNRLIELGLTPDDRFCGC
        DVGY+NEAVEIF+DMK SGTCSPDSWTFSSMITIYSCSGKVSEAEEMLN+MVE+GFDPNIFVLTSLIQCYGKAKRVDDVV+TFN+LIELGLTPDDRFCGC
Subjt:  DVGYINEAVEIFEDMKRSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVESGFDPNIFVLTSLIQCYGKAKRVDDVVKTFNRLIELGLTPDDRFCGC

Query:  LLNVITQTPKEELSKLIDCVERANPKLGFVVRLLLGEQDQEGDFRTEASELLSVVSADVRRAYCNCLIDLCVNLDLLDKACEVLDLGLTLQIYTDLQSRS
        LLNVITQTPKEE+SKLIDCV RANPKLGFVV LLLGEQD+EG+FRTEASEL SVVSADVR+AYCNCLIDLCVNLDLLDKACE+L+LGLTLQIY DLQSRS
Subjt:  LLNVITQTPKEELSKLIDCVERANPKLGFVVRLLLGEQDQEGDFRTEASELLSVVSADVRRAYCNCLIDLCVNLDLLDKACEVLDLGLTLQIYTDLQSRS

Query:  PTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRGSPE
        PTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESR SPE
Subjt:  PTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRGSPE

Query:  LVS
        LV+
Subjt:  LVS

XP_022142513.1 pentatricopeptide repeat-containing protein At4g16390, chloroplastic [Momordica charantia]0.0e+0091.18Show/hide
Query:  MAFQLCHSPSTFFTDHHSLSSCLNSQRKATLYKSSDRFKLNPIPRPSKTFLQITSVSLQEYAPQETQNPSPSDDEISKNSDGKSGSSSKSSVWVNPRSPR
        MAFQLCHSPSTFF+DHH LS+ LNSQ + TL KSS RFKLNP P  SKT L+IT+VSLQEYA QE QNP P+ DE SK  DGKS SSSKSSVWVNPRSPR
Subjt:  MAFQLCHSPSTFFTDHHSLSSCLNSQRKATLYKSSDRFKLNPIPRPSKTFLQITSVSLQEYAPQETQNPSPSDDEISKNSDGKSGSSSKSSVWVNPRSPR

Query:  ASKLRKQSYEARYASLTRISESLDSCNPCEEDVADVLKVIGNNILEQDAVVVLNNMSNSQTALLALRYFQEVLKSSKEAILFNVTLKVFRKCRDMEGAEK
        ASKLR QSYEARYASLTRISESLDSCNPCEEDVADVLK +G+NILEQDAV VLNNMSNS TALLAL+ FQ+VLKSSK+AIL+NVTLKV RK RDMEGAEK
Subjt:  ASKLRKQSYEARYASLTRISESLDSCNPCEEDVADVLKVIGNNILEQDAVVVLNNMSNSQTALLALRYFQEVLKSSKEAILFNVTLKVFRKCRDMEGAEK

Query:  LFDEMLKRGLKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDLAFSLYDRARTENWRIDPATFSTLIKIHGVAGNY
        LFDEMLKRG+KPDNVTFST+ISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVD+AFSLYDRARTENWRIDPATFSTLIKIHGVAGNY
Subjt:  LFDEMLKRGLKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDLAFSLYDRARTENWRIDPATFSTLIKIHGVAGNY

Query:  DGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRSRYGEDALLVYKEMKEKGLQLNVILYNTLLAMCA
        DGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGR+RYGEDALLVYKEMKEKGLQLNVILYNTLLAMCA
Subjt:  DGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRSRYGEDALLVYKEMKEKGLQLNVILYNTLLAMCA

Query:  DVGYINEAVEIFEDMKRSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVESGFDPNIFVLTSLIQCYGKAKRVDDVVKTFNRLIELGLTPDDRFCGC
        DVGY+NEAVEIFEDMK SG CSPDSWTFSSMITIYSCSGKVSEAEEMLNEM+E+GFDPNIFVLTSLIQCYGK KRVDDVV+TF+RL+ELGLTPDDRFCGC
Subjt:  DVGYINEAVEIFEDMKRSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVESGFDPNIFVLTSLIQCYGKAKRVDDVVKTFNRLIELGLTPDDRFCGC

Query:  LLNVITQTPKEELSKLIDCVERANPKLGFVVRLLLGEQDQEGDFRTEASELLSVVSADVRRAYCNCLIDLCVNLDLLDKACEVLDLGLTLQIYTDLQSRS
        LLNVITQTPKEELSKLIDCVERAN KLG+VV+LLLGEQD+EGD RTEASELLSVVSADVR+AYCNCLIDLCVNLDLL+KACE+LDLGLTLQIYT LQS S
Subjt:  LLNVITQTPKEELSKLIDCVERANPKLGFVVRLLLGEQDQEGDFRTEASELLSVVSADVRRAYCNCLIDLCVNLDLLDKACEVLDLGLTLQIYTDLQSRS

Query:  PTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRGSPE
        PTQWSL+LKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRGSPE
Subjt:  PTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRGSPE

Query:  LVS
        LV+
Subjt:  LVS

XP_038877791.1 pentatricopeptide repeat-containing protein At4g16390, chloroplastic [Benincasa hispida]0.0e+0093.03Show/hide
Query:  MAFQLCHSPSTFFTDHHSLSSCLNSQRKATLYKSSDRFKLNPIPRPSKTFLQITSVSLQEYAPQETQNPSPSDDEISKNSDGKSGSSSKSSVWVNPRSPR
        MAFQLCHSPSTFFTDHHSLS+ L SQRK TL  SS  FKLNPIPR SK FLQIT+VSLQEYAPQET NPSPS+DEISK  DGKS SSSKSSVWVNPRSPR
Subjt:  MAFQLCHSPSTFFTDHHSLSSCLNSQRKATLYKSSDRFKLNPIPRPSKTFLQITSVSLQEYAPQETQNPSPSDDEISKNSDGKSGSSSKSSVWVNPRSPR

Query:  ASKLRKQSYEARYASLTRISESLDSCNPCEEDVADVLKVIGNNILEQDAVVVLNNMSNSQTALLALRYFQEVLKSSKEAILFNVTLKVFRKCRDMEGAEK
        ASKLRKQSYEARYASLTRISESLDSCNPC+EDVADVLK IG+NIL+QDAVVVLNNMSNSQTALLALRYFQ+VLKSSK+AI +NVTLKVFRKCRDMEGAEK
Subjt:  ASKLRKQSYEARYASLTRISESLDSCNPCEEDVADVLKVIGNNILEQDAVVVLNNMSNSQTALLALRYFQEVLKSSKEAILFNVTLKVFRKCRDMEGAEK

Query:  LFDEMLKRGLKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDLAFSLYDRARTENWRIDPATFSTLIKIHGVAGNY
        LF+EMLKRG+KPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVD+AFSLYDRARTENWRIDPATFST+IKIHGVAGNY
Subjt:  LFDEMLKRGLKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDLAFSLYDRARTENWRIDPATFSTLIKIHGVAGNY

Query:  DGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRSRYGEDALLVYKEMKEKGLQLNVILYNTLLAMCA
        DGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGR+RYGEDALLVYKEMKEKGLQLNVILYNTLLAMCA
Subjt:  DGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRSRYGEDALLVYKEMKEKGLQLNVILYNTLLAMCA

Query:  DVGYINEAVEIFEDMKRSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVESGFDPNIFVLTSLIQCYGKAKRVDDVVKTFNRLIELGLTPDDRFCGC
        DVGY+ EAVE+F+DMK SGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVE+GFDPNIFVLTSLIQCYGKAKRVDDVV+TF+RLIELGLTPDDRFCGC
Subjt:  DVGYINEAVEIFEDMKRSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVESGFDPNIFVLTSLIQCYGKAKRVDDVVKTFNRLIELGLTPDDRFCGC

Query:  LLNVITQTPKEELSKLIDCVERANPKLGFVVRLLLGEQDQEGDFRTEASELLSVVSADVRRAYCNCLIDLCVNLDLLDKACEVLDLGLTLQIYTDLQSRS
        LLNVITQTPKEELSKLIDCV RANPKLGFVV+LL+GEQD+EGDFRTEASEL SVVSADVR+AYCNCLIDLCVNLDLLDKACE+LDLGLTLQ+Y DLQSRS
Subjt:  LLNVITQTPKEELSKLIDCVERANPKLGFVVRLLLGEQDQEGDFRTEASELLSVVSADVRRAYCNCLIDLCVNLDLLDKACEVLDLGLTLQIYTDLQSRS

Query:  PTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRGSPE
        PTQWSLYLKGLSLGA LTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESR SPE
Subjt:  PTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRGSPE

Query:  LVS
        LV+
Subjt:  LVS

TrEMBL top hitse value%identityAlignment
A0A0A0LVP1 Smr domain-containing protein0.0e+0089.9Show/hide
Query:  MAFQLCHSPSTFFTDHHSLSSCLNSQRKATLYKSSDRFKLNPIPRPSKTFLQITSVSLQEYAPQETQNPSPSDDEISKNSDGKSGSSSKSSVWVNPRSPR
        MAFQLC+SP TFFT+HH LS+ L  QRK TL  SS  FKL+PIPR SK FLQIT+VSLQE+APQ+TQN  PS DEISK  D KSGSSS SSVWVNPRSPR
Subjt:  MAFQLCHSPSTFFTDHHSLSSCLNSQRKATLYKSSDRFKLNPIPRPSKTFLQITSVSLQEYAPQETQNPSPSDDEISKNSDGKSGSSSKSSVWVNPRSPR

Query:  ASKLRKQSYEARYASLTRISESLDSCNPCEEDVADVLKVIGNNILEQDAVVVLNNMSNSQTALLALRYFQEVLKSSKEAILFNVTLKVFRKCRDMEGAEK
        ASKLRKQSYEARYASL R+SESLDS NPCE DVADVLKVIGNNILE+DA++VLNNMSNSQTALLALRYFQ++LKSSK+ I +NVTLKVFRKCRDMEGAEK
Subjt:  ASKLRKQSYEARYASLTRISESLDSCNPCEEDVADVLKVIGNNILEQDAVVVLNNMSNSQTALLALRYFQEVLKSSKEAILFNVTLKVFRKCRDMEGAEK

Query:  LFDEMLKRGLKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDLAFSLYDRARTENWRIDPATFSTLIKIHGVAGNY
        LF+EM+ RG+KPDNVTFSTIISCARLCSLP+KAVEWFEKMPSFDCNPDDVTYS MIDAYGRAGNVD+AFSLYDRARTENWRIDPATFST+IKIHGVAGNY
Subjt:  LFDEMLKRGLKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDLAFSLYDRARTENWRIDPATFSTLIKIHGVAGNY

Query:  DGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRSRYGEDALLVYKEMKEKGLQLNVILYNTLLAMCA
        DGCLNVYEEMKAIGIKPNLVIYN LLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGR+RYGEDAL+VYKEMKEKGLQLNVILYNTLLAMCA
Subjt:  DGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRSRYGEDALLVYKEMKEKGLQLNVILYNTLLAMCA

Query:  DVGYINEAVEIFEDMKRSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVESGFDPNIFVLTSLIQCYGKAKRVDDVVKTFNRLIELGLTPDDRFCGC
        DVGY+NEAVEIF+DMK SGTCSPDSWTFSSMITIYSC GKVSEAEEMLN+MVE+GFDPNIFVLTSLIQCYGKAKRVDDVV+TFN+LIELGLTPDDRFCGC
Subjt:  DVGYINEAVEIFEDMKRSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVESGFDPNIFVLTSLIQCYGKAKRVDDVVKTFNRLIELGLTPDDRFCGC

Query:  LLNVITQTPKEELSKLIDCVERANPKLGFVVRLLLGEQDQEGDFRTEASELLSVVSADVRRAYCNCLIDLCVNLDLLDKACEVLDLGLTLQIYTDLQSRS
        LLNVITQTPK EL KLIDCV RANPKLGFVV LLLGEQD+EG+FRTEASEL SVVSADVR+AYCNCLIDLCVNLDLLDKACE+LDLGLTLQIY DLQSRS
Subjt:  LLNVITQTPKEELSKLIDCVERANPKLGFVVRLLLGEQDQEGDFRTEASELLSVVSADVRRAYCNCLIDLCVNLDLLDKACEVLDLGLTLQIYTDLQSRS

Query:  PTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRGSPE
        PTQWSLYLKGLSLGAALTALHVWI DLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESR SPE
Subjt:  PTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRGSPE

Query:  LVS
        LV+
Subjt:  LVS

A0A1S3CL39 pentatricopeptide repeat-containing protein At4g16390, chloroplastic0.0e+0091.32Show/hide
Query:  MAFQLCHSPSTFFTDHHSLSSCLNSQRKATLYKSSDRFKLNPIPRPSKTFLQITSVSLQEYAPQETQNPSPSDDEISKNSDGKSGSSSKSSVWVNPRSPR
        MAFQLCHSP TFFT HHSLS+ L  QRK TL  SS  FKLNPIPR S  FLQIT++SLQE++PQET N  PSDDEISK SD KSGSSSKSSVWVNPRSPR
Subjt:  MAFQLCHSPSTFFTDHHSLSSCLNSQRKATLYKSSDRFKLNPIPRPSKTFLQITSVSLQEYAPQETQNPSPSDDEISKNSDGKSGSSSKSSVWVNPRSPR

Query:  ASKLRKQSYEARYASLTRISESLDSCNPCEEDVADVLKVIGNNILEQDAVVVLNNMSNSQTALLALRYFQEVLKSSKEAILFNVTLKVFRKCRDMEGAEK
        ASKLRKQSYEARYASL RISESLDSCNPCE DVADVLKVIGNNILEQDAVVVLNNMSNSQTALLALRYFQ++LKSSK+ I +NVTLKVFRKCRDMEGAE+
Subjt:  ASKLRKQSYEARYASLTRISESLDSCNPCEEDVADVLKVIGNNILEQDAVVVLNNMSNSQTALLALRYFQEVLKSSKEAILFNVTLKVFRKCRDMEGAEK

Query:  LFDEMLKRGLKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDLAFSLYDRARTENWRIDPATFSTLIKIHGVAGNY
        LF+EML RG+KPDNVTFSTIISCARLCSLP+KAVEWFEKMPSFDCNPDDVTYS MIDAYGRAGNVD+AFSLYDRARTENWRIDPATFST+IKIHGVAGNY
Subjt:  LFDEMLKRGLKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDLAFSLYDRARTENWRIDPATFSTLIKIHGVAGNY

Query:  DGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRSRYGEDALLVYKEMKEKGLQLNVILYNTLLAMCA
        DGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIK+GFSPSWATYASLLRAYGR+RYGEDAL+VYKEMKEKGLQLNVILYNTLLAMCA
Subjt:  DGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRSRYGEDALLVYKEMKEKGLQLNVILYNTLLAMCA

Query:  DVGYINEAVEIFEDMKRSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVESGFDPNIFVLTSLIQCYGKAKRVDDVVKTFNRLIELGLTPDDRFCGC
        DVGY+NEAVEIF+DMK SGTCSPDSWTFSSMITIYSCSGKVSEAEEMLN+MVE+GFDPNIFVLTSLIQCYGKAKRVDDVV+TFN+LIELGLTPDDRFCGC
Subjt:  DVGYINEAVEIFEDMKRSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVESGFDPNIFVLTSLIQCYGKAKRVDDVVKTFNRLIELGLTPDDRFCGC

Query:  LLNVITQTPKEELSKLIDCVERANPKLGFVVRLLLGEQDQEGDFRTEASELLSVVSADVRRAYCNCLIDLCVNLDLLDKACEVLDLGLTLQIYTDLQSRS
        LLNVITQTPKEE+SKLIDCV RANPKLGFVV LLLGEQD+EG+FRTEASEL SVVSADVR+AYCNCLIDLCVNLDLLDKACE+L+LGLTLQIY DLQSRS
Subjt:  LLNVITQTPKEELSKLIDCVERANPKLGFVVRLLLGEQDQEGDFRTEASELLSVVSADVRRAYCNCLIDLCVNLDLLDKACEVLDLGLTLQIYTDLQSRS

Query:  PTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRGSPE
        PTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESR SPE
Subjt:  PTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRGSPE

Query:  LVS
        LV+
Subjt:  LVS

A0A5A7TLM5 Pentatricopeptide repeat-containing protein0.0e+0091.32Show/hide
Query:  MAFQLCHSPSTFFTDHHSLSSCLNSQRKATLYKSSDRFKLNPIPRPSKTFLQITSVSLQEYAPQETQNPSPSDDEISKNSDGKSGSSSKSSVWVNPRSPR
        MAFQLCHSP TFFT HHSLS+ L  QRK TL  SS  FKLNPIPR S  FLQIT++SLQE++PQET N  PSDDEISK SD KSGSSSKSSVWVNPRSPR
Subjt:  MAFQLCHSPSTFFTDHHSLSSCLNSQRKATLYKSSDRFKLNPIPRPSKTFLQITSVSLQEYAPQETQNPSPSDDEISKNSDGKSGSSSKSSVWVNPRSPR

Query:  ASKLRKQSYEARYASLTRISESLDSCNPCEEDVADVLKVIGNNILEQDAVVVLNNMSNSQTALLALRYFQEVLKSSKEAILFNVTLKVFRKCRDMEGAEK
        ASKLRKQSYEARYASL RISESLDSCNPCE DVADVLKVIGNNILEQDAVVVLNNMSNSQTALLALRYFQ++LKSSK+ I +NVTLKVFRKCRDMEGAE+
Subjt:  ASKLRKQSYEARYASLTRISESLDSCNPCEEDVADVLKVIGNNILEQDAVVVLNNMSNSQTALLALRYFQEVLKSSKEAILFNVTLKVFRKCRDMEGAEK

Query:  LFDEMLKRGLKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDLAFSLYDRARTENWRIDPATFSTLIKIHGVAGNY
        LF+EML RG+KPDNVTFSTIISCARLCSLP+KAVEWFEKMPSFDCNPDDVTYS MIDAYGRAGNVD+AFSLYDRARTENWRIDPATFST+IKIHGVAGNY
Subjt:  LFDEMLKRGLKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDLAFSLYDRARTENWRIDPATFSTLIKIHGVAGNY

Query:  DGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRSRYGEDALLVYKEMKEKGLQLNVILYNTLLAMCA
        DGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIK+GFSPSWATYASLLRAYGR+RYGEDAL+VYKEMKEKGLQLNVILYNTLLAMCA
Subjt:  DGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRSRYGEDALLVYKEMKEKGLQLNVILYNTLLAMCA

Query:  DVGYINEAVEIFEDMKRSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVESGFDPNIFVLTSLIQCYGKAKRVDDVVKTFNRLIELGLTPDDRFCGC
        DVGY+NEAVEIF+DMK SGTCSPDSWTFSSMITIYSCSGKVSEAEEMLN+MVE+GFDPNIFVLTSLIQCYGKAKRVDDVV+TFN+LIELGLTPDDRFCGC
Subjt:  DVGYINEAVEIFEDMKRSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVESGFDPNIFVLTSLIQCYGKAKRVDDVVKTFNRLIELGLTPDDRFCGC

Query:  LLNVITQTPKEELSKLIDCVERANPKLGFVVRLLLGEQDQEGDFRTEASELLSVVSADVRRAYCNCLIDLCVNLDLLDKACEVLDLGLTLQIYTDLQSRS
        LLNVITQTPKEE+SKLIDCV RANPKLGFVV LLLGEQD+EG+FRTEASEL SVVSADVR+AYCNCLIDLCVNLDLLDKACE+L+LGLTLQIY DLQSRS
Subjt:  LLNVITQTPKEELSKLIDCVERANPKLGFVVRLLLGEQDQEGDFRTEASELLSVVSADVRRAYCNCLIDLCVNLDLLDKACEVLDLGLTLQIYTDLQSRS

Query:  PTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRGSPE
        PTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESR SPE
Subjt:  PTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRGSPE

Query:  LVS
        LV+
Subjt:  LVS

A0A6J1CNE5 pentatricopeptide repeat-containing protein At4g16390, chloroplastic0.0e+0091.18Show/hide
Query:  MAFQLCHSPSTFFTDHHSLSSCLNSQRKATLYKSSDRFKLNPIPRPSKTFLQITSVSLQEYAPQETQNPSPSDDEISKNSDGKSGSSSKSSVWVNPRSPR
        MAFQLCHSPSTFF+DHH LS+ LNSQ + TL KSS RFKLNP P  SKT L+IT+VSLQEYA QE QNP P+ DE SK  DGKS SSSKSSVWVNPRSPR
Subjt:  MAFQLCHSPSTFFTDHHSLSSCLNSQRKATLYKSSDRFKLNPIPRPSKTFLQITSVSLQEYAPQETQNPSPSDDEISKNSDGKSGSSSKSSVWVNPRSPR

Query:  ASKLRKQSYEARYASLTRISESLDSCNPCEEDVADVLKVIGNNILEQDAVVVLNNMSNSQTALLALRYFQEVLKSSKEAILFNVTLKVFRKCRDMEGAEK
        ASKLR QSYEARYASLTRISESLDSCNPCEEDVADVLK +G+NILEQDAV VLNNMSNS TALLAL+ FQ+VLKSSK+AIL+NVTLKV RK RDMEGAEK
Subjt:  ASKLRKQSYEARYASLTRISESLDSCNPCEEDVADVLKVIGNNILEQDAVVVLNNMSNSQTALLALRYFQEVLKSSKEAILFNVTLKVFRKCRDMEGAEK

Query:  LFDEMLKRGLKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDLAFSLYDRARTENWRIDPATFSTLIKIHGVAGNY
        LFDEMLKRG+KPDNVTFST+ISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVD+AFSLYDRARTENWRIDPATFSTLIKIHGVAGNY
Subjt:  LFDEMLKRGLKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDLAFSLYDRARTENWRIDPATFSTLIKIHGVAGNY

Query:  DGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRSRYGEDALLVYKEMKEKGLQLNVILYNTLLAMCA
        DGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGR+RYGEDALLVYKEMKEKGLQLNVILYNTLLAMCA
Subjt:  DGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRSRYGEDALLVYKEMKEKGLQLNVILYNTLLAMCA

Query:  DVGYINEAVEIFEDMKRSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVESGFDPNIFVLTSLIQCYGKAKRVDDVVKTFNRLIELGLTPDDRFCGC
        DVGY+NEAVEIFEDMK SG CSPDSWTFSSMITIYSCSGKVSEAEEMLNEM+E+GFDPNIFVLTSLIQCYGK KRVDDVV+TF+RL+ELGLTPDDRFCGC
Subjt:  DVGYINEAVEIFEDMKRSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVESGFDPNIFVLTSLIQCYGKAKRVDDVVKTFNRLIELGLTPDDRFCGC

Query:  LLNVITQTPKEELSKLIDCVERANPKLGFVVRLLLGEQDQEGDFRTEASELLSVVSADVRRAYCNCLIDLCVNLDLLDKACEVLDLGLTLQIYTDLQSRS
        LLNVITQTPKEELSKLIDCVERAN KLG+VV+LLLGEQD+EGD RTEASELLSVVSADVR+AYCNCLIDLCVNLDLL+KACE+LDLGLTLQIYT LQS S
Subjt:  LLNVITQTPKEELSKLIDCVERANPKLGFVVRLLLGEQDQEGDFRTEASELLSVVSADVRRAYCNCLIDLCVNLDLLDKACEVLDLGLTLQIYTDLQSRS

Query:  PTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRGSPE
        PTQWSL+LKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRGSPE
Subjt:  PTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRGSPE

Query:  LVS
        LV+
Subjt:  LVS

A0A6J1IDL0 pentatricopeptide repeat-containing protein At4g16390, chloroplastic0.0e+0087.64Show/hide
Query:  MAFQLCHSPSTFFTDHHSLSSCLNSQRKATLYKSSDR-FKLNPIPRPSKTFLQITSVSLQEYAPQETQNPSPSDDEISKNSDGKSGSSSKSSVWVNPRSP
        MAFQL H P TFFTDH+SL+       K TL KSS R FKLNPIP  SK FLQIT+VS QEYAPQET+NPSPSDDEISK  DGKSGSSSK+SVWVNP SP
Subjt:  MAFQLCHSPSTFFTDHHSLSSCLNSQRKATLYKSSDR-FKLNPIPRPSKTFLQITSVSLQEYAPQETQNPSPSDDEISKNSDGKSGSSSKSSVWVNPRSP

Query:  RASKLRKQSYEARYASLTRISESLDSCNPCEEDVADVLKVIGNNILEQDAVVVLNNMSNSQTALLALRYFQEVLKSSKEAILFNVTLKVFRKCRDMEGAE
        RASKLRKQSYEARYASL +ISESLDSCNPCE+DVADVLK I   ILEQDA+ VLNNMSNSQTALL LRYFQ+VLKSSK+A+ +NVTLKVFRKC+D EGAE
Subjt:  RASKLRKQSYEARYASLTRISESLDSCNPCEEDVADVLKVIGNNILEQDAVVVLNNMSNSQTALLALRYFQEVLKSSKEAILFNVTLKVFRKCRDMEGAE

Query:  KLFDEMLKRGLKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDLAFSLYDRARTENWRIDPATFSTLIKIHGVAGN
        KLFDEMLKRG+KPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDD+TYS MIDAYGRAGNVD+AFSLYDRARTENWRID +TFST+IKIHGVAGN
Subjt:  KLFDEMLKRGLKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDLAFSLYDRARTENWRIDPATFSTLIKIHGVAGN

Query:  YDGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRSRYGEDALLVYKEMKEKGLQLNVILYNTLLAMC
        YDGCLNVYEEMKA+GIKPNL IYNSLL AMGRAKRPWQIKTIY+EMIKNGFSPSWATYASLLRAY R+RY ED +LVYKEMKEKGLQLNVILYNTLLAMC
Subjt:  YDGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRSRYGEDALLVYKEMKEKGLQLNVILYNTLLAMC

Query:  ADVGYINEAVEIFEDMKRSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVESGFDPNIFVLTSLIQCYGKAKRVDDVVKTFNRLIELGLTPDDRFCG
        ADVGY+NEA+E+F+DMK SGTCSPDSWTFSSMITIYSCSG VSEAEEMLNEM+E+GFDPNIFVLTSLIQCYGKAKRVDDVV+TF+RL+ELGLTPDDRFCG
Subjt:  ADVGYINEAVEIFEDMKRSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVESGFDPNIFVLTSLIQCYGKAKRVDDVVKTFNRLIELGLTPDDRFCG

Query:  CLLNVITQTPKEELSKLIDCVERANPKLGFVVRLLLGEQDQEGDFRTEASELLSVVSADVRRAYCNCLIDLCVNLDLLDKACEVLDLGLTLQIYTDLQSR
        CLLNVITQTPK ELSKLIDCVERANPKLGF+VRLL+GE+D EGDFRTEASEL SVVS DVR+AYCNCLIDLCVNLDLLDKACE+LDLGL++QIYTDLQSR
Subjt:  CLLNVITQTPKEELSKLIDCVERANPKLGFVVRLLLGEQDQEGDFRTEASELLSVVSADVRRAYCNCLIDLCVNLDLLDKACEVLDLGLTLQIYTDLQSR

Query:  SPTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRGSP
        S TQWSLYLKGLSLGAALTALHVWINDLTK L+SGEELPPLLGINTGHGKHKYSDKGL+SVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESR SP
Subjt:  SPTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRGSP

Query:  ELVS
        ELV+
Subjt:  ELVS

SwissProt top hitse value%identityAlignment
B4F8Z1 Pentatricopeptide repeat-containing protein ATP4, chloroplastic7.2e-20753.39Show/hide
Query:  LCHSPSTFFTD--HHSLSSCLNSQRKATLYKSSDRFKLNPIPRPSKTFLQITSVSLQEYAPQETQNPSPSDDEISKNSDGKSGSSSKSSVWVNPRSPRAS
        LC SPS+      H  +S+  N +  +           +P+            VS+QE  PQ  Q+PSP  D  + N    S SS+   +WVNP SPRA+
Subjt:  LCHSPSTFFTD--HHSLSSCLNSQRKATLYKSSDRFKLNPIPRPSKTFLQITSVSLQEYAPQETQNPSPSDDEISKNSDGKSGSSSKSSVWVNPRSPRAS

Query:  KL-RKQSYEARYASLTRISESLDSCNPCEEDVADVLK-VIGNNILEQDAVVVLNN--MSNSQTALLALRYFQEVLKSSKEAILFNVTLKVFRKCRDMEGA
         + R ++   R A L   + +L +C   E  V   L+        EQDAV+VLN    + ++TA+LALR+F    K  K+ IL+NV LK+ RK R     
Subjt:  KL-RKQSYEARYASLTRISESLDSCNPCEEDVADVLK-VIGNNILEQDAVVVLNN--MSNSQTALLALRYFQEVLKSSKEAILFNVTLKVFRKCRDMEGA

Query:  EKLFDEMLKRGLKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDLAFSLYDRARTENWRIDPATFSTLIKIHGVAG
        E L+ EML+ G++PDN TFST+ISCAR C L +KAVEWF+KMP F C+PD +TYSA+IDAYG AGN + A  LYDRAR E W++DP   ST+IK+H  +G
Subjt:  EKLFDEMLKRGLKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDLAFSLYDRARTENWRIDPATFSTLIKIHGVAG

Query:  NYDGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRSRYGEDALLVYKEMKEKGLQLNVILYNTLLAM
        N+DG LNV+EEMKAIG++PNLV+YN++LDAMGRA RPW +KTI++EM+     PS ATY  LL AY R+RYGEDA+ VY+ MK++ + ++V+LYN LL+M
Subjt:  NYDGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRSRYGEDALLVYKEMKEKGLQLNVILYNTLLAM

Query:  CADVGYINEAVEIFEDMKRS--GTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVESGFDPNIFVLTSLIQCYGKAKRVDDVVKTFNRLIELGLTPDDR
        CAD+GY++EA EIF DMK S      PDSW++SSM+T+YS +  V  AE +LNEMVE+GF PNIFVLTSLI+CYGK  R DDVV++F  L +LG+ PDDR
Subjt:  CADVGYINEAVEIFEDMKRS--GTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVESGFDPNIFVLTSLIQCYGKAKRVDDVVKTFNRLIELGLTPDDR

Query:  FCGCLLNVITQTPKEELSKLIDCVERANPKLGFVVRLLLGEQDQEGDFRTEASELLSVVSADVRRAYCNCLIDLCVNLDLLDKACEVLDLGLTLQIYTDL
        FCGCLL+V   TP EEL K+I C+ER+N +LG VV+LL+     E  FR  A ELL      V+  YCNCL+DLCVNL+ ++KAC +LD    L IY ++
Subjt:  FCGCLLNVITQTPKEELSKLIDCVERANPKLGFVVRLLLGEQDQEGDFRTEASELLSVVSADVRRAYCNCLIDLCVNLDLLDKACEVLDLGLTLQIYTDL

Query:  QSRSPTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEE-LPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLES
        Q+R+ TQWSL+L+GLS+GAALT LHVW+NDL   L++G E LPPLLGI+TG GK+ YSD+GLA++FE+HLKEL+APFHEAP+K GWFLTT VAAK WLES
Subjt:  QSRSPTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEE-LPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLES

Query:  RGSPELVS
        + + ELV+
Subjt:  RGSPELVS

Q10PZ4 Pentatricopeptide repeat-containing protein ATP4 homolog, chloroplastic2.5e-20755.16Show/hide
Query:  NPIPRPSKTFLQITSVSLQEYAPQETQ-NPSPSDDEISKNSDGKSGSSSKSSVWVNPRSPRASKL-RKQSYEARYASLTRISESLDSCNPCEEDVADVLK
        NP P P+        VS+Q+  P  +  NPSP          G+S ++S+  VWVNP SPRA+ L R ++   R A L   + +L +C   E  VA  L+
Subjt:  NPIPRPSKTFLQITSVSLQEYAPQETQ-NPSPSDDEISKNSDGKSGSSSKSSVWVNPRSPRASKL-RKQSYEARYASLTRISESLDSCNPCEEDVADVLK

Query:  -VIGNNILEQDAVVVLNNMSNSQTA-LLALRYFQEVLKSSKEAILFNVTLKVFRKCRDMEGAEKLFDEMLKRGLKPDNVTFSTIISCARLCSLPNKAVEW
                EQDAV+VLN  S    A +LAL +F    +  KE IL+NV LK  RK R    AE L++EML+ G++PDN TFST+ISCAR C +P KAVEW
Subjt:  -VIGNNILEQDAVVVLNNMSNSQTA-LLALRYFQEVLKSSKEAILFNVTLKVFRKCRDMEGAEKLFDEMLKRGLKPDNVTFSTIISCARLCSLPNKAVEW

Query:  FEKMPSFDCNPDDVTYSAMIDAYGRAGNVDLAFSLYDRARTENWRIDPATFSTLIKIHGVAGNYDGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPW
        FEKMP F C+PD +TYSA+IDAYGRAG+ + A  LYDRAR E W++DP   +T+I++H  +GN+DG LNV+EEMKA G+KPNLV+YN++LDAMGRA RPW
Subjt:  FEKMPSFDCNPDDVTYSAMIDAYGRAGNVDLAFSLYDRARTENWRIDPATFSTLIKIHGVAGNYDGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPW

Query:  QIKTIYKEMIKNGFSPSWATYASLLRAYGRSRYGEDALLVYKEMKEKGLQLNVILYNTLLAMCADVGYINEAVEIFEDMKRS--GTCSPDSWTFSSMITI
         +KTI++E++     P+ ATY  LL AY R+RYGEDA+ VY+ MK++ + ++V+LYN LL+MCAD+GY+ EA EIF DMK S      PDSW++SSM+T+
Subjt:  QIKTIYKEMIKNGFSPSWATYASLLRAYGRSRYGEDALLVYKEMKEKGLQLNVILYNTLLAMCADVGYINEAVEIFEDMKRS--GTCSPDSWTFSSMITI

Query:  YSCSGKVSEAEEMLNEMVESGFDPNIFVLTSLIQCYGKAKRVDDVVKTFNRLIELGLTPDDRFCGCLLNVITQTPKEELSKLIDCVERANPKLGFVVRLL
        YSC+G V+ AE +LNEMVE+GF PNIF+LTSLI+CYGKA R DDVV++F  L +LG+TPDDRFCGCLL V   TP +EL K+I C++R++ +LG VVRLL
Subjt:  YSCSGKVSEAEEMLNEMVESGFDPNIFVLTSLIQCYGKAKRVDDVVKTFNRLIELGLTPDDRFCGCLLNVITQTPKEELSKLIDCVERANPKLGFVVRLL

Query:  LGEQDQEGDFRTEASELLSVVSADVRRAYCNCLIDLCVNLDLLDKACEVLDLGLTLQIYTDLQSRSPTQWSLYLKGLSLGAALTALHVWINDLTKVLESG
        +         R  A ELL      VR  YCNCL+DL VNL  ++KAC +LD+ L L IY+++Q+R+ TQWSL+L+GLS+GAALT LHVW++DL   L++G
Subjt:  LGEQDQEGDFRTEASELLSVVSADVRRAYCNCLIDLCVNLDLLDKACEVLDLGLTLQIYTDLQSRSPTQWSLYLKGLSLGAALTALHVWINDLTKVLESG

Query:  EELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRGSPELVS
        +ELPPLLGI+TG GK+ YS KGLA+VFESHLKEL+APFHEAP+K GWFLTT VAA+ WLE++ S ELV+
Subjt:  EELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRGSPELVS

Q8GWE0 Pentatricopeptide repeat-containing protein At4g16390, chloroplastic4.6e-27065.48Show/hide
Query:  LCHSPSTFFTDHHSLSSCLNSQRKATLYKSSDRFKLNPIPRPSKTFLQITSVSLQEYAPQETQNPSPSDDEISKNSDGKSGSSSKSSVWVNPRSPRASKL
        LC SPS+   D   L + L+   K+T       +  N     S+  LQ T VS+QE  PQ     S     +  +      ++SKS VWVNP+SPRAS+L
Subjt:  LCHSPSTFFTDHHSLSSCLNSQRKATLYKSSDRFKLNPIPRPSKTFLQITSVSLQEYAPQETQNPSPSDDEISKNSDGKSGSSSKSSVWVNPRSPRASKL

Query:  RKQSYEARYASLTRISESLDSCNPCEEDVADVLKVIGNNILEQDAVVVLNNMSNSQTALLALRYFQEVLKSSKEAILFNVTLKVFRKCRDMEGAEKLFDE
        R++SY++RY+SL +++ESLD+C P E DV DV+   G  + EQDAVV LNNM+N +TA L L    E +K S+E IL+NVT+KVFRK +D+E +EKLFDE
Subjt:  RKQSYEARYASLTRISESLDSCNPCEEDVADVLKVIGNNILEQDAVVVLNNMSNSQTALLALRYFQEVLKSSKEAILFNVTLKVFRKCRDMEGAEKLFDE

Query:  MLKRGLKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDLAFSLYDRARTENWRIDPATFSTLIKIHGVAGNYDGCL
        ML+RG+KPDN TF+TIISCAR   +P +AVEWFEKM SF C PD+VT +AMIDAYGRAGNVD+A SLYDRARTE WRID  TFSTLI+I+GV+GNYDGCL
Subjt:  MLKRGLKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDLAFSLYDRARTENWRIDPATFSTLIKIHGVAGNYDGCL

Query:  NVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRSRYGEDALLVYKEMKEKGLQLNVILYNTLLAMCADVGY
        N+YEEMKA+G+KPNLVIYN L+D+MGRAKRPWQ K IYK++I NGF+P+W+TYA+L+RAYGR+RYG+DAL +Y+EMKEKGL L VILYNTLL+MCAD  Y
Subjt:  NVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRSRYGEDALLVYKEMKEKGLQLNVILYNTLLAMCADVGY

Query:  INEAVEIFEDMKRSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVESGFDPNIFVLTSLIQCYGKAKRVDDVVKTFNRLIELGLTPDDRFCGCLLNV
        ++EA EIF+DMK   TC PDSWTFSS+IT+Y+CSG+VSEAE  L +M E+GF+P +FVLTS+IQCYGKAK+VDDVV+TF++++ELG+TPDDRFCGCLLNV
Subjt:  INEAVEIFEDMKRSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVESGFDPNIFVLTSLIQCYGKAKRVDDVVKTFNRLIELGLTPDDRFCGCLLNV

Query:  ITQTPKEELSKLIDCVERANPKLGFVVRLLLGEQD-QEGDFRTEASELLSVVSADVRRAYCNCLIDLCVNLDLLDKACEVLDLGLTLQIYTDLQSRSPTQ
        +TQTP EE+ KLI CVE+A PKLG VV++L+ EQ+ +EG F+ EASEL+  + +DV++AY NCLIDLCVNL+ L++ACE+L LGL   IYT LQS+S TQ
Subjt:  ITQTPKEELSKLIDCVERANPKLGFVVRLLLGEQD-QEGDFRTEASELLSVVSADVRRAYCNCLIDLCVNLDLLDKACEVLDLGLTLQIYTDLQSRSPTQ

Query:  WSLYLKGLSLGAALTALHVWINDLTK-VLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRGSPELV
        WSL+LK LSLGAALTALHVW+NDL++  LESGEE PPLLGINTGHGKHKYSDKGLA+VFESHLKELNAPFHEAP+KVGWFLTT VAAK+WLESR S   V
Subjt:  WSLYLKGLSLGAALTALHVWINDLTK-VLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRGSPELV

Query:  S
        S
Subjt:  S

Q9LS25 Pentatricopeptide repeat-containing protein At5g46580, chloroplastic3.2e-13037.57Show/hide
Query:  AFQLCHSPSTFFTDHHSLSSCLNSQRKATLYKSSDRFKLNPIPRPSKTFLQITSVSLQEYAPQETQNPSPSD-----DEISKNSDGKSGSSSKSSVWVNP
        A  +C +P    T  HSL        K +L++ S   KLN       +   +      E  P  T+ PS S+        +   +     S   SVWVNP
Subjt:  AFQLCHSPSTFFTDHHSLSSCLNSQRKATLYKSSDRFKLNPIPRPSKTFLQITSVSLQEYAPQETQNPSPSD-----DEISKNSDGKSGSSSKSSVWVNP

Query:  RSPRASKLRKQ-------SYEARYASLTRISESLDSCNPCEE-DVADVLKVIGNNILEQDAVVVLNNMSNSQTALLALRYFQEVLKSSKEAILFNVTLKV
          P+ S L  Q       SY  +   L   +  L+S    E+ +   +L  I +     +A++VLN++   Q       + +       E I +NVT+K 
Subjt:  RSPRASKLRKQ-------SYEARYASLTRISESLDSCNPCEE-DVADVLKVIGNNILEQDAVVVLNNMSNSQTALLALRYFQEVLKSSKEAILFNVTLKV

Query:  FRKCRDMEGAEKLFDEMLKRGLKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDLAFSLYDRARTENWRIDPATFS
         R  R  +  E++  EM+K G++ DN+T+STII+CA+ C+L NKA+EWFE+M      PD+VTYSA++D Y ++G V+   SLY+RA    W+ D   FS
Subjt:  FRKCRDMEGAEKLFDEMLKRGLKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDLAFSLYDRARTENWRIDPATFS

Query:  TLIKIHGVAGNYDGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRSRYGEDALLVYKEMKEKGLQLN
         L K+ G AG+YDG   V +EMK++ +KPN+V+YN+LL+AMGRA +P   ++++ EM++ G +P+  T  +L++ YG++R+  DAL +++EMK K   ++
Subjt:  TLIKIHGVAGNYDGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRSRYGEDALLVYKEMKEKGLQLN

Query:  VILYNTLLAMCADVGYINEAVEIFEDMKRSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVESGFDPNIFVLTSLIQCYGKAKRVDDVVKTFNRLIE
         ILYNTLL MCAD+G   EA  +F DMK S  C PD++++++M+ IY   GK  +A E+  EM+++G   N+   T L+QC GKAKR+DDVV  F+  I+
Subjt:  VILYNTLLAMCADVGYINEAVEIFEDMKRSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVESGFDPNIFVLTSLIQCYGKAKRVDDVVKTFNRLIE

Query:  LGLTPDDRFCGCLLNVITQ-TPKEELSKLIDCVERANPKLGFVVRLLLGEQDQEGDFRTEASELLSVVSADVRRAYCNCLIDLCVNLDLLDKACEVLDLG
         G+ PDDR CGCLL+V+      E+  K++ C+ERAN KL   V L++ E+ +    + E   +++    + RR +CNCLID+C   +  ++A E+L LG
Subjt:  LGLTPDDRFCGCLLNVITQ-TPKEELSKLIDCVERANPKLGFVVRLLLGEQDQEGDFRTEASELLSVVSADVRRAYCNCLIDLCVNLDLLDKACEVLDLG

Query:  LTLQIYTDLQSRSPTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKV
            +Y  L +++  +WSL ++ LS+GAA TAL  W+  L  +++  EELP L    TG G H++S +GLA+ F  HL++L+APF ++ ++ G F+ TK 
Subjt:  LTLQIYTDLQSRSPTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKV

Query:  AAKSWLESRGSPELVS
           SWLES+  P + S
Subjt:  AAKSWLESRGSPELVS

Q9SIC9 Pentatricopeptide repeat-containing protein At2g31400, chloroplastic5.8e-4724.13Show/hide
Query:  EISKNSDGKSGSSSKSSVWVNPRSPRASKLRKQSYEARYASLTRISESLDSC---NPCEEDVADV---LKVIG--NNILEQDAVVVLNNMSNSQTALLAL
        E  KN  GK  S+  S++    +   A ++ + ++   Y +      +L S    +   E+   V   +K  G   N++  +AV+        +   +A 
Subjt:  EISKNSDGKSGSSSKSSVWVNPRSPRASKLRKQSYEARYASLTRISESLDSC---NPCEEDVADV---LKVIG--NNILEQDAVVVLNNMSNSQTALLAL

Query:  RYFQEVLKS--SKEAILFNVTLKVFRKCRDMEGAEKLFDEMLKRGLKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGN
        ++F E+ ++    + I FN  L V  +    E A  LFDEM  R ++ D  +++T++         + A E   +MP     P+ V+YS +ID + +AG 
Subjt:  RYFQEVLKS--SKEAILFNVTLKVFRKCRDMEGAEKLFDEMLKRGLKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGN

Query:  VDLAFSLYDRARTENWRIDPATFSTLIKIHGVAGNYDGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAY
         D A +L+   R     +D  +++TL+ I+   G  +  L++  EM ++GIK ++V YN+LL   G+  +  ++K ++ EM +    P+  TY++L+  Y
Subjt:  VDLAFSLYDRARTENWRIDPATFSTLIKIHGVAGNYDGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAY

Query:  GRSRYGEDALLVYKEMKEKGLQLNVILYNTLLAMCADVGYINEAVEIFEDMKRSGTCSPDSWTFSSMITI------------YSCSGKVSEAEEMLNEMV
         +    ++A+ +++E K  GL+ +V+LY+ L+      G +  AV + ++M + G  SP+  T++S+I              YS  G +  +   L+ + 
Subjt:  GRSRYGEDALLVYKEMKEKGLQLNVILYNTLLAMCADVGYINEAVEIFEDMKRSGTCSPDSWTFSSMITI------------YSCSGKVSEAEEMLNEMV

Query:  ESGFDPNIFVLTSLI---------QCYGKAKRVDDVVKTFNRLIELGLTPDDRFCGCLLNVITQTPK-EELSKLIDCVERANPKL-GFVVRLLLGEQDQE
        E+  +  I +   L           C    + +  +++ F ++ +L + P+      +LN  ++    E+ S L++ +   + K+ G V  LL+G+++  
Subjt:  ESGFDPNIFVLTSLI---------QCYGKAKRVDDVVKTFNRLIELGLTPDDRFCGCLLNVITQTPK-EELSKLIDCVERANPKL-GFVVRLLLGEQDQE

Query:  GDFRTEASELLSVVSADVRRAYCNCLIDLCVNLDLLDKACEVLDLGLTLQIYTDLQSRSPTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLL
                + ++ +      A+ N L D+  +      A  V   G + Q++ ++ S S     L L  +S GAA   +H W+ ++  ++  G ELP +L
Subjt:  GDFRTEASELLSVVSADVRRAYCNCLIDLCVNLDLLDKACEVLDLGLTLQIYTDLQSRSPTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLL

Query:  GINTGHGKHK--YSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRGSPELV
         I TG GKH     D  L    E  L+ ++APFH +   +G F ++     +WL    + +L+
Subjt:  GINTGHGKHK--YSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRGSPELV

Arabidopsis top hitse value%identityAlignment
AT1G18900.1 Pentatricopeptide repeat (PPR) superfamily protein6.4e-4123.63Show/hide
Query:  DVLKVIGNNILEQDAVVVLNNMSNSQTALLALRYFQEVLKSSKEAILFNVTLKVFRKCRDMEGAEKLFDEMLKRGLKPDNVTFSTIISCARLCSLPNKAV
        + L+ +G  I    A  VL  M++   AL    + +       +   +   +    + +      KL DEM++ G +P+ VT++ +I      +  N+A+
Subjt:  DVLKVIGNNILEQDAVVVLNNMSNSQTALLALRYFQEVLKSSKEAILFNVTLKVFRKCRDMEGAEKLFDEMLKRGLKPDNVTFSTIISCARLCSLPNKAV

Query:  EWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDLAFSLYDRARTENWRIDPATFSTLIKIHGVAGNYDGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKR
          F +M    C PD VTY  +ID + +AG +D+A  +Y R +      D  T+S +I   G AG+      ++ EM   G  PNLV YN ++D   +A+ 
Subjt:  EWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDLAFSLYDRARTENWRIDPATFSTLIKIHGVAGNYDGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKR

Query:  PWQIKTIYKEMIKNGFSPSWATYASLLRAYGRSRYGEDALLVYKEMKEKGLQLNVILYNTLLAMCADVGYINEAVEIFEDMKRSGTCSPDSWTFSSMITI
              +Y++M   GF P   TY+ ++   G   Y E+A  V+ EM++K    +  +Y  L+ +    G + +A + ++ M  +G   P+  T +S+++ 
Subjt:  PWQIKTIYKEMIKNGFSPSWATYASLLRAYGRSRYGEDALLVYKEMKEKGLQLNVILYNTLLAMCADVGYINEAVEIFEDMKRSGTCSPDSWTFSSMITI

Query:  YSCSGKVSEAEEMLNEMVESGFDPNIFVLTSLIQCY--GKAKRVDDVVKTFNRLIELGLTPDDRFCGCLLNVITQTPKEELSKLIDCVERANPKLGFVVR
        +    K++EA E+L  M+  G  P++   T L+ C   G++K            +++G      FCG L+                     +P   F+++
Subjt:  YSCSGKVSEAEEMLNEMVESGFDPNIFVLTSLIQCY--GKAKRVDDVVKTFNRLIELGLTPDDRFCGCLLNVITQTPKEELSKLIDCVERANPKLGFVVR

Query:  LLLGEQDQEGDFRTEASELLSVVSADVR---RAYCNCLIDLCVNLDLLDKACEVLDLGLTLQIYTD-LQSRSPTQWSLYLKGLSLGAALTALHVWINDLT
        +     D E + R  A+  L ++ ++ R   R   + ++D        ++A  V ++     ++ D L+ +S + W + L  +S G A+TAL   +    
Subjt:  LLLGEQDQEGDFRTEASELLSVVSADVR---RAYCNCLIDLCVNLDLLDKACEVLDLGLTLQIYTD-LQSRSPTQWSLYLKGLSLGAALTALHVWINDLT

Query:  KVLESGEELPPLLGINTGHGKHK--YSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWL
        K + +    P  + I TG G+         +    E  L    +PF       G F+ +      WL
Subjt:  KVLESGEELPPLLGINTGHGKHK--YSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWL

AT2G31400.1 genomes uncoupled 14.1e-4824.13Show/hide
Query:  EISKNSDGKSGSSSKSSVWVNPRSPRASKLRKQSYEARYASLTRISESLDSC---NPCEEDVADV---LKVIG--NNILEQDAVVVLNNMSNSQTALLAL
        E  KN  GK  S+  S++    +   A ++ + ++   Y +      +L S    +   E+   V   +K  G   N++  +AV+        +   +A 
Subjt:  EISKNSDGKSGSSSKSSVWVNPRSPRASKLRKQSYEARYASLTRISESLDSC---NPCEEDVADV---LKVIG--NNILEQDAVVVLNNMSNSQTALLAL

Query:  RYFQEVLKS--SKEAILFNVTLKVFRKCRDMEGAEKLFDEMLKRGLKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGN
        ++F E+ ++    + I FN  L V  +    E A  LFDEM  R ++ D  +++T++         + A E   +MP     P+ V+YS +ID + +AG 
Subjt:  RYFQEVLKS--SKEAILFNVTLKVFRKCRDMEGAEKLFDEMLKRGLKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGN

Query:  VDLAFSLYDRARTENWRIDPATFSTLIKIHGVAGNYDGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAY
         D A +L+   R     +D  +++TL+ I+   G  +  L++  EM ++GIK ++V YN+LL   G+  +  ++K ++ EM +    P+  TY++L+  Y
Subjt:  VDLAFSLYDRARTENWRIDPATFSTLIKIHGVAGNYDGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAY

Query:  GRSRYGEDALLVYKEMKEKGLQLNVILYNTLLAMCADVGYINEAVEIFEDMKRSGTCSPDSWTFSSMITI------------YSCSGKVSEAEEMLNEMV
         +    ++A+ +++E K  GL+ +V+LY+ L+      G +  AV + ++M + G  SP+  T++S+I              YS  G +  +   L+ + 
Subjt:  GRSRYGEDALLVYKEMKEKGLQLNVILYNTLLAMCADVGYINEAVEIFEDMKRSGTCSPDSWTFSSMITI------------YSCSGKVSEAEEMLNEMV

Query:  ESGFDPNIFVLTSLI---------QCYGKAKRVDDVVKTFNRLIELGLTPDDRFCGCLLNVITQTPK-EELSKLIDCVERANPKL-GFVVRLLLGEQDQE
        E+  +  I +   L           C    + +  +++ F ++ +L + P+      +LN  ++    E+ S L++ +   + K+ G V  LL+G+++  
Subjt:  ESGFDPNIFVLTSLI---------QCYGKAKRVDDVVKTFNRLIELGLTPDDRFCGCLLNVITQTPK-EELSKLIDCVERANPKL-GFVVRLLLGEQDQE

Query:  GDFRTEASELLSVVSADVRRAYCNCLIDLCVNLDLLDKACEVLDLGLTLQIYTDLQSRSPTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLL
                + ++ +      A+ N L D+  +      A  V   G + Q++ ++ S S     L L  +S GAA   +H W+ ++  ++  G ELP +L
Subjt:  GDFRTEASELLSVVSADVRRAYCNCLIDLCVNLDLLDKACEVLDLGLTLQIYTDLQSRSPTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLL

Query:  GINTGHGKHK--YSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRGSPELV
         I TG GKH     D  L    E  L+ ++APFH +   +G F ++     +WL    + +L+
Subjt:  GINTGHGKHK--YSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRGSPELV

AT4G16390.1 pentatricopeptide (PPR) repeat-containing protein3.3e-27165.48Show/hide
Query:  LCHSPSTFFTDHHSLSSCLNSQRKATLYKSSDRFKLNPIPRPSKTFLQITSVSLQEYAPQETQNPSPSDDEISKNSDGKSGSSSKSSVWVNPRSPRASKL
        LC SPS+   D   L + L+   K+T       +  N     S+  LQ T VS+QE  PQ     S     +  +      ++SKS VWVNP+SPRAS+L
Subjt:  LCHSPSTFFTDHHSLSSCLNSQRKATLYKSSDRFKLNPIPRPSKTFLQITSVSLQEYAPQETQNPSPSDDEISKNSDGKSGSSSKSSVWVNPRSPRASKL

Query:  RKQSYEARYASLTRISESLDSCNPCEEDVADVLKVIGNNILEQDAVVVLNNMSNSQTALLALRYFQEVLKSSKEAILFNVTLKVFRKCRDMEGAEKLFDE
        R++SY++RY+SL +++ESLD+C P E DV DV+   G  + EQDAVV LNNM+N +TA L L    E +K S+E IL+NVT+KVFRK +D+E +EKLFDE
Subjt:  RKQSYEARYASLTRISESLDSCNPCEEDVADVLKVIGNNILEQDAVVVLNNMSNSQTALLALRYFQEVLKSSKEAILFNVTLKVFRKCRDMEGAEKLFDE

Query:  MLKRGLKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDLAFSLYDRARTENWRIDPATFSTLIKIHGVAGNYDGCL
        ML+RG+KPDN TF+TIISCAR   +P +AVEWFEKM SF C PD+VT +AMIDAYGRAGNVD+A SLYDRARTE WRID  TFSTLI+I+GV+GNYDGCL
Subjt:  MLKRGLKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDLAFSLYDRARTENWRIDPATFSTLIKIHGVAGNYDGCL

Query:  NVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRSRYGEDALLVYKEMKEKGLQLNVILYNTLLAMCADVGY
        N+YEEMKA+G+KPNLVIYN L+D+MGRAKRPWQ K IYK++I NGF+P+W+TYA+L+RAYGR+RYG+DAL +Y+EMKEKGL L VILYNTLL+MCAD  Y
Subjt:  NVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRSRYGEDALLVYKEMKEKGLQLNVILYNTLLAMCADVGY

Query:  INEAVEIFEDMKRSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVESGFDPNIFVLTSLIQCYGKAKRVDDVVKTFNRLIELGLTPDDRFCGCLLNV
        ++EA EIF+DMK   TC PDSWTFSS+IT+Y+CSG+VSEAE  L +M E+GF+P +FVLTS+IQCYGKAK+VDDVV+TF++++ELG+TPDDRFCGCLLNV
Subjt:  INEAVEIFEDMKRSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVESGFDPNIFVLTSLIQCYGKAKRVDDVVKTFNRLIELGLTPDDRFCGCLLNV

Query:  ITQTPKEELSKLIDCVERANPKLGFVVRLLLGEQD-QEGDFRTEASELLSVVSADVRRAYCNCLIDLCVNLDLLDKACEVLDLGLTLQIYTDLQSRSPTQ
        +TQTP EE+ KLI CVE+A PKLG VV++L+ EQ+ +EG F+ EASEL+  + +DV++AY NCLIDLCVNL+ L++ACE+L LGL   IYT LQS+S TQ
Subjt:  ITQTPKEELSKLIDCVERANPKLGFVVRLLLGEQD-QEGDFRTEASELLSVVSADVRRAYCNCLIDLCVNLDLLDKACEVLDLGLTLQIYTDLQSRSPTQ

Query:  WSLYLKGLSLGAALTALHVWINDLTK-VLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRGSPELV
        WSL+LK LSLGAALTALHVW+NDL++  LESGEE PPLLGINTGHGKHKYSDKGLA+VFESHLKELNAPFHEAP+KVGWFLTT VAAK+WLESR S   V
Subjt:  WSLYLKGLSLGAALTALHVWINDLTK-VLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRGSPELV

Query:  S
        S
Subjt:  S

AT5G02860.1 Pentatricopeptide repeat (PPR) superfamily protein9.8e-4229.12Show/hide
Query:  FQEVLKSSKEAILFNVTLKVFRKCRDMEGAEKLFDEMLKRGLKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNV-DL
        +Q +L +S  AI+ ++  K  R    +  A  +F+ + + G   D  +++++IS         +AV  F+KM    C P  +TY+ +++ +G+ G   + 
Subjt:  FQEVLKSSKEAILFNVTLKVFRKCRDMEGAEKLFDEMLKRGLKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNV-DL

Query:  AFSLYDRARTENWRIDPATFSTLIKIHGVAGNYDGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRS
          SL ++ +++    D  T++TLI        +     V+EEMKA G   + V YN+LLD  G++ RP +   +  EM+ NGFSPS  TY SL+ AY R 
Subjt:  AFSLYDRARTENWRIDPATFSTLIKIHGVAGNYDGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRS

Query:  RYGEDALLVYKEMKEKGLQLNVILYNTLLAMCADVGYINEAVEIFEDMKRSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVESGFDPNIFVLTSLI
           ++A+ +  +M EKG + +V  Y TLL+     G +  A+ IFE+M+ +G C P+  TF++ I +Y   GK +E  ++ +E+   G  P+I    +L+
Subjt:  RYGEDALLVYKEMKEKGLQLNVILYNTLLAMCADVGYINEAVEIFEDMKRSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVESGFDPNIFVLTSLI

Query:  QCYGKAKRVDDVVKTFNRLIELGLTPDDRFCGCLLNVITQ
          +G+     +V   F  +   G  P+      L++  ++
Subjt:  QCYGKAKRVDDVVKTFNRLIELGLTPDDRFCGCLLNVITQ

AT5G46580.1 pentatricopeptide (PPR) repeat-containing protein2.3e-13137.57Show/hide
Query:  AFQLCHSPSTFFTDHHSLSSCLNSQRKATLYKSSDRFKLNPIPRPSKTFLQITSVSLQEYAPQETQNPSPSD-----DEISKNSDGKSGSSSKSSVWVNP
        A  +C +P    T  HSL        K +L++ S   KLN       +   +      E  P  T+ PS S+        +   +     S   SVWVNP
Subjt:  AFQLCHSPSTFFTDHHSLSSCLNSQRKATLYKSSDRFKLNPIPRPSKTFLQITSVSLQEYAPQETQNPSPSD-----DEISKNSDGKSGSSSKSSVWVNP

Query:  RSPRASKLRKQ-------SYEARYASLTRISESLDSCNPCEE-DVADVLKVIGNNILEQDAVVVLNNMSNSQTALLALRYFQEVLKSSKEAILFNVTLKV
          P+ S L  Q       SY  +   L   +  L+S    E+ +   +L  I +     +A++VLN++   Q       + +       E I +NVT+K 
Subjt:  RSPRASKLRKQ-------SYEARYASLTRISESLDSCNPCEE-DVADVLKVIGNNILEQDAVVVLNNMSNSQTALLALRYFQEVLKSSKEAILFNVTLKV

Query:  FRKCRDMEGAEKLFDEMLKRGLKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDLAFSLYDRARTENWRIDPATFS
         R  R  +  E++  EM+K G++ DN+T+STII+CA+ C+L NKA+EWFE+M      PD+VTYSA++D Y ++G V+   SLY+RA    W+ D   FS
Subjt:  FRKCRDMEGAEKLFDEMLKRGLKPDNVTFSTIISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDLAFSLYDRARTENWRIDPATFS

Query:  TLIKIHGVAGNYDGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRSRYGEDALLVYKEMKEKGLQLN
         L K+ G AG+YDG   V +EMK++ +KPN+V+YN+LL+AMGRA +P   ++++ EM++ G +P+  T  +L++ YG++R+  DAL +++EMK K   ++
Subjt:  TLIKIHGVAGNYDGCLNVYEEMKAIGIKPNLVIYNSLLDAMGRAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRSRYGEDALLVYKEMKEKGLQLN

Query:  VILYNTLLAMCADVGYINEAVEIFEDMKRSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVESGFDPNIFVLTSLIQCYGKAKRVDDVVKTFNRLIE
         ILYNTLL MCAD+G   EA  +F DMK S  C PD++++++M+ IY   GK  +A E+  EM+++G   N+   T L+QC GKAKR+DDVV  F+  I+
Subjt:  VILYNTLLAMCADVGYINEAVEIFEDMKRSGTCSPDSWTFSSMITIYSCSGKVSEAEEMLNEMVESGFDPNIFVLTSLIQCYGKAKRVDDVVKTFNRLIE

Query:  LGLTPDDRFCGCLLNVITQ-TPKEELSKLIDCVERANPKLGFVVRLLLGEQDQEGDFRTEASELLSVVSADVRRAYCNCLIDLCVNLDLLDKACEVLDLG
         G+ PDDR CGCLL+V+      E+  K++ C+ERAN KL   V L++ E+ +    + E   +++    + RR +CNCLID+C   +  ++A E+L LG
Subjt:  LGLTPDDRFCGCLLNVITQ-TPKEELSKLIDCVERANPKLGFVVRLLLGEQDQEGDFRTEASELLSVVSADVRRAYCNCLIDLCVNLDLLDKACEVLDLG

Query:  LTLQIYTDLQSRSPTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKV
            +Y  L +++  +WSL ++ LS+GAA TAL  W+  L  +++  EELP L    TG G H++S +GLA+ F  HL++L+APF ++ ++ G F+ TK 
Subjt:  LTLQIYTDLQSRSPTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASVFESHLKELNAPFHEAPEKVGWFLTTKV

Query:  AAKSWLESRGSPELVS
           SWLES+  P + S
Subjt:  AAKSWLESRGSPELVS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTTCCAGCTCTGCCATTCGCCGTCCACCTTCTTCACCGACCACCATTCTCTCAGCAGTTGTCTAAATTCTCAACGCAAAGCAACTCTCTACAAGTCCTCTGACCG
TTTCAAGCTCAATCCCATACCTCGCCCCTCAAAAACATTCCTCCAAATTACCAGTGTCTCGCTACAGGAATACGCTCCTCAAGAAACCCAGAATCCGAGCCCCTCTGACG
ATGAAATCTCGAAAAATTCAGATGGGAAATCCGGTTCCTCGTCCAAAAGCTCCGTTTGGGTCAATCCCAGAAGCCCCAGAGCTTCGAAACTTCGGAAGCAATCGTACGAG
GCCAGATATGCTTCTCTTACGAGGATATCGGAGTCTTTGGACTCTTGTAATCCATGTGAGGAAGATGTTGCTGATGTCTTGAAGGTGATAGGTAATAACATTTTAGAACA
GGACGCTGTTGTAGTGCTGAATAACATGTCGAATTCCCAAACTGCGTTGCTTGCTCTTCGGTACTTTCAGGAGGTTTTGAAATCAAGTAAAGAGGCGATTCTTTTTAATG
TGACACTGAAGGTGTTTAGGAAGTGTAGAGATATGGAGGGTGCAGAGAAACTGTTCGACGAAATGCTTAAGAGAGGACTCAAACCTGATAATGTGACATTTTCTACGATT
ATTAGTTGTGCTAGGTTGTGTTCGTTGCCAAATAAGGCTGTTGAGTGGTTTGAGAAGATGCCAAGTTTTGACTGTAATCCTGATGATGTCACTTACTCTGCGATGATTGA
TGCCTATGGACGTGCTGGTAATGTTGATCTGGCTTTCAGCTTGTATGACCGTGCAAGAACAGAAAACTGGCGTATTGATCCTGCAACATTCTCGACGTTGATCAAAATTC
ATGGAGTGGCAGGAAACTATGATGGGTGCTTGAATGTGTATGAAGAAATGAAGGCTATAGGCATCAAGCCAAACTTGGTTATATATAACAGCTTGCTGGATGCTATGGGT
AGGGCTAAAAGACCCTGGCAGATCAAGACCATTTACAAAGAGATGATTAAAAATGGATTTTCACCAAGTTGGGCGACTTATGCTTCTCTTTTACGCGCCTATGGGAGATC
CAGATATGGTGAGGATGCTCTTCTTGTGTACAAGGAGATGAAGGAAAAGGGACTGCAGTTAAATGTAATTCTCTACAATACACTTTTAGCTATGTGTGCTGATGTTGGCT
ACATTAATGAGGCTGTTGAAATTTTTGAAGATATGAAGAGATCTGGGACTTGCTCCCCTGACAGTTGGACTTTTTCTTCCATGATCACCATATATTCCTGCAGTGGAAAA
GTATCCGAGGCGGAGGAAATGTTGAACGAGATGGTGGAATCCGGTTTCGACCCTAATATCTTTGTCTTGACATCACTAATCCAGTGTTATGGGAAAGCCAAACGTGTTGA
TGATGTAGTGAAGACATTTAATCGACTAATAGAGTTGGGATTAACTCCAGACGACCGATTCTGTGGCTGTCTTCTCAATGTAATTACCCAGACTCCAAAAGAGGAACTTA
GCAAGCTGATTGATTGTGTTGAGAGAGCTAATCCAAAACTTGGTTTTGTGGTTAGACTTTTGCTAGGGGAACAAGACCAAGAAGGAGATTTCAGAACTGAAGCCTCAGAA
TTACTTAGTGTTGTCAGTGCTGATGTGAGAAGAGCCTATTGCAATTGCTTAATTGATCTCTGTGTGAATTTAGATCTTTTGGATAAGGCGTGTGAAGTACTGGATTTGGG
GCTTACGCTTCAGATATATACAGATTTGCAGTCCAGGTCTCCAACTCAGTGGTCTCTATATCTTAAGGGTCTTTCTCTTGGGGCTGCTCTTACTGCATTACACGTTTGGA
TAAATGACTTAACAAAGGTACTTGAATCCGGGGAGGAACTTCCACCATTACTTGGAATAAATACTGGACATGGAAAACACAAATATTCAGATAAGGGTTTGGCAAGCGTC
TTTGAATCACATTTAAAGGAATTAAATGCTCCATTCCATGAGGCTCCAGAAAAGGTTGGGTGGTTTTTGACGACTAAAGTGGCAGCAAAATCCTGGTTGGAATCTAGAGG
TTCACCTGAATTAGTTTCGACATAG
mRNA sequenceShow/hide mRNA sequence
GTTGAAACTTGAAAGTATCCATAAAAATTCGTCTCAATGATATTTTTTTCGGAAGAAAACGCCCTTTAGTTTTAAATAGGATATTTTAATGTTAAACTCCAGCCGATAAG
GCGGACGAGCCACTCAACCGGAAGCCAAAAGAACTCATCGGAGAGGCAATCAGTCACCGGAATGGCTTTCCAGCTCTGCCATTCGCCGTCCACCTTCTTCACCGACCACC
ATTCTCTCAGCAGTTGTCTAAATTCTCAACGCAAAGCAACTCTCTACAAGTCCTCTGACCGTTTCAAGCTCAATCCCATACCTCGCCCCTCAAAAACATTCCTCCAAATT
ACCAGTGTCTCGCTACAGGAATACGCTCCTCAAGAAACCCAGAATCCGAGCCCCTCTGACGATGAAATCTCGAAAAATTCAGATGGGAAATCCGGTTCCTCGTCCAAAAG
CTCCGTTTGGGTCAATCCCAGAAGCCCCAGAGCTTCGAAACTTCGGAAGCAATCGTACGAGGCCAGATATGCTTCTCTTACGAGGATATCGGAGTCTTTGGACTCTTGTA
ATCCATGTGAGGAAGATGTTGCTGATGTCTTGAAGGTGATAGGTAATAACATTTTAGAACAGGACGCTGTTGTAGTGCTGAATAACATGTCGAATTCCCAAACTGCGTTG
CTTGCTCTTCGGTACTTTCAGGAGGTTTTGAAATCAAGTAAAGAGGCGATTCTTTTTAATGTGACACTGAAGGTGTTTAGGAAGTGTAGAGATATGGAGGGTGCAGAGAA
ACTGTTCGACGAAATGCTTAAGAGAGGACTCAAACCTGATAATGTGACATTTTCTACGATTATTAGTTGTGCTAGGTTGTGTTCGTTGCCAAATAAGGCTGTTGAGTGGT
TTGAGAAGATGCCAAGTTTTGACTGTAATCCTGATGATGTCACTTACTCTGCGATGATTGATGCCTATGGACGTGCTGGTAATGTTGATCTGGCTTTCAGCTTGTATGAC
CGTGCAAGAACAGAAAACTGGCGTATTGATCCTGCAACATTCTCGACGTTGATCAAAATTCATGGAGTGGCAGGAAACTATGATGGGTGCTTGAATGTGTATGAAGAAAT
GAAGGCTATAGGCATCAAGCCAAACTTGGTTATATATAACAGCTTGCTGGATGCTATGGGTAGGGCTAAAAGACCCTGGCAGATCAAGACCATTTACAAAGAGATGATTA
AAAATGGATTTTCACCAAGTTGGGCGACTTATGCTTCTCTTTTACGCGCCTATGGGAGATCCAGATATGGTGAGGATGCTCTTCTTGTGTACAAGGAGATGAAGGAAAAG
GGACTGCAGTTAAATGTAATTCTCTACAATACACTTTTAGCTATGTGTGCTGATGTTGGCTACATTAATGAGGCTGTTGAAATTTTTGAAGATATGAAGAGATCTGGGAC
TTGCTCCCCTGACAGTTGGACTTTTTCTTCCATGATCACCATATATTCCTGCAGTGGAAAAGTATCCGAGGCGGAGGAAATGTTGAACGAGATGGTGGAATCCGGTTTCG
ACCCTAATATCTTTGTCTTGACATCACTAATCCAGTGTTATGGGAAAGCCAAACGTGTTGATGATGTAGTGAAGACATTTAATCGACTAATAGAGTTGGGATTAACTCCA
GACGACCGATTCTGTGGCTGTCTTCTCAATGTAATTACCCAGACTCCAAAAGAGGAACTTAGCAAGCTGATTGATTGTGTTGAGAGAGCTAATCCAAAACTTGGTTTTGT
GGTTAGACTTTTGCTAGGGGAACAAGACCAAGAAGGAGATTTCAGAACTGAAGCCTCAGAATTACTTAGTGTTGTCAGTGCTGATGTGAGAAGAGCCTATTGCAATTGCT
TAATTGATCTCTGTGTGAATTTAGATCTTTTGGATAAGGCGTGTGAAGTACTGGATTTGGGGCTTACGCTTCAGATATATACAGATTTGCAGTCCAGGTCTCCAACTCAG
TGGTCTCTATATCTTAAGGGTCTTTCTCTTGGGGCTGCTCTTACTGCATTACACGTTTGGATAAATGACTTAACAAAGGTACTTGAATCCGGGGAGGAACTTCCACCATT
ACTTGGAATAAATACTGGACATGGAAAACACAAATATTCAGATAAGGGTTTGGCAAGCGTCTTTGAATCACATTTAAAGGAATTAAATGCTCCATTCCATGAGGCTCCAG
AAAAGGTTGGGTGGTTTTTGACGACTAAAGTGGCAGCAAAATCCTGGTTGGAATCTAGAGGTTCACCTGAATTAGTTTCGACATAGGTTGTTCCTGAAAGGTTCCATAAT
ATCTGTAGTGATATCTTTTGTTTAAATATTTTTTTCCTTTCTTTTAATCGGCCATATCTTTCTTGAAAGAGAATTTTATGTTAAGGATGCAATGAAGAGTTCTGCTATCT
TTGTTCTTCTGTTGTATTATAGCTCTTTGGAAGGCCAGATCAATATTCAAACACTCTACTTTCTTGTTTACTGTCAA
Protein sequenceShow/hide protein sequence
MAFQLCHSPSTFFTDHHSLSSCLNSQRKATLYKSSDRFKLNPIPRPSKTFLQITSVSLQEYAPQETQNPSPSDDEISKNSDGKSGSSSKSSVWVNPRSPRASKLRKQSYE
ARYASLTRISESLDSCNPCEEDVADVLKVIGNNILEQDAVVVLNNMSNSQTALLALRYFQEVLKSSKEAILFNVTLKVFRKCRDMEGAEKLFDEMLKRGLKPDNVTFSTI
ISCARLCSLPNKAVEWFEKMPSFDCNPDDVTYSAMIDAYGRAGNVDLAFSLYDRARTENWRIDPATFSTLIKIHGVAGNYDGCLNVYEEMKAIGIKPNLVIYNSLLDAMG
RAKRPWQIKTIYKEMIKNGFSPSWATYASLLRAYGRSRYGEDALLVYKEMKEKGLQLNVILYNTLLAMCADVGYINEAVEIFEDMKRSGTCSPDSWTFSSMITIYSCSGK
VSEAEEMLNEMVESGFDPNIFVLTSLIQCYGKAKRVDDVVKTFNRLIELGLTPDDRFCGCLLNVITQTPKEELSKLIDCVERANPKLGFVVRLLLGEQDQEGDFRTEASE
LLSVVSADVRRAYCNCLIDLCVNLDLLDKACEVLDLGLTLQIYTDLQSRSPTQWSLYLKGLSLGAALTALHVWINDLTKVLESGEELPPLLGINTGHGKHKYSDKGLASV
FESHLKELNAPFHEAPEKVGWFLTTKVAAKSWLESRGSPELVST