; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr028442 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr028442
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationtig00153145:2255396..2261185
RNA-Seq ExpressionSgr028442
SyntenySgr028442
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR005500 - Protein of unknown function DUF309
IPR011990 - Tetratricopeptide-like helical domain superfamily
IPR023203 - TTHA0068-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6596202.1 Pentatricopeptide repeat-containing protein, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia]4.8e-20074.64Show/hide
Query:  PPQLNSCKSISHLKQIHAFAIKTASFSLQKQFFYPKLISLSSTSSSSPDLFYIRSILLSQLDDAQFCLNLCNAIIHSIATNSHGKSGSSTNLRAMEFLRE
        P  LNSC SI+HLKQ+HA AIKT S SL  QF +PKLISLS  SSSSPDLFYIRSILL+   DAQF LNLCNA IH I+ NS+G+S +ST+LRAMEFLRE
Subjt:  PPQLNSCKSISHLKQIHAFAIKTASFSLQKQFFYPKLISLSSTSSSSPDLFYIRSILLSQLDDAQFCLNLCNAIIHSIATNSHGKSGSSTNLRAMEFLRE

Query:  MLAIGIEPDEFTLPYVLKALAQIQAMREGQQIHARSIKNGLLRFNVYSRSCSTNV----LTGIWCLGPRLFKHLLRRAL----------------QEAVG
        ML IG++PD FTLP+VLKALA++Q +REGQQIHA SIK GL+RFNVY   C+T +    + G      +LF     R L                ++AVG
Subjt:  MLAIGIEPDEFTLPYVLKALAQIQAMREGQQIHARSIKNGLLRFNVYSRSCSTNV----LTGIWCLGPRLFKHLLRRAL----------------QEAVG

Query:  VFLEICDLNLRADGRTLVVVLSACSNLGDLNLGRKVHAYIRHHIDMNADVFLGNALIDMYLKCNDSSSAYKLFDEMPLRNVITWNAMISGLAYQGRYREA
         F+E+CDL LRADGRTLVVVLSACSNLGDLNLGRKVH+YI H+ID+NADVF+GNAL+DMYLKC+DS+SAYK+FDEMP+RNV+TWNAMISGLAYQGRY+EA
Subjt:  VFLEICDLNLRADGRTLVVVLSACSNLGDLNLGRKVHAYIRHHIDMNADVFLGNALIDMYLKCNDSSSAYKLFDEMPLRNVITWNAMISGLAYQGRYREA

Query:  LDVFRSMQSTGPKPDEVTLVVVLNSCANLGVLELGKWVHAYMRRNHYVADKFVGNALLDMYAKCGRIDEALRVFQSMKRRDVYSYTTMIVGFALHGKASS
        LD+FR MQ TGPKPDEVTLV VLNSCANLGVLELGKWVHAYMRRNH + DKFVGNALLDMYAKCGRIDEA RVF+SMKRRDVYSYT MIVG ALHG+A+ 
Subjt:  LDVFRSMQSTGPKPDEVTLVVVLNSCANLGVLELGKWVHAYMRRNHYVADKFVGNALLDMYAKCGRIDEALRVFQSMKRRDVYSYTTMIVGFALHGKASS

Query:  AFHVFSEMLRVGIEPNEVTFLGLLMACSHGGLVAEGKKYFFDMSNTYKLRPQAEHYGCMIDLLGRAGLVKEAEEIIHSMQISPDA
        AF VFS MLR G+EPNEVTFLGLLMACSH GLV++GKKYFFDM NTYKLRPQAEHYGCMIDLLGRAGLVKEAEEIIHSM+I PDA
Subjt:  AFHVFSEMLRVGIEPNEVTFLGLLMACSHGGLVAEGKKYFFDMSNTYKLRPQAEHYGCMIDLLGRAGLVKEAEEIIHSMQISPDA

XP_022154280.1 pentatricopeptide repeat-containing protein At1g31430-like [Momordica charantia]1.9e-20476.17Show/hide
Query:  PPQLNSCKSISHLKQIHAFAIKTASFSLQKQFFYPKLISLSSTSSSSPDLFYIRSILLSQLDDAQFCLNLCNAIIHSIATNSHGKSGSSTNLRAMEFLRE
        P  LNSCKSI+ LKQIHA AIK AS SLQKQFFYPKLISLSS SSSS DLFYIRSI+L+  DDAQFCL+LCNAII  I  NS+ ++  ST   AMEFLRE
Subjt:  PPQLNSCKSISHLKQIHAFAIKTASFSLQKQFFYPKLISLSSTSSSSPDLFYIRSILLSQLDDAQFCLNLCNAIIHSIATNSHGKSGSSTNLRAMEFLRE

Query:  MLAIGIEPDEFTLPYVLKALAQIQAMREGQQIHARSIKNGLLRFNVYSRSCSTNV--LTGIWCLGPRLFKHLLRRAL----------------QEAVGVF
        ML +G+EPDEFTLPYVLKALA+I+ MREGQQIHARSIK GLLRFNVY  +    +  + G+     +LF     R L                + A+G F
Subjt:  MLAIGIEPDEFTLPYVLKALAQIQAMREGQQIHARSIKNGLLRFNVYSRSCSTNV--LTGIWCLGPRLFKHLLRRAL----------------QEAVGVF

Query:  LEICDLNLRADGRTLVVVLSACSNLGDLNLGRKVHAYIRHHIDMNADVFLGNALIDMYLKCNDSSSAYKLFDEMPLRNVITWNAMISGLAYQGRYREALD
        L++CDLNLRADGR LVVVLSACSNLGDLNLGRKVH+YIRH+IDMNADVFLGNALIDMYLKCNDS+SAY++F+EMP+RNV+TWNA+ISGLAYQGRYREALD
Subjt:  LEICDLNLRADGRTLVVVLSACSNLGDLNLGRKVHAYIRHHIDMNADVFLGNALIDMYLKCNDSSSAYKLFDEMPLRNVITWNAMISGLAYQGRYREALD

Query:  VFRSMQSTGPKPDEVTLVVVLNSCANLGVLELGKWVHAYMRRNHYVADKFVGNALLDMYAKCGRIDEALRVFQSMKRRDVYSYTTMIVGFALHGKASSAF
        VFR MQS G KPDEVTLV VLNSCANLGVLELGKWVH YMRRN+ +ADKFVGNALLDMY KCGRIDEA RVFQ MKRRDVYSYT+MIVG ALHGKA+SAF
Subjt:  VFRSMQSTGPKPDEVTLVVVLNSCANLGVLELGKWVHAYMRRNHYVADKFVGNALLDMYAKCGRIDEALRVFQSMKRRDVYSYTTMIVGFALHGKASSAF

Query:  HVFSEMLRVGIEPNEVTFLGLLMACSHGGLVAEGKKYFFDMSNTYKLRPQAEHYGCMIDLLGRAGLVKEAEEIIHSMQISPD-----ALLG
         +FSEM RVGIEPNEVTFLGLLMACSHGGLVAEGKKY FDMSNTY LRPQAEHYGCMIDLLGRAGLVKEAEEII  MQISPD     ALLG
Subjt:  HVFSEMLRVGIEPNEVTFLGLLMACSHGGLVAEGKKYFFDMSNTYKLRPQAEHYGCMIDLLGRAGLVKEAEEIIHSMQISPD-----ALLG

XP_022947818.1 pentatricopeptide repeat-containing protein At1g08070, chloroplastic-like isoform X1 [Cucurbita moschata]2.6e-19874.43Show/hide
Query:  PPQLNSCKSISHLKQIHAFAIKTASFSLQKQFFYPKLISLSSTSSSSPDLFYIRSILLSQLDDAQFCLNLCNAIIHSIATNSHGKSGSSTNLRAMEFLRE
        P  LNSC SI+HLKQ+HA AIKT S SL  Q  +PKLISLSS SS SPDLFYIRSILL+   DAQF LNLCNA IH I+ NS+G+S +ST LRAMEFLRE
Subjt:  PPQLNSCKSISHLKQIHAFAIKTASFSLQKQFFYPKLISLSSTSSSSPDLFYIRSILLSQLDDAQFCLNLCNAIIHSIATNSHGKSGSSTNLRAMEFLRE

Query:  MLAIGIEPDEFTLPYVLKALAQIQAMREGQQIHARSIKNGLLRFNVYSRSCSTNV----LTGIWCLGPRLFKHLLRRAL----------------QEAVG
        ML IG++PD FTLP+VLKALA+IQ +REGQQIHA SIK GL+RFNVY   C+T +    + G      +LF     R L                ++AVG
Subjt:  MLAIGIEPDEFTLPYVLKALAQIQAMREGQQIHARSIKNGLLRFNVYSRSCSTNV----LTGIWCLGPRLFKHLLRRAL----------------QEAVG

Query:  VFLEICDLNLRADGRTLVVVLSACSNLGDLNLGRKVHAYIRHHIDMNADVFLGNALIDMYLKCNDSSSAYKLFDEMPLRNVITWNAMISGLAYQGRYREA
         F+E+CDL LRADGRTLVVVLSACSNLGDLNLGRKVH+YI H+ID+NADVF+GNAL+DMYLKC+DS+SAYK+FDEMP+RNV+TWNAMI GLAYQGRY+EA
Subjt:  VFLEICDLNLRADGRTLVVVLSACSNLGDLNLGRKVHAYIRHHIDMNADVFLGNALIDMYLKCNDSSSAYKLFDEMPLRNVITWNAMISGLAYQGRYREA

Query:  LDVFRSMQSTGPKPDEVTLVVVLNSCANLGVLELGKWVHAYMRRNHYVADKFVGNALLDMYAKCGRIDEALRVFQSMKRRDVYSYTTMIVGFALHGKASS
        LD+FR MQ TGPKPDEVTLV VLNSCANLGVLELGKWVHAYMRRNH +ADKFVGNALLDMYAKCGRIDEA RVF+ MKRRDVYSYT MIVG ALHG+A+ 
Subjt:  LDVFRSMQSTGPKPDEVTLVVVLNSCANLGVLELGKWVHAYMRRNHYVADKFVGNALLDMYAKCGRIDEALRVFQSMKRRDVYSYTTMIVGFALHGKASS

Query:  AFHVFSEMLRVGIEPNEVTFLGLLMACSHGGLVAEGKKYFFDMSNTYKLRPQAEHYGCMIDLLGRAGLVKEAEEIIHSMQISPDA
        AF VFS MLR G+EPNEVTFLGLLMACSH GLV++GKK FFDMSNTYKLRPQAEHYGCMIDLLGRAGLVKEAEEIIHSM+I PDA
Subjt:  AFHVFSEMLRVGIEPNEVTFLGLLMACSHGGLVAEGKKYFFDMSNTYKLRPQAEHYGCMIDLLGRAGLVKEAEEIIHSMQISPDA

XP_022971561.1 pentatricopeptide repeat-containing protein At1g08070, chloroplastic-like isoform X1 [Cucurbita maxima]1.1e-19673.71Show/hide
Query:  PPQLNSCKSISHLKQIHAFAIKTASFSLQKQFFYPKLISLSSTSSSSPDLFYIRSILLSQLDDAQFCLNLCNAIIHSIATNSHGKSGSSTNLRAMEFLRE
        P  LNSC SI+HLKQ+HA AIKT S SL  QF + KLISLS  SSSSPDLFYIRSILL+ L DAQF LNLCNA IH I+ NS+G+S +ST LRAMEFLRE
Subjt:  PPQLNSCKSISHLKQIHAFAIKTASFSLQKQFFYPKLISLSSTSSSSPDLFYIRSILLSQLDDAQFCLNLCNAIIHSIATNSHGKSGSSTNLRAMEFLRE

Query:  MLAIGIEPDEFTLPYVLKALAQIQAMREGQQIHARSIKNGLLRFNV---------YSRSCSTNVLTGIWCLGPR--------LFKHLLRRAL-QEAVGVF
        ML IG++PD FTLP+VLKALA++Q +REGQQIHA SIK GL+RFNV         YS   S + +  ++   P         L +   +  L ++AVG F
Subjt:  MLAIGIEPDEFTLPYVLKALAQIQAMREGQQIHARSIKNGLLRFNV---------YSRSCSTNVLTGIWCLGPR--------LFKHLLRRAL-QEAVGVF

Query:  LEICDLNLRADGRTLVVVLSACSNLGDLNLGRKVHAYIRHHIDMNADVFLGNALIDMYLKCNDSSSAYKLFDEMPLRNVITWNAMISGLAYQGRYREALD
        +E+CDL LRADGRTLVVVLSACSNLGDLNLGRK+H+YI H+ID+N DVF+GNAL+DMYLKC+DS+SAYK+FDEMP+RNV+TWNAMI GLAYQGRY+EALD
Subjt:  LEICDLNLRADGRTLVVVLSACSNLGDLNLGRKVHAYIRHHIDMNADVFLGNALIDMYLKCNDSSSAYKLFDEMPLRNVITWNAMISGLAYQGRYREALD

Query:  VFRSMQSTGPKPDEVTLVVVLNSCANLGVLELGKWVHAYMRRNHYVADKFVGNALLDMYAKCGRIDEALRVFQSMKRRDVYSYTTMIVGFALHGKASSAF
        +FR MQ TGPKPDEVTLV VLNSCANLGVLELG+WVHAYMRRN+ +ADKFVGNALLDMYAKCG IDEA RVF+SMKRRDVYSYT MIVG ALHG+A+ AF
Subjt:  VFRSMQSTGPKPDEVTLVVVLNSCANLGVLELGKWVHAYMRRNHYVADKFVGNALLDMYAKCGRIDEALRVFQSMKRRDVYSYTTMIVGFALHGKASSAF

Query:  HVFSEMLRVGIEPNEVTFLGLLMACSHGGLVAEGKKYFFDMSNTYKLRPQAEHYGCMIDLLGRAGLVKEAEEIIHSMQISPDA
         VFS MLR G+EPNEVTFLGLLMACSH GLV++GKKYFFDMSNTYKLRPQAEHYGCMIDLLGRAGLVKEAEEIIHSM+I PDA
Subjt:  HVFSEMLRVGIEPNEVTFLGLLMACSHGGLVAEGKKYFFDMSNTYKLRPQAEHYGCMIDLLGRAGLVKEAEEIIHSMQISPDA

XP_023521817.1 pentatricopeptide repeat-containing protein At1g08070, chloroplastic-like isoform X1 [Cucurbita pepo subsp. pepo]1.0e-19774.43Show/hide
Query:  PPQLNSCKSISHLKQIHAFAIKTASFSLQKQFFYPKLISLSSTSSSSPDLFYIRSILLSQLDDAQFCLNLCNAIIHSIATNSHGKSGSSTNLRAMEFLRE
        P  LNSC SI+HLKQ+HA AIKT S SL  Q  +PKLISL   SSSSPDLFYIRSILL+   DAQF LNLCNA IH I+ NS G+S +ST+LRAMEFLRE
Subjt:  PPQLNSCKSISHLKQIHAFAIKTASFSLQKQFFYPKLISLSSTSSSSPDLFYIRSILLSQLDDAQFCLNLCNAIIHSIATNSHGKSGSSTNLRAMEFLRE

Query:  MLAIGIEPDEFTLPYVLKALAQIQAMREGQQIHARSIKNGLLRFNVYSRSCSTNV----LTGIWCLGPRLFKHLLRRAL----------------QEAVG
        ML IG++PD FTLP+VLKALA+IQ +REGQQIHA SIK GL+RFNVY   C+T +    + G      +LF     R L                ++AVG
Subjt:  MLAIGIEPDEFTLPYVLKALAQIQAMREGQQIHARSIKNGLLRFNVYSRSCSTNV----LTGIWCLGPRLFKHLLRRAL----------------QEAVG

Query:  VFLEICDLNLRADGRTLVVVLSACSNLGDLNLGRKVHAYIRHHIDMNADVFLGNALIDMYLKCNDSSSAYKLFDEMPLRNVITWNAMISGLAYQGRYREA
         F+E+CDL LR DGRTLVVVLSA SNLGDLNLGRKVHAYI H+ID+NADVF+GNAL+DMYLKC+DS+SAYK+FDEMP+RNV+TWNAMISGLAYQGRY+EA
Subjt:  VFLEICDLNLRADGRTLVVVLSACSNLGDLNLGRKVHAYIRHHIDMNADVFLGNALIDMYLKCNDSSSAYKLFDEMPLRNVITWNAMISGLAYQGRYREA

Query:  LDVFRSMQSTGPKPDEVTLVVVLNSCANLGVLELGKWVHAYMRRNHYVADKFVGNALLDMYAKCGRIDEALRVFQSMKRRDVYSYTTMIVGFALHGKASS
        LD+FR MQ TGPKPDEVTLV VLNSCANLGVLELGKWVHAYMRRNH +ADKFVGNALLDMYAKCGRIDEA RVF+SMKRRDVYSYT MIVG ALHG+A+ 
Subjt:  LDVFRSMQSTGPKPDEVTLVVVLNSCANLGVLELGKWVHAYMRRNHYVADKFVGNALLDMYAKCGRIDEALRVFQSMKRRDVYSYTTMIVGFALHGKASS

Query:  AFHVFSEMLRVGIEPNEVTFLGLLMACSHGGLVAEGKKYFFDMSNTYKLRPQAEHYGCMIDLLGRAGLVKEAEEIIHSMQISPDA
        AF VFS MLR G+EPNEVTFLGLLMACSH GLV++GKKYFFDM NTYKLRPQAEHYGCMIDLLGRAGLVKEAEEIIHSM+I PDA
Subjt:  AFHVFSEMLRVGIEPNEVTFLGLLMACSHGGLVAEGKKYFFDMSNTYKLRPQAEHYGCMIDLLGRAGLVKEAEEIIHSMQISPDA

TrEMBL top hitse value%identityAlignment
A0A5A7UKB6 Pentatricopeptide repeat-containing protein9.5e-18670.3Show/hide
Query:  PPQLNSCKSISHLKQIHAFAIKTASFSLQKQFFYPKLISLSSTSSSSPDLFYIRSILLSQLDDAQFCLNLCNAIIHSIATNSHGKSGSSTNLRAMEFLRE
        P  L+SCKS+SHLKQIH  AIKT S SL      PKLI LSS+SSSSPDLFYIRSILL+   DAQF LNLCNAI+ SI+ N       STNL  MEFL E
Subjt:  PPQLNSCKSISHLKQIHAFAIKTASFSLQKQFFYPKLISLSSTSSSSPDLFYIRSILLSQLDDAQFCLNLCNAIIHSIATNSHGKSGSSTNLRAMEFLRE

Query:  MLAIGIEPDEFTLPYVLKALAQIQAMREGQQIHARSIKNGLLRFNVYSRSCSTNVLTGIWCL------GPRLFKHLLRRAL----------------QEA
        ML IG+EPD FTLP VLKALA+ + +REGQQIHARSIK G++  NVY     TN L  ++ +        ++F     R L                  A
Subjt:  MLAIGIEPDEFTLPYVLKALAQIQAMREGQQIHARSIKNGLLRFNVYSRSCSTNVLTGIWCL------GPRLFKHLLRRAL----------------QEA

Query:  VGVFLEICDLNLRADGRTLVVVLSACSNLGDLNLGRKVHAYIRHHIDMNADVFLGNALIDMYLKCNDSSSAYKLFDEMPLRNVITWNAMISGLAYQGRYR
        V  F+E+CDL LRADGRTLVVVLSACSNLGDLNLG+KVH+YIR++IDMNADVF+GNALIDMYLKC+D +SA K+FDEMP+RNV+TWNAMISGLAYQGRYR
Subjt:  VGVFLEICDLNLRADGRTLVVVLSACSNLGDLNLGRKVHAYIRHHIDMNADVFLGNALIDMYLKCNDSSSAYKLFDEMPLRNVITWNAMISGLAYQGRYR

Query:  EALDVFRSMQSTGPKPDEVTLVVVLNSCANLGVLELGKWVHAYMRRNHYVADKFVGNALLDMYAKCGRIDEALRVFQSMKRRDVYSYTTMIVGFALHGKA
        EALD FR MQ+ G KPDEVTLV VLNSCANLGVLE+GKWVHAYMRRNH +AD+FVGNALLDMYAKCG IDEA RVF+SMK+RDVYSYT MIVG ALHG+A
Subjt:  EALDVFRSMQSTGPKPDEVTLVVVLNSCANLGVLELGKWVHAYMRRNHYVADKFVGNALLDMYAKCGRIDEALRVFQSMKRRDVYSYTTMIVGFALHGKA

Query:  SSAFHVFSEMLRVGIEPNEVTFLGLLMACSHGGLVAEGKKYFFDMSNTYKLRPQAEHYGCMIDLLGRAGLVKEAEEIIHSMQISPD-----ALLG
        + AF VFSEM RVGIEPNEVTFLGLLMACSHGGLVAEGKKYFF+MS+ YKLRPQ+EHYGCMIDLLGR GLVKEAEEI+H M+I PD     ALLG
Subjt:  SSAFHVFSEMLRVGIEPNEVTFLGLLMACSHGGLVAEGKKYFFDMSNTYKLRPQAEHYGCMIDLLGRAGLVKEAEEIIHSMQISPD-----ALLG

A0A6J1DJ70 pentatricopeptide repeat-containing protein At1g31430-like9.2e-20576.17Show/hide
Query:  PPQLNSCKSISHLKQIHAFAIKTASFSLQKQFFYPKLISLSSTSSSSPDLFYIRSILLSQLDDAQFCLNLCNAIIHSIATNSHGKSGSSTNLRAMEFLRE
        P  LNSCKSI+ LKQIHA AIK AS SLQKQFFYPKLISLSS SSSS DLFYIRSI+L+  DDAQFCL+LCNAII  I  NS+ ++  ST   AMEFLRE
Subjt:  PPQLNSCKSISHLKQIHAFAIKTASFSLQKQFFYPKLISLSSTSSSSPDLFYIRSILLSQLDDAQFCLNLCNAIIHSIATNSHGKSGSSTNLRAMEFLRE

Query:  MLAIGIEPDEFTLPYVLKALAQIQAMREGQQIHARSIKNGLLRFNVYSRSCSTNV--LTGIWCLGPRLFKHLLRRAL----------------QEAVGVF
        ML +G+EPDEFTLPYVLKALA+I+ MREGQQIHARSIK GLLRFNVY  +    +  + G+     +LF     R L                + A+G F
Subjt:  MLAIGIEPDEFTLPYVLKALAQIQAMREGQQIHARSIKNGLLRFNVYSRSCSTNV--LTGIWCLGPRLFKHLLRRAL----------------QEAVGVF

Query:  LEICDLNLRADGRTLVVVLSACSNLGDLNLGRKVHAYIRHHIDMNADVFLGNALIDMYLKCNDSSSAYKLFDEMPLRNVITWNAMISGLAYQGRYREALD
        L++CDLNLRADGR LVVVLSACSNLGDLNLGRKVH+YIRH+IDMNADVFLGNALIDMYLKCNDS+SAY++F+EMP+RNV+TWNA+ISGLAYQGRYREALD
Subjt:  LEICDLNLRADGRTLVVVLSACSNLGDLNLGRKVHAYIRHHIDMNADVFLGNALIDMYLKCNDSSSAYKLFDEMPLRNVITWNAMISGLAYQGRYREALD

Query:  VFRSMQSTGPKPDEVTLVVVLNSCANLGVLELGKWVHAYMRRNHYVADKFVGNALLDMYAKCGRIDEALRVFQSMKRRDVYSYTTMIVGFALHGKASSAF
        VFR MQS G KPDEVTLV VLNSCANLGVLELGKWVH YMRRN+ +ADKFVGNALLDMY KCGRIDEA RVFQ MKRRDVYSYT+MIVG ALHGKA+SAF
Subjt:  VFRSMQSTGPKPDEVTLVVVLNSCANLGVLELGKWVHAYMRRNHYVADKFVGNALLDMYAKCGRIDEALRVFQSMKRRDVYSYTTMIVGFALHGKASSAF

Query:  HVFSEMLRVGIEPNEVTFLGLLMACSHGGLVAEGKKYFFDMSNTYKLRPQAEHYGCMIDLLGRAGLVKEAEEIIHSMQISPD-----ALLG
         +FSEM RVGIEPNEVTFLGLLMACSHGGLVAEGKKY FDMSNTY LRPQAEHYGCMIDLLGRAGLVKEAEEII  MQISPD     ALLG
Subjt:  HVFSEMLRVGIEPNEVTFLGLLMACSHGGLVAEGKKYFFDMSNTYKLRPQAEHYGCMIDLLGRAGLVKEAEEIIHSMQISPD-----ALLG

A0A6J1G7N7 pentatricopeptide repeat-containing protein At1g08070, chloroplastic-like isoform X11.3e-19874.43Show/hide
Query:  PPQLNSCKSISHLKQIHAFAIKTASFSLQKQFFYPKLISLSSTSSSSPDLFYIRSILLSQLDDAQFCLNLCNAIIHSIATNSHGKSGSSTNLRAMEFLRE
        P  LNSC SI+HLKQ+HA AIKT S SL  Q  +PKLISLSS SS SPDLFYIRSILL+   DAQF LNLCNA IH I+ NS+G+S +ST LRAMEFLRE
Subjt:  PPQLNSCKSISHLKQIHAFAIKTASFSLQKQFFYPKLISLSSTSSSSPDLFYIRSILLSQLDDAQFCLNLCNAIIHSIATNSHGKSGSSTNLRAMEFLRE

Query:  MLAIGIEPDEFTLPYVLKALAQIQAMREGQQIHARSIKNGLLRFNVYSRSCSTNV----LTGIWCLGPRLFKHLLRRAL----------------QEAVG
        ML IG++PD FTLP+VLKALA+IQ +REGQQIHA SIK GL+RFNVY   C+T +    + G      +LF     R L                ++AVG
Subjt:  MLAIGIEPDEFTLPYVLKALAQIQAMREGQQIHARSIKNGLLRFNVYSRSCSTNV----LTGIWCLGPRLFKHLLRRAL----------------QEAVG

Query:  VFLEICDLNLRADGRTLVVVLSACSNLGDLNLGRKVHAYIRHHIDMNADVFLGNALIDMYLKCNDSSSAYKLFDEMPLRNVITWNAMISGLAYQGRYREA
         F+E+CDL LRADGRTLVVVLSACSNLGDLNLGRKVH+YI H+ID+NADVF+GNAL+DMYLKC+DS+SAYK+FDEMP+RNV+TWNAMI GLAYQGRY+EA
Subjt:  VFLEICDLNLRADGRTLVVVLSACSNLGDLNLGRKVHAYIRHHIDMNADVFLGNALIDMYLKCNDSSSAYKLFDEMPLRNVITWNAMISGLAYQGRYREA

Query:  LDVFRSMQSTGPKPDEVTLVVVLNSCANLGVLELGKWVHAYMRRNHYVADKFVGNALLDMYAKCGRIDEALRVFQSMKRRDVYSYTTMIVGFALHGKASS
        LD+FR MQ TGPKPDEVTLV VLNSCANLGVLELGKWVHAYMRRNH +ADKFVGNALLDMYAKCGRIDEA RVF+ MKRRDVYSYT MIVG ALHG+A+ 
Subjt:  LDVFRSMQSTGPKPDEVTLVVVLNSCANLGVLELGKWVHAYMRRNHYVADKFVGNALLDMYAKCGRIDEALRVFQSMKRRDVYSYTTMIVGFALHGKASS

Query:  AFHVFSEMLRVGIEPNEVTFLGLLMACSHGGLVAEGKKYFFDMSNTYKLRPQAEHYGCMIDLLGRAGLVKEAEEIIHSMQISPDA
        AF VFS MLR G+EPNEVTFLGLLMACSH GLV++GKK FFDMSNTYKLRPQAEHYGCMIDLLGRAGLVKEAEEIIHSM+I PDA
Subjt:  AFHVFSEMLRVGIEPNEVTFLGLLMACSHGGLVAEGKKYFFDMSNTYKLRPQAEHYGCMIDLLGRAGLVKEAEEIIHSMQISPDA

A0A6J1G856 pentatricopeptide repeat-containing protein At1g08070, chloroplastic-like isoform X21.7e-17172.21Show/hide
Query:  PPQLNSCKSISHLKQIHAFAIKTASFSLQKQFFYPKLISLSSTSSSSPDLFYIRSILLSQLDDAQFCLNLCNAIIHSIATNSHGKSGSSTNLRAMEFLRE
        P  LNSC SI+HLKQ+HA AIKT S SL  Q  +PKLISLSS SS SPDLFYIRSILL+   DAQF LNLCNA IH I+ NS+G+S +ST LRAMEFLRE
Subjt:  PPQLNSCKSISHLKQIHAFAIKTASFSLQKQFFYPKLISLSSTSSSSPDLFYIRSILLSQLDDAQFCLNLCNAIIHSIATNSHGKSGSSTNLRAMEFLRE

Query:  MLAIGIEPDEFTLPYVLKALAQIQAMREGQQIHARSIKNGLLRFNVYSRSCSTNV----LTGIWCLGPRLFKHLLRRAL----------------QEAVG
        ML IG++PD FTLP+VLKALA+IQ +REGQQIHA SIK GL+RFNVY   C+T +    + G      +LF     R L                ++AVG
Subjt:  MLAIGIEPDEFTLPYVLKALAQIQAMREGQQIHARSIKNGLLRFNVYSRSCSTNV----LTGIWCLGPRLFKHLLRRAL----------------QEAVG

Query:  VFLEICDLNLRADGRTLVVVLSACSNLGDLNLGRKVHAYIRHHIDMNADVFLGNALIDMYLKCNDSSSAYKLFDEMPLRNVITWNAMISGLAYQGRYREA
         F+E+CDL LRADGRTLVVVLSACSNLGDLNLGRKVH+YI H+ID+NADVF+GNAL+DMYLKC+DS+SAYK+FDEMP+RNV+TWNAMI GLAYQGRY+EA
Subjt:  VFLEICDLNLRADGRTLVVVLSACSNLGDLNLGRKVHAYIRHHIDMNADVFLGNALIDMYLKCNDSSSAYKLFDEMPLRNVITWNAMISGLAYQGRYREA

Query:  LDVFRSMQSTGPKPDEVTLVVVLNSCANLGVLELGKWVHAYMRRNHYVADKFVGNALLDMYAKCGRIDEALRVFQSMKRRDVYSYTTMIVGFALHGKASS
        LD+FR MQ TGPKPDEVTLV VLNSCANLGVLELGKWVHAYMRRNH +ADKFVGNALLDMYAKCGRIDEA RVF+ MKRRDVYSYT MIVG ALHG+A+ 
Subjt:  LDVFRSMQSTGPKPDEVTLVVVLNSCANLGVLELGKWVHAYMRRNHYVADKFVGNALLDMYAKCGRIDEALRVFQSMKRRDVYSYTTMIVGFALHGKASS

Query:  AFHVFSEMLRVGIEPNEVTFLGLLMACSHGGLVAEGKKY
        AF VFS MLR G+EPNEVTFLGLLMACSH GLV++GKK+
Subjt:  AFHVFSEMLRVGIEPNEVTFLGLLMACSHGGLVAEGKKY

A0A6J1I3M1 pentatricopeptide repeat-containing protein At1g08070, chloroplastic-like isoform X15.4e-19773.71Show/hide
Query:  PPQLNSCKSISHLKQIHAFAIKTASFSLQKQFFYPKLISLSSTSSSSPDLFYIRSILLSQLDDAQFCLNLCNAIIHSIATNSHGKSGSSTNLRAMEFLRE
        P  LNSC SI+HLKQ+HA AIKT S SL  QF + KLISLS  SSSSPDLFYIRSILL+ L DAQF LNLCNA IH I+ NS+G+S +ST LRAMEFLRE
Subjt:  PPQLNSCKSISHLKQIHAFAIKTASFSLQKQFFYPKLISLSSTSSSSPDLFYIRSILLSQLDDAQFCLNLCNAIIHSIATNSHGKSGSSTNLRAMEFLRE

Query:  MLAIGIEPDEFTLPYVLKALAQIQAMREGQQIHARSIKNGLLRFNV---------YSRSCSTNVLTGIWCLGPR--------LFKHLLRRAL-QEAVGVF
        ML IG++PD FTLP+VLKALA++Q +REGQQIHA SIK GL+RFNV         YS   S + +  ++   P         L +   +  L ++AVG F
Subjt:  MLAIGIEPDEFTLPYVLKALAQIQAMREGQQIHARSIKNGLLRFNV---------YSRSCSTNVLTGIWCLGPR--------LFKHLLRRAL-QEAVGVF

Query:  LEICDLNLRADGRTLVVVLSACSNLGDLNLGRKVHAYIRHHIDMNADVFLGNALIDMYLKCNDSSSAYKLFDEMPLRNVITWNAMISGLAYQGRYREALD
        +E+CDL LRADGRTLVVVLSACSNLGDLNLGRK+H+YI H+ID+N DVF+GNAL+DMYLKC+DS+SAYK+FDEMP+RNV+TWNAMI GLAYQGRY+EALD
Subjt:  LEICDLNLRADGRTLVVVLSACSNLGDLNLGRKVHAYIRHHIDMNADVFLGNALIDMYLKCNDSSSAYKLFDEMPLRNVITWNAMISGLAYQGRYREALD

Query:  VFRSMQSTGPKPDEVTLVVVLNSCANLGVLELGKWVHAYMRRNHYVADKFVGNALLDMYAKCGRIDEALRVFQSMKRRDVYSYTTMIVGFALHGKASSAF
        +FR MQ TGPKPDEVTLV VLNSCANLGVLELG+WVHAYMRRN+ +ADKFVGNALLDMYAKCG IDEA RVF+SMKRRDVYSYT MIVG ALHG+A+ AF
Subjt:  VFRSMQSTGPKPDEVTLVVVLNSCANLGVLELGKWVHAYMRRNHYVADKFVGNALLDMYAKCGRIDEALRVFQSMKRRDVYSYTTMIVGFALHGKASSAF

Query:  HVFSEMLRVGIEPNEVTFLGLLMACSHGGLVAEGKKYFFDMSNTYKLRPQAEHYGCMIDLLGRAGLVKEAEEIIHSMQISPDA
         VFS MLR G+EPNEVTFLGLLMACSH GLV++GKKYFFDMSNTYKLRPQAEHYGCMIDLLGRAGLVKEAEEIIHSM+I PDA
Subjt:  HVFSEMLRVGIEPNEVTFLGLLMACSHGGLVAEGKKYFFDMSNTYKLRPQAEHYGCMIDLLGRAGLVKEAEEIIHSMQISPDA

SwissProt top hitse value%identityAlignment
Q9C866 Pentatricopeptide repeat-containing protein At1g314306.6e-7535.15Show/hide
Query:  LNLCNAIIHSIATNSHGKSGSSTNLRAMEFLREMLAIGIEPDEFTLPYVLKALAQIQAMREGQQIHARSIKNGLLRFNVYSRSCSTNVLTGIWC------
        L + N ++ S+A    GKS +    + +    E+   G+ PD FTLP VLK++ +++ + EG+++H  ++K G L F+ Y     +N L G++       
Subjt:  LNLCNAIIHSIATNSHGKSGSSTNLRAMEFLREMLAIGIEPDEFTLPYVLKALAQIQAMREGQQIHARSIKNGLLRFNVYSRSCSTNVLTGIWC------

Query:  LGPRLFKHLLRR----------------ALQEAVGVFLEIC-DLNLRADGRTLVVVLSACSNLGDLNLGRKVHAYIRHHIDMNADVFLGNALIDMYLKCN
        +  ++F  + +R                  ++A+GVF  +  + NL+ D  T+V  LSACS L +L +G +++ ++    +M+  V +GNAL+DM+ KC 
Subjt:  LGPRLFKHLLRR----------------ALQEAVGVFLEIC-DLNLRADGRTLVVVLSACSNLGDLNLGRKVHAYIRHHIDMNADVFLGNALIDMYLKCN

Query:  DSSSAYKLFDEM-------------------------------PLRNVITWNAMISGLAYQGRYREALDVFRSMQSTGPKPDEVTLVVVLNSCANLGVLE
            A  +FD M                               P+++V+ W AM++G     R+ EAL++FR MQ+ G +PD   LV +L  CA  G LE
Subjt:  DSSSAYKLFDEM-------------------------------PLRNVITWNAMISGLAYQGRYREALDVFRSMQSTGPKPDEVTLVVVLNSCANLGVLE

Query:  LGKWVHAYMRRNHYVADKFVGNALLDMYAKCGRIDEALRVFQSMKRRDVYSYTTMIVGFALHGKASSAFHVFSEMLRVGIEPNEVTFLGLLMACSHGGLV
         GKW+H Y+  N    DK VG AL+DMYAKCG I+ AL VF  +K RD  S+T++I G A++G +  A  ++ EM  VG+  + +TF+ +L AC+HGG V
Subjt:  LGKWVHAYMRRNHYVADKFVGNALLDMYAKCGRIDEALRVFQSMKRRDVYSYTTMIVGFALHGKASSAFHVFSEMLRVGIEPNEVTFLGLLMACSHGGLV

Query:  AEGKKYFFDMSNTYKLRPQAEHYGCMIDLLGRAGLVKEAEEIIHSMQISPDALLGELY
        AEG+K F  M+  + ++P++EH  C+IDLL RAGL+ EAEE+I  M+   D  L  +Y
Subjt:  AEGKKYFFDMSNTYKLRPQAEHYGCMIDLLGRAGLVKEAEEIIHSMQISPDALLGELY

Q9FI80 Pentatricopeptide repeat-containing protein At5g489101.7e-7035.7Show/hide
Query:  PLSSPPQLNSCKSISHLKQIHAFAIKTASFSLQKQFFYPKLISLSSTSS-SSPDLFYIRSILLSQLDDAQFCLNLCNAIIHSIATNSHGKSGSSTNLRAM
        P S  PQ+N+C++I  L QIHA  IK  S  ++      +++   +TS     DL Y   I  +Q+     C +  N II   + +   K+     L A+
Subjt:  PLSSPPQLNSCKSISHLKQIHAFAIKTASFSLQKQFFYPKLISLSSTSS-SSPDLFYIRSILLSQLDDAQFCLNLCNAIIHSIATNSHGKSGSSTNLRAM

Query:  EFLREMLAIG-IEPDEFTLPYVLKALAQIQAMREGQQIHARSIKNGLLRFNVYSRSCSTNVLTGIWCLGPRLFKHLLRRALQEAVGVFLEICDLNLRADG
            EM++   +EP+ FT P VLKA A+   ++EG+QIH  ++K G   F       S  V   + C                               D 
Subjt:  EFLREMLAIG-IEPDEFTLPYVLKALAQIQAMREGQQIHARSIKNGLLRFNVYSRSCSTNVLTGIWCLGPRLFKHLLRRALQEAVGVFLEICDLNLRADG

Query:  RTLVVVLSACSNLGDLNLGRKVHAYIRHHIDMNADVFLGNALIDMYLKCNDSSSAYKLFDEMPLRNVITWNAMISGLAYQGRYREALDVFRSMQSTGPKP
        R L             N+  K    +      + ++ L N +ID Y++  D  +A  LFD+M  R+V++WN MISG +  G +++A++VFR M+    +P
Subjt:  RTLVVVLSACSNLGDLNLGRKVHAYIRHHIDMNADVFLGNALIDMYLKCNDSSSAYKLFDEMPLRNVITWNAMISGLAYQGRYREALDVFRSMQSTGPKP

Query:  DEVTLVVVLNSCANLGVLELGKWVHAYMRRNHYVADKFVGNALLDMYAKCGRIDEALRVFQSMKRRDVYSYTTMIVGFALHGKASSAFHVFSEMLRVGIE
        + VTLV VL + + LG LELG+W+H Y   +    D  +G+AL+DMY+KCG I++A+ VF+ + R +V +++ MI GFA+HG+A  A   F +M + G+ 
Subjt:  DEVTLVVVLNSCANLGVLELGKWVHAYMRRNHYVADKFVGNALLDMYAKCGRIDEALRVFQSMKRRDVYSYTTMIVGFALHGKASSAFHVFSEMLRVGIE

Query:  PNEVTFLGLLMACSHGGLVAEGKKYFFDMSNTYKLRPQAEHYGCMIDLLGRAGLVKEAEEIIHSMQISPD-----ALLG
        P++V ++ LL ACSHGGLV EG++YF  M +   L P+ EHYGCM+DLLGR+GL+ EAEE I +M I PD     ALLG
Subjt:  PNEVTFLGLLMACSHGGLVAEGKKYFFDMSNTYKLRPQAEHYGCMIDLLGRAGLVKEAEEIIHSMQISPD-----ALLG

Q9LN01 Pentatricopeptide repeat-containing protein At1g08070, chloroplastic1.5e-7933.65Show/hide
Query:  YEFPRPLSSPPQLNSCKSISHLKQIHAFAIKTASFSLQKQFFYPKLISLSSTSSSSPDLFYIRSILLSQLDDAQFCLNLCNAIIHSIATNSHGKSGSSTN
        Y+  R   S   L++CK++  L+ IHA  IK    +    +   KLI     S     L Y  S+  +        +   N +I +  T   G + SS  
Subjt:  YEFPRPLSSPPQLNSCKSISHLKQIHAFAIKTASFSLQKQFFYPKLISLSSTSSSSPDLFYIRSILLSQLDDAQFCLNLCNAIIHSIATNSHGKSGSSTN

Query:  LRAMEFLREMLAIGIEPDEFTLPYVLKALAQIQAMREGQQIHARSIK-------------------NGLLR--FNVYSRSCSTNVLT-----------GI
        + A++    M+++G+ P+ +T P+VLK+ A+ +A +EGQQIH   +K                   NG L     V+ +S   +V++           G 
Subjt:  LRAMEFLREMLAIGIEPDEFTLPYVLKALAQIQAMREGQQIHARSIK-------------------NGLLR--FNVYSRSCSTNVLT-----------GI

Query:  WCLGPRLFKHLLRRAL----------------QEAVGVFLEICDLNLRADGRTLVVVLSACSNLGDLNLGRKVHAYIRHHIDMNADVFLGNALIDMYLKC
             +LF  +  + +                +EA+ +F ++   N+R D  T+V V+SAC+  G + LGR+VH +I  H    +++ + NALID+Y KC
Subjt:  WCLGPRLFKHLLRRAL----------------QEAVGVFLEICDLNLRADGRTLVVVLSACSNLGDLNLGRKVHAYIRHHIDMNADVFLGNALIDMYLKC

Query:  NDSSSAYKLFDEMPLRNVITWNAMISGLAYQGRYREALDVFRSMQSTGPKPDEVTLVVVLNSCANLGVLELGKWVHAYM--RRNHYVADKFVGNALLDMY
         +  +A  LF+ +P ++VI+WN +I G  +   Y+EAL +F+ M  +G  P++VT++ +L +CA+LG +++G+W+H Y+  R         +  +L+DMY
Subjt:  NDSSSAYKLFDEMPLRNVITWNAMISGLAYQGRYREALDVFRSMQSTGPKPDEVTLVVVLNSCANLGVLELGKWVHAYM--RRNHYVADKFVGNALLDMY

Query:  AKCGRIDEALRVFQSMKRRDVYSYTTMIVGFALHGKASSAFHVFSEMLRVGIEPNEVTFLGLLMACSHGGLVAEGKKYFFDMSNTYKLRPQAEHYGCMID
        AKCG I+ A +VF S+  + + S+  MI GFA+HG+A ++F +FS M ++GI+P+++TF+GLL ACSH G++  G+  F  M+  YK+ P+ EHYGCMID
Subjt:  AKCGRIDEALRVFQSMKRRDVYSYTTMIVGFALHGKASSAFHVFSEMLRVGIEPNEVTFLGLLMACSHGGLVAEGKKYFFDMSNTYKLRPQAEHYGCMID

Query:  LLGRAGLVKEAEEIIHSMQISPDALL
        LLG +GL KEAEE+I+ M++ PD ++
Subjt:  LLGRAGLVKEAEEIIHSMQISPDALL

Q9MA95 Putative pentatricopeptide repeat-containing protein At3g052401.6e-6832.38Show/hide
Query:  QLNSCKSISHLKQIHAFAIKTASFSLQKQFFYPKLISLSSTSSSSPDLFYIRSILLSQLDDAQFCLNLCNAIIHSIATNSHGKSGSSTNLRAMEFLREML
        QL +C+S+  L Q+H   IK  S  ++      +LI   +T   + +L Y RS+  S +D     + + N++I        G S S    +A+ F +EML
Subjt:  QLNSCKSISHLKQIHAFAIKTASFSLQKQFFYPKLISLSSTSSSSPDLFYIRSILLSQLDDAQFCLNLCNAIIHSIATNSHGKSGSSTNLRAMEFLREML

Query:  AIGIEPDEFTLPYVLKALAQIQAMREGQQIHARSIKNGLLRFNVYSRSCSTN--VLTGIWCLGPRLFKHLLR----------------RALQEAVGVFLE
          G  PD FT PYVLKA + ++ ++ G  +H   +K G    N+Y  +C  +  +  G    G R+F+ + +                    +A+  F E
Subjt:  AIGIEPDEFTLPYVLKALAQIQAMREGQQIHARSIKNGLLRFNVYSRSCSTN--VLTGIWCLGPRLFKHLLR----------------RALQEAVGVFLE

Query:  ICDLNLRADGRTLVVVLSACSNLGDLNLGRKVHAYIR-------HHIDMNADVFLGNALIDMYLKCNDSSSAYKLFDEMPLRNVITWNAMISGLAYQGRY
        +    ++A+   +V +L AC    D+  G+  H +++           +  +V L  +LIDMY KC D  +A  LFD MP R +++WN++I+G +  G  
Subjt:  ICDLNLRADGRTLVVVLSACSNLGDLNLGRKVHAYIR-------HHIDMNADVFLGNALIDMYLKCNDSSSAYKLFDEMPLRNVITWNAMISGLAYQGRY

Query:  REALDVFRSMQSTGPKPDEVTLVVVLNSCANLGVLELGKWVHAYMRRNHYVADKFVGNALLDMYAKCGRIDEALRVFQSMKRRDVYSYTTMIVGFALHGK
         EAL +F  M   G  PD+VT + V+ +    G  +LG+ +HAY+ +  +V D  +  AL++MYAK G  + A + F+ ++++D  ++T +I+G A HG 
Subjt:  REALDVFRSMQSTGPKPDEVTLVVVLNSCANLGVLELGKWVHAYMRRNHYVADKFVGNALLDMYAKCGRIDEALRVFQSMKRRDVYSYTTMIVGFALHGK

Query:  ASSAFHVFSEMLRVG-IEPNEVTFLGLLMACSHGGLVAEGKKYFFDMSNTYKLRPQAEHYGCMIDLLGRAGLVKEAEEIIHSMQISPD
         + A  +F  M   G   P+ +T+LG+L ACSH GLV EG++YF +M + + L P  EHYGCM+D+L RAG  +EAE ++ +M + P+
Subjt:  ASSAFHVFSEMLRVG-IEPNEVTFLGLLMACSHGGLVAEGKKYFFDMSNTYKLRPQAEHYGCMIDLLGRAGLVKEAEEIIHSMQISPD

Q9SX45 Pentatricopeptide repeat-containing protein At1g502702.3e-7234.95Show/hide
Query:  HLKQIHAFAIKTASFSLQKQFFYPKLISLSSTSSSSPDLFYIRSILLSQLDDAQFCLNLCNAIIHSIATNSHGKSGSSTNLRAMEFL--REMLAIGIEPD
        HLKQIH   + +  F  ++  F  +L+    T+++    F     LL QL      + L +++I          SG  T  R + FL  R M   G+ P 
Subjt:  HLKQIHAFAIKTASFSLQKQFFYPKLISLSSTSSSSPDLFYIRSILLSQLDDAQFCLNLCNAIIHSIATNSHGKSGSSTNLRAMEFL--REMLAIGIEPD

Query:  EFTLPYVLKALAQIQAMREGQQIHARSIKNGL-----LRFNVYSRSCSTNVLTGIWCLGPRLFK----------------HLLRRALQEAVGVFLEICDL
          T P +LKA+ +++      Q HA  +K GL     +R ++ S   S    +G++    RLF                  +   +  EA+  F+E+   
Subjt:  EFTLPYVLKALAQIQAMREGQQIHARSIKNGL-----LRFNVYSRSCSTNVLTGIWCLGPRLFK----------------HLLRRALQEAVGVFLEICDL

Query:  NLRADGRTLVVVLSACSNLGDLNLGRKVHAYIRHHIDMNADVFLGNALIDMYLKCNDSSSAYKLFDEMPLRNVITWNAMISGLAYQGRYREALDVFRSMQ
         + A+  T+V VL A   + D+  GR VH        +  DVF+G++L+DMY KC+    A K+FDEMP RNV+TW A+I+G      + + + VF  M 
Subjt:  NLRADGRTLVVVLSACSNLGDLNLGRKVHAYIRHHIDMNADVFLGNALIDMYLKCNDSSSAYKLFDEMPLRNVITWNAMISGLAYQGRYREALDVFRSMQ

Query:  STGPKPDEVTLVVVLNSCANLGVLELGKWVHAYMRRNHYVADKFVGNALLDMYAKCGRIDEALRVFQSMKRRDVYSYTTMIVGFALHGKASSAFHVFSEM
         +   P+E TL  VL++CA++G L  G+ VH YM +N    +   G  L+D+Y KCG ++EA+ VF+ +  ++VY++T MI GFA HG A  AF +F  M
Subjt:  STGPKPDEVTLVVVLNSCANLGVLELGKWVHAYMRRNHYVADKFVGNALLDMYAKCGRIDEALRVFQSMKRRDVYSYTTMIVGFALHGKASSAFHVFSEM

Query:  LRVGIEPNEVTFLGLLMACSHGGLVAEGKKYFFDMSNTYKLRPQAEHYGCMIDLLGRAGLVKEAEEIIHSMQISP
        L   + PNEVTF+ +L AC+HGGLV EG++ F  M   + + P+A+HY CM+DL GR GL++EA+ +I  M + P
Subjt:  LRVGIEPNEVTFLGLLMACSHGGLVAEGKKYFFDMSNTYKLRPQAEHYGCMIDLLGRAGLVKEAEEIIHSMQISP

Arabidopsis top hitse value%identityAlignment
AT1G08070.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.1e-8033.65Show/hide
Query:  YEFPRPLSSPPQLNSCKSISHLKQIHAFAIKTASFSLQKQFFYPKLISLSSTSSSSPDLFYIRSILLSQLDDAQFCLNLCNAIIHSIATNSHGKSGSSTN
        Y+  R   S   L++CK++  L+ IHA  IK    +    +   KLI     S     L Y  S+  +        +   N +I +  T   G + SS  
Subjt:  YEFPRPLSSPPQLNSCKSISHLKQIHAFAIKTASFSLQKQFFYPKLISLSSTSSSSPDLFYIRSILLSQLDDAQFCLNLCNAIIHSIATNSHGKSGSSTN

Query:  LRAMEFLREMLAIGIEPDEFTLPYVLKALAQIQAMREGQQIHARSIK-------------------NGLLR--FNVYSRSCSTNVLT-----------GI
        + A++    M+++G+ P+ +T P+VLK+ A+ +A +EGQQIH   +K                   NG L     V+ +S   +V++           G 
Subjt:  LRAMEFLREMLAIGIEPDEFTLPYVLKALAQIQAMREGQQIHARSIK-------------------NGLLR--FNVYSRSCSTNVLT-----------GI

Query:  WCLGPRLFKHLLRRAL----------------QEAVGVFLEICDLNLRADGRTLVVVLSACSNLGDLNLGRKVHAYIRHHIDMNADVFLGNALIDMYLKC
             +LF  +  + +                +EA+ +F ++   N+R D  T+V V+SAC+  G + LGR+VH +I  H    +++ + NALID+Y KC
Subjt:  WCLGPRLFKHLLRRAL----------------QEAVGVFLEICDLNLRADGRTLVVVLSACSNLGDLNLGRKVHAYIRHHIDMNADVFLGNALIDMYLKC

Query:  NDSSSAYKLFDEMPLRNVITWNAMISGLAYQGRYREALDVFRSMQSTGPKPDEVTLVVVLNSCANLGVLELGKWVHAYM--RRNHYVADKFVGNALLDMY
         +  +A  LF+ +P ++VI+WN +I G  +   Y+EAL +F+ M  +G  P++VT++ +L +CA+LG +++G+W+H Y+  R         +  +L+DMY
Subjt:  NDSSSAYKLFDEMPLRNVITWNAMISGLAYQGRYREALDVFRSMQSTGPKPDEVTLVVVLNSCANLGVLELGKWVHAYM--RRNHYVADKFVGNALLDMY

Query:  AKCGRIDEALRVFQSMKRRDVYSYTTMIVGFALHGKASSAFHVFSEMLRVGIEPNEVTFLGLLMACSHGGLVAEGKKYFFDMSNTYKLRPQAEHYGCMID
        AKCG I+ A +VF S+  + + S+  MI GFA+HG+A ++F +FS M ++GI+P+++TF+GLL ACSH G++  G+  F  M+  YK+ P+ EHYGCMID
Subjt:  AKCGRIDEALRVFQSMKRRDVYSYTTMIVGFALHGKASSAFHVFSEMLRVGIEPNEVTFLGLLMACSHGGLVAEGKKYFFDMSNTYKLRPQAEHYGCMID

Query:  LLGRAGLVKEAEEIIHSMQISPDALL
        LLG +GL KEAEE+I+ M++ PD ++
Subjt:  LLGRAGLVKEAEEIIHSMQISPDALL

AT1G31430.1 Pentatricopeptide repeat (PPR-like) superfamily protein4.7e-7635.15Show/hide
Query:  LNLCNAIIHSIATNSHGKSGSSTNLRAMEFLREMLAIGIEPDEFTLPYVLKALAQIQAMREGQQIHARSIKNGLLRFNVYSRSCSTNVLTGIWC------
        L + N ++ S+A    GKS +    + +    E+   G+ PD FTLP VLK++ +++ + EG+++H  ++K G L F+ Y     +N L G++       
Subjt:  LNLCNAIIHSIATNSHGKSGSSTNLRAMEFLREMLAIGIEPDEFTLPYVLKALAQIQAMREGQQIHARSIKNGLLRFNVYSRSCSTNVLTGIWC------

Query:  LGPRLFKHLLRR----------------ALQEAVGVFLEIC-DLNLRADGRTLVVVLSACSNLGDLNLGRKVHAYIRHHIDMNADVFLGNALIDMYLKCN
        +  ++F  + +R                  ++A+GVF  +  + NL+ D  T+V  LSACS L +L +G +++ ++    +M+  V +GNAL+DM+ KC 
Subjt:  LGPRLFKHLLRR----------------ALQEAVGVFLEIC-DLNLRADGRTLVVVLSACSNLGDLNLGRKVHAYIRHHIDMNADVFLGNALIDMYLKCN

Query:  DSSSAYKLFDEM-------------------------------PLRNVITWNAMISGLAYQGRYREALDVFRSMQSTGPKPDEVTLVVVLNSCANLGVLE
            A  +FD M                               P+++V+ W AM++G     R+ EAL++FR MQ+ G +PD   LV +L  CA  G LE
Subjt:  DSSSAYKLFDEM-------------------------------PLRNVITWNAMISGLAYQGRYREALDVFRSMQSTGPKPDEVTLVVVLNSCANLGVLE

Query:  LGKWVHAYMRRNHYVADKFVGNALLDMYAKCGRIDEALRVFQSMKRRDVYSYTTMIVGFALHGKASSAFHVFSEMLRVGIEPNEVTFLGLLMACSHGGLV
         GKW+H Y+  N    DK VG AL+DMYAKCG I+ AL VF  +K RD  S+T++I G A++G +  A  ++ EM  VG+  + +TF+ +L AC+HGG V
Subjt:  LGKWVHAYMRRNHYVADKFVGNALLDMYAKCGRIDEALRVFQSMKRRDVYSYTTMIVGFALHGKASSAFHVFSEMLRVGIEPNEVTFLGLLMACSHGGLV

Query:  AEGKKYFFDMSNTYKLRPQAEHYGCMIDLLGRAGLVKEAEEIIHSMQISPDALLGELY
        AEG+K F  M+  + ++P++EH  C+IDLL RAGL+ EAEE+I  M+   D  L  +Y
Subjt:  AEGKKYFFDMSNTYKLRPQAEHYGCMIDLLGRAGLVKEAEEIIHSMQISPDALLGELY

AT1G50270.1 Pentatricopeptide repeat (PPR) superfamily protein1.7e-7334.95Show/hide
Query:  HLKQIHAFAIKTASFSLQKQFFYPKLISLSSTSSSSPDLFYIRSILLSQLDDAQFCLNLCNAIIHSIATNSHGKSGSSTNLRAMEFL--REMLAIGIEPD
        HLKQIH   + +  F  ++  F  +L+    T+++    F     LL QL      + L +++I          SG  T  R + FL  R M   G+ P 
Subjt:  HLKQIHAFAIKTASFSLQKQFFYPKLISLSSTSSSSPDLFYIRSILLSQLDDAQFCLNLCNAIIHSIATNSHGKSGSSTNLRAMEFL--REMLAIGIEPD

Query:  EFTLPYVLKALAQIQAMREGQQIHARSIKNGL-----LRFNVYSRSCSTNVLTGIWCLGPRLFK----------------HLLRRALQEAVGVFLEICDL
          T P +LKA+ +++      Q HA  +K GL     +R ++ S   S    +G++    RLF                  +   +  EA+  F+E+   
Subjt:  EFTLPYVLKALAQIQAMREGQQIHARSIKNGL-----LRFNVYSRSCSTNVLTGIWCLGPRLFK----------------HLLRRALQEAVGVFLEICDL

Query:  NLRADGRTLVVVLSACSNLGDLNLGRKVHAYIRHHIDMNADVFLGNALIDMYLKCNDSSSAYKLFDEMPLRNVITWNAMISGLAYQGRYREALDVFRSMQ
         + A+  T+V VL A   + D+  GR VH        +  DVF+G++L+DMY KC+    A K+FDEMP RNV+TW A+I+G      + + + VF  M 
Subjt:  NLRADGRTLVVVLSACSNLGDLNLGRKVHAYIRHHIDMNADVFLGNALIDMYLKCNDSSSAYKLFDEMPLRNVITWNAMISGLAYQGRYREALDVFRSMQ

Query:  STGPKPDEVTLVVVLNSCANLGVLELGKWVHAYMRRNHYVADKFVGNALLDMYAKCGRIDEALRVFQSMKRRDVYSYTTMIVGFALHGKASSAFHVFSEM
         +   P+E TL  VL++CA++G L  G+ VH YM +N    +   G  L+D+Y KCG ++EA+ VF+ +  ++VY++T MI GFA HG A  AF +F  M
Subjt:  STGPKPDEVTLVVVLNSCANLGVLELGKWVHAYMRRNHYVADKFVGNALLDMYAKCGRIDEALRVFQSMKRRDVYSYTTMIVGFALHGKASSAFHVFSEM

Query:  LRVGIEPNEVTFLGLLMACSHGGLVAEGKKYFFDMSNTYKLRPQAEHYGCMIDLLGRAGLVKEAEEIIHSMQISP
        L   + PNEVTF+ +L AC+HGGLV EG++ F  M   + + P+A+HY CM+DL GR GL++EA+ +I  M + P
Subjt:  LRVGIEPNEVTFLGLLMACSHGGLVAEGKKYFFDMSNTYKLRPQAEHYGCMIDLLGRAGLVKEAEEIIHSMQISP

AT3G05240.1 mitochondrial editing factor 191.1e-6932.38Show/hide
Query:  QLNSCKSISHLKQIHAFAIKTASFSLQKQFFYPKLISLSSTSSSSPDLFYIRSILLSQLDDAQFCLNLCNAIIHSIATNSHGKSGSSTNLRAMEFLREML
        QL +C+S+  L Q+H   IK  S  ++      +LI   +T   + +L Y RS+  S +D     + + N++I        G S S    +A+ F +EML
Subjt:  QLNSCKSISHLKQIHAFAIKTASFSLQKQFFYPKLISLSSTSSSSPDLFYIRSILLSQLDDAQFCLNLCNAIIHSIATNSHGKSGSSTNLRAMEFLREML

Query:  AIGIEPDEFTLPYVLKALAQIQAMREGQQIHARSIKNGLLRFNVYSRSCSTN--VLTGIWCLGPRLFKHLLR----------------RALQEAVGVFLE
          G  PD FT PYVLKA + ++ ++ G  +H   +K G    N+Y  +C  +  +  G    G R+F+ + +                    +A+  F E
Subjt:  AIGIEPDEFTLPYVLKALAQIQAMREGQQIHARSIKNGLLRFNVYSRSCSTN--VLTGIWCLGPRLFKHLLR----------------RALQEAVGVFLE

Query:  ICDLNLRADGRTLVVVLSACSNLGDLNLGRKVHAYIR-------HHIDMNADVFLGNALIDMYLKCNDSSSAYKLFDEMPLRNVITWNAMISGLAYQGRY
        +    ++A+   +V +L AC    D+  G+  H +++           +  +V L  +LIDMY KC D  +A  LFD MP R +++WN++I+G +  G  
Subjt:  ICDLNLRADGRTLVVVLSACSNLGDLNLGRKVHAYIR-------HHIDMNADVFLGNALIDMYLKCNDSSSAYKLFDEMPLRNVITWNAMISGLAYQGRY

Query:  REALDVFRSMQSTGPKPDEVTLVVVLNSCANLGVLELGKWVHAYMRRNHYVADKFVGNALLDMYAKCGRIDEALRVFQSMKRRDVYSYTTMIVGFALHGK
         EAL +F  M   G  PD+VT + V+ +    G  +LG+ +HAY+ +  +V D  +  AL++MYAK G  + A + F+ ++++D  ++T +I+G A HG 
Subjt:  REALDVFRSMQSTGPKPDEVTLVVVLNSCANLGVLELGKWVHAYMRRNHYVADKFVGNALLDMYAKCGRIDEALRVFQSMKRRDVYSYTTMIVGFALHGK

Query:  ASSAFHVFSEMLRVG-IEPNEVTFLGLLMACSHGGLVAEGKKYFFDMSNTYKLRPQAEHYGCMIDLLGRAGLVKEAEEIIHSMQISPD
         + A  +F  M   G   P+ +T+LG+L ACSH GLV EG++YF +M + + L P  EHYGCM+D+L RAG  +EAE ++ +M + P+
Subjt:  ASSAFHVFSEMLRVG-IEPNEVTFLGLLMACSHGGLVAEGKKYFFDMSNTYKLRPQAEHYGCMIDLLGRAGLVKEAEEIIHSMQISPD

AT5G48910.1 Pentatricopeptide repeat (PPR) superfamily protein1.2e-7135.7Show/hide
Query:  PLSSPPQLNSCKSISHLKQIHAFAIKTASFSLQKQFFYPKLISLSSTSS-SSPDLFYIRSILLSQLDDAQFCLNLCNAIIHSIATNSHGKSGSSTNLRAM
        P S  PQ+N+C++I  L QIHA  IK  S  ++      +++   +TS     DL Y   I  +Q+     C +  N II   + +   K+     L A+
Subjt:  PLSSPPQLNSCKSISHLKQIHAFAIKTASFSLQKQFFYPKLISLSSTSS-SSPDLFYIRSILLSQLDDAQFCLNLCNAIIHSIATNSHGKSGSSTNLRAM

Query:  EFLREMLAIG-IEPDEFTLPYVLKALAQIQAMREGQQIHARSIKNGLLRFNVYSRSCSTNVLTGIWCLGPRLFKHLLRRALQEAVGVFLEICDLNLRADG
            EM++   +EP+ FT P VLKA A+   ++EG+QIH  ++K G   F       S  V   + C                               D 
Subjt:  EFLREMLAIG-IEPDEFTLPYVLKALAQIQAMREGQQIHARSIKNGLLRFNVYSRSCSTNVLTGIWCLGPRLFKHLLRRALQEAVGVFLEICDLNLRADG

Query:  RTLVVVLSACSNLGDLNLGRKVHAYIRHHIDMNADVFLGNALIDMYLKCNDSSSAYKLFDEMPLRNVITWNAMISGLAYQGRYREALDVFRSMQSTGPKP
        R L             N+  K    +      + ++ L N +ID Y++  D  +A  LFD+M  R+V++WN MISG +  G +++A++VFR M+    +P
Subjt:  RTLVVVLSACSNLGDLNLGRKVHAYIRHHIDMNADVFLGNALIDMYLKCNDSSSAYKLFDEMPLRNVITWNAMISGLAYQGRYREALDVFRSMQSTGPKP

Query:  DEVTLVVVLNSCANLGVLELGKWVHAYMRRNHYVADKFVGNALLDMYAKCGRIDEALRVFQSMKRRDVYSYTTMIVGFALHGKASSAFHVFSEMLRVGIE
        + VTLV VL + + LG LELG+W+H Y   +    D  +G+AL+DMY+KCG I++A+ VF+ + R +V +++ MI GFA+HG+A  A   F +M + G+ 
Subjt:  DEVTLVVVLNSCANLGVLELGKWVHAYMRRNHYVADKFVGNALLDMYAKCGRIDEALRVFQSMKRRDVYSYTTMIVGFALHGKASSAFHVFSEMLRVGIE

Query:  PNEVTFLGLLMACSHGGLVAEGKKYFFDMSNTYKLRPQAEHYGCMIDLLGRAGLVKEAEEIIHSMQISPD-----ALLG
        P++V ++ LL ACSHGGLV EG++YF  M +   L P+ EHYGCM+DLLGR+GL+ EAEE I +M I PD     ALLG
Subjt:  PNEVTFLGLLMACSHGGLVAEGKKYFFDMSNTYKLRPQAEHYGCMIDLLGRAGLVKEAEEIIHSMQISPD-----ALLG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCCCTTCCATCCCTGTATGTTTCCTCCTCCTTATCATCCCCTCTTCGACATCTCCACCATTTGAACTCCTTTTTCCGCCATGAAAGCAGCCTCCCAAACCCTCC
AAGAAGCCGCCGCAGCAGCAGACGAAGAAGAAGCACGATATCGCTCTCCTTCCGCACCTCCTACCGATTCTCCGTCGACCATGAAGACGAAGATGAAGACGAGCAGATCA
CCGGAGATTTCGGCTTCGACGAAGCAGTGGATCTCTTCAATCAAGGAGCATATTACGATTGCCACGATATCCTTGAAATTCTATGGAACGGAGCCGAAGACCCTACTAGA
ACCCTAATTCATGGCATTCTTCAGTGCGCCGTGGGGCTTCATCATCTCTTCAATCGGAATCATAGAGGGGCGATGATGGAGCTGGGGGAGGGGCTGTGTAAGCTACGGAA
GATGGAGTTTGAGAGTGGCCCTTTCTATACTTTTGAGAGGGAGATTTCTGCAGTTTTGGATTTTGTTTACCAGACTCAGATTGAATTAGCTGCCTGTGATGAGAATGTGT
GTGTTACAATGGTGGGTTCAGAGAGATCATATGAATTGCTTGGAAGGTATGGTGCAGGACAGAAGCTGTATGATTTAGAGAGAGAAGTTGATGGAAGGATTACCACTACA
TCTGCCATAGCCGTTTATGAATTCCCAAGACCTCTATCTTCTCCCCCACAACTCAATTCCTGCAAATCTATTTCGCACCTCAAACAAATCCATGCCTTCGCCATTAAAAC
AGCTTCCTTCTCTCTCCAGAAGCAATTCTTTTATCCCAAGCTCATTTCTCTCTCTTCCACTTCTTCCTCCTCCCCCGACCTTTTCTACATCCGCTCCATCCTTCTCAGCC
AGTTGGACGATGCGCAATTTTGCCTCAATCTCTGCAACGCCATCATCCACAGCATTGCCACGAACTCCCATGGTAAGAGTGGCAGTTCTACCAATCTCAGGGCCATGGAA
TTCTTGAGGGAAATGCTTGCGATCGGCATCGAACCGGATGAGTTCACCTTGCCGTATGTTCTCAAAGCGTTGGCTCAGATTCAGGCGATGAGAGAAGGCCAACAGATTCA
CGCTCGTTCTATCAAGAATGGACTGCTGCGATTCAATGTTTATTCCAGAAGCTGTTCGACGAATGTCCTCACCGGGATTTGGTGTCTTGGACCACGCTTATTCAAGCATT
TACTCAGGCGGGCATTACAGGAAGCTGTAGGAGTTTTTCTGGAAATTTGTGATCTGAACCTAAGGGCTGATGGGCGGACTCTGGTGGTTGTCCTCTCCGCATGCTCCAAC
TTGGGAGACCTGAATTTGGGTCGAAAGGTACACGCCTATATCCGCCATCACATTGACATGAATGCAGATGTATTTCTTGGCAATGCCTTGATAGATATGTACTTAAAATG
TAATGATTCAAGCTCTGCTTACAAATTGTTCGACGAAATGCCTCTGAGAAATGTCATTACTTGGAATGCCATGATTTCCGGGTTGGCTTACCAAGGCCGGTACAGGGAAG
CTCTGGACGTGTTTCGTAGCATGCAAAGCACCGGGCCCAAGCCAGACGAGGTGACCTTAGTGGTGGTTCTCAACTCCTGCGCAAACCTTGGAGTTCTTGAGCTGGGTAAG
TGGGTTCATGCATACATGCGTAGAAATCATTATGTGGCTGATAAATTTGTTGGGAATGCGCTTCTGGATATGTATGCAAAATGTGGTAGAATAGATGAAGCTCTTAGGGT
TTTTCAGAGCATGAAAAGGAGGGATGTATATTCATACACAACCATGATTGTTGGGTTTGCCTTGCATGGCAAAGCGAGCTCGGCATTCCATGTCTTCTCTGAGATGTTAA
GAGTTGGTATCGAGCCAAACGAGGTAACATTTTTAGGTCTTCTCATGGCTTGCAGCCATGGTGGATTGGTTGCAGAAGGCAAGAAGTACTTTTTCGACATGTCAAATACA
TATAAGCTTAGACCTCAAGCAGAGCATTATGGCTGCATGATTGACCTTCTTGGTCGTGCAGGGTTAGTGAAGGAGGCAGAAGAGATTATTCACAGCATGCAAATCAGCCC
AGATGCCTTGCTTGGGGAGCTCTACTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCTTCCCTTCCATCCCTGTATGTTTCCTCCTCCTTATCATCCCCTCTTCGACATCTCCACCATTTGAACTCCTTTTTCCGCCATGAAAGCAGCCTCCCAAACCCTCC
AAGAAGCCGCCGCAGCAGCAGACGAAGAAGAAGCACGATATCGCTCTCCTTCCGCACCTCCTACCGATTCTCCGTCGACCATGAAGACGAAGATGAAGACGAGCAGATCA
CCGGAGATTTCGGCTTCGACGAAGCAGTGGATCTCTTCAATCAAGGAGCATATTACGATTGCCACGATATCCTTGAAATTCTATGGAACGGAGCCGAAGACCCTACTAGA
ACCCTAATTCATGGCATTCTTCAGTGCGCCGTGGGGCTTCATCATCTCTTCAATCGGAATCATAGAGGGGCGATGATGGAGCTGGGGGAGGGGCTGTGTAAGCTACGGAA
GATGGAGTTTGAGAGTGGCCCTTTCTATACTTTTGAGAGGGAGATTTCTGCAGTTTTGGATTTTGTTTACCAGACTCAGATTGAATTAGCTGCCTGTGATGAGAATGTGT
GTGTTACAATGGTGGGTTCAGAGAGATCATATGAATTGCTTGGAAGGTATGGTGCAGGACAGAAGCTGTATGATTTAGAGAGAGAAGTTGATGGAAGGATTACCACTACA
TCTGCCATAGCCGTTTATGAATTCCCAAGACCTCTATCTTCTCCCCCACAACTCAATTCCTGCAAATCTATTTCGCACCTCAAACAAATCCATGCCTTCGCCATTAAAAC
AGCTTCCTTCTCTCTCCAGAAGCAATTCTTTTATCCCAAGCTCATTTCTCTCTCTTCCACTTCTTCCTCCTCCCCCGACCTTTTCTACATCCGCTCCATCCTTCTCAGCC
AGTTGGACGATGCGCAATTTTGCCTCAATCTCTGCAACGCCATCATCCACAGCATTGCCACGAACTCCCATGGTAAGAGTGGCAGTTCTACCAATCTCAGGGCCATGGAA
TTCTTGAGGGAAATGCTTGCGATCGGCATCGAACCGGATGAGTTCACCTTGCCGTATGTTCTCAAAGCGTTGGCTCAGATTCAGGCGATGAGAGAAGGCCAACAGATTCA
CGCTCGTTCTATCAAGAATGGACTGCTGCGATTCAATGTTTATTCCAGAAGCTGTTCGACGAATGTCCTCACCGGGATTTGGTGTCTTGGACCACGCTTATTCAAGCATT
TACTCAGGCGGGCATTACAGGAAGCTGTAGGAGTTTTTCTGGAAATTTGTGATCTGAACCTAAGGGCTGATGGGCGGACTCTGGTGGTTGTCCTCTCCGCATGCTCCAAC
TTGGGAGACCTGAATTTGGGTCGAAAGGTACACGCCTATATCCGCCATCACATTGACATGAATGCAGATGTATTTCTTGGCAATGCCTTGATAGATATGTACTTAAAATG
TAATGATTCAAGCTCTGCTTACAAATTGTTCGACGAAATGCCTCTGAGAAATGTCATTACTTGGAATGCCATGATTTCCGGGTTGGCTTACCAAGGCCGGTACAGGGAAG
CTCTGGACGTGTTTCGTAGCATGCAAAGCACCGGGCCCAAGCCAGACGAGGTGACCTTAGTGGTGGTTCTCAACTCCTGCGCAAACCTTGGAGTTCTTGAGCTGGGTAAG
TGGGTTCATGCATACATGCGTAGAAATCATTATGTGGCTGATAAATTTGTTGGGAATGCGCTTCTGGATATGTATGCAAAATGTGGTAGAATAGATGAAGCTCTTAGGGT
TTTTCAGAGCATGAAAAGGAGGGATGTATATTCATACACAACCATGATTGTTGGGTTTGCCTTGCATGGCAAAGCGAGCTCGGCATTCCATGTCTTCTCTGAGATGTTAA
GAGTTGGTATCGAGCCAAACGAGGTAACATTTTTAGGTCTTCTCATGGCTTGCAGCCATGGTGGATTGGTTGCAGAAGGCAAGAAGTACTTTTTCGACATGTCAAATACA
TATAAGCTTAGACCTCAAGCAGAGCATTATGGCTGCATGATTGACCTTCTTGGTCGTGCAGGGTTAGTGAAGGAGGCAGAAGAGATTATTCACAGCATGCAAATCAGCCC
AGATGCCTTGCTTGGGGAGCTCTACTAG
Protein sequenceShow/hide protein sequence
MASLPSLYVSSSLSSPLRHLHHLNSFFRHESSLPNPPRSRRSSRRRRSTISLSFRTSYRFSVDHEDEDEDEQITGDFGFDEAVDLFNQGAYYDCHDILEILWNGAEDPTR
TLIHGILQCAVGLHHLFNRNHRGAMMELGEGLCKLRKMEFESGPFYTFEREISAVLDFVYQTQIELAACDENVCVTMVGSERSYELLGRYGAGQKLYDLEREVDGRITTT
SAIAVYEFPRPLSSPPQLNSCKSISHLKQIHAFAIKTASFSLQKQFFYPKLISLSSTSSSSPDLFYIRSILLSQLDDAQFCLNLCNAIIHSIATNSHGKSGSSTNLRAME
FLREMLAIGIEPDEFTLPYVLKALAQIQAMREGQQIHARSIKNGLLRFNVYSRSCSTNVLTGIWCLGPRLFKHLLRRALQEAVGVFLEICDLNLRADGRTLVVVLSACSN
LGDLNLGRKVHAYIRHHIDMNADVFLGNALIDMYLKCNDSSSAYKLFDEMPLRNVITWNAMISGLAYQGRYREALDVFRSMQSTGPKPDEVTLVVVLNSCANLGVLELGK
WVHAYMRRNHYVADKFVGNALLDMYAKCGRIDEALRVFQSMKRRDVYSYTTMIVGFALHGKASSAFHVFSEMLRVGIEPNEVTFLGLLMACSHGGLVAEGKKYFFDMSNT
YKLRPQAEHYGCMIDLLGRAGLVKEAEEIIHSMQISPDALLGELY