; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0016815 (gene) of Snake gourd v1 genome

Gene IDTan0016815
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationLG08:6469368..6470819
RNA-Seq ExpressionTan0016815
SyntenyTan0016815
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004144815.1 pentatricopeptide repeat-containing protein At1g09190 isoform X1 [Cucumis sativus]1.4e-25488.64Show/hide
Query:  MSKNCREIERKILRLLHGHKSRTHLTQIHAHFLRHGLHQSNQILTHFISICGAFNKIAYANRVFSQSQNPNIFLFNSMIKAHSLSGPFQQSLLLFSSMKN
        MSKNC EIER+ILRLLHGHKSRTHLTQIHAHFLRHGLHQSNQIL HFIS+C +FN+IAYA+R+FSQS NPNIFLFNS+IKAHSLS PF QSLLLFSSMKN
Subjt:  MSKNCREIERKILRLLHGHKSRTHLTQIHAHFLRHGLHQSNQILTHFISICGAFNKIAYANRVFSQSQNPNIFLFNSMIKAHSLSGPFQQSLLLFSSMKN

Query:  HRIVPDEYTFAPLLKSCSNLCDYRLGQLVIGEALRRGFECFGSIRIGVVELYVCCERMEDARKVFDEMPHRDVVVWNLMIRGFCKMGNVDFGLRLFRQMS
        HRIVPD+YTFAPLLKSC+NLC+Y LGQ VI E  RRGF CFGSIRIGVVELYVCCE+MEDA K+FDEM HRDVVVWNLMIRGFCK GNVDFGL LFRQMS
Subjt:  HRIVPDEYTFAPLLKSCSNLCDYRLGQLVIGEALRRGFECFGSIRIGVVELYVCCERMEDARKVFDEMPHRDVVVWNLMIRGFCKMGNVDFGLRLFRQMS

Query:  ERSLVSWNTIISCLAQNGRDVEALELFQQMEEHGFEPDEVTVVTILPVCSRLGALDVGQRIHSYASSKGDLVDTTTVGNSLVDFYCKCGNTEGAYNIFQK
        ERSLVSWNTIISCLAQN RDVEALELFQQMEEHGF+PDEVTVVT+LPVCSRLGAL+VGQRIHSYASSKG+LV  TTVGNSL+DFYCKCGN E AYNIFQK
Subjt:  ERSLVSWNTIISCLAQNGRDVEALELFQQMEEHGFEPDEVTVVTILPVCSRLGALDVGQRIHSYASSKGDLVDTTTVGNSLVDFYCKCGNTEGAYNIFQK

Query:  MTCRSVVSWNSMILGFALNGKGELAIDIFMKMRREDVKPNDATFVAVLTACVHSGLLEKGRELFSSMAE-YEIQPKLEHFGCMVDLLGRGGCVEEAHNLI
        MTC+SVVSWN++ILGFALNGKGE AID+FM+MR+E +KPNDATFVAVLTACVHSGLLEKGRELFSSMAE YEIQPKLEHFGCMVDLLGRGGCVEEAH LI
Subjt:  MTCRSVVSWNSMILGFALNGKGELAIDIFMKMRREDVKPNDATFVAVLTACVHSGLLEKGRELFSSMAE-YEIQPKLEHFGCMVDLLGRGGCVEEAHNLI

Query:  KSMPMQPNATLWGALLGACRTHGNLKLAEVAVKELISLEPWNSGNYVLLSNMLAEEGRWEDVENVRRWMRGKSVKKVPGQSASG
        KSMPMQPNATLWGA+LGACRTHGNLKLAE+AVKELISLEPWNSGNYVLLSNMLAEEGRWE+VENVR+WMR KSVKK PGQSASG
Subjt:  KSMPMQPNATLWGALLGACRTHGNLKLAEVAVKELISLEPWNSGNYVLLSNMLAEEGRWEDVENVRRWMRGKSVKKVPGQSASG

XP_008453700.1 PREDICTED: pentatricopeptide repeat-containing protein At1g09190 [Cucumis melo]3.3e-25689.05Show/hide
Query:  MSKNCREIERKILRLLHGHKSRTHLTQIHAHFLRHGLHQSNQILTHFISICGAFNKIAYANRVFSQSQNPNIFLFNSMIKAHSLSGPFQQSLLLFSSMKN
        MSKNC EIER+ILRLLHGHKSRTHLTQIHAHFLRHGLHQSNQIL HFIS+C +FN+I YA+R+FSQS NPNIFLFNS+IKAHSLS PF QSLLLFS MKN
Subjt:  MSKNCREIERKILRLLHGHKSRTHLTQIHAHFLRHGLHQSNQILTHFISICGAFNKIAYANRVFSQSQNPNIFLFNSMIKAHSLSGPFQQSLLLFSSMKN

Query:  HRIVPDEYTFAPLLKSCSNLCDYRLGQLVIGEALRRGFECFGSIRIGVVELYVCCERMEDARKVFDEMPHRDVVVWNLMIRGFCKMGNVDFGLRLFRQMS
        HRIVPD+YTFAPLLKSC+NLC+Y LGQ VI E L RGF CFGSIRIGVVELYVCCE+MEDA K FDEM HRDVVVWNLMIRGFCKMGNVDFGL LFRQMS
Subjt:  HRIVPDEYTFAPLLKSCSNLCDYRLGQLVIGEALRRGFECFGSIRIGVVELYVCCERMEDARKVFDEMPHRDVVVWNLMIRGFCKMGNVDFGLRLFRQMS

Query:  ERSLVSWNTIISCLAQNGRDVEALELFQQMEEHGFEPDEVTVVTILPVCSRLGALDVGQRIHSYASSKGDLVDTTTVGNSLVDFYCKCGNTEGAYNIFQK
        ERSLVSWNTIISCLAQN RDVEALELFQQMEEHGF+PDEVTVVT+LPVCSRLGAL+VGQRIHSY SSKG+LV TT VGNSL+DFYCKCGN E AYNIFQK
Subjt:  ERSLVSWNTIISCLAQNGRDVEALELFQQMEEHGFEPDEVTVVTILPVCSRLGALDVGQRIHSYASSKGDLVDTTTVGNSLVDFYCKCGNTEGAYNIFQK

Query:  MTCRSVVSWNSMILGFALNGKGELAIDIFMKMRREDVKPNDATFVAVLTACVHSGLLEKGRELFSSMAE-YEIQPKLEHFGCMVDLLGRGGCVEEAHNLI
        MTC+SVVSWN++ILGFALNGKGE AID+FM+MR+EDVKPNDATFVAVLTACVHSGLLEKGRELFSSMAE YEIQPKLEHFGCMVDLLGRGGCVEEAHNLI
Subjt:  MTCRSVVSWNSMILGFALNGKGELAIDIFMKMRREDVKPNDATFVAVLTACVHSGLLEKGRELFSSMAE-YEIQPKLEHFGCMVDLLGRGGCVEEAHNLI

Query:  KSMPMQPNATLWGALLGACRTHGNLKLAEVAVKELISLEPWNSGNYVLLSNMLAEEGRWEDVENVRRWMRGKSVKKVPGQSASG
        KSMPMQPNATLWGALLGACRTHGNLKLAE+AVKELISLEPWNSGNYVLLSNMLAEEGRWE+VENVR+WMR KSVKK PGQSASG
Subjt:  KSMPMQPNATLWGALLGACRTHGNLKLAEVAVKELISLEPWNSGNYVLLSNMLAEEGRWEDVENVRRWMRGKSVKKVPGQSASG

XP_022965696.1 pentatricopeptide repeat-containing protein At1g09190 [Cucurbita maxima]1.9e-24887.4Show/hide
Query:  MSKNCREIERKILRLLHGHKSRTHLTQIHAHFLRHGLHQSNQILTHFISICGAFNKIAYANRVFSQSQNPNIFLFNSMIKAHSLSGPFQQSLLLFSSMKN
        MSKN R IER+ILRLL GHKS THLTQIHAHFLRH LHQSNQIL HFISICG FN+IAYANRVFSQSQNPNIFLFNSMIKAHSLSGPF+QSLLLFSS+KN
Subjt:  MSKNCREIERKILRLLHGHKSRTHLTQIHAHFLRHGLHQSNQILTHFISICGAFNKIAYANRVFSQSQNPNIFLFNSMIKAHSLSGPFQQSLLLFSSMKN

Query:  HRIVPDEYTFAPLLKSCSNLCDYRLGQLVIGEALRRGFECFGSIRIGVVELYVCCERMEDARKVFDEMPHRDVVVWNLMIRGFCKMGNVDFGLRLFRQMS
         RIVPDEYTFAPLLKSCSNL DYRLG+ VIGE LRRGFECFGSIRIGVVELYVCCERM+DA+KVFDEMPH DVVVWNLMIRGFCKMGNVD GL LFRQM+
Subjt:  HRIVPDEYTFAPLLKSCSNLCDYRLGQLVIGEALRRGFECFGSIRIGVVELYVCCERMEDARKVFDEMPHRDVVVWNLMIRGFCKMGNVDFGLRLFRQMS

Query:  ERSLVSWNTIISCLAQNGRDVEALELFQQMEEHGFEPDEVTVVTILPVCSRLGALDVGQRIHSYASSKGDLVDTTTVGNSLVDFYCKCGNTEGAYNIFQK
        +RSLVSWNT ISCLAQ+GRDVEALELFQQMEEHGFEPDEVTVVT+LPVCSRLGA+DVGQ IHSYA+SK DLV+TT VGNSL+DFYCK GNTE AYNIFQK
Subjt:  ERSLVSWNTIISCLAQNGRDVEALELFQQMEEHGFEPDEVTVVTILPVCSRLGALDVGQRIHSYASSKGDLVDTTTVGNSLVDFYCKCGNTEGAYNIFQK

Query:  MTCRSVVSWNSMILGFALNGKGELAIDIFMKMRREDVKPNDATFVAVLTACVHSGLLEKGRELFSSMAE-YEIQPKLEHFGCMVDLLGRGGCVEEAHNLI
        MTC+SVVSWN+MILGFALNGKGELAID+FM+M + D KPND T VA+LTACVHSGLLEKG+E+FSSMAE YEI+PKLEHFGCMVDLLGRGGCVEEAH+LI
Subjt:  MTCRSVVSWNSMILGFALNGKGELAIDIFMKMRREDVKPNDATFVAVLTACVHSGLLEKGRELFSSMAE-YEIQPKLEHFGCMVDLLGRGGCVEEAHNLI

Query:  KSMPMQPNATLWGALLGACRTHGNLKLAEVAVKELISLEPWNSGNYVLLSNMLAEEGRWEDVENVRRWMRGKSVKKVPGQSASG
        +SMPMQPNATLWGALLGACRTHGNLKLAE+AV ELISLEP NSGNYVLLSN LAEE RWEDVENVRR MRGK+VKK PG+SASG
Subjt:  KSMPMQPNATLWGALLGACRTHGNLKLAEVAVKELISLEPWNSGNYVLLSNMLAEEGRWEDVENVRRWMRGKSVKKVPGQSASG

XP_023538021.1 pentatricopeptide repeat-containing protein At1g09190 [Cucurbita pepo subsp. pepo]3.0e-24988.22Show/hide
Query:  MSKNCREIERKILRLLHGHKSRTHLTQIHAHFLRHGLHQSNQILTHFISICGAFNKIAYANRVFSQSQNPNIFLFNSMIKAHSLSGPFQQSLLLFSSMKN
        MSKN R IER+ILRLL GHKS THLTQIHAHFLRH LHQSNQIL HFISICGAFN+IAYANRVFSQSQNPNIFLFNSMIKAHSLSGPFQQSLLLFSSMKN
Subjt:  MSKNCREIERKILRLLHGHKSRTHLTQIHAHFLRHGLHQSNQILTHFISICGAFNKIAYANRVFSQSQNPNIFLFNSMIKAHSLSGPFQQSLLLFSSMKN

Query:  HRIVPDEYTFAPLLKSCSNLCDYRLGQLVIGEALRRGFECFGSIRIGVVELYVCCERMEDARKVFDEMPHRDVVVWNLMIRGFCKMGNVDFGLRLFRQMS
         RIVPDEYTFAPLLKSCSNL DYRLG+ VIGE LRRGFE FGSIRIGVVELYVCCERM+DA+KVFDEMP RDVVVWNLMIRGFCKMGNVD GL LFRQM+
Subjt:  HRIVPDEYTFAPLLKSCSNLCDYRLGQLVIGEALRRGFECFGSIRIGVVELYVCCERMEDARKVFDEMPHRDVVVWNLMIRGFCKMGNVDFGLRLFRQMS

Query:  ERSLVSWNTIISCLAQNGRDVEALELFQQMEEHGFEPDEVTVVTILPVCSRLGALDVGQRIHSYASSKGDLVDTTTVGNSLVDFYCKCGNTEGAYNIFQK
        +RSLVSWNT ISCLAQ+GRDVEAL+LFQQMEEHGFEPDEVTVVT+LPVCSRLGALDVGQ IHSYA+SK DLV+TT VGNSL+DFYCK GNTE AYNIFQK
Subjt:  ERSLVSWNTIISCLAQNGRDVEALELFQQMEEHGFEPDEVTVVTILPVCSRLGALDVGQRIHSYASSKGDLVDTTTVGNSLVDFYCKCGNTEGAYNIFQK

Query:  MTCRSVVSWNSMILGFALNGKGELAIDIFMKMRREDVKPNDATFVAVLTACVHSGLLEKGRELFSSMAE-YEIQPKLEHFGCMVDLLGRGGCVEEAHNLI
        MTC+SVVSWN+MILGFALNGKGELAID+F +M R D KPNDAT VA+LTACVHSGLLEKGRE+FSSMAE YEI+PKLEHFGCMVDLLGRGGCVEEAH+LI
Subjt:  MTCRSVVSWNSMILGFALNGKGELAIDIFMKMRREDVKPNDATFVAVLTACVHSGLLEKGRELFSSMAE-YEIQPKLEHFGCMVDLLGRGGCVEEAHNLI

Query:  KSMPMQPNATLWGALLGACRTHGNLKLAEVAVKELISLEPWNSGNYVLLSNMLAEEGRWEDVENVRRWMRGKSVKKVPGQSASG
        +SMPMQPNATLWGALLGACRTHGNLKLAE+A  ELISLEP NSGNYVLLSN+LAEEGRWEDVENVRR MRGK+VKK PG+SASG
Subjt:  KSMPMQPNATLWGALLGACRTHGNLKLAEVAVKELISLEPWNSGNYVLLSNMLAEEGRWEDVENVRRWMRGKSVKKVPGQSASG

XP_038890516.1 pentatricopeptide repeat-containing protein At1g09190 [Benincasa hispida]8.0e-25588.64Show/hide
Query:  MSKNCREIERKILRLLHGHKSRTHLTQIHAHFLRHGLHQSNQILTHFISICGAFNKIAYANRVFSQSQNPNIFLFNSMIKAHSLSGPFQQSLLLFSSMKN
        M+KNC EIER+ILRLLHG KSRTHLT+IHAHFLRHGLHQSNQIL HFISIC AFN I+YA+R+FSQS NPNIFLFNS+IKAHSL  PFQQSLLLFSSMKN
Subjt:  MSKNCREIERKILRLLHGHKSRTHLTQIHAHFLRHGLHQSNQILTHFISICGAFNKIAYANRVFSQSQNPNIFLFNSMIKAHSLSGPFQQSLLLFSSMKN

Query:  HRIVPDEYTFAPLLKSCSNLCDYRLGQLVIGEALRRGFECFGSIRIGVVELYVCCERMEDARKVFDEMPHRDVVVWNLMIRGFCKMGNVDFGLRLFRQMS
        HRIVPDEYTFAPLLKSC+NL +Y LGQ VI E LRRGF CFGSIRIGVVELYVCCERMEDARKVFDEMPHRDVVVWNLMIRGFCKMGNVDFGL LFRQM+
Subjt:  HRIVPDEYTFAPLLKSCSNLCDYRLGQLVIGEALRRGFECFGSIRIGVVELYVCCERMEDARKVFDEMPHRDVVVWNLMIRGFCKMGNVDFGLRLFRQMS

Query:  ERSLVSWNTIISCLAQNGRDVEALELFQQMEEHGFEPDEVTVVTILPVCSRLGALDVGQRIHSYASSKGDLVDTTTVGNSLVDFYCKCGNTEGAYNIFQK
        ERSL+SWNT++SCLAQ+  D EALELFQQMEE GF+PDEVTVVT+LPVCSRLGALDVGQRIH+YASSKGD+VD TT+GNSLVDFYCKCGN E AYNIFQK
Subjt:  ERSLVSWNTIISCLAQNGRDVEALELFQQMEEHGFEPDEVTVVTILPVCSRLGALDVGQRIHSYASSKGDLVDTTTVGNSLVDFYCKCGNTEGAYNIFQK

Query:  MTCRSVVSWNSMILGFALNGKGELAIDIFMKMRREDVKPNDATFVAVLTACVHSGLLEKGRELFSSMAE-YEIQPKLEHFGCMVDLLGRGGCVEEAHNLI
        MTC+SVVSWN+MILGFALNG GE AID+FMKM +EDVKPNDATFVAVLTACVHSGLLEKGRELFSSMA+ YEIQPKLEHFGCMVDLLGRGGC+EEAHNLI
Subjt:  MTCRSVVSWNSMILGFALNGKGELAIDIFMKMRREDVKPNDATFVAVLTACVHSGLLEKGRELFSSMAE-YEIQPKLEHFGCMVDLLGRGGCVEEAHNLI

Query:  KSMPMQPNATLWGALLGACRTHGNLKLAEVAVKELISLEPWNSGNYVLLSNMLAEEGRWEDVENVRRWMRGKSVKKVPGQSASG
        KSMPMQPNATLWGALLGACRTHGNLKLAE+AVKEL SLEPWNSGNYVLLSNMLAEEGRWEDVENVRRWM+GKSVKK PGQSASG
Subjt:  KSMPMQPNATLWGALLGACRTHGNLKLAEVAVKELISLEPWNSGNYVLLSNMLAEEGRWEDVENVRRWMRGKSVKKVPGQSASG

TrEMBL top hitse value%identityAlignment
A0A0A0LJW6 Uncharacterized protein6.6e-25588.64Show/hide
Query:  MSKNCREIERKILRLLHGHKSRTHLTQIHAHFLRHGLHQSNQILTHFISICGAFNKIAYANRVFSQSQNPNIFLFNSMIKAHSLSGPFQQSLLLFSSMKN
        MSKNC EIER+ILRLLHGHKSRTHLTQIHAHFLRHGLHQSNQIL HFIS+C +FN+IAYA+R+FSQS NPNIFLFNS+IKAHSLS PF QSLLLFSSMKN
Subjt:  MSKNCREIERKILRLLHGHKSRTHLTQIHAHFLRHGLHQSNQILTHFISICGAFNKIAYANRVFSQSQNPNIFLFNSMIKAHSLSGPFQQSLLLFSSMKN

Query:  HRIVPDEYTFAPLLKSCSNLCDYRLGQLVIGEALRRGFECFGSIRIGVVELYVCCERMEDARKVFDEMPHRDVVVWNLMIRGFCKMGNVDFGLRLFRQMS
        HRIVPD+YTFAPLLKSC+NLC+Y LGQ VI E  RRGF CFGSIRIGVVELYVCCE+MEDA K+FDEM HRDVVVWNLMIRGFCK GNVDFGL LFRQMS
Subjt:  HRIVPDEYTFAPLLKSCSNLCDYRLGQLVIGEALRRGFECFGSIRIGVVELYVCCERMEDARKVFDEMPHRDVVVWNLMIRGFCKMGNVDFGLRLFRQMS

Query:  ERSLVSWNTIISCLAQNGRDVEALELFQQMEEHGFEPDEVTVVTILPVCSRLGALDVGQRIHSYASSKGDLVDTTTVGNSLVDFYCKCGNTEGAYNIFQK
        ERSLVSWNTIISCLAQN RDVEALELFQQMEEHGF+PDEVTVVT+LPVCSRLGAL+VGQRIHSYASSKG+LV  TTVGNSL+DFYCKCGN E AYNIFQK
Subjt:  ERSLVSWNTIISCLAQNGRDVEALELFQQMEEHGFEPDEVTVVTILPVCSRLGALDVGQRIHSYASSKGDLVDTTTVGNSLVDFYCKCGNTEGAYNIFQK

Query:  MTCRSVVSWNSMILGFALNGKGELAIDIFMKMRREDVKPNDATFVAVLTACVHSGLLEKGRELFSSMAE-YEIQPKLEHFGCMVDLLGRGGCVEEAHNLI
        MTC+SVVSWN++ILGFALNGKGE AID+FM+MR+E +KPNDATFVAVLTACVHSGLLEKGRELFSSMAE YEIQPKLEHFGCMVDLLGRGGCVEEAH LI
Subjt:  MTCRSVVSWNSMILGFALNGKGELAIDIFMKMRREDVKPNDATFVAVLTACVHSGLLEKGRELFSSMAE-YEIQPKLEHFGCMVDLLGRGGCVEEAHNLI

Query:  KSMPMQPNATLWGALLGACRTHGNLKLAEVAVKELISLEPWNSGNYVLLSNMLAEEGRWEDVENVRRWMRGKSVKKVPGQSASG
        KSMPMQPNATLWGA+LGACRTHGNLKLAE+AVKELISLEPWNSGNYVLLSNMLAEEGRWE+VENVR+WMR KSVKK PGQSASG
Subjt:  KSMPMQPNATLWGALLGACRTHGNLKLAEVAVKELISLEPWNSGNYVLLSNMLAEEGRWEDVENVRRWMRGKSVKKVPGQSASG

A0A1S3BXN7 pentatricopeptide repeat-containing protein At1g091901.6e-25689.05Show/hide
Query:  MSKNCREIERKILRLLHGHKSRTHLTQIHAHFLRHGLHQSNQILTHFISICGAFNKIAYANRVFSQSQNPNIFLFNSMIKAHSLSGPFQQSLLLFSSMKN
        MSKNC EIER+ILRLLHGHKSRTHLTQIHAHFLRHGLHQSNQIL HFIS+C +FN+I YA+R+FSQS NPNIFLFNS+IKAHSLS PF QSLLLFS MKN
Subjt:  MSKNCREIERKILRLLHGHKSRTHLTQIHAHFLRHGLHQSNQILTHFISICGAFNKIAYANRVFSQSQNPNIFLFNSMIKAHSLSGPFQQSLLLFSSMKN

Query:  HRIVPDEYTFAPLLKSCSNLCDYRLGQLVIGEALRRGFECFGSIRIGVVELYVCCERMEDARKVFDEMPHRDVVVWNLMIRGFCKMGNVDFGLRLFRQMS
        HRIVPD+YTFAPLLKSC+NLC+Y LGQ VI E L RGF CFGSIRIGVVELYVCCE+MEDA K FDEM HRDVVVWNLMIRGFCKMGNVDFGL LFRQMS
Subjt:  HRIVPDEYTFAPLLKSCSNLCDYRLGQLVIGEALRRGFECFGSIRIGVVELYVCCERMEDARKVFDEMPHRDVVVWNLMIRGFCKMGNVDFGLRLFRQMS

Query:  ERSLVSWNTIISCLAQNGRDVEALELFQQMEEHGFEPDEVTVVTILPVCSRLGALDVGQRIHSYASSKGDLVDTTTVGNSLVDFYCKCGNTEGAYNIFQK
        ERSLVSWNTIISCLAQN RDVEALELFQQMEEHGF+PDEVTVVT+LPVCSRLGAL+VGQRIHSY SSKG+LV TT VGNSL+DFYCKCGN E AYNIFQK
Subjt:  ERSLVSWNTIISCLAQNGRDVEALELFQQMEEHGFEPDEVTVVTILPVCSRLGALDVGQRIHSYASSKGDLVDTTTVGNSLVDFYCKCGNTEGAYNIFQK

Query:  MTCRSVVSWNSMILGFALNGKGELAIDIFMKMRREDVKPNDATFVAVLTACVHSGLLEKGRELFSSMAE-YEIQPKLEHFGCMVDLLGRGGCVEEAHNLI
        MTC+SVVSWN++ILGFALNGKGE AID+FM+MR+EDVKPNDATFVAVLTACVHSGLLEKGRELFSSMAE YEIQPKLEHFGCMVDLLGRGGCVEEAHNLI
Subjt:  MTCRSVVSWNSMILGFALNGKGELAIDIFMKMRREDVKPNDATFVAVLTACVHSGLLEKGRELFSSMAE-YEIQPKLEHFGCMVDLLGRGGCVEEAHNLI

Query:  KSMPMQPNATLWGALLGACRTHGNLKLAEVAVKELISLEPWNSGNYVLLSNMLAEEGRWEDVENVRRWMRGKSVKKVPGQSASG
        KSMPMQPNATLWGALLGACRTHGNLKLAE+AVKELISLEPWNSGNYVLLSNMLAEEGRWE+VENVR+WMR KSVKK PGQSASG
Subjt:  KSMPMQPNATLWGALLGACRTHGNLKLAEVAVKELISLEPWNSGNYVLLSNMLAEEGRWEDVENVRRWMRGKSVKKVPGQSASG

A0A5A7U1F2 Pentatricopeptide repeat-containing protein1.6e-25689.05Show/hide
Query:  MSKNCREIERKILRLLHGHKSRTHLTQIHAHFLRHGLHQSNQILTHFISICGAFNKIAYANRVFSQSQNPNIFLFNSMIKAHSLSGPFQQSLLLFSSMKN
        MSKNC EIER+ILRLLHGHKSRTHLTQIHAHFLRHGLHQSNQIL HFIS+C +FN+I YA+R+FSQS NPNIFLFNS+IKAHSLS PF QSLLLFS MKN
Subjt:  MSKNCREIERKILRLLHGHKSRTHLTQIHAHFLRHGLHQSNQILTHFISICGAFNKIAYANRVFSQSQNPNIFLFNSMIKAHSLSGPFQQSLLLFSSMKN

Query:  HRIVPDEYTFAPLLKSCSNLCDYRLGQLVIGEALRRGFECFGSIRIGVVELYVCCERMEDARKVFDEMPHRDVVVWNLMIRGFCKMGNVDFGLRLFRQMS
        HRIVPD+YTFAPLLKSC+NLC+Y LGQ VI E L RGF CFGSIRIGVVELYVCCE+MEDA K FDEM HRDVVVWNLMIRGFCKMGNVDFGL LFRQMS
Subjt:  HRIVPDEYTFAPLLKSCSNLCDYRLGQLVIGEALRRGFECFGSIRIGVVELYVCCERMEDARKVFDEMPHRDVVVWNLMIRGFCKMGNVDFGLRLFRQMS

Query:  ERSLVSWNTIISCLAQNGRDVEALELFQQMEEHGFEPDEVTVVTILPVCSRLGALDVGQRIHSYASSKGDLVDTTTVGNSLVDFYCKCGNTEGAYNIFQK
        ERSLVSWNTIISCLAQN RDVEALELFQQMEEHGF+PDEVTVVT+LPVCSRLGAL+VGQRIHSY SSKG+LV TT VGNSL+DFYCKCGN E AYNIFQK
Subjt:  ERSLVSWNTIISCLAQNGRDVEALELFQQMEEHGFEPDEVTVVTILPVCSRLGALDVGQRIHSYASSKGDLVDTTTVGNSLVDFYCKCGNTEGAYNIFQK

Query:  MTCRSVVSWNSMILGFALNGKGELAIDIFMKMRREDVKPNDATFVAVLTACVHSGLLEKGRELFSSMAE-YEIQPKLEHFGCMVDLLGRGGCVEEAHNLI
        MTC+SVVSWN++ILGFALNGKGE AID+FM+MR+EDVKPNDATFVAVLTACVHSGLLEKGRELFSSMAE YEIQPKLEHFGCMVDLLGRGGCVEEAHNLI
Subjt:  MTCRSVVSWNSMILGFALNGKGELAIDIFMKMRREDVKPNDATFVAVLTACVHSGLLEKGRELFSSMAE-YEIQPKLEHFGCMVDLLGRGGCVEEAHNLI

Query:  KSMPMQPNATLWGALLGACRTHGNLKLAEVAVKELISLEPWNSGNYVLLSNMLAEEGRWEDVENVRRWMRGKSVKKVPGQSASG
        KSMPMQPNATLWGALLGACRTHGNLKLAE+AVKELISLEPWNSGNYVLLSNMLAEEGRWE+VENVR+WMR KSVKK PGQSASG
Subjt:  KSMPMQPNATLWGALLGACRTHGNLKLAEVAVKELISLEPWNSGNYVLLSNMLAEEGRWEDVENVRRWMRGKSVKKVPGQSASG

A0A6J1FGY4 pentatricopeptide repeat-containing protein At1g091901.2e-24887.6Show/hide
Query:  MSKNCREIERKILRLLHGHKSRTHLTQIHAHFLRHGLHQSNQILTHFISICGAFNKIAYANRVFSQSQNPNIFLFNSMIKAHSLSGPFQQSLLLFSSMKN
        MSKN R IER+ILRLL GHKS THLTQIHAHFLRH LHQSNQIL HFISICGA N+IAYANRVFSQSQNPNIFLFNSMIKAHSLSGPFQQSLLLFSSMKN
Subjt:  MSKNCREIERKILRLLHGHKSRTHLTQIHAHFLRHGLHQSNQILTHFISICGAFNKIAYANRVFSQSQNPNIFLFNSMIKAHSLSGPFQQSLLLFSSMKN

Query:  HRIVPDEYTFAPLLKSCSNLCDYRLGQLVIGEALRRGFECFGSIRIGVVELYVCCERMEDARKVFDEMPHRDVVVWNLMIRGFCKMGNVDFGLRLFRQMS
         RIVPDEYTFAPLLKSCSNL DYRLG+ VIGE LRRGFECFGSIRIGVVELYVCCE+M+DA+KVFDEMP RDVVVWNLMIRGFCKMGNVD GL LFRQM+
Subjt:  HRIVPDEYTFAPLLKSCSNLCDYRLGQLVIGEALRRGFECFGSIRIGVVELYVCCERMEDARKVFDEMPHRDVVVWNLMIRGFCKMGNVDFGLRLFRQMS

Query:  ERSLVSWNTIISCLAQNGRDVEALELFQQMEEHGFEPDEVTVVTILPVCSRLGALDVGQRIHSYASSKGDLVDTTTVGNSLVDFYCKCGNTEGAYNIFQK
        +RSLVSWNT ISCLAQ+GRDVEALELFQQ+EEHGFEPDEVTVVT+LPVCSRLGALDVGQ IHSYA+SK DL++TT VGNSL+DFYCK GNTE AYNIFQK
Subjt:  ERSLVSWNTIISCLAQNGRDVEALELFQQMEEHGFEPDEVTVVTILPVCSRLGALDVGQRIHSYASSKGDLVDTTTVGNSLVDFYCKCGNTEGAYNIFQK

Query:  MTCRSVVSWNSMILGFALNGKGELAIDIFMKMRREDVKPNDATFVAVLTACVHSGLLEKGRELFSSMAE-YEIQPKLEHFGCMVDLLGRGGCVEEAHNLI
        MTC++VVSWN+MILGFALNGKGELAID+FM+M R DVKPNDAT VA+LTACVHSGLLEKGRE+FSSMAE YEI+PKLEHFGCMVDLLGRGGCVEEAH+LI
Subjt:  MTCRSVVSWNSMILGFALNGKGELAIDIFMKMRREDVKPNDATFVAVLTACVHSGLLEKGRELFSSMAE-YEIQPKLEHFGCMVDLLGRGGCVEEAHNLI

Query:  KSMPMQPNATLWGALLGACRTHGNLKLAEVAVKELISLEPWNSGNYVLLSNMLAEEGRWEDVENVRRWMRGKSVKKVPGQSASG
        +SMPMQPNATLWGALLGACRTHGNLKLAE+A  ELISLEP NSGNYVLLSN+LAEEGRW+DVENVR  MRGK+VKK PG+SASG
Subjt:  KSMPMQPNATLWGALLGACRTHGNLKLAEVAVKELISLEPWNSGNYVLLSNMLAEEGRWEDVENVRRWMRGKSVKKVPGQSASG

A0A6J1HMD5 pentatricopeptide repeat-containing protein At1g091909.3e-24987.4Show/hide
Query:  MSKNCREIERKILRLLHGHKSRTHLTQIHAHFLRHGLHQSNQILTHFISICGAFNKIAYANRVFSQSQNPNIFLFNSMIKAHSLSGPFQQSLLLFSSMKN
        MSKN R IER+ILRLL GHKS THLTQIHAHFLRH LHQSNQIL HFISICG FN+IAYANRVFSQSQNPNIFLFNSMIKAHSLSGPF+QSLLLFSS+KN
Subjt:  MSKNCREIERKILRLLHGHKSRTHLTQIHAHFLRHGLHQSNQILTHFISICGAFNKIAYANRVFSQSQNPNIFLFNSMIKAHSLSGPFQQSLLLFSSMKN

Query:  HRIVPDEYTFAPLLKSCSNLCDYRLGQLVIGEALRRGFECFGSIRIGVVELYVCCERMEDARKVFDEMPHRDVVVWNLMIRGFCKMGNVDFGLRLFRQMS
         RIVPDEYTFAPLLKSCSNL DYRLG+ VIGE LRRGFECFGSIRIGVVELYVCCERM+DA+KVFDEMPH DVVVWNLMIRGFCKMGNVD GL LFRQM+
Subjt:  HRIVPDEYTFAPLLKSCSNLCDYRLGQLVIGEALRRGFECFGSIRIGVVELYVCCERMEDARKVFDEMPHRDVVVWNLMIRGFCKMGNVDFGLRLFRQMS

Query:  ERSLVSWNTIISCLAQNGRDVEALELFQQMEEHGFEPDEVTVVTILPVCSRLGALDVGQRIHSYASSKGDLVDTTTVGNSLVDFYCKCGNTEGAYNIFQK
        +RSLVSWNT ISCLAQ+GRDVEALELFQQMEEHGFEPDEVTVVT+LPVCSRLGA+DVGQ IHSYA+SK DLV+TT VGNSL+DFYCK GNTE AYNIFQK
Subjt:  ERSLVSWNTIISCLAQNGRDVEALELFQQMEEHGFEPDEVTVVTILPVCSRLGALDVGQRIHSYASSKGDLVDTTTVGNSLVDFYCKCGNTEGAYNIFQK

Query:  MTCRSVVSWNSMILGFALNGKGELAIDIFMKMRREDVKPNDATFVAVLTACVHSGLLEKGRELFSSMAE-YEIQPKLEHFGCMVDLLGRGGCVEEAHNLI
        MTC+SVVSWN+MILGFALNGKGELAID+FM+M + D KPND T VA+LTACVHSGLLEKG+E+FSSMAE YEI+PKLEHFGCMVDLLGRGGCVEEAH+LI
Subjt:  MTCRSVVSWNSMILGFALNGKGELAIDIFMKMRREDVKPNDATFVAVLTACVHSGLLEKGRELFSSMAE-YEIQPKLEHFGCMVDLLGRGGCVEEAHNLI

Query:  KSMPMQPNATLWGALLGACRTHGNLKLAEVAVKELISLEPWNSGNYVLLSNMLAEEGRWEDVENVRRWMRGKSVKKVPGQSASG
        +SMPMQPNATLWGALLGACRTHGNLKLAE+AV ELISLEP NSGNYVLLSN LAEE RWEDVENVRR MRGK+VKK PG+SASG
Subjt:  KSMPMQPNATLWGALLGACRTHGNLKLAEVAVKELISLEPWNSGNYVLLSNMLAEEGRWEDVENVRRWMRGKSVKKVPGQSASG

SwissProt top hitse value%identityAlignment
O80488 Pentatricopeptide repeat-containing protein At1g091906.3e-16259.66Show/hide
Query:  EIERKILRLLHGHKSRTHLTQIHAHFLRHGLHQSNQILTHFISICGAFNKIAYANRVFSQSQNPNIFLFNSMIKAHSLSGPFQQSLLLFSSMKNHRIVPD
        EIERK+LRLLHGH +RT L +IHAH LRH LH SN +L HFISICG+ +   YANRVFS  QNPN+ +FN+MIK +SL GP  +SL  FSSMK+  I  D
Subjt:  EIERKILRLLHGHKSRTHLTQIHAHFLRHGLHQSNQILTHFISICGAFNKIAYANRVFSQSQNPNIFLFNSMIKAHSLSGPFQQSLLLFSSMKNHRIVPD

Query:  EYTFAPLLKSCSNLCDYRLGQLVIGEALRRGFECFGSIRIGVVELYVCCERMEDARKVFDEMPHRDVVVWNLMIRGFCKMGNVDFGLRLFRQMSERSLVS
        EYT+APLLKSCS+L D R G+ V GE +R GF   G IRIGVVELY    RM DA+KVFDEM  R+VVVWNLMIRGFC  G+V+ GL LF+QMSERS+VS
Subjt:  EYTFAPLLKSCSNLCDYRLGQLVIGEALRRGFECFGSIRIGVVELYVCCERMEDARKVFDEMPHRDVVVWNLMIRGFCKMGNVDFGLRLFRQMSERSLVS

Query:  WNTIISCLAQNGRDVEALELFQQMEEHGFEPDEVTVVTILPVCSRLGALDVGQRIHSYASSKGDLVDTTTVGNSLVDFYCKCGNTEGAYNIFQKMTCRSV
        WN++IS L++ GRD EALELF +M + GF+PDE TVVT+LP+ + LG LD G+ IHS A S G   D  TVGN+LVDFYCK G+ E A  IF+KM  R+V
Subjt:  WNTIISCLAQNGRDVEALELFQQMEEHGFEPDEVTVVTILPVCSRLGALDVGQRIHSYASSKGDLVDTTTVGNSLVDFYCKCGNTEGAYNIFQKMTCRSV

Query:  VSWNSMILGFALNGKGELAIDIFMKMRRE-DVKPNDATFVAVLTACVHSGLLEKGRELFSSMAE-YEIQPKLEHFGCMVDLLGRGGCVEEAHNLIKSMPM
        VSWN++I G A+NGKGE  ID+F  M  E  V PN+ATF+ VL  C ++G +E+G ELF  M E ++++ + EH+G MVDL+ R G + EA   +K+MP+
Subjt:  VSWNSMILGFALNGKGELAIDIFMKMRRE-DVKPNDATFVAVLTACVHSGLLEKGRELFSSMAE-YEIQPKLEHFGCMVDLLGRGGCVEEAHNLIKSMPM

Query:  QPNATLWGALLGACRTHGNLKLAEVAVKELISLEPWNSGNYVLLSNMLAEEGRWEDVENVRRWMRGKSVKKVPGQS
          NA +WG+LL ACR+HG++KLAEVA  EL+ +EP NSGNYVLLSN+ AEEGRW+DVE VR  M+   ++K  GQS
Subjt:  QPNATLWGALLGACRTHGNLKLAEVAVKELISLEPWNSGNYVLLSNMLAEEGRWEDVENVRRWMRGKSVKKVPGQS

Q9FFG8 Pentatricopeptide repeat-containing protein At5g442304.8e-10140.69Show/hide
Query:  LTQIHAHFLRHGLHQSNQILTHFISICGAFN--KIAYANRVFSQSQNPNIFLFNSMIKAHSLSGPFQQSLLLFSSMKNHRIVPDEYTFAPLLKSCSNLCD
        + QIH H LR GL QS  ILT  I            YA RV    Q  N FL+ ++I+ +++ G F +++ ++  M+   I P  +TF+ LLK+C  + D
Subjt:  LTQIHAHFLRHGLHQSNQILTHFISICGAFN--KIAYANRVFSQSQNPNIFLFNSMIKAHSLSGPFQQSLLLFSSMKNHRIVPDEYTFAPLLKSCSNLCD

Query:  YRLGQLVIGEALR-RGFECFGSIRIGVVELYVCCERMEDARKVFDEMPHRDVVVWNLMIRGFCKMGNVDFGLRLFRQMSERSLVSWNTIISCLAQNGRDV
          LG+    +  R RGF CF  +   ++++YV CE ++ ARKVFDEMP RDV+ W  +I  + ++GN++    LF  +  + +V+W  +++  AQN +  
Subjt:  YRLGQLVIGEALR-RGFECFGSIRIGVVELYVCCERMEDARKVFDEMPHRDVVVWNLMIRGFCKMGNVDFGLRLFRQMSERSLVSWNTIISCLAQNGRDV

Query:  EALELFQQMEEHGFEPDEVTVVTILPVCSRLGALDVGQRIHSYASSKG-DLVDTTTVGNSLVDFYCKCGNTEGAYNIFQKMTCRSVVSWNSMILGFALNG
        EALE F +ME+ G   DEVTV   +  C++LGA     R    A   G    D   +G++L+D Y KCGN E A N+F  M  ++V +++SMILG A +G
Subjt:  EALELFQQMEEHGFEPDEVTVVTILPVCSRLGALDVGQRIHSYASSKG-DLVDTTTVGNSLVDFYCKCGNTEGAYNIFQKMTCRSVVSWNSMILGFALNG

Query:  KGELAIDIFMKM-RREDVKPNDATFVAVLTACVHSGLLEKGRELFSSMAE-YEIQPKLEHFGCMVDLLGRGGCVEEAHNLIKSMPMQPNATLWGALLGAC
        + + A+ +F  M  + ++KPN  TFV  L AC HSGL+++GR++F SM + + +QP  +H+ CMVDLLGR G ++EA  LIK+M ++P+  +WGALLGAC
Subjt:  KGELAIDIFMKM-RREDVKPNDATFVAVLTACVHSGLLEKGRELFSSMAE-YEIQPKLEHFGCMVDLLGRGGCVEEAHNLIKSMPMQPNATLWGALLGAC

Query:  RTHGNLKLAEVAVKELISLEPWNSGNYVLLSNMLAEEGRWEDVENVRRWMRGKSVKKVPGQS
        R H N ++AE+A + L  LEP   GNY+LLSN+ A  G W  V  VR+ ++ K +KK P  S
Subjt:  RTHGNLKLAEVAVKELISLEPWNSGNYVLLSNMLAEEGRWEDVENVRRWMRGKSVKKVPGQS

Q9FMA1 Pentatricopeptide repeat-containing protein At5g563102.9e-9839.83Show/hide
Query:  LHGHKSRTHLTQIHAHFLRHGLHQSNQILTHFISICGAFNKIAYANRVFSQSQNPNIFLFNSMIKAHS-LSGPFQQSLLLFSSMKNHRIV--PDEYTFAP
        +HG+  +T L Q H + +  GL++ N  +  FI  C     + YA  VF+    PN +L N+MI+A S L  P   S+ +    K   +   PD +TF  
Subjt:  LHGHKSRTHLTQIHAHFLRHGLHQSNQILTHFISICGAFNKIAYANRVFSQSQNPNIFLFNSMIKAHS-LSGPFQQSLLLFSSMKNHRIV--PDEYTFAP

Query:  LLKSCSNLCDYRLGQLVIGEALRRGFECFGSIRIGVVELYVCCERMEDARKVFDEMPHRDVVVWNLMIRGFCKMGNVDFGLRLFRQMS--ERSLVSWNTI
        +LK    + D   G+ + G+ +  GF+    +  G++++Y  C  + DARK+FDEM  +DV VWN ++ G+ K+G +D    L   M    R+ VSW  +
Subjt:  LLKSCSNLCDYRLGQLVIGEALRRGFECFGSIRIGVVELYVCCERMEDARKVFDEMPHRDVVVWNLMIRGFCKMGNVDFGLRLFRQMS--ERSLVSWNTI

Query:  ISCLAQNGRDVEALELFQQMEEHGFEPDEVTVVTILPVCSRLGALDVGQRIHSYASSKGDLVDTTTVGNSLVDFYCKCGNTEGAYNIFQKMTCRSVVSWN
        IS  A++GR  EA+E+FQ+M     EPDEVT++ +L  C+ LG+L++G+RI SY   +G +    ++ N+++D Y K GN   A ++F+ +  R+VV+W 
Subjt:  ISCLAQNGRDVEALELFQQMEEHGFEPDEVTVVTILPVCSRLGALDVGQRIHSYASSKGDLVDTTTVGNSLVDFYCKCGNTEGAYNIFQKMTCRSVVSWN

Query:  SMILGFALNGKGELAIDIFMKMRREDVKPNDATFVAVLTACVHSGLLEKGRELFSSM-AEYEIQPKLEHFGCMVDLLGRGGCVEEAHNLIKSMPMQPNAT
        ++I G A +G G  A+ +F +M +  V+PND TF+A+L+AC H G ++ G+ LF+SM ++Y I P +EH+GCM+DLLGR G + EA  +IKSMP + NA 
Subjt:  SMILGFALNGKGELAIDIFMKMRREDVKPNDATFVAVLTACVHSGLLEKGRELFSSM-AEYEIQPKLEHFGCMVDLLGRGGCVEEAHNLIKSMPMQPNAT

Query:  LWGALLGACRTHGNLKLAEVAVKELISLEPWNSGNYVLLSNMLAEEGRWEDVENVRRWMRGKSVKKVPGQSA
        +WG+LL A   H +L+L E A+ ELI LEP NSGNY+LL+N+ +  GRW++   +R  M+G  VKK+ G+S+
Subjt:  LWGALLGACRTHGNLKLAEVAVKELISLEPWNSGNYVLLSNMLAEEGRWEDVENVRRWMRGKSVKKVPGQSA

Q9LN01 Pentatricopeptide repeat-containing protein At1g08070, chloroplastic4.9e-9833.74Show/hide
Query:  LRLLHGHKSRTHLTQIHAHFLRHGLHQSNQILTHFISIC---GAFNKIAYANRVFSQSQNPNIFLFNSMIKAHSLSGPFQQSLLLFSSMKNHRIVPDEYT
        L LLH  K+   L  IHA  ++ GLH +N  L+  I  C     F  + YA  VF   Q PN+ ++N+M + H+LS     +L L+  M +  ++P+ YT
Subjt:  LRLLHGHKSRTHLTQIHAHFLRHGLHQSNQILTHFISIC---GAFNKIAYANRVFSQSQNPNIFLFNSMIKAHSLSGPFQQSLLLFSSMKNHRIVPDEYT

Query:  FAPLLKSCSNLCDYRLGQLVIGEALRRGFECFGSIRIGVVELYVCCERMEDARKVFDEMPHRDVVVWNLMIRGFCKMGNVDFGLRLFRQMSERSLVSWNT
        F  +LKSC+    ++ GQ + G  L+ G +    +   ++ +YV   R+EDA KVFD+ PHRDVV +  +I+G+   G ++   +LF ++  + +VSWN 
Subjt:  FAPLLKSCSNLCDYRLGQLVIGEALRRGFECFGSIRIGVVELYVCCERMEDARKVFDEMPHRDVVVWNLMIRGFCKMGNVDFGLRLFRQMSERSLVSWNT

Query:  IISCLAQNGRDVEALELFQQM-----------------------------------EEHGF---------------------------------------
        +IS  A+ G   EALELF+ M                                   ++HGF                                       
Subjt:  IISCLAQNGRDVEALELFQQM-----------------------------------EEHGF---------------------------------------

Query:  ---------------------------EPDEVTVVTILPVCSRLGALDVGQRIHSYASSK-GDLVDTTTVGNSLVDFYCKCGNTEGAYNIFQKMTCRSVV
                                    P++VT+++ILP C+ LGA+D+G+ IH Y   +   + + +++  SL+D Y KCG+ E A+ +F  +  +S+ 
Subjt:  ---------------------------EPDEVTVVTILPVCSRLGALDVGQRIHSYASSK-GDLVDTTTVGNSLVDFYCKCGNTEGAYNIFQKMTCRSVV

Query:  SWNSMILGFALNGKGELAIDIFMKMRREDVKPNDATFVAVLTACVHSGLLEKGRELFSSMA-EYEIQPKLEHFGCMVDLLGRGGCVEEAHNLIKSMPMQP
        SWN+MI GFA++G+ + + D+F +MR+  ++P+D TFV +L+AC HSG+L+ GR +F +M  +Y++ PKLEH+GCM+DLLG  G  +EA  +I  M M+P
Subjt:  SWNSMILGFALNGKGELAIDIFMKMRREDVKPNDATFVAVLTACVHSGLLEKGRELFSSMA-EYEIQPKLEHFGCMVDLLGRGGCVEEAHNLIKSMPMQP

Query:  NATLWGALLGACRTHGNLKLAEVAVKELISLEPWNSGNYVLLSNMLAEEGRWEDVENVRRWMRGKSVKKVPGQSA
        +  +W +LL AC+ HGN++L E   + LI +EP N G+YVLLSN+ A  GRW +V   R  +  K +KKVPG S+
Subjt:  NATLWGALLGACRTHGNLKLAEVAVKELISLEPWNSGNYVLLSNMLAEEGRWEDVENVRRWMRGKSVKKVPGQSA

Q9SIL5 Pentatricopeptide repeat-containing protein At2g205405.5e-10538.57Show/hide
Query:  REIERKILRLLHGHKSRTHLTQIHAHFLRHGLHQSNQILTHFISICGAFNKIAYANRVFSQSQNPNIFLFNSMIKAHSLSGPFQQSLLLFSSMKNHRI-V
        RE+E   +  L   KSR    +I+A  + HGL QS+ ++T  +  C     + YA R+F+Q  NPN+FL+NS+I+A++ +  +   + ++  +      +
Subjt:  REIERKILRLLHGHKSRTHLTQIHAHFLRHGLHQSNQILTHFISICGAFNKIAYANRVFSQSQNPNIFLFNSMIKAHSLSGPFQQSLLLFSSMKNHRI-V

Query:  PDEYTFAPLLKSCSNLCDYRLGQLVIGEALRRGFECFGSIRIGVVELYVCCERMEDARKVFDEMPHRDVVVWNLMIRGFCKMGNVDFGLRLFRQMSERSL
        PD +TF  + KSC++L    LG+ V G   + G          ++++Y+  + + DA KVFDEM  RDV+ WN ++ G+ ++G +     LF  M ++++
Subjt:  PDEYTFAPLLKSCSNLCDYRLGQLVIGEALRRGFECFGSIRIGVVELYVCCERMEDARKVFDEMPHRDVVVWNLMIRGFCKMGNVDFGLRLFRQMSERSL

Query:  VSWNTIISCLAQNGRDVEALELFQQMEEHGFEPDEVTVVTILPVCSRLGALDVGQRIHSYASSKGDLVDTTTVGNSLVDFYCKCGNTEGAYNIFQKMTCR
        VSW  +IS     G  VEA++ F++M+  G EPDE++++++LP C++LG+L++G+ IH YA  +G  +  T V N+L++ Y KCG    A  +F +M  +
Subjt:  VSWNTIISCLAQNGRDVEALELFQQMEEHGFEPDEVTVVTILPVCSRLGALDVGQRIHSYASSKGDLVDTTTVGNSLVDFYCKCGNTEGAYNIFQKMTCR

Query:  SVVSWNSMILGFALNGKGELAIDIFMKMRREDVKPNDATFVAVLTACVHSGLLEKGRELFSSM-AEYEIQPKLEHFGCMVDLLGRGGCVEEAHNLIKSMP
         V+SW++MI G+A +G    AI+ F +M+R  VKPN  TF+ +L+AC H G+ ++G   F  M  +Y+I+PK+EH+GC++D+L R G +E A  + K+MP
Subjt:  SVVSWNSMILGFALNGKGELAIDIFMKMRREDVKPNDATFVAVLTACVHSGLLEKGRELFSSM-AEYEIQPKLEHFGCMVDLLGRGGCVEEAHNLIKSMP

Query:  MQPNATLWGALLGACRTHGNLKLAEVAVKELISLEPWNSGNYVLLSNMLAEEGRWEDVENVRRWMRGKSVKKVPGQS
        M+P++ +WG+LL +CRT GNL +A VA+  L+ LEP + GNYVLL+N+ A+ G+WEDV  +R+ +R +++KK PG S
Subjt:  MQPNATLWGALLGACRTHGNLKLAEVAVKELISLEPWNSGNYVLLSNMLAEEGRWEDVENVRRWMRGKSVKKVPGQS

Arabidopsis top hitse value%identityAlignment
AT1G08070.1 Tetratricopeptide repeat (TPR)-like superfamily protein3.5e-9933.74Show/hide
Query:  LRLLHGHKSRTHLTQIHAHFLRHGLHQSNQILTHFISIC---GAFNKIAYANRVFSQSQNPNIFLFNSMIKAHSLSGPFQQSLLLFSSMKNHRIVPDEYT
        L LLH  K+   L  IHA  ++ GLH +N  L+  I  C     F  + YA  VF   Q PN+ ++N+M + H+LS     +L L+  M +  ++P+ YT
Subjt:  LRLLHGHKSRTHLTQIHAHFLRHGLHQSNQILTHFISIC---GAFNKIAYANRVFSQSQNPNIFLFNSMIKAHSLSGPFQQSLLLFSSMKNHRIVPDEYT

Query:  FAPLLKSCSNLCDYRLGQLVIGEALRRGFECFGSIRIGVVELYVCCERMEDARKVFDEMPHRDVVVWNLMIRGFCKMGNVDFGLRLFRQMSERSLVSWNT
        F  +LKSC+    ++ GQ + G  L+ G +    +   ++ +YV   R+EDA KVFD+ PHRDVV +  +I+G+   G ++   +LF ++  + +VSWN 
Subjt:  FAPLLKSCSNLCDYRLGQLVIGEALRRGFECFGSIRIGVVELYVCCERMEDARKVFDEMPHRDVVVWNLMIRGFCKMGNVDFGLRLFRQMSERSLVSWNT

Query:  IISCLAQNGRDVEALELFQQM-----------------------------------EEHGF---------------------------------------
        +IS  A+ G   EALELF+ M                                   ++HGF                                       
Subjt:  IISCLAQNGRDVEALELFQQM-----------------------------------EEHGF---------------------------------------

Query:  ---------------------------EPDEVTVVTILPVCSRLGALDVGQRIHSYASSK-GDLVDTTTVGNSLVDFYCKCGNTEGAYNIFQKMTCRSVV
                                    P++VT+++ILP C+ LGA+D+G+ IH Y   +   + + +++  SL+D Y KCG+ E A+ +F  +  +S+ 
Subjt:  ---------------------------EPDEVTVVTILPVCSRLGALDVGQRIHSYASSK-GDLVDTTTVGNSLVDFYCKCGNTEGAYNIFQKMTCRSVV

Query:  SWNSMILGFALNGKGELAIDIFMKMRREDVKPNDATFVAVLTACVHSGLLEKGRELFSSMA-EYEIQPKLEHFGCMVDLLGRGGCVEEAHNLIKSMPMQP
        SWN+MI GFA++G+ + + D+F +MR+  ++P+D TFV +L+AC HSG+L+ GR +F +M  +Y++ PKLEH+GCM+DLLG  G  +EA  +I  M M+P
Subjt:  SWNSMILGFALNGKGELAIDIFMKMRREDVKPNDATFVAVLTACVHSGLLEKGRELFSSMA-EYEIQPKLEHFGCMVDLLGRGGCVEEAHNLIKSMPMQP

Query:  NATLWGALLGACRTHGNLKLAEVAVKELISLEPWNSGNYVLLSNMLAEEGRWEDVENVRRWMRGKSVKKVPGQSA
        +  +W +LL AC+ HGN++L E   + LI +EP N G+YVLLSN+ A  GRW +V   R  +  K +KKVPG S+
Subjt:  NATLWGALLGACRTHGNLKLAEVAVKELISLEPWNSGNYVLLSNMLAEEGRWEDVENVRRWMRGKSVKKVPGQSA

AT1G09190.1 Tetratricopeptide repeat (TPR)-like superfamily protein4.5e-16359.66Show/hide
Query:  EIERKILRLLHGHKSRTHLTQIHAHFLRHGLHQSNQILTHFISICGAFNKIAYANRVFSQSQNPNIFLFNSMIKAHSLSGPFQQSLLLFSSMKNHRIVPD
        EIERK+LRLLHGH +RT L +IHAH LRH LH SN +L HFISICG+ +   YANRVFS  QNPN+ +FN+MIK +SL GP  +SL  FSSMK+  I  D
Subjt:  EIERKILRLLHGHKSRTHLTQIHAHFLRHGLHQSNQILTHFISICGAFNKIAYANRVFSQSQNPNIFLFNSMIKAHSLSGPFQQSLLLFSSMKNHRIVPD

Query:  EYTFAPLLKSCSNLCDYRLGQLVIGEALRRGFECFGSIRIGVVELYVCCERMEDARKVFDEMPHRDVVVWNLMIRGFCKMGNVDFGLRLFRQMSERSLVS
        EYT+APLLKSCS+L D R G+ V GE +R GF   G IRIGVVELY    RM DA+KVFDEM  R+VVVWNLMIRGFC  G+V+ GL LF+QMSERS+VS
Subjt:  EYTFAPLLKSCSNLCDYRLGQLVIGEALRRGFECFGSIRIGVVELYVCCERMEDARKVFDEMPHRDVVVWNLMIRGFCKMGNVDFGLRLFRQMSERSLVS

Query:  WNTIISCLAQNGRDVEALELFQQMEEHGFEPDEVTVVTILPVCSRLGALDVGQRIHSYASSKGDLVDTTTVGNSLVDFYCKCGNTEGAYNIFQKMTCRSV
        WN++IS L++ GRD EALELF +M + GF+PDE TVVT+LP+ + LG LD G+ IHS A S G   D  TVGN+LVDFYCK G+ E A  IF+KM  R+V
Subjt:  WNTIISCLAQNGRDVEALELFQQMEEHGFEPDEVTVVTILPVCSRLGALDVGQRIHSYASSKGDLVDTTTVGNSLVDFYCKCGNTEGAYNIFQKMTCRSV

Query:  VSWNSMILGFALNGKGELAIDIFMKMRRE-DVKPNDATFVAVLTACVHSGLLEKGRELFSSMAE-YEIQPKLEHFGCMVDLLGRGGCVEEAHNLIKSMPM
        VSWN++I G A+NGKGE  ID+F  M  E  V PN+ATF+ VL  C ++G +E+G ELF  M E ++++ + EH+G MVDL+ R G + EA   +K+MP+
Subjt:  VSWNSMILGFALNGKGELAIDIFMKMRRE-DVKPNDATFVAVLTACVHSGLLEKGRELFSSMAE-YEIQPKLEHFGCMVDLLGRGGCVEEAHNLIKSMPM

Query:  QPNATLWGALLGACRTHGNLKLAEVAVKELISLEPWNSGNYVLLSNMLAEEGRWEDVENVRRWMRGKSVKKVPGQS
          NA +WG+LL ACR+HG++KLAEVA  EL+ +EP NSGNYVLLSN+ AEEGRW+DVE VR  M+   ++K  GQS
Subjt:  QPNATLWGALLGACRTHGNLKLAEVAVKELISLEPWNSGNYVLLSNMLAEEGRWEDVENVRRWMRGKSVKKVPGQS

AT2G20540.1 mitochondrial editing factor 213.9e-10638.57Show/hide
Query:  REIERKILRLLHGHKSRTHLTQIHAHFLRHGLHQSNQILTHFISICGAFNKIAYANRVFSQSQNPNIFLFNSMIKAHSLSGPFQQSLLLFSSMKNHRI-V
        RE+E   +  L   KSR    +I+A  + HGL QS+ ++T  +  C     + YA R+F+Q  NPN+FL+NS+I+A++ +  +   + ++  +      +
Subjt:  REIERKILRLLHGHKSRTHLTQIHAHFLRHGLHQSNQILTHFISICGAFNKIAYANRVFSQSQNPNIFLFNSMIKAHSLSGPFQQSLLLFSSMKNHRI-V

Query:  PDEYTFAPLLKSCSNLCDYRLGQLVIGEALRRGFECFGSIRIGVVELYVCCERMEDARKVFDEMPHRDVVVWNLMIRGFCKMGNVDFGLRLFRQMSERSL
        PD +TF  + KSC++L    LG+ V G   + G          ++++Y+  + + DA KVFDEM  RDV+ WN ++ G+ ++G +     LF  M ++++
Subjt:  PDEYTFAPLLKSCSNLCDYRLGQLVIGEALRRGFECFGSIRIGVVELYVCCERMEDARKVFDEMPHRDVVVWNLMIRGFCKMGNVDFGLRLFRQMSERSL

Query:  VSWNTIISCLAQNGRDVEALELFQQMEEHGFEPDEVTVVTILPVCSRLGALDVGQRIHSYASSKGDLVDTTTVGNSLVDFYCKCGNTEGAYNIFQKMTCR
        VSW  +IS     G  VEA++ F++M+  G EPDE++++++LP C++LG+L++G+ IH YA  +G  +  T V N+L++ Y KCG    A  +F +M  +
Subjt:  VSWNTIISCLAQNGRDVEALELFQQMEEHGFEPDEVTVVTILPVCSRLGALDVGQRIHSYASSKGDLVDTTTVGNSLVDFYCKCGNTEGAYNIFQKMTCR

Query:  SVVSWNSMILGFALNGKGELAIDIFMKMRREDVKPNDATFVAVLTACVHSGLLEKGRELFSSM-AEYEIQPKLEHFGCMVDLLGRGGCVEEAHNLIKSMP
         V+SW++MI G+A +G    AI+ F +M+R  VKPN  TF+ +L+AC H G+ ++G   F  M  +Y+I+PK+EH+GC++D+L R G +E A  + K+MP
Subjt:  SVVSWNSMILGFALNGKGELAIDIFMKMRREDVKPNDATFVAVLTACVHSGLLEKGRELFSSM-AEYEIQPKLEHFGCMVDLLGRGGCVEEAHNLIKSMP

Query:  MQPNATLWGALLGACRTHGNLKLAEVAVKELISLEPWNSGNYVLLSNMLAEEGRWEDVENVRRWMRGKSVKKVPGQS
        M+P++ +WG+LL +CRT GNL +A VA+  L+ LEP + GNYVLL+N+ A+ G+WEDV  +R+ +R +++KK PG S
Subjt:  MQPNATLWGALLGACRTHGNLKLAEVAVKELISLEPWNSGNYVLLSNMLAEEGRWEDVENVRRWMRGKSVKKVPGQS

AT5G44230.1 Pentatricopeptide repeat (PPR) superfamily protein3.4e-10240.69Show/hide
Query:  LTQIHAHFLRHGLHQSNQILTHFISICGAFN--KIAYANRVFSQSQNPNIFLFNSMIKAHSLSGPFQQSLLLFSSMKNHRIVPDEYTFAPLLKSCSNLCD
        + QIH H LR GL QS  ILT  I            YA RV    Q  N FL+ ++I+ +++ G F +++ ++  M+   I P  +TF+ LLK+C  + D
Subjt:  LTQIHAHFLRHGLHQSNQILTHFISICGAFN--KIAYANRVFSQSQNPNIFLFNSMIKAHSLSGPFQQSLLLFSSMKNHRIVPDEYTFAPLLKSCSNLCD

Query:  YRLGQLVIGEALR-RGFECFGSIRIGVVELYVCCERMEDARKVFDEMPHRDVVVWNLMIRGFCKMGNVDFGLRLFRQMSERSLVSWNTIISCLAQNGRDV
          LG+    +  R RGF CF  +   ++++YV CE ++ ARKVFDEMP RDV+ W  +I  + ++GN++    LF  +  + +V+W  +++  AQN +  
Subjt:  YRLGQLVIGEALR-RGFECFGSIRIGVVELYVCCERMEDARKVFDEMPHRDVVVWNLMIRGFCKMGNVDFGLRLFRQMSERSLVSWNTIISCLAQNGRDV

Query:  EALELFQQMEEHGFEPDEVTVVTILPVCSRLGALDVGQRIHSYASSKG-DLVDTTTVGNSLVDFYCKCGNTEGAYNIFQKMTCRSVVSWNSMILGFALNG
        EALE F +ME+ G   DEVTV   +  C++LGA     R    A   G    D   +G++L+D Y KCGN E A N+F  M  ++V +++SMILG A +G
Subjt:  EALELFQQMEEHGFEPDEVTVVTILPVCSRLGALDVGQRIHSYASSKG-DLVDTTTVGNSLVDFYCKCGNTEGAYNIFQKMTCRSVVSWNSMILGFALNG

Query:  KGELAIDIFMKM-RREDVKPNDATFVAVLTACVHSGLLEKGRELFSSMAE-YEIQPKLEHFGCMVDLLGRGGCVEEAHNLIKSMPMQPNATLWGALLGAC
        + + A+ +F  M  + ++KPN  TFV  L AC HSGL+++GR++F SM + + +QP  +H+ CMVDLLGR G ++EA  LIK+M ++P+  +WGALLGAC
Subjt:  KGELAIDIFMKM-RREDVKPNDATFVAVLTACVHSGLLEKGRELFSSMAE-YEIQPKLEHFGCMVDLLGRGGCVEEAHNLIKSMPMQPNATLWGALLGAC

Query:  RTHGNLKLAEVAVKELISLEPWNSGNYVLLSNMLAEEGRWEDVENVRRWMRGKSVKKVPGQS
        R H N ++AE+A + L  LEP   GNY+LLSN+ A  G W  V  VR+ ++ K +KK P  S
Subjt:  RTHGNLKLAEVAVKELISLEPWNSGNYVLLSNMLAEEGRWEDVENVRRWMRGKSVKKVPGQS

AT5G56310.1 Pentatricopeptide repeat (PPR) superfamily protein2.1e-9939.83Show/hide
Query:  LHGHKSRTHLTQIHAHFLRHGLHQSNQILTHFISICGAFNKIAYANRVFSQSQNPNIFLFNSMIKAHS-LSGPFQQSLLLFSSMKNHRIV--PDEYTFAP
        +HG+  +T L Q H + +  GL++ N  +  FI  C     + YA  VF+    PN +L N+MI+A S L  P   S+ +    K   +   PD +TF  
Subjt:  LHGHKSRTHLTQIHAHFLRHGLHQSNQILTHFISICGAFNKIAYANRVFSQSQNPNIFLFNSMIKAHS-LSGPFQQSLLLFSSMKNHRIV--PDEYTFAP

Query:  LLKSCSNLCDYRLGQLVIGEALRRGFECFGSIRIGVVELYVCCERMEDARKVFDEMPHRDVVVWNLMIRGFCKMGNVDFGLRLFRQMS--ERSLVSWNTI
        +LK    + D   G+ + G+ +  GF+    +  G++++Y  C  + DARK+FDEM  +DV VWN ++ G+ K+G +D    L   M    R+ VSW  +
Subjt:  LLKSCSNLCDYRLGQLVIGEALRRGFECFGSIRIGVVELYVCCERMEDARKVFDEMPHRDVVVWNLMIRGFCKMGNVDFGLRLFRQMS--ERSLVSWNTI

Query:  ISCLAQNGRDVEALELFQQMEEHGFEPDEVTVVTILPVCSRLGALDVGQRIHSYASSKGDLVDTTTVGNSLVDFYCKCGNTEGAYNIFQKMTCRSVVSWN
        IS  A++GR  EA+E+FQ+M     EPDEVT++ +L  C+ LG+L++G+RI SY   +G +    ++ N+++D Y K GN   A ++F+ +  R+VV+W 
Subjt:  ISCLAQNGRDVEALELFQQMEEHGFEPDEVTVVTILPVCSRLGALDVGQRIHSYASSKGDLVDTTTVGNSLVDFYCKCGNTEGAYNIFQKMTCRSVVSWN

Query:  SMILGFALNGKGELAIDIFMKMRREDVKPNDATFVAVLTACVHSGLLEKGRELFSSM-AEYEIQPKLEHFGCMVDLLGRGGCVEEAHNLIKSMPMQPNAT
        ++I G A +G G  A+ +F +M +  V+PND TF+A+L+AC H G ++ G+ LF+SM ++Y I P +EH+GCM+DLLGR G + EA  +IKSMP + NA 
Subjt:  SMILGFALNGKGELAIDIFMKMRREDVKPNDATFVAVLTACVHSGLLEKGRELFSSM-AEYEIQPKLEHFGCMVDLLGRGGCVEEAHNLIKSMPMQPNAT

Query:  LWGALLGACRTHGNLKLAEVAVKELISLEPWNSGNYVLLSNMLAEEGRWEDVENVRRWMRGKSVKKVPGQSA
        +WG+LL A   H +L+L E A+ ELI LEP NSGNY+LL+N+ +  GRW++   +R  M+G  VKK+ G+S+
Subjt:  LWGALLGACRTHGNLKLAEVAVKELISLEPWNSGNYVLLSNMLAEEGRWEDVENVRRWMRGKSVKKVPGQSA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGCAAGAACTGTCGCGAGATCGAGCGGAAAATCCTCCGGCTCCTTCACGGCCACAAATCCCGAACCCATCTCACTCAAATCCACGCCCATTTCCTCCGCCATGGCCT
TCATCAATCCAACCAAATCCTCACCCATTTCATCTCCATTTGCGGAGCTTTCAACAAGATTGCCTATGCCAATCGCGTCTTTTCCCAATCTCAAAATCCCAACATCTTCC
TCTTCAATTCCATGATCAAAGCCCATTCCCTTTCCGGCCCCTTCCAACAATCCCTTCTCCTGTTTTCCTCCATGAAGAACCACAGGATTGTCCCTGACGAGTACACTTTT
GCGCCATTGTTGAAATCCTGTTCGAATCTTTGCGATTATAGGCTTGGCCAGCTTGTGATTGGTGAGGCTTTGCGTCGTGGGTTTGAGTGTTTTGGGTCCATTCGTATTGG
GGTGGTTGAGTTGTATGTTTGTTGTGAGAGGATGGAGGATGCGCGGAAGGTGTTTGATGAAATGCCTCATAGAGATGTGGTTGTTTGGAACTTGATGATTCGTGGGTTTT
GCAAGATGGGTAATGTTGATTTTGGATTGCGTCTCTTTAGGCAAATGAGTGAACGTAGCCTTGTTTCTTGGAACACTATTATTTCCTGTTTAGCTCAAAATGGGCGTGAT
GTTGAAGCTTTGGAACTCTTTCAACAGATGGAAGAACATGGTTTTGAACCAGATGAGGTGACTGTGGTCACAATATTGCCTGTATGTTCTCGTTTGGGAGCTCTTGATGT
TGGACAAAGGATCCATTCTTATGCAAGTTCTAAGGGAGATTTAGTAGATACTACAACGGTTGGGAATTCGCTTGTCGATTTCTACTGTAAATGTGGGAATACAGAAGGGG
CTTACAACATTTTTCAGAAAATGACTTGCAGAAGTGTTGTTTCTTGGAATTCAATGATCTTGGGCTTTGCTTTGAATGGGAAGGGAGAGCTTGCTATTGACATTTTCATG
AAGATGAGAAGAGAGGATGTGAAGCCTAATGATGCAACATTCGTAGCCGTCTTGACCGCTTGTGTTCATTCGGGATTGTTAGAGAAGGGTCGAGAGCTATTTTCTTCAAT
GGCTGAGTACGAAATCCAGCCAAAACTTGAACATTTTGGTTGTATGGTTGATCTTTTGGGACGAGGTGGATGTGTGGAGGAGGCTCATAACTTGATTAAAAGCATGCCAA
TGCAGCCAAATGCCACTTTATGGGGTGCTTTGCTTGGTGCTTGCCGAACTCATGGCAACTTGAAACTTGCAGAAGTAGCAGTGAAAGAGCTCATCAGTCTTGAACCATGG
AACTCTGGTAATTATGTGTTGCTGTCAAATATGTTGGCAGAAGAAGGAAGATGGGAAGATGTTGAGAATGTTAGACGTTGGATGAGAGGAAAAAGCGTCAAGAAAGTACC
TGGGCAGAGTGCAAGTGGGTAA
mRNA sequenceShow/hide mRNA sequence
ATGAGCAAGAACTGTCGCGAGATCGAGCGGAAAATCCTCCGGCTCCTTCACGGCCACAAATCCCGAACCCATCTCACTCAAATCCACGCCCATTTCCTCCGCCATGGCCT
TCATCAATCCAACCAAATCCTCACCCATTTCATCTCCATTTGCGGAGCTTTCAACAAGATTGCCTATGCCAATCGCGTCTTTTCCCAATCTCAAAATCCCAACATCTTCC
TCTTCAATTCCATGATCAAAGCCCATTCCCTTTCCGGCCCCTTCCAACAATCCCTTCTCCTGTTTTCCTCCATGAAGAACCACAGGATTGTCCCTGACGAGTACACTTTT
GCGCCATTGTTGAAATCCTGTTCGAATCTTTGCGATTATAGGCTTGGCCAGCTTGTGATTGGTGAGGCTTTGCGTCGTGGGTTTGAGTGTTTTGGGTCCATTCGTATTGG
GGTGGTTGAGTTGTATGTTTGTTGTGAGAGGATGGAGGATGCGCGGAAGGTGTTTGATGAAATGCCTCATAGAGATGTGGTTGTTTGGAACTTGATGATTCGTGGGTTTT
GCAAGATGGGTAATGTTGATTTTGGATTGCGTCTCTTTAGGCAAATGAGTGAACGTAGCCTTGTTTCTTGGAACACTATTATTTCCTGTTTAGCTCAAAATGGGCGTGAT
GTTGAAGCTTTGGAACTCTTTCAACAGATGGAAGAACATGGTTTTGAACCAGATGAGGTGACTGTGGTCACAATATTGCCTGTATGTTCTCGTTTGGGAGCTCTTGATGT
TGGACAAAGGATCCATTCTTATGCAAGTTCTAAGGGAGATTTAGTAGATACTACAACGGTTGGGAATTCGCTTGTCGATTTCTACTGTAAATGTGGGAATACAGAAGGGG
CTTACAACATTTTTCAGAAAATGACTTGCAGAAGTGTTGTTTCTTGGAATTCAATGATCTTGGGCTTTGCTTTGAATGGGAAGGGAGAGCTTGCTATTGACATTTTCATG
AAGATGAGAAGAGAGGATGTGAAGCCTAATGATGCAACATTCGTAGCCGTCTTGACCGCTTGTGTTCATTCGGGATTGTTAGAGAAGGGTCGAGAGCTATTTTCTTCAAT
GGCTGAGTACGAAATCCAGCCAAAACTTGAACATTTTGGTTGTATGGTTGATCTTTTGGGACGAGGTGGATGTGTGGAGGAGGCTCATAACTTGATTAAAAGCATGCCAA
TGCAGCCAAATGCCACTTTATGGGGTGCTTTGCTTGGTGCTTGCCGAACTCATGGCAACTTGAAACTTGCAGAAGTAGCAGTGAAAGAGCTCATCAGTCTTGAACCATGG
AACTCTGGTAATTATGTGTTGCTGTCAAATATGTTGGCAGAAGAAGGAAGATGGGAAGATGTTGAGAATGTTAGACGTTGGATGAGAGGAAAAAGCGTCAAGAAAGTACC
TGGGCAGAGTGCAAGTGGGTAA
Protein sequenceShow/hide protein sequence
MSKNCREIERKILRLLHGHKSRTHLTQIHAHFLRHGLHQSNQILTHFISICGAFNKIAYANRVFSQSQNPNIFLFNSMIKAHSLSGPFQQSLLLFSSMKNHRIVPDEYTF
APLLKSCSNLCDYRLGQLVIGEALRRGFECFGSIRIGVVELYVCCERMEDARKVFDEMPHRDVVVWNLMIRGFCKMGNVDFGLRLFRQMSERSLVSWNTIISCLAQNGRD
VEALELFQQMEEHGFEPDEVTVVTILPVCSRLGALDVGQRIHSYASSKGDLVDTTTVGNSLVDFYCKCGNTEGAYNIFQKMTCRSVVSWNSMILGFALNGKGELAIDIFM
KMRREDVKPNDATFVAVLTACVHSGLLEKGRELFSSMAEYEIQPKLEHFGCMVDLLGRGGCVEEAHNLIKSMPMQPNATLWGALLGACRTHGNLKLAEVAVKELISLEPW
NSGNYVLLSNMLAEEGRWEDVENVRRWMRGKSVKKVPGQSASG