; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cucsat.G19922 (gene) of Cucumber (B10) v3 genome

Gene IDCucsat.G19922
OrganismCucumis sativus L. var. sativus cv. B10 (Cucumber (B10) v3)
DescriptionPentatricopeptide repeat-containing protein
Genome locationctg4:3268685..3270748
RNA-Seq ExpressionCucsat.G19922
SyntenyCucsat.G19922
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004144815.1 pentatricopeptide repeat-containing protein At1g09190 isoform X1 [Cucumis sativus]0.0100Show/hide
Query:  MSKNCMEIERRILRLLHGHKSRTHLTQIHAHFLRHGLHQSNQILAHFISVCASFNRIAYADRLFSQSHNPNIFLFNSIIKAHSLSVPFHQSLLLFSSMKN
        MSKNCMEIERRILRLLHGHKSRTHLTQIHAHFLRHGLHQSNQILAHFISVCASFNRIAYADRLFSQSHNPNIFLFNSIIKAHSLSVPFHQSLLLFSSMKN
Subjt:  MSKNCMEIERRILRLLHGHKSRTHLTQIHAHFLRHGLHQSNQILAHFISVCASFNRIAYADRLFSQSHNPNIFLFNSIIKAHSLSVPFHQSLLLFSSMKN

Query:  HRIVPDQYTFAPLLKSCANLCEYSLGQCVISEVFRRGFYCFGSIRIGVVELYVCCEKMEDAWKMFDEMSHRDVVVWNLMIRGFCKTGNVDFGLCLFRQMS
        HRIVPDQYTFAPLLKSCANLCEYSLGQCVISEVFRRGFYCFGSIRIGVVELYVCCEKMEDAWKMFDEMSHRDVVVWNLMIRGFCKTGNVDFGLCLFRQMS
Subjt:  HRIVPDQYTFAPLLKSCANLCEYSLGQCVISEVFRRGFYCFGSIRIGVVELYVCCEKMEDAWKMFDEMSHRDVVVWNLMIRGFCKTGNVDFGLCLFRQMS

Query:  ERSLVSWNTIISCLAQNRRDVEALELFQQMEEHGFKPDEVTVVTMLPVCSRLGALEVGQRIHSYASSKGNLVGITTVGNSLIDFYCKCGNIEKAYNIFQK
        ERSLVSWNTIISCLAQNRRDVEALELFQQMEEHGFKPDEVTVVTMLPVCSRLGALEVGQRIHSYASSKGNLVGITTVGNSLIDFYCKCGNIEKAYNIFQK
Subjt:  ERSLVSWNTIISCLAQNRRDVEALELFQQMEEHGFKPDEVTVVTMLPVCSRLGALEVGQRIHSYASSKGNLVGITTVGNSLIDFYCKCGNIEKAYNIFQK

Query:  MTCKSVVSWNTIILGFALNGKGEFAIDLFMEMRKEYLKPNDATFVAVLTACVHSGLLEKGRELFSSMAEDYEIQPKLEHFGCMVDLLGRGGCVEEAHKLI
        MTCKSVVSWNTIILGFALNGKGEFAIDLFMEMRKEYLKPNDATFVAVLTACVHSGLLEKGRELFSSMAEDYEIQPKLEHFGCMVDLLGRGGCVEEAHKLI
Subjt:  MTCKSVVSWNTIILGFALNGKGEFAIDLFMEMRKEYLKPNDATFVAVLTACVHSGLLEKGRELFSSMAEDYEIQPKLEHFGCMVDLLGRGGCVEEAHKLI

Query:  KSMPMQPNATLWGAVLGACRTHGNLKLAEMAVKELISLEPWNSGNYVLLSNMLAEEGRWEEVENVRQWMREKSVKKAPGQSASG
        KSMPMQPNATLWGAVLGACRTHGNLKLAEMAVKELISLEPWNSGNYVLLSNMLAEEGRWEEVENVRQWMREKSVKKAPGQSASG
Subjt:  KSMPMQPNATLWGAVLGACRTHGNLKLAEMAVKELISLEPWNSGNYVLLSNMLAEEGRWEEVENVRQWMREKSVKKAPGQSASG

XP_008453700.1 PREDICTED: pentatricopeptide repeat-containing protein At1g09190 [Cucumis melo]0.096.9Show/hide
Query:  MSKNCMEIERRILRLLHGHKSRTHLTQIHAHFLRHGLHQSNQILAHFISVCASFNRIAYADRLFSQSHNPNIFLFNSIIKAHSLSVPFHQSLLLFSSMKN
        MSKNCMEIERRILRLLHGHKSRTHLTQIHAHFLRHGLHQSNQILAHFISVCASFNRI YADRLFSQSHNPNIFLFNSIIKAHSLS PFHQSLLLFS MKN
Subjt:  MSKNCMEIERRILRLLHGHKSRTHLTQIHAHFLRHGLHQSNQILAHFISVCASFNRIAYADRLFSQSHNPNIFLFNSIIKAHSLSVPFHQSLLLFSSMKN

Query:  HRIVPDQYTFAPLLKSCANLCEYSLGQCVISEVFRRGFYCFGSIRIGVVELYVCCEKMEDAWKMFDEMSHRDVVVWNLMIRGFCKTGNVDFGLCLFRQMS
        HRIVPDQYTFAPLLKSCANLCEYSLGQCVISEV  RGFYCFGSIRIGVVELYVCCEKMEDAWK FDEMSHRDVVVWNLMIRGFCK GNVDFGLCLFRQMS
Subjt:  HRIVPDQYTFAPLLKSCANLCEYSLGQCVISEVFRRGFYCFGSIRIGVVELYVCCEKMEDAWKMFDEMSHRDVVVWNLMIRGFCKTGNVDFGLCLFRQMS

Query:  ERSLVSWNTIISCLAQNRRDVEALELFQQMEEHGFKPDEVTVVTMLPVCSRLGALEVGQRIHSYASSKGNLVGITTVGNSLIDFYCKCGNIEKAYNIFQK
        ERSLVSWNTIISCLAQNRRDVEALELFQQMEEHGFKPDEVTVVTMLPVCSRLGALEVGQRIHSY SSKGNLVG T VGNSLIDFYCKCGNIE AYNIFQK
Subjt:  ERSLVSWNTIISCLAQNRRDVEALELFQQMEEHGFKPDEVTVVTMLPVCSRLGALEVGQRIHSYASSKGNLVGITTVGNSLIDFYCKCGNIEKAYNIFQK

Query:  MTCKSVVSWNTIILGFALNGKGEFAIDLFMEMRKEYLKPNDATFVAVLTACVHSGLLEKGRELFSSMAEDYEIQPKLEHFGCMVDLLGRGGCVEEAHKLI
        MTCKSVVSWNTIILGFALNGKGEFAIDLFMEMRKE +KPNDATFVAVLTACVHSGLLEKGRELFSSMAEDYEIQPKLEHFGCMVDLLGRGGCVEEAH LI
Subjt:  MTCKSVVSWNTIILGFALNGKGEFAIDLFMEMRKEYLKPNDATFVAVLTACVHSGLLEKGRELFSSMAEDYEIQPKLEHFGCMVDLLGRGGCVEEAHKLI

Query:  KSMPMQPNATLWGAVLGACRTHGNLKLAEMAVKELISLEPWNSGNYVLLSNMLAEEGRWEEVENVRQWMREKSVKKAPGQSASG
        KSMPMQPNATLWGA+LGACRTHGNLKLAEMAVKELISLEPWNSGNYVLLSNMLAEEGRWEEVENVRQWMREKSVKKAPGQSASG
Subjt:  KSMPMQPNATLWGAVLGACRTHGNLKLAEMAVKELISLEPWNSGNYVLLSNMLAEEGRWEEVENVRQWMREKSVKKAPGQSASG

XP_023538021.1 pentatricopeptide repeat-containing protein At1g09190 [Cucurbita pepo subsp. pepo]4.25e-29783.68Show/hide
Query:  MSKNCMEIERRILRLLHGHKSRTHLTQIHAHFLRHGLHQSNQILAHFISVCASFNRIAYADRLFSQSHNPNIFLFNSIIKAHSLSVPFHQSLLLFSSMKN
        MSKN   IERRILRLL GHKS THLTQIHAHFLRH LHQSNQILAHFIS+C +FN IAYA+R+FSQS NPNIFLFNS+IKAHSLS PF QSLLLFSSMKN
Subjt:  MSKNCMEIERRILRLLHGHKSRTHLTQIHAHFLRHGLHQSNQILAHFISVCASFNRIAYADRLFSQSHNPNIFLFNSIIKAHSLSVPFHQSLLLFSSMKN

Query:  HRIVPDQYTFAPLLKSCANLCEYSLGQCVISEVFRRGFYCFGSIRIGVVELYVCCEKMEDAWKMFDEMSHRDVVVWNLMIRGFCKTGNVDFGLCLFRQMS
         RIVPD+YTFAPLLKSC+NL +Y LG+CVI EV RRGF  FGSIRIGVVELYVCCE+M+DA K+FDEM  RDVVVWNLMIRGFCK GNVD GL LFRQM+
Subjt:  HRIVPDQYTFAPLLKSCANLCEYSLGQCVISEVFRRGFYCFGSIRIGVVELYVCCEKMEDAWKMFDEMSHRDVVVWNLMIRGFCKTGNVDFGLCLFRQMS

Query:  ERSLVSWNTIISCLAQNRRDVEALELFQQMEEHGFKPDEVTVVTMLPVCSRLGALEVGQRIHSYASSKGNLVGITTVGNSLIDFYCKCGNIEKAYNIFQK
        +RSLVSWNT ISCLAQ+ RDVEAL+LFQQMEEHGF+PDEVTVVTMLPVCSRLGAL+VGQ IHSYA+SK +LV  T VGNSLIDFYCK GN EKAYNIFQK
Subjt:  ERSLVSWNTIISCLAQNRRDVEALELFQQMEEHGFKPDEVTVVTMLPVCSRLGALEVGQRIHSYASSKGNLVGITTVGNSLIDFYCKCGNIEKAYNIFQK

Query:  MTCKSVVSWNTIILGFALNGKGEFAIDLFMEMRKEYLKPNDATFVAVLTACVHSGLLEKGRELFSSMAEDYEIQPKLEHFGCMVDLLGRGGCVEEAHKLI
        MTCKSVVSWNT+ILGFALNGKGE AIDLF EM +   KPNDAT VA+LTACVHSGLLEKGRE+FSSMAE YEI+PKLEHFGCMVDLLGRGGCVEEAH LI
Subjt:  MTCKSVVSWNTIILGFALNGKGEFAIDLFMEMRKEYLKPNDATFVAVLTACVHSGLLEKGRELFSSMAEDYEIQPKLEHFGCMVDLLGRGGCVEEAHKLI

Query:  KSMPMQPNATLWGAVLGACRTHGNLKLAEMAVKELISLEPWNSGNYVLLSNMLAEEGRWEEVENVRQWMREKSVKKAPGQSASG
        +SMPMQPNATLWGA+LGACRTHGNLKLAEMA  ELISLEP NSGNYVLLSN+LAEEGRWE+VENVR+ MR K+VKKAPG+SASG
Subjt:  KSMPMQPNATLWGAVLGACRTHGNLKLAEMAVKELISLEPWNSGNYVLLSNMLAEEGRWEEVENVRQWMREKSVKKAPGQSASG

XP_031737169.1 pentatricopeptide repeat-containing protein At1g09190 isoform X2 [Cucumis sativus]0.094.21Show/hide
Query:  MSKNCMEIERRILRLLHGHKSRTHLTQIHAHFLRHGLHQSNQILAHFISVCASFNRIAYADRLFSQSHNPNIFLFNSIIKAHSLSVPFHQSLLLFSSMKN
        MSKNCMEIERRILRLLHGHKSRTHLTQIHAHFLRHGLHQSNQILAHFISVCASFNRIAYADRLFSQSHNPNIFLFNSIIKAHSLSVPFHQSLLLFSSMKN
Subjt:  MSKNCMEIERRILRLLHGHKSRTHLTQIHAHFLRHGLHQSNQILAHFISVCASFNRIAYADRLFSQSHNPNIFLFNSIIKAHSLSVPFHQSLLLFSSMKN

Query:  HRIVPDQYTFAPLLKSCANLCEYSLGQCVISEVFRRGFYCFGSIRIGVVELYVCCEKMEDAWKMFDEMSHRDVVVWNLMIRGFCKTGNVDFGLCLFRQMS
        HRIVPDQYTFAPLLKSCANLCEYSLGQCVISEVFRRGFYCFGSIRIGVVELYVCCEKMEDAWKMFDEMSHRDVVVWNLMIRGFCKTGNVDFGLCLFRQMS
Subjt:  HRIVPDQYTFAPLLKSCANLCEYSLGQCVISEVFRRGFYCFGSIRIGVVELYVCCEKMEDAWKMFDEMSHRDVVVWNLMIRGFCKTGNVDFGLCLFRQMS

Query:  ERSLVSWNTIISCLAQNRRDVEALELFQQMEEHGFKPDEVTVVTMLPVCSRLGALEVGQRIHSYASSKGNLVGITTVGNSLIDFYCKCGNIEKAYNIFQK
        ERSLVSWNTIISCLAQNRRDVEALELFQQMEEHGFKPDEVTVVTMLPVCSRLGALEVGQRIHSYASSKGNL                            K
Subjt:  ERSLVSWNTIISCLAQNRRDVEALELFQQMEEHGFKPDEVTVVTMLPVCSRLGALEVGQRIHSYASSKGNLVGITTVGNSLIDFYCKCGNIEKAYNIFQK

Query:  MTCKSVVSWNTIILGFALNGKGEFAIDLFMEMRKEYLKPNDATFVAVLTACVHSGLLEKGRELFSSMAEDYEIQPKLEHFGCMVDLLGRGGCVEEAHKLI
        MTCKSVVSWNTIILGFALNGKGEFAIDLFMEMRKEYLKPNDATFVAVLTACVHSGLLEKGRELFSSMAEDYEIQPKLEHFGCMVDLLGRGGCVEEAHKLI
Subjt:  MTCKSVVSWNTIILGFALNGKGEFAIDLFMEMRKEYLKPNDATFVAVLTACVHSGLLEKGRELFSSMAEDYEIQPKLEHFGCMVDLLGRGGCVEEAHKLI

Query:  KSMPMQPNATLWGAVLGACRTHGNLKLAEMAVKELISLEPWNSGNYVLLSNMLAEEGRWEEVENVRQWMREKSVKKAPGQSASG
        KSMPMQPNATLWGAVLGACRTHGNLKLAEMAVKELISLEPWNSGNYVLLSNMLAEEGRWEEVENVRQWMREKSVKKAPGQSASG
Subjt:  KSMPMQPNATLWGAVLGACRTHGNLKLAEMAVKELISLEPWNSGNYVLLSNMLAEEGRWEEVENVRQWMREKSVKKAPGQSASG

XP_038890516.1 pentatricopeptide repeat-containing protein At1g09190 [Benincasa hispida]0.088.43Show/hide
Query:  MSKNCMEIERRILRLLHGHKSRTHLTQIHAHFLRHGLHQSNQILAHFISVCASFNRIAYADRLFSQSHNPNIFLFNSIIKAHSLSVPFHQSLLLFSSMKN
        M+KNC+EIERRILRLLHG KSRTHLT+IHAHFLRHGLHQSNQILAHFIS+CA+FN I+YA RLFSQSHNPNIFLFNSIIKAHSL  PF QSLLLFSSMKN
Subjt:  MSKNCMEIERRILRLLHGHKSRTHLTQIHAHFLRHGLHQSNQILAHFISVCASFNRIAYADRLFSQSHNPNIFLFNSIIKAHSLSVPFHQSLLLFSSMKN

Query:  HRIVPDQYTFAPLLKSCANLCEYSLGQCVISEVFRRGFYCFGSIRIGVVELYVCCEKMEDAWKMFDEMSHRDVVVWNLMIRGFCKTGNVDFGLCLFRQMS
        HRIVPD+YTFAPLLKSCANL EYSLGQCVI+EV RRGFYCFGSIRIGVVELYVCCE+MEDA K+FDEM HRDVVVWNLMIRGFCK GNVDFGL LFRQM+
Subjt:  HRIVPDQYTFAPLLKSCANLCEYSLGQCVISEVFRRGFYCFGSIRIGVVELYVCCEKMEDAWKMFDEMSHRDVVVWNLMIRGFCKTGNVDFGLCLFRQMS

Query:  ERSLVSWNTIISCLAQNRRDVEALELFQQMEEHGFKPDEVTVVTMLPVCSRLGALEVGQRIHSYASSKGNLVGITTVGNSLIDFYCKCGNIEKAYNIFQK
        ERSL+SWNT++SCLAQ+R D EALELFQQMEE GFKPDEVTVVTMLPVCSRLGAL+VGQRIH+YASSKG++V  TT+GNSL+DFYCKCGN+E+AYNIFQK
Subjt:  ERSLVSWNTIISCLAQNRRDVEALELFQQMEEHGFKPDEVTVVTMLPVCSRLGALEVGQRIHSYASSKGNLVGITTVGNSLIDFYCKCGNIEKAYNIFQK

Query:  MTCKSVVSWNTIILGFALNGKGEFAIDLFMEMRKEYLKPNDATFVAVLTACVHSGLLEKGRELFSSMAEDYEIQPKLEHFGCMVDLLGRGGCVEEAHKLI
        MTCKSVVSWNT+ILGFALNG GEFAIDLFM+M KE +KPNDATFVAVLTACVHSGLLEKGRELFSSMA+ YEIQPKLEHFGCMVDLLGRGGC+EEAH LI
Subjt:  MTCKSVVSWNTIILGFALNGKGEFAIDLFMEMRKEYLKPNDATFVAVLTACVHSGLLEKGRELFSSMAEDYEIQPKLEHFGCMVDLLGRGGCVEEAHKLI

Query:  KSMPMQPNATLWGAVLGACRTHGNLKLAEMAVKELISLEPWNSGNYVLLSNMLAEEGRWEEVENVRQWMREKSVKKAPGQSASG
        KSMPMQPNATLWGA+LGACRTHGNLKLAEMAVKEL SLEPWNSGNYVLLSNMLAEEGRWE+VENVR+WM+ KSVKKAPGQSASG
Subjt:  KSMPMQPNATLWGAVLGACRTHGNLKLAEMAVKELISLEPWNSGNYVLLSNMLAEEGRWEEVENVRQWMREKSVKKAPGQSASG

TrEMBL top hitse value%identityAlignment
A0A0A0LJW6 Uncharacterized protein0.0100Show/hide
Query:  MSKNCMEIERRILRLLHGHKSRTHLTQIHAHFLRHGLHQSNQILAHFISVCASFNRIAYADRLFSQSHNPNIFLFNSIIKAHSLSVPFHQSLLLFSSMKN
        MSKNCMEIERRILRLLHGHKSRTHLTQIHAHFLRHGLHQSNQILAHFISVCASFNRIAYADRLFSQSHNPNIFLFNSIIKAHSLSVPFHQSLLLFSSMKN
Subjt:  MSKNCMEIERRILRLLHGHKSRTHLTQIHAHFLRHGLHQSNQILAHFISVCASFNRIAYADRLFSQSHNPNIFLFNSIIKAHSLSVPFHQSLLLFSSMKN

Query:  HRIVPDQYTFAPLLKSCANLCEYSLGQCVISEVFRRGFYCFGSIRIGVVELYVCCEKMEDAWKMFDEMSHRDVVVWNLMIRGFCKTGNVDFGLCLFRQMS
        HRIVPDQYTFAPLLKSCANLCEYSLGQCVISEVFRRGFYCFGSIRIGVVELYVCCEKMEDAWKMFDEMSHRDVVVWNLMIRGFCKTGNVDFGLCLFRQMS
Subjt:  HRIVPDQYTFAPLLKSCANLCEYSLGQCVISEVFRRGFYCFGSIRIGVVELYVCCEKMEDAWKMFDEMSHRDVVVWNLMIRGFCKTGNVDFGLCLFRQMS

Query:  ERSLVSWNTIISCLAQNRRDVEALELFQQMEEHGFKPDEVTVVTMLPVCSRLGALEVGQRIHSYASSKGNLVGITTVGNSLIDFYCKCGNIEKAYNIFQK
        ERSLVSWNTIISCLAQNRRDVEALELFQQMEEHGFKPDEVTVVTMLPVCSRLGALEVGQRIHSYASSKGNLVGITTVGNSLIDFYCKCGNIEKAYNIFQK
Subjt:  ERSLVSWNTIISCLAQNRRDVEALELFQQMEEHGFKPDEVTVVTMLPVCSRLGALEVGQRIHSYASSKGNLVGITTVGNSLIDFYCKCGNIEKAYNIFQK

Query:  MTCKSVVSWNTIILGFALNGKGEFAIDLFMEMRKEYLKPNDATFVAVLTACVHSGLLEKGRELFSSMAEDYEIQPKLEHFGCMVDLLGRGGCVEEAHKLI
        MTCKSVVSWNTIILGFALNGKGEFAIDLFMEMRKEYLKPNDATFVAVLTACVHSGLLEKGRELFSSMAEDYEIQPKLEHFGCMVDLLGRGGCVEEAHKLI
Subjt:  MTCKSVVSWNTIILGFALNGKGEFAIDLFMEMRKEYLKPNDATFVAVLTACVHSGLLEKGRELFSSMAEDYEIQPKLEHFGCMVDLLGRGGCVEEAHKLI

Query:  KSMPMQPNATLWGAVLGACRTHGNLKLAEMAVKELISLEPWNSGNYVLLSNMLAEEGRWEEVENVRQWMREKSVKKAPGQSASG
        KSMPMQPNATLWGAVLGACRTHGNLKLAEMAVKELISLEPWNSGNYVLLSNMLAEEGRWEEVENVRQWMREKSVKKAPGQSASG
Subjt:  KSMPMQPNATLWGAVLGACRTHGNLKLAEMAVKELISLEPWNSGNYVLLSNMLAEEGRWEEVENVRQWMREKSVKKAPGQSASG

A0A1S3BXN7 pentatricopeptide repeat-containing protein At1g091900.096.9Show/hide
Query:  MSKNCMEIERRILRLLHGHKSRTHLTQIHAHFLRHGLHQSNQILAHFISVCASFNRIAYADRLFSQSHNPNIFLFNSIIKAHSLSVPFHQSLLLFSSMKN
        MSKNCMEIERRILRLLHGHKSRTHLTQIHAHFLRHGLHQSNQILAHFISVCASFNRI YADRLFSQSHNPNIFLFNSIIKAHSLS PFHQSLLLFS MKN
Subjt:  MSKNCMEIERRILRLLHGHKSRTHLTQIHAHFLRHGLHQSNQILAHFISVCASFNRIAYADRLFSQSHNPNIFLFNSIIKAHSLSVPFHQSLLLFSSMKN

Query:  HRIVPDQYTFAPLLKSCANLCEYSLGQCVISEVFRRGFYCFGSIRIGVVELYVCCEKMEDAWKMFDEMSHRDVVVWNLMIRGFCKTGNVDFGLCLFRQMS
        HRIVPDQYTFAPLLKSCANLCEYSLGQCVISEV  RGFYCFGSIRIGVVELYVCCEKMEDAWK FDEMSHRDVVVWNLMIRGFCK GNVDFGLCLFRQMS
Subjt:  HRIVPDQYTFAPLLKSCANLCEYSLGQCVISEVFRRGFYCFGSIRIGVVELYVCCEKMEDAWKMFDEMSHRDVVVWNLMIRGFCKTGNVDFGLCLFRQMS

Query:  ERSLVSWNTIISCLAQNRRDVEALELFQQMEEHGFKPDEVTVVTMLPVCSRLGALEVGQRIHSYASSKGNLVGITTVGNSLIDFYCKCGNIEKAYNIFQK
        ERSLVSWNTIISCLAQNRRDVEALELFQQMEEHGFKPDEVTVVTMLPVCSRLGALEVGQRIHSY SSKGNLVG T VGNSLIDFYCKCGNIE AYNIFQK
Subjt:  ERSLVSWNTIISCLAQNRRDVEALELFQQMEEHGFKPDEVTVVTMLPVCSRLGALEVGQRIHSYASSKGNLVGITTVGNSLIDFYCKCGNIEKAYNIFQK

Query:  MTCKSVVSWNTIILGFALNGKGEFAIDLFMEMRKEYLKPNDATFVAVLTACVHSGLLEKGRELFSSMAEDYEIQPKLEHFGCMVDLLGRGGCVEEAHKLI
        MTCKSVVSWNTIILGFALNGKGEFAIDLFMEMRKE +KPNDATFVAVLTACVHSGLLEKGRELFSSMAEDYEIQPKLEHFGCMVDLLGRGGCVEEAH LI
Subjt:  MTCKSVVSWNTIILGFALNGKGEFAIDLFMEMRKEYLKPNDATFVAVLTACVHSGLLEKGRELFSSMAEDYEIQPKLEHFGCMVDLLGRGGCVEEAHKLI

Query:  KSMPMQPNATLWGAVLGACRTHGNLKLAEMAVKELISLEPWNSGNYVLLSNMLAEEGRWEEVENVRQWMREKSVKKAPGQSASG
        KSMPMQPNATLWGA+LGACRTHGNLKLAEMAVKELISLEPWNSGNYVLLSNMLAEEGRWEEVENVRQWMREKSVKKAPGQSASG
Subjt:  KSMPMQPNATLWGAVLGACRTHGNLKLAEMAVKELISLEPWNSGNYVLLSNMLAEEGRWEEVENVRQWMREKSVKKAPGQSASG

A0A5A7U1F2 Pentatricopeptide repeat-containing protein0.096.9Show/hide
Query:  MSKNCMEIERRILRLLHGHKSRTHLTQIHAHFLRHGLHQSNQILAHFISVCASFNRIAYADRLFSQSHNPNIFLFNSIIKAHSLSVPFHQSLLLFSSMKN
        MSKNCMEIERRILRLLHGHKSRTHLTQIHAHFLRHGLHQSNQILAHFISVCASFNRI YADRLFSQSHNPNIFLFNSIIKAHSLS PFHQSLLLFS MKN
Subjt:  MSKNCMEIERRILRLLHGHKSRTHLTQIHAHFLRHGLHQSNQILAHFISVCASFNRIAYADRLFSQSHNPNIFLFNSIIKAHSLSVPFHQSLLLFSSMKN

Query:  HRIVPDQYTFAPLLKSCANLCEYSLGQCVISEVFRRGFYCFGSIRIGVVELYVCCEKMEDAWKMFDEMSHRDVVVWNLMIRGFCKTGNVDFGLCLFRQMS
        HRIVPDQYTFAPLLKSCANLCEYSLGQCVISEV  RGFYCFGSIRIGVVELYVCCEKMEDAWK FDEMSHRDVVVWNLMIRGFCK GNVDFGLCLFRQMS
Subjt:  HRIVPDQYTFAPLLKSCANLCEYSLGQCVISEVFRRGFYCFGSIRIGVVELYVCCEKMEDAWKMFDEMSHRDVVVWNLMIRGFCKTGNVDFGLCLFRQMS

Query:  ERSLVSWNTIISCLAQNRRDVEALELFQQMEEHGFKPDEVTVVTMLPVCSRLGALEVGQRIHSYASSKGNLVGITTVGNSLIDFYCKCGNIEKAYNIFQK
        ERSLVSWNTIISCLAQNRRDVEALELFQQMEEHGFKPDEVTVVTMLPVCSRLGALEVGQRIHSY SSKGNLVG T VGNSLIDFYCKCGNIE AYNIFQK
Subjt:  ERSLVSWNTIISCLAQNRRDVEALELFQQMEEHGFKPDEVTVVTMLPVCSRLGALEVGQRIHSYASSKGNLVGITTVGNSLIDFYCKCGNIEKAYNIFQK

Query:  MTCKSVVSWNTIILGFALNGKGEFAIDLFMEMRKEYLKPNDATFVAVLTACVHSGLLEKGRELFSSMAEDYEIQPKLEHFGCMVDLLGRGGCVEEAHKLI
        MTCKSVVSWNTIILGFALNGKGEFAIDLFMEMRKE +KPNDATFVAVLTACVHSGLLEKGRELFSSMAEDYEIQPKLEHFGCMVDLLGRGGCVEEAH LI
Subjt:  MTCKSVVSWNTIILGFALNGKGEFAIDLFMEMRKEYLKPNDATFVAVLTACVHSGLLEKGRELFSSMAEDYEIQPKLEHFGCMVDLLGRGGCVEEAHKLI

Query:  KSMPMQPNATLWGAVLGACRTHGNLKLAEMAVKELISLEPWNSGNYVLLSNMLAEEGRWEEVENVRQWMREKSVKKAPGQSASG
        KSMPMQPNATLWGA+LGACRTHGNLKLAEMAVKELISLEPWNSGNYVLLSNMLAEEGRWEEVENVRQWMREKSVKKAPGQSASG
Subjt:  KSMPMQPNATLWGAVLGACRTHGNLKLAEMAVKELISLEPWNSGNYVLLSNMLAEEGRWEEVENVRQWMREKSVKKAPGQSASG

A0A6J1DUP9 pentatricopeptide repeat-containing protein At1g091902.92e-29782.64Show/hide
Query:  MSKNCMEIERRILRLLHGHKSRTHLTQIHAHFLRHGLHQSNQILAHFISVCASFNRIAYADRLFSQSHNPNIFLFNSIIKAHSLSVPFHQSLLLFSSMKN
        M+KN  EIER+ILRLLHGHK+R HLTQ HAHFLRHGLHQSNQILAHFIS+C + +++ YA R+FSQS NPNIFLFNS+IKAHSL  PF QSLLLFSSMK 
Subjt:  MSKNCMEIERRILRLLHGHKSRTHLTQIHAHFLRHGLHQSNQILAHFISVCASFNRIAYADRLFSQSHNPNIFLFNSIIKAHSLSVPFHQSLLLFSSMKN

Query:  HRIVPDQYTFAPLLKSCANLCEYSLGQCVISEVFRRGFYCFGSIRIGVVELYVCCEKMEDAWKMFDEMSHRDVVVWNLMIRGFCKTGNVDFGLCLFRQMS
         RIVPD+YTFAPLLKSC+NLC+Y LGQCV  EV RRGF  FGSIRIGVVELYVCCE+MEDA K+FD M HRDV+VWNLMIRGFCKTGNVD G+ LFRQMS
Subjt:  HRIVPDQYTFAPLLKSCANLCEYSLGQCVISEVFRRGFYCFGSIRIGVVELYVCCEKMEDAWKMFDEMSHRDVVVWNLMIRGFCKTGNVDFGLCLFRQMS

Query:  ERSLVSWNTIISCLAQNRRDVEALELFQQMEEHGFKPDEVTVVTMLPVCSRLGALEVGQRIHSYASSKGNLVGITTVGNSLIDFYCKCGNIEKAYNIFQK
        +RS+VSWNTIISCLAQ+ RDVEALELFQQMEE GF PDEVTVVT+LPVCSRLGA  VGQRIH+YASSKGNLV IT VGNSL+DFYCKCGN E AYNIF K
Subjt:  ERSLVSWNTIISCLAQNRRDVEALELFQQMEEHGFKPDEVTVVTMLPVCSRLGALEVGQRIHSYASSKGNLVGITTVGNSLIDFYCKCGNIEKAYNIFQK

Query:  MTCKSVVSWNTIILGFALNGKGEFAIDLFMEMRKEYLKPNDATFVAVLTACVHSGLLEKGRELFSSMAEDYEIQPKLEHFGCMVDLLGRGGCVEEAHKLI
        MTCKSVVSWNT+ILGFALNGKGE A DLFMEM ++ +KPNDATFVA+LTACVHSGLLEKGRE+FSSM   Y+I PKLEHFGCMVDLLGR G VEEAH LI
Subjt:  MTCKSVVSWNTIILGFALNGKGEFAIDLFMEMRKEYLKPNDATFVAVLTACVHSGLLEKGRELFSSMAEDYEIQPKLEHFGCMVDLLGRGGCVEEAHKLI

Query:  KSMPMQPNATLWGAVLGACRTHGNLKLAEMAVKELISLEPWNSGNYVLLSNMLAEEGRWEEVENVRQWMREKSVKKAPGQSASG
        KSMPMQPNATLWGA+LGACRTHGNLKLAE+AVKELISLEPWNSGNYVLLSNMLA EGRWE+VENVR WMR KSV KAPGQSA+G
Subjt:  KSMPMQPNATLWGAVLGACRTHGNLKLAEMAVKELISLEPWNSGNYVLLSNMLAEEGRWEEVENVRQWMREKSVKKAPGQSASG

A0A6J1HMD5 pentatricopeptide repeat-containing protein At1g091902.92e-29783.26Show/hide
Query:  MSKNCMEIERRILRLLHGHKSRTHLTQIHAHFLRHGLHQSNQILAHFISVCASFNRIAYADRLFSQSHNPNIFLFNSIIKAHSLSVPFHQSLLLFSSMKN
        MSKN   IERRILRLL GHKS THLTQIHAHFLRH LHQSNQILAHFIS+C  FN IAYA+R+FSQS NPNIFLFNS+IKAHSLS PF QSLLLFSS+KN
Subjt:  MSKNCMEIERRILRLLHGHKSRTHLTQIHAHFLRHGLHQSNQILAHFISVCASFNRIAYADRLFSQSHNPNIFLFNSIIKAHSLSVPFHQSLLLFSSMKN

Query:  HRIVPDQYTFAPLLKSCANLCEYSLGQCVISEVFRRGFYCFGSIRIGVVELYVCCEKMEDAWKMFDEMSHRDVVVWNLMIRGFCKTGNVDFGLCLFRQMS
         RIVPD+YTFAPLLKSC+NL +Y LG+CVI EV RRGF CFGSIRIGVVELYVCCE+M+DA K+FDEM H DVVVWNLMIRGFCK GNVD GL LFRQM+
Subjt:  HRIVPDQYTFAPLLKSCANLCEYSLGQCVISEVFRRGFYCFGSIRIGVVELYVCCEKMEDAWKMFDEMSHRDVVVWNLMIRGFCKTGNVDFGLCLFRQMS

Query:  ERSLVSWNTIISCLAQNRRDVEALELFQQMEEHGFKPDEVTVVTMLPVCSRLGALEVGQRIHSYASSKGNLVGITTVGNSLIDFYCKCGNIEKAYNIFQK
        +RSLVSWNT ISCLAQ+ RDVEALELFQQMEEHGF+PDEVTVVTMLPVCSRLGA++VGQ IHSYA+SK +LV  T VGNSLIDFYCK GN E+AYNIFQK
Subjt:  ERSLVSWNTIISCLAQNRRDVEALELFQQMEEHGFKPDEVTVVTMLPVCSRLGALEVGQRIHSYASSKGNLVGITTVGNSLIDFYCKCGNIEKAYNIFQK

Query:  MTCKSVVSWNTIILGFALNGKGEFAIDLFMEMRKEYLKPNDATFVAVLTACVHSGLLEKGRELFSSMAEDYEIQPKLEHFGCMVDLLGRGGCVEEAHKLI
        MTCKSVVSWNT+ILGFALNGKGE AIDLFMEM +   KPND T VA+LTACVHSGLLEKG+E+FSSMAE YEI+PKLEHFGCMVDLLGRGGCVEEAH LI
Subjt:  MTCKSVVSWNTIILGFALNGKGEFAIDLFMEMRKEYLKPNDATFVAVLTACVHSGLLEKGRELFSSMAEDYEIQPKLEHFGCMVDLLGRGGCVEEAHKLI

Query:  KSMPMQPNATLWGAVLGACRTHGNLKLAEMAVKELISLEPWNSGNYVLLSNMLAEEGRWEEVENVRQWMREKSVKKAPGQSASG
        +SMPMQPNATLWGA+LGACRTHGNLKLAEMAV ELISLEP NSGNYVLLSN LAEE RWE+VENVR+ MR K+VKKAPG+SASG
Subjt:  KSMPMQPNATLWGAVLGACRTHGNLKLAEMAVKELISLEPWNSGNYVLLSNMLAEEGRWEEVENVRQWMREKSVKKAPGQSASG

SwissProt top hitse value%identityAlignment
O80488 Pentatricopeptide repeat-containing protein At1g091901.6e-15556.39Show/hide
Query:  MEIERRILRLLHGHKSRTHLTQIHAHFLRHGLHQSNQILAHFISVCASFNRIAYADRLFSQSHNPNIFLFNSIIKAHSLSVPFHQSLLLFSSMKNHRIVP
        MEIER++LRLLHGH +RT L +IHAH LRH LH SN +LAHFIS+C S +   YA+R+FS   NPN+ +FN++IK +SL  P  +SL  FSSMK+  I  
Subjt:  MEIERRILRLLHGHKSRTHLTQIHAHFLRHGLHQSNQILAHFISVCASFNRIAYADRLFSQSHNPNIFLFNSIIKAHSLSVPFHQSLLLFSSMKNHRIVP

Query:  DQYTFAPLLKSCANLCEYSLGQCVISEVFRRGFYCFGSIRIGVVELYVCCEKMEDAWKMFDEMSHRDVVVWNLMIRGFCKTGNVDFGLCLFRQMSERSLV
        D+YT+APLLKSC++L +   G+CV  E+ R GF+  G IRIGVVELY    +M DA K+FDEMS R+VVVWNLMIRGFC +G+V+ GL LF+QMSERS+V
Subjt:  DQYTFAPLLKSCANLCEYSLGQCVISEVFRRGFYCFGSIRIGVVELYVCCEKMEDAWKMFDEMSHRDVVVWNLMIRGFCKTGNVDFGLCLFRQMSERSLV

Query:  SWNTIISCLAQNRRDVEALELFQQMEEHGFKPDEVTVVTMLPVCSRLGALEVGQRIHSYASSKGNLVGITTVGNSLIDFYCKCGNIEKAYNIFQKMTCKS
        SWN++IS L++  RD EALELF +M + GF PDE TVVT+LP+ + LG L+ G+ IHS A S G      TVGN+L+DFYCK G++E A  IF+KM  ++
Subjt:  SWNTIISCLAQNRRDVEALELFQQMEEHGFKPDEVTVVTMLPVCSRLGALEVGQRIHSYASSKGNLVGITTVGNSLIDFYCKCGNIEKAYNIFQKMTCKS

Query:  VVSWNTIILGFALNGKGEFAIDLFMEMRKE-YLKPNDATFVAVLTACVHSGLLEKGRELFSSMAEDYEIQPKLEHFGCMVDLLGRGGCVEEAHKLIKSMP
        VVSWNT+I G A+NGKGEF IDLF  M +E  + PN+ATF+ VL  C ++G +E+G ELF  M E ++++ + EH+G MVDL+ R G + EA K +K+MP
Subjt:  VVSWNTIILGFALNGKGEFAIDLFMEMRKE-YLKPNDATFVAVLTACVHSGLLEKGRELFSSMAEDYEIQPKLEHFGCMVDLLGRGGCVEEAHKLIKSMP

Query:  MQPNATLWGAVLGACRTHGNLKLAEMAVKELISLEPWNSGNYVLLSNMLAEEGRWEEVENVRQWMREKSVKKAPGQS
        +  NA +WG++L ACR+HG++KLAE+A  EL+ +EP NSGNYVLLSN+ AEEGRW++VE VR  M++  ++K+ GQS
Subjt:  MQPNATLWGAVLGACRTHGNLKLAEMAVKELISLEPWNSGNYVLLSNMLAEEGRWEEVENVRQWMREKSVKKAPGQS

Q9FFG8 Pentatricopeptide repeat-containing protein At5g442303.1e-9536.77Show/hide
Query:  SSSSSSLLDNMSKNCMEIERRILRLLHGHKSRTHLTQIHAHFLRHGLHQSNQILAHFISVCASFN--RIAYADRLFSQSHNPNIFLFNSIIKAHSLSVPF
        +S++S     +S     +   ++  L    +   + QIH H LR GL QS  IL   I            YA R+       N FL+ ++I+ +++   F
Subjt:  SSSSSSLLDNMSKNCMEIERRILRLLHGHKSRTHLTQIHAHFLRHGLHQSNQILAHFISVCASFN--RIAYADRLFSQSHNPNIFLFNSIIKAHSLSVPF

Query:  HQSLLLFSSMKNHRIVPDQYTFAPLLKSCANLCEYSLGQCVISEVFRRGFYCFGSIRIGVVELYVCCEKMEDAWKMFDEMSHRDVVVWNLMIRGFCKTGN
         +++ ++  M+   I P  +TF+ LLK+C  + + +LG+   ++ FR   +CF  +   ++++YV CE ++ A K+FDEM  RDV+ W  +I  + + GN
Subjt:  HQSLLLFSSMKNHRIVPDQYTFAPLLKSCANLCEYSLGQCVISEVFRRGFYCFGSIRIGVVELYVCCEKMEDAWKMFDEMSHRDVVVWNLMIRGFCKTGN

Query:  VDFGLCLFRQMSERSLVSWNTIISCLAQNRRDVEALELFQQMEEHGFKPDEVTVVTMLPVCSRLGALEVGQRIHSYASSKG-NLVGITTVGNSLIDFYCK
        ++    LF  +  + +V+W  +++  AQN +  EALE F +ME+ G + DEVTV   +  C++LGA +   R    A   G +      +G++LID Y K
Subjt:  VDFGLCLFRQMSERSLVSWNTIISCLAQNRRDVEALELFQQMEEHGFKPDEVTVVTMLPVCSRLGALEVGQRIHSYASSKG-NLVGITTVGNSLIDFYCK

Query:  CGNIEKAYNIFQKMTCKSVVSWNTIILGFALNGKGEFAIDLFMEM-RKEYLKPNDATFVAVLTACVHSGLLEKGRELFSSMAEDYEIQPKLEHFGCMVDL
        CGN+E+A N+F  M  K+V +++++ILG A +G+ + A+ LF  M  +  +KPN  TFV  L AC HSGL+++GR++F SM + + +QP  +H+ CMVDL
Subjt:  CGNIEKAYNIFQKMTCKSVVSWNTIILGFALNGKGEFAIDLFMEM-RKEYLKPNDATFVAVLTACVHSGLLEKGRELFSSMAEDYEIQPKLEHFGCMVDL

Query:  LGRGGCVEEAHKLIKSMPMQPNATLWGAVLGACRTHGNLKLAEMAVKELISLEPWNSGNYVLLSNMLAEEGRWEEVENVRQWMREKSVKKAPGQS
        LGR G ++EA +LIK+M ++P+  +WGA+LGACR H N ++AE+A + L  LEP   GNY+LLSN+ A  G W  V  VR+ ++EK +KK P  S
Subjt:  LGRGGCVEEAHKLIKSMPMQPNATLWGAVLGACRTHGNLKLAEMAVKELISLEPWNSGNYVLLSNMLAEEGRWEEVENVRQWMREKSVKKAPGQS

Q9FJY7 Pentatricopeptide repeat-containing protein At5g665201.5e-9439.22Show/hide
Query:  LTQIHAHFLRHGLHQSNQILAHFISVC---ASFNRIAYADRLFSQSHNPNIFLFNSIIKAHSLSVPFHQSLLLFSSMKNHRIVPDQYTFAPLLKSCANLC
        L QIHA  L+ GL Q +  +  F+S C    S + + YA  +F     P+ FL+N +I+  S S    +SLLL+  M       + YTF  LLK+C+NL 
Subjt:  LTQIHAHFLRHGLHQSNQILAHFISVC---ASFNRIAYADRLFSQSHNPNIFLFNSIIKAHSLSVPFHQSLLLFSSMKNHRIVPDQYTFAPLLKSCANLC

Query:  EYSLGQCVISEVFRRGF----YCFGSIRIGVVELYVCCEKMEDAWKMFDEMSHRDVVVWNLMIRGFCKTGNVDFGLCLFRQMSERSLVSWNTIISCLAQN
         +     + +++ + G+    Y   S+    +  Y      + A  +FD +   D V WN +I+G+ K G +D  L LFR+M+E++ +SW T+IS   Q 
Subjt:  EYSLGQCVISEVFRRGF----YCFGSIRIGVVELYVCCEKMEDAWKMFDEMSHRDVVVWNLMIRGFCKTGNVDFGLCLFRQMSERSLVSWNTIISCLAQN

Query:  RRDVEALELFQQMEEHGFKPDEVTVVTMLPVCSRLGALEVGQRIHSYASSKGNLVGITTVGNSLIDFYCKCGNIEKAYNIFQKMTCKSVVSWNTIILGFA
          + EAL+LF +M+    +PD V++   L  C++LGALE G+ IHSY  +K  +   + +G  LID Y KCG +E+A  +F+ +  KSV +W  +I G+A
Subjt:  RRDVEALELFQQMEEHGFKPDEVTVVTMLPVCSRLGALEVGQRIHSYASSKGNLVGITTVGNSLIDFYCKCGNIEKAYNIFQKMTCKSVVSWNTIILGFA

Query:  LNGKGEFAIDLFMEMRKEYLKPNDATFVAVLTACVHSGLLEKGRELFSSMAEDYEIQPKLEHFGCMVDLLGRGGCVEEAHKLIKSMPMQPNATLWGAVLG
         +G G  AI  FMEM+K  +KPN  TF AVLTAC ++GL+E+G+ +F SM  DY ++P +EH+GC+VDLLGR G ++EA + I+ MP++PNA +WGA+L 
Subjt:  LNGKGEFAIDLFMEMRKEYLKPNDATFVAVLTACVHSGLLEKGRELFSSMAEDYEIQPKLEHFGCMVDLLGRGGCVEEAHKLIKSMPMQPNATLWGAVLG

Query:  ACRTHGNLKLAEMAVKELISLEPWNSGNYVLLSNMLAEEGRWEEVENVRQWMREKSVKKAPGQS
        ACR H N++L E   + LI+++P++ G YV  +N+ A + +W++    R+ M+E+ V K PG S
Subjt:  ACRTHGNLKLAEMAVKELISLEPWNSGNYVLLSNMLAEEGRWEEVENVRQWMREKSVKKAPGQS

Q9FMA1 Pentatricopeptide repeat-containing protein At5g563101.4e-9538.91Show/hide
Query:  LHGHKSRTHLTQIHAHFLRHGLHQSNQILAHFISVCASFNRIAYADRLFSQSHNPNIFLFNSIIK---------AHSLSVPFHQSLLLFSSMKNHRIVPD
        +HG+  +T L Q H + +  GL++ N  +A FI  C++   + YA  +F+    PN +L N++I+         AHS+++  ++ L    +       PD
Subjt:  LHGHKSRTHLTQIHAHFLRHGLHQSNQILAHFISVCASFNRIAYADRLFSQSHNPNIFLFNSIIK---------AHSLSVPFHQSLLLFSSMKNHRIVPD

Query:  QYTFAPLLKSCANLCEYSLGQCVISEVFRRGFYCFGSIRIGVVELYVCCEKMEDAWKMFDEMSHRDVVVWNLMIRGFCKTGNVDFGLCLFRQMS--ERSL
         +TF  +LK    + +   G+ +  +V   GF     +  G++++Y  C  + DA KMFDEM  +DV VWN ++ G+ K G +D    L   M    R+ 
Subjt:  QYTFAPLLKSCANLCEYSLGQCVISEVFRRGFYCFGSIRIGVVELYVCCEKMEDAWKMFDEMSHRDVVVWNLMIRGFCKTGNVDFGLCLFRQMS--ERSL

Query:  VSWNTIISCLAQNRRDVEALELFQQMEEHGFKPDEVTVVTMLPVCSRLGALEVGQRIHSYASSKGNLVGITTVGNSLIDFYCKCGNIEKAYNIFQKMTCK
        VSW  +IS  A++ R  EA+E+FQ+M     +PDEVT++ +L  C+ LG+LE+G+RI SY   +G +    ++ N++ID Y K GNI KA ++F+ +  +
Subjt:  VSWNTIISCLAQNRRDVEALELFQQMEEHGFKPDEVTVVTMLPVCSRLGALEVGQRIHSYASSKGNLVGITTVGNSLIDFYCKCGNIEKAYNIFQKMTCK

Query:  SVVSWNTIILGFALNGKGEFAIDLFMEMRKEYLKPNDATFVAVLTACVHSGLLEKGRELFSSMAEDYEIQPKLEHFGCMVDLLGRGGCVEEAHKLIKSMP
        +VV+W TII G A +G G  A+ +F  M K  ++PND TF+A+L+AC H G ++ G+ LF+SM   Y I P +EH+GCM+DLLGR G + EA ++IKSMP
Subjt:  SVVSWNTIILGFALNGKGEFAIDLFMEMRKEYLKPNDATFVAVLTACVHSGLLEKGRELFSSMAEDYEIQPKLEHFGCMVDLLGRGGCVEEAHKLIKSMP

Query:  MQPNATLWGAVLGACRTHGNLKLAEMAVKELISLEPWNSGNYVLLSNMLAEEGRWEEVENVRQWMREKSVKKAPGQSA
         + NA +WG++L A   H +L+L E A+ ELI LEP NSGNY+LL+N+ +  GRW+E   +R  M+   VKK  G+S+
Subjt:  MQPNATLWGAVLGACRTHGNLKLAEMAVKELISLEPWNSGNYVLLSNMLAEEGRWEEVENVRQWMREKSVKKAPGQSA

Q9SIL5 Pentatricopeptide repeat-containing protein At2g205405.8e-10238.45Show/hide
Query:  EIERRILRLLHGHKSRTHLTQIHAHFLRHGLHQSNQILAHFISVCASFNRIAYADRLFSQSHNPNIFLFNSIIKAHSLSVPFHQSLLLFSSMKNHRI-VP
        E+E   +  L   KSR    +I+A  + HGL QS+ ++   +  C     + YA RLF+Q  NPN+FL+NSII+A++ +  +   + ++  +      +P
Subjt:  EIERRILRLLHGHKSRTHLTQIHAHFLRHGLHQSNQILAHFISVCASFNRIAYADRLFSQSHNPNIFLFNSIIKAHSLSVPFHQSLLLFSSMKNHRI-VP

Query:  DQYTFAPLLKSCANLCEYSLGQCVISEVFRRGFYCFGSIRIGVVELYVCCEKMEDAWKMFDEMSHRDVVVWNLMIRGFCKTGNVDFGLCLFRQMSERSLV
        D++TF  + KSCA+L    LG+ V   + + G          ++++Y+  + + DA K+FDEM  RDV+ WN ++ G+ + G +     LF  M ++++V
Subjt:  DQYTFAPLLKSCANLCEYSLGQCVISEVFRRGFYCFGSIRIGVVELYVCCEKMEDAWKMFDEMSHRDVVVWNLMIRGFCKTGNVDFGLCLFRQMSERSLV

Query:  SWNTIISCLAQNRRDVEALELFQQMEEHGFKPDEVTVVTMLPVCSRLGALEVGQRIHSYASSKGNLVGITTVGNSLIDFYCKCGNIEKAYNIFQKMTCKS
        SW  +IS        VEA++ F++M+  G +PDE++++++LP C++LG+LE+G+ IH YA  +G L   T V N+LI+ Y KCG I +A  +F +M  K 
Subjt:  SWNTIISCLAQNRRDVEALELFQQMEEHGFKPDEVTVVTMLPVCSRLGALEVGQRIHSYASSKGNLVGITTVGNSLIDFYCKCGNIEKAYNIFQKMTCKS

Query:  VVSWNTIILGFALNGKGEFAIDLFMEMRKEYLKPNDATFVAVLTACVHSGLLEKGRELFSSMAEDYEIQPKLEHFGCMVDLLGRGGCVEEAHKLIKSMPM
        V+SW+T+I G+A +G    AI+ F EM++  +KPN  TF+ +L+AC H G+ ++G   F  M +DY+I+PK+EH+GC++D+L R G +E A ++ K+MPM
Subjt:  VVSWNTIILGFALNGKGEFAIDLFMEMRKEYLKPNDATFVAVLTACVHSGLLEKGRELFSSMAEDYEIQPKLEHFGCMVDLLGRGGCVEEAHKLIKSMPM

Query:  QPNATLWGAVLGACRTHGNLKLAEMAVKELISLEPWNSGNYVLLSNMLAEEGRWEEVENVRQWMREKSVKKAPGQS
        +P++ +WG++L +CRT GNL +A +A+  L+ LEP + GNYVLL+N+ A+ G+WE+V  +R+ +R +++KK PG S
Subjt:  QPNATLWGAVLGACRTHGNLKLAEMAVKELISLEPWNSGNYVLLSNMLAEEGRWEEVENVRQWMREKSVKKAPGQS

Arabidopsis top hitse value%identityAlignment
AT1G09190.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.1e-15656.39Show/hide
Query:  MEIERRILRLLHGHKSRTHLTQIHAHFLRHGLHQSNQILAHFISVCASFNRIAYADRLFSQSHNPNIFLFNSIIKAHSLSVPFHQSLLLFSSMKNHRIVP
        MEIER++LRLLHGH +RT L +IHAH LRH LH SN +LAHFIS+C S +   YA+R+FS   NPN+ +FN++IK +SL  P  +SL  FSSMK+  I  
Subjt:  MEIERRILRLLHGHKSRTHLTQIHAHFLRHGLHQSNQILAHFISVCASFNRIAYADRLFSQSHNPNIFLFNSIIKAHSLSVPFHQSLLLFSSMKNHRIVP

Query:  DQYTFAPLLKSCANLCEYSLGQCVISEVFRRGFYCFGSIRIGVVELYVCCEKMEDAWKMFDEMSHRDVVVWNLMIRGFCKTGNVDFGLCLFRQMSERSLV
        D+YT+APLLKSC++L +   G+CV  E+ R GF+  G IRIGVVELY    +M DA K+FDEMS R+VVVWNLMIRGFC +G+V+ GL LF+QMSERS+V
Subjt:  DQYTFAPLLKSCANLCEYSLGQCVISEVFRRGFYCFGSIRIGVVELYVCCEKMEDAWKMFDEMSHRDVVVWNLMIRGFCKTGNVDFGLCLFRQMSERSLV

Query:  SWNTIISCLAQNRRDVEALELFQQMEEHGFKPDEVTVVTMLPVCSRLGALEVGQRIHSYASSKGNLVGITTVGNSLIDFYCKCGNIEKAYNIFQKMTCKS
        SWN++IS L++  RD EALELF +M + GF PDE TVVT+LP+ + LG L+ G+ IHS A S G      TVGN+L+DFYCK G++E A  IF+KM  ++
Subjt:  SWNTIISCLAQNRRDVEALELFQQMEEHGFKPDEVTVVTMLPVCSRLGALEVGQRIHSYASSKGNLVGITTVGNSLIDFYCKCGNIEKAYNIFQKMTCKS

Query:  VVSWNTIILGFALNGKGEFAIDLFMEMRKE-YLKPNDATFVAVLTACVHSGLLEKGRELFSSMAEDYEIQPKLEHFGCMVDLLGRGGCVEEAHKLIKSMP
        VVSWNT+I G A+NGKGEF IDLF  M +E  + PN+ATF+ VL  C ++G +E+G ELF  M E ++++ + EH+G MVDL+ R G + EA K +K+MP
Subjt:  VVSWNTIILGFALNGKGEFAIDLFMEMRKE-YLKPNDATFVAVLTACVHSGLLEKGRELFSSMAEDYEIQPKLEHFGCMVDLLGRGGCVEEAHKLIKSMP

Query:  MQPNATLWGAVLGACRTHGNLKLAEMAVKELISLEPWNSGNYVLLSNMLAEEGRWEEVENVRQWMREKSVKKAPGQS
        +  NA +WG++L ACR+HG++KLAE+A  EL+ +EP NSGNYVLLSN+ AEEGRW++VE VR  M++  ++K+ GQS
Subjt:  MQPNATLWGAVLGACRTHGNLKLAEMAVKELISLEPWNSGNYVLLSNMLAEEGRWEEVENVRQWMREKSVKKAPGQS

AT2G20540.1 mitochondrial editing factor 214.1e-10338.45Show/hide
Query:  EIERRILRLLHGHKSRTHLTQIHAHFLRHGLHQSNQILAHFISVCASFNRIAYADRLFSQSHNPNIFLFNSIIKAHSLSVPFHQSLLLFSSMKNHRI-VP
        E+E   +  L   KSR    +I+A  + HGL QS+ ++   +  C     + YA RLF+Q  NPN+FL+NSII+A++ +  +   + ++  +      +P
Subjt:  EIERRILRLLHGHKSRTHLTQIHAHFLRHGLHQSNQILAHFISVCASFNRIAYADRLFSQSHNPNIFLFNSIIKAHSLSVPFHQSLLLFSSMKNHRI-VP

Query:  DQYTFAPLLKSCANLCEYSLGQCVISEVFRRGFYCFGSIRIGVVELYVCCEKMEDAWKMFDEMSHRDVVVWNLMIRGFCKTGNVDFGLCLFRQMSERSLV
        D++TF  + KSCA+L    LG+ V   + + G          ++++Y+  + + DA K+FDEM  RDV+ WN ++ G+ + G +     LF  M ++++V
Subjt:  DQYTFAPLLKSCANLCEYSLGQCVISEVFRRGFYCFGSIRIGVVELYVCCEKMEDAWKMFDEMSHRDVVVWNLMIRGFCKTGNVDFGLCLFRQMSERSLV

Query:  SWNTIISCLAQNRRDVEALELFQQMEEHGFKPDEVTVVTMLPVCSRLGALEVGQRIHSYASSKGNLVGITTVGNSLIDFYCKCGNIEKAYNIFQKMTCKS
        SW  +IS        VEA++ F++M+  G +PDE++++++LP C++LG+LE+G+ IH YA  +G L   T V N+LI+ Y KCG I +A  +F +M  K 
Subjt:  SWNTIISCLAQNRRDVEALELFQQMEEHGFKPDEVTVVTMLPVCSRLGALEVGQRIHSYASSKGNLVGITTVGNSLIDFYCKCGNIEKAYNIFQKMTCKS

Query:  VVSWNTIILGFALNGKGEFAIDLFMEMRKEYLKPNDATFVAVLTACVHSGLLEKGRELFSSMAEDYEIQPKLEHFGCMVDLLGRGGCVEEAHKLIKSMPM
        V+SW+T+I G+A +G    AI+ F EM++  +KPN  TF+ +L+AC H G+ ++G   F  M +DY+I+PK+EH+GC++D+L R G +E A ++ K+MPM
Subjt:  VVSWNTIILGFALNGKGEFAIDLFMEMRKEYLKPNDATFVAVLTACVHSGLLEKGRELFSSMAEDYEIQPKLEHFGCMVDLLGRGGCVEEAHKLIKSMPM

Query:  QPNATLWGAVLGACRTHGNLKLAEMAVKELISLEPWNSGNYVLLSNMLAEEGRWEEVENVRQWMREKSVKKAPGQS
        +P++ +WG++L +CRT GNL +A +A+  L+ LEP + GNYVLL+N+ A+ G+WE+V  +R+ +R +++KK PG S
Subjt:  QPNATLWGAVLGACRTHGNLKLAEMAVKELISLEPWNSGNYVLLSNMLAEEGRWEEVENVRQWMREKSVKKAPGQS

AT5G44230.1 Pentatricopeptide repeat (PPR) superfamily protein2.2e-9636.77Show/hide
Query:  SSSSSSLLDNMSKNCMEIERRILRLLHGHKSRTHLTQIHAHFLRHGLHQSNQILAHFISVCASFN--RIAYADRLFSQSHNPNIFLFNSIIKAHSLSVPF
        +S++S     +S     +   ++  L    +   + QIH H LR GL QS  IL   I            YA R+       N FL+ ++I+ +++   F
Subjt:  SSSSSSLLDNMSKNCMEIERRILRLLHGHKSRTHLTQIHAHFLRHGLHQSNQILAHFISVCASFN--RIAYADRLFSQSHNPNIFLFNSIIKAHSLSVPF

Query:  HQSLLLFSSMKNHRIVPDQYTFAPLLKSCANLCEYSLGQCVISEVFRRGFYCFGSIRIGVVELYVCCEKMEDAWKMFDEMSHRDVVVWNLMIRGFCKTGN
         +++ ++  M+   I P  +TF+ LLK+C  + + +LG+   ++ FR   +CF  +   ++++YV CE ++ A K+FDEM  RDV+ W  +I  + + GN
Subjt:  HQSLLLFSSMKNHRIVPDQYTFAPLLKSCANLCEYSLGQCVISEVFRRGFYCFGSIRIGVVELYVCCEKMEDAWKMFDEMSHRDVVVWNLMIRGFCKTGN

Query:  VDFGLCLFRQMSERSLVSWNTIISCLAQNRRDVEALELFQQMEEHGFKPDEVTVVTMLPVCSRLGALEVGQRIHSYASSKG-NLVGITTVGNSLIDFYCK
        ++    LF  +  + +V+W  +++  AQN +  EALE F +ME+ G + DEVTV   +  C++LGA +   R    A   G +      +G++LID Y K
Subjt:  VDFGLCLFRQMSERSLVSWNTIISCLAQNRRDVEALELFQQMEEHGFKPDEVTVVTMLPVCSRLGALEVGQRIHSYASSKG-NLVGITTVGNSLIDFYCK

Query:  CGNIEKAYNIFQKMTCKSVVSWNTIILGFALNGKGEFAIDLFMEM-RKEYLKPNDATFVAVLTACVHSGLLEKGRELFSSMAEDYEIQPKLEHFGCMVDL
        CGN+E+A N+F  M  K+V +++++ILG A +G+ + A+ LF  M  +  +KPN  TFV  L AC HSGL+++GR++F SM + + +QP  +H+ CMVDL
Subjt:  CGNIEKAYNIFQKMTCKSVVSWNTIILGFALNGKGEFAIDLFMEM-RKEYLKPNDATFVAVLTACVHSGLLEKGRELFSSMAEDYEIQPKLEHFGCMVDL

Query:  LGRGGCVEEAHKLIKSMPMQPNATLWGAVLGACRTHGNLKLAEMAVKELISLEPWNSGNYVLLSNMLAEEGRWEEVENVRQWMREKSVKKAPGQS
        LGR G ++EA +LIK+M ++P+  +WGA+LGACR H N ++AE+A + L  LEP   GNY+LLSN+ A  G W  V  VR+ ++EK +KK P  S
Subjt:  LGRGGCVEEAHKLIKSMPMQPNATLWGAVLGACRTHGNLKLAEMAVKELISLEPWNSGNYVLLSNMLAEEGRWEEVENVRQWMREKSVKKAPGQS

AT5G56310.1 Pentatricopeptide repeat (PPR) superfamily protein9.9e-9738.91Show/hide
Query:  LHGHKSRTHLTQIHAHFLRHGLHQSNQILAHFISVCASFNRIAYADRLFSQSHNPNIFLFNSIIK---------AHSLSVPFHQSLLLFSSMKNHRIVPD
        +HG+  +T L Q H + +  GL++ N  +A FI  C++   + YA  +F+    PN +L N++I+         AHS+++  ++ L    +       PD
Subjt:  LHGHKSRTHLTQIHAHFLRHGLHQSNQILAHFISVCASFNRIAYADRLFSQSHNPNIFLFNSIIK---------AHSLSVPFHQSLLLFSSMKNHRIVPD

Query:  QYTFAPLLKSCANLCEYSLGQCVISEVFRRGFYCFGSIRIGVVELYVCCEKMEDAWKMFDEMSHRDVVVWNLMIRGFCKTGNVDFGLCLFRQMS--ERSL
         +TF  +LK    + +   G+ +  +V   GF     +  G++++Y  C  + DA KMFDEM  +DV VWN ++ G+ K G +D    L   M    R+ 
Subjt:  QYTFAPLLKSCANLCEYSLGQCVISEVFRRGFYCFGSIRIGVVELYVCCEKMEDAWKMFDEMSHRDVVVWNLMIRGFCKTGNVDFGLCLFRQMS--ERSL

Query:  VSWNTIISCLAQNRRDVEALELFQQMEEHGFKPDEVTVVTMLPVCSRLGALEVGQRIHSYASSKGNLVGITTVGNSLIDFYCKCGNIEKAYNIFQKMTCK
        VSW  +IS  A++ R  EA+E+FQ+M     +PDEVT++ +L  C+ LG+LE+G+RI SY   +G +    ++ N++ID Y K GNI KA ++F+ +  +
Subjt:  VSWNTIISCLAQNRRDVEALELFQQMEEHGFKPDEVTVVTMLPVCSRLGALEVGQRIHSYASSKGNLVGITTVGNSLIDFYCKCGNIEKAYNIFQKMTCK

Query:  SVVSWNTIILGFALNGKGEFAIDLFMEMRKEYLKPNDATFVAVLTACVHSGLLEKGRELFSSMAEDYEIQPKLEHFGCMVDLLGRGGCVEEAHKLIKSMP
        +VV+W TII G A +G G  A+ +F  M K  ++PND TF+A+L+AC H G ++ G+ LF+SM   Y I P +EH+GCM+DLLGR G + EA ++IKSMP
Subjt:  SVVSWNTIILGFALNGKGEFAIDLFMEMRKEYLKPNDATFVAVLTACVHSGLLEKGRELFSSMAEDYEIQPKLEHFGCMVDLLGRGGCVEEAHKLIKSMP

Query:  MQPNATLWGAVLGACRTHGNLKLAEMAVKELISLEPWNSGNYVLLSNMLAEEGRWEEVENVRQWMREKSVKKAPGQSA
         + NA +WG++L A   H +L+L E A+ ELI LEP NSGNY+LL+N+ +  GRW+E   +R  M+   VKK  G+S+
Subjt:  MQPNATLWGAVLGACRTHGNLKLAEMAVKELISLEPWNSGNYVLLSNMLAEEGRWEEVENVRQWMREKSVKKAPGQSA

AT5G66520.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.1e-9539.22Show/hide
Query:  LTQIHAHFLRHGLHQSNQILAHFISVC---ASFNRIAYADRLFSQSHNPNIFLFNSIIKAHSLSVPFHQSLLLFSSMKNHRIVPDQYTFAPLLKSCANLC
        L QIHA  L+ GL Q +  +  F+S C    S + + YA  +F     P+ FL+N +I+  S S    +SLLL+  M       + YTF  LLK+C+NL 
Subjt:  LTQIHAHFLRHGLHQSNQILAHFISVC---ASFNRIAYADRLFSQSHNPNIFLFNSIIKAHSLSVPFHQSLLLFSSMKNHRIVPDQYTFAPLLKSCANLC

Query:  EYSLGQCVISEVFRRGF----YCFGSIRIGVVELYVCCEKMEDAWKMFDEMSHRDVVVWNLMIRGFCKTGNVDFGLCLFRQMSERSLVSWNTIISCLAQN
         +     + +++ + G+    Y   S+    +  Y      + A  +FD +   D V WN +I+G+ K G +D  L LFR+M+E++ +SW T+IS   Q 
Subjt:  EYSLGQCVISEVFRRGF----YCFGSIRIGVVELYVCCEKMEDAWKMFDEMSHRDVVVWNLMIRGFCKTGNVDFGLCLFRQMSERSLVSWNTIISCLAQN

Query:  RRDVEALELFQQMEEHGFKPDEVTVVTMLPVCSRLGALEVGQRIHSYASSKGNLVGITTVGNSLIDFYCKCGNIEKAYNIFQKMTCKSVVSWNTIILGFA
          + EAL+LF +M+    +PD V++   L  C++LGALE G+ IHSY  +K  +   + +G  LID Y KCG +E+A  +F+ +  KSV +W  +I G+A
Subjt:  RRDVEALELFQQMEEHGFKPDEVTVVTMLPVCSRLGALEVGQRIHSYASSKGNLVGITTVGNSLIDFYCKCGNIEKAYNIFQKMTCKSVVSWNTIILGFA

Query:  LNGKGEFAIDLFMEMRKEYLKPNDATFVAVLTACVHSGLLEKGRELFSSMAEDYEIQPKLEHFGCMVDLLGRGGCVEEAHKLIKSMPMQPNATLWGAVLG
         +G G  AI  FMEM+K  +KPN  TF AVLTAC ++GL+E+G+ +F SM  DY ++P +EH+GC+VDLLGR G ++EA + I+ MP++PNA +WGA+L 
Subjt:  LNGKGEFAIDLFMEMRKEYLKPNDATFVAVLTACVHSGLLEKGRELFSSMAEDYEIQPKLEHFGCMVDLLGRGGCVEEAHKLIKSMPMQPNATLWGAVLG

Query:  ACRTHGNLKLAEMAVKELISLEPWNSGNYVLLSNMLAEEGRWEEVENVRQWMREKSVKKAPGQS
        ACR H N++L E   + LI+++P++ G YV  +N+ A + +W++    R+ M+E+ V K PG S
Subjt:  ACRTHGNLKLAEMAVKELISLEPWNSGNYVLLSNMLAEEGRWEEVENVRQWMREKSVKKAPGQS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
TACTCTTCGTTCACTTCTTCTTCTTCTTCTTCTCTCCTCGATAACATGAGCAAGAACTGTATGGAGATCGAGCGGAGAATCCTACGCCTTCTTCACGGCCACAAATCTCG
AACCCATCTCACTCAAATCCACGCCCACTTCCTCCGCCATGGCCTCCATCAATCCAACCAAATTCTCGCCCATTTCATCTCCGTTTGCGCCTCTTTCAACCGCATTGCCT
ATGCCGATCGCCTCTTTTCCCAATCTCACAACCCCAACATCTTCCTATTCAATTCAATCATCAAAGCTCACTCCCTCTCTGTTCCCTTCCATCAATCCCTTCTCCTGTTT
TCCTCCATGAAGAATCACAGGATTGTTCCTGATCAGTACACTTTTGCGCCGTTGCTGAAATCCTGTGCCAATCTCTGTGAGTATAGTCTTGGTCAGTGTGTGATTTCTGA
AGTTTTTCGTCGTGGGTTTTACTGTTTTGGGTCTATTCGTATTGGGGTTGTTGAGTTGTATGTTTGTTGTGAGAAGATGGAGGATGCATGGAAGATGTTTGATGAAATGT
CTCACAGGGATGTGGTTGTTTGGAACTTGATGATTCGTGGATTTTGCAAGACGGGAAATGTTGATTTTGGGCTGTGTCTCTTTAGACAAATGAGTGAACGTAGTCTTGTT
TCTTGGAACACTATTATTTCCTGTTTAGCTCAAAATAGGCGTGATGTTGAAGCTTTGGAACTCTTTCAACAAATGGAAGAACATGGGTTTAAACCAGATGAGGTTACTGT
GGTCACAATGTTGCCTGTATGTTCTCGTTTGGGAGCTCTTGAAGTTGGACAAAGAATCCATTCTTATGCAAGTTCTAAGGGAAATTTGGTAGGTATTACGACTGTTGGGA
ATTCACTCATTGATTTTTACTGTAAATGTGGTAACATAGAAAAGGCTTACAACATATTTCAGAAAATGACTTGCAAAAGCGTCGTCTCTTGGAATACAATTATTTTGGGC
TTTGCTTTAAATGGGAAGGGGGAGTTTGCTATTGACCTTTTTATGGAGATGCGAAAGGAGTACTTGAAGCCAAATGATGCAACATTTGTAGCTGTCTTGACTGCTTGTGT
TCATTCGGGATTGTTAGAGAAGGGTCGAGAGCTATTTTCTTCAATGGCTGAGGATTACGAAATCCAGCCAAAACTCGAACATTTTGGTTGTATGGTTGATCTTTTGGGAC
GTGGTGGGTGTGTGGAGGAGGCTCATAAATTGATTAAAAGCATGCCAATGCAACCAAATGCCACTTTATGGGGAGCTGTGCTTGGTGCTTGCCGAACCCATGGCAACTTG
AAACTTGCAGAAATGGCAGTGAAAGAGCTCATCAGTCTTGAACCATGGAACTCTGGTAATTATGTATTGTTGTCAAATATGTTGGCAGAGGAAGGAAGATGGGAAGAAGT
TGAGAATGTCAGACAGTGGATGAGAGAAAAGAGCGTCAAGAAAGCACCTGGGCAGAGTGCAAGTGGGTAA
mRNA sequenceShow/hide mRNA sequence
TACTCTTCGTTCACTTCTTCTTCTTCTTCTTCTCTCCTCGATAACATGAGCAAGAACTGTATGGAGATCGAGCGGAGAATCCTACGCCTTCTTCACGGCCACAAATCTCG
AACCCATCTCACTCAAATCCACGCCCACTTCCTCCGCCATGGCCTCCATCAATCCAACCAAATTCTCGCCCATTTCATCTCCGTTTGCGCCTCTTTCAACCGCATTGCCT
ATGCCGATCGCCTCTTTTCCCAATCTCACAACCCCAACATCTTCCTATTCAATTCAATCATCAAAGCTCACTCCCTCTCTGTTCCCTTCCATCAATCCCTTCTCCTGTTT
TCCTCCATGAAGAATCACAGGATTGTTCCTGATCAGTACACTTTTGCGCCGTTGCTGAAATCCTGTGCCAATCTCTGTGAGTATAGTCTTGGTCAGTGTGTGATTTCTGA
AGTTTTTCGTCGTGGGTTTTACTGTTTTGGGTCTATTCGTATTGGGGTTGTTGAGTTGTATGTTTGTTGTGAGAAGATGGAGGATGCATGGAAGATGTTTGATGAAATGT
CTCACAGGGATGTGGTTGTTTGGAACTTGATGATTCGTGGATTTTGCAAGACGGGAAATGTTGATTTTGGGCTGTGTCTCTTTAGACAAATGAGTGAACGTAGTCTTGTT
TCTTGGAACACTATTATTTCCTGTTTAGCTCAAAATAGGCGTGATGTTGAAGCTTTGGAACTCTTTCAACAAATGGAAGAACATGGGTTTAAACCAGATGAGGTTACTGT
GGTCACAATGTTGCCTGTATGTTCTCGTTTGGGAGCTCTTGAAGTTGGACAAAGAATCCATTCTTATGCAAGTTCTAAGGGAAATTTGGTAGGTATTACGACTGTTGGGA
ATTCACTCATTGATTTTTACTGTAAATGTGGTAACATAGAAAAGGCTTACAACATATTTCAGAAAATGACTTGCAAAAGCGTCGTCTCTTGGAATACAATTATTTTGGGC
TTTGCTTTAAATGGGAAGGGGGAGTTTGCTATTGACCTTTTTATGGAGATGCGAAAGGAGTACTTGAAGCCAAATGATGCAACATTTGTAGCTGTCTTGACTGCTTGTGT
TCATTCGGGATTGTTAGAGAAGGGTCGAGAGCTATTTTCTTCAATGGCTGAGGATTACGAAATCCAGCCAAAACTCGAACATTTTGGTTGTATGGTTGATCTTTTGGGAC
GTGGTGGGTGTGTGGAGGAGGCTCATAAATTGATTAAAAGCATGCCAATGCAACCAAATGCCACTTTATGGGGAGCTGTGCTTGGTGCTTGCCGAACCCATGGCAACTTG
AAACTTGCAGAAATGGCAGTGAAAGAGCTCATCAGTCTTGAACCATGGAACTCTGGTAATTATGTATTGTTGTCAAATATGTTGGCAGAGGAAGGAAGATGGGAAGAAGT
TGAGAATGTCAGACAGTGGATGAGAGAAAAGAGCGTCAAGAAAGCACCTGGGCAGAGTGCAAGTGGGTAA
Protein sequenceShow/hide protein sequence
YSSFTSSSSSSLLDNMSKNCMEIERRILRLLHGHKSRTHLTQIHAHFLRHGLHQSNQILAHFISVCASFNRIAYADRLFSQSHNPNIFLFNSIIKAHSLSVPFHQSLLLF
SSMKNHRIVPDQYTFAPLLKSCANLCEYSLGQCVISEVFRRGFYCFGSIRIGVVELYVCCEKMEDAWKMFDEMSHRDVVVWNLMIRGFCKTGNVDFGLCLFRQMSERSLV
SWNTIISCLAQNRRDVEALELFQQMEEHGFKPDEVTVVTMLPVCSRLGALEVGQRIHSYASSKGNLVGITTVGNSLIDFYCKCGNIEKAYNIFQKMTCKSVVSWNTIILG
FALNGKGEFAIDLFMEMRKEYLKPNDATFVAVLTACVHSGLLEKGRELFSSMAEDYEIQPKLEHFGCMVDLLGRGGCVEEAHKLIKSMPMQPNATLWGAVLGACRTHGNL
KLAEMAVKELISLEPWNSGNYVLLSNMLAEEGRWEEVENVRQWMREKSVKKAPGQSASG