; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG00G004375 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG00G004375
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionPentatricopeptide repeat-containing protein
Genome locationCG_Chr00:10279993..10281447
RNA-Seq ExpressionClCG00G004375
SyntenyClCG00G004375
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004144815.1 pentatricopeptide repeat-containing protein At1g09190 isoform X1 [Cucumis sativus]1.6e-26391.32Show/hide
Query:  MSKNCLQIERRILRLLHGHKSRTHLTQIHAHFLRHGLHQSNQILSHFISICAAFNHIDYANRLFSQSHNPNIFLFNSIIKAHSLSPPFQQSLLLFSSMKN
        MSKNC++IERRILRLLHGHKSRTHLTQIHAHFLRHGLHQSNQIL+HFIS+CA+FN I YA+RLFSQSHNPNIFLFNSIIKAHSLS PF QSLLLFSSMKN
Subjt:  MSKNCLQIERRILRLLHGHKSRTHLTQIHAHFLRHGLHQSNQILSHFISICAAFNHIDYANRLFSQSHNPNIFLFNSIIKAHSLSPPFQQSLLLFSSMKN

Query:  HRIVPDEYTFAPLLKSCANLCEYRLGQCVISEVLRRGFYCFGSIRIGVVELYVCCERMEDARKVFDEMPHRDVVVWNLMIRGFCKMGNVDFGLCLFRQMS
        HRIVPD+YTFAPLLKSCANLCEY LGQCVISEV RRGFYCFGSIRIGVVELYVCCE+MEDA K+FDEM HRDVVVWNLMIRGFCK GNVDFGLCLFRQMS
Subjt:  HRIVPDEYTFAPLLKSCANLCEYRLGQCVISEVLRRGFYCFGSIRIGVVELYVCCERMEDARKVFDEMPHRDVVVWNLMIRGFCKMGNVDFGLCLFRQMS

Query:  ERSLVSWNTMISCLAQNRRDVEALELFQRMEEYGFKPDEVTVVTMLPVCSRLGALDVGQRIHSFASSKGDLVHITTVGNSLVDFYCKCGNTERAYNIFQK
        ERSLVSWNT+ISCLAQNRRDVEALELFQ+MEE+GFKPDEVTVVTMLPVCSRLGAL+VGQRIHS+ASSKG+LV ITTVGNSL+DFYCKCGN E+AYNIFQK
Subjt:  ERSLVSWNTMISCLAQNRRDVEALELFQRMEEYGFKPDEVTVVTMLPVCSRLGALDVGQRIHSFASSKGDLVHITTVGNSLVDFYCKCGNTERAYNIFQK

Query:  MTCKSVVSWNTMILGFALNGKGEFAIDLFMVMGREDVKPNDATFVAVLTACVHSGLLEKGREIFSSMAEKYEIQPKLEHFGCMVDLLGRGGCVEEAHNLI
        MTCKSVVSWNT+ILGFALNGKGEFAIDLFM M +E +KPNDATFVAVLTACVHSGLLEKGRE+FSSMAE YEIQPKLEHFGCMVDLLGRGGCVEEAH LI
Subjt:  MTCKSVVSWNTMILGFALNGKGEFAIDLFMVMGREDVKPNDATFVAVLTACVHSGLLEKGREIFSSMAEKYEIQPKLEHFGCMVDLLGRGGCVEEAHNLI

Query:  KSMPMQPNATLWGALLGACRTHGNLKLAEMAVKELISLEPWNSGNYVLLSNMLAEEGRWEDVENVRRWMRGKSIKKAPGQSASG
        KSMPMQPNATLWGA+LGACRTHGNLKLAEMAVKELISLEPWNSGNYVLLSNMLAEEGRWE+VENVR+WMR KS+KKAPGQSASG
Subjt:  KSMPMQPNATLWGALLGACRTHGNLKLAEMAVKELISLEPWNSGNYVLLSNMLAEEGRWEDVENVRRWMRGKSIKKAPGQSASG

XP_008453700.1 PREDICTED: pentatricopeptide repeat-containing protein At1g09190 [Cucumis melo]8.6e-26591.74Show/hide
Query:  MSKNCLQIERRILRLLHGHKSRTHLTQIHAHFLRHGLHQSNQILSHFISICAAFNHIDYANRLFSQSHNPNIFLFNSIIKAHSLSPPFQQSLLLFSSMKN
        MSKNC++IERRILRLLHGHKSRTHLTQIHAHFLRHGLHQSNQIL+HFIS+CA+FN I YA+RLFSQSHNPNIFLFNSIIKAHSLSPPF QSLLLFS MKN
Subjt:  MSKNCLQIERRILRLLHGHKSRTHLTQIHAHFLRHGLHQSNQILSHFISICAAFNHIDYANRLFSQSHNPNIFLFNSIIKAHSLSPPFQQSLLLFSSMKN

Query:  HRIVPDEYTFAPLLKSCANLCEYRLGQCVISEVLRRGFYCFGSIRIGVVELYVCCERMEDARKVFDEMPHRDVVVWNLMIRGFCKMGNVDFGLCLFRQMS
        HRIVPD+YTFAPLLKSCANLCEY LGQCVISEVL RGFYCFGSIRIGVVELYVCCE+MEDA K FDEM HRDVVVWNLMIRGFCKMGNVDFGLCLFRQMS
Subjt:  HRIVPDEYTFAPLLKSCANLCEYRLGQCVISEVLRRGFYCFGSIRIGVVELYVCCERMEDARKVFDEMPHRDVVVWNLMIRGFCKMGNVDFGLCLFRQMS

Query:  ERSLVSWNTMISCLAQNRRDVEALELFQRMEEYGFKPDEVTVVTMLPVCSRLGALDVGQRIHSFASSKGDLVHITTVGNSLVDFYCKCGNTERAYNIFQK
        ERSLVSWNT+ISCLAQNRRDVEALELFQ+MEE+GFKPDEVTVVTMLPVCSRLGAL+VGQRIHS+ SSKG+LV  T VGNSL+DFYCKCGN E AYNIFQK
Subjt:  ERSLVSWNTMISCLAQNRRDVEALELFQRMEEYGFKPDEVTVVTMLPVCSRLGALDVGQRIHSFASSKGDLVHITTVGNSLVDFYCKCGNTERAYNIFQK

Query:  MTCKSVVSWNTMILGFALNGKGEFAIDLFMVMGREDVKPNDATFVAVLTACVHSGLLEKGREIFSSMAEKYEIQPKLEHFGCMVDLLGRGGCVEEAHNLI
        MTCKSVVSWNT+ILGFALNGKGEFAIDLFM M +EDVKPNDATFVAVLTACVHSGLLEKGRE+FSSMAE YEIQPKLEHFGCMVDLLGRGGCVEEAHNLI
Subjt:  MTCKSVVSWNTMILGFALNGKGEFAIDLFMVMGREDVKPNDATFVAVLTACVHSGLLEKGREIFSSMAEKYEIQPKLEHFGCMVDLLGRGGCVEEAHNLI

Query:  KSMPMQPNATLWGALLGACRTHGNLKLAEMAVKELISLEPWNSGNYVLLSNMLAEEGRWEDVENVRRWMRGKSIKKAPGQSASG
        KSMPMQPNATLWGALLGACRTHGNLKLAEMAVKELISLEPWNSGNYVLLSNMLAEEGRWE+VENVR+WMR KS+KKAPGQSASG
Subjt:  KSMPMQPNATLWGALLGACRTHGNLKLAEMAVKELISLEPWNSGNYVLLSNMLAEEGRWEDVENVRRWMRGKSIKKAPGQSASG

XP_022965696.1 pentatricopeptide repeat-containing protein At1g09190 [Cucurbita maxima]3.7e-24485.95Show/hide
Query:  MSKNCLQIERRILRLLHGHKSRTHLTQIHAHFLRHGLHQSNQILSHFISICAAFNHIDYANRLFSQSHNPNIFLFNSIIKAHSLSPPFQQSLLLFSSMKN
        MSKN   IERRILRLL GHKS THLTQIHAHFLRH LHQSNQIL+HFISIC  FN I YANR+FSQS NPNIFLFNS+IKAHSLS PF+QSLLLFSS+KN
Subjt:  MSKNCLQIERRILRLLHGHKSRTHLTQIHAHFLRHGLHQSNQILSHFISICAAFNHIDYANRLFSQSHNPNIFLFNSIIKAHSLSPPFQQSLLLFSSMKN

Query:  HRIVPDEYTFAPLLKSCANLCEYRLGQCVISEVLRRGFYCFGSIRIGVVELYVCCERMEDARKVFDEMPHRDVVVWNLMIRGFCKMGNVDFGLCLFRQMS
         RIVPDEYTFAPLLKSC+NL +YRLG+CVI EVLRRGF CFGSIRIGVVELYVCCERM+DA+KVFDEMPH DVVVWNLMIRGFCKMGNVD GL LFRQM+
Subjt:  HRIVPDEYTFAPLLKSCANLCEYRLGQCVISEVLRRGFYCFGSIRIGVVELYVCCERMEDARKVFDEMPHRDVVVWNLMIRGFCKMGNVDFGLCLFRQMS

Query:  ERSLVSWNTMISCLAQNRRDVEALELFQRMEEYGFKPDEVTVVTMLPVCSRLGALDVGQRIHSFASSKGDLVHITTVGNSLVDFYCKCGNTERAYNIFQK
        +RSLVSWNT ISCLAQ+ RDVEALELFQ+MEE+GF+PDEVTVVTMLPVCSRLGA+DVGQ IHS+A+SK DLV+ T VGNSL+DFYCK GNTERAYNIFQK
Subjt:  ERSLVSWNTMISCLAQNRRDVEALELFQRMEEYGFKPDEVTVVTMLPVCSRLGALDVGQRIHSFASSKGDLVHITTVGNSLVDFYCKCGNTERAYNIFQK

Query:  MTCKSVVSWNTMILGFALNGKGEFAIDLFMVMGREDVKPNDATFVAVLTACVHSGLLEKGREIFSSMAEKYEIQPKLEHFGCMVDLLGRGGCVEEAHNLI
        MTCKSVVSWNTMILGFALNGKGE AIDLFM MG+ D KPND T VA+LTACVHSGLLEKG+E+FSSMAEKYEI+PKLEHFGCMVDLLGRGGCVEEAH+LI
Subjt:  MTCKSVVSWNTMILGFALNGKGEFAIDLFMVMGREDVKPNDATFVAVLTACVHSGLLEKGREIFSSMAEKYEIQPKLEHFGCMVDLLGRGGCVEEAHNLI

Query:  KSMPMQPNATLWGALLGACRTHGNLKLAEMAVKELISLEPWNSGNYVLLSNMLAEEGRWEDVENVRRWMRGKSIKKAPGQSASG
        +SMPMQPNATLWGALLGACRTHGNLKLAEMAV ELISLEP NSGNYVLLSN LAEE RWEDVENVRR MRGK++KKAPG+SASG
Subjt:  KSMPMQPNATLWGALLGACRTHGNLKLAEMAVKELISLEPWNSGNYVLLSNMLAEEGRWEDVENVRRWMRGKSIKKAPGQSASG

XP_023538021.1 pentatricopeptide repeat-containing protein At1g09190 [Cucurbita pepo subsp. pepo]1.3e-24486.57Show/hide
Query:  MSKNCLQIERRILRLLHGHKSRTHLTQIHAHFLRHGLHQSNQILSHFISICAAFNHIDYANRLFSQSHNPNIFLFNSIIKAHSLSPPFQQSLLLFSSMKN
        MSKN   IERRILRLL GHKS THLTQIHAHFLRH LHQSNQIL+HFISIC AFN I YANR+FSQS NPNIFLFNS+IKAHSLS PFQQSLLLFSSMKN
Subjt:  MSKNCLQIERRILRLLHGHKSRTHLTQIHAHFLRHGLHQSNQILSHFISICAAFNHIDYANRLFSQSHNPNIFLFNSIIKAHSLSPPFQQSLLLFSSMKN

Query:  HRIVPDEYTFAPLLKSCANLCEYRLGQCVISEVLRRGFYCFGSIRIGVVELYVCCERMEDARKVFDEMPHRDVVVWNLMIRGFCKMGNVDFGLCLFRQMS
         RIVPDEYTFAPLLKSC+NL +YRLG+CVI EVLRRGF  FGSIRIGVVELYVCCERM+DA+KVFDEMP RDVVVWNLMIRGFCKMGNVD GL LFRQM+
Subjt:  HRIVPDEYTFAPLLKSCANLCEYRLGQCVISEVLRRGFYCFGSIRIGVVELYVCCERMEDARKVFDEMPHRDVVVWNLMIRGFCKMGNVDFGLCLFRQMS

Query:  ERSLVSWNTMISCLAQNRRDVEALELFQRMEEYGFKPDEVTVVTMLPVCSRLGALDVGQRIHSFASSKGDLVHITTVGNSLVDFYCKCGNTERAYNIFQK
        +RSLVSWNT ISCLAQ+ RDVEAL+LFQ+MEE+GF+PDEVTVVTMLPVCSRLGALDVGQ IHS+A+SK DLV+ T VGNSL+DFYCK GNTE+AYNIFQK
Subjt:  ERSLVSWNTMISCLAQNRRDVEALELFQRMEEYGFKPDEVTVVTMLPVCSRLGALDVGQRIHSFASSKGDLVHITTVGNSLVDFYCKCGNTERAYNIFQK

Query:  MTCKSVVSWNTMILGFALNGKGEFAIDLFMVMGREDVKPNDATFVAVLTACVHSGLLEKGREIFSSMAEKYEIQPKLEHFGCMVDLLGRGGCVEEAHNLI
        MTCKSVVSWNTMILGFALNGKGE AIDLF  MGR D KPNDAT VA+LTACVHSGLLEKGRE+FSSMAEKYEI+PKLEHFGCMVDLLGRGGCVEEAH+LI
Subjt:  MTCKSVVSWNTMILGFALNGKGEFAIDLFMVMGREDVKPNDATFVAVLTACVHSGLLEKGREIFSSMAEKYEIQPKLEHFGCMVDLLGRGGCVEEAHNLI

Query:  KSMPMQPNATLWGALLGACRTHGNLKLAEMAVKELISLEPWNSGNYVLLSNMLAEEGRWEDVENVRRWMRGKSIKKAPGQSASG
        +SMPMQPNATLWGALLGACRTHGNLKLAEMA  ELISLEP NSGNYVLLSN+LAEEGRWEDVENVRR MRGK++KKAPG+SASG
Subjt:  KSMPMQPNATLWGALLGACRTHGNLKLAEMAVKELISLEPWNSGNYVLLSNMLAEEGRWEDVENVRRWMRGKSIKKAPGQSASG

XP_038890516.1 pentatricopeptide repeat-containing protein At1g09190 [Benincasa hispida]1.8e-26792.56Show/hide
Query:  MSKNCLQIERRILRLLHGHKSRTHLTQIHAHFLRHGLHQSNQILSHFISICAAFNHIDYANRLFSQSHNPNIFLFNSIIKAHSLSPPFQQSLLLFSSMKN
        M+KNCL+IERRILRLLHG KSRTHLT+IHAHFLRHGLHQSNQIL+HFISICAAFNHI YA+RLFSQSHNPNIFLFNSIIKAHSL PPFQQSLLLFSSMKN
Subjt:  MSKNCLQIERRILRLLHGHKSRTHLTQIHAHFLRHGLHQSNQILSHFISICAAFNHIDYANRLFSQSHNPNIFLFNSIIKAHSLSPPFQQSLLLFSSMKN

Query:  HRIVPDEYTFAPLLKSCANLCEYRLGQCVISEVLRRGFYCFGSIRIGVVELYVCCERMEDARKVFDEMPHRDVVVWNLMIRGFCKMGNVDFGLCLFRQMS
        HRIVPDEYTFAPLLKSCANL EY LGQCVI+EVLRRGFYCFGSIRIGVVELYVCCERMEDARKVFDEMPHRDVVVWNLMIRGFCKMGNVDFGL LFRQM+
Subjt:  HRIVPDEYTFAPLLKSCANLCEYRLGQCVISEVLRRGFYCFGSIRIGVVELYVCCERMEDARKVFDEMPHRDVVVWNLMIRGFCKMGNVDFGLCLFRQMS

Query:  ERSLVSWNTMISCLAQNRRDVEALELFQRMEEYGFKPDEVTVVTMLPVCSRLGALDVGQRIHSFASSKGDLVHITTVGNSLVDFYCKCGNTERAYNIFQK
        ERSL+SWNTM+SCLAQ+R D EALELFQ+MEE GFKPDEVTVVTMLPVCSRLGALDVGQRIH++ASSKGD+V  TT+GNSLVDFYCKCGN ERAYNIFQK
Subjt:  ERSLVSWNTMISCLAQNRRDVEALELFQRMEEYGFKPDEVTVVTMLPVCSRLGALDVGQRIHSFASSKGDLVHITTVGNSLVDFYCKCGNTERAYNIFQK

Query:  MTCKSVVSWNTMILGFALNGKGEFAIDLFMVMGREDVKPNDATFVAVLTACVHSGLLEKGREIFSSMAEKYEIQPKLEHFGCMVDLLGRGGCVEEAHNLI
        MTCKSVVSWNTMILGFALNG GEFAIDLFM MG+EDVKPNDATFVAVLTACVHSGLLEKGRE+FSSMA+KYEIQPKLEHFGCMVDLLGRGGC+EEAHNLI
Subjt:  MTCKSVVSWNTMILGFALNGKGEFAIDLFMVMGREDVKPNDATFVAVLTACVHSGLLEKGREIFSSMAEKYEIQPKLEHFGCMVDLLGRGGCVEEAHNLI

Query:  KSMPMQPNATLWGALLGACRTHGNLKLAEMAVKELISLEPWNSGNYVLLSNMLAEEGRWEDVENVRRWMRGKSIKKAPGQSASG
        KSMPMQPNATLWGALLGACRTHGNLKLAEMAVKEL SLEPWNSGNYVLLSNMLAEEGRWEDVENVRRWM+GKS+KKAPGQSASG
Subjt:  KSMPMQPNATLWGALLGACRTHGNLKLAEMAVKELISLEPWNSGNYVLLSNMLAEEGRWEDVENVRRWMRGKSIKKAPGQSASG

TrEMBL top hitse value%identityAlignment
A0A0A0LJW6 Uncharacterized protein7.8e-26491.32Show/hide
Query:  MSKNCLQIERRILRLLHGHKSRTHLTQIHAHFLRHGLHQSNQILSHFISICAAFNHIDYANRLFSQSHNPNIFLFNSIIKAHSLSPPFQQSLLLFSSMKN
        MSKNC++IERRILRLLHGHKSRTHLTQIHAHFLRHGLHQSNQIL+HFIS+CA+FN I YA+RLFSQSHNPNIFLFNSIIKAHSLS PF QSLLLFSSMKN
Subjt:  MSKNCLQIERRILRLLHGHKSRTHLTQIHAHFLRHGLHQSNQILSHFISICAAFNHIDYANRLFSQSHNPNIFLFNSIIKAHSLSPPFQQSLLLFSSMKN

Query:  HRIVPDEYTFAPLLKSCANLCEYRLGQCVISEVLRRGFYCFGSIRIGVVELYVCCERMEDARKVFDEMPHRDVVVWNLMIRGFCKMGNVDFGLCLFRQMS
        HRIVPD+YTFAPLLKSCANLCEY LGQCVISEV RRGFYCFGSIRIGVVELYVCCE+MEDA K+FDEM HRDVVVWNLMIRGFCK GNVDFGLCLFRQMS
Subjt:  HRIVPDEYTFAPLLKSCANLCEYRLGQCVISEVLRRGFYCFGSIRIGVVELYVCCERMEDARKVFDEMPHRDVVVWNLMIRGFCKMGNVDFGLCLFRQMS

Query:  ERSLVSWNTMISCLAQNRRDVEALELFQRMEEYGFKPDEVTVVTMLPVCSRLGALDVGQRIHSFASSKGDLVHITTVGNSLVDFYCKCGNTERAYNIFQK
        ERSLVSWNT+ISCLAQNRRDVEALELFQ+MEE+GFKPDEVTVVTMLPVCSRLGAL+VGQRIHS+ASSKG+LV ITTVGNSL+DFYCKCGN E+AYNIFQK
Subjt:  ERSLVSWNTMISCLAQNRRDVEALELFQRMEEYGFKPDEVTVVTMLPVCSRLGALDVGQRIHSFASSKGDLVHITTVGNSLVDFYCKCGNTERAYNIFQK

Query:  MTCKSVVSWNTMILGFALNGKGEFAIDLFMVMGREDVKPNDATFVAVLTACVHSGLLEKGREIFSSMAEKYEIQPKLEHFGCMVDLLGRGGCVEEAHNLI
        MTCKSVVSWNT+ILGFALNGKGEFAIDLFM M +E +KPNDATFVAVLTACVHSGLLEKGRE+FSSMAE YEIQPKLEHFGCMVDLLGRGGCVEEAH LI
Subjt:  MTCKSVVSWNTMILGFALNGKGEFAIDLFMVMGREDVKPNDATFVAVLTACVHSGLLEKGREIFSSMAEKYEIQPKLEHFGCMVDLLGRGGCVEEAHNLI

Query:  KSMPMQPNATLWGALLGACRTHGNLKLAEMAVKELISLEPWNSGNYVLLSNMLAEEGRWEDVENVRRWMRGKSIKKAPGQSASG
        KSMPMQPNATLWGA+LGACRTHGNLKLAEMAVKELISLEPWNSGNYVLLSNMLAEEGRWE+VENVR+WMR KS+KKAPGQSASG
Subjt:  KSMPMQPNATLWGALLGACRTHGNLKLAEMAVKELISLEPWNSGNYVLLSNMLAEEGRWEDVENVRRWMRGKSIKKAPGQSASG

A0A1S3BXN7 pentatricopeptide repeat-containing protein At1g091904.1e-26591.74Show/hide
Query:  MSKNCLQIERRILRLLHGHKSRTHLTQIHAHFLRHGLHQSNQILSHFISICAAFNHIDYANRLFSQSHNPNIFLFNSIIKAHSLSPPFQQSLLLFSSMKN
        MSKNC++IERRILRLLHGHKSRTHLTQIHAHFLRHGLHQSNQIL+HFIS+CA+FN I YA+RLFSQSHNPNIFLFNSIIKAHSLSPPF QSLLLFS MKN
Subjt:  MSKNCLQIERRILRLLHGHKSRTHLTQIHAHFLRHGLHQSNQILSHFISICAAFNHIDYANRLFSQSHNPNIFLFNSIIKAHSLSPPFQQSLLLFSSMKN

Query:  HRIVPDEYTFAPLLKSCANLCEYRLGQCVISEVLRRGFYCFGSIRIGVVELYVCCERMEDARKVFDEMPHRDVVVWNLMIRGFCKMGNVDFGLCLFRQMS
        HRIVPD+YTFAPLLKSCANLCEY LGQCVISEVL RGFYCFGSIRIGVVELYVCCE+MEDA K FDEM HRDVVVWNLMIRGFCKMGNVDFGLCLFRQMS
Subjt:  HRIVPDEYTFAPLLKSCANLCEYRLGQCVISEVLRRGFYCFGSIRIGVVELYVCCERMEDARKVFDEMPHRDVVVWNLMIRGFCKMGNVDFGLCLFRQMS

Query:  ERSLVSWNTMISCLAQNRRDVEALELFQRMEEYGFKPDEVTVVTMLPVCSRLGALDVGQRIHSFASSKGDLVHITTVGNSLVDFYCKCGNTERAYNIFQK
        ERSLVSWNT+ISCLAQNRRDVEALELFQ+MEE+GFKPDEVTVVTMLPVCSRLGAL+VGQRIHS+ SSKG+LV  T VGNSL+DFYCKCGN E AYNIFQK
Subjt:  ERSLVSWNTMISCLAQNRRDVEALELFQRMEEYGFKPDEVTVVTMLPVCSRLGALDVGQRIHSFASSKGDLVHITTVGNSLVDFYCKCGNTERAYNIFQK

Query:  MTCKSVVSWNTMILGFALNGKGEFAIDLFMVMGREDVKPNDATFVAVLTACVHSGLLEKGREIFSSMAEKYEIQPKLEHFGCMVDLLGRGGCVEEAHNLI
        MTCKSVVSWNT+ILGFALNGKGEFAIDLFM M +EDVKPNDATFVAVLTACVHSGLLEKGRE+FSSMAE YEIQPKLEHFGCMVDLLGRGGCVEEAHNLI
Subjt:  MTCKSVVSWNTMILGFALNGKGEFAIDLFMVMGREDVKPNDATFVAVLTACVHSGLLEKGREIFSSMAEKYEIQPKLEHFGCMVDLLGRGGCVEEAHNLI

Query:  KSMPMQPNATLWGALLGACRTHGNLKLAEMAVKELISLEPWNSGNYVLLSNMLAEEGRWEDVENVRRWMRGKSIKKAPGQSASG
        KSMPMQPNATLWGALLGACRTHGNLKLAEMAVKELISLEPWNSGNYVLLSNMLAEEGRWE+VENVR+WMR KS+KKAPGQSASG
Subjt:  KSMPMQPNATLWGALLGACRTHGNLKLAEMAVKELISLEPWNSGNYVLLSNMLAEEGRWEDVENVRRWMRGKSIKKAPGQSASG

A0A5A7U1F2 Pentatricopeptide repeat-containing protein4.1e-26591.74Show/hide
Query:  MSKNCLQIERRILRLLHGHKSRTHLTQIHAHFLRHGLHQSNQILSHFISICAAFNHIDYANRLFSQSHNPNIFLFNSIIKAHSLSPPFQQSLLLFSSMKN
        MSKNC++IERRILRLLHGHKSRTHLTQIHAHFLRHGLHQSNQIL+HFIS+CA+FN I YA+RLFSQSHNPNIFLFNSIIKAHSLSPPF QSLLLFS MKN
Subjt:  MSKNCLQIERRILRLLHGHKSRTHLTQIHAHFLRHGLHQSNQILSHFISICAAFNHIDYANRLFSQSHNPNIFLFNSIIKAHSLSPPFQQSLLLFSSMKN

Query:  HRIVPDEYTFAPLLKSCANLCEYRLGQCVISEVLRRGFYCFGSIRIGVVELYVCCERMEDARKVFDEMPHRDVVVWNLMIRGFCKMGNVDFGLCLFRQMS
        HRIVPD+YTFAPLLKSCANLCEY LGQCVISEVL RGFYCFGSIRIGVVELYVCCE+MEDA K FDEM HRDVVVWNLMIRGFCKMGNVDFGLCLFRQMS
Subjt:  HRIVPDEYTFAPLLKSCANLCEYRLGQCVISEVLRRGFYCFGSIRIGVVELYVCCERMEDARKVFDEMPHRDVVVWNLMIRGFCKMGNVDFGLCLFRQMS

Query:  ERSLVSWNTMISCLAQNRRDVEALELFQRMEEYGFKPDEVTVVTMLPVCSRLGALDVGQRIHSFASSKGDLVHITTVGNSLVDFYCKCGNTERAYNIFQK
        ERSLVSWNT+ISCLAQNRRDVEALELFQ+MEE+GFKPDEVTVVTMLPVCSRLGAL+VGQRIHS+ SSKG+LV  T VGNSL+DFYCKCGN E AYNIFQK
Subjt:  ERSLVSWNTMISCLAQNRRDVEALELFQRMEEYGFKPDEVTVVTMLPVCSRLGALDVGQRIHSFASSKGDLVHITTVGNSLVDFYCKCGNTERAYNIFQK

Query:  MTCKSVVSWNTMILGFALNGKGEFAIDLFMVMGREDVKPNDATFVAVLTACVHSGLLEKGREIFSSMAEKYEIQPKLEHFGCMVDLLGRGGCVEEAHNLI
        MTCKSVVSWNT+ILGFALNGKGEFAIDLFM M +EDVKPNDATFVAVLTACVHSGLLEKGRE+FSSMAE YEIQPKLEHFGCMVDLLGRGGCVEEAHNLI
Subjt:  MTCKSVVSWNTMILGFALNGKGEFAIDLFMVMGREDVKPNDATFVAVLTACVHSGLLEKGREIFSSMAEKYEIQPKLEHFGCMVDLLGRGGCVEEAHNLI

Query:  KSMPMQPNATLWGALLGACRTHGNLKLAEMAVKELISLEPWNSGNYVLLSNMLAEEGRWEDVENVRRWMRGKSIKKAPGQSASG
        KSMPMQPNATLWGALLGACRTHGNLKLAEMAVKELISLEPWNSGNYVLLSNMLAEEGRWE+VENVR+WMR KS+KKAPGQSASG
Subjt:  KSMPMQPNATLWGALLGACRTHGNLKLAEMAVKELISLEPWNSGNYVLLSNMLAEEGRWEDVENVRRWMRGKSIKKAPGQSASG

A0A6J1FGY4 pentatricopeptide repeat-containing protein At1g091902.4e-24486.16Show/hide
Query:  MSKNCLQIERRILRLLHGHKSRTHLTQIHAHFLRHGLHQSNQILSHFISICAAFNHIDYANRLFSQSHNPNIFLFNSIIKAHSLSPPFQQSLLLFSSMKN
        MSKN   IERRILRLL GHKS THLTQIHAHFLRH LHQSNQIL+HFISIC A N I YANR+FSQS NPNIFLFNS+IKAHSLS PFQQSLLLFSSMKN
Subjt:  MSKNCLQIERRILRLLHGHKSRTHLTQIHAHFLRHGLHQSNQILSHFISICAAFNHIDYANRLFSQSHNPNIFLFNSIIKAHSLSPPFQQSLLLFSSMKN

Query:  HRIVPDEYTFAPLLKSCANLCEYRLGQCVISEVLRRGFYCFGSIRIGVVELYVCCERMEDARKVFDEMPHRDVVVWNLMIRGFCKMGNVDFGLCLFRQMS
         RIVPDEYTFAPLLKSC+NL +YRLG+CVI EVLRRGF CFGSIRIGVVELYVCCE+M+DA+KVFDEMP RDVVVWNLMIRGFCKMGNVD GL LFRQM+
Subjt:  HRIVPDEYTFAPLLKSCANLCEYRLGQCVISEVLRRGFYCFGSIRIGVVELYVCCERMEDARKVFDEMPHRDVVVWNLMIRGFCKMGNVDFGLCLFRQMS

Query:  ERSLVSWNTMISCLAQNRRDVEALELFQRMEEYGFKPDEVTVVTMLPVCSRLGALDVGQRIHSFASSKGDLVHITTVGNSLVDFYCKCGNTERAYNIFQK
        +RSLVSWNT ISCLAQ+ RDVEALELFQ++EE+GF+PDEVTVVTMLPVCSRLGALDVGQ IHS+A+SK DL++ T VGNSL+DFYCK GNTERAYNIFQK
Subjt:  ERSLVSWNTMISCLAQNRRDVEALELFQRMEEYGFKPDEVTVVTMLPVCSRLGALDVGQRIHSFASSKGDLVHITTVGNSLVDFYCKCGNTERAYNIFQK

Query:  MTCKSVVSWNTMILGFALNGKGEFAIDLFMVMGREDVKPNDATFVAVLTACVHSGLLEKGREIFSSMAEKYEIQPKLEHFGCMVDLLGRGGCVEEAHNLI
        MTCK+VVSWNTMILGFALNGKGE AIDLFM MGR DVKPNDAT VA+LTACVHSGLLEKGRE+FSSMAEKYEI+PKLEHFGCMVDLLGRGGCVEEAH+LI
Subjt:  MTCKSVVSWNTMILGFALNGKGEFAIDLFMVMGREDVKPNDATFVAVLTACVHSGLLEKGREIFSSMAEKYEIQPKLEHFGCMVDLLGRGGCVEEAHNLI

Query:  KSMPMQPNATLWGALLGACRTHGNLKLAEMAVKELISLEPWNSGNYVLLSNMLAEEGRWEDVENVRRWMRGKSIKKAPGQSASG
        +SMPMQPNATLWGALLGACRTHGNLKLAEMA  ELISLEP NSGNYVLLSN+LAEEGRW+DVENVR  MRGK++KKAPG+SASG
Subjt:  KSMPMQPNATLWGALLGACRTHGNLKLAEMAVKELISLEPWNSGNYVLLSNMLAEEGRWEDVENVRRWMRGKSIKKAPGQSASG

A0A6J1HMD5 pentatricopeptide repeat-containing protein At1g091901.8e-24485.95Show/hide
Query:  MSKNCLQIERRILRLLHGHKSRTHLTQIHAHFLRHGLHQSNQILSHFISICAAFNHIDYANRLFSQSHNPNIFLFNSIIKAHSLSPPFQQSLLLFSSMKN
        MSKN   IERRILRLL GHKS THLTQIHAHFLRH LHQSNQIL+HFISIC  FN I YANR+FSQS NPNIFLFNS+IKAHSLS PF+QSLLLFSS+KN
Subjt:  MSKNCLQIERRILRLLHGHKSRTHLTQIHAHFLRHGLHQSNQILSHFISICAAFNHIDYANRLFSQSHNPNIFLFNSIIKAHSLSPPFQQSLLLFSSMKN

Query:  HRIVPDEYTFAPLLKSCANLCEYRLGQCVISEVLRRGFYCFGSIRIGVVELYVCCERMEDARKVFDEMPHRDVVVWNLMIRGFCKMGNVDFGLCLFRQMS
         RIVPDEYTFAPLLKSC+NL +YRLG+CVI EVLRRGF CFGSIRIGVVELYVCCERM+DA+KVFDEMPH DVVVWNLMIRGFCKMGNVD GL LFRQM+
Subjt:  HRIVPDEYTFAPLLKSCANLCEYRLGQCVISEVLRRGFYCFGSIRIGVVELYVCCERMEDARKVFDEMPHRDVVVWNLMIRGFCKMGNVDFGLCLFRQMS

Query:  ERSLVSWNTMISCLAQNRRDVEALELFQRMEEYGFKPDEVTVVTMLPVCSRLGALDVGQRIHSFASSKGDLVHITTVGNSLVDFYCKCGNTERAYNIFQK
        +RSLVSWNT ISCLAQ+ RDVEALELFQ+MEE+GF+PDEVTVVTMLPVCSRLGA+DVGQ IHS+A+SK DLV+ T VGNSL+DFYCK GNTERAYNIFQK
Subjt:  ERSLVSWNTMISCLAQNRRDVEALELFQRMEEYGFKPDEVTVVTMLPVCSRLGALDVGQRIHSFASSKGDLVHITTVGNSLVDFYCKCGNTERAYNIFQK

Query:  MTCKSVVSWNTMILGFALNGKGEFAIDLFMVMGREDVKPNDATFVAVLTACVHSGLLEKGREIFSSMAEKYEIQPKLEHFGCMVDLLGRGGCVEEAHNLI
        MTCKSVVSWNTMILGFALNGKGE AIDLFM MG+ D KPND T VA+LTACVHSGLLEKG+E+FSSMAEKYEI+PKLEHFGCMVDLLGRGGCVEEAH+LI
Subjt:  MTCKSVVSWNTMILGFALNGKGEFAIDLFMVMGREDVKPNDATFVAVLTACVHSGLLEKGREIFSSMAEKYEIQPKLEHFGCMVDLLGRGGCVEEAHNLI

Query:  KSMPMQPNATLWGALLGACRTHGNLKLAEMAVKELISLEPWNSGNYVLLSNMLAEEGRWEDVENVRRWMRGKSIKKAPGQSASG
        +SMPMQPNATLWGALLGACRTHGNLKLAEMAV ELISLEP NSGNYVLLSN LAEE RWEDVENVRR MRGK++KKAPG+SASG
Subjt:  KSMPMQPNATLWGALLGACRTHGNLKLAEMAVKELISLEPWNSGNYVLLSNMLAEEGRWEDVENVRRWMRGKSIKKAPGQSASG

SwissProt top hitse value%identityAlignment
O80488 Pentatricopeptide repeat-containing protein At1g091901.5e-15857.65Show/hide
Query:  LQIERRILRLLHGHKSRTHLTQIHAHFLRHGLHQSNQILSHFISICAAFNHIDYANRLFSQSHNPNIFLFNSIIKAHSLSPPFQQSLLLFSSMKNHRIVP
        ++IER++LRLLHGH +RT L +IHAH LRH LH SN +L+HFISIC + ++ DYANR+FS   NPN+ +FN++IK +SL  P  +SL  FSSMK+  I  
Subjt:  LQIERRILRLLHGHKSRTHLTQIHAHFLRHGLHQSNQILSHFISICAAFNHIDYANRLFSQSHNPNIFLFNSIIKAHSLSPPFQQSLLLFSSMKNHRIVP

Query:  DEYTFAPLLKSCANLCEYRLGQCVISEVLRRGFYCFGSIRIGVVELYVCCERMEDARKVFDEMPHRDVVVWNLMIRGFCKMGNVDFGLCLFRQMSERSLV
        DEYT+APLLKSC++L + R G+CV  E++R GF+  G IRIGVVELY    RM DA+KVFDEM  R+VVVWNLMIRGFC  G+V+ GL LF+QMSERS+V
Subjt:  DEYTFAPLLKSCANLCEYRLGQCVISEVLRRGFYCFGSIRIGVVELYVCCERMEDARKVFDEMPHRDVVVWNLMIRGFCKMGNVDFGLCLFRQMSERSLV

Query:  SWNTMISCLAQNRRDVEALELFQRMEEYGFKPDEVTVVTMLPVCSRLGALDVGQRIHSFASSKGDLVHITTVGNSLVDFYCKCGNTERAYNIFQKMTCKS
        SWN+MIS L++  RD EALELF  M + GF PDE TVVT+LP+ + LG LD G+ IHS A S G      TVGN+LVDFYCK G+ E A  IF+KM  ++
Subjt:  SWNTMISCLAQNRRDVEALELFQRMEEYGFKPDEVTVVTMLPVCSRLGALDVGQRIHSFASSKGDLVHITTVGNSLVDFYCKCGNTERAYNIFQKMTCKS

Query:  VVSWNTMILGFALNGKGEFAIDLFMVMGRE-DVKPNDATFVAVLTACVHSGLLEKGREIFSSMAEKYEIQPKLEHFGCMVDLLGRGGCVEEAHNLIKSMP
        VVSWNT+I G A+NGKGEF IDLF  M  E  V PN+ATF+ VL  C ++G +E+G E+F  M E+++++ + EH+G MVDL+ R G + EA   +K+MP
Subjt:  VVSWNTMILGFALNGKGEFAIDLFMVMGRE-DVKPNDATFVAVLTACVHSGLLEKGREIFSSMAEKYEIQPKLEHFGCMVDLLGRGGCVEEAHNLIKSMP

Query:  MQPNATLWGALLGACRTHGNLKLAEMAVKELISLEPWNSGNYVLLSNMLAEEGRWEDVENVRRWMRGKSIKKAPGQS
        +  NA +WG+LL ACR+HG++KLAE+A  EL+ +EP NSGNYVLLSN+ AEEGRW+DVE VR  M+   ++K+ GQS
Subjt:  MQPNATLWGALLGACRTHGNLKLAEMAVKELISLEPWNSGNYVLLSNMLAEEGRWEDVENVRRWMRGKSIKKAPGQS

Q9FFG8 Pentatricopeptide repeat-containing protein At5g442301.5e-9939.61Show/hide
Query:  LTQIHAHFLRHGLHQSNQILSHFISICAAFN--HIDYANRLFSQSHNPNIFLFNSIIKAHSLSPPFQQSLLLFSSMKNHRIVPDEYTFAPLLKSCANLCE
        + QIH H LR GL QS  IL+  I            YA R+       N FL+ ++I+ +++   F +++ ++  M+   I P  +TF+ LLK+C  + +
Subjt:  LTQIHAHFLRHGLHQSNQILSHFISICAAFN--HIDYANRLFSQSHNPNIFLFNSIIKAHSLSPPFQQSLLLFSSMKNHRIVPDEYTFAPLLKSCANLCE

Query:  YRLGQCVISEVLRRGFYCFGSIRIGVVELYVCCERMEDARKVFDEMPHRDVVVWNLMIRGFCKMGNVDFGLCLFRQMSERSLVSWNTMISCLAQNRRDVE
          LG+   ++  R   +CF  +   ++++YV CE ++ ARKVFDEMP RDV+ W  +I  + ++GN++    LF  +  + +V+W  M++  AQN +  E
Subjt:  YRLGQCVISEVLRRGFYCFGSIRIGVVELYVCCERMEDARKVFDEMPHRDVVVWNLMIRGFCKMGNVDFGLCLFRQMSERSLVSWNTMISCLAQNRRDVE

Query:  ALELFQRMEEYGFKPDEVTVVTMLPVCSRLGALDVGQRIHSFASSKG--DLVHITTVGNSLVDFYCKCGNTERAYNIFQKMTCKSVVSWNTMILGFALNG
        ALE F RME+ G + DEVTV   +  C++LGA     R    A   G     H+  +G++L+D Y KCGN E A N+F  M  K+V ++++MILG A +G
Subjt:  ALELFQRMEEYGFKPDEVTVVTMLPVCSRLGALDVGQRIHSFASSKG--DLVHITTVGNSLVDFYCKCGNTERAYNIFQKMTCKSVVSWNTMILGFALNG

Query:  KGEFAIDLFMVM-GREDVKPNDATFVAVLTACVHSGLLEKGREIFSSMAEKYEIQPKLEHFGCMVDLLGRGGCVEEAHNLIKSMPMQPNATLWGALLGAC
        + + A+ LF  M  + ++KPN  TFV  L AC HSGL+++GR++F SM + + +QP  +H+ CMVDLLGR G ++EA  LIK+M ++P+  +WGALLGAC
Subjt:  KGEFAIDLFMVM-GREDVKPNDATFVAVLTACVHSGLLEKGREIFSSMAEKYEIQPKLEHFGCMVDLLGRGGCVEEAHNLIKSMPMQPNATLWGALLGAC

Query:  RTHGNLKLAEMAVKELISLEPWNSGNYVLLSNMLAEEGRWEDVENVRRWMRGKSIKKAPGQS
        R H N ++AE+A + L  LEP   GNY+LLSN+ A  G W  V  VR+ ++ K +KK P  S
Subjt:  RTHGNLKLAEMAVKELISLEPWNSGNYVLLSNMLAEEGRWEDVENVRRWMRGKSIKKAPGQS

Q9FJY7 Pentatricopeptide repeat-containing protein At5g665204.8e-9338.58Show/hide
Query:  LTQIHAHFLRHGLHQSNQILSHFISICAAFNHID---YANRLFSQSHNPNIFLFNSIIKAHSLSPPFQQSLLLFSSMKNHRIVPDEYTFAPLLKSCANLC
        L QIHA  L+ GL Q +  ++ F+S C +    D   YA  +F     P+ FL+N +I+  S S   ++SLLL+  M       + YTF  LLK+C+NL 
Subjt:  LTQIHAHFLRHGLHQSNQILSHFISICAAFNHID---YANRLFSQSHNPNIFLFNSIIKAHSLSPPFQQSLLLFSSMKNHRIVPDEYTFAPLLKSCANLC

Query:  EYRLGQCVISEVLRRGF----YCFGSIRIGVVELYVCCERMEDARKVFDEMPHRDVVVWNLMIRGFCKMGNVDFGLCLFRQMSERSLVSWNTMISCLAQN
         +     + +++ + G+    Y   S+    +  Y      + A  +FD +P  D V WN +I+G+ K G +D  L LFR+M+E++ +SW TMIS   Q 
Subjt:  EYRLGQCVISEVLRRGF----YCFGSIRIGVVELYVCCERMEDARKVFDEMPHRDVVVWNLMIRGFCKMGNVDFGLCLFRQMSERSLVSWNTMISCLAQN

Query:  RRDVEALELFQRMEEYGFKPDEVTVVTMLPVCSRLGALDVGQRIHSFASSKGDLVHITTVGNSLVDFYCKCGNTERAYNIFQKMTCKSVVSWNTMILGFA
          + EAL+LF  M+    +PD V++   L  C++LGAL+ G+ IHS+  +K  +   + +G  L+D Y KCG  E A  +F+ +  KSV +W  +I G+A
Subjt:  RRDVEALELFQRMEEYGFKPDEVTVVTMLPVCSRLGALDVGQRIHSFASSKGDLVHITTVGNSLVDFYCKCGNTERAYNIFQKMTCKSVVSWNTMILGFA

Query:  LNGKGEFAIDLFMVMGREDVKPNDATFVAVLTACVHSGLLEKGREIFSSMAEKYEIQPKLEHFGCMVDLLGRGGCVEEAHNLIKSMPMQPNATLWGALLG
         +G G  AI  FM M +  +KPN  TF AVLTAC ++GL+E+G+ IF SM   Y ++P +EH+GC+VDLLGR G ++EA   I+ MP++PNA +WGALL 
Subjt:  LNGKGEFAIDLFMVMGREDVKPNDATFVAVLTACVHSGLLEKGREIFSSMAEKYEIQPKLEHFGCMVDLLGRGGCVEEAHNLIKSMPMQPNATLWGALLG

Query:  ACRTHGNLKLAEMAVKELISLEPWNSGNYVLLSNMLAEEGRWEDVENVRRWMRGKSIKKAPGQS
        ACR H N++L E   + LI+++P++ G YV  +N+ A + +W+     RR M+ + + K PG S
Subjt:  ACRTHGNLKLAEMAVKELISLEPWNSGNYVLLSNMLAEEGRWEDVENVRRWMRGKSIKKAPGQS

Q9FMA1 Pentatricopeptide repeat-containing protein At5g563101.0e-9538.77Show/hide
Query:  LHGHKSRTHLTQIHAHFLRHGLHQSNQILSHFISICAAFNHIDYANRLFSQSHNPNIFLFNSIIKAHS-LSPPFQQSLLLFSSMKNHRIV--PDEYTFAP
        +HG+  +T L Q H + +  GL++ N  ++ FI  C+   H+ YA  +F+    PN +L N++I+A S L  P   S+ +    K   +   PD +TF  
Subjt:  LHGHKSRTHLTQIHAHFLRHGLHQSNQILSHFISICAAFNHIDYANRLFSQSHNPNIFLFNSIIKAHS-LSPPFQQSLLLFSSMKNHRIV--PDEYTFAP

Query:  LLKSCANLCEYRLGQCVISEVLRRGFYCFGSIRIGVVELYVCCERMEDARKVFDEMPHRDVVVWNLMIRGFCKMGNVDFGLCLFRQMS--ERSLVSWNTM
        +LK    + +   G+ +  +V+  GF     +  G++++Y  C  + DARK+FDEM  +DV VWN ++ G+ K+G +D    L   M    R+ VSW  +
Subjt:  LLKSCANLCEYRLGQCVISEVLRRGFYCFGSIRIGVVELYVCCERMEDARKVFDEMPHRDVVVWNLMIRGFCKMGNVDFGLCLFRQMS--ERSLVSWNTM

Query:  ISCLAQNRRDVEALELFQRMEEYGFKPDEVTVVTMLPVCSRLGALDVGQRIHSFASSKGDLVHITTVGNSLVDFYCKCGNTERAYNIFQKMTCKSVVSWN
        IS  A++ R  EA+E+FQRM     +PDEVT++ +L  C+ LG+L++G+RI S+   +G +    ++ N+++D Y K GN  +A ++F+ +  ++VV+W 
Subjt:  ISCLAQNRRDVEALELFQRMEEYGFKPDEVTVVTMLPVCSRLGALDVGQRIHSFASSKGDLVHITTVGNSLVDFYCKCGNTERAYNIFQKMTCKSVVSWN

Query:  TMILGFALNGKGEFAIDLFMVMGREDVKPNDATFVAVLTACVHSGLLEKGREIFSSMAEKYEIQPKLEHFGCMVDLLGRGGCVEEAHNLIKSMPMQPNAT
        T+I G A +G G  A+ +F  M +  V+PND TF+A+L+AC H G ++ G+ +F+SM  KY I P +EH+GCM+DLLGR G + EA  +IKSMP + NA 
Subjt:  TMILGFALNGKGEFAIDLFMVMGREDVKPNDATFVAVLTACVHSGLLEKGREIFSSMAEKYEIQPKLEHFGCMVDLLGRGGCVEEAHNLIKSMPMQPNAT

Query:  LWGALLGACRTHGNLKLAEMAVKELISLEPWNSGNYVLLSNMLAEEGRWEDVENVRRWMRGKSIKKAPGQSA
        +WG+LL A   H +L+L E A+ ELI LEP NSGNY+LL+N+ +  GRW++   +R  M+G  +KK  G+S+
Subjt:  LWGALLGACRTHGNLKLAEMAVKELISLEPWNSGNYVLLSNMLAEEGRWEDVENVRRWMRGKSIKKAPGQSA

Q9SIL5 Pentatricopeptide repeat-containing protein At2g205406.7e-10338.66Show/hide
Query:  QIERRILRLLHGHKSRTHLTQIHAHFLRHGLHQSNQILSHFISICAAFNHIDYANRLFSQSHNPNIFLFNSIIKAHSLSPPFQQSLLLFSSMKNHRI-VP
        ++E   +  L   KSR    +I+A  + HGL QS+ +++  +  C     +DYA RLF+Q  NPN+FL+NSII+A++ +  +   + ++  +      +P
Subjt:  QIERRILRLLHGHKSRTHLTQIHAHFLRHGLHQSNQILSHFISICAAFNHIDYANRLFSQSHNPNIFLFNSIIKAHSLSPPFQQSLLLFSSMKNHRI-VP

Query:  DEYTFAPLLKSCANLCEYRLGQCVISEVLRRGFYCFGSIRIGVVELYVCCERMEDARKVFDEMPHRDVVVWNLMIRGFCKMGNVDFGLCLFRQMSERSLV
        D +TF  + KSCA+L    LG+ V   + + G          ++++Y+  + + DA KVFDEM  RDV+ WN ++ G+ ++G +     LF  M ++++V
Subjt:  DEYTFAPLLKSCANLCEYRLGQCVISEVLRRGFYCFGSIRIGVVELYVCCERMEDARKVFDEMPHRDVVVWNLMIRGFCKMGNVDFGLCLFRQMSERSLV

Query:  SWNTMISCLAQNRRDVEALELFQRMEEYGFKPDEVTVVTMLPVCSRLGALDVGQRIHSFASSKGDLVHITTVGNSLVDFYCKCGNTERAYNIFQKMTCKS
        SW  MIS        VEA++ F+ M+  G +PDE++++++LP C++LG+L++G+ IH +A  +G L   T V N+L++ Y KCG   +A  +F +M  K 
Subjt:  SWNTMISCLAQNRRDVEALELFQRMEEYGFKPDEVTVVTMLPVCSRLGALDVGQRIHSFASSKGDLVHITTVGNSLVDFYCKCGNTERAYNIFQKMTCKS

Query:  VVSWNTMILGFALNGKGEFAIDLFMVMGREDVKPNDATFVAVLTACVHSGLLEKGREIFSSMAEKYEIQPKLEHFGCMVDLLGRGGCVEEAHNLIKSMPM
        V+SW+TMI G+A +G    AI+ F  M R  VKPN  TF+ +L+AC H G+ ++G   F  M + Y+I+PK+EH+GC++D+L R G +E A  + K+MPM
Subjt:  VVSWNTMILGFALNGKGEFAIDLFMVMGREDVKPNDATFVAVLTACVHSGLLEKGREIFSSMAEKYEIQPKLEHFGCMVDLLGRGGCVEEAHNLIKSMPM

Query:  QPNATLWGALLGACRTHGNLKLAEMAVKELISLEPWNSGNYVLLSNMLAEEGRWEDVENVRRWMRGKSIKKAPGQS
        +P++ +WG+LL +CRT GNL +A +A+  L+ LEP + GNYVLL+N+ A+ G+WEDV  +R+ +R +++KK PG S
Subjt:  QPNATLWGALLGACRTHGNLKLAEMAVKELISLEPWNSGNYVLLSNMLAEEGRWEDVENVRRWMRGKSIKKAPGQS

Arabidopsis top hitse value%identityAlignment
AT1G09190.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.0e-15957.65Show/hide
Query:  LQIERRILRLLHGHKSRTHLTQIHAHFLRHGLHQSNQILSHFISICAAFNHIDYANRLFSQSHNPNIFLFNSIIKAHSLSPPFQQSLLLFSSMKNHRIVP
        ++IER++LRLLHGH +RT L +IHAH LRH LH SN +L+HFISIC + ++ DYANR+FS   NPN+ +FN++IK +SL  P  +SL  FSSMK+  I  
Subjt:  LQIERRILRLLHGHKSRTHLTQIHAHFLRHGLHQSNQILSHFISICAAFNHIDYANRLFSQSHNPNIFLFNSIIKAHSLSPPFQQSLLLFSSMKNHRIVP

Query:  DEYTFAPLLKSCANLCEYRLGQCVISEVLRRGFYCFGSIRIGVVELYVCCERMEDARKVFDEMPHRDVVVWNLMIRGFCKMGNVDFGLCLFRQMSERSLV
        DEYT+APLLKSC++L + R G+CV  E++R GF+  G IRIGVVELY    RM DA+KVFDEM  R+VVVWNLMIRGFC  G+V+ GL LF+QMSERS+V
Subjt:  DEYTFAPLLKSCANLCEYRLGQCVISEVLRRGFYCFGSIRIGVVELYVCCERMEDARKVFDEMPHRDVVVWNLMIRGFCKMGNVDFGLCLFRQMSERSLV

Query:  SWNTMISCLAQNRRDVEALELFQRMEEYGFKPDEVTVVTMLPVCSRLGALDVGQRIHSFASSKGDLVHITTVGNSLVDFYCKCGNTERAYNIFQKMTCKS
        SWN+MIS L++  RD EALELF  M + GF PDE TVVT+LP+ + LG LD G+ IHS A S G      TVGN+LVDFYCK G+ E A  IF+KM  ++
Subjt:  SWNTMISCLAQNRRDVEALELFQRMEEYGFKPDEVTVVTMLPVCSRLGALDVGQRIHSFASSKGDLVHITTVGNSLVDFYCKCGNTERAYNIFQKMTCKS

Query:  VVSWNTMILGFALNGKGEFAIDLFMVMGRE-DVKPNDATFVAVLTACVHSGLLEKGREIFSSMAEKYEIQPKLEHFGCMVDLLGRGGCVEEAHNLIKSMP
        VVSWNT+I G A+NGKGEF IDLF  M  E  V PN+ATF+ VL  C ++G +E+G E+F  M E+++++ + EH+G MVDL+ R G + EA   +K+MP
Subjt:  VVSWNTMILGFALNGKGEFAIDLFMVMGRE-DVKPNDATFVAVLTACVHSGLLEKGREIFSSMAEKYEIQPKLEHFGCMVDLLGRGGCVEEAHNLIKSMP

Query:  MQPNATLWGALLGACRTHGNLKLAEMAVKELISLEPWNSGNYVLLSNMLAEEGRWEDVENVRRWMRGKSIKKAPGQS
        +  NA +WG+LL ACR+HG++KLAE+A  EL+ +EP NSGNYVLLSN+ AEEGRW+DVE VR  M+   ++K+ GQS
Subjt:  MQPNATLWGALLGACRTHGNLKLAEMAVKELISLEPWNSGNYVLLSNMLAEEGRWEDVENVRRWMRGKSIKKAPGQS

AT2G20540.1 mitochondrial editing factor 214.7e-10438.66Show/hide
Query:  QIERRILRLLHGHKSRTHLTQIHAHFLRHGLHQSNQILSHFISICAAFNHIDYANRLFSQSHNPNIFLFNSIIKAHSLSPPFQQSLLLFSSMKNHRI-VP
        ++E   +  L   KSR    +I+A  + HGL QS+ +++  +  C     +DYA RLF+Q  NPN+FL+NSII+A++ +  +   + ++  +      +P
Subjt:  QIERRILRLLHGHKSRTHLTQIHAHFLRHGLHQSNQILSHFISICAAFNHIDYANRLFSQSHNPNIFLFNSIIKAHSLSPPFQQSLLLFSSMKNHRI-VP

Query:  DEYTFAPLLKSCANLCEYRLGQCVISEVLRRGFYCFGSIRIGVVELYVCCERMEDARKVFDEMPHRDVVVWNLMIRGFCKMGNVDFGLCLFRQMSERSLV
        D +TF  + KSCA+L    LG+ V   + + G          ++++Y+  + + DA KVFDEM  RDV+ WN ++ G+ ++G +     LF  M ++++V
Subjt:  DEYTFAPLLKSCANLCEYRLGQCVISEVLRRGFYCFGSIRIGVVELYVCCERMEDARKVFDEMPHRDVVVWNLMIRGFCKMGNVDFGLCLFRQMSERSLV

Query:  SWNTMISCLAQNRRDVEALELFQRMEEYGFKPDEVTVVTMLPVCSRLGALDVGQRIHSFASSKGDLVHITTVGNSLVDFYCKCGNTERAYNIFQKMTCKS
        SW  MIS        VEA++ F+ M+  G +PDE++++++LP C++LG+L++G+ IH +A  +G L   T V N+L++ Y KCG   +A  +F +M  K 
Subjt:  SWNTMISCLAQNRRDVEALELFQRMEEYGFKPDEVTVVTMLPVCSRLGALDVGQRIHSFASSKGDLVHITTVGNSLVDFYCKCGNTERAYNIFQKMTCKS

Query:  VVSWNTMILGFALNGKGEFAIDLFMVMGREDVKPNDATFVAVLTACVHSGLLEKGREIFSSMAEKYEIQPKLEHFGCMVDLLGRGGCVEEAHNLIKSMPM
        V+SW+TMI G+A +G    AI+ F  M R  VKPN  TF+ +L+AC H G+ ++G   F  M + Y+I+PK+EH+GC++D+L R G +E A  + K+MPM
Subjt:  VVSWNTMILGFALNGKGEFAIDLFMVMGREDVKPNDATFVAVLTACVHSGLLEKGREIFSSMAEKYEIQPKLEHFGCMVDLLGRGGCVEEAHNLIKSMPM

Query:  QPNATLWGALLGACRTHGNLKLAEMAVKELISLEPWNSGNYVLLSNMLAEEGRWEDVENVRRWMRGKSIKKAPGQS
        +P++ +WG+LL +CRT GNL +A +A+  L+ LEP + GNYVLL+N+ A+ G+WEDV  +R+ +R +++KK PG S
Subjt:  QPNATLWGALLGACRTHGNLKLAEMAVKELISLEPWNSGNYVLLSNMLAEEGRWEDVENVRRWMRGKSIKKAPGQS

AT5G44230.1 Pentatricopeptide repeat (PPR) superfamily protein1.1e-10039.61Show/hide
Query:  LTQIHAHFLRHGLHQSNQILSHFISICAAFN--HIDYANRLFSQSHNPNIFLFNSIIKAHSLSPPFQQSLLLFSSMKNHRIVPDEYTFAPLLKSCANLCE
        + QIH H LR GL QS  IL+  I            YA R+       N FL+ ++I+ +++   F +++ ++  M+   I P  +TF+ LLK+C  + +
Subjt:  LTQIHAHFLRHGLHQSNQILSHFISICAAFN--HIDYANRLFSQSHNPNIFLFNSIIKAHSLSPPFQQSLLLFSSMKNHRIVPDEYTFAPLLKSCANLCE

Query:  YRLGQCVISEVLRRGFYCFGSIRIGVVELYVCCERMEDARKVFDEMPHRDVVVWNLMIRGFCKMGNVDFGLCLFRQMSERSLVSWNTMISCLAQNRRDVE
          LG+   ++  R   +CF  +   ++++YV CE ++ ARKVFDEMP RDV+ W  +I  + ++GN++    LF  +  + +V+W  M++  AQN +  E
Subjt:  YRLGQCVISEVLRRGFYCFGSIRIGVVELYVCCERMEDARKVFDEMPHRDVVVWNLMIRGFCKMGNVDFGLCLFRQMSERSLVSWNTMISCLAQNRRDVE

Query:  ALELFQRMEEYGFKPDEVTVVTMLPVCSRLGALDVGQRIHSFASSKG--DLVHITTVGNSLVDFYCKCGNTERAYNIFQKMTCKSVVSWNTMILGFALNG
        ALE F RME+ G + DEVTV   +  C++LGA     R    A   G     H+  +G++L+D Y KCGN E A N+F  M  K+V ++++MILG A +G
Subjt:  ALELFQRMEEYGFKPDEVTVVTMLPVCSRLGALDVGQRIHSFASSKG--DLVHITTVGNSLVDFYCKCGNTERAYNIFQKMTCKSVVSWNTMILGFALNG

Query:  KGEFAIDLFMVM-GREDVKPNDATFVAVLTACVHSGLLEKGREIFSSMAEKYEIQPKLEHFGCMVDLLGRGGCVEEAHNLIKSMPMQPNATLWGALLGAC
        + + A+ LF  M  + ++KPN  TFV  L AC HSGL+++GR++F SM + + +QP  +H+ CMVDLLGR G ++EA  LIK+M ++P+  +WGALLGAC
Subjt:  KGEFAIDLFMVM-GREDVKPNDATFVAVLTACVHSGLLEKGREIFSSMAEKYEIQPKLEHFGCMVDLLGRGGCVEEAHNLIKSMPMQPNATLWGALLGAC

Query:  RTHGNLKLAEMAVKELISLEPWNSGNYVLLSNMLAEEGRWEDVENVRRWMRGKSIKKAPGQS
        R H N ++AE+A + L  LEP   GNY+LLSN+ A  G W  V  VR+ ++ K +KK P  S
Subjt:  RTHGNLKLAEMAVKELISLEPWNSGNYVLLSNMLAEEGRWEDVENVRRWMRGKSIKKAPGQS

AT5G56310.1 Pentatricopeptide repeat (PPR) superfamily protein7.3e-9738.77Show/hide
Query:  LHGHKSRTHLTQIHAHFLRHGLHQSNQILSHFISICAAFNHIDYANRLFSQSHNPNIFLFNSIIKAHS-LSPPFQQSLLLFSSMKNHRIV--PDEYTFAP
        +HG+  +T L Q H + +  GL++ N  ++ FI  C+   H+ YA  +F+    PN +L N++I+A S L  P   S+ +    K   +   PD +TF  
Subjt:  LHGHKSRTHLTQIHAHFLRHGLHQSNQILSHFISICAAFNHIDYANRLFSQSHNPNIFLFNSIIKAHS-LSPPFQQSLLLFSSMKNHRIV--PDEYTFAP

Query:  LLKSCANLCEYRLGQCVISEVLRRGFYCFGSIRIGVVELYVCCERMEDARKVFDEMPHRDVVVWNLMIRGFCKMGNVDFGLCLFRQMS--ERSLVSWNTM
        +LK    + +   G+ +  +V+  GF     +  G++++Y  C  + DARK+FDEM  +DV VWN ++ G+ K+G +D    L   M    R+ VSW  +
Subjt:  LLKSCANLCEYRLGQCVISEVLRRGFYCFGSIRIGVVELYVCCERMEDARKVFDEMPHRDVVVWNLMIRGFCKMGNVDFGLCLFRQMS--ERSLVSWNTM

Query:  ISCLAQNRRDVEALELFQRMEEYGFKPDEVTVVTMLPVCSRLGALDVGQRIHSFASSKGDLVHITTVGNSLVDFYCKCGNTERAYNIFQKMTCKSVVSWN
        IS  A++ R  EA+E+FQRM     +PDEVT++ +L  C+ LG+L++G+RI S+   +G +    ++ N+++D Y K GN  +A ++F+ +  ++VV+W 
Subjt:  ISCLAQNRRDVEALELFQRMEEYGFKPDEVTVVTMLPVCSRLGALDVGQRIHSFASSKGDLVHITTVGNSLVDFYCKCGNTERAYNIFQKMTCKSVVSWN

Query:  TMILGFALNGKGEFAIDLFMVMGREDVKPNDATFVAVLTACVHSGLLEKGREIFSSMAEKYEIQPKLEHFGCMVDLLGRGGCVEEAHNLIKSMPMQPNAT
        T+I G A +G G  A+ +F  M +  V+PND TF+A+L+AC H G ++ G+ +F+SM  KY I P +EH+GCM+DLLGR G + EA  +IKSMP + NA 
Subjt:  TMILGFALNGKGEFAIDLFMVMGREDVKPNDATFVAVLTACVHSGLLEKGREIFSSMAEKYEIQPKLEHFGCMVDLLGRGGCVEEAHNLIKSMPMQPNAT

Query:  LWGALLGACRTHGNLKLAEMAVKELISLEPWNSGNYVLLSNMLAEEGRWEDVENVRRWMRGKSIKKAPGQSA
        +WG+LL A   H +L+L E A+ ELI LEP NSGNY+LL+N+ +  GRW++   +R  M+G  +KK  G+S+
Subjt:  LWGALLGACRTHGNLKLAEMAVKELISLEPWNSGNYVLLSNMLAEEGRWEDVENVRRWMRGKSIKKAPGQSA

AT5G66520.1 Tetratricopeptide repeat (TPR)-like superfamily protein3.4e-9438.58Show/hide
Query:  LTQIHAHFLRHGLHQSNQILSHFISICAAFNHID---YANRLFSQSHNPNIFLFNSIIKAHSLSPPFQQSLLLFSSMKNHRIVPDEYTFAPLLKSCANLC
        L QIHA  L+ GL Q +  ++ F+S C +    D   YA  +F     P+ FL+N +I+  S S   ++SLLL+  M       + YTF  LLK+C+NL 
Subjt:  LTQIHAHFLRHGLHQSNQILSHFISICAAFNHID---YANRLFSQSHNPNIFLFNSIIKAHSLSPPFQQSLLLFSSMKNHRIVPDEYTFAPLLKSCANLC

Query:  EYRLGQCVISEVLRRGF----YCFGSIRIGVVELYVCCERMEDARKVFDEMPHRDVVVWNLMIRGFCKMGNVDFGLCLFRQMSERSLVSWNTMISCLAQN
         +     + +++ + G+    Y   S+    +  Y      + A  +FD +P  D V WN +I+G+ K G +D  L LFR+M+E++ +SW TMIS   Q 
Subjt:  EYRLGQCVISEVLRRGF----YCFGSIRIGVVELYVCCERMEDARKVFDEMPHRDVVVWNLMIRGFCKMGNVDFGLCLFRQMSERSLVSWNTMISCLAQN

Query:  RRDVEALELFQRMEEYGFKPDEVTVVTMLPVCSRLGALDVGQRIHSFASSKGDLVHITTVGNSLVDFYCKCGNTERAYNIFQKMTCKSVVSWNTMILGFA
          + EAL+LF  M+    +PD V++   L  C++LGAL+ G+ IHS+  +K  +   + +G  L+D Y KCG  E A  +F+ +  KSV +W  +I G+A
Subjt:  RRDVEALELFQRMEEYGFKPDEVTVVTMLPVCSRLGALDVGQRIHSFASSKGDLVHITTVGNSLVDFYCKCGNTERAYNIFQKMTCKSVVSWNTMILGFA

Query:  LNGKGEFAIDLFMVMGREDVKPNDATFVAVLTACVHSGLLEKGREIFSSMAEKYEIQPKLEHFGCMVDLLGRGGCVEEAHNLIKSMPMQPNATLWGALLG
         +G G  AI  FM M +  +KPN  TF AVLTAC ++GL+E+G+ IF SM   Y ++P +EH+GC+VDLLGR G ++EA   I+ MP++PNA +WGALL 
Subjt:  LNGKGEFAIDLFMVMGREDVKPNDATFVAVLTACVHSGLLEKGREIFSSMAEKYEIQPKLEHFGCMVDLLGRGGCVEEAHNLIKSMPMQPNATLWGALLG

Query:  ACRTHGNLKLAEMAVKELISLEPWNSGNYVLLSNMLAEEGRWEDVENVRRWMRGKSIKKAPGQS
        ACR H N++L E   + LI+++P++ G YV  +N+ A + +W+     RR M+ + + K PG S
Subjt:  ACRTHGNLKLAEMAVKELISLEPWNSGNYVLLSNMLAEEGRWEDVENVRRWMRGKSIKKAPGQS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGCAAGAACTGTCTCCAGATCGAGAGAAGAATCCTCCGCCTCCTTCATGGCCACAAATCCCGAACCCATCTCACTCAAATCCACGCCCACTTCCTCCGCCAT
GGCCTCCACCAATCCAACCAAATCCTCTCCCATTTCATCTCCATTTGCGCCGCTTTCAACCACATTGACTATGCCAATCGCCTCTTTTCCCAATCCCATAACCCC
AATATCTTCCTCTTCAATTCCATAATCAAAGCTCACTCCCTCTCCCCTCCCTTCCAACAATCCCTTCTCCTGTTTTCCTCTATGAAGAATCACAGGATTGTTCCT
GACGAATACACTTTTGCGCCGTTGCTTAAATCCTGCGCGAATCTCTGTGAGTATAGACTTGGTCAGTGTGTGATATCTGAAGTTTTGCGTCGTGGATTTTACTGT
TTTGGGTCTATTCGTATTGGGGTGGTTGAGTTGTATGTCTGTTGTGAAAGGATGGAGGATGCGCGTAAGGTGTTTGATGAAATGCCTCACAGGGATGTGGTTGTT
TGGAACTTGATGATTCGTGGGTTTTGCAAGATGGGCAATGTTGATTTTGGGTTGTGTCTCTTTAGGCAAATGAGTGAACGTAGCCTTGTTTCTTGGAACACTATG
ATTTCCTGTTTAGCTCAAAATAGACGTGATGTTGAAGCTTTGGAACTCTTTCAACGGATGGAAGAATATGGTTTTAAACCAGATGAAGTTACAGTGGTCACAATG
TTGCCTGTATGTTCTCGTTTGGGAGCTCTTGATGTTGGACAAAGGATCCATTCTTTTGCAAGTTCGAAGGGAGATTTGGTACATATTACGACGGTTGGGAATTCG
CTAGTTGATTTTTACTGTAAATGTGGGAATACAGAAAGGGCTTACAACATTTTTCAGAAAATGACTTGCAAAAGTGTTGTCTCTTGGAATACAATGATTTTGGGC
TTTGCTTTAAATGGGAAGGGGGAGTTTGCCATTGACCTTTTTATGGTGATGGGAAGAGAGGATGTGAAGCCTAATGATGCAACATTTGTAGCTGTCTTGACCGCT
TGTGTCCATTCGGGATTGTTAGAGAAGGGTCGAGAGATATTTTCTTCAATGGCTGAGAAGTATGAAATCCAGCCAAAACTCGAACATTTTGGTTGTATGGTTGAT
CTTTTGGGACGTGGTGGATGTGTGGAGGAGGCTCATAACTTGATTAAAAGCATGCCAATGCAACCAAATGCCACTTTATGGGGTGCTTTGCTTGGTGCTTGCCGA
ACTCATGGTAACTTGAAACTTGCGGAAATGGCAGTGAAGGAGCTCATCAGTCTTGAACCATGGAACTCTGGTAATTATGTATTGTTGTCAAATATGTTGGCAGAA
GAAGGAAGATGGGAAGATGTTGAGAATGTCAGACGTTGGATGAGAGGAAAGAGCATCAAGAAAGCACCTGGGCAGAGTGCAAGTGGGTAA
mRNA sequenceShow/hide mRNA sequence
ATGAGCAAGAACTGTCTCCAGATCGAGAGAAGAATCCTCCGCCTCCTTCATGGCCACAAATCCCGAACCCATCTCACTCAAATCCACGCCCACTTCCTCCGCCAT
GGCCTCCACCAATCCAACCAAATCCTCTCCCATTTCATCTCCATTTGCGCCGCTTTCAACCACATTGACTATGCCAATCGCCTCTTTTCCCAATCCCATAACCCC
AATATCTTCCTCTTCAATTCCATAATCAAAGCTCACTCCCTCTCCCCTCCCTTCCAACAATCCCTTCTCCTGTTTTCCTCTATGAAGAATCACAGGATTGTTCCT
GACGAATACACTTTTGCGCCGTTGCTTAAATCCTGCGCGAATCTCTGTGAGTATAGACTTGGTCAGTGTGTGATATCTGAAGTTTTGCGTCGTGGATTTTACTGT
TTTGGGTCTATTCGTATTGGGGTGGTTGAGTTGTATGTCTGTTGTGAAAGGATGGAGGATGCGCGTAAGGTGTTTGATGAAATGCCTCACAGGGATGTGGTTGTT
TGGAACTTGATGATTCGTGGGTTTTGCAAGATGGGCAATGTTGATTTTGGGTTGTGTCTCTTTAGGCAAATGAGTGAACGTAGCCTTGTTTCTTGGAACACTATG
ATTTCCTGTTTAGCTCAAAATAGACGTGATGTTGAAGCTTTGGAACTCTTTCAACGGATGGAAGAATATGGTTTTAAACCAGATGAAGTTACAGTGGTCACAATG
TTGCCTGTATGTTCTCGTTTGGGAGCTCTTGATGTTGGACAAAGGATCCATTCTTTTGCAAGTTCGAAGGGAGATTTGGTACATATTACGACGGTTGGGAATTCG
CTAGTTGATTTTTACTGTAAATGTGGGAATACAGAAAGGGCTTACAACATTTTTCAGAAAATGACTTGCAAAAGTGTTGTCTCTTGGAATACAATGATTTTGGGC
TTTGCTTTAAATGGGAAGGGGGAGTTTGCCATTGACCTTTTTATGGTGATGGGAAGAGAGGATGTGAAGCCTAATGATGCAACATTTGTAGCTGTCTTGACCGCT
TGTGTCCATTCGGGATTGTTAGAGAAGGGTCGAGAGATATTTTCTTCAATGGCTGAGAAGTATGAAATCCAGCCAAAACTCGAACATTTTGGTTGTATGGTTGAT
CTTTTGGGACGTGGTGGATGTGTGGAGGAGGCTCATAACTTGATTAAAAGCATGCCAATGCAACCAAATGCCACTTTATGGGGTGCTTTGCTTGGTGCTTGCCGA
ACTCATGGTAACTTGAAACTTGCGGAAATGGCAGTGAAGGAGCTCATCAGTCTTGAACCATGGAACTCTGGTAATTATGTATTGTTGTCAAATATGTTGGCAGAA
GAAGGAAGATGGGAAGATGTTGAGAATGTCAGACGTTGGATGAGAGGAAAGAGCATCAAGAAAGCACCTGGGCAGAGTGCAAGTGGGTAA
Protein sequenceShow/hide protein sequence
MSKNCLQIERRILRLLHGHKSRTHLTQIHAHFLRHGLHQSNQILSHFISICAAFNHIDYANRLFSQSHNPNIFLFNSIIKAHSLSPPFQQSLLLFSSMKNHRIVP
DEYTFAPLLKSCANLCEYRLGQCVISEVLRRGFYCFGSIRIGVVELYVCCERMEDARKVFDEMPHRDVVVWNLMIRGFCKMGNVDFGLCLFRQMSERSLVSWNTM
ISCLAQNRRDVEALELFQRMEEYGFKPDEVTVVTMLPVCSRLGALDVGQRIHSFASSKGDLVHITTVGNSLVDFYCKCGNTERAYNIFQKMTCKSVVSWNTMILG
FALNGKGEFAIDLFMVMGREDVKPNDATFVAVLTACVHSGLLEKGREIFSSMAEKYEIQPKLEHFGCMVDLLGRGGCVEEAHNLIKSMPMQPNATLWGALLGACR
THGNLKLAEMAVKELISLEPWNSGNYVLLSNMLAEEGRWEDVENVRRWMRGKSIKKAPGQSASG