; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr018932 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr018932
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationtig00153228:871108..883019
RNA-Seq ExpressionSgr018932
SyntenySgr018932
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily
IPR012881 - Protein of unknown function DUF1685


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022145163.1 pentatricopeptide repeat-containing protein At5g39710-like [Momordica charantia]5.6e-28689.13Show/hide
Query:  TRNGSVLPKIITLHFSGFSSTFFGTSTATKHIATAPRAVARRPTSRTAPIPRALDTVSSTDVVNSVCSLLSNKNHQTTNLDLDHLLKRFNEKLSSDLVLQ
        +RNGSVL K  TLHFS FSS FFGTST    IA APR  ARRPTSR+AP+PRALDT+SSTDVVNSVCSLLSNKNHQTTNLDLD LLKRFNE LSSDLVL+
Subjt:  TRNGSVLPKIITLHFSGFSSTFFGTSTATKHIATAPRAVARRPTSRTAPIPRALDTVSSTDVVNSVCSLLSNKNHQTTNLDLDHLLKRFNEKLSSDLVLQ

Query:  ILMNYRLLGRAKTLEFFSWSGLQMGYQFDESVVEYMADFLGRRKLFDDMKCLLVTVSYHKGRISCRTFSICIRFLGRQGRVREALCLFEEMEPKFGCKPD
        ILMNYR+LGRAKTLEFFSWSGLQMGY+FDESVVEYMADF GRRKLFDDMKCLLVTVS HKGR+SCRTFSICIRFLGRQGRVREALCLFEEMEPKFGCKPD
Subjt:  ILMNYRLLGRAKTLEFFSWSGLQMGYQFDESVVEYMADFLGRRKLFDDMKCLLVTVSYHKGRISCRTFSICIRFLGRQGRVREALCLFEEMEPKFGCKPD

Query:  NLVFNNMLYALCKKEPTGELIDTALTIFRRIELPDKYSYSNIIIGLCKFGRFGTALEVFDEMDRAGLVPTRSAVNILIGDLCSLSAKEGAIEKVRVRSTR
        NLVFNN+LYALCKKE TGELIDTALTIFRRIELPDKYSYSNIIIGLCKFGRF TALEVF+EM+R G VPTRSAVNILIGDLCSLSAKEGAIEKVRVRSTR
Subjt:  NLVFNNMLYALCKKEPTGELIDTALTIFRRIELPDKYSYSNIIIGLCKFGRFGTALEVFDEMDRAGLVPTRSAVNILIGDLCSLSAKEGAIEKVRVRSTR

Query:  RPFTLLVPNVNSKSGAIEPAVGVFWVANRLALVPSTFIIVQLISELCRLGQMQEAIKVLKVVEDGKLRCAEECYSIVMQALCEHRRVEEASDLFGRMLSH
        RPFT+LVPNVNSKSGAIEPAVGVFW ANR+ALVPS+F++VQLISELCRLGQMQEAI+VLKVVEDGKLRC EEC+SIVMQALCE+R+VEEASDLFGRMLS 
Subjt:  RPFTLLVPNVNSKSGAIEPAVGVFWVANRLALVPSTFIIVQLISELCRLGQMQEAIKVLKVVEDGKLRCAEECYSIVMQALCEHRRVEEASDLFGRMLSH

Query:  GMKPKLAVYNSVICMLCKLGNLVDAERVFKIMSRKRCVPDHVTYSALIHAYGETKNWSVAYSLLKEMLSLGMSPHFHLYSLVNKLMREYGQIDLCLKLEM
        GMKPKLAVYNSVICMLCKLGN+VDAERVFKIM+RKRCVPD VTYSALIHAY E  NWS AYSLLKEMLSLGMSPHFHLYS V+KLMRE+GQ+DLCLKLEM
Subjt:  GMKPKLAVYNSVICMLCKLGNLVDAERVFKIMSRKRCVPDHVTYSALIHAYGETKNWSVAYSLLKEMLSLGMSPHFHLYSLVNKLMREYGQIDLCLKLEM

Query:  KWEAQILQKLCKQGQLGAAYEKLKSMLEKGFYPPIYVRDAFGSAFQKNGKFKIARELLEKI
        KWEAQILQKLCKQGQL AAYEKLKSMLEKG +PP YVRDAF +AFQKNGK+KIARELLEKI
Subjt:  KWEAQILQKLCKQGQLGAAYEKLKSMLEKGFYPPIYVRDAFGSAFQKNGKFKIARELLEKI

XP_022955543.1 pentatricopeptide repeat-containing protein At4g20090-like [Cucurbita moschata]1.9e-27886.44Show/hide
Query:  SKTRNGSVLPKIITLHFSGFSSTFFGTSTATKHIATAPRAVARRPTSRTAPIPRALDTVSSTDVVNSVCSLLSNKNHQTTNLDLDHLLKRFNEKLSSDLV
        S  RN S   K +   FS +SS+   TS+  K  A APRA+ARRPTSRTAPIPRALD    TD V+SVCSLLSNKNHQTTNL+LDHLLKRF E LSSD V
Subjt:  SKTRNGSVLPKIITLHFSGFSSTFFGTSTATKHIATAPRAVARRPTSRTAPIPRALDTVSSTDVVNSVCSLLSNKNHQTTNLDLDHLLKRFNEKLSSDLV

Query:  LQILMNYRLLGRAKTLEFFSWSGLQMGYQFDESVVEYMADFLGRRKLFDDMKCLLVTVSYHKGRISCRTFSICIRFLGRQGRVREALCLFEEMEPKFGCK
        LQILMNYRL GRAKTLEFFSWSGLQMGY+FDESVVEYMADFLGRRKLFDDMKCLLVTVS +KGRISCRTFSICIRFLGRQGRVREALCLFEEMEPKFGCK
Subjt:  LQILMNYRLLGRAKTLEFFSWSGLQMGYQFDESVVEYMADFLGRRKLFDDMKCLLVTVSYHKGRISCRTFSICIRFLGRQGRVREALCLFEEMEPKFGCK

Query:  PDNLVFNNMLYALCKKEPTGELIDTALTIFRRIELPDKYSYSNIIIGLCKFGRFGTALEVFDEMDRAGLVPTRSAVNILIGDLCSLSAKEGAIEKVRVRS
        PDNLVFNNMLYALCKKEPTGELIDTALTIFRRIELPDKYSYSNIIIGLCKFGRFGTA+EVFDEM RAGLVPTRSAVNILIGDLCSLSAKEGA+E+VRVRS
Subjt:  PDNLVFNNMLYALCKKEPTGELIDTALTIFRRIELPDKYSYSNIIIGLCKFGRFGTALEVFDEMDRAGLVPTRSAVNILIGDLCSLSAKEGAIEKVRVRS

Query:  TRRPFTLLVPNVNSKSGAIEPAVGVFWVANRLALVPSTFIIVQLISELCRLGQMQEAIKVLKVVEDGKLRCAEECYSIVMQALCEHRRVEEASDLFGRML
        TRRPFT+LVPNVN KSGAI+ AVGVFW ANRLALVPSTF+IV+LISELCRLGQMQEAI+VLKVVE  KLRC EECYSIVMQALCEHRRV+EASDLFGRML
Subjt:  TRRPFTLLVPNVNSKSGAIEPAVGVFWVANRLALVPSTFIIVQLISELCRLGQMQEAIKVLKVVEDGKLRCAEECYSIVMQALCEHRRVEEASDLFGRML

Query:  SHGMKPKLAVYNSVICMLCKLGNLVDAERVFKIMSRKRCVPDHVTYSALIHAYGETKNWSVAYSLLKEMLSLGMSPHFHLYSLVNKLMREYGQIDLCLKL
        S  MKPKLA+YNSVICMLCKLGNL DAERVFKIM+RKRCVPDHVTYSALIHAYGE +NWS AYSLLKEMLSLG+SPHFH+YS+V+KLMRE GQ DLCLKL
Subjt:  SHGMKPKLAVYNSVICMLCKLGNLVDAERVFKIMSRKRCVPDHVTYSALIHAYGETKNWSVAYSLLKEMLSLGMSPHFHLYSLVNKLMREYGQIDLCLKL

Query:  EMKWEAQILQKLCKQGQLGAAYEKLKSMLEKGFYPPIYVRDAFGSAFQKNGKFKIARELLEKIDGVRK
        EMKWE+QILQKLCKQGQLG AYEKLKSMLEKGFYPPIYVRDAF SAFQK GKFKIARELL+ +DGV +
Subjt:  EMKWEAQILQKLCKQGQLGAAYEKLKSMLEKGFYPPIYVRDAFGSAFQKNGKFKIARELLEKIDGVRK

XP_022979738.1 pentatricopeptide repeat-containing protein At4g20090-like [Cucurbita maxima]2.8e-27786.09Show/hide
Query:  SKTRNGSVLPKIITLHFSGFSSTFFGTSTATKHIATAPRAVARRPTSRTAPIPRALDTVSSTDVVNSVCSLLSNKNHQTTNLDLDHLLKRFNEKLSSDLV
        S  RN S   K +   FS +SS+   TS+ TK  A APRA+ARRPTSRTA IPRALD    TD V+SVCSLLSNK+HQTTNL+LDHLLKRF E LSSD V
Subjt:  SKTRNGSVLPKIITLHFSGFSSTFFGTSTATKHIATAPRAVARRPTSRTAPIPRALDTVSSTDVVNSVCSLLSNKNHQTTNLDLDHLLKRFNEKLSSDLV

Query:  LQILMNYRLLGRAKTLEFFSWSGLQMGYQFDESVVEYMADFLGRRKLFDDMKCLLVTVSYHKGRISCRTFSICIRFLGRQGRVREALCLFEEMEPKFGCK
        LQILMNYRL GRAKTLEFFSWSGLQMGY+FDESVVEYMADFLGRRKLFDDMKCLLVTVS +KGRISCRTFSICIRFLGRQGRVREALCLFEEMEP FGCK
Subjt:  LQILMNYRLLGRAKTLEFFSWSGLQMGYQFDESVVEYMADFLGRRKLFDDMKCLLVTVSYHKGRISCRTFSICIRFLGRQGRVREALCLFEEMEPKFGCK

Query:  PDNLVFNNMLYALCKKEPTGELIDTALTIFRRIELPDKYSYSNIIIGLCKFGRFGTALEVFDEMDRAGLVPTRSAVNILIGDLCSLSAKEGAIEKVRVRS
        PDNLVFNNMLYALCKKEPTGELIDTALTIFRRIELPDKYSYSNIIIGLCKFGRFGTA+EVFDEM RA LVPTRSAVNILIGDLCSLSAKEGA+E+VRVRS
Subjt:  PDNLVFNNMLYALCKKEPTGELIDTALTIFRRIELPDKYSYSNIIIGLCKFGRFGTALEVFDEMDRAGLVPTRSAVNILIGDLCSLSAKEGAIEKVRVRS

Query:  TRRPFTLLVPNVNSKSGAIEPAVGVFWVANRLALVPSTFIIVQLISELCRLGQMQEAIKVLKVVEDGKLRCAEECYSIVMQALCEHRRVEEASDLFGRML
        TRRPFT+LVPNVN KSGAIEPAVGVFW ANR+ALVPS F+IV+LISELCRLGQMQEAI+VLKVVE  KLRC EECYSIVMQALCEHRRV+EASDLFGRML
Subjt:  TRRPFTLLVPNVNSKSGAIEPAVGVFWVANRLALVPSTFIIVQLISELCRLGQMQEAIKVLKVVEDGKLRCAEECYSIVMQALCEHRRVEEASDLFGRML

Query:  SHGMKPKLAVYNSVICMLCKLGNLVDAERVFKIMSRKRCVPDHVTYSALIHAYGETKNWSVAYSLLKEMLSLGMSPHFHLYSLVNKLMREYGQIDLCLKL
        S  MKPKLA+YNSVICMLCKLGNL DAERVFKIM+RKRCVPDHVTYSALIHAYGE +NWS AYSLLKEMLSLG+SPHFH+YS+V+KLMRE GQ DLCLKL
Subjt:  SHGMKPKLAVYNSVICMLCKLGNLVDAERVFKIMSRKRCVPDHVTYSALIHAYGETKNWSVAYSLLKEMLSLGMSPHFHLYSLVNKLMREYGQIDLCLKL

Query:  EMKWEAQILQKLCKQGQLGAAYEKLKSMLEKGFYPPIYVRDAFGSAFQKNGKFKIARELLEKIDGVRK
        EMKWE+QILQKLCKQGQLGAAYEKLKSMLEKGFYPPIYVRDAF SAFQK GKFKIARELL+ +DGV +
Subjt:  EMKWEAQILQKLCKQGQLGAAYEKLKSMLEKGFYPPIYVRDAFGSAFQKNGKFKIARELLEKIDGVRK

XP_023526018.1 pentatricopeptide repeat-containing protein At4g20090-like isoform X1 [Cucurbita pepo subsp. pepo]2.4e-27685.56Show/hide
Query:  SKTRNGSVLPKIITLHFSGFSSTFFGTSTATKHIATAPRAVARRPTSRTAPIPRALDTVSSTDVVNSVCSLLSNKNHQTTNLDLDHLLKRFNEKLSSDLV
        S  RN S   K +     GFSS  F TS+ TK  A APRA+ARRPTSRTAPIPRALD    TD V+SVCSLLSNKNHQT NL+LDHLLKRF E +SSD V
Subjt:  SKTRNGSVLPKIITLHFSGFSSTFFGTSTATKHIATAPRAVARRPTSRTAPIPRALDTVSSTDVVNSVCSLLSNKNHQTTNLDLDHLLKRFNEKLSSDLV

Query:  LQILMNYRLLGRAKTLEFFSWSGLQMGYQFDESVVEYMADFLGRRKLFDDMKCLLVTVSYHKGRISCRTFSICIRFLGRQGRVREALCLFEEMEPKFGCK
        LQILMNYRL GRAKTLEFFSWS LQMGY+FDESVVEYMADFLGRRKLFDDMKCLLVTVS +KGRISCRTFSICIRFLGRQGRVREALCLFEEMEPKFGCK
Subjt:  LQILMNYRLLGRAKTLEFFSWSGLQMGYQFDESVVEYMADFLGRRKLFDDMKCLLVTVSYHKGRISCRTFSICIRFLGRQGRVREALCLFEEMEPKFGCK

Query:  PDNLVFNNMLYALCKKEPTGELIDTALTIFRRIELPDKYSYSNIIIGLCKFGRFGTALEVFDEMDRAGLVPTRSAVNILIGDLCSLSAKEGAIEKVRVRS
        PDNLVFNNMLYALCKKEPTGELIDTALTIFRRIELPDKYSYSNIIIGLCKFGRFGTA+EVFDEM RAGLVPTRSAVNILIGDLCSLSAKEGA+E+VRVRS
Subjt:  PDNLVFNNMLYALCKKEPTGELIDTALTIFRRIELPDKYSYSNIIIGLCKFGRFGTALEVFDEMDRAGLVPTRSAVNILIGDLCSLSAKEGAIEKVRVRS

Query:  TRRPFTLLVPNVNSKSGAIEPAVGVFWVANRLALVPSTFIIVQLISELCRLGQMQEAIKVLKVVEDGKLRCAEECYSIVMQALCEHRRVEEASDLFGRML
        TRRPFT+LVPNVN KSGAIEPAVGVFW ANRLALVPS F+IV+LI ELCRLGQMQEAI+VLKVVE  KLRC EECYSIVMQALCEHRRV+EASDL GRML
Subjt:  TRRPFTLLVPNVNSKSGAIEPAVGVFWVANRLALVPSTFIIVQLISELCRLGQMQEAIKVLKVVEDGKLRCAEECYSIVMQALCEHRRVEEASDLFGRML

Query:  SHGMKPKLAVYNSVICMLCKLGNLVDAERVFKIMSRKRCVPDHVTYSALIHAYGETKNWSVAYSLLKEMLSLGMSPHFHLYSLVNKLMREYGQIDLCLKL
        S  MKPKLA+YNSVICMLCKLGNL DAERVFKIM+RK+CVPDHVTYSALIHAYGE +NWS  YSLLK+MLSLG+SPHFH+YS+V+KLMRE GQ DLCLKL
Subjt:  SHGMKPKLAVYNSVICMLCKLGNLVDAERVFKIMSRKRCVPDHVTYSALIHAYGETKNWSVAYSLLKEMLSLGMSPHFHLYSLVNKLMREYGQIDLCLKL

Query:  EMKWEAQILQKLCKQGQLGAAYEKLKSMLEKGFYPPIYVRDAFGSAFQKNGKFKIARELLEKIDGVRK
        EMKWE+QILQKLCKQGQLG AYEKLKSMLEKGFYPPIYVRDAF SAFQK GKFKIARELL+ +DGV +
Subjt:  EMKWEAQILQKLCKQGQLGAAYEKLKSMLEKGFYPPIYVRDAFGSAFQKNGKFKIARELLEKIDGVRK

XP_038875040.1 pentatricopeptide repeat-containing protein At1g77360, mitochondrial-like [Benincasa hispida]3.4e-29188.11Show/hide
Query:  MLSKT--RNGSVLPKIITLHFSGFSSTFFGTSTATKHIATAPRAVARRPTSRTAPIPRALDTVSSTDVVNSVCSLLSNKNHQTTNLDLDHLLKRFNEKLS
        MLSK   RN S   K   LHF GFSS FF TST TKHIA APRA+ARRPTSRTAPIPRA DT+ S+DVVNSVCSLLSNKNHQT NLDLDHLLKRF + LS
Subjt:  MLSKT--RNGSVLPKIITLHFSGFSSTFFGTSTATKHIATAPRAVARRPTSRTAPIPRALDTVSSTDVVNSVCSLLSNKNHQTTNLDLDHLLKRFNEKLS

Query:  SDLVLQILMNYRLLGRAKTLEFFSWSGLQMGYQFDESVVEYMADFLGRRKLFDDMKCLLVTVSYHKGRISCRTFSICIRFLGRQGRVREALCLFEEMEPK
        SDLVLQILMNYRLLGRAKTLEFFSWSGLQMGY+FDE+VVEYMADFLGRRKLFDDMKCLLVTVS HKGR+SCRTFSICIRFLGRQGRVREALCLFEEMEPK
Subjt:  SDLVLQILMNYRLLGRAKTLEFFSWSGLQMGYQFDESVVEYMADFLGRRKLFDDMKCLLVTVSYHKGRISCRTFSICIRFLGRQGRVREALCLFEEMEPK

Query:  FGCKPDNLVFNNMLYALCKKEPTGELIDTALTIFRRIELPDKYSYSNIIIGLCKFGRFGTALEVFDEMDRAGLVPTRSAVNILIGDLCSLSAKEGAIEKV
        FGCKPDNLVFNNMLYALCKKEPTGELIDTAL+IFRRIELPDKYSYSN+IIGLCKFGRFGTA+EVFDEM+RAGLVPTRSAVNILIGDLCSLSAKEGA+E+V
Subjt:  FGCKPDNLVFNNMLYALCKKEPTGELIDTALTIFRRIELPDKYSYSNIIIGLCKFGRFGTALEVFDEMDRAGLVPTRSAVNILIGDLCSLSAKEGAIEKV

Query:  RVRSTRRPFTLLVPNVNSKSGAIEPAVGVFWVANRLALVPSTFIIVQLISELCRLGQMQEAIKVLKVVEDGKLRCAEECYSIVMQALCEHRRVEEASDLF
        RVRSTRRPFT+LVPNVN KSGAIEPAVG+FW AN+LALVPS F+IVQLISELCRLGQMQEAIKVLKVVE  KLRCAEECYS+VM+ALCEHR VEEASDLF
Subjt:  RVRSTRRPFTLLVPNVNSKSGAIEPAVGVFWVANRLALVPSTFIIVQLISELCRLGQMQEAIKVLKVVEDGKLRCAEECYSIVMQALCEHRRVEEASDLF

Query:  GRMLSHGMKPKLAVYNSVICMLCKLGNLVDAERVFKIMSRKRCVPDHVTYSALIHAYGETKNWSVAYSLLKEMLSLGMSPHFHLYSLVNKLMREYGQIDL
        GR+LS GMKPKLA+YNS+ICMLCK+GNL DAERVFKIM+RKRC PDHVTYS+LIHAYGET+NWS AYSLLKEMLSLGMSPHFHLYSLV+KLMRE+GQIDL
Subjt:  GRMLSHGMKPKLAVYNSVICMLCKLGNLVDAERVFKIMSRKRCVPDHVTYSALIHAYGETKNWSVAYSLLKEMLSLGMSPHFHLYSLVNKLMREYGQIDL

Query:  CLKLEMKWEAQILQKLCKQGQLGAAYEKLKSMLEKGFYPPIYVRDAFGSAFQKNGKFKIARELLEKIDGVRK
        CLKLEMKWEAQILQKLCK GQL AAYEK+KSMLEKGFYPPIYVRD+F SAFQK GKFKIARELL+KIDGV +
Subjt:  CLKLEMKWEAQILQKLCKQGQLGAAYEKLKSMLEKGFYPPIYVRDAFGSAFQKNGKFKIARELLEKIDGVRK

TrEMBL top hitse value%identityAlignment
A0A1S3BXL0 pentatricopeptide repeat-containing protein At5g65560-like isoform X13.3e-27683.68Show/hide
Query:  MLSKTRNGSVLPKIITLHFSGFSSTFFGTSTATKHIATAPRAVARRPTSRTAPIPRALDTVSSTDVVNSVCSLLSNKNHQTTNLDLDHLLKRFNEKLSSD
        ML   R+ S L K+++LHF GFSS FF TS  TKHIA APRA+ RRPTSRTAP PR+ +TV S+DVVNSVCSLLSNKN QT NLD++HLLKRF + LSSD
Subjt:  MLSKTRNGSVLPKIITLHFSGFSSTFFGTSTATKHIATAPRAVARRPTSRTAPIPRALDTVSSTDVVNSVCSLLSNKNHQTTNLDLDHLLKRFNEKLSSD

Query:  LVLQILMNYRLLGRAKTLEFFSWSGLQMGYQFDESVVEYMADFLGRRKLFDDMKCLLVTVSYHKGRISCRTFSICIRFLGRQGRVREALCLFEEMEPKFG
        LVLQILMNY+LLGRAKTLEFFSWSGLQMG++FD SVVEYMADFLGRRKLFDDMKCLLVTV  HKGRISCRTFSICIRFLGRQGRVREALCLFEEMEPKFG
Subjt:  LVLQILMNYRLLGRAKTLEFFSWSGLQMGYQFDESVVEYMADFLGRRKLFDDMKCLLVTVSYHKGRISCRTFSICIRFLGRQGRVREALCLFEEMEPKFG

Query:  CKPDNLVFNNMLYALCKKEPTGELIDTALTIFRRIELPDKYSYSNIIIGLCKFGRFGTALEVFDEMDRAGLVPTRSAVNILIGDLCSLSAKEGAIEKVRV
        CKPDNLVFNNMLYALCKKEPTGELIDTAL IFRRIELPDKYSYSN+IIGLCKFGR+ TA+E F EM RAGLVPTRSA NILIG+LCSLSAKEGA+EKVRV
Subjt:  CKPDNLVFNNMLYALCKKEPTGELIDTALTIFRRIELPDKYSYSNIIIGLCKFGRFGTALEVFDEMDRAGLVPTRSAVNILIGDLCSLSAKEGAIEKVRV

Query:  RSTRRPFTLLVPNVNSKSGAIEPAVGVFWVANRLALVPSTFIIVQLISELCRLGQMQEAIKVLKVVEDGKLRCAEECYSIVMQALCEHRRVEEASDLFGR
        RST RPFT+LVPNVN KSGAIEPAVG+FW AN+L LVPS+F+ VQLISELCR+GQMQEAIKVLKVVE  KLRCAEECYS+VM+ALCEHR ++EASDLFGR
Subjt:  RSTRRPFTLLVPNVNSKSGAIEPAVGVFWVANRLALVPSTFIIVQLISELCRLGQMQEAIKVLKVVEDGKLRCAEECYSIVMQALCEHRRVEEASDLFGR

Query:  MLSHGMKPKLAVYNSVICMLCKLGNLVDAERVFKIMSRKRCVPDHVTYSALIHAYGETKNWSVAYSLLKEMLSLGMSPHFHLYSLVNKLMREYGQIDLCL
        MLS GMKPKLA+YN VICMLCKLGNL  AERVF IM++KRC PDHVTYSALIHAYGE +NWS AY LLKEMLSLGMSPHFH+YSLV+KLMRE+GQ+DLCL
Subjt:  MLSHGMKPKLAVYNSVICMLCKLGNLVDAERVFKIMSRKRCVPDHVTYSALIHAYGETKNWSVAYSLLKEMLSLGMSPHFHLYSLVNKLMREYGQIDLCL

Query:  KLEMKWEAQILQKLCKQGQLGAAYEKLKSMLEKGFYPPIYVRDAFGSAFQKNGKFKIARELLEKIDGVRK
        KLEMKWEAQILQKLCKQGQL AAYEK+KSMLEKG  PPIYVRDAF SAFQK GKFKIARELL+K+DGV +
Subjt:  KLEMKWEAQILQKLCKQGQLGAAYEKLKSMLEKGFYPPIYVRDAFGSAFQKNGKFKIARELLEKIDGVRK

A0A5D3CYL1 Pentatricopeptide repeat-containing protein9.7e-26884.03Show/hide
Query:  MLSKTRNGSVLPKIITLHFSGFSSTFFGTSTATKHIATAPRAVARRPTSRTAPIPRALDTVSSTDVVNSVCSLLSNKNHQTTNLDLDHLLKRFNEKLSSD
        ML   R+ S L K+++LHF GFSS FF TS  TKHIA APRA+ RRPTSRTAP PR+ +TV S+DVVNSVCSLLSNKN QT NLD++HLLKRF + LSSD
Subjt:  MLSKTRNGSVLPKIITLHFSGFSSTFFGTSTATKHIATAPRAVARRPTSRTAPIPRALDTVSSTDVVNSVCSLLSNKNHQTTNLDLDHLLKRFNEKLSSD

Query:  LVLQILMNYRLLGRAKTLEFFSWSGLQMGYQFDESVVEYMADFLGRRKLFDDMKCLLVTVSYHKGRISCRTFSICIRFLGRQGRVREALCLFEEMEPKFG
        LVLQILMNY+LLGRAKTLEFFSWSGLQMG++FD SVVEYMADFLGRRKLFDDMKCLLVTV  HKGRISCRTFSICIRFLGRQGRVREALCLFEEMEPKFG
Subjt:  LVLQILMNYRLLGRAKTLEFFSWSGLQMGYQFDESVVEYMADFLGRRKLFDDMKCLLVTVSYHKGRISCRTFSICIRFLGRQGRVREALCLFEEMEPKFG

Query:  CKPDNLVFNNMLYALCKKEPTGELIDTALTIFRRIELPDKYSYSNIIIGLCKFGRFGTALEVFDEMDRAGLVPTRSAVNILIGDLCSLSAKEGAIEKVRV
        CKPDNLVFNNMLYALCKKEPTGELIDTAL IFRRIELPDKYSYSN+IIGLCKFGR+ TA+E F EM RAGLVPTRSA NILIG+LCSLSAKEGA+EKVRV
Subjt:  CKPDNLVFNNMLYALCKKEPTGELIDTALTIFRRIELPDKYSYSNIIIGLCKFGRFGTALEVFDEMDRAGLVPTRSAVNILIGDLCSLSAKEGAIEKVRV

Query:  RSTRRPFTLLVPNVNSKSGAIEPAVGVFWVANRLALVPSTFIIVQLISELCRLGQMQEAIKVLKVVEDGKLRCAEECYSIVMQALCEHRRVEEASDLFGR
        RST RPFT+LVPNVN KSGAIEPAVG+FW AN+L LVPS+F+ VQLISELCR+GQMQEAIKVLKVVE  KLRCAEECYS+VM+ALCEHR ++EASDLFGR
Subjt:  RSTRRPFTLLVPNVNSKSGAIEPAVGVFWVANRLALVPSTFIIVQLISELCRLGQMQEAIKVLKVVEDGKLRCAEECYSIVMQALCEHRRVEEASDLFGR

Query:  MLSHGMKPKLAVYNSVICMLCKLGNLVDAERVFKIMSRKRCVPDHVTYSALIHAYGETKNWSVAYSLLKEMLSLGMSPHFHLYSLVNKLMREYGQIDLCL
        MLS GMKPKLA+YN VICMLCKLGNL  AERVF IM++KRC PDHVTYSALIHAYGE +NWS AY LLKEMLSLGMSPHFH+YSLV+KLMRE+GQ+DLCL
Subjt:  MLSHGMKPKLAVYNSVICMLCKLGNLVDAERVFKIMSRKRCVPDHVTYSALIHAYGETKNWSVAYSLLKEMLSLGMSPHFHLYSLVNKLMREYGQIDLCL

Query:  KLEMKWEAQILQKLCKQGQLGAAYEKLKSMLEKGFYPPIYVRDAFGSAFQK
        KLEMKWEAQILQKLCKQGQL AAYEK+KSMLEKG  PPIYVRDAF SAFQK
Subjt:  KLEMKWEAQILQKLCKQGQLGAAYEKLKSMLEKGFYPPIYVRDAFGSAFQK

A0A6J1CV78 pentatricopeptide repeat-containing protein At5g39710-like2.7e-28689.13Show/hide
Query:  TRNGSVLPKIITLHFSGFSSTFFGTSTATKHIATAPRAVARRPTSRTAPIPRALDTVSSTDVVNSVCSLLSNKNHQTTNLDLDHLLKRFNEKLSSDLVLQ
        +RNGSVL K  TLHFS FSS FFGTST    IA APR  ARRPTSR+AP+PRALDT+SSTDVVNSVCSLLSNKNHQTTNLDLD LLKRFNE LSSDLVL+
Subjt:  TRNGSVLPKIITLHFSGFSSTFFGTSTATKHIATAPRAVARRPTSRTAPIPRALDTVSSTDVVNSVCSLLSNKNHQTTNLDLDHLLKRFNEKLSSDLVLQ

Query:  ILMNYRLLGRAKTLEFFSWSGLQMGYQFDESVVEYMADFLGRRKLFDDMKCLLVTVSYHKGRISCRTFSICIRFLGRQGRVREALCLFEEMEPKFGCKPD
        ILMNYR+LGRAKTLEFFSWSGLQMGY+FDESVVEYMADF GRRKLFDDMKCLLVTVS HKGR+SCRTFSICIRFLGRQGRVREALCLFEEMEPKFGCKPD
Subjt:  ILMNYRLLGRAKTLEFFSWSGLQMGYQFDESVVEYMADFLGRRKLFDDMKCLLVTVSYHKGRISCRTFSICIRFLGRQGRVREALCLFEEMEPKFGCKPD

Query:  NLVFNNMLYALCKKEPTGELIDTALTIFRRIELPDKYSYSNIIIGLCKFGRFGTALEVFDEMDRAGLVPTRSAVNILIGDLCSLSAKEGAIEKVRVRSTR
        NLVFNN+LYALCKKE TGELIDTALTIFRRIELPDKYSYSNIIIGLCKFGRF TALEVF+EM+R G VPTRSAVNILIGDLCSLSAKEGAIEKVRVRSTR
Subjt:  NLVFNNMLYALCKKEPTGELIDTALTIFRRIELPDKYSYSNIIIGLCKFGRFGTALEVFDEMDRAGLVPTRSAVNILIGDLCSLSAKEGAIEKVRVRSTR

Query:  RPFTLLVPNVNSKSGAIEPAVGVFWVANRLALVPSTFIIVQLISELCRLGQMQEAIKVLKVVEDGKLRCAEECYSIVMQALCEHRRVEEASDLFGRMLSH
        RPFT+LVPNVNSKSGAIEPAVGVFW ANR+ALVPS+F++VQLISELCRLGQMQEAI+VLKVVEDGKLRC EEC+SIVMQALCE+R+VEEASDLFGRMLS 
Subjt:  RPFTLLVPNVNSKSGAIEPAVGVFWVANRLALVPSTFIIVQLISELCRLGQMQEAIKVLKVVEDGKLRCAEECYSIVMQALCEHRRVEEASDLFGRMLSH

Query:  GMKPKLAVYNSVICMLCKLGNLVDAERVFKIMSRKRCVPDHVTYSALIHAYGETKNWSVAYSLLKEMLSLGMSPHFHLYSLVNKLMREYGQIDLCLKLEM
        GMKPKLAVYNSVICMLCKLGN+VDAERVFKIM+RKRCVPD VTYSALIHAY E  NWS AYSLLKEMLSLGMSPHFHLYS V+KLMRE+GQ+DLCLKLEM
Subjt:  GMKPKLAVYNSVICMLCKLGNLVDAERVFKIMSRKRCVPDHVTYSALIHAYGETKNWSVAYSLLKEMLSLGMSPHFHLYSLVNKLMREYGQIDLCLKLEM

Query:  KWEAQILQKLCKQGQLGAAYEKLKSMLEKGFYPPIYVRDAFGSAFQKNGKFKIARELLEKI
        KWEAQILQKLCKQGQL AAYEKLKSMLEKG +PP YVRDAF +AFQKNGK+KIARELLEKI
Subjt:  KWEAQILQKLCKQGQLGAAYEKLKSMLEKGFYPPIYVRDAFGSAFQKNGKFKIARELLEKI

A0A6J1GU90 pentatricopeptide repeat-containing protein At4g20090-like9.4e-27986.44Show/hide
Query:  SKTRNGSVLPKIITLHFSGFSSTFFGTSTATKHIATAPRAVARRPTSRTAPIPRALDTVSSTDVVNSVCSLLSNKNHQTTNLDLDHLLKRFNEKLSSDLV
        S  RN S   K +   FS +SS+   TS+  K  A APRA+ARRPTSRTAPIPRALD    TD V+SVCSLLSNKNHQTTNL+LDHLLKRF E LSSD V
Subjt:  SKTRNGSVLPKIITLHFSGFSSTFFGTSTATKHIATAPRAVARRPTSRTAPIPRALDTVSSTDVVNSVCSLLSNKNHQTTNLDLDHLLKRFNEKLSSDLV

Query:  LQILMNYRLLGRAKTLEFFSWSGLQMGYQFDESVVEYMADFLGRRKLFDDMKCLLVTVSYHKGRISCRTFSICIRFLGRQGRVREALCLFEEMEPKFGCK
        LQILMNYRL GRAKTLEFFSWSGLQMGY+FDESVVEYMADFLGRRKLFDDMKCLLVTVS +KGRISCRTFSICIRFLGRQGRVREALCLFEEMEPKFGCK
Subjt:  LQILMNYRLLGRAKTLEFFSWSGLQMGYQFDESVVEYMADFLGRRKLFDDMKCLLVTVSYHKGRISCRTFSICIRFLGRQGRVREALCLFEEMEPKFGCK

Query:  PDNLVFNNMLYALCKKEPTGELIDTALTIFRRIELPDKYSYSNIIIGLCKFGRFGTALEVFDEMDRAGLVPTRSAVNILIGDLCSLSAKEGAIEKVRVRS
        PDNLVFNNMLYALCKKEPTGELIDTALTIFRRIELPDKYSYSNIIIGLCKFGRFGTA+EVFDEM RAGLVPTRSAVNILIGDLCSLSAKEGA+E+VRVRS
Subjt:  PDNLVFNNMLYALCKKEPTGELIDTALTIFRRIELPDKYSYSNIIIGLCKFGRFGTALEVFDEMDRAGLVPTRSAVNILIGDLCSLSAKEGAIEKVRVRS

Query:  TRRPFTLLVPNVNSKSGAIEPAVGVFWVANRLALVPSTFIIVQLISELCRLGQMQEAIKVLKVVEDGKLRCAEECYSIVMQALCEHRRVEEASDLFGRML
        TRRPFT+LVPNVN KSGAI+ AVGVFW ANRLALVPSTF+IV+LISELCRLGQMQEAI+VLKVVE  KLRC EECYSIVMQALCEHRRV+EASDLFGRML
Subjt:  TRRPFTLLVPNVNSKSGAIEPAVGVFWVANRLALVPSTFIIVQLISELCRLGQMQEAIKVLKVVEDGKLRCAEECYSIVMQALCEHRRVEEASDLFGRML

Query:  SHGMKPKLAVYNSVICMLCKLGNLVDAERVFKIMSRKRCVPDHVTYSALIHAYGETKNWSVAYSLLKEMLSLGMSPHFHLYSLVNKLMREYGQIDLCLKL
        S  MKPKLA+YNSVICMLCKLGNL DAERVFKIM+RKRCVPDHVTYSALIHAYGE +NWS AYSLLKEMLSLG+SPHFH+YS+V+KLMRE GQ DLCLKL
Subjt:  SHGMKPKLAVYNSVICMLCKLGNLVDAERVFKIMSRKRCVPDHVTYSALIHAYGETKNWSVAYSLLKEMLSLGMSPHFHLYSLVNKLMREYGQIDLCLKL

Query:  EMKWEAQILQKLCKQGQLGAAYEKLKSMLEKGFYPPIYVRDAFGSAFQKNGKFKIARELLEKIDGVRK
        EMKWE+QILQKLCKQGQLG AYEKLKSMLEKGFYPPIYVRDAF SAFQK GKFKIARELL+ +DGV +
Subjt:  EMKWEAQILQKLCKQGQLGAAYEKLKSMLEKGFYPPIYVRDAFGSAFQKNGKFKIARELLEKIDGVRK

A0A6J1IX53 pentatricopeptide repeat-containing protein At4g20090-like1.4e-27786.09Show/hide
Query:  SKTRNGSVLPKIITLHFSGFSSTFFGTSTATKHIATAPRAVARRPTSRTAPIPRALDTVSSTDVVNSVCSLLSNKNHQTTNLDLDHLLKRFNEKLSSDLV
        S  RN S   K +   FS +SS+   TS+ TK  A APRA+ARRPTSRTA IPRALD    TD V+SVCSLLSNK+HQTTNL+LDHLLKRF E LSSD V
Subjt:  SKTRNGSVLPKIITLHFSGFSSTFFGTSTATKHIATAPRAVARRPTSRTAPIPRALDTVSSTDVVNSVCSLLSNKNHQTTNLDLDHLLKRFNEKLSSDLV

Query:  LQILMNYRLLGRAKTLEFFSWSGLQMGYQFDESVVEYMADFLGRRKLFDDMKCLLVTVSYHKGRISCRTFSICIRFLGRQGRVREALCLFEEMEPKFGCK
        LQILMNYRL GRAKTLEFFSWSGLQMGY+FDESVVEYMADFLGRRKLFDDMKCLLVTVS +KGRISCRTFSICIRFLGRQGRVREALCLFEEMEP FGCK
Subjt:  LQILMNYRLLGRAKTLEFFSWSGLQMGYQFDESVVEYMADFLGRRKLFDDMKCLLVTVSYHKGRISCRTFSICIRFLGRQGRVREALCLFEEMEPKFGCK

Query:  PDNLVFNNMLYALCKKEPTGELIDTALTIFRRIELPDKYSYSNIIIGLCKFGRFGTALEVFDEMDRAGLVPTRSAVNILIGDLCSLSAKEGAIEKVRVRS
        PDNLVFNNMLYALCKKEPTGELIDTALTIFRRIELPDKYSYSNIIIGLCKFGRFGTA+EVFDEM RA LVPTRSAVNILIGDLCSLSAKEGA+E+VRVRS
Subjt:  PDNLVFNNMLYALCKKEPTGELIDTALTIFRRIELPDKYSYSNIIIGLCKFGRFGTALEVFDEMDRAGLVPTRSAVNILIGDLCSLSAKEGAIEKVRVRS

Query:  TRRPFTLLVPNVNSKSGAIEPAVGVFWVANRLALVPSTFIIVQLISELCRLGQMQEAIKVLKVVEDGKLRCAEECYSIVMQALCEHRRVEEASDLFGRML
        TRRPFT+LVPNVN KSGAIEPAVGVFW ANR+ALVPS F+IV+LISELCRLGQMQEAI+VLKVVE  KLRC EECYSIVMQALCEHRRV+EASDLFGRML
Subjt:  TRRPFTLLVPNVNSKSGAIEPAVGVFWVANRLALVPSTFIIVQLISELCRLGQMQEAIKVLKVVEDGKLRCAEECYSIVMQALCEHRRVEEASDLFGRML

Query:  SHGMKPKLAVYNSVICMLCKLGNLVDAERVFKIMSRKRCVPDHVTYSALIHAYGETKNWSVAYSLLKEMLSLGMSPHFHLYSLVNKLMREYGQIDLCLKL
        S  MKPKLA+YNSVICMLCKLGNL DAERVFKIM+RKRCVPDHVTYSALIHAYGE +NWS AYSLLKEMLSLG+SPHFH+YS+V+KLMRE GQ DLCLKL
Subjt:  SHGMKPKLAVYNSVICMLCKLGNLVDAERVFKIMSRKRCVPDHVTYSALIHAYGETKNWSVAYSLLKEMLSLGMSPHFHLYSLVNKLMREYGQIDLCLKL

Query:  EMKWEAQILQKLCKQGQLGAAYEKLKSMLEKGFYPPIYVRDAFGSAFQKNGKFKIARELLEKIDGVRK
        EMKWE+QILQKLCKQGQLGAAYEKLKSMLEKGFYPPIYVRDAF SAFQK GKFKIARELL+ +DGV +
Subjt:  EMKWEAQILQKLCKQGQLGAAYEKLKSMLEKGFYPPIYVRDAFGSAFQKNGKFKIARELLEKIDGVRK

SwissProt top hitse value%identityAlignment
Q3EDF8 Pentatricopeptide repeat-containing protein At1g099002.0e-2825.24Show/hide
Query:  TFSICIRFLGRQGRVREALCLFEEMEPKFGCKPDNLVFNNMLYALCKKEPTGELIDTALTIFRRIELPDKYSYSNIIIGLCKFGRFGTALEVFDEMDRAG
        T+++ I    + G +  AL + + M       PD + +N +L +LC      + ++    + +R   PD  +Y+ +I   C+    G A+++ DEM   G
Subjt:  TFSICIRFLGRQGRVREALCLFEEMEPKFGCKPDNLVFNNMLYALCKKEPTGELIDTALTIFRRIELPDKYSYSNIIIGLCKFGRFGTALEVFDEMDRAG

Query:  LVPTRSAVNILIGDLCSLSAKEGAIEKVRVRSTRRPFTLLVPNVNSKSGAIEP--AVGVFWVANRL-------ALVPSTFIIVQLISELCRLGQMQEAIK
          P     N+L+  +C    KEG +++        P +   PNV + +  +    + G +  A +L          PS      LI+ LCR G +  AI 
Subjt:  LVPTRSAVNILIGDLCSLSAKEGAIEKVRVRSTRRPFTLLVPNVNSKSGAIEP--AVGVFWVANRL-------ALVPSTFIIVQLISELCRLGQMQEAIK

Query:  VLKVVEDGKLRCAEECYSIVMQALCEHRRVEEASDLFGRMLSHGMKPKLAVYNSVICMLCKLGNLVDAERVFKIMSRKRCVPDHVTYSALIHAYGETKNW
        +L+ +     +     Y+ ++   C+ ++++ A +   RM+S G  P +  YN+++  LCK G + DA  +   +S K C P  +TY+ +I    +    
Subjt:  VLKVVEDGKLRCAEECYSIVMQALCEHRRVEEASDLFGRMLSHGMKPKLAVYNSVICMLCKLGNLVDAERVFKIMSRKRCVPDHVTYSALIHAYGETKNW

Query:  SVAYSLLKEMLSLGMSPHFHLY-SLVNKLMREYGQIDLCLKLEMKWEA-----------QILQKLCKQGQLGAAYEKLKSMLEKGFYPPIYVRDAFGSAF
          A  LL EM +  + P    Y SLV  L RE G++D  +K   ++E             I+  LCK  Q   A + L  M+ +G  P            
Subjt:  SVAYSLLKEMLSLGMSPHFHLY-SLVNKLMREYGQIDLCLKLEMKWEA-----------QILQKLCKQGQLGAAYEKLKSMLEKGFYPPIYVRDAFGSAF

Query:  QKNGKFKIARELLEKI
           G  K A ELL ++
Subjt:  QKNGKFKIARELLEKI

Q6NQ83 Pentatricopeptide repeat-containing protein At3g22470, mitochondrial3.8e-2723.69Show/hide
Query:  QILMNYRLLGRAKTLEFFSWSGLQMGYQFDESVVEYMADFLGRRKLFDDMKCLLVTVSYHKGRISCRTFSICIRFLGRQGRVREALCLFEEMEPKFGCKP
        ++L  + +LGRA       W   ++GY+ D      + +         +   L+  +   K R    T S  I  L  +GRV EAL L + M  ++G +P
Subjt:  QILMNYRLLGRAKTLEFFSWSGLQMGYQFDESVVEYMADFLGRRKLFDDMKCLLVTVSYHKGRISCRTFSICIRFLGRQGRVREALCLFEEMEPKFGCKP

Query:  DNLVFNNMLYALCKKEPTGELIDTALTIFRRIE----LPDKYSYSNIIIGLCKFGRFGTALEVFDEMDRAGLVPTRSAVNILIGDLCSLSAKEGAIEKVR
        D + +  +L  LCK   +      AL +FR++E          YS +I  LCK G F  AL +F+EM+  G+       + LIG LC+    +   + +R
Subjt:  DNLVFNNMLYALCKKEPTGELIDTALTIFRRIE----LPDKYSYSNIIIGLCKFGRFGTALEVFDEMDRAGLVPTRSAVNILIGDLCSLSAKEGAIEKVR

Query:  VRSTRRPFTLLVPNVNSKSGAIEPAVGVFWVANRLALVPSTFIIVQLISELCRLGQMQEAIKVLKVVEDGKLRCAEECYSIVMQALCEHRRVEEASDLFG
            R     ++P+V + S  I+    VF    +L          +L +E+   G   + I                 Y+ ++   C+   + EA+ +F 
Subjt:  VRSTRRPFTLLVPNVNSKSGAIEPAVGVFWVANRLALVPSTFIIVQLISELCRLGQMQEAIKVLKVVEDGKLRCAEECYSIVMQALCEHRRVEEASDLFG

Query:  RMLSHGMKPKLAVYNSVICMLCKLGNLVDAERVFKIMSRKRCVPDHVTYSALIHAYGETKNWSVAYSLLKEMLSLGMSPHFHLYSLVNKLMREYGQIDLC
         M+S G +P +  Y+ +I   CK   + D  R+F+ +S K  +P+ +TY+ L+  + ++   + A  L +EM+S G+ P    Y ++   + + G+++  
Subjt:  RMLSHGMKPKLAVYNSVICMLCKLGNLVDAERVFKIMSRKRCVPDHVTYSALIHAYGETKNWSVAYSLLKEMLSLGMSPHFHLYSLVNKLMREYGQIDLC

Query:  LKLEMKWEAQ-----------ILQKLCKQGQLGAAYEKLKSMLEKGFYPPIYVRDAFGSAFQKNGKFKIARELLEKI
        L++  K +             I+  +C   ++  A+    S+ +KG  P +   +       K G    A  L  K+
Subjt:  LKLEMKWEAQ-----------ILQKLCKQGQLGAAYEKLKSMLEKGFYPPIYVRDAFGSAFQKNGKFKIARELLEKI

Q9FIX3 Pentatricopeptide repeat-containing protein At5g397105.0e-2727.19Show/hide
Query:  TFSICIRFLGRQGRVREALCLFEEMEPKFGCKPDNLVFNNMLYALCK----------------------------------KEPTGELIDTALT-IFRRI
        T++I IR     G +  AL LF++ME K GC P+ + +N ++   CK                                  +E   + +   LT + RR 
Subjt:  TFSICIRFLGRQGRVREALCLFEEMEPKFGCKPDNLVFNNMLYALCK----------------------------------KEPTGELIDTALT-IFRRI

Query:  ELPDKYSYSNIIIGLCKFGRFGTALEVFDEMDRAGLVPTRSAVNILIGDLCSLSAKEGAIE---KVRVRS---TRRPFTLLVPNVNSKSGAIEPAVGVFW
           D+ +Y+ +I G CK G F  AL +  EM R GL P+      LI  +C       A+E   ++RVR      R +T LV   + K G +  A  V  
Subjt:  ELPDKYSYSNIIIGLCKFGRFGTALEVFDEMDRAGLVPTRSAVNILIGDLCSLSAKEGAIE---KVRVRS---TRRPFTLLVPNVNSKSGAIEPAVGVFW

Query:  VANRLALVPSTFIIVQLISELCRLGQMQEAIKVLKVVEDGKLRCAEECYSIVMQALCEHRRVEEASDLFGRMLSHGMKPKLAVYNSVICMLCKLGNLVDA
          N     PS      LI+  C  G+M++AI VL+ +++  L      YS V+   C    V+EA  +   M+  G+KP    Y+S+I   C+     +A
Subjt:  VANRLALVPSTFIIVQLISELCRLGQMQEAIKVLKVVEDGKLRCAEECYSIVMQALCEHRRVEEASDLFGRMLSHGMKPKLAVYNSVICMLCKLGNLVDA

Query:  ERVFKIMSRKRCVPDHVTYSALIHAYGETKNWSVAYSLLKEMLSLGMSPHFHLYS-LVNKLMRE-------------------------YGQIDLCLKLE
          +++ M R    PD  TY+ALI+AY    +   A  L  EM+  G+ P    YS L+N L ++                         +  I+ C  +E
Subjt:  ERVFKIMSRKRCVPDHVTYSALIHAYGETKNWSVAYSLLKEMLSLGMSPHFHLYS-LVNKLMRE-------------------------YGQIDLCLKLE

Query:  MKWEAQILQKLCKQGQLGAAYEKLKSMLEKGFYP
         K    +++  C +G +  A +  +SML K   P
Subjt:  MKWEAQILQKLCKQGQLGAAYEKLKSMLEKGFYP

Q9LFC5 Pentatricopeptide repeat-containing protein At5g011101.5e-2824.39Show/hide
Query:  ISCRTFSICIRFLGRQGRVREALCLFEEMEPKFGCKPDNLVFNNMLYALCKKEPTGELIDTALTIFRRIELPDKYSYSNIIIGLCKFGRFGTALEVFDEM
        I+  T +I +  L + G++ +      +++ K G  PD + +N ++ A   K    E  +    +  +   P  Y+Y+ +I GLCK G++  A EVF EM
Subjt:  ISCRTFSICIRFLGRQGRVREALCLFEEMEPKFGCKPDNLVFNNMLYALCKKEPTGELIDTALTIFRRIELPDKYSYSNIIIGLCKFGRFGTALEVFDEM

Query:  DRAGLVPTRSAVNILIGDLCSLSAKEGAIEKVRVRSTRRPFTLL--------VPNVNSKSGAIEPAVGVFWVANRLALVPSTFIIVQLISELCRLGQMQE
         R+GL P  +    L+ + C    K   +E  +V S  R   ++        + ++ ++SG ++ A+  F       L+P   I   LI   CR G +  
Subjt:  DRAGLVPTRSAVNILIGDLCSLSAKEGAIEKVRVRSTRRPFTLL--------VPNVNSKSGAIEPAVGVFWVANRLALVPSTFIIVQLISELCRLGQMQE

Query:  AIKVL-KVVEDGKLRCAEE--CYSIVMQALCEHRRVEEASDLFGRMLSHGMKPKLAVYNSVICMLCKLGNLVDAERVFKIMSRKRCVPDHVTYSALIHAY
        A+ +  ++++ G   CA +   Y+ ++  LC+ + + EA  LF  M    + P       +I   CKLGNL +A  +F+ M  KR   D VTY+ L+  +
Subjt:  AIKVL-KVVEDGKLRCAEE--CYSIVMQALCEHRRVEEASDLFGRMLSHGMKPKLAVYNSVICMLCKLGNLVDAERVFKIMSRKRCVPDHVTYSALIHAY

Query:  GETKNWSVAYSLLKEMLSLGMSPHFHLYSLVNKLMREYGQIDLCLKLEMKWEAQILQKLCKQGQLGAAYEKLKSMLEKGFYPPIYVRDAFGSAFQKNGKF
        G+  +   A  +  +M+S  + P    YS+                        ++  LC +G L  A+     M+ K   P + + ++    + ++G  
Subjt:  GETKNWSVAYSLLKEMLSLGMSPHFHLYSLVNKLMREYGQIDLCLKLEMKWEAQILQKLCKQGQLGAAYEKLKSMLEKGFYPPIYVRDAFGSAFQKNGKF

Query:  KIARELLEKI
              LEK+
Subjt:  KIARELLEKI

Q9LFF1 Pentatricopeptide repeat-containing protein At3g53700, chloroplastic5.7e-3126.1Show/hide
Query:  RQGRVREALCLFEEMEPKFGCKPDNLVFNNMLYALCKKEPTGELIDTALTIFRRIELPDKYSYSNIIIGLCKFGRFGTALEVFDEMDRAGLVPTRSAVNI
        ++GRV +AL   +EM  + G  PD   FN ++  LCK       I+    + +    PD Y+Y+++I GLCK G    A+EV D+M      P     N 
Subjt:  RQGRVREALCLFEEMEPKFGCKPDNLVFNNMLYALCKKEPTGELIDTALTIFRRIELPDKYSYSNIIIGLCKFGRFGTALEVFDEMDRAGLVPTRSAVNI

Query:  LIGDLCSLSAKEGAIEKVRVRSTRRPFTLLVPNVNSKSGAIE---------PAVGVFWVANRLALVPSTFIIVQLISELCRLGQMQEAIKVLKVVEDGKL
        LI  LC  +  E A E  RV +++     ++P+V + +  I+          A+ +F         P  F    LI  LC  G++ EA+ +LK +E    
Subjt:  LIGDLCSLSAKEGAIEKVRVRSTRRPFTLLVPNVNSKSGAIE---------PAVGVFWVANRLALVPSTFIIVQLISELCRLGQMQEAIKVLKVVEDGKL

Query:  RCAEECYSIVMQALCEHRRVEEASDLFGRMLSHGMKPKLAVYNSVICMLCKLGNLVDAERVFKIMSRKRCVPDHVTYSALIHAYGETKNWSVAYSLLKEM
          +   Y+ ++   C+  +  EA ++F  M  HG+      YN++I  LCK   + DA ++   M  +   PD  TY++L+  +    +   A  +++ M
Subjt:  RCAEECYSIVMQALCEHRRVEEASDLFGRMLSHGMKPKLAVYNSVICMLCKLGNLVDAERVFKIMSRKRCVPDHVTYSALIHAYGETKNWSVAYSLLKEM

Query:  LSLGMSPHFHLYSLVNKLMREYGQIDLCLKLEMKWEAQ-----------ILQKLCKQGQLGAAYEKLKSMLEKGFYPP--IYVRDAFGSAFQKNGKFKIA
         S G  P    Y  +   + + G++++  KL    + +           ++Q L ++ +   A    + MLE+   PP  +  R  F       G  + A
Subjt:  LSLGMSPHFHLYSLVNKLMREYGQIDLCLKLEMKWEAQ-----------ILQKLCKQGQLGAAYEKLKSMLEKGFYPP--IYVRDAFGSAFQKNGKFKIA

Query:  ----RELLEK
             ELLEK
Subjt:  ----RELLEK

Arabidopsis top hitse value%identityAlignment
AT1G09900.1 Pentatricopeptide repeat (PPR-like) superfamily protein1.4e-2925.24Show/hide
Query:  TFSICIRFLGRQGRVREALCLFEEMEPKFGCKPDNLVFNNMLYALCKKEPTGELIDTALTIFRRIELPDKYSYSNIIIGLCKFGRFGTALEVFDEMDRAG
        T+++ I    + G +  AL + + M       PD + +N +L +LC      + ++    + +R   PD  +Y+ +I   C+    G A+++ DEM   G
Subjt:  TFSICIRFLGRQGRVREALCLFEEMEPKFGCKPDNLVFNNMLYALCKKEPTGELIDTALTIFRRIELPDKYSYSNIIIGLCKFGRFGTALEVFDEMDRAG

Query:  LVPTRSAVNILIGDLCSLSAKEGAIEKVRVRSTRRPFTLLVPNVNSKSGAIEP--AVGVFWVANRL-------ALVPSTFIIVQLISELCRLGQMQEAIK
          P     N+L+  +C    KEG +++        P +   PNV + +  +    + G +  A +L          PS      LI+ LCR G +  AI 
Subjt:  LVPTRSAVNILIGDLCSLSAKEGAIEKVRVRSTRRPFTLLVPNVNSKSGAIEP--AVGVFWVANRL-------ALVPSTFIIVQLISELCRLGQMQEAIK

Query:  VLKVVEDGKLRCAEECYSIVMQALCEHRRVEEASDLFGRMLSHGMKPKLAVYNSVICMLCKLGNLVDAERVFKIMSRKRCVPDHVTYSALIHAYGETKNW
        +L+ +     +     Y+ ++   C+ ++++ A +   RM+S G  P +  YN+++  LCK G + DA  +   +S K C P  +TY+ +I    +    
Subjt:  VLKVVEDGKLRCAEECYSIVMQALCEHRRVEEASDLFGRMLSHGMKPKLAVYNSVICMLCKLGNLVDAERVFKIMSRKRCVPDHVTYSALIHAYGETKNW

Query:  SVAYSLLKEMLSLGMSPHFHLY-SLVNKLMREYGQIDLCLKLEMKWEA-----------QILQKLCKQGQLGAAYEKLKSMLEKGFYPPIYVRDAFGSAF
          A  LL EM +  + P    Y SLV  L RE G++D  +K   ++E             I+  LCK  Q   A + L  M+ +G  P            
Subjt:  SVAYSLLKEMLSLGMSPHFHLY-SLVNKLMREYGQIDLCLKLEMKWEA-----------QILQKLCKQGQLGAAYEKLKSMLEKGFYPPIYVRDAFGSAF

Query:  QKNGKFKIARELLEKI
           G  K A ELL ++
Subjt:  QKNGKFKIARELLEKI

AT3G22470.1 Pentatricopeptide repeat (PPR) superfamily protein2.7e-2823.69Show/hide
Query:  QILMNYRLLGRAKTLEFFSWSGLQMGYQFDESVVEYMADFLGRRKLFDDMKCLLVTVSYHKGRISCRTFSICIRFLGRQGRVREALCLFEEMEPKFGCKP
        ++L  + +LGRA       W   ++GY+ D      + +         +   L+  +   K R    T S  I  L  +GRV EAL L + M  ++G +P
Subjt:  QILMNYRLLGRAKTLEFFSWSGLQMGYQFDESVVEYMADFLGRRKLFDDMKCLLVTVSYHKGRISCRTFSICIRFLGRQGRVREALCLFEEMEPKFGCKP

Query:  DNLVFNNMLYALCKKEPTGELIDTALTIFRRIE----LPDKYSYSNIIIGLCKFGRFGTALEVFDEMDRAGLVPTRSAVNILIGDLCSLSAKEGAIEKVR
        D + +  +L  LCK   +      AL +FR++E          YS +I  LCK G F  AL +F+EM+  G+       + LIG LC+    +   + +R
Subjt:  DNLVFNNMLYALCKKEPTGELIDTALTIFRRIE----LPDKYSYSNIIIGLCKFGRFGTALEVFDEMDRAGLVPTRSAVNILIGDLCSLSAKEGAIEKVR

Query:  VRSTRRPFTLLVPNVNSKSGAIEPAVGVFWVANRLALVPSTFIIVQLISELCRLGQMQEAIKVLKVVEDGKLRCAEECYSIVMQALCEHRRVEEASDLFG
            R     ++P+V + S  I+    VF    +L          +L +E+   G   + I                 Y+ ++   C+   + EA+ +F 
Subjt:  VRSTRRPFTLLVPNVNSKSGAIEPAVGVFWVANRLALVPSTFIIVQLISELCRLGQMQEAIKVLKVVEDGKLRCAEECYSIVMQALCEHRRVEEASDLFG

Query:  RMLSHGMKPKLAVYNSVICMLCKLGNLVDAERVFKIMSRKRCVPDHVTYSALIHAYGETKNWSVAYSLLKEMLSLGMSPHFHLYSLVNKLMREYGQIDLC
         M+S G +P +  Y+ +I   CK   + D  R+F+ +S K  +P+ +TY+ L+  + ++   + A  L +EM+S G+ P    Y ++   + + G+++  
Subjt:  RMLSHGMKPKLAVYNSVICMLCKLGNLVDAERVFKIMSRKRCVPDHVTYSALIHAYGETKNWSVAYSLLKEMLSLGMSPHFHLYSLVNKLMREYGQIDLC

Query:  LKLEMKWEAQ-----------ILQKLCKQGQLGAAYEKLKSMLEKGFYPPIYVRDAFGSAFQKNGKFKIARELLEKI
        L++  K +             I+  +C   ++  A+    S+ +KG  P +   +       K G    A  L  K+
Subjt:  LKLEMKWEAQ-----------ILQKLCKQGQLGAAYEKLKSMLEKGFYPPIYVRDAFGSAFQKNGKFKIARELLEKI

AT3G53700.1 Pentatricopeptide repeat (PPR) superfamily protein4.0e-3226.1Show/hide
Query:  RQGRVREALCLFEEMEPKFGCKPDNLVFNNMLYALCKKEPTGELIDTALTIFRRIELPDKYSYSNIIIGLCKFGRFGTALEVFDEMDRAGLVPTRSAVNI
        ++GRV +AL   +EM  + G  PD   FN ++  LCK       I+    + +    PD Y+Y+++I GLCK G    A+EV D+M      P     N 
Subjt:  RQGRVREALCLFEEMEPKFGCKPDNLVFNNMLYALCKKEPTGELIDTALTIFRRIELPDKYSYSNIIIGLCKFGRFGTALEVFDEMDRAGLVPTRSAVNI

Query:  LIGDLCSLSAKEGAIEKVRVRSTRRPFTLLVPNVNSKSGAIE---------PAVGVFWVANRLALVPSTFIIVQLISELCRLGQMQEAIKVLKVVEDGKL
        LI  LC  +  E A E  RV +++     ++P+V + +  I+          A+ +F         P  F    LI  LC  G++ EA+ +LK +E    
Subjt:  LIGDLCSLSAKEGAIEKVRVRSTRRPFTLLVPNVNSKSGAIE---------PAVGVFWVANRLALVPSTFIIVQLISELCRLGQMQEAIKVLKVVEDGKL

Query:  RCAEECYSIVMQALCEHRRVEEASDLFGRMLSHGMKPKLAVYNSVICMLCKLGNLVDAERVFKIMSRKRCVPDHVTYSALIHAYGETKNWSVAYSLLKEM
          +   Y+ ++   C+  +  EA ++F  M  HG+      YN++I  LCK   + DA ++   M  +   PD  TY++L+  +    +   A  +++ M
Subjt:  RCAEECYSIVMQALCEHRRVEEASDLFGRMLSHGMKPKLAVYNSVICMLCKLGNLVDAERVFKIMSRKRCVPDHVTYSALIHAYGETKNWSVAYSLLKEM

Query:  LSLGMSPHFHLYSLVNKLMREYGQIDLCLKLEMKWEAQ-----------ILQKLCKQGQLGAAYEKLKSMLEKGFYPP--IYVRDAFGSAFQKNGKFKIA
         S G  P    Y  +   + + G++++  KL    + +           ++Q L ++ +   A    + MLE+   PP  +  R  F       G  + A
Subjt:  LSLGMSPHFHLYSLVNKLMREYGQIDLCLKLEMKWEAQ-----------ILQKLCKQGQLGAAYEKLKSMLEKGFYPP--IYVRDAFGSAFQKNGKFKIA

Query:  ----RELLEK
             ELLEK
Subjt:  ----RELLEK

AT5G01110.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.1e-2924.39Show/hide
Query:  ISCRTFSICIRFLGRQGRVREALCLFEEMEPKFGCKPDNLVFNNMLYALCKKEPTGELIDTALTIFRRIELPDKYSYSNIIIGLCKFGRFGTALEVFDEM
        I+  T +I +  L + G++ +      +++ K G  PD + +N ++ A   K    E  +    +  +   P  Y+Y+ +I GLCK G++  A EVF EM
Subjt:  ISCRTFSICIRFLGRQGRVREALCLFEEMEPKFGCKPDNLVFNNMLYALCKKEPTGELIDTALTIFRRIELPDKYSYSNIIIGLCKFGRFGTALEVFDEM

Query:  DRAGLVPTRSAVNILIGDLCSLSAKEGAIEKVRVRSTRRPFTLL--------VPNVNSKSGAIEPAVGVFWVANRLALVPSTFIIVQLISELCRLGQMQE
         R+GL P  +    L+ + C    K   +E  +V S  R   ++        + ++ ++SG ++ A+  F       L+P   I   LI   CR G +  
Subjt:  DRAGLVPTRSAVNILIGDLCSLSAKEGAIEKVRVRSTRRPFTLL--------VPNVNSKSGAIEPAVGVFWVANRLALVPSTFIIVQLISELCRLGQMQE

Query:  AIKVL-KVVEDGKLRCAEE--CYSIVMQALCEHRRVEEASDLFGRMLSHGMKPKLAVYNSVICMLCKLGNLVDAERVFKIMSRKRCVPDHVTYSALIHAY
        A+ +  ++++ G   CA +   Y+ ++  LC+ + + EA  LF  M    + P       +I   CKLGNL +A  +F+ M  KR   D VTY+ L+  +
Subjt:  AIKVL-KVVEDGKLRCAEE--CYSIVMQALCEHRRVEEASDLFGRMLSHGMKPKLAVYNSVICMLCKLGNLVDAERVFKIMSRKRCVPDHVTYSALIHAY

Query:  GETKNWSVAYSLLKEMLSLGMSPHFHLYSLVNKLMREYGQIDLCLKLEMKWEAQILQKLCKQGQLGAAYEKLKSMLEKGFYPPIYVRDAFGSAFQKNGKF
        G+  +   A  +  +M+S  + P    YS+                        ++  LC +G L  A+     M+ K   P + + ++    + ++G  
Subjt:  GETKNWSVAYSLLKEMLSLGMSPHFHLYSLVNKLMREYGQIDLCLKLEMKWEAQILQKLCKQGQLGAAYEKLKSMLEKGFYPPIYVRDAFGSAFQKNGKF

Query:  KIARELLEKI
              LEK+
Subjt:  KIARELLEKI

AT5G39710.1 Tetratricopeptide repeat (TPR)-like superfamily protein3.5e-2827.19Show/hide
Query:  TFSICIRFLGRQGRVREALCLFEEMEPKFGCKPDNLVFNNMLYALCK----------------------------------KEPTGELIDTALT-IFRRI
        T++I IR     G +  AL LF++ME K GC P+ + +N ++   CK                                  +E   + +   LT + RR 
Subjt:  TFSICIRFLGRQGRVREALCLFEEMEPKFGCKPDNLVFNNMLYALCK----------------------------------KEPTGELIDTALT-IFRRI

Query:  ELPDKYSYSNIIIGLCKFGRFGTALEVFDEMDRAGLVPTRSAVNILIGDLCSLSAKEGAIE---KVRVRS---TRRPFTLLVPNVNSKSGAIEPAVGVFW
           D+ +Y+ +I G CK G F  AL +  EM R GL P+      LI  +C       A+E   ++RVR      R +T LV   + K G +  A  V  
Subjt:  ELPDKYSYSNIIIGLCKFGRFGTALEVFDEMDRAGLVPTRSAVNILIGDLCSLSAKEGAIE---KVRVRS---TRRPFTLLVPNVNSKSGAIEPAVGVFW

Query:  VANRLALVPSTFIIVQLISELCRLGQMQEAIKVLKVVEDGKLRCAEECYSIVMQALCEHRRVEEASDLFGRMLSHGMKPKLAVYNSVICMLCKLGNLVDA
          N     PS      LI+  C  G+M++AI VL+ +++  L      YS V+   C    V+EA  +   M+  G+KP    Y+S+I   C+     +A
Subjt:  VANRLALVPSTFIIVQLISELCRLGQMQEAIKVLKVVEDGKLRCAEECYSIVMQALCEHRRVEEASDLFGRMLSHGMKPKLAVYNSVICMLCKLGNLVDA

Query:  ERVFKIMSRKRCVPDHVTYSALIHAYGETKNWSVAYSLLKEMLSLGMSPHFHLYS-LVNKLMRE-------------------------YGQIDLCLKLE
          +++ M R    PD  TY+ALI+AY    +   A  L  EM+  G+ P    YS L+N L ++                         +  I+ C  +E
Subjt:  ERVFKIMSRKRCVPDHVTYSALIHAYGETKNWSVAYSLLKEMLSLGMSPHFHLYS-LVNKLMRE-------------------------YGQIDLCLKLE

Query:  MKWEAQILQKLCKQGQLGAAYEKLKSMLEKGFYP
         K    +++  C +G +  A +  +SML K   P
Subjt:  MKWEAQILQKLCKQGQLGAAYEKLKSMLEKGFYP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTGCTTTTGCTATTCACTCAGCTTTATAAAGCCGGCTTTCTCTCCAGCTCTGTTGCGGACACGTTTTGCAGAGAGAGAAGCGAGTTTTGGGAGGGAAAAGAAGGAAA
AAAAAGAAAGAAAATGATTAATTTCTTTAGCGAATCTGAATCTTGGGTTTCTTCTGTGAATGGAATTCTAGAAGACCCAGATGATGAGAGCATCAGAGAAGAAGATGGAG
AAGAAGAAATTGGGTCGGACTCGGATTTCGACGAGTCGGCTCGAATGGGGAGCGTGGAGAGGGAGACAATGAAGAAGAAGAACCAGGTTTTGCTCGAAGGCTACGATGAG
ATTCCGGAGCTCTGCAACACTTTGCCGGCGCTGGAGCTCTGTTATTCCATGAGCCAGAAGTACATGGACGACCACCAGAAGTCGCCGGAGAGCTCTCCGGCCTCGGCGGC
GGACTCGTGTTCGTCGGTGTCGAGTCCGATTGCGAACTGGAAGATCTCCAGCCCCGGTGATCATCCAGAAGATGTTAAGGCAAGGCTCAAATTTTGGGCCCAGGCAGTGG
CATGCCTCCTAGAAACGAAGGAACGAACGGATCGTTTGATGAGCCTGACTTATGAAGAAGAAGAAGAATGTGTCAGGTTGGAGTTGAACCTCTTGGCTCTGCTTCTGCAC
TGCAACTTGCCGATGTTGAGCAAGACTAGGAATGGTTCAGTGTTACCCAAAATCATTACTCTTCATTTTTCTGGGTTCTCTTCAACCTTTTTCGGAACTTCAACGGCTAC
AAAGCACATTGCCACAGCTCCAAGAGCTGTTGCAAGAAGACCCACTTCGCGAACTGCCCCAATCCCTCGGGCTTTGGACACCGTCAGCTCTACCGACGTCGTCAATTCAG
TATGTTCTTTACTTTCAAACAAAAATCACCAAACAACAAATCTCGATCTTGATCATTTATTGAAAAGGTTCAACGAAAAGTTAAGTTCGGATCTCGTGCTTCAAATTCTG
ATGAATTATAGGCTGTTGGGTCGGGCTAAAACGTTGGAATTCTTTTCTTGGTCTGGATTGCAAATGGGGTATCAGTTTGATGAGTCCGTGGTTGAATACATGGCTGATTT
CTTAGGTAGAAGGAAACTGTTTGATGATATGAAGTGTCTGTTGGTGACTGTGTCATATCATAAGGGTCGGATTTCATGTCGGACATTTTCAATTTGTATCAGATTTTTGG
GTAGGCAAGGGAGGGTTAGAGAAGCGCTTTGCTTGTTCGAAGAAATGGAGCCAAAATTTGGGTGTAAACCTGATAATCTTGTCTTTAACAACATGCTTTATGCGCTATGT
AAGAAGGAACCAACTGGGGAATTGATTGATACTGCTCTTACAATCTTCAGAAGAATCGAATTGCCTGATAAATATTCATACAGTAATATAATTATAGGGTTGTGTAAATT
TGGTAGGTTTGGTACAGCTCTTGAAGTGTTTGATGAAATGGACAGGGCAGGTTTGGTTCCTACTCGATCTGCTGTGAACATTCTCATTGGGGATTTGTGTTCATTGAGTG
CCAAAGAAGGGGCTATAGAAAAAGTTAGAGTCAGAAGTACTCGTAGGCCTTTTACCCTCCTAGTTCCAAATGTGAATTCAAAGAGCGGTGCCATTGAACCTGCAGTTGGA
GTTTTTTGGGTAGCTAATAGGCTGGCTTTAGTTCCCAGTACGTTCATAATAGTTCAGCTTATCTCGGAACTTTGTCGATTAGGTCAAATGCAAGAAGCAATTAAAGTATT
GAAGGTTGTTGAGGATGGCAAGCTAAGATGTGCAGAAGAGTGTTACTCCATTGTGATGCAAGCATTATGTGAACATCGTCGGGTCGAAGAAGCTAGTGATCTGTTTGGCA
GGATGCTTTCTCACGGTATGAAGCCAAAGTTGGCTGTTTACAATTCTGTTATTTGCATGCTATGCAAATTGGGAAATTTGGTTGATGCTGAAAGGGTTTTTAAGATTATG
AGTAGGAAGAGATGTGTACCTGATCATGTTACTTATTCGGCACTAATCCATGCCTACGGTGAAACTAAGAATTGGTCGGTGGCCTACAGTTTATTGAAGGAAATGCTGAG
TTTAGGCATGTCTCCTCATTTTCATTTGTATAGTTTAGTGAATAAACTAATGAGGGAATATGGGCAAATAGATCTGTGTTTGAAGCTGGAAATGAAGTGGGAGGCCCAAA
TTTTGCAGAAGCTTTGTAAACAAGGCCAACTGGGGGCTGCTTATGAAAAGCTTAAATCAATGCTTGAAAAGGGTTTTTACCCTCCTATCTATGTGAGAGATGCTTTTGGG
AGTGCATTTCAAAAGAATGGTAAGTTTAAGATCGCACGGGAGTTGCTGGAGAAGATAGACGGAGTCCGCAAACCTTTGAGGAGGATCATATCGGTTAGAATCACCCGCGC
TTTCAGAACTTCACTTCGCCTCATAAAACCGAGCAGTCAAGAAAATGCCTCAACACCTCTTCATCTCTTGTGCCTATATGGTGGTCTGGTCTTGTGGACTAAGCTTTTGG
AGATTTTCAACCTGCAGTGGGCTTTTGGTGATCCGGGTTTGACTGATCCAAGATTTAAAAGTGCACGAAGGTTTTGTGGTCAAATACTTTTGTGCTCTTCTTTGGCACTT
GATTGGAAAGCTAACATAAGAATTTTTCAGGATGAAGAGAAGACCGGAAAGAAACTCTAG
mRNA sequenceShow/hide mRNA sequence
ATGTTGCTTTTGCTATTCACTCAGCTTTATAAAGCCGGCTTTCTCTCCAGCTCTGTTGCGGACACGTTTTGCAGAGAGAGAAGCGAGTTTTGGGAGGGAAAAGAAGGAAA
AAAAAGAAAGAAAATGATTAATTTCTTTAGCGAATCTGAATCTTGGGTTTCTTCTGTGAATGGAATTCTAGAAGACCCAGATGATGAGAGCATCAGAGAAGAAGATGGAG
AAGAAGAAATTGGGTCGGACTCGGATTTCGACGAGTCGGCTCGAATGGGGAGCGTGGAGAGGGAGACAATGAAGAAGAAGAACCAGGTTTTGCTCGAAGGCTACGATGAG
ATTCCGGAGCTCTGCAACACTTTGCCGGCGCTGGAGCTCTGTTATTCCATGAGCCAGAAGTACATGGACGACCACCAGAAGTCGCCGGAGAGCTCTCCGGCCTCGGCGGC
GGACTCGTGTTCGTCGGTGTCGAGTCCGATTGCGAACTGGAAGATCTCCAGCCCCGGTGATCATCCAGAAGATGTTAAGGCAAGGCTCAAATTTTGGGCCCAGGCAGTGG
CATGCCTCCTAGAAACGAAGGAACGAACGGATCGTTTGATGAGCCTGACTTATGAAGAAGAAGAAGAATGTGTCAGGTTGGAGTTGAACCTCTTGGCTCTGCTTCTGCAC
TGCAACTTGCCGATGTTGAGCAAGACTAGGAATGGTTCAGTGTTACCCAAAATCATTACTCTTCATTTTTCTGGGTTCTCTTCAACCTTTTTCGGAACTTCAACGGCTAC
AAAGCACATTGCCACAGCTCCAAGAGCTGTTGCAAGAAGACCCACTTCGCGAACTGCCCCAATCCCTCGGGCTTTGGACACCGTCAGCTCTACCGACGTCGTCAATTCAG
TATGTTCTTTACTTTCAAACAAAAATCACCAAACAACAAATCTCGATCTTGATCATTTATTGAAAAGGTTCAACGAAAAGTTAAGTTCGGATCTCGTGCTTCAAATTCTG
ATGAATTATAGGCTGTTGGGTCGGGCTAAAACGTTGGAATTCTTTTCTTGGTCTGGATTGCAAATGGGGTATCAGTTTGATGAGTCCGTGGTTGAATACATGGCTGATTT
CTTAGGTAGAAGGAAACTGTTTGATGATATGAAGTGTCTGTTGGTGACTGTGTCATATCATAAGGGTCGGATTTCATGTCGGACATTTTCAATTTGTATCAGATTTTTGG
GTAGGCAAGGGAGGGTTAGAGAAGCGCTTTGCTTGTTCGAAGAAATGGAGCCAAAATTTGGGTGTAAACCTGATAATCTTGTCTTTAACAACATGCTTTATGCGCTATGT
AAGAAGGAACCAACTGGGGAATTGATTGATACTGCTCTTACAATCTTCAGAAGAATCGAATTGCCTGATAAATATTCATACAGTAATATAATTATAGGGTTGTGTAAATT
TGGTAGGTTTGGTACAGCTCTTGAAGTGTTTGATGAAATGGACAGGGCAGGTTTGGTTCCTACTCGATCTGCTGTGAACATTCTCATTGGGGATTTGTGTTCATTGAGTG
CCAAAGAAGGGGCTATAGAAAAAGTTAGAGTCAGAAGTACTCGTAGGCCTTTTACCCTCCTAGTTCCAAATGTGAATTCAAAGAGCGGTGCCATTGAACCTGCAGTTGGA
GTTTTTTGGGTAGCTAATAGGCTGGCTTTAGTTCCCAGTACGTTCATAATAGTTCAGCTTATCTCGGAACTTTGTCGATTAGGTCAAATGCAAGAAGCAATTAAAGTATT
GAAGGTTGTTGAGGATGGCAAGCTAAGATGTGCAGAAGAGTGTTACTCCATTGTGATGCAAGCATTATGTGAACATCGTCGGGTCGAAGAAGCTAGTGATCTGTTTGGCA
GGATGCTTTCTCACGGTATGAAGCCAAAGTTGGCTGTTTACAATTCTGTTATTTGCATGCTATGCAAATTGGGAAATTTGGTTGATGCTGAAAGGGTTTTTAAGATTATG
AGTAGGAAGAGATGTGTACCTGATCATGTTACTTATTCGGCACTAATCCATGCCTACGGTGAAACTAAGAATTGGTCGGTGGCCTACAGTTTATTGAAGGAAATGCTGAG
TTTAGGCATGTCTCCTCATTTTCATTTGTATAGTTTAGTGAATAAACTAATGAGGGAATATGGGCAAATAGATCTGTGTTTGAAGCTGGAAATGAAGTGGGAGGCCCAAA
TTTTGCAGAAGCTTTGTAAACAAGGCCAACTGGGGGCTGCTTATGAAAAGCTTAAATCAATGCTTGAAAAGGGTTTTTACCCTCCTATCTATGTGAGAGATGCTTTTGGG
AGTGCATTTCAAAAGAATGGTAAGTTTAAGATCGCACGGGAGTTGCTGGAGAAGATAGACGGAGTCCGCAAACCTTTGAGGAGGATCATATCGGTTAGAATCACCCGCGC
TTTCAGAACTTCACTTCGCCTCATAAAACCGAGCAGTCAAGAAAATGCCTCAACACCTCTTCATCTCTTGTGCCTATATGGTGGTCTGGTCTTGTGGACTAAGCTTTTGG
AGATTTTCAACCTGCAGTGGGCTTTTGGTGATCCGGGTTTGACTGATCCAAGATTTAAAAGTGCACGAAGGTTTTGTGGTCAAATACTTTTGTGCTCTTCTTTGGCACTT
GATTGGAAAGCTAACATAAGAATTTTTCAGGATGAAGAGAAGACCGGAAAGAAACTCTAG
Protein sequenceShow/hide protein sequence
MLLLLFTQLYKAGFLSSSVADTFCRERSEFWEGKEGKKRKKMINFFSESESWVSSVNGILEDPDDESIREEDGEEEIGSDSDFDESARMGSVERETMKKKNQVLLEGYDE
IPELCNTLPALELCYSMSQKYMDDHQKSPESSPASAADSCSSVSSPIANWKISSPGDHPEDVKARLKFWAQAVACLLETKERTDRLMSLTYEEEEECVRLELNLLALLLH
CNLPMLSKTRNGSVLPKIITLHFSGFSSTFFGTSTATKHIATAPRAVARRPTSRTAPIPRALDTVSSTDVVNSVCSLLSNKNHQTTNLDLDHLLKRFNEKLSSDLVLQIL
MNYRLLGRAKTLEFFSWSGLQMGYQFDESVVEYMADFLGRRKLFDDMKCLLVTVSYHKGRISCRTFSICIRFLGRQGRVREALCLFEEMEPKFGCKPDNLVFNNMLYALC
KKEPTGELIDTALTIFRRIELPDKYSYSNIIIGLCKFGRFGTALEVFDEMDRAGLVPTRSAVNILIGDLCSLSAKEGAIEKVRVRSTRRPFTLLVPNVNSKSGAIEPAVG
VFWVANRLALVPSTFIIVQLISELCRLGQMQEAIKVLKVVEDGKLRCAEECYSIVMQALCEHRRVEEASDLFGRMLSHGMKPKLAVYNSVICMLCKLGNLVDAERVFKIM
SRKRCVPDHVTYSALIHAYGETKNWSVAYSLLKEMLSLGMSPHFHLYSLVNKLMREYGQIDLCLKLEMKWEAQILQKLCKQGQLGAAYEKLKSMLEKGFYPPIYVRDAFG
SAFQKNGKFKIARELLEKIDGVRKPLRRIISVRITRAFRTSLRLIKPSSQENASTPLHLLCLYGGLVLWTKLLEIFNLQWAFGDPGLTDPRFKSARRFCGQILLCSSLAL
DWKANIRIFQDEEKTGKKL