; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr012369 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr012369
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionPentatricopeptide repeat-containing protein DOT4
Genome locationtig00153348:43920..53357
RNA-Seq ExpressionSgr012369
SyntenySgr012369
Gene Ontology termsGO:0006412 - translation (biological process)
GO:0005840 - ribosome (cellular component)
GO:0003735 - structural constituent of ribosome (molecular function)
GO:0005515 - protein binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily
IPR020929 - Ribosomal protein L5, conserved site
IPR022803 - Ribosomal protein L5 domain superfamily
IPR031309 - Ribosomal protein L5, C-terminal
IPR031310 - Ribosomal protein L5, N-terminal


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6583948.1 Pentatricopeptide repeat-containing protein DOT4, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia]6.7e-25283.49Show/hide
Query:  MLLLVAKPPPKFWFSPAGHDHRGSVNLKFPHSVVLAKPNSKFSFSNSAYACTEIY-PPASQTKSYLDVKLDNSAGIVDFCEVGDLKNAMELLCSYQNSNL
        M LLVAK PP FW S AG+DHRG VNLKF  S    KPNS+ SFSNSA+A TE Y P A + K+Y+DV+L+NS  IV FCEVGDLKNA+ELLCS QNSNL
Subjt:  MLLLVAKPPPKFWFSPAGHDHRGSVNLKFPHSVVLAKPNSKFSFSNSAYACTEIY-PPASQTKSYLDVKLDNSAGIVDFCEVGDLKNAMELLCSYQNSNL

Query:  NLDTYCSILQLCAEQKSIRDGKRVHSIIESNGVVIDGILGAKLVFMYVKCGDLKEGRMIFDKLSEKKVFLWNLMISEYAGSGNYGESINLFKQMLEFGIK
        +LDTYC ILQLCAEQKSIRDG+RVHSIIESN VVIDGILGAKLVFMYVKCGDL+EGRMIFDKLSEKKVFLWNLMISEY+GSGNYGESINLFK+MLE GI 
Subjt:  NLDTYCSILQLCAEQKSIRDGKRVHSIIESNGVVIDGILGAKLVFMYVKCGDLKEGRMIFDKLSEKKVFLWNLMISEYAGSGNYGESINLFKQMLEFGIK

Query:  PNSYTFSSVFKCFAAVARVEEGRQVHGLICKLGFSSYNTVVNSLISFYFVGRKVRTAQKLFDELSDRDVISWNSMISGYVKNGFEDKGIEIFIKMLVFSI
        PNSYTFSSV KCFAAVARVEEG QVHGLICKLGF+SYN VVNSLISFYFVGRKVR+A+KLFDE+SDRDVISWNSMISGYVKNG ED+GIEIF++MLVFS+
Subjt:  PNSYTFSSVFKCFAAVARVEEGRQVHGLICKLGFSSYNTVVNSLISFYFVGRKVRTAQKLFDELSDRDVISWNSMISGYVKNGFEDKGIEIFIKMLVFSI

Query:  DVDLATLVNVLVASANMGTLLLGKALHSYAIK-DCSLDREVMFNNTLLDMYSKCGDLNSAIRVFEKMDEKTVVSWTSMIAGYVREGLSDGAITLFDKMKS
        DVDLAT+VNVLVA ANMGTL LGK LHSY+IK   +LDR+VMFNNTLLDMYSKCGDLNSAIRVFE+MDEKTVVSWTSMIAGYVREGLSDGAI LF++MKS
Subjt:  DVDLATLVNVLVASANMGTLLLGKALHSYAIK-DCSLDREVMFNNTLLDMYSKCGDLNSAIRVFEKMDEKTVVSWTSMIAGYVREGLSDGAITLFDKMKS

Query:  RGVVPDVYAVTSILHAFAVNGNLNSGKIVHNYIKENNMETNSFVSNALMDMYAKCGSMKDAHSVFSHMKRKDVISWNTMIGGYSKNRLPNEALNLFAEMQ
        RGV+PDVYAV SILHA A NGNLNSGK +HNYI+ENN+ETNSFVSNALMDMYAKCGSM+DA  VFSHMKRKDVISWNTMIGGYSKNRLPNEAL+LFAEMQ
Subjt:  RGVVPDVYAVTSILHAFAVNGNLNSGKIVHNYIKENNMETNSFVSNALMDMYAKCGSMKDAHSVFSHMKRKDVISWNTMIGGYSKNRLPNEALNLFAEMQ

Query:  GESKPDGTTVACILPACASLAAWIEAEKSMDIH
         ESKPDGTTVACILPACASLAA    +K  +IH
Subjt:  GESKPDGTTVACILPACASLAAWIEAEKSMDIH

KAG7019566.1 Pentatricopeptide repeat-containing protein DOT4, chloroplastic, partial [Cucurbita argyrosperma subsp. argyrosperma]3.2e-25482.54Show/hide
Query:  AATYWGCAAKAMLLLVAKPPPKFWFSPAGHDHRGSVNLKFPHSVVLAKPNSKFSFSNSAYACTEIY-PPASQTKSYLDVKLDNSAGIVDFCEVGDLKNAM
        A + W  +AKAM LLVAK PP FW S AG+DHRG VNLKF  S    KPNS+ SFSNSA+A TE Y P A + K+Y+DV+L+NS  IV FCEVGDLKNA+
Subjt:  AATYWGCAAKAMLLLVAKPPPKFWFSPAGHDHRGSVNLKFPHSVVLAKPNSKFSFSNSAYACTEIY-PPASQTKSYLDVKLDNSAGIVDFCEVGDLKNAM

Query:  ELLCSYQNSNLNLDTYCSILQLCAEQKSIRDGKRVHSIIESNGVVIDGILGAKLVFMYVKCGDLKEGRMIFDKLSEKKVFLWNLMISEYAGSGNYGESIN
        ELLCS QNSNL+LDTYC ILQLCAEQKSIRDG+RVHSIIESN VVIDGILGAKL+FMYVKCGDL+EGRMIFDKLSEKKVFLWNLMISEY+GSGNYGESIN
Subjt:  ELLCSYQNSNLNLDTYCSILQLCAEQKSIRDGKRVHSIIESNGVVIDGILGAKLVFMYVKCGDLKEGRMIFDKLSEKKVFLWNLMISEYAGSGNYGESIN

Query:  LFKQMLEFGIKPNSYTFSSVFKCFAAVARVEEGRQVHGLICKLGFSSYNTVVNSLISFYFVGRKVRTAQKLFDELSDRDVISWNSMISGYVKNGFEDKGI
        LFK+MLE GI PNSYTFSSV KCFAAVARVEEG QVHGLICKLGF+SYN VVNSLISFYFVGRKVR+A+KLFDE+SDRDVISWNSMISGYVKNG ED+GI
Subjt:  LFKQMLEFGIKPNSYTFSSVFKCFAAVARVEEGRQVHGLICKLGFSSYNTVVNSLISFYFVGRKVRTAQKLFDELSDRDVISWNSMISGYVKNGFEDKGI

Query:  EIFIKMLVFSIDVDLATLVNVLVASANMGTLLLGKALHSYAIK-DCSLDREVMFNNTLLDMYSKCGDLNSAIRVFEKMDEKTVVSWTSMIAGYVREGLSD
        EIF++MLVFS+DVDLAT+VNVLVA ANMGTL LGK LHSY+IK   +LDR+VMFNNTLLDMYSKCGDLNSAIRVFE+MDEKTVVSWTSMIAGYVREGLSD
Subjt:  EIFIKMLVFSIDVDLATLVNVLVASANMGTLLLGKALHSYAIK-DCSLDREVMFNNTLLDMYSKCGDLNSAIRVFEKMDEKTVVSWTSMIAGYVREGLSD

Query:  GAITLFDKMKSRGVVPDVYAVTSILHAFAVNGNLNSGKIVHNYIKENNMETNSFVSNALMDMYAKCGSMKDAHSVFSHMKRKDVISWNTMIGGYSKNRLP
        GAI LF++MKSRGV+PDVYAV SILHA A NGNLNSGK +HNYI+ENN+ETNSFVSNALMDMYAKCGSM+DA  VFSHMKRKDVISWNTMIGGYSKNRLP
Subjt:  GAITLFDKMKSRGVVPDVYAVTSILHAFAVNGNLNSGKIVHNYIKENNMETNSFVSNALMDMYAKCGSMKDAHSVFSHMKRKDVISWNTMIGGYSKNRLP

Query:  NEALNLFAEMQGESKPDGTTVACILPACASLAAWIEAEKSMDIH
        NEAL+LFAEMQ ESKPDGTTVACILPACASLAA    +K  +IH
Subjt:  NEALNLFAEMQGESKPDGTTVACILPACASLAAWIEAEKSMDIH

XP_022139839.1 pentatricopeptide repeat-containing protein DOT4, chloroplastic [Momordica charantia]1.1e-25985.88Show/hide
Query:  LLLVAKPPPKFWFSPAGHDHRGSVNLKFPHSVVLAKPNSKFSFSNSAYACTEIYPPASQTKSYLDVKLDNSAGIVDFCEVGDLKNAMELLCSYQNSNLNL
        +LLVAK PP FW SP GHD  G VNLKF HS V AKP SKFSFSNSAYACT+IYP  SQTKSYLD++LDNSA IV+FCEVGDLKNAMELLCS  N+NL+L
Subjt:  LLLVAKPPPKFWFSPAGHDHRGSVNLKFPHSVVLAKPNSKFSFSNSAYACTEIYPPASQTKSYLDVKLDNSAGIVDFCEVGDLKNAMELLCSYQNSNLNL

Query:  DTYCSILQLCAEQKSIRDGKRVHSIIESNGVVIDGILGAKLVFMYVKCGDLKEGRMIFDKLSEKKVFLWNLMISEYAGSGNYGESINLFKQMLEFGIKPN
        +TYCS+LQLCAE+KSIR GKRVHSIIESNGVV+DGILGAKLVFMYVKCGDLKE RMIFDKLSE+KVFLWNLMISEYAG+GNY ES+NLFK+M+E GIKPN
Subjt:  DTYCSILQLCAEQKSIRDGKRVHSIIESNGVVIDGILGAKLVFMYVKCGDLKEGRMIFDKLSEKKVFLWNLMISEYAGSGNYGESINLFKQMLEFGIKPN

Query:  SYTFSSVFKCFAAVARVEEGRQVHGLICKLGFSSYNTVVNSLISFYFVGRKVRTAQKLFDELSDRDVISWNSMISGYVKNGFEDKGIEIFIKMLVFSIDV
        SYTFSSV KC AAVARVE+GR VHG ICKLGFSSYNTVVNSLISFYFV +KVR+AQKLFDELSDRDVISWNSMISGYVKNG EDKGIEIFIKML FS+DV
Subjt:  SYTFSSVFKCFAAVARVEEGRQVHGLICKLGFSSYNTVVNSLISFYFVGRKVRTAQKLFDELSDRDVISWNSMISGYVKNGFEDKGIEIFIKMLVFSIDV

Query:  DLATLVNVLVASANMGTLLLGKALHSYAIKDC-SLDREVMFNNTLLDMYSKCGDLNSAIRVFEKMDEKTVVSWTSMIAGYVREGLSDGAITLFDKMKSRG
        DLAT+VNVLVA AN GTLLLGKALHSYAIK   SLDREVMF NTLLDMYSKCGDLNSAIRVFEKMDEKTVVSWTSMIAGYVREGLSDGAI LFD+MKSRG
Subjt:  DLATLVNVLVASANMGTLLLGKALHSYAIKDC-SLDREVMFNNTLLDMYSKCGDLNSAIRVFEKMDEKTVVSWTSMIAGYVREGLSDGAITLFDKMKSRG

Query:  VVPDVYAVTSILHAFAVNGNLNSGKIVHNYIKENNMETNSFVSNALMDMYAKCGSMKDAHSVFSHMKRKDVISWNTMIGGYSKNRLPNEALNLFAEMQGE
        VVPDVYAVTSILHA A+NGNL+SGKIVHNYI++NN+ETNSFVSNALMDMYAKCGSMKDA SVFSHMK KDVISWNTMIGGYSKN LPNEALNLFAEMQ E
Subjt:  VVPDVYAVTSILHAFAVNGNLNSGKIVHNYIKENNMETNSFVSNALMDMYAKCGSMKDAHSVFSHMKRKDVISWNTMIGGYSKNRLPNEALNLFAEMQGE

Query:  SKPDGTTVACILPACASLAAWIEAEKSMDIH
        SKPDGTTVACILPACASLAA    ++  +IH
Subjt:  SKPDGTTVACILPACASLAAWIEAEKSMDIH

XP_023000778.1 pentatricopeptide repeat-containing protein DOT4, chloroplastic [Cucurbita maxima]3.6e-25383.68Show/hide
Query:  MLLLVAKPPPKFWFSPAGHDHRGSVNLKFPHSVVLAKPNSKFSFSNSAYACTEIYPPAS-QTKSYLDVKLDNSAGIVDFCEVGDLKNAMELLCSYQNSNL
        M LLVAK PP FW S AG+DHRG VNLKF  S    KPNS+ SFSNSAYA TE Y PA+ + K+Y+D +L+NS  IV FCEVGDLKNA+ELLCS QNSNL
Subjt:  MLLLVAKPPPKFWFSPAGHDHRGSVNLKFPHSVVLAKPNSKFSFSNSAYACTEIYPPAS-QTKSYLDVKLDNSAGIVDFCEVGDLKNAMELLCSYQNSNL

Query:  NLDTYCSILQLCAEQKSIRDGKRVHSIIESNGVVIDGILGAKLVFMYVKCGDLKEGRMIFDKLSEKKVFLWNLMISEYAGSGNYGESINLFKQMLEFGIK
        +LDTYC ILQLCAEQKSIRDG+RVHSIIESN VVIDGILGAKLVFMYVKCGDL+EGRMIFDKLSEKKVFLWNLMISEY+GSGNYGESINLFK+MLE GI 
Subjt:  NLDTYCSILQLCAEQKSIRDGKRVHSIIESNGVVIDGILGAKLVFMYVKCGDLKEGRMIFDKLSEKKVFLWNLMISEYAGSGNYGESINLFKQMLEFGIK

Query:  PNSYTFSSVFKCFAAVARVEEGRQVHGLICKLGFSSYNTVVNSLISFYFVGRKVRTAQKLFDELSDRDVISWNSMISGYVKNGFEDKGIEIFIKMLVFSI
        PNSYTFSSV KCFAAV RVEEGRQVHGLICKLGF+SYN VVNSLISFYFVGRKVR+A+KLFDE+SDRDVISWNSMISGYVKNG ED+GIEIF++MLVFS+
Subjt:  PNSYTFSSVFKCFAAVARVEEGRQVHGLICKLGFSSYNTVVNSLISFYFVGRKVRTAQKLFDELSDRDVISWNSMISGYVKNGFEDKGIEIFIKMLVFSI

Query:  DVDLATLVNVLVASANMGTLLLGKALHSYAIK-DCSLDREVMFNNTLLDMYSKCGDLNSAIRVFEKMDEKTVVSWTSMIAGYVREGLSDGAITLFDKMKS
        DVDLAT+VNVLVA ANMGTL LGK LHSY+IK   +LDR+VMFNNTLLDMYSKCGDLNSAIRVFE+MDEKTVVSWTSMIAGYVREGLSDGAI LF++MKS
Subjt:  DVDLATLVNVLVASANMGTLLLGKALHSYAIK-DCSLDREVMFNNTLLDMYSKCGDLNSAIRVFEKMDEKTVVSWTSMIAGYVREGLSDGAITLFDKMKS

Query:  RGVVPDVYAVTSILHAFAVNGNLNSGKIVHNYIKENNMETNSFVSNALMDMYAKCGSMKDAHSVFSHMKRKDVISWNTMIGGYSKNRLPNEALNLFAEMQ
        RGV+PDVYAV SILHA A+NGNLNSGK +HNYIKENN+ETNSFVSNALMDMYAKCGSMKDA  VFSH+KRKDVISWNTMIGGYSKNRLPNEAL+LFAEMQ
Subjt:  RGVVPDVYAVTSILHAFAVNGNLNSGKIVHNYIKENNMETNSFVSNALMDMYAKCGSMKDAHSVFSHMKRKDVISWNTMIGGYSKNRLPNEALNLFAEMQ

Query:  GESKPDGTTVACILPACASLAAWIEAEKSMDIH
         ESKPDGTTVACILPACASLAA    +K  +IH
Subjt:  GESKPDGTTVACILPACASLAAWIEAEKSMDIH

XP_038893908.1 pentatricopeptide repeat-containing protein DOT4, chloroplastic [Benincasa hispida]1.3e-25584.21Show/hide
Query:  LLLVAKPPPKFWFSPAGHDHRGSVNLKFPHSVVLAKPNSKFSFSNSAYACTEIYPPASQT--KSYLDVKLDNSAGIVDFCEVGDLKNAMELLCSYQNSNL
        +LLVAKPP  FW SP G+DHRG ++LKF  S V  KPNSKFSFSNSA+ACTE Y PA +T  KSY+DV+LDNS  IV+FCE+GDLKNAMELLC  QNS  
Subjt:  LLLVAKPPPKFWFSPAGHDHRGSVNLKFPHSVVLAKPNSKFSFSNSAYACTEIYPPASQT--KSYLDVKLDNSAGIVDFCEVGDLKNAMELLCSYQNSNL

Query:  NLDTYCSILQLCAEQKSIRDGKRVHSIIESNGVVIDGILGAKLVFMYVKCGDLKEGRMIFDKLSEKKVFLWNLMISEYAGSGNYGESINLFKQMLEFGIK
        +LDTYCSILQLCAEQKSIRDG+RVHSIIESNGV+IDGILG KLVFMYVKCGDLKEGR+IFDKLSE KVFLWNLMISEY+G+GNYGESINLFKQMLE GIK
Subjt:  NLDTYCSILQLCAEQKSIRDGKRVHSIIESNGVVIDGILGAKLVFMYVKCGDLKEGRMIFDKLSEKKVFLWNLMISEYAGSGNYGESINLFKQMLEFGIK

Query:  PNSYTFSSVFKCFAAVARVEEGRQVHGLICKLGFSSYNTVVNSLISFYFVGRKVRTAQKLFDELSDRDVISWNSMISGYVKNGFEDKGIEIFIKMLVFSI
        PNSYTFSSV KC AAVARVEEGRQVHGLICKLGF+SYNTVVNSLISFYFV RKVR AQKLFDEL+DRDVISWNSMISGYVKNG EDKGIEIFIKML FSI
Subjt:  PNSYTFSSVFKCFAAVARVEEGRQVHGLICKLGFSSYNTVVNSLISFYFVGRKVRTAQKLFDELSDRDVISWNSMISGYVKNGFEDKGIEIFIKMLVFSI

Query:  DVDLATLVNVLVASANMGTLLLGKALHSYAIKDCSLDREVMFNNTLLDMYSKCGDLNSAIRVFEKMDEKTVVSWTSMIAGYVREGLSDGAITLFDKMKSR
        D DLAT+VNVLVA ANMGTLLLGKALHSY IK  +L++EVMFNNTLLDMYSKCG LNSAIRVFE+MDEKTVVSWTSMI GYVREGLSDGAI LFD+MKS+
Subjt:  DVDLATLVNVLVASANMGTLLLGKALHSYAIKDCSLDREVMFNNTLLDMYSKCGDLNSAIRVFEKMDEKTVVSWTSMIAGYVREGLSDGAITLFDKMKSR

Query:  GVVPDVYAVTSILHAFAVNGNLNSGKIVHNYIKENNMETNSFVSNALMDMYAKCGSMKDAHSVFSHMKRKDVISWNTMIGGYSKNRLPNEALNLFAEMQG
        G++PDVYAVTSILHA A+NGNLNSGKIVHNYI+EN +ETNSFVSNALMDMYAK GSMKDAH VFSHMKRKDVISWNTMIGGYSKNRLPNEALNLFAEMQ 
Subjt:  GVVPDVYAVTSILHAFAVNGNLNSGKIVHNYIKENNMETNSFVSNALMDMYAKCGSMKDAHSVFSHMKRKDVISWNTMIGGYSKNRLPNEALNLFAEMQG

Query:  ESKPDGTTVACILPACASLAAWIEAEKSMDIH
        E KPD TTVACILPACASLAA    ++  +IH
Subjt:  ESKPDGTTVACILPACASLAAWIEAEKSMDIH

TrEMBL top hitse value%identityAlignment
A0A1S3B857 pentatricopeptide repeat-containing protein DOT4, chloroplastic5.5e-25282.67Show/hide
Query:  MLLLVAKPPPKFWFSPAGHDHRGSVNLKFPHSVVLAKPNSKFSFSNSAYACTEIYPPASQTKSYLDVKLDNSAGIVDFCEVGDLKNAMELLCSYQNSNLN
        M+LL AK P  FW SPAGHDHRGSVNLKF  S + AKPNSK SFS+ AYA      PA +TKSY+DV+LD+S  IV+FCEVGDLKNAMELLCS QNSN +
Subjt:  MLLLVAKPPPKFWFSPAGHDHRGSVNLKFPHSVVLAKPNSKFSFSNSAYACTEIYPPASQTKSYLDVKLDNSAGIVDFCEVGDLKNAMELLCSYQNSNLN

Query:  LDTYCSILQLCAEQKSIRDGKRVHSIIESNGVVIDGILGAKLVFMYVKCGDLKEGRMIFDKLSEKKVFLWNLMISEYAGSGNYGESINLFKQMLEFGIKP
        LD +CSILQLCAE+KSIRDG+RVHSIIES+GV+IDGILG KLVFMYVKCGDLKEGRMIFDKLSE KVF+WNLMISEY G+GNYGESINLFKQMLE GIKP
Subjt:  LDTYCSILQLCAEQKSIRDGKRVHSIIESNGVVIDGILGAKLVFMYVKCGDLKEGRMIFDKLSEKKVFLWNLMISEYAGSGNYGESINLFKQMLEFGIKP

Query:  NSYTFSSVFKCFAAVARVEEGRQVHGLICKLGFSSYNTVVNSLISFYFVGRKVRTAQKLFDELSDRDVISWNSMISGYVKNGFEDKGIEIFIKMLVFSID
        NSYTFSSV KCFAAVA VEEGRQVHGLI KLG++SYNTVVNSLISFYFVGRKVR AQKLFDEL+DRDVISWNSMISGYVKNG +D+GIEIFIKMLVF ++
Subjt:  NSYTFSSVFKCFAAVARVEEGRQVHGLICKLGFSSYNTVVNSLISFYFVGRKVRTAQKLFDELSDRDVISWNSMISGYVKNGFEDKGIEIFIKMLVFSID

Query:  VDLATLVNVLVASANMGTLLLGKALHSYAIKDCSLDREVMFNNTLLDMYSKCGDLNSAIRVFEKMDEKTVVSWTSMIAGYVREGLSDGAITLFDKMKSRG
        +DLAT+VNVLVA AN GTLL GK LHSY+IK  +LDREV FNNTLLDMYSKCGDLNSAIRVFE+MDEKTVVSWTSMI GYVREGLSDGAI LFD+MKSRG
Subjt:  VDLATLVNVLVASANMGTLLLGKALHSYAIKDCSLDREVMFNNTLLDMYSKCGDLNSAIRVFEKMDEKTVVSWTSMIAGYVREGLSDGAITLFDKMKSRG

Query:  VVPDVYAVTSILHAFAVNGNLNSGKIVHNYIKENNMETNSFVSNALMDMYAKCGSMKDAHSVFSHMKRKDVISWNTMIGGYSKNRLPNEALNLFAEMQGE
        VVPDVYAVTSILHA A+NGNL SG+IVH+YI+ENN+ETNSFVSNAL DMYAKCGSMKDAH VFSHMK+KDVISWNTMIGGYSKNRLPNEAL LFAEMQ E
Subjt:  VVPDVYAVTSILHAFAVNGNLNSGKIVHNYIKENNMETNSFVSNALMDMYAKCGSMKDAHSVFSHMKRKDVISWNTMIGGYSKNRLPNEALNLFAEMQGE

Query:  SKPDGTTVACILPACASLAAWIEAEKSMDIH
        SKPDGTTVACILPACASLAA    ++  +IH
Subjt:  SKPDGTTVACILPACASLAAWIEAEKSMDIH

A0A5A7UPC9 Pentatricopeptide repeat-containing protein DOT45.5e-25282.67Show/hide
Query:  MLLLVAKPPPKFWFSPAGHDHRGSVNLKFPHSVVLAKPNSKFSFSNSAYACTEIYPPASQTKSYLDVKLDNSAGIVDFCEVGDLKNAMELLCSYQNSNLN
        M+LL AK P  FW SPAGHDHRGSVNLKF  S + AKPNSK SFS+ AYA      PA +TKSY+DV+LD+S  IV+FCEVGDLKNAMELLCS QNSN +
Subjt:  MLLLVAKPPPKFWFSPAGHDHRGSVNLKFPHSVVLAKPNSKFSFSNSAYACTEIYPPASQTKSYLDVKLDNSAGIVDFCEVGDLKNAMELLCSYQNSNLN

Query:  LDTYCSILQLCAEQKSIRDGKRVHSIIESNGVVIDGILGAKLVFMYVKCGDLKEGRMIFDKLSEKKVFLWNLMISEYAGSGNYGESINLFKQMLEFGIKP
        LD +CSILQLCAE+KSIRDG+RVHSIIES+GV+IDGILG KLVFMYVKCGDLKEGRMIFDKLSE KVF+WNLMISEY G+GNYGESINLFKQMLE GIKP
Subjt:  LDTYCSILQLCAEQKSIRDGKRVHSIIESNGVVIDGILGAKLVFMYVKCGDLKEGRMIFDKLSEKKVFLWNLMISEYAGSGNYGESINLFKQMLEFGIKP

Query:  NSYTFSSVFKCFAAVARVEEGRQVHGLICKLGFSSYNTVVNSLISFYFVGRKVRTAQKLFDELSDRDVISWNSMISGYVKNGFEDKGIEIFIKMLVFSID
        NSYTFSSV KCFAAVA VEEGRQVHGLI KLG++SYNTVVNSLISFYFVGRKVR AQKLFDEL+DRDVISWNSMISGYVKNG +D+GIEIFIKMLVF ++
Subjt:  NSYTFSSVFKCFAAVARVEEGRQVHGLICKLGFSSYNTVVNSLISFYFVGRKVRTAQKLFDELSDRDVISWNSMISGYVKNGFEDKGIEIFIKMLVFSID

Query:  VDLATLVNVLVASANMGTLLLGKALHSYAIKDCSLDREVMFNNTLLDMYSKCGDLNSAIRVFEKMDEKTVVSWTSMIAGYVREGLSDGAITLFDKMKSRG
        +DLAT+VNVLVA AN GTLL GK LHSY+IK  +LDREV FNNTLLDMYSKCGDLNSAIRVFE+MDEKTVVSWTSMI GYVREGLSDGAI LFD+MKSRG
Subjt:  VDLATLVNVLVASANMGTLLLGKALHSYAIKDCSLDREVMFNNTLLDMYSKCGDLNSAIRVFEKMDEKTVVSWTSMIAGYVREGLSDGAITLFDKMKSRG

Query:  VVPDVYAVTSILHAFAVNGNLNSGKIVHNYIKENNMETNSFVSNALMDMYAKCGSMKDAHSVFSHMKRKDVISWNTMIGGYSKNRLPNEALNLFAEMQGE
        VVPDVYAVTSILHA A+NGNL SG+IVH+YI+ENN+ETNSFVSNAL DMYAKCGSMKDAH VFSHMK+KDVISWNTMIGGYSKNRLPNEAL LFAEMQ E
Subjt:  VVPDVYAVTSILHAFAVNGNLNSGKIVHNYIKENNMETNSFVSNALMDMYAKCGSMKDAHSVFSHMKRKDVISWNTMIGGYSKNRLPNEALNLFAEMQGE

Query:  SKPDGTTVACILPACASLAAWIEAEKSMDIH
        SKPDGTTVACILPACASLAA    ++  +IH
Subjt:  SKPDGTTVACILPACASLAAWIEAEKSMDIH

A0A6J1CE34 pentatricopeptide repeat-containing protein DOT4, chloroplastic5.5e-26085.88Show/hide
Query:  LLLVAKPPPKFWFSPAGHDHRGSVNLKFPHSVVLAKPNSKFSFSNSAYACTEIYPPASQTKSYLDVKLDNSAGIVDFCEVGDLKNAMELLCSYQNSNLNL
        +LLVAK PP FW SP GHD  G VNLKF HS V AKP SKFSFSNSAYACT+IYP  SQTKSYLD++LDNSA IV+FCEVGDLKNAMELLCS  N+NL+L
Subjt:  LLLVAKPPPKFWFSPAGHDHRGSVNLKFPHSVVLAKPNSKFSFSNSAYACTEIYPPASQTKSYLDVKLDNSAGIVDFCEVGDLKNAMELLCSYQNSNLNL

Query:  DTYCSILQLCAEQKSIRDGKRVHSIIESNGVVIDGILGAKLVFMYVKCGDLKEGRMIFDKLSEKKVFLWNLMISEYAGSGNYGESINLFKQMLEFGIKPN
        +TYCS+LQLCAE+KSIR GKRVHSIIESNGVV+DGILGAKLVFMYVKCGDLKE RMIFDKLSE+KVFLWNLMISEYAG+GNY ES+NLFK+M+E GIKPN
Subjt:  DTYCSILQLCAEQKSIRDGKRVHSIIESNGVVIDGILGAKLVFMYVKCGDLKEGRMIFDKLSEKKVFLWNLMISEYAGSGNYGESINLFKQMLEFGIKPN

Query:  SYTFSSVFKCFAAVARVEEGRQVHGLICKLGFSSYNTVVNSLISFYFVGRKVRTAQKLFDELSDRDVISWNSMISGYVKNGFEDKGIEIFIKMLVFSIDV
        SYTFSSV KC AAVARVE+GR VHG ICKLGFSSYNTVVNSLISFYFV +KVR+AQKLFDELSDRDVISWNSMISGYVKNG EDKGIEIFIKML FS+DV
Subjt:  SYTFSSVFKCFAAVARVEEGRQVHGLICKLGFSSYNTVVNSLISFYFVGRKVRTAQKLFDELSDRDVISWNSMISGYVKNGFEDKGIEIFIKMLVFSIDV

Query:  DLATLVNVLVASANMGTLLLGKALHSYAIKDC-SLDREVMFNNTLLDMYSKCGDLNSAIRVFEKMDEKTVVSWTSMIAGYVREGLSDGAITLFDKMKSRG
        DLAT+VNVLVA AN GTLLLGKALHSYAIK   SLDREVMF NTLLDMYSKCGDLNSAIRVFEKMDEKTVVSWTSMIAGYVREGLSDGAI LFD+MKSRG
Subjt:  DLATLVNVLVASANMGTLLLGKALHSYAIKDC-SLDREVMFNNTLLDMYSKCGDLNSAIRVFEKMDEKTVVSWTSMIAGYVREGLSDGAITLFDKMKSRG

Query:  VVPDVYAVTSILHAFAVNGNLNSGKIVHNYIKENNMETNSFVSNALMDMYAKCGSMKDAHSVFSHMKRKDVISWNTMIGGYSKNRLPNEALNLFAEMQGE
        VVPDVYAVTSILHA A+NGNL+SGKIVHNYI++NN+ETNSFVSNALMDMYAKCGSMKDA SVFSHMK KDVISWNTMIGGYSKN LPNEALNLFAEMQ E
Subjt:  VVPDVYAVTSILHAFAVNGNLNSGKIVHNYIKENNMETNSFVSNALMDMYAKCGSMKDAHSVFSHMKRKDVISWNTMIGGYSKNRLPNEALNLFAEMQGE

Query:  SKPDGTTVACILPACASLAAWIEAEKSMDIH
        SKPDGTTVACILPACASLAA    ++  +IH
Subjt:  SKPDGTTVACILPACASLAAWIEAEKSMDIH

A0A6J1EHU9 pentatricopeptide repeat-containing protein DOT4, chloroplastic9.4e-25283.3Show/hide
Query:  MLLLVAKPPPKFWFSPAGHDHRGSVNLKFPHSVVLAKPNSKFSFSNSAYACTEIY-PPASQTKSYLDVKLDNSAGIVDFCEVGDLKNAMELLCSYQNSNL
        M LLVAK PP FW S AG+DHRG VNLKF  S    KPNS+ SFSNSA+A TE Y P A + K+Y+DV+L+NS  IV FCEVGDLKNA+ELLCS QNSNL
Subjt:  MLLLVAKPPPKFWFSPAGHDHRGSVNLKFPHSVVLAKPNSKFSFSNSAYACTEIY-PPASQTKSYLDVKLDNSAGIVDFCEVGDLKNAMELLCSYQNSNL

Query:  NLDTYCSILQLCAEQKSIRDGKRVHSIIESNGVVIDGILGAKLVFMYVKCGDLKEGRMIFDKLSEKKVFLWNLMISEYAGSGNYGESINLFKQMLEFGIK
        +LDTYC ILQLCAEQKSIRDG+RVHSIIESN VVIDGILGAKLVFMYVKCGDL+EGRMIFDKLSEKKVFLWNLMISEY+GSGNYGESINLFK+MLE GI 
Subjt:  NLDTYCSILQLCAEQKSIRDGKRVHSIIESNGVVIDGILGAKLVFMYVKCGDLKEGRMIFDKLSEKKVFLWNLMISEYAGSGNYGESINLFKQMLEFGIK

Query:  PNSYTFSSVFKCFAAVARVEEGRQVHGLICKLGFSSYNTVVNSLISFYFVGRKVRTAQKLFDELSDRDVISWNSMISGYVKNGFEDKGIEIFIKMLVFSI
        PNSYTFSSV KCFAAVARVEEG QVHGLICKLGF+SYN VVNSLISFYFVGRKVR+A+KLFDE+SDRDVISWNSMISGYVKNG ED+GIEIF++MLVFS+
Subjt:  PNSYTFSSVFKCFAAVARVEEGRQVHGLICKLGFSSYNTVVNSLISFYFVGRKVRTAQKLFDELSDRDVISWNSMISGYVKNGFEDKGIEIFIKMLVFSI

Query:  DVDLATLVNVLVASANMGTLLLGKALHSYAIK-DCSLDREVMFNNTLLDMYSKCGDLNSAIRVFEKMDEKTVVSWTSMIAGYVREGLSDGAITLFDKMKS
        DVDLAT+VNVLVA ANMGTL LGK LHSY+IK   +LDR+VMFNNTLLDMYSKCGDLNSAIRVFE+MDEKTVVSWTS+IAGYVREGLSDGAI LF++MKS
Subjt:  DVDLATLVNVLVASANMGTLLLGKALHSYAIK-DCSLDREVMFNNTLLDMYSKCGDLNSAIRVFEKMDEKTVVSWTSMIAGYVREGLSDGAITLFDKMKS

Query:  RGVVPDVYAVTSILHAFAVNGNLNSGKIVHNYIKENNMETNSFVSNALMDMYAKCGSMKDAHSVFSHMKRKDVISWNTMIGGYSKNRLPNEALNLFAEMQ
        RGV+PDVYAV SILHA A NGNLNSGK +HNYI+ENN+ETNSFVSNALMDMYAKCGSM+DA  VFSHMKRKDVISWNTMIGGYSKNRLPNEAL+LFAEMQ
Subjt:  RGVVPDVYAVTSILHAFAVNGNLNSGKIVHNYIKENNMETNSFVSNALMDMYAKCGSMKDAHSVFSHMKRKDVISWNTMIGGYSKNRLPNEALNLFAEMQ

Query:  GESKPDGTTVACILPACASLAAWIEAEKSMDIH
         ESKPDGTTVACILPACASLAA    +K  +IH
Subjt:  GESKPDGTTVACILPACASLAAWIEAEKSMDIH

A0A6J1KNK7 pentatricopeptide repeat-containing protein DOT4, chloroplastic1.7e-25383.68Show/hide
Query:  MLLLVAKPPPKFWFSPAGHDHRGSVNLKFPHSVVLAKPNSKFSFSNSAYACTEIYPPAS-QTKSYLDVKLDNSAGIVDFCEVGDLKNAMELLCSYQNSNL
        M LLVAK PP FW S AG+DHRG VNLKF  S    KPNS+ SFSNSAYA TE Y PA+ + K+Y+D +L+NS  IV FCEVGDLKNA+ELLCS QNSNL
Subjt:  MLLLVAKPPPKFWFSPAGHDHRGSVNLKFPHSVVLAKPNSKFSFSNSAYACTEIYPPAS-QTKSYLDVKLDNSAGIVDFCEVGDLKNAMELLCSYQNSNL

Query:  NLDTYCSILQLCAEQKSIRDGKRVHSIIESNGVVIDGILGAKLVFMYVKCGDLKEGRMIFDKLSEKKVFLWNLMISEYAGSGNYGESINLFKQMLEFGIK
        +LDTYC ILQLCAEQKSIRDG+RVHSIIESN VVIDGILGAKLVFMYVKCGDL+EGRMIFDKLSEKKVFLWNLMISEY+GSGNYGESINLFK+MLE GI 
Subjt:  NLDTYCSILQLCAEQKSIRDGKRVHSIIESNGVVIDGILGAKLVFMYVKCGDLKEGRMIFDKLSEKKVFLWNLMISEYAGSGNYGESINLFKQMLEFGIK

Query:  PNSYTFSSVFKCFAAVARVEEGRQVHGLICKLGFSSYNTVVNSLISFYFVGRKVRTAQKLFDELSDRDVISWNSMISGYVKNGFEDKGIEIFIKMLVFSI
        PNSYTFSSV KCFAAV RVEEGRQVHGLICKLGF+SYN VVNSLISFYFVGRKVR+A+KLFDE+SDRDVISWNSMISGYVKNG ED+GIEIF++MLVFS+
Subjt:  PNSYTFSSVFKCFAAVARVEEGRQVHGLICKLGFSSYNTVVNSLISFYFVGRKVRTAQKLFDELSDRDVISWNSMISGYVKNGFEDKGIEIFIKMLVFSI

Query:  DVDLATLVNVLVASANMGTLLLGKALHSYAIK-DCSLDREVMFNNTLLDMYSKCGDLNSAIRVFEKMDEKTVVSWTSMIAGYVREGLSDGAITLFDKMKS
        DVDLAT+VNVLVA ANMGTL LGK LHSY+IK   +LDR+VMFNNTLLDMYSKCGDLNSAIRVFE+MDEKTVVSWTSMIAGYVREGLSDGAI LF++MKS
Subjt:  DVDLATLVNVLVASANMGTLLLGKALHSYAIK-DCSLDREVMFNNTLLDMYSKCGDLNSAIRVFEKMDEKTVVSWTSMIAGYVREGLSDGAITLFDKMKS

Query:  RGVVPDVYAVTSILHAFAVNGNLNSGKIVHNYIKENNMETNSFVSNALMDMYAKCGSMKDAHSVFSHMKRKDVISWNTMIGGYSKNRLPNEALNLFAEMQ
        RGV+PDVYAV SILHA A+NGNLNSGK +HNYIKENN+ETNSFVSNALMDMYAKCGSMKDA  VFSH+KRKDVISWNTMIGGYSKNRLPNEAL+LFAEMQ
Subjt:  RGVVPDVYAVTSILHAFAVNGNLNSGKIVHNYIKENNMETNSFVSNALMDMYAKCGSMKDAHSVFSHMKRKDVISWNTMIGGYSKNRLPNEALNLFAEMQ

Query:  GESKPDGTTVACILPACASLAAWIEAEKSMDIH
         ESKPDGTTVACILPACASLAA    +K  +IH
Subjt:  GESKPDGTTVACILPACASLAAWIEAEKSMDIH

SwissProt top hitse value%identityAlignment
A2YDY2 60S ribosomal protein L117.5e-8993.18Show/hide
Query:  MASEKKLSNPMRDIKVQKLVLNISVGESGDRLTRAAKVLEQLSGQAPVFSKARYTVRSFGIRRNEKIACYVTVRGEKAMQLLESGLKVKEYELLRRNFSD
        MASEKK SNPMR+IKVQKLVLNISVGESGDRLTRA+KVLEQLSGQ+PVFSKARYTVRSFGIRRNEKIACYVTVRGEKAMQLLESGLKVKEYELLRRNFS+
Subjt:  MASEKKLSNPMRDIKVQKLVLNISVGESGDRLTRAAKVLEQLSGQAPVFSKARYTVRSFGIRRNEKIACYVTVRGEKAMQLLESGLKVKEYELLRRNFSD

Query:  TGCFGFGIQEHIDLGIKYDPSTGIYGMDFFVVLERPGYRVGRRRRCKSRVGIQHRVTKEDAMKWFQVKYEVVDANR
        TGCFGFGIQEHIDLGIKYDPSTGIYGMDF+VVLER GYRV RRRRCKSRVGIQHRVTKEDAMKWFQVKYE V  N+
Subjt:  TGCFGFGIQEHIDLGIKYDPSTGIYGMDFFVVLERPGYRVGRRRRCKSRVGIQHRVTKEDAMKWFQVKYEVVDANR

P42794 60S ribosomal protein L11-21.4e-9094.32Show/hide
Query:  MASEKKLSNPMRDIKVQKLVLNISVGESGDRLTRAAKVLEQLSGQAPVFSKARYTVRSFGIRRNEKIACYVTVRGEKAMQLLESGLKVKEYELLRRNFSD
        MASEKKLSNPMRDIKVQKLVLNISVGESGDRLTRA+KVLEQLSGQ PVFSKARYTVRSFGIRRNEKIACYVTVRGEKAMQLLESGLKVKEYELLRRNFSD
Subjt:  MASEKKLSNPMRDIKVQKLVLNISVGESGDRLTRAAKVLEQLSGQAPVFSKARYTVRSFGIRRNEKIACYVTVRGEKAMQLLESGLKVKEYELLRRNFSD

Query:  TGCFGFGIQEHIDLGIKYDPSTGIYGMDFFVVLERPGYRVGRRRRCKSRVGIQHRVTKEDAMKWFQVKYEVVDANR
        TGCFGFGIQEHIDLGIKYDPSTGIYGMDF+VVLERPGYRV RRRRCK+RVGIQHRVTK+DAMKWFQVKYE V  N+
Subjt:  TGCFGFGIQEHIDLGIKYDPSTGIYGMDFFVVLERPGYRVGRRRRCKSRVGIQHRVTKEDAMKWFQVKYEVVDANR

P42795 60S ribosomal protein L11-11.4e-9094.32Show/hide
Query:  MASEKKLSNPMRDIKVQKLVLNISVGESGDRLTRAAKVLEQLSGQAPVFSKARYTVRSFGIRRNEKIACYVTVRGEKAMQLLESGLKVKEYELLRRNFSD
        MASEKKLSNPMRDIKVQKLVLNISVGESGDRLTRA+KVLEQLSGQ PVFSKARYTVRSFGIRRNEKIACYVTVRGEKAMQLLESGLKVKEYELLRRNFSD
Subjt:  MASEKKLSNPMRDIKVQKLVLNISVGESGDRLTRAAKVLEQLSGQAPVFSKARYTVRSFGIRRNEKIACYVTVRGEKAMQLLESGLKVKEYELLRRNFSD

Query:  TGCFGFGIQEHIDLGIKYDPSTGIYGMDFFVVLERPGYRVGRRRRCKSRVGIQHRVTKEDAMKWFQVKYEVVDANR
        TGCFGFGIQEHIDLGIKYDPSTGIYGMDF+VVLERPGYRV RRRRCK+RVGIQHRVTK+DAMKWFQVKYE V  N+
Subjt:  TGCFGFGIQEHIDLGIKYDPSTGIYGMDFFVVLERPGYRVGRRRRCKSRVGIQHRVTKEDAMKWFQVKYEVVDANR

P46287 60S ribosomal protein L113.6e-9194.89Show/hide
Query:  MASEKKLSNPMRDIKVQKLVLNISVGESGDRLTRAAKVLEQLSGQAPVFSKARYTVRSFGIRRNEKIACYVTVRGEKAMQLLESGLKVKEYELLRRNFSD
        MASEKKLSNPMR+IKVQKLVLNISVGESGDRLTRAAKVLEQLSGQ PVFSKARYTVRSFGIRRNEKIACYVTVRG+KAMQLLESGLKVKEYELLRRNFSD
Subjt:  MASEKKLSNPMRDIKVQKLVLNISVGESGDRLTRAAKVLEQLSGQAPVFSKARYTVRSFGIRRNEKIACYVTVRGEKAMQLLESGLKVKEYELLRRNFSD

Query:  TGCFGFGIQEHIDLGIKYDPSTGIYGMDFFVVLERPGYRVGRRRRCKSRVGIQHRVTKEDAMKWFQVKYEVVDANR
        TGCFGFGIQEHIDLGIKYDPSTGIYGMDFFVVLERPGYRVGRRRRCK+RVGIQHRVTK+DAMKWFQVKYE V  N+
Subjt:  TGCFGFGIQEHIDLGIKYDPSTGIYGMDFFVVLERPGYRVGRRRRCKSRVGIQHRVTKEDAMKWFQVKYEVVDANR

Q9SN39 Pentatricopeptide repeat-containing protein DOT4, chloroplastic1.3e-13352.41Show/hide
Query:  FCEVGDLKNAMELLCSYQNSNLNLDTYCSILQLCAEQKSIRDGKRVHSIIESNGVVIDGILGAKLVFMYVKCGDLKEGRMIFDKLSEKKVFLWNLMISEY
        FCE G+L+NA++LLC     +++  T CS+LQLCA+ KS++DGK V + I  NG VID  LG+KL  MY  CGDLKE   +FD++  +K   WN++++E 
Subjt:  FCEVGDLKNAMELLCSYQNSNLNLDTYCSILQLCAEQKSIRDGKRVHSIIESNGVVIDGILGAKLVFMYVKCGDLKEGRMIFDKLSEKKVFLWNLMISEY

Query:  AGSGNYGESINLFKQMLEFGIKPNSYTFSSVFKCFAAVARVEEGRQVHGLICKLGFSSYNTVVNSLISFYFVGRKVRTAQKLFDELSDRDVISWNSMISG
        A SG++  SI LFK+M+  G++ +SYTFS V K F+++  V  G Q+HG I K GF   N+V NSL++FY   ++V +A+K+FDE+++RDVISWNS+I+G
Subjt:  AGSGNYGESINLFKQMLEFGIKPNSYTFSSVFKCFAAVARVEEGRQVHGLICKLGFSSYNTVVNSLISFYFVGRKVRTAQKLFDELSDRDVISWNSMISG

Query:  YVKNGFEDKGIEIFIKMLVFSIDVDLATLVNVLVASANMGTLLLGKALHSYAIKDCSLDREVMFNNTLLDMYSKCGDLNSAIRVFEKMDEKTVVSWTSMI
        YV NG  +KG+ +F++MLV  I++DLAT+V+V    A+   + LG+A+HS  +K C   RE  F NTLLDMYSKCGDL+SA  VF +M +++VVS+TSMI
Subjt:  YVKNGFEDKGIEIFIKMLVFSIDVDLATLVNVLVASANMGTLLLGKALHSYAIKDCSLDREVMFNNTLLDMYSKCGDLNSAIRVFEKMDEKTVVSWTSMI

Query:  AGYVREGLSDGAITLFDKMKSRGVVPDVYAVTSILHAFAVNGNLNSGKIVHNYIKENNMETNSFVSNALMDMYAKCGSMKDAHSVFSHMKRKDVISWNTM
        AGY REGL+  A+ LF++M+  G+ PDVY VT++L+  A    L+ GK VH +IKEN++  + FVSNALMDMYAKCGSM++A  VFS M+ KD+ISWNT+
Subjt:  AGYVREGLSDGAITLFDKMKSRGVVPDVYAVTSILHAFAVNGNLNSGKIVHNYIKENNMETNSFVSNALMDMYAKCGSMKDAHSVFSHMKRKDVISWNTM

Query:  IGGYSKNRLPNEALNLFAEMQGESK--PDGTTVACILPACASLAAWIEAEKSMDIH
        IGGYSKN   NEAL+LF  +  E +  PD  TVAC+LPACASL+A+   +K  +IH
Subjt:  IGGYSKNRLPNEALNLFAEMQGESK--PDGTTVACILPACASLAAWIEAEKSMDIH

Arabidopsis top hitse value%identityAlignment
AT2G42740.1 ribosomal protein large subunit 16A9.8e-9294.32Show/hide
Query:  MASEKKLSNPMRDIKVQKLVLNISVGESGDRLTRAAKVLEQLSGQAPVFSKARYTVRSFGIRRNEKIACYVTVRGEKAMQLLESGLKVKEYELLRRNFSD
        MASEKKLSNPMRDIKVQKLVLNISVGESGDRLTRA+KVLEQLSGQ PVFSKARYTVRSFGIRRNEKIACYVTVRGEKAMQLLESGLKVKEYELLRRNFSD
Subjt:  MASEKKLSNPMRDIKVQKLVLNISVGESGDRLTRAAKVLEQLSGQAPVFSKARYTVRSFGIRRNEKIACYVTVRGEKAMQLLESGLKVKEYELLRRNFSD

Query:  TGCFGFGIQEHIDLGIKYDPSTGIYGMDFFVVLERPGYRVGRRRRCKSRVGIQHRVTKEDAMKWFQVKYEVVDANR
        TGCFGFGIQEHIDLGIKYDPSTGIYGMDF+VVLERPGYRV RRRRCK+RVGIQHRVTK+DAMKWFQVKYE V  N+
Subjt:  TGCFGFGIQEHIDLGIKYDPSTGIYGMDFFVVLERPGYRVGRRRRCKSRVGIQHRVTKEDAMKWFQVKYEVVDANR

AT3G58700.1 Ribosomal L5P family protein9.8e-9294.32Show/hide
Query:  MASEKKLSNPMRDIKVQKLVLNISVGESGDRLTRAAKVLEQLSGQAPVFSKARYTVRSFGIRRNEKIACYVTVRGEKAMQLLESGLKVKEYELLRRNFSD
        MASEKKLSNPMRDIKVQKLVLNISVGESGDRLTRA+KVLEQLSGQ PVFSKARYTVRSFGIRRNEKIACYVTVRGEKAMQLLESGLKVKEYELLRRNFSD
Subjt:  MASEKKLSNPMRDIKVQKLVLNISVGESGDRLTRAAKVLEQLSGQAPVFSKARYTVRSFGIRRNEKIACYVTVRGEKAMQLLESGLKVKEYELLRRNFSD

Query:  TGCFGFGIQEHIDLGIKYDPSTGIYGMDFFVVLERPGYRVGRRRRCKSRVGIQHRVTKEDAMKWFQVKYEVVDANR
        TGCFGFGIQEHIDLGIKYDPSTGIYGMDF+VVLERPGYRV RRRRCK+RVGIQHRVTK+DAMKWFQVKYE V  N+
Subjt:  TGCFGFGIQEHIDLGIKYDPSTGIYGMDFFVVLERPGYRVGRRRRCKSRVGIQHRVTKEDAMKWFQVKYEVVDANR

AT4G18730.1 ribosomal protein L16B9.8e-9294.32Show/hide
Query:  MASEKKLSNPMRDIKVQKLVLNISVGESGDRLTRAAKVLEQLSGQAPVFSKARYTVRSFGIRRNEKIACYVTVRGEKAMQLLESGLKVKEYELLRRNFSD
        MASEKKLSNPMRDIKVQKLVLNISVGESGDRLTRA+KVLEQLSGQ PVFSKARYTVRSFGIRRNEKIACYVTVRGEKAMQLLESGLKVKEYELLRRNFSD
Subjt:  MASEKKLSNPMRDIKVQKLVLNISVGESGDRLTRAAKVLEQLSGQAPVFSKARYTVRSFGIRRNEKIACYVTVRGEKAMQLLESGLKVKEYELLRRNFSD

Query:  TGCFGFGIQEHIDLGIKYDPSTGIYGMDFFVVLERPGYRVGRRRRCKSRVGIQHRVTKEDAMKWFQVKYEVVDANR
        TGCFGFGIQEHIDLGIKYDPSTGIYGMDF+VVLERPGYRV RRRRCK+RVGIQHRVTK+DAMKWFQVKYE V  N+
Subjt:  TGCFGFGIQEHIDLGIKYDPSTGIYGMDFFVVLERPGYRVGRRRRCKSRVGIQHRVTKEDAMKWFQVKYEVVDANR

AT4G18750.1 Pentatricopeptide repeat (PPR) superfamily protein9.3e-13552.41Show/hide
Query:  FCEVGDLKNAMELLCSYQNSNLNLDTYCSILQLCAEQKSIRDGKRVHSIIESNGVVIDGILGAKLVFMYVKCGDLKEGRMIFDKLSEKKVFLWNLMISEY
        FCE G+L+NA++LLC     +++  T CS+LQLCA+ KS++DGK V + I  NG VID  LG+KL  MY  CGDLKE   +FD++  +K   WN++++E 
Subjt:  FCEVGDLKNAMELLCSYQNSNLNLDTYCSILQLCAEQKSIRDGKRVHSIIESNGVVIDGILGAKLVFMYVKCGDLKEGRMIFDKLSEKKVFLWNLMISEY

Query:  AGSGNYGESINLFKQMLEFGIKPNSYTFSSVFKCFAAVARVEEGRQVHGLICKLGFSSYNTVVNSLISFYFVGRKVRTAQKLFDELSDRDVISWNSMISG
        A SG++  SI LFK+M+  G++ +SYTFS V K F+++  V  G Q+HG I K GF   N+V NSL++FY   ++V +A+K+FDE+++RDVISWNS+I+G
Subjt:  AGSGNYGESINLFKQMLEFGIKPNSYTFSSVFKCFAAVARVEEGRQVHGLICKLGFSSYNTVVNSLISFYFVGRKVRTAQKLFDELSDRDVISWNSMISG

Query:  YVKNGFEDKGIEIFIKMLVFSIDVDLATLVNVLVASANMGTLLLGKALHSYAIKDCSLDREVMFNNTLLDMYSKCGDLNSAIRVFEKMDEKTVVSWTSMI
        YV NG  +KG+ +F++MLV  I++DLAT+V+V    A+   + LG+A+HS  +K C   RE  F NTLLDMYSKCGDL+SA  VF +M +++VVS+TSMI
Subjt:  YVKNGFEDKGIEIFIKMLVFSIDVDLATLVNVLVASANMGTLLLGKALHSYAIKDCSLDREVMFNNTLLDMYSKCGDLNSAIRVFEKMDEKTVVSWTSMI

Query:  AGYVREGLSDGAITLFDKMKSRGVVPDVYAVTSILHAFAVNGNLNSGKIVHNYIKENNMETNSFVSNALMDMYAKCGSMKDAHSVFSHMKRKDVISWNTM
        AGY REGL+  A+ LF++M+  G+ PDVY VT++L+  A    L+ GK VH +IKEN++  + FVSNALMDMYAKCGSM++A  VFS M+ KD+ISWNT+
Subjt:  AGYVREGLSDGAITLFDKMKSRGVVPDVYAVTSILHAFAVNGNLNSGKIVHNYIKENNMETNSFVSNALMDMYAKCGSMKDAHSVFSHMKRKDVISWNTM

Query:  IGGYSKNRLPNEALNLFAEMQGESK--PDGTTVACILPACASLAAWIEAEKSMDIH
        IGGYSKN   NEAL+LF  +  E +  PD  TVAC+LPACASL+A+   +K  +IH
Subjt:  IGGYSKNRLPNEALNLFAEMQGESK--PDGTTVACILPACASLAAWIEAEKSMDIH

AT5G45775.2 Ribosomal L5P family protein9.8e-9294.32Show/hide
Query:  MASEKKLSNPMRDIKVQKLVLNISVGESGDRLTRAAKVLEQLSGQAPVFSKARYTVRSFGIRRNEKIACYVTVRGEKAMQLLESGLKVKEYELLRRNFSD
        MASEKKLSNPMRDIKVQKLVLNISVGESGDRLTRA+KVLEQLSGQ PVFSKARYTVRSFGIRRNEKIACYVTVRGEKAMQLLESGLKVKEYELLRRNFSD
Subjt:  MASEKKLSNPMRDIKVQKLVLNISVGESGDRLTRAAKVLEQLSGQAPVFSKARYTVRSFGIRRNEKIACYVTVRGEKAMQLLESGLKVKEYELLRRNFSD

Query:  TGCFGFGIQEHIDLGIKYDPSTGIYGMDFFVVLERPGYRVGRRRRCKSRVGIQHRVTKEDAMKWFQVKYEVVDANR
        TGCFGFGIQEHIDLGIKYDPSTGIYGMDF+VVLERPGYRV RRRRCK+RVGIQHRVTK+DAMKWFQVKYE V  N+
Subjt:  TGCFGFGIQEHIDLGIKYDPSTGIYGMDFFVVLERPGYRVGRRRRCKSRVGIQHRVTKEDAMKWFQVKYEVVDANR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCCAGAGGATTATGTGATAACTCCAATCTTACCAAGCTTGTTAAATTGGTCAAAAAACTGAAAATGCTCCTTTCTGAAGAGGGAGATTCAGTGGTGAAGTTGTTGAG
CTCCAAATTTATCCATTGGAGGTTTTTGAATTTACCAAAAACGCTGGGAATCAACCCAGAAAATGAGTTTTCTACAACATCAAATAGAGTGAGCTTAGAAGCATTGGAGA
TAGATTCGGGAATGGTTCCTGTGAGTTTGTTTCTTCCTAAAATAAGCTGGACAAGATTAGGAAGTCCAGCGCCAAGAAGTTCTCTTGCAGCGCCAAATACTCAAGGTTTT
GAAGATAACCCAATTCATAAGCTGACCCCGAGGAAGATCCGCGACACCCGGGTTAAGTTCCCGATACTTCTGGCGTCGAAGGAAGTCCGCCGGATAGCCGATTATAAGAC
AAATACAATCCTCCCAGCGCCGGAAGATCTTCACAGATGCCGTCGGGAAGCCCGCCGGCAGATTGATGAGCTCAATAGGAAGTGGATCATGAAAGCTGTTGTTCTTGATG
GTAACGTAAGTGAGAAAAGAGAGAGTTCCCACTTCAGAAGGAAATGTGCCTCGACATCGGAGAAAAACCCTAAAACCCTCTCCTCCTCCACTTCGTCTGCTGTGGCCGTC
ACTCTTCCTCGGCTGCTTCATCCGACAATGGCTTCGGAGAAGAAGCTCTCGAACCCCATGCGAGATATTAAAGTCCAGAAGCTGGTTCTTAACATTTCTGTCGGTGAAAG
CGGCGATCGTCTCACCAGAGCCGCCAAGGTGCTCGAACAACTCAGTGGCCAAGCCCCCGTCTTCTCCAAAGCTCGTTACACCGTTCGGTCTTTTGGGATCAGGCGTAACG
AGAAGATCGCGTGCTATGTGACAGTGAGGGGCGAAAAAGCAATGCAACTTCTTGAAAGCGGTTTGAAGGTTAAGGAATACGAACTTTTGCGCAGGAACTTCAGTGATACT
GGCTGCTTCGGTTTTGGTATTCAGGAGCATATTGATCTTGGAATCAAGTATGACCCTTCTACTGGTATATATGGAATGGACTTCTTTGTTGTTCTAGAACGGCCAGGTTA
TCGTGTAGGTCGTCGACGTCGATGCAAGTCACGTGTTGGAATTCAGCATAGAGTCACCAAGGAGGATGCAATGAAGTGGTTTCAAGTTAAATATGAAGTAGTTGATGCTA
ACAGGGATTGCCAATACGTGTGTGTGCATGCACACGCGTTTGCATCCTTGCCAGGATTATGGCCTCTGTGGAACCACCCGGCAAGTTTCATATTCTCTACACTTGGGAAC
TTCAACTCCCTGCTTTCAGCTATGGTTTCAGATGCTTCTTCTCTAAGCTGTGAACCATTTCCCCCTGGGGGACATGGGATTGGTGATCGCTTCACAAATTTAGATGGTGG
TCGAACTAACTTCAAATCTGCTGCTGGCAAAAAATCCTATGCAAAAGCAAAACATTGCAATAGCGTAGAGCTTGTTCTGTATGTTTTCTTAATGGATTGCAAAATTCGAG
AAGATGGTTTTTCTGGGATTGGGGAATTCGAGTGGGAGTTTCAAGAGGAAGAAGGAGCTTCATACCTTTCAAAGCGAAAATGGGTCTCCTTCCGAAAGAGACCAAATTGG
GAAAGCGGTGTACAGTTCGAGGCTGGAAAACTACTGCTTCCATCTGATTTTCCCGGAAAATGCAATTCTAAGAATATCCAAACTACATCAGATCCACAAGCTGAGAAATC
CGCTGCCATTTTGATTTCTGCTGCTACTTACTGGGGCTGTGCGGCAAAAGCCATGTTGTTACTGGTAGCTAAACCCCCTCCAAAGTTCTGGTTTTCTCCGGCCGGGCACG
ATCACCGTGGCTCAGTGAACTTGAAATTCCCACATTCTGTCGTCCTTGCCAAACCAAATTCAAAATTTTCCTTTTCGAATTCGGCCTATGCTTGTACGGAGATTTACCCT
CCAGCATCGCAAACGAAAAGCTATCTCGATGTTAAACTGGATAACTCCGCCGGAATTGTCGATTTCTGTGAAGTGGGTGATCTAAAAAATGCTATGGAGCTTCTTTGCAG
CTACCAAAATTCCAACCTTAACTTGGACACTTACTGCTCCATCTTGCAGCTATGTGCTGAACAAAAATCGATACGAGATGGAAAAAGAGTTCATTCAATAATTGAATCTA
ATGGGGTTGTGATAGATGGAATCTTGGGGGCGAAACTAGTTTTTATGTATGTAAAATGCGGGGATCTAAAAGAAGGGAGGATGATTTTTGATAAACTATCAGAAAAGAAG
GTTTTCCTCTGGAACCTTATGATCAGTGAGTATGCGGGAAGCGGTAACTATGGAGAGAGTATAAATTTGTTCAAGCAAATGCTGGAGTTTGGGATAAAACCTAATTCTTA
TACATTTTCTAGTGTTTTCAAATGTTTCGCAGCAGTTGCACGTGTAGAAGAGGGTAGGCAGGTTCATGGGCTGATCTGCAAGTTGGGTTTCTCTTCCTATAATACAGTCG
TTAATTCGCTAATCTCTTTCTACTTTGTGGGTAGAAAGGTAAGAACTGCACAGAAGTTGTTCGATGAATTGAGTGACCGAGACGTCATATCATGGAACTCTATGATCAGT
GGCTATGTTAAGAATGGTTTTGAAGACAAGGGAATTGAGATCTTCATAAAGATGTTAGTTTTCAGCATTGATGTTGATTTGGCTACATTGGTCAATGTGCTTGTGGCTAG
TGCAAATATGGGCACTCTTTTGTTGGGTAAGGCACTTCATTCGTATGCCATAAAGGATTGTTCTCTTGACAGAGAAGTTATGTTCAATAATACTTTACTGGACATGTACT
CAAAATGTGGGGATTTGAACAGTGCCATTCGGGTTTTTGAGAAAATGGATGAGAAAACTGTTGTATCTTGGACTTCGATGATTGCAGGCTATGTCCGTGAAGGTCTATCT
GATGGCGCAATCACGTTGTTTGACAAAATGAAAAGCAGAGGTGTTGTCCCGGATGTTTATGCTGTTACAAGCATCCTTCATGCTTTTGCTGTCAATGGCAACCTGAATAG
TGGGAAGATTGTACACAACTACATCAAGGAAAACAACATGGAAACTAACTCGTTTGTTAGTAATGCTCTTATGGACATGTATGCCAAATGCGGCAGCATGAAGGACGCTC
ACAGTGTTTTTTCTCACATGAAAAGGAAGGATGTTATATCATGGAATACTATGATTGGAGGTTACTCGAAGAACCGTCTTCCAAATGAAGCTCTTAACTTGTTCGCAGAG
ATGCAAGGAGAATCAAAGCCTGACGGCACAACAGTGGCGTGCATCCTTCCAGCCTGTGCGAGTCTTGCAGCTTGGATAGAGGCAGAGAAATCCATGGATATTCATTAA
mRNA sequenceShow/hide mRNA sequence
ATGTCCAGAGGATTATGTGATAACTCCAATCTTACCAAGCTTGTTAAATTGGTCAAAAAACTGAAAATGCTCCTTTCTGAAGAGGGAGATTCAGTGGTGAAGTTGTTGAG
CTCCAAATTTATCCATTGGAGGTTTTTGAATTTACCAAAAACGCTGGGAATCAACCCAGAAAATGAGTTTTCTACAACATCAAATAGAGTGAGCTTAGAAGCATTGGAGA
TAGATTCGGGAATGGTTCCTGTGAGTTTGTTTCTTCCTAAAATAAGCTGGACAAGATTAGGAAGTCCAGCGCCAAGAAGTTCTCTTGCAGCGCCAAATACTCAAGGTTTT
GAAGATAACCCAATTCATAAGCTGACCCCGAGGAAGATCCGCGACACCCGGGTTAAGTTCCCGATACTTCTGGCGTCGAAGGAAGTCCGCCGGATAGCCGATTATAAGAC
AAATACAATCCTCCCAGCGCCGGAAGATCTTCACAGATGCCGTCGGGAAGCCCGCCGGCAGATTGATGAGCTCAATAGGAAGTGGATCATGAAAGCTGTTGTTCTTGATG
GTAACGTAAGTGAGAAAAGAGAGAGTTCCCACTTCAGAAGGAAATGTGCCTCGACATCGGAGAAAAACCCTAAAACCCTCTCCTCCTCCACTTCGTCTGCTGTGGCCGTC
ACTCTTCCTCGGCTGCTTCATCCGACAATGGCTTCGGAGAAGAAGCTCTCGAACCCCATGCGAGATATTAAAGTCCAGAAGCTGGTTCTTAACATTTCTGTCGGTGAAAG
CGGCGATCGTCTCACCAGAGCCGCCAAGGTGCTCGAACAACTCAGTGGCCAAGCCCCCGTCTTCTCCAAAGCTCGTTACACCGTTCGGTCTTTTGGGATCAGGCGTAACG
AGAAGATCGCGTGCTATGTGACAGTGAGGGGCGAAAAAGCAATGCAACTTCTTGAAAGCGGTTTGAAGGTTAAGGAATACGAACTTTTGCGCAGGAACTTCAGTGATACT
GGCTGCTTCGGTTTTGGTATTCAGGAGCATATTGATCTTGGAATCAAGTATGACCCTTCTACTGGTATATATGGAATGGACTTCTTTGTTGTTCTAGAACGGCCAGGTTA
TCGTGTAGGTCGTCGACGTCGATGCAAGTCACGTGTTGGAATTCAGCATAGAGTCACCAAGGAGGATGCAATGAAGTGGTTTCAAGTTAAATATGAAGTAGTTGATGCTA
ACAGGGATTGCCAATACGTGTGTGTGCATGCACACGCGTTTGCATCCTTGCCAGGATTATGGCCTCTGTGGAACCACCCGGCAAGTTTCATATTCTCTACACTTGGGAAC
TTCAACTCCCTGCTTTCAGCTATGGTTTCAGATGCTTCTTCTCTAAGCTGTGAACCATTTCCCCCTGGGGGACATGGGATTGGTGATCGCTTCACAAATTTAGATGGTGG
TCGAACTAACTTCAAATCTGCTGCTGGCAAAAAATCCTATGCAAAAGCAAAACATTGCAATAGCGTAGAGCTTGTTCTGTATGTTTTCTTAATGGATTGCAAAATTCGAG
AAGATGGTTTTTCTGGGATTGGGGAATTCGAGTGGGAGTTTCAAGAGGAAGAAGGAGCTTCATACCTTTCAAAGCGAAAATGGGTCTCCTTCCGAAAGAGACCAAATTGG
GAAAGCGGTGTACAGTTCGAGGCTGGAAAACTACTGCTTCCATCTGATTTTCCCGGAAAATGCAATTCTAAGAATATCCAAACTACATCAGATCCACAAGCTGAGAAATC
CGCTGCCATTTTGATTTCTGCTGCTACTTACTGGGGCTGTGCGGCAAAAGCCATGTTGTTACTGGTAGCTAAACCCCCTCCAAAGTTCTGGTTTTCTCCGGCCGGGCACG
ATCACCGTGGCTCAGTGAACTTGAAATTCCCACATTCTGTCGTCCTTGCCAAACCAAATTCAAAATTTTCCTTTTCGAATTCGGCCTATGCTTGTACGGAGATTTACCCT
CCAGCATCGCAAACGAAAAGCTATCTCGATGTTAAACTGGATAACTCCGCCGGAATTGTCGATTTCTGTGAAGTGGGTGATCTAAAAAATGCTATGGAGCTTCTTTGCAG
CTACCAAAATTCCAACCTTAACTTGGACACTTACTGCTCCATCTTGCAGCTATGTGCTGAACAAAAATCGATACGAGATGGAAAAAGAGTTCATTCAATAATTGAATCTA
ATGGGGTTGTGATAGATGGAATCTTGGGGGCGAAACTAGTTTTTATGTATGTAAAATGCGGGGATCTAAAAGAAGGGAGGATGATTTTTGATAAACTATCAGAAAAGAAG
GTTTTCCTCTGGAACCTTATGATCAGTGAGTATGCGGGAAGCGGTAACTATGGAGAGAGTATAAATTTGTTCAAGCAAATGCTGGAGTTTGGGATAAAACCTAATTCTTA
TACATTTTCTAGTGTTTTCAAATGTTTCGCAGCAGTTGCACGTGTAGAAGAGGGTAGGCAGGTTCATGGGCTGATCTGCAAGTTGGGTTTCTCTTCCTATAATACAGTCG
TTAATTCGCTAATCTCTTTCTACTTTGTGGGTAGAAAGGTAAGAACTGCACAGAAGTTGTTCGATGAATTGAGTGACCGAGACGTCATATCATGGAACTCTATGATCAGT
GGCTATGTTAAGAATGGTTTTGAAGACAAGGGAATTGAGATCTTCATAAAGATGTTAGTTTTCAGCATTGATGTTGATTTGGCTACATTGGTCAATGTGCTTGTGGCTAG
TGCAAATATGGGCACTCTTTTGTTGGGTAAGGCACTTCATTCGTATGCCATAAAGGATTGTTCTCTTGACAGAGAAGTTATGTTCAATAATACTTTACTGGACATGTACT
CAAAATGTGGGGATTTGAACAGTGCCATTCGGGTTTTTGAGAAAATGGATGAGAAAACTGTTGTATCTTGGACTTCGATGATTGCAGGCTATGTCCGTGAAGGTCTATCT
GATGGCGCAATCACGTTGTTTGACAAAATGAAAAGCAGAGGTGTTGTCCCGGATGTTTATGCTGTTACAAGCATCCTTCATGCTTTTGCTGTCAATGGCAACCTGAATAG
TGGGAAGATTGTACACAACTACATCAAGGAAAACAACATGGAAACTAACTCGTTTGTTAGTAATGCTCTTATGGACATGTATGCCAAATGCGGCAGCATGAAGGACGCTC
ACAGTGTTTTTTCTCACATGAAAAGGAAGGATGTTATATCATGGAATACTATGATTGGAGGTTACTCGAAGAACCGTCTTCCAAATGAAGCTCTTAACTTGTTCGCAGAG
ATGCAAGGAGAATCAAAGCCTGACGGCACAACAGTGGCGTGCATCCTTCCAGCCTGTGCGAGTCTTGCAGCTTGGATAGAGGCAGAGAAATCCATGGATATTCATTAA
Protein sequenceShow/hide protein sequence
MSRGLCDNSNLTKLVKLVKKLKMLLSEEGDSVVKLLSSKFIHWRFLNLPKTLGINPENEFSTTSNRVSLEALEIDSGMVPVSLFLPKISWTRLGSPAPRSSLAAPNTQGF
EDNPIHKLTPRKIRDTRVKFPILLASKEVRRIADYKTNTILPAPEDLHRCRREARRQIDELNRKWIMKAVVLDGNVSEKRESSHFRRKCASTSEKNPKTLSSSTSSAVAV
TLPRLLHPTMASEKKLSNPMRDIKVQKLVLNISVGESGDRLTRAAKVLEQLSGQAPVFSKARYTVRSFGIRRNEKIACYVTVRGEKAMQLLESGLKVKEYELLRRNFSDT
GCFGFGIQEHIDLGIKYDPSTGIYGMDFFVVLERPGYRVGRRRRCKSRVGIQHRVTKEDAMKWFQVKYEVVDANRDCQYVCVHAHAFASLPGLWPLWNHPASFIFSTLGN
FNSLLSAMVSDASSLSCEPFPPGGHGIGDRFTNLDGGRTNFKSAAGKKSYAKAKHCNSVELVLYVFLMDCKIREDGFSGIGEFEWEFQEEEGASYLSKRKWVSFRKRPNW
ESGVQFEAGKLLLPSDFPGKCNSKNIQTTSDPQAEKSAAILISAATYWGCAAKAMLLLVAKPPPKFWFSPAGHDHRGSVNLKFPHSVVLAKPNSKFSFSNSAYACTEIYP
PASQTKSYLDVKLDNSAGIVDFCEVGDLKNAMELLCSYQNSNLNLDTYCSILQLCAEQKSIRDGKRVHSIIESNGVVIDGILGAKLVFMYVKCGDLKEGRMIFDKLSEKK
VFLWNLMISEYAGSGNYGESINLFKQMLEFGIKPNSYTFSSVFKCFAAVARVEEGRQVHGLICKLGFSSYNTVVNSLISFYFVGRKVRTAQKLFDELSDRDVISWNSMIS
GYVKNGFEDKGIEIFIKMLVFSIDVDLATLVNVLVASANMGTLLLGKALHSYAIKDCSLDREVMFNNTLLDMYSKCGDLNSAIRVFEKMDEKTVVSWTSMIAGYVREGLS
DGAITLFDKMKSRGVVPDVYAVTSILHAFAVNGNLNSGKIVHNYIKENNMETNSFVSNALMDMYAKCGSMKDAHSVFSHMKRKDVISWNTMIGGYSKNRLPNEALNLFAE
MQGESKPDGTTVACILPACASLAAWIEAEKSMDIH