; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI07G02380 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI07G02380
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationChr7:2050116..2054581
RNA-Seq ExpressionCSPI07G02380
SyntenyCSPI07G02380
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7011839.1 Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. argyrosperma]3.1e-15078.96Show/hide
Query:  MGA--FQLQSRLQVSKILSSTIRKWEIDFPLLHNPLRTPDFSLYPAQSRRFASNSMDDDHAFSDSEPINTPPQRRFPPDHREARRVPR--GGVTASY--D
        MGA   QLQSRL+VSKILSSTIR WE  FPLLH+PLRTPD SL P +SR F+SNSMDDD  FS SEP ++PPQRR PPD REARRVP+  G V+A Y  D
Subjt:  MGA--FQLQSRLQVSKILSSTIRKWEIDFPLLHNPLRTPDFSLYPAQSRRFASNSMDDDHAFSDSEPINTPPQRRFPPDHREARRVPR--GGVTASY--D

Query:  NRNQRFNRHSEGSSSRFTNEGSTSPQRSESLSQKDFSFLEKFKLNTDNQSSGKEKTEENSSSAPVSESMLEKQQSEPQGPPEADEIFRKMKETGLIPNAV
        NRNQRFNRHSEGSSSRF NEG  S  R+ESLSQKDFSFLEKFKLNTDNQS+  EK E  SSSA  SESM + +QS+PQ PPEADEIFRKMKETGLIPNAV
Subjt:  NRNQRFNRHSEGSSSRFTNEGSTSPQRSESLSQKDFSFLEKFKLNTDNQSSGKEKTEENSSSAPVSESMLEKQQSEPQGPPEADEIFRKMKETGLIPNAV

Query:  AMLDGLCKDGLIQEAMKLFGLIREKGTIPEVVIYTAVVDGFCKAEKFDEAIRIFRKMQHNGIPPNAFSFGVLIQGLYKCKKLDDAVAFCNEMLESGHLPN
        AMLDGLCKDGLIQEAMKLFGLIREKGTIPEVV+YTAVVDGFCKAEKFDEAIRIFRKMQ+NGI PNAFS+G+LIQGLYK K+L+DAV+FC EMLESGH PN
Subjt:  AMLDGLCKDGLIQEAMKLFGLIREKGTIPEVVIYTAVVDGFCKAEKFDEAIRIFRKMQHNGIPPNAFSFGVLIQGLYKCKKLDDAVAFCNEMLESGHLPN

Query:  LTTFVGLIDALCNEKGVDEAHSVVETFKQKGFLIDEKALREFLNKRAPFSPDIWKVFFGNRKPPFF
        LTTFVGLID LC EKGVDEAH+V+ET KQKGFLI+EK LREFLNKRAPFSP IW++ FG +   FF
Subjt:  LTTFVGLIDALCNEKGVDEAHSVVETFKQKGFLIDEKALREFLNKRAPFSPDIWKVFFGNRKPPFF

XP_004144563.1 pentatricopeptide repeat-containing protein At4g38150 [Cucumis sativus]1.4e-20099.17Show/hide
Query:  MGAFQLQSRLQVSKILSSTIRKWEIDFPLLHNPLRTPDFSLYPAQSRRFASNSMDDDHAFSDSEPINTPPQRRFPPDHREARRVPRGGVTASYDNRNQRF
        MGAFQLQSRLQVSKILSSTIRKWEIDFPLLHNPLRTPDFSLYPAQSRRFASNSMDDDHAFSDSEPINTPPQRRFPPDHREARRVPRGGVTASYDNRNQRF
Subjt:  MGAFQLQSRLQVSKILSSTIRKWEIDFPLLHNPLRTPDFSLYPAQSRRFASNSMDDDHAFSDSEPINTPPQRRFPPDHREARRVPRGGVTASYDNRNQRF

Query:  NRHSEGSSSRFTNEGSTSPQRSESLSQKDFSFLEKFKLNTDNQSSGKEKTEENSSSAPVSESMLEKQQSEPQGPPEADEIFRKMKETGLIPNAVAMLDGL
        NRHSEGSSSRFTNEGSTSPQRSESLSQKDFSFLEKFKLNTDNQSSGKEKTEENSSSAPVSESMLEKQQSEPQ PPEADEIF KMKETGLIPNAVAMLDGL
Subjt:  NRHSEGSSSRFTNEGSTSPQRSESLSQKDFSFLEKFKLNTDNQSSGKEKTEENSSSAPVSESMLEKQQSEPQGPPEADEIFRKMKETGLIPNAVAMLDGL

Query:  CKDGLIQEAMKLFGLIREKGTIPEVVIYTAVVDGFCKAEKFDEAIRIFRKMQHNGIPPNAFSFGVLIQGLYKCKKLDDAVAFCNEMLESGHLPNLTTFVG
        CKDGLIQEAMKLFGLIREKGTIPEVVIYTAVVDGFCKAEKFDEAIRIFRKMQHNGIPPNAFSFGVLIQGLYKCKKLDDAVAFCNEMLESGHLPNLTTFVG
Subjt:  CKDGLIQEAMKLFGLIREKGTIPEVVIYTAVVDGFCKAEKFDEAIRIFRKMQHNGIPPNAFSFGVLIQGLYKCKKLDDAVAFCNEMLESGHLPNLTTFVG

Query:  LIDALCNEKGVDEAHSVVETFKQKGFLIDEKALREFLNKRAPFSPDIWKVFFGNRKPPFF
        LIDALCNEKGVDEAHSVVETFKQKGFLIDEKALRE LNKRAPFSPDIWKVFFGNRKPPFF
Subjt:  LIDALCNEKGVDEAHSVVETFKQKGFLIDEKALREFLNKRAPFSPDIWKVFFGNRKPPFF

XP_008462006.1 PREDICTED: pentatricopeptide repeat-containing protein At4g38150 [Cucumis melo]6.5e-18592.5Show/hide
Query:  MGAFQLQSRLQVSKILSSTIRKWEIDFPLLHNPLRTPDFSLYPAQSRRFASNSMDDDHAFSDSEPINTPPQRRFPPDHREARRVPRGGVTASYDNRNQRF
        M AFQ QSRLQVSKILSS+IRKWEIDFPLLHNPLRTPDFSL PA SRRF+SNSMDDD AFS+SEP NTPPQRRFPPDHREARRVPRGGVT SYDNRNQRF
Subjt:  MGAFQLQSRLQVSKILSSTIRKWEIDFPLLHNPLRTPDFSLYPAQSRRFASNSMDDDHAFSDSEPINTPPQRRFPPDHREARRVPRGGVTASYDNRNQRF

Query:  NRHSEGSSSRFTNEGSTSPQRSESLSQKDFSFLEKFKLNTDNQSSGKEKTEENSSSAPVSESMLEKQQSEPQGPPEADEIFRKMKETGLIPNAVAMLDGL
        NRHSEGSSSRFTNEGSTS QRSESLSQKDFSFLEKFKLNTDNQSSG EKTEENSSSAPVSESM EKQQS+ Q PP ADEIFRKMKE+GLIPNAVAMLDGL
Subjt:  NRHSEGSSSRFTNEGSTSPQRSESLSQKDFSFLEKFKLNTDNQSSGKEKTEENSSSAPVSESMLEKQQSEPQGPPEADEIFRKMKETGLIPNAVAMLDGL

Query:  CKDGLIQEAMKLFGLIREKGTIPEVVIYTAVVDGFCKAEKFDEAIRIFRKMQHNGIPPNAFSFGVLIQGLYKCKKLDDAVAFCNEMLESGHLPNLTTFVG
        CKDGLIQEAMKLF LIREKGTIPEVVIYTAVVDGFCKAEKFDEAIRIFRKMQ+ GI PNAFSFGVLIQGLYKCKKLDDAVAFCNEMLESGHLPNLTTFVG
Subjt:  CKDGLIQEAMKLFGLIREKGTIPEVVIYTAVVDGFCKAEKFDEAIRIFRKMQHNGIPPNAFSFGVLIQGLYKCKKLDDAVAFCNEMLESGHLPNLTTFVG

Query:  LIDALCNEKGVDEAHSVVETFKQKGFLIDEKALREFLNKRAPFSPDIWKVFFGNRKPPFF
        LID LC+EKGVDEAH+VVETFKQKGFLIDEKALREFLNKRAPFSPDIWKVFFGN+K PFF
Subjt:  LIDALCNEKGVDEAHSVVETFKQKGFLIDEKALREFLNKRAPFSPDIWKVFFGNRKPPFF

XP_022969144.1 pentatricopeptide repeat-containing protein At4g38150-like [Cucurbita maxima]8.9e-15078.96Show/hide
Query:  MGA--FQLQSRLQVSKILSSTIRKWEIDFPLLHNPLRTPDFSLYPAQSRRFASNSMDDDHAFSDSEPINTPPQRRFPPDHREARRVPR--GGVTASY--D
        MGA   QLQSRL+VSKIL STIR WE  FPLLH+PLRTPD SL P +SR F+SNSMDDD  FS SEP ++PPQRR PPD REARRVP+  G V+A Y  D
Subjt:  MGA--FQLQSRLQVSKILSSTIRKWEIDFPLLHNPLRTPDFSLYPAQSRRFASNSMDDDHAFSDSEPINTPPQRRFPPDHREARRVPR--GGVTASY--D

Query:  NRNQRFNRHSEGSSSRFTNEGSTSPQRSESLSQKDFSFLEKFKLNTDNQSSGKEKTEENSSSAPVSESMLEKQQSEPQGPPEADEIFRKMKETGLIPNAV
        NRNQRFNRH EGS SRF NEG  S +RSESLSQKDFSFLEKFKLNTDN S+  EKTE  SSSA  SESM + +QS+PQ PPEADEIFRKMKETGLIPNAV
Subjt:  NRNQRFNRHSEGSSSRFTNEGSTSPQRSESLSQKDFSFLEKFKLNTDNQSSGKEKTEENSSSAPVSESMLEKQQSEPQGPPEADEIFRKMKETGLIPNAV

Query:  AMLDGLCKDGLIQEAMKLFGLIREKGTIPEVVIYTAVVDGFCKAEKFDEAIRIFRKMQHNGIPPNAFSFGVLIQGLYKCKKLDDAVAFCNEMLESGHLPN
        AMLDGLCKDGLIQEAMKLFGLIREKGTIPEVV+YTAVVDGFCKAE  DEAIRIFRKMQ+NGI PNAFS+GVLIQGLYKCKKL+DAV+FC EMLESGH PN
Subjt:  AMLDGLCKDGLIQEAMKLFGLIREKGTIPEVVIYTAVVDGFCKAEKFDEAIRIFRKMQHNGIPPNAFSFGVLIQGLYKCKKLDDAVAFCNEMLESGHLPN

Query:  LTTFVGLIDALCNEKGVDEAHSVVETFKQKGFLIDEKALREFLNKRAPFSPDIWKVFFGNRKPPFF
        LTTFVGLID LC EKGVDEAH+V+ET KQKGFLI+EKALREFLNKRAPFSP IW++ FG +   FF
Subjt:  LTTFVGLIDALCNEKGVDEAHSVVETFKQKGFLIDEKALREFLNKRAPFSPDIWKVFFGNRKPPFF

XP_038888880.1 pentatricopeptide repeat-containing protein At4g38150 isoform X1 [Benincasa hispida]3.0e-15382.42Show/hide
Query:  AFQLQSRLQVSKILSSTIRKWEIDFPLLHNPLRTPDFSLYPAQSRRFASNSMDDDHAFSDSEPINTPPQRRF-PPDHREARRVPRGG--VTASY--DNRN
        AFQ QSRL+V KILSSTIRKWE DFPLLH+ LR P+FSL PAQSR FASNS+DDD AFS SEP  +PPQRR  PPDHREARRVP     V+A +  DNRN
Subjt:  AFQLQSRLQVSKILSSTIRKWEIDFPLLHNPLRTPDFSLYPAQSRRFASNSMDDDHAFSDSEPINTPPQRRF-PPDHREARRVPRGG--VTASY--DNRN

Query:  QRFNRHSEGSSSRFTNEGSTSPQRSESLSQKDFSFLEKFKLNTDNQSSGKEKTEENSSSAPVSESMLEKQQSEPQGPPEADEIFRKMKETGLIPNAVAML
        QRFNR SEGSSSRF NE S S QR+ESL+QKDFSFLE+FKLNTDNQSS  EKTEE SSSA +SESM EK QS+PQ PPEADEIFRKMKETGLIPNAVAML
Subjt:  QRFNRHSEGSSSRFTNEGSTSPQRSESLSQKDFSFLEKFKLNTDNQSSGKEKTEENSSSAPVSESMLEKQQSEPQGPPEADEIFRKMKETGLIPNAVAML

Query:  DGLCKDGLIQEAMKLFGLIREKGTIPEVVIYTAVVDGFCKAEKFDEAIRIFRKMQHNGIPPNAFSFGVLIQGLYKCKKLDDAVAFCNEMLESGHLPNLTT
        DGLCKDGLIQEAMKLF LIREKGTIPEVVIYTAVVDGFCKAEK DEAIRIFRKMQ+NGIPPNAFSFGVLIQGLYKCKKL+DAVAFC EMLESGH PNL T
Subjt:  DGLCKDGLIQEAMKLFGLIREKGTIPEVVIYTAVVDGFCKAEKFDEAIRIFRKMQHNGIPPNAFSFGVLIQGLYKCKKLDDAVAFCNEMLESGHLPNLTT

Query:  FVGLIDALCNEKGVDEAHSVVETFKQKGFLIDEKALREFLNKRAPFSPDIWKVFFGNR-KPPFF
        FVGLID LCNEKGVDEAHSVVET KQKGFLI+EKALRE LNKRAPFSP IW+V FG + K  FF
Subjt:  FVGLIDALCNEKGVDEAHSVVETFKQKGFLIDEKALREFLNKRAPFSPDIWKVFFGNR-KPPFF

TrEMBL top hitse value%identityAlignment
A0A0A0K6C0 Uncharacterized protein7.0e-20199.17Show/hide
Query:  MGAFQLQSRLQVSKILSSTIRKWEIDFPLLHNPLRTPDFSLYPAQSRRFASNSMDDDHAFSDSEPINTPPQRRFPPDHREARRVPRGGVTASYDNRNQRF
        MGAFQLQSRLQVSKILSSTIRKWEIDFPLLHNPLRTPDFSLYPAQSRRFASNSMDDDHAFSDSEPINTPPQRRFPPDHREARRVPRGGVTASYDNRNQRF
Subjt:  MGAFQLQSRLQVSKILSSTIRKWEIDFPLLHNPLRTPDFSLYPAQSRRFASNSMDDDHAFSDSEPINTPPQRRFPPDHREARRVPRGGVTASYDNRNQRF

Query:  NRHSEGSSSRFTNEGSTSPQRSESLSQKDFSFLEKFKLNTDNQSSGKEKTEENSSSAPVSESMLEKQQSEPQGPPEADEIFRKMKETGLIPNAVAMLDGL
        NRHSEGSSSRFTNEGSTSPQRSESLSQKDFSFLEKFKLNTDNQSSGKEKTEENSSSAPVSESMLEKQQSEPQ PPEADEIF KMKETGLIPNAVAMLDGL
Subjt:  NRHSEGSSSRFTNEGSTSPQRSESLSQKDFSFLEKFKLNTDNQSSGKEKTEENSSSAPVSESMLEKQQSEPQGPPEADEIFRKMKETGLIPNAVAMLDGL

Query:  CKDGLIQEAMKLFGLIREKGTIPEVVIYTAVVDGFCKAEKFDEAIRIFRKMQHNGIPPNAFSFGVLIQGLYKCKKLDDAVAFCNEMLESGHLPNLTTFVG
        CKDGLIQEAMKLFGLIREKGTIPEVVIYTAVVDGFCKAEKFDEAIRIFRKMQHNGIPPNAFSFGVLIQGLYKCKKLDDAVAFCNEMLESGHLPNLTTFVG
Subjt:  CKDGLIQEAMKLFGLIREKGTIPEVVIYTAVVDGFCKAEKFDEAIRIFRKMQHNGIPPNAFSFGVLIQGLYKCKKLDDAVAFCNEMLESGHLPNLTTFVG

Query:  LIDALCNEKGVDEAHSVVETFKQKGFLIDEKALREFLNKRAPFSPDIWKVFFGNRKPPFF
        LIDALCNEKGVDEAHSVVETFKQKGFLIDEKALRE LNKRAPFSPDIWKVFFGNRKPPFF
Subjt:  LIDALCNEKGVDEAHSVVETFKQKGFLIDEKALREFLNKRAPFSPDIWKVFFGNRKPPFF

A0A1S3CFW0 pentatricopeptide repeat-containing protein At4g381503.2e-18592.5Show/hide
Query:  MGAFQLQSRLQVSKILSSTIRKWEIDFPLLHNPLRTPDFSLYPAQSRRFASNSMDDDHAFSDSEPINTPPQRRFPPDHREARRVPRGGVTASYDNRNQRF
        M AFQ QSRLQVSKILSS+IRKWEIDFPLLHNPLRTPDFSL PA SRRF+SNSMDDD AFS+SEP NTPPQRRFPPDHREARRVPRGGVT SYDNRNQRF
Subjt:  MGAFQLQSRLQVSKILSSTIRKWEIDFPLLHNPLRTPDFSLYPAQSRRFASNSMDDDHAFSDSEPINTPPQRRFPPDHREARRVPRGGVTASYDNRNQRF

Query:  NRHSEGSSSRFTNEGSTSPQRSESLSQKDFSFLEKFKLNTDNQSSGKEKTEENSSSAPVSESMLEKQQSEPQGPPEADEIFRKMKETGLIPNAVAMLDGL
        NRHSEGSSSRFTNEGSTS QRSESLSQKDFSFLEKFKLNTDNQSSG EKTEENSSSAPVSESM EKQQS+ Q PP ADEIFRKMKE+GLIPNAVAMLDGL
Subjt:  NRHSEGSSSRFTNEGSTSPQRSESLSQKDFSFLEKFKLNTDNQSSGKEKTEENSSSAPVSESMLEKQQSEPQGPPEADEIFRKMKETGLIPNAVAMLDGL

Query:  CKDGLIQEAMKLFGLIREKGTIPEVVIYTAVVDGFCKAEKFDEAIRIFRKMQHNGIPPNAFSFGVLIQGLYKCKKLDDAVAFCNEMLESGHLPNLTTFVG
        CKDGLIQEAMKLF LIREKGTIPEVVIYTAVVDGFCKAEKFDEAIRIFRKMQ+ GI PNAFSFGVLIQGLYKCKKLDDAVAFCNEMLESGHLPNLTTFVG
Subjt:  CKDGLIQEAMKLFGLIREKGTIPEVVIYTAVVDGFCKAEKFDEAIRIFRKMQHNGIPPNAFSFGVLIQGLYKCKKLDDAVAFCNEMLESGHLPNLTTFVG

Query:  LIDALCNEKGVDEAHSVVETFKQKGFLIDEKALREFLNKRAPFSPDIWKVFFGNRKPPFF
        LID LC+EKGVDEAH+VVETFKQKGFLIDEKALREFLNKRAPFSPDIWKVFFGN+K PFF
Subjt:  LIDALCNEKGVDEAHSVVETFKQKGFLIDEKALREFLNKRAPFSPDIWKVFFGNRKPPFF

A0A5D3D5J5 Pentatricopeptide repeat-containing protein3.2e-18592.5Show/hide
Query:  MGAFQLQSRLQVSKILSSTIRKWEIDFPLLHNPLRTPDFSLYPAQSRRFASNSMDDDHAFSDSEPINTPPQRRFPPDHREARRVPRGGVTASYDNRNQRF
        M AFQ QSRLQVSKILSS+IRKWEIDFPLLHNPLRTPDFSL PA SRRF+SNSMDDD AFS+SEP NTPPQRRFPPDHREARRVPRGGVT SYDNRNQRF
Subjt:  MGAFQLQSRLQVSKILSSTIRKWEIDFPLLHNPLRTPDFSLYPAQSRRFASNSMDDDHAFSDSEPINTPPQRRFPPDHREARRVPRGGVTASYDNRNQRF

Query:  NRHSEGSSSRFTNEGSTSPQRSESLSQKDFSFLEKFKLNTDNQSSGKEKTEENSSSAPVSESMLEKQQSEPQGPPEADEIFRKMKETGLIPNAVAMLDGL
        NRHSEGSSSRFTNEGSTS QRSESLSQKDFSFLEKFKLNTDNQSSG EKTEENSSSAPVSESM EKQQS+ Q PP ADEIFRKMKE+GLIPNAVAMLDGL
Subjt:  NRHSEGSSSRFTNEGSTSPQRSESLSQKDFSFLEKFKLNTDNQSSGKEKTEENSSSAPVSESMLEKQQSEPQGPPEADEIFRKMKETGLIPNAVAMLDGL

Query:  CKDGLIQEAMKLFGLIREKGTIPEVVIYTAVVDGFCKAEKFDEAIRIFRKMQHNGIPPNAFSFGVLIQGLYKCKKLDDAVAFCNEMLESGHLPNLTTFVG
        CKDGLIQEAMKLF LIREKGTIPEVVIYTAVVDGFCKAEKFDEAIRIFRKMQ+ GI PNAFSFGVLIQGLYKCKKLDDAVAFCNEMLESGHLPNLTTFVG
Subjt:  CKDGLIQEAMKLFGLIREKGTIPEVVIYTAVVDGFCKAEKFDEAIRIFRKMQHNGIPPNAFSFGVLIQGLYKCKKLDDAVAFCNEMLESGHLPNLTTFVG

Query:  LIDALCNEKGVDEAHSVVETFKQKGFLIDEKALREFLNKRAPFSPDIWKVFFGNRKPPFF
        LID LC+EKGVDEAH+VVETFKQKGFLIDEKALREFLNKRAPFSPDIWKVFFGN+K PFF
Subjt:  LIDALCNEKGVDEAHSVVETFKQKGFLIDEKALREFLNKRAPFSPDIWKVFFGNRKPPFF

A0A6J1GKZ0 pentatricopeptide repeat-containing protein At4g38150-like1.6e-14978.14Show/hide
Query:  MGA--FQLQSRLQVSKILSSTIRKWEIDFPLLHNPLRTPDFSLYPAQSRRFASNSMDDDHAFSDSEPINTPPQRRFPPDHREARRVPR--GGVTASY--D
        MGA   QLQSRL+VSKILSSTIR WE  FPLLH+PLRTPD SL P +SR F+SNSMDDD  FS SEP ++PPQRR PPD REARRVP+  G V+A Y  D
Subjt:  MGA--FQLQSRLQVSKILSSTIRKWEIDFPLLHNPLRTPDFSLYPAQSRRFASNSMDDDHAFSDSEPINTPPQRRFPPDHREARRVPR--GGVTASY--D

Query:  NRNQRFNRHSEGSSSRFTNEGSTSPQRSESLSQKDFSFLEKFKLNTDNQSSGKEKTEENSSSAPVSESMLEKQQSEPQGPPEADEIFRKMKETGLIPNAV
        NRNQRFNRHSEGSSSRF N+G  S  R+ESLSQKDFSFLEKFKLNTDNQS+  EK E  SSSA +SESM + +QS+PQ PPEADEIFRKMKETGLIPNAV
Subjt:  NRNQRFNRHSEGSSSRFTNEGSTSPQRSESLSQKDFSFLEKFKLNTDNQSSGKEKTEENSSSAPVSESMLEKQQSEPQGPPEADEIFRKMKETGLIPNAV

Query:  AMLDGLCKDGLIQEAMKLFGLIREKGTIPEVVIYTAVVDGFCKAEKFDEAIRIFRKMQHNGIPPNAFSFGVLIQGLYKCKKLDDAVAFCNEMLESGHLPN
        AMLDGLCKDGLIQEAMKLFGLIREKGTIPEVV+YTAVVDGFCKAEK DEAIRIFRKMQ+NGI PNAFS+G+LIQGLYK K+L+DAV+FC EMLESGH PN
Subjt:  AMLDGLCKDGLIQEAMKLFGLIREKGTIPEVVIYTAVVDGFCKAEKFDEAIRIFRKMQHNGIPPNAFSFGVLIQGLYKCKKLDDAVAFCNEMLESGHLPN

Query:  LTTFVGLIDALCNEKGVDEAHSVVETFKQKGFLIDEKALREFLNKRAPFSPDIWKVFFGNRKPPFF
        LTTFVGLID LC EKGVDEAH+V+ET KQKGFLI+EK LR+FLNKRAPFSP IW++ FG +   FF
Subjt:  LTTFVGLIDALCNEKGVDEAHSVVETFKQKGFLIDEKALREFLNKRAPFSPDIWKVFFGNRKPPFF

A0A6J1I1Q2 pentatricopeptide repeat-containing protein At4g38150-like4.3e-15078.96Show/hide
Query:  MGA--FQLQSRLQVSKILSSTIRKWEIDFPLLHNPLRTPDFSLYPAQSRRFASNSMDDDHAFSDSEPINTPPQRRFPPDHREARRVPR--GGVTASY--D
        MGA   QLQSRL+VSKIL STIR WE  FPLLH+PLRTPD SL P +SR F+SNSMDDD  FS SEP ++PPQRR PPD REARRVP+  G V+A Y  D
Subjt:  MGA--FQLQSRLQVSKILSSTIRKWEIDFPLLHNPLRTPDFSLYPAQSRRFASNSMDDDHAFSDSEPINTPPQRRFPPDHREARRVPR--GGVTASY--D

Query:  NRNQRFNRHSEGSSSRFTNEGSTSPQRSESLSQKDFSFLEKFKLNTDNQSSGKEKTEENSSSAPVSESMLEKQQSEPQGPPEADEIFRKMKETGLIPNAV
        NRNQRFNRH EGS SRF NEG  S +RSESLSQKDFSFLEKFKLNTDN S+  EKTE  SSSA  SESM + +QS+PQ PPEADEIFRKMKETGLIPNAV
Subjt:  NRNQRFNRHSEGSSSRFTNEGSTSPQRSESLSQKDFSFLEKFKLNTDNQSSGKEKTEENSSSAPVSESMLEKQQSEPQGPPEADEIFRKMKETGLIPNAV

Query:  AMLDGLCKDGLIQEAMKLFGLIREKGTIPEVVIYTAVVDGFCKAEKFDEAIRIFRKMQHNGIPPNAFSFGVLIQGLYKCKKLDDAVAFCNEMLESGHLPN
        AMLDGLCKDGLIQEAMKLFGLIREKGTIPEVV+YTAVVDGFCKAE  DEAIRIFRKMQ+NGI PNAFS+GVLIQGLYKCKKL+DAV+FC EMLESGH PN
Subjt:  AMLDGLCKDGLIQEAMKLFGLIREKGTIPEVVIYTAVVDGFCKAEKFDEAIRIFRKMQHNGIPPNAFSFGVLIQGLYKCKKLDDAVAFCNEMLESGHLPN

Query:  LTTFVGLIDALCNEKGVDEAHSVVETFKQKGFLIDEKALREFLNKRAPFSPDIWKVFFGNRKPPFF
        LTTFVGLID LC EKGVDEAH+V+ET KQKGFLI+EKALREFLNKRAPFSP IW++ FG +   FF
Subjt:  LTTFVGLIDALCNEKGVDEAHSVVETFKQKGFLIDEKALREFLNKRAPFSPDIWKVFFGNRKPPFF

SwissProt top hitse value%identityAlignment
Q0WVK7 Pentatricopeptide repeat-containing protein At1g05670, mitochondrial2.5e-2235.62Show/hide
Query:  EADEIFRKMKETGLIPNAV---AMLDGLCKDGLIQEAMKLFGLIREKGTIPEVVIYTAVVDGFCKAEKFDEAIRIFRKMQHNGIPPNAFSFGVLIQGLYK
        EA+E F +M   G++P+ V    ++DG CK G I+ A K F  +  +   P+V+ YTA++ GFC+     EA ++F +M   G+ P++ +F  LI G  K
Subjt:  EADEIFRKMKETGLIPNAV---AMLDGLCKDGLIQEAMKLFGLIREKGTIPEVVIYTAVVDGFCKAEKFDEAIRIFRKMQHNGIPPNAFSFGVLIQGLYK

Query:  CKKLDDAVAFCNEMLESGHLPNLTTFVGLIDALCNEKGVDEAHSVV
           + DA    N M+++G  PN+ T+  LID LC E  +D A+ ++
Subjt:  CKKLDDAVAFCNEMLESGHLPNLTTFVGLIDALCNEKGVDEAHSVV

Q6NQ83 Pentatricopeptide repeat-containing protein At3g22470, mitochondrial7.4e-2229.8Show/hide
Query:  EADEIFRKMKETGLIPNAV---AMLDGLCKDGLIQEAMKLFGLIREKGTIPEVVIYTAVVDGFCKAEKFDEAIRIFRKMQHNGIPPNAFSFGVLIQGLYK
        EA E++ +M   G+ P+ +   +++DG CK+  + EA ++F L+  KG  P++V Y+ +++ +CKA++ D+ +R+FR++   G+ PN  ++  L+ G  +
Subjt:  EADEIFRKMKETGLIPNAV---AMLDGLCKDGLIQEAMKLFGLIREKGTIPEVVIYTAVVDGFCKAEKFDEAIRIFRKMQHNGIPPNAFSFGVLIQGLYK

Query:  CKKLDDAVAFCNEMLESGHLPNLTTFVGLIDALCNEKGVDEAHSVVETFKQ
          KL+ A     EM+  G  P++ T+  L+D LC+   +++A  + E  ++
Subjt:  CKKLDDAVAFCNEMLESGHLPNLTTFVGLIDALCNEKGVDEAHSVVETFKQ

Q9LPX2 Pentatricopeptide repeat-containing protein At1g12775, mitochondrial1.5e-2236.18Show/hide
Query:  ADEIFRKMKETGLIPNAV---AMLDGLCKDGLIQEAMKLFGLIREKGTIPEVVIYTAVVDGFCKAEKFDEAIRIFRKMQHNGIPPNAFSFGVLIQGLYKC
        A E+ RKM+E  +  +AV    ++DGLCKDG +  A  LF  +  KG   +++ Y  ++ GFC A ++D+  ++ R M    I PN  +F VLI    K 
Subjt:  ADEIFRKMKETGLIPNAV---AMLDGLCKDGLIQEAMKLFGLIREKGTIPEVVIYTAVVDGFCKAEKFDEAIRIFRKMQHNGIPPNAFSFGVLIQGLYKC

Query:  KKLDDAVAFCNEMLESGHLPNLTTFVGLIDALCNEKGVDEAHSVVETFKQKG
         KL +A     EM++ G  PN  T+  LID  C E  ++EA  +V+    KG
Subjt:  KKLDDAVAFCNEMLESGHLPNLTTFVGLIDALCNEKGVDEAHSVVETFKQKG

Q9M9X9 Pentatricopeptide repeat-containing protein At1g06710, mitochondrial1.1e-2233.12Show/hide
Query:  PEADEIFRKMKETGLIPNAV---AMLDGLCKDGLIQEAMKLFGLIREKGTIPEVVIYTAVVDGFCKAEKFDEAIRIFRKMQHNGIPPNAFSFGVLIQGLY
        P+ D  F++  +    PN V   A+LDG CK   ++EA KL   +  +G  P  ++Y A++DG CK  K DEA  +  +M  +G P   +++  LI   +
Subjt:  PEADEIFRKMKETGLIPNAV---AMLDGLCKDGLIQEAMKLFGLIREKGTIPEVVIYTAVVDGFCKAEKFDEAIRIFRKMQHNGIPPNAFSFGVLIQGLY

Query:  KCKKLDDAVAFCNEMLESGHLPNLTTFVGLIDALCNEKGVDEAHSVVETFKQKG
        K K+ D A    ++MLE+   PN+  +  +ID LC     DEA+ +++  ++KG
Subjt:  KCKKLDDAVAFCNEMLESGHLPNLTTFVGLIDALCNEKGVDEAHSVVETFKQKG

Q9SZL5 Pentatricopeptide repeat-containing protein At4g381501.1e-7357.36Show/hide
Query:  NRHSEGSSSRFTNEGSTSPQRSESLSQKDFSFLEKFKLNTDNQSSGKEKTEENSSSAPVSESMLEKQQSEPQGPPE-ADEIFRKMKETGLIPNAVAMLDG
        N H E  + +  N G +    S      D  FLE+FKL  +  S    K E+                 EP  PPE +DEIF+KMKE GLIPNAVAMLDG
Subjt:  NRHSEGSSSRFTNEGSTSPQRSESLSQKDFSFLEKFKLNTDNQSSGKEKTEENSSSAPVSESMLEKQQSEPQGPPE-ADEIFRKMKETGLIPNAVAMLDG

Query:  LCKDGLIQEAMKLFGLIREKGTIPEVVIYTAVVDGFCKAEKFDEAIRIFRKMQHNGIPPNAFSFGVLIQGLYKCKKLDDAVAFCNEMLESGHLPNLTTFV
        LCKDGL+QEAMKLFGL+R+KGTIPEVVIYTAVV+ FCKA K ++A RIFRKMQ+NGI PNAFS+GVL+QGLY C  LDDAVAFC+EMLESGH PN+ TFV
Subjt:  LCKDGLIQEAMKLFGLIREKGTIPEVVIYTAVVDGFCKAEKFDEAIRIFRKMQHNGIPPNAFSFGVLIQGLYKCKKLDDAVAFCNEMLESGHLPNLTTFV

Query:  GLIDALCNEKGVDEAHSVVETFKQKGFLIDEKALREFLNKRAPFSPDIWKVFFGNRKP
         L+DALC  KGV++A S ++T  QKGF ++ KA++EF++KRAPF    W+  F  +KP
Subjt:  GLIDALCNEKGVDEAHSVVETFKQKGFLIDEKALREFLNKRAPFSPDIWKVFFGNRKP

Arabidopsis top hitse value%identityAlignment
AT1G06710.1 Tetratricopeptide repeat (TPR)-like superfamily protein8.1e-2433.12Show/hide
Query:  PEADEIFRKMKETGLIPNAV---AMLDGLCKDGLIQEAMKLFGLIREKGTIPEVVIYTAVVDGFCKAEKFDEAIRIFRKMQHNGIPPNAFSFGVLIQGLY
        P+ D  F++  +    PN V   A+LDG CK   ++EA KL   +  +G  P  ++Y A++DG CK  K DEA  +  +M  +G P   +++  LI   +
Subjt:  PEADEIFRKMKETGLIPNAV---AMLDGLCKDGLIQEAMKLFGLIREKGTIPEVVIYTAVVDGFCKAEKFDEAIRIFRKMQHNGIPPNAFSFGVLIQGLY

Query:  KCKKLDDAVAFCNEMLESGHLPNLTTFVGLIDALCNEKGVDEAHSVVETFKQKG
        K K+ D A    ++MLE+   PN+  +  +ID LC     DEA+ +++  ++KG
Subjt:  KCKKLDDAVAFCNEMLESGHLPNLTTFVGLIDALCNEKGVDEAHSVVETFKQKG

AT1G12775.1 Pentatricopeptide repeat (PPR) superfamily protein1.1e-2336.18Show/hide
Query:  ADEIFRKMKETGLIPNAV---AMLDGLCKDGLIQEAMKLFGLIREKGTIPEVVIYTAVVDGFCKAEKFDEAIRIFRKMQHNGIPPNAFSFGVLIQGLYKC
        A E+ RKM+E  +  +AV    ++DGLCKDG +  A  LF  +  KG   +++ Y  ++ GFC A ++D+  ++ R M    I PN  +F VLI    K 
Subjt:  ADEIFRKMKETGLIPNAV---AMLDGLCKDGLIQEAMKLFGLIREKGTIPEVVIYTAVVDGFCKAEKFDEAIRIFRKMQHNGIPPNAFSFGVLIQGLYKC

Query:  KKLDDAVAFCNEMLESGHLPNLTTFVGLIDALCNEKGVDEAHSVVETFKQKG
         KL +A     EM++ G  PN  T+  LID  C E  ++EA  +V+    KG
Subjt:  KKLDDAVAFCNEMLESGHLPNLTTFVGLIDALCNEKGVDEAHSVVETFKQKG

AT4G38150.1 Pentatricopeptide repeat (PPR) superfamily protein7.7e-7557.36Show/hide
Query:  NRHSEGSSSRFTNEGSTSPQRSESLSQKDFSFLEKFKLNTDNQSSGKEKTEENSSSAPVSESMLEKQQSEPQGPPE-ADEIFRKMKETGLIPNAVAMLDG
        N H E  + +  N G +    S      D  FLE+FKL  +  S    K E+                 EP  PPE +DEIF+KMKE GLIPNAVAMLDG
Subjt:  NRHSEGSSSRFTNEGSTSPQRSESLSQKDFSFLEKFKLNTDNQSSGKEKTEENSSSAPVSESMLEKQQSEPQGPPE-ADEIFRKMKETGLIPNAVAMLDG

Query:  LCKDGLIQEAMKLFGLIREKGTIPEVVIYTAVVDGFCKAEKFDEAIRIFRKMQHNGIPPNAFSFGVLIQGLYKCKKLDDAVAFCNEMLESGHLPNLTTFV
        LCKDGL+QEAMKLFGL+R+KGTIPEVVIYTAVV+ FCKA K ++A RIFRKMQ+NGI PNAFS+GVL+QGLY C  LDDAVAFC+EMLESGH PN+ TFV
Subjt:  LCKDGLIQEAMKLFGLIREKGTIPEVVIYTAVVDGFCKAEKFDEAIRIFRKMQHNGIPPNAFSFGVLIQGLYKCKKLDDAVAFCNEMLESGHLPNLTTFV

Query:  GLIDALCNEKGVDEAHSVVETFKQKGFLIDEKALREFLNKRAPFSPDIWKVFFGNRKP
         L+DALC  KGV++A S ++T  QKGF ++ KA++EF++KRAPF    W+  F  +KP
Subjt:  GLIDALCNEKGVDEAHSVVETFKQKGFLIDEKALREFLNKRAPFSPDIWKVFFGNRKP

AT4G38150.2 Pentatricopeptide repeat (PPR) superfamily protein7.7e-7557.36Show/hide
Query:  NRHSEGSSSRFTNEGSTSPQRSESLSQKDFSFLEKFKLNTDNQSSGKEKTEENSSSAPVSESMLEKQQSEPQGPPE-ADEIFRKMKETGLIPNAVAMLDG
        N H E  + +  N G +    S      D  FLE+FKL  +  S    K E+                 EP  PPE +DEIF+KMKE GLIPNAVAMLDG
Subjt:  NRHSEGSSSRFTNEGSTSPQRSESLSQKDFSFLEKFKLNTDNQSSGKEKTEENSSSAPVSESMLEKQQSEPQGPPE-ADEIFRKMKETGLIPNAVAMLDG

Query:  LCKDGLIQEAMKLFGLIREKGTIPEVVIYTAVVDGFCKAEKFDEAIRIFRKMQHNGIPPNAFSFGVLIQGLYKCKKLDDAVAFCNEMLESGHLPNLTTFV
        LCKDGL+QEAMKLFGL+R+KGTIPEVVIYTAVV+ FCKA K ++A RIFRKMQ+NGI PNAFS+GVL+QGLY C  LDDAVAFC+EMLESGH PN+ TFV
Subjt:  LCKDGLIQEAMKLFGLIREKGTIPEVVIYTAVVDGFCKAEKFDEAIRIFRKMQHNGIPPNAFSFGVLIQGLYKCKKLDDAVAFCNEMLESGHLPNLTTFV

Query:  GLIDALCNEKGVDEAHSVVETFKQKGFLIDEKALREFLNKRAPFSPDIWKVFFGNRKP
         L+DALC  KGV++A S ++T  QKGF ++ KA++EF++KRAPF    W+  F  +KP
Subjt:  GLIDALCNEKGVDEAHSVVETFKQKGFLIDEKALREFLNKRAPFSPDIWKVFFGNRKP

AT5G03560.2 Tetratricopeptide repeat (TPR)-like superfamily protein6.6e-2631.52Show/hide
Query:  SEPINTPPQRRFPPDHREARRVPRGGVTASYDNRNQRFNRHSEGSSSRFTNEGSTSPQRSESLSQKDFSFLEKFKLNTDNQSSGKEKTEENSSSAPVSES
        S  I+    R+ PP+ R  R   R     S  N N  F     G  +R  +  +       + S  D    E+    T+ Q +   K +E     P    
Subjt:  SEPINTPPQRRFPPDHREARRVPRGGVTASYDNRNQRFNRHSEGSSSRFTNEGSTSPQRSESLSQKDFSFLEKFKLNTDNQSSGKEKTEENSSSAPVSES

Query:  MLEKQQSEPQGPPEADEIFRKMKETGLIPNAVAMLDGLCKDGLIQEAMKLFGLIREKGTIPEVVIYTAVVDGFCKAEKFDEAIRIFRKMQHNGIPPNAFS
          +    EP+ P    EIF KM+  G    AV M D L KDG   EA++LF  I++K  +P+VV +TA+V+ +  A +  E +++F +M  +G+ PNA++
Subjt:  MLEKQQSEPQGPPEADEIFRKMKETGLIPNAVAMLDGLCKDGLIQEAMKLFGLIREKGTIPEVVIYTAVVDGFCKAEKFDEAIRIFRKMQHNGIPPNAFS

Query:  FGVLIQGLYKCKKL-DDAVAFCNEMLESGHLPNLTTFVGLIDALCNEKGVDEAHSVVETFKQKGFLIDEKALREFL
        + VLI+GL    K   DA  +  EM+ +G  PN  T+  + +A   E   + A  +++  K KGF+ DEKA+RE L
Subjt:  FGVLIQGLYKCKKL-DDAVAFCNEMLESGHLPNLTTFVGLIDALCNEKGVDEAHSVVETFKQKGFLIDEKALREFL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGGGCGTTTCAACTTCAATCTCGTCTTCAAGTTTCGAAGATTTTGTCTTCAACTATTCGGAAATGGGAAATAGATTTTCCTCTCCTCCACAACCCATTGCGAACGCC
GGATTTCTCACTCTACCCTGCGCAATCTCGTCGTTTTGCCTCCAATTCCATGGATGACGATCATGCCTTCTCGGATTCCGAGCCGATAAACACCCCGCCGCAACGTCGTT
TTCCTCCAGATCATCGCGAAGCTCGTCGTGTTCCCAGAGGGGGAGTAACTGCTTCTTACGATAATCGAAATCAACGCTTTAATAGGCATTCAGAAGGTTCCTCTTCTCGA
TTCACAAATGAAGGGTCGACTTCGCCACAAAGGAGTGAGTCTTTATCTCAGAAAGATTTTAGCTTTCTTGAAAAATTCAAGCTGAATACTGATAATCAGAGTAGTGGTAA
AGAGAAAACTGAGGAGAACTCTTCTTCTGCTCCGGTGTCTGAATCAATGCTGGAGAAGCAGCAGTCGGAACCGCAGGGTCCTCCAGAAGCTGATGAGATATTCAGGAAAA
TGAAGGAGACTGGTTTAATTCCCAATGCTGTTGCTATGCTTGATGGGCTTTGTAAAGATGGACTTATACAAGAAGCAATGAAGCTTTTTGGTTTGATTCGTGAGAAAGGT
ACAATTCCAGAAGTTGTGATTTACACTGCTGTTGTTGATGGGTTTTGCAAGGCGGAGAAGTTTGATGAAGCAATTAGGATTTTCAGGAAAATGCAGCATAACGGTATTCC
TCCAAATGCCTTCAGTTTTGGTGTCTTGATTCAGGGATTGTACAAATGCAAAAAGTTAGATGATGCTGTAGCATTTTGCAATGAGATGTTAGAATCTGGGCATTTACCAA
ATCTTACCACTTTTGTTGGCCTAATTGATGCGTTATGCAATGAGAAGGGTGTGGATGAAGCTCATAGTGTCGTAGAAACCTTTAAACAAAAGGGATTCTTGATTGATGAG
AAAGCTTTAAGGGAATTTTTGAATAAAAGAGCCCCATTTTCACCAGATATCTGGAAAGTGTTCTTTGGTAATAGAAAACCCCCATTTTTCTGA
mRNA sequenceShow/hide mRNA sequence
GCTAAAACCCTACTCCCATCCCCTCTCCTCCCAACCCCATTTGAACTTTCATCAAATACCCTCTCGTACCAATATTTCTTCATAATTTCTTGGTTTAAATTTCTCATGGG
GGCGTTTCAACTTCAATCTCGTCTTCAAGTTTCGAAGATTTTGTCTTCAACTATTCGGAAATGGGAAATAGATTTTCCTCTCCTCCACAACCCATTGCGAACGCCGGATT
TCTCACTCTACCCTGCGCAATCTCGTCGTTTTGCCTCCAATTCCATGGATGACGATCATGCCTTCTCGGATTCCGAGCCGATAAACACCCCGCCGCAACGTCGTTTTCCT
CCAGATCATCGCGAAGCTCGTCGTGTTCCCAGAGGGGGAGTAACTGCTTCTTACGATAATCGAAATCAACGCTTTAATAGGCATTCAGAAGGTTCCTCTTCTCGATTCAC
AAATGAAGGGTCGACTTCGCCACAAAGGAGTGAGTCTTTATCTCAGAAAGATTTTAGCTTTCTTGAAAAATTCAAGCTGAATACTGATAATCAGAGTAGTGGTAAAGAGA
AAACTGAGGAGAACTCTTCTTCTGCTCCGGTGTCTGAATCAATGCTGGAGAAGCAGCAGTCGGAACCGCAGGGTCCTCCAGAAGCTGATGAGATATTCAGGAAAATGAAG
GAGACTGGTTTAATTCCCAATGCTGTTGCTATGCTTGATGGGCTTTGTAAAGATGGACTTATACAAGAAGCAATGAAGCTTTTTGGTTTGATTCGTGAGAAAGGTACAAT
TCCAGAAGTTGTGATTTACACTGCTGTTGTTGATGGGTTTTGCAAGGCGGAGAAGTTTGATGAAGCAATTAGGATTTTCAGGAAAATGCAGCATAACGGTATTCCTCCAA
ATGCCTTCAGTTTTGGTGTCTTGATTCAGGGATTGTACAAATGCAAAAAGTTAGATGATGCTGTAGCATTTTGCAATGAGATGTTAGAATCTGGGCATTTACCAAATCTT
ACCACTTTTGTTGGCCTAATTGATGCGTTATGCAATGAGAAGGGTGTGGATGAAGCTCATAGTGTCGTAGAAACCTTTAAACAAAAGGGATTCTTGATTGATGAGAAAGC
TTTAAGGGAATTTTTGAATAAAAGAGCCCCATTTTCACCAGATATCTGGAAAGTGTTCTTTGGTAATAGAAAACCCCCATTTTTCTGAGATTGCTGTCGTAGTGTTGTGA
AGAGACCAATCATATTACTGAGAAGAAACCACAAGTTGAAATCATTGTAACTCAGTAGTCAGAATTTGATTATCGCGTTGAAGAATAAAAATCTGGTTGAAGAGTGAACT
GTCTACAACTGTTTAAGTGAGAGCTCTAAACCATTGGTCAAGTTTGGGGAACACAAAACTTGTGCAGTCAAGTTCATTGACCTTTCGTGAAAGATTGTGTGGACTGGATC
AATGCATGGGAGACATTAGCTGCAGTTAACAACTGATGGATATTTTTTGCAAGATTGAACATCAGAAAATCTGTCCAATTCACCATTTACCTCTTAGAATTTTTTTAAAA
CAATCATGTAAGTTATGACTATGTCATTGCTTCCTAGTTGCTACCACTAATGGCTATATAGAGTTTCTTTTTGTCTCTCACTCTCTGTGCACTGTGCTTAACTTCCCTTA
TTTAATTATTTGATTGTAAAGAGTTTATATGAGTGACTAGAAAGCTTAACCTAATAGATTGATTGGTGATGTAACAGTATAAGGATGGAAGGTCTTGTATTCAAATTTTT
GTAACAAACCGTTGTTTCTTCTCCAATTAACATTGATTTCTACTTGTAGGTCTTCC
Protein sequenceShow/hide protein sequence
MGAFQLQSRLQVSKILSSTIRKWEIDFPLLHNPLRTPDFSLYPAQSRRFASNSMDDDHAFSDSEPINTPPQRRFPPDHREARRVPRGGVTASYDNRNQRFNRHSEGSSSR
FTNEGSTSPQRSESLSQKDFSFLEKFKLNTDNQSSGKEKTEENSSSAPVSESMLEKQQSEPQGPPEADEIFRKMKETGLIPNAVAMLDGLCKDGLIQEAMKLFGLIREKG
TIPEVVIYTAVVDGFCKAEKFDEAIRIFRKMQHNGIPPNAFSFGVLIQGLYKCKKLDDAVAFCNEMLESGHLPNLTTFVGLIDALCNEKGVDEAHSVVETFKQKGFLIDE
KALREFLNKRAPFSPDIWKVFFGNRKPPFF