; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0012592 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0012592
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationchr1:42493402..42499358
RNA-Seq ExpressionLag0012592
SyntenyLag0012592
Gene Ontology termsGO:0003729 - mRNA binding (molecular function)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily
IPR033443 - Pentacotripeptide-repeat region of PRORP


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7027282.1 Pentatricopeptide repeat-containing protein, chloroplastic, partial [Cucurbita argyrosperma subsp. argyrosperma]0.0e+0090.54Show/hide
Query:  MRVFLILG----SSSSASIAGHRCHRH-SHSKAPKSSLSNLSPTGTHSPLSSRPSTLHSRPPLLSSVQWDFAGASSGGRTPLRHYADVASKLAERGKLED
        MR  LILG    SSSS+SIAG RC RH SHSKA  SSLSN+ PTGTHSPLSSRPST+HSR PLLSSVQWD AGASSGG+T LRHYADVASKLAERGKL+D
Subjt:  MRVFLILG----SSSSASIAGHRCHRH-SHSKAPKSSLSNLSPTGTHSPLSSRPSTLHSRPPLLSSVQWDFAGASSGGRTPLRHYADVASKLAERGKLED

Query:  FAMVVESVVVAGVEPSQFAAVLAVELVAKGISRCLREGKLWNVVQVLRKVEELGISAVGLCDDFAVESLRRDCRRIAKSGELEELVEFMEVIAGFGFSIK
        FAMVVE+VVVAGVEPSQFAAVLAVELVAKGISRCLREG LWNVVQVLRKVEELGISAVGLCD+ AVESLRR+CRRI+KSG+LEELVEFMEV++GFGFSIK
Subjt:  FAMVVESVVVAGVEPSQFAAVLAVELVAKGISRCLREGKLWNVVQVLRKVEELGISAVGLCDDFAVESLRRDCRRIAKSGELEELVEFMEVIAGFGFSIK

Query:  EMMKPSEVIKLCVDCRNPKMAIRYASILPHADILFCTTINEFGKKRDLKSAFMAYTESKANMNGPNMYIYRTIIDVCGLCGDYKKSRNIY-QDLLNQNVT
        EMMKPSEVIKLCVD RNPK+AIRYASILP ADILFCT INEFGKKRDLKSAFMAYTESKA+MNGPNMYIYR+IIDVCGLCGDYKKSRNIY QDLL+QNVT
Subjt:  EMMKPSEVIKLCVDCRNPKMAIRYASILPHADILFCTTINEFGKKRDLKSAFMAYTESKANMNGPNMYIYRTIIDVCGLCGDYKKSRNIY-QDLLNQNVT

Query:  PNIYVFNSLMNVNAHDLNYTFQLYKSMQNLGVPADMASYNILLKACCLAGRVDLAQDIYREVKNLETTGVLKLDVFTYSTIVKVFADAKLWKMALRVKED
        PN+YVFNSLMNVNAHDLNY FQLYK+MQNLGVPADMASYNILLKACCLAGRVDLAQDIYREVK+LETTGVLKLDVFTYSTIVKVFADAKLWKMALRVKED
Subjt:  PNIYVFNSLMNVNAHDLNYTFQLYKSMQNLGVPADMASYNILLKACCLAGRVDLAQDIYREVKNLETTGVLKLDVFTYSTIVKVFADAKLWKMALRVKED

Query:  MQSAGVSPNMVTWSSLISSCANSGLVELAIQLFEEMVSAGCEPNTQCCNILLHACVEARQFDRAFRLFRSWREKELWDDKDRKNSTNDNLNADPTSQLCT
        MQSAGVSPNMVTWSSLISSCANSGLVELAIQLFEEMVSAGCEPNTQCCNILLHACVEARQFDRAFRLFRSWREKELWDD +R+NST+DNLNADPTSQLC 
Subjt:  MQSAGVSPNMVTWSSLISSCANSGLVELAIQLFEEMVSAGCEPNTQCCNILLHACVEARQFDRAFRLFRSWREKELWDDKDRKNSTNDNLNADPTSQLCT

Query:  TNMPNAPSHVHQIHFVGNFSFKPTITTYNILMKACGTDYYHAKALMEEMKSVGLTPNHISWSILIDICGGSHDVENAVQILTTMRIAGVDPDVVAYTTAI
        TNMPNA SHVHQI F GNF+FKPTITTYNILMKACGTDYYHAKALMEEMKSVGLTPNHISWSILIDICGGS DV++AVQILTTMR+AGVDPDVVAYTTAI
Subjt:  TNMPNAPSHVHQIHFVGNFSFKPTITTYNILMKACGTDYYHAKALMEEMKSVGLTPNHISWSILIDICGGSHDVENAVQILTTMRIAGVDPDVVAYTTAI

Query:  KVCVEGKNWKLAFSLFEEMKRFEIQPNLVTYSTLLRARSTYGSLHEVQQCLAIYQDMRKSGFKSNDHYLKELIAEWCEGVIQNNNQQQVEIT-PCNRINM
        KVCV  KNW+LAFSLFEEMKRFEIQPNLVTYSTLLRARSTYGSLHEVQQCLAIYQDMRKSGFKS+DHYLKELIAEWCEGV+QNNNQQQ E T PCNR ++
Subjt:  KVCVEGKNWKLAFSLFEEMKRFEIQPNLVTYSTLLRARSTYGSLHEVQQCLAIYQDMRKSGFKSNDHYLKELIAEWCEGVIQNNNQQQVEIT-PCNRINM

Query:  GKPRCLILEKVAEHLQKSFTESLAIDLQELTKVEARIVVLAVLRMIKENYALGESVKDDIFIILEVSKVETDLVQQNFEVRDAITKLLQEELGLQVLPAG
        GKPRCLILEKVAEHLQKSFTESLAIDLQELTKVEARIVVLAVLRMIKENYALGESVKDDIFIIL VSKVE DLV+QNFEVRDAITKLLQ ELGL+VLPAG
Subjt:  GKPRCLILEKVAEHLQKSFTESLAIDLQELTKVEARIVVLAVLRMIKENYALGESVKDDIFIILEVSKVETDLVQQNFEVRDAITKLLQEELGLQVLPAG

Query:  PTVALDKVPDSESSNMSHTKLKGIMGRNKYSTRRPADVQRLKVTKKSLQDWLQRNR
        PT A D+V  SESSNMS TKL GIMGRNKY+TR+PA VQRLKVTKKSL+DWLQRNR
Subjt:  PTVALDKVPDSESSNMSHTKLKGIMGRNKYSTRRPADVQRLKVTKKSLQDWLQRNR

XP_004142106.1 pentatricopeptide repeat-containing protein At5g02830, chloroplastic [Cucumis sativus]0.0e+0090Show/hide
Query:  MRVFLILGSSSSASIAGHRCHRHSHSKAPKSSLSNLSPTGTHSPLSSRPSTLHSRPPLLSSVQWDFAGASSGGRTPLRHYADVASKLAERGKLEDFAMVV
        MRVFLILG SSSASIAG R +RHSH KAPKSSLSNLSPTGTH P SS  ST HS P LLSSV+ D AGASSGGR P++HYA VASKLAE GKLEDFAMVV
Subjt:  MRVFLILGSSSSASIAGHRCHRHSHSKAPKSSLSNLSPTGTHSPLSSRPSTLHSRPPLLSSVQWDFAGASSGGRTPLRHYADVASKLAERGKLEDFAMVV

Query:  ESVVVAGVEPSQFAAVLAVELVAKGISRCLREGKLWNVVQVLRKVEELGISAVGLCDDFAVESLRRDCRRIAKSGELEELVEFMEVIAGFGFSIKEMMKP
        ESVVVAGVEPSQF A+LAVELVAKGISRCLREGK+W+VVQVLRKVEELGIS + LCD+ AVESLRRDCRR+AKSGELEELVE MEV++GFGFS++EMMKP
Subjt:  ESVVVAGVEPSQFAAVLAVELVAKGISRCLREGKLWNVVQVLRKVEELGISAVGLCDDFAVESLRRDCRRIAKSGELEELVEFMEVIAGFGFSIKEMMKP

Query:  SEVIKLCVDCRNPKMAIRYASILPHADILFCTTINEFGKKRDLKSAFMAYTESKANMNGPNMYIYRTIIDVCGLCGDYKKSRNIYQDLLNQNVTPNIYVF
        SEVIKLCVD RNPKMAIRYASILPHADILFCTTINEFGKKRDLKSA++AYTESKANMNG NMYIYRTIIDVCGLCGDYKKSRNIYQDL+NQNV PNI+VF
Subjt:  SEVIKLCVDCRNPKMAIRYASILPHADILFCTTINEFGKKRDLKSAFMAYTESKANMNGPNMYIYRTIIDVCGLCGDYKKSRNIYQDLLNQNVTPNIYVF

Query:  NSLMNVNAHDLNYTFQLYKSMQNLGVPADMASYNILLKACCLAGRVDLAQDIYREVKNLETTGVLKLDVFTYSTIVKVFADAKLWKMALRVKEDMQSAGV
        NSLMNVNAHDLNYTFQLYK+MQNLGVPADMASYNILLKACCLAGRVDLAQDIYREVK+LETTGVLKLDVFTYSTIVKVFADAKLWKMALRVKEDMQSAGV
Subjt:  NSLMNVNAHDLNYTFQLYKSMQNLGVPADMASYNILLKACCLAGRVDLAQDIYREVKNLETTGVLKLDVFTYSTIVKVFADAKLWKMALRVKEDMQSAGV

Query:  SPNMVTWSSLISSCANSGLVELAIQLFEEMVSAGCEPNTQCCNILLHACVEARQFDRAFRLFRSWREKELWDDKDRKNSTNDNLNADPTSQLCTTNMPNA
        SPNMVTWSSLISSCANSGLVELAIQLFEEMVSAGCEPNTQCCN LLHACVE RQFDRAFRLFRSW+EKELWD  +RK+ST++NL+AD TSQLC T MPNA
Subjt:  SPNMVTWSSLISSCANSGLVELAIQLFEEMVSAGCEPNTQCCNILLHACVEARQFDRAFRLFRSWREKELWDDKDRKNSTNDNLNADPTSQLCTTNMPNA

Query:  PSHVHQIHFVGNFSFKPTITTYNILMKACGTDYYHAKALMEEMKSVGLTPNHISWSILIDICGGSHDVENAVQILTTMRIAGVDPDVVAYTTAIKVCVEG
        PSHVHQI FVGNF+FKPTITTYNILMKACGTDYYHAKALMEEMKSVGLTPNHISWSIL+DICG SHDVE+AVQILTTMR+AGVDPDVVAYTTAIKVCVEG
Subjt:  PSHVHQIHFVGNFSFKPTITTYNILMKACGTDYYHAKALMEEMKSVGLTPNHISWSILIDICGGSHDVENAVQILTTMRIAGVDPDVVAYTTAIKVCVEG

Query:  KNWKLAFSLFEEMKRFEIQPNLVTYSTLLRARSTYGSLHEVQQCLAIYQDMRKSGFKSNDHYLKELIAEWCEGVIQNNNQQQVEITPCNRINMGKPRCLI
        KNWKLAFSLFEEMKRFEIQPNLVTYSTLLRARSTYGSLHEVQQCLAIYQDMRKSGFKSNDHYLKELIAEWCEGVIQ NNQQ VEITPCN+I++GKPRCLI
Subjt:  KNWKLAFSLFEEMKRFEIQPNLVTYSTLLRARSTYGSLHEVQQCLAIYQDMRKSGFKSNDHYLKELIAEWCEGVIQNNNQQQVEITPCNRINMGKPRCLI

Query:  LEKVAEHLQKSFTESLAIDLQELTKVEARIVVLAVLRMIKENYALGESVKDDIFIILEVSKVETDLVQQNFEVRDAITKLLQEELGLQVLPAGPTVALDK
        LEKVA+HLQKSF ESL IDLQELTKVEARIVVLAVLRMIKENYALGESVKDDIFIILEV+KVETDLV QNFEVRDAIT+LLQ+ELGL+VLP GPT+ALDK
Subjt:  LEKVAEHLQKSFTESLAIDLQELTKVEARIVVLAVLRMIKENYALGESVKDDIFIILEVSKVETDLVQQNFEVRDAITKLLQEELGLQVLPAGPTVALDK

Query:  VPDSESSNMSH-TKLKGIMGRNKYSTRRPADVQRLKVTKKSLQDWLQRNR
        VP+SESS +SH TKLKG MGRNKY TR+PADVQRLKVTKKSLQDWLQRNR
Subjt:  VPDSESSNMSH-TKLKGIMGRNKYSTRRPADVQRLKVTKKSLQDWLQRNR

XP_022143740.1 pentatricopeptide repeat-containing protein At5g02830, chloroplastic [Momordica charantia]0.0e+0092.23Show/hide
Query:  MRVFLILGSSSSASIAGHRCHRHSHSKAPKSSLSNLSPTGTHSPLSSRPSTLHSRPPLLSSVQWDFAGASSGGRTPLRHYADVASKLAERGKLEDFAMVV
        MRVFLILG SSSASIAG R HRHSHSK PKSSLSNLSPTG HSPLSSRPSTLHSRPPLLSSVQWD AGASSGGRTPLRHYADVASKLAERGKLEDFAMVV
Subjt:  MRVFLILGSSSSASIAGHRCHRHSHSKAPKSSLSNLSPTGTHSPLSSRPSTLHSRPPLLSSVQWDFAGASSGGRTPLRHYADVASKLAERGKLEDFAMVV

Query:  ESVVVAGVEPSQFAAVLAVELVAKGISRCLREGKLWNVVQVLRKVEELGISAVGLCDDFAVESLRRDCRRIAKSGELEELVEFMEVIAGFGFSIKEMMKP
        ESVVVAGVEPS F AVL VELVA GISRCLREGKLW VVQVLRKVEELGISAVGLCD  AVESL+RDCRR+AKSGELEELVEFMEV++GFGFSIKE+MKP
Subjt:  ESVVVAGVEPSQFAAVLAVELVAKGISRCLREGKLWNVVQVLRKVEELGISAVGLCDDFAVESLRRDCRRIAKSGELEELVEFMEVIAGFGFSIKEMMKP

Query:  SEVIKLCVDCRNPKMAIRYASILPHADILFCTTINEFGKKRDLKSAFMAYTESKANMNGPNMYIYRTIIDVCGLCGDYKKSRNIYQDLLNQNVTPNIYVF
        SEVIKLCVDCRNPKMAIRYASILPHADILFCTTINEFGKKRDLKSAFMAY+ESKANMNGPNMYIYRTIIDVCGLCGD+KKSRNIYQDL++QNVTPNIYVF
Subjt:  SEVIKLCVDCRNPKMAIRYASILPHADILFCTTINEFGKKRDLKSAFMAYTESKANMNGPNMYIYRTIIDVCGLCGDYKKSRNIYQDLLNQNVTPNIYVF

Query:  NSLMNVNAHDLNYTFQLYKSMQNLGVPADMASYNILLKACCLAGRVDLAQDIYREVKNLETTGVLKLDVFTYSTIVKVFADAKLWKMALRVKEDMQSAGV
        NSLMNVNAHDLNYTFQLYK+MQNLGVPADMASYNI+LKACCLAGRVDLAQD+YREVK+LETTGVLKLDVFTYSTIVKVFADAKLWKMALRVKEDMQSAGV
Subjt:  NSLMNVNAHDLNYTFQLYKSMQNLGVPADMASYNILLKACCLAGRVDLAQDIYREVKNLETTGVLKLDVFTYSTIVKVFADAKLWKMALRVKEDMQSAGV

Query:  SPNMVTWSSLISSCANSGLVELAIQLFEEMVSAGCEPNTQCCNILLHACVEARQFDRAFRLFRSWREKELWDDKDRKNSTNDNLNADPTSQLCTTNMPNA
        SPNMVTWSSLISSCANSGLVELAIQLFEEM+  GCEPN QCCNILLHACVEARQFDRAFRLFRSWR KE+WDD +RKNSTN NLNAD TSQLCTTNM NA
Subjt:  SPNMVTWSSLISSCANSGLVELAIQLFEEMVSAGCEPNTQCCNILLHACVEARQFDRAFRLFRSWREKELWDDKDRKNSTNDNLNADPTSQLCTTNMPNA

Query:  PSHVHQIHFVGNFSFKPTITTYNILMKACGTDYYHAKALMEEMKSVGLTPNHISWSILIDICGGSHDVENAVQILTTMRIAGVDPDVVAYTTAIKVCVEG
        P H HQI+FVG+F+FKPTITTYNILMKACGTDYYHAKALMEEM+SVGLTPNHISWSILIDICGGSHDVE+AVQILTTMR+AGVDPDVVAYTTAIKVCVEG
Subjt:  PSHVHQIHFVGNFSFKPTITTYNILMKACGTDYYHAKALMEEMKSVGLTPNHISWSILIDICGGSHDVENAVQILTTMRIAGVDPDVVAYTTAIKVCVEG

Query:  KNWKLAFSLFEEMKRFEIQPNLVTYSTLLRARSTYGSLHEVQQCLAIYQDMRKSGFKSNDHYLKELIAEWCEGVIQNNNQQQVEITPCNRINMGKPRCLI
        KNWKLAFSLFEEMKRFEIQPNLVTYSTLLRARSTYGSL EVQQCLAIYQDMRKS FKSNDHYLKELIAEWCEGVIQN++QQQ EI PC +I++GKPR LI
Subjt:  KNWKLAFSLFEEMKRFEIQPNLVTYSTLLRARSTYGSLHEVQQCLAIYQDMRKSGFKSNDHYLKELIAEWCEGVIQNNNQQQVEITPCNRINMGKPRCLI

Query:  LEKVAEHLQKSFTESLAIDLQELTKVEARIVVLAVLRMIKENYALGESVKDDIFIILEVSKVETDLVQQNFEVRDAITKLLQEELGLQVLPAGPTVALDK
        LEKVAEHLQKSFTESLAIDLQELTKVEARIVVLAVLRMIKENYALGESVKDDI IILEVSKVETDLVQQNFEVRDAITKLLQ+ELGL+VLPA PTVAL K
Subjt:  LEKVAEHLQKSFTESLAIDLQELTKVEARIVVLAVLRMIKENYALGESVKDDIFIILEVSKVETDLVQQNFEVRDAITKLLQEELGLQVLPAGPTVALDK

Query:  VPDSESSNMSHTKLKGIMGRNKYSTRRPADVQRLKVTKKSLQDWLQRNR
        V DSESSNMS+TKLKGI+GRNKYSTRRPADVQRLKVT+KSLQ WLQRNR
Subjt:  VPDSESSNMSHTKLKGIMGRNKYSTRRPADVQRLKVTKKSLQDWLQRNR

XP_022963393.1 pentatricopeptide repeat-containing protein At5g02830, chloroplastic isoform X1 [Cucurbita moschata]0.0e+0090.29Show/hide
Query:  MRVFLILG----SSSSASIAGHRCHRHSHSKAPKSSLSNLSPTGTHSPLSSRPSTLHSRPPLLSSVQWDFAGASSGGRTPLRHYADVASKLAERGKLEDF
        MR  LILG    SSSS+SIAG    R SHSKA  SSLSN+ PTGTHSPLSSRPST+HSR PLLSSVQWD AGASSGG+T LRHYADVASKLAERGKL+DF
Subjt:  MRVFLILG----SSSSASIAGHRCHRHSHSKAPKSSLSNLSPTGTHSPLSSRPSTLHSRPPLLSSVQWDFAGASSGGRTPLRHYADVASKLAERGKLEDF

Query:  AMVVESVVVAGVEPSQFAAVLAVELVAKGISRCLREGKLWNVVQVLRKVEELGISAVGLCDDFAVESLRRDCRRIAKSGELEELVEFMEVIAGFGFSIKE
        AMVVE+VVVAGVEPSQFAAVLAVELVAKGISRCLREG LWNVVQVLRKVEELGISAVGLCD+ AVESLRR+CRRI+KSG+LEELVEFMEV++GFGFSIKE
Subjt:  AMVVESVVVAGVEPSQFAAVLAVELVAKGISRCLREGKLWNVVQVLRKVEELGISAVGLCDDFAVESLRRDCRRIAKSGELEELVEFMEVIAGFGFSIKE

Query:  MMKPSEVIKLCVDCRNPKMAIRYASILPHADILFCTTINEFGKKRDLKSAFMAYTESKANMNGPNMYIYRTIIDVCGLCGDYKKSRNIY-QDLLNQNVTP
        MMKPSEVIKLCVD RNPK+AIRYASILP ADILFCT INEFGKKRDLKSAFMAYTESKA+MNGPNMYIYR+IIDVCGLCGDYKKSRNIY QDLL+QNVTP
Subjt:  MMKPSEVIKLCVDCRNPKMAIRYASILPHADILFCTTINEFGKKRDLKSAFMAYTESKANMNGPNMYIYRTIIDVCGLCGDYKKSRNIY-QDLLNQNVTP

Query:  NIYVFNSLMNVNAHDLNYTFQLYKSMQNLGVPADMASYNILLKACCLAGRVDLAQDIYREVKNLETTGVLKLDVFTYSTIVKVFADAKLWKMALRVKEDM
        N+YVFNSLMNVNAHDLNY FQLYK+MQNLGVPADMASYNILLKACCLAGRVDLAQDIYREVK+LETTGVLKLDVFTYSTIVKVFADAKLWKMALRVKEDM
Subjt:  NIYVFNSLMNVNAHDLNYTFQLYKSMQNLGVPADMASYNILLKACCLAGRVDLAQDIYREVKNLETTGVLKLDVFTYSTIVKVFADAKLWKMALRVKEDM

Query:  QSAGVSPNMVTWSSLISSCANSGLVELAIQLFEEMVSAGCEPNTQCCNILLHACVEARQFDRAFRLFRSWREKELWDDKDRKNSTNDNLNADPTSQLCTT
        QSAGVSPNMVTWSSLISSCANSGLVELAIQLFEEMVSAGCEPNTQCCNILLHACVEARQFDRAFRLFRSWREKELWDD +R+NST+DNLNADPTSQLC T
Subjt:  QSAGVSPNMVTWSSLISSCANSGLVELAIQLFEEMVSAGCEPNTQCCNILLHACVEARQFDRAFRLFRSWREKELWDDKDRKNSTNDNLNADPTSQLCTT

Query:  NMPNAPSHVHQIHFVGNFSFKPTITTYNILMKACGTDYYHAKALMEEMKSVGLTPNHISWSILIDICGGSHDVENAVQILTTMRIAGVDPDVVAYTTAIK
        NMPNA SHVHQI F GNF+FKPTITTYNILMKACGTDYYHAKALMEEMKSVGLTPNHISWSILIDICGGS DV++AVQILTTMR+AGVDPDVVAYTTAIK
Subjt:  NMPNAPSHVHQIHFVGNFSFKPTITTYNILMKACGTDYYHAKALMEEMKSVGLTPNHISWSILIDICGGSHDVENAVQILTTMRIAGVDPDVVAYTTAIK

Query:  VCVEGKNWKLAFSLFEEMKRFEIQPNLVTYSTLLRARSTYGSLHEVQQCLAIYQDMRKSGFKSNDHYLKELIAEWCEGVIQNNNQQQVEIT-PCNRINMG
        VCV  KNW+LAFSLFEEMKRFEIQPNLVTYSTLLRARSTYGSLHEVQQCLAIYQDMRKSGFKS+DHYLKELIAEWCEGV+QNNNQQQ E T PCNR ++G
Subjt:  VCVEGKNWKLAFSLFEEMKRFEIQPNLVTYSTLLRARSTYGSLHEVQQCLAIYQDMRKSGFKSNDHYLKELIAEWCEGVIQNNNQQQVEIT-PCNRINMG

Query:  KPRCLILEKVAEHLQKSFTESLAIDLQELTKVEARIVVLAVLRMIKENYALGESVKDDIFIILEVSKVETDLVQQNFEVRDAITKLLQEELGLQVLPAGP
        KPRCLILEKVAEHLQKSFTESLAIDLQELTKVEARIVVLAVLRMIKENYALGESVKDDIFIIL VSKVE DLV+QNFEVRDAITKLLQ ELGL+VLPAGP
Subjt:  KPRCLILEKVAEHLQKSFTESLAIDLQELTKVEARIVVLAVLRMIKENYALGESVKDDIFIILEVSKVETDLVQQNFEVRDAITKLLQEELGLQVLPAGP

Query:  TVALDKVPDSESSNMSHTKLKGIMGRNKYSTRRPADVQRLKVTKKSLQDWLQRNR
        T A D+V  SESSNMS TKL GIMGRNKY+TR+PA VQRLKVTKKSL+DWLQRNR
Subjt:  TVALDKVPDSESSNMSHTKLKGIMGRNKYSTRRPADVQRLKVTKKSLQDWLQRNR

XP_038881251.1 pentatricopeptide repeat-containing protein At5g02830, chloroplastic [Benincasa hispida]0.0e+0091.76Show/hide
Query:  MRVFLILGSSSSASIAGHRCHRHSHSKAPKSSLSNLSPTGTHSPLSSRPSTLHSRPPLLSSVQWDFAGASSGGRTPLRHYADVASKLAERGKLEDFAMVV
        MRVF     SSSASIAG R H HSHSKAP + LS   P     PLSS PST HS PPLLSSVQ D AGASSGGR PL+HYA VASKLAERGKLEDFA+VV
Subjt:  MRVFLILGSSSSASIAGHRCHRHSHSKAPKSSLSNLSPTGTHSPLSSRPSTLHSRPPLLSSVQWDFAGASSGGRTPLRHYADVASKLAERGKLEDFAMVV

Query:  ESVVVAGVEPSQFAAVLAVELVAKGISRCLREGKLWNVVQVLRKVEELGISAVGLCDDFAVESLRRDCRRIAKSGELEELVEFMEVIAGFGFSIKEMMKP
        ESVVVAGVEPSQFAAVLAVEL+AKGISRCLREGKLW+V+QVLRKV+ELGISA+GLCD+ AVESLRRDC RIAKSGELEELVEFME +AGFGFSIKEMMKP
Subjt:  ESVVVAGVEPSQFAAVLAVELVAKGISRCLREGKLWNVVQVLRKVEELGISAVGLCDDFAVESLRRDCRRIAKSGELEELVEFMEVIAGFGFSIKEMMKP

Query:  SEVIKLCVDCRNPKMAIRYASILPHADILFCTTINEFGKKRDLKSAFMAYTESKANMNGPNMYIYRTIIDVCGLCGDYKKSRNIYQDLLNQNVTPNIYVF
        SEVIKLCVD RNPKMAIRY+SILPHADILFCTTI+EFGKKRDLKSA++AYTESKANMNGPNMYIYRTIIDVCGLCGDYKKSRNIYQDL+NQNVTPNI+VF
Subjt:  SEVIKLCVDCRNPKMAIRYASILPHADILFCTTINEFGKKRDLKSAFMAYTESKANMNGPNMYIYRTIIDVCGLCGDYKKSRNIYQDLLNQNVTPNIYVF

Query:  NSLMNVNAHDLNYTFQLYKSMQNLGVPADMASYNILLKACCLAGRVDLAQDIYREVKNLETTGVLKLDVFTYSTIVKVFADAKLWKMALRVKEDMQSAGV
        NSLMNVNAHDLNYTFQLYK+MQNLGVPADMASYNILLKACCLAGRVDLAQDIYREVK+LETTGVLKLDVFTYSTIVKVFADAKLWKMALRVKEDMQSAGV
Subjt:  NSLMNVNAHDLNYTFQLYKSMQNLGVPADMASYNILLKACCLAGRVDLAQDIYREVKNLETTGVLKLDVFTYSTIVKVFADAKLWKMALRVKEDMQSAGV

Query:  SPNMVTWSSLISSCANSGLVELAIQLFEEMVSAGCEPNTQCCNILLHACVEARQFDRAFRLFRSWREKELWDDKDRKNSTNDNLNADPTSQLCTTNMPNA
        SPNMVTWSSLISSCANSGLVELAIQLFEEMVS GCEPNTQCCNILLHACVEARQFDRAFRLFRSWREKELWDD +RK+STNDNLNAD TSQLCTTNMPNA
Subjt:  SPNMVTWSSLISSCANSGLVELAIQLFEEMVSAGCEPNTQCCNILLHACVEARQFDRAFRLFRSWREKELWDDKDRKNSTNDNLNADPTSQLCTTNMPNA

Query:  PSHVHQIHFVGNFSFKPTITTYNILMKACGTDYYHAKALMEEMKSVGLTPNHISWSILIDICGGSHDVENAVQILTTMRIAGVDPDVVAYTTAIKVCVEG
        PSHVHQI FVGNF+FKPTITTYNILMKACGTDYYHAKALMEEM+SVGLTPNHISWSILIDICGGSHDVENAVQILTTMR+AGVDPDVVAYTTAIKVCVEG
Subjt:  PSHVHQIHFVGNFSFKPTITTYNILMKACGTDYYHAKALMEEMKSVGLTPNHISWSILIDICGGSHDVENAVQILTTMRIAGVDPDVVAYTTAIKVCVEG

Query:  KNWKLAFSLFEEMKRFEIQPNLVTYSTLLRARSTYGSLHEVQQCLAIYQDMRKSGFKSNDHYLKELIAEWCEGVIQNNNQQQVEITPCNRINMGKPRCLI
        KNWKLAFSLFEEMKRFEIQPNLVTYSTLLRARSTYGSLHEVQQCLAIYQDMRKSGFKSNDHYLKELIAEWCEGVIQNNN+QQVEITPCNRI++GKPRCLI
Subjt:  KNWKLAFSLFEEMKRFEIQPNLVTYSTLLRARSTYGSLHEVQQCLAIYQDMRKSGFKSNDHYLKELIAEWCEGVIQNNNQQQVEITPCNRINMGKPRCLI

Query:  LEKVAEHLQKSFTESLAIDLQELTKVEARIVVLAVLRMIKENYALGESVKDDIFIILEVSKVETDLVQQNFEVRDAITKLLQEELGLQVLPAGPTVALDK
        LEKVA+HLQKSFTESL IDLQELTKVEARIVVLAVLRMIKENYALGESVKDDIFIILEVSKVETDLVQQNFEVRDAITKLLQ+ELGL+VLP G T+ALDK
Subjt:  LEKVAEHLQKSFTESLAIDLQELTKVEARIVVLAVLRMIKENYALGESVKDDIFIILEVSKVETDLVQQNFEVRDAITKLLQEELGLQVLPAGPTVALDK

Query:  VPDSESSNMSHTKLKGIMGRNKYSTRRPADVQRLKVTKKSLQDWLQRNR
        VP+SES NMSHTKL+GI+GRNKYSTRRPADVQRLKVTKKSLQDWLQRNR
Subjt:  VPDSESSNMSHTKLKGIMGRNKYSTRRPADVQRLKVTKKSLQDWLQRNR

TrEMBL top hitse value%identityAlignment
A0A0A0KX28 PPR_long domain-containing protein0.0e+0090Show/hide
Query:  MRVFLILGSSSSASIAGHRCHRHSHSKAPKSSLSNLSPTGTHSPLSSRPSTLHSRPPLLSSVQWDFAGASSGGRTPLRHYADVASKLAERGKLEDFAMVV
        MRVFLILG SSSASIAG R +RHSH KAPKSSLSNLSPTGTH P SS  ST HS P LLSSV+ D AGASSGGR P++HYA VASKLAE GKLEDFAMVV
Subjt:  MRVFLILGSSSSASIAGHRCHRHSHSKAPKSSLSNLSPTGTHSPLSSRPSTLHSRPPLLSSVQWDFAGASSGGRTPLRHYADVASKLAERGKLEDFAMVV

Query:  ESVVVAGVEPSQFAAVLAVELVAKGISRCLREGKLWNVVQVLRKVEELGISAVGLCDDFAVESLRRDCRRIAKSGELEELVEFMEVIAGFGFSIKEMMKP
        ESVVVAGVEPSQF A+LAVELVAKGISRCLREGK+W+VVQVLRKVEELGIS + LCD+ AVESLRRDCRR+AKSGELEELVE MEV++GFGFS++EMMKP
Subjt:  ESVVVAGVEPSQFAAVLAVELVAKGISRCLREGKLWNVVQVLRKVEELGISAVGLCDDFAVESLRRDCRRIAKSGELEELVEFMEVIAGFGFSIKEMMKP

Query:  SEVIKLCVDCRNPKMAIRYASILPHADILFCTTINEFGKKRDLKSAFMAYTESKANMNGPNMYIYRTIIDVCGLCGDYKKSRNIYQDLLNQNVTPNIYVF
        SEVIKLCVD RNPKMAIRYASILPHADILFCTTINEFGKKRDLKSA++AYTESKANMNG NMYIYRTIIDVCGLCGDYKKSRNIYQDL+NQNV PNI+VF
Subjt:  SEVIKLCVDCRNPKMAIRYASILPHADILFCTTINEFGKKRDLKSAFMAYTESKANMNGPNMYIYRTIIDVCGLCGDYKKSRNIYQDLLNQNVTPNIYVF

Query:  NSLMNVNAHDLNYTFQLYKSMQNLGVPADMASYNILLKACCLAGRVDLAQDIYREVKNLETTGVLKLDVFTYSTIVKVFADAKLWKMALRVKEDMQSAGV
        NSLMNVNAHDLNYTFQLYK+MQNLGVPADMASYNILLKACCLAGRVDLAQDIYREVK+LETTGVLKLDVFTYSTIVKVFADAKLWKMALRVKEDMQSAGV
Subjt:  NSLMNVNAHDLNYTFQLYKSMQNLGVPADMASYNILLKACCLAGRVDLAQDIYREVKNLETTGVLKLDVFTYSTIVKVFADAKLWKMALRVKEDMQSAGV

Query:  SPNMVTWSSLISSCANSGLVELAIQLFEEMVSAGCEPNTQCCNILLHACVEARQFDRAFRLFRSWREKELWDDKDRKNSTNDNLNADPTSQLCTTNMPNA
        SPNMVTWSSLISSCANSGLVELAIQLFEEMVSAGCEPNTQCCN LLHACVE RQFDRAFRLFRSW+EKELWD  +RK+ST++NL+AD TSQLC T MPNA
Subjt:  SPNMVTWSSLISSCANSGLVELAIQLFEEMVSAGCEPNTQCCNILLHACVEARQFDRAFRLFRSWREKELWDDKDRKNSTNDNLNADPTSQLCTTNMPNA

Query:  PSHVHQIHFVGNFSFKPTITTYNILMKACGTDYYHAKALMEEMKSVGLTPNHISWSILIDICGGSHDVENAVQILTTMRIAGVDPDVVAYTTAIKVCVEG
        PSHVHQI FVGNF+FKPTITTYNILMKACGTDYYHAKALMEEMKSVGLTPNHISWSIL+DICG SHDVE+AVQILTTMR+AGVDPDVVAYTTAIKVCVEG
Subjt:  PSHVHQIHFVGNFSFKPTITTYNILMKACGTDYYHAKALMEEMKSVGLTPNHISWSILIDICGGSHDVENAVQILTTMRIAGVDPDVVAYTTAIKVCVEG

Query:  KNWKLAFSLFEEMKRFEIQPNLVTYSTLLRARSTYGSLHEVQQCLAIYQDMRKSGFKSNDHYLKELIAEWCEGVIQNNNQQQVEITPCNRINMGKPRCLI
        KNWKLAFSLFEEMKRFEIQPNLVTYSTLLRARSTYGSLHEVQQCLAIYQDMRKSGFKSNDHYLKELIAEWCEGVIQ NNQQ VEITPCN+I++GKPRCLI
Subjt:  KNWKLAFSLFEEMKRFEIQPNLVTYSTLLRARSTYGSLHEVQQCLAIYQDMRKSGFKSNDHYLKELIAEWCEGVIQNNNQQQVEITPCNRINMGKPRCLI

Query:  LEKVAEHLQKSFTESLAIDLQELTKVEARIVVLAVLRMIKENYALGESVKDDIFIILEVSKVETDLVQQNFEVRDAITKLLQEELGLQVLPAGPTVALDK
        LEKVA+HLQKSF ESL IDLQELTKVEARIVVLAVLRMIKENYALGESVKDDIFIILEV+KVETDLV QNFEVRDAIT+LLQ+ELGL+VLP GPT+ALDK
Subjt:  LEKVAEHLQKSFTESLAIDLQELTKVEARIVVLAVLRMIKENYALGESVKDDIFIILEVSKVETDLVQQNFEVRDAITKLLQEELGLQVLPAGPTVALDK

Query:  VPDSESSNMSH-TKLKGIMGRNKYSTRRPADVQRLKVTKKSLQDWLQRNR
        VP+SESS +SH TKLKG MGRNKY TR+PADVQRLKVTKKSLQDWLQRNR
Subjt:  VPDSESSNMSH-TKLKGIMGRNKYSTRRPADVQRLKVTKKSLQDWLQRNR

A0A1S3BHG8 Pentatricopeptide repeat-containing protein0.0e+0089.19Show/hide
Query:  MRVFLILGSSSSASIAGHR--CHRHSHSKAPKSSLSNLSPTGTHSPLSSRPSTLHSRPPLLSSVQWDFAGASSGGRTPLRHYADVASKLAERGKLEDFAM
        MRVFLILG S+SASIAG R   HRHSH KAPKSSLSN+SPTGTH P SS  ST HS P LLSSV+ D AGASSGGR P++HYA VA+KLAERGKLEDFAM
Subjt:  MRVFLILGSSSSASIAGHR--CHRHSHSKAPKSSLSNLSPTGTHSPLSSRPSTLHSRPPLLSSVQWDFAGASSGGRTPLRHYADVASKLAERGKLEDFAM

Query:  VVESVVVAGVEPSQFAAVLAVELVAKGISRCLREGKLWNVVQVLRKVEELGISAVGLCDDFAVESLRRDCRRIAKSGELEELVEFMEVIAGFGFSIKEMM
        VVESVVVAGVEPSQF A+LAVELVAKGISRCLREGKLW+VVQVLRKVEELGISA+GLCD+ AVESLRRDCRR+AKSGELEELVEF+EV++GFG S+KEMM
Subjt:  VVESVVVAGVEPSQFAAVLAVELVAKGISRCLREGKLWNVVQVLRKVEELGISAVGLCDDFAVESLRRDCRRIAKSGELEELVEFMEVIAGFGFSIKEMM

Query:  KPSEVIKLCVDCRNPKMAIRYASILPHADILFCTTINEFGKKRDLKSAFMAYTESKANMNGPNMYIYRTIIDVCGLCGDYKKSRNIYQDLLNQNVTPNIY
        KP EVIKLCVD RNPKMAIRYASILPHADILFCTTINEFGKKRDLKSA++AY ESKANMNG NMYI+R+IIDVCGLCGDYKKSRNIYQDL+NQNVTPNI+
Subjt:  KPSEVIKLCVDCRNPKMAIRYASILPHADILFCTTINEFGKKRDLKSAFMAYTESKANMNGPNMYIYRTIIDVCGLCGDYKKSRNIYQDLLNQNVTPNIY

Query:  VFNSLMNVNAHDLNYTFQLYKSMQNLGVPADMASYNILLKACCLAGRVDLAQDIYREVKNLETTGVLKLDVFTYSTIVKVFADAKLWKMALRVKEDMQSA
        VFNSLMNVNAHDLNYTFQLYK MQNLGVPADMASYNILLKACCLAGRVDLAQDIYREVK+LETTGVLKLDVFTYSTIVKVFADAKLWKMALRVKEDMQSA
Subjt:  VFNSLMNVNAHDLNYTFQLYKSMQNLGVPADMASYNILLKACCLAGRVDLAQDIYREVKNLETTGVLKLDVFTYSTIVKVFADAKLWKMALRVKEDMQSA

Query:  GVSPNMVTWSSLISSCANSGLVELAIQLFEEMVSAGCEPNTQCCNILLHACVEARQFDRAFRLFRSWREKELWDDKDRKNSTNDNLNADPTSQLCTTNMP
        GVSPN+VTWSSLISSCANSGLVELAIQLFEEMVSAG EPNTQCCNILLHACVEARQFDRAFRLFRSWREKELWD  +RK+S +DNL+AD TSQLCTTNM 
Subjt:  GVSPNMVTWSSLISSCANSGLVELAIQLFEEMVSAGCEPNTQCCNILLHACVEARQFDRAFRLFRSWREKELWDDKDRKNSTNDNLNADPTSQLCTTNMP

Query:  NAPSHVHQIHFVGNFSFKPTITTYNILMKACGTDYYHAKALMEEMKSVGLTPNHISWSILIDICGGSHDVENAVQILTTMRIAGVDPDVVAYTTAIKVCV
        NAPSH HQI  VGNF+FKPT+TTYN LMKACGTDYYHAKALMEEMKSVGLTPNHISWSILID+CG SHDVE+AVQILTTMR+AGVDPDVVAYTTAIKVCV
Subjt:  NAPSHVHQIHFVGNFSFKPTITTYNILMKACGTDYYHAKALMEEMKSVGLTPNHISWSILIDICGGSHDVENAVQILTTMRIAGVDPDVVAYTTAIKVCV

Query:  EGKNWKLAFSLFEEMKRFEIQPNLVTYSTLLRARSTYGSLHEVQQCLAIYQDMRKSGFKSNDHYLKELIAEWCEGVIQNNNQQQVEITPCNRINMGKPRC
        EGKNWKLAFSLFEEMKRFEIQPNLVTYSTLLRARSTYGSLHEVQQCLA+YQDMRKSGFKSNDHYLKELIAEWCEGV+QNNNQQQVE TPCN+I++ KPRC
Subjt:  EGKNWKLAFSLFEEMKRFEIQPNLVTYSTLLRARSTYGSLHEVQQCLAIYQDMRKSGFKSNDHYLKELIAEWCEGVIQNNNQQQVEITPCNRINMGKPRC

Query:  LILEKVAEHLQKSFTESLAIDLQELTKVEARIVVLAVLRMIKENYALGESVKDDIFIILEVSKVETDLVQQNFEVRDAITKLLQEELGLQVLPAGPTVAL
        LILEKVA+HLQKSF ESL IDLQELTKVEARIVVLAVLRMIKENYALGESVKDDIFIILEV+KV+TDL+++NFEVRDAITKLLQ+ELGL+VLP GPT+ L
Subjt:  LILEKVAEHLQKSFTESLAIDLQELTKVEARIVVLAVLRMIKENYALGESVKDDIFIILEVSKVETDLVQQNFEVRDAITKLLQEELGLQVLPAGPTVAL

Query:  DKVPDSESSNMSHTKLKGIMGRNKYSTRRPADVQRLKVTKKSLQDWLQRNR
        DKVP+SESSNMSHTKLKG MGRNKY TR+PADVQRLKVTKKSLQDWLQRNR
Subjt:  DKVPDSESSNMSHTKLKGIMGRNKYSTRRPADVQRLKVTKKSLQDWLQRNR

A0A2D0WXL1 Pentatricopeptide repeat-containing protein0.0e+0089.19Show/hide
Query:  MRVFLILGSSSSASIAGHR--CHRHSHSKAPKSSLSNLSPTGTHSPLSSRPSTLHSRPPLLSSVQWDFAGASSGGRTPLRHYADVASKLAERGKLEDFAM
        MRVFLILG S+SASIAG R   HRHSH KAPKSSLSN+SPTGTH P SS  ST HS P LLSSV+ D AGASSGGR P++HYA VA+KLAERGKLEDFAM
Subjt:  MRVFLILGSSSSASIAGHR--CHRHSHSKAPKSSLSNLSPTGTHSPLSSRPSTLHSRPPLLSSVQWDFAGASSGGRTPLRHYADVASKLAERGKLEDFAM

Query:  VVESVVVAGVEPSQFAAVLAVELVAKGISRCLREGKLWNVVQVLRKVEELGISAVGLCDDFAVESLRRDCRRIAKSGELEELVEFMEVIAGFGFSIKEMM
        VVESVVVAGVEPSQF A+LA+ELVAKGISRCLREGKLW+VVQVLRKVEELGISA+GLCD+ AVESLRRDCRR+AKSGELEELVEF+EV++GFG S+KEMM
Subjt:  VVESVVVAGVEPSQFAAVLAVELVAKGISRCLREGKLWNVVQVLRKVEELGISAVGLCDDFAVESLRRDCRRIAKSGELEELVEFMEVIAGFGFSIKEMM

Query:  KPSEVIKLCVDCRNPKMAIRYASILPHADILFCTTINEFGKKRDLKSAFMAYTESKANMNGPNMYIYRTIIDVCGLCGDYKKSRNIYQDLLNQNVTPNIY
        KP EVIKLCVD RNPKMAIRYASILPHADILFCTTINEFGKKRDLKSA++AY ESKANMNG NMYI+RTIIDVCGLCGDYKKSRNIYQDL+NQNVTPNI+
Subjt:  KPSEVIKLCVDCRNPKMAIRYASILPHADILFCTTINEFGKKRDLKSAFMAYTESKANMNGPNMYIYRTIIDVCGLCGDYKKSRNIYQDLLNQNVTPNIY

Query:  VFNSLMNVNAHDLNYTFQLYKSMQNLGVPADMASYNILLKACCLAGRVDLAQDIYREVKNLETTGVLKLDVFTYSTIVKVFADAKLWKMALRVKEDMQSA
        VFNSLMNVNAHDLNYTFQLYK MQNLGVPADMASYNILLKACCLAGRVDLAQDIYREVK+LETTGVLKLDVFTYSTIVKVFADAKLWKMALRVKEDMQSA
Subjt:  VFNSLMNVNAHDLNYTFQLYKSMQNLGVPADMASYNILLKACCLAGRVDLAQDIYREVKNLETTGVLKLDVFTYSTIVKVFADAKLWKMALRVKEDMQSA

Query:  GVSPNMVTWSSLISSCANSGLVELAIQLFEEMVSAGCEPNTQCCNILLHACVEARQFDRAFRLFRSWREKELWDDKDRKNSTNDNLNADPTSQLCTTNMP
        GVSPN+VTWSSLISSCANSGLVELAIQLFEEMVSAG EPNTQCCNILLHACVEARQFDRAFRLFRSWREKELWD  +RK+S +DNL+AD TSQLCTTNM 
Subjt:  GVSPNMVTWSSLISSCANSGLVELAIQLFEEMVSAGCEPNTQCCNILLHACVEARQFDRAFRLFRSWREKELWDDKDRKNSTNDNLNADPTSQLCTTNMP

Query:  NAPSHVHQIHFVGNFSFKPTITTYNILMKACGTDYYHAKALMEEMKSVGLTPNHISWSILIDICGGSHDVENAVQILTTMRIAGVDPDVVAYTTAIKVCV
        NAPSH HQI  VGNF+FKPT+TTYN LMKACGTDYYHAKALMEEMKSVGLTPNHISWSILID+CG SHDVE+AVQILTTMR+AGVDPDVVAYTTAIKVCV
Subjt:  NAPSHVHQIHFVGNFSFKPTITTYNILMKACGTDYYHAKALMEEMKSVGLTPNHISWSILIDICGGSHDVENAVQILTTMRIAGVDPDVVAYTTAIKVCV

Query:  EGKNWKLAFSLFEEMKRFEIQPNLVTYSTLLRARSTYGSLHEVQQCLAIYQDMRKSGFKSNDHYLKELIAEWCEGVIQNNNQQQVEITPCNRINMGKPRC
        EGKNWKLAFSLFEEMKRFEIQPNLVTYSTLLRARSTYGSLHEVQQCLA+YQDMRKSGFKSNDHYLKELIAEWCEGV+QNNNQQQVE TPCN+I++ KPRC
Subjt:  EGKNWKLAFSLFEEMKRFEIQPNLVTYSTLLRARSTYGSLHEVQQCLAIYQDMRKSGFKSNDHYLKELIAEWCEGVIQNNNQQQVEITPCNRINMGKPRC

Query:  LILEKVAEHLQKSFTESLAIDLQELTKVEARIVVLAVLRMIKENYALGESVKDDIFIILEVSKVETDLVQQNFEVRDAITKLLQEELGLQVLPAGPTVAL
        LILEKVA+HLQKSF ESL IDLQELTKVEARIVVLAVLRMIKENYALGESVKDDIFIILEV+KV+TDL+++NFEVRDAITKLLQ+ELGL+VLP GPT+ L
Subjt:  LILEKVAEHLQKSFTESLAIDLQELTKVEARIVVLAVLRMIKENYALGESVKDDIFIILEVSKVETDLVQQNFEVRDAITKLLQEELGLQVLPAGPTVAL

Query:  DKVPDSESSNMSHTKLKGIMGRNKYSTRRPADVQRLKVTKKSLQDWLQRNR
        DKVP+SESSNMSHTKLKG MGRNKY TR+PADVQRLKVTKKSLQDWLQRNR
Subjt:  DKVPDSESSNMSHTKLKGIMGRNKYSTRRPADVQRLKVTKKSLQDWLQRNR

A0A6J1CPM4 pentatricopeptide repeat-containing protein At5g02830, chloroplastic0.0e+0092.23Show/hide
Query:  MRVFLILGSSSSASIAGHRCHRHSHSKAPKSSLSNLSPTGTHSPLSSRPSTLHSRPPLLSSVQWDFAGASSGGRTPLRHYADVASKLAERGKLEDFAMVV
        MRVFLILG SSSASIAG R HRHSHSK PKSSLSNLSPTG HSPLSSRPSTLHSRPPLLSSVQWD AGASSGGRTPLRHYADVASKLAERGKLEDFAMVV
Subjt:  MRVFLILGSSSSASIAGHRCHRHSHSKAPKSSLSNLSPTGTHSPLSSRPSTLHSRPPLLSSVQWDFAGASSGGRTPLRHYADVASKLAERGKLEDFAMVV

Query:  ESVVVAGVEPSQFAAVLAVELVAKGISRCLREGKLWNVVQVLRKVEELGISAVGLCDDFAVESLRRDCRRIAKSGELEELVEFMEVIAGFGFSIKEMMKP
        ESVVVAGVEPS F AVL VELVA GISRCLREGKLW VVQVLRKVEELGISAVGLCD  AVESL+RDCRR+AKSGELEELVEFMEV++GFGFSIKE+MKP
Subjt:  ESVVVAGVEPSQFAAVLAVELVAKGISRCLREGKLWNVVQVLRKVEELGISAVGLCDDFAVESLRRDCRRIAKSGELEELVEFMEVIAGFGFSIKEMMKP

Query:  SEVIKLCVDCRNPKMAIRYASILPHADILFCTTINEFGKKRDLKSAFMAYTESKANMNGPNMYIYRTIIDVCGLCGDYKKSRNIYQDLLNQNVTPNIYVF
        SEVIKLCVDCRNPKMAIRYASILPHADILFCTTINEFGKKRDLKSAFMAY+ESKANMNGPNMYIYRTIIDVCGLCGD+KKSRNIYQDL++QNVTPNIYVF
Subjt:  SEVIKLCVDCRNPKMAIRYASILPHADILFCTTINEFGKKRDLKSAFMAYTESKANMNGPNMYIYRTIIDVCGLCGDYKKSRNIYQDLLNQNVTPNIYVF

Query:  NSLMNVNAHDLNYTFQLYKSMQNLGVPADMASYNILLKACCLAGRVDLAQDIYREVKNLETTGVLKLDVFTYSTIVKVFADAKLWKMALRVKEDMQSAGV
        NSLMNVNAHDLNYTFQLYK+MQNLGVPADMASYNI+LKACCLAGRVDLAQD+YREVK+LETTGVLKLDVFTYSTIVKVFADAKLWKMALRVKEDMQSAGV
Subjt:  NSLMNVNAHDLNYTFQLYKSMQNLGVPADMASYNILLKACCLAGRVDLAQDIYREVKNLETTGVLKLDVFTYSTIVKVFADAKLWKMALRVKEDMQSAGV

Query:  SPNMVTWSSLISSCANSGLVELAIQLFEEMVSAGCEPNTQCCNILLHACVEARQFDRAFRLFRSWREKELWDDKDRKNSTNDNLNADPTSQLCTTNMPNA
        SPNMVTWSSLISSCANSGLVELAIQLFEEM+  GCEPN QCCNILLHACVEARQFDRAFRLFRSWR KE+WDD +RKNSTN NLNAD TSQLCTTNM NA
Subjt:  SPNMVTWSSLISSCANSGLVELAIQLFEEMVSAGCEPNTQCCNILLHACVEARQFDRAFRLFRSWREKELWDDKDRKNSTNDNLNADPTSQLCTTNMPNA

Query:  PSHVHQIHFVGNFSFKPTITTYNILMKACGTDYYHAKALMEEMKSVGLTPNHISWSILIDICGGSHDVENAVQILTTMRIAGVDPDVVAYTTAIKVCVEG
        P H HQI+FVG+F+FKPTITTYNILMKACGTDYYHAKALMEEM+SVGLTPNHISWSILIDICGGSHDVE+AVQILTTMR+AGVDPDVVAYTTAIKVCVEG
Subjt:  PSHVHQIHFVGNFSFKPTITTYNILMKACGTDYYHAKALMEEMKSVGLTPNHISWSILIDICGGSHDVENAVQILTTMRIAGVDPDVVAYTTAIKVCVEG

Query:  KNWKLAFSLFEEMKRFEIQPNLVTYSTLLRARSTYGSLHEVQQCLAIYQDMRKSGFKSNDHYLKELIAEWCEGVIQNNNQQQVEITPCNRINMGKPRCLI
        KNWKLAFSLFEEMKRFEIQPNLVTYSTLLRARSTYGSL EVQQCLAIYQDMRKS FKSNDHYLKELIAEWCEGVIQN++QQQ EI PC +I++GKPR LI
Subjt:  KNWKLAFSLFEEMKRFEIQPNLVTYSTLLRARSTYGSLHEVQQCLAIYQDMRKSGFKSNDHYLKELIAEWCEGVIQNNNQQQVEITPCNRINMGKPRCLI

Query:  LEKVAEHLQKSFTESLAIDLQELTKVEARIVVLAVLRMIKENYALGESVKDDIFIILEVSKVETDLVQQNFEVRDAITKLLQEELGLQVLPAGPTVALDK
        LEKVAEHLQKSFTESLAIDLQELTKVEARIVVLAVLRMIKENYALGESVKDDI IILEVSKVETDLVQQNFEVRDAITKLLQ+ELGL+VLPA PTVAL K
Subjt:  LEKVAEHLQKSFTESLAIDLQELTKVEARIVVLAVLRMIKENYALGESVKDDIFIILEVSKVETDLVQQNFEVRDAITKLLQEELGLQVLPAGPTVALDK

Query:  VPDSESSNMSHTKLKGIMGRNKYSTRRPADVQRLKVTKKSLQDWLQRNR
        V DSESSNMS+TKLKGI+GRNKYSTRRPADVQRLKVT+KSLQ WLQRNR
Subjt:  VPDSESSNMSHTKLKGIMGRNKYSTRRPADVQRLKVTKKSLQDWLQRNR

A0A6J1HHN3 pentatricopeptide repeat-containing protein At5g02830, chloroplastic isoform X10.0e+0090.29Show/hide
Query:  MRVFLILG----SSSSASIAGHRCHRHSHSKAPKSSLSNLSPTGTHSPLSSRPSTLHSRPPLLSSVQWDFAGASSGGRTPLRHYADVASKLAERGKLEDF
        MR  LILG    SSSS+SIAG    R SHSKA  SSLSN+ PTGTHSPLSSRPST+HSR PLLSSVQWD AGASSGG+T LRHYADVASKLAERGKL+DF
Subjt:  MRVFLILG----SSSSASIAGHRCHRHSHSKAPKSSLSNLSPTGTHSPLSSRPSTLHSRPPLLSSVQWDFAGASSGGRTPLRHYADVASKLAERGKLEDF

Query:  AMVVESVVVAGVEPSQFAAVLAVELVAKGISRCLREGKLWNVVQVLRKVEELGISAVGLCDDFAVESLRRDCRRIAKSGELEELVEFMEVIAGFGFSIKE
        AMVVE+VVVAGVEPSQFAAVLAVELVAKGISRCLREG LWNVVQVLRKVEELGISAVGLCD+ AVESLRR+CRRI+KSG+LEELVEFMEV++GFGFSIKE
Subjt:  AMVVESVVVAGVEPSQFAAVLAVELVAKGISRCLREGKLWNVVQVLRKVEELGISAVGLCDDFAVESLRRDCRRIAKSGELEELVEFMEVIAGFGFSIKE

Query:  MMKPSEVIKLCVDCRNPKMAIRYASILPHADILFCTTINEFGKKRDLKSAFMAYTESKANMNGPNMYIYRTIIDVCGLCGDYKKSRNIY-QDLLNQNVTP
        MMKPSEVIKLCVD RNPK+AIRYASILP ADILFCT INEFGKKRDLKSAFMAYTESKA+MNGPNMYIYR+IIDVCGLCGDYKKSRNIY QDLL+QNVTP
Subjt:  MMKPSEVIKLCVDCRNPKMAIRYASILPHADILFCTTINEFGKKRDLKSAFMAYTESKANMNGPNMYIYRTIIDVCGLCGDYKKSRNIY-QDLLNQNVTP

Query:  NIYVFNSLMNVNAHDLNYTFQLYKSMQNLGVPADMASYNILLKACCLAGRVDLAQDIYREVKNLETTGVLKLDVFTYSTIVKVFADAKLWKMALRVKEDM
        N+YVFNSLMNVNAHDLNY FQLYK+MQNLGVPADMASYNILLKACCLAGRVDLAQDIYREVK+LETTGVLKLDVFTYSTIVKVFADAKLWKMALRVKEDM
Subjt:  NIYVFNSLMNVNAHDLNYTFQLYKSMQNLGVPADMASYNILLKACCLAGRVDLAQDIYREVKNLETTGVLKLDVFTYSTIVKVFADAKLWKMALRVKEDM

Query:  QSAGVSPNMVTWSSLISSCANSGLVELAIQLFEEMVSAGCEPNTQCCNILLHACVEARQFDRAFRLFRSWREKELWDDKDRKNSTNDNLNADPTSQLCTT
        QSAGVSPNMVTWSSLISSCANSGLVELAIQLFEEMVSAGCEPNTQCCNILLHACVEARQFDRAFRLFRSWREKELWDD +R+NST+DNLNADPTSQLC T
Subjt:  QSAGVSPNMVTWSSLISSCANSGLVELAIQLFEEMVSAGCEPNTQCCNILLHACVEARQFDRAFRLFRSWREKELWDDKDRKNSTNDNLNADPTSQLCTT

Query:  NMPNAPSHVHQIHFVGNFSFKPTITTYNILMKACGTDYYHAKALMEEMKSVGLTPNHISWSILIDICGGSHDVENAVQILTTMRIAGVDPDVVAYTTAIK
        NMPNA SHVHQI F GNF+FKPTITTYNILMKACGTDYYHAKALMEEMKSVGLTPNHISWSILIDICGGS DV++AVQILTTMR+AGVDPDVVAYTTAIK
Subjt:  NMPNAPSHVHQIHFVGNFSFKPTITTYNILMKACGTDYYHAKALMEEMKSVGLTPNHISWSILIDICGGSHDVENAVQILTTMRIAGVDPDVVAYTTAIK

Query:  VCVEGKNWKLAFSLFEEMKRFEIQPNLVTYSTLLRARSTYGSLHEVQQCLAIYQDMRKSGFKSNDHYLKELIAEWCEGVIQNNNQQQVEIT-PCNRINMG
        VCV  KNW+LAFSLFEEMKRFEIQPNLVTYSTLLRARSTYGSLHEVQQCLAIYQDMRKSGFKS+DHYLKELIAEWCEGV+QNNNQQQ E T PCNR ++G
Subjt:  VCVEGKNWKLAFSLFEEMKRFEIQPNLVTYSTLLRARSTYGSLHEVQQCLAIYQDMRKSGFKSNDHYLKELIAEWCEGVIQNNNQQQVEIT-PCNRINMG

Query:  KPRCLILEKVAEHLQKSFTESLAIDLQELTKVEARIVVLAVLRMIKENYALGESVKDDIFIILEVSKVETDLVQQNFEVRDAITKLLQEELGLQVLPAGP
        KPRCLILEKVAEHLQKSFTESLAIDLQELTKVEARIVVLAVLRMIKENYALGESVKDDIFIIL VSKVE DLV+QNFEVRDAITKLLQ ELGL+VLPAGP
Subjt:  KPRCLILEKVAEHLQKSFTESLAIDLQELTKVEARIVVLAVLRMIKENYALGESVKDDIFIILEVSKVETDLVQQNFEVRDAITKLLQEELGLQVLPAGP

Query:  TVALDKVPDSESSNMSHTKLKGIMGRNKYSTRRPADVQRLKVTKKSLQDWLQRNR
        T A D+V  SESSNMS TKL GIMGRNKY+TR+PA VQRLKVTKKSL+DWLQRNR
Subjt:  TVALDKVPDSESSNMSHTKLKGIMGRNKYSTRRPADVQRLKVTKKSLQDWLQRNR

SwissProt top hitse value%identityAlignment
Q3ECK2 Pentatricopeptide repeat-containing protein At1g62680, mitochondrial1.4e-2626.1Show/hide
Query:  NMYIYRTIIDVCGLCGDYKKSRNIYQDLLNQNVTPNIYVFNSLMN--VNAHDLNYTFQLYKSMQNLGVPADMASYNILLKACCLAGRVDLAQDIYREVKN
        ++Y +  +I+    C     + +I   +L     P+     SL+N     + ++    L   M  +G   D+ +YN ++ + C   RV+ A D ++E+  
Subjt:  NMYIYRTIIDVCGLCGDYKKSRNIYQDLLNQNVTPNIYVFNSLMN--VNAHDLNYTFQLYKSMQNLGVPADMASYNILLKACCLAGRVDLAQDIYREVKN

Query:  LETTGVLKLDVFTYSTIVKVFADAKLWKMALRVKEDMQSAGVSPNMVTWSSLISSCANSGLVELAIQLFEEMVSAGCEPNTQCCNILLHACVEARQFDRA
         E  G+ + +V TY+ +V    ++  W  A R+  DM    ++PN++T+S+L+ +   +G V  A +LFEEMV    +P+    + L++      + D A
Subjt:  LETTGVLKLDVFTYSTIVKVFADAKLWKMALRVKEDMQSAGVSPNMVTWSSLISSCANSGLVELAIQLFEEMVSAGCEPNTQCCNILLHACVEARQFDRA

Query:  FRLFRSWREKELWDDKDRKNSTNDNLNADPTSQLCTTNMPNAPSHVHQIHFVGNFSFKPTITTYNILMKAC--GTDYYHAKALMEEMKSVGLTPNHISWS
         ++F     K    D     S N  +N    ++     M      + Q   V N        TYN L++      D   A+    +M   G++P+  +++
Subjt:  FRLFRSWREKELWDDKDRKNSTNDNLNADPTSQLCTTNMPNAPSHVHQIHFVGNFSFKPTITTYNILMKAC--GTDYYHAKALMEEMKSVGLTPNHISWS

Query:  ILIDICGGSHDVENAVQILTTMRIAGVDPDVVAYTTAIK-VCVEGKNWKLAFSLFEEMKRFEIQPNLVTYSTLLRARSTYGSLHEVQQCLAIYQDMRKSG
        IL+     + ++E A+ I   M+   +D D+V YTT I+ +C  GK  + A+SLF  +    ++P++VTY+T++    T G LHEV+   A+Y  M++ G
Subjt:  ILIDICGGSHDVENAVQILTTMRIAGVDPDVVAYTTAIK-VCVEGKNWKLAFSLFEEMKRFEIQPNLVTYSTLLRARSTYGSLHEVQQCLAIYQDMRKSG

Query:  FKSNDHYLKE
           ND  L +
Subjt:  FKSNDHYLKE

Q8GYL7 Pentatricopeptide repeat-containing protein At5g02830, chloroplastic3.2e-24953.97Show/hide
Query:  MRVFLILGSSSSASIAGHRCHRHSHSKAPK--------SSLSNLSPT--GTHSPLSSRPSTLHSRPPLLSS-VQWDFAGASSGGRTPLRHYADVASKLAE
        MR F+I+  SSSA    H  HR  ++ AP+        SS + L P+    HSP  +  S  HS     S+ V+W   G+       L +YAD ASKLAE
Subjt:  MRVFLILGSSSSASIAGHRCHRHSHSKAPK--------SSLSNLSPT--GTHSPLSSRPSTLHSRPPLLSS-VQWDFAGASSGGRTPLRHYADVASKLAE

Query:  RGKLEDFAMVVESVVV-AGVEPSQFAAVLAVELVAKGISRCLREGKLWNVVQVLRKVEELGISAVGLCDDFAVESLRRDCRRIAKSGELEELVEFMEVIA
         G++ED A++ E++   +G   ++FA+++  +L++KGIS  LR+GK+ +VV  L+++E++GI+ + L DD +V+ +R+  R +A S ++E+ ++ ME++A
Subjt:  RGKLEDFAMVVESVVV-AGVEPSQFAAVLAVELVAKGISRCLREGKLWNVVQVLRKVEELGISAVGLCDDFAVESLRRDCRRIAKSGELEELVEFMEVIA

Query:  GFGFSIKEMMKPSEVIKLCVDCRNPKMAIRYASILPHADILFCTTINEFGKKRDLKSAFMAYTESKANMNGPNMYIYRTIIDVCGLCGDYKKSRNIYQDL
        G GF IKE++ P +V+K CV+  NP++AIRYA +LPH ++L C  I+ FGKK D+ S   AY   K  ++ PNMYI RT+IDVCGLCGDY KSR IY+DL
Subjt:  GFGFSIKEMMKPSEVIKLCVDCRNPKMAIRYASILPHADILFCTTINEFGKKRDLKSAFMAYTESKANMNGPNMYIYRTIIDVCGLCGDYKKSRNIYQDL

Query:  LNQNVTPNIYVFNSLMNVNAHDLNYTFQLYKSMQNLGVPADMASYNILLKACCLAGRVDLAQDIYREVKNLETTGVLKLDVFTYSTIVKVFADAKLWKMA
        L +N+ PNIYV NSLMNVN+HDL YT ++YK+MQ L V ADM SYNILLK CCLAGRVDLAQDIY+E K +E++G+LKLD FTY TI+KVFADAK+WK A
Subjt:  LNQNVTPNIYVFNSLMNVNAHDLNYTFQLYKSMQNLGVPADMASYNILLKACCLAGRVDLAQDIYREVKNLETTGVLKLDVFTYSTIVKVFADAKLWKMA

Query:  LRVKEDMQSAGVSPNMVTWSSLISSCANSGLVELAIQLFEEMVSAGCEPNTQCCNILLHACVEARQFDRAFRLFRSWR----EKELWDD----KDRKNST
        L+VK+DM+S GV+PN  TWSSLIS+CAN+GLVE A  LFEEM+++GCEPN+QC NILLHACVEA Q+DRAFRLF+SW+     + L+ D    K R +S 
Subjt:  LRVKEDMQSAGVSPNMVTWSSLISSCANSGLVELAIQLFEEMVSAGCEPNTQCCNILLHACVEARQFDRAFRLFRSWR----EKELWDD----KDRKNST

Query:  NDNLNADPTSQLCTTNMPNAPSHVHQIHFVGNFSFKPTITTYNILMKACGTDYYHAKALMEEMKSVGLTPNHISWSILIDICGGSHDVENAVQILTTMRI
        N   N  P S      + N  S+   I     F FKPT  TYNIL+KACGTDYY  K LM+EMKS+GL+PN I+WS LID+CGGS DVE AV+IL TM  
Subjt:  NDNLNADPTSQLCTTNMPNAPSHVHQIHFVGNFSFKPTITTYNILMKACGTDYYHAKALMEEMKSVGLTPNHISWSILIDICGGSHDVENAVQILTTMRI

Query:  AGVDPDVVAYTTAIKVCVEGKNWKLAFSLFEEMKRFEIQPNLVTYSTLLRARSTYGSLHEVQQCLAIYQDMRKSGFKSNDHYLKELIAEWCEGVIQNNNQ
        AG  PDVVAYTTAIK+C E K  KLAFSLFEEM+R++I+PN VTY+TLL+ARS YGSL EV+QCLAIYQDMR +G+K NDH+LKELI EWCEGVIQ N Q
Subjt:  AGVDPDVVAYTTAIKVCVEGKNWKLAFSLFEEMKRFEIQPNLVTYSTLLRARSTYGSLHEVQQCLAIYQDMRKSGFKSNDHYLKELIAEWCEGVIQNNNQ

Query:  QQVEITPCNRINMGKPRCLILEKVAEHLQKSFTESLAIDLQELTKVEARIVVLAVLRMIKENYALGESVKDDIFIILEVSKVETDLVQQNFEVRDAITKL
         Q +I+     N G+P  L++EKVA H+Q+    +LAIDLQ LTK+EAR+VVLAVLRMIKE+Y  G+ V DD+ II+   +  T   +Q   V++A+ KL
Subjt:  QQVEITPCNRINMGKPRCLILEKVAEHLQKSFTESLAIDLQELTKVEARIVVLAVLRMIKENYALGESVKDDIFIILEVSKVETDLVQQNFEVRDAITKL

Query:  LQEELGLQVLPAGPTVALDKVPDSESSNMSHTKLKGIMGRNKYSTRRPADVQRLKVTKKSLQDWLQRNR
        L++EL L VLPAG    +      + ++  +TK    +     STRRPA ++RL VTK SL  WLQR +
Subjt:  LQEELGLQVLPAGPTVALDKVPDSESSNMSHTKLKGIMGRNKYSTRRPADVQRLKVTKKSLQDWLQRNR

Q940A6 Pentatricopeptide repeat-containing protein At4g19440, chloroplastic3.6e-2722.71Show/hide
Query:  LFCTTINEFGKKRDLKSAFMAYTESKANMNGPNMYIYRTIIDVCGLCGDY--------------------------------KKSRNIY---QDLLNQNV
        LF T IN F K   ++ A   +++ +     PN+  + T+ID  G+CG Y                                K+  + Y   +++  +  
Subjt:  LFCTTINEFGKKRDLKSAFMAYTESKANMNGPNMYIYRTIIDVCGLCGDY--------------------------------KKSRNIY---QDLLNQNV

Query:  TPNIYVFNSLMN--VNAHDLNYTFQLYKSMQNLGVPADMASYNILLKACCLAGRVDLAQDIYREVKNLETTGVLKLDVFTYSTIVKVFADAKLWKMALRV
         PN+ V+N+L++  + A  LN   ++   M + G+    ++YN L+K  C  G+ D A+ + +E+ ++       ++  ++++++ +     ++  ALR 
Subjt:  TPNIYVFNSLMN--VNAHDLNYTFQLYKSMQNLGVPADMASYNILLKACCLAGRVDLAQDIYREVKNLETTGVLKLDVFTYSTIVKVFADAKLWKMALRV

Query:  KEDMQSAGVSPNMVTWSSLISSCANSGLVELAIQLFEEMVSAGCEPNTQCCNILLHACVEARQFDRAFRLFRSWREKELWDDKDRKNSTNDNLNADPTSQ
          +M    +SP     ++LIS     G    A++L+ + ++ G   +T+  N LLH   EA + D AFR+     +KE+      +    D ++ +    
Subjt:  KEDMQSAGVSPNMVTWSSLISSCANSGLVELAIQLFEEMVSAGCEPNTQCCNILLHACVEARQFDRAFRLFRSWREKELWDDKDRKNSTNDNLNADPTSQ

Query:  LCTTNMPNAPSHVHQIHFVGNFSFKPTITTYNILMKACG----TDYYHAKALMEEMKSVGLTPNHISWSILIDICGGSHDVENAVQILTTMRIAGVDPDV
         C        + +     V     KP   TY+IL+  CG         A    ++ K  G+ P+  ++S++ID C  +   E   +    M    V P+ 
Subjt:  LCTTNMPNAPSHVHQIHFVGNFSFKPTITTYNILMKACG----TDYYHAKALMEEMKSVGLTPNHISWSILIDICGGSHDVENAVQILTTMRIAGVDPDV

Query:  VAYTTAIKVCVEGKNWKLAFSLFEEMKRFEIQPNLVTYSTLLRARSTYGSLHEVQQCLAIYQDMRKSGFKSNDHYLKELI
        V Y   I+         +A  L E+MK   I PN  TY++L++  S    +  V++   ++++MR  G + N  +   LI
Subjt:  VAYTTAIKVCVEGKNWKLAFSLFEEMKRFEIQPNLVTYSTLLRARSTYGSLHEVQQCLAIYQDMRKSGFKSNDHYLKELI

Q9FIX3 Pentatricopeptide repeat-containing protein At5g397101.3e-2923.81Show/hide
Query:  KRDLKSAFMAYTESKANMNGPNMYIYRTIIDVCGLCGDYKKSRNIYQDLLNQNVTPNIYVFNSLMN--VNAHDLNYTFQLYKSMQNLGVPADMASYNILL
        KR++  A   + E   +   PN++ Y  +I      G+   +  ++  +  +   PN+  +N+L++       ++  F+L +SM   G+  ++ SYN+++
Subjt:  KRDLKSAFMAYTESKANMNGPNMYIYRTIIDVCGLCGDYKKSRNIYQDLLNQNVTPNIYVFNSLMN--VNAHDLNYTFQLYKSMQNLGVPADMASYNILL

Query:  KACCLAGRVDLAQDIYREVKNLETTGVLKLDVFTYSTIVKVFADAKLWKMALRVKEDMQSAGVSPNMVTWSSLISSCANSGLVELAIQLFEEMVSAGCEP
           C  GR+     +  E+          LD  TY+T++K +     +  AL +  +M   G++P+++T++SLI S   +G +  A++  ++M   G  P
Subjt:  KACCLAGRVDLAQDIYREVKNLETTGVLKLDVFTYSTIVKVFADAKLWKMALRVKEDMQSAGVSPNMVTWSSLISSCANSGLVELAIQLFEEMVSAGCEP

Query:  NTQCCNILLHACVEARQFDRAFRLFRSWREKELWDDKDRKNSTNDNLNADPTSQLCTTNMPNAPSHVHQIHFVGNFSFKPTITTYNILMKA-CGTDYYH-
        N +    L+    +    + A+R+ R                 NDN                               F P++ TYN L+   C T     
Subjt:  NTQCCNILLHACVEARQFDRAFRLFRSWREKELWDDKDRKNSTNDNLNADPTSQLCTTNMPNAPSHVHQIHFVGNFSFKPTITTYNILMKA-CGTDYYH-

Query:  AKALMEEMKSVGLTPNHISWSILIDICGGSHDVENAVQILTTMRIAGVDPDVVAYTTAIKVCVEGKNWKLAFSLFEEMKRFEIQPNLVTYSTLLRARSTY
        A A++E+MK  GL+P+ +S+S ++     S+DV+ A+++   M   G+ PD + Y++ I+   E +  K A  L+EEM R  + P+  TY+ L+ A    
Subjt:  AKALMEEMKSVGLTPNHISWSILIDICGGSHDVENAVQILTTMRIAGVDPDVVAYTTAIKVCVEGKNWKLAFSLFEEMKRFEIQPNLVTYSTLLRARSTY

Query:  GSLHEVQQCLAIYQDMRKSG
        G L   ++ L ++ +M + G
Subjt:  GSLHEVQQCLAIYQDMRKSG

Q9LYZ9 Pentatricopeptide repeat-containing protein At5g028604.3e-2819.57Show/hide
Query:  YADVASKLAERGKLEDFAMVVESVVVAGVEPSQFAAVLAVELVAKGISRCLREGKLWN-VVQVLRKVEELGISAVGLCDDFAVESLRRDCRRIAKSGELE
        Y  + S  A  G+  +   V + +   G +P+     + + +  K        G  WN +  ++ K++  GI+     D +   +L   C+R +   E  
Subjt:  YADVASKLAERGKLEDFAMVVESVVVAGVEPSQFAAVLAVELVAKGISRCLREGKLWN-VVQVLRKVEELGISAVGLCDDFAVESLRRDCRRIAKSGELE

Query:  ELVEFMEVIAGFGFSIKEMMKPSEVIKLCVDCRNPKMAIRYASIL-----PHADILFCTTINEFGKKRDLKSAFMAYTESKANMNGPNMYIYRTIIDVCG
        ++ E M+  AGF +   + +  + ++ +      PK A++  + +       + + + + I+ + +   L  A     +       P+++ Y T++    
Subjt:  ELVEFMEVIAGFGFSIKEMMKPSEVIKLCVDCRNPKMAIRYASIL-----PHADILFCTTINEFGKKRDLKSAFMAYTESKANMNGPNMYIYRTIIDVCG

Query:  LCGDYKKSRNIYQDLLNQNVTPNIYVFNSLMNVNAHDLNYT--FQLYKSMQNLGVPADMASYNILLKACCLAGRVDLAQDIYREVKNLETTGVLKLDVFT
          G  + + +I++++ N    PNI  FN+ + +  +   +T   +++  +   G+  D+ ++N LL    + G+  +  ++    K ++  G +  +  T
Subjt:  LCGDYKKSRNIYQDLLNQNVTPNIYVFNSLMNVNAHDLNYT--FQLYKSMQNLGVPADMASYNILLKACCLAGRVDLAQDIYREVKNLETTGVLKLDVFT

Query:  YSTIVKVFADAKLWKMALRVKEDMQSAGVSPNMVTWSSLISSCANSGLVELAIQLFEEMVSAGCEPNTQCCNILLHACVEARQFDRAFRLFRSWREKELW
        ++T++  ++    ++ A+ V   M  AGV+P++ T+++++++ A  G+ E + ++  EM    C+PN      LLHA    ++      L      +E++
Subjt:  YSTIVKVFADAKLWKMALRVKEDMQSAGVSPNMVTWSSLISSCANSGLVELAIQLFEEMVSAGCEPNTQCCNILLHACVEARQFDRAFRLFRSWREKELW

Query:  DDKDRKNSTNDNLNADPTSQLCTTNMPNAPSHVHQIHFVGNFSFKPTITTYNILMKACGTDYYHAKA--LMEEMKSVGLTPNHISWSILIDICGGSHDVE
               +         T  L  +     P        +    F P ITT N ++   G     AKA  +++ MK  G TP+  +++ L+ +   S D  
Subjt:  DDKDRKNSTNDNLNADPTSQLCTTNMPNAPSHVHQIHFVGNFSFKPTITTYNILMKACGTDYYHAKA--LMEEMKSVGLTPNHISWSILIDICGGSHDVE

Query:  NAVQILTTMRIAGVDPDVVAYTTAIKVCVEGKNWKLAFSLFEEMKRFEIQPNLVTYSTLLRARSTYGSLHEVQQCLAIYQDMRKSGFKSNDHYLKELIAE
         + +IL  +   G+ PD+++Y T I         + A  +F EM+   I P+++TY+T +    +Y +    ++ + + + M K G + N +    ++  
Subjt:  NAVQILTTMRIAGVDPDVVAYTTAIKVCVEGKNWKLAFSLFEEMKRFEIQPNLVTYSTLLRARSTYGSLHEVQQCLAIYQDMRKSGFKSNDHYLKELIAE

Query:  WCE
        +C+
Subjt:  WCE

Arabidopsis top hitse value%identityAlignment
AT4G19440.1 Tetratricopeptide repeat (TPR)-like superfamily protein2.6e-2822.71Show/hide
Query:  LFCTTINEFGKKRDLKSAFMAYTESKANMNGPNMYIYRTIIDVCGLCGDY--------------------------------KKSRNIY---QDLLNQNV
        LF T IN F K   ++ A   +++ +     PN+  + T+ID  G+CG Y                                K+  + Y   +++  +  
Subjt:  LFCTTINEFGKKRDLKSAFMAYTESKANMNGPNMYIYRTIIDVCGLCGDY--------------------------------KKSRNIY---QDLLNQNV

Query:  TPNIYVFNSLMN--VNAHDLNYTFQLYKSMQNLGVPADMASYNILLKACCLAGRVDLAQDIYREVKNLETTGVLKLDVFTYSTIVKVFADAKLWKMALRV
         PN+ V+N+L++  + A  LN   ++   M + G+    ++YN L+K  C  G+ D A+ + +E+ ++       ++  ++++++ +     ++  ALR 
Subjt:  TPNIYVFNSLMN--VNAHDLNYTFQLYKSMQNLGVPADMASYNILLKACCLAGRVDLAQDIYREVKNLETTGVLKLDVFTYSTIVKVFADAKLWKMALRV

Query:  KEDMQSAGVSPNMVTWSSLISSCANSGLVELAIQLFEEMVSAGCEPNTQCCNILLHACVEARQFDRAFRLFRSWREKELWDDKDRKNSTNDNLNADPTSQ
          +M    +SP     ++LIS     G    A++L+ + ++ G   +T+  N LLH   EA + D AFR+     +KE+      +    D ++ +    
Subjt:  KEDMQSAGVSPNMVTWSSLISSCANSGLVELAIQLFEEMVSAGCEPNTQCCNILLHACVEARQFDRAFRLFRSWREKELWDDKDRKNSTNDNLNADPTSQ

Query:  LCTTNMPNAPSHVHQIHFVGNFSFKPTITTYNILMKACG----TDYYHAKALMEEMKSVGLTPNHISWSILIDICGGSHDVENAVQILTTMRIAGVDPDV
         C        + +     V     KP   TY+IL+  CG         A    ++ K  G+ P+  ++S++ID C  +   E   +    M    V P+ 
Subjt:  LCTTNMPNAPSHVHQIHFVGNFSFKPTITTYNILMKACG----TDYYHAKALMEEMKSVGLTPNHISWSILIDICGGSHDVENAVQILTTMRIAGVDPDV

Query:  VAYTTAIKVCVEGKNWKLAFSLFEEMKRFEIQPNLVTYSTLLRARSTYGSLHEVQQCLAIYQDMRKSGFKSNDHYLKELI
        V Y   I+         +A  L E+MK   I PN  TY++L++  S    +  V++   ++++MR  G + N  +   LI
Subjt:  VAYTTAIKVCVEGKNWKLAFSLFEEMKRFEIQPNLVTYSTLLRARSTYGSLHEVQQCLAIYQDMRKSGFKSNDHYLKELI

AT4G19440.2 Tetratricopeptide repeat (TPR)-like superfamily protein2.6e-2822.71Show/hide
Query:  LFCTTINEFGKKRDLKSAFMAYTESKANMNGPNMYIYRTIIDVCGLCGDY--------------------------------KKSRNIY---QDLLNQNV
        LF T IN F K   ++ A   +++ +     PN+  + T+ID  G+CG Y                                K+  + Y   +++  +  
Subjt:  LFCTTINEFGKKRDLKSAFMAYTESKANMNGPNMYIYRTIIDVCGLCGDY--------------------------------KKSRNIY---QDLLNQNV

Query:  TPNIYVFNSLMN--VNAHDLNYTFQLYKSMQNLGVPADMASYNILLKACCLAGRVDLAQDIYREVKNLETTGVLKLDVFTYSTIVKVFADAKLWKMALRV
         PN+ V+N+L++  + A  LN   ++   M + G+    ++YN L+K  C  G+ D A+ + +E+ ++       ++  ++++++ +     ++  ALR 
Subjt:  TPNIYVFNSLMN--VNAHDLNYTFQLYKSMQNLGVPADMASYNILLKACCLAGRVDLAQDIYREVKNLETTGVLKLDVFTYSTIVKVFADAKLWKMALRV

Query:  KEDMQSAGVSPNMVTWSSLISSCANSGLVELAIQLFEEMVSAGCEPNTQCCNILLHACVEARQFDRAFRLFRSWREKELWDDKDRKNSTNDNLNADPTSQ
          +M    +SP     ++LIS     G    A++L+ + ++ G   +T+  N LLH   EA + D AFR+     +KE+      +    D ++ +    
Subjt:  KEDMQSAGVSPNMVTWSSLISSCANSGLVELAIQLFEEMVSAGCEPNTQCCNILLHACVEARQFDRAFRLFRSWREKELWDDKDRKNSTNDNLNADPTSQ

Query:  LCTTNMPNAPSHVHQIHFVGNFSFKPTITTYNILMKACG----TDYYHAKALMEEMKSVGLTPNHISWSILIDICGGSHDVENAVQILTTMRIAGVDPDV
         C        + +     V     KP   TY+IL+  CG         A    ++ K  G+ P+  ++S++ID C  +   E   +    M    V P+ 
Subjt:  LCTTNMPNAPSHVHQIHFVGNFSFKPTITTYNILMKACG----TDYYHAKALMEEMKSVGLTPNHISWSILIDICGGSHDVENAVQILTTMRIAGVDPDV

Query:  VAYTTAIKVCVEGKNWKLAFSLFEEMKRFEIQPNLVTYSTLLRARSTYGSLHEVQQCLAIYQDMRKSGFKSNDHYLKELI
        V Y   I+         +A  L E+MK   I PN  TY++L++  S    +  V++   ++++MR  G + N  +   LI
Subjt:  VAYTTAIKVCVEGKNWKLAFSLFEEMKRFEIQPNLVTYSTLLRARSTYGSLHEVQQCLAIYQDMRKSGFKSNDHYLKELI

AT5G02830.1 Tetratricopeptide repeat (TPR)-like superfamily protein2.2e-25053.97Show/hide
Query:  MRVFLILGSSSSASIAGHRCHRHSHSKAPK--------SSLSNLSPT--GTHSPLSSRPSTLHSRPPLLSS-VQWDFAGASSGGRTPLRHYADVASKLAE
        MR F+I+  SSSA    H  HR  ++ AP+        SS + L P+    HSP  +  S  HS     S+ V+W   G+       L +YAD ASKLAE
Subjt:  MRVFLILGSSSSASIAGHRCHRHSHSKAPK--------SSLSNLSPT--GTHSPLSSRPSTLHSRPPLLSS-VQWDFAGASSGGRTPLRHYADVASKLAE

Query:  RGKLEDFAMVVESVVV-AGVEPSQFAAVLAVELVAKGISRCLREGKLWNVVQVLRKVEELGISAVGLCDDFAVESLRRDCRRIAKSGELEELVEFMEVIA
         G++ED A++ E++   +G   ++FA+++  +L++KGIS  LR+GK+ +VV  L+++E++GI+ + L DD +V+ +R+  R +A S ++E+ ++ ME++A
Subjt:  RGKLEDFAMVVESVVV-AGVEPSQFAAVLAVELVAKGISRCLREGKLWNVVQVLRKVEELGISAVGLCDDFAVESLRRDCRRIAKSGELEELVEFMEVIA

Query:  GFGFSIKEMMKPSEVIKLCVDCRNPKMAIRYASILPHADILFCTTINEFGKKRDLKSAFMAYTESKANMNGPNMYIYRTIIDVCGLCGDYKKSRNIYQDL
        G GF IKE++ P +V+K CV+  NP++AIRYA +LPH ++L C  I+ FGKK D+ S   AY   K  ++ PNMYI RT+IDVCGLCGDY KSR IY+DL
Subjt:  GFGFSIKEMMKPSEVIKLCVDCRNPKMAIRYASILPHADILFCTTINEFGKKRDLKSAFMAYTESKANMNGPNMYIYRTIIDVCGLCGDYKKSRNIYQDL

Query:  LNQNVTPNIYVFNSLMNVNAHDLNYTFQLYKSMQNLGVPADMASYNILLKACCLAGRVDLAQDIYREVKNLETTGVLKLDVFTYSTIVKVFADAKLWKMA
        L +N+ PNIYV NSLMNVN+HDL YT ++YK+MQ L V ADM SYNILLK CCLAGRVDLAQDIY+E K +E++G+LKLD FTY TI+KVFADAK+WK A
Subjt:  LNQNVTPNIYVFNSLMNVNAHDLNYTFQLYKSMQNLGVPADMASYNILLKACCLAGRVDLAQDIYREVKNLETTGVLKLDVFTYSTIVKVFADAKLWKMA

Query:  LRVKEDMQSAGVSPNMVTWSSLISSCANSGLVELAIQLFEEMVSAGCEPNTQCCNILLHACVEARQFDRAFRLFRSWR----EKELWDD----KDRKNST
        L+VK+DM+S GV+PN  TWSSLIS+CAN+GLVE A  LFEEM+++GCEPN+QC NILLHACVEA Q+DRAFRLF+SW+     + L+ D    K R +S 
Subjt:  LRVKEDMQSAGVSPNMVTWSSLISSCANSGLVELAIQLFEEMVSAGCEPNTQCCNILLHACVEARQFDRAFRLFRSWR----EKELWDD----KDRKNST

Query:  NDNLNADPTSQLCTTNMPNAPSHVHQIHFVGNFSFKPTITTYNILMKACGTDYYHAKALMEEMKSVGLTPNHISWSILIDICGGSHDVENAVQILTTMRI
        N   N  P S      + N  S+   I     F FKPT  TYNIL+KACGTDYY  K LM+EMKS+GL+PN I+WS LID+CGGS DVE AV+IL TM  
Subjt:  NDNLNADPTSQLCTTNMPNAPSHVHQIHFVGNFSFKPTITTYNILMKACGTDYYHAKALMEEMKSVGLTPNHISWSILIDICGGSHDVENAVQILTTMRI

Query:  AGVDPDVVAYTTAIKVCVEGKNWKLAFSLFEEMKRFEIQPNLVTYSTLLRARSTYGSLHEVQQCLAIYQDMRKSGFKSNDHYLKELIAEWCEGVIQNNNQ
        AG  PDVVAYTTAIK+C E K  KLAFSLFEEM+R++I+PN VTY+TLL+ARS YGSL EV+QCLAIYQDMR +G+K NDH+LKELI EWCEGVIQ N Q
Subjt:  AGVDPDVVAYTTAIKVCVEGKNWKLAFSLFEEMKRFEIQPNLVTYSTLLRARSTYGSLHEVQQCLAIYQDMRKSGFKSNDHYLKELIAEWCEGVIQNNNQ

Query:  QQVEITPCNRINMGKPRCLILEKVAEHLQKSFTESLAIDLQELTKVEARIVVLAVLRMIKENYALGESVKDDIFIILEVSKVETDLVQQNFEVRDAITKL
         Q +I+     N G+P  L++EKVA H+Q+    +LAIDLQ LTK+EAR+VVLAVLRMIKE+Y  G+ V DD+ II+   +  T   +Q   V++A+ KL
Subjt:  QQVEITPCNRINMGKPRCLILEKVAEHLQKSFTESLAIDLQELTKVEARIVVLAVLRMIKENYALGESVKDDIFIILEVSKVETDLVQQNFEVRDAITKL

Query:  LQEELGLQVLPAGPTVALDKVPDSESSNMSHTKLKGIMGRNKYSTRRPADVQRLKVTKKSLQDWLQRNR
        L++EL L VLPAG    +      + ++  +TK    +     STRRPA ++RL VTK SL  WLQR +
Subjt:  LQEELGLQVLPAGPTVALDKVPDSESSNMSHTKLKGIMGRNKYSTRRPADVQRLKVTKKSLQDWLQRNR

AT5G02860.1 Pentatricopeptide repeat (PPR) superfamily protein3.0e-2919.57Show/hide
Query:  YADVASKLAERGKLEDFAMVVESVVVAGVEPSQFAAVLAVELVAKGISRCLREGKLWN-VVQVLRKVEELGISAVGLCDDFAVESLRRDCRRIAKSGELE
        Y  + S  A  G+  +   V + +   G +P+     + + +  K        G  WN +  ++ K++  GI+     D +   +L   C+R +   E  
Subjt:  YADVASKLAERGKLEDFAMVVESVVVAGVEPSQFAAVLAVELVAKGISRCLREGKLWN-VVQVLRKVEELGISAVGLCDDFAVESLRRDCRRIAKSGELE

Query:  ELVEFMEVIAGFGFSIKEMMKPSEVIKLCVDCRNPKMAIRYASIL-----PHADILFCTTINEFGKKRDLKSAFMAYTESKANMNGPNMYIYRTIIDVCG
        ++ E M+  AGF +   + +  + ++ +      PK A++  + +       + + + + I+ + +   L  A     +       P+++ Y T++    
Subjt:  ELVEFMEVIAGFGFSIKEMMKPSEVIKLCVDCRNPKMAIRYASIL-----PHADILFCTTINEFGKKRDLKSAFMAYTESKANMNGPNMYIYRTIIDVCG

Query:  LCGDYKKSRNIYQDLLNQNVTPNIYVFNSLMNVNAHDLNYT--FQLYKSMQNLGVPADMASYNILLKACCLAGRVDLAQDIYREVKNLETTGVLKLDVFT
          G  + + +I++++ N    PNI  FN+ + +  +   +T   +++  +   G+  D+ ++N LL    + G+  +  ++    K ++  G +  +  T
Subjt:  LCGDYKKSRNIYQDLLNQNVTPNIYVFNSLMNVNAHDLNYT--FQLYKSMQNLGVPADMASYNILLKACCLAGRVDLAQDIYREVKNLETTGVLKLDVFT

Query:  YSTIVKVFADAKLWKMALRVKEDMQSAGVSPNMVTWSSLISSCANSGLVELAIQLFEEMVSAGCEPNTQCCNILLHACVEARQFDRAFRLFRSWREKELW
        ++T++  ++    ++ A+ V   M  AGV+P++ T+++++++ A  G+ E + ++  EM    C+PN      LLHA    ++      L      +E++
Subjt:  YSTIVKVFADAKLWKMALRVKEDMQSAGVSPNMVTWSSLISSCANSGLVELAIQLFEEMVSAGCEPNTQCCNILLHACVEARQFDRAFRLFRSWREKELW

Query:  DDKDRKNSTNDNLNADPTSQLCTTNMPNAPSHVHQIHFVGNFSFKPTITTYNILMKACGTDYYHAKA--LMEEMKSVGLTPNHISWSILIDICGGSHDVE
               +         T  L  +     P        +    F P ITT N ++   G     AKA  +++ MK  G TP+  +++ L+ +   S D  
Subjt:  DDKDRKNSTNDNLNADPTSQLCTTNMPNAPSHVHQIHFVGNFSFKPTITTYNILMKACGTDYYHAKA--LMEEMKSVGLTPNHISWSILIDICGGSHDVE

Query:  NAVQILTTMRIAGVDPDVVAYTTAIKVCVEGKNWKLAFSLFEEMKRFEIQPNLVTYSTLLRARSTYGSLHEVQQCLAIYQDMRKSGFKSNDHYLKELIAE
         + +IL  +   G+ PD+++Y T I         + A  +F EM+   I P+++TY+T +    +Y +    ++ + + + M K G + N +    ++  
Subjt:  NAVQILTTMRIAGVDPDVVAYTTAIKVCVEGKNWKLAFSLFEEMKRFEIQPNLVTYSTLLRARSTYGSLHEVQQCLAIYQDMRKSGFKSNDHYLKELIAE

Query:  WCE
        +C+
Subjt:  WCE

AT5G39710.1 Tetratricopeptide repeat (TPR)-like superfamily protein9.4e-3123.81Show/hide
Query:  KRDLKSAFMAYTESKANMNGPNMYIYRTIIDVCGLCGDYKKSRNIYQDLLNQNVTPNIYVFNSLMN--VNAHDLNYTFQLYKSMQNLGVPADMASYNILL
        KR++  A   + E   +   PN++ Y  +I      G+   +  ++  +  +   PN+  +N+L++       ++  F+L +SM   G+  ++ SYN+++
Subjt:  KRDLKSAFMAYTESKANMNGPNMYIYRTIIDVCGLCGDYKKSRNIYQDLLNQNVTPNIYVFNSLMN--VNAHDLNYTFQLYKSMQNLGVPADMASYNILL

Query:  KACCLAGRVDLAQDIYREVKNLETTGVLKLDVFTYSTIVKVFADAKLWKMALRVKEDMQSAGVSPNMVTWSSLISSCANSGLVELAIQLFEEMVSAGCEP
           C  GR+     +  E+          LD  TY+T++K +     +  AL +  +M   G++P+++T++SLI S   +G +  A++  ++M   G  P
Subjt:  KACCLAGRVDLAQDIYREVKNLETTGVLKLDVFTYSTIVKVFADAKLWKMALRVKEDMQSAGVSPNMVTWSSLISSCANSGLVELAIQLFEEMVSAGCEP

Query:  NTQCCNILLHACVEARQFDRAFRLFRSWREKELWDDKDRKNSTNDNLNADPTSQLCTTNMPNAPSHVHQIHFVGNFSFKPTITTYNILMKA-CGTDYYH-
        N +    L+    +    + A+R+ R                 NDN                               F P++ TYN L+   C T     
Subjt:  NTQCCNILLHACVEARQFDRAFRLFRSWREKELWDDKDRKNSTNDNLNADPTSQLCTTNMPNAPSHVHQIHFVGNFSFKPTITTYNILMKA-CGTDYYH-

Query:  AKALMEEMKSVGLTPNHISWSILIDICGGSHDVENAVQILTTMRIAGVDPDVVAYTTAIKVCVEGKNWKLAFSLFEEMKRFEIQPNLVTYSTLLRARSTY
        A A++E+MK  GL+P+ +S+S ++     S+DV+ A+++   M   G+ PD + Y++ I+   E +  K A  L+EEM R  + P+  TY+ L+ A    
Subjt:  AKALMEEMKSVGLTPNHISWSILIDICGGSHDVENAVQILTTMRIAGVDPDVVAYTTAIKVCVEGKNWKLAFSLFEEMKRFEIQPNLVTYSTLLRARSTY

Query:  GSLHEVQQCLAIYQDMRKSG
        G L   ++ L ++ +M + G
Subjt:  GSLHEVQQCLAIYQDMRKSG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCGAGTCTTCCTCATCCTCGGCTCCTCCTCCTCCGCCTCCATCGCCGGACATCGCTGCCACCGCCATAGCCATTCCAAGGCCCCAAAATCCTCGCTCTCCAAC
CTATCTCCCACCGGTACGCATTCGCCGCTTTCTTCTCGTCCTTCCACTCTCCATTCCCGTCCACCTCTTCTCTCCTCCGTCCAATGGGACTTTGCCGGCGCCTCA
TCCGGCGGAAGAACTCCGCTCCGTCACTACGCCGACGTCGCATCGAAGCTCGCCGAACGCGGGAAGCTTGAGGATTTTGCAATGGTGGTGGAGAGTGTGGTCGTC
GCTGGTGTTGAGCCCTCGCAGTTCGCTGCGGTGTTGGCCGTTGAACTTGTGGCCAAGGGGATTTCGCGTTGTCTTAGGGAGGGAAAGCTTTGGAATGTCGTGCAG
GTCTTGAGGAAGGTCGAGGAGCTTGGGATTTCAGCCGTGGGGCTTTGTGATGATTTTGCTGTAGAGTCGCTGAGGAGAGATTGCCGCCGTATTGCCAAGTCTGGA
GAATTGGAGGAGCTTGTTGAGTTCATGGAGGTTATTGCCGGTTTTGGTTTCTCAATCAAAGAAATGATGAAGCCATCCGAAGTAATAAAACTGTGTGTTGATTGC
CGTAATCCAAAAATGGCAATTAGGTATGCTAGCATTTTACCACACGCAGATATATTGTTCTGTACCACTATAAATGAATTTGGAAAGAAAAGGGACTTGAAATCT
GCTTTCATGGCATATACAGAATCCAAAGCTAATATGAATGGTCCTAATATGTATATCTATCGCACAATAATTGATGTCTGTGGTCTCTGTGGTGACTACAAGAAA
TCGAGGAACATCTATCAGGATTTGCTCAATCAGAATGTCACCCCAAATATATACGTTTTCAACAGTCTCATGAATGTAAATGCCCATGATTTGAACTACACATTT
CAACTATATAAAAGTATGCAGAATCTTGGTGTACCAGCTGATATGGCCTCATATAATATCCTTCTCAAGGCCTGTTGTCTAGCAGGAAGAGTTGATTTGGCCCAG
GACATTTACAGGGAAGTAAAGAATTTGGAAACAACAGGTGTGTTGAAGTTGGATGTCTTCACCTACAGCACAATCGTAAAGGTTTTCGCAGATGCGAAATTGTGG
AAAATGGCACTTAGAGTCAAAGAAGACATGCAATCAGCAGGAGTATCCCCAAATATGGTGACCTGGTCTTCATTGATCAGTTCATGTGCTAATTCGGGCCTTGTT
GAGCTGGCAATCCAATTGTTTGAAGAGATGGTTTCAGCAGGATGTGAACCTAATACACAGTGTTGTAATATTCTTTTACATGCTTGTGTCGAAGCTCGCCAGTTT
GATAGAGCTTTTCGCCTATTTCGATCCTGGAGGGAAAAGGAACTCTGGGATGATAAAGACAGAAAAAACAGCACCAATGATAATCTGAATGCAGATCCAACATCT
CAGCTTTGTACTACTAATATGCCTAATGCTCCATCTCATGTACATCAAATCCACTTTGTTGGGAACTTTTCTTTTAAACCTACAATAACGACGTATAATATTTTA
ATGAAAGCTTGTGGTACTGATTACTACCATGCTAAAGCTTTGATGGAGGAGATGAAGAGTGTTGGCCTTACGCCCAATCACATTAGCTGGTCAATTTTGATTGAC
ATATGTGGGGGATCTCACGACGTGGAAAATGCTGTACAGATCTTGACAACCATGCGTATAGCTGGAGTCGATCCTGATGTTGTTGCATACACAACGGCTATCAAG
GTTTGCGTTGAAGGTAAAAACTGGAAGCTGGCATTTTCATTATTTGAAGAAATGAAAAGATTTGAGATACAGCCAAATTTAGTGACCTATAGTACACTTCTGAGA
GCTCGCAGTACTTATGGTTCATTACACGAAGTACAGCAATGCCTTGCTATATATCAGGACATGAGGAAATCAGGGTTCAAATCCAATGATCATTATCTCAAGGAG
TTGATTGCAGAGTGGTGTGAAGGCGTTATACAGAATAACAATCAGCAGCAAGTTGAAATAACTCCCTGCAACAGAATTAACATGGGGAAGCCACGATGTTTAATT
CTTGAAAAAGTTGCTGAGCATTTGCAGAAAAGCTTTACCGAAAGCCTCGCGATTGACCTTCAGGAGCTCACAAAGGTTGAAGCCCGGATCGTTGTTCTTGCGGTT
TTGCGAATGATCAAGGAGAACTATGCATTAGGAGAATCAGTAAAAGATGATATATTCATCATCTTAGAAGTGAGTAAAGTTGAAACAGATTTAGTTCAACAGAAC
TTTGAGGTGAGAGATGCAATAACCAAACTCTTGCAAGAAGAATTAGGGCTTCAGGTTCTTCCTGCAGGACCTACAGTTGCTCTTGATAAAGTTCCAGATTCAGAA
AGCTCTAACATGTCACATACAAAACTGAAAGGAATCATGGGAAGGAATAAGTACTCCACTAGGAGACCAGCAGATGTACAGAGACTAAAAGTCACAAAGAAATCA
CTGCAAGACTGGCTGCAAAGAAATAGGTGA
mRNA sequenceShow/hide mRNA sequence
ATGCGAGTCTTCCTCATCCTCGGCTCCTCCTCCTCCGCCTCCATCGCCGGACATCGCTGCCACCGCCATAGCCATTCCAAGGCCCCAAAATCCTCGCTCTCCAAC
CTATCTCCCACCGGTACGCATTCGCCGCTTTCTTCTCGTCCTTCCACTCTCCATTCCCGTCCACCTCTTCTCTCCTCCGTCCAATGGGACTTTGCCGGCGCCTCA
TCCGGCGGAAGAACTCCGCTCCGTCACTACGCCGACGTCGCATCGAAGCTCGCCGAACGCGGGAAGCTTGAGGATTTTGCAATGGTGGTGGAGAGTGTGGTCGTC
GCTGGTGTTGAGCCCTCGCAGTTCGCTGCGGTGTTGGCCGTTGAACTTGTGGCCAAGGGGATTTCGCGTTGTCTTAGGGAGGGAAAGCTTTGGAATGTCGTGCAG
GTCTTGAGGAAGGTCGAGGAGCTTGGGATTTCAGCCGTGGGGCTTTGTGATGATTTTGCTGTAGAGTCGCTGAGGAGAGATTGCCGCCGTATTGCCAAGTCTGGA
GAATTGGAGGAGCTTGTTGAGTTCATGGAGGTTATTGCCGGTTTTGGTTTCTCAATCAAAGAAATGATGAAGCCATCCGAAGTAATAAAACTGTGTGTTGATTGC
CGTAATCCAAAAATGGCAATTAGGTATGCTAGCATTTTACCACACGCAGATATATTGTTCTGTACCACTATAAATGAATTTGGAAAGAAAAGGGACTTGAAATCT
GCTTTCATGGCATATACAGAATCCAAAGCTAATATGAATGGTCCTAATATGTATATCTATCGCACAATAATTGATGTCTGTGGTCTCTGTGGTGACTACAAGAAA
TCGAGGAACATCTATCAGGATTTGCTCAATCAGAATGTCACCCCAAATATATACGTTTTCAACAGTCTCATGAATGTAAATGCCCATGATTTGAACTACACATTT
CAACTATATAAAAGTATGCAGAATCTTGGTGTACCAGCTGATATGGCCTCATATAATATCCTTCTCAAGGCCTGTTGTCTAGCAGGAAGAGTTGATTTGGCCCAG
GACATTTACAGGGAAGTAAAGAATTTGGAAACAACAGGTGTGTTGAAGTTGGATGTCTTCACCTACAGCACAATCGTAAAGGTTTTCGCAGATGCGAAATTGTGG
AAAATGGCACTTAGAGTCAAAGAAGACATGCAATCAGCAGGAGTATCCCCAAATATGGTGACCTGGTCTTCATTGATCAGTTCATGTGCTAATTCGGGCCTTGTT
GAGCTGGCAATCCAATTGTTTGAAGAGATGGTTTCAGCAGGATGTGAACCTAATACACAGTGTTGTAATATTCTTTTACATGCTTGTGTCGAAGCTCGCCAGTTT
GATAGAGCTTTTCGCCTATTTCGATCCTGGAGGGAAAAGGAACTCTGGGATGATAAAGACAGAAAAAACAGCACCAATGATAATCTGAATGCAGATCCAACATCT
CAGCTTTGTACTACTAATATGCCTAATGCTCCATCTCATGTACATCAAATCCACTTTGTTGGGAACTTTTCTTTTAAACCTACAATAACGACGTATAATATTTTA
ATGAAAGCTTGTGGTACTGATTACTACCATGCTAAAGCTTTGATGGAGGAGATGAAGAGTGTTGGCCTTACGCCCAATCACATTAGCTGGTCAATTTTGATTGAC
ATATGTGGGGGATCTCACGACGTGGAAAATGCTGTACAGATCTTGACAACCATGCGTATAGCTGGAGTCGATCCTGATGTTGTTGCATACACAACGGCTATCAAG
GTTTGCGTTGAAGGTAAAAACTGGAAGCTGGCATTTTCATTATTTGAAGAAATGAAAAGATTTGAGATACAGCCAAATTTAGTGACCTATAGTACACTTCTGAGA
GCTCGCAGTACTTATGGTTCATTACACGAAGTACAGCAATGCCTTGCTATATATCAGGACATGAGGAAATCAGGGTTCAAATCCAATGATCATTATCTCAAGGAG
TTGATTGCAGAGTGGTGTGAAGGCGTTATACAGAATAACAATCAGCAGCAAGTTGAAATAACTCCCTGCAACAGAATTAACATGGGGAAGCCACGATGTTTAATT
CTTGAAAAAGTTGCTGAGCATTTGCAGAAAAGCTTTACCGAAAGCCTCGCGATTGACCTTCAGGAGCTCACAAAGGTTGAAGCCCGGATCGTTGTTCTTGCGGTT
TTGCGAATGATCAAGGAGAACTATGCATTAGGAGAATCAGTAAAAGATGATATATTCATCATCTTAGAAGTGAGTAAAGTTGAAACAGATTTAGTTCAACAGAAC
TTTGAGGTGAGAGATGCAATAACCAAACTCTTGCAAGAAGAATTAGGGCTTCAGGTTCTTCCTGCAGGACCTACAGTTGCTCTTGATAAAGTTCCAGATTCAGAA
AGCTCTAACATGTCACATACAAAACTGAAAGGAATCATGGGAAGGAATAAGTACTCCACTAGGAGACCAGCAGATGTACAGAGACTAAAAGTCACAAAGAAATCA
CTGCAAGACTGGCTGCAAAGAAATAGGTGA
Protein sequenceShow/hide protein sequence
MRVFLILGSSSSASIAGHRCHRHSHSKAPKSSLSNLSPTGTHSPLSSRPSTLHSRPPLLSSVQWDFAGASSGGRTPLRHYADVASKLAERGKLEDFAMVVESVVV
AGVEPSQFAAVLAVELVAKGISRCLREGKLWNVVQVLRKVEELGISAVGLCDDFAVESLRRDCRRIAKSGELEELVEFMEVIAGFGFSIKEMMKPSEVIKLCVDC
RNPKMAIRYASILPHADILFCTTINEFGKKRDLKSAFMAYTESKANMNGPNMYIYRTIIDVCGLCGDYKKSRNIYQDLLNQNVTPNIYVFNSLMNVNAHDLNYTF
QLYKSMQNLGVPADMASYNILLKACCLAGRVDLAQDIYREVKNLETTGVLKLDVFTYSTIVKVFADAKLWKMALRVKEDMQSAGVSPNMVTWSSLISSCANSGLV
ELAIQLFEEMVSAGCEPNTQCCNILLHACVEARQFDRAFRLFRSWREKELWDDKDRKNSTNDNLNADPTSQLCTTNMPNAPSHVHQIHFVGNFSFKPTITTYNIL
MKACGTDYYHAKALMEEMKSVGLTPNHISWSILIDICGGSHDVENAVQILTTMRIAGVDPDVVAYTTAIKVCVEGKNWKLAFSLFEEMKRFEIQPNLVTYSTLLR
ARSTYGSLHEVQQCLAIYQDMRKSGFKSNDHYLKELIAEWCEGVIQNNNQQQVEITPCNRINMGKPRCLILEKVAEHLQKSFTESLAIDLQELTKVEARIVVLAV
LRMIKENYALGESVKDDIFIILEVSKVETDLVQQNFEVRDAITKLLQEELGLQVLPAGPTVALDKVPDSESSNMSHTKLKGIMGRNKYSTRRPADVQRLKVTKKS
LQDWLQRNR