; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc04G09060 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc04G09060
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionPentatricopeptide repeat-containing protein
Genome locationClcChr04:22702018..22708606
RNA-Seq ExpressionClc04G09060
SyntenyClc04G09060
Gene Ontology termsGO:0003729 - mRNA binding (molecular function)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily
IPR033443 - Pentacotripeptide-repeat region of PRORP


Homology Show/hide homology
GenBank top hitse value%identityAlignment
ARV85580.1 pentatricopeptide repeat-containing protein [Cucumis melo]0.0e+0089.44Show/hide
Query:  MRVFLILGTSSSSASIAGPR--RHRHSHSKAPKSSLSNLTPTGTHLPLSSHPSTRHSRPPLLSSVQLDIAGASSGGRIPLQHYAGVASKLAERGKLEDFA
        MRVFLILG  S+SASIAGPR  RHRHSH KAPKSSLSN++PTGTHLP SSH STRHS P LLSSV+LDIAGASSGGRIP+QHYAGVA+KLAERGKLEDFA
Subjt:  MRVFLILGTSSSSASIAGPR--RHRHSHSKAPKSSLSNLTPTGTHLPLSSHPSTRHSRPPLLSSVQLDIAGASSGGRIPLQHYAGVASKLAERGKLEDFA

Query:  MVVESVVVAGVEPSQFAAVLAVELVAKGISRCLKEGKLWSVLQVMRKVEELGISVLGLCDESAVESLRRDCHRIAKSGELEELVEFMEVLAGFGFSIKEM
        MVVESVVVAGVEPSQF A+LA+ELVAKGISRCL+EGKLWSV+QV+RKVEELGIS LGLCDESAVESLRRDC R+AKSGELEELVEF+EVL+GFG S+KEM
Subjt:  MVVESVVVAGVEPSQFAAVLAVELVAKGISRCLKEGKLWSVLQVMRKVEELGISVLGLCDESAVESLRRDCHRIAKSGELEELVEFMEVLAGFGFSIKEM

Query:  MKPSEVIKLCVDYRNPKMAIRYASMLPHADILSCTTINEFGKKRDLKSAYIAYTESKANMNGPNMYIYRTIIDVCGLCGDYKKSRNIYQDLVSQNVTPNI
        MKP EVIKLCVDYRNPKMAIRYAS+LPHADIL CTTINEFGKKRDLKSAYIAY ESKANMNG NMYI+RTIIDVCGLCGDYKKSRNIYQDLV+QNVTPNI
Subjt:  MKPSEVIKLCVDYRNPKMAIRYASMLPHADILSCTTINEFGKKRDLKSAYIAYTESKANMNGPNMYIYRTIIDVCGLCGDYKKSRNIYQDLVSQNVTPNI

Query:  FVFNSLMNVNAHDLNYTFQLYKNMQNLGVPADMASYNILLKACCLAGRVDLAQDIYREVKHLETTGVLKLDVFTYSTIVKVFADAKLWKMALGVKEDMQS
        FVFNSLMNVNAHDLNYTFQLYK+MQNLGVPADMASYNILLKACCLAGRVDLAQDIYREVKHLETTGVLKLDVFTYSTIVKVFADAKLWKMAL VKEDMQS
Subjt:  FVFNSLMNVNAHDLNYTFQLYKNMQNLGVPADMASYNILLKACCLAGRVDLAQDIYREVKHLETTGVLKLDVFTYSTIVKVFADAKLWKMALGVKEDMQS

Query:  AGVSPNMVTWSSLISSCANSGLVELAIQLFEEMVSAGCEPNTQCCNILLHACVEARQFDRAFRLFRSWREKELWDGIERKSSNNDNSNADSTSQLCTTNM
        AGVSPN+VTWSSLISSCANSGLVELAIQLFEEMVSAG EPNTQCCNILLHACVEARQFDRAFRLFRSWREKELWDGIERKSS +DN +ADSTSQLCTTNM
Subjt:  AGVSPNMVTWSSLISSCANSGLVELAIQLFEEMVSAGCEPNTQCCNILLHACVEARQFDRAFRLFRSWREKELWDGIERKSSNNDNSNADSTSQLCTTNM

Query:  ANAPSHIHQISFVGNFAFKPTITTYNILMKACGTDYYHAKALMEEMKSVGLTPNHISWSILIDICGGSHDVESAVQILTTMRMAGVDPDVVAYTTAIKVC
         NAPSH HQIS VGNFAFKPT+TTYN LMKACGTDYYHAKALMEEMKSVGLTPNHISWSILID+CG SHDVESAVQILTTMRMAGVDPDVVAYTTAIKVC
Subjt:  ANAPSHIHQISFVGNFAFKPTITTYNILMKACGTDYYHAKALMEEMKSVGLTPNHISWSILIDICGGSHDVESAVQILTTMRMAGVDPDVVAYTTAIKVC

Query:  VEGKNWKLAFSLFEEMKRFEIQPNLVTYSTLLGARSTYGSLHEVQQCLAIYQDMRKSGFKSNDHYLKELIAEWCEGVIQNNNQQQVEITPYNRIENGKPR
        VEGKNWKLAFSLFEEMKRFEIQPNLVTYSTLL ARSTYGSLHEVQQCLA+YQDMRKSGFKSNDHYLKELIAEWCEGV+QNNNQQQVE TP N+I+  KPR
Subjt:  VEGKNWKLAFSLFEEMKRFEIQPNLVTYSTLLGARSTYGSLHEVQQCLAIYQDMRKSGFKSNDHYLKELIAEWCEGVIQNNNQQQVEITPYNRIENGKPR

Query:  CLILEKVADHLQKSFTESLTIDLQELTKYLSLAASITLWECLRFNLAVGESVKDDIFIILEVNKVETDSAPQNFEVRDAITKLLQDELGLEVLPAGPTIA
        CLILEKVADHLQKSF ESLTIDLQELTK  +    + +   ++ N A+GESVKDDIFIILEVNKV+TD   +NFEVRDAITKLLQDELGLEVLP GPTI 
Subjt:  CLILEKVADHLQKSFTESLTIDLQELTKYLSLAASITLWECLRFNLAVGESVKDDIFIILEVNKVETDSAPQNFEVRDAITKLLQDELGLEVLPAGPTIA

Query:  LDKVPNSKSPNMSHTKLKGVMGRNKYSTRRPADVQRLKVTKKSLQDWLQRNR
        LDKVPNS+S NMSHTKLKG MGRNKY TR+PADVQRLKVTKKSLQDWLQRNR
Subjt:  LDKVPNSKSPNMSHTKLKGVMGRNKYSTRRPADVQRLKVTKKSLQDWLQRNR

ARV85582.1 pentatricopeptide repeat-containing protein [Cucumis melo]0.0e+0089.44Show/hide
Query:  MRVFLILGTSSSSASIAGPR--RHRHSHSKAPKSSLSNLTPTGTHLPLSSHPSTRHSRPPLLSSVQLDIAGASSGGRIPLQHYAGVASKLAERGKLEDFA
        MRVFLILG  S+SASIAGPR  RHRHSH KAPKSSLSN++PTGTHLP SSH STRHS P LLSSV+LDIAGASSGGRIP+QHYAGVA+KLAERGKLEDFA
Subjt:  MRVFLILGTSSSSASIAGPR--RHRHSHSKAPKSSLSNLTPTGTHLPLSSHPSTRHSRPPLLSSVQLDIAGASSGGRIPLQHYAGVASKLAERGKLEDFA

Query:  MVVESVVVAGVEPSQFAAVLAVELVAKGISRCLKEGKLWSVLQVMRKVEELGISVLGLCDESAVESLRRDCHRIAKSGELEELVEFMEVLAGFGFSIKEM
        MVVESVVVAGVEPSQF A+LAVELVAKGISRCL+EGKLWSV+QV+RKVEELGIS LGLCDESAVESLRRDC R+AKSGELEELVEF+EVL+GFG S+KEM
Subjt:  MVVESVVVAGVEPSQFAAVLAVELVAKGISRCLKEGKLWSVLQVMRKVEELGISVLGLCDESAVESLRRDCHRIAKSGELEELVEFMEVLAGFGFSIKEM

Query:  MKPSEVIKLCVDYRNPKMAIRYASMLPHADILSCTTINEFGKKRDLKSAYIAYTESKANMNGPNMYIYRTIIDVCGLCGDYKKSRNIYQDLVSQNVTPNI
        MKP EVIKLCVDYRNPKMAIRYAS+LPHADIL CTTINEFGKKRDLKSAYIAY ESKANMNG NMYI+RTIIDVCGLCGDYKKSRNIYQDLV+QNVTPNI
Subjt:  MKPSEVIKLCVDYRNPKMAIRYASMLPHADILSCTTINEFGKKRDLKSAYIAYTESKANMNGPNMYIYRTIIDVCGLCGDYKKSRNIYQDLVSQNVTPNI

Query:  FVFNSLMNVNAHDLNYTFQLYKNMQNLGVPADMASYNILLKACCLAGRVDLAQDIYREVKHLETTGVLKLDVFTYSTIVKVFADAKLWKMALGVKEDMQS
        FVFNSLMNVNAHDLNYTFQLYK+MQNLGVPADMASYNILLKACCLAGRVDLAQDIYREVKHLETTGVLKLDVFTYSTIVKVFADAKLWKMAL VKEDMQS
Subjt:  FVFNSLMNVNAHDLNYTFQLYKNMQNLGVPADMASYNILLKACCLAGRVDLAQDIYREVKHLETTGVLKLDVFTYSTIVKVFADAKLWKMALGVKEDMQS

Query:  AGVSPNMVTWSSLISSCANSGLVELAIQLFEEMVSAGCEPNTQCCNILLHACVEARQFDRAFRLFRSWREKELWDGIERKSSNNDNSNADSTSQLCTTNM
        AGVSPN+VTWSSLISSCANSGLVELAIQLFEEMVSAG EPNTQCCNILLHACVEARQFDRAFRLFRSWREKELWDGIERKSS +DN +ADSTSQLCTTNM
Subjt:  AGVSPNMVTWSSLISSCANSGLVELAIQLFEEMVSAGCEPNTQCCNILLHACVEARQFDRAFRLFRSWREKELWDGIERKSSNNDNSNADSTSQLCTTNM

Query:  ANAPSHIHQISFVGNFAFKPTITTYNILMKACGTDYYHAKALMEEMKSVGLTPNHISWSILIDICGGSHDVESAVQILTTMRMAGVDPDVVAYTTAIKVC
         NAPSH HQIS VGNFAFKPT+TTYN LMKACGTDYYHAKALMEEMKSVGLTPNHISWSILID+CG SHDVESAVQILTTMRMAGVDPDVVAYTTAIKVC
Subjt:  ANAPSHIHQISFVGNFAFKPTITTYNILMKACGTDYYHAKALMEEMKSVGLTPNHISWSILIDICGGSHDVESAVQILTTMRMAGVDPDVVAYTTAIKVC

Query:  VEGKNWKLAFSLFEEMKRFEIQPNLVTYSTLLGARSTYGSLHEVQQCLAIYQDMRKSGFKSNDHYLKELIAEWCEGVIQNNNQQQVEITPYNRIENGKPR
        VEGKNWKLAFSLFEEMKRFEIQPNLVTYSTLL ARSTYGSLHEVQQCLA+YQDMRKSGFKSNDHYLKELIAEWCEGV+QNNNQQQVE TP N+I+  KPR
Subjt:  VEGKNWKLAFSLFEEMKRFEIQPNLVTYSTLLGARSTYGSLHEVQQCLAIYQDMRKSGFKSNDHYLKELIAEWCEGVIQNNNQQQVEITPYNRIENGKPR

Query:  CLILEKVADHLQKSFTESLTIDLQELTKYLSLAASITLWECLRFNLAVGESVKDDIFIILEVNKVETDSAPQNFEVRDAITKLLQDELGLEVLPAGPTIA
        CLILEKVADHLQKSF ESLTIDLQELTK  +    + +   ++ N A+GESVKDDIFIILEVNKV+TD   +NFEVRDAITKLLQDELGLEVLP GPTI 
Subjt:  CLILEKVADHLQKSFTESLTIDLQELTKYLSLAASITLWECLRFNLAVGESVKDDIFIILEVNKVETDSAPQNFEVRDAITKLLQDELGLEVLPAGPTIA

Query:  LDKVPNSKSPNMSHTKLKGVMGRNKYSTRRPADVQRLKVTKKSLQDWLQRNR
        LDKVPNS+S NMSHTKLKG  GRNKY TR+PADVQRLKVTKKSLQDWLQRNR
Subjt:  LDKVPNSKSPNMSHTKLKGVMGRNKYSTRRPADVQRLKVTKKSLQDWLQRNR

XP_004142106.1 pentatricopeptide repeat-containing protein At5g02830, chloroplastic [Cucumis sativus]0.0e+0090.13Show/hide
Query:  MRVFLILGTSSSSASIAGPRRHRHSHSKAPKSSLSNLTPTGTHLPLSSHPSTRHSRPPLLSSVQLDIAGASSGGRIPLQHYAGVASKLAERGKLEDFAMV
        MRVFLILG  SSSASIAGPRR+RHSH KAPKSSLSNL+PTGTHLP SSH STRHS P LLSSV+LDIAGASSGGRIP+QHYAGVASKLAE GKLEDFAMV
Subjt:  MRVFLILGTSSSSASIAGPRRHRHSHSKAPKSSLSNLTPTGTHLPLSSHPSTRHSRPPLLSSVQLDIAGASSGGRIPLQHYAGVASKLAERGKLEDFAMV

Query:  VESVVVAGVEPSQFAAVLAVELVAKGISRCLKEGKLWSVLQVMRKVEELGISVLGLCDESAVESLRRDCHRIAKSGELEELVEFMEVLAGFGFSIKEMMK
        VESVVVAGVEPSQF A+LAVELVAKGISRCL+EGK+WSV+QV+RKVEELGISVL LCDE AVESLRRDC R+AKSGELEELVE MEVL+GFGFS++EMMK
Subjt:  VESVVVAGVEPSQFAAVLAVELVAKGISRCLKEGKLWSVLQVMRKVEELGISVLGLCDESAVESLRRDCHRIAKSGELEELVEFMEVLAGFGFSIKEMMK

Query:  PSEVIKLCVDYRNPKMAIRYASMLPHADILSCTTINEFGKKRDLKSAYIAYTESKANMNGPNMYIYRTIIDVCGLCGDYKKSRNIYQDLVSQNVTPNIFV
        PSEVIKLCVDYRNPKMAIRYAS+LPHADIL CTTINEFGKKRDLKSAYIAYTESKANMNG NMYIYRTIIDVCGLCGDYKKSRNIYQDLV+QNV PNIFV
Subjt:  PSEVIKLCVDYRNPKMAIRYASMLPHADILSCTTINEFGKKRDLKSAYIAYTESKANMNGPNMYIYRTIIDVCGLCGDYKKSRNIYQDLVSQNVTPNIFV

Query:  FNSLMNVNAHDLNYTFQLYKNMQNLGVPADMASYNILLKACCLAGRVDLAQDIYREVKHLETTGVLKLDVFTYSTIVKVFADAKLWKMALGVKEDMQSAG
        FNSLMNVNAHDLNYTFQLYKNMQNLGVPADMASYNILLKACCLAGRVDLAQDIYREVKHLETTGVLKLDVFTYSTIVKVFADAKLWKMAL VKEDMQSAG
Subjt:  FNSLMNVNAHDLNYTFQLYKNMQNLGVPADMASYNILLKACCLAGRVDLAQDIYREVKHLETTGVLKLDVFTYSTIVKVFADAKLWKMALGVKEDMQSAG

Query:  VSPNMVTWSSLISSCANSGLVELAIQLFEEMVSAGCEPNTQCCNILLHACVEARQFDRAFRLFRSWREKELWDGIERKSSNNDNSNADSTSQLCTTNMAN
        VSPNMVTWSSLISSCANSGLVELAIQLFEEMVSAGCEPNTQCCN LLHACVE RQFDRAFRLFRSW+EKELWDGIERKSS ++N +ADSTSQLC T M N
Subjt:  VSPNMVTWSSLISSCANSGLVELAIQLFEEMVSAGCEPNTQCCNILLHACVEARQFDRAFRLFRSWREKELWDGIERKSSNNDNSNADSTSQLCTTNMAN

Query:  APSHIHQISFVGNFAFKPTITTYNILMKACGTDYYHAKALMEEMKSVGLTPNHISWSILIDICGGSHDVESAVQILTTMRMAGVDPDVVAYTTAIKVCVE
        APSH+HQISFVGNFAFKPTITTYNILMKACGTDYYHAKALMEEMKSVGLTPNHISWSIL+DICG SHDVESAVQILTTMRMAGVDPDVVAYTTAIKVCVE
Subjt:  APSHIHQISFVGNFAFKPTITTYNILMKACGTDYYHAKALMEEMKSVGLTPNHISWSILIDICGGSHDVESAVQILTTMRMAGVDPDVVAYTTAIKVCVE

Query:  GKNWKLAFSLFEEMKRFEIQPNLVTYSTLLGARSTYGSLHEVQQCLAIYQDMRKSGFKSNDHYLKELIAEWCEGVIQNNNQQQVEITPYNRIENGKPRCL
        GKNWKLAFSLFEEMKRFEIQPNLVTYSTLL ARSTYGSLHEVQQCLAIYQDMRKSGFKSNDHYLKELIAEWCEGVIQ NNQQ VEITP N+I+ GKPRCL
Subjt:  GKNWKLAFSLFEEMKRFEIQPNLVTYSTLLGARSTYGSLHEVQQCLAIYQDMRKSGFKSNDHYLKELIAEWCEGVIQNNNQQQVEITPYNRIENGKPRCL

Query:  ILEKVADHLQKSFTESLTIDLQELTKYLSLAASITLWECLRFNLAVGESVKDDIFIILEVNKVETDSAPQNFEVRDAITKLLQDELGLEVLPAGPTIALD
        ILEKVADHLQKSF ESLTIDLQELTK  +    + +   ++ N A+GESVKDDIFIILEVNKVETD  PQNFEVRDAIT+LLQDELGLEVLP GPTIALD
Subjt:  ILEKVADHLQKSFTESLTIDLQELTKYLSLAASITLWECLRFNLAVGESVKDDIFIILEVNKVETDSAPQNFEVRDAITKLLQDELGLEVLPAGPTIALD

Query:  KVPNSKSPNMSH-TKLKGVMGRNKYSTRRPADVQRLKVTKKSLQDWLQRNR
        KVPNS+S  +SH TKLKG MGRNKY TR+PADVQRLKVTKKSLQDWLQRNR
Subjt:  KVPNSKSPNMSH-TKLKGVMGRNKYSTRRPADVQRLKVTKKSLQDWLQRNR

XP_008447192.1 PREDICTED: pentatricopeptide repeat-containing protein At5g02830, chloroplastic isoform X1 [Cucumis melo]0.0e+0089.44Show/hide
Query:  MRVFLILGTSSSSASIAGPR--RHRHSHSKAPKSSLSNLTPTGTHLPLSSHPSTRHSRPPLLSSVQLDIAGASSGGRIPLQHYAGVASKLAERGKLEDFA
        MRVFLILG  S+SASIAGPR  RHRHSH KAPKSSLSN++PTGTHLP SSH STRHS P LLSSV+LDIAGASSGGRIP+QHYAGVA+KLAERGKLEDFA
Subjt:  MRVFLILGTSSSSASIAGPR--RHRHSHSKAPKSSLSNLTPTGTHLPLSSHPSTRHSRPPLLSSVQLDIAGASSGGRIPLQHYAGVASKLAERGKLEDFA

Query:  MVVESVVVAGVEPSQFAAVLAVELVAKGISRCLKEGKLWSVLQVMRKVEELGISVLGLCDESAVESLRRDCHRIAKSGELEELVEFMEVLAGFGFSIKEM
        MVVESVVVAGVEPSQF A+LAVELVAKGISRCL+EGKLWSV+QV+RKVEELGIS LGLCDESAVESLRRDC R+AKSGELEELVEF+EVL+GFG S+KEM
Subjt:  MVVESVVVAGVEPSQFAAVLAVELVAKGISRCLKEGKLWSVLQVMRKVEELGISVLGLCDESAVESLRRDCHRIAKSGELEELVEFMEVLAGFGFSIKEM

Query:  MKPSEVIKLCVDYRNPKMAIRYASMLPHADILSCTTINEFGKKRDLKSAYIAYTESKANMNGPNMYIYRTIIDVCGLCGDYKKSRNIYQDLVSQNVTPNI
        MKP EVIKLCVDYRNPKMAIRYAS+LPHADIL CTTINEFGKKRDLKSAYIAY ESKANMNG NMYI+R+IIDVCGLCGDYKKSRNIYQDLV+QNVTPNI
Subjt:  MKPSEVIKLCVDYRNPKMAIRYASMLPHADILSCTTINEFGKKRDLKSAYIAYTESKANMNGPNMYIYRTIIDVCGLCGDYKKSRNIYQDLVSQNVTPNI

Query:  FVFNSLMNVNAHDLNYTFQLYKNMQNLGVPADMASYNILLKACCLAGRVDLAQDIYREVKHLETTGVLKLDVFTYSTIVKVFADAKLWKMALGVKEDMQS
        FVFNSLMNVNAHDLNYTFQLYK+MQNLGVPADMASYNILLKACCLAGRVDLAQDIYREVKHLETTGVLKLDVFTYSTIVKVFADAKLWKMAL VKEDMQS
Subjt:  FVFNSLMNVNAHDLNYTFQLYKNMQNLGVPADMASYNILLKACCLAGRVDLAQDIYREVKHLETTGVLKLDVFTYSTIVKVFADAKLWKMALGVKEDMQS

Query:  AGVSPNMVTWSSLISSCANSGLVELAIQLFEEMVSAGCEPNTQCCNILLHACVEARQFDRAFRLFRSWREKELWDGIERKSSNNDNSNADSTSQLCTTNM
        AGVSPN+VTWSSLISSCANSGLVELAIQLFEEMVSAG EPNTQCCNILLHACVEARQFDRAFRLFRSWREKELWDGIERKSS +DN +ADSTSQLCTTNM
Subjt:  AGVSPNMVTWSSLISSCANSGLVELAIQLFEEMVSAGCEPNTQCCNILLHACVEARQFDRAFRLFRSWREKELWDGIERKSSNNDNSNADSTSQLCTTNM

Query:  ANAPSHIHQISFVGNFAFKPTITTYNILMKACGTDYYHAKALMEEMKSVGLTPNHISWSILIDICGGSHDVESAVQILTTMRMAGVDPDVVAYTTAIKVC
         NAPSH HQIS VGNFAFKPT+TTYN LMKACGTDYYHAKALMEEMKSVGLTPNHISWSILID+CG SHDVESAVQILTTMRMAGVDPDVVAYTTAIKVC
Subjt:  ANAPSHIHQISFVGNFAFKPTITTYNILMKACGTDYYHAKALMEEMKSVGLTPNHISWSILIDICGGSHDVESAVQILTTMRMAGVDPDVVAYTTAIKVC

Query:  VEGKNWKLAFSLFEEMKRFEIQPNLVTYSTLLGARSTYGSLHEVQQCLAIYQDMRKSGFKSNDHYLKELIAEWCEGVIQNNNQQQVEITPYNRIENGKPR
        VEGKNWKLAFSLFEEMKRFEIQPNLVTYSTLL ARSTYGSLHEVQQCLA+YQDMRKSGFKSNDHYLKELIAEWCEGV+QNNNQQQVE TP N+I+  KPR
Subjt:  VEGKNWKLAFSLFEEMKRFEIQPNLVTYSTLLGARSTYGSLHEVQQCLAIYQDMRKSGFKSNDHYLKELIAEWCEGVIQNNNQQQVEITPYNRIENGKPR

Query:  CLILEKVADHLQKSFTESLTIDLQELTKYLSLAASITLWECLRFNLAVGESVKDDIFIILEVNKVETDSAPQNFEVRDAITKLLQDELGLEVLPAGPTIA
        CLILEKVADHLQKSF ESLTIDLQELTK  +    + +   ++ N A+GESVKDDIFIILEVNKV+TD   +NFEVRDAITKLLQDELGLEVLP GPTI 
Subjt:  CLILEKVADHLQKSFTESLTIDLQELTKYLSLAASITLWECLRFNLAVGESVKDDIFIILEVNKVETDSAPQNFEVRDAITKLLQDELGLEVLPAGPTIA

Query:  LDKVPNSKSPNMSHTKLKGVMGRNKYSTRRPADVQRLKVTKKSLQDWLQRNR
        LDKVPNS+S NMSHTKLKG MGRNKY TR+PADVQRLKVTKKSLQDWLQRNR
Subjt:  LDKVPNSKSPNMSHTKLKGVMGRNKYSTRRPADVQRLKVTKKSLQDWLQRNR

XP_038881251.1 pentatricopeptide repeat-containing protein At5g02830, chloroplastic [Benincasa hispida]0.0e+0092.02Show/hide
Query:  SSSASIAGPRRHRHSHSKAPKSSLSNLTPTGTHLPLSSHPSTRHSRPPLLSSVQLDIAGASSGGRIPLQHYAGVASKLAERGKLEDFAMVVESVVVAGVE
        SSSASIAGPRRH HSHSKAP + LS   P    LPLSS PSTRHS PPLLSSVQLDIAGASSGGRIPLQHYAGVASKLAERGKLEDFA+VVESVVVAGVE
Subjt:  SSSASIAGPRRHRHSHSKAPKSSLSNLTPTGTHLPLSSHPSTRHSRPPLLSSVQLDIAGASSGGRIPLQHYAGVASKLAERGKLEDFAMVVESVVVAGVE

Query:  PSQFAAVLAVELVAKGISRCLKEGKLWSVLQVMRKVEELGISVLGLCDESAVESLRRDCHRIAKSGELEELVEFMEVLAGFGFSIKEMMKPSEVIKLCVD
        PSQFAAVLAVEL+AKGISRCL+EGKLWSVLQV+RKV+ELGIS LGLCDESAVESLRRDCHRIAKSGELEELVEFME LAGFGFSIKEMMKPSEVIKLCVD
Subjt:  PSQFAAVLAVELVAKGISRCLKEGKLWSVLQVMRKVEELGISVLGLCDESAVESLRRDCHRIAKSGELEELVEFMEVLAGFGFSIKEMMKPSEVIKLCVD

Query:  YRNPKMAIRYASMLPHADILSCTTINEFGKKRDLKSAYIAYTESKANMNGPNMYIYRTIIDVCGLCGDYKKSRNIYQDLVSQNVTPNIFVFNSLMNVNAH
        YRNPKMAIRY+S+LPHADIL CTTI+EFGKKRDLKSAYIAYTESKANMNGPNMYIYRTIIDVCGLCGDYKKSRNIYQDLV+QNVTPNIFVFNSLMNVNAH
Subjt:  YRNPKMAIRYASMLPHADILSCTTINEFGKKRDLKSAYIAYTESKANMNGPNMYIYRTIIDVCGLCGDYKKSRNIYQDLVSQNVTPNIFVFNSLMNVNAH

Query:  DLNYTFQLYKNMQNLGVPADMASYNILLKACCLAGRVDLAQDIYREVKHLETTGVLKLDVFTYSTIVKVFADAKLWKMALGVKEDMQSAGVSPNMVTWSS
        DLNYTFQLYKNMQNLGVPADMASYNILLKACCLAGRVDLAQDIYREVKHLETTGVLKLDVFTYSTIVKVFADAKLWKMAL VKEDMQSAGVSPNMVTWSS
Subjt:  DLNYTFQLYKNMQNLGVPADMASYNILLKACCLAGRVDLAQDIYREVKHLETTGVLKLDVFTYSTIVKVFADAKLWKMALGVKEDMQSAGVSPNMVTWSS

Query:  LISSCANSGLVELAIQLFEEMVSAGCEPNTQCCNILLHACVEARQFDRAFRLFRSWREKELWDGIERKSSNNDNSNADSTSQLCTTNMANAPSHIHQISF
        LISSCANSGLVELAIQLFEEMVS GCEPNTQCCNILLHACVEARQFDRAFRLFRSWREKELWD IERKSS NDN NADSTSQLCTTNM NAPSH+HQISF
Subjt:  LISSCANSGLVELAIQLFEEMVSAGCEPNTQCCNILLHACVEARQFDRAFRLFRSWREKELWDGIERKSSNNDNSNADSTSQLCTTNMANAPSHIHQISF

Query:  VGNFAFKPTITTYNILMKACGTDYYHAKALMEEMKSVGLTPNHISWSILIDICGGSHDVESAVQILTTMRMAGVDPDVVAYTTAIKVCVEGKNWKLAFSL
        VGNFAFKPTITTYNILMKACGTDYYHAKALMEEM+SVGLTPNHISWSILIDICGGSHDVE+AVQILTTMRMAGVDPDVVAYTTAIKVCVEGKNWKLAFSL
Subjt:  VGNFAFKPTITTYNILMKACGTDYYHAKALMEEMKSVGLTPNHISWSILIDICGGSHDVESAVQILTTMRMAGVDPDVVAYTTAIKVCVEGKNWKLAFSL

Query:  FEEMKRFEIQPNLVTYSTLLGARSTYGSLHEVQQCLAIYQDMRKSGFKSNDHYLKELIAEWCEGVIQNNNQQQVEITPYNRIENGKPRCLILEKVADHLQ
        FEEMKRFEIQPNLVTYSTLL ARSTYGSLHEVQQCLAIYQDMRKSGFKSNDHYLKELIAEWCEGVIQNNN+QQVEITP NRI+ GKPRCLILEKVADHLQ
Subjt:  FEEMKRFEIQPNLVTYSTLLGARSTYGSLHEVQQCLAIYQDMRKSGFKSNDHYLKELIAEWCEGVIQNNNQQQVEITPYNRIENGKPRCLILEKVADHLQ

Query:  KSFTESLTIDLQELTKYLSLAASITLWECLRFNLAVGESVKDDIFIILEVNKVETDSAPQNFEVRDAITKLLQDELGLEVLPAGPTIALDKVPNSKSPNM
        KSFTESLTIDLQELTK  +    + +   ++ N A+GESVKDDIFIILEV+KVETD   QNFEVRDAITKLLQDELGLEVLP G TIALDKVPNS+SPNM
Subjt:  KSFTESLTIDLQELTKYLSLAASITLWECLRFNLAVGESVKDDIFIILEVNKVETDSAPQNFEVRDAITKLLQDELGLEVLPAGPTIALDKVPNSKSPNM

Query:  SHTKLKGVMGRNKYSTRRPADVQRLKVTKKSLQDWLQRNR
        SHTKL+G++GRNKYSTRRPADVQRLKVTKKSLQDWLQRNR
Subjt:  SHTKLKGVMGRNKYSTRRPADVQRLKVTKKSLQDWLQRNR

TrEMBL top hitse value%identityAlignment
A0A0A0KX28 PPR_long domain-containing protein0.0e+0090.13Show/hide
Query:  MRVFLILGTSSSSASIAGPRRHRHSHSKAPKSSLSNLTPTGTHLPLSSHPSTRHSRPPLLSSVQLDIAGASSGGRIPLQHYAGVASKLAERGKLEDFAMV
        MRVFLILG  SSSASIAGPRR+RHSH KAPKSSLSNL+PTGTHLP SSH STRHS P LLSSV+LDIAGASSGGRIP+QHYAGVASKLAE GKLEDFAMV
Subjt:  MRVFLILGTSSSSASIAGPRRHRHSHSKAPKSSLSNLTPTGTHLPLSSHPSTRHSRPPLLSSVQLDIAGASSGGRIPLQHYAGVASKLAERGKLEDFAMV

Query:  VESVVVAGVEPSQFAAVLAVELVAKGISRCLKEGKLWSVLQVMRKVEELGISVLGLCDESAVESLRRDCHRIAKSGELEELVEFMEVLAGFGFSIKEMMK
        VESVVVAGVEPSQF A+LAVELVAKGISRCL+EGK+WSV+QV+RKVEELGISVL LCDE AVESLRRDC R+AKSGELEELVE MEVL+GFGFS++EMMK
Subjt:  VESVVVAGVEPSQFAAVLAVELVAKGISRCLKEGKLWSVLQVMRKVEELGISVLGLCDESAVESLRRDCHRIAKSGELEELVEFMEVLAGFGFSIKEMMK

Query:  PSEVIKLCVDYRNPKMAIRYASMLPHADILSCTTINEFGKKRDLKSAYIAYTESKANMNGPNMYIYRTIIDVCGLCGDYKKSRNIYQDLVSQNVTPNIFV
        PSEVIKLCVDYRNPKMAIRYAS+LPHADIL CTTINEFGKKRDLKSAYIAYTESKANMNG NMYIYRTIIDVCGLCGDYKKSRNIYQDLV+QNV PNIFV
Subjt:  PSEVIKLCVDYRNPKMAIRYASMLPHADILSCTTINEFGKKRDLKSAYIAYTESKANMNGPNMYIYRTIIDVCGLCGDYKKSRNIYQDLVSQNVTPNIFV

Query:  FNSLMNVNAHDLNYTFQLYKNMQNLGVPADMASYNILLKACCLAGRVDLAQDIYREVKHLETTGVLKLDVFTYSTIVKVFADAKLWKMALGVKEDMQSAG
        FNSLMNVNAHDLNYTFQLYKNMQNLGVPADMASYNILLKACCLAGRVDLAQDIYREVKHLETTGVLKLDVFTYSTIVKVFADAKLWKMAL VKEDMQSAG
Subjt:  FNSLMNVNAHDLNYTFQLYKNMQNLGVPADMASYNILLKACCLAGRVDLAQDIYREVKHLETTGVLKLDVFTYSTIVKVFADAKLWKMALGVKEDMQSAG

Query:  VSPNMVTWSSLISSCANSGLVELAIQLFEEMVSAGCEPNTQCCNILLHACVEARQFDRAFRLFRSWREKELWDGIERKSSNNDNSNADSTSQLCTTNMAN
        VSPNMVTWSSLISSCANSGLVELAIQLFEEMVSAGCEPNTQCCN LLHACVE RQFDRAFRLFRSW+EKELWDGIERKSS ++N +ADSTSQLC T M N
Subjt:  VSPNMVTWSSLISSCANSGLVELAIQLFEEMVSAGCEPNTQCCNILLHACVEARQFDRAFRLFRSWREKELWDGIERKSSNNDNSNADSTSQLCTTNMAN

Query:  APSHIHQISFVGNFAFKPTITTYNILMKACGTDYYHAKALMEEMKSVGLTPNHISWSILIDICGGSHDVESAVQILTTMRMAGVDPDVVAYTTAIKVCVE
        APSH+HQISFVGNFAFKPTITTYNILMKACGTDYYHAKALMEEMKSVGLTPNHISWSIL+DICG SHDVESAVQILTTMRMAGVDPDVVAYTTAIKVCVE
Subjt:  APSHIHQISFVGNFAFKPTITTYNILMKACGTDYYHAKALMEEMKSVGLTPNHISWSILIDICGGSHDVESAVQILTTMRMAGVDPDVVAYTTAIKVCVE

Query:  GKNWKLAFSLFEEMKRFEIQPNLVTYSTLLGARSTYGSLHEVQQCLAIYQDMRKSGFKSNDHYLKELIAEWCEGVIQNNNQQQVEITPYNRIENGKPRCL
        GKNWKLAFSLFEEMKRFEIQPNLVTYSTLL ARSTYGSLHEVQQCLAIYQDMRKSGFKSNDHYLKELIAEWCEGVIQ NNQQ VEITP N+I+ GKPRCL
Subjt:  GKNWKLAFSLFEEMKRFEIQPNLVTYSTLLGARSTYGSLHEVQQCLAIYQDMRKSGFKSNDHYLKELIAEWCEGVIQNNNQQQVEITPYNRIENGKPRCL

Query:  ILEKVADHLQKSFTESLTIDLQELTKYLSLAASITLWECLRFNLAVGESVKDDIFIILEVNKVETDSAPQNFEVRDAITKLLQDELGLEVLPAGPTIALD
        ILEKVADHLQKSF ESLTIDLQELTK  +    + +   ++ N A+GESVKDDIFIILEVNKVETD  PQNFEVRDAIT+LLQDELGLEVLP GPTIALD
Subjt:  ILEKVADHLQKSFTESLTIDLQELTKYLSLAASITLWECLRFNLAVGESVKDDIFIILEVNKVETDSAPQNFEVRDAITKLLQDELGLEVLPAGPTIALD

Query:  KVPNSKSPNMSH-TKLKGVMGRNKYSTRRPADVQRLKVTKKSLQDWLQRNR
        KVPNS+S  +SH TKLKG MGRNKY TR+PADVQRLKVTKKSLQDWLQRNR
Subjt:  KVPNSKSPNMSH-TKLKGVMGRNKYSTRRPADVQRLKVTKKSLQDWLQRNR

A0A1S3BHG8 Pentatricopeptide repeat-containing protein0.0e+0089.44Show/hide
Query:  MRVFLILGTSSSSASIAGPR--RHRHSHSKAPKSSLSNLTPTGTHLPLSSHPSTRHSRPPLLSSVQLDIAGASSGGRIPLQHYAGVASKLAERGKLEDFA
        MRVFLILG  S+SASIAGPR  RHRHSH KAPKSSLSN++PTGTHLP SSH STRHS P LLSSV+LDIAGASSGGRIP+QHYAGVA+KLAERGKLEDFA
Subjt:  MRVFLILGTSSSSASIAGPR--RHRHSHSKAPKSSLSNLTPTGTHLPLSSHPSTRHSRPPLLSSVQLDIAGASSGGRIPLQHYAGVASKLAERGKLEDFA

Query:  MVVESVVVAGVEPSQFAAVLAVELVAKGISRCLKEGKLWSVLQVMRKVEELGISVLGLCDESAVESLRRDCHRIAKSGELEELVEFMEVLAGFGFSIKEM
        MVVESVVVAGVEPSQF A+LAVELVAKGISRCL+EGKLWSV+QV+RKVEELGIS LGLCDESAVESLRRDC R+AKSGELEELVEF+EVL+GFG S+KEM
Subjt:  MVVESVVVAGVEPSQFAAVLAVELVAKGISRCLKEGKLWSVLQVMRKVEELGISVLGLCDESAVESLRRDCHRIAKSGELEELVEFMEVLAGFGFSIKEM

Query:  MKPSEVIKLCVDYRNPKMAIRYASMLPHADILSCTTINEFGKKRDLKSAYIAYTESKANMNGPNMYIYRTIIDVCGLCGDYKKSRNIYQDLVSQNVTPNI
        MKP EVIKLCVDYRNPKMAIRYAS+LPHADIL CTTINEFGKKRDLKSAYIAY ESKANMNG NMYI+R+IIDVCGLCGDYKKSRNIYQDLV+QNVTPNI
Subjt:  MKPSEVIKLCVDYRNPKMAIRYASMLPHADILSCTTINEFGKKRDLKSAYIAYTESKANMNGPNMYIYRTIIDVCGLCGDYKKSRNIYQDLVSQNVTPNI

Query:  FVFNSLMNVNAHDLNYTFQLYKNMQNLGVPADMASYNILLKACCLAGRVDLAQDIYREVKHLETTGVLKLDVFTYSTIVKVFADAKLWKMALGVKEDMQS
        FVFNSLMNVNAHDLNYTFQLYK+MQNLGVPADMASYNILLKACCLAGRVDLAQDIYREVKHLETTGVLKLDVFTYSTIVKVFADAKLWKMAL VKEDMQS
Subjt:  FVFNSLMNVNAHDLNYTFQLYKNMQNLGVPADMASYNILLKACCLAGRVDLAQDIYREVKHLETTGVLKLDVFTYSTIVKVFADAKLWKMALGVKEDMQS

Query:  AGVSPNMVTWSSLISSCANSGLVELAIQLFEEMVSAGCEPNTQCCNILLHACVEARQFDRAFRLFRSWREKELWDGIERKSSNNDNSNADSTSQLCTTNM
        AGVSPN+VTWSSLISSCANSGLVELAIQLFEEMVSAG EPNTQCCNILLHACVEARQFDRAFRLFRSWREKELWDGIERKSS +DN +ADSTSQLCTTNM
Subjt:  AGVSPNMVTWSSLISSCANSGLVELAIQLFEEMVSAGCEPNTQCCNILLHACVEARQFDRAFRLFRSWREKELWDGIERKSSNNDNSNADSTSQLCTTNM

Query:  ANAPSHIHQISFVGNFAFKPTITTYNILMKACGTDYYHAKALMEEMKSVGLTPNHISWSILIDICGGSHDVESAVQILTTMRMAGVDPDVVAYTTAIKVC
         NAPSH HQIS VGNFAFKPT+TTYN LMKACGTDYYHAKALMEEMKSVGLTPNHISWSILID+CG SHDVESAVQILTTMRMAGVDPDVVAYTTAIKVC
Subjt:  ANAPSHIHQISFVGNFAFKPTITTYNILMKACGTDYYHAKALMEEMKSVGLTPNHISWSILIDICGGSHDVESAVQILTTMRMAGVDPDVVAYTTAIKVC

Query:  VEGKNWKLAFSLFEEMKRFEIQPNLVTYSTLLGARSTYGSLHEVQQCLAIYQDMRKSGFKSNDHYLKELIAEWCEGVIQNNNQQQVEITPYNRIENGKPR
        VEGKNWKLAFSLFEEMKRFEIQPNLVTYSTLL ARSTYGSLHEVQQCLA+YQDMRKSGFKSNDHYLKELIAEWCEGV+QNNNQQQVE TP N+I+  KPR
Subjt:  VEGKNWKLAFSLFEEMKRFEIQPNLVTYSTLLGARSTYGSLHEVQQCLAIYQDMRKSGFKSNDHYLKELIAEWCEGVIQNNNQQQVEITPYNRIENGKPR

Query:  CLILEKVADHLQKSFTESLTIDLQELTKYLSLAASITLWECLRFNLAVGESVKDDIFIILEVNKVETDSAPQNFEVRDAITKLLQDELGLEVLPAGPTIA
        CLILEKVADHLQKSF ESLTIDLQELTK  +    + +   ++ N A+GESVKDDIFIILEVNKV+TD   +NFEVRDAITKLLQDELGLEVLP GPTI 
Subjt:  CLILEKVADHLQKSFTESLTIDLQELTKYLSLAASITLWECLRFNLAVGESVKDDIFIILEVNKVETDSAPQNFEVRDAITKLLQDELGLEVLPAGPTIA

Query:  LDKVPNSKSPNMSHTKLKGVMGRNKYSTRRPADVQRLKVTKKSLQDWLQRNR
        LDKVPNS+S NMSHTKLKG MGRNKY TR+PADVQRLKVTKKSLQDWLQRNR
Subjt:  LDKVPNSKSPNMSHTKLKGVMGRNKYSTRRPADVQRLKVTKKSLQDWLQRNR

A0A2D0WXK6 Pentatricopeptide repeat-containing protein0.0e+0089.32Show/hide
Query:  MRVFLILGTSSSSASIAGPR--RHRHSHSKAPKSSLSNLTPTGTHLPLSSHPSTRHSRPPLLSSVQLDIAGASSGGRIPLQHYAGVASKLAERGKLEDFA
        MRVFLILG  S+SASIAGPR  RHRHSH KAPKSSLSN++PTGTHLP SSH STRHS P LLSSV+LDIAGASSGGRIP+QHYAGVA+KLAERGKLEDFA
Subjt:  MRVFLILGTSSSSASIAGPR--RHRHSHSKAPKSSLSNLTPTGTHLPLSSHPSTRHSRPPLLSSVQLDIAGASSGGRIPLQHYAGVASKLAERGKLEDFA

Query:  MVVESVVVAGVEPSQFAAVLAVELVAKGISRCLKEGKLWSVLQVMRKVEELGISVLGLCDESAVESLRRDCHRIAKSGELEELVEFMEVLAGFGFSIKEM
        MVVESVVVAGVEPSQF A+LAVELVAKGISRCL+EGKLWSV+QV+RKV+ELGIS LGLCDESAVESLRRDC R+AKSGELEELVEF+EVL+GFG S+KEM
Subjt:  MVVESVVVAGVEPSQFAAVLAVELVAKGISRCLKEGKLWSVLQVMRKVEELGISVLGLCDESAVESLRRDCHRIAKSGELEELVEFMEVLAGFGFSIKEM

Query:  MKPSEVIKLCVDYRNPKMAIRYASMLPHADILSCTTINEFGKKRDLKSAYIAYTESKANMNGPNMYIYRTIIDVCGLCGDYKKSRNIYQDLVSQNVTPNI
        MKP EVIKLCVDYRNPKMAIRYAS+LPHADIL CTTINEFGKKRDLKSAYIAY ESKANMNG NMYI+R+IIDVCGLCGDYKKSRNIYQDLV+QNVTPNI
Subjt:  MKPSEVIKLCVDYRNPKMAIRYASMLPHADILSCTTINEFGKKRDLKSAYIAYTESKANMNGPNMYIYRTIIDVCGLCGDYKKSRNIYQDLVSQNVTPNI

Query:  FVFNSLMNVNAHDLNYTFQLYKNMQNLGVPADMASYNILLKACCLAGRVDLAQDIYREVKHLETTGVLKLDVFTYSTIVKVFADAKLWKMALGVKEDMQS
        FVFNSLMNVNAHDLNYTFQLYK+MQNLGVPADMASYNILLKACCLAGRVDLAQDIYREVKHLETTGVLKLDVFTYSTIVKVFADAKLWKMAL VKEDMQS
Subjt:  FVFNSLMNVNAHDLNYTFQLYKNMQNLGVPADMASYNILLKACCLAGRVDLAQDIYREVKHLETTGVLKLDVFTYSTIVKVFADAKLWKMALGVKEDMQS

Query:  AGVSPNMVTWSSLISSCANSGLVELAIQLFEEMVSAGCEPNTQCCNILLHACVEARQFDRAFRLFRSWREKELWDGIERKSSNNDNSNADSTSQLCTTNM
        AGVSPN+VTWSSLISSCANSGLVELAIQLFEEMVSAG EPNTQCCNILLHACVEARQFDRAFRLFRSWREKELWDGIERKSS +DN +ADSTSQLCTTNM
Subjt:  AGVSPNMVTWSSLISSCANSGLVELAIQLFEEMVSAGCEPNTQCCNILLHACVEARQFDRAFRLFRSWREKELWDGIERKSSNNDNSNADSTSQLCTTNM

Query:  ANAPSHIHQISFVGNFAFKPTITTYNILMKACGTDYYHAKALMEEMKSVGLTPNHISWSILIDICGGSHDVESAVQILTTMRMAGVDPDVVAYTTAIKVC
         NAPSH HQIS VGNFAFKPT+TTYN LMKACGTDYYHAKALMEEMKSVGLTPNHISWSILID+CG SHDVESAVQILTTMRMAGVDPDVVAYTTAIKVC
Subjt:  ANAPSHIHQISFVGNFAFKPTITTYNILMKACGTDYYHAKALMEEMKSVGLTPNHISWSILIDICGGSHDVESAVQILTTMRMAGVDPDVVAYTTAIKVC

Query:  VEGKNWKLAFSLFEEMKRFEIQPNLVTYSTLLGARSTYGSLHEVQQCLAIYQDMRKSGFKSNDHYLKELIAEWCEGVIQNNNQQQVEITPYNRIENGKPR
        VEGKNWKLAFSLFEEMKRFEIQPNLVTYSTLL ARSTYGSLHEVQQCLA+YQDMRKSGFKSNDHYLKELIAEWCEGV+QNNNQQQVE TP N+I+  KPR
Subjt:  VEGKNWKLAFSLFEEMKRFEIQPNLVTYSTLLGARSTYGSLHEVQQCLAIYQDMRKSGFKSNDHYLKELIAEWCEGVIQNNNQQQVEITPYNRIENGKPR

Query:  CLILEKVADHLQKSFTESLTIDLQELTKYLSLAASITLWECLRFNLAVGESVKDDIFIILEVNKVETDSAPQNFEVRDAITKLLQDELGLEVLPAGPTIA
        CLILEKVADHLQKSF ESLTIDLQELTK  +    + +   ++ N A+GESVKDDIFIILEVNKV+TD   +NFEVRDAITKLLQDELGLEVLP GPTI 
Subjt:  CLILEKVADHLQKSFTESLTIDLQELTKYLSLAASITLWECLRFNLAVGESVKDDIFIILEVNKVETDSAPQNFEVRDAITKLLQDELGLEVLPAGPTIA

Query:  LDKVPNSKSPNMSHTKLKGVMGRNKYSTRRPADVQRLKVTKKSLQDWLQRNR
        LDKVPNS+S NMSHTKLKG MGRNKY TR+PADVQRLKVTKKSLQDWLQRNR
Subjt:  LDKVPNSKSPNMSHTKLKGVMGRNKYSTRRPADVQRLKVTKKSLQDWLQRNR

A0A2D0WXL1 Pentatricopeptide repeat-containing protein0.0e+0089.44Show/hide
Query:  MRVFLILGTSSSSASIAGPR--RHRHSHSKAPKSSLSNLTPTGTHLPLSSHPSTRHSRPPLLSSVQLDIAGASSGGRIPLQHYAGVASKLAERGKLEDFA
        MRVFLILG  S+SASIAGPR  RHRHSH KAPKSSLSN++PTGTHLP SSH STRHS P LLSSV+LDIAGASSGGRIP+QHYAGVA+KLAERGKLEDFA
Subjt:  MRVFLILGTSSSSASIAGPR--RHRHSHSKAPKSSLSNLTPTGTHLPLSSHPSTRHSRPPLLSSVQLDIAGASSGGRIPLQHYAGVASKLAERGKLEDFA

Query:  MVVESVVVAGVEPSQFAAVLAVELVAKGISRCLKEGKLWSVLQVMRKVEELGISVLGLCDESAVESLRRDCHRIAKSGELEELVEFMEVLAGFGFSIKEM
        MVVESVVVAGVEPSQF A+LA+ELVAKGISRCL+EGKLWSV+QV+RKVEELGIS LGLCDESAVESLRRDC R+AKSGELEELVEF+EVL+GFG S+KEM
Subjt:  MVVESVVVAGVEPSQFAAVLAVELVAKGISRCLKEGKLWSVLQVMRKVEELGISVLGLCDESAVESLRRDCHRIAKSGELEELVEFMEVLAGFGFSIKEM

Query:  MKPSEVIKLCVDYRNPKMAIRYASMLPHADILSCTTINEFGKKRDLKSAYIAYTESKANMNGPNMYIYRTIIDVCGLCGDYKKSRNIYQDLVSQNVTPNI
        MKP EVIKLCVDYRNPKMAIRYAS+LPHADIL CTTINEFGKKRDLKSAYIAY ESKANMNG NMYI+RTIIDVCGLCGDYKKSRNIYQDLV+QNVTPNI
Subjt:  MKPSEVIKLCVDYRNPKMAIRYASMLPHADILSCTTINEFGKKRDLKSAYIAYTESKANMNGPNMYIYRTIIDVCGLCGDYKKSRNIYQDLVSQNVTPNI

Query:  FVFNSLMNVNAHDLNYTFQLYKNMQNLGVPADMASYNILLKACCLAGRVDLAQDIYREVKHLETTGVLKLDVFTYSTIVKVFADAKLWKMALGVKEDMQS
        FVFNSLMNVNAHDLNYTFQLYK+MQNLGVPADMASYNILLKACCLAGRVDLAQDIYREVKHLETTGVLKLDVFTYSTIVKVFADAKLWKMAL VKEDMQS
Subjt:  FVFNSLMNVNAHDLNYTFQLYKNMQNLGVPADMASYNILLKACCLAGRVDLAQDIYREVKHLETTGVLKLDVFTYSTIVKVFADAKLWKMALGVKEDMQS

Query:  AGVSPNMVTWSSLISSCANSGLVELAIQLFEEMVSAGCEPNTQCCNILLHACVEARQFDRAFRLFRSWREKELWDGIERKSSNNDNSNADSTSQLCTTNM
        AGVSPN+VTWSSLISSCANSGLVELAIQLFEEMVSAG EPNTQCCNILLHACVEARQFDRAFRLFRSWREKELWDGIERKSS +DN +ADSTSQLCTTNM
Subjt:  AGVSPNMVTWSSLISSCANSGLVELAIQLFEEMVSAGCEPNTQCCNILLHACVEARQFDRAFRLFRSWREKELWDGIERKSSNNDNSNADSTSQLCTTNM

Query:  ANAPSHIHQISFVGNFAFKPTITTYNILMKACGTDYYHAKALMEEMKSVGLTPNHISWSILIDICGGSHDVESAVQILTTMRMAGVDPDVVAYTTAIKVC
         NAPSH HQIS VGNFAFKPT+TTYN LMKACGTDYYHAKALMEEMKSVGLTPNHISWSILID+CG SHDVESAVQILTTMRMAGVDPDVVAYTTAIKVC
Subjt:  ANAPSHIHQISFVGNFAFKPTITTYNILMKACGTDYYHAKALMEEMKSVGLTPNHISWSILIDICGGSHDVESAVQILTTMRMAGVDPDVVAYTTAIKVC

Query:  VEGKNWKLAFSLFEEMKRFEIQPNLVTYSTLLGARSTYGSLHEVQQCLAIYQDMRKSGFKSNDHYLKELIAEWCEGVIQNNNQQQVEITPYNRIENGKPR
        VEGKNWKLAFSLFEEMKRFEIQPNLVTYSTLL ARSTYGSLHEVQQCLA+YQDMRKSGFKSNDHYLKELIAEWCEGV+QNNNQQQVE TP N+I+  KPR
Subjt:  VEGKNWKLAFSLFEEMKRFEIQPNLVTYSTLLGARSTYGSLHEVQQCLAIYQDMRKSGFKSNDHYLKELIAEWCEGVIQNNNQQQVEITPYNRIENGKPR

Query:  CLILEKVADHLQKSFTESLTIDLQELTKYLSLAASITLWECLRFNLAVGESVKDDIFIILEVNKVETDSAPQNFEVRDAITKLLQDELGLEVLPAGPTIA
        CLILEKVADHLQKSF ESLTIDLQELTK  +    + +   ++ N A+GESVKDDIFIILEVNKV+TD   +NFEVRDAITKLLQDELGLEVLP GPTI 
Subjt:  CLILEKVADHLQKSFTESLTIDLQELTKYLSLAASITLWECLRFNLAVGESVKDDIFIILEVNKVETDSAPQNFEVRDAITKLLQDELGLEVLPAGPTIA

Query:  LDKVPNSKSPNMSHTKLKGVMGRNKYSTRRPADVQRLKVTKKSLQDWLQRNR
        LDKVPNS+S NMSHTKLKG MGRNKY TR+PADVQRLKVTKKSLQDWLQRNR
Subjt:  LDKVPNSKSPNMSHTKLKGVMGRNKYSTRRPADVQRLKVTKKSLQDWLQRNR

A0A2D0WXL2 Pentatricopeptide repeat-containing protein0.0e+0089.44Show/hide
Query:  MRVFLILGTSSSSASIAGPR--RHRHSHSKAPKSSLSNLTPTGTHLPLSSHPSTRHSRPPLLSSVQLDIAGASSGGRIPLQHYAGVASKLAERGKLEDFA
        MRVFLILG  S+SASIAGPR  RHRHSH KAPKSSLSN++PTGTHLP SSH STRHS P LLSSV+LDIAGASSGGRIP+QHYAGVA+KLAERGKLEDFA
Subjt:  MRVFLILGTSSSSASIAGPR--RHRHSHSKAPKSSLSNLTPTGTHLPLSSHPSTRHSRPPLLSSVQLDIAGASSGGRIPLQHYAGVASKLAERGKLEDFA

Query:  MVVESVVVAGVEPSQFAAVLAVELVAKGISRCLKEGKLWSVLQVMRKVEELGISVLGLCDESAVESLRRDCHRIAKSGELEELVEFMEVLAGFGFSIKEM
        MVVESVVVAGVEPSQF A+LAVELVAKGISRCL+EGKLWSV+QV+RKVEELGIS LGLCDESAVESLRRDC R+AKSGELEELVEF+EVL+GFG S+KEM
Subjt:  MVVESVVVAGVEPSQFAAVLAVELVAKGISRCLKEGKLWSVLQVMRKVEELGISVLGLCDESAVESLRRDCHRIAKSGELEELVEFMEVLAGFGFSIKEM

Query:  MKPSEVIKLCVDYRNPKMAIRYASMLPHADILSCTTINEFGKKRDLKSAYIAYTESKANMNGPNMYIYRTIIDVCGLCGDYKKSRNIYQDLVSQNVTPNI
        MKP EVIKLCVDYRNPKMAIRYAS+LPHADIL CTTINEFGKKRDLKSAYIAY ESKANMNG NMYI+RTIIDVCGLCGDYKKSRNIYQDLV+QNVTPNI
Subjt:  MKPSEVIKLCVDYRNPKMAIRYASMLPHADILSCTTINEFGKKRDLKSAYIAYTESKANMNGPNMYIYRTIIDVCGLCGDYKKSRNIYQDLVSQNVTPNI

Query:  FVFNSLMNVNAHDLNYTFQLYKNMQNLGVPADMASYNILLKACCLAGRVDLAQDIYREVKHLETTGVLKLDVFTYSTIVKVFADAKLWKMALGVKEDMQS
        FVFNSLMNVNAHDLNYTFQLYK+MQNLGVPADMASYNILLKACCLAGRVDLAQDIYREVKHLETTGVLKLDVFTYSTIVKVFADAKLWKMAL VKEDMQS
Subjt:  FVFNSLMNVNAHDLNYTFQLYKNMQNLGVPADMASYNILLKACCLAGRVDLAQDIYREVKHLETTGVLKLDVFTYSTIVKVFADAKLWKMALGVKEDMQS

Query:  AGVSPNMVTWSSLISSCANSGLVELAIQLFEEMVSAGCEPNTQCCNILLHACVEARQFDRAFRLFRSWREKELWDGIERKSSNNDNSNADSTSQLCTTNM
        AGVSPN+VTWSSLISSCANSGLVELAIQLFEEMVSAG EPNTQCCNILLHACVEARQFDRAFRLFRSWREKELWDGIERKSS +DN +ADSTSQLCTTNM
Subjt:  AGVSPNMVTWSSLISSCANSGLVELAIQLFEEMVSAGCEPNTQCCNILLHACVEARQFDRAFRLFRSWREKELWDGIERKSSNNDNSNADSTSQLCTTNM

Query:  ANAPSHIHQISFVGNFAFKPTITTYNILMKACGTDYYHAKALMEEMKSVGLTPNHISWSILIDICGGSHDVESAVQILTTMRMAGVDPDVVAYTTAIKVC
         NAPSH HQIS VGNFAFKPT+TTYN LMKACGTDYYHAKALMEEMKSVGLTPNHISWSILID+CG SHDVESAVQILTTMRMAGVDPDVVAYTTAIKVC
Subjt:  ANAPSHIHQISFVGNFAFKPTITTYNILMKACGTDYYHAKALMEEMKSVGLTPNHISWSILIDICGGSHDVESAVQILTTMRMAGVDPDVVAYTTAIKVC

Query:  VEGKNWKLAFSLFEEMKRFEIQPNLVTYSTLLGARSTYGSLHEVQQCLAIYQDMRKSGFKSNDHYLKELIAEWCEGVIQNNNQQQVEITPYNRIENGKPR
        VEGKNWKLAFSLFEEMKRFEIQPNLVTYSTLL ARSTYGSLHEVQQCLA+YQDMRKSGFKSNDHYLKELIAEWCEGV+QNNNQQQVE TP N+I+  KPR
Subjt:  VEGKNWKLAFSLFEEMKRFEIQPNLVTYSTLLGARSTYGSLHEVQQCLAIYQDMRKSGFKSNDHYLKELIAEWCEGVIQNNNQQQVEITPYNRIENGKPR

Query:  CLILEKVADHLQKSFTESLTIDLQELTKYLSLAASITLWECLRFNLAVGESVKDDIFIILEVNKVETDSAPQNFEVRDAITKLLQDELGLEVLPAGPTIA
        CLILEKVADHLQKSF ESLTIDLQELTK  +    + +   ++ N A+GESVKDDIFIILEVNKV+TD   +NFEVRDAITKLLQDELGLEVLP GPTI 
Subjt:  CLILEKVADHLQKSFTESLTIDLQELTKYLSLAASITLWECLRFNLAVGESVKDDIFIILEVNKVETDSAPQNFEVRDAITKLLQDELGLEVLPAGPTIA

Query:  LDKVPNSKSPNMSHTKLKGVMGRNKYSTRRPADVQRLKVTKKSLQDWLQRNR
        LDKVPNS+S NMSHTKLKG  GRNKY TR+PADVQRLKVTKKSLQDWLQRNR
Subjt:  LDKVPNSKSPNMSHTKLKGVMGRNKYSTRRPADVQRLKVTKKSLQDWLQRNR

SwissProt top hitse value%identityAlignment
Q3ECK2 Pentatricopeptide repeat-containing protein At1g62680, mitochondrial8.9e-2625.37Show/hide
Query:  NMYIYRTIIDVCGLCGDYKKSRNIYQDLVSQNVTPNIFVFNSLMN--VNAHDLNYTFQLYKNMQNLGVPADMASYNILLKACCLAGRVDLAQDIYREVKH
        ++Y +  +I+    C     + +I   ++     P+     SL+N     + ++    L   M  +G   D+ +YN ++ + C   RV+ A D ++E+  
Subjt:  NMYIYRTIIDVCGLCGDYKKSRNIYQDLVSQNVTPNIFVFNSLMN--VNAHDLNYTFQLYKNMQNLGVPADMASYNILLKACCLAGRVDLAQDIYREVKH

Query:  LETTGVLKLDVFTYSTIVKVFADAKLWKMALGVKEDMQSAGVSPNMVTWSSLISSCANSGLVELAIQLFEEMVSAGCEPNTQCCNILLHACVEARQFDRA
         E  G+ + +V TY+ +V    ++  W  A  +  DM    ++PN++T+S+L+ +   +G V  A +LFEEMV    +P+    + L++      + D A
Subjt:  LETTGVLKLDVFTYSTIVKVFADAKLWKMALGVKEDMQSAGVSPNMVTWSSLISSCANSGLVELAIQLFEEMVSAGCEPNTQCCNILLHACVEARQFDRA

Query:  FRLFRSWREKELWDGIERKSSNNDNSNADSTSQLCTTNMANAPSHIHQISFVGNFAFKPTITTYNILMKAC--GTDYYHAKALMEEMKSVGLTPNHISWS
         ++F     K     +    S N   N    ++     M      + Q   V N        TYN L++      D   A+    +M   G++P+  +++
Subjt:  FRLFRSWREKELWDGIERKSSNNDNSNADSTSQLCTTNMANAPSHIHQISFVGNFAFKPTITTYNILMKAC--GTDYYHAKALMEEMKSVGLTPNHISWS

Query:  ILIDICGGSHDVESAVQILTTMRMAGVDPDVVAYTTAIK-VCVEGKNWKLAFSLFEEMKRFEIQPNLVTYSTLLGARSTYGSLHEVQQCLAIYQDMRKSG
        IL+     + ++E A+ I   M+   +D D+V YTT I+ +C  GK  + A+SLF  +    ++P++VTY+T++    T G LHEV+   A+Y  M++ G
Subjt:  ILIDICGGSHDVESAVQILTTMRMAGVDPDVVAYTTAIK-VCVEGKNWKLAFSLFEEMKRFEIQPNLVTYSTLLGARSTYGSLHEVQQCLAIYQDMRKSG

Query:  FKSNDHYLKE
           ND  L +
Subjt:  FKSNDHYLKE

Q8GYL7 Pentatricopeptide repeat-containing protein At5g02830, chloroplastic1.3e-23451.16Show/hide
Query:  MRVFLILGTSSSSASIAGP-RRHRHSHSKAPK--------SSLSNLTPT--GTHLPLSSHPSTRHSRPPLLSSVQLDIAGASSGGRIPLQHYAGVASKLA
        MR F+I+    SS++I  P   HR  ++ AP+        SS + L P+    H P  +  S  HS     S+V   I   S      L++YA  ASKLA
Subjt:  MRVFLILGTSSSSASIAGP-RRHRHSHSKAPK--------SSLSNLTPT--GTHLPLSSHPSTRHSRPPLLSSVQLDIAGASSGGRIPLQHYAGVASKLA

Query:  ERGKLEDFAMVVESVVV-AGVEPSQFAAVLAVELVAKGISRCLKEGKLWSVLQVMRKVEELGISVLGLCDESAVESLRRDCHRIAKSGELEELVEFMEVL
        E G++ED A++ E++   +G   ++FA+++  +L++KGIS  L++GK+ SV+  ++++E++GI+ L L D+S+V+ +R+    +A S ++E+ ++ ME+L
Subjt:  ERGKLEDFAMVVESVVV-AGVEPSQFAAVLAVELVAKGISRCLKEGKLWSVLQVMRKVEELGISVLGLCDESAVESLRRDCHRIAKSGELEELVEFMEVL

Query:  AGFGFSIKEMMKPSEVIKLCVDYRNPKMAIRYASMLPHADILSCTTINEFGKKRDLKSAYIAYTESKANMNGPNMYIYRTIIDVCGLCGDYKKSRNIYQD
        AG GF IKE++ P +V+K CV+  NP++AIRYA +LPH ++L C  I+ FGKK D+ S   AY   K  ++ PNMYI RT+IDVCGLCGDY KSR IY+D
Subjt:  AGFGFSIKEMMKPSEVIKLCVDYRNPKMAIRYASMLPHADILSCTTINEFGKKRDLKSAYIAYTESKANMNGPNMYIYRTIIDVCGLCGDYKKSRNIYQD

Query:  LVSQNVTPNIFVFNSLMNVNAHDLNYTFQLYKNMQNLGVPADMASYNILLKACCLAGRVDLAQDIYREVKHLETTGVLKLDVFTYSTIVKVFADAKLWKM
        L+ +N+ PNI+V NSLMNVN+HDL YT ++YKNMQ L V ADM SYNILLK CCLAGRVDLAQDIY+E K +E++G+LKLD FTY TI+KVFADAK+WK 
Subjt:  LVSQNVTPNIFVFNSLMNVNAHDLNYTFQLYKNMQNLGVPADMASYNILLKACCLAGRVDLAQDIYREVKHLETTGVLKLDVFTYSTIVKVFADAKLWKM

Query:  ALGVKEDMQSAGVSPNMVTWSSLISSCANSGLVELAIQLFEEMVSAGCEPNTQCCNILLHACVEARQFDRAFRLFRSWREKELWDGIERKS--SNNDNSN
        AL VK+DM+S GV+PN  TWSSLIS+CAN+GLVE A  LFEEM+++GCEPN+QC NILLHACVEA Q+DRAFRLF+SW+   + + +      S    S+
Subjt:  ALGVKEDMQSAGVSPNMVTWSSLISSCANSGLVELAIQLFEEMVSAGCEPNTQCCNILLHACVEARQFDRAFRLFRSWREKELWDGIERKS--SNNDNSN

Query:  ADSTSQLCTTNMANAPSHIHQISFVGNFAFKPTITTYNILMKACGTDYYHAKALMEEMKSVGLTPNHISWSILIDICGGSHDVESAVQILTTMRMAGVDP
         +        ++ N  S+   I     F FKPT  TYNIL+KACGTDYY  K LM+EMKS+GL+PN I+WS LID+CGGS DVE AV+IL TM  AG  P
Subjt:  ADSTSQLCTTNMANAPSHIHQISFVGNFAFKPTITTYNILMKACGTDYYHAKALMEEMKSVGLTPNHISWSILIDICGGSHDVESAVQILTTMRMAGVDP

Query:  DVVAYTTAIKVCVEGKNWKLAFSLFEEMKRFEIQPNLVTYSTLLGARSTYGSLHEVQQCLAIYQDMRKSGFKSNDHYLKELIAEWCEGVIQNNNQQQVEI
        DVVAYTTAIK+C E K  KLAFSLFEEM+R++I+PN VTY+TLL ARS YGSL EV+QCLAIYQDMR +G+K NDH+LKELI EWCEGVIQ N Q Q +I
Subjt:  DVVAYTTAIKVCVEGKNWKLAFSLFEEMKRFEIQPNLVTYSTLLGARSTYGSLHEVQQCLAIYQDMRKSGFKSNDHYLKELIAEWCEGVIQNNNQQQVEI

Query:  TPYNRIENGKPRCLILEKVADHLQKSFTESLTIDLQELTKYLSLAASITLWECLRFNLAVGESVKDDIFIILEVNKVETDSAPQNFEVRDAITKLLQDEL
        +       G+P  L++EKVA H+Q+    +L IDLQ LTK  +    + +   ++ +   G+ V DD+ II+  ++  T S  Q   V++A+ KLL+DEL
Subjt:  TPYNRIENGKPRCLILEKVADHLQKSFTESLTIDLQELTKYLSLAASITLWECLRFNLAVGESVKDDIFIILEVNKVETDSAPQNFEVRDAITKLLQDEL

Query:  GLEVLPAGPTIALDKVPNSKSPNMSHTKLKGVMGRNKYSTRRPADVQRLKVTKKSLQDWLQRNR
         L VLPAG    +         +  +TK    +     STRRPA ++RL VTK SL  WLQR +
Subjt:  GLEVLPAGPTIALDKVPNSKSPNMSHTKLKGVMGRNKYSTRRPADVQRLKVTKKSLQDWLQRNR

Q9FIX3 Pentatricopeptide repeat-containing protein At5g397105.9e-3023.57Show/hide
Query:  KRDLKSAYIAYTESKANMNGPNMYIYRTIIDVCGLCGDYKKSRNIYQDLVSQNVTPNIFVFNSLMN--VNAHDLNYTFQLYKNMQNLGVPADMASYNILL
        KR++  A   + E   +   PN++ Y  +I      G+   +  ++  + ++   PN+  +N+L++       ++  F+L ++M   G+  ++ SYN+++
Subjt:  KRDLKSAYIAYTESKANMNGPNMYIYRTIIDVCGLCGDYKKSRNIYQDLVSQNVTPNIFVFNSLMN--VNAHDLNYTFQLYKNMQNLGVPADMASYNILL

Query:  KACCLAGRVDLAQDIYREVKHLETTGVLKLDVFTYSTIVKVFADAKLWKMALGVKEDMQSAGVSPNMVTWSSLISSCANSGLVELAIQLFEEMVSAGCEP
           C  GR+     +  E+          LD  TY+T++K +     +  AL +  +M   G++P+++T++SLI S   +G +  A++  ++M   G  P
Subjt:  KACCLAGRVDLAQDIYREVKHLETTGVLKLDVFTYSTIVKVFADAKLWKMALGVKEDMQSAGVSPNMVTWSSLISSCANSGLVELAIQLFEEMVSAGCEP

Query:  NTQCCNILLHACVEARQFDRAFRLFRSWREKELWDGIERKSSNNDNSNADSTSQLCTTNMANAPSHIHQISFVGNFAFKPTITTYNILMKA-CGTDYYH-
        N +    L+    +    + A+R+ R                 NDN                               F P++ TYN L+   C T     
Subjt:  NTQCCNILLHACVEARQFDRAFRLFRSWREKELWDGIERKSSNNDNSNADSTSQLCTTNMANAPSHIHQISFVGNFAFKPTITTYNILMKA-CGTDYYH-

Query:  AKALMEEMKSVGLTPNHISWSILIDICGGSHDVESAVQILTTMRMAGVDPDVVAYTTAIKVCVEGKNWKLAFSLFEEMKRFEIQPNLVTYSTLLGARSTY
        A A++E+MK  GL+P+ +S+S ++     S+DV+ A+++   M   G+ PD + Y++ I+   E +  K A  L+EEM R  + P+  TY+ L+ A    
Subjt:  AKALMEEMKSVGLTPNHISWSILIDICGGSHDVESAVQILTTMRMAGVDPDVVAYTTAIKVCVEGKNWKLAFSLFEEMKRFEIQPNLVTYSTLLGARSTY

Query:  GSLHEVQQCLAIYQDMRKSG
        G L   ++ L ++ +M + G
Subjt:  GSLHEVQQCLAIYQDMRKSG

Q9LYZ9 Pentatricopeptide repeat-containing protein At5g028603.6e-2720.13Show/hide
Query:  GGRIPLQHYAGVASKLAERGKLEDFAMVVESVVVAGVEPSQFAAVLAVELVAKGISRCLKEGKLWS-VLQVMRKVEELGISVLGLCDESAVESLRRDCHR
        G  + +  Y  + S  A  G+  +   V + +   G +P+     + + +         K G  W+ +  ++ K++  GI+     D     +L   C R
Subjt:  GGRIPLQHYAGVASKLAERGKLEDFAMVVESVVVAGVEPSQFAAVLAVELVAKGISRCLKEGKLWS-VLQVMRKVEELGISVLGLCDESAVESLRRDCHR

Query:  IAKSGELEELVEFMEVLAGFGFSIKEMMKPSEVIKLCVDYRNPKMAIRYAS-MLPHADILSCTTINE----FGKKRDLKSAYIAYTESKANMNGPNMYIY
         +   E  ++ E M+  AGF +   + +  + ++ +      PK A++  + M+ +    S  T N     + +   L  A     +       P+++ Y
Subjt:  IAKSGELEELVEFMEVLAGFGFSIKEMMKPSEVIKLCVDYRNPKMAIRYAS-MLPHADILSCTTINE----FGKKRDLKSAYIAYTESKANMNGPNMYIY

Query:  RTIIDVCGLCGDYKKSRNIYQDLVSQNVTPNIFVFNSLMNVNAHDLNYT--FQLYKNMQNLGVPADMASYNILLKACCLAGRVDLAQDIYREVKHLETTG
         T++      G  + + +I++++ +    PNI  FN+ + +  +   +T   +++  +   G+  D+ ++N LL    + G+  +  ++    K ++  G
Subjt:  RTIIDVCGLCGDYKKSRNIYQDLVSQNVTPNIFVFNSLMNVNAHDLNYT--FQLYKNMQNLGVPADMASYNILLKACCLAGRVDLAQDIYREVKHLETTG

Query:  VLKLDVFTYSTIVKVFADAKLWKMALGVKEDMQSAGVSPNMVTWSSLISSCANSGLVELAIQLFEEMVSAGCEPNTQCCNILLHACVEARQFDRAFRLFR
         +  +  T++T++  ++    ++ A+ V   M  AGV+P++ T+++++++ A  G+ E + ++  EM    C+PN      LLHA    ++      L  
Subjt:  VLKLDVFTYSTIVKVFADAKLWKMALGVKEDMQSAGVSPNMVTWSSLISSCANSGLVELAIQLFEEMVSAGCEPNTQCCNILLHACVEARQFDRAFRLFR

Query:  SWREKELWDGIERKSSNNDNSNADSTSQLCTTNMANAPSHIHQISFVGNFAFKPTITTYNILMKACGTDYYHAKA--LMEEMKSVGLTPNHISWSILIDI
            +E++ G+         +    T  L  +     P      S +    F P ITT N ++   G     AKA  +++ MK  G TP+  +++ L+ +
Subjt:  SWREKELWDGIERKSSNNDNSNADSTSQLCTTNMANAPSHIHQISFVGNFAFKPTITTYNILMKACGTDYYHAKA--LMEEMKSVGLTPNHISWSILIDI

Query:  CGGSHDVESAVQILTTMRMAGVDPDVVAYTTAIKVCVEGKNWKLAFSLFEEMKRFEIQPNLVTYSTLLGARSTYGSLHEVQQCLAIYQDMRKSGFKSNDH
           S D   + +IL  +   G+ PD+++Y T I         + A  +F EM+   I P+++TY+T +G   +Y +    ++ + + + M K G + N +
Subjt:  CGGSHDVESAVQILTTMRMAGVDPDVVAYTTAIKVCVEGKNWKLAFSLFEEMKRFEIQPNLVTYSTLLGARSTYGSLHEVQQCLAIYQDMRKSGFKSNDH

Query:  YLKELIAEWCE
            ++  +C+
Subjt:  YLKELIAEWCE

Q9S7Q2 Pentatricopeptide repeat-containing protein At1g74850, chloroplastic4.0e-2621.79Show/hide
Query:  INEFGKKRDLKSAYIAYTESKANMNGPNMYIYRTIIDVCGLCGDYKKSRNIYQDLVSQNVTPNIFVFNSLMNVNAH---DLNYTFQLYKNMQNLGVPADM
        I+  G++  L      + E  +     +++ Y  +I+  G  G Y+ S  +   + ++ ++P+I  +N+++N  A    D      L+  M++ G+  D+
Subjt:  INEFGKKRDLKSAYIAYTESKANMNGPNMYIYRTIIDVCGLCGDYKKSRNIYQDLVSQNVTPNIFVFNSLMNVNAH---DLNYTFQLYKNMQNLGVPADM

Query:  ASYNILLKACCLAGRVDLAQDIYREVKHLETTGVLKLDVFTYSTIVKVFADAKLWKMALGVKEDMQSAGVSPNMVTWSSLISSCANSGLVELAIQLFEEM
         +YN LL AC + G  D A+ ++R +      G +  D+ TYS +V+ F   +  +    +  +M S G  P++ +++ L+ + A SG ++ A+ +F +M
Subjt:  ASYNILLKACCLAGRVDLAQDIYREVKHLETTGVLKLDVFTYSTIVKVFADAKLWKMALGVKEDMQSAGVSPNMVTWSSLISSCANSGLVELAIQLFEEM

Query:  VSAGCEPNTQCCNILLHACVEARQFDRAFRLFRSWREKELWDGIERKSSNNDNSNADSTSQLCTTNMANAPSHIHQISFVGNFA---FKPTITTYNILMK
         +AGC PN    ++LL+   ++ ++D   +LF           +E KSSN D   A  T  +              ++   +      +P + TY  ++ 
Subjt:  VSAGCEPNTQCCNILLHACVEARQFDRAFRLFRSWREKELWDGIERKSSNNDNSNADSTSQLCTTNMANAPSHIHQISFVGNFA---FKPTITTYNILMK

Query:  ACGTDYYH--AKALMEEMKSVGLTPNHISWSILIDICGGSHDVESAVQILTTMRMAGVDPDVVAYTTAIKVCVEGKNWKLAFSLFEEMKRFEIQPNLVTY
        ACG    H  A+ +++ M +  + P+  +++ +I+  G +   E A+    TM   G +P +  + + +     G   K + ++   +    I  N  T+
Subjt:  ACGTDYYH--AKALMEEMKSVGLTPNHISWSILIDICGGSHDVESAVQILTTMRMAGVDPDVVAYTTAIKVCVEGKNWKLAFSLFEEMKRFEIQPNLVTY

Query:  STLLGARSTYGSLHEVQQCLAIYQDMRKSGFKSNDHYLKELIAEWCEGVIQNNNQQQVE
        +  + A    G   E    +  Y DM KS    ++  L+ +++ +    + +  ++Q E
Subjt:  STLLGARSTYGSLHEVQQCLAIYQDMRKSGFKSNDHYLKELIAEWCEGVIQNNNQQQVE

Arabidopsis top hitse value%identityAlignment
AT1G62680.1 Pentatricopeptide repeat (PPR) superfamily protein6.3e-2725.37Show/hide
Query:  NMYIYRTIIDVCGLCGDYKKSRNIYQDLVSQNVTPNIFVFNSLMN--VNAHDLNYTFQLYKNMQNLGVPADMASYNILLKACCLAGRVDLAQDIYREVKH
        ++Y +  +I+    C     + +I   ++     P+     SL+N     + ++    L   M  +G   D+ +YN ++ + C   RV+ A D ++E+  
Subjt:  NMYIYRTIIDVCGLCGDYKKSRNIYQDLVSQNVTPNIFVFNSLMN--VNAHDLNYTFQLYKNMQNLGVPADMASYNILLKACCLAGRVDLAQDIYREVKH

Query:  LETTGVLKLDVFTYSTIVKVFADAKLWKMALGVKEDMQSAGVSPNMVTWSSLISSCANSGLVELAIQLFEEMVSAGCEPNTQCCNILLHACVEARQFDRA
         E  G+ + +V TY+ +V    ++  W  A  +  DM    ++PN++T+S+L+ +   +G V  A +LFEEMV    +P+    + L++      + D A
Subjt:  LETTGVLKLDVFTYSTIVKVFADAKLWKMALGVKEDMQSAGVSPNMVTWSSLISSCANSGLVELAIQLFEEMVSAGCEPNTQCCNILLHACVEARQFDRA

Query:  FRLFRSWREKELWDGIERKSSNNDNSNADSTSQLCTTNMANAPSHIHQISFVGNFAFKPTITTYNILMKAC--GTDYYHAKALMEEMKSVGLTPNHISWS
         ++F     K     +    S N   N    ++     M      + Q   V N        TYN L++      D   A+    +M   G++P+  +++
Subjt:  FRLFRSWREKELWDGIERKSSNNDNSNADSTSQLCTTNMANAPSHIHQISFVGNFAFKPTITTYNILMKAC--GTDYYHAKALMEEMKSVGLTPNHISWS

Query:  ILIDICGGSHDVESAVQILTTMRMAGVDPDVVAYTTAIK-VCVEGKNWKLAFSLFEEMKRFEIQPNLVTYSTLLGARSTYGSLHEVQQCLAIYQDMRKSG
        IL+     + ++E A+ I   M+   +D D+V YTT I+ +C  GK  + A+SLF  +    ++P++VTY+T++    T G LHEV+   A+Y  M++ G
Subjt:  ILIDICGGSHDVESAVQILTTMRMAGVDPDVVAYTTAIK-VCVEGKNWKLAFSLFEEMKRFEIQPNLVTYSTLLGARSTYGSLHEVQQCLAIYQDMRKSG

Query:  FKSNDHYLKE
           ND  L +
Subjt:  FKSNDHYLKE

AT1G74850.1 plastid transcriptionally active 22.8e-2721.79Show/hide
Query:  INEFGKKRDLKSAYIAYTESKANMNGPNMYIYRTIIDVCGLCGDYKKSRNIYQDLVSQNVTPNIFVFNSLMNVNAH---DLNYTFQLYKNMQNLGVPADM
        I+  G++  L      + E  +     +++ Y  +I+  G  G Y+ S  +   + ++ ++P+I  +N+++N  A    D      L+  M++ G+  D+
Subjt:  INEFGKKRDLKSAYIAYTESKANMNGPNMYIYRTIIDVCGLCGDYKKSRNIYQDLVSQNVTPNIFVFNSLMNVNAH---DLNYTFQLYKNMQNLGVPADM

Query:  ASYNILLKACCLAGRVDLAQDIYREVKHLETTGVLKLDVFTYSTIVKVFADAKLWKMALGVKEDMQSAGVSPNMVTWSSLISSCANSGLVELAIQLFEEM
         +YN LL AC + G  D A+ ++R +      G +  D+ TYS +V+ F   +  +    +  +M S G  P++ +++ L+ + A SG ++ A+ +F +M
Subjt:  ASYNILLKACCLAGRVDLAQDIYREVKHLETTGVLKLDVFTYSTIVKVFADAKLWKMALGVKEDMQSAGVSPNMVTWSSLISSCANSGLVELAIQLFEEM

Query:  VSAGCEPNTQCCNILLHACVEARQFDRAFRLFRSWREKELWDGIERKSSNNDNSNADSTSQLCTTNMANAPSHIHQISFVGNFA---FKPTITTYNILMK
         +AGC PN    ++LL+   ++ ++D   +LF           +E KSSN D   A  T  +              ++   +      +P + TY  ++ 
Subjt:  VSAGCEPNTQCCNILLHACVEARQFDRAFRLFRSWREKELWDGIERKSSNNDNSNADSTSQLCTTNMANAPSHIHQISFVGNFA---FKPTITTYNILMK

Query:  ACGTDYYH--AKALMEEMKSVGLTPNHISWSILIDICGGSHDVESAVQILTTMRMAGVDPDVVAYTTAIKVCVEGKNWKLAFSLFEEMKRFEIQPNLVTY
        ACG    H  A+ +++ M +  + P+  +++ +I+  G +   E A+    TM   G +P +  + + +     G   K + ++   +    I  N  T+
Subjt:  ACGTDYYH--AKALMEEMKSVGLTPNHISWSILIDICGGSHDVESAVQILTTMRMAGVDPDVVAYTTAIKVCVEGKNWKLAFSLFEEMKRFEIQPNLVTY

Query:  STLLGARSTYGSLHEVQQCLAIYQDMRKSGFKSNDHYLKELIAEWCEGVIQNNNQQQVE
        +  + A    G   E    +  Y DM KS    ++  L+ +++ +    + +  ++Q E
Subjt:  STLLGARSTYGSLHEVQQCLAIYQDMRKSGFKSNDHYLKELIAEWCEGVIQNNNQQQVE

AT5G02830.1 Tetratricopeptide repeat (TPR)-like superfamily protein9.2e-23651.16Show/hide
Query:  MRVFLILGTSSSSASIAGP-RRHRHSHSKAPK--------SSLSNLTPT--GTHLPLSSHPSTRHSRPPLLSSVQLDIAGASSGGRIPLQHYAGVASKLA
        MR F+I+    SS++I  P   HR  ++ AP+        SS + L P+    H P  +  S  HS     S+V   I   S      L++YA  ASKLA
Subjt:  MRVFLILGTSSSSASIAGP-RRHRHSHSKAPK--------SSLSNLTPT--GTHLPLSSHPSTRHSRPPLLSSVQLDIAGASSGGRIPLQHYAGVASKLA

Query:  ERGKLEDFAMVVESVVV-AGVEPSQFAAVLAVELVAKGISRCLKEGKLWSVLQVMRKVEELGISVLGLCDESAVESLRRDCHRIAKSGELEELVEFMEVL
        E G++ED A++ E++   +G   ++FA+++  +L++KGIS  L++GK+ SV+  ++++E++GI+ L L D+S+V+ +R+    +A S ++E+ ++ ME+L
Subjt:  ERGKLEDFAMVVESVVV-AGVEPSQFAAVLAVELVAKGISRCLKEGKLWSVLQVMRKVEELGISVLGLCDESAVESLRRDCHRIAKSGELEELVEFMEVL

Query:  AGFGFSIKEMMKPSEVIKLCVDYRNPKMAIRYASMLPHADILSCTTINEFGKKRDLKSAYIAYTESKANMNGPNMYIYRTIIDVCGLCGDYKKSRNIYQD
        AG GF IKE++ P +V+K CV+  NP++AIRYA +LPH ++L C  I+ FGKK D+ S   AY   K  ++ PNMYI RT+IDVCGLCGDY KSR IY+D
Subjt:  AGFGFSIKEMMKPSEVIKLCVDYRNPKMAIRYASMLPHADILSCTTINEFGKKRDLKSAYIAYTESKANMNGPNMYIYRTIIDVCGLCGDYKKSRNIYQD

Query:  LVSQNVTPNIFVFNSLMNVNAHDLNYTFQLYKNMQNLGVPADMASYNILLKACCLAGRVDLAQDIYREVKHLETTGVLKLDVFTYSTIVKVFADAKLWKM
        L+ +N+ PNI+V NSLMNVN+HDL YT ++YKNMQ L V ADM SYNILLK CCLAGRVDLAQDIY+E K +E++G+LKLD FTY TI+KVFADAK+WK 
Subjt:  LVSQNVTPNIFVFNSLMNVNAHDLNYTFQLYKNMQNLGVPADMASYNILLKACCLAGRVDLAQDIYREVKHLETTGVLKLDVFTYSTIVKVFADAKLWKM

Query:  ALGVKEDMQSAGVSPNMVTWSSLISSCANSGLVELAIQLFEEMVSAGCEPNTQCCNILLHACVEARQFDRAFRLFRSWREKELWDGIERKS--SNNDNSN
        AL VK+DM+S GV+PN  TWSSLIS+CAN+GLVE A  LFEEM+++GCEPN+QC NILLHACVEA Q+DRAFRLF+SW+   + + +      S    S+
Subjt:  ALGVKEDMQSAGVSPNMVTWSSLISSCANSGLVELAIQLFEEMVSAGCEPNTQCCNILLHACVEARQFDRAFRLFRSWREKELWDGIERKS--SNNDNSN

Query:  ADSTSQLCTTNMANAPSHIHQISFVGNFAFKPTITTYNILMKACGTDYYHAKALMEEMKSVGLTPNHISWSILIDICGGSHDVESAVQILTTMRMAGVDP
         +        ++ N  S+   I     F FKPT  TYNIL+KACGTDYY  K LM+EMKS+GL+PN I+WS LID+CGGS DVE AV+IL TM  AG  P
Subjt:  ADSTSQLCTTNMANAPSHIHQISFVGNFAFKPTITTYNILMKACGTDYYHAKALMEEMKSVGLTPNHISWSILIDICGGSHDVESAVQILTTMRMAGVDP

Query:  DVVAYTTAIKVCVEGKNWKLAFSLFEEMKRFEIQPNLVTYSTLLGARSTYGSLHEVQQCLAIYQDMRKSGFKSNDHYLKELIAEWCEGVIQNNNQQQVEI
        DVVAYTTAIK+C E K  KLAFSLFEEM+R++I+PN VTY+TLL ARS YGSL EV+QCLAIYQDMR +G+K NDH+LKELI EWCEGVIQ N Q Q +I
Subjt:  DVVAYTTAIKVCVEGKNWKLAFSLFEEMKRFEIQPNLVTYSTLLGARSTYGSLHEVQQCLAIYQDMRKSGFKSNDHYLKELIAEWCEGVIQNNNQQQVEI

Query:  TPYNRIENGKPRCLILEKVADHLQKSFTESLTIDLQELTKYLSLAASITLWECLRFNLAVGESVKDDIFIILEVNKVETDSAPQNFEVRDAITKLLQDEL
        +       G+P  L++EKVA H+Q+    +L IDLQ LTK  +    + +   ++ +   G+ V DD+ II+  ++  T S  Q   V++A+ KLL+DEL
Subjt:  TPYNRIENGKPRCLILEKVADHLQKSFTESLTIDLQELTKYLSLAASITLWECLRFNLAVGESVKDDIFIILEVNKVETDSAPQNFEVRDAITKLLQDEL

Query:  GLEVLPAGPTIALDKVPNSKSPNMSHTKLKGVMGRNKYSTRRPADVQRLKVTKKSLQDWLQRNR
         L VLPAG    +         +  +TK    +     STRRPA ++RL VTK SL  WLQR +
Subjt:  GLEVLPAGPTIALDKVPNSKSPNMSHTKLKGVMGRNKYSTRRPADVQRLKVTKKSLQDWLQRNR

AT5G02860.1 Pentatricopeptide repeat (PPR) superfamily protein2.6e-2820.13Show/hide
Query:  GGRIPLQHYAGVASKLAERGKLEDFAMVVESVVVAGVEPSQFAAVLAVELVAKGISRCLKEGKLWS-VLQVMRKVEELGISVLGLCDESAVESLRRDCHR
        G  + +  Y  + S  A  G+  +   V + +   G +P+     + + +         K G  W+ +  ++ K++  GI+     D     +L   C R
Subjt:  GGRIPLQHYAGVASKLAERGKLEDFAMVVESVVVAGVEPSQFAAVLAVELVAKGISRCLKEGKLWS-VLQVMRKVEELGISVLGLCDESAVESLRRDCHR

Query:  IAKSGELEELVEFMEVLAGFGFSIKEMMKPSEVIKLCVDYRNPKMAIRYAS-MLPHADILSCTTINE----FGKKRDLKSAYIAYTESKANMNGPNMYIY
         +   E  ++ E M+  AGF +   + +  + ++ +      PK A++  + M+ +    S  T N     + +   L  A     +       P+++ Y
Subjt:  IAKSGELEELVEFMEVLAGFGFSIKEMMKPSEVIKLCVDYRNPKMAIRYAS-MLPHADILSCTTINE----FGKKRDLKSAYIAYTESKANMNGPNMYIY

Query:  RTIIDVCGLCGDYKKSRNIYQDLVSQNVTPNIFVFNSLMNVNAHDLNYT--FQLYKNMQNLGVPADMASYNILLKACCLAGRVDLAQDIYREVKHLETTG
         T++      G  + + +I++++ +    PNI  FN+ + +  +   +T   +++  +   G+  D+ ++N LL    + G+  +  ++    K ++  G
Subjt:  RTIIDVCGLCGDYKKSRNIYQDLVSQNVTPNIFVFNSLMNVNAHDLNYT--FQLYKNMQNLGVPADMASYNILLKACCLAGRVDLAQDIYREVKHLETTG

Query:  VLKLDVFTYSTIVKVFADAKLWKMALGVKEDMQSAGVSPNMVTWSSLISSCANSGLVELAIQLFEEMVSAGCEPNTQCCNILLHACVEARQFDRAFRLFR
         +  +  T++T++  ++    ++ A+ V   M  AGV+P++ T+++++++ A  G+ E + ++  EM    C+PN      LLHA    ++      L  
Subjt:  VLKLDVFTYSTIVKVFADAKLWKMALGVKEDMQSAGVSPNMVTWSSLISSCANSGLVELAIQLFEEMVSAGCEPNTQCCNILLHACVEARQFDRAFRLFR

Query:  SWREKELWDGIERKSSNNDNSNADSTSQLCTTNMANAPSHIHQISFVGNFAFKPTITTYNILMKACGTDYYHAKA--LMEEMKSVGLTPNHISWSILIDI
            +E++ G+         +    T  L  +     P      S +    F P ITT N ++   G     AKA  +++ MK  G TP+  +++ L+ +
Subjt:  SWREKELWDGIERKSSNNDNSNADSTSQLCTTNMANAPSHIHQISFVGNFAFKPTITTYNILMKACGTDYYHAKA--LMEEMKSVGLTPNHISWSILIDI

Query:  CGGSHDVESAVQILTTMRMAGVDPDVVAYTTAIKVCVEGKNWKLAFSLFEEMKRFEIQPNLVTYSTLLGARSTYGSLHEVQQCLAIYQDMRKSGFKSNDH
           S D   + +IL  +   G+ PD+++Y T I         + A  +F EM+   I P+++TY+T +G   +Y +    ++ + + + M K G + N +
Subjt:  CGGSHDVESAVQILTTMRMAGVDPDVVAYTTAIKVCVEGKNWKLAFSLFEEMKRFEIQPNLVTYSTLLGARSTYGSLHEVQQCLAIYQDMRKSGFKSNDH

Query:  YLKELIAEWCE
            ++  +C+
Subjt:  YLKELIAEWCE

AT5G39710.1 Tetratricopeptide repeat (TPR)-like superfamily protein4.2e-3123.57Show/hide
Query:  KRDLKSAYIAYTESKANMNGPNMYIYRTIIDVCGLCGDYKKSRNIYQDLVSQNVTPNIFVFNSLMN--VNAHDLNYTFQLYKNMQNLGVPADMASYNILL
        KR++  A   + E   +   PN++ Y  +I      G+   +  ++  + ++   PN+  +N+L++       ++  F+L ++M   G+  ++ SYN+++
Subjt:  KRDLKSAYIAYTESKANMNGPNMYIYRTIIDVCGLCGDYKKSRNIYQDLVSQNVTPNIFVFNSLMN--VNAHDLNYTFQLYKNMQNLGVPADMASYNILL

Query:  KACCLAGRVDLAQDIYREVKHLETTGVLKLDVFTYSTIVKVFADAKLWKMALGVKEDMQSAGVSPNMVTWSSLISSCANSGLVELAIQLFEEMVSAGCEP
           C  GR+     +  E+          LD  TY+T++K +     +  AL +  +M   G++P+++T++SLI S   +G +  A++  ++M   G  P
Subjt:  KACCLAGRVDLAQDIYREVKHLETTGVLKLDVFTYSTIVKVFADAKLWKMALGVKEDMQSAGVSPNMVTWSSLISSCANSGLVELAIQLFEEMVSAGCEP

Query:  NTQCCNILLHACVEARQFDRAFRLFRSWREKELWDGIERKSSNNDNSNADSTSQLCTTNMANAPSHIHQISFVGNFAFKPTITTYNILMKA-CGTDYYH-
        N +    L+    +    + A+R+ R                 NDN                               F P++ TYN L+   C T     
Subjt:  NTQCCNILLHACVEARQFDRAFRLFRSWREKELWDGIERKSSNNDNSNADSTSQLCTTNMANAPSHIHQISFVGNFAFKPTITTYNILMKA-CGTDYYH-

Query:  AKALMEEMKSVGLTPNHISWSILIDICGGSHDVESAVQILTTMRMAGVDPDVVAYTTAIKVCVEGKNWKLAFSLFEEMKRFEIQPNLVTYSTLLGARSTY
        A A++E+MK  GL+P+ +S+S ++     S+DV+ A+++   M   G+ PD + Y++ I+   E +  K A  L+EEM R  + P+  TY+ L+ A    
Subjt:  AKALMEEMKSVGLTPNHISWSILIDICGGSHDVESAVQILTTMRMAGVDPDVVAYTTAIKVCVEGKNWKLAFSLFEEMKRFEIQPNLVTYSTLLGARSTY

Query:  GSLHEVQQCLAIYQDMRKSG
        G L   ++ L ++ +M + G
Subjt:  GSLHEVQQCLAIYQDMRKSG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCGAGTCTTCCTCATCCTCGGCACCTCCTCCTCCTCCGCCTCCATTGCCGGACCTCGTCGCCACCGCCATAGCCACTCCAAAGCCCCCAAATCCTCTCTCTCCAACCT
AACCCCCACCGGAACGCATTTGCCGCTCTCTTCACACCCGTCCACTCGCCATTCCCGTCCCCCTCTTCTGTCCTCTGTCCAATTGGACATTGCCGGCGCCTCATCCGGCG
GAAGAATTCCTCTCCAGCACTATGCTGGTGTCGCATCGAAGCTCGCTGAGCGCGGGAAGCTTGAGGATTTTGCAATGGTGGTGGAGAGTGTGGTCGTCGCTGGTGTTGAG
CCCTCGCAGTTCGCTGCGGTGTTGGCTGTTGAGCTTGTAGCCAAGGGGATATCGCGATGTCTGAAAGAGGGAAAGCTATGGAGTGTTTTGCAGGTCATGAGGAAGGTCGA
GGAGCTTGGGATTTCGGTTTTGGGGCTTTGTGATGAGTCTGCCGTAGAATCGCTGAGGAGAGACTGTCACCGTATTGCCAAGTCCGGAGAATTAGAAGAGCTTGTGGAGT
TCATGGAGGTTCTTGCCGGTTTTGGTTTCTCAATCAAAGAAATGATGAAGCCGTCCGAAGTAATTAAATTGTGTGTTGATTACCGTAATCCGAAAATGGCCATTAGGTAT
GCTAGCATGTTACCACATGCAGATATATTGTCCTGTACAACTATAAATGAATTTGGAAAGAAAAGGGACTTGAAATCTGCTTACATAGCATATACAGAATCCAAGGCTAA
TATGAATGGTCCTAATATGTATATCTACCGCACAATCATTGATGTATGTGGCCTCTGTGGGGACTACAAGAAATCGAGGAACATCTATCAGGATTTGGTCAGTCAGAATG
TCACCCCAAATATATTCGTTTTCAACAGTCTCATGAATGTAAATGCCCATGATTTGAACTACACATTTCAACTATACAAAAATATGCAGAATCTCGGTGTACCAGCTGAT
ATGGCCTCATATAATATCCTTCTCAAGGCCTGTTGTCTAGCAGGAAGAGTTGATTTGGCTCAGGACATTTACAGGGAAGTAAAGCATTTGGAAACAACAGGTGTGTTGAA
GTTGGATGTCTTCACCTACAGCACAATTGTAAAGGTTTTCGCAGATGCGAAATTGTGGAAAATGGCACTTGGAGTCAAAGAAGACATGCAATCAGCTGGAGTATCCCCAA
ATATGGTGACCTGGTCTTCTTTGATAAGTTCATGTGCTAATTCGGGTCTTGTTGAGCTGGCTATCCAATTGTTCGAAGAGATGGTTTCAGCAGGATGTGAACCTAATACA
CAGTGTTGTAATATCCTTTTACATGCTTGTGTTGAAGCTCGCCAGTTCGATAGAGCTTTTCGTTTATTTCGCTCCTGGAGGGAAAAGGAACTCTGGGATGGCATAGAAAG
AAAAAGCAGCAACAATGATAATTCGAATGCAGATTCAACATCTCAACTTTGTACTACAAATATGGCTAATGCACCATCTCATATACATCAAATCAGCTTTGTAGGGAATT
TTGCCTTCAAACCTACAATTACAACGTATAATATTCTAATGAAAGCTTGTGGTACTGATTACTACCATGCTAAAGCTTTGATGGAGGAGATGAAAAGTGTTGGTCTTACT
CCCAATCACATTAGCTGGTCAATTCTGATTGACATATGTGGAGGATCTCATGATGTGGAAAGCGCTGTACAGATCTTGACTACCATGCGAATGGCTGGAGTCGATCCTGA
TGTTGTTGCATACACGACGGCTATCAAGGTTTGCGTTGAAGGTAAAAACTGGAAGCTGGCATTTTCATTATTTGAAGAAATGAAAAGATTTGAGATACAGCCGAATTTAG
TGACCTATAGTACACTTCTGGGAGCTCGCAGTACGTATGGTTCATTACACGAAGTACAGCAATGCCTTGCTATATATCAGGACATGAGGAAATCGGGGTTCAAATCCAAT
GATCATTATCTCAAAGAGTTGATTGCAGAGTGGTGTGAAGGAGTTATACAGAATAACAATCAGCAGCAAGTTGAGATAACTCCCTACAACAGAATTGAAAATGGGAAACC
ACGATGTTTGATTCTTGAAAAAGTTGCTGATCATCTGCAGAAGAGCTTTACCGAAAGCCTTACAATTGACCTTCAGGAGCTCACAAAGTACTTAAGCCTAGCTGCATCTA
TTACTCTTTGGGAATGCTTACGTTTTAACTTGGCTGTAGGAGAATCAGTAAAAGATGATATATTCATCATCTTAGAAGTGAATAAAGTTGAAACAGATTCAGCTCCACAG
AACTTTGAGGTGAGAGATGCAATAACCAAACTCTTGCAAGACGAATTAGGGCTTGAGGTTCTTCCTGCAGGACCTACAATTGCTCTTGATAAAGTGCCAAATTCAAAAAG
CCCTAACATGTCACATACAAAACTGAAAGGAGTTATGGGGAGGAATAAGTACTCCACTAGGAGACCAGCAGATGTACAGAGGCTAAAAGTCACCAAGAAATCACTGCAAG
ATTGGCTGCAAAGAAATAGGTAA
mRNA sequenceShow/hide mRNA sequence
ATGCGAGTCTTCCTCATCCTCGGCACCTCCTCCTCCTCCGCCTCCATTGCCGGACCTCGTCGCCACCGCCATAGCCACTCCAAAGCCCCCAAATCCTCTCTCTCCAACCT
AACCCCCACCGGAACGCATTTGCCGCTCTCTTCACACCCGTCCACTCGCCATTCCCGTCCCCCTCTTCTGTCCTCTGTCCAATTGGACATTGCCGGCGCCTCATCCGGCG
GAAGAATTCCTCTCCAGCACTATGCTGGTGTCGCATCGAAGCTCGCTGAGCGCGGGAAGCTTGAGGATTTTGCAATGGTGGTGGAGAGTGTGGTCGTCGCTGGTGTTGAG
CCCTCGCAGTTCGCTGCGGTGTTGGCTGTTGAGCTTGTAGCCAAGGGGATATCGCGATGTCTGAAAGAGGGAAAGCTATGGAGTGTTTTGCAGGTCATGAGGAAGGTCGA
GGAGCTTGGGATTTCGGTTTTGGGGCTTTGTGATGAGTCTGCCGTAGAATCGCTGAGGAGAGACTGTCACCGTATTGCCAAGTCCGGAGAATTAGAAGAGCTTGTGGAGT
TCATGGAGGTTCTTGCCGGTTTTGGTTTCTCAATCAAAGAAATGATGAAGCCGTCCGAAGTAATTAAATTGTGTGTTGATTACCGTAATCCGAAAATGGCCATTAGGTAT
GCTAGCATGTTACCACATGCAGATATATTGTCCTGTACAACTATAAATGAATTTGGAAAGAAAAGGGACTTGAAATCTGCTTACATAGCATATACAGAATCCAAGGCTAA
TATGAATGGTCCTAATATGTATATCTACCGCACAATCATTGATGTATGTGGCCTCTGTGGGGACTACAAGAAATCGAGGAACATCTATCAGGATTTGGTCAGTCAGAATG
TCACCCCAAATATATTCGTTTTCAACAGTCTCATGAATGTAAATGCCCATGATTTGAACTACACATTTCAACTATACAAAAATATGCAGAATCTCGGTGTACCAGCTGAT
ATGGCCTCATATAATATCCTTCTCAAGGCCTGTTGTCTAGCAGGAAGAGTTGATTTGGCTCAGGACATTTACAGGGAAGTAAAGCATTTGGAAACAACAGGTGTGTTGAA
GTTGGATGTCTTCACCTACAGCACAATTGTAAAGGTTTTCGCAGATGCGAAATTGTGGAAAATGGCACTTGGAGTCAAAGAAGACATGCAATCAGCTGGAGTATCCCCAA
ATATGGTGACCTGGTCTTCTTTGATAAGTTCATGTGCTAATTCGGGTCTTGTTGAGCTGGCTATCCAATTGTTCGAAGAGATGGTTTCAGCAGGATGTGAACCTAATACA
CAGTGTTGTAATATCCTTTTACATGCTTGTGTTGAAGCTCGCCAGTTCGATAGAGCTTTTCGTTTATTTCGCTCCTGGAGGGAAAAGGAACTCTGGGATGGCATAGAAAG
AAAAAGCAGCAACAATGATAATTCGAATGCAGATTCAACATCTCAACTTTGTACTACAAATATGGCTAATGCACCATCTCATATACATCAAATCAGCTTTGTAGGGAATT
TTGCCTTCAAACCTACAATTACAACGTATAATATTCTAATGAAAGCTTGTGGTACTGATTACTACCATGCTAAAGCTTTGATGGAGGAGATGAAAAGTGTTGGTCTTACT
CCCAATCACATTAGCTGGTCAATTCTGATTGACATATGTGGAGGATCTCATGATGTGGAAAGCGCTGTACAGATCTTGACTACCATGCGAATGGCTGGAGTCGATCCTGA
TGTTGTTGCATACACGACGGCTATCAAGGTTTGCGTTGAAGGTAAAAACTGGAAGCTGGCATTTTCATTATTTGAAGAAATGAAAAGATTTGAGATACAGCCGAATTTAG
TGACCTATAGTACACTTCTGGGAGCTCGCAGTACGTATGGTTCATTACACGAAGTACAGCAATGCCTTGCTATATATCAGGACATGAGGAAATCGGGGTTCAAATCCAAT
GATCATTATCTCAAAGAGTTGATTGCAGAGTGGTGTGAAGGAGTTATACAGAATAACAATCAGCAGCAAGTTGAGATAACTCCCTACAACAGAATTGAAAATGGGAAACC
ACGATGTTTGATTCTTGAAAAAGTTGCTGATCATCTGCAGAAGAGCTTTACCGAAAGCCTTACAATTGACCTTCAGGAGCTCACAAAGTACTTAAGCCTAGCTGCATCTA
TTACTCTTTGGGAATGCTTACGTTTTAACTTGGCTGTAGGAGAATCAGTAAAAGATGATATATTCATCATCTTAGAAGTGAATAAAGTTGAAACAGATTCAGCTCCACAG
AACTTTGAGGTGAGAGATGCAATAACCAAACTCTTGCAAGACGAATTAGGGCTTGAGGTTCTTCCTGCAGGACCTACAATTGCTCTTGATAAAGTGCCAAATTCAAAAAG
CCCTAACATGTCACATACAAAACTGAAAGGAGTTATGGGGAGGAATAAGTACTCCACTAGGAGACCAGCAGATGTACAGAGGCTAAAAGTCACCAAGAAATCACTGCAAG
ATTGGCTGCAAAGAAATAGGTAA
Protein sequenceShow/hide protein sequence
MRVFLILGTSSSSASIAGPRRHRHSHSKAPKSSLSNLTPTGTHLPLSSHPSTRHSRPPLLSSVQLDIAGASSGGRIPLQHYAGVASKLAERGKLEDFAMVVESVVVAGVE
PSQFAAVLAVELVAKGISRCLKEGKLWSVLQVMRKVEELGISVLGLCDESAVESLRRDCHRIAKSGELEELVEFMEVLAGFGFSIKEMMKPSEVIKLCVDYRNPKMAIRY
ASMLPHADILSCTTINEFGKKRDLKSAYIAYTESKANMNGPNMYIYRTIIDVCGLCGDYKKSRNIYQDLVSQNVTPNIFVFNSLMNVNAHDLNYTFQLYKNMQNLGVPAD
MASYNILLKACCLAGRVDLAQDIYREVKHLETTGVLKLDVFTYSTIVKVFADAKLWKMALGVKEDMQSAGVSPNMVTWSSLISSCANSGLVELAIQLFEEMVSAGCEPNT
QCCNILLHACVEARQFDRAFRLFRSWREKELWDGIERKSSNNDNSNADSTSQLCTTNMANAPSHIHQISFVGNFAFKPTITTYNILMKACGTDYYHAKALMEEMKSVGLT
PNHISWSILIDICGGSHDVESAVQILTTMRMAGVDPDVVAYTTAIKVCVEGKNWKLAFSLFEEMKRFEIQPNLVTYSTLLGARSTYGSLHEVQQCLAIYQDMRKSGFKSN
DHYLKELIAEWCEGVIQNNNQQQVEITPYNRIENGKPRCLILEKVADHLQKSFTESLTIDLQELTKYLSLAASITLWECLRFNLAVGESVKDDIFIILEVNKVETDSAPQ
NFEVRDAITKLLQDELGLEVLPAGPTIALDKVPNSKSPNMSHTKLKGVMGRNKYSTRRPADVQRLKVTKKSLQDWLQRNR