; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI01G18220 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI01G18220
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationChr1:13667097..13669810
RNA-Seq ExpressionCSPI01G18220
SyntenyCSPI01G18220
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily
IPR033443 - Pentacotripeptide-repeat region of PRORP


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0035127.1 pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa]0.0e+0093.68Show/hide
Query:  MAHISVSKLHFTHYRVLSSSSISKPTALNSLHFFSSTQEPISTATQNGSPNDPSASSDAALPQTGESAAVNGVQQVKGRIPRGRPRDPEKLEKIICKMMA
        MAHISVSKLHFTHYRVLSSSSISKPTALNSLHFFSSTQEPIS ATQN SPNDP ASS+AALPQT ESAAVNGVQQVKGRIPRGRPR+ EKLE +IC+MMA
Subjt:  MAHISVSKLHFTHYRVLSSSSISKPTALNSLHFFSSTQEPISTATQNGSPNDPSASSDAALPQTGESAAVNGVQQVKGRIPRGRPRDPEKLEKIICKMMA

Query:  NREWTTRLQNSIRSLVPQFDHNLVYNVLHAAKKSEHALNFFRWVERAGLFQHDRETHFKIIEILGRASKLNHARCILLDMPNKGVQWDEDLFVVLIESYG
        +REWTTRLQNSIRSLVPQFDH LVYNVLHAAKKSEHALNFFRWVERAGLFQHDRETH KIIEILG ASKLNHARCILLDMPNKGV+WDEDLFVVLI+SYG
Subjt:  NREWTTRLQNSIRSLVPQFDHNLVYNVLHAAKKSEHALNFFRWVERAGLFQHDRETHFKIIEILGRASKLNHARCILLDMPNKGVQWDEDLFVVLIESYG

Query:  KAGIVQEAVKIFQKMKELGVERSVKSYDALFKEIMRRGRYMMAKRYFNAMLNEGIEPIRHTYNVMLWGFFLSLRLETAKRFYEDMKSRGISPDVVTYNTM
        KAGIVQEAVKIF+KMKELGVERS KSYDALFK I+RRGRYMMAKRYFNAMLNEG+EP RHTYNVMLWGFFLSLRLETAKRFYEDMKSRGISPDVVTYNTM
Subjt:  KAGIVQEAVKIFQKMKELGVERSVKSYDALFKEIMRRGRYMMAKRYFNAMLNEGIEPIRHTYNVMLWGFFLSLRLETAKRFYEDMKSRGISPDVVTYNTM

Query:  INGYCRFKMMEEAEQFFTEMKGKNIAPTVISYTTMIKGYVSVSRADDALRLFEEMKAAGEKPNDITYSTLLPGLCDAEKLPEARKILTEMVTRHFAPKDN
        INGYCRFKMMEEAEQFFTEMKGKNIAPTVISYTTMIKGYVSV R DD LRLFEEMKAAGEKPNDITYSTLLPGLCDAEKLPEARKILTEMV R+ APKDN
Subjt:  INGYCRFKMMEEAEQFFTEMKGKNIAPTVISYTTMIKGYVSVSRADDALRLFEEMKAAGEKPNDITYSTLLPGLCDAEKLPEARKILTEMVTRHFAPKDN

Query:  SIFMRLLSCQCKHGDLDAAMHVLKAMIRLSIPTEAGHYGILIENCCKAGMYDQAVKLLENLVEKEIILRPQSTLEMEASAYNLIIQYLCNHGQTGKADTF
        SIFMRLLSCQCKHGDLDAAMHVLKAM+RLSIPTEAGHYGILIENCCKAGMYD+AVKLL+ LVEKEIIL+PQSTLEMEASAYNLIIQYLCNHGQTGKA+ F
Subjt:  SIFMRLLSCQCKHGDLDAAMHVLKAMIRLSIPTEAGHYGILIENCCKAGMYDQAVKLLENLVEKEIILRPQSTLEMEASAYNLIIQYLCNHGQTGKADTF

Query:  FRQLLKKGIQDEVAFNNLIRGHAKEGNPDLAFEMLKIMGRRGVSRDAESYKLLIKSYLSKGEPADAKTALDSMIENGHSPDSALFRSVMESLFADGRVQT
        FRQLLKKGIQDEVAFNNLIRGHAKEGNP+ AFEMLKIMGRRGVSRDAESYKLLIKSYLSKGEPADAKTALDSMIENGHSPDSALFRSVMESLFADGRVQT
Subjt:  FRQLLKKGIQDEVAFNNLIRGHAKEGNPDLAFEMLKIMGRRGVSRDAESYKLLIKSYLSKGEPADAKTALDSMIENGHSPDSALFRSVMESLFADGRVQT

Query:  ASRVMNSMLDKGITENLDLVAKILEALFMRGHDEEALGRINLLMNCNCPPDFNSLLSVLCEKGKTTSAFKLLDFGLERECNIEFSSYEKVLDALLGAGKT
        ASRVMNSMLDKGITENLDLVAKILEALFMRGHDEE LGRINLLMNCNCPPDF+SLLSVLCEKGKT +AFKLL+FGLERECNI+FSSYEKVLDAL+GAGKT
Subjt:  ASRVMNSMLDKGITENLDLVAKILEALFMRGHDEEALGRINLLMNCNCPPDFNSLLSVLCEKGKTTSAFKLLDFGLERECNIEFSSYEKVLDALLGAGKT

Query:  LNAYAILCKIMEKGGAKDWSSCDDLIKSLNQEGNTKQADILSRMIKGGDRKRSKKPSLVA
        LNAYAILCKIMEKGGAKDWSSCDDLIK+LNQEGNTKQADILSRM+KGGDRKRSKK SL A
Subjt:  LNAYAILCKIMEKGGAKDWSSCDDLIKSLNQEGNTKQADILSRMIKGGDRKRSKKPSLVA

XP_004149878.1 pentatricopeptide repeat-containing protein At2g37230 isoform X1 [Cucumis sativus]0.0e+0099.87Show/hide
Query:  MAHISVSKLHFTHYRVLSSSSISKPTALNSLHFFSSTQEPISTATQNGSPNDPSASSDAALPQTGESAAVNGVQQVKGRIPRGRPRDPEKLEKIICKMMA
        MAHISVSKLHFTHYRVLSSSSISKPTALNSLHFFSSTQEPISTATQNGSPNDPSASSDAALPQTGESAAVNGVQQVKGRIPRGRPRDPEKLEKIICKMMA
Subjt:  MAHISVSKLHFTHYRVLSSSSISKPTALNSLHFFSSTQEPISTATQNGSPNDPSASSDAALPQTGESAAVNGVQQVKGRIPRGRPRDPEKLEKIICKMMA

Query:  NREWTTRLQNSIRSLVPQFDHNLVYNVLHAAKKSEHALNFFRWVERAGLFQHDRETHFKIIEILGRASKLNHARCILLDMPNKGVQWDEDLFVVLIESYG
        NREWTTRLQNSIRSLVPQFDHNLVYNVLHAAKKSEHALNFFRWVERAGLFQHDRETHFKIIEILGRASKLNHARCILLDMPNKGVQWDEDLFVVLIESYG
Subjt:  NREWTTRLQNSIRSLVPQFDHNLVYNVLHAAKKSEHALNFFRWVERAGLFQHDRETHFKIIEILGRASKLNHARCILLDMPNKGVQWDEDLFVVLIESYG

Query:  KAGIVQEAVKIFQKMKELGVERSVKSYDALFKEIMRRGRYMMAKRYFNAMLNEGIEPIRHTYNVMLWGFFLSLRLETAKRFYEDMKSRGISPDVVTYNTM
        KAGIVQEAVKIFQKMKELGVERSVKSYDALFKEIMRRGRYMMAKRYFNAMLNEGIEPIRHTYNVMLWGFFLSLRLETAKRFYEDMKSRGISPDVVTYNTM
Subjt:  KAGIVQEAVKIFQKMKELGVERSVKSYDALFKEIMRRGRYMMAKRYFNAMLNEGIEPIRHTYNVMLWGFFLSLRLETAKRFYEDMKSRGISPDVVTYNTM

Query:  INGYCRFKMMEEAEQFFTEMKGKNIAPTVISYTTMIKGYVSVSRADDALRLFEEMKAAGEKPNDITYSTLLPGLCDAEKLPEARKILTEMVTRHFAPKDN
        INGYCRFKMMEEAEQFFTEMKGKNIAPTVISYTTMIKGYVSVSRADDALRLFEEMKAAGEKPNDITYSTLLPGLCDAEKLPEARKILTEMVTRHFAPKDN
Subjt:  INGYCRFKMMEEAEQFFTEMKGKNIAPTVISYTTMIKGYVSVSRADDALRLFEEMKAAGEKPNDITYSTLLPGLCDAEKLPEARKILTEMVTRHFAPKDN

Query:  SIFMRLLSCQCKHGDLDAAMHVLKAMIRLSIPTEAGHYGILIENCCKAGMYDQAVKLLENLVEKEIILRPQSTLEMEASAYNLIIQYLCNHGQTGKADTF
        SIFMRLLSCQCKHGDLDAAMHVLKAMIRLSIPTEAGHYGILIENCCKAGMYDQAVKLLENLVEKEIILRPQSTLEMEASAYNLIIQYLCNHGQTGKADTF
Subjt:  SIFMRLLSCQCKHGDLDAAMHVLKAMIRLSIPTEAGHYGILIENCCKAGMYDQAVKLLENLVEKEIILRPQSTLEMEASAYNLIIQYLCNHGQTGKADTF

Query:  FRQLLKKGIQDEVAFNNLIRGHAKEGNPDLAFEMLKIMGRRGVSRDAESYKLLIKSYLSKGEPADAKTALDSMIENGHSPDSALFRSVMESLFADGRVQT
        FRQLLKKGIQDEVAFNNLIRGHAKEGNPDLAFEMLKIMGRRGVSRDAESYKLLIKSYLSKGEPADAKTALDSMIENGHSPDSALFRSVMESLFADGRVQT
Subjt:  FRQLLKKGIQDEVAFNNLIRGHAKEGNPDLAFEMLKIMGRRGVSRDAESYKLLIKSYLSKGEPADAKTALDSMIENGHSPDSALFRSVMESLFADGRVQT

Query:  ASRVMNSMLDKGITENLDLVAKILEALFMRGHDEEALGRINLLMNCNCPPDFNSLLSVLCEKGKTTSAFKLLDFGLERECNIEFSSYEKVLDALLGAGKT
        ASRVMNSMLDKGITENLDLVAKILEALFMRGHDEEALGRINLLMNCNCPPDFNSLLSVLCEKGKTTSAFKLLDFGLERECNIEFSSYEKVLDALLGAGKT
Subjt:  ASRVMNSMLDKGITENLDLVAKILEALFMRGHDEEALGRINLLMNCNCPPDFNSLLSVLCEKGKTTSAFKLLDFGLERECNIEFSSYEKVLDALLGAGKT

Query:  LNAYAILCKIMEKGGAKDWSSCDDLIKSLNQEGNTKQADILSRMIKGGDRKRSKKPSLVA
        LNAYAILCKIMEKGGAKDWSSCDDLIKSLNQEGNTKQADILSRMIKGGDRKRSKKPSL A
Subjt:  LNAYAILCKIMEKGGAKDWSSCDDLIKSLNQEGNTKQADILSRMIKGGDRKRSKKPSLVA

XP_008443807.2 PREDICTED: pentatricopeptide repeat-containing protein At2g37230 [Cucumis melo]0.0e+0093.88Show/hide
Query:  MAHISVSKLHFTHYRVLSSSSISKPTALNSLHFFSSTQEPISTATQNGSPNDPSASSDAALPQTGESAAVNGVQQVKGRIPRGRPRDPEKLEKIICKMMA
        MAHISVSKLHFTHYRVLSSSSISKPTALNSLHFFSSTQEPIS ATQN SPNDP ASS+AALPQT ESAAVNGVQQVKGRIPRGRPR+ EKLE +IC+MMA
Subjt:  MAHISVSKLHFTHYRVLSSSSISKPTALNSLHFFSSTQEPISTATQNGSPNDPSASSDAALPQTGESAAVNGVQQVKGRIPRGRPRDPEKLEKIICKMMA

Query:  NREWTTRLQNSIRSLVPQFDHNLVYNVLHAAKKSEHALNFFRWVERAGLFQHDRETHFKIIEILGRASKLNHARCILLDMPNKGVQWDEDLFVVLIESYG
        +REWTTRLQNSIRSLVPQFDH LVYNVLHAAKKSEHALNFFRWVERAGLFQHDRETH KIIEILG ASKLNHARCILLDMPNKGV+WDEDLFVVLI+SYG
Subjt:  NREWTTRLQNSIRSLVPQFDHNLVYNVLHAAKKSEHALNFFRWVERAGLFQHDRETHFKIIEILGRASKLNHARCILLDMPNKGVQWDEDLFVVLIESYG

Query:  KAGIVQEAVKIFQKMKELGVERSVKSYDALFKEIMRRGRYMMAKRYFNAMLNEGIEPIRHTYNVMLWGFFLSLRLETAKRFYEDMKSRGISPDVVTYNTM
        KAGIVQEAVKIF+KMKELGVERS KSYDALFK I+RRGRYMMAKRYFNAMLNEG+EP RHTYNVMLWGFFLSLRLETAKRFYEDMKSRGISPDVVTYNTM
Subjt:  KAGIVQEAVKIFQKMKELGVERSVKSYDALFKEIMRRGRYMMAKRYFNAMLNEGIEPIRHTYNVMLWGFFLSLRLETAKRFYEDMKSRGISPDVVTYNTM

Query:  INGYCRFKMMEEAEQFFTEMKGKNIAPTVISYTTMIKGYVSVSRADDALRLFEEMKAAGEKPNDITYSTLLPGLCDAEKLPEARKILTEMVTRHFAPKDN
        INGYCRFKMMEEAEQFFTEMKGKNIAPTVISYTTMIKGYVSV R DD LRLFEEMKAAGEKPNDITYSTLLPGLCDAEKLPEARKILTEMV R+ APKDN
Subjt:  INGYCRFKMMEEAEQFFTEMKGKNIAPTVISYTTMIKGYVSVSRADDALRLFEEMKAAGEKPNDITYSTLLPGLCDAEKLPEARKILTEMVTRHFAPKDN

Query:  SIFMRLLSCQCKHGDLDAAMHVLKAMIRLSIPTEAGHYGILIENCCKAGMYDQAVKLLENLVEKEIILRPQSTLEMEASAYNLIIQYLCNHGQTGKADTF
        SIFMRLLSCQCKHGDLDAAMHVLKAM+RLSIPTEAGHYGILIENCCKAGMYD+AVKLL+ LVEKEIIL+PQSTLEMEASAYNLIIQYLCNHGQTGKA+ F
Subjt:  SIFMRLLSCQCKHGDLDAAMHVLKAMIRLSIPTEAGHYGILIENCCKAGMYDQAVKLLENLVEKEIILRPQSTLEMEASAYNLIIQYLCNHGQTGKADTF

Query:  FRQLLKKGIQDEVAFNNLIRGHAKEGNPDLAFEMLKIMGRRGVSRDAESYKLLIKSYLSKGEPADAKTALDSMIENGHSPDSALFRSVMESLFADGRVQT
        FRQLLKKGIQDEVAFNNLIRGHAKEGNP+ AFEMLKIMGRRGVSRDAESYKLLIKSYLSKGEPADAKTALDSMIENGHSPDSALFRSVMESLFADGRVQT
Subjt:  FRQLLKKGIQDEVAFNNLIRGHAKEGNPDLAFEMLKIMGRRGVSRDAESYKLLIKSYLSKGEPADAKTALDSMIENGHSPDSALFRSVMESLFADGRVQT

Query:  ASRVMNSMLDKGITENLDLVAKILEALFMRGHDEEALGRINLLMNCNCPPDFNSLLSVLCEKGKTTSAFKLLDFGLERECNIEFSSYEKVLDALLGAGKT
        ASRVMNSMLDKGITENLDLVAKILEALFMRGHDEE LGRINLLMNCNCPPDF+SLLSVLCEKGKT +AFKLL+FGLERECNI+FSSYEKVLDAL+GAGKT
Subjt:  ASRVMNSMLDKGITENLDLVAKILEALFMRGHDEEALGRINLLMNCNCPPDFNSLLSVLCEKGKTTSAFKLLDFGLERECNIEFSSYEKVLDALLGAGKT

Query:  LNAYAILCKIMEKGGAKDWSSCDDLIKSLNQEGNTKQADILSRMIKGGDRKR
        LNAYAILCKIMEKGGAKDWSSCDDLIK+LNQEGNTKQADILSRM+KGGDRKR
Subjt:  LNAYAILCKIMEKGGAKDWSSCDDLIKSLNQEGNTKQADILSRMIKGGDRKR

XP_023515807.1 pentatricopeptide repeat-containing protein At2g37230-like [Cucurbita pepo subsp. pepo]0.0e+0086.05Show/hide
Query:  MAHISVSKLHFTHYRVLSSSSISKPTALNSLHFFSSTQEPISTATQNGSPNDPSASSDAALPQTGESAAVNGVQQVKGRIPRGRPRDPEKLEKIICKMMA
        MAHIS+SK H++H +VLSSSSISKP + NSLHFFSS Q+P +TATQN SP DP  SSDAA+PQ  E  AVNG  QVK  IPRG  R+PEKLE IIC+MMA
Subjt:  MAHISVSKLHFTHYRVLSSSSISKPTALNSLHFFSSTQEPISTATQNGSPNDPSASSDAALPQTGESAAVNGVQQVKGRIPRGRPRDPEKLEKIICKMMA

Query:  NREWTTRLQNSIRSLVPQFDHNLVYNVLHAAKKSEHALNFFRWVERAGLFQHDRETHFKIIEILGRASKLNHARCILLDMPNKGVQWDEDLFVVLIESYG
        NREWTTRLQNSIRSLVPQFDH++V+NVLHAAK S+HAL FFRWVER+GLFQHDR THFKIIEILGRASKLNHARCILLDMPNKGV+WDEDLFV++I+SYG
Subjt:  NREWTTRLQNSIRSLVPQFDHNLVYNVLHAAKKSEHALNFFRWVERAGLFQHDRETHFKIIEILGRASKLNHARCILLDMPNKGVQWDEDLFVVLIESYG

Query:  KAGIVQEAVKIFQKMKELGVERSVKSYDALFKEIMRRGRYMMAKRYFNAMLNEGIEPIRHTYNVMLWGFFLSLRLETAKRFYEDMKSRGISPDVVTYNTM
        KAGIVQEAVKIFQKMKELGVERS+KSY+ LFK I+RRGRYMMAKRYFNAMLNEGIEP  HTYNVMLWGFFLSLRLETAKRFYEDMK+RGI+PDVVTYNTM
Subjt:  KAGIVQEAVKIFQKMKELGVERSVKSYDALFKEIMRRGRYMMAKRYFNAMLNEGIEPIRHTYNVMLWGFFLSLRLETAKRFYEDMKSRGISPDVVTYNTM

Query:  INGYCRFKMMEEAEQFFTEMKGKNIAPTVISYTTMIKGYVSVSRADDALRLFEEMKAAGEKPNDITYSTLLPGLCDAEKLPEARKILTEMVTRHFAPKDN
        INGY RFKMMEEAEQFFTEMKG N+ PTVISYTTMIKGYVS  R DD LRLFEEMKA G KPND TYSTLLPGLCDAEK+ EAR+ILTEMV ++ APKDN
Subjt:  INGYCRFKMMEEAEQFFTEMKGKNIAPTVISYTTMIKGYVSVSRADDALRLFEEMKAAGEKPNDITYSTLLPGLCDAEKLPEARKILTEMVTRHFAPKDN

Query:  SIFMRLLSCQCKHGDLDAAMHVLKAMIRLSIPTEAGHYGILIENCCKAGMYDQAVKLLENLVEKEIILRPQSTLEMEASAYNLIIQYLCNHGQTGKADTF
        SIFMRLLSCQCKHGDLDAAMHVLKAMIRLSIPTEAGHYGILIENCCKAG+YD+AVKLL+ LVEKEIIL+PQSTLEMEASAYN IIQYLC+HGQTGKA+TF
Subjt:  SIFMRLLSCQCKHGDLDAAMHVLKAMIRLSIPTEAGHYGILIENCCKAGMYDQAVKLLENLVEKEIILRPQSTLEMEASAYNLIIQYLCNHGQTGKADTF

Query:  FRQLLKKGIQDEVAFNNLIRGHAKEGNPDLAFEMLKIMGRRGVSRDAESYKLLIKSYLSKGEPADAKTALDSMIENGHSPDSALFRSVMESLFADGRVQT
        FRQLLKKGIQDEVAFNNLIRGH+KEGNP+LAFE+LKIMGR+ VSRDAESYKLLIKSYLSKGEPADAKTALDSMIE+GH PDSALFRSVMESLFADGRVQT
Subjt:  FRQLLKKGIQDEVAFNNLIRGHAKEGNPDLAFEMLKIMGRRGVSRDAESYKLLIKSYLSKGEPADAKTALDSMIENGHSPDSALFRSVMESLFADGRVQT

Query:  ASRVMNSMLDKGITENLDLVAKILEALFMRGHDEEALGRINLLMNCNCPPDFNSLLSVLCEKGKTTSAFKLLDFGLERECNIEFSSYEKVLDALLGAGKT
        ASRVMNSMLDKGITEN+DLVAKILEALFMRGH EEALGR++LLM C+CPPDF+SLLSVLCEKGKT +A KLLDFGLERECNIE SSYEKVLDALL AGKT
Subjt:  ASRVMNSMLDKGITENLDLVAKILEALFMRGHDEEALGRINLLMNCNCPPDFNSLLSVLCEKGKTTSAFKLLDFGLERECNIEFSSYEKVLDALLGAGKT

Query:  LNAYAILCKIMEKGGAKDWSSCDDLIKSLNQEGNTKQADILSRMIKGGDRKRSKKPSLVA
        LNAY+ILCKIMEKGGAK+WSSCDDLI SLNQEG+TKQADILSRMIKGGDR R KK S  A
Subjt:  LNAYAILCKIMEKGGAKDWSSCDDLIKSLNQEGNTKQADILSRMIKGGDRKRSKKPSLVA

XP_038880029.1 pentatricopeptide repeat-containing protein At2g37230 [Benincasa hispida]0.0e+0090.53Show/hide
Query:  MAHISVSKLHFTHYRVLSSSSISKPTALNSLHFFSSTQEPISTATQNGSPNDPSASSDAALPQTGESAAVNGVQQVKGRIPRGRPRDPEKLEKIICKMMA
        MAHISVSKLH +HYRVLSSSSI KPTAL SLHFFSSTQEPISTATQN SPN PSASSDAA+PQ  ES AVNG +QVK R PRG+PR+PEKLE +ICKMMA
Subjt:  MAHISVSKLHFTHYRVLSSSSISKPTALNSLHFFSSTQEPISTATQNGSPNDPSASSDAALPQTGESAAVNGVQQVKGRIPRGRPRDPEKLEKIICKMMA

Query:  NREWTTRLQNSIRSLVPQFDHNLVYNVLHAAKKSEHALNFFRWVERAGLFQHDRETHFKIIEILGRASKLNHARCILLDMPNKGVQWDEDLFVVLIESYG
        NREWTTRLQNSIRSLVPQFDH+LV+NVLHAAK S+HALNFFRWVERAGLFQHDRETH KIIEILGRASKLNHARCILLDM NKG++WDEDLFV+LIESYG
Subjt:  NREWTTRLQNSIRSLVPQFDHNLVYNVLHAAKKSEHALNFFRWVERAGLFQHDRETHFKIIEILGRASKLNHARCILLDMPNKGVQWDEDLFVVLIESYG

Query:  KAGIVQEAVKIFQKMKELGVERSVKSYDALFKEIMRRGRYMMAKRYFNAMLNEGIEPIRHTYNVMLWGFFLSLRLETAKRFYEDMKSRGISPDVVTYNTM
        KAGIVQEAVKIFQKMKELGVERSVKSYDALFK I+RRGRYMMAKRYFNAMLNEGIEP RHTYNVMLWGFFLSLRLETAKRFYEDMKSRGISPDVVTYNTM
Subjt:  KAGIVQEAVKIFQKMKELGVERSVKSYDALFKEIMRRGRYMMAKRYFNAMLNEGIEPIRHTYNVMLWGFFLSLRLETAKRFYEDMKSRGISPDVVTYNTM

Query:  INGYCRFKMMEEAEQFFTEMKGKNIAPTVISYTTMIKGYVSVSRADDALRLFEEMKAAGEKPNDITYSTLLPGLCDAEKLPEARKILTEMVTRHFAPKDN
        INGY RFKMMEEAEQFFTEMKGKNI PTVISYTTMIKGYVS  R DD LRLFEEMKA G KPNDITYSTLLPGLCDAEK+ EAR+ILTEMV ++ APKDN
Subjt:  INGYCRFKMMEEAEQFFTEMKGKNIAPTVISYTTMIKGYVSVSRADDALRLFEEMKAAGEKPNDITYSTLLPGLCDAEKLPEARKILTEMVTRHFAPKDN

Query:  SIFMRLLSCQCKHGDLDAAMHVLKAMIRLSIPTEAGHYGILIENCCKAGMYDQAVKLLENLVEKEIILRPQSTLEMEASAYNLIIQYLCNHGQTGKADTF
        SIFMRLLSCQC HGDLDAAMHVLKAMIRLSIPTEAGHYGILIENCCKAGMYD+AVKLL+ LVEKEIILRPQSTLEMEASAYNLIIQYLCNHGQTGKADTF
Subjt:  SIFMRLLSCQCKHGDLDAAMHVLKAMIRLSIPTEAGHYGILIENCCKAGMYDQAVKLLENLVEKEIILRPQSTLEMEASAYNLIIQYLCNHGQTGKADTF

Query:  FRQLLKKGIQDEVAFNNLIRGHAKEGNPDLAFEMLKIMGRRGVSRDAESYKLLIKSYLSKGEPADAKTALDSMIENGHSPDSALFRSVMESLFADGRVQT
        FRQLLKKGIQDEVAFNNLIRGH+KEGNP+LAFE+LKIMGRR VSRDAESYKLLIKSYLSKGEPADAKTALDSMIE+GH PDSALFRSVMESLF DGRVQT
Subjt:  FRQLLKKGIQDEVAFNNLIRGHAKEGNPDLAFEMLKIMGRRGVSRDAESYKLLIKSYLSKGEPADAKTALDSMIENGHSPDSALFRSVMESLFADGRVQT

Query:  ASRVMNSMLDKGITENLDLVAKILEALFMRGHDEEALGRINLLMNCNCPPDFNSLLSVLCEKGKTTSAFKLLDFGLERECNIEFSSYEKVLDALLGAGKT
        ASRVMNSMLDKGITENLDLVAKILEAL MRGH EEALGRI+LLM+CNCPPDF+SLLSVLCE+GKT +A KLLDFGLERECNIEFSSYEKVLDALLGAGKT
Subjt:  ASRVMNSMLDKGITENLDLVAKILEALFMRGHDEEALGRINLLMNCNCPPDFNSLLSVLCEKGKTTSAFKLLDFGLERECNIEFSSYEKVLDALLGAGKT

Query:  LNAYAILCKIMEKGGAKDWSSCDDLIKSLNQEGNTKQADILSRMIKGGDRKRSKKPSLVA
        LNAYAILCKIMEKGGA DW S DDLIKSLNQEGNTKQADILSR +KGGDRKR KKPSL A
Subjt:  LNAYAILCKIMEKGGAKDWSSCDDLIKSLNQEGNTKQADILSRMIKGGDRKRSKKPSLVA

TrEMBL top hitse value%identityAlignment
A0A0A0LTL3 PPR_long domain-containing protein0.0e+0099.87Show/hide
Query:  MAHISVSKLHFTHYRVLSSSSISKPTALNSLHFFSSTQEPISTATQNGSPNDPSASSDAALPQTGESAAVNGVQQVKGRIPRGRPRDPEKLEKIICKMMA
        MAHISVSKLHFTHYRVLSSSSISKPTALNSLHFFSSTQEPISTATQNGSPNDPSASSDAALPQTGESAAVNGVQQVKGRIPRGRPRDPEKLEKIICKMMA
Subjt:  MAHISVSKLHFTHYRVLSSSSISKPTALNSLHFFSSTQEPISTATQNGSPNDPSASSDAALPQTGESAAVNGVQQVKGRIPRGRPRDPEKLEKIICKMMA

Query:  NREWTTRLQNSIRSLVPQFDHNLVYNVLHAAKKSEHALNFFRWVERAGLFQHDRETHFKIIEILGRASKLNHARCILLDMPNKGVQWDEDLFVVLIESYG
        NREWTTRLQNSIRSLVPQFDHNLVYNVLHAAKKSEHALNFFRWVERAGLFQHDRETHFKIIEILGRASKLNHARCILLDMPNKGVQWDEDLFVVLIESYG
Subjt:  NREWTTRLQNSIRSLVPQFDHNLVYNVLHAAKKSEHALNFFRWVERAGLFQHDRETHFKIIEILGRASKLNHARCILLDMPNKGVQWDEDLFVVLIESYG

Query:  KAGIVQEAVKIFQKMKELGVERSVKSYDALFKEIMRRGRYMMAKRYFNAMLNEGIEPIRHTYNVMLWGFFLSLRLETAKRFYEDMKSRGISPDVVTYNTM
        KAGIVQEAVKIFQKMKELGVERSVKSYDALFKEIMRRGRYMMAKRYFNAMLNEGIEPIRHTYNVMLWGFFLSLRLETAKRFYEDMKSRGISPDVVTYNTM
Subjt:  KAGIVQEAVKIFQKMKELGVERSVKSYDALFKEIMRRGRYMMAKRYFNAMLNEGIEPIRHTYNVMLWGFFLSLRLETAKRFYEDMKSRGISPDVVTYNTM

Query:  INGYCRFKMMEEAEQFFTEMKGKNIAPTVISYTTMIKGYVSVSRADDALRLFEEMKAAGEKPNDITYSTLLPGLCDAEKLPEARKILTEMVTRHFAPKDN
        INGYCRFKMMEEAEQFFTEMKGKNIAPTVISYTTMIKGYVSVSRADDALRLFEEMKAAGEKPNDITYSTLLPGLCDAEKLPEARKILTEMVTRHFAPKDN
Subjt:  INGYCRFKMMEEAEQFFTEMKGKNIAPTVISYTTMIKGYVSVSRADDALRLFEEMKAAGEKPNDITYSTLLPGLCDAEKLPEARKILTEMVTRHFAPKDN

Query:  SIFMRLLSCQCKHGDLDAAMHVLKAMIRLSIPTEAGHYGILIENCCKAGMYDQAVKLLENLVEKEIILRPQSTLEMEASAYNLIIQYLCNHGQTGKADTF
        SIFMRLLSCQCKHGDLDAAMHVLKAMIRLSIPTEAGHYGILIENCCKAGMYDQAVKLLENLVEKEIILRPQSTLEMEASAYNLIIQYLCNHGQTGKADTF
Subjt:  SIFMRLLSCQCKHGDLDAAMHVLKAMIRLSIPTEAGHYGILIENCCKAGMYDQAVKLLENLVEKEIILRPQSTLEMEASAYNLIIQYLCNHGQTGKADTF

Query:  FRQLLKKGIQDEVAFNNLIRGHAKEGNPDLAFEMLKIMGRRGVSRDAESYKLLIKSYLSKGEPADAKTALDSMIENGHSPDSALFRSVMESLFADGRVQT
        FRQLLKKGIQDEVAFNNLIRGHAKEGNPDLAFEMLKIMGRRGVSRDAESYKLLIKSYLSKGEPADAKTALDSMIENGHSPDSALFRSVMESLFADGRVQT
Subjt:  FRQLLKKGIQDEVAFNNLIRGHAKEGNPDLAFEMLKIMGRRGVSRDAESYKLLIKSYLSKGEPADAKTALDSMIENGHSPDSALFRSVMESLFADGRVQT

Query:  ASRVMNSMLDKGITENLDLVAKILEALFMRGHDEEALGRINLLMNCNCPPDFNSLLSVLCEKGKTTSAFKLLDFGLERECNIEFSSYEKVLDALLGAGKT
        ASRVMNSMLDKGITENLDLVAKILEALFMRGHDEEALGRINLLMNCNCPPDFNSLLSVLCEKGKTTSAFKLLDFGLERECNIEFSSYEKVLDALLGAGKT
Subjt:  ASRVMNSMLDKGITENLDLVAKILEALFMRGHDEEALGRINLLMNCNCPPDFNSLLSVLCEKGKTTSAFKLLDFGLERECNIEFSSYEKVLDALLGAGKT

Query:  LNAYAILCKIMEKGGAKDWSSCDDLIKSLNQEGNTKQADILSRMIKGGDRKRSKKPSLVA
        LNAYAILCKIMEKGGAKDWSSCDDLIKSLNQEGNTKQADILSRMIKGGDRKRSKKPSL A
Subjt:  LNAYAILCKIMEKGGAKDWSSCDDLIKSLNQEGNTKQADILSRMIKGGDRKRSKKPSLVA

A0A1S3B8Y6 pentatricopeptide repeat-containing protein At2g372300.0e+0093.88Show/hide
Query:  MAHISVSKLHFTHYRVLSSSSISKPTALNSLHFFSSTQEPISTATQNGSPNDPSASSDAALPQTGESAAVNGVQQVKGRIPRGRPRDPEKLEKIICKMMA
        MAHISVSKLHFTHYRVLSSSSISKPTALNSLHFFSSTQEPIS ATQN SPNDP ASS+AALPQT ESAAVNGVQQVKGRIPRGRPR+ EKLE +IC+MMA
Subjt:  MAHISVSKLHFTHYRVLSSSSISKPTALNSLHFFSSTQEPISTATQNGSPNDPSASSDAALPQTGESAAVNGVQQVKGRIPRGRPRDPEKLEKIICKMMA

Query:  NREWTTRLQNSIRSLVPQFDHNLVYNVLHAAKKSEHALNFFRWVERAGLFQHDRETHFKIIEILGRASKLNHARCILLDMPNKGVQWDEDLFVVLIESYG
        +REWTTRLQNSIRSLVPQFDH LVYNVLHAAKKSEHALNFFRWVERAGLFQHDRETH KIIEILG ASKLNHARCILLDMPNKGV+WDEDLFVVLI+SYG
Subjt:  NREWTTRLQNSIRSLVPQFDHNLVYNVLHAAKKSEHALNFFRWVERAGLFQHDRETHFKIIEILGRASKLNHARCILLDMPNKGVQWDEDLFVVLIESYG

Query:  KAGIVQEAVKIFQKMKELGVERSVKSYDALFKEIMRRGRYMMAKRYFNAMLNEGIEPIRHTYNVMLWGFFLSLRLETAKRFYEDMKSRGISPDVVTYNTM
        KAGIVQEAVKIF+KMKELGVERS KSYDALFK I+RRGRYMMAKRYFNAMLNEG+EP RHTYNVMLWGFFLSLRLETAKRFYEDMKSRGISPDVVTYNTM
Subjt:  KAGIVQEAVKIFQKMKELGVERSVKSYDALFKEIMRRGRYMMAKRYFNAMLNEGIEPIRHTYNVMLWGFFLSLRLETAKRFYEDMKSRGISPDVVTYNTM

Query:  INGYCRFKMMEEAEQFFTEMKGKNIAPTVISYTTMIKGYVSVSRADDALRLFEEMKAAGEKPNDITYSTLLPGLCDAEKLPEARKILTEMVTRHFAPKDN
        INGYCRFKMMEEAEQFFTEMKGKNIAPTVISYTTMIKGYVSV R DD LRLFEEMKAAGEKPNDITYSTLLPGLCDAEKLPEARKILTEMV R+ APKDN
Subjt:  INGYCRFKMMEEAEQFFTEMKGKNIAPTVISYTTMIKGYVSVSRADDALRLFEEMKAAGEKPNDITYSTLLPGLCDAEKLPEARKILTEMVTRHFAPKDN

Query:  SIFMRLLSCQCKHGDLDAAMHVLKAMIRLSIPTEAGHYGILIENCCKAGMYDQAVKLLENLVEKEIILRPQSTLEMEASAYNLIIQYLCNHGQTGKADTF
        SIFMRLLSCQCKHGDLDAAMHVLKAM+RLSIPTEAGHYGILIENCCKAGMYD+AVKLL+ LVEKEIIL+PQSTLEMEASAYNLIIQYLCNHGQTGKA+ F
Subjt:  SIFMRLLSCQCKHGDLDAAMHVLKAMIRLSIPTEAGHYGILIENCCKAGMYDQAVKLLENLVEKEIILRPQSTLEMEASAYNLIIQYLCNHGQTGKADTF

Query:  FRQLLKKGIQDEVAFNNLIRGHAKEGNPDLAFEMLKIMGRRGVSRDAESYKLLIKSYLSKGEPADAKTALDSMIENGHSPDSALFRSVMESLFADGRVQT
        FRQLLKKGIQDEVAFNNLIRGHAKEGNP+ AFEMLKIMGRRGVSRDAESYKLLIKSYLSKGEPADAKTALDSMIENGHSPDSALFRSVMESLFADGRVQT
Subjt:  FRQLLKKGIQDEVAFNNLIRGHAKEGNPDLAFEMLKIMGRRGVSRDAESYKLLIKSYLSKGEPADAKTALDSMIENGHSPDSALFRSVMESLFADGRVQT

Query:  ASRVMNSMLDKGITENLDLVAKILEALFMRGHDEEALGRINLLMNCNCPPDFNSLLSVLCEKGKTTSAFKLLDFGLERECNIEFSSYEKVLDALLGAGKT
        ASRVMNSMLDKGITENLDLVAKILEALFMRGHDEE LGRINLLMNCNCPPDF+SLLSVLCEKGKT +AFKLL+FGLERECNI+FSSYEKVLDAL+GAGKT
Subjt:  ASRVMNSMLDKGITENLDLVAKILEALFMRGHDEEALGRINLLMNCNCPPDFNSLLSVLCEKGKTTSAFKLLDFGLERECNIEFSSYEKVLDALLGAGKT

Query:  LNAYAILCKIMEKGGAKDWSSCDDLIKSLNQEGNTKQADILSRMIKGGDRKR
        LNAYAILCKIMEKGGAKDWSSCDDLIK+LNQEGNTKQADILSRM+KGGDRKR
Subjt:  LNAYAILCKIMEKGGAKDWSSCDDLIKSLNQEGNTKQADILSRMIKGGDRKR

A0A5A7T0L7 Pentatricopeptide repeat-containing protein0.0e+0093.68Show/hide
Query:  MAHISVSKLHFTHYRVLSSSSISKPTALNSLHFFSSTQEPISTATQNGSPNDPSASSDAALPQTGESAAVNGVQQVKGRIPRGRPRDPEKLEKIICKMMA
        MAHISVSKLHFTHYRVLSSSSISKPTALNSLHFFSSTQEPIS ATQN SPNDP ASS+AALPQT ESAAVNGVQQVKGRIPRGRPR+ EKLE +IC+MMA
Subjt:  MAHISVSKLHFTHYRVLSSSSISKPTALNSLHFFSSTQEPISTATQNGSPNDPSASSDAALPQTGESAAVNGVQQVKGRIPRGRPRDPEKLEKIICKMMA

Query:  NREWTTRLQNSIRSLVPQFDHNLVYNVLHAAKKSEHALNFFRWVERAGLFQHDRETHFKIIEILGRASKLNHARCILLDMPNKGVQWDEDLFVVLIESYG
        +REWTTRLQNSIRSLVPQFDH LVYNVLHAAKKSEHALNFFRWVERAGLFQHDRETH KIIEILG ASKLNHARCILLDMPNKGV+WDEDLFVVLI+SYG
Subjt:  NREWTTRLQNSIRSLVPQFDHNLVYNVLHAAKKSEHALNFFRWVERAGLFQHDRETHFKIIEILGRASKLNHARCILLDMPNKGVQWDEDLFVVLIESYG

Query:  KAGIVQEAVKIFQKMKELGVERSVKSYDALFKEIMRRGRYMMAKRYFNAMLNEGIEPIRHTYNVMLWGFFLSLRLETAKRFYEDMKSRGISPDVVTYNTM
        KAGIVQEAVKIF+KMKELGVERS KSYDALFK I+RRGRYMMAKRYFNAMLNEG+EP RHTYNVMLWGFFLSLRLETAKRFYEDMKSRGISPDVVTYNTM
Subjt:  KAGIVQEAVKIFQKMKELGVERSVKSYDALFKEIMRRGRYMMAKRYFNAMLNEGIEPIRHTYNVMLWGFFLSLRLETAKRFYEDMKSRGISPDVVTYNTM

Query:  INGYCRFKMMEEAEQFFTEMKGKNIAPTVISYTTMIKGYVSVSRADDALRLFEEMKAAGEKPNDITYSTLLPGLCDAEKLPEARKILTEMVTRHFAPKDN
        INGYCRFKMMEEAEQFFTEMKGKNIAPTVISYTTMIKGYVSV R DD LRLFEEMKAAGEKPNDITYSTLLPGLCDAEKLPEARKILTEMV R+ APKDN
Subjt:  INGYCRFKMMEEAEQFFTEMKGKNIAPTVISYTTMIKGYVSVSRADDALRLFEEMKAAGEKPNDITYSTLLPGLCDAEKLPEARKILTEMVTRHFAPKDN

Query:  SIFMRLLSCQCKHGDLDAAMHVLKAMIRLSIPTEAGHYGILIENCCKAGMYDQAVKLLENLVEKEIILRPQSTLEMEASAYNLIIQYLCNHGQTGKADTF
        SIFMRLLSCQCKHGDLDAAMHVLKAM+RLSIPTEAGHYGILIENCCKAGMYD+AVKLL+ LVEKEIIL+PQSTLEMEASAYNLIIQYLCNHGQTGKA+ F
Subjt:  SIFMRLLSCQCKHGDLDAAMHVLKAMIRLSIPTEAGHYGILIENCCKAGMYDQAVKLLENLVEKEIILRPQSTLEMEASAYNLIIQYLCNHGQTGKADTF

Query:  FRQLLKKGIQDEVAFNNLIRGHAKEGNPDLAFEMLKIMGRRGVSRDAESYKLLIKSYLSKGEPADAKTALDSMIENGHSPDSALFRSVMESLFADGRVQT
        FRQLLKKGIQDEVAFNNLIRGHAKEGNP+ AFEMLKIMGRRGVSRDAESYKLLIKSYLSKGEPADAKTALDSMIENGHSPDSALFRSVMESLFADGRVQT
Subjt:  FRQLLKKGIQDEVAFNNLIRGHAKEGNPDLAFEMLKIMGRRGVSRDAESYKLLIKSYLSKGEPADAKTALDSMIENGHSPDSALFRSVMESLFADGRVQT

Query:  ASRVMNSMLDKGITENLDLVAKILEALFMRGHDEEALGRINLLMNCNCPPDFNSLLSVLCEKGKTTSAFKLLDFGLERECNIEFSSYEKVLDALLGAGKT
        ASRVMNSMLDKGITENLDLVAKILEALFMRGHDEE LGRINLLMNCNCPPDF+SLLSVLCEKGKT +AFKLL+FGLERECNI+FSSYEKVLDAL+GAGKT
Subjt:  ASRVMNSMLDKGITENLDLVAKILEALFMRGHDEEALGRINLLMNCNCPPDFNSLLSVLCEKGKTTSAFKLLDFGLERECNIEFSSYEKVLDALLGAGKT

Query:  LNAYAILCKIMEKGGAKDWSSCDDLIKSLNQEGNTKQADILSRMIKGGDRKRSKKPSLVA
        LNAYAILCKIMEKGGAKDWSSCDDLIK+LNQEGNTKQADILSRM+KGGDRKRSKK SL A
Subjt:  LNAYAILCKIMEKGGAKDWSSCDDLIKSLNQEGNTKQADILSRMIKGGDRKRSKKPSLVA

A0A6J1E1L0 pentatricopeptide repeat-containing protein At2g37230-like0.0e+0085.79Show/hide
Query:  MAHISVSKLHFTHYRVLSSSSISKPTALNSLHFFSSTQEPISTATQNGSPNDPSASSDAALPQTGESAAVNGVQQVKGRIPRGRPRDPEKLEKIICKMMA
        MAHIS+SK H++H +VLSSSSISKP + NSLHFFSSTQ+P +TATQN SP DP  SSDAA+PQ  E  AVNG  QVK  IPRG  R+PEKLE IIC+MMA
Subjt:  MAHISVSKLHFTHYRVLSSSSISKPTALNSLHFFSSTQEPISTATQNGSPNDPSASSDAALPQTGESAAVNGVQQVKGRIPRGRPRDPEKLEKIICKMMA

Query:  NREWTTRLQNSIRSLVPQFDHNLVYNVLHAAKKSEHALNFFRWVERAGLFQHDRETHFKIIEILGRASKLNHARCILLDMPNKGVQWDEDLFVVLIESYG
        NREWTTRLQNSIRSLVPQFDH++V+NVLHAAK S+HAL FFRWVERAGLFQHDR THFKIIEILGRASKLNHARCILLDMPNKGV+WDEDLFV++I+SYG
Subjt:  NREWTTRLQNSIRSLVPQFDHNLVYNVLHAAKKSEHALNFFRWVERAGLFQHDRETHFKIIEILGRASKLNHARCILLDMPNKGVQWDEDLFVVLIESYG

Query:  KAGIVQEAVKIFQKMKELGVERSVKSYDALFKEIMRRGRYMMAKRYFNAMLNEGIEPIRHTYNVMLWGFFLSLRLETAKRFYEDMKSRGISPDVVTYNTM
        KAGIVQEAVKIFQKMKELGVERS+KSY+ LFK I+RRGRYMMAKRYFNAMLNEGIEP  HTYNVMLWGFFLSLRLETAKRFYEDMK+RGI+PDVVTYNTM
Subjt:  KAGIVQEAVKIFQKMKELGVERSVKSYDALFKEIMRRGRYMMAKRYFNAMLNEGIEPIRHTYNVMLWGFFLSLRLETAKRFYEDMKSRGISPDVVTYNTM

Query:  INGYCRFKMMEEAEQFFTEMKGKNIAPTVISYTTMIKGYVSVSRADDALRLFEEMKAAGEKPNDITYSTLLPGLCDAEKLPEARKILTEMVTRHFAPKDN
        INGY RFKMMEEAEQFFTEMKGKN+ PTVISYTTMIKGYVS  R DD LRLFEEMKA G KPND TYSTLLPGLCDAE++ EAR+ILTEMV ++ APKDN
Subjt:  INGYCRFKMMEEAEQFFTEMKGKNIAPTVISYTTMIKGYVSVSRADDALRLFEEMKAAGEKPNDITYSTLLPGLCDAEKLPEARKILTEMVTRHFAPKDN

Query:  SIFMRLLSCQCKHGDLDAAMHVLKAMIRLSIPTEAGHYGILIENCCKAGMYDQAVKLLENLVEKEIILRPQSTLEMEASAYNLIIQYLCNHGQTGKADTF
        SIFMRLLSCQCKHGDLDAAMHVLKAM RLS+PTEAGHYGILIENCCKAG+YD+AVKLL+ LVEKEIIL+PQSTLEMEASAYN +IQYLC+HGQTGKA+TF
Subjt:  SIFMRLLSCQCKHGDLDAAMHVLKAMIRLSIPTEAGHYGILIENCCKAGMYDQAVKLLENLVEKEIILRPQSTLEMEASAYNLIIQYLCNHGQTGKADTF

Query:  FRQLLKKGIQDEVAFNNLIRGHAKEGNPDLAFEMLKIMGRRGVSRDAESYKLLIKSYLSKGEPADAKTALDSMIENGHSPDSALFRSVMESLFADGRVQT
        FRQLLKKGIQDEVAFNNLIRGH+KEGNP+LAFE+LKIMGR+ VSRDAESYKLLIKSYLSKGEPADAKTALDSMIE+GH PDSALFRSVMESLFADGRVQT
Subjt:  FRQLLKKGIQDEVAFNNLIRGHAKEGNPDLAFEMLKIMGRRGVSRDAESYKLLIKSYLSKGEPADAKTALDSMIENGHSPDSALFRSVMESLFADGRVQT

Query:  ASRVMNSMLDKGITENLDLVAKILEALFMRGHDEEALGRINLLMNCNCPPDFNSLLSVLCEKGKTTSAFKLLDFGLERECNIEFSSYEKVLDALLGAGKT
        ASRVMNSML KGITENLDLVAKILEALFMRGH EEALGR++LLM  +CPPDF+SLLSVLCEKGKT +A KLLDFGLERECNIE SSYEKVLDALL AGKT
Subjt:  ASRVMNSMLDKGITENLDLVAKILEALFMRGHDEEALGRINLLMNCNCPPDFNSLLSVLCEKGKTTSAFKLLDFGLERECNIEFSSYEKVLDALLGAGKT

Query:  LNAYAILCKIMEKGGAKDWSSCDDLIKSLNQEGNTKQADILSRMIKGGDRKRSKKPSLVA
        LNAY+ILCKIMEKGGAK+WSSCDDLI SLNQEG+TKQADILSRMIKGGDR R KK S  A
Subjt:  LNAYAILCKIMEKGGAKDWSSCDDLIKSLNQEGNTKQADILSRMIKGGDRKRSKKPSLVA

A0A6J1JJW0 pentatricopeptide repeat-containing protein At2g37230-like0.0e+0086.05Show/hide
Query:  MAHISVSKLHFTHYRVLSSSSISKPTALNSLHFFSSTQEPISTATQNGSPNDPSASSDAALPQTGESAAVNGVQQVKGRIPRGRPRDPEKLEKIICKMMA
        MAHIS+SK H++H +V SSSSISK  + NSLHFFSSTQ+PIST TQN SPNDP  SSDAA+PQ+ E  AVNG  QVK  IPRG  R+PEKLE IIC+MMA
Subjt:  MAHISVSKLHFTHYRVLSSSSISKPTALNSLHFFSSTQEPISTATQNGSPNDPSASSDAALPQTGESAAVNGVQQVKGRIPRGRPRDPEKLEKIICKMMA

Query:  NREWTTRLQNSIRSLVPQFDHNLVYNVLHAAKKSEHALNFFRWVERAGLFQHDRETHFKIIEILGRASKLNHARCILLDMPNKGVQWDEDLFVVLIESYG
        NREWTTRLQNSIRSLVPQFDH+LV+NVLHAAK S+HAL FFRWVERAGLFQHDR THFKIIEILGRASKLNHARCILLDMP KGV+WDEDLFV++I+SYG
Subjt:  NREWTTRLQNSIRSLVPQFDHNLVYNVLHAAKKSEHALNFFRWVERAGLFQHDRETHFKIIEILGRASKLNHARCILLDMPNKGVQWDEDLFVVLIESYG

Query:  KAGIVQEAVKIFQKMKELGVERSVKSYDALFKEIMRRGRYMMAKRYFNAMLNEGIEPIRHTYNVMLWGFFLSLRLETAKRFYEDMKSRGISPDVVTYNTM
        KAGIVQEAVKIFQKMKELGVERS+KSYDALFK I+RRGRYMMAKRYFN MLNEGIEP RHTYNVMLWGFFLSLRLETAKRFYEDMK+RGI+PDVVTYNTM
Subjt:  KAGIVQEAVKIFQKMKELGVERSVKSYDALFKEIMRRGRYMMAKRYFNAMLNEGIEPIRHTYNVMLWGFFLSLRLETAKRFYEDMKSRGISPDVVTYNTM

Query:  INGYCRFKMMEEAEQFFTEMKGKNIAPTVISYTTMIKGYVSVSRADDALRLFEEMKAAGEKPNDITYSTLLPGLCDAEKLPEARKILTEMVTRHFAPKDN
        INGY RFKMMEEAEQFFTEMKGKNI PTVISYTTMIKGYVS  R DD LRLFEEMKA G KPND+TYSTLLPGLCDAEK+ EA +ILTEMV R+ APKDN
Subjt:  INGYCRFKMMEEAEQFFTEMKGKNIAPTVISYTTMIKGYVSVSRADDALRLFEEMKAAGEKPNDITYSTLLPGLCDAEKLPEARKILTEMVTRHFAPKDN

Query:  SIFMRLLSCQCKHGDLDAAMHVLKAMIRLSIPTEAGHYGILIENCCKAGMYDQAVKLLENLVEKEIILRPQSTLEMEASAYNLIIQYLCNHGQTGKADTF
        SIFMRLLSCQC HGDLDAAMHVLKAM RLS+PTEAGHYGILIENCCKAG+YD+AVKLL+ LV+KEIIL+PQSTLEMEASAYN IIQYLC+HGQTGKA+TF
Subjt:  SIFMRLLSCQCKHGDLDAAMHVLKAMIRLSIPTEAGHYGILIENCCKAGMYDQAVKLLENLVEKEIILRPQSTLEMEASAYNLIIQYLCNHGQTGKADTF

Query:  FRQLLKKGIQDEVAFNNLIRGHAKEGNPDLAFEMLKIMGRRGVSRDAESYKLLIKSYLSKGEPADAKTALDSMIENGHSPDSALFRSVMESLFADGRVQT
        FRQLLKKGIQDE+AFNNLIRGH+KEGNP+LAFEMLKIMGR+ VSRDAESYKLLIKSYLSKGEPADAKTALDSMIE+GH PDSALFRSVMESLFADGRVQT
Subjt:  FRQLLKKGIQDEVAFNNLIRGHAKEGNPDLAFEMLKIMGRRGVSRDAESYKLLIKSYLSKGEPADAKTALDSMIENGHSPDSALFRSVMESLFADGRVQT

Query:  ASRVMNSMLDKGITENLDLVAKILEALFMRGHDEEALGRINLLMNCNCPPDFNSLLSVLCEKGKTTSAFKLLDFGLERECNIEFSSYEKVLDALLGAGKT
        ASRVMNSMLDKGITENLDLVAKILEALFMRGH EEALGR++LLM C+CPPDF+SLLSVLCEKGKT +A KLLDFGLERECNIE SSYEKVLDALL AGKT
Subjt:  ASRVMNSMLDKGITENLDLVAKILEALFMRGHDEEALGRINLLMNCNCPPDFNSLLSVLCEKGKTTSAFKLLDFGLERECNIEFSSYEKVLDALLGAGKT

Query:  LNAYAILCKIMEKGGAKDWSSCDDLIKSLNQEGNTKQADILSRMIKGGDRKRSKKPSLVA
        LNAY+ILCKIMEKGGAK+WSSCDDLI SLNQEG+TKQAD+LSRMIKGGD  R K  S  A
Subjt:  LNAYAILCKIMEKGGAKDWSSCDDLIKSLNQEGNTKQADILSRMIKGGDRKRSKKPSLVA

SwissProt top hitse value%identityAlignment
O81908 Pentatricopeptide repeat-containing protein At1g02060, chloroplastic5.0e-9732.68Show/hide
Query:  KLEKIICKMMANREWTTRLQNSIRSLVPQ--FDHNLVYNVLHAAKKSEHALNFFRWVERAGLFQHDRETHFKIIEILGRASKLNHARCILLDMPNKG---
        KL + + + + +  W+  L++S+ SL P        V   L   K     L FF WV   G F H  ++ F ++E LGRA  LN AR  L  +  +    
Subjt:  KLEKIICKMMANREWTTRLQNSIRSLVPQ--FDHNLVYNVLHAAKKSEHALNFFRWVERAGLFQHDRETHFKIIEILGRASKLNHARCILLDMPNKG---

Query:  VQWDEDLFVVLIESYGKAGIVQEAVKIFQKMKELGVERSVKSYDALFKEIMRRGRYMMAKRYFNAMLNE-GIEPIRHTYNVMLWGFFLSLRLETAKRFYE
        V+  +  F  LI SYG AG+ QE+VK+FQ MK++G+  SV ++++L   +++RGR  MA   F+ M    G+ P  +T+N ++ GF  +  ++ A R ++
Subjt:  VQWDEDLFVVLIESYGKAGIVQEAVKIFQKMKELGVERSVKSYDALFKEIMRRGRYMMAKRYFNAMLNE-GIEPIRHTYNVMLWGFFLSLRLETAKRFYE

Query:  DMKSRGISPDVVTYNTMINGYCRFKMMEEAEQFFTEM--KGKNIAPTVISYTTMIKGYVSVSRADDALRLFEEMKAAGEKPNDITYSTLLPGLCDAEKLP
        DM+    +PDVVTYNT+I+G CR   ++ A    + M  K  ++ P V+SYTT+++GY      D+A+ +F +M + G KPN +TY+TL+ GL +A +  
Subjt:  DMKSRGISPDVVTYNTMINGYCRFKMMEEAEQFFTEM--KGKNIAPTVISYTTMIKGYVSVSRADDALRLFEEMKAAGEKPNDITYSTLLPGLCDAEKLP

Query:  EARKILT--EMVTRHFAPKDNSIFMRLLSCQCKHGDLDAAMHVLKAMIRLSIPTEAGHYGILIENCCKAGMYDQAVKLLENLVEKEIILRPQSTLEMEAS
        E + IL         FAP D   F  L+   C  G LDAAM V + M+ + +  ++  Y +LI   C    +D+A  L   L EKE++L       + A+
Subjt:  EARKILT--EMVTRHFAPKDNSIFMRLLSCQCKHGDLDAAMHVLKAMIRLSIPTEAGHYGILIENCCKAGMYDQAVKLLENLVEKEIILRPQSTLEMEAS

Query:  AYNLIIQYLCNHGQTGKADTFFRQLLKKGIQDEVAFNNLIRGHAKEGNPDLAFEMLKIMGRRGVSRDAESYKLLIKSYLSKGEPADAKTALDSMIENGHS
        AYN + +YLC +G+T +A+  FRQL+K+G+QD  ++  LI GH +EG    A+E+L +M RR    D E+Y+LLI   L  GE   A   L  M+ + + 
Subjt:  AYNLIIQYLCNHGQTGKADTFFRQLLKKGIQDEVAFNNLIRGHAKEGNPDLAFEMLKIMGRRGVSRDAESYKLLIKSYLSKGEPADAKTALDSMIENGHS

Query:  PDSALFRSVMESLFADGRVQTASRVMNSMLDKGITENLDLVAKILEALFMRGHDEEALGRINLLMNCNCPPDFNSLLSVLCEKGKTTSAFKLLDFGLERE
        P +  F SV+  L        +  ++  ML+K I +N+DL  +++  LF     E+A   + LL +         LL  LCE  K   A  L+ F LE+ 
Subjt:  PDSALFRSVMESLFADGRVQTASRVMNSMLDKGITENLDLVAKILEALFMRGHDEEALGRINLLMNCNCPPDFNSLLSVLCEKGKTTSAFKLLDFGLERE

Query:  CNIEFSSYEKVLDALLGAGKTLNAYAILCKIMEKGGAKDWSSCDDLIKSLNQEGNTKQADILSR
          ++  +   V++ L    +   A+++  +++E G  +  S    L  +L   G  ++   +S+
Subjt:  CNIEFSSYEKVLDALLGAGKTLNAYAILCKIMEKGGAKDWSSCDDLIKSLNQEGNTKQADILSR

P0C7Q7 Putative pentatricopeptide repeat-containing protein At1g12700, mitochondrial6.0e-5027.73Show/hide
Query:  HNLVYNVLHAAKKSEHALNFFRWVERAGLFQHDRETHFKIIEILGRASKLNHARCILLDMPNKGVQWDEDLFVVLIESYGKAGIVQEAVKIFQKMKELGV
        + L+  +    K SE  +   R VE     Q D  T+  I+  + R+   + A  +L  M  + V+ D   +  +I+S  + G +  A+ +F++M+  G+
Subjt:  HNLVYNVLHAAKKSEHALNFFRWVERAGLFQHDRETHFKIIEILGRASKLNHARCILLDMPNKGVQWDEDLFVVLIESYGKAGIVQEAVKIFQKMKELGV

Query:  ERSVKSYDALFKEIMRRGRYMMAKRYFNAMLNEGIEPIRHTYNVMLWGFFLSLRLETAKRFYEDMKSRGISPDVVTYNTMINGYCRFKMMEEAEQFFTEM
        + SV +Y++L + + + G++         M++  I P   T+NV+L  F    +L+ A   Y++M +RGISP+++TYNT+++GYC    + EA      M
Subjt:  ERSVKSYDALFKEIMRRGRYMMAKRYFNAMLNEGIEPIRHTYNVMLWGFFLSLRLETAKRFYEDMKSRGISPDVVTYNTMINGYCRFKMMEEAEQFFTEM

Query:  KGKNIAPTVISYTTMIKGYVSVSRADDALRLFEEMKAAGEKPNDITYSTLLPGLCDAEKLPEARKILTEMVTRHFAPKDNSIFMRLLSCQCKHGDLDAAM
             +P ++++T++IKGY  V R DD +++F  +   G   N +TYS L+ G C + K+  A ++  EMV+    P D   +  LL   C +G L+ A+
Subjt:  KGKNIAPTVISYTTMIKGYVSVSRADDALRLFEEMKAAGEKPNDITYSTLLPGLCDAEKLPEARKILTEMVTRHFAPKDNSIFMRLLSCQCKHGDLDAAM

Query:  HVLKAMIRLSIPTEAGHYGILIENCCKAGMYDQAVKLLENLVEKEIILRPQSTLEMEASAYNLIIQYLCNHGQTGKADTFFRQLLKKG-IQDEVAFNNLI
         + + + +  +      Y  +IE  CK G  + A  L  +L        P   ++     Y ++I  LC  G   +A+   R++ + G   ++  +N LI
Subjt:  HVLKAMIRLSIPTEAGHYGILIENCCKAGMYDQAVKLLENLVEKEIILRPQSTLEMEASAYNLIIQYLCNHGQTGKADTFFRQLLKKG-IQDEVAFNNLI

Query:  RGHAKEGNPDLAFEMLKIMGRRGVSRDAESYKLLIKSYLS
        R H ++G+   + ++++ M   G S DA S K++I   LS
Subjt:  RGHAKEGNPDLAFEMLKIMGRRGVSRDAESYKLLIKSYLS

Q9CA58 Putative pentatricopeptide repeat-containing protein At1g745801.6e-5024.38Show/hide
Query:  VLHAAKKSEHALNFFRWVERAGLFQHDRETHFKIIEILGRASKLNHARCILLDM-PNKGVQWDEDLFVVLIESYGKAGIVQEAVKIFQKMKELGVERSVK
        V+   K    AL  F  + +   F+H   T+  +IE LG   K      +L+DM  N G    E ++V  +++YG+ G VQEAV +F++M     E +V 
Subjt:  VLHAAKKSEHALNFFRWVERAGLFQHDRETHFKIIEILGRASKLNHARCILLDM-PNKGVQWDEDLFVVLIESYGKAGIVQEAVKIFQKMKELGVERSVK

Query:  SYDALFKEIMRRGRYMMAKRYFNAMLNEGIEPIRHTYNVMLWGFFLSLRLETAKRFYEDMKSRGISPDVVTYNTMINGY---------------------
        SY+A+   ++  G +  A + +  M + GI P  +++ + +  F  + R   A R   +M S+G   +VV Y T++ G+                     
Subjt:  SYDALFKEIMRRGRYMMAKRYFNAMLNEGIEPIRHTYNVMLWGFFLSLRLETAKRFYEDMKSRGISPDVVTYNTMINGY---------------------

Query:  --------------CRFKMMEEAEQFFTEMKGKNIAPTVISYTTMIKGYVSVSRADDALRLFEEMKAAGEKPNDITYSTLLPGLCDAEKLPEARKILTEM
                      C+   ++E E+   ++  + + P + +Y   I+G       D A+R+   +   G KP+ ITY+ L+ GLC   K  EA   L +M
Subjt:  --------------CRFKMMEEAEQFFTEMKGKNIAPTVISYTTMIKGYVSVSRADDALRLFEEMKAAGEKPNDITYSTLLPGLCDAEKLPEARKILTEM

Query:  VTRHFAPKDNSIFMRLLSCQCKHGDLDAAMHVLKAMIRLSIPTEAGHYGILIENCCKAGMYDQAVKLLENLVEKEIILRPQSTLEMEASAYNLIIQYLCN
        V     P D+  +  L++  CK G +  A  ++   +      +   Y  LI+  C  G  ++A+ L    + K I  +P   L      YN +I+ L N
Subjt:  VTRHFAPKDNSIFMRLLSCQCKHGDLDAAMHVLKAMIRLSIPTEAGHYGILIENCCKAGMYDQAVKLLENLVEKEIILRPQSTLEMEASAYNLIIQYLCN

Query:  HGQTGKADTFFRQLLKKGIQDEV-AFNNLIRGHAKEGNPDLAFEMLKIMGRRGVSRDAESYKLLIKSYLSKGEPADAKTALDSMIENGHSPDSALFRSVM
         G   +A     ++ +KG+  EV  FN L+ G  K G    A  ++K+M  +G   D  ++ +LI  Y ++ +  +A   LD M++NG  PD   + S++
Subjt:  HGQTGKADTFFRQLLKKGIQDEV-AFNNLIRGHAKEGNPDLAFEMLKIMGRRGVSRDAESYKLLIKSYLSKGEPADAKTALDSMIENGHSPDSALFRSVM

Query:  ESLFADGRVQTASRVMNSMLDKGITENLDLVAKILEALFMRGHDEEALGRINLLMNCNCPPD--------------------------------------
          L    + +       +M++KG   NL     +LE+L      +EALG +  + N +  PD                                      
Subjt:  ESLFADGRVQTASRVMNSMLDKGITENLDLVAKILEALFMRGHDEEALGRINLLMNCNCPPD--------------------------------------

Query:  -FNSLLSVLCEKGKTTSAFKLLDFGLERECNIEFSSYEKVLDALLGAGKTLNAYAILCKIMEKGGAKDWSSCDDLIKSLNQEGNT-KQADILSRMIKGG-
         +N ++    EK   T A KL    ++R    +  +Y  ++D     G     Y  L ++ME G     ++   +I  L  E    + A I+ RM++ G 
Subjt:  -FNSLLSVLCEKGKTTSAFKLLDFGLERECNIEFSSYEKVLDALLGAGKTLNAYAILCKIMEKGGAKDWSSCDDLIKSLNQEGNT-KQADILSRMIKGG-

Query:  -----------DRKRSKKPSLV
                   D+K    P LV
Subjt:  -----------DRKRSKKPSLV

Q9LPX2 Pentatricopeptide repeat-containing protein At1g12775, mitochondrial5.8e-5328.89Show/hide
Query:  HNLVYNVLHAAKKSEHALNFFRWVERAGLFQHDRETHFKIIEILGRASKLNHARCILLDMPNKGVQWDEDLFVVLIESYGKAGIVQEAVKIFQKMKELGV
        + LV  +    K S+  +   R VE    FQ +  T+  ++ ++ ++ +   A  +L  M  + ++ D   + ++I+   K G +  A  +F +M+  G 
Subjt:  HNLVYNVLHAAKKSEHALNFFRWVERAGLFQHDRETHFKIIEILGRASKLNHARCILLDMPNKGVQWDEDLFVVLIESYGKAGIVQEAVKIFQKMKELGV

Query:  ERSVKSYDALFKEIMRRGRYMMAKRYFNAMLNEGIEPIRHTYNVMLWGFFLSLRLETAKRFYEDMKSRGISPDVVTYNTMINGYCRFKMMEEAEQFFTEM
        +  + +Y+ L       GR+    +    M+   I P   T++V++  F    +L  A +  ++M  RGI+P+ +TYN++I+G+C+   +EEA Q    M
Subjt:  ERSVKSYDALFKEIMRRGRYMMAKRYFNAMLNEGIEPIRHTYNVMLWGFFLSLRLETAKRFYEDMKSRGISPDVVTYNTMINGYCRFKMMEEAEQFFTEM

Query:  KGKNIAPTVISYTTMIKGYVSVSRADDALRLFEEMKAAGEKPNDITYSTLLPGLCDAEKLPEARKILTEMVTRHFAPKDNSIFMRLLSCQCKHGDLDAAM
          K   P ++++  +I GY   +R DD L LF EM   G   N +TY+TL+ G C + KL  A+K+  EMV+R   P D   +  LL   C +G+L+ A+
Subjt:  KGKNIAPTVISYTTMIKGYVSVSRADDALRLFEEMKAAGEKPNDITYSTLLPGLCDAEKLPEARKILTEMVTRHFAPKDNSIFMRLLSCQCKHGDLDAAM

Query:  HVLKAMIRLSIPTEAGHYGILIENCCKAGMYDQAVKLLENLVEKEIILRPQSTLEMEASAYNLIIQYLCNHGQTGKADTFFRQLLKKG-IQDEVAFNNLI
         +   + +  +  + G Y I+I   C A   D A  L  +L        P   ++++A AYN++I  LC      KAD  FR++ ++G   DE+ +N LI
Subjt:  HVLKAMIRLSIPTEAGHYGILIENCCKAGMYDQAVKLLENLVEKEIILRPQSTLEMEASAYNLIIQYLCNHGQTGKADTFFRQLLKKG-IQDEVAFNNLI

Query:  RGHAKEGNPDLAFEMLKIMGRRGVSRDAESYKLLIKSYLSKGE
        R H  + +   A E+++ M   G   D  + K++I + LS GE
Subjt:  RGHAKEGNPDLAFEMLKIMGRRGVSRDAESYKLLIKSYLSKGE

Q9ZUU3 Pentatricopeptide repeat-containing protein At2g372303.9e-29167.38Show/hide
Query:  MAHISVSKLHFTHYRVLSSSSISKPTALNSL-HFFSSTQEPISTATQNGSPNDPSASSDAALPQTGESAAVNGVQQVKGRIPRGRPRDPEKLEKIICKMM
        MA IS SK + +  RV  S   S  ++L SL   FS+ +E  + A  N     P A S     +T ++      + ++ R  RG+ ++ EKLE  IC+MM
Subjt:  MAHISVSKLHFTHYRVLSSSSISKPTALNSL-HFFSSTQEPISTATQNGSPNDPSASSDAALPQTGESAAVNGVQQVKGRIPRGRPRDPEKLEKIICKMM

Query:  ANREWTTRLQNSIRSLVPQFDHNLVYNVLHAAKKSEHALNFFRWVERAGLFQHDRETHFKIIEILGRASKLNHARCILLDMPNKGVQWDEDLFVVLIESY
         NR WTTRLQNSIR LVP++DH+LVYNVLH AKK EHAL FFRW ER+GL +HDR+TH K+I++LG  SKLNHARCILLDMP KGV WDED+FVVLIESY
Subjt:  ANREWTTRLQNSIRSLVPQFDHNLVYNVLHAAKKSEHALNFFRWVERAGLFQHDRETHFKIIEILGRASKLNHARCILLDMPNKGVQWDEDLFVVLIESY

Query:  GKAGIVQEAVKIFQKMKELGVERSVKSYDALFKEIMRRGRYMMAKRYFNAMLNEGIEPIRHTYNVMLWGFFLSLRLETAKRFYEDMKSRGISPDVVTYNT
        GKAGIVQE+VKIFQKMK+LGVER++KSY++LFK I+RRGRYMMAKRYFN M++EG+EP RHTYN+MLWGFFLSLRLETA RF+EDMK+RGISPD  T+NT
Subjt:  GKAGIVQEAVKIFQKMKELGVERSVKSYDALFKEIMRRGRYMMAKRYFNAMLNEGIEPIRHTYNVMLWGFFLSLRLETAKRFYEDMKSRGISPDVVTYNT

Query:  MINGYCRFKMMEEAEQFFTEMKGKNIAPTVISYTTMIKGYVSVSRADDALRLFEEMKAAGEKPNDITYSTLLPGLCDAEKLPEARKILTEMVTRHFAPKD
        MING+CRFK M+EAE+ F EMKG  I P+V+SYTTMIKGY++V R DD LR+FEEM+++G +PN  TYSTLLPGLCDA K+ EA+ IL  M+ +H APKD
Subjt:  MINGYCRFKMMEEAEQFFTEMKGKNIAPTVISYTTMIKGYVSVSRADDALRLFEEMKAAGEKPNDITYSTLLPGLCDAEKLPEARKILTEMVTRHFAPKD

Query:  NSIFMRLLSCQCKHGDLDAAMHVLKAMIRLSIPTEAGHYGILIENCCKAGMYDQAVKLLENLVEKEIILRPQSTLEMEASAYNLIIQYLCNHGQTGKADT
        NSIF++LL  Q K GD+ AA  VLKAM  L++P EAGHYG+LIEN CKA  Y++A+KLL+ L+EKEIILR Q TLEME SAYN II+YLCN+GQT KA+ 
Subjt:  NSIFMRLLSCQCKHGDLDAAMHVLKAMIRLSIPTEAGHYGILIENCCKAGMYDQAVKLLENLVEKEIILRPQSTLEMEASAYNLIIQYLCNHGQTGKADT

Query:  FFRQLLKKGIQDEVAFNNLIRGHAKEGNPDLAFEMLKIMGRRGVSRDAESYKLLIKSYLSKGEPADAKTALDSMIENGHSPDSALFRSVMESLFADGRVQ
         FRQL+K+G+QD+ A NNLIRGHAKEGNPD ++E+LKIM RRGV R++ +Y+LLIKSY+SKGEP DAKTALDSM+E+GH PDS+LFRSV+ESLF DGRVQ
Subjt:  FFRQLLKKGIQDEVAFNNLIRGHAKEGNPDLAFEMLKIMGRRGVSRDAESYKLLIKSYLSKGEPADAKTALDSMIENGHSPDSALFRSVMESLFADGRVQ

Query:  TASRVMNSMLDK--GITENLDLVAKILEALFMRGHDEEALGRINLLMNCNCPPDFNSLLSVLCEKGKTTSAFKLLDFGLERECNIEFSSYEKVLDALLGA
        TASRVM  M+DK  GI +N+DL+AKILEAL MRGH EEALGRI+LL       D +SLLSVL EKGKT +A KLLDFGLER+ ++EFSSY+KVLDALLGA
Subjt:  TASRVMNSMLDK--GITENLDLVAKILEALFMRGHDEEALGRINLLMNCNCPPDFNSLLSVLCEKGKTTSAFKLLDFGLERECNIEFSSYEKVLDALLGA

Query:  GKTLNAYAILCKIMEKGGAKDWSSCDDLIKSLNQEGNTKQADILSRMIKGG
        GKTLNAY++LCKIMEKG + DW S D+LIKSLNQEGNTKQAD+LSRMIK G
Subjt:  GKTLNAYAILCKIMEKGGAKDWSSCDDLIKSLNQEGNTKQADILSRMIKGG

Arabidopsis top hitse value%identityAlignment
AT1G02060.1 Tetratricopeptide repeat (TPR)-like superfamily protein3.6e-9832.68Show/hide
Query:  KLEKIICKMMANREWTTRLQNSIRSLVPQ--FDHNLVYNVLHAAKKSEHALNFFRWVERAGLFQHDRETHFKIIEILGRASKLNHARCILLDMPNKG---
        KL + + + + +  W+  L++S+ SL P        V   L   K     L FF WV   G F H  ++ F ++E LGRA  LN AR  L  +  +    
Subjt:  KLEKIICKMMANREWTTRLQNSIRSLVPQ--FDHNLVYNVLHAAKKSEHALNFFRWVERAGLFQHDRETHFKIIEILGRASKLNHARCILLDMPNKG---

Query:  VQWDEDLFVVLIESYGKAGIVQEAVKIFQKMKELGVERSVKSYDALFKEIMRRGRYMMAKRYFNAMLNE-GIEPIRHTYNVMLWGFFLSLRLETAKRFYE
        V+  +  F  LI SYG AG+ QE+VK+FQ MK++G+  SV ++++L   +++RGR  MA   F+ M    G+ P  +T+N ++ GF  +  ++ A R ++
Subjt:  VQWDEDLFVVLIESYGKAGIVQEAVKIFQKMKELGVERSVKSYDALFKEIMRRGRYMMAKRYFNAMLNE-GIEPIRHTYNVMLWGFFLSLRLETAKRFYE

Query:  DMKSRGISPDVVTYNTMINGYCRFKMMEEAEQFFTEM--KGKNIAPTVISYTTMIKGYVSVSRADDALRLFEEMKAAGEKPNDITYSTLLPGLCDAEKLP
        DM+    +PDVVTYNT+I+G CR   ++ A    + M  K  ++ P V+SYTT+++GY      D+A+ +F +M + G KPN +TY+TL+ GL +A +  
Subjt:  DMKSRGISPDVVTYNTMINGYCRFKMMEEAEQFFTEM--KGKNIAPTVISYTTMIKGYVSVSRADDALRLFEEMKAAGEKPNDITYSTLLPGLCDAEKLP

Query:  EARKILT--EMVTRHFAPKDNSIFMRLLSCQCKHGDLDAAMHVLKAMIRLSIPTEAGHYGILIENCCKAGMYDQAVKLLENLVEKEIILRPQSTLEMEAS
        E + IL         FAP D   F  L+   C  G LDAAM V + M+ + +  ++  Y +LI   C    +D+A  L   L EKE++L       + A+
Subjt:  EARKILT--EMVTRHFAPKDNSIFMRLLSCQCKHGDLDAAMHVLKAMIRLSIPTEAGHYGILIENCCKAGMYDQAVKLLENLVEKEIILRPQSTLEMEAS

Query:  AYNLIIQYLCNHGQTGKADTFFRQLLKKGIQDEVAFNNLIRGHAKEGNPDLAFEMLKIMGRRGVSRDAESYKLLIKSYLSKGEPADAKTALDSMIENGHS
        AYN + +YLC +G+T +A+  FRQL+K+G+QD  ++  LI GH +EG    A+E+L +M RR    D E+Y+LLI   L  GE   A   L  M+ + + 
Subjt:  AYNLIIQYLCNHGQTGKADTFFRQLLKKGIQDEVAFNNLIRGHAKEGNPDLAFEMLKIMGRRGVSRDAESYKLLIKSYLSKGEPADAKTALDSMIENGHS

Query:  PDSALFRSVMESLFADGRVQTASRVMNSMLDKGITENLDLVAKILEALFMRGHDEEALGRINLLMNCNCPPDFNSLLSVLCEKGKTTSAFKLLDFGLERE
        P +  F SV+  L        +  ++  ML+K I +N+DL  +++  LF     E+A   + LL +         LL  LCE  K   A  L+ F LE+ 
Subjt:  PDSALFRSVMESLFADGRVQTASRVMNSMLDKGITENLDLVAKILEALFMRGHDEEALGRINLLMNCNCPPDFNSLLSVLCEKGKTTSAFKLLDFGLERE

Query:  CNIEFSSYEKVLDALLGAGKTLNAYAILCKIMEKGGAKDWSSCDDLIKSLNQEGNTKQADILSR
          ++  +   V++ L    +   A+++  +++E G  +  S    L  +L   G  ++   +S+
Subjt:  CNIEFSSYEKVLDALLGAGKTLNAYAILCKIMEKGGAKDWSSCDDLIKSLNQEGNTKQADILSR

AT1G12775.1 Pentatricopeptide repeat (PPR) superfamily protein4.2e-5428.89Show/hide
Query:  HNLVYNVLHAAKKSEHALNFFRWVERAGLFQHDRETHFKIIEILGRASKLNHARCILLDMPNKGVQWDEDLFVVLIESYGKAGIVQEAVKIFQKMKELGV
        + LV  +    K S+  +   R VE    FQ +  T+  ++ ++ ++ +   A  +L  M  + ++ D   + ++I+   K G +  A  +F +M+  G 
Subjt:  HNLVYNVLHAAKKSEHALNFFRWVERAGLFQHDRETHFKIIEILGRASKLNHARCILLDMPNKGVQWDEDLFVVLIESYGKAGIVQEAVKIFQKMKELGV

Query:  ERSVKSYDALFKEIMRRGRYMMAKRYFNAMLNEGIEPIRHTYNVMLWGFFLSLRLETAKRFYEDMKSRGISPDVVTYNTMINGYCRFKMMEEAEQFFTEM
        +  + +Y+ L       GR+    +    M+   I P   T++V++  F    +L  A +  ++M  RGI+P+ +TYN++I+G+C+   +EEA Q    M
Subjt:  ERSVKSYDALFKEIMRRGRYMMAKRYFNAMLNEGIEPIRHTYNVMLWGFFLSLRLETAKRFYEDMKSRGISPDVVTYNTMINGYCRFKMMEEAEQFFTEM

Query:  KGKNIAPTVISYTTMIKGYVSVSRADDALRLFEEMKAAGEKPNDITYSTLLPGLCDAEKLPEARKILTEMVTRHFAPKDNSIFMRLLSCQCKHGDLDAAM
          K   P ++++  +I GY   +R DD L LF EM   G   N +TY+TL+ G C + KL  A+K+  EMV+R   P D   +  LL   C +G+L+ A+
Subjt:  KGKNIAPTVISYTTMIKGYVSVSRADDALRLFEEMKAAGEKPNDITYSTLLPGLCDAEKLPEARKILTEMVTRHFAPKDNSIFMRLLSCQCKHGDLDAAM

Query:  HVLKAMIRLSIPTEAGHYGILIENCCKAGMYDQAVKLLENLVEKEIILRPQSTLEMEASAYNLIIQYLCNHGQTGKADTFFRQLLKKG-IQDEVAFNNLI
         +   + +  +  + G Y I+I   C A   D A  L  +L        P   ++++A AYN++I  LC      KAD  FR++ ++G   DE+ +N LI
Subjt:  HVLKAMIRLSIPTEAGHYGILIENCCKAGMYDQAVKLLENLVEKEIILRPQSTLEMEASAYNLIIQYLCNHGQTGKADTFFRQLLKKG-IQDEVAFNNLI

Query:  RGHAKEGNPDLAFEMLKIMGRRGVSRDAESYKLLIKSYLSKGE
        R H  + +   A E+++ M   G   D  + K++I + LS GE
Subjt:  RGHAKEGNPDLAFEMLKIMGRRGVSRDAESYKLLIKSYLSKGE

AT1G30290.1 Tetratricopeptide repeat (TPR)-like superfamily protein8.1e-5823.34Show/hide
Query:  ALPQTGESAAVNGVQQVKGRIPRGRPRDPEKLEKIICKMMANR-EWTTRLQNSIRSLVPQFDHNLVYNVLHAAKKSEHALNFFRWVERAGLFQHDRETHF
        ++PQ+ E A    V++ + R P         L + + +++  R  W  + +  +R+L+     + V  VL +      AL FF W +R   ++HD   ++
Subjt:  ALPQTGESAAVNGVQQVKGRIPRGRPRDPEKLEKIICKMMANR-EWTTRLQNSIRSLVPQFDHNLVYNVLHAAKKSEHALNFFRWVERAGLFQHDRETHF

Query:  KIIEILGRASKLNHARCILLDMPNKGVQWDEDLFVVLIESYGKAGIVQEAVKIFQKMKELGVERSVKSYDALFKEIMRRGRYMMAKRYFNAMLNEGIEPI
         ++E+L +      +R +L+ M  +G+    + F  ++ SY +AG +++A+K+   M+  GVE ++   +      +R  R   A R+   M   GI P 
Subjt:  KIIEILGRASKLNHARCILLDMPNKGVQWDEDLFVVLIESYGKAGIVQEAVKIFQKMKELGVERSVKSYDALFKEIMRRGRYMMAKRYFNAMLNEGIEPI

Query:  RHTYNVMLWGFFLSLRLETAKRFYEDMKSRGISPDVVTYNTMINGYCRFKMMEEAEQFFTEM-KGKNIAPTVISYTTMIKGYVSVSRADDALRLFEEMKA
          TYN M+ G+    R+E A    EDM S+G  PD V+Y T++   C+ K + E      +M K   + P  ++Y T+I        AD+AL   ++ + 
Subjt:  RHTYNVMLWGFFLSLRLETAKRFYEDMKSRGISPDVVTYNTMINGYCRFKMMEEAEQFFTEM-KGKNIAPTVISYTTMIKGYVSVSRADDALRLFEEMKA

Query:  AGEKPNDITYSTLLPGLCDAEKLPEARKILTEMVTRHFAPKDNSIFMRLLSCQCKHGDLDAAMHVLKAMIRLSIPTEAGHYGILIENCCKAGMYDQAVKL
         G + + + YS ++  LC   ++ EA+ ++ EM+++   P D   +  +++  C+ G++D A  +L+ M           Y  L+   C+ G   +A ++
Subjt:  AGEKPNDITYSTLLPGLCDAEKLPEARKILTEMVTRHFAPKDNSIFMRLLSCQCKHGDLDAAMHVLKAMIRLSIPTEAGHYGILIENCCKAGMYDQAVKL

Query:  LENLVEKEIILRPQSTLEMEASAYNLIIQYLCNHGQTGKADTFFRQLLKKG-IQDEVAFNNLIRGHAKEGNPDLAFEMLKIMGRRGVSRDAESYKLLIKS
        +   + +E    P S        Y++I+  L   G+  +A    R+++ KG     V  N L++   ++G    A + ++    +G + +  ++  +I  
Subjt:  LENLVEKEIILRPQSTLEMEASAYNLIIQYLCNHGQTGKADTFFRQLLKKG-IQDEVAFNNLIRGHAKEGNPDLAFEMLKIMGRRGVSRDAESYKLLIKS

Query:  YLSKGEPADAKTALDSMIENGHSPDSALFRSVMESLFADGRVQTASRVMNSMLDKGITENLDLVAKILEALFMRGHDEEALGRI-NLLMNCNCPPDFNSL
        +    E   A + LD M       D   + +++++L   GR+  A+ +M  ML KGI         ++      G  ++ +  +  ++    C   +N +
Subjt:  YLSKGEPADAKTALDSMIENGHSPDSALFRSVMESLFADGRVQTASRVMNSMLDKGITENLDLVAKILEALFMRGHDEEALGRI-NLLMNCNCPPDFNSL

Query:  LSVLCEKGKTTSAFKLLDFGLERECNIEFSSYEKVLDALLGAGKTLNAYAILCKIMEKGGAKDWSSCDDLIKSLNQEGNTKQAD-ILSRMIKGG
        +  LC  GK   A  LL   L      +  +   +++  L  G  L+AY + C++  +    D   C+ L K L  +G   +AD ++ R+++ G
Subjt:  LSVLCEKGKTTSAFKLLDFGLERECNIEFSSYEKVLDALLGAGKTLNAYAILCKIMEKGGAKDWSSCDDLIKSLNQEGNTKQAD-ILSRMIKGG

AT1G74580.1 Pentatricopeptide repeat (PPR) superfamily protein1.1e-5124.38Show/hide
Query:  VLHAAKKSEHALNFFRWVERAGLFQHDRETHFKIIEILGRASKLNHARCILLDM-PNKGVQWDEDLFVVLIESYGKAGIVQEAVKIFQKMKELGVERSVK
        V+   K    AL  F  + +   F+H   T+  +IE LG   K      +L+DM  N G    E ++V  +++YG+ G VQEAV +F++M     E +V 
Subjt:  VLHAAKKSEHALNFFRWVERAGLFQHDRETHFKIIEILGRASKLNHARCILLDM-PNKGVQWDEDLFVVLIESYGKAGIVQEAVKIFQKMKELGVERSVK

Query:  SYDALFKEIMRRGRYMMAKRYFNAMLNEGIEPIRHTYNVMLWGFFLSLRLETAKRFYEDMKSRGISPDVVTYNTMINGY---------------------
        SY+A+   ++  G +  A + +  M + GI P  +++ + +  F  + R   A R   +M S+G   +VV Y T++ G+                     
Subjt:  SYDALFKEIMRRGRYMMAKRYFNAMLNEGIEPIRHTYNVMLWGFFLSLRLETAKRFYEDMKSRGISPDVVTYNTMINGY---------------------

Query:  --------------CRFKMMEEAEQFFTEMKGKNIAPTVISYTTMIKGYVSVSRADDALRLFEEMKAAGEKPNDITYSTLLPGLCDAEKLPEARKILTEM
                      C+   ++E E+   ++  + + P + +Y   I+G       D A+R+   +   G KP+ ITY+ L+ GLC   K  EA   L +M
Subjt:  --------------CRFKMMEEAEQFFTEMKGKNIAPTVISYTTMIKGYVSVSRADDALRLFEEMKAAGEKPNDITYSTLLPGLCDAEKLPEARKILTEM

Query:  VTRHFAPKDNSIFMRLLSCQCKHGDLDAAMHVLKAMIRLSIPTEAGHYGILIENCCKAGMYDQAVKLLENLVEKEIILRPQSTLEMEASAYNLIIQYLCN
        V     P D+  +  L++  CK G +  A  ++   +      +   Y  LI+  C  G  ++A+ L    + K I  +P   L      YN +I+ L N
Subjt:  VTRHFAPKDNSIFMRLLSCQCKHGDLDAAMHVLKAMIRLSIPTEAGHYGILIENCCKAGMYDQAVKLLENLVEKEIILRPQSTLEMEASAYNLIIQYLCN

Query:  HGQTGKADTFFRQLLKKGIQDEV-AFNNLIRGHAKEGNPDLAFEMLKIMGRRGVSRDAESYKLLIKSYLSKGEPADAKTALDSMIENGHSPDSALFRSVM
         G   +A     ++ +KG+  EV  FN L+ G  K G    A  ++K+M  +G   D  ++ +LI  Y ++ +  +A   LD M++NG  PD   + S++
Subjt:  HGQTGKADTFFRQLLKKGIQDEV-AFNNLIRGHAKEGNPDLAFEMLKIMGRRGVSRDAESYKLLIKSYLSKGEPADAKTALDSMIENGHSPDSALFRSVM

Query:  ESLFADGRVQTASRVMNSMLDKGITENLDLVAKILEALFMRGHDEEALGRINLLMNCNCPPD--------------------------------------
          L    + +       +M++KG   NL     +LE+L      +EALG +  + N +  PD                                      
Subjt:  ESLFADGRVQTASRVMNSMLDKGITENLDLVAKILEALFMRGHDEEALGRINLLMNCNCPPD--------------------------------------

Query:  -FNSLLSVLCEKGKTTSAFKLLDFGLERECNIEFSSYEKVLDALLGAGKTLNAYAILCKIMEKGGAKDWSSCDDLIKSLNQEGNT-KQADILSRMIKGG-
         +N ++    EK   T A KL    ++R    +  +Y  ++D     G     Y  L ++ME G     ++   +I  L  E    + A I+ RM++ G 
Subjt:  -FNSLLSVLCEKGKTTSAFKLLDFGLERECNIEFSSYEKVLDALLGAGKTLNAYAILCKIMEKGGAKDWSSCDDLIKSLNQEGNT-KQADILSRMIKGG-

Query:  -----------DRKRSKKPSLV
                   D+K    P LV
Subjt:  -----------DRKRSKKPSLV

AT2G37230.1 Tetratricopeptide repeat (TPR)-like superfamily protein2.8e-29267.38Show/hide
Query:  MAHISVSKLHFTHYRVLSSSSISKPTALNSL-HFFSSTQEPISTATQNGSPNDPSASSDAALPQTGESAAVNGVQQVKGRIPRGRPRDPEKLEKIICKMM
        MA IS SK + +  RV  S   S  ++L SL   FS+ +E  + A  N     P A S     +T ++      + ++ R  RG+ ++ EKLE  IC+MM
Subjt:  MAHISVSKLHFTHYRVLSSSSISKPTALNSL-HFFSSTQEPISTATQNGSPNDPSASSDAALPQTGESAAVNGVQQVKGRIPRGRPRDPEKLEKIICKMM

Query:  ANREWTTRLQNSIRSLVPQFDHNLVYNVLHAAKKSEHALNFFRWVERAGLFQHDRETHFKIIEILGRASKLNHARCILLDMPNKGVQWDEDLFVVLIESY
         NR WTTRLQNSIR LVP++DH+LVYNVLH AKK EHAL FFRW ER+GL +HDR+TH K+I++LG  SKLNHARCILLDMP KGV WDED+FVVLIESY
Subjt:  ANREWTTRLQNSIRSLVPQFDHNLVYNVLHAAKKSEHALNFFRWVERAGLFQHDRETHFKIIEILGRASKLNHARCILLDMPNKGVQWDEDLFVVLIESY

Query:  GKAGIVQEAVKIFQKMKELGVERSVKSYDALFKEIMRRGRYMMAKRYFNAMLNEGIEPIRHTYNVMLWGFFLSLRLETAKRFYEDMKSRGISPDVVTYNT
        GKAGIVQE+VKIFQKMK+LGVER++KSY++LFK I+RRGRYMMAKRYFN M++EG+EP RHTYN+MLWGFFLSLRLETA RF+EDMK+RGISPD  T+NT
Subjt:  GKAGIVQEAVKIFQKMKELGVERSVKSYDALFKEIMRRGRYMMAKRYFNAMLNEGIEPIRHTYNVMLWGFFLSLRLETAKRFYEDMKSRGISPDVVTYNT

Query:  MINGYCRFKMMEEAEQFFTEMKGKNIAPTVISYTTMIKGYVSVSRADDALRLFEEMKAAGEKPNDITYSTLLPGLCDAEKLPEARKILTEMVTRHFAPKD
        MING+CRFK M+EAE+ F EMKG  I P+V+SYTTMIKGY++V R DD LR+FEEM+++G +PN  TYSTLLPGLCDA K+ EA+ IL  M+ +H APKD
Subjt:  MINGYCRFKMMEEAEQFFTEMKGKNIAPTVISYTTMIKGYVSVSRADDALRLFEEMKAAGEKPNDITYSTLLPGLCDAEKLPEARKILTEMVTRHFAPKD

Query:  NSIFMRLLSCQCKHGDLDAAMHVLKAMIRLSIPTEAGHYGILIENCCKAGMYDQAVKLLENLVEKEIILRPQSTLEMEASAYNLIIQYLCNHGQTGKADT
        NSIF++LL  Q K GD+ AA  VLKAM  L++P EAGHYG+LIEN CKA  Y++A+KLL+ L+EKEIILR Q TLEME SAYN II+YLCN+GQT KA+ 
Subjt:  NSIFMRLLSCQCKHGDLDAAMHVLKAMIRLSIPTEAGHYGILIENCCKAGMYDQAVKLLENLVEKEIILRPQSTLEMEASAYNLIIQYLCNHGQTGKADT

Query:  FFRQLLKKGIQDEVAFNNLIRGHAKEGNPDLAFEMLKIMGRRGVSRDAESYKLLIKSYLSKGEPADAKTALDSMIENGHSPDSALFRSVMESLFADGRVQ
         FRQL+K+G+QD+ A NNLIRGHAKEGNPD ++E+LKIM RRGV R++ +Y+LLIKSY+SKGEP DAKTALDSM+E+GH PDS+LFRSV+ESLF DGRVQ
Subjt:  FFRQLLKKGIQDEVAFNNLIRGHAKEGNPDLAFEMLKIMGRRGVSRDAESYKLLIKSYLSKGEPADAKTALDSMIENGHSPDSALFRSVMESLFADGRVQ

Query:  TASRVMNSMLDK--GITENLDLVAKILEALFMRGHDEEALGRINLLMNCNCPPDFNSLLSVLCEKGKTTSAFKLLDFGLERECNIEFSSYEKVLDALLGA
        TASRVM  M+DK  GI +N+DL+AKILEAL MRGH EEALGRI+LL       D +SLLSVL EKGKT +A KLLDFGLER+ ++EFSSY+KVLDALLGA
Subjt:  TASRVMNSMLDK--GITENLDLVAKILEALFMRGHDEEALGRINLLMNCNCPPDFNSLLSVLCEKGKTTSAFKLLDFGLERECNIEFSSYEKVLDALLGA

Query:  GKTLNAYAILCKIMEKGGAKDWSSCDDLIKSLNQEGNTKQADILSRMIKGG
        GKTLNAY++LCKIMEKG + DW S D+LIKSLNQEGNTKQAD+LSRMIK G
Subjt:  GKTLNAYAILCKIMEKGGAKDWSSCDDLIKSLNQEGNTKQADILSRMIKGG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTCACATTTCTGTATCTAAACTTCACTTCACCCATTACAGGGTTCTTTCCAGTTCTTCAATTTCGAAACCAACTGCTCTCAATTCACTTCATTTCTTTAGCTCCAC
TCAAGAGCCCATCTCCACGGCTACTCAAAATGGAAGCCCCAATGATCCATCCGCCAGTTCTGATGCGGCACTGCCTCAGACAGGGGAATCTGCAGCTGTTAATGGCGTCC
AGCAAGTTAAGGGAAGAATCCCTAGAGGTAGGCCTCGTGACCCTGAAAAGTTAGAAAAAATTATTTGTAAAATGATGGCAAATCGTGAATGGACTACGCGTTTACAGAAC
TCGATTCGTTCGTTGGTTCCTCAATTTGATCACAACCTTGTTTATAATGTGTTACATGCTGCTAAGAAATCCGAACATGCCCTCAATTTCTTCCGGTGGGTGGAGCGAGC
TGGATTATTTCAGCATGATCGCGAAACCCATTTCAAAATAATTGAGATTCTGGGTAGGGCTTCAAAGCTTAATCATGCCCGTTGTATTCTTCTTGATATGCCCAATAAGG
GGGTTCAATGGGATGAAGACTTATTCGTTGTATTGATTGAAAGCTATGGTAAAGCTGGGATAGTTCAGGAGGCTGTGAAAATTTTTCAAAAGATGAAGGAATTGGGTGTT
GAGAGGAGTGTTAAATCTTATGATGCTTTGTTTAAGGAGATTATGAGGAGGGGGCGGTATATGATGGCCAAGAGGTACTTTAATGCTATGTTGAATGAAGGAATAGAACC
GATTCGCCATACCTATAATGTGATGCTTTGGGGTTTCTTTTTGTCGTTGAGGCTTGAGACAGCCAAGAGATTTTATGAAGACATGAAGAGTAGAGGTATTTCGCCTGATG
TGGTTACGTATAACACTATGATTAATGGATACTGTCGGTTCAAAATGATGGAGGAGGCAGAGCAATTCTTTACTGAGATGAAGGGTAAGAATATTGCACCAACAGTGATA
AGCTATACTACTATGATCAAAGGTTATGTTTCTGTTAGTCGAGCAGATGATGCATTGAGATTGTTTGAAGAGATGAAGGCTGCTGGTGAGAAGCCGAATGATATTACTTA
TTCAACTCTCCTACCTGGTCTTTGTGATGCAGAGAAATTGCCCGAGGCTCGAAAAATTTTGACAGAAATGGTGACTAGGCATTTTGCACCGAAGGACAATTCAATTTTCA
TGAGGTTGTTATCTTGCCAGTGCAAGCATGGTGATTTGGATGCTGCTATGCATGTGCTGAAAGCAATGATTCGACTAAGCATTCCAACTGAGGCTGGACACTACGGTATT
CTGATTGAGAACTGTTGCAAAGCCGGGATGTATGATCAGGCAGTTAAGTTGCTTGAAAATCTTGTAGAAAAAGAAATCATATTGAGGCCACAAAGTACTTTGGAAATGGA
GGCTAGTGCATATAATCTTATAATTCAGTATCTGTGCAACCACGGGCAGACTGGAAAAGCTGATACATTTTTCCGACAGTTGTTGAAGAAGGGTATTCAGGATGAGGTTG
CATTTAACAATTTAATCCGTGGCCATGCCAAAGAAGGGAATCCTGACTTAGCATTTGAAATGTTGAAAATCATGGGTAGGAGAGGTGTGTCAAGGGATGCAGAATCTTAC
AAGTTGCTAATCAAGAGCTACTTGAGTAAAGGTGAACCAGCTGATGCTAAAACAGCTTTGGATAGCATGATTGAAAATGGGCACTCTCCTGACTCGGCATTGTTTAGATC
AGTGATGGAAAGTCTATTTGCAGACGGGAGGGTGCAGACTGCAAGCCGAGTGATGAATAGTATGTTGGATAAAGGAATAACAGAAAACTTAGACTTGGTTGCCAAAATCC
TTGAAGCCCTTTTCATGAGAGGTCATGACGAAGAAGCGTTGGGACGAATTAATTTGCTAATGAATTGCAATTGCCCACCTGATTTTAACAGTCTTTTATCTGTTCTTTGT
GAAAAGGGGAAGACAACTTCTGCCTTCAAGCTTTTAGATTTTGGATTGGAAAGAGAATGCAACATAGAGTTCTCAAGTTATGAGAAGGTTCTAGATGCGCTGTTGGGGGC
AGGGAAGACGCTTAATGCATACGCAATTCTATGCAAGATAATGGAGAAAGGAGGGGCCAAGGATTGGAGCAGCTGTGATGATTTGATCAAAAGCCTGAATCAGGAAGGGA
ACACAAAGCAAGCTGATATTCTCTCAAGAATGATAAAGGGCGGAGACAGAAAACGGAGTAAGAAACCTTCTCTAGTTGCTTGA
mRNA sequenceShow/hide mRNA sequence
AGACTTTACTCTCTCTCTCTCTCTCTCTCTCTCTCTCTATGTTTATAGATCTCTGTCTGTACAAAATTCAAATGCCCTGAGGTTTGACCTTCTCCCACCCTTCTTTACAA
ATCCCTTTCAGCGATTCCGACCTCCATGGCTCACATTTCTGTATCTAAACTTCACTTCACCCATTACAGGGTTCTTTCCAGTTCTTCAATTTCGAAACCAACTGCTCTCA
ATTCACTTCATTTCTTTAGCTCCACTCAAGAGCCCATCTCCACGGCTACTCAAAATGGAAGCCCCAATGATCCATCCGCCAGTTCTGATGCGGCACTGCCTCAGACAGGG
GAATCTGCAGCTGTTAATGGCGTCCAGCAAGTTAAGGGAAGAATCCCTAGAGGTAGGCCTCGTGACCCTGAAAAGTTAGAAAAAATTATTTGTAAAATGATGGCAAATCG
TGAATGGACTACGCGTTTACAGAACTCGATTCGTTCGTTGGTTCCTCAATTTGATCACAACCTTGTTTATAATGTGTTACATGCTGCTAAGAAATCCGAACATGCCCTCA
ATTTCTTCCGGTGGGTGGAGCGAGCTGGATTATTTCAGCATGATCGCGAAACCCATTTCAAAATAATTGAGATTCTGGGTAGGGCTTCAAAGCTTAATCATGCCCGTTGT
ATTCTTCTTGATATGCCCAATAAGGGGGTTCAATGGGATGAAGACTTATTCGTTGTATTGATTGAAAGCTATGGTAAAGCTGGGATAGTTCAGGAGGCTGTGAAAATTTT
TCAAAAGATGAAGGAATTGGGTGTTGAGAGGAGTGTTAAATCTTATGATGCTTTGTTTAAGGAGATTATGAGGAGGGGGCGGTATATGATGGCCAAGAGGTACTTTAATG
CTATGTTGAATGAAGGAATAGAACCGATTCGCCATACCTATAATGTGATGCTTTGGGGTTTCTTTTTGTCGTTGAGGCTTGAGACAGCCAAGAGATTTTATGAAGACATG
AAGAGTAGAGGTATTTCGCCTGATGTGGTTACGTATAACACTATGATTAATGGATACTGTCGGTTCAAAATGATGGAGGAGGCAGAGCAATTCTTTACTGAGATGAAGGG
TAAGAATATTGCACCAACAGTGATAAGCTATACTACTATGATCAAAGGTTATGTTTCTGTTAGTCGAGCAGATGATGCATTGAGATTGTTTGAAGAGATGAAGGCTGCTG
GTGAGAAGCCGAATGATATTACTTATTCAACTCTCCTACCTGGTCTTTGTGATGCAGAGAAATTGCCCGAGGCTCGAAAAATTTTGACAGAAATGGTGACTAGGCATTTT
GCACCGAAGGACAATTCAATTTTCATGAGGTTGTTATCTTGCCAGTGCAAGCATGGTGATTTGGATGCTGCTATGCATGTGCTGAAAGCAATGATTCGACTAAGCATTCC
AACTGAGGCTGGACACTACGGTATTCTGATTGAGAACTGTTGCAAAGCCGGGATGTATGATCAGGCAGTTAAGTTGCTTGAAAATCTTGTAGAAAAAGAAATCATATTGA
GGCCACAAAGTACTTTGGAAATGGAGGCTAGTGCATATAATCTTATAATTCAGTATCTGTGCAACCACGGGCAGACTGGAAAAGCTGATACATTTTTCCGACAGTTGTTG
AAGAAGGGTATTCAGGATGAGGTTGCATTTAACAATTTAATCCGTGGCCATGCCAAAGAAGGGAATCCTGACTTAGCATTTGAAATGTTGAAAATCATGGGTAGGAGAGG
TGTGTCAAGGGATGCAGAATCTTACAAGTTGCTAATCAAGAGCTACTTGAGTAAAGGTGAACCAGCTGATGCTAAAACAGCTTTGGATAGCATGATTGAAAATGGGCACT
CTCCTGACTCGGCATTGTTTAGATCAGTGATGGAAAGTCTATTTGCAGACGGGAGGGTGCAGACTGCAAGCCGAGTGATGAATAGTATGTTGGATAAAGGAATAACAGAA
AACTTAGACTTGGTTGCCAAAATCCTTGAAGCCCTTTTCATGAGAGGTCATGACGAAGAAGCGTTGGGACGAATTAATTTGCTAATGAATTGCAATTGCCCACCTGATTT
TAACAGTCTTTTATCTGTTCTTTGTGAAAAGGGGAAGACAACTTCTGCCTTCAAGCTTTTAGATTTTGGATTGGAAAGAGAATGCAACATAGAGTTCTCAAGTTATGAGA
AGGTTCTAGATGCGCTGTTGGGGGCAGGGAAGACGCTTAATGCATACGCAATTCTATGCAAGATAATGGAGAAAGGAGGGGCCAAGGATTGGAGCAGCTGTGATGATTTG
ATCAAAAGCCTGAATCAGGAAGGGAACACAAAGCAAGCTGATATTCTCTCAAGAATGATAAAGGGCGGAGACAGAAAACGGAGTAAGAAACCTTCTCTAGTTGCTTGATT
AATCAATTCCTCCCCCTTTTTTCCACCATTTTCTCCTTTGAAATGATATTTCATGCTTCAAGCCCTGAATAACACTTGGGTTGTTATTTGATCCAATATACTCAAGATGT
AGCTTTAACTTTTGTAGTGTTATTTTTTTTTCCCTCCCCATGATTTAAGTTTCTTCAGTTGATGAATACTAAGGACTTTGGTTTTCTTAAAAAGTTTCTTGGTAATATCT
GAAATGTGCAAGGGATGTATATTATCATTATTGGTGGATGAGAAAACATAGAGAAGTATTTTGAGAATGCAAGG
Protein sequenceShow/hide protein sequence
MAHISVSKLHFTHYRVLSSSSISKPTALNSLHFFSSTQEPISTATQNGSPNDPSASSDAALPQTGESAAVNGVQQVKGRIPRGRPRDPEKLEKIICKMMANREWTTRLQN
SIRSLVPQFDHNLVYNVLHAAKKSEHALNFFRWVERAGLFQHDRETHFKIIEILGRASKLNHARCILLDMPNKGVQWDEDLFVVLIESYGKAGIVQEAVKIFQKMKELGV
ERSVKSYDALFKEIMRRGRYMMAKRYFNAMLNEGIEPIRHTYNVMLWGFFLSLRLETAKRFYEDMKSRGISPDVVTYNTMINGYCRFKMMEEAEQFFTEMKGKNIAPTVI
SYTTMIKGYVSVSRADDALRLFEEMKAAGEKPNDITYSTLLPGLCDAEKLPEARKILTEMVTRHFAPKDNSIFMRLLSCQCKHGDLDAAMHVLKAMIRLSIPTEAGHYGI
LIENCCKAGMYDQAVKLLENLVEKEIILRPQSTLEMEASAYNLIIQYLCNHGQTGKADTFFRQLLKKGIQDEVAFNNLIRGHAKEGNPDLAFEMLKIMGRRGVSRDAESY
KLLIKSYLSKGEPADAKTALDSMIENGHSPDSALFRSVMESLFADGRVQTASRVMNSMLDKGITENLDLVAKILEALFMRGHDEEALGRINLLMNCNCPPDFNSLLSVLC
EKGKTTSAFKLLDFGLERECNIEFSSYEKVLDALLGAGKTLNAYAILCKIMEKGGAKDWSSCDDLIKSLNQEGNTKQADILSRMIKGGDRKRSKKPSLVA