; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg014401 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg014401
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationscaffold3:45374228..45378607
RNA-Seq ExpressionSpg014401
SyntenySpg014401
Gene Ontology termsGO:0009507 - chloroplast (cellular component)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR002625 - Smr domain
IPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0046705.1 pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa]0.0e+0095.17Show/hide
Query:  MASTPPHCSITTAKPYQTHQYPQNNLKNHRQNPRQNGSWTTTHKVSLVKPLPPTPGHNATKSTSTSTSTPLSQSPNFPSLCSLPTSKSELASNFSGRRST
        MASTPPHCSITTAKPYQTHQYPQNNLKNHRQNPRQNG WTTTHK SLVK  P TPGH+A K    STSTPLSQSPNF SLCSLPTSKSELASNFSG RST
Subjt:  MASTPPHCSITTAKPYQTHQYPQNNLKNHRQNPRQNGSWTTTHKVSLVKPLPPTPGHNATKSTSTSTSTPLSQSPNFPSLCSLPTSKSELASNFSGRRST

Query:  RFVSKFHFGRPKSSMATRHTAIAEEVLHQALQFGKDDGSLDNILLHFESKLCGSEDYTFLLRELGNRGECWKAIRCFEFALVREGRKNERGKLASAMIST
        RFVSKFHFGRPKSSMATRHTAIAEEVLHQ LQFGKDD SLDNILL+FESKLCGSEDYTFLLRELGNRGECWKAIRCF+FALVREGRKNE+GKLASAMIST
Subjt:  RFVSKFHFGRPKSSMATRHTAIAEEVLHQALQFGKDDGSLDNILLHFESKLCGSEDYTFLLRELGNRGECWKAIRCFEFALVREGRKNERGKLASAMIST

Query:  LGRLGKVELAKGVFETALSEGYGNTVFAFSALISAYGKSGYFDQAIKVFESMKDSGLKPNLVTYNAVIDACGKGGVEFKKVVEIFEEMLRNGVQPDRITY
        LGRLGKVELAKGVFETALSEGYGNTVFAFSALISAYGKSGYFD+AIKVFESMK SGLKPNLVTYNAVIDACGKGGVEFK+VVEIFEEMLRNGVQPDRITY
Subjt:  LGRLGKVELAKGVFETALSEGYGNTVFAFSALISAYGKSGYFDQAIKVFESMKDSGLKPNLVTYNAVIDACGKGGVEFKKVVEIFEEMLRNGVQPDRITY

Query:  NSLLAVCSRGGLWEAAQNLFNEMIDRGIDQDIFTYNTLLDAVCKGGQMDLAFEIMLEMPAKKILPNVVTYSTMADGYAKAGRLEDALNLYNEMKFLGIGL
        NSLLAVCSRGGLWEAAQNLFNEM+DRGIDQD+FTYNTLLDAVCKGGQMDLA+EIMLEMP KKILPNVVTYSTMADGYAKAGRLEDALNLYN+MKFLGIGL
Subjt:  NSLLAVCSRGGLWEAAQNLFNEMIDRGIDQDIFTYNTLLDAVCKGGQMDLAFEIMLEMPAKKILPNVVTYSTMADGYAKAGRLEDALNLYNEMKFLGIGL

Query:  DRVSYNTLLSIYAKLGRFEDALNVCREMGSSGVKKDAVTYNALLDGYGKQGKFNEVTRVFKEMKKDRVSPNLLTYSTLIDVYSKGSLYEEAMEVFREFKQ
        DRVSYNTLLSIYAKLGRFEDAL VC+EMGSSGVKKD VTYNALLDGYGKQGKFNEVTRVFKEMKKDRV PNLLTYSTLIDVYSKGSLYEEAMEVFREFKQ
Subjt:  DRVSYNTLLSIYAKLGRFEDALNVCREMGSSGVKKDAVTYNALLDGYGKQGKFNEVTRVFKEMKKDRVSPNLLTYSTLIDVYSKGSLYEEAMEVFREFKQ

Query:  AGLKADVVLYSELINALCKNGLVDSAVSLLDEMIKEGIRPNVVTYNSIIDAFGRSSTAESLVDAVGASSERQSESPSFMLIEGVDESEIENWDDDHVFKF
        AGLKADVVLYSELINALCKNGLVDSAV LLDEM KEGIRPNVVTYNSIIDAFGRS+TAESLVD VGAS+ERQSESPSFMLIEGVDESEI+NWDDDHVFKF
Subjt:  AGLKADVVLYSELINALCKNGLVDSAVSLLDEMIKEGIRPNVVTYNSIIDAFGRSSTAESLVDAVGASSERQSESPSFMLIEGVDESEIENWDDDHVFKF

Query:  YQQLVSEKEGPAKKERLGKEEIKSILSVFKKMHELEIKPNVVTFSAILNACSRCKSIEEASMLLEELRLFDNQVYGVAHGLLMGFSENVWIQAQYLFDEV
        YQQLVSEKEGPAKKERLGKEEI+SILSVFKKMHELEIKPNVVTFSAILNACSRCKSIE+ASMLLEELRLFDNQVYGVAHGLLMGFSENVWIQAQYLFDEV
Subjt:  YQQLVSEKEGPAKKERLGKEEIKSILSVFKKMHELEIKPNVVTFSAILNACSRCKSIEEASMLLEELRLFDNQVYGVAHGLLMGFSENVWIQAQYLFDEV

Query:  KQMDSSTASAFYNALTDMLWHFGQKRGAQLVVLEGKRRKVWETLWSDSCLDLHLMSSGAARAMVHAWLLGIHSVVFNGHQLPKLLR
        KQMDSSTASAFYNALTDMLWHFGQKRGAQLVVLEGKRRKVWETLWSDSCLDLHLMSSGAARAMVHAWLLGIHSVVFNGHQLPKLLR
Subjt:  KQMDSSTASAFYNALTDMLWHFGQKRGAQLVVLEGKRRKVWETLWSDSCLDLHLMSSGAARAMVHAWLLGIHSVVFNGHQLPKLLR

XP_004135985.1 pentatricopeptide repeat-containing protein At2g31400, chloroplastic [Cucumis sativus]0.0e+0095.03Show/hide
Query:  MASTPPHCSITTAKPYQTHQYPQNNLKNHRQNPRQNGSWTTTHKVSLVKPLPPTPGHNATKSTSTSTSTPLSQSPNFPSLCSLPTSKSELASNFSGRRST
        MASTPPHCSIT AKPYQTHQYPQNNLKNHRQN RQNG WTTTHK  LVKPLP TPGH+ATK    STSTPLSQSPNFPSLCSLPTSKSELASNFSGRRST
Subjt:  MASTPPHCSITTAKPYQTHQYPQNNLKNHRQNPRQNGSWTTTHKVSLVKPLPPTPGHNATKSTSTSTSTPLSQSPNFPSLCSLPTSKSELASNFSGRRST

Query:  RFVSKFHFGRPKSSMATRHTAIAEEVLHQALQFGKDDGSLDNILLHFESKLCGSEDYTFLLRELGNRGECWKAIRCFEFALVREGRKNERGKLASAMIST
        RFVSKFHFGRPKSSM TRH+AIAEEVLHQ LQFGKDD SLDNILL+FESKLCGSEDYTFLLRELGNRGECWKAIRCF+FALVREGRKNERGKLASAMIST
Subjt:  RFVSKFHFGRPKSSMATRHTAIAEEVLHQALQFGKDDGSLDNILLHFESKLCGSEDYTFLLRELGNRGECWKAIRCFEFALVREGRKNERGKLASAMIST

Query:  LGRLGKVELAKGVFETALSEGYGNTVFAFSALISAYGKSGYFDQAIKVFESMKDSGLKPNLVTYNAVIDACGKGGVEFKKVVEIFEEMLRNGVQPDRITY
        LGRLGKVELAKGVFETALSEGYGNTVFAFSALISAYGKSGYFD+AIKVFESMK SGLKPNLVTYNAVIDACGKGGVEFK+VVEIFEEMLRNGVQPDRITY
Subjt:  LGRLGKVELAKGVFETALSEGYGNTVFAFSALISAYGKSGYFDQAIKVFESMKDSGLKPNLVTYNAVIDACGKGGVEFKKVVEIFEEMLRNGVQPDRITY

Query:  NSLLAVCSRGGLWEAAQNLFNEMIDRGIDQDIFTYNTLLDAVCKGGQMDLAFEIMLEMPAKKILPNVVTYSTMADGYAKAGRLEDALNLYNEMKFLGIGL
        NSLLAVCSRGGLWEAA+NLFNEMIDRGIDQD+FTYNTLLDAVCKGGQMDLA+EIMLEMP KKILPNVVTYSTMADGYAKAGRLEDALNLYNEMKFLGIGL
Subjt:  NSLLAVCSRGGLWEAAQNLFNEMIDRGIDQDIFTYNTLLDAVCKGGQMDLAFEIMLEMPAKKILPNVVTYSTMADGYAKAGRLEDALNLYNEMKFLGIGL

Query:  DRVSYNTLLSIYAKLGRFEDALNVCREMGSSGVKKDAVTYNALLDGYGKQGKFNEVTRVFKEMKKDRVSPNLLTYSTLIDVYSKGSLYEEAMEVFREFKQ
        DRVSYNTLLSIYAKLGRFEDAL VC+EMGSSGVKKD VTYNALLDGYGKQGKFNEVTRVFKEMKKDRV PNLLTYSTLIDVYSKGSLYEEAMEVFREFKQ
Subjt:  DRVSYNTLLSIYAKLGRFEDALNVCREMGSSGVKKDAVTYNALLDGYGKQGKFNEVTRVFKEMKKDRVSPNLLTYSTLIDVYSKGSLYEEAMEVFREFKQ

Query:  AGLKADVVLYSELINALCKNGLVDSAVSLLDEMIKEGIRPNVVTYNSIIDAFGRSSTAESLVDAVGASSERQSESPSFMLIEGVDESEIENWDDDHVFKF
        AGLKADVVLYSELINALCKNGLVDSAV LLDEM KEGIRPNVVTYNSIIDAFGRS+TAE LVD VGAS+ERQSESPSFMLIEGVDESEI NWDD HVFKF
Subjt:  AGLKADVVLYSELINALCKNGLVDSAVSLLDEMIKEGIRPNVVTYNSIIDAFGRSSTAESLVDAVGASSERQSESPSFMLIEGVDESEIENWDDDHVFKF

Query:  YQQLVSEKEGPAKKERLGKEEIKSILSVFKKMHELEIKPNVVTFSAILNACSRCKSIEEASMLLEELRLFDNQVYGVAHGLLMGFSENVWIQAQYLFDEV
        YQQLVSEKEGPAKKERLGKEEI+SILSVFKKMHELEIKPNVVTFSAILNACSRCKSIE+ASMLLEELRLFDNQVYGVAHGLLMGFSENVWIQAQYLFDEV
Subjt:  YQQLVSEKEGPAKKERLGKEEIKSILSVFKKMHELEIKPNVVTFSAILNACSRCKSIEEASMLLEELRLFDNQVYGVAHGLLMGFSENVWIQAQYLFDEV

Query:  KQMDSSTASAFYNALTDMLWHFGQKRGAQLVVLEGKRRKVWETLWSDSCLDLHLMSSGAARAMVHAWLLGIHSVVFNGHQLPKLL
        KQMDSSTASAFYNALTDMLWHFGQKRGAQLVVLEGKRRKVWETLWSDSCLDLHLMSSGAARAMVHAWLLGIHSVVF GHQLPKLL
Subjt:  KQMDSSTASAFYNALTDMLWHFGQKRGAQLVVLEGKRRKVWETLWSDSCLDLHLMSSGAARAMVHAWLLGIHSVVFNGHQLPKLL

XP_008451469.1 PREDICTED: pentatricopeptide repeat-containing protein At2g31400, chloroplastic [Cucumis melo]0.0e+0095.16Show/hide
Query:  MASTPPHCSITTAKPYQTHQYPQNNLKNHRQNPRQNGSWTTTHKVSLVKPLPPTPGHNATKSTSTSTSTPLSQSPNFPSLCSLPTSKSELASNFSGRRST
        MASTPPHCSITTAKPYQTHQYPQNNLKNHRQNPRQNG WTTTHK SLVK  P TPGH+A K    STSTPLSQSPNF SLCSLPTSKSELASNFSG RST
Subjt:  MASTPPHCSITTAKPYQTHQYPQNNLKNHRQNPRQNGSWTTTHKVSLVKPLPPTPGHNATKSTSTSTSTPLSQSPNFPSLCSLPTSKSELASNFSGRRST

Query:  RFVSKFHFGRPKSSMATRHTAIAEEVLHQALQFGKDDGSLDNILLHFESKLCGSEDYTFLLRELGNRGECWKAIRCFEFALVREGRKNERGKLASAMIST
        RFVSKFHFGRPKSSMATRHTAIAEEVLHQ LQFGKDD SLDNILL+FESKLCGSEDYTFLLRELGNRGECWKAIRCF+FALVREGRKNE+GKLASAMIST
Subjt:  RFVSKFHFGRPKSSMATRHTAIAEEVLHQALQFGKDDGSLDNILLHFESKLCGSEDYTFLLRELGNRGECWKAIRCFEFALVREGRKNERGKLASAMIST

Query:  LGRLGKVELAKGVFETALSEGYGNTVFAFSALISAYGKSGYFDQAIKVFESMKDSGLKPNLVTYNAVIDACGKGGVEFKKVVEIFEEMLRNGVQPDRITY
        LGRLGKVELAKGVFETALSEGYGNTVFAFSALISAYGKSGYFD+AIKVFESMK SGLKPNLVTYNAVIDACGKGGVEFK+VVEIFEEMLRNGVQPDRITY
Subjt:  LGRLGKVELAKGVFETALSEGYGNTVFAFSALISAYGKSGYFDQAIKVFESMKDSGLKPNLVTYNAVIDACGKGGVEFKKVVEIFEEMLRNGVQPDRITY

Query:  NSLLAVCSRGGLWEAAQNLFNEMIDRGIDQDIFTYNTLLDAVCKGGQMDLAFEIMLEMPAKKILPNVVTYSTMADGYAKAGRLEDALNLYNEMKFLGIGL
        NSLLAVCSRGGLWEAAQNLFNEM+DRGIDQD+FTYNTLLDAVCKGGQMDLA+EIMLEMP KKILPNVVTYSTMADGYAKAGRLEDALNLYN+MKFLGIGL
Subjt:  NSLLAVCSRGGLWEAAQNLFNEMIDRGIDQDIFTYNTLLDAVCKGGQMDLAFEIMLEMPAKKILPNVVTYSTMADGYAKAGRLEDALNLYNEMKFLGIGL

Query:  DRVSYNTLLSIYAKLGRFEDALNVCREMGSSGVKKDAVTYNALLDGYGKQGKFNEVTRVFKEMKKDRVSPNLLTYSTLIDVYSKGSLYEEAMEVFREFKQ
        DRVSYNTLLSIYAKLGRFEDAL VC+EMGSSGVKKD VTYNALLDGYGKQGKFNEVTRVFKEMKKDRV PNLLTYSTLIDVYSKGSLYEEAMEVFREFKQ
Subjt:  DRVSYNTLLSIYAKLGRFEDALNVCREMGSSGVKKDAVTYNALLDGYGKQGKFNEVTRVFKEMKKDRVSPNLLTYSTLIDVYSKGSLYEEAMEVFREFKQ

Query:  AGLKADVVLYSELINALCKNGLVDSAVSLLDEMIKEGIRPNVVTYNSIIDAFGRSSTAESLVDAVGASSERQSESPSFMLIEGVDESEIENWDDDHVFKF
        AGLKADVVLYSELINALCKNGLVDSAV LLDEM KEGIRPNVVTYNSIIDAFGRS+TAESLVD VGAS+ERQSESPSFMLIEGVDESEI+NWDDDHVFKF
Subjt:  AGLKADVVLYSELINALCKNGLVDSAVSLLDEMIKEGIRPNVVTYNSIIDAFGRSSTAESLVDAVGASSERQSESPSFMLIEGVDESEIENWDDDHVFKF

Query:  YQQLVSEKEGPAKKERLGKEEIKSILSVFKKMHELEIKPNVVTFSAILNACSRCKSIEEASMLLEELRLFDNQVYGVAHGLLMGFSENVWIQAQYLFDEV
        YQQLVSEKEGPAKKERLGKEEI+SILSVFKKMHELEIKPNVVTFSAILNACSRCKSIE+ASMLLEELRLFDNQVYGVAHGLLMGFSENVWIQAQYLFDEV
Subjt:  YQQLVSEKEGPAKKERLGKEEIKSILSVFKKMHELEIKPNVVTFSAILNACSRCKSIEEASMLLEELRLFDNQVYGVAHGLLMGFSENVWIQAQYLFDEV

Query:  KQMDSSTASAFYNALTDMLWHFGQKRGAQLVVLEGKRRKVWETLWSDSCLDLHLMSSGAARAMVHAWLLGIHSVVFNGHQLPKLL
        KQMDSSTASAFYNALTDMLWHFGQKRGAQLVVLEGKRRKVWETLWSDSCLDLHLMSSGAARAMVHAWLLGIHSVVFNGHQLPKLL
Subjt:  KQMDSSTASAFYNALTDMLWHFGQKRGAQLVVLEGKRRKVWETLWSDSCLDLHLMSSGAARAMVHAWLLGIHSVVFNGHQLPKLL

XP_022147988.1 pentatricopeptide repeat-containing protein At2g31400, chloroplastic [Momordica charantia]0.0e+0094.9Show/hide
Query:  MASTPPHCSITTAKPYQTHQYPQNNLKNHRQNPRQNGSWTTTHKVSLVKPLPPTPGHNATKSTSTSTSTPLSQSPNFPSLCSLPTSKSELASNFSGRRST
        MASTPPHCSITTAK YQTHQYPQNNLKNHRQNPRQNGSW TT KVSLVKPLP +P  NATKS+++ST TP+SQ+PNFPSLCSLP SKSELASNFSGRRST
Subjt:  MASTPPHCSITTAKPYQTHQYPQNNLKNHRQNPRQNGSWTTTHKVSLVKPLPPTPGHNATKSTSTSTSTPLSQSPNFPSLCSLPTSKSELASNFSGRRST

Query:  RFVSKFHFGRPKSSMATRHTAIAEEVLHQALQFGKDDGSLDNILLHFESKLCGSEDYTFLLRELGNRGECWKAIRCFEFALVREGRKNERGKLASAMIST
        RFVSKFHFGRPKSSMATRHT IAEEVLHQALQFGKDDGSLDN+LL+FESKLCGSEDYTFLLRELGNRGECWKAIRCFEFAL REGRKNERGKLASAMIST
Subjt:  RFVSKFHFGRPKSSMATRHTAIAEEVLHQALQFGKDDGSLDNILLHFESKLCGSEDYTFLLRELGNRGECWKAIRCFEFALVREGRKNERGKLASAMIST

Query:  LGRLGKVELAKGVFETALSEGYGNTVFAFSALISAYGKSGYFDQAIKVFESMKDSGLKPNLVTYNAVIDACGKGGVEFKKVVEIFEEMLRNGVQPDRITY
        LGRLGKVELAKGVFETALSEGYGNTVFAFSALISAYGKSGYFD+AIKVFESMKDSGLKPNLVTYNAVIDACGKGGVEFK+VVEIF+EMLRNGVQPDRITY
Subjt:  LGRLGKVELAKGVFETALSEGYGNTVFAFSALISAYGKSGYFDQAIKVFESMKDSGLKPNLVTYNAVIDACGKGGVEFKKVVEIFEEMLRNGVQPDRITY

Query:  NSLLAVCSRGGLWEAAQNLFNEMIDRGIDQDIFTYNTLLDAVCKGGQMDLAFEIMLEMPAKKILPNVVTYSTMADGYAKAGRLEDALNLYNEMKFLGIGL
        NSLLAVCSRGGLWEAA+NLF+EM+DRGIDQDIFTYNTLLDAVCKGGQMDLAFEIMLEMPAKKILPNVVTYSTMADGYAKAGRLEDALNLYNEMKFLGIGL
Subjt:  NSLLAVCSRGGLWEAAQNLFNEMIDRGIDQDIFTYNTLLDAVCKGGQMDLAFEIMLEMPAKKILPNVVTYSTMADGYAKAGRLEDALNLYNEMKFLGIGL

Query:  DRVSYNTLLSIYAKLGRFEDALNVCREMGSSGVKKDAVTYNALLDGYGKQGKFNEVTRVFKEMKKDRVSPNLLTYSTLIDVYSKGSLYEEAMEVFREFKQ
        DRVSYNTLLSIYAKLGRFEDALNVCREMGSSGVKKDAVTYNALLDGYGKQGKFNEVTRVFKEMK+DRV+PNLLTYSTLIDVYSKGSL+EEAMEVF EFKQ
Subjt:  DRVSYNTLLSIYAKLGRFEDALNVCREMGSSGVKKDAVTYNALLDGYGKQGKFNEVTRVFKEMKKDRVSPNLLTYSTLIDVYSKGSLYEEAMEVFREFKQ

Query:  AGLKADVVLYSELINALCKNGLVDSAVSLLDEMIKEGIRPNVVTYNSIIDAFGRSSTAESLVDAVGASSERQSESPSFMLIEGVDESEIENWDDDHVFKF
        AGLKADVVLYSELINALCKNGLVDSAVSLLDEMIKEGIRPNVVTYNSIIDAFGRS+T ESLVDAVGA+SERQ E+PSFMLIEGVDESE+ENWDDDHV KF
Subjt:  AGLKADVVLYSELINALCKNGLVDSAVSLLDEMIKEGIRPNVVTYNSIIDAFGRSSTAESLVDAVGASSERQSESPSFMLIEGVDESEIENWDDDHVFKF

Query:  YQQLVSEKEGPAKKERLGKEEIKSILSVFKKMHELEIKPNVVTFSAILNACSRCKSIEEASMLLEELRLFDNQVYGVAHGLLMGFSENVWIQAQYLFDEV
        YQQLVSEKEGPAKKERLGK+EI+SILSVFKKMHELEIKPNVVTFSAILNACSRCKSIE+ASMLLEELRLFDNQVYGVAHGLLMG+SENVWIQAQ+LFDEV
Subjt:  YQQLVSEKEGPAKKERLGKEEIKSILSVFKKMHELEIKPNVVTFSAILNACSRCKSIEEASMLLEELRLFDNQVYGVAHGLLMGFSENVWIQAQYLFDEV

Query:  KQMDSSTASAFYNALTDMLWHFGQKRGAQLVVLEGKRRKVWETLWSDSCLDLHLMSSGAARAMVHAWLLGIHSVVFNGHQLPKLL
        KQMDSSTASAFYNALTDMLWHFGQKRGAQLVVLEGKRRKVWETLWSDSCLDLHLMSSGAARAMVHAWLLGIHSVVFNGHQLPKLL
Subjt:  KQMDSSTASAFYNALTDMLWHFGQKRGAQLVVLEGKRRKVWETLWSDSCLDLHLMSSGAARAMVHAWLLGIHSVVFNGHQLPKLL

XP_038897325.1 pentatricopeptide repeat-containing protein At2g31400, chloroplastic [Benincasa hispida]0.0e+0096.31Show/hide
Query:  MASTPPHCSITTAKPYQTHQYPQNNLKNHRQNPRQNGSWTTTHKVSLVKPLPPTPGHNATKSTSTSTSTPLSQSPNFPSLCSLPTSKSELASNFSGRRST
        MASTPPHCSITTAKPYQ HQYPQNNLKNHRQNPRQNGSWTTTHKVSLVKPLP TPGH+ATKSTSTSTSTPLSQSPNFPSLCSLPTSKSELASNFSGRRST
Subjt:  MASTPPHCSITTAKPYQTHQYPQNNLKNHRQNPRQNGSWTTTHKVSLVKPLPPTPGHNATKSTSTSTSTPLSQSPNFPSLCSLPTSKSELASNFSGRRST

Query:  RFVSKFHFGRPKSSMATRHTAIAEEVLHQALQFGKDDGSLDNILLHFESKLCGSEDYTFLLRELGNRGECWKAIRCFEFALVREGRKNERGKLASAMIST
        RFVSKFHFGRPKSSMATRHTAIAEEVLHQALQFGKDD SLDNILL+FESKLCGSEDYTFLLRELGNRGECWKAIRCFEFALVREG+KNERGKLASAMIST
Subjt:  RFVSKFHFGRPKSSMATRHTAIAEEVLHQALQFGKDDGSLDNILLHFESKLCGSEDYTFLLRELGNRGECWKAIRCFEFALVREGRKNERGKLASAMIST

Query:  LGRLGKVELAKGVFETALSEGYGNTVFAFSALISAYGKSGYFDQAIKVFESMKDSGLKPNLVTYNAVIDACGKGGVEFKKVVEIFEEMLRNGVQPDRITY
        LGRLGKVELAKGVFETALSEGYGNTVFAFSALISAYGKSGYFDQAIKVFESMK SGLKPNLVTYNAVIDACGKGGVEFK+VVEIFEEMLRNGVQPDRITY
Subjt:  LGRLGKVELAKGVFETALSEGYGNTVFAFSALISAYGKSGYFDQAIKVFESMKDSGLKPNLVTYNAVIDACGKGGVEFKKVVEIFEEMLRNGVQPDRITY

Query:  NSLLAVCSRGGLWEAAQNLFNEMIDRGIDQDIFTYNTLLDAVCKGGQMDLAFEIMLEMPAKKILPNVVTYSTMADGYAKAGRLEDALNLYNEMKFLGIGL
        NSLLAVCSRGGLWEAA+NLFNEM+DRGIDQD+FTYNTLLDAVCKGGQMDLA+EIMLEMP KKILPNVVTYSTMADGYAKAGRLEDAL+LYNEMKFLGIGL
Subjt:  NSLLAVCSRGGLWEAAQNLFNEMIDRGIDQDIFTYNTLLDAVCKGGQMDLAFEIMLEMPAKKILPNVVTYSTMADGYAKAGRLEDALNLYNEMKFLGIGL

Query:  DRVSYNTLLSIYAKLGRFEDALNVCREMGSSGVKKDAVTYNALLDGYGKQGKFNEVTRVFKEMKKDRVSPNLLTYSTLIDVYSKGSLYEEAMEVFREFKQ
        DRVSYNTLLSIYAKLGRFEDALNVCREMGSSGVKKD VTYNALLDGYGKQGKFNEVTRVFKEMKKDRV PNLLTYSTLIDVYSKGSLYEEAMEVFREFKQ
Subjt:  DRVSYNTLLSIYAKLGRFEDALNVCREMGSSGVKKDAVTYNALLDGYGKQGKFNEVTRVFKEMKKDRVSPNLLTYSTLIDVYSKGSLYEEAMEVFREFKQ

Query:  AGLKADVVLYSELINALCKNGLVDSAVSLLDEMIKEGIRPNVVTYNSIIDAFGRSSTAESLVDAVGASSERQSESPSFMLIEGVDESEIENWDDDHVFKF
        AGLKADVVLYSELINALCKNGLVDSAVSLLDEMIKEGIRPNVVTYNSIIDAFGRS+TAES VD+VGASSERQSESPSFMLIE VDESEI+NWD DH+FKF
Subjt:  AGLKADVVLYSELINALCKNGLVDSAVSLLDEMIKEGIRPNVVTYNSIIDAFGRSSTAESLVDAVGASSERQSESPSFMLIEGVDESEIENWDDDHVFKF

Query:  YQQLVSEKEGPAKKERLGKEEIKSILSVFKKMHELEIKPNVVTFSAILNACSRCKSIEEASMLLEELRLFDNQVYGVAHGLLMGFSENVWIQAQYLFDEV
        YQQLVSEKEG  KKERLGKEEI+SILSVFKKMHEL+IKPNVVTFSAILNACSRCKSIE+ASMLLEELRLFDNQVYGVAHGLLMGF+ENVWIQAQYLFDEV
Subjt:  YQQLVSEKEGPAKKERLGKEEIKSILSVFKKMHELEIKPNVVTFSAILNACSRCKSIEEASMLLEELRLFDNQVYGVAHGLLMGFSENVWIQAQYLFDEV

Query:  KQMDSSTASAFYNALTDMLWHFGQKRGAQLVVLEGKRRKVWETLWSDSCLDLHLMSSGAARAMVHAWLLGIHSVVFNGHQLPKLL
        KQMDSSTASAFYNALTDMLWHFGQKRGAQLVVLEGKRRKVWETLWSDSCLDLHLMSSGAARAMVHAWLLGIHSVVFNGHQLPKLL
Subjt:  KQMDSSTASAFYNALTDMLWHFGQKRGAQLVVLEGKRRKVWETLWSDSCLDLHLMSSGAARAMVHAWLLGIHSVVFNGHQLPKLL

TrEMBL top hitse value%identityAlignment
A0A0A0KB90 Pentatricopeptide repeat-containing protein0.0e+0095.03Show/hide
Query:  MASTPPHCSITTAKPYQTHQYPQNNLKNHRQNPRQNGSWTTTHKVSLVKPLPPTPGHNATKSTSTSTSTPLSQSPNFPSLCSLPTSKSELASNFSGRRST
        MASTPPHCSIT AKPYQTHQYPQNNLKNHRQN RQNG WTTTHK  LVKPLP TPGH+ATK    STSTPLSQSPNFPSLCSLPTSKSELASNFSGRRST
Subjt:  MASTPPHCSITTAKPYQTHQYPQNNLKNHRQNPRQNGSWTTTHKVSLVKPLPPTPGHNATKSTSTSTSTPLSQSPNFPSLCSLPTSKSELASNFSGRRST

Query:  RFVSKFHFGRPKSSMATRHTAIAEEVLHQALQFGKDDGSLDNILLHFESKLCGSEDYTFLLRELGNRGECWKAIRCFEFALVREGRKNERGKLASAMIST
        RFVSKFHFGRPKSSM TRH+AIAEEVLHQ LQFGKDD SLDNILL+FESKLCGSEDYTFLLRELGNRGECWKAIRCF+FALVREGRKNERGKLASAMIST
Subjt:  RFVSKFHFGRPKSSMATRHTAIAEEVLHQALQFGKDDGSLDNILLHFESKLCGSEDYTFLLRELGNRGECWKAIRCFEFALVREGRKNERGKLASAMIST

Query:  LGRLGKVELAKGVFETALSEGYGNTVFAFSALISAYGKSGYFDQAIKVFESMKDSGLKPNLVTYNAVIDACGKGGVEFKKVVEIFEEMLRNGVQPDRITY
        LGRLGKVELAKGVFETALSEGYGNTVFAFSALISAYGKSGYFD+AIKVFESMK SGLKPNLVTYNAVIDACGKGGVEFK+VVEIFEEMLRNGVQPDRITY
Subjt:  LGRLGKVELAKGVFETALSEGYGNTVFAFSALISAYGKSGYFDQAIKVFESMKDSGLKPNLVTYNAVIDACGKGGVEFKKVVEIFEEMLRNGVQPDRITY

Query:  NSLLAVCSRGGLWEAAQNLFNEMIDRGIDQDIFTYNTLLDAVCKGGQMDLAFEIMLEMPAKKILPNVVTYSTMADGYAKAGRLEDALNLYNEMKFLGIGL
        NSLLAVCSRGGLWEAA+NLFNEMIDRGIDQD+FTYNTLLDAVCKGGQMDLA+EIMLEMP KKILPNVVTYSTMADGYAKAGRLEDALNLYNEMKFLGIGL
Subjt:  NSLLAVCSRGGLWEAAQNLFNEMIDRGIDQDIFTYNTLLDAVCKGGQMDLAFEIMLEMPAKKILPNVVTYSTMADGYAKAGRLEDALNLYNEMKFLGIGL

Query:  DRVSYNTLLSIYAKLGRFEDALNVCREMGSSGVKKDAVTYNALLDGYGKQGKFNEVTRVFKEMKKDRVSPNLLTYSTLIDVYSKGSLYEEAMEVFREFKQ
        DRVSYNTLLSIYAKLGRFEDAL VC+EMGSSGVKKD VTYNALLDGYGKQGKFNEVTRVFKEMKKDRV PNLLTYSTLIDVYSKGSLYEEAMEVFREFKQ
Subjt:  DRVSYNTLLSIYAKLGRFEDALNVCREMGSSGVKKDAVTYNALLDGYGKQGKFNEVTRVFKEMKKDRVSPNLLTYSTLIDVYSKGSLYEEAMEVFREFKQ

Query:  AGLKADVVLYSELINALCKNGLVDSAVSLLDEMIKEGIRPNVVTYNSIIDAFGRSSTAESLVDAVGASSERQSESPSFMLIEGVDESEIENWDDDHVFKF
        AGLKADVVLYSELINALCKNGLVDSAV LLDEM KEGIRPNVVTYNSIIDAFGRS+TAE LVD VGAS+ERQSESPSFMLIEGVDESEI NWDD HVFKF
Subjt:  AGLKADVVLYSELINALCKNGLVDSAVSLLDEMIKEGIRPNVVTYNSIIDAFGRSSTAESLVDAVGASSERQSESPSFMLIEGVDESEIENWDDDHVFKF

Query:  YQQLVSEKEGPAKKERLGKEEIKSILSVFKKMHELEIKPNVVTFSAILNACSRCKSIEEASMLLEELRLFDNQVYGVAHGLLMGFSENVWIQAQYLFDEV
        YQQLVSEKEGPAKKERLGKEEI+SILSVFKKMHELEIKPNVVTFSAILNACSRCKSIE+ASMLLEELRLFDNQVYGVAHGLLMGFSENVWIQAQYLFDEV
Subjt:  YQQLVSEKEGPAKKERLGKEEIKSILSVFKKMHELEIKPNVVTFSAILNACSRCKSIEEASMLLEELRLFDNQVYGVAHGLLMGFSENVWIQAQYLFDEV

Query:  KQMDSSTASAFYNALTDMLWHFGQKRGAQLVVLEGKRRKVWETLWSDSCLDLHLMSSGAARAMVHAWLLGIHSVVFNGHQLPKLL
        KQMDSSTASAFYNALTDMLWHFGQKRGAQLVVLEGKRRKVWETLWSDSCLDLHLMSSGAARAMVHAWLLGIHSVVF GHQLPKLL
Subjt:  KQMDSSTASAFYNALTDMLWHFGQKRGAQLVVLEGKRRKVWETLWSDSCLDLHLMSSGAARAMVHAWLLGIHSVVFNGHQLPKLL

A0A1S3BSC2 pentatricopeptide repeat-containing protein At2g31400, chloroplastic0.0e+0095.16Show/hide
Query:  MASTPPHCSITTAKPYQTHQYPQNNLKNHRQNPRQNGSWTTTHKVSLVKPLPPTPGHNATKSTSTSTSTPLSQSPNFPSLCSLPTSKSELASNFSGRRST
        MASTPPHCSITTAKPYQTHQYPQNNLKNHRQNPRQNG WTTTHK SLVK  P TPGH+A K    STSTPLSQSPNF SLCSLPTSKSELASNFSG RST
Subjt:  MASTPPHCSITTAKPYQTHQYPQNNLKNHRQNPRQNGSWTTTHKVSLVKPLPPTPGHNATKSTSTSTSTPLSQSPNFPSLCSLPTSKSELASNFSGRRST

Query:  RFVSKFHFGRPKSSMATRHTAIAEEVLHQALQFGKDDGSLDNILLHFESKLCGSEDYTFLLRELGNRGECWKAIRCFEFALVREGRKNERGKLASAMIST
        RFVSKFHFGRPKSSMATRHTAIAEEVLHQ LQFGKDD SLDNILL+FESKLCGSEDYTFLLRELGNRGECWKAIRCF+FALVREGRKNE+GKLASAMIST
Subjt:  RFVSKFHFGRPKSSMATRHTAIAEEVLHQALQFGKDDGSLDNILLHFESKLCGSEDYTFLLRELGNRGECWKAIRCFEFALVREGRKNERGKLASAMIST

Query:  LGRLGKVELAKGVFETALSEGYGNTVFAFSALISAYGKSGYFDQAIKVFESMKDSGLKPNLVTYNAVIDACGKGGVEFKKVVEIFEEMLRNGVQPDRITY
        LGRLGKVELAKGVFETALSEGYGNTVFAFSALISAYGKSGYFD+AIKVFESMK SGLKPNLVTYNAVIDACGKGGVEFK+VVEIFEEMLRNGVQPDRITY
Subjt:  LGRLGKVELAKGVFETALSEGYGNTVFAFSALISAYGKSGYFDQAIKVFESMKDSGLKPNLVTYNAVIDACGKGGVEFKKVVEIFEEMLRNGVQPDRITY

Query:  NSLLAVCSRGGLWEAAQNLFNEMIDRGIDQDIFTYNTLLDAVCKGGQMDLAFEIMLEMPAKKILPNVVTYSTMADGYAKAGRLEDALNLYNEMKFLGIGL
        NSLLAVCSRGGLWEAAQNLFNEM+DRGIDQD+FTYNTLLDAVCKGGQMDLA+EIMLEMP KKILPNVVTYSTMADGYAKAGRLEDALNLYN+MKFLGIGL
Subjt:  NSLLAVCSRGGLWEAAQNLFNEMIDRGIDQDIFTYNTLLDAVCKGGQMDLAFEIMLEMPAKKILPNVVTYSTMADGYAKAGRLEDALNLYNEMKFLGIGL

Query:  DRVSYNTLLSIYAKLGRFEDALNVCREMGSSGVKKDAVTYNALLDGYGKQGKFNEVTRVFKEMKKDRVSPNLLTYSTLIDVYSKGSLYEEAMEVFREFKQ
        DRVSYNTLLSIYAKLGRFEDAL VC+EMGSSGVKKD VTYNALLDGYGKQGKFNEVTRVFKEMKKDRV PNLLTYSTLIDVYSKGSLYEEAMEVFREFKQ
Subjt:  DRVSYNTLLSIYAKLGRFEDALNVCREMGSSGVKKDAVTYNALLDGYGKQGKFNEVTRVFKEMKKDRVSPNLLTYSTLIDVYSKGSLYEEAMEVFREFKQ

Query:  AGLKADVVLYSELINALCKNGLVDSAVSLLDEMIKEGIRPNVVTYNSIIDAFGRSSTAESLVDAVGASSERQSESPSFMLIEGVDESEIENWDDDHVFKF
        AGLKADVVLYSELINALCKNGLVDSAV LLDEM KEGIRPNVVTYNSIIDAFGRS+TAESLVD VGAS+ERQSESPSFMLIEGVDESEI+NWDDDHVFKF
Subjt:  AGLKADVVLYSELINALCKNGLVDSAVSLLDEMIKEGIRPNVVTYNSIIDAFGRSSTAESLVDAVGASSERQSESPSFMLIEGVDESEIENWDDDHVFKF

Query:  YQQLVSEKEGPAKKERLGKEEIKSILSVFKKMHELEIKPNVVTFSAILNACSRCKSIEEASMLLEELRLFDNQVYGVAHGLLMGFSENVWIQAQYLFDEV
        YQQLVSEKEGPAKKERLGKEEI+SILSVFKKMHELEIKPNVVTFSAILNACSRCKSIE+ASMLLEELRLFDNQVYGVAHGLLMGFSENVWIQAQYLFDEV
Subjt:  YQQLVSEKEGPAKKERLGKEEIKSILSVFKKMHELEIKPNVVTFSAILNACSRCKSIEEASMLLEELRLFDNQVYGVAHGLLMGFSENVWIQAQYLFDEV

Query:  KQMDSSTASAFYNALTDMLWHFGQKRGAQLVVLEGKRRKVWETLWSDSCLDLHLMSSGAARAMVHAWLLGIHSVVFNGHQLPKLL
        KQMDSSTASAFYNALTDMLWHFGQKRGAQLVVLEGKRRKVWETLWSDSCLDLHLMSSGAARAMVHAWLLGIHSVVFNGHQLPKLL
Subjt:  KQMDSSTASAFYNALTDMLWHFGQKRGAQLVVLEGKRRKVWETLWSDSCLDLHLMSSGAARAMVHAWLLGIHSVVFNGHQLPKLL

A0A5A7TZ92 Pentatricopeptide repeat-containing protein0.0e+0095.17Show/hide
Query:  MASTPPHCSITTAKPYQTHQYPQNNLKNHRQNPRQNGSWTTTHKVSLVKPLPPTPGHNATKSTSTSTSTPLSQSPNFPSLCSLPTSKSELASNFSGRRST
        MASTPPHCSITTAKPYQTHQYPQNNLKNHRQNPRQNG WTTTHK SLVK  P TPGH+A K    STSTPLSQSPNF SLCSLPTSKSELASNFSG RST
Subjt:  MASTPPHCSITTAKPYQTHQYPQNNLKNHRQNPRQNGSWTTTHKVSLVKPLPPTPGHNATKSTSTSTSTPLSQSPNFPSLCSLPTSKSELASNFSGRRST

Query:  RFVSKFHFGRPKSSMATRHTAIAEEVLHQALQFGKDDGSLDNILLHFESKLCGSEDYTFLLRELGNRGECWKAIRCFEFALVREGRKNERGKLASAMIST
        RFVSKFHFGRPKSSMATRHTAIAEEVLHQ LQFGKDD SLDNILL+FESKLCGSEDYTFLLRELGNRGECWKAIRCF+FALVREGRKNE+GKLASAMIST
Subjt:  RFVSKFHFGRPKSSMATRHTAIAEEVLHQALQFGKDDGSLDNILLHFESKLCGSEDYTFLLRELGNRGECWKAIRCFEFALVREGRKNERGKLASAMIST

Query:  LGRLGKVELAKGVFETALSEGYGNTVFAFSALISAYGKSGYFDQAIKVFESMKDSGLKPNLVTYNAVIDACGKGGVEFKKVVEIFEEMLRNGVQPDRITY
        LGRLGKVELAKGVFETALSEGYGNTVFAFSALISAYGKSGYFD+AIKVFESMK SGLKPNLVTYNAVIDACGKGGVEFK+VVEIFEEMLRNGVQPDRITY
Subjt:  LGRLGKVELAKGVFETALSEGYGNTVFAFSALISAYGKSGYFDQAIKVFESMKDSGLKPNLVTYNAVIDACGKGGVEFKKVVEIFEEMLRNGVQPDRITY

Query:  NSLLAVCSRGGLWEAAQNLFNEMIDRGIDQDIFTYNTLLDAVCKGGQMDLAFEIMLEMPAKKILPNVVTYSTMADGYAKAGRLEDALNLYNEMKFLGIGL
        NSLLAVCSRGGLWEAAQNLFNEM+DRGIDQD+FTYNTLLDAVCKGGQMDLA+EIMLEMP KKILPNVVTYSTMADGYAKAGRLEDALNLYN+MKFLGIGL
Subjt:  NSLLAVCSRGGLWEAAQNLFNEMIDRGIDQDIFTYNTLLDAVCKGGQMDLAFEIMLEMPAKKILPNVVTYSTMADGYAKAGRLEDALNLYNEMKFLGIGL

Query:  DRVSYNTLLSIYAKLGRFEDALNVCREMGSSGVKKDAVTYNALLDGYGKQGKFNEVTRVFKEMKKDRVSPNLLTYSTLIDVYSKGSLYEEAMEVFREFKQ
        DRVSYNTLLSIYAKLGRFEDAL VC+EMGSSGVKKD VTYNALLDGYGKQGKFNEVTRVFKEMKKDRV PNLLTYSTLIDVYSKGSLYEEAMEVFREFKQ
Subjt:  DRVSYNTLLSIYAKLGRFEDALNVCREMGSSGVKKDAVTYNALLDGYGKQGKFNEVTRVFKEMKKDRVSPNLLTYSTLIDVYSKGSLYEEAMEVFREFKQ

Query:  AGLKADVVLYSELINALCKNGLVDSAVSLLDEMIKEGIRPNVVTYNSIIDAFGRSSTAESLVDAVGASSERQSESPSFMLIEGVDESEIENWDDDHVFKF
        AGLKADVVLYSELINALCKNGLVDSAV LLDEM KEGIRPNVVTYNSIIDAFGRS+TAESLVD VGAS+ERQSESPSFMLIEGVDESEI+NWDDDHVFKF
Subjt:  AGLKADVVLYSELINALCKNGLVDSAVSLLDEMIKEGIRPNVVTYNSIIDAFGRSSTAESLVDAVGASSERQSESPSFMLIEGVDESEIENWDDDHVFKF

Query:  YQQLVSEKEGPAKKERLGKEEIKSILSVFKKMHELEIKPNVVTFSAILNACSRCKSIEEASMLLEELRLFDNQVYGVAHGLLMGFSENVWIQAQYLFDEV
        YQQLVSEKEGPAKKERLGKEEI+SILSVFKKMHELEIKPNVVTFSAILNACSRCKSIE+ASMLLEELRLFDNQVYGVAHGLLMGFSENVWIQAQYLFDEV
Subjt:  YQQLVSEKEGPAKKERLGKEEIKSILSVFKKMHELEIKPNVVTFSAILNACSRCKSIEEASMLLEELRLFDNQVYGVAHGLLMGFSENVWIQAQYLFDEV

Query:  KQMDSSTASAFYNALTDMLWHFGQKRGAQLVVLEGKRRKVWETLWSDSCLDLHLMSSGAARAMVHAWLLGIHSVVFNGHQLPKLLR
        KQMDSSTASAFYNALTDMLWHFGQKRGAQLVVLEGKRRKVWETLWSDSCLDLHLMSSGAARAMVHAWLLGIHSVVFNGHQLPKLLR
Subjt:  KQMDSSTASAFYNALTDMLWHFGQKRGAQLVVLEGKRRKVWETLWSDSCLDLHLMSSGAARAMVHAWLLGIHSVVFNGHQLPKLLR

A0A6J1D2T9 pentatricopeptide repeat-containing protein At2g31400, chloroplastic0.0e+0094.9Show/hide
Query:  MASTPPHCSITTAKPYQTHQYPQNNLKNHRQNPRQNGSWTTTHKVSLVKPLPPTPGHNATKSTSTSTSTPLSQSPNFPSLCSLPTSKSELASNFSGRRST
        MASTPPHCSITTAK YQTHQYPQNNLKNHRQNPRQNGSW TT KVSLVKPLP +P  NATKS+++ST TP+SQ+PNFPSLCSLP SKSELASNFSGRRST
Subjt:  MASTPPHCSITTAKPYQTHQYPQNNLKNHRQNPRQNGSWTTTHKVSLVKPLPPTPGHNATKSTSTSTSTPLSQSPNFPSLCSLPTSKSELASNFSGRRST

Query:  RFVSKFHFGRPKSSMATRHTAIAEEVLHQALQFGKDDGSLDNILLHFESKLCGSEDYTFLLRELGNRGECWKAIRCFEFALVREGRKNERGKLASAMIST
        RFVSKFHFGRPKSSMATRHT IAEEVLHQALQFGKDDGSLDN+LL+FESKLCGSEDYTFLLRELGNRGECWKAIRCFEFAL REGRKNERGKLASAMIST
Subjt:  RFVSKFHFGRPKSSMATRHTAIAEEVLHQALQFGKDDGSLDNILLHFESKLCGSEDYTFLLRELGNRGECWKAIRCFEFALVREGRKNERGKLASAMIST

Query:  LGRLGKVELAKGVFETALSEGYGNTVFAFSALISAYGKSGYFDQAIKVFESMKDSGLKPNLVTYNAVIDACGKGGVEFKKVVEIFEEMLRNGVQPDRITY
        LGRLGKVELAKGVFETALSEGYGNTVFAFSALISAYGKSGYFD+AIKVFESMKDSGLKPNLVTYNAVIDACGKGGVEFK+VVEIF+EMLRNGVQPDRITY
Subjt:  LGRLGKVELAKGVFETALSEGYGNTVFAFSALISAYGKSGYFDQAIKVFESMKDSGLKPNLVTYNAVIDACGKGGVEFKKVVEIFEEMLRNGVQPDRITY

Query:  NSLLAVCSRGGLWEAAQNLFNEMIDRGIDQDIFTYNTLLDAVCKGGQMDLAFEIMLEMPAKKILPNVVTYSTMADGYAKAGRLEDALNLYNEMKFLGIGL
        NSLLAVCSRGGLWEAA+NLF+EM+DRGIDQDIFTYNTLLDAVCKGGQMDLAFEIMLEMPAKKILPNVVTYSTMADGYAKAGRLEDALNLYNEMKFLGIGL
Subjt:  NSLLAVCSRGGLWEAAQNLFNEMIDRGIDQDIFTYNTLLDAVCKGGQMDLAFEIMLEMPAKKILPNVVTYSTMADGYAKAGRLEDALNLYNEMKFLGIGL

Query:  DRVSYNTLLSIYAKLGRFEDALNVCREMGSSGVKKDAVTYNALLDGYGKQGKFNEVTRVFKEMKKDRVSPNLLTYSTLIDVYSKGSLYEEAMEVFREFKQ
        DRVSYNTLLSIYAKLGRFEDALNVCREMGSSGVKKDAVTYNALLDGYGKQGKFNEVTRVFKEMK+DRV+PNLLTYSTLIDVYSKGSL+EEAMEVF EFKQ
Subjt:  DRVSYNTLLSIYAKLGRFEDALNVCREMGSSGVKKDAVTYNALLDGYGKQGKFNEVTRVFKEMKKDRVSPNLLTYSTLIDVYSKGSLYEEAMEVFREFKQ

Query:  AGLKADVVLYSELINALCKNGLVDSAVSLLDEMIKEGIRPNVVTYNSIIDAFGRSSTAESLVDAVGASSERQSESPSFMLIEGVDESEIENWDDDHVFKF
        AGLKADVVLYSELINALCKNGLVDSAVSLLDEMIKEGIRPNVVTYNSIIDAFGRS+T ESLVDAVGA+SERQ E+PSFMLIEGVDESE+ENWDDDHV KF
Subjt:  AGLKADVVLYSELINALCKNGLVDSAVSLLDEMIKEGIRPNVVTYNSIIDAFGRSSTAESLVDAVGASSERQSESPSFMLIEGVDESEIENWDDDHVFKF

Query:  YQQLVSEKEGPAKKERLGKEEIKSILSVFKKMHELEIKPNVVTFSAILNACSRCKSIEEASMLLEELRLFDNQVYGVAHGLLMGFSENVWIQAQYLFDEV
        YQQLVSEKEGPAKKERLGK+EI+SILSVFKKMHELEIKPNVVTFSAILNACSRCKSIE+ASMLLEELRLFDNQVYGVAHGLLMG+SENVWIQAQ+LFDEV
Subjt:  YQQLVSEKEGPAKKERLGKEEIKSILSVFKKMHELEIKPNVVTFSAILNACSRCKSIEEASMLLEELRLFDNQVYGVAHGLLMGFSENVWIQAQYLFDEV

Query:  KQMDSSTASAFYNALTDMLWHFGQKRGAQLVVLEGKRRKVWETLWSDSCLDLHLMSSGAARAMVHAWLLGIHSVVFNGHQLPKLL
        KQMDSSTASAFYNALTDMLWHFGQKRGAQLVVLEGKRRKVWETLWSDSCLDLHLMSSGAARAMVHAWLLGIHSVVFNGHQLPKLL
Subjt:  KQMDSSTASAFYNALTDMLWHFGQKRGAQLVVLEGKRRKVWETLWSDSCLDLHLMSSGAARAMVHAWLLGIHSVVFNGHQLPKLL

A0A6J1GMF8 pentatricopeptide repeat-containing protein At2g31400, chloroplastic-like0.0e+0093.63Show/hide
Query:  MASTPPHCSITTAKPYQTHQYPQNNLKNHRQNPRQNGSWTTTHKVSLVKPLPPTPGHNATKSTSTSTSTPLSQSPNFPSLCSLPTSKSELASNFSGRRST
        MASTPPHCSITTAKPYQTHQYP NNLKNHR   RQNGS TTTH VSLVKPLPPTP H+A KS STSTSTPLSQSPNFPSL SL TSKSELASNFSGRRST
Subjt:  MASTPPHCSITTAKPYQTHQYPQNNLKNHRQNPRQNGSWTTTHKVSLVKPLPPTPGHNATKSTSTSTSTPLSQSPNFPSLCSLPTSKSELASNFSGRRST

Query:  RFVSKFHFGRPKSSMATRHTAIAEEVLHQALQFGKDDGSLDNILLHFESKLCGSEDYTFLLRELGNRGECWKAIRCFEFALVREGRKNERGKLASAMIST
        RFVSKFH GRPKSSMATRHTAIAEEVLHQALQFGK+D SLDNILL+FESKLCGSEDYTFLLRELGNRGECWKAIRCF+FALVREGRKNERGKLASAMIST
Subjt:  RFVSKFHFGRPKSSMATRHTAIAEEVLHQALQFGKDDGSLDNILLHFESKLCGSEDYTFLLRELGNRGECWKAIRCFEFALVREGRKNERGKLASAMIST

Query:  LGRLGKVELAKGVFETALSEGYGNTVFAFSALISAYGKSGYFDQAIKVFESMKDSGLKPNLVTYNAVIDACGKGGVEFKKVVEIFEEMLRNGVQPDRITY
        LGRLGKVELAKGVFETALSEGYGNTVFAFSALISAYGKSGYFDQAIKVFESMKDSGLKPNLVTYNAVIDACGKG VEF +VVEIFEEMLRNGVQPDRITY
Subjt:  LGRLGKVELAKGVFETALSEGYGNTVFAFSALISAYGKSGYFDQAIKVFESMKDSGLKPNLVTYNAVIDACGKGGVEFKKVVEIFEEMLRNGVQPDRITY

Query:  NSLLAVCSRGGLWEAAQNLFNEMIDRGIDQDIFTYNTLLDAVCKGGQMDLAFEIMLEMPAKKILPNVVTYSTMADGYAKAGRLEDALNLYNEMKFLGIGL
        NS LAVCSRGGLWEAA++LF+EM DRGIDQDIFTYNTLLDAVCKGGQMDLA EIMLEMP+KKI PNVVTYSTMADGYAKAGRLEDALNLYNEMK LGIGL
Subjt:  NSLLAVCSRGGLWEAAQNLFNEMIDRGIDQDIFTYNTLLDAVCKGGQMDLAFEIMLEMPAKKILPNVVTYSTMADGYAKAGRLEDALNLYNEMKFLGIGL

Query:  DRVSYNTLLSIYAKLGRFEDALNVCREMGSSGVKKDAVTYNALLDGYGKQGKFNEVTRVFKEMKKDRVSPNLLTYSTLIDVYSKGSLYEEAMEVFREFKQ
        DRVSYNTL+SIYAKLGRFEDALNVCREMGSSGVKKD VTYNALLDGYGKQGKF EVTRVFKEMK+DRV PNLLTYSTLIDVYSKGSLYEEAMEVFREFK+
Subjt:  DRVSYNTLLSIYAKLGRFEDALNVCREMGSSGVKKDAVTYNALLDGYGKQGKFNEVTRVFKEMKKDRVSPNLLTYSTLIDVYSKGSLYEEAMEVFREFKQ

Query:  AGLKADVVLYSELINALCKNGLVDSAVSLLDEMIKEGIRPNVVTYNSIIDAFGRSSTAESLVDAVGASSERQSESPSFMLIEGVDESEIENWDDDHVFKF
        AGLKADVVLYSELINALCKNGLVDSAVSLLDEMIKEGIRPNVVTYNSIIDAFGRS+TAESL   VGASS+RQSESPS MLIEGVDE E ENWDDDH FKF
Subjt:  AGLKADVVLYSELINALCKNGLVDSAVSLLDEMIKEGIRPNVVTYNSIIDAFGRSSTAESLVDAVGASSERQSESPSFMLIEGVDESEIENWDDDHVFKF

Query:  YQQLVSEKEGPAKKERLGKEEIKSILSVFKKMHELEIKPNVVTFSAILNACSRCKSIEEASMLLEELRLFDNQVYGVAHGLLMGFSENVWIQAQYLFDEV
        YQQLVSEKEGPAKKERLGKEEI SIL+VFKKMHEL+IKPNVVTFSAILNACSRCKSIE+ASMLLEELRLFDNQVYGVAHGLLMGFSENVWIQAQYLFDEV
Subjt:  YQQLVSEKEGPAKKERLGKEEIKSILSVFKKMHELEIKPNVVTFSAILNACSRCKSIEEASMLLEELRLFDNQVYGVAHGLLMGFSENVWIQAQYLFDEV

Query:  KQMDSSTASAFYNALTDMLWHFGQKRGAQLVVLEGKRRKVWETLWSDSCLDLHLMSSGAARAMVHAWLLGIHSVVFNGHQLPKLL
        K+MDSSTASAFYNALTDMLWHFGQKRGAQLVVLEGKRR VWETLWSDSCLDLHLMSSGAARAMVHAWLLGIHSVVFNGHQLPKLL
Subjt:  KQMDSSTASAFYNALTDMLWHFGQKRGAQLVVLEGKRRKVWETLWSDSCLDLHLMSSGAARAMVHAWLLGIHSVVFNGHQLPKLL

SwissProt top hitse value%identityAlignment
Q76C99 Protein Rf1, mitochondrial7.1e-5432.65Show/hide
Query:  YTFLLRELGNRGECWKAIRCFEFALVREGRKNERGKLASAMISTLGRLGKVELAKGVFETALSEGYGNTVFAFSALISAYGKSGYFDQAIKVFESMKDSG
        Y+ L+  L   G C +A + F+ ++ + G K E     + +     +   VE+  G+ +  +  G     + FS LI AY K G  DQA+ VF  M+  G
Subjt:  YTFLLRELGNRGECWKAIRCFEFALVREGRKNERGKLASAMISTLGRLGKVELAKGVFETALSEGYGNTVFAFSALISAYGKSGYFDQAIKVFESMKDSG

Query:  LKPNLVTYNAVIDACGKGGVEFKKVVEIFEEMLRNGVQPDRITYNSLLAVCSRGGLWEAAQNLFNEMIDRGIDQDIFTYNTLLDAVCKGGQM---DLAFE
        L PN VTY AVI    K G   +  +  FE+M+  G+ P  I YNSL+        WE A+ L  EM+DRGI  +   +N+++D+ CK G++   +  FE
Subjt:  LKPNLVTYNAVIDACGKGGVEFKKVVEIFEEMLRNGVQPDRITYNSLLAVCSRGGLWEAAQNLFNEMIDRGIDQDIFTYNTLLDAVCKGGQM---DLAFE

Query:  IMLEMPAKKILPNVVTYSTMADGYAKAGRLEDALNLYNEMKFLGIGLDRVSYNTLLSIYAKLGRFEDALNVCREMGSSGVKKDAVTYNALLDGYGKQGKF
        +M+ +  K   PNV+TY+T+ +GY  AG++++A+ L + M  +G+  + V+Y+TL++ Y K+ R EDAL + +EM SSGV  D +TYN +L G  +  + 
Subjt:  IMLEMPAKKILPNVVTYSTMADGYAKAGRLEDALNLYNEMKFLGIGLDRVSYNTLLSIYAKLGRFEDALNVCREMGSSGVKKDAVTYNALLDGYGKQGKF

Query:  NEVTRVFKEMKKDRVSPNLLTYSTLIDVYSKGSLYEEAMEVFREFKQAGLKADVVLYSELINALCKNGLVDSAVSLLDEMIKEGIRPNVVTY
             ++  + +      L TY+ ++    K  L ++A+++F+      LK +   ++ +I+AL K G  D A  L       G+ PN  TY
Subjt:  NEVTRVFKEMKKDRVSPNLLTYSTLIDVYSKGSLYEEAMEVFREFKQAGLKADVVLYSELINALCKNGLVDSAVSLLDEMIKEGIRPNVVTY

Q9FIX3 Pentatricopeptide repeat-containing protein At5g397102.1e-5328.33Show/hide
Query:  ELAKGVFETALSEGYG---NTVFAFSALISAYGKSGYFDQAIKVFESMKDSGLKPNLVTYNAVIDACGKGGVEFKKVVEIFEEMLRNGVQPDRITYNSLL
        E A  VF+ +L E Y    +T   F  ++ +Y +    D+A+ +    +  G  P +++YNAV+DA  +          +F+EML + V P+  TYN L+
Subjt:  ELAKGVFETALSEGYG---NTVFAFSALISAYGKSGYFDQAIKVFESMKDSGLKPNLVTYNAVIDACGKGGVEFKKVVEIFEEMLRNGVQPDRITYNSLL

Query:  AVCSRGGLWEAAQNLFNEMIDRGIDQDIFTYNTLLDAVCKGGQMDLAFEIMLEMPAKKILPNVVTYSTMADGYAKAGRLEDALNLYNEMKFLGIGLDRVS
              G  + A  LF++M  +G   ++ TYNTL+D  CK  ++D  F+++  M  K + PN+++Y+ + +G  + GR+++   +  EM   G  LD V+
Subjt:  AVCSRGGLWEAAQNLFNEMIDRGIDQDIFTYNTLLDAVCKGGQMDLAFEIMLEMPAKKILPNVVTYSTMADGYAKAGRLEDALNLYNEMKFLGIGLDRVS

Query:  YNTLLSIYAKLGRFEDALNVCREMGSSGVKKDAVTYNALLDGYGKQGKFNEVTRVFKEMKKDRVSPNLLTYSTLIDVYSKGSLYEEAMEVFREFKQAGLK
        YNTL+  Y K G F  AL +  EM   G+    +TY +L+    K G  N       +M+   + PN  TY+TL+D +S+     EA  V RE    G  
Subjt:  YNTLLSIYAKLGRFEDALNVCREMGSSGVKKDAVTYNALLDGYGKQGKFNEVTRVFKEMKKDRVSPNLLTYSTLIDVYSKGSLYEEAMEVFREFKQAGLK

Query:  ADVVLYSELINALCKNGLVDSAVSLLDEMIKEGIRPNVVTYNSIIDAFGRSSTAESLVDAVGASSERQSESPSFMLIEGVDESEIENWDDDHVFKFYQQL
          VV Y+ LIN  C  G ++ A+++L++M ++G+ P+VV+Y++++  F RS   +  +  V      +   P  +    + +   E          Y+++
Subjt:  ADVVLYSELINALCKNGLVDSAVSLLDEMIKEGIRPNVVTYNSIIDAFGRSSTAESLVDAVGASSERQSESPSFMLIEGVDESEIENWDDDHVFKFYQQL

Query:  VSEKEGPAK-------KERLGKEEIKSILSVFKKMHELEIKPNVVTFSAILNACSRCKSIEEASMLLEELRLF
        +     P +            + +++  L +  +M E  + P+VVT+S ++N  ++     EA  LL  L+LF
Subjt:  VSEKEGPAK-------KERLGKEEIKSILSVFKKMHELEIKPNVVTFSAILNACSRCKSIEEASMLLEELRLF

Q9LYZ9 Pentatricopeptide repeat-containing protein At5g028607.8e-6927.1Show/hide
Query:  KPLPPTPGHNATKSTSTSTSTPLSQSPNFP--------SLCSLPTSKSELASNFSGRRSTRFVSKF---HFGRPKSSMATRHTAIAEEVLHQALQFGKDD
        KP      H+ +++   +TS  LS  P  P         L   P S+  ++S  S  R+   + K    + G+P S      +   ++VL   ++   D 
Subjt:  KPLPPTPGHNATKSTSTSTSTPLSQSPNFP--------SLCSLPTSKSELASNFSGRRSTRFVSKF---HFGRPKSSMATRHTAIAEEVLHQALQFGKDD

Query:  GSLDNILLHF-----ESKLCGSEDYTFLLRELGNRGECWKAIRCFE-FALVREGRKNERGKLASAMISTLGRLGKVELAKGVFETALSEGYGNTVFAFSA
        G LD++L        +     S +    L+ LG   +   A+R F+ F   ++ +      + + +IS LG+ G+V  A  +F     +G+   V+++++
Subjt:  GSLDNILLHF-----ESKLCGSEDYTFLLRELGNRGECWKAIRCFE-FALVREGRKNERGKLASAMISTLGRLGKVELAKGVFETALSEGYGNTVFAFSA

Query:  LISAYGKSGYFDQAIKVFESMKDSGLKPNLVTYNAVIDACGKGGVEFKKVVEIFEEMLRNGVQPDRITYNSLLAVCSRGGLWEAAQNLFNEMIDRGIDQD
        LISA+  SG + +A+ VF+ M++ G KP L+TYN +++  GK G  + K+  + E+M  +G+ PD  TYN+L+  C RG L + A  +F EM   G   D
Subjt:  LISAYGKSGYFDQAIKVFESMKDSGLKPNLVTYNAVIDACGKGGVEFKKVVEIFEEMLRNGVQPDRITYNSLLAVCSRGGLWEAAQNLFNEMIDRGIDQD

Query:  IFTYNTLLDAVCKGGQMDLAFEIMLEMPAKKILPNVVTYSTMADGYAKAGRLEDALNLYNEMKFLGIGLDRVSYNTLLSIYAKLGRFEDALNVCREMGSS
          TYN LLD   K  +   A +++ EM      P++VTY+++   YA+ G L++A+ L N+M   G   D  +Y TLLS + + G+ E A+++  EM ++
Subjt:  IFTYNTLLDAVCKGGQMDLAFEIMLEMPAKKILPNVVTYSTMADGYAKAGRLEDALNLYNEMKFLGIGLDRVSYNTLLSIYAKLGRFEDALNVCREMGSS

Query:  GVKKDAVTYNALLDGYGKQGKFNEVTRVFKEMKKDRVSPNLLTYSTLIDVYSKGSLYEEAMEVFREFKQAGLKADVVLYSELINALCKNGLVDSAVSLLD
        G K +  T+NA +  YG +GKF E+ ++F E+    +SP+++T++TL+ V+ +  +  E   VF+E K+AG   +   ++ LI+A  + G  + A+++  
Subjt:  GVKKDAVTYNALLDGYGKQGKFNEVTRVFKEMKKDRVSPNLLTYSTLIDVYSKGSLYEEAMEVFREFKQAGLKADVVLYSELINALCKNGLVDSAVSLLD

Query:  EMIKEGIRPNVVTYNSIIDAFGRSSTAESLVDAVGASSERQSESPSFMLIEG-VDESEIENWDDDHVFKFYQQLVSEKEGPAKKERLGKEEIKSILSVFK
         M+  G+ P++ TYN+++ A  R    E            QSE     + +G    +E+      H +                     +EI  + S+ +
Subjt:  EMIKEGIRPNVVTYNSIIDAFGRSSTAESLVDAVGASSERQSESPSFMLIEG-VDESEIENWDDDHVFKFYQQLVSEKEGPAKKERLGKEEIKSILSVFK

Query:  KMHELEIKPNVVTFSAILNACSRCKSIEEASMLLEEL--RLFDNQVYGVAHGLLMGFSENVWIQAQYLFDEVKQMDSSTASAFYNALTDM
        +++   I+P  V    ++  CS+C  + EA     EL  R F   +  +   + +     +  +A  + D +K+   + + A YN+L  M
Subjt:  KMHELEIKPNVVTFSAILNACSRCKSIEEASMLLEEL--RLFDNQVYGVAHGLLMGFSENVWIQAQYLFDEVKQMDSSTASAFYNALTDM

Q9S7Q2 Pentatricopeptide repeat-containing protein At1g74850, chloroplastic4.9e-7927.17Show/hide
Query:  NFSGRRSTRFVSKFHFGRPKSSMATRHTAIAEEVLHQALQFGKDDGSLDNILLHFESKLCGSEDYTFLLRELGNRGECWKAIRCFEFALVREGRKNERGK
        +FSG+   +       G P  S+     +   E L   L      GS+   L  F++KL    D+  + +E   RG+  +++R F++ + R+        
Subjt:  NFSGRRSTRFVSKFHFGRPKSSMATRHTAIAEEVLHQALQFGKDDGSLDNILLHFESKLCGSEDYTFLLRELGNRGECWKAIRCFEFALVREGRKNERGK

Query:  LASAMISTLGRLGKVELAKGVFETALSEGYGNTVFAFSALISAYGKSGYFDQAIKVFESMKDSGLKPNLVTYNAVIDACGKGGVEFKKVVEIFEEMLRNG
        + + MIS LGR G ++    VF+   S+G   +VF+++ALI+AYG++G ++ ++++ + MK+  + P+++TYN VI+AC +GG++++ ++ +F EM   G
Subjt:  LASAMISTLGRLGKVELAKGVFETALSEGYGNTVFAFSALISAYGKSGYFDQAIKVFESMKDSGLKPNLVTYNAVIDACGKGGVEFKKVVEIFEEMLRNG

Query:  VQPDRITYNSLLAVCSRGGLWEAAQNLFNEMIDRGIDQDIFTYNTLLDAVCKGGQMDLAFEIMLEMPAKKILPNVVTYSTMADGYAKAGRLEDALNLYNE
        +QPD +TYN+LL+ C+  GL + A+ +F  M D GI  D+ TY+ L++   K  +++   +++ EM +   LP++ +Y+ + + YAK+G +++A+ ++++
Subjt:  VQPDRITYNSLLAVCSRGGLWEAAQNLFNEMIDRGIDQDIFTYNTLLDAVCKGGQMDLAFEIMLEMPAKKILPNVVTYSTMADGYAKAGRLEDALNLYNE

Query:  MKFLGIGLDRVSYNTLLSIYAKLGRFEDALNVCREMGSSGVKKDAVTYNALLDGYGKQGKFNEVTRVFKEMKKDRVSPNLLTYSTLIDVYSKGSLYEEAM
        M+  G   +  +Y+ LL+++ + GR++D   +  EM SS    DA TYN L++ +G+ G F EV  +F +M ++ + P++ TY  +I    KG L+E+A 
Subjt:  MKFLGIGLDRVSYNTLLSIYAKLGRFEDALNVCREMGSSGVKKDAVTYNALLDGYGKQGKFNEVTRVFKEMKKDRVSPNLLTYSTLIDVYSKGSLYEEAM

Query:  EVFREFKQAGLKADVVLYSELINALCKNGLVDSAVSLLDEMIKEGIRPNVVTYNSIIDAFGRSSTAESLVDAVGASSERQSESPSFMLIEGVDESEIENW
        ++ +      +      Y+ +I A  +  L + A+   + M + G  P++ T++S++ +F R                ++SE+   +L   VD     N 
Subjt:  EVFREFKQAGLKADVVLYSELINALCKNGLVDSAVSLLDEMIKEGIRPNVVTYNSIIDAFGRSSTAESLVDAVGASSERQSESPSFMLIEGVDESEIENW

Query:  DDDHVFKFYQQLVSEKEG-----------PAKKERLGKEE--IKSILSV-------------FKKMHELEIKPNVVTFSAILNACSRCKSIEEASMLLEE
        D      F  Q+ + K+G             +K R   +E  ++++LSV             F++M   +I P+++ +  +L    + +  ++ + LLEE
Subjt:  DDDHVFKFYQQLVSEKEG-----------PAKKERLGKEE--IKSILSV-------------FKKMHELEIKPNVVTFSAILNACSRCKSIEEASMLLEE

Query:  -LRLFDNQVYGVAHGLLMG--FSENVWIQAQYLFDEVKQMDSSTASAFYNALTDMLWHFGQKRGAQLVVLEGKRRKVWETLWSDSCL----DLHLMSSGA
         L    + ++ V   ++ G    ++ W   +Y+ D++          FYNAL D LW  GQK  A  V+ E  +R ++  L+  + L    D+H MS G 
Subjt:  -LRLFDNQVYGVAHGLLMG--FSENVWIQAQYLFDEVKQMDSSTASAFYNALTDMLWHFGQKRGAQLVVLEGKRRKVWETLWSDSCL----DLHLMSSGA

Query:  ARAMVHAWLLGIHSVVFNGHQLPKL
            +  WL  I+ ++  G  LP+L
Subjt:  ARAMVHAWLLGIHSVVFNGHQLPKL

Q9SIC9 Pentatricopeptide repeat-containing protein At2g31400, chloroplastic5.6e-28565.71Show/hide
Query:  QTHQYPQNNLKNHRQNPRQNGSWTTTHKVSLVKPLPPTPGHNATKSTSTSTSTPLSQSPNFPSLCSLPTSKSELASNFSGRRSTRFVSKFHFGRPKSSMA
        Q+ ++      NHRQ  RQN ++      +   P    P      S +T     LSQ PNF     L T KS+L+S+FSGRRSTRFVSK HFGR K++MA
Subjt:  QTHQYPQNNLKNHRQNPRQNGSWTTTHKVSLVKPLPPTPGHNATKSTSTSTSTPLSQSPNFPSLCSLPTSKSELASNFSGRRSTRFVSKFHFGRPKSSMA

Query:  TRHTAIAEEVLHQALQFGKDDGSLDNILLHFESKLCGSEDYTFLLRELGNRGECWKAIRCFEFALVREGRKNERGKLASAMISTLGRLGKVELAKGVFET
        TRH++ AE+ L  A+ F  DD    +++L FESKLCGS+D T+++RELGNR EC KA+  +EFA+ RE RKNE+GKLASAMISTLGR GKV +AK +FET
Subjt:  TRHTAIAEEVLHQALQFGKDDGSLDNILLHFESKLCGSEDYTFLLRELGNRGECWKAIRCFEFALVREGRKNERGKLASAMISTLGRLGKVELAKGVFET

Query:  ALSEGYGNTVFAFSALISAYGKSGYFDQAIKVFESMKDSGLKPNLVTYNAVIDACGKGGVEFKKVVEIFEEMLRNGVQPDRITYNSLLAVCSRGGLWEAA
        A + GYGNTV+AFSALISAYG+SG  ++AI VF SMK+ GL+PNLVTYNAVIDACGKGG+EFK+V + F+EM RNGVQPDRIT+NSLLAVCSRGGLWEAA
Subjt:  ALSEGYGNTVFAFSALISAYGKSGYFDQAIKVFESMKDSGLKPNLVTYNAVIDACGKGGVEFKKVVEIFEEMLRNGVQPDRITYNSLLAVCSRGGLWEAA

Query:  QNLFNEMIDRGIDQDIFTYNTLLDAVCKGGQMDLAFEIMLEMPAKKILPNVVTYSTMADGYAKAGRLEDALNLYNEMKFLGIGLDRVSYNTLLSIYAKLG
        +NLF+EM +R I+QD+F+YNTLLDA+CKGGQMDLAFEI+ +MP K+I+PNVV+YST+ DG+AKAGR ++ALNL+ EM++LGI LDRVSYNTLLSIY K+G
Subjt:  QNLFNEMIDRGIDQDIFTYNTLLDAVCKGGQMDLAFEIMLEMPAKKILPNVVTYSTMADGYAKAGRLEDALNLYNEMKFLGIGLDRVSYNTLLSIYAKLG

Query:  RFEDALNVCREMGSSGVKKDAVTYNALLDGYGKQGKFNEVTRVFKEMKKDRVSPNLLTYSTLIDVYSKGSLYEEAMEVFREFKQAGLKADVVLYSELINA
        R E+AL++ REM S G+KKD VTYNALL GYGKQGK++EV +VF EMK++ V PNLLTYSTLID YSKG LY+EAME+FREFK AGL+ADVVLYS LI+A
Subjt:  RFEDALNVCREMGSSGVKKDAVTYNALLDGYGKQGKFNEVTRVFKEMKKDRVSPNLLTYSTLIDVYSKGSLYEEAMEVFREFKQAGLKADVVLYSELINA

Query:  LCKNGLVDSAVSLLDEMIKEGIRPNVVTYNSIIDAFGRSSTAESLVDAVGASSERQSESPSFMLIEGVDESEIENWDDDHVFKFYQQLVSEKEGPAKKE-
        LCKNGLV SAVSL+DEM KEGI PNVVTYNSIIDAFGRS+T +   D     S   S S    L E          + + V + + QL +E      K+ 
Subjt:  LCKNGLVDSAVSLLDEMIKEGIRPNVVTYNSIIDAFGRSSTAESLVDAVGASSERQSESPSFMLIEGVDESEIENWDDDHVFKFYQQLVSEKEGPAKKE-

Query:  RLGKEEIKSILSVFKKMHELEIKPNVVTFSAILNACSRCKSIEEASMLLEELRLFDNQVYGVAHGLLMGFSENVWIQAQYLFDEVKQMDSSTASAFYNAL
          G +E+  IL VF+KMH+LEIKPNVVTFSAILNACSRC S E+ASMLLEELRLFDN+VYGV HGLLMG  ENVW+QAQ LFD+V +MD STASAFYNAL
Subjt:  RLGKEEIKSILSVFKKMHELEIKPNVVTFSAILNACSRCKSIEEASMLLEELRLFDNQVYGVAHGLLMGFSENVWIQAQYLFDEVKQMDSSTASAFYNAL

Query:  TDMLWHFGQKRGAQLVVLEGKRRKVWETLWSDSCLDLHLMSSGAARAMVHAWLLGIHSVVFNGHQLPKLL
        TDMLWHFGQKRGA+LV LEG+ R+VWE +WSDSCLDLHLMSSGAARAMVHAWLL I S+V+ GH+LPK+L
Subjt:  TDMLWHFGQKRGAQLVVLEGKRRKVWETLWSDSCLDLHLMSSGAARAMVHAWLLGIHSVVFNGHQLPKLL

Arabidopsis top hitse value%identityAlignment
AT1G74850.1 plastid transcriptionally active 23.5e-8027.17Show/hide
Query:  NFSGRRSTRFVSKFHFGRPKSSMATRHTAIAEEVLHQALQFGKDDGSLDNILLHFESKLCGSEDYTFLLRELGNRGECWKAIRCFEFALVREGRKNERGK
        +FSG+   +       G P  S+     +   E L   L      GS+   L  F++KL    D+  + +E   RG+  +++R F++ + R+        
Subjt:  NFSGRRSTRFVSKFHFGRPKSSMATRHTAIAEEVLHQALQFGKDDGSLDNILLHFESKLCGSEDYTFLLRELGNRGECWKAIRCFEFALVREGRKNERGK

Query:  LASAMISTLGRLGKVELAKGVFETALSEGYGNTVFAFSALISAYGKSGYFDQAIKVFESMKDSGLKPNLVTYNAVIDACGKGGVEFKKVVEIFEEMLRNG
        + + MIS LGR G ++    VF+   S+G   +VF+++ALI+AYG++G ++ ++++ + MK+  + P+++TYN VI+AC +GG++++ ++ +F EM   G
Subjt:  LASAMISTLGRLGKVELAKGVFETALSEGYGNTVFAFSALISAYGKSGYFDQAIKVFESMKDSGLKPNLVTYNAVIDACGKGGVEFKKVVEIFEEMLRNG

Query:  VQPDRITYNSLLAVCSRGGLWEAAQNLFNEMIDRGIDQDIFTYNTLLDAVCKGGQMDLAFEIMLEMPAKKILPNVVTYSTMADGYAKAGRLEDALNLYNE
        +QPD +TYN+LL+ C+  GL + A+ +F  M D GI  D+ TY+ L++   K  +++   +++ EM +   LP++ +Y+ + + YAK+G +++A+ ++++
Subjt:  VQPDRITYNSLLAVCSRGGLWEAAQNLFNEMIDRGIDQDIFTYNTLLDAVCKGGQMDLAFEIMLEMPAKKILPNVVTYSTMADGYAKAGRLEDALNLYNE

Query:  MKFLGIGLDRVSYNTLLSIYAKLGRFEDALNVCREMGSSGVKKDAVTYNALLDGYGKQGKFNEVTRVFKEMKKDRVSPNLLTYSTLIDVYSKGSLYEEAM
        M+  G   +  +Y+ LL+++ + GR++D   +  EM SS    DA TYN L++ +G+ G F EV  +F +M ++ + P++ TY  +I    KG L+E+A 
Subjt:  MKFLGIGLDRVSYNTLLSIYAKLGRFEDALNVCREMGSSGVKKDAVTYNALLDGYGKQGKFNEVTRVFKEMKKDRVSPNLLTYSTLIDVYSKGSLYEEAM

Query:  EVFREFKQAGLKADVVLYSELINALCKNGLVDSAVSLLDEMIKEGIRPNVVTYNSIIDAFGRSSTAESLVDAVGASSERQSESPSFMLIEGVDESEIENW
        ++ +      +      Y+ +I A  +  L + A+   + M + G  P++ T++S++ +F R                ++SE+   +L   VD     N 
Subjt:  EVFREFKQAGLKADVVLYSELINALCKNGLVDSAVSLLDEMIKEGIRPNVVTYNSIIDAFGRSSTAESLVDAVGASSERQSESPSFMLIEGVDESEIENW

Query:  DDDHVFKFYQQLVSEKEG-----------PAKKERLGKEE--IKSILSV-------------FKKMHELEIKPNVVTFSAILNACSRCKSIEEASMLLEE
        D      F  Q+ + K+G             +K R   +E  ++++LSV             F++M   +I P+++ +  +L    + +  ++ + LLEE
Subjt:  DDDHVFKFYQQLVSEKEG-----------PAKKERLGKEE--IKSILSV-------------FKKMHELEIKPNVVTFSAILNACSRCKSIEEASMLLEE

Query:  -LRLFDNQVYGVAHGLLMG--FSENVWIQAQYLFDEVKQMDSSTASAFYNALTDMLWHFGQKRGAQLVVLEGKRRKVWETLWSDSCL----DLHLMSSGA
         L    + ++ V   ++ G    ++ W   +Y+ D++          FYNAL D LW  GQK  A  V+ E  +R ++  L+  + L    D+H MS G 
Subjt:  -LRLFDNQVYGVAHGLLMG--FSENVWIQAQYLFDEVKQMDSSTASAFYNALTDMLWHFGQKRGAQLVVLEGKRRKVWETLWSDSCL----DLHLMSSGA

Query:  ARAMVHAWLLGIHSVVFNGHQLPKL
            +  WL  I+ ++  G  LP+L
Subjt:  ARAMVHAWLLGIHSVVFNGHQLPKL

AT2G31400.1 genomes uncoupled 14.0e-28665.71Show/hide
Query:  QTHQYPQNNLKNHRQNPRQNGSWTTTHKVSLVKPLPPTPGHNATKSTSTSTSTPLSQSPNFPSLCSLPTSKSELASNFSGRRSTRFVSKFHFGRPKSSMA
        Q+ ++      NHRQ  RQN ++      +   P    P      S +T     LSQ PNF     L T KS+L+S+FSGRRSTRFVSK HFGR K++MA
Subjt:  QTHQYPQNNLKNHRQNPRQNGSWTTTHKVSLVKPLPPTPGHNATKSTSTSTSTPLSQSPNFPSLCSLPTSKSELASNFSGRRSTRFVSKFHFGRPKSSMA

Query:  TRHTAIAEEVLHQALQFGKDDGSLDNILLHFESKLCGSEDYTFLLRELGNRGECWKAIRCFEFALVREGRKNERGKLASAMISTLGRLGKVELAKGVFET
        TRH++ AE+ L  A+ F  DD    +++L FESKLCGS+D T+++RELGNR EC KA+  +EFA+ RE RKNE+GKLASAMISTLGR GKV +AK +FET
Subjt:  TRHTAIAEEVLHQALQFGKDDGSLDNILLHFESKLCGSEDYTFLLRELGNRGECWKAIRCFEFALVREGRKNERGKLASAMISTLGRLGKVELAKGVFET

Query:  ALSEGYGNTVFAFSALISAYGKSGYFDQAIKVFESMKDSGLKPNLVTYNAVIDACGKGGVEFKKVVEIFEEMLRNGVQPDRITYNSLLAVCSRGGLWEAA
        A + GYGNTV+AFSALISAYG+SG  ++AI VF SMK+ GL+PNLVTYNAVIDACGKGG+EFK+V + F+EM RNGVQPDRIT+NSLLAVCSRGGLWEAA
Subjt:  ALSEGYGNTVFAFSALISAYGKSGYFDQAIKVFESMKDSGLKPNLVTYNAVIDACGKGGVEFKKVVEIFEEMLRNGVQPDRITYNSLLAVCSRGGLWEAA

Query:  QNLFNEMIDRGIDQDIFTYNTLLDAVCKGGQMDLAFEIMLEMPAKKILPNVVTYSTMADGYAKAGRLEDALNLYNEMKFLGIGLDRVSYNTLLSIYAKLG
        +NLF+EM +R I+QD+F+YNTLLDA+CKGGQMDLAFEI+ +MP K+I+PNVV+YST+ DG+AKAGR ++ALNL+ EM++LGI LDRVSYNTLLSIY K+G
Subjt:  QNLFNEMIDRGIDQDIFTYNTLLDAVCKGGQMDLAFEIMLEMPAKKILPNVVTYSTMADGYAKAGRLEDALNLYNEMKFLGIGLDRVSYNTLLSIYAKLG

Query:  RFEDALNVCREMGSSGVKKDAVTYNALLDGYGKQGKFNEVTRVFKEMKKDRVSPNLLTYSTLIDVYSKGSLYEEAMEVFREFKQAGLKADVVLYSELINA
        R E+AL++ REM S G+KKD VTYNALL GYGKQGK++EV +VF EMK++ V PNLLTYSTLID YSKG LY+EAME+FREFK AGL+ADVVLYS LI+A
Subjt:  RFEDALNVCREMGSSGVKKDAVTYNALLDGYGKQGKFNEVTRVFKEMKKDRVSPNLLTYSTLIDVYSKGSLYEEAMEVFREFKQAGLKADVVLYSELINA

Query:  LCKNGLVDSAVSLLDEMIKEGIRPNVVTYNSIIDAFGRSSTAESLVDAVGASSERQSESPSFMLIEGVDESEIENWDDDHVFKFYQQLVSEKEGPAKKE-
        LCKNGLV SAVSL+DEM KEGI PNVVTYNSIIDAFGRS+T +   D     S   S S    L E          + + V + + QL +E      K+ 
Subjt:  LCKNGLVDSAVSLLDEMIKEGIRPNVVTYNSIIDAFGRSSTAESLVDAVGASSERQSESPSFMLIEGVDESEIENWDDDHVFKFYQQLVSEKEGPAKKE-

Query:  RLGKEEIKSILSVFKKMHELEIKPNVVTFSAILNACSRCKSIEEASMLLEELRLFDNQVYGVAHGLLMGFSENVWIQAQYLFDEVKQMDSSTASAFYNAL
          G +E+  IL VF+KMH+LEIKPNVVTFSAILNACSRC S E+ASMLLEELRLFDN+VYGV HGLLMG  ENVW+QAQ LFD+V +MD STASAFYNAL
Subjt:  RLGKEEIKSILSVFKKMHELEIKPNVVTFSAILNACSRCKSIEEASMLLEELRLFDNQVYGVAHGLLMGFSENVWIQAQYLFDEVKQMDSSTASAFYNAL

Query:  TDMLWHFGQKRGAQLVVLEGKRRKVWETLWSDSCLDLHLMSSGAARAMVHAWLLGIHSVVFNGHQLPKLL
        TDMLWHFGQKRGA+LV LEG+ R+VWE +WSDSCLDLHLMSSGAARAMVHAWLL I S+V+ GH+LPK+L
Subjt:  TDMLWHFGQKRGAQLVVLEGKRRKVWETLWSDSCLDLHLMSSGAARAMVHAWLLGIHSVVFNGHQLPKLL

AT3G18110.1 Pentatricopeptide repeat (PPR) superfamily protein3.3e-5429.26Show/hide
Query:  DYTFLLRELGNRGECW-KAIRCFEFALVREGRKNERGKLASAMISTLGRLGKVELAKGVFETALSEGYGNTVFAFSALISAYGKSGYFDQAIKVFESMKD
        DY F+++ +G   E W +A+  FE+  +R    +   ++ +A++  LGR  +  LA  +F T      G+ V  ++A++  Y +SG F +A ++ ++M+ 
Subjt:  DYTFLLRELGNRGECW-KAIRCFEFALVREGRKNERGKLASAMISTLGRLGKVELAKGVFETALSEGYGNTVFAFSALISAYGKSGYFDQAIKVFESMKD

Query:  SGLKPNLVTYNAVIDACGK-GGVEFKKVVEIFEEMLRNGVQPDRITYNSLLAVCSRGGLWEAAQNLFNEMIDRGIDQDIFTYNTLLDAVCKGGQMDLAFE
         G  P+L+++N +I+A  K GG+     VE+ + +  +G++PD ITYN+LL+ CSR    + A  +F +M       D++TYN ++    + G    A  
Subjt:  SGLKPNLVTYNAVIDACGK-GGVEFKKVVEIFEEMLRNGVQPDRITYNSLLAVCSRGGLWEAAQNLFNEMIDRGIDQDIFTYNTLLDAVCKGGQMDLAFE

Query:  IMLEMPAKKILPNVVTYSTMADGYAKAGRLEDALNLYNEMKFLGIGLDRVSYNTLLSIYAKLGRFEDALNVCREM-GSSGVKKDAVTYNALLDGYGKQGK
        + +E+  K   P+ VTY+++   +A+    E    +Y +M+ +G G D ++YNT++ +Y K G+ + AL + ++M G SG   DA+TY  L+D  GK  +
Subjt:  IMLEMPAKKILPNVVTYSTMADGYAKAGRLEDALNLYNEMKFLGIGLDRVSYNTLLSIYAKLGRFEDALNVCREM-GSSGVKKDAVTYNALLDGYGKQGK

Query:  FNEVTRVFKEMKKDRVSPNLLTYSTLIDVYSKGSLYEEAMEVFREFKQAGLKADVVLYSELINALCKNGLVDSAVSLLDEMIKEGIRPNVVTYNSIIDAF
          E   +  EM    + P L TYS LI  Y+K    EEA + F    ++G K D + YS +++ L +      A  L  +MI +G  P+   Y  +I   
Subjt:  FNEVTRVFKEMKKDRVSPNLLTYSTLIDVYSKGSLYEEAMEVFREFKQAGLKADVVLYSELINALCKNGLVDSAVSLLDEMIKEGIRPNVVTYNSIIDAF

Query:  GRSSTAESLVDAVGASSERQSESP---SFMLIEG
         + + ++ +   +    E    +P   S +L++G
Subjt:  GRSSTAESLVDAVGASSERQSESP---SFMLIEG

AT5G02860.1 Pentatricopeptide repeat (PPR) superfamily protein5.6e-7027.1Show/hide
Query:  KPLPPTPGHNATKSTSTSTSTPLSQSPNFP--------SLCSLPTSKSELASNFSGRRSTRFVSKF---HFGRPKSSMATRHTAIAEEVLHQALQFGKDD
        KP      H+ +++   +TS  LS  P  P         L   P S+  ++S  S  R+   + K    + G+P S      +   ++VL   ++   D 
Subjt:  KPLPPTPGHNATKSTSTSTSTPLSQSPNFP--------SLCSLPTSKSELASNFSGRRSTRFVSKF---HFGRPKSSMATRHTAIAEEVLHQALQFGKDD

Query:  GSLDNILLHF-----ESKLCGSEDYTFLLRELGNRGECWKAIRCFE-FALVREGRKNERGKLASAMISTLGRLGKVELAKGVFETALSEGYGNTVFAFSA
        G LD++L        +     S +    L+ LG   +   A+R F+ F   ++ +      + + +IS LG+ G+V  A  +F     +G+   V+++++
Subjt:  GSLDNILLHF-----ESKLCGSEDYTFLLRELGNRGECWKAIRCFE-FALVREGRKNERGKLASAMISTLGRLGKVELAKGVFETALSEGYGNTVFAFSA

Query:  LISAYGKSGYFDQAIKVFESMKDSGLKPNLVTYNAVIDACGKGGVEFKKVVEIFEEMLRNGVQPDRITYNSLLAVCSRGGLWEAAQNLFNEMIDRGIDQD
        LISA+  SG + +A+ VF+ M++ G KP L+TYN +++  GK G  + K+  + E+M  +G+ PD  TYN+L+  C RG L + A  +F EM   G   D
Subjt:  LISAYGKSGYFDQAIKVFESMKDSGLKPNLVTYNAVIDACGKGGVEFKKVVEIFEEMLRNGVQPDRITYNSLLAVCSRGGLWEAAQNLFNEMIDRGIDQD

Query:  IFTYNTLLDAVCKGGQMDLAFEIMLEMPAKKILPNVVTYSTMADGYAKAGRLEDALNLYNEMKFLGIGLDRVSYNTLLSIYAKLGRFEDALNVCREMGSS
          TYN LLD   K  +   A +++ EM      P++VTY+++   YA+ G L++A+ L N+M   G   D  +Y TLLS + + G+ E A+++  EM ++
Subjt:  IFTYNTLLDAVCKGGQMDLAFEIMLEMPAKKILPNVVTYSTMADGYAKAGRLEDALNLYNEMKFLGIGLDRVSYNTLLSIYAKLGRFEDALNVCREMGSS

Query:  GVKKDAVTYNALLDGYGKQGKFNEVTRVFKEMKKDRVSPNLLTYSTLIDVYSKGSLYEEAMEVFREFKQAGLKADVVLYSELINALCKNGLVDSAVSLLD
        G K +  T+NA +  YG +GKF E+ ++F E+    +SP+++T++TL+ V+ +  +  E   VF+E K+AG   +   ++ LI+A  + G  + A+++  
Subjt:  GVKKDAVTYNALLDGYGKQGKFNEVTRVFKEMKKDRVSPNLLTYSTLIDVYSKGSLYEEAMEVFREFKQAGLKADVVLYSELINALCKNGLVDSAVSLLD

Query:  EMIKEGIRPNVVTYNSIIDAFGRSSTAESLVDAVGASSERQSESPSFMLIEG-VDESEIENWDDDHVFKFYQQLVSEKEGPAKKERLGKEEIKSILSVFK
         M+  G+ P++ TYN+++ A  R    E            QSE     + +G    +E+      H +                     +EI  + S+ +
Subjt:  EMIKEGIRPNVVTYNSIIDAFGRSSTAESLVDAVGASSERQSESPSFMLIEG-VDESEIENWDDDHVFKFYQQLVSEKEGPAKKERLGKEEIKSILSVFK

Query:  KMHELEIKPNVVTFSAILNACSRCKSIEEASMLLEEL--RLFDNQVYGVAHGLLMGFSENVWIQAQYLFDEVKQMDSSTASAFYNALTDM
        +++   I+P  V    ++  CS+C  + EA     EL  R F   +  +   + +     +  +A  + D +K+   + + A YN+L  M
Subjt:  KMHELEIKPNVVTFSAILNACSRCKSIEEASMLLEEL--RLFDNQVYGVAHGLLMGFSENVWIQAQYLFDEVKQMDSSTASAFYNALTDM

AT5G39710.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.5e-5428.33Show/hide
Query:  ELAKGVFETALSEGYG---NTVFAFSALISAYGKSGYFDQAIKVFESMKDSGLKPNLVTYNAVIDACGKGGVEFKKVVEIFEEMLRNGVQPDRITYNSLL
        E A  VF+ +L E Y    +T   F  ++ +Y +    D+A+ +    +  G  P +++YNAV+DA  +          +F+EML + V P+  TYN L+
Subjt:  ELAKGVFETALSEGYG---NTVFAFSALISAYGKSGYFDQAIKVFESMKDSGLKPNLVTYNAVIDACGKGGVEFKKVVEIFEEMLRNGVQPDRITYNSLL

Query:  AVCSRGGLWEAAQNLFNEMIDRGIDQDIFTYNTLLDAVCKGGQMDLAFEIMLEMPAKKILPNVVTYSTMADGYAKAGRLEDALNLYNEMKFLGIGLDRVS
              G  + A  LF++M  +G   ++ TYNTL+D  CK  ++D  F+++  M  K + PN+++Y+ + +G  + GR+++   +  EM   G  LD V+
Subjt:  AVCSRGGLWEAAQNLFNEMIDRGIDQDIFTYNTLLDAVCKGGQMDLAFEIMLEMPAKKILPNVVTYSTMADGYAKAGRLEDALNLYNEMKFLGIGLDRVS

Query:  YNTLLSIYAKLGRFEDALNVCREMGSSGVKKDAVTYNALLDGYGKQGKFNEVTRVFKEMKKDRVSPNLLTYSTLIDVYSKGSLYEEAMEVFREFKQAGLK
        YNTL+  Y K G F  AL +  EM   G+    +TY +L+    K G  N       +M+   + PN  TY+TL+D +S+     EA  V RE    G  
Subjt:  YNTLLSIYAKLGRFEDALNVCREMGSSGVKKDAVTYNALLDGYGKQGKFNEVTRVFKEMKKDRVSPNLLTYSTLIDVYSKGSLYEEAMEVFREFKQAGLK

Query:  ADVVLYSELINALCKNGLVDSAVSLLDEMIKEGIRPNVVTYNSIIDAFGRSSTAESLVDAVGASSERQSESPSFMLIEGVDESEIENWDDDHVFKFYQQL
          VV Y+ LIN  C  G ++ A+++L++M ++G+ P+VV+Y++++  F RS   +  +  V      +   P  +    + +   E          Y+++
Subjt:  ADVVLYSELINALCKNGLVDSAVSLLDEMIKEGIRPNVVTYNSIIDAFGRSSTAESLVDAVGASSERQSESPSFMLIEGVDESEIENWDDDHVFKFYQQL

Query:  VSEKEGPAK-------KERLGKEEIKSILSVFKKMHELEIKPNVVTFSAILNACSRCKSIEEASMLLEELRLF
        +     P +            + +++  L +  +M E  + P+VVT+S ++N  ++     EA  LL  L+LF
Subjt:  VSEKEGPAK-------KERLGKEEIKSILSVFKKMHELEIKPNVVTFSAILNACSRCKSIEEASMLLEELRLF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCTACTCCACCACACTGTTCAATCACCACGGCGAAGCCCTATCAAACTCATCAATACCCACAAAATAACCTGAAAAACCATCGTCAAAATCCCCGCCAGAATGG
CTCTTGGACGACGACCCACAAGGTTTCTCTTGTCAAGCCATTACCCCCAACTCCCGGTCATAATGCGACTAAATCGACTTCTACTTCTACTTCTACTCCGCTTTCTCAAA
GCCCTAATTTCCCATCTCTCTGCTCTCTCCCCACCTCGAAATCCGAGCTCGCTTCCAACTTTTCTGGCCGCCGATCAACTCGTTTCGTCTCTAAGTTTCACTTTGGACGA
CCCAAATCCTCCATGGCCACTCGTCACACTGCGATTGCTGAGGAGGTTTTGCACCAGGCGCTGCAGTTTGGTAAGGACGATGGGAGTTTGGATAATATTTTGCTTCATTT
CGAGTCTAAGCTTTGTGGGTCGGAAGATTATACGTTTTTGCTTCGGGAACTTGGGAATAGAGGTGAATGTTGGAAAGCGATTCGATGCTTTGAGTTTGCGCTTGTTAGAG
AAGGAAGGAAAAATGAACGAGGTAAATTGGCTAGTGCGATGATTAGTACGCTTGGTAGGCTTGGGAAAGTGGAACTTGCTAAGGGGGTTTTTGAGACAGCATTGAGTGAG
GGGTATGGGAACACTGTTTTTGCATTCTCAGCTTTGATAAGTGCTTATGGGAAAAGTGGTTACTTTGATCAGGCTATTAAGGTGTTTGAATCGATGAAAGATTCAGGATT
GAAGCCAAATTTGGTTACTTATAATGCAGTAATTGATGCATGTGGAAAAGGAGGAGTAGAATTTAAGAAAGTGGTGGAGATTTTTGAAGAAATGTTGAGGAATGGGGTCC
AACCTGATAGAATTACCTATAACTCACTTCTTGCTGTGTGTAGTCGAGGAGGGTTGTGGGAGGCAGCTCAAAACTTGTTTAATGAGATGATAGATAGAGGGATTGATCAG
GATATATTTACTTATAATACGCTTTTGGATGCGGTTTGCAAAGGTGGGCAGATGGATTTGGCCTTTGAGATTATGTTAGAAATGCCTGCAAAGAAAATACTGCCTAATGT
GGTTACTTACAGTACAATGGCTGATGGATATGCCAAGGCTGGTAGATTGGAAGATGCACTAAACTTGTACAATGAAATGAAGTTTCTGGGAATTGGGTTGGATAGGGTTT
CATATAATACATTGCTTTCGATCTATGCCAAGCTTGGCAGGTTTGAAGATGCTCTTAATGTTTGCAGAGAGATGGGGAGTTCTGGTGTTAAAAAGGATGCTGTTACTTAC
AATGCACTTCTAGATGGATATGGAAAGCAGGGGAAGTTTAATGAAGTGACTAGAGTATTTAAGGAGATGAAAAAAGACCGTGTATCCCCTAATTTGTTGACGTATTCTAC
CTTAATTGATGTATACTCTAAAGGTAGTCTATATGAGGAGGCAATGGAGGTCTTTCGTGAATTCAAGCAGGCTGGATTGAAGGCTGATGTAGTTCTCTATAGTGAACTCA
TCAATGCTTTATGTAAAAATGGTTTAGTGGATTCTGCTGTATCCTTGCTTGATGAGATGATAAAAGAAGGGATTAGGCCTAATGTTGTCACTTACAATTCCATAATTGAT
GCCTTTGGTCGTTCTTCAACAGCAGAATCTCTAGTTGATGCTGTTGGTGCATCTAGTGAAAGGCAAAGTGAATCCCCATCTTTCATGTTGATTGAGGGTGTGGATGAGAG
TGAGATTGAAAATTGGGATGATGATCATGTCTTCAAATTTTATCAGCAGCTTGTTTCTGAGAAAGAAGGACCTGCAAAGAAAGAAAGACTAGGCAAAGAAGAGATCAAGT
CCATCTTGAGTGTCTTCAAGAAGATGCATGAGCTGGAAATAAAACCAAATGTTGTCACCTTTTCAGCGATTCTAAATGCATGCAGCCGCTGCAAATCGATTGAAGAAGCT
TCAATGTTATTGGAAGAGCTCCGGTTATTTGATAATCAAGTCTATGGTGTAGCTCACGGACTCCTTATGGGCTTTAGCGAAAATGTGTGGATTCAAGCACAATATCTGTT
TGATGAAGTGAAGCAGATGGACTCTTCCACTGCATCTGCTTTCTACAATGCTTTGACAGATATGCTGTGGCATTTTGGTCAGAAACGAGGGGCCCAATTGGTCGTACTTG
AAGGAAAAAGACGCAAAGTATGGGAAACTTTATGGTCCGATTCTTGTTTAGATTTGCACCTCATGTCTTCTGGAGCTGCTCGCGCCATGGTTCATGCTTGGTTGCTGGGT
ATTCATTCTGTTGTATTTAATGGCCATCAGTTGCCAAAGTTATTAAGGTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCTTCTACTCCACCACACTGTTCAATCACCACGGCGAAGCCCTATCAAACTCATCAATACCCACAAAATAACCTGAAAAACCATCGTCAAAATCCCCGCCAGAATGG
CTCTTGGACGACGACCCACAAGGTTTCTCTTGTCAAGCCATTACCCCCAACTCCCGGTCATAATGCGACTAAATCGACTTCTACTTCTACTTCTACTCCGCTTTCTCAAA
GCCCTAATTTCCCATCTCTCTGCTCTCTCCCCACCTCGAAATCCGAGCTCGCTTCCAACTTTTCTGGCCGCCGATCAACTCGTTTCGTCTCTAAGTTTCACTTTGGACGA
CCCAAATCCTCCATGGCCACTCGTCACACTGCGATTGCTGAGGAGGTTTTGCACCAGGCGCTGCAGTTTGGTAAGGACGATGGGAGTTTGGATAATATTTTGCTTCATTT
CGAGTCTAAGCTTTGTGGGTCGGAAGATTATACGTTTTTGCTTCGGGAACTTGGGAATAGAGGTGAATGTTGGAAAGCGATTCGATGCTTTGAGTTTGCGCTTGTTAGAG
AAGGAAGGAAAAATGAACGAGGTAAATTGGCTAGTGCGATGATTAGTACGCTTGGTAGGCTTGGGAAAGTGGAACTTGCTAAGGGGGTTTTTGAGACAGCATTGAGTGAG
GGGTATGGGAACACTGTTTTTGCATTCTCAGCTTTGATAAGTGCTTATGGGAAAAGTGGTTACTTTGATCAGGCTATTAAGGTGTTTGAATCGATGAAAGATTCAGGATT
GAAGCCAAATTTGGTTACTTATAATGCAGTAATTGATGCATGTGGAAAAGGAGGAGTAGAATTTAAGAAAGTGGTGGAGATTTTTGAAGAAATGTTGAGGAATGGGGTCC
AACCTGATAGAATTACCTATAACTCACTTCTTGCTGTGTGTAGTCGAGGAGGGTTGTGGGAGGCAGCTCAAAACTTGTTTAATGAGATGATAGATAGAGGGATTGATCAG
GATATATTTACTTATAATACGCTTTTGGATGCGGTTTGCAAAGGTGGGCAGATGGATTTGGCCTTTGAGATTATGTTAGAAATGCCTGCAAAGAAAATACTGCCTAATGT
GGTTACTTACAGTACAATGGCTGATGGATATGCCAAGGCTGGTAGATTGGAAGATGCACTAAACTTGTACAATGAAATGAAGTTTCTGGGAATTGGGTTGGATAGGGTTT
CATATAATACATTGCTTTCGATCTATGCCAAGCTTGGCAGGTTTGAAGATGCTCTTAATGTTTGCAGAGAGATGGGGAGTTCTGGTGTTAAAAAGGATGCTGTTACTTAC
AATGCACTTCTAGATGGATATGGAAAGCAGGGGAAGTTTAATGAAGTGACTAGAGTATTTAAGGAGATGAAAAAAGACCGTGTATCCCCTAATTTGTTGACGTATTCTAC
CTTAATTGATGTATACTCTAAAGGTAGTCTATATGAGGAGGCAATGGAGGTCTTTCGTGAATTCAAGCAGGCTGGATTGAAGGCTGATGTAGTTCTCTATAGTGAACTCA
TCAATGCTTTATGTAAAAATGGTTTAGTGGATTCTGCTGTATCCTTGCTTGATGAGATGATAAAAGAAGGGATTAGGCCTAATGTTGTCACTTACAATTCCATAATTGAT
GCCTTTGGTCGTTCTTCAACAGCAGAATCTCTAGTTGATGCTGTTGGTGCATCTAGTGAAAGGCAAAGTGAATCCCCATCTTTCATGTTGATTGAGGGTGTGGATGAGAG
TGAGATTGAAAATTGGGATGATGATCATGTCTTCAAATTTTATCAGCAGCTTGTTTCTGAGAAAGAAGGACCTGCAAAGAAAGAAAGACTAGGCAAAGAAGAGATCAAGT
CCATCTTGAGTGTCTTCAAGAAGATGCATGAGCTGGAAATAAAACCAAATGTTGTCACCTTTTCAGCGATTCTAAATGCATGCAGCCGCTGCAAATCGATTGAAGAAGCT
TCAATGTTATTGGAAGAGCTCCGGTTATTTGATAATCAAGTCTATGGTGTAGCTCACGGACTCCTTATGGGCTTTAGCGAAAATGTGTGGATTCAAGCACAATATCTGTT
TGATGAAGTGAAGCAGATGGACTCTTCCACTGCATCTGCTTTCTACAATGCTTTGACAGATATGCTGTGGCATTTTGGTCAGAAACGAGGGGCCCAATTGGTCGTACTTG
AAGGAAAAAGACGCAAAGTATGGGAAACTTTATGGTCCGATTCTTGTTTAGATTTGCACCTCATGTCTTCTGGAGCTGCTCGCGCCATGGTTCATGCTTGGTTGCTGGGT
ATTCATTCTGTTGTATTTAATGGCCATCAGTTGCCAAAGTTATTAAGGTAA
Protein sequenceShow/hide protein sequence
MASTPPHCSITTAKPYQTHQYPQNNLKNHRQNPRQNGSWTTTHKVSLVKPLPPTPGHNATKSTSTSTSTPLSQSPNFPSLCSLPTSKSELASNFSGRRSTRFVSKFHFGR
PKSSMATRHTAIAEEVLHQALQFGKDDGSLDNILLHFESKLCGSEDYTFLLRELGNRGECWKAIRCFEFALVREGRKNERGKLASAMISTLGRLGKVELAKGVFETALSE
GYGNTVFAFSALISAYGKSGYFDQAIKVFESMKDSGLKPNLVTYNAVIDACGKGGVEFKKVVEIFEEMLRNGVQPDRITYNSLLAVCSRGGLWEAAQNLFNEMIDRGIDQ
DIFTYNTLLDAVCKGGQMDLAFEIMLEMPAKKILPNVVTYSTMADGYAKAGRLEDALNLYNEMKFLGIGLDRVSYNTLLSIYAKLGRFEDALNVCREMGSSGVKKDAVTY
NALLDGYGKQGKFNEVTRVFKEMKKDRVSPNLLTYSTLIDVYSKGSLYEEAMEVFREFKQAGLKADVVLYSELINALCKNGLVDSAVSLLDEMIKEGIRPNVVTYNSIID
AFGRSSTAESLVDAVGASSERQSESPSFMLIEGVDESEIENWDDDHVFKFYQQLVSEKEGPAKKERLGKEEIKSILSVFKKMHELEIKPNVVTFSAILNACSRCKSIEEA
SMLLEELRLFDNQVYGVAHGLLMGFSENVWIQAQYLFDEVKQMDSSTASAFYNALTDMLWHFGQKRGAQLVVLEGKRRKVWETLWSDSCLDLHLMSSGAARAMVHAWLLG
IHSVVFNGHQLPKLLR