; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Bhi11G001096 (gene) of Wax gourd (B227) v1 genome

Gene IDBhi11G001096
OrganismBenincasa hispida cv. B227 (Wax gourd (B227) v1)
DescriptionPentatricopeptide repeat
Genome locationchr11:37995036..37997664
RNA-Seq ExpressionBhi11G001096
SyntenyBhi11G001096
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6606921.1 putative pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. sororia]1.2e-26489.59Show/hide
Query:  MILRLPSSSQSPVRYFSPIPLSNFFSHSFSSASENQSLNENVETVFRIITSSPSSADMRDSLKSSRVFLSNELIDGVLKRVRFSHGNPLQALEFFNYTGR
        MILR  S  QSP+RYFSP PLSNF  H FS+ASENQSLNENV+TVFRII+SS SSA+MR SLKSSRVFLSNELIDGVLKRVRFSHGNPLQALEFFNYTGR
Subjt:  MILRLPSSSQSPVRYFSPIPLSNFFSHSFSSASENQSLNENVETVFRIITSSPSSADMRDSLKSSRVFLSNELIDGVLKRVRFSHGNPLQALEFFNYTGR

Query:  RRGFYHTAFSVDTMLYILGRGRKFEKIWDVLVDIKLKDRSLISPRTVMVVLGRIAKVCSVRQTVESFRKFKKFVPEFDVTCFNALLRTLCQEKSMMDARN
        RRGFYH+AFS DTMLYILGR RKFEKIWDVL+D+KLKDRSLI+PRTVMVVLGRIAKVCSVRQTVESFRKFKKFVPEFDVTCFNALLRTLCQEKSM DARN
Subjt:  RRGFYHTAFSVDTMLYILGRGRKFEKIWDVLVDIKLKDRSLISPRTVMVVLGRIAKVCSVRQTVESFRKFKKFVPEFDVTCFNALLRTLCQEKSMMDARN

Query:  VYHSLKSKFRPNLQTFNILLSGWKSSEEAEGFFNEMREMGVKPDVVSYNCLVDVYCKNREMDKAFKVIEKMRDEDIAADVITYTSVIGGLGLIGQPDKAR
        VYH LKSKFRPNLQTFNILLSGWKSSEEAEGFF+EMREMGV+PDVVSYNCLVDVYCKNREMDKAFKVIEKMRDEDIAADVITYTS+IGGLGLIGQPDKAR
Subjt:  VYHSLKSKFRPNLQTFNILLSGWKSSEEAEGFFNEMREMGVKPDVVSYNCLVDVYCKNREMDKAFKVIEKMRDEDIAADVITYTSVIGGLGLIGQPDKAR

Query:  NILKEMKEYGCYPDVAAYNAAIRNFCIAKRLRGAFDLMDEMVNKGLSPNATTYNLFFRIFFWSNDLQSSWNLYRRMMDTCCLPNTQSCLFLMRLFKKHEN
        NILKEMKEYGCYPDVAAYNAAIRNFCIAKRLR AFDLMDEM NKGL+PNATTYNLFFRIFFWSNDLQSSWNLYRRMM   CLPNTQSCLFLMR FK+HE 
Subjt:  NILKEMKEYGCYPDVAAYNAAIRNFCIAKRLRGAFDLMDEMVNKGLSPNATTYNLFFRIFFWSNDLQSSWNLYRRMMDTCCLPNTQSCLFLMRLFKKHEN

Query:  VEMALELWNDMIQRGFGSYILVSEELFDFLCDLGKLIEAERCFLQMVDKGHKPSNVSFKRIKVLMELANKHEVLQNLSKKMDGFLDPQKRLPETMSYSTD
        VEMAL+LWNDM++RGFGSYILVSEELFD LCDLGKLIEAE+CFLQMVDKGHKPS VSFKRI+VLMELANKHE LQNLSK M+ FLDP KRLPETM+  TD
Subjt:  VEMALELWNDMIQRGFGSYILVSEELFDFLCDLGKLIEAERCFLQMVDKGHKPSNVSFKRIKVLMELANKHEVLQNLSKKMDGFLDPQKRLPETMSYSTD

Query:  LSNSNSFQC
        LSN NS  C
Subjt:  LSNSNSFQC

XP_008447458.1 PREDICTED: putative pentatricopeptide repeat-containing protein At1g02420 isoform X1 [Cucumis melo]2.7e-26489.78Show/hide
Query:  MILRLPSSSQSPVRYFSPIPLSNFFSHSFSSASENQSLNENVETVFRIITSSPSSADMRDSLKSSRVFLSNELIDGVLKRVRFSHGNPLQALEFFNYTGR
        MILR  +S QSPVRYFSPIPLSNFF H FSSA+ NQSLNEN+ETVFRIIT+SPSS DM+ SLKSSRVFLSNELIDGVLKRVRFSHGNPLQ LEFFNYT  
Subjt:  MILRLPSSSQSPVRYFSPIPLSNFFSHSFSSASENQSLNENVETVFRIITSSPSSADMRDSLKSSRVFLSNELIDGVLKRVRFSHGNPLQALEFFNYTGR

Query:  RRGFYHTAFSVDTMLYILGRGRKFEKIWDVLVDIKLKDRSLISPRTVMVVLGRIAKVCSVRQTVESFRKFKKFVPEFDVTCFNALLRTLCQEKSMMDARN
        RRGFYHT++SVDTMLYILGRGRKF  IWDVLVD+KLKD SLIS RTVMVVLGRIAKVCSVRQTVESFRKFKKFVPEFDVTCFNALLRTLCQEKSMMDARN
Subjt:  RRGFYHTAFSVDTMLYILGRGRKFEKIWDVLVDIKLKDRSLISPRTVMVVLGRIAKVCSVRQTVESFRKFKKFVPEFDVTCFNALLRTLCQEKSMMDARN

Query:  VYHSLKSKFRPNLQTFNILLSGWKSSEEAEGFFNEMREMGVKPDVVSYNCLVDVYCKNREMDKAFKVIEKMRDEDIAADVITYTSVIGGLGLIGQPDKAR
        VYH LKSKFRPNLQTFNILLSGWKSSEEAEGFF+EM EMGVKPDVVSYN LVDVYCKNREMDKAFKV+EKMRDEDI ADVITYTS+IGGLGLIGQPDKAR
Subjt:  VYHSLKSKFRPNLQTFNILLSGWKSSEEAEGFFNEMREMGVKPDVVSYNCLVDVYCKNREMDKAFKVIEKMRDEDIAADVITYTSVIGGLGLIGQPDKAR

Query:  NILKEMKEYGCYPDVAAYNAAIRNFCIAKRLRGAFDLMDEMVNKGLSPNATTYNLFFRIFFWSNDLQSSWNLYRRMMDTCCLPNTQSCLFLMRLFKKHEN
        NILKEMKEYGCYPDVAAYNA IRNFCIAKRL  AFDL+DEMVNKGLSPNATTYNLFFRIFFWSNDL+S+WNLYRRMMDT CLPNTQSCLFL+RLFKK+E 
Subjt:  NILKEMKEYGCYPDVAAYNAAIRNFCIAKRLRGAFDLMDEMVNKGLSPNATTYNLFFRIFFWSNDLQSSWNLYRRMMDTCCLPNTQSCLFLMRLFKKHEN

Query:  VEMALELWNDMIQRGFGSYILVSEELFDFLCDLGKLIEAERCFLQMVDKGHKPSNVSFKRIKVLMELANKHEVLQNLSKKMDGFLDPQKRLPETMSYSTD
         +MALELWNDMIQ+GFGSYILVSEELFD LCDLGKLIEAE CFLQMVDKGHKPS+ SFKRIKVLMELANKHE LQNLSKKMDGFLDPQKRLP TMSYS D
Subjt:  VEMALELWNDMIQRGFGSYILVSEELFDFLCDLGKLIEAERCFLQMVDKGHKPSNVSFKRIKVLMELANKHEVLQNLSKKMDGFLDPQKRLPETMSYSTD

Query:  LSNSNSFQC
        LSNSNS  C
Subjt:  LSNSNSFQC

XP_022948942.1 putative pentatricopeptide repeat-containing protein At1g02420 [Cucurbita moschata]5.3e-26589.59Show/hide
Query:  MILRLPSSSQSPVRYFSPIPLSNFFSHSFSSASENQSLNENVETVFRIITSSPSSADMRDSLKSSRVFLSNELIDGVLKRVRFSHGNPLQALEFFNYTGR
        MILR  S  QSP+RYFSP PLSNF  H FS+ASENQSLNENV+TVFRII+SS SSA+MR SLKSSRVFLSNELIDGVLKRVRFSHGNPLQALEFFNYTGR
Subjt:  MILRLPSSSQSPVRYFSPIPLSNFFSHSFSSASENQSLNENVETVFRIITSSPSSADMRDSLKSSRVFLSNELIDGVLKRVRFSHGNPLQALEFFNYTGR

Query:  RRGFYHTAFSVDTMLYILGRGRKFEKIWDVLVDIKLKDRSLISPRTVMVVLGRIAKVCSVRQTVESFRKFKKFVPEFDVTCFNALLRTLCQEKSMMDARN
        RRGFYH+AFS DTMLYILGR RKFEKIWDVL+D+KLKDRSLI+PRTVMVVLGRIAKVCSVRQTVESFRKFKKFVPEFDVTCFNALLRTLCQEKSM DARN
Subjt:  RRGFYHTAFSVDTMLYILGRGRKFEKIWDVLVDIKLKDRSLISPRTVMVVLGRIAKVCSVRQTVESFRKFKKFVPEFDVTCFNALLRTLCQEKSMMDARN

Query:  VYHSLKSKFRPNLQTFNILLSGWKSSEEAEGFFNEMREMGVKPDVVSYNCLVDVYCKNREMDKAFKVIEKMRDEDIAADVITYTSVIGGLGLIGQPDKAR
        VYH LKSKFRPNLQTFNILLSGWKSSEEAEGFF+EMREMGV+PDVVSYNCLVDVYCKNREMDKAFKVIEKMRDEDIAADVITYTS+IGGLGLIGQPDKAR
Subjt:  VYHSLKSKFRPNLQTFNILLSGWKSSEEAEGFFNEMREMGVKPDVVSYNCLVDVYCKNREMDKAFKVIEKMRDEDIAADVITYTSVIGGLGLIGQPDKAR

Query:  NILKEMKEYGCYPDVAAYNAAIRNFCIAKRLRGAFDLMDEMVNKGLSPNATTYNLFFRIFFWSNDLQSSWNLYRRMMDTCCLPNTQSCLFLMRLFKKHEN
        NILKEMKEYGCYPDVAAYNAAIRNFCIAKRLR AFDLMDEM NKGL+PNATTYNLFFRIFFWSNDLQSSWNLYRRMMD  CLPNTQSCLFLMR FK+HE 
Subjt:  NILKEMKEYGCYPDVAAYNAAIRNFCIAKRLRGAFDLMDEMVNKGLSPNATTYNLFFRIFFWSNDLQSSWNLYRRMMDTCCLPNTQSCLFLMRLFKKHEN

Query:  VEMALELWNDMIQRGFGSYILVSEELFDFLCDLGKLIEAERCFLQMVDKGHKPSNVSFKRIKVLMELANKHEVLQNLSKKMDGFLDPQKRLPETMSYSTD
        VEMAL+LWNDM++RGFGSYILVSEELFD LCDLGKLIEAE+CFLQMVDKGHKPSNVSFKRIKVLMELANKHE LQNLSK M+ FLDP + LPETM+   D
Subjt:  VEMALELWNDMIQRGFGSYILVSEELFDFLCDLGKLIEAERCFLQMVDKGHKPSNVSFKRIKVLMELANKHEVLQNLSKKMDGFLDPQKRLPETMSYSTD

Query:  LSNSNSFQC
        LSN NS  C
Subjt:  LSNSNSFQC

XP_023522207.1 putative pentatricopeptide repeat-containing protein At1g02420 [Cucurbita pepo subsp. pepo]1.2e-26489.19Show/hide
Query:  MILRLPSSSQSPVRYFSPIPLSNFFSHSFSSASENQSLNENVETVFRIITSSPSSADMRDSLKSSRVFLSNELIDGVLKRVRFSHGNPLQALEFFNYTGR
        MILR  S  QSP+RYFSP PLSNF  H FS+ASENQSLNENV+TVFRII+SS SSA+MR SLKSSRVFLSNELIDGVLKRVRFSHGNPLQALEFFNYTGR
Subjt:  MILRLPSSSQSPVRYFSPIPLSNFFSHSFSSASENQSLNENVETVFRIITSSPSSADMRDSLKSSRVFLSNELIDGVLKRVRFSHGNPLQALEFFNYTGR

Query:  RRGFYHTAFSVDTMLYILGRGRKFEKIWDVLVDIKLKDRSLISPRTVMVVLGRIAKVCSVRQTVESFRKFKKFVPEFDVTCFNALLRTLCQEKSMMDARN
        RRGFYH+AFS DTMLYILGR RKFEKIWDVL+D+KLKDRSLI+PRTVMVVLGRIAKVCSVRQTVESFRKFKKFVPEFDVTCFNALLRTLCQEKSM DARN
Subjt:  RRGFYHTAFSVDTMLYILGRGRKFEKIWDVLVDIKLKDRSLISPRTVMVVLGRIAKVCSVRQTVESFRKFKKFVPEFDVTCFNALLRTLCQEKSMMDARN

Query:  VYHSLKSKFRPNLQTFNILLSGWKSSEEAEGFFNEMREMGVKPDVVSYNCLVDVYCKNREMDKAFKVIEKMRDEDIAADVITYTSVIGGLGLIGQPDKAR
        VYH LKSKFRPNLQTFNILLSGWKSSEEAEGFF+EMREMGV+PDVVSYNCLVDVYCKNREMDKAFKVIEKMRDEDIAADVITYTS+IGGLGLIGQPDKAR
Subjt:  VYHSLKSKFRPNLQTFNILLSGWKSSEEAEGFFNEMREMGVKPDVVSYNCLVDVYCKNREMDKAFKVIEKMRDEDIAADVITYTSVIGGLGLIGQPDKAR

Query:  NILKEMKEYGCYPDVAAYNAAIRNFCIAKRLRGAFDLMDEMVNKGLSPNATTYNLFFRIFFWSNDLQSSWNLYRRMMDTCCLPNTQSCLFLMRLFKKHEN
        NILKEMKEYGCYPDVAAYNAAIRNFCIAKRLR AFDLMDEM NKGL+PNATTYNLFFRIFFWSNDLQSSWNLYRRMMD  CLPNTQSCLFLMR FK+HE 
Subjt:  NILKEMKEYGCYPDVAAYNAAIRNFCIAKRLRGAFDLMDEMVNKGLSPNATTYNLFFRIFFWSNDLQSSWNLYRRMMDTCCLPNTQSCLFLMRLFKKHEN

Query:  VEMALELWNDMIQRGFGSYILVSEELFDFLCDLGKLIEAERCFLQMVDKGHKPSNVSFKRIKVLMELANKHEVLQNLSKKMDGFLDPQKRLPETMSYSTD
        ++MAL+LWNDM++RGFGSYILVSEELFD LCDLGKLIEAE+CFLQMVDKGHKPSNVSFKRIKVLMELANKHE LQNLSK M+ FLDP +RLPE M+   D
Subjt:  VEMALELWNDMIQRGFGSYILVSEELFDFLCDLGKLIEAERCFLQMVDKGHKPSNVSFKRIKVLMELANKHEVLQNLSKKMDGFLDPQKRLPETMSYSTD

Query:  LSNSNSFQC
        LSN NS  C
Subjt:  LSNSNSFQC

XP_038903108.1 putative pentatricopeptide repeat-containing protein At1g02420 [Benincasa hispida]3.7e-298100Show/hide
Query:  MSRKPTMILRLPSSSQSPVRYFSPIPLSNFFSHSFSSASENQSLNENVETVFRIITSSPSSADMRDSLKSSRVFLSNELIDGVLKRVRFSHGNPLQALEF
        MSRKPTMILRLPSSSQSPVRYFSPIPLSNFFSHSFSSASENQSLNENVETVFRIITSSPSSADMRDSLKSSRVFLSNELIDGVLKRVRFSHGNPLQALEF
Subjt:  MSRKPTMILRLPSSSQSPVRYFSPIPLSNFFSHSFSSASENQSLNENVETVFRIITSSPSSADMRDSLKSSRVFLSNELIDGVLKRVRFSHGNPLQALEF

Query:  FNYTGRRRGFYHTAFSVDTMLYILGRGRKFEKIWDVLVDIKLKDRSLISPRTVMVVLGRIAKVCSVRQTVESFRKFKKFVPEFDVTCFNALLRTLCQEKS
        FNYTGRRRGFYHTAFSVDTMLYILGRGRKFEKIWDVLVDIKLKDRSLISPRTVMVVLGRIAKVCSVRQTVESFRKFKKFVPEFDVTCFNALLRTLCQEKS
Subjt:  FNYTGRRRGFYHTAFSVDTMLYILGRGRKFEKIWDVLVDIKLKDRSLISPRTVMVVLGRIAKVCSVRQTVESFRKFKKFVPEFDVTCFNALLRTLCQEKS

Query:  MMDARNVYHSLKSKFRPNLQTFNILLSGWKSSEEAEGFFNEMREMGVKPDVVSYNCLVDVYCKNREMDKAFKVIEKMRDEDIAADVITYTSVIGGLGLIG
        MMDARNVYHSLKSKFRPNLQTFNILLSGWKSSEEAEGFFNEMREMGVKPDVVSYNCLVDVYCKNREMDKAFKVIEKMRDEDIAADVITYTSVIGGLGLIG
Subjt:  MMDARNVYHSLKSKFRPNLQTFNILLSGWKSSEEAEGFFNEMREMGVKPDVVSYNCLVDVYCKNREMDKAFKVIEKMRDEDIAADVITYTSVIGGLGLIG

Query:  QPDKARNILKEMKEYGCYPDVAAYNAAIRNFCIAKRLRGAFDLMDEMVNKGLSPNATTYNLFFRIFFWSNDLQSSWNLYRRMMDTCCLPNTQSCLFLMRL
        QPDKARNILKEMKEYGCYPDVAAYNAAIRNFCIAKRLRGAFDLMDEMVNKGLSPNATTYNLFFRIFFWSNDLQSSWNLYRRMMDTCCLPNTQSCLFLMRL
Subjt:  QPDKARNILKEMKEYGCYPDVAAYNAAIRNFCIAKRLRGAFDLMDEMVNKGLSPNATTYNLFFRIFFWSNDLQSSWNLYRRMMDTCCLPNTQSCLFLMRL

Query:  FKKHENVEMALELWNDMIQRGFGSYILVSEELFDFLCDLGKLIEAERCFLQMVDKGHKPSNVSFKRIKVLMELANKHEVLQNLSKKMDGFLDPQKRLPET
        FKKHENVEMALELWNDMIQRGFGSYILVSEELFDFLCDLGKLIEAERCFLQMVDKGHKPSNVSFKRIKVLMELANKHEVLQNLSKKMDGFLDPQKRLPET
Subjt:  FKKHENVEMALELWNDMIQRGFGSYILVSEELFDFLCDLGKLIEAERCFLQMVDKGHKPSNVSFKRIKVLMELANKHEVLQNLSKKMDGFLDPQKRLPET

Query:  MSYSTDLSNSNSFQC
        MSYSTDLSNSNSFQC
Subjt:  MSYSTDLSNSNSFQC

TrEMBL top hitse value%identityAlignment
A0A1S3BI42 putative pentatricopeptide repeat-containing protein At1g02420 isoform X11.3e-26489.78Show/hide
Query:  MILRLPSSSQSPVRYFSPIPLSNFFSHSFSSASENQSLNENVETVFRIITSSPSSADMRDSLKSSRVFLSNELIDGVLKRVRFSHGNPLQALEFFNYTGR
        MILR  +S QSPVRYFSPIPLSNFF H FSSA+ NQSLNEN+ETVFRIIT+SPSS DM+ SLKSSRVFLSNELIDGVLKRVRFSHGNPLQ LEFFNYT  
Subjt:  MILRLPSSSQSPVRYFSPIPLSNFFSHSFSSASENQSLNENVETVFRIITSSPSSADMRDSLKSSRVFLSNELIDGVLKRVRFSHGNPLQALEFFNYTGR

Query:  RRGFYHTAFSVDTMLYILGRGRKFEKIWDVLVDIKLKDRSLISPRTVMVVLGRIAKVCSVRQTVESFRKFKKFVPEFDVTCFNALLRTLCQEKSMMDARN
        RRGFYHT++SVDTMLYILGRGRKF  IWDVLVD+KLKD SLIS RTVMVVLGRIAKVCSVRQTVESFRKFKKFVPEFDVTCFNALLRTLCQEKSMMDARN
Subjt:  RRGFYHTAFSVDTMLYILGRGRKFEKIWDVLVDIKLKDRSLISPRTVMVVLGRIAKVCSVRQTVESFRKFKKFVPEFDVTCFNALLRTLCQEKSMMDARN

Query:  VYHSLKSKFRPNLQTFNILLSGWKSSEEAEGFFNEMREMGVKPDVVSYNCLVDVYCKNREMDKAFKVIEKMRDEDIAADVITYTSVIGGLGLIGQPDKAR
        VYH LKSKFRPNLQTFNILLSGWKSSEEAEGFF+EM EMGVKPDVVSYN LVDVYCKNREMDKAFKV+EKMRDEDI ADVITYTS+IGGLGLIGQPDKAR
Subjt:  VYHSLKSKFRPNLQTFNILLSGWKSSEEAEGFFNEMREMGVKPDVVSYNCLVDVYCKNREMDKAFKVIEKMRDEDIAADVITYTSVIGGLGLIGQPDKAR

Query:  NILKEMKEYGCYPDVAAYNAAIRNFCIAKRLRGAFDLMDEMVNKGLSPNATTYNLFFRIFFWSNDLQSSWNLYRRMMDTCCLPNTQSCLFLMRLFKKHEN
        NILKEMKEYGCYPDVAAYNA IRNFCIAKRL  AFDL+DEMVNKGLSPNATTYNLFFRIFFWSNDL+S+WNLYRRMMDT CLPNTQSCLFL+RLFKK+E 
Subjt:  NILKEMKEYGCYPDVAAYNAAIRNFCIAKRLRGAFDLMDEMVNKGLSPNATTYNLFFRIFFWSNDLQSSWNLYRRMMDTCCLPNTQSCLFLMRLFKKHEN

Query:  VEMALELWNDMIQRGFGSYILVSEELFDFLCDLGKLIEAERCFLQMVDKGHKPSNVSFKRIKVLMELANKHEVLQNLSKKMDGFLDPQKRLPETMSYSTD
         +MALELWNDMIQ+GFGSYILVSEELFD LCDLGKLIEAE CFLQMVDKGHKPS+ SFKRIKVLMELANKHE LQNLSKKMDGFLDPQKRLP TMSYS D
Subjt:  VEMALELWNDMIQRGFGSYILVSEELFDFLCDLGKLIEAERCFLQMVDKGHKPSNVSFKRIKVLMELANKHEVLQNLSKKMDGFLDPQKRLPETMSYSTD

Query:  LSNSNSFQC
        LSNSNS  C
Subjt:  LSNSNSFQC

A0A5A7T8Z7 Putative pentatricopeptide repeat-containing protein1.3e-26489.78Show/hide
Query:  MILRLPSSSQSPVRYFSPIPLSNFFSHSFSSASENQSLNENVETVFRIITSSPSSADMRDSLKSSRVFLSNELIDGVLKRVRFSHGNPLQALEFFNYTGR
        MILR  +S QSPVRYFSPIPLSNFF H FSSA+ NQSLNEN+ETVFRIIT+SPSS DM+ SLKSSRVFLSNELIDGVLKRVRFSHGNPLQ LEFFNYT  
Subjt:  MILRLPSSSQSPVRYFSPIPLSNFFSHSFSSASENQSLNENVETVFRIITSSPSSADMRDSLKSSRVFLSNELIDGVLKRVRFSHGNPLQALEFFNYTGR

Query:  RRGFYHTAFSVDTMLYILGRGRKFEKIWDVLVDIKLKDRSLISPRTVMVVLGRIAKVCSVRQTVESFRKFKKFVPEFDVTCFNALLRTLCQEKSMMDARN
        RRGFYHT++SVDTMLYILGRGRKF  IWDVLVD+KLKD SLIS RTVMVVLGRIAKVCSVRQTVESFRKFKKFVPEFDVTCFNALLRTLCQEKSMMDARN
Subjt:  RRGFYHTAFSVDTMLYILGRGRKFEKIWDVLVDIKLKDRSLISPRTVMVVLGRIAKVCSVRQTVESFRKFKKFVPEFDVTCFNALLRTLCQEKSMMDARN

Query:  VYHSLKSKFRPNLQTFNILLSGWKSSEEAEGFFNEMREMGVKPDVVSYNCLVDVYCKNREMDKAFKVIEKMRDEDIAADVITYTSVIGGLGLIGQPDKAR
        VYH LKSKFRPNLQTFNILLSGWKSSEEAEGFF+EM EMGVKPDVVSYN LVDVYCKNREMDKAFKV+EKMRDEDI ADVITYTS+IGGLGLIGQPDKAR
Subjt:  VYHSLKSKFRPNLQTFNILLSGWKSSEEAEGFFNEMREMGVKPDVVSYNCLVDVYCKNREMDKAFKVIEKMRDEDIAADVITYTSVIGGLGLIGQPDKAR

Query:  NILKEMKEYGCYPDVAAYNAAIRNFCIAKRLRGAFDLMDEMVNKGLSPNATTYNLFFRIFFWSNDLQSSWNLYRRMMDTCCLPNTQSCLFLMRLFKKHEN
        NILKEMKEYGCYPDVAAYNA IRNFCIAKRL  AFDL+DEMVNKGLSPNATTYNLFFRIFFWSNDL+S+WNLYRRMMDT CLPNTQSCLFL+RLFKK+E 
Subjt:  NILKEMKEYGCYPDVAAYNAAIRNFCIAKRLRGAFDLMDEMVNKGLSPNATTYNLFFRIFFWSNDLQSSWNLYRRMMDTCCLPNTQSCLFLMRLFKKHEN

Query:  VEMALELWNDMIQRGFGSYILVSEELFDFLCDLGKLIEAERCFLQMVDKGHKPSNVSFKRIKVLMELANKHEVLQNLSKKMDGFLDPQKRLPETMSYSTD
         +MALELWNDMIQ+GFGSYILVSEELFD LCDLGKLIEAE CFLQMVDKGHKPS+ SFKRIKVLMELANKHE LQNLSKKMDGFLDPQKRLP TMSYS D
Subjt:  VEMALELWNDMIQRGFGSYILVSEELFDFLCDLGKLIEAERCFLQMVDKGHKPSNVSFKRIKVLMELANKHEVLQNLSKKMDGFLDPQKRLPETMSYSTD

Query:  LSNSNSFQC
        LSNSNS  C
Subjt:  LSNSNSFQC

A0A6J1DVB2 putative pentatricopeptide repeat-containing protein At1g02420 isoform X19.9e-25789.52Show/hide
Query:  RYFSPIPLSNFFSHSFSSASENQSLNENVETVFRIITSSPSSADMRDSLKSSRVFLSNELIDGVLKRVRFSHGNPLQALEFFNYTGRRRGFYHTAFSVDT
        RYF PIPLSN FSH FSSA+ENQSLN  VET+FRII+SS SS +MR SLKS+RVFLSNELIDGVLKRVRFSHGNPLQALEFFNYTGRRRGFYHTAFSVD+
Subjt:  RYFSPIPLSNFFSHSFSSASENQSLNENVETVFRIITSSPSSADMRDSLKSSRVFLSNELIDGVLKRVRFSHGNPLQALEFFNYTGRRRGFYHTAFSVDT

Query:  MLYILGRGRKFEKIWDVLVDIKLKDRSLISPRTVMVVLGRIAKVCSVRQTVESFRKFKKFVPEFDVTCFNALLRTLCQEKSMMDARNVYHSLKSKFRPNL
        MLYILGR RKFEKIWDVLVD+KLKDRSLI+PRTVMVVLGRIAKVCSVRQTVESFRKFKKFVPEFDV CFNALLRTLCQEKSM DARNVYH +KSKFRPNL
Subjt:  MLYILGRGRKFEKIWDVLVDIKLKDRSLISPRTVMVVLGRIAKVCSVRQTVESFRKFKKFVPEFDVTCFNALLRTLCQEKSMMDARNVYHSLKSKFRPNL

Query:  QTFNILLSGWKSSEEAEGFFNEMREMGVKPDVVSYNCLVDVYCKNREMDKAFKVIEKMRDEDIAADVITYTSVIGGLGLIGQPDKARNILKEMKEYGCYP
        QTFNILLSGWKSSEEAEGFF+EMREMGVKPDVVSYNCLVDVYCKNREMDKA+KV+E+M+DEDI ADVITYTS+IGGLGLIGQPDKARNILKEMKEYGCYP
Subjt:  QTFNILLSGWKSSEEAEGFFNEMREMGVKPDVVSYNCLVDVYCKNREMDKAFKVIEKMRDEDIAADVITYTSVIGGLGLIGQPDKARNILKEMKEYGCYP

Query:  DVAAYNAAIRNFCIAKRLRGAFDLMDEMVNKGLSPNATTYNLFFRIFFWSNDLQSSWNLYRRMMDTCCLPNTQSCLFLMRLFKKHENVEMALELWNDMIQ
        DVAAYNAAIRNFCIAKRLR AFDLMDEMVNKGLSPNATTYNLFFRIFFWSNDLQSSWNLYRRMMDT CLPNTQSCLFLMRLFKKHE VEMALELWNDM+ 
Subjt:  DVAAYNAAIRNFCIAKRLRGAFDLMDEMVNKGLSPNATTYNLFFRIFFWSNDLQSSWNLYRRMMDTCCLPNTQSCLFLMRLFKKHENVEMALELWNDMIQ

Query:  RGFGSYILVSEELFDFLCDLGKLIEAERCFLQMVDKGHKPSNVSFKRIKVLMELANKHEVLQNLSKKMDGFLDPQKRLPETMSYSTDLSNSNSFQC
        RGFGSYILVSEELFD L DLGKL+EAE CFLQMVDKGHKPSNVSFKRIKVLMELANKHE LQNL+KKM+GF +P K LPETMS  TD    +SF C
Subjt:  RGFGSYILVSEELFDFLCDLGKLIEAERCFLQMVDKGHKPSNVSFKRIKVLMELANKHEVLQNLSKKMDGFLDPQKRLPETMSYSTDLSNSNSFQC

A0A6J1GAM6 putative pentatricopeptide repeat-containing protein At1g024202.6e-26589.59Show/hide
Query:  MILRLPSSSQSPVRYFSPIPLSNFFSHSFSSASENQSLNENVETVFRIITSSPSSADMRDSLKSSRVFLSNELIDGVLKRVRFSHGNPLQALEFFNYTGR
        MILR  S  QSP+RYFSP PLSNF  H FS+ASENQSLNENV+TVFRII+SS SSA+MR SLKSSRVFLSNELIDGVLKRVRFSHGNPLQALEFFNYTGR
Subjt:  MILRLPSSSQSPVRYFSPIPLSNFFSHSFSSASENQSLNENVETVFRIITSSPSSADMRDSLKSSRVFLSNELIDGVLKRVRFSHGNPLQALEFFNYTGR

Query:  RRGFYHTAFSVDTMLYILGRGRKFEKIWDVLVDIKLKDRSLISPRTVMVVLGRIAKVCSVRQTVESFRKFKKFVPEFDVTCFNALLRTLCQEKSMMDARN
        RRGFYH+AFS DTMLYILGR RKFEKIWDVL+D+KLKDRSLI+PRTVMVVLGRIAKVCSVRQTVESFRKFKKFVPEFDVTCFNALLRTLCQEKSM DARN
Subjt:  RRGFYHTAFSVDTMLYILGRGRKFEKIWDVLVDIKLKDRSLISPRTVMVVLGRIAKVCSVRQTVESFRKFKKFVPEFDVTCFNALLRTLCQEKSMMDARN

Query:  VYHSLKSKFRPNLQTFNILLSGWKSSEEAEGFFNEMREMGVKPDVVSYNCLVDVYCKNREMDKAFKVIEKMRDEDIAADVITYTSVIGGLGLIGQPDKAR
        VYH LKSKFRPNLQTFNILLSGWKSSEEAEGFF+EMREMGV+PDVVSYNCLVDVYCKNREMDKAFKVIEKMRDEDIAADVITYTS+IGGLGLIGQPDKAR
Subjt:  VYHSLKSKFRPNLQTFNILLSGWKSSEEAEGFFNEMREMGVKPDVVSYNCLVDVYCKNREMDKAFKVIEKMRDEDIAADVITYTSVIGGLGLIGQPDKAR

Query:  NILKEMKEYGCYPDVAAYNAAIRNFCIAKRLRGAFDLMDEMVNKGLSPNATTYNLFFRIFFWSNDLQSSWNLYRRMMDTCCLPNTQSCLFLMRLFKKHEN
        NILKEMKEYGCYPDVAAYNAAIRNFCIAKRLR AFDLMDEM NKGL+PNATTYNLFFRIFFWSNDLQSSWNLYRRMMD  CLPNTQSCLFLMR FK+HE 
Subjt:  NILKEMKEYGCYPDVAAYNAAIRNFCIAKRLRGAFDLMDEMVNKGLSPNATTYNLFFRIFFWSNDLQSSWNLYRRMMDTCCLPNTQSCLFLMRLFKKHEN

Query:  VEMALELWNDMIQRGFGSYILVSEELFDFLCDLGKLIEAERCFLQMVDKGHKPSNVSFKRIKVLMELANKHEVLQNLSKKMDGFLDPQKRLPETMSYSTD
        VEMAL+LWNDM++RGFGSYILVSEELFD LCDLGKLIEAE+CFLQMVDKGHKPSNVSFKRIKVLMELANKHE LQNLSK M+ FLDP + LPETM+   D
Subjt:  VEMALELWNDMIQRGFGSYILVSEELFDFLCDLGKLIEAERCFLQMVDKGHKPSNVSFKRIKVLMELANKHEVLQNLSKKMDGFLDPQKRLPETMSYSTD

Query:  LSNSNSFQC
        LSN NS  C
Subjt:  LSNSNSFQC

A0A6J1KCH6 putative pentatricopeptide repeat-containing protein At1g024201.6e-26289Show/hide
Query:  MILRLPSSSQSPVRYFSPIPLSNFFSHSFSSASENQSLNENVETVFRIITSSPSSADMRDSLKSSRVFLSNELIDGVLKRVRFSHGNPLQALEFFNYTGR
        MILR  S  QSP+RYFSP PLSNF  H FS+ASENQSLNENVETVFRII+SS SSA+MR SLKSSRVFLSNELIDGVLKRVRFSHGNPLQALEFFNYT R
Subjt:  MILRLPSSSQSPVRYFSPIPLSNFFSHSFSSASENQSLNENVETVFRIITSSPSSADMRDSLKSSRVFLSNELIDGVLKRVRFSHGNPLQALEFFNYTGR

Query:  RRGFYHTAFSVDTMLYILGRGRKFEKIWDVLVDIKLKDRSLISPRTVMVVLGRIAKVCSVRQTVESFRKFKKFVPEFDVTCFNALLRTLCQEKSMMDARN
        RR FYH+AFS+DTMLYILGR RKFEKIWDVL+D+KLKDRSLI+PRTVMVVLGRIAKVCSVRQTVESFRK KKFVPEFDVTCFNALLRTLCQEKSM DARN
Subjt:  RRGFYHTAFSVDTMLYILGRGRKFEKIWDVLVDIKLKDRSLISPRTVMVVLGRIAKVCSVRQTVESFRKFKKFVPEFDVTCFNALLRTLCQEKSMMDARN

Query:  VYHSLKSKFRPNLQTFNILLSGWKSSEEAEGFFNEMREMGVKPDVVSYNCLVDVYCKNREMDKAFKVIEKMRDEDIAADVITYTSVIGGLGLIGQPDKAR
        VYH LKSKFRPNLQTFNILLSGWKSSEEAEGFF+EMREMGV+PDVVSYNCL+DVYCKNREMDKAFKVIEKMRDEDIAADVITYTS+IGGLGLIGQPDKAR
Subjt:  VYHSLKSKFRPNLQTFNILLSGWKSSEEAEGFFNEMREMGVKPDVVSYNCLVDVYCKNREMDKAFKVIEKMRDEDIAADVITYTSVIGGLGLIGQPDKAR

Query:  NILKEMKEYGCYPDVAAYNAAIRNFCIAKRLRGAFDLMDEMVNKGLSPNATTYNLFFRIFFWSNDLQSSWNLYRRMMDTCCLPNTQSCLFLMRLFKKHEN
        NILKEMKEYGCYPDVAAYNAAIRNFCIAKRLR AFDLMDEM NKGL+PNATTYNLFFRIFFWSNDLQSSWNLYRRMMD  CLPNTQSCLFLMR FK+HE 
Subjt:  NILKEMKEYGCYPDVAAYNAAIRNFCIAKRLRGAFDLMDEMVNKGLSPNATTYNLFFRIFFWSNDLQSSWNLYRRMMDTCCLPNTQSCLFLMRLFKKHEN

Query:  VEMALELWNDMIQRGFGSYILVSEELFDFLCDLGKLIEAERCFLQMVDKGHKPSNVSFKRIKVLMELANKHEVLQNLSKKMDGFLDPQKRLPETMSYSTD
        VEMAL+LWNDM++RGFGSYILVSEELFD LCDLGKLIEAE+CFLQMVDKGHKPSNVSFKRIKVLMELANKHE LQNLSK M+ FLDP K LP+TM+   D
Subjt:  VEMALELWNDMIQRGFGSYILVSEELFDFLCDLGKLIEAERCFLQMVDKGHKPSNVSFKRIKVLMELANKHEVLQNLSKKMDGFLDPQKRLPETMSYSTD

Query:  LSNSNSFQC
        LSN NS  C
Subjt:  LSNSNSFQC

SwissProt top hitse value%identityAlignment
Q9C9A2 Pentatricopeptide repeat-containing protein At1g71060, mitochondrial7.0e-5026.94Show/hide
Query:  NFFSHSFSSASENQSLNENVETVFRIITSSPSSADMRDSLKSSRVFLSNELIDGVLKRVRFSHGNPLQALEFFNYTGRRRGFYHTAFSVDTMLYILGRGR
        +F + S  +       +++ E + +I+T    S  +   L  + V LS  LI+ VLK++  S+   L AL  F +   ++GF HT  + + ++  LG+ +
Subjt:  NFFSHSFSSASENQSLNENVETVFRIITSSPSSADMRDSLKSSRVFLSNELIDGVLKRVRFSHGNPLQALEFFNYTGRRRGFYHTAFSVDTMLYILGRGR

Query:  KFEKIWDVLVDIKLKDRSLISPRTVMVVLGRIAKVCSVRQTVESFRKFKKFVPEFDVTCFNALLRTLCQEKSMMDARNVYHSLKSK-FRPNLQTFNILLS
        +F+ IW ++ D+K K   L+S  T  ++  R A+   V++ + +F K ++F  + + + FN +L TL + +++ DA+ V+  +K K F P+++++ ILL 
Subjt:  KFEKIWDVLVDIKLKDRSLISPRTVMVVLGRIAKVCSVRQTVESFRKFKKFVPEFDVTCFNALLRTLCQEKSMMDARNVYHSLKSK-FRPNLQTFNILLS

Query:  GWKSS---EEAEGFFNEMREMGVKPDVVSYNCLVDVYCKNREMDKAFKVIEKMRDEDIAADVITYTSVIGGLGLIGQPDKARNILKEMKEYGCYPDVAAY
        GW         +    EM++ G +PDVV+Y  +++ +CK ++ ++A +   +M   +       + S+I GLG   + + A    +  K  G   +   Y
Subjt:  GWKSS---EEAEGFFNEMREMGVKPDVVSYNCLVDVYCKNREMDKAFKVIEKMRDEDIAADVITYTSVIGGLGLIGQPDKARNILKEMKEYGCYPDVAAY

Query:  NAAIRNFCIAKRLRGAFDLMDEMVNKGLSPNATTYNLFFRIFFWSNDLQSSWNLYRRMMDTCCLPNTQSCLFLMRLFKKHENVEMALELWNDMIQRGFGS
        NA +  +C ++R+  A+  +DEM  KG+ PNA TY++           + ++ +Y+ M    C P   +   ++R+F   E ++MA+++W++M  +G   
Subjt:  NAAIRNFCIAKRLRGAFDLMDEMVNKGLSPNATTYNLFFRIFFWSNDLQSSWNLYRRMMDTCCLPNTQSCLFLMRLFKKHENVEMALELWNDMIQRGFGS

Query:  YILVSEELFDFLCDLGKLIEAERCFLQMVDKGHKPSNVSFKRIKVLMELANKHEVLQNLSKKMD
         + +   L   LC   KL EA   F +M+D G +P    F R+K  +    + + + +L  KMD
Subjt:  YILVSEELFDFLCDLGKLIEAERCFLQMVDKGHKPSNVSFKRIKVLMELANKHEVLQNLSKKMD

Q9FVX2 Pentatricopeptide repeat-containing protein At1g77360, mitochondrial2.0e-4924.29Show/hide
Query:  FSSASENQSLNENVETVFRIITSSPSSADMRDSLKSSRVFLSNELIDGVLKRVRFSHGNPLQALEFFNYTGRRRGFYHTAFSVDTMLYILGRGRKFEKIW
        +SS+ + + + +  + + +++ SSP    +  +L  S + +S E+++ VL R R      L    FF ++ ++R + H+  +   M+    + R+++ +W
Subjt:  FSSASENQSLNENVETVFRIITSSPSSADMRDSLKSSRVFLSNELIDGVLKRVRFSHGNPLQALEFFNYTGRRRGFYHTAFSVDTMLYILGRGRKFEKIW

Query:  DVLVDIKLKDRSLISPRTVMVVLGRIAKVCSVRQTVESFRKFKKFVPEFDVTCFNALLRTLCQEKSMMDARNVYHSLKSKFRPNLQTFNILLSGW---KS
        D++    ++ + +++  T  +V+ + A+   V + + +F   +K+    ++  FN LL  LC+ K++  A+ V+ +++ +F P+ +T++ILL GW    +
Subjt:  DVLVDIKLKDRSLISPRTVMVVLGRIAKVCSVRQTVESFRKFKKFVPEFDVTCFNALLRTLCQEKSMMDARNVYHSLKSKFRPNLQTFNILLSGW---KS

Query:  SEEAEGFFNEMREMGVKPDVVSYNCLVDVYCKNREMDKAFKVIEKMRDEDIAADVITYTSVIGGLGLIGQPDKARNILKEMKEYGCYPDVAAYNAAIRNF
          +A   F EM + G  PD+V+Y+ +VD+ CK   +D+A  ++  M           Y+ ++   G   + ++A +   EM+  G   DVA +N+ I  F
Subjt:  SEEAEGFFNEMREMGVKPDVVSYNCLVDVYCKNREMDKAFKVIEKMRDEDIAADVITYTSVIGGLGLIGQPDKARNILKEMKEYGCYPDVAAYNAAIRNF

Query:  CIAKRLRGAFDLMDEMVNKGLSPNATTYNLFFRIFFWSNDLQSSWNLYRRMMDTCCLPNTQSCLFLMRLFKKHENVEMALELWNDMIQRGFGSYILVSEE
        C A R++  + ++ EM +KG++PN+ + N+  R      +   +++++R+M+   C P+  +   ++++F + + +E A ++W  M ++G    +     
Subjt:  CIAKRLRGAFDLMDEMVNKGLSPNATTYNLFFRIFFWSNDLQSSWNLYRRMMDTCCLPNTQSCLFLMRLFKKHENVEMALELWNDMIQRGFGSYILVSEE

Query:  LFDFLCDLGKLIEAERCFLQMVDKGHKPSNVSFKRIKVLMELANKHEVLQNLSKKMD
        L + LC+     +A     +M++ G +PS V+F R++ L+    + +VL+ L++KM+
Subjt:  LFDFLCDLGKLIEAERCFLQMVDKGHKPSNVSFKRIKVLMELANKHEVLQNLSKKMD

Q9FZ19 Putative pentatricopeptide repeat-containing protein At1g024201.1e-18566.19Show/hide
Query:  MILRLPSSSQSPVRYFSPIPLSNFFSHSFSSASENQSLNE---NVETVFRIITSSPSSADMRDSLKSSRVFLSNELIDGVLKRVRFSHGNPLQALEFFNY
        M++  P SS     + S   LS  F HS + +     + E   + ETVFR+I  S    ++++SL SS + LS +LID VLKRVRFSHGNP+Q LEF+ Y
Subjt:  MILRLPSSSQSPVRYFSPIPLSNFFSHSFSSASENQSLNE---NVETVFRIITSSPSSADMRDSLKSSRVFLSNELIDGVLKRVRFSHGNPLQALEFFNY

Query:  TGRRRGFYHTAFSVDTMLYILGRGRKFEKIWDVLVDIKLKDRSLISPRTVMVVLGRIAKVCSVRQTVESFRKFKKFVPE-FDVTCFNALLRTLCQEKSMM
            RGFYH++FS+DTMLYILGR RKF++IW++L++ K KDRSLISPRT+ VVLGR+AK+CSVRQTVESF KFK+ VP+ FD  CFNALLRTLCQEKSM 
Subjt:  TGRRRGFYHTAFSVDTMLYILGRGRKFEKIWDVLVDIKLKDRSLISPRTVMVVLGRIAKVCSVRQTVESFRKFKKFVPE-FDVTCFNALLRTLCQEKSMM

Query:  DARNVYHSLKSKFRPNLQTFNILLSGWKSSEEAEGFFNEMREMGVKPDVVSYNCLVDVYCKNREMDKAFKVIEKMRDEDIAADVITYTSVIGGLGLIGQP
        DARNVYHSLK +F+P+LQTFNILLSGWKSSEEAE FF EM+  G+KPDVV+YN L+DVYCK+RE++KA+K+I+KMR+E+   DVITYT+VIGGLGLIGQP
Subjt:  DARNVYHSLKSKFRPNLQTFNILLSGWKSSEEAEGFFNEMREMGVKPDVVSYNCLVDVYCKNREMDKAFKVIEKMRDEDIAADVITYTSVIGGLGLIGQP

Query:  DKARNILKEMKEYGCYPDVAAYNAAIRNFCIAKRLRGAFDLMDEMVNKGLSPNATTYNLFFRIFFWSNDLQSSWNLYRRMMDTCCLPNTQSCLFLMRLFK
        DKAR +LKEMKEYGCYPDVAAYNAAIRNFCIA+RL  A  L+DEMV KGLSPNATTYNLFFR+   +NDL  SW LY RM+   CLPNTQSC+FL+++FK
Subjt:  DKARNILKEMKEYGCYPDVAAYNAAIRNFCIAKRLRGAFDLMDEMVNKGLSPNATTYNLFFRIFFWSNDLQSSWNLYRRMMDTCCLPNTQSCLFLMRLFK

Query:  KHENVEMALELWNDMIQRGFGSYILVSEELFDFLCDLGKLIEAERCFLQMVDKGHKPSNVSFKRIKVLMELANKHEVLQNLSKKM
        +HE V+MA+ LW DM+ +GFGSY LVS+ L D LCDL K+ EAE+C L+MV+KGH+PSNVSFKRIK+LMELANKH+ + NL +KM
Subjt:  KHENVEMALELWNDMIQRGFGSYILVSEELFDFLCDLGKLIEAERCFLQMVDKGHKPSNVSFKRIKVLMELANKHEVLQNLSKKM

Q9LFQ4 Pentatricopeptide repeat-containing protein At5g15010, mitochondrial5.2e-5330Show/hide
Query:  NQSLNENVETVFRIITSSPSS-ADMRDSLKSSRVFLSNELIDGVLKRVRFSHGNPLQALEFFNYTGRRRGFYHTAFSVDTMLYILGRGRKFEKIWDVLVD
        ++ ++E+V  + +++    S   ++R+ L+   V  SNEL+  +L RVR    +   A  FF + G+++G+  +     +M+ ILG+ RKF+  W ++ +
Subjt:  NQSLNENVETVFRIITSSPSS-ADMRDSLKSSRVFLSNELIDGVLKRVRFSHGNPLQALEFFNYTGRRRGFYHTAFSVDTMLYILGRGRKFEKIWDVLVD

Query:  IKLKDRSLISPRTVMVVLGRIAKVCSVRQTVESFRKFKKFVPEFDVTCFNALLRTLCQEKSMMDARNVYHSLKSKFRPNLQTFNILLSGW----KSSEEA
        ++    SL++ +T+++++ +   V  V + + +F  +K+F  E  +  F +LL  LC+ K++ DA ++    K K+  + ++FNI+L+GW     S  EA
Subjt:  IKLKDRSLISPRTVMVVLGRIAKVCSVRQTVESFRKFKKFVPEFDVTCFNALLRTLCQEKSMMDARNVYHSLKSKFRPNLQTFNILLSGW----KSSEEA

Query:  EGFFNEMREMGVKPDVVSYNCLVDVYCKNREMDKAFKVIEKMRDEDIAADVITYTSVIGGLGLIGQPDKARNILKEM-KEYGCYPDVAAYNAAIRNFCIA
        E  + EM  +GVK DVVSY+ ++  Y K   ++K  K+ ++M+ E I  D   Y +V+  L       +ARN++K M +E G  P+V  YN+ I+  C A
Subjt:  EGFFNEMREMGVKPDVVSYNCLVDVYCKNREMDKAFKVIEKMRDEDIAADVITYTSVIGGLGLIGQPDKARNILKEM-KEYGCYPDVAAYNAAIRNFCIA

Query:  KRLRGAFDLMDEMVNKGLSPNATTYNLFFRIFFWSNDLQSSWNLYRRMMDTCCLPNTQSCLFLMRLFKKHENVEMALELWNDMIQRGFG----SYILVSE
        ++   A  + DEM+ KGL P   TY+ F RI     ++   + L  +M    C P  ++ + L+R   +  + +  L LW++M ++  G    SYI++  
Subjt:  KRLRGAFDLMDEMVNKGLSPNATTYNLFFRIFFWSNDLQSSWNLYRRMMDTCCLPNTQSCLFLMRLFKKHENVEMALELWNDMIQRGFG----SYILVSE

Query:  ELFDFLCDLGKLIEAERCFLQMVDKGHKPS
         LF      GK+ EA   + +M DKG +P+
Subjt:  ELFDFLCDLGKLIEAERCFLQMVDKGHKPS

Q9M2C8 Pentatricopeptide repeat-containing protein At3g613609.1e-5830.63Show/hide
Query:  SSQSPVRYFSPIPLSNFFSHSFSSASENQSLNENVETVFRIITSSPSSADMRDSLKSSRVFLSN---ELIDGVLKRVRFSHGNPLQALEFFNYTGRRRGF
        SS S  R  +P+  S   S S SS S N++    +E +  II   P        + +  + LS+   E +  VL R+  +H N L+ALEFF Y+ +    
Subjt:  SSQSPVRYFSPIPLSNFFSHSFSSASENQSLNENVETVFRIITSSPSSADMRDSLKSSRVFLSN---ELIDGVLKRVRFSHGNPLQALEFFNYTGRRRGF

Query:  YHTAFSVDTMLYILGRGRKFEKIWDVLVDIKLKDRSLISPRTVMVVLGRIAKVCSVRQTVESFRKFKK--FVPEFDVTCFNALLRTLCQEKSMMDARNVY
          T+ S +  L+IL R R F++ W ++ +++    +L+S +++ ++L +IAK  S  +T+E+F K +K  F  +F V  FN LLR  C E+ M +AR+++
Subjt:  YHTAFSVDTMLYILGRGRKFEKIWDVLVDIKLKDRSLISPRTVMVVLGRIAKVCSVRQTVESFRKFKK--FVPEFDVTCFNALLRTLCQEKSMMDARNVY

Query:  HSLKSKFRPNLQTFNILLSGWKSSEE---AEGFFNEMREMGVKPDVVSYNCLVDVYCKNREMDKAFKVIEKMRDEDIAADVITYTSVIGGLGLIGQPDKA
          L S+F P+++T NILL G+K + +    E F++EM + G KP+ V+Y   +D +CK R   +A ++ E M   D    V   T++I G G+     KA
Subjt:  HSLKSKFRPNLQTFNILLSGWKSSEE---AEGFFNEMREMGVKPDVVSYNCLVDVYCKNREMDKAFKVIEKMRDEDIAADVITYTSVIGGLGLIGQPDKA

Query:  RNILKEMKEYGCYPDVAAYNAAIRNFCIAKRLRGAFDLMDEMVNKGLSPNATTYNLFFRIFFWSND--LQSSWNLYRRMMDTCCLPNTQSCLFLMRLFKK
        R +  E+ + G  PD  AYNA + +      + GA  +M EM  KG+ P++ T++  F     S +         Y++M +   +P T + + LM+LF  
Subjt:  RNILKEMKEYGCYPDVAAYNAAIRNFCIAKRLRGAFDLMDEMVNKGLSPNATTYNLFFRIFFWSND--LQSSWNLYRRMMDTCCLPNTQSCLFLMRLFKK

Query:  HENVEMALELWNDMIQRGFGSYILVSEELFDFLCDLGKLIEAERCFLQMVDKGHKPSNVSFKRIKVLMELANKHEVLQNLS---KKMDGFLDP
        +  V + L+LW  M+++G+  +    E L   LC   +  +A  C  Q V++G   S   ++ ++  +   N+ + L+ L    +K+  FL P
Subjt:  HENVEMALELWNDMIQRGFGSYILVSEELFDFLCDLGKLIEAERCFLQMVDKGHKPSNVSFKRIKVLMELANKHEVLQNLS---KKMDGFLDP

Arabidopsis top hitse value%identityAlignment
AT1G02420.1 Pentatricopeptide repeat (PPR) superfamily protein8.1e-18766.19Show/hide
Query:  MILRLPSSSQSPVRYFSPIPLSNFFSHSFSSASENQSLNE---NVETVFRIITSSPSSADMRDSLKSSRVFLSNELIDGVLKRVRFSHGNPLQALEFFNY
        M++  P SS     + S   LS  F HS + +     + E   + ETVFR+I  S    ++++SL SS + LS +LID VLKRVRFSHGNP+Q LEF+ Y
Subjt:  MILRLPSSSQSPVRYFSPIPLSNFFSHSFSSASENQSLNE---NVETVFRIITSSPSSADMRDSLKSSRVFLSNELIDGVLKRVRFSHGNPLQALEFFNY

Query:  TGRRRGFYHTAFSVDTMLYILGRGRKFEKIWDVLVDIKLKDRSLISPRTVMVVLGRIAKVCSVRQTVESFRKFKKFVPE-FDVTCFNALLRTLCQEKSMM
            RGFYH++FS+DTMLYILGR RKF++IW++L++ K KDRSLISPRT+ VVLGR+AK+CSVRQTVESF KFK+ VP+ FD  CFNALLRTLCQEKSM 
Subjt:  TGRRRGFYHTAFSVDTMLYILGRGRKFEKIWDVLVDIKLKDRSLISPRTVMVVLGRIAKVCSVRQTVESFRKFKKFVPE-FDVTCFNALLRTLCQEKSMM

Query:  DARNVYHSLKSKFRPNLQTFNILLSGWKSSEEAEGFFNEMREMGVKPDVVSYNCLVDVYCKNREMDKAFKVIEKMRDEDIAADVITYTSVIGGLGLIGQP
        DARNVYHSLK +F+P+LQTFNILLSGWKSSEEAE FF EM+  G+KPDVV+YN L+DVYCK+RE++KA+K+I+KMR+E+   DVITYT+VIGGLGLIGQP
Subjt:  DARNVYHSLKSKFRPNLQTFNILLSGWKSSEEAEGFFNEMREMGVKPDVVSYNCLVDVYCKNREMDKAFKVIEKMRDEDIAADVITYTSVIGGLGLIGQP

Query:  DKARNILKEMKEYGCYPDVAAYNAAIRNFCIAKRLRGAFDLMDEMVNKGLSPNATTYNLFFRIFFWSNDLQSSWNLYRRMMDTCCLPNTQSCLFLMRLFK
        DKAR +LKEMKEYGCYPDVAAYNAAIRNFCIA+RL  A  L+DEMV KGLSPNATTYNLFFR+   +NDL  SW LY RM+   CLPNTQSC+FL+++FK
Subjt:  DKARNILKEMKEYGCYPDVAAYNAAIRNFCIAKRLRGAFDLMDEMVNKGLSPNATTYNLFFRIFFWSNDLQSSWNLYRRMMDTCCLPNTQSCLFLMRLFK

Query:  KHENVEMALELWNDMIQRGFGSYILVSEELFDFLCDLGKLIEAERCFLQMVDKGHKPSNVSFKRIKVLMELANKHEVLQNLSKKM
        +HE V+MA+ LW DM+ +GFGSY LVS+ L D LCDL K+ EAE+C L+MV+KGH+PSNVSFKRIK+LMELANKH+ + NL +KM
Subjt:  KHENVEMALELWNDMIQRGFGSYILVSEELFDFLCDLGKLIEAERCFLQMVDKGHKPSNVSFKRIKVLMELANKHEVLQNLSKKM

AT1G71060.1 Tetratricopeptide repeat (TPR)-like superfamily protein5.0e-5126.94Show/hide
Query:  NFFSHSFSSASENQSLNENVETVFRIITSSPSSADMRDSLKSSRVFLSNELIDGVLKRVRFSHGNPLQALEFFNYTGRRRGFYHTAFSVDTMLYILGRGR
        +F + S  +       +++ E + +I+T    S  +   L  + V LS  LI+ VLK++  S+   L AL  F +   ++GF HT  + + ++  LG+ +
Subjt:  NFFSHSFSSASENQSLNENVETVFRIITSSPSSADMRDSLKSSRVFLSNELIDGVLKRVRFSHGNPLQALEFFNYTGRRRGFYHTAFSVDTMLYILGRGR

Query:  KFEKIWDVLVDIKLKDRSLISPRTVMVVLGRIAKVCSVRQTVESFRKFKKFVPEFDVTCFNALLRTLCQEKSMMDARNVYHSLKSK-FRPNLQTFNILLS
        +F+ IW ++ D+K K   L+S  T  ++  R A+   V++ + +F K ++F  + + + FN +L TL + +++ DA+ V+  +K K F P+++++ ILL 
Subjt:  KFEKIWDVLVDIKLKDRSLISPRTVMVVLGRIAKVCSVRQTVESFRKFKKFVPEFDVTCFNALLRTLCQEKSMMDARNVYHSLKSK-FRPNLQTFNILLS

Query:  GWKSS---EEAEGFFNEMREMGVKPDVVSYNCLVDVYCKNREMDKAFKVIEKMRDEDIAADVITYTSVIGGLGLIGQPDKARNILKEMKEYGCYPDVAAY
        GW         +    EM++ G +PDVV+Y  +++ +CK ++ ++A +   +M   +       + S+I GLG   + + A    +  K  G   +   Y
Subjt:  GWKSS---EEAEGFFNEMREMGVKPDVVSYNCLVDVYCKNREMDKAFKVIEKMRDEDIAADVITYTSVIGGLGLIGQPDKARNILKEMKEYGCYPDVAAY

Query:  NAAIRNFCIAKRLRGAFDLMDEMVNKGLSPNATTYNLFFRIFFWSNDLQSSWNLYRRMMDTCCLPNTQSCLFLMRLFKKHENVEMALELWNDMIQRGFGS
        NA +  +C ++R+  A+  +DEM  KG+ PNA TY++           + ++ +Y+ M    C P   +   ++R+F   E ++MA+++W++M  +G   
Subjt:  NAAIRNFCIAKRLRGAFDLMDEMVNKGLSPNATTYNLFFRIFFWSNDLQSSWNLYRRMMDTCCLPNTQSCLFLMRLFKKHENVEMALELWNDMIQRGFGS

Query:  YILVSEELFDFLCDLGKLIEAERCFLQMVDKGHKPSNVSFKRIKVLMELANKHEVLQNLSKKMD
         + +   L   LC   KL EA   F +M+D G +P    F R+K  +    + + + +L  KMD
Subjt:  YILVSEELFDFLCDLGKLIEAERCFLQMVDKGHKPSNVSFKRIKVLMELANKHEVLQNLSKKMD

AT1G77360.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.4e-5024.29Show/hide
Query:  FSSASENQSLNENVETVFRIITSSPSSADMRDSLKSSRVFLSNELIDGVLKRVRFSHGNPLQALEFFNYTGRRRGFYHTAFSVDTMLYILGRGRKFEKIW
        +SS+ + + + +  + + +++ SSP    +  +L  S + +S E+++ VL R R      L    FF ++ ++R + H+  +   M+    + R+++ +W
Subjt:  FSSASENQSLNENVETVFRIITSSPSSADMRDSLKSSRVFLSNELIDGVLKRVRFSHGNPLQALEFFNYTGRRRGFYHTAFSVDTMLYILGRGRKFEKIW

Query:  DVLVDIKLKDRSLISPRTVMVVLGRIAKVCSVRQTVESFRKFKKFVPEFDVTCFNALLRTLCQEKSMMDARNVYHSLKSKFRPNLQTFNILLSGW---KS
        D++    ++ + +++  T  +V+ + A+   V + + +F   +K+    ++  FN LL  LC+ K++  A+ V+ +++ +F P+ +T++ILL GW    +
Subjt:  DVLVDIKLKDRSLISPRTVMVVLGRIAKVCSVRQTVESFRKFKKFVPEFDVTCFNALLRTLCQEKSMMDARNVYHSLKSKFRPNLQTFNILLSGW---KS

Query:  SEEAEGFFNEMREMGVKPDVVSYNCLVDVYCKNREMDKAFKVIEKMRDEDIAADVITYTSVIGGLGLIGQPDKARNILKEMKEYGCYPDVAAYNAAIRNF
          +A   F EM + G  PD+V+Y+ +VD+ CK   +D+A  ++  M           Y+ ++   G   + ++A +   EM+  G   DVA +N+ I  F
Subjt:  SEEAEGFFNEMREMGVKPDVVSYNCLVDVYCKNREMDKAFKVIEKMRDEDIAADVITYTSVIGGLGLIGQPDKARNILKEMKEYGCYPDVAAYNAAIRNF

Query:  CIAKRLRGAFDLMDEMVNKGLSPNATTYNLFFRIFFWSNDLQSSWNLYRRMMDTCCLPNTQSCLFLMRLFKKHENVEMALELWNDMIQRGFGSYILVSEE
        C A R++  + ++ EM +KG++PN+ + N+  R      +   +++++R+M+   C P+  +   ++++F + + +E A ++W  M ++G    +     
Subjt:  CIAKRLRGAFDLMDEMVNKGLSPNATTYNLFFRIFFWSNDLQSSWNLYRRMMDTCCLPNTQSCLFLMRLFKKHENVEMALELWNDMIQRGFGSYILVSEE

Query:  LFDFLCDLGKLIEAERCFLQMVDKGHKPSNVSFKRIKVLMELANKHEVLQNLSKKMD
        L + LC+     +A     +M++ G +PS V+F R++ L+    + +VL+ L++KM+
Subjt:  LFDFLCDLGKLIEAERCFLQMVDKGHKPSNVSFKRIKVLMELANKHEVLQNLSKKMD

AT3G61360.1 Tetratricopeptide repeat (TPR)-like superfamily protein6.5e-5930.63Show/hide
Query:  SSQSPVRYFSPIPLSNFFSHSFSSASENQSLNENVETVFRIITSSPSSADMRDSLKSSRVFLSN---ELIDGVLKRVRFSHGNPLQALEFFNYTGRRRGF
        SS S  R  +P+  S   S S SS S N++    +E +  II   P        + +  + LS+   E +  VL R+  +H N L+ALEFF Y+ +    
Subjt:  SSQSPVRYFSPIPLSNFFSHSFSSASENQSLNENVETVFRIITSSPSSADMRDSLKSSRVFLSN---ELIDGVLKRVRFSHGNPLQALEFFNYTGRRRGF

Query:  YHTAFSVDTMLYILGRGRKFEKIWDVLVDIKLKDRSLISPRTVMVVLGRIAKVCSVRQTVESFRKFKK--FVPEFDVTCFNALLRTLCQEKSMMDARNVY
          T+ S +  L+IL R R F++ W ++ +++    +L+S +++ ++L +IAK  S  +T+E+F K +K  F  +F V  FN LLR  C E+ M +AR+++
Subjt:  YHTAFSVDTMLYILGRGRKFEKIWDVLVDIKLKDRSLISPRTVMVVLGRIAKVCSVRQTVESFRKFKK--FVPEFDVTCFNALLRTLCQEKSMMDARNVY

Query:  HSLKSKFRPNLQTFNILLSGWKSSEE---AEGFFNEMREMGVKPDVVSYNCLVDVYCKNREMDKAFKVIEKMRDEDIAADVITYTSVIGGLGLIGQPDKA
          L S+F P+++T NILL G+K + +    E F++EM + G KP+ V+Y   +D +CK R   +A ++ E M   D    V   T++I G G+     KA
Subjt:  HSLKSKFRPNLQTFNILLSGWKSSEE---AEGFFNEMREMGVKPDVVSYNCLVDVYCKNREMDKAFKVIEKMRDEDIAADVITYTSVIGGLGLIGQPDKA

Query:  RNILKEMKEYGCYPDVAAYNAAIRNFCIAKRLRGAFDLMDEMVNKGLSPNATTYNLFFRIFFWSND--LQSSWNLYRRMMDTCCLPNTQSCLFLMRLFKK
        R +  E+ + G  PD  AYNA + +      + GA  +M EM  KG+ P++ T++  F     S +         Y++M +   +P T + + LM+LF  
Subjt:  RNILKEMKEYGCYPDVAAYNAAIRNFCIAKRLRGAFDLMDEMVNKGLSPNATTYNLFFRIFFWSND--LQSSWNLYRRMMDTCCLPNTQSCLFLMRLFKK

Query:  HENVEMALELWNDMIQRGFGSYILVSEELFDFLCDLGKLIEAERCFLQMVDKGHKPSNVSFKRIKVLMELANKHEVLQNLS---KKMDGFLDP
        +  V + L+LW  M+++G+  +    E L   LC   +  +A  C  Q V++G   S   ++ ++  +   N+ + L+ L    +K+  FL P
Subjt:  HENVEMALELWNDMIQRGFGSYILVSEELFDFLCDLGKLIEAERCFLQMVDKGHKPSNVSFKRIKVLMELANKHEVLQNLS---KKMDGFLDP

AT5G15010.1 Tetratricopeptide repeat (TPR)-like superfamily protein3.7e-5430Show/hide
Query:  NQSLNENVETVFRIITSSPSS-ADMRDSLKSSRVFLSNELIDGVLKRVRFSHGNPLQALEFFNYTGRRRGFYHTAFSVDTMLYILGRGRKFEKIWDVLVD
        ++ ++E+V  + +++    S   ++R+ L+   V  SNEL+  +L RVR    +   A  FF + G+++G+  +     +M+ ILG+ RKF+  W ++ +
Subjt:  NQSLNENVETVFRIITSSPSS-ADMRDSLKSSRVFLSNELIDGVLKRVRFSHGNPLQALEFFNYTGRRRGFYHTAFSVDTMLYILGRGRKFEKIWDVLVD

Query:  IKLKDRSLISPRTVMVVLGRIAKVCSVRQTVESFRKFKKFVPEFDVTCFNALLRTLCQEKSMMDARNVYHSLKSKFRPNLQTFNILLSGW----KSSEEA
        ++    SL++ +T+++++ +   V  V + + +F  +K+F  E  +  F +LL  LC+ K++ DA ++    K K+  + ++FNI+L+GW     S  EA
Subjt:  IKLKDRSLISPRTVMVVLGRIAKVCSVRQTVESFRKFKKFVPEFDVTCFNALLRTLCQEKSMMDARNVYHSLKSKFRPNLQTFNILLSGW----KSSEEA

Query:  EGFFNEMREMGVKPDVVSYNCLVDVYCKNREMDKAFKVIEKMRDEDIAADVITYTSVIGGLGLIGQPDKARNILKEM-KEYGCYPDVAAYNAAIRNFCIA
        E  + EM  +GVK DVVSY+ ++  Y K   ++K  K+ ++M+ E I  D   Y +V+  L       +ARN++K M +E G  P+V  YN+ I+  C A
Subjt:  EGFFNEMREMGVKPDVVSYNCLVDVYCKNREMDKAFKVIEKMRDEDIAADVITYTSVIGGLGLIGQPDKARNILKEM-KEYGCYPDVAAYNAAIRNFCIA

Query:  KRLRGAFDLMDEMVNKGLSPNATTYNLFFRIFFWSNDLQSSWNLYRRMMDTCCLPNTQSCLFLMRLFKKHENVEMALELWNDMIQRGFG----SYILVSE
        ++   A  + DEM+ KGL P   TY+ F RI     ++   + L  +M    C P  ++ + L+R   +  + +  L LW++M ++  G    SYI++  
Subjt:  KRLRGAFDLMDEMVNKGLSPNATTYNLFFRIFFWSNDLQSSWNLYRRMMDTCCLPNTQSCLFLMRLFKKHENVEMALELWNDMIQRGFG----SYILVSE

Query:  ELFDFLCDLGKLIEAERCFLQMVDKGHKPS
         LF      GK+ EA   + +M DKG +P+
Subjt:  ELFDFLCDLGKLIEAERCFLQMVDKGHKPS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCAAGGAAGCCGACGATGATTCTCCGGCTGCCAAGCTCCTCTCAATCGCCGGTTAGGTACTTCTCACCAATCCCATTATCCAATTTCTTCTCGCACTCATTT
TCATCTGCAAGTGAGAATCAATCGCTAAATGAAAACGTAGAAACAGTATTTCGCATAATTACTAGTTCACCCTCTTCAGCAGACATGAGGGATTCTCTGAAATCG
AGTCGGGTTTTTCTCTCAAATGAATTGATCGATGGAGTTCTTAAGAGGGTTAGATTTAGCCACGGGAATCCCTTACAGGCATTGGAGTTTTTTAATTACACTGGT
AGAAGAAGGGGATTTTATCACACTGCGTTTTCTGTTGATACAATGCTTTATATCCTAGGTAGAGGCCGGAAGTTTGAAAAAATCTGGGATGTTTTGGTTGATATT
AAGCTTAAGGATCGGTCGTTAATCTCGCCGCGAACTGTTATGGTTGTATTAGGAAGAATTGCCAAAGTGTGTTCTGTGAGGCAGACTGTGGAGTCTTTTAGGAAG
TTTAAGAAGTTTGTTCCTGAGTTTGATGTCACTTGTTTTAATGCATTGTTGAGAACTCTGTGCCAGGAGAAGAGTATGATGGATGCGAGGAACGTCTACCACAGT
TTGAAGAGTAAGTTTAGACCGAATTTGCAGACGTTTAACATATTATTGTCGGGTTGGAAGTCGTCAGAAGAAGCTGAGGGATTCTTTAATGAGATGAGAGAAATG
GGGGTTAAACCTGATGTTGTTTCATACAACTGTTTGGTTGATGTTTATTGTAAGAATAGGGAAATGGACAAGGCGTTTAAGGTGATTGAGAAAATGAGGGATGAG
GATATAGCTGCTGATGTGATTACGTACACTAGTGTTATTGGGGGATTGGGATTGATTGGTCAACCCGACAAAGCGAGAAATATTTTGAAAGAAATGAAGGAGTAT
GGATGTTACCCTGATGTTGCAGCTTACAATGCTGCGATACGGAATTTTTGCATTGCAAAGAGGCTTCGCGGGGCTTTTGATTTGATGGATGAAATGGTGAATAAG
GGTCTGAGTCCAAATGCAACGACATACAACTTGTTCTTTAGGATTTTCTTCTGGTCGAATGACTTACAAAGCTCGTGGAATTTATATCGTCGAATGATGGATACA
TGTTGCTTGCCTAACACGCAATCCTGTTTGTTCCTAATGAGGTTGTTTAAGAAGCATGAAAATGTAGAAATGGCACTGGAGCTATGGAATGATATGATTCAAAGG
GGTTTTGGGTCTTATATTTTAGTATCCGAGGAGTTGTTTGATTTTCTTTGTGATTTGGGTAAGTTGATTGAAGCTGAGAGGTGTTTTCTGCAGATGGTCGATAAG
GGGCATAAGCCTAGTAACGTCTCATTTAAAAGGATCAAAGTACTCATGGAACTGGCAAATAAGCATGAGGTTCTTCAGAACTTGTCAAAGAAAATGGATGGTTTT
TTGGATCCACAAAAACGCCTTCCTGAAACAATGAGTTATTCAACGGATCTGTCAAATTCAAATTCATTCCAATGTTAA
mRNA sequenceShow/hide mRNA sequence
TTTGACCTTTAAAATTGGTCCTTCGTTTTGAAACGGCGTCGTTTGTGGTGATGAAGTTTCAATCTGCAAACCTAAAAACTCTGTTTACTGTGAAGCTCAATTCAG
TTTCATGTCAAGGAAGCCGACGATGATTCTCCGGCTGCCAAGCTCCTCTCAATCGCCGGTTAGGTACTTCTCACCAATCCCATTATCCAATTTCTTCTCGCACTC
ATTTTCATCTGCAAGTGAGAATCAATCGCTAAATGAAAACGTAGAAACAGTATTTCGCATAATTACTAGTTCACCCTCTTCAGCAGACATGAGGGATTCTCTGAA
ATCGAGTCGGGTTTTTCTCTCAAATGAATTGATCGATGGAGTTCTTAAGAGGGTTAGATTTAGCCACGGGAATCCCTTACAGGCATTGGAGTTTTTTAATTACAC
TGGTAGAAGAAGGGGATTTTATCACACTGCGTTTTCTGTTGATACAATGCTTTATATCCTAGGTAGAGGCCGGAAGTTTGAAAAAATCTGGGATGTTTTGGTTGA
TATTAAGCTTAAGGATCGGTCGTTAATCTCGCCGCGAACTGTTATGGTTGTATTAGGAAGAATTGCCAAAGTGTGTTCTGTGAGGCAGACTGTGGAGTCTTTTAG
GAAGTTTAAGAAGTTTGTTCCTGAGTTTGATGTCACTTGTTTTAATGCATTGTTGAGAACTCTGTGCCAGGAGAAGAGTATGATGGATGCGAGGAACGTCTACCA
CAGTTTGAAGAGTAAGTTTAGACCGAATTTGCAGACGTTTAACATATTATTGTCGGGTTGGAAGTCGTCAGAAGAAGCTGAGGGATTCTTTAATGAGATGAGAGA
AATGGGGGTTAAACCTGATGTTGTTTCATACAACTGTTTGGTTGATGTTTATTGTAAGAATAGGGAAATGGACAAGGCGTTTAAGGTGATTGAGAAAATGAGGGA
TGAGGATATAGCTGCTGATGTGATTACGTACACTAGTGTTATTGGGGGATTGGGATTGATTGGTCAACCCGACAAAGCGAGAAATATTTTGAAAGAAATGAAGGA
GTATGGATGTTACCCTGATGTTGCAGCTTACAATGCTGCGATACGGAATTTTTGCATTGCAAAGAGGCTTCGCGGGGCTTTTGATTTGATGGATGAAATGGTGAA
TAAGGGTCTGAGTCCAAATGCAACGACATACAACTTGTTCTTTAGGATTTTCTTCTGGTCGAATGACTTACAAAGCTCGTGGAATTTATATCGTCGAATGATGGA
TACATGTTGCTTGCCTAACACGCAATCCTGTTTGTTCCTAATGAGGTTGTTTAAGAAGCATGAAAATGTAGAAATGGCACTGGAGCTATGGAATGATATGATTCA
AAGGGGTTTTGGGTCTTATATTTTAGTATCCGAGGAGTTGTTTGATTTTCTTTGTGATTTGGGTAAGTTGATTGAAGCTGAGAGGTGTTTTCTGCAGATGGTCGA
TAAGGGGCATAAGCCTAGTAACGTCTCATTTAAAAGGATCAAAGTACTCATGGAACTGGCAAATAAGCATGAGGTTCTTCAGAACTTGTCAAAGAAAATGGATGG
TTTTTTGGATCCACAAAAACGCCTTCCTGAAACAATGAGTTATTCAACGGATCTGTCAAATTCAAATTCATTCCAATGTTAAATGGATTTATTTTACATGTCATG
GGTCTATATTTTTCTCTCAGTGAAAACTAAAAACTGACAGACCATCAACGTATTATCTGCCTTTCGAGAGCAGGACGCCGAGCAAGATCGTGGAGCTGGAAGTCG
ACCTATAACATCTTGGGGTGAGCCTCAACATCTAAATCCTTTGGCATTCTAGAAAAGTACCCCACATAGCCTTCTCAAGAAAGCTTTGCATATTAGGCTGTAGAC
ACTACCTCAAGAAAGGGACAGCAATTTACGCTTGTGTTGGATGCATTGTCCGAATAAGCATGAAGAACTATTCACTTCAAAAAGATAGTTGGAATTTGGAGGATT
GAAGATTGAGGAAGAAAAGAAATGAGGTGATTGGTTAAAGAACTGTCTCATAAGAGGGTTACCAACTCTATTGCTTCATGGGGATATGAATGTATACAAAAATCC
AGAAACTCAGATCTGGGTGTCATGCTCATGAAGATCACCATGACCGAGTACGATTACATCTCTATTCAGATTTCAGAACATGGAAAATATTTTCCAAGTTGAAGA
CAGATAAACCAAGAACTATCAATTTTAAACCTTTCTGCTGTTTTCGTCATCGAGTTTTGTTACGAGCAGCTTTCTCGAGTGTGAAATCTCATCCAGCCGACCTCT
AAGTTAGAAAAAGTTCCATCCCTTCTCTTCTGTTTAACAAAATACATGGGAAAGATTCTTGGTAATGGGAGTTCCAAACTTGAAGGCCTCACCCGTTCTTATACG
TTCTCAACGTTCTAATTTTTCAGTGTGTGAGCATATGTTGTAGGGGATCTAAGTAATTTTTTTTCCTTTTTCTCTTTTTGAAAATGAACATTATTTGCATTACAT
AAAATTGGATGGATATTTTCGAAAAATTCAGTTTGTTGGTCGTTTATTTATTTTTAATAATCAATTTCCATGCAAATATTGGTTATTTATTTATTTTTATTAATG
CAGG
Protein sequenceShow/hide protein sequence
MSRKPTMILRLPSSSQSPVRYFSPIPLSNFFSHSFSSASENQSLNENVETVFRIITSSPSSADMRDSLKSSRVFLSNELIDGVLKRVRFSHGNPLQALEFFNYTG
RRRGFYHTAFSVDTMLYILGRGRKFEKIWDVLVDIKLKDRSLISPRTVMVVLGRIAKVCSVRQTVESFRKFKKFVPEFDVTCFNALLRTLCQEKSMMDARNVYHS
LKSKFRPNLQTFNILLSGWKSSEEAEGFFNEMREMGVKPDVVSYNCLVDVYCKNREMDKAFKVIEKMRDEDIAADVITYTSVIGGLGLIGQPDKARNILKEMKEY
GCYPDVAAYNAAIRNFCIAKRLRGAFDLMDEMVNKGLSPNATTYNLFFRIFFWSNDLQSSWNLYRRMMDTCCLPNTQSCLFLMRLFKKHENVEMALELWNDMIQR
GFGSYILVSEELFDFLCDLGKLIEAERCFLQMVDKGHKPSNVSFKRIKVLMELANKHEVLQNLSKKMDGFLDPQKRLPETMSYSTDLSNSNSFQC