; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0026458 (gene) of Chayote v1 genome

Gene IDSed0026458
OrganismSechium edule (Chayote v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationLG12:3752296..3755005
RNA-Seq ExpressionSed0026458
SyntenySed0026458
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004152299.1 pentatricopeptide repeat-containing protein At4g20090 isoform X1 [Cucumis sativus]1.7e-25379.89Show/hide
Query:  MLSATRNGSALLKSITLHFSAFSSN------STKPIAIAPQ---------------------STDVVDSVCSLLSN---HSTNLDLDHSLKKFKQSLTSD
        ML   R+ SALLK ++LHF   SS+      +T  IAIAP+                     S+DVV+SVCSLLSN    + NLDLDH LK+FK +L+SD
Subjt:  MLSATRNGSALLKSITLHFSAFSSN------STKPIAIAPQ---------------------STDVVDSVCSLLSN---HSTNLDLDHSLKKFKQSLTSD

Query:  LVLRILMNYRLLGRAKTLEFFSWSGFQMGFRFHHSVVEYMADFLGRRKLFDDMKCLLVTVSSRKGRLSCRTFSICIRFLGRQGRVREALCLFEEMETKFG
         VL+ILMNY+LLGRAKTLEFFSWSG QMGFRF  SVVEYMADFLGRRKLFDDMKCLLVTV S KGR+SCRTFSICIRFLGRQGRVREALCLFEEME KFG
Subjt:  LVLRILMNYRLLGRAKTLEFFSWSGFQMGFRFHHSVVEYMADFLGRRKLFDDMKCLLVTVSSRKGRLSCRTFSICIRFLGRQGRVREALCLFEEMETKFG

Query:  CKPDNLVFNNMLYALCKKEPTGELIDVALTIFRRIELPDKYSYSNVIIGLCKFGRFCTALEVFGEMDRAGLAPTRSAVNILIGDLCSLSAKDGAIEKVRV
        CKPDNLVFNNMLYALCKKEPTGELID AL IFRRIELPDKYSYSNVIIGLCKFGR+ TA+E FGEM RAGL PTR+AVNILIG+LCSLSAK+GA+EKVRV
Subjt:  CKPDNLVFNNMLYALCKKEPTGELIDVALTIFRRIELPDKYSYSNVIIGLCKFGRFCTALEVFGEMDRAGLAPTRSAVNILIGDLCSLSAKDGAIEKVRV

Query:  RSTRRPFTVLVPNVNPKSGVIEPAVCVFWAANRLGLVPSAFVIVQLISELCRLGKMQEAMRVLKVVEGGKLRCAEECYSIVMQALCEHRHVKEASDLFGW
         ST RPFTVLVPNVNPKSG IEPAV +FWAAN+L LVPS+FV VQLISELCRLG+MQEA+RVLKVVEG KLRCAEECYS+VM+ALCEHRHV EASDLFG 
Subjt:  RSTRRPFTVLVPNVNPKSGVIEPAVCVFWAANRLGLVPSAFVIVQLISELCRLGKMQEAMRVLKVVEGGKLRCAEECYSIVMQALCEHRHVKEASDLFGW

Query:  MLSRGMKPKLAIYNYVICMLCKLGNLDDAEKVFKIMNKKRCVPDHVTYSALIHAYGETRKWTAAHNLLKEMLSLGMSPHFHLFSLVDTLMREHGQIDLCL
        MLS+GMKPKLAIYNYVICMLCKLGNLD AE+VF IMNKKRC PDHVTYSALIHAYGE R W+AA+ LLKEMLSLGMSPHFH++S+VD LMREHGQIDLCL
Subjt:  MLSRGMKPKLAIYNYVICMLCKLGNLDDAEKVFKIMNKKRCVPDHVTYSALIHAYGETRKWTAAHNLLKEMLSLGMSPHFHLFSLVDTLMREHGQIDLCL

Query:  KLEMKWEAQILQKLCKQGQLEAAFEKLKSMLEKGSYPPIYVRDAFESAFQKKGKTKIARELLQKIDG
        KLEMKWEAQILQKLCKQGQLEAA+EK+KSMLEKG  PPIYVRDAFESAFQKKGK KIARELLQK+DG
Subjt:  KLEMKWEAQILQKLCKQGQLEAAFEKLKSMLEKGSYPPIYVRDAFESAFQKKGKTKIARELLQKIDG

XP_008453994.1 PREDICTED: pentatricopeptide repeat-containing protein At5g65560-like isoform X1 [Cucumis melo]3.4e-25479.72Show/hide
Query:  MLSATRNGSALLKSITLHFSAFSSN------STKPIAIAPQ---------------------STDVVDSVCSLLSN---HSTNLDLDHSLKKFKQSLTSD
        ML   R+ SALLK ++LHF  FSS+      +TK IAIAP+                     S+DVV+SVCSLLSN    + NLD++H LK+FK +L+SD
Subjt:  MLSATRNGSALLKSITLHFSAFSSN------STKPIAIAPQ---------------------STDVVDSVCSLLSN---HSTNLDLDHSLKKFKQSLTSD

Query:  LVLRILMNYRLLGRAKTLEFFSWSGFQMGFRFHHSVVEYMADFLGRRKLFDDMKCLLVTVSSRKGRLSCRTFSICIRFLGRQGRVREALCLFEEMETKFG
        LVL+ILMNY+LLGRAKTLEFFSWSG QMGFRF  SVVEYMADFLGRRKLFDDMKCLLVTV S KGR+SCRTFSICIRFLGRQGRVREALCLFEEME KFG
Subjt:  LVLRILMNYRLLGRAKTLEFFSWSGFQMGFRFHHSVVEYMADFLGRRKLFDDMKCLLVTVSSRKGRLSCRTFSICIRFLGRQGRVREALCLFEEMETKFG

Query:  CKPDNLVFNNMLYALCKKEPTGELIDVALTIFRRIELPDKYSYSNVIIGLCKFGRFCTALEVFGEMDRAGLAPTRSAVNILIGDLCSLSAKDGAIEKVRV
        CKPDNLVFNNMLYALCKKEPTGELID AL IFRRIELPDKYSYSNVIIGLCKFGR+ TA+E FGEM RAGL PTRSA NILIG+LCSLSAK+GA+EKVRV
Subjt:  CKPDNLVFNNMLYALCKKEPTGELIDVALTIFRRIELPDKYSYSNVIIGLCKFGRFCTALEVFGEMDRAGLAPTRSAVNILIGDLCSLSAKDGAIEKVRV

Query:  RSTRRPFTVLVPNVNPKSGVIEPAVCVFWAANRLGLVPSAFVIVQLISELCRLGKMQEAMRVLKVVEGGKLRCAEECYSIVMQALCEHRHVKEASDLFGW
        RST RPFTVLVPNVNPKSG IEPAV +FWAAN+LGLVPS+FV VQLISELCR+G+MQEA++VLKVVE  KLRCAEECYS+VM+ALCEHRH+ EASDLFG 
Subjt:  RSTRRPFTVLVPNVNPKSGVIEPAVCVFWAANRLGLVPSAFVIVQLISELCRLGKMQEAMRVLKVVEGGKLRCAEECYSIVMQALCEHRHVKEASDLFGW

Query:  MLSRGMKPKLAIYNYVICMLCKLGNLDDAEKVFKIMNKKRCVPDHVTYSALIHAYGETRKWTAAHNLLKEMLSLGMSPHFHLFSLVDTLMREHGQIDLCL
        MLS+GMKPKLAIYNYVICMLCKLGNLD AE+VF IMNKKRC PDHVTYSALIHAYGE R W+AA+ LLKEMLSLGMSPHFH++SLVD LMREHGQ+DLCL
Subjt:  MLSRGMKPKLAIYNYVICMLCKLGNLDDAEKVFKIMNKKRCVPDHVTYSALIHAYGETRKWTAAHNLLKEMLSLGMSPHFHLFSLVDTLMREHGQIDLCL

Query:  KLEMKWEAQILQKLCKQGQLEAAFEKLKSMLEKGSYPPIYVRDAFESAFQKKGKTKIARELLQKIDG
        KLEMKWEAQILQKLCKQGQLEAA+EK+KSMLEKG  PPIYVRDAFESAFQKKGK KIARELLQK+DG
Subjt:  KLEMKWEAQILQKLCKQGQLEAAFEKLKSMLEKGSYPPIYVRDAFESAFQKKGKTKIARELLQKIDG

XP_022955543.1 pentatricopeptide repeat-containing protein At4g20090-like [Cucurbita moschata]1.5e-24980.11Show/hide
Query:  SATRNGSALLKSITLHFSAFSSNST---KPIAIAPQS-----------------TDVVDSVCSLLSN---HSTNLDLDHSLKKFKQSLTSDLVLRILMNY
        S  RN SA LK +   FS++SS+++   K  AIAP++                 TD V SVCSLLSN    +TNL+LDH LK+FK++L+SD VL+ILMNY
Subjt:  SATRNGSALLKSITLHFSAFSSNST---KPIAIAPQS-----------------TDVVDSVCSLLSN---HSTNLDLDHSLKKFKQSLTSDLVLRILMNY

Query:  RLLGRAKTLEFFSWSGFQMGFRFHHSVVEYMADFLGRRKLFDDMKCLLVTVSSRKGRLSCRTFSICIRFLGRQGRVREALCLFEEMETKFGCKPDNLVFN
        RL GRAKTLEFFSWSG QMG+RF  SVVEYMADFLGRRKLFDDMKCLLVTVSS KGR+SCRTFSICIRFLGRQGRVREALCLFEEME KFGCKPDNLVFN
Subjt:  RLLGRAKTLEFFSWSGFQMGFRFHHSVVEYMADFLGRRKLFDDMKCLLVTVSSRKGRLSCRTFSICIRFLGRQGRVREALCLFEEMETKFGCKPDNLVFN

Query:  NMLYALCKKEPTGELIDVALTIFRRIELPDKYSYSNVIIGLCKFGRFCTALEVFGEMDRAGLAPTRSAVNILIGDLCSLSAKDGAIEKVRVRSTRRPFTV
        NMLYALCKKEPTGELID ALTIFRRIELPDKYSYSN+IIGLCKFGRF TA+EVF EM RAGL PTRSAVNILIGDLCSLSAK+GA+E+VRVRSTRRPFTV
Subjt:  NMLYALCKKEPTGELIDVALTIFRRIELPDKYSYSNVIIGLCKFGRFCTALEVFGEMDRAGLAPTRSAVNILIGDLCSLSAKDGAIEKVRVRSTRRPFTV

Query:  LVPNVNPKSGVIEPAVCVFWAANRLGLVPSAFVIVQLISELCRLGKMQEAMRVLKVVEGGKLRCAEECYSIVMQALCEHRHVKEASDLFGWMLSRGMKPK
        LVPNVNPKSG I+ AV VFWAANRL LVPS FVIV+LISELCRLG+MQEA+RVLKVVE  KLRC EECYSIVMQALCEHR V EASDLFG MLS+ MKPK
Subjt:  LVPNVNPKSGVIEPAVCVFWAANRLGLVPSAFVIVQLISELCRLGKMQEAMRVLKVVEGGKLRCAEECYSIVMQALCEHRHVKEASDLFGWMLSRGMKPK

Query:  LAIYNYVICMLCKLGNLDDAEKVFKIMNKKRCVPDHVTYSALIHAYGETRKWTAAHNLLKEMLSLGMSPHFHLFSLVDTLMREHGQIDLCLKLEMKWEAQ
        LAIYN VICMLCKLGNLDDAE+VFKIMN+KRCVPDHVTYSALIHAYGE R W+AA++LLKEMLSLG+SPHFH++S+VD LMRE GQ DLCLKLEMKWE+Q
Subjt:  LAIYNYVICMLCKLGNLDDAEKVFKIMNKKRCVPDHVTYSALIHAYGETRKWTAAHNLLKEMLSLGMSPHFHLFSLVDTLMREHGQIDLCLKLEMKWEAQ

Query:  ILQKLCKQGQLEAAFEKLKSMLEKGSYPPIYVRDAFESAFQKKGKTKIARELLQKIDG
        ILQKLCKQGQL  A+EKLKSMLEKG YPPIYVRDAFESAFQKKGK KIARELLQ +DG
Subjt:  ILQKLCKQGQLEAAFEKLKSMLEKGSYPPIYVRDAFESAFQKKGKTKIARELLQKIDG

XP_022979738.1 pentatricopeptide repeat-containing protein At4g20090-like [Cucurbita maxima]6.0e-25180.47Show/hide
Query:  SATRNGSALLKSITLHFSAFS---SNSTKPIAIAPQS-----------------TDVVDSVCSLLSN---HSTNLDLDHSLKKFKQSLTSDLVLRILMNY
        S  RN SA LK +   FS++S   S++TK  AIAP++                 TD V SVCSLLSN    +TNL+LDH LK+FK++L+SD VL+ILMNY
Subjt:  SATRNGSALLKSITLHFSAFS---SNSTKPIAIAPQS-----------------TDVVDSVCSLLSN---HSTNLDLDHSLKKFKQSLTSDLVLRILMNY

Query:  RLLGRAKTLEFFSWSGFQMGFRFHHSVVEYMADFLGRRKLFDDMKCLLVTVSSRKGRLSCRTFSICIRFLGRQGRVREALCLFEEMETKFGCKPDNLVFN
        RL GRAKTLEFFSWSG QMG+RF  SVVEYMADFLGRRKLFDDMKCLLVTVSS KGR+SCRTFSICIRFLGRQGRVREALCLFEEME  FGCKPDNLVFN
Subjt:  RLLGRAKTLEFFSWSGFQMGFRFHHSVVEYMADFLGRRKLFDDMKCLLVTVSSRKGRLSCRTFSICIRFLGRQGRVREALCLFEEMETKFGCKPDNLVFN

Query:  NMLYALCKKEPTGELIDVALTIFRRIELPDKYSYSNVIIGLCKFGRFCTALEVFGEMDRAGLAPTRSAVNILIGDLCSLSAKDGAIEKVRVRSTRRPFTV
        NMLYALCKKEPTGELID ALTIFRRIELPDKYSYSN+IIGLCKFGRF TA+EVF EM RA L PTRSAVNILIGDLCSLSAK+GA+E+VRVRSTRRPFTV
Subjt:  NMLYALCKKEPTGELIDVALTIFRRIELPDKYSYSNVIIGLCKFGRFCTALEVFGEMDRAGLAPTRSAVNILIGDLCSLSAKDGAIEKVRVRSTRRPFTV

Query:  LVPNVNPKSGVIEPAVCVFWAANRLGLVPSAFVIVQLISELCRLGKMQEAMRVLKVVEGGKLRCAEECYSIVMQALCEHRHVKEASDLFGWMLSRGMKPK
        LVPNVNPKSG IEPAV VFWAANR+ LVPSAFVIV+LISELCRLG+MQEA+RVLKVVE  KLRC EECYSIVMQALCEHR V EASDLFG MLS+ MKPK
Subjt:  LVPNVNPKSGVIEPAVCVFWAANRLGLVPSAFVIVQLISELCRLGKMQEAMRVLKVVEGGKLRCAEECYSIVMQALCEHRHVKEASDLFGWMLSRGMKPK

Query:  LAIYNYVICMLCKLGNLDDAEKVFKIMNKKRCVPDHVTYSALIHAYGETRKWTAAHNLLKEMLSLGMSPHFHLFSLVDTLMREHGQIDLCLKLEMKWEAQ
        LAIYN VICMLCKLGNLDDAE+VFKIMN+KRCVPDHVTYSALIHAYGE R W+AA++LLKEMLSLG+SPHFH++S+VD LMRE GQ DLCLKLEMKWE+Q
Subjt:  LAIYNYVICMLCKLGNLDDAEKVFKIMNKKRCVPDHVTYSALIHAYGETRKWTAAHNLLKEMLSLGMSPHFHLFSLVDTLMREHGQIDLCLKLEMKWEAQ

Query:  ILQKLCKQGQLEAAFEKLKSMLEKGSYPPIYVRDAFESAFQKKGKTKIARELLQKIDG
        ILQKLCKQGQL AA+EKLKSMLEKG YPPIYVRDAFESAFQKKGK KIARELLQ +DG
Subjt:  ILQKLCKQGQLEAAFEKLKSMLEKGSYPPIYVRDAFESAFQKKGKTKIARELLQKIDG

XP_038875040.1 pentatricopeptide repeat-containing protein At1g77360, mitochondrial-like [Benincasa hispida]2.3e-25881.67Show/hide
Query:  RNGSALLKSITLHFSAFSSN------STKPIAIAPQ---------------------STDVVDSVCSLLS--NHST-NLDLDHSLKKFKQSLTSDLVLRI
        RN SA LK   LHF  FSS+      +TK IAIAP+                     S+DVV+SVCSLLS  NH T NLDLDH LK+FK +L+SDLVL+I
Subjt:  RNGSALLKSITLHFSAFSSN------STKPIAIAPQ---------------------STDVVDSVCSLLS--NHST-NLDLDHSLKKFKQSLTSDLVLRI

Query:  LMNYRLLGRAKTLEFFSWSGFQMGFRFHHSVVEYMADFLGRRKLFDDMKCLLVTVSSRKGRLSCRTFSICIRFLGRQGRVREALCLFEEMETKFGCKPDN
        LMNYRLLGRAKTLEFFSWSG QMG+RF  +VVEYMADFLGRRKLFDDMKCLLVTVSS KGRLSCRTFSICIRFLGRQGRVREALCLFEEME KFGCKPDN
Subjt:  LMNYRLLGRAKTLEFFSWSGFQMGFRFHHSVVEYMADFLGRRKLFDDMKCLLVTVSSRKGRLSCRTFSICIRFLGRQGRVREALCLFEEMETKFGCKPDN

Query:  LVFNNMLYALCKKEPTGELIDVALTIFRRIELPDKYSYSNVIIGLCKFGRFCTALEVFGEMDRAGLAPTRSAVNILIGDLCSLSAKDGAIEKVRVRSTRR
        LVFNNMLYALCKKEPTGELID AL+IFRRIELPDKYSYSNVIIGLCKFGRF TA+EVF EM+RAGL PTRSAVNILIGDLCSLSAK+GA+E+VRVRSTRR
Subjt:  LVFNNMLYALCKKEPTGELIDVALTIFRRIELPDKYSYSNVIIGLCKFGRFCTALEVFGEMDRAGLAPTRSAVNILIGDLCSLSAKDGAIEKVRVRSTRR

Query:  PFTVLVPNVNPKSGVIEPAVCVFWAANRLGLVPSAFVIVQLISELCRLGKMQEAMRVLKVVEGGKLRCAEECYSIVMQALCEHRHVKEASDLFGWMLSRG
        PFTVLVPNVNPKSG IEPAV +FWAAN+L LVPSAFVIVQLISELCRLG+MQEA++VLKVVEG KLRCAEECYS+VM+ALCEHRHV+EASDLFG +LS+G
Subjt:  PFTVLVPNVNPKSGVIEPAVCVFWAANRLGLVPSAFVIVQLISELCRLGKMQEAMRVLKVVEGGKLRCAEECYSIVMQALCEHRHVKEASDLFGWMLSRG

Query:  MKPKLAIYNYVICMLCKLGNLDDAEKVFKIMNKKRCVPDHVTYSALIHAYGETRKWTAAHNLLKEMLSLGMSPHFHLFSLVDTLMREHGQIDLCLKLEMK
        MKPKLAIYN +ICMLCK+GNL+DAE+VFKIMN+KRC PDHVTYS+LIHAYGETR W+AA++LLKEMLSLGMSPHFHL+SLVD LMREHGQIDLCLKLEMK
Subjt:  MKPKLAIYNYVICMLCKLGNLDDAEKVFKIMNKKRCVPDHVTYSALIHAYGETRKWTAAHNLLKEMLSLGMSPHFHLFSLVDTLMREHGQIDLCLKLEMK

Query:  WEAQILQKLCKQGQLEAAFEKLKSMLEKGSYPPIYVRDAFESAFQKKGKTKIARELLQKIDG
        WEAQILQKLCK GQL+AA+EK+KSMLEKG YPPIYVRD+FESAFQKKGK KIARELLQKIDG
Subjt:  WEAQILQKLCKQGQLEAAFEKLKSMLEKGSYPPIYVRDAFESAFQKKGKTKIARELLQKIDG

TrEMBL top hitse value%identityAlignment
A0A1S3BXL0 pentatricopeptide repeat-containing protein At5g65560-like isoform X11.7e-25479.72Show/hide
Query:  MLSATRNGSALLKSITLHFSAFSSN------STKPIAIAPQ---------------------STDVVDSVCSLLSN---HSTNLDLDHSLKKFKQSLTSD
        ML   R+ SALLK ++LHF  FSS+      +TK IAIAP+                     S+DVV+SVCSLLSN    + NLD++H LK+FK +L+SD
Subjt:  MLSATRNGSALLKSITLHFSAFSSN------STKPIAIAPQ---------------------STDVVDSVCSLLSN---HSTNLDLDHSLKKFKQSLTSD

Query:  LVLRILMNYRLLGRAKTLEFFSWSGFQMGFRFHHSVVEYMADFLGRRKLFDDMKCLLVTVSSRKGRLSCRTFSICIRFLGRQGRVREALCLFEEMETKFG
        LVL+ILMNY+LLGRAKTLEFFSWSG QMGFRF  SVVEYMADFLGRRKLFDDMKCLLVTV S KGR+SCRTFSICIRFLGRQGRVREALCLFEEME KFG
Subjt:  LVLRILMNYRLLGRAKTLEFFSWSGFQMGFRFHHSVVEYMADFLGRRKLFDDMKCLLVTVSSRKGRLSCRTFSICIRFLGRQGRVREALCLFEEMETKFG

Query:  CKPDNLVFNNMLYALCKKEPTGELIDVALTIFRRIELPDKYSYSNVIIGLCKFGRFCTALEVFGEMDRAGLAPTRSAVNILIGDLCSLSAKDGAIEKVRV
        CKPDNLVFNNMLYALCKKEPTGELID AL IFRRIELPDKYSYSNVIIGLCKFGR+ TA+E FGEM RAGL PTRSA NILIG+LCSLSAK+GA+EKVRV
Subjt:  CKPDNLVFNNMLYALCKKEPTGELIDVALTIFRRIELPDKYSYSNVIIGLCKFGRFCTALEVFGEMDRAGLAPTRSAVNILIGDLCSLSAKDGAIEKVRV

Query:  RSTRRPFTVLVPNVNPKSGVIEPAVCVFWAANRLGLVPSAFVIVQLISELCRLGKMQEAMRVLKVVEGGKLRCAEECYSIVMQALCEHRHVKEASDLFGW
        RST RPFTVLVPNVNPKSG IEPAV +FWAAN+LGLVPS+FV VQLISELCR+G+MQEA++VLKVVE  KLRCAEECYS+VM+ALCEHRH+ EASDLFG 
Subjt:  RSTRRPFTVLVPNVNPKSGVIEPAVCVFWAANRLGLVPSAFVIVQLISELCRLGKMQEAMRVLKVVEGGKLRCAEECYSIVMQALCEHRHVKEASDLFGW

Query:  MLSRGMKPKLAIYNYVICMLCKLGNLDDAEKVFKIMNKKRCVPDHVTYSALIHAYGETRKWTAAHNLLKEMLSLGMSPHFHLFSLVDTLMREHGQIDLCL
        MLS+GMKPKLAIYNYVICMLCKLGNLD AE+VF IMNKKRC PDHVTYSALIHAYGE R W+AA+ LLKEMLSLGMSPHFH++SLVD LMREHGQ+DLCL
Subjt:  MLSRGMKPKLAIYNYVICMLCKLGNLDDAEKVFKIMNKKRCVPDHVTYSALIHAYGETRKWTAAHNLLKEMLSLGMSPHFHLFSLVDTLMREHGQIDLCL

Query:  KLEMKWEAQILQKLCKQGQLEAAFEKLKSMLEKGSYPPIYVRDAFESAFQKKGKTKIARELLQKIDG
        KLEMKWEAQILQKLCKQGQLEAA+EK+KSMLEKG  PPIYVRDAFESAFQKKGK KIARELLQK+DG
Subjt:  KLEMKWEAQILQKLCKQGQLEAAFEKLKSMLEKGSYPPIYVRDAFESAFQKKGKTKIARELLQKIDG

A0A5D3CYL1 Pentatricopeptide repeat-containing protein4.4e-24779.53Show/hide
Query:  MLSATRNGSALLKSITLHFSAFSSN------STKPIAIAPQ---------------------STDVVDSVCSLLSN---HSTNLDLDHSLKKFKQSLTSD
        ML   R+ SALLK ++LHF  FSS+      +TK IAIAP+                     S+DVV+SVCSLLSN    + NLD++H LK+FK +L+SD
Subjt:  MLSATRNGSALLKSITLHFSAFSSN------STKPIAIAPQ---------------------STDVVDSVCSLLSN---HSTNLDLDHSLKKFKQSLTSD

Query:  LVLRILMNYRLLGRAKTLEFFSWSGFQMGFRFHHSVVEYMADFLGRRKLFDDMKCLLVTVSSRKGRLSCRTFSICIRFLGRQGRVREALCLFEEMETKFG
        LVL+ILMNY+LLGRAKTLEFFSWSG QMGFRF  SVVEYMADFLGRRKLFDDMKCLLVTV S KGR+SCRTFSICIRFLGRQGRVREALCLFEEME KFG
Subjt:  LVLRILMNYRLLGRAKTLEFFSWSGFQMGFRFHHSVVEYMADFLGRRKLFDDMKCLLVTVSSRKGRLSCRTFSICIRFLGRQGRVREALCLFEEMETKFG

Query:  CKPDNLVFNNMLYALCKKEPTGELIDVALTIFRRIELPDKYSYSNVIIGLCKFGRFCTALEVFGEMDRAGLAPTRSAVNILIGDLCSLSAKDGAIEKVRV
        CKPDNLVFNNMLYALCKKEPTGELID AL IFRRIELPDKYSYSNVIIGLCKFGR+ TA+E FGEM RAGL PTRSA NILIG+LCSLSAK+GA+EKVRV
Subjt:  CKPDNLVFNNMLYALCKKEPTGELIDVALTIFRRIELPDKYSYSNVIIGLCKFGRFCTALEVFGEMDRAGLAPTRSAVNILIGDLCSLSAKDGAIEKVRV

Query:  RSTRRPFTVLVPNVNPKSGVIEPAVCVFWAANRLGLVPSAFVIVQLISELCRLGKMQEAMRVLKVVEGGKLRCAEECYSIVMQALCEHRHVKEASDLFGW
        RST RPFTVLVPNVNPKSG IEPAV +FWAAN+LGLVPS+FV VQLISELCR+G+MQEA++VLKVVE  KLRCAEECYS+VM+ALCEHRH+ EASDLFG 
Subjt:  RSTRRPFTVLVPNVNPKSGVIEPAVCVFWAANRLGLVPSAFVIVQLISELCRLGKMQEAMRVLKVVEGGKLRCAEECYSIVMQALCEHRHVKEASDLFGW

Query:  MLSRGMKPKLAIYNYVICMLCKLGNLDDAEKVFKIMNKKRCVPDHVTYSALIHAYGETRKWTAAHNLLKEMLSLGMSPHFHLFSLVDTLMREHGQIDLCL
        MLS+GMKPKLAIYNYVICMLCKLGNLD AE+VF IMNKKRC PDHVTYSALIHAYGE R W+AA+ LLKEMLSLGMSPHFH++SLVD LMREHGQ+DLCL
Subjt:  MLSRGMKPKLAIYNYVICMLCKLGNLDDAEKVFKIMNKKRCVPDHVTYSALIHAYGETRKWTAAHNLLKEMLSLGMSPHFHLFSLVDTLMREHGQIDLCL

Query:  KLEMKWEAQILQKLCKQGQLEAAFEKLKSMLEKGSYPPIYVRDAFESAFQKK
        KLEMKWEAQILQKLCKQGQLEAA+EK+KSMLEKG  PPIYVRDAFESAFQKK
Subjt:  KLEMKWEAQILQKLCKQGQLEAAFEKLKSMLEKGSYPPIYVRDAFESAFQKK

A0A6J1CV78 pentatricopeptide repeat-containing protein At5g39710-like1.8e-24879.86Show/hide
Query:  TRNGSALLKSITLHFSAFSSN------STKPIAIAPQ---------------------STDVVDSVCSLLSN---HSTNLDLDHSLKKFKQSLTSDLVLR
        +RNGS LLK  TLHFS FSSN      +   IAIAP+                     STDVV+SVCSLLSN    +TNLDLD  LK+F ++L+SDLVLR
Subjt:  TRNGSALLKSITLHFSAFSSN------STKPIAIAPQ---------------------STDVVDSVCSLLSN---HSTNLDLDHSLKKFKQSLTSDLVLR

Query:  ILMNYRLLGRAKTLEFFSWSGFQMGFRFHHSVVEYMADFLGRRKLFDDMKCLLVTVSSRKGRLSCRTFSICIRFLGRQGRVREALCLFEEMETKFGCKPD
        ILMNYR+LGRAKTLEFFSWSG QMG+RF  SVVEYMADF GRRKLFDDMKCLLVTVSS KGRLSCRTFSICIRFLGRQGRVREALCLFEEME KFGCKPD
Subjt:  ILMNYRLLGRAKTLEFFSWSGFQMGFRFHHSVVEYMADFLGRRKLFDDMKCLLVTVSSRKGRLSCRTFSICIRFLGRQGRVREALCLFEEMETKFGCKPD

Query:  NLVFNNMLYALCKKEPTGELIDVALTIFRRIELPDKYSYSNVIIGLCKFGRFCTALEVFGEMDRAGLAPTRSAVNILIGDLCSLSAKDGAIEKVRVRSTR
        NLVFNN+LYALCKKE TGELID ALTIFRRIELPDKYSYSN+IIGLCKFGRF TALEVF EM+R G  PTRSAVNILIGDLCSLSAK+GAIEKVRVRSTR
Subjt:  NLVFNNMLYALCKKEPTGELIDVALTIFRRIELPDKYSYSNVIIGLCKFGRFCTALEVFGEMDRAGLAPTRSAVNILIGDLCSLSAKDGAIEKVRVRSTR

Query:  RPFTVLVPNVNPKSGVIEPAVCVFWAANRLGLVPSAFVIVQLISELCRLGKMQEAMRVLKVVEGGKLRCAEECYSIVMQALCEHRHVKEASDLFGWMLSR
        RPFTVLVPNVN KSG IEPAV VFWAANR+ LVPS+FV+VQLISELCRLG+MQEA+ VLKVVE GKLRC EEC+SIVMQALCE+R V+EASDLFG MLS+
Subjt:  RPFTVLVPNVNPKSGVIEPAVCVFWAANRLGLVPSAFVIVQLISELCRLGKMQEAMRVLKVVEGGKLRCAEECYSIVMQALCEHRHVKEASDLFGWMLSR

Query:  GMKPKLAIYNYVICMLCKLGNLDDAEKVFKIMNKKRCVPDHVTYSALIHAYGETRKWTAAHNLLKEMLSLGMSPHFHLFSLVDTLMREHGQIDLCLKLEM
        GMKPKLA+YN VICMLCKLGN+ DAE+VFKIMN+KRCVPD VTYSALIHAY E   W+AA++LLKEMLSLGMSPHFHL+S VD LMREHGQ+DLCLKLEM
Subjt:  GMKPKLAIYNYVICMLCKLGNLDDAEKVFKIMNKKRCVPDHVTYSALIHAYGETRKWTAAHNLLKEMLSLGMSPHFHLFSLVDTLMREHGQIDLCLKLEM

Query:  KWEAQILQKLCKQGQLEAAFEKLKSMLEKGSYPPIYVRDAFESAFQKKGKTKIARELLQKI
        KWEAQILQKLCKQGQLEAA+EKLKSMLEKG +PP YVRDAFE+AFQK GK KIARELL+KI
Subjt:  KWEAQILQKLCKQGQLEAAFEKLKSMLEKGSYPPIYVRDAFESAFQKKGKTKIARELLQKI

A0A6J1GU90 pentatricopeptide repeat-containing protein At4g20090-like7.2e-25080.11Show/hide
Query:  SATRNGSALLKSITLHFSAFSSNST---KPIAIAPQS-----------------TDVVDSVCSLLSN---HSTNLDLDHSLKKFKQSLTSDLVLRILMNY
        S  RN SA LK +   FS++SS+++   K  AIAP++                 TD V SVCSLLSN    +TNL+LDH LK+FK++L+SD VL+ILMNY
Subjt:  SATRNGSALLKSITLHFSAFSSNST---KPIAIAPQS-----------------TDVVDSVCSLLSN---HSTNLDLDHSLKKFKQSLTSDLVLRILMNY

Query:  RLLGRAKTLEFFSWSGFQMGFRFHHSVVEYMADFLGRRKLFDDMKCLLVTVSSRKGRLSCRTFSICIRFLGRQGRVREALCLFEEMETKFGCKPDNLVFN
        RL GRAKTLEFFSWSG QMG+RF  SVVEYMADFLGRRKLFDDMKCLLVTVSS KGR+SCRTFSICIRFLGRQGRVREALCLFEEME KFGCKPDNLVFN
Subjt:  RLLGRAKTLEFFSWSGFQMGFRFHHSVVEYMADFLGRRKLFDDMKCLLVTVSSRKGRLSCRTFSICIRFLGRQGRVREALCLFEEMETKFGCKPDNLVFN

Query:  NMLYALCKKEPTGELIDVALTIFRRIELPDKYSYSNVIIGLCKFGRFCTALEVFGEMDRAGLAPTRSAVNILIGDLCSLSAKDGAIEKVRVRSTRRPFTV
        NMLYALCKKEPTGELID ALTIFRRIELPDKYSYSN+IIGLCKFGRF TA+EVF EM RAGL PTRSAVNILIGDLCSLSAK+GA+E+VRVRSTRRPFTV
Subjt:  NMLYALCKKEPTGELIDVALTIFRRIELPDKYSYSNVIIGLCKFGRFCTALEVFGEMDRAGLAPTRSAVNILIGDLCSLSAKDGAIEKVRVRSTRRPFTV

Query:  LVPNVNPKSGVIEPAVCVFWAANRLGLVPSAFVIVQLISELCRLGKMQEAMRVLKVVEGGKLRCAEECYSIVMQALCEHRHVKEASDLFGWMLSRGMKPK
        LVPNVNPKSG I+ AV VFWAANRL LVPS FVIV+LISELCRLG+MQEA+RVLKVVE  KLRC EECYSIVMQALCEHR V EASDLFG MLS+ MKPK
Subjt:  LVPNVNPKSGVIEPAVCVFWAANRLGLVPSAFVIVQLISELCRLGKMQEAMRVLKVVEGGKLRCAEECYSIVMQALCEHRHVKEASDLFGWMLSRGMKPK

Query:  LAIYNYVICMLCKLGNLDDAEKVFKIMNKKRCVPDHVTYSALIHAYGETRKWTAAHNLLKEMLSLGMSPHFHLFSLVDTLMREHGQIDLCLKLEMKWEAQ
        LAIYN VICMLCKLGNLDDAE+VFKIMN+KRCVPDHVTYSALIHAYGE R W+AA++LLKEMLSLG+SPHFH++S+VD LMRE GQ DLCLKLEMKWE+Q
Subjt:  LAIYNYVICMLCKLGNLDDAEKVFKIMNKKRCVPDHVTYSALIHAYGETRKWTAAHNLLKEMLSLGMSPHFHLFSLVDTLMREHGQIDLCLKLEMKWEAQ

Query:  ILQKLCKQGQLEAAFEKLKSMLEKGSYPPIYVRDAFESAFQKKGKTKIARELLQKIDG
        ILQKLCKQGQL  A+EKLKSMLEKG YPPIYVRDAFESAFQKKGK KIARELLQ +DG
Subjt:  ILQKLCKQGQLEAAFEKLKSMLEKGSYPPIYVRDAFESAFQKKGKTKIARELLQKIDG

A0A6J1IX53 pentatricopeptide repeat-containing protein At4g20090-like2.9e-25180.47Show/hide
Query:  SATRNGSALLKSITLHFSAFS---SNSTKPIAIAPQS-----------------TDVVDSVCSLLSN---HSTNLDLDHSLKKFKQSLTSDLVLRILMNY
        S  RN SA LK +   FS++S   S++TK  AIAP++                 TD V SVCSLLSN    +TNL+LDH LK+FK++L+SD VL+ILMNY
Subjt:  SATRNGSALLKSITLHFSAFS---SNSTKPIAIAPQS-----------------TDVVDSVCSLLSN---HSTNLDLDHSLKKFKQSLTSDLVLRILMNY

Query:  RLLGRAKTLEFFSWSGFQMGFRFHHSVVEYMADFLGRRKLFDDMKCLLVTVSSRKGRLSCRTFSICIRFLGRQGRVREALCLFEEMETKFGCKPDNLVFN
        RL GRAKTLEFFSWSG QMG+RF  SVVEYMADFLGRRKLFDDMKCLLVTVSS KGR+SCRTFSICIRFLGRQGRVREALCLFEEME  FGCKPDNLVFN
Subjt:  RLLGRAKTLEFFSWSGFQMGFRFHHSVVEYMADFLGRRKLFDDMKCLLVTVSSRKGRLSCRTFSICIRFLGRQGRVREALCLFEEMETKFGCKPDNLVFN

Query:  NMLYALCKKEPTGELIDVALTIFRRIELPDKYSYSNVIIGLCKFGRFCTALEVFGEMDRAGLAPTRSAVNILIGDLCSLSAKDGAIEKVRVRSTRRPFTV
        NMLYALCKKEPTGELID ALTIFRRIELPDKYSYSN+IIGLCKFGRF TA+EVF EM RA L PTRSAVNILIGDLCSLSAK+GA+E+VRVRSTRRPFTV
Subjt:  NMLYALCKKEPTGELIDVALTIFRRIELPDKYSYSNVIIGLCKFGRFCTALEVFGEMDRAGLAPTRSAVNILIGDLCSLSAKDGAIEKVRVRSTRRPFTV

Query:  LVPNVNPKSGVIEPAVCVFWAANRLGLVPSAFVIVQLISELCRLGKMQEAMRVLKVVEGGKLRCAEECYSIVMQALCEHRHVKEASDLFGWMLSRGMKPK
        LVPNVNPKSG IEPAV VFWAANR+ LVPSAFVIV+LISELCRLG+MQEA+RVLKVVE  KLRC EECYSIVMQALCEHR V EASDLFG MLS+ MKPK
Subjt:  LVPNVNPKSGVIEPAVCVFWAANRLGLVPSAFVIVQLISELCRLGKMQEAMRVLKVVEGGKLRCAEECYSIVMQALCEHRHVKEASDLFGWMLSRGMKPK

Query:  LAIYNYVICMLCKLGNLDDAEKVFKIMNKKRCVPDHVTYSALIHAYGETRKWTAAHNLLKEMLSLGMSPHFHLFSLVDTLMREHGQIDLCLKLEMKWEAQ
        LAIYN VICMLCKLGNLDDAE+VFKIMN+KRCVPDHVTYSALIHAYGE R W+AA++LLKEMLSLG+SPHFH++S+VD LMRE GQ DLCLKLEMKWE+Q
Subjt:  LAIYNYVICMLCKLGNLDDAEKVFKIMNKKRCVPDHVTYSALIHAYGETRKWTAAHNLLKEMLSLGMSPHFHLFSLVDTLMREHGQIDLCLKLEMKWEAQ

Query:  ILQKLCKQGQLEAAFEKLKSMLEKGSYPPIYVRDAFESAFQKKGKTKIARELLQKIDG
        ILQKLCKQGQL AA+EKLKSMLEKG YPPIYVRDAFESAFQKKGK KIARELLQ +DG
Subjt:  ILQKLCKQGQLEAAFEKLKSMLEKGSYPPIYVRDAFESAFQKKGKTKIARELLQKIDG

SwissProt top hitse value%identityAlignment
Q6NQ83 Pentatricopeptide repeat-containing protein At3g22470, mitochondrial4.9e-3023.91Show/hide
Query:  RILMNYRLLGRAKTLEF----FSWSGFQMGFRFHHSVVEYMADFLGRRKLFDDMKCLLVTVSSRKGRLSCRTFSICIRFLGRQGRVREALCLFEEMETKF
        ++L  + +LGRA  L +     ++S    GF     V E +A              L+  +   K R    T S  I  L  +GRV EAL L + M  ++
Subjt:  RILMNYRLLGRAKTLEF----FSWSGFQMGFRFHHSVVEYMADFLGRRKLFDDMKCLLVTVSSRKGRLSCRTFSICIRFLGRQGRVREALCLFEEMETKF

Query:  GCKPDNLVFNNMLYALCKKEPTGELIDVALTIFRRIE----LPDKYSYSNVIIGLCKFGRFCTALEVFGEMDRAGLAPTRSAVNILIGDLCSLSAKDGAI
        G +PD + +  +L  LCK   +     +AL +FR++E          YS VI  LCK G F  AL +F EM+  G+       + LIG LC+    D   
Subjt:  GCKPDNLVFNNMLYALCKKEPTGELIDVALTIFRRIE----LPDKYSYSNVIIGLCKFGRFCTALEVFGEMDRAGLAPTRSAVNILIGDLCSLSAKDGAI

Query:  EKVRVRSTRRPFTVLVPNVNPKSGVIEPAVCVFWAANRLGLVPSAFVIVQLISELCRLGKMQEAMRVLKVVEGGKLRCAEECYSIVMQALCEHRHVKEAS
        + +R    R                               ++P       LI    + GK+ EA  +   +    +      Y+ ++   C+   + EA+
Subjt:  EKVRVRSTRRPFTVLVPNVNPKSGVIEPAVCVFWAANRLGLVPSAFVIVQLISELCRLGKMQEAMRVLKVVEGGKLRCAEECYSIVMQALCEHRHVKEAS

Query:  DLFGWMLSRGMKPKLAIYNYVICMLCKLGNLDDAEKVFKIMNKKRCVPDHVTYSALIHAYGETRKWTAAHNLLKEMLSLGMSPHFHLFSLVDTLMREHGQ
         +F  M+S+G +P +  Y+ +I   CK   +DD  ++F+ ++ K  +P+ +TY+ L+  + ++ K  AA  L +EM+S G+ P    + ++   + ++G+
Subjt:  DLFGWMLSRGMKPKLAIYNYVICMLCKLGNLDDAEKVFKIMNKKRCVPDHVTYSALIHAYGETRKWTAAHNLLKEMLSLGMSPHFHLFSLVDTLMREHGQ

Query:  IDLCLKLEMKWEAQ-----------ILQKLCKQGQLEAAFEKLKSMLEKGSYPPIYVRDAFESAFQKKGKTKIARELLQKI
        ++  L++  K +             I+  +C   +++ A+    S+ +KG  P +   +       KKG    A  L +K+
Subjt:  IDLCLKLEMKWEAQ-----------ILQKLCKQGQLEAAFEKLKSMLEKGSYPPIYVRDAFESAFQKKGKTKIARELLQKI

Q9CA58 Putative pentatricopeptide repeat-containing protein At1g745804.6e-2824.56Show/hide
Query:  LSCRTFSICIRFLGRQGRVREALCLFEEMETKFGCKPDNLVFNNMLYALCKKEPTGELIDVALTIFRRIELPDKYSYSNVIIGLCKFGRFCTALEVFGEM
        L   TF+  +R L ++G V+E   L +++  K G  P+   +N  +  LC++      + +   +  +   PD  +Y+N+I GLCK  +F  A    G+M
Subjt:  LSCRTFSICIRFLGRQGRVREALCLFEEMETKFGCKPDNLVFNNMLYALCKKEPTGELIDVALTIFRRIELPDKYSYSNVIIGLCKFGRFCTALEVFGEM

Query:  DRAGLAPTRSAVNILIGDLCSLSAKDGAIEKVRVRSTRRPFTVLVPNVNPKSGVIEPAVCVFWAANRLGLVPSAFVIVQLISELCRLGKMQEAMRVLKVV
           GL P     N LI   C                              K G+++ A  +   A   G VP  F    LI  LC  G+   A+ +    
Subjt:  DRAGLAPTRSAVNILIGDLCSLSAKDGAIEKVRVRSTRRPFTVLVPNVNPKSGVIEPAVCVFWAANRLGLVPSAFVIVQLISELCRLGKMQEAMRVLKVV

Query:  EGGKLRCAEECYSIVMQALCEHRHVKEASDLFGWMLSRGMKPKLAIYNYVICMLCKLGNLDDAEKVFKIMNKKRCVPDHVTYSALIHAYGETRKWTAAHN
         G  ++     Y+ +++ L     + EA+ L   M  +G+ P++  +N ++  LCK+G + DA+ + K+M  K   PD  T++ LIH Y    K   A  
Subjt:  EGGKLRCAEECYSIVMQALCEHRHVKEASDLFGWMLSRGMKPKLAIYNYVICMLCKLGNLDDAEKVFKIMNKKRCVPDHVTYSALIHAYGETRKWTAAHN

Query:  LLKEMLSLGMSPHFHLFSLVDTLMREHGQIDLCLKLEMKWEAQILQKLCKQGQLEAAFEKLKSMLEKGSYPPIYVRDAFESAFQKKGKTKIARELLQKI
        +L  ML  G+ P  + ++                         +L  LCK  + E   E  K+M+EKG  P ++  +    +  +  K   A  LL+++
Subjt:  LLKEMLSLGMSPHFHLFSLVDTLMREHGQIDLCLKLEMKWEAQILQKLCKQGQLEAAFEKLKSMLEKGSYPPIYVRDAFESAFQKKGKTKIARELLQKI

Q9FIX3 Pentatricopeptide repeat-containing protein At5g397101.1e-2924.86Show/hide
Query:  KSITLH-FSAFSSNSTKPIAIAPQSTDVVDSVCSLLSNHSTNLDLDHSLKKFKQSLTSDLVLRILMNYRLLGRAKTLEFFSWS-GFQMGFRFHHSVVEYM
        K ITLH  + F    T  I     +   +D   + L   S     D     +  S   DLV++      L+ +A ++   + + GF  G   +++V++  
Subjt:  KSITLH-FSAFSSNSTKPIAIAPQSTDVVDSVCSLLSNHSTNLDLDHSLKKFKQSLTSDLVLRILMNYRLLGRAKTLEFFSWS-GFQMGFRFHHSVVEYM

Query:  ADFLGRRKLFDDMKCLLVTVSSRKGRLSCRTFSICIRFLGRQGRVREALCLFEEMETKFGCKPDNLVFNNMLYALCKKEPTGELIDVALTIFRRIEL---
        A    +R +          + S+    +  T++I IR     G +  AL LF++METK GC P+ + +N ++   CK       ID    + R + L   
Subjt:  ADFLGRRKLFDDMKCLLVTVSSRKGRLSCRTFSICIRFLGRQGRVREALCLFEEMETKFGCKPDNLVFNNMLYALCKKEPTGELIDVALTIFRRIEL---

Query:  -PDKYSYSNVIIGLCKFGRFCTALEVFGEMDRAGLAPTRSAVNILIGDLCSLSAKDGAIEKVRVRSTRRPFTVLVPNVNPKSGVIEPAVCVFWAANRL--
         P+  SY+ VI GLC+ GR      V  EM+R G +      N LI   C    K+G   +  V         L P+V   + +I  ++C     NR   
Subjt:  -PDKYSYSNVIIGLCKFGRFCTALEVFGEMDRAGLAPTRSAVNILIGDLCSLSAKDGAIEKVRVRSTRRPFTVLVPNVNPKSGVIEPAVCVFWAANRL--

Query:  --------GLVPSAFVIVQLISELCRLGKMQEAMRVLKVVEGGKLRCAEECYSIVMQALCEHRHVKEASDLFGWMLSRGMKPKLAIYNYVICMLCKLGNL
                GL P+      L+    + G M EA RVL+ +       +   Y+ ++   C    +++A  +   M  +G+ P +  Y+ V+   C+  ++
Subjt:  --------GLVPSAFVIVQLISELCRLGKMQEAMRVLKVVEGGKLRCAEECYSIVMQALCEHRHVKEASDLFGWMLSRGMKPKLAIYNYVICMLCKLGNL

Query:  DDAEKVFKIMNKKRCVPDHVTYSALIHAYGETRKWTAAHNLLKEMLSLGMSPHFHLFSLVDTLMREHGQIDLCLKLEMKWEAQILQKLCKQGQLEAAFEK
        D+A +V + M +K   PD +TYS+LI  + E R+   A +L +EML +G+ P    ++                         ++   C +G LE A + 
Subjt:  DDAEKVFKIMNKKRCVPDHVTYSALIHAYGETRKWTAAHNLLKEMLSLGMSPHFHLFSLVDTLMREHGQIDLCLKLEMKWEAQILQKLCKQGQLEAAFEK

Query:  LKSMLEKGSYPPIYVRDAFESAFQKKGKTKIARELLQKI
           M+EKG  P +       +   K+ +T+ A+ LL K+
Subjt:  LKSMLEKGSYPPIYVRDAFESAFQKKGKTKIARELLQKI

Q9LFC5 Pentatricopeptide repeat-containing protein At5g011103.8e-3025.24Show/hide
Query:  SRKG-RLSCRTFSICIRFLGRQGRVREALCLFEEMETKFGCKPDNLVFNNMLYALCKKEPTGELIDVALTIFRRIELPDKYSYSNVIIGLCKFGRFCTAL
        SR G  ++  T +I +  L + G++ +      +++ K G  PD + +N ++ A   K    E  ++   +  +   P  Y+Y+ VI GLCK G++  A 
Subjt:  SRKG-RLSCRTFSICIRFLGRQGRVREALCLFEEMETKFGCKPDNLVFNNMLYALCKKEPTGELIDVALTIFRRIELPDKYSYSNVIIGLCKFGRFCTAL

Query:  EVFGEMDRAGLAPTRSAVNILIGDLCSLSAKDGAIEKVRVRSTRRPFTVL--------VPNVNPKSGVIEPAVCVFWAANRLGLVPSAFVIVQLISELCR
        EVF EM R+GL+P  +    L+ + C    K   +E  +V S  R   V+        + ++  +SG ++ A+  F +    GL+P   +   LI   CR
Subjt:  EVFGEMDRAGLAPTRSAVNILIGDLCSLSAKDGAIEKVRVRSTRRPFTVL--------VPNVNPKSGVIEPAVCVFWAANRLGLVPSAFVIVQLISELCR

Query:  LGKMQEAMRVL-KVVEGGKLRCAEE--CYSIVMQALCEHRHVKEASDLFGWMLSRGMKPKLAIYNYVICMLCKLGNLDDAEKVFKIMNKKRCVPDHVTYS
         G +  AM +  ++++ G   CA +   Y+ ++  LC+ + + EA  LF  M  R + P       +I   CKLGNL +A ++F+ M +KR   D VTY+
Subjt:  LGKMQEAMRVL-KVVEGGKLRCAEE--CYSIVMQALCEHRHVKEASDLFGWMLSRGMKPKLAIYNYVICMLCKLGNLDDAEKVFKIMNKKRCVPDHVTYS

Query:  ALIHAYGETRKWTAAHNLLKEMLSLGMSPHFHLFSLVDTLMREHGQIDLCLKLEMKWEAQILQKLCKQGQLEAAFEKLKSMLEKGSYPPIYVRDAFESAF
         L+  +G+      A  +  +M+S  + P    +S+                        ++  LC +G L  AF     M+ K   P + + ++    +
Subjt:  ALIHAYGETRKWTAAHNLLKEMLSLGMSPHFHLFSLVDTLMREHGQIDLCLKLEMKWEAQILQKLCKQGQLEAAFEKLKSMLEKGSYPPIYVRDAFESAF

Query:  QKKGKTKIARELLQKI
         + G        L+K+
Subjt:  QKKGKTKIARELLQKI

Q9LFF1 Pentatricopeptide repeat-containing protein At3g53700, chloroplastic1.2e-3125.87Show/hide
Query:  RQGRVREALCLFEEMETKFGCKPDNLVFNNMLYALCKKEPTGELIDVALTIFRRIELPDKYSYSNVIIGLCKFGRFCTALEVFGEMDRAGLAPTRSAVNI
        ++GRV +AL   +EM  + G  PD   FN ++  LCK       I++   + +    PD Y+Y++VI GLCK G    A+EV  +M     +P     N 
Subjt:  RQGRVREALCLFEEMETKFGCKPDNLVFNNMLYALCKKEPTGELIDVALTIFRRIELPDKYSYSNVIIGLCKFGRFCTALEVFGEMDRAGLAPTRSAVNI

Query:  LIGDLCSLSAKDGAIEKVRVRSTRRPFTVLVPNVNPKSGVIE---------PAVCVFWAANRLGLVPSAFVIVQLISELCRLGKMQEAMRVLKVVEGGKL
        LI  LC  +  + A E  RV +++     ++P+V   + +I+          A+ +F      G  P  F    LI  LC  GK+ EA+ +LK +E    
Subjt:  LIGDLCSLSAKDGAIEKVRVRSTRRPFTVLVPNVNPKSGVIE---------PAVCVFWAANRLGLVPSAFVIVQLISELCRLGKMQEAMRVLKVVEGGKL

Query:  RCAEECYSIVMQALCEHRHVKEASDLFGWMLSRGMKPKLAIYNYVICMLCKLGNLDDAEKVFKIMNKKRCVPDHVTYSALIHAYGETRKWTAAHNLLKEM
          +   Y+ ++   C+    +EA ++F  M   G+      YN +I  LCK   ++DA ++   M  +   PD  TY++L+  +        A ++++ M
Subjt:  RCAEECYSIVMQALCEHRHVKEASDLFGWMLSRGMKPKLAIYNYVICMLCKLGNLDDAEKVFKIMNKKRCVPDHVTYSALIHAYGETRKWTAAHNLLKEM

Query:  LSLGMSPHFHLFSLVDTLMREHGQIDLCLKLEMKWEAQILQKLCKQGQLEAAFEKLKSMLEKG-SYPPIYVRDAFESAFQKKGKTK---IARELLQKIDG
         S G  P    +                          ++  LCK G++E A + L+S+  KG +  P       +  F+K+  T+   + RE+L++ + 
Subjt:  LSLGMSPHFHLFSLVDTLMREHGQIDLCLKLEMKWEAQILQKLCKQGQLEAAFEKLKSMLEKG-SYPPIYVRDAFESAFQKKGKTK---IARELLQKIDG

Query:  SP
         P
Subjt:  SP

Arabidopsis top hitse value%identityAlignment
AT1G74580.1 Pentatricopeptide repeat (PPR) superfamily protein3.3e-2924.56Show/hide
Query:  LSCRTFSICIRFLGRQGRVREALCLFEEMETKFGCKPDNLVFNNMLYALCKKEPTGELIDVALTIFRRIELPDKYSYSNVIIGLCKFGRFCTALEVFGEM
        L   TF+  +R L ++G V+E   L +++  K G  P+   +N  +  LC++      + +   +  +   PD  +Y+N+I GLCK  +F  A    G+M
Subjt:  LSCRTFSICIRFLGRQGRVREALCLFEEMETKFGCKPDNLVFNNMLYALCKKEPTGELIDVALTIFRRIELPDKYSYSNVIIGLCKFGRFCTALEVFGEM

Query:  DRAGLAPTRSAVNILIGDLCSLSAKDGAIEKVRVRSTRRPFTVLVPNVNPKSGVIEPAVCVFWAANRLGLVPSAFVIVQLISELCRLGKMQEAMRVLKVV
           GL P     N LI   C                              K G+++ A  +   A   G VP  F    LI  LC  G+   A+ +    
Subjt:  DRAGLAPTRSAVNILIGDLCSLSAKDGAIEKVRVRSTRRPFTVLVPNVNPKSGVIEPAVCVFWAANRLGLVPSAFVIVQLISELCRLGKMQEAMRVLKVV

Query:  EGGKLRCAEECYSIVMQALCEHRHVKEASDLFGWMLSRGMKPKLAIYNYVICMLCKLGNLDDAEKVFKIMNKKRCVPDHVTYSALIHAYGETRKWTAAHN
         G  ++     Y+ +++ L     + EA+ L   M  +G+ P++  +N ++  LCK+G + DA+ + K+M  K   PD  T++ LIH Y    K   A  
Subjt:  EGGKLRCAEECYSIVMQALCEHRHVKEASDLFGWMLSRGMKPKLAIYNYVICMLCKLGNLDDAEKVFKIMNKKRCVPDHVTYSALIHAYGETRKWTAAHN

Query:  LLKEMLSLGMSPHFHLFSLVDTLMREHGQIDLCLKLEMKWEAQILQKLCKQGQLEAAFEKLKSMLEKGSYPPIYVRDAFESAFQKKGKTKIARELLQKI
        +L  ML  G+ P  + ++                         +L  LCK  + E   E  K+M+EKG  P ++  +    +  +  K   A  LL+++
Subjt:  LLKEMLSLGMSPHFHLFSLVDTLMREHGQIDLCLKLEMKWEAQILQKLCKQGQLEAAFEKLKSMLEKGSYPPIYVRDAFESAFQKKGKTKIARELLQKI

AT3G22470.1 Pentatricopeptide repeat (PPR) superfamily protein3.5e-3123.91Show/hide
Query:  RILMNYRLLGRAKTLEF----FSWSGFQMGFRFHHSVVEYMADFLGRRKLFDDMKCLLVTVSSRKGRLSCRTFSICIRFLGRQGRVREALCLFEEMETKF
        ++L  + +LGRA  L +     ++S    GF     V E +A              L+  +   K R    T S  I  L  +GRV EAL L + M  ++
Subjt:  RILMNYRLLGRAKTLEF----FSWSGFQMGFRFHHSVVEYMADFLGRRKLFDDMKCLLVTVSSRKGRLSCRTFSICIRFLGRQGRVREALCLFEEMETKF

Query:  GCKPDNLVFNNMLYALCKKEPTGELIDVALTIFRRIE----LPDKYSYSNVIIGLCKFGRFCTALEVFGEMDRAGLAPTRSAVNILIGDLCSLSAKDGAI
        G +PD + +  +L  LCK   +     +AL +FR++E          YS VI  LCK G F  AL +F EM+  G+       + LIG LC+    D   
Subjt:  GCKPDNLVFNNMLYALCKKEPTGELIDVALTIFRRIE----LPDKYSYSNVIIGLCKFGRFCTALEVFGEMDRAGLAPTRSAVNILIGDLCSLSAKDGAI

Query:  EKVRVRSTRRPFTVLVPNVNPKSGVIEPAVCVFWAANRLGLVPSAFVIVQLISELCRLGKMQEAMRVLKVVEGGKLRCAEECYSIVMQALCEHRHVKEAS
        + +R    R                               ++P       LI    + GK+ EA  +   +    +      Y+ ++   C+   + EA+
Subjt:  EKVRVRSTRRPFTVLVPNVNPKSGVIEPAVCVFWAANRLGLVPSAFVIVQLISELCRLGKMQEAMRVLKVVEGGKLRCAEECYSIVMQALCEHRHVKEAS

Query:  DLFGWMLSRGMKPKLAIYNYVICMLCKLGNLDDAEKVFKIMNKKRCVPDHVTYSALIHAYGETRKWTAAHNLLKEMLSLGMSPHFHLFSLVDTLMREHGQ
         +F  M+S+G +P +  Y+ +I   CK   +DD  ++F+ ++ K  +P+ +TY+ L+  + ++ K  AA  L +EM+S G+ P    + ++   + ++G+
Subjt:  DLFGWMLSRGMKPKLAIYNYVICMLCKLGNLDDAEKVFKIMNKKRCVPDHVTYSALIHAYGETRKWTAAHNLLKEMLSLGMSPHFHLFSLVDTLMREHGQ

Query:  IDLCLKLEMKWEAQ-----------ILQKLCKQGQLEAAFEKLKSMLEKGSYPPIYVRDAFESAFQKKGKTKIARELLQKI
        ++  L++  K +             I+  +C   +++ A+    S+ +KG  P +   +       KKG    A  L +K+
Subjt:  IDLCLKLEMKWEAQ-----------ILQKLCKQGQLEAAFEKLKSMLEKGSYPPIYVRDAFESAFQKKGKTKIARELLQKI

AT3G53700.1 Pentatricopeptide repeat (PPR) superfamily protein8.4e-3325.87Show/hide
Query:  RQGRVREALCLFEEMETKFGCKPDNLVFNNMLYALCKKEPTGELIDVALTIFRRIELPDKYSYSNVIIGLCKFGRFCTALEVFGEMDRAGLAPTRSAVNI
        ++GRV +AL   +EM  + G  PD   FN ++  LCK       I++   + +    PD Y+Y++VI GLCK G    A+EV  +M     +P     N 
Subjt:  RQGRVREALCLFEEMETKFGCKPDNLVFNNMLYALCKKEPTGELIDVALTIFRRIELPDKYSYSNVIIGLCKFGRFCTALEVFGEMDRAGLAPTRSAVNI

Query:  LIGDLCSLSAKDGAIEKVRVRSTRRPFTVLVPNVNPKSGVIE---------PAVCVFWAANRLGLVPSAFVIVQLISELCRLGKMQEAMRVLKVVEGGKL
        LI  LC  +  + A E  RV +++     ++P+V   + +I+          A+ +F      G  P  F    LI  LC  GK+ EA+ +LK +E    
Subjt:  LIGDLCSLSAKDGAIEKVRVRSTRRPFTVLVPNVNPKSGVIE---------PAVCVFWAANRLGLVPSAFVIVQLISELCRLGKMQEAMRVLKVVEGGKL

Query:  RCAEECYSIVMQALCEHRHVKEASDLFGWMLSRGMKPKLAIYNYVICMLCKLGNLDDAEKVFKIMNKKRCVPDHVTYSALIHAYGETRKWTAAHNLLKEM
          +   Y+ ++   C+    +EA ++F  M   G+      YN +I  LCK   ++DA ++   M  +   PD  TY++L+  +        A ++++ M
Subjt:  RCAEECYSIVMQALCEHRHVKEASDLFGWMLSRGMKPKLAIYNYVICMLCKLGNLDDAEKVFKIMNKKRCVPDHVTYSALIHAYGETRKWTAAHNLLKEM

Query:  LSLGMSPHFHLFSLVDTLMREHGQIDLCLKLEMKWEAQILQKLCKQGQLEAAFEKLKSMLEKG-SYPPIYVRDAFESAFQKKGKTK---IARELLQKIDG
         S G  P    +                          ++  LCK G++E A + L+S+  KG +  P       +  F+K+  T+   + RE+L++ + 
Subjt:  LSLGMSPHFHLFSLVDTLMREHGQIDLCLKLEMKWEAQILQKLCKQGQLEAAFEKLKSMLEKG-SYPPIYVRDAFESAFQKKGKTK---IARELLQKIDG

Query:  SP
         P
Subjt:  SP

AT5G01110.1 Tetratricopeptide repeat (TPR)-like superfamily protein2.7e-3125.24Show/hide
Query:  SRKG-RLSCRTFSICIRFLGRQGRVREALCLFEEMETKFGCKPDNLVFNNMLYALCKKEPTGELIDVALTIFRRIELPDKYSYSNVIIGLCKFGRFCTAL
        SR G  ++  T +I +  L + G++ +      +++ K G  PD + +N ++ A   K    E  ++   +  +   P  Y+Y+ VI GLCK G++  A 
Subjt:  SRKG-RLSCRTFSICIRFLGRQGRVREALCLFEEMETKFGCKPDNLVFNNMLYALCKKEPTGELIDVALTIFRRIELPDKYSYSNVIIGLCKFGRFCTAL

Query:  EVFGEMDRAGLAPTRSAVNILIGDLCSLSAKDGAIEKVRVRSTRRPFTVL--------VPNVNPKSGVIEPAVCVFWAANRLGLVPSAFVIVQLISELCR
        EVF EM R+GL+P  +    L+ + C    K   +E  +V S  R   V+        + ++  +SG ++ A+  F +    GL+P   +   LI   CR
Subjt:  EVFGEMDRAGLAPTRSAVNILIGDLCSLSAKDGAIEKVRVRSTRRPFTVL--------VPNVNPKSGVIEPAVCVFWAANRLGLVPSAFVIVQLISELCR

Query:  LGKMQEAMRVL-KVVEGGKLRCAEE--CYSIVMQALCEHRHVKEASDLFGWMLSRGMKPKLAIYNYVICMLCKLGNLDDAEKVFKIMNKKRCVPDHVTYS
         G +  AM +  ++++ G   CA +   Y+ ++  LC+ + + EA  LF  M  R + P       +I   CKLGNL +A ++F+ M +KR   D VTY+
Subjt:  LGKMQEAMRVL-KVVEGGKLRCAEE--CYSIVMQALCEHRHVKEASDLFGWMLSRGMKPKLAIYNYVICMLCKLGNLDDAEKVFKIMNKKRCVPDHVTYS

Query:  ALIHAYGETRKWTAAHNLLKEMLSLGMSPHFHLFSLVDTLMREHGQIDLCLKLEMKWEAQILQKLCKQGQLEAAFEKLKSMLEKGSYPPIYVRDAFESAF
         L+  +G+      A  +  +M+S  + P    +S+                        ++  LC +G L  AF     M+ K   P + + ++    +
Subjt:  ALIHAYGETRKWTAAHNLLKEMLSLGMSPHFHLFSLVDTLMREHGQIDLCLKLEMKWEAQILQKLCKQGQLEAAFEKLKSMLEKGSYPPIYVRDAFESAF

Query:  QKKGKTKIARELLQKI
         + G        L+K+
Subjt:  QKKGKTKIARELLQKI

AT5G39710.1 Tetratricopeptide repeat (TPR)-like superfamily protein7.8e-3124.86Show/hide
Query:  KSITLH-FSAFSSNSTKPIAIAPQSTDVVDSVCSLLSNHSTNLDLDHSLKKFKQSLTSDLVLRILMNYRLLGRAKTLEFFSWS-GFQMGFRFHHSVVEYM
        K ITLH  + F    T  I     +   +D   + L   S     D     +  S   DLV++      L+ +A ++   + + GF  G   +++V++  
Subjt:  KSITLH-FSAFSSNSTKPIAIAPQSTDVVDSVCSLLSNHSTNLDLDHSLKKFKQSLTSDLVLRILMNYRLLGRAKTLEFFSWS-GFQMGFRFHHSVVEYM

Query:  ADFLGRRKLFDDMKCLLVTVSSRKGRLSCRTFSICIRFLGRQGRVREALCLFEEMETKFGCKPDNLVFNNMLYALCKKEPTGELIDVALTIFRRIEL---
        A    +R +          + S+    +  T++I IR     G +  AL LF++METK GC P+ + +N ++   CK       ID    + R + L   
Subjt:  ADFLGRRKLFDDMKCLLVTVSSRKGRLSCRTFSICIRFLGRQGRVREALCLFEEMETKFGCKPDNLVFNNMLYALCKKEPTGELIDVALTIFRRIEL---

Query:  -PDKYSYSNVIIGLCKFGRFCTALEVFGEMDRAGLAPTRSAVNILIGDLCSLSAKDGAIEKVRVRSTRRPFTVLVPNVNPKSGVIEPAVCVFWAANRL--
         P+  SY+ VI GLC+ GR      V  EM+R G +      N LI   C    K+G   +  V         L P+V   + +I  ++C     NR   
Subjt:  -PDKYSYSNVIIGLCKFGRFCTALEVFGEMDRAGLAPTRSAVNILIGDLCSLSAKDGAIEKVRVRSTRRPFTVLVPNVNPKSGVIEPAVCVFWAANRL--

Query:  --------GLVPSAFVIVQLISELCRLGKMQEAMRVLKVVEGGKLRCAEECYSIVMQALCEHRHVKEASDLFGWMLSRGMKPKLAIYNYVICMLCKLGNL
                GL P+      L+    + G M EA RVL+ +       +   Y+ ++   C    +++A  +   M  +G+ P +  Y+ V+   C+  ++
Subjt:  --------GLVPSAFVIVQLISELCRLGKMQEAMRVLKVVEGGKLRCAEECYSIVMQALCEHRHVKEASDLFGWMLSRGMKPKLAIYNYVICMLCKLGNL

Query:  DDAEKVFKIMNKKRCVPDHVTYSALIHAYGETRKWTAAHNLLKEMLSLGMSPHFHLFSLVDTLMREHGQIDLCLKLEMKWEAQILQKLCKQGQLEAAFEK
        D+A +V + M +K   PD +TYS+LI  + E R+   A +L +EML +G+ P    ++                         ++   C +G LE A + 
Subjt:  DDAEKVFKIMNKKRCVPDHVTYSALIHAYGETRKWTAAHNLLKEMLSLGMSPHFHLFSLVDTLMREHGQIDLCLKLEMKWEAQILQKLCKQGQLEAAFEK

Query:  LKSMLEKGSYPPIYVRDAFESAFQKKGKTKIARELLQKI
           M+EKG  P +       +   K+ +T+ A+ LL K+
Subjt:  LKSMLEKGSYPPIYVRDAFESAFQKKGKTKIARELLQKI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTGAGCGCCACTAGAAACGGTTCCGCGTTGCTCAAATCCATAACCCTTCATTTCTCCGCCTTCTCTTCAAACTCCACAAAGCCAATCGCCATAGCTCCACAAAGCAC
CGATGTCGTCGATTCAGTATGTTCTTTACTTTCAAACCATTCAACTAATCTCGATCTTGATCATTCATTGAAAAAATTCAAACAATCCTTAACTTCCGATCTAGTTCTTC
GAATTCTAATGAATTATAGGCTGTTGGGTCGGGCTAAAACCTTGGAATTCTTCTCTTGGTCTGGATTCCAAATGGGGTTTCGATTTCATCACTCTGTTGTTGAGTACATG
GCTGATTTCTTAGGTAGGAGGAAATTGTTTGATGATATGAAGTGTCTTTTGGTTACGGTTTCGTCTCGTAAGGGTCGCCTTTCGTGTCGGACGTTTTCGATTTGTATTAG
GTTTTTGGGTAGGCAAGGGAGGGTTAGAGAAGCCCTTTGCTTGTTCGAAGAAATGGAGACGAAATTCGGGTGTAAACCGGATAATCTGGTGTTTAACAATATGCTTTATG
CACTTTGTAAGAAGGAACCAACTGGGGAATTGATTGATGTTGCTTTAACAATTTTCAGAAGGATTGAGTTGCCTGATAAATATTCATACAGTAATGTAATCATAGGGTTG
TGTAAATTTGGTAGGTTTTGTACTGCCCTTGAAGTGTTTGGTGAAATGGATAGGGCGGGTTTGGCGCCTACTCGATCTGCTGTGAACATTCTCATTGGGGATTTGTGTTC
GTTGAGTGCTAAAGATGGGGCTATTGAAAAGGTTAGGGTCAGAAGTACTCGTCGGCCTTTTACCGTTCTAGTTCCGAATGTGAACCCGAAGAGCGGTGTGATTGAACCTG
CAGTTTGCGTTTTTTGGGCTGCTAATAGGTTGGGTTTAGTTCCTAGTGCATTTGTTATAGTTCAGCTCATCTCGGAGCTTTGTCGATTAGGTAAAATGCAAGAAGCAATG
AGAGTATTGAAGGTTGTTGAGGGTGGCAAGTTAAGATGTGCTGAAGAGTGTTACTCCATTGTGATGCAAGCATTGTGTGAGCATCGTCATGTCAAAGAAGCTAGTGATCT
ATTCGGGTGGATGCTTTCTCGGGGTATGAAGCCAAAGTTGGCTATTTACAATTATGTTATTTGCATGCTATGCAAATTAGGAAATTTGGATGATGCTGAAAAGGTCTTCA
AGATTATGAACAAGAAAAGATGTGTACCGGACCATGTTACTTATTCGGCACTAATCCATGCCTATGGTGAAACTAGGAAATGGACAGCAGCCCACAATTTATTGAAGGAA
ATGTTGAGTTTAGGAATGTCTCCTCATTTTCATTTGTTTAGTTTAGTGGATACACTAATGAGGGAACATGGGCAAATTGATTTGTGTTTGAAGCTGGAAATGAAGTGGGA
AGCCCAAATTTTGCAGAAGCTTTGTAAACAAGGACAACTTGAGGCCGCGTTTGAGAAGCTAAAGTCGATGCTTGAAAAGGGTTCTTATCCTCCTATCTATGTGAGAGATG
CATTTGAGAGTGCATTTCAAAAGAAGGGTAAGACAAAGATTGCACGCGAGTTGTTGCAGAAGATAGATGGGAGTCCACGAACATGA
mRNA sequenceShow/hide mRNA sequence
TTATATTAGGGATTTCTCCAAACATGGAGGTTGGAAAATAAGAGAGCATCTTAAACATTATAATAAAAAAGAGAAATGAAAGAAACCCTAATTTTTCCAAATGAGAAAAT
TTTGGAATTTCTCCAGGTTGAAGCTATTGGTTCTGCAACTCGTCGATGTTGAGCGCCACTAGAAACGGTTCCGCGTTGCTCAAATCCATAACCCTTCATTTCTCCGCCTT
CTCTTCAAACTCCACAAAGCCAATCGCCATAGCTCCACAAAGCACCGATGTCGTCGATTCAGTATGTTCTTTACTTTCAAACCATTCAACTAATCTCGATCTTGATCATT
CATTGAAAAAATTCAAACAATCCTTAACTTCCGATCTAGTTCTTCGAATTCTAATGAATTATAGGCTGTTGGGTCGGGCTAAAACCTTGGAATTCTTCTCTTGGTCTGGA
TTCCAAATGGGGTTTCGATTTCATCACTCTGTTGTTGAGTACATGGCTGATTTCTTAGGTAGGAGGAAATTGTTTGATGATATGAAGTGTCTTTTGGTTACGGTTTCGTC
TCGTAAGGGTCGCCTTTCGTGTCGGACGTTTTCGATTTGTATTAGGTTTTTGGGTAGGCAAGGGAGGGTTAGAGAAGCCCTTTGCTTGTTCGAAGAAATGGAGACGAAAT
TCGGGTGTAAACCGGATAATCTGGTGTTTAACAATATGCTTTATGCACTTTGTAAGAAGGAACCAACTGGGGAATTGATTGATGTTGCTTTAACAATTTTCAGAAGGATT
GAGTTGCCTGATAAATATTCATACAGTAATGTAATCATAGGGTTGTGTAAATTTGGTAGGTTTTGTACTGCCCTTGAAGTGTTTGGTGAAATGGATAGGGCGGGTTTGGC
GCCTACTCGATCTGCTGTGAACATTCTCATTGGGGATTTGTGTTCGTTGAGTGCTAAAGATGGGGCTATTGAAAAGGTTAGGGTCAGAAGTACTCGTCGGCCTTTTACCG
TTCTAGTTCCGAATGTGAACCCGAAGAGCGGTGTGATTGAACCTGCAGTTTGCGTTTTTTGGGCTGCTAATAGGTTGGGTTTAGTTCCTAGTGCATTTGTTATAGTTCAG
CTCATCTCGGAGCTTTGTCGATTAGGTAAAATGCAAGAAGCAATGAGAGTATTGAAGGTTGTTGAGGGTGGCAAGTTAAGATGTGCTGAAGAGTGTTACTCCATTGTGAT
GCAAGCATTGTGTGAGCATCGTCATGTCAAAGAAGCTAGTGATCTATTCGGGTGGATGCTTTCTCGGGGTATGAAGCCAAAGTTGGCTATTTACAATTATGTTATTTGCA
TGCTATGCAAATTAGGAAATTTGGATGATGCTGAAAAGGTCTTCAAGATTATGAACAAGAAAAGATGTGTACCGGACCATGTTACTTATTCGGCACTAATCCATGCCTAT
GGTGAAACTAGGAAATGGACAGCAGCCCACAATTTATTGAAGGAAATGTTGAGTTTAGGAATGTCTCCTCATTTTCATTTGTTTAGTTTAGTGGATACACTAATGAGGGA
ACATGGGCAAATTGATTTGTGTTTGAAGCTGGAAATGAAGTGGGAAGCCCAAATTTTGCAGAAGCTTTGTAAACAAGGACAACTTGAGGCCGCGTTTGAGAAGCTAAAGT
CGATGCTTGAAAAGGGTTCTTATCCTCCTATCTATGTGAGAGATGCATTTGAGAGTGCATTTCAAAAGAAGGGTAAGACAAAGATTGCACGCGAGTTGTTGCAGAAGATA
GATGGGAGTCCACGAACATGAGCTGATCAGAAATTCTTCATGAGTCGTCGAAGAAATCACTGTGCAGTTTCAATTTAATGAGGCAAAATTGCTCGGATTGATCGTGTCTT
GATATCAAGAAAGAAAGTACGAAAGTTTAAATGTTGTACTTGAAAGGCTTGTACTAGCTTTGAGGAGGATCATATTGGGTATATACCCCAGGACAATAATAACAGCGGCA
AAAAGGGCAAGAAGGCAAGTAATTCCATTCCAACTAGTTCCTCGTGTAACTCCGCAGCCTGCACTGAAACACTGCTGGAGCAGTCGAGAAAACGCTCGACGCACCTTGTT
TATATTTATGATTATGGCAGTGGCCTTTGGATCAAGATAACCAAGATTTTAAAGTGCAAGGGAAAAAACAACAAAGAACTTTTTGTGTGCTCTTCTTTGAAATGCTGGAT
TGGAAAGCAGCAAAGAACTTTTCAGGTCGAATCAAATAGTGAAGGGGAAAAAAAAACTGTATGATGTTATTGTGTACCTGAGCTGTTTTCAAGAATACTTATGCTACAAC
ATATGATATCGAAGAACTCA
Protein sequenceShow/hide protein sequence
MLSATRNGSALLKSITLHFSAFSSNSTKPIAIAPQSTDVVDSVCSLLSNHSTNLDLDHSLKKFKQSLTSDLVLRILMNYRLLGRAKTLEFFSWSGFQMGFRFHHSVVEYM
ADFLGRRKLFDDMKCLLVTVSSRKGRLSCRTFSICIRFLGRQGRVREALCLFEEMETKFGCKPDNLVFNNMLYALCKKEPTGELIDVALTIFRRIELPDKYSYSNVIIGL
CKFGRFCTALEVFGEMDRAGLAPTRSAVNILIGDLCSLSAKDGAIEKVRVRSTRRPFTVLVPNVNPKSGVIEPAVCVFWAANRLGLVPSAFVIVQLISELCRLGKMQEAM
RVLKVVEGGKLRCAEECYSIVMQALCEHRHVKEASDLFGWMLSRGMKPKLAIYNYVICMLCKLGNLDDAEKVFKIMNKKRCVPDHVTYSALIHAYGETRKWTAAHNLLKE
MLSLGMSPHFHLFSLVDTLMREHGQIDLCLKLEMKWEAQILQKLCKQGQLEAAFEKLKSMLEKGSYPPIYVRDAFESAFQKKGKTKIARELLQKIDGSPRT