; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS013092 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS013092
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationscaffold459:134789..140731
RNA-Seq ExpressionMS013092
SyntenyMS013092
Gene Ontology termsGO:0000963 - mitochondrial RNA processing (biological process)
GO:0008380 - RNA splicing (biological process)
GO:0032981 - mitochondrial respiratory chain complex I assembly (biological process)
GO:0005739 - mitochondrion (cellular component)
GO:0005515 - protein binding (molecular function)
GO:0016787 - hydrolase activity (molecular function)
GO:1990825 - sequence-specific mRNA binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
GAV66013.1 PPR domain-containing protein/PPR_1 domain-containing protein/PPR_2 domain-containing protein/PPR_3 domain-containing protein, partial [Cephalotus follicularis]0.0e+0063.59Show/hide
Query:  IIMLYGKAEMIKHAVDTFYDMHLYGCRRTVKSFNAVLKVLMKTRDLGAIEAFLSEAPEKFDIELDIISVNIVVKAFCDIGILSRAYLLMLEMEKVGIRPD
        IIMLYGK  M KHA+DTFY+MHLYG +RTVKS NA LKVL ++RDLGAIE FL E P+KFDI LDI SVNIV+K  C++GIL +AYL+MLEMEKVG+RPD
Subjt:  IIMLYGKAEMIKHAVDTFYDMHLYGCRRTVKSFNAVLKVLMKTRDLGAIEAFLSEAPEKFDIELDIISVNIVVKAFCDIGILSRAYLLMLEMEKVGIRPD

Query:  VVTYTTLISAFYKDNRCEISNGLWNLMVLRGCLPNLASFNVRIQYLVDRRRAWEANKLMNVMRNIGIVPDEVTYNLVIKGFCQAGFFDMAKRVYSALQGS
        V+TYTTLISA  K NR EI NGLWNLMV +GC PNL +FNVRI+YLV RRRAW+AN L+ +M+N+G++PDE+TYNLVIKGFCQAG+F+MAKRVYSA+   
Subjt:  VVTYTTLISAFYKDNRCEISNGLWNLMVLRGCLPNLASFNVRIQYLVDRRRAWEANKLMNVMRNIGIVPDEVTYNLVIKGFCQAGFFDMAKRVYSALQGS

Query:  GYKPNVKIYQTMIHYLCRSGDFNLAYTMCKDAMNRNWFPNIDTIHSLIKGLKKMGQLGKAQDNGDDKCTDNYTTCIMVRGLCLEGRTEDGRKLIESRWGK
        GY+PN+KIY+TMIHYLC+ GDFNLAYTM KD M++NWFPN+DTIH+L++                            VRGLC E + E+G+KL+E+RWG 
Subjt:  GYKPNVKIYQTMIHYLCRSGDFNLAYTMCKDAMNRNWFPNIDTIHSLIKGLKKMGQLGKAQDNGDDKCTDNYTTCIMVRGLCLEGRTEDGRKLIESRWGK

Query:  GCVPNIVFYNTLIDGYCKKGEVGSAYELFIELKLKGFVPTLETFGSMVNGFCKTGNFEAIDLLLMEMKDRGLSVSVQVYNNIIDAQYKLGCDIRAKDTLK
        GCVPN VFYN LIDGYCKK E+ SA +L  +LK+KGF+PTLET+G+M+NGFCK G+F+A+D LL+EMK+RG++VSV+VYNNIIDA+YK  C ++A +T+K
Subjt:  GCVPNIVFYNTLIDGYCKKGEVGSAYELFIELKLKGFVPTLETFGSMVNGFCKTGNFEAIDLLLMEMKDRGLSVSVQVYNNIIDAQYKLGCDIRAKDTLK

Query:  EMAENCCEPDLVTYNTLINYLCRGGEVMEAEKILEQAIKRGMVPNKFTYTPLVHAYCKQGEYYRASDLLIEMSKKGHKVDMVSYGALIHGLVVAGEVDNA
         M E+ CEPD+ TYNTLI+  CR G+V +A ++LEQ  +RG++PNKFTYTPL+H YC  G++ RAS+ LIEM ++GH+ D+V+YGAL++GLV AGEVD A
Subjt:  EMAENCCEPDLVTYNTLINYLCRGGEVMEAEKILEQAIKRGMVPNKFTYTPLVHAYCKQGEYYRASDLLIEMSKKGHKVDMVSYGALIHGLVVAGEVDNA

Query:  MTIRDRMMERGVLPDANIYNVLMNGLFKKGNLSMAKVMLSEMLDQNIAPDAFIYATLVDGFIRHDNLDEAKKLFQLTIEKGIDPGVVGYNSMIKGFCKFG
        +T+R +M+ERGVLPDA IYNVLM+GL KK  LS AK++L+EMLD N+ PDAF+YATLVDGFIR+  LDEAKKLF+LTIEKGIDPGVVGYN+MIKG+CK G
Subjt:  MTIRDRMMERGVLPDANIYNVLMNGLFKKGNLSMAKVMLSEMLDQNIAPDAFIYATLVDGFIRHDNLDEAKKLFQLTIEKGIDPGVVGYNSMIKGFCKFG

Query:  MMEDAVLCIDRMRSARHVPDVFTFSTIIDGYVKQCDLYAALKIFGLMLKQSCKPNVVTYTSLINGYCHKGELKIAEKLFSLMQSHGLEPSVVTYGVLIRS
        MM+DA+ C+++M +  H PD F++STIIDGYVK  DL  AL+IF  M KQ+CKPNVVTYT LING+C KG+   AEK F  MQS GL P+VVTY +LI  
Subjt:  MMEDAVLCIDRMRSARHVPDVFTFSTIIDGYVKQCDLYAALKIFGLMLKQSCKPNVVTYTSLINGYCHKGELKIAEKLFSLMQSHGLEPSVVTYGVLIRS

Query:  LCKEAKLAEAASYFELMLINRCIPNDVIFHYLVNGFANMNAAAVSKGLNNFSENAKSMFEEFFLRMIGDGWTRKAAAYNCILICLCQHRMVKTALQLRDK
         CKE KLA+AAS+FE+ML  RCIPNDV FH LV+ F   N A  +       +  KSMF EFF RMI DG+ +  AAYN ILICLC   +VKTALQL  K
Subjt:  LCKEAKLAEAASYFELMLINRCIPNDVIFHYLVNGFANMNAAAVSKGLNNFSENAKSMFEEFFLRMIGDGWTRKAAAYNCILICLCQHRMVKTALQLRDK

Query:  MLSLGLCPDAVSFAALIHGICLGG
        M++ G+  D VSF AL+HG+CL G
Subjt:  MLSLGLCPDAVSFAALIHGICLGG

RXH68979.1 hypothetical protein DVH24_031312 [Malus domestica]0.0e+0049.69Show/hide
Query:  IIMLYGKAEMIKHAVDTFYDMHLYGCRRTVKSFNAVLKVLMKTRDLGAIEAFLSEAPEKFDIELDIISVNIVVKAFCDIGILSRAYLLMLEMEKVGIRPD
        II LYGKA M KHA+DTF DMHLYGC RTVKSFNA LKVL +TRDLGA+EAFLSE PEKFDIELDI SVNIV+KAFC++GIL +AY +M++MEK+GI+PD
Subjt:  IIMLYGKAEMIKHAVDTFYDMHLYGCRRTVKSFNAVLKVLMKTRDLGAIEAFLSEAPEKFDIELDIISVNIVVKAFCDIGILSRAYLLMLEMEKVGIRPD

Query:  VVTYTTLISAFYKDNRCEISNGLWNLMVLRGCLPNLASFNVRIQYLVDRRRAWEANKLMNVMRNIGIVPDEVTYNLVIKGFCQAGFFDMAKRVYSALQGS
        V+TYTTL+SAFYKDNR EI NGLWNLM+L+GCLPNLA+FNVRIQYLV RRRAWEAN+LM +M+NI I PDEVTYNLVIKGFCQAG+ +MAKRVYSAL G 
Subjt:  VVTYTTLISAFYKDNRCEISNGLWNLMVLRGCLPNLASFNVRIQYLVDRRRAWEANKLMNVMRNIGIVPDEVTYNLVIKGFCQAGFFDMAKRVYSALQGS

Query:  GYKPNVKIYQTMIHYLCRSGDFNLAYTMCKDAMNRNWFPNIDTIHSLIKGLKKMGQLGKA-------QDNG-----------------------------
        GYKPNVKIYQTMIHYLC+ GDF+LAYTMCKD M +NWFPN+DTI +L++GLKK  QLGKA       ++NG                             
Subjt:  GYKPNVKIYQTMIHYLCRSGDFNLAYTMCKDAMNRNWFPNIDTIHSLIKGLKKMGQLGKA-------QDNG-----------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ------------------------------------------------------------------DDKCTDNYTTCIMVRGLCLEGRTEDGRKLIESRW
                                                                          +  C DNY+TCIMV+GLC EGR E+GRKLI  RW
Subjt:  ------------------------------------------------------------------DDKCTDNYTTCIMVRGLCLEGRTEDGRKLIESRW

Query:  GKGCVPNIVFYNTLIDGYCKKGEVGSAYELFIELKLKGFVPTLETFGSMVNGFCKTGNFEAIDLLLMEMKDRGLSVSVQVYNNIIDAQYKLGCDIRAKDT
        GK CVPN+VFYNTLIDGYCKKG+V SA  +F ELK KGF+PTLET+G+M+NG+CK G F+AID L MEMK+RGL ++VQV NNI+DA+ K G  ++  +T
Subjt:  GKGCVPNIVFYNTLIDGYCKKGEVGSAYELFIELKLKGFVPTLETFGSMVNGFCKTGNFEAIDLLLMEMKDRGLSVSVQVYNNIIDAQYKLGCDIRAKDT

Query:  LKEMAENCCEPDLVTYNTLINYLCRGGEVMEAEKILEQAIKRGMVPNKFTYTPLVHAYCKQGEYYRASDLLIEMSKKGHKVDMVSYGALIHGLVVAGEVD
        +K+M E+ CEPD+ TYN LI+  C+GG+V EAE+ +  A++RG+VPNKF+YTPL H Y +Q E+ RA DL  +++++G+K D+VSYGALIHGLVV+ EVD
Subjt:  LKEMAENCCEPDLVTYNTLINYLCRGGEVMEAEKILEQAIKRGMVPNKFTYTPLVHAYCKQGEYYRASDLLIEMSKKGHKVDMVSYGALIHGLVVAGEVD

Query:  NAMTIRDRMMERGVLPDANIYNVLMNGLFKKGNLSMAKVMLSEMLDQNIAPDAFIYATLVDGFIRHDNLDEAKKLFQLTIEKGIDPGVVGYNSMIKGFCK
         AM +RDRMME GV+PDA+IYNVLM+GL KKG L  AK++L +MLDQN+ PDA++YATLVDG IR+ +L+EAK +F LTIEKG++PGVVGYN+MIKGFCK
Subjt:  NAMTIRDRMMERGVLPDANIYNVLMNGLFKKGNLSMAKVMLSEMLDQNIAPDAFIYATLVDGFIRHDNLDEAKKLFQLTIEKGIDPGVVGYNSMIKGFCK

Query:  FGMMEDAVLCIDRMRSARHVPDVFTFSTIIDGYVKQCDLYAALKIFGLMLKQSCKPNVVTYTSLINGYCHKGELKIAEKLFSLMQSHGLEPSVVTYGVLI
        FGMM+DA+ C ++MR   H PD FT+STIIDGYVKQ +L AAL  F LM+KQ CKPNVVTYTSLI G+CHKG+   A K F  M+S GLEP+VVTY +LI
Subjt:  FGMMEDAVLCIDRMRSARHVPDVFTFSTIIDGYVKQCDLYAALKIFGLMLKQSCKPNVVTYTSLINGYCHKGELKIAEKLFSLMQSHGLEPSVVTYGVLI

Query:  RSLCKEAKLAEAASYFELMLINRCIPNDVIFHYLVNGFANMNAAAVSKGLNNFSENAKSMFEEFFLRMIGDGWTRKAAAYNCILICLCQHRMVKTALQLR
         + CKE  LA AAS+FELML  +CIPNDV FHYLVNGF N    A+ K +N   +N KS+F   F RMI DGW +K A YN I+ICLC H MVKTALQL 
Subjt:  RSLCKEAKLAEAASYFELMLINRCIPNDVIFHYLVNGFANMNAAAVSKGLNNFSENAKSMFEEFFLRMIGDGWTRKAAAYNCILICLCQHRMVKTALQLR

Query:  DKMLSLGLCPDAVSFAALIHGICLGGSSKECKNVISCYLSEKE
        +K ++ G+  D+VSFA L+HGICL G SKE K++I   L ++E
Subjt:  DKMLSLGLCPDAVSFAALIHGICLGGSSKECKNVISCYLSEKE

XP_022153568.1 pentatricopeptide repeat-containing protein At1g52620 [Momordica charantia]0.0e+0095.45Show/hide
Query:  NWFPNIDTIHSLIKGLKKMGQLGKA---------QDNGDDKCTDNYTTCIMVRGLCLEGRTEDGRKLIESRWGKGCVPNIVFYNTLIDGYCKKGEVGSAY
        N  P+    +SL+  L K  +LG A         +DNGDDKCTDNYTTCIMVRGLCLEGRTEDGRKLIESRWGKGCVPNIVFYNTLIDGYCKKGEVGSAY
Subjt:  NWFPNIDTIHSLIKGLKKMGQLGKA---------QDNGDDKCTDNYTTCIMVRGLCLEGRTEDGRKLIESRWGKGCVPNIVFYNTLIDGYCKKGEVGSAY

Query:  ELFIELKLKGFVPTLETFGSMVNGFCKTGNFEAIDLLLMEMKDRGLSVSVQVYNNIIDAQYKLGCDIRAKDTLKEMAENCCEPDLVTYNTLINYLCRGGE
        ELFIELKLKGFVPTLETFGSMVNGFCKTGNFEAIDLLLMEMKDRGLSVSVQVYNNIIDAQYKLGCDIRAKD LKE AENCCEPDLVTYNTLINYLCRGGE
Subjt:  ELFIELKLKGFVPTLETFGSMVNGFCKTGNFEAIDLLLMEMKDRGLSVSVQVYNNIIDAQYKLGCDIRAKDTLKEMAENCCEPDLVTYNTLINYLCRGGE

Query:  VMEAEKILEQAIKRGMVPNKFTYTPLVHAYCKQGEYYRASDLLIEMSKKGHKVDMVSYGALIHGLVVAGEVDNAMTIRDRMMERGVLPDANIYNVLMNGL
        VMEAEKILEQAIKRGMVPNKFTYTPLVHAYCKQGEYYRASDLLIEMSKKGHKVDMVSYGALIHGLVVAGEVDNAMTIRDRMMERGVLPDANIYNVLMNGL
Subjt:  VMEAEKILEQAIKRGMVPNKFTYTPLVHAYCKQGEYYRASDLLIEMSKKGHKVDMVSYGALIHGLVVAGEVDNAMTIRDRMMERGVLPDANIYNVLMNGL

Query:  FKKGNLSMAKVMLSEMLDQNIAPDAFIYATLVDGFIRHDNLDEAKKLFQLTIEKGIDPGVVGYNSMIKGFCKFGMMEDAVLCIDRMRSARHVPDVFTFST
        FKKGNLSMAKVMLSEMLDQNIAPDAFIYATLVDGFIRHDNLDEAKKLFQLTIEKGIDPGVVGYNSMIKGFCKFGMMEDAVLCIDRMRSARHVPDVFTFST
Subjt:  FKKGNLSMAKVMLSEMLDQNIAPDAFIYATLVDGFIRHDNLDEAKKLFQLTIEKGIDPGVVGYNSMIKGFCKFGMMEDAVLCIDRMRSARHVPDVFTFST

Query:  IIDGYVKQCDLYAALKIFGLMLKQSCKPNVVTYTSLINGYCHKGELKIAEKLFSLMQSHGLEPSVVTYGVLIRSLCKEAKLAEAASYFELMLINRCIPND
        IIDGYVKQCDLYAALKIFGLMLKQSCKPNVVTYTSLINGYCHKGELKIAEKLFSLMQSHGLEPSVVTYGVLIRSLCKEAKLAEAASYFELMLINRCIPND
Subjt:  IIDGYVKQCDLYAALKIFGLMLKQSCKPNVVTYTSLINGYCHKGELKIAEKLFSLMQSHGLEPSVVTYGVLIRSLCKEAKLAEAASYFELMLINRCIPND

Query:  VIFHYLVNGFANMNAAAVSKGLNNFSENAKSMFEEFFLRMIGDGWTRKAAAYNCILICLCQHRMVKTALQLRDKMLSLGLCPDAVSFAALIHGICLGGSS
        VIFHYLVNGFANMNAAAVSKGLNNFSENAKSMFEEFFLRMIGDGWTRKAAAYNCILICLCQHRMVKTALQLRDKMLSLGLCPDAVSFAALIHGICLGGSS
Subjt:  VIFHYLVNGFANMNAAAVSKGLNNFSENAKSMFEEFFLRMIGDGWTRKAAAYNCILICLCQHRMVKTALQLRDKMLSLGLCPDAVSFAALIHGICLGGSS

Query:  KECKNVISCYLSEKEL
        KECKNVISCYLSEKEL
Subjt:  KECKNVISCYLSEKEL

XP_024956279.1 pentatricopeptide repeat-containing protein At1g52620 [Citrus sinensis]3.3e-29648.27Show/hide
Query:  IIMLYGKAEMIKHAVDTFYDMHLYGCRRTVKSFNAVLKVLMKTRDLGAIEAFLSEAPEKFDIELDIISVNIVVKAFCDIGILSRAYLLMLEMEKVGIRPD
        I+MLYGKA MIKHA+DTFYDMHLYGC+RTVKS NA LKVL ++RDL AI+AFL E PEKF I+ DI S NIV+KAFC++GIL +AYL+M+EM+K+G++PD
Subjt:  IIMLYGKAEMIKHAVDTFYDMHLYGCRRTVKSFNAVLKVLMKTRDLGAIEAFLSEAPEKFDIELDIISVNIVVKAFCDIGILSRAYLLMLEMEKVGIRPD

Query:  VVTYTTLISAFYKDNRCEISNGLWNLMVLRGCLPNLASFNVRIQYLVDRRRAWEANKLMNVMRNIGIVPDEVTYNLVIKGFCQAGFFDMAKRVYSALQGS
        V+TYTTLISAFYKDNR EI NGLWNLMV +GC PNLA+FNVRIQ+LV++RR+W+ANKLM +M+  GI PDEVTYNLVIKGFC++G  DMAK+VYSA+ G 
Subjt:  VVTYTTLISAFYKDNRCEISNGLWNLMVLRGCLPNLASFNVRIQYLVDRRRAWEANKLMNVMRNIGIVPDEVTYNLVIKGFCQAGFFDMAKRVYSALQGS

Query:  GYKPNVKIYQTMIHYLCRSGDFNLAYTMCKDAMNRNWFPNIDTIHSLIKGLKKMGQLGKAQ---------------------------------------
           PN KIYQTMIHYLC+ GDFNLAY MCKD+M +NW P++DTI +L++GLKK  Q  KA                                        
Subjt:  GYKPNVKIYQTMIHYLCRSGDFNLAYTMCKDAMNRNWFPNIDTIHSLIKGLKKMGQLGKAQ---------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ---------------------------------------------DNG----------------------------------------------------
                                                      NG                                                    
Subjt:  ---------------------------------------------DNG----------------------------------------------------

Query:  ------------------------------------------DDKC-----TDNYTTCIMVRGLCLEGRTEDGRKLIESRWGKGCVPNIVFYNTLIDGYC
                                                  D+ C      DNY+TCIMVRGLC EG+ E+G+ LIE R+GKGC+PNIVFYNTLIDGYC
Subjt:  ------------------------------------------DDKC-----TDNYTTCIMVRGLCLEGRTEDGRKLIESRWGKGCVPNIVFYNTLIDGYC

Query:  KKGEVGSAYELFIELKLKGFVPTLETFGSMVNGFCKTGNFEAIDLLLMEMKDRGLSVSVQVYNNIIDAQYKLGCDIRAKDTLKEMAENCCEPDLVTYNTL
        KKG+V +A +LF ELK+KGF+PTLET+G++++GFCK G+F+ ID L+MEMK R L+V+V+VYN+IID +YK G  + A +T++ M EN CEPD+VTYN L
Subjt:  KKGEVGSAYELFIELKLKGFVPTLETFGSMVNGFCKTGNFEAIDLLLMEMKDRGLSVSVQVYNNIIDAQYKLGCDIRAKDTLKEMAENCCEPDLVTYNTL

Query:  INYLCRGGEVMEAEKILEQAIKRGMVPNKFTYTPLVHAYCKQGEYYRASDLLIEMSKKGHKVDMVSYGALIHGLVVAGEVDNAMTIRDRMMERGVLPDAN
        I+  CR G+V EA ++LEQ  KRG+ PNK++YTPL+H Y K GEY +ASDLL++M+++GHK D+ ++GA+IHGLV AGEV  AMT++++MMER  +PDA 
Subjt:  INYLCRGGEVMEAEKILEQAIKRGMVPNKFTYTPLVHAYCKQGEYYRASDLLIEMSKKGHKVDMVSYGALIHGLVVAGEVDNAMTIRDRMMERGVLPDAN

Query:  IYNVLMNGLFKKGNLSMAKVMLSEMLDQNIAPDAFIYATLVDGFIRHDNLDEAKKLFQLTIEKGIDPGVVGYNSMIKGFCKFGMMEDAVLCIDRMRSARH
        IYNVLM+GL KK  L  AK++L EMLD N+  DA+IYATL+DGFIR+D+LDEAKKLF+LTI+KG+DPGVVG N+MIKG+CKFG+M+DA+ C++RM    H
Subjt:  IYNVLMNGLFKKGNLSMAKVMLSEMLDQNIAPDAFIYATLVDGFIRHDNLDEAKKLFQLTIEKGIDPGVVGYNSMIKGFCKFGMMEDAVLCIDRMRSARH

Query:  VPDVFTFSTIIDGYVKQCDLYAALKIFGLMLKQSCKPNVVTYTSLINGYCHKGELKIAEKLFSLMQSHGLEPSVVTYGVLIRSLCKEAKLAEAASYFELM
         PD FT+STIIDGYVKQ DL  AL+ FG M+++ CKPNVVTYT+LI+G+C  G+   A++ F  MQ HGL P+VVTY ++I S CK+ +L +AAS+FELM
Subjt:  VPDVFTFSTIIDGYVKQCDLYAALKIFGLMLKQSCKPNVVTYTSLINGYCHKGELKIAEKLFSLMQSHGLEPSVVTYGVLIRSLCKEAKLAEAASYFELM

Query:  LINRCIPNDVIFHYLVNGFANMNAAAVSKGLNNFSENAKSMFEEFFLRMIGDGWTRKAAAYNCILICLCQHRMVKTALQLRDKMLSLGLCPDAVSFAALI
        L N+CIPND  FH L+NGF N    AVS   +   E  K +F EFF RMI DGW   AAAYN I+ICLC H MVK ALQL DKM+S G   D +SFAAL+
Subjt:  LINRCIPNDVIFHYLVNGFANMNAAAVSKGLNNFSENAKSMFEEFFLRMIGDGWTRKAAAYNCILICLCQHRMVKTALQLRDKMLSLGLCPDAVSFAALI

Query:  HGICLGGSSKECKNVISCYLSEKEL
        HGICL G SKE  N I C L+EKEL
Subjt:  HGICLGGSSKECKNVISCYLSEKEL

XP_038894903.1 pentatricopeptide repeat-containing protein At1g52620 [Benincasa hispida]6.9e-29480.03Show/hide
Query:  NWFPNIDTIHSLIKGLKKMGQLGKA---------QDNGDDKCTDNYTTCIMVRGLCLEGRTEDGRKLIESRWGKGCVPNIVFYNTLIDGYCKKGEVGSAY
        N  P++   +SL+  L K  +L  A         +DNGDD C DNYTTCIMVRGLCLEGR EDGRKLIESRWGKGCVPNIVFYNTLIDGYCKKGEV SAY
Subjt:  NWFPNIDTIHSLIKGLKKMGQLGKA---------QDNGDDKCTDNYTTCIMVRGLCLEGRTEDGRKLIESRWGKGCVPNIVFYNTLIDGYCKKGEVGSAY

Query:  ELFIELKLKGFVPTLETFGSMVNGFCKTGNFEAIDLLLMEMKDRGLSVSVQVYNNIIDAQYKLGCDIRAKDTLKEMAENCCEPDLVTYNTLINYLCRGGE
        +LF ELK+KGF+PTLETFGS+VNGFCK G FEAIDLLL+EMK+RGLSV+VQ+YN IIDA+YKLG D +AKDTLKEM ENCC PDLVTYNTLINYLC  GE
Subjt:  ELFIELKLKGFVPTLETFGSMVNGFCKTGNFEAIDLLLMEMKDRGLSVSVQVYNNIIDAQYKLGCDIRAKDTLKEMAENCCEPDLVTYNTLINYLCRGGE

Query:  VMEAEKILEQAIKRGMVPNKFTYTPLVHAYCKQGEYYRASDLLIEMSKKGHKVDMVSYGALIHGLVVAGEVDNAMTIRDRMMERGVLPDANIYNVLMNGL
        V EAEK+LEQ I+RG+ P+KF YTPLVH Y KQGEY RASDL+IEMS +GH+VD VSYGA+IHGLVVAGEVD A+TIRDRMMERGVLPDANIYNVLMNGL
Subjt:  VMEAEKILEQAIKRGMVPNKFTYTPLVHAYCKQGEYYRASDLLIEMSKKGHKVDMVSYGALIHGLVVAGEVDNAMTIRDRMMERGVLPDANIYNVLMNGL

Query:  FKKGNLSMAKVMLSEMLDQNIAPDAFIYATLVDGFIRHDNLDEAKKLFQLTIEKGIDPGVVGYNSMIKGFCKFGMMEDAVLCIDRMRSARHVPDVFTFST
        FKKG LSMAK+ML+EMLDQ+IAPDAFIYATLVDGFIRH NLDEA K+FQLTIEKGIDPGVVGYN MIKGF KFGMM DA+LCIDRMRSA H PDVFTFST
Subjt:  FKKGNLSMAKVMLSEMLDQNIAPDAFIYATLVDGFIRHDNLDEAKKLFQLTIEKGIDPGVVGYNSMIKGFCKFGMMEDAVLCIDRMRSARHVPDVFTFST

Query:  IIDGYVKQCDLYAALKIFGLMLKQSCKPNVVTYTSLINGYCHKGELKIAEKLFSLMQSHGLEPSVVTYGVLIRSLCKEAKLAEAASYFELMLINRCIPND
        IIDGYVKQ D+YA LK+FGLM+KQ+CKPNV+TYTSLINGYC KGE+K+AEK FS+MQSHGLEPSVVTY +LIRS CKEAKL +AASYFELMLIN+C PND
Subjt:  IIDGYVKQCDLYAALKIFGLMLKQSCKPNVVTYTSLINGYCHKGELKIAEKLFSLMQSHGLEPSVVTYGVLIRSLCKEAKLAEAASYFELMLINRCIPND

Query:  VIFHYLVNGFANMNAAAVSKGLNNFSENAKSMFEEFFLRMIGDGWTRKAAAYNCILICLCQHRMVKTALQLRDKMLSLGLCPDAVSFAALIHGICLGGSS
        V+FHYLVNGF N NAAAVS+G NN  +N++SMFE+FF RMIGDGWTRKAAAYNCILICLCQHRMVKTALQLRDKMLSLGLCPDAVSF ALIHGICL G S
Subjt:  VIFHYLVNGFANMNAAAVSKGLNNFSENAKSMFEEFFLRMIGDGWTRKAAAYNCILICLCQHRMVKTALQLRDKMLSLGLCPDAVSFAALIHGICLGGSS

Query:  KECKNVISCYLSEKEL
        KE +N+ISC L+E EL
Subjt:  KECKNVISCYLSEKEL

TrEMBL top hitse value%identityAlignment
A0A1Q3BDZ3 PPR domain-containing protein/PPR_1 domain-containing protein/PPR_2 domain-containing protein/PPR_3 domain-containing protein (Fragment)0.0e+0063.59Show/hide
Query:  IIMLYGKAEMIKHAVDTFYDMHLYGCRRTVKSFNAVLKVLMKTRDLGAIEAFLSEAPEKFDIELDIISVNIVVKAFCDIGILSRAYLLMLEMEKVGIRPD
        IIMLYGK  M KHA+DTFY+MHLYG +RTVKS NA LKVL ++RDLGAIE FL E P+KFDI LDI SVNIV+K  C++GIL +AYL+MLEMEKVG+RPD
Subjt:  IIMLYGKAEMIKHAVDTFYDMHLYGCRRTVKSFNAVLKVLMKTRDLGAIEAFLSEAPEKFDIELDIISVNIVVKAFCDIGILSRAYLLMLEMEKVGIRPD

Query:  VVTYTTLISAFYKDNRCEISNGLWNLMVLRGCLPNLASFNVRIQYLVDRRRAWEANKLMNVMRNIGIVPDEVTYNLVIKGFCQAGFFDMAKRVYSALQGS
        V+TYTTLISA  K NR EI NGLWNLMV +GC PNL +FNVRI+YLV RRRAW+AN L+ +M+N+G++PDE+TYNLVIKGFCQAG+F+MAKRVYSA+   
Subjt:  VVTYTTLISAFYKDNRCEISNGLWNLMVLRGCLPNLASFNVRIQYLVDRRRAWEANKLMNVMRNIGIVPDEVTYNLVIKGFCQAGFFDMAKRVYSALQGS

Query:  GYKPNVKIYQTMIHYLCRSGDFNLAYTMCKDAMNRNWFPNIDTIHSLIKGLKKMGQLGKAQDNGDDKCTDNYTTCIMVRGLCLEGRTEDGRKLIESRWGK
        GY+PN+KIY+TMIHYLC+ GDFNLAYTM KD M++NWFPN+DTIH+L++                            VRGLC E + E+G+KL+E+RWG 
Subjt:  GYKPNVKIYQTMIHYLCRSGDFNLAYTMCKDAMNRNWFPNIDTIHSLIKGLKKMGQLGKAQDNGDDKCTDNYTTCIMVRGLCLEGRTEDGRKLIESRWGK

Query:  GCVPNIVFYNTLIDGYCKKGEVGSAYELFIELKLKGFVPTLETFGSMVNGFCKTGNFEAIDLLLMEMKDRGLSVSVQVYNNIIDAQYKLGCDIRAKDTLK
        GCVPN VFYN LIDGYCKK E+ SA +L  +LK+KGF+PTLET+G+M+NGFCK G+F+A+D LL+EMK+RG++VSV+VYNNIIDA+YK  C ++A +T+K
Subjt:  GCVPNIVFYNTLIDGYCKKGEVGSAYELFIELKLKGFVPTLETFGSMVNGFCKTGNFEAIDLLLMEMKDRGLSVSVQVYNNIIDAQYKLGCDIRAKDTLK

Query:  EMAENCCEPDLVTYNTLINYLCRGGEVMEAEKILEQAIKRGMVPNKFTYTPLVHAYCKQGEYYRASDLLIEMSKKGHKVDMVSYGALIHGLVVAGEVDNA
         M E+ CEPD+ TYNTLI+  CR G+V +A ++LEQ  +RG++PNKFTYTPL+H YC  G++ RAS+ LIEM ++GH+ D+V+YGAL++GLV AGEVD A
Subjt:  EMAENCCEPDLVTYNTLINYLCRGGEVMEAEKILEQAIKRGMVPNKFTYTPLVHAYCKQGEYYRASDLLIEMSKKGHKVDMVSYGALIHGLVVAGEVDNA

Query:  MTIRDRMMERGVLPDANIYNVLMNGLFKKGNLSMAKVMLSEMLDQNIAPDAFIYATLVDGFIRHDNLDEAKKLFQLTIEKGIDPGVVGYNSMIKGFCKFG
        +T+R +M+ERGVLPDA IYNVLM+GL KK  LS AK++L+EMLD N+ PDAF+YATLVDGFIR+  LDEAKKLF+LTIEKGIDPGVVGYN+MIKG+CK G
Subjt:  MTIRDRMMERGVLPDANIYNVLMNGLFKKGNLSMAKVMLSEMLDQNIAPDAFIYATLVDGFIRHDNLDEAKKLFQLTIEKGIDPGVVGYNSMIKGFCKFG

Query:  MMEDAVLCIDRMRSARHVPDVFTFSTIIDGYVKQCDLYAALKIFGLMLKQSCKPNVVTYTSLINGYCHKGELKIAEKLFSLMQSHGLEPSVVTYGVLIRS
        MM+DA+ C+++M +  H PD F++STIIDGYVK  DL  AL+IF  M KQ+CKPNVVTYT LING+C KG+   AEK F  MQS GL P+VVTY +LI  
Subjt:  MMEDAVLCIDRMRSARHVPDVFTFSTIIDGYVKQCDLYAALKIFGLMLKQSCKPNVVTYTSLINGYCHKGELKIAEKLFSLMQSHGLEPSVVTYGVLIRS

Query:  LCKEAKLAEAASYFELMLINRCIPNDVIFHYLVNGFANMNAAAVSKGLNNFSENAKSMFEEFFLRMIGDGWTRKAAAYNCILICLCQHRMVKTALQLRDK
         CKE KLA+AAS+FE+ML  RCIPNDV FH LV+ F   N A  +       +  KSMF EFF RMI DG+ +  AAYN ILICLC   +VKTALQL  K
Subjt:  LCKEAKLAEAASYFELMLINRCIPNDVIFHYLVNGFANMNAAAVSKGLNNFSENAKSMFEEFFLRMIGDGWTRKAAAYNCILICLCQHRMVKTALQLRDK

Query:  MLSLGLCPDAVSFAALIHGICLGG
        M++ G+  D VSF AL+HG+CL G
Subjt:  MLSLGLCPDAVSFAALIHGICLGG

A0A1S3BYA4 pentatricopeptide repeat-containing protein At1g526204.5e-28377.76Show/hide
Query:  NWFPNIDTIHSLIKGLKKMGQLGKA---------QDNGDDKCTDNYTTCIMVRGLCLEGRTEDGRKLIESRWGKGCVPNIVFYNTLIDGYCKKGEVGSAY
        N  P++   +SL+  L K  +   A         +DNGD    D YTTCIMVRGLCLEGR EDGRKLIESRWGKGCVPNIVFYNTLIDGYCKKGEV SAY
Subjt:  NWFPNIDTIHSLIKGLKKMGQLGKA---------QDNGDDKCTDNYTTCIMVRGLCLEGRTEDGRKLIESRWGKGCVPNIVFYNTLIDGYCKKGEVGSAY

Query:  ELFIELKLKGFVPTLETFGSMVNGFCKTGNFEAIDLLLMEMKDRGLSVSVQVYNNIIDAQYKLGCDIRAKDTLKEMAENCCEPDLVTYNTLINYLCRGGE
        ELF ELK KGF+PTL+TFGS+VNGFCK G FEAIDLLL+EMKDRG SV+VQ+YNNIIDAQYKLGCDI+AKDTLKEM+EN C PDLVTYNTLINYLC  GE
Subjt:  ELFIELKLKGFVPTLETFGSMVNGFCKTGNFEAIDLLLMEMKDRGLSVSVQVYNNIIDAQYKLGCDIRAKDTLKEMAENCCEPDLVTYNTLINYLCRGGE

Query:  VMEAEKILEQAIKRGMVPNKFTYTPLVHAYCKQGEYYRASDLLIEMSKKGHKVDMVSYGALIHGLVVAGEVDNAMTIRDRMMERGVLPDANIYNVLMNGL
        V EAEK+LEQ I+RG+ PN+FTYTPLVH YCK+GEY RA+DLLIEMS +G ++DM+SYGALIHGLVVAGEVD A+TIRDRMM +G+LPDANIYNVLMNGL
Subjt:  VMEAEKILEQAIKRGMVPNKFTYTPLVHAYCKQGEYYRASDLLIEMSKKGHKVDMVSYGALIHGLVVAGEVDNAMTIRDRMMERGVLPDANIYNVLMNGL

Query:  FKKGNLSMAKVMLSEMLDQNIAPDAFIYATLVDGFIRHDNLDEAKKLFQLTIEKGIDPGVVGYNSMIKGFCKFGMMEDAVLCIDRMRSARHVPDVFTFST
        FKKG LSMAKV+LSEMLDQNIAPDAF+YATLVDGFIR  NLDEAKKLFQL IEKG+DPGVVGYN MIKGF KFGMM++A+LCIDRMRSA HVPDVFTFST
Subjt:  FKKGNLSMAKVMLSEMLDQNIAPDAFIYATLVDGFIRHDNLDEAKKLFQLTIEKGIDPGVVGYNSMIKGFCKFGMMEDAVLCIDRMRSARHVPDVFTFST

Query:  IIDGYVKQCDLYAALKIFGLMLKQSCKPNVVTYTSLINGYCHKGELKIAEKLFSLMQSHGLEPSVVTYGVLIRSLCKEAKLAEAASYFELMLINRCIPND
        IIDGYVKQ ++ A LKIFGLM+KQ+CKPNVVTYTSLINGYC KGE ++AEKLFS+M+SHGLEPSVVTY +LI + CKEAKL +A SYFELMLIN+C PND
Subjt:  IIDGYVKQCDLYAALKIFGLMLKQSCKPNVVTYTSLINGYCHKGELKIAEKLFSLMQSHGLEPSVVTYGVLIRSLCKEAKLAEAASYFELMLINRCIPND

Query:  VIFHYLVNGFANMNAAAVSKGLNNFSENAKSMFEEFFLRMIGDGWTRKAAAYNCILICLCQHRMVKTALQLRDKMLSLGLCPDAVSFAALIHGICLGGSS
          FHYLVNGF N  A AVS G NN  EN++SMFE+FF RMIGDGWTRKAAAYNCILICLCQ RMVKTALQLR+KMLSLGLC DAVSF AL+HGICL G+S
Subjt:  VIFHYLVNGFANMNAAAVSKGLNNFSENAKSMFEEFFLRMIGDGWTRKAAAYNCILICLCQHRMVKTALQLRDKMLSLGLCPDAVSFAALIHGICLGGSS

Query:  KECKNVISCYLSEKEL
        KE +N+ISC L+E EL
Subjt:  KECKNVISCYLSEKEL

A0A498HD54 Uncharacterized protein0.0e+0049.69Show/hide
Query:  IIMLYGKAEMIKHAVDTFYDMHLYGCRRTVKSFNAVLKVLMKTRDLGAIEAFLSEAPEKFDIELDIISVNIVVKAFCDIGILSRAYLLMLEMEKVGIRPD
        II LYGKA M KHA+DTF DMHLYGC RTVKSFNA LKVL +TRDLGA+EAFLSE PEKFDIELDI SVNIV+KAFC++GIL +AY +M++MEK+GI+PD
Subjt:  IIMLYGKAEMIKHAVDTFYDMHLYGCRRTVKSFNAVLKVLMKTRDLGAIEAFLSEAPEKFDIELDIISVNIVVKAFCDIGILSRAYLLMLEMEKVGIRPD

Query:  VVTYTTLISAFYKDNRCEISNGLWNLMVLRGCLPNLASFNVRIQYLVDRRRAWEANKLMNVMRNIGIVPDEVTYNLVIKGFCQAGFFDMAKRVYSALQGS
        V+TYTTL+SAFYKDNR EI NGLWNLM+L+GCLPNLA+FNVRIQYLV RRRAWEAN+LM +M+NI I PDEVTYNLVIKGFCQAG+ +MAKRVYSAL G 
Subjt:  VVTYTTLISAFYKDNRCEISNGLWNLMVLRGCLPNLASFNVRIQYLVDRRRAWEANKLMNVMRNIGIVPDEVTYNLVIKGFCQAGFFDMAKRVYSALQGS

Query:  GYKPNVKIYQTMIHYLCRSGDFNLAYTMCKDAMNRNWFPNIDTIHSLIKGLKKMGQLGKA-------QDNG-----------------------------
        GYKPNVKIYQTMIHYLC+ GDF+LAYTMCKD M +NWFPN+DTI +L++GLKK  QLGKA       ++NG                             
Subjt:  GYKPNVKIYQTMIHYLCRSGDFNLAYTMCKDAMNRNWFPNIDTIHSLIKGLKKMGQLGKA-------QDNG-----------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ------------------------------------------------------------------DDKCTDNYTTCIMVRGLCLEGRTEDGRKLIESRW
                                                                          +  C DNY+TCIMV+GLC EGR E+GRKLI  RW
Subjt:  ------------------------------------------------------------------DDKCTDNYTTCIMVRGLCLEGRTEDGRKLIESRW

Query:  GKGCVPNIVFYNTLIDGYCKKGEVGSAYELFIELKLKGFVPTLETFGSMVNGFCKTGNFEAIDLLLMEMKDRGLSVSVQVYNNIIDAQYKLGCDIRAKDT
        GK CVPN+VFYNTLIDGYCKKG+V SA  +F ELK KGF+PTLET+G+M+NG+CK G F+AID L MEMK+RGL ++VQV NNI+DA+ K G  ++  +T
Subjt:  GKGCVPNIVFYNTLIDGYCKKGEVGSAYELFIELKLKGFVPTLETFGSMVNGFCKTGNFEAIDLLLMEMKDRGLSVSVQVYNNIIDAQYKLGCDIRAKDT

Query:  LKEMAENCCEPDLVTYNTLINYLCRGGEVMEAEKILEQAIKRGMVPNKFTYTPLVHAYCKQGEYYRASDLLIEMSKKGHKVDMVSYGALIHGLVVAGEVD
        +K+M E+ CEPD+ TYN LI+  C+GG+V EAE+ +  A++RG+VPNKF+YTPL H Y +Q E+ RA DL  +++++G+K D+VSYGALIHGLVV+ EVD
Subjt:  LKEMAENCCEPDLVTYNTLINYLCRGGEVMEAEKILEQAIKRGMVPNKFTYTPLVHAYCKQGEYYRASDLLIEMSKKGHKVDMVSYGALIHGLVVAGEVD

Query:  NAMTIRDRMMERGVLPDANIYNVLMNGLFKKGNLSMAKVMLSEMLDQNIAPDAFIYATLVDGFIRHDNLDEAKKLFQLTIEKGIDPGVVGYNSMIKGFCK
         AM +RDRMME GV+PDA+IYNVLM+GL KKG L  AK++L +MLDQN+ PDA++YATLVDG IR+ +L+EAK +F LTIEKG++PGVVGYN+MIKGFCK
Subjt:  NAMTIRDRMMERGVLPDANIYNVLMNGLFKKGNLSMAKVMLSEMLDQNIAPDAFIYATLVDGFIRHDNLDEAKKLFQLTIEKGIDPGVVGYNSMIKGFCK

Query:  FGMMEDAVLCIDRMRSARHVPDVFTFSTIIDGYVKQCDLYAALKIFGLMLKQSCKPNVVTYTSLINGYCHKGELKIAEKLFSLMQSHGLEPSVVTYGVLI
        FGMM+DA+ C ++MR   H PD FT+STIIDGYVKQ +L AAL  F LM+KQ CKPNVVTYTSLI G+CHKG+   A K F  M+S GLEP+VVTY +LI
Subjt:  FGMMEDAVLCIDRMRSARHVPDVFTFSTIIDGYVKQCDLYAALKIFGLMLKQSCKPNVVTYTSLINGYCHKGELKIAEKLFSLMQSHGLEPSVVTYGVLI

Query:  RSLCKEAKLAEAASYFELMLINRCIPNDVIFHYLVNGFANMNAAAVSKGLNNFSENAKSMFEEFFLRMIGDGWTRKAAAYNCILICLCQHRMVKTALQLR
         + CKE  LA AAS+FELML  +CIPNDV FHYLVNGF N    A+ K +N   +N KS+F   F RMI DGW +K A YN I+ICLC H MVKTALQL 
Subjt:  RSLCKEAKLAEAASYFELMLINRCIPNDVIFHYLVNGFANMNAAAVSKGLNNFSENAKSMFEEFFLRMIGDGWTRKAAAYNCILICLCQHRMVKTALQLR

Query:  DKMLSLGLCPDAVSFAALIHGICLGGSSKECKNVISCYLSEKE
        +K ++ G+  D+VSFA L+HGICL G SKE K++I   L ++E
Subjt:  DKMLSLGLCPDAVSFAALIHGICLGGSSKECKNVISCYLSEKE

A0A5A7TLP1 Pentatricopeptide repeat-containing protein4.5e-28377.76Show/hide
Query:  NWFPNIDTIHSLIKGLKKMGQLGKA---------QDNGDDKCTDNYTTCIMVRGLCLEGRTEDGRKLIESRWGKGCVPNIVFYNTLIDGYCKKGEVGSAY
        N  P++   +SL+  L K  +   A         +DNGD    D YTTCIMVRGLCLEGR EDGRKLIESRWGKGCVPNIVFYNTLIDGYCKKGEV SAY
Subjt:  NWFPNIDTIHSLIKGLKKMGQLGKA---------QDNGDDKCTDNYTTCIMVRGLCLEGRTEDGRKLIESRWGKGCVPNIVFYNTLIDGYCKKGEVGSAY

Query:  ELFIELKLKGFVPTLETFGSMVNGFCKTGNFEAIDLLLMEMKDRGLSVSVQVYNNIIDAQYKLGCDIRAKDTLKEMAENCCEPDLVTYNTLINYLCRGGE
        ELF ELK KGF+PTL+TFGS+VNGFCK G FEAIDLLL+EMKDRG SV+VQ+YNNIIDAQYKLGCDI+AKDTLKEM+EN C PDLVTYNTLINYLC  GE
Subjt:  ELFIELKLKGFVPTLETFGSMVNGFCKTGNFEAIDLLLMEMKDRGLSVSVQVYNNIIDAQYKLGCDIRAKDTLKEMAENCCEPDLVTYNTLINYLCRGGE

Query:  VMEAEKILEQAIKRGMVPNKFTYTPLVHAYCKQGEYYRASDLLIEMSKKGHKVDMVSYGALIHGLVVAGEVDNAMTIRDRMMERGVLPDANIYNVLMNGL
        V EAEK+LEQ I+RG+ PN+FTYTPLVH YCK+GEY RA+DLLIEMS +G ++DM+SYGALIHGLVVAGEVD A+TIRDRMM +G+LPDANIYNVLMNGL
Subjt:  VMEAEKILEQAIKRGMVPNKFTYTPLVHAYCKQGEYYRASDLLIEMSKKGHKVDMVSYGALIHGLVVAGEVDNAMTIRDRMMERGVLPDANIYNVLMNGL

Query:  FKKGNLSMAKVMLSEMLDQNIAPDAFIYATLVDGFIRHDNLDEAKKLFQLTIEKGIDPGVVGYNSMIKGFCKFGMMEDAVLCIDRMRSARHVPDVFTFST
        FKKG LSMAKV+LSEMLDQNIAPDAF+YATLVDGFIR  NLDEAKKLFQL IEKG+DPGVVGYN MIKGF KFGMM++A+LCIDRMRSA HVPDVFTFST
Subjt:  FKKGNLSMAKVMLSEMLDQNIAPDAFIYATLVDGFIRHDNLDEAKKLFQLTIEKGIDPGVVGYNSMIKGFCKFGMMEDAVLCIDRMRSARHVPDVFTFST

Query:  IIDGYVKQCDLYAALKIFGLMLKQSCKPNVVTYTSLINGYCHKGELKIAEKLFSLMQSHGLEPSVVTYGVLIRSLCKEAKLAEAASYFELMLINRCIPND
        IIDGYVKQ ++ A LKIFGLM+KQ+CKPNVVTYTSLINGYC KGE ++AEKLFS+M+SHGLEPSVVTY +LI + CKEAKL +A SYFELMLIN+C PND
Subjt:  IIDGYVKQCDLYAALKIFGLMLKQSCKPNVVTYTSLINGYCHKGELKIAEKLFSLMQSHGLEPSVVTYGVLIRSLCKEAKLAEAASYFELMLINRCIPND

Query:  VIFHYLVNGFANMNAAAVSKGLNNFSENAKSMFEEFFLRMIGDGWTRKAAAYNCILICLCQHRMVKTALQLRDKMLSLGLCPDAVSFAALIHGICLGGSS
          FHYLVNGF N  A AVS G NN  EN++SMFE+FF RMIGDGWTRKAAAYNCILICLCQ RMVKTALQLR+KMLSLGLC DAVSF AL+HGICL G+S
Subjt:  VIFHYLVNGFANMNAAAVSKGLNNFSENAKSMFEEFFLRMIGDGWTRKAAAYNCILICLCQHRMVKTALQLRDKMLSLGLCPDAVSFAALIHGICLGGSS

Query:  KECKNVISCYLSEKEL
        KE +N+ISC L+E EL
Subjt:  KECKNVISCYLSEKEL

A0A6J1DHT9 pentatricopeptide repeat-containing protein At1g526200.0e+0095.45Show/hide
Query:  NWFPNIDTIHSLIKGLKKMGQLGKA---------QDNGDDKCTDNYTTCIMVRGLCLEGRTEDGRKLIESRWGKGCVPNIVFYNTLIDGYCKKGEVGSAY
        N  P+    +SL+  L K  +LG A         +DNGDDKCTDNYTTCIMVRGLCLEGRTEDGRKLIESRWGKGCVPNIVFYNTLIDGYCKKGEVGSAY
Subjt:  NWFPNIDTIHSLIKGLKKMGQLGKA---------QDNGDDKCTDNYTTCIMVRGLCLEGRTEDGRKLIESRWGKGCVPNIVFYNTLIDGYCKKGEVGSAY

Query:  ELFIELKLKGFVPTLETFGSMVNGFCKTGNFEAIDLLLMEMKDRGLSVSVQVYNNIIDAQYKLGCDIRAKDTLKEMAENCCEPDLVTYNTLINYLCRGGE
        ELFIELKLKGFVPTLETFGSMVNGFCKTGNFEAIDLLLMEMKDRGLSVSVQVYNNIIDAQYKLGCDIRAKD LKE AENCCEPDLVTYNTLINYLCRGGE
Subjt:  ELFIELKLKGFVPTLETFGSMVNGFCKTGNFEAIDLLLMEMKDRGLSVSVQVYNNIIDAQYKLGCDIRAKDTLKEMAENCCEPDLVTYNTLINYLCRGGE

Query:  VMEAEKILEQAIKRGMVPNKFTYTPLVHAYCKQGEYYRASDLLIEMSKKGHKVDMVSYGALIHGLVVAGEVDNAMTIRDRMMERGVLPDANIYNVLMNGL
        VMEAEKILEQAIKRGMVPNKFTYTPLVHAYCKQGEYYRASDLLIEMSKKGHKVDMVSYGALIHGLVVAGEVDNAMTIRDRMMERGVLPDANIYNVLMNGL
Subjt:  VMEAEKILEQAIKRGMVPNKFTYTPLVHAYCKQGEYYRASDLLIEMSKKGHKVDMVSYGALIHGLVVAGEVDNAMTIRDRMMERGVLPDANIYNVLMNGL

Query:  FKKGNLSMAKVMLSEMLDQNIAPDAFIYATLVDGFIRHDNLDEAKKLFQLTIEKGIDPGVVGYNSMIKGFCKFGMMEDAVLCIDRMRSARHVPDVFTFST
        FKKGNLSMAKVMLSEMLDQNIAPDAFIYATLVDGFIRHDNLDEAKKLFQLTIEKGIDPGVVGYNSMIKGFCKFGMMEDAVLCIDRMRSARHVPDVFTFST
Subjt:  FKKGNLSMAKVMLSEMLDQNIAPDAFIYATLVDGFIRHDNLDEAKKLFQLTIEKGIDPGVVGYNSMIKGFCKFGMMEDAVLCIDRMRSARHVPDVFTFST

Query:  IIDGYVKQCDLYAALKIFGLMLKQSCKPNVVTYTSLINGYCHKGELKIAEKLFSLMQSHGLEPSVVTYGVLIRSLCKEAKLAEAASYFELMLINRCIPND
        IIDGYVKQCDLYAALKIFGLMLKQSCKPNVVTYTSLINGYCHKGELKIAEKLFSLMQSHGLEPSVVTYGVLIRSLCKEAKLAEAASYFELMLINRCIPND
Subjt:  IIDGYVKQCDLYAALKIFGLMLKQSCKPNVVTYTSLINGYCHKGELKIAEKLFSLMQSHGLEPSVVTYGVLIRSLCKEAKLAEAASYFELMLINRCIPND

Query:  VIFHYLVNGFANMNAAAVSKGLNNFSENAKSMFEEFFLRMIGDGWTRKAAAYNCILICLCQHRMVKTALQLRDKMLSLGLCPDAVSFAALIHGICLGGSS
        VIFHYLVNGFANMNAAAVSKGLNNFSENAKSMFEEFFLRMIGDGWTRKAAAYNCILICLCQHRMVKTALQLRDKMLSLGLCPDAVSFAALIHGICLGGSS
Subjt:  VIFHYLVNGFANMNAAAVSKGLNNFSENAKSMFEEFFLRMIGDGWTRKAAAYNCILICLCQHRMVKTALQLRDKMLSLGLCPDAVSFAALIHGICLGGSS

Query:  KECKNVISCYLSEKEL
        KECKNVISCYLSEKEL
Subjt:  KECKNVISCYLSEKEL

SwissProt top hitse value%identityAlignment
Q8GW57 Pentatricopeptide repeat-containing protein At1g80150, mitochondrial3.3e-9763.22Show/hide
Query:  IIMLYGKAEMIKHAVDTFYDMHLYGCRRTVKSFNAVLKVLMKTRDLGAIEAFLSEAPEKFDIELDIISVNIVVKAFCDIGILSRAYLLMLEMEKVGIRPD
        IIMLYGKA M K A+DTF++M LYGC+R+VKSFNA L+VL    DL  I  FL +AP K+ I++D +S NI +K+FC++GIL  AY+ M EMEK G+ PD
Subjt:  IIMLYGKAEMIKHAVDTFYDMHLYGCRRTVKSFNAVLKVLMKTRDLGAIEAFLSEAPEKFDIELDIISVNIVVKAFCDIGILSRAYLLMLEMEKVGIRPD

Query:  VVTYTTLISAFYKDNRCEISNGLWNLMVLRGCLPNLASFNVRIQYLVDRRRAWEANKLMNVMRNIGIVPDEVTYNLVIKGFCQAGFFDMAKRVYSALQGS
        VVTYTTLISA YK  RC I NGLWNLMVL+GC PNL +FNVRIQ+LV+RRRAW+AN L+ +M  + + PD +TYN+VIKGF  A F DMA+RVY+A+ G 
Subjt:  VVTYTTLISAFYKDNRCEISNGLWNLMVLRGCLPNLASFNVRIQYLVDRRRAWEANKLMNVMRNIGIVPDEVTYNLVIKGFCQAGFFDMAKRVYSALQGS

Query:  GYKPNVKIYQTMIHYLCRSGDFNLAYTMCKDAMNRNWFPNIDTIHSLIKGLKKMGQLGKAQ
        GYKPN+KIYQTMIHYLC++G+F+LAYTMCKD M + W+PN+DT+  L+KGL K GQL +A+
Subjt:  GYKPNVKIYQTMIHYLCRSGDFNLAYTMCKDAMNRNWFPNIDTIHSLIKGLKKMGQLGKAQ

Q9CA58 Putative pentatricopeptide repeat-containing protein At1g745802.7e-8330.35Show/hide
Query:  YGKAEMIKHAVDTFYDMHLYGCRRTVKSFNAVLKVLMKTRDLGAIEAFLSEAPEKFDIELDIISVNIVVKAFCDIGILSRAYLLMLEMEKVGIRPDVVTY
        YG+   ++ AV+ F  M  Y C  TV S+NA++ VL+ +              ++  I  D+ S  I +K+FC       A  L+  M   G   +VV Y
Subjt:  YGKAEMIKHAVDTFYDMHLYGCRRTVKSFNAVLKVLMKTRDLGAIEAFLSEAPEKFDIELDIISVNIVVKAFCDIGILSRAYLLMLEMEKVGIRPDVVTY

Query:  TTLISAFYKDNRCEISNGLWNLMVLRGCLPNLASFNVRIQYLVDRRRAWEANKLMNVMRNIGIVPDEVTYNLVIKGFCQAGFFDMAKRVYSALQGSGYKP
         T++  FY++N       L+  M+  G    L++FN  ++ L  +    E  KL++ +   G++P+  TYNL I+G CQ G  D A R+   L   G KP
Subjt:  TTLISAFYKDNRCEISNGLWNLMVLRGCLPNLASFNVRIQYLVDRRRAWEANKLMNVMRNIGIVPDEVTYNLVIKGFCQAGFFDMAKRVYSALQGSGYKP

Query:  NVKIYQTMIHYLCRSGDFNLAYTMCKDAMNRNWFPNIDTIHSLIKGLKKMGQ-------LGKAQDNGDDKCTDNYTTCIMVRGLCLEGRTEDGRKLIESR
        +V  Y  +I+ LC++  F  A       +N    P+  T ++LI G  K G        +G A  NG     D +T   ++ GLC EG T     L    
Subjt:  NVKIYQTMIHYLCRSGDFNLAYTMCKDAMNRNWFPNIDTIHSLIKGLKKMGQ-------LGKAQDNGDDKCTDNYTTCIMVRGLCLEGRTEDGRKLIESR

Query:  WGKGCVPNIVFYNTLIDGYCKKGEVGSAYELFIELKLKGFVPTLETFGSMVNGFCKTGNFEAIDLLLMEMKDRGLSVSVQVYNNII---DAQYKLGCDIR
         GKG  PN++ YNTLI G   +G +  A +L  E+  KG +P ++TF  +VNG CK G     D L+  M  +G    +  +N +I     Q K+     
Subjt:  WGKGCVPNIVFYNTLIDGYCKKGEVGSAYELFIELKLKGFVPTLETFGSMVNGFCKTGNFEAIDLLLMEMKDRGLSVSVQVYNNII---DAQYKLGCDIR

Query:  AKDTLKEMAENCCEPDLVTYNTLINYLCRGGEVMEAEKILEQAIKRGMVPNKFTYTPLVHAYCKQGEYYRASDLLIEMSKKGHKVDMVSYGALIHGLVVA
        A + L  M +N  +PD+ TYN+L+N LC+  +  +  +  +  +++G  PN FT+  L+ + C+  +   A  LL EM  K    D V++G LI G    
Subjt:  AKDTLKEMAENCCEPDLVTYNTLINYLCRGGEVMEAEKILEQAIKRGMVPNKFTYTPLVHAYCKQGEYYRASDLLIEMSKKGHKVDMVSYGALIHGLVVA

Query:  GEVDNAMTIRDRMMER-GVLPDANIYNVLMNGLFKKGNLSMAKVMLSEMLDQNIAPDAFIYATLVDGFIRHDNLDEAKKLFQLTIEKGIDPGVVGYNSMI
        G++D A T+  +M E   V      YN++++   +K N++MA+ +  EM+D+ + PD + Y  +VDGF +  N++   K     +E G  P +     +I
Subjt:  GEVDNAMTIRDRMMER-GVLPDANIYNVLMNGLFKKGNLSMAKVMLSEMLDQNIAPDAFIYATLVDGFIRHDNLDEAKKLFQLTIEKGIDPGVVGYNSMI

Query:  KGFCKFGMMEDAVLCIDRMRSARHVPDVFTFSTIIDGYVKQCDLYAALKIFGLMLKQSC
           C    + +A   I RM     VP+    +TI D  V + ++ A   +   +LK+SC
Subjt:  KGFCKFGMMEDAVLCIDRMRSARHVPDVFTFSTIIDGYVKQCDLYAALKIFGLMLKQSC

Q9FJE6 Putative pentatricopeptide repeat-containing protein At5g599009.3e-8427.69Show/hide
Query:  ILSRAYLLMLEMEKVGIRPDVVTYTTLISAFYKDNRCEISNGLWNLMVLRGCLPNLASFNVRIQYLVDRRRAWEANKLMNVMRNIGIVPDEVTYNLVIKG
        +L    +  + + KV + P+V T + L+    K     ++  L+N MV  G  P++  +   I+ L + +    A +++  M   G   + V YN++I G
Subjt:  ILSRAYLLMLEMEKVGIRPDVVTYTTLISAFYKDNRCEISNGLWNLMVLRGCLPNLASFNVRIQYLVDRRRAWEANKLMNVMRNIGIVPDEVTYNLVIKG

Query:  FCQAGFFDMAKRVYSALQGSGYKPNVKIYQTMIHYLCRSGDFNLAYTMCKDAMNRNWFPNIDTIHSLIKGLKKMGQLGKAQDNGDDKCTDNYTTCIMVRG
         C+      A  +   L G   KP+V  Y T+++ LC+  +F +   M  + +   + P+   + SL++GL+K G++                       
Subjt:  FCQAGFFDMAKRVYSALQGSGYKPNVKIYQTMIHYLCRSGDFNLAYTMCKDAMNRNWFPNIDTIHSLIKGLKKMGQLGKAQDNGDDKCTDNYTTCIMVRG

Query:  LCLEGRTEDGRKLIESRWGKGCVPNIVFYNTLIDGYCKKGEVGSAYELFIELKLKGFVPTLETFGSMVNGFCKTGNFEAIDLLLMEMKDRGLSVSVQVYN
               E+   L++     G  PN+  YN LID  CK  +   A  LF  +   G  P   T+  +++ FC+ G  +     L EM D GL +SV  YN
Subjt:  LCLEGRTEDGRKLIESRWGKGCVPNIVFYNTLIDGYCKKGEVGSAYELFIELKLKGFVPTLETFGSMVNGFCKTGNFEAIDLLLMEMKDRGLSVSVQVYN

Query:  NIIDAQYKLGCDIRAKDTLKEMAENCCEPDLVTYNTLINYLCRGGEVMEAEKILEQAIKRGMVPNKFTYTPLVHAYCKQGEYYRASDLLIEMSKKGHKVD
        ++I+   K G    A+  + EM     EP +VTY +L+   C  G++ +A ++  +   +G+ P+ +T+T L+    + G    A  L  EM++   K +
Subjt:  NIIDAQYKLGCDIRAKDTLKEMAENCCEPDLVTYNTLINYLCRGGEVMEAEKILEQAIKRGMVPNKFTYTPLVHAYCKQGEYYRASDLLIEMSKKGHKVD

Query:  MVSYGALIHGLVVAGEVDNAMTIRDRMMERGVLPDANIYNVLMNGLFKKGNLSMAKVMLSEMLDQNIAPDAFIYATLVDGFIRHDNLDEAKKLFQLTIEK
         V+Y  +I G    G++  A      M E+G++PD   Y  L++GL   G  S AKV +  +   N   +   Y  L+ GF R   L+EA  + Q  +++
Subjt:  MVSYGALIHGLVVAGEVDNAMTIRDRMMERGVLPDANIYNVLMNGLFKKGNLSMAKVMLSEMLDQNIAPDAFIYATLVDGFIRHDNLDEAKKLFQLTIEK

Query:  GIDPGVVGYNSMIKGFCKFGMMEDAVLCIDRMRSARHVPDVFTFSTIIDGYVKQCDLYAALKIFGLMLKQSCKPNVVTYTSLINGYCHKGELKIAEKLFS
        G+D  +V Y  +I G  K    +     +  M      PD   ++++ID   K  D   A  I+ LM+ + C PN VTYT++ING C  G +  AE L S
Subjt:  GIDPGVVGYNSMIKGFCKFGMMEDAVLCIDRMRSARHVPDVFTFSTIIDGYVKQCDLYAALKIFGLMLKQSCKPNVVTYTSLINGYCHKGELKIAEKLFS

Query:  LMQSHGLEPSVVTYGVLIRSLCKEAKLAEAASYFELMLINRCIPNDVIFHYLVNGFANMNAAAVSKGLNNFSENAKSMFEEFFLRMIGDGWTRKAAAYNC
         MQ     P+ VTYG  +  L K     + A      ++   + N   ++ L+ GF                E A     E   RMIGDG +     Y  
Subjt:  LMQSHGLEPSVVTYGVLIRSLCKEAKLAEAASYFELMLINRCIPNDVIFHYLVNGFANMNAAAVSKGLNNFSENAKSMFEEFFLRMIGDGWTRKAAAYNC

Query:  ILICLCQHRMVKTALQLRDKMLSLGLCPDAVSFAALIHGICLGG
        ++  LC+   VK A++L + M   G+ PD V++  LIHG C+ G
Subjt:  ILICLCQHRMVKTALQLRDKMLSLGLCPDAVSFAALIHGICLGG

Q9M907 Pentatricopeptide repeat-containing protein At3g069201.1e-7927.59Show/hide
Query:  KSFNAVLKVLMKTRDLGAIEAFLSE---APEKFDIELDIISVNIVVKAFCDIGILSRAYLLMLEMEKVGIRPDVVTYTTLISAFYKDNRCEISNGLWNLM
        +S+N++L V+ + R+  A++  L E   A     +   I  V   VKA      L   Y ++  M K   RP    YTTLI AF   N  ++   L+  M
Subjt:  KSFNAVLKVLMKTRDLGAIEAFLSE---APEKFDIELDIISVNIVVKAFCDIGILSRAYLLMLEMEKVGIRPDVVTYTTLISAFYKDNRCEISNGLWNLM

Query:  VLRGCLPNLASFNVRIQYLVDRRRAWEANKLMNVMRNIGIVPDEVTYNLVIKGFCQAGFFDMAKRVYSALQGSGYKPNVKIYQTMIHYLCRSGDFNLAYT
           G  P +  F   I+      R   A  L++ M++  +  D V YN+ I  F + G  DMA + +  ++ +G KP+   Y +MI  LC++   + A  
Subjt:  VLRGCLPNLASFNVRIQYLVDRRRAWEANKLMNVMRNIGIVPDEVTYNLVIKGFCQAGFFDMAKRVYSALQGSGYKPNVKIYQTMIHYLCRSGDFNLAYT

Query:  MCKDAMNRNWFPNIDTIHSLIKGLKKMGQLGKA-------QDNGDDKCTDNYTTCIMVRGLCLE--GRTEDGRKLIESRWGKGCVPNIVFYNTLIDGYCK
        M +        P     +++I G    G+  +A       +  G       Y  CI+    CL   G+ ++  K+ E    K   PN+  YN LID  C+
Subjt:  MCKDAMNRNWFPNIDTIHSLIKGLKKMGQLGKA-------QDNGDDKCTDNYTTCIMVRGLCLE--GRTEDGRKLIESRWGKGCVPNIVFYNTLIDGYCK

Query:  KGEVGSAYELFIELKLKGFVPTLETFGSMVNGFCKTGNFEAIDLLLMEMKDRGLSVSVQVYNNIIDAQYKLGCDIRAKDTLKEMAENCCEPDLVTYNTLI
         G++ +A+EL   ++  G  P + T   MV+  CK+   +    +  EM  +  +     + ++ID   K+G    A    ++M ++ C  + + Y +LI
Subjt:  KGEVGSAYELFIELKLKGFVPTLETFGSMVNGFCKTGNFEAIDLLLMEMKDRGLSVSVQVYNNIIDAQYKLGCDIRAKDTLKEMAENCCEPDLVTYNTLI

Query:  NYLCRGGEVMEAEKILEQAIKRGMVPNKFTYTPLVHAYCKQGEYYRASDLLIEMSKKGHKVDMVSYGALIHGLVVAGEVDNAMTIRDRMMERGVLPDANI
              G   +  KI +  I +   P+       +    K GE  +   +  E+  +    D  SY  LIHGL+ AG  +    +   M E+G + D   
Subjt:  NYLCRGGEVMEAEKILEQAIKRGMVPNKFTYTPLVHAYCKQGEYYRASDLLIEMSKKGHKVDMVSYGALIHGLVVAGEVDNAMTIRDRMMERGVLPDANI

Query:  YNVLMNGLFKKGNLSMAKVMLSEMLDQNIAPDAFIYATLVDGFIRHDNLDEAKKLFQLTIEKGIDPGVVGYNSMIKGFCKFGMMEDAVLCIDRMRSARHV
        YN++++G  K G ++ A  +L EM  +   P    Y +++DG  + D LDEA  LF+    K I+  VV Y+S+I GF K G +++A L ++ +      
Subjt:  YNVLMNGLFKKGNLSMAKVMLSEMLDQNIAPDAFIYATLVDGFIRHDNLDEAKKLFQLTIEKGIDPGVVGYNSMIKGFCKFGMMEDAVLCIDRMRSARHV

Query:  PDVFTFSTIIDGYVKQCDLYAALKIFGLMLKQSCKPNVVTYTSLINGYCHKGELKIAEKLFSLMQSHGLEPSVVTYGVLIRSLCKEAKLAEAASYFELML
        P+++T+++++D  VK  ++  AL  F  M +  C PN VTY  LING C   +   A   +  MQ  G++PS ++Y  +I  L K   +AEA + F+   
Subjt:  PDVFTFSTIIDGYVKQCDLYAALKIFGLMLKQSCKPNVVTYTSLINGYCHKGELKIAEKLFSLMQSHGLEPSVVTYGVLIRSLCKEAKLAEAASYFELML

Query:  INRCIPNDVIFHYLVNGFANMNAAAVSKGLNNFSENAKSMFEE
         N  +P+   ++ ++ G +N N A           +A S+FEE
Subjt:  INRCIPNDVIFHYLVNGFANMNAAAVSKGLNNFSENAKSMFEE

Q9SSR4 Pentatricopeptide repeat-containing protein At1g526203.2e-19350.08Show/hide
Query:  FDMAKRVYSALQGSGYKPNVKIYQTMIHYLCRSGDFNLAYTMCKDAMN-RNWFPNIDTIHSLIKGLKKMGQLGKAQDNGDDKC-----TDNYTTCIMVRG
        F+  + V   L+    K   +    ++H    SG  + A  +    +   +  P++   +SL+  L K  +LG A+   D+ C      DNY+TCI+V+G
Subjt:  FDMAKRVYSALQGSGYKPNVKIYQTMIHYLCRSGDFNLAYTMCKDAMN-RNWFPNIDTIHSLIKGLKKMGQLGKAQDNGDDKC-----TDNYTTCIMVRG

Query:  LCLEGRTEDGRKLIESRWGKGCVPNIVFYNTLIDGYCKKGEVGSAYELFIELKLKGFVPTLETFGSMVNGFCKTGNFEAIDLLLMEMKDRGLSVSVQVYN
        +C EG+ E GRKLIE RWGKGC+PNIVFYNT+I GYCK G++ +AY +F ELKLKGF+PTLETFG+M+NGFCK G+F A D LL E+K+RGL VSV   N
Subjt:  LCLEGRTEDGRKLIESRWGKGCVPNIVFYNTLIDGYCKKGEVGSAYELFIELKLKGFVPTLETFGSMVNGFCKTGNFEAIDLLLMEMKDRGLSVSVQVYN

Query:  NIIDAQYKLGCDIRAKDTLKEMAENCCEPDLVTYNTLINYLCRGGEVMEAEKILEQAIKRGMVPNKFTYTPLVHAYCKQGEYYRASDLLIEMSKKGHKVD
        NIIDA+Y+ G  +   +++  +  N C+PD+ TYN LIN LC+ G+   A   L++A K+G++PN  +Y PL+ AYCK  EY  AS LL++M+++G K D
Subjt:  NIIDAQYKLGCDIRAKDTLKEMAENCCEPDLVTYNTLINYLCRGGEVMEAEKILEQAIKRGMVPNKFTYTPLVHAYCKQGEYYRASDLLIEMSKKGHKVD

Query:  MVSYGALIHGLVVAGEVDNAMTIRDRMMERGVLPDANIYNVLMNGLFKKGNLSMAKVMLSEMLDQNIAPDAFIYATLVDGFIRHDNLDEAKKLFQLTIEK
        +V+YG LIHGLVV+G +D+A+ ++ ++++RGV PDA IYN+LM+GL K G    AK++ SEMLD+NI PDA++YATL+DGFIR  + DEA+K+F L++EK
Subjt:  MVSYGALIHGLVVAGEVDNAMTIRDRMMERGVLPDANIYNVLMNGLFKKGNLSMAKVMLSEMLDQNIAPDAFIYATLVDGFIRHDNLDEAKKLFQLTIEK

Query:  GIDPGVVGYNSMIKGFCKFGMMEDAVLCIDRMRSARHVPDVFTFSTIIDGYVKQCDLYAALKIFGLMLKQSCKPNVVTYTSLINGYCHKGELKIAEKLFS
        G+   VV +N+MIKGFC+ GM+++A+ C++RM     VPD FT+STIIDGYVKQ D+  A+KIF  M K  CKPNVVTYTSLING+C +G+ K+AE+ F 
Subjt:  GIDPGVVGYNSMIKGFCKFGMMEDAVLCIDRMRSARHVPDVFTFSTIIDGYVKQCDLYAALKIFGLMLKQSCKPNVVTYTSLINGYCHKGELKIAEKLFS

Query:  LMQSHGLEPSVVTYGVLIRSLCKEAK-LAEAASYFELMLINRCIPNDVIFHYLVNGFANMNAAAVSKGLNNFSENAKSMFEEFFLRMIGDGWTRKAAAYN
         MQ   L P+VVTY  LIRSL KE+  L +A  Y+ELM+ N+C+PN+V F+ L+ GF    +  V    +  +    S+F EFF RM  DGW+  AAAYN
Subjt:  LMQSHGLEPSVVTYGVLIRSLCKEAK-LAEAASYFELMLINRCIPNDVIFHYLVNGFANMNAAAVSKGLNNFSENAKSMFEEFFLRMIGDGWTRKAAAYN

Query:  CILICLCQHRMVKTALQLRDKMLSLGLCPDAVSFAALIHGICLGGSSKECKNVISCYLSEKEL
          L+CLC H MVKTA   +DKM+  G  PD VSFAA++HG C+ G+SK+ +N+  C L EK L
Subjt:  CILICLCQHRMVKTALQLRDKMLSLGLCPDAVSFAALIHGICLGGSSKECKNVISCYLSEKEL

Arabidopsis top hitse value%identityAlignment
AT1G52620.1 Pentatricopeptide repeat (PPR) superfamily protein2.3e-19450.08Show/hide
Query:  FDMAKRVYSALQGSGYKPNVKIYQTMIHYLCRSGDFNLAYTMCKDAMN-RNWFPNIDTIHSLIKGLKKMGQLGKAQDNGDDKC-----TDNYTTCIMVRG
        F+  + V   L+    K   +    ++H    SG  + A  +    +   +  P++   +SL+  L K  +LG A+   D+ C      DNY+TCI+V+G
Subjt:  FDMAKRVYSALQGSGYKPNVKIYQTMIHYLCRSGDFNLAYTMCKDAMN-RNWFPNIDTIHSLIKGLKKMGQLGKAQDNGDDKC-----TDNYTTCIMVRG

Query:  LCLEGRTEDGRKLIESRWGKGCVPNIVFYNTLIDGYCKKGEVGSAYELFIELKLKGFVPTLETFGSMVNGFCKTGNFEAIDLLLMEMKDRGLSVSVQVYN
        +C EG+ E GRKLIE RWGKGC+PNIVFYNT+I GYCK G++ +AY +F ELKLKGF+PTLETFG+M+NGFCK G+F A D LL E+K+RGL VSV   N
Subjt:  LCLEGRTEDGRKLIESRWGKGCVPNIVFYNTLIDGYCKKGEVGSAYELFIELKLKGFVPTLETFGSMVNGFCKTGNFEAIDLLLMEMKDRGLSVSVQVYN

Query:  NIIDAQYKLGCDIRAKDTLKEMAENCCEPDLVTYNTLINYLCRGGEVMEAEKILEQAIKRGMVPNKFTYTPLVHAYCKQGEYYRASDLLIEMSKKGHKVD
        NIIDA+Y+ G  +   +++  +  N C+PD+ TYN LIN LC+ G+   A   L++A K+G++PN  +Y PL+ AYCK  EY  AS LL++M+++G K D
Subjt:  NIIDAQYKLGCDIRAKDTLKEMAENCCEPDLVTYNTLINYLCRGGEVMEAEKILEQAIKRGMVPNKFTYTPLVHAYCKQGEYYRASDLLIEMSKKGHKVD

Query:  MVSYGALIHGLVVAGEVDNAMTIRDRMMERGVLPDANIYNVLMNGLFKKGNLSMAKVMLSEMLDQNIAPDAFIYATLVDGFIRHDNLDEAKKLFQLTIEK
        +V+YG LIHGLVV+G +D+A+ ++ ++++RGV PDA IYN+LM+GL K G    AK++ SEMLD+NI PDA++YATL+DGFIR  + DEA+K+F L++EK
Subjt:  MVSYGALIHGLVVAGEVDNAMTIRDRMMERGVLPDANIYNVLMNGLFKKGNLSMAKVMLSEMLDQNIAPDAFIYATLVDGFIRHDNLDEAKKLFQLTIEK

Query:  GIDPGVVGYNSMIKGFCKFGMMEDAVLCIDRMRSARHVPDVFTFSTIIDGYVKQCDLYAALKIFGLMLKQSCKPNVVTYTSLINGYCHKGELKIAEKLFS
        G+   VV +N+MIKGFC+ GM+++A+ C++RM     VPD FT+STIIDGYVKQ D+  A+KIF  M K  CKPNVVTYTSLING+C +G+ K+AE+ F 
Subjt:  GIDPGVVGYNSMIKGFCKFGMMEDAVLCIDRMRSARHVPDVFTFSTIIDGYVKQCDLYAALKIFGLMLKQSCKPNVVTYTSLINGYCHKGELKIAEKLFS

Query:  LMQSHGLEPSVVTYGVLIRSLCKEAK-LAEAASYFELMLINRCIPNDVIFHYLVNGFANMNAAAVSKGLNNFSENAKSMFEEFFLRMIGDGWTRKAAAYN
         MQ   L P+VVTY  LIRSL KE+  L +A  Y+ELM+ N+C+PN+V F+ L+ GF    +  V    +  +    S+F EFF RM  DGW+  AAAYN
Subjt:  LMQSHGLEPSVVTYGVLIRSLCKEAK-LAEAASYFELMLINRCIPNDVIFHYLVNGFANMNAAAVSKGLNNFSENAKSMFEEFFLRMIGDGWTRKAAAYN

Query:  CILICLCQHRMVKTALQLRDKMLSLGLCPDAVSFAALIHGICLGGSSKECKNVISCYLSEKEL
          L+CLC H MVKTA   +DKM+  G  PD VSFAA++HG C+ G+SK+ +N+  C L EK L
Subjt:  CILICLCQHRMVKTALQLRDKMLSLGLCPDAVSFAALIHGICLGGSSKECKNVISCYLSEKEL

AT1G74580.1 Pentatricopeptide repeat (PPR) superfamily protein1.9e-8430.35Show/hide
Query:  YGKAEMIKHAVDTFYDMHLYGCRRTVKSFNAVLKVLMKTRDLGAIEAFLSEAPEKFDIELDIISVNIVVKAFCDIGILSRAYLLMLEMEKVGIRPDVVTY
        YG+   ++ AV+ F  M  Y C  TV S+NA++ VL+ +              ++  I  D+ S  I +K+FC       A  L+  M   G   +VV Y
Subjt:  YGKAEMIKHAVDTFYDMHLYGCRRTVKSFNAVLKVLMKTRDLGAIEAFLSEAPEKFDIELDIISVNIVVKAFCDIGILSRAYLLMLEMEKVGIRPDVVTY

Query:  TTLISAFYKDNRCEISNGLWNLMVLRGCLPNLASFNVRIQYLVDRRRAWEANKLMNVMRNIGIVPDEVTYNLVIKGFCQAGFFDMAKRVYSALQGSGYKP
         T++  FY++N       L+  M+  G    L++FN  ++ L  +    E  KL++ +   G++P+  TYNL I+G CQ G  D A R+   L   G KP
Subjt:  TTLISAFYKDNRCEISNGLWNLMVLRGCLPNLASFNVRIQYLVDRRRAWEANKLMNVMRNIGIVPDEVTYNLVIKGFCQAGFFDMAKRVYSALQGSGYKP

Query:  NVKIYQTMIHYLCRSGDFNLAYTMCKDAMNRNWFPNIDTIHSLIKGLKKMGQ-------LGKAQDNGDDKCTDNYTTCIMVRGLCLEGRTEDGRKLIESR
        +V  Y  +I+ LC++  F  A       +N    P+  T ++LI G  K G        +G A  NG     D +T   ++ GLC EG T     L    
Subjt:  NVKIYQTMIHYLCRSGDFNLAYTMCKDAMNRNWFPNIDTIHSLIKGLKKMGQ-------LGKAQDNGDDKCTDNYTTCIMVRGLCLEGRTEDGRKLIESR

Query:  WGKGCVPNIVFYNTLIDGYCKKGEVGSAYELFIELKLKGFVPTLETFGSMVNGFCKTGNFEAIDLLLMEMKDRGLSVSVQVYNNII---DAQYKLGCDIR
         GKG  PN++ YNTLI G   +G +  A +L  E+  KG +P ++TF  +VNG CK G     D L+  M  +G    +  +N +I     Q K+     
Subjt:  WGKGCVPNIVFYNTLIDGYCKKGEVGSAYELFIELKLKGFVPTLETFGSMVNGFCKTGNFEAIDLLLMEMKDRGLSVSVQVYNNII---DAQYKLGCDIR

Query:  AKDTLKEMAENCCEPDLVTYNTLINYLCRGGEVMEAEKILEQAIKRGMVPNKFTYTPLVHAYCKQGEYYRASDLLIEMSKKGHKVDMVSYGALIHGLVVA
        A + L  M +N  +PD+ TYN+L+N LC+  +  +  +  +  +++G  PN FT+  L+ + C+  +   A  LL EM  K    D V++G LI G    
Subjt:  AKDTLKEMAENCCEPDLVTYNTLINYLCRGGEVMEAEKILEQAIKRGMVPNKFTYTPLVHAYCKQGEYYRASDLLIEMSKKGHKVDMVSYGALIHGLVVA

Query:  GEVDNAMTIRDRMMER-GVLPDANIYNVLMNGLFKKGNLSMAKVMLSEMLDQNIAPDAFIYATLVDGFIRHDNLDEAKKLFQLTIEKGIDPGVVGYNSMI
        G++D A T+  +M E   V      YN++++   +K N++MA+ +  EM+D+ + PD + Y  +VDGF +  N++   K     +E G  P +     +I
Subjt:  GEVDNAMTIRDRMMER-GVLPDANIYNVLMNGLFKKGNLSMAKVMLSEMLDQNIAPDAFIYATLVDGFIRHDNLDEAKKLFQLTIEKGIDPGVVGYNSMI

Query:  KGFCKFGMMEDAVLCIDRMRSARHVPDVFTFSTIIDGYVKQCDLYAALKIFGLMLKQSC
           C    + +A   I RM     VP+    +TI D  V + ++ A   +   +LK+SC
Subjt:  KGFCKFGMMEDAVLCIDRMRSARHVPDVFTFSTIIDGYVKQCDLYAALKIFGLMLKQSC

AT1G80150.1 Tetratricopeptide repeat (TPR)-like superfamily protein2.3e-9863.22Show/hide
Query:  IIMLYGKAEMIKHAVDTFYDMHLYGCRRTVKSFNAVLKVLMKTRDLGAIEAFLSEAPEKFDIELDIISVNIVVKAFCDIGILSRAYLLMLEMEKVGIRPD
        IIMLYGKA M K A+DTF++M LYGC+R+VKSFNA L+VL    DL  I  FL +AP K+ I++D +S NI +K+FC++GIL  AY+ M EMEK G+ PD
Subjt:  IIMLYGKAEMIKHAVDTFYDMHLYGCRRTVKSFNAVLKVLMKTRDLGAIEAFLSEAPEKFDIELDIISVNIVVKAFCDIGILSRAYLLMLEMEKVGIRPD

Query:  VVTYTTLISAFYKDNRCEISNGLWNLMVLRGCLPNLASFNVRIQYLVDRRRAWEANKLMNVMRNIGIVPDEVTYNLVIKGFCQAGFFDMAKRVYSALQGS
        VVTYTTLISA YK  RC I NGLWNLMVL+GC PNL +FNVRIQ+LV+RRRAW+AN L+ +M  + + PD +TYN+VIKGF  A F DMA+RVY+A+ G 
Subjt:  VVTYTTLISAFYKDNRCEISNGLWNLMVLRGCLPNLASFNVRIQYLVDRRRAWEANKLMNVMRNIGIVPDEVTYNLVIKGFCQAGFFDMAKRVYSALQGS

Query:  GYKPNVKIYQTMIHYLCRSGDFNLAYTMCKDAMNRNWFPNIDTIHSLIKGLKKMGQLGKAQ
        GYKPN+KIYQTMIHYLC++G+F+LAYTMCKD M + W+PN+DT+  L+KGL K GQL +A+
Subjt:  GYKPNVKIYQTMIHYLCRSGDFNLAYTMCKDAMNRNWFPNIDTIHSLIKGLKKMGQLGKAQ

AT3G06920.1 Tetratricopeptide repeat (TPR)-like superfamily protein7.5e-8127.59Show/hide
Query:  KSFNAVLKVLMKTRDLGAIEAFLSE---APEKFDIELDIISVNIVVKAFCDIGILSRAYLLMLEMEKVGIRPDVVTYTTLISAFYKDNRCEISNGLWNLM
        +S+N++L V+ + R+  A++  L E   A     +   I  V   VKA      L   Y ++  M K   RP    YTTLI AF   N  ++   L+  M
Subjt:  KSFNAVLKVLMKTRDLGAIEAFLSE---APEKFDIELDIISVNIVVKAFCDIGILSRAYLLMLEMEKVGIRPDVVTYTTLISAFYKDNRCEISNGLWNLM

Query:  VLRGCLPNLASFNVRIQYLVDRRRAWEANKLMNVMRNIGIVPDEVTYNLVIKGFCQAGFFDMAKRVYSALQGSGYKPNVKIYQTMIHYLCRSGDFNLAYT
           G  P +  F   I+      R   A  L++ M++  +  D V YN+ I  F + G  DMA + +  ++ +G KP+   Y +MI  LC++   + A  
Subjt:  VLRGCLPNLASFNVRIQYLVDRRRAWEANKLMNVMRNIGIVPDEVTYNLVIKGFCQAGFFDMAKRVYSALQGSGYKPNVKIYQTMIHYLCRSGDFNLAYT

Query:  MCKDAMNRNWFPNIDTIHSLIKGLKKMGQLGKA-------QDNGDDKCTDNYTTCIMVRGLCLE--GRTEDGRKLIESRWGKGCVPNIVFYNTLIDGYCK
        M +        P     +++I G    G+  +A       +  G       Y  CI+    CL   G+ ++  K+ E    K   PN+  YN LID  C+
Subjt:  MCKDAMNRNWFPNIDTIHSLIKGLKKMGQLGKA-------QDNGDDKCTDNYTTCIMVRGLCLE--GRTEDGRKLIESRWGKGCVPNIVFYNTLIDGYCK

Query:  KGEVGSAYELFIELKLKGFVPTLETFGSMVNGFCKTGNFEAIDLLLMEMKDRGLSVSVQVYNNIIDAQYKLGCDIRAKDTLKEMAENCCEPDLVTYNTLI
         G++ +A+EL   ++  G  P + T   MV+  CK+   +    +  EM  +  +     + ++ID   K+G    A    ++M ++ C  + + Y +LI
Subjt:  KGEVGSAYELFIELKLKGFVPTLETFGSMVNGFCKTGNFEAIDLLLMEMKDRGLSVSVQVYNNIIDAQYKLGCDIRAKDTLKEMAENCCEPDLVTYNTLI

Query:  NYLCRGGEVMEAEKILEQAIKRGMVPNKFTYTPLVHAYCKQGEYYRASDLLIEMSKKGHKVDMVSYGALIHGLVVAGEVDNAMTIRDRMMERGVLPDANI
              G   +  KI +  I +   P+       +    K GE  +   +  E+  +    D  SY  LIHGL+ AG  +    +   M E+G + D   
Subjt:  NYLCRGGEVMEAEKILEQAIKRGMVPNKFTYTPLVHAYCKQGEYYRASDLLIEMSKKGHKVDMVSYGALIHGLVVAGEVDNAMTIRDRMMERGVLPDANI

Query:  YNVLMNGLFKKGNLSMAKVMLSEMLDQNIAPDAFIYATLVDGFIRHDNLDEAKKLFQLTIEKGIDPGVVGYNSMIKGFCKFGMMEDAVLCIDRMRSARHV
        YN++++G  K G ++ A  +L EM  +   P    Y +++DG  + D LDEA  LF+    K I+  VV Y+S+I GF K G +++A L ++ +      
Subjt:  YNVLMNGLFKKGNLSMAKVMLSEMLDQNIAPDAFIYATLVDGFIRHDNLDEAKKLFQLTIEKGIDPGVVGYNSMIKGFCKFGMMEDAVLCIDRMRSARHV

Query:  PDVFTFSTIIDGYVKQCDLYAALKIFGLMLKQSCKPNVVTYTSLINGYCHKGELKIAEKLFSLMQSHGLEPSVVTYGVLIRSLCKEAKLAEAASYFELML
        P+++T+++++D  VK  ++  AL  F  M +  C PN VTY  LING C   +   A   +  MQ  G++PS ++Y  +I  L K   +AEA + F+   
Subjt:  PDVFTFSTIIDGYVKQCDLYAALKIFGLMLKQSCKPNVVTYTSLINGYCHKGELKIAEKLFSLMQSHGLEPSVVTYGVLIRSLCKEAKLAEAASYFELML

Query:  INRCIPNDVIFHYLVNGFANMNAAAVSKGLNNFSENAKSMFEE
         N  +P+   ++ ++ G +N N A           +A S+FEE
Subjt:  INRCIPNDVIFHYLVNGFANMNAAAVSKGLNNFSENAKSMFEE

AT5G59900.1 Pentatricopeptide repeat (PPR) superfamily protein6.6e-8527.69Show/hide
Query:  ILSRAYLLMLEMEKVGIRPDVVTYTTLISAFYKDNRCEISNGLWNLMVLRGCLPNLASFNVRIQYLVDRRRAWEANKLMNVMRNIGIVPDEVTYNLVIKG
        +L    +  + + KV + P+V T + L+    K     ++  L+N MV  G  P++  +   I+ L + +    A +++  M   G   + V YN++I G
Subjt:  ILSRAYLLMLEMEKVGIRPDVVTYTTLISAFYKDNRCEISNGLWNLMVLRGCLPNLASFNVRIQYLVDRRRAWEANKLMNVMRNIGIVPDEVTYNLVIKG

Query:  FCQAGFFDMAKRVYSALQGSGYKPNVKIYQTMIHYLCRSGDFNLAYTMCKDAMNRNWFPNIDTIHSLIKGLKKMGQLGKAQDNGDDKCTDNYTTCIMVRG
         C+      A  +   L G   KP+V  Y T+++ LC+  +F +   M  + +   + P+   + SL++GL+K G++                       
Subjt:  FCQAGFFDMAKRVYSALQGSGYKPNVKIYQTMIHYLCRSGDFNLAYTMCKDAMNRNWFPNIDTIHSLIKGLKKMGQLGKAQDNGDDKCTDNYTTCIMVRG

Query:  LCLEGRTEDGRKLIESRWGKGCVPNIVFYNTLIDGYCKKGEVGSAYELFIELKLKGFVPTLETFGSMVNGFCKTGNFEAIDLLLMEMKDRGLSVSVQVYN
               E+   L++     G  PN+  YN LID  CK  +   A  LF  +   G  P   T+  +++ FC+ G  +     L EM D GL +SV  YN
Subjt:  LCLEGRTEDGRKLIESRWGKGCVPNIVFYNTLIDGYCKKGEVGSAYELFIELKLKGFVPTLETFGSMVNGFCKTGNFEAIDLLLMEMKDRGLSVSVQVYN

Query:  NIIDAQYKLGCDIRAKDTLKEMAENCCEPDLVTYNTLINYLCRGGEVMEAEKILEQAIKRGMVPNKFTYTPLVHAYCKQGEYYRASDLLIEMSKKGHKVD
        ++I+   K G    A+  + EM     EP +VTY +L+   C  G++ +A ++  +   +G+ P+ +T+T L+    + G    A  L  EM++   K +
Subjt:  NIIDAQYKLGCDIRAKDTLKEMAENCCEPDLVTYNTLINYLCRGGEVMEAEKILEQAIKRGMVPNKFTYTPLVHAYCKQGEYYRASDLLIEMSKKGHKVD

Query:  MVSYGALIHGLVVAGEVDNAMTIRDRMMERGVLPDANIYNVLMNGLFKKGNLSMAKVMLSEMLDQNIAPDAFIYATLVDGFIRHDNLDEAKKLFQLTIEK
         V+Y  +I G    G++  A      M E+G++PD   Y  L++GL   G  S AKV +  +   N   +   Y  L+ GF R   L+EA  + Q  +++
Subjt:  MVSYGALIHGLVVAGEVDNAMTIRDRMMERGVLPDANIYNVLMNGLFKKGNLSMAKVMLSEMLDQNIAPDAFIYATLVDGFIRHDNLDEAKKLFQLTIEK

Query:  GIDPGVVGYNSMIKGFCKFGMMEDAVLCIDRMRSARHVPDVFTFSTIIDGYVKQCDLYAALKIFGLMLKQSCKPNVVTYTSLINGYCHKGELKIAEKLFS
        G+D  +V Y  +I G  K    +     +  M      PD   ++++ID   K  D   A  I+ LM+ + C PN VTYT++ING C  G +  AE L S
Subjt:  GIDPGVVGYNSMIKGFCKFGMMEDAVLCIDRMRSARHVPDVFTFSTIIDGYVKQCDLYAALKIFGLMLKQSCKPNVVTYTSLINGYCHKGELKIAEKLFS

Query:  LMQSHGLEPSVVTYGVLIRSLCKEAKLAEAASYFELMLINRCIPNDVIFHYLVNGFANMNAAAVSKGLNNFSENAKSMFEEFFLRMIGDGWTRKAAAYNC
         MQ     P+ VTYG  +  L K     + A      ++   + N   ++ L+ GF                E A     E   RMIGDG +     Y  
Subjt:  LMQSHGLEPSVVTYGVLIRSLCKEAKLAEAASYFELMLINRCIPNDVIFHYLVNGFANMNAAAVSKGLNNFSENAKSMFEEFFLRMIGDGWTRKAAAYNC

Query:  ILICLCQHRMVKTALQLRDKMLSLGLCPDAVSFAALIHGICLGG
        ++  LC+   VK A++L + M   G+ PD V++  LIHG C+ G
Subjt:  ILICLCQHRMVKTALQLRDKMLSLGLCPDAVSFAALIHGICLGG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATCATAATGCTATATGGTAAGGCTGAGATGATCAAGCATGCTGTTGATACTTTTTATGATATGCACTTATATGGGTGCCGTAGGACTGTGAAATCTTTTAATGCCGTGCT
TAAGGTTTTGATGAAGACCCGTGATTTGGGAGCCATTGAGGCATTTTTGAGTGAAGCTCCTGAAAAATTCGATATTGAGTTGGATATTATTTCTGTTAACATTGTTGTTA
AGGCTTTTTGTGATATAGGTATTCTAAGTAGAGCTTATCTTCTCATGTTAGAGATGGAGAAAGTGGGAATAAGACCTGATGTGGTTACCTATACGACGTTAATTTCAGCT
TTTTACAAGGATAATCGATGCGAAATTAGTAATGGACTGTGGAATCTAATGGTTTTGAGGGGTTGTTTGCCCAATCTTGCTTCTTTCAATGTGAGGATTCAATATTTGGT
TGACAGGAGACGAGCGTGGGAGGCTAATAAATTGATGAATGTGATGCGGAATATCGGTATTGTTCCCGATGAGGTTACTTACAATCTGGTAATAAAAGGCTTTTGCCAAG
CCGGTTTTTTTGATATGGCCAAAAGGGTTTATTCTGCTCTGCAAGGGAGCGGGTATAAACCGAACGTCAAAATTTACCAAACCATGATTCATTACCTGTGCAGAAGTGGA
GATTTCAACCTGGCATATACAATGTGCAAAGATGCCATGAACAGGAATTGGTTTCCAAATATTGATACAATTCATTCATTAATTAAAGGCCTGAAGAAGATGGGACAGCT
TGGGAAGGCTCAAGATAACGGCGACGATAAATGTACGGATAACTATACTACTTGTATTATGGTGAGGGGCTTATGTTTGGAAGGTAGAACCGAGGATGGAAGGAAGCTGA
TTGAATCCAGATGGGGGAAAGGCTGTGTACCGAACATTGTGTTTTACAATACACTCATTGATGGATATTGCAAGAAAGGTGAGGTTGGAAGTGCTTATGAACTTTTTATT
GAATTGAAGCTGAAAGGATTTGTACCTACATTAGAAACTTTTGGTTCCATGGTAAATGGCTTTTGCAAGACGGGAAACTTTGAAGCTATTGATCTTCTTTTGATGGAAAT
GAAAGATAGGGGCTTGAGTGTTAGTGTTCAAGTGTATAATAACATTATTGATGCTCAATATAAGCTTGGTTGTGACATTAGAGCAAAGGATACACTTAAAGAAATGGCTG
AGAATTGCTGTGAACCAGATCTTGTGACTTATAATACTCTAATCAACTATTTATGCAGAGGTGGGGAGGTCATGGAAGCTGAGAAGATCTTGGAACAAGCAATAAAGAGA
GGAATGGTGCCGAATAAGTTCACTTATACTCCGCTTGTTCATGCCTATTGTAAACAAGGGGAATATTATAGGGCCTCAGATTTACTTATTGAGATGTCAAAAAAAGGACA
TAAAGTTGATATGGTTTCGTATGGAGCTTTAATTCATGGACTTGTAGTTGCAGGGGAAGTCGATAATGCTATGACTATCCGGGACAGAATGATGGAAAGAGGGGTTTTAC
CTGATGCCAATATCTACAATGTTTTGATGAATGGACTTTTCAAGAAAGGGAATCTTTCCATGGCCAAGGTGATGCTTTCTGAGATGCTTGACCAAAATATAGCACCTGAT
GCATTTATTTATGCAACTTTAGTGGATGGGTTCATTAGGCATGACAACCTTGATGAGGCCAAGAAACTCTTTCAGCTCACTATAGAAAAGGGTATAGACCCAGGTGTTGT
GGGATATAATTCCATGATCAAAGGTTTCTGTAAATTCGGGATGATGGAAGATGCAGTTTTGTGCATTGATAGAATGAGGAGTGCCCGTCATGTTCCTGATGTCTTTACTT
TCTCCACCATAATTGATGGATATGTAAAACAATGTGACTTGTACGCTGCACTGAAGATCTTTGGACTGATGTTGAAGCAGAGTTGCAAACCAAATGTTGTCACTTACACA
TCTTTGATCAATGGATATTGCCATAAGGGAGAATTGAAGATAGCTGAAAAACTTTTTAGCTTAATGCAATCTCATGGTTTGGAGCCTAGTGTCGTCACATACGGTGTACT
TATACGGAGCCTTTGCAAAGAAGCTAAGCTTGCCGAAGCTGCATCGTATTTCGAGCTCATGTTGATTAACAGGTGCATCCCTAATGATGTTATATTTCATTATCTAGTCA
ATGGGTTTGCAAATATGAATGCTGCTGCAGTTTCCAAAGGACTAAATAATTTTAGCGAGAATGCTAAATCTATGTTTGAGGAGTTCTTTCTGAGAATGATCGGTGATGGA
TGGACACGAAAGGCTGCAGCTTACAATTGTATTCTCATTTGCCTTTGTCAGCATCGAATGGTTAAAACTGCCTTACAGTTGCGTGACAAGATGCTGTCTTTGGGACTTTG
TCCTGATGCTGTTTCTTTTGCTGCATTGATACATGGCATTTGCTTGGGAGGAAGCTCAAAAGAATGCAAGAATGTTATTTCCTGTTATCTGAGTGAAAAGGAGCTG
mRNA sequenceShow/hide mRNA sequence
ATCATAATGCTATATGGTAAGGCTGAGATGATCAAGCATGCTGTTGATACTTTTTATGATATGCACTTATATGGGTGCCGTAGGACTGTGAAATCTTTTAATGCCGTGCT
TAAGGTTTTGATGAAGACCCGTGATTTGGGAGCCATTGAGGCATTTTTGAGTGAAGCTCCTGAAAAATTCGATATTGAGTTGGATATTATTTCTGTTAACATTGTTGTTA
AGGCTTTTTGTGATATAGGTATTCTAAGTAGAGCTTATCTTCTCATGTTAGAGATGGAGAAAGTGGGAATAAGACCTGATGTGGTTACCTATACGACGTTAATTTCAGCT
TTTTACAAGGATAATCGATGCGAAATTAGTAATGGACTGTGGAATCTAATGGTTTTGAGGGGTTGTTTGCCCAATCTTGCTTCTTTCAATGTGAGGATTCAATATTTGGT
TGACAGGAGACGAGCGTGGGAGGCTAATAAATTGATGAATGTGATGCGGAATATCGGTATTGTTCCCGATGAGGTTACTTACAATCTGGTAATAAAAGGCTTTTGCCAAG
CCGGTTTTTTTGATATGGCCAAAAGGGTTTATTCTGCTCTGCAAGGGAGCGGGTATAAACCGAACGTCAAAATTTACCAAACCATGATTCATTACCTGTGCAGAAGTGGA
GATTTCAACCTGGCATATACAATGTGCAAAGATGCCATGAACAGGAATTGGTTTCCAAATATTGATACAATTCATTCATTAATTAAAGGCCTGAAGAAGATGGGACAGCT
TGGGAAGGCTCAAGATAACGGCGACGATAAATGTACGGATAACTATACTACTTGTATTATGGTGAGGGGCTTATGTTTGGAAGGTAGAACCGAGGATGGAAGGAAGCTGA
TTGAATCCAGATGGGGGAAAGGCTGTGTACCGAACATTGTGTTTTACAATACACTCATTGATGGATATTGCAAGAAAGGTGAGGTTGGAAGTGCTTATGAACTTTTTATT
GAATTGAAGCTGAAAGGATTTGTACCTACATTAGAAACTTTTGGTTCCATGGTAAATGGCTTTTGCAAGACGGGAAACTTTGAAGCTATTGATCTTCTTTTGATGGAAAT
GAAAGATAGGGGCTTGAGTGTTAGTGTTCAAGTGTATAATAACATTATTGATGCTCAATATAAGCTTGGTTGTGACATTAGAGCAAAGGATACACTTAAAGAAATGGCTG
AGAATTGCTGTGAACCAGATCTTGTGACTTATAATACTCTAATCAACTATTTATGCAGAGGTGGGGAGGTCATGGAAGCTGAGAAGATCTTGGAACAAGCAATAAAGAGA
GGAATGGTGCCGAATAAGTTCACTTATACTCCGCTTGTTCATGCCTATTGTAAACAAGGGGAATATTATAGGGCCTCAGATTTACTTATTGAGATGTCAAAAAAAGGACA
TAAAGTTGATATGGTTTCGTATGGAGCTTTAATTCATGGACTTGTAGTTGCAGGGGAAGTCGATAATGCTATGACTATCCGGGACAGAATGATGGAAAGAGGGGTTTTAC
CTGATGCCAATATCTACAATGTTTTGATGAATGGACTTTTCAAGAAAGGGAATCTTTCCATGGCCAAGGTGATGCTTTCTGAGATGCTTGACCAAAATATAGCACCTGAT
GCATTTATTTATGCAACTTTAGTGGATGGGTTCATTAGGCATGACAACCTTGATGAGGCCAAGAAACTCTTTCAGCTCACTATAGAAAAGGGTATAGACCCAGGTGTTGT
GGGATATAATTCCATGATCAAAGGTTTCTGTAAATTCGGGATGATGGAAGATGCAGTTTTGTGCATTGATAGAATGAGGAGTGCCCGTCATGTTCCTGATGTCTTTACTT
TCTCCACCATAATTGATGGATATGTAAAACAATGTGACTTGTACGCTGCACTGAAGATCTTTGGACTGATGTTGAAGCAGAGTTGCAAACCAAATGTTGTCACTTACACA
TCTTTGATCAATGGATATTGCCATAAGGGAGAATTGAAGATAGCTGAAAAACTTTTTAGCTTAATGCAATCTCATGGTTTGGAGCCTAGTGTCGTCACATACGGTGTACT
TATACGGAGCCTTTGCAAAGAAGCTAAGCTTGCCGAAGCTGCATCGTATTTCGAGCTCATGTTGATTAACAGGTGCATCCCTAATGATGTTATATTTCATTATCTAGTCA
ATGGGTTTGCAAATATGAATGCTGCTGCAGTTTCCAAAGGACTAAATAATTTTAGCGAGAATGCTAAATCTATGTTTGAGGAGTTCTTTCTGAGAATGATCGGTGATGGA
TGGACACGAAAGGCTGCAGCTTACAATTGTATTCTCATTTGCCTTTGTCAGCATCGAATGGTTAAAACTGCCTTACAGTTGCGTGACAAGATGCTGTCTTTGGGACTTTG
TCCTGATGCTGTTTCTTTTGCTGCATTGATACATGGCATTTGCTTGGGAGGAAGCTCAAAAGAATGCAAGAATGTTATTTCCTGTTATCTGAGTGAAAAGGAGCTG
Protein sequenceShow/hide protein sequence
IIMLYGKAEMIKHAVDTFYDMHLYGCRRTVKSFNAVLKVLMKTRDLGAIEAFLSEAPEKFDIELDIISVNIVVKAFCDIGILSRAYLLMLEMEKVGIRPDVVTYTTLISA
FYKDNRCEISNGLWNLMVLRGCLPNLASFNVRIQYLVDRRRAWEANKLMNVMRNIGIVPDEVTYNLVIKGFCQAGFFDMAKRVYSALQGSGYKPNVKIYQTMIHYLCRSG
DFNLAYTMCKDAMNRNWFPNIDTIHSLIKGLKKMGQLGKAQDNGDDKCTDNYTTCIMVRGLCLEGRTEDGRKLIESRWGKGCVPNIVFYNTLIDGYCKKGEVGSAYELFI
ELKLKGFVPTLETFGSMVNGFCKTGNFEAIDLLLMEMKDRGLSVSVQVYNNIIDAQYKLGCDIRAKDTLKEMAENCCEPDLVTYNTLINYLCRGGEVMEAEKILEQAIKR
GMVPNKFTYTPLVHAYCKQGEYYRASDLLIEMSKKGHKVDMVSYGALIHGLVVAGEVDNAMTIRDRMMERGVLPDANIYNVLMNGLFKKGNLSMAKVMLSEMLDQNIAPD
AFIYATLVDGFIRHDNLDEAKKLFQLTIEKGIDPGVVGYNSMIKGFCKFGMMEDAVLCIDRMRSARHVPDVFTFSTIIDGYVKQCDLYAALKIFGLMLKQSCKPNVVTYT
SLINGYCHKGELKIAEKLFSLMQSHGLEPSVVTYGVLIRSLCKEAKLAEAASYFELMLINRCIPNDVIFHYLVNGFANMNAAAVSKGLNNFSENAKSMFEEFFLRMIGDG
WTRKAAAYNCILICLCQHRMVKTALQLRDKMLSLGLCPDAVSFAALIHGICLGGSSKECKNVISCYLSEKEL