; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr015774 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr015774
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationtig00005930:156711..167782
RNA-Seq ExpressionSgr015774
SyntenySgr015774
Gene Ontology termsGO:0006396 - RNA processing (biological process)
GO:0005515 - protein binding (molecular function)
GO:0042586 - peptide deformylase activity (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily
IPR023635 - Peptide deformylase
IPR036821 - Peptide deformylase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE7997340.1 hypothetical protein FH972_001984 [Carpinus fangiana]1.1e-22953.65Show/hide
Query:  MSKTLLSRIKPLLNLKPEPSSSSYFPLKCDIKKLVNDTIKILKSHEKWEQSLETQFNESDIPVVDISHFVLDRIDDVELGLKFFDWASKNAGSSSLHGSV
        MSKTLLSRIKPL   KP  SSSS   L     KL+ DTI+IL++H++W+QSLET F+ES + V DI+H VLDRI DVELGLKFFDWASK     SL GS 
Subjt:  MSKTLLSRIKPLLNLKPEPSSSSYFPLKCDIKKLVNDTIKILKSHEKWEQSLETQFNESDIPVVDISHFVLDRIDDVELGLKFFDWASKNAGSSSLHGSV

Query:  YSSLIKLLSRFRVFSEIEFTLEDMKTKEIIPTREALSNVLCAYADFGLVDRALELYHGVVKLHNSLPSTYACNSLLNLLVKHRRLETAHQVYDEMVER--
        YSSL+KLL+RF++FSEIE  L+ MK +E+ PTREALS ++ AYAD G V +ALE Y+ V ++HN +P  +ACNSLL++LV HRR E A +VYDEM+E   
Subjt:  YSSLIKLLSRFRVFSEIEFTLEDMKTKEIIPTREALSNVLCAYADFGLVDRALELYHGVVKLHNSLPSTYACNSLLNLLVKHRRLETAHQVYDEMVER--

Query:  -DNGNTYVWITILL-----------------------VLWIYTYI----------------------------RTFGSMVNGFGKMGNFEAIDLLLMEMK
         D+ +T + +  L                        V++  T I                             T+G+M+NGF K G FEAID LL+EMK
Subjt:  -DNGNTYVWITILL-----------------------VLWIYTYI----------------------------RTFGSMVNGFGKMGNFEAIDLLLMEMK

Query:  DRGLRVNVQVYNNIIDAQYKLGCEIKAKDTLEEMAENCCEPDLVTYNTLINYLCRAGEVKEGEKILQQAIRGGLAPNKFTYTPLVHGYCKQGEYIRASDL
        +RGL V+VQVYN IIDAQYK GC ++  +T+  M E+ CEPD++TYNTLIN  CR G+VKE +K L++A + GL P+KF+YTPL+H YC+QGEY RASDL
Subjt:  DRGLRVNVQVYNNIIDAQYKLGCEIKAKDTLEEMAENCCEPDLVTYNTLINYLCRAGEVKEGEKILQQAIRGGLAPNKFTYTPLVHGYCKQGEYIRASDL

Query:  LIEMTERGHKVDMVSYGALIHGLVVAGEVDIALTIRDRMMERGVLPDANIYNVLMNGLFRKGNFP---------------------------WPRHGNLD
        LI+M E GH+ D+VSYGALIHGL+VAGEVD+ALTIRD+MMERGVLPDA IYNVLM+GL +KG  P                           + R+G ++
Subjt:  LIEMTERGHKVDMVSYGALIHGLVVAGEVDIALTIRDRMMERGVLPDANIYNVLMNGLFRKGNFP---------------------------WPRHGNLD

Query:  EAKKLFQLTIEKGIDPGVVGYNAMIKGFCKFGMMKDAVLCIDRMRSAHHVPDVFTFSTIIDGY-------------------------------------
        EAKKLF+  IEK IDPGVVGYNAMIKGFCKFGMMKDA+ CI RMR   H PDVFT++T+IDGY                                     
Subjt:  EAKKLFQLTIEKGIDPGVVGYNAMIKGFCKFGMMKDAVLCIDRMRSAHHVPDVFTFSTIIDGY-------------------------------------

Query:  -GEVKMAEMLFSVMQSHDLEPSVVTYSTLIRSFCKEAKLGKAASYFELMLINRCTPNGVIFHYLVNGFANTNAAAVSKGLNNFSENAKSMFEDFFLRMIG
         G+   AE  F  MQS  LEP+VVTYS  I SFCK  KL KAAS+FELML+++C PN VIFHYLVNGFANT  +AV K      EN KSMF DFF +M  
Subjt:  -GEVKMAEMLFSVMQSHDLEPSVVTYSTLIRSFCKEAKLGKAASYFELMLINRCTPNGVIFHYLVNGFANTNAAAVSKGLNNFSENAKSMFEDFFLRMIG

Query:  DGWTRKAAAHNCILICLCQHRMVKTALQLRDKMLSLGLYPDAVSFAALIHGICLEGNSKQWKNVISCDLSEKELQIALKYSHELDKSITQGGFSEASDIL
        DGW +  AA+N I+ICLCQH MVKTALQL +KM+S G   D+VSFAAL+HGICLEG S +W ++ISC+L+E ELQ A  Y H+L++ + QG  SEAS I+
Subjt:  DGWTRKAAAHNCILICLCQHRMVKTALQLRDKMLSLGLYPDAVSFAALIHGICLEGNSKQWKNVISCDLSEKELQIALKYSHELDKSITQGGFSEASDIL

Query:  QAMVEDYES
        QA++EDY+S
Subjt:  QAMVEDYES

XP_004152354.1 pentatricopeptide repeat-containing protein At1g52620 [Cucumis sativus]5.9e-30065.41Show/hide
Query:  MSKTLLSRIKPLLNLKPEPSSSSYFPLKCDIKKLVNDTIKILKSHEKWEQSLETQFNESDIPVVDISHFVLDRIDDVELGLKFFDWASKNAGSSSLHGSV
        MSKTLLSRI PL N KP+ S     P + +IK+LVNDTI+ILKSHEKWEQSL+T F ESDIP++D++HFVLDRI+DVELGLKFFDWASKN+ S SL+G+ 
Subjt:  MSKTLLSRIKPLLNLKPEPSSSSYFPLKCDIKKLVNDTIKILKSHEKWEQSLETQFNESDIPVVDISHFVLDRIDDVELGLKFFDWASKNAGSSSLHGSV

Query:  YSSLIKLLSRFRVFSEIEFTLEDMKTKEIIPTREALSNVLCAYADFGLVDRALELYHGVVKLHNSLPSTYACNSLLNLLVKHRRLETAHQVYDEMVERDN
        YSSL+KLLSRFRVF EIEFTLE+MKTKE IPTREALS+VLCAYAD GLVD+ALE+YHGVVKLHNSLPSTYACNSLLNLLVKHRR+ETAHQ+YDEM++RDN
Subjt:  YSSLIKLLSRFRVFSEIEFTLEDMKTKEIIPTREALSNVLCAYADFGLVDRALELYHGVVKLHNSLPSTYACNSLLNLLVKHRRLETAHQVYDEMVERDN

Query:  G-----------------------------------------------------------NTYVWITILLVLWIYTYIRTFGSMVNGFGKMGNFEAIDLL
        G                                                           + Y     L +      ++TFGS+VNGF KMG FEAIDLL
Subjt:  G-----------------------------------------------------------NTYVWITILLVLWIYTYIRTFGSMVNGFGKMGNFEAIDLL

Query:  LMEMKDRGLRVNVQVYNNIIDAQYKLGCEIKAKDTLEEMAENCCEPDLVTYNTLINYLCRAGEVKEGEKILQQAIRGGLAPNKFTYTPLVHGYCKQGEYI
        L+EMKDRGL VNVQ+YNNIIDA+YKLG +IKAKDTL+EM+ENCCEPDLVTYNTLIN+ C  GEV+E EK+L+Q IR GLAPNK TYTPLVHGYCKQGEY 
Subjt:  LMEMKDRGLRVNVQVYNNIIDAQYKLGCEIKAKDTLEEMAENCCEPDLVTYNTLINYLCRAGEVKEGEKILQQAIRGGLAPNKFTYTPLVHGYCKQGEYI

Query:  RASDLLIEMTERGHKVDMVSYGALIHGLVVAGEVDIALTIRDRMMERGVLPDANIYNVLMNGLFRKGNFP---------------------------WPR
        +A+D LIEM+  G +VDM+SYGALIHGLVVAGEVD ALTIRDRMM RG+LPDANIYNVLMNGLF+KG                              + R
Subjt:  RASDLLIEMTERGHKVDMVSYGALIHGLVVAGEVDIALTIRDRMMERGVLPDANIYNVLMNGLFRKGNFP---------------------------WPR

Query:  HGNLDEAKKLFQLTIEKGIDPGVVGYNAMIKGFCKFGMMKDAVLCIDRMRSAHHVPDVFTFSTIIDGY--------------------------------
        HGNLDEAKKLFQL IEKG+DPGVVGYN MIKGF K GMM +A+LCID+MR AHHVPD+FTFSTIIDGY                                
Subjt:  HGNLDEAKKLFQLTIEKGIDPGVVGYNAMIKGFCKFGMMKDAVLCIDRMRSAHHVPDVFTFSTIIDGY--------------------------------

Query:  ------GEVKMAEMLFSVMQSHDLEPSVVTYSTLIRSFCKEAKLGKAASYFELMLINRCTPNGVIFHYLVNGFANTNAAAVSKGLNNFSENAKSMFEDFF
              GE KMAE LFS+M+SH L+PSVVTYS LI SFCKEAKLGKA SYFELMLIN+CTPN   FHYLVNGF NT A AVS+  NN  EN++SMFEDFF
Subjt:  ------GEVKMAEMLFSVMQSHDLEPSVVTYSTLIRSFCKEAKLGKAASYFELMLINRCTPNGVIFHYLVNGFANTNAAAVSKGLNNFSENAKSMFEDFF

Query:  LRMIGDGWTRKAAAHNCILICLCQHRMVKTALQLRDKMLSLGLYPDAVSFAALIHGICLEGNSKQWKNVISCDLSEKELQIALKYSHELDKSITQGGFSE
         RMIGDGWT+KAAA+NCILICLCQ RMVKTALQLR+KML+ GL  DAVSF ALIHGICLEGNSK+W+N+ISCDL+E ELQIALKYS ELDK I +GG SE
Subjt:  LRMIGDGWTRKAAAHNCILICLCQHRMVKTALQLRDKMLSLGLYPDAVSFAALIHGICLEGNSKQWKNVISCDLSEKELQIALKYSHELDKSITQGGFSE

Query:  ASDILQAMVEDYESPNQDMNNLKD
        AS ILQAM++ Y SPNQD+NNLK+
Subjt:  ASDILQAMVEDYESPNQDMNNLKD

XP_008454246.1 PREDICTED: pentatricopeptide repeat-containing protein At1g52620 [Cucumis melo]3.7e-30266.38Show/hide
Query:  MSKTLLSRIKPLLNLKPEPSSSSYFPLKCDIKKLVNDTIKILKSHEKWEQSLETQFNESDIPVVDISHFVLDRIDDVELGLKFFDWASKNAGSSSLHGSV
        MSKTLLSRI+ L N KP+ SS     L+ DIK+LVND+I+ILKSHE+WEQSL+T F ESDIP++D++HFVLDRIDDVELGLKFFDWASKN+ S SL+G+ 
Subjt:  MSKTLLSRIKPLLNLKPEPSSSSYFPLKCDIKKLVNDTIKILKSHEKWEQSLETQFNESDIPVVDISHFVLDRIDDVELGLKFFDWASKNAGSSSLHGSV

Query:  YSSLIKLLSRFRVFSEIEFTLEDMKTKEIIPTREALSNVLCAYADFGLVDRALELYHGVVKLHNSLPSTYACNSLLNLLVKHRRLETAHQVYDEMVERDN
        YSSL+KLLSRFRVF EIEFTLE+MKTKE IPTREALSNVLCAY D G VD+ALE+YHGV KLHNSLPS YACNSLLNLLVKHRR ETAHQ+YDEMV+RDN
Subjt:  YSSLIKLLSRFRVFSEIEFTLEDMKTKEIIPTREALSNVLCAYADFGLVDRALELYHGVVKLHNSLPSTYACNSLLNLLVKHRRLETAHQVYDEMVERDN

Query:  GN----TYVWITILL---------------------------VLWIYTYI----------------------------RTFGSMVNGFGKMGNFEAIDLL
        G+     Y    I++                           +++  T I                            +TFGS+VNGF KMG FEAIDLL
Subjt:  GN----TYVWITILL---------------------------VLWIYTYI----------------------------RTFGSMVNGFGKMGNFEAIDLL

Query:  LMEMKDRGLRVNVQVYNNIIDAQYKLGCEIKAKDTLEEMAENCCEPDLVTYNTLINYLCRAGEVKEGEKILQQAIRGGLAPNKFTYTPLVHGYCKQGEYI
        L+EMKDRG  VNVQ+YNNIIDAQYKLGC+IKAKDTL+EM+EN C PDLVTYNTLINYLC  GEVKE EK+L+Q IR GLAPN+FTYTPLVHGYCK+GEY 
Subjt:  LMEMKDRGLRVNVQVYNNIIDAQYKLGCEIKAKDTLEEMAENCCEPDLVTYNTLINYLCRAGEVKEGEKILQQAIRGGLAPNKFTYTPLVHGYCKQGEYI

Query:  RASDLLIEMTERGHKVDMVSYGALIHGLVVAGEVDIALTIRDRMMERGVLPDANIYNVLMNGLFRKGNFP---------------------------WPR
        RA+DLLIEM+ RG ++DM+SYGALIHGLVVAGEVDIALTIRDRMM +G+LPDANIYNVLMNGLF+KG                              + R
Subjt:  RASDLLIEMTERGHKVDMVSYGALIHGLVVAGEVDIALTIRDRMMERGVLPDANIYNVLMNGLFRKGNFP---------------------------WPR

Query:  HGNLDEAKKLFQLTIEKGIDPGVVGYNAMIKGFCKFGMMKDAVLCIDRMRSAHHVPDVFTFSTIIDGY--------------------------------
         GNLDEAKKLFQL IEKG+DPGVVGYN MIKGF KFGMM +A+LCIDRMRSAHHVPDVFTFSTIIDGY                                
Subjt:  HGNLDEAKKLFQLTIEKGIDPGVVGYNAMIKGFCKFGMMKDAVLCIDRMRSAHHVPDVFTFSTIIDGY--------------------------------

Query:  ------GEVKMAEMLFSVMQSHDLEPSVVTYSTLIRSFCKEAKLGKAASYFELMLINRCTPNGVIFHYLVNGFANTNAAAVSKGLNNFSENAKSMFEDFF
              GE +MAE LFS+M+SH LEPSVVTY+ LI +FCKEAKLGKA SYFELMLIN+CTPN   FHYLVNGF NT A AVS G NN  EN++SMFEDFF
Subjt:  ------GEVKMAEMLFSVMQSHDLEPSVVTYSTLIRSFCKEAKLGKAASYFELMLINRCTPNGVIFHYLVNGFANTNAAAVSKGLNNFSENAKSMFEDFF

Query:  LRMIGDGWTRKAAAHNCILICLCQHRMVKTALQLRDKMLSLGLYPDAVSFAALIHGICLEGNSKQWKNVISCDLSEKELQIALKYSHELDKSITQGGFSE
         RMIGDGWTRKAAA+NCILICLCQ RMVKTALQLR+KMLSLGL  DAVSF AL+HGICLEGNSK+W+N+ISCDL+E ELQIALKYS ELDK IT+GG SE
Subjt:  LRMIGDGWTRKAAAHNCILICLCQHRMVKTALQLRDKMLSLGLYPDAVSFAALIHGICLEGNSKQWKNVISCDLSEKELQIALKYSHELDKSITQGGFSE

Query:  ASDILQAMVEDYESPNQDMNNLKD
        AS ILQAM++ Y SPNQD+NNLK+
Subjt:  ASDILQAMVEDYESPNQDMNNLKD

XP_022153568.1 pentatricopeptide repeat-containing protein At1g52620 [Momordica charantia]0.0e+0072.94Show/hide
Query:  MSKTLLSRIKPLLNLKPEPSSSSYFPLKCDIKKLVNDTIKILKSHEKWEQSLETQFNESDIPVVDISHFVLDRIDDVELGLKFFDWASKNAGSSSLHGSV
        MSK LLSRIKPL NLKP+PSS   FPLKCDIKKLVNDTIKILKSHEKWEQSLETQFNESDIPV+DISHFVLDRIDDVELGLKFFDWASKN+ S SL+GS 
Subjt:  MSKTLLSRIKPLLNLKPEPSSSSYFPLKCDIKKLVNDTIKILKSHEKWEQSLETQFNESDIPVVDISHFVLDRIDDVELGLKFFDWASKNAGSSSLHGSV

Query:  YSSLIKLLSRFRVFSEIEFTLEDMKTKEIIPTREALSNVLCAYADFGLVDRALELYHGVVKLHNSLPSTYACNSLLNLLVKHRRLETAHQVYDEMVERDN
        YSSL+KLLSRFRVF EIEFTLEDM+TKEI+PTR+ALSNVLCAYAD G VD+AL  YHGVVKLHNSLPSTYACNSLLNLLVKHRRL TAHQ+YDEMV+RDN
Subjt:  YSSLIKLLSRFRVFSEIEFTLEDMKTKEIIPTREALSNVLCAYADFGLVDRALELYHGVVKLHNSLPSTYACNSLLNLLVKHRRLETAHQVYDEMVERDN

Query:  G---------------------------------------------NTYV---------------WITILLVLWIYTYIRTFGSMVNGFGKMGNFEAIDL
        G                                             NT +               +I + L  ++ T + TFGSMVNGF K GNFEAIDL
Subjt:  G---------------------------------------------NTYV---------------WITILLVLWIYTYIRTFGSMVNGFGKMGNFEAIDL

Query:  LLMEMKDRGLRVNVQVYNNIIDAQYKLGCEIKAKDTLEEMAENCCEPDLVTYNTLINYLCRAGEVKEGEKILQQAIRGGLAPNKFTYTPLVHGYCKQGEY
        LLMEMKDRGL V+VQVYNNIIDAQYKLGC+I+AKD L+E AENCCEPDLVTYNTLINYLCR GEV E EKIL+QAI+ G+ PNKFTYTPLVH YCKQGEY
Subjt:  LLMEMKDRGLRVNVQVYNNIIDAQYKLGCEIKAKDTLEEMAENCCEPDLVTYNTLINYLCRAGEVKEGEKILQQAIRGGLAPNKFTYTPLVHGYCKQGEY

Query:  IRASDLLIEMTERGHKVDMVSYGALIHGLVVAGEVDIALTIRDRMMERGVLPDANIYNVLMNGLFRKGNFP---------------------------WP
         RASDLLIEM+++GHKVDMVSYGALIHGLVVAGEVD A+TIRDRMMERGVLPDANIYNVLMNGLF+KGN                             + 
Subjt:  IRASDLLIEMTERGHKVDMVSYGALIHGLVVAGEVDIALTIRDRMMERGVLPDANIYNVLMNGLFRKGNFP---------------------------WP

Query:  RHGNLDEAKKLFQLTIEKGIDPGVVGYNAMIKGFCKFGMMKDAVLCIDRMRSAHHVPDVFTFSTIIDGY-------------------------------
        RH NLDEAKKLFQLTIEKGIDPGVVGYN+MIKGFCKFGMM+DAVLCIDRMRSA HVPDVFTFSTIIDGY                               
Subjt:  RHGNLDEAKKLFQLTIEKGIDPGVVGYNAMIKGFCKFGMMKDAVLCIDRMRSAHHVPDVFTFSTIIDGY-------------------------------

Query:  -------GEVKMAEMLFSVMQSHDLEPSVVTYSTLIRSFCKEAKLGKAASYFELMLINRCTPNGVIFHYLVNGFANTNAAAVSKGLNNFSENAKSMFEDF
               GE+K+AE LFS+MQSH LEPSVVTY  LIRS CKEAKL +AASYFELMLINRC PN VIFHYLVNGFAN NAAAVSKGLNNFSENAKSMFE+F
Subjt:  -------GEVKMAEMLFSVMQSHDLEPSVVTYSTLIRSFCKEAKLGKAASYFELMLINRCTPNGVIFHYLVNGFANTNAAAVSKGLNNFSENAKSMFEDF

Query:  FLRMIGDGWTRKAAAHNCILICLCQHRMVKTALQLRDKMLSLGLYPDAVSFAALIHGICLEGNSKQWKNVISCDLSEKELQIALKYSHELDKSITQGGFS
        FLRMIGDGWTRKAAA+NCILICLCQHRMVKTALQLRDKMLSLGL PDAVSFAALIHGICL G+SK+ KNVISC LSEKEL+IALKYS ELDKSITQGG S
Subjt:  FLRMIGDGWTRKAAAHNCILICLCQHRMVKTALQLRDKMLSLGLYPDAVSFAALIHGICLEGNSKQWKNVISCDLSEKELQIALKYSHELDKSITQGGFS

Query:  EASDILQAMVEDYESPNQDMNNLK
        EASDILQAMVEDYESPNQD+N+LK
Subjt:  EASDILQAMVEDYESPNQDMNNLK

XP_038894903.1 pentatricopeptide repeat-containing protein At1g52620 [Benincasa hispida]0.0e+0069.13Show/hide
Query:  MSKTLLSRIKPLLNLKPEPSSSSYFPLKCDIKKLVNDTIKILKSHEKWEQSLETQFNESDIPVVDISHFVLDRIDDVELGLKFFDWASKNAGSSSLHGSV
        MSKT+LSRIKP  N KPE SSS  FPL+CDIK+LVNDTI+ILKSHEKWEQSL T F ESDIPVVD++HFVLDRIDDV LGLKFFDWASKN+ S SL+GS 
Subjt:  MSKTLLSRIKPLLNLKPEPSSSSYFPLKCDIKKLVNDTIKILKSHEKWEQSLETQFNESDIPVVDISHFVLDRIDDVELGLKFFDWASKNAGSSSLHGSV

Query:  YSSLIKLLSRFRVFSEIEFTLEDMKTKEIIPTREALSNVLCAYADFGLVDRALELYHGVVKLHNSLPSTYACNSLLNLLVKHRRLETAHQVYDEMVERDN
        YSSL+KLLS+FRVF EIEFTLEDMKTKE IPTREALSNVLCAYAD G VD+ALE+YHGVVKLHNSLPS YACNSLLNLLVKHRRLETAHQ+YDEMV RDN
Subjt:  YSSLIKLLSRFRVFSEIEFTLEDMKTKEIIPTREALSNVLCAYADFGLVDRALELYHGVVKLHNSLPSTYACNSLLNLLVKHRRLETAHQVYDEMVERDN

Query:  G-----------------------------------------------------------NTYVWITILLVLWIYTYIRTFGSMVNGFGKMGNFEAIDLL
        G                                                           + Y     L +      + TFGS+VNGF K+G FEAIDLL
Subjt:  G-----------------------------------------------------------NTYVWITILLVLWIYTYIRTFGSMVNGFGKMGNFEAIDLL

Query:  LMEMKDRGLRVNVQVYNNIIDAQYKLGCEIKAKDTLEEMAENCCEPDLVTYNTLINYLCRAGEVKEGEKILQQAIRGGLAPNKFTYTPLVHGYCKQGEYI
        L+EMK+RGL VNVQ+YN IIDA+YKLG + KAKDTL+EM ENCC PDLVTYNTLINYLC  GEVKE EK+L+Q IR GLAP+KF YTPLVHGY KQGEYI
Subjt:  LMEMKDRGLRVNVQVYNNIIDAQYKLGCEIKAKDTLEEMAENCCEPDLVTYNTLINYLCRAGEVKEGEKILQQAIRGGLAPNKFTYTPLVHGYCKQGEYI

Query:  RASDLLIEMTERGHKVDMVSYGALIHGLVVAGEVDIALTIRDRMMERGVLPDANIYNVLMNGLFRKGNFP---------------------------WPR
        RASDL+IEM+ RGH+VD VSYGA+IHGLVVAGEVDIALTIRDRMMERGVLPDANIYNVLMNGLF+KG                              + R
Subjt:  RASDLLIEMTERGHKVDMVSYGALIHGLVVAGEVDIALTIRDRMMERGVLPDANIYNVLMNGLFRKGNFP---------------------------WPR

Query:  HGNLDEAKKLFQLTIEKGIDPGVVGYNAMIKGFCKFGMMKDAVLCIDRMRSAHHVPDVFTFSTIIDGY--------------------------------
        HGNLDEA K+FQLTIEKGIDPGVVGYN MIKGF KFGMM DA+LCIDRMRSAHH PDVFTFSTIIDGY                                
Subjt:  HGNLDEAKKLFQLTIEKGIDPGVVGYNAMIKGFCKFGMMKDAVLCIDRMRSAHHVPDVFTFSTIIDGY--------------------------------

Query:  ------GEVKMAEMLFSVMQSHDLEPSVVTYSTLIRSFCKEAKLGKAASYFELMLINRCTPNGVIFHYLVNGFANTNAAAVSKGLNNFSENAKSMFEDFF
              GE+KMAE  FS+MQSH LEPSVVTYS LIRSFCKEAKLGKAASYFELMLIN+CTPN V+FHYLVNGF NTNAAAVS+G NN  +N++SMFEDFF
Subjt:  ------GEVKMAEMLFSVMQSHDLEPSVVTYSTLIRSFCKEAKLGKAASYFELMLINRCTPNGVIFHYLVNGFANTNAAAVSKGLNNFSENAKSMFEDFF

Query:  LRMIGDGWTRKAAAHNCILICLCQHRMVKTALQLRDKMLSLGLYPDAVSFAALIHGICLEGNSKQWKNVISCDLSEKELQIALKYSHELDKSITQGGFSE
         RMIGDGWTRKAAA+NCILICLCQHRMVKTALQLRDKMLSLGL PDAVSF ALIHGICLEG SK+W+N+ISCDL+E ELQIALKYS ELDKSITQGG SE
Subjt:  LRMIGDGWTRKAAAHNCILICLCQHRMVKTALQLRDKMLSLGLYPDAVSFAALIHGICLEGNSKQWKNVISCDLSEKELQIALKYSHELDKSITQGGFSE

Query:  ASDILQAMVEDYESPNQDMNNLKDEQ
        AS+ILQAM++ YESPNQD+NNL++ Q
Subjt:  ASDILQAMVEDYESPNQDMNNLKDEQ

TrEMBL top hitse value%identityAlignment
A0A0A0KTD1 Uncharacterized protein2.9e-30065.41Show/hide
Query:  MSKTLLSRIKPLLNLKPEPSSSSYFPLKCDIKKLVNDTIKILKSHEKWEQSLETQFNESDIPVVDISHFVLDRIDDVELGLKFFDWASKNAGSSSLHGSV
        MSKTLLSRI PL N KP+ S     P + +IK+LVNDTI+ILKSHEKWEQSL+T F ESDIP++D++HFVLDRI+DVELGLKFFDWASKN+ S SL+G+ 
Subjt:  MSKTLLSRIKPLLNLKPEPSSSSYFPLKCDIKKLVNDTIKILKSHEKWEQSLETQFNESDIPVVDISHFVLDRIDDVELGLKFFDWASKNAGSSSLHGSV

Query:  YSSLIKLLSRFRVFSEIEFTLEDMKTKEIIPTREALSNVLCAYADFGLVDRALELYHGVVKLHNSLPSTYACNSLLNLLVKHRRLETAHQVYDEMVERDN
        YSSL+KLLSRFRVF EIEFTLE+MKTKE IPTREALS+VLCAYAD GLVD+ALE+YHGVVKLHNSLPSTYACNSLLNLLVKHRR+ETAHQ+YDEM++RDN
Subjt:  YSSLIKLLSRFRVFSEIEFTLEDMKTKEIIPTREALSNVLCAYADFGLVDRALELYHGVVKLHNSLPSTYACNSLLNLLVKHRRLETAHQVYDEMVERDN

Query:  G-----------------------------------------------------------NTYVWITILLVLWIYTYIRTFGSMVNGFGKMGNFEAIDLL
        G                                                           + Y     L +      ++TFGS+VNGF KMG FEAIDLL
Subjt:  G-----------------------------------------------------------NTYVWITILLVLWIYTYIRTFGSMVNGFGKMGNFEAIDLL

Query:  LMEMKDRGLRVNVQVYNNIIDAQYKLGCEIKAKDTLEEMAENCCEPDLVTYNTLINYLCRAGEVKEGEKILQQAIRGGLAPNKFTYTPLVHGYCKQGEYI
        L+EMKDRGL VNVQ+YNNIIDA+YKLG +IKAKDTL+EM+ENCCEPDLVTYNTLIN+ C  GEV+E EK+L+Q IR GLAPNK TYTPLVHGYCKQGEY 
Subjt:  LMEMKDRGLRVNVQVYNNIIDAQYKLGCEIKAKDTLEEMAENCCEPDLVTYNTLINYLCRAGEVKEGEKILQQAIRGGLAPNKFTYTPLVHGYCKQGEYI

Query:  RASDLLIEMTERGHKVDMVSYGALIHGLVVAGEVDIALTIRDRMMERGVLPDANIYNVLMNGLFRKGNFP---------------------------WPR
        +A+D LIEM+  G +VDM+SYGALIHGLVVAGEVD ALTIRDRMM RG+LPDANIYNVLMNGLF+KG                              + R
Subjt:  RASDLLIEMTERGHKVDMVSYGALIHGLVVAGEVDIALTIRDRMMERGVLPDANIYNVLMNGLFRKGNFP---------------------------WPR

Query:  HGNLDEAKKLFQLTIEKGIDPGVVGYNAMIKGFCKFGMMKDAVLCIDRMRSAHHVPDVFTFSTIIDGY--------------------------------
        HGNLDEAKKLFQL IEKG+DPGVVGYN MIKGF K GMM +A+LCID+MR AHHVPD+FTFSTIIDGY                                
Subjt:  HGNLDEAKKLFQLTIEKGIDPGVVGYNAMIKGFCKFGMMKDAVLCIDRMRSAHHVPDVFTFSTIIDGY--------------------------------

Query:  ------GEVKMAEMLFSVMQSHDLEPSVVTYSTLIRSFCKEAKLGKAASYFELMLINRCTPNGVIFHYLVNGFANTNAAAVSKGLNNFSENAKSMFEDFF
              GE KMAE LFS+M+SH L+PSVVTYS LI SFCKEAKLGKA SYFELMLIN+CTPN   FHYLVNGF NT A AVS+  NN  EN++SMFEDFF
Subjt:  ------GEVKMAEMLFSVMQSHDLEPSVVTYSTLIRSFCKEAKLGKAASYFELMLINRCTPNGVIFHYLVNGFANTNAAAVSKGLNNFSENAKSMFEDFF

Query:  LRMIGDGWTRKAAAHNCILICLCQHRMVKTALQLRDKMLSLGLYPDAVSFAALIHGICLEGNSKQWKNVISCDLSEKELQIALKYSHELDKSITQGGFSE
         RMIGDGWT+KAAA+NCILICLCQ RMVKTALQLR+KML+ GL  DAVSF ALIHGICLEGNSK+W+N+ISCDL+E ELQIALKYS ELDK I +GG SE
Subjt:  LRMIGDGWTRKAAAHNCILICLCQHRMVKTALQLRDKMLSLGLYPDAVSFAALIHGICLEGNSKQWKNVISCDLSEKELQIALKYSHELDKSITQGGFSE

Query:  ASDILQAMVEDYESPNQDMNNLKD
        AS ILQAM++ Y SPNQD+NNLK+
Subjt:  ASDILQAMVEDYESPNQDMNNLKD

A0A1S3BYA4 pentatricopeptide repeat-containing protein At1g526201.8e-30266.38Show/hide
Query:  MSKTLLSRIKPLLNLKPEPSSSSYFPLKCDIKKLVNDTIKILKSHEKWEQSLETQFNESDIPVVDISHFVLDRIDDVELGLKFFDWASKNAGSSSLHGSV
        MSKTLLSRI+ L N KP+ SS     L+ DIK+LVND+I+ILKSHE+WEQSL+T F ESDIP++D++HFVLDRIDDVELGLKFFDWASKN+ S SL+G+ 
Subjt:  MSKTLLSRIKPLLNLKPEPSSSSYFPLKCDIKKLVNDTIKILKSHEKWEQSLETQFNESDIPVVDISHFVLDRIDDVELGLKFFDWASKNAGSSSLHGSV

Query:  YSSLIKLLSRFRVFSEIEFTLEDMKTKEIIPTREALSNVLCAYADFGLVDRALELYHGVVKLHNSLPSTYACNSLLNLLVKHRRLETAHQVYDEMVERDN
        YSSL+KLLSRFRVF EIEFTLE+MKTKE IPTREALSNVLCAY D G VD+ALE+YHGV KLHNSLPS YACNSLLNLLVKHRR ETAHQ+YDEMV+RDN
Subjt:  YSSLIKLLSRFRVFSEIEFTLEDMKTKEIIPTREALSNVLCAYADFGLVDRALELYHGVVKLHNSLPSTYACNSLLNLLVKHRRLETAHQVYDEMVERDN

Query:  GN----TYVWITILL---------------------------VLWIYTYI----------------------------RTFGSMVNGFGKMGNFEAIDLL
        G+     Y    I++                           +++  T I                            +TFGS+VNGF KMG FEAIDLL
Subjt:  GN----TYVWITILL---------------------------VLWIYTYI----------------------------RTFGSMVNGFGKMGNFEAIDLL

Query:  LMEMKDRGLRVNVQVYNNIIDAQYKLGCEIKAKDTLEEMAENCCEPDLVTYNTLINYLCRAGEVKEGEKILQQAIRGGLAPNKFTYTPLVHGYCKQGEYI
        L+EMKDRG  VNVQ+YNNIIDAQYKLGC+IKAKDTL+EM+EN C PDLVTYNTLINYLC  GEVKE EK+L+Q IR GLAPN+FTYTPLVHGYCK+GEY 
Subjt:  LMEMKDRGLRVNVQVYNNIIDAQYKLGCEIKAKDTLEEMAENCCEPDLVTYNTLINYLCRAGEVKEGEKILQQAIRGGLAPNKFTYTPLVHGYCKQGEYI

Query:  RASDLLIEMTERGHKVDMVSYGALIHGLVVAGEVDIALTIRDRMMERGVLPDANIYNVLMNGLFRKGNFP---------------------------WPR
        RA+DLLIEM+ RG ++DM+SYGALIHGLVVAGEVDIALTIRDRMM +G+LPDANIYNVLMNGLF+KG                              + R
Subjt:  RASDLLIEMTERGHKVDMVSYGALIHGLVVAGEVDIALTIRDRMMERGVLPDANIYNVLMNGLFRKGNFP---------------------------WPR

Query:  HGNLDEAKKLFQLTIEKGIDPGVVGYNAMIKGFCKFGMMKDAVLCIDRMRSAHHVPDVFTFSTIIDGY--------------------------------
         GNLDEAKKLFQL IEKG+DPGVVGYN MIKGF KFGMM +A+LCIDRMRSAHHVPDVFTFSTIIDGY                                
Subjt:  HGNLDEAKKLFQLTIEKGIDPGVVGYNAMIKGFCKFGMMKDAVLCIDRMRSAHHVPDVFTFSTIIDGY--------------------------------

Query:  ------GEVKMAEMLFSVMQSHDLEPSVVTYSTLIRSFCKEAKLGKAASYFELMLINRCTPNGVIFHYLVNGFANTNAAAVSKGLNNFSENAKSMFEDFF
              GE +MAE LFS+M+SH LEPSVVTY+ LI +FCKEAKLGKA SYFELMLIN+CTPN   FHYLVNGF NT A AVS G NN  EN++SMFEDFF
Subjt:  ------GEVKMAEMLFSVMQSHDLEPSVVTYSTLIRSFCKEAKLGKAASYFELMLINRCTPNGVIFHYLVNGFANTNAAAVSKGLNNFSENAKSMFEDFF

Query:  LRMIGDGWTRKAAAHNCILICLCQHRMVKTALQLRDKMLSLGLYPDAVSFAALIHGICLEGNSKQWKNVISCDLSEKELQIALKYSHELDKSITQGGFSE
         RMIGDGWTRKAAA+NCILICLCQ RMVKTALQLR+KMLSLGL  DAVSF AL+HGICLEGNSK+W+N+ISCDL+E ELQIALKYS ELDK IT+GG SE
Subjt:  LRMIGDGWTRKAAAHNCILICLCQHRMVKTALQLRDKMLSLGLYPDAVSFAALIHGICLEGNSKQWKNVISCDLSEKELQIALKYSHELDKSITQGGFSE

Query:  ASDILQAMVEDYESPNQDMNNLKD
        AS ILQAM++ Y SPNQD+NNLK+
Subjt:  ASDILQAMVEDYESPNQDMNNLKD

A0A5A7TLP1 Pentatricopeptide repeat-containing protein1.8e-30266.38Show/hide
Query:  MSKTLLSRIKPLLNLKPEPSSSSYFPLKCDIKKLVNDTIKILKSHEKWEQSLETQFNESDIPVVDISHFVLDRIDDVELGLKFFDWASKNAGSSSLHGSV
        MSKTLLSRI+ L N KP+ SS     L+ DIK+LVND+I+ILKSHE+WEQSL+T F ESDIP++D++HFVLDRIDDVELGLKFFDWASKN+ S SL+G+ 
Subjt:  MSKTLLSRIKPLLNLKPEPSSSSYFPLKCDIKKLVNDTIKILKSHEKWEQSLETQFNESDIPVVDISHFVLDRIDDVELGLKFFDWASKNAGSSSLHGSV

Query:  YSSLIKLLSRFRVFSEIEFTLEDMKTKEIIPTREALSNVLCAYADFGLVDRALELYHGVVKLHNSLPSTYACNSLLNLLVKHRRLETAHQVYDEMVERDN
        YSSL+KLLSRFRVF EIEFTLE+MKTKE IPTREALSNVLCAY D G VD+ALE+YHGV KLHNSLPS YACNSLLNLLVKHRR ETAHQ+YDEMV+RDN
Subjt:  YSSLIKLLSRFRVFSEIEFTLEDMKTKEIIPTREALSNVLCAYADFGLVDRALELYHGVVKLHNSLPSTYACNSLLNLLVKHRRLETAHQVYDEMVERDN

Query:  GN----TYVWITILL---------------------------VLWIYTYI----------------------------RTFGSMVNGFGKMGNFEAIDLL
        G+     Y    I++                           +++  T I                            +TFGS+VNGF KMG FEAIDLL
Subjt:  GN----TYVWITILL---------------------------VLWIYTYI----------------------------RTFGSMVNGFGKMGNFEAIDLL

Query:  LMEMKDRGLRVNVQVYNNIIDAQYKLGCEIKAKDTLEEMAENCCEPDLVTYNTLINYLCRAGEVKEGEKILQQAIRGGLAPNKFTYTPLVHGYCKQGEYI
        L+EMKDRG  VNVQ+YNNIIDAQYKLGC+IKAKDTL+EM+EN C PDLVTYNTLINYLC  GEVKE EK+L+Q IR GLAPN+FTYTPLVHGYCK+GEY 
Subjt:  LMEMKDRGLRVNVQVYNNIIDAQYKLGCEIKAKDTLEEMAENCCEPDLVTYNTLINYLCRAGEVKEGEKILQQAIRGGLAPNKFTYTPLVHGYCKQGEYI

Query:  RASDLLIEMTERGHKVDMVSYGALIHGLVVAGEVDIALTIRDRMMERGVLPDANIYNVLMNGLFRKGNFP---------------------------WPR
        RA+DLLIEM+ RG ++DM+SYGALIHGLVVAGEVDIALTIRDRMM +G+LPDANIYNVLMNGLF+KG                              + R
Subjt:  RASDLLIEMTERGHKVDMVSYGALIHGLVVAGEVDIALTIRDRMMERGVLPDANIYNVLMNGLFRKGNFP---------------------------WPR

Query:  HGNLDEAKKLFQLTIEKGIDPGVVGYNAMIKGFCKFGMMKDAVLCIDRMRSAHHVPDVFTFSTIIDGY--------------------------------
         GNLDEAKKLFQL IEKG+DPGVVGYN MIKGF KFGMM +A+LCIDRMRSAHHVPDVFTFSTIIDGY                                
Subjt:  HGNLDEAKKLFQLTIEKGIDPGVVGYNAMIKGFCKFGMMKDAVLCIDRMRSAHHVPDVFTFSTIIDGY--------------------------------

Query:  ------GEVKMAEMLFSVMQSHDLEPSVVTYSTLIRSFCKEAKLGKAASYFELMLINRCTPNGVIFHYLVNGFANTNAAAVSKGLNNFSENAKSMFEDFF
              GE +MAE LFS+M+SH LEPSVVTY+ LI +FCKEAKLGKA SYFELMLIN+CTPN   FHYLVNGF NT A AVS G NN  EN++SMFEDFF
Subjt:  ------GEVKMAEMLFSVMQSHDLEPSVVTYSTLIRSFCKEAKLGKAASYFELMLINRCTPNGVIFHYLVNGFANTNAAAVSKGLNNFSENAKSMFEDFF

Query:  LRMIGDGWTRKAAAHNCILICLCQHRMVKTALQLRDKMLSLGLYPDAVSFAALIHGICLEGNSKQWKNVISCDLSEKELQIALKYSHELDKSITQGGFSE
         RMIGDGWTRKAAA+NCILICLCQ RMVKTALQLR+KMLSLGL  DAVSF AL+HGICLEGNSK+W+N+ISCDL+E ELQIALKYS ELDK IT+GG SE
Subjt:  LRMIGDGWTRKAAAHNCILICLCQHRMVKTALQLRDKMLSLGLYPDAVSFAALIHGICLEGNSKQWKNVISCDLSEKELQIALKYSHELDKSITQGGFSE

Query:  ASDILQAMVEDYESPNQDMNNLKD
        AS ILQAM++ Y SPNQD+NNLK+
Subjt:  ASDILQAMVEDYESPNQDMNNLKD

A0A5N6QGR8 Uncharacterized protein5.3e-23053.65Show/hide
Query:  MSKTLLSRIKPLLNLKPEPSSSSYFPLKCDIKKLVNDTIKILKSHEKWEQSLETQFNESDIPVVDISHFVLDRIDDVELGLKFFDWASKNAGSSSLHGSV
        MSKTLLSRIKPL   KP  SSSS   L     KL+ DTI+IL++H++W+QSLET F+ES + V DI+H VLDRI DVELGLKFFDWASK     SL GS 
Subjt:  MSKTLLSRIKPLLNLKPEPSSSSYFPLKCDIKKLVNDTIKILKSHEKWEQSLETQFNESDIPVVDISHFVLDRIDDVELGLKFFDWASKNAGSSSLHGSV

Query:  YSSLIKLLSRFRVFSEIEFTLEDMKTKEIIPTREALSNVLCAYADFGLVDRALELYHGVVKLHNSLPSTYACNSLLNLLVKHRRLETAHQVYDEMVER--
        YSSL+KLL+RF++FSEIE  L+ MK +E+ PTREALS ++ AYAD G V +ALE Y+ V ++HN +P  +ACNSLL++LV HRR E A +VYDEM+E   
Subjt:  YSSLIKLLSRFRVFSEIEFTLEDMKTKEIIPTREALSNVLCAYADFGLVDRALELYHGVVKLHNSLPSTYACNSLLNLLVKHRRLETAHQVYDEMVER--

Query:  -DNGNTYVWITILL-----------------------VLWIYTYI----------------------------RTFGSMVNGFGKMGNFEAIDLLLMEMK
         D+ +T + +  L                        V++  T I                             T+G+M+NGF K G FEAID LL+EMK
Subjt:  -DNGNTYVWITILL-----------------------VLWIYTYI----------------------------RTFGSMVNGFGKMGNFEAIDLLLMEMK

Query:  DRGLRVNVQVYNNIIDAQYKLGCEIKAKDTLEEMAENCCEPDLVTYNTLINYLCRAGEVKEGEKILQQAIRGGLAPNKFTYTPLVHGYCKQGEYIRASDL
        +RGL V+VQVYN IIDAQYK GC ++  +T+  M E+ CEPD++TYNTLIN  CR G+VKE +K L++A + GL P+KF+YTPL+H YC+QGEY RASDL
Subjt:  DRGLRVNVQVYNNIIDAQYKLGCEIKAKDTLEEMAENCCEPDLVTYNTLINYLCRAGEVKEGEKILQQAIRGGLAPNKFTYTPLVHGYCKQGEYIRASDL

Query:  LIEMTERGHKVDMVSYGALIHGLVVAGEVDIALTIRDRMMERGVLPDANIYNVLMNGLFRKGNFP---------------------------WPRHGNLD
        LI+M E GH+ D+VSYGALIHGL+VAGEVD+ALTIRD+MMERGVLPDA IYNVLM+GL +KG  P                           + R+G ++
Subjt:  LIEMTERGHKVDMVSYGALIHGLVVAGEVDIALTIRDRMMERGVLPDANIYNVLMNGLFRKGNFP---------------------------WPRHGNLD

Query:  EAKKLFQLTIEKGIDPGVVGYNAMIKGFCKFGMMKDAVLCIDRMRSAHHVPDVFTFSTIIDGY-------------------------------------
        EAKKLF+  IEK IDPGVVGYNAMIKGFCKFGMMKDA+ CI RMR   H PDVFT++T+IDGY                                     
Subjt:  EAKKLFQLTIEKGIDPGVVGYNAMIKGFCKFGMMKDAVLCIDRMRSAHHVPDVFTFSTIIDGY-------------------------------------

Query:  -GEVKMAEMLFSVMQSHDLEPSVVTYSTLIRSFCKEAKLGKAASYFELMLINRCTPNGVIFHYLVNGFANTNAAAVSKGLNNFSENAKSMFEDFFLRMIG
         G+   AE  F  MQS  LEP+VVTYS  I SFCK  KL KAAS+FELML+++C PN VIFHYLVNGFANT  +AV K      EN KSMF DFF +M  
Subjt:  -GEVKMAEMLFSVMQSHDLEPSVVTYSTLIRSFCKEAKLGKAASYFELMLINRCTPNGVIFHYLVNGFANTNAAAVSKGLNNFSENAKSMFEDFFLRMIG

Query:  DGWTRKAAAHNCILICLCQHRMVKTALQLRDKMLSLGLYPDAVSFAALIHGICLEGNSKQWKNVISCDLSEKELQIALKYSHELDKSITQGGFSEASDIL
        DGW +  AA+N I+ICLCQH MVKTALQL +KM+S G   D+VSFAAL+HGICLEG S +W ++ISC+L+E ELQ A  Y H+L++ + QG  SEAS I+
Subjt:  DGWTRKAAAHNCILICLCQHRMVKTALQLRDKMLSLGLYPDAVSFAALIHGICLEGNSKQWKNVISCDLSEKELQIALKYSHELDKSITQGGFSEASDIL

Query:  QAMVEDYES
        QA++EDY+S
Subjt:  QAMVEDYES

A0A6J1DHT9 pentatricopeptide repeat-containing protein At1g526200.0e+0072.94Show/hide
Query:  MSKTLLSRIKPLLNLKPEPSSSSYFPLKCDIKKLVNDTIKILKSHEKWEQSLETQFNESDIPVVDISHFVLDRIDDVELGLKFFDWASKNAGSSSLHGSV
        MSK LLSRIKPL NLKP+PSS   FPLKCDIKKLVNDTIKILKSHEKWEQSLETQFNESDIPV+DISHFVLDRIDDVELGLKFFDWASKN+ S SL+GS 
Subjt:  MSKTLLSRIKPLLNLKPEPSSSSYFPLKCDIKKLVNDTIKILKSHEKWEQSLETQFNESDIPVVDISHFVLDRIDDVELGLKFFDWASKNAGSSSLHGSV

Query:  YSSLIKLLSRFRVFSEIEFTLEDMKTKEIIPTREALSNVLCAYADFGLVDRALELYHGVVKLHNSLPSTYACNSLLNLLVKHRRLETAHQVYDEMVERDN
        YSSL+KLLSRFRVF EIEFTLEDM+TKEI+PTR+ALSNVLCAYAD G VD+AL  YHGVVKLHNSLPSTYACNSLLNLLVKHRRL TAHQ+YDEMV+RDN
Subjt:  YSSLIKLLSRFRVFSEIEFTLEDMKTKEIIPTREALSNVLCAYADFGLVDRALELYHGVVKLHNSLPSTYACNSLLNLLVKHRRLETAHQVYDEMVERDN

Query:  G---------------------------------------------NTYV---------------WITILLVLWIYTYIRTFGSMVNGFGKMGNFEAIDL
        G                                             NT +               +I + L  ++ T + TFGSMVNGF K GNFEAIDL
Subjt:  G---------------------------------------------NTYV---------------WITILLVLWIYTYIRTFGSMVNGFGKMGNFEAIDL

Query:  LLMEMKDRGLRVNVQVYNNIIDAQYKLGCEIKAKDTLEEMAENCCEPDLVTYNTLINYLCRAGEVKEGEKILQQAIRGGLAPNKFTYTPLVHGYCKQGEY
        LLMEMKDRGL V+VQVYNNIIDAQYKLGC+I+AKD L+E AENCCEPDLVTYNTLINYLCR GEV E EKIL+QAI+ G+ PNKFTYTPLVH YCKQGEY
Subjt:  LLMEMKDRGLRVNVQVYNNIIDAQYKLGCEIKAKDTLEEMAENCCEPDLVTYNTLINYLCRAGEVKEGEKILQQAIRGGLAPNKFTYTPLVHGYCKQGEY

Query:  IRASDLLIEMTERGHKVDMVSYGALIHGLVVAGEVDIALTIRDRMMERGVLPDANIYNVLMNGLFRKGNFP---------------------------WP
         RASDLLIEM+++GHKVDMVSYGALIHGLVVAGEVD A+TIRDRMMERGVLPDANIYNVLMNGLF+KGN                             + 
Subjt:  IRASDLLIEMTERGHKVDMVSYGALIHGLVVAGEVDIALTIRDRMMERGVLPDANIYNVLMNGLFRKGNFP---------------------------WP

Query:  RHGNLDEAKKLFQLTIEKGIDPGVVGYNAMIKGFCKFGMMKDAVLCIDRMRSAHHVPDVFTFSTIIDGY-------------------------------
        RH NLDEAKKLFQLTIEKGIDPGVVGYN+MIKGFCKFGMM+DAVLCIDRMRSA HVPDVFTFSTIIDGY                               
Subjt:  RHGNLDEAKKLFQLTIEKGIDPGVVGYNAMIKGFCKFGMMKDAVLCIDRMRSAHHVPDVFTFSTIIDGY-------------------------------

Query:  -------GEVKMAEMLFSVMQSHDLEPSVVTYSTLIRSFCKEAKLGKAASYFELMLINRCTPNGVIFHYLVNGFANTNAAAVSKGLNNFSENAKSMFEDF
               GE+K+AE LFS+MQSH LEPSVVTY  LIRS CKEAKL +AASYFELMLINRC PN VIFHYLVNGFAN NAAAVSKGLNNFSENAKSMFE+F
Subjt:  -------GEVKMAEMLFSVMQSHDLEPSVVTYSTLIRSFCKEAKLGKAASYFELMLINRCTPNGVIFHYLVNGFANTNAAAVSKGLNNFSENAKSMFEDF

Query:  FLRMIGDGWTRKAAAHNCILICLCQHRMVKTALQLRDKMLSLGLYPDAVSFAALIHGICLEGNSKQWKNVISCDLSEKELQIALKYSHELDKSITQGGFS
        FLRMIGDGWTRKAAA+NCILICLCQHRMVKTALQLRDKMLSLGL PDAVSFAALIHGICL G+SK+ KNVISC LSEKEL+IALKYS ELDKSITQGG S
Subjt:  FLRMIGDGWTRKAAAHNCILICLCQHRMVKTALQLRDKMLSLGLYPDAVSFAALIHGICLEGNSKQWKNVISCDLSEKELQIALKYSHELDKSITQGGFS

Query:  EASDILQAMVEDYESPNQDMNNLK
        EASDILQAMVEDYESPNQD+N+LK
Subjt:  EASDILQAMVEDYESPNQDMNNLK

SwissProt top hitse value%identityAlignment
Q9CAN0 Pentatricopeptide repeat-containing protein At1g63130, mitochondrial1.0e-5227.18Show/hide
Query:  FGSMVNGFGKMGNFEAIDLLLMEMKDRGLRVNVQVYNNIIDAQYKLGCEIKAKDTLEEMAENCCEPDLVTYNTLINYLCRAGEVKEGEKILQQAIRGGLA
        F  +++   KM  F+ +  L  +M++ G+  N+  Y+ +I+   +      A   L +M +   EPD+VT N+L+N  C    + +   ++ Q +  G  
Subjt:  FGSMVNGFGKMGNFEAIDLLLMEMKDRGLRVNVQVYNNIIDAQYKLGCEIKAKDTLEEMAENCCEPDLVTYNTLINYLCRAGEVKEGEKILQQAIRGGLA

Query:  PNKFTYTPLVHGYCKQGEYIRASDLLIEMTERGHKVDMVSYGALIHGLVVAGEVDIALTIRDRMMERGVLPDANIYNVLMNGLFRKGNFPWPRHGNLDEA
        P+ FT+  L+HG  +      A  L+  M  +G + D+V+YG +++GL   G++D+AL++  +M +  + P   IYN +++ L          + N+++A
Subjt:  PNKFTYTPLVHGYCKQGEYIRASDLLIEMTERGHKVDMVSYGALIHGLVVAGEVDIALTIRDRMMERGVLPDANIYNVLMNGLFRKGNFPWPRHGNLDEA

Query:  KKLFQLTIEKGIDPGVVGYNAMIKGFCKFGMMKDAVLCIDRMRSAHHVPDVFTFSTIIDGY---GEVKMAEMLFSVMQSHDLEPSVVTYSTLIRSFCKEA
          LF     KGI P VV YN++I+  C +G   DA   +  M      P+V TFS +ID +   G++  AE L+  M    ++P + TYS+LI  FC   
Subjt:  KKLFQLTIEKGIDPGVVGYNAMIKGFCKFGMMKDAVLCIDRMRSAHHVPDVFTFSTIIDGY---GEVKMAEMLFSVMQSHDLEPSVVTYSTLIRSFCKEA

Query:  KLGKAASYFELMLINRCTPNGVIFHYLVNGFANTNAAAVSKGLNNFSENAK-----------SMFEDF------------FLRMIGDGWTRKAAAHNCIL
        +L +A   FELM+   C PN V ++ L+ GF    A  V +G+  F E ++           ++   F            F +M+ DG       ++ +L
Subjt:  KLGKAASYFELMLINRCTPNGVIFHYLVNGFANTNAAAVSKGLNNFSENAK-----------SMFEDF------------FLRMIGDGWTRKAAAHNCIL

Query:  ICLCQHRMVKTALQLRDKMLSLGLYPDAVSFAALIHGICLEGNSKQ-WKNVISCDLSEKELQ-IALKYSHELDKSITQGGFSEASDILQAMVEDYESPNQ
          LC +  V+TAL + + +    + PD  ++  +I G+C  G  +  W   + C LS K ++   + Y+  +     +G   EA  + + M E+   P+ 
Subjt:  ICLCQHRMVKTALQLRDKMLSLGLYPDAVSFAALIHGICLEGNSKQ-WKNVISCDLSEKELQ-IALKYSHELDKSITQGGFSEASDILQAMVEDYESPNQ

Query:  DMNN
           N
Subjt:  DMNN

Q9FIX3 Pentatricopeptide repeat-containing protein At5g397104.7e-5823.74Show/hide
Query:  YMSKTLLSRIKPLLNLKPEPSSSSYFPLKCDIKKLVNDTIKILKSHEKWEQSLETQFNESDIPVVDISHFVLDRIDDVELGLKFFDWASKNAGSSSLHGS
        +++KTL+ R   L      PS S           L +  +  LK H      L   F          S+ +L   +D  L LKF +WA+ +   +     
Subjt:  YMSKTLLSRIKPLLNLKPEPSSSSYFPLKCDIKKLVNDTIKILKSHEKWEQSLETQFNESDIPVVDISHFVLDRIDDVELGLKFFDWASKNAGSSSLHGS

Query:  VYSSLIKLLSRFRVFSEIEFTLEDMKTKEI-------------------IPTREALSNVLCAYADFGLVDRALELYHGVVKLHNSLPSTYACNSLLNLLV
        +    + +L++F+++   +   ED+  K +                     T      V+ +Y+   L+D+AL + H + + H  +P   + N++L+  +
Subjt:  VYSSLIKLLSRFRVFSEIEFTLEDMKTKEI-------------------IPTREALSNVLCAYADFGLVDRALELYHGVVKLHNSLPSTYACNSLLNLLV

Query:  KHRR-LETAHQVYDEMVERDNGNTYVWITILLVLWIYTYIRTFGSMVNGFGKMGNFEAIDLLLMEMKDRGLRVNVQVYNNIIDAQYKLGCEIKAKDTLEE
        + +R +  A  V+ EM+E                 +   + T+  ++ GF   GN +    L  +M+ +G   NV  YN +ID   KL         L  
Subjt:  KHRR-LETAHQVYDEMVERDNGNTYVWITILLVLWIYTYIRTFGSMVNGFGKMGNFEAIDLLLMEMKDRGLRVNVQVYNNIIDAQYKLGCEIKAKDTLEE

Query:  MAENCCEPDLVTYNTLINYLCRAGEVKEGEKILQQAIRGGLAPNKFTYTPLVHGYCKQGEYIRASDLLIEMTERGHKVDMVSYGALIHGLVVAGEVDIAL
        MA    EP+L++YN +IN LCR G +KE   +L +  R G + ++ TY  L+ GYCK+G + +A  +  EM   G    +++Y +LIH +  AG ++ A+
Subjt:  MAENCCEPDLVTYNTLINYLCRAGEVKEGEKILQQAIRGGLAPNKFTYTPLVHGYCKQGEYIRASDLLIEMTERGHKVDMVSYGALIHGLVVAGEVDIAL

Query:  TIRDRMMERGVLPDANIYNVLMNGLFRKGNFPWPRHGNLDEAKKLFQLTIEKGIDPGVVGYNAMIKGFCKFGMMKDAVLCIDRMRSAHHVPDVFTFSTII
           D+M  RG+ P+   Y  L++G  +K        G ++EA ++ +   + G  P VV YNA+I G C  G M+DA+  ++ M+     PDV ++ST++
Subjt:  TIRDRMMERGVLPDANIYNVLMNGLFRKGNFPWPRHGNLDEAKKLFQLTIEKGIDPGVVGYNAMIKGFCKFGMMKDAVLCIDRMRSAHHVPDVFTFSTII

Query:  DGY---GEVKMAEMLFSVMQSHDLEPSVVTYSTLIRSFCKEAKLGKAASYFELMLINRCTPNGVIFHYLVNGFANTNAAAVSKGLNNFSENAKSMFEDFF
         G+    +V  A  +   M    ++P  +TYS+LI+ FC++ +  +A   +E ML     P+   +  L+N +                           
Subjt:  DGY---GEVKMAEMLFSVMQSHDLEPSVVTYSTLIRSFCKEAKLGKAASYFELMLINRCTPNGVIFHYLVNGFANTNAAAVSKGLNNFSENAKSMFEDFF

Query:  LRMIGDGWTRKAAAHNCILICLCQHRMVKTALQLRDKMLSLGLYPDAVSFAALIHGICLEGNSKQWKNVISCDLSEKELQIALKYSHELD-------KSI
                              C    ++ ALQL ++M+  G+ PD V+++ LI+G+  +  +++ K ++     E+ +   + Y   ++       KS+
Subjt:  LRMIGDGWTRKAAAHNCILICLCQHRMVKTALQLRDKMLSLGLYPDAVSFAALIHGICLEGNSKQWKNVISCDLSEKELQIALKYSHELD-------KSI

Query:  T--------QGGFSEASDILQAMVEDYESPNQDMNNL
                 +G  +EA  + ++M+     P+    N+
Subjt:  T--------QGGFSEASDILQAMVEDYESPNQDMNNL

Q9LQ16 Pentatricopeptide repeat-containing protein At1g629103.0e-5227.18Show/hide
Query:  FGSMVNGFGKMGNFEAIDLLLMEMKDRGLRVNVQVYNNIIDAQYKLGCEIKAKDTLEEMAENCCEPDLVTYNTLINYLCRAGEVKEGEKILQQAIRGGLA
        F  +++   KM  FE +  L  +M+  G+  ++  Y+  I+   +      A   L +M +   EPD+VT ++L+N  C +  + +   ++ Q +  G  
Subjt:  FGSMVNGFGKMGNFEAIDLLLMEMKDRGLRVNVQVYNNIIDAQYKLGCEIKAKDTLEEMAENCCEPDLVTYNTLINYLCRAGEVKEGEKILQQAIRGGLA

Query:  PNKFTYTPLVHGYCKQGEYIRASDLLIEMTERGHKVDMVSYGALIHGLVVAGEVDIALTIRDRMMERGVLPDANIYNVLMNGLFRKGNFPWPRHGNLDEA
        P+ FT+T L+HG     +   A  L+ +M +RG + D+V+YG +++GL   G++D+AL++  +M +  +  D  IYN +++GL         ++ ++D+A
Subjt:  PNKFTYTPLVHGYCKQGEYIRASDLLIEMTERGHKVDMVSYGALIHGLVVAGEVDIALTIRDRMMERGVLPDANIYNVLMNGLFRKGNFPWPRHGNLDEA

Query:  KKLFQLTIEKGIDPGVVGYNAMIKGFCKFGMMKDAVLCIDRMRSAHHVPDVFTFSTIIDGY---GEVKMAEMLFSVMQSHDLEPSVVTYSTLIRSFCKEA
          LF     KGI P V  Y+++I   C +G   DA   +  M      P+V TFS +ID +   G++  AE L+  M    ++P + TYS+LI  FC   
Subjt:  KKLFQLTIEKGIDPGVVGYNAMIKGFCKFGMMKDAVLCIDRMRSAHHVPDVFTFSTIIDGY---GEVKMAEMLFSVMQSHDLEPSVVTYSTLIRSFCKEA

Query:  KLGKAASYFELMLINRCTPNGVIFHYLVNGFANTNAAAVSKGLNNFSENAK-----------SMFEDF------------FLRMIGDGWTRKAAAHNCIL
        +L +A   FELM+   C PN V +  L+ GF    A  V +G+  F E ++           ++   F            F +M+  G       +N +L
Subjt:  KLGKAASYFELMLINRCTPNGVIFHYLVNGFANTNAAAVSKGLNNFSENAK-----------SMFEDF------------FLRMIGDGWTRKAAAHNCIL

Query:  ICLCQHRMVKTALQLRDKMLSLGLYPDAVSFAALIHGICLEGNSKQ-WKNVISCDLSEKELQ-IALKYSHELDKSITQGGFSEASDILQAMVEDYESPNQ
          LC++  +  A+ + + +    + PD  ++  +I G+C  G  +  W+  + C+LS K +    + Y+  +     +G   EA  +L+ M ED   PN 
Subjt:  ICLCQHRMVKTALQLRDKMLSLGLYPDAVSFAALIHGICLEGNSKQ-WKNVISCDLSEKELQ-IALKYSHELDKSITQGGFSEASDILQAMVEDYESPNQ

Query:  DMNN
           N
Subjt:  DMNN

Q9SSR4 Pentatricopeptide repeat-containing protein At1g526208.3e-17242.2Show/hide
Query:  MSKTLLSRIKPLLNLKPEPSSSSYFPLKCDIKKLVNDTIKILKSHEKWEQSLETQFNESDIPVVDISHFVLDRIDDVELGLKFFDW-ASKNAGSSSLHGS
        MSKTLLSRIKPL N     S  S+ P+   IKKLV+DT+ ILK+ + W Q L+  F + ++  VDIS FV DRI DVE+G+K FDW +S+       +G 
Subjt:  MSKTLLSRIKPLLNLKPEPSSSSYFPLKCDIKKLVNDTIKILKSHEKWEQSLETQFNESDIPVVDISHFVLDRIDDVELGLKFFDW-ASKNAGSSSLHGS

Query:  VYSSLIKLLSRFRVFSEIEFTLEDMKTKEIIPTREALSNVLCAYADFGLVDRALELYHGVVKLHNSLPSTYACNSLLNLLVKHRRLETAHQVYDEMVER-
          SS +KLL+R+R+F+EIE  L +++ + +  T EALS+VL AYA+ G + +A+E+Y  VV+L++S+P   ACNSLL+LLVK RRL  A +VYDEM +R 
Subjt:  VYSSLIKLLSRFRVFSEIEFTLEDMKTKEIIPTREALSNVLCAYADFGLVDRALELYHGVVKLHNSLPSTYACNSLLNLLVKHRRLETAHQVYDEMVER-

Query:  ------------------------------------------------------DNGNTYVWITILLVLWIYTYIRTFGSMVNGFGKMGNFEAIDLLLME
                                                              D  N Y+    L +      + TFG+M+NGF K G+F A D LL E
Subjt:  ------------------------------------------------------DNGNTYVWITILLVLWIYTYIRTFGSMVNGFGKMGNFEAIDLLLME

Query:  MKDRGLRVNVQVYNNIIDAQYKLGCEIKAKDTLEEMAENCCEPDLVTYNTLINYLCRAGEVKEGEKILQQAIRGGLAPNKFTYTPLVHGYCKQGEYIRAS
        +K+RGLRV+V   NNIIDA+Y+ G ++   +++  +  N C+PD+ TYN LIN LC+ G+ +     L +A + GL PN  +Y PL+  YCK  EY  AS
Subjt:  MKDRGLRVNVQVYNNIIDAQYKLGCEIKAKDTLEEMAENCCEPDLVTYNTLINYLCRAGEVKEGEKILQQAIRGGLAPNKFTYTPLVHGYCKQGEYIRAS

Query:  DLLIEMTERGHKVDMVSYGALIHGLVVAGEVDIALTIRDRMMERGVLPDANIYNVLMNGLFRKGNF---------------------------PWPRHGN
         LL++M ERG K D+V+YG LIHGLVV+G +D A+ ++ ++++RGV PDA IYN+LM+GL + G F                            + R G+
Subjt:  DLLIEMTERGHKVDMVSYGALIHGLVVAGEVDIALTIRDRMMERGVLPDANIYNVLMNGLFRKGNF---------------------------PWPRHGN

Query:  LDEAKKLFQLTIEKGIDPGVVGYNAMIKGFCKFGMMKDAVLCIDRMRSAHHVPDVFTFSTIIDGY-----------------------------------
         DEA+K+F L++EKG+   VV +NAMIKGFC+ GM+ +A+ C++RM   H VPD FT+STIIDGY                                   
Subjt:  LDEAKKLFQLTIEKGIDPGVVGYNAMIKGFCKFGMMKDAVLCIDRMRSAHHVPDVFTFSTIIDGY-----------------------------------

Query:  ---GEVKMAEMLFSVMQSHDLEPSVVTYSTLIRSFCKEAK-LGKAASYFELMLINRCTPNGVIFHYLVNGFANTNAAAVSKGLNNFSENAKSMFEDFFLR
           G+ KMAE  F  MQ  DL P+VVTY+TLIRS  KE+  L KA  Y+ELM+ N+C PN V F+ L+ GF    +  V    +  +    S+F +FF R
Subjt:  ---GEVKMAEMLFSVMQSHDLEPSVVTYSTLIRSFCKEAK-LGKAASYFELMLINRCTPNGVIFHYLVNGFANTNAAAVSKGLNNFSENAKSMFEDFFLR

Query:  MIGDGWTRKAAAHNCILICLCQHRMVKTALQLRDKMLSLGLYPDAVSFAALIHGICLEGNSKQWKNVISCDLSEKELQIALKYSHELDKSITQGGFSEAS
        M  DGW+  AAA+N  L+CLC H MVKTA   +DKM+  G  PD VSFAA++HG C+ GNSKQW+N+  C+L EK L++A++YS  L++ + Q    EAS
Subjt:  MIGDGWTRKAAAHNCILICLCQHRMVKTALQLRDKMLSLGLYPDAVSFAALIHGICLEGNSKQWKNVISCDLSEKELQIALKYSHELDKSITQGGFSEAS

Query:  DILQAMVE
         IL AMVE
Subjt:  DILQAMVE

Q9SXD1 Pentatricopeptide repeat-containing protein At1g62670, mitochondrial3.9e-5226.79Show/hide
Query:  FGSMVNGFGKMGNFEAIDLLLMEMKDRGLRVNVQVYNNIIDAQYKLGCEIKAKDTLEEMAENCCEPDLVTYNTLINYLCRAGEVKEGEKILQQAIRGGLA
        F  +++   KM  F+ +  L  +M++ G+  N   Y+ +I+   +      A   L +M +   EP++VT ++L+N  C +  + E   ++ Q    G  
Subjt:  FGSMVNGFGKMGNFEAIDLLLMEMKDRGLRVNVQVYNNIIDAQYKLGCEIKAKDTLEEMAENCCEPDLVTYNTLINYLCRAGEVKEGEKILQQAIRGGLA

Query:  PNKFTYTPLVHGYCKQGEYIRASDLLIEMTERGHKVDMVSYGALIHGLVVAGEVDIALTIRDRMMERGVLPDANIYNVLMNGLFRKGNFPWPRHGNLDEA
        PN  T+  L+HG     +   A  L+  M  +G + D+V+YG +++GL   G+ D+A  + ++M +  + P   IYN +++GL         ++ ++D+A
Subjt:  PNKFTYTPLVHGYCKQGEYIRASDLLIEMTERGHKVDMVSYGALIHGLVVAGEVDIALTIRDRMMERGVLPDANIYNVLMNGLFRKGNFPWPRHGNLDEA

Query:  KKLFQLTIEKGIDPGVVGYNAMIKGFCKFGMMKDAVLCIDRMRSAHHVPDVFTFSTIIDGY---GEVKMAEMLFSVMQSHDLEPSVVTYSTLIRSFCKEA
          LF+    KGI P VV Y+++I   C +G   DA   +  M      PDVFTFS +ID +   G++  AE L+  M    ++PS+VTYS+LI  FC   
Subjt:  KKLFQLTIEKGIDPGVVGYNAMIKGFCKFGMMKDAVLCIDRMRSAHHVPDVFTFSTIIDGY---GEVKMAEMLFSVMQSHDLEPSVVTYSTLIRSFCKEA

Query:  KLGKAASYFELMLINRCTPNGVIFHYLVNGFANTNAAAVSKGLNNFSENAK-----------------------SMFEDFFLRMIGDGWTRKAAAHNCIL
        +L +A   FE M+   C P+ V ++ L+ GF       V +G+  F E ++                        M ++ F  M+ DG       +N +L
Subjt:  KLGKAASYFELMLINRCTPNGVIFHYLVNGFANTNAAAVSKGLNNFSENAK-----------------------SMFEDFFLRMIGDGWTRKAAAHNCIL

Query:  ICLCQHRMVKTALQLRDKMLSLGLYPDAVSFAALIHGICLEGNSKQ-WKNVISCDLSEKELQ-IALKYSHELDKSITQGGFSEASDILQAMVEDYESPNQ
          LC++  ++ A+ + + +    + P   ++  +I G+C  G  +  W   + C+LS K ++   + Y+  +     +G   EA  + + M ED   PN 
Subjt:  ICLCQHRMVKTALQLRDKMLSLGLYPDAVSFAALIHGICLEGNSKQ-WKNVISCDLSEKELQ-IALKYSHELDKSITQGGFSEASDILQAMVEDYESPNQ

Query:  DMNN
           N
Subjt:  DMNN

Arabidopsis top hitse value%identityAlignment
AT1G52620.1 Pentatricopeptide repeat (PPR) superfamily protein5.9e-17342.2Show/hide
Query:  MSKTLLSRIKPLLNLKPEPSSSSYFPLKCDIKKLVNDTIKILKSHEKWEQSLETQFNESDIPVVDISHFVLDRIDDVELGLKFFDW-ASKNAGSSSLHGS
        MSKTLLSRIKPL N     S  S+ P+   IKKLV+DT+ ILK+ + W Q L+  F + ++  VDIS FV DRI DVE+G+K FDW +S+       +G 
Subjt:  MSKTLLSRIKPLLNLKPEPSSSSYFPLKCDIKKLVNDTIKILKSHEKWEQSLETQFNESDIPVVDISHFVLDRIDDVELGLKFFDW-ASKNAGSSSLHGS

Query:  VYSSLIKLLSRFRVFSEIEFTLEDMKTKEIIPTREALSNVLCAYADFGLVDRALELYHGVVKLHNSLPSTYACNSLLNLLVKHRRLETAHQVYDEMVER-
          SS +KLL+R+R+F+EIE  L +++ + +  T EALS+VL AYA+ G + +A+E+Y  VV+L++S+P   ACNSLL+LLVK RRL  A +VYDEM +R 
Subjt:  VYSSLIKLLSRFRVFSEIEFTLEDMKTKEIIPTREALSNVLCAYADFGLVDRALELYHGVVKLHNSLPSTYACNSLLNLLVKHRRLETAHQVYDEMVER-

Query:  ------------------------------------------------------DNGNTYVWITILLVLWIYTYIRTFGSMVNGFGKMGNFEAIDLLLME
                                                              D  N Y+    L +      + TFG+M+NGF K G+F A D LL E
Subjt:  ------------------------------------------------------DNGNTYVWITILLVLWIYTYIRTFGSMVNGFGKMGNFEAIDLLLME

Query:  MKDRGLRVNVQVYNNIIDAQYKLGCEIKAKDTLEEMAENCCEPDLVTYNTLINYLCRAGEVKEGEKILQQAIRGGLAPNKFTYTPLVHGYCKQGEYIRAS
        +K+RGLRV+V   NNIIDA+Y+ G ++   +++  +  N C+PD+ TYN LIN LC+ G+ +     L +A + GL PN  +Y PL+  YCK  EY  AS
Subjt:  MKDRGLRVNVQVYNNIIDAQYKLGCEIKAKDTLEEMAENCCEPDLVTYNTLINYLCRAGEVKEGEKILQQAIRGGLAPNKFTYTPLVHGYCKQGEYIRAS

Query:  DLLIEMTERGHKVDMVSYGALIHGLVVAGEVDIALTIRDRMMERGVLPDANIYNVLMNGLFRKGNF---------------------------PWPRHGN
         LL++M ERG K D+V+YG LIHGLVV+G +D A+ ++ ++++RGV PDA IYN+LM+GL + G F                            + R G+
Subjt:  DLLIEMTERGHKVDMVSYGALIHGLVVAGEVDIALTIRDRMMERGVLPDANIYNVLMNGLFRKGNF---------------------------PWPRHGN

Query:  LDEAKKLFQLTIEKGIDPGVVGYNAMIKGFCKFGMMKDAVLCIDRMRSAHHVPDVFTFSTIIDGY-----------------------------------
         DEA+K+F L++EKG+   VV +NAMIKGFC+ GM+ +A+ C++RM   H VPD FT+STIIDGY                                   
Subjt:  LDEAKKLFQLTIEKGIDPGVVGYNAMIKGFCKFGMMKDAVLCIDRMRSAHHVPDVFTFSTIIDGY-----------------------------------

Query:  ---GEVKMAEMLFSVMQSHDLEPSVVTYSTLIRSFCKEAK-LGKAASYFELMLINRCTPNGVIFHYLVNGFANTNAAAVSKGLNNFSENAKSMFEDFFLR
           G+ KMAE  F  MQ  DL P+VVTY+TLIRS  KE+  L KA  Y+ELM+ N+C PN V F+ L+ GF    +  V    +  +    S+F +FF R
Subjt:  ---GEVKMAEMLFSVMQSHDLEPSVVTYSTLIRSFCKEAK-LGKAASYFELMLINRCTPNGVIFHYLVNGFANTNAAAVSKGLNNFSENAKSMFEDFFLR

Query:  MIGDGWTRKAAAHNCILICLCQHRMVKTALQLRDKMLSLGLYPDAVSFAALIHGICLEGNSKQWKNVISCDLSEKELQIALKYSHELDKSITQGGFSEAS
        M  DGW+  AAA+N  L+CLC H MVKTA   +DKM+  G  PD VSFAA++HG C+ GNSKQW+N+  C+L EK L++A++YS  L++ + Q    EAS
Subjt:  MIGDGWTRKAAAHNCILICLCQHRMVKTALQLRDKMLSLGLYPDAVSFAALIHGICLEGNSKQWKNVISCDLSEKELQIALKYSHELDKSITQGGFSEAS

Query:  DILQAMVE
         IL AMVE
Subjt:  DILQAMVE

AT1G62670.1 rna processing factor 22.7e-5326.79Show/hide
Query:  FGSMVNGFGKMGNFEAIDLLLMEMKDRGLRVNVQVYNNIIDAQYKLGCEIKAKDTLEEMAENCCEPDLVTYNTLINYLCRAGEVKEGEKILQQAIRGGLA
        F  +++   KM  F+ +  L  +M++ G+  N   Y+ +I+   +      A   L +M +   EP++VT ++L+N  C +  + E   ++ Q    G  
Subjt:  FGSMVNGFGKMGNFEAIDLLLMEMKDRGLRVNVQVYNNIIDAQYKLGCEIKAKDTLEEMAENCCEPDLVTYNTLINYLCRAGEVKEGEKILQQAIRGGLA

Query:  PNKFTYTPLVHGYCKQGEYIRASDLLIEMTERGHKVDMVSYGALIHGLVVAGEVDIALTIRDRMMERGVLPDANIYNVLMNGLFRKGNFPWPRHGNLDEA
        PN  T+  L+HG     +   A  L+  M  +G + D+V+YG +++GL   G+ D+A  + ++M +  + P   IYN +++GL         ++ ++D+A
Subjt:  PNKFTYTPLVHGYCKQGEYIRASDLLIEMTERGHKVDMVSYGALIHGLVVAGEVDIALTIRDRMMERGVLPDANIYNVLMNGLFRKGNFPWPRHGNLDEA

Query:  KKLFQLTIEKGIDPGVVGYNAMIKGFCKFGMMKDAVLCIDRMRSAHHVPDVFTFSTIIDGY---GEVKMAEMLFSVMQSHDLEPSVVTYSTLIRSFCKEA
          LF+    KGI P VV Y+++I   C +G   DA   +  M      PDVFTFS +ID +   G++  AE L+  M    ++PS+VTYS+LI  FC   
Subjt:  KKLFQLTIEKGIDPGVVGYNAMIKGFCKFGMMKDAVLCIDRMRSAHHVPDVFTFSTIIDGY---GEVKMAEMLFSVMQSHDLEPSVVTYSTLIRSFCKEA

Query:  KLGKAASYFELMLINRCTPNGVIFHYLVNGFANTNAAAVSKGLNNFSENAK-----------------------SMFEDFFLRMIGDGWTRKAAAHNCIL
        +L +A   FE M+   C P+ V ++ L+ GF       V +G+  F E ++                        M ++ F  M+ DG       +N +L
Subjt:  KLGKAASYFELMLINRCTPNGVIFHYLVNGFANTNAAAVSKGLNNFSENAK-----------------------SMFEDFFLRMIGDGWTRKAAAHNCIL

Query:  ICLCQHRMVKTALQLRDKMLSLGLYPDAVSFAALIHGICLEGNSKQ-WKNVISCDLSEKELQ-IALKYSHELDKSITQGGFSEASDILQAMVEDYESPNQ
          LC++  ++ A+ + + +    + P   ++  +I G+C  G  +  W   + C+LS K ++   + Y+  +     +G   EA  + + M ED   PN 
Subjt:  ICLCQHRMVKTALQLRDKMLSLGLYPDAVSFAALIHGICLEGNSKQ-WKNVISCDLSEKELQ-IALKYSHELDKSITQGGFSEASDILQAMVEDYESPNQ

Query:  DMNN
           N
Subjt:  DMNN

AT1G62910.1 Pentatricopeptide repeat (PPR) superfamily protein2.1e-5327.18Show/hide
Query:  FGSMVNGFGKMGNFEAIDLLLMEMKDRGLRVNVQVYNNIIDAQYKLGCEIKAKDTLEEMAENCCEPDLVTYNTLINYLCRAGEVKEGEKILQQAIRGGLA
        F  +++   KM  FE +  L  +M+  G+  ++  Y+  I+   +      A   L +M +   EPD+VT ++L+N  C +  + +   ++ Q +  G  
Subjt:  FGSMVNGFGKMGNFEAIDLLLMEMKDRGLRVNVQVYNNIIDAQYKLGCEIKAKDTLEEMAENCCEPDLVTYNTLINYLCRAGEVKEGEKILQQAIRGGLA

Query:  PNKFTYTPLVHGYCKQGEYIRASDLLIEMTERGHKVDMVSYGALIHGLVVAGEVDIALTIRDRMMERGVLPDANIYNVLMNGLFRKGNFPWPRHGNLDEA
        P+ FT+T L+HG     +   A  L+ +M +RG + D+V+YG +++GL   G++D+AL++  +M +  +  D  IYN +++GL         ++ ++D+A
Subjt:  PNKFTYTPLVHGYCKQGEYIRASDLLIEMTERGHKVDMVSYGALIHGLVVAGEVDIALTIRDRMMERGVLPDANIYNVLMNGLFRKGNFPWPRHGNLDEA

Query:  KKLFQLTIEKGIDPGVVGYNAMIKGFCKFGMMKDAVLCIDRMRSAHHVPDVFTFSTIIDGY---GEVKMAEMLFSVMQSHDLEPSVVTYSTLIRSFCKEA
          LF     KGI P V  Y+++I   C +G   DA   +  M      P+V TFS +ID +   G++  AE L+  M    ++P + TYS+LI  FC   
Subjt:  KKLFQLTIEKGIDPGVVGYNAMIKGFCKFGMMKDAVLCIDRMRSAHHVPDVFTFSTIIDGY---GEVKMAEMLFSVMQSHDLEPSVVTYSTLIRSFCKEA

Query:  KLGKAASYFELMLINRCTPNGVIFHYLVNGFANTNAAAVSKGLNNFSENAK-----------SMFEDF------------FLRMIGDGWTRKAAAHNCIL
        +L +A   FELM+   C PN V +  L+ GF    A  V +G+  F E ++           ++   F            F +M+  G       +N +L
Subjt:  KLGKAASYFELMLINRCTPNGVIFHYLVNGFANTNAAAVSKGLNNFSENAK-----------SMFEDF------------FLRMIGDGWTRKAAAHNCIL

Query:  ICLCQHRMVKTALQLRDKMLSLGLYPDAVSFAALIHGICLEGNSKQ-WKNVISCDLSEKELQ-IALKYSHELDKSITQGGFSEASDILQAMVEDYESPNQ
          LC++  +  A+ + + +    + PD  ++  +I G+C  G  +  W+  + C+LS K +    + Y+  +     +G   EA  +L+ M ED   PN 
Subjt:  ICLCQHRMVKTALQLRDKMLSLGLYPDAVSFAALIHGICLEGNSKQ-WKNVISCDLSEKELQ-IALKYSHELDKSITQGGFSEASDILQAMVEDYESPNQ

Query:  DMNN
           N
Subjt:  DMNN

AT1G63130.1 Tetratricopeptide repeat (TPR)-like superfamily protein7.2e-5427.18Show/hide
Query:  FGSMVNGFGKMGNFEAIDLLLMEMKDRGLRVNVQVYNNIIDAQYKLGCEIKAKDTLEEMAENCCEPDLVTYNTLINYLCRAGEVKEGEKILQQAIRGGLA
        F  +++   KM  F+ +  L  +M++ G+  N+  Y+ +I+   +      A   L +M +   EPD+VT N+L+N  C    + +   ++ Q +  G  
Subjt:  FGSMVNGFGKMGNFEAIDLLLMEMKDRGLRVNVQVYNNIIDAQYKLGCEIKAKDTLEEMAENCCEPDLVTYNTLINYLCRAGEVKEGEKILQQAIRGGLA

Query:  PNKFTYTPLVHGYCKQGEYIRASDLLIEMTERGHKVDMVSYGALIHGLVVAGEVDIALTIRDRMMERGVLPDANIYNVLMNGLFRKGNFPWPRHGNLDEA
        P+ FT+  L+HG  +      A  L+  M  +G + D+V+YG +++GL   G++D+AL++  +M +  + P   IYN +++ L          + N+++A
Subjt:  PNKFTYTPLVHGYCKQGEYIRASDLLIEMTERGHKVDMVSYGALIHGLVVAGEVDIALTIRDRMMERGVLPDANIYNVLMNGLFRKGNFPWPRHGNLDEA

Query:  KKLFQLTIEKGIDPGVVGYNAMIKGFCKFGMMKDAVLCIDRMRSAHHVPDVFTFSTIIDGY---GEVKMAEMLFSVMQSHDLEPSVVTYSTLIRSFCKEA
          LF     KGI P VV YN++I+  C +G   DA   +  M      P+V TFS +ID +   G++  AE L+  M    ++P + TYS+LI  FC   
Subjt:  KKLFQLTIEKGIDPGVVGYNAMIKGFCKFGMMKDAVLCIDRMRSAHHVPDVFTFSTIIDGY---GEVKMAEMLFSVMQSHDLEPSVVTYSTLIRSFCKEA

Query:  KLGKAASYFELMLINRCTPNGVIFHYLVNGFANTNAAAVSKGLNNFSENAK-----------SMFEDF------------FLRMIGDGWTRKAAAHNCIL
        +L +A   FELM+   C PN V ++ L+ GF    A  V +G+  F E ++           ++   F            F +M+ DG       ++ +L
Subjt:  KLGKAASYFELMLINRCTPNGVIFHYLVNGFANTNAAAVSKGLNNFSENAK-----------SMFEDF------------FLRMIGDGWTRKAAAHNCIL

Query:  ICLCQHRMVKTALQLRDKMLSLGLYPDAVSFAALIHGICLEGNSKQ-WKNVISCDLSEKELQ-IALKYSHELDKSITQGGFSEASDILQAMVEDYESPNQ
          LC +  V+TAL + + +    + PD  ++  +I G+C  G  +  W   + C LS K ++   + Y+  +     +G   EA  + + M E+   P+ 
Subjt:  ICLCQHRMVKTALQLRDKMLSLGLYPDAVSFAALIHGICLEGNSKQ-WKNVISCDLSEKELQ-IALKYSHELDKSITQGGFSEASDILQAMVEDYESPNQ

Query:  DMNN
           N
Subjt:  DMNN

AT5G39710.1 Tetratricopeptide repeat (TPR)-like superfamily protein3.3e-5923.74Show/hide
Query:  YMSKTLLSRIKPLLNLKPEPSSSSYFPLKCDIKKLVNDTIKILKSHEKWEQSLETQFNESDIPVVDISHFVLDRIDDVELGLKFFDWASKNAGSSSLHGS
        +++KTL+ R   L      PS S           L +  +  LK H      L   F          S+ +L   +D  L LKF +WA+ +   +     
Subjt:  YMSKTLLSRIKPLLNLKPEPSSSSYFPLKCDIKKLVNDTIKILKSHEKWEQSLETQFNESDIPVVDISHFVLDRIDDVELGLKFFDWASKNAGSSSLHGS

Query:  VYSSLIKLLSRFRVFSEIEFTLEDMKTKEI-------------------IPTREALSNVLCAYADFGLVDRALELYHGVVKLHNSLPSTYACNSLLNLLV
        +    + +L++F+++   +   ED+  K +                     T      V+ +Y+   L+D+AL + H + + H  +P   + N++L+  +
Subjt:  VYSSLIKLLSRFRVFSEIEFTLEDMKTKEI-------------------IPTREALSNVLCAYADFGLVDRALELYHGVVKLHNSLPSTYACNSLLNLLV

Query:  KHRR-LETAHQVYDEMVERDNGNTYVWITILLVLWIYTYIRTFGSMVNGFGKMGNFEAIDLLLMEMKDRGLRVNVQVYNNIIDAQYKLGCEIKAKDTLEE
        + +R +  A  V+ EM+E                 +   + T+  ++ GF   GN +    L  +M+ +G   NV  YN +ID   KL         L  
Subjt:  KHRR-LETAHQVYDEMVERDNGNTYVWITILLVLWIYTYIRTFGSMVNGFGKMGNFEAIDLLLMEMKDRGLRVNVQVYNNIIDAQYKLGCEIKAKDTLEE

Query:  MAENCCEPDLVTYNTLINYLCRAGEVKEGEKILQQAIRGGLAPNKFTYTPLVHGYCKQGEYIRASDLLIEMTERGHKVDMVSYGALIHGLVVAGEVDIAL
        MA    EP+L++YN +IN LCR G +KE   +L +  R G + ++ TY  L+ GYCK+G + +A  +  EM   G    +++Y +LIH +  AG ++ A+
Subjt:  MAENCCEPDLVTYNTLINYLCRAGEVKEGEKILQQAIRGGLAPNKFTYTPLVHGYCKQGEYIRASDLLIEMTERGHKVDMVSYGALIHGLVVAGEVDIAL

Query:  TIRDRMMERGVLPDANIYNVLMNGLFRKGNFPWPRHGNLDEAKKLFQLTIEKGIDPGVVGYNAMIKGFCKFGMMKDAVLCIDRMRSAHHVPDVFTFSTII
           D+M  RG+ P+   Y  L++G  +K        G ++EA ++ +   + G  P VV YNA+I G C  G M+DA+  ++ M+     PDV ++ST++
Subjt:  TIRDRMMERGVLPDANIYNVLMNGLFRKGNFPWPRHGNLDEAKKLFQLTIEKGIDPGVVGYNAMIKGFCKFGMMKDAVLCIDRMRSAHHVPDVFTFSTII

Query:  DGY---GEVKMAEMLFSVMQSHDLEPSVVTYSTLIRSFCKEAKLGKAASYFELMLINRCTPNGVIFHYLVNGFANTNAAAVSKGLNNFSENAKSMFEDFF
         G+    +V  A  +   M    ++P  +TYS+LI+ FC++ +  +A   +E ML     P+   +  L+N +                           
Subjt:  DGY---GEVKMAEMLFSVMQSHDLEPSVVTYSTLIRSFCKEAKLGKAASYFELMLINRCTPNGVIFHYLVNGFANTNAAAVSKGLNNFSENAKSMFEDFF

Query:  LRMIGDGWTRKAAAHNCILICLCQHRMVKTALQLRDKMLSLGLYPDAVSFAALIHGICLEGNSKQWKNVISCDLSEKELQIALKYSHELD-------KSI
                              C    ++ ALQL ++M+  G+ PD V+++ LI+G+  +  +++ K ++     E+ +   + Y   ++       KS+
Subjt:  LRMIGDGWTRKAAAHNCILICLCQHRMVKTALQLRDKMLSLGLYPDAVSFAALIHGICLEGNSKQWKNVISCDLSEKELQIALKYSHELD-------KSI

Query:  T--------QGGFSEASDILQAMVEDYESPNQDMNNL
                 +G  +EA  + ++M+     P+    N+
Subjt:  T--------QGGFSEASDILQAMVEDYESPNQDMNNL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGCTATCACCAATGTCTATGGGGAACGTTCATCCTGTCTGCTGTTGGTCTGACAAGGTTCTTAAGCTGTTCATGGGCCGATCATTGTTGTAAAACTTGGCAAAAGCT
TTCAGTTGGTGTGCATCTCGAAAGGCCGTTGGTTGATCTCTCGAGTAAAGGGGAAATTGTTCAATGTAGATTACAGCAAGAGCCTGCCATCTTTTGTATGAAGAAAGTAA
CCGTCTTACCAGTATACCAGTACCACCAACTTGGATCAACTTTAAAGACCCACCATGGAAATAGAAGGCATCTTGAATCTTCCCGCTCGGAGCTTTATACATGCCTAGCG
TGCATAGTTTTTAAGTTGGCTTTCTTCTTCTGCATCACCAGGAACACAGGCATTTTCCTGAGCGAGGAAACTCCTTTTTTTTGCAGAGCGATCTCCTTCAGACCCAGAAA
TCTCATCCTTGAGATAATCGACTCCTCTTACATGTCTAAAACTCTTCTCTCTCGCATCAAGCCATTACTCAACCTCAAACCGGAACCATCTTCCTCTTCCTACTTTCCTC
TCAAATGCGACATCAAGAAGCTTGTCAATGACACCATTAAAATTCTCAAGTCCCACGAGAAGTGGGAGCAATCCCTTGAAACCCAGTTCAATGAATCCGATATACCCGTC
GTTGACATTAGTCATTTCGTTTTAGACCGAATTGATGATGTAGAACTGGGTTTGAAGTTCTTCGATTGGGCGTCAAAGAATGCGGGCTCCAGTTCTTTACATGGGAGTGT
CTACTCTTCGCTGATAAAACTTCTATCAAGGTTTAGAGTCTTTTCGGAGATTGAATTTACACTCGAAGATATGAAAACTAAAGAAATCATCCCGACCCGTGAAGCGTTGA
GTAATGTACTTTGTGCATATGCGGATTTTGGGTTGGTTGACAGAGCTCTTGAATTGTATCATGGCGTTGTCAAGTTGCACAACAGTCTTCCAAGTACCTATGCTTGCAAT
TCCTTGCTCAATTTGCTTGTTAAACACCGTAGGCTTGAAACTGCACACCAAGTATATGATGAAATGGTCGAAAGAGATAATGGCAACACATATGTATGGATAACTATACT
ACTTGTATTATGGATTTATACCTACATTAGAACTTTTGGTTCCATGGTAAATGGTTTTGGCAAGATGGGAAACTTTGAAGCTATTGATCTTCTTTTGATGGAAATGAAAG
ATAGGGGCTTGAGAGTTAATGTTCAAGTGTACAATAACATTATTGATGCTCAATATAAGCTTGGTTGTGAAATTAAAGCAAAGGATACGCTTGAAGAAATGGCTGAGAAT
TGCTGTGAACCAGATCTTGTGACTTATAATACTTTAATCAACTATTTATGTAGAGCTGGGGAGGTCAAGGAAGGTGAGAAGATCTTGCAACAAGCAATAAGGGGAGGATT
GGCACCGAATAAGTTCACCTATACTCCTCTTGTTCATGGCTACTGTAAACAAGGGGAATATATTAGGGCCTCAGATTTGCTAATTGAGATGACAGAAAGAGGACATAAAG
TTGATATGGTTTCGTATGGAGCTTTAATTCATGGACTTGTTGTTGCAGGGGAGGTTGATATTGCATTGACAATCCGGGACAGAATGATGGAAAGGGGCGTCTTACCTGAT
GCCAATATCTACAATGTTTTGATGAATGGACTTTTCAGGAAGGGAAACTTTCCATGGCCAAGGCATGGAAATCTTGATGAGGCCAAGAAACTCTTTCAGCTCACTATTGA
AAAGGGTATAGACCCTGGTGTTGTAGGATATAATGCCATGATCAAAGGTTTCTGTAAATTTGGGATGATGAAAGATGCAGTTTTATGCATTGATAGAATGAGGAGTGCCC
ATCATGTTCCTGATGTATTTACTTTCTCCACCATAATTGATGGATATGGAGAAGTTAAGATGGCTGAAATGCTTTTTAGCGTGATGCAATCTCATGATTTGGAGCCTAGT
GTCGTCACATACAGTACACTTATAAGAAGCTTTTGCAAAGAAGCTAAGCTTGGAAAAGCTGCATCATATTTCGAGCTAATGTTGATTAACAGGTGCACCCCCAATGGTGT
AATATTTCATTATTTAGTCAATGGGTTTGCAAATACAAATGCTGCTGCAGTTTCCAAAGGACTTAATAATTTTAGTGAGAATGCGAAATCTATGTTTGAGGACTTCTTTT
TGAGAATGATCGGTGATGGATGGACACGAAAGGCTGCAGCTCACAATTGTATTCTCATTTGCCTTTGTCAGCATAGAATGGTTAAAACTGCCTTACAGTTGCGTGATAAG
ATGCTGTCTTTGGGCCTTTATCCTGATGCTGTTTCTTTTGCAGCATTGATACATGGCATATGCTTGGAAGGAAACTCAAAACAATGGAAGAATGTTATTTCCTGTGATTT
GAGTGAAAAAGAGCTGCAAATTGCCTTGAAATACTCACATGAGTTAGATAAGTCCATAACTCAGGGAGGGTTTTCCGAGGCTTCAGACATTTTGCAGGCTATGGTTGAGG
ATTACGAGTCTCCTAACCAAGATATGAACAATTTGAAGGATGAACAAGAGCTGATCAAGATCGTCACCACCGGGCCTTATTCCAAACTCCACCATTCGATTGAAAGCCCG
AATAGCGTCTTGAGGCAAGTTAGCTCTACTATACGCTCTGAAAACGAGCCAGAAAAGTTCCGGGCTGATTACAGAACATCGGAATGCGATAGCTATCGGCACTGGGTTCA
AAACCCGGAATTCTTTTCGCCCAGAGAAGAACCTGTGAGCAGAGAATCCAATATGCTTGCACTTTTCAGAACCTGCTCGACAAGGTCGGTCGATACATTGGATGAGAAAG
AGGTAAGACCGAGCTCTAAGTCATGGAATGGGAGCGGTGGGTCTGACCCTGAGTGTGCTGGGGCGAGAGAGTTGAATGGGCGCTGGAGAGTAAAAGGATTGGTTCTGAGG
AGAGAGCGAAAGAGTGAAATGGAGGTTCGGAGTTTGGAGGAGAAGGCTCTGATGGCCATTAGAGATCTCGGTTGCAGAGGATGGTCGTCGTCGACATGTCAGAGGCAGAG
CTGCACTGGAAACTCCTTGAAATTTACGCCTGCTTCAGCCACACCCAGATTTCGGAAACTTATTTACAGAGGCAGTTCTTCCTGGACGGCATTCCCAAATTTATTCCGCC
CGGAAAAGTTCAGCACGTCGTCGACTTCAATTGCGAGAGCCGGTTGGTTTCTGGGTCTCGGAGAACAAAAGAAAATGAACTTACCGACGATTGTGACGGCGGGAGACCCA
GTTTTGCACGAACCTGCCCGGGAAGTTGATCCTAAGGAAATACGGTCGGAGAAAATTCAGAAGATAATCGACGACATGGTTCAAGCCATGAGGAAGGCTCCTGGAGTGGG
TCTTGCAGCTCCCCAGATTGGAATCCCTTTAAGGATAATTGTTTTGGAAGACACTAAAGAATATATAAGTTATGCACCCAAGGAGGAGATCAAAGCACAAGATAGACGCC
CTTTTGATCTTCTGTTGTTGAGAAGTATCAATTGTCCAGTCCCTTCTTTCATTCAGATGATCATAAAAATATGCAGCTTTCTTGGTCTTCTTGCCAGATTTCTGCATCTG
CTTGTGGGTGTTCTTCCTTGTCCTTTGCATATTACCCATGATCTATCTTTACCTTGTAGTGTTGATGGATTTAGAGCAGTGGTGGAGCGCTACCTTGAGGTTGAGGTAGT
CGGTCTGGATCGAGATGGGAATCCTATCAAGGTGGATGCTTCAGAAAATCTTGCTGCAGCTCGGATGCCGGAGCTCCGGCTGCCAGTGGATAGCTCGGAGAAAGAGGAGA
GAGAGATTGAATTTTCTTCCTCCGTTGCTGCTGCCGCTTTCACTGATATGTTAATTATAATTGTTGCTAATAACACTGCCATTTTCACCACCCTCATTCTGCTCACTTTC
TCTCACTAA
mRNA sequenceShow/hide mRNA sequence
ATGAGCTATCACCAATGTCTATGGGGAACGTTCATCCTGTCTGCTGTTGGTCTGACAAGGTTCTTAAGCTGTTCATGGGCCGATCATTGTTGTAAAACTTGGCAAAAGCT
TTCAGTTGGTGTGCATCTCGAAAGGCCGTTGGTTGATCTCTCGAGTAAAGGGGAAATTGTTCAATGTAGATTACAGCAAGAGCCTGCCATCTTTTGTATGAAGAAAGTAA
CCGTCTTACCAGTATACCAGTACCACCAACTTGGATCAACTTTAAAGACCCACCATGGAAATAGAAGGCATCTTGAATCTTCCCGCTCGGAGCTTTATACATGCCTAGCG
TGCATAGTTTTTAAGTTGGCTTTCTTCTTCTGCATCACCAGGAACACAGGCATTTTCCTGAGCGAGGAAACTCCTTTTTTTTGCAGAGCGATCTCCTTCAGACCCAGAAA
TCTCATCCTTGAGATAATCGACTCCTCTTACATGTCTAAAACTCTTCTCTCTCGCATCAAGCCATTACTCAACCTCAAACCGGAACCATCTTCCTCTTCCTACTTTCCTC
TCAAATGCGACATCAAGAAGCTTGTCAATGACACCATTAAAATTCTCAAGTCCCACGAGAAGTGGGAGCAATCCCTTGAAACCCAGTTCAATGAATCCGATATACCCGTC
GTTGACATTAGTCATTTCGTTTTAGACCGAATTGATGATGTAGAACTGGGTTTGAAGTTCTTCGATTGGGCGTCAAAGAATGCGGGCTCCAGTTCTTTACATGGGAGTGT
CTACTCTTCGCTGATAAAACTTCTATCAAGGTTTAGAGTCTTTTCGGAGATTGAATTTACACTCGAAGATATGAAAACTAAAGAAATCATCCCGACCCGTGAAGCGTTGA
GTAATGTACTTTGTGCATATGCGGATTTTGGGTTGGTTGACAGAGCTCTTGAATTGTATCATGGCGTTGTCAAGTTGCACAACAGTCTTCCAAGTACCTATGCTTGCAAT
TCCTTGCTCAATTTGCTTGTTAAACACCGTAGGCTTGAAACTGCACACCAAGTATATGATGAAATGGTCGAAAGAGATAATGGCAACACATATGTATGGATAACTATACT
ACTTGTATTATGGATTTATACCTACATTAGAACTTTTGGTTCCATGGTAAATGGTTTTGGCAAGATGGGAAACTTTGAAGCTATTGATCTTCTTTTGATGGAAATGAAAG
ATAGGGGCTTGAGAGTTAATGTTCAAGTGTACAATAACATTATTGATGCTCAATATAAGCTTGGTTGTGAAATTAAAGCAAAGGATACGCTTGAAGAAATGGCTGAGAAT
TGCTGTGAACCAGATCTTGTGACTTATAATACTTTAATCAACTATTTATGTAGAGCTGGGGAGGTCAAGGAAGGTGAGAAGATCTTGCAACAAGCAATAAGGGGAGGATT
GGCACCGAATAAGTTCACCTATACTCCTCTTGTTCATGGCTACTGTAAACAAGGGGAATATATTAGGGCCTCAGATTTGCTAATTGAGATGACAGAAAGAGGACATAAAG
TTGATATGGTTTCGTATGGAGCTTTAATTCATGGACTTGTTGTTGCAGGGGAGGTTGATATTGCATTGACAATCCGGGACAGAATGATGGAAAGGGGCGTCTTACCTGAT
GCCAATATCTACAATGTTTTGATGAATGGACTTTTCAGGAAGGGAAACTTTCCATGGCCAAGGCATGGAAATCTTGATGAGGCCAAGAAACTCTTTCAGCTCACTATTGA
AAAGGGTATAGACCCTGGTGTTGTAGGATATAATGCCATGATCAAAGGTTTCTGTAAATTTGGGATGATGAAAGATGCAGTTTTATGCATTGATAGAATGAGGAGTGCCC
ATCATGTTCCTGATGTATTTACTTTCTCCACCATAATTGATGGATATGGAGAAGTTAAGATGGCTGAAATGCTTTTTAGCGTGATGCAATCTCATGATTTGGAGCCTAGT
GTCGTCACATACAGTACACTTATAAGAAGCTTTTGCAAAGAAGCTAAGCTTGGAAAAGCTGCATCATATTTCGAGCTAATGTTGATTAACAGGTGCACCCCCAATGGTGT
AATATTTCATTATTTAGTCAATGGGTTTGCAAATACAAATGCTGCTGCAGTTTCCAAAGGACTTAATAATTTTAGTGAGAATGCGAAATCTATGTTTGAGGACTTCTTTT
TGAGAATGATCGGTGATGGATGGACACGAAAGGCTGCAGCTCACAATTGTATTCTCATTTGCCTTTGTCAGCATAGAATGGTTAAAACTGCCTTACAGTTGCGTGATAAG
ATGCTGTCTTTGGGCCTTTATCCTGATGCTGTTTCTTTTGCAGCATTGATACATGGCATATGCTTGGAAGGAAACTCAAAACAATGGAAGAATGTTATTTCCTGTGATTT
GAGTGAAAAAGAGCTGCAAATTGCCTTGAAATACTCACATGAGTTAGATAAGTCCATAACTCAGGGAGGGTTTTCCGAGGCTTCAGACATTTTGCAGGCTATGGTTGAGG
ATTACGAGTCTCCTAACCAAGATATGAACAATTTGAAGGATGAACAAGAGCTGATCAAGATCGTCACCACCGGGCCTTATTCCAAACTCCACCATTCGATTGAAAGCCCG
AATAGCGTCTTGAGGCAAGTTAGCTCTACTATACGCTCTGAAAACGAGCCAGAAAAGTTCCGGGCTGATTACAGAACATCGGAATGCGATAGCTATCGGCACTGGGTTCA
AAACCCGGAATTCTTTTCGCCCAGAGAAGAACCTGTGAGCAGAGAATCCAATATGCTTGCACTTTTCAGAACCTGCTCGACAAGGTCGGTCGATACATTGGATGAGAAAG
AGGTAAGACCGAGCTCTAAGTCATGGAATGGGAGCGGTGGGTCTGACCCTGAGTGTGCTGGGGCGAGAGAGTTGAATGGGCGCTGGAGAGTAAAAGGATTGGTTCTGAGG
AGAGAGCGAAAGAGTGAAATGGAGGTTCGGAGTTTGGAGGAGAAGGCTCTGATGGCCATTAGAGATCTCGGTTGCAGAGGATGGTCGTCGTCGACATGTCAGAGGCAGAG
CTGCACTGGAAACTCCTTGAAATTTACGCCTGCTTCAGCCACACCCAGATTTCGGAAACTTATTTACAGAGGCAGTTCTTCCTGGACGGCATTCCCAAATTTATTCCGCC
CGGAAAAGTTCAGCACGTCGTCGACTTCAATTGCGAGAGCCGGTTGGTTTCTGGGTCTCGGAGAACAAAAGAAAATGAACTTACCGACGATTGTGACGGCGGGAGACCCA
GTTTTGCACGAACCTGCCCGGGAAGTTGATCCTAAGGAAATACGGTCGGAGAAAATTCAGAAGATAATCGACGACATGGTTCAAGCCATGAGGAAGGCTCCTGGAGTGGG
TCTTGCAGCTCCCCAGATTGGAATCCCTTTAAGGATAATTGTTTTGGAAGACACTAAAGAATATATAAGTTATGCACCCAAGGAGGAGATCAAAGCACAAGATAGACGCC
CTTTTGATCTTCTGTTGTTGAGAAGTATCAATTGTCCAGTCCCTTCTTTCATTCAGATGATCATAAAAATATGCAGCTTTCTTGGTCTTCTTGCCAGATTTCTGCATCTG
CTTGTGGGTGTTCTTCCTTGTCCTTTGCATATTACCCATGATCTATCTTTACCTTGTAGTGTTGATGGATTTAGAGCAGTGGTGGAGCGCTACCTTGAGGTTGAGGTAGT
CGGTCTGGATCGAGATGGGAATCCTATCAAGGTGGATGCTTCAGAAAATCTTGCTGCAGCTCGGATGCCGGAGCTCCGGCTGCCAGTGGATAGCTCGGAGAAAGAGGAGA
GAGAGATTGAATTTTCTTCCTCCGTTGCTGCTGCCGCTTTCACTGATATGTTAATTATAATTGTTGCTAATAACACTGCCATTTTCACCACCCTCATTCTGCTCACTTTC
TCTCACTAA
Protein sequenceShow/hide protein sequence
MSYHQCLWGTFILSAVGLTRFLSCSWADHCCKTWQKLSVGVHLERPLVDLSSKGEIVQCRLQQEPAIFCMKKVTVLPVYQYHQLGSTLKTHHGNRRHLESSRSELYTCLA
CIVFKLAFFFCITRNTGIFLSEETPFFCRAISFRPRNLILEIIDSSYMSKTLLSRIKPLLNLKPEPSSSSYFPLKCDIKKLVNDTIKILKSHEKWEQSLETQFNESDIPV
VDISHFVLDRIDDVELGLKFFDWASKNAGSSSLHGSVYSSLIKLLSRFRVFSEIEFTLEDMKTKEIIPTREALSNVLCAYADFGLVDRALELYHGVVKLHNSLPSTYACN
SLLNLLVKHRRLETAHQVYDEMVERDNGNTYVWITILLVLWIYTYIRTFGSMVNGFGKMGNFEAIDLLLMEMKDRGLRVNVQVYNNIIDAQYKLGCEIKAKDTLEEMAEN
CCEPDLVTYNTLINYLCRAGEVKEGEKILQQAIRGGLAPNKFTYTPLVHGYCKQGEYIRASDLLIEMTERGHKVDMVSYGALIHGLVVAGEVDIALTIRDRMMERGVLPD
ANIYNVLMNGLFRKGNFPWPRHGNLDEAKKLFQLTIEKGIDPGVVGYNAMIKGFCKFGMMKDAVLCIDRMRSAHHVPDVFTFSTIIDGYGEVKMAEMLFSVMQSHDLEPS
VVTYSTLIRSFCKEAKLGKAASYFELMLINRCTPNGVIFHYLVNGFANTNAAAVSKGLNNFSENAKSMFEDFFLRMIGDGWTRKAAAHNCILICLCQHRMVKTALQLRDK
MLSLGLYPDAVSFAALIHGICLEGNSKQWKNVISCDLSEKELQIALKYSHELDKSITQGGFSEASDILQAMVEDYESPNQDMNNLKDEQELIKIVTTGPYSKLHHSIESP
NSVLRQVSSTIRSENEPEKFRADYRTSECDSYRHWVQNPEFFSPREEPVSRESNMLALFRTCSTRSVDTLDEKEVRPSSKSWNGSGGSDPECAGARELNGRWRVKGLVLR
RERKSEMEVRSLEEKALMAIRDLGCRGWSSSTCQRQSCTGNSLKFTPASATPRFRKLIYRGSSSWTAFPNLFRPEKFSTSSTSIARAGWFLGLGEQKKMNLPTIVTAGDP
VLHEPAREVDPKEIRSEKIQKIIDDMVQAMRKAPGVGLAAPQIGIPLRIIVLEDTKEYISYAPKEEIKAQDRRPFDLLLLRSINCPVPSFIQMIIKICSFLGLLARFLHL
LVGVLPCPLHITHDLSLPCSVDGFRAVVERYLEVEVVGLDRDGNPIKVDASENLAAARMPELRLPVDSSEKEEREIEFSSSVAAAAFTDMLIIIVANNTAIFTTLILLTF
SH