; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0023016 (gene) of Chayote v1 genome

Gene IDSed0023016
OrganismSechium edule (Chayote v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationLG06:7006840..7008939
RNA-Seq ExpressionSed0023016
SyntenySed0023016
Gene Ontology termsGO:0006629 - lipid metabolic process (biological process)
GO:0140042 - lipid droplet formation (biological process)
GO:0005789 - endoplasmic reticulum membrane (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0005515 - protein binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily
IPR032867 - DYW domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6592321.1 Pentatricopeptide repeat-containing protein, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0081.97Show/hide
Query:  MILIRCWSLLLPRRLYHLNLQLPILQHTIDSNSCTFYIKKCSTIKSLKCVHASILRAHLHLNLFFCTTLISQYVLLGSVSCAYSLFSLLRSLDVFLWNVM
        MI+IRC SLL  RRL H +LQ P+    IDS SCT YIK CSTIKSLKCVHASIL+A+LHLNLFFCTTLISQY  LGSVS AYSLFSLL+SLDVFLWNVM
Subjt:  MILIRCWSLLLPRRLYHLNLQLPILQHTIDSNSCTFYIKKCSTIKSLKCVHASILRAHLHLNLFFCTTLISQYVLLGSVSCAYSLFSLLRSLDVFLWNVM

Query:  LRGFVDARLYRRAVILYAQMLDLGIQPDNFTFPFVLKACRSTEDLDFGARVHHNAVVFGYQLDVFVANSLIAMYGRCGRLKLAQEVFDKMHGRNVVSWSS
        LRGFVDA  YR+A++LYAQMLDLGI+PDNFTFPFV KAC   +DLDFG RVH++AV FGY+LDVFVANSLIAMYGRCGR +LA+EVFDK+  RNVVSWSS
Subjt:  LRGFVDARLYRRAVILYAQMLDLGIQPDNFTFPFVLKACRSTEDLDFGARVHHNAVVFGYQLDVFVANSLIAMYGRCGRLKLAQEVFDKMHGRNVVSWSS

Query:  IIGAYAYNGQYVVGVSLFSQMLLQGFEPNRPVVLNVMACIHAEKEADDICRVVMNHKLGLDQSVQNAAVGMYARCGRIDLAQKIFNGIHNKDLVSWASMI
        IIGAYA NGQY +GVSLFS ML++GF+ NR V+LNVMAC+H+EKEADD+ R+ M+H+LGL+QSVQNAAVGMYARCGRID AQ+IFNGIHNKDLVSWASMI
Subjt:  IIGAYAYNGQYVVGVSLFSQMLLQGFEPNRPVVLNVMACIHAEKEADDICRVVMNHKLGLDQSVQNAAVGMYARCGRIDLAQKIFNGIHNKDLVSWASMI

Query:  EAYVQGDLPLKALEIFRKMIQKSIRPDSITLLGVIHACLALGSFSNACSIHGFVMRRFLRNQIVVETAILDLYVKCGSLIYARNVFDNMQERNVISWSTM
        EAYVQ DLPLKA+EIFR+MI K + PDSITLLGVI ACLALGSFS AC +HGFV+RRF  NQ+VVETAI+DLYVKCGSLIYAR VFDNM+ERNVISWSTM
Subjt:  EAYVQGDLPLKALEIFRKMIQKSIRPDSITLLGVIHACLALGSFSNACSIHGFVMRRFLRNQIVVETAILDLYVKCGSLIYARNVFDNMQERNVISWSTM

Query:  ISGYGLHGQGREAIRLFDEMKNSTNPDHVTLVSILAACSHAGLVVEGWDCFNAMERDFQLKPRSEHYACMVDLFGRVGKLKEAHDFISKMPIRPNAGVWG
        ISGYGLHG GR+AI LF+EMKN+T PDH+T VSILAACSHAGLV EGWDCFNAM RDF+LKP SEHYACMVDL GRVGKLKEA +FISKMPIRPNAGVWG
Subjt:  ISGYGLHGQGREAIRLFDEMKNSTNPDHVTLVSILAACSHAGLVVEGWDCFNAMERDFQLKPRSEHYACMVDLFGRVGKLKEAHDFISKMPIRPNAGVWG

Query:  ALLGACRIHSNIEMAEVAAKHLLELDSENPGRYVLLYNIYLSSGKRKEAIHIRALMKQRGLRKIAGHTTIEIKNKIHTFVAGDQSHPQTEMIYSELDKVM
        ALLGACRI+SN+EMAEVAA +LLELD ENPGRYVLLYNIYLSSGKRKEA  IRALMKQRGLRKIAGHT I++KNK+HTFVAGD+SHPQTEMIYSELDKV+
Subjt:  ALLGACRIHSNIEMAEVAAKHLLELDSENPGRYVLLYNIYLSSGKRKEAIHIRALMKQRGLRKIAGHTTIEIKNKIHTFVAGDQSHPQTEMIYSELDKVM

Query:  YRLQEKGYTPDLNFVLHDVEEETKEKMLCVHSEKLAIVFGLLNSGSGTIIRLRKNLRVCGDCHSFTKFVSNVAGRDIIVRDSHRFHHFKEGTCSCGDYW
        YR+QE+GYTPDLNFVLHDVEEETKEK+L VHSEKLAIVFGLLNSGSG++IRL+KNLRVCGDCH+FTKFVS VA R+IIVRD+ RFH FK+GTCSCGDYW
Subjt:  YRLQEKGYTPDLNFVLHDVEEETKEKMLCVHSEKLAIVFGLLNSGSGTIIRLRKNLRVCGDCHSFTKFVSNVAGRDIIVRDSHRFHHFKEGTCSCGDYW

KAG7025143.1 putative pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. argyrosperma]0.0e+0081.83Show/hide
Query:  MILIRCWSLLLPRRLYHLNLQLPILQHTIDSNSCTFYIKKCSTIKSLKCVHASILRAHLHLNLFFCTTLISQYVLLGSVSCAYSLFSLLRSLDVFLWNVM
        MI+IRC SLL  RRL H +LQ P+    IDS SCT YIK CSTIKSLKCVHASIL+A+LHLNLFFCTTLISQY  LGSVS AYSLFSLL+SLDVFLWNVM
Subjt:  MILIRCWSLLLPRRLYHLNLQLPILQHTIDSNSCTFYIKKCSTIKSLKCVHASILRAHLHLNLFFCTTLISQYVLLGSVSCAYSLFSLLRSLDVFLWNVM

Query:  LRGFVDARLYRRAVILYAQMLDLGIQPDNFTFPFVLKACRSTEDLDFGARVHHNAVVFGYQLDVFVANSLIAMYGRCGRLKLAQEVFDKMHGRNVVSWSS
        LRGFVDA  YR+A++LYAQMLDLGI+PDNFTFPFV KAC   +DLDFG RVH++AV FGY+LDVFVANSLIAMYGRC R +LA+EVFDKM  RNVVSWSS
Subjt:  LRGFVDARLYRRAVILYAQMLDLGIQPDNFTFPFVLKACRSTEDLDFGARVHHNAVVFGYQLDVFVANSLIAMYGRCGRLKLAQEVFDKMHGRNVVSWSS

Query:  IIGAYAYNGQYVVGVSLFSQMLLQGFEPNRPVVLNVMACIHAEKEADDICRVVMNHKLGLDQSVQNAAVGMYARCGRIDLAQKIFNGIHNKDLVSWASMI
        IIGAYA NGQY +GVSLFS ML++GF+ NR V+LNVMAC+H+EKEADD+ R+ M+H+LGL+QSVQNAAVGMYARCGRID A++IFNGIHNKDLVSWASMI
Subjt:  IIGAYAYNGQYVVGVSLFSQMLLQGFEPNRPVVLNVMACIHAEKEADDICRVVMNHKLGLDQSVQNAAVGMYARCGRIDLAQKIFNGIHNKDLVSWASMI

Query:  EAYVQGDLPLKALEIFRKMIQKSIRPDSITLLGVIHACLALGSFSNACSIHGFVMRRFLRNQIVVETAILDLYVKCGSLIYARNVFDNMQERNVISWSTM
        EAYVQ DLPLKA+EIFR+MI K + PDSITLLGVI ACLALGSFS AC +HGFV+RRF  NQ+VVETAI+DLYVKCGSLIYAR VFDNM+ERNVISWSTM
Subjt:  EAYVQGDLPLKALEIFRKMIQKSIRPDSITLLGVIHACLALGSFSNACSIHGFVMRRFLRNQIVVETAILDLYVKCGSLIYARNVFDNMQERNVISWSTM

Query:  ISGYGLHGQGREAIRLFDEMKNSTNPDHVTLVSILAACSHAGLVVEGWDCFNAMERDFQLKPRSEHYACMVDLFGRVGKLKEAHDFISKMPIRPNAGVWG
        ISGYGLHG GR+AI LF+EMKN+T PDH+T VSILAACSHAGLV EGWDCFNAM RDF+LKP SEHYACMVDL GRVGKLKEA +FISKMPIRPNAGVWG
Subjt:  ISGYGLHGQGREAIRLFDEMKNSTNPDHVTLVSILAACSHAGLVVEGWDCFNAMERDFQLKPRSEHYACMVDLFGRVGKLKEAHDFISKMPIRPNAGVWG

Query:  ALLGACRIHSNIEMAEVAAKHLLELDSENPGRYVLLYNIYLSSGKRKEAIHIRALMKQRGLRKIAGHTTIEIKNKIHTFVAGDQSHPQTEMIYSELDKVM
        ALLGACRIHSN+EMAEVAA +LLELD ENPGRYVLLYNIYLSSGKRKEA  IRALMKQRGLRKIAGHT I++KNK+HTFVAGD+SHPQTEMIYSELDKV+
Subjt:  ALLGACRIHSNIEMAEVAAKHLLELDSENPGRYVLLYNIYLSSGKRKEAIHIRALMKQRGLRKIAGHTTIEIKNKIHTFVAGDQSHPQTEMIYSELDKVM

Query:  YRLQEKGYTPDLNFVLHDVEEETKEKMLCVHSEKLAIVFGLLNSGSGTIIRLRKNLRVCGDCHSFTKFVSNVAGRDIIVRDSHRFHHFKEGTCSCGDYW
        YR+QE+GYTPDLNFVLHDVEEETKEK+L VHSEKLAIVFGLLNSGSG+++RL+KNLRVCGDCH+FTKFVS VA R+IIVRD+ RFH FK+GTCSCGDYW
Subjt:  YRLQEKGYTPDLNFVLHDVEEETKEKMLCVHSEKLAIVFGLLNSGSGTIIRLRKNLRVCGDCHSFTKFVSNVAGRDIIVRDSHRFHHFKEGTCSCGDYW

XP_022925426.1 pentatricopeptide repeat-containing protein At3g26782, mitochondrial-like isoform X1 [Cucurbita moschata]0.0e+0081.97Show/hide
Query:  MILIRCWSLLLPRRLYHLNLQLPILQHTIDSNSCTFYIKKCSTIKSLKCVHASILRAHLHLNLFFCTTLISQYVLLGSVSCAYSLFSLLRSLDVFLWNVM
        MI+IRC SLL  RRL H +LQ P+    ID  SCT YIK CSTIKSLKCVHASIL+A+LHLNLFFCTTLISQY  LGSVS AYSLFSLL+SLDVFLWNVM
Subjt:  MILIRCWSLLLPRRLYHLNLQLPILQHTIDSNSCTFYIKKCSTIKSLKCVHASILRAHLHLNLFFCTTLISQYVLLGSVSCAYSLFSLLRSLDVFLWNVM

Query:  LRGFVDARLYRRAVILYAQMLDLGIQPDNFTFPFVLKACRSTEDLDFGARVHHNAVVFGYQLDVFVANSLIAMYGRCGRLKLAQEVFDKMHGRNVVSWSS
        LRGFVDA  YR+A++LYAQMLDLGI+PDNFTFPFV KAC   +DLDFG RVH++AV FGY+LDVFVANSLIAMYGRCGR +LA+EVFDKM  RNVVSWSS
Subjt:  LRGFVDARLYRRAVILYAQMLDLGIQPDNFTFPFVLKACRSTEDLDFGARVHHNAVVFGYQLDVFVANSLIAMYGRCGRLKLAQEVFDKMHGRNVVSWSS

Query:  IIGAYAYNGQYVVGVSLFSQMLLQGFEPNRPVVLNVMACIHAEKEADDICRVVMNHKLGLDQSVQNAAVGMYARCGRIDLAQKIFNGIHNKDLVSWASMI
        IIGAYA NGQY +GVSLFS ML++GF+ NR V+LNVMAC+H+EKEADD+ R+ M+H+LGL+QSVQNAAVGMYARCGRID A++IFNGIHNKDLVSWASMI
Subjt:  IIGAYAYNGQYVVGVSLFSQMLLQGFEPNRPVVLNVMACIHAEKEADDICRVVMNHKLGLDQSVQNAAVGMYARCGRIDLAQKIFNGIHNKDLVSWASMI

Query:  EAYVQGDLPLKALEIFRKMIQKSIRPDSITLLGVIHACLALGSFSNACSIHGFVMRRFLRNQIVVETAILDLYVKCGSLIYARNVFDNMQERNVISWSTM
        EAYVQ DLPLKA+EIFR+MI K + PDSITLLGVI ACLALGSFS AC +HGFV+RRF  NQ+VVETAI+DLYVKCGSLIYAR VFDNM+ERNVISWSTM
Subjt:  EAYVQGDLPLKALEIFRKMIQKSIRPDSITLLGVIHACLALGSFSNACSIHGFVMRRFLRNQIVVETAILDLYVKCGSLIYARNVFDNMQERNVISWSTM

Query:  ISGYGLHGQGREAIRLFDEMKNSTNPDHVTLVSILAACSHAGLVVEGWDCFNAMERDFQLKPRSEHYACMVDLFGRVGKLKEAHDFISKMPIRPNAGVWG
        ISGYGLHG GR+AI LF+EMKN+T PDH+T VSILAACSHAGLV EGWDCFNAM RDF+LKP SEHYACMVDL GRVGKLKEA +FISKMPIRPNAGVWG
Subjt:  ISGYGLHGQGREAIRLFDEMKNSTNPDHVTLVSILAACSHAGLVVEGWDCFNAMERDFQLKPRSEHYACMVDLFGRVGKLKEAHDFISKMPIRPNAGVWG

Query:  ALLGACRIHSNIEMAEVAAKHLLELDSENPGRYVLLYNIYLSSGKRKEAIHIRALMKQRGLRKIAGHTTIEIKNKIHTFVAGDQSHPQTEMIYSELDKVM
        ALLGACRIHSN+EMAEVAA +LLELD ENPGRYVLLYNIYLSSGKRKEA  IRALMKQRGLRKIAGHT I++KNK+HTFVAGD+SHPQTEMIYSELDKV+
Subjt:  ALLGACRIHSNIEMAEVAAKHLLELDSENPGRYVLLYNIYLSSGKRKEAIHIRALMKQRGLRKIAGHTTIEIKNKIHTFVAGDQSHPQTEMIYSELDKVM

Query:  YRLQEKGYTPDLNFVLHDVEEETKEKMLCVHSEKLAIVFGLLNSGSGTIIRLRKNLRVCGDCHSFTKFVSNVAGRDIIVRDSHRFHHFKEGTCSCGDYW
        YR+QE+GYTPDLNFVLHDVEEETKEK+L VHSEKLAIVFGLLNSGSG++IRL+KNLRVCGDCH+FTKFVS VA R+IIVRD+ RFH FK+GTCSCGDYW
Subjt:  YRLQEKGYTPDLNFVLHDVEEETKEKMLCVHSEKLAIVFGLLNSGSGTIIRLRKNLRVCGDCHSFTKFVSNVAGRDIIVRDSHRFHHFKEGTCSCGDYW

XP_023535836.1 pentatricopeptide repeat-containing protein At3g26782, mitochondrial-like isoform X1 [Cucurbita pepo subsp. pepo]0.0e+0081.97Show/hide
Query:  MILIRCWSLLLPRRLYHLNLQLPILQHTIDSNSCTFYIKKCSTIKSLKCVHASILRAHLHLNLFFCTTLISQYVLLGSVSCAYSLFSLLRSLDVFLWNVM
        MI+IRC SLL  RRL H +LQLP+    IDS SCT YIK CSTI+SLKCVHASIL+A+LHLNLFFCTTLISQY  LGSVS AYSLFSLL+SLDVFLWNVM
Subjt:  MILIRCWSLLLPRRLYHLNLQLPILQHTIDSNSCTFYIKKCSTIKSLKCVHASILRAHLHLNLFFCTTLISQYVLLGSVSCAYSLFSLLRSLDVFLWNVM

Query:  LRGFVDARLYRRAVILYAQMLDLGIQPDNFTFPFVLKACRSTEDLDFGARVHHNAVVFGYQLDVFVANSLIAMYGRCGRLKLAQEVFDKMHGRNVVSWSS
        LRGFVDA  YR+ + LYAQMLDLGI+PDNFTFPFV KAC   +DLDFG RVH++AV FGY+LDVFVANSLIAMYGRCGR +LA+EVFDKM  RNVVSWSS
Subjt:  LRGFVDARLYRRAVILYAQMLDLGIQPDNFTFPFVLKACRSTEDLDFGARVHHNAVVFGYQLDVFVANSLIAMYGRCGRLKLAQEVFDKMHGRNVVSWSS

Query:  IIGAYAYNGQYVVGVSLFSQMLLQGFEPNRPVVLNVMACIHAEKEADDICRVVMNHKLGLDQSVQNAAVGMYARCGRIDLAQKIFNGIHNKDLVSWASMI
        IIGAYA NGQY +GVSLFS ML +GF+ NR V+LNVMACIH+EKEADD+ R+ M+H+LGL+QSVQNAAVGMYARCGRID AQ+IFNGI NKDLVSWASMI
Subjt:  IIGAYAYNGQYVVGVSLFSQMLLQGFEPNRPVVLNVMACIHAEKEADDICRVVMNHKLGLDQSVQNAAVGMYARCGRIDLAQKIFNGIHNKDLVSWASMI

Query:  EAYVQGDLPLKALEIFRKMIQKSIRPDSITLLGVIHACLALGSFSNACSIHGFVMRRFLRNQIVVETAILDLYVKCGSLIYARNVFDNMQERNVISWSTM
        EAYVQ +LPLKALEIFR++I K I PDSITLLGVI ACLALGSFS AC +HGFV+RR   NQIVVETAI+DLYVKCGSLIYAR VFDNM+ERNVISWSTM
Subjt:  EAYVQGDLPLKALEIFRKMIQKSIRPDSITLLGVIHACLALGSFSNACSIHGFVMRRFLRNQIVVETAILDLYVKCGSLIYARNVFDNMQERNVISWSTM

Query:  ISGYGLHGQGREAIRLFDEMKNSTNPDHVTLVSILAACSHAGLVVEGWDCFNAMERDFQLKPRSEHYACMVDLFGRVGKLKEAHDFISKMPIRPNAGVWG
        ISGYGLHG GR+AI LF+EMKNST PDH+T VSILAACSHAGLV EGWDCFNAM RDF+LKP  EHYACMVDL GRVG+LKEA DFISKMPIRPNAGVWG
Subjt:  ISGYGLHGQGREAIRLFDEMKNSTNPDHVTLVSILAACSHAGLVVEGWDCFNAMERDFQLKPRSEHYACMVDLFGRVGKLKEAHDFISKMPIRPNAGVWG

Query:  ALLGACRIHSNIEMAEVAAKHLLELDSENPGRYVLLYNIYLSSGKRKEAIHIRALMKQRGLRKIAGHTTIEIKNKIHTFVAGDQSHPQTEMIYSELDKVM
        ALLGACRIHSN+EMAEVAA +LLELD ENPGRYVLLYNIYLSSGKRKEA  IRALMKQRGLRKIAGHT I++KNK+HTFVAGD+SHPQTEMIYSELDKV+
Subjt:  ALLGACRIHSNIEMAEVAAKHLLELDSENPGRYVLLYNIYLSSGKRKEAIHIRALMKQRGLRKIAGHTTIEIKNKIHTFVAGDQSHPQTEMIYSELDKVM

Query:  YRLQEKGYTPDLNFVLHDVEEETKEKMLCVHSEKLAIVFGLLNSGSGTIIRLRKNLRVCGDCHSFTKFVSNVAGRDIIVRDSHRFHHFKEGTCSCGDYW
        YR+QE+GYTPDLNFVLHDVEEETKEK+L VHSEKLAIVFGLLNSGSG+++RL+KNLRVCGDCH+FTKFVS VA R+IIVRD+ RFH FK+GTCSCGDYW
Subjt:  YRLQEKGYTPDLNFVLHDVEEETKEKMLCVHSEKLAIVFGLLNSGSGTIIRLRKNLRVCGDCHSFTKFVSNVAGRDIIVRDSHRFHHFKEGTCSCGDYW

XP_038884398.1 pentatricopeptide repeat-containing protein At3g26782, mitochondrial-like [Benincasa hispida]0.0e+0082.26Show/hide
Query:  MILIRCWSLLLPRRLYHLNLQLPILQHTIDSNSCTFYIKKCSTIKSLKCVHASILRAHLHLNLFFCTTLISQYVLLGSVSCAYSLFSLLRSLDVFLWNVM
        MIL R  SLL  RRLYH +LQL      IDS SCTFYIK CSTIKSLKC+HASIL+A+LHLNLFFCTTLISQY  LGSVS AYSLFSLL+SLDVFLWNVM
Subjt:  MILIRCWSLLLPRRLYHLNLQLPILQHTIDSNSCTFYIKKCSTIKSLKCVHASILRAHLHLNLFFCTTLISQYVLLGSVSCAYSLFSLLRSLDVFLWNVM

Query:  LRGFVDARLYRRAVILYAQMLDLGIQPDNFTFPFVLKACRSTEDLDFGARVHHNAVVFGYQLDVFVANSLIAMYGRCGRLKLAQEVFDKMHGRNVVSWSS
        LRGFVDA  +RRA++LY QMLDLGI PDNFTFPFV KAC   EDLDFG RVH++AV FGY+LDVFVANSLIAMYGRCGR +LA+EVFDKM GRNVVSWSS
Subjt:  LRGFVDARLYRRAVILYAQMLDLGIQPDNFTFPFVLKACRSTEDLDFGARVHHNAVVFGYQLDVFVANSLIAMYGRCGRLKLAQEVFDKMHGRNVVSWSS

Query:  IIGAYAYNGQYVVGVSLFSQMLLQGFEPNRPVVLNVMACIHAEKEADDICRVVMNHKLGLDQSVQNAAVGMYARCGRIDLAQKIFNGIHNKDLVSWASMI
        IIGAYA N QY +GVSLFS ML +GF+PNR V+LNVMACI +EKEADD+ R+ +++KLGLDQSVQNAAVGMYARCG+ID AQ IFNGIHNKDLVSWASMI
Subjt:  IIGAYAYNGQYVVGVSLFSQMLLQGFEPNRPVVLNVMACIHAEKEADDICRVVMNHKLGLDQSVQNAAVGMYARCGRIDLAQKIFNGIHNKDLVSWASMI

Query:  EAYVQGDLPLKALEIFRKMIQKSIRPDSITLLGVIHACLALGSFSNACSIHGFVMRRFLRNQIVVETAILDLYVKCGSLIYARNVFDNMQERNVISWSTM
        EAYVQ DLPL AL+ FR+MI   I PDSITLLGVIHACLALG FS AC +HGFV+RR   NQIVVETAI+DLYVKCGSLIYAR VFDNM+ERNVISWSTM
Subjt:  EAYVQGDLPLKALEIFRKMIQKSIRPDSITLLGVIHACLALGSFSNACSIHGFVMRRFLRNQIVVETAILDLYVKCGSLIYARNVFDNMQERNVISWSTM

Query:  ISGYGLHGQGREAIRLFDEMKNSTNPDHVTLVSILAACSHAGLVVEGWDCFNAMERDFQLKPRSEHYACMVDLFGRVGKLKEAHDFISKMPIRPNAGVWG
        ISGYGLHG GR+AI LF+EMKNST PDH+T VS+LAACSHAGLV EGWDCFN+M RDF+LKP  EHYACMVDLFGRVGKLKEAHDFISKMPIRPNAG+WG
Subjt:  ISGYGLHGQGREAIRLFDEMKNSTNPDHVTLVSILAACSHAGLVVEGWDCFNAMERDFQLKPRSEHYACMVDLFGRVGKLKEAHDFISKMPIRPNAGVWG

Query:  ALLGACRIHSNIEMAEVAAKHLLELDSENPGRYVLLYNIYLSSGKRKEAIHIRALMKQRGLRKIAGHTTIEIKNKIHTFVAGDQSHPQTEMIYSELDKVM
        ALLGACRIHSN+EMAEVAAKHLLELD ENPGRYVLLYNIYLSSG+RKEA  IRALMK+RGLRKIAGHTTI+IKNKIHTFVAGDQSHPQTEMIYSELDKV+
Subjt:  ALLGACRIHSNIEMAEVAAKHLLELDSENPGRYVLLYNIYLSSGKRKEAIHIRALMKQRGLRKIAGHTTIEIKNKIHTFVAGDQSHPQTEMIYSELDKVM

Query:  YRLQEKGYTPDLNFVLHDVEEETKEKMLCVHSEKLAIVFGLLNSGSGTIIRLRKNLRVCGDCHSFTKFVSNVAGRDIIVRDSHRFHHFKEGTCSCGDYW
        YR+QE GYTPDLNFVLHDVEEETKEK+L VHSEKLAIVFGLLNS  G++IRL+KNLRVCGDCH+FTKF+S VA R+IIVRD+ RFHHFK+GTCSCGDYW
Subjt:  YRLQEKGYTPDLNFVLHDVEEETKEKMLCVHSEKLAIVFGLLNSGSGTIIRLRKNLRVCGDCHSFTKFVSNVAGRDIIVRDSHRFHHFKEGTCSCGDYW

TrEMBL top hitse value%identityAlignment
A0A2I4EAK9 pentatricopeptide repeat-containing protein At1g11290, chloroplastic-like8.3e-28367.59Show/hide
Query:  LYHLNLQLPILQHTIDSNSCTFYIKKCSTIKSLKCVHASILRAHLHLNLFFCTTLISQYVLLGSVSCAYSLFSLLRSLDVFLWNVMLRGFVDARLYRRAV
        LYH   Q+  L   I+ ++C   IK+C TI+SLK VHAS+LR+HLH NLFF T LISQY  LGS+S AYSLFS  +S DVFLWNVMLRGFVD  LY R++
Subjt:  LYHLNLQLPILQHTIDSNSCTFYIKKCSTIKSLKCVHASILRAHLHLNLFFCTTLISQYVLLGSVSCAYSLFSLLRSLDVFLWNVMLRGFVDARLYRRAV

Query:  ILYAQMLDLGIQPDNFTFPFVLKACRSTEDLDFGARVHHNAVVFGYQLDVFVANSLIAMYGRCGRLKLAQEVFDKMHGRNVVSWSSIIGAYAYNGQYVVG
        +LY +ML  GIQPDNFT+PF+LKAC    DL+FG  VH N +  GY  DV V NSL+ MYG+C RL +++ VFDK+  R++VSWSS+IGA A NGQY  G
Subjt:  ILYAQMLDLGIQPDNFTFPFVLKACRSTEDLDFGARVHHNAVVFGYQLDVFVANSLIAMYGRCGRLKLAQEVFDKMHGRNVVSWSSIIGAYAYNGQYVVG

Query:  VSLFSQMLLQGFEPNRPVVLNVMACIHAEKEADDICRVVMNHKLGLDQSVQNAAVGMYARCGRIDLAQKIFNGIHNKDLVSWASMIEAYVQGDLPLKALE
        +SLFS+ML +G  PNR ++LNVM+C+H E +ADD+CR+V+NH + LD+ V+NAA+GMYARCGRID+A++ F+GI  KDL+SWA+MIEAYVQ DLPL ALE
Subjt:  VSLFSQMLLQGFEPNRPVVLNVMACIHAEKEADDICRVVMNHKLGLDQSVQNAAVGMYARCGRIDLAQKIFNGIHNKDLVSWASMIEAYVQGDLPLKALE

Query:  IFRKMIQKSIRPDSITLLGVIHACLALGSFSNACSIHGFVMRRFLRNQIVVETAILDLYVKCGSLIYARNVFDNMQERNVISWSTMISGYGLHGQGREAI
        +F++M+ + I  DS++LL VIHAC  L SF  A  IHGF+ R FL NQI VETA++DLYVKCG+L+YAR +FDNM+ERN+ISWST+ISGYG+HG GREA+
Subjt:  IFRKMIQKSIRPDSITLLGVIHACLALGSFSNACSIHGFVMRRFLRNQIVVETAILDLYVKCGSLIYARNVFDNMQERNVISWSTMISGYGLHGQGREAI

Query:  RLFDEMKNSTNPDHVTLVSILAACSHAGLVVEGWDCFNAMERDFQLKPRSEHYACMVDLFGRVGKLKEAHDFISKMPIRPNAGVWGALLGACRIHSNIEM
         LFD+MK+S  PDH+  +S+L+ACSH GL+VEGW+CFN+M RDF + PRSEHYACMVDL GR G+L EAHDFI +MP+ P+AGVWGALLGACRIH N+E+
Subjt:  RLFDEMKNSTNPDHVTLVSILAACSHAGLVVEGWDCFNAMERDFQLKPRSEHYACMVDLFGRVGKLKEAHDFISKMPIRPNAGVWGALLGACRIHSNIEM

Query:  AEVAAKHLLELDSENPGRYVLLYNIYLSSGKRKEAIHIRALMKQRGLRKIAGHTTIEIKNKIHTFVAGDQSHPQTEMIYSELDKVMYRLQEKGYTPDLNF
        AEVAAK L  LDS+N GRYVLL NIY+SSGKR +A  IRALMKQRGLRKIAGHT IEIKNKI+TFVAGDQSHPQT +IYSEL+KVM+R+++ GYTPDLNF
Subjt:  AEVAAKHLLELDSENPGRYVLLYNIYLSSGKRKEAIHIRALMKQRGLRKIAGHTTIEIKNKIHTFVAGDQSHPQTEMIYSELDKVMYRLQEKGYTPDLNF

Query:  VLHDVEEETKEKMLCVHSEKLAIVFGLLNSGSGTIIRLRKNLRVCGDCHSFTKFVSNVAGRDIIVRDSHRFHHFKEGTCSCGDYW
        VLHDVEEE KEK+L  HSEKLAIVFGLLN+GS + IR+RKNLRVCGDCH+ TK++S   GRDIIVRD+HRFHHF +GTCSCGDYW
Subjt:  VLHDVEEETKEKMLCVHSEKLAIVFGLLNSGSGTIIRLRKNLRVCGDCHSFTKFVSNVAGRDIIVRDSHRFHHFKEGTCSCGDYW

A0A6J1EC65 pentatricopeptide repeat-containing protein At3g26782, mitochondrial-like isoform X10.0e+0081.97Show/hide
Query:  MILIRCWSLLLPRRLYHLNLQLPILQHTIDSNSCTFYIKKCSTIKSLKCVHASILRAHLHLNLFFCTTLISQYVLLGSVSCAYSLFSLLRSLDVFLWNVM
        MI+IRC SLL  RRL H +LQ P+    ID  SCT YIK CSTIKSLKCVHASIL+A+LHLNLFFCTTLISQY  LGSVS AYSLFSLL+SLDVFLWNVM
Subjt:  MILIRCWSLLLPRRLYHLNLQLPILQHTIDSNSCTFYIKKCSTIKSLKCVHASILRAHLHLNLFFCTTLISQYVLLGSVSCAYSLFSLLRSLDVFLWNVM

Query:  LRGFVDARLYRRAVILYAQMLDLGIQPDNFTFPFVLKACRSTEDLDFGARVHHNAVVFGYQLDVFVANSLIAMYGRCGRLKLAQEVFDKMHGRNVVSWSS
        LRGFVDA  YR+A++LYAQMLDLGI+PDNFTFPFV KAC   +DLDFG RVH++AV FGY+LDVFVANSLIAMYGRCGR +LA+EVFDKM  RNVVSWSS
Subjt:  LRGFVDARLYRRAVILYAQMLDLGIQPDNFTFPFVLKACRSTEDLDFGARVHHNAVVFGYQLDVFVANSLIAMYGRCGRLKLAQEVFDKMHGRNVVSWSS

Query:  IIGAYAYNGQYVVGVSLFSQMLLQGFEPNRPVVLNVMACIHAEKEADDICRVVMNHKLGLDQSVQNAAVGMYARCGRIDLAQKIFNGIHNKDLVSWASMI
        IIGAYA NGQY +GVSLFS ML++GF+ NR V+LNVMAC+H+EKEADD+ R+ M+H+LGL+QSVQNAAVGMYARCGRID A++IFNGIHNKDLVSWASMI
Subjt:  IIGAYAYNGQYVVGVSLFSQMLLQGFEPNRPVVLNVMACIHAEKEADDICRVVMNHKLGLDQSVQNAAVGMYARCGRIDLAQKIFNGIHNKDLVSWASMI

Query:  EAYVQGDLPLKALEIFRKMIQKSIRPDSITLLGVIHACLALGSFSNACSIHGFVMRRFLRNQIVVETAILDLYVKCGSLIYARNVFDNMQERNVISWSTM
        EAYVQ DLPLKA+EIFR+MI K + PDSITLLGVI ACLALGSFS AC +HGFV+RRF  NQ+VVETAI+DLYVKCGSLIYAR VFDNM+ERNVISWSTM
Subjt:  EAYVQGDLPLKALEIFRKMIQKSIRPDSITLLGVIHACLALGSFSNACSIHGFVMRRFLRNQIVVETAILDLYVKCGSLIYARNVFDNMQERNVISWSTM

Query:  ISGYGLHGQGREAIRLFDEMKNSTNPDHVTLVSILAACSHAGLVVEGWDCFNAMERDFQLKPRSEHYACMVDLFGRVGKLKEAHDFISKMPIRPNAGVWG
        ISGYGLHG GR+AI LF+EMKN+T PDH+T VSILAACSHAGLV EGWDCFNAM RDF+LKP SEHYACMVDL GRVGKLKEA +FISKMPIRPNAGVWG
Subjt:  ISGYGLHGQGREAIRLFDEMKNSTNPDHVTLVSILAACSHAGLVVEGWDCFNAMERDFQLKPRSEHYACMVDLFGRVGKLKEAHDFISKMPIRPNAGVWG

Query:  ALLGACRIHSNIEMAEVAAKHLLELDSENPGRYVLLYNIYLSSGKRKEAIHIRALMKQRGLRKIAGHTTIEIKNKIHTFVAGDQSHPQTEMIYSELDKVM
        ALLGACRIHSN+EMAEVAA +LLELD ENPGRYVLLYNIYLSSGKRKEA  IRALMKQRGLRKIAGHT I++KNK+HTFVAGD+SHPQTEMIYSELDKV+
Subjt:  ALLGACRIHSNIEMAEVAAKHLLELDSENPGRYVLLYNIYLSSGKRKEAIHIRALMKQRGLRKIAGHTTIEIKNKIHTFVAGDQSHPQTEMIYSELDKVM

Query:  YRLQEKGYTPDLNFVLHDVEEETKEKMLCVHSEKLAIVFGLLNSGSGTIIRLRKNLRVCGDCHSFTKFVSNVAGRDIIVRDSHRFHHFKEGTCSCGDYW
        YR+QE+GYTPDLNFVLHDVEEETKEK+L VHSEKLAIVFGLLNSGSG++IRL+KNLRVCGDCH+FTKFVS VA R+IIVRD+ RFH FK+GTCSCGDYW
Subjt:  YRLQEKGYTPDLNFVLHDVEEETKEKMLCVHSEKLAIVFGLLNSGSGTIIRLRKNLRVCGDCHSFTKFVSNVAGRDIIVRDSHRFHHFKEGTCSCGDYW

A0A6J1ID68 pentatricopeptide repeat-containing protein At3g26782, mitochondrial-like isoform X10.0e+0081.26Show/hide
Query:  MILIRCWSLLLPRRLYHLNLQLPILQHTIDSNSCTFYIKKCSTIKSLKCVHASILRAHLHLNLFFCTTLISQYVLLGSVSCAYSLFSLLRSLDVFLWNVM
        MI+IRC SLL  RRL H +LQLP+    IDS SCT  IK CSTIKSLKCVH SIL+A+LHLNLFFCTTLISQY  LGSVS AYSLFSLL+SLDVFLWNVM
Subjt:  MILIRCWSLLLPRRLYHLNLQLPILQHTIDSNSCTFYIKKCSTIKSLKCVHASILRAHLHLNLFFCTTLISQYVLLGSVSCAYSLFSLLRSLDVFLWNVM

Query:  LRGFVDARLYRRAVILYAQMLDLGIQPDNFTFPFVLKACRSTEDLDFGARVHHNAVVFGYQLDVFVANSLIAMYGRCGRLKLAQEVFDKMHGRNVVSWSS
        LRGFVDA  YR+A++LYAQMLDLGI+PDNFTFPFV KAC   +DLDFG RVH+++V FGY+LDVFVANSLIAMYGRCGR +LA+EVFDKM  RNVVSWSS
Subjt:  LRGFVDARLYRRAVILYAQMLDLGIQPDNFTFPFVLKACRSTEDLDFGARVHHNAVVFGYQLDVFVANSLIAMYGRCGRLKLAQEVFDKMHGRNVVSWSS

Query:  IIGAYAYNGQYVVGVSLFSQMLLQGFEPNRPVVLNVMACIHAEKEADDICRVVMNHKLGLDQSVQNAAVGMYARCGRIDLAQKIFNGIHNKDLVSWASMI
        IIGAYA NGQY +GVSLFS ML +GF+ NR V+LNVMACIH+EKEADD+CR+ M+H+LGL+QSVQNAAVGMYARCGRID AQ+IFNGIH+KDLVSWASMI
Subjt:  IIGAYAYNGQYVVGVSLFSQMLLQGFEPNRPVVLNVMACIHAEKEADDICRVVMNHKLGLDQSVQNAAVGMYARCGRIDLAQKIFNGIHNKDLVSWASMI

Query:  EAYVQGDLPLKALEIFRKMIQKSIRPDSITLLGVIHACLALGSFSNACSIHGFVMRRFLRNQIVVETAILDLYVKCGSLIYARNVFDNMQERNVISWSTM
        EAYVQ DLPLKA+EIFR+M  K + PDSITLLGVI ACLALGSFS AC +HGFV+RR   NQIVVETAI+DLYVKCGSLIYAR VFDN++ERNVISWSTM
Subjt:  EAYVQGDLPLKALEIFRKMIQKSIRPDSITLLGVIHACLALGSFSNACSIHGFVMRRFLRNQIVVETAILDLYVKCGSLIYARNVFDNMQERNVISWSTM

Query:  ISGYGLHGQGREAIRLFDEMKNSTNPDHVTLVSILAACSHAGLVVEGWDCFNAMERDFQLKPRSEHYACMVDLFGRVGKLKEAHDFISKMPIRPNAGVWG
        ISGYGLHG GR+AI LF+EMKNST PDH+T VSILAACSHAGLV EGWDCFNAM RDF+LKP  EHYACMVDL GRVGKLKEA +FISKMPIRPNAGVWG
Subjt:  ISGYGLHGQGREAIRLFDEMKNSTNPDHVTLVSILAACSHAGLVVEGWDCFNAMERDFQLKPRSEHYACMVDLFGRVGKLKEAHDFISKMPIRPNAGVWG

Query:  ALLGACRIHSNIEMAEVAAKHLLELDSENPGRYVLLYNIYLSSGKRKEAIHIRALMKQRGLRKIAGHTTIEIKNKIHTFVAGDQSHPQTEMIYSELDKVM
        ALLGACRIHSN+EMAEVAA +LLELD ENPGRYVLLYNIYLSSGKRKEA  IRALMKQRGLRKIAGHT I+IKNK+HTFVAGD+SHPQTEM+YSELDKV+
Subjt:  ALLGACRIHSNIEMAEVAAKHLLELDSENPGRYVLLYNIYLSSGKRKEAIHIRALMKQRGLRKIAGHTTIEIKNKIHTFVAGDQSHPQTEMIYSELDKVM

Query:  YRLQEKGYTPDLNFVLHDVEEETKEKMLCVHSEKLAIVFGLLNSGSGTIIRLRKNLRVCGDCHSFTKFVSNVAGRDIIVRDSHRFHHFKEGTCSCGDYW
        YR+QE+GYTPDLNFVLHDVEEETKE++L VHSEKLAIVFGLLNSGSG+++RL+KNLRVC DCH+F KFVS VA R+IIVRD+ RFH FK+GTCSCGDYW
Subjt:  YRLQEKGYTPDLNFVLHDVEEETKEKMLCVHSEKLAIVFGLLNSGSGTIIRLRKNLRVCGDCHSFTKFVSNVAGRDIIVRDSHRFHHFKEGTCSCGDYW

A0A6J1IEQ3 pentatricopeptide repeat-containing protein At2g01510, mitochondrial-like isoform X21.1e-28771.96Show/hide
Query:  MILIRCWSLLLPRRLYHLNLQLPILQHTIDSNSCTFYIKKCSTIKSLKCVHASILRAHLHLNLFFCTTLISQYVLLGSVSCAYSLFSLLRSLDVFLWNVM
        MI+IRC SLL  RRL H +LQLP+    IDS SCT  IK CSTIKSLKCVH SIL+A+LHLNLFFCTTLISQY  LGSVS AYSLFSLL+SLDVFLWNVM
Subjt:  MILIRCWSLLLPRRLYHLNLQLPILQHTIDSNSCTFYIKKCSTIKSLKCVHASILRAHLHLNLFFCTTLISQYVLLGSVSCAYSLFSLLRSLDVFLWNVM

Query:  LRGFVDARLYRRAVILYAQMLDLGIQPDNFTFPFVLKACRSTEDLDFGARVHHNAVVFGYQLDVFVANSLIAMYGRCGRLKLAQEVFDKMHGRNVVSWSS
        LRGFVDA  YR+A++LYAQMLDLGI+PDNFTFPFV KAC   +DLDFG RVH+++V FGY+LDVFVANSLIAMYGRCGR +LA+EVFDKM  RNVVSWSS
Subjt:  LRGFVDARLYRRAVILYAQMLDLGIQPDNFTFPFVLKACRSTEDLDFGARVHHNAVVFGYQLDVFVANSLIAMYGRCGRLKLAQEVFDKMHGRNVVSWSS

Query:  IIGAYAYNGQYVVGVSLFSQMLLQGFEPNRPVVLNVMACIHAEKEADDICRVVMNHKLGLDQSVQNAAVGMYARCGRIDLAQKIFNGIHNKDLVSWASMI
        IIGAYA NGQY +GVSLFS ML +GF+ NR V+LNVMACIH+EKEADD+CR+ M+H+LGL+QSVQNAAVGMYARCGRID AQ+IFNGIH+KDLVSWASMI
Subjt:  IIGAYAYNGQYVVGVSLFSQMLLQGFEPNRPVVLNVMACIHAEKEADDICRVVMNHKLGLDQSVQNAAVGMYARCGRIDLAQKIFNGIHNKDLVSWASMI

Query:  EAYVQGDLPLKALEIFRKMIQKSIRPDSITLLGVIHACLALGSFSNACSIHGFVMRRFLRNQIVVETAILDLYVKCGSLIYARNVFDNMQERNVISWSTM
        EAYVQ DLPLKA+EIFR+M  K                                                                              
Subjt:  EAYVQGDLPLKALEIFRKMIQKSIRPDSITLLGVIHACLALGSFSNACSIHGFVMRRFLRNQIVVETAILDLYVKCGSLIYARNVFDNMQERNVISWSTM

Query:  ISGYGLHGQGREAIRLFDEMKNSTNPDHVTLVSILAACSHAGLVVEGWDCFNAMERDFQLKPRSEHYACMVDLFGRVGKLKEAHDFISKMPIRPNAGVWG
          GYGLHG GR+AI LF+EMKNST PDH+T VSILAACSHAGLV EGWDCFNAM RDF+LKP  EHYACMVDL GRVGKLKEA +FISKMPIRPNAGVWG
Subjt:  ISGYGLHGQGREAIRLFDEMKNSTNPDHVTLVSILAACSHAGLVVEGWDCFNAMERDFQLKPRSEHYACMVDLFGRVGKLKEAHDFISKMPIRPNAGVWG

Query:  ALLGACRIHSNIEMAEVAAKHLLELDSENPGRYVLLYNIYLSSGKRKEAIHIRALMKQRGLRKIAGHTTIEIKNKIHTFVAGDQSHPQTEMIYSELDKVM
        ALLGACRIHSN+EMAEVAA +LLELD ENPGRYVLLYNIYLSSGKRKEA  IRALMKQRGLRKIAGHT I+IKNK+HTFVAGD+SHPQTEM+YSELDKV+
Subjt:  ALLGACRIHSNIEMAEVAAKHLLELDSENPGRYVLLYNIYLSSGKRKEAIHIRALMKQRGLRKIAGHTTIEIKNKIHTFVAGDQSHPQTEMIYSELDKVM

Query:  YRLQEKGYTPDLNFVLHDVEEETKEKMLCVHSEKLAIVFGLLNSGSGTIIRLRKNLRVCGDCHSFTKFVSNVAGRDIIVRDSHRFHHFKEGTCSCGDYW
        YR+QE+GYTPDLNFVLHDVEEETKE++L VHSEKLAIVFGLLNSGSG+++RL+KNLRVC DCH+F KFVS VA R+IIVRD+ RFH FK+GTCSCGDYW
Subjt:  YRLQEKGYTPDLNFVLHDVEEETKEKMLCVHSEKLAIVFGLLNSGSGTIIRLRKNLRVCGDCHSFTKFVSNVAGRDIIVRDSHRFHHFKEGTCSCGDYW

A0A7N2LQK3 DYW_deaminase domain-containing protein2.4e-27465.79Show/hide
Query:  YHLNLQLPILQHTIDSNSCTFYIKKCSTIKSLKCVHASILRAHLHLNLFFCTTLISQYVLLGSVSCAYSLFSLLRSLDVFLWNVMLRGFVDARLYRRAVI
        YH   Q   L   I+ ++C   IK+C T +SL+ VH S+L++HLHLNLFF T LI+QY  LGS+S AY+LFS  +S DVFLWNVM++GFVD  LY R+++
Subjt:  YHLNLQLPILQHTIDSNSCTFYIKKCSTIKSLKCVHASILRAHLHLNLFFCTTLISQYVLLGSVSCAYSLFSLLRSLDVFLWNVMLRGFVDARLYRRAVI

Query:  LYAQMLDLGIQPDNFTFPFVLKACRSTEDLDFGARVHHNAVVFGYQLDVFVANSLIAMYGRCGRLKLAQEVFDKMHGRNVVSWSSIIGAYAYNGQYVVGV
        LY+QM +LGIQPDNFTFPFVLKAC   +D+ FG  VH N + FGY  DV+V NSLI+MYG+C  L  ++ VFDKM  +N+VSWSS+IGAY  NG Y  GV
Subjt:  LYAQMLDLGIQPDNFTFPFVLKACRSTEDLDFGARVHHNAVVFGYQLDVFVANSLIAMYGRCGRLKLAQEVFDKMHGRNVVSWSSIIGAYAYNGQYVVGV

Query:  SLFSQMLLQGFEPNRPVVLNVMACIHAEKEADDICRVVMNHKLGLDQSVQNAAVGMYARCGRIDLAQKIFNGIHNKDLVSWASMIEAYVQGDLPLKALEI
         LFSQML  G  PNR V+LNVMAC++   EADDICRVV+++ L  D+SVQNAA+ MYARCGRID+A++ F+GI  KDLVSWASMIEAY Q DL L+AL++
Subjt:  SLFSQMLLQGFEPNRPVVLNVMACIHAEKEADDICRVVMNHKLGLDQSVQNAAVGMYARCGRIDLAQKIFNGIHNKDLVSWASMIEAYVQGDLPLKALEI

Query:  FRKMIQKSIRPDSITLLGVIHACLALGSFSNACSIHGFVMRRFLRNQIVVETAILDLYVKCGSLIYARNVFDNMQERNVISWSTMISGYGLHGQGREAIR
        F++MI + I  DS+TLL VIHAC  L  F +A  +HG + RRF +NQI +ET+++DLYVKCGSL+YAR  FD MQERN++SWST+ISGYG+HG GREA+ 
Subjt:  FRKMIQKSIRPDSITLLGVIHACLALGSFSNACSIHGFVMRRFLRNQIVVETAILDLYVKCGSLIYARNVFDNMQERNVISWSTMISGYGLHGQGREAIR

Query:  LFDEMKNSTNPDHVTLVSILAACSHAGLVVEGWDCFNAMERDFQLKPRSEHYACMVDLFGRVGKLKEAHDFISKMPIRPNAGVWGALLGACRIHSNIEMA
        LFD+MK S  PDHV  +S+L+ACSH GL+VEGW+CFN+M RDFQ+ PR EHYACMVDL GR G+L EA DFI +MP RP+AGVWGALLGACRIH N+E+A
Subjt:  LFDEMKNSTNPDHVTLVSILAACSHAGLVVEGWDCFNAMERDFQLKPRSEHYACMVDLFGRVGKLKEAHDFISKMPIRPNAGVWGALLGACRIHSNIEMA

Query:  EVAAKHLLELDSENPGRYVLLYNIYLSSGKRKEAIHIRALMKQRGLRKIAGHTTIEIKNKIHTFVAGDQSHPQTEMIYSELDKVMYRLQEKGYTPDLNFV
        EVAAK+L ELD++NPGRYVLL NIY SSGKR +A  +RALMK+RG+RKIAGHT IEI+NK++TFVAGDQS+PQT +IY+EL+K++ R++E GY PDLNFV
Subjt:  EVAAKHLLELDSENPGRYVLLYNIYLSSGKRKEAIHIRALMKQRGLRKIAGHTTIEIKNKIHTFVAGDQSHPQTEMIYSELDKVMYRLQEKGYTPDLNFV

Query:  LHDVEEETKEKMLCVHSEKLAIVFGLLNSGSGTIIRLRKNLRVCGDCHSFTKFVSNVAGRDIIVRDSHRFHHFKEGTCSCGDYW
        LHDVEEE KEKML VHSEKLAIVFGLLN+   ++IR++KNLRVCGDCH+ TKF+S V  R+IIVRD+HRFHHFKEG CSCGDYW
Subjt:  LHDVEEETKEKMLCVHSEKLAIVFGLLNSGSGTIIRLRKNLRVCGDCHSFTKFVSNVAGRDIIVRDSHRFHHFKEGTCSCGDYW

SwissProt top hitse value%identityAlignment
P0C899 Putative pentatricopeptide repeat-containing protein At3g491421.1e-14639.91Show/hide
Query:  IKSLKCVHASILRAHLHLNLFFCTTLISQYVLLGSVSCAYSLFSLLRSLDVFLWNVMLRGFVDARLYRRAVILYAQMLDLGIQPDNFTFPFVLKACRSTE
        I++L+ VH+ I+   L  N      L+  Y  L  V+ A  +F  +   +V + NVM+R +V+   Y   V ++  M    ++PD++TFP VLKAC  + 
Subjt:  IKSLKCVHASILRAHLHLNLFFCTTLISQYVLLGSVSCAYSLFSLLRSLDVFLWNVMLRGFVDARLYRRAVILYAQMLDLGIQPDNFTFPFVLKACRSTE

Query:  DLDFGARVHHNAVVFGYQLDVFVANSLIAMYGRCGRLKLAQEVFDKMHGRNVVSWSSIIGAYAYNGQYVVGVSLFSQMLLQGFEPNRPVVLNVMACIHAE
         +  G ++H +A   G    +FV N L++MYG+CG L  A+ V D+M  R+VVSW+S++  YA N ++                                
Subjt:  DLDFGARVHHNAVVFGYQLDVFVANSLIAMYGRCGRLKLAQEVFDKMHGRNVVSWSSIIGAYAYNGQYVVGVSLFSQMLLQGFEPNRPVVLNVMACIHAE

Query:  KEADDICRVVMNHKLGLDQSVQNAAVGMYAR--CGRIDLAQKIFNGIHNKDLVSWASMIEAYVQGDLPLKALEIFRKMIQKSIRPDSITLLGVIHACLAL
         +A ++CR + + K+  D     + +   +      +   + +F  +  K LVSW  MI  Y++  +P++A+E++ +M      PD++++  V+ AC   
Subjt:  KEADDICRVVMNHKLGLDQSVQNAAVGMYAR--CGRIDLAQKIFNGIHNKDLVSWASMIEAYVQGDLPLKALEIFRKMIQKSIRPDSITLLGVIHACLAL

Query:  GSFSNACSIHGFVMRRFLRNQIVVETAILDLYVKCGSLIYARNVFDNMQERNVISWSTMISGYGLHGQGREAIRLFDEMKNS-TNPDHVTLVSILAACSH
         + S    IHG++ R+ L   +++E A++D+Y KCG L  AR+VF+NM+ R+V+SW+ MIS YG  G+G +A+ LF ++++S   PD +  V+ LAACSH
Subjt:  GSFSNACSIHGFVMRRFLRNQIVVETAILDLYVKCGSLIYARNVFDNMQERNVISWSTMISGYGLHGQGREAIRLFDEMKNS-TNPDHVTLVSILAACSH

Query:  AGLVVEGWDCFNAMERDFQLKPRSEHYACMVDLFGRVGKLKEAHDFISKMPIRPNAGVWGALLGACRIHSNIEMAEVAAKHLLELDSENPGRYVLLYNIY
        AGL+ EG  CF  M   +++ PR EH ACMVDL GR GK+KEA+ FI  M + PN  VWGALLGACR+HS+ ++  +AA  L +L  E  G YVLL NIY
Subjt:  AGLVVEGWDCFNAMERDFQLKPRSEHYACMVDLFGRVGKLKEAHDFISKMPIRPNAGVWGALLGACRIHSNIEMAEVAAKHLLELDSENPGRYVLLYNIY

Query:  LSSGKRKEAIHIRALMKQRGLRKIAGHTTIEIKNKIHTFVAGDQSHPQTEMIYSELDKVMYRLQEKGYTPDLNFVLHDVEEETKEKMLCVHSEKLAIVFG
          +G+ +E  +IR +MK +GL+K  G + +E+   IHTF+ GD+SHPQ++ IY ELD ++ +++E GY PD    LHDVEEE KE  L VHSEKLAIVF 
Subjt:  LSSGKRKEAIHIRALMKQRGLRKIAGHTTIEIKNKIHTFVAGDQSHPQTEMIYSELDKVMYRLQEKGYTPDLNFVLHDVEEETKEKMLCVHSEKLAIVFG

Query:  LLNS-----GSGTIIRLRKNLRVCGDCHSFTKFVSNVAGRDIIVRDSHRFHHFKEGTCSCGDYW
        L+N+      S   IR+ KNLR+CGDCH   K +S +  R+II+RD++RFH F+ G CSCGDYW
Subjt:  LLNS-----GSGTIIRLRKNLRVCGDCHSFTKFVSNVAGRDIIVRDSHRFHHFKEGTCSCGDYW

Q3E6Q1 Pentatricopeptide repeat-containing protein At1g11290, chloroplastic3.9e-15240.89Show/hide
Query:  TFYIKKCSTIKSL---KCVHASILRAHLHLNLFFCTTLISQYVLLGSVSCAYSLFSLLRSLDVFLWNVMLRGFVDARLYRRAVILYAQMLDLGIQPDNFT
        T+ +K C     L   K +H  ++++   L+LF  T L + Y     V+ A  +F  +   D+  WN ++ G+    + R A+ +   M +  ++P   T
Subjt:  TFYIKKCSTIKSL---KCVHASILRAHLHLNLFFCTTLISQYVLLGSVSCAYSLFSLLRSLDVFLWNVMLRGFVDARLYRRAVILYAQMLDLGIQPDNFT

Query:  FPFVLKACRSTEDLDFGARVHHNAVVFGYQLDVFVANSLIAMYGRCGRLKLAQEVFDKMHGRNVVSWSSIIGAYAYNGQYVVGVSLFSQMLLQGFEPNRP
           VL A  +   +  G  +H  A+  G+   V ++ +L+ MY +CG L+ A+++FD M  RNVVSW+S+I AY  N      + +F +ML +G    +P
Subjt:  FPFVLKACRSTEDLDFGARVHHNAVVFGYQLDVFVANSLIAMYGRCGRLKLAQEVFDKMHGRNVVSWSSIIGAYAYNGQYVVGVSLFSQMLLQGFEPNRP

Query:  VVLNVMACIHAEKEADDICRVVMNHK----LGLDQ--SVQNAAVGMYARCGRIDLAQKIFNGIHNKDLVSWASMIEAYVQGDLPLKALEIFRKMIQKSIR
          ++VM  +HA  +  D+ R    HK    LGLD+  SV N+ + MY +C  +D A  +F  + ++ LVSW +MI  + Q   P+ AL  F +M  ++++
Subjt:  VVLNVMACIHAEKEADDICRVVMNHK----LGLDQ--SVQNAAVGMYARCGRIDLAQKIFNGIHNKDLVSWASMIEAYVQGDLPLKALEIFRKMIQKSIR

Query:  PDSITLLGVIHACLALGSFSNACSIHGFVMRRFLRNQIVVETAILDLYVKCGSLIYARNVFDNMQERNVISWSTMISGYGLHGQGREAIRLFDEMKNST-
        PD+ T + VI A   L    +A  IHG VMR  L   + V TA++D+Y KCG+++ AR +FD M ER+V +W+ MI GYG HG G+ A+ LF+EM+  T 
Subjt:  PDSITLLGVIHACLALGSFSNACSIHGFVMRRFLRNQIVVETAILDLYVKCGSLIYARNVFDNMQERNVISWSTMISGYGLHGQGREAIRLFDEMKNST-

Query:  NPDHVTLVSILAACSHAGLVVEGWDCFNAMERDFQLKPRSEHYACMVDLFGRVGKLKEAHDFISKMPIRPNAGVWGALLGACRIHSNIEMAEVAAKHLLE
         P+ VT +S+++ACSH+GLV  G  CF  M+ ++ ++   +HY  MVDL GR G+L EA DFI +MP++P   V+GA+LGAC+IH N+  AE AA+ L E
Subjt:  NPDHVTLVSILAACSHAGLVVEGWDCFNAMERDFQLKPRSEHYACMVDLFGRVGKLKEAHDFISKMPIRPNAGVWGALLGACRIHSNIEMAEVAAKHLLE

Query:  LDSENPGRYVLLYNIYLSSGKRKEAIHIRALMKQRGLRKIAGHTTIEIKNKIHTFVAGDQSHPQTEMIYSELDKVMYRLQEKGYTPDLNFVLHDVEEETK
        L+ ++ G +VLL NIY ++   ++   +R  M ++GLRK  G + +EIKN++H+F +G  +HP ++ IY+ L+K++  ++E GY PD N VL  VE + K
Subjt:  LDSENPGRYVLLYNIYLSSGKRKEAIHIRALMKQRGLRKIAGHTTIEIKNKIHTFVAGDQSHPQTEMIYSELDKVMYRLQEKGYTPDLNFVLHDVEEETK

Query:  EKMLCVHSEKLAIVFGLLNSGSGTIIRLRKNLRVCGDCHSFTKFVSNVAGRDIIVRDSHRFHHFKEGTCSCGDYW
        E++L  HSEKLAI FGLLN+ +GT I +RKNLRVC DCH+ TK++S V GR+I+VRD  RFHHFK G CSCGDYW
Subjt:  EKMLCVHSEKLAIVFGLLNSGSGTIIRLRKNLRVCGDCHSFTKFVSNVAGRDIIVRDSHRFHHFKEGTCSCGDYW

Q9LTV8 Pentatricopeptide repeat-containing protein At3g127703.4e-14838.03Show/hide
Query:  LQLPILQHTIDSNSCTFY---IKKCSTIKSLKCVHASILRAHLHLNLFFCTTLISQYVLLGSVSCAYSLFSLLRSLDVFLWNVMLRGFVDARLYRRAVIL
        L  P+L      +S +FY   I   +    LK +HA +L   L  + F  T LI      G ++ A  +F  L    +F WN ++RG+     ++ A+++
Subjt:  LQLPILQHTIDSNSCTFY---IKKCSTIKSLKCVHASILRAHLHLNLFFCTTLISQYVLLGSVSCAYSLFSLLRSLDVFLWNVMLRGFVDARLYRRAVIL

Query:  YAQMLDLGIQPDNFTFPFVLKACRSTEDLDFGARVHHNAVVFGYQLDVFVANSLIAMYGRCGRLKLAQEVFD--KMHGRNVVSWSSIIGAYAYNGQYVVG
        Y+ M    + PD+FTFP +LKAC     L  G  VH      G+  DVFV N LIA+Y +C RL  A+ VF+   +  R +VSW++I+ AYA NG+ +  
Subjt:  YAQMLDLGIQPDNFTFPFVLKACRSTEDLDFGARVHHNAVVFGYQLDVFVANSLIAMYGRCGRLKLAQEVFD--KMHGRNVVSWSSIIGAYAYNGQYVVG

Query:  VSLFSQMLLQGFEPNRPV---VLNVMACIHAEKEADDICRVVMNHKLGLDQSVQNAAVGMYARCGRIDLAQKIFNGIHNKDLVSWASMIEAYVQGDLPLK
        + +FSQM     +P+      VLN   C+   K+   I   V+   L ++  +  +   MYA+CG++  A+ +F+ + + +L+ W +MI  Y +     +
Subjt:  VSLFSQMLLQGFEPNRPV---VLNVMACIHAEKEADDICRVVMNHKLGLDQSVQNAAVGMYARCGRIDLAQKIFNGIHNKDLVSWASMIEAYVQGDLPLK

Query:  ALEIFRKMIQKSIRPDSITLLGVIHACLALGSFSNACSIHGFVMRRFLRNQIVVETAILDLYVKCGSLIYARNVFDNMQERNVISWSTMISGYGLHGQGR
        A+++F +MI K +RPD+I++   I AC  +GS   A S++ +V R   R+ + + +A++D++ KCGS+  AR VFD   +R+V+ WS MI GYGLHG+ R
Subjt:  ALEIFRKMIQKSIRPDSITLLGVIHACLALGSFSNACSIHGFVMRRFLRNQIVVETAILDLYVKCGSLIYARNVFDNMQERNVISWSTMISGYGLHGQGR

Query:  EAIRLFDEM-KNSTNPDHVTLVSILAACSHAGLVVEGWDCFNAMERDFQLKPRSEHYACMVDLFGRVGKLKEAHDFISKMPIRPNAGVWGALLGACRIHS
        EAI L+  M +   +P+ VT + +L AC+H+G+V EGW  FN M  D ++ P+ +HYAC++DL GR G L +A++ I  MP++P   VWGALL AC+ H 
Subjt:  EAIRLFDEM-KNSTNPDHVTLVSILAACSHAGLVVEGWDCFNAMERDFQLKPRSEHYACMVDLFGRVGKLKEAHDFISKMPIRPNAGVWGALLGACRIHS

Query:  NIEMAEVAAKHLLELDSENPGRYVLLYNIYLSSGKRKEAIHIRALMKQRGLRKIAGHTTIEIKNKIHTFVAGDQSHPQTEMIYSELDKVMYRLQEKGYTP
        ++E+ E AA+ L  +D  N G YV L N+Y ++        +R  MK++GL K  G + +E++ ++  F  GD+SHP+ E I  +++ +  RL+E G+  
Subjt:  NIEMAEVAAKHLLELDSENPGRYVLLYNIYLSSGKRKEAIHIRALMKQRGLRKIAGHTTIEIKNKIHTFVAGDQSHPQTEMIYSELDKVMYRLQEKGYTP

Query:  DLNFVLHDVEEETKEKMLCVHSEKLAIVFGLLNSGSGTIIRLRKNLRVCGDCHSFTKFVSNVAGRDIIVRDSHRFHHFKEGTCSCGDYW
        + +  LHD+ +E  E+ LC HSE++AI +GL+++  GT +R+ KNLR C +CH+ TK +S +  R+I+VRD++RFHHFK+G CSCGDYW
Subjt:  DLNFVLHDVEEETKEKMLCVHSEKLAIVFGLLNSGSGTIIRLRKNLRVCGDCHSFTKFVSNVAGRDIIVRDSHRFHHFKEGTCSCGDYW

Q9LW32 Pentatricopeptide repeat-containing protein At3g26782, mitochondrial4.4e-14840.71Show/hide
Query:  KCSTIKSLKCVHASILRAHLHLNLFFCTTLISQYVLLGSVSCAYSLFSLLRSLDVFLWNVMLRGFVDARLYRRAVILYAQMLDLGIQPDNFTFPFVLKAC
        K  + K+L C  + +L    H      TTL ++YV                  DVF WN ++     +     A++ ++ M  L + P   +FP  +KAC
Subjt:  KCSTIKSLKCVHASILRAHLHLNLFFCTTLISQYVLLGSVSCAYSLFSLLRSLDVFLWNVMLRGFVDARLYRRAVILYAQMLDLGIQPDNFTFPFVLKAC

Query:  RSTEDLDFGARVHHNAVVFGYQLDVFVANSLIAMYGRCGRLKLAQEVFDKMHGRNVVSWSSIIGAYAYNGQYVVGVSLFSQMLLQGFEPNRPVVLNVM--
         S  D+  G + H  A VFGYQ D+FV+++LI MY  CG+L+ A++VFD++  RN+VSW+S+I  Y  NG  +  VSLF  +L+   + +  + L+ M  
Subjt:  RSTEDLDFGARVHHNAVVFGYQLDVFVANSLIAMYGRCGRLKLAQEVFDKMHGRNVVSWSSIIGAYAYNGQYVVGVSLFSQMLLQGFEPNRPVVLNVM--

Query:  -----AC--IHAEKEADDICRVVMNHKLGLDQSVQNAAVGMYARCGR--IDLAQKIFNGIHNKDLVSWASMIEAYVQGDLPLKALEIFRKMIQ-KSIRPD
             AC  + A+   + I   V+        SV N  +  YA+ G   + +A+KIF+ I +KD VS+ S++  Y Q  +  +A E+FR++++ K +  +
Subjt:  -----AC--IHAEKEADDICRVVMNHKLGLDQSVQNAAVGMYARCGR--IDLAQKIFNGIHNKDLVSWASMIEAYVQGDLPLKALEIFRKMIQ-KSIRPD

Query:  SITLLGVIHACLALGSFSNACSIHGFVMRRFLRNQIVVETAILDLYVKCGSLIYARNVFDNMQERNVISWSTMISGYGLHGQGREAIRLFDEMKNS-TNP
        +ITL  V+ A    G+      IH  V+R  L + ++V T+I+D+Y KCG +  AR  FD M+ +NV SW+ MI+GYG+HG   +A+ LF  M +S   P
Subjt:  SITLLGVIHACLALGSFSNACSIHGFVMRRFLRNQIVVETAILDLYVKCGSLIYARNVFDNMQERNVISWSTMISGYGLHGQGREAIRLFDEMKNS-TNP

Query:  DHVTLVSILAACSHAGLVVEGWDCFNAMERDFQLKPRSEHYACMVDLFGRVGKLKEAHDFISKMPIRPNAGVWGALLGACRIHSNIEMAEVAAKHLLELD
        +++T VS+LAACSHAGL VEGW  FNAM+  F ++P  EHY CMVDL GR G L++A+D I +M ++P++ +W +LL ACRIH N+E+AE++   L ELD
Subjt:  DHVTLVSILAACSHAGLVVEGWDCFNAMERDFQLKPRSEHYACMVDLFGRVGKLKEAHDFISKMPIRPNAGVWGALLGACRIHSNIEMAEVAAKHLLELD

Query:  SENPGRYVLLYNIYLSSGKRKEAIHIRALMKQRGLRKIAGHTTIEIKNKIHTFVAGDQSHPQTEMIYSELDKVMYRLQEKGYTPDLNFVLHDVEEETKEK
        S N G Y+LL +IY  +G+ K+   +R +MK RGL K  G + +E+  ++H F+ GD+ HPQ E IY  L ++  +L E GY  + + V HDV+EE KE 
Subjt:  SENPGRYVLLYNIYLSSGKRKEAIHIRALMKQRGLRKIAGHTTIEIKNKIHTFVAGDQSHPQTEMIYSELDKVMYRLQEKGYTPDLNFVLHDVEEETKEK

Query:  MLCVHSEKLAIVFGLLNSGSGTIIRLRKNLRVCGDCHSFTKFVSNVAGRDIIVRDSHRFHHFKEGTCSCGDYW
         L VHSEKLAI FG++N+  G+ + + KNLRVC DCH+  K +S +  R+ +VRD+ RFHHFK+G CSCGDYW
Subjt:  MLCVHSEKLAIVFGLLNSGSGTIIRLRKNLRVCGDCHSFTKFVSNVAGRDIIVRDSHRFHHFKEGTCSCGDYW

Q9SN39 Pentatricopeptide repeat-containing protein DOT4, chloroplastic4.7e-14239.18Show/hide
Query:  IDSNSCTFYIKKCSTIKSL---KCVHASILRAHLHLNLFFCTTLISQYVLLGSVSCAYSLFSLLRSLDVFLWNVMLRGFVDARLYRRAVILYAQMLDLGI
        +DS + +   K  S+++S+   + +H  IL++          +L++ Y+    V  A  +F  +   DV  WN ++ G+V   L  + + ++ QML  GI
Subjt:  IDSNSCTFYIKKCSTIKSL---KCVHASILRAHLHLNLFFCTTLISQYVLLGSVSCAYSLFSLLRSLDVFLWNVMLRGFVDARLYRRAVILYAQMLDLGI

Query:  QPDNFTFPFVLKACRSTEDLDFGARVHHNAVVFGYQLDVFVANSLIAMYGRCGRLKLAQEVFDKMHGRNVVSWSSIIGAYAYNGQYVVGVSLFSQMLLQG
        + D  T   V   C  +  +  G  VH   V   +  +    N+L+ MY +CG L  A+ VF +M  R+VVS++S+I  YA  G     V LF +M  +G
Subjt:  QPDNFTFPFVLKACRSTEDLDFGARVHHNAVVFGYQLDVFVANSLIAMYGRCGRLKLAQEVFDKMHGRNVVSWSSIIGAYAYNGQYVVGVSLFSQMLLQG

Query:  FEPNRPVVLNVMACIHAEKEADDICRV---VMNHKLGLDQSVQNAAVGMYARCGRIDLAQKIFNGIHNKDLVSWASMIEAYVQGDLPLKALEIFRKMI-Q
          P+   V  V+ C    +  D+  RV   +  + LG D  V NA + MYA+CG +  A+ +F+ +  KD++SW ++I  Y +     +AL +F  ++ +
Subjt:  FEPNRPVVLNVMACIHAEKEADDICRV---VMNHKLGLDQSVQNAAVGMYARCGRIDLAQKIFNGIHNKDLVSWASMIEAYVQGDLPLKALEIFRKMI-Q

Query:  KSIRPDSITLLGVIHACLALGSFSNACSIHGFVMRRFLRNQIVVETAILDLYVKCGSLIYARNVFDNMQERNVISWSTMISGYGLHGQGREAIRLFDEMK
        K   PD  T+  V+ AC +L +F     IHG++MR    +   V  +++D+Y KCG+L+ A  +FD++  ++++SW+ MI+GYG+HG G+EAI LF++M+
Subjt:  KSIRPDSITLLGVIHACLALGSFSNACSIHGFVMRRFLRNQIVVETAILDLYVKCGSLIYARNVFDNMQERNVISWSTMISGYGLHGQGREAIRLFDEMK

Query:  NS-TNPDHVTLVSILAACSHAGLVVEGWDCFNAMERDFQLKPRSEHYACMVDLFGRVGKLKEAHDFISKMPIRPNAGVWGALLGACRIHSNIEMAEVAAK
         +    D ++ VS+L ACSH+GLV EGW  FN M  + +++P  EHYAC+VD+  R G L +A+ FI  MPI P+A +WGALL  CRIH ++++AE  A+
Subjt:  NS-TNPDHVTLVSILAACSHAGLVVEGWDCFNAMERDFQLKPRSEHYACMVDLFGRVGKLKEAHDFISKMPIRPNAGVWGALLGACRIHSNIEMAEVAAK

Query:  HLLELDSENPGRYVLLYNIYLSSGKRKEAIHIRALMKQRGLRKIAGHTTIEIKNKIHTFVAGDQSHPQTEMIYSELDKVMYRLQEKGYTPDLNFVLHDVE
         + EL+ EN G YVL+ NIY  + K ++   +R  + QRGLRK  G + IEIK +++ FVAGD S+P+TE I + L KV  R+ E+GY+P   + L D E
Subjt:  HLLELDSENPGRYVLLYNIYLSSGKRKEAIHIRALMKQRGLRKIAGHTTIEIKNKIHTFVAGDQSHPQTEMIYSELDKVMYRLQEKGYTPDLNFVLHDVE

Query:  EETKEKMLCVHSEKLAIVFGLLNSGSGTIIRLRKNLRVCGDCHSFTKFVSNVAGRDIIVRDSHRFHHFKEGTCSCGDYW
        E  KE+ LC HSEKLA+  G+++SG G IIR+ KNLRVCGDCH   KF+S +  R+I++RDS+RFH FK+G CSC  +W
Subjt:  EETKEKMLCVHSEKLAIVFGLLNSGSGTIIRLRKNLRVCGDCHSFTKFVSNVAGRDIIVRDSHRFHHFKEGTCSCGDYW

Arabidopsis top hitse value%identityAlignment
AT1G11290.1 Pentatricopeptide repeat (PPR) superfamily protein2.7e-15340.89Show/hide
Query:  TFYIKKCSTIKSL---KCVHASILRAHLHLNLFFCTTLISQYVLLGSVSCAYSLFSLLRSLDVFLWNVMLRGFVDARLYRRAVILYAQMLDLGIQPDNFT
        T+ +K C     L   K +H  ++++   L+LF  T L + Y     V+ A  +F  +   D+  WN ++ G+    + R A+ +   M +  ++P   T
Subjt:  TFYIKKCSTIKSL---KCVHASILRAHLHLNLFFCTTLISQYVLLGSVSCAYSLFSLLRSLDVFLWNVMLRGFVDARLYRRAVILYAQMLDLGIQPDNFT

Query:  FPFVLKACRSTEDLDFGARVHHNAVVFGYQLDVFVANSLIAMYGRCGRLKLAQEVFDKMHGRNVVSWSSIIGAYAYNGQYVVGVSLFSQMLLQGFEPNRP
           VL A  +   +  G  +H  A+  G+   V ++ +L+ MY +CG L+ A+++FD M  RNVVSW+S+I AY  N      + +F +ML +G    +P
Subjt:  FPFVLKACRSTEDLDFGARVHHNAVVFGYQLDVFVANSLIAMYGRCGRLKLAQEVFDKMHGRNVVSWSSIIGAYAYNGQYVVGVSLFSQMLLQGFEPNRP

Query:  VVLNVMACIHAEKEADDICRVVMNHK----LGLDQ--SVQNAAVGMYARCGRIDLAQKIFNGIHNKDLVSWASMIEAYVQGDLPLKALEIFRKMIQKSIR
          ++VM  +HA  +  D+ R    HK    LGLD+  SV N+ + MY +C  +D A  +F  + ++ LVSW +MI  + Q   P+ AL  F +M  ++++
Subjt:  VVLNVMACIHAEKEADDICRVVMNHK----LGLDQ--SVQNAAVGMYARCGRIDLAQKIFNGIHNKDLVSWASMIEAYVQGDLPLKALEIFRKMIQKSIR

Query:  PDSITLLGVIHACLALGSFSNACSIHGFVMRRFLRNQIVVETAILDLYVKCGSLIYARNVFDNMQERNVISWSTMISGYGLHGQGREAIRLFDEMKNST-
        PD+ T + VI A   L    +A  IHG VMR  L   + V TA++D+Y KCG+++ AR +FD M ER+V +W+ MI GYG HG G+ A+ LF+EM+  T 
Subjt:  PDSITLLGVIHACLALGSFSNACSIHGFVMRRFLRNQIVVETAILDLYVKCGSLIYARNVFDNMQERNVISWSTMISGYGLHGQGREAIRLFDEMKNST-

Query:  NPDHVTLVSILAACSHAGLVVEGWDCFNAMERDFQLKPRSEHYACMVDLFGRVGKLKEAHDFISKMPIRPNAGVWGALLGACRIHSNIEMAEVAAKHLLE
         P+ VT +S+++ACSH+GLV  G  CF  M+ ++ ++   +HY  MVDL GR G+L EA DFI +MP++P   V+GA+LGAC+IH N+  AE AA+ L E
Subjt:  NPDHVTLVSILAACSHAGLVVEGWDCFNAMERDFQLKPRSEHYACMVDLFGRVGKLKEAHDFISKMPIRPNAGVWGALLGACRIHSNIEMAEVAAKHLLE

Query:  LDSENPGRYVLLYNIYLSSGKRKEAIHIRALMKQRGLRKIAGHTTIEIKNKIHTFVAGDQSHPQTEMIYSELDKVMYRLQEKGYTPDLNFVLHDVEEETK
        L+ ++ G +VLL NIY ++   ++   +R  M ++GLRK  G + +EIKN++H+F +G  +HP ++ IY+ L+K++  ++E GY PD N VL  VE + K
Subjt:  LDSENPGRYVLLYNIYLSSGKRKEAIHIRALMKQRGLRKIAGHTTIEIKNKIHTFVAGDQSHPQTEMIYSELDKVMYRLQEKGYTPDLNFVLHDVEEETK

Query:  EKMLCVHSEKLAIVFGLLNSGSGTIIRLRKNLRVCGDCHSFTKFVSNVAGRDIIVRDSHRFHHFKEGTCSCGDYW
        E++L  HSEKLAI FGLLN+ +GT I +RKNLRVC DCH+ TK++S V GR+I+VRD  RFHHFK G CSCGDYW
Subjt:  EKMLCVHSEKLAIVFGLLNSGSGTIIRLRKNLRVCGDCHSFTKFVSNVAGRDIIVRDSHRFHHFKEGTCSCGDYW

AT3G12770.1 mitochondrial editing factor 222.4e-14938.03Show/hide
Query:  LQLPILQHTIDSNSCTFY---IKKCSTIKSLKCVHASILRAHLHLNLFFCTTLISQYVLLGSVSCAYSLFSLLRSLDVFLWNVMLRGFVDARLYRRAVIL
        L  P+L      +S +FY   I   +    LK +HA +L   L  + F  T LI      G ++ A  +F  L    +F WN ++RG+     ++ A+++
Subjt:  LQLPILQHTIDSNSCTFY---IKKCSTIKSLKCVHASILRAHLHLNLFFCTTLISQYVLLGSVSCAYSLFSLLRSLDVFLWNVMLRGFVDARLYRRAVIL

Query:  YAQMLDLGIQPDNFTFPFVLKACRSTEDLDFGARVHHNAVVFGYQLDVFVANSLIAMYGRCGRLKLAQEVFD--KMHGRNVVSWSSIIGAYAYNGQYVVG
        Y+ M    + PD+FTFP +LKAC     L  G  VH      G+  DVFV N LIA+Y +C RL  A+ VF+   +  R +VSW++I+ AYA NG+ +  
Subjt:  YAQMLDLGIQPDNFTFPFVLKACRSTEDLDFGARVHHNAVVFGYQLDVFVANSLIAMYGRCGRLKLAQEVFD--KMHGRNVVSWSSIIGAYAYNGQYVVG

Query:  VSLFSQMLLQGFEPNRPV---VLNVMACIHAEKEADDICRVVMNHKLGLDQSVQNAAVGMYARCGRIDLAQKIFNGIHNKDLVSWASMIEAYVQGDLPLK
        + +FSQM     +P+      VLN   C+   K+   I   V+   L ++  +  +   MYA+CG++  A+ +F+ + + +L+ W +MI  Y +     +
Subjt:  VSLFSQMLLQGFEPNRPV---VLNVMACIHAEKEADDICRVVMNHKLGLDQSVQNAAVGMYARCGRIDLAQKIFNGIHNKDLVSWASMIEAYVQGDLPLK

Query:  ALEIFRKMIQKSIRPDSITLLGVIHACLALGSFSNACSIHGFVMRRFLRNQIVVETAILDLYVKCGSLIYARNVFDNMQERNVISWSTMISGYGLHGQGR
        A+++F +MI K +RPD+I++   I AC  +GS   A S++ +V R   R+ + + +A++D++ KCGS+  AR VFD   +R+V+ WS MI GYGLHG+ R
Subjt:  ALEIFRKMIQKSIRPDSITLLGVIHACLALGSFSNACSIHGFVMRRFLRNQIVVETAILDLYVKCGSLIYARNVFDNMQERNVISWSTMISGYGLHGQGR

Query:  EAIRLFDEM-KNSTNPDHVTLVSILAACSHAGLVVEGWDCFNAMERDFQLKPRSEHYACMVDLFGRVGKLKEAHDFISKMPIRPNAGVWGALLGACRIHS
        EAI L+  M +   +P+ VT + +L AC+H+G+V EGW  FN M  D ++ P+ +HYAC++DL GR G L +A++ I  MP++P   VWGALL AC+ H 
Subjt:  EAIRLFDEM-KNSTNPDHVTLVSILAACSHAGLVVEGWDCFNAMERDFQLKPRSEHYACMVDLFGRVGKLKEAHDFISKMPIRPNAGVWGALLGACRIHS

Query:  NIEMAEVAAKHLLELDSENPGRYVLLYNIYLSSGKRKEAIHIRALMKQRGLRKIAGHTTIEIKNKIHTFVAGDQSHPQTEMIYSELDKVMYRLQEKGYTP
        ++E+ E AA+ L  +D  N G YV L N+Y ++        +R  MK++GL K  G + +E++ ++  F  GD+SHP+ E I  +++ +  RL+E G+  
Subjt:  NIEMAEVAAKHLLELDSENPGRYVLLYNIYLSSGKRKEAIHIRALMKQRGLRKIAGHTTIEIKNKIHTFVAGDQSHPQTEMIYSELDKVMYRLQEKGYTP

Query:  DLNFVLHDVEEETKEKMLCVHSEKLAIVFGLLNSGSGTIIRLRKNLRVCGDCHSFTKFVSNVAGRDIIVRDSHRFHHFKEGTCSCGDYW
        + +  LHD+ +E  E+ LC HSE++AI +GL+++  GT +R+ KNLR C +CH+ TK +S +  R+I+VRD++RFHHFK+G CSCGDYW
Subjt:  DLNFVLHDVEEETKEKMLCVHSEKLAIVFGLLNSGSGTIIRLRKNLRVCGDCHSFTKFVSNVAGRDIIVRDSHRFHHFKEGTCSCGDYW

AT3G26782.1 Tetratricopeptide repeat (TPR)-like superfamily protein3.1e-14940.71Show/hide
Query:  KCSTIKSLKCVHASILRAHLHLNLFFCTTLISQYVLLGSVSCAYSLFSLLRSLDVFLWNVMLRGFVDARLYRRAVILYAQMLDLGIQPDNFTFPFVLKAC
        K  + K+L C  + +L    H      TTL ++YV                  DVF WN ++     +     A++ ++ M  L + P   +FP  +KAC
Subjt:  KCSTIKSLKCVHASILRAHLHLNLFFCTTLISQYVLLGSVSCAYSLFSLLRSLDVFLWNVMLRGFVDARLYRRAVILYAQMLDLGIQPDNFTFPFVLKAC

Query:  RSTEDLDFGARVHHNAVVFGYQLDVFVANSLIAMYGRCGRLKLAQEVFDKMHGRNVVSWSSIIGAYAYNGQYVVGVSLFSQMLLQGFEPNRPVVLNVM--
         S  D+  G + H  A VFGYQ D+FV+++LI MY  CG+L+ A++VFD++  RN+VSW+S+I  Y  NG  +  VSLF  +L+   + +  + L+ M  
Subjt:  RSTEDLDFGARVHHNAVVFGYQLDVFVANSLIAMYGRCGRLKLAQEVFDKMHGRNVVSWSSIIGAYAYNGQYVVGVSLFSQMLLQGFEPNRPVVLNVM--

Query:  -----AC--IHAEKEADDICRVVMNHKLGLDQSVQNAAVGMYARCGR--IDLAQKIFNGIHNKDLVSWASMIEAYVQGDLPLKALEIFRKMIQ-KSIRPD
             AC  + A+   + I   V+        SV N  +  YA+ G   + +A+KIF+ I +KD VS+ S++  Y Q  +  +A E+FR++++ K +  +
Subjt:  -----AC--IHAEKEADDICRVVMNHKLGLDQSVQNAAVGMYARCGR--IDLAQKIFNGIHNKDLVSWASMIEAYVQGDLPLKALEIFRKMIQ-KSIRPD

Query:  SITLLGVIHACLALGSFSNACSIHGFVMRRFLRNQIVVETAILDLYVKCGSLIYARNVFDNMQERNVISWSTMISGYGLHGQGREAIRLFDEMKNS-TNP
        +ITL  V+ A    G+      IH  V+R  L + ++V T+I+D+Y KCG +  AR  FD M+ +NV SW+ MI+GYG+HG   +A+ LF  M +S   P
Subjt:  SITLLGVIHACLALGSFSNACSIHGFVMRRFLRNQIVVETAILDLYVKCGSLIYARNVFDNMQERNVISWSTMISGYGLHGQGREAIRLFDEMKNS-TNP

Query:  DHVTLVSILAACSHAGLVVEGWDCFNAMERDFQLKPRSEHYACMVDLFGRVGKLKEAHDFISKMPIRPNAGVWGALLGACRIHSNIEMAEVAAKHLLELD
        +++T VS+LAACSHAGL VEGW  FNAM+  F ++P  EHY CMVDL GR G L++A+D I +M ++P++ +W +LL ACRIH N+E+AE++   L ELD
Subjt:  DHVTLVSILAACSHAGLVVEGWDCFNAMERDFQLKPRSEHYACMVDLFGRVGKLKEAHDFISKMPIRPNAGVWGALLGACRIHSNIEMAEVAAKHLLELD

Query:  SENPGRYVLLYNIYLSSGKRKEAIHIRALMKQRGLRKIAGHTTIEIKNKIHTFVAGDQSHPQTEMIYSELDKVMYRLQEKGYTPDLNFVLHDVEEETKEK
        S N G Y+LL +IY  +G+ K+   +R +MK RGL K  G + +E+  ++H F+ GD+ HPQ E IY  L ++  +L E GY  + + V HDV+EE KE 
Subjt:  SENPGRYVLLYNIYLSSGKRKEAIHIRALMKQRGLRKIAGHTTIEIKNKIHTFVAGDQSHPQTEMIYSELDKVMYRLQEKGYTPDLNFVLHDVEEETKEK

Query:  MLCVHSEKLAIVFGLLNSGSGTIIRLRKNLRVCGDCHSFTKFVSNVAGRDIIVRDSHRFHHFKEGTCSCGDYW
         L VHSEKLAI FG++N+  G+ + + KNLRVC DCH+  K +S +  R+ +VRD+ RFHHFK+G CSCGDYW
Subjt:  MLCVHSEKLAIVFGLLNSGSGTIIRLRKNLRVCGDCHSFTKFVSNVAGRDIIVRDSHRFHHFKEGTCSCGDYW

AT3G49142.1 Tetratricopeptide repeat (TPR)-like superfamily protein7.7e-14839.91Show/hide
Query:  IKSLKCVHASILRAHLHLNLFFCTTLISQYVLLGSVSCAYSLFSLLRSLDVFLWNVMLRGFVDARLYRRAVILYAQMLDLGIQPDNFTFPFVLKACRSTE
        I++L+ VH+ I+   L  N      L+  Y  L  V+ A  +F  +   +V + NVM+R +V+   Y   V ++  M    ++PD++TFP VLKAC  + 
Subjt:  IKSLKCVHASILRAHLHLNLFFCTTLISQYVLLGSVSCAYSLFSLLRSLDVFLWNVMLRGFVDARLYRRAVILYAQMLDLGIQPDNFTFPFVLKACRSTE

Query:  DLDFGARVHHNAVVFGYQLDVFVANSLIAMYGRCGRLKLAQEVFDKMHGRNVVSWSSIIGAYAYNGQYVVGVSLFSQMLLQGFEPNRPVVLNVMACIHAE
         +  G ++H +A   G    +FV N L++MYG+CG L  A+ V D+M  R+VVSW+S++  YA N ++                                
Subjt:  DLDFGARVHHNAVVFGYQLDVFVANSLIAMYGRCGRLKLAQEVFDKMHGRNVVSWSSIIGAYAYNGQYVVGVSLFSQMLLQGFEPNRPVVLNVMACIHAE

Query:  KEADDICRVVMNHKLGLDQSVQNAAVGMYAR--CGRIDLAQKIFNGIHNKDLVSWASMIEAYVQGDLPLKALEIFRKMIQKSIRPDSITLLGVIHACLAL
         +A ++CR + + K+  D     + +   +      +   + +F  +  K LVSW  MI  Y++  +P++A+E++ +M      PD++++  V+ AC   
Subjt:  KEADDICRVVMNHKLGLDQSVQNAAVGMYAR--CGRIDLAQKIFNGIHNKDLVSWASMIEAYVQGDLPLKALEIFRKMIQKSIRPDSITLLGVIHACLAL

Query:  GSFSNACSIHGFVMRRFLRNQIVVETAILDLYVKCGSLIYARNVFDNMQERNVISWSTMISGYGLHGQGREAIRLFDEMKNS-TNPDHVTLVSILAACSH
         + S    IHG++ R+ L   +++E A++D+Y KCG L  AR+VF+NM+ R+V+SW+ MIS YG  G+G +A+ LF ++++S   PD +  V+ LAACSH
Subjt:  GSFSNACSIHGFVMRRFLRNQIVVETAILDLYVKCGSLIYARNVFDNMQERNVISWSTMISGYGLHGQGREAIRLFDEMKNS-TNPDHVTLVSILAACSH

Query:  AGLVVEGWDCFNAMERDFQLKPRSEHYACMVDLFGRVGKLKEAHDFISKMPIRPNAGVWGALLGACRIHSNIEMAEVAAKHLLELDSENPGRYVLLYNIY
        AGL+ EG  CF  M   +++ PR EH ACMVDL GR GK+KEA+ FI  M + PN  VWGALLGACR+HS+ ++  +AA  L +L  E  G YVLL NIY
Subjt:  AGLVVEGWDCFNAMERDFQLKPRSEHYACMVDLFGRVGKLKEAHDFISKMPIRPNAGVWGALLGACRIHSNIEMAEVAAKHLLELDSENPGRYVLLYNIY

Query:  LSSGKRKEAIHIRALMKQRGLRKIAGHTTIEIKNKIHTFVAGDQSHPQTEMIYSELDKVMYRLQEKGYTPDLNFVLHDVEEETKEKMLCVHSEKLAIVFG
          +G+ +E  +IR +MK +GL+K  G + +E+   IHTF+ GD+SHPQ++ IY ELD ++ +++E GY PD    LHDVEEE KE  L VHSEKLAIVF 
Subjt:  LSSGKRKEAIHIRALMKQRGLRKIAGHTTIEIKNKIHTFVAGDQSHPQTEMIYSELDKVMYRLQEKGYTPDLNFVLHDVEEETKEKMLCVHSEKLAIVFG

Query:  LLNS-----GSGTIIRLRKNLRVCGDCHSFTKFVSNVAGRDIIVRDSHRFHHFKEGTCSCGDYW
        L+N+      S   IR+ KNLR+CGDCH   K +S +  R+II+RD++RFH F+ G CSCGDYW
Subjt:  LLNS-----GSGTIIRLRKNLRVCGDCHSFTKFVSNVAGRDIIVRDSHRFHHFKEGTCSCGDYW

AT4G18750.1 Pentatricopeptide repeat (PPR) superfamily protein3.4e-14339.18Show/hide
Query:  IDSNSCTFYIKKCSTIKSL---KCVHASILRAHLHLNLFFCTTLISQYVLLGSVSCAYSLFSLLRSLDVFLWNVMLRGFVDARLYRRAVILYAQMLDLGI
        +DS + +   K  S+++S+   + +H  IL++          +L++ Y+    V  A  +F  +   DV  WN ++ G+V   L  + + ++ QML  GI
Subjt:  IDSNSCTFYIKKCSTIKSL---KCVHASILRAHLHLNLFFCTTLISQYVLLGSVSCAYSLFSLLRSLDVFLWNVMLRGFVDARLYRRAVILYAQMLDLGI

Query:  QPDNFTFPFVLKACRSTEDLDFGARVHHNAVVFGYQLDVFVANSLIAMYGRCGRLKLAQEVFDKMHGRNVVSWSSIIGAYAYNGQYVVGVSLFSQMLLQG
        + D  T   V   C  +  +  G  VH   V   +  +    N+L+ MY +CG L  A+ VF +M  R+VVS++S+I  YA  G     V LF +M  +G
Subjt:  QPDNFTFPFVLKACRSTEDLDFGARVHHNAVVFGYQLDVFVANSLIAMYGRCGRLKLAQEVFDKMHGRNVVSWSSIIGAYAYNGQYVVGVSLFSQMLLQG

Query:  FEPNRPVVLNVMACIHAEKEADDICRV---VMNHKLGLDQSVQNAAVGMYARCGRIDLAQKIFNGIHNKDLVSWASMIEAYVQGDLPLKALEIFRKMI-Q
          P+   V  V+ C    +  D+  RV   +  + LG D  V NA + MYA+CG +  A+ +F+ +  KD++SW ++I  Y +     +AL +F  ++ +
Subjt:  FEPNRPVVLNVMACIHAEKEADDICRV---VMNHKLGLDQSVQNAAVGMYARCGRIDLAQKIFNGIHNKDLVSWASMIEAYVQGDLPLKALEIFRKMI-Q

Query:  KSIRPDSITLLGVIHACLALGSFSNACSIHGFVMRRFLRNQIVVETAILDLYVKCGSLIYARNVFDNMQERNVISWSTMISGYGLHGQGREAIRLFDEMK
        K   PD  T+  V+ AC +L +F     IHG++MR    +   V  +++D+Y KCG+L+ A  +FD++  ++++SW+ MI+GYG+HG G+EAI LF++M+
Subjt:  KSIRPDSITLLGVIHACLALGSFSNACSIHGFVMRRFLRNQIVVETAILDLYVKCGSLIYARNVFDNMQERNVISWSTMISGYGLHGQGREAIRLFDEMK

Query:  NS-TNPDHVTLVSILAACSHAGLVVEGWDCFNAMERDFQLKPRSEHYACMVDLFGRVGKLKEAHDFISKMPIRPNAGVWGALLGACRIHSNIEMAEVAAK
         +    D ++ VS+L ACSH+GLV EGW  FN M  + +++P  EHYAC+VD+  R G L +A+ FI  MPI P+A +WGALL  CRIH ++++AE  A+
Subjt:  NS-TNPDHVTLVSILAACSHAGLVVEGWDCFNAMERDFQLKPRSEHYACMVDLFGRVGKLKEAHDFISKMPIRPNAGVWGALLGACRIHSNIEMAEVAAK

Query:  HLLELDSENPGRYVLLYNIYLSSGKRKEAIHIRALMKQRGLRKIAGHTTIEIKNKIHTFVAGDQSHPQTEMIYSELDKVMYRLQEKGYTPDLNFVLHDVE
         + EL+ EN G YVL+ NIY  + K ++   +R  + QRGLRK  G + IEIK +++ FVAGD S+P+TE I + L KV  R+ E+GY+P   + L D E
Subjt:  HLLELDSENPGRYVLLYNIYLSSGKRKEAIHIRALMKQRGLRKIAGHTTIEIKNKIHTFVAGDQSHPQTEMIYSELDKVMYRLQEKGYTPDLNFVLHDVE

Query:  EETKEKMLCVHSEKLAIVFGLLNSGSGTIIRLRKNLRVCGDCHSFTKFVSNVAGRDIIVRDSHRFHHFKEGTCSCGDYW
        E  KE+ LC HSEKLA+  G+++SG G IIR+ KNLRVCGDCH   KF+S +  R+I++RDS+RFH FK+G CSC  +W
Subjt:  EETKEKMLCVHSEKLAIVFGLLNSGSGTIIRLRKNLRVCGDCHSFTKFVSNVAGRDIIVRDSHRFHHFKEGTCSCGDYW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATTCTGATACGTTGTTGGAGCCTCCTTTTACCCCGTCGTCTCTACCACTTGAATCTCCAACTTCCAATTTTGCAACATACCATTGATTCTAATTCATGCACCTTCTA
CATCAAAAAGTGCAGTACAATCAAATCCTTGAAGTGTGTTCACGCTTCAATTCTCAGAGCTCACCTCCACCTCAACTTGTTCTTTTGCACCACTCTCATATCTCAATATG
TTTTGCTTGGCTCTGTTTCTTGCGCATATTCATTGTTCTCTTTGTTGCGGTCGCTTGATGTTTTTCTGTGGAACGTGATGCTTCGTGGTTTTGTCGATGCTAGGCTTTAT
CGCAGGGCCGTGATTTTATATGCCCAAATGTTGGATTTGGGCATTCAACCTGATAATTTTACATTTCCATTTGTGTTGAAGGCGTGTCGGTCTACGGAGGATTTGGATTT
TGGGGCTAGGGTTCATCATAATGCTGTGGTTTTTGGGTATCAGTTGGATGTTTTCGTTGCAAATTCGCTCATTGCAATGTATGGTAGATGTGGGCGTTTAAAGCTTGCAC
AAGAGGTTTTTGATAAAATGCATGGGAGAAATGTGGTGTCTTGGAGTTCAATAATTGGTGCTTATGCATATAATGGGCAGTATGTTGTGGGAGTTTCATTGTTCTCGCAG
ATGTTGCTCCAAGGATTTGAACCAAACCGGCCGGTAGTGTTGAATGTGATGGCGTGCATTCATGCAGAGAAGGAAGCCGATGATATTTGTCGAGTTGTTATGAATCACAA
GCTTGGTTTGGATCAATCAGTTCAAAATGCAGCAGTTGGTATGTATGCTCGATGTGGAAGAATTGACCTTGCTCAAAAGATCTTTAATGGAATTCATAATAAGGATTTGG
TTTCGTGGGCATCAATGATTGAAGCTTACGTGCAGGGTGATCTTCCTTTAAAAGCTTTGGAGATTTTTAGAAAAATGATACAAAAAAGTATTCGGCCTGATTCTATCACC
CTTTTGGGTGTGATTCATGCTTGTTTAGCATTAGGATCCTTTAGCAATGCATGCTCAATACATGGTTTTGTTATGAGAAGATTTTTAAGAAACCAAATAGTGGTTGAAAC
TGCTATTCTTGATCTCTATGTCAAATGTGGAAGTTTAATATATGCCAGAAATGTTTTCGATAACATGCAGGAAAGAAACGTCATCTCATGGAGCACCATGATTTCAGGGT
ATGGTTTACACGGACAAGGACGAGAAGCTATCCGTCTCTTCGATGAGATGAAGAACTCAACGAATCCTGACCACGTAACACTTGTATCGATATTGGCAGCGTGTAGTCAT
GCTGGATTGGTTGTTGAAGGATGGGATTGCTTCAATGCCATGGAAAGAGATTTTCAGCTGAAACCAAGATCCGAACATTATGCGTGTATGGTCGATCTCTTTGGTCGAGT
TGGAAAGCTTAAGGAAGCTCATGATTTTATCTCAAAAATGCCAATTAGGCCCAATGCTGGTGTTTGGGGTGCATTGCTTGGGGCATGTAGAATACATTCAAATATAGAAA
TGGCTGAAGTTGCTGCAAAGCACTTACTTGAGTTGGACTCGGAGAATCCCGGTAGATATGTTCTCTTGTACAATATTTATTTGTCATCTGGAAAAAGAAAAGAAGCAATT
CATATTAGGGCTCTTATGAAACAGAGAGGTTTGAGAAAAATTGCAGGCCACACAACCATTGAGATTAAAAACAAGATTCATACATTTGTGGCTGGGGATCAATCTCATCC
ACAAACTGAAATGATATACTCTGAGTTGGATAAAGTGATGTATAGGCTGCAAGAAAAAGGGTACACCCCTGATTTGAACTTTGTATTGCATGATGTGGAGGAAGAGACAA
AGGAAAAGATGTTGTGTGTGCATAGTGAGAAGCTTGCTATTGTTTTCGGGCTCTTGAACTCAGGATCAGGGACCATCATTAGACTACGGAAGAATCTCCGAGTTTGTGGC
GATTGTCATTCATTTACAAAGTTCGTGTCAAATGTTGCAGGAAGAGATATTATAGTTAGAGACTCTCATAGGTTTCACCATTTCAAGGAGGGAACTTGCTCGTGTGGGGA
TTATTGGTGA
mRNA sequenceShow/hide mRNA sequence
ATGATTCTGATACGTTGTTGGAGCCTCCTTTTACCCCGTCGTCTCTACCACTTGAATCTCCAACTTCCAATTTTGCAACATACCATTGATTCTAATTCATGCACCTTCTA
CATCAAAAAGTGCAGTACAATCAAATCCTTGAAGTGTGTTCACGCTTCAATTCTCAGAGCTCACCTCCACCTCAACTTGTTCTTTTGCACCACTCTCATATCTCAATATG
TTTTGCTTGGCTCTGTTTCTTGCGCATATTCATTGTTCTCTTTGTTGCGGTCGCTTGATGTTTTTCTGTGGAACGTGATGCTTCGTGGTTTTGTCGATGCTAGGCTTTAT
CGCAGGGCCGTGATTTTATATGCCCAAATGTTGGATTTGGGCATTCAACCTGATAATTTTACATTTCCATTTGTGTTGAAGGCGTGTCGGTCTACGGAGGATTTGGATTT
TGGGGCTAGGGTTCATCATAATGCTGTGGTTTTTGGGTATCAGTTGGATGTTTTCGTTGCAAATTCGCTCATTGCAATGTATGGTAGATGTGGGCGTTTAAAGCTTGCAC
AAGAGGTTTTTGATAAAATGCATGGGAGAAATGTGGTGTCTTGGAGTTCAATAATTGGTGCTTATGCATATAATGGGCAGTATGTTGTGGGAGTTTCATTGTTCTCGCAG
ATGTTGCTCCAAGGATTTGAACCAAACCGGCCGGTAGTGTTGAATGTGATGGCGTGCATTCATGCAGAGAAGGAAGCCGATGATATTTGTCGAGTTGTTATGAATCACAA
GCTTGGTTTGGATCAATCAGTTCAAAATGCAGCAGTTGGTATGTATGCTCGATGTGGAAGAATTGACCTTGCTCAAAAGATCTTTAATGGAATTCATAATAAGGATTTGG
TTTCGTGGGCATCAATGATTGAAGCTTACGTGCAGGGTGATCTTCCTTTAAAAGCTTTGGAGATTTTTAGAAAAATGATACAAAAAAGTATTCGGCCTGATTCTATCACC
CTTTTGGGTGTGATTCATGCTTGTTTAGCATTAGGATCCTTTAGCAATGCATGCTCAATACATGGTTTTGTTATGAGAAGATTTTTAAGAAACCAAATAGTGGTTGAAAC
TGCTATTCTTGATCTCTATGTCAAATGTGGAAGTTTAATATATGCCAGAAATGTTTTCGATAACATGCAGGAAAGAAACGTCATCTCATGGAGCACCATGATTTCAGGGT
ATGGTTTACACGGACAAGGACGAGAAGCTATCCGTCTCTTCGATGAGATGAAGAACTCAACGAATCCTGACCACGTAACACTTGTATCGATATTGGCAGCGTGTAGTCAT
GCTGGATTGGTTGTTGAAGGATGGGATTGCTTCAATGCCATGGAAAGAGATTTTCAGCTGAAACCAAGATCCGAACATTATGCGTGTATGGTCGATCTCTTTGGTCGAGT
TGGAAAGCTTAAGGAAGCTCATGATTTTATCTCAAAAATGCCAATTAGGCCCAATGCTGGTGTTTGGGGTGCATTGCTTGGGGCATGTAGAATACATTCAAATATAGAAA
TGGCTGAAGTTGCTGCAAAGCACTTACTTGAGTTGGACTCGGAGAATCCCGGTAGATATGTTCTCTTGTACAATATTTATTTGTCATCTGGAAAAAGAAAAGAAGCAATT
CATATTAGGGCTCTTATGAAACAGAGAGGTTTGAGAAAAATTGCAGGCCACACAACCATTGAGATTAAAAACAAGATTCATACATTTGTGGCTGGGGATCAATCTCATCC
ACAAACTGAAATGATATACTCTGAGTTGGATAAAGTGATGTATAGGCTGCAAGAAAAAGGGTACACCCCTGATTTGAACTTTGTATTGCATGATGTGGAGGAAGAGACAA
AGGAAAAGATGTTGTGTGTGCATAGTGAGAAGCTTGCTATTGTTTTCGGGCTCTTGAACTCAGGATCAGGGACCATCATTAGACTACGGAAGAATCTCCGAGTTTGTGGC
GATTGTCATTCATTTACAAAGTTCGTGTCAAATGTTGCAGGAAGAGATATTATAGTTAGAGACTCTCATAGGTTTCACCATTTCAAGGAGGGAACTTGCTCGTGTGGGGA
TTATTGGTGA
Protein sequenceShow/hide protein sequence
MILIRCWSLLLPRRLYHLNLQLPILQHTIDSNSCTFYIKKCSTIKSLKCVHASILRAHLHLNLFFCTTLISQYVLLGSVSCAYSLFSLLRSLDVFLWNVMLRGFVDARLY
RRAVILYAQMLDLGIQPDNFTFPFVLKACRSTEDLDFGARVHHNAVVFGYQLDVFVANSLIAMYGRCGRLKLAQEVFDKMHGRNVVSWSSIIGAYAYNGQYVVGVSLFSQ
MLLQGFEPNRPVVLNVMACIHAEKEADDICRVVMNHKLGLDQSVQNAAVGMYARCGRIDLAQKIFNGIHNKDLVSWASMIEAYVQGDLPLKALEIFRKMIQKSIRPDSIT
LLGVIHACLALGSFSNACSIHGFVMRRFLRNQIVVETAILDLYVKCGSLIYARNVFDNMQERNVISWSTMISGYGLHGQGREAIRLFDEMKNSTNPDHVTLVSILAACSH
AGLVVEGWDCFNAMERDFQLKPRSEHYACMVDLFGRVGKLKEAHDFISKMPIRPNAGVWGALLGACRIHSNIEMAEVAAKHLLELDSENPGRYVLLYNIYLSSGKRKEAI
HIRALMKQRGLRKIAGHTTIEIKNKIHTFVAGDQSHPQTEMIYSELDKVMYRLQEKGYTPDLNFVLHDVEEETKEKMLCVHSEKLAIVFGLLNSGSGTIIRLRKNLRVCG
DCHSFTKFVSNVAGRDIIVRDSHRFHHFKEGTCSCGDYW