; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Chy2G040100 (gene) of Cucumber (hystrix) v1 genome

Gene IDChy2G040100
OrganismCucumis hystrix (Cucumber (hystrix) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationchrH02:18341933..18343954
RNA-Seq ExpressionChy2G040100
SyntenyChy2G040100
Gene Ontology termsGO:0009451 - RNA modification (biological process)
GO:0043231 - intracellular membrane-bounded organelle (cellular component)
GO:0003723 - RNA binding (molecular function)
GO:0005515 - protein binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily
IPR032867 - DYW domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TYJ96962.1 pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa]0.094.42Show/hide
Query:  MLKQRYLDCRRITFDLLATSASGLPAAPSNSKDYQSLYRVLEACRLFHMNSKTVIETHARIIKFGYGNYPTLISSLVSTYQRVGCLNRVHQLLDILCPKQ
        MLKQRYLDCRRIT  LLATSAS LPAAPSN  DYQ+L+RVLEACRLF MNSKTVIETHARIIKFGYGNYPTLI+SLVSTYQ VGCLNRVH+LLDILC K 
Subjt:  MLKQRYLDCRRITFDLLATSASGLPAAPSNSKDYQSLYRVLEACRLFHMNSKTVIETHARIIKFGYGNYPTLISSLVSTYQRVGCLNRVHQLLDILCPKQ

Query:  LDLVAMNLLIGNFMKIGECKFAKKVFYKMPFRDVVTWNSIIGGCVKNARYDEAFRFFRRMLTSNIQPDGFTFASILNACAQLGAPSNTHWVHAQMTQKKI
        LDLVAMNLLIGNFMKIGECKFAKKVFYKMPFRDVVTWNSIIGGCVKNARYDEAFRFFR+MLTSNIQPDGFTFAS+LNACAQLGAPSNTHWV AQMTQKKI
Subjt:  LDLVAMNLLIGNFMKIGECKFAKKVFYKMPFRDVVTWNSIIGGCVKNARYDEAFRFFRRMLTSNIQPDGFTFASILNACAQLGAPSNTHWVHAQMTQKKI

Query:  ELNSLLSCALIDAYSKCGSIQIAKEIFSNIPHSDTSVWNVMIKGLAIHGLAMDALSLFLRMEHENVLPDAITFLGVLTACNHGGLIDRGRRYFELMKSHY
        ELNSLLSCALIDAYSKCGSIQIAKEIFSN+PHSDTSVWNVMIKGLAIHGLAMDALSLFLRMEHENVLPDAITFLG+LTACNHGGLID GRRYFELM+S Y
Subjt:  ELNSLLSCALIDAYSKCGSIQIAKEIFSNIPHSDTSVWNVMIKGLAIHGLAMDALSLFLRMEHENVLPDAITFLGVLTACNHGGLIDRGRRYFELMKSHY

Query:  SIQPQLEHYGVMVDLYSRAGFLDEAYSLILTMPIEPDVVTWRTLLSGCRIYKNHKLAEVAIANMSHRKSGDYVLLSNIYCSLNRWKEAETVRKMMKINRV
        SIQPQLEHYGVMVDLYSRAGFL+EAYSLI+TMP+EPDVVTWRTLLSGC+IYKNHKLAEVAIANMSHRKSGDYVLLSNIYCSLNRW+EAETVRKMMKINRV
Subjt:  SIQPQLEHYGVMVDLYSRAGFLDEAYSLILTMPIEPDVVTWRTLLSGCRIYKNHKLAEVAIANMSHRKSGDYVLLSNIYCSLNRWKEAETVRKMMKINRV

Query:  RKKRGKSWIELGGTIQHFKSGDRLHPESDAIEKVLCSLMKRTRTEGYMPVTELVFMDISEEEKEENLSFHSEKMALAYAILKTSPGAKISISKNLRICDD
        RKKRGKSWIELGGT Q+FKSGDRLHPESDAI+KVLCSLMKRTR+EGYMPVTELVFMDISEEEKEENLSFHSEKMALAYAILKTSPGAKISISKNLRICDD
Subjt:  RKKRGKSWIELGGTIQHFKSGDRLHPESDAIEKVLCSLMKRTRTEGYMPVTELVFMDISEEEKEENLSFHSEKMALAYAILKTSPGAKISISKNLRICDD

Query:  CHTWIKLVSRVLCRVIVVRDRIRFHQFEGGMCSCGDRW
        CHTWIKLVSRVLCRVIVVRDRIRFHQFEGGMCSCGDRW
Subjt:  CHTWIKLVSRVLCRVIVVRDRIRFHQFEGGMCSCGDRW

XP_008443769.1 PREDICTED: pentatricopeptide repeat-containing protein At5g50990 [Cucumis melo]0.094.42Show/hide
Query:  MLKQRYLDCRRITFDLLATSASGLPAAPSNSKDYQSLYRVLEACRLFHMNSKTVIETHARIIKFGYGNYPTLISSLVSTYQRVGCLNRVHQLLDILCPKQ
        MLKQRYLDCRRIT  LLATSAS LPAAPSN  DYQ+L+RVLEACRLF MNSKTVIETHARIIKFGYGNYPTLI+SLVSTYQ VGCLNRVH+LLDILC K 
Subjt:  MLKQRYLDCRRITFDLLATSASGLPAAPSNSKDYQSLYRVLEACRLFHMNSKTVIETHARIIKFGYGNYPTLISSLVSTYQRVGCLNRVHQLLDILCPKQ

Query:  LDLVAMNLLIGNFMKIGECKFAKKVFYKMPFRDVVTWNSIIGGCVKNARYDEAFRFFRRMLTSNIQPDGFTFASILNACAQLGAPSNTHWVHAQMTQKKI
        LDLVAMNLLIGNFMKIGECKFAKKVFYKMPFRDVVTWNSIIGGCVKNARYDEAFRFFR+MLTSNIQPDGFTFAS+LNACAQLGAPSNTHWV AQMTQKKI
Subjt:  LDLVAMNLLIGNFMKIGECKFAKKVFYKMPFRDVVTWNSIIGGCVKNARYDEAFRFFRRMLTSNIQPDGFTFASILNACAQLGAPSNTHWVHAQMTQKKI

Query:  ELNSLLSCALIDAYSKCGSIQIAKEIFSNIPHSDTSVWNVMIKGLAIHGLAMDALSLFLRMEHENVLPDAITFLGVLTACNHGGLIDRGRRYFELMKSHY
        ELNSLLSCALIDAYSKCGSIQIAKEIFSN+PHSDTSVWNVMIKGLAIHGLAMDALSLFLRMEHENVLPDAITFLG+LTACNHGGLID GRRYFELM+S Y
Subjt:  ELNSLLSCALIDAYSKCGSIQIAKEIFSNIPHSDTSVWNVMIKGLAIHGLAMDALSLFLRMEHENVLPDAITFLGVLTACNHGGLIDRGRRYFELMKSHY

Query:  SIQPQLEHYGVMVDLYSRAGFLDEAYSLILTMPIEPDVVTWRTLLSGCRIYKNHKLAEVAIANMSHRKSGDYVLLSNIYCSLNRWKEAETVRKMMKINRV
        SIQPQLEHYGVMVDLYSRAGFL+EAYSLI+TMPIEPDVVTWRTLLSGC+IYKNHKLAEVAIANMSH KSGDYVLLSNIYCSLNRW+EAETVRKMMKINRV
Subjt:  SIQPQLEHYGVMVDLYSRAGFLDEAYSLILTMPIEPDVVTWRTLLSGCRIYKNHKLAEVAIANMSHRKSGDYVLLSNIYCSLNRWKEAETVRKMMKINRV

Query:  RKKRGKSWIELGGTIQHFKSGDRLHPESDAIEKVLCSLMKRTRTEGYMPVTELVFMDISEEEKEENLSFHSEKMALAYAILKTSPGAKISISKNLRICDD
        RKKRGKSWIELGGT Q+FKSGDRLHPESDAI+KVLCSLMKRTR+EGYMPVTELVFMDISEEEKEENLSFHSEKMALAYAILKTSPGAKISISKNLRICDD
Subjt:  RKKRGKSWIELGGTIQHFKSGDRLHPESDAIEKVLCSLMKRTRTEGYMPVTELVFMDISEEEKEENLSFHSEKMALAYAILKTSPGAKISISKNLRICDD

Query:  CHTWIKLVSRVLCRVIVVRDRIRFHQFEGGMCSCGDRW
        CHTWIKLVSRVLCRVIVVRDRIRFHQFEGGMCSCGDRW
Subjt:  CHTWIKLVSRVLCRVIVVRDRIRFHQFEGGMCSCGDRW

XP_011660258.1 pentatricopeptide repeat-containing protein At5g50990 isoform X1 [Cucumis sativus]0.097.96Show/hide
Query:  MLKQRYLDCRRITFDLLATSASGLPAAPSNSKDYQSLYRVLEACRLFHMNSKTVIETHARIIKFGYGNYPTLISSLVSTYQRVGCLNRVHQLLDILCPKQ
        MLKQRYLDCRRITF LLATSASGLPAAPSNSKDYQSLYRVLEACRLFHMNSKTVIETHARIIKFGYGNYPTLI+SLVSTYQRVGCLNRVHQLLDILC KQ
Subjt:  MLKQRYLDCRRITFDLLATSASGLPAAPSNSKDYQSLYRVLEACRLFHMNSKTVIETHARIIKFGYGNYPTLISSLVSTYQRVGCLNRVHQLLDILCPKQ

Query:  LDLVAMNLLIGNFMKIGECKFAKKVFYKMPFRDVVTWNSIIGGCVKNARYDEAFRFFRRMLTSNIQPDGFTFASILNACAQLGAPSNTHWVHAQMTQKKI
        LDLVAMNLLIGNFMKIGECKFAKKVFYKMPFRDVVTWNSIIGGCVKNARYDEAFRFFR+MLTSNIQPDGFTFASILNACAQLGAPSNTHWVHAQMTQKKI
Subjt:  LDLVAMNLLIGNFMKIGECKFAKKVFYKMPFRDVVTWNSIIGGCVKNARYDEAFRFFRRMLTSNIQPDGFTFASILNACAQLGAPSNTHWVHAQMTQKKI

Query:  ELNSLLSCALIDAYSKCGSIQIAKEIFSNIPHSDTSVWNVMIKGLAIHGLAMDALSLFLRMEHENVLPDAITFLGVLTACNHGGLIDRGRRYFELMKSHY
        ELNSLL+CALIDAYSKCGSIQIAKEIFSNIPHSDTSVWNVMIKGLAIHGLAMDALSLFLRMEHE+VLPDAITFLGVLTACNHG LID GRRYFELMKSHY
Subjt:  ELNSLLSCALIDAYSKCGSIQIAKEIFSNIPHSDTSVWNVMIKGLAIHGLAMDALSLFLRMEHENVLPDAITFLGVLTACNHGGLIDRGRRYFELMKSHY

Query:  SIQPQLEHYGVMVDLYSRAGFLDEAYSLILTMPIEPDVVTWRTLLSGCRIYKNHKLAEVAIANMSHRKSGDYVLLSNIYCSLNRWKEAETVRKMMKINRV
        SIQPQLEHYGVMVDLYSRAGFL+EAYSLI+TMPIEPDVVTWRTLLSGCRIYKNHKLAEVAIANMS RKSGDYVLLSNIYCSLNRWKEAETVRKMMKINRV
Subjt:  SIQPQLEHYGVMVDLYSRAGFLDEAYSLILTMPIEPDVVTWRTLLSGCRIYKNHKLAEVAIANMSHRKSGDYVLLSNIYCSLNRWKEAETVRKMMKINRV

Query:  RKKRGKSWIELGGTIQHFKSGDRLHPESDAIEKVLCSLMKRTRTEGYMPVTELVFMDISEEEKEENLSFHSEKMALAYAILKTSPGAKISISKNLRICDD
        RKKRGKSWIELGGTIQHFKSGDRLHPESDAIEKVLCSLMKRTRTEGYMPVTELVFMDISEEEKEENLSFHSEKMALAYAILKTSPGAKISISKNLRICDD
Subjt:  RKKRGKSWIELGGTIQHFKSGDRLHPESDAIEKVLCSLMKRTRTEGYMPVTELVFMDISEEEKEENLSFHSEKMALAYAILKTSPGAKISISKNLRICDD

Query:  CHTWIKLVSRVLCRVIVVRDRIRFHQFEGGMCSCGDRW
        CHTWIKLVSRVLCRVIVVRDRIRFHQFEGGMCSCGDRW
Subjt:  CHTWIKLVSRVLCRVIVVRDRIRFHQFEGGMCSCGDRW

XP_031736189.1 pentatricopeptide repeat-containing protein At5g50990 isoform X2 [Cucumis sativus]0.097.83Show/hide
Query:  DYQSLYRVLEACRLFHMNSKTVIETHARIIKFGYGNYPTLISSLVSTYQRVGCLNRVHQLLDILCPKQLDLVAMNLLIGNFMKIGECKFAKKVFYKMPFR
        +YQSLYRVLEACRLFHMNSKTVIETHARIIKFGYGNYPTLI+SLVSTYQRVGCLNRVHQLLDILC KQLDLVAMNLLIGNFMKIGECKFAKKVFYKMPFR
Subjt:  DYQSLYRVLEACRLFHMNSKTVIETHARIIKFGYGNYPTLISSLVSTYQRVGCLNRVHQLLDILCPKQLDLVAMNLLIGNFMKIGECKFAKKVFYKMPFR

Query:  DVVTWNSIIGGCVKNARYDEAFRFFRRMLTSNIQPDGFTFASILNACAQLGAPSNTHWVHAQMTQKKIELNSLLSCALIDAYSKCGSIQIAKEIFSNIPH
        DVVTWNSIIGGCVKNARYDEAFRFFR+MLTSNIQPDGFTFASILNACAQLGAPSNTHWVHAQMTQKKIELNSLL+CALIDAYSKCGSIQIAKEIFSNIPH
Subjt:  DVVTWNSIIGGCVKNARYDEAFRFFRRMLTSNIQPDGFTFASILNACAQLGAPSNTHWVHAQMTQKKIELNSLLSCALIDAYSKCGSIQIAKEIFSNIPH

Query:  SDTSVWNVMIKGLAIHGLAMDALSLFLRMEHENVLPDAITFLGVLTACNHGGLIDRGRRYFELMKSHYSIQPQLEHYGVMVDLYSRAGFLDEAYSLILTM
        SDTSVWNVMIKGLAIHGLAMDALSLFLRMEHE+VLPDAITFLGVLTACNHG LID GRRYFELMKSHYSIQPQLEHYGVMVDLYSRAGFL+EAYSLI+TM
Subjt:  SDTSVWNVMIKGLAIHGLAMDALSLFLRMEHENVLPDAITFLGVLTACNHGGLIDRGRRYFELMKSHYSIQPQLEHYGVMVDLYSRAGFLDEAYSLILTM

Query:  PIEPDVVTWRTLLSGCRIYKNHKLAEVAIANMSHRKSGDYVLLSNIYCSLNRWKEAETVRKMMKINRVRKKRGKSWIELGGTIQHFKSGDRLHPESDAIE
        PIEPDVVTWRTLLSGCRIYKNHKLAEVAIANMS RKSGDYVLLSNIYCSLNRWKEAETVRKMMKINRVRKKRGKSWIELGGTIQHFKSGDRLHPESDAIE
Subjt:  PIEPDVVTWRTLLSGCRIYKNHKLAEVAIANMSHRKSGDYVLLSNIYCSLNRWKEAETVRKMMKINRVRKKRGKSWIELGGTIQHFKSGDRLHPESDAIE

Query:  KVLCSLMKRTRTEGYMPVTELVFMDISEEEKEENLSFHSEKMALAYAILKTSPGAKISISKNLRICDDCHTWIKLVSRVLCRVIVVRDRIRFHQFEGGMC
        KVLCSLMKRTRTEGYMPVTELVFMDISEEEKEENLSFHSEKMALAYAILKTSPGAKISISKNLRICDDCHTWIKLVSRVLCRVIVVRDRIRFHQFEGGMC
Subjt:  KVLCSLMKRTRTEGYMPVTELVFMDISEEEKEENLSFHSEKMALAYAILKTSPGAKISISKNLRICDDCHTWIKLVSRVLCRVIVVRDRIRFHQFEGGMC

Query:  SCGDRW
        SCGDRW
Subjt:  SCGDRW

XP_031736190.1 pentatricopeptide repeat-containing protein At5g50990 isoform X3 [Cucumis sativus]0.097.96Show/hide
Query:  MNSKTVIETHARIIKFGYGNYPTLISSLVSTYQRVGCLNRVHQLLDILCPKQLDLVAMNLLIGNFMKIGECKFAKKVFYKMPFRDVVTWNSIIGGCVKNA
        MNSKTVIETHARIIKFGYGNYPTLI+SLVSTYQRVGCLNRVHQLLDILC KQLDLVAMNLLIGNFMKIGECKFAKKVFYKMPFRDVVTWNSIIGGCVKNA
Subjt:  MNSKTVIETHARIIKFGYGNYPTLISSLVSTYQRVGCLNRVHQLLDILCPKQLDLVAMNLLIGNFMKIGECKFAKKVFYKMPFRDVVTWNSIIGGCVKNA

Query:  RYDEAFRFFRRMLTSNIQPDGFTFASILNACAQLGAPSNTHWVHAQMTQKKIELNSLLSCALIDAYSKCGSIQIAKEIFSNIPHSDTSVWNVMIKGLAIH
        RYDEAFRFFR+MLTSNIQPDGFTFASILNACAQLGAPSNTHWVHAQMTQKKIELNSLL+CALIDAYSKCGSIQIAKEIFSNIPHSDTSVWNVMIKGLAIH
Subjt:  RYDEAFRFFRRMLTSNIQPDGFTFASILNACAQLGAPSNTHWVHAQMTQKKIELNSLLSCALIDAYSKCGSIQIAKEIFSNIPHSDTSVWNVMIKGLAIH

Query:  GLAMDALSLFLRMEHENVLPDAITFLGVLTACNHGGLIDRGRRYFELMKSHYSIQPQLEHYGVMVDLYSRAGFLDEAYSLILTMPIEPDVVTWRTLLSGC
        GLAMDALSLFLRMEHE+VLPDAITFLGVLTACNHG LID GRRYFELMKSHYSIQPQLEHYGVMVDLYSRAGFL+EAYSLI+TMPIEPDVVTWRTLLSGC
Subjt:  GLAMDALSLFLRMEHENVLPDAITFLGVLTACNHGGLIDRGRRYFELMKSHYSIQPQLEHYGVMVDLYSRAGFLDEAYSLILTMPIEPDVVTWRTLLSGC

Query:  RIYKNHKLAEVAIANMSHRKSGDYVLLSNIYCSLNRWKEAETVRKMMKINRVRKKRGKSWIELGGTIQHFKSGDRLHPESDAIEKVLCSLMKRTRTEGYM
        RIYKNHKLAEVAIANMS RKSGDYVLLSNIYCSLNRWKEAETVRKMMKINRVRKKRGKSWIELGGTIQHFKSGDRLHPESDAIEKVLCSLMKRTRTEGYM
Subjt:  RIYKNHKLAEVAIANMSHRKSGDYVLLSNIYCSLNRWKEAETVRKMMKINRVRKKRGKSWIELGGTIQHFKSGDRLHPESDAIEKVLCSLMKRTRTEGYM

Query:  PVTELVFMDISEEEKEENLSFHSEKMALAYAILKTSPGAKISISKNLRICDDCHTWIKLVSRVLCRVIVVRDRIRFHQFEGGMCSCGDRW
        PVTELVFMDISEEEKEENLSFHSEKMALAYAILKTSPGAKISISKNLRICDDCHTWIKLVSRVLCRVIVVRDRIRFHQFEGGMCSCGDRW
Subjt:  PVTELVFMDISEEEKEENLSFHSEKMALAYAILKTSPGAKISISKNLRICDDCHTWIKLVSRVLCRVIVVRDRIRFHQFEGGMCSCGDRW

TrEMBL top hitse value%identityAlignment
A0A0A0LXZ9 DYW_deaminase domain-containing protein0.0e+0097.96Show/hide
Query:  MLKQRYLDCRRITFDLLATSASGLPAAPSNSKDYQSLYRVLEACRLFHMNSKTVIETHARIIKFGYGNYPTLISSLVSTYQRVGCLNRVHQLLDILCPKQ
        MLKQRYLDCRRITF LLATSASGLPAAPSNSKDYQSLYRVLEACRLFHMNSKTVIETHARIIKFGYGNYPTLI+SLVSTYQRVGCLNRVHQLLDILC KQ
Subjt:  MLKQRYLDCRRITFDLLATSASGLPAAPSNSKDYQSLYRVLEACRLFHMNSKTVIETHARIIKFGYGNYPTLISSLVSTYQRVGCLNRVHQLLDILCPKQ

Query:  LDLVAMNLLIGNFMKIGECKFAKKVFYKMPFRDVVTWNSIIGGCVKNARYDEAFRFFRRMLTSNIQPDGFTFASILNACAQLGAPSNTHWVHAQMTQKKI
        LDLVAMNLLIGNFMKIGECKFAKKVFYKMPFRDVVTWNSIIGGCVKNARYDEAFRFFR+MLTSNIQPDGFTFASILNACAQLGAPSNTHWVHAQMTQKKI
Subjt:  LDLVAMNLLIGNFMKIGECKFAKKVFYKMPFRDVVTWNSIIGGCVKNARYDEAFRFFRRMLTSNIQPDGFTFASILNACAQLGAPSNTHWVHAQMTQKKI

Query:  ELNSLLSCALIDAYSKCGSIQIAKEIFSNIPHSDTSVWNVMIKGLAIHGLAMDALSLFLRMEHENVLPDAITFLGVLTACNHGGLIDRGRRYFELMKSHY
        ELNSLL+CALIDAYSKCGSIQIAKEIFSNIPHSDTSVWNVMIKGLAIHGLAMDALSLFLRMEHE+VLPDAITFLGVLTACNHG LID GRRYFELMKSHY
Subjt:  ELNSLLSCALIDAYSKCGSIQIAKEIFSNIPHSDTSVWNVMIKGLAIHGLAMDALSLFLRMEHENVLPDAITFLGVLTACNHGGLIDRGRRYFELMKSHY

Query:  SIQPQLEHYGVMVDLYSRAGFLDEAYSLILTMPIEPDVVTWRTLLSGCRIYKNHKLAEVAIANMSHRKSGDYVLLSNIYCSLNRWKEAETVRKMMKINRV
        SIQPQLEHYGVMVDLYSRAGFL+EAYSLI+TMPIEPDVVTWRTLLSGCRIYKNHKLAEVAIANMS RKSGDYVLLSNIYCSLNRWKEAETVRKMMKINRV
Subjt:  SIQPQLEHYGVMVDLYSRAGFLDEAYSLILTMPIEPDVVTWRTLLSGCRIYKNHKLAEVAIANMSHRKSGDYVLLSNIYCSLNRWKEAETVRKMMKINRV

Query:  RKKRGKSWIELGGTIQHFKSGDRLHPESDAIEKVLCSLMKRTRTEGYMPVTELVFMDISEEEKEENLSFHSEKMALAYAILKTSPGAKISISKNLRICDD
        RKKRGKSWIELGGTIQHFKSGDRLHPESDAIEKVLCSLMKRTRTEGYMPVTELVFMDISEEEKEENLSFHSEKMALAYAILKTSPGAKISISKNLRICDD
Subjt:  RKKRGKSWIELGGTIQHFKSGDRLHPESDAIEKVLCSLMKRTRTEGYMPVTELVFMDISEEEKEENLSFHSEKMALAYAILKTSPGAKISISKNLRICDD

Query:  CHTWIKLVSRVLCRVIVVRDRIRFHQFEGGMCSCGDRW
        CHTWIKLVSRVLCRVIVVRDRIRFHQFEGGMCSCGDRW
Subjt:  CHTWIKLVSRVLCRVIVVRDRIRFHQFEGGMCSCGDRW

A0A1S4DV22 pentatricopeptide repeat-containing protein At5g509905.2e-30194.42Show/hide
Query:  MLKQRYLDCRRITFDLLATSASGLPAAPSNSKDYQSLYRVLEACRLFHMNSKTVIETHARIIKFGYGNYPTLISSLVSTYQRVGCLNRVHQLLDILCPKQ
        MLKQRYLDCRRIT  LLATSAS LPAAPSN  DYQ+L+RVLEACRLF MNSKTVIETHARIIKFGYGNYPTLI+SLVSTYQ VGCLNRVH+LLDILC K 
Subjt:  MLKQRYLDCRRITFDLLATSASGLPAAPSNSKDYQSLYRVLEACRLFHMNSKTVIETHARIIKFGYGNYPTLISSLVSTYQRVGCLNRVHQLLDILCPKQ

Query:  LDLVAMNLLIGNFMKIGECKFAKKVFYKMPFRDVVTWNSIIGGCVKNARYDEAFRFFRRMLTSNIQPDGFTFASILNACAQLGAPSNTHWVHAQMTQKKI
        LDLVAMNLLIGNFMKIGECKFAKKVFYKMPFRDVVTWNSIIGGCVKNARYDEAFRFFR+MLTSNIQPDGFTFAS+LNACAQLGAPSNTHWV AQMTQKKI
Subjt:  LDLVAMNLLIGNFMKIGECKFAKKVFYKMPFRDVVTWNSIIGGCVKNARYDEAFRFFRRMLTSNIQPDGFTFASILNACAQLGAPSNTHWVHAQMTQKKI

Query:  ELNSLLSCALIDAYSKCGSIQIAKEIFSNIPHSDTSVWNVMIKGLAIHGLAMDALSLFLRMEHENVLPDAITFLGVLTACNHGGLIDRGRRYFELMKSHY
        ELNSLLSCALIDAYSKCGSIQIAKEIFSN+PHSDTSVWNVMIKGLAIHGLAMDALSLFLRMEHENVLPDAITFLG+LTACNHGGLID GRRYFELM+S Y
Subjt:  ELNSLLSCALIDAYSKCGSIQIAKEIFSNIPHSDTSVWNVMIKGLAIHGLAMDALSLFLRMEHENVLPDAITFLGVLTACNHGGLIDRGRRYFELMKSHY

Query:  SIQPQLEHYGVMVDLYSRAGFLDEAYSLILTMPIEPDVVTWRTLLSGCRIYKNHKLAEVAIANMSHRKSGDYVLLSNIYCSLNRWKEAETVRKMMKINRV
        SIQPQLEHYGVMVDLYSRAGFL+EAYSLI+TMPIEPDVVTWRTLLSGC+IYKNHKLAEVAIANMSH KSGDYVLLSNIYCSLNRW+EAETVRKMMKINRV
Subjt:  SIQPQLEHYGVMVDLYSRAGFLDEAYSLILTMPIEPDVVTWRTLLSGCRIYKNHKLAEVAIANMSHRKSGDYVLLSNIYCSLNRWKEAETVRKMMKINRV

Query:  RKKRGKSWIELGGTIQHFKSGDRLHPESDAIEKVLCSLMKRTRTEGYMPVTELVFMDISEEEKEENLSFHSEKMALAYAILKTSPGAKISISKNLRICDD
        RKKRGKSWIELGGT Q+FKSGDRLHPESDAI+KVLCSLMKRTR+EGYMPVTELVFMDISEEEKEENLSFHSEKMALAYAILKTSPGAKISISKNLRICDD
Subjt:  RKKRGKSWIELGGTIQHFKSGDRLHPESDAIEKVLCSLMKRTRTEGYMPVTELVFMDISEEEKEENLSFHSEKMALAYAILKTSPGAKISISKNLRICDD

Query:  CHTWIKLVSRVLCRVIVVRDRIRFHQFEGGMCSCGDRW
        CHTWIKLVSRVLCRVIVVRDRIRFHQFEGGMCSCGDRW
Subjt:  CHTWIKLVSRVLCRVIVVRDRIRFHQFEGGMCSCGDRW

A0A5A7T674 Pentatricopeptide repeat-containing protein5.2e-30194.42Show/hide
Query:  MLKQRYLDCRRITFDLLATSASGLPAAPSNSKDYQSLYRVLEACRLFHMNSKTVIETHARIIKFGYGNYPTLISSLVSTYQRVGCLNRVHQLLDILCPKQ
        MLKQRYLDCRRIT  LLATSAS LPAAPSN  DYQ+L+RVLEACRLF MNSKTVIETHARIIKFGYGNYPTLI+SLVSTYQ VGCLNRVH+LLDILC K 
Subjt:  MLKQRYLDCRRITFDLLATSASGLPAAPSNSKDYQSLYRVLEACRLFHMNSKTVIETHARIIKFGYGNYPTLISSLVSTYQRVGCLNRVHQLLDILCPKQ

Query:  LDLVAMNLLIGNFMKIGECKFAKKVFYKMPFRDVVTWNSIIGGCVKNARYDEAFRFFRRMLTSNIQPDGFTFASILNACAQLGAPSNTHWVHAQMTQKKI
        LDLVAMNLLIGNFMKIGECKFAKKVFYKMPFRDVVTWNSIIGGCVKNARYDEAFRFFR+MLTSNIQPDGFTFAS+LNACAQLGAPSNTHWV AQMTQKKI
Subjt:  LDLVAMNLLIGNFMKIGECKFAKKVFYKMPFRDVVTWNSIIGGCVKNARYDEAFRFFRRMLTSNIQPDGFTFASILNACAQLGAPSNTHWVHAQMTQKKI

Query:  ELNSLLSCALIDAYSKCGSIQIAKEIFSNIPHSDTSVWNVMIKGLAIHGLAMDALSLFLRMEHENVLPDAITFLGVLTACNHGGLIDRGRRYFELMKSHY
        ELNSLLSCALIDAYSKCGSIQIAKEIFSN+PHSDTSVWNVMIKGLAIHGLAMDALSLFLRMEHENVLPDAITFLG+LTACNHGGLID GRRYFELM+S Y
Subjt:  ELNSLLSCALIDAYSKCGSIQIAKEIFSNIPHSDTSVWNVMIKGLAIHGLAMDALSLFLRMEHENVLPDAITFLGVLTACNHGGLIDRGRRYFELMKSHY

Query:  SIQPQLEHYGVMVDLYSRAGFLDEAYSLILTMPIEPDVVTWRTLLSGCRIYKNHKLAEVAIANMSHRKSGDYVLLSNIYCSLNRWKEAETVRKMMKINRV
        SIQPQLEHYGVMVDLYSRAGFL+EAYSLI+TMPIEPDVVTWRTLLSGC+IYKNHKLAEVAIANMSH KSGDYVLLSNIYCSLNRW+EAETVRKMMKINRV
Subjt:  SIQPQLEHYGVMVDLYSRAGFLDEAYSLILTMPIEPDVVTWRTLLSGCRIYKNHKLAEVAIANMSHRKSGDYVLLSNIYCSLNRWKEAETVRKMMKINRV

Query:  RKKRGKSWIELGGTIQHFKSGDRLHPESDAIEKVLCSLMKRTRTEGYMPVTELVFMDISEEEKEENLSFHSEKMALAYAILKTSPGAKISISKNLRICDD
        RKKRGKSWIELGGT Q+FKSGDRLHPESDAI+KVLCSLMKRTR+EGYMPVTELVFMDISEEEKEENLSFHSEKMALAYAILKTSPGAKISISKNLRICDD
Subjt:  RKKRGKSWIELGGTIQHFKSGDRLHPESDAIEKVLCSLMKRTRTEGYMPVTELVFMDISEEEKEENLSFHSEKMALAYAILKTSPGAKISISKNLRICDD

Query:  CHTWIKLVSRVLCRVIVVRDRIRFHQFEGGMCSCGDRW
        CHTWIKLVSRVLCRVIVVRDRIRFHQFEGGMCSCGDRW
Subjt:  CHTWIKLVSRVLCRVIVVRDRIRFHQFEGGMCSCGDRW

A0A5D3BAU8 Pentatricopeptide repeat-containing protein1.1e-30194.42Show/hide
Query:  MLKQRYLDCRRITFDLLATSASGLPAAPSNSKDYQSLYRVLEACRLFHMNSKTVIETHARIIKFGYGNYPTLISSLVSTYQRVGCLNRVHQLLDILCPKQ
        MLKQRYLDCRRIT  LLATSAS LPAAPSN  DYQ+L+RVLEACRLF MNSKTVIETHARIIKFGYGNYPTLI+SLVSTYQ VGCLNRVH+LLDILC K 
Subjt:  MLKQRYLDCRRITFDLLATSASGLPAAPSNSKDYQSLYRVLEACRLFHMNSKTVIETHARIIKFGYGNYPTLISSLVSTYQRVGCLNRVHQLLDILCPKQ

Query:  LDLVAMNLLIGNFMKIGECKFAKKVFYKMPFRDVVTWNSIIGGCVKNARYDEAFRFFRRMLTSNIQPDGFTFASILNACAQLGAPSNTHWVHAQMTQKKI
        LDLVAMNLLIGNFMKIGECKFAKKVFYKMPFRDVVTWNSIIGGCVKNARYDEAFRFFR+MLTSNIQPDGFTFAS+LNACAQLGAPSNTHWV AQMTQKKI
Subjt:  LDLVAMNLLIGNFMKIGECKFAKKVFYKMPFRDVVTWNSIIGGCVKNARYDEAFRFFRRMLTSNIQPDGFTFASILNACAQLGAPSNTHWVHAQMTQKKI

Query:  ELNSLLSCALIDAYSKCGSIQIAKEIFSNIPHSDTSVWNVMIKGLAIHGLAMDALSLFLRMEHENVLPDAITFLGVLTACNHGGLIDRGRRYFELMKSHY
        ELNSLLSCALIDAYSKCGSIQIAKEIFSN+PHSDTSVWNVMIKGLAIHGLAMDALSLFLRMEHENVLPDAITFLG+LTACNHGGLID GRRYFELM+S Y
Subjt:  ELNSLLSCALIDAYSKCGSIQIAKEIFSNIPHSDTSVWNVMIKGLAIHGLAMDALSLFLRMEHENVLPDAITFLGVLTACNHGGLIDRGRRYFELMKSHY

Query:  SIQPQLEHYGVMVDLYSRAGFLDEAYSLILTMPIEPDVVTWRTLLSGCRIYKNHKLAEVAIANMSHRKSGDYVLLSNIYCSLNRWKEAETVRKMMKINRV
        SIQPQLEHYGVMVDLYSRAGFL+EAYSLI+TMP+EPDVVTWRTLLSGC+IYKNHKLAEVAIANMSHRKSGDYVLLSNIYCSLNRW+EAETVRKMMKINRV
Subjt:  SIQPQLEHYGVMVDLYSRAGFLDEAYSLILTMPIEPDVVTWRTLLSGCRIYKNHKLAEVAIANMSHRKSGDYVLLSNIYCSLNRWKEAETVRKMMKINRV

Query:  RKKRGKSWIELGGTIQHFKSGDRLHPESDAIEKVLCSLMKRTRTEGYMPVTELVFMDISEEEKEENLSFHSEKMALAYAILKTSPGAKISISKNLRICDD
        RKKRGKSWIELGGT Q+FKSGDRLHPESDAI+KVLCSLMKRTR+EGYMPVTELVFMDISEEEKEENLSFHSEKMALAYAILKTSPGAKISISKNLRICDD
Subjt:  RKKRGKSWIELGGTIQHFKSGDRLHPESDAIEKVLCSLMKRTRTEGYMPVTELVFMDISEEEKEENLSFHSEKMALAYAILKTSPGAKISISKNLRICDD

Query:  CHTWIKLVSRVLCRVIVVRDRIRFHQFEGGMCSCGDRW
        CHTWIKLVSRVLCRVIVVRDRIRFHQFEGGMCSCGDRW
Subjt:  CHTWIKLVSRVLCRVIVVRDRIRFHQFEGGMCSCGDRW

A0A6J1E1Y0 pentatricopeptide repeat-containing protein At5g50990 isoform X12.7e-24982.38Show/hide
Query:  YQSLYRVLEACRLFHMNSKTVIETHARIIKFGYGNYPTLISSLVSTYQRVGCLNRVHQLLDILCPKQLDLVAMNLLIGNFMKIGECKFAKKVFYKMPFRD
        YQ+LYRVLEACRL   NSKT  ETHAR+IKFGYGNYPTL++SLVS YQR  CLNRVHQLL++LC K LDLVAMNL I NFMKIGECK AK+VF KMP+RD
Subjt:  YQSLYRVLEACRLFHMNSKTVIETHARIIKFGYGNYPTLISSLVSTYQRVGCLNRVHQLLDILCPKQLDLVAMNLLIGNFMKIGECKFAKKVFYKMPFRD

Query:  VVTWNSIIGGCVKNARYDEAFRFFRRMLTSNIQPDGFTFASILNACAQLGAPSNTHWVHAQMTQKKIELNSLLSCALIDAYSKCGSIQIAKEIFSNIPHS
        VVTWNSIIGGCVKNARY+EAF+FFR+ML SNIQPDGFTFAS+LNACAQLGAPSNT WVHA MTQKKIELNS+LSCALIDAYSKCGSIQIAKEIFS++P S
Subjt:  VVTWNSIIGGCVKNARYDEAFRFFRRMLTSNIQPDGFTFASILNACAQLGAPSNTHWVHAQMTQKKIELNSLLSCALIDAYSKCGSIQIAKEIFSNIPHS

Query:  DTSVWNVMIKGLAIHGLAMDALSLFLRMEHENVLPDAITFLGVLTACNHGGLIDRGRRYFELMKSHYSIQPQLEHYGVMVDLYSRAGFLDEAYSLILTMP
        + SVWN MIKGLAIHGL+MDALS+F  ME ENVLPDA+TFLG+LTACNHGGLI++GRR+F+ MK+ YSIQPQLEHYGVMVDLYSRAGFL+EAYS+I+ MP
Subjt:  DTSVWNVMIKGLAIHGLAMDALSLFLRMEHENVLPDAITFLGVLTACNHGGLIDRGRRYFELMKSHYSIQPQLEHYGVMVDLYSRAGFLDEAYSLILTMP

Query:  IEPDVVTWRTLLSGCRIYKNHKLAEVAIANMSHRKSGDYVLLSNIYCSLNRWKEAETVRKMMKINRVRKKRGKSWIELGGTIQHFKSGDRLHPESDAIEK
        IE DVVTWR LLSGCRIY+N +LAEVAIANMSHR SGDYVLLSNIYCSLNRW+ AE VR+ MK N VRK  GKSWIELGG+IQ FKSGDR HPESDA+ K
Subjt:  IEPDVVTWRTLLSGCRIYKNHKLAEVAIANMSHRKSGDYVLLSNIYCSLNRWKEAETVRKMMKINRVRKKRGKSWIELGGTIQHFKSGDRLHPESDAIEK

Query:  VLCSLMKRTRTEGYMPVTELVFMDISEEEKEENLSFHSEKMALAYAILKTSPGAKISISKNLRICDDCHTWIKLVSRVLCRVIVVRDRIRFHQFEGGMCS
        V+  LMKR+R+EGYMPVT+LV MDISEEEKEENLS+HSEK+ALAYAILKT PGAKISISKNLR+CDDCH WIKLVS +LCRV+VVRDRIRFHQFEGGMCS
Subjt:  VLCSLMKRTRTEGYMPVTELVFMDISEEEKEENLSFHSEKMALAYAILKTSPGAKISISKNLRICDDCHTWIKLVSRVLCRVIVVRDRIRFHQFEGGMCS

Query:  CGDRW
        CGDRW
Subjt:  CGDRW

SwissProt top hitse value%identityAlignment
A8MQA3 Pentatricopeptide repeat-containing protein At4g210657.2e-10643.18Show/hide
Query:  LVAMNLLIGNFMKIGECKFAKKVFYKMPFRDVVTWNSIIGGCVKNARYDEAFRFFRRMLTSNIQPDGFTFASILNACAQLGAPSNTHWVHAQMTQKKIEL
        +   N L+  +   G+   A KVF KMP +D+V WNS+I G  +N + +EA   +  M +  I+PDGFT  S+L+ACA++GA +    VH  M +  +  
Subjt:  LVAMNLLIGNFMKIGECKFAKKVFYKMPFRDVVTWNSIIGGCVKNARYDEAFRFFRRMLTSNIQPDGFTFASILNACAQLGAPSNTHWVHAQMTQKKIEL

Query:  NSLLSCALIDAYSKCGSIQIAKEIFSNIPHSDTSVWNVMIKGLAIHGLAMDALSLFLRMEH-ENVLPDAITFLGVLTACNHGGLIDRGRRYFELMKSHYS
        N   S  L+D Y++CG ++ AK +F  +   ++  W  +I GLA++G   +A+ LF  ME  E +LP  ITF+G+L AC+H G++  G  YF  M+  Y 
Subjt:  NSLLSCALIDAYSKCGSIQIAKEIFSNIPHSDTSVWNVMIKGLAIHGLAMDALSLFLRMEH-ENVLPDAITFLGVLTACNHGGLIDRGRRYFELMKSHYS

Query:  IQPQLEHYGVMVDLYSRAGFLDEAYSLILTMPIEPDVVTWRTLLSGCRIYKNHKLAEVA---IANMSHRKSGDYVLLSNIYCSLNRWKEAETVRKMMKIN
        I+P++EH+G MVDL +RAG + +AY  I +MP++P+VV WRTLL  C ++ +  LAE A   I  +    SGDYVLLSN+Y S  RW + + +RK M  +
Subjt:  IQPQLEHYGVMVDLYSRAGFLDEAYSLILTMPIEPDVVTWRTLLSGCRIYKNHKLAEVA---IANMSHRKSGDYVLLSNIYCSLNRWKEAETVRKMMKIN

Query:  RVRKKRGKSWIELGGTIQHFKSGDRLHPESDAIEKVLCSLMKRTRTEGYMPVTELVFMDISEEEKEENLSFHSEKMALAYAILKTSPGAKISISKNLRIC
         V+K  G S +E+G  +  F  GD+ HP+SDAI   L  +  R R+EGY+P    V++D+ EEEKE  + +HSEK+A+A+ ++ T   + I++ KNLR+C
Subjt:  RVRKKRGKSWIELGGTIQHFKSGDRLHPESDAIEKVLCSLMKRTRTEGYMPVTELVFMDISEEEKEENLSFHSEKMALAYAILKTSPGAKISISKNLRIC

Query:  DDCHTWIKLVSRVLCRVIVVRDRIRFHQFEGGMCSCGDRW
         DCH  IKLVS+V  R IVVRDR RFH F+ G CSC D W
Subjt:  DDCHTWIKLVSRVLCRVIVVRDRIRFHQFEGGMCSCGDRW

Q683I9 Pentatricopeptide repeat-containing protein At3g628903.2e-10640.24Show/hide
Query:  THARIIKFGYGNYPTLISSLVSTYQRVGCLNRVHQLLDILCPKQLDLVAMNLLIGNFMKIGECKFAKKVFYKMPFRDVVTWNSIIGGCVKNARYDEAFRF
        THA+I+ FG    P + +SL++ Y   G L    ++ D    K  DL A N ++  + K G    A+K+F +MP R+V++W+ +I G V   +Y EA   
Subjt:  THARIIKFGYGNYPTLISSLVSTYQRVGCLNRVHQLLDILCPKQLDLVAMNLLIGNFMKIGECKFAKKVFYKMPFRDVVTWNSIIGGCVKNARYDEAFRF

Query:  FRRMLTSN-----IQPDGFTFASILNACAQLGAPSNTHWVHAQMTQKKIELNSLLSCALIDAYSKCGSIQIAKEIFSNI-PHSDTSVWNVMIKGLAIHGL
        FR M         ++P+ FT +++L+AC +LGA     WVHA + +  +E++ +L  ALID Y+KCGS++ AK +F+ +    D   ++ MI  LA++GL
Subjt:  FRRMLTSN-----IQPDGFTFASILNACAQLGAPSNTHWVHAQMTQKKIELNSLLSCALIDAYSKCGSIQIAKEIFSNI-PHSDTSVWNVMIKGLAIHGL

Query:  AMDALSLFLRM-EHENVLPDAITFLGVLTACNHGGLIDRGRRYFELMKSHYSIQPQLEHYGVMVDLYSRAGFLDEAYSLILTMPIEPDVVTWRTLLSGCR
          +   LF  M   +N+ P+++TF+G+L AC H GLI+ G+ YF++M   + I P ++HYG MVDLY R+G + EA S I +MP+EPDV+ W +LLSG R
Subjt:  AMDALSLFLRM-EHENVLPDAITFLGVLTACNHGGLIDRGRRYFELMKSHYSIQPQLEHYGVMVDLYSRAGFLDEAYSLILTMPIEPDVVTWRTLLSGCR

Query:  IYKNHKLAEVA---IANMSHRKSGDYVLLSNIYCSLNRWKEAETVRKMMKINRVRKKRGKSWIELGGTIQHFKSGDRLHPESDAIEKVLCSLMKRTRTEG
        +  + K  E A   +  +    SG YVLLSN+Y    RW E + +R  M++  + K  G S++E+ G +  F  GD    ES+ I  +L  +M+R R  G
Subjt:  IYKNHKLAEVA---IANMSHRKSGDYVLLSNIYCSLNRWKEAETVRKMMKINRVRKKRGKSWIELGGTIQHFKSGDRLHPESDAIEKVLCSLMKRTRTEG

Query:  YMPVTELVFMDISEEEKEENLSFHSEKMALAYAILKTSPGAKISISKNLRICDDCHTWIKLVSRVLCRVIVVRDRIRFHQFEGGMCSCGDRW
        Y+  T+ V +D++E++KE  LS+HSEK+A+A+ ++KT PG  + I KNLRIC DCH  +K++S++  R IVVRD  RFH F  G CSC D W
Subjt:  YMPVTELVFMDISEEEKEENLSFHSEKMALAYAILKTSPGAKISISKNLRICDDCHTWIKLVSRVLCRVIVVRDRIRFHQFEGGMCSCGDRW

Q9FI49 Pentatricopeptide repeat-containing protein At5g509908.9e-15752.33Show/hide
Query:  AAPSNSKDYQSLYRVLEACRLFHMNSKTVIETHARIIKFGYGNYPTLISSLVSTYQRVGCLNRVHQLLDILCPKQLDLVAMNLLIGNFMKIGECKFAKKV
        ++ SN  D+  L +VLE+C+    NSK V++ HA+I K GYG YP+L+ S V+ Y+R        +LL         +  +NL+I + MKIGE   AKKV
Subjt:  AAPSNSKDYQSLYRVLEACRLFHMNSKTVIETHARIIKFGYGNYPTLISSLVSTYQRVGCLNRVHQLLDILCPKQLDLVAMNLLIGNFMKIGECKFAKKV

Query:  FYKMPFRDVVTWNSIIGGCVKNARYDEAFRFFRRMLT-SNIQPDGFTFASILNACAQLGAPSNTHWVHAQMTQKKIELNSLLSCALIDAYSKCGSIQIAK
              ++V+TWN +IGG V+N +Y+EA +  + ML+ ++I+P+ F+FAS L ACA+LG   +  WVH+ M    IELN++LS AL+D Y+KCG I  ++
Subjt:  FYKMPFRDVVTWNSIIGGCVKNARYDEAFRFFRRMLT-SNIQPDGFTFASILNACAQLGAPSNTHWVHAQMTQKKIELNSLLSCALIDAYSKCGSIQIAK

Query:  EIFSNIPHSDTSVWNVMIKGLAIHGLAMDALSLFLRMEHENVLPDAITFLGVLTACNHGGLIDRGRRYFELMKSHYSIQPQLEHYGVMVDLYSRAGFLDE
        E+F ++  +D S+WN MI G A HGLA +A+ +F  ME E+V PD+ITFLG+LT C+H GL++ G+ YF LM   +SIQP+LEHYG MVDL  RAG + E
Subjt:  EIFSNIPHSDTSVWNVMIKGLAIHGLAMDALSLFLRMEHENVLPDAITFLGVLTACNHGGLIDRGRRYFELMKSHYSIQPQLEHYGVMVDLYSRAGFLDE

Query:  AYSLILTMPIEPDVVTWRTLLSGCRIYKNHKLAEVAIANMSHRKSGDYVLLSNIYCSLNRWKEAETVRKMMKINRVRKKRGKSWIELGGTIQHFKSGDRL
        AY LI +MPIEPDVV WR+LLS  R YKN +L E+AI N+S  KSGDYVLLSNIY S  +W+ A+ VR++M    +RK +GKSW+E GG I  FK+GD  
Subjt:  AYSLILTMPIEPDVVTWRTLLSGCRIYKNHKLAEVAIANMSHRKSGDYVLLSNIYCSLNRWKEAETVRKMMKINRVRKKRGKSWIELGGTIQHFKSGDRL

Query:  HPESDAIEKVLCSLMKRTRTEGYMPVTELVFMDISEEEKEENLSFHSEKMALAYAILKTSPGAKISISKNLRICDDCHTWIKLVSRVLCRVIVVRDRIRF
        H E+ AI KVL  L+++T+++G++  T+LV MD+SEEEKEENL++HSEK+ALAY ILK+SPG +I I KN+R+C DCH WIK VS++L RVI++RDRIRF
Subjt:  HPESDAIEKVLCSLMKRTRTEGYMPVTELVFMDISEEEKEENLSFHSEKMALAYAILKTSPGAKISISKNLRICDDCHTWIKLVSRVLCRVIVVRDRIRF

Query:  HQFEGGMCSCGDRW
        H+FE G+CSC D W
Subjt:  HQFEGGMCSCGDRW

Q9FI80 Pentatricopeptide repeat-containing protein At5g489104.5e-10839.96Show/hide
Query:  ETHARIIKFGYGNYPTLISSLVSTYQRVGCL---------NRVHQLLDILCPKQL---DLVAMNLLIGNFMKIGECKFAKKVFYKMPFRDVVTWNSIIGG
        + H   +K+G+G    ++S+LV  Y   G +         N + + + ++  ++    ++V  N++I  +M++G+CK A+ +F KM  R VV+WN++I G
Subjt:  ETHARIIKFGYGNYPTLISSLVSTYQRVGCL---------NRVHQLLDILCPKQL---DLVAMNLLIGNFMKIGECKFAKKVFYKMPFRDVVTWNSIIGG

Query:  CVKNARYDEAFRFFRRMLTSNIQPDGFTFASILNACAQLGAPSNTHWVHAQMTQKKIELNSLLSCALIDAYSKCGSIQIAKEIFSNIPHSDTSVWNVMIK
           N  + +A   FR M   +I+P+  T  S+L A ++LG+     W+H       I ++ +L  ALID YSKCG I+ A  +F  +P  +   W+ MI 
Subjt:  CVKNARYDEAFRFFRRMLTSNIQPDGFTFASILNACAQLGAPSNTHWVHAQMTQKKIELNSLLSCALIDAYSKCGSIQIAKEIFSNIPHSDTSVWNVMIK

Query:  GLAIHGLAMDALSLFLRMEHENVLPDAITFLGVLTACNHGGLIDRGRRYFELMKSHYSIQPQLEHYGVMVDLYSRAGFLDEAYSLILTMPIEPDVVTWRT
        G AIHG A DA+  F +M    V P  + ++ +LTAC+HGGL++ GRRYF  M S   ++P++EHYG MVDL  R+G LDEA   IL MPI+PD V W+ 
Subjt:  GLAIHGLAMDALSLFLRMEHENVLPDAITFLGVLTACNHGGLIDRGRRYFELMKSHYSIQPQLEHYGVMVDLYSRAGFLDEAYSLILTMPIEPDVVTWRT

Query:  LLSGCRIYKNHKLAE-VA--IANMSHRKSGDYVLLSNIYCSLNRWKEAETVRKMMKINRVRKKRGKSWIELGGTIQHFKSGDRLHPESDAIEKVLCSLMK
        LL  CR+  N ++ + VA  + +M    SG YV LSN+Y S   W E   +R  MK   +RK  G S I++ G +  F   D  HP++  I  +L  +  
Subjt:  LLSGCRIYKNHKLAE-VA--IANMSHRKSGDYVLLSNIYCSLNRWKEAETVRKMMKINRVRKKRGKSWIELGGTIQHFKSGDRLHPESDAIEKVLCSLMK

Query:  RTRTEGYMPVTELVFMDISEEEKEENLSFHSEKMALAYAILKTSPGAKISISKNLRICDDCHTWIKLVSRVLCRVIVVRDRIRFHQFEGGMCSCGDRW
        + R  GY P+T  V +++ EE+KE  L +HSEK+A A+ ++ TSPG  I I KNLRIC+DCH+ IKL+S+V  R I VRDR RFH F+ G CSC D W
Subjt:  RTRTEGYMPVTELVFMDISEEEKEENLSFHSEKMALAYAILKTSPGAKISISKNLRICDDCHTWIKLVSRVLCRVIVVRDRIRFHQFEGGMCSCGDRW

Q9FJY7 Pentatricopeptide repeat-containing protein At5g665208.5e-10738.61Show/hide
Query:  AAPSNSKDYQSLYRVLEACRLFHMNSKTVIETHARIIKFGYGNYPTLISSLVSTYQRVGCLNRVHQLLDILCPKQLDLVAMNLLIGNFMKIGECKFAKKV
        +AP N+  + SL   L+AC       +T  + HA+I K GY N    ++SL+++Y   G     H L D +   + D V+ N +I  ++K G+   A  +
Subjt:  AAPSNSKDYQSLYRVLEACRLFHMNSKTVIETHARIIKFGYGNYPTLISSLVSTYQRVGCLNRVHQLLDILCPKQLDLVAMNLLIGNFMKIGECKFAKKV

Query:  FYKMPFRDVVTWNSIIGGCVKNARYDEAFRFFRRMLTSNIQPDGFTFASILNACAQLGAPSNTHWVHAQMTQKKIELNSLLSCALIDAYSKCGSIQIAKE
        F KM  ++ ++W ++I G V+     EA + F  M  S+++PD  + A+ L+ACAQLGA     W+H+ + + +I ++S+L C LID Y+KCG ++ A E
Subjt:  FYKMPFRDVVTWNSIIGGCVKNARYDEAFRFFRRMLTSNIQPDGFTFASILNACAQLGAPSNTHWVHAQMTQKKIELNSLLSCALIDAYSKCGSIQIAKE

Query:  IFSNIPHSDTSVWNVMIKGLAIHGLAMDALSLFLRMEHENVLPDAITFLGVLTACNHGGLIDRGRRYFELMKSHYSIQPQLEHYGVMVDLYSRAGFLDEA
        +F NI       W  +I G A HG   +A+S F+ M+   + P+ ITF  VLTAC++ GL++ G+  F  M+  Y+++P +EHYG +VDL  RAG LDEA
Subjt:  IFSNIPHSDTSVWNVMIKGLAIHGLAMDALSLFLRMEHENVLPDAITFLGVLTACNHGGLIDRGRRYFELMKSHYSIQPQLEHYGVMVDLYSRAGFLDEA

Query:  YSLILTMPIEPDVVTWRTLLSGCRIYKN----HKLAEVAIANMSHRKSGDYVLLSNIYCSLNRWKEAETVRKMMKINRVRKKRGKSWIELGGTIQHFKSG
           I  MP++P+ V W  LL  CRI+KN     ++ E+ IA +     G YV  +NI+    +W +A   R++MK   V K  G S I L GT   F +G
Subjt:  YSLILTMPIEPDVVTWRTLLSGCRIYKN----HKLAEVAIANMSHRKSGDYVLLSNIYCSLNRWKEAETVRKMMKINRVRKKRGKSWIELGGTIQHFKSG

Query:  DRLHPESDAIEKVLCSLMKRTRTEGYMPVTELVFMD-ISEEEKEENLSFHSEKMALAYAILKTSPGAKISISKNLRICDDCHTWIKLVSRVLCRVIVVRD
        DR HPE + I+     + ++    GY+P  E + +D + ++E+E  +  HSEK+A+ Y ++KT PG  I I KNLR+C DCH   KL+S++  R IV+RD
Subjt:  DRLHPESDAIEKVLCSLMKRTRTEGYMPVTELVFMD-ISEEEKEENLSFHSEKMALAYAILKTSPGAKISISKNLRICDDCHTWIKLVSRVLCRVIVVRD

Query:  RIRFHQFEGGMCSCGDRW
        R RFH F  G CSCGD W
Subjt:  RIRFHQFEGGMCSCGDRW

Arabidopsis top hitse value%identityAlignment
AT3G62890.1 Pentatricopeptide repeat (PPR) superfamily protein2.3e-10740.24Show/hide
Query:  THARIIKFGYGNYPTLISSLVSTYQRVGCLNRVHQLLDILCPKQLDLVAMNLLIGNFMKIGECKFAKKVFYKMPFRDVVTWNSIIGGCVKNARYDEAFRF
        THA+I+ FG    P + +SL++ Y   G L    ++ D    K  DL A N ++  + K G    A+K+F +MP R+V++W+ +I G V   +Y EA   
Subjt:  THARIIKFGYGNYPTLISSLVSTYQRVGCLNRVHQLLDILCPKQLDLVAMNLLIGNFMKIGECKFAKKVFYKMPFRDVVTWNSIIGGCVKNARYDEAFRF

Query:  FRRMLTSN-----IQPDGFTFASILNACAQLGAPSNTHWVHAQMTQKKIELNSLLSCALIDAYSKCGSIQIAKEIFSNI-PHSDTSVWNVMIKGLAIHGL
        FR M         ++P+ FT +++L+AC +LGA     WVHA + +  +E++ +L  ALID Y+KCGS++ AK +F+ +    D   ++ MI  LA++GL
Subjt:  FRRMLTSN-----IQPDGFTFASILNACAQLGAPSNTHWVHAQMTQKKIELNSLLSCALIDAYSKCGSIQIAKEIFSNI-PHSDTSVWNVMIKGLAIHGL

Query:  AMDALSLFLRM-EHENVLPDAITFLGVLTACNHGGLIDRGRRYFELMKSHYSIQPQLEHYGVMVDLYSRAGFLDEAYSLILTMPIEPDVVTWRTLLSGCR
          +   LF  M   +N+ P+++TF+G+L AC H GLI+ G+ YF++M   + I P ++HYG MVDLY R+G + EA S I +MP+EPDV+ W +LLSG R
Subjt:  AMDALSLFLRM-EHENVLPDAITFLGVLTACNHGGLIDRGRRYFELMKSHYSIQPQLEHYGVMVDLYSRAGFLDEAYSLILTMPIEPDVVTWRTLLSGCR

Query:  IYKNHKLAEVA---IANMSHRKSGDYVLLSNIYCSLNRWKEAETVRKMMKINRVRKKRGKSWIELGGTIQHFKSGDRLHPESDAIEKVLCSLMKRTRTEG
        +  + K  E A   +  +    SG YVLLSN+Y    RW E + +R  M++  + K  G S++E+ G +  F  GD    ES+ I  +L  +M+R R  G
Subjt:  IYKNHKLAEVA---IANMSHRKSGDYVLLSNIYCSLNRWKEAETVRKMMKINRVRKKRGKSWIELGGTIQHFKSGDRLHPESDAIEKVLCSLMKRTRTEG

Query:  YMPVTELVFMDISEEEKEENLSFHSEKMALAYAILKTSPGAKISISKNLRICDDCHTWIKLVSRVLCRVIVVRDRIRFHQFEGGMCSCGDRW
        Y+  T+ V +D++E++KE  LS+HSEK+A+A+ ++KT PG  + I KNLRIC DCH  +K++S++  R IVVRD  RFH F  G CSC D W
Subjt:  YMPVTELVFMDISEEEKEENLSFHSEKMALAYAILKTSPGAKISISKNLRICDDCHTWIKLVSRVLCRVIVVRDRIRFHQFEGGMCSCGDRW

AT4G21065.1 Tetratricopeptide repeat (TPR)-like superfamily protein5.1e-10743.18Show/hide
Query:  LVAMNLLIGNFMKIGECKFAKKVFYKMPFRDVVTWNSIIGGCVKNARYDEAFRFFRRMLTSNIQPDGFTFASILNACAQLGAPSNTHWVHAQMTQKKIEL
        +   N L+  +   G+   A KVF KMP +D+V WNS+I G  +N + +EA   +  M +  I+PDGFT  S+L+ACA++GA +    VH  M +  +  
Subjt:  LVAMNLLIGNFMKIGECKFAKKVFYKMPFRDVVTWNSIIGGCVKNARYDEAFRFFRRMLTSNIQPDGFTFASILNACAQLGAPSNTHWVHAQMTQKKIEL

Query:  NSLLSCALIDAYSKCGSIQIAKEIFSNIPHSDTSVWNVMIKGLAIHGLAMDALSLFLRMEH-ENVLPDAITFLGVLTACNHGGLIDRGRRYFELMKSHYS
        N   S  L+D Y++CG ++ AK +F  +   ++  W  +I GLA++G   +A+ LF  ME  E +LP  ITF+G+L AC+H G++  G  YF  M+  Y 
Subjt:  NSLLSCALIDAYSKCGSIQIAKEIFSNIPHSDTSVWNVMIKGLAIHGLAMDALSLFLRMEH-ENVLPDAITFLGVLTACNHGGLIDRGRRYFELMKSHYS

Query:  IQPQLEHYGVMVDLYSRAGFLDEAYSLILTMPIEPDVVTWRTLLSGCRIYKNHKLAEVA---IANMSHRKSGDYVLLSNIYCSLNRWKEAETVRKMMKIN
        I+P++EH+G MVDL +RAG + +AY  I +MP++P+VV WRTLL  C ++ +  LAE A   I  +    SGDYVLLSN+Y S  RW + + +RK M  +
Subjt:  IQPQLEHYGVMVDLYSRAGFLDEAYSLILTMPIEPDVVTWRTLLSGCRIYKNHKLAEVA---IANMSHRKSGDYVLLSNIYCSLNRWKEAETVRKMMKIN

Query:  RVRKKRGKSWIELGGTIQHFKSGDRLHPESDAIEKVLCSLMKRTRTEGYMPVTELVFMDISEEEKEENLSFHSEKMALAYAILKTSPGAKISISKNLRIC
         V+K  G S +E+G  +  F  GD+ HP+SDAI   L  +  R R+EGY+P    V++D+ EEEKE  + +HSEK+A+A+ ++ T   + I++ KNLR+C
Subjt:  RVRKKRGKSWIELGGTIQHFKSGDRLHPESDAIEKVLCSLMKRTRTEGYMPVTELVFMDISEEEKEENLSFHSEKMALAYAILKTSPGAKISISKNLRIC

Query:  DDCHTWIKLVSRVLCRVIVVRDRIRFHQFEGGMCSCGDRW
         DCH  IKLVS+V  R IVVRDR RFH F+ G CSC D W
Subjt:  DDCHTWIKLVSRVLCRVIVVRDRIRFHQFEGGMCSCGDRW

AT5G48910.1 Pentatricopeptide repeat (PPR) superfamily protein3.2e-10939.96Show/hide
Query:  ETHARIIKFGYGNYPTLISSLVSTYQRVGCL---------NRVHQLLDILCPKQL---DLVAMNLLIGNFMKIGECKFAKKVFYKMPFRDVVTWNSIIGG
        + H   +K+G+G    ++S+LV  Y   G +         N + + + ++  ++    ++V  N++I  +M++G+CK A+ +F KM  R VV+WN++I G
Subjt:  ETHARIIKFGYGNYPTLISSLVSTYQRVGCL---------NRVHQLLDILCPKQL---DLVAMNLLIGNFMKIGECKFAKKVFYKMPFRDVVTWNSIIGG

Query:  CVKNARYDEAFRFFRRMLTSNIQPDGFTFASILNACAQLGAPSNTHWVHAQMTQKKIELNSLLSCALIDAYSKCGSIQIAKEIFSNIPHSDTSVWNVMIK
           N  + +A   FR M   +I+P+  T  S+L A ++LG+     W+H       I ++ +L  ALID YSKCG I+ A  +F  +P  +   W+ MI 
Subjt:  CVKNARYDEAFRFFRRMLTSNIQPDGFTFASILNACAQLGAPSNTHWVHAQMTQKKIELNSLLSCALIDAYSKCGSIQIAKEIFSNIPHSDTSVWNVMIK

Query:  GLAIHGLAMDALSLFLRMEHENVLPDAITFLGVLTACNHGGLIDRGRRYFELMKSHYSIQPQLEHYGVMVDLYSRAGFLDEAYSLILTMPIEPDVVTWRT
        G AIHG A DA+  F +M    V P  + ++ +LTAC+HGGL++ GRRYF  M S   ++P++EHYG MVDL  R+G LDEA   IL MPI+PD V W+ 
Subjt:  GLAIHGLAMDALSLFLRMEHENVLPDAITFLGVLTACNHGGLIDRGRRYFELMKSHYSIQPQLEHYGVMVDLYSRAGFLDEAYSLILTMPIEPDVVTWRT

Query:  LLSGCRIYKNHKLAE-VA--IANMSHRKSGDYVLLSNIYCSLNRWKEAETVRKMMKINRVRKKRGKSWIELGGTIQHFKSGDRLHPESDAIEKVLCSLMK
        LL  CR+  N ++ + VA  + +M    SG YV LSN+Y S   W E   +R  MK   +RK  G S I++ G +  F   D  HP++  I  +L  +  
Subjt:  LLSGCRIYKNHKLAE-VA--IANMSHRKSGDYVLLSNIYCSLNRWKEAETVRKMMKINRVRKKRGKSWIELGGTIQHFKSGDRLHPESDAIEKVLCSLMK

Query:  RTRTEGYMPVTELVFMDISEEEKEENLSFHSEKMALAYAILKTSPGAKISISKNLRICDDCHTWIKLVSRVLCRVIVVRDRIRFHQFEGGMCSCGDRW
        + R  GY P+T  V +++ EE+KE  L +HSEK+A A+ ++ TSPG  I I KNLRIC+DCH+ IKL+S+V  R I VRDR RFH F+ G CSC D W
Subjt:  RTRTEGYMPVTELVFMDISEEEKEENLSFHSEKMALAYAILKTSPGAKISISKNLRICDDCHTWIKLVSRVLCRVIVVRDRIRFHQFEGGMCSCGDRW

AT5G50990.1 Tetratricopeptide repeat (TPR)-like superfamily protein6.4e-15852.33Show/hide
Query:  AAPSNSKDYQSLYRVLEACRLFHMNSKTVIETHARIIKFGYGNYPTLISSLVSTYQRVGCLNRVHQLLDILCPKQLDLVAMNLLIGNFMKIGECKFAKKV
        ++ SN  D+  L +VLE+C+    NSK V++ HA+I K GYG YP+L+ S V+ Y+R        +LL         +  +NL+I + MKIGE   AKKV
Subjt:  AAPSNSKDYQSLYRVLEACRLFHMNSKTVIETHARIIKFGYGNYPTLISSLVSTYQRVGCLNRVHQLLDILCPKQLDLVAMNLLIGNFMKIGECKFAKKV

Query:  FYKMPFRDVVTWNSIIGGCVKNARYDEAFRFFRRMLT-SNIQPDGFTFASILNACAQLGAPSNTHWVHAQMTQKKIELNSLLSCALIDAYSKCGSIQIAK
              ++V+TWN +IGG V+N +Y+EA +  + ML+ ++I+P+ F+FAS L ACA+LG   +  WVH+ M    IELN++LS AL+D Y+KCG I  ++
Subjt:  FYKMPFRDVVTWNSIIGGCVKNARYDEAFRFFRRMLT-SNIQPDGFTFASILNACAQLGAPSNTHWVHAQMTQKKIELNSLLSCALIDAYSKCGSIQIAK

Query:  EIFSNIPHSDTSVWNVMIKGLAIHGLAMDALSLFLRMEHENVLPDAITFLGVLTACNHGGLIDRGRRYFELMKSHYSIQPQLEHYGVMVDLYSRAGFLDE
        E+F ++  +D S+WN MI G A HGLA +A+ +F  ME E+V PD+ITFLG+LT C+H GL++ G+ YF LM   +SIQP+LEHYG MVDL  RAG + E
Subjt:  EIFSNIPHSDTSVWNVMIKGLAIHGLAMDALSLFLRMEHENVLPDAITFLGVLTACNHGGLIDRGRRYFELMKSHYSIQPQLEHYGVMVDLYSRAGFLDE

Query:  AYSLILTMPIEPDVVTWRTLLSGCRIYKNHKLAEVAIANMSHRKSGDYVLLSNIYCSLNRWKEAETVRKMMKINRVRKKRGKSWIELGGTIQHFKSGDRL
        AY LI +MPIEPDVV WR+LLS  R YKN +L E+AI N+S  KSGDYVLLSNIY S  +W+ A+ VR++M    +RK +GKSW+E GG I  FK+GD  
Subjt:  AYSLILTMPIEPDVVTWRTLLSGCRIYKNHKLAEVAIANMSHRKSGDYVLLSNIYCSLNRWKEAETVRKMMKINRVRKKRGKSWIELGGTIQHFKSGDRL

Query:  HPESDAIEKVLCSLMKRTRTEGYMPVTELVFMDISEEEKEENLSFHSEKMALAYAILKTSPGAKISISKNLRICDDCHTWIKLVSRVLCRVIVVRDRIRF
        H E+ AI KVL  L+++T+++G++  T+LV MD+SEEEKEENL++HSEK+ALAY ILK+SPG +I I KN+R+C DCH WIK VS++L RVI++RDRIRF
Subjt:  HPESDAIEKVLCSLMKRTRTEGYMPVTELVFMDISEEEKEENLSFHSEKMALAYAILKTSPGAKISISKNLRICDDCHTWIKLVSRVLCRVIVVRDRIRF

Query:  HQFEGGMCSCGDRW
        H+FE G+CSC D W
Subjt:  HQFEGGMCSCGDRW

AT5G66520.1 Tetratricopeptide repeat (TPR)-like superfamily protein6.0e-10838.61Show/hide
Query:  AAPSNSKDYQSLYRVLEACRLFHMNSKTVIETHARIIKFGYGNYPTLISSLVSTYQRVGCLNRVHQLLDILCPKQLDLVAMNLLIGNFMKIGECKFAKKV
        +AP N+  + SL   L+AC       +T  + HA+I K GY N    ++SL+++Y   G     H L D +   + D V+ N +I  ++K G+   A  +
Subjt:  AAPSNSKDYQSLYRVLEACRLFHMNSKTVIETHARIIKFGYGNYPTLISSLVSTYQRVGCLNRVHQLLDILCPKQLDLVAMNLLIGNFMKIGECKFAKKV

Query:  FYKMPFRDVVTWNSIIGGCVKNARYDEAFRFFRRMLTSNIQPDGFTFASILNACAQLGAPSNTHWVHAQMTQKKIELNSLLSCALIDAYSKCGSIQIAKE
        F KM  ++ ++W ++I G V+     EA + F  M  S+++PD  + A+ L+ACAQLGA     W+H+ + + +I ++S+L C LID Y+KCG ++ A E
Subjt:  FYKMPFRDVVTWNSIIGGCVKNARYDEAFRFFRRMLTSNIQPDGFTFASILNACAQLGAPSNTHWVHAQMTQKKIELNSLLSCALIDAYSKCGSIQIAKE

Query:  IFSNIPHSDTSVWNVMIKGLAIHGLAMDALSLFLRMEHENVLPDAITFLGVLTACNHGGLIDRGRRYFELMKSHYSIQPQLEHYGVMVDLYSRAGFLDEA
        +F NI       W  +I G A HG   +A+S F+ M+   + P+ ITF  VLTAC++ GL++ G+  F  M+  Y+++P +EHYG +VDL  RAG LDEA
Subjt:  IFSNIPHSDTSVWNVMIKGLAIHGLAMDALSLFLRMEHENVLPDAITFLGVLTACNHGGLIDRGRRYFELMKSHYSIQPQLEHYGVMVDLYSRAGFLDEA

Query:  YSLILTMPIEPDVVTWRTLLSGCRIYKN----HKLAEVAIANMSHRKSGDYVLLSNIYCSLNRWKEAETVRKMMKINRVRKKRGKSWIELGGTIQHFKSG
           I  MP++P+ V W  LL  CRI+KN     ++ E+ IA +     G YV  +NI+    +W +A   R++MK   V K  G S I L GT   F +G
Subjt:  YSLILTMPIEPDVVTWRTLLSGCRIYKN----HKLAEVAIANMSHRKSGDYVLLSNIYCSLNRWKEAETVRKMMKINRVRKKRGKSWIELGGTIQHFKSG

Query:  DRLHPESDAIEKVLCSLMKRTRTEGYMPVTELVFMD-ISEEEKEENLSFHSEKMALAYAILKTSPGAKISISKNLRICDDCHTWIKLVSRVLCRVIVVRD
        DR HPE + I+     + ++    GY+P  E + +D + ++E+E  +  HSEK+A+ Y ++KT PG  I I KNLR+C DCH   KL+S++  R IV+RD
Subjt:  DRLHPESDAIEKVLCSLMKRTRTEGYMPVTELVFMD-ISEEEKEENLSFHSEKMALAYAILKTSPGAKISISKNLRICDDCHTWIKLVSRVLCRVIVVRD

Query:  RIRFHQFEGGMCSCGDRW
        R RFH F  G CSCGD W
Subjt:  RIRFHQFEGGMCSCGDRW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTGAAGCAGAGATACTTGGACTGCAGAAGGATTACTTTTGACCTTCTTGCTACCTCTGCCTCTGGTTTACCAGCGGCGCCTTCGAATTCCAAAGATTATCAAAGTCT
TTACCGTGTTCTTGAAGCCTGCAGACTCTTCCACATGAATTCCAAAACTGTTATCGAAACGCATGCACGAATTATTAAATTTGGATATGGAAACTACCCTACTCTCATCT
CCTCTCTAGTATCTACTTATCAACGTGTTGGTTGCCTTAATCGGGTTCATCAACTTCTTGATATACTCTGCCCTAAGCAGCTTGATTTAGTTGCAATGAACTTACTCATT
GGAAACTTTATGAAAATCGGGGAGTGCAAATTTGCTAAAAAGGTATTTTATAAAATGCCTTTCCGTGATGTGGTAACATGGAACTCAATCATTGGAGGTTGTGTAAAAAA
TGCACGGTATGATGAGGCATTTAGATTCTTTAGACGGATGCTCACGTCAAATATTCAGCCGGATGGATTTACATTTGCTTCTATATTGAATGCATGTGCTCAACTCGGAG
CTCCTAGTAACACTCATTGGGTTCATGCTCAGATGACTCAGAAAAAAATTGAGCTTAATTCTTTATTAAGTTGTGCACTCATAGACGCATACTCAAAATGTGGTAGCATC
CAAATTGCAAAGGAAATATTTAGTAATATTCCTCACAGTGATACTTCAGTTTGGAATGTGATGATCAAAGGGCTTGCGATTCATGGGCTTGCAATGGATGCATTATCGTT
ATTTTTGAGGATGGAGCATGAGAATGTTCTGCCTGATGCCATCACCTTTTTGGGTGTTTTAACAGCTTGCAACCATGGTGGTTTAATTGACCGTGGTCGTAGATATTTTG
AGCTGATGAAAAGTCATTATTCAATTCAGCCGCAGCTTGAGCATTATGGTGTCATGGTTGATCTTTATAGTCGAGCTGGGTTTCTGGACGAGGCCTATTCTCTAATTCTG
ACAATGCCGATAGAGCCTGATGTTGTCACGTGGCGGACACTTTTAAGTGGTTGTAGAATTTACAAAAATCATAAACTTGCAGAAGTTGCTATTGCAAATATGTCTCATCG
TAAGAGTGGGGATTATGTATTATTATCAAATATATATTGTTCTCTCAACAGATGGAAGGAAGCAGAAACAGTTAGAAAGATGATGAAAATCAATAGAGTTCGTAAGAAAC
GTGGAAAAAGCTGGATTGAGTTGGGAGGTACCATTCAACACTTCAAGTCAGGTGATCGATTGCATCCAGAAAGCGATGCAATAGAAAAAGTGCTATGCAGTTTGATGAAG
AGAACTCGGACGGAGGGATATATGCCTGTGACGGAGTTGGTTTTCATGGATATCTCTGAGGAGGAGAAGGAAGAGAACTTATCATTTCATAGCGAAAAGATGGCATTGGC
TTATGCGATCTTGAAAACTAGTCCTGGGGCAAAGATCAGTATATCAAAGAACCTGCGGATCTGTGATGATTGTCATACATGGATTAAATTAGTTTCAAGAGTGCTGTGTA
GAGTTATAGTAGTGAGGGATCGGATCCGGTTTCATCAATTTGAAGGTGGCATGTGTTCTTGTGGTGATCGTTGGTAG
mRNA sequenceShow/hide mRNA sequence
ATGCTGAAGCAGAGATACTTGGACTGCAGAAGGATTACTTTTGACCTTCTTGCTACCTCTGCCTCTGGTTTACCAGCGGCGCCTTCGAATTCCAAAGATTATCAAAGTCT
TTACCGTGTTCTTGAAGCCTGCAGACTCTTCCACATGAATTCCAAAACTGTTATCGAAACGCATGCACGAATTATTAAATTTGGATATGGAAACTACCCTACTCTCATCT
CCTCTCTAGTATCTACTTATCAACGTGTTGGTTGCCTTAATCGGGTTCATCAACTTCTTGATATACTCTGCCCTAAGCAGCTTGATTTAGTTGCAATGAACTTACTCATT
GGAAACTTTATGAAAATCGGGGAGTGCAAATTTGCTAAAAAGGTATTTTATAAAATGCCTTTCCGTGATGTGGTAACATGGAACTCAATCATTGGAGGTTGTGTAAAAAA
TGCACGGTATGATGAGGCATTTAGATTCTTTAGACGGATGCTCACGTCAAATATTCAGCCGGATGGATTTACATTTGCTTCTATATTGAATGCATGTGCTCAACTCGGAG
CTCCTAGTAACACTCATTGGGTTCATGCTCAGATGACTCAGAAAAAAATTGAGCTTAATTCTTTATTAAGTTGTGCACTCATAGACGCATACTCAAAATGTGGTAGCATC
CAAATTGCAAAGGAAATATTTAGTAATATTCCTCACAGTGATACTTCAGTTTGGAATGTGATGATCAAAGGGCTTGCGATTCATGGGCTTGCAATGGATGCATTATCGTT
ATTTTTGAGGATGGAGCATGAGAATGTTCTGCCTGATGCCATCACCTTTTTGGGTGTTTTAACAGCTTGCAACCATGGTGGTTTAATTGACCGTGGTCGTAGATATTTTG
AGCTGATGAAAAGTCATTATTCAATTCAGCCGCAGCTTGAGCATTATGGTGTCATGGTTGATCTTTATAGTCGAGCTGGGTTTCTGGACGAGGCCTATTCTCTAATTCTG
ACAATGCCGATAGAGCCTGATGTTGTCACGTGGCGGACACTTTTAAGTGGTTGTAGAATTTACAAAAATCATAAACTTGCAGAAGTTGCTATTGCAAATATGTCTCATCG
TAAGAGTGGGGATTATGTATTATTATCAAATATATATTGTTCTCTCAACAGATGGAAGGAAGCAGAAACAGTTAGAAAGATGATGAAAATCAATAGAGTTCGTAAGAAAC
GTGGAAAAAGCTGGATTGAGTTGGGAGGTACCATTCAACACTTCAAGTCAGGTGATCGATTGCATCCAGAAAGCGATGCAATAGAAAAAGTGCTATGCAGTTTGATGAAG
AGAACTCGGACGGAGGGATATATGCCTGTGACGGAGTTGGTTTTCATGGATATCTCTGAGGAGGAGAAGGAAGAGAACTTATCATTTCATAGCGAAAAGATGGCATTGGC
TTATGCGATCTTGAAAACTAGTCCTGGGGCAAAGATCAGTATATCAAAGAACCTGCGGATCTGTGATGATTGTCATACATGGATTAAATTAGTTTCAAGAGTGCTGTGTA
GAGTTATAGTAGTGAGGGATCGGATCCGGTTTCATCAATTTGAAGGTGGCATGTGTTCTTGTGGTGATCGTTGGTAG
Protein sequenceShow/hide protein sequence
MLKQRYLDCRRITFDLLATSASGLPAAPSNSKDYQSLYRVLEACRLFHMNSKTVIETHARIIKFGYGNYPTLISSLVSTYQRVGCLNRVHQLLDILCPKQLDLVAMNLLI
GNFMKIGECKFAKKVFYKMPFRDVVTWNSIIGGCVKNARYDEAFRFFRRMLTSNIQPDGFTFASILNACAQLGAPSNTHWVHAQMTQKKIELNSLLSCALIDAYSKCGSI
QIAKEIFSNIPHSDTSVWNVMIKGLAIHGLAMDALSLFLRMEHENVLPDAITFLGVLTACNHGGLIDRGRRYFELMKSHYSIQPQLEHYGVMVDLYSRAGFLDEAYSLIL
TMPIEPDVVTWRTLLSGCRIYKNHKLAEVAIANMSHRKSGDYVLLSNIYCSLNRWKEAETVRKMMKINRVRKKRGKSWIELGGTIQHFKSGDRLHPESDAIEKVLCSLMK
RTRTEGYMPVTELVFMDISEEEKEENLSFHSEKMALAYAILKTSPGAKISISKNLRICDDCHTWIKLVSRVLCRVIVVRDRIRFHQFEGGMCSCGDRW