; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI01G33040 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI01G33040
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationChr1:27925425..27931609
RNA-Seq ExpressionCSPI01G33040
SyntenyCSPI01G33040
Gene Ontology termsGO:0009451 - RNA modification (biological process)
GO:0015074 - DNA integration (biological process)
GO:0043231 - intracellular membrane-bounded organelle (cellular component)
GO:0003723 - RNA binding (molecular function)
GO:0005515 - protein binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR000953 - Chromo/chromo shadow domain
IPR001584 - Integrase, catalytic core
IPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily
IPR012337 - Ribonuclease H-like superfamily
IPR016197 - Chromo-like domain superfamily
IPR032867 - DYW domain
IPR036397 - Ribonuclease H superfamily
IPR041588 - Integrase zinc-binding domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TYJ96962.1 pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa]4.7e-30695.22Show/hide
Query:  TDRAEVMLKQRYLDCRRITFALLATSASGLPAAPSNSKDYQSLYHVLEACRLFHMNSKTVIETHARIIKFGYGNYPTLIASLVSTYQRVGCLNRVHQLLD
        TDRAEVMLKQRYLDCRRIT ALLATSAS LPAAPSN  DYQ+L+ VLEACRLF MNSKTVIETHARIIKFGYGNYPTLIASLVSTYQ VGCLNRVH+LLD
Subjt:  TDRAEVMLKQRYLDCRRITFALLATSASGLPAAPSNSKDYQSLYHVLEACRLFHMNSKTVIETHARIIKFGYGNYPTLIASLVSTYQRVGCLNRVHQLLD

Query:  ILCSKQLDLVAMNLLIGNFMKIGECKFAKKVFYKMPFRDVVTWNSIIGGCVKNARYDEAFRFFRQMLTSNIQPDGFTFASILNACAQLGAPSNTHWVHAQ
        ILCSK LDLVAMNLLIGNFMKIGECKFAKKVFYKMPFRDVVTWNSIIGGCVKNARYDEAFRFFRQMLTSNIQPDGFTFAS+LNACAQLGAPSNTHWV AQ
Subjt:  ILCSKQLDLVAMNLLIGNFMKIGECKFAKKVFYKMPFRDVVTWNSIIGGCVKNARYDEAFRFFRQMLTSNIQPDGFTFASILNACAQLGAPSNTHWVHAQ

Query:  MTQKKIELNSLLTCALIDAYSKCGSIQIAKEIFSNIPHSDTSVWNVMIKGLAIHGLAMDALSLFLRMEHESVLPDAITFLGVLTACNHGGLIDHGRRYFE
        MTQKKIELNSLL+CALIDAYSKCGSIQIAKEIFSN+PHSDTSVWNVMIKGLAIHGLAMDALSLFLRMEHE+VLPDAITFLG+LTACNHGGLIDHGRRYFE
Subjt:  MTQKKIELNSLLTCALIDAYSKCGSIQIAKEIFSNIPHSDTSVWNVMIKGLAIHGLAMDALSLFLRMEHESVLPDAITFLGVLTACNHGGLIDHGRRYFE

Query:  LMKSHYSIQPQLEHYGVMVDLYSRAGFLEEAYSLIVTMPIEPDVVTWRTLLSGCRIYKNHKLAEVAIANMSHRKSGDYVLLSNIYCSLNRWKEAETVRKM
        LM+S YSIQPQLEHYGVMVDLYSRAGFLEEAYSLIVTMP+EPDVVTWRTLLSGC+IYKNHKLAEVAIANMSHRKSGDYVLLSNIYCSLNRW+EAETVRKM
Subjt:  LMKSHYSIQPQLEHYGVMVDLYSRAGFLEEAYSLIVTMPIEPDVVTWRTLLSGCRIYKNHKLAEVAIANMSHRKSGDYVLLSNIYCSLNRWKEAETVRKM

Query:  MKINRVRKKRGKSWIELGGTIQHFKSGDRLHPESDAIEKVLCSLMKRTRTEGYMPVTELVFMDISEEEKEENLSFHSEKMALAYAILKTSPGAKISISKN
        MKINRVRKKRGKSWIELGGT Q+FKSGDRLHPESDAI+KVLCSLMKRTR+EGYMPVTELVFMDISEEEKEENLSFHSEKMALAYAILKTSPGAKISISKN
Subjt:  MKINRVRKKRGKSWIELGGTIQHFKSGDRLHPESDAIEKVLCSLMKRTRTEGYMPVTELVFMDISEEEKEENLSFHSEKMALAYAILKTSPGAKISISKN

Query:  LRICDDCHTWIKLVSRVLCRVIVVRDRIRFHQFEGGMCSCGDRW
        LRICDDCHTWIKLVSRVLCRVIVVRDRIRFHQFEGGMCSCGDRW
Subjt:  LRICDDCHTWIKLVSRVLCRVIVVRDRIRFHQFEGGMCSCGDRW

TYJ96962.1 pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa]3.7e-0857.97Show/hide
Query:  SLSLINVDGKNPWILDSGATDHLIGSSENFVSYILCTGIERTKIANGSLGSLNYCNLFLKPFQLLLSYS
        SLS INVD KNPWILD  AT+HLIGSSENFVSYI   G E+ +I NGSL  + +    + PF+ L  Y+
Subjt:  SLSLINVDGKNPWILDSGATDHLIGSSENFVSYILCTGIERTKIANGSLGSLNYCNLFLKPFQLLLSYS

TYJ96962.1 pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa]7.1e-30295.17Show/hide
Query:  MLKQRYLDCRRITFALLATSASGLPAAPSNSKDYQSLYHVLEACRLFHMNSKTVIETHARIIKFGYGNYPTLIASLVSTYQRVGCLNRVHQLLDILCSKQ
        MLKQRYLDCRRIT ALLATSAS LPAAPSN  DYQ+L+ VLEACRLF MNSKTVIETHARIIKFGYGNYPTLIASLVSTYQ VGCLNRVH+LLDILCSK 
Subjt:  MLKQRYLDCRRITFALLATSASGLPAAPSNSKDYQSLYHVLEACRLFHMNSKTVIETHARIIKFGYGNYPTLIASLVSTYQRVGCLNRVHQLLDILCSKQ

Query:  LDLVAMNLLIGNFMKIGECKFAKKVFYKMPFRDVVTWNSIIGGCVKNARYDEAFRFFRQMLTSNIQPDGFTFASILNACAQLGAPSNTHWVHAQMTQKKI
        LDLVAMNLLIGNFMKIGECKFAKKVFYKMPFRDVVTWNSIIGGCVKNARYDEAFRFFRQMLTSNIQPDGFTFAS+LNACAQLGAPSNTHWV AQMTQKKI
Subjt:  LDLVAMNLLIGNFMKIGECKFAKKVFYKMPFRDVVTWNSIIGGCVKNARYDEAFRFFRQMLTSNIQPDGFTFASILNACAQLGAPSNTHWVHAQMTQKKI

Query:  ELNSLLTCALIDAYSKCGSIQIAKEIFSNIPHSDTSVWNVMIKGLAIHGLAMDALSLFLRMEHESVLPDAITFLGVLTACNHGGLIDHGRRYFELMKSHY
        ELNSLL+CALIDAYSKCGSIQIAKEIFSN+PHSDTSVWNVMIKGLAIHGLAMDALSLFLRMEHE+VLPDAITFLG+LTACNHGGLIDHGRRYFELM+S Y
Subjt:  ELNSLLTCALIDAYSKCGSIQIAKEIFSNIPHSDTSVWNVMIKGLAIHGLAMDALSLFLRMEHESVLPDAITFLGVLTACNHGGLIDHGRRYFELMKSHY

Query:  SIQPQLEHYGVMVDLYSRAGFLEEAYSLIVTMPIEPDVVTWRTLLSGCRIYKNHKLAEVAIANMSHRKSGDYVLLSNIYCSLNRWKEAETVRKMMKINRV
        SIQPQLEHYGVMVDLYSRAGFLEEAYSLIVTMPIEPDVVTWRTLLSGC+IYKNHKLAEVAIANMSH KSGDYVLLSNIYCSLNRW+EAETVRKMMKINRV
Subjt:  SIQPQLEHYGVMVDLYSRAGFLEEAYSLIVTMPIEPDVVTWRTLLSGCRIYKNHKLAEVAIANMSHRKSGDYVLLSNIYCSLNRWKEAETVRKMMKINRV

Query:  RKKRGKSWIELGGTIQHFKSGDRLHPESDAIEKVLCSLMKRTRTEGYMPVTELVFMDISEEEKEENLSFHSEKMALAYAILKTSPGAKISISKNLRICDD
        RKKRGKSWIELGGT Q+FKSGDRLHPESDAI+KVLCSLMKRTR+EGYMPVTELVFMDISEEEKEENLSFHSEKMALAYAILKTSPGAKISISKNLRICDD
Subjt:  RKKRGKSWIELGGTIQHFKSGDRLHPESDAIEKVLCSLMKRTRTEGYMPVTELVFMDISEEEKEENLSFHSEKMALAYAILKTSPGAKISISKNLRICDD

Query:  CHTWIKLVSRVLCRVIVVRDRIRFHQFEGGMCSCGDRW
        CHTWIKLVSRVLCRVIVVRDRIRFHQFEGGMCSCGDRW
Subjt:  CHTWIKLVSRVLCRVIVVRDRIRFHQFEGGMCSCGDRW

XP_011660258.1 pentatricopeptide repeat-containing protein At5g50990 isoform X1 [Cucumis sativus]0.0e+0099.44Show/hide
Query:  MLKQRYLDCRRITFALLATSASGLPAAPSNSKDYQSLYHVLEACRLFHMNSKTVIETHARIIKFGYGNYPTLIASLVSTYQRVGCLNRVHQLLDILCSKQ
        MLKQRYLDCRRITFALLATSASGLPAAPSNSKDYQSLY VLEACRLFHMNSKTVIETHARIIKFGYGNYPTLIASLVSTYQRVGCLNRVHQLLDILCSKQ
Subjt:  MLKQRYLDCRRITFALLATSASGLPAAPSNSKDYQSLYHVLEACRLFHMNSKTVIETHARIIKFGYGNYPTLIASLVSTYQRVGCLNRVHQLLDILCSKQ

Query:  LDLVAMNLLIGNFMKIGECKFAKKVFYKMPFRDVVTWNSIIGGCVKNARYDEAFRFFRQMLTSNIQPDGFTFASILNACAQLGAPSNTHWVHAQMTQKKI
        LDLVAMNLLIGNFMKIGECKFAKKVFYKMPFRDVVTWNSIIGGCVKNARYDEAFRFFRQMLTSNIQPDGFTFASILNACAQLGAPSNTHWVHAQMTQKKI
Subjt:  LDLVAMNLLIGNFMKIGECKFAKKVFYKMPFRDVVTWNSIIGGCVKNARYDEAFRFFRQMLTSNIQPDGFTFASILNACAQLGAPSNTHWVHAQMTQKKI

Query:  ELNSLLTCALIDAYSKCGSIQIAKEIFSNIPHSDTSVWNVMIKGLAIHGLAMDALSLFLRMEHESVLPDAITFLGVLTACNHGGLIDHGRRYFELMKSHY
        ELNSLLTCALIDAYSKCGSIQIAKEIFSNIPHSDTSVWNVMIKGLAIHGLAMDALSLFLRMEHESVLPDAITFLGVLTACNHG LIDHGRRYFELMKSHY
Subjt:  ELNSLLTCALIDAYSKCGSIQIAKEIFSNIPHSDTSVWNVMIKGLAIHGLAMDALSLFLRMEHESVLPDAITFLGVLTACNHGGLIDHGRRYFELMKSHY

Query:  SIQPQLEHYGVMVDLYSRAGFLEEAYSLIVTMPIEPDVVTWRTLLSGCRIYKNHKLAEVAIANMSHRKSGDYVLLSNIYCSLNRWKEAETVRKMMKINRV
        SIQPQLEHYGVMVDLYSRAGFLEEAYSLIVTMPIEPDVVTWRTLLSGCRIYKNHKLAEVAIANMS RKSGDYVLLSNIYCSLNRWKEAETVRKMMKINRV
Subjt:  SIQPQLEHYGVMVDLYSRAGFLEEAYSLIVTMPIEPDVVTWRTLLSGCRIYKNHKLAEVAIANMSHRKSGDYVLLSNIYCSLNRWKEAETVRKMMKINRV

Query:  RKKRGKSWIELGGTIQHFKSGDRLHPESDAIEKVLCSLMKRTRTEGYMPVTELVFMDISEEEKEENLSFHSEKMALAYAILKTSPGAKISISKNLRICDD
        RKKRGKSWIELGGTIQHFKSGDRLHPESDAIEKVLCSLMKRTRTEGYMPVTELVFMDISEEEKEENLSFHSEKMALAYAILKTSPGAKISISKNLRICDD
Subjt:  RKKRGKSWIELGGTIQHFKSGDRLHPESDAIEKVLCSLMKRTRTEGYMPVTELVFMDISEEEKEENLSFHSEKMALAYAILKTSPGAKISISKNLRICDD

Query:  CHTWIKLVSRVLCRVIVVRDRIRFHQFEGGMCSCGDRW
        CHTWIKLVSRVLCRVIVVRDRIRFHQFEGGMCSCGDRW
Subjt:  CHTWIKLVSRVLCRVIVVRDRIRFHQFEGGMCSCGDRW

XP_031736189.1 pentatricopeptide repeat-containing protein At5g50990 isoform X2 [Cucumis sativus]3.8e-29599.21Show/hide
Query:  DYQSLYHVLEACRLFHMNSKTVIETHARIIKFGYGNYPTLIASLVSTYQRVGCLNRVHQLLDILCSKQLDLVAMNLLIGNFMKIGECKFAKKVFYKMPFR
        +YQSLY VLEACRLFHMNSKTVIETHARIIKFGYGNYPTLIASLVSTYQRVGCLNRVHQLLDILCSKQLDLVAMNLLIGNFMKIGECKFAKKVFYKMPFR
Subjt:  DYQSLYHVLEACRLFHMNSKTVIETHARIIKFGYGNYPTLIASLVSTYQRVGCLNRVHQLLDILCSKQLDLVAMNLLIGNFMKIGECKFAKKVFYKMPFR

Query:  DVVTWNSIIGGCVKNARYDEAFRFFRQMLTSNIQPDGFTFASILNACAQLGAPSNTHWVHAQMTQKKIELNSLLTCALIDAYSKCGSIQIAKEIFSNIPH
        DVVTWNSIIGGCVKNARYDEAFRFFRQMLTSNIQPDGFTFASILNACAQLGAPSNTHWVHAQMTQKKIELNSLLTCALIDAYSKCGSIQIAKEIFSNIPH
Subjt:  DVVTWNSIIGGCVKNARYDEAFRFFRQMLTSNIQPDGFTFASILNACAQLGAPSNTHWVHAQMTQKKIELNSLLTCALIDAYSKCGSIQIAKEIFSNIPH

Query:  SDTSVWNVMIKGLAIHGLAMDALSLFLRMEHESVLPDAITFLGVLTACNHGGLIDHGRRYFELMKSHYSIQPQLEHYGVMVDLYSRAGFLEEAYSLIVTM
        SDTSVWNVMIKGLAIHGLAMDALSLFLRMEHESVLPDAITFLGVLTACNHG LIDHGRRYFELMKSHYSIQPQLEHYGVMVDLYSRAGFLEEAYSLIVTM
Subjt:  SDTSVWNVMIKGLAIHGLAMDALSLFLRMEHESVLPDAITFLGVLTACNHGGLIDHGRRYFELMKSHYSIQPQLEHYGVMVDLYSRAGFLEEAYSLIVTM

Query:  PIEPDVVTWRTLLSGCRIYKNHKLAEVAIANMSHRKSGDYVLLSNIYCSLNRWKEAETVRKMMKINRVRKKRGKSWIELGGTIQHFKSGDRLHPESDAIE
        PIEPDVVTWRTLLSGCRIYKNHKLAEVAIANMS RKSGDYVLLSNIYCSLNRWKEAETVRKMMKINRVRKKRGKSWIELGGTIQHFKSGDRLHPESDAIE
Subjt:  PIEPDVVTWRTLLSGCRIYKNHKLAEVAIANMSHRKSGDYVLLSNIYCSLNRWKEAETVRKMMKINRVRKKRGKSWIELGGTIQHFKSGDRLHPESDAIE

Query:  KVLCSLMKRTRTEGYMPVTELVFMDISEEEKEENLSFHSEKMALAYAILKTSPGAKISISKNLRICDDCHTWIKLVSRVLCRVIVVRDRIRFHQFEGGMC
        KVLCSLMKRTRTEGYMPVTELVFMDISEEEKEENLSFHSEKMALAYAILKTSPGAKISISKNLRICDDCHTWIKLVSRVLCRVIVVRDRIRFHQFEGGMC
Subjt:  KVLCSLMKRTRTEGYMPVTELVFMDISEEEKEENLSFHSEKMALAYAILKTSPGAKISISKNLRICDDCHTWIKLVSRVLCRVIVVRDRIRFHQFEGGMC

Query:  SCGDRW
        SCGDRW
Subjt:  SCGDRW

XP_031736190.1 pentatricopeptide repeat-containing protein At5g50990 isoform X3 [Cucumis sativus]3.2e-28699.59Show/hide
Query:  MNSKTVIETHARIIKFGYGNYPTLIASLVSTYQRVGCLNRVHQLLDILCSKQLDLVAMNLLIGNFMKIGECKFAKKVFYKMPFRDVVTWNSIIGGCVKNA
        MNSKTVIETHARIIKFGYGNYPTLIASLVSTYQRVGCLNRVHQLLDILCSKQLDLVAMNLLIGNFMKIGECKFAKKVFYKMPFRDVVTWNSIIGGCVKNA
Subjt:  MNSKTVIETHARIIKFGYGNYPTLIASLVSTYQRVGCLNRVHQLLDILCSKQLDLVAMNLLIGNFMKIGECKFAKKVFYKMPFRDVVTWNSIIGGCVKNA

Query:  RYDEAFRFFRQMLTSNIQPDGFTFASILNACAQLGAPSNTHWVHAQMTQKKIELNSLLTCALIDAYSKCGSIQIAKEIFSNIPHSDTSVWNVMIKGLAIH
        RYDEAFRFFRQMLTSNIQPDGFTFASILNACAQLGAPSNTHWVHAQMTQKKIELNSLLTCALIDAYSKCGSIQIAKEIFSNIPHSDTSVWNVMIKGLAIH
Subjt:  RYDEAFRFFRQMLTSNIQPDGFTFASILNACAQLGAPSNTHWVHAQMTQKKIELNSLLTCALIDAYSKCGSIQIAKEIFSNIPHSDTSVWNVMIKGLAIH

Query:  GLAMDALSLFLRMEHESVLPDAITFLGVLTACNHGGLIDHGRRYFELMKSHYSIQPQLEHYGVMVDLYSRAGFLEEAYSLIVTMPIEPDVVTWRTLLSGC
        GLAMDALSLFLRMEHESVLPDAITFLGVLTACNHG LIDHGRRYFELMKSHYSIQPQLEHYGVMVDLYSRAGFLEEAYSLIVTMPIEPDVVTWRTLLSGC
Subjt:  GLAMDALSLFLRMEHESVLPDAITFLGVLTACNHGGLIDHGRRYFELMKSHYSIQPQLEHYGVMVDLYSRAGFLEEAYSLIVTMPIEPDVVTWRTLLSGC

Query:  RIYKNHKLAEVAIANMSHRKSGDYVLLSNIYCSLNRWKEAETVRKMMKINRVRKKRGKSWIELGGTIQHFKSGDRLHPESDAIEKVLCSLMKRTRTEGYM
        RIYKNHKLAEVAIANMS RKSGDYVLLSNIYCSLNRWKEAETVRKMMKINRVRKKRGKSWIELGGTIQHFKSGDRLHPESDAIEKVLCSLMKRTRTEGYM
Subjt:  RIYKNHKLAEVAIANMSHRKSGDYVLLSNIYCSLNRWKEAETVRKMMKINRVRKKRGKSWIELGGTIQHFKSGDRLHPESDAIEKVLCSLMKRTRTEGYM

Query:  PVTELVFMDISEEEKEENLSFHSEKMALAYAILKTSPGAKISISKNLRICDDCHTWIKLVSRVLCRVIVVRDRIRFHQFEGGMCSCGDRW
        PVTELVFMDISEEEKEENLSFHSEKMALAYAILKTSPGAKISISKNLRICDDCHTWIKLVSRVLCRVIVVRDRIRFHQFEGGMCSCGDRW
Subjt:  PVTELVFMDISEEEKEENLSFHSEKMALAYAILKTSPGAKISISKNLRICDDCHTWIKLVSRVLCRVIVVRDRIRFHQFEGGMCSCGDRW

TrEMBL top hitse value%identityAlignment
A0A0A0LXZ9 DYW_deaminase domain-containing protein0.0e+0099.44Show/hide
Query:  MLKQRYLDCRRITFALLATSASGLPAAPSNSKDYQSLYHVLEACRLFHMNSKTVIETHARIIKFGYGNYPTLIASLVSTYQRVGCLNRVHQLLDILCSKQ
        MLKQRYLDCRRITFALLATSASGLPAAPSNSKDYQSLY VLEACRLFHMNSKTVIETHARIIKFGYGNYPTLIASLVSTYQRVGCLNRVHQLLDILCSKQ
Subjt:  MLKQRYLDCRRITFALLATSASGLPAAPSNSKDYQSLYHVLEACRLFHMNSKTVIETHARIIKFGYGNYPTLIASLVSTYQRVGCLNRVHQLLDILCSKQ

Query:  LDLVAMNLLIGNFMKIGECKFAKKVFYKMPFRDVVTWNSIIGGCVKNARYDEAFRFFRQMLTSNIQPDGFTFASILNACAQLGAPSNTHWVHAQMTQKKI
        LDLVAMNLLIGNFMKIGECKFAKKVFYKMPFRDVVTWNSIIGGCVKNARYDEAFRFFRQMLTSNIQPDGFTFASILNACAQLGAPSNTHWVHAQMTQKKI
Subjt:  LDLVAMNLLIGNFMKIGECKFAKKVFYKMPFRDVVTWNSIIGGCVKNARYDEAFRFFRQMLTSNIQPDGFTFASILNACAQLGAPSNTHWVHAQMTQKKI

Query:  ELNSLLTCALIDAYSKCGSIQIAKEIFSNIPHSDTSVWNVMIKGLAIHGLAMDALSLFLRMEHESVLPDAITFLGVLTACNHGGLIDHGRRYFELMKSHY
        ELNSLLTCALIDAYSKCGSIQIAKEIFSNIPHSDTSVWNVMIKGLAIHGLAMDALSLFLRMEHESVLPDAITFLGVLTACNHG LIDHGRRYFELMKSHY
Subjt:  ELNSLLTCALIDAYSKCGSIQIAKEIFSNIPHSDTSVWNVMIKGLAIHGLAMDALSLFLRMEHESVLPDAITFLGVLTACNHGGLIDHGRRYFELMKSHY

Query:  SIQPQLEHYGVMVDLYSRAGFLEEAYSLIVTMPIEPDVVTWRTLLSGCRIYKNHKLAEVAIANMSHRKSGDYVLLSNIYCSLNRWKEAETVRKMMKINRV
        SIQPQLEHYGVMVDLYSRAGFLEEAYSLIVTMPIEPDVVTWRTLLSGCRIYKNHKLAEVAIANMS RKSGDYVLLSNIYCSLNRWKEAETVRKMMKINRV
Subjt:  SIQPQLEHYGVMVDLYSRAGFLEEAYSLIVTMPIEPDVVTWRTLLSGCRIYKNHKLAEVAIANMSHRKSGDYVLLSNIYCSLNRWKEAETVRKMMKINRV

Query:  RKKRGKSWIELGGTIQHFKSGDRLHPESDAIEKVLCSLMKRTRTEGYMPVTELVFMDISEEEKEENLSFHSEKMALAYAILKTSPGAKISISKNLRICDD
        RKKRGKSWIELGGTIQHFKSGDRLHPESDAIEKVLCSLMKRTRTEGYMPVTELVFMDISEEEKEENLSFHSEKMALAYAILKTSPGAKISISKNLRICDD
Subjt:  RKKRGKSWIELGGTIQHFKSGDRLHPESDAIEKVLCSLMKRTRTEGYMPVTELVFMDISEEEKEENLSFHSEKMALAYAILKTSPGAKISISKNLRICDD

Query:  CHTWIKLVSRVLCRVIVVRDRIRFHQFEGGMCSCGDRW
        CHTWIKLVSRVLCRVIVVRDRIRFHQFEGGMCSCGDRW
Subjt:  CHTWIKLVSRVLCRVIVVRDRIRFHQFEGGMCSCGDRW

A0A5A7T674 Pentatricopeptide repeat-containing protein3.4e-30295.17Show/hide
Query:  MLKQRYLDCRRITFALLATSASGLPAAPSNSKDYQSLYHVLEACRLFHMNSKTVIETHARIIKFGYGNYPTLIASLVSTYQRVGCLNRVHQLLDILCSKQ
        MLKQRYLDCRRIT ALLATSAS LPAAPSN  DYQ+L+ VLEACRLF MNSKTVIETHARIIKFGYGNYPTLIASLVSTYQ VGCLNRVH+LLDILCSK 
Subjt:  MLKQRYLDCRRITFALLATSASGLPAAPSNSKDYQSLYHVLEACRLFHMNSKTVIETHARIIKFGYGNYPTLIASLVSTYQRVGCLNRVHQLLDILCSKQ

Query:  LDLVAMNLLIGNFMKIGECKFAKKVFYKMPFRDVVTWNSIIGGCVKNARYDEAFRFFRQMLTSNIQPDGFTFASILNACAQLGAPSNTHWVHAQMTQKKI
        LDLVAMNLLIGNFMKIGECKFAKKVFYKMPFRDVVTWNSIIGGCVKNARYDEAFRFFRQMLTSNIQPDGFTFAS+LNACAQLGAPSNTHWV AQMTQKKI
Subjt:  LDLVAMNLLIGNFMKIGECKFAKKVFYKMPFRDVVTWNSIIGGCVKNARYDEAFRFFRQMLTSNIQPDGFTFASILNACAQLGAPSNTHWVHAQMTQKKI

Query:  ELNSLLTCALIDAYSKCGSIQIAKEIFSNIPHSDTSVWNVMIKGLAIHGLAMDALSLFLRMEHESVLPDAITFLGVLTACNHGGLIDHGRRYFELMKSHY
        ELNSLL+CALIDAYSKCGSIQIAKEIFSN+PHSDTSVWNVMIKGLAIHGLAMDALSLFLRMEHE+VLPDAITFLG+LTACNHGGLIDHGRRYFELM+S Y
Subjt:  ELNSLLTCALIDAYSKCGSIQIAKEIFSNIPHSDTSVWNVMIKGLAIHGLAMDALSLFLRMEHESVLPDAITFLGVLTACNHGGLIDHGRRYFELMKSHY

Query:  SIQPQLEHYGVMVDLYSRAGFLEEAYSLIVTMPIEPDVVTWRTLLSGCRIYKNHKLAEVAIANMSHRKSGDYVLLSNIYCSLNRWKEAETVRKMMKINRV
        SIQPQLEHYGVMVDLYSRAGFLEEAYSLIVTMPIEPDVVTWRTLLSGC+IYKNHKLAEVAIANMSH KSGDYVLLSNIYCSLNRW+EAETVRKMMKINRV
Subjt:  SIQPQLEHYGVMVDLYSRAGFLEEAYSLIVTMPIEPDVVTWRTLLSGCRIYKNHKLAEVAIANMSHRKSGDYVLLSNIYCSLNRWKEAETVRKMMKINRV

Query:  RKKRGKSWIELGGTIQHFKSGDRLHPESDAIEKVLCSLMKRTRTEGYMPVTELVFMDISEEEKEENLSFHSEKMALAYAILKTSPGAKISISKNLRICDD
        RKKRGKSWIELGGT Q+FKSGDRLHPESDAI+KVLCSLMKRTR+EGYMPVTELVFMDISEEEKEENLSFHSEKMALAYAILKTSPGAKISISKNLRICDD
Subjt:  RKKRGKSWIELGGTIQHFKSGDRLHPESDAIEKVLCSLMKRTRTEGYMPVTELVFMDISEEEKEENLSFHSEKMALAYAILKTSPGAKISISKNLRICDD

Query:  CHTWIKLVSRVLCRVIVVRDRIRFHQFEGGMCSCGDRW
        CHTWIKLVSRVLCRVIVVRDRIRFHQFEGGMCSCGDRW
Subjt:  CHTWIKLVSRVLCRVIVVRDRIRFHQFEGGMCSCGDRW

A0A5D3BAU8 Pentatricopeptide repeat-containing protein2.3e-30695.22Show/hide
Query:  TDRAEVMLKQRYLDCRRITFALLATSASGLPAAPSNSKDYQSLYHVLEACRLFHMNSKTVIETHARIIKFGYGNYPTLIASLVSTYQRVGCLNRVHQLLD
        TDRAEVMLKQRYLDCRRIT ALLATSAS LPAAPSN  DYQ+L+ VLEACRLF MNSKTVIETHARIIKFGYGNYPTLIASLVSTYQ VGCLNRVH+LLD
Subjt:  TDRAEVMLKQRYLDCRRITFALLATSASGLPAAPSNSKDYQSLYHVLEACRLFHMNSKTVIETHARIIKFGYGNYPTLIASLVSTYQRVGCLNRVHQLLD

Query:  ILCSKQLDLVAMNLLIGNFMKIGECKFAKKVFYKMPFRDVVTWNSIIGGCVKNARYDEAFRFFRQMLTSNIQPDGFTFASILNACAQLGAPSNTHWVHAQ
        ILCSK LDLVAMNLLIGNFMKIGECKFAKKVFYKMPFRDVVTWNSIIGGCVKNARYDEAFRFFRQMLTSNIQPDGFTFAS+LNACAQLGAPSNTHWV AQ
Subjt:  ILCSKQLDLVAMNLLIGNFMKIGECKFAKKVFYKMPFRDVVTWNSIIGGCVKNARYDEAFRFFRQMLTSNIQPDGFTFASILNACAQLGAPSNTHWVHAQ

Query:  MTQKKIELNSLLTCALIDAYSKCGSIQIAKEIFSNIPHSDTSVWNVMIKGLAIHGLAMDALSLFLRMEHESVLPDAITFLGVLTACNHGGLIDHGRRYFE
        MTQKKIELNSLL+CALIDAYSKCGSIQIAKEIFSN+PHSDTSVWNVMIKGLAIHGLAMDALSLFLRMEHE+VLPDAITFLG+LTACNHGGLIDHGRRYFE
Subjt:  MTQKKIELNSLLTCALIDAYSKCGSIQIAKEIFSNIPHSDTSVWNVMIKGLAIHGLAMDALSLFLRMEHESVLPDAITFLGVLTACNHGGLIDHGRRYFE

Query:  LMKSHYSIQPQLEHYGVMVDLYSRAGFLEEAYSLIVTMPIEPDVVTWRTLLSGCRIYKNHKLAEVAIANMSHRKSGDYVLLSNIYCSLNRWKEAETVRKM
        LM+S YSIQPQLEHYGVMVDLYSRAGFLEEAYSLIVTMP+EPDVVTWRTLLSGC+IYKNHKLAEVAIANMSHRKSGDYVLLSNIYCSLNRW+EAETVRKM
Subjt:  LMKSHYSIQPQLEHYGVMVDLYSRAGFLEEAYSLIVTMPIEPDVVTWRTLLSGCRIYKNHKLAEVAIANMSHRKSGDYVLLSNIYCSLNRWKEAETVRKM

Query:  MKINRVRKKRGKSWIELGGTIQHFKSGDRLHPESDAIEKVLCSLMKRTRTEGYMPVTELVFMDISEEEKEENLSFHSEKMALAYAILKTSPGAKISISKN
        MKINRVRKKRGKSWIELGGT Q+FKSGDRLHPESDAI+KVLCSLMKRTR+EGYMPVTELVFMDISEEEKEENLSFHSEKMALAYAILKTSPGAKISISKN
Subjt:  MKINRVRKKRGKSWIELGGTIQHFKSGDRLHPESDAIEKVLCSLMKRTRTEGYMPVTELVFMDISEEEKEENLSFHSEKMALAYAILKTSPGAKISISKN

Query:  LRICDDCHTWIKLVSRVLCRVIVVRDRIRFHQFEGGMCSCGDRW
        LRICDDCHTWIKLVSRVLCRVIVVRDRIRFHQFEGGMCSCGDRW
Subjt:  LRICDDCHTWIKLVSRVLCRVIVVRDRIRFHQFEGGMCSCGDRW

A0A5D3BAU8 Pentatricopeptide repeat-containing protein1.8e-0857.97Show/hide
Query:  SLSLINVDGKNPWILDSGATDHLIGSSENFVSYILCTGIERTKIANGSLGSLNYCNLFLKPFQLLLSYS
        SLS INVD KNPWILD  AT+HLIGSSENFVSYI   G E+ +I NGSL  + +    + PF+ L  Y+
Subjt:  SLSLINVDGKNPWILDSGATDHLIGSSENFVSYILCTGIERTKIANGSLGSLNYCNLFLKPFQLLLSYS

A0A5D3BAU8 Pentatricopeptide repeat-containing protein3.4e-30295.17Show/hide
Query:  MLKQRYLDCRRITFALLATSASGLPAAPSNSKDYQSLYHVLEACRLFHMNSKTVIETHARIIKFGYGNYPTLIASLVSTYQRVGCLNRVHQLLDILCSKQ
        MLKQRYLDCRRIT ALLATSAS LPAAPSN  DYQ+L+ VLEACRLF MNSKTVIETHARIIKFGYGNYPTLIASLVSTYQ VGCLNRVH+LLDILCSK 
Subjt:  MLKQRYLDCRRITFALLATSASGLPAAPSNSKDYQSLYHVLEACRLFHMNSKTVIETHARIIKFGYGNYPTLIASLVSTYQRVGCLNRVHQLLDILCSKQ

Query:  LDLVAMNLLIGNFMKIGECKFAKKVFYKMPFRDVVTWNSIIGGCVKNARYDEAFRFFRQMLTSNIQPDGFTFASILNACAQLGAPSNTHWVHAQMTQKKI
        LDLVAMNLLIGNFMKIGECKFAKKVFYKMPFRDVVTWNSIIGGCVKNARYDEAFRFFRQMLTSNIQPDGFTFAS+LNACAQLGAPSNTHWV AQMTQKKI
Subjt:  LDLVAMNLLIGNFMKIGECKFAKKVFYKMPFRDVVTWNSIIGGCVKNARYDEAFRFFRQMLTSNIQPDGFTFASILNACAQLGAPSNTHWVHAQMTQKKI

Query:  ELNSLLTCALIDAYSKCGSIQIAKEIFSNIPHSDTSVWNVMIKGLAIHGLAMDALSLFLRMEHESVLPDAITFLGVLTACNHGGLIDHGRRYFELMKSHY
        ELNSLL+CALIDAYSKCGSIQIAKEIFSN+PHSDTSVWNVMIKGLAIHGLAMDALSLFLRMEHE+VLPDAITFLG+LTACNHGGLIDHGRRYFELM+S Y
Subjt:  ELNSLLTCALIDAYSKCGSIQIAKEIFSNIPHSDTSVWNVMIKGLAIHGLAMDALSLFLRMEHESVLPDAITFLGVLTACNHGGLIDHGRRYFELMKSHY

Query:  SIQPQLEHYGVMVDLYSRAGFLEEAYSLIVTMPIEPDVVTWRTLLSGCRIYKNHKLAEVAIANMSHRKSGDYVLLSNIYCSLNRWKEAETVRKMMKINRV
        SIQPQLEHYGVMVDLYSRAGFLEEAYSLIVTMPIEPDVVTWRTLLSGC+IYKNHKLAEVAIANMSH KSGDYVLLSNIYCSLNRW+EAETVRKMMKINRV
Subjt:  SIQPQLEHYGVMVDLYSRAGFLEEAYSLIVTMPIEPDVVTWRTLLSGCRIYKNHKLAEVAIANMSHRKSGDYVLLSNIYCSLNRWKEAETVRKMMKINRV

Query:  RKKRGKSWIELGGTIQHFKSGDRLHPESDAIEKVLCSLMKRTRTEGYMPVTELVFMDISEEEKEENLSFHSEKMALAYAILKTSPGAKISISKNLRICDD
        RKKRGKSWIELGGT Q+FKSGDRLHPESDAI+KVLCSLMKRTR+EGYMPVTELVFMDISEEEKEENLSFHSEKMALAYAILKTSPGAKISISKNLRICDD
Subjt:  RKKRGKSWIELGGTIQHFKSGDRLHPESDAIEKVLCSLMKRTRTEGYMPVTELVFMDISEEEKEENLSFHSEKMALAYAILKTSPGAKISISKNLRICDD

Query:  CHTWIKLVSRVLCRVIVVRDRIRFHQFEGGMCSCGDRW
        CHTWIKLVSRVLCRVIVVRDRIRFHQFEGGMCSCGDRW
Subjt:  CHTWIKLVSRVLCRVIVVRDRIRFHQFEGGMCSCGDRW

A0A6J1JL20 pentatricopeptide repeat-containing protein At5g50990 isoform X22.6e-24971.29Show/hide
Query:  TDRAEVMLKQRYLDCRRITFALLATSASGLP------------------------------------------------------AAPSNSKD-------
        T + ++  KQRYLD +RIT ALLATSA  LP                                                      + P NS +       
Subjt:  TDRAEVMLKQRYLDCRRITFALLATSASGLP------------------------------------------------------AAPSNSKD-------

Query:  --------YQSLYHVLEACRLFHMNSKTVIETHARIIKFGYGNYPTLIASLVSTYQRVGCLNRVHQLLDILCSKQLDLVAMNLLIGNFMKIGECKFAKKV
                YQ+LY VLEACRL   NSKT  ETHAR+IKFGYGNYPTL+ SLVS YQR  CLNRVHQLL++LCSK LDLVAMNL I NFMKIGECK AK+V
Subjt:  --------YQSLYHVLEACRLFHMNSKTVIETHARIIKFGYGNYPTLIASLVSTYQRVGCLNRVHQLLDILCSKQLDLVAMNLLIGNFMKIGECKFAKKV

Query:  FYKMPFRDVVTWNSIIGGCVKNARYDEAFRFFRQMLTSNIQPDGFTFASILNACAQLGAPSNTHWVHAQMTQKKIELNSLLTCALIDAYSKCGSIQIAKE
        F KMP+RDVVTWNSIIGGCVKNARY+EAF+FFRQML SNIQPDGFTFAS+LNACAQLGAPSNT WVHA MTQKKI+LNS+L+CALIDAYSKCGSIQIAKE
Subjt:  FYKMPFRDVVTWNSIIGGCVKNARYDEAFRFFRQMLTSNIQPDGFTFASILNACAQLGAPSNTHWVHAQMTQKKIELNSLLTCALIDAYSKCGSIQIAKE

Query:  IFSNIPHSDTSVWNVMIKGLAIHGLAMDALSLFLRMEHESVLPDAITFLGVLTACNHGGLIDHGRRYFELMKSHYSIQPQLEHYGVMVDLYSRAGFLEEA
        IFS++P S+ SVWN MIKGLAIHGL+MDALS+F  ME E+VLPDA+TFLG+LTACNHGGLI+ GRR+F+ MK+ YSIQPQLEHYGVMVDLYSRAGFLEEA
Subjt:  IFSNIPHSDTSVWNVMIKGLAIHGLAMDALSLFLRMEHESVLPDAITFLGVLTACNHGGLIDHGRRYFELMKSHYSIQPQLEHYGVMVDLYSRAGFLEEA

Query:  YSLIVTMPIEPDVVTWRTLLSGCRIYKNHKLAEVAIANMSHRKSGDYVLLSNIYCSLNRWKEAETVRKMMKINRVRKKRGKSWIELGGTIQHFKSGDRLH
        YS+IV MPIE DVVTWR LLSGCRIY+N +LAEVAIANMSHR SGDYVLLSNIYCSLNRW+ AE VR+ MK N VRK  GKSWIELGG+IQ F+SGDR H
Subjt:  YSLIVTMPIEPDVVTWRTLLSGCRIYKNHKLAEVAIANMSHRKSGDYVLLSNIYCSLNRWKEAETVRKMMKINRVRKKRGKSWIELGGTIQHFKSGDRLH

Query:  PESDAIEKVLCSLMKRTRTEGYMPVTELVFMDISEEEKEENLSFHSEKMALAYAILKTSPGAKISISKNLRICDDCHTWIKLVSRVLCRVIVVRDRIRFH
        PESDA+ KV+  LMKR+R+EGYMPVT+LV MDISEEEKEENLS+HSEK+ALAYAILKTSPGAKISISKNLR+CDDCH WIK+VS +LCRV+VVRDRIRFH
Subjt:  PESDAIEKVLCSLMKRTRTEGYMPVTELVFMDISEEEKEENLSFHSEKMALAYAILKTSPGAKISISKNLRICDDCHTWIKLVSRVLCRVIVVRDRIRFH

Query:  QFEGGMCSCGDRW
        QFEGGMCSCGDRW
Subjt:  QFEGGMCSCGDRW

SwissProt top hitse value%identityAlignment
A8MQA3 Pentatricopeptide repeat-containing protein At4g210652.0e-10541.77Show/hide
Query:  YPTLIASLVSTYQRVGCLNRVHQLLDILCSKQLDLVAMNLLIGNFMKIGECKFAKKVFYKMPFRDVVTWNSIIGGCVKNARYDEAFRFFRQMLTSNIQPD
        YP LI + V+T   V     +H ++ I       +   N L+  +   G+   A KVF KMP +D+V WNS+I G  +N + +EA   + +M +  I+PD
Subjt:  YPTLIASLVSTYQRVGCLNRVHQLLDILCSKQLDLVAMNLLIGNFMKIGECKFAKKVFYKMPFRDVVTWNSIIGGCVKNARYDEAFRFFRQMLTSNIQPD

Query:  GFTFASILNACAQLGAPSNTHWVHAQMTQKKIELNSLLTCALIDAYSKCGSIQIAKEIFSNIPHSDTSVWNVMIKGLAIHGLAMDALSLFLRMEH-ESVL
        GFT  S+L+ACA++GA +    VH  M +  +  N   +  L+D Y++CG ++ AK +F  +   ++  W  +I GLA++G   +A+ LF  ME  E +L
Subjt:  GFTFASILNACAQLGAPSNTHWVHAQMTQKKIELNSLLTCALIDAYSKCGSIQIAKEIFSNIPHSDTSVWNVMIKGLAIHGLAMDALSLFLRMEH-ESVL

Query:  PDAITFLGVLTACNHGGLIDHGRRYFELMKSHYSIQPQLEHYGVMVDLYSRAGFLEEAYSLIVTMPIEPDVVTWRTLLSGCRIYKNHKLAEVA---IANM
        P  ITF+G+L AC+H G++  G  YF  M+  Y I+P++EH+G MVDL +RAG +++AY  I +MP++P+VV WRTLL  C ++ +  LAE A   I  +
Subjt:  PDAITFLGVLTACNHGGLIDHGRRYFELMKSHYSIQPQLEHYGVMVDLYSRAGFLEEAYSLIVTMPIEPDVVTWRTLLSGCRIYKNHKLAEVA---IANM

Query:  SHRKSGDYVLLSNIYCSLNRWKEAETVRKMMKINRVRKKRGKSWIELGGTIQHFKSGDRLHPESDAIEKVLCSLMKRTRTEGYMPVTELVFMDISEEEKE
            SGDYVLLSN+Y S  RW + + +RK M  + V+K  G S +E+G  +  F  GD+ HP+SDAI   L  +  R R+EGY+P    V++D+ EEEKE
Subjt:  SHRKSGDYVLLSNIYCSLNRWKEAETVRKMMKINRVRKKRGKSWIELGGTIQHFKSGDRLHPESDAIEKVLCSLMKRTRTEGYMPVTELVFMDISEEEKE

Query:  ENLSFHSEKMALAYAILKTSPGAKISISKNLRICDDCHTWIKLVSRVLCRVIVVRDRIRFHQFEGGMCSCGDRW
          + +HSEK+A+A+ ++ T   + I++ KNLR+C DCH  IKLVS+V  R IVVRDR RFH F+ G CSC D W
Subjt:  ENLSFHSEKMALAYAILKTSPGAKISISKNLRICDDCHTWIKLVSRVLCRVIVVRDRIRFHQFEGGMCSCGDRW

Q683I9 Pentatricopeptide repeat-containing protein At3g628901.2e-10540.24Show/hide
Query:  THARIIKFGYGNYPTLIASLVSTYQRVGCLNRVHQLLDILCSKQLDLVAMNLLIGNFMKIGECKFAKKVFYKMPFRDVVTWNSIIGGCVKNARYDEAFRF
        THA+I+ FG    P +  SL++ Y   G L    ++ D   SK  DL A N ++  + K G    A+K+F +MP R+V++W+ +I G V   +Y EA   
Subjt:  THARIIKFGYGNYPTLIASLVSTYQRVGCLNRVHQLLDILCSKQLDLVAMNLLIGNFMKIGECKFAKKVFYKMPFRDVVTWNSIIGGCVKNARYDEAFRF

Query:  FRQMLTSN-----IQPDGFTFASILNACAQLGAPSNTHWVHAQMTQKKIELNSLLTCALIDAYSKCGSIQIAKEIFSNI-PHSDTSVWNVMIKGLAIHGL
        FR+M         ++P+ FT +++L+AC +LGA     WVHA + +  +E++ +L  ALID Y+KCGS++ AK +F+ +    D   ++ MI  LA++GL
Subjt:  FRQMLTSN-----IQPDGFTFASILNACAQLGAPSNTHWVHAQMTQKKIELNSLLTCALIDAYSKCGSIQIAKEIFSNI-PHSDTSVWNVMIKGLAIHGL

Query:  AMDALSLFLRM-EHESVLPDAITFLGVLTACNHGGLIDHGRRYFELMKSHYSIQPQLEHYGVMVDLYSRAGFLEEAYSLIVTMPIEPDVVTWRTLLSGCR
          +   LF  M   +++ P+++TF+G+L AC H GLI+ G+ YF++M   + I P ++HYG MVDLY R+G ++EA S I +MP+EPDV+ W +LLSG R
Subjt:  AMDALSLFLRM-EHESVLPDAITFLGVLTACNHGGLIDHGRRYFELMKSHYSIQPQLEHYGVMVDLYSRAGFLEEAYSLIVTMPIEPDVVTWRTLLSGCR

Query:  IYKNHKLAEVA---IANMSHRKSGDYVLLSNIYCSLNRWKEAETVRKMMKINRVRKKRGKSWIELGGTIQHFKSGDRLHPESDAIEKVLCSLMKRTRTEG
        +  + K  E A   +  +    SG YVLLSN+Y    RW E + +R  M++  + K  G S++E+ G +  F  GD    ES+ I  +L  +M+R R  G
Subjt:  IYKNHKLAEVA---IANMSHRKSGDYVLLSNIYCSLNRWKEAETVRKMMKINRVRKKRGKSWIELGGTIQHFKSGDRLHPESDAIEKVLCSLMKRTRTEG

Query:  YMPVTELVFMDISEEEKEENLSFHSEKMALAYAILKTSPGAKISISKNLRICDDCHTWIKLVSRVLCRVIVVRDRIRFHQFEGGMCSCGDRW
        Y+  T+ V +D++E++KE  LS+HSEK+A+A+ ++KT PG  + I KNLRIC DCH  +K++S++  R IVVRD  RFH F  G CSC D W
Subjt:  YMPVTELVFMDISEEEKEENLSFHSEKMALAYAILKTSPGAKISISKNLRICDDCHTWIKLVSRVLCRVIVVRDRIRFHQFEGGMCSCGDRW

Q9FI49 Pentatricopeptide repeat-containing protein At5g509901.9e-15651.51Show/hide
Query:  RRITFALLATSASGLPAAPSNSKDYQSLYHVLEACRLFHMNSKTVIETHARIIKFGYGNYPTLIASLVSTYQRVGCLNRVHQLLDILCSKQLDLVAMNLL
        RR     L++S++      SN  D+  L  VLE+C+    NSK V++ HA+I K GYG YP+L+ S V+ Y+R        +LL    S    +  +NL+
Subjt:  RRITFALLATSASGLPAAPSNSKDYQSLYHVLEACRLFHMNSKTVIETHARIIKFGYGNYPTLIASLVSTYQRVGCLNRVHQLLDILCSKQLDLVAMNLL

Query:  IGNFMKIGECKFAKKVFYKMPFRDVVTWNSIIGGCVKNARYDEAFRFFRQMLT-SNIQPDGFTFASILNACAQLGAPSNTHWVHAQMTQKKIELNSLLTC
        I + MKIGE   AKKV      ++V+TWN +IGG V+N +Y+EA +  + ML+ ++I+P+ F+FAS L ACA+LG   +  WVH+ M    IELN++L+ 
Subjt:  IGNFMKIGECKFAKKVFYKMPFRDVVTWNSIIGGCVKNARYDEAFRFFRQMLT-SNIQPDGFTFASILNACAQLGAPSNTHWVHAQMTQKKIELNSLLTC

Query:  ALIDAYSKCGSIQIAKEIFSNIPHSDTSVWNVMIKGLAIHGLAMDALSLFLRMEHESVLPDAITFLGVLTACNHGGLIDHGRRYFELMKSHYSIQPQLEH
        AL+D Y+KCG I  ++E+F ++  +D S+WN MI G A HGLA +A+ +F  ME E V PD+ITFLG+LT C+H GL++ G+ YF LM   +SIQP+LEH
Subjt:  ALIDAYSKCGSIQIAKEIFSNIPHSDTSVWNVMIKGLAIHGLAMDALSLFLRMEHESVLPDAITFLGVLTACNHGGLIDHGRRYFELMKSHYSIQPQLEH

Query:  YGVMVDLYSRAGFLEEAYSLIVTMPIEPDVVTWRTLLSGCRIYKNHKLAEVAIANMSHRKSGDYVLLSNIYCSLNRWKEAETVRKMMKINRVRKKRGKSW
        YG MVDL  RAG ++EAY LI +MPIEPDVV WR+LLS  R YKN +L E+AI N+S  KSGDYVLLSNIY S  +W+ A+ VR++M    +RK +GKSW
Subjt:  YGVMVDLYSRAGFLEEAYSLIVTMPIEPDVVTWRTLLSGCRIYKNHKLAEVAIANMSHRKSGDYVLLSNIYCSLNRWKEAETVRKMMKINRVRKKRGKSW

Query:  IELGGTIQHFKSGDRLHPESDAIEKVLCSLMKRTRTEGYMPVTELVFMDISEEEKEENLSFHSEKMALAYAILKTSPGAKISISKNLRICDDCHTWIKLV
        +E GG I  FK+GD  H E+ AI KVL  L+++T+++G++  T+LV MD+SEEEKEENL++HSEK+ALAY ILK+SPG +I I KN+R+C DCH WIK V
Subjt:  IELGGTIQHFKSGDRLHPESDAIEKVLCSLMKRTRTEGYMPVTELVFMDISEEEKEENLSFHSEKMALAYAILKTSPGAKISISKNLRICDDCHTWIKLV

Query:  SRVLCRVIVVRDRIRFHQFEGGMCSCGDRW
        S++L RVI++RDRIRFH+FE G+CSC D W
Subjt:  SRVLCRVIVVRDRIRFHQFEGGMCSCGDRW

Q9FI80 Pentatricopeptide repeat-containing protein At5g489103.0e-10639.36Show/hide
Query:  ETHARIIKFGYGNYPTLIASLVSTYQRVGCL---------NRVHQLLDILCSKQL---DLVAMNLLIGNFMKIGECKFAKKVFYKMPFRDVVTWNSIIGG
        + H   +K+G+G    ++++LV  Y   G +         N + + + ++  ++    ++V  N++I  +M++G+CK A+ +F KM  R VV+WN++I G
Subjt:  ETHARIIKFGYGNYPTLIASLVSTYQRVGCL---------NRVHQLLDILCSKQL---DLVAMNLLIGNFMKIGECKFAKKVFYKMPFRDVVTWNSIIGG

Query:  CVKNARYDEAFRFFRQMLTSNIQPDGFTFASILNACAQLGAPSNTHWVHAQMTQKKIELNSLLTCALIDAYSKCGSIQIAKEIFSNIPHSDTSVWNVMIK
           N  + +A   FR+M   +I+P+  T  S+L A ++LG+     W+H       I ++ +L  ALID YSKCG I+ A  +F  +P  +   W+ MI 
Subjt:  CVKNARYDEAFRFFRQMLTSNIQPDGFTFASILNACAQLGAPSNTHWVHAQMTQKKIELNSLLTCALIDAYSKCGSIQIAKEIFSNIPHSDTSVWNVMIK

Query:  GLAIHGLAMDALSLFLRMEHESVLPDAITFLGVLTACNHGGLIDHGRRYFELMKSHYSIQPQLEHYGVMVDLYSRAGFLEEAYSLIVTMPIEPDVVTWRT
        G AIHG A DA+  F +M    V P  + ++ +LTAC+HGGL++ GRRYF  M S   ++P++EHYG MVDL  R+G L+EA   I+ MPI+PD V W+ 
Subjt:  GLAIHGLAMDALSLFLRMEHESVLPDAITFLGVLTACNHGGLIDHGRRYFELMKSHYSIQPQLEHYGVMVDLYSRAGFLEEAYSLIVTMPIEPDVVTWRT

Query:  LLSGCRIYKNHKLAE-VA--IANMSHRKSGDYVLLSNIYCSLNRWKEAETVRKMMKINRVRKKRGKSWIELGGTIQHFKSGDRLHPESDAIEKVLCSLMK
        LL  CR+  N ++ + VA  + +M    SG YV LSN+Y S   W E   +R  MK   +RK  G S I++ G +  F   D  HP++  I  +L  +  
Subjt:  LLSGCRIYKNHKLAE-VA--IANMSHRKSGDYVLLSNIYCSLNRWKEAETVRKMMKINRVRKKRGKSWIELGGTIQHFKSGDRLHPESDAIEKVLCSLMK

Query:  RTRTEGYMPVTELVFMDISEEEKEENLSFHSEKMALAYAILKTSPGAKISISKNLRICDDCHTWIKLVSRVLCRVIVVRDRIRFHQFEGGMCSCGDRW
        + R  GY P+T  V +++ EE+KE  L +HSEK+A A+ ++ TSPG  I I KNLRIC+DCH+ IKL+S+V  R I VRDR RFH F+ G CSC D W
Subjt:  RTRTEGYMPVTELVFMDISEEEKEENLSFHSEKMALAYAILKTSPGAKISISKNLRICDDCHTWIKLVSRVLCRVIVVRDRIRFHQFEGGMCSCGDRW

Q9FJY7 Pentatricopeptide repeat-containing protein At5g665202.6e-10538.42Show/hide
Query:  AAPSNSKDYQSLYHVLEACRLFHMNSKTVIETHARIIKFGYGNYPTLIASLVSTYQRVGCLNRVHQLLDILCSKQLDLVAMNLLIGNFMKIGECKFAKKV
        +AP N+  + SL   L+AC       +T  + HA+I K GY N    + SL+++Y   G     H L D +   + D V+ N +I  ++K G+   A  +
Subjt:  AAPSNSKDYQSLYHVLEACRLFHMNSKTVIETHARIIKFGYGNYPTLIASLVSTYQRVGCLNRVHQLLDILCSKQLDLVAMNLLIGNFMKIGECKFAKKV

Query:  FYKMPFRDVVTWNSIIGGCVKNARYDEAFRFFRQMLTSNIQPDGFTFASILNACAQLGAPSNTHWVHAQMTQKKIELNSLLTCALIDAYSKCGSIQIAKE
        F KM  ++ ++W ++I G V+     EA + F +M  S+++PD  + A+ L+ACAQLGA     W+H+ + + +I ++S+L C LID Y+KCG ++ A E
Subjt:  FYKMPFRDVVTWNSIIGGCVKNARYDEAFRFFRQMLTSNIQPDGFTFASILNACAQLGAPSNTHWVHAQMTQKKIELNSLLTCALIDAYSKCGSIQIAKE

Query:  IFSNIPHSDTSVWNVMIKGLAIHGLAMDALSLFLRMEHESVLPDAITFLGVLTACNHGGLIDHGRRYFELMKSHYSIQPQLEHYGVMVDLYSRAGFLEEA
        +F NI       W  +I G A HG   +A+S F+ M+   + P+ ITF  VLTAC++ GL++ G+  F  M+  Y+++P +EHYG +VDL  RAG L+EA
Subjt:  IFSNIPHSDTSVWNVMIKGLAIHGLAMDALSLFLRMEHESVLPDAITFLGVLTACNHGGLIDHGRRYFELMKSHYSIQPQLEHYGVMVDLYSRAGFLEEA

Query:  YSLIVTMPIEPDVVTWRTLLSGCRIYKN----HKLAEVAIANMSHRKSGDYVLLSNIYCSLNRWKEAETVRKMMKINRVRKKRGKSWIELGGTIQHFKSG
           I  MP++P+ V W  LL  CRI+KN     ++ E+ IA +     G YV  +NI+    +W +A   R++MK   V K  G S I L GT   F +G
Subjt:  YSLIVTMPIEPDVVTWRTLLSGCRIYKN----HKLAEVAIANMSHRKSGDYVLLSNIYCSLNRWKEAETVRKMMKINRVRKKRGKSWIELGGTIQHFKSG

Query:  DRLHPESDAIEKVLCSLMKRTRTEGYMPVTELVFMD-ISEEEKEENLSFHSEKMALAYAILKTSPGAKISISKNLRICDDCHTWIKLVSRVLCRVIVVRD
        DR HPE + I+     + ++    GY+P  E + +D + ++E+E  +  HSEK+A+ Y ++KT PG  I I KNLR+C DCH   KL+S++  R IV+RD
Subjt:  DRLHPESDAIEKVLCSLMKRTRTEGYMPVTELVFMD-ISEEEKEENLSFHSEKMALAYAILKTSPGAKISISKNLRICDDCHTWIKLVSRVLCRVIVVRD

Query:  RIRFHQFEGGMCSCGDRW
        R RFH F  G CSCGD W
Subjt:  RIRFHQFEGGMCSCGDRW

Arabidopsis top hitse value%identityAlignment
AT3G62890.1 Pentatricopeptide repeat (PPR) superfamily protein8.2e-10740.24Show/hide
Query:  THARIIKFGYGNYPTLIASLVSTYQRVGCLNRVHQLLDILCSKQLDLVAMNLLIGNFMKIGECKFAKKVFYKMPFRDVVTWNSIIGGCVKNARYDEAFRF
        THA+I+ FG    P +  SL++ Y   G L    ++ D   SK  DL A N ++  + K G    A+K+F +MP R+V++W+ +I G V   +Y EA   
Subjt:  THARIIKFGYGNYPTLIASLVSTYQRVGCLNRVHQLLDILCSKQLDLVAMNLLIGNFMKIGECKFAKKVFYKMPFRDVVTWNSIIGGCVKNARYDEAFRF

Query:  FRQMLTSN-----IQPDGFTFASILNACAQLGAPSNTHWVHAQMTQKKIELNSLLTCALIDAYSKCGSIQIAKEIFSNI-PHSDTSVWNVMIKGLAIHGL
        FR+M         ++P+ FT +++L+AC +LGA     WVHA + +  +E++ +L  ALID Y+KCGS++ AK +F+ +    D   ++ MI  LA++GL
Subjt:  FRQMLTSN-----IQPDGFTFASILNACAQLGAPSNTHWVHAQMTQKKIELNSLLTCALIDAYSKCGSIQIAKEIFSNI-PHSDTSVWNVMIKGLAIHGL

Query:  AMDALSLFLRM-EHESVLPDAITFLGVLTACNHGGLIDHGRRYFELMKSHYSIQPQLEHYGVMVDLYSRAGFLEEAYSLIVTMPIEPDVVTWRTLLSGCR
          +   LF  M   +++ P+++TF+G+L AC H GLI+ G+ YF++M   + I P ++HYG MVDLY R+G ++EA S I +MP+EPDV+ W +LLSG R
Subjt:  AMDALSLFLRM-EHESVLPDAITFLGVLTACNHGGLIDHGRRYFELMKSHYSIQPQLEHYGVMVDLYSRAGFLEEAYSLIVTMPIEPDVVTWRTLLSGCR

Query:  IYKNHKLAEVA---IANMSHRKSGDYVLLSNIYCSLNRWKEAETVRKMMKINRVRKKRGKSWIELGGTIQHFKSGDRLHPESDAIEKVLCSLMKRTRTEG
        +  + K  E A   +  +    SG YVLLSN+Y    RW E + +R  M++  + K  G S++E+ G +  F  GD    ES+ I  +L  +M+R R  G
Subjt:  IYKNHKLAEVA---IANMSHRKSGDYVLLSNIYCSLNRWKEAETVRKMMKINRVRKKRGKSWIELGGTIQHFKSGDRLHPESDAIEKVLCSLMKRTRTEG

Query:  YMPVTELVFMDISEEEKEENLSFHSEKMALAYAILKTSPGAKISISKNLRICDDCHTWIKLVSRVLCRVIVVRDRIRFHQFEGGMCSCGDRW
        Y+  T+ V +D++E++KE  LS+HSEK+A+A+ ++KT PG  + I KNLRIC DCH  +K++S++  R IVVRD  RFH F  G CSC D W
Subjt:  YMPVTELVFMDISEEEKEENLSFHSEKMALAYAILKTSPGAKISISKNLRICDDCHTWIKLVSRVLCRVIVVRDRIRFHQFEGGMCSCGDRW

AT4G21065.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.4e-10641.77Show/hide
Query:  YPTLIASLVSTYQRVGCLNRVHQLLDILCSKQLDLVAMNLLIGNFMKIGECKFAKKVFYKMPFRDVVTWNSIIGGCVKNARYDEAFRFFRQMLTSNIQPD
        YP LI + V+T   V     +H ++ I       +   N L+  +   G+   A KVF KMP +D+V WNS+I G  +N + +EA   + +M +  I+PD
Subjt:  YPTLIASLVSTYQRVGCLNRVHQLLDILCSKQLDLVAMNLLIGNFMKIGECKFAKKVFYKMPFRDVVTWNSIIGGCVKNARYDEAFRFFRQMLTSNIQPD

Query:  GFTFASILNACAQLGAPSNTHWVHAQMTQKKIELNSLLTCALIDAYSKCGSIQIAKEIFSNIPHSDTSVWNVMIKGLAIHGLAMDALSLFLRMEH-ESVL
        GFT  S+L+ACA++GA +    VH  M +  +  N   +  L+D Y++CG ++ AK +F  +   ++  W  +I GLA++G   +A+ LF  ME  E +L
Subjt:  GFTFASILNACAQLGAPSNTHWVHAQMTQKKIELNSLLTCALIDAYSKCGSIQIAKEIFSNIPHSDTSVWNVMIKGLAIHGLAMDALSLFLRMEH-ESVL

Query:  PDAITFLGVLTACNHGGLIDHGRRYFELMKSHYSIQPQLEHYGVMVDLYSRAGFLEEAYSLIVTMPIEPDVVTWRTLLSGCRIYKNHKLAEVA---IANM
        P  ITF+G+L AC+H G++  G  YF  M+  Y I+P++EH+G MVDL +RAG +++AY  I +MP++P+VV WRTLL  C ++ +  LAE A   I  +
Subjt:  PDAITFLGVLTACNHGGLIDHGRRYFELMKSHYSIQPQLEHYGVMVDLYSRAGFLEEAYSLIVTMPIEPDVVTWRTLLSGCRIYKNHKLAEVA---IANM

Query:  SHRKSGDYVLLSNIYCSLNRWKEAETVRKMMKINRVRKKRGKSWIELGGTIQHFKSGDRLHPESDAIEKVLCSLMKRTRTEGYMPVTELVFMDISEEEKE
            SGDYVLLSN+Y S  RW + + +RK M  + V+K  G S +E+G  +  F  GD+ HP+SDAI   L  +  R R+EGY+P    V++D+ EEEKE
Subjt:  SHRKSGDYVLLSNIYCSLNRWKEAETVRKMMKINRVRKKRGKSWIELGGTIQHFKSGDRLHPESDAIEKVLCSLMKRTRTEGYMPVTELVFMDISEEEKE

Query:  ENLSFHSEKMALAYAILKTSPGAKISISKNLRICDDCHTWIKLVSRVLCRVIVVRDRIRFHQFEGGMCSCGDRW
          + +HSEK+A+A+ ++ T   + I++ KNLR+C DCH  IKLVS+V  R IVVRDR RFH F+ G CSC D W
Subjt:  ENLSFHSEKMALAYAILKTSPGAKISISKNLRICDDCHTWIKLVSRVLCRVIVVRDRIRFHQFEGGMCSCGDRW

AT4G21065.2 Tetratricopeptide repeat (TPR)-like superfamily protein1.8e-10642.95Show/hide
Query:  LVAMNLLIGNFMKIGECKFAKKVFYKMPFRDVVTWNSIIGGCVKNARYDEAFRFFRQMLTSNIQPDGFTFASILNACAQLGAPSNTHWVHAQMTQKKIEL
        +   N L+  +   G+   A KVF KMP +D+V WNS+I G  +N + +EA   + +M +  I+PDGFT  S+L+ACA++GA +    VH  M +  +  
Subjt:  LVAMNLLIGNFMKIGECKFAKKVFYKMPFRDVVTWNSIIGGCVKNARYDEAFRFFRQMLTSNIQPDGFTFASILNACAQLGAPSNTHWVHAQMTQKKIEL

Query:  NSLLTCALIDAYSKCGSIQIAKEIFSNIPHSDTSVWNVMIKGLAIHGLAMDALSLFLRMEH-ESVLPDAITFLGVLTACNHGGLIDHGRRYFELMKSHYS
        N   +  L+D Y++CG ++ AK +F  +   ++  W  +I GLA++G   +A+ LF  ME  E +LP  ITF+G+L AC+H G++  G  YF  M+  Y 
Subjt:  NSLLTCALIDAYSKCGSIQIAKEIFSNIPHSDTSVWNVMIKGLAIHGLAMDALSLFLRMEH-ESVLPDAITFLGVLTACNHGGLIDHGRRYFELMKSHYS

Query:  IQPQLEHYGVMVDLYSRAGFLEEAYSLIVTMPIEPDVVTWRTLLSGCRIYKNHKLAEVA---IANMSHRKSGDYVLLSNIYCSLNRWKEAETVRKMMKIN
        I+P++EH+G MVDL +RAG +++AY  I +MP++P+VV WRTLL  C ++ +  LAE A   I  +    SGDYVLLSN+Y S  RW + + +RK M  +
Subjt:  IQPQLEHYGVMVDLYSRAGFLEEAYSLIVTMPIEPDVVTWRTLLSGCRIYKNHKLAEVA---IANMSHRKSGDYVLLSNIYCSLNRWKEAETVRKMMKIN

Query:  RVRKKRGKSWIELGGTIQHFKSGDRLHPESDAIEKVLCSLMKRTRTEGYMPVTELVFMDISEEEKEENLSFHSEKMALAYAILKTSPGAKISISKNLRIC
         V+K  G S +E+G  +  F  GD+ HP+SDAI   L  +  R R+EGY+P    V++D+ EEEKE  + +HSEK+A+A+ ++ T   + I++ KNLR+C
Subjt:  RVRKKRGKSWIELGGTIQHFKSGDRLHPESDAIEKVLCSLMKRTRTEGYMPVTELVFMDISEEEKEENLSFHSEKMALAYAILKTSPGAKISISKNLRIC

Query:  DDCHTWIKLVSRVLCRVIVVRDRIRFHQFEGGMCSCGDRW
         DCH  IKLVS+V  R IVVRDR RFH F+ G CSC D W
Subjt:  DDCHTWIKLVSRVLCRVIVVRDRIRFHQFEGGMCSCGDRW

AT5G48910.1 Pentatricopeptide repeat (PPR) superfamily protein2.2e-10739.36Show/hide
Query:  ETHARIIKFGYGNYPTLIASLVSTYQRVGCL---------NRVHQLLDILCSKQL---DLVAMNLLIGNFMKIGECKFAKKVFYKMPFRDVVTWNSIIGG
        + H   +K+G+G    ++++LV  Y   G +         N + + + ++  ++    ++V  N++I  +M++G+CK A+ +F KM  R VV+WN++I G
Subjt:  ETHARIIKFGYGNYPTLIASLVSTYQRVGCL---------NRVHQLLDILCSKQL---DLVAMNLLIGNFMKIGECKFAKKVFYKMPFRDVVTWNSIIGG

Query:  CVKNARYDEAFRFFRQMLTSNIQPDGFTFASILNACAQLGAPSNTHWVHAQMTQKKIELNSLLTCALIDAYSKCGSIQIAKEIFSNIPHSDTSVWNVMIK
           N  + +A   FR+M   +I+P+  T  S+L A ++LG+     W+H       I ++ +L  ALID YSKCG I+ A  +F  +P  +   W+ MI 
Subjt:  CVKNARYDEAFRFFRQMLTSNIQPDGFTFASILNACAQLGAPSNTHWVHAQMTQKKIELNSLLTCALIDAYSKCGSIQIAKEIFSNIPHSDTSVWNVMIK

Query:  GLAIHGLAMDALSLFLRMEHESVLPDAITFLGVLTACNHGGLIDHGRRYFELMKSHYSIQPQLEHYGVMVDLYSRAGFLEEAYSLIVTMPIEPDVVTWRT
        G AIHG A DA+  F +M    V P  + ++ +LTAC+HGGL++ GRRYF  M S   ++P++EHYG MVDL  R+G L+EA   I+ MPI+PD V W+ 
Subjt:  GLAIHGLAMDALSLFLRMEHESVLPDAITFLGVLTACNHGGLIDHGRRYFELMKSHYSIQPQLEHYGVMVDLYSRAGFLEEAYSLIVTMPIEPDVVTWRT

Query:  LLSGCRIYKNHKLAE-VA--IANMSHRKSGDYVLLSNIYCSLNRWKEAETVRKMMKINRVRKKRGKSWIELGGTIQHFKSGDRLHPESDAIEKVLCSLMK
        LL  CR+  N ++ + VA  + +M    SG YV LSN+Y S   W E   +R  MK   +RK  G S I++ G +  F   D  HP++  I  +L  +  
Subjt:  LLSGCRIYKNHKLAE-VA--IANMSHRKSGDYVLLSNIYCSLNRWKEAETVRKMMKINRVRKKRGKSWIELGGTIQHFKSGDRLHPESDAIEKVLCSLMK

Query:  RTRTEGYMPVTELVFMDISEEEKEENLSFHSEKMALAYAILKTSPGAKISISKNLRICDDCHTWIKLVSRVLCRVIVVRDRIRFHQFEGGMCSCGDRW
        + R  GY P+T  V +++ EE+KE  L +HSEK+A A+ ++ TSPG  I I KNLRIC+DCH+ IKL+S+V  R I VRDR RFH F+ G CSC D W
Subjt:  RTRTEGYMPVTELVFMDISEEEKEENLSFHSEKMALAYAILKTSPGAKISISKNLRICDDCHTWIKLVSRVLCRVIVVRDRIRFHQFEGGMCSCGDRW

AT5G50990.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.3e-15751.51Show/hide
Query:  RRITFALLATSASGLPAAPSNSKDYQSLYHVLEACRLFHMNSKTVIETHARIIKFGYGNYPTLIASLVSTYQRVGCLNRVHQLLDILCSKQLDLVAMNLL
        RR     L++S++      SN  D+  L  VLE+C+    NSK V++ HA+I K GYG YP+L+ S V+ Y+R        +LL    S    +  +NL+
Subjt:  RRITFALLATSASGLPAAPSNSKDYQSLYHVLEACRLFHMNSKTVIETHARIIKFGYGNYPTLIASLVSTYQRVGCLNRVHQLLDILCSKQLDLVAMNLL

Query:  IGNFMKIGECKFAKKVFYKMPFRDVVTWNSIIGGCVKNARYDEAFRFFRQMLT-SNIQPDGFTFASILNACAQLGAPSNTHWVHAQMTQKKIELNSLLTC
        I + MKIGE   AKKV      ++V+TWN +IGG V+N +Y+EA +  + ML+ ++I+P+ F+FAS L ACA+LG   +  WVH+ M    IELN++L+ 
Subjt:  IGNFMKIGECKFAKKVFYKMPFRDVVTWNSIIGGCVKNARYDEAFRFFRQMLT-SNIQPDGFTFASILNACAQLGAPSNTHWVHAQMTQKKIELNSLLTC

Query:  ALIDAYSKCGSIQIAKEIFSNIPHSDTSVWNVMIKGLAIHGLAMDALSLFLRMEHESVLPDAITFLGVLTACNHGGLIDHGRRYFELMKSHYSIQPQLEH
        AL+D Y+KCG I  ++E+F ++  +D S+WN MI G A HGLA +A+ +F  ME E V PD+ITFLG+LT C+H GL++ G+ YF LM   +SIQP+LEH
Subjt:  ALIDAYSKCGSIQIAKEIFSNIPHSDTSVWNVMIKGLAIHGLAMDALSLFLRMEHESVLPDAITFLGVLTACNHGGLIDHGRRYFELMKSHYSIQPQLEH

Query:  YGVMVDLYSRAGFLEEAYSLIVTMPIEPDVVTWRTLLSGCRIYKNHKLAEVAIANMSHRKSGDYVLLSNIYCSLNRWKEAETVRKMMKINRVRKKRGKSW
        YG MVDL  RAG ++EAY LI +MPIEPDVV WR+LLS  R YKN +L E+AI N+S  KSGDYVLLSNIY S  +W+ A+ VR++M    +RK +GKSW
Subjt:  YGVMVDLYSRAGFLEEAYSLIVTMPIEPDVVTWRTLLSGCRIYKNHKLAEVAIANMSHRKSGDYVLLSNIYCSLNRWKEAETVRKMMKINRVRKKRGKSW

Query:  IELGGTIQHFKSGDRLHPESDAIEKVLCSLMKRTRTEGYMPVTELVFMDISEEEKEENLSFHSEKMALAYAILKTSPGAKISISKNLRICDDCHTWIKLV
        +E GG I  FK+GD  H E+ AI KVL  L+++T+++G++  T+LV MD+SEEEKEENL++HSEK+ALAY ILK+SPG +I I KN+R+C DCH WIK V
Subjt:  IELGGTIQHFKSGDRLHPESDAIEKVLCSLMKRTRTEGYMPVTELVFMDISEEEKEENLSFHSEKMALAYAILKTSPGAKISISKNLRICDDCHTWIKLV

Query:  SRVLCRVIVVRDRIRFHQFEGGMCSCGDRW
        S++L RVI++RDRIRFH+FE G+CSC D W
Subjt:  SRVLCRVIVVRDRIRFHQFEGGMCSCGDRW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTGTTATCTAAAACATCCAAATTGATTCCAACAATCCTGCATCCTTACCATGATTCAGTATTCGGTGGTCATTTGGGGTTCTTACGAACTTATAAGAGGATGGCTGG
AGAGTTGTATTGGGAAGGAATGAAACAGGAAGTGAAGAAATATTATGAGGAGTGTATGATCTGCCAACGCAATAAGACCTTGGCTCTTTCACCAGCTGGTTTATTAACTC
CTCTAGAAGTGCCAAATAGAGTGTGGGGGGATATATCAATGAATTTTGTGGAAGGATTACCTAAGGCAGGGGGTTATAAGGTGATATTTGTAGTGGTTGGCCGATTTAGT
AAATATGGGCATTTCATCCCTATGAAACATCCATATACAACAAAAGTCGTGTCTGAAATATTTGTGAAGGAAATCGTGCGTTTACATGGATATCCCAAATCCATAGTATC
AGATAGAGACAAGGTGTTTTTGAGTCACTTTTGGAGACAATTGTTTTGTTTGGCAGGAACCAAATTGAACCACAGTATAGCATATCATCCACAAACCGATGGGCAGACGG
AGGTTGTTAACAGAGGAGTGGAGAGTTTTTTGCGCTGTTTTTGTGGGGAAAAGCCGAGGGAGTGGATTAAATGGATTCCTTGGGCAGAATACTGGTATAATACAACATAT
AAGTGTTCTTTGGGAGTCACACCATTTCAAGCGGTGTATGGGAGATTGCCACCTCCTTTGATATATTATAGAGTGATAGCTAAGATTGGACCGGTAGCTTATAAATTGGA
ATTGCCAGAGAATGCTAGTATACATCCAGTATTCCATGTCTCTCAGTCAAAGAAAGTATTTGGAAAGCATGATGAGAATAGAAATGACGTGCCCTGTTTAACTGAAAATC
ATGAATGGAAAGATGTTCCGGAGGAGGTTTACGGATATTCGAAAAACAAAGCGGGAAGTTGGGATGTATTGGTGCAGTGGAAAGGTCTACCGCGACATGAGGCTACCTGG
GAGTTATATGAAGATTTGAAGCAGCGATTTCTAGATTTTCACCTTGAGGACAAGAGAGACAGTCTAGTACTGAATGATGACATACAGTATGCTAAACTTGAAGAAGTTGA
TCGGATTTATGACTTTCGTGCAGGTTTCAACCTCAAGTTTCACATTAGACCCATTTCCTCCTTAATGAAAGTGTGTTCTGAAGTTCGTCTTGTAGAAGATCATACAAGTG
CCATGAGTGTTCTGACTATCCCGCTATTGATTTTGCTACCTTCAGCGCTAGATCCTTTACCCATCCTAGTGACAAGAACAACGGAAAACCGATTCCTATCTTCCCTTAGT
CTTATCAATGTCGATGGGAAGAATCCCTGGATCCTAGACTCAGGTGCTACAGATCACTTGATAGGTTCCTCTGAAAATTTTGTTTCTTATATTTTGTGCACTGGTATTGA
AAGGACAAAGATAGCTAATGGTTCTTTGGGTAGCCTCAATTACTGCAATTTGTTTTTGAAGCCCTTTCAGTTGTTGCTGAGTTATTCCCGATGCAGAAGCTTGCACGATA
GGGTTTTGGAATGTGCCGAAAATCCACTAACCTCAAATGTGGCAGAACACGAATCGGCACGTGGAGAACACTACAGACAACAGATGAAGACGAGGGTGGCACAGTTAGGT
TGTTCGGTGGCGTTTGAACAGGATGGACCGGCAGCGAAGCCATTCGTCCATGGCAACGTTGATCTGAGGATCGATGGTGGCACTAAAACTGGCAGCTGCCTCCTCTATGT
GATTTCCATCACTGACAGAGCTGAAGTAATGCTGAAGCAGAGATACTTGGACTGCAGAAGGATTACTTTTGCCCTTCTTGCTACCTCTGCCTCTGGTTTACCAGCGGCCC
CTTCGAATTCCAAAGATTATCAAAGTCTTTACCATGTTCTTGAAGCCTGCAGACTCTTCCACATGAATTCCAAAACTGTTATCGAAACGCATGCACGAATTATTAAATTT
GGATATGGAAACTACCCTACTCTCATCGCCTCTCTAGTATCTACTTATCAACGTGTTGGTTGCCTTAATCGGGTTCATCAACTTCTTGATATACTCTGCTCTAAGCAGCT
TGATTTAGTTGCAATGAACTTACTCATTGGAAACTTTATGAAAATCGGGGAGTGCAAATTTGCTAAAAAGGTGTTTTATAAAATGCCTTTTCGTGATGTGGTAACATGGA
ACTCAATCATTGGAGGTTGTGTGAAGAATGCACGGTATGATGAGGCATTTAGATTCTTTAGACAGATGCTGACGTCAAATATTCAGCCGGATGGATTTACATTTGCTTCT
ATATTGAATGCATGTGCTCAACTCGGAGCTCCTAGTAACACTCATTGGGTTCATGCTCAGATGACTCAGAAAAAAATTGAGCTTAATTCTTTATTAACTTGTGCACTCAT
AGACGCATACTCGAAATGTGGTAGCATCCAAATTGCAAAGGAAATATTTAGTAATATTCCTCACAGTGATACTTCCGTTTGGAATGTGATGATCAAAGGGCTTGCGATTC
ATGGGCTTGCAATGGATGCATTATCGTTATTTTTGAGGATGGAGCATGAGAGTGTTCTGCCTGATGCCATCACCTTTTTGGGTGTTTTAACAGCTTGCAACCATGGTGGT
TTAATTGACCATGGTCGTAGATATTTTGAGCTGATGAAAAGTCATTATTCAATTCAGCCGCAGCTTGAGCATTATGGTGTCATGGTTGACCTTTATAGTCGAGCTGGGTT
TCTGGAAGAGGCCTATTCTCTAATCGTGACAATGCCGATAGAGCCTGATGTTGTCACGTGGCGGACACTTTTAAGTGGTTGTAGAATTTACAAAAATCATAAACTTGCAG
AAGTTGCTATTGCAAATATGTCTCATCGTAAGAGTGGGGATTACGTATTATTATCAAATATATATTGTTCTCTCAACAGATGGAAGGAAGCAGAAACAGTTAGAAAGATG
ATGAAAATCAATAGAGTTCGTAAGAAACGTGGAAAAAGCTGGATTGAGTTGGGAGGTACCATTCAACACTTCAAGTCAGGTGATCGATTGCATCCAGAAAGCGATGCAAT
AGAAAAAGTGCTATGCAGTTTGATGAAGAGAACTCGGACGGAGGGATATATGCCTGTGACGGAGTTGGTTTTCATGGATATCTCTGAGGAGGAGAAGGAAGAGAACTTAT
CATTTCATAGTGAAAAGATGGCATTGGCTTATGCGATCTTGAAAACTAGTCCTGGGGCAAAGATCAGTATTTCAAAGAACCTGCGGATCTGTGATGATTGCCATACATGG
ATTAAATTAGTTTCAAGAGTGCTGTGTAGAGTTATAGTAGTGAGGGATCGGATCCGGTTTCATCAATTTGAAGGTGGCATGTGTTCTTGTGGTGATCGTTGGTAG
mRNA sequenceShow/hide mRNA sequence
ATGGTGTTATCTAAAACATCCAAATTGATTCCAACAATCCTGCATCCTTACCATGATTCAGTATTCGGTGGTCATTTGGGGTTCTTACGAACTTATAAGAGGATGGCTGG
AGAGTTGTATTGGGAAGGAATGAAACAGGAAGTGAAGAAATATTATGAGGAGTGTATGATCTGCCAACGCAATAAGACCTTGGCTCTTTCACCAGCTGGTTTATTAACTC
CTCTAGAAGTGCCAAATAGAGTGTGGGGGGATATATCAATGAATTTTGTGGAAGGATTACCTAAGGCAGGGGGTTATAAGGTGATATTTGTAGTGGTTGGCCGATTTAGT
AAATATGGGCATTTCATCCCTATGAAACATCCATATACAACAAAAGTCGTGTCTGAAATATTTGTGAAGGAAATCGTGCGTTTACATGGATATCCCAAATCCATAGTATC
AGATAGAGACAAGGTGTTTTTGAGTCACTTTTGGAGACAATTGTTTTGTTTGGCAGGAACCAAATTGAACCACAGTATAGCATATCATCCACAAACCGATGGGCAGACGG
AGGTTGTTAACAGAGGAGTGGAGAGTTTTTTGCGCTGTTTTTGTGGGGAAAAGCCGAGGGAGTGGATTAAATGGATTCCTTGGGCAGAATACTGGTATAATACAACATAT
AAGTGTTCTTTGGGAGTCACACCATTTCAAGCGGTGTATGGGAGATTGCCACCTCCTTTGATATATTATAGAGTGATAGCTAAGATTGGACCGGTAGCTTATAAATTGGA
ATTGCCAGAGAATGCTAGTATACATCCAGTATTCCATGTCTCTCAGTCAAAGAAAGTATTTGGAAAGCATGATGAGAATAGAAATGACGTGCCCTGTTTAACTGAAAATC
ATGAATGGAAAGATGTTCCGGAGGAGGTTTACGGATATTCGAAAAACAAAGCGGGAAGTTGGGATGTATTGGTGCAGTGGAAAGGTCTACCGCGACATGAGGCTACCTGG
GAGTTATATGAAGATTTGAAGCAGCGATTTCTAGATTTTCACCTTGAGGACAAGAGAGACAGTCTAGTACTGAATGATGACATACAGTATGCTAAACTTGAAGAAGTTGA
TCGGATTTATGACTTTCGTGCAGGTTTCAACCTCAAGTTTCACATTAGACCCATTTCCTCCTTAATGAAAGTGTGTTCTGAAGTTCGTCTTGTAGAAGATCATACAAGTG
CCATGAGTGTTCTGACTATCCCGCTATTGATTTTGCTACCTTCAGCGCTAGATCCTTTACCCATCCTAGTGACAAGAACAACGGAAAACCGATTCCTATCTTCCCTTAGT
CTTATCAATGTCGATGGGAAGAATCCCTGGATCCTAGACTCAGGTGCTACAGATCACTTGATAGGTTCCTCTGAAAATTTTGTTTCTTATATTTTGTGCACTGGTATTGA
AAGGACAAAGATAGCTAATGGTTCTTTGGGTAGCCTCAATTACTGCAATTTGTTTTTGAAGCCCTTTCAGTTGTTGCTGAGTTATTCCCGATGCAGAAGCTTGCACGATA
GGGTTTTGGAATGTGCCGAAAATCCACTAACCTCAAATGTGGCAGAACACGAATCGGCACGTGGAGAACACTACAGACAACAGATGAAGACGAGGGTGGCACAGTTAGGT
TGTTCGGTGGCGTTTGAACAGGATGGACCGGCAGCGAAGCCATTCGTCCATGGCAACGTTGATCTGAGGATCGATGGTGGCACTAAAACTGGCAGCTGCCTCCTCTATGT
GATTTCCATCACTGACAGAGCTGAAGTAATGCTGAAGCAGAGATACTTGGACTGCAGAAGGATTACTTTTGCCCTTCTTGCTACCTCTGCCTCTGGTTTACCAGCGGCCC
CTTCGAATTCCAAAGATTATCAAAGTCTTTACCATGTTCTTGAAGCCTGCAGACTCTTCCACATGAATTCCAAAACTGTTATCGAAACGCATGCACGAATTATTAAATTT
GGATATGGAAACTACCCTACTCTCATCGCCTCTCTAGTATCTACTTATCAACGTGTTGGTTGCCTTAATCGGGTTCATCAACTTCTTGATATACTCTGCTCTAAGCAGCT
TGATTTAGTTGCAATGAACTTACTCATTGGAAACTTTATGAAAATCGGGGAGTGCAAATTTGCTAAAAAGGTGTTTTATAAAATGCCTTTTCGTGATGTGGTAACATGGA
ACTCAATCATTGGAGGTTGTGTGAAGAATGCACGGTATGATGAGGCATTTAGATTCTTTAGACAGATGCTGACGTCAAATATTCAGCCGGATGGATTTACATTTGCTTCT
ATATTGAATGCATGTGCTCAACTCGGAGCTCCTAGTAACACTCATTGGGTTCATGCTCAGATGACTCAGAAAAAAATTGAGCTTAATTCTTTATTAACTTGTGCACTCAT
AGACGCATACTCGAAATGTGGTAGCATCCAAATTGCAAAGGAAATATTTAGTAATATTCCTCACAGTGATACTTCCGTTTGGAATGTGATGATCAAAGGGCTTGCGATTC
ATGGGCTTGCAATGGATGCATTATCGTTATTTTTGAGGATGGAGCATGAGAGTGTTCTGCCTGATGCCATCACCTTTTTGGGTGTTTTAACAGCTTGCAACCATGGTGGT
TTAATTGACCATGGTCGTAGATATTTTGAGCTGATGAAAAGTCATTATTCAATTCAGCCGCAGCTTGAGCATTATGGTGTCATGGTTGACCTTTATAGTCGAGCTGGGTT
TCTGGAAGAGGCCTATTCTCTAATCGTGACAATGCCGATAGAGCCTGATGTTGTCACGTGGCGGACACTTTTAAGTGGTTGTAGAATTTACAAAAATCATAAACTTGCAG
AAGTTGCTATTGCAAATATGTCTCATCGTAAGAGTGGGGATTACGTATTATTATCAAATATATATTGTTCTCTCAACAGATGGAAGGAAGCAGAAACAGTTAGAAAGATG
ATGAAAATCAATAGAGTTCGTAAGAAACGTGGAAAAAGCTGGATTGAGTTGGGAGGTACCATTCAACACTTCAAGTCAGGTGATCGATTGCATCCAGAAAGCGATGCAAT
AGAAAAAGTGCTATGCAGTTTGATGAAGAGAACTCGGACGGAGGGATATATGCCTGTGACGGAGTTGGTTTTCATGGATATCTCTGAGGAGGAGAAGGAAGAGAACTTAT
CATTTCATAGTGAAAAGATGGCATTGGCTTATGCGATCTTGAAAACTAGTCCTGGGGCAAAGATCAGTATTTCAAAGAACCTGCGGATCTGTGATGATTGCCATACATGG
ATTAAATTAGTTTCAAGAGTGCTGTGTAGAGTTATAGTAGTGAGGGATCGGATCCGGTTTCATCAATTTGAAGGTGGCATGTGTTCTTGTGGTGATCGTTGGTAG
Protein sequenceShow/hide protein sequence
MVLSKTSKLIPTILHPYHDSVFGGHLGFLRTYKRMAGELYWEGMKQEVKKYYEECMICQRNKTLALSPAGLLTPLEVPNRVWGDISMNFVEGLPKAGGYKVIFVVVGRFS
KYGHFIPMKHPYTTKVVSEIFVKEIVRLHGYPKSIVSDRDKVFLSHFWRQLFCLAGTKLNHSIAYHPQTDGQTEVVNRGVESFLRCFCGEKPREWIKWIPWAEYWYNTTY
KCSLGVTPFQAVYGRLPPPLIYYRVIAKIGPVAYKLELPENASIHPVFHVSQSKKVFGKHDENRNDVPCLTENHEWKDVPEEVYGYSKNKAGSWDVLVQWKGLPRHEATW
ELYEDLKQRFLDFHLEDKRDSLVLNDDIQYAKLEEVDRIYDFRAGFNLKFHIRPISSLMKVCSEVRLVEDHTSAMSVLTIPLLILLPSALDPLPILVTRTTENRFLSSLS
LINVDGKNPWILDSGATDHLIGSSENFVSYILCTGIERTKIANGSLGSLNYCNLFLKPFQLLLSYSRCRSLHDRVLECAENPLTSNVAEHESARGEHYRQQMKTRVAQLG
CSVAFEQDGPAAKPFVHGNVDLRIDGGTKTGSCLLYVISITDRAEVMLKQRYLDCRRITFALLATSASGLPAAPSNSKDYQSLYHVLEACRLFHMNSKTVIETHARIIKF
GYGNYPTLIASLVSTYQRVGCLNRVHQLLDILCSKQLDLVAMNLLIGNFMKIGECKFAKKVFYKMPFRDVVTWNSIIGGCVKNARYDEAFRFFRQMLTSNIQPDGFTFAS
ILNACAQLGAPSNTHWVHAQMTQKKIELNSLLTCALIDAYSKCGSIQIAKEIFSNIPHSDTSVWNVMIKGLAIHGLAMDALSLFLRMEHESVLPDAITFLGVLTACNHGG
LIDHGRRYFELMKSHYSIQPQLEHYGVMVDLYSRAGFLEEAYSLIVTMPIEPDVVTWRTLLSGCRIYKNHKLAEVAIANMSHRKSGDYVLLSNIYCSLNRWKEAETVRKM
MKINRVRKKRGKSWIELGGTIQHFKSGDRLHPESDAIEKVLCSLMKRTRTEGYMPVTELVFMDISEEEKEENLSFHSEKMALAYAILKTSPGAKISISKNLRICDDCHTW
IKLVSRVLCRVIVVRDRIRFHQFEGGMCSCGDRW