; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

PI0020385 (gene) of Melon (PI 482460) v1 genome

Gene IDPI0020385
OrganismCucumis metuliferus PI 482460 (Melon (PI 482460) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationchr02:12341796..12343813
RNA-Seq ExpressionPI0020385
SyntenyPI0020385
Gene Ontology termsGO:0009451 - RNA modification (biological process)
GO:0043231 - intracellular membrane-bounded organelle (cellular component)
GO:0003723 - RNA binding (molecular function)
GO:0005515 - protein binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily
IPR032867 - DYW domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TYJ96962.1 pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa]6.0e-28892.01Show/hide
Query:  MLKQRYLDCRRITFALLATSASALPAAPSNSKDYQTLCRVLEACRLFPWNSKTVIETHARIIKFGYGNYPTLIASLVSTYQRVGCLNQVHQLLDILCSKH
        MLKQRYLDCRRIT ALLATSASALPAAPSN  DYQTL RVLEACRLFP NSKTVIETHARIIKFGYGNYPTLIASLVSTYQ VGCLN+VH+LLDILCSKH
Subjt:  MLKQRYLDCRRITFALLATSASALPAAPSNSKDYQTLCRVLEACRLFPWNSKTVIETHARIIKFGYGNYPTLIASLVSTYQRVGCLNQVHQLLDILCSKH

Query:  LDLVAMNLLIGNFMKIGECKFAKKYFIK---------------CLSNARYDEAFRFFRQMLKSNIQPDGFTFASILNACAQLGAPSNTHWVHAQMTQKKI
        LDLVAMNLLIGNFMKIGECKFAKK F K               C+ NARYDEAFRFFRQML SNIQPDGFTFAS+LNACAQLGAPSNTHWV AQMTQKKI
Subjt:  LDLVAMNLLIGNFMKIGECKFAKKYFIK---------------CLSNARYDEAFRFFRQMLKSNIQPDGFTFASILNACAQLGAPSNTHWVHAQMTQKKI

Query:  ELNSLLSCALIDAYSKCGSIQIAKEIFSNVPHNDTSVWNVMIKGLAIHGLAMDALSLFLRMEHENVLPDAITFLGILTACNHGGLIDHGRRYFELMKSRY
        ELNSLLSCALIDAYSKCGSIQIAKEIFSNVPH+DTSVWNVMIKGLAIHGLAMDALSLFLRMEHENVLPDAITFLGILTACNHGGLIDHGRRYFELM+SRY
Subjt:  ELNSLLSCALIDAYSKCGSIQIAKEIFSNVPHNDTSVWNVMIKGLAIHGLAMDALSLFLRMEHENVLPDAITFLGILTACNHGGLIDHGRRYFELMKSRY

Query:  SIQPQLQHYGIMVDLYSRAGFLEEAYSLIVTMPIEPDVVTWRTLLSGCRIYKNHKLAEVAIANMSHRRSGDYVLLSNIYCSLNRWEEAETVRKMMKLNKV
        SIQPQL+HYG+MVDLYSRAGFLEEAYSLIVTMP+EPDVVTWRTLLSGC+IYKNHKLAEVAIANMSHR+SGDYVLLSNIYCSLNRWEEAETVRKMMK+N+V
Subjt:  SIQPQLQHYGIMVDLYSRAGFLEEAYSLIVTMPIEPDVVTWRTLLSGCRIYKNHKLAEVAIANMSHRRSGDYVLLSNIYCSLNRWEEAETVRKMMKLNKV

Query:  RKKRGKSWIELGGTIQHFKSGDRSHPERDAIDKVLCSLMKRTRSEGYMPVTELVFMDISEEEKEENLSFHSEKMALAYAILKTSPGAKISISKNLRICDD
        RKKRGKSWIELGGT Q+FKSGDR HPE DAIDKVLCSLMKRTRSEGYMPVTELVFMDISEEEKEENLSFHSEKMALAYAILKTSPGAKISISKNLRICDD
Subjt:  RKKRGKSWIELGGTIQHFKSGDRSHPERDAIDKVLCSLMKRTRSEGYMPVTELVFMDISEEEKEENLSFHSEKMALAYAILKTSPGAKISISKNLRICDD

Query:  CHTWIKLVSRVLCRVIVVRDRIRFHQFEGGMCSCGDRW
        CHTWIKLVSRVLCRVIVVRDRIRFHQFEGGMCSCGDRW
Subjt:  CHTWIKLVSRVLCRVIVVRDRIRFHQFEGGMCSCGDRW

XP_008443769.1 PREDICTED: pentatricopeptide repeat-containing protein At5g50990 [Cucumis melo]3.0e-28792.01Show/hide
Query:  MLKQRYLDCRRITFALLATSASALPAAPSNSKDYQTLCRVLEACRLFPWNSKTVIETHARIIKFGYGNYPTLIASLVSTYQRVGCLNQVHQLLDILCSKH
        MLKQRYLDCRRIT ALLATSASALPAAPSN  DYQTL RVLEACRLFP NSKTVIETHARIIKFGYGNYPTLIASLVSTYQ VGCLN+VH+LLDILCSKH
Subjt:  MLKQRYLDCRRITFALLATSASALPAAPSNSKDYQTLCRVLEACRLFPWNSKTVIETHARIIKFGYGNYPTLIASLVSTYQRVGCLNQVHQLLDILCSKH

Query:  LDLVAMNLLIGNFMKIGECKFAKKYFIK---------------CLSNARYDEAFRFFRQMLKSNIQPDGFTFASILNACAQLGAPSNTHWVHAQMTQKKI
        LDLVAMNLLIGNFMKIGECKFAKK F K               C+ NARYDEAFRFFRQML SNIQPDGFTFAS+LNACAQLGAPSNTHWV AQMTQKKI
Subjt:  LDLVAMNLLIGNFMKIGECKFAKKYFIK---------------CLSNARYDEAFRFFRQMLKSNIQPDGFTFASILNACAQLGAPSNTHWVHAQMTQKKI

Query:  ELNSLLSCALIDAYSKCGSIQIAKEIFSNVPHNDTSVWNVMIKGLAIHGLAMDALSLFLRMEHENVLPDAITFLGILTACNHGGLIDHGRRYFELMKSRY
        ELNSLLSCALIDAYSKCGSIQIAKEIFSNVPH+DTSVWNVMIKGLAIHGLAMDALSLFLRMEHENVLPDAITFLGILTACNHGGLIDHGRRYFELM+SRY
Subjt:  ELNSLLSCALIDAYSKCGSIQIAKEIFSNVPHNDTSVWNVMIKGLAIHGLAMDALSLFLRMEHENVLPDAITFLGILTACNHGGLIDHGRRYFELMKSRY

Query:  SIQPQLQHYGIMVDLYSRAGFLEEAYSLIVTMPIEPDVVTWRTLLSGCRIYKNHKLAEVAIANMSHRRSGDYVLLSNIYCSLNRWEEAETVRKMMKLNKV
        SIQPQL+HYG+MVDLYSRAGFLEEAYSLIVTMPIEPDVVTWRTLLSGC+IYKNHKLAEVAIANMSH +SGDYVLLSNIYCSLNRWEEAETVRKMMK+N+V
Subjt:  SIQPQLQHYGIMVDLYSRAGFLEEAYSLIVTMPIEPDVVTWRTLLSGCRIYKNHKLAEVAIANMSHRRSGDYVLLSNIYCSLNRWEEAETVRKMMKLNKV

Query:  RKKRGKSWIELGGTIQHFKSGDRSHPERDAIDKVLCSLMKRTRSEGYMPVTELVFMDISEEEKEENLSFHSEKMALAYAILKTSPGAKISISKNLRICDD
        RKKRGKSWIELGGT Q+FKSGDR HPE DAIDKVLCSLMKRTRSEGYMPVTELVFMDISEEEKEENLSFHSEKMALAYAILKTSPGAKISISKNLRICDD
Subjt:  RKKRGKSWIELGGTIQHFKSGDRSHPERDAIDKVLCSLMKRTRSEGYMPVTELVFMDISEEEKEENLSFHSEKMALAYAILKTSPGAKISISKNLRICDD

Query:  CHTWIKLVSRVLCRVIVVRDRIRFHQFEGGMCSCGDRW
        CHTWIKLVSRVLCRVIVVRDRIRFHQFEGGMCSCGDRW
Subjt:  CHTWIKLVSRVLCRVIVVRDRIRFHQFEGGMCSCGDRW

XP_011660258.1 pentatricopeptide repeat-containing protein At5g50990 isoform X1 [Cucumis sativus]3.9e-28791.64Show/hide
Query:  MLKQRYLDCRRITFALLATSASALPAAPSNSKDYQTLCRVLEACRLFPWNSKTVIETHARIIKFGYGNYPTLIASLVSTYQRVGCLNQVHQLLDILCSKH
        MLKQRYLDCRRITFALLATSAS LPAAPSNSKDYQ+L RVLEACRLF  NSKTVIETHARIIKFGYGNYPTLIASLVSTYQRVGCLN+VHQLLDILCSK 
Subjt:  MLKQRYLDCRRITFALLATSASALPAAPSNSKDYQTLCRVLEACRLFPWNSKTVIETHARIIKFGYGNYPTLIASLVSTYQRVGCLNQVHQLLDILCSKH

Query:  LDLVAMNLLIGNFMKIGECKFAKKYFIK---------------CLSNARYDEAFRFFRQMLKSNIQPDGFTFASILNACAQLGAPSNTHWVHAQMTQKKI
        LDLVAMNLLIGNFMKIGECKFAKK F K               C+ NARYDEAFRFFRQML SNIQPDGFTFASILNACAQLGAPSNTHWVHAQMTQKKI
Subjt:  LDLVAMNLLIGNFMKIGECKFAKKYFIK---------------CLSNARYDEAFRFFRQMLKSNIQPDGFTFASILNACAQLGAPSNTHWVHAQMTQKKI

Query:  ELNSLLSCALIDAYSKCGSIQIAKEIFSNVPHNDTSVWNVMIKGLAIHGLAMDALSLFLRMEHENVLPDAITFLGILTACNHGGLIDHGRRYFELMKSRY
        ELNSLL+CALIDAYSKCGSIQIAKEIFSN+PH+DTSVWNVMIKGLAIHGLAMDALSLFLRMEHE+VLPDAITFLG+LTACNHG LIDHGRRYFELMKS Y
Subjt:  ELNSLLSCALIDAYSKCGSIQIAKEIFSNVPHNDTSVWNVMIKGLAIHGLAMDALSLFLRMEHENVLPDAITFLGILTACNHGGLIDHGRRYFELMKSRY

Query:  SIQPQLQHYGIMVDLYSRAGFLEEAYSLIVTMPIEPDVVTWRTLLSGCRIYKNHKLAEVAIANMSHRRSGDYVLLSNIYCSLNRWEEAETVRKMMKLNKV
        SIQPQL+HYG+MVDLYSRAGFLEEAYSLIVTMPIEPDVVTWRTLLSGCRIYKNHKLAEVAIANMS R+SGDYVLLSNIYCSLNRW+EAETVRKMMK+N+V
Subjt:  SIQPQLQHYGIMVDLYSRAGFLEEAYSLIVTMPIEPDVVTWRTLLSGCRIYKNHKLAEVAIANMSHRRSGDYVLLSNIYCSLNRWEEAETVRKMMKLNKV

Query:  RKKRGKSWIELGGTIQHFKSGDRSHPERDAIDKVLCSLMKRTRSEGYMPVTELVFMDISEEEKEENLSFHSEKMALAYAILKTSPGAKISISKNLRICDD
        RKKRGKSWIELGGTIQHFKSGDR HPE DAI+KVLCSLMKRTR+EGYMPVTELVFMDISEEEKEENLSFHSEKMALAYAILKTSPGAKISISKNLRICDD
Subjt:  RKKRGKSWIELGGTIQHFKSGDRSHPERDAIDKVLCSLMKRTRSEGYMPVTELVFMDISEEEKEENLSFHSEKMALAYAILKTSPGAKISISKNLRICDD

Query:  CHTWIKLVSRVLCRVIVVRDRIRFHQFEGGMCSCGDRW
        CHTWIKLVSRVLCRVIVVRDRIRFHQFEGGMCSCGDRW
Subjt:  CHTWIKLVSRVLCRVIVVRDRIRFHQFEGGMCSCGDRW

XP_031736189.1 pentatricopeptide repeat-containing protein At5g50990 isoform X2 [Cucumis sativus]3.6e-26991.11Show/hide
Query:  DYQTLCRVLEACRLFPWNSKTVIETHARIIKFGYGNYPTLIASLVSTYQRVGCLNQVHQLLDILCSKHLDLVAMNLLIGNFMKIGECKFAKKYFIK----
        +YQ+L RVLEACRLF  NSKTVIETHARIIKFGYGNYPTLIASLVSTYQRVGCLN+VHQLLDILCSK LDLVAMNLLIGNFMKIGECKFAKK F K    
Subjt:  DYQTLCRVLEACRLFPWNSKTVIETHARIIKFGYGNYPTLIASLVSTYQRVGCLNQVHQLLDILCSKHLDLVAMNLLIGNFMKIGECKFAKKYFIK----

Query:  -----------CLSNARYDEAFRFFRQMLKSNIQPDGFTFASILNACAQLGAPSNTHWVHAQMTQKKIELNSLLSCALIDAYSKCGSIQIAKEIFSNVPH
                   C+ NARYDEAFRFFRQML SNIQPDGFTFASILNACAQLGAPSNTHWVHAQMTQKKIELNSLL+CALIDAYSKCGSIQIAKEIFSN+PH
Subjt:  -----------CLSNARYDEAFRFFRQMLKSNIQPDGFTFASILNACAQLGAPSNTHWVHAQMTQKKIELNSLLSCALIDAYSKCGSIQIAKEIFSNVPH

Query:  NDTSVWNVMIKGLAIHGLAMDALSLFLRMEHENVLPDAITFLGILTACNHGGLIDHGRRYFELMKSRYSIQPQLQHYGIMVDLYSRAGFLEEAYSLIVTM
        +DTSVWNVMIKGLAIHGLAMDALSLFLRMEHE+VLPDAITFLG+LTACNHG LIDHGRRYFELMKS YSIQPQL+HYG+MVDLYSRAGFLEEAYSLIVTM
Subjt:  NDTSVWNVMIKGLAIHGLAMDALSLFLRMEHENVLPDAITFLGILTACNHGGLIDHGRRYFELMKSRYSIQPQLQHYGIMVDLYSRAGFLEEAYSLIVTM

Query:  PIEPDVVTWRTLLSGCRIYKNHKLAEVAIANMSHRRSGDYVLLSNIYCSLNRWEEAETVRKMMKLNKVRKKRGKSWIELGGTIQHFKSGDRSHPERDAID
        PIEPDVVTWRTLLSGCRIYKNHKLAEVAIANMS R+SGDYVLLSNIYCSLNRW+EAETVRKMMK+N+VRKKRGKSWIELGGTIQHFKSGDR HPE DAI+
Subjt:  PIEPDVVTWRTLLSGCRIYKNHKLAEVAIANMSHRRSGDYVLLSNIYCSLNRWEEAETVRKMMKLNKVRKKRGKSWIELGGTIQHFKSGDRSHPERDAID

Query:  KVLCSLMKRTRSEGYMPVTELVFMDISEEEKEENLSFHSEKMALAYAILKTSPGAKISISKNLRICDDCHTWIKLVSRVLCRVIVVRDRIRFHQFEGGMC
        KVLCSLMKRTR+EGYMPVTELVFMDISEEEKEENLSFHSEKMALAYAILKTSPGAKISISKNLRICDDCHTWIKLVSRVLCRVIVVRDRIRFHQFEGGMC
Subjt:  KVLCSLMKRTRSEGYMPVTELVFMDISEEEKEENLSFHSEKMALAYAILKTSPGAKISISKNLRICDDCHTWIKLVSRVLCRVIVVRDRIRFHQFEGGMC

Query:  SCGDRW
        SCGDRW
Subjt:  SCGDRW

XP_031736190.1 pentatricopeptide repeat-containing protein At5g50990 isoform X3 [Cucumis sativus]4.3e-26291.82Show/hide
Query:  NSKTVIETHARIIKFGYGNYPTLIASLVSTYQRVGCLNQVHQLLDILCSKHLDLVAMNLLIGNFMKIGECKFAKKYFIK---------------CLSNAR
        NSKTVIETHARIIKFGYGNYPTLIASLVSTYQRVGCLN+VHQLLDILCSK LDLVAMNLLIGNFMKIGECKFAKK F K               C+ NAR
Subjt:  NSKTVIETHARIIKFGYGNYPTLIASLVSTYQRVGCLNQVHQLLDILCSKHLDLVAMNLLIGNFMKIGECKFAKKYFIK---------------CLSNAR

Query:  YDEAFRFFRQMLKSNIQPDGFTFASILNACAQLGAPSNTHWVHAQMTQKKIELNSLLSCALIDAYSKCGSIQIAKEIFSNVPHNDTSVWNVMIKGLAIHG
        YDEAFRFFRQML SNIQPDGFTFASILNACAQLGAPSNTHWVHAQMTQKKIELNSLL+CALIDAYSKCGSIQIAKEIFSN+PH+DTSVWNVMIKGLAIHG
Subjt:  YDEAFRFFRQMLKSNIQPDGFTFASILNACAQLGAPSNTHWVHAQMTQKKIELNSLLSCALIDAYSKCGSIQIAKEIFSNVPHNDTSVWNVMIKGLAIHG

Query:  LAMDALSLFLRMEHENVLPDAITFLGILTACNHGGLIDHGRRYFELMKSRYSIQPQLQHYGIMVDLYSRAGFLEEAYSLIVTMPIEPDVVTWRTLLSGCR
        LAMDALSLFLRMEHE+VLPDAITFLG+LTACNHG LIDHGRRYFELMKS YSIQPQL+HYG+MVDLYSRAGFLEEAYSLIVTMPIEPDVVTWRTLLSGCR
Subjt:  LAMDALSLFLRMEHENVLPDAITFLGILTACNHGGLIDHGRRYFELMKSRYSIQPQLQHYGIMVDLYSRAGFLEEAYSLIVTMPIEPDVVTWRTLLSGCR

Query:  IYKNHKLAEVAIANMSHRRSGDYVLLSNIYCSLNRWEEAETVRKMMKLNKVRKKRGKSWIELGGTIQHFKSGDRSHPERDAIDKVLCSLMKRTRSEGYMP
        IYKNHKLAEVAIANMS R+SGDYVLLSNIYCSLNRW+EAETVRKMMK+N+VRKKRGKSWIELGGTIQHFKSGDR HPE DAI+KVLCSLMKRTR+EGYMP
Subjt:  IYKNHKLAEVAIANMSHRRSGDYVLLSNIYCSLNRWEEAETVRKMMKLNKVRKKRGKSWIELGGTIQHFKSGDRSHPERDAIDKVLCSLMKRTRSEGYMP

Query:  VTELVFMDISEEEKEENLSFHSEKMALAYAILKTSPGAKISISKNLRICDDCHTWIKLVSRVLCRVIVVRDRIRFHQFEGGMCSCGDRW
        VTELVFMDISEEEKEENLSFHSEKMALAYAILKTSPGAKISISKNLRICDDCHTWIKLVSRVLCRVIVVRDRIRFHQFEGGMCSCGDRW
Subjt:  VTELVFMDISEEEKEENLSFHSEKMALAYAILKTSPGAKISISKNLRICDDCHTWIKLVSRVLCRVIVVRDRIRFHQFEGGMCSCGDRW

TrEMBL top hitse value%identityAlignment
A0A0A0LXZ9 DYW_deaminase domain-containing protein1.9e-28791.64Show/hide
Query:  MLKQRYLDCRRITFALLATSASALPAAPSNSKDYQTLCRVLEACRLFPWNSKTVIETHARIIKFGYGNYPTLIASLVSTYQRVGCLNQVHQLLDILCSKH
        MLKQRYLDCRRITFALLATSAS LPAAPSNSKDYQ+L RVLEACRLF  NSKTVIETHARIIKFGYGNYPTLIASLVSTYQRVGCLN+VHQLLDILCSK 
Subjt:  MLKQRYLDCRRITFALLATSASALPAAPSNSKDYQTLCRVLEACRLFPWNSKTVIETHARIIKFGYGNYPTLIASLVSTYQRVGCLNQVHQLLDILCSKH

Query:  LDLVAMNLLIGNFMKIGECKFAKKYFIK---------------CLSNARYDEAFRFFRQMLKSNIQPDGFTFASILNACAQLGAPSNTHWVHAQMTQKKI
        LDLVAMNLLIGNFMKIGECKFAKK F K               C+ NARYDEAFRFFRQML SNIQPDGFTFASILNACAQLGAPSNTHWVHAQMTQKKI
Subjt:  LDLVAMNLLIGNFMKIGECKFAKKYFIK---------------CLSNARYDEAFRFFRQMLKSNIQPDGFTFASILNACAQLGAPSNTHWVHAQMTQKKI

Query:  ELNSLLSCALIDAYSKCGSIQIAKEIFSNVPHNDTSVWNVMIKGLAIHGLAMDALSLFLRMEHENVLPDAITFLGILTACNHGGLIDHGRRYFELMKSRY
        ELNSLL+CALIDAYSKCGSIQIAKEIFSN+PH+DTSVWNVMIKGLAIHGLAMDALSLFLRMEHE+VLPDAITFLG+LTACNHG LIDHGRRYFELMKS Y
Subjt:  ELNSLLSCALIDAYSKCGSIQIAKEIFSNVPHNDTSVWNVMIKGLAIHGLAMDALSLFLRMEHENVLPDAITFLGILTACNHGGLIDHGRRYFELMKSRY

Query:  SIQPQLQHYGIMVDLYSRAGFLEEAYSLIVTMPIEPDVVTWRTLLSGCRIYKNHKLAEVAIANMSHRRSGDYVLLSNIYCSLNRWEEAETVRKMMKLNKV
        SIQPQL+HYG+MVDLYSRAGFLEEAYSLIVTMPIEPDVVTWRTLLSGCRIYKNHKLAEVAIANMS R+SGDYVLLSNIYCSLNRW+EAETVRKMMK+N+V
Subjt:  SIQPQLQHYGIMVDLYSRAGFLEEAYSLIVTMPIEPDVVTWRTLLSGCRIYKNHKLAEVAIANMSHRRSGDYVLLSNIYCSLNRWEEAETVRKMMKLNKV

Query:  RKKRGKSWIELGGTIQHFKSGDRSHPERDAIDKVLCSLMKRTRSEGYMPVTELVFMDISEEEKEENLSFHSEKMALAYAILKTSPGAKISISKNLRICDD
        RKKRGKSWIELGGTIQHFKSGDR HPE DAI+KVLCSLMKRTR+EGYMPVTELVFMDISEEEKEENLSFHSEKMALAYAILKTSPGAKISISKNLRICDD
Subjt:  RKKRGKSWIELGGTIQHFKSGDRSHPERDAIDKVLCSLMKRTRSEGYMPVTELVFMDISEEEKEENLSFHSEKMALAYAILKTSPGAKISISKNLRICDD

Query:  CHTWIKLVSRVLCRVIVVRDRIRFHQFEGGMCSCGDRW
        CHTWIKLVSRVLCRVIVVRDRIRFHQFEGGMCSCGDRW
Subjt:  CHTWIKLVSRVLCRVIVVRDRIRFHQFEGGMCSCGDRW

A0A1S4DV22 pentatricopeptide repeat-containing protein At5g509901.4e-28792.01Show/hide
Query:  MLKQRYLDCRRITFALLATSASALPAAPSNSKDYQTLCRVLEACRLFPWNSKTVIETHARIIKFGYGNYPTLIASLVSTYQRVGCLNQVHQLLDILCSKH
        MLKQRYLDCRRIT ALLATSASALPAAPSN  DYQTL RVLEACRLFP NSKTVIETHARIIKFGYGNYPTLIASLVSTYQ VGCLN+VH+LLDILCSKH
Subjt:  MLKQRYLDCRRITFALLATSASALPAAPSNSKDYQTLCRVLEACRLFPWNSKTVIETHARIIKFGYGNYPTLIASLVSTYQRVGCLNQVHQLLDILCSKH

Query:  LDLVAMNLLIGNFMKIGECKFAKKYFIK---------------CLSNARYDEAFRFFRQMLKSNIQPDGFTFASILNACAQLGAPSNTHWVHAQMTQKKI
        LDLVAMNLLIGNFMKIGECKFAKK F K               C+ NARYDEAFRFFRQML SNIQPDGFTFAS+LNACAQLGAPSNTHWV AQMTQKKI
Subjt:  LDLVAMNLLIGNFMKIGECKFAKKYFIK---------------CLSNARYDEAFRFFRQMLKSNIQPDGFTFASILNACAQLGAPSNTHWVHAQMTQKKI

Query:  ELNSLLSCALIDAYSKCGSIQIAKEIFSNVPHNDTSVWNVMIKGLAIHGLAMDALSLFLRMEHENVLPDAITFLGILTACNHGGLIDHGRRYFELMKSRY
        ELNSLLSCALIDAYSKCGSIQIAKEIFSNVPH+DTSVWNVMIKGLAIHGLAMDALSLFLRMEHENVLPDAITFLGILTACNHGGLIDHGRRYFELM+SRY
Subjt:  ELNSLLSCALIDAYSKCGSIQIAKEIFSNVPHNDTSVWNVMIKGLAIHGLAMDALSLFLRMEHENVLPDAITFLGILTACNHGGLIDHGRRYFELMKSRY

Query:  SIQPQLQHYGIMVDLYSRAGFLEEAYSLIVTMPIEPDVVTWRTLLSGCRIYKNHKLAEVAIANMSHRRSGDYVLLSNIYCSLNRWEEAETVRKMMKLNKV
        SIQPQL+HYG+MVDLYSRAGFLEEAYSLIVTMPIEPDVVTWRTLLSGC+IYKNHKLAEVAIANMSH +SGDYVLLSNIYCSLNRWEEAETVRKMMK+N+V
Subjt:  SIQPQLQHYGIMVDLYSRAGFLEEAYSLIVTMPIEPDVVTWRTLLSGCRIYKNHKLAEVAIANMSHRRSGDYVLLSNIYCSLNRWEEAETVRKMMKLNKV

Query:  RKKRGKSWIELGGTIQHFKSGDRSHPERDAIDKVLCSLMKRTRSEGYMPVTELVFMDISEEEKEENLSFHSEKMALAYAILKTSPGAKISISKNLRICDD
        RKKRGKSWIELGGT Q+FKSGDR HPE DAIDKVLCSLMKRTRSEGYMPVTELVFMDISEEEKEENLSFHSEKMALAYAILKTSPGAKISISKNLRICDD
Subjt:  RKKRGKSWIELGGTIQHFKSGDRSHPERDAIDKVLCSLMKRTRSEGYMPVTELVFMDISEEEKEENLSFHSEKMALAYAILKTSPGAKISISKNLRICDD

Query:  CHTWIKLVSRVLCRVIVVRDRIRFHQFEGGMCSCGDRW
        CHTWIKLVSRVLCRVIVVRDRIRFHQFEGGMCSCGDRW
Subjt:  CHTWIKLVSRVLCRVIVVRDRIRFHQFEGGMCSCGDRW

A0A5A7T674 Pentatricopeptide repeat-containing protein1.4e-28792.01Show/hide
Query:  MLKQRYLDCRRITFALLATSASALPAAPSNSKDYQTLCRVLEACRLFPWNSKTVIETHARIIKFGYGNYPTLIASLVSTYQRVGCLNQVHQLLDILCSKH
        MLKQRYLDCRRIT ALLATSASALPAAPSN  DYQTL RVLEACRLFP NSKTVIETHARIIKFGYGNYPTLIASLVSTYQ VGCLN+VH+LLDILCSKH
Subjt:  MLKQRYLDCRRITFALLATSASALPAAPSNSKDYQTLCRVLEACRLFPWNSKTVIETHARIIKFGYGNYPTLIASLVSTYQRVGCLNQVHQLLDILCSKH

Query:  LDLVAMNLLIGNFMKIGECKFAKKYFIK---------------CLSNARYDEAFRFFRQMLKSNIQPDGFTFASILNACAQLGAPSNTHWVHAQMTQKKI
        LDLVAMNLLIGNFMKIGECKFAKK F K               C+ NARYDEAFRFFRQML SNIQPDGFTFAS+LNACAQLGAPSNTHWV AQMTQKKI
Subjt:  LDLVAMNLLIGNFMKIGECKFAKKYFIK---------------CLSNARYDEAFRFFRQMLKSNIQPDGFTFASILNACAQLGAPSNTHWVHAQMTQKKI

Query:  ELNSLLSCALIDAYSKCGSIQIAKEIFSNVPHNDTSVWNVMIKGLAIHGLAMDALSLFLRMEHENVLPDAITFLGILTACNHGGLIDHGRRYFELMKSRY
        ELNSLLSCALIDAYSKCGSIQIAKEIFSNVPH+DTSVWNVMIKGLAIHGLAMDALSLFLRMEHENVLPDAITFLGILTACNHGGLIDHGRRYFELM+SRY
Subjt:  ELNSLLSCALIDAYSKCGSIQIAKEIFSNVPHNDTSVWNVMIKGLAIHGLAMDALSLFLRMEHENVLPDAITFLGILTACNHGGLIDHGRRYFELMKSRY

Query:  SIQPQLQHYGIMVDLYSRAGFLEEAYSLIVTMPIEPDVVTWRTLLSGCRIYKNHKLAEVAIANMSHRRSGDYVLLSNIYCSLNRWEEAETVRKMMKLNKV
        SIQPQL+HYG+MVDLYSRAGFLEEAYSLIVTMPIEPDVVTWRTLLSGC+IYKNHKLAEVAIANMSH +SGDYVLLSNIYCSLNRWEEAETVRKMMK+N+V
Subjt:  SIQPQLQHYGIMVDLYSRAGFLEEAYSLIVTMPIEPDVVTWRTLLSGCRIYKNHKLAEVAIANMSHRRSGDYVLLSNIYCSLNRWEEAETVRKMMKLNKV

Query:  RKKRGKSWIELGGTIQHFKSGDRSHPERDAIDKVLCSLMKRTRSEGYMPVTELVFMDISEEEKEENLSFHSEKMALAYAILKTSPGAKISISKNLRICDD
        RKKRGKSWIELGGT Q+FKSGDR HPE DAIDKVLCSLMKRTRSEGYMPVTELVFMDISEEEKEENLSFHSEKMALAYAILKTSPGAKISISKNLRICDD
Subjt:  RKKRGKSWIELGGTIQHFKSGDRSHPERDAIDKVLCSLMKRTRSEGYMPVTELVFMDISEEEKEENLSFHSEKMALAYAILKTSPGAKISISKNLRICDD

Query:  CHTWIKLVSRVLCRVIVVRDRIRFHQFEGGMCSCGDRW
        CHTWIKLVSRVLCRVIVVRDRIRFHQFEGGMCSCGDRW
Subjt:  CHTWIKLVSRVLCRVIVVRDRIRFHQFEGGMCSCGDRW

A0A5D3BAU8 Pentatricopeptide repeat-containing protein2.9e-28892.01Show/hide
Query:  MLKQRYLDCRRITFALLATSASALPAAPSNSKDYQTLCRVLEACRLFPWNSKTVIETHARIIKFGYGNYPTLIASLVSTYQRVGCLNQVHQLLDILCSKH
        MLKQRYLDCRRIT ALLATSASALPAAPSN  DYQTL RVLEACRLFP NSKTVIETHARIIKFGYGNYPTLIASLVSTYQ VGCLN+VH+LLDILCSKH
Subjt:  MLKQRYLDCRRITFALLATSASALPAAPSNSKDYQTLCRVLEACRLFPWNSKTVIETHARIIKFGYGNYPTLIASLVSTYQRVGCLNQVHQLLDILCSKH

Query:  LDLVAMNLLIGNFMKIGECKFAKKYFIK---------------CLSNARYDEAFRFFRQMLKSNIQPDGFTFASILNACAQLGAPSNTHWVHAQMTQKKI
        LDLVAMNLLIGNFMKIGECKFAKK F K               C+ NARYDEAFRFFRQML SNIQPDGFTFAS+LNACAQLGAPSNTHWV AQMTQKKI
Subjt:  LDLVAMNLLIGNFMKIGECKFAKKYFIK---------------CLSNARYDEAFRFFRQMLKSNIQPDGFTFASILNACAQLGAPSNTHWVHAQMTQKKI

Query:  ELNSLLSCALIDAYSKCGSIQIAKEIFSNVPHNDTSVWNVMIKGLAIHGLAMDALSLFLRMEHENVLPDAITFLGILTACNHGGLIDHGRRYFELMKSRY
        ELNSLLSCALIDAYSKCGSIQIAKEIFSNVPH+DTSVWNVMIKGLAIHGLAMDALSLFLRMEHENVLPDAITFLGILTACNHGGLIDHGRRYFELM+SRY
Subjt:  ELNSLLSCALIDAYSKCGSIQIAKEIFSNVPHNDTSVWNVMIKGLAIHGLAMDALSLFLRMEHENVLPDAITFLGILTACNHGGLIDHGRRYFELMKSRY

Query:  SIQPQLQHYGIMVDLYSRAGFLEEAYSLIVTMPIEPDVVTWRTLLSGCRIYKNHKLAEVAIANMSHRRSGDYVLLSNIYCSLNRWEEAETVRKMMKLNKV
        SIQPQL+HYG+MVDLYSRAGFLEEAYSLIVTMP+EPDVVTWRTLLSGC+IYKNHKLAEVAIANMSHR+SGDYVLLSNIYCSLNRWEEAETVRKMMK+N+V
Subjt:  SIQPQLQHYGIMVDLYSRAGFLEEAYSLIVTMPIEPDVVTWRTLLSGCRIYKNHKLAEVAIANMSHRRSGDYVLLSNIYCSLNRWEEAETVRKMMKLNKV

Query:  RKKRGKSWIELGGTIQHFKSGDRSHPERDAIDKVLCSLMKRTRSEGYMPVTELVFMDISEEEKEENLSFHSEKMALAYAILKTSPGAKISISKNLRICDD
        RKKRGKSWIELGGT Q+FKSGDR HPE DAIDKVLCSLMKRTRSEGYMPVTELVFMDISEEEKEENLSFHSEKMALAYAILKTSPGAKISISKNLRICDD
Subjt:  RKKRGKSWIELGGTIQHFKSGDRSHPERDAIDKVLCSLMKRTRSEGYMPVTELVFMDISEEEKEENLSFHSEKMALAYAILKTSPGAKISISKNLRICDD

Query:  CHTWIKLVSRVLCRVIVVRDRIRFHQFEGGMCSCGDRW
        CHTWIKLVSRVLCRVIVVRDRIRFHQFEGGMCSCGDRW
Subjt:  CHTWIKLVSRVLCRVIVVRDRIRFHQFEGGMCSCGDRW

A0A6J1JL20 pentatricopeptide repeat-containing protein At5g50990 isoform X21.4e-23770.41Show/hide
Query:  KQRYLDCRRITFALLATSASALP------------------------------------------------------AAPSNSKD---------------
        KQRYLD +RIT ALLATSA ALP                                                      + P NS +               
Subjt:  KQRYLDCRRITFALLATSASALP------------------------------------------------------AAPSNSKD---------------

Query:  YQTLCRVLEACRLFPWNSKTVIETHARIIKFGYGNYPTLIASLVSTYQRVGCLNQVHQLLDILCSKHLDLVAMNLLIGNFMKIGECKFAKKYFIK-----
        YQTL RVLEACRL P NSKT  ETHAR+IKFGYGNYPTL+ SLVS YQR  CLN+VHQLL++LCSKHLDLVAMNL I NFMKIGECK AK+ F K     
Subjt:  YQTLCRVLEACRLFPWNSKTVIETHARIIKFGYGNYPTLIASLVSTYQRVGCLNQVHQLLDILCSKHLDLVAMNLLIGNFMKIGECKFAKKYFIK-----

Query:  ----------CLSNARYDEAFRFFRQMLKSNIQPDGFTFASILNACAQLGAPSNTHWVHAQMTQKKIELNSLLSCALIDAYSKCGSIQIAKEIFSNVPHN
                  C+ NARY+EAF+FFRQML SNIQPDGFTFAS+LNACAQLGAPSNT WVHA MTQKKI+LNS+LSCALIDAYSKCGSIQIAKEIFS+VP +
Subjt:  ----------CLSNARYDEAFRFFRQMLKSNIQPDGFTFASILNACAQLGAPSNTHWVHAQMTQKKIELNSLLSCALIDAYSKCGSIQIAKEIFSNVPHN

Query:  DTSVWNVMIKGLAIHGLAMDALSLFLRMEHENVLPDAITFLGILTACNHGGLIDHGRRYFELMKSRYSIQPQLQHYGIMVDLYSRAGFLEEAYSLIVTMP
        + SVWN MIKGLAIHGL+MDALS+F  ME ENVLPDA+TFLGILTACNHGGLI+ GRR+F+ MK+RYSIQPQL+HYG+MVDLYSRAGFLEEAYS+IV MP
Subjt:  DTSVWNVMIKGLAIHGLAMDALSLFLRMEHENVLPDAITFLGILTACNHGGLIDHGRRYFELMKSRYSIQPQLQHYGIMVDLYSRAGFLEEAYSLIVTMP

Query:  IEPDVVTWRTLLSGCRIYKNHKLAEVAIANMSHRRSGDYVLLSNIYCSLNRWEEAETVRKMMKLNKVRKKRGKSWIELGGTIQHFKSGDRSHPERDAIDK
        IE DVVTWR LLSGCRIY+N +LAEVAIANMSHR SGDYVLLSNIYCSLNRWE AE VR+ MK N VRK  GKSWIELGG+IQ F+SGDRSHPE DA+ K
Subjt:  IEPDVVTWRTLLSGCRIYKNHKLAEVAIANMSHRRSGDYVLLSNIYCSLNRWEEAETVRKMMKLNKVRKKRGKSWIELGGTIQHFKSGDRSHPERDAIDK

Query:  VLCSLMKRTRSEGYMPVTELVFMDISEEEKEENLSFHSEKMALAYAILKTSPGAKISISKNLRICDDCHTWIKLVSRVLCRVIVVRDRIRFHQFEGGMCS
        V+  LMKR+RSEGYMPVT+LV MDISEEEKEENLS+HSEK+ALAYAILKTSPGAKISISKNLR+CDDCH WIK+VS +LCRV+VVRDRIRFHQFEGGMCS
Subjt:  VLCSLMKRTRSEGYMPVTELVFMDISEEEKEENLSFHSEKMALAYAILKTSPGAKISISKNLRICDDCHTWIKLVSRVLCRVIVVRDRIRFHQFEGGMCS

Query:  CGDRW
        CGDRW
Subjt:  CGDRW

SwissProt top hitse value%identityAlignment
A8MQA3 Pentatricopeptide repeat-containing protein At4g210657.5e-10040.64Show/hide
Query:  HARIIKFGYGNYPTLIASLVSTYQRVGCLNQVHQLLDILCSKHLDLVAMNLLIGNFMKIGECKFAKKYFIKCLSNARYDEAFRFFRQMLKSNIQPDGFTF
        H+ +I+ G+G+   +  SL+  Y   G +   +++ D +  K  DLVA N +I  F +                N + +EA   + +M    I+PDGFT 
Subjt:  HARIIKFGYGNYPTLIASLVSTYQRVGCLNQVHQLLDILCSKHLDLVAMNLLIGNFMKIGECKFAKKYFIKCLSNARYDEAFRFFRQMLKSNIQPDGFTF

Query:  ASILNACAQLGAPSNTHWVHAQMTQKKIELNSLLSCALIDAYSKCGSIQIAKEIFSNVPHNDTSVWNVMIKGLAIHGLAMDALSLFLRMEH-ENVLPDAI
         S+L+ACA++GA +    VH  M +  +  N   S  L+D Y++CG ++ AK +F  +   ++  W  +I GLA++G   +A+ LF  ME  E +LP  I
Subjt:  ASILNACAQLGAPSNTHWVHAQMTQKKIELNSLLSCALIDAYSKCGSIQIAKEIFSNVPHNDTSVWNVMIKGLAIHGLAMDALSLFLRMEH-ENVLPDAI

Query:  TFLGILTACNHGGLIDHGRRYFELMKSRYSIQPQLQHYGIMVDLYSRAGFLEEAYSLIVTMPIEPDVVTWRTLLSGCRIYKNHKLAEVA---IANMSHRR
        TF+GIL AC+H G++  G  YF  M+  Y I+P+++H+G MVDL +RAG +++AY  I +MP++P+VV WRTLL  C ++ +  LAE A   I  +    
Subjt:  TFLGILTACNHGGLIDHGRRYFELMKSRYSIQPQLQHYGIMVDLYSRAGFLEEAYSLIVTMPIEPDVVTWRTLLSGCRIYKNHKLAEVA---IANMSHRR

Query:  SGDYVLLSNIYCSLNRWEEAETVRKMMKLNKVRKKRGKSWIELGGTIQHFKSGDRSHPERDAIDKVLCSLMKRTRSEGYMPVTELVFMDISEEEKEENLS
        SGDYVLLSN+Y S  RW + + +RK M  + V+K  G S +E+G  +  F  GD+SHP+ DAI   L  +  R RSEGY+P    V++D+ EEEKE  + 
Subjt:  SGDYVLLSNIYCSLNRWEEAETVRKMMKLNKVRKKRGKSWIELGGTIQHFKSGDRSHPERDAIDKVLCSLMKRTRSEGYMPVTELVFMDISEEEKEENLS

Query:  FHSEKMALAYAILKTSPGAKISISKNLRICDDCHTWIKLVSRVLCRVIVVRDRIRFHQFEGGMCSCGDRW
        +HSEK+A+A+ ++ T   + I++ KNLR+C DCH  IKLVS+V  R IVVRDR RFH F+ G CSC D W
Subjt:  FHSEKMALAYAILKTSPGAKISISKNLRICDDCHTWIKLVSRVLCRVIVVRDRIRFHQFEGGMCSCGDRW

Q683I9 Pentatricopeptide repeat-containing protein At3g628901.4e-9839.84Show/hide
Query:  THARIIKFGYGNYPTLIASLVSTYQRVGCLNQVHQLLDILCSKHLDLVAMNLLIGNFMKIGECKFAKKYFIK----------CLSN-----ARYDEAFRF
        THA+I+ FG    P +  SL++ Y   G L    ++ D   SK  DL A N ++  + K G    A+K F +          CL N      +Y EA   
Subjt:  THARIIKFGYGNYPTLIASLVSTYQRVGCLNQVHQLLDILCSKHLDLVAMNLLIGNFMKIGECKFAKKYFIK----------CLSN-----ARYDEAFRF

Query:  FRQML-----KSNIQPDGFTFASILNACAQLGAPSNTHWVHAQMTQKKIELNSLLSCALIDAYSKCGSIQIAKEIFSNV-PHNDTSVWNVMIKGLAIHGL
        FR+M      ++ ++P+ FT +++L+AC +LGA     WVHA + +  +E++ +L  ALID Y+KCGS++ AK +F+ +    D   ++ MI  LA++GL
Subjt:  FRQML-----KSNIQPDGFTFASILNACAQLGAPSNTHWVHAQMTQKKIELNSLLSCALIDAYSKCGSIQIAKEIFSNV-PHNDTSVWNVMIKGLAIHGL

Query:  AMDALSLFLRM-EHENVLPDAITFLGILTACNHGGLIDHGRRYFELMKSRYSIQPQLQHYGIMVDLYSRAGFLEEAYSLIVTMPIEPDVVTWRTLLSGCR
          +   LF  M   +N+ P+++TF+GIL AC H GLI+ G+ YF++M   + I P +QHYG MVDLY R+G ++EA S I +MP+EPDV+ W +LLSG R
Subjt:  AMDALSLFLRM-EHENVLPDAITFLGILTACNHGGLIDHGRRYFELMKSRYSIQPQLQHYGIMVDLYSRAGFLEEAYSLIVTMPIEPDVVTWRTLLSGCR

Query:  IYKNHKLAEVA---IANMSHRRSGDYVLLSNIYCSLNRWEEAETVRKMMKLNKVRKKRGKSWIELGGTIQHFKSGDRSHPERDAIDKVLCSLMKRTRSEG
        +  + K  E A   +  +    SG YVLLSN+Y    RW E + +R  M++  + K  G S++E+ G +  F  GD S  E + I  +L  +M+R R  G
Subjt:  IYKNHKLAEVA---IANMSHRRSGDYVLLSNIYCSLNRWEEAETVRKMMKLNKVRKKRGKSWIELGGTIQHFKSGDRSHPERDAIDKVLCSLMKRTRSEG

Query:  YMPVTELVFMDISEEEKEENLSFHSEKMALAYAILKTSPGAKISISKNLRICDDCHTWIKLVSRVLCRVIVVRDRIRFHQFEGGMCSCGDRW
        Y+  T+ V +D++E++KE  LS+HSEK+A+A+ ++KT PG  + I KNLRIC DCH  +K++S++  R IVVRD  RFH F  G CSC D W
Subjt:  YMPVTELVFMDISEEEKEENLSFHSEKMALAYAILKTSPGAKISISKNLRICDDCHTWIKLVSRVLCRVIVVRDRIRFHQFEGGMCSCGDRW

Q9FI49 Pentatricopeptide repeat-containing protein At5g509903.5e-15050.94Show/hide
Query:  RRITFALLATSASALPAAPSNSKDYQTLCRVLEACRLFPWNSKTVIETHARIIKFGYGNYPTLIASLVSTYQRVGCLNQVHQLLDILCSKHLDLVAMNLL
        RR     L++S++      SN  D+  L +VLE+C+  P NSK V++ HA+I K GYG YP+L+ S V+ Y+R        +LL    S    +  +NL+
Subjt:  RRITFALLATSASALPAAPSNSKDYQTLCRVLEACRLFPWNSKTVIETHARIIKFGYGNYPTLIASLVSTYQRVGCLNQVHQLLDILCSKHLDLVAMNLL

Query:  IGNFMKIGECKFAKKYFIKC---------------LSNARYDEAFRFFRQMLK-SNIQPDGFTFASILNACAQLGAPSNTHWVHAQMTQKKIELNSLLSC
        I + MKIGE   AKK                    + N +Y+EA +  + ML  ++I+P+ F+FAS L ACA+LG   +  WVH+ M    IELN++LS 
Subjt:  IGNFMKIGECKFAKKYFIKC---------------LSNARYDEAFRFFRQMLK-SNIQPDGFTFASILNACAQLGAPSNTHWVHAQMTQKKIELNSLLSC

Query:  ALIDAYSKCGSIQIAKEIFSNVPHNDTSVWNVMIKGLAIHGLAMDALSLFLRMEHENVLPDAITFLGILTACNHGGLIDHGRRYFELMKSRYSIQPQLQH
        AL+D Y+KCG I  ++E+F +V  ND S+WN MI G A HGLA +A+ +F  ME E+V PD+ITFLG+LT C+H GL++ G+ YF LM  R+SIQP+L+H
Subjt:  ALIDAYSKCGSIQIAKEIFSNVPHNDTSVWNVMIKGLAIHGLAMDALSLFLRMEHENVLPDAITFLGILTACNHGGLIDHGRRYFELMKSRYSIQPQLQH

Query:  YGIMVDLYSRAGFLEEAYSLIVTMPIEPDVVTWRTLLSGCRIYKNHKLAEVAIANMSHRRSGDYVLLSNIYCSLNRWEEAETVRKMMKLNKVRKKRGKSW
        YG MVDL  RAG ++EAY LI +MPIEPDVV WR+LLS  R YKN +L E+AI N+S  +SGDYVLLSNIY S  +WE A+ VR++M    +RK +GKSW
Subjt:  YGIMVDLYSRAGFLEEAYSLIVTMPIEPDVVTWRTLLSGCRIYKNHKLAEVAIANMSHRRSGDYVLLSNIYCSLNRWEEAETVRKMMKLNKVRKKRGKSW

Query:  IELGGTIQHFKSGDRSHPERDAIDKVLCSLMKRTRSEGYMPVTELVFMDISEEEKEENLSFHSEKMALAYAILKTSPGAKISISKNLRICDDCHTWIKLV
        +E GG I  FK+GD SH E  AI KVL  L+++T+S+G++  T+LV MD+SEEEKEENL++HSEK+ALAY ILK+SPG +I I KN+R+C DCH WIK V
Subjt:  IELGGTIQHFKSGDRSHPERDAIDKVLCSLMKRTRSEGYMPVTELVFMDISEEEKEENLSFHSEKMALAYAILKTSPGAKISISKNLRICDDCHTWIKLV

Query:  SRVLCRVIVVRDRIRFHQFEGGMCSCGDRW
        S++L RVI++RDRIRFH+FE G+CSC D W
Subjt:  SRVLCRVIVVRDRIRFHQFEGGMCSCGDRW

Q9FI80 Pentatricopeptide repeat-containing protein At5g489108.3e-9938.15Show/hide
Query:  ETHARIIKFGYGNYPTLIASLVSTYQRVGCLNQVHQLL-------DILC-----SKHLDLVAMNLLIGNFMKIGECKFAKKYFIKCLS------------
        + H   +K+G+G    ++++LV  Y   G +     L        D++       +  ++V  N++I  +M++G+CK A+  F K               
Subjt:  ETHARIIKFGYGNYPTLIASLVSTYQRVGCLNQVHQLL-------DILC-----SKHLDLVAMNLLIGNFMKIGECKFAKKYFIKCLS------------

Query:  ---NARYDEAFRFFRQMLKSNIQPDGFTFASILNACAQLGAPSNTHWVHAQMTQKKIELNSLLSCALIDAYSKCGSIQIAKEIFSNVPHNDTSVWNVMIK
           N  + +A   FR+M K +I+P+  T  S+L A ++LG+     W+H       I ++ +L  ALID YSKCG I+ A  +F  +P  +   W+ MI 
Subjt:  ---NARYDEAFRFFRQMLKSNIQPDGFTFASILNACAQLGAPSNTHWVHAQMTQKKIELNSLLSCALIDAYSKCGSIQIAKEIFSNVPHNDTSVWNVMIK

Query:  GLAIHGLAMDALSLFLRMEHENVLPDAITFLGILTACNHGGLIDHGRRYFELMKSRYSIQPQLQHYGIMVDLYSRAGFLEEAYSLIVTMPIEPDVVTWRT
        G AIHG A DA+  F +M    V P  + ++ +LTAC+HGGL++ GRRYF  M S   ++P+++HYG MVDL  R+G L+EA   I+ MPI+PD V W+ 
Subjt:  GLAIHGLAMDALSLFLRMEHENVLPDAITFLGILTACNHGGLIDHGRRYFELMKSRYSIQPQLQHYGIMVDLYSRAGFLEEAYSLIVTMPIEPDVVTWRT

Query:  LLSGCRIYKNHKLAE-VA--IANMSHRRSGDYVLLSNIYCSLNRWEEAETVRKMMKLNKVRKKRGKSWIELGGTIQHFKSGDRSHPERDAIDKVLCSLMK
        LL  CR+  N ++ + VA  + +M    SG YV LSN+Y S   W E   +R  MK   +RK  G S I++ G +  F   D SHP+   I+ +L  +  
Subjt:  LLSGCRIYKNHKLAE-VA--IANMSHRRSGDYVLLSNIYCSLNRWEEAETVRKMMKLNKVRKKRGKSWIELGGTIQHFKSGDRSHPERDAIDKVLCSLMK

Query:  RTRSEGYMPVTELVFMDISEEEKEENLSFHSEKMALAYAILKTSPGAKISISKNLRICDDCHTWIKLVSRVLCRVIVVRDRIRFHQFEGGMCSCGDRW
        + R  GY P+T  V +++ EE+KE  L +HSEK+A A+ ++ TSPG  I I KNLRIC+DCH+ IKL+S+V  R I VRDR RFH F+ G CSC D W
Subjt:  RTRSEGYMPVTELVFMDISEEEKEENLSFHSEKMALAYAILKTSPGAKISISKNLRICDDCHTWIKLVSRVLCRVIVVRDRIRFHQFEGGMCSCGDRW

Q9FJY7 Pentatricopeptide repeat-containing protein At5g665202.0e-9736.48Show/hide
Query:  ALLATSASALPAAPSNSKDYQTLCRVLEACRLFPWNSKTVIETHARIIKFGYGNYPTLIASLVSTYQRVGCLNQVHQLLDILCSKHLDLVAMNLLIGNFM
        +LL        +AP N+  + +L   L+AC       +T  + HA+I K GY N    + SL+++Y   G     H L D +     D V+ N +I  ++
Subjt:  ALLATSASALPAAPSNSKDYQTLCRVLEACRLFPWNSKTVIETHARIIKFGYGNYPTLIASLVSTYQRVGCLNQVHQLLDILCSKHLDLVAMNLLIGNFM

Query:  KIGECKFAKKYFIKCLSNARYD---------------EAFRFFRQMLKSNIQPDGFTFASILNACAQLGAPSNTHWVHAQMTQKKIELNSLLSCALIDAY
        K G+   A   F K                       EA + F +M  S+++PD  + A+ L+ACAQLGA     W+H+ + + +I ++S+L C LID Y
Subjt:  KIGECKFAKKYFIKCLSNARYD---------------EAFRFFRQMLKSNIQPDGFTFASILNACAQLGAPSNTHWVHAQMTQKKIELNSLLSCALIDAY

Query:  SKCGSIQIAKEIFSNVPHNDTSVWNVMIKGLAIHGLAMDALSLFLRMEHENVLPDAITFLGILTACNHGGLIDHGRRYFELMKSRYSIQPQLQHYGIMVD
        +KCG ++ A E+F N+       W  +I G A HG   +A+S F+ M+   + P+ ITF  +LTAC++ GL++ G+  F  M+  Y+++P ++HYG +VD
Subjt:  SKCGSIQIAKEIFSNVPHNDTSVWNVMIKGLAIHGLAMDALSLFLRMEHENVLPDAITFLGILTACNHGGLIDHGRRYFELMKSRYSIQPQLQHYGIMVD

Query:  LYSRAGFLEEAYSLIVTMPIEPDVVTWRTLLSGCRIYKN----HKLAEVAIANMSHRRSGDYVLLSNIYCSLNRWEEAETVRKMMKLNKVRKKRGKSWIE
        L  RAG L+EA   I  MP++P+ V W  LL  CRI+KN     ++ E+ IA +     G YV  +NI+    +W++A   R++MK   V K  G S I 
Subjt:  LYSRAGFLEEAYSLIVTMPIEPDVVTWRTLLSGCRIYKN----HKLAEVAIANMSHRRSGDYVLLSNIYCSLNRWEEAETVRKMMKLNKVRKKRGKSWIE

Query:  LGGTIQHFKSGDRSHPERDAIDKVLCSLMKRTRSEGYMPVTELVFMD-ISEEEKEENLSFHSEKMALAYAILKTSPGAKISISKNLRICDDCHTWIKLVS
        L GT   F +GDRSHPE + I      + ++    GY+P  E + +D + ++E+E  +  HSEK+A+ Y ++KT PG  I I KNLR+C DCH   KL+S
Subjt:  LGGTIQHFKSGDRSHPERDAIDKVLCSLMKRTRSEGYMPVTELVFMD-ISEEEKEENLSFHSEKMALAYAILKTSPGAKISISKNLRICDDCHTWIKLVS

Query:  RVLCRVIVVRDRIRFHQFEGGMCSCGDRW
        ++  R IV+RDR RFH F  G CSCGD W
Subjt:  RVLCRVIVVRDRIRFHQFEGGMCSCGDRW

Arabidopsis top hitse value%identityAlignment
AT3G62890.1 Pentatricopeptide repeat (PPR) superfamily protein1.0e-9939.84Show/hide
Query:  THARIIKFGYGNYPTLIASLVSTYQRVGCLNQVHQLLDILCSKHLDLVAMNLLIGNFMKIGECKFAKKYFIK----------CLSN-----ARYDEAFRF
        THA+I+ FG    P +  SL++ Y   G L    ++ D   SK  DL A N ++  + K G    A+K F +          CL N      +Y EA   
Subjt:  THARIIKFGYGNYPTLIASLVSTYQRVGCLNQVHQLLDILCSKHLDLVAMNLLIGNFMKIGECKFAKKYFIK----------CLSN-----ARYDEAFRF

Query:  FRQML-----KSNIQPDGFTFASILNACAQLGAPSNTHWVHAQMTQKKIELNSLLSCALIDAYSKCGSIQIAKEIFSNV-PHNDTSVWNVMIKGLAIHGL
        FR+M      ++ ++P+ FT +++L+AC +LGA     WVHA + +  +E++ +L  ALID Y+KCGS++ AK +F+ +    D   ++ MI  LA++GL
Subjt:  FRQML-----KSNIQPDGFTFASILNACAQLGAPSNTHWVHAQMTQKKIELNSLLSCALIDAYSKCGSIQIAKEIFSNV-PHNDTSVWNVMIKGLAIHGL

Query:  AMDALSLFLRM-EHENVLPDAITFLGILTACNHGGLIDHGRRYFELMKSRYSIQPQLQHYGIMVDLYSRAGFLEEAYSLIVTMPIEPDVVTWRTLLSGCR
          +   LF  M   +N+ P+++TF+GIL AC H GLI+ G+ YF++M   + I P +QHYG MVDLY R+G ++EA S I +MP+EPDV+ W +LLSG R
Subjt:  AMDALSLFLRM-EHENVLPDAITFLGILTACNHGGLIDHGRRYFELMKSRYSIQPQLQHYGIMVDLYSRAGFLEEAYSLIVTMPIEPDVVTWRTLLSGCR

Query:  IYKNHKLAEVA---IANMSHRRSGDYVLLSNIYCSLNRWEEAETVRKMMKLNKVRKKRGKSWIELGGTIQHFKSGDRSHPERDAIDKVLCSLMKRTRSEG
        +  + K  E A   +  +    SG YVLLSN+Y    RW E + +R  M++  + K  G S++E+ G +  F  GD S  E + I  +L  +M+R R  G
Subjt:  IYKNHKLAEVA---IANMSHRRSGDYVLLSNIYCSLNRWEEAETVRKMMKLNKVRKKRGKSWIELGGTIQHFKSGDRSHPERDAIDKVLCSLMKRTRSEG

Query:  YMPVTELVFMDISEEEKEENLSFHSEKMALAYAILKTSPGAKISISKNLRICDDCHTWIKLVSRVLCRVIVVRDRIRFHQFEGGMCSCGDRW
        Y+  T+ V +D++E++KE  LS+HSEK+A+A+ ++KT PG  + I KNLRIC DCH  +K++S++  R IVVRD  RFH F  G CSC D W
Subjt:  YMPVTELVFMDISEEEKEENLSFHSEKMALAYAILKTSPGAKISISKNLRICDDCHTWIKLVSRVLCRVIVVRDRIRFHQFEGGMCSCGDRW

AT4G21065.1 Tetratricopeptide repeat (TPR)-like superfamily protein5.3e-10140.64Show/hide
Query:  HARIIKFGYGNYPTLIASLVSTYQRVGCLNQVHQLLDILCSKHLDLVAMNLLIGNFMKIGECKFAKKYFIKCLSNARYDEAFRFFRQMLKSNIQPDGFTF
        H+ +I+ G+G+   +  SL+  Y   G +   +++ D +  K  DLVA N +I  F +                N + +EA   + +M    I+PDGFT 
Subjt:  HARIIKFGYGNYPTLIASLVSTYQRVGCLNQVHQLLDILCSKHLDLVAMNLLIGNFMKIGECKFAKKYFIKCLSNARYDEAFRFFRQMLKSNIQPDGFTF

Query:  ASILNACAQLGAPSNTHWVHAQMTQKKIELNSLLSCALIDAYSKCGSIQIAKEIFSNVPHNDTSVWNVMIKGLAIHGLAMDALSLFLRMEH-ENVLPDAI
         S+L+ACA++GA +    VH  M +  +  N   S  L+D Y++CG ++ AK +F  +   ++  W  +I GLA++G   +A+ LF  ME  E +LP  I
Subjt:  ASILNACAQLGAPSNTHWVHAQMTQKKIELNSLLSCALIDAYSKCGSIQIAKEIFSNVPHNDTSVWNVMIKGLAIHGLAMDALSLFLRMEH-ENVLPDAI

Query:  TFLGILTACNHGGLIDHGRRYFELMKSRYSIQPQLQHYGIMVDLYSRAGFLEEAYSLIVTMPIEPDVVTWRTLLSGCRIYKNHKLAEVA---IANMSHRR
        TF+GIL AC+H G++  G  YF  M+  Y I+P+++H+G MVDL +RAG +++AY  I +MP++P+VV WRTLL  C ++ +  LAE A   I  +    
Subjt:  TFLGILTACNHGGLIDHGRRYFELMKSRYSIQPQLQHYGIMVDLYSRAGFLEEAYSLIVTMPIEPDVVTWRTLLSGCRIYKNHKLAEVA---IANMSHRR

Query:  SGDYVLLSNIYCSLNRWEEAETVRKMMKLNKVRKKRGKSWIELGGTIQHFKSGDRSHPERDAIDKVLCSLMKRTRSEGYMPVTELVFMDISEEEKEENLS
        SGDYVLLSN+Y S  RW + + +RK M  + V+K  G S +E+G  +  F  GD+SHP+ DAI   L  +  R RSEGY+P    V++D+ EEEKE  + 
Subjt:  SGDYVLLSNIYCSLNRWEEAETVRKMMKLNKVRKKRGKSWIELGGTIQHFKSGDRSHPERDAIDKVLCSLMKRTRSEGYMPVTELVFMDISEEEKEENLS

Query:  FHSEKMALAYAILKTSPGAKISISKNLRICDDCHTWIKLVSRVLCRVIVVRDRIRFHQFEGGMCSCGDRW
        +HSEK+A+A+ ++ T   + I++ KNLR+C DCH  IKLVS+V  R IVVRDR RFH F+ G CSC D W
Subjt:  FHSEKMALAYAILKTSPGAKISISKNLRICDDCHTWIKLVSRVLCRVIVVRDRIRFHQFEGGMCSCGDRW

AT4G21065.2 Tetratricopeptide repeat (TPR)-like superfamily protein5.3e-10140.64Show/hide
Query:  HARIIKFGYGNYPTLIASLVSTYQRVGCLNQVHQLLDILCSKHLDLVAMNLLIGNFMKIGECKFAKKYFIKCLSNARYDEAFRFFRQMLKSNIQPDGFTF
        H+ +I+ G+G+   +  SL+  Y   G +   +++ D +  K  DLVA N +I  F +                N + +EA   + +M    I+PDGFT 
Subjt:  HARIIKFGYGNYPTLIASLVSTYQRVGCLNQVHQLLDILCSKHLDLVAMNLLIGNFMKIGECKFAKKYFIKCLSNARYDEAFRFFRQMLKSNIQPDGFTF

Query:  ASILNACAQLGAPSNTHWVHAQMTQKKIELNSLLSCALIDAYSKCGSIQIAKEIFSNVPHNDTSVWNVMIKGLAIHGLAMDALSLFLRMEH-ENVLPDAI
         S+L+ACA++GA +    VH  M +  +  N   S  L+D Y++CG ++ AK +F  +   ++  W  +I GLA++G   +A+ LF  ME  E +LP  I
Subjt:  ASILNACAQLGAPSNTHWVHAQMTQKKIELNSLLSCALIDAYSKCGSIQIAKEIFSNVPHNDTSVWNVMIKGLAIHGLAMDALSLFLRMEH-ENVLPDAI

Query:  TFLGILTACNHGGLIDHGRRYFELMKSRYSIQPQLQHYGIMVDLYSRAGFLEEAYSLIVTMPIEPDVVTWRTLLSGCRIYKNHKLAEVA---IANMSHRR
        TF+GIL AC+H G++  G  YF  M+  Y I+P+++H+G MVDL +RAG +++AY  I +MP++P+VV WRTLL  C ++ +  LAE A   I  +    
Subjt:  TFLGILTACNHGGLIDHGRRYFELMKSRYSIQPQLQHYGIMVDLYSRAGFLEEAYSLIVTMPIEPDVVTWRTLLSGCRIYKNHKLAEVA---IANMSHRR

Query:  SGDYVLLSNIYCSLNRWEEAETVRKMMKLNKVRKKRGKSWIELGGTIQHFKSGDRSHPERDAIDKVLCSLMKRTRSEGYMPVTELVFMDISEEEKEENLS
        SGDYVLLSN+Y S  RW + + +RK M  + V+K  G S +E+G  +  F  GD+SHP+ DAI   L  +  R RSEGY+P    V++D+ EEEKE  + 
Subjt:  SGDYVLLSNIYCSLNRWEEAETVRKMMKLNKVRKKRGKSWIELGGTIQHFKSGDRSHPERDAIDKVLCSLMKRTRSEGYMPVTELVFMDISEEEKEENLS

Query:  FHSEKMALAYAILKTSPGAKISISKNLRICDDCHTWIKLVSRVLCRVIVVRDRIRFHQFEGGMCSCGDRW
        +HSEK+A+A+ ++ T   + I++ KNLR+C DCH  IKLVS+V  R IVVRDR RFH F+ G CSC D W
Subjt:  FHSEKMALAYAILKTSPGAKISISKNLRICDDCHTWIKLVSRVLCRVIVVRDRIRFHQFEGGMCSCGDRW

AT5G48910.1 Pentatricopeptide repeat (PPR) superfamily protein5.9e-10038.15Show/hide
Query:  ETHARIIKFGYGNYPTLIASLVSTYQRVGCLNQVHQLL-------DILC-----SKHLDLVAMNLLIGNFMKIGECKFAKKYFIKCLS------------
        + H   +K+G+G    ++++LV  Y   G +     L        D++       +  ++V  N++I  +M++G+CK A+  F K               
Subjt:  ETHARIIKFGYGNYPTLIASLVSTYQRVGCLNQVHQLL-------DILC-----SKHLDLVAMNLLIGNFMKIGECKFAKKYFIKCLS------------

Query:  ---NARYDEAFRFFRQMLKSNIQPDGFTFASILNACAQLGAPSNTHWVHAQMTQKKIELNSLLSCALIDAYSKCGSIQIAKEIFSNVPHNDTSVWNVMIK
           N  + +A   FR+M K +I+P+  T  S+L A ++LG+     W+H       I ++ +L  ALID YSKCG I+ A  +F  +P  +   W+ MI 
Subjt:  ---NARYDEAFRFFRQMLKSNIQPDGFTFASILNACAQLGAPSNTHWVHAQMTQKKIELNSLLSCALIDAYSKCGSIQIAKEIFSNVPHNDTSVWNVMIK

Query:  GLAIHGLAMDALSLFLRMEHENVLPDAITFLGILTACNHGGLIDHGRRYFELMKSRYSIQPQLQHYGIMVDLYSRAGFLEEAYSLIVTMPIEPDVVTWRT
        G AIHG A DA+  F +M    V P  + ++ +LTAC+HGGL++ GRRYF  M S   ++P+++HYG MVDL  R+G L+EA   I+ MPI+PD V W+ 
Subjt:  GLAIHGLAMDALSLFLRMEHENVLPDAITFLGILTACNHGGLIDHGRRYFELMKSRYSIQPQLQHYGIMVDLYSRAGFLEEAYSLIVTMPIEPDVVTWRT

Query:  LLSGCRIYKNHKLAE-VA--IANMSHRRSGDYVLLSNIYCSLNRWEEAETVRKMMKLNKVRKKRGKSWIELGGTIQHFKSGDRSHPERDAIDKVLCSLMK
        LL  CR+  N ++ + VA  + +M    SG YV LSN+Y S   W E   +R  MK   +RK  G S I++ G +  F   D SHP+   I+ +L  +  
Subjt:  LLSGCRIYKNHKLAE-VA--IANMSHRRSGDYVLLSNIYCSLNRWEEAETVRKMMKLNKVRKKRGKSWIELGGTIQHFKSGDRSHPERDAIDKVLCSLMK

Query:  RTRSEGYMPVTELVFMDISEEEKEENLSFHSEKMALAYAILKTSPGAKISISKNLRICDDCHTWIKLVSRVLCRVIVVRDRIRFHQFEGGMCSCGDRW
        + R  GY P+T  V +++ EE+KE  L +HSEK+A A+ ++ TSPG  I I KNLRIC+DCH+ IKL+S+V  R I VRDR RFH F+ G CSC D W
Subjt:  RTRSEGYMPVTELVFMDISEEEKEENLSFHSEKMALAYAILKTSPGAKISISKNLRICDDCHTWIKLVSRVLCRVIVVRDRIRFHQFEGGMCSCGDRW

AT5G50990.1 Tetratricopeptide repeat (TPR)-like superfamily protein2.5e-15150.94Show/hide
Query:  RRITFALLATSASALPAAPSNSKDYQTLCRVLEACRLFPWNSKTVIETHARIIKFGYGNYPTLIASLVSTYQRVGCLNQVHQLLDILCSKHLDLVAMNLL
        RR     L++S++      SN  D+  L +VLE+C+  P NSK V++ HA+I K GYG YP+L+ S V+ Y+R        +LL    S    +  +NL+
Subjt:  RRITFALLATSASALPAAPSNSKDYQTLCRVLEACRLFPWNSKTVIETHARIIKFGYGNYPTLIASLVSTYQRVGCLNQVHQLLDILCSKHLDLVAMNLL

Query:  IGNFMKIGECKFAKKYFIKC---------------LSNARYDEAFRFFRQMLK-SNIQPDGFTFASILNACAQLGAPSNTHWVHAQMTQKKIELNSLLSC
        I + MKIGE   AKK                    + N +Y+EA +  + ML  ++I+P+ F+FAS L ACA+LG   +  WVH+ M    IELN++LS 
Subjt:  IGNFMKIGECKFAKKYFIKC---------------LSNARYDEAFRFFRQMLK-SNIQPDGFTFASILNACAQLGAPSNTHWVHAQMTQKKIELNSLLSC

Query:  ALIDAYSKCGSIQIAKEIFSNVPHNDTSVWNVMIKGLAIHGLAMDALSLFLRMEHENVLPDAITFLGILTACNHGGLIDHGRRYFELMKSRYSIQPQLQH
        AL+D Y+KCG I  ++E+F +V  ND S+WN MI G A HGLA +A+ +F  ME E+V PD+ITFLG+LT C+H GL++ G+ YF LM  R+SIQP+L+H
Subjt:  ALIDAYSKCGSIQIAKEIFSNVPHNDTSVWNVMIKGLAIHGLAMDALSLFLRMEHENVLPDAITFLGILTACNHGGLIDHGRRYFELMKSRYSIQPQLQH

Query:  YGIMVDLYSRAGFLEEAYSLIVTMPIEPDVVTWRTLLSGCRIYKNHKLAEVAIANMSHRRSGDYVLLSNIYCSLNRWEEAETVRKMMKLNKVRKKRGKSW
        YG MVDL  RAG ++EAY LI +MPIEPDVV WR+LLS  R YKN +L E+AI N+S  +SGDYVLLSNIY S  +WE A+ VR++M    +RK +GKSW
Subjt:  YGIMVDLYSRAGFLEEAYSLIVTMPIEPDVVTWRTLLSGCRIYKNHKLAEVAIANMSHRRSGDYVLLSNIYCSLNRWEEAETVRKMMKLNKVRKKRGKSW

Query:  IELGGTIQHFKSGDRSHPERDAIDKVLCSLMKRTRSEGYMPVTELVFMDISEEEKEENLSFHSEKMALAYAILKTSPGAKISISKNLRICDDCHTWIKLV
        +E GG I  FK+GD SH E  AI KVL  L+++T+S+G++  T+LV MD+SEEEKEENL++HSEK+ALAY ILK+SPG +I I KN+R+C DCH WIK V
Subjt:  IELGGTIQHFKSGDRSHPERDAIDKVLCSLMKRTRSEGYMPVTELVFMDISEEEKEENLSFHSEKMALAYAILKTSPGAKISISKNLRICDDCHTWIKLV

Query:  SRVLCRVIVVRDRIRFHQFEGGMCSCGDRW
        S++L RVI++RDRIRFH+FE G+CSC D W
Subjt:  SRVLCRVIVVRDRIRFHQFEGGMCSCGDRW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTGAAGCAGAGATACTTGGACTGCAGAAGGATTACTTTTGCTCTTCTTGCTACCTCTGCCTCTGCTTTACCGGCGGCCCCTTCGAATTCCAAAGATTATCAAACCCT
TTGCCGTGTTCTTGAAGCCTGCAGACTCTTCCCCTGGAATTCCAAAACCGTTATTGAAACACATGCACGAATCATTAAATTTGGATATGGAAACTACCCTACTCTCATCG
CCTCTTTAGTATCGACTTATCAACGTGTCGGTTGCCTGAATCAGGTCCATCAACTTCTTGATATACTCTGCTCTAAGCATCTTGATTTAGTTGCAATGAACTTACTCATT
GGAAATTTTATGAAAATCGGGGAGTGCAAATTTGCTAAAAAGTATTTTATAAAATGCCTTTCCAATGCACGGTATGATGAGGCATTTAGATTCTTTAGACAGATGCTGAA
GTCAAATATTCAGCCGGATGGATTTACATTTGCTTCCATATTGAATGCATGTGCTCAACTCGGAGCTCCTAGTAACACTCATTGGGTTCATGCTCAGATGACTCAGAAAA
AAATTGAACTTAATTCTTTATTAAGTTGTGCACTCATAGATGCATACTCAAAATGTGGTAGCATCCAAATTGCAAAGGAAATATTTAGTAATGTCCCTCACAATGATACT
TCAGTTTGGAATGTGATGATCAAAGGGCTTGCGATTCATGGGCTTGCAATGGATGCATTATCGTTATTTTTGAGGATGGAGCATGAGAATGTTCTGCCAGATGCTATCAC
CTTTTTGGGTATTTTAACAGCTTGCAACCATGGTGGTTTAATTGACCATGGTCGTCGATACTTTGAGCTGATGAAAAGTCGCTATTCAATTCAGCCGCAGCTTCAGCATT
ATGGCATCATGGTTGATCTTTATAGTCGAGCTGGGTTTCTGGAAGAGGCCTATTCTCTAATCGTGACAATGCCGATAGAGCCTGATGTTGTCACATGGAGGACGCTTCTA
AGTGGTTGTAGAATTTACAAAAATCATAAACTTGCAGAAGTTGCTATTGCAAATATGTCTCATCGTAGGAGTGGTGATTACGTATTGTTATCAAATATCTATTGTTCTCT
CAACAGATGGGAGGAAGCAGAAACAGTTAGAAAGATGATGAAACTCAATAAAGTTCGTAAGAAACGCGGAAAAAGCTGGATTGAGTTGGGAGGTACCATTCAACACTTCA
AGTCAGGTGATCGATCGCATCCAGAAAGGGATGCAATAGACAAAGTGCTATGCAGTTTGATGAAGAGAACTCGGTCGGAGGGATATATGCCTGTTACGGAGTTGGTTTTC
ATGGATATCTCTGAGGAGGAGAAGGAAGAGAACTTGTCATTTCATAGCGAAAAGATGGCATTGGCTTATGCGATCTTGAAAACTAGTCCTGGGGCAAAGATCAGTATATC
AAAGAACCTGCGGATCTGTGATGATTGTCATACATGGATTAAATTAGTTTCAAGAGTGCTGTGTAGAGTTATAGTAGTGAGGGATCGGATCCGGTTTCATCAATTTGAAG
GTGGCATGTGTTCTTGTGGTGATCGTTGGTAG
mRNA sequenceShow/hide mRNA sequence
ATGCTGAAGCAGAGATACTTGGACTGCAGAAGGATTACTTTTGCTCTTCTTGCTACCTCTGCCTCTGCTTTACCGGCGGCCCCTTCGAATTCCAAAGATTATCAAACCCT
TTGCCGTGTTCTTGAAGCCTGCAGACTCTTCCCCTGGAATTCCAAAACCGTTATTGAAACACATGCACGAATCATTAAATTTGGATATGGAAACTACCCTACTCTCATCG
CCTCTTTAGTATCGACTTATCAACGTGTCGGTTGCCTGAATCAGGTCCATCAACTTCTTGATATACTCTGCTCTAAGCATCTTGATTTAGTTGCAATGAACTTACTCATT
GGAAATTTTATGAAAATCGGGGAGTGCAAATTTGCTAAAAAGTATTTTATAAAATGCCTTTCCAATGCACGGTATGATGAGGCATTTAGATTCTTTAGACAGATGCTGAA
GTCAAATATTCAGCCGGATGGATTTACATTTGCTTCCATATTGAATGCATGTGCTCAACTCGGAGCTCCTAGTAACACTCATTGGGTTCATGCTCAGATGACTCAGAAAA
AAATTGAACTTAATTCTTTATTAAGTTGTGCACTCATAGATGCATACTCAAAATGTGGTAGCATCCAAATTGCAAAGGAAATATTTAGTAATGTCCCTCACAATGATACT
TCAGTTTGGAATGTGATGATCAAAGGGCTTGCGATTCATGGGCTTGCAATGGATGCATTATCGTTATTTTTGAGGATGGAGCATGAGAATGTTCTGCCAGATGCTATCAC
CTTTTTGGGTATTTTAACAGCTTGCAACCATGGTGGTTTAATTGACCATGGTCGTCGATACTTTGAGCTGATGAAAAGTCGCTATTCAATTCAGCCGCAGCTTCAGCATT
ATGGCATCATGGTTGATCTTTATAGTCGAGCTGGGTTTCTGGAAGAGGCCTATTCTCTAATCGTGACAATGCCGATAGAGCCTGATGTTGTCACATGGAGGACGCTTCTA
AGTGGTTGTAGAATTTACAAAAATCATAAACTTGCAGAAGTTGCTATTGCAAATATGTCTCATCGTAGGAGTGGTGATTACGTATTGTTATCAAATATCTATTGTTCTCT
CAACAGATGGGAGGAAGCAGAAACAGTTAGAAAGATGATGAAACTCAATAAAGTTCGTAAGAAACGCGGAAAAAGCTGGATTGAGTTGGGAGGTACCATTCAACACTTCA
AGTCAGGTGATCGATCGCATCCAGAAAGGGATGCAATAGACAAAGTGCTATGCAGTTTGATGAAGAGAACTCGGTCGGAGGGATATATGCCTGTTACGGAGTTGGTTTTC
ATGGATATCTCTGAGGAGGAGAAGGAAGAGAACTTGTCATTTCATAGCGAAAAGATGGCATTGGCTTATGCGATCTTGAAAACTAGTCCTGGGGCAAAGATCAGTATATC
AAAGAACCTGCGGATCTGTGATGATTGTCATACATGGATTAAATTAGTTTCAAGAGTGCTGTGTAGAGTTATAGTAGTGAGGGATCGGATCCGGTTTCATCAATTTGAAG
GTGGCATGTGTTCTTGTGGTGATCGTTGGTAG
Protein sequenceShow/hide protein sequence
MLKQRYLDCRRITFALLATSASALPAAPSNSKDYQTLCRVLEACRLFPWNSKTVIETHARIIKFGYGNYPTLIASLVSTYQRVGCLNQVHQLLDILCSKHLDLVAMNLLI
GNFMKIGECKFAKKYFIKCLSNARYDEAFRFFRQMLKSNIQPDGFTFASILNACAQLGAPSNTHWVHAQMTQKKIELNSLLSCALIDAYSKCGSIQIAKEIFSNVPHNDT
SVWNVMIKGLAIHGLAMDALSLFLRMEHENVLPDAITFLGILTACNHGGLIDHGRRYFELMKSRYSIQPQLQHYGIMVDLYSRAGFLEEAYSLIVTMPIEPDVVTWRTLL
SGCRIYKNHKLAEVAIANMSHRRSGDYVLLSNIYCSLNRWEEAETVRKMMKLNKVRKKRGKSWIELGGTIQHFKSGDRSHPERDAIDKVLCSLMKRTRSEGYMPVTELVF
MDISEEEKEENLSFHSEKMALAYAILKTSPGAKISISKNLRICDDCHTWIKLVSRVLCRVIVVRDRIRFHQFEGGMCSCGDRW