; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr028517 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr028517
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationtig00153204:1869299..1872377
RNA-Seq ExpressionSgr028517
SyntenySgr028517
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily
IPR044657 - Pentatricopeptide repeat-containing protein NFD5-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7029395.1 Pentatricopeptide repeat-containing protein [Cucurbita argyrosperma subsp. argyrosperma]7.6e-26370.1Show/hide
Query:  MSLKRLLLRQTFKNFSGINGSLLHRQPSIKINATRTFITTPSFPLLDPQYGRCSSLPVQNLELCKSSSIFSRCVHFTITKSSDTAIAPKLESADVEDDDG
        MSL  LLLR+T KNF  ING+LL +Q ++ IN TR FIT+PSF LLD  Y   SS+PV+N EL KS+SIFSRC+H T+TK SD A+ PKLESADVE+DDG
Subjt:  MSLKRLLLRQTFKNFSGINGSLLHRQPSIKINATRTFITTPSFPLLDPQYGRCSSLPVQNLELCKSSSIFSRCVHFTITKSSDTAIAPKLESADVEDDDG

Query:  SMNEFLSRFVWIMRGKISEAFPDYEKQTVDAMLLMIVEKVVSEMEKGNIEQTLRASTGSPDWDLSEDLWKTVSEVSNMVLDDMKKATKKEKMKGFLLSEE
        SMNEFLSRFVWIMRGKISE FPDY+K+TVDAMLLMIVEKVVSEMEKG+ EQ+L+AST + DWDLSEDLWKTVSEVSNMVLDDMKKATKKEKMKGFLLSEE
Subjt:  SMNEFLSRFVWIMRGKISEAFPDYEKQTVDAMLLMIVEKVVSEMEKGNIEQTLRASTGSPDWDLSEDLWKTVSEVSNMVLDDMKKATKKEKMKGFLLSEE

Query:  VQEMCRFSGEVAQKGSPCQE------------------------ENGGEGNNNASPSGEEATSEEKSEVVSLPKRRGKIKYKIYGLDLSDSKWSKVADRI
        VQEMCRF+GEV  +G   +E                        E   +  NN SPS  EA SE KSE VSLPKRRGKIKYKIYGLDLSD KWS+VAD++
Subjt:  VQEMCRFSGEVAQKGSPCQE------------------------ENGGEGNNNASPSGEEATSEEKSEVVSLPKRRGKIKYKIYGLDLSDSKWSKVADRI

Query:  HEAEEVLWPQEPSQFLGN---------------------ANW--------IDWITLLDKLNEKNRFLYLKVAELVLSEESFQTNIRDYSKLIDAHAKENC
        HEAEEVLWPQEP    G                      A W        +DWI LLDKLNE NRFLYLKVAEL+LSEESFQT+IRDYSKL+D HAKEN 
Subjt:  HEAEEVLWPQEPSQFLGN---------------------ANW--------IDWITLLDKLNEKNRFLYLKVAELVLSEESFQTNIRDYSKLIDAHAKENC

Query:  LEDAERILNKMTEKGIPPDILTATVLVHMYSKVGNLDRAKEAFETLRSHGFQPDEK--------------------------------------------
        LEDAERIL KM EKGI PDILTA+VLVHMYSKVGNLDRAKEAF+TLRSHGFQPDEK                                            
Subjt:  LEDAERILNKMTEKGIPPDILTATVLVHMYSKVGNLDRAKEAFETLRSHGFQPDEK--------------------------------------------

Query:  -------WSWRIAATMQFAGFPPSLESCTLLVETYGQAGDPDQARNNFDHMIKIGHRPDDRCTASMIAAYEKKNLLDKALNLLLQLEKDGFHPGVATYAV
                + RI+ATMQFAGF PSLESCTLL+E YGQAGDPDQARNNFD+MIKIGHRPDDRCTASM+AAYEKKNLLDKALNLLLQLEKDGF PGVATYAV
Subjt:  -------WSWRIAATMQFAGFPPSLESCTLLVETYGQAGDPDQARNNFDHMIKIGHRPDDRCTASMIAAYEKKNLLDKALNLLLQLEKDGFHPGVATYAV

Query:  LVDWLGKLQLVDEAEQLLGKIGAQGDTLPFKVHISLCDMYSRAGIEKKALQALGVLEAKKEQLGHGDYERIINGLIAGGFVQDAKRVQGLMEAQGFTAS
        LVDWLGKLQLVDEAEQ+LGKIGAQGD LPFKVHISLCDMYSRAGIEKKALQALGVLEAKKE+LGHGD+ERIINGLIAGGFVQDAKR+Q +MEAQGFTAS
Subjt:  LVDWLGKLQLVDEAEQLLGKIGAQGDTLPFKVHISLCDMYSRAGIEKKALQALGVLEAKKEQLGHGDYERIINGLIAGGFVQDAKRVQGLMEAQGFTAS

XP_022144194.1 pentatricopeptide repeat-containing protein At5g39710 [Momordica charantia]7.3e-28274.79Show/hide
Query:  MSLKRLLLRQTFKNFSGINGSLLHRQPSIKINATRTFITTPSFPLLDPQYGRCSSLPVQNLELCKSSSIFSRCVHFTITKSSDTAIAPKLESADVEDDDG
        MSLKRLLLRQT K FSGI+GSLLHRQ +I+INATRTF TTPSF LLDP YGR SS+  QNLELCKSS IFSRC+HFT+TK SDTAI PKLESAD EDDDG
Subjt:  MSLKRLLLRQTFKNFSGINGSLLHRQPSIKINATRTFITTPSFPLLDPQYGRCSSLPVQNLELCKSSSIFSRCVHFTITKSSDTAIAPKLESADVEDDDG

Query:  SMNEFLSRFVWIMRGKISEAFPDYEKQTVDAMLLMIVEKVVSEMEKGNIEQTLRASTGSPDWDLSEDLWKTVSEVSNMVLDDMKKATKKEKMKGFLLSEE
        SMNEFLSRFVWIMRGKISEAFPDY+KQTVDAMLLMIVE+VVSEMEKGNI QTL AS  S DWDLSEDLWKTVSEVSNMVL+DMKKATKKEKMKGFLLSEE
Subjt:  SMNEFLSRFVWIMRGKISEAFPDYEKQTVDAMLLMIVEKVVSEMEKGNIEQTLRASTGSPDWDLSEDLWKTVSEVSNMVLDDMKKATKKEKMKGFLLSEE

Query:  VQEMCRFSGEVAQKGSPCQE---------------------------ENGGEGNNNASPSGEEATSEEKSEVVSLPKRRGKIKYKIYGLDLSDSKWSKVA
        VQEMCRF+GEV  +G   +E                           EN  EGNN  SPSG EA SEEKSEVVSLPKRRGKIKYKIYGLDLSD KW+KVA
Subjt:  VQEMCRFSGEVAQKGSPCQE---------------------------ENGGEGNNNASPSGEEATSEEKSEVVSLPKRRGKIKYKIYGLDLSDSKWSKVA

Query:  DRIHEAEEVLWPQEPSQFLGN---------------------ANW--------IDWITLLDKLNEKNRFLYLKVAELVLSEESFQTNIRDYSKLIDAHAK
        D+IHEAEEVLWPQEP    G                      A W        IDWITLLDKLNEKNRFLYLKVAELVLSEESFQTNIRDYSKL+DAHAK
Subjt:  DRIHEAEEVLWPQEPSQFLGN---------------------ANW--------IDWITLLDKLNEKNRFLYLKVAELVLSEESFQTNIRDYSKLIDAHAK

Query:  ENCLEDAERILNKMTEKGIPPDILTATVLVHMYSKVGNLDRAKEAFETLRSHGFQPDEK-----------------------------------------
        EN LEDAERIL KM EKGI PDILTATVLVHMYSKVGNLDRAKEAFETLRSHGFQPDEK                                         
Subjt:  ENCLEDAERILNKMTEKGIPPDILTATVLVHMYSKVGNLDRAKEAFETLRSHGFQPDEK-----------------------------------------

Query:  ----------WSWRIAATMQFAGFPPSLESCTLLVETYGQAGDPDQARNNFDHMIKIGHRPDDRCTASMIAAYEKKNLLDKALNLLLQLEKDGFHPGVAT
                   + RI+ATMQFAGFPPSLESCTLLVETYGQAGDPDQARNNFD+MIKIGHRPDDRCTASM+AAYEKKNLLDKALNLLLQLEKDGF PG +T
Subjt:  ----------WSWRIAATMQFAGFPPSLESCTLLVETYGQAGDPDQARNNFDHMIKIGHRPDDRCTASMIAAYEKKNLLDKALNLLLQLEKDGFHPGVAT

Query:  YAVLVDWLGKLQLVDEAEQLLGKIGAQGDTLPFKVHISLCDMYSRAGIEKKALQALGVLEAKKEQLGHGDYERIINGLIAGGFVQDAKRVQGLMEAQGFT
        YAVL+DWLGKLQLVDEAEQ+LGKIGAQG+ LPFKVHISLCDMYSRAGIEKKALQALGVLEA+KEQLGHGDY RIINGLIAGGFVQDAKRVQGLMEAQGFT
Subjt:  YAVLVDWLGKLQLVDEAEQLLGKIGAQGDTLPFKVHISLCDMYSRAGIEKKALQALGVLEAKKEQLGHGDYERIINGLIAGGFVQDAKRVQGLMEAQGFT

Query:  AS
        AS
Subjt:  AS

XP_022961855.1 putative pentatricopeptide repeat-containing protein At2g02150 [Cucurbita moschata]6.2e-26570.39Show/hide
Query:  MSLKRLLLRQTFKNFSGINGSLLHRQPSIKINATRTFITTPSFPLLDPQYGRCSSLPVQNLELCKSSSIFSRCVHFTITKSSDTAIAPKLESADVEDDDG
        MSL  LLLR+T KNF  ING+LL +Q ++ IN TRTFIT+PSF LLDP Y   SS+PV+N EL KS+SIFSRC+H T+TK SD A+ PKLESADVE+DDG
Subjt:  MSLKRLLLRQTFKNFSGINGSLLHRQPSIKINATRTFITTPSFPLLDPQYGRCSSLPVQNLELCKSSSIFSRCVHFTITKSSDTAIAPKLESADVEDDDG

Query:  SMNEFLSRFVWIMRGKISEAFPDYEKQTVDAMLLMIVEKVVSEMEKGNIEQTLRASTGSPDWDLSEDLWKTVSEVSNMVLDDMKKATKKEKMKGFLLSEE
        SMNEFLSRFVWIMRGKISE FPDY+K+TVDAMLLMIVEK+VSEMEKG+ EQ+L+AST + DWDLSEDLWKTVSEVSNMVLDDMKKATKKEKMKGFLLSEE
Subjt:  SMNEFLSRFVWIMRGKISEAFPDYEKQTVDAMLLMIVEKVVSEMEKGNIEQTLRASTGSPDWDLSEDLWKTVSEVSNMVLDDMKKATKKEKMKGFLLSEE

Query:  VQEMCRFSGEVAQKGSPCQE------------------------ENGGEGNNNASPSGEEATSEEKSEVVSLPKRRGKIKYKIYGLDLSDSKWSKVADRI
        VQEMCRF+GEV  +G   +E                        E   +  NN SPS  EA SE KSE VSLPKRRGKIKYKIYGLDLSD KWS+VAD++
Subjt:  VQEMCRFSGEVAQKGSPCQE------------------------ENGGEGNNNASPSGEEATSEEKSEVVSLPKRRGKIKYKIYGLDLSDSKWSKVADRI

Query:  HEAEEVLWPQEPSQFLGN---------------------ANW--------IDWITLLDKLNEKNRFLYLKVAELVLSEESFQTNIRDYSKLIDAHAKENC
        HEAEEVLWPQEP    G                      A W        +DWI LLDKLNE NRFLYLKVAEL+LSEESFQT+IRDYSKL+D HAKEN 
Subjt:  HEAEEVLWPQEPSQFLGN---------------------ANW--------IDWITLLDKLNEKNRFLYLKVAELVLSEESFQTNIRDYSKLIDAHAKENC

Query:  LEDAERILNKMTEKGIPPDILTATVLVHMYSKVGNLDRAKEAFETLRSHGFQPDEK--------------------------------------------
        LEDAERIL KM EKGI PDILTA+VLVHMYSKVGNLDRAKEAF+TLRSHGFQPDEK                                            
Subjt:  LEDAERILNKMTEKGIPPDILTATVLVHMYSKVGNLDRAKEAFETLRSHGFQPDEK--------------------------------------------

Query:  -------WSWRIAATMQFAGFPPSLESCTLLVETYGQAGDPDQARNNFDHMIKIGHRPDDRCTASMIAAYEKKNLLDKALNLLLQLEKDGFHPGVATYAV
                + RI+ATMQFAGF PSLESCTLLVE YGQAGDPDQARNNFD+MIKIGHRPDDRCTASM+AAYEKKNLLDKALNLLLQLEKDGF PGVATYAV
Subjt:  -------WSWRIAATMQFAGFPPSLESCTLLVETYGQAGDPDQARNNFDHMIKIGHRPDDRCTASMIAAYEKKNLLDKALNLLLQLEKDGFHPGVATYAV

Query:  LVDWLGKLQLVDEAEQLLGKIGAQGDTLPFKVHISLCDMYSRAGIEKKALQALGVLEAKKEQLGHGDYERIINGLIAGGFVQDAKRVQGLMEAQGFTAS
        LVDWLGKLQLVDEAEQ+LGKIGAQGD LPFKVHISLCDMYSRAGIEKKALQALGVLEAKKE+LGHGD+ERIINGL+AGGFVQDAKR+QG+MEAQGFTAS
Subjt:  LVDWLGKLQLVDEAEQLLGKIGAQGDTLPFKVHISLCDMYSRAGIEKKALQALGVLEAKKEQLGHGDYERIINGLIAGGFVQDAKRVQGLMEAQGFTAS

XP_022996674.1 putative pentatricopeptide repeat-containing protein At2g02150 [Cucurbita maxima]4.8e-26570.67Show/hide
Query:  MSLKRLLLRQTFKNFSGINGSLLHRQPSIKINATRTFITTPSFPLLDPQYGRCSSLPVQNLELCKSSSIFSRCVHFTITKSSDTAIAPKLESADVEDDDG
        MSL  LLLR+T KNF  ING+LL  Q ++ INATRTFIT+PSF LLDP YG  SS+P++N EL KS+SIFSRC+H T+TK SD A+ PKLESADVE+DDG
Subjt:  MSLKRLLLRQTFKNFSGINGSLLHRQPSIKINATRTFITTPSFPLLDPQYGRCSSLPVQNLELCKSSSIFSRCVHFTITKSSDTAIAPKLESADVEDDDG

Query:  SMNEFLSRFVWIMRGKISEAFPDYEKQTVDAMLLMIVEKVVSEMEKGNIEQTLRASTGSPDWDLSEDLWKTVSEVSNMVLDDMKKATKKEKMKGFLLSEE
         MNEFLSRFVWI+RGKISE FPDY+K+TVDAMLLMIVEKVVSEMEKG+ EQ+L+AST + DWDLSEDLWKTVSEVSNMVLDDMKKATKKEKMKGFLLSEE
Subjt:  SMNEFLSRFVWIMRGKISEAFPDYEKQTVDAMLLMIVEKVVSEMEKGNIEQTLRASTGSPDWDLSEDLWKTVSEVSNMVLDDMKKATKKEKMKGFLLSEE

Query:  VQEMCRFSGEVAQKGSPCQE------------------------ENGGEGNNNASPSGEEATSEEKSEVVSLPKRRGKIKYKIYGLDLSDSKWSKVADRI
        VQEMCRF+GEV  +G   +E                        E   +  NN SPS  E  SE KSE VSLPKRRGKIKYKIYGLDLSD KWS+VAD++
Subjt:  VQEMCRFSGEVAQKGSPCQE------------------------ENGGEGNNNASPSGEEATSEEKSEVVSLPKRRGKIKYKIYGLDLSDSKWSKVADRI

Query:  HEAEEVLWPQEPSQFLGN---------------------ANW--------IDWITLLDKLNEKNRFLYLKVAELVLSEESFQTNIRDYSKLIDAHAKENC
        HEAEEVLWPQEP    G                      A W        +DWITLLDKLNE NRFLYLKVAEL+LSEESFQTNIRDYSKL+D HAKEN 
Subjt:  HEAEEVLWPQEPSQFLGN---------------------ANW--------IDWITLLDKLNEKNRFLYLKVAELVLSEESFQTNIRDYSKLIDAHAKENC

Query:  LEDAERILNKMTEKGIPPDILTATVLVHMYSKVGNLDRAKEAFETLRSHGFQPDEK--------------------------------------------
        LEDAERIL KM EKGI PDILTATVLVHMYSKVGNLDRAKEAF+TLRSHGFQPDEK                                            
Subjt:  LEDAERILNKMTEKGIPPDILTATVLVHMYSKVGNLDRAKEAFETLRSHGFQPDEK--------------------------------------------

Query:  -------WSWRIAATMQFAGFPPSLESCTLLVETYGQAGDPDQARNNFDHMIKIGHRPDDRCTASMIAAYEKKNLLDKALNLLLQLEKDGFHPGVATYAV
                + RI+ATMQFAGF PSLESCTLLVE YGQAGDPDQARNNFD+MIKIGHRPDDRCTASM+AAYEKKNLLDKALNLLLQLEKDGF PGVATYAV
Subjt:  -------WSWRIAATMQFAGFPPSLESCTLLVETYGQAGDPDQARNNFDHMIKIGHRPDDRCTASMIAAYEKKNLLDKALNLLLQLEKDGFHPGVATYAV

Query:  LVDWLGKLQLVDEAEQLLGKIGAQGDTLPFKVHISLCDMYSRAGIEKKALQALGVLEAKKEQLGHGDYERIINGLIAGGFVQDAKRVQGLMEAQGFTAS
        LVDWLGKLQLVDEAEQ+LGKIGAQGD LPFKVHISLCDMYSRAGIEKKALQAL VLEAKKE+LGHGD+ERIINGLIAGGFVQDAKR+QG+MEAQGFTAS
Subjt:  LVDWLGKLQLVDEAEQLLGKIGAQGDTLPFKVHISLCDMYSRAGIEKKALQALGVLEAKKEQLGHGDYERIINGLIAGGFVQDAKRVQGLMEAQGFTAS

XP_023002246.1 putative pentatricopeptide repeat-containing protein At2g02150 [Cucurbita maxima]1.0e-26771.47Show/hide
Query:  MSLKRLLLRQTFKNFSGINGSLLHRQPSIKINATRTFITTPSFPLLDPQYGRCSSLPVQNLELCKSSSIFSRCVHFTITKSSDTAIAPKLESADVEDDDG
        M L RLLLRQT K FS ING++LHRQ  +K N TRTF+  PSF LLDP++ RCSS+PV+NLELCKS+SIFSRC+HFT TK SDTAI PKLES+DVED DG
Subjt:  MSLKRLLLRQTFKNFSGINGSLLHRQPSIKINATRTFITTPSFPLLDPQYGRCSSLPVQNLELCKSSSIFSRCVHFTITKSSDTAIAPKLESADVEDDDG

Query:  SMNEFLSRFVWIMRGKISEAFPDYEKQTVDAMLLMIVEKVVSEMEKGNIEQTLRASTGSPDWDLSEDLWKTVSEVSNMVLDDMKKATKKEKMKGFLLSEE
        SMNEFLSRFVWIMRGKISEAFPDY+KQTVDAMLLMIVEKVVSEMEKG+ EQ+LR ST +PDWDLSEDLWKTVSEVSNMVLDDMKKATKKEKMKGFLLSEE
Subjt:  SMNEFLSRFVWIMRGKISEAFPDYEKQTVDAMLLMIVEKVVSEMEKGNIEQTLRASTGSPDWDLSEDLWKTVSEVSNMVLDDMKKATKKEKMKGFLLSEE

Query:  VQEMCRFSGEVAQKGSPCQE-------ENGGEG-----------NNNASPSG-EEATSEEKSEVVSLPKRRGKIKYKIYGLDLSDSKWSKVADRIHEAEE
        VQEMCRF+GEV  +G   ++       E   E              NASPSG E A S EKSE VS+PKRRGKIKYKIYGLDLSD KWS+VAD+IHEAEE
Subjt:  VQEMCRFSGEVAQKGSPCQE-------ENGGEG-----------NNNASPSG-EEATSEEKSEVVSLPKRRGKIKYKIYGLDLSDSKWSKVADRIHEAEE

Query:  VLWPQEPSQFLGN---------------------ANW--------IDWITLLDKLNEKNRFLYLKVAELVLSEESFQTNIRDYSKLIDAHAKENCLEDAE
        V+WPQEP    G                      A W        IDWI LLDKLN+KNRFLYLKVAEL+L+EESF+TNIRDYSKL+D HAKEN LEDAE
Subjt:  VLWPQEPSQFLGN---------------------ANW--------IDWITLLDKLNEKNRFLYLKVAELVLSEESFQTNIRDYSKLIDAHAKENCLEDAE

Query:  RILNKMTEKGIPPDILTATVLVHMYSKVGNLDRAKEAFETLRSHGFQPDEK-------------------------------------------------
        RIL  MTEKGI PDILTATVLVHMYSKVGNLDRAKEAF+TLRSHGFQPDEK                                                 
Subjt:  RILNKMTEKGIPPDILTATVLVHMYSKVGNLDRAKEAFETLRSHGFQPDEK-------------------------------------------------

Query:  --WSWRIAATMQFAGFPPSLESCTLLVETYGQAGDPDQARNNFDHMIKIGHRPDDRCTASMIAAYEKKNLLDKALNLLLQLEKDGFHPGVATYAVLVDWL
           + RIAATMQF+G  PSLESCTLLVETYG AGDPDQARNNFD+MIKIGHRPDDRCTASM+AAY KKNLLDKALNLLLQLEKDGF PGV TYA LVDWL
Subjt:  --WSWRIAATMQFAGFPPSLESCTLLVETYGQAGDPDQARNNFDHMIKIGHRPDDRCTASMIAAYEKKNLLDKALNLLLQLEKDGFHPGVATYAVLVDWL

Query:  GKLQLVDEAEQLLGKIGAQGDTLPFKVHISLCDMYSRAGIEKKALQALGVLEAKKEQLGHGDYERIINGLIAGGFVQDAKRVQGLMEAQGFTAS
        GKLQLVDEAEQLLGKIGAQGD +P KVHISLCDMYSRAG+EKKALQALGVLEAKK++LGHG++ERIINGLIAGGFVQDAKRVQGLMEA+GFTAS
Subjt:  GKLQLVDEAEQLLGKIGAQGDTLPFKVHISLCDMYSRAGIEKKALQALGVLEAKKEQLGHGDYERIINGLIAGGFVQDAKRVQGLMEAQGFTAS

TrEMBL top hitse value%identityAlignment
A0A5A7VHK4 Pentatricopeptide repeat-containing protein9.8e-24867.38Show/hide
Query:  MSLKRLLLRQTFKNFSGINGSLLHRQPSIKINATRTFITTPSFPLLDPQYGRCSSLPVQNLELCKSSSIFSRCVHFTITKSSDTAIAPKLESADVEDDDG
        MSLK LLLRQT KNFS ING+LL RQ S  INAT  FIT PSF LLD  +G  SS+  +N EL KS+SIFSRC+HFT+TK ++ AI  K ESA+VEDDDG
Subjt:  MSLKRLLLRQTFKNFSGINGSLLHRQPSIKINATRTFITTPSFPLLDPQYGRCSSLPVQNLELCKSSSIFSRCVHFTITKSSDTAIAPKLESADVEDDDG

Query:  SMNEFLSRFVWIMRGKISEAFPDYEKQTVDAMLLMIVEKVVSEMEKGNIEQTLRASTGSPDWDLSEDLWKTVSEVSNMVLDDMKKATKKEKMKGFLLSEE
        SMNEFLSRFVWIMRGKISEAFPDY+KQTV+AMLLMIVEKVVSEMEKG+ EQTL++ST +PDWDLSEDLWKTVSEVSNMVLDDMKKATKKEKMKGFLLS E
Subjt:  SMNEFLSRFVWIMRGKISEAFPDYEKQTVDAMLLMIVEKVVSEMEKGNIEQTLRASTGSPDWDLSEDLWKTVSEVSNMVLDDMKKATKKEKMKGFLLSEE

Query:  VQEMCRFSGEVAQKG------------SPCQEENGGEG------------NNNASPSGEEATSEEKSEVVSLPKRRGKIKYKIYGLDLSDSKWSKVADRI
        VQEMCRF+GEV  +G               +E    EG                S SG EA SE KSE  SLPKRRGK+KYKIYGLDLSD+KWS+VAD+I
Subjt:  VQEMCRFSGEVAQKG------------SPCQEENGGEG------------NNNASPSGEEATSEEKSEVVSLPKRRGKIKYKIYGLDLSDSKWSKVADRI

Query:  HEAEEVLWPQEPSQFLGN---------------------ANW--------IDWITLLDKLNEKNRFLYLKVAELVLSEESFQTNIRDYSKLIDAHAKENC
        HEA ++LWP EP    G                      A W        IDWITLLD+LNEKNRFLY KVAEL+L+EESFQTNIRDYSKL+D +AKE+ 
Subjt:  HEAEEVLWPQEPSQFLGN---------------------ANW--------IDWITLLDKLNEKNRFLYLKVAELVLSEESFQTNIRDYSKLIDAHAKENC

Query:  LEDAERILNKMTEKGIPPDILTATVLVHMYSKVGNLDRAKEAFETLRSHGFQPDEK--------------------------------------------
        LEDAERIL KM EKGI PDILTATVLVHMYSKVGNLDRAKEAF+TL+SHGFQPDEK                                            
Subjt:  LEDAERILNKMTEKGIPPDILTATVLVHMYSKVGNLDRAKEAFETLRSHGFQPDEK--------------------------------------------

Query:  -------WSWRIAATMQFAGFPPSLESCTLLVETYGQAGDPDQARNNFDHMIKIGHRPDDRCTASMIAAYEKKNLLDKALNLLLQLEKDGFHPGVATYAV
                + RIAATMQFAG  P+LESCTLLVE YGQAGDPDQARNNFD+MIK+GH PDDRCTASMIAAYEKKNLLDKAL+LLLQLEKDGF PG+ TYAV
Subjt:  -------WSWRIAATMQFAGFPPSLESCTLLVETYGQAGDPDQARNNFDHMIKIGHRPDDRCTASMIAAYEKKNLLDKALNLLLQLEKDGFHPGVATYAV

Query:  LVDWLGKLQLVDEAEQLLGKIGAQGDTLPFKVHISLCDMYSRAGIEKKALQALGVLEAKKEQLGHGDYERIINGLIAGGFVQDAKRVQGLMEAQGFTAS
        LVDWLGKLQLVDEAEQ+LGKIGA+G + P KV ISLCDMYSRAGIEKKALQAL +LEAKKE+LGH D+ERIINGL+AGGF+QDAKR+ G+MEAQGFTAS
Subjt:  LVDWLGKLQLVDEAEQLLGKIGAQGDTLPFKVHISLCDMYSRAGIEKKALQALGVLEAKKEQLGHGDYERIINGLIAGGFVQDAKRVQGLMEAQGFTAS

A0A6J1CRK9 pentatricopeptide repeat-containing protein At5g397103.5e-28274.79Show/hide
Query:  MSLKRLLLRQTFKNFSGINGSLLHRQPSIKINATRTFITTPSFPLLDPQYGRCSSLPVQNLELCKSSSIFSRCVHFTITKSSDTAIAPKLESADVEDDDG
        MSLKRLLLRQT K FSGI+GSLLHRQ +I+INATRTF TTPSF LLDP YGR SS+  QNLELCKSS IFSRC+HFT+TK SDTAI PKLESAD EDDDG
Subjt:  MSLKRLLLRQTFKNFSGINGSLLHRQPSIKINATRTFITTPSFPLLDPQYGRCSSLPVQNLELCKSSSIFSRCVHFTITKSSDTAIAPKLESADVEDDDG

Query:  SMNEFLSRFVWIMRGKISEAFPDYEKQTVDAMLLMIVEKVVSEMEKGNIEQTLRASTGSPDWDLSEDLWKTVSEVSNMVLDDMKKATKKEKMKGFLLSEE
        SMNEFLSRFVWIMRGKISEAFPDY+KQTVDAMLLMIVE+VVSEMEKGNI QTL AS  S DWDLSEDLWKTVSEVSNMVL+DMKKATKKEKMKGFLLSEE
Subjt:  SMNEFLSRFVWIMRGKISEAFPDYEKQTVDAMLLMIVEKVVSEMEKGNIEQTLRASTGSPDWDLSEDLWKTVSEVSNMVLDDMKKATKKEKMKGFLLSEE

Query:  VQEMCRFSGEVAQKGSPCQE---------------------------ENGGEGNNNASPSGEEATSEEKSEVVSLPKRRGKIKYKIYGLDLSDSKWSKVA
        VQEMCRF+GEV  +G   +E                           EN  EGNN  SPSG EA SEEKSEVVSLPKRRGKIKYKIYGLDLSD KW+KVA
Subjt:  VQEMCRFSGEVAQKGSPCQE---------------------------ENGGEGNNNASPSGEEATSEEKSEVVSLPKRRGKIKYKIYGLDLSDSKWSKVA

Query:  DRIHEAEEVLWPQEPSQFLGN---------------------ANW--------IDWITLLDKLNEKNRFLYLKVAELVLSEESFQTNIRDYSKLIDAHAK
        D+IHEAEEVLWPQEP    G                      A W        IDWITLLDKLNEKNRFLYLKVAELVLSEESFQTNIRDYSKL+DAHAK
Subjt:  DRIHEAEEVLWPQEPSQFLGN---------------------ANW--------IDWITLLDKLNEKNRFLYLKVAELVLSEESFQTNIRDYSKLIDAHAK

Query:  ENCLEDAERILNKMTEKGIPPDILTATVLVHMYSKVGNLDRAKEAFETLRSHGFQPDEK-----------------------------------------
        EN LEDAERIL KM EKGI PDILTATVLVHMYSKVGNLDRAKEAFETLRSHGFQPDEK                                         
Subjt:  ENCLEDAERILNKMTEKGIPPDILTATVLVHMYSKVGNLDRAKEAFETLRSHGFQPDEK-----------------------------------------

Query:  ----------WSWRIAATMQFAGFPPSLESCTLLVETYGQAGDPDQARNNFDHMIKIGHRPDDRCTASMIAAYEKKNLLDKALNLLLQLEKDGFHPGVAT
                   + RI+ATMQFAGFPPSLESCTLLVETYGQAGDPDQARNNFD+MIKIGHRPDDRCTASM+AAYEKKNLLDKALNLLLQLEKDGF PG +T
Subjt:  ----------WSWRIAATMQFAGFPPSLESCTLLVETYGQAGDPDQARNNFDHMIKIGHRPDDRCTASMIAAYEKKNLLDKALNLLLQLEKDGFHPGVAT

Query:  YAVLVDWLGKLQLVDEAEQLLGKIGAQGDTLPFKVHISLCDMYSRAGIEKKALQALGVLEAKKEQLGHGDYERIINGLIAGGFVQDAKRVQGLMEAQGFT
        YAVL+DWLGKLQLVDEAEQ+LGKIGAQG+ LPFKVHISLCDMYSRAGIEKKALQALGVLEA+KEQLGHGDY RIINGLIAGGFVQDAKRVQGLMEAQGFT
Subjt:  YAVLVDWLGKLQLVDEAEQLLGKIGAQGDTLPFKVHISLCDMYSRAGIEKKALQALGVLEAKKEQLGHGDYERIINGLIAGGFVQDAKRVQGLMEAQGFT

Query:  AS
        AS
Subjt:  AS

A0A6J1HDE2 putative pentatricopeptide repeat-containing protein At2g021503.0e-26570.39Show/hide
Query:  MSLKRLLLRQTFKNFSGINGSLLHRQPSIKINATRTFITTPSFPLLDPQYGRCSSLPVQNLELCKSSSIFSRCVHFTITKSSDTAIAPKLESADVEDDDG
        MSL  LLLR+T KNF  ING+LL +Q ++ IN TRTFIT+PSF LLDP Y   SS+PV+N EL KS+SIFSRC+H T+TK SD A+ PKLESADVE+DDG
Subjt:  MSLKRLLLRQTFKNFSGINGSLLHRQPSIKINATRTFITTPSFPLLDPQYGRCSSLPVQNLELCKSSSIFSRCVHFTITKSSDTAIAPKLESADVEDDDG

Query:  SMNEFLSRFVWIMRGKISEAFPDYEKQTVDAMLLMIVEKVVSEMEKGNIEQTLRASTGSPDWDLSEDLWKTVSEVSNMVLDDMKKATKKEKMKGFLLSEE
        SMNEFLSRFVWIMRGKISE FPDY+K+TVDAMLLMIVEK+VSEMEKG+ EQ+L+AST + DWDLSEDLWKTVSEVSNMVLDDMKKATKKEKMKGFLLSEE
Subjt:  SMNEFLSRFVWIMRGKISEAFPDYEKQTVDAMLLMIVEKVVSEMEKGNIEQTLRASTGSPDWDLSEDLWKTVSEVSNMVLDDMKKATKKEKMKGFLLSEE

Query:  VQEMCRFSGEVAQKGSPCQE------------------------ENGGEGNNNASPSGEEATSEEKSEVVSLPKRRGKIKYKIYGLDLSDSKWSKVADRI
        VQEMCRF+GEV  +G   +E                        E   +  NN SPS  EA SE KSE VSLPKRRGKIKYKIYGLDLSD KWS+VAD++
Subjt:  VQEMCRFSGEVAQKGSPCQE------------------------ENGGEGNNNASPSGEEATSEEKSEVVSLPKRRGKIKYKIYGLDLSDSKWSKVADRI

Query:  HEAEEVLWPQEPSQFLGN---------------------ANW--------IDWITLLDKLNEKNRFLYLKVAELVLSEESFQTNIRDYSKLIDAHAKENC
        HEAEEVLWPQEP    G                      A W        +DWI LLDKLNE NRFLYLKVAEL+LSEESFQT+IRDYSKL+D HAKEN 
Subjt:  HEAEEVLWPQEPSQFLGN---------------------ANW--------IDWITLLDKLNEKNRFLYLKVAELVLSEESFQTNIRDYSKLIDAHAKENC

Query:  LEDAERILNKMTEKGIPPDILTATVLVHMYSKVGNLDRAKEAFETLRSHGFQPDEK--------------------------------------------
        LEDAERIL KM EKGI PDILTA+VLVHMYSKVGNLDRAKEAF+TLRSHGFQPDEK                                            
Subjt:  LEDAERILNKMTEKGIPPDILTATVLVHMYSKVGNLDRAKEAFETLRSHGFQPDEK--------------------------------------------

Query:  -------WSWRIAATMQFAGFPPSLESCTLLVETYGQAGDPDQARNNFDHMIKIGHRPDDRCTASMIAAYEKKNLLDKALNLLLQLEKDGFHPGVATYAV
                + RI+ATMQFAGF PSLESCTLLVE YGQAGDPDQARNNFD+MIKIGHRPDDRCTASM+AAYEKKNLLDKALNLLLQLEKDGF PGVATYAV
Subjt:  -------WSWRIAATMQFAGFPPSLESCTLLVETYGQAGDPDQARNNFDHMIKIGHRPDDRCTASMIAAYEKKNLLDKALNLLLQLEKDGFHPGVATYAV

Query:  LVDWLGKLQLVDEAEQLLGKIGAQGDTLPFKVHISLCDMYSRAGIEKKALQALGVLEAKKEQLGHGDYERIINGLIAGGFVQDAKRVQGLMEAQGFTAS
        LVDWLGKLQLVDEAEQ+LGKIGAQGD LPFKVHISLCDMYSRAGIEKKALQALGVLEAKKE+LGHGD+ERIINGL+AGGFVQDAKR+QG+MEAQGFTAS
Subjt:  LVDWLGKLQLVDEAEQLLGKIGAQGDTLPFKVHISLCDMYSRAGIEKKALQALGVLEAKKEQLGHGDYERIINGLIAGGFVQDAKRVQGLMEAQGFTAS

A0A6J1K9D9 putative pentatricopeptide repeat-containing protein At2g021502.3e-26570.67Show/hide
Query:  MSLKRLLLRQTFKNFSGINGSLLHRQPSIKINATRTFITTPSFPLLDPQYGRCSSLPVQNLELCKSSSIFSRCVHFTITKSSDTAIAPKLESADVEDDDG
        MSL  LLLR+T KNF  ING+LL  Q ++ INATRTFIT+PSF LLDP YG  SS+P++N EL KS+SIFSRC+H T+TK SD A+ PKLESADVE+DDG
Subjt:  MSLKRLLLRQTFKNFSGINGSLLHRQPSIKINATRTFITTPSFPLLDPQYGRCSSLPVQNLELCKSSSIFSRCVHFTITKSSDTAIAPKLESADVEDDDG

Query:  SMNEFLSRFVWIMRGKISEAFPDYEKQTVDAMLLMIVEKVVSEMEKGNIEQTLRASTGSPDWDLSEDLWKTVSEVSNMVLDDMKKATKKEKMKGFLLSEE
         MNEFLSRFVWI+RGKISE FPDY+K+TVDAMLLMIVEKVVSEMEKG+ EQ+L+AST + DWDLSEDLWKTVSEVSNMVLDDMKKATKKEKMKGFLLSEE
Subjt:  SMNEFLSRFVWIMRGKISEAFPDYEKQTVDAMLLMIVEKVVSEMEKGNIEQTLRASTGSPDWDLSEDLWKTVSEVSNMVLDDMKKATKKEKMKGFLLSEE

Query:  VQEMCRFSGEVAQKGSPCQE------------------------ENGGEGNNNASPSGEEATSEEKSEVVSLPKRRGKIKYKIYGLDLSDSKWSKVADRI
        VQEMCRF+GEV  +G   +E                        E   +  NN SPS  E  SE KSE VSLPKRRGKIKYKIYGLDLSD KWS+VAD++
Subjt:  VQEMCRFSGEVAQKGSPCQE------------------------ENGGEGNNNASPSGEEATSEEKSEVVSLPKRRGKIKYKIYGLDLSDSKWSKVADRI

Query:  HEAEEVLWPQEPSQFLGN---------------------ANW--------IDWITLLDKLNEKNRFLYLKVAELVLSEESFQTNIRDYSKLIDAHAKENC
        HEAEEVLWPQEP    G                      A W        +DWITLLDKLNE NRFLYLKVAEL+LSEESFQTNIRDYSKL+D HAKEN 
Subjt:  HEAEEVLWPQEPSQFLGN---------------------ANW--------IDWITLLDKLNEKNRFLYLKVAELVLSEESFQTNIRDYSKLIDAHAKENC

Query:  LEDAERILNKMTEKGIPPDILTATVLVHMYSKVGNLDRAKEAFETLRSHGFQPDEK--------------------------------------------
        LEDAERIL KM EKGI PDILTATVLVHMYSKVGNLDRAKEAF+TLRSHGFQPDEK                                            
Subjt:  LEDAERILNKMTEKGIPPDILTATVLVHMYSKVGNLDRAKEAFETLRSHGFQPDEK--------------------------------------------

Query:  -------WSWRIAATMQFAGFPPSLESCTLLVETYGQAGDPDQARNNFDHMIKIGHRPDDRCTASMIAAYEKKNLLDKALNLLLQLEKDGFHPGVATYAV
                + RI+ATMQFAGF PSLESCTLLVE YGQAGDPDQARNNFD+MIKIGHRPDDRCTASM+AAYEKKNLLDKALNLLLQLEKDGF PGVATYAV
Subjt:  -------WSWRIAATMQFAGFPPSLESCTLLVETYGQAGDPDQARNNFDHMIKIGHRPDDRCTASMIAAYEKKNLLDKALNLLLQLEKDGFHPGVATYAV

Query:  LVDWLGKLQLVDEAEQLLGKIGAQGDTLPFKVHISLCDMYSRAGIEKKALQALGVLEAKKEQLGHGDYERIINGLIAGGFVQDAKRVQGLMEAQGFTAS
        LVDWLGKLQLVDEAEQ+LGKIGAQGD LPFKVHISLCDMYSRAGIEKKALQAL VLEAKKE+LGHGD+ERIINGLIAGGFVQDAKR+QG+MEAQGFTAS
Subjt:  LVDWLGKLQLVDEAEQLLGKIGAQGDTLPFKVHISLCDMYSRAGIEKKALQALGVLEAKKEQLGHGDYERIINGLIAGGFVQDAKRVQGLMEAQGFTAS

A0A6J1KKS4 putative pentatricopeptide repeat-containing protein At2g021505.0e-26871.47Show/hide
Query:  MSLKRLLLRQTFKNFSGINGSLLHRQPSIKINATRTFITTPSFPLLDPQYGRCSSLPVQNLELCKSSSIFSRCVHFTITKSSDTAIAPKLESADVEDDDG
        M L RLLLRQT K FS ING++LHRQ  +K N TRTF+  PSF LLDP++ RCSS+PV+NLELCKS+SIFSRC+HFT TK SDTAI PKLES+DVED DG
Subjt:  MSLKRLLLRQTFKNFSGINGSLLHRQPSIKINATRTFITTPSFPLLDPQYGRCSSLPVQNLELCKSSSIFSRCVHFTITKSSDTAIAPKLESADVEDDDG

Query:  SMNEFLSRFVWIMRGKISEAFPDYEKQTVDAMLLMIVEKVVSEMEKGNIEQTLRASTGSPDWDLSEDLWKTVSEVSNMVLDDMKKATKKEKMKGFLLSEE
        SMNEFLSRFVWIMRGKISEAFPDY+KQTVDAMLLMIVEKVVSEMEKG+ EQ+LR ST +PDWDLSEDLWKTVSEVSNMVLDDMKKATKKEKMKGFLLSEE
Subjt:  SMNEFLSRFVWIMRGKISEAFPDYEKQTVDAMLLMIVEKVVSEMEKGNIEQTLRASTGSPDWDLSEDLWKTVSEVSNMVLDDMKKATKKEKMKGFLLSEE

Query:  VQEMCRFSGEVAQKGSPCQE-------ENGGEG-----------NNNASPSG-EEATSEEKSEVVSLPKRRGKIKYKIYGLDLSDSKWSKVADRIHEAEE
        VQEMCRF+GEV  +G   ++       E   E              NASPSG E A S EKSE VS+PKRRGKIKYKIYGLDLSD KWS+VAD+IHEAEE
Subjt:  VQEMCRFSGEVAQKGSPCQE-------ENGGEG-----------NNNASPSG-EEATSEEKSEVVSLPKRRGKIKYKIYGLDLSDSKWSKVADRIHEAEE

Query:  VLWPQEPSQFLGN---------------------ANW--------IDWITLLDKLNEKNRFLYLKVAELVLSEESFQTNIRDYSKLIDAHAKENCLEDAE
        V+WPQEP    G                      A W        IDWI LLDKLN+KNRFLYLKVAEL+L+EESF+TNIRDYSKL+D HAKEN LEDAE
Subjt:  VLWPQEPSQFLGN---------------------ANW--------IDWITLLDKLNEKNRFLYLKVAELVLSEESFQTNIRDYSKLIDAHAKENCLEDAE

Query:  RILNKMTEKGIPPDILTATVLVHMYSKVGNLDRAKEAFETLRSHGFQPDEK-------------------------------------------------
        RIL  MTEKGI PDILTATVLVHMYSKVGNLDRAKEAF+TLRSHGFQPDEK                                                 
Subjt:  RILNKMTEKGIPPDILTATVLVHMYSKVGNLDRAKEAFETLRSHGFQPDEK-------------------------------------------------

Query:  --WSWRIAATMQFAGFPPSLESCTLLVETYGQAGDPDQARNNFDHMIKIGHRPDDRCTASMIAAYEKKNLLDKALNLLLQLEKDGFHPGVATYAVLVDWL
           + RIAATMQF+G  PSLESCTLLVETYG AGDPDQARNNFD+MIKIGHRPDDRCTASM+AAY KKNLLDKALNLLLQLEKDGF PGV TYA LVDWL
Subjt:  --WSWRIAATMQFAGFPPSLESCTLLVETYGQAGDPDQARNNFDHMIKIGHRPDDRCTASMIAAYEKKNLLDKALNLLLQLEKDGFHPGVATYAVLVDWL

Query:  GKLQLVDEAEQLLGKIGAQGDTLPFKVHISLCDMYSRAGIEKKALQALGVLEAKKEQLGHGDYERIINGLIAGGFVQDAKRVQGLMEAQGFTAS
        GKLQLVDEAEQLLGKIGAQGD +P KVHISLCDMYSRAG+EKKALQALGVLEAKK++LGHG++ERIINGLIAGGFVQDAKRVQGLMEA+GFTAS
Subjt:  GKLQLVDEAEQLLGKIGAQGDTLPFKVHISLCDMYSRAGIEKKALQALGVLEAKKEQLGHGDYERIINGLIAGGFVQDAKRVQGLMEAQGFTAS

SwissProt top hitse value%identityAlignment
A0A1D6IEG9 Pentatricopeptide repeat-containing protein CRP1, chloroplastic2.7e-2123.3Show/hide
Query:  RDYSKLIDAHAKENCLEDAERILNKMTEKGIPPDILTATVLVHMYSKVGNLDRAKEAFETLRSHGFQPD------------EKWSWR----IAATMQFAG
        R Y+ L+  + +   L++AE++L++M++ G+ PD  T ++LV  Y++ G  + A+   + + + G +P             ++  W+    +   MQ +G
Subjt:  RDYSKLIDAHAKENCLEDAERILNKMTEKGIPPDILTATVLVHMYSKVGNLDRAKEAFETLRSHGFQPD------------EKWSWR----IAATMQFAG

Query:  FPPSLESCTLLVETYGQAGDPDQARNNFDHMIKIGHRPDDRCTASMIAAYEKKNLLDKALNLLLQLEKDGFHPGVATYAVLVDWLGKLQLVDEAEQLLGK
          P      ++++T+G+      A + F+ M + G  PD     ++I A+ K    D+A  L  ++ +    PG  TY ++++ LG+ +  +  E +L +
Subjt:  FPPSLESCTLLVETYGQAGDPDQARNNFDHMIKIGHRPDDRCTASMIAAYEKKNLLDKALNLLLQLEKDGFHPGVATYAVLVDWLGKLQLVDEAEQLLGK

Query:  IGAQGDTLPFKVHISLCDMYSRAGIEKKALQALGVLEAKKEQLGHGDYERIINGLIAGGFVQDAKRVQGLMEAQGFTAS
        +  QG       + +L D+Y R+G  K+A+  +  ++A   +     Y  ++N     G    A  V   M+A G   S
Subjt:  IGAQGDTLPFKVHISLCDMYSRAGIEKKALQALGVLEAKKEQLGHGDYERIINGLIAGGFVQDAKRVQGLMEAQGFTAS

O04647 Pentatricopeptide repeat-containing protein At5g272709.7e-1923.36Show/hide
Query:  KLNEKNRFLYLKVAELVLSEESFQTNIRDYSKLIDAHAKENCLEDAERILNKMTEKGIPPDILTATVLVHMYSKVGNLDRAKEAFETLRSHGFQPDE---
        KL E  R LYL   E   S+   ++ IR    +IDA+ +   LEDA  +  +  EKG  P  +T ++LV+  +  G    A+    T      + D    
Subjt:  KLNEKNRFLYLKVAELVLSEESFQTNIRDYSKLIDAHAKENCLEDAERILNKMTEKGIPPDILTATVLVHMYSKVGNLDRAKEAFETLRSHGFQPDE---

Query:  -------------KWSWRIAATMQFAGFPPSLESCTLLVETYGQAGDPDQARNNFDHMIKIGHRPDDRCTASMIAAYEKKNLLDKALNLLLQLEKDGFHP
                     + +  I   M  +G P S+++   ++  YG+    D+A   F +  + G   D++   +MI  Y K   + +AL+L  +++K G  P
Subjt:  -------------KWSWRIAATMQFAGFPPSLESCTLLVETYGQAGDPDQARNNFDHMIKIGHRPDDRCTASMIAAYEKKNLLDKALNLLLQLEKDGFHP

Query:  GVATYAVLVDWLGKLQLVDEAEQLLGKIGAQGDTLPFKVHISLCDMYSRAGIEKKALQALGVLEAKKEQLGHGDYERIINGLIAGGFVQDAKRVQGLMEA
        G  +Y ++V      +L  E ++LL  +   G       +++L  +Y+ +    +A + + +++ K   L H  +  +++ L+  G +++A+R    M  
Subjt:  GVATYAVLVDWLGKLQLVDEAEQLLGKIGAQGDTLPFKVHISLCDMYSRAGIEKKALQALGVLEAKKEQLGHGDYERIINGLIAGGFVQDAKRVQGLMEA

Query:  QGFT
         G +
Subjt:  QGFT

Q8LEZ4 Protein NUCLEAR FUSION DEFECTIVE 5, mitochondrial3.0e-5240.68Show/hide
Query:  PVQNLELCKSSSIFSRCVHFTITK--SSDTAIAPKLESADVEDDDGSMNEFLSRFVWIMRGKISEAFPDYEKQTVDAMLLMIVEKVVSEMEKGNIEQTLR
        P QN+E+ +  S F+R  HFT     S  +A        + +D+DG+ NEFLSRFVWIMRGK+SEA+PD +K+ +D MLL+IVEKVV E+E+G   + + 
Subjt:  PVQNLELCKSSSIFSRCVHFTITK--SSDTAIAPKLESADVEDDDGSMNEFLSRFVWIMRGKISEAFPDYEKQTVDAMLLMIVEKVVSEMEKGNIEQTLR

Query:  ASTGSPDWDLSEDLWKTVSEVSNMVLDDMKKATKKEKMKGFLLSEEVQEMCRFSGEVAQKGSPCQE-------------------ENGGEGNNNASPS--
        ++  SP  + S+DLW T+ EVSN VL DM+K  KKEKMK ++ S EV EMCRF+GE+  +G   +E                   E   + +N+   S  
Subjt:  ASTGSPDWDLSEDLWKTVSEVSNMVLDDMKKATKKEKMKGFLLSEEVQEMCRFSGEVAQKGSPCQE-------------------ENGGEGNNNASPS--

Query:  --------GEEATSEEKSEVVSLPKRRGKIKYKIYGLDLSDSKWSKVADRIHEAEEVLWPQEPSQFLGN---------------------ANW-------
                G   + E +S  +SLPKR+GK+KYKIYGL+LSD KW ++AD+IHEAEE    +EP    G                      A W       
Subjt:  --------GEEATSEEKSEVVSLPKRRGKIKYKIYGLDLSDSKWSKVADRIHEAEEVLWPQEPSQFLGN---------------------ANW-------

Query:  -IDWITLLDKLNEKNRFLYLKV
         +DWI L+++L E N   YLKV
Subjt:  -IDWITLLDKLNEKNRFLYLKV

Q940Z1 Pentatricopeptide repeat-containing protein At1g195251.2e-6445.58Show/hide
Query:  MTEKGIPPDILTATVLVHMYSKVGNLDRAKEAFETLRSHGFQPDEK---------------------------------------------------WSW
        M++ GI PDILTAT LVHMYSK GN +RA EAFE L+S+G +PDEK                                                    + 
Subjt:  MTEKGIPPDILTATVLVHMYSKVGNLDRAKEAFETLRSHGFQPDEK---------------------------------------------------WSW

Query:  RIAATMQFAGFPP-SLESCTLLVETYGQAGDPDQARNNFDHMIKIGHRPDDRCTASMIAAYEKKNLLDKALNLLLQLEKDGFHPGVATYAVLVDWLGKLQ
         I+++MQ+A   P S E+ +L VE YG+AG  D+A++NFD M K+GH+PDD+C A+++ AY+ +N LDKAL LLLQLEKDG   GV TY VLVDW+  L 
Subjt:  RIAATMQFAGFPP-SLESCTLLVETYGQAGDPDQARNNFDHMIKIGHRPDDRCTASMIAAYEKKNLLDKALNLLLQLEKDGFHPGVATYAVLVDWLGKLQ

Query:  LVDEAEQLLGKIGAQGDTLPFKVHISLCDMYSRAGIEKKALQALGVLEAKKEQLGHGDYERIINGLIAGGFVQDAKRVQGLMEAQGFTASNRFR
        L++EAEQLL KI   G+  PF++ +SLC MYS    EKK LQALGVLEAK++Q+G  +++++I+ L  GGF +DA+R+   MEA+ F  S R +
Subjt:  LVDEAEQLLGKIGAQGDTLPFKVHISLCDMYSRAGIEKKALQALGVLEAKKEQLGHGDYERIINGLIAGGFVQDAKRVQGLMEAQGFTASNRFR

Q9LPC4 Pentatricopeptide repeat-containing protein At1g019701.3e-3132.21Show/hide
Query:  DWITLLDKLNEKNRFLYLKVAELVLSEESFQTNIRDYSKLIDAHAKENCLEDAERILNKMTEKGIPPDILTATVLVHMYSKVGNLDRAKEAFE-------
        DW+++L +L   +   Y+KVAE  L ++SF+ N RDY+K+I  + K N +EDAER L  M  +G   D +T T +V +YSK G    A+E F        
Subjt:  DWITLLDKLNEKNRFLYLKVAELVLSEESFQTNIRDYSKLIDAHAKENCLEDAERILNKMTEKGIPPDILTATVLVHMYSKVGNLDRAKEAFE-------

Query:  --------------------------------------------TLRSHGFQPDEKWSWRIAATMQFAGFPPSLESCTLLVETYGQAGDPDQARNNFDHM
                                                     LR +    D + + R+   +Q AG  P ++ C LL+  Y  +G    AR  F++M
Subjt:  --------------------------------------------TLRSHGFQPDEKWSWRIAATMQFAGFPPSLESCTLLVETYGQAGDPDQARNNFDHM

Query:  IKIGHRPDDRCTASMIAAYEKKNLLDKALNLLLQLEKDGFHPGVATYAVLVDWLGKLQLVDEAEQLL
         K G +  D+C A ++AAYEK+  L++AL  L++LEKD    G    AVL  W  KL +V+E E LL
Subjt:  IKIGHRPDDRCTASMIAAYEKKNLLDKALNLLLQLEKDGFHPGVATYAVLVDWLGKLQLVDEAEQLL

Arabidopsis top hitse value%identityAlignment
AT1G01970.1 Tetratricopeptide repeat (TPR)-like superfamily protein9.3e-3332.21Show/hide
Query:  DWITLLDKLNEKNRFLYLKVAELVLSEESFQTNIRDYSKLIDAHAKENCLEDAERILNKMTEKGIPPDILTATVLVHMYSKVGNLDRAKEAFE-------
        DW+++L +L   +   Y+KVAE  L ++SF+ N RDY+K+I  + K N +EDAER L  M  +G   D +T T +V +YSK G    A+E F        
Subjt:  DWITLLDKLNEKNRFLYLKVAELVLSEESFQTNIRDYSKLIDAHAKENCLEDAERILNKMTEKGIPPDILTATVLVHMYSKVGNLDRAKEAFE-------

Query:  --------------------------------------------TLRSHGFQPDEKWSWRIAATMQFAGFPPSLESCTLLVETYGQAGDPDQARNNFDHM
                                                     LR +    D + + R+   +Q AG  P ++ C LL+  Y  +G    AR  F++M
Subjt:  --------------------------------------------TLRSHGFQPDEKWSWRIAATMQFAGFPPSLESCTLLVETYGQAGDPDQARNNFDHM

Query:  IKIGHRPDDRCTASMIAAYEKKNLLDKALNLLLQLEKDGFHPGVATYAVLVDWLGKLQLVDEAEQLL
         K G +  D+C A ++AAYEK+  L++AL  L++LEKD    G    AVL  W  KL +V+E E LL
Subjt:  IKIGHRPDDRCTASMIAAYEKKNLLDKALNLLLQLEKDGFHPGVATYAVLVDWLGKLQLVDEAEQLL

AT1G03560.1 Pentatricopeptide repeat (PPR-like) superfamily protein1.9e-1726.67Show/hide
Query:  NIRDYSKLIDAHAKENCLEDAERILNKMTEKGIPPDILTATVLVHMYSKVGNLDRAKEAFETLRSHGFQPDEKWSWRIAATMQFAGFPPSLESCTLLVET
        N+  Y+ LID +AK   +EDA R+L++M ++G  PD++T +V+V+   K G ++ A + F T R  G          +A    F          + L++ 
Subjt:  NIRDYSKLIDAHAKENCLEDAERILNKMTEKGIPPDILTATVLVHMYSKVGNLDRAKEAFETLRSHGFQPDEKWSWRIAATMQFAGFPPSLESCTLLVET

Query:  YGQAGDPDQARNNFDHMIKIGHRPDDRCTASMIAAYEKKNLLDKALNLLLQL-EKDGFHPGVATYAVLVDWLGKLQLVDEAEQLLGKIGAQGDTLPFKVH
         G+AG  D+A   F+ M + G   D  C  ++I A+ K   +D+A+ L  ++ E++G    V TY +L+  + K    +EA +L   +  +G T      
Subjt:  YGQAGDPDQARNNFDHMIKIGHRPDDRCTASMIAAYEKKNLLDKALNLLLQL-EKDGFHPGVATYAVLVDWLGKLQLVDEAEQLLGKIGAQGDTLPFKVH

Query:  ISLCDMYSRAGIEKKALQALGVLEAKKEQLGHGDYERIINGLIAGGFVQDAKRVQGLMEAQGFTASNRFR
         +L      +G   +A + L  L A    +     E +IN L   G +++A ++   +  +G     R R
Subjt:  ISLCDMYSRAGIEKKALQALGVLEAKKEQLGHGDYERIINGLIAGGFVQDAKRVQGLMEAQGFTASNRFR

AT1G19520.1 pentatricopeptide (PPR) repeat-containing protein6.0e-14144.43Show/hide
Query:  PVQNLELCKSSSIFSRCVHFTITK--SSDTAIAPKLESADVEDDDGSMNEFLSRFVWIMRGKISEAFPDYEKQTVDAMLLMIVEKVVSEMEKGNIEQTLR
        P QN+E+ +  S F+R  HFT     S  +A        + +D+DG+ NEFLSRFVWIMRGK+SEA+PD +K+ +D MLL+IVEKVV E+E+G   + + 
Subjt:  PVQNLELCKSSSIFSRCVHFTITK--SSDTAIAPKLESADVEDDDGSMNEFLSRFVWIMRGKISEAFPDYEKQTVDAMLLMIVEKVVSEMEKGNIEQTLR

Query:  ASTGSPDWDLSEDLWKTVSEVSNMVLDDMKKATKKEKMKGFLLSEEVQEMCRFSGEVAQKGSPCQE-------------------ENGGEGNNNASPS--
        ++  SP  + S+DLW T+ EVSN VL DM+K  KKEKMK ++ S EV EMCRF+GE+  +G   +E                   E   + +N+   S  
Subjt:  ASTGSPDWDLSEDLWKTVSEVSNMVLDDMKKATKKEKMKGFLLSEEVQEMCRFSGEVAQKGSPCQE-------------------ENGGEGNNNASPS--

Query:  --------GEEATSEEKSEVVSLPKRRGKIKYKIYGLDLSDSKWSKVADRIHEAEEVLWPQEPSQFLGN---------------------ANW-------
                G   + E +S  +SLPKR+GK+KYKIYGL+LSD KW ++AD+IHEAEE    +EP    G                      A W       
Subjt:  --------GEEATSEEKSEVVSLPKRRGKIKYKIYGLDLSDSKWSKVADRIHEAEEVLWPQEPSQFLGN---------------------ANW-------

Query:  -IDWITLLDKLNEKNRFLYLKVAELVLSEESFQTNIRDYSKLIDAHAKENCLEDAERILNKMTEKGIPPDILTATVLVHMYSKVGNLDRAKEAFETLRSH
         +DWI L+++L E N   YLKVAE VL E+SF  +I DYSKLI  HAKEN +ED ERIL KM++ GI PDILTAT LVHMYSK GN +RA EAFE L+S+
Subjt:  -IDWITLLDKLNEKNRFLYLKVAELVLSEESFQTNIRDYSKLIDAHAKENCLEDAERILNKMTEKGIPPDILTATVLVHMYSKVGNLDRAKEAFETLRSH

Query:  GFQPDEK---------------------------------------------------WSWRIAATMQFAGFPP-SLESCTLLVETYGQAGDPDQARNNF
        G +PDEK                                                    +  I+++MQ+A   P S E+ +L VE YG+AG  D+A++NF
Subjt:  GFQPDEK---------------------------------------------------WSWRIAATMQFAGFPP-SLESCTLLVETYGQAGDPDQARNNF

Query:  DHMIKIGHRPDDRCTASMIAAYEKKNLLDKALNLLLQLEKDGFHPGVATYAVLVDWLGKLQLVDEAEQLLGKIGAQGDTLPFKVHISLCDMYSRAGIEKK
        D M K+GH+PDD+C A+++ AY+ +N LDKAL LLLQLEKDG   GV TY VLVDW+  L L++EAEQLL KI   G+  PF++ +SLC MYS    EKK
Subjt:  DHMIKIGHRPDDRCTASMIAAYEKKNLLDKALNLLLQLEKDGFHPGVATYAVLVDWLGKLQLVDEAEQLLGKIGAQGDTLPFKVHISLCDMYSRAGIEKK

Query:  ALQALGVLEAKKEQLGHGDYERIINGLIAGGFVQDAKRVQGLMEAQGFTASNRFR
         LQALGVLEAK++Q+G  +++++I+ L  GGF +DA+R+   MEA+ F  S R +
Subjt:  ALQALGVLEAKKEQLGHGDYERIINGLIAGGFVQDAKRVQGLMEAQGFTASNRFR

AT1G19520.2 pentatricopeptide (PPR) repeat-containing protein2.1e-5340.68Show/hide
Query:  PVQNLELCKSSSIFSRCVHFTITK--SSDTAIAPKLESADVEDDDGSMNEFLSRFVWIMRGKISEAFPDYEKQTVDAMLLMIVEKVVSEMEKGNIEQTLR
        P QN+E+ +  S F+R  HFT     S  +A        + +D+DG+ NEFLSRFVWIMRGK+SEA+PD +K+ +D MLL+IVEKVV E+E+G   + + 
Subjt:  PVQNLELCKSSSIFSRCVHFTITK--SSDTAIAPKLESADVEDDDGSMNEFLSRFVWIMRGKISEAFPDYEKQTVDAMLLMIVEKVVSEMEKGNIEQTLR

Query:  ASTGSPDWDLSEDLWKTVSEVSNMVLDDMKKATKKEKMKGFLLSEEVQEMCRFSGEVAQKGSPCQE-------------------ENGGEGNNNASPS--
        ++  SP  + S+DLW T+ EVSN VL DM+K  KKEKMK ++ S EV EMCRF+GE+  +G   +E                   E   + +N+   S  
Subjt:  ASTGSPDWDLSEDLWKTVSEVSNMVLDDMKKATKKEKMKGFLLSEEVQEMCRFSGEVAQKGSPCQE-------------------ENGGEGNNNASPS--

Query:  --------GEEATSEEKSEVVSLPKRRGKIKYKIYGLDLSDSKWSKVADRIHEAEEVLWPQEPSQFLGN---------------------ANW-------
                G   + E +S  +SLPKR+GK+KYKIYGL+LSD KW ++AD+IHEAEE    +EP    G                      A W       
Subjt:  --------GEEATSEEKSEVVSLPKRRGKIKYKIYGLDLSDSKWSKVADRIHEAEEVLWPQEPSQFLGN---------------------ANW-------

Query:  -IDWITLLDKLNEKNRFLYLKV
         +DWI L+++L E N   YLKV
Subjt:  -IDWITLLDKLNEKNRFLYLKV

AT5G27270.1 Tetratricopeptide repeat (TPR)-like superfamily protein6.9e-2023.36Show/hide
Query:  KLNEKNRFLYLKVAELVLSEESFQTNIRDYSKLIDAHAKENCLEDAERILNKMTEKGIPPDILTATVLVHMYSKVGNLDRAKEAFETLRSHGFQPDE---
        KL E  R LYL   E   S+   ++ IR    +IDA+ +   LEDA  +  +  EKG  P  +T ++LV+  +  G    A+    T      + D    
Subjt:  KLNEKNRFLYLKVAELVLSEESFQTNIRDYSKLIDAHAKENCLEDAERILNKMTEKGIPPDILTATVLVHMYSKVGNLDRAKEAFETLRSHGFQPDE---

Query:  -------------KWSWRIAATMQFAGFPPSLESCTLLVETYGQAGDPDQARNNFDHMIKIGHRPDDRCTASMIAAYEKKNLLDKALNLLLQLEKDGFHP
                     + +  I   M  +G P S+++   ++  YG+    D+A   F +  + G   D++   +MI  Y K   + +AL+L  +++K G  P
Subjt:  -------------KWSWRIAATMQFAGFPPSLESCTLLVETYGQAGDPDQARNNFDHMIKIGHRPDDRCTASMIAAYEKKNLLDKALNLLLQLEKDGFHP

Query:  GVATYAVLVDWLGKLQLVDEAEQLLGKIGAQGDTLPFKVHISLCDMYSRAGIEKKALQALGVLEAKKEQLGHGDYERIINGLIAGGFVQDAKRVQGLMEA
        G  +Y ++V      +L  E ++LL  +   G       +++L  +Y+ +    +A + + +++ K   L H  +  +++ L+  G +++A+R    M  
Subjt:  GVATYAVLVDWLGKLQLVDEAEQLLGKIGAQGDTLPFKVHISLCDMYSRAGIEKKALQALGVLEAKKEQLGHGDYERIINGLIAGGFVQDAKRVQGLMEA

Query:  QGFT
         G +
Subjt:  QGFT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGCCTAAAGCGTTTACTCCTTCGTCAAACATTCAAGAATTTTTCGGGAATCAATGGCAGCTTACTACATCGCCAACCATCTATCAAGATCAATGCCACCCGCACATT
CATCACGACACCTTCATTTCCTTTGCTTGATCCACAGTACGGTCGTTGTTCTTCACTCCCTGTCCAGAATCTTGAGCTCTGCAAATCGAGTTCAATTTTCAGTAGGTGCG
TTCACTTCACTATAACTAAGTCGAGCGATACAGCAATTGCGCCGAAACTCGAGTCTGCGGATGTTGAGGATGATGATGGATCAATGAACGAGTTCTTATCCAGATTTGTC
TGGATAATGCGTGGGAAGATCTCTGAAGCTTTTCCGGACTATGAAAAGCAAACAGTTGATGCAATGCTTTTGATGATTGTGGAGAAAGTAGTCTCTGAAATGGAAAAGGG
GAACATTGAGCAGACGTTAAGAGCTTCAACGGGCAGTCCGGATTGGGACCTAAGCGAGGATTTGTGGAAGACAGTAAGCGAAGTTAGCAACATGGTTTTGGATGATATGA
AGAAGGCTACAAAGAAGGAGAAAATGAAGGGTTTTCTGCTATCTGAGGAAGTTCAGGAAATGTGTAGGTTTTCTGGGGAAGTTGCTCAAAAAGGAAGCCCGTGCCAGGAA
GAAAATGGTGGAGAAGGTAATAACAACGCTTCTCCGAGTGGTGAAGAGGCTACATCCGAGGAGAAATCCGAGGTTGTTTCTCTTCCCAAGAGGCGAGGGAAGATAAAGTA
CAAGATTTATGGTCTTGATTTATCTGATTCCAAGTGGAGTAAAGTAGCAGACAGAATCCACGAGGCAGAGGAGGTGCTATGGCCGCAGGAACCAAGCCAATTTCTGGGAA
ATGCAAACTGGATTGACTGGATTACCTTACTTGATAAATTGAATGAGAAGAATAGATTCTTATACTTAAAGGTAGCAGAACTAGTTTTGAGTGAAGAGTCTTTCCAGACC
AACATCCGTGACTACTCTAAGCTTATTGATGCCCATGCTAAAGAGAACTGCCTAGAGGATGCTGAGAGGATCCTTAATAAGATGACTGAGAAGGGCATTCCACCAGACAT
TTTGACAGCCACAGTTTTAGTTCATATGTATAGCAAGGTAGGCAATCTTGATCGTGCAAAGGAAGCATTTGAGACTTTGAGGAGTCACGGCTTTCAACCAGATGAGAAGT
GGAGCTGGCGAATTGCTGCTACTATGCAGTTTGCTGGCTTCCCGCCAAGTTTGGAGTCTTGTACATTGCTCGTTGAGACATATGGGCAAGCTGGCGATCCTGATCAGGCA
AGGAACAATTTTGACCACATGATAAAAATTGGGCACAGGCCTGATGACAGGTGCACTGCAAGTATGATTGCAGCCTATGAAAAGAAGAATCTGTTGGACAAGGCTTTGAA
TCTTTTACTACAGCTTGAAAAGGATGGGTTTCACCCTGGGGTTGCAACTTATGCTGTTCTTGTAGATTGGTTGGGTAAGTTGCAGCTGGTTGATGAAGCTGAGCAGCTAT
TAGGCAAGATTGGTGCGCAGGGAGATACCCTACCTTTTAAGGTTCATATTAGCCTATGTGATATGTACTCAAGAGCTGGGATAGAGAAAAAGGCGCTGCAAGCCTTGGGG
GTATTGGAAGCTAAAAAGGAGCAGTTGGGACATGGTGATTATGAGAGGATCATAAATGGGCTTATAGCAGGTGGTTTTGTGCAGGATGCTAAAAGAGTGCAGGGTCTTAT
GGAGGCCCAGGGTTTTACTGCATCCAACCGCTTCAGATGGCTCTGA
mRNA sequenceShow/hide mRNA sequence
ATGAGCCTAAAGCGTTTACTCCTTCGTCAAACATTCAAGAATTTTTCGGGAATCAATGGCAGCTTACTACATCGCCAACCATCTATCAAGATCAATGCCACCCGCACATT
CATCACGACACCTTCATTTCCTTTGCTTGATCCACAGTACGGTCGTTGTTCTTCACTCCCTGTCCAGAATCTTGAGCTCTGCAAATCGAGTTCAATTTTCAGTAGGTGCG
TTCACTTCACTATAACTAAGTCGAGCGATACAGCAATTGCGCCGAAACTCGAGTCTGCGGATGTTGAGGATGATGATGGATCAATGAACGAGTTCTTATCCAGATTTGTC
TGGATAATGCGTGGGAAGATCTCTGAAGCTTTTCCGGACTATGAAAAGCAAACAGTTGATGCAATGCTTTTGATGATTGTGGAGAAAGTAGTCTCTGAAATGGAAAAGGG
GAACATTGAGCAGACGTTAAGAGCTTCAACGGGCAGTCCGGATTGGGACCTAAGCGAGGATTTGTGGAAGACAGTAAGCGAAGTTAGCAACATGGTTTTGGATGATATGA
AGAAGGCTACAAAGAAGGAGAAAATGAAGGGTTTTCTGCTATCTGAGGAAGTTCAGGAAATGTGTAGGTTTTCTGGGGAAGTTGCTCAAAAAGGAAGCCCGTGCCAGGAA
GAAAATGGTGGAGAAGGTAATAACAACGCTTCTCCGAGTGGTGAAGAGGCTACATCCGAGGAGAAATCCGAGGTTGTTTCTCTTCCCAAGAGGCGAGGGAAGATAAAGTA
CAAGATTTATGGTCTTGATTTATCTGATTCCAAGTGGAGTAAAGTAGCAGACAGAATCCACGAGGCAGAGGAGGTGCTATGGCCGCAGGAACCAAGCCAATTTCTGGGAA
ATGCAAACTGGATTGACTGGATTACCTTACTTGATAAATTGAATGAGAAGAATAGATTCTTATACTTAAAGGTAGCAGAACTAGTTTTGAGTGAAGAGTCTTTCCAGACC
AACATCCGTGACTACTCTAAGCTTATTGATGCCCATGCTAAAGAGAACTGCCTAGAGGATGCTGAGAGGATCCTTAATAAGATGACTGAGAAGGGCATTCCACCAGACAT
TTTGACAGCCACAGTTTTAGTTCATATGTATAGCAAGGTAGGCAATCTTGATCGTGCAAAGGAAGCATTTGAGACTTTGAGGAGTCACGGCTTTCAACCAGATGAGAAGT
GGAGCTGGCGAATTGCTGCTACTATGCAGTTTGCTGGCTTCCCGCCAAGTTTGGAGTCTTGTACATTGCTCGTTGAGACATATGGGCAAGCTGGCGATCCTGATCAGGCA
AGGAACAATTTTGACCACATGATAAAAATTGGGCACAGGCCTGATGACAGGTGCACTGCAAGTATGATTGCAGCCTATGAAAAGAAGAATCTGTTGGACAAGGCTTTGAA
TCTTTTACTACAGCTTGAAAAGGATGGGTTTCACCCTGGGGTTGCAACTTATGCTGTTCTTGTAGATTGGTTGGGTAAGTTGCAGCTGGTTGATGAAGCTGAGCAGCTAT
TAGGCAAGATTGGTGCGCAGGGAGATACCCTACCTTTTAAGGTTCATATTAGCCTATGTGATATGTACTCAAGAGCTGGGATAGAGAAAAAGGCGCTGCAAGCCTTGGGG
GTATTGGAAGCTAAAAAGGAGCAGTTGGGACATGGTGATTATGAGAGGATCATAAATGGGCTTATAGCAGGTGGTTTTGTGCAGGATGCTAAAAGAGTGCAGGGTCTTAT
GGAGGCCCAGGGTTTTACTGCATCCAACCGCTTCAGATGGCTCTGA
Protein sequenceShow/hide protein sequence
MSLKRLLLRQTFKNFSGINGSLLHRQPSIKINATRTFITTPSFPLLDPQYGRCSSLPVQNLELCKSSSIFSRCVHFTITKSSDTAIAPKLESADVEDDDGSMNEFLSRFV
WIMRGKISEAFPDYEKQTVDAMLLMIVEKVVSEMEKGNIEQTLRASTGSPDWDLSEDLWKTVSEVSNMVLDDMKKATKKEKMKGFLLSEEVQEMCRFSGEVAQKGSPCQE
ENGGEGNNNASPSGEEATSEEKSEVVSLPKRRGKIKYKIYGLDLSDSKWSKVADRIHEAEEVLWPQEPSQFLGNANWIDWITLLDKLNEKNRFLYLKVAELVLSEESFQT
NIRDYSKLIDAHAKENCLEDAERILNKMTEKGIPPDILTATVLVHMYSKVGNLDRAKEAFETLRSHGFQPDEKWSWRIAATMQFAGFPPSLESCTLLVETYGQAGDPDQA
RNNFDHMIKIGHRPDDRCTASMIAAYEKKNLLDKALNLLLQLEKDGFHPGVATYAVLVDWLGKLQLVDEAEQLLGKIGAQGDTLPFKVHISLCDMYSRAGIEKKALQALG
VLEAKKEQLGHGDYERIINGLIAGGFVQDAKRVQGLMEAQGFTASNRFRWL