; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr013083 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr013083
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationtig00153666:260209..262091
RNA-Seq ExpressionSgr013083
SyntenySgr013083
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022155480.1 pentatricopeptide repeat-containing protein At2g01390 isoform X1 [Momordica charantia]9.0e-30187.41Show/hide
Query:  MHYSNSFCFLLSNYVVVSAIGKKIYQNICIKAVHSFHQYKQKKPIKLFGRKLKKGVKVVEKEEIDPKLYTRDTVRNIYNILRNWSWSSAQEHLERLPIRW
        MHYSNSF  LLSNYVV+SAI KKIY NI IKA+HS  QYKQ+KPIKLF RKL+KG KVVEKEE+DPKLYTRDTVRNIYNILRN+SWSSAQEHLERLP+RW
Subjt:  MHYSNSFCFLLSNYVVVSAIGKKIYQNICIKAVHSFHQYKQKKPIKLFGRKLKKGVKVVEKEEIDPKLYTRDTVRNIYNILRNWSWSSAQEHLERLPIRW

Query:  DSYLINQVLKTHPPLEKAWLFFNWASRLHIFKHDQYTYTTMLDIFGEAGRISSMNYVFQQMKEKGMKIDAVTYTSLMHWRSNSGDVDGAIKVWKEMKSNG
        DSYLINQV+KTHPPLEKAWLFFNWA RL  FKHDQYTYTTMLDIFGEAGRISSMNY+FQQMKEKG+KIDAVTYTSLMHWRS SGDVDGAIKVWKEMK+NG
Subjt:  DSYLINQVLKTHPPLEKAWLFFNWASRLHIFKHDQYTYTTMLDIFGEAGRISSMNYVFQQMKEKGMKIDAVTYTSLMHWRSNSGDVDGAIKVWKEMKSNG

Query:  CYPTVVSYTAYIKILLDNDQVKEATDTYKEMLQSGLSPNCCTYTVLMEYLIGAGKCKEALDIFHKMQDAGVYPDKAACNILIQKCCKSGEMLVITQILEY
        CYPTVVSYTAYIKILLDNDQVKEATDTYKEMLQSGLSPNCCTYTVLMEYLIGAGKCKEALDIFHKMQDAGVYPDKAACNILI KCC+SGEMLV+T ILEY
Subjt:  CYPTVVSYTAYIKILLDNDQVKEATDTYKEMLQSGLSPNCCTYTVLMEYLIGAGKCKEALDIFHKMQDAGVYPDKAACNILIQKCCKSGEMLVITQILEY

Query:  MKEKRLVLRYPVFVEAHETLKICSVSDSLLRQVNPHIETESVCHDEVMHVNTSSNVIHSNVDHELVAILLKKENLIAIDYILTGLIDKNIQLDSAIISTI
        MKE R VLRYPVFVEAH+TLK CSVS++LLRQVNPHIETESV  DEV+HV TSS +I SNVDHEL+ ILLKKE LIA+DY+LTG++DKNIQLDSAIISTI
Subjt:  MKEKRLVLRYPVFVEAHETLKICSVSDSLLRQVNPHIETESVCHDEVMHVNTSSNVIHSNVDHELVAILLKKENLIAIDYILTGLIDKNIQLDSAIISTI

Query:  IEVNCKHNRPSGALLAFDYCLKNGVNMGRNLYLGLIGILTRSSIYSKLLEIVPEMYRHGHCLGLYHATLILYRLGKAGKPQYAKKIFNLLPEELKCTAAY
        IEVNCKHNRP GALL FD+CLK+GVNM RNLYLGLIG+L RSSIYSKLLEIV EMYR GHCLGLYHATLILYRLGKAGKPQYA KIFN+LPEELKCTA Y
Subjt:  IEVNCKHNRPSGALLAFDYCLKNGVNMGRNLYLGLIGILTRSSIYSKLLEIVPEMYRHGHCLGLYHATLILYRLGKAGKPQYAKKIFNLLPEELKCTAAY

Query:  TALVGAYFSAGSYGKGLKIYETMRKKGFTPCLGTYNVLLTGLAKSGRVDELEIYRREKKSFEISY--HHNTILEEETICDLLFGEMVS
        TALVGAYFSAGS GKGLKIYETMRKKGF+P LGTYNVLLTGL KSGRV ELEIYRREKKSFEI Y  HH+ ILEE+ ICDLL+GEM+S
Subjt:  TALVGAYFSAGSYGKGLKIYETMRKKGFTPCLGTYNVLLTGLAKSGRVDELEIYRREKKSFEISY--HHNTILEEETICDLLFGEMVS

XP_022928072.1 pentatricopeptide repeat-containing protein At2g01390 [Cucurbita moschata]4.2e-27481.46Show/hide
Query:  MHYSNSFCFLLSNYVVVSAIGKKIYQNICIKAVHSFHQYKQKKPIKLFGRKLKKGVKVVEKEEIDPKLYTRDTVRNIYNILRNWSWSSAQEHLERLPIRW
        M  SN F FL+SNYVV SAI K+IYQNI  K +HS HQYKQ+KP   F RKL+KG K V+KEE++   YTRDTVRNIYNILRN SW+SAQ H+E LPIRW
Subjt:  MHYSNSFCFLLSNYVVVSAIGKKIYQNICIKAVHSFHQYKQKKPIKLFGRKLKKGVKVVEKEEIDPKLYTRDTVRNIYNILRNWSWSSAQEHLERLPIRW

Query:  DSYLINQVLKTHPPLEKAWLFFNWASRLHIFKHDQYTYTTMLDIFGEAGRISSMNYVFQQMKEKGMKIDAVTYTSLMHWRSNSGDVDGAIKVWKEMKSNG
        DSYLINQVLKTHPPLEKAWLFFNWASRL  F+HD YTYTTMLDIFGEAGRISSMNYVFQQMKEKG+KIDAVTYTSLMHWRSNSGDVDGAI+VW+EMK+NG
Subjt:  DSYLINQVLKTHPPLEKAWLFFNWASRLHIFKHDQYTYTTMLDIFGEAGRISSMNYVFQQMKEKGMKIDAVTYTSLMHWRSNSGDVDGAIKVWKEMKSNG

Query:  CYPTVVSYTAYIKILLDNDQVKEATDTYKEMLQSGLSPNCCTYTVLMEYLIGAGKCKEALDIFHKMQDAGVYPDKAACNILIQKCCKSGEMLVITQILEY
        CYPTVVSYTAYIKILLDN +V++ATD YKEMLQSGLSPNCCTYTVLMEYLIG  K KEALDIFHKMQDAG YPDKAACNILIQKCCKSGEMLV+TQILEY
Subjt:  CYPTVVSYTAYIKILLDNDQVKEATDTYKEMLQSGLSPNCCTYTVLMEYLIGAGKCKEALDIFHKMQDAGVYPDKAACNILIQKCCKSGEMLVITQILEY

Query:  MKEKRLVLRYPVFVEAHETLKICSVSDSLLRQVNPHIETESVCHDEVMHVNTSSNVIHSNVDHELVAILLKKENLIAIDYILTGLIDKNIQLDSAIISTI
        MKEKRLVLRYPVFVEAHE LK CSVS +LL QVNPHIE ESV   EV+ V+TS NVI  +VD+ELVA LLK+E LIA+D+IL G+ DKNIQLDS+II +I
Subjt:  MKEKRLVLRYPVFVEAHETLKICSVSDSLLRQVNPHIETESVCHDEVMHVNTSSNVIHSNVDHELVAILLKKENLIAIDYILTGLIDKNIQLDSAIISTI

Query:  IEVNCKHNRPSGALLAFDYCLKNGVNMGRNLYLGLIGILTRSSIYSKLLEIVPEMYRHGHCLGLYHATLILYRLGKAGKPQYAKKIFNLLPEELKCTAAY
        IEVNCK NRP+GALLAFDYCLKNGV + RNLYL LIG+L RSSIYS LLEIV EMY  GHCLGLYHATLILYRLGKAGKPQYA+K+FN+LPEELKCTA Y
Subjt:  IEVNCKHNRPSGALLAFDYCLKNGVNMGRNLYLGLIGILTRSSIYSKLLEIVPEMYRHGHCLGLYHATLILYRLGKAGKPQYAKKIFNLLPEELKCTAAY

Query:  TALVGAYFSAGSYGKGLKIYETMRKKGFTPCLGTYNVLLTGLAKSGRVDELEIYRREKKSFEISY--HHNTILEEETICDLLFGEMVS
        TALV AYFSAGS+GKGLKIYETMRKKGFTP LGTYNVLL+GL KS RV EL+IYRREKK FEIS+  HH TILEEE ICDLLFGE+VS
Subjt:  TALVGAYFSAGSYGKGLKIYETMRKKGFTPCLGTYNVLLTGLAKSGRVDELEIYRREKKSFEISY--HHNTILEEETICDLLFGEMVS

XP_022971714.1 pentatricopeptide repeat-containing protein At2g01390 [Cucurbita maxima]7.7e-27681.97Show/hide
Query:  MHYSNSFCFLLSNYVVVSAIGKKIYQNICIKAVHSFHQYKQKKPIKLFGRKLKKGVKVVEKEEIDPKLYTRDTVRNIYNILRNWSWSSAQEHLERLPIRW
        M  SNSF FL+SNYVV SAI K+IYQNI  K +HS HQYKQ+KP   F RKL+KG K V+KEE++   YTRDTVRNIYNILRN SW SAQ H+E LPIRW
Subjt:  MHYSNSFCFLLSNYVVVSAIGKKIYQNICIKAVHSFHQYKQKKPIKLFGRKLKKGVKVVEKEEIDPKLYTRDTVRNIYNILRNWSWSSAQEHLERLPIRW

Query:  DSYLINQVLKTHPPLEKAWLFFNWASRLHIFKHDQYTYTTMLDIFGEAGRISSMNYVFQQMKEKGMKIDAVTYTSLMHWRSNSGDVDGAIKVWKEMKSNG
        DSYLINQVLKTHPPLEKAWLFFNWASRL  FKHD YTYTTMLDIFGEAGRISSMNYVFQQMKEKG+KIDAVTYTSLMHWRSNSGDVDGAI+VW+EMK+NG
Subjt:  DSYLINQVLKTHPPLEKAWLFFNWASRLHIFKHDQYTYTTMLDIFGEAGRISSMNYVFQQMKEKGMKIDAVTYTSLMHWRSNSGDVDGAIKVWKEMKSNG

Query:  CYPTVVSYTAYIKILLDNDQVKEATDTYKEMLQSGLSPNCCTYTVLMEYLIGAGKCKEALDIFHKMQDAGVYPDKAACNILIQKCCKSGEMLVITQILEY
        CYPTVVSYTAYIKILLDN +V++ATDTYKEMLQSGLSPNCCTYTVLMEYLIG  K KEALDIFHKMQDAGVYPDKAACNILIQKCCKSGEMLV+TQILEY
Subjt:  CYPTVVSYTAYIKILLDNDQVKEATDTYKEMLQSGLSPNCCTYTVLMEYLIGAGKCKEALDIFHKMQDAGVYPDKAACNILIQKCCKSGEMLVITQILEY

Query:  MKEKRLVLRYPVFVEAHETLKICSVSDSLLRQVNPHIETESVCHDEVMHVNTSSNVIHSNVDHELVAILLKKENLIAIDYILTGLIDKNIQLDSAIISTI
        MKEKRLVLRYPVFVEAHE LK CSVS +LL QVNPHIE ESV   EV+ V+TS NVI  +VD+ELVA LLK+E LIA+D+IL G+ DKNIQLDS+II +I
Subjt:  MKEKRLVLRYPVFVEAHETLKICSVSDSLLRQVNPHIETESVCHDEVMHVNTSSNVIHSNVDHELVAILLKKENLIAIDYILTGLIDKNIQLDSAIISTI

Query:  IEVNCKHNRPSGALLAFDYCLKNGVNMGRNLYLGLIGILTRSSIYSKLLEIVPEMYRHGHCLGLYHATLILYRLGKAGKPQYAKKIFNLLPEELKCTAAY
        IEVNCK NRP+GALLAFDYCLKNGV + RNLYL LIG+L RSSIYS LLEIV +MY  GHCLGLYHATLILYRLGKAGKPQYA+K+FN+LPEELKCTA Y
Subjt:  IEVNCKHNRPSGALLAFDYCLKNGVNMGRNLYLGLIGILTRSSIYSKLLEIVPEMYRHGHCLGLYHATLILYRLGKAGKPQYAKKIFNLLPEELKCTAAY

Query:  TALVGAYFSAGSYGKGLKIYETMRKKGFTPCLGTYNVLLTGLAKSGRVDELEIYRREKKSFEISY--HHNTILEEETICDLLFGEMVS
        TALV AYFSAGS+GKGLKIYETMRKKGFTP LGTYNVLL+GL KS RV EL+IYRREKK FEIS+  HH TILEEE ICDLLFGE+VS
Subjt:  TALVGAYFSAGSYGKGLKIYETMRKKGFTPCLGTYNVLLTGLAKSGRVDELEIYRREKKSFEISY--HHNTILEEETICDLLFGEMVS

XP_023512200.1 pentatricopeptide repeat-containing protein At2g01390 [Cucurbita pepo subsp. pepo]3.8e-27581.29Show/hide
Query:  MHYSNSFCFLLSNYVVVSAIGKKIYQNICIKAVHSFHQYKQKKPIKLFGRKLKKGVKVVEKEEIDPKLYTRDTVRNIYNILRNWSWSSAQEHLERLPIRW
        M  SN F F +SNYVV SAI K++YQNI  K +HS HQYKQ+KP   F RKL+KG K V+KEE+DP  YTRDTVRNIYNILRN SW  AQ H+E LPIRW
Subjt:  MHYSNSFCFLLSNYVVVSAIGKKIYQNICIKAVHSFHQYKQKKPIKLFGRKLKKGVKVVEKEEIDPKLYTRDTVRNIYNILRNWSWSSAQEHLERLPIRW

Query:  DSYLINQVLKTHPPLEKAWLFFNWASRLHIFKHDQYTYTTMLDIFGEAGRISSMNYVFQQMKEKGMKIDAVTYTSLMHWRSNSGDVDGAIKVWKEMKSNG
        DSYLINQVLKTHPPLEKAWLFFNWASRL  F+HD YTYTTMLDIFGEAGRISSMNYVFQQMKEKG+KIDAVTYTSLMHWRSNSGDVDGAI+VW+EMK+NG
Subjt:  DSYLINQVLKTHPPLEKAWLFFNWASRLHIFKHDQYTYTTMLDIFGEAGRISSMNYVFQQMKEKGMKIDAVTYTSLMHWRSNSGDVDGAIKVWKEMKSNG

Query:  CYPTVVSYTAYIKILLDNDQVKEATDTYKEMLQSGLSPNCCTYTVLMEYLIGAGKCKEALDIFHKMQDAGVYPDKAACNILIQKCCKSGEMLVITQILEY
        CYPTVVSYTAYIKILLDN +V++ATD YKEMLQSGLSPNCCTYTVLMEYLIG  K KEALDIFHKMQDAG YPDKAACNILIQKCCKSGEMLV+TQILEY
Subjt:  CYPTVVSYTAYIKILLDNDQVKEATDTYKEMLQSGLSPNCCTYTVLMEYLIGAGKCKEALDIFHKMQDAGVYPDKAACNILIQKCCKSGEMLVITQILEY

Query:  MKEKRLVLRYPVFVEAHETLKICSVSDSLLRQVNPHIETESVCHDEVMHVNTSSNVIHSNVDHELVAILLKKENLIAIDYILTGLIDKNIQLDSAIISTI
        MKEKRLVLRYPVFVEAHE LK CSVS +LL QVNPHIE ESV   EV+ V+TS NVI  +VD+ELVA LLK+E LIA+D+IL G+ DKNIQLDS+II +I
Subjt:  MKEKRLVLRYPVFVEAHETLKICSVSDSLLRQVNPHIETESVCHDEVMHVNTSSNVIHSNVDHELVAILLKKENLIAIDYILTGLIDKNIQLDSAIISTI

Query:  IEVNCKHNRPSGALLAFDYCLKNGVNMGRNLYLGLIGILTRSSIYSKLLEIVPEMYRHGHCLGLYHATLILYRLGKAGKPQYAKKIFNLLPEELKCTAAY
        IEVNCK NRP+GALLAFDYCLKNGV + RNLYLGLIG+L RSSIYSKLLE+V EMY  GHCLGLYHATL LYRLGKAGKPQYA+K+FN+LPEELKCTA Y
Subjt:  IEVNCKHNRPSGALLAFDYCLKNGVNMGRNLYLGLIGILTRSSIYSKLLEIVPEMYRHGHCLGLYHATLILYRLGKAGKPQYAKKIFNLLPEELKCTAAY

Query:  TALVGAYFSAGSYGKGLKIYETMRKKGFTPCLGTYNVLLTGLAKSGRVDELEIYRREKKSFEISY--HHNTILEEETICDLLFGEMVS
        TALV AYFSAGS+GKGLKIYETMRKKGFTP LGTYNVLL+GL KS RV EL+IYRREKK FEIS+  HH TILEEE ICDLLFGE VS
Subjt:  TALVGAYFSAGSYGKGLKIYETMRKKGFTPCLGTYNVLLTGLAKSGRVDELEIYRREKKSFEISY--HHNTILEEETICDLLFGEMVS

XP_038901985.1 pentatricopeptide repeat-containing protein At2g01390 [Benincasa hispida]3.2e-28282.45Show/hide
Query:  MHYSNSFCFLLSNYVVVSAIGKKIYQNICIKAVHSFHQYKQKKPIKLFGRKLKKGVKVVEKEEIDPKLYTRDTVRNIYNILRNWSWSSAQEHLERLPIRW
        MH SNSF FLLSNYVV SAIGK+IYQNI  K +HSFHQYKQ+KPIK F RK +KG KVV+KEE+D + YTRDTVRNIYNILR  SW SAQEHLE LPIRW
Subjt:  MHYSNSFCFLLSNYVVVSAIGKKIYQNICIKAVHSFHQYKQKKPIKLFGRKLKKGVKVVEKEEIDPKLYTRDTVRNIYNILRNWSWSSAQEHLERLPIRW

Query:  DSYLINQVLKTHPPLEKAWLFFNWASRLHIFKHDQYTYTTMLDIFGEAGRISSMNYVFQQMKEKGMKIDAVTYTSLMHWRSNSGDVDGAIKVWKEMKSNG
        DSYLINQVLKTHPPLEK WLFFNWASRL +FKHDQYTYTTMLDIFGEAGRISSMNYVFQQMKEK +KIDAVTYTSLMHWRSNSGDV+GAIKVWKEMK+NG
Subjt:  DSYLINQVLKTHPPLEKAWLFFNWASRLHIFKHDQYTYTTMLDIFGEAGRISSMNYVFQQMKEKGMKIDAVTYTSLMHWRSNSGDVDGAIKVWKEMKSNG

Query:  CYPTVVSYTAYIKILLDNDQVKEATDTYKEMLQSGLSPNCCTYTVLMEYLIGAGKCKEALDIFHKMQDAGVYPDKAACNILIQKCCKSGEMLVITQILEY
        CYPTVVSYTAYIKILLD+DQ+KEATDTYKEMLQSGL PNCCTYT+LMEYLIG GKCKEALDIF KMQDAGVYPDKAACNILIQKCCKSGE LV+TQILEY
Subjt:  CYPTVVSYTAYIKILLDNDQVKEATDTYKEMLQSGLSPNCCTYTVLMEYLIGAGKCKEALDIFHKMQDAGVYPDKAACNILIQKCCKSGEMLVITQILEY

Query:  MKEKRLVLRYPVFVEAHETLKICSVSDSLLRQVNPHIETESVCHDEVMHVNTSSNVIHSNVDHELVAILLKKENLIAIDYILTGLIDKNIQLDSAIISTI
        MK+KRLVLRYPVFVEAHETLK CSVS +LLRQVNPHIE ESV   EV++V+T SN++  NVDHEL+AILLK+  L AIDY+LTG++D+NIQLDS+II +I
Subjt:  MKEKRLVLRYPVFVEAHETLKICSVSDSLLRQVNPHIETESVCHDEVMHVNTSSNVIHSNVDHELVAILLKKENLIAIDYILTGLIDKNIQLDSAIISTI

Query:  IEVNCKHNRPSGALLAFDYCLKNGVNMGRNLYLGLIGILTRSSIYSKLLEIVPEMYRHGHCLGLYHATLILYRLGKAGKPQYAKKIFNLLPEELKCTAAY
         EVNCK NRP+GALLAF+YCLK+GVN+ R LYL LIGIL RSSIY KLLEIV +MY  GHCLGLYHATLILYRLGKAGKPQYA+K+FN+LPEELKCTA Y
Subjt:  IEVNCKHNRPSGALLAFDYCLKNGVNMGRNLYLGLIGILTRSSIYSKLLEIVPEMYRHGHCLGLYHATLILYRLGKAGKPQYAKKIFNLLPEELKCTAAY

Query:  TALVGAYFSAGSYGKGLKIYETMRKKGFTPCLGTYNVLLTGLAKSGRVDELEIYRREKKSFEISYHHN--TILEEETICDLLFGEMV
        TALV AYFSAGS GKGLKIYETMRKKGF P LGTYNVLL GLAK GR+DEL IYR+E+KSFEIS+H +  TILEEE ICDLL+GE+V
Subjt:  TALVGAYFSAGSYGKGLKIYETMRKKGFTPCLGTYNVLLTGLAKSGRVDELEIYRREKKSFEISYHHN--TILEEETICDLLFGEMV

TrEMBL top hitse value%identityAlignment
A0A0A0LJM3 Uncharacterized protein1.5e-26978.91Show/hide
Query:  MHYSNSFCFLLSNYVVVSAIGKKIYQNICIKAVHSFHQYKQKKPIKLFGRKLKKGVKVVEKEEIDPKLYTRDTVRNIYNILRNWSWSSAQEHLERLPIRW
        MH+ N F  LLSNYVV SAI K+IYQNI  K +HS HQYK+ KPI  F R+ +KG KV +KEE+ P+LYTRDTVRNI NILRN SW+SAQ+HLE LPIRW
Subjt:  MHYSNSFCFLLSNYVVVSAIGKKIYQNICIKAVHSFHQYKQKKPIKLFGRKLKKGVKVVEKEEIDPKLYTRDTVRNIYNILRNWSWSSAQEHLERLPIRW

Query:  DSYLINQVLKTHPPLEKAWLFFNWASRLHIFKHDQYTYTTMLDIFGEAGRISSMNYVFQQMKEKGMKIDAVTYTSLMHWRSNSGDVDGAIKVWKEMKSNG
        DSYLINQVLKTHPPLEK WLFFNWAS L +FKHDQYTYTTMLDIFGEAGRISSMNYVFQQMKEKG+KIDAVTYTSLMHWRSNSGDVDGAIK+WKEMK+NG
Subjt:  DSYLINQVLKTHPPLEKAWLFFNWASRLHIFKHDQYTYTTMLDIFGEAGRISSMNYVFQQMKEKGMKIDAVTYTSLMHWRSNSGDVDGAIKVWKEMKSNG

Query:  CYPTVVSYTAYIKILLDNDQVKEATDTYKEMLQSGLSPNCCTYTVLMEYLIGAGKCKEALDIFHKMQDAGVYPDKAACNILIQKCCKSGEMLVITQILEY
        C+PTVVSYTAYIKILLDN Q+ EAT TYK+MLQSGLSPNCCTYT+LMEYLIG GKCKEALDIF KMQDAGVYPDKAACNILIQKCCKSGE LV+TQILE+
Subjt:  CYPTVVSYTAYIKILLDNDQVKEATDTYKEMLQSGLSPNCCTYTVLMEYLIGAGKCKEALDIFHKMQDAGVYPDKAACNILIQKCCKSGEMLVITQILEY

Query:  MKEKRLVLRYPVFVEAHETLKICSVSDSLLRQVNPHIETESVCHDEVMHVNTSSNVIHSNVDHELVAILLKKENLIAIDYILTGLIDKNIQLDSAIISTI
        MKE R VLRYPVFVEAHETLK CSVS +LL+QVNPH+E ES+   EV+ V+T SN +  NVD+EL+A+LLK   L A+D++L G++DKNIQLDS+II +I
Subjt:  MKEKRLVLRYPVFVEAHETLKICSVSDSLLRQVNPHIETESVCHDEVMHVNTSSNVIHSNVDHELVAILLKKENLIAIDYILTGLIDKNIQLDSAIISTI

Query:  IEVNCKHNRPSGALLAFDYCLKNGVNMGRNLYLGLIGILTRSSIYSKLLEIVPEMYRHGHCLGLYHATLILYRLGKAGKPQYAKKIFNLLPEELKCTAAY
        IEVNCK NRP+ ALLAFDYCLKN VN+ R LYL LIGIL RSSIY KLLEIV EMY  GHCLGLYHATLIL  LGKAGKPQYA+K+FN+LPEELKCTA Y
Subjt:  IEVNCKHNRPSGALLAFDYCLKNGVNMGRNLYLGLIGILTRSSIYSKLLEIVPEMYRHGHCLGLYHATLILYRLGKAGKPQYAKKIFNLLPEELKCTAAY

Query:  TALVGAYFSAGSYGKGLKIYETMRKKGFTPCLGTYNVLLTGLAKSGRVDELEIYRREKKSFEISYHH--NTILEEETICDLLFGEMVS
        TALV  YFSAGS GKGLKI+ETMRKKGFTP LGTYNVLL GLAK+GR  EL IYRREKKSFEIS+H   NTIL++E ICDLLFGE+VS
Subjt:  TALVGAYFSAGSYGKGLKIYETMRKKGFTPCLGTYNVLLTGLAKSGRVDELEIYRREKKSFEISYHH--NTILEEETICDLLFGEMVS

A0A1S4E1N2 pentatricopeptide repeat-containing protein At2g01390-like7.2e-27279.08Show/hide
Query:  MHYSNSFCFLLSNYVVVSAIGKKIYQNICIKAVHSFHQYKQKKPIKLFGRKLKKGVKVVEKEEIDPKLYTRDTVRNIYNILRNWSWSSAQEHLERLPIRW
        MH+ N F  LLSNYVV+SAI K+IYQNI  K +HS HQYK++KPI  F R  +KG KVV+KEE+ P++YTRDTV NI NILRN SW+SAQ+HLE LPIRW
Subjt:  MHYSNSFCFLLSNYVVVSAIGKKIYQNICIKAVHSFHQYKQKKPIKLFGRKLKKGVKVVEKEEIDPKLYTRDTVRNIYNILRNWSWSSAQEHLERLPIRW

Query:  DSYLINQVLKTHPPLEKAWLFFNWASRLHIFKHDQYTYTTMLDIFGEAGRISSMNYVFQQMKEKGMKIDAVTYTSLMHWRSNSGDVDGAIKVWKEMKSNG
        DSYLINQVLKTHPPLEK WLFFNWASRL +FKHDQYTYTTMLDIFGEAGRISSMNY+FQQMKEKG+KIDA TYTSLMHWRSNSGDVDGAIKVWKEMK+NG
Subjt:  DSYLINQVLKTHPPLEKAWLFFNWASRLHIFKHDQYTYTTMLDIFGEAGRISSMNYVFQQMKEKGMKIDAVTYTSLMHWRSNSGDVDGAIKVWKEMKSNG

Query:  CYPTVVSYTAYIKILLDNDQVKEATDTYKEMLQSGLSPNCCTYTVLMEYLIGAGKCKEALDIFHKMQDAGVYPDKAACNILIQKCCKSGEMLVITQILEY
        C+PTVVSYTAYIKILLDN Q KEAT TYKEML++GLSPNCCTYT+LMEYLIG GKCKEALDIF KMQDAGVYPDKAACNILIQKCCKSGE LV+TQILE+
Subjt:  CYPTVVSYTAYIKILLDNDQVKEATDTYKEMLQSGLSPNCCTYTVLMEYLIGAGKCKEALDIFHKMQDAGVYPDKAACNILIQKCCKSGEMLVITQILEY

Query:  MKEKRLVLRYPVFVEAHETLKICSVSDSLLRQVNPHIETESVCHDEVMHVNTSSNVIHSNVDHELVAILLKKENLIAIDYILTGLIDKNIQLDSAIISTI
        MKE R VLRYPVFVEAHE LK CSV  +LLRQVNPHIE ES+   EV+ V+T SN +  NVD+EL+A+LLK   L AID++L G++DKNIQLDS+II +I
Subjt:  MKEKRLVLRYPVFVEAHETLKICSVSDSLLRQVNPHIETESVCHDEVMHVNTSSNVIHSNVDHELVAILLKKENLIAIDYILTGLIDKNIQLDSAIISTI

Query:  IEVNCKHNRPSGALLAFDYCLKNGVNMGRNLYLGLIGILTRSSIYSKLLEIVPEMYRHGHCLGLYHATLILYRLGKAGKPQYAKKIFNLLPEELKCTAAY
        IEVNCK NRP+ A+LAFDYCLKNGVN+GR LYL LIGIL RSSIY KLLEIV EMY  GHC+GLYHATLILY LG+AGKPQYA+K+FN+LPEELKCTA Y
Subjt:  IEVNCKHNRPSGALLAFDYCLKNGVNMGRNLYLGLIGILTRSSIYSKLLEIVPEMYRHGHCLGLYHATLILYRLGKAGKPQYAKKIFNLLPEELKCTAAY

Query:  TALVGAYFSAGSYGKGLKIYETMRKKGFTPCLGTYNVLLTGLAKSGRVDELEIYRREKKSFEISYHH--NTILEEETICDLLFGEMVS
        T+LV AYFSAGS GKGLKI+ETMRKKGFTP LGTYNVLL GLAKSGR  EL IYRREKKSFEIS+H   NTIL++E ICDLLFGE+VS
Subjt:  TALVGAYFSAGSYGKGLKIYETMRKKGFTPCLGTYNVLLTGLAKSGRVDELEIYRREKKSFEISYHH--NTILEEETICDLLFGEMVS

A0A6J1DMJ2 pentatricopeptide repeat-containing protein At2g01390 isoform X14.3e-30187.41Show/hide
Query:  MHYSNSFCFLLSNYVVVSAIGKKIYQNICIKAVHSFHQYKQKKPIKLFGRKLKKGVKVVEKEEIDPKLYTRDTVRNIYNILRNWSWSSAQEHLERLPIRW
        MHYSNSF  LLSNYVV+SAI KKIY NI IKA+HS  QYKQ+KPIKLF RKL+KG KVVEKEE+DPKLYTRDTVRNIYNILRN+SWSSAQEHLERLP+RW
Subjt:  MHYSNSFCFLLSNYVVVSAIGKKIYQNICIKAVHSFHQYKQKKPIKLFGRKLKKGVKVVEKEEIDPKLYTRDTVRNIYNILRNWSWSSAQEHLERLPIRW

Query:  DSYLINQVLKTHPPLEKAWLFFNWASRLHIFKHDQYTYTTMLDIFGEAGRISSMNYVFQQMKEKGMKIDAVTYTSLMHWRSNSGDVDGAIKVWKEMKSNG
        DSYLINQV+KTHPPLEKAWLFFNWA RL  FKHDQYTYTTMLDIFGEAGRISSMNY+FQQMKEKG+KIDAVTYTSLMHWRS SGDVDGAIKVWKEMK+NG
Subjt:  DSYLINQVLKTHPPLEKAWLFFNWASRLHIFKHDQYTYTTMLDIFGEAGRISSMNYVFQQMKEKGMKIDAVTYTSLMHWRSNSGDVDGAIKVWKEMKSNG

Query:  CYPTVVSYTAYIKILLDNDQVKEATDTYKEMLQSGLSPNCCTYTVLMEYLIGAGKCKEALDIFHKMQDAGVYPDKAACNILIQKCCKSGEMLVITQILEY
        CYPTVVSYTAYIKILLDNDQVKEATDTYKEMLQSGLSPNCCTYTVLMEYLIGAGKCKEALDIFHKMQDAGVYPDKAACNILI KCC+SGEMLV+T ILEY
Subjt:  CYPTVVSYTAYIKILLDNDQVKEATDTYKEMLQSGLSPNCCTYTVLMEYLIGAGKCKEALDIFHKMQDAGVYPDKAACNILIQKCCKSGEMLVITQILEY

Query:  MKEKRLVLRYPVFVEAHETLKICSVSDSLLRQVNPHIETESVCHDEVMHVNTSSNVIHSNVDHELVAILLKKENLIAIDYILTGLIDKNIQLDSAIISTI
        MKE R VLRYPVFVEAH+TLK CSVS++LLRQVNPHIETESV  DEV+HV TSS +I SNVDHEL+ ILLKKE LIA+DY+LTG++DKNIQLDSAIISTI
Subjt:  MKEKRLVLRYPVFVEAHETLKICSVSDSLLRQVNPHIETESVCHDEVMHVNTSSNVIHSNVDHELVAILLKKENLIAIDYILTGLIDKNIQLDSAIISTI

Query:  IEVNCKHNRPSGALLAFDYCLKNGVNMGRNLYLGLIGILTRSSIYSKLLEIVPEMYRHGHCLGLYHATLILYRLGKAGKPQYAKKIFNLLPEELKCTAAY
        IEVNCKHNRP GALL FD+CLK+GVNM RNLYLGLIG+L RSSIYSKLLEIV EMYR GHCLGLYHATLILYRLGKAGKPQYA KIFN+LPEELKCTA Y
Subjt:  IEVNCKHNRPSGALLAFDYCLKNGVNMGRNLYLGLIGILTRSSIYSKLLEIVPEMYRHGHCLGLYHATLILYRLGKAGKPQYAKKIFNLLPEELKCTAAY

Query:  TALVGAYFSAGSYGKGLKIYETMRKKGFTPCLGTYNVLLTGLAKSGRVDELEIYRREKKSFEISY--HHNTILEEETICDLLFGEMVS
        TALVGAYFSAGS GKGLKIYETMRKKGF+P LGTYNVLLTGL KSGRV ELEIYRREKKSFEI Y  HH+ ILEE+ ICDLL+GEM+S
Subjt:  TALVGAYFSAGSYGKGLKIYETMRKKGFTPCLGTYNVLLTGLAKSGRVDELEIYRREKKSFEISY--HHNTILEEETICDLLFGEMVS

A0A6J1EIW0 pentatricopeptide repeat-containing protein At2g013902.0e-27481.46Show/hide
Query:  MHYSNSFCFLLSNYVVVSAIGKKIYQNICIKAVHSFHQYKQKKPIKLFGRKLKKGVKVVEKEEIDPKLYTRDTVRNIYNILRNWSWSSAQEHLERLPIRW
        M  SN F FL+SNYVV SAI K+IYQNI  K +HS HQYKQ+KP   F RKL+KG K V+KEE++   YTRDTVRNIYNILRN SW+SAQ H+E LPIRW
Subjt:  MHYSNSFCFLLSNYVVVSAIGKKIYQNICIKAVHSFHQYKQKKPIKLFGRKLKKGVKVVEKEEIDPKLYTRDTVRNIYNILRNWSWSSAQEHLERLPIRW

Query:  DSYLINQVLKTHPPLEKAWLFFNWASRLHIFKHDQYTYTTMLDIFGEAGRISSMNYVFQQMKEKGMKIDAVTYTSLMHWRSNSGDVDGAIKVWKEMKSNG
        DSYLINQVLKTHPPLEKAWLFFNWASRL  F+HD YTYTTMLDIFGEAGRISSMNYVFQQMKEKG+KIDAVTYTSLMHWRSNSGDVDGAI+VW+EMK+NG
Subjt:  DSYLINQVLKTHPPLEKAWLFFNWASRLHIFKHDQYTYTTMLDIFGEAGRISSMNYVFQQMKEKGMKIDAVTYTSLMHWRSNSGDVDGAIKVWKEMKSNG

Query:  CYPTVVSYTAYIKILLDNDQVKEATDTYKEMLQSGLSPNCCTYTVLMEYLIGAGKCKEALDIFHKMQDAGVYPDKAACNILIQKCCKSGEMLVITQILEY
        CYPTVVSYTAYIKILLDN +V++ATD YKEMLQSGLSPNCCTYTVLMEYLIG  K KEALDIFHKMQDAG YPDKAACNILIQKCCKSGEMLV+TQILEY
Subjt:  CYPTVVSYTAYIKILLDNDQVKEATDTYKEMLQSGLSPNCCTYTVLMEYLIGAGKCKEALDIFHKMQDAGVYPDKAACNILIQKCCKSGEMLVITQILEY

Query:  MKEKRLVLRYPVFVEAHETLKICSVSDSLLRQVNPHIETESVCHDEVMHVNTSSNVIHSNVDHELVAILLKKENLIAIDYILTGLIDKNIQLDSAIISTI
        MKEKRLVLRYPVFVEAHE LK CSVS +LL QVNPHIE ESV   EV+ V+TS NVI  +VD+ELVA LLK+E LIA+D+IL G+ DKNIQLDS+II +I
Subjt:  MKEKRLVLRYPVFVEAHETLKICSVSDSLLRQVNPHIETESVCHDEVMHVNTSSNVIHSNVDHELVAILLKKENLIAIDYILTGLIDKNIQLDSAIISTI

Query:  IEVNCKHNRPSGALLAFDYCLKNGVNMGRNLYLGLIGILTRSSIYSKLLEIVPEMYRHGHCLGLYHATLILYRLGKAGKPQYAKKIFNLLPEELKCTAAY
        IEVNCK NRP+GALLAFDYCLKNGV + RNLYL LIG+L RSSIYS LLEIV EMY  GHCLGLYHATLILYRLGKAGKPQYA+K+FN+LPEELKCTA Y
Subjt:  IEVNCKHNRPSGALLAFDYCLKNGVNMGRNLYLGLIGILTRSSIYSKLLEIVPEMYRHGHCLGLYHATLILYRLGKAGKPQYAKKIFNLLPEELKCTAAY

Query:  TALVGAYFSAGSYGKGLKIYETMRKKGFTPCLGTYNVLLTGLAKSGRVDELEIYRREKKSFEISY--HHNTILEEETICDLLFGEMVS
        TALV AYFSAGS+GKGLKIYETMRKKGFTP LGTYNVLL+GL KS RV EL+IYRREKK FEIS+  HH TILEEE ICDLLFGE+VS
Subjt:  TALVGAYFSAGSYGKGLKIYETMRKKGFTPCLGTYNVLLTGLAKSGRVDELEIYRREKKSFEISY--HHNTILEEETICDLLFGEMVS

A0A6J1I9C5 pentatricopeptide repeat-containing protein At2g013903.7e-27681.97Show/hide
Query:  MHYSNSFCFLLSNYVVVSAIGKKIYQNICIKAVHSFHQYKQKKPIKLFGRKLKKGVKVVEKEEIDPKLYTRDTVRNIYNILRNWSWSSAQEHLERLPIRW
        M  SNSF FL+SNYVV SAI K+IYQNI  K +HS HQYKQ+KP   F RKL+KG K V+KEE++   YTRDTVRNIYNILRN SW SAQ H+E LPIRW
Subjt:  MHYSNSFCFLLSNYVVVSAIGKKIYQNICIKAVHSFHQYKQKKPIKLFGRKLKKGVKVVEKEEIDPKLYTRDTVRNIYNILRNWSWSSAQEHLERLPIRW

Query:  DSYLINQVLKTHPPLEKAWLFFNWASRLHIFKHDQYTYTTMLDIFGEAGRISSMNYVFQQMKEKGMKIDAVTYTSLMHWRSNSGDVDGAIKVWKEMKSNG
        DSYLINQVLKTHPPLEKAWLFFNWASRL  FKHD YTYTTMLDIFGEAGRISSMNYVFQQMKEKG+KIDAVTYTSLMHWRSNSGDVDGAI+VW+EMK+NG
Subjt:  DSYLINQVLKTHPPLEKAWLFFNWASRLHIFKHDQYTYTTMLDIFGEAGRISSMNYVFQQMKEKGMKIDAVTYTSLMHWRSNSGDVDGAIKVWKEMKSNG

Query:  CYPTVVSYTAYIKILLDNDQVKEATDTYKEMLQSGLSPNCCTYTVLMEYLIGAGKCKEALDIFHKMQDAGVYPDKAACNILIQKCCKSGEMLVITQILEY
        CYPTVVSYTAYIKILLDN +V++ATDTYKEMLQSGLSPNCCTYTVLMEYLIG  K KEALDIFHKMQDAGVYPDKAACNILIQKCCKSGEMLV+TQILEY
Subjt:  CYPTVVSYTAYIKILLDNDQVKEATDTYKEMLQSGLSPNCCTYTVLMEYLIGAGKCKEALDIFHKMQDAGVYPDKAACNILIQKCCKSGEMLVITQILEY

Query:  MKEKRLVLRYPVFVEAHETLKICSVSDSLLRQVNPHIETESVCHDEVMHVNTSSNVIHSNVDHELVAILLKKENLIAIDYILTGLIDKNIQLDSAIISTI
        MKEKRLVLRYPVFVEAHE LK CSVS +LL QVNPHIE ESV   EV+ V+TS NVI  +VD+ELVA LLK+E LIA+D+IL G+ DKNIQLDS+II +I
Subjt:  MKEKRLVLRYPVFVEAHETLKICSVSDSLLRQVNPHIETESVCHDEVMHVNTSSNVIHSNVDHELVAILLKKENLIAIDYILTGLIDKNIQLDSAIISTI

Query:  IEVNCKHNRPSGALLAFDYCLKNGVNMGRNLYLGLIGILTRSSIYSKLLEIVPEMYRHGHCLGLYHATLILYRLGKAGKPQYAKKIFNLLPEELKCTAAY
        IEVNCK NRP+GALLAFDYCLKNGV + RNLYL LIG+L RSSIYS LLEIV +MY  GHCLGLYHATLILYRLGKAGKPQYA+K+FN+LPEELKCTA Y
Subjt:  IEVNCKHNRPSGALLAFDYCLKNGVNMGRNLYLGLIGILTRSSIYSKLLEIVPEMYRHGHCLGLYHATLILYRLGKAGKPQYAKKIFNLLPEELKCTAAY

Query:  TALVGAYFSAGSYGKGLKIYETMRKKGFTPCLGTYNVLLTGLAKSGRVDELEIYRREKKSFEISY--HHNTILEEETICDLLFGEMVS
        TALV AYFSAGS+GKGLKIYETMRKKGFTP LGTYNVLL+GL KS RV EL+IYRREKK FEIS+  HH TILEEE ICDLLFGE+VS
Subjt:  TALVGAYFSAGSYGKGLKIYETMRKKGFTPCLGTYNVLLTGLAKSGRVDELEIYRREKKSFEISY--HHNTILEEETICDLLFGEMVS

SwissProt top hitse value%identityAlignment
Q76C99 Protein Rf1, mitochondrial5.0e-2823.49Show/hide
Query:  GMKIDAVTYTSLMHWRSNSGDVDGAIKVWKEMKSNGCYPTVVSYTAYIKILLDNDQVKEATDTYKEMLQSGLSPNCCTYTVLMEYLIGAGKCKEALDIFH
        G   D V+YT++++     GD D A   + EM   G  P VV+Y + I  L     + +A +    M+++G+ P+C TY  ++     +G+ KEA+    
Subjt:  GMKIDAVTYTSLMHWRSNSGDVDGAIKVWKEMKSNGCYPTVVSYTAYIKILLDNDQVKEATDTYKEMLQSGLSPNCCTYTVLMEYLIGAGKCKEALDIFH

Query:  KMQDAGVYPDKAACNILIQKCCKSGEMLVITQILEYMKEKRL---VLRYPVFVEAHET----LKICSVSDSLLRQVNPHIETESVCHDEVMHVNTSSNVI
        KM+  GV PD    ++L+   CK+G  +   +I + M ++ L   +  Y   ++ + T    +++  + D ++R                       N I
Subjt:  KMQDAGVYPDKAACNILIQKCCKSGEMLVITQILEYMKEKRL---VLRYPVFVEAHET----LKICSVSDSLLRQVNPHIETESVCHDEVMHVNTSSNVI

Query:  HSNVDHELVAILL----KKENLIAIDYILTGLIDKNIQLDSAIISTIIEVNCKHNRPSGALLAFDYCLKNGVNMGRNLYLGLIGILTRSSIYSKLLEIVP
        H   DH + +IL+    K+  +     + + +  + +  ++     +I + CK  R   A+L F+  +  G++ G  +Y  LI  L   + + +  E++ 
Subjt:  HSNVDHELVAILL----KKENLIAIDYILTGLIDKNIQLDSAIISTIIEVNCKHNRPSGALLAFDYCLKNGVNMGRNLYLGLIGILTRSSIYSKLLEIVP

Query:  EMYRHGHCLGLYHATLILYRLGKAGKPQYAKKIFNLLPE--ELKCTAAYTALVGAYFSAGSYGKGLKIYETMRKKGFTPCLGTYNVLLTGLAKSGRVDEL
        EM   G CL       I+    K G+   ++K+F L+           Y  L+  Y  AG   + +K+   M   G  P   TY+ L+ G  K  R+++ 
Subjt:  EMYRHGHCLGLYHATLILYRLGKAGKPQYAKKIFNLLPE--ELKCTAAYTALVGAYFSAGSYGKGLKIYETMRKKGFTPCLGTYNVLLTGLAKSGRVDEL

Query:  EIYRREKKSFEIS
         +  +E +S  +S
Subjt:  EIYRREKKSFEIS

Q8GYP6 Pentatricopeptide repeat-containing protein At1g189008.5e-2834.26Show/hide
Query:  VRNIYNILRNWSWS-SAQEHLERLPIRWDSYLINQVLKTHPPLEKAWLFFNWASRLHIFKHDQYTYTTMLDIFGEAGRISSMNYVFQQMKEKGMKIDAVT
        V N+ ++LR + W  +A+E L+ L +R D+Y  NQVLK       A  FF W  R   FKHD +TYTTM+   G A +  ++N +  +M   G + + VT
Subjt:  VRNIYNILRNWSWS-SAQEHLERLPIRWDSYLINQVLKTHPPLEKAWLFFNWASRLHIFKHDQYTYTTMLDIFGEAGRISSMNYVFQQMKEKGMKIDAVT

Query:  YTSLMHWRSNSGDVDGAIKVWKEMKSNGCYPTVVSYTAYIKILLDNDQVKEATDTYKEMLQSGLSPNCCTYTVLMEYLIGAGKCKEALDIFHKMQDAGVY
        Y  L+H    +  ++ A+ V+ +M+  GC P  V+Y   I I      +  A D Y+ M   GLSP+  TY+V++  L  AG    A  +F +M D G  
Subjt:  YTSLMHWRSNSGDVDGAIKVWKEMKSNGCYPTVVSYTAYIKILLDNDQVKEATDTYKEMLQSGLSPNCCTYTVLMEYLIGAGKCKEALDIFHKMQDAGVY

Query:  PDKAACNILIQKCCKS
        P+    NI++    K+
Subjt:  PDKAACNILIQKCCKS

Q9LYZ9 Pentatricopeptide repeat-containing protein At5g028603.6e-2621.33Show/hide
Query:  KHDQYTYTTMLDIFGEAGRISSMNYVFQQMKEKGMKIDAVTYTSLMHWRSNSGDVDGAIKVWKEMKSNGCYPTVVSYTAYIKILLDNDQVKEATDTYKEM
        K D +TYTT+L  F  AG++ S   +F++M+  G K +  T+ + +    N G     +K++ E+   G  P +V++   + +   N    E +  +KEM
Subjt:  KHDQYTYTTMLDIFGEAGRISSMNYVFQQMKEKGMKIDAVTYTSLMHWRSNSGDVDGAIKVWKEMKSNGCYPTVVSYTAYIKILLDNDQVKEATDTYKEM

Query:  LQSGLSPNCCTYTVLMEYLIGAGKCKEALDIFHKMQDAGVYPDKAACNILIQKCCKSGEMLVITQILEYMKEKRL---VLRYPVFVEAHETLKICSVSDS
         ++G  P   T+  L+      G  ++A+ ++ +M DAGV PD +  N ++    + G      ++L  M++ R     L Y   + A+   K   +  S
Subjt:  LQSGLSPNCCTYTVLMEYLIGAGKCKEALDIFHKMQDAGVYPDKAACNILIQKCCKSGEMLVITQILEYMKEKRL---VLRYPVFVEAHETLKICSVSDS

Query:  LLRQVNPHIETESVCHDEVMHVNTSSNVIHSNVDHELVAILLKKENLIAIDYILTGLIDKNIQLDSAIISTIIEVNCKHNRPSGALLAFDYCLKNGVNMG
        L  +V        V     + + T            LV +  K + L   +   + L ++    D   +++++ +  +    + A    DY  + G    
Subjt:  LLRQVNPHIETESVCHDEVMHVNTSSNVIHSNVDHELVAILLKKENLIAIDYILTGLIDKNIQLDSAIISTIIEVNCKHNRPSGALLAFDYCLKNGVNMG

Query:  RNLYLGLIGILTRSSIYSKLLEIVPEMYRHGHCLGLYHATLILYRLGKAGKPQYAKKIFNLLPEE--LKCTAAYTALVGAYFSAGSYGKGLKIYETMRKK
           Y  L+ + +RS+ + K  EI+ E+   G    +     ++Y   +  + + A +IF+ +     +     Y   +G+Y +   + + + +   M K 
Subjt:  RNLYLGLIGILTRSSIYSKLLEIVPEMYRHGHCLGLYHATLILYRLGKAGKPQYAKKIFNLLPEE--LKCTAAYTALVGAYFSAGSYGKGLKIYETMRKK

Query:  GFTPCLGTYNVLLTGLAKSGRVDELEIYRREKKSFE
        G  P   TYN ++ G  K  R DE +++  + ++ +
Subjt:  GFTPCLGTYNVLLTGLAKSGRVDELEIYRREKKSFE

Q9SSF9 Pentatricopeptide repeat-containing protein At1g747507.7e-2932.3Show/hide
Query:  CIKAVHSFHQYKQKKPIKLFGRKLKKGVKVVEKEEIDPKLYTRD--TVRNIYNILRNWSWS-SAQEHLERLPIRWDSYLINQVLKTHPPLEKAWLFFNWA
        C  +VHS         ++ FG+  ++ +KV  +    P+ +      V N+ +ILR + W  +A+E L     R D+Y  NQVLK       A  FF W 
Subjt:  CIKAVHSFHQYKQKKPIKLFGRKLKKGVKVVEKEEIDPKLYTRD--TVRNIYNILRNWSWS-SAQEHLERLPIRWDSYLINQVLKTHPPLEKAWLFFNWA

Query:  SRLHIFKHDQYTYTTMLDIFGEAGRISSMNYVFQQMKEKGMKIDAVTYTSLMHWRSNSGDVDGAIKVWKEMKSNGCYPTVVSYTAYIKILLDNDQVKEAT
         R   FKHD +TYTTM+   G A +   +N +  +M   G K + VTY  L+H    +  +  A+ V+ +M+  GC P  V+Y   I I      +  A 
Subjt:  SRLHIFKHDQYTYTTMLDIFGEAGRISSMNYVFQQMKEKGMKIDAVTYTSLMHWRSNSGDVDGAIKVWKEMKSNGCYPTVVSYTAYIKILLDNDQVKEAT

Query:  DTYKEMLQSGLSPNCCTYTVLMEYLIGAGKCKEALDIFHKMQDAGVYPDKAACNILI
        D Y+ M ++GLSP+  TY+V++  L  AG    A  +F +M   G  P+    NI+I
Subjt:  DTYKEMLQSGLSPNCCTYTVLMEYLIGAGKCKEALDIFHKMQDAGVYPDKAACNILI

Q9ZU29 Pentatricopeptide repeat-containing protein At2g013901.1e-16052.43Show/hide
Query:  AVHSFHQYKQKKPIKLFGRKLKKGVKVVEKEEI-DPKLYTRDTVRNIYNILRNWSWSSAQEHLERLPIRWDSYLINQVLKTHPPLEKAWLFFNWASRLHI
        +V   H   + KP     ++  +  K+V+ + + DP +YTRD V NIYNIL+  +W SAQE L  L +RWDS++IN+VLK HPP++KAWLFFNWA+++  
Subjt:  AVHSFHQYKQKKPIKLFGRKLKKGVKVVEKEEI-DPKLYTRDTVRNIYNILRNWSWSSAQEHLERLPIRWDSYLINQVLKTHPPLEKAWLFFNWASRLHI

Query:  FKHDQYTYTTMLDIFGEAGRISSMNYVFQQMKEKGMKIDAVTYTSLMHWRSNSGDVDGAIKVWKEMKSNGCYPTVVSYTAYIKILLDNDQVKEATDTYKE
        FKHD +TYTTMLDIFGEAGRI SM  VF  MKEKG+ ID VTYTSL+HW S+SGDVDGA+++W+EM+ NGC PTVVSYTAY+K+L  + +V+EAT+ YKE
Subjt:  FKHDQYTYTTMLDIFGEAGRISSMNYVFQQMKEKGMKIDAVTYTSLMHWRSNSGDVDGAIKVWKEMKSNGCYPTVVSYTAYIKILLDNDQVKEATDTYKE

Query:  MLQSGLSPNCCTYTVLMEYLIGAGKCKEALDIFHKMQDAGVYPDKAACNILIQKCCKSGEMLVITQILEYMKEKRLVLRYPVFVEAHETLKICSVSDSLL
        ML+S +SPNC TYTVLMEYL+  GKC+EALDIF KMQ+ GV PDKAACNILI K  K GE   +T++L YMKE  +VLRYP+FVEA ETLK    SD LL
Subjt:  MLQSGLSPNCCTYTVLMEYLIGAGKCKEALDIFHKMQDAGVYPDKAACNILIQKCCKSGEMLVITQILEYMKEKRLVLRYPVFVEAHETLKICSVSDSLL

Query:  RQVNPHIETESVCHDEVMHVNTS--SNVIHSNVDHELVAILLKKENLIAIDYILTGLIDKNIQLDSAIISTIIEVNCKHNRPSGALLAFDYCLKNGVNMG
        R+VN HI  ES+C  ++    T+  ++  +S+    + ++LL K+NL+A+D +L  + D+NI+LDS ++S IIE NC   R  GA LAFDY L+ G+++ 
Subjt:  RQVNPHIETESVCHDEVMHVNTS--SNVIHSNVDHELVAILLKKENLIAIDYILTGLIDKNIQLDSAIISTIIEVNCKHNRPSGALLAFDYCLKNGVNMG

Query:  RNLYLGLIGILTRSSIYSKLLEIVPEMYRHGHCLGLYHATLILYRLGKAGKPQYAKKIFNLLPEELKCTAAYTALVGAYFSAGSYGKGLKIYETMRKKGF
        ++ YL LIG   RS+   K++E+V EM +  H LG Y   ++++RLG   +P+ A  +F+LLP++ K  AAYTAL+  Y SAGS  K +KI   MR++  
Subjt:  RNLYLGLIGILTRSSIYSKLLEIVPEMYRHGHCLGLYHATLILYRLGKAGKPQYAKKIFNLLPEELKCTAAYTALVGAYFSAGSYGKGLKIYETMRKKGF

Query:  TPCLGTYNVLLTGLAKSGRVD-ELEIYRREKKSFEISYH-HNTILEEETICDLLF
         P LGTY+VLL+GL K+     E+ + R+EKKS   S      +  E+ ICDLLF
Subjt:  TPCLGTYNVLLTGLAKSGRVD-ELEIYRREKKSFEISYH-HNTILEEETICDLLF

Arabidopsis top hitse value%identityAlignment
AT1G18900.1 Pentatricopeptide repeat (PPR) superfamily protein6.1e-2934.26Show/hide
Query:  VRNIYNILRNWSWS-SAQEHLERLPIRWDSYLINQVLKTHPPLEKAWLFFNWASRLHIFKHDQYTYTTMLDIFGEAGRISSMNYVFQQMKEKGMKIDAVT
        V N+ ++LR + W  +A+E L+ L +R D+Y  NQVLK       A  FF W  R   FKHD +TYTTM+   G A +  ++N +  +M   G + + VT
Subjt:  VRNIYNILRNWSWS-SAQEHLERLPIRWDSYLINQVLKTHPPLEKAWLFFNWASRLHIFKHDQYTYTTMLDIFGEAGRISSMNYVFQQMKEKGMKIDAVT

Query:  YTSLMHWRSNSGDVDGAIKVWKEMKSNGCYPTVVSYTAYIKILLDNDQVKEATDTYKEMLQSGLSPNCCTYTVLMEYLIGAGKCKEALDIFHKMQDAGVY
        Y  L+H    +  ++ A+ V+ +M+  GC P  V+Y   I I      +  A D Y+ M   GLSP+  TY+V++  L  AG    A  +F +M D G  
Subjt:  YTSLMHWRSNSGDVDGAIKVWKEMKSNGCYPTVVSYTAYIKILLDNDQVKEATDTYKEMLQSGLSPNCCTYTVLMEYLIGAGKCKEALDIFHKMQDAGVY

Query:  PDKAACNILIQKCCKS
        P+    NI++    K+
Subjt:  PDKAACNILIQKCCKS

AT1G18900.2 Pentatricopeptide repeat (PPR) superfamily protein6.1e-2934.26Show/hide
Query:  VRNIYNILRNWSWS-SAQEHLERLPIRWDSYLINQVLKTHPPLEKAWLFFNWASRLHIFKHDQYTYTTMLDIFGEAGRISSMNYVFQQMKEKGMKIDAVT
        V N+ ++LR + W  +A+E L+ L +R D+Y  NQVLK       A  FF W  R   FKHD +TYTTM+   G A +  ++N +  +M   G + + VT
Subjt:  VRNIYNILRNWSWS-SAQEHLERLPIRWDSYLINQVLKTHPPLEKAWLFFNWASRLHIFKHDQYTYTTMLDIFGEAGRISSMNYVFQQMKEKGMKIDAVT

Query:  YTSLMHWRSNSGDVDGAIKVWKEMKSNGCYPTVVSYTAYIKILLDNDQVKEATDTYKEMLQSGLSPNCCTYTVLMEYLIGAGKCKEALDIFHKMQDAGVY
        Y  L+H    +  ++ A+ V+ +M+  GC P  V+Y   I I      +  A D Y+ M   GLSP+  TY+V++  L  AG    A  +F +M D G  
Subjt:  YTSLMHWRSNSGDVDGAIKVWKEMKSNGCYPTVVSYTAYIKILLDNDQVKEATDTYKEMLQSGLSPNCCTYTVLMEYLIGAGKCKEALDIFHKMQDAGVY

Query:  PDKAACNILIQKCCKS
        P+    NI++    K+
Subjt:  PDKAACNILIQKCCKS

AT1G18900.3 Pentatricopeptide repeat (PPR) superfamily protein6.1e-2934.26Show/hide
Query:  VRNIYNILRNWSWS-SAQEHLERLPIRWDSYLINQVLKTHPPLEKAWLFFNWASRLHIFKHDQYTYTTMLDIFGEAGRISSMNYVFQQMKEKGMKIDAVT
        V N+ ++LR + W  +A+E L+ L +R D+Y  NQVLK       A  FF W  R   FKHD +TYTTM+   G A +  ++N +  +M   G + + VT
Subjt:  VRNIYNILRNWSWS-SAQEHLERLPIRWDSYLINQVLKTHPPLEKAWLFFNWASRLHIFKHDQYTYTTMLDIFGEAGRISSMNYVFQQMKEKGMKIDAVT

Query:  YTSLMHWRSNSGDVDGAIKVWKEMKSNGCYPTVVSYTAYIKILLDNDQVKEATDTYKEMLQSGLSPNCCTYTVLMEYLIGAGKCKEALDIFHKMQDAGVY
        Y  L+H    +  ++ A+ V+ +M+  GC P  V+Y   I I      +  A D Y+ M   GLSP+  TY+V++  L  AG    A  +F +M D G  
Subjt:  YTSLMHWRSNSGDVDGAIKVWKEMKSNGCYPTVVSYTAYIKILLDNDQVKEATDTYKEMLQSGLSPNCCTYTVLMEYLIGAGKCKEALDIFHKMQDAGVY

Query:  PDKAACNILIQKCCKS
        P+    NI++    K+
Subjt:  PDKAACNILIQKCCKS

AT1G74750.1 Pentatricopeptide repeat (PPR) superfamily protein5.5e-3032.3Show/hide
Query:  CIKAVHSFHQYKQKKPIKLFGRKLKKGVKVVEKEEIDPKLYTRD--TVRNIYNILRNWSWS-SAQEHLERLPIRWDSYLINQVLKTHPPLEKAWLFFNWA
        C  +VHS         ++ FG+  ++ +KV  +    P+ +      V N+ +ILR + W  +A+E L     R D+Y  NQVLK       A  FF W 
Subjt:  CIKAVHSFHQYKQKKPIKLFGRKLKKGVKVVEKEEIDPKLYTRD--TVRNIYNILRNWSWS-SAQEHLERLPIRWDSYLINQVLKTHPPLEKAWLFFNWA

Query:  SRLHIFKHDQYTYTTMLDIFGEAGRISSMNYVFQQMKEKGMKIDAVTYTSLMHWRSNSGDVDGAIKVWKEMKSNGCYPTVVSYTAYIKILLDNDQVKEAT
         R   FKHD +TYTTM+   G A +   +N +  +M   G K + VTY  L+H    +  +  A+ V+ +M+  GC P  V+Y   I I      +  A 
Subjt:  SRLHIFKHDQYTYTTMLDIFGEAGRISSMNYVFQQMKEKGMKIDAVTYTSLMHWRSNSGDVDGAIKVWKEMKSNGCYPTVVSYTAYIKILLDNDQVKEAT

Query:  DTYKEMLQSGLSPNCCTYTVLMEYLIGAGKCKEALDIFHKMQDAGVYPDKAACNILI
        D Y+ M ++GLSP+  TY+V++  L  AG    A  +F +M   G  P+    NI+I
Subjt:  DTYKEMLQSGLSPNCCTYTVLMEYLIGAGKCKEALDIFHKMQDAGVYPDKAACNILI

AT2G01390.1 Tetratricopeptide repeat (TPR)-like superfamily protein7.9e-16252.43Show/hide
Query:  AVHSFHQYKQKKPIKLFGRKLKKGVKVVEKEEI-DPKLYTRDTVRNIYNILRNWSWSSAQEHLERLPIRWDSYLINQVLKTHPPLEKAWLFFNWASRLHI
        +V   H   + KP     ++  +  K+V+ + + DP +YTRD V NIYNIL+  +W SAQE L  L +RWDS++IN+VLK HPP++KAWLFFNWA+++  
Subjt:  AVHSFHQYKQKKPIKLFGRKLKKGVKVVEKEEI-DPKLYTRDTVRNIYNILRNWSWSSAQEHLERLPIRWDSYLINQVLKTHPPLEKAWLFFNWASRLHI

Query:  FKHDQYTYTTMLDIFGEAGRISSMNYVFQQMKEKGMKIDAVTYTSLMHWRSNSGDVDGAIKVWKEMKSNGCYPTVVSYTAYIKILLDNDQVKEATDTYKE
        FKHD +TYTTMLDIFGEAGRI SM  VF  MKEKG+ ID VTYTSL+HW S+SGDVDGA+++W+EM+ NGC PTVVSYTAY+K+L  + +V+EAT+ YKE
Subjt:  FKHDQYTYTTMLDIFGEAGRISSMNYVFQQMKEKGMKIDAVTYTSLMHWRSNSGDVDGAIKVWKEMKSNGCYPTVVSYTAYIKILLDNDQVKEATDTYKE

Query:  MLQSGLSPNCCTYTVLMEYLIGAGKCKEALDIFHKMQDAGVYPDKAACNILIQKCCKSGEMLVITQILEYMKEKRLVLRYPVFVEAHETLKICSVSDSLL
        ML+S +SPNC TYTVLMEYL+  GKC+EALDIF KMQ+ GV PDKAACNILI K  K GE   +T++L YMKE  +VLRYP+FVEA ETLK    SD LL
Subjt:  MLQSGLSPNCCTYTVLMEYLIGAGKCKEALDIFHKMQDAGVYPDKAACNILIQKCCKSGEMLVITQILEYMKEKRLVLRYPVFVEAHETLKICSVSDSLL

Query:  RQVNPHIETESVCHDEVMHVNTS--SNVIHSNVDHELVAILLKKENLIAIDYILTGLIDKNIQLDSAIISTIIEVNCKHNRPSGALLAFDYCLKNGVNMG
        R+VN HI  ES+C  ++    T+  ++  +S+    + ++LL K+NL+A+D +L  + D+NI+LDS ++S IIE NC   R  GA LAFDY L+ G+++ 
Subjt:  RQVNPHIETESVCHDEVMHVNTS--SNVIHSNVDHELVAILLKKENLIAIDYILTGLIDKNIQLDSAIISTIIEVNCKHNRPSGALLAFDYCLKNGVNMG

Query:  RNLYLGLIGILTRSSIYSKLLEIVPEMYRHGHCLGLYHATLILYRLGKAGKPQYAKKIFNLLPEELKCTAAYTALVGAYFSAGSYGKGLKIYETMRKKGF
        ++ YL LIG   RS+   K++E+V EM +  H LG Y   ++++RLG   +P+ A  +F+LLP++ K  AAYTAL+  Y SAGS  K +KI   MR++  
Subjt:  RNLYLGLIGILTRSSIYSKLLEIVPEMYRHGHCLGLYHATLILYRLGKAGKPQYAKKIFNLLPEELKCTAAYTALVGAYFSAGSYGKGLKIYETMRKKGF

Query:  TPCLGTYNVLLTGLAKSGRVD-ELEIYRREKKSFEISYH-HNTILEEETICDLLF
         P LGTY+VLL+GL K+     E+ + R+EKKS   S      +  E+ ICDLLF
Subjt:  TPCLGTYNVLLTGLAKSGRVD-ELEIYRREKKSFEISYH-HNTILEEETICDLLF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCATTATTCTAACAGTTTCTGTTTCCTTCTGAGTAATTATGTGGTTGTCTCTGCCATCGGTAAGAAAATTTATCAGAATATTTGCATTAAAGCTGTGCATTCCTTTCA
CCAATATAAACAAAAGAAACCCATCAAATTATTCGGTAGAAAGTTGAAGAAGGGAGTTAAGGTAGTTGAGAAGGAAGAAATAGATCCAAAGCTTTACACGAGAGATACAG
TTAGGAATATCTACAATATTCTGAGAAATTGGTCCTGGAGTTCTGCTCAAGAACACCTAGAGAGGCTCCCTATAAGATGGGATTCTTATCTCATCAACCAGGTTCTGAAA
ACTCATCCACCATTGGAGAAGGCATGGCTGTTCTTCAATTGGGCATCAAGGCTGCACATCTTCAAGCATGACCAGTATACTTACACGACGATGTTGGATATTTTTGGAGA
AGCTGGGAGGATTTCATCCATGAATTATGTATTTCAACAAATGAAGGAGAAGGGGATGAAGATAGATGCAGTTACTTATACTTCATTAATGCACTGGCGTTCAAACTCAG
GGGATGTTGATGGGGCTATAAAGGTTTGGAAGGAAATGAAATCCAATGGCTGCTATCCGACAGTAGTTTCATATACTGCTTATATAAAGATTTTGTTGGACAATGACCAA
GTTAAGGAGGCCACTGATACATACAAGGAGATGCTTCAATCTGGGCTTTCTCCAAATTGCTGTACTTACACCGTCTTAATGGAATACCTTATTGGGGCAGGTAAATGCAA
AGAAGCCCTTGATATTTTTCATAAAATGCAAGATGCTGGAGTATATCCTGATAAAGCTGCTTGCAATATTTTGATTCAGAAATGCTGTAAATCTGGGGAGATGCTGGTAA
TTACACAAATCCTTGAGTACATGAAAGAAAAACGCCTTGTCCTTCGCTACCCTGTGTTTGTTGAAGCCCATGAAACTTTAAAAATTTGCTCCGTAAGTGATAGCCTACTC
AGGCAAGTTAATCCTCATATAGAAACTGAATCAGTCTGTCACGATGAGGTTATGCATGTTAATACAAGTTCTAATGTTATTCACTCAAATGTAGATCATGAGCTTGTGGC
AATTCTGTTGAAGAAGGAAAATCTTATTGCTATCGACTACATACTCACTGGGCTGATAGATAAGAACATACAATTGGATTCTGCAATTATTTCAACCATCATTGAGGTAA
ATTGCAAACATAATAGACCTAGCGGAGCTCTGTTGGCTTTTGACTACTGTTTGAAAAATGGAGTTAACATGGGGAGAAATCTGTATCTTGGCTTGATAGGGATTCTGACC
CGATCAAGTATATATTCGAAGTTGCTGGAAATTGTACCAGAAATGTATAGGCATGGGCATTGTCTTGGACTCTATCATGCCACACTTATACTTTATAGGCTTGGCAAAGC
TGGAAAACCCCAATATGCGAAGAAAATTTTTAACCTGTTGCCTGAAGAATTGAAGTGCACTGCAGCTTACACTGCTCTGGTTGGTGCTTATTTCTCTGCTGGAAGTTATG
GTAAAGGGCTTAAAATTTACGAAACCATGCGAAAGAAAGGATTTACACCGTGTTTAGGCACGTATAATGTTCTGTTAACTGGTCTTGCGAAGAGTGGAAGAGTTGACGAA
TTAGAAATTTATAGAAGGGAGAAGAAGAGTTTTGAGATCAGTTATCATCATAATACAATTCTGGAGGAAGAAACGATTTGTGATCTTCTTTTTGGAGAAATGGTATCTTG
A
mRNA sequenceShow/hide mRNA sequence
ATGCATTATTCTAACAGTTTCTGTTTCCTTCTGAGTAATTATGTGGTTGTCTCTGCCATCGGTAAGAAAATTTATCAGAATATTTGCATTAAAGCTGTGCATTCCTTTCA
CCAATATAAACAAAAGAAACCCATCAAATTATTCGGTAGAAAGTTGAAGAAGGGAGTTAAGGTAGTTGAGAAGGAAGAAATAGATCCAAAGCTTTACACGAGAGATACAG
TTAGGAATATCTACAATATTCTGAGAAATTGGTCCTGGAGTTCTGCTCAAGAACACCTAGAGAGGCTCCCTATAAGATGGGATTCTTATCTCATCAACCAGGTTCTGAAA
ACTCATCCACCATTGGAGAAGGCATGGCTGTTCTTCAATTGGGCATCAAGGCTGCACATCTTCAAGCATGACCAGTATACTTACACGACGATGTTGGATATTTTTGGAGA
AGCTGGGAGGATTTCATCCATGAATTATGTATTTCAACAAATGAAGGAGAAGGGGATGAAGATAGATGCAGTTACTTATACTTCATTAATGCACTGGCGTTCAAACTCAG
GGGATGTTGATGGGGCTATAAAGGTTTGGAAGGAAATGAAATCCAATGGCTGCTATCCGACAGTAGTTTCATATACTGCTTATATAAAGATTTTGTTGGACAATGACCAA
GTTAAGGAGGCCACTGATACATACAAGGAGATGCTTCAATCTGGGCTTTCTCCAAATTGCTGTACTTACACCGTCTTAATGGAATACCTTATTGGGGCAGGTAAATGCAA
AGAAGCCCTTGATATTTTTCATAAAATGCAAGATGCTGGAGTATATCCTGATAAAGCTGCTTGCAATATTTTGATTCAGAAATGCTGTAAATCTGGGGAGATGCTGGTAA
TTACACAAATCCTTGAGTACATGAAAGAAAAACGCCTTGTCCTTCGCTACCCTGTGTTTGTTGAAGCCCATGAAACTTTAAAAATTTGCTCCGTAAGTGATAGCCTACTC
AGGCAAGTTAATCCTCATATAGAAACTGAATCAGTCTGTCACGATGAGGTTATGCATGTTAATACAAGTTCTAATGTTATTCACTCAAATGTAGATCATGAGCTTGTGGC
AATTCTGTTGAAGAAGGAAAATCTTATTGCTATCGACTACATACTCACTGGGCTGATAGATAAGAACATACAATTGGATTCTGCAATTATTTCAACCATCATTGAGGTAA
ATTGCAAACATAATAGACCTAGCGGAGCTCTGTTGGCTTTTGACTACTGTTTGAAAAATGGAGTTAACATGGGGAGAAATCTGTATCTTGGCTTGATAGGGATTCTGACC
CGATCAAGTATATATTCGAAGTTGCTGGAAATTGTACCAGAAATGTATAGGCATGGGCATTGTCTTGGACTCTATCATGCCACACTTATACTTTATAGGCTTGGCAAAGC
TGGAAAACCCCAATATGCGAAGAAAATTTTTAACCTGTTGCCTGAAGAATTGAAGTGCACTGCAGCTTACACTGCTCTGGTTGGTGCTTATTTCTCTGCTGGAAGTTATG
GTAAAGGGCTTAAAATTTACGAAACCATGCGAAAGAAAGGATTTACACCGTGTTTAGGCACGTATAATGTTCTGTTAACTGGTCTTGCGAAGAGTGGAAGAGTTGACGAA
TTAGAAATTTATAGAAGGGAGAAGAAGAGTTTTGAGATCAGTTATCATCATAATACAATTCTGGAGGAAGAAACGATTTGTGATCTTCTTTTTGGAGAAATGGTATCTTG
A
Protein sequenceShow/hide protein sequence
MHYSNSFCFLLSNYVVVSAIGKKIYQNICIKAVHSFHQYKQKKPIKLFGRKLKKGVKVVEKEEIDPKLYTRDTVRNIYNILRNWSWSSAQEHLERLPIRWDSYLINQVLK
THPPLEKAWLFFNWASRLHIFKHDQYTYTTMLDIFGEAGRISSMNYVFQQMKEKGMKIDAVTYTSLMHWRSNSGDVDGAIKVWKEMKSNGCYPTVVSYTAYIKILLDNDQ
VKEATDTYKEMLQSGLSPNCCTYTVLMEYLIGAGKCKEALDIFHKMQDAGVYPDKAACNILIQKCCKSGEMLVITQILEYMKEKRLVLRYPVFVEAHETLKICSVSDSLL
RQVNPHIETESVCHDEVMHVNTSSNVIHSNVDHELVAILLKKENLIAIDYILTGLIDKNIQLDSAIISTIIEVNCKHNRPSGALLAFDYCLKNGVNMGRNLYLGLIGILT
RSSIYSKLLEIVPEMYRHGHCLGLYHATLILYRLGKAGKPQYAKKIFNLLPEELKCTAAYTALVGAYFSAGSYGKGLKIYETMRKKGFTPCLGTYNVLLTGLAKSGRVDE
LEIYRREKKSFEISYHHNTILEEETICDLLFGEMVS