; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC04g0406 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC04g0406
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationMC04:3236733..3238340
RNA-Seq ExpressionMC04g0406
SyntenyMC04g0406
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7011402.1 Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. argyrosperma]5.94e-30678.36Show/hide
Query:  LLRLCAKVKSFKGGKWIHFYLKHTGLKRPTTIVANHLIGMYFECGNDIEARKVFDKMSARNLYSWNHMLAGYAKLGNINQAKKLFDSMTEKDVVSWNTVV
        LLRLCAK KS KGGK +H +LK TG KRPTTIVANHLIGMYF+CG+D EARKVFDKMS RNLYSWNHMLAGYAKLGN+ QA+KLFD+M EKDV+SWNT+V
Subjt:  LLRLCAKVKSFKGGKWIHFYLKHTGLKRPTTIVANHLIGMYFECGNDIEARKVFDKMSARNLYSWNHMLAGYAKLGNINQAKKLFDSMTEKDVVSWNTVV

Query:  LAYAKKGCLNEAIG---------------------------------------------------------------------RLFDEMPVKDILSWTTM
        LAYAKKGC NEAIG                                                                     RLFDEMPVKDIL+WTTM
Subjt:  LAYAKKGCLNEAIG---------------------------------------------------------------------RLFDEMPVKDILSWTTM

Query:  VSGYAKWGDMNLASELFHQMPEKNPVSWTALISGYARNSLGHEALDYFTKMMTFRINPDQYTFSSCLCACASIAALKHGKQVHACLIRTNFRCNTIVVSS
        VSGYAKWGDMNLAS LFHQMPEKNPVSWTALISGYARNSLGHEALDYFT+MM FR+NPDQ+TFSSCLCACASIAALKHGKQVHA LIRTNFRCNTIVVSS
Subjt:  VSGYAKWGDMNLASELFHQMPEKNPVSWTALISGYARNSLGHEALDYFTKMMTFRINPDQYTFSSCLCACASIAALKHGKQVHACLIRTNFRCNTIVVSS

Query:  LIDMYSKCGMLEAGCHVFYLMGNKQDVVLWNTMISALAQHGHGEQAMQMFNDMVKSGLKPDRITFIVILSACSHSGLVQEGLRFFKAMMYDHGVLPDQEH
        LIDMYSKCGMLEA C VFYL+GNKQDVVLWNTMISALAQHGHGE+AMQMFNDMV+SGL PDRITFIVILSACSHSGLVQEGL+FFKAM YDHG+LPDQEH
Subjt:  LIDMYSKCGMLEAGCHVFYLMGNKQDVVLWNTMISALAQHGHGEQAMQMFNDMVKSGLKPDRITFIVILSACSHSGLVQEGLRFFKAMMYDHGVLPDQEH

Query:  YACLIDLLGRAGWFIELVNELEKMCCKPDDRVWNALLGVCRIHGNIELGRKVAEHVIELEPQSSAAYVSLASLYAFLGKWESVEKVRELMEERFVRKERA
        YACLIDLLGRAG F ELVNELEKM CKPDDR+WNALLGVCRIHGNIELGRKVAEHVIELEP+SSAAYVSLASLYAFLGKWESVE+VRE+MEER VRKERA
Subjt:  YACLIDLLGRAGWFIELVNELEKMCCKPDDRVWNALLGVCRIHGNIELGRKVAEHVIELEPQSSAAYVSLASLYAFLGKWESVEKVRELMEERFVRKERA

Query:  ISWIDTGNKVHSFIASDRSHPLKEEIYSLLEQLASH
        ISWID  NKVHSFIASDR HPLKEEIYSLLEQLASH
Subjt:  ISWIDTGNKVHSFIASDRSHPLKEEIYSLLEQLASH

XP_022136244.1 pentatricopeptide repeat-containing protein At2g21090 [Momordica charantia]0.087.13Show/hide
Query:  LLRLCAKVKSFKGGKWIHFYLKHTGLKRPTTIVANHLIGMYFECGNDIEARKVFDKMSARNLYSWNHMLAGYAKLGNINQAKKLFDSMTEKDVVSWNTVV
        LLRLCAKVKSFKGGKWIHFYLKHTGLKRPTTIVANHLIGMYFECGNDIEARKVFDKMSARNLYSWNHMLAGYAKLGNINQAKKLFDSMTEKDVVSWNTVV
Subjt:  LLRLCAKVKSFKGGKWIHFYLKHTGLKRPTTIVANHLIGMYFECGNDIEARKVFDKMSARNLYSWNHMLAGYAKLGNINQAKKLFDSMTEKDVVSWNTVV

Query:  LAYAKKGCLNEAIG---------------------------------------------------------------------RLFDEMPVKDILSWTTM
        LAYAKKGCLNEAIG                                                                     RLFDEMPVKDILSWTTM
Subjt:  LAYAKKGCLNEAIG---------------------------------------------------------------------RLFDEMPVKDILSWTTM

Query:  VSGYAKWGDMNLASELFHQMPEKNPVSWTALISGYARNSLGHEALDYFTKMMTFRINPDQYTFSSCLCACASIAALKHGKQVHACLIRTNFRCNTIVVSS
        VSGYAKWGDMNLASELFHQMPEKNPVSWTALISGYARNSLGHEALDYFTKMMTFRINPDQYTFSSCLCACASIAALKHGKQVHACLIRTNFRCNTIVVSS
Subjt:  VSGYAKWGDMNLASELFHQMPEKNPVSWTALISGYARNSLGHEALDYFTKMMTFRINPDQYTFSSCLCACASIAALKHGKQVHACLIRTNFRCNTIVVSS

Query:  LIDMYSKCGMLEAGCHVFYLMGNKQDVVLWNTMISALAQHGHGEQAMQMFNDMVKSGLKPDRITFIVILSACSHSGLVQEGLRFFKAMMYDHGVLPDQEH
        LIDMYSKCGMLEAGCHVFYLMGNKQDVVLWNTMISALAQHGHGEQAMQMFNDMVKSGLKPDRITFIVILSACSHSGLVQEGLRFFKAMMYDHGVLPDQEH
Subjt:  LIDMYSKCGMLEAGCHVFYLMGNKQDVVLWNTMISALAQHGHGEQAMQMFNDMVKSGLKPDRITFIVILSACSHSGLVQEGLRFFKAMMYDHGVLPDQEH

Query:  YACLIDLLGRAGWFIELVNELEKMCCKPDDRVWNALLGVCRIHGNIELGRKVAEHVIELEPQSSAAYVSLASLYAFLGKWESVEKVRELMEERFVRKERA
        YACLIDLLGRAGWFIELVNELEKMCCKPDDRVWNALLGVCRIHGNIELGRKVAEHVIELEPQSSAAYVSLASLYAFLGKWESVEKVRELMEERFVRKERA
Subjt:  YACLIDLLGRAGWFIELVNELEKMCCKPDDRVWNALLGVCRIHGNIELGRKVAEHVIELEPQSSAAYVSLASLYAFLGKWESVEKVRELMEERFVRKERA

Query:  ISWIDTGNKVHSFIASDRSHPLKEEIYSLLEQLASH
        ISWIDTGNKVHSFIASDRSHPLKEEIYSLLEQLASH
Subjt:  ISWIDTGNKVHSFIASDRSHPLKEEIYSLLEQLASH

XP_022967585.1 pentatricopeptide repeat-containing protein At2g21090-like [Cucurbita maxima]1.70e-30578.17Show/hide
Query:  LLRLCAKVKSFKGGKWIHFYLKHTGLKRPTTIVANHLIGMYFECGNDIEARKVFDKMSARNLYSWNHMLAGYAKLGNINQAKKLFDSMTEKDVVSWNTVV
        LLRLCAK KS KGGK +H +LK TG KRPTTI+ANHLIGMYF+CG+DIEARKVFDKMS RNLYSWNHMLAGYAKLGN+ QA+K+FD+M EKDV+SWNT+V
Subjt:  LLRLCAKVKSFKGGKWIHFYLKHTGLKRPTTIVANHLIGMYFECGNDIEARKVFDKMSARNLYSWNHMLAGYAKLGNINQAKKLFDSMTEKDVVSWNTVV

Query:  LAYAKKGCLNEAIG---------------------------------------------------------------------RLFDEMPVKDILSWTTM
        LAYAKKGC NEAIG                                                                     RLFDEMPVKDIL+WTTM
Subjt:  LAYAKKGCLNEAIG---------------------------------------------------------------------RLFDEMPVKDILSWTTM

Query:  VSGYAKWGDMNLASELFHQMPEKNPVSWTALISGYARNSLGHEALDYFTKMMTFRINPDQYTFSSCLCACASIAALKHGKQVHACLIRTNFRCNTIVVSS
        VSGYAKWGDMNLAS LFHQMPEKNPVSWTALISGYARNSLGHEALDYFT+MM FR+NPDQ+TFSSCLCACASIAALKHGKQVHA LIRTNFRCNTIVVSS
Subjt:  VSGYAKWGDMNLASELFHQMPEKNPVSWTALISGYARNSLGHEALDYFTKMMTFRINPDQYTFSSCLCACASIAALKHGKQVHACLIRTNFRCNTIVVSS

Query:  LIDMYSKCGMLEAGCHVFYLMGNKQDVVLWNTMISALAQHGHGEQAMQMFNDMVKSGLKPDRITFIVILSACSHSGLVQEGLRFFKAMMYDHGVLPDQEH
        LIDMYSKCGMLEA C VFYL+GNKQDVVLWNTMISALAQHGHGE+AMQMFNDMV+SGLKPDRITFIVILSACSHSGLV EGL+FFKAM YDH VLPDQEH
Subjt:  LIDMYSKCGMLEAGCHVFYLMGNKQDVVLWNTMISALAQHGHGEQAMQMFNDMVKSGLKPDRITFIVILSACSHSGLVQEGLRFFKAMMYDHGVLPDQEH

Query:  YACLIDLLGRAGWFIELVNELEKMCCKPDDRVWNALLGVCRIHGNIELGRKVAEHVIELEPQSSAAYVSLASLYAFLGKWESVEKVRELMEERFVRKERA
        YACLIDLLGRAG F ELVNELEKM CKPDDR+WNALLGVCRIHGNIELGRKVAEHVIELEP+SSAAYVSLASLYAFLGKWESVE+VRE+MEER VRKERA
Subjt:  YACLIDLLGRAGWFIELVNELEKMCCKPDDRVWNALLGVCRIHGNIELGRKVAEHVIELEPQSSAAYVSLASLYAFLGKWESVEKVRELMEERFVRKERA

Query:  ISWIDTGNKVHSFIASDRSHPLKEEIYSLLEQLASH
        ISWID  NKVHSFIASDR HPLKEEIYSLLEQLASH
Subjt:  ISWIDTGNKVHSFIASDRSHPLKEEIYSLLEQLASH

XP_023553643.1 pentatricopeptide repeat-containing protein At2g21090 [Cucurbita pepo subsp. pepo]5.94e-30678.36Show/hide
Query:  LLRLCAKVKSFKGGKWIHFYLKHTGLKRPTTIVANHLIGMYFECGNDIEARKVFDKMSARNLYSWNHMLAGYAKLGNINQAKKLFDSMTEKDVVSWNTVV
        LLRLCAK KS KGGK +H +LK TG KRPTTI+ANHLIGMYF+CG+DIEARKVFDKMS RNLYSWNHMLAGYAKLGN+ QA+KLFD+M EKDV+SWNT+V
Subjt:  LLRLCAKVKSFKGGKWIHFYLKHTGLKRPTTIVANHLIGMYFECGNDIEARKVFDKMSARNLYSWNHMLAGYAKLGNINQAKKLFDSMTEKDVVSWNTVV

Query:  LAYAKKGCLNEAIG---------------------------------------------------------------------RLFDEMPVKDILSWTTM
        LAYAKKGCLNEAIG                                                                     RLFDEMPVKDIL+WTTM
Subjt:  LAYAKKGCLNEAIG---------------------------------------------------------------------RLFDEMPVKDILSWTTM

Query:  VSGYAKWGDMNLASELFHQMPEKNPVSWTALISGYARNSLGHEALDYFTKMMTFRINPDQYTFSSCLCACASIAALKHGKQVHACLIRTNFRCNTIVVSS
        VSGYAKWGDMNLAS LFHQMPEKNPVSWTALISGYARNSLGHEALDYFT+MM FR+NPDQ+TFSSCLCACASIAALKHGKQVHA LIRTNFRCNTIVVSS
Subjt:  VSGYAKWGDMNLASELFHQMPEKNPVSWTALISGYARNSLGHEALDYFTKMMTFRINPDQYTFSSCLCACASIAALKHGKQVHACLIRTNFRCNTIVVSS

Query:  LIDMYSKCGMLEAGCHVFYLMGNKQDVVLWNTMISALAQHGHGEQAMQMFNDMVKSGLKPDRITFIVILSACSHSGLVQEGLRFFKAMMYDHGVLPDQEH
        LIDMYSKCGMLEA C VFYL+GNKQDVVLWNTMISALAQHGHG++AMQMF+DMV+SGLKPDRITFIVILSACSHSGLV EGL+FFKAM YDHGVLPDQEH
Subjt:  LIDMYSKCGMLEAGCHVFYLMGNKQDVVLWNTMISALAQHGHGEQAMQMFNDMVKSGLKPDRITFIVILSACSHSGLVQEGLRFFKAMMYDHGVLPDQEH

Query:  YACLIDLLGRAGWFIELVNELEKMCCKPDDRVWNALLGVCRIHGNIELGRKVAEHVIELEPQSSAAYVSLASLYAFLGKWESVEKVRELMEERFVRKERA
        YACLIDLLGRAG F ELVNELEKM CKPDDR+WNALLGVCRIHGNIELGRKVAEHVIELEP+SSAAYVSLASLYAFLGKWESVE+VRE+MEER VRKERA
Subjt:  YACLIDLLGRAGWFIELVNELEKMCCKPDDRVWNALLGVCRIHGNIELGRKVAEHVIELEPQSSAAYVSLASLYAFLGKWESVEKVRELMEERFVRKERA

Query:  ISWIDTGNKVHSFIASDRSHPLKEEIYSLLEQLASH
        ISWID  NKVHSFIASDR HPLKEEIYSLLEQLASH
Subjt:  ISWIDTGNKVHSFIASDRSHPLKEEIYSLLEQLASH

XP_038886822.1 pentatricopeptide repeat-containing protein At2g21090 [Benincasa hispida]1.39e-31180.41Show/hide
Query:  LLRLCAKVKSFKGGKWIHFYLKHTGLKRPTTIVANHLIGMYFECGNDIEARKVFDKMSARNLYSWNHMLAGYAKLGNINQAKKLFDSMTEKDVVSWNTVV
        LLRLC K K  KGGK +H +LKHTG KRPTTIVANHLIGMYFECGNDIEARKVFDKMS RNLYSWNHMLAGYAKLGN++ A+KLFD M EKDVVSWNT+V
Subjt:  LLRLCAKVKSFKGGKWIHFYLKHTGLKRPTTIVANHLIGMYFECGNDIEARKVFDKMSARNLYSWNHMLAGYAKLGNINQAKKLFDSMTEKDVVSWNTVV

Query:  LAYAKKGCLNEAIG---------------------------------------------------------------------RLFDEMPVKDILSWTTM
        LAYAKKG  NEAIG                                                                     RLFDEM VKDIL+WTTM
Subjt:  LAYAKKGCLNEAIG---------------------------------------------------------------------RLFDEMPVKDILSWTTM

Query:  VSGYAKWGDMNLASELFHQMPEKNPVSWTALISGYARNSLGHEALDYFTKMMTFRINPDQYTFSSCLCACASIAALKHGKQVHACLIRTNFRCNTIVVSS
        VSGYAKWGDMNLASELFHQMPEKNPVSWTALISGYARNSLGHEALDYFTKMM FRINPDQYTFSSCLCACASIAALKHGKQVHA LIRTNFRCNTIVVSS
Subjt:  VSGYAKWGDMNLASELFHQMPEKNPVSWTALISGYARNSLGHEALDYFTKMMTFRINPDQYTFSSCLCACASIAALKHGKQVHACLIRTNFRCNTIVVSS

Query:  LIDMYSKCGMLEAGCHVFYLMGNKQDVVLWNTMISALAQHGHGEQAMQMFNDMVKSGLKPDRITFIVILSACSHSGLVQEGLRFFKAMMYDHGVLPDQEH
        +IDMYSKCGMLEA CHVFYLMGNKQDVVLWNTMISALAQHGHGE+AMQMFNDMV+SGLKPDRITFIVILSACSHSGLVQEGLRFFKAM YDHGVLPDQEH
Subjt:  LIDMYSKCGMLEAGCHVFYLMGNKQDVVLWNTMISALAQHGHGEQAMQMFNDMVKSGLKPDRITFIVILSACSHSGLVQEGLRFFKAMMYDHGVLPDQEH

Query:  YACLIDLLGRAGWFIELVNELEKMCCKPDDRVWNALLGVCRIHGNIELGRKVAEHVIELEPQSSAAYVSLASLYAFLGKWESVEKVRELMEERFVRKERA
        YACLIDLLGRAG FIELVNELEKM CKPDDRVWNALLGVCRIHGNIELGRKVAEHVI LEPQSSAAYVSLASLYA LGKWESVEKVRELMEERFVRKERA
Subjt:  YACLIDLLGRAGWFIELVNELEKMCCKPDDRVWNALLGVCRIHGNIELGRKVAEHVIELEPQSSAAYVSLASLYAFLGKWESVEKVRELMEERFVRKERA

Query:  ISWIDTGNKVHSFIASDRSHPLKEEIYSLLEQLASH
        ISWI  GNKVHSFIASDR HPLKEEIYS+LEQLASH
Subjt:  ISWIDTGNKVHSFIASDRSHPLKEEIYSLLEQLASH

TrEMBL top hitse value%identityAlignment
A0A1S3BZY1 pentatricopeptide repeat-containing protein At2g210905.81e-29976.68Show/hide
Query:  LLRLCAKVKSFKGGKWIHFYLKHTGLKRPTTIVANHLIGMYFECGNDIEARKVFDKMSARNLYSWNHMLAGYAKLGNINQAKKLFDSMTEKDVVSWNTVV
        LLRLCAK K FKGGK +H +LKHTG KRPTTIVANHLIGMYFECG D+EARKVFDKMS RNLYSWNHMLAGYAKLG ++ A+KLFD M EKDVVSWNT+V
Subjt:  LLRLCAKVKSFKGGKWIHFYLKHTGLKRPTTIVANHLIGMYFECGNDIEARKVFDKMSARNLYSWNHMLAGYAKLGNINQAKKLFDSMTEKDVVSWNTVV

Query:  LAYAKKGCLNEAIG---------------------------------------------------------------------RLFDEMPVKDILSWTTM
        LAYAKKGC NEAIG                                                                     RLFDEM VKDI  WTTM
Subjt:  LAYAKKGCLNEAIG---------------------------------------------------------------------RLFDEMPVKDILSWTTM

Query:  VSGYAKWGDMNLASELFHQMPEKNPVSWTALISGYARNSLGHEALDYFTKMMTFRINPDQYTFSSCLCACASIAALKHGKQVHACLIRTNFRCNTIVVSS
        VSGYAKWGDMN ASELFHQMPEKNPVSWTALISGYARNSLGHEALDYFTKMM   INP+QYTFSSCLCACASIAALKHGKQVH  LIRTNFRCNTIVVSS
Subjt:  VSGYAKWGDMNLASELFHQMPEKNPVSWTALISGYARNSLGHEALDYFTKMMTFRINPDQYTFSSCLCACASIAALKHGKQVHACLIRTNFRCNTIVVSS

Query:  LIDMYSKCGMLEAGCHVFYLMGNKQDVVLWNTMISALAQHGHGEQAMQMFNDMVKSGLKPDRITFIVILSACSHSGLVQEGLRFFKAMMYDHGVLPDQEH
        LIDMYSKCGMLEA C+VF+LMGNKQDVV+WNTMIS LAQ+GHGE+AMQMFN MV+SG+KPDRITFIVILSACSHSGLVQEGL+FFKAM YDHGVLPDQEH
Subjt:  LIDMYSKCGMLEAGCHVFYLMGNKQDVVLWNTMISALAQHGHGEQAMQMFNDMVKSGLKPDRITFIVILSACSHSGLVQEGLRFFKAMMYDHGVLPDQEH

Query:  YACLIDLLGRAGWFIELVNELEKMCCKPDDRVWNALLGVCRIHGNIELGRKVAEHVIELEPQSSAAYVSLASLYAFLGKWESVEKVRELMEERFVRKERA
        YACLIDLLGRAG FIELVNELE M CKPDDRVW+ALLGVCRIH NIELGRKVAEHVIEL+PQSSAAYVSLA LYAFLGKWESVEKVRELM+E+F+RKERA
Subjt:  YACLIDLLGRAGWFIELVNELEKMCCKPDDRVWNALLGVCRIHGNIELGRKVAEHVIELEPQSSAAYVSLASLYAFLGKWESVEKVRELMEERFVRKERA

Query:  ISWIDTGNKVHSFIASDRSHPLKEEIYSLLEQLASH
        ISWID GNK+HSFIASDR HPLKEEIY LLEQLA H
Subjt:  ISWIDTGNKVHSFIASDRSHPLKEEIYSLLEQLASH

A0A5D3C6K7 Pentatricopeptide repeat-containing protein5.81e-29976.68Show/hide
Query:  LLRLCAKVKSFKGGKWIHFYLKHTGLKRPTTIVANHLIGMYFECGNDIEARKVFDKMSARNLYSWNHMLAGYAKLGNINQAKKLFDSMTEKDVVSWNTVV
        LLRLCAK K FKGGK +H +LKHTG KRPTTIVANHLIGMYFECG D+EARKVFDKMS RNLYSWNHMLAGYAKLG ++ A+KLFD M EKDVVSWNT+V
Subjt:  LLRLCAKVKSFKGGKWIHFYLKHTGLKRPTTIVANHLIGMYFECGNDIEARKVFDKMSARNLYSWNHMLAGYAKLGNINQAKKLFDSMTEKDVVSWNTVV

Query:  LAYAKKGCLNEAIG---------------------------------------------------------------------RLFDEMPVKDILSWTTM
        LAYAKKGC NEAIG                                                                     RLFDEM VKDI  WTTM
Subjt:  LAYAKKGCLNEAIG---------------------------------------------------------------------RLFDEMPVKDILSWTTM

Query:  VSGYAKWGDMNLASELFHQMPEKNPVSWTALISGYARNSLGHEALDYFTKMMTFRINPDQYTFSSCLCACASIAALKHGKQVHACLIRTNFRCNTIVVSS
        VSGYAKWGDMN ASELFHQMPEKNPVSWTALISGYARNSLGHEALDYFTKMM   INP+QYTFSSCLCACASIAALKHGKQVH  LIRTNFRCNTIVVSS
Subjt:  VSGYAKWGDMNLASELFHQMPEKNPVSWTALISGYARNSLGHEALDYFTKMMTFRINPDQYTFSSCLCACASIAALKHGKQVHACLIRTNFRCNTIVVSS

Query:  LIDMYSKCGMLEAGCHVFYLMGNKQDVVLWNTMISALAQHGHGEQAMQMFNDMVKSGLKPDRITFIVILSACSHSGLVQEGLRFFKAMMYDHGVLPDQEH
        LIDMYSKCGMLEA C+VF+LMGNKQDVV+WNTMIS LAQ+GHGE+AMQMFN MV+SG+KPDRITFIVILSACSHSGLVQEGL+FFKAM YDHGVLPDQEH
Subjt:  LIDMYSKCGMLEAGCHVFYLMGNKQDVVLWNTMISALAQHGHGEQAMQMFNDMVKSGLKPDRITFIVILSACSHSGLVQEGLRFFKAMMYDHGVLPDQEH

Query:  YACLIDLLGRAGWFIELVNELEKMCCKPDDRVWNALLGVCRIHGNIELGRKVAEHVIELEPQSSAAYVSLASLYAFLGKWESVEKVRELMEERFVRKERA
        YACLIDLLGRAG FIELVNELE M CKPDDRVW+ALLGVCRIH NIELGRKVAEHVIEL+PQSSAAYVSLA LYAFLGKWESVEKVRELM+E+F+RKERA
Subjt:  YACLIDLLGRAGWFIELVNELEKMCCKPDDRVWNALLGVCRIHGNIELGRKVAEHVIELEPQSSAAYVSLASLYAFLGKWESVEKVRELMEERFVRKERA

Query:  ISWIDTGNKVHSFIASDRSHPLKEEIYSLLEQLASH
        ISWID GNK+HSFIASDR HPLKEEIY LLEQLA H
Subjt:  ISWIDTGNKVHSFIASDRSHPLKEEIYSLLEQLASH

A0A6J1C3S4 pentatricopeptide repeat-containing protein At2g210900.087.13Show/hide
Query:  LLRLCAKVKSFKGGKWIHFYLKHTGLKRPTTIVANHLIGMYFECGNDIEARKVFDKMSARNLYSWNHMLAGYAKLGNINQAKKLFDSMTEKDVVSWNTVV
        LLRLCAKVKSFKGGKWIHFYLKHTGLKRPTTIVANHLIGMYFECGNDIEARKVFDKMSARNLYSWNHMLAGYAKLGNINQAKKLFDSMTEKDVVSWNTVV
Subjt:  LLRLCAKVKSFKGGKWIHFYLKHTGLKRPTTIVANHLIGMYFECGNDIEARKVFDKMSARNLYSWNHMLAGYAKLGNINQAKKLFDSMTEKDVVSWNTVV

Query:  LAYAKKGCLNEAIG---------------------------------------------------------------------RLFDEMPVKDILSWTTM
        LAYAKKGCLNEAIG                                                                     RLFDEMPVKDILSWTTM
Subjt:  LAYAKKGCLNEAIG---------------------------------------------------------------------RLFDEMPVKDILSWTTM

Query:  VSGYAKWGDMNLASELFHQMPEKNPVSWTALISGYARNSLGHEALDYFTKMMTFRINPDQYTFSSCLCACASIAALKHGKQVHACLIRTNFRCNTIVVSS
        VSGYAKWGDMNLASELFHQMPEKNPVSWTALISGYARNSLGHEALDYFTKMMTFRINPDQYTFSSCLCACASIAALKHGKQVHACLIRTNFRCNTIVVSS
Subjt:  VSGYAKWGDMNLASELFHQMPEKNPVSWTALISGYARNSLGHEALDYFTKMMTFRINPDQYTFSSCLCACASIAALKHGKQVHACLIRTNFRCNTIVVSS

Query:  LIDMYSKCGMLEAGCHVFYLMGNKQDVVLWNTMISALAQHGHGEQAMQMFNDMVKSGLKPDRITFIVILSACSHSGLVQEGLRFFKAMMYDHGVLPDQEH
        LIDMYSKCGMLEAGCHVFYLMGNKQDVVLWNTMISALAQHGHGEQAMQMFNDMVKSGLKPDRITFIVILSACSHSGLVQEGLRFFKAMMYDHGVLPDQEH
Subjt:  LIDMYSKCGMLEAGCHVFYLMGNKQDVVLWNTMISALAQHGHGEQAMQMFNDMVKSGLKPDRITFIVILSACSHSGLVQEGLRFFKAMMYDHGVLPDQEH

Query:  YACLIDLLGRAGWFIELVNELEKMCCKPDDRVWNALLGVCRIHGNIELGRKVAEHVIELEPQSSAAYVSLASLYAFLGKWESVEKVRELMEERFVRKERA
        YACLIDLLGRAGWFIELVNELEKMCCKPDDRVWNALLGVCRIHGNIELGRKVAEHVIELEPQSSAAYVSLASLYAFLGKWESVEKVRELMEERFVRKERA
Subjt:  YACLIDLLGRAGWFIELVNELEKMCCKPDDRVWNALLGVCRIHGNIELGRKVAEHVIELEPQSSAAYVSLASLYAFLGKWESVEKVRELMEERFVRKERA

Query:  ISWIDTGNKVHSFIASDRSHPLKEEIYSLLEQLASH
        ISWIDTGNKVHSFIASDRSHPLKEEIYSLLEQLASH
Subjt:  ISWIDTGNKVHSFIASDRSHPLKEEIYSLLEQLASH

A0A6J1HLP5 pentatricopeptide repeat-containing protein At2g210901.17e-30578.17Show/hide
Query:  LLRLCAKVKSFKGGKWIHFYLKHTGLKRPTTIVANHLIGMYFECGNDIEARKVFDKMSARNLYSWNHMLAGYAKLGNINQAKKLFDSMTEKDVVSWNTVV
        LLRLCAK KS KGGK +H +LK TG KRPTTIVANHLIGMYF+CG+D EARKVFDKMS RNLYSWNHMLAGYAKLGN+ QA+KLFD+M EKDV+SWNT+V
Subjt:  LLRLCAKVKSFKGGKWIHFYLKHTGLKRPTTIVANHLIGMYFECGNDIEARKVFDKMSARNLYSWNHMLAGYAKLGNINQAKKLFDSMTEKDVVSWNTVV

Query:  LAYAKKGCLNEAIG---------------------------------------------------------------------RLFDEMPVKDILSWTTM
        LAYAKKGC NEAIG                                                                     RLFDEMPVKDIL+WTTM
Subjt:  LAYAKKGCLNEAIG---------------------------------------------------------------------RLFDEMPVKDILSWTTM

Query:  VSGYAKWGDMNLASELFHQMPEKNPVSWTALISGYARNSLGHEALDYFTKMMTFRINPDQYTFSSCLCACASIAALKHGKQVHACLIRTNFRCNTIVVSS
        VSGYAKWGDMNLAS LFHQMPEKNPVSWTALISGYARNSLGHEALDYFT+MM FR+NPDQ+TFSSCLCACASIAALKHGKQVHA LIRTNFRCNTIVVSS
Subjt:  VSGYAKWGDMNLASELFHQMPEKNPVSWTALISGYARNSLGHEALDYFTKMMTFRINPDQYTFSSCLCACASIAALKHGKQVHACLIRTNFRCNTIVVSS

Query:  LIDMYSKCGMLEAGCHVFYLMGNKQDVVLWNTMISALAQHGHGEQAMQMFNDMVKSGLKPDRITFIVILSACSHSGLVQEGLRFFKAMMYDHGVLPDQEH
        LIDMYSKCGMLEA C VFYL+GNKQDVVLWNTMISALAQHGHGE+AMQMFNDMV+SGL PDRITFIVILSACSHSGLVQEGL+FFKAM YDHG+LPDQEH
Subjt:  LIDMYSKCGMLEAGCHVFYLMGNKQDVVLWNTMISALAQHGHGEQAMQMFNDMVKSGLKPDRITFIVILSACSHSGLVQEGLRFFKAMMYDHGVLPDQEH

Query:  YACLIDLLGRAGWFIELVNELEKMCCKPDDRVWNALLGVCRIHGNIELGRKVAEHVIELEPQSSAAYVSLASLYAFLGKWESVEKVRELMEERFVRKERA
        YACLIDLLGRAG F ELVNELEKM CKPDDR+WN LLGVCRIHGNIELGRKVAEHVIELEP+SSAAYVSLASLYAFLGKWESVE+VRE+MEER VRKERA
Subjt:  YACLIDLLGRAGWFIELVNELEKMCCKPDDRVWNALLGVCRIHGNIELGRKVAEHVIELEPQSSAAYVSLASLYAFLGKWESVEKVRELMEERFVRKERA

Query:  ISWIDTGNKVHSFIASDRSHPLKEEIYSLLEQLASH
        ISWID  NKVHSFIASDR HPLKEEIYSLLEQLASH
Subjt:  ISWIDTGNKVHSFIASDRSHPLKEEIYSLLEQLASH

A0A6J1HUW8 pentatricopeptide repeat-containing protein At2g21090-like8.24e-30678.17Show/hide
Query:  LLRLCAKVKSFKGGKWIHFYLKHTGLKRPTTIVANHLIGMYFECGNDIEARKVFDKMSARNLYSWNHMLAGYAKLGNINQAKKLFDSMTEKDVVSWNTVV
        LLRLCAK KS KGGK +H +LK TG KRPTTI+ANHLIGMYF+CG+DIEARKVFDKMS RNLYSWNHMLAGYAKLGN+ QA+K+FD+M EKDV+SWNT+V
Subjt:  LLRLCAKVKSFKGGKWIHFYLKHTGLKRPTTIVANHLIGMYFECGNDIEARKVFDKMSARNLYSWNHMLAGYAKLGNINQAKKLFDSMTEKDVVSWNTVV

Query:  LAYAKKGCLNEAIG---------------------------------------------------------------------RLFDEMPVKDILSWTTM
        LAYAKKGC NEAIG                                                                     RLFDEMPVKDIL+WTTM
Subjt:  LAYAKKGCLNEAIG---------------------------------------------------------------------RLFDEMPVKDILSWTTM

Query:  VSGYAKWGDMNLASELFHQMPEKNPVSWTALISGYARNSLGHEALDYFTKMMTFRINPDQYTFSSCLCACASIAALKHGKQVHACLIRTNFRCNTIVVSS
        VSGYAKWGDMNLAS LFHQMPEKNPVSWTALISGYARNSLGHEALDYFT+MM FR+NPDQ+TFSSCLCACASIAALKHGKQVHA LIRTNFRCNTIVVSS
Subjt:  VSGYAKWGDMNLASELFHQMPEKNPVSWTALISGYARNSLGHEALDYFTKMMTFRINPDQYTFSSCLCACASIAALKHGKQVHACLIRTNFRCNTIVVSS

Query:  LIDMYSKCGMLEAGCHVFYLMGNKQDVVLWNTMISALAQHGHGEQAMQMFNDMVKSGLKPDRITFIVILSACSHSGLVQEGLRFFKAMMYDHGVLPDQEH
        LIDMYSKCGMLEA C VFYL+GNKQDVVLWNTMISALAQHGHGE+AMQMFNDMV+SGLKPDRITFIVILSACSHSGLV EGL+FFKAM YDH VLPDQEH
Subjt:  LIDMYSKCGMLEAGCHVFYLMGNKQDVVLWNTMISALAQHGHGEQAMQMFNDMVKSGLKPDRITFIVILSACSHSGLVQEGLRFFKAMMYDHGVLPDQEH

Query:  YACLIDLLGRAGWFIELVNELEKMCCKPDDRVWNALLGVCRIHGNIELGRKVAEHVIELEPQSSAAYVSLASLYAFLGKWESVEKVRELMEERFVRKERA
        YACLIDLLGRAG F ELVNELEKM CKPDDR+WNALLGVCRIHGNIELGRKVAEHVIELEP+SSAAYVSLASLYAFLGKWESVE+VRE+MEER VRKERA
Subjt:  YACLIDLLGRAGWFIELVNELEKMCCKPDDRVWNALLGVCRIHGNIELGRKVAEHVIELEPQSSAAYVSLASLYAFLGKWESVEKVRELMEERFVRKERA

Query:  ISWIDTGNKVHSFIASDRSHPLKEEIYSLLEQLASH
        ISWID  NKVHSFIASDR HPLKEEIYSLLEQLASH
Subjt:  ISWIDTGNKVHSFIASDRSHPLKEEIYSLLEQLASH

SwissProt top hitse value%identityAlignment
O23169 Pentatricopeptide repeat-containing protein At4g371703.1e-9735.59Show/hide
Query:  LLRLCAKVKSFKGGKWIHFYLKHTGLKRPTTIVANHLIGMYFECGNDIEARKVFDKMSARNLYSWNHMLAGYAKLGNINQAKKLFDSMTEKDVVSWNTVV
        L+++C++ ++ + GK +H +++ +G   P  ++ N L+ MY +CG+ ++ARKVFD+M  R+L SWN M+ GYA++G + +A+KLFD MTEKD  SW  +V
Subjt:  LLRLCAKVKSFKGGKWIHFYLKHTGLKRPTTIVANHLIGMYFECGNDIEARKVFDKMSARNLYSWNHMLAGYAKLGNINQAKKLFDSMTEKDVVSWNTVV

Query:  LAYAKKGCLNEAI---------------------------------------GRLFDEMPVKDILSWTTMVSGYAKWGDMNLASELFHQMPEKNPVSWTA
          Y KK    EA+                                       G +       D + W++++  Y K G ++ A  +F ++ EK+ VSWT+
Subjt:  LAYAKKGCLNEAI---------------------------------------GRLFDEMPVKDILSWTTMVSGYAKWGDMNLASELFHQMPEKNPVSWTA

Query:  LISGYARNSLGHEALDYFTKMMTFRINPDQYTFSSCLCACASIAALKHGKQVHACLIRTNFRCNTIVVSSLIDMYSKCGMLEAGCHVFYLMGNKQDVVLW
        +I  Y ++S   E    F++++     P++YTF+  L ACA +   + GKQVH  + R  F   +   SSL+DMY+KCG +E+  HV      K D+V W
Subjt:  LISGYARNSLGHEALDYFTKMMTFRINPDQYTFSSCLCACASIAALKHGKQVHACLIRTNFRCNTIVVSSLIDMYSKCGMLEAGCHVFYLMGNKQDVVLW

Query:  NTMISALAQHGHGEQAMQMFNDMVKSGLKPDRITFIVILSACSHSGLVQEGLRFFKAMMYDHGVLPDQEHYACLIDLLGRAGWFIELVNELEKMCCKPDD
         ++I   AQ+G  ++A++ F+ ++KSG KPD +TF+ +LSAC+H+GLV++GL FF ++   H +    +HY CL+DLL R+G F +L + + +M  KP  
Subjt:  NTMISALAQHGHGEQAMQMFNDMVKSGLKPDRITFIVILSACSHSGLVQEGLRFFKAMMYDHGVLPDQEHYACLIDLLGRAGWFIELVNELEKMCCKPDD

Query:  RVWNALLGVCRIHGNIELGRKVAEHVIELEPQSSAAYVSLASLYAFLGKWESVEKVRELMEERFVRKERAISWIDTGNKVHSFIASDRSHPLKEEIYSLL
         +W ++LG C  +GNI+L  + A+ + ++EP++   YV++A++YA  GKWE   K+R+ M+E  V K    SW +   K H FIA+D SHP+  +I   L
Subjt:  RVWNALLGVCRIHGNIELGRKVAEHVIELEPQSSAAYVSLASLYAFLGKWESVEKVRELMEERFVRKERAISWIDTGNKVHSFIASDRSHPLKEEIYSLL

Query:  EQL
         +L
Subjt:  EQL

Q9LS72 Pentatricopeptide repeat-containing protein At3g292302.4e-9737.39Show/hide
Query:  LLRLCAKVKSFKGGKWIHFYLKHTGLKRPTTIVANHLIGMYFECG--NDIEARKVFDKMSARNLYSWNHMLAGYAKLGNINQAKKLFDSMTEKDVVSWNT
        LL+ C+        K +H +++  GL      V N LI  Y  CG     +A K+F+KMS R+  SWN ML G  K G +  A++LFD M ++D++SWNT
Subjt:  LLRLCAKVKSFKGGKWIHFYLKHTGLKRPTTIVANHLIGMYFECG--NDIEARKVFDKMSARNLYSWNHMLAGYAKLGNINQAKKLFDSMTEKDVVSWNT

Query:  VVLAYAKKGCLNEAIGRLFDEMPVKDILSWTTMVSGYAKWGDMNLASELFHQM--PEKNPVSWTALISGYARNSLGHEALDYFTKMMTFRINPDQYTFSS
        ++  YA+   +++A   LF++MP ++ +SW+TMV GY+K GDM +A  +F +M  P KN V+WT +I+GYA   L  EA     +M+   +  D     S
Subjt:  VVLAYAKKGCLNEAIGRLFDEMPVKDILSWTTMVSGYAKWGDMNLASELFHQM--PEKNPVSWTALISGYARNSLGHEALDYFTKMMTFRINPDQYTFSS

Query:  CLCACASIAALKHGKQVHACLIRTNFRCNTIVVSSLIDMYSKCGMLEAGCHVFYLMGNKQDVVLWNTMISALAQHGHGEQAMQMFNDMVKSGLKPDRITF
         L AC     L  G ++H+ L R+N   N  V+++L+DMY+KCG L+    VF  +  K+D+V WNTM+  L  HGHG++A+++F+ M + G++PD++TF
Subjt:  CLCACASIAALKHGKQVHACLIRTNFRCNTIVVSSLIDMYSKCGMLEAGCHVFYLMGNKQDVVLWNTMISALAQHGHGEQAMQMFNDMVKSGLKPDRITF

Query:  IVILSACSHSGLVQEGLRFFKAMMYDHGVLPDQEHYACLIDLLGRAGWFIELVNELEKMCCKPDDRVWNALLGVCRIHGNIELGRKVAEHVIELEPQSSA
        I +L +C+H+GL+ EG+ +F +M   + ++P  EHY CL+DLLGR G   E +  ++ M  +P+  +W ALLG CR+H  +++ ++V +++++L+P    
Subjt:  IVILSACSHSGLVQEGLRFFKAMMYDHGVLPDQEHYACLIDLLGRAGWFIELVNELEKMCCKPDDRVWNALLGVCRIHGNIELGRKVAEHVIELEPQSSA

Query:  AYVSLASLYAFLGKWESVEKVRELMEERFVRKERAISWIDTGNKVHSFIASDRSHPLKEEIYSLLEQL
         Y  L+++YA    WE V  +R  M+   V K    S ++  + +H F   D+SHP  ++IY +L  L
Subjt:  AYVSLASLYAFLGKWESVEKVRELMEERFVRKERAISWIDTGNKVHSFIASDRSHPLKEEIYSLLEQL

Q9M4P3 Pentatricopeptide repeat-containing protein At4g16835, mitochondrial1.5e-9639.55Show/hide
Query:  LKRPTTIVANHLIGMYFECGNDIEARKVFDKMSARNLYSWNHMLAGYAKLGNINQAKKLFDSMTEKDVVSWNTVVLAYAKKGCLNEAIGRLFDEMPVKDI
        +  P T   N ++  Y    N  +A+  FD+M  ++  SWN M+ GYA+ G + +A++LF SM EK+ VSWN ++  Y + G L +A    F   PV+ +
Subjt:  LKRPTTIVANHLIGMYFECGNDIEARKVFDKMSARNLYSWNHMLAGYAKLGNINQAKKLFDSMTEKDVVSWNTVVLAYAKKGCLNEAIGRLFDEMPVKDI

Query:  LSWTTMVSGYAKWGDMNLASELFHQMP-EKNPVSWTALISGYARNSLGHEALDYFTKMMTFRINPDQYTFSSCLCACASIAALKHGKQVHACLIRTNFRC
        ++WT M++GY K   + LA  +F  M   KN V+W A+ISGY  NS   + L  F  M+   I P+    SS L  C+ ++AL+ G+Q+H  + ++    
Subjt:  LSWTTMVSGYAKWGDMNLASELFHQMP-EKNPVSWTALISGYARNSLGHEALDYFTKMMTFRINPDQYTFSSCLCACASIAALKHGKQVHACLIRTNFRC

Query:  NTIVVSSLIDMYSKCGMLEAGCHVFYLMGNKQDVVLWNTMISALAQHGHGEQAMQMFNDMVKSGLKPDRITFIVILSACSHSGLVQEGLRFFKAMMYDHG
        +   ++SLI MY KCG L     +F +M  K+DVV WN MIS  AQHG+ ++A+ +F +M+ + ++PD ITF+ +L AC+H+GLV  G+ +F++M+ D+ 
Subjt:  NTIVVSSLIDMYSKCGMLEAGCHVFYLMGNKQDVVLWNTMISALAQHGHGEQAMQMFNDMVKSGLKPDRITFIVILSACSHSGLVQEGLRFFKAMMYDHG

Query:  VLPDQEHYACLIDLLGRAGWFIELVNELEKMCCKPDDRVWNALLGVCRIHGNIELGRKVAEHVIELEPQSSAAYVSLASLYAFLGKWESVEKVRELMEER
        V P  +HY C++DLLGRAG   E +  +  M  +P   V+  LLG CR+H N+EL    AE +++L  Q++A YV LA++YA   +WE V +VR+ M+E 
Subjt:  VLPDQEHYACLIDLLGRAGWFIELVNELEKMCCKPDDRVWNALLGVCRIHGNIELGRKVAEHVIELEPQSSAAYVSLASLYAFLGKWESVEKVRELMEER

Query:  FVRKERAISWIDTGNKVHSFIASDRSHPLKEEIYSLLEQL
         V K    SWI+  NKVH F +SDR HP  + I+  L++L
Subjt:  FVRKERAISWIDTGNKVHSFIASDRSHPLKEEIYSLLEQL

Q9SKQ4 Pentatricopeptide repeat-containing protein At2g210906.8e-16152.51Show/hide
Query:  LLRLCAKVKSFKGGKWIHFYLKHTGLKRPTTIVANHLIGMYFECGNDIEARKVFDKMSARNLYSWNHMLAGYAKLGNINQAKKLFDSMTEKDVVSWNTVV
        LL+ C   KS K GKWIH +LK TG KRP T+++NHLIGMY +CG  I+A KVFD+M  RNLYSWN+M++GY K G + +A+ +FDSM E+DVVSWNT+V
Subjt:  LLRLCAKVKSFKGGKWIHFYLKHTGLKRPTTIVANHLIGMYFECGNDIEARKVFDKMSARNLYSWNHMLAGYAKLGNINQAKKLFDSMTEKDVVSWNTVV

Query:  LAYAKKGCLNEAI---------------------------------------------------------------------GRLFDEMPVKDILSWTTM
        + YA+ G L+EA+                                                                      R FDEM VKDI  WTT+
Subjt:  LAYAKKGCLNEAI---------------------------------------------------------------------GRLFDEMPVKDILSWTTM

Query:  VSGYAKWGDMNLASELFHQMPEKNPVSWTALISGYARNSLGHEALDYFTKMMTFRINPDQYTFSSCLCACASIAALKHGKQVHACLIRTNFRCNTIVVSS
        +SGYAK GDM  A +LF +MPEKNPVSWTALI+GY R   G+ ALD F KM+   + P+Q+TFSSCLCA ASIA+L+HGK++H  +IRTN R N IV+SS
Subjt:  VSGYAKWGDMNLASELFHQMPEKNPVSWTALISGYARNSLGHEALDYFTKMMTFRINPDQYTFSSCLCACASIAALKHGKQVHACLIRTNFRCNTIVVSS

Query:  LIDMYSKCGMLEAGCHVFYLMGNKQDVVLWNTMISALAQHGHGEQAMQMFNDMVKSGLKPDRITFIVILSACSHSGLVQEGLRFFKAMMYDHGVLPDQEH
        LIDMYSK G LEA   VF +  +K D V WNTMISALAQHG G +A++M +DM+K  ++P+R T +VIL+ACSHSGLV+EGLR+F++M   HG++PDQEH
Subjt:  LIDMYSKCGMLEAGCHVFYLMGNKQDVVLWNTMISALAQHGHGEQAMQMFNDMVKSGLKPDRITFIVILSACSHSGLVQEGLRFFKAMMYDHGVLPDQEH

Query:  YACLIDLLGRAGWFIELVNELEKMCCKPDDRVWNALLGVCRIHGNIELGRKVAEHVIELEPQSSAAYVSLASLYAFLGKWESVEKVRELMEERFVRKERA
        YACLIDLLGRAG F EL+ ++E+M  +PD  +WNA+LGVCRIHGN ELG+K A+ +I+L+P+SSA Y+ L+S+YA  GKWE VEK+R +M++R V KE+A
Subjt:  YACLIDLLGRAGWFIELVNELEKMCCKPDDRVWNALLGVCRIHGNIELGRKVAEHVIELEPQSSAAYVSLASLYAFLGKWESVEKVRELMEERFVRKERA

Query:  ISWIDTGNKVHSFIASDRS--HPLKEEIYSLLEQLAS
        +SWI+   KV +F  SD S  H  KEEIY +L  LA+
Subjt:  ISWIDTGNKVHSFIASDRS--HPLKEEIYSLLEQLAS

Q9SY02 Pentatricopeptide repeat-containing protein At4g027501.4e-10240.7Show/hide
Query:  NHLIGMYFECGNDIEARKVFDKMSARNLYSWNHMLAGYAKLGNINQAKKLFDSMTEKDVVSWNTVVLAYAKKGCLNEAIGRLFDEMPVKDILSWTTMVSG
        N +I  Y + G   EAR++FD+   +++++W  M++GY +   + +A++LFD M E++ VSWN ++  Y  +G   E    LFD MP +++ +W TM++G
Subjt:  NHLIGMYFECGNDIEARKVFDKMSARNLYSWNHMLAGYAKLGNINQAKKLFDSMTEKDVVSWNTVVLAYAKKGCLNEAIGRLFDEMPVKDILSWTTMVSG

Query:  YAKWGDMNLASELFHQMPEKNPVSWTALISGYARNSLGHEALDYFTKMMTFRINPDQYTFSSCLCACASIAALKHGKQVHACLIRTNFRCNTIVVSSLID
        YA+ G ++ A  LF +MP+++PVSW A+I+GY+++    EAL  F +M       ++ +FSS L  CA + AL+ GKQ+H  L++  +     V ++L+ 
Subjt:  YAKWGDMNLASELFHQMPEKNPVSWTALISGYARNSLGHEALDYFTKMMTFRINPDQYTFSSCLCACASIAALKHGKQVHACLIRTNFRCNTIVVSSLID

Query:  MYSKCGMLEAGCHVFYLMGNKQDVVLWNTMISALAQHGHGEQAMQMFNDMVKSGLKPDRITFIVILSACSHSGLVQEGLRFFKAMMYDHGVLPDQEHYAC
        MY KCG +E    +F  M  K D+V WNTMI+  ++HG GE A++ F  M + GLKPD  T + +LSACSH+GLV +G ++F  M  D+GV+P+ +HYAC
Subjt:  MYSKCGMLEAGCHVFYLMGNKQDVVLWNTMISALAQHGHGEQAMQMFNDMVKSGLKPDRITFIVILSACSHSGLVQEGLRFFKAMMYDHGVLPDQEHYAC

Query:  LIDLLGRAGWFIELVNELEKMCCKPDDRVWNALLGVCRIHGNIELGRKVAEHVIELEPQSSAAYVSLASLYAFLGKWESVEKVRELMEERFVRKERAISW
        ++DLLGRAG   +  N ++ M  +PD  +W  LLG  R+HGN EL    A+ +  +EP++S  YV L++LYA  G+W  V K+R  M ++ V+K    SW
Subjt:  LIDLLGRAGWFIELVNELEKMCCKPDDRVWNALLGVCRIHGNIELGRKVAEHVIELEPQSSAAYVSLASLYAFLGKWESVEKVRELMEERFVRKERAISW

Query:  IDTGNKVHSFIASDRSHPLKEEIYSLLEQL
        I+  NK H+F   D  HP K+EI++ LE+L
Subjt:  IDTGNKVHSFIASDRSHPLKEEIYSLLEQL

Arabidopsis top hitse value%identityAlignment
AT2G21090.1 Pentatricopeptide repeat (PPR-like) superfamily protein4.8e-16252.51Show/hide
Query:  LLRLCAKVKSFKGGKWIHFYLKHTGLKRPTTIVANHLIGMYFECGNDIEARKVFDKMSARNLYSWNHMLAGYAKLGNINQAKKLFDSMTEKDVVSWNTVV
        LL+ C   KS K GKWIH +LK TG KRP T+++NHLIGMY +CG  I+A KVFD+M  RNLYSWN+M++GY K G + +A+ +FDSM E+DVVSWNT+V
Subjt:  LLRLCAKVKSFKGGKWIHFYLKHTGLKRPTTIVANHLIGMYFECGNDIEARKVFDKMSARNLYSWNHMLAGYAKLGNINQAKKLFDSMTEKDVVSWNTVV

Query:  LAYAKKGCLNEAI---------------------------------------------------------------------GRLFDEMPVKDILSWTTM
        + YA+ G L+EA+                                                                      R FDEM VKDI  WTT+
Subjt:  LAYAKKGCLNEAI---------------------------------------------------------------------GRLFDEMPVKDILSWTTM

Query:  VSGYAKWGDMNLASELFHQMPEKNPVSWTALISGYARNSLGHEALDYFTKMMTFRINPDQYTFSSCLCACASIAALKHGKQVHACLIRTNFRCNTIVVSS
        +SGYAK GDM  A +LF +MPEKNPVSWTALI+GY R   G+ ALD F KM+   + P+Q+TFSSCLCA ASIA+L+HGK++H  +IRTN R N IV+SS
Subjt:  VSGYAKWGDMNLASELFHQMPEKNPVSWTALISGYARNSLGHEALDYFTKMMTFRINPDQYTFSSCLCACASIAALKHGKQVHACLIRTNFRCNTIVVSS

Query:  LIDMYSKCGMLEAGCHVFYLMGNKQDVVLWNTMISALAQHGHGEQAMQMFNDMVKSGLKPDRITFIVILSACSHSGLVQEGLRFFKAMMYDHGVLPDQEH
        LIDMYSK G LEA   VF +  +K D V WNTMISALAQHG G +A++M +DM+K  ++P+R T +VIL+ACSHSGLV+EGLR+F++M   HG++PDQEH
Subjt:  LIDMYSKCGMLEAGCHVFYLMGNKQDVVLWNTMISALAQHGHGEQAMQMFNDMVKSGLKPDRITFIVILSACSHSGLVQEGLRFFKAMMYDHGVLPDQEH

Query:  YACLIDLLGRAGWFIELVNELEKMCCKPDDRVWNALLGVCRIHGNIELGRKVAEHVIELEPQSSAAYVSLASLYAFLGKWESVEKVRELMEERFVRKERA
        YACLIDLLGRAG F EL+ ++E+M  +PD  +WNA+LGVCRIHGN ELG+K A+ +I+L+P+SSA Y+ L+S+YA  GKWE VEK+R +M++R V KE+A
Subjt:  YACLIDLLGRAGWFIELVNELEKMCCKPDDRVWNALLGVCRIHGNIELGRKVAEHVIELEPQSSAAYVSLASLYAFLGKWESVEKVRELMEERFVRKERA

Query:  ISWIDTGNKVHSFIASDRS--HPLKEEIYSLLEQLAS
        +SWI+   KV +F  SD S  H  KEEIY +L  LA+
Subjt:  ISWIDTGNKVHSFIASDRS--HPLKEEIYSLLEQLAS

AT3G29230.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.7e-9837.39Show/hide
Query:  LLRLCAKVKSFKGGKWIHFYLKHTGLKRPTTIVANHLIGMYFECG--NDIEARKVFDKMSARNLYSWNHMLAGYAKLGNINQAKKLFDSMTEKDVVSWNT
        LL+ C+        K +H +++  GL      V N LI  Y  CG     +A K+F+KMS R+  SWN ML G  K G +  A++LFD M ++D++SWNT
Subjt:  LLRLCAKVKSFKGGKWIHFYLKHTGLKRPTTIVANHLIGMYFECG--NDIEARKVFDKMSARNLYSWNHMLAGYAKLGNINQAKKLFDSMTEKDVVSWNT

Query:  VVLAYAKKGCLNEAIGRLFDEMPVKDILSWTTMVSGYAKWGDMNLASELFHQM--PEKNPVSWTALISGYARNSLGHEALDYFTKMMTFRINPDQYTFSS
        ++  YA+   +++A   LF++MP ++ +SW+TMV GY+K GDM +A  +F +M  P KN V+WT +I+GYA   L  EA     +M+   +  D     S
Subjt:  VVLAYAKKGCLNEAIGRLFDEMPVKDILSWTTMVSGYAKWGDMNLASELFHQM--PEKNPVSWTALISGYARNSLGHEALDYFTKMMTFRINPDQYTFSS

Query:  CLCACASIAALKHGKQVHACLIRTNFRCNTIVVSSLIDMYSKCGMLEAGCHVFYLMGNKQDVVLWNTMISALAQHGHGEQAMQMFNDMVKSGLKPDRITF
         L AC     L  G ++H+ L R+N   N  V+++L+DMY+KCG L+    VF  +  K+D+V WNTM+  L  HGHG++A+++F+ M + G++PD++TF
Subjt:  CLCACASIAALKHGKQVHACLIRTNFRCNTIVVSSLIDMYSKCGMLEAGCHVFYLMGNKQDVVLWNTMISALAQHGHGEQAMQMFNDMVKSGLKPDRITF

Query:  IVILSACSHSGLVQEGLRFFKAMMYDHGVLPDQEHYACLIDLLGRAGWFIELVNELEKMCCKPDDRVWNALLGVCRIHGNIELGRKVAEHVIELEPQSSA
        I +L +C+H+GL+ EG+ +F +M   + ++P  EHY CL+DLLGR G   E +  ++ M  +P+  +W ALLG CR+H  +++ ++V +++++L+P    
Subjt:  IVILSACSHSGLVQEGLRFFKAMMYDHGVLPDQEHYACLIDLLGRAGWFIELVNELEKMCCKPDDRVWNALLGVCRIHGNIELGRKVAEHVIELEPQSSA

Query:  AYVSLASLYAFLGKWESVEKVRELMEERFVRKERAISWIDTGNKVHSFIASDRSHPLKEEIYSLLEQL
         Y  L+++YA    WE V  +R  M+   V K    S ++  + +H F   D+SHP  ++IY +L  L
Subjt:  AYVSLASLYAFLGKWESVEKVRELMEERFVRKERAISWIDTGNKVHSFIASDRSHPLKEEIYSLLEQL

AT4G02750.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.0e-10340.7Show/hide
Query:  NHLIGMYFECGNDIEARKVFDKMSARNLYSWNHMLAGYAKLGNINQAKKLFDSMTEKDVVSWNTVVLAYAKKGCLNEAIGRLFDEMPVKDILSWTTMVSG
        N +I  Y + G   EAR++FD+   +++++W  M++GY +   + +A++LFD M E++ VSWN ++  Y  +G   E    LFD MP +++ +W TM++G
Subjt:  NHLIGMYFECGNDIEARKVFDKMSARNLYSWNHMLAGYAKLGNINQAKKLFDSMTEKDVVSWNTVVLAYAKKGCLNEAIGRLFDEMPVKDILSWTTMVSG

Query:  YAKWGDMNLASELFHQMPEKNPVSWTALISGYARNSLGHEALDYFTKMMTFRINPDQYTFSSCLCACASIAALKHGKQVHACLIRTNFRCNTIVVSSLID
        YA+ G ++ A  LF +MP+++PVSW A+I+GY+++    EAL  F +M       ++ +FSS L  CA + AL+ GKQ+H  L++  +     V ++L+ 
Subjt:  YAKWGDMNLASELFHQMPEKNPVSWTALISGYARNSLGHEALDYFTKMMTFRINPDQYTFSSCLCACASIAALKHGKQVHACLIRTNFRCNTIVVSSLID

Query:  MYSKCGMLEAGCHVFYLMGNKQDVVLWNTMISALAQHGHGEQAMQMFNDMVKSGLKPDRITFIVILSACSHSGLVQEGLRFFKAMMYDHGVLPDQEHYAC
        MY KCG +E    +F  M  K D+V WNTMI+  ++HG GE A++ F  M + GLKPD  T + +LSACSH+GLV +G ++F  M  D+GV+P+ +HYAC
Subjt:  MYSKCGMLEAGCHVFYLMGNKQDVVLWNTMISALAQHGHGEQAMQMFNDMVKSGLKPDRITFIVILSACSHSGLVQEGLRFFKAMMYDHGVLPDQEHYAC

Query:  LIDLLGRAGWFIELVNELEKMCCKPDDRVWNALLGVCRIHGNIELGRKVAEHVIELEPQSSAAYVSLASLYAFLGKWESVEKVRELMEERFVRKERAISW
        ++DLLGRAG   +  N ++ M  +PD  +W  LLG  R+HGN EL    A+ +  +EP++S  YV L++LYA  G+W  V K+R  M ++ V+K    SW
Subjt:  LIDLLGRAGWFIELVNELEKMCCKPDDRVWNALLGVCRIHGNIELGRKVAEHVIELEPQSSAAYVSLASLYAFLGKWESVEKVRELMEERFVRKERAISW

Query:  IDTGNKVHSFIASDRSHPLKEEIYSLLEQL
        I+  NK H+F   D  HP K+EI++ LE+L
Subjt:  IDTGNKVHSFIASDRSHPLKEEIYSLLEQL

AT4G16835.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.1e-9739.55Show/hide
Query:  LKRPTTIVANHLIGMYFECGNDIEARKVFDKMSARNLYSWNHMLAGYAKLGNINQAKKLFDSMTEKDVVSWNTVVLAYAKKGCLNEAIGRLFDEMPVKDI
        +  P T   N ++  Y    N  +A+  FD+M  ++  SWN M+ GYA+ G + +A++LF SM EK+ VSWN ++  Y + G L +A    F   PV+ +
Subjt:  LKRPTTIVANHLIGMYFECGNDIEARKVFDKMSARNLYSWNHMLAGYAKLGNINQAKKLFDSMTEKDVVSWNTVVLAYAKKGCLNEAIGRLFDEMPVKDI

Query:  LSWTTMVSGYAKWGDMNLASELFHQMP-EKNPVSWTALISGYARNSLGHEALDYFTKMMTFRINPDQYTFSSCLCACASIAALKHGKQVHACLIRTNFRC
        ++WT M++GY K   + LA  +F  M   KN V+W A+ISGY  NS   + L  F  M+   I P+    SS L  C+ ++AL+ G+Q+H  + ++    
Subjt:  LSWTTMVSGYAKWGDMNLASELFHQMP-EKNPVSWTALISGYARNSLGHEALDYFTKMMTFRINPDQYTFSSCLCACASIAALKHGKQVHACLIRTNFRC

Query:  NTIVVSSLIDMYSKCGMLEAGCHVFYLMGNKQDVVLWNTMISALAQHGHGEQAMQMFNDMVKSGLKPDRITFIVILSACSHSGLVQEGLRFFKAMMYDHG
        +   ++SLI MY KCG L     +F +M  K+DVV WN MIS  AQHG+ ++A+ +F +M+ + ++PD ITF+ +L AC+H+GLV  G+ +F++M+ D+ 
Subjt:  NTIVVSSLIDMYSKCGMLEAGCHVFYLMGNKQDVVLWNTMISALAQHGHGEQAMQMFNDMVKSGLKPDRITFIVILSACSHSGLVQEGLRFFKAMMYDHG

Query:  VLPDQEHYACLIDLLGRAGWFIELVNELEKMCCKPDDRVWNALLGVCRIHGNIELGRKVAEHVIELEPQSSAAYVSLASLYAFLGKWESVEKVRELMEER
        V P  +HY C++DLLGRAG   E +  +  M  +P   V+  LLG CR+H N+EL    AE +++L  Q++A YV LA++YA   +WE V +VR+ M+E 
Subjt:  VLPDQEHYACLIDLLGRAGWFIELVNELEKMCCKPDDRVWNALLGVCRIHGNIELGRKVAEHVIELEPQSSAAYVSLASLYAFLGKWESVEKVRELMEER

Query:  FVRKERAISWIDTGNKVHSFIASDRSHPLKEEIYSLLEQL
         V K    SWI+  NKVH F +SDR HP  + I+  L++L
Subjt:  FVRKERAISWIDTGNKVHSFIASDRSHPLKEEIYSLLEQL

AT4G37170.1 Pentatricopeptide repeat (PPR) superfamily protein2.2e-9835.59Show/hide
Query:  LLRLCAKVKSFKGGKWIHFYLKHTGLKRPTTIVANHLIGMYFECGNDIEARKVFDKMSARNLYSWNHMLAGYAKLGNINQAKKLFDSMTEKDVVSWNTVV
        L+++C++ ++ + GK +H +++ +G   P  ++ N L+ MY +CG+ ++ARKVFD+M  R+L SWN M+ GYA++G + +A+KLFD MTEKD  SW  +V
Subjt:  LLRLCAKVKSFKGGKWIHFYLKHTGLKRPTTIVANHLIGMYFECGNDIEARKVFDKMSARNLYSWNHMLAGYAKLGNINQAKKLFDSMTEKDVVSWNTVV

Query:  LAYAKKGCLNEAI---------------------------------------GRLFDEMPVKDILSWTTMVSGYAKWGDMNLASELFHQMPEKNPVSWTA
          Y KK    EA+                                       G +       D + W++++  Y K G ++ A  +F ++ EK+ VSWT+
Subjt:  LAYAKKGCLNEAI---------------------------------------GRLFDEMPVKDILSWTTMVSGYAKWGDMNLASELFHQMPEKNPVSWTA

Query:  LISGYARNSLGHEALDYFTKMMTFRINPDQYTFSSCLCACASIAALKHGKQVHACLIRTNFRCNTIVVSSLIDMYSKCGMLEAGCHVFYLMGNKQDVVLW
        +I  Y ++S   E    F++++     P++YTF+  L ACA +   + GKQVH  + R  F   +   SSL+DMY+KCG +E+  HV      K D+V W
Subjt:  LISGYARNSLGHEALDYFTKMMTFRINPDQYTFSSCLCACASIAALKHGKQVHACLIRTNFRCNTIVVSSLIDMYSKCGMLEAGCHVFYLMGNKQDVVLW

Query:  NTMISALAQHGHGEQAMQMFNDMVKSGLKPDRITFIVILSACSHSGLVQEGLRFFKAMMYDHGVLPDQEHYACLIDLLGRAGWFIELVNELEKMCCKPDD
         ++I   AQ+G  ++A++ F+ ++KSG KPD +TF+ +LSAC+H+GLV++GL FF ++   H +    +HY CL+DLL R+G F +L + + +M  KP  
Subjt:  NTMISALAQHGHGEQAMQMFNDMVKSGLKPDRITFIVILSACSHSGLVQEGLRFFKAMMYDHGVLPDQEHYACLIDLLGRAGWFIELVNELEKMCCKPDD

Query:  RVWNALLGVCRIHGNIELGRKVAEHVIELEPQSSAAYVSLASLYAFLGKWESVEKVRELMEERFVRKERAISWIDTGNKVHSFIASDRSHPLKEEIYSLL
         +W ++LG C  +GNI+L  + A+ + ++EP++   YV++A++YA  GKWE   K+R+ M+E  V K    SW +   K H FIA+D SHP+  +I   L
Subjt:  RVWNALLGVCRIHGNIELGRKVAEHVIELEPQSSAAYVSLASLYAFLGKWESVEKVRELMEERFVRKERAISWIDTGNKVHSFIASDRSHPLKEEIYSLL

Query:  EQL
         +L
Subjt:  EQL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
CTCTTGCGACTCTGTGCGAAAGTCAAGTCCTTCAAAGGAGGCAAATGGATTCATTTCTATTTGAAACACACGGGGTTGAAGCGCCCCACCACTATTGTAGCCAACCATTT
GATCGGTATGTACTTTGAATGCGGCAATGACATAGAGGCACGCAAGGTGTTTGATAAAATGTCTGCGAGGAATTTGTACTCTTGGAACCATATGCTTGCTGGGTATGCTA
AGTTGGGGAACATTAATCAAGCTAAGAAGTTGTTCGATAGTATGACCGAGAAGGATGTTGTTTCGTGGAATACTGTGGTTCTTGCTTATGCGAAGAAGGGGTGTCTCAAT
GAAGCTATTGGGAGATTGTTTGATGAAATGCCGGTGAAAGATATCCTTTCGTGGACCACAATGGTCTCTGGATATGCTAAATGGGGTGACATGAATTTGGCTAGCGAATT
GTTTCACCAAATGCCTGAAAAGAATCCTGTCTCATGGACAGCTCTGATATCAGGCTATGCGAGAAACAGTTTGGGGCATGAAGCACTTGATTATTTTACAAAAATGATGA
CGTTTCGTATTAATCCCGACCAATATACATTTAGTAGTTGTCTTTGTGCTTGCGCCAGCATTGCTGCACTAAAGCATGGTAAACAGGTACATGCCTGTTTGATAAGAACC
AATTTCAGATGCAACACAATAGTCGTGAGCTCTCTCATTGACATGTATTCAAAGTGTGGCATGTTAGAAGCTGGCTGCCACGTTTTCTACCTTATGGGAAACAAGCAGGA
TGTTGTTTTGTGGAATACAATGATATCTGCCCTGGCACAACATGGTCATGGGGAACAGGCTATGCAGATGTTCAATGACATGGTTAAATCGGGTCTAAAGCCTGATAGGA
TCACTTTTATCGTGATTCTTAGTGCGTGTAGTCATTCAGGTCTCGTGCAAGAAGGACTCCGGTTTTTCAAGGCCATGATGTATGATCATGGTGTTCTCCCTGATCAAGAA
CACTATGCATGCTTAATTGACCTCTTAGGTCGAGCTGGATGGTTTATCGAGTTGGTAAACGAACTAGAGAAGATGTGTTGTAAACCCGATGATCGGGTATGGAATGCCTT
ACTTGGCGTCTGTAGGATACACGGTAATATAGAGCTTGGAAGAAAAGTGGCTGAACATGTAATTGAGCTGGAGCCTCAATCTTCTGCAGCTTATGTTTCTCTTGCAAGTT
TGTATGCTTTTCTTGGGAAATGGGAGTCAGTAGAGAAGGTCAGGGAACTAATGGAAGAGAGATTTGTGAGGAAGGAGCGTGCAATTAGTTGGATTGACACTGGAAATAAG
GTCCATTCTTTCATTGCATCTGATAGATCACATCCATTGAAAGAAGAAATATACTCGCTACTGGAGCAATTAGCCAGCCAT
mRNA sequenceShow/hide mRNA sequence
CTCTTGCGACTCTGTGCGAAAGTCAAGTCCTTCAAAGGAGGCAAATGGATTCATTTCTATTTGAAACACACGGGGTTGAAGCGCCCCACCACTATTGTAGCCAACCATTT
GATCGGTATGTACTTTGAATGCGGCAATGACATAGAGGCACGCAAGGTGTTTGATAAAATGTCTGCGAGGAATTTGTACTCTTGGAACCATATGCTTGCTGGGTATGCTA
AGTTGGGGAACATTAATCAAGCTAAGAAGTTGTTCGATAGTATGACCGAGAAGGATGTTGTTTCGTGGAATACTGTGGTTCTTGCTTATGCGAAGAAGGGGTGTCTCAAT
GAAGCTATTGGGAGATTGTTTGATGAAATGCCGGTGAAAGATATCCTTTCGTGGACCACAATGGTCTCTGGATATGCTAAATGGGGTGACATGAATTTGGCTAGCGAATT
GTTTCACCAAATGCCTGAAAAGAATCCTGTCTCATGGACAGCTCTGATATCAGGCTATGCGAGAAACAGTTTGGGGCATGAAGCACTTGATTATTTTACAAAAATGATGA
CGTTTCGTATTAATCCCGACCAATATACATTTAGTAGTTGTCTTTGTGCTTGCGCCAGCATTGCTGCACTAAAGCATGGTAAACAGGTACATGCCTGTTTGATAAGAACC
AATTTCAGATGCAACACAATAGTCGTGAGCTCTCTCATTGACATGTATTCAAAGTGTGGCATGTTAGAAGCTGGCTGCCACGTTTTCTACCTTATGGGAAACAAGCAGGA
TGTTGTTTTGTGGAATACAATGATATCTGCCCTGGCACAACATGGTCATGGGGAACAGGCTATGCAGATGTTCAATGACATGGTTAAATCGGGTCTAAAGCCTGATAGGA
TCACTTTTATCGTGATTCTTAGTGCGTGTAGTCATTCAGGTCTCGTGCAAGAAGGACTCCGGTTTTTCAAGGCCATGATGTATGATCATGGTGTTCTCCCTGATCAAGAA
CACTATGCATGCTTAATTGACCTCTTAGGTCGAGCTGGATGGTTTATCGAGTTGGTAAACGAACTAGAGAAGATGTGTTGTAAACCCGATGATCGGGTATGGAATGCCTT
ACTTGGCGTCTGTAGGATACACGGTAATATAGAGCTTGGAAGAAAAGTGGCTGAACATGTAATTGAGCTGGAGCCTCAATCTTCTGCAGCTTATGTTTCTCTTGCAAGTT
TGTATGCTTTTCTTGGGAAATGGGAGTCAGTAGAGAAGGTCAGGGAACTAATGGAAGAGAGATTTGTGAGGAAGGAGCGTGCAATTAGTTGGATTGACACTGGAAATAAG
GTCCATTCTTTCATTGCATCTGATAGATCACATCCATTGAAAGAAGAAATATACTCGCTACTGGAGCAATTAGCCAGCCAT
Protein sequenceShow/hide protein sequence
LLRLCAKVKSFKGGKWIHFYLKHTGLKRPTTIVANHLIGMYFECGNDIEARKVFDKMSARNLYSWNHMLAGYAKLGNINQAKKLFDSMTEKDVVSWNTVVLAYAKKGCLN
EAIGRLFDEMPVKDILSWTTMVSGYAKWGDMNLASELFHQMPEKNPVSWTALISGYARNSLGHEALDYFTKMMTFRINPDQYTFSSCLCACASIAALKHGKQVHACLIRT
NFRCNTIVVSSLIDMYSKCGMLEAGCHVFYLMGNKQDVVLWNTMISALAQHGHGEQAMQMFNDMVKSGLKPDRITFIVILSACSHSGLVQEGLRFFKAMMYDHGVLPDQE
HYACLIDLLGRAGWFIELVNELEKMCCKPDDRVWNALLGVCRIHGNIELGRKVAEHVIELEPQSSAAYVSLASLYAFLGKWESVEKVRELMEERFVRKERAISWIDTGNK
VHSFIASDRSHPLKEEIYSLLEQLASH