; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS026690 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS026690
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionPentatricopeptide repeat
Genome locationscaffold2:761677..764466
RNA-Seq ExpressionMS026690
SyntenyMS026690
Gene Ontology termsGO:0005739 - mitochondrion (cellular component)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6573719.1 Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. sororia]4.4e-22783.89Show/hide
Query:  MALRQFSRPKNVAKRS-NKYLEEALYVRLFKDGSSEKSIRHQLNAFIKGHKRVFKWEVGDTLKKLRGRKLYNPALKLSETMAKRGMNKTISDQAIHLDLV
        + LR+ SR KNVAKRS  KYLEE LYVRLFKDGSSEKS+R QLN F+K  KRVFKWEVGDTLKKLR RKLY PALKLSETMAKR MNKT+SDQAIHLDL+
Subjt:  MALRQFSRPKNVAKRS-NKYLEEALYVRLFKDGSSEKSIRHQLNAFIKGHKRVFKWEVGDTLKKLRGRKLYNPALKLSETMAKRGMNKTISDQAIHLDLV

Query:  AKARGIAAAESYFVSLPESSKNHLCYGSLLNCYCKELMTDEAEALLEKMKELNLPVTSMSYNSLMTLYTKTGQPEKVRAIIQEMKAANVMFDSYTYNVWM
        AKARGIAAAES+FVSLPESSKNHLCYGSLLNCYCKELMT++AEA+ EKMKELNLPVTSM YNSLMTLY+KTG PEKV AIIQEMKAA VMFD+YTYNVWM
Subjt:  AKARGIAAAESYFVSLPESSKNHLCYGSLLNCYCKELMTDEAEALLEKMKELNLPVTSMSYNSLMTLYTKTGQPEKVRAIIQEMKAANVMFDSYTYNVWM

Query:  RALAALNDISGVERVIDEMKRDGQVVGDWTTYSNLASIYVDAHLFEKADKALKDLEKRNAHRELSAFQFMITLYGRMGNLLEVYRVWRSLRLAFPKTANI
        RALAALNDISGVERVIDEMK DG+ VGDWTTYSNLASIYVDAH+F+KA  ALK+LEKRNA R+LSAFQF+ITL+G+MGNLLEVYRVWRSLRLAFPKTANI
Subjt:  RALAALNDISGVERVIDEMKRDGQVVGDWTTYSNLASIYVDAHLFEKADKALKDLEKRNAHRELSAFQFMITLYGRMGNLLEVYRVWRSLRLAFPKTANI

Query:  SYLNMIQTLTKLKDLPGAEKCFKEWESGCSTYDIRIANVLIGAYAEEGLLEKAMELKERARRRGAKPNAKTWEIFLDYYLRNGEFKPAVDCVAKAVSIGR
        SYLNMIQTL KLKDLPGAEKCFKEW+SGCSTYDIRIAN LIGAYA+EGLLEKA+ELK RAR+RGAKPNAKTWEIF+DYYL+NGEFK A DC AKAVS GR
Subjt:  SYLNMIQTLTKLKDLPGAEKCFKEWESGCSTYDIRIANVLIGAYAEEGLLEKAMELKERARRRGAKPNAKTWEIFLDYYLRNGEFKPAVDCVAKAVSIGR

Query:  --GGKWMPSPEIVKTLMSHFEEEKDVDGAEDFLETVKKAVDTLEPEVFETLIRTYSAAGRKSSMMDRRLKMENVEAYE
          GGKW+PSPE+++T MSH+E+EKDVDGAE F+ETVKK+VD+LE EVFE+LIRTYSAAGR+S MM RRLKMENV+  E
Subjt:  --GGKWMPSPEIVKTLMSHFEEEKDVDGAEDFLETVKKAVDTLEPEVFETLIRTYSAAGRKSSMMDRRLKMENVEAYE

XP_022151897.1 pentatricopeptide repeat-containing protein At1g60770 [Momordica charantia]2.8e-26698.32Show/hide
Query:  MALRQFSRPKNVAKRSNKYLEEALYVRLFKDGSSEKSIRHQLNAFIKGHKRVFKWEVGDTLKKLRGRKLYNPALKLSETMAKRGMNKTISDQAIHLDLVA
        MALRQFSRPKNVAKRSNKYLEEALYVRLFKDGSSEKSIRHQLNAFIKGHKRVFKWEVGDTLKKLRGRKLYNPALKLSETMAKRGMNKTISDQAIHLDLVA
Subjt:  MALRQFSRPKNVAKRSNKYLEEALYVRLFKDGSSEKSIRHQLNAFIKGHKRVFKWEVGDTLKKLRGRKLYNPALKLSETMAKRGMNKTISDQAIHLDLVA

Query:  KARGIAAAESYFVSLPESSKNHLCYGSLLNCYCKELMTDEAEALLEKMKELNLPVTSMSYNSLMTLYTKTGQPEKVRAIIQEMKAANVMFDSYTYNVWMR
        KARGIAAAESYFV+LPESSKNHLCYGSLLNCYCKELMTDEAEALLEKMKELNLPVTSMSYNS+MTLYTKTGQPEKVRAIIQEMKAANVMFDSYTYNVWMR
Subjt:  KARGIAAAESYFVSLPESSKNHLCYGSLLNCYCKELMTDEAEALLEKMKELNLPVTSMSYNSLMTLYTKTGQPEKVRAIIQEMKAANVMFDSYTYNVWMR

Query:  ALAALNDISGVERVIDEMKRDGQVVGDWTTYSNLASIYVDAHLFEKADKALKDLEKRNAHRELSAFQFMITLYGRMGNLLEVYRVWRSLRLAFPKTANIS
        ALAALNDISGVERVIDEMKRDGQVVGDWTTYSNLASIYVDAHLFEKADKAL+DLEKRN+HRELSAFQFMITLYGRMGNLLEVYRVWRSLRLAFPKTANIS
Subjt:  ALAALNDISGVERVIDEMKRDGQVVGDWTTYSNLASIYVDAHLFEKADKALKDLEKRNAHRELSAFQFMITLYGRMGNLLEVYRVWRSLRLAFPKTANIS

Query:  YLNMIQTLTKLKDLPGAEKCFKEWESGCSTYDIRIANVLIGAYAEEGLLEKAMELKERARRRGAKPNAKTWEIFLDYYLRNGEFKPAVDCVAKAVSIGRG
        YLNMIQTLTKLKDLPGAEKCFKEWESGCSTYDIRIANVLIGAYAEEGLLEKAMELKERARRRGAKPNAKTWEIFLDY+LRNGEFKPAVDCVAKAVSIGRG
Subjt:  YLNMIQTLTKLKDLPGAEKCFKEWESGCSTYDIRIANVLIGAYAEEGLLEKAMELKERARRRGAKPNAKTWEIFLDYYLRNGEFKPAVDCVAKAVSIGRG

Query:  GKWMPSPEIVKTLMSHFEEEKDVDGAEDFLETVKKAVDTLEPEVFETLIRTYSAAGRKSSMMDRRLKMENVEAYE
        GKWMPSPEIVKTLMSHFEEEKDVDGAEDFLETVKKAVDTLEPEVFETLIRTYSAAGRKSSMMDR LKMENVE  E
Subjt:  GKWMPSPEIVKTLMSHFEEEKDVDGAEDFLETVKKAVDTLEPEVFETLIRTYSAAGRKSSMMDRRLKMENVEAYE

XP_022945244.1 pentatricopeptide repeat-containing protein At1g60770 [Cucurbita moschata]2.0e-22784.1Show/hide
Query:  MALRQFSRPKNVAKRS-NKYLEEALYVRLFKDGSSEKSIRHQLNAFIKGHKRVFKWEVGDTLKKLRGRKLYNPALKLSETMAKRGMNKTISDQAIHLDLV
        + LR+ SR KNVAKRS  KYLEE LYVRLFKDGSSEKS+R QLN F+K  KRVFKWEVGDTLKKLR RKLY PALKLSETMAKR MNKT+SDQAIHLDL+
Subjt:  MALRQFSRPKNVAKRS-NKYLEEALYVRLFKDGSSEKSIRHQLNAFIKGHKRVFKWEVGDTLKKLRGRKLYNPALKLSETMAKRGMNKTISDQAIHLDLV

Query:  AKARGIAAAESYFVSLPESSKNHLCYGSLLNCYCKELMTDEAEALLEKMKELNLPVTSMSYNSLMTLYTKTGQPEKVRAIIQEMKAANVMFDSYTYNVWM
        AKARGIAAAES+FVSLPESSKNHLCYGSLLNCYCKELMT++AEA+ EKMKELNLPVTSM YNSLMTLY+KTG PEKV AIIQEMKAA VMFD+YTYNVWM
Subjt:  AKARGIAAAESYFVSLPESSKNHLCYGSLLNCYCKELMTDEAEALLEKMKELNLPVTSMSYNSLMTLYTKTGQPEKVRAIIQEMKAANVMFDSYTYNVWM

Query:  RALAALNDISGVERVIDEMKRDGQVVGDWTTYSNLASIYVDAHLFEKADKALKDLEKRNAHRELSAFQFMITLYGRMGNLLEVYRVWRSLRLAFPKTANI
        RALAALNDISGVERVIDEMK DG+ VGDWTTYSNLASIYVDAH+F+KA  ALK+LEKRNA R+LSAFQF+ITL+G+MGNLLEVYRVWRSLRLAFPKTANI
Subjt:  RALAALNDISGVERVIDEMKRDGQVVGDWTTYSNLASIYVDAHLFEKADKALKDLEKRNAHRELSAFQFMITLYGRMGNLLEVYRVWRSLRLAFPKTANI

Query:  SYLNMIQTLTKLKDLPGAEKCFKEWESGCSTYDIRIANVLIGAYAEEGLLEKAMELKERARRRGAKPNAKTWEIFLDYYLRNGEFKPAVDCVAKAVSIGR
        SYLNMIQTL KLKDLPGAEKCFKEW+SGCSTYDIRIAN LIGAYA+EGLLEKA+ELK RAR+RGAKPNAKTWEIF+DYYL+NGEFK A DC AKAVS GR
Subjt:  SYLNMIQTLTKLKDLPGAEKCFKEWESGCSTYDIRIANVLIGAYAEEGLLEKAMELKERARRRGAKPNAKTWEIFLDYYLRNGEFKPAVDCVAKAVSIGR

Query:  --GGKWMPSPEIVKTLMSHFEEEKDVDGAEDFLETVKKAVDTLEPEVFETLIRTYSAAGRKSSMMDRRLKMENVEAYE
          GGKW+PSPE+++T MSH+E+EKDVDGAE F+ETVKK+VD+LE EVFE+LIRTYSAAGR+S MM RRLKMENVE  E
Subjt:  --GGKWMPSPEIVKTLMSHFEEEKDVDGAEDFLETVKKAVDTLEPEVFETLIRTYSAAGRKSSMMDRRLKMENVEAYE

XP_022966912.1 pentatricopeptide repeat-containing protein At1g60770 [Cucurbita maxima]3.0e-22884.31Show/hide
Query:  MALRQFSRPKNVAKRS-NKYLEEALYVRLFKDGSSEKSIRHQLNAFIKGHKRVFKWEVGDTLKKLRGRKLYNPALKLSETMAKRGMNKTISDQAIHLDLV
        + LRQ SR KNVAKRS  KYLEE LYVRLFKDG SEKS+R QLN F+K  KRVFKWEVGDTLKKLR RKLY PALKLSETMAKR MNKT+SDQA HLDL+
Subjt:  MALRQFSRPKNVAKRS-NKYLEEALYVRLFKDGSSEKSIRHQLNAFIKGHKRVFKWEVGDTLKKLRGRKLYNPALKLSETMAKRGMNKTISDQAIHLDLV

Query:  AKARGIAAAESYFVSLPESSKNHLCYGSLLNCYCKELMTDEAEALLEKMKELNLPVTSMSYNSLMTLYTKTGQPEKVRAIIQEMKAANVMFDSYTYNVWM
         KARGIAAAES+FVSLPESSKNHLCYGSLLNCYCKELMT++AEA+LEKMKELNL VTSM YNSLMTLYTKTGQPEKVRAIIQEMKAANV+FD+YTYNVWM
Subjt:  AKARGIAAAESYFVSLPESSKNHLCYGSLLNCYCKELMTDEAEALLEKMKELNLPVTSMSYNSLMTLYTKTGQPEKVRAIIQEMKAANVMFDSYTYNVWM

Query:  RALAALNDISGVERVIDEMKRDGQVVGDWTTYSNLASIYVDAHLFEKADKALKDLEKRNAHRELSAFQFMITLYGRMGNLLEVYRVWRSLRLAFPKTANI
        RALAA NDISGVERVIDEMKRDG+ VGDWTTYSNLASIYVDAH+F+KA  ALK+LEKRNA R+LSAFQF+ITL+G+MGNLLEVYRVWRSLRLAFP TANI
Subjt:  RALAALNDISGVERVIDEMKRDGQVVGDWTTYSNLASIYVDAHLFEKADKALKDLEKRNAHRELSAFQFMITLYGRMGNLLEVYRVWRSLRLAFPKTANI

Query:  SYLNMIQTLTKLKDLPGAEKCFKEWESGCSTYDIRIANVLIGAYAEEGLLEKAMELKERARRRGAKPNAKTWEIFLDYYLRNGEFKPAVDCVAKAVSIGR
        SYLNMIQTL KLKDLPGAEKCFKEWESGCSTYDIRIAN LIGAYA+EGLLEKA+ELK RAR+RGAKPNAKTWEIF+DYYL+NGEFK A DCVAKAVS GR
Subjt:  SYLNMIQTLTKLKDLPGAEKCFKEWESGCSTYDIRIANVLIGAYAEEGLLEKAMELKERARRRGAKPNAKTWEIFLDYYLRNGEFKPAVDCVAKAVSIGR

Query:  --GGKWMPSPEIVKTLMSHFEEEKDVDGAEDFLETVKKAVDTLEPEVFETLIRTYSAAGRKSSMMDRRLKMENVEAYE
           GKW+PSPE+++T MSH+E+EKDVDGAE F+ETVKK+VD+LE EVFE+LIRTYSAAGR+S MM RRLKMENVE  E
Subjt:  --GGKWMPSPEIVKTLMSHFEEEKDVDGAEDFLETVKKAVDTLEPEVFETLIRTYSAAGRKSSMMDRRLKMENVEAYE

XP_038893310.1 pentatricopeptide repeat-containing protein At1g60770 isoform X1 [Benincasa hispida]1.4e-23386.82Show/hide
Query:  MALRQFSRPKNVAKRS-NKYLEEALYVRLFKDGSSEKSIRHQLNAFIKGHKRVFKWEVGDTLKKLRGRKLYNPALKLSETMAKRGMNKTISDQAIHLDLV
        MALRQFSR KN+AKRS  KYLEEALYVRLFKDG SEKSIR QLN FIK HKRVFKWEVGDTLKKLR RKLYNPALKLSETM KRGMNKT+SDQAIHLDLV
Subjt:  MALRQFSRPKNVAKRS-NKYLEEALYVRLFKDGSSEKSIRHQLNAFIKGHKRVFKWEVGDTLKKLRGRKLYNPALKLSETMAKRGMNKTISDQAIHLDLV

Query:  AKARGIAAAESYFVSLPESSKNHLCYGSLLNCYCKELMTDEAEALLEKMKELNLPVTSMSYNSLMTLYTKTGQPEKVRAIIQEMKAANVMFDSYTYNVWM
        AKARG+AAAESYFVSLPESSKNHLCYGSLLNCYCKEL+T++AEAL EKMKELNLPV SM YNSLMTLYTKTGQP+KVR+IIQEMKAANVMFDSYTYNVWM
Subjt:  AKARGIAAAESYFVSLPESSKNHLCYGSLLNCYCKELMTDEAEALLEKMKELNLPVTSMSYNSLMTLYTKTGQPEKVRAIIQEMKAANVMFDSYTYNVWM

Query:  RALAALNDISGVERVIDEMKRDGQVVGDWTTYSNLASIYVDAHLFEKADKALKDLEKRNAHRELSAFQFMITLYGRMGNLLEVYRVWRSLRLAFPKTANI
        RALAALNDISGVERV+DEMKRDG VVGDWTTYSNLASIYVDA+LFEKA KALK+LEKRNA R LSAFQF+ITLYG++GNL EVYRVWRSLRLAF KTANI
Subjt:  RALAALNDISGVERVIDEMKRDGQVVGDWTTYSNLASIYVDAHLFEKADKALKDLEKRNAHRELSAFQFMITLYGRMGNLLEVYRVWRSLRLAFPKTANI

Query:  SYLNMIQTLTKLKDLPGAEKCFKEWESGCSTYDIRIANVLIGAYAEEGLLEKAMELKERARRRGAKPNAKTWEIFLDYYLRNGEFKPAVDCVAKAVS--I
        SYLNMIQTL KLKDLPGAEKCFKEWESGCSTYDIRIAN LIGAY +EGLLEKAMELKERAR +GAKPN KTWE+FLDYYL+NG+FK A DCVAKAVS   
Subjt:  SYLNMIQTLTKLKDLPGAEKCFKEWESGCSTYDIRIANVLIGAYAEEGLLEKAMELKERARRRGAKPNAKTWEIFLDYYLRNGEFKPAVDCVAKAVS--I

Query:  GRGGKWMPSPEIVKTLMSHFEEEKDVDGAEDFLETVKKAVDTLEPEVFETLIRTYSAAGRKSSMMDRRLKMENVEAYE
        G GGKWMPSPEI+K+ MSHFE+EKDVDGAE FLE VKK VDTLE EVFE+LIRTYSAAGR+SS M+ RLKMENVE  E
Subjt:  GRGGKWMPSPEIVKTLMSHFEEEKDVDGAEDFLETVKKAVDTLEPEVFETLIRTYSAAGRKSSMMDRRLKMENVEAYE

TrEMBL top hitse value%identityAlignment
A0A6A1UQC6 Uncharacterized protein2.6e-21779.33Show/hide
Query:  MALR--QFSRPKNVAKRSNKYLEEALYVRLFKDGSSEKSIRHQLNAFIKGHKRVFKWEVGDTLKKLRGRKLYNPALKLSETMAKRGMNKTISDQAIHLDL
        MAL+  QF R K+VAKRS K+LEEALY RLF++G SE S+RHQLN F+K HKRV+KWEVGDTLKKLR RKLY PALKLSETM KRGMNKT+SDQA+HLDL
Subjt:  MALR--QFSRPKNVAKRSNKYLEEALYVRLFKDGSSEKSIRHQLNAFIKGHKRVFKWEVGDTLKKLRGRKLYNPALKLSETMAKRGMNKTISDQAIHLDL

Query:  VAKARGIAAAESYFVSLPESSKNHLCYGSLLNCYCKELMTDEAEALLEKMKELNLPVTSMSYNSLMTLYTKTGQPEKVRAIIQEMKAANVMFDSYTYNVW
        V+K RGIAAAE+YFV LPESSKNHLCYG+LLNCYCKELMT++AEAL+EKMKELNLP+TSM YNSLMTLY K GQPEK+ AIIQEMKA+N+M DSYTYNVW
Subjt:  VAKARGIAAAESYFVSLPESSKNHLCYGSLLNCYCKELMTDEAEALLEKMKELNLPVTSMSYNSLMTLYTKTGQPEKVRAIIQEMKAANVMFDSYTYNVW

Query:  MRALAALNDISGVERVIDEMKRDGQVVGDWTTYSNLASIYVDAHLFEKADKALKDLEKRNAHRELSAFQFMITLYGRMGNLLEVYRVWRSLRLAFPKTAN
        +RALAA+NDI+ VERV+DEMKRDG+V GDWTTYSNLASIYVDA   EKA+KALK+LEKRNA+++LSA+QF+ITLYGR GNLLEVYRVWRSLRLAFPKTAN
Subjt:  MRALAALNDISGVERVIDEMKRDGQVVGDWTTYSNLASIYVDAHLFEKADKALKDLEKRNAHRELSAFQFMITLYGRMGNLLEVYRVWRSLRLAFPKTAN

Query:  ISYLNMIQTLTKLKDLPGAEKCFKEWESGCSTYDIRIANVLIGAYAEEGLLEKAMELKERARRRGAKPNAKTWEIFLDYYLRNGEFKPAVDCVAKAVSIG
         SYLNMIQ L  LKDLPGAEKCF+EWESGCSTYDIRIAN LIGAYA+EGLLEKA ELKERARRRGAKPNAKTWEIFL+YYL++GEFK AVDCVA AVSIG
Subjt:  ISYLNMIQTLTKLKDLPGAEKCFKEWESGCSTYDIRIANVLIGAYAEEGLLEKAMELKERARRRGAKPNAKTWEIFLDYYLRNGEFKPAVDCVAKAVSIG

Query:  R--GGKWMPSPEIVKTLMSHFEEEKDVDGAEDFLETVKKAVDTLEPEVFETLIRTYSAAGRKSSMMDRRLKMENVEAYE
        R  GGKW+P   +V +LM HFE+EKDVD AE FLE +KKAVD +  EVFE+LIRTY+AAGR S  + RRLKMENVE  E
Subjt:  R--GGKWMPSPEIVKTLMSHFEEEKDVDGAEDFLETVKKAVDTLEPEVFETLIRTYSAAGRKSSMMDRRLKMENVEAYE

A0A6J1DEQ3 pentatricopeptide repeat-containing protein At1g607701.4e-26698.32Show/hide
Query:  MALRQFSRPKNVAKRSNKYLEEALYVRLFKDGSSEKSIRHQLNAFIKGHKRVFKWEVGDTLKKLRGRKLYNPALKLSETMAKRGMNKTISDQAIHLDLVA
        MALRQFSRPKNVAKRSNKYLEEALYVRLFKDGSSEKSIRHQLNAFIKGHKRVFKWEVGDTLKKLRGRKLYNPALKLSETMAKRGMNKTISDQAIHLDLVA
Subjt:  MALRQFSRPKNVAKRSNKYLEEALYVRLFKDGSSEKSIRHQLNAFIKGHKRVFKWEVGDTLKKLRGRKLYNPALKLSETMAKRGMNKTISDQAIHLDLVA

Query:  KARGIAAAESYFVSLPESSKNHLCYGSLLNCYCKELMTDEAEALLEKMKELNLPVTSMSYNSLMTLYTKTGQPEKVRAIIQEMKAANVMFDSYTYNVWMR
        KARGIAAAESYFV+LPESSKNHLCYGSLLNCYCKELMTDEAEALLEKMKELNLPVTSMSYNS+MTLYTKTGQPEKVRAIIQEMKAANVMFDSYTYNVWMR
Subjt:  KARGIAAAESYFVSLPESSKNHLCYGSLLNCYCKELMTDEAEALLEKMKELNLPVTSMSYNSLMTLYTKTGQPEKVRAIIQEMKAANVMFDSYTYNVWMR

Query:  ALAALNDISGVERVIDEMKRDGQVVGDWTTYSNLASIYVDAHLFEKADKALKDLEKRNAHRELSAFQFMITLYGRMGNLLEVYRVWRSLRLAFPKTANIS
        ALAALNDISGVERVIDEMKRDGQVVGDWTTYSNLASIYVDAHLFEKADKAL+DLEKRN+HRELSAFQFMITLYGRMGNLLEVYRVWRSLRLAFPKTANIS
Subjt:  ALAALNDISGVERVIDEMKRDGQVVGDWTTYSNLASIYVDAHLFEKADKALKDLEKRNAHRELSAFQFMITLYGRMGNLLEVYRVWRSLRLAFPKTANIS

Query:  YLNMIQTLTKLKDLPGAEKCFKEWESGCSTYDIRIANVLIGAYAEEGLLEKAMELKERARRRGAKPNAKTWEIFLDYYLRNGEFKPAVDCVAKAVSIGRG
        YLNMIQTLTKLKDLPGAEKCFKEWESGCSTYDIRIANVLIGAYAEEGLLEKAMELKERARRRGAKPNAKTWEIFLDY+LRNGEFKPAVDCVAKAVSIGRG
Subjt:  YLNMIQTLTKLKDLPGAEKCFKEWESGCSTYDIRIANVLIGAYAEEGLLEKAMELKERARRRGAKPNAKTWEIFLDYYLRNGEFKPAVDCVAKAVSIGRG

Query:  GKWMPSPEIVKTLMSHFEEEKDVDGAEDFLETVKKAVDTLEPEVFETLIRTYSAAGRKSSMMDRRLKMENVEAYE
        GKWMPSPEIVKTLMSHFEEEKDVDGAEDFLETVKKAVDTLEPEVFETLIRTYSAAGRKSSMMDR LKMENVE  E
Subjt:  GKWMPSPEIVKTLMSHFEEEKDVDGAEDFLETVKKAVDTLEPEVFETLIRTYSAAGRKSSMMDRRLKMENVEAYE

A0A6J1G0G4 pentatricopeptide repeat-containing protein At1g607709.6e-22884.1Show/hide
Query:  MALRQFSRPKNVAKRS-NKYLEEALYVRLFKDGSSEKSIRHQLNAFIKGHKRVFKWEVGDTLKKLRGRKLYNPALKLSETMAKRGMNKTISDQAIHLDLV
        + LR+ SR KNVAKRS  KYLEE LYVRLFKDGSSEKS+R QLN F+K  KRVFKWEVGDTLKKLR RKLY PALKLSETMAKR MNKT+SDQAIHLDL+
Subjt:  MALRQFSRPKNVAKRS-NKYLEEALYVRLFKDGSSEKSIRHQLNAFIKGHKRVFKWEVGDTLKKLRGRKLYNPALKLSETMAKRGMNKTISDQAIHLDLV

Query:  AKARGIAAAESYFVSLPESSKNHLCYGSLLNCYCKELMTDEAEALLEKMKELNLPVTSMSYNSLMTLYTKTGQPEKVRAIIQEMKAANVMFDSYTYNVWM
        AKARGIAAAES+FVSLPESSKNHLCYGSLLNCYCKELMT++AEA+ EKMKELNLPVTSM YNSLMTLY+KTG PEKV AIIQEMKAA VMFD+YTYNVWM
Subjt:  AKARGIAAAESYFVSLPESSKNHLCYGSLLNCYCKELMTDEAEALLEKMKELNLPVTSMSYNSLMTLYTKTGQPEKVRAIIQEMKAANVMFDSYTYNVWM

Query:  RALAALNDISGVERVIDEMKRDGQVVGDWTTYSNLASIYVDAHLFEKADKALKDLEKRNAHRELSAFQFMITLYGRMGNLLEVYRVWRSLRLAFPKTANI
        RALAALNDISGVERVIDEMK DG+ VGDWTTYSNLASIYVDAH+F+KA  ALK+LEKRNA R+LSAFQF+ITL+G+MGNLLEVYRVWRSLRLAFPKTANI
Subjt:  RALAALNDISGVERVIDEMKRDGQVVGDWTTYSNLASIYVDAHLFEKADKALKDLEKRNAHRELSAFQFMITLYGRMGNLLEVYRVWRSLRLAFPKTANI

Query:  SYLNMIQTLTKLKDLPGAEKCFKEWESGCSTYDIRIANVLIGAYAEEGLLEKAMELKERARRRGAKPNAKTWEIFLDYYLRNGEFKPAVDCVAKAVSIGR
        SYLNMIQTL KLKDLPGAEKCFKEW+SGCSTYDIRIAN LIGAYA+EGLLEKA+ELK RAR+RGAKPNAKTWEIF+DYYL+NGEFK A DC AKAVS GR
Subjt:  SYLNMIQTLTKLKDLPGAEKCFKEWESGCSTYDIRIANVLIGAYAEEGLLEKAMELKERARRRGAKPNAKTWEIFLDYYLRNGEFKPAVDCVAKAVSIGR

Query:  --GGKWMPSPEIVKTLMSHFEEEKDVDGAEDFLETVKKAVDTLEPEVFETLIRTYSAAGRKSSMMDRRLKMENVEAYE
          GGKW+PSPE+++T MSH+E+EKDVDGAE F+ETVKK+VD+LE EVFE+LIRTYSAAGR+S MM RRLKMENVE  E
Subjt:  --GGKWMPSPEIVKTLMSHFEEEKDVDGAEDFLETVKKAVDTLEPEVFETLIRTYSAAGRKSSMMDRRLKMENVEAYE

A0A6J1HP97 pentatricopeptide repeat-containing protein At1g607701.5e-22884.31Show/hide
Query:  MALRQFSRPKNVAKRS-NKYLEEALYVRLFKDGSSEKSIRHQLNAFIKGHKRVFKWEVGDTLKKLRGRKLYNPALKLSETMAKRGMNKTISDQAIHLDLV
        + LRQ SR KNVAKRS  KYLEE LYVRLFKDG SEKS+R QLN F+K  KRVFKWEVGDTLKKLR RKLY PALKLSETMAKR MNKT+SDQA HLDL+
Subjt:  MALRQFSRPKNVAKRS-NKYLEEALYVRLFKDGSSEKSIRHQLNAFIKGHKRVFKWEVGDTLKKLRGRKLYNPALKLSETMAKRGMNKTISDQAIHLDLV

Query:  AKARGIAAAESYFVSLPESSKNHLCYGSLLNCYCKELMTDEAEALLEKMKELNLPVTSMSYNSLMTLYTKTGQPEKVRAIIQEMKAANVMFDSYTYNVWM
         KARGIAAAES+FVSLPESSKNHLCYGSLLNCYCKELMT++AEA+LEKMKELNL VTSM YNSLMTLYTKTGQPEKVRAIIQEMKAANV+FD+YTYNVWM
Subjt:  AKARGIAAAESYFVSLPESSKNHLCYGSLLNCYCKELMTDEAEALLEKMKELNLPVTSMSYNSLMTLYTKTGQPEKVRAIIQEMKAANVMFDSYTYNVWM

Query:  RALAALNDISGVERVIDEMKRDGQVVGDWTTYSNLASIYVDAHLFEKADKALKDLEKRNAHRELSAFQFMITLYGRMGNLLEVYRVWRSLRLAFPKTANI
        RALAA NDISGVERVIDEMKRDG+ VGDWTTYSNLASIYVDAH+F+KA  ALK+LEKRNA R+LSAFQF+ITL+G+MGNLLEVYRVWRSLRLAFP TANI
Subjt:  RALAALNDISGVERVIDEMKRDGQVVGDWTTYSNLASIYVDAHLFEKADKALKDLEKRNAHRELSAFQFMITLYGRMGNLLEVYRVWRSLRLAFPKTANI

Query:  SYLNMIQTLTKLKDLPGAEKCFKEWESGCSTYDIRIANVLIGAYAEEGLLEKAMELKERARRRGAKPNAKTWEIFLDYYLRNGEFKPAVDCVAKAVSIGR
        SYLNMIQTL KLKDLPGAEKCFKEWESGCSTYDIRIAN LIGAYA+EGLLEKA+ELK RAR+RGAKPNAKTWEIF+DYYL+NGEFK A DCVAKAVS GR
Subjt:  SYLNMIQTLTKLKDLPGAEKCFKEWESGCSTYDIRIANVLIGAYAEEGLLEKAMELKERARRRGAKPNAKTWEIFLDYYLRNGEFKPAVDCVAKAVSIGR

Query:  --GGKWMPSPEIVKTLMSHFEEEKDVDGAEDFLETVKKAVDTLEPEVFETLIRTYSAAGRKSSMMDRRLKMENVEAYE
           GKW+PSPE+++T MSH+E+EKDVDGAE F+ETVKK+VD+LE EVFE+LIRTYSAAGR+S MM RRLKMENVE  E
Subjt:  --GGKWMPSPEIVKTLMSHFEEEKDVDGAEDFLETVKKAVDTLEPEVFETLIRTYSAAGRKSSMMDRRLKMENVEAYE

A0A7N2LAW4 Uncharacterized protein2.6e-21779.58Show/hide
Query:  LRQFSRPKNVAKRSNKYLEEALYVRLFKDGSSEKSIRHQLNAFIKGHKRVFKWEVGDTLKKLRGRKLYNPALKLSETMAKRGMNKTISDQAIHLDLVAKA
        L+   R K+VAKRS KYLEEALY RLFK+G SE S+RHQLN FIK HKRV+KWEVGDTL+KLR RKLY PALKLSETMAKRGMNKT+SDQAIHLDLVAK 
Subjt:  LRQFSRPKNVAKRSNKYLEEALYVRLFKDGSSEKSIRHQLNAFIKGHKRVFKWEVGDTLKKLRGRKLYNPALKLSETMAKRGMNKTISDQAIHLDLVAKA

Query:  RGIAAAESYFVSLPESSKNHLCYGSLLNCYCKELMTDEAEALLEKMKELNLPVTSMSYNSLMTLYTKTGQPEKVRAIIQEMKAANVMFDSYTYNVWMRAL
        RGI+AAE+YF+ LPESSKNHLCYG+LLNCYCKELMT++AEAL+EKMKELNLP++SM YNSLMTLYTKT QPEK+ AIIQEMKA+++MFD+YTYNVWMRAL
Subjt:  RGIAAAESYFVSLPESSKNHLCYGSLLNCYCKELMTDEAEALLEKMKELNLPVTSMSYNSLMTLYTKTGQPEKVRAIIQEMKAANVMFDSYTYNVWMRAL

Query:  AALNDISGVERVIDEMKRDGQVVGDWTTYSNLASIYVDAHLFEKADKALKDLEKRNAHRELSAFQFMITLYGRMGNLLEVYRVWRSLRLAFPKTANISYL
        AA+NDI+GVERVIDEMKRDG+V GDWTTYSNLASIYVDA LFEKA+KALK+LEKRNA ++LSA+QF+ITLYGR GNLLEVYRVWRSLRLAFPKTANISYL
Subjt:  AALNDISGVERVIDEMKRDGQVVGDWTTYSNLASIYVDAHLFEKADKALKDLEKRNAHRELSAFQFMITLYGRMGNLLEVYRVWRSLRLAFPKTANISYL

Query:  NMIQTLTKLKDLPGAEKCFKEWESGCSTYDIRIANVLIGAYAEEGLLEKAMELKERARRRGAKPNAKTWEIFLDYYLRNGEFKPAVDCVAKAVSIGRGG-
        NMIQ L  +KDLPGAEKCF+EWESGCS YDIRIAN LIGAYA+EGLLEKA E+KERARRRGA PNAKTWEIFLDYY++  E K AVDCVA A+SIGRG  
Subjt:  NMIQTLTKLKDLPGAEKCFKEWESGCSTYDIRIANVLIGAYAEEGLLEKAMELKERARRRGAKPNAKTWEIFLDYYLRNGEFKPAVDCVAKAVSIGRGG-

Query:  -KWMPSPEIVKTLMSHFEEEKDVDGAEDFLETVKKAVDTLEPEVFETLIRTYSAAGRKSSMMDRRLKMENVEAYE
         KW+PS  +V +LM HFE+EKDVDGAE FLE +K+AVD L  EVFE+LIRTY+AAGR S ++ RRLKMENVE  E
Subjt:  -KWMPSPEIVKTLMSHFEEEKDVDGAEDFLETVKKAVDTLEPEVFETLIRTYSAAGRKSSMMDRRLKMENVEAYE

SwissProt top hitse value%identityAlignment
O22714 Pentatricopeptide repeat-containing protein At1g607701.4e-19169.25Show/hide
Query:  MALRQFSRPKNVAKRS-NKYLEEALYVRLFKDGSSEKSIRHQLNAFIKGHKRVFKWEVGDTLKKLRGRKLYNPALKLSETMAKRGMNKTISDQAIHLDLV
        MA+R  SR ++V KRS  KY+EE LY RLFKDG +E  +R QLN F+KG K VFKWEVGDT+KKLR R LY PALKLSE M +RGMNKT+SDQAIHLDLV
Subjt:  MALRQFSRPKNVAKRS-NKYLEEALYVRLFKDGSSEKSIRHQLNAFIKGHKRVFKWEVGDTLKKLRGRKLYNPALKLSETMAKRGMNKTISDQAIHLDLV

Query:  AKARGIAAAESYFVSLPESSKNHLCYGSLLNCYCKELMTDEAEALLEKMKELNLPVTSMSYNSLMTLYTKTGQPEKVRAIIQEMKAANVMFDSYTYNVWM
        AKAR I A E+YFV LPE+SK  L YGSLLNCYCKEL+T++AE LL KMKELN+  +SMSYNSLMTLYTKTG+ EKV A+IQE+KA NVM DSYTYNVWM
Subjt:  AKARGIAAAESYFVSLPESSKNHLCYGSLLNCYCKELMTDEAEALLEKMKELNLPVTSMSYNSLMTLYTKTGQPEKVRAIIQEMKAANVMFDSYTYNVWM

Query:  RALAALNDISGVERVIDEMKRDGQVVGDWTTYSNLASIYVDAHLFEKADKALKDLEKRNAHRELSAFQFMITLYGRMGNLLEVYRVWRSLRLAFPKTANI
        RALAA NDISGVERVI+EM RDG+V  DWTTYSN+ASIYVDA L +KA+KAL++LE +N  R+ +A+QF+ITLYGR+G L EVYR+WRSLRLA PKT+N+
Subjt:  RALAALNDISGVERVIDEMKRDGQVVGDWTTYSNLASIYVDAHLFEKADKALKDLEKRNAHRELSAFQFMITLYGRMGNLLEVYRVWRSLRLAFPKTANI

Query:  SYLNMIQTLTKLKDLPGAEKCFKEWESGCSTYDIRIANVLIGAYAEEGLLEKAMELKERARRRGAKPNAKTWEIFLDYYLRNGEFKPAVDCVAKAVSIGR
        +YLNMIQ L KL DLPGAE  FKEW++ CSTYDIRI NVLIGAYA+EGL++KA ELKE+A RRG K NAKTWEIF+DYY+++G+   A++C++KAVSIG+
Subjt:  SYLNMIQTLTKLKDLPGAEKCFKEWESGCSTYDIRIANVLIGAYAEEGLLEKAMELKERARRRGAKPNAKTWEIFLDYYLRNGEFKPAVDCVAKAVSIGR

Query:  --GGKWMPSPEIVKTLMSHFEEEKDVDGAEDFLETVKKAVDTLEPEVFETLIRTYSAAGRKSSMMDRRLKMENVEAYE
          GGKW+PSPE V+ LMS+FE++KDV+GAE+ LE +K   D +  E+FE LIRTY+AAG+    M RRLKMENVE  E
Subjt:  --GGKWMPSPEIVKTLMSHFEEEKDVDGAEDFLETVKKAVDTLEPEVFETLIRTYSAAGRKSSMMDRRLKMENVEAYE

Q8LPS6 Pentatricopeptide repeat-containing protein At1g021501.3e-6934.27Show/hide
Query:  ALYVRLFKDGSSEKSIRHQLNAFIKGHKRVFKWEVGDTLKKLRGRKLYNPALKLSETMAKRG--MNKTISDQAIHLDLVAKARGIAAAESYFVSLPESSK
        A+Y ++      E      LN + K  +++ KWE+   +K+LR  K  N AL++ + M  RG     + SD AI LDL+ K RGI  AE +F+ LPE+ K
Subjt:  ALYVRLFKDGSSEKSIRHQLNAFIKGHKRVFKWEVGDTLKKLRGRKLYNPALKLSETMAKRG--MNKTISDQAIHLDLVAKARGIAAAESYFVSLPESSK

Query:  NHLCYGSLLNCYCKELMTDEAEALLEKMKELNLPVTSMSYNSLMTLYTKTGQPEKVRAIIQEMKAANVMFDSYTYNVWMRALAALNDISGVERVIDEMKR
        +   YGSLLN Y +    ++AEALL  M++    +  + +N +MTLY    + +KV A++ EMK  ++  D Y+YN+W+ +  +L  +  +E V  +MK 
Subjt:  NHLCYGSLLNCYCKELMTDEAEALLEKMKELNLPVTSMSYNSLMTLYTKTGQPEKVRAIIQEMKAANVMFDSYTYNVWMRALAALNDISGVERVIDEMKR

Query:  DGQVVGDWTTYSNLASIYVDAHLFEKADKALKDLEKRNAHRELSAFQFMITLYGRMGNLLEVYRVWRSLRLAFPKTANISYLNMIQTLTKLKDLPGAEKC
        D  +  +WTT+S +A++Y+     EKA+ AL+ +E R   R    + ++++LYG +GN  E+YRVW   +   P   N+ Y  ++ +L ++ D+ GAEK 
Subjt:  DGQVVGDWTTYSNLASIYVDAHLFEKADKALKDLEKRNAHRELSAFQFMITLYGRMGNLLEVYRVWRSLRLAFPKTANISYLNMIQTLTKLKDLPGAEKC

Query:  FKEWESGCSTYDIRIANVLIGAYAEEGLLEKAMELKERARRRGAKPNAKTWEIFLDYYLRNGEFKPAVDCVAKAVSIGRGGKWMPSPEIVKTLMSHFEEE
        ++EW    S+YD RI N+L+ AY +   LE A  L +     G KP++ TWEI    + R      A+ C+  A S      W P   ++       EEE
Subjt:  FKEWESGCSTYDIRIANVLIGAYAEEGLLEKAMELKERARRRGAKPNAKTWEIFLDYYLRNGEFKPAVDCVAKAVSIGRGGKWMPSPEIVKTLMSHFEEE

Query:  KDVDGAEDFLETVKKAVDTLEPEVFETLI
         DV   E  LE ++++ D LE + +  LI
Subjt:  KDVDGAEDFLETVKKAVDTLEPEVFETLI

Q93WC5 Pentatricopeptide repeat-containing protein At4g01990, mitochondrial1.3e-8038.84Show/hide
Query:  LNAFIKGHKRVFKWEVGDTLKKLRGRKLYNPALKLSETMAKRGMNKTISDQAIHLDLVAKARGIAAAESYFVSLPESSKNHLCYGSLLNCYCKELMTDEA
        LN F+     V K ++    K LR  +    AL++ E M ++ +  T SD AI L+L+AK++G+ AAE+YF SL +S KN   YGSLLNCYC E    +A
Subjt:  LNAFIKGHKRVFKWEVGDTLKKLRGRKLYNPALKLSETMAKRGMNKTISDQAIHLDLVAKARGIAAAESYFVSLPESSKNHLCYGSLLNCYCKELMTDEA

Query:  EALLEKMKELNLPVTSMSYNSLMTLYTKTGQPEKVRAIIQEMKAANVMFDSYTYNVWMRALAALNDISGVERVIDEMKRDGQVVGDWTTYSNLASIYVDA
        +A  E M +LN    S+ +N+LM +Y   GQPEKV A++  MK  ++     TY++W+++  +L D+ GVE+V+DEMK +G+ +  W T++NLA+IY+  
Subjt:  EALLEKMKELNLPVTSMSYNSLMTLYTKTGQPEKVRAIIQEMKAANVMFDSYTYNVWMRALAALNDISGVERVIDEMKRDGQVVGDWTTYSNLASIYVDA

Query:  HLFEKADKALKDLEKRNAHRELSAFQFMITLYGRMGNLLEVYRVWRSLRLAFPKTANISYLNMIQTLTKLKDLPGAEKCFKEWESGCSTYDIRIANVLIG
         L+ KA++ALK LE          + F+I LY  + N  EVYRVW  L+  +P   N SYL M++ L+KL D+ G +K F EWES C TYD+R+ANV I 
Subjt:  HLFEKADKALKDLEKRNAHRELSAFQFMITLYGRMGNLLEVYRVWRSLRLAFPKTANISYLNMIQTLTKLKDLPGAEKCFKEWESGCSTYDIRIANVLIG

Query:  AYAEEGLLEKAMELKERARRRGAKPNAKTWEIFLDYYLRNGEFKPAVDCVAKAVSIGRGGKWMPSPEIVKTLMSHFEEEKDVDGAEDFLETVKKAVDTLE
        +Y ++ + E+A  +   A ++     +K  ++ + + L+N +   A+     AV + +   W  S E++ +   HFEE KDVDGAE+F +T+ K    L 
Subjt:  AYAEEGLLEKAMELKERARRRGAKPNAKTWEIFLDYYLRNGEFKPAVDCVAKAVSIGRGGKWMPSPEIVKTLMSHFEEEKDVDGAEDFLETVKKAVDTLE

Query:  PEVFETLIRTYSAAGRKSSMMDRRLKMENV
         E +  L++TY AAG+    M +RL+ + +
Subjt:  PEVFETLIRTYSAAGRKSSMMDRRLKMENV

Q9FZ24 Pentatricopeptide repeat-containing protein At1g02370, mitochondrial1.9e-8438.26Show/hide
Query:  EEALYVRLFKDGSSEKSIRHQLNAFIKGHKRVFKWEVGDTLKKLRGRKLYNPALKLSETMAKRGMNKTISDQAIHLDLVAKARGIAAAESYFVSLPESSK
        +  LY +L     +  ++   LN FI     V K ++    K LR  +    A ++ + M KR M  ++SD AI LDL+ K +G+ AAE+YF +L  S+K
Subjt:  EEALYVRLFKDGSSEKSIRHQLNAFIKGHKRVFKWEVGDTLKKLRGRKLYNPALKLSETMAKRGMNKTISDQAIHLDLVAKARGIAAAESYFVSLPESSK

Query:  NH-LCYGSLLNCYCKELMTDEAEALLEKMKELNLPVTSMSYNSLMTLYTKTGQPEKVRAIIQEMKAANVMFDSYTYNVWMRALAALNDISGVERVIDEMK
        NH   YG+L+NCYC EL  ++A+A  E M ELN    S+ +N++M++Y +  QPEKV  ++  MK   +     TY++WM++  +LND+ G+E++IDEM 
Subjt:  NH-LCYGSLLNCYCKELMTDEAEALLEKMKELNLPVTSMSYNSLMTLYTKTGQPEKVRAIIQEMKAANVMFDSYTYNVWMRALAALNDISGVERVIDEMK

Query:  RDGQVVGDWTTYSNLASIYVDAHLFEKADKALKDLEKRNAHRELSAFQFMITLYGRMGNLLEVYRVWRSLRLAFPKTANISYLNMIQTLTKLKDLPGAEK
        +D +    W T+SNLA+IY  A L+EKAD ALK +E++       +  F+++LY  +    EVYRVW SL+ A P+  N+SYL M+Q ++KL DL G +K
Subjt:  RDGQVVGDWTTYSNLASIYVDAHLFEKADKALKDLEKRNAHRELSAFQFMITLYGRMGNLLEVYRVWRSLRLAFPKTANISYLNMIQTLTKLKDLPGAEK

Query:  CFKEWESGCSTYDIRIANVLIGAYAEEGLLEKAMELKERARRRGAKPNAKTWEIFLDYYLRNGEFKPAVDCVAKAVSIGRGGK--WMPSPEIVKTLMSHF
         F EWES C  YD+R+AN+ I  Y +  + E+A ++ + A ++   P +K  ++ + + L N +   A+  +  AVS     K  W  S E+V     HF
Subjt:  CFKEWESGCSTYDIRIANVLIGAYAEEGLLEKAMELKERARRRGAKPNAKTWEIFLDYYLRNGEFKPAVDCVAKAVSIGRGGK--WMPSPEIVKTLMSHF

Query:  EEEKDVDGAEDFLETVKKAVDTLEPEVFETLIRTYSAAGRKSSMMDRRLKMENVEAYERI
        E+ KDVDGAEDF + +      L+ E    LI+TY+AA + S  M  RL  + +E  E I
Subjt:  EEEKDVDGAEDFLETVKKAVDTLEPEVFETLIRTYSAAGRKSSMMDRRLKMENVEAYERI

Q9SY07 Pentatricopeptide repeat-containing protein At4g02820, mitochondrial1.4e-6332.79Show/hide
Query:  KGHKRVFKWEVGDTLKKLRGRKLYNPALKLSETM-AKRGMNKTISDQAIHLDLVAKARGIAAAESYFVSLPESSKNHLCYGSLLNCYCKELMTDEAEALL
        +GH  V K+E+   +++LR  K Y  AL++ E M  +  +     D A+HLDL++K RG+ +AE +F  +P+  + H    SLL+ Y +  ++D+AEAL 
Subjt:  KGHKRVFKWEVGDTLKKLRGRKLYNPALKLSETM-AKRGMNKTISDQAIHLDLVAKARGIAAAESYFVSLPESSKNHLCYGSLLNCYCKELMTDEAEALL

Query:  EKMKELNLPVTSMSYNSLMTLYTKTGQPEKVRAIIQEMKAANVMFDSYTYNVWMRALAALNDISGVERVIDEMKRDGQVVGDWTTYSNLASIYVDAHLFE
        EKM E     + + YN ++++Y   GQ EKV  +I+E+K      D  TYN+W+ A A+ ND+ G E+V  + K + ++  DW TYS L ++Y      E
Subjt:  EKMKELNLPVTSMSYNSLMTLYTKTGQPEKVRAIIQEMKAANVMFDSYTYNVWMRALAALNDISGVERVIDEMKRDGQVVGDWTTYSNLASIYVDAHLFE

Query:  KADKALKDLEKRNAHRELSAFQFMITLYGRMGNLLEVYRVWRSLRLAFPKTANISYLNMIQTLTKLKDLPGAEKCFKEWESGCSTYDIRIANVLIGAYAE
        KA  ALK++EK  + +   A+  +I+L+  +G+   V   W+ ++ +F K  +  YL+MI  + KL +   A+  + EWES   T D RI N+++  Y  
Subjt:  KADKALKDLEKRNAHRELSAFQFMITLYGRMGNLLEVYRVWRSLRLAFPKTANISYLNMIQTLTKLKDLPGAEKCFKEWESGCSTYDIRIANVLIGAYAE

Query:  EGLLEKAMELKERARRRGAKPNAKTWEIFLDYYLRNGEFKPAVDCVAKAVSIGRGGKWMPSPEIVKTLMSHFEEEKDVDGAEDFLETVKKAVDTLEPEVF
           +    +  ER   +G  P+  TWEI    YL+  + +  +DC  KA+   +  KW  +  +VK      EE+ +V GAE  +  ++KA   +  +++
Subjt:  EGLLEKAMELKERARRRGAKPNAKTWEIFLDYYLRNGEFKPAVDCVAKAVSIGRGGKWMPSPEIVKTLMSHFEEEKDVDGAEDFLETVKKAVDTLEPEVF

Query:  ETLIRTYSAAGRKSSMMDRRLKMENVEAYE
         +L+RTY+ AG  + +++ R+  +NVE  E
Subjt:  ETLIRTYSAAGRKSSMMDRRLKMENVEAYE

Arabidopsis top hitse value%identityAlignment
AT1G02150.1 Tetratricopeptide repeat (TPR)-like superfamily protein9.6e-7134.27Show/hide
Query:  ALYVRLFKDGSSEKSIRHQLNAFIKGHKRVFKWEVGDTLKKLRGRKLYNPALKLSETMAKRG--MNKTISDQAIHLDLVAKARGIAAAESYFVSLPESSK
        A+Y ++      E      LN + K  +++ KWE+   +K+LR  K  N AL++ + M  RG     + SD AI LDL+ K RGI  AE +F+ LPE+ K
Subjt:  ALYVRLFKDGSSEKSIRHQLNAFIKGHKRVFKWEVGDTLKKLRGRKLYNPALKLSETMAKRG--MNKTISDQAIHLDLVAKARGIAAAESYFVSLPESSK

Query:  NHLCYGSLLNCYCKELMTDEAEALLEKMKELNLPVTSMSYNSLMTLYTKTGQPEKVRAIIQEMKAANVMFDSYTYNVWMRALAALNDISGVERVIDEMKR
        +   YGSLLN Y +    ++AEALL  M++    +  + +N +MTLY    + +KV A++ EMK  ++  D Y+YN+W+ +  +L  +  +E V  +MK 
Subjt:  NHLCYGSLLNCYCKELMTDEAEALLEKMKELNLPVTSMSYNSLMTLYTKTGQPEKVRAIIQEMKAANVMFDSYTYNVWMRALAALNDISGVERVIDEMKR

Query:  DGQVVGDWTTYSNLASIYVDAHLFEKADKALKDLEKRNAHRELSAFQFMITLYGRMGNLLEVYRVWRSLRLAFPKTANISYLNMIQTLTKLKDLPGAEKC
        D  +  +WTT+S +A++Y+     EKA+ AL+ +E R   R    + ++++LYG +GN  E+YRVW   +   P   N+ Y  ++ +L ++ D+ GAEK 
Subjt:  DGQVVGDWTTYSNLASIYVDAHLFEKADKALKDLEKRNAHRELSAFQFMITLYGRMGNLLEVYRVWRSLRLAFPKTANISYLNMIQTLTKLKDLPGAEKC

Query:  FKEWESGCSTYDIRIANVLIGAYAEEGLLEKAMELKERARRRGAKPNAKTWEIFLDYYLRNGEFKPAVDCVAKAVSIGRGGKWMPSPEIVKTLMSHFEEE
        ++EW    S+YD RI N+L+ AY +   LE A  L +     G KP++ TWEI    + R      A+ C+  A S      W P   ++       EEE
Subjt:  FKEWESGCSTYDIRIANVLIGAYAEEGLLEKAMELKERARRRGAKPNAKTWEIFLDYYLRNGEFKPAVDCVAKAVSIGRGGKWMPSPEIVKTLMSHFEEE

Query:  KDVDGAEDFLETVKKAVDTLEPEVFETLI
         DV   E  LE ++++ D LE + +  LI
Subjt:  KDVDGAEDFLETVKKAVDTLEPEVFETLI

AT1G02370.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.4e-8538.26Show/hide
Query:  EEALYVRLFKDGSSEKSIRHQLNAFIKGHKRVFKWEVGDTLKKLRGRKLYNPALKLSETMAKRGMNKTISDQAIHLDLVAKARGIAAAESYFVSLPESSK
        +  LY +L     +  ++   LN FI     V K ++    K LR  +    A ++ + M KR M  ++SD AI LDL+ K +G+ AAE+YF +L  S+K
Subjt:  EEALYVRLFKDGSSEKSIRHQLNAFIKGHKRVFKWEVGDTLKKLRGRKLYNPALKLSETMAKRGMNKTISDQAIHLDLVAKARGIAAAESYFVSLPESSK

Query:  NH-LCYGSLLNCYCKELMTDEAEALLEKMKELNLPVTSMSYNSLMTLYTKTGQPEKVRAIIQEMKAANVMFDSYTYNVWMRALAALNDISGVERVIDEMK
        NH   YG+L+NCYC EL  ++A+A  E M ELN    S+ +N++M++Y +  QPEKV  ++  MK   +     TY++WM++  +LND+ G+E++IDEM 
Subjt:  NH-LCYGSLLNCYCKELMTDEAEALLEKMKELNLPVTSMSYNSLMTLYTKTGQPEKVRAIIQEMKAANVMFDSYTYNVWMRALAALNDISGVERVIDEMK

Query:  RDGQVVGDWTTYSNLASIYVDAHLFEKADKALKDLEKRNAHRELSAFQFMITLYGRMGNLLEVYRVWRSLRLAFPKTANISYLNMIQTLTKLKDLPGAEK
        +D +    W T+SNLA+IY  A L+EKAD ALK +E++       +  F+++LY  +    EVYRVW SL+ A P+  N+SYL M+Q ++KL DL G +K
Subjt:  RDGQVVGDWTTYSNLASIYVDAHLFEKADKALKDLEKRNAHRELSAFQFMITLYGRMGNLLEVYRVWRSLRLAFPKTANISYLNMIQTLTKLKDLPGAEK

Query:  CFKEWESGCSTYDIRIANVLIGAYAEEGLLEKAMELKERARRRGAKPNAKTWEIFLDYYLRNGEFKPAVDCVAKAVSIGRGGK--WMPSPEIVKTLMSHF
         F EWES C  YD+R+AN+ I  Y +  + E+A ++ + A ++   P +K  ++ + + L N +   A+  +  AVS     K  W  S E+V     HF
Subjt:  CFKEWESGCSTYDIRIANVLIGAYAEEGLLEKAMELKERARRRGAKPNAKTWEIFLDYYLRNGEFKPAVDCVAKAVSIGRGGK--WMPSPEIVKTLMSHF

Query:  EEEKDVDGAEDFLETVKKAVDTLEPEVFETLIRTYSAAGRKSSMMDRRLKMENVEAYERI
        E+ KDVDGAEDF + +      L+ E    LI+TY+AA + S  M  RL  + +E  E I
Subjt:  EEEKDVDGAEDFLETVKKAVDTLEPEVFETLIRTYSAAGRKSSMMDRRLKMENVEAYERI

AT1G60770.1 Tetratricopeptide repeat (TPR)-like superfamily protein9.9e-19369.25Show/hide
Query:  MALRQFSRPKNVAKRS-NKYLEEALYVRLFKDGSSEKSIRHQLNAFIKGHKRVFKWEVGDTLKKLRGRKLYNPALKLSETMAKRGMNKTISDQAIHLDLV
        MA+R  SR ++V KRS  KY+EE LY RLFKDG +E  +R QLN F+KG K VFKWEVGDT+KKLR R LY PALKLSE M +RGMNKT+SDQAIHLDLV
Subjt:  MALRQFSRPKNVAKRS-NKYLEEALYVRLFKDGSSEKSIRHQLNAFIKGHKRVFKWEVGDTLKKLRGRKLYNPALKLSETMAKRGMNKTISDQAIHLDLV

Query:  AKARGIAAAESYFVSLPESSKNHLCYGSLLNCYCKELMTDEAEALLEKMKELNLPVTSMSYNSLMTLYTKTGQPEKVRAIIQEMKAANVMFDSYTYNVWM
        AKAR I A E+YFV LPE+SK  L YGSLLNCYCKEL+T++AE LL KMKELN+  +SMSYNSLMTLYTKTG+ EKV A+IQE+KA NVM DSYTYNVWM
Subjt:  AKARGIAAAESYFVSLPESSKNHLCYGSLLNCYCKELMTDEAEALLEKMKELNLPVTSMSYNSLMTLYTKTGQPEKVRAIIQEMKAANVMFDSYTYNVWM

Query:  RALAALNDISGVERVIDEMKRDGQVVGDWTTYSNLASIYVDAHLFEKADKALKDLEKRNAHRELSAFQFMITLYGRMGNLLEVYRVWRSLRLAFPKTANI
        RALAA NDISGVERVI+EM RDG+V  DWTTYSN+ASIYVDA L +KA+KAL++LE +N  R+ +A+QF+ITLYGR+G L EVYR+WRSLRLA PKT+N+
Subjt:  RALAALNDISGVERVIDEMKRDGQVVGDWTTYSNLASIYVDAHLFEKADKALKDLEKRNAHRELSAFQFMITLYGRMGNLLEVYRVWRSLRLAFPKTANI

Query:  SYLNMIQTLTKLKDLPGAEKCFKEWESGCSTYDIRIANVLIGAYAEEGLLEKAMELKERARRRGAKPNAKTWEIFLDYYLRNGEFKPAVDCVAKAVSIGR
        +YLNMIQ L KL DLPGAE  FKEW++ CSTYDIRI NVLIGAYA+EGL++KA ELKE+A RRG K NAKTWEIF+DYY+++G+   A++C++KAVSIG+
Subjt:  SYLNMIQTLTKLKDLPGAEKCFKEWESGCSTYDIRIANVLIGAYAEEGLLEKAMELKERARRRGAKPNAKTWEIFLDYYLRNGEFKPAVDCVAKAVSIGR

Query:  --GGKWMPSPEIVKTLMSHFEEEKDVDGAEDFLETVKKAVDTLEPEVFETLIRTYSAAGRKSSMMDRRLKMENVEAYE
          GGKW+PSPE V+ LMS+FE++KDV+GAE+ LE +K   D +  E+FE LIRTY+AAG+    M RRLKMENVE  E
Subjt:  --GGKWMPSPEIVKTLMSHFEEEKDVDGAEDFLETVKKAVDTLEPEVFETLIRTYSAAGRKSSMMDRRLKMENVEAYE

AT4G01990.1 Tetratricopeptide repeat (TPR)-like superfamily protein9.2e-8238.84Show/hide
Query:  LNAFIKGHKRVFKWEVGDTLKKLRGRKLYNPALKLSETMAKRGMNKTISDQAIHLDLVAKARGIAAAESYFVSLPESSKNHLCYGSLLNCYCKELMTDEA
        LN F+     V K ++    K LR  +    AL++ E M ++ +  T SD AI L+L+AK++G+ AAE+YF SL +S KN   YGSLLNCYC E    +A
Subjt:  LNAFIKGHKRVFKWEVGDTLKKLRGRKLYNPALKLSETMAKRGMNKTISDQAIHLDLVAKARGIAAAESYFVSLPESSKNHLCYGSLLNCYCKELMTDEA

Query:  EALLEKMKELNLPVTSMSYNSLMTLYTKTGQPEKVRAIIQEMKAANVMFDSYTYNVWMRALAALNDISGVERVIDEMKRDGQVVGDWTTYSNLASIYVDA
        +A  E M +LN    S+ +N+LM +Y   GQPEKV A++  MK  ++     TY++W+++  +L D+ GVE+V+DEMK +G+ +  W T++NLA+IY+  
Subjt:  EALLEKMKELNLPVTSMSYNSLMTLYTKTGQPEKVRAIIQEMKAANVMFDSYTYNVWMRALAALNDISGVERVIDEMKRDGQVVGDWTTYSNLASIYVDA

Query:  HLFEKADKALKDLEKRNAHRELSAFQFMITLYGRMGNLLEVYRVWRSLRLAFPKTANISYLNMIQTLTKLKDLPGAEKCFKEWESGCSTYDIRIANVLIG
         L+ KA++ALK LE          + F+I LY  + N  EVYRVW  L+  +P   N SYL M++ L+KL D+ G +K F EWES C TYD+R+ANV I 
Subjt:  HLFEKADKALKDLEKRNAHRELSAFQFMITLYGRMGNLLEVYRVWRSLRLAFPKTANISYLNMIQTLTKLKDLPGAEKCFKEWESGCSTYDIRIANVLIG

Query:  AYAEEGLLEKAMELKERARRRGAKPNAKTWEIFLDYYLRNGEFKPAVDCVAKAVSIGRGGKWMPSPEIVKTLMSHFEEEKDVDGAEDFLETVKKAVDTLE
        +Y ++ + E+A  +   A ++     +K  ++ + + L+N +   A+     AV + +   W  S E++ +   HFEE KDVDGAE+F +T+ K    L 
Subjt:  AYAEEGLLEKAMELKERARRRGAKPNAKTWEIFLDYYLRNGEFKPAVDCVAKAVSIGRGGKWMPSPEIVKTLMSHFEEEKDVDGAEDFLETVKKAVDTLE

Query:  PEVFETLIRTYSAAGRKSSMMDRRLKMENV
         E +  L++TY AAG+    M +RL+ + +
Subjt:  PEVFETLIRTYSAAGRKSSMMDRRLKMENV

AT4G02820.1 Pentatricopeptide repeat (PPR) superfamily protein1.0e-6432.79Show/hide
Query:  KGHKRVFKWEVGDTLKKLRGRKLYNPALKLSETM-AKRGMNKTISDQAIHLDLVAKARGIAAAESYFVSLPESSKNHLCYGSLLNCYCKELMTDEAEALL
        +GH  V K+E+   +++LR  K Y  AL++ E M  +  +     D A+HLDL++K RG+ +AE +F  +P+  + H    SLL+ Y +  ++D+AEAL 
Subjt:  KGHKRVFKWEVGDTLKKLRGRKLYNPALKLSETM-AKRGMNKTISDQAIHLDLVAKARGIAAAESYFVSLPESSKNHLCYGSLLNCYCKELMTDEAEALL

Query:  EKMKELNLPVTSMSYNSLMTLYTKTGQPEKVRAIIQEMKAANVMFDSYTYNVWMRALAALNDISGVERVIDEMKRDGQVVGDWTTYSNLASIYVDAHLFE
        EKM E     + + YN ++++Y   GQ EKV  +I+E+K      D  TYN+W+ A A+ ND+ G E+V  + K + ++  DW TYS L ++Y      E
Subjt:  EKMKELNLPVTSMSYNSLMTLYTKTGQPEKVRAIIQEMKAANVMFDSYTYNVWMRALAALNDISGVERVIDEMKRDGQVVGDWTTYSNLASIYVDAHLFE

Query:  KADKALKDLEKRNAHRELSAFQFMITLYGRMGNLLEVYRVWRSLRLAFPKTANISYLNMIQTLTKLKDLPGAEKCFKEWESGCSTYDIRIANVLIGAYAE
        KA  ALK++EK  + +   A+  +I+L+  +G+   V   W+ ++ +F K  +  YL+MI  + KL +   A+  + EWES   T D RI N+++  Y  
Subjt:  KADKALKDLEKRNAHRELSAFQFMITLYGRMGNLLEVYRVWRSLRLAFPKTANISYLNMIQTLTKLKDLPGAEKCFKEWESGCSTYDIRIANVLIGAYAE

Query:  EGLLEKAMELKERARRRGAKPNAKTWEIFLDYYLRNGEFKPAVDCVAKAVSIGRGGKWMPSPEIVKTLMSHFEEEKDVDGAEDFLETVKKAVDTLEPEVF
           +    +  ER   +G  P+  TWEI    YL+  + +  +DC  KA+   +  KW  +  +VK      EE+ +V GAE  +  ++KA   +  +++
Subjt:  EGLLEKAMELKERARRRGAKPNAKTWEIFLDYYLRNGEFKPAVDCVAKAVSIGRGGKWMPSPEIVKTLMSHFEEEKDVDGAEDFLETVKKAVDTLEPEVF

Query:  ETLIRTYSAAGRKSSMMDRRLKMENVEAYE
         +L+RTY+ AG  + +++ R+  +NVE  E
Subjt:  ETLIRTYSAAGRKSSMMDRRLKMENVEAYE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGTTACGGCAGTTCAGTCGCCCCAAGAATGTGGCGAAGAGGTCGAACAAGTATCTGGAGGAAGCGCTGTATGTGAGGCTCTTTAAAGATGGTAGCTCAGAGAAGAG
CATTCGGCATCAGTTGAATGCTTTCATCAAGGGTCACAAACGAGTTTTCAAATGGGAGGTGGGAGATACGCTCAAAAAGCTTCGCGGCAGAAAGCTGTATAATCCTGCTC
TCAAGCTTTCAGAAACTATGGCCAAAAGGGGCATGAACAAGACAATTAGTGATCAAGCTATACACCTTGATTTAGTAGCCAAGGCTCGAGGAATTGCTGCTGCTGAGAGT
TACTTTGTTAGTCTTCCTGAATCATCAAAGAATCATCTTTGCTATGGCTCTCTTCTCAACTGTTACTGCAAGGAATTAATGACTGACGAAGCTGAAGCTCTCCTTGAAAA
GATGAAGGAACTCAACTTACCTGTGACCTCCATGTCATATAATAGTCTTATGACGCTATACACAAAGACCGGGCAGCCAGAAAAAGTTCGTGCAATCATACAGGAAATGA
AGGCTGCCAATGTTATGTTTGATTCCTATACATACAATGTGTGGATGAGGGCACTAGCTGCTTTAAATGACATTTCTGGTGTGGAGAGGGTTATTGACGAGATGAAGAGG
GATGGCCAAGTTGTTGGTGATTGGACAACATATAGCAATTTAGCCTCAATTTATGTTGATGCCCACTTGTTCGAAAAGGCAGACAAGGCACTTAAGGACTTGGAGAAGAG
AAATGCTCATCGAGAACTTTCTGCTTTCCAGTTCATGATTACATTATATGGACGAATGGGTAACCTGCTTGAAGTTTATAGAGTCTGGCGCTCATTAAGGTTGGCCTTCC
CAAAAACTGCAAATATAAGCTATCTCAACATGATCCAAACTCTGACAAAGCTGAAAGATTTACCTGGTGCAGAGAAATGTTTCAAGGAGTGGGAATCAGGATGCTCAACT
TATGATATTAGGATAGCGAATGTTCTTATAGGAGCTTATGCTGAAGAGGGTCTGCTAGAGAAGGCTATGGAGCTCAAGGAACGAGCCAGAAGAAGAGGCGCTAAACCTAA
TGCAAAAACTTGGGAAATTTTTCTGGATTATTATCTCAGAAATGGAGAATTTAAACCGGCAGTTGATTGTGTTGCTAAAGCAGTATCTATTGGTAGAGGAGGGAAATGGA
TGCCATCACCCGAGATTGTTAAAACGTTGATGAGCCATTTTGAGGAAGAAAAAGATGTAGATGGGGCAGAGGATTTTCTTGAAACTGTGAAGAAGGCTGTTGACACTTTA
GAGCCTGAGGTGTTTGAGACATTGATAAGAACATATTCAGCGGCCGGAAGGAAAAGTTCTATGATGGATCGCCGGTTGAAAATGGAGAACGTGGAGGCCTATGAACGGAT
TGGGGGAAAAGGCAATGCAAATAATACCAAGATCCCTCCAGATGGAAGATCTGGTTCTGTTCTGATTCGTTACGACAGTGACAACAACGGGGATCTGGTTCTCTTCTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCGTTACGGCAGTTCAGTCGCCCCAAGAATGTGGCGAAGAGGTCGAACAAGTATCTGGAGGAAGCGCTGTATGTGAGGCTCTTTAAAGATGGTAGCTCAGAGAAGAG
CATTCGGCATCAGTTGAATGCTTTCATCAAGGGTCACAAACGAGTTTTCAAATGGGAGGTGGGAGATACGCTCAAAAAGCTTCGCGGCAGAAAGCTGTATAATCCTGCTC
TCAAGCTTTCAGAAACTATGGCCAAAAGGGGCATGAACAAGACAATTAGTGATCAAGCTATACACCTTGATTTAGTAGCCAAGGCTCGAGGAATTGCTGCTGCTGAGAGT
TACTTTGTTAGTCTTCCTGAATCATCAAAGAATCATCTTTGCTATGGCTCTCTTCTCAACTGTTACTGCAAGGAATTAATGACTGACGAAGCTGAAGCTCTCCTTGAAAA
GATGAAGGAACTCAACTTACCTGTGACCTCCATGTCATATAATAGTCTTATGACGCTATACACAAAGACCGGGCAGCCAGAAAAAGTTCGTGCAATCATACAGGAAATGA
AGGCTGCCAATGTTATGTTTGATTCCTATACATACAATGTGTGGATGAGGGCACTAGCTGCTTTAAATGACATTTCTGGTGTGGAGAGGGTTATTGACGAGATGAAGAGG
GATGGCCAAGTTGTTGGTGATTGGACAACATATAGCAATTTAGCCTCAATTTATGTTGATGCCCACTTGTTCGAAAAGGCAGACAAGGCACTTAAGGACTTGGAGAAGAG
AAATGCTCATCGAGAACTTTCTGCTTTCCAGTTCATGATTACATTATATGGACGAATGGGTAACCTGCTTGAAGTTTATAGAGTCTGGCGCTCATTAAGGTTGGCCTTCC
CAAAAACTGCAAATATAAGCTATCTCAACATGATCCAAACTCTGACAAAGCTGAAAGATTTACCTGGTGCAGAGAAATGTTTCAAGGAGTGGGAATCAGGATGCTCAACT
TATGATATTAGGATAGCGAATGTTCTTATAGGAGCTTATGCTGAAGAGGGTCTGCTAGAGAAGGCTATGGAGCTCAAGGAACGAGCCAGAAGAAGAGGCGCTAAACCTAA
TGCAAAAACTTGGGAAATTTTTCTGGATTATTATCTCAGAAATGGAGAATTTAAACCGGCAGTTGATTGTGTTGCTAAAGCAGTATCTATTGGTAGAGGAGGGAAATGGA
TGCCATCACCCGAGATTGTTAAAACGTTGATGAGCCATTTTGAGGAAGAAAAAGATGTAGATGGGGCAGAGGATTTTCTTGAAACTGTGAAGAAGGCTGTTGACACTTTA
GAGCCTGAGGTGTTTGAGACATTGATAAGAACATATTCAGCGGCCGGAAGGAAAAGTTCTATGATGGATCGCCGGTTGAAAATGGAGAACGTGGAGGCCTATGAACGGAT
TGGGGGAAAAGGCAATGCAAATAATACCAAGATCCCTCCAGATGGAAGATCTGGTTCTGTTCTGATTCGTTACGACAGTGACAACAACGGGGATCTGGTTCTCTTCTGA
Protein sequenceShow/hide protein sequence
MALRQFSRPKNVAKRSNKYLEEALYVRLFKDGSSEKSIRHQLNAFIKGHKRVFKWEVGDTLKKLRGRKLYNPALKLSETMAKRGMNKTISDQAIHLDLVAKARGIAAAES
YFVSLPESSKNHLCYGSLLNCYCKELMTDEAEALLEKMKELNLPVTSMSYNSLMTLYTKTGQPEKVRAIIQEMKAANVMFDSYTYNVWMRALAALNDISGVERVIDEMKR
DGQVVGDWTTYSNLASIYVDAHLFEKADKALKDLEKRNAHRELSAFQFMITLYGRMGNLLEVYRVWRSLRLAFPKTANISYLNMIQTLTKLKDLPGAEKCFKEWESGCST
YDIRIANVLIGAYAEEGLLEKAMELKERARRRGAKPNAKTWEIFLDYYLRNGEFKPAVDCVAKAVSIGRGGKWMPSPEIVKTLMSHFEEEKDVDGAEDFLETVKKAVDTL
EPEVFETLIRTYSAAGRKSSMMDRRLKMENVEAYERIGGKGNANNTKIPPDGRSGSVLIRYDSDNNGDLVLF