; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS008433 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS008433
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationscaffold4:1387492..1394470
RNA-Seq ExpressionMS008433
SyntenyMS008433
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003729 - mRNA binding (molecular function)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR000644 - CBS domain
IPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily
IPR044781 - Pentatricopeptide repeat-containing protein At5g10690-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008466304.1 PREDICTED: pentatricopeptide repeat-containing protein At5g10690 isoform X4 [Cucumis melo]1.0e-29686.62Show/hide
Query:  MLRIASFPSRPLALSSSDSLNVFSCPIHSRTASRRRRKVSPRNPNLKRLTSRVVRLTRRKQLHQVFEEIEIAKRRYGKLNTIVMNAVLEACVHCGDIDLA
        ML IASF SR LAL +S+S NVFSC + SRTA+ RR + SPR+PNLKRLTSRVVRLTRRK+LHQVFEEIEIAKRRYGKLNTIVMNAVLEACVHCGDIDLA
Subjt:  MLRIASFPSRPLALSSSDSLNVFSCPIHSRTASRRRRKVSPRNPNLKRLTSRVVRLTRRKQLHQVFEEIEIAKRRYGKLNTIVMNAVLEACVHCGDIDLA

Query:  LRTFYEMSKPDNCGVDNVSYGTLLKGLGEARKVDEAFQLLESVEEGTAIGSPTLSAPLIYGLLNALIEAGDMRRANGLIARYGFILREGGNLSTSVYNLL
        LRTF EMSKPD+CG+DNVSYGTLLKGLGEARK+DEAFQLLESVEEGTAIG PTLSAPLIYGLLNALIEAGDMRRANGLIARYG++LREGGNLS SVYNLL
Subjt:  LRTFYEMSKPDNCGVDNVSYGTLLKGLGEARKVDEAFQLLESVEEGTAIGSPTLSAPLIYGLLNALIEAGDMRRANGLIARYGFILREGGNLSTSVYNLL

Query:  MKGYISSGVPQAAIAMYSEMLNLGLQPDRLTYNTLISACVKINKLDAAMHFFEEMKERADKYDQEDIFPDAVTYTTLLKGFGILRDARVVHMIVLEMKSC
        MKGYISSGVPQAA+AMY+EMLNL L+PDRLTYNTLISACVKINKLDAAMHFFEEMKERADKYDQED+FPD VTYTTLLK FGIL+D  +VH IVLEMKSC
Subjt:  MKGYISSGVPQAAIAMYSEMLNLGLQPDRLTYNTLISACVKINKLDAAMHFFEEMKERADKYDQEDIFPDAVTYTTLLKGFGILRDARVVHMIVLEMKSC

Query:  HDLFIDRTAYTAMIDSLVNCGSIN------AGALSLFGELLKLSGWNSDFRPKPHLYLSLMRVLSSRGDYRMVKCLHRRMWLDSSGTIFPGFQEEADHLL
        H L IDRTAYTAMID+LVNCGSIN      AGALSLFGELLKLSGWN + RPKPHLYL+LMRV SSRGDYRMVKCLHRRMWLDSSGTI  G+QEEADHLL
Subjt:  HDLFIDRTAYTAMIDSLVNCGSIN------AGALSLFGELLKLSGWNSDFRPKPHLYLSLMRVLSSRGDYRMVKCLHRRMWLDSSGTIFPGFQEEADHLL

Query:  MEAALNDNQIDVAIEKLSTIIKIWKGISWTSRGGSVALRIEALLGFTKSFFSPCIFPRVNPSAPIESVMMPFKAVEPLNGSMQLKEVVMHFFDKSVVPIV
        MEAALNDNQIDVAIEKLSTIIK WKGISWTSRGGSVALRIEALLG TKSFFSPCIFPRVN  APIESVMMPFKAV+PLNGS+ LKEVVM FFDKSVVPI+
Subjt:  MEAALNDNQIDVAIEKLSTIIKIWKGISWTSRGGSVALRIEALLGFTKSFFSPCIFPRVNPSAPIESVMMPFKAVEPLNGSMQLKEVVMHFFDKSVVPIV

Query:  DEWGRCTGLLHREDCTELDAPLWKMMRSPPPSVTTTTSIGHVANLILQKRYKMVIVVRHSKFSTYDGSSLRAVGVFTAEKLYGFVSPLPLPPQPNNPR
        D+WGRC GLLHREDCTELDAPLWKMMRSPPP VTTT  IGHVANLILQKRYKMV+VVRHSKFS Y GSSLRA+GVFT E+LYGF+SP+P+P +PN PR
Subjt:  DEWGRCTGLLHREDCTELDAPLWKMMRSPPPSVTTTTSIGHVANLILQKRYKMVIVVRHSKFSTYDGSSLRAVGVFTAEKLYGFVSPLPLPPQPNNPR

XP_008466305.1 PREDICTED: pentatricopeptide repeat-containing protein At5g10690 isoform X5 [Cucumis melo]8.0e-29787.33Show/hide
Query:  MLRIASFPSRPLALSSSDSLNVFSCPIHSRTASRRRRKVSPRNPNLKRLTSRVVRLTRRKQLHQVFEEIEIAKRRYGKLNTIVMNAVLEACVHCGDIDLA
        ML IASF SR LAL +S+S NVFSC + SRTA+ RR + SPR+PNLKRLTSRVVRLTRRK+LHQVFEEIEIAKRRYGKLNTIVMNAVLEACVHCGDIDLA
Subjt:  MLRIASFPSRPLALSSSDSLNVFSCPIHSRTASRRRRKVSPRNPNLKRLTSRVVRLTRRKQLHQVFEEIEIAKRRYGKLNTIVMNAVLEACVHCGDIDLA

Query:  LRTFYEMSKPDNCGVDNVSYGTLLKGLGEARKVDEAFQLLESVEEGTAIGSPTLSAPLIYGLLNALIEAGDMRRANGLIARYGFILREGGNLSTSVYNLL
        LRTF EMSKPD+CG+DNVSYGTLLKGLGEARK+DEAFQLLESVEEGTAIG PTLSAPLIYGLLNALIEAGDMRRANGLIARYG++LREGGNLS SVYNLL
Subjt:  LRTFYEMSKPDNCGVDNVSYGTLLKGLGEARKVDEAFQLLESVEEGTAIGSPTLSAPLIYGLLNALIEAGDMRRANGLIARYGFILREGGNLSTSVYNLL

Query:  MKGYISSGVPQAAIAMYSEMLNLGLQPDRLTYNTLISACVKINKLDAAMHFFEEMKERADKYDQEDIFPDAVTYTTLLKGFGILRDARVVHMIVLEMKSC
        MKGYISSGVPQAA+AMY+EMLNL L+PDRLTYNTLISACVKINKLDAAMHFFEEMKERADKYDQED+FPD VTYTTLLK FGIL+D  +VH IVLEMKSC
Subjt:  MKGYISSGVPQAAIAMYSEMLNLGLQPDRLTYNTLISACVKINKLDAAMHFFEEMKERADKYDQEDIFPDAVTYTTLLKGFGILRDARVVHMIVLEMKSC

Query:  HDLFIDRTAYTAMIDSLVNCGSINAGALSLFGELLKLSGWNSDFRPKPHLYLSLMRVLSSRGDYRMVKCLHRRMWLDSSGTIFPGFQEEADHLLMEAALN
        H L IDRTAYTAMID+LVNCGSIN GALSLFGELLKLSGWN + RPKPHLYL+LMRV SSRGDYRMVKCLHRRMWLDSSGTI  G+QEEADHLLMEAALN
Subjt:  HDLFIDRTAYTAMIDSLVNCGSINAGALSLFGELLKLSGWNSDFRPKPHLYLSLMRVLSSRGDYRMVKCLHRRMWLDSSGTIFPGFQEEADHLLMEAALN

Query:  DNQIDVAIEKLSTIIKIWKGISWTSRGGSVALRIEALLGFTKSFFSPCIFPRVNPSAPIESVMMPFKAVEPLNGSMQLKEVVMHFFDKSVVPIVDEWGRC
        DNQIDVAIEKLSTIIK WKGISWTSRGGSVALRIEALLG TKSFFSPCIFPRVN  APIESVMMPFKAV+PLNGS+ LKEVVM FFDKSVVPI+D+WGRC
Subjt:  DNQIDVAIEKLSTIIKIWKGISWTSRGGSVALRIEALLGFTKSFFSPCIFPRVNPSAPIESVMMPFKAVEPLNGSMQLKEVVMHFFDKSVVPIVDEWGRC

Query:  TGLLHREDCTELDAPLWKMMRSPPPSVTTTTSIGHVANLILQKRYKMVIVVRHSKFSTYDGSSLRAVGVFTAEKLYGFVSPLPLPPQPNNPR
         GLLHREDCTELDAPLWKMMRSPPP VTTT  IGHVANLILQKRYKMV+VVRHSKFS Y GSSLRA+GVFT E+LYGF+SP+P+P +PN PR
Subjt:  TGLLHREDCTELDAPLWKMMRSPPPSVTTTTSIGHVANLILQKRYKMVIVVRHSKFSTYDGSSLRAVGVFTAEKLYGFVSPLPLPPQPNNPR

XP_016903485.1 PREDICTED: pentatricopeptide repeat-containing protein At5g10690 isoform X2 [Cucumis melo]1.8e-29686.33Show/hide
Query:  MLRIASFPSRPLALSSSDSLNVFSCPIHSRTASRRRRKVSPRNPNLKRLTSRVVRLTRRKQLHQVFEEIEIAKRRYGKLNTIVMNAVLEACVHCGDIDLA
        ML IASF SR LAL +S+S NVFSC + SRTA+ RR + SPR+PNLKRLTSRVVRLTRRK+LHQVFEEIEIAKRRYGKLNTIVMNAVLEACVHCGDIDLA
Subjt:  MLRIASFPSRPLALSSSDSLNVFSCPIHSRTASRRRRKVSPRNPNLKRLTSRVVRLTRRKQLHQVFEEIEIAKRRYGKLNTIVMNAVLEACVHCGDIDLA

Query:  LRTFYEMSKPDNCGVDNVSYGTLLKGLGEARKVDEAFQLLESVEEGTAIGSPTLSAPLIYGLLNALIEAGDMRRANGLIARYGFILREGGNLSTSVYNLL
        LRTF EMSKPD+CG+DNVSYGTLLKGLGEARK+DEAFQLLESVEEGTAIG PTLSAPLIYGLLNALIEAGDMRRANGLIARYG++LREGGNLS SVYNLL
Subjt:  LRTFYEMSKPDNCGVDNVSYGTLLKGLGEARKVDEAFQLLESVEEGTAIGSPTLSAPLIYGLLNALIEAGDMRRANGLIARYGFILREGGNLSTSVYNLL

Query:  MKGYISSGVPQAAIAMYSEMLNLGLQPDRLTYNTLISACVKINKLDAAMHFFEEMKERADKYDQEDIFPDAVTYTTLLKGFGILRDARVVHMIVLEMKSC
        MKGYISSGVPQAA+AMY+EMLNL L+PDRLTYNTLISACVKINKLDAAMHFFEEMKERADKYDQED+FPD VTYTTLLK FGIL+D  +VH IVLEMKSC
Subjt:  MKGYISSGVPQAAIAMYSEMLNLGLQPDRLTYNTLISACVKINKLDAAMHFFEEMKERADKYDQEDIFPDAVTYTTLLKGFGILRDARVVHMIVLEMKSC

Query:  HDLFIDRTAYTAMIDSLVNCGSINAGALSLFGELLKLSGWNSDFRPKPHLYLSLMRVLSSRGDYRMVKCLHRRMWLDSSGTIFPGFQEEADHLLMEAALN
        H L IDRTAYTAMID+LVNCGSINAGALSLFGELLKLSGWN + RPKPHLYL+LMRV SSRGDYRMVKCLHRRMWLDSSGTI  G+QEEADHLLMEAALN
Subjt:  HDLFIDRTAYTAMIDSLVNCGSINAGALSLFGELLKLSGWNSDFRPKPHLYLSLMRVLSSRGDYRMVKCLHRRMWLDSSGTIFPGFQEEADHLLMEAALN

Query:  DNQ--------IDVAIEKLSTIIKIWKGISWTSRGGSVALRIEALLGFTKSFFSPCIFPRVNPSAPIESVMMPFKAVEPLNGSMQLKEVVMHFFDKSVVP
        DNQ        IDVAIEKLSTIIK WKGISWTSRGGSVALRIEALLG TKSFFSPCIFPRVN  APIESVMMPFKAV+PLNGS+ LKEVVM FFDKSVVP
Subjt:  DNQ--------IDVAIEKLSTIIKIWKGISWTSRGGSVALRIEALLGFTKSFFSPCIFPRVNPSAPIESVMMPFKAVEPLNGSMQLKEVVMHFFDKSVVP

Query:  IVDEWGRCTGLLHREDCTELDAPLWKMMRSPPPSVTTTTSIGHVANLILQKRYKMVIVVRHSKFSTYDGSSLRAVGVFTAEKLYGFVSPLPLPPQPNNPR
        I+D+WGRC GLLHREDCTELDAPLWKMMRSPPP VTTT  IGHVANLILQKRYKMV+VVRHSKFS Y GSSLRA+GVFT E+LYGF+SP+P+P +PN PR
Subjt:  IVDEWGRCTGLLHREDCTELDAPLWKMMRSPPPSVTTTTSIGHVANLILQKRYKMVIVVRHSKFSTYDGSSLRAVGVFTAEKLYGFVSPLPLPPQPNNPR

XP_022139945.1 pentatricopeptide repeat-containing protein At5g10690 isoform X1 [Momordica charantia]0.0e+0099.33Show/hide
Query:  MLRIASFPSRPLALSSSDSLNVFSCPIHSRTASRRRRKVSPRNPNLKRLTSRVVRLTRRKQLHQVFEEIEIAKRRYGKLNTIVMNAVLEACVHCGDIDLA
        MLRIASFPSRPLALSSSDSLNVFSCPIHSRTASRRRRKVSPRNPNLKRLTSRVVRLTRRKQLHQVFEEIEIAKRRYGKLNTIVMNAVLEACVHCGDIDLA
Subjt:  MLRIASFPSRPLALSSSDSLNVFSCPIHSRTASRRRRKVSPRNPNLKRLTSRVVRLTRRKQLHQVFEEIEIAKRRYGKLNTIVMNAVLEACVHCGDIDLA

Query:  LRTFYEMSKPDNCGVDNVSYGTLLKGLGEARKVDEAFQLLESVEEGTAIGSPTLSAPLIYGLLNALIEAGDMRRANGLIARYGFILREGGNLSTSVYNLL
        LRTFYEMSKPDNCGVDNVSYGTLLKGLGE RKVDEAFQLLESVEEGTAIGSPTLSAPLIYGLLNALIEAGDMRRANGLIARYGFILREGGNLSTSVYNLL
Subjt:  LRTFYEMSKPDNCGVDNVSYGTLLKGLGEARKVDEAFQLLESVEEGTAIGSPTLSAPLIYGLLNALIEAGDMRRANGLIARYGFILREGGNLSTSVYNLL

Query:  MKGYISSGVPQAAIAMYSEMLNLGLQPDRLTYNTLISACVKINKLDAAMHFFEEMKERADKYDQEDIFPDAVTYTTLLKGFGILRDARVVHMIVLEMKSC
        MKGYISSGVPQAAIAMYSEMLNLGLQPDRLTYNTLISACVKINKLDAAMHFFEEMKERADKYDQEDIFPDAVTYTTLLKGFGILRDARVVHMIVLEMKSC
Subjt:  MKGYISSGVPQAAIAMYSEMLNLGLQPDRLTYNTLISACVKINKLDAAMHFFEEMKERADKYDQEDIFPDAVTYTTLLKGFGILRDARVVHMIVLEMKSC

Query:  HDLFIDRTAYTAMIDSLVNCGSINAGALSLFGELLKLSGWNSDFRPKPHLYLSLMRVLSSRGDYRMVKCLHRRMWLDSSGTIFPGFQEEADHLLMEAALN
        HD FIDRTAYTAMIDSLVNCGSIN GALSLFGELLKLSGWNSDFRPKPHLYLSLMRVLSSRGDYRMVKCLHRRMWLDSSGTIFPGFQEEADHLLMEAALN
Subjt:  HDLFIDRTAYTAMIDSLVNCGSINAGALSLFGELLKLSGWNSDFRPKPHLYLSLMRVLSSRGDYRMVKCLHRRMWLDSSGTIFPGFQEEADHLLMEAALN

Query:  DNQIDVAIEKLSTIIKIWKGISWTSRGGSVALRIEALLGFTKSFFSPCIFPRVNPSAPIESVMMPFKAVEPLNGSMQLKEVVMHFFDKSVVPIVDEWGRC
        DNQIDVAIEKLSTIIKIWKGISWTSRGGSVALRIEALLGFTKSFFSPCIFPRVNPSAPIESVMMPFKAVEPLNGSMQLKEVVMHFFDKSVVPIVDEWGRC
Subjt:  DNQIDVAIEKLSTIIKIWKGISWTSRGGSVALRIEALLGFTKSFFSPCIFPRVNPSAPIESVMMPFKAVEPLNGSMQLKEVVMHFFDKSVVPIVDEWGRC

Query:  TGLLHREDCTELDAPLWKMMRSPPPSVTTTTSIGHVANLILQKRYKMVIVVRHSKFSTYDGSSLRAVGVFTAEKLYGFVSPLPLPPQPNNPRNR
        TGLLHREDCTELDAPLWK+MRSPPPSVTTTTSIGHVANLILQKRYKMVIVVRHSKFSTYDGSSLRAVGVFTAEKLYGFVSPLPLPPQPNNPRNR
Subjt:  TGLLHREDCTELDAPLWKMMRSPPPSVTTTTSIGHVANLILQKRYKMVIVVRHSKFSTYDGSSLRAVGVFTAEKLYGFVSPLPLPPQPNNPRNR

XP_022140025.1 pentatricopeptide repeat-containing protein At5g10690 isoform X2 [Momordica charantia]0.0e+0095.29Show/hide
Query:  MLRIASFPSRPLALSSSDSLNVFSCPIHSRTASRRRRKVSPRNPNLKRLTSRVVRLTRRKQLHQVFEEIEIAKRRYGKLNTIVMNAVLEACVHCGDIDLA
        MLRIASFPSRPLALSSSDSLNVFSCPIHSRTASRRRRKVSPRNPNLKRLTSRVVRLTRRKQLHQVFEEIEIAKRRYGKLNTIVMNAVLEACVHCGDIDLA
Subjt:  MLRIASFPSRPLALSSSDSLNVFSCPIHSRTASRRRRKVSPRNPNLKRLTSRVVRLTRRKQLHQVFEEIEIAKRRYGKLNTIVMNAVLEACVHCGDIDLA

Query:  LRTFYEMSKPDNCGVDNVSYGTLLKGLGEARKVDEAFQLLESVEEGTAIGSPTLSAPLIYGLLNALIEAGDMRRANGLIARYGFILREGGNLSTSVYNLL
        LRTFYEMSKPDNCGVDNVSYGTLLKGLGE RKVDEAFQLLESVEE                        GDMRRANGLIARYGFILREGGNLSTSVYNLL
Subjt:  LRTFYEMSKPDNCGVDNVSYGTLLKGLGEARKVDEAFQLLESVEEGTAIGSPTLSAPLIYGLLNALIEAGDMRRANGLIARYGFILREGGNLSTSVYNLL

Query:  MKGYISSGVPQAAIAMYSEMLNLGLQPDRLTYNTLISACVKINKLDAAMHFFEEMKERADKYDQEDIFPDAVTYTTLLKGFGILRDARVVHMIVLEMKSC
        MKGYISSGVPQAAIAMYSEMLNLGLQPDRLTYNTLISACVKINKLDAAMHFFEEMKERADKYDQEDIFPDAVTYTTLLKGFGILRDARVVHMIVLEMKSC
Subjt:  MKGYISSGVPQAAIAMYSEMLNLGLQPDRLTYNTLISACVKINKLDAAMHFFEEMKERADKYDQEDIFPDAVTYTTLLKGFGILRDARVVHMIVLEMKSC

Query:  HDLFIDRTAYTAMIDSLVNCGSINAGALSLFGELLKLSGWNSDFRPKPHLYLSLMRVLSSRGDYRMVKCLHRRMWLDSSGTIFPGFQEEADHLLMEAALN
        HD FIDRTAYTAMIDSLVNCGSIN GALSLFGELLKLSGWNSDFRPKPHLYLSLMRVLSSRGDYRMVKCLHRRMWLDSSGTIFPGFQEEADHLLMEAALN
Subjt:  HDLFIDRTAYTAMIDSLVNCGSINAGALSLFGELLKLSGWNSDFRPKPHLYLSLMRVLSSRGDYRMVKCLHRRMWLDSSGTIFPGFQEEADHLLMEAALN

Query:  DNQIDVAIEKLSTIIKIWKGISWTSRGGSVALRIEALLGFTKSFFSPCIFPRVNPSAPIESVMMPFKAVEPLNGSMQLKEVVMHFFDKSVVPIVDEWGRC
        DNQIDVAIEKLSTIIKIWKGISWTSRGGSVALRIEALLGFTKSFFSPCIFPRVNPSAPIESVMMPFKAVEPLNGSMQLKEVVMHFFDKSVVPIVDEWGRC
Subjt:  DNQIDVAIEKLSTIIKIWKGISWTSRGGSVALRIEALLGFTKSFFSPCIFPRVNPSAPIESVMMPFKAVEPLNGSMQLKEVVMHFFDKSVVPIVDEWGRC

Query:  TGLLHREDCTELDAPLWKMMRSPPPSVTTTTSIGHVANLILQKRYKMVIVVRHSKFSTYDGSSLRAVGVFTAEKLYGFVSPLPLPPQPNNPRNR
        TGLLHREDCTELDAPLWK+MRSPPPSVTTTTSIGHVANLILQKRYKMVIVVRHSKFSTYDGSSLRAVGVFTAEKLYGFVSPLPLPPQPNNPRNR
Subjt:  TGLLHREDCTELDAPLWKMMRSPPPSVTTTTSIGHVANLILQKRYKMVIVVRHSKFSTYDGSSLRAVGVFTAEKLYGFVSPLPLPPQPNNPRNR

TrEMBL top hitse value%identityAlignment
A0A1S3CQX2 pentatricopeptide repeat-containing protein At5g10690 isoform X45.0e-29786.62Show/hide
Query:  MLRIASFPSRPLALSSSDSLNVFSCPIHSRTASRRRRKVSPRNPNLKRLTSRVVRLTRRKQLHQVFEEIEIAKRRYGKLNTIVMNAVLEACVHCGDIDLA
        ML IASF SR LAL +S+S NVFSC + SRTA+ RR + SPR+PNLKRLTSRVVRLTRRK+LHQVFEEIEIAKRRYGKLNTIVMNAVLEACVHCGDIDLA
Subjt:  MLRIASFPSRPLALSSSDSLNVFSCPIHSRTASRRRRKVSPRNPNLKRLTSRVVRLTRRKQLHQVFEEIEIAKRRYGKLNTIVMNAVLEACVHCGDIDLA

Query:  LRTFYEMSKPDNCGVDNVSYGTLLKGLGEARKVDEAFQLLESVEEGTAIGSPTLSAPLIYGLLNALIEAGDMRRANGLIARYGFILREGGNLSTSVYNLL
        LRTF EMSKPD+CG+DNVSYGTLLKGLGEARK+DEAFQLLESVEEGTAIG PTLSAPLIYGLLNALIEAGDMRRANGLIARYG++LREGGNLS SVYNLL
Subjt:  LRTFYEMSKPDNCGVDNVSYGTLLKGLGEARKVDEAFQLLESVEEGTAIGSPTLSAPLIYGLLNALIEAGDMRRANGLIARYGFILREGGNLSTSVYNLL

Query:  MKGYISSGVPQAAIAMYSEMLNLGLQPDRLTYNTLISACVKINKLDAAMHFFEEMKERADKYDQEDIFPDAVTYTTLLKGFGILRDARVVHMIVLEMKSC
        MKGYISSGVPQAA+AMY+EMLNL L+PDRLTYNTLISACVKINKLDAAMHFFEEMKERADKYDQED+FPD VTYTTLLK FGIL+D  +VH IVLEMKSC
Subjt:  MKGYISSGVPQAAIAMYSEMLNLGLQPDRLTYNTLISACVKINKLDAAMHFFEEMKERADKYDQEDIFPDAVTYTTLLKGFGILRDARVVHMIVLEMKSC

Query:  HDLFIDRTAYTAMIDSLVNCGSIN------AGALSLFGELLKLSGWNSDFRPKPHLYLSLMRVLSSRGDYRMVKCLHRRMWLDSSGTIFPGFQEEADHLL
        H L IDRTAYTAMID+LVNCGSIN      AGALSLFGELLKLSGWN + RPKPHLYL+LMRV SSRGDYRMVKCLHRRMWLDSSGTI  G+QEEADHLL
Subjt:  HDLFIDRTAYTAMIDSLVNCGSIN------AGALSLFGELLKLSGWNSDFRPKPHLYLSLMRVLSSRGDYRMVKCLHRRMWLDSSGTIFPGFQEEADHLL

Query:  MEAALNDNQIDVAIEKLSTIIKIWKGISWTSRGGSVALRIEALLGFTKSFFSPCIFPRVNPSAPIESVMMPFKAVEPLNGSMQLKEVVMHFFDKSVVPIV
        MEAALNDNQIDVAIEKLSTIIK WKGISWTSRGGSVALRIEALLG TKSFFSPCIFPRVN  APIESVMMPFKAV+PLNGS+ LKEVVM FFDKSVVPI+
Subjt:  MEAALNDNQIDVAIEKLSTIIKIWKGISWTSRGGSVALRIEALLGFTKSFFSPCIFPRVNPSAPIESVMMPFKAVEPLNGSMQLKEVVMHFFDKSVVPIV

Query:  DEWGRCTGLLHREDCTELDAPLWKMMRSPPPSVTTTTSIGHVANLILQKRYKMVIVVRHSKFSTYDGSSLRAVGVFTAEKLYGFVSPLPLPPQPNNPR
        D+WGRC GLLHREDCTELDAPLWKMMRSPPP VTTT  IGHVANLILQKRYKMV+VVRHSKFS Y GSSLRA+GVFT E+LYGF+SP+P+P +PN PR
Subjt:  DEWGRCTGLLHREDCTELDAPLWKMMRSPPPSVTTTTSIGHVANLILQKRYKMVIVVRHSKFSTYDGSSLRAVGVFTAEKLYGFVSPLPLPPQPNNPR

A0A1S3CQY3 pentatricopeptide repeat-containing protein At5g10690 isoform X53.9e-29787.33Show/hide
Query:  MLRIASFPSRPLALSSSDSLNVFSCPIHSRTASRRRRKVSPRNPNLKRLTSRVVRLTRRKQLHQVFEEIEIAKRRYGKLNTIVMNAVLEACVHCGDIDLA
        ML IASF SR LAL +S+S NVFSC + SRTA+ RR + SPR+PNLKRLTSRVVRLTRRK+LHQVFEEIEIAKRRYGKLNTIVMNAVLEACVHCGDIDLA
Subjt:  MLRIASFPSRPLALSSSDSLNVFSCPIHSRTASRRRRKVSPRNPNLKRLTSRVVRLTRRKQLHQVFEEIEIAKRRYGKLNTIVMNAVLEACVHCGDIDLA

Query:  LRTFYEMSKPDNCGVDNVSYGTLLKGLGEARKVDEAFQLLESVEEGTAIGSPTLSAPLIYGLLNALIEAGDMRRANGLIARYGFILREGGNLSTSVYNLL
        LRTF EMSKPD+CG+DNVSYGTLLKGLGEARK+DEAFQLLESVEEGTAIG PTLSAPLIYGLLNALIEAGDMRRANGLIARYG++LREGGNLS SVYNLL
Subjt:  LRTFYEMSKPDNCGVDNVSYGTLLKGLGEARKVDEAFQLLESVEEGTAIGSPTLSAPLIYGLLNALIEAGDMRRANGLIARYGFILREGGNLSTSVYNLL

Query:  MKGYISSGVPQAAIAMYSEMLNLGLQPDRLTYNTLISACVKINKLDAAMHFFEEMKERADKYDQEDIFPDAVTYTTLLKGFGILRDARVVHMIVLEMKSC
        MKGYISSGVPQAA+AMY+EMLNL L+PDRLTYNTLISACVKINKLDAAMHFFEEMKERADKYDQED+FPD VTYTTLLK FGIL+D  +VH IVLEMKSC
Subjt:  MKGYISSGVPQAAIAMYSEMLNLGLQPDRLTYNTLISACVKINKLDAAMHFFEEMKERADKYDQEDIFPDAVTYTTLLKGFGILRDARVVHMIVLEMKSC

Query:  HDLFIDRTAYTAMIDSLVNCGSINAGALSLFGELLKLSGWNSDFRPKPHLYLSLMRVLSSRGDYRMVKCLHRRMWLDSSGTIFPGFQEEADHLLMEAALN
        H L IDRTAYTAMID+LVNCGSIN GALSLFGELLKLSGWN + RPKPHLYL+LMRV SSRGDYRMVKCLHRRMWLDSSGTI  G+QEEADHLLMEAALN
Subjt:  HDLFIDRTAYTAMIDSLVNCGSINAGALSLFGELLKLSGWNSDFRPKPHLYLSLMRVLSSRGDYRMVKCLHRRMWLDSSGTIFPGFQEEADHLLMEAALN

Query:  DNQIDVAIEKLSTIIKIWKGISWTSRGGSVALRIEALLGFTKSFFSPCIFPRVNPSAPIESVMMPFKAVEPLNGSMQLKEVVMHFFDKSVVPIVDEWGRC
        DNQIDVAIEKLSTIIK WKGISWTSRGGSVALRIEALLG TKSFFSPCIFPRVN  APIESVMMPFKAV+PLNGS+ LKEVVM FFDKSVVPI+D+WGRC
Subjt:  DNQIDVAIEKLSTIIKIWKGISWTSRGGSVALRIEALLGFTKSFFSPCIFPRVNPSAPIESVMMPFKAVEPLNGSMQLKEVVMHFFDKSVVPIVDEWGRC

Query:  TGLLHREDCTELDAPLWKMMRSPPPSVTTTTSIGHVANLILQKRYKMVIVVRHSKFSTYDGSSLRAVGVFTAEKLYGFVSPLPLPPQPNNPR
         GLLHREDCTELDAPLWKMMRSPPP VTTT  IGHVANLILQKRYKMV+VVRHSKFS Y GSSLRA+GVFT E+LYGF+SP+P+P +PN PR
Subjt:  TGLLHREDCTELDAPLWKMMRSPPPSVTTTTSIGHVANLILQKRYKMVIVVRHSKFSTYDGSSLRAVGVFTAEKLYGFVSPLPLPPQPNNPR

A0A1S4E685 pentatricopeptide repeat-containing protein At5g10690 isoform X28.6e-29786.33Show/hide
Query:  MLRIASFPSRPLALSSSDSLNVFSCPIHSRTASRRRRKVSPRNPNLKRLTSRVVRLTRRKQLHQVFEEIEIAKRRYGKLNTIVMNAVLEACVHCGDIDLA
        ML IASF SR LAL +S+S NVFSC + SRTA+ RR + SPR+PNLKRLTSRVVRLTRRK+LHQVFEEIEIAKRRYGKLNTIVMNAVLEACVHCGDIDLA
Subjt:  MLRIASFPSRPLALSSSDSLNVFSCPIHSRTASRRRRKVSPRNPNLKRLTSRVVRLTRRKQLHQVFEEIEIAKRRYGKLNTIVMNAVLEACVHCGDIDLA

Query:  LRTFYEMSKPDNCGVDNVSYGTLLKGLGEARKVDEAFQLLESVEEGTAIGSPTLSAPLIYGLLNALIEAGDMRRANGLIARYGFILREGGNLSTSVYNLL
        LRTF EMSKPD+CG+DNVSYGTLLKGLGEARK+DEAFQLLESVEEGTAIG PTLSAPLIYGLLNALIEAGDMRRANGLIARYG++LREGGNLS SVYNLL
Subjt:  LRTFYEMSKPDNCGVDNVSYGTLLKGLGEARKVDEAFQLLESVEEGTAIGSPTLSAPLIYGLLNALIEAGDMRRANGLIARYGFILREGGNLSTSVYNLL

Query:  MKGYISSGVPQAAIAMYSEMLNLGLQPDRLTYNTLISACVKINKLDAAMHFFEEMKERADKYDQEDIFPDAVTYTTLLKGFGILRDARVVHMIVLEMKSC
        MKGYISSGVPQAA+AMY+EMLNL L+PDRLTYNTLISACVKINKLDAAMHFFEEMKERADKYDQED+FPD VTYTTLLK FGIL+D  +VH IVLEMKSC
Subjt:  MKGYISSGVPQAAIAMYSEMLNLGLQPDRLTYNTLISACVKINKLDAAMHFFEEMKERADKYDQEDIFPDAVTYTTLLKGFGILRDARVVHMIVLEMKSC

Query:  HDLFIDRTAYTAMIDSLVNCGSINAGALSLFGELLKLSGWNSDFRPKPHLYLSLMRVLSSRGDYRMVKCLHRRMWLDSSGTIFPGFQEEADHLLMEAALN
        H L IDRTAYTAMID+LVNCGSINAGALSLFGELLKLSGWN + RPKPHLYL+LMRV SSRGDYRMVKCLHRRMWLDSSGTI  G+QEEADHLLMEAALN
Subjt:  HDLFIDRTAYTAMIDSLVNCGSINAGALSLFGELLKLSGWNSDFRPKPHLYLSLMRVLSSRGDYRMVKCLHRRMWLDSSGTIFPGFQEEADHLLMEAALN

Query:  DNQ--------IDVAIEKLSTIIKIWKGISWTSRGGSVALRIEALLGFTKSFFSPCIFPRVNPSAPIESVMMPFKAVEPLNGSMQLKEVVMHFFDKSVVP
        DNQ        IDVAIEKLSTIIK WKGISWTSRGGSVALRIEALLG TKSFFSPCIFPRVN  APIESVMMPFKAV+PLNGS+ LKEVVM FFDKSVVP
Subjt:  DNQ--------IDVAIEKLSTIIKIWKGISWTSRGGSVALRIEALLGFTKSFFSPCIFPRVNPSAPIESVMMPFKAVEPLNGSMQLKEVVMHFFDKSVVP

Query:  IVDEWGRCTGLLHREDCTELDAPLWKMMRSPPPSVTTTTSIGHVANLILQKRYKMVIVVRHSKFSTYDGSSLRAVGVFTAEKLYGFVSPLPLPPQPNNPR
        I+D+WGRC GLLHREDCTELDAPLWKMMRSPPP VTTT  IGHVANLILQKRYKMV+VVRHSKFS Y GSSLRA+GVFT E+LYGF+SP+P+P +PN PR
Subjt:  IVDEWGRCTGLLHREDCTELDAPLWKMMRSPPPSVTTTTSIGHVANLILQKRYKMVIVVRHSKFSTYDGSSLRAVGVFTAEKLYGFVSPLPLPPQPNNPR

A0A6J1CDP2 pentatricopeptide repeat-containing protein At5g10690 isoform X10.0e+0099.33Show/hide
Query:  MLRIASFPSRPLALSSSDSLNVFSCPIHSRTASRRRRKVSPRNPNLKRLTSRVVRLTRRKQLHQVFEEIEIAKRRYGKLNTIVMNAVLEACVHCGDIDLA
        MLRIASFPSRPLALSSSDSLNVFSCPIHSRTASRRRRKVSPRNPNLKRLTSRVVRLTRRKQLHQVFEEIEIAKRRYGKLNTIVMNAVLEACVHCGDIDLA
Subjt:  MLRIASFPSRPLALSSSDSLNVFSCPIHSRTASRRRRKVSPRNPNLKRLTSRVVRLTRRKQLHQVFEEIEIAKRRYGKLNTIVMNAVLEACVHCGDIDLA

Query:  LRTFYEMSKPDNCGVDNVSYGTLLKGLGEARKVDEAFQLLESVEEGTAIGSPTLSAPLIYGLLNALIEAGDMRRANGLIARYGFILREGGNLSTSVYNLL
        LRTFYEMSKPDNCGVDNVSYGTLLKGLGE RKVDEAFQLLESVEEGTAIGSPTLSAPLIYGLLNALIEAGDMRRANGLIARYGFILREGGNLSTSVYNLL
Subjt:  LRTFYEMSKPDNCGVDNVSYGTLLKGLGEARKVDEAFQLLESVEEGTAIGSPTLSAPLIYGLLNALIEAGDMRRANGLIARYGFILREGGNLSTSVYNLL

Query:  MKGYISSGVPQAAIAMYSEMLNLGLQPDRLTYNTLISACVKINKLDAAMHFFEEMKERADKYDQEDIFPDAVTYTTLLKGFGILRDARVVHMIVLEMKSC
        MKGYISSGVPQAAIAMYSEMLNLGLQPDRLTYNTLISACVKINKLDAAMHFFEEMKERADKYDQEDIFPDAVTYTTLLKGFGILRDARVVHMIVLEMKSC
Subjt:  MKGYISSGVPQAAIAMYSEMLNLGLQPDRLTYNTLISACVKINKLDAAMHFFEEMKERADKYDQEDIFPDAVTYTTLLKGFGILRDARVVHMIVLEMKSC

Query:  HDLFIDRTAYTAMIDSLVNCGSINAGALSLFGELLKLSGWNSDFRPKPHLYLSLMRVLSSRGDYRMVKCLHRRMWLDSSGTIFPGFQEEADHLLMEAALN
        HD FIDRTAYTAMIDSLVNCGSIN GALSLFGELLKLSGWNSDFRPKPHLYLSLMRVLSSRGDYRMVKCLHRRMWLDSSGTIFPGFQEEADHLLMEAALN
Subjt:  HDLFIDRTAYTAMIDSLVNCGSINAGALSLFGELLKLSGWNSDFRPKPHLYLSLMRVLSSRGDYRMVKCLHRRMWLDSSGTIFPGFQEEADHLLMEAALN

Query:  DNQIDVAIEKLSTIIKIWKGISWTSRGGSVALRIEALLGFTKSFFSPCIFPRVNPSAPIESVMMPFKAVEPLNGSMQLKEVVMHFFDKSVVPIVDEWGRC
        DNQIDVAIEKLSTIIKIWKGISWTSRGGSVALRIEALLGFTKSFFSPCIFPRVNPSAPIESVMMPFKAVEPLNGSMQLKEVVMHFFDKSVVPIVDEWGRC
Subjt:  DNQIDVAIEKLSTIIKIWKGISWTSRGGSVALRIEALLGFTKSFFSPCIFPRVNPSAPIESVMMPFKAVEPLNGSMQLKEVVMHFFDKSVVPIVDEWGRC

Query:  TGLLHREDCTELDAPLWKMMRSPPPSVTTTTSIGHVANLILQKRYKMVIVVRHSKFSTYDGSSLRAVGVFTAEKLYGFVSPLPLPPQPNNPRNR
        TGLLHREDCTELDAPLWK+MRSPPPSVTTTTSIGHVANLILQKRYKMVIVVRHSKFSTYDGSSLRAVGVFTAEKLYGFVSPLPLPPQPNNPRNR
Subjt:  TGLLHREDCTELDAPLWKMMRSPPPSVTTTTSIGHVANLILQKRYKMVIVVRHSKFSTYDGSSLRAVGVFTAEKLYGFVSPLPLPPQPNNPRNR

A0A6J1CDX0 pentatricopeptide repeat-containing protein At5g10690 isoform X20.0e+0095.29Show/hide
Query:  MLRIASFPSRPLALSSSDSLNVFSCPIHSRTASRRRRKVSPRNPNLKRLTSRVVRLTRRKQLHQVFEEIEIAKRRYGKLNTIVMNAVLEACVHCGDIDLA
        MLRIASFPSRPLALSSSDSLNVFSCPIHSRTASRRRRKVSPRNPNLKRLTSRVVRLTRRKQLHQVFEEIEIAKRRYGKLNTIVMNAVLEACVHCGDIDLA
Subjt:  MLRIASFPSRPLALSSSDSLNVFSCPIHSRTASRRRRKVSPRNPNLKRLTSRVVRLTRRKQLHQVFEEIEIAKRRYGKLNTIVMNAVLEACVHCGDIDLA

Query:  LRTFYEMSKPDNCGVDNVSYGTLLKGLGEARKVDEAFQLLESVEEGTAIGSPTLSAPLIYGLLNALIEAGDMRRANGLIARYGFILREGGNLSTSVYNLL
        LRTFYEMSKPDNCGVDNVSYGTLLKGLGE RKVDEAFQLLESVEE                        GDMRRANGLIARYGFILREGGNLSTSVYNLL
Subjt:  LRTFYEMSKPDNCGVDNVSYGTLLKGLGEARKVDEAFQLLESVEEGTAIGSPTLSAPLIYGLLNALIEAGDMRRANGLIARYGFILREGGNLSTSVYNLL

Query:  MKGYISSGVPQAAIAMYSEMLNLGLQPDRLTYNTLISACVKINKLDAAMHFFEEMKERADKYDQEDIFPDAVTYTTLLKGFGILRDARVVHMIVLEMKSC
        MKGYISSGVPQAAIAMYSEMLNLGLQPDRLTYNTLISACVKINKLDAAMHFFEEMKERADKYDQEDIFPDAVTYTTLLKGFGILRDARVVHMIVLEMKSC
Subjt:  MKGYISSGVPQAAIAMYSEMLNLGLQPDRLTYNTLISACVKINKLDAAMHFFEEMKERADKYDQEDIFPDAVTYTTLLKGFGILRDARVVHMIVLEMKSC

Query:  HDLFIDRTAYTAMIDSLVNCGSINAGALSLFGELLKLSGWNSDFRPKPHLYLSLMRVLSSRGDYRMVKCLHRRMWLDSSGTIFPGFQEEADHLLMEAALN
        HD FIDRTAYTAMIDSLVNCGSIN GALSLFGELLKLSGWNSDFRPKPHLYLSLMRVLSSRGDYRMVKCLHRRMWLDSSGTIFPGFQEEADHLLMEAALN
Subjt:  HDLFIDRTAYTAMIDSLVNCGSINAGALSLFGELLKLSGWNSDFRPKPHLYLSLMRVLSSRGDYRMVKCLHRRMWLDSSGTIFPGFQEEADHLLMEAALN

Query:  DNQIDVAIEKLSTIIKIWKGISWTSRGGSVALRIEALLGFTKSFFSPCIFPRVNPSAPIESVMMPFKAVEPLNGSMQLKEVVMHFFDKSVVPIVDEWGRC
        DNQIDVAIEKLSTIIKIWKGISWTSRGGSVALRIEALLGFTKSFFSPCIFPRVNPSAPIESVMMPFKAVEPLNGSMQLKEVVMHFFDKSVVPIVDEWGRC
Subjt:  DNQIDVAIEKLSTIIKIWKGISWTSRGGSVALRIEALLGFTKSFFSPCIFPRVNPSAPIESVMMPFKAVEPLNGSMQLKEVVMHFFDKSVVPIVDEWGRC

Query:  TGLLHREDCTELDAPLWKMMRSPPPSVTTTTSIGHVANLILQKRYKMVIVVRHSKFSTYDGSSLRAVGVFTAEKLYGFVSPLPLPPQPNNPRNR
        TGLLHREDCTELDAPLWK+MRSPPPSVTTTTSIGHVANLILQKRYKMVIVVRHSKFSTYDGSSLRAVGVFTAEKLYGFVSPLPLPPQPNNPRNR
Subjt:  TGLLHREDCTELDAPLWKMMRSPPPSVTTTTSIGHVANLILQKRYKMVIVVRHSKFSTYDGSSLRAVGVFTAEKLYGFVSPLPLPPQPNNPRNR

SwissProt top hitse value%identityAlignment
Q8VYD6 Pentatricopeptide repeat-containing protein At5g106901.8e-18259.09Show/hide
Query:  IHSRTASRRRRKVSPRNPNLKRLTSRVVRLTRRKQLHQVFEEIEIAKRRYGKLNTIVMNAVLEACVHCGDIDLALRTFYEMSKPDNCGVDNVSYGTLLKG
        + +R    RR     R  NLK LTSR+V LTRR+QL Q+ EE+E AK+RYG+LNTIVMN+VLEACVHCG+IDLALR F+EM++P   GVD++SY T+LKG
Subjt:  IHSRTASRRRRKVSPRNPNLKRLTSRVVRLTRRKQLHQVFEEIEIAKRRYGKLNTIVMNAVLEACVHCGDIDLALRTFYEMSKPDNCGVDNVSYGTLLKG

Query:  LGEARKVDEAFQLLESVEEGTAIGSPTLSAPLIYGLLNALIEAGDMRRANGLIARYGFILREGGNLSTSVYNLLMKGYISSGVPQAAIAMYSEMLNLGLQ
        LG+AR++DEAFQ+LE++E GTA G+P LS+ LIYGLL+ALI AGD+RRANGL+ARY  +L + G  S  +YNLLMKGY++S  PQAAI +  EML L L+
Subjt:  LGEARKVDEAFQLLESVEEGTAIGSPTLSAPLIYGLLNALIEAGDMRRANGLIARYGFILREGGNLSTSVYNLLMKGYISSGVPQAAIAMYSEMLNLGLQ

Query:  PDRLTYNTLISACVKINKLDAAMHFFEEMKERADKYDQEDIFPDAVTYTTLLKGFGILRDARVVHMIVLEMKSCHDLFIDRTAYTAMIDSLVNCGSINAG
        PDRLTYNTLI AC+K   LDAAM FF +MKE+A++Y  + + PD VTYTTL+KGFG   D   +  I LEMK C ++FIDRTA+TA++D+++ CGS  +G
Subjt:  PDRLTYNTLISACVKINKLDAAMHFFEEMKERADKYDQEDIFPDAVTYTTLLKGFGILRDARVVHMIVLEMKSCHDLFIDRTAYTAMIDSLVNCGSINAG

Query:  ALSLFGELLKLSGWNSDFRPKPHLYLSLMRVLSSRGDYRMVKCLHRRMWLDSSGTIFPGFQEEADHLLMEAALNDNQIDVAIEKLSTIIKIWKGISWTSR
        AL +FGE+LK SG N   RPKPHLYLS+MR  + +GDY MV+ L+ R+W DSSG+I    Q+EAD+LLMEAALND Q+D A+  L +I++ WK I WT+ 
Subjt:  ALSLFGELLKLSGWNSDFRPKPHLYLSLMRVLSSRGDYRMVKCLHRRMWLDSSGTIFPGFQEEADHLLMEAALNDNQIDVAIEKLSTIIKIWKGISWTSR

Query:  GGSVALRIEALLGFTKSFFSPCIFPRVNPSAPIESVMMPFKAVEPLNGSMQLKEVVMHFFDKSVVPIVDEWGRCTGLLHREDCTELDAPLWKMMRSPPPS
        GG  A+R+E LLGF+KS   P +  +V PS PIES+M+ F+A  PL G++QLK V M FF + VVPIVD+ G C GLLHREDC  LDAPL  MMRSPP  
Subjt:  GGSVALRIEALLGFTKSFFSPCIFPRVNPSAPIESVMMPFKAVEPLNGSMQLKEVVMHFFDKSVVPIVDEWGRCTGLLHREDCTELDAPLWKMMRSPPPS

Query:  VTTTTSIGHVANLILQKRYKMVIVVRHSKFSTYDGSSLRAVGVFTAEKLY
        V+TTTSIG V +L+L+K+ KMVIVV    FS   G S +AVG FT  +LY
Subjt:  VTTTTSIGHVANLILQKRYKMVIVVRHSKFSTYDGSSLRAVGVFTAEKLY

Q940A6 Pentatricopeptide repeat-containing protein At4g19440, chloroplastic9.9e-1628.86Show/hide
Query:  LNTIVMNAVLEACVHCGDIDLALRTFYEMSKPDNCGVDNVSYGTLLKGLGEARKVDEAFQLL-ESVEEGTAIGSPTLSAPLIYGLLNALIEAGDMRRANG
        ++T   NA+L      G +D A R   E+     C +D VSY TL+ G    +K+DEAF  L E V+ G    + T S  LI GL N       M +   
Subjt:  LNTIVMNAVLEACVHCGDIDLALRTFYEMSKPDNCGVDNVSYGTLLKGLGEARKVDEAFQLL-ESVEEGTAIGSPTLSAPLIYGLLNALIEAGDMRRANG

Query:  LIARYGFILREGGNLSTSVYNLLMKGYISSGVPQAAIAMYSEMLNLGLQPDRLTYNTLISACVKINKLDAAMHFFEEMKERADKYDQEDIFPDAVTYTTL
         I  +    R G       Y++++ G   +   +     + EM++  +QP+ + YN LI A  +  +L  A+   E+MK +        I P++ TYT+L
Subjt:  LIARYGFILREGGNLSTSVYNLLMKGYISSGVPQAAIAMYSEMLNLGLQPDRLTYNTLISACVKINKLDAAMHFFEEMKERADKYDQEDIFPDAVTYTTL

Query:  LKGFGILRDARVVHMIVLEMKSCHDLFIDRTAYTAMIDSLVNCGSI
        +KG  I+       ++  EM+    L  +   YTA+ID     G +
Subjt:  LKGFGILRDARVVHMIVLEMKSCHDLFIDRTAYTAMIDSLVNCGSI

Q9FIX3 Pentatricopeptide repeat-containing protein At5g397104.3e-1931.53Show/hide
Query:  NTIVMNAVLEACVHCGDIDLALRTFYEMSKPDNCGVDNVSYGTLLKGLGEARKVDEAFQLLESVE-EGTAIGSPTLSAPLIYGLLNALIEAGDMRRANGL
        N    N ++      G+ID+AL T ++  +   C  + V+Y TL+ G  + RK+D+ F+LL S+  +G     P L +  +  ++N L   G M+  + +
Subjt:  NTIVMNAVLEACVHCGDIDLALRTFYEMSKPDNCGVDNVSYGTLLKGLGEARKVDEAFQLLESVE-EGTAIGSPTLSAPLIYGLLNALIEAGDMRRANGL

Query:  IARYGFILREGGNLSTSVYNLLMKGYISSGVPQAAIAMYSEMLNLGLQPDRLTYNTLISACVKINKLDAAMHFFEEMKERADKYDQEDIFPDAVTYTTLL
        +       R G +L    YN L+KGY   G    A+ M++EML  GL P  +TY +LI +  K   ++ AM F ++M+ R        + P+  TYTTL+
Subjt:  IARYGFILREGGNLSTSVYNLLMKGYISSGVPQAAIAMYSEMLNLGLQPDRLTYNTLISACVKINKLDAAMHFFEEMKERADKYDQEDIFPDAVTYTTLL

Query:  KGF
         GF
Subjt:  KGF

Q9LPX2 Pentatricopeptide repeat-containing protein At1g12775, mitochondrial1.5e-1626.54Show/hide
Query:  KLNTIVMNAVLEACVHCGDIDLALRTFYEMSKPDNCGVDNVSYGTLLKGLGEARKVDEAFQLL-ESVEEGTAIGSPTLSAPLIYGLLNALIEAGDMRRAN
        KL+ +  + +++     G +D A   F EM +      D ++Y TL+ G   A + D+  +LL + ++   +    T S      L+++ ++ G +R A+
Subjt:  KLNTIVMNAVLEACVHCGDIDLALRTFYEMSKPDNCGVDNVSYGTLLKGLGEARKVDEAFQLL-ESVEEGTAIGSPTLSAPLIYGLLNALIEAGDMRRAN

Query:  GLIARYGFILREGGNLSTSVYNLLMKGYISSGVPQAAIAMYSEMLNLGLQPDRLTYNTLISACVKINKLDAAMHFFEEMKERADKYDQEDIFPDAVTYTT
         L+     +++ G   +T  YN L+ G+      + AI M   M++ G  PD +T+N LI+   K N++D  +  F EM  R        +  + VTY T
Subjt:  GLIARYGFILREGGNLSTSVYNLLMKGYISSGVPQAAIAMYSEMLNLGLQPDRLTYNTLISACVKINKLDAAMHFFEEMKERADKYDQEDIFPDAVTYTT

Query:  LLKGFGILRDARVVHMIVLEMKSCHDLFIDRTAYTAMIDSLVNCGSINAGALSLFGELLK
        L++GF       V   +  EM S   +  D  +Y  ++D L + G +   AL +FG++ K
Subjt:  LLKGFGILRDARVVHMIVLEMKSCHDLFIDRTAYTAMIDSLVNCGSINAGALSLFGELLK

Q9SZ52 Pentatricopeptide repeat-containing protein At4g31850, chloroplastic2.4e-1727.69Show/hide
Query:  NAVLEACVHCGDIDLALRTFYEMSKPDNCGVDNVSYGTLLKGLGEARKVDEAFQLLESVEEGTAIGSPTLSAPLIYG-LLNALIEAGDMRRANGLIARYG
        N +L+A    G ID     + EMS  + C  + +++  ++ GL +A  VD+A  L   +       SPT      YG L++ L ++G +  A  L   + 
Subjt:  NAVLEACVHCGDIDLALRTFYEMSKPDNCGVDNVSYGTLLKGLGEARKVDEAFQLLESVEEGTAIGSPTLSAPLIYG-LLNALIEAGDMRRANGLIARYG

Query:  FILREGGNLSTSVYNLLMKGYISSGVPQAAIAMYSEMLNLGLQPDRLTYNTLISACVKINKLDAAMHFFEEMKERADKYDQEDIFPDAVTYTTLLKGFGI
         +L  G   + ++YN+L+ G+  +G   AA A++  M+  G++PD  TY+ L+     + ++D  +H+F+E+KE         + PD V Y  ++ G G 
Subjt:  FILREGGNLSTSVYNLLMKGYISSGVPQAAIAMYSEMLNLGLQPDRLTYNTLISACVKINKLDAAMHFFEEMKERADKYDQEDIFPDAVTYTTLLKGFGI

Query:  LRDARVVHMIVL--EMKSCHDLFIDRTAYTAMIDSLVNCGSI
         +  R+   +VL  EMK+   +  D   Y ++I +L   G +
Subjt:  LRDARVVHMIVL--EMKSCHDLFIDRTAYTAMIDSLVNCGSI

Arabidopsis top hitse value%identityAlignment
AT1G12775.1 Pentatricopeptide repeat (PPR) superfamily protein1.1e-1726.54Show/hide
Query:  KLNTIVMNAVLEACVHCGDIDLALRTFYEMSKPDNCGVDNVSYGTLLKGLGEARKVDEAFQLL-ESVEEGTAIGSPTLSAPLIYGLLNALIEAGDMRRAN
        KL+ +  + +++     G +D A   F EM +      D ++Y TL+ G   A + D+  +LL + ++   +    T S      L+++ ++ G +R A+
Subjt:  KLNTIVMNAVLEACVHCGDIDLALRTFYEMSKPDNCGVDNVSYGTLLKGLGEARKVDEAFQLL-ESVEEGTAIGSPTLSAPLIYGLLNALIEAGDMRRAN

Query:  GLIARYGFILREGGNLSTSVYNLLMKGYISSGVPQAAIAMYSEMLNLGLQPDRLTYNTLISACVKINKLDAAMHFFEEMKERADKYDQEDIFPDAVTYTT
         L+     +++ G   +T  YN L+ G+      + AI M   M++ G  PD +T+N LI+   K N++D  +  F EM  R        +  + VTY T
Subjt:  GLIARYGFILREGGNLSTSVYNLLMKGYISSGVPQAAIAMYSEMLNLGLQPDRLTYNTLISACVKINKLDAAMHFFEEMKERADKYDQEDIFPDAVTYTT

Query:  LLKGFGILRDARVVHMIVLEMKSCHDLFIDRTAYTAMIDSLVNCGSINAGALSLFGELLK
        L++GF       V   +  EM S   +  D  +Y  ++D L + G +   AL +FG++ K
Subjt:  LLKGFGILRDARVVHMIVLEMKSCHDLFIDRTAYTAMIDSLVNCGSINAGALSLFGELLK

AT4G19440.1 Tetratricopeptide repeat (TPR)-like superfamily protein7.1e-1728.86Show/hide
Query:  LNTIVMNAVLEACVHCGDIDLALRTFYEMSKPDNCGVDNVSYGTLLKGLGEARKVDEAFQLL-ESVEEGTAIGSPTLSAPLIYGLLNALIEAGDMRRANG
        ++T   NA+L      G +D A R   E+     C +D VSY TL+ G    +K+DEAF  L E V+ G    + T S  LI GL N       M +   
Subjt:  LNTIVMNAVLEACVHCGDIDLALRTFYEMSKPDNCGVDNVSYGTLLKGLGEARKVDEAFQLL-ESVEEGTAIGSPTLSAPLIYGLLNALIEAGDMRRANG

Query:  LIARYGFILREGGNLSTSVYNLLMKGYISSGVPQAAIAMYSEMLNLGLQPDRLTYNTLISACVKINKLDAAMHFFEEMKERADKYDQEDIFPDAVTYTTL
         I  +    R G       Y++++ G   +   +     + EM++  +QP+ + YN LI A  +  +L  A+   E+MK +        I P++ TYT+L
Subjt:  LIARYGFILREGGNLSTSVYNLLMKGYISSGVPQAAIAMYSEMLNLGLQPDRLTYNTLISACVKINKLDAAMHFFEEMKERADKYDQEDIFPDAVTYTTL

Query:  LKGFGILRDARVVHMIVLEMKSCHDLFIDRTAYTAMIDSLVNCGSI
        +KG  I+       ++  EM+    L  +   YTA+ID     G +
Subjt:  LKGFGILRDARVVHMIVLEMKSCHDLFIDRTAYTAMIDSLVNCGSI

AT4G31850.1 proton gradient regulation 31.7e-1827.69Show/hide
Query:  NAVLEACVHCGDIDLALRTFYEMSKPDNCGVDNVSYGTLLKGLGEARKVDEAFQLLESVEEGTAIGSPTLSAPLIYG-LLNALIEAGDMRRANGLIARYG
        N +L+A    G ID     + EMS  + C  + +++  ++ GL +A  VD+A  L   +       SPT      YG L++ L ++G +  A  L   + 
Subjt:  NAVLEACVHCGDIDLALRTFYEMSKPDNCGVDNVSYGTLLKGLGEARKVDEAFQLLESVEEGTAIGSPTLSAPLIYG-LLNALIEAGDMRRANGLIARYG

Query:  FILREGGNLSTSVYNLLMKGYISSGVPQAAIAMYSEMLNLGLQPDRLTYNTLISACVKINKLDAAMHFFEEMKERADKYDQEDIFPDAVTYTTLLKGFGI
         +L  G   + ++YN+L+ G+  +G   AA A++  M+  G++PD  TY+ L+     + ++D  +H+F+E+KE         + PD V Y  ++ G G 
Subjt:  FILREGGNLSTSVYNLLMKGYISSGVPQAAIAMYSEMLNLGLQPDRLTYNTLISACVKINKLDAAMHFFEEMKERADKYDQEDIFPDAVTYTTLLKGFGI

Query:  LRDARVVHMIVL--EMKSCHDLFIDRTAYTAMIDSLVNCGSI
         +  R+   +VL  EMK+   +  D   Y ++I +L   G +
Subjt:  LRDARVVHMIVL--EMKSCHDLFIDRTAYTAMIDSLVNCGSI

AT5G10690.1 pentatricopeptide (PPR) repeat-containing protein / CBS domain-containing protein1.3e-18359.09Show/hide
Query:  IHSRTASRRRRKVSPRNPNLKRLTSRVVRLTRRKQLHQVFEEIEIAKRRYGKLNTIVMNAVLEACVHCGDIDLALRTFYEMSKPDNCGVDNVSYGTLLKG
        + +R    RR     R  NLK LTSR+V LTRR+QL Q+ EE+E AK+RYG+LNTIVMN+VLEACVHCG+IDLALR F+EM++P   GVD++SY T+LKG
Subjt:  IHSRTASRRRRKVSPRNPNLKRLTSRVVRLTRRKQLHQVFEEIEIAKRRYGKLNTIVMNAVLEACVHCGDIDLALRTFYEMSKPDNCGVDNVSYGTLLKG

Query:  LGEARKVDEAFQLLESVEEGTAIGSPTLSAPLIYGLLNALIEAGDMRRANGLIARYGFILREGGNLSTSVYNLLMKGYISSGVPQAAIAMYSEMLNLGLQ
        LG+AR++DEAFQ+LE++E GTA G+P LS+ LIYGLL+ALI AGD+RRANGL+ARY  +L + G  S  +YNLLMKGY++S  PQAAI +  EML L L+
Subjt:  LGEARKVDEAFQLLESVEEGTAIGSPTLSAPLIYGLLNALIEAGDMRRANGLIARYGFILREGGNLSTSVYNLLMKGYISSGVPQAAIAMYSEMLNLGLQ

Query:  PDRLTYNTLISACVKINKLDAAMHFFEEMKERADKYDQEDIFPDAVTYTTLLKGFGILRDARVVHMIVLEMKSCHDLFIDRTAYTAMIDSLVNCGSINAG
        PDRLTYNTLI AC+K   LDAAM FF +MKE+A++Y  + + PD VTYTTL+KGFG   D   +  I LEMK C ++FIDRTA+TA++D+++ CGS  +G
Subjt:  PDRLTYNTLISACVKINKLDAAMHFFEEMKERADKYDQEDIFPDAVTYTTLLKGFGILRDARVVHMIVLEMKSCHDLFIDRTAYTAMIDSLVNCGSINAG

Query:  ALSLFGELLKLSGWNSDFRPKPHLYLSLMRVLSSRGDYRMVKCLHRRMWLDSSGTIFPGFQEEADHLLMEAALNDNQIDVAIEKLSTIIKIWKGISWTSR
        AL +FGE+LK SG N   RPKPHLYLS+MR  + +GDY MV+ L+ R+W DSSG+I    Q+EAD+LLMEAALND Q+D A+  L +I++ WK I WT+ 
Subjt:  ALSLFGELLKLSGWNSDFRPKPHLYLSLMRVLSSRGDYRMVKCLHRRMWLDSSGTIFPGFQEEADHLLMEAALNDNQIDVAIEKLSTIIKIWKGISWTSR

Query:  GGSVALRIEALLGFTKSFFSPCIFPRVNPSAPIESVMMPFKAVEPLNGSMQLKEVVMHFFDKSVVPIVDEWGRCTGLLHREDCTELDAPLWKMMRSPPPS
        GG  A+R+E LLGF+KS   P +  +V PS PIES+M+ F+A  PL G++QLK V M FF + VVPIVD+ G C GLLHREDC  LDAPL  MMRSPP  
Subjt:  GGSVALRIEALLGFTKSFFSPCIFPRVNPSAPIESVMMPFKAVEPLNGSMQLKEVVMHFFDKSVVPIVDEWGRCTGLLHREDCTELDAPLWKMMRSPPPS

Query:  VTTTTSIGHVANLILQKRYKMVIVVRHSKFSTYDGSSLRAVGVFTAEKLY
        V+TTTSIG V +L+L+K+ KMVIVV    FS   G S +AVG FT  +LY
Subjt:  VTTTTSIGHVANLILQKRYKMVIVVRHSKFSTYDGSSLRAVGVFTAEKLY

AT5G39710.1 Tetratricopeptide repeat (TPR)-like superfamily protein3.1e-2031.53Show/hide
Query:  NTIVMNAVLEACVHCGDIDLALRTFYEMSKPDNCGVDNVSYGTLLKGLGEARKVDEAFQLLESVE-EGTAIGSPTLSAPLIYGLLNALIEAGDMRRANGL
        N    N ++      G+ID+AL T ++  +   C  + V+Y TL+ G  + RK+D+ F+LL S+  +G     P L +  +  ++N L   G M+  + +
Subjt:  NTIVMNAVLEACVHCGDIDLALRTFYEMSKPDNCGVDNVSYGTLLKGLGEARKVDEAFQLLESVE-EGTAIGSPTLSAPLIYGLLNALIEAGDMRRANGL

Query:  IARYGFILREGGNLSTSVYNLLMKGYISSGVPQAAIAMYSEMLNLGLQPDRLTYNTLISACVKINKLDAAMHFFEEMKERADKYDQEDIFPDAVTYTTLL
        +       R G +L    YN L+KGY   G    A+ M++EML  GL P  +TY +LI +  K   ++ AM F ++M+ R        + P+  TYTTL+
Subjt:  IARYGFILREGGNLSTSVYNLLMKGYISSGVPQAAIAMYSEMLNLGLQPDRLTYNTLISACVKINKLDAAMHFFEEMKERADKYDQEDIFPDAVTYTTLL

Query:  KGF
         GF
Subjt:  KGF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTACGGATCGCCTCATTTCCCTCTCGACCATTGGCATTAAGTTCATCCGACTCCCTGAATGTATTTTCTTGTCCAATTCATTCTCGTACGGCTTCCCGCCGGCGTCG
GAAAGTATCTCCTCGGAATCCTAATCTCAAGCGGCTGACCTCTCGTGTCGTGCGACTCACTCGCCGGAAGCAGCTTCACCAGGTATTTGAGGAAATTGAAATTGCCAAGA
GACGTTATGGAAAGTTGAATACAATTGTTATGAATGCGGTCCTGGAAGCTTGCGTGCACTGCGGTGACATTGATTTAGCTCTGAGGACTTTTTATGAAATGTCAAAGCCA
GATAATTGTGGCGTAGACAATGTCAGTTATGGCACACTATTAAAGGGTTTGGGTGAAGCTCGAAAAGTCGATGAAGCATTTCAATTGCTTGAATCTGTGGAAGAAGGTAC
CGCTATTGGAAGTCCAACATTGTCAGCACCCCTTATTTATGGTCTTCTAAATGCTTTAATTGAAGCAGGAGACATGCGCCGTGCCAATGGTCTAATTGCACGATATGGGT
TCATACTTCGTGAAGGAGGCAATCTCTCTACTTCAGTTTACAACTTATTGATGAAGGGGTACATAAGCTCAGGTGTTCCTCAGGCTGCTATAGCTATGTATAGTGAGATG
CTAAATCTCGGGTTACAACCTGATAGGCTCACTTACAATACGTTAATCTCTGCATGTGTGAAGATTAACAAATTGGACGCAGCAATGCATTTCTTTGAGGAAATGAAGGA
ACGAGCTGACAAATATGACCAGGAAGATATTTTTCCTGATGCTGTGACGTACACTACATTACTTAAGGGGTTTGGGATTTTGAGAGATGCCCGTGTAGTTCACATGATTG
TGCTGGAAATGAAATCTTGTCATGATTTGTTTATTGATCGAACAGCATACACCGCAATGATTGATTCTTTGGTCAATTGTGGCTCAATAAACGCAGGTGCTCTTTCATTA
TTTGGGGAATTATTGAAGCTTTCTGGATGGAATTCAGACTTTCGGCCAAAGCCACATCTTTATCTCTCTCTTATGAGGGTTCTTTCTAGTAGAGGCGATTATCGGATGGT
CAAATGTTTGCATAGACGCATGTGGCTGGACTCTTCCGGAACTATTTTTCCTGGATTTCAAGAAGAAGCAGACCATCTTCTCATGGAGGCAGCTTTAAACGACAATCAGA
TTGACGTGGCAATAGAGAAACTGTCAACAATTATTAAGATATGGAAGGGAATCTCATGGACAAGTCGAGGAGGCAGTGTTGCTCTTCGCATAGAAGCATTGCTAGGATTC
ACCAAATCATTCTTCAGTCCTTGCATATTTCCTCGGGTAAATCCTAGTGCACCTATTGAGAGTGTTATGATGCCATTTAAAGCAGTTGAGCCTTTAAATGGCAGCATGCA
GTTGAAGGAAGTGGTGATGCATTTCTTTGACAAATCAGTTGTGCCCATCGTAGACGAATGGGGCAGATGCACTGGATTATTGCACCGCGAAGACTGTACTGAGTTGGATG
CTCCCCTTTGGAAGATGATGAGAAGCCCTCCTCCTAGTGTAACTACTACAACATCCATTGGACACGTCGCGAATCTAATTCTACAAAAGAGGTACAAAATGGTTATTGTT
GTAAGACATAGCAAGTTTAGTACATATGATGGCTCGAGTTTGAGGGCTGTCGGCGTTTTTACTGCTGAGAAATTGTATGGGTTTGTTTCTCCCCTTCCCTTGCCACCCCA
GCCAAACAACCCACGTAACAGG
mRNA sequenceShow/hide mRNA sequence
ATGTTACGGATCGCCTCATTTCCCTCTCGACCATTGGCATTAAGTTCATCCGACTCCCTGAATGTATTTTCTTGTCCAATTCATTCTCGTACGGCTTCCCGCCGGCGTCG
GAAAGTATCTCCTCGGAATCCTAATCTCAAGCGGCTGACCTCTCGTGTCGTGCGACTCACTCGCCGGAAGCAGCTTCACCAGGTATTTGAGGAAATTGAAATTGCCAAGA
GACGTTATGGAAAGTTGAATACAATTGTTATGAATGCGGTCCTGGAAGCTTGCGTGCACTGCGGTGACATTGATTTAGCTCTGAGGACTTTTTATGAAATGTCAAAGCCA
GATAATTGTGGCGTAGACAATGTCAGTTATGGCACACTATTAAAGGGTTTGGGTGAAGCTCGAAAAGTCGATGAAGCATTTCAATTGCTTGAATCTGTGGAAGAAGGTAC
CGCTATTGGAAGTCCAACATTGTCAGCACCCCTTATTTATGGTCTTCTAAATGCTTTAATTGAAGCAGGAGACATGCGCCGTGCCAATGGTCTAATTGCACGATATGGGT
TCATACTTCGTGAAGGAGGCAATCTCTCTACTTCAGTTTACAACTTATTGATGAAGGGGTACATAAGCTCAGGTGTTCCTCAGGCTGCTATAGCTATGTATAGTGAGATG
CTAAATCTCGGGTTACAACCTGATAGGCTCACTTACAATACGTTAATCTCTGCATGTGTGAAGATTAACAAATTGGACGCAGCAATGCATTTCTTTGAGGAAATGAAGGA
ACGAGCTGACAAATATGACCAGGAAGATATTTTTCCTGATGCTGTGACGTACACTACATTACTTAAGGGGTTTGGGATTTTGAGAGATGCCCGTGTAGTTCACATGATTG
TGCTGGAAATGAAATCTTGTCATGATTTGTTTATTGATCGAACAGCATACACCGCAATGATTGATTCTTTGGTCAATTGTGGCTCAATAAACGCAGGTGCTCTTTCATTA
TTTGGGGAATTATTGAAGCTTTCTGGATGGAATTCAGACTTTCGGCCAAAGCCACATCTTTATCTCTCTCTTATGAGGGTTCTTTCTAGTAGAGGCGATTATCGGATGGT
CAAATGTTTGCATAGACGCATGTGGCTGGACTCTTCCGGAACTATTTTTCCTGGATTTCAAGAAGAAGCAGACCATCTTCTCATGGAGGCAGCTTTAAACGACAATCAGA
TTGACGTGGCAATAGAGAAACTGTCAACAATTATTAAGATATGGAAGGGAATCTCATGGACAAGTCGAGGAGGCAGTGTTGCTCTTCGCATAGAAGCATTGCTAGGATTC
ACCAAATCATTCTTCAGTCCTTGCATATTTCCTCGGGTAAATCCTAGTGCACCTATTGAGAGTGTTATGATGCCATTTAAAGCAGTTGAGCCTTTAAATGGCAGCATGCA
GTTGAAGGAAGTGGTGATGCATTTCTTTGACAAATCAGTTGTGCCCATCGTAGACGAATGGGGCAGATGCACTGGATTATTGCACCGCGAAGACTGTACTGAGTTGGATG
CTCCCCTTTGGAAGATGATGAGAAGCCCTCCTCCTAGTGTAACTACTACAACATCCATTGGACACGTCGCGAATCTAATTCTACAAAAGAGGTACAAAATGGTTATTGTT
GTAAGACATAGCAAGTTTAGTACATATGATGGCTCGAGTTTGAGGGCTGTCGGCGTTTTTACTGCTGAGAAATTGTATGGGTTTGTTTCTCCCCTTCCCTTGCCACCCCA
GCCAAACAACCCACGTAACAGG
Protein sequenceShow/hide protein sequence
MLRIASFPSRPLALSSSDSLNVFSCPIHSRTASRRRRKVSPRNPNLKRLTSRVVRLTRRKQLHQVFEEIEIAKRRYGKLNTIVMNAVLEACVHCGDIDLALRTFYEMSKP
DNCGVDNVSYGTLLKGLGEARKVDEAFQLLESVEEGTAIGSPTLSAPLIYGLLNALIEAGDMRRANGLIARYGFILREGGNLSTSVYNLLMKGYISSGVPQAAIAMYSEM
LNLGLQPDRLTYNTLISACVKINKLDAAMHFFEEMKERADKYDQEDIFPDAVTYTTLLKGFGILRDARVVHMIVLEMKSCHDLFIDRTAYTAMIDSLVNCGSINAGALSL
FGELLKLSGWNSDFRPKPHLYLSLMRVLSSRGDYRMVKCLHRRMWLDSSGTIFPGFQEEADHLLMEAALNDNQIDVAIEKLSTIIKIWKGISWTSRGGSVALRIEALLGF
TKSFFSPCIFPRVNPSAPIESVMMPFKAVEPLNGSMQLKEVVMHFFDKSVVPIVDEWGRCTGLLHREDCTELDAPLWKMMRSPPPSVTTTTSIGHVANLILQKRYKMVIV
VRHSKFSTYDGSSLRAVGVFTAEKLYGFVSPLPLPPQPNNPRNR