; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc11g06080 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc11g06080
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
Descriptionpentatricopeptide repeat-containing protein isoform X1
Genome locationchr11:4059081..4074773
RNA-Seq ExpressionMoc11g06080
SyntenyMoc11g06080
Gene Ontology termsGO:0003729 - mRNA binding (molecular function)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR000644 - CBS domain
IPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily
IPR044781 - Pentatricopeptide repeat-containing protein At5g10690-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008466304.1 PREDICTED: pentatricopeptide repeat-containing protein At5g10690 isoform X4 [Cucumis melo]1.6e-29184.51Show/hide
Query:  MLRIASFPSRPLALSSSDSLNVFSCPIHSRTASRRRRKVSPRNPNLKRLTSRVVRLTRRKQLHQVFEEIEIAKRRYGKLNTIVMNAVLEACVHCGDIDLA
        ML IASF SR LAL +S+S NVFSC + SRTA+ RR + SPR+PNLKRLTSRVVRLTRRK+LHQVFEEIEIAKRRYGKLNTIVMNAVLEACVHCGDIDLA
Subjt:  MLRIASFPSRPLALSSSDSLNVFSCPIHSRTASRRRRKVSPRNPNLKRLTSRVVRLTRRKQLHQVFEEIEIAKRRYGKLNTIVMNAVLEACVHCGDIDLA

Query:  LRTFYEMSKPDNCGVDNVSYGTLLKGLGEGRKVDEAFQLLESVEEGTAIGSPTLSAPLIYGLLNALIEAGDMRRANGLIARYGFILREGGNLSTSVYNLL
        LRTF EMSKPD+CG+DNVSYGTLLKGLGE RK+DEAFQLLESVEEGTAIG PTLSAPLIYGLLNALIEAGDMRRANGLIARYG++LREGGNLS SVYNLL
Subjt:  LRTFYEMSKPDNCGVDNVSYGTLLKGLGEGRKVDEAFQLLESVEEGTAIGSPTLSAPLIYGLLNALIEAGDMRRANGLIARYGFILREGGNLSTSVYNLL

Query:  MKGYISSGVPQAAIAMYSEMLNLGLQPDRLTYNTLISACVKINKLDAAMHFFEEMKERADKYDQEDIFPDAVTYTTLLKGFGILRDARVVHMIVLEMKSC
        MKGYISSGVPQAA+AMY+EMLNL L+PDRLTYNTLISACVKINKLDAAMHFFEEMKERADKYDQED+FPD VTYTTLLK FGIL+D  +VH IVLEMKSC
Subjt:  MKGYISSGVPQAAIAMYSEMLNLGLQPDRLTYNTLISACVKINKLDAAMHFFEEMKERADKYDQEDIFPDAVTYTTLLKGFGILRDARVVHMIVLEMKSC

Query:  HDFFIDRTAYTAMIDSLVNCGSIN-------GALSLFGELLKLSGWNSDFRPKPHLYLSLMRVLSSRGDYRMVKCLHRRMWLDSSGTIFPGFQEEADHLL
        H   IDRTAYTAMID+LVNCGSIN       GALSLFGELLKLSGWN + RPKPHLYL+LMRV SSRGDYRMVKCLHRRMWLDSSGTI  G+QEEADHLL
Subjt:  HDFFIDRTAYTAMIDSLVNCGSIN-------GALSLFGELLKLSGWNSDFRPKPHLYLSLMRVLSSRGDYRMVKCLHRRMWLDSSGTIFPGFQEEADHLL

Query:  MEAALNDNQIDVAIEKLSTIIKIWKGISWTSRGGSVALRIEALLGFTKSFFSPCIFPRVNPSAPIESVMMPFKAVEPLNGSMQLKEVVMHFFDKSVVPIV
        MEAALNDNQIDVAIEKLSTIIK WKGISWTSRGGSVALRIEALLG TKSFFSPCIFPRVN  APIESVMMPFKAV+PLNGS+ LKEVVM FFDKSVVPI+
Subjt:  MEAALNDNQIDVAIEKLSTIIKIWKGISWTSRGGSVALRIEALLGFTKSFFSPCIFPRVNPSAPIESVMMPFKAVEPLNGSMQLKEVVMHFFDKSVVPIV

Query:  DEWGRCTGLLHREDCTEKKCKYLFMAQLDAPLWKIMRSPPPSVTTTTSIGHVANLILQKRYKMVIVVRHSKFSTYDGSSLRAVGVFTAEKLYGFVSPLPL
        D+WGRC GLLHREDCTE          LDAPLWK+MRSPPP VTTT  IGHVANLILQKRYKMV+VVRHSKFS Y GSSLRA+GVFT E+LYGF+SP+P+
Subjt:  DEWGRCTGLLHREDCTEKKCKYLFMAQLDAPLWKIMRSPPPSVTTTTSIGHVANLILQKRYKMVIVVRHSKFSTYDGSSLRAVGVFTAEKLYGFVSPLPL

Query:  PPQPNNP
        P +PN P
Subjt:  PPQPNNP

XP_008466305.1 PREDICTED: pentatricopeptide repeat-containing protein At5g10690 isoform X5 [Cucumis melo]1.3e-29385.5Show/hide
Query:  MLRIASFPSRPLALSSSDSLNVFSCPIHSRTASRRRRKVSPRNPNLKRLTSRVVRLTRRKQLHQVFEEIEIAKRRYGKLNTIVMNAVLEACVHCGDIDLA
        ML IASF SR LAL +S+S NVFSC + SRTA+ RR + SPR+PNLKRLTSRVVRLTRRK+LHQVFEEIEIAKRRYGKLNTIVMNAVLEACVHCGDIDLA
Subjt:  MLRIASFPSRPLALSSSDSLNVFSCPIHSRTASRRRRKVSPRNPNLKRLTSRVVRLTRRKQLHQVFEEIEIAKRRYGKLNTIVMNAVLEACVHCGDIDLA

Query:  LRTFYEMSKPDNCGVDNVSYGTLLKGLGEGRKVDEAFQLLESVEEGTAIGSPTLSAPLIYGLLNALIEAGDMRRANGLIARYGFILREGGNLSTSVYNLL
        LRTF EMSKPD+CG+DNVSYGTLLKGLGE RK+DEAFQLLESVEEGTAIG PTLSAPLIYGLLNALIEAGDMRRANGLIARYG++LREGGNLS SVYNLL
Subjt:  LRTFYEMSKPDNCGVDNVSYGTLLKGLGEGRKVDEAFQLLESVEEGTAIGSPTLSAPLIYGLLNALIEAGDMRRANGLIARYGFILREGGNLSTSVYNLL

Query:  MKGYISSGVPQAAIAMYSEMLNLGLQPDRLTYNTLISACVKINKLDAAMHFFEEMKERADKYDQEDIFPDAVTYTTLLKGFGILRDARVVHMIVLEMKSC
        MKGYISSGVPQAA+AMY+EMLNL L+PDRLTYNTLISACVKINKLDAAMHFFEEMKERADKYDQED+FPD VTYTTLLK FGIL+D  +VH IVLEMKSC
Subjt:  MKGYISSGVPQAAIAMYSEMLNLGLQPDRLTYNTLISACVKINKLDAAMHFFEEMKERADKYDQEDIFPDAVTYTTLLKGFGILRDARVVHMIVLEMKSC

Query:  HDFFIDRTAYTAMIDSLVNCGSINGALSLFGELLKLSGWNSDFRPKPHLYLSLMRVLSSRGDYRMVKCLHRRMWLDSSGTIFPGFQEEADHLLMEAALND
        H   IDRTAYTAMID+LVNCGSINGALSLFGELLKLSGWN + RPKPHLYL+LMRV SSRGDYRMVKCLHRRMWLDSSGTI  G+QEEADHLLMEAALND
Subjt:  HDFFIDRTAYTAMIDSLVNCGSINGALSLFGELLKLSGWNSDFRPKPHLYLSLMRVLSSRGDYRMVKCLHRRMWLDSSGTIFPGFQEEADHLLMEAALND

Query:  NQIDVAIEKLSTIIKIWKGISWTSRGGSVALRIEALLGFTKSFFSPCIFPRVNPSAPIESVMMPFKAVEPLNGSMQLKEVVMHFFDKSVVPIVDEWGRCT
        NQIDVAIEKLSTIIK WKGISWTSRGGSVALRIEALLG TKSFFSPCIFPRVN  APIESVMMPFKAV+PLNGS+ LKEVVM FFDKSVVPI+D+WGRC 
Subjt:  NQIDVAIEKLSTIIKIWKGISWTSRGGSVALRIEALLGFTKSFFSPCIFPRVNPSAPIESVMMPFKAVEPLNGSMQLKEVVMHFFDKSVVPIVDEWGRCT

Query:  GLLHREDCTEKKCKYLFMAQLDAPLWKIMRSPPPSVTTTTSIGHVANLILQKRYKMVIVVRHSKFSTYDGSSLRAVGVFTAEKLYGFVSPLPLPPQPNNP
        GLLHREDCTE          LDAPLWK+MRSPPP VTTT  IGHVANLILQKRYKMV+VVRHSKFS Y GSSLRA+GVFT E+LYGF+SP+P+P +PN P
Subjt:  GLLHREDCTEKKCKYLFMAQLDAPLWKIMRSPPPSVTTTTSIGHVANLILQKRYKMVIVVRHSKFSTYDGSSLRAVGVFTAEKLYGFVSPLPLPPQPNNP

XP_022139945.1 pentatricopeptide repeat-containing protein At5g10690 isoform X1 [Momordica charantia]0.0e+0098.17Show/hide
Query:  MLRIASFPSRPLALSSSDSLNVFSCPIHSRTASRRRRKVSPRNPNLKRLTSRVVRLTRRKQLHQVFEEIEIAKRRYGKLNTIVMNAVLEACVHCGDIDLA
        MLRIASFPSRPLALSSSDSLNVFSCPIHSRTASRRRRKVSPRNPNLKRLTSRVVRLTRRKQLHQVFEEIEIAKRRYGKLNTIVMNAVLEACVHCGDIDLA
Subjt:  MLRIASFPSRPLALSSSDSLNVFSCPIHSRTASRRRRKVSPRNPNLKRLTSRVVRLTRRKQLHQVFEEIEIAKRRYGKLNTIVMNAVLEACVHCGDIDLA

Query:  LRTFYEMSKPDNCGVDNVSYGTLLKGLGEGRKVDEAFQLLESVEEGTAIGSPTLSAPLIYGLLNALIEAGDMRRANGLIARYGFILREGGNLSTSVYNLL
        LRTFYEMSKPDNCGVDNVSYGTLLKGLGEGRKVDEAFQLLESVEEGTAIGSPTLSAPLIYGLLNALIEAGDMRRANGLIARYGFILREGGNLSTSVYNLL
Subjt:  LRTFYEMSKPDNCGVDNVSYGTLLKGLGEGRKVDEAFQLLESVEEGTAIGSPTLSAPLIYGLLNALIEAGDMRRANGLIARYGFILREGGNLSTSVYNLL

Query:  MKGYISSGVPQAAIAMYSEMLNLGLQPDRLTYNTLISACVKINKLDAAMHFFEEMKERADKYDQEDIFPDAVTYTTLLKGFGILRDARVVHMIVLEMKSC
        MKGYISSGVPQAAIAMYSEMLNLGLQPDRLTYNTLISACVKINKLDAAMHFFEEMKERADKYDQEDIFPDAVTYTTLLKGFGILRDARVVHMIVLEMKSC
Subjt:  MKGYISSGVPQAAIAMYSEMLNLGLQPDRLTYNTLISACVKINKLDAAMHFFEEMKERADKYDQEDIFPDAVTYTTLLKGFGILRDARVVHMIVLEMKSC

Query:  HDFFIDRTAYTAMIDSLVNCGSINGALSLFGELLKLSGWNSDFRPKPHLYLSLMRVLSSRGDYRMVKCLHRRMWLDSSGTIFPGFQEEADHLLMEAALND
        HDFFIDRTAYTAMIDSLVNCGSINGALSLFGELLKLSGWNSDFRPKPHLYLSLMRVLSSRGDYRMVKCLHRRMWLDSSGTIFPGFQEEADHLLMEAALND
Subjt:  HDFFIDRTAYTAMIDSLVNCGSINGALSLFGELLKLSGWNSDFRPKPHLYLSLMRVLSSRGDYRMVKCLHRRMWLDSSGTIFPGFQEEADHLLMEAALND

Query:  NQIDVAIEKLSTIIKIWKGISWTSRGGSVALRIEALLGFTKSFFSPCIFPRVNPSAPIESVMMPFKAVEPLNGSMQLKEVVMHFFDKSVVPIVDEWGRCT
        NQIDVAIEKLSTIIKIWKGISWTSRGGSVALRIEALLGFTKSFFSPCIFPRVNPSAPIESVMMPFKAVEPLNGSMQLKEVVMHFFDKSVVPIVDEWGRCT
Subjt:  NQIDVAIEKLSTIIKIWKGISWTSRGGSVALRIEALLGFTKSFFSPCIFPRVNPSAPIESVMMPFKAVEPLNGSMQLKEVVMHFFDKSVVPIVDEWGRCT

Query:  GLLHREDCTEKKCKYLFMAQLDAPLWKIMRSPPPSVTTTTSIGHVANLILQKRYKMVIVVRHSKFSTYDGSSLRAVGVFTAEKLYGFVSPLPLPPQPNNP
        GLLHREDCTE          LDAPLWKIMRSPPPSVTTTTSIGHVANLILQKRYKMVIVVRHSKFSTYDGSSLRAVGVFTAEKLYGFVSPLPLPPQPNNP
Subjt:  GLLHREDCTEKKCKYLFMAQLDAPLWKIMRSPPPSVTTTTSIGHVANLILQKRYKMVIVVRHSKFSTYDGSSLRAVGVFTAEKLYGFVSPLPLPPQPNNP

Query:  LN
         N
Subjt:  LN

XP_022140025.1 pentatricopeptide repeat-containing protein At5g10690 isoform X2 [Momordica charantia]0.0e+0094.19Show/hide
Query:  MLRIASFPSRPLALSSSDSLNVFSCPIHSRTASRRRRKVSPRNPNLKRLTSRVVRLTRRKQLHQVFEEIEIAKRRYGKLNTIVMNAVLEACVHCGDIDLA
        MLRIASFPSRPLALSSSDSLNVFSCPIHSRTASRRRRKVSPRNPNLKRLTSRVVRLTRRKQLHQVFEEIEIAKRRYGKLNTIVMNAVLEACVHCGDIDLA
Subjt:  MLRIASFPSRPLALSSSDSLNVFSCPIHSRTASRRRRKVSPRNPNLKRLTSRVVRLTRRKQLHQVFEEIEIAKRRYGKLNTIVMNAVLEACVHCGDIDLA

Query:  LRTFYEMSKPDNCGVDNVSYGTLLKGLGEGRKVDEAFQLLESVEEGTAIGSPTLSAPLIYGLLNALIEAGDMRRANGLIARYGFILREGGNLSTSVYNLL
        LRTFYEMSKPDNCGVDNVSYGTLLKGLGEGRKVDEAFQLLESVEE                        GDMRRANGLIARYGFILREGGNLSTSVYNLL
Subjt:  LRTFYEMSKPDNCGVDNVSYGTLLKGLGEGRKVDEAFQLLESVEEGTAIGSPTLSAPLIYGLLNALIEAGDMRRANGLIARYGFILREGGNLSTSVYNLL

Query:  MKGYISSGVPQAAIAMYSEMLNLGLQPDRLTYNTLISACVKINKLDAAMHFFEEMKERADKYDQEDIFPDAVTYTTLLKGFGILRDARVVHMIVLEMKSC
        MKGYISSGVPQAAIAMYSEMLNLGLQPDRLTYNTLISACVKINKLDAAMHFFEEMKERADKYDQEDIFPDAVTYTTLLKGFGILRDARVVHMIVLEMKSC
Subjt:  MKGYISSGVPQAAIAMYSEMLNLGLQPDRLTYNTLISACVKINKLDAAMHFFEEMKERADKYDQEDIFPDAVTYTTLLKGFGILRDARVVHMIVLEMKSC

Query:  HDFFIDRTAYTAMIDSLVNCGSINGALSLFGELLKLSGWNSDFRPKPHLYLSLMRVLSSRGDYRMVKCLHRRMWLDSSGTIFPGFQEEADHLLMEAALND
        HDFFIDRTAYTAMIDSLVNCGSINGALSLFGELLKLSGWNSDFRPKPHLYLSLMRVLSSRGDYRMVKCLHRRMWLDSSGTIFPGFQEEADHLLMEAALND
Subjt:  HDFFIDRTAYTAMIDSLVNCGSINGALSLFGELLKLSGWNSDFRPKPHLYLSLMRVLSSRGDYRMVKCLHRRMWLDSSGTIFPGFQEEADHLLMEAALND

Query:  NQIDVAIEKLSTIIKIWKGISWTSRGGSVALRIEALLGFTKSFFSPCIFPRVNPSAPIESVMMPFKAVEPLNGSMQLKEVVMHFFDKSVVPIVDEWGRCT
        NQIDVAIEKLSTIIKIWKGISWTSRGGSVALRIEALLGFTKSFFSPCIFPRVNPSAPIESVMMPFKAVEPLNGSMQLKEVVMHFFDKSVVPIVDEWGRCT
Subjt:  NQIDVAIEKLSTIIKIWKGISWTSRGGSVALRIEALLGFTKSFFSPCIFPRVNPSAPIESVMMPFKAVEPLNGSMQLKEVVMHFFDKSVVPIVDEWGRCT

Query:  GLLHREDCTEKKCKYLFMAQLDAPLWKIMRSPPPSVTTTTSIGHVANLILQKRYKMVIVVRHSKFSTYDGSSLRAVGVFTAEKLYGFVSPLPLPPQPNNP
        GLLHREDCTE          LDAPLWKIMRSPPPSVTTTTSIGHVANLILQKRYKMVIVVRHSKFSTYDGSSLRAVGVFTAEKLYGFVSPLPLPPQPNNP
Subjt:  GLLHREDCTEKKCKYLFMAQLDAPLWKIMRSPPPSVTTTTSIGHVANLILQKRYKMVIVVRHSKFSTYDGSSLRAVGVFTAEKLYGFVSPLPLPPQPNNP

Query:  LN
         N
Subjt:  LN

XP_023535821.1 pentatricopeptide repeat-containing protein At5g10690 isoform X1 [Cucurbita pepo subsp. pepo]4.3e-29284.33Show/hide
Query:  MLRIASFPSRPLALSSSDSLNVFSCPIHSRTASRRRRKVSPRNPNLKRLTSRVVRLTRRKQLHQVFEEIEIAKRRYGKLNTIVMNAVLEACVHCGDIDLA
        ML IASF SR LA + SDS+NVFSC + SRTA  RRR  SPR+PNLKRLTSRVVRLTRRKQLHQ+FEEIEIAK+RYGKLNTIVMNAVLEACVHCGDIDLA
Subjt:  MLRIASFPSRPLALSSSDSLNVFSCPIHSRTASRRRRKVSPRNPNLKRLTSRVVRLTRRKQLHQVFEEIEIAKRRYGKLNTIVMNAVLEACVHCGDIDLA

Query:  LRTFYEMSKPDNCGVDNVSYGTLLKGLGEGRKVDEAFQLLESVEEGTAIGSPTLSAPLIYGLLNALIEAGDMRRANGLIARYGFILREGGNLSTSVYNLL
        LRTF EMSKPDNCG+DNVSYGTLLKGLGE RK+DEAFQLLESVEEGTAIGSPTLSAPLIYG+LNAL EAGDMRRANGLIARYGF+L EGGNLS SVYNLL
Subjt:  LRTFYEMSKPDNCGVDNVSYGTLLKGLGEGRKVDEAFQLLESVEEGTAIGSPTLSAPLIYGLLNALIEAGDMRRANGLIARYGFILREGGNLSTSVYNLL

Query:  MKGYISSGVPQAAIAMYSEMLNLGLQPDRLTYNTLISACVKINKLDAAMHFFEEMKERADKYDQEDIFPDAVTYTTLLKGFGILRDARVVHMIVLEMKSC
        MKGYISSGVPQAA+A+Y+EMLNL L+PD+LTYNTLISACVKINKLDAAM+FFEEMKERA KY+QEDIFPD VTYTTLLKGFGIL+D R+VH IVLEMKS 
Subjt:  MKGYISSGVPQAAIAMYSEMLNLGLQPDRLTYNTLISACVKINKLDAAMHFFEEMKERADKYDQEDIFPDAVTYTTLLKGFGILRDARVVHMIVLEMKSC

Query:  HDFFIDRTAYTAMIDSLVNCGSINGALSLFGELLKLSGWNSDFRPKPHLYLSLMRVLSSRGDYRMVKCLHRRMWLDSSGTIFPGFQEEADHLLMEAALND
        HD  IDRTAYTAMID+LVNCGSINGALSLFGELLKLSGWN D RPKPHLYL+ MR  SSRGDYRMVKCLHRRMWLDSSG+I PGFQEEADHLLMEAAL+D
Subjt:  HDFFIDRTAYTAMIDSLVNCGSINGALSLFGELLKLSGWNSDFRPKPHLYLSLMRVLSSRGDYRMVKCLHRRMWLDSSGTIFPGFQEEADHLLMEAALND

Query:  NQIDVAIEKLSTIIKIWKGISWTSRGGSVALRIEALLGFTKSFFSPCIFPRVNPSAPIESVMMPFKAVEPLNGSMQLKEVVMHFFDKSVVPIVDEWGRCT
        NQIDVA EKLSTIIK WKGISW+SRGGSVALRIEALLG TKSFFSPCIFPRVNP APIESVMMPFKAV+PLNG++QLKEVVM FFDKSVVPI+D+WGRC 
Subjt:  NQIDVAIEKLSTIIKIWKGISWTSRGGSVALRIEALLGFTKSFFSPCIFPRVNPSAPIESVMMPFKAVEPLNGSMQLKEVVMHFFDKSVVPIVDEWGRCT

Query:  GLLHREDCTEKKCKYLFMAQLDAPLWKIMRSPPPSVTTTTSIGHVANLILQKRYKMVIVVRHSKFSTYDGSSLRAVGVFTAEKLYGFVSPLPLPPQPNNP
        GLLHREDC+E          L++PLWK+MRSPPP VTTTTSIGHV NLIL+KRYKM+I+VRHSKFSTYD SS RAVGVFT E+LYGF+SP+P+  QPN P
Subjt:  GLLHREDCTEKKCKYLFMAQLDAPLWKIMRSPPPSVTTTTSIGHVANLILQKRYKMVIVVRHSKFSTYDGSSLRAVGVFTAEKLYGFVSPLPLPPQPNNP

TrEMBL top hitse value%identityAlignment
A0A1S3CQX2 pentatricopeptide repeat-containing protein At5g10690 isoform X48.0e-29284.51Show/hide
Query:  MLRIASFPSRPLALSSSDSLNVFSCPIHSRTASRRRRKVSPRNPNLKRLTSRVVRLTRRKQLHQVFEEIEIAKRRYGKLNTIVMNAVLEACVHCGDIDLA
        ML IASF SR LAL +S+S NVFSC + SRTA+ RR + SPR+PNLKRLTSRVVRLTRRK+LHQVFEEIEIAKRRYGKLNTIVMNAVLEACVHCGDIDLA
Subjt:  MLRIASFPSRPLALSSSDSLNVFSCPIHSRTASRRRRKVSPRNPNLKRLTSRVVRLTRRKQLHQVFEEIEIAKRRYGKLNTIVMNAVLEACVHCGDIDLA

Query:  LRTFYEMSKPDNCGVDNVSYGTLLKGLGEGRKVDEAFQLLESVEEGTAIGSPTLSAPLIYGLLNALIEAGDMRRANGLIARYGFILREGGNLSTSVYNLL
        LRTF EMSKPD+CG+DNVSYGTLLKGLGE RK+DEAFQLLESVEEGTAIG PTLSAPLIYGLLNALIEAGDMRRANGLIARYG++LREGGNLS SVYNLL
Subjt:  LRTFYEMSKPDNCGVDNVSYGTLLKGLGEGRKVDEAFQLLESVEEGTAIGSPTLSAPLIYGLLNALIEAGDMRRANGLIARYGFILREGGNLSTSVYNLL

Query:  MKGYISSGVPQAAIAMYSEMLNLGLQPDRLTYNTLISACVKINKLDAAMHFFEEMKERADKYDQEDIFPDAVTYTTLLKGFGILRDARVVHMIVLEMKSC
        MKGYISSGVPQAA+AMY+EMLNL L+PDRLTYNTLISACVKINKLDAAMHFFEEMKERADKYDQED+FPD VTYTTLLK FGIL+D  +VH IVLEMKSC
Subjt:  MKGYISSGVPQAAIAMYSEMLNLGLQPDRLTYNTLISACVKINKLDAAMHFFEEMKERADKYDQEDIFPDAVTYTTLLKGFGILRDARVVHMIVLEMKSC

Query:  HDFFIDRTAYTAMIDSLVNCGSIN-------GALSLFGELLKLSGWNSDFRPKPHLYLSLMRVLSSRGDYRMVKCLHRRMWLDSSGTIFPGFQEEADHLL
        H   IDRTAYTAMID+LVNCGSIN       GALSLFGELLKLSGWN + RPKPHLYL+LMRV SSRGDYRMVKCLHRRMWLDSSGTI  G+QEEADHLL
Subjt:  HDFFIDRTAYTAMIDSLVNCGSIN-------GALSLFGELLKLSGWNSDFRPKPHLYLSLMRVLSSRGDYRMVKCLHRRMWLDSSGTIFPGFQEEADHLL

Query:  MEAALNDNQIDVAIEKLSTIIKIWKGISWTSRGGSVALRIEALLGFTKSFFSPCIFPRVNPSAPIESVMMPFKAVEPLNGSMQLKEVVMHFFDKSVVPIV
        MEAALNDNQIDVAIEKLSTIIK WKGISWTSRGGSVALRIEALLG TKSFFSPCIFPRVN  APIESVMMPFKAV+PLNGS+ LKEVVM FFDKSVVPI+
Subjt:  MEAALNDNQIDVAIEKLSTIIKIWKGISWTSRGGSVALRIEALLGFTKSFFSPCIFPRVNPSAPIESVMMPFKAVEPLNGSMQLKEVVMHFFDKSVVPIV

Query:  DEWGRCTGLLHREDCTEKKCKYLFMAQLDAPLWKIMRSPPPSVTTTTSIGHVANLILQKRYKMVIVVRHSKFSTYDGSSLRAVGVFTAEKLYGFVSPLPL
        D+WGRC GLLHREDCTE          LDAPLWK+MRSPPP VTTT  IGHVANLILQKRYKMV+VVRHSKFS Y GSSLRA+GVFT E+LYGF+SP+P+
Subjt:  DEWGRCTGLLHREDCTEKKCKYLFMAQLDAPLWKIMRSPPPSVTTTTSIGHVANLILQKRYKMVIVVRHSKFSTYDGSSLRAVGVFTAEKLYGFVSPLPL

Query:  PPQPNNP
        P +PN P
Subjt:  PPQPNNP

A0A1S3CQY3 pentatricopeptide repeat-containing protein At5g10690 isoform X56.5e-29485.5Show/hide
Query:  MLRIASFPSRPLALSSSDSLNVFSCPIHSRTASRRRRKVSPRNPNLKRLTSRVVRLTRRKQLHQVFEEIEIAKRRYGKLNTIVMNAVLEACVHCGDIDLA
        ML IASF SR LAL +S+S NVFSC + SRTA+ RR + SPR+PNLKRLTSRVVRLTRRK+LHQVFEEIEIAKRRYGKLNTIVMNAVLEACVHCGDIDLA
Subjt:  MLRIASFPSRPLALSSSDSLNVFSCPIHSRTASRRRRKVSPRNPNLKRLTSRVVRLTRRKQLHQVFEEIEIAKRRYGKLNTIVMNAVLEACVHCGDIDLA

Query:  LRTFYEMSKPDNCGVDNVSYGTLLKGLGEGRKVDEAFQLLESVEEGTAIGSPTLSAPLIYGLLNALIEAGDMRRANGLIARYGFILREGGNLSTSVYNLL
        LRTF EMSKPD+CG+DNVSYGTLLKGLGE RK+DEAFQLLESVEEGTAIG PTLSAPLIYGLLNALIEAGDMRRANGLIARYG++LREGGNLS SVYNLL
Subjt:  LRTFYEMSKPDNCGVDNVSYGTLLKGLGEGRKVDEAFQLLESVEEGTAIGSPTLSAPLIYGLLNALIEAGDMRRANGLIARYGFILREGGNLSTSVYNLL

Query:  MKGYISSGVPQAAIAMYSEMLNLGLQPDRLTYNTLISACVKINKLDAAMHFFEEMKERADKYDQEDIFPDAVTYTTLLKGFGILRDARVVHMIVLEMKSC
        MKGYISSGVPQAA+AMY+EMLNL L+PDRLTYNTLISACVKINKLDAAMHFFEEMKERADKYDQED+FPD VTYTTLLK FGIL+D  +VH IVLEMKSC
Subjt:  MKGYISSGVPQAAIAMYSEMLNLGLQPDRLTYNTLISACVKINKLDAAMHFFEEMKERADKYDQEDIFPDAVTYTTLLKGFGILRDARVVHMIVLEMKSC

Query:  HDFFIDRTAYTAMIDSLVNCGSINGALSLFGELLKLSGWNSDFRPKPHLYLSLMRVLSSRGDYRMVKCLHRRMWLDSSGTIFPGFQEEADHLLMEAALND
        H   IDRTAYTAMID+LVNCGSINGALSLFGELLKLSGWN + RPKPHLYL+LMRV SSRGDYRMVKCLHRRMWLDSSGTI  G+QEEADHLLMEAALND
Subjt:  HDFFIDRTAYTAMIDSLVNCGSINGALSLFGELLKLSGWNSDFRPKPHLYLSLMRVLSSRGDYRMVKCLHRRMWLDSSGTIFPGFQEEADHLLMEAALND

Query:  NQIDVAIEKLSTIIKIWKGISWTSRGGSVALRIEALLGFTKSFFSPCIFPRVNPSAPIESVMMPFKAVEPLNGSMQLKEVVMHFFDKSVVPIVDEWGRCT
        NQIDVAIEKLSTIIK WKGISWTSRGGSVALRIEALLG TKSFFSPCIFPRVN  APIESVMMPFKAV+PLNGS+ LKEVVM FFDKSVVPI+D+WGRC 
Subjt:  NQIDVAIEKLSTIIKIWKGISWTSRGGSVALRIEALLGFTKSFFSPCIFPRVNPSAPIESVMMPFKAVEPLNGSMQLKEVVMHFFDKSVVPIVDEWGRCT

Query:  GLLHREDCTEKKCKYLFMAQLDAPLWKIMRSPPPSVTTTTSIGHVANLILQKRYKMVIVVRHSKFSTYDGSSLRAVGVFTAEKLYGFVSPLPLPPQPNNP
        GLLHREDCTE          LDAPLWK+MRSPPP VTTT  IGHVANLILQKRYKMV+VVRHSKFS Y GSSLRA+GVFT E+LYGF+SP+P+P +PN P
Subjt:  GLLHREDCTEKKCKYLFMAQLDAPLWKIMRSPPPSVTTTTSIGHVANLILQKRYKMVIVVRHSKFSTYDGSSLRAVGVFTAEKLYGFVSPLPLPPQPNNP

A0A1S3CS90 pentatricopeptide repeat-containing protein At5g10690 isoform X31.0e-29184.38Show/hide
Query:  MLRIASFPSRPLALSSSDSLNVFSCPIHSRTASRRRRKVSPRNPNLKRLTSRVVRLTRRKQLHQVFEEIEIAKRRYGKLNTIVMNAVLEACVHCGDIDLA
        ML IASF SR LAL +S+S NVFSC + SRTA+ RR + SPR+PNLKRLTSRVVRLTRRK+LHQVFEEIEIAKRRYGKLNTIVMNAVLEACVHCGDIDLA
Subjt:  MLRIASFPSRPLALSSSDSLNVFSCPIHSRTASRRRRKVSPRNPNLKRLTSRVVRLTRRKQLHQVFEEIEIAKRRYGKLNTIVMNAVLEACVHCGDIDLA

Query:  LRTFYEMSKPDNCGVDNVSYGTLLKGLGEGRKVDEAFQLLESVEEGTAIGSPTLSAPLIYGLLNALIEAGDMRRANGLIARYGFILREGGNLSTSVYNLL
        LRTF EMSKPD+CG+DNVSYGTLLKGLGE RK+DEAFQLLESVEEGTAIG PTLSAPLIYGLLNALIEAGDMRRANGLIARYG++LREGGNLS SVYNLL
Subjt:  LRTFYEMSKPDNCGVDNVSYGTLLKGLGEGRKVDEAFQLLESVEEGTAIGSPTLSAPLIYGLLNALIEAGDMRRANGLIARYGFILREGGNLSTSVYNLL

Query:  MKGYISSGVPQAAIAMYSEMLNLGLQPDRLTYNTLISACVKINKLDAAMHFFEEMKERADKYDQEDIFPDAVTYTTLLKGFGILRDARVVHMIVLEMKSC
        MKGYISSGVPQAA+AMY+EMLNL L+PDRLTYNTLISACVKINKLDAAMHFFEEMKERADKYDQED+FPD VTYTTLLK FGIL+D  +VH IVLEMKSC
Subjt:  MKGYISSGVPQAAIAMYSEMLNLGLQPDRLTYNTLISACVKINKLDAAMHFFEEMKERADKYDQEDIFPDAVTYTTLLKGFGILRDARVVHMIVLEMKSC

Query:  HDFFIDRTAYTAMIDSLVNCGSINGALSLFGELLKLSGWNSDFRPKPHLYLSLMRVLSSRGDYRMVKCLHRRMWLDSSGTIFPGFQEEADHLLMEAALND
        H   IDRTAYTAMID+LVNCGSINGALSLFGELLKLSGWN + RPKPHLYL+LMRV SSRGDYRMVKCLHRRMWLDSSGTI  G+QEEADHLLMEAALND
Subjt:  HDFFIDRTAYTAMIDSLVNCGSINGALSLFGELLKLSGWNSDFRPKPHLYLSLMRVLSSRGDYRMVKCLHRRMWLDSSGTIFPGFQEEADHLLMEAALND

Query:  NQ--------IDVAIEKLSTIIKIWKGISWTSRGGSVALRIEALLGFTKSFFSPCIFPRVNPSAPIESVMMPFKAVEPLNGSMQLKEVVMHFFDKSVVPI
        NQ        IDVAIEKLSTIIK WKGISWTSRGGSVALRIEALLG TKSFFSPCIFPRVN  APIESVMMPFKAV+PLNGS+ LKEVVM FFDKSVVPI
Subjt:  NQ--------IDVAIEKLSTIIKIWKGISWTSRGGSVALRIEALLGFTKSFFSPCIFPRVNPSAPIESVMMPFKAVEPLNGSMQLKEVVMHFFDKSVVPI

Query:  VDEWGRCTGLLHREDCTEKKCKYLFMAQLDAPLWKIMRSPPPSVTTTTSIGHVANLILQKRYKMVIVVRHSKFSTYDGSSLRAVGVFTAEKLYGFVSPLP
        +D+WGRC GLLHREDCTE          LDAPLWK+MRSPPP VTTT  IGHVANLILQKRYKMV+VVRHSKFS Y GSSLRA+GVFT E+LYGF+SP+P
Subjt:  VDEWGRCTGLLHREDCTEKKCKYLFMAQLDAPLWKIMRSPPPSVTTTTSIGHVANLILQKRYKMVIVVRHSKFSTYDGSSLRAVGVFTAEKLYGFVSPLP

Query:  LPPQPNNP
        +P +PN P
Subjt:  LPPQPNNP

A0A6J1CDP2 pentatricopeptide repeat-containing protein At5g10690 isoform X10.0e+0098.17Show/hide
Query:  MLRIASFPSRPLALSSSDSLNVFSCPIHSRTASRRRRKVSPRNPNLKRLTSRVVRLTRRKQLHQVFEEIEIAKRRYGKLNTIVMNAVLEACVHCGDIDLA
        MLRIASFPSRPLALSSSDSLNVFSCPIHSRTASRRRRKVSPRNPNLKRLTSRVVRLTRRKQLHQVFEEIEIAKRRYGKLNTIVMNAVLEACVHCGDIDLA
Subjt:  MLRIASFPSRPLALSSSDSLNVFSCPIHSRTASRRRRKVSPRNPNLKRLTSRVVRLTRRKQLHQVFEEIEIAKRRYGKLNTIVMNAVLEACVHCGDIDLA

Query:  LRTFYEMSKPDNCGVDNVSYGTLLKGLGEGRKVDEAFQLLESVEEGTAIGSPTLSAPLIYGLLNALIEAGDMRRANGLIARYGFILREGGNLSTSVYNLL
        LRTFYEMSKPDNCGVDNVSYGTLLKGLGEGRKVDEAFQLLESVEEGTAIGSPTLSAPLIYGLLNALIEAGDMRRANGLIARYGFILREGGNLSTSVYNLL
Subjt:  LRTFYEMSKPDNCGVDNVSYGTLLKGLGEGRKVDEAFQLLESVEEGTAIGSPTLSAPLIYGLLNALIEAGDMRRANGLIARYGFILREGGNLSTSVYNLL

Query:  MKGYISSGVPQAAIAMYSEMLNLGLQPDRLTYNTLISACVKINKLDAAMHFFEEMKERADKYDQEDIFPDAVTYTTLLKGFGILRDARVVHMIVLEMKSC
        MKGYISSGVPQAAIAMYSEMLNLGLQPDRLTYNTLISACVKINKLDAAMHFFEEMKERADKYDQEDIFPDAVTYTTLLKGFGILRDARVVHMIVLEMKSC
Subjt:  MKGYISSGVPQAAIAMYSEMLNLGLQPDRLTYNTLISACVKINKLDAAMHFFEEMKERADKYDQEDIFPDAVTYTTLLKGFGILRDARVVHMIVLEMKSC

Query:  HDFFIDRTAYTAMIDSLVNCGSINGALSLFGELLKLSGWNSDFRPKPHLYLSLMRVLSSRGDYRMVKCLHRRMWLDSSGTIFPGFQEEADHLLMEAALND
        HDFFIDRTAYTAMIDSLVNCGSINGALSLFGELLKLSGWNSDFRPKPHLYLSLMRVLSSRGDYRMVKCLHRRMWLDSSGTIFPGFQEEADHLLMEAALND
Subjt:  HDFFIDRTAYTAMIDSLVNCGSINGALSLFGELLKLSGWNSDFRPKPHLYLSLMRVLSSRGDYRMVKCLHRRMWLDSSGTIFPGFQEEADHLLMEAALND

Query:  NQIDVAIEKLSTIIKIWKGISWTSRGGSVALRIEALLGFTKSFFSPCIFPRVNPSAPIESVMMPFKAVEPLNGSMQLKEVVMHFFDKSVVPIVDEWGRCT
        NQIDVAIEKLSTIIKIWKGISWTSRGGSVALRIEALLGFTKSFFSPCIFPRVNPSAPIESVMMPFKAVEPLNGSMQLKEVVMHFFDKSVVPIVDEWGRCT
Subjt:  NQIDVAIEKLSTIIKIWKGISWTSRGGSVALRIEALLGFTKSFFSPCIFPRVNPSAPIESVMMPFKAVEPLNGSMQLKEVVMHFFDKSVVPIVDEWGRCT

Query:  GLLHREDCTEKKCKYLFMAQLDAPLWKIMRSPPPSVTTTTSIGHVANLILQKRYKMVIVVRHSKFSTYDGSSLRAVGVFTAEKLYGFVSPLPLPPQPNNP
        GLLHREDCTE          LDAPLWKIMRSPPPSVTTTTSIGHVANLILQKRYKMVIVVRHSKFSTYDGSSLRAVGVFTAEKLYGFVSPLPLPPQPNNP
Subjt:  GLLHREDCTEKKCKYLFMAQLDAPLWKIMRSPPPSVTTTTSIGHVANLILQKRYKMVIVVRHSKFSTYDGSSLRAVGVFTAEKLYGFVSPLPLPPQPNNP

Query:  LN
         N
Subjt:  LN

A0A6J1CDX0 pentatricopeptide repeat-containing protein At5g10690 isoform X20.0e+0094.19Show/hide
Query:  MLRIASFPSRPLALSSSDSLNVFSCPIHSRTASRRRRKVSPRNPNLKRLTSRVVRLTRRKQLHQVFEEIEIAKRRYGKLNTIVMNAVLEACVHCGDIDLA
        MLRIASFPSRPLALSSSDSLNVFSCPIHSRTASRRRRKVSPRNPNLKRLTSRVVRLTRRKQLHQVFEEIEIAKRRYGKLNTIVMNAVLEACVHCGDIDLA
Subjt:  MLRIASFPSRPLALSSSDSLNVFSCPIHSRTASRRRRKVSPRNPNLKRLTSRVVRLTRRKQLHQVFEEIEIAKRRYGKLNTIVMNAVLEACVHCGDIDLA

Query:  LRTFYEMSKPDNCGVDNVSYGTLLKGLGEGRKVDEAFQLLESVEEGTAIGSPTLSAPLIYGLLNALIEAGDMRRANGLIARYGFILREGGNLSTSVYNLL
        LRTFYEMSKPDNCGVDNVSYGTLLKGLGEGRKVDEAFQLLESVEE                        GDMRRANGLIARYGFILREGGNLSTSVYNLL
Subjt:  LRTFYEMSKPDNCGVDNVSYGTLLKGLGEGRKVDEAFQLLESVEEGTAIGSPTLSAPLIYGLLNALIEAGDMRRANGLIARYGFILREGGNLSTSVYNLL

Query:  MKGYISSGVPQAAIAMYSEMLNLGLQPDRLTYNTLISACVKINKLDAAMHFFEEMKERADKYDQEDIFPDAVTYTTLLKGFGILRDARVVHMIVLEMKSC
        MKGYISSGVPQAAIAMYSEMLNLGLQPDRLTYNTLISACVKINKLDAAMHFFEEMKERADKYDQEDIFPDAVTYTTLLKGFGILRDARVVHMIVLEMKSC
Subjt:  MKGYISSGVPQAAIAMYSEMLNLGLQPDRLTYNTLISACVKINKLDAAMHFFEEMKERADKYDQEDIFPDAVTYTTLLKGFGILRDARVVHMIVLEMKSC

Query:  HDFFIDRTAYTAMIDSLVNCGSINGALSLFGELLKLSGWNSDFRPKPHLYLSLMRVLSSRGDYRMVKCLHRRMWLDSSGTIFPGFQEEADHLLMEAALND
        HDFFIDRTAYTAMIDSLVNCGSINGALSLFGELLKLSGWNSDFRPKPHLYLSLMRVLSSRGDYRMVKCLHRRMWLDSSGTIFPGFQEEADHLLMEAALND
Subjt:  HDFFIDRTAYTAMIDSLVNCGSINGALSLFGELLKLSGWNSDFRPKPHLYLSLMRVLSSRGDYRMVKCLHRRMWLDSSGTIFPGFQEEADHLLMEAALND

Query:  NQIDVAIEKLSTIIKIWKGISWTSRGGSVALRIEALLGFTKSFFSPCIFPRVNPSAPIESVMMPFKAVEPLNGSMQLKEVVMHFFDKSVVPIVDEWGRCT
        NQIDVAIEKLSTIIKIWKGISWTSRGGSVALRIEALLGFTKSFFSPCIFPRVNPSAPIESVMMPFKAVEPLNGSMQLKEVVMHFFDKSVVPIVDEWGRCT
Subjt:  NQIDVAIEKLSTIIKIWKGISWTSRGGSVALRIEALLGFTKSFFSPCIFPRVNPSAPIESVMMPFKAVEPLNGSMQLKEVVMHFFDKSVVPIVDEWGRCT

Query:  GLLHREDCTEKKCKYLFMAQLDAPLWKIMRSPPPSVTTTTSIGHVANLILQKRYKMVIVVRHSKFSTYDGSSLRAVGVFTAEKLYGFVSPLPLPPQPNNP
        GLLHREDCTE          LDAPLWKIMRSPPPSVTTTTSIGHVANLILQKRYKMVIVVRHSKFSTYDGSSLRAVGVFTAEKLYGFVSPLPLPPQPNNP
Subjt:  GLLHREDCTEKKCKYLFMAQLDAPLWKIMRSPPPSVTTTTSIGHVANLILQKRYKMVIVVRHSKFSTYDGSSLRAVGVFTAEKLYGFVSPLPLPPQPNNP

Query:  LN
         N
Subjt:  LN

SwissProt top hitse value%identityAlignment
O49500 E3 ubiquitin-protein ligase MBR21.9e-1635.08Show/hide
Query:  SGSSHTSSSLQSRSESHSRHHMTYP-------EHEDYSQSLLDIGQTTAAEGGISSTSTNLNSFRCRNLDGIAEVLLALERIEQEEELTYEQAILLETSL
        +G S +S+   + S S SR H +         E ++    L  +G++ AA+          N  R R +  I +VL A+ R    E L +E  ++ +  +
Subjt:  SGSSHTSSSLQSRSESHSRHHMTYP-------EHEDYSQSLLDIGQTTAAEGGISSTSTNLNSFRCRNLDGIAEVLLALERIEQEEELTYEQAILLETSL

Query:  FLSSLNIHDQHRDMRLDIDNMTYEELLDLEERMGTVSTALSEEALTECLNRSIFQSKPQGGELTSSVEDLSDVKCCICQEEYVSGDEVGRL
        +     +HD+HRDMRLD+DNM+YEELL L ER+G VST LSEE + + + +    S   G     S +D+    CC+CQEEY  GD++G L
Subjt:  FLSSLNIHDQHRDMRLDIDNMTYEELLDLEERMGTVSTALSEEALTECLNRSIFQSKPQGGELTSSVEDLSDVKCCICQEEYVSGDEVGRL

Q7XTV7 Probable E3 ubiquitin-protein ligase HIP13.4e-2149.61Show/hide
Query:  SFRCRNLDGIAEVLLALERIEQEEELTYEQAILLETSLFLSSLNIHDQHRDMRLDIDNMTYEELLDLEERMGTVSTALSEEALTECLNRSIFQSKPQGGE
        S R R    + E+  ALE I + E + +E       S+F   ++IHD+HRDMRLDIDNM+YEELL LEER+G VST LSEE +T+ L +  F S      
Subjt:  SFRCRNLDGIAEVLLALERIEQEEELTYEQAILLETSLFLSSLNIHDQHRDMRLDIDNMTYEELLDLEERMGTVSTALSEEALTECLNRSIFQSKPQGGE

Query:  LTSSVEDLSDVKCCICQEEYVSGDEVGRL
        L +SVE   +  CCICQEEYV GD++G L
Subjt:  LTSSVEDLSDVKCCICQEEYVSGDEVGRL

Q8VYD6 Pentatricopeptide repeat-containing protein At5g106906.1e-18057.78Show/hide
Query:  IHSRTASRRRRKVSPRNPNLKRLTSRVVRLTRRKQLHQVFEEIEIAKRRYGKLNTIVMNAVLEACVHCGDIDLALRTFYEMSKPDNCGVDNVSYGTLLKG
        + +R    RR     R  NLK LTSR+V LTRR+QL Q+ EE+E AK+RYG+LNTIVMN+VLEACVHCG+IDLALR F+EM++P   GVD++SY T+LKG
Subjt:  IHSRTASRRRRKVSPRNPNLKRLTSRVVRLTRRKQLHQVFEEIEIAKRRYGKLNTIVMNAVLEACVHCGDIDLALRTFYEMSKPDNCGVDNVSYGTLLKG

Query:  LGEGRKVDEAFQLLESVEEGTAIGSPTLSAPLIYGLLNALIEAGDMRRANGLIARYGFILREGGNLSTSVYNLLMKGYISSGVPQAAIAMYSEMLNLGLQ
        LG+ R++DEAFQ+LE++E GTA G+P LS+ LIYGLL+ALI AGD+RRANGL+ARY  +L + G  S  +YNLLMKGY++S  PQAAI +  EML L L+
Subjt:  LGEGRKVDEAFQLLESVEEGTAIGSPTLSAPLIYGLLNALIEAGDMRRANGLIARYGFILREGGNLSTSVYNLLMKGYISSGVPQAAIAMYSEMLNLGLQ

Query:  PDRLTYNTLISACVKINKLDAAMHFFEEMKERADKYDQEDIFPDAVTYTTLLKGFGILRDARVVHMIVLEMKSCHDFFIDRTAYTAMIDSLVNCGSINGA
        PDRLTYNTLI AC+K   LDAAM FF +MKE+A++Y  + + PD VTYTTL+KGFG   D   +  I LEMK C + FIDRTA+TA++D+++ CGS +GA
Subjt:  PDRLTYNTLISACVKINKLDAAMHFFEEMKERADKYDQEDIFPDAVTYTTLLKGFGILRDARVVHMIVLEMKSCHDFFIDRTAYTAMIDSLVNCGSINGA

Query:  LSLFGELLKLSGWNSDFRPKPHLYLSLMRVLSSRGDYRMVKCLHRRMWLDSSGTIFPGFQEEADHLLMEAALNDNQIDVAIEKLSTIIKIWKGISWTSRG
        L +FGE+LK SG N   RPKPHLYLS+MR  + +GDY MV+ L+ R+W DSSG+I    Q+EAD+LLMEAALND Q+D A+  L +I++ WK I WT+ G
Subjt:  LSLFGELLKLSGWNSDFRPKPHLYLSLMRVLSSRGDYRMVKCLHRRMWLDSSGTIFPGFQEEADHLLMEAALNDNQIDVAIEKLSTIIKIWKGISWTSRG

Query:  GSVALRIEALLGFTKSFFSPCIFPRVNPSAPIESVMMPFKAVEPLNGSMQLKEVVMHFFDKSVVPIVDEWGRCTGLLHREDCTEKKCKYLFMAQLDAPLW
        G  A+R+E LLGF+KS   P +  +V PS PIES+M+ F+A  PL G++QLK V M FF + VVPIVD+ G C GLLHREDC            LDAPL 
Subjt:  GSVALRIEALLGFTKSFFSPCIFPRVNPSAPIESVMMPFKAVEPLNGSMQLKEVVMHFFDKSVVPIVDEWGRCTGLLHREDCTEKKCKYLFMAQLDAPLW

Query:  KIMRSPPPSVTTTTSIGHVANLILQKRYKMVIVVRHSKFSTYDGSSLRAVGVFTAEKLY
         +MRSPP  V+TTTSIG V +L+L+K+ KMVIVV    FS   G S +AVG FT  +LY
Subjt:  KIMRSPPPSVTTTTSIGHVANLILQKRYKMVIVVRHSKFSTYDGSSLRAVGVFTAEKLY

Q9SZ52 Pentatricopeptide repeat-containing protein At4g31850, chloroplastic7.8e-1826.07Show/hide
Query:  NAVLEACVHCGDIDLALRTFYEMSKPDNCGVDNVSYGTLLKGLGEGRKVDEAFQLLESVEEGTAIGSPTLSAPLIYG-LLNALIEAGDMRRANGLIARYG
        N +L+A    G ID     + EMS  + C  + +++  ++ GL +   VD+A  L   +       SPT      YG L++ L ++G +  A  L   + 
Subjt:  NAVLEACVHCGDIDLALRTFYEMSKPDNCGVDNVSYGTLLKGLGEGRKVDEAFQLLESVEEGTAIGSPTLSAPLIYG-LLNALIEAGDMRRANGLIARYG

Query:  FILREGGNLSTSVYNLLMKGYISSGVPQAAIAMYSEMLNLGLQPDRLTYNTLISACVKINKLDAAMHFFEEMKERADKYDQEDIFPDAVTYTTLLKGFGI
         +L  G   + ++YN+L+ G+  +G   AA A++  M+  G++PD  TY+ L+     + ++D  +H+F+E+KE         + PD V Y  ++ G G 
Subjt:  FILREGGNLSTSVYNLLMKGYISSGVPQAAIAMYSEMLNLGLQPDRLTYNTLISACVKINKLDAAMHFFEEMKERADKYDQEDIFPDAVTYTTLLKGFGI

Query:  LRDARVVHMIVL--EMKSCHDFFIDRTAYTAMIDSLVNCGSINGALSLFGELLKLSGWNSDFRPKPHLYLSLMRVLSSRG
         +  R+   +VL  EMK+      D   Y ++I +L   G +  A  ++ E+ +     +   P    + +L+R  S  G
Subjt:  LRDARVVHMIVL--EMKSCHDFFIDRTAYTAMIDSLVNCGSINGALSLFGELLKLSGWNSDFRPKPHLYLSLMRVLSSRG

Q9ZQF9 E3 ubiquitin-protein ligase MBR15.1e-1735.94Show/hide
Query:  SGSSHTSSSLQSRSESHSRHHMT-------YPEHEDYSQSLLDIGQTTAAEGGISSTSTNLNSFRCRNLDGIAEVLLALERIEQEEELTYEQAILLETSL
        +G S +S+ +   S S+SR H +         E ++    L  IG++ AA+G          + R + +  I +VL A+ R    E L  E  ++ +  +
Subjt:  SGSSHTSSSLQSRSESHSRHHMT-------YPEHEDYSQSLLDIGQTTAAEGGISSTSTNLNSFRCRNLDGIAEVLLALERIEQEEELTYEQAILLETSL

Query:  FLSSLNIHDQHRDMRLDIDNMTYEELLDLEERMGTVSTALSEEALTECLNR-SIFQSKPQGGELTSSVEDLSDVKCCICQEEYVSGDEVGRL
        +    ++HD+HR+MRLD+DNM+YEELL L ER+G VST LSEE + + + +     S P   EL  ++E      CCICQEEYV GD +G L
Subjt:  FLSSLNIHDQHRDMRLDIDNMTYEELLDLEERMGTVSTALSEEALTECLNR-SIFQSKPQGGELTSSVEDLSDVKCCICQEEYVSGDEVGRL

Arabidopsis top hitse value%identityAlignment
AT5G10650.1 RING/U-box superfamily protein4.5e-5336.01Show/hide
Query:  IVSRKGPGVVLRDNMKNRDHSN--HGSRLGCTGRICSSSGAEVGC---SSKASSKRPFRSSSGKEIVGSSSRSSSIRNFRKSFPDPFGKL----------
        +V RK  G+ LR+NM   D  N    SR+GCT ++ S+  + +G    ++K        + + KEIVGSSSR+          P  FG L          
Subjt:  IVSRKGPGVVLRDNMKNRDHSN--HGSRLGCTGRICSSSGAEVGC---SSKASSKRPFRSSSGKEIVGSSSRSSSIRNFRKSFPDPFGKL----------

Query:  --SSKLETDSSEDGSIQDELEELEFISPPGMFHTV-LHVKSESSVPADAILMERGSSSTDSNTKSRRNSIQRYGVDNQDTSTSMRSRSLCQALNGGTSRY
          SS L+T+SSE   I D+    E   P      V ++V  +S+V  + ++ + GSSS  ++  S   S           S S  S +    + GG SR+
Subjt:  --SSKLETDSSEDGSIQDELEELEFISPPGMFHTV-LHVKSESSVPADAILMERGSSSTDSNTKSRRNSIQRYGVDNQDTSTSMRSRSLCQALNGGTSRY

Query:  VSRNLECNSVSEIIPPDSSSKEQNLVRRKNVTKKMIYDGENSSSSRGKKVS---GTSCGQNCGYRRGISISDQRQGRNMFHRQNVLSSLGTRSLASACDG
          RNL CNSVS+++P +S+S       + +VTKK   DGE+S SS+G K S        Q   +  G+++SD R+ R       V+ S+   S   +   
Subjt:  VSRNLECNSVSEIIPPDSSSKEQNLVRRKNVTKKMIYDGENSSSSRGKKVS---GTSCGQNCGYRRGISISDQRQGRNMFHRQNVLSSLGTRSLASACDG

Query:  ASHSYQGISGDRSLQDSLMTS-WMPQSNISGSSHTSSSL--QSRSESHSRHHMTYPEHEDYSQSLLDIGQTTAAEGGISSTSTNLNSFRCRNLDGIAEVL
            Y G S       S  TS  MP        + S S    +   S  R H   P            G  T A+   SS+  N +     N++GIAEVL
Subjt:  ASHSYQGISGDRSLQDSLMTS-WMPQSNISGSSHTSSSL--QSRSESHSRHHMTYPEHEDYSQSLLDIGQTTAAEGGISSTSTNLNSFRCRNLDGIAEVL

Query:  LALERIEQEEELTYEQAILLETSLFLSSL-NIHDQHRDMRLDIDNMTYEELLDLEERMGTVSTALSEEALTECLNRSIFQSKPQGGELTSSVEDLSDVKC
        LALERIE +EELTYEQ   +ET+LF S +   +DQHRDMRLDIDNM+YEELL L ++MGTVSTALSEEAL+  L +SI+Q   + G ++   +D  D+KC
Subjt:  LALERIEQEEELTYEQAILLETSLFLSSL-NIHDQHRDMRLDIDNMTYEELLDLEERMGTVSTALSEEALTECLNRSIFQSKPQGGELTSSVEDLSDVKC

Query:  CICQEEYVSGDEVGRLHLEQ----AIVLQYSSLETW
         ICQEEYV GDE+G +  +     + V Q+  ++ W
Subjt:  CICQEEYVSGDEVGRLHLEQ----AIVLQYSSLETW

AT5G10650.2 RING/U-box superfamily protein4.5e-5336.01Show/hide
Query:  IVSRKGPGVVLRDNMKNRDHSN--HGSRLGCTGRICSSSGAEVGC---SSKASSKRPFRSSSGKEIVGSSSRSSSIRNFRKSFPDPFGKL----------
        +V RK  G+ LR+NM   D  N    SR+GCT ++ S+  + +G    ++K        + + KEIVGSSSR+          P  FG L          
Subjt:  IVSRKGPGVVLRDNMKNRDHSN--HGSRLGCTGRICSSSGAEVGC---SSKASSKRPFRSSSGKEIVGSSSRSSSIRNFRKSFPDPFGKL----------

Query:  --SSKLETDSSEDGSIQDELEELEFISPPGMFHTV-LHVKSESSVPADAILMERGSSSTDSNTKSRRNSIQRYGVDNQDTSTSMRSRSLCQALNGGTSRY
          SS L+T+SSE   I D+    E   P      V ++V  +S+V  + ++ + GSSS  ++  S   S           S S  S +    + GG SR+
Subjt:  --SSKLETDSSEDGSIQDELEELEFISPPGMFHTV-LHVKSESSVPADAILMERGSSSTDSNTKSRRNSIQRYGVDNQDTSTSMRSRSLCQALNGGTSRY

Query:  VSRNLECNSVSEIIPPDSSSKEQNLVRRKNVTKKMIYDGENSSSSRGKKVS---GTSCGQNCGYRRGISISDQRQGRNMFHRQNVLSSLGTRSLASACDG
          RNL CNSVS+++P +S+S       + +VTKK   DGE+S SS+G K S        Q   +  G+++SD R+ R       V+ S+   S   +   
Subjt:  VSRNLECNSVSEIIPPDSSSKEQNLVRRKNVTKKMIYDGENSSSSRGKKVS---GTSCGQNCGYRRGISISDQRQGRNMFHRQNVLSSLGTRSLASACDG

Query:  ASHSYQGISGDRSLQDSLMTS-WMPQSNISGSSHTSSSL--QSRSESHSRHHMTYPEHEDYSQSLLDIGQTTAAEGGISSTSTNLNSFRCRNLDGIAEVL
            Y G S       S  TS  MP        + S S    +   S  R H   P            G  T A+   SS+  N +     N++GIAEVL
Subjt:  ASHSYQGISGDRSLQDSLMTS-WMPQSNISGSSHTSSSL--QSRSESHSRHHMTYPEHEDYSQSLLDIGQTTAAEGGISSTSTNLNSFRCRNLDGIAEVL

Query:  LALERIEQEEELTYEQAILLETSLFLSSL-NIHDQHRDMRLDIDNMTYEELLDLEERMGTVSTALSEEALTECLNRSIFQSKPQGGELTSSVEDLSDVKC
        LALERIE +EELTYEQ   +ET+LF S +   +DQHRDMRLDIDNM+YEELL L ++MGTVSTALSEEAL+  L +SI+Q   + G ++   +D  D+KC
Subjt:  LALERIEQEEELTYEQAILLETSLFLSSL-NIHDQHRDMRLDIDNMTYEELLDLEERMGTVSTALSEEALTECLNRSIFQSKPQGGELTSSVEDLSDVKC

Query:  CICQEEYVSGDEVGRLHLEQ----AIVLQYSSLETW
         ICQEEYV GDE+G +  +     + V Q+  ++ W
Subjt:  CICQEEYVSGDEVGRLHLEQ----AIVLQYSSLETW

AT5G10690.1 pentatricopeptide (PPR) repeat-containing protein / CBS domain-containing protein4.3e-18157.78Show/hide
Query:  IHSRTASRRRRKVSPRNPNLKRLTSRVVRLTRRKQLHQVFEEIEIAKRRYGKLNTIVMNAVLEACVHCGDIDLALRTFYEMSKPDNCGVDNVSYGTLLKG
        + +R    RR     R  NLK LTSR+V LTRR+QL Q+ EE+E AK+RYG+LNTIVMN+VLEACVHCG+IDLALR F+EM++P   GVD++SY T+LKG
Subjt:  IHSRTASRRRRKVSPRNPNLKRLTSRVVRLTRRKQLHQVFEEIEIAKRRYGKLNTIVMNAVLEACVHCGDIDLALRTFYEMSKPDNCGVDNVSYGTLLKG

Query:  LGEGRKVDEAFQLLESVEEGTAIGSPTLSAPLIYGLLNALIEAGDMRRANGLIARYGFILREGGNLSTSVYNLLMKGYISSGVPQAAIAMYSEMLNLGLQ
        LG+ R++DEAFQ+LE++E GTA G+P LS+ LIYGLL+ALI AGD+RRANGL+ARY  +L + G  S  +YNLLMKGY++S  PQAAI +  EML L L+
Subjt:  LGEGRKVDEAFQLLESVEEGTAIGSPTLSAPLIYGLLNALIEAGDMRRANGLIARYGFILREGGNLSTSVYNLLMKGYISSGVPQAAIAMYSEMLNLGLQ

Query:  PDRLTYNTLISACVKINKLDAAMHFFEEMKERADKYDQEDIFPDAVTYTTLLKGFGILRDARVVHMIVLEMKSCHDFFIDRTAYTAMIDSLVNCGSINGA
        PDRLTYNTLI AC+K   LDAAM FF +MKE+A++Y  + + PD VTYTTL+KGFG   D   +  I LEMK C + FIDRTA+TA++D+++ CGS +GA
Subjt:  PDRLTYNTLISACVKINKLDAAMHFFEEMKERADKYDQEDIFPDAVTYTTLLKGFGILRDARVVHMIVLEMKSCHDFFIDRTAYTAMIDSLVNCGSINGA

Query:  LSLFGELLKLSGWNSDFRPKPHLYLSLMRVLSSRGDYRMVKCLHRRMWLDSSGTIFPGFQEEADHLLMEAALNDNQIDVAIEKLSTIIKIWKGISWTSRG
        L +FGE+LK SG N   RPKPHLYLS+MR  + +GDY MV+ L+ R+W DSSG+I    Q+EAD+LLMEAALND Q+D A+  L +I++ WK I WT+ G
Subjt:  LSLFGELLKLSGWNSDFRPKPHLYLSLMRVLSSRGDYRMVKCLHRRMWLDSSGTIFPGFQEEADHLLMEAALNDNQIDVAIEKLSTIIKIWKGISWTSRG

Query:  GSVALRIEALLGFTKSFFSPCIFPRVNPSAPIESVMMPFKAVEPLNGSMQLKEVVMHFFDKSVVPIVDEWGRCTGLLHREDCTEKKCKYLFMAQLDAPLW
        G  A+R+E LLGF+KS   P +  +V PS PIES+M+ F+A  PL G++QLK V M FF + VVPIVD+ G C GLLHREDC            LDAPL 
Subjt:  GSVALRIEALLGFTKSFFSPCIFPRVNPSAPIESVMMPFKAVEPLNGSMQLKEVVMHFFDKSVVPIVDEWGRCTGLLHREDCTEKKCKYLFMAQLDAPLW

Query:  KIMRSPPPSVTTTTSIGHVANLILQKRYKMVIVVRHSKFSTYDGSSLRAVGVFTAEKLY
         +MRSPP  V+TTTSIG V +L+L+K+ KMVIVV    FS   G S +AVG FT  +LY
Subjt:  KIMRSPPPSVTTTTSIGHVANLILQKRYKMVIVVRHSKFSTYDGSSLRAVGVFTAEKLY

AT5G24870.1 RING/U-box superfamily protein3.7e-6339.3Show/hide
Query:  MDEYSSRRVHVGIVSRKGPGVVLRDNMKNRDHSN---HGSRLGCTGRICSSSGAEVGCSSKASSKRPFRSS-SGKEIVGSSSRSSSIRNFRKSFPDPFGK
        MD +  +R    I+ RK  G+VL +NMK +D  +     SR+GC+ R+ S+ G  +   +KA+    FRS  SGKE VGSSSRS S     K      G+
Subjt:  MDEYSSRRVHVGIVSRKGPGVVLRDNMKNRDHSN---HGSRLGCTGRICSSSGAEVGCSSKASSKRPFRSS-SGKEIVGSSSRSSSIRNFRKSFPDPFGK

Query:  --LSSKLETDSSEDGSIQDELEELEFISPPGMF-HTVLHVKSESSVPADAILMERGSSSTDSNTKSRRNSIQRYGVDNQDTSTSMRSRSLCQALNGGTSR
          LSS L+ DSSE  S+ ++    E   P G    + + V SESSV  + ++ E GSSS  +     R+  QR  + ++D   S   ++   ++N    +
Subjt:  --LSSKLETDSSEDGSIQDELEELEFISPPGMF-HTVLHVKSESSVPADAILMERGSSSTDSNTKSRRNSIQRYGVDNQDTSTSMRSRSLCQALNGGTSR

Query:  YVSRNLECNSVSEIIPPDSSSKEQNLVRRKNVTKKMIYDGENSSSSRGKKVSGTSCGQ---NCGYRRGISISDQRQGRNMFHRQNVLSSLGTRSLASACD
           R+L   S S+++P +S+       R+ N+ +K   DGE+SSSSRG K  G+  G    +     GI++S+ R+ RN       L S+   S+ S+  
Subjt:  YVSRNLECNSVSEIIPPDSSSKEQNLVRRKNVTKKMIYDGENSSSSRGKKVSGTSCGQ---NCGYRRGISISDQRQGRNMFHRQNVLSSLGTRSLASACD

Query:  GASHSYQGISGDRSLQDSLMTSWMPQSNISGSSHTSSSLQSRSESHSRHHMTYPEHEDYSQSLLDIGQTTAAEGGISSTSTNLNSFRCRNLDGIAEVLLA
          S  Y G +G      +L     P       S ++ + +S   S+SR   +        +SL+  G  + +E G+S +  N ++FR  N++G+AEVLLA
Subjt:  GASHSYQGISGDRSLQDSLMTSWMPQSNISGSSHTSSSLQSRSESHSRHHMTYPEHEDYSQSLLDIGQTTAAEGGISSTSTNLNSFRCRNLDGIAEVLLA

Query:  LERIEQEEELTYEQAILLETSLFLSSL-NIHDQHRDMRLDIDNMTYEELLDLEERMGTVSTALSEEALTECLNRSIFQSKPQGGELTSSVEDLSDVKCCI
        LERIEQ+EELTYEQ  +LET+LFL+ + + HDQHRDMRLDIDNM+YEELL LEE+MGTVSTALSEEAL + L  SI++   +  ++  + +D  DVKC I
Subjt:  LERIEQEEELTYEQAILLETSLFLSSL-NIHDQHRDMRLDIDNMTYEELLDLEERMGTVSTALSEEALTECLNRSIFQSKPQGGELTSSVEDLSDVKCCI

Query:  CQEEYVSGDEVGRL
        CQEEYV GDEVG L
Subjt:  CQEEYVSGDEVGRL

AT5G24870.2 RING/U-box superfamily protein3.7e-6339.3Show/hide
Query:  MDEYSSRRVHVGIVSRKGPGVVLRDNMKNRDHSN---HGSRLGCTGRICSSSGAEVGCSSKASSKRPFRSS-SGKEIVGSSSRSSSIRNFRKSFPDPFGK
        MD +  +R    I+ RK  G+VL +NMK +D  +     SR+GC+ R+ S+ G  +   +KA+    FRS  SGKE VGSSSRS S     K      G+
Subjt:  MDEYSSRRVHVGIVSRKGPGVVLRDNMKNRDHSN---HGSRLGCTGRICSSSGAEVGCSSKASSKRPFRSS-SGKEIVGSSSRSSSIRNFRKSFPDPFGK

Query:  --LSSKLETDSSEDGSIQDELEELEFISPPGMF-HTVLHVKSESSVPADAILMERGSSSTDSNTKSRRNSIQRYGVDNQDTSTSMRSRSLCQALNGGTSR
          LSS L+ DSSE  S+ ++    E   P G    + + V SESSV  + ++ E GSSS  +     R+  QR  + ++D   S   ++   ++N    +
Subjt:  --LSSKLETDSSEDGSIQDELEELEFISPPGMF-HTVLHVKSESSVPADAILMERGSSSTDSNTKSRRNSIQRYGVDNQDTSTSMRSRSLCQALNGGTSR

Query:  YVSRNLECNSVSEIIPPDSSSKEQNLVRRKNVTKKMIYDGENSSSSRGKKVSGTSCGQ---NCGYRRGISISDQRQGRNMFHRQNVLSSLGTRSLASACD
           R+L   S S+++P +S+       R+ N+ +K   DGE+SSSSRG K  G+  G    +     GI++S+ R+ RN       L S+   S+ S+  
Subjt:  YVSRNLECNSVSEIIPPDSSSKEQNLVRRKNVTKKMIYDGENSSSSRGKKVSGTSCGQ---NCGYRRGISISDQRQGRNMFHRQNVLSSLGTRSLASACD

Query:  GASHSYQGISGDRSLQDSLMTSWMPQSNISGSSHTSSSLQSRSESHSRHHMTYPEHEDYSQSLLDIGQTTAAEGGISSTSTNLNSFRCRNLDGIAEVLLA
          S  Y G +G      +L     P       S ++ + +S   S+SR   +        +SL+  G  + +E G+S +  N ++FR  N++G+AEVLLA
Subjt:  GASHSYQGISGDRSLQDSLMTSWMPQSNISGSSHTSSSLQSRSESHSRHHMTYPEHEDYSQSLLDIGQTTAAEGGISSTSTNLNSFRCRNLDGIAEVLLA

Query:  LERIEQEEELTYEQAILLETSLFLSSL-NIHDQHRDMRLDIDNMTYEELLDLEERMGTVSTALSEEALTECLNRSIFQSKPQGGELTSSVEDLSDVKCCI
        LERIEQ+EELTYEQ  +LET+LFL+ + + HDQHRDMRLDIDNM+YEELL LEE+MGTVSTALSEEAL + L  SI++   +  ++  + +D  DVKC I
Subjt:  LERIEQEEELTYEQAILLETSLFLSSL-NIHDQHRDMRLDIDNMTYEELLDLEERMGTVSTALSEEALTECLNRSIFQSKPQGGELTSSVEDLSDVKCCI

Query:  CQEEYVSGDEVGRL
        CQEEYV GDEVG L
Subjt:  CQEEYVSGDEVGRL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTACGGATCGCCTCATTTCCCTCTCGACCATTGGCATTAAGTTCATCCGACTCCCTGAATGTATTTTCTTGTCCAATTCATTCTCGTACGGCTTCCCGCCGGCGTCG
GAAAGTATCTCCTCGGAATCCTAATCTCAAGCGGCTGACCTCTCGTGTCGTGCGACTCACTCGCCGGAAGCAGCTTCACCAGGTATTTGAGGAAATTGAAATTGCCAAGA
GACGTTATGGAAAGTTGAATACAATTGTTATGAATGCGGTCCTGGAAGCTTGCGTGCACTGCGGTGACATTGATTTAGCTCTGAGGACTTTTTATGAAATGTCAAAGCCA
GATAATTGTGGCGTAGACAATGTCAGTTATGGCACACTATTAAAGGGTTTGGGTGAAGGTCGAAAAGTCGATGAAGCATTTCAATTGCTTGAATCTGTGGAAGAAGGTAC
CGCTATTGGAAGTCCAACATTGTCAGCACCCCTTATTTATGGTCTTCTAAATGCTTTAATTGAAGCAGGAGACATGCGCCGTGCCAATGGTCTAATTGCACGATATGGGT
TCATACTTCGTGAAGGAGGCAATCTCTCTACTTCAGTTTACAACTTATTGATGAAGGGGTACATAAGCTCAGGTGTTCCTCAGGCTGCTATAGCTATGTATAGTGAGATG
CTAAATCTCGGGTTACAACCTGATAGGCTCACTTACAATACGTTAATCTCTGCATGTGTGAAGATTAACAAATTGGACGCAGCAATGCATTTCTTTGAGGAAATGAAGGA
ACGAGCTGACAAATATGACCAGGAAGATATTTTTCCTGATGCTGTGACGTACACTACATTACTTAAGGGGTTTGGGATTTTGAGAGATGCCCGTGTAGTTCACATGATTG
TGCTGGAAATGAAATCTTGTCATGATTTTTTTATTGATCGAACAGCATACACCGCAATGATTGATTCTTTGGTTAATTGTGGCTCAATAAACGGTGCTCTTTCATTATTT
GGGGAATTATTGAAGCTTTCTGGATGGAATTCAGACTTTCGGCCAAAGCCACATCTTTATCTCTCTCTTATGAGGGTTCTTTCTAGTAGAGGCGATTATCGGATGGTCAA
ATGTTTGCATAGACGCATGTGGCTGGACTCTTCCGGAACTATTTTTCCTGGATTTCAAGAAGAAGCAGACCATCTTCTCATGGAGGCAGCTTTAAACGACAATCAGATTG
ACGTGGCAATAGAGAAACTGTCAACAATTATTAAGATATGGAAGGGAATCTCATGGACAAGTCGAGGAGGCAGTGTTGCTCTTCGCATAGAAGCATTGCTAGGATTCACC
AAATCATTCTTCAGTCCTTGCATATTTCCTCGGGTAAATCCTAGTGCACCTATTGAGAGTGTTATGATGCCATTTAAAGCAGTTGAGCCTTTAAATGGCAGCATGCAGTT
GAAGGAAGTGGTGATGCATTTCTTTGACAAATCAGTTGTGCCCATCGTAGACGAATGGGGCAGATGCACTGGATTATTGCACCGTGAAGACTGTACTGAGAAAAAGTGTA
AATATTTGTTTATGGCGCAGTTGGATGCTCCCCTTTGGAAGATTATGAGAAGCCCTCCTCCTAGTGTAACTACTACAACATCCATTGGACACGTCGCGAATCTAATTCTA
CAAAAGAGGTACAAAATGGTTATTGTTGTAAGACATAGCAAGTTTAGTACATATGATGGCTCGAGTTTGAGGGCTGTCGGCGTTTTTACTGCTGAGAAATTGTATGGGTT
TGTTTCTCCCCTTCCCTTGCCACCCCAGCCAAACAACCCACTAAATCCCATACCACAGGATGGACACCGAAATGTGGCTCATTTGAAAGGAAAAGATCGAGAAGTTGAGA
GGTGCTTCTCATCATCCATGGCCACTGGATTTGCCAATGCTGGCTTTAGATCTCAATCAAAAGCACCGTTTTATCAAAATCTCGTTTCTCTTATTGTCAAGGAAGGTGAA
GATGCGGTGGACTTGAAGCTCGGTGTTGCCGTTCTTCTGTTCTTCTTTCGGTTTCTTCTAGTGTTCAAGCAGGTTGAGTCGACATCGTTGAACAAAAAGGGGAAGAAGGA
AGACAAAGAAGAGCCTCTTCAAAATTACGTGACAAAGCGATACTCACCTCAGTTTTTGATGGATGAGTATTCCAGTAGAAGAGTTCACGTGGGAATTGTTAGTAGAAAGG
GACCTGGTGTAGTTTTGAGGGATAACATGAAGAACAGGGATCATAGTAATCATGGCAGCCGACTTGGCTGTACTGGCAGGATATGCTCCAGTAGTGGTGCAGAAGTTGGC
TGTTCAAGTAAAGCCAGCTCAAAAAGACCATTCCGCAGCTCAAGTGGCAAAGAAATTGTTGGAAGTTCCTCAAGGAGTTCTTCAATCAGAAATTTCAGGAAATCCTTTCC
AGATCCATTTGGAAAGCTTTCGTCTAAGCTTGAAACAGATTCATCTGAAGATGGTAGCATTCAGGATGAGTTGGAAGAACTGGAATTCATCTCACCTCCTGGAATGTTCC
ACACAGTGCTTCATGTAAAATCTGAAAGTTCGGTGCCTGCTGATGCCATATTGATGGAAAGGGGAAGCAGCAGTACGGATTCCAACACTAAATCTAGGAGAAACTCAATT
CAAAGATATGGAGTGGACAATCAAGATACTTCTACATCAATGCGATCTAGAAGTCTTTGCCAGGCATTAAATGGTGGTACCAGTAGATACGTTTCGAGAAACCTCGAATG
TAATTCAGTATCTGAGATAATTCCACCAGATAGTTCTTCGAAAGAGCAAAACCTTGTGAGAAGGAAGAATGTAACAAAGAAGATGATTTATGATGGTGAGAATAGTTCCA
GTTCTAGAGGAAAGAAGGTGAGTGGCACTTCCTGTGGACAGAATTGTGGTTACCGTCGTGGCATCTCTATTTCTGATCAAAGACAAGGTCGAAACATGTTTCACAGACAA
AATGTTCTTTCCTCACTTGGAACTCGAAGTTTGGCAAGTGCATGTGATGGGGCGAGCCATTCTTATCAAGGAATTAGTGGTGATCGGTCGTTGCAGGATTCACTGATGAC
TTCATGGATGCCTCAATCCAACATCTCTGGTTCCTCGCATACTTCTAGCTCACTTCAAAGTCGTTCAGAATCACACTCCAGGCACCACATGACTTACCCTGAACATGAGG
ATTATAGTCAAAGTCTACTTGACATTGGACAAACCACTGCTGCAGAAGGTGGTATTTCGAGCACTTCGACAAATCTGAACAGTTTTCGATGCCGCAACCTTGATGGCATT
GCAGAGGTACTATTGGCGTTGGAGAGAATTGAACAGGAGGAGGAGTTAACTTACGAGCAAGCTATTCTTCTCGAGACAAGTTTGTTCCTTAGTAGCCTAAACATCCATGA
TCAGCATAGAGACATGAGGCTGGACATTGATAACATGACGTATGAGGAACTTCTAGATTTAGAAGAGAGAATGGGGACAGTGAGCACAGCACTGTCTGAAGAAGCATTGA
CAGAATGCCTTAACAGAAGCATCTTCCAGTCCAAACCTCAAGGAGGAGAGCTCACGAGTTCTGTCGAGGATTTGAGCGATGTCAAATGTTGTATTTGTCAGGAAGAATAT
GTGAGTGGAGATGAAGTTGGGAGACTGCATTTGGAACAGGCAATAGTGTTACAGTATTCTAGCTTGGAAACTTGGAGCCTGAACCCATTGGGAGCAATCAAGTGTACCCC
AGAAAGAAGGGCAGAATGGGGAGATACCCCCCAGGTAAACTTTATAGCCCTTTGGATCAGAACAGACTAG
mRNA sequenceShow/hide mRNA sequence
ATGTTACGGATCGCCTCATTTCCCTCTCGACCATTGGCATTAAGTTCATCCGACTCCCTGAATGTATTTTCTTGTCCAATTCATTCTCGTACGGCTTCCCGCCGGCGTCG
GAAAGTATCTCCTCGGAATCCTAATCTCAAGCGGCTGACCTCTCGTGTCGTGCGACTCACTCGCCGGAAGCAGCTTCACCAGGTATTTGAGGAAATTGAAATTGCCAAGA
GACGTTATGGAAAGTTGAATACAATTGTTATGAATGCGGTCCTGGAAGCTTGCGTGCACTGCGGTGACATTGATTTAGCTCTGAGGACTTTTTATGAAATGTCAAAGCCA
GATAATTGTGGCGTAGACAATGTCAGTTATGGCACACTATTAAAGGGTTTGGGTGAAGGTCGAAAAGTCGATGAAGCATTTCAATTGCTTGAATCTGTGGAAGAAGGTAC
CGCTATTGGAAGTCCAACATTGTCAGCACCCCTTATTTATGGTCTTCTAAATGCTTTAATTGAAGCAGGAGACATGCGCCGTGCCAATGGTCTAATTGCACGATATGGGT
TCATACTTCGTGAAGGAGGCAATCTCTCTACTTCAGTTTACAACTTATTGATGAAGGGGTACATAAGCTCAGGTGTTCCTCAGGCTGCTATAGCTATGTATAGTGAGATG
CTAAATCTCGGGTTACAACCTGATAGGCTCACTTACAATACGTTAATCTCTGCATGTGTGAAGATTAACAAATTGGACGCAGCAATGCATTTCTTTGAGGAAATGAAGGA
ACGAGCTGACAAATATGACCAGGAAGATATTTTTCCTGATGCTGTGACGTACACTACATTACTTAAGGGGTTTGGGATTTTGAGAGATGCCCGTGTAGTTCACATGATTG
TGCTGGAAATGAAATCTTGTCATGATTTTTTTATTGATCGAACAGCATACACCGCAATGATTGATTCTTTGGTTAATTGTGGCTCAATAAACGGTGCTCTTTCATTATTT
GGGGAATTATTGAAGCTTTCTGGATGGAATTCAGACTTTCGGCCAAAGCCACATCTTTATCTCTCTCTTATGAGGGTTCTTTCTAGTAGAGGCGATTATCGGATGGTCAA
ATGTTTGCATAGACGCATGTGGCTGGACTCTTCCGGAACTATTTTTCCTGGATTTCAAGAAGAAGCAGACCATCTTCTCATGGAGGCAGCTTTAAACGACAATCAGATTG
ACGTGGCAATAGAGAAACTGTCAACAATTATTAAGATATGGAAGGGAATCTCATGGACAAGTCGAGGAGGCAGTGTTGCTCTTCGCATAGAAGCATTGCTAGGATTCACC
AAATCATTCTTCAGTCCTTGCATATTTCCTCGGGTAAATCCTAGTGCACCTATTGAGAGTGTTATGATGCCATTTAAAGCAGTTGAGCCTTTAAATGGCAGCATGCAGTT
GAAGGAAGTGGTGATGCATTTCTTTGACAAATCAGTTGTGCCCATCGTAGACGAATGGGGCAGATGCACTGGATTATTGCACCGTGAAGACTGTACTGAGAAAAAGTGTA
AATATTTGTTTATGGCGCAGTTGGATGCTCCCCTTTGGAAGATTATGAGAAGCCCTCCTCCTAGTGTAACTACTACAACATCCATTGGACACGTCGCGAATCTAATTCTA
CAAAAGAGGTACAAAATGGTTATTGTTGTAAGACATAGCAAGTTTAGTACATATGATGGCTCGAGTTTGAGGGCTGTCGGCGTTTTTACTGCTGAGAAATTGTATGGGTT
TGTTTCTCCCCTTCCCTTGCCACCCCAGCCAAACAACCCACTAAATCCCATACCACAGGATGGACACCGAAATGTGGCTCATTTGAAAGGAAAAGATCGAGAAGTTGAGA
GGTGCTTCTCATCATCCATGGCCACTGGATTTGCCAATGCTGGCTTTAGATCTCAATCAAAAGCACCGTTTTATCAAAATCTCGTTTCTCTTATTGTCAAGGAAGGTGAA
GATGCGGTGGACTTGAAGCTCGGTGTTGCCGTTCTTCTGTTCTTCTTTCGGTTTCTTCTAGTGTTCAAGCAGGTTGAGTCGACATCGTTGAACAAAAAGGGGAAGAAGGA
AGACAAAGAAGAGCCTCTTCAAAATTACGTGACAAAGCGATACTCACCTCAGTTTTTGATGGATGAGTATTCCAGTAGAAGAGTTCACGTGGGAATTGTTAGTAGAAAGG
GACCTGGTGTAGTTTTGAGGGATAACATGAAGAACAGGGATCATAGTAATCATGGCAGCCGACTTGGCTGTACTGGCAGGATATGCTCCAGTAGTGGTGCAGAAGTTGGC
TGTTCAAGTAAAGCCAGCTCAAAAAGACCATTCCGCAGCTCAAGTGGCAAAGAAATTGTTGGAAGTTCCTCAAGGAGTTCTTCAATCAGAAATTTCAGGAAATCCTTTCC
AGATCCATTTGGAAAGCTTTCGTCTAAGCTTGAAACAGATTCATCTGAAGATGGTAGCATTCAGGATGAGTTGGAAGAACTGGAATTCATCTCACCTCCTGGAATGTTCC
ACACAGTGCTTCATGTAAAATCTGAAAGTTCGGTGCCTGCTGATGCCATATTGATGGAAAGGGGAAGCAGCAGTACGGATTCCAACACTAAATCTAGGAGAAACTCAATT
CAAAGATATGGAGTGGACAATCAAGATACTTCTACATCAATGCGATCTAGAAGTCTTTGCCAGGCATTAAATGGTGGTACCAGTAGATACGTTTCGAGAAACCTCGAATG
TAATTCAGTATCTGAGATAATTCCACCAGATAGTTCTTCGAAAGAGCAAAACCTTGTGAGAAGGAAGAATGTAACAAAGAAGATGATTTATGATGGTGAGAATAGTTCCA
GTTCTAGAGGAAAGAAGGTGAGTGGCACTTCCTGTGGACAGAATTGTGGTTACCGTCGTGGCATCTCTATTTCTGATCAAAGACAAGGTCGAAACATGTTTCACAGACAA
AATGTTCTTTCCTCACTTGGAACTCGAAGTTTGGCAAGTGCATGTGATGGGGCGAGCCATTCTTATCAAGGAATTAGTGGTGATCGGTCGTTGCAGGATTCACTGATGAC
TTCATGGATGCCTCAATCCAACATCTCTGGTTCCTCGCATACTTCTAGCTCACTTCAAAGTCGTTCAGAATCACACTCCAGGCACCACATGACTTACCCTGAACATGAGG
ATTATAGTCAAAGTCTACTTGACATTGGACAAACCACTGCTGCAGAAGGTGGTATTTCGAGCACTTCGACAAATCTGAACAGTTTTCGATGCCGCAACCTTGATGGCATT
GCAGAGGTACTATTGGCGTTGGAGAGAATTGAACAGGAGGAGGAGTTAACTTACGAGCAAGCTATTCTTCTCGAGACAAGTTTGTTCCTTAGTAGCCTAAACATCCATGA
TCAGCATAGAGACATGAGGCTGGACATTGATAACATGACGTATGAGGAACTTCTAGATTTAGAAGAGAGAATGGGGACAGTGAGCACAGCACTGTCTGAAGAAGCATTGA
CAGAATGCCTTAACAGAAGCATCTTCCAGTCCAAACCTCAAGGAGGAGAGCTCACGAGTTCTGTCGAGGATTTGAGCGATGTCAAATGTTGTATTTGTCAGGAAGAATAT
GTGAGTGGAGATGAAGTTGGGAGACTGCATTTGGAACAGGCAATAGTGTTACAGTATTCTAGCTTGGAAACTTGGAGCCTGAACCCATTGGGAGCAATCAAGTGTACCCC
AGAAAGAAGGGCAGAATGGGGAGATACCCCCCAGGTAAACTTTATAGCCCTTTGGATCAGAACAGACTAG
Protein sequenceShow/hide protein sequence
MLRIASFPSRPLALSSSDSLNVFSCPIHSRTASRRRRKVSPRNPNLKRLTSRVVRLTRRKQLHQVFEEIEIAKRRYGKLNTIVMNAVLEACVHCGDIDLALRTFYEMSKP
DNCGVDNVSYGTLLKGLGEGRKVDEAFQLLESVEEGTAIGSPTLSAPLIYGLLNALIEAGDMRRANGLIARYGFILREGGNLSTSVYNLLMKGYISSGVPQAAIAMYSEM
LNLGLQPDRLTYNTLISACVKINKLDAAMHFFEEMKERADKYDQEDIFPDAVTYTTLLKGFGILRDARVVHMIVLEMKSCHDFFIDRTAYTAMIDSLVNCGSINGALSLF
GELLKLSGWNSDFRPKPHLYLSLMRVLSSRGDYRMVKCLHRRMWLDSSGTIFPGFQEEADHLLMEAALNDNQIDVAIEKLSTIIKIWKGISWTSRGGSVALRIEALLGFT
KSFFSPCIFPRVNPSAPIESVMMPFKAVEPLNGSMQLKEVVMHFFDKSVVPIVDEWGRCTGLLHREDCTEKKCKYLFMAQLDAPLWKIMRSPPPSVTTTTSIGHVANLIL
QKRYKMVIVVRHSKFSTYDGSSLRAVGVFTAEKLYGFVSPLPLPPQPNNPLNPIPQDGHRNVAHLKGKDREVERCFSSSMATGFANAGFRSQSKAPFYQNLVSLIVKEGE
DAVDLKLGVAVLLFFFRFLLVFKQVESTSLNKKGKKEDKEEPLQNYVTKRYSPQFLMDEYSSRRVHVGIVSRKGPGVVLRDNMKNRDHSNHGSRLGCTGRICSSSGAEVG
CSSKASSKRPFRSSSGKEIVGSSSRSSSIRNFRKSFPDPFGKLSSKLETDSSEDGSIQDELEELEFISPPGMFHTVLHVKSESSVPADAILMERGSSSTDSNTKSRRNSI
QRYGVDNQDTSTSMRSRSLCQALNGGTSRYVSRNLECNSVSEIIPPDSSSKEQNLVRRKNVTKKMIYDGENSSSSRGKKVSGTSCGQNCGYRRGISISDQRQGRNMFHRQ
NVLSSLGTRSLASACDGASHSYQGISGDRSLQDSLMTSWMPQSNISGSSHTSSSLQSRSESHSRHHMTYPEHEDYSQSLLDIGQTTAAEGGISSTSTNLNSFRCRNLDGI
AEVLLALERIEQEEELTYEQAILLETSLFLSSLNIHDQHRDMRLDIDNMTYEELLDLEERMGTVSTALSEEALTECLNRSIFQSKPQGGELTSSVEDLSDVKCCICQEEY
VSGDEVGRLHLEQAIVLQYSSLETWSLNPLGAIKCTPERRAEWGDTPQVNFIALWIRTD