; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10013520 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10013520
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationChr02:2304091..2315524
RNA-Seq ExpressionHG10013520
SyntenyHG10013520
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003729 - mRNA binding (molecular function)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR000644 - CBS domain
IPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily
IPR044781 - Pentatricopeptide repeat-containing protein At5g10690-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008466303.1 PREDICTED: pentatricopeptide repeat-containing protein At5g10690 isoform X3 [Cucumis melo]3.6e-29788.14Show/hide
Query:  MLRIASFS--------TGSTTVFSCSLPSRTATARRRRNSLRSPNLKRLTSRVVRLTRRKQLHQVFEEIAIAKRRYGKLNTIVMNAVLEACVHCGDIDLA
        ML IASFS        + ST VFSCSLPSRTAT+RR R+S RSPNLKRLTSRVVRLTRRK+LHQVFEEI IAKRRYGKLNTIVMNAVLEACVHCGDIDLA
Subjt:  MLRIASFS--------TGSTTVFSCSLPSRTATARRRRNSLRSPNLKRLTSRVVRLTRRKQLHQVFEEIAIAKRRYGKLNTIVMNAVLEACVHCGDIDLA

Query:  LRTFNEMSKPDNCGLDNVSYGTLLKGLGEARKVDEAFQLLESVEEGTAIGGPTLSAPLIYGLLNALIEAGDMRRANGLIARYGFLLREGGNLSISVYNLL
        LRTFNEMSKPD+CGLDNVSYGTLLKGLGEARK+DEAFQLLESVEEGTAIGGPTLSAPLIYGLLNALIEAGDMRRANGLIARYG+LLREGGNLSISVYNLL
Subjt:  LRTFNEMSKPDNCGLDNVSYGTLLKGLGEARKVDEAFQLLESVEEGTAIGGPTLSAPLIYGLLNALIEAGDMRRANGLIARYGFLLREGGNLSISVYNLL

Query:  MKGYISSGVPQAALAMYNEMLNLELKPDRLTYNTLIYACVKINKLDAAILFFEEMKERAAKYDQEDIFPDVVTYTTLLKGFGILKDVHLVHKIVLEMKSC
        MKGYISSGVPQAALAMYNEMLNLELKPDRLTYNTLI ACVKINKLDAA+ FFEEMKERA KYDQED+FPDVVTYTTLLK FGILKDVHLVHKIVLEMKSC
Subjt:  MKGYISSGVPQAALAMYNEMLNLELKPDRLTYNTLIYACVKINKLDAAILFFEEMKERAAKYDQEDIFPDVVTYTTLLKGFGILKDVHLVHKIVLEMKSC

Query:  HELLIDRTAYTAMIDALVNCGSINAALSLFGELLKLSGCNLDLRPKPHLYLTLMRVFSSRGDYRMVKCLHRRMWLDSSGTISPGFQEESDHLLMEAALNG
        H L IDRTAYTAMIDALVNCGSIN ALSLFGELLKLSG NL+LRPKPHLYLTLMRVFSSRGDYRMVKCLHRRMWLDSSGTIS G+QEE+DHLLMEAALN 
Subjt:  HELLIDRTAYTAMIDALVNCGSINAALSLFGELLKLSGCNLDLRPKPHLYLTLMRVFSSRGDYRMVKCLHRRMWLDSSGTISPGFQEESDHLLMEAALNG

Query:  NQTFYGLCCICMSVQIDVAIEKLSTIIKKWKGISWASRGGSVALRIEALLGLTKSFFSPPCIFPRVNPGASIESVMMPFKAVQPLNGSLQLKEVVMRFFD
        NQ       I     IDVAIEKLSTIIKKWKGISW SRGGSVALRIEALLGLTKSFFS PCIFPRVN GA IESVMMPFKAVQPLNGSL LKEVVMRFFD
Subjt:  NQTFYGLCCICMSVQIDVAIEKLSTIIKKWKGISWASRGGSVALRIEALLGLTKSFFSPPCIFPRVNPGASIESVMMPFKAVQPLNGSLQLKEVVMRFFD

Query:  KSVVPIIDDWGRCNGLLHREDCTELDAPLWKMMRSPPPSVTTTTSIGHVANLILQKRYKMIIVVRHNKFSTYNGLSLRAVGVFTTEQLYGFISPIPMQPQ
        KSVVPIIDDWGRC GLLHREDCTELDAPLWKMMRSPPP VTTT  IGHVANLILQKRYKM++VVRH+KFS Y G SLRA+GVFT EQLYGF+SPIPM  +
Subjt:  KSVVPIIDDWGRCNGLLHREDCTELDAPLWKMMRSPPPSVTTTTSIGHVANLILQKRYKMIIVVRHNKFSTYNGLSLRAVGVFTTEQLYGFISPIPMQPQ

Query:  PSIPHKT
        P++P KT
Subjt:  PSIPHKT

XP_008466304.1 PREDICTED: pentatricopeptide repeat-containing protein At5g10690 isoform X4 [Cucumis melo]2.0e-29586.97Show/hide
Query:  MLRIASFS--------TGSTTVFSCSLPSRTATARRRRNSLRSPNLKRLTSRVVRLTRRKQLHQVFEEIAIAKRRYGKLNTIVMNAVLEACVHCGDIDLA
        ML IASFS        + ST VFSCSLPSRTAT+RR R+S RSPNLKRLTSRVVRLTRRK+LHQVFEEI IAKRRYGKLNTIVMNAVLEACVHCGDIDLA
Subjt:  MLRIASFS--------TGSTTVFSCSLPSRTATARRRRNSLRSPNLKRLTSRVVRLTRRKQLHQVFEEIAIAKRRYGKLNTIVMNAVLEACVHCGDIDLA

Query:  LRTFNEMSKPDNCGLDNVSYGTLLKGLGEARKVDEAFQLLESVEEGTAIGGPTLSAPLIYGLLNALIEAGDMRRANGLIARYGFLLREGGNLSISVYNLL
        LRTFNEMSKPD+CGLDNVSYGTLLKGLGEARK+DEAFQLLESVEEGTAIGGPTLSAPLIYGLLNALIEAGDMRRANGLIARYG+LLREGGNLSISVYNLL
Subjt:  LRTFNEMSKPDNCGLDNVSYGTLLKGLGEARKVDEAFQLLESVEEGTAIGGPTLSAPLIYGLLNALIEAGDMRRANGLIARYGFLLREGGNLSISVYNLL

Query:  MKGYISSGVPQAALAMYNEMLNLELKPDRLTYNTLIYACVKINKLDAAILFFEEMKERAAKYDQEDIFPDVVTYTTLLKGFGILKDVHLVHKIVLEMKSC
        MKGYISSGVPQAALAMYNEMLNLELKPDRLTYNTLI ACVKINKLDAA+ FFEEMKERA KYDQED+FPDVVTYTTLLK FGILKDVHLVHKIVLEMKSC
Subjt:  MKGYISSGVPQAALAMYNEMLNLELKPDRLTYNTLIYACVKINKLDAAILFFEEMKERAAKYDQEDIFPDVVTYTTLLKGFGILKDVHLVHKIVLEMKSC

Query:  HELLIDRTAYTAMIDALVNCGSIN-------AALSLFGELLKLSGCNLDLRPKPHLYLTLMRVFSSRGDYRMVKCLHRRMWLDSSGTISPGFQEESDHLL
        H L IDRTAYTAMIDALVNCGSIN        ALSLFGELLKLSG NL+LRPKPHLYLTLMRVFSSRGDYRMVKCLHRRMWLDSSGTIS G+QEE+DHLL
Subjt:  HELLIDRTAYTAMIDALVNCGSIN-------AALSLFGELLKLSGCNLDLRPKPHLYLTLMRVFSSRGDYRMVKCLHRRMWLDSSGTISPGFQEESDHLL

Query:  MEAALNGNQTFYGLCCICMSVQIDVAIEKLSTIIKKWKGISWASRGGSVALRIEALLGLTKSFFSPPCIFPRVNPGASIESVMMPFKAVQPLNGSLQLKE
        MEAALN N             QIDVAIEKLSTIIKKWKGISW SRGGSVALRIEALLGLTKSFFS PCIFPRVN GA IESVMMPFKAVQPLNGSL LKE
Subjt:  MEAALNGNQTFYGLCCICMSVQIDVAIEKLSTIIKKWKGISWASRGGSVALRIEALLGLTKSFFSPPCIFPRVNPGASIESVMMPFKAVQPLNGSLQLKE

Query:  VVMRFFDKSVVPIIDDWGRCNGLLHREDCTELDAPLWKMMRSPPPSVTTTTSIGHVANLILQKRYKMIIVVRHNKFSTYNGLSLRAVGVFTTEQLYGFIS
        VVMRFFDKSVVPIIDDWGRC GLLHREDCTELDAPLWKMMRSPPP VTTT  IGHVANLILQKRYKM++VVRH+KFS Y G SLRA+GVFT EQLYGF+S
Subjt:  VVMRFFDKSVVPIIDDWGRCNGLLHREDCTELDAPLWKMMRSPPPSVTTTTSIGHVANLILQKRYKMIIVVRHNKFSTYNGLSLRAVGVFTTEQLYGFIS

Query:  PIPMQPQPSIPHKT
        PIPM  +P++P KT
Subjt:  PIPMQPQPSIPHKT

XP_008466305.1 PREDICTED: pentatricopeptide repeat-containing protein At5g10690 isoform X5 [Cucumis melo]1.6e-29787.97Show/hide
Query:  MLRIASFS--------TGSTTVFSCSLPSRTATARRRRNSLRSPNLKRLTSRVVRLTRRKQLHQVFEEIAIAKRRYGKLNTIVMNAVLEACVHCGDIDLA
        ML IASFS        + ST VFSCSLPSRTAT+RR R+S RSPNLKRLTSRVVRLTRRK+LHQVFEEI IAKRRYGKLNTIVMNAVLEACVHCGDIDLA
Subjt:  MLRIASFS--------TGSTTVFSCSLPSRTATARRRRNSLRSPNLKRLTSRVVRLTRRKQLHQVFEEIAIAKRRYGKLNTIVMNAVLEACVHCGDIDLA

Query:  LRTFNEMSKPDNCGLDNVSYGTLLKGLGEARKVDEAFQLLESVEEGTAIGGPTLSAPLIYGLLNALIEAGDMRRANGLIARYGFLLREGGNLSISVYNLL
        LRTFNEMSKPD+CGLDNVSYGTLLKGLGEARK+DEAFQLLESVEEGTAIGGPTLSAPLIYGLLNALIEAGDMRRANGLIARYG+LLREGGNLSISVYNLL
Subjt:  LRTFNEMSKPDNCGLDNVSYGTLLKGLGEARKVDEAFQLLESVEEGTAIGGPTLSAPLIYGLLNALIEAGDMRRANGLIARYGFLLREGGNLSISVYNLL

Query:  MKGYISSGVPQAALAMYNEMLNLELKPDRLTYNTLIYACVKINKLDAAILFFEEMKERAAKYDQEDIFPDVVTYTTLLKGFGILKDVHLVHKIVLEMKSC
        MKGYISSGVPQAALAMYNEMLNLELKPDRLTYNTLI ACVKINKLDAA+ FFEEMKERA KYDQED+FPDVVTYTTLLK FGILKDVHLVHKIVLEMKSC
Subjt:  MKGYISSGVPQAALAMYNEMLNLELKPDRLTYNTLIYACVKINKLDAAILFFEEMKERAAKYDQEDIFPDVVTYTTLLKGFGILKDVHLVHKIVLEMKSC

Query:  HELLIDRTAYTAMIDALVNCGSINAALSLFGELLKLSGCNLDLRPKPHLYLTLMRVFSSRGDYRMVKCLHRRMWLDSSGTISPGFQEESDHLLMEAALNG
        H L IDRTAYTAMIDALVNCGSIN ALSLFGELLKLSG NL+LRPKPHLYLTLMRVFSSRGDYRMVKCLHRRMWLDSSGTIS G+QEE+DHLLMEAALN 
Subjt:  HELLIDRTAYTAMIDALVNCGSINAALSLFGELLKLSGCNLDLRPKPHLYLTLMRVFSSRGDYRMVKCLHRRMWLDSSGTISPGFQEESDHLLMEAALNG

Query:  NQTFYGLCCICMSVQIDVAIEKLSTIIKKWKGISWASRGGSVALRIEALLGLTKSFFSPPCIFPRVNPGASIESVMMPFKAVQPLNGSLQLKEVVMRFFD
        N             QIDVAIEKLSTIIKKWKGISW SRGGSVALRIEALLGLTKSFFS PCIFPRVN GA IESVMMPFKAVQPLNGSL LKEVVMRFFD
Subjt:  NQTFYGLCCICMSVQIDVAIEKLSTIIKKWKGISWASRGGSVALRIEALLGLTKSFFSPPCIFPRVNPGASIESVMMPFKAVQPLNGSLQLKEVVMRFFD

Query:  KSVVPIIDDWGRCNGLLHREDCTELDAPLWKMMRSPPPSVTTTTSIGHVANLILQKRYKMIIVVRHNKFSTYNGLSLRAVGVFTTEQLYGFISPIPMQPQ
        KSVVPIIDDWGRC GLLHREDCTELDAPLWKMMRSPPP VTTT  IGHVANLILQKRYKM++VVRH+KFS Y G SLRA+GVFT EQLYGF+SPIPM  +
Subjt:  KSVVPIIDDWGRCNGLLHREDCTELDAPLWKMMRSPPPSVTTTTSIGHVANLILQKRYKMIIVVRHNKFSTYNGLSLRAVGVFTTEQLYGFISPIPMQPQ

Query:  PSIPHKT
        P++P KT
Subjt:  PSIPHKT

XP_016903485.1 PREDICTED: pentatricopeptide repeat-containing protein At5g10690 isoform X2 [Cucumis melo]3.1e-29688.16Show/hide
Query:  MLRIASFS--------TGSTTVFSCSLPSRTATARRRRNSLRSPNLKRLTSRVVRLTRRKQLHQVFEEIAIAKRRYGKLNTIVMNAVLEACVHCGDIDLA
        ML IASFS        + ST VFSCSLPSRTAT+RR R+S RSPNLKRLTSRVVRLTRRK+LHQVFEEI IAKRRYGKLNTIVMNAVLEACVHCGDIDLA
Subjt:  MLRIASFS--------TGSTTVFSCSLPSRTATARRRRNSLRSPNLKRLTSRVVRLTRRKQLHQVFEEIAIAKRRYGKLNTIVMNAVLEACVHCGDIDLA

Query:  LRTFNEMSKPDNCGLDNVSYGTLLKGLGEARKVDEAFQLLESVEEGTAIGGPTLSAPLIYGLLNALIEAGDMRRANGLIARYGFLLREGGNLSISVYNLL
        LRTFNEMSKPD+CGLDNVSYGTLLKGLGEARK+DEAFQLLESVEEGTAIGGPTLSAPLIYGLLNALIEAGDMRRANGLIARYG+LLREGGNLSISVYNLL
Subjt:  LRTFNEMSKPDNCGLDNVSYGTLLKGLGEARKVDEAFQLLESVEEGTAIGGPTLSAPLIYGLLNALIEAGDMRRANGLIARYGFLLREGGNLSISVYNLL

Query:  MKGYISSGVPQAALAMYNEMLNLELKPDRLTYNTLIYACVKINKLDAAILFFEEMKERAAKYDQEDIFPDVVTYTTLLKGFGILKDVHLVHKIVLEMKSC
        MKGYISSGVPQAALAMYNEMLNLELKPDRLTYNTLI ACVKINKLDAA+ FFEEMKERA KYDQED+FPDVVTYTTLLK FGILKDVHLVHKIVLEMKSC
Subjt:  MKGYISSGVPQAALAMYNEMLNLELKPDRLTYNTLIYACVKINKLDAAILFFEEMKERAAKYDQEDIFPDVVTYTTLLKGFGILKDVHLVHKIVLEMKSC

Query:  HELLIDRTAYTAMIDALVNCGSINA-ALSLFGELLKLSGCNLDLRPKPHLYLTLMRVFSSRGDYRMVKCLHRRMWLDSSGTISPGFQEESDHLLMEAALN
        H L IDRTAYTAMIDALVNCGSINA ALSLFGELLKLSG NL+LRPKPHLYLTLMRVFSSRGDYRMVKCLHRRMWLDSSGTIS G+QEE+DHLLMEAALN
Subjt:  HELLIDRTAYTAMIDALVNCGSINA-ALSLFGELLKLSGCNLDLRPKPHLYLTLMRVFSSRGDYRMVKCLHRRMWLDSSGTISPGFQEESDHLLMEAALN

Query:  GNQTFYGLCCICMSVQIDVAIEKLSTIIKKWKGISWASRGGSVALRIEALLGLTKSFFSPPCIFPRVNPGASIESVMMPFKAVQPLNGSLQLKEVVMRFF
         NQ       I     IDVAIEKLSTIIKKWKGISW SRGGSVALRIEALLGLTKSFFS PCIFPRVN GA IESVMMPFKAVQPLNGSL LKEVVMRFF
Subjt:  GNQTFYGLCCICMSVQIDVAIEKLSTIIKKWKGISWASRGGSVALRIEALLGLTKSFFSPPCIFPRVNPGASIESVMMPFKAVQPLNGSLQLKEVVMRFF

Query:  DKSVVPIIDDWGRCNGLLHREDCTELDAPLWKMMRSPPPSVTTTTSIGHVANLILQKRYKMIIVVRHNKFSTYNGLSLRAVGVFTTEQLYGFISPIPMQP
        DKSVVPIIDDWGRC GLLHREDCTELDAPLWKMMRSPPP VTTT  IGHVANLILQKRYKM++VVRH+KFS Y G SLRA+GVFT EQLYGF+SPIPM  
Subjt:  DKSVVPIIDDWGRCNGLLHREDCTELDAPLWKMMRSPPPSVTTTTSIGHVANLILQKRYKMIIVVRHNKFSTYNGLSLRAVGVFTTEQLYGFISPIPMQP

Query:  QPSIPHKT
        +P++P KT
Subjt:  QPSIPHKT

XP_023535821.1 pentatricopeptide repeat-containing protein At5g10690 isoform X1 [Cucurbita pepo subsp. pepo]3.1e-29687.15Show/hide
Query:  MLRIASFS--------TGSTTVFSCSLPSRTATARRRRNSLRSPNLKRLTSRVVRLTRRKQLHQVFEEIAIAKRRYGKLNTIVMNAVLEACVHCGDIDLA
        ML IASFS        + S  VFSCSLPSRTA ARRR +S RSPNLKRLTSRVVRLTRRKQLHQ+FEEI IAK+RYGKLNTIVMNAVLEACVHCGDIDLA
Subjt:  MLRIASFS--------TGSTTVFSCSLPSRTATARRRRNSLRSPNLKRLTSRVVRLTRRKQLHQVFEEIAIAKRRYGKLNTIVMNAVLEACVHCGDIDLA

Query:  LRTFNEMSKPDNCGLDNVSYGTLLKGLGEARKVDEAFQLLESVEEGTAIGGPTLSAPLIYGLLNALIEAGDMRRANGLIARYGFLLREGGNLSISVYNLL
        LRTFNEMSKPDNCGLDNVSYGTLLKGLGEARK+DEAFQLLESVEEGTAIG PTLSAPLIYG+LNAL EAGDMRRANGLIARYGFLL EGGNLSISVYNLL
Subjt:  LRTFNEMSKPDNCGLDNVSYGTLLKGLGEARKVDEAFQLLESVEEGTAIGGPTLSAPLIYGLLNALIEAGDMRRANGLIARYGFLLREGGNLSISVYNLL

Query:  MKGYISSGVPQAALAMYNEMLNLELKPDRLTYNTLIYACVKINKLDAAILFFEEMKERAAKYDQEDIFPDVVTYTTLLKGFGILKDVHLVHKIVLEMKSC
        MKGYISSGVPQAALA+YNEMLNLELKPD+LTYNTLI ACVKINKLDAA+ FFEEMKERA KY+QEDIFPDVVTYTTLLKGFGILKDV LVHKIVLEMKS 
Subjt:  MKGYISSGVPQAALAMYNEMLNLELKPDRLTYNTLIYACVKINKLDAAILFFEEMKERAAKYDQEDIFPDVVTYTTLLKGFGILKDVHLVHKIVLEMKSC

Query:  HELLIDRTAYTAMIDALVNCGSINAALSLFGELLKLSGCNLDLRPKPHLYLTLMRVFSSRGDYRMVKCLHRRMWLDSSGTISPGFQEESDHLLMEAALNG
        H+LLIDRTAYTAMIDALVNCGSIN ALSLFGELLKLSG NLDLRPKPHLYLT MR FSSRGDYRMVKCLHRRMWLDSSG+ISPGFQEE+DHLLMEAAL+ 
Subjt:  HELLIDRTAYTAMIDALVNCGSINAALSLFGELLKLSGCNLDLRPKPHLYLTLMRVFSSRGDYRMVKCLHRRMWLDSSGTISPGFQEESDHLLMEAALNG

Query:  NQTFYGLCCICMSVQIDVAIEKLSTIIKKWKGISWASRGGSVALRIEALLGLTKSFFSPPCIFPRVNPGASIESVMMPFKAVQPLNGSLQLKEVVMRFFD
        N             QIDVA EKLSTIIK+WKGISW+SRGGSVALRIEALLGLTKSFFS PCIFPRVNPGA IESVMMPFKAVQPLNG+LQLKEVVMRFFD
Subjt:  NQTFYGLCCICMSVQIDVAIEKLSTIIKKWKGISWASRGGSVALRIEALLGLTKSFFSPPCIFPRVNPGASIESVMMPFKAVQPLNGSLQLKEVVMRFFD

Query:  KSVVPIIDDWGRCNGLLHREDCTELDAPLWKMMRSPPPSVTTTTSIGHVANLILQKRYKMIIVVRHNKFSTYNGLSLRAVGVFTTEQLYGFISPIPMQPQ
        KSVVPIIDDWGRC GLLHREDC+EL++PLWKMMRSPPP VTTTTSIGHV NLIL+KRYKMII+VRH+KFSTY+  S RAVGVFT EQLYGFISPIPMQ Q
Subjt:  KSVVPIIDDWGRCNGLLHREDCTELDAPLWKMMRSPPPSVTTTTSIGHVANLILQKRYKMIIVVRHNKFSTYNGLSLRAVGVFTTEQLYGFISPIPMQPQ

Query:  PSIPHKT
        P+IPHKT
Subjt:  PSIPHKT

TrEMBL top hitse value%identityAlignment
A0A1S3CQX2 pentatricopeptide repeat-containing protein At5g10690 isoform X49.6e-29686.97Show/hide
Query:  MLRIASFS--------TGSTTVFSCSLPSRTATARRRRNSLRSPNLKRLTSRVVRLTRRKQLHQVFEEIAIAKRRYGKLNTIVMNAVLEACVHCGDIDLA
        ML IASFS        + ST VFSCSLPSRTAT+RR R+S RSPNLKRLTSRVVRLTRRK+LHQVFEEI IAKRRYGKLNTIVMNAVLEACVHCGDIDLA
Subjt:  MLRIASFS--------TGSTTVFSCSLPSRTATARRRRNSLRSPNLKRLTSRVVRLTRRKQLHQVFEEIAIAKRRYGKLNTIVMNAVLEACVHCGDIDLA

Query:  LRTFNEMSKPDNCGLDNVSYGTLLKGLGEARKVDEAFQLLESVEEGTAIGGPTLSAPLIYGLLNALIEAGDMRRANGLIARYGFLLREGGNLSISVYNLL
        LRTFNEMSKPD+CGLDNVSYGTLLKGLGEARK+DEAFQLLESVEEGTAIGGPTLSAPLIYGLLNALIEAGDMRRANGLIARYG+LLREGGNLSISVYNLL
Subjt:  LRTFNEMSKPDNCGLDNVSYGTLLKGLGEARKVDEAFQLLESVEEGTAIGGPTLSAPLIYGLLNALIEAGDMRRANGLIARYGFLLREGGNLSISVYNLL

Query:  MKGYISSGVPQAALAMYNEMLNLELKPDRLTYNTLIYACVKINKLDAAILFFEEMKERAAKYDQEDIFPDVVTYTTLLKGFGILKDVHLVHKIVLEMKSC
        MKGYISSGVPQAALAMYNEMLNLELKPDRLTYNTLI ACVKINKLDAA+ FFEEMKERA KYDQED+FPDVVTYTTLLK FGILKDVHLVHKIVLEMKSC
Subjt:  MKGYISSGVPQAALAMYNEMLNLELKPDRLTYNTLIYACVKINKLDAAILFFEEMKERAAKYDQEDIFPDVVTYTTLLKGFGILKDVHLVHKIVLEMKSC

Query:  HELLIDRTAYTAMIDALVNCGSIN-------AALSLFGELLKLSGCNLDLRPKPHLYLTLMRVFSSRGDYRMVKCLHRRMWLDSSGTISPGFQEESDHLL
        H L IDRTAYTAMIDALVNCGSIN        ALSLFGELLKLSG NL+LRPKPHLYLTLMRVFSSRGDYRMVKCLHRRMWLDSSGTIS G+QEE+DHLL
Subjt:  HELLIDRTAYTAMIDALVNCGSIN-------AALSLFGELLKLSGCNLDLRPKPHLYLTLMRVFSSRGDYRMVKCLHRRMWLDSSGTISPGFQEESDHLL

Query:  MEAALNGNQTFYGLCCICMSVQIDVAIEKLSTIIKKWKGISWASRGGSVALRIEALLGLTKSFFSPPCIFPRVNPGASIESVMMPFKAVQPLNGSLQLKE
        MEAALN N             QIDVAIEKLSTIIKKWKGISW SRGGSVALRIEALLGLTKSFFS PCIFPRVN GA IESVMMPFKAVQPLNGSL LKE
Subjt:  MEAALNGNQTFYGLCCICMSVQIDVAIEKLSTIIKKWKGISWASRGGSVALRIEALLGLTKSFFSPPCIFPRVNPGASIESVMMPFKAVQPLNGSLQLKE

Query:  VVMRFFDKSVVPIIDDWGRCNGLLHREDCTELDAPLWKMMRSPPPSVTTTTSIGHVANLILQKRYKMIIVVRHNKFSTYNGLSLRAVGVFTTEQLYGFIS
        VVMRFFDKSVVPIIDDWGRC GLLHREDCTELDAPLWKMMRSPPP VTTT  IGHVANLILQKRYKM++VVRH+KFS Y G SLRA+GVFT EQLYGF+S
Subjt:  VVMRFFDKSVVPIIDDWGRCNGLLHREDCTELDAPLWKMMRSPPPSVTTTTSIGHVANLILQKRYKMIIVVRHNKFSTYNGLSLRAVGVFTTEQLYGFIS

Query:  PIPMQPQPSIPHKT
        PIPM  +P++P KT
Subjt:  PIPMQPQPSIPHKT

A0A1S3CQY3 pentatricopeptide repeat-containing protein At5g10690 isoform X57.8e-29887.97Show/hide
Query:  MLRIASFS--------TGSTTVFSCSLPSRTATARRRRNSLRSPNLKRLTSRVVRLTRRKQLHQVFEEIAIAKRRYGKLNTIVMNAVLEACVHCGDIDLA
        ML IASFS        + ST VFSCSLPSRTAT+RR R+S RSPNLKRLTSRVVRLTRRK+LHQVFEEI IAKRRYGKLNTIVMNAVLEACVHCGDIDLA
Subjt:  MLRIASFS--------TGSTTVFSCSLPSRTATARRRRNSLRSPNLKRLTSRVVRLTRRKQLHQVFEEIAIAKRRYGKLNTIVMNAVLEACVHCGDIDLA

Query:  LRTFNEMSKPDNCGLDNVSYGTLLKGLGEARKVDEAFQLLESVEEGTAIGGPTLSAPLIYGLLNALIEAGDMRRANGLIARYGFLLREGGNLSISVYNLL
        LRTFNEMSKPD+CGLDNVSYGTLLKGLGEARK+DEAFQLLESVEEGTAIGGPTLSAPLIYGLLNALIEAGDMRRANGLIARYG+LLREGGNLSISVYNLL
Subjt:  LRTFNEMSKPDNCGLDNVSYGTLLKGLGEARKVDEAFQLLESVEEGTAIGGPTLSAPLIYGLLNALIEAGDMRRANGLIARYGFLLREGGNLSISVYNLL

Query:  MKGYISSGVPQAALAMYNEMLNLELKPDRLTYNTLIYACVKINKLDAAILFFEEMKERAAKYDQEDIFPDVVTYTTLLKGFGILKDVHLVHKIVLEMKSC
        MKGYISSGVPQAALAMYNEMLNLELKPDRLTYNTLI ACVKINKLDAA+ FFEEMKERA KYDQED+FPDVVTYTTLLK FGILKDVHLVHKIVLEMKSC
Subjt:  MKGYISSGVPQAALAMYNEMLNLELKPDRLTYNTLIYACVKINKLDAAILFFEEMKERAAKYDQEDIFPDVVTYTTLLKGFGILKDVHLVHKIVLEMKSC

Query:  HELLIDRTAYTAMIDALVNCGSINAALSLFGELLKLSGCNLDLRPKPHLYLTLMRVFSSRGDYRMVKCLHRRMWLDSSGTISPGFQEESDHLLMEAALNG
        H L IDRTAYTAMIDALVNCGSIN ALSLFGELLKLSG NL+LRPKPHLYLTLMRVFSSRGDYRMVKCLHRRMWLDSSGTIS G+QEE+DHLLMEAALN 
Subjt:  HELLIDRTAYTAMIDALVNCGSINAALSLFGELLKLSGCNLDLRPKPHLYLTLMRVFSSRGDYRMVKCLHRRMWLDSSGTISPGFQEESDHLLMEAALNG

Query:  NQTFYGLCCICMSVQIDVAIEKLSTIIKKWKGISWASRGGSVALRIEALLGLTKSFFSPPCIFPRVNPGASIESVMMPFKAVQPLNGSLQLKEVVMRFFD
        N             QIDVAIEKLSTIIKKWKGISW SRGGSVALRIEALLGLTKSFFS PCIFPRVN GA IESVMMPFKAVQPLNGSL LKEVVMRFFD
Subjt:  NQTFYGLCCICMSVQIDVAIEKLSTIIKKWKGISWASRGGSVALRIEALLGLTKSFFSPPCIFPRVNPGASIESVMMPFKAVQPLNGSLQLKEVVMRFFD

Query:  KSVVPIIDDWGRCNGLLHREDCTELDAPLWKMMRSPPPSVTTTTSIGHVANLILQKRYKMIIVVRHNKFSTYNGLSLRAVGVFTTEQLYGFISPIPMQPQ
        KSVVPIIDDWGRC GLLHREDCTELDAPLWKMMRSPPP VTTT  IGHVANLILQKRYKM++VVRH+KFS Y G SLRA+GVFT EQLYGF+SPIPM  +
Subjt:  KSVVPIIDDWGRCNGLLHREDCTELDAPLWKMMRSPPPSVTTTTSIGHVANLILQKRYKMIIVVRHNKFSTYNGLSLRAVGVFTTEQLYGFISPIPMQPQ

Query:  PSIPHKT
        P++P KT
Subjt:  PSIPHKT

A0A1S3CR48 pentatricopeptide repeat-containing protein At5g10690 isoform X12.1e-29587.13Show/hide
Query:  MLRIASFS--------TGSTTVFSCSLPSRTATARRRRNSLRSPNLKRLTSRVVRLTRRKQLHQVFEEIAIAKRRYGKLNTIVMNAVLEACVHCGDIDLA
        ML IASFS        + ST VFSCSLPSRTAT+RR R+S RSPNLKRLTSRVVRLTRRK+LHQVFEEI IAKRRYGKLNTIVMNAVLEACVHCGDIDLA
Subjt:  MLRIASFS--------TGSTTVFSCSLPSRTATARRRRNSLRSPNLKRLTSRVVRLTRRKQLHQVFEEIAIAKRRYGKLNTIVMNAVLEACVHCGDIDLA

Query:  LRTFNEMSKPDNCGLDNVSYGTLLKGLGEARKVDEAFQLLESVEEGTAIGGPTLSAPLIYGLLNALIEAGDMRRANGLIARYGFLLREGGNLSISVYNLL
        LRTFNEMSKPD+CGLDNVSYGTLLKGLGEARK+DEAFQLLESVEEGTAIGGPTLSAPLIYGLLNALIEAGDMRRANGLIARYG+LLREGGNLSISVYNLL
Subjt:  LRTFNEMSKPDNCGLDNVSYGTLLKGLGEARKVDEAFQLLESVEEGTAIGGPTLSAPLIYGLLNALIEAGDMRRANGLIARYGFLLREGGNLSISVYNLL

Query:  MKGYISSGVPQAALAMYNEMLNLELKPDRLTYNTLIYACVKINKLDAAILFFEEMKERAAKYDQEDIFPDVVTYTTLLKGFGILKDVHLVHKIVLEMKSC
        MKGYISSGVPQAALAMYNEMLNLELKPDRLTYNTLI ACVKINKLDAA+ FFEEMKERA KYDQED+FPDVVTYTTLLK FGILKDVHLVHKIVLEMKSC
Subjt:  MKGYISSGVPQAALAMYNEMLNLELKPDRLTYNTLIYACVKINKLDAAILFFEEMKERAAKYDQEDIFPDVVTYTTLLKGFGILKDVHLVHKIVLEMKSC

Query:  HELLIDRTAYTAMIDALVNCGSIN-------AALSLFGELLKLSGCNLDLRPKPHLYLTLMRVFSSRGDYRMVKCLHRRMWLDSSGTISPGFQEESDHLL
        H L IDRTAYTAMIDALVNCGSIN        ALSLFGELLKLSG NL+LRPKPHLYLTLMRVFSSRGDYRMVKCLHRRMWLDSSGTIS G+QEE+DHLL
Subjt:  HELLIDRTAYTAMIDALVNCGSIN-------AALSLFGELLKLSGCNLDLRPKPHLYLTLMRVFSSRGDYRMVKCLHRRMWLDSSGTISPGFQEESDHLL

Query:  MEAALNGNQTFYGLCCICMSVQIDVAIEKLSTIIKKWKGISWASRGGSVALRIEALLGLTKSFFSPPCIFPRVNPGASIESVMMPFKAVQPLNGSLQLKE
        MEAALN NQ       I     IDVAIEKLSTIIKKWKGISW SRGGSVALRIEALLGLTKSFFS PCIFPRVN GA IESVMMPFKAVQPLNGSL LKE
Subjt:  MEAALNGNQTFYGLCCICMSVQIDVAIEKLSTIIKKWKGISWASRGGSVALRIEALLGLTKSFFSPPCIFPRVNPGASIESVMMPFKAVQPLNGSLQLKE

Query:  VVMRFFDKSVVPIIDDWGRCNGLLHREDCTELDAPLWKMMRSPPPSVTTTTSIGHVANLILQKRYKMIIVVRHNKFSTYNGLSLRAVGVFTTEQLYGFIS
        VVMRFFDKSVVPIIDDWGRC GLLHREDCTELDAPLWKMMRSPPP VTTT  IGHVANLILQKRYKM++VVRH+KFS Y G SLRA+GVFT EQLYGF+S
Subjt:  VVMRFFDKSVVPIIDDWGRCNGLLHREDCTELDAPLWKMMRSPPPSVTTTTSIGHVANLILQKRYKMIIVVRHNKFSTYNGLSLRAVGVFTTEQLYGFIS

Query:  PIPMQPQPSIPHKT
        PIPM  +P++P KT
Subjt:  PIPMQPQPSIPHKT

A0A1S3CS90 pentatricopeptide repeat-containing protein At5g10690 isoform X31.7e-29788.14Show/hide
Query:  MLRIASFS--------TGSTTVFSCSLPSRTATARRRRNSLRSPNLKRLTSRVVRLTRRKQLHQVFEEIAIAKRRYGKLNTIVMNAVLEACVHCGDIDLA
        ML IASFS        + ST VFSCSLPSRTAT+RR R+S RSPNLKRLTSRVVRLTRRK+LHQVFEEI IAKRRYGKLNTIVMNAVLEACVHCGDIDLA
Subjt:  MLRIASFS--------TGSTTVFSCSLPSRTATARRRRNSLRSPNLKRLTSRVVRLTRRKQLHQVFEEIAIAKRRYGKLNTIVMNAVLEACVHCGDIDLA

Query:  LRTFNEMSKPDNCGLDNVSYGTLLKGLGEARKVDEAFQLLESVEEGTAIGGPTLSAPLIYGLLNALIEAGDMRRANGLIARYGFLLREGGNLSISVYNLL
        LRTFNEMSKPD+CGLDNVSYGTLLKGLGEARK+DEAFQLLESVEEGTAIGGPTLSAPLIYGLLNALIEAGDMRRANGLIARYG+LLREGGNLSISVYNLL
Subjt:  LRTFNEMSKPDNCGLDNVSYGTLLKGLGEARKVDEAFQLLESVEEGTAIGGPTLSAPLIYGLLNALIEAGDMRRANGLIARYGFLLREGGNLSISVYNLL

Query:  MKGYISSGVPQAALAMYNEMLNLELKPDRLTYNTLIYACVKINKLDAAILFFEEMKERAAKYDQEDIFPDVVTYTTLLKGFGILKDVHLVHKIVLEMKSC
        MKGYISSGVPQAALAMYNEMLNLELKPDRLTYNTLI ACVKINKLDAA+ FFEEMKERA KYDQED+FPDVVTYTTLLK FGILKDVHLVHKIVLEMKSC
Subjt:  MKGYISSGVPQAALAMYNEMLNLELKPDRLTYNTLIYACVKINKLDAAILFFEEMKERAAKYDQEDIFPDVVTYTTLLKGFGILKDVHLVHKIVLEMKSC

Query:  HELLIDRTAYTAMIDALVNCGSINAALSLFGELLKLSGCNLDLRPKPHLYLTLMRVFSSRGDYRMVKCLHRRMWLDSSGTISPGFQEESDHLLMEAALNG
        H L IDRTAYTAMIDALVNCGSIN ALSLFGELLKLSG NL+LRPKPHLYLTLMRVFSSRGDYRMVKCLHRRMWLDSSGTIS G+QEE+DHLLMEAALN 
Subjt:  HELLIDRTAYTAMIDALVNCGSINAALSLFGELLKLSGCNLDLRPKPHLYLTLMRVFSSRGDYRMVKCLHRRMWLDSSGTISPGFQEESDHLLMEAALNG

Query:  NQTFYGLCCICMSVQIDVAIEKLSTIIKKWKGISWASRGGSVALRIEALLGLTKSFFSPPCIFPRVNPGASIESVMMPFKAVQPLNGSLQLKEVVMRFFD
        NQ       I     IDVAIEKLSTIIKKWKGISW SRGGSVALRIEALLGLTKSFFS PCIFPRVN GA IESVMMPFKAVQPLNGSL LKEVVMRFFD
Subjt:  NQTFYGLCCICMSVQIDVAIEKLSTIIKKWKGISWASRGGSVALRIEALLGLTKSFFSPPCIFPRVNPGASIESVMMPFKAVQPLNGSLQLKEVVMRFFD

Query:  KSVVPIIDDWGRCNGLLHREDCTELDAPLWKMMRSPPPSVTTTTSIGHVANLILQKRYKMIIVVRHNKFSTYNGLSLRAVGVFTTEQLYGFISPIPMQPQ
        KSVVPIIDDWGRC GLLHREDCTELDAPLWKMMRSPPP VTTT  IGHVANLILQKRYKM++VVRH+KFS Y G SLRA+GVFT EQLYGF+SPIPM  +
Subjt:  KSVVPIIDDWGRCNGLLHREDCTELDAPLWKMMRSPPPSVTTTTSIGHVANLILQKRYKMIIVVRHNKFSTYNGLSLRAVGVFTTEQLYGFISPIPMQPQ

Query:  PSIPHKT
        P++P KT
Subjt:  PSIPHKT

A0A1S4E685 pentatricopeptide repeat-containing protein At5g10690 isoform X21.5e-29688.16Show/hide
Query:  MLRIASFS--------TGSTTVFSCSLPSRTATARRRRNSLRSPNLKRLTSRVVRLTRRKQLHQVFEEIAIAKRRYGKLNTIVMNAVLEACVHCGDIDLA
        ML IASFS        + ST VFSCSLPSRTAT+RR R+S RSPNLKRLTSRVVRLTRRK+LHQVFEEI IAKRRYGKLNTIVMNAVLEACVHCGDIDLA
Subjt:  MLRIASFS--------TGSTTVFSCSLPSRTATARRRRNSLRSPNLKRLTSRVVRLTRRKQLHQVFEEIAIAKRRYGKLNTIVMNAVLEACVHCGDIDLA

Query:  LRTFNEMSKPDNCGLDNVSYGTLLKGLGEARKVDEAFQLLESVEEGTAIGGPTLSAPLIYGLLNALIEAGDMRRANGLIARYGFLLREGGNLSISVYNLL
        LRTFNEMSKPD+CGLDNVSYGTLLKGLGEARK+DEAFQLLESVEEGTAIGGPTLSAPLIYGLLNALIEAGDMRRANGLIARYG+LLREGGNLSISVYNLL
Subjt:  LRTFNEMSKPDNCGLDNVSYGTLLKGLGEARKVDEAFQLLESVEEGTAIGGPTLSAPLIYGLLNALIEAGDMRRANGLIARYGFLLREGGNLSISVYNLL

Query:  MKGYISSGVPQAALAMYNEMLNLELKPDRLTYNTLIYACVKINKLDAAILFFEEMKERAAKYDQEDIFPDVVTYTTLLKGFGILKDVHLVHKIVLEMKSC
        MKGYISSGVPQAALAMYNEMLNLELKPDRLTYNTLI ACVKINKLDAA+ FFEEMKERA KYDQED+FPDVVTYTTLLK FGILKDVHLVHKIVLEMKSC
Subjt:  MKGYISSGVPQAALAMYNEMLNLELKPDRLTYNTLIYACVKINKLDAAILFFEEMKERAAKYDQEDIFPDVVTYTTLLKGFGILKDVHLVHKIVLEMKSC

Query:  HELLIDRTAYTAMIDALVNCGSINA-ALSLFGELLKLSGCNLDLRPKPHLYLTLMRVFSSRGDYRMVKCLHRRMWLDSSGTISPGFQEESDHLLMEAALN
        H L IDRTAYTAMIDALVNCGSINA ALSLFGELLKLSG NL+LRPKPHLYLTLMRVFSSRGDYRMVKCLHRRMWLDSSGTIS G+QEE+DHLLMEAALN
Subjt:  HELLIDRTAYTAMIDALVNCGSINA-ALSLFGELLKLSGCNLDLRPKPHLYLTLMRVFSSRGDYRMVKCLHRRMWLDSSGTISPGFQEESDHLLMEAALN

Query:  GNQTFYGLCCICMSVQIDVAIEKLSTIIKKWKGISWASRGGSVALRIEALLGLTKSFFSPPCIFPRVNPGASIESVMMPFKAVQPLNGSLQLKEVVMRFF
         NQ       I     IDVAIEKLSTIIKKWKGISW SRGGSVALRIEALLGLTKSFFS PCIFPRVN GA IESVMMPFKAVQPLNGSL LKEVVMRFF
Subjt:  GNQTFYGLCCICMSVQIDVAIEKLSTIIKKWKGISWASRGGSVALRIEALLGLTKSFFSPPCIFPRVNPGASIESVMMPFKAVQPLNGSLQLKEVVMRFF

Query:  DKSVVPIIDDWGRCNGLLHREDCTELDAPLWKMMRSPPPSVTTTTSIGHVANLILQKRYKMIIVVRHNKFSTYNGLSLRAVGVFTTEQLYGFISPIPMQP
        DKSVVPIIDDWGRC GLLHREDCTELDAPLWKMMRSPPP VTTT  IGHVANLILQKRYKM++VVRH+KFS Y G SLRA+GVFT EQLYGF+SPIPM  
Subjt:  DKSVVPIIDDWGRCNGLLHREDCTELDAPLWKMMRSPPPSVTTTTSIGHVANLILQKRYKMIIVVRHNKFSTYNGLSLRAVGVFTTEQLYGFISPIPMQP

Query:  QPSIPHKT
        +P++P KT
Subjt:  QPSIPHKT

SwissProt top hitse value%identityAlignment
Q8VYD6 Pentatricopeptide repeat-containing protein At5g106903.5e-17856.41Show/hide
Query:  MLRIASFSTGST---TVFSCS-LPSRTATARRRRNSLRSPNLKRLTSRVVRLTRRKQLHQVFEEIAIAKRRYGKLNTIVMNAVLEACVHCGDIDLALRTF
        M RI++ ST  T    + SCS +P+R    RR     R  NLK LTSR+V LTRR+QL Q+ EE+  AK+RYG+LNTIVMN+VLEACVHCG+IDLALR F
Subjt:  MLRIASFSTGST---TVFSCS-LPSRTATARRRRNSLRSPNLKRLTSRVVRLTRRKQLHQVFEEIAIAKRRYGKLNTIVMNAVLEACVHCGDIDLALRTF

Query:  NEMSKPDNCGLDNVSYGTLLKGLGEARKVDEAFQLLESVEEGTAIGGPTLSAPLIYGLLNALIEAGDMRRANGLIARYGFLLREGGNLSISVYNLLMKGY
        +EM++P   G+D++SY T+LKGLG+AR++DEAFQ+LE++E GTA G P LS+ LIYGLL+ALI AGD+RRANGL+ARY  LL + G  S+ +YNLLMKGY
Subjt:  NEMSKPDNCGLDNVSYGTLLKGLGEARKVDEAFQLLESVEEGTAIGGPTLSAPLIYGLLNALIEAGDMRRANGLIARYGFLLREGGNLSISVYNLLMKGY

Query:  ISSGVPQAALAMYNEMLNLELKPDRLTYNTLIYACVKINKLDAAILFFEEMKERAAKYDQEDIFPDVVTYTTLLKGFGILKDVHLVHKIVLEMKSCHELL
        ++S  PQAA+ + +EML L L+PDRLTYNTLI+AC+K   LDAA+ FF +MKE+A +Y  + + PDVVTYTTL+KGFG   D+  + +I LEMK C  + 
Subjt:  ISSGVPQAALAMYNEMLNLELKPDRLTYNTLIYACVKINKLDAAILFFEEMKERAAKYDQEDIFPDVVTYTTLLKGFGILKDVHLVHKIVLEMKSCHELL

Query:  IDRTAYTAMIDALVNCGSINAALSLFGELLKLSGCNLDLRPKPHLYLTLMRVFSSRGDYRMVKCLHRRMWLDSSGTISPGFQEESDHLLMEAALNGNQTF
        IDRTA+TA++DA++ CGS + AL +FGE+LK SG N  LRPKPHLYL++MR F+ +GDY MV+ L+ R+W DSSG+IS   Q+E+D+LLMEAALN     
Subjt:  IDRTAYTAMIDALVNCGSINAALSLFGELLKLSGCNLDLRPKPHLYLTLMRVFSSRGDYRMVKCLHRRMWLDSSGTISPGFQEESDHLLMEAALNGNQTF

Query:  YGLCCICMSVQIDVAIEKLSTIIKKWKGISWASRGGSVALRIEALLGLTKSFFSPPCIFPRVNPGASIESVMMPFKAVQPLNGSLQLKEVVMRFFDKSVV
                  Q+D A+  L +I+++WK I W + GG  A+R+E LLG +KS   P  +  +V P   IES+M+ F+A +PL G+LQLK V MRFF + VV
Subjt:  YGLCCICMSVQIDVAIEKLSTIIKKWKGISWASRGGSVALRIEALLGLTKSFFSPPCIFPRVNPGASIESVMMPFKAVQPLNGSLQLKEVVMRFFDKSVV

Query:  PIIDDWGRCNGLLHREDCTELDAPLWKMMRSPPPSVTTTTSIGHVANLILQKRYKMIIVVRHNKFSTYNGLSLRAVGVFTTEQLY
        PI+DD G C GLLHREDC  LDAPL  MMRSPP  V+TTTSIG V +L+L+K+ KM+IVV    FS  +G S +AVG FT  QLY
Subjt:  PIIDDWGRCNGLLHREDCTELDAPLWKMMRSPPPSVTTTTSIGHVANLILQKRYKMIIVVRHNKFSTYNGLSLRAVGVFTTEQLY

Q9FIX3 Pentatricopeptide repeat-containing protein At5g397103.7e-1828.03Show/hide
Query:  NTIVMNAVLEACVHCGDIDLALRTFNEMSKPDNCGLDNVSYGTLLKGLGEARKVDEAFQLLESVEEGTAIGGPTLSAPLIYGLLNALIEAGDMRRANGLI
        N    N ++      G+ID+AL  F++M +   C  + V+Y TL+ G  + RK+D+ F+LL S+    A+ G   +      ++N L   G M+  + ++
Subjt:  NTIVMNAVLEACVHCGDIDLALRTFNEMSKPDNCGLDNVSYGTLLKGLGEARKVDEAFQLLESVEEGTAIGGPTLSAPLIYGLLNALIEAGDMRRANGLI

Query:  ARYGFLLREGGNLSISVYNLLMKGYISSGVPQAALAMYNEMLNLELKPDRLTYNTLIYACVKINKLDAAILFFEEMKERAAKYDQEDIFPDVVTYTTLLK
             + R G +L    YN L+KGY   G    AL M+ EML   L P  +TY +LI++  K   ++ A+ F ++M+ R        + P+  TYTTL+ 
Subjt:  ARYGFLLREGGNLSISVYNLLMKGYISSGVPQAALAMYNEMLNLELKPDRLTYNTLIYACVKINKLDAAILFFEEMKERAAKYDQEDIFPDVVTYTTLLK

Query:  GFGILKDVHLVHKIVLEMKSCHELLIDRTAYTAMIDALVNCGSINAALSLFGELLKLSGCNLDL
        GF     ++  ++++ EM   +        Y A+I+     G +  A+++  E +K  G + D+
Subjt:  GFGILKDVHLVHKIVLEMKSCHELLIDRTAYTAMIDALVNCGSINAALSLFGELLKLSGCNLDL

Q9LFF1 Pentatricopeptide repeat-containing protein At3g53700, chloroplastic5.3e-1727.24Show/hide
Query:  VLEACVHCGDIDLALRTFNEMSKPDNCGLDNVSYGTLLKGLGEARKVDEAFQLLE--SVEEGTAIGGPTLSAPLIYGLLNALIEAGDMRRANGLIARYGF
        V++  +  GD+D ALR   +M +   C   NVS   ++ G  +  +V++A   ++  S ++G      T +      L+N L +AG ++ A   I     
Subjt:  VLEACVHCGDIDLALRTFNEMSKPDNCGLDNVSYGTLLKGLGEARKVDEAFQLLE--SVEEGTAIGGPTLSAPLIYGLLNALIEAGDMRRANGLIARYGF

Query:  LLREGGNLSISVYNLLMKGYISSGVPQAALAMYNEMLNLELKPDRLTYNTLIYACVKINKLDAAILFFEEMKERAAKYDQEDIFPDVVTYTTLLKGFGIL
        +L+EG +  +  YN ++ G    G  + A+ + ++M+  +  P+ +TYNTLI    K N++       EE  E A     + I PDV T+ +L++G  + 
Subjt:  LLREGGNLSISVYNLLMKGYISSGVPQAALAMYNEMLNLELKPDRLTYNTLIYACVKINKLDAAILFFEEMKERAAKYDQEDIFPDVVTYTTLLKGFGIL

Query:  KDVHLVHKIVLEMKS--CHELLIDRTAYTAMIDALVNCGSINAALSLFGELLKLSGC
        ++  +  ++  EM+S  C     D   Y  +ID+L + G ++ AL++  + ++LSGC
Subjt:  KDVHLVHKIVLEMKS--CHELLIDRTAYTAMIDALVNCGSINAALSLFGELLKLSGC

Q9SIC9 Pentatricopeptide repeat-containing protein At2g31400, chloroplastic1.3e-1529.39Show/hide
Query:  KQLHQVFEEIAIAKRRYGKLNTIVMNAVLEACVHCGDIDLALRTFNEMSKPDNCGLDNVSYGTLLKGLGEARKVDEAFQLLESVEEGTAIGGPTLSAPLI
        KQ+ + F+E+   +R   + + I  N++L  C   G  + A   F+EM+       D  SY TLL  + +  ++D AF++L  +     +      + +I
Subjt:  KQLHQVFEEIAIAKRRYGKLNTIVMNAVLEACVHCGDIDLALRTFNEMSKPDNCGLDNVSYGTLLKGLGEARKVDEAFQLLESVEEGTAIGGPTLSAPLI

Query:  YGLLNALIEAGDMRRANGLIARYGFLLREGGNLSISVYNLLMKGYISSGVPQAALAMYNEMLNLELKPDRLTYNTLIYACVKINKLDAAILFFEEMKERA
         G      +AG    A  L     +L   G  L    YN L+  Y   G  + AL +  EM ++ +K D +TYN L+    K  K D     F EMK   
Subjt:  YGLLNALIEAGDMRRANGLIARYGFLLREGGNLSISVYNLLMKGYISSGVPQAALAMYNEMLNLELKPDRLTYNTLIYACVKINKLDAAILFFEEMKERA

Query:  AKYDQEDIFPDVVTYTTLLKGF---GILKDVHLVHKIVLEMKSCHELLIDRTAYTAMIDALVNCGSINAALSLFGELLK
            +E + P+++TY+TL+ G+   G+ K+     +I  E KS   L  D   Y+A+IDAL   G + +A+SL  E+ K
Subjt:  AKYDQEDIFPDVVTYTTLLKGF---GILKDVHLVHKIVLEMKSCHELLIDRTAYTAMIDALVNCGSINAALSLFGELLK

Q9SZ52 Pentatricopeptide repeat-containing protein At4g31850, chloroplastic3.4e-1626.43Show/hide
Query:  NAVLEACVHCGDIDLALRTFNEMSKPDNCGLDNVSYGTLLKGLGEARKVDEAFQLLESVEEGTAIGGPTLSAPLIYG-LLNALIEAGDMRRANGLIARYG
        N +L+A    G ID     + EMS  + C  + +++  ++ GL +A  VD+A  L   +        PT      YG L++ L ++G +  A  L   + 
Subjt:  NAVLEACVHCGDIDLALRTFNEMSKPDNCGLDNVSYGTLLKGLGEARKVDEAFQLLESVEEGTAIGGPTLSAPLIYG-LLNALIEAGDMRRANGLIARYG

Query:  FLLREGGNLSISVYNLLMKGYISSGVPQAALAMYNEMLNLELKPDRLTYNTLIYACVKINKLDAAILFFEEMKERAAKYDQEDIFPDVVTYTTLLKGFGI
         +L  G   + ++YN+L+ G+  +G   AA A++  M+   ++PD  TY+ L+     + ++D  + +F+E+KE         + PDVV Y  ++ G G 
Subjt:  FLLREGGNLSISVYNLLMKGYISSGVPQAALAMYNEMLNLELKPDRLTYNTLIYACVKINKLDAAILFFEEMKERAAKYDQEDIFPDVVTYTTLLKGFGI

Query:  LKDVHLVHKIVL--EMKSCHELLIDRTAYTAMIDALVNCGSINAALSLFGELLKLSGCNLDLRPKPHLYLTLMRVFSSRG
         K   L   +VL  EMK+   +  D   Y ++I  L   G +  A  ++ E+ +       L P    +  L+R +S  G
Subjt:  LKDVHLVHKIVL--EMKSCHELLIDRTAYTAMIDALVNCGSINAALSLFGELLKLSGCNLDLRPKPHLYLTLMRVFSSRG

Arabidopsis top hitse value%identityAlignment
AT2G31400.1 genomes uncoupled 19.3e-1729.39Show/hide
Query:  KQLHQVFEEIAIAKRRYGKLNTIVMNAVLEACVHCGDIDLALRTFNEMSKPDNCGLDNVSYGTLLKGLGEARKVDEAFQLLESVEEGTAIGGPTLSAPLI
        KQ+ + F+E+   +R   + + I  N++L  C   G  + A   F+EM+       D  SY TLL  + +  ++D AF++L  +     +      + +I
Subjt:  KQLHQVFEEIAIAKRRYGKLNTIVMNAVLEACVHCGDIDLALRTFNEMSKPDNCGLDNVSYGTLLKGLGEARKVDEAFQLLESVEEGTAIGGPTLSAPLI

Query:  YGLLNALIEAGDMRRANGLIARYGFLLREGGNLSISVYNLLMKGYISSGVPQAALAMYNEMLNLELKPDRLTYNTLIYACVKINKLDAAILFFEEMKERA
         G      +AG    A  L     +L   G  L    YN L+  Y   G  + AL +  EM ++ +K D +TYN L+    K  K D     F EMK   
Subjt:  YGLLNALIEAGDMRRANGLIARYGFLLREGGNLSISVYNLLMKGYISSGVPQAALAMYNEMLNLELKPDRLTYNTLIYACVKINKLDAAILFFEEMKERA

Query:  AKYDQEDIFPDVVTYTTLLKGF---GILKDVHLVHKIVLEMKSCHELLIDRTAYTAMIDALVNCGSINAALSLFGELLK
            +E + P+++TY+TL+ G+   G+ K+     +I  E KS   L  D   Y+A+IDAL   G + +A+SL  E+ K
Subjt:  AKYDQEDIFPDVVTYTTLLKGF---GILKDVHLVHKIVLEMKSCHELLIDRTAYTAMIDALVNCGSINAALSLFGELLK

AT3G53700.1 Pentatricopeptide repeat (PPR) superfamily protein3.8e-1827.24Show/hide
Query:  VLEACVHCGDIDLALRTFNEMSKPDNCGLDNVSYGTLLKGLGEARKVDEAFQLLE--SVEEGTAIGGPTLSAPLIYGLLNALIEAGDMRRANGLIARYGF
        V++  +  GD+D ALR   +M +   C   NVS   ++ G  +  +V++A   ++  S ++G      T +      L+N L +AG ++ A   I     
Subjt:  VLEACVHCGDIDLALRTFNEMSKPDNCGLDNVSYGTLLKGLGEARKVDEAFQLLE--SVEEGTAIGGPTLSAPLIYGLLNALIEAGDMRRANGLIARYGF

Query:  LLREGGNLSISVYNLLMKGYISSGVPQAALAMYNEMLNLELKPDRLTYNTLIYACVKINKLDAAILFFEEMKERAAKYDQEDIFPDVVTYTTLLKGFGIL
        +L+EG +  +  YN ++ G    G  + A+ + ++M+  +  P+ +TYNTLI    K N++       EE  E A     + I PDV T+ +L++G  + 
Subjt:  LLREGGNLSISVYNLLMKGYISSGVPQAALAMYNEMLNLELKPDRLTYNTLIYACVKINKLDAAILFFEEMKERAAKYDQEDIFPDVVTYTTLLKGFGIL

Query:  KDVHLVHKIVLEMKS--CHELLIDRTAYTAMIDALVNCGSINAALSLFGELLKLSGC
        ++  +  ++  EM+S  C     D   Y  +ID+L + G ++ AL++  + ++LSGC
Subjt:  KDVHLVHKIVLEMKS--CHELLIDRTAYTAMIDALVNCGSINAALSLFGELLKLSGC

AT4G31850.1 proton gradient regulation 32.4e-1726.43Show/hide
Query:  NAVLEACVHCGDIDLALRTFNEMSKPDNCGLDNVSYGTLLKGLGEARKVDEAFQLLESVEEGTAIGGPTLSAPLIYG-LLNALIEAGDMRRANGLIARYG
        N +L+A    G ID     + EMS  + C  + +++  ++ GL +A  VD+A  L   +        PT      YG L++ L ++G +  A  L   + 
Subjt:  NAVLEACVHCGDIDLALRTFNEMSKPDNCGLDNVSYGTLLKGLGEARKVDEAFQLLESVEEGTAIGGPTLSAPLIYG-LLNALIEAGDMRRANGLIARYG

Query:  FLLREGGNLSISVYNLLMKGYISSGVPQAALAMYNEMLNLELKPDRLTYNTLIYACVKINKLDAAILFFEEMKERAAKYDQEDIFPDVVTYTTLLKGFGI
         +L  G   + ++YN+L+ G+  +G   AA A++  M+   ++PD  TY+ L+     + ++D  + +F+E+KE         + PDVV Y  ++ G G 
Subjt:  FLLREGGNLSISVYNLLMKGYISSGVPQAALAMYNEMLNLELKPDRLTYNTLIYACVKINKLDAAILFFEEMKERAAKYDQEDIFPDVVTYTTLLKGFGI

Query:  LKDVHLVHKIVL--EMKSCHELLIDRTAYTAMIDALVNCGSINAALSLFGELLKLSGCNLDLRPKPHLYLTLMRVFSSRG
         K   L   +VL  EMK+   +  D   Y ++I  L   G +  A  ++ E+ +       L P    +  L+R +S  G
Subjt:  LKDVHLVHKIVL--EMKSCHELLIDRTAYTAMIDALVNCGSINAALSLFGELLKLSGCNLDLRPKPHLYLTLMRVFSSRG

AT5G10690.1 pentatricopeptide (PPR) repeat-containing protein / CBS domain-containing protein2.5e-17956.41Show/hide
Query:  MLRIASFSTGST---TVFSCS-LPSRTATARRRRNSLRSPNLKRLTSRVVRLTRRKQLHQVFEEIAIAKRRYGKLNTIVMNAVLEACVHCGDIDLALRTF
        M RI++ ST  T    + SCS +P+R    RR     R  NLK LTSR+V LTRR+QL Q+ EE+  AK+RYG+LNTIVMN+VLEACVHCG+IDLALR F
Subjt:  MLRIASFSTGST---TVFSCS-LPSRTATARRRRNSLRSPNLKRLTSRVVRLTRRKQLHQVFEEIAIAKRRYGKLNTIVMNAVLEACVHCGDIDLALRTF

Query:  NEMSKPDNCGLDNVSYGTLLKGLGEARKVDEAFQLLESVEEGTAIGGPTLSAPLIYGLLNALIEAGDMRRANGLIARYGFLLREGGNLSISVYNLLMKGY
        +EM++P   G+D++SY T+LKGLG+AR++DEAFQ+LE++E GTA G P LS+ LIYGLL+ALI AGD+RRANGL+ARY  LL + G  S+ +YNLLMKGY
Subjt:  NEMSKPDNCGLDNVSYGTLLKGLGEARKVDEAFQLLESVEEGTAIGGPTLSAPLIYGLLNALIEAGDMRRANGLIARYGFLLREGGNLSISVYNLLMKGY

Query:  ISSGVPQAALAMYNEMLNLELKPDRLTYNTLIYACVKINKLDAAILFFEEMKERAAKYDQEDIFPDVVTYTTLLKGFGILKDVHLVHKIVLEMKSCHELL
        ++S  PQAA+ + +EML L L+PDRLTYNTLI+AC+K   LDAA+ FF +MKE+A +Y  + + PDVVTYTTL+KGFG   D+  + +I LEMK C  + 
Subjt:  ISSGVPQAALAMYNEMLNLELKPDRLTYNTLIYACVKINKLDAAILFFEEMKERAAKYDQEDIFPDVVTYTTLLKGFGILKDVHLVHKIVLEMKSCHELL

Query:  IDRTAYTAMIDALVNCGSINAALSLFGELLKLSGCNLDLRPKPHLYLTLMRVFSSRGDYRMVKCLHRRMWLDSSGTISPGFQEESDHLLMEAALNGNQTF
        IDRTA+TA++DA++ CGS + AL +FGE+LK SG N  LRPKPHLYL++MR F+ +GDY MV+ L+ R+W DSSG+IS   Q+E+D+LLMEAALN     
Subjt:  IDRTAYTAMIDALVNCGSINAALSLFGELLKLSGCNLDLRPKPHLYLTLMRVFSSRGDYRMVKCLHRRMWLDSSGTISPGFQEESDHLLMEAALNGNQTF

Query:  YGLCCICMSVQIDVAIEKLSTIIKKWKGISWASRGGSVALRIEALLGLTKSFFSPPCIFPRVNPGASIESVMMPFKAVQPLNGSLQLKEVVMRFFDKSVV
                  Q+D A+  L +I+++WK I W + GG  A+R+E LLG +KS   P  +  +V P   IES+M+ F+A +PL G+LQLK V MRFF + VV
Subjt:  YGLCCICMSVQIDVAIEKLSTIIKKWKGISWASRGGSVALRIEALLGLTKSFFSPPCIFPRVNPGASIESVMMPFKAVQPLNGSLQLKEVVMRFFDKSVV

Query:  PIIDDWGRCNGLLHREDCTELDAPLWKMMRSPPPSVTTTTSIGHVANLILQKRYKMIIVVRHNKFSTYNGLSLRAVGVFTTEQLY
        PI+DD G C GLLHREDC  LDAPL  MMRSPP  V+TTTSIG V +L+L+K+ KM+IVV    FS  +G S +AVG FT  QLY
Subjt:  PIIDDWGRCNGLLHREDCTELDAPLWKMMRSPPPSVTTTTSIGHVANLILQKRYKMIIVVRHNKFSTYNGLSLRAVGVFTTEQLY

AT5G39710.1 Tetratricopeptide repeat (TPR)-like superfamily protein2.6e-1928.03Show/hide
Query:  NTIVMNAVLEACVHCGDIDLALRTFNEMSKPDNCGLDNVSYGTLLKGLGEARKVDEAFQLLESVEEGTAIGGPTLSAPLIYGLLNALIEAGDMRRANGLI
        N    N ++      G+ID+AL  F++M +   C  + V+Y TL+ G  + RK+D+ F+LL S+    A+ G   +      ++N L   G M+  + ++
Subjt:  NTIVMNAVLEACVHCGDIDLALRTFNEMSKPDNCGLDNVSYGTLLKGLGEARKVDEAFQLLESVEEGTAIGGPTLSAPLIYGLLNALIEAGDMRRANGLI

Query:  ARYGFLLREGGNLSISVYNLLMKGYISSGVPQAALAMYNEMLNLELKPDRLTYNTLIYACVKINKLDAAILFFEEMKERAAKYDQEDIFPDVVTYTTLLK
             + R G +L    YN L+KGY   G    AL M+ EML   L P  +TY +LI++  K   ++ A+ F ++M+ R        + P+  TYTTL+ 
Subjt:  ARYGFLLREGGNLSISVYNLLMKGYISSGVPQAALAMYNEMLNLELKPDRLTYNTLIYACVKINKLDAAILFFEEMKERAAKYDQEDIFPDVVTYTTLLK

Query:  GFGILKDVHLVHKIVLEMKSCHELLIDRTAYTAMIDALVNCGSINAALSLFGELLKLSGCNLDL
        GF     ++  ++++ EM   +        Y A+I+     G +  A+++  E +K  G + D+
Subjt:  GFGILKDVHLVHKIVLEMKSCHELLIDRTAYTAMIDALVNCGSINAALSLFGELLKLSGCNLDL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTACGGATCGCCTCATTCTCCACTGGTTCAACGACCGTATTTTCTTGTTCGCTTCCTTCTCGCACGGCCACCGCCCGGCGTAGGAGAAACTCTCTTCGGAGTCCTAA
TCTCAAGCGACTGACCTCTCGTGTTGTCAGACTCACCCGTCGCAAGCAACTCCACCAGGTATTTGAGGAAATTGCAATTGCCAAGAGACGTTATGGAAAGCTTAATACAA
TTGTTATGAACGCGGTCTTGGAAGCTTGTGTTCACTGCGGTGATATTGATTTAGCTCTGAGGACTTTTAATGAAATGTCAAAGCCAGATAATTGTGGTTTAGACAATGTC
AGCTATGGCACGCTGTTAAAGGGTTTAGGTGAAGCTCGGAAGGTTGATGAAGCATTTCAATTACTTGAATCTGTGGAAGAAGGTACCGCTATTGGAGGTCCAACATTGTC
AGCACCGCTTATTTATGGTCTTCTAAATGCTTTAATTGAAGCAGGAGACATGCGCCGTGCCAATGGTCTAATAGCACGATATGGGTTCTTACTTCGTGAAGGAGGCAATC
TCTCTATATCAGTGTACAACTTATTGATGAAGGGGTACATAAGCTCAGGTGTTCCTCAAGCTGCTTTAGCCATGTACAATGAGATGCTAAATCTGGAGTTGAAACCTGAT
AGGCTCACATATAATACATTAATCTATGCTTGTGTGAAGATTAACAAACTGGACGCAGCAATACTTTTCTTTGAGGAAATGAAGGAACGAGCTGCTAAGTATGATCAAGA
AGATATTTTTCCTGATGTTGTGACGTACACTACTTTACTTAAGGGTTTTGGGATTCTGAAAGATGTCCATCTTGTTCACAAGATTGTTCTGGAAATGAAATCTTGTCATG
AATTGTTGATTGATCGAACAGCATACACTGCAATGATTGATGCTTTGGTTAACTGTGGCTCTATAAACGCTGCTCTTTCTTTATTTGGGGAATTATTGAAGCTTTCTGGA
TGCAATTTGGACTTACGGCCAAAGCCACATCTTTATCTCACACTTATGAGAGTTTTTTCTAGTAGAGGAGATTATAGGATGGTCAAATGTTTGCATAGACGCATGTGGCT
GGACTCCTCTGGAACTATTTCTCCTGGATTTCAAGAAGAATCAGATCATCTTCTCATGGAGGCAGCTTTAAATGGCAATCAGACATTTTATGGACTTTGTTGTATATGCA
TGTCCGTGCAGATTGATGTGGCAATAGAGAAACTCTCAACAATTATTAAGAAATGGAAGGGAATCTCATGGGCTAGTCGAGGAGGCAGTGTTGCTCTGCGTATTGAAGCA
TTGCTGGGACTCACCAAATCTTTCTTTAGTCCTCCTTGCATATTTCCTCGGGTAAATCCAGGTGCATCTATTGAGAGTGTTATGATGCCATTTAAAGCCGTTCAGCCATT
AAATGGAAGCTTACAGTTGAAGGAAGTGGTTATGCGTTTCTTTGACAAATCAGTTGTGCCTATCATAGACGACTGGGGTAGATGCAATGGACTACTGCACCGAGAAGACT
GTACTGAGTTGGATGCCCCCCTTTGGAAAATGATGAGAAGCCCTCCTCCTAGCGTGACAACCACCACATCCATTGGACATGTTGCGAATCTAATTCTACAAAAGAGGTAC
AAAATGATTATTGTTGTAAGACATAACAAGTTTAGTACATATAATGGTTTGAGTTTGAGAGCTGTCGGTGTTTTTACTACCGAACAATTGTATGGCTTTATTTCTCCCAT
TCCGATGCAGCCTCAGCCGAGCATTCCACATAAGACGTAA
mRNA sequenceShow/hide mRNA sequence
ATGCTACGGATCGCCTCATTCTCCACTGGTTCAACGACCGTATTTTCTTGTTCGCTTCCTTCTCGCACGGCCACCGCCCGGCGTAGGAGAAACTCTCTTCGGAGTCCTAA
TCTCAAGCGACTGACCTCTCGTGTTGTCAGACTCACCCGTCGCAAGCAACTCCACCAGGTATTTGAGGAAATTGCAATTGCCAAGAGACGTTATGGAAAGCTTAATACAA
TTGTTATGAACGCGGTCTTGGAAGCTTGTGTTCACTGCGGTGATATTGATTTAGCTCTGAGGACTTTTAATGAAATGTCAAAGCCAGATAATTGTGGTTTAGACAATGTC
AGCTATGGCACGCTGTTAAAGGGTTTAGGTGAAGCTCGGAAGGTTGATGAAGCATTTCAATTACTTGAATCTGTGGAAGAAGGTACCGCTATTGGAGGTCCAACATTGTC
AGCACCGCTTATTTATGGTCTTCTAAATGCTTTAATTGAAGCAGGAGACATGCGCCGTGCCAATGGTCTAATAGCACGATATGGGTTCTTACTTCGTGAAGGAGGCAATC
TCTCTATATCAGTGTACAACTTATTGATGAAGGGGTACATAAGCTCAGGTGTTCCTCAAGCTGCTTTAGCCATGTACAATGAGATGCTAAATCTGGAGTTGAAACCTGAT
AGGCTCACATATAATACATTAATCTATGCTTGTGTGAAGATTAACAAACTGGACGCAGCAATACTTTTCTTTGAGGAAATGAAGGAACGAGCTGCTAAGTATGATCAAGA
AGATATTTTTCCTGATGTTGTGACGTACACTACTTTACTTAAGGGTTTTGGGATTCTGAAAGATGTCCATCTTGTTCACAAGATTGTTCTGGAAATGAAATCTTGTCATG
AATTGTTGATTGATCGAACAGCATACACTGCAATGATTGATGCTTTGGTTAACTGTGGCTCTATAAACGCTGCTCTTTCTTTATTTGGGGAATTATTGAAGCTTTCTGGA
TGCAATTTGGACTTACGGCCAAAGCCACATCTTTATCTCACACTTATGAGAGTTTTTTCTAGTAGAGGAGATTATAGGATGGTCAAATGTTTGCATAGACGCATGTGGCT
GGACTCCTCTGGAACTATTTCTCCTGGATTTCAAGAAGAATCAGATCATCTTCTCATGGAGGCAGCTTTAAATGGCAATCAGACATTTTATGGACTTTGTTGTATATGCA
TGTCCGTGCAGATTGATGTGGCAATAGAGAAACTCTCAACAATTATTAAGAAATGGAAGGGAATCTCATGGGCTAGTCGAGGAGGCAGTGTTGCTCTGCGTATTGAAGCA
TTGCTGGGACTCACCAAATCTTTCTTTAGTCCTCCTTGCATATTTCCTCGGGTAAATCCAGGTGCATCTATTGAGAGTGTTATGATGCCATTTAAAGCCGTTCAGCCATT
AAATGGAAGCTTACAGTTGAAGGAAGTGGTTATGCGTTTCTTTGACAAATCAGTTGTGCCTATCATAGACGACTGGGGTAGATGCAATGGACTACTGCACCGAGAAGACT
GTACTGAGTTGGATGCCCCCCTTTGGAAAATGATGAGAAGCCCTCCTCCTAGCGTGACAACCACCACATCCATTGGACATGTTGCGAATCTAATTCTACAAAAGAGGTAC
AAAATGATTATTGTTGTAAGACATAACAAGTTTAGTACATATAATGGTTTGAGTTTGAGAGCTGTCGGTGTTTTTACTACCGAACAATTGTATGGCTTTATTTCTCCCAT
TCCGATGCAGCCTCAGCCGAGCATTCCACATAAGACGTAA
Protein sequenceShow/hide protein sequence
MLRIASFSTGSTTVFSCSLPSRTATARRRRNSLRSPNLKRLTSRVVRLTRRKQLHQVFEEIAIAKRRYGKLNTIVMNAVLEACVHCGDIDLALRTFNEMSKPDNCGLDNV
SYGTLLKGLGEARKVDEAFQLLESVEEGTAIGGPTLSAPLIYGLLNALIEAGDMRRANGLIARYGFLLREGGNLSISVYNLLMKGYISSGVPQAALAMYNEMLNLELKPD
RLTYNTLIYACVKINKLDAAILFFEEMKERAAKYDQEDIFPDVVTYTTLLKGFGILKDVHLVHKIVLEMKSCHELLIDRTAYTAMIDALVNCGSINAALSLFGELLKLSG
CNLDLRPKPHLYLTLMRVFSSRGDYRMVKCLHRRMWLDSSGTISPGFQEESDHLLMEAALNGNQTFYGLCCICMSVQIDVAIEKLSTIIKKWKGISWASRGGSVALRIEA
LLGLTKSFFSPPCIFPRVNPGASIESVMMPFKAVQPLNGSLQLKEVVMRFFDKSVVPIIDDWGRCNGLLHREDCTELDAPLWKMMRSPPPSVTTTTSIGHVANLILQKRY
KMIIVVRHNKFSTYNGLSLRAVGVFTTEQLYGFISPIPMQPQPSIPHKT