; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmaCh09G005280 (gene) of Cucurbita maxima (Rimu) v1.1 genome

Gene IDCmaCh09G005280
OrganismCucurbita maxima Rimu (Cucurbita maxima (Rimu) v1.1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationCma_Chr09:2342725..2352844
RNA-Seq ExpressionCmaCh09G005280
SyntenyCmaCh09G005280
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003729 - mRNA binding (molecular function)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily
IPR044781 - Pentatricopeptide repeat-containing protein At5g10690-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6591628.1 Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0083.52Show/hide
Query:  MLWIVSFSSRHLVWNPSDSMNVFSCSLPSRTADARRRIDSPRSPNLKRLTSRVVRLTRRKQLHQIFEEIEIAKRRYGKLNTIVMNAVLEACVHCGDIDLA
        MLWIVSFSSRHL WNPSDSMNVFSCSLPSRTADARRRIDSPRSPNLKRLTSRVVRLTRRKQLHQIFEEIEIAKRRYGKLNTIVMNAVLEACVHCGDIDLA
Subjt:  MLWIVSFSSRHLVWNPSDSMNVFSCSLPSRTADARRRIDSPRSPNLKRLTSRVVRLTRRKQLHQIFEEIEIAKRRYGKLNTIVMNAVLEACVHCGDIDLA

Query:  LSTFNEMSKPGNCGLDNVSYGTLLKGLGEARKIDEAFLLLESVEEGTAIGSPTLSAPLIYGVLNALTEAGDMRRANGLIARYGFLLHEGGNLSISVYNLL
        L TFNEMSKP NCGLDNVSYGTLLKGLGEARKIDEAF LLESVEEGTAIGSPTLSAPLIYGVLNALTEAGDMRRANGLIARYGFLLHEGGNLSISVYNLL
Subjt:  LSTFNEMSKPGNCGLDNVSYGTLLKGLGEARKIDEAFLLLESVEEGTAIGSPTLSAPLIYGVLNALTEAGDMRRANGLIARYGFLLHEGGNLSISVYNLL

Query:  MKGYISSGVPQAALALYNEMLNLELKPDKLTYNTLISACGKINKLDAAMYFFEEMKERAGKYNQEDIFPDVVTYTTLLKGFGILKDVRLVHKIVLEMKSC
        MKGYISSGVPQAALALYNEMLNLELKPDKLTYNTLISAC KINKLDAAMYFFEEMKERAGKYNQEDIFPDVVTYTTLLKGFGILKDVRLV KIVLEMK+C
Subjt:  MKGYISSGVPQAALALYNEMLNLELKPDKLTYNTLISACGKINKLDAAMYFFEEMKERAGKYNQEDIFPDVVTYTTLLKGFGILKDVRLVHKIVLEMKSC

Query:  HDLLIDRTAYTAMIDALVNCGSINGALSLFGELLKLSGWNLDLRPKPHLYLTFMRFFSSRGDYRMVKCLHRRMWLDSSGSISPGFQEEADHLLMEAALHD
        HDLLIDRTAYTAMIDALVNCGSINGALSLFGELLKLSGW LDLRPKPHLYLTFMRFFSSRGDY MVKCLHRRMWLDSSGSISPGFQEEADHLLMEAALHD
Subjt:  HDLLIDRTAYTAMIDALVNCGSINGALSLFGELLKLSGWNLDLRPKPHLYLTFMRFFSSRGDYRMVKCLHRRMWLDSSGSISPGFQEEADHLLMEAALHD

Query:  NQIDVATEKLSTIIKRWKGISWSSRGGSVALRIEALLGLTKSFFSPCIFPRVNPGAPIESVMMPFKAVQPLNGSLQLKEVVMRFFDKSVVPIIDDWGRCI
        NQIDVATEKLSTIIKRWKGISWSSRGGSVALRIEALLGLTKSFFSPCIFPRVNPGAPIESVMMPFKAVQPLNG+LQLKEVVMRFFDKSVVPIIDDWGRCI
Subjt:  NQIDVATEKLSTIIKRWKGISWSSRGGSVALRIEALLGLTKSFFSPCIFPRVNPGAPIESVMMPFKAVQPLNGSLQLKEVVMRFFDKSVVPIIDDWGRCI

Query:  GLLHREDCSELEAPLWTMMRSPPPGVTTTTSIGHVVNLILRKRYKMIIIVRHSKFSTYDSSSSRAVGVFTIEQLYGFISPIPMQLQPNIPHRTRRSSTVQ
        GLLHREDCSELEAPLW MMRSPPPGVTTTTSIGHVVNLILRK                                                  +RRSSTVQ
Subjt:  GLLHREDCSELEAPLWTMMRSPPPGVTTTTSIGHVVNLILRKRYKMIIIVRHSKFSTYDSSSSRAVGVFTIEQLYGFISPIPMQLQPNIPHRTRRSSTVQ

Query:  NTPIRRPISQKPPFSLFLLLQNGNIPESSAELPSIPQYIYTLEVSTCFPFRIFCSSFPSLFHRHSCSEVEEKRLQKDIIRSFVPNFLGRSLEQEHFVENN
        N PIRRPISQKPPFSLFLLLQNG IP SSAELP                                  ++++   +K +IRSFV NF  +   +EHFVENN
Subjt:  NTPIRRPISQKPPFSLFLLLQNGNIPESSAELPSIPQYIYTLEVSTCFPFRIFCSSFPSLFHRHSCSEVEEKRLQKDIIRSFVPNFLGRSLEQEHFVENN

Query:  RAKQYTEIHL
        RAKQYTEIHL
Subjt:  RAKQYTEIHL

XP_022936928.1 pentatricopeptide repeat-containing protein At5g10690 isoform X1 [Cucurbita moschata]0.0e+0097.64Show/hide
Query:  MLWIVSFSSRHLVWNPSDSMNVFSCSLPSRTADARRRIDSPRSPNLKRLTSRVVRLTRRKQLHQIFEEIEIAKRRYGKLNTIVMNAVLEACVHCGDIDLA
        MLWIVSFSSRHL WNPSDSMNVFSCSLPSRTADARRRIDSPRSPNLKRLTSRVVRLTRRKQLHQIFEEIEIAKRRYGKLNTIVMNAVLEACVHCGDIDLA
Subjt:  MLWIVSFSSRHLVWNPSDSMNVFSCSLPSRTADARRRIDSPRSPNLKRLTSRVVRLTRRKQLHQIFEEIEIAKRRYGKLNTIVMNAVLEACVHCGDIDLA

Query:  LSTFNEMSKPGNCGLDNVSYGTLLKGLGEARKIDEAFLLLESVEEGTAIGSPTLSAPLIYGVLNALTEAGDMRRANGLIARYGFLLHEGGNLSISVYNLL
        L TFNEMSKP NCGLDNVSYGTLLKGLGEARKIDEAF LLESVEEGTAIGSPTLSAPLIYGVLNALTEAGDMRRANGLIARYGFLLHEGGNLSISVYNLL
Subjt:  LSTFNEMSKPGNCGLDNVSYGTLLKGLGEARKIDEAFLLLESVEEGTAIGSPTLSAPLIYGVLNALTEAGDMRRANGLIARYGFLLHEGGNLSISVYNLL

Query:  MKGYISSGVPQAALALYNEMLNLELKPDKLTYNTLISACGKINKLDAAMYFFEEMKERAGKYNQEDIFPDVVTYTTLLKGFGILKDVRLVHKIVLEMKSC
        MKGYISSGVPQAALALYNEMLNLELKPDKLTYNTLISAC KINKLDAAMYFFEEMKERAGKYNQEDIFPDVVTYTTLLKGFGILKDVRLVHKIVLEMK+C
Subjt:  MKGYISSGVPQAALALYNEMLNLELKPDKLTYNTLISACGKINKLDAAMYFFEEMKERAGKYNQEDIFPDVVTYTTLLKGFGILKDVRLVHKIVLEMKSC

Query:  HDLLIDRTAYTAMIDALVNCGSINGALSLFGELLKLSGWNLDLRPKPHLYLTFMRFFSSRGDYRMVKCLHRRMWLDSSGSISPGFQEEADHLLMEAALHD
        HDLLIDRTAYTAMIDALVNCGSINGALSLFGELLKLSGW LDLRPKPHLYLTFMRFFSSRGDY MVKCLHRRMWLDSSGSISPGFQEEADHLLMEAALHD
Subjt:  HDLLIDRTAYTAMIDALVNCGSINGALSLFGELLKLSGWNLDLRPKPHLYLTFMRFFSSRGDYRMVKCLHRRMWLDSSGSISPGFQEEADHLLMEAALHD

Query:  NQIDVATEKLSTIIKRWKGISWSSRGGSVALRIEALLGLTKSFFSPCIFPRVNPGAPIESVMMPFKAVQPLNGSLQLKEVVMRFFDKSVVPIIDDWGRCI
        NQIDVATEKLSTIIKRWKGISWSSRGGSVALRIEALLGLTKSFFSPCIFPRVNPGAPIESVMMPFKAVQPLNG+LQLKEVVMRFFDKSVVPIIDDWGRCI
Subjt:  NQIDVATEKLSTIIKRWKGISWSSRGGSVALRIEALLGLTKSFFSPCIFPRVNPGAPIESVMMPFKAVQPLNGSLQLKEVVMRFFDKSVVPIIDDWGRCI

Query:  GLLHREDCSELEAPLWTMMRSPPPGVTTTTSIGHVVNLILRKRYKMIIIVRHSKFSTYDSSSSRAVGVFTIEQLYGFISPIPMQLQPNIPHRT
        GLLHREDCSELEAPLW MMRSPPPGVTTTTSIGHVVNLILRKRYKMIIIVR+SKFSTYDSSSSRAVGVFTIEQLYGFISP P+QLQPNIPH+T
Subjt:  GLLHREDCSELEAPLWTMMRSPPPGVTTTTSIGHVVNLILRKRYKMIIIVRHSKFSTYDSSSSRAVGVFTIEQLYGFISPIPMQLQPNIPHRT

XP_022975759.1 pentatricopeptide repeat-containing protein At5g10690 isoform X1 [Cucurbita maxima]0.0e+00100Show/hide
Query:  MLWIVSFSSRHLVWNPSDSMNVFSCSLPSRTADARRRIDSPRSPNLKRLTSRVVRLTRRKQLHQIFEEIEIAKRRYGKLNTIVMNAVLEACVHCGDIDLA
        MLWIVSFSSRHLVWNPSDSMNVFSCSLPSRTADARRRIDSPRSPNLKRLTSRVVRLTRRKQLHQIFEEIEIAKRRYGKLNTIVMNAVLEACVHCGDIDLA
Subjt:  MLWIVSFSSRHLVWNPSDSMNVFSCSLPSRTADARRRIDSPRSPNLKRLTSRVVRLTRRKQLHQIFEEIEIAKRRYGKLNTIVMNAVLEACVHCGDIDLA

Query:  LSTFNEMSKPGNCGLDNVSYGTLLKGLGEARKIDEAFLLLESVEEGTAIGSPTLSAPLIYGVLNALTEAGDMRRANGLIARYGFLLHEGGNLSISVYNLL
        LSTFNEMSKPGNCGLDNVSYGTLLKGLGEARKIDEAFLLLESVEEGTAIGSPTLSAPLIYGVLNALTEAGDMRRANGLIARYGFLLHEGGNLSISVYNLL
Subjt:  LSTFNEMSKPGNCGLDNVSYGTLLKGLGEARKIDEAFLLLESVEEGTAIGSPTLSAPLIYGVLNALTEAGDMRRANGLIARYGFLLHEGGNLSISVYNLL

Query:  MKGYISSGVPQAALALYNEMLNLELKPDKLTYNTLISACGKINKLDAAMYFFEEMKERAGKYNQEDIFPDVVTYTTLLKGFGILKDVRLVHKIVLEMKSC
        MKGYISSGVPQAALALYNEMLNLELKPDKLTYNTLISACGKINKLDAAMYFFEEMKERAGKYNQEDIFPDVVTYTTLLKGFGILKDVRLVHKIVLEMKSC
Subjt:  MKGYISSGVPQAALALYNEMLNLELKPDKLTYNTLISACGKINKLDAAMYFFEEMKERAGKYNQEDIFPDVVTYTTLLKGFGILKDVRLVHKIVLEMKSC

Query:  HDLLIDRTAYTAMIDALVNCGSINGALSLFGELLKLSGWNLDLRPKPHLYLTFMRFFSSRGDYRMVKCLHRRMWLDSSGSISPGFQEEADHLLMEAALHD
        HDLLIDRTAYTAMIDALVNCGSINGALSLFGELLKLSGWNLDLRPKPHLYLTFMRFFSSRGDYRMVKCLHRRMWLDSSGSISPGFQEEADHLLMEAALHD
Subjt:  HDLLIDRTAYTAMIDALVNCGSINGALSLFGELLKLSGWNLDLRPKPHLYLTFMRFFSSRGDYRMVKCLHRRMWLDSSGSISPGFQEEADHLLMEAALHD

Query:  NQIDVATEKLSTIIKRWKGISWSSRGGSVALRIEALLGLTKSFFSPCIFPRVNPGAPIESVMMPFKAVQPLNGSLQLKEVVMRFFDKSVVPIIDDWGRCI
        NQIDVATEKLSTIIKRWKGISWSSRGGSVALRIEALLGLTKSFFSPCIFPRVNPGAPIESVMMPFKAVQPLNGSLQLKEVVMRFFDKSVVPIIDDWGRCI
Subjt:  NQIDVATEKLSTIIKRWKGISWSSRGGSVALRIEALLGLTKSFFSPCIFPRVNPGAPIESVMMPFKAVQPLNGSLQLKEVVMRFFDKSVVPIIDDWGRCI

Query:  GLLHREDCSELEAPLWTMMRSPPPGVTTTTSIGHVVNLILRKRYKMIIIVRHSKFSTYDSSSSRAVGVFTIEQLYGFISPIPMQLQPNIPHRT
        GLLHREDCSELEAPLWTMMRSPPPGVTTTTSIGHVVNLILRKRYKMIIIVRHSKFSTYDSSSSRAVGVFTIEQLYGFISPIPMQLQPNIPHRT
Subjt:  GLLHREDCSELEAPLWTMMRSPPPGVTTTTSIGHVVNLILRKRYKMIIIVRHSKFSTYDSSSSRAVGVFTIEQLYGFISPIPMQLQPNIPHRT

XP_022975761.1 pentatricopeptide repeat-containing protein At5g10690 isoform X2 [Cucurbita maxima]0.0e+0099.63Show/hide
Query:  MLWIVSFSSRHLVWNPSDSMNVFSCSLPSRTADARRRIDSPRSPNLKRLTSRVVRLTRRKQLHQIFEEIEIAKRRYGKLNTIVMNAVLEACVHCGDIDLA
        MLWIVSFSSRHLVWNPSDSMNVFSCSLPSRTADARRRIDSPRSPNLKRLTSRVVRLTRRKQLHQIFEEIEIAKRRYGKLNTIVMNAVLEACVHCGDIDLA
Subjt:  MLWIVSFSSRHLVWNPSDSMNVFSCSLPSRTADARRRIDSPRSPNLKRLTSRVVRLTRRKQLHQIFEEIEIAKRRYGKLNTIVMNAVLEACVHCGDIDLA

Query:  LSTFNEMSKPGNCGLDNVSYGTLLKGLGEARKIDEAFLLLESVEEGTAIGSPTLSAPLIYGVLNALTEAGDMRRANGLIARYGFLLHEGGNLSISVYNLL
        LSTFNEMSKPGNCGLDNVSYGTLLKGLGEARKIDEAFLLLESVEEGTAIGSPTLSAPLIYGVLNALTEAGDMRRANGLIARYGFLLHEGGNLSISVYNLL
Subjt:  LSTFNEMSKPGNCGLDNVSYGTLLKGLGEARKIDEAFLLLESVEEGTAIGSPTLSAPLIYGVLNALTEAGDMRRANGLIARYGFLLHEGGNLSISVYNLL

Query:  MKGYISSGVPQAALALYNEMLNLELKPDKLTYNTLISACGKINKLDAAMYFFEEMKERAGKYNQEDIFPDVVTYTTLLKGFGILKDVRLVHKIVLEMKSC
        MKGYISSGVPQAALALYNEMLNLELKPDKLTYNTLISACGKINKLDAAMYFFEEMKERAGKYNQEDIFPDVVTYTTLLKGFGILKDVRLVHKIVLEMKSC
Subjt:  MKGYISSGVPQAALALYNEMLNLELKPDKLTYNTLISACGKINKLDAAMYFFEEMKERAGKYNQEDIFPDVVTYTTLLKGFGILKDVRLVHKIVLEMKSC

Query:  HDLLIDRTAYTAMIDALVNCGSINGALSLFGELLKLSGWNLDLRPKPHLYLTFMRFFSSRGDYRMVKCLHRRMWLDSSGSISPGFQEEADHLLMEAALHD
        HDLLIDRTAYTAMIDALVNCGSINGALSLFGELLKLSGWNLDLRPKPHLYLTFMRFFSSRGDYRMVKCLHRRMWLDSSGSISPGFQEEADHLLMEAALHD
Subjt:  HDLLIDRTAYTAMIDALVNCGSINGALSLFGELLKLSGWNLDLRPKPHLYLTFMRFFSSRGDYRMVKCLHRRMWLDSSGSISPGFQEEADHLLMEAALHD

Query:  NQIDVATEKLSTIIKRWKGISWSSRGGSVALRIEALLGLTKSFFSPCIFPRVNPGAPIESVMMPFKAVQPLNGSLQLKEVVMRFFDKSVVPIIDDWGRCI
        NQIDVATEKLSTIIKRWKGISWSSRGGSVALRIEALLGLTKSFFSPCIFPRVNPGAPIESVMMPFKAVQPLNGSLQLKEVVMRFFDKSVVPIIDDWGRCI
Subjt:  NQIDVATEKLSTIIKRWKGISWSSRGGSVALRIEALLGLTKSFFSPCIFPRVNPGAPIESVMMPFKAVQPLNGSLQLKEVVMRFFDKSVVPIIDDWGRCI

Query:  GLLHREDCSELEAPLWTMMRSPPPGVTTTTSIGHVVNLILRKRYK
        GLLHREDCSELEAPLWTMMRSPPPGVTTTTSIGHVVNLILRKR++
Subjt:  GLLHREDCSELEAPLWTMMRSPPPGVTTTTSIGHVVNLILRKRYK

XP_023535821.1 pentatricopeptide repeat-containing protein At5g10690 isoform X1 [Cucurbita pepo subsp. pepo]0.0e+0097.98Show/hide
Query:  MLWIVSFSSRHLVWNPSDSMNVFSCSLPSRTADARRRIDSPRSPNLKRLTSRVVRLTRRKQLHQIFEEIEIAKRRYGKLNTIVMNAVLEACVHCGDIDLA
        MLWI SFSSRHL WNPSDSMNVFSCSLPSRTADARRRIDSPRSPNLKRLTSRVVRLTRRKQLHQIFEEIEIAK+RYGKLNTIVMNAVLEACVHCGDIDLA
Subjt:  MLWIVSFSSRHLVWNPSDSMNVFSCSLPSRTADARRRIDSPRSPNLKRLTSRVVRLTRRKQLHQIFEEIEIAKRRYGKLNTIVMNAVLEACVHCGDIDLA

Query:  LSTFNEMSKPGNCGLDNVSYGTLLKGLGEARKIDEAFLLLESVEEGTAIGSPTLSAPLIYGVLNALTEAGDMRRANGLIARYGFLLHEGGNLSISVYNLL
        L TFNEMSKP NCGLDNVSYGTLLKGLGEARKIDEAF LLESVEEGTAIGSPTLSAPLIYGVLNALTEAGDMRRANGLIARYGFLLHEGGNLSISVYNLL
Subjt:  LSTFNEMSKPGNCGLDNVSYGTLLKGLGEARKIDEAFLLLESVEEGTAIGSPTLSAPLIYGVLNALTEAGDMRRANGLIARYGFLLHEGGNLSISVYNLL

Query:  MKGYISSGVPQAALALYNEMLNLELKPDKLTYNTLISACGKINKLDAAMYFFEEMKERAGKYNQEDIFPDVVTYTTLLKGFGILKDVRLVHKIVLEMKSC
        MKGYISSGVPQAALALYNEMLNLELKPDKLTYNTLISAC KINKLDAAMYFFEEMKERAGKYNQEDIFPDVVTYTTLLKGFGILKDVRLVHKIVLEMKS 
Subjt:  MKGYISSGVPQAALALYNEMLNLELKPDKLTYNTLISACGKINKLDAAMYFFEEMKERAGKYNQEDIFPDVVTYTTLLKGFGILKDVRLVHKIVLEMKSC

Query:  HDLLIDRTAYTAMIDALVNCGSINGALSLFGELLKLSGWNLDLRPKPHLYLTFMRFFSSRGDYRMVKCLHRRMWLDSSGSISPGFQEEADHLLMEAALHD
        HDLLIDRTAYTAMIDALVNCGSINGALSLFGELLKLSGWNLDLRPKPHLYLTFMRFFSSRGDYRMVKCLHRRMWLDSSGSISPGFQEEADHLLMEAALHD
Subjt:  HDLLIDRTAYTAMIDALVNCGSINGALSLFGELLKLSGWNLDLRPKPHLYLTFMRFFSSRGDYRMVKCLHRRMWLDSSGSISPGFQEEADHLLMEAALHD

Query:  NQIDVATEKLSTIIKRWKGISWSSRGGSVALRIEALLGLTKSFFSPCIFPRVNPGAPIESVMMPFKAVQPLNGSLQLKEVVMRFFDKSVVPIIDDWGRCI
        NQIDVATEKLSTIIKRWKGISWSSRGGSVALRIEALLGLTKSFFSPCIFPRVNPGAPIESVMMPFKAVQPLNG+LQLKEVVMRFFDKSVVPIIDDWGRCI
Subjt:  NQIDVATEKLSTIIKRWKGISWSSRGGSVALRIEALLGLTKSFFSPCIFPRVNPGAPIESVMMPFKAVQPLNGSLQLKEVVMRFFDKSVVPIIDDWGRCI

Query:  GLLHREDCSELEAPLWTMMRSPPPGVTTTTSIGHVVNLILRKRYKMIIIVRHSKFSTYDSSSSRAVGVFTIEQLYGFISPIPMQLQPNIPHRT
        GLLHREDCSELE+PLW MMRSPPPGVTTTTSIGHVVNLILRKRYKMIIIVRHSKFSTYDSSSSRAVGVFTIEQLYGFISPIPMQLQPNIPH+T
Subjt:  GLLHREDCSELEAPLWTMMRSPPPGVTTTTSIGHVVNLILRKRYKMIIIVRHSKFSTYDSSSSRAVGVFTIEQLYGFISPIPMQLQPNIPHRT

TrEMBL top hitse value%identityAlignment
A0A1S3CQY3 pentatricopeptide repeat-containing protein At5g10690 isoform X51.3e-30288.53Show/hide
Query:  MLWIVSFSSRHLVWNPSDSMNVFSCSLPSRTADARRRIDSPRSPNLKRLTSRVVRLTRRKQLHQIFEEIEIAKRRYGKLNTIVMNAVLEACVHCGDIDLA
        ML I SFSSRHL    S+S NVFSCSLPSRTA +RR  DSPRSPNLKRLTSRVVRLTRRK+LHQ+FEEIEIAKRRYGKLNTIVMNAVLEACVHCGDIDLA
Subjt:  MLWIVSFSSRHLVWNPSDSMNVFSCSLPSRTADARRRIDSPRSPNLKRLTSRVVRLTRRKQLHQIFEEIEIAKRRYGKLNTIVMNAVLEACVHCGDIDLA

Query:  LSTFNEMSKPGNCGLDNVSYGTLLKGLGEARKIDEAFLLLESVEEGTAIGSPTLSAPLIYGVLNALTEAGDMRRANGLIARYGFLLHEGGNLSISVYNLL
        L TFNEMSKP +CGLDNVSYGTLLKGLGEARKIDEAF LLESVEEGTAIG PTLSAPLIYG+LNAL EAGDMRRANGLIARYG+LL EGGNLSISVYNLL
Subjt:  LSTFNEMSKPGNCGLDNVSYGTLLKGLGEARKIDEAFLLLESVEEGTAIGSPTLSAPLIYGVLNALTEAGDMRRANGLIARYGFLLHEGGNLSISVYNLL

Query:  MKGYISSGVPQAALALYNEMLNLELKPDKLTYNTLISACGKINKLDAAMYFFEEMKERAGKYNQEDIFPDVVTYTTLLKGFGILKDVRLVHKIVLEMKSC
        MKGYISSGVPQAALA+YNEMLNLELKPD+LTYNTLISAC KINKLDAAM+FFEEMKERA KY+QED+FPDVVTYTTLLK FGILKDV LVHKIVLEMKSC
Subjt:  MKGYISSGVPQAALALYNEMLNLELKPDKLTYNTLISACGKINKLDAAMYFFEEMKERAGKYNQEDIFPDVVTYTTLLKGFGILKDVRLVHKIVLEMKSC

Query:  HDLLIDRTAYTAMIDALVNCGSINGALSLFGELLKLSGWNLDLRPKPHLYLTFMRFFSSRGDYRMVKCLHRRMWLDSSGSISPGFQEEADHLLMEAALHD
        H L IDRTAYTAMIDALVNCGSINGALSLFGELLKLSGWNL+LRPKPHLYLT MR FSSRGDYRMVKCLHRRMWLDSSG+IS G+QEEADHLLMEAAL+D
Subjt:  HDLLIDRTAYTAMIDALVNCGSINGALSLFGELLKLSGWNLDLRPKPHLYLTFMRFFSSRGDYRMVKCLHRRMWLDSSGSISPGFQEEADHLLMEAALHD

Query:  NQIDVATEKLSTIIKRWKGISWSSRGGSVALRIEALLGLTKSFFSPCIFPRVNPGAPIESVMMPFKAVQPLNGSLQLKEVVMRFFDKSVVPIIDDWGRCI
        NQIDVA EKLSTIIK+WKGISW+SRGGSVALRIEALLGLTKSFFSPCIFPRVN GAPIESVMMPFKAVQPLNGSL LKEVVMRFFDKSVVPIIDDWGRCI
Subjt:  NQIDVATEKLSTIIKRWKGISWSSRGGSVALRIEALLGLTKSFFSPCIFPRVNPGAPIESVMMPFKAVQPLNGSLQLKEVVMRFFDKSVVPIIDDWGRCI

Query:  GLLHREDCSELEAPLWTMMRSPPPGVTTTTSIGHVVNLILRKRYKMIIIVRHSKFSTYDSSSSRAVGVFTIEQLYGFISPIPMQLQPNIPHRT
        GLLHREDC+EL+APLW MMRSPPPGVTTT  IGHV NLIL+KRYKM+++VRHSKFS Y  SS RA+GVFTIEQLYGF+SPIPM  +PN+P +T
Subjt:  GLLHREDCSELEAPLWTMMRSPPPGVTTTTSIGHVVNLILRKRYKMIIIVRHSKFSTYDSSSSRAVGVFTIEQLYGFISPIPMQLQPNIPHRT

A0A6J1F9Q1 pentatricopeptide repeat-containing protein At5g10690 isoform X22.0e-30897.8Show/hide
Query:  MLWIVSFSSRHLVWNPSDSMNVFSCSLPSRTADARRRIDSPRSPNLKRLTSRVVRLTRRKQLHQIFEEIEIAKRRYGKLNTIVMNAVLEACVHCGDIDLA
        MLWIVSFSSRHL WNPSDSMNVFSCSLPSRTADARRRIDSPRSPNLKRLTSRVVRLTRRKQLHQIFEEIEIAKRRYGKLNTIVMNAVLEACVHCGDIDLA
Subjt:  MLWIVSFSSRHLVWNPSDSMNVFSCSLPSRTADARRRIDSPRSPNLKRLTSRVVRLTRRKQLHQIFEEIEIAKRRYGKLNTIVMNAVLEACVHCGDIDLA

Query:  LSTFNEMSKPGNCGLDNVSYGTLLKGLGEARKIDEAFLLLESVEEGTAIGSPTLSAPLIYGVLNALTEAGDMRRANGLIARYGFLLHEGGNLSISVYNLL
        L TFNEMSKP NCGLDNVSYGTLLKGLGEARKIDEAF LLESVEEGTAIGSPTLSAPLIYGVLNALTEAGDMRRANGLIARYGFLLHEGGNLSISVYNLL
Subjt:  LSTFNEMSKPGNCGLDNVSYGTLLKGLGEARKIDEAFLLLESVEEGTAIGSPTLSAPLIYGVLNALTEAGDMRRANGLIARYGFLLHEGGNLSISVYNLL

Query:  MKGYISSGVPQAALALYNEMLNLELKPDKLTYNTLISACGKINKLDAAMYFFEEMKERAGKYNQEDIFPDVVTYTTLLKGFGILKDVRLVHKIVLEMKSC
        MKGYISSGVPQAALALYNEMLNLELKPDKLTYNTLISAC KINKLDAAMYFFEEMKERAGKYNQEDIFPDVVTYTTLLKGFGILKDVRLVHKIVLEMK+C
Subjt:  MKGYISSGVPQAALALYNEMLNLELKPDKLTYNTLISACGKINKLDAAMYFFEEMKERAGKYNQEDIFPDVVTYTTLLKGFGILKDVRLVHKIVLEMKSC

Query:  HDLLIDRTAYTAMIDALVNCGSINGALSLFGELLKLSGWNLDLRPKPHLYLTFMRFFSSRGDYRMVKCLHRRMWLDSSGSISPGFQEEADHLLMEAALHD
        HDLLIDRTAYTAMIDALVNCGSINGALSLFGELLKLSGW LDLRPKPHLYLTFMRFFSSRGDY MVKCLHRRMWLDSSGSISPGFQEEADHLLMEAALHD
Subjt:  HDLLIDRTAYTAMIDALVNCGSINGALSLFGELLKLSGWNLDLRPKPHLYLTFMRFFSSRGDYRMVKCLHRRMWLDSSGSISPGFQEEADHLLMEAALHD

Query:  NQIDVATEKLSTIIKRWKGISWSSRGGSVALRIEALLGLTKSFFSPCIFPRVNPGAPIESVMMPFKAVQPLNGSLQLKEVVMRFFDKSVVPIIDDWGRCI
        NQIDVATEKLSTIIKRWKGISWSSRGGSVALRIEALLGLTKSFFSPCIFPRVNPGAPIESVMMPFKAVQPLNG+LQLKEVVMRFFDKSVVPIIDDWGRCI
Subjt:  NQIDVATEKLSTIIKRWKGISWSSRGGSVALRIEALLGLTKSFFSPCIFPRVNPGAPIESVMMPFKAVQPLNGSLQLKEVVMRFFDKSVVPIIDDWGRCI

Query:  GLLHREDCSELEAPLWTMMRSPPPGVTTTTSIGHVVNLILRKRYK
        GLLHREDCSELEAPLW MMRSPPPGVTTTTSIGHVVNLILRKR++
Subjt:  GLLHREDCSELEAPLWTMMRSPPPGVTTTTSIGHVVNLILRKRYK

A0A6J1FEL4 pentatricopeptide repeat-containing protein At5g10690 isoform X10.0e+0097.64Show/hide
Query:  MLWIVSFSSRHLVWNPSDSMNVFSCSLPSRTADARRRIDSPRSPNLKRLTSRVVRLTRRKQLHQIFEEIEIAKRRYGKLNTIVMNAVLEACVHCGDIDLA
        MLWIVSFSSRHL WNPSDSMNVFSCSLPSRTADARRRIDSPRSPNLKRLTSRVVRLTRRKQLHQIFEEIEIAKRRYGKLNTIVMNAVLEACVHCGDIDLA
Subjt:  MLWIVSFSSRHLVWNPSDSMNVFSCSLPSRTADARRRIDSPRSPNLKRLTSRVVRLTRRKQLHQIFEEIEIAKRRYGKLNTIVMNAVLEACVHCGDIDLA

Query:  LSTFNEMSKPGNCGLDNVSYGTLLKGLGEARKIDEAFLLLESVEEGTAIGSPTLSAPLIYGVLNALTEAGDMRRANGLIARYGFLLHEGGNLSISVYNLL
        L TFNEMSKP NCGLDNVSYGTLLKGLGEARKIDEAF LLESVEEGTAIGSPTLSAPLIYGVLNALTEAGDMRRANGLIARYGFLLHEGGNLSISVYNLL
Subjt:  LSTFNEMSKPGNCGLDNVSYGTLLKGLGEARKIDEAFLLLESVEEGTAIGSPTLSAPLIYGVLNALTEAGDMRRANGLIARYGFLLHEGGNLSISVYNLL

Query:  MKGYISSGVPQAALALYNEMLNLELKPDKLTYNTLISACGKINKLDAAMYFFEEMKERAGKYNQEDIFPDVVTYTTLLKGFGILKDVRLVHKIVLEMKSC
        MKGYISSGVPQAALALYNEMLNLELKPDKLTYNTLISAC KINKLDAAMYFFEEMKERAGKYNQEDIFPDVVTYTTLLKGFGILKDVRLVHKIVLEMK+C
Subjt:  MKGYISSGVPQAALALYNEMLNLELKPDKLTYNTLISACGKINKLDAAMYFFEEMKERAGKYNQEDIFPDVVTYTTLLKGFGILKDVRLVHKIVLEMKSC

Query:  HDLLIDRTAYTAMIDALVNCGSINGALSLFGELLKLSGWNLDLRPKPHLYLTFMRFFSSRGDYRMVKCLHRRMWLDSSGSISPGFQEEADHLLMEAALHD
        HDLLIDRTAYTAMIDALVNCGSINGALSLFGELLKLSGW LDLRPKPHLYLTFMRFFSSRGDY MVKCLHRRMWLDSSGSISPGFQEEADHLLMEAALHD
Subjt:  HDLLIDRTAYTAMIDALVNCGSINGALSLFGELLKLSGWNLDLRPKPHLYLTFMRFFSSRGDYRMVKCLHRRMWLDSSGSISPGFQEEADHLLMEAALHD

Query:  NQIDVATEKLSTIIKRWKGISWSSRGGSVALRIEALLGLTKSFFSPCIFPRVNPGAPIESVMMPFKAVQPLNGSLQLKEVVMRFFDKSVVPIIDDWGRCI
        NQIDVATEKLSTIIKRWKGISWSSRGGSVALRIEALLGLTKSFFSPCIFPRVNPGAPIESVMMPFKAVQPLNG+LQLKEVVMRFFDKSVVPIIDDWGRCI
Subjt:  NQIDVATEKLSTIIKRWKGISWSSRGGSVALRIEALLGLTKSFFSPCIFPRVNPGAPIESVMMPFKAVQPLNGSLQLKEVVMRFFDKSVVPIIDDWGRCI

Query:  GLLHREDCSELEAPLWTMMRSPPPGVTTTTSIGHVVNLILRKRYKMIIIVRHSKFSTYDSSSSRAVGVFTIEQLYGFISPIPMQLQPNIPHRT
        GLLHREDCSELEAPLW MMRSPPPGVTTTTSIGHVVNLILRKRYKMIIIVR+SKFSTYDSSSSRAVGVFTIEQLYGFISP P+QLQPNIPH+T
Subjt:  GLLHREDCSELEAPLWTMMRSPPPGVTTTTSIGHVVNLILRKRYKMIIIVRHSKFSTYDSSSSRAVGVFTIEQLYGFISPIPMQLQPNIPHRT

A0A6J1IF33 pentatricopeptide repeat-containing protein At5g10690 isoform X20.0e+0099.63Show/hide
Query:  MLWIVSFSSRHLVWNPSDSMNVFSCSLPSRTADARRRIDSPRSPNLKRLTSRVVRLTRRKQLHQIFEEIEIAKRRYGKLNTIVMNAVLEACVHCGDIDLA
        MLWIVSFSSRHLVWNPSDSMNVFSCSLPSRTADARRRIDSPRSPNLKRLTSRVVRLTRRKQLHQIFEEIEIAKRRYGKLNTIVMNAVLEACVHCGDIDLA
Subjt:  MLWIVSFSSRHLVWNPSDSMNVFSCSLPSRTADARRRIDSPRSPNLKRLTSRVVRLTRRKQLHQIFEEIEIAKRRYGKLNTIVMNAVLEACVHCGDIDLA

Query:  LSTFNEMSKPGNCGLDNVSYGTLLKGLGEARKIDEAFLLLESVEEGTAIGSPTLSAPLIYGVLNALTEAGDMRRANGLIARYGFLLHEGGNLSISVYNLL
        LSTFNEMSKPGNCGLDNVSYGTLLKGLGEARKIDEAFLLLESVEEGTAIGSPTLSAPLIYGVLNALTEAGDMRRANGLIARYGFLLHEGGNLSISVYNLL
Subjt:  LSTFNEMSKPGNCGLDNVSYGTLLKGLGEARKIDEAFLLLESVEEGTAIGSPTLSAPLIYGVLNALTEAGDMRRANGLIARYGFLLHEGGNLSISVYNLL

Query:  MKGYISSGVPQAALALYNEMLNLELKPDKLTYNTLISACGKINKLDAAMYFFEEMKERAGKYNQEDIFPDVVTYTTLLKGFGILKDVRLVHKIVLEMKSC
        MKGYISSGVPQAALALYNEMLNLELKPDKLTYNTLISACGKINKLDAAMYFFEEMKERAGKYNQEDIFPDVVTYTTLLKGFGILKDVRLVHKIVLEMKSC
Subjt:  MKGYISSGVPQAALALYNEMLNLELKPDKLTYNTLISACGKINKLDAAMYFFEEMKERAGKYNQEDIFPDVVTYTTLLKGFGILKDVRLVHKIVLEMKSC

Query:  HDLLIDRTAYTAMIDALVNCGSINGALSLFGELLKLSGWNLDLRPKPHLYLTFMRFFSSRGDYRMVKCLHRRMWLDSSGSISPGFQEEADHLLMEAALHD
        HDLLIDRTAYTAMIDALVNCGSINGALSLFGELLKLSGWNLDLRPKPHLYLTFMRFFSSRGDYRMVKCLHRRMWLDSSGSISPGFQEEADHLLMEAALHD
Subjt:  HDLLIDRTAYTAMIDALVNCGSINGALSLFGELLKLSGWNLDLRPKPHLYLTFMRFFSSRGDYRMVKCLHRRMWLDSSGSISPGFQEEADHLLMEAALHD

Query:  NQIDVATEKLSTIIKRWKGISWSSRGGSVALRIEALLGLTKSFFSPCIFPRVNPGAPIESVMMPFKAVQPLNGSLQLKEVVMRFFDKSVVPIIDDWGRCI
        NQIDVATEKLSTIIKRWKGISWSSRGGSVALRIEALLGLTKSFFSPCIFPRVNPGAPIESVMMPFKAVQPLNGSLQLKEVVMRFFDKSVVPIIDDWGRCI
Subjt:  NQIDVATEKLSTIIKRWKGISWSSRGGSVALRIEALLGLTKSFFSPCIFPRVNPGAPIESVMMPFKAVQPLNGSLQLKEVVMRFFDKSVVPIIDDWGRCI

Query:  GLLHREDCSELEAPLWTMMRSPPPGVTTTTSIGHVVNLILRKRYK
        GLLHREDCSELEAPLWTMMRSPPPGVTTTTSIGHVVNLILRKR++
Subjt:  GLLHREDCSELEAPLWTMMRSPPPGVTTTTSIGHVVNLILRKRYK

A0A6J1ILI3 pentatricopeptide repeat-containing protein At5g10690 isoform X10.0e+00100Show/hide
Query:  MLWIVSFSSRHLVWNPSDSMNVFSCSLPSRTADARRRIDSPRSPNLKRLTSRVVRLTRRKQLHQIFEEIEIAKRRYGKLNTIVMNAVLEACVHCGDIDLA
        MLWIVSFSSRHLVWNPSDSMNVFSCSLPSRTADARRRIDSPRSPNLKRLTSRVVRLTRRKQLHQIFEEIEIAKRRYGKLNTIVMNAVLEACVHCGDIDLA
Subjt:  MLWIVSFSSRHLVWNPSDSMNVFSCSLPSRTADARRRIDSPRSPNLKRLTSRVVRLTRRKQLHQIFEEIEIAKRRYGKLNTIVMNAVLEACVHCGDIDLA

Query:  LSTFNEMSKPGNCGLDNVSYGTLLKGLGEARKIDEAFLLLESVEEGTAIGSPTLSAPLIYGVLNALTEAGDMRRANGLIARYGFLLHEGGNLSISVYNLL
        LSTFNEMSKPGNCGLDNVSYGTLLKGLGEARKIDEAFLLLESVEEGTAIGSPTLSAPLIYGVLNALTEAGDMRRANGLIARYGFLLHEGGNLSISVYNLL
Subjt:  LSTFNEMSKPGNCGLDNVSYGTLLKGLGEARKIDEAFLLLESVEEGTAIGSPTLSAPLIYGVLNALTEAGDMRRANGLIARYGFLLHEGGNLSISVYNLL

Query:  MKGYISSGVPQAALALYNEMLNLELKPDKLTYNTLISACGKINKLDAAMYFFEEMKERAGKYNQEDIFPDVVTYTTLLKGFGILKDVRLVHKIVLEMKSC
        MKGYISSGVPQAALALYNEMLNLELKPDKLTYNTLISACGKINKLDAAMYFFEEMKERAGKYNQEDIFPDVVTYTTLLKGFGILKDVRLVHKIVLEMKSC
Subjt:  MKGYISSGVPQAALALYNEMLNLELKPDKLTYNTLISACGKINKLDAAMYFFEEMKERAGKYNQEDIFPDVVTYTTLLKGFGILKDVRLVHKIVLEMKSC

Query:  HDLLIDRTAYTAMIDALVNCGSINGALSLFGELLKLSGWNLDLRPKPHLYLTFMRFFSSRGDYRMVKCLHRRMWLDSSGSISPGFQEEADHLLMEAALHD
        HDLLIDRTAYTAMIDALVNCGSINGALSLFGELLKLSGWNLDLRPKPHLYLTFMRFFSSRGDYRMVKCLHRRMWLDSSGSISPGFQEEADHLLMEAALHD
Subjt:  HDLLIDRTAYTAMIDALVNCGSINGALSLFGELLKLSGWNLDLRPKPHLYLTFMRFFSSRGDYRMVKCLHRRMWLDSSGSISPGFQEEADHLLMEAALHD

Query:  NQIDVATEKLSTIIKRWKGISWSSRGGSVALRIEALLGLTKSFFSPCIFPRVNPGAPIESVMMPFKAVQPLNGSLQLKEVVMRFFDKSVVPIIDDWGRCI
        NQIDVATEKLSTIIKRWKGISWSSRGGSVALRIEALLGLTKSFFSPCIFPRVNPGAPIESVMMPFKAVQPLNGSLQLKEVVMRFFDKSVVPIIDDWGRCI
Subjt:  NQIDVATEKLSTIIKRWKGISWSSRGGSVALRIEALLGLTKSFFSPCIFPRVNPGAPIESVMMPFKAVQPLNGSLQLKEVVMRFFDKSVVPIIDDWGRCI

Query:  GLLHREDCSELEAPLWTMMRSPPPGVTTTTSIGHVVNLILRKRYKMIIIVRHSKFSTYDSSSSRAVGVFTIEQLYGFISPIPMQLQPNIPHRT
        GLLHREDCSELEAPLWTMMRSPPPGVTTTTSIGHVVNLILRKRYKMIIIVRHSKFSTYDSSSSRAVGVFTIEQLYGFISPIPMQLQPNIPHRT
Subjt:  GLLHREDCSELEAPLWTMMRSPPPGVTTTTSIGHVVNLILRKRYKMIIIVRHSKFSTYDSSSSRAVGVFTIEQLYGFISPIPMQLQPNIPHRT

SwissProt top hitse value%identityAlignment
P0C7Q7 Putative pentatricopeptide repeat-containing protein At1g12700, mitochondrial6.3e-1722.45Show/hide
Query:  RRIDSPRSPNLKRLTSRVVRLTRRKQLHQIFEEIEIAKRRYGKLNTIVMNAVLEACVHCGDIDLALSTFNEMSKPGNCGLDNVSYGTLLKGLGEARKIDE
        R +++   P++    S V  + R        + +   + R  K +    + ++++    G ID A+S F EM   G      V+Y +L++GL +A K ++
Subjt:  RRIDSPRSPNLKRLTSRVVRLTRRKQLHQIFEEIEIAKRRYGKLNTIVMNAVLEACVHCGDIDLALSTFNEMSKPGNCGLDNVSYGTLLKGLGEARKIDE

Query:  AFLLLESVEEGTAIGSPTLSAPLIYGV-LNALTEAGDMRRANGLIARYGFLLHEGGNLSISVYNLLMKGYISSGVPQAALALYNEMLNLELKPDKLTYNT
          LLL+ +     +    +   + + V L+   + G ++ AN L   Y  ++  G + +I  YN LM GY        A  + + M+  +  PD +T+ +
Subjt:  AFLLLESVEEGTAIGSPTLSAPLIYGV-LNALTEAGDMRRANGLIARYGFLLHEGGNLSISVYNLLMKGYISSGVPQAALALYNEMLNLELKPDKLTYNT

Query:  LISACGKINKLDAAMYFFEEMKERAGKYNQEDIFPDVVTYTTLLKGFGILKDVRLVHKIVLEMKSCHDLLIDRTAYTAMIDALVNCGSINGALSLFGELL
        LI     + ++D  M  F  + +R        +  + VTY+ L++GF     ++L  ++  EM S H +L D   Y  ++D L + G +  AL +F +L 
Subjt:  LISACGKINKLDAAMYFFEEMKERAGKYNQEDIFPDVVTYTTLLKGFGILKDVRLVHKIVLEMKSCHDLLIDRTAYTAMIDALVNCGSINGALSLFGELL

Query:  KL-----------------------SGWNL-------DLRPKPHLYLTFMRFFSSRGDYRMVKCLHRRMWLDSSGSISPGFQEEADHLLMEAALHDNQID
        K                          WNL        ++P    Y   +     +G       L R+M  D +        +   + L+ A L D  + 
Subjt:  KL-----------------------SGWNL-------DLRPKPHLYLTFMRFFSSRGDYRMVKCLHRRMWLDSSGSISPGFQEEADHLLMEAALHDNQID

Query:  VATEKLSTIIKRWKGISWSSRGGSVALRIEALLG--LTKSF
         + +    +I+  K   +S+   S+ + I+ LL   L KSF
Subjt:  VATEKLSTIIKRWKGISWSSRGGSVALRIEALLG--LTKSF

Q8VYD6 Pentatricopeptide repeat-containing protein At5g106908.7e-18459.67Show/hide
Query:  SCS-LPSRTADARRRIDSPRSPNLKRLTSRVVRLTRRKQLHQIFEEIEIAKRRYGKLNTIVMNAVLEACVHCGDIDLALSTFNEMSKPGNCGLDNVSYGT
        SCS +P+R    RR     R  NLK LTSR+V LTRR+QL QI EE+E AK+RYG+LNTIVMN+VLEACVHCG+IDLAL  F+EM++PG  G+D++SY T
Subjt:  SCS-LPSRTADARRRIDSPRSPNLKRLTSRVVRLTRRKQLHQIFEEIEIAKRRYGKLNTIVMNAVLEACVHCGDIDLALSTFNEMSKPGNCGLDNVSYGT

Query:  LLKGLGEARKIDEAFLLLESVEEGTAIGSPTLSAPLIYGVLNALTEAGDMRRANGLIARYGFLLHEGGNLSISVYNLLMKGYISSGVPQAALALYNEMLN
        +LKGLG+AR+IDEAF +LE++E GTA G+P LS+ LIYG+L+AL  AGD+RRANGL+ARY  LL + G  S+ +YNLLMKGY++S  PQAA+ L +EML 
Subjt:  LLKGLGEARKIDEAFLLLESVEEGTAIGSPTLSAPLIYGVLNALTEAGDMRRANGLIARYGFLLHEGGNLSISVYNLLMKGYISSGVPQAALALYNEMLN

Query:  LELKPDKLTYNTLISACGKINKLDAAMYFFEEMKERAGKYNQEDIFPDVVTYTTLLKGFGILKDVRLVHKIVLEMKSCHDLLIDRTAYTAMIDALVNCGS
        L L+PD+LTYNTLI AC K   LDAAM FF +MKE+A +Y  + + PDVVTYTTL+KGFG   D+  + +I LEMK C ++ IDRTA+TA++DA++ CGS
Subjt:  LELKPDKLTYNTLISACGKINKLDAAMYFFEEMKERAGKYNQEDIFPDVVTYTTLLKGFGILKDVRLVHKIVLEMKSCHDLLIDRTAYTAMIDALVNCGS

Query:  INGALSLFGELLKLSGWNLDLRPKPHLYLTFMRFFSSRGDYRMVKCLHRRMWLDSSGSISPGFQEEADHLLMEAALHDNQIDVATEKLSTIIKRWKGISW
         +GAL +FGE+LK SG N  LRPKPHLYL+ MR F+ +GDY MV+ L+ R+W DSSGSIS   Q+EAD+LLMEAAL+D Q+D A   L +I++RWK I W
Subjt:  INGALSLFGELLKLSGWNLDLRPKPHLYLTFMRFFSSRGDYRMVKCLHRRMWLDSSGSISPGFQEEADHLLMEAALHDNQIDVATEKLSTIIKRWKGISW

Query:  SSRGGSVALRIEALLGLTKSFFSPCIFPRVNPGAPIESVMMPFKAVQPLNGSLQLKEVVMRFFDKSVVPIIDDWGRCIGLLHREDCSELEAPLWTMMRSP
        ++ GG  A+R+E LLG +KS   P +  +V P  PIES+M+ F+A +PL G+LQLK V MRFF + VVPI+DD G CIGLLHREDC+ L+APL +MMRSP
Subjt:  SSRGGSVALRIEALLGLTKSFFSPCIFPRVNPGAPIESVMMPFKAVQPLNGSLQLKEVVMRFFDKSVVPIIDDWGRCIGLLHREDCSELEAPLWTMMRSP

Query:  PPGVTTTTSIGHVVNLILRKRYKMIIIVRHSKFSTYDSSSSRAVGVFTIEQLY
        P  V+TTTSIG VV+L+L K+ KM+I+V    FS     SS+AVG FT  QLY
Subjt:  PPGVTTTTSIGHVVNLILRKRYKMIIIVRHSKFSTYDSSSSRAVGVFTIEQLY

Q9FIX3 Pentatricopeptide repeat-containing protein At5g397103.3e-1828.63Show/hide
Query:  NTIVMNAVLEACVHCGDIDLALSTFNEMSKPGNCGLDNVSYGTLLKGLGEARKIDEAFLLLESVE-EGTAIGSPTLSAPLIYGVLNALTEAGDMRRANGL
        N    N ++      G+ID+AL+ F++M   G C  + V+Y TL+ G  + RKID+ F LL S+  +G     P L +  +  V+N L   G M+  + +
Subjt:  NTIVMNAVLEACVHCGDIDLALSTFNEMSKPGNCGLDNVSYGTLLKGLGEARKIDEAFLLLESVE-EGTAIGSPTLSAPLIYGVLNALTEAGDMRRANGL

Query:  IARYGFLLHEGGNLSISVYNLLMKGYISSGVPQAALALYNEMLNLELKPDKLTYNTLISACGKINKLDAAMYFFEEMKERAGKYNQEDIFPDVVTYTTLL
        +     +   G +L    YN L+KGY   G    AL ++ EML   L P  +TY +LI +  K   ++ AM F ++M+ R        + P+  TYTTL+
Subjt:  IARYGFLLHEGGNLSISVYNLLMKGYISSGVPQAALALYNEMLNLELKPDKLTYNTLISACGKINKLDAAMYFFEEMKERAGKYNQEDIFPDVVTYTTLL

Query:  KGFGILKDVRLVHKIVLEMKSCHDLLIDRTAYTAMIDALVNCGSINGALSLFGEL
         GF     +   ++++ EM   +        Y A+I+     G +  A+++  ++
Subjt:  KGFGILKDVRLVHKIVLEMKSCHDLLIDRTAYTAMIDALVNCGSINGALSLFGEL

Q9SIC9 Pentatricopeptide repeat-containing protein At2g31400, chloroplastic7.4e-1829.75Show/hide
Query:  KQLHQIFEEIEIAKRRYGKLNTIVMNAVLEACVHCGDIDLALSTFNEMSKPGNCGLDNVSYGTLLKGLGEARKIDEAFLLLESVEEGTAIGSPTLSAPLI
        KQ+ + F+E++   R   + + I  N++L  C   G  + A + F+EM+       D  SY TLL  + +  ++D AF +L  +     + +    + +I
Subjt:  KQLHQIFEEIEIAKRRYGKLNTIVMNAVLEACVHCGDIDLALSTFNEMSKPGNCGLDNVSYGTLLKGLGEARKIDEAFLLLESVEEGTAIGSPTLSAPLI

Query:  YGVLNALTEAGDMRRANGLIARYGFLLHEGGNLSISVYNLLMKGYISSGVPQAALALYNEMLNLELKPDKLTYNTLISACGKINKLDAAMYFFEEMKERA
         G      +AG    A  L   +G + + G  L    YN L+  Y   G  + AL +  EM ++ +K D +TYN L+   GK  K D     F EMK   
Subjt:  YGVLNALTEAGDMRRANGLIARYGFLLHEGGNLSISVYNLLMKGYISSGVPQAALALYNEMLNLELKPDKLTYNTLISACGKINKLDAAMYFFEEMKERA

Query:  GKYNQEDIFPDVVTYTTLLKGF---GILKDVRLVHKIVLEMKSCHDLLIDRTAYTAMIDALVNCGSINGALSLFGELLK
            +E + P+++TY+TL+ G+   G+ K+     +I  E KS   L  D   Y+A+IDAL   G +  A+SL  E+ K
Subjt:  GKYNQEDIFPDVVTYTTLLKGF---GILKDVRLVHKIVLEMKSCHDLLIDRTAYTAMIDALVNCGSINGALSLFGELLK

Q9SZ52 Pentatricopeptide repeat-containing protein At4g31850, chloroplastic1.7e-1725.83Show/hide
Query:  NAVLEACVHCGDIDLALSTFNEMSKPGNCGLDNVSYGTLLKGLGEARKIDEAFLLLESVEEGTAIGSPTLSAPLIYG-VLNALTEAGDMRRANGLIARYG
        N +L+A    G ID     + EMS    C  + +++  ++ GL +A  +D+A  L   +       SPT      YG +++ L+++G +  A  L   + 
Subjt:  NAVLEACVHCGDIDLALSTFNEMSKPGNCGLDNVSYGTLLKGLGEARKIDEAFLLLESVEEGTAIGSPTLSAPLIYG-VLNALTEAGDMRRANGLIARYG

Query:  FLLHEGGNLSISVYNLLMKGYISSGVPQAALALYNEMLNLELKPDKLTYNTLISACGKINKLDAAMYFFEEMKERAGKYNQEDIFPDVVTYTTLLKGFGI
         +L  G   + ++YN+L+ G+  +G   AA AL+  M+   ++PD  TY+ L+     + ++D  +++F+E+KE         + PDVV Y  ++ G G 
Subjt:  FLLHEGGNLSISVYNLLMKGYISSGVPQAALALYNEMLNLELKPDKLTYNTLISACGKINKLDAAMYFFEEMKERAGKYNQEDIFPDVVTYTTLLKGFGI

Query:  LKDVRLVHKIVL--EMKSCHDLLIDRTAYTAMIDALVNCGSINGALSLFGELLKLSGWNLDLRPKPHLYLTFMRFFSSRGDYRMVKCLHRRMWLDSSGSI
         K  RL   +VL  EMK+   +  D   Y ++I  L   G +  A  ++ E+ +       L P    +   +R +S  G       +++ M    +G  
Subjt:  LKDVRLVHKIVL--EMKSCHDLLIDRTAYTAMIDALVNCGSINGALSLFGELLKLSGWNLDLRPKPHLYLTFMRFFSSRGDYRMVKCLHRRMWLDSSGSI

Query:  SP
        SP
Subjt:  SP

Arabidopsis top hitse value%identityAlignment
AT1G12700.1 ATP binding;nucleic acid binding;helicases7.6e-1821.97Show/hide
Query:  RRIDSPRSPNLKRLTSRVVRLTRRKQLHQIFEEIEIAKRRYGKLNTIVMNAVLEACVHCGDIDLALSTFNEMSKPGNCGLDNVSYGTLLKGLGEARKIDE
        R +++   P++    S V  + R        + +   + R  K +    + ++++    G ID A+S F EM   G      V+Y +L++GL +A K ++
Subjt:  RRIDSPRSPNLKRLTSRVVRLTRRKQLHQIFEEIEIAKRRYGKLNTIVMNAVLEACVHCGDIDLALSTFNEMSKPGNCGLDNVSYGTLLKGLGEARKIDE

Query:  AFLLLESVEEGTAIGSPTLSAPLIYGV-LNALTEAGDMRRANGLIARYGFLLHEGGNLSISVYNLLMKGYISSGVPQAALALYNEMLNLELKPDKLTYNT
          LLL+ +     +    +   + + V L+   + G ++ AN L   Y  ++  G + +I  YN LM GY        A  + + M+  +  PD +T+ +
Subjt:  AFLLLESVEEGTAIGSPTLSAPLIYGV-LNALTEAGDMRRANGLIARYGFLLHEGGNLSISVYNLLMKGYISSGVPQAALALYNEMLNLELKPDKLTYNT

Query:  LISACGKINKLDAAMYFFEEMKERAGKYNQEDIFPDVVTYTTLLKGFGILKDVRLVHKIVLEMKSCHDLLIDRTAYTAMIDALVNCGSINGALSLFGELL
        LI     + ++D  M  F  + +R        +  + VTY+ L++GF     ++L  ++  EM S H +L D   Y  ++D L + G +  AL +F +L 
Subjt:  LISACGKINKLDAAMYFFEEMKERAGKYNQEDIFPDVVTYTTLLKGFGILKDVRLVHKIVLEMKSCHDLLIDRTAYTAMIDALVNCGSINGALSLFGELL

Query:  KL-----------------------SGWNL-------DLRPKPHLYLTFMRFFSSRGDYRMVKCLHRRMWLDSSGSISPGFQEEADHLLMEAALHDNQID
        K                          WNL        ++P    Y   +     +G       L R+M  D +        +   + L+ A L D  + 
Subjt:  KL-----------------------SGWNL-------DLRPKPHLYLTFMRFFSSRGDYRMVKCLHRRMWLDSSGSISPGFQEEADHLLMEAALHDNQID

Query:  VATEKLSTIIKRWKGISWSSRGGSVALRIEALLGLTK
         + +    +I+  K   +S+   S+ + I+ LL   K
Subjt:  VATEKLSTIIKRWKGISWSSRGGSVALRIEALLGLTK

AT2G31400.1 genomes uncoupled 15.3e-1929.75Show/hide
Query:  KQLHQIFEEIEIAKRRYGKLNTIVMNAVLEACVHCGDIDLALSTFNEMSKPGNCGLDNVSYGTLLKGLGEARKIDEAFLLLESVEEGTAIGSPTLSAPLI
        KQ+ + F+E++   R   + + I  N++L  C   G  + A + F+EM+       D  SY TLL  + +  ++D AF +L  +     + +    + +I
Subjt:  KQLHQIFEEIEIAKRRYGKLNTIVMNAVLEACVHCGDIDLALSTFNEMSKPGNCGLDNVSYGTLLKGLGEARKIDEAFLLLESVEEGTAIGSPTLSAPLI

Query:  YGVLNALTEAGDMRRANGLIARYGFLLHEGGNLSISVYNLLMKGYISSGVPQAALALYNEMLNLELKPDKLTYNTLISACGKINKLDAAMYFFEEMKERA
         G      +AG    A  L   +G + + G  L    YN L+  Y   G  + AL +  EM ++ +K D +TYN L+   GK  K D     F EMK   
Subjt:  YGVLNALTEAGDMRRANGLIARYGFLLHEGGNLSISVYNLLMKGYISSGVPQAALALYNEMLNLELKPDKLTYNTLISACGKINKLDAAMYFFEEMKERA

Query:  GKYNQEDIFPDVVTYTTLLKGF---GILKDVRLVHKIVLEMKSCHDLLIDRTAYTAMIDALVNCGSINGALSLFGELLK
            +E + P+++TY+TL+ G+   G+ K+     +I  E KS   L  D   Y+A+IDAL   G +  A+SL  E+ K
Subjt:  GKYNQEDIFPDVVTYTTLLKGF---GILKDVRLVHKIVLEMKSCHDLLIDRTAYTAMIDALVNCGSINGALSLFGELLK

AT4G31850.1 proton gradient regulation 31.2e-1825.83Show/hide
Query:  NAVLEACVHCGDIDLALSTFNEMSKPGNCGLDNVSYGTLLKGLGEARKIDEAFLLLESVEEGTAIGSPTLSAPLIYG-VLNALTEAGDMRRANGLIARYG
        N +L+A    G ID     + EMS    C  + +++  ++ GL +A  +D+A  L   +       SPT      YG +++ L+++G +  A  L   + 
Subjt:  NAVLEACVHCGDIDLALSTFNEMSKPGNCGLDNVSYGTLLKGLGEARKIDEAFLLLESVEEGTAIGSPTLSAPLIYG-VLNALTEAGDMRRANGLIARYG

Query:  FLLHEGGNLSISVYNLLMKGYISSGVPQAALALYNEMLNLELKPDKLTYNTLISACGKINKLDAAMYFFEEMKERAGKYNQEDIFPDVVTYTTLLKGFGI
         +L  G   + ++YN+L+ G+  +G   AA AL+  M+   ++PD  TY+ L+     + ++D  +++F+E+KE         + PDVV Y  ++ G G 
Subjt:  FLLHEGGNLSISVYNLLMKGYISSGVPQAALALYNEMLNLELKPDKLTYNTLISACGKINKLDAAMYFFEEMKERAGKYNQEDIFPDVVTYTTLLKGFGI

Query:  LKDVRLVHKIVL--EMKSCHDLLIDRTAYTAMIDALVNCGSINGALSLFGELLKLSGWNLDLRPKPHLYLTFMRFFSSRGDYRMVKCLHRRMWLDSSGSI
         K  RL   +VL  EMK+   +  D   Y ++I  L   G +  A  ++ E+ +       L P    +   +R +S  G       +++ M    +G  
Subjt:  LKDVRLVHKIVL--EMKSCHDLLIDRTAYTAMIDALVNCGSINGALSLFGELLKLSGWNLDLRPKPHLYLTFMRFFSSRGDYRMVKCLHRRMWLDSSGSI

Query:  SP
        SP
Subjt:  SP

AT5G10690.1 pentatricopeptide (PPR) repeat-containing protein / CBS domain-containing protein6.2e-18559.67Show/hide
Query:  SCS-LPSRTADARRRIDSPRSPNLKRLTSRVVRLTRRKQLHQIFEEIEIAKRRYGKLNTIVMNAVLEACVHCGDIDLALSTFNEMSKPGNCGLDNVSYGT
        SCS +P+R    RR     R  NLK LTSR+V LTRR+QL QI EE+E AK+RYG+LNTIVMN+VLEACVHCG+IDLAL  F+EM++PG  G+D++SY T
Subjt:  SCS-LPSRTADARRRIDSPRSPNLKRLTSRVVRLTRRKQLHQIFEEIEIAKRRYGKLNTIVMNAVLEACVHCGDIDLALSTFNEMSKPGNCGLDNVSYGT

Query:  LLKGLGEARKIDEAFLLLESVEEGTAIGSPTLSAPLIYGVLNALTEAGDMRRANGLIARYGFLLHEGGNLSISVYNLLMKGYISSGVPQAALALYNEMLN
        +LKGLG+AR+IDEAF +LE++E GTA G+P LS+ LIYG+L+AL  AGD+RRANGL+ARY  LL + G  S+ +YNLLMKGY++S  PQAA+ L +EML 
Subjt:  LLKGLGEARKIDEAFLLLESVEEGTAIGSPTLSAPLIYGVLNALTEAGDMRRANGLIARYGFLLHEGGNLSISVYNLLMKGYISSGVPQAALALYNEMLN

Query:  LELKPDKLTYNTLISACGKINKLDAAMYFFEEMKERAGKYNQEDIFPDVVTYTTLLKGFGILKDVRLVHKIVLEMKSCHDLLIDRTAYTAMIDALVNCGS
        L L+PD+LTYNTLI AC K   LDAAM FF +MKE+A +Y  + + PDVVTYTTL+KGFG   D+  + +I LEMK C ++ IDRTA+TA++DA++ CGS
Subjt:  LELKPDKLTYNTLISACGKINKLDAAMYFFEEMKERAGKYNQEDIFPDVVTYTTLLKGFGILKDVRLVHKIVLEMKSCHDLLIDRTAYTAMIDALVNCGS

Query:  INGALSLFGELLKLSGWNLDLRPKPHLYLTFMRFFSSRGDYRMVKCLHRRMWLDSSGSISPGFQEEADHLLMEAALHDNQIDVATEKLSTIIKRWKGISW
         +GAL +FGE+LK SG N  LRPKPHLYL+ MR F+ +GDY MV+ L+ R+W DSSGSIS   Q+EAD+LLMEAAL+D Q+D A   L +I++RWK I W
Subjt:  INGALSLFGELLKLSGWNLDLRPKPHLYLTFMRFFSSRGDYRMVKCLHRRMWLDSSGSISPGFQEEADHLLMEAALHDNQIDVATEKLSTIIKRWKGISW

Query:  SSRGGSVALRIEALLGLTKSFFSPCIFPRVNPGAPIESVMMPFKAVQPLNGSLQLKEVVMRFFDKSVVPIIDDWGRCIGLLHREDCSELEAPLWTMMRSP
        ++ GG  A+R+E LLG +KS   P +  +V P  PIES+M+ F+A +PL G+LQLK V MRFF + VVPI+DD G CIGLLHREDC+ L+APL +MMRSP
Subjt:  SSRGGSVALRIEALLGLTKSFFSPCIFPRVNPGAPIESVMMPFKAVQPLNGSLQLKEVVMRFFDKSVVPIIDDWGRCIGLLHREDCSELEAPLWTMMRSP

Query:  PPGVTTTTSIGHVVNLILRKRYKMIIIVRHSKFSTYDSSSSRAVGVFTIEQLY
        P  V+TTTSIG VV+L+L K+ KM+I+V    FS     SS+AVG FT  QLY
Subjt:  PPGVTTTTSIGHVVNLILRKRYKMIIIVRHSKFSTYDSSSSRAVGVFTIEQLY

AT5G39710.1 Tetratricopeptide repeat (TPR)-like superfamily protein2.4e-1928.63Show/hide
Query:  NTIVMNAVLEACVHCGDIDLALSTFNEMSKPGNCGLDNVSYGTLLKGLGEARKIDEAFLLLESVE-EGTAIGSPTLSAPLIYGVLNALTEAGDMRRANGL
        N    N ++      G+ID+AL+ F++M   G C  + V+Y TL+ G  + RKID+ F LL S+  +G     P L +  +  V+N L   G M+  + +
Subjt:  NTIVMNAVLEACVHCGDIDLALSTFNEMSKPGNCGLDNVSYGTLLKGLGEARKIDEAFLLLESVE-EGTAIGSPTLSAPLIYGVLNALTEAGDMRRANGL

Query:  IARYGFLLHEGGNLSISVYNLLMKGYISSGVPQAALALYNEMLNLELKPDKLTYNTLISACGKINKLDAAMYFFEEMKERAGKYNQEDIFPDVVTYTTLL
        +     +   G +L    YN L+KGY   G    AL ++ EML   L P  +TY +LI +  K   ++ AM F ++M+ R        + P+  TYTTL+
Subjt:  IARYGFLLHEGGNLSISVYNLLMKGYISSGVPQAALALYNEMLNLELKPDKLTYNTLISACGKINKLDAAMYFFEEMKERAGKYNQEDIFPDVVTYTTLL

Query:  KGFGILKDVRLVHKIVLEMKSCHDLLIDRTAYTAMIDALVNCGSINGALSLFGEL
         GF     +   ++++ EM   +        Y A+I+     G +  A+++  ++
Subjt:  KGFGILKDVRLVHKIVLEMKSCHDLLIDRTAYTAMIDALVNCGSINGALSLFGEL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTATGGATCGTGTCATTCTCCTCTCGTCATTTGGTATGGAATCCATCCGACTCAATGAACGTATTTTCTTGTTCACTTCCTTCTCGCACGGCCGACGCTCGGCGTCG
GATAGACTCTCCTCGGAGTCCCAATCTCAAACGGCTGACCTCTCGTGTCGTCAGACTCACCCGTCGGAAGCAGCTCCACCAGATATTTGAGGAAATCGAAATTGCCAAGA
GACGTTATGGAAAGCTGAATACAATTGTTATGAATGCGGTCCTGGAAGCTTGTGTTCACTGCGGTGATATTGATTTAGCTCTGAGTACTTTTAATGAAATGTCAAAGCCA
GGCAATTGTGGTTTAGACAACGTCAGTTATGGCACGCTATTAAAGGGTTTGGGTGAAGCTCGCAAGATTGATGAAGCATTTCTATTACTTGAATCTGTGGAAGAAGGTAC
TGCTATTGGAAGTCCAACATTGTCCGCACCACTTATTTATGGTGTTCTAAATGCTTTAACCGAAGCAGGAGACATGCGCCGTGCCAATGGTCTAATAGCACGATATGGAT
TCTTACTTCATGAAGGAGGCAATCTCTCTATATCAGTTTACAACTTATTGATGAAGGGGTACATAAGCTCAGGTGTTCCTCAAGCTGCTTTAGCTCTGTACAATGAGATG
CTAAATCTGGAGTTGAAACCTGATAAGCTCACATATAATACATTAATCTCTGCTTGTGGGAAGATTAACAAACTGGACGCAGCAATGTATTTCTTTGAGGAAATGAAGGA
ACGAGCTGGTAAGTATAATCAGGAAGATATTTTTCCTGATGTTGTGACGTACACTACTTTACTTAAGGGTTTTGGGATTCTGAAAGATGTCCGTCTTGTTCACAAAATTG
TGCTGGAAATGAAGTCTTGTCATGATTTGTTGATTGATCGAACAGCATACACTGCAATGATTGATGCTTTGGTTAACTGTGGCTCTATAAACGGTGCTCTTTCTTTATTT
GGGGAATTATTGAAGCTTTCTGGATGGAATTTGGACTTGCGGCCAAAACCACATCTCTATCTCACTTTTATGAGATTTTTTTCCAGTAGAGGAGATTATAGGATGGTCAA
ATGTTTGCATAGACGCATGTGGCTGGACTCCTCTGGAAGTATTTCTCCTGGATTTCAAGAAGAGGCAGATCATCTTCTCATGGAGGCAGCTTTACATGACAATCAGATTG
ATGTGGCAACAGAGAAATTATCAACAATTATTAAGAGATGGAAGGGAATCTCGTGGTCTAGCAGAGGAGGCAGTGTTGCTCTGCGTATTGAAGCATTGCTTGGACTCACC
AAATCTTTCTTTAGTCCTTGCATATTTCCTCGGGTAAATCCGGGTGCTCCTATTGAGAGTGTCATGATGCCATTTAAAGCCGTTCAGCCATTAAATGGAAGCTTACAGTT
GAAGGAAGTGGTCATGCGTTTCTTTGACAAATCAGTTGTGCCTATCATAGACGACTGGGGTAGATGCATTGGACTATTGCACCGAGAAGACTGCTCTGAGTTGGAGGCTC
CCCTTTGGACAATGATGAGAAGCCCTCCTCCCGGTGTTACGACTACAACATCGATCGGACATGTTGTGAATCTAATTCTACGAAAGAGGTACAAAATGATTATTATTGTA
AGACATAGCAAGTTTAGCACATACGATAGCTCGAGTTCAAGGGCTGTAGGTGTTTTTACTATCGAGCAATTGTATGGCTTTATTTCTCCCATTCCCATGCAGCTTCAGCC
AAACATTCCACATAGGACCCGTCGATCATCAACTGTACAGAACACTCCGATCCGACGGCCGATATCGCAGAAGCCGCCATTTTCTTTGTTCTTGCTGCTGCAGAACGGCA
ACATTCCCGAGTCTTCCGCTGAGCTTCCAAGTATCCCTCAGTATATTTACACACTCGAAGTCTCCACCTGTTTTCCATTTCGGATCTTCTGTTCTTCCTTTCCTTCTCTC
TTTCACCGCCATTCTTGCTCCGAAGTTGAAGAAAAGCGTCTTCAAAAAGATATCATTCGTTCTTTTGTCCCTAATTTTCTCGGAAGAAGCTTGGAACAGGAACATTTCGT
AGAAAACAATCGAGCAAAACAATACACTGAGATCCACCTATGA
mRNA sequenceShow/hide mRNA sequence
ATGGTTAGCTTCTCCTGTATCTTATCGGCACTTACAACCACCATTGGAGAAGATGCTATGGATCGTGTCATTCTCCTCTCGTCATTTGGTATGGAATCCATCCGACTCAA
TGAACGTATTTTCTTGTTCACTTCCTTCTCGCACGGCCGACGCTCGGCGTCGGATAGACTCTCCTCGGAGTCCCAATCTCAAACGGCTGACCTCTCGTGTCGTCAGACTC
ACCCGTCGGAAGCAGCTCCACCAGATATTTGAGGAAATCGAAATTGCCAAGAGACGTTATGGAAAGCTGAATACAATTGTTATGAATGCGGTCCTGGAAGCTTGTGTTCA
CTGCGGTGATATTGATTTAGCTCTGAGTACTTTTAATGAAATGTCAAAGCCAGGCAATTGTGGTTTAGACAACGTCAGTTATGGCACGCTATTAAAGGGTTTGGGTGAAG
CTCGCAAGATTGATGAAGCATTTCTATTACTTGAATCTGTGGAAGAAGGTACTGCTATTGGAAGTCCAACATTGTCCGCACCACTTATTTATGGTGTTCTAAATGCTTTA
ACCGAAGCAGGAGACATGCGCCGTGCCAATGGTCTAATAGCACGATATGGATTCTTACTTCATGAAGGAGGCAATCTCTCTATATCAGTTTACAACTTATTGATGAAGGG
GTACATAAGCTCAGGTGTTCCTCAAGCTGCTTTAGCTCTGTACAATGAGATGCTAAATCTGGAGTTGAAACCTGATAAGCTCACATATAATACATTAATCTCTGCTTGTG
GGAAGATTAACAAACTGGACGCAGCAATGTATTTCTTTGAGGAAATGAAGGAACGAGCTGGTAAGTATAATCAGGAAGATATTTTTCCTGATGTTGTGACGTACACTACT
TTACTTAAGGGTTTTGGGATTCTGAAAGATGTCCGTCTTGTTCACAAAATTGTGCTGGAAATGAAGTCTTGTCATGATTTGTTGATTGATCGAACAGCATACACTGCAAT
GATTGATGCTTTGGTTAACTGTGGCTCTATAAACGGTGCTCTTTCTTTATTTGGGGAATTATTGAAGCTTTCTGGATGGAATTTGGACTTGCGGCCAAAACCACATCTCT
ATCTCACTTTTATGAGATTTTTTTCCAGTAGAGGAGATTATAGGATGGTCAAATGTTTGCATAGACGCATGTGGCTGGACTCCTCTGGAAGTATTTCTCCTGGATTTCAA
GAAGAGGCAGATCATCTTCTCATGGAGGCAGCTTTACATGACAATCAGATTGATGTGGCAACAGAGAAATTATCAACAATTATTAAGAGATGGAAGGGAATCTCGTGGTC
TAGCAGAGGAGGCAGTGTTGCTCTGCGTATTGAAGCATTGCTTGGACTCACCAAATCTTTCTTTAGTCCTTGCATATTTCCTCGGGTAAATCCGGGTGCTCCTATTGAGA
GTGTCATGATGCCATTTAAAGCCGTTCAGCCATTAAATGGAAGCTTACAGTTGAAGGAAGTGGTCATGCGTTTCTTTGACAAATCAGTTGTGCCTATCATAGACGACTGG
GGTAGATGCATTGGACTATTGCACCGAGAAGACTGCTCTGAGTTGGAGGCTCCCCTTTGGACAATGATGAGAAGCCCTCCTCCCGGTGTTACGACTACAACATCGATCGG
ACATGTTGTGAATCTAATTCTACGAAAGAGGTACAAAATGATTATTATTGTAAGACATAGCAAGTTTAGCACATACGATAGCTCGAGTTCAAGGGCTGTAGGTGTTTTTA
CTATCGAGCAATTGTATGGCTTTATTTCTCCCATTCCCATGCAGCTTCAGCCAAACATTCCACATAGGACCCGTCGATCATCAACTGTACAGAACACTCCGATCCGACGG
CCGATATCGCAGAAGCCGCCATTTTCTTTGTTCTTGCTGCTGCAGAACGGCAACATTCCCGAGTCTTCCGCTGAGCTTCCAAGTATCCCTCAGTATATTTACACACTCGA
AGTCTCCACCTGTTTTCCATTTCGGATCTTCTGTTCTTCCTTTCCTTCTCTCTTTCACCGCCATTCTTGCTCCGAAGTTGAAGAAAAGCGTCTTCAAAAAGATATCATTC
GTTCTTTTGTCCCTAATTTTCTCGGAAGAAGCTTGGAACAGGAACATTTCGTAGAAAACAATCGAGCAAAACAATACACTGAGATCCACCTATGA
Protein sequenceShow/hide protein sequence
MLWIVSFSSRHLVWNPSDSMNVFSCSLPSRTADARRRIDSPRSPNLKRLTSRVVRLTRRKQLHQIFEEIEIAKRRYGKLNTIVMNAVLEACVHCGDIDLALSTFNEMSKP
GNCGLDNVSYGTLLKGLGEARKIDEAFLLLESVEEGTAIGSPTLSAPLIYGVLNALTEAGDMRRANGLIARYGFLLHEGGNLSISVYNLLMKGYISSGVPQAALALYNEM
LNLELKPDKLTYNTLISACGKINKLDAAMYFFEEMKERAGKYNQEDIFPDVVTYTTLLKGFGILKDVRLVHKIVLEMKSCHDLLIDRTAYTAMIDALVNCGSINGALSLF
GELLKLSGWNLDLRPKPHLYLTFMRFFSSRGDYRMVKCLHRRMWLDSSGSISPGFQEEADHLLMEAALHDNQIDVATEKLSTIIKRWKGISWSSRGGSVALRIEALLGLT
KSFFSPCIFPRVNPGAPIESVMMPFKAVQPLNGSLQLKEVVMRFFDKSVVPIIDDWGRCIGLLHREDCSELEAPLWTMMRSPPPGVTTTTSIGHVVNLILRKRYKMIIIV
RHSKFSTYDSSSSRAVGVFTIEQLYGFISPIPMQLQPNIPHRTRRSSTVQNTPIRRPISQKPPFSLFLLLQNGNIPESSAELPSIPQYIYTLEVSTCFPFRIFCSSFPSL
FHRHSCSEVEEKRLQKDIIRSFVPNFLGRSLEQEHFVENNRAKQYTEIHL