; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cla97C10G206080 (gene) of Watermelon (97103) v2.5 genome

Gene IDCla97C10G206080
OrganismCitrullus lanatus subsp. vulgaris cv. 97103 (Watermelon (97103) v2.5)
DescriptionPentatricopeptide repeat-containing protein
Genome locationCla97Chr10:34899823..34901505
RNA-Seq ExpressionCla97C10G206080
SyntenyCla97C10G206080
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0044223.1 pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa]6.2e-27589.42Show/hide
Query:  MIPKLHCLSSMKELEQAQAFVVKSGLCNHIPIMAKLIAFSSLSPSGSLPHAHALFRETSMDDSFICNTMIRAYSNSVFPIKALLIYNHMQRMDVDSDHFT
        MIPKLH LSS+KELEQAQAF+VK+G  NHIPIM KLIAFSSLSPSGSLP A+ALF+ETSMDDSFICNTMIRAYSNSVFP+KALLIYN MQRMDVDSDHFT
Subjt:  MIPKLHCLSSMKELEQAQAFVVKSGLCNHIPIMAKLIAFSSLSPSGSLPHAHALFRETSMDDSFICNTMIRAYSNSVFPIKALLIYNHMQRMDVDSDHFT

Query:  YNFVLKACARAIKCTEKNDECFGHDIISRKGAEIHTHVLKLGLDQDHHVQNSLLLMYSGCGLVVFARLIFDEMTVRSNVSWNIMMSAYNRVHDYKSSDVL
        YNFVLKACA AIKCTE +D+CFGHDIISRKGAEIH  +LKLG DQDHHVQNSLLL+YSG GLV FARLIF+EMTVR+ VSWNIMMSAYNRVHDYKS+DVL
Subjt:  YNFVLKACARAIKCTEKNDECFGHDIISRKGAEIHTHVLKLGLDQDHHVQNSLLLMYSGCGLVVFARLIFDEMTVRSNVSWNIMMSAYNRVHDYKSSDVL

Query:  LELMPETNVISWNTVLARYIRLNNLVAARKVFEEMPERDVVSWNSIIAGYVKVKDYKGALELFCSMKQSEITATEVTFISILGACAEMGALEIGKKIHES
        LE MP+TN +SWNT+LARYIRLNNLVAARKVFEEMPERDVVSWNSIIAGYVKVKDYKGALELF SMKQ  I ATEVTFISILGACAE+GALE GKKIHES
Subjt:  LELMPETNVISWNTVLARYIRLNNLVAARKVFEEMPERDVVSWNSIIAGYVKVKDYKGALELFCSMKQSEITATEVTFISILGACAEMGALEIGKKIHES

Query:  LKEKHYRIEGYLGNAIVDMYAKCGELRLALEVFNEMEMKPVSCWNAMIMGLAVHGDCERALEMFDSMKAEHDDHKPNRVTFIALLIACSHKGLVAEGRHF
        LKEKHY+IEGYLGNAIVDMYAKCGEL LALEVFNEMEMKPVSCWNAMIMGLAVHG CE+ALEMFDSMKAE  DHKPNRVTFIALLIACSHKGLVAEGRHF
Subjt:  LKEKHYRIEGYLGNAIVDMYAKCGELRLALEVFNEMEMKPVSCWNAMIMGLAVHGDCERALEMFDSMKAEHDDHKPNRVTFIALLIACSHKGLVAEGRHF

Query:  FSLMVTKYKIMPDLKHYGCMIDLLSRWGFLEEAYVMIKTCPFSSCSVLWRTLLGGCRVHRNVELGEKSFRKIAGLEPGKDGDYVLLSNMYAEEERWDDVE
        FSLMVTKYKIMPDLKHYGCMIDLLSRWGFLEEAYV+IKTCPFSSCSVLWRTLLGGCRVHR VELGE+SFR++A LEPGKDGDYVLLSN+YAEEERWDDVE
Subjt:  FSLMVTKYKIMPDLKHYGCMIDLLSRWGFLEEAYVMIKTCPFSSCSVLWRTLLGGCRVHRNVELGEKSFRKIAGLEPGKDGDYVLLSNMYAEEERWDDVE

Query:  RLRNEMIDYGVCKKAGSSQI
        RLR EMI YGVCKKAGSS +
Subjt:  RLRNEMIDYGVCKKAGSSQI

KGN58697.1 hypothetical protein Csa_000806 [Cucumis sativus]1.1e-27188.22Show/hide
Query:  MIPKLHCLSSMKELEQAQAFVVKSGLCNHIPIMAKLIAFSSLSPSGSLPHAHALFRETSMDDSFICNTMIRAYSNSVFPIKALLIYNHMQRMDVDSDHFT
        MIPKLH LS +KELEQAQAF+VK+G  NHIPI+ KLIAFSSLSP GSLPHA ALF+ETSMDDSFICNTMIRAYSN+VFP+KALLIYN MQRMDVDSDHFT
Subjt:  MIPKLHCLSSMKELEQAQAFVVKSGLCNHIPIMAKLIAFSSLSPSGSLPHAHALFRETSMDDSFICNTMIRAYSNSVFPIKALLIYNHMQRMDVDSDHFT

Query:  YNFVLKACARAIKCTEKNDECFGHDIISRKGAEIHTHVLKLGLDQDHHVQNSLLLMYSGCGLVVFARLIFDEMTVRSNVSWNIMMSAYNRVHDYKSSDVL
        YNFVL+ACARAIKCTE +D+CFGH IISRKG+EIH  +LKLG DQDHHVQNSLLL+YSG GLV FARLIF+EMTV++ VSWNIMMSAYNRVHDYKS+DVL
Subjt:  YNFVLKACARAIKCTEKNDECFGHDIISRKGAEIHTHVLKLGLDQDHHVQNSLLLMYSGCGLVVFARLIFDEMTVRSNVSWNIMMSAYNRVHDYKSSDVL

Query:  LELMPETNVISWNTVLARYIRLNNLVAARKVFEEMPERDVVSWNSIIAGYVKVKDYKGALELFCSMKQSEITATEVTFISILGACAEMGALEIGKKIHES
        LE MP+TN +SWNT+LARYIRLNNLVAARKVFEEMPERDVVSWNSII+GYV VKDYKGAL+LF SMKQ  I ATEVTFISILGACAE+GALEIGKKIH+S
Subjt:  LELMPETNVISWNTVLARYIRLNNLVAARKVFEEMPERDVVSWNSIIAGYVKVKDYKGALELFCSMKQSEITATEVTFISILGACAEMGALEIGKKIHES

Query:  LKEKHYRIEGYLGNAIVDMYAKCGELRLALEVFNEMEMKPVSCWNAMIMGLAVHGDCERALEMFDSMKAEHDDHKPNRVTFIALLIACSHKGLVAEGRHF
        LKEKHYRIEGYLGNAIVDMYAKCGEL LALEVFNEMEMKPVSCWNAMIMGLAVHG CERALEMFDSMKAE  DHKPNR+TFIALLIACSHKGL+AEGRHF
Subjt:  LKEKHYRIEGYLGNAIVDMYAKCGELRLALEVFNEMEMKPVSCWNAMIMGLAVHGDCERALEMFDSMKAEHDDHKPNRVTFIALLIACSHKGLVAEGRHF

Query:  FSLMVTKYKIMPDLKHYGCMIDLLSRWGFLEEAYVMIKTCPFSSCSVLWRTLLGGCRVHRNVELGEKSFRKIAGLEPGKDGDYVLLSNMYAEEERWDDVE
        FSLMVTKYKIMPDLKHYGCMIDLLSRWGFLEEAYVMIKTCPFSSCSVLWRTLLGGCR+HR+VELGE+SFRK+A LEPGKDGDYVLLSN+YAEEERWDDVE
Subjt:  FSLMVTKYKIMPDLKHYGCMIDLLSRWGFLEEAYVMIKTCPFSSCSVLWRTLLGGCRVHRNVELGEKSFRKIAGLEPGKDGDYVLLSNMYAEEERWDDVE

Query:  RLRNEMIDYGVCKKAGSS
        RLR EMI+YGVCKKAGSS
Subjt:  RLRNEMIDYGVCKKAGSS

XP_004137888.1 pentatricopeptide repeat-containing protein At5g15300 [Cucumis sativus]4.9e-27287.88Show/hide
Query:  MIPKLHCLSSMKELEQAQAFVVKSGLCNHIPIMAKLIAFSSLSPSGSLPHAHALFRETSMDDSFICNTMIRAYSNSVFPIKALLIYNHMQRMDVDSDHFT
        MIPKLH LS +KELEQAQAF+VK+G  NHIPI+ KLIAFSSLSP GSLPHA ALF+ETSMDDSFICNTMIRAYSN+VFP+KALLIYN MQRMDVDSDHFT
Subjt:  MIPKLHCLSSMKELEQAQAFVVKSGLCNHIPIMAKLIAFSSLSPSGSLPHAHALFRETSMDDSFICNTMIRAYSNSVFPIKALLIYNHMQRMDVDSDHFT

Query:  YNFVLKACARAIKCTEKNDECFGHDIISRKGAEIHTHVLKLGLDQDHHVQNSLLLMYSGCGLVVFARLIFDEMTVRSNVSWNIMMSAYNRVHDYKSSDVL
        YNFVL+ACARAIKCTE +D+CFGH IISRKG+EIH  +LKLG DQDHHVQNSLLL+YSG GLV FARLIF+EMTV++ VSWNIMMSAYNRVHDYKS+DVL
Subjt:  YNFVLKACARAIKCTEKNDECFGHDIISRKGAEIHTHVLKLGLDQDHHVQNSLLLMYSGCGLVVFARLIFDEMTVRSNVSWNIMMSAYNRVHDYKSSDVL

Query:  LELMPETNVISWNTVLARYIRLNNLVAARKVFEEMPERDVVSWNSIIAGYVKVKDYKGALELFCSMKQSEITATEVTFISILGACAEMGALEIGKKIHES
        LE MP+TN +SWNT+LARYIRLNNLVAARKVFEEMPERDVVSWNSII+GYV VKDYKGAL+LF SMKQ  I ATEVTFISILGACAE+GALEIGKKIH+S
Subjt:  LELMPETNVISWNTVLARYIRLNNLVAARKVFEEMPERDVVSWNSIIAGYVKVKDYKGALELFCSMKQSEITATEVTFISILGACAEMGALEIGKKIHES

Query:  LKEKHYRIEGYLGNAIVDMYAKCGELRLALEVFNEMEMKPVSCWNAMIMGLAVHGDCERALEMFDSMKAEHDDHKPNRVTFIALLIACSHKGLVAEGRHF
        LKEKHYRIEGYLGNAIVDMYAKCGEL LALEVFNEMEMKPVSCWNAMIMGLAVHG CERALEMFDSMKAE  DHKPNR+TFIALLIACSHKGL+AEGRHF
Subjt:  LKEKHYRIEGYLGNAIVDMYAKCGELRLALEVFNEMEMKPVSCWNAMIMGLAVHGDCERALEMFDSMKAEHDDHKPNRVTFIALLIACSHKGLVAEGRHF

Query:  FSLMVTKYKIMPDLKHYGCMIDLLSRWGFLEEAYVMIKTCPFSSCSVLWRTLLGGCRVHRNVELGEKSFRKIAGLEPGKDGDYVLLSNMYAEEERWDDVE
        FSLMVTKYKIMPDLKHYGCMIDLLSRWGFLEEAYVMIKTCPFSSCSVLWRTLLGGCR+HR+VELGE+SFRK+A LEPGKDGDYVLLSN+YAEEERWDDVE
Subjt:  FSLMVTKYKIMPDLKHYGCMIDLLSRWGFLEEAYVMIKTCPFSSCSVLWRTLLGGCRVHRNVELGEKSFRKIAGLEPGKDGDYVLLSNMYAEEERWDDVE

Query:  RLRNEMIDYGVCKKAGSSQI
        RLR EMI+YGVCKKAGSS +
Subjt:  RLRNEMIDYGVCKKAGSSQI

XP_008442325.1 PREDICTED: pentatricopeptide repeat-containing protein At5g15300-like [Cucumis melo]1.6e-27589.62Show/hide
Query:  MIPKLHCLSSMKELEQAQAFVVKSGLCNHIPIMAKLIAFSSLSPSGSLPHAHALFRETSMDDSFICNTMIRAYSNSVFPIKALLIYNHMQRMDVDSDHFT
        MIPKLH LSS+KELEQAQAF+VK+G  NHIPIM KLIAFSSLSPSGSLP A+ALF+ETSMDDSFICNTMIRAYSNSVFP+KALLIYN MQRMDVDSDHFT
Subjt:  MIPKLHCLSSMKELEQAQAFVVKSGLCNHIPIMAKLIAFSSLSPSGSLPHAHALFRETSMDDSFICNTMIRAYSNSVFPIKALLIYNHMQRMDVDSDHFT

Query:  YNFVLKACARAIKCTEKNDECFGHDIISRKGAEIHTHVLKLGLDQDHHVQNSLLLMYSGCGLVVFARLIFDEMTVRSNVSWNIMMSAYNRVHDYKSSDVL
        YNFVLKACA AIKCTE +D+CFGHDIISRKGAEIH  +LKLG DQDHHVQNSLLL+YSG GLV FARLIF+EMTVR+ VSWNIMMSAYNRVHDYKS+DVL
Subjt:  YNFVLKACARAIKCTEKNDECFGHDIISRKGAEIHTHVLKLGLDQDHHVQNSLLLMYSGCGLVVFARLIFDEMTVRSNVSWNIMMSAYNRVHDYKSSDVL

Query:  LELMPETNVISWNTVLARYIRLNNLVAARKVFEEMPERDVVSWNSIIAGYVKVKDYKGALELFCSMKQSEITATEVTFISILGACAEMGALEIGKKIHES
        LE MP+TN +SWNT+LARYIRLNNLVAARKVFEEMPERDVVSWNSIIAGYVKVKDYKGALELF SMKQ  I ATEVTFISILGACAE+GALE GKKIHES
Subjt:  LELMPETNVISWNTVLARYIRLNNLVAARKVFEEMPERDVVSWNSIIAGYVKVKDYKGALELFCSMKQSEITATEVTFISILGACAEMGALEIGKKIHES

Query:  LKEKHYRIEGYLGNAIVDMYAKCGELRLALEVFNEMEMKPVSCWNAMIMGLAVHGDCERALEMFDSMKAEHDDHKPNRVTFIALLIACSHKGLVAEGRHF
        LKEKHY+IEGYLGNAIVDMYAKCGEL LALEVFNEMEMKPVSCWNAMIMGLAVHG CE+ALEMFDSMKAE  DHKPNRVTFIALLIACSHKGLVAEGRHF
Subjt:  LKEKHYRIEGYLGNAIVDMYAKCGELRLALEVFNEMEMKPVSCWNAMIMGLAVHGDCERALEMFDSMKAEHDDHKPNRVTFIALLIACSHKGLVAEGRHF

Query:  FSLMVTKYKIMPDLKHYGCMIDLLSRWGFLEEAYVMIKTCPFSSCSVLWRTLLGGCRVHRNVELGEKSFRKIAGLEPGKDGDYVLLSNMYAEEERWDDVE
        FSLMVTKYKIMPDLKHYGCMIDLLSRWGFLEEAYVMIKTCPFSSCSVLWRTLLGGCRVHR VELGE+SFR++A LEPGKDGDYVLLSN+YAEEERWDDVE
Subjt:  FSLMVTKYKIMPDLKHYGCMIDLLSRWGFLEEAYVMIKTCPFSSCSVLWRTLLGGCRVHRNVELGEKSFRKIAGLEPGKDGDYVLLSNMYAEEERWDDVE

Query:  RLRNEMIDYGVCKKAGSSQI
        RLR EMI YGVCKKAGSS +
Subjt:  RLRNEMIDYGVCKKAGSSQI

XP_038904117.1 pentatricopeptide repeat-containing protein At5g15300-like [Benincasa hispida]1.2e-28690.93Show/hide
Query:  MLVKGAISYMIPKLHCLSSMKELEQAQAFVVKSGLCNHIPIMAKLIAFSSLSPSGSLPHAHALFRETSMDDSFICNTMIRAYSNSVFPIKALLIYNHMQR
        M +KGA SY+IPKLH LSSMKELEQAQAF+VK+GLCNHI IM KLIAFSSLSPSGSLPH+HALF+ETSMDDSFICNTMIRAYSNSVFP+KALLIYNHMQR
Subjt:  MLVKGAISYMIPKLHCLSSMKELEQAQAFVVKSGLCNHIPIMAKLIAFSSLSPSGSLPHAHALFRETSMDDSFICNTMIRAYSNSVFPIKALLIYNHMQR

Query:  MDVDSDHFTYNFVLKACARAIKCTEKNDECFGHDIISRKGAEIHTHVLKLGLDQDHHVQNSLLLMYSGCGLVVFARLIFDEMTVRSNVSWNIMMSAYNRV
        MDV SDHFTYNFVLK CARAIKCTEK+D+CFGHDIISRKGAEIH+ VLKLGLDQDHHVQNSLLLMYS CGLVVFARLIFDEMT+RS VSWNIM+SAYNRV
Subjt:  MDVDSDHFTYNFVLKACARAIKCTEKNDECFGHDIISRKGAEIHTHVLKLGLDQDHHVQNSLLLMYSGCGLVVFARLIFDEMTVRSNVSWNIMMSAYNRV

Query:  HDYKSSDVLLELMPETNVISWNTVLARYIRLNNLVAARKVFEEMPERDVVSWNSIIAGYVKVKDYKGALELFCSMKQSEITATEVTFISILGACAEMGAL
        HDYKS+DVLLELMP+TNVISWNTVLARYIRLN+LVAARKVFEEMPERDVVSWNSIIAGYVKVKDYK ALELF SMKQS I ATEVTFISILGACAEMGAL
Subjt:  HDYKSSDVLLELMPETNVISWNTVLARYIRLNNLVAARKVFEEMPERDVVSWNSIIAGYVKVKDYKGALELFCSMKQSEITATEVTFISILGACAEMGAL

Query:  EIGKKIHESLKEKHYRIEGYLGNAIVDMYAKCGELRLALEVFNEMEMKPVSCWNAMIMGLAVHGDCERALEMFDSMKAEHDDHKPNRVTFIALLIACSHK
        E+GKKIHESLKEKHYRI+GYLGNAIVDMYAKCGEL LALEVFNEMEMKPVSCWNAMIMGLAVHG CERALEMFDSMKAEHD  KPNR+TFIALLIACSHK
Subjt:  EIGKKIHESLKEKHYRIEGYLGNAIVDMYAKCGELRLALEVFNEMEMKPVSCWNAMIMGLAVHGDCERALEMFDSMKAEHDDHKPNRVTFIALLIACSHK

Query:  GLVAEGRHFFSLMVTKYKIMPDLKHYGCMIDLLSRWGFLEEAYVMIKTCPFSSCSVLWRTLLGGCRVHRNVELGEKSFRKIAGLEPGKDGDYVLLSNMYA
        GLVAEGRHFF+LM+TKYKI PDLKHYGCMIDLLSRWGFL+EAYVMIKTCPFSSCS+LWRTLLGGCRVHR+VELGE+SFRK+A LEPGKDGDYVLLSN+YA
Subjt:  GLVAEGRHFFSLMVTKYKIMPDLKHYGCMIDLLSRWGFLEEAYVMIKTCPFSSCSVLWRTLLGGCRVHRNVELGEKSFRKIAGLEPGKDGDYVLLSNMYA

Query:  EEERWDDVERLRNEMIDYGVCKKAGSSQI
        EEERWDDVERLRNEMIDYGVCKKAGSS +
Subjt:  EEERWDDVERLRNEMIDYGVCKKAGSSQI

TrEMBL top hitse value%identityAlignment
A0A0A0LD95 Uncharacterized protein5.3e-27288.22Show/hide
Query:  MIPKLHCLSSMKELEQAQAFVVKSGLCNHIPIMAKLIAFSSLSPSGSLPHAHALFRETSMDDSFICNTMIRAYSNSVFPIKALLIYNHMQRMDVDSDHFT
        MIPKLH LS +KELEQAQAF+VK+G  NHIPI+ KLIAFSSLSP GSLPHA ALF+ETSMDDSFICNTMIRAYSN+VFP+KALLIYN MQRMDVDSDHFT
Subjt:  MIPKLHCLSSMKELEQAQAFVVKSGLCNHIPIMAKLIAFSSLSPSGSLPHAHALFRETSMDDSFICNTMIRAYSNSVFPIKALLIYNHMQRMDVDSDHFT

Query:  YNFVLKACARAIKCTEKNDECFGHDIISRKGAEIHTHVLKLGLDQDHHVQNSLLLMYSGCGLVVFARLIFDEMTVRSNVSWNIMMSAYNRVHDYKSSDVL
        YNFVL+ACARAIKCTE +D+CFGH IISRKG+EIH  +LKLG DQDHHVQNSLLL+YSG GLV FARLIF+EMTV++ VSWNIMMSAYNRVHDYKS+DVL
Subjt:  YNFVLKACARAIKCTEKNDECFGHDIISRKGAEIHTHVLKLGLDQDHHVQNSLLLMYSGCGLVVFARLIFDEMTVRSNVSWNIMMSAYNRVHDYKSSDVL

Query:  LELMPETNVISWNTVLARYIRLNNLVAARKVFEEMPERDVVSWNSIIAGYVKVKDYKGALELFCSMKQSEITATEVTFISILGACAEMGALEIGKKIHES
        LE MP+TN +SWNT+LARYIRLNNLVAARKVFEEMPERDVVSWNSII+GYV VKDYKGAL+LF SMKQ  I ATEVTFISILGACAE+GALEIGKKIH+S
Subjt:  LELMPETNVISWNTVLARYIRLNNLVAARKVFEEMPERDVVSWNSIIAGYVKVKDYKGALELFCSMKQSEITATEVTFISILGACAEMGALEIGKKIHES

Query:  LKEKHYRIEGYLGNAIVDMYAKCGELRLALEVFNEMEMKPVSCWNAMIMGLAVHGDCERALEMFDSMKAEHDDHKPNRVTFIALLIACSHKGLVAEGRHF
        LKEKHYRIEGYLGNAIVDMYAKCGEL LALEVFNEMEMKPVSCWNAMIMGLAVHG CERALEMFDSMKAE  DHKPNR+TFIALLIACSHKGL+AEGRHF
Subjt:  LKEKHYRIEGYLGNAIVDMYAKCGELRLALEVFNEMEMKPVSCWNAMIMGLAVHGDCERALEMFDSMKAEHDDHKPNRVTFIALLIACSHKGLVAEGRHF

Query:  FSLMVTKYKIMPDLKHYGCMIDLLSRWGFLEEAYVMIKTCPFSSCSVLWRTLLGGCRVHRNVELGEKSFRKIAGLEPGKDGDYVLLSNMYAEEERWDDVE
        FSLMVTKYKIMPDLKHYGCMIDLLSRWGFLEEAYVMIKTCPFSSCSVLWRTLLGGCR+HR+VELGE+SFRK+A LEPGKDGDYVLLSN+YAEEERWDDVE
Subjt:  FSLMVTKYKIMPDLKHYGCMIDLLSRWGFLEEAYVMIKTCPFSSCSVLWRTLLGGCRVHRNVELGEKSFRKIAGLEPGKDGDYVLLSNMYAEEERWDDVE

Query:  RLRNEMIDYGVCKKAGSS
        RLR EMI+YGVCKKAGSS
Subjt:  RLRNEMIDYGVCKKAGSS

A0A1S3B5F5 pentatricopeptide repeat-containing protein At5g15300-like7.9e-27689.62Show/hide
Query:  MIPKLHCLSSMKELEQAQAFVVKSGLCNHIPIMAKLIAFSSLSPSGSLPHAHALFRETSMDDSFICNTMIRAYSNSVFPIKALLIYNHMQRMDVDSDHFT
        MIPKLH LSS+KELEQAQAF+VK+G  NHIPIM KLIAFSSLSPSGSLP A+ALF+ETSMDDSFICNTMIRAYSNSVFP+KALLIYN MQRMDVDSDHFT
Subjt:  MIPKLHCLSSMKELEQAQAFVVKSGLCNHIPIMAKLIAFSSLSPSGSLPHAHALFRETSMDDSFICNTMIRAYSNSVFPIKALLIYNHMQRMDVDSDHFT

Query:  YNFVLKACARAIKCTEKNDECFGHDIISRKGAEIHTHVLKLGLDQDHHVQNSLLLMYSGCGLVVFARLIFDEMTVRSNVSWNIMMSAYNRVHDYKSSDVL
        YNFVLKACA AIKCTE +D+CFGHDIISRKGAEIH  +LKLG DQDHHVQNSLLL+YSG GLV FARLIF+EMTVR+ VSWNIMMSAYNRVHDYKS+DVL
Subjt:  YNFVLKACARAIKCTEKNDECFGHDIISRKGAEIHTHVLKLGLDQDHHVQNSLLLMYSGCGLVVFARLIFDEMTVRSNVSWNIMMSAYNRVHDYKSSDVL

Query:  LELMPETNVISWNTVLARYIRLNNLVAARKVFEEMPERDVVSWNSIIAGYVKVKDYKGALELFCSMKQSEITATEVTFISILGACAEMGALEIGKKIHES
        LE MP+TN +SWNT+LARYIRLNNLVAARKVFEEMPERDVVSWNSIIAGYVKVKDYKGALELF SMKQ  I ATEVTFISILGACAE+GALE GKKIHES
Subjt:  LELMPETNVISWNTVLARYIRLNNLVAARKVFEEMPERDVVSWNSIIAGYVKVKDYKGALELFCSMKQSEITATEVTFISILGACAEMGALEIGKKIHES

Query:  LKEKHYRIEGYLGNAIVDMYAKCGELRLALEVFNEMEMKPVSCWNAMIMGLAVHGDCERALEMFDSMKAEHDDHKPNRVTFIALLIACSHKGLVAEGRHF
        LKEKHY+IEGYLGNAIVDMYAKCGEL LALEVFNEMEMKPVSCWNAMIMGLAVHG CE+ALEMFDSMKAE  DHKPNRVTFIALLIACSHKGLVAEGRHF
Subjt:  LKEKHYRIEGYLGNAIVDMYAKCGELRLALEVFNEMEMKPVSCWNAMIMGLAVHGDCERALEMFDSMKAEHDDHKPNRVTFIALLIACSHKGLVAEGRHF

Query:  FSLMVTKYKIMPDLKHYGCMIDLLSRWGFLEEAYVMIKTCPFSSCSVLWRTLLGGCRVHRNVELGEKSFRKIAGLEPGKDGDYVLLSNMYAEEERWDDVE
        FSLMVTKYKIMPDLKHYGCMIDLLSRWGFLEEAYVMIKTCPFSSCSVLWRTLLGGCRVHR VELGE+SFR++A LEPGKDGDYVLLSN+YAEEERWDDVE
Subjt:  FSLMVTKYKIMPDLKHYGCMIDLLSRWGFLEEAYVMIKTCPFSSCSVLWRTLLGGCRVHRNVELGEKSFRKIAGLEPGKDGDYVLLSNMYAEEERWDDVE

Query:  RLRNEMIDYGVCKKAGSSQI
        RLR EMI YGVCKKAGSS +
Subjt:  RLRNEMIDYGVCKKAGSSQI

A0A5A7TL32 Pentatricopeptide repeat-containing protein3.0e-27589.42Show/hide
Query:  MIPKLHCLSSMKELEQAQAFVVKSGLCNHIPIMAKLIAFSSLSPSGSLPHAHALFRETSMDDSFICNTMIRAYSNSVFPIKALLIYNHMQRMDVDSDHFT
        MIPKLH LSS+KELEQAQAF+VK+G  NHIPIM KLIAFSSLSPSGSLP A+ALF+ETSMDDSFICNTMIRAYSNSVFP+KALLIYN MQRMDVDSDHFT
Subjt:  MIPKLHCLSSMKELEQAQAFVVKSGLCNHIPIMAKLIAFSSLSPSGSLPHAHALFRETSMDDSFICNTMIRAYSNSVFPIKALLIYNHMQRMDVDSDHFT

Query:  YNFVLKACARAIKCTEKNDECFGHDIISRKGAEIHTHVLKLGLDQDHHVQNSLLLMYSGCGLVVFARLIFDEMTVRSNVSWNIMMSAYNRVHDYKSSDVL
        YNFVLKACA AIKCTE +D+CFGHDIISRKGAEIH  +LKLG DQDHHVQNSLLL+YSG GLV FARLIF+EMTVR+ VSWNIMMSAYNRVHDYKS+DVL
Subjt:  YNFVLKACARAIKCTEKNDECFGHDIISRKGAEIHTHVLKLGLDQDHHVQNSLLLMYSGCGLVVFARLIFDEMTVRSNVSWNIMMSAYNRVHDYKSSDVL

Query:  LELMPETNVISWNTVLARYIRLNNLVAARKVFEEMPERDVVSWNSIIAGYVKVKDYKGALELFCSMKQSEITATEVTFISILGACAEMGALEIGKKIHES
        LE MP+TN +SWNT+LARYIRLNNLVAARKVFEEMPERDVVSWNSIIAGYVKVKDYKGALELF SMKQ  I ATEVTFISILGACAE+GALE GKKIHES
Subjt:  LELMPETNVISWNTVLARYIRLNNLVAARKVFEEMPERDVVSWNSIIAGYVKVKDYKGALELFCSMKQSEITATEVTFISILGACAEMGALEIGKKIHES

Query:  LKEKHYRIEGYLGNAIVDMYAKCGELRLALEVFNEMEMKPVSCWNAMIMGLAVHGDCERALEMFDSMKAEHDDHKPNRVTFIALLIACSHKGLVAEGRHF
        LKEKHY+IEGYLGNAIVDMYAKCGEL LALEVFNEMEMKPVSCWNAMIMGLAVHG CE+ALEMFDSMKAE  DHKPNRVTFIALLIACSHKGLVAEGRHF
Subjt:  LKEKHYRIEGYLGNAIVDMYAKCGELRLALEVFNEMEMKPVSCWNAMIMGLAVHGDCERALEMFDSMKAEHDDHKPNRVTFIALLIACSHKGLVAEGRHF

Query:  FSLMVTKYKIMPDLKHYGCMIDLLSRWGFLEEAYVMIKTCPFSSCSVLWRTLLGGCRVHRNVELGEKSFRKIAGLEPGKDGDYVLLSNMYAEEERWDDVE
        FSLMVTKYKIMPDLKHYGCMIDLLSRWGFLEEAYV+IKTCPFSSCSVLWRTLLGGCRVHR VELGE+SFR++A LEPGKDGDYVLLSN+YAEEERWDDVE
Subjt:  FSLMVTKYKIMPDLKHYGCMIDLLSRWGFLEEAYVMIKTCPFSSCSVLWRTLLGGCRVHRNVELGEKSFRKIAGLEPGKDGDYVLLSNMYAEEERWDDVE

Query:  RLRNEMIDYGVCKKAGSSQI
        RLR EMI YGVCKKAGSS +
Subjt:  RLRNEMIDYGVCKKAGSSQI

A0A5D3DN16 Pentatricopeptide repeat-containing protein7.9e-27689.62Show/hide
Query:  MIPKLHCLSSMKELEQAQAFVVKSGLCNHIPIMAKLIAFSSLSPSGSLPHAHALFRETSMDDSFICNTMIRAYSNSVFPIKALLIYNHMQRMDVDSDHFT
        MIPKLH LSS+KELEQAQAF+VK+G  NHIPIM KLIAFSSLSPSGSLP A+ALF+ETSMDDSFICNTMIRAYSNSVFP+KALLIYN MQRMDVDSDHFT
Subjt:  MIPKLHCLSSMKELEQAQAFVVKSGLCNHIPIMAKLIAFSSLSPSGSLPHAHALFRETSMDDSFICNTMIRAYSNSVFPIKALLIYNHMQRMDVDSDHFT

Query:  YNFVLKACARAIKCTEKNDECFGHDIISRKGAEIHTHVLKLGLDQDHHVQNSLLLMYSGCGLVVFARLIFDEMTVRSNVSWNIMMSAYNRVHDYKSSDVL
        YNFVLKACA AIKCTE +D+CFGHDIISRKGAEIH  +LKLG DQDHHVQNSLLL+YSG GLV FARLIF+EMTVR+ VSWNIMMSAYNRVHDYKS+DVL
Subjt:  YNFVLKACARAIKCTEKNDECFGHDIISRKGAEIHTHVLKLGLDQDHHVQNSLLLMYSGCGLVVFARLIFDEMTVRSNVSWNIMMSAYNRVHDYKSSDVL

Query:  LELMPETNVISWNTVLARYIRLNNLVAARKVFEEMPERDVVSWNSIIAGYVKVKDYKGALELFCSMKQSEITATEVTFISILGACAEMGALEIGKKIHES
        LE MP+TN +SWNT+LARYIRLNNLVAARKVFEEMPERDVVSWNSIIAGYVKVKDYKGALELF SMKQ  I ATEVTFISILGACAE+GALE GKKIHES
Subjt:  LELMPETNVISWNTVLARYIRLNNLVAARKVFEEMPERDVVSWNSIIAGYVKVKDYKGALELFCSMKQSEITATEVTFISILGACAEMGALEIGKKIHES

Query:  LKEKHYRIEGYLGNAIVDMYAKCGELRLALEVFNEMEMKPVSCWNAMIMGLAVHGDCERALEMFDSMKAEHDDHKPNRVTFIALLIACSHKGLVAEGRHF
        LKEKHY+IEGYLGNAIVDMYAKCGEL LALEVFNEMEMKPVSCWNAMIMGLAVHG CE+ALEMFDSMKAE  DHKPNRVTFIALLIACSHKGLVAEGRHF
Subjt:  LKEKHYRIEGYLGNAIVDMYAKCGELRLALEVFNEMEMKPVSCWNAMIMGLAVHGDCERALEMFDSMKAEHDDHKPNRVTFIALLIACSHKGLVAEGRHF

Query:  FSLMVTKYKIMPDLKHYGCMIDLLSRWGFLEEAYVMIKTCPFSSCSVLWRTLLGGCRVHRNVELGEKSFRKIAGLEPGKDGDYVLLSNMYAEEERWDDVE
        FSLMVTKYKIMPDLKHYGCMIDLLSRWGFLEEAYVMIKTCPFSSCSVLWRTLLGGCRVHR VELGE+SFR++A LEPGKDGDYVLLSN+YAEEERWDDVE
Subjt:  FSLMVTKYKIMPDLKHYGCMIDLLSRWGFLEEAYVMIKTCPFSSCSVLWRTLLGGCRVHRNVELGEKSFRKIAGLEPGKDGDYVLLSNMYAEEERWDDVE

Query:  RLRNEMIDYGVCKKAGSSQI
        RLR EMI YGVCKKAGSS +
Subjt:  RLRNEMIDYGVCKKAGSSQI

A0A6J1F4H4 pentatricopeptide repeat-containing protein At5g15300-like1.5e-27185.63Show/hide
Query:  MLVKGAISYMIPKLHCLSSMKELEQAQAFVVKSGLCNHIPIMAKLIAFSSLSPSGSLPHAHALFRETSMDDSFICNTMIRAYSNSVFPIKALLIYNHMQR
        M++KGA+SY++PKL  LSSMKELE A AF+VK+GLCNHIP+M KLIAFSSLSPSGSLPHAHALF++ SMDDSFICNTMIRAYSNSVFP+KALLIYNHMQR
Subjt:  MLVKGAISYMIPKLHCLSSMKELEQAQAFVVKSGLCNHIPIMAKLIAFSSLSPSGSLPHAHALFRETSMDDSFICNTMIRAYSNSVFPIKALLIYNHMQR

Query:  MDVDSDHFTYNFVLKACARAIKCTEKNDECFGHDIISRKGAEIHTHVLKLGLDQDHHVQNSLLLMYSGCGLVVFARLIFDEMTVRSNVSWNIMMSAYNRV
        MDV SDHFTYNFVLKACARAIKCTEK+D+CFGHDIISRKGAEIH+ VLKLGLDQDHHVQNSLLLMYSGCGLVVFAR++F+EMTVRS VSWNIMMSAYNRV
Subjt:  MDVDSDHFTYNFVLKACARAIKCTEKNDECFGHDIISRKGAEIHTHVLKLGLDQDHHVQNSLLLMYSGCGLVVFARLIFDEMTVRSNVSWNIMMSAYNRV

Query:  HDYKSSDVLLELMPETNVISWNTVLARYIRLNNLVAARKVFEEMPERDVVSWNSIIAGYVKVKDYKGALELFCSMKQSEITATEVTFISILGACAEMGAL
         DYKS+D LL+LMP+TNV SWNTVLARYI LNNLVAARKVFEEMPERDVVSWNSIIAGYVKVKDYK A+E+F SMKQS I ATEVTFISILGACAE G+L
Subjt:  HDYKSSDVLLELMPETNVISWNTVLARYIRLNNLVAARKVFEEMPERDVVSWNSIIAGYVKVKDYKGALELFCSMKQSEITATEVTFISILGACAEMGAL

Query:  EIGKKIHESLKEKHYRIEGYLGNAIVDMYAKCGELRLALEVFNEMEMKPVSCWNAMIMGLAVHGDCERALEMFDSMKAEHDDHKPNRVTFIALLIACSHK
        E+GKKIHESLK +HYRIEGYLGNAIVDMYAKCGEL LALEVFNEMEMKPVSCWNAMIMGLAVHG CERAL+MFDSM+  +DDHKPNRVTF+A+LIACSHK
Subjt:  EIGKKIHESLKEKHYRIEGYLGNAIVDMYAKCGELRLALEVFNEMEMKPVSCWNAMIMGLAVHGDCERALEMFDSMKAEHDDHKPNRVTFIALLIACSHK

Query:  GLVAEGRHFFSLMVTKYKIMPDLKHYGCMIDLLSRWGFLEEAYVMIKTCPFSSCSVLWRTLLGGCRVHRNVELGEKSFRKIAGLEPGKDGDYVLLSNMYA
        GLVAEGRHF SLM+ KYKIMPDLKHYGCM+DLLSRWGFL+EAY MIK CPFSSC+V+WRTLLGGCRVHR+VELGE++F K+  LE  KDGDYVLLSN+YA
Subjt:  GLVAEGRHFFSLMVTKYKIMPDLKHYGCMIDLLSRWGFLEEAYVMIKTCPFSSCSVLWRTLLGGCRVHRNVELGEKSFRKIAGLEPGKDGDYVLLSNMYA

Query:  EEERWDDVERLRNEMIDYGVCKKAGSSQI
        EEERWDDV RLRNEMI+YGVCKKAGSS +
Subjt:  EEERWDDVERLRNEMIDYGVCKKAGSSQI

SwissProt top hitse value%identityAlignment
O49399 Pentatricopeptide repeat-containing protein At4g188405.7e-9036.27Show/hide
Query:  SMKELEQAQAFVVKSGLCNHIPIMAKLIAFSSLSPS-GSLPHAHALFRETSMDDSFICNTMIRAYSNSVFPIKALLIYNHMQRMDVDSDHFTYNFVLKAC
        S+ E++QA AF++K+GL +     +KL+AF++ +P   ++ +AH++       + F  N++IRAY+NS  P  AL ++  M    V  D +++ FVLKAC
Subjt:  SMKELEQAQAFVVKSGLCNHIPIMAKLIAFSSLSPS-GSLPHAHALFRETSMDDSFICNTMIRAYSNSVFPIKALLIYNHMQRMDVDSDHFTYNFVLKAC

Query:  ARAIKCTEKNDECFGHDIISRKGAEIHTHVLKLGLDQDHHVQNSLLLMYSGCGLVVFARLIFDEMTVRSNVSWNIMMSAYNRVHDYKSSDVLLELMPETN
        A      E              G +IH   +K GL  D  V+N+L+ +Y   G    AR + D M VR  VSWN ++SAY        +  L + M E N
Subjt:  ARAIKCTEKNDECFGHDIISRKGAEIHTHVLKLGLDQDHHVQNSLLLMYSGCGLVVFARLIFDEMTVRSNVSWNIMMSAYNRVHDYKSSDVLLELMPETN

Query:  VISWNTVLARYIRLNNLVAARKVFEEMPERDVVSWNSIIAGYVKVKDYKGALELFCSMKQSEITATE-VTFISILGACAEMGALEIGKKIHESLKEKHYR
        V SWN +++ Y     +  A++VF+ MP RDVVSWN+++  Y  V  Y   LE+F  M        +  T +S+L ACA +G+L  G+ +H  + +    
Subjt:  VISWNTVLARYIRLNNLVAARKVFEEMPERDVVSWNSIIAGYVKVKDYKGALELFCSMKQSEITATE-VTFISILGACAEMGALEIGKKIHESLKEKHYR

Query:  IEGYLGNAIVDMYAKCGELRLALEVFNEMEMKPVSCWNAMIMGLAVHGDCERALEMFDSMKAEHDDHKPNRVTFIALLIACSHKGLVAEGRHFFSLMVTK
        IEG+L  A+VDMY+KCG++  ALEVF     + VS WN++I  L+VHG  + ALE+F  M   ++  KPN +TFI +L AC+H G++ + R  F +M + 
Subjt:  IEGYLGNAIVDMYAKCGELRLALEVFNEMEMKPVSCWNAMIMGLAVHGDCERALEMFDSMKAEHDDHKPNRVTFIALLIACSHKGLVAEGRHFFSLMVTK

Query:  YKIMPDLKHYGCMIDLLSRWGFLEEAYVMIKTCPFSSCSVLWRTLLGGCRVHRNVELGEKSFRKIAGLEPGKDGDYVLLSNMYAEEERWDDVERLRNEM
        Y++ P ++HYGCM+DLL R G +EEA  ++   P    S+L  +LLG C+    +E  E+   ++  L       Y  +SN+YA + RW+ V   R  M
Subjt:  YKIMPDLKHYGCMIDLLSRWGFLEEAYVMIKTCPFSSCSVLWRTLLGGCRVHRNVELGEKSFRKIAGLEPGKDGDYVLLSNMYAEEERWDDVERLRNEM

Q56X05 Pentatricopeptide repeat-containing protein At1g061431.6e-9236.04Show/hide
Query:  SSMKELEQAQAFVVKSGLCNHIPIMAKLIAFSSLSPSGSLPHAHALFRETSMDDSFICNTMIRAYSNSVFPIKALLIYNHMQRMDVDSDHFTYNFVLKAC
        S+ K LE A A ++K+ L     +M + I  ++ +    L  A +   +    + F+ N + + +     PI++L +Y  M R  V    +TY+ ++KA 
Subjt:  SSMKELEQAQAFVVKSGLCNHIPIMAKLIAFSSLSPSGSLPHAHALFRETSMDDSFICNTMIRAYSNSVFPIKALLIYNHMQRMDVDSDHFTYNFVLKAC

Query:  ARAIKCTEKNDECFGHDIISRKGAEIHTHVLKLGLDQDHHVQNSLLLMYSGCGLVVFARLIFDEMTVRSNVSWNIMMSAYNRVHDYKSSDVLLELMPETN
        + A                SR G  +  H+ K G      +Q +L+  YS  G +  AR +FDEM  R +++W  M+SAY RV D  S++ L   M E N
Subjt:  ARAIKCTEKNDECFGHDIISRKGAEIHTHVLKLGLDQDHHVQNSLLLMYSGCGLVVFARLIFDEMTVRSNVSWNIMMSAYNRVHDYKSSDVLLELMPETN

Query:  VISWNTVLARYIRLNNLVAARKVFEEMPERDVVSWNSIIAGYVKVKDYKGALELFCSMKQSEITATEVTFISILGACAEMGALEIGKKIHESLKEKHYRI
          + N ++  Y+ L NL  A  +F +MP +D++SW ++I GY + K Y+ A+ +F  M +  I   EVT  +++ ACA +G LEIGK++H    +  + +
Subjt:  VISWNTVLARYIRLNNLVAARKVFEEMPERDVVSWNSIIAGYVKVKDYKGALELFCSMKQSEITATEVTFISILGACAEMGALEIGKKIHESLKEKHYRI

Query:  EGYLGNAIVDMYAKCGELRLALEVFNEMEMKPVSCWNAMIMGLAVHGDCERALEMFDSMKAEHDDHKPNRVTFIALLIACSHKGLVAEGRHFFSLMVTKY
        + Y+G+A+VDMY+KCG L  AL VF  +  K + CWN++I GLA HG  + AL+MF   K E +  KPN VTF+++  AC+H GLV EGR  +  M+  Y
Subjt:  EGYLGNAIVDMYAKCGELRLALEVFNEMEMKPVSCWNAMIMGLAVHGDCERALEMFDSMKAEHDDHKPNRVTFIALLIACSHKGLVAEGRHFFSLMVTKY

Query:  KIMPDLKHYGCMIDLLSRWGFLEEAYVMIKTCPFSSCSVLWRTLLGGCRVHRNVELGEKSFRKIAGLEPGKDGDYVLLSNMYAEEERWDDVERLRNEMID
         I+ +++HYG M+ L S+ G + EA  +I    F   +V+W  LL GCR+H+N+ + E +F K+  LEP   G Y LL +MYAE+ RW DV  +R  M +
Subjt:  KIMPDLKHYGCMIDLLSRWGFLEEAYVMIKTCPFSSCSVLWRTLLGGCRVHRNVELGEKSFRKIAGLEPGKDGDYVLLSNMYAEEERWDDVERLRNEMID

Query:  YGVCK
         G+ K
Subjt:  YGVCK

Q9CA54 Pentatricopeptide repeat-containing protein At1g746304.4e-9035.67Show/hide
Query:  AISYMIPKLHCLSSMKELEQAQAFVVKSGLCNHIPIMAKLIAFSSLSPSGSLPHAHALFRETSMDDSFICNTMIRAYSNSVFPIKALLIYNHMQRMD-VD
        AI + +  L+   +++ L Q     +K G+        KLI   ++S S +LP+A  L       D+F+ NT++R YS S  P  ++ ++  M R   V 
Subjt:  AISYMIPKLHCLSSMKELEQAQAFVVKSGLCNHIPIMAKLIAFSSLSPSGSLPHAHALFRETSMDDSFICNTMIRAYSNSVFPIKALLIYNHMQRMD-VD

Query:  SDHFTYNFVLKACARAIKCTEKNDECFGHDIISRKGAEIHTHVLKLGLDQDHHVQNSLLLMYSGCGLVVFARLIFDEMTVRSNVSWNIMMSAYNRVHDYK
         D F++ FV+KA                     R G ++H   LK GL+    V  +L+ MY GCG V FAR +FDEM   + V+WN +++A  R +D  
Subjt:  SDHFTYNFVLKACARAIKCTEKNDECFGHDIISRKGAEIHTHVLKLGLDQDHHVQNSLLLMYSGCGLVVFARLIFDEMTVRSNVSWNIMMSAYNRVHDYK

Query:  SSDVLLELMPETNVISWNTVLARYIRLNNLVAARKVFEEMPERDVVSWNSIIAGYVKVKDYKGALELFCSMKQSEITATEVTFISILGACAEMGALEIGK
         +  + + M   N  SWN +LA YI+   L +A+++F EMP RD VSW+++I G      +  +   F  ++++ ++  EV+   +L AC++ G+ E GK
Subjt:  SSDVLLELMPETNVISWNTVLARYIRLNNLVAARKVFEEMPERDVVSWNSIIAGYVKVKDYKGALELFCSMKQSEITATEVTFISILGACAEMGALEIGK

Query:  KIHESLKEKHYRIEGYLGNAIVDMYAKCGELRLALEVFNEM-EMKPVSCWNAMIMGLAVHGDCERALEMFDSMKAEHDDHKPNRVTFIALLIACSHKGLV
         +H  +++  Y     + NA++DMY++CG + +A  VF  M E + +  W +MI GLA+HG  E A+ +F+ M A      P+ ++FI+LL ACSH GL+
Subjt:  KIHESLKEKHYRIEGYLGNAIVDMYAKCGELRLALEVFNEM-EMKPVSCWNAMIMGLAVHGDCERALEMFDSMKAEHDDHKPNRVTFIALLIACSHKGLV

Query:  AEGRHFFSLMVTKYKIMPDLKHYGCMIDLLSRWGFLEEAYVMIKTCPFSSCSVLWRTLLGGCRVHRNVELGEKSFRKIAGLEPGKDGDYVLLSNMYAEEE
         EG  +FS M   Y I P+++HYGCM+DL  R G L++AY  I   P    +++WRTLLG C  H N+EL E+  +++  L+P   GD VLLSN YA   
Subjt:  AEGRHFFSLMVTKYKIMPDLKHYGCMIDLLSRWGFLEEAYVMIKTCPFSSCSVLWRTLLGGCRVHRNVELGEKSFRKIAGLEPGKDGDYVLLSNMYAEEE

Query:  RWDDVERLRNEMI
        +W DV  +R  MI
Subjt:  RWDDVERLRNEMI

Q9LS72 Pentatricopeptide repeat-containing protein At3g292303.3e-9334.92Show/hide
Query:  SSMKELEQAQAFVVKSGLCNHIPIMAKLIAFSSLSPSGSLPHAHALFRETSMDDSFICNTMIRAYSNSVFPIKALLIYNHMQRMDVDSDHFTYNFVLKAC
        +++ +++Q  A +++  L   + I  KLI+  SL    +L  A  +F +    +  +CN++IRA++ +  P +A  +++ MQR  + +D+FTY F+LKAC
Subjt:  SSMKELEQAQAFVVKSGLCNHIPIMAKLIAFSSLSPSGSLPHAHALFRETSMDDSFICNTMIRAYSNSVFPIKALLIYNHMQRMDVDSDHFTYNFVLKAC

Query:  ARAIKCTEKNDECFGHDIISRKGAEIHTHVLKLGLDQDHHVQNSLLLMYSGCG-------LVVF--------------------------ARLIFDEMTV
        +             G   +      +H H+ KLGL  D +V N+L+  YS CG       + +F                          AR +FDEM  
Subjt:  ARAIKCTEKNDECFGHDIISRKGAEIHTHVLKLGLDQDHHVQNSLLLMYSGCG-------LVVF--------------------------ARLIFDEMTV

Query:  RSNVSWNIMMSAYNRVHDYKSSDVLLELMPETNVISWNTVLARYIRLNNLVAARKVFEEM--PERDVVSWNSIIAGYVKVKDYKGALELFCSMKQSEITA
        R  +SWN M+  Y R  +   +  L E MPE N +SW+T++  Y +  ++  AR +F++M  P ++VV+W  IIAGY +    K A  L   M  S +  
Subjt:  RSNVSWNIMMSAYNRVHDYKSSDVLLELMPETNVISWNTVLARYIRLNNLVAARKVFEEM--PERDVVSWNSIIAGYVKVKDYKGALELFCSMKQSEITA

Query:  TEVTFISILGACAEMGALEIGKKIHESLKEKHYRIEGYLGNAIVDMYAKCGELRLALEVFNEMEMKPVSCWNAMIMGLAVHGDCERALEMFDSMKAEHDD
             ISIL AC E G L +G +IH  LK  +     Y+ NA++DMYAKCG L+ A +VFN++  K +  WN M+ GL VHG  + A+E+F  M+ E   
Subjt:  TEVTFISILGACAEMGALEIGKKIHESLKEKHYRIEGYLGNAIVDMYAKCGELRLALEVFNEMEMKPVSCWNAMIMGLAVHGDCERALEMFDSMKAEHDD

Query:  HKPNRVTFIALLIACSHKGLVAEGRHFFSLMVTKYKIMPDLKHYGCMIDLLSRWGFLEEAYVMIKTCPFSSCSVLWRTLLGGCRVHRNVELGEKSFRKIA
         +P++VTFIA+L +C+H GL+ EG  +F  M   Y ++P ++HYGC++DLL R G L+EA  +++T P     V+W  LLG CR+H  V++ ++    + 
Subjt:  HKPNRVTFIALLIACSHKGLVAEGRHFFSLMVTKYKIMPDLKHYGCMIDLLSRWGFLEEAYVMIKTCPFSSCSVLWRTLLGGCRVHRNVELGEKSFRKIA

Query:  GLEPGKDGDYVLLSNMYAEEERWDDVERLRNEMIDYGVCKKAGSSQI
         L+P   G+Y LLSN+YA  E W+ V  +R++M   GV K +G+S +
Subjt:  GLEPGKDGDYVLLSNMYAEEERWDDVERLRNEMIDYGVCKKAGSSQI

Q9LXF2 Pentatricopeptide repeat-containing protein At5g153002.2e-9736.76Show/hide
Query:  PKL--HCLSSMKELEQAQAFVVKSGLCNHIPIMAKLIAFSSLSPSGSLPHAHALFRETSMDDSFICNTMIRAYSNSVFPIKALLIYNHMQRMDVDSDHFT
        PKL  +C  +++ L+Q  A +V +GL +++ ++ +LI  +SLS  G+L +AH LF E    D  ICN ++R  + S+ P K + +Y  M++  V  D +T
Subjt:  PKL--HCLSSMKELEQAQAFVVKSGLCNHIPIMAKLIAFSSLSPSGSLPHAHALFRETSMDDSFICNTMIRAYSNSVFPIKALLIYNHMQRMDVDSDHFT

Query:  YNFVLKACARAIKCTEKNDECFGHDIISRKGAEIHTHVLKLGLDQDHHVQNSLLLMYSGCGLVVFARLIFDEMTVRSNVSWNIMMSAYNRVHDYKSSDVL
        + FVLKAC++                    G   H  V++ G   + +V+N+L+L ++ CG +  A  +FD+      V+W+ M S Y +      +  L
Subjt:  YNFVLKACARAIKCTEKNDECFGHDIISRKGAEIHTHVLKLGLDQDHHVQNSLLLMYSGCGLVVFARLIFDEMTVRSNVSWNIMMSAYNRVHDYKSSDVL

Query:  LELMPETNVISWNTVLARYIRLNNLVAARKVFEEMPERDVVSWNSIIAGYVKVKDYKGALELFCSMKQSEITATEVTFISILGACAEMGALEIGKKIH-E
         + MP  + ++WN ++   ++   + +AR++F+   E+DVV+WN++I+GYV     K AL +F  M+ +      VT +S+L ACA +G LE GK++H  
Subjt:  LELMPETNVISWNTVLARYIRLNNLVAARKVFEEMPERDVVSWNSIIAGYVKVKDYKGALELFCSMKQSEITATEVTFISILGACAEMGALEIGKKIH-E

Query:  SLKEKHYRIEGYLG----NAIVDMYAKCGELRLALEVFNEMEMKPVSCWNAMIMGLAVHGDCERALEMFDSMKAEHDDHKPNRVTFIALLIACSHKGLVA
         L+        Y+G    NA++DMYAKCG +  A+EVF  ++ + +S WN +I+GLA+H   E ++EMF+ M  +     PN VTFI +++ACSH G V 
Subjt:  SLKEKHYRIEGYLG----NAIVDMYAKCGELRLALEVFNEMEMKPVSCWNAMIMGLAVHGDCERALEMFDSMKAEHDDHKPNRVTFIALLIACSHKGLVA

Query:  EGRHFFSLMVTKYKIMPDLKHYGCMIDLLSRWGFLEEAYVMIKTCPFSSCSVLWRTLLGGCRVHRNVELGEKSFRKIAGLEPGKDGDYVLLSNMYAEEER
        EGR +FSLM   Y I P++KHYGCM+D+L R G LEEA++ +++      +++WRTLLG C+++ NVELG+ +  K+  +   + GDYVLLSN+YA   +
Subjt:  EGRHFFSLMVTKYKIMPDLKHYGCMIDLLSRWGFLEEAYVMIKTCPFSSCSVLWRTLLGGCRVHRNVELGEKSFRKIAGLEPGKDGDYVLLSNMYAEEER

Query:  WDDVERLRNEMIDYGVCKKAGSSQI
        WD V+++R    D  V K  G S I
Subjt:  WDDVERLRNEMIDYGVCKKAGSSQI

Arabidopsis top hitse value%identityAlignment
AT1G06150.1 basic helix-loop-helix (bHLH) DNA-binding superfamily protein1.1e-9336.04Show/hide
Query:  SSMKELEQAQAFVVKSGLCNHIPIMAKLIAFSSLSPSGSLPHAHALFRETSMDDSFICNTMIRAYSNSVFPIKALLIYNHMQRMDVDSDHFTYNFVLKAC
        S+ K LE A A ++K+ L     +M + I  ++ +    L  A +   +    + F+ N + + +     PI++L +Y  M R  V    +TY+ ++KA 
Subjt:  SSMKELEQAQAFVVKSGLCNHIPIMAKLIAFSSLSPSGSLPHAHALFRETSMDDSFICNTMIRAYSNSVFPIKALLIYNHMQRMDVDSDHFTYNFVLKAC

Query:  ARAIKCTEKNDECFGHDIISRKGAEIHTHVLKLGLDQDHHVQNSLLLMYSGCGLVVFARLIFDEMTVRSNVSWNIMMSAYNRVHDYKSSDVLLELMPETN
        + A                SR G  +  H+ K G      +Q +L+  YS  G +  AR +FDEM  R +++W  M+SAY RV D  S++ L   M E N
Subjt:  ARAIKCTEKNDECFGHDIISRKGAEIHTHVLKLGLDQDHHVQNSLLLMYSGCGLVVFARLIFDEMTVRSNVSWNIMMSAYNRVHDYKSSDVLLELMPETN

Query:  VISWNTVLARYIRLNNLVAARKVFEEMPERDVVSWNSIIAGYVKVKDYKGALELFCSMKQSEITATEVTFISILGACAEMGALEIGKKIHESLKEKHYRI
          + N ++  Y+ L NL  A  +F +MP +D++SW ++I GY + K Y+ A+ +F  M +  I   EVT  +++ ACA +G LEIGK++H    +  + +
Subjt:  VISWNTVLARYIRLNNLVAARKVFEEMPERDVVSWNSIIAGYVKVKDYKGALELFCSMKQSEITATEVTFISILGACAEMGALEIGKKIHESLKEKHYRI

Query:  EGYLGNAIVDMYAKCGELRLALEVFNEMEMKPVSCWNAMIMGLAVHGDCERALEMFDSMKAEHDDHKPNRVTFIALLIACSHKGLVAEGRHFFSLMVTKY
        + Y+G+A+VDMY+KCG L  AL VF  +  K + CWN++I GLA HG  + AL+MF   K E +  KPN VTF+++  AC+H GLV EGR  +  M+  Y
Subjt:  EGYLGNAIVDMYAKCGELRLALEVFNEMEMKPVSCWNAMIMGLAVHGDCERALEMFDSMKAEHDDHKPNRVTFIALLIACSHKGLVAEGRHFFSLMVTKY

Query:  KIMPDLKHYGCMIDLLSRWGFLEEAYVMIKTCPFSSCSVLWRTLLGGCRVHRNVELGEKSFRKIAGLEPGKDGDYVLLSNMYAEEERWDDVERLRNEMID
         I+ +++HYG M+ L S+ G + EA  +I    F   +V+W  LL GCR+H+N+ + E +F K+  LEP   G Y LL +MYAE+ RW DV  +R  M +
Subjt:  KIMPDLKHYGCMIDLLSRWGFLEEAYVMIKTCPFSSCSVLWRTLLGGCRVHRNVELGEKSFRKIAGLEPGKDGDYVLLSNMYAEEERWDDVERLRNEMID

Query:  YGVCK
         G+ K
Subjt:  YGVCK

AT1G74630.1 Tetratricopeptide repeat (TPR)-like superfamily protein3.1e-9135.67Show/hide
Query:  AISYMIPKLHCLSSMKELEQAQAFVVKSGLCNHIPIMAKLIAFSSLSPSGSLPHAHALFRETSMDDSFICNTMIRAYSNSVFPIKALLIYNHMQRMD-VD
        AI + +  L+   +++ L Q     +K G+        KLI   ++S S +LP+A  L       D+F+ NT++R YS S  P  ++ ++  M R   V 
Subjt:  AISYMIPKLHCLSSMKELEQAQAFVVKSGLCNHIPIMAKLIAFSSLSPSGSLPHAHALFRETSMDDSFICNTMIRAYSNSVFPIKALLIYNHMQRMD-VD

Query:  SDHFTYNFVLKACARAIKCTEKNDECFGHDIISRKGAEIHTHVLKLGLDQDHHVQNSLLLMYSGCGLVVFARLIFDEMTVRSNVSWNIMMSAYNRVHDYK
         D F++ FV+KA                     R G ++H   LK GL+    V  +L+ MY GCG V FAR +FDEM   + V+WN +++A  R +D  
Subjt:  SDHFTYNFVLKACARAIKCTEKNDECFGHDIISRKGAEIHTHVLKLGLDQDHHVQNSLLLMYSGCGLVVFARLIFDEMTVRSNVSWNIMMSAYNRVHDYK

Query:  SSDVLLELMPETNVISWNTVLARYIRLNNLVAARKVFEEMPERDVVSWNSIIAGYVKVKDYKGALELFCSMKQSEITATEVTFISILGACAEMGALEIGK
         +  + + M   N  SWN +LA YI+   L +A+++F EMP RD VSW+++I G      +  +   F  ++++ ++  EV+   +L AC++ G+ E GK
Subjt:  SSDVLLELMPETNVISWNTVLARYIRLNNLVAARKVFEEMPERDVVSWNSIIAGYVKVKDYKGALELFCSMKQSEITATEVTFISILGACAEMGALEIGK

Query:  KIHESLKEKHYRIEGYLGNAIVDMYAKCGELRLALEVFNEM-EMKPVSCWNAMIMGLAVHGDCERALEMFDSMKAEHDDHKPNRVTFIALLIACSHKGLV
         +H  +++  Y     + NA++DMY++CG + +A  VF  M E + +  W +MI GLA+HG  E A+ +F+ M A      P+ ++FI+LL ACSH GL+
Subjt:  KIHESLKEKHYRIEGYLGNAIVDMYAKCGELRLALEVFNEM-EMKPVSCWNAMIMGLAVHGDCERALEMFDSMKAEHDDHKPNRVTFIALLIACSHKGLV

Query:  AEGRHFFSLMVTKYKIMPDLKHYGCMIDLLSRWGFLEEAYVMIKTCPFSSCSVLWRTLLGGCRVHRNVELGEKSFRKIAGLEPGKDGDYVLLSNMYAEEE
         EG  +FS M   Y I P+++HYGCM+DL  R G L++AY  I   P    +++WRTLLG C  H N+EL E+  +++  L+P   GD VLLSN YA   
Subjt:  AEGRHFFSLMVTKYKIMPDLKHYGCMIDLLSRWGFLEEAYVMIKTCPFSSCSVLWRTLLGGCRVHRNVELGEKSFRKIAGLEPGKDGDYVLLSNMYAEEE

Query:  RWDDVERLRNEMI
        +W DV  +R  MI
Subjt:  RWDDVERLRNEMI

AT3G29230.1 Tetratricopeptide repeat (TPR)-like superfamily protein2.3e-9434.92Show/hide
Query:  SSMKELEQAQAFVVKSGLCNHIPIMAKLIAFSSLSPSGSLPHAHALFRETSMDDSFICNTMIRAYSNSVFPIKALLIYNHMQRMDVDSDHFTYNFVLKAC
        +++ +++Q  A +++  L   + I  KLI+  SL    +L  A  +F +    +  +CN++IRA++ +  P +A  +++ MQR  + +D+FTY F+LKAC
Subjt:  SSMKELEQAQAFVVKSGLCNHIPIMAKLIAFSSLSPSGSLPHAHALFRETSMDDSFICNTMIRAYSNSVFPIKALLIYNHMQRMDVDSDHFTYNFVLKAC

Query:  ARAIKCTEKNDECFGHDIISRKGAEIHTHVLKLGLDQDHHVQNSLLLMYSGCG-------LVVF--------------------------ARLIFDEMTV
        +             G   +      +H H+ KLGL  D +V N+L+  YS CG       + +F                          AR +FDEM  
Subjt:  ARAIKCTEKNDECFGHDIISRKGAEIHTHVLKLGLDQDHHVQNSLLLMYSGCG-------LVVF--------------------------ARLIFDEMTV

Query:  RSNVSWNIMMSAYNRVHDYKSSDVLLELMPETNVISWNTVLARYIRLNNLVAARKVFEEM--PERDVVSWNSIIAGYVKVKDYKGALELFCSMKQSEITA
        R  +SWN M+  Y R  +   +  L E MPE N +SW+T++  Y +  ++  AR +F++M  P ++VV+W  IIAGY +    K A  L   M  S +  
Subjt:  RSNVSWNIMMSAYNRVHDYKSSDVLLELMPETNVISWNTVLARYIRLNNLVAARKVFEEM--PERDVVSWNSIIAGYVKVKDYKGALELFCSMKQSEITA

Query:  TEVTFISILGACAEMGALEIGKKIHESLKEKHYRIEGYLGNAIVDMYAKCGELRLALEVFNEMEMKPVSCWNAMIMGLAVHGDCERALEMFDSMKAEHDD
             ISIL AC E G L +G +IH  LK  +     Y+ NA++DMYAKCG L+ A +VFN++  K +  WN M+ GL VHG  + A+E+F  M+ E   
Subjt:  TEVTFISILGACAEMGALEIGKKIHESLKEKHYRIEGYLGNAIVDMYAKCGELRLALEVFNEMEMKPVSCWNAMIMGLAVHGDCERALEMFDSMKAEHDD

Query:  HKPNRVTFIALLIACSHKGLVAEGRHFFSLMVTKYKIMPDLKHYGCMIDLLSRWGFLEEAYVMIKTCPFSSCSVLWRTLLGGCRVHRNVELGEKSFRKIA
         +P++VTFIA+L +C+H GL+ EG  +F  M   Y ++P ++HYGC++DLL R G L+EA  +++T P     V+W  LLG CR+H  V++ ++    + 
Subjt:  HKPNRVTFIALLIACSHKGLVAEGRHFFSLMVTKYKIMPDLKHYGCMIDLLSRWGFLEEAYVMIKTCPFSSCSVLWRTLLGGCRVHRNVELGEKSFRKIA

Query:  GLEPGKDGDYVLLSNMYAEEERWDDVERLRNEMIDYGVCKKAGSSQI
         L+P   G+Y LLSN+YA  E W+ V  +R++M   GV K +G+S +
Subjt:  GLEPGKDGDYVLLSNMYAEEERWDDVERLRNEMIDYGVCKKAGSSQI

AT4G18840.1 Pentatricopeptide repeat (PPR-like) superfamily protein4.1e-9136.27Show/hide
Query:  SMKELEQAQAFVVKSGLCNHIPIMAKLIAFSSLSPS-GSLPHAHALFRETSMDDSFICNTMIRAYSNSVFPIKALLIYNHMQRMDVDSDHFTYNFVLKAC
        S+ E++QA AF++K+GL +     +KL+AF++ +P   ++ +AH++       + F  N++IRAY+NS  P  AL ++  M    V  D +++ FVLKAC
Subjt:  SMKELEQAQAFVVKSGLCNHIPIMAKLIAFSSLSPS-GSLPHAHALFRETSMDDSFICNTMIRAYSNSVFPIKALLIYNHMQRMDVDSDHFTYNFVLKAC

Query:  ARAIKCTEKNDECFGHDIISRKGAEIHTHVLKLGLDQDHHVQNSLLLMYSGCGLVVFARLIFDEMTVRSNVSWNIMMSAYNRVHDYKSSDVLLELMPETN
        A      E              G +IH   +K GL  D  V+N+L+ +Y   G    AR + D M VR  VSWN ++SAY        +  L + M E N
Subjt:  ARAIKCTEKNDECFGHDIISRKGAEIHTHVLKLGLDQDHHVQNSLLLMYSGCGLVVFARLIFDEMTVRSNVSWNIMMSAYNRVHDYKSSDVLLELMPETN

Query:  VISWNTVLARYIRLNNLVAARKVFEEMPERDVVSWNSIIAGYVKVKDYKGALELFCSMKQSEITATE-VTFISILGACAEMGALEIGKKIHESLKEKHYR
        V SWN +++ Y     +  A++VF+ MP RDVVSWN+++  Y  V  Y   LE+F  M        +  T +S+L ACA +G+L  G+ +H  + +    
Subjt:  VISWNTVLARYIRLNNLVAARKVFEEMPERDVVSWNSIIAGYVKVKDYKGALELFCSMKQSEITATE-VTFISILGACAEMGALEIGKKIHESLKEKHYR

Query:  IEGYLGNAIVDMYAKCGELRLALEVFNEMEMKPVSCWNAMIMGLAVHGDCERALEMFDSMKAEHDDHKPNRVTFIALLIACSHKGLVAEGRHFFSLMVTK
        IEG+L  A+VDMY+KCG++  ALEVF     + VS WN++I  L+VHG  + ALE+F  M   ++  KPN +TFI +L AC+H G++ + R  F +M + 
Subjt:  IEGYLGNAIVDMYAKCGELRLALEVFNEMEMKPVSCWNAMIMGLAVHGDCERALEMFDSMKAEHDDHKPNRVTFIALLIACSHKGLVAEGRHFFSLMVTK

Query:  YKIMPDLKHYGCMIDLLSRWGFLEEAYVMIKTCPFSSCSVLWRTLLGGCRVHRNVELGEKSFRKIAGLEPGKDGDYVLLSNMYAEEERWDDVERLRNEM
        Y++ P ++HYGCM+DLL R G +EEA  ++   P    S+L  +LLG C+    +E  E+   ++  L       Y  +SN+YA + RW+ V   R  M
Subjt:  YKIMPDLKHYGCMIDLLSRWGFLEEAYVMIKTCPFSSCSVLWRTLLGGCRVHRNVELGEKSFRKIAGLEPGKDGDYVLLSNMYAEEERWDDVERLRNEM

AT5G15300.1 Pentatricopeptide repeat (PPR) superfamily protein1.5e-9836.76Show/hide
Query:  PKL--HCLSSMKELEQAQAFVVKSGLCNHIPIMAKLIAFSSLSPSGSLPHAHALFRETSMDDSFICNTMIRAYSNSVFPIKALLIYNHMQRMDVDSDHFT
        PKL  +C  +++ L+Q  A +V +GL +++ ++ +LI  +SLS  G+L +AH LF E    D  ICN ++R  + S+ P K + +Y  M++  V  D +T
Subjt:  PKL--HCLSSMKELEQAQAFVVKSGLCNHIPIMAKLIAFSSLSPSGSLPHAHALFRETSMDDSFICNTMIRAYSNSVFPIKALLIYNHMQRMDVDSDHFT

Query:  YNFVLKACARAIKCTEKNDECFGHDIISRKGAEIHTHVLKLGLDQDHHVQNSLLLMYSGCGLVVFARLIFDEMTVRSNVSWNIMMSAYNRVHDYKSSDVL
        + FVLKAC++                    G   H  V++ G   + +V+N+L+L ++ CG +  A  +FD+      V+W+ M S Y +      +  L
Subjt:  YNFVLKACARAIKCTEKNDECFGHDIISRKGAEIHTHVLKLGLDQDHHVQNSLLLMYSGCGLVVFARLIFDEMTVRSNVSWNIMMSAYNRVHDYKSSDVL

Query:  LELMPETNVISWNTVLARYIRLNNLVAARKVFEEMPERDVVSWNSIIAGYVKVKDYKGALELFCSMKQSEITATEVTFISILGACAEMGALEIGKKIH-E
         + MP  + ++WN ++   ++   + +AR++F+   E+DVV+WN++I+GYV     K AL +F  M+ +      VT +S+L ACA +G LE GK++H  
Subjt:  LELMPETNVISWNTVLARYIRLNNLVAARKVFEEMPERDVVSWNSIIAGYVKVKDYKGALELFCSMKQSEITATEVTFISILGACAEMGALEIGKKIH-E

Query:  SLKEKHYRIEGYLG----NAIVDMYAKCGELRLALEVFNEMEMKPVSCWNAMIMGLAVHGDCERALEMFDSMKAEHDDHKPNRVTFIALLIACSHKGLVA
         L+        Y+G    NA++DMYAKCG +  A+EVF  ++ + +S WN +I+GLA+H   E ++EMF+ M  +     PN VTFI +++ACSH G V 
Subjt:  SLKEKHYRIEGYLG----NAIVDMYAKCGELRLALEVFNEMEMKPVSCWNAMIMGLAVHGDCERALEMFDSMKAEHDDHKPNRVTFIALLIACSHKGLVA

Query:  EGRHFFSLMVTKYKIMPDLKHYGCMIDLLSRWGFLEEAYVMIKTCPFSSCSVLWRTLLGGCRVHRNVELGEKSFRKIAGLEPGKDGDYVLLSNMYAEEER
        EGR +FSLM   Y I P++KHYGCM+D+L R G LEEA++ +++      +++WRTLLG C+++ NVELG+ +  K+  +   + GDYVLLSN+YA   +
Subjt:  EGRHFFSLMVTKYKIMPDLKHYGCMIDLLSRWGFLEEAYVMIKTCPFSSCSVLWRTLLGGCRVHRNVELGEKSFRKIAGLEPGKDGDYVLLSNMYAEEER

Query:  WDDVERLRNEMIDYGVCKKAGSSQI
        WD V+++R    D  V K  G S I
Subjt:  WDDVERLRNEMIDYGVCKKAGSSQI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAACAATGCAATCTGGTGGCGTTTCAAATTTGGTTGAGCCCTTGAGGGTCCATACAAAATGCATTTGCCTTCCTTCACGAAGATGGAAAATGCTTGTGAAAGGTGC
CATTTCCTACATGATTCCTAAGCTTCACTGTCTCTCTTCCATGAAAGAACTGGAACAAGCTCAAGCTTTTGTCGTCAAATCTGGTCTCTGTAATCACATTCCTATAATGG
CGAAGTTAATTGCATTCTCATCCCTTTCTCCATCAGGAAGTCTCCCTCATGCTCATGCTCTGTTCCGAGAGACTTCTATGGACGATTCTTTCATTTGTAACACCATGATT
CGAGCCTACTCAAACAGTGTTTTTCCCATTAAAGCCTTGCTTATTTACAACCATATGCAACGAATGGATGTTGATTCTGATCATTTCACCTACAATTTTGTGCTCAAGGC
ATGTGCAAGAGCTATCAAATGCACTGAAAAGAACGACGAATGTTTCGGGCATGACATTATTTCTCGCAAGGGTGCTGAAATTCATACCCACGTCCTGAAATTGGGGCTTG
ATCAAGATCATCACGTCCAGAATTCATTGCTTCTAATGTACTCTGGGTGTGGCTTGGTAGTTTTTGCTCGTTTGATTTTCGATGAAATGACTGTGAGAAGTAATGTTTCC
TGGAATATTATGATGTCGGCTTATAATCGAGTTCATGACTATAAGTCATCGGATGTTCTTCTTGAATTGATGCCTGAGACAAATGTAATTTCTTGGAATACCGTATTGGC
ACGATATATTAGATTGAATAATCTTGTAGCAGCACGAAAAGTGTTCGAAGAAATGCCTGAGAGGGACGTTGTATCTTGGAATTCTATAATTGCGGGTTATGTGAAGGTCA
AAGATTATAAGGGAGCTCTGGAGCTCTTCTGTAGCATGAAACAGTCGGAAATTACAGCAACTGAAGTAACGTTTATTAGTATATTGGGCGCTTGTGCTGAAATGGGTGCA
TTGGAGATAGGGAAGAAAATCCACGAGTCCTTGAAAGAGAAGCATTACAGAATTGAAGGATATCTAGGTAATGCCATAGTAGATATGTACGCTAAATGTGGGGAACTGCG
CTTAGCTTTGGAGGTATTCAATGAAATGGAGATGAAGCCTGTAAGTTGTTGGAATGCGATGATTATGGGTTTGGCAGTTCATGGTGACTGTGAGAGAGCCTTGGAGATGT
TTGATTCCATGAAGGCAGAGCATGATGATCACAAACCCAATCGGGTAACTTTCATTGCTCTTCTGATTGCCTGTAGTCACAAGGGTCTGGTGGCAGAAGGACGCCATTTT
TTTAGTCTGATGGTTACCAAATACAAGATAATGCCGGATTTAAAGCATTATGGTTGTATGATTGACCTTCTTAGTAGATGGGGTTTTTTGGAAGAAGCGTACGTTATGAT
CAAAACTTGCCCTTTCAGTTCATGCTCTGTTCTATGGAGAACTTTGTTGGGTGGTTGTAGGGTACACAGGAATGTAGAATTGGGCGAGAAATCGTTCCGCAAAATTGCGG
GGTTGGAGCCGGGGAAGGATGGGGATTATGTACTTTTATCAAACATGTACGCTGAAGAAGAGCGGTGGGATGATGTGGAGCGACTGAGAAACGAGATGATTGATTATGGA
GTTTGTAAGAAAGCTGGGTCTAGTCAAATTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGAAACAATGCAATCTGGTGGCGTTTCAAATTTGGTTGAGCCCTTGAGGGTCCATACAAAATGCATTTGCCTTCCTTCACGAAGATGGAAAATGCTTGTGAAAGGTGC
CATTTCCTACATGATTCCTAAGCTTCACTGTCTCTCTTCCATGAAAGAACTGGAACAAGCTCAAGCTTTTGTCGTCAAATCTGGTCTCTGTAATCACATTCCTATAATGG
CGAAGTTAATTGCATTCTCATCCCTTTCTCCATCAGGAAGTCTCCCTCATGCTCATGCTCTGTTCCGAGAGACTTCTATGGACGATTCTTTCATTTGTAACACCATGATT
CGAGCCTACTCAAACAGTGTTTTTCCCATTAAAGCCTTGCTTATTTACAACCATATGCAACGAATGGATGTTGATTCTGATCATTTCACCTACAATTTTGTGCTCAAGGC
ATGTGCAAGAGCTATCAAATGCACTGAAAAGAACGACGAATGTTTCGGGCATGACATTATTTCTCGCAAGGGTGCTGAAATTCATACCCACGTCCTGAAATTGGGGCTTG
ATCAAGATCATCACGTCCAGAATTCATTGCTTCTAATGTACTCTGGGTGTGGCTTGGTAGTTTTTGCTCGTTTGATTTTCGATGAAATGACTGTGAGAAGTAATGTTTCC
TGGAATATTATGATGTCGGCTTATAATCGAGTTCATGACTATAAGTCATCGGATGTTCTTCTTGAATTGATGCCTGAGACAAATGTAATTTCTTGGAATACCGTATTGGC
ACGATATATTAGATTGAATAATCTTGTAGCAGCACGAAAAGTGTTCGAAGAAATGCCTGAGAGGGACGTTGTATCTTGGAATTCTATAATTGCGGGTTATGTGAAGGTCA
AAGATTATAAGGGAGCTCTGGAGCTCTTCTGTAGCATGAAACAGTCGGAAATTACAGCAACTGAAGTAACGTTTATTAGTATATTGGGCGCTTGTGCTGAAATGGGTGCA
TTGGAGATAGGGAAGAAAATCCACGAGTCCTTGAAAGAGAAGCATTACAGAATTGAAGGATATCTAGGTAATGCCATAGTAGATATGTACGCTAAATGTGGGGAACTGCG
CTTAGCTTTGGAGGTATTCAATGAAATGGAGATGAAGCCTGTAAGTTGTTGGAATGCGATGATTATGGGTTTGGCAGTTCATGGTGACTGTGAGAGAGCCTTGGAGATGT
TTGATTCCATGAAGGCAGAGCATGATGATCACAAACCCAATCGGGTAACTTTCATTGCTCTTCTGATTGCCTGTAGTCACAAGGGTCTGGTGGCAGAAGGACGCCATTTT
TTTAGTCTGATGGTTACCAAATACAAGATAATGCCGGATTTAAAGCATTATGGTTGTATGATTGACCTTCTTAGTAGATGGGGTTTTTTGGAAGAAGCGTACGTTATGAT
CAAAACTTGCCCTTTCAGTTCATGCTCTGTTCTATGGAGAACTTTGTTGGGTGGTTGTAGGGTACACAGGAATGTAGAATTGGGCGAGAAATCGTTCCGCAAAATTGCGG
GGTTGGAGCCGGGGAAGGATGGGGATTATGTACTTTTATCAAACATGTACGCTGAAGAAGAGCGGTGGGATGATGTGGAGCGACTGAGAAACGAGATGATTGATTATGGA
GTTTGTAAGAAAGCTGGGTCTAGTCAAATTTAA
Protein sequenceShow/hide protein sequence
METMQSGGVSNLVEPLRVHTKCICLPSRRWKMLVKGAISYMIPKLHCLSSMKELEQAQAFVVKSGLCNHIPIMAKLIAFSSLSPSGSLPHAHALFRETSMDDSFICNTMI
RAYSNSVFPIKALLIYNHMQRMDVDSDHFTYNFVLKACARAIKCTEKNDECFGHDIISRKGAEIHTHVLKLGLDQDHHVQNSLLLMYSGCGLVVFARLIFDEMTVRSNVS
WNIMMSAYNRVHDYKSSDVLLELMPETNVISWNTVLARYIRLNNLVAARKVFEEMPERDVVSWNSIIAGYVKVKDYKGALELFCSMKQSEITATEVTFISILGACAEMGA
LEIGKKIHESLKEKHYRIEGYLGNAIVDMYAKCGELRLALEVFNEMEMKPVSCWNAMIMGLAVHGDCERALEMFDSMKAEHDDHKPNRVTFIALLIACSHKGLVAEGRHF
FSLMVTKYKIMPDLKHYGCMIDLLSRWGFLEEAYVMIKTCPFSSCSVLWRTLLGGCRVHRNVELGEKSFRKIAGLEPGKDGDYVLLSNMYAEEERWDDVERLRNEMIDYG
VCKKAGSSQI