; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0022239 (gene) of Chayote v1 genome

Gene IDSed0022239
OrganismSechium edule (Chayote v1)
DescriptionUPF0503 protein At3g09070, chloroplastic-like
Genome locationLG02:37068514..37070485
RNA-Seq ExpressionSed0022239
SyntenySed0022239
Gene Ontology termsGO:0005886 - plasma membrane (cellular component)
InterPro domainsIPR008004 - Protein OCTOPUS-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6580491.1 Protein OCTOPUS, partial [Cucurbita argyrosperma subsp. sororia]4.7e-21468.8Show/hide
Query:  MNPSTAAAPQPAPPSVA-------AAAASCPRHPQEHLTPFCPLCLCERLSLIESSS-SASSSTRKPPSAAASALKAIFKPSSASASASNPPLLRRSHSF
        MNPST   P P PP            +A+CPRHPQE  T FCPLCLCERLSL++SSS ++SSSTRKP S AASALKAIF+P+  +  +S  P LRR+ SF
Subjt:  MNPSTAAAPQPAPPSVA-------AAAASCPRHPQEHLTPFCPLCLCERLSLIESSS-SASSSTRKPPSAAASALKAIFKPSSASASASNPPLLRRSHSF

Query:  SASKNEPFSSSLEPQRKSCDLRVR-----------------------IDPHTKNLDDQPG--------------FEEPPNVLDFVIQNRVQEIVEEEEEE
        SASKN+ FS+  EPQRKSCD+RVR                       I P TK L+D                   + PNV+D VI+N VQEIVEEEE +
Subjt:  SASKNEPFSSSLEPQRKSCDLRVR-----------------------IDPHTKNLDDQPG--------------FEEPPNVLDFVIQNRVQEIVEEEEEE

Query:  EEI---PLELQQQELKPMKDHIDLDSHTKKPT--GSFWSAASVLSKKLQKWRDSKQKGKKQRS---TATLPVEKPIGRHFRETQSEIADYGFGRRSCDID
         E+   P++L Q+E K MKDHIDLDSHTKKP+  GSFWSAASV SKKLQKWRD KQK KKQR+   + TLPVEKPIGRHFR+TQSEIADYG+GRRSCDID
Subjt:  EEI---PLELQQQELKPMKDHIDLDSHTKKPT--GSFWSAASVLSKKLQKWRDSKQKGKKQRS---TATLPVEKPIGRHFRETQSEIADYGFGRRSCDID

Query:  PRFSLDIPRMSFDDPRHSFDEPRASWDGYLISRTFPRMPTMLSVVEDAPVHVFRSDAQIPVEDSVNSIHEDENTPGGSLQTRDYYSG----RRKSLDRSN
        PRFSLD  RMS DDPR+SFDEPRASWDGYLISRT PRMPTMLSVVEDAP++VFR+DAQIPVEDS+NS +E+EN PGGS QTRDYYS     RRKSLDRSN
Subjt:  PRFSLDIPRMSFDDPRHSFDEPRASWDGYLISRTFPRMPTMLSVVEDAPVHVFRSDAQIPVEDSVNSIHEDENTPGGSLQTRDYYSG----RRKSLDRSN

Query:  SIRKTAAAVVAEIDEMK-SVSNAKVSPAITESSH--------RDSNSNSLRDDCSNSFNLGFNDNAS-----NRKEESKKSRRWGKGWSIWGLINRRGGN
        SIRKTAAAVVAEIDEMK SVSNAKVSPA T+  H        RDSNS+SL++DCS S +  FND AS     NRKEESKKSR WGKGW IWGLINRRGGN
Subjt:  SIRKTAAAVVAEIDEMK-SVSNAKVSPAITESSH--------RDSNSNSLRDDCSNSFNLGFNDNAS-----NRKEESKKSRRWGKGWSIWGLINRRGGN

Query:  KDEEEERE-IRPNGIERSYSGSWPELRGDRNVDAKGGFNPKMFRSNSSVSWRSSSMVGGSFSSARKSNADSNGNGNGNGKKKKEQEPVLERNRSARHSPT
        KDEEE++E  RPNG+ERSYSGSWPELRG+RN D KGGFNPKMFRSNSSVSWRSSSM+GGSFSS+RKSNA++NGNG     KKK +EPVLERNRSARHSPT
Subjt:  KDEEEERE-IRPNGIERSYSGSWPELRGDRNVDAKGGFNPKMFRSNSSVSWRSSSMVGGSFSSARKSNADSNGNGNGNGKKKKEQEPVLERNRSARHSPT

Query:  NVDNGLLRFYLAPMKGSRRGGSGKEKPNQAQSIARSVLRLY
        N+DNGLLRFYL  ++GSRRGGSGK KPNQAQSIARSVLRLY
Subjt:  NVDNGLLRFYLAPMKGSRRGGSGKEKPNQAQSIARSVLRLY

KAG7017244.1 UPF0503 protein, chloroplastic, partial [Cucurbita argyrosperma subsp. argyrosperma]4.0e-21368.64Show/hide
Query:  MNPSTAAAPQPAPPSVA-------AAAASCPRHPQEHLTPFCPLCLCERLSLIESSS-SASSSTRKPPSAAASALKAIFKPSSASASASNPPLLRRSHSF
        MNPST   P P PP            +A+CPRHPQE  T FCPLCLCERLSL++SSS ++SSSTRKP S AASALKAIF+P+  +  +S  P LRR+ SF
Subjt:  MNPSTAAAPQPAPPSVA-------AAAASCPRHPQEHLTPFCPLCLCERLSLIESSS-SASSSTRKPPSAAASALKAIFKPSSASASASNPPLLRRSHSF

Query:  SASKNEPFSSSLEPQRKSCDLRVR-----------------------IDPHTKNLDDQPG--------------FEEPPNVLDFVIQNRVQEIVEEEEEE
        SASKN+ FS+  EPQRKSCD+RVR                       I P TK L+D                   + PNV+D VI+N VQEIVEEEE +
Subjt:  SASKNEPFSSSLEPQRKSCDLRVR-----------------------IDPHTKNLDDQPG--------------FEEPPNVLDFVIQNRVQEIVEEEEEE

Query:  EEI---PLELQQQELKPMKDHIDLDSHTKKPT--GSFWSAASVLSKKLQKWRDSKQKGKKQRS---TATLPVEKPIGRHFRETQSEIADYGFGRRSCDID
         E+   P++L Q+E K MKDHIDLDSHTKKP+  GSFWSAASV SKKLQKWRD KQK KKQR+   + TLPVEKPIGRHFR+TQSEIADYG+GRRSCDID
Subjt:  EEI---PLELQQQELKPMKDHIDLDSHTKKPT--GSFWSAASVLSKKLQKWRDSKQKGKKQRS---TATLPVEKPIGRHFRETQSEIADYGFGRRSCDID

Query:  PRFSLDIPRMSFDDPRHSFDEPRASWDGYLISRTFPRMPTMLSVVEDAPVHVFRSDAQIPVEDSVNSIHEDENTPGGSLQTRDYYSG----RRKSLDRSN
        PRFSLD  RMS DDPR+SFDEPRASWDGYLISRT PRMPTMLSVVEDAP++VFR+DAQIPVEDS+NS +E+EN PGGS QTRDYYS     RRKSLDRSN
Subjt:  PRFSLDIPRMSFDDPRHSFDEPRASWDGYLISRTFPRMPTMLSVVEDAPVHVFRSDAQIPVEDSVNSIHEDENTPGGSLQTRDYYSG----RRKSLDRSN

Query:  SIRKTAAAVVAEIDEMK-SVSNAKVSPAITESSH--------RDSNSNSLRDDCSNSFNLGFNDNAS-----NRKEESKKSRRWGKGWSIWGLINRRGGN
        SIRKTAAAVVAEIDEMK SVSNAKVSPA T+  H        RDSNS+SL++DCS S +  FND AS     NRKEESKKSR WGKGW IWGLINRRGGN
Subjt:  SIRKTAAAVVAEIDEMK-SVSNAKVSPAITESSH--------RDSNSNSLRDDCSNSFNLGFNDNAS-----NRKEESKKSRRWGKGWSIWGLINRRGGN

Query:  KDEEEEREI-RPNGIERSYSGSWPELRGDRNVDAKGGFNPKMFRSNSSVSWRSSSMVGGSFSSARKSNADSNGNGNGNGKKKKEQEPVLERNRSARHSPT
        KDEEE++E  RPNG+ERSYSGSWPEL G+RN D KGGFNPKMFRSNSSVSWRSSSM+GGSFSS+RKSNA++NGNG     KKK +EPVLERNRSARHSPT
Subjt:  KDEEEEREI-RPNGIERSYSGSWPELRGDRNVDAKGGFNPKMFRSNSSVSWRSSSMVGGSFSSARKSNADSNGNGNGNGKKKKEQEPVLERNRSARHSPT

Query:  NVDNGLLRFYLAPMKGSRRGGSGKEKPNQAQSIARSVLRLY
        N+DNGLLRFYL  ++GSRRGGSGK KPNQAQSIARSVLRLY
Subjt:  NVDNGLLRFYLAPMKGSRRGGSGKEKPNQAQSIARSVLRLY

XP_022934120.1 UPF0503 protein At3g09070, chloroplastic-like [Cucurbita moschata]2.3e-21669.7Show/hide
Query:  MNPST---AAAPQPAPPSVAAAAASCPRHPQEHLTPFCPLCLCERLSLIESSS-SASSSTRKPPSAAASALKAIFKPSSASASASNPPLLRRSHSFSASK
        MNPST    A P P PP     +A+CPRHPQE  T FCPLCLCERLSL++SSS ++SSSTRKP S AASALKAIF+P+  +  +S  P LRR+ SFSASK
Subjt:  MNPST---AAAPQPAPPSVAAAAASCPRHPQEHLTPFCPLCLCERLSLIESSS-SASSSTRKPPSAAASALKAIFKPSSASASASNPPLLRRSHSFSASK

Query:  NEPFSSSLEPQRKSCDLRVR-----------------------IDPHTKNLDDQPG--------------FEEPPNVLDFVIQNRVQEIVEEEEEEEEI-
        N+ FS+  EPQRKSCD+RVR                       I P TK L+D                   + PNV+D VI+N VQEIVEEEE   E+ 
Subjt:  NEPFSSSLEPQRKSCDLRVR-----------------------IDPHTKNLDDQPG--------------FEEPPNVLDFVIQNRVQEIVEEEEEEEEI-

Query:  --PLELQQQELKPMKDHIDLDSHTKKPT--GSFWSAASVLSKKLQKWRDSKQKGKKQRS---TATLPVEKPIGRHFRETQSEIADYGFGRRSCDIDPRFS
          P++L Q+E K MKDHIDLDSHTKKP+  GSFWSAASV SKKLQKWRD KQK KKQR+   + TLPVEKPIGRHFR+TQSEIADYG+GRRSCDIDPRFS
Subjt:  --PLELQQQELKPMKDHIDLDSHTKKPT--GSFWSAASVLSKKLQKWRDSKQKGKKQRS---TATLPVEKPIGRHFRETQSEIADYGFGRRSCDIDPRFS

Query:  LDIPRMSFDDPRHSFDEPRASWDGYLISRTFPRMPTMLSVVEDAPVHVFRSDAQIPVEDSVNSIHEDENTPGGSLQTRDYYSG----RRKSLDRSNSIRK
        LD  RMSFDDPR+SFDEPRASWDGYLISRTFPRMPTMLSVVEDAP++VFR+DAQIPVEDS+NS +E+EN PGGS QTRDYYS     RRKSLDRSNSIRK
Subjt:  LDIPRMSFDDPRHSFDEPRASWDGYLISRTFPRMPTMLSVVEDAPVHVFRSDAQIPVEDSVNSIHEDENTPGGSLQTRDYYSG----RRKSLDRSNSIRK

Query:  TAAAVVAEIDEMK-SVSNAKVSPAITESSH--------RDSNSNSLRDDCSNSFNLGFNDNAS-----NRKEESKKSRRWGKGWSIWGLINRRGGNKDEE
        TAAAVVAEIDEMK SVSNAKVSPA T+  H        RDSNS+SL++DCS SF+  FND AS     NRKEESKKSR WGKGW IWGLINRRGGNKDEE
Subjt:  TAAAVVAEIDEMK-SVSNAKVSPAITESSH--------RDSNSNSLRDDCSNSFNLGFNDNAS-----NRKEESKKSRRWGKGWSIWGLINRRGGNKDEE

Query:  EERE-IRPNGIERSYSGSWPELRGDRNVDAKGGFNPKMFRSNSSVSWRSSSMVGGSFSSARKSNADSNGNGNGNGKKKKEQEPVLERNRSARHSPTNVDN
        E++E  RPNG+ERSYSGSWPELRG+RN D KGGFNPKMFRSNSSVSWRSSSM+GGSFSS+RKSNA++NGNG     +KK +EPVLERNRSARHSPTN+DN
Subjt:  EERE-IRPNGIERSYSGSWPELRGDRNVDAKGGFNPKMFRSNSSVSWRSSSMVGGSFSSARKSNADSNGNGNGNGKKKKEQEPVLERNRSARHSPTNVDN

Query:  GLLRFYLAPMKGSRRGGSGKEKPNQAQSIARSVLRLY
        GLLRFYL  ++GSRRGGSGK KPNQAQSIARSVLRLY
Subjt:  GLLRFYLAPMKGSRRGGSGKEKPNQAQSIARSVLRLY

XP_023526761.1 UPF0503 protein At3g09070, chloroplastic-like [Cucurbita pepo subsp. pepo]3.1e-21368.75Show/hide
Query:  MNPSTAAAPQPAPPSVA------AAAASCPRHPQEHLTPFCPLCLCERLSLIESSS-SASSSTRKPPSAAASALKAIFKPSSASASASNPPLLRRSHSFS
        MNPST   P P PP           +A+CPRHPQE  T FCPLCLCERLSL++SSS ++SSSTRKP S AASALKAIF+P+  +  +   P LRR+ SFS
Subjt:  MNPSTAAAPQPAPPSVA------AAAASCPRHPQEHLTPFCPLCLCERLSLIESSS-SASSSTRKPPSAAASALKAIFKPSSASASASNPPLLRRSHSFS

Query:  ASKNEPFSSSLEPQRKSCDLRVR-----------------------IDPHTKNLDDQPG--------------FEEPPNVLDFVIQNRVQEIVEEEEEEE
        ASKN+ FS+  EPQRKSCD+RVR                       I   TK L+D                   + PNV+D V +N VQEIVEEEE + 
Subjt:  ASKNEPFSSSLEPQRKSCDLRVR-----------------------IDPHTKNLDDQPG--------------FEEPPNVLDFVIQNRVQEIVEEEEEEE

Query:  EI---PLELQQQELKPMKDHIDLDSHTKKPT--GSFWSAASVLSKKLQKWRDSKQKGKKQRS---TATLPVEKPIGRHFRETQSEIADYGFGRRSCDIDP
        E+   P++L Q+E K MKDHIDLDSHTKKP+  GSFWSAASV SKKLQKWRD KQK KKQR+   + TLPVEKPIGRHFR+TQSEIADYG+GRRSCDIDP
Subjt:  EI---PLELQQQELKPMKDHIDLDSHTKKPT--GSFWSAASVLSKKLQKWRDSKQKGKKQRS---TATLPVEKPIGRHFRETQSEIADYGFGRRSCDIDP

Query:  RFSLDIPRMSFDDPRHSFDEPRASWDGYLISRTFPRMPTMLSVVEDAPVHVFRSDAQIPVEDSVNSIHEDENTPGGSLQTRDYYSG----RRKSLDRSNS
        RFSLD  R+SFDDPR+SFDEPRASWDGYLISRTFPRMPTMLSVVEDAP++VFR+DAQIPVEDS+NS +E+EN PGGS QTRDYYS     RRKSLDRSNS
Subjt:  RFSLDIPRMSFDDPRHSFDEPRASWDGYLISRTFPRMPTMLSVVEDAPVHVFRSDAQIPVEDSVNSIHEDENTPGGSLQTRDYYSG----RRKSLDRSNS

Query:  IRKTAAAVVAEIDEMK-SVSNAKVSPAITESSH--------RDSNSNSLRDDCSNSFNLGFNDNAS-----NRKEESKKSRRWGKGWSIWGLINRRGGNK
        IRKTAAAVVAEIDEMK SVSNAKVSPA T+  H        RDSNS+SL+DDCS S +  FND AS     NRKEESKKSR WGKGW IWGLINRRGGNK
Subjt:  IRKTAAAVVAEIDEMK-SVSNAKVSPAITESSH--------RDSNSNSLRDDCSNSFNLGFNDNAS-----NRKEESKKSRRWGKGWSIWGLINRRGGNK

Query:  DEEEERE-IRPNGIERSYSGSWPELRGDRNVDAKGGFNPKMFRSNSSVSWRSSSMVGGSFSSARKSNADSNGNGNGNGKKKKEQEPVLERNRSARHSPTN
        DEEE++E  RPNG+ERSYSGSWPELRG+RN D KGGFNPKMFRSNSSVSWRSSSM+GGSFSS+RKSNA++NGNG     KKK +EPVLERNRSARHSPTN
Subjt:  DEEEERE-IRPNGIERSYSGSWPELRGDRNVDAKGGFNPKMFRSNSSVSWRSSSMVGGSFSSARKSNADSNGNGNGNGKKKKEQEPVLERNRSARHSPTN

Query:  VDNGLLRFYLAPMKGSRRGGSGKEKPNQAQSIARSVLRLY
        +DNGLLRFYL  ++GSRRGGSGK KPNQAQSIARSVLRLY
Subjt:  VDNGLLRFYLAPMKGSRRGGSGKEKPNQAQSIARSVLRLY

XP_038903136.1 protein OCTOPUS-like [Benincasa hispida]4.7e-21469.72Show/hide
Query:  MNPST------AAAPQPAPPSVAAAAASCPRHPQEHLTPFCPLCLCERLSLIESSSSA--SSSTRKPPSAAASALKAIFKPSSASASASNPPLLRRSHSF
        MNPST         P P PP     +A CPRHPQEH T FCPLCLCERLS+++SS+SA  SSS+RKP S AASAL+AIF+P+  +  +S  P LRR+ SF
Subjt:  MNPST------AAAPQPAPPSVAAAAASCPRHPQEHLTPFCPLCLCERLSLIESSSSA--SSSTRKPPSAAASALKAIFKPSSASASASNPPLLRRSHSF

Query:  SASKNEPFSSSLEPQRKSCDLRVR------------------------IDPHTKNLDD--------QPG------FEEPPNVLDFVIQNRVQEIVEEEEE
        SASKNE FS+  EPQRKSCD+R+R                        I   +KNL+D        QP           PNV+D VI+N VQEIVEEEEE
Subjt:  SASKNEPFSSSLEPQRKSCDLRVR------------------------IDPHTKNLDD--------QPG------FEEPPNVLDFVIQNRVQEIVEEEEE

Query:  EEEIPLELQ----QQELKPMKDHIDLDSHTKKPT--GSFWSAASVLSKKLQKWRDSKQKGKKQRS---TATLPVEKPIGRHFRETQSEIADYGFGRRSCD
        E ++ L  +    Q+E K MKDHIDLDSHTKKP   GSFWSAASV SKKLQKWRD KQK KKQR+   + TLPVEKPIGRHFRETQSEIADYGFGRRSCD
Subjt:  EEEIPLELQ----QQELKPMKDHIDLDSHTKKPT--GSFWSAASVLSKKLQKWRDSKQKGKKQRS---TATLPVEKPIGRHFRETQSEIADYGFGRRSCD

Query:  IDPRFSLDIPRMSFDDPRHSFDEPRASWDGYLISRTFPRMPTMLSVVEDAPVHVFRSDAQIPVEDSVNSIHEDENTPGGSLQTRDYYSG----RRKSLDR
        IDPRFSLD  RMSFDDPR+SFDEPRASWDGYLISRTFPRMPTMLSVVEDAP+HVFRSD QIPVEDS+NS +EDEN PGGS QTRDYYS     RRKSLDR
Subjt:  IDPRFSLDIPRMSFDDPRHSFDEPRASWDGYLISRTFPRMPTMLSVVEDAPVHVFRSDAQIPVEDSVNSIHEDENTPGGSLQTRDYYSG----RRKSLDR

Query:  SNSIRKTAAAVVAEIDEMK-SVSNAKVSPAITESSH--------RDSNSNSLRDDCSNSFNLGFNDNAS-----NRKEESKKSRRWGKGWSIWGLINRRG
        SNSIRKTAAAVVAEID+MK SVSNAKVSPA T+  H        RDSNSNS+RDDCS S    F+D AS     NRKEESKKS+ WGKGW IWGLINRRG
Subjt:  SNSIRKTAAAVVAEIDEMK-SVSNAKVSPAITESSH--------RDSNSNSLRDDCSNSFNLGFNDNAS-----NRKEESKKSRRWGKGWSIWGLINRRG

Query:  GNKDEEEERE-IRPNGIERSYSGSWPELRGDRNVDAKGGFNPKMFRSNSSVSWRSSSMVG-GSFSSARKSNADSNGNGNGNGKKKKEQEPVLERNRSARH
        GNKDEEE+RE  RPNG+ERSYS SWPELRGDRN D KGGFNPKMFRSNSSVSWRSSSM+G G FSS+RKSNA+SNGNG    KKK+E +PVLERNRSARH
Subjt:  GNKDEEEERE-IRPNGIERSYSGSWPELRGDRNVDAKGGFNPKMFRSNSSVSWRSSSMVG-GSFSSARKSNADSNGNGNGNGKKKKEQEPVLERNRSARH

Query:  SPTNVDNGLLRFYLAPMKGSRRGGSGKEKPNQAQSIARSVLRLY
        SPTNVDNGLLRFYL P++GSRRGGSGK KPNQAQSIARSVLRLY
Subjt:  SPTNVDNGLLRFYLAPMKGSRRGGSGKEKPNQAQSIARSVLRLY

TrEMBL top hitse value%identityAlignment
A0A5A7TLQ7 UPF0503 protein8.2e-21269.97Show/hide
Query:  MNPST--AAAPQPAPPSVAAAAASCPRHPQEHLTPFCPLCLCERLSLIESS-----SSASSSTRKPPSAAASALKAIFKPSSASASASNPPLLRRSHSFS
        MNPST     P P PP     +A+CPRHPQEH T FCPLCLCERLSL++SS     SS+SSS+RKP S AASALKAIF+P   +  +S  P LRR+ SFS
Subjt:  MNPST--AAAPQPAPPSVAAAAASCPRHPQEHLTPFCPLCLCERLSLIESS-----SSASSSTRKPPSAAASALKAIFKPSSASASASNPPLLRRSHSFS

Query:  ASKNEPFSSSLEPQRKSCDLRVR------------------------IDPHTKNLDDQ-PGFEEP-------------PNVLDFVIQNRVQEIVEEE-EE
        ASKNE FS+  EPQRKSCD+R+R                        I   TKNL+D    + EP             PNV DFVI+N VQEIVEEE + 
Subjt:  ASKNEPFSSSLEPQRKSCDLRVR------------------------IDPHTKNLDDQ-PGFEEP-------------PNVLDFVIQNRVQEIVEEE-EE

Query:  EEEIPLELQQQELKPMKDHIDLDSHTKKPT--GSFWSAASVLSKKLQKWRDSKQKGKKQRS---TATLPVEKPIGRHFRETQSEIADYGFGRRSCDIDPR
        E E      Q+E K MKDHIDLDSHTKKP+  GSFWSAASV SKKLQKWRD KQK KKQR+   + TLPVEKPIGRHFRETQSEIADYGFGRRSCDIDPR
Subjt:  EEEIPLELQQQELKPMKDHIDLDSHTKKPT--GSFWSAASVLSKKLQKWRDSKQKGKKQRS---TATLPVEKPIGRHFRETQSEIADYGFGRRSCDIDPR

Query:  FSLDIPRMSFDDPRHSFDEPRASWDGYLISRTFPRMPTMLSVVEDAPVHVFRSDAQIPVEDSVNSIHEDENTPGGSLQTRDYYSG----RRKSLDRSNSI
        FSLD  RMSFDDPR+SFDEPRASWDGYLISRTFPRMPTMLSVVEDAP+HVFRSD QIPVEDS+NS +E+EN PGGS QTR+YYS     RRKSLDRSNSI
Subjt:  FSLDIPRMSFDDPRHSFDEPRASWDGYLISRTFPRMPTMLSVVEDAPVHVFRSDAQIPVEDSVNSIHEDENTPGGSLQTRDYYSG----RRKSLDRSNSI

Query:  RKTAAAVVAEIDEMK-SVSNAKVSPAITESSH--------RDSNSNSLRDDCSNSF-NLGFNDNASNRKEESKKSRRWGKGWSIWGLINRRGGNKDEEEE
        RKTAAAVVAEID+MK SVSNAKVSPA T+  H        RDSNSNSLRDD S SF +       +NRKEESKKS+ WGKGW IWGLINRRGGNKDEEE+
Subjt:  RKTAAAVVAEIDEMK-SVSNAKVSPAITESSH--------RDSNSNSLRDDCSNSF-NLGFNDNASNRKEESKKSRRWGKGWSIWGLINRRGGNKDEEEE

Query:  RE-IRPNGIERSYSGSWPELRGDRNVDAK-GGFNPKMFRSNSSVSWRSSSMVGGSFSSARKSNADSNGNGNGNGKKKKEQEPVLERNRSARHSPTNVDNG
        RE  RPNG+ERSYS SWPELRGDRN D K GGFNPKMFRSNSSVSWRS+SM+GGSFSS+RKSNA+SNGNG    KKK+E +PVLERNRSARHSPTNVDNG
Subjt:  RE-IRPNGIERSYSGSWPELRGDRNVDAK-GGFNPKMFRSNSSVSWRSSSMVGGSFSSARKSNADSNGNGNGNGKKKKEQEPVLERNRSARHSPTNVDNG

Query:  LLRFYLAPMKGSRRGGSGKEKPNQAQSIARSVLRLY
        LLRFYL P++GSRRGGSGK KP+QAQSIARSVLRLY
Subjt:  LLRFYLAPMKGSRRGGSGKEKPNQAQSIARSVLRLY

A0A6J1F6S2 UPF0503 protein At3g09070, chloroplastic-like1.1e-21669.7Show/hide
Query:  MNPST---AAAPQPAPPSVAAAAASCPRHPQEHLTPFCPLCLCERLSLIESSS-SASSSTRKPPSAAASALKAIFKPSSASASASNPPLLRRSHSFSASK
        MNPST    A P P PP     +A+CPRHPQE  T FCPLCLCERLSL++SSS ++SSSTRKP S AASALKAIF+P+  +  +S  P LRR+ SFSASK
Subjt:  MNPST---AAAPQPAPPSVAAAAASCPRHPQEHLTPFCPLCLCERLSLIESSS-SASSSTRKPPSAAASALKAIFKPSSASASASNPPLLRRSHSFSASK

Query:  NEPFSSSLEPQRKSCDLRVR-----------------------IDPHTKNLDDQPG--------------FEEPPNVLDFVIQNRVQEIVEEEEEEEEI-
        N+ FS+  EPQRKSCD+RVR                       I P TK L+D                   + PNV+D VI+N VQEIVEEEE   E+ 
Subjt:  NEPFSSSLEPQRKSCDLRVR-----------------------IDPHTKNLDDQPG--------------FEEPPNVLDFVIQNRVQEIVEEEEEEEEI-

Query:  --PLELQQQELKPMKDHIDLDSHTKKPT--GSFWSAASVLSKKLQKWRDSKQKGKKQRS---TATLPVEKPIGRHFRETQSEIADYGFGRRSCDIDPRFS
          P++L Q+E K MKDHIDLDSHTKKP+  GSFWSAASV SKKLQKWRD KQK KKQR+   + TLPVEKPIGRHFR+TQSEIADYG+GRRSCDIDPRFS
Subjt:  --PLELQQQELKPMKDHIDLDSHTKKPT--GSFWSAASVLSKKLQKWRDSKQKGKKQRS---TATLPVEKPIGRHFRETQSEIADYGFGRRSCDIDPRFS

Query:  LDIPRMSFDDPRHSFDEPRASWDGYLISRTFPRMPTMLSVVEDAPVHVFRSDAQIPVEDSVNSIHEDENTPGGSLQTRDYYSG----RRKSLDRSNSIRK
        LD  RMSFDDPR+SFDEPRASWDGYLISRTFPRMPTMLSVVEDAP++VFR+DAQIPVEDS+NS +E+EN PGGS QTRDYYS     RRKSLDRSNSIRK
Subjt:  LDIPRMSFDDPRHSFDEPRASWDGYLISRTFPRMPTMLSVVEDAPVHVFRSDAQIPVEDSVNSIHEDENTPGGSLQTRDYYSG----RRKSLDRSNSIRK

Query:  TAAAVVAEIDEMK-SVSNAKVSPAITESSH--------RDSNSNSLRDDCSNSFNLGFNDNAS-----NRKEESKKSRRWGKGWSIWGLINRRGGNKDEE
        TAAAVVAEIDEMK SVSNAKVSPA T+  H        RDSNS+SL++DCS SF+  FND AS     NRKEESKKSR WGKGW IWGLINRRGGNKDEE
Subjt:  TAAAVVAEIDEMK-SVSNAKVSPAITESSH--------RDSNSNSLRDDCSNSFNLGFNDNAS-----NRKEESKKSRRWGKGWSIWGLINRRGGNKDEE

Query:  EERE-IRPNGIERSYSGSWPELRGDRNVDAKGGFNPKMFRSNSSVSWRSSSMVGGSFSSARKSNADSNGNGNGNGKKKKEQEPVLERNRSARHSPTNVDN
        E++E  RPNG+ERSYSGSWPELRG+RN D KGGFNPKMFRSNSSVSWRSSSM+GGSFSS+RKSNA++NGNG     +KK +EPVLERNRSARHSPTN+DN
Subjt:  EERE-IRPNGIERSYSGSWPELRGDRNVDAKGGFNPKMFRSNSSVSWRSSSMVGGSFSSARKSNADSNGNGNGNGKKKKEQEPVLERNRSARHSPTNVDN

Query:  GLLRFYLAPMKGSRRGGSGKEKPNQAQSIARSVLRLY
        GLLRFYL  ++GSRRGGSGK KPNQAQSIARSVLRLY
Subjt:  GLLRFYLAPMKGSRRGGSGKEKPNQAQSIARSVLRLY

A0A6J1FBM4 UPF0503 protein At3g09070, chloroplastic-like8.2e-21268.15Show/hide
Query:  MNPSTAAA-PQPAPPSVAAAAASCPRHPQEHLTPFCPLCLCERLSLIESSSSASSSTRKPPSAAASALKAIFKPSSASASASN-PPLLRRSHSFSASKNE
        MNPSTAAA P P PP  +A A  CPRHPQEH  PFC LCLCERLSLI+SSSS+SSS+RKPPS AASALK++F+P   +   S+  P LRR+ SFSASKNE
Subjt:  MNPSTAAA-PQPAPPSVAAAAASCPRHPQEHLTPFCPLCLCERLSLIESSSSASSSTRKPPSAAASALKAIFKPSSASASASN-PPLLRRSHSFSASKNE

Query:  PFSSSLEPQRKSCDLRVR--------------------------IDPHTKNLDDQPG----------------FEEPPNVLDFVIQNRVQEIVEEEEEEE
        PFS+  EP RKSCD+R+R                          +   +KNL+D+P                   E P V+D VI+N VQEIVEEE+ + 
Subjt:  PFSSSLEPQRKSCDLRVR--------------------------IDPHTKNLDDQPG----------------FEEPPNVLDFVIQNRVQEIVEEEEEEE

Query:  EIPLE--LQQQELKPMKDHIDLDSHTKKPT--------GSFWSAASVLSKKLQKWRDSKQKGKKQRSTA---TLPVEKPIGRHFRETQSEIADYGFGRRS
        E   E    Q++ K MKDHIDLDSHTKKP+        GSF SAASV SKKLQKWRD KQKGKKQRS A   TLPVEKPIGRHFRETQSEIADYGFGRRS
Subjt:  EIPLE--LQQQELKPMKDHIDLDSHTKKPT--------GSFWSAASVLSKKLQKWRDSKQKGKKQRSTA---TLPVEKPIGRHFRETQSEIADYGFGRRS

Query:  CDIDPRFSLDIPRMSFDDPRHSFDEPRASWDGYLISRTFPRMPTMLSVVEDAPVHVFRSDAQIPVEDSVNSIHEDENTPGGSLQTRDYY----SGRRKSL
        CDIDPRFSLD  RMSFDDPR+SFDEPRASWDGYLISRTFP+MPTMLSVVEDAP+ VFRSD QIPVEDS++S +E+EN PGG+ QTRDYY    S RRKSL
Subjt:  CDIDPRFSLDIPRMSFDDPRHSFDEPRASWDGYLISRTFPRMPTMLSVVEDAPVHVFRSDAQIPVEDSVNSIHEDENTPGGSLQTRDYY----SGRRKSL

Query:  DRSNSIRKTAAAVVAEIDEMKSVSNAKVSPAITESSH--------RDSNSNSLRDDCSNSFNLGFNDNAS-----NRKEESKKSRRWGKGWSIWGLINRR
        DRSNSIRK AAAVVAEIDEMKSVSNAKVSPA T+ SH        RDSN+NS+RDDCS +F++GFND AS     NRKEESKKSRRWGKGWSIWGLINRR
Subjt:  DRSNSIRKTAAAVVAEIDEMKSVSNAKVSPAITESSH--------RDSNSNSLRDDCSNSFNLGFNDNAS-----NRKEESKKSRRWGKGWSIWGLINRR

Query:  GGNKDEEEEREIRPNGIERSYSGSWPELRGDRNVDAKGGFNPKMFRSNSSVSWRSSSMVGGSF-SSARKSNADSNGNGNGNGKKKKEQE-----PVLERN
        GGNKD+EE+   R NG+ERS+SGSWPELRG++++D KGGFNPKMFRSNSSVSWRS+SMVGGSF SS+RKSNAD     NGNGKKKKEQE     PVL RN
Subjt:  GGNKDEEEEREIRPNGIERSYSGSWPELRGDRNVDAKGGFNPKMFRSNSSVSWRSSSMVGGSF-SSARKSNADSNGNGNGNGKKKKEQE-----PVLERN

Query:  RSARHSPTNVDNGLLRFYLAPMKGSRRGGS-GKEKPNQAQSIARSVLRLY
         SARHS TNVDNGLLRFY+ PMK SRRG S GK KPNQA SIARSVLRLY
Subjt:  RSARHSPTNVDNGLLRFYLAPMKGSRRGGS-GKEKPNQAQSIARSVLRLY

A0A6J1I8G1 UPF0503 protein At3g09070, chloroplastic-like2.8e-21268.46Show/hide
Query:  MNPSTAAA-PQPAPPSVAAAAASCPRHPQEHLTPFCPLCLCERLSLIESSSSASSSTRKPPSAAASALKAIFKPSSASASASN-PPLLRRSHSFSASKNE
        MNPSTAAA P P PP  +A AA CPRHPQEH  PFC LCLCERLSLI+SSSS+SSS+RKPPS AASALK++F+P   +   S+  P LRR+ SFSASKNE
Subjt:  MNPSTAAA-PQPAPPSVAAAAASCPRHPQEHLTPFCPLCLCERLSLIESSSSASSSTRKPPSAAASALKAIFKPSSASASASN-PPLLRRSHSFSASKNE

Query:  PFSSSLEPQRKSCDLRVR--------------------------IDPHTKNLDDQPG----------------FEEPPNVLDFVIQNRVQEIVEEEEEEE
        PFS+  EP RKSCD+R+R                          I   +KNL+D+P                   E P V+D VI+N VQEIVEEE+ + 
Subjt:  PFSSSLEPQRKSCDLRVR--------------------------IDPHTKNLDDQPG----------------FEEPPNVLDFVIQNRVQEIVEEEEEEE

Query:  EIPLE--LQQQELKPMKDHIDLDSHTKKPT--------GSFWSAASVLSKKLQKWRDSKQKGKKQRSTA---TLPVEKPIGRHFRETQSEIADYGFGRRS
        E   E    Q++ K MKDHIDLDSHTKKP+        GSF SAASV SKKLQKWRD KQKGKKQRS A   TLPVEKPIGRHFRETQSEIADYGFGRRS
Subjt:  EIPLE--LQQQELKPMKDHIDLDSHTKKPT--------GSFWSAASVLSKKLQKWRDSKQKGKKQRSTA---TLPVEKPIGRHFRETQSEIADYGFGRRS

Query:  CDIDPRFSLDIPRMSFDDPRHSFDEPRASWDGYLISRTFPRMPTMLSVVEDAPVHVFRSDAQIPVEDSVNSIHEDENTPGGSLQTRDYY----SGRRKSL
        CDIDPRFSLD  RMSFDDPR+SFDEPRASWDGYLISRTFP+MPTMLSVVEDAP+ VFRSD QIPVEDS++S +E+EN PGG+ QTRDYY    S RRKSL
Subjt:  CDIDPRFSLDIPRMSFDDPRHSFDEPRASWDGYLISRTFPRMPTMLSVVEDAPVHVFRSDAQIPVEDSVNSIHEDENTPGGSLQTRDYY----SGRRKSL

Query:  DRSNSIRKTAAAVVAEIDEMKSVSNAKVSPAITESSH--------RDSNSNSLRDDCSNSFNLGFNDNAS-----NRKEESKKSRRWGKGWSIWGLINRR
        DRSNSIRK AAAVVAEIDEMKSVSNAKVSPA T+ SH        RDSNSNS+RDDCS +F++GFND AS     NRKEESKKSRRWGKGWSIWGLINRR
Subjt:  DRSNSIRKTAAAVVAEIDEMKSVSNAKVSPAITESSH--------RDSNSNSLRDDCSNSFNLGFNDNAS-----NRKEESKKSRRWGKGWSIWGLINRR

Query:  GGNKDEEEEREIRPNGIERSYSGSWPELRGDRNVDAKGGFNPKMFRSNSSVSWRSSSMVGGSF-SSARKSNADSNGNGNGNGKKKKEQE-----PVLERN
        GGNKD+EE+   R NG+ERS+SGSWPELRG++++D KGGFNPKMFRSNSSVSWRS+SMVGGSF SS+RKSNAD     NGNGKKKKEQE     PVL RN
Subjt:  GGNKDEEEEREIRPNGIERSYSGSWPELRGDRNVDAKGGFNPKMFRSNSSVSWRSSSMVGGSF-SSARKSNADSNGNGNGNGKKKKEQE-----PVLERN

Query:  RSARHSPTNVDNGLLRFYLAPMKGSRRG-GSGKEKPNQAQSIARSVLRLY
         SARHS TNVDNGLLRFY+ PMK SRRG  +GK KPNQA SIARSVLRLY
Subjt:  RSARHSPTNVDNGLLRFYLAPMKGSRRG-GSGKEKPNQAQSIARSVLRLY

A0A6J1J5H7 UPF0503 protein At3g09070, chloroplastic-like4.3e-21369.39Show/hide
Query:  MNPST---AAAPQPAPPSVAAAAASCPRHPQEHLTPFCPLCLCERLSLIESSS--SASSSTRKPPSAAASALKAIFKPSSASASASNPPLLRRSHSFSAS
        MNPST    A P P PP     +A+CPRHPQE  T FCPLCLCERLSL++SSS  S+SSSTRKP S AASALKAIF+P+  +  +S  P LRR+ SFSAS
Subjt:  MNPST---AAAPQPAPPSVAAAAASCPRHPQEHLTPFCPLCLCERLSLIESSS--SASSSTRKPPSAAASALKAIFKPSSASASASNPPLLRRSHSFSAS

Query:  KNEPFSSSLEPQRKSCDLRVR-----------------------IDPHTKNLDD--------QP------GFEEPPNVLDFVIQNRVQEIVEEEEEEEEI
        KN+ FS+  EPQRKSCD+RVR                       I P TK L+D        QP         E PNV+D VI+N  QEIVEE+E + E+
Subjt:  KNEPFSSSLEPQRKSCDLRVR-----------------------IDPHTKNLDD--------QP------GFEEPPNVLDFVIQNRVQEIVEEEEEEEEI

Query:  -PLELQ-QQELKPMKDHIDLDSHTKKPT--GSFWSAASVLSKKLQKWRDSKQKGKKQRS---TATLPVEKPIGRHFRETQSEIADYGFGRRSCDIDPRFS
         P  +Q Q+E K MKDHIDLDSHTKKP+  GSFWSAASV SKKLQKWRD KQK KKQR+   +  LPVEKPIGRHFR+TQSEIADYG+GRRSCDIDPRFS
Subjt:  -PLELQ-QQELKPMKDHIDLDSHTKKPT--GSFWSAASVLSKKLQKWRDSKQKGKKQRS---TATLPVEKPIGRHFRETQSEIADYGFGRRSCDIDPRFS

Query:  LDIPRMSFDDPRHSFDEPRASWDGYLISRTFPRMPTMLSVVEDAPVHVFRSDAQIPVEDSVNSIHEDENTPGGSLQTRDYYSG----RRKSLDRSNSIRK
        LD  RMSFDDPR+SFDEPRASWDGYLISRTFPRMPTMLSVVEDAP++VFR+DAQIPVEDS+NS +E+EN PGGS QTRDYYS     RRKSLDRSNSIRK
Subjt:  LDIPRMSFDDPRHSFDEPRASWDGYLISRTFPRMPTMLSVVEDAPVHVFRSDAQIPVEDSVNSIHEDENTPGGSLQTRDYYSG----RRKSLDRSNSIRK

Query:  TAAAVVAEIDEMK-SVSNAKVSPAITESSH--------RDSNSNSLRDDCSNSFNLGFNDNAS-----NRKEESKKSRRWGKGWSIWGLINRRGGNKDEE
        TAAAVVAEIDEMK SVSNAKVSPA T+  H        RDSNS+SL++DCS S +  FND AS     NRKEESKKSR WGKGW IWGLINRRGGNKDEE
Subjt:  TAAAVVAEIDEMK-SVSNAKVSPAITESSH--------RDSNSNSLRDDCSNSFNLGFNDNAS-----NRKEESKKSRRWGKGWSIWGLINRRGGNKDEE

Query:  EERE-IRPNGIERSYSGSWPELRGDRNVDAKGGFNPKMFRSNSSVSWRSSSMVGGSFSSARKSNADSNGNGNGNGKKKKEQEPVLERNRSARHSPTNVDN
        E++E  RPNG+ERS SGSWPELRG+RN D KGGFNPKMFRSNSSVSWRSSSM+GGSFSS+RKSN ++NGNG     KKK +EPVLER+RSARHSPTN+DN
Subjt:  EERE-IRPNGIERSYSGSWPELRGDRNVDAKGGFNPKMFRSNSSVSWRSSSMVGGSFSSARKSNADSNGNGNGNGKKKKEQEPVLERNRSARHSPTNVDN

Query:  GLLRFYLAPMKGSRRGGSGKEKPNQAQSIARSVLRLY
        GLLRFYL  ++GSRRGGSGK KPNQAQSIARSVLRLY
Subjt:  GLLRFYLAPMKGSRRGGSGKEKPNQAQSIARSVLRLY

SwissProt top hitse value%identityAlignment
Q9LFB9 Protein OCTOPUS-like7.6e-9043.06Show/hide
Query:  MNPSTAAAP-----QPAPPSVA-AAAASCPRHPQEHLTPFCPLCLCERLSLIESSSS--ASSSTRKPPSAAASALKAIFKPSSASASASNP---------
        MN S   AP     + APPS     + SC  HP+E  + FCP CLC+RLS+++ +++   SSS+RKPPS +A +LKA+FKPSS+  + SN          
Subjt:  MNPSTAAAP-----QPAPPSVA-AAAASCPRHPQEHLTPFCPLCLCERLSLIESSSS--ASSSTRKPPSAAASALKAIFKPSSASASASNP---------

Query:  PLLRRSHSFSASKNEPFSSSLEPQRKSCDLRVRIDPHTKNLDDQPGFEE------PPNVLDFVIQ-NRVQEIVEEEEEEEEIPLEL----------QQQE
        P LRR+ SFSA  NE FS   EPQR+SCD+R+R D     +++    ++        +V + V++     EI E+EE  E+ P E+          +++E
Subjt:  PLLRRSHSFSASKNEPFSSSLEPQRKSCDLRVRIDPHTKNLDDQPGFEE------PPNVLDFVIQ-NRVQEIVEEEEEEEEIPLEL----------QQQE

Query:  LKPMKDHIDLDSHTKKPT-----GSFWSAASVLSKKLQKWRDSKQKGKKQRSTATLPVEKPIGRHFRETQSEIADYGFGRRSCDIDPRFSL-------DI
        LKPMKD++DL S TKKP+     GSF+SAASV SKKLQKW+  KQK KK R+          G      QSEI   G GRRS D DPRFSL       DI
Subjt:  LKPMKDHIDLDSHTKKPT-----GSFWSAASVLSKKLQKWRDSKQKGKKQRSTATLPVEKPIGRHFRETQSEIADYGFGRRSCDIDPRFSL-------DI

Query:  PRMSFDDPRHSFDEPRASWDGYLISRT----FPRMPTMLSVVEDAPVHVFRSDAQIPVEDSVNSIHEDEN----TPGGSLQTRDYYSG-----RRKSLDR
         R+S DD R+S DEPRASWDG+LI RT     P  P+MLSVVE+AP++  RSD QIP   S+  I  D +     PGGS QTRDYY+G     RRKSLDR
Subjt:  PRMSFDDPRHSFDEPRASWDGYLISRT----FPRMPTMLSVVEDAPVHVFRSDAQIPVEDSVNSIHEDEN----TPGGSLQTRDYYSG-----RRKSLDR

Query:  SNSIRKTAAAVVAEIDEMKSVSNAKVSPAITESSHRDSNSNSLRDDCSNSFNLGFNDNASNRKEESKKSRRWGKGWSIWGLINRRGGNKDEEEEREIRPN
        SNSIRK    +V E++++KSVSN+        ++  DSNS    ++  N             +   KKSRRWGK WSI G I R+ G  DEEE+R  R N
Subjt:  SNSIRKTAAAVVAEIDEMKSVSNAKVSPAITESSHRDSNSNSLRDDCSNSFNLGFNDNASNRKEESKKSRRWGKGWSIWGLINRRGGNKDEEEEREIRPN

Query:  G---IERSYSGSWPELRGDRNVDAKGGFNPKMFRSNSSVSWRSSSMVGGSFSSARKSNADSNGNGNGNGKKKKEQEPVLERNRSARHSPTNVDNGLLRFY
            +ERS S SWPE+R     + +GG  PKM RSNS+VSWRSS   GGS                              RN+S+R+S  + +NG+LRFY
Subjt:  G---IERSYSGSWPELRGDRNVDAKGGFNPKMFRSNSSVSWRSSSMVGGSFSSARKSNADSNGNGNGNGKKKKEQEPVLERNRSARHSPTNVDNGLLRFY

Query:  LAPMK------------GSRRGGSGKEKP-----NQAQSIARSVLRLY
        L PM+            G   GG G EK      +   SIAR V+RLY
Subjt:  LAPMK------------GSRRGGSGKEKP-----NQAQSIARSVLRLY

Q9SS80 Protein OCTOPUS7.1e-11244.96Show/hide
Query:  MNPST--------AAAPQPAPPSVAAAAASCPRHPQEHLTPFCPLCLCERLSLIESSSS--ASSSTRKPPSAAASALKAIFKPSSASASAS---------
        MNP+T        A AP P PP     + SC RHP+E  T FCP CLCERLS+++ +++  +SSS++KPP+ +A+ALKA+FKPS  +             
Subjt:  MNPST--------AAAPQPAPPSVAAAAASCPRHPQEHLTPFCPLCLCERLSLIESSSS--ASSSTRKPPSAAASALKAIFKPSSASASAS---------

Query:  NP---PLLRRSHSFSASK-NEPFSSSLEPQRKSCDLRVR-------------------------IDPHTKNL----------------DDQPGFEEPPNV
         P   P LRR+ SFSASK NE FS   EPQR+SCD+R+R                         ++P   ++                D++   EE  + 
Subjt:  NP---PLLRRSHSFSASK-NEPFSSSLEPQRKSCDLRVR-------------------------IDPHTKNL----------------DDQPGFEEPPNV

Query:  L---DFVIQN--------RVQEIVEEEEEEEEI--PLE-LQQQELKPMKDHIDLDSHTKKPT--GSFWSAASVLSKKLQKWRDSKQKGKKQRS-------
        +   DF I N        +  EIVE  EE EE   P + L ++ELKP+KD+IDLDS TKKP+   SFWSAASV SKKLQKWR + QK KK+R+       
Subjt:  L---DFVIQN--------RVQEIVEEEEEEEEI--PLE-LQQQELKPMKDHIDLDSHTKKPT--GSFWSAASVLSKKLQKWRDSKQKGKKQRS-------

Query:  TATLPVEKPIGRHFRETQSEIADYGFGRRSCDIDPRFSL--------------DIPRMSFDDPRHSFDEPRASWDGYLISRTF-------PRMPTMLSVV
        +A LPVEKPIGR  R+TQSEIADYG+GRRSCD DPRFSL              DI R+S DDPR+SFDEPRASWDG LI RT        P  P+MLSVV
Subjt:  TATLPVEKPIGRHFRETQSEIADYGFGRRSCDIDPRFSL--------------DIPRMSFDDPRHSFDEPRASWDGYLISRTF-------PRMPTMLSVV

Query:  EDAPV----HVFRSDAQIPVED---------SVNSIHEDENTPGGSLQTRDYY----SGRRKSLDR-SNSIRKTAAAVVAEIDEMKSVSNAKVSPAITES
        EDAP     HV R+D Q PVE+           N + +    PGGS+QTRDYY    S RRKSLDR S+S+RKTAAAVVA++DE K   ++ +S      
Subjt:  EDAPV----HVFRSDAQIPVED---------SVNSIHEDENTPGGSLQTRDYY----SGRRKSLDR-SNSIRKTAAAVVAEIDEMKSVSNAKVSPAITES

Query:  SHRDSNSNSLRDDCSNSFN---LGFNDNASNRKEESKKSRRWGKGWSIWGLINRRGGNK-----DEEEEREIRPNG--IERSYSGSWPELRGDRNVDAKG
        S RD+N+ ++    + SF    +   D   N  + +KKSRRWGK WSI GLI R+  NK     +EEE+R  R NG  +ERS S SWPELR        G
Subjt:  SHRDSNSNSLRDDCSNSFN---LGFNDNASNRKEESKKSRRWGKGWSIWGLINRRGGNK-----DEEEEREIRPNG--IERSYSGSWPELRGDRNVDAKG

Query:  GFNPKMFRSNSSVSWRSSSMVGGSFSSARKSNADSNGNGNGNGKKKKEQEPVLERNRSARHSPTNVDNGLLRFYLAPMKGSRR---------GGSGKEKP
        G  P+M RSNS+VSWRSS   GG   SARK N                   +  RN+S+R+SP N +NG+L+FYL  MK SRR         GG G    
Subjt:  GFNPKMFRSNSSVSWRSSSMVGGSFSSARKSNADSNGNGNGNGKKKKEQEPVLERNRSARHSPTNVDNGLLRFYLAPMKGSRR---------GGSGKEKP

Query:  NQAQSIARSVLRLY
        +   SIARSV+RLY
Subjt:  NQAQSIARSVLRLY

Arabidopsis top hitse value%identityAlignment
AT2G38070.1 Protein of unknown function (DUF740)9.2e-10745.12Show/hide
Query:  STAAAPQPAPPSVAAAAASCPRHPQEHLTPFCPLCLCERLSLIE----SSSSASSSTRKPPSAAASALKAIFKPSSASASASNPPLLRRSHSFSASKNEP
        + A AP P PP     + SC RHP E  T FCP CL +RLS+++    ++++ +SS++KPPS++A ALKAIFKPSS+S S    P LRR+ SFSASK E 
Subjt:  STAAAPQPAPPSVAAAAASCPRHPQEHLTPFCPLCLCERLSLIE----SSSSASSSTRKPPSAAASALKAIFKPSSASASASNPPLLRRSHSFSASKNEP

Query:  FS-SSLEPQRKSCDLRVR---------------------------ID-PHTKNLDDQPGFEEPPNV---------LDFVI----QNRVQEIVEEEEEEEE
        FS  + EPQR+SCD+RVR                           ID     ++   P FEE   +         + F      ++ + EIVEEEEEEE 
Subjt:  FS-SSLEPQRKSCDLRVR---------------------------ID-PHTKNLDDQPGFEEPPNV---------LDFVI----QNRVQEIVEEEEEEEE

Query:  IPLELQQQELKPMKDHIDLDSHTKKPTGSFWSAASVLSKKLQKWRDSKQKGKKQRS------TATLPVEKPIGRHFRETQSEIADYGFGRRSCDIDP---
          +E    E  P       +   K+  GSFWSAASV SKKLQKWR  KQK KK R+      ++ LPVEK IGR  R+TQSEIA+YG+GRRSCD DP   
Subjt:  IPLELQQQELKPMKDHIDLDSHTKKPTGSFWSAASVLSKKLQKWRDSKQKGKKQRS------TATLPVEKPIGRHFRETQSEIADYGFGRRSCDIDP---

Query:  ----RFSLDIPRMSFDDPRHSFDEPRASWDGYLISRTFP--RMPTMLSVVEDAPV--HVFRSDAQIPVEDS--VNSIHEDENTPGGSLQTRDYY-----S
            RFSLD  R+S DDPR+SF+EPRASWDGYLI R     RMP+MLSVVED+PV  HV RSD  IPVE S  V+    DE  PGGS QTR+YY     S
Subjt:  ----RFSLDIPRMSFDDPRHSFDEPRASWDGYLISRTFP--RMPTMLSVVEDAPV--HVFRSDAQIPVEDS--VNSIHEDENTPGGSLQTRDYY-----S

Query:  GRRKSLDRSNSIRKTAAAVVAEIDEMKSVSNAKVSPAITESSHRDSNSNSLRDDC---SNSFNLGFNDNASNRKEESKKSRRWGKGWSIWGLINRRGGNK
         RRKSLDRS+S RK +A+V+AEIDE+K   + +    +       S+SNSLRDDC    N++ +G  +N    +   K++++    W+I+GL++R+ GNK
Subjt:  GRRKSLDRSNSIRKTAAAVVAEIDEMKSVSNAKVSPAITESSHRDSNSNSLRDDC---SNSFNLGFNDNASNRKEESKKSRRWGKGWSIWGLINRRGGNK

Query:  DEEEEREIRPNGIERSYSGSWPELRGDRNVDAKGGFNPKMFRSNSSVSWRSSSMVGGSFSSARKSNADSNGNGNGNGKKKKEQEPVLERNRSARHSPTNV
         EEEER    +G++R++SGSW       NV+ + GF+PKM RSNSSVSWRSS   GG     ++++ D    G  +GKKK                 +  
Subjt:  DEEEEREIRPNGIERSYSGSWPELRGDRNVDAKGGFNPKMFRSNSSVSWRSSSMVGGSFSSARKSNADSNGNGNGNGKKKKEQEPVLERNRSARHSPTNV

Query:  DNGLLRFYLAPMKGSRRGGSGKEKP
        +NG+L+FYL P KG RRG      P
Subjt:  DNGLLRFYLAPMKGSRRGGSGKEKP

AT3G09070.1 Protein of unknown function (DUF740)5.0e-11344.96Show/hide
Query:  MNPST--------AAAPQPAPPSVAAAAASCPRHPQEHLTPFCPLCLCERLSLIESSSS--ASSSTRKPPSAAASALKAIFKPSSASASAS---------
        MNP+T        A AP P PP     + SC RHP+E  T FCP CLCERLS+++ +++  +SSS++KPP+ +A+ALKA+FKPS  +             
Subjt:  MNPST--------AAAPQPAPPSVAAAAASCPRHPQEHLTPFCPLCLCERLSLIESSSS--ASSSTRKPPSAAASALKAIFKPSSASASAS---------

Query:  NP---PLLRRSHSFSASK-NEPFSSSLEPQRKSCDLRVR-------------------------IDPHTKNL----------------DDQPGFEEPPNV
         P   P LRR+ SFSASK NE FS   EPQR+SCD+R+R                         ++P   ++                D++   EE  + 
Subjt:  NP---PLLRRSHSFSASK-NEPFSSSLEPQRKSCDLRVR-------------------------IDPHTKNL----------------DDQPGFEEPPNV

Query:  L---DFVIQN--------RVQEIVEEEEEEEEI--PLE-LQQQELKPMKDHIDLDSHTKKPT--GSFWSAASVLSKKLQKWRDSKQKGKKQRS-------
        +   DF I N        +  EIVE  EE EE   P + L ++ELKP+KD+IDLDS TKKP+   SFWSAASV SKKLQKWR + QK KK+R+       
Subjt:  L---DFVIQN--------RVQEIVEEEEEEEEI--PLE-LQQQELKPMKDHIDLDSHTKKPT--GSFWSAASVLSKKLQKWRDSKQKGKKQRS-------

Query:  TATLPVEKPIGRHFRETQSEIADYGFGRRSCDIDPRFSL--------------DIPRMSFDDPRHSFDEPRASWDGYLISRTF-------PRMPTMLSVV
        +A LPVEKPIGR  R+TQSEIADYG+GRRSCD DPRFSL              DI R+S DDPR+SFDEPRASWDG LI RT        P  P+MLSVV
Subjt:  TATLPVEKPIGRHFRETQSEIADYGFGRRSCDIDPRFSL--------------DIPRMSFDDPRHSFDEPRASWDGYLISRTF-------PRMPTMLSVV

Query:  EDAPV----HVFRSDAQIPVED---------SVNSIHEDENTPGGSLQTRDYY----SGRRKSLDR-SNSIRKTAAAVVAEIDEMKSVSNAKVSPAITES
        EDAP     HV R+D Q PVE+           N + +    PGGS+QTRDYY    S RRKSLDR S+S+RKTAAAVVA++DE K   ++ +S      
Subjt:  EDAPV----HVFRSDAQIPVED---------SVNSIHEDENTPGGSLQTRDYY----SGRRKSLDR-SNSIRKTAAAVVAEIDEMKSVSNAKVSPAITES

Query:  SHRDSNSNSLRDDCSNSFN---LGFNDNASNRKEESKKSRRWGKGWSIWGLINRRGGNK-----DEEEEREIRPNG--IERSYSGSWPELRGDRNVDAKG
        S RD+N+ ++    + SF    +   D   N  + +KKSRRWGK WSI GLI R+  NK     +EEE+R  R NG  +ERS S SWPELR        G
Subjt:  SHRDSNSNSLRDDCSNSFN---LGFNDNASNRKEESKKSRRWGKGWSIWGLINRRGGNK-----DEEEEREIRPNG--IERSYSGSWPELRGDRNVDAKG

Query:  GFNPKMFRSNSSVSWRSSSMVGGSFSSARKSNADSNGNGNGNGKKKKEQEPVLERNRSARHSPTNVDNGLLRFYLAPMKGSRR---------GGSGKEKP
        G  P+M RSNS+VSWRSS   GG   SARK N                   +  RN+S+R+SP N +NG+L+FYL  MK SRR         GG G    
Subjt:  GFNPKMFRSNSSVSWRSSSMVGGSFSSARKSNADSNGNGNGNGKKKKEQEPVLERNRSARHSPTNVDNGLLRFYLAPMKGSRR---------GGSGKEKP

Query:  NQAQSIARSVLRLY
        +   SIARSV+RLY
Subjt:  NQAQSIARSVLRLY

AT3G46990.1 Protein of unknown function (DUF740)8.2e-3930.73Show/hide
Query:  AAASCPRHPQEHLTP-FCPLCLCERLSLIESSSSASSSTRKPPSAAASALKAIFKPSSASASASNPPLLRRSHSFSASKNEPFSSSLEPQRKSCDLRVRI
        +++SC RHP    T  FC  CL ERL  IE+ SS                         S +A   P LRR  S+S  +N   S S +P+R+SCD+R   
Subjt:  AAASCPRHPQEHLTP-FCPLCLCERLSLIESSSSASSSTRKPPSAAASALKAIFKPSSASASASNPPLLRRSHSFSASKNEPFSSSLEPQRKSCDLRVRI

Query:  DPHTKNLDDQPGFEEPPNVLDFVIQNRVQEIVEEEEEEEE----------------IPLELQQQELKPMKDHIDLD--SHTKKPTG-SFWSAASVLSKKL
             +L D    ++   V   + +  V ++ EEEEEEEE                 P ++  +E K MK+ IDLD  +  KK  G      ASVLS++L
Subjt:  DPHTKNLDDQPGFEEPPNVLDFVIQNRVQEIVEEEEEEEE----------------IPLELQQQELKPMKDHIDLD--SHTKKPTG-SFWSAASVLSKKL

Query:  QKWRDSKQKGKKQRSTATLPVEKPIGRHFRETQSEIADYGFGRRSCDIDPRFSLDIPRMSFDDPRHSFDEPRASWDGYLISRTFPRMPTMLSVVEDAPVH
        + +  +K+  +K                   + S  A    GR S D+D       PR+SFD  R SF++PR+SWDG LI +++ ++ T+ +V EDA   
Subjt:  QKWRDSKQKGKKQRSTATLPVEKPIGRHFRETQSEIADYGFGRRSCDIDPRFSLDIPRMSFDDPRHSFDEPRASWDGYLISRTFPRMPTMLSVVEDAPVH

Query:  VFRSDAQIPVEDSVNSIHEDENTPGGSLQTRDYYSG--RRKSLDRSNSIRKTAAAVVAEIDEMKSVSNAKVSP-----------AITESSHRDSNSNSLR
             A+  VE+    + E E +PGG++QT++YYS   RR+S DRS SI++     + E+DE++ +SNAKVSP            +TE   RDSN  S++
Subjt:  VFRSDAQIPVEDSVNSIHEDENTPGGSLQTRDYYSG--RRKSLDRSNSIRKTAAAVVAEIDEMKSVSNAKVSP-----------AITESSHRDSNSNSLR

Query:  DDCSNSFNL----------GFNDNASNRKEESKKSRRWGKGWSIWGLINRRGGNKDE---EEEREIRPNGIERSYSGSWPELRGDRNVDAKGGFNPKMFR
        +    S  L          G      +  E  K  ++W KGW+IWGLI R+   K+E   E+  ++  N +E S + S  +LR     +   G + K+ +
Subjt:  DDCSNSFNL----------GFNDNASNRKEESKKSRRWGKGWSIWGLINRRGGNKDE---EEEREIRPNGIERSYSGSWPELRGDRNVDAKGGFNPKMFR

Query:  S------NSSVSWRSSSMVGGSFSSARKS------------NADSNG-NGNGNGKKKKEQEPVLERNRS-ARHSPTNVDNGLLRFYLAPMKGSRRGGSGK
        S       S    RS + +   F   R S             A  N  +G  NG + K+   +L+RN +    S  N++  + RFYL+P+K  +   SGK
Subjt:  S------NSSVSWRSSSMVGGSFSSARKS------------NADSNG-NGNGNGKKKKEQEPVLERNRS-ARHSPTNVDNGLLRFYLAPMKGSRRGGSGK

Query:  EK
         +
Subjt:  EK

AT5G01170.1 Protein of unknown function (DUF740)5.4e-9143.06Show/hide
Query:  MNPSTAAAP-----QPAPPSVA-AAAASCPRHPQEHLTPFCPLCLCERLSLIESSSS--ASSSTRKPPSAAASALKAIFKPSSASASASNP---------
        MN S   AP     + APPS     + SC  HP+E  + FCP CLC+RLS+++ +++   SSS+RKPPS +A +LKA+FKPSS+  + SN          
Subjt:  MNPSTAAAP-----QPAPPSVA-AAAASCPRHPQEHLTPFCPLCLCERLSLIESSSS--ASSSTRKPPSAAASALKAIFKPSSASASASNP---------

Query:  PLLRRSHSFSASKNEPFSSSLEPQRKSCDLRVRIDPHTKNLDDQPGFEE------PPNVLDFVIQ-NRVQEIVEEEEEEEEIPLEL----------QQQE
        P LRR+ SFSA  NE FS   EPQR+SCD+R+R D     +++    ++        +V + V++     EI E+EE  E+ P E+          +++E
Subjt:  PLLRRSHSFSASKNEPFSSSLEPQRKSCDLRVRIDPHTKNLDDQPGFEE------PPNVLDFVIQ-NRVQEIVEEEEEEEEIPLEL----------QQQE

Query:  LKPMKDHIDLDSHTKKPT-----GSFWSAASVLSKKLQKWRDSKQKGKKQRSTATLPVEKPIGRHFRETQSEIADYGFGRRSCDIDPRFSL-------DI
        LKPMKD++DL S TKKP+     GSF+SAASV SKKLQKW+  KQK KK R+          G      QSEI   G GRRS D DPRFSL       DI
Subjt:  LKPMKDHIDLDSHTKKPT-----GSFWSAASVLSKKLQKWRDSKQKGKKQRSTATLPVEKPIGRHFRETQSEIADYGFGRRSCDIDPRFSL-------DI

Query:  PRMSFDDPRHSFDEPRASWDGYLISRT----FPRMPTMLSVVEDAPVHVFRSDAQIPVEDSVNSIHEDEN----TPGGSLQTRDYYSG-----RRKSLDR
         R+S DD R+S DEPRASWDG+LI RT     P  P+MLSVVE+AP++  RSD QIP   S+  I  D +     PGGS QTRDYY+G     RRKSLDR
Subjt:  PRMSFDDPRHSFDEPRASWDGYLISRT----FPRMPTMLSVVEDAPVHVFRSDAQIPVEDSVNSIHEDEN----TPGGSLQTRDYYSG-----RRKSLDR

Query:  SNSIRKTAAAVVAEIDEMKSVSNAKVSPAITESSHRDSNSNSLRDDCSNSFNLGFNDNASNRKEESKKSRRWGKGWSIWGLINRRGGNKDEEEEREIRPN
        SNSIRK    +V E++++KSVSN+        ++  DSNS    ++  N             +   KKSRRWGK WSI G I R+ G  DEEE+R  R N
Subjt:  SNSIRKTAAAVVAEIDEMKSVSNAKVSPAITESSHRDSNSNSLRDDCSNSFNLGFNDNASNRKEESKKSRRWGKGWSIWGLINRRGGNKDEEEEREIRPN

Query:  G---IERSYSGSWPELRGDRNVDAKGGFNPKMFRSNSSVSWRSSSMVGGSFSSARKSNADSNGNGNGNGKKKKEQEPVLERNRSARHSPTNVDNGLLRFY
            +ERS S SWPE+R     + +GG  PKM RSNS+VSWRSS   GGS                              RN+S+R+S  + +NG+LRFY
Subjt:  G---IERSYSGSWPELRGDRNVDAKGGFNPKMFRSNSSVSWRSSSMVGGSFSSARKSNADSNGNGNGNGKKKKEQEPVLERNRSARHSPTNVDNGLLRFY

Query:  LAPMK------------GSRRGGSGKEKP-----NQAQSIARSVLRLY
        L PM+            G   GG G EK      +   SIAR V+RLY
Subjt:  LAPMK------------GSRRGGSGKEKP-----NQAQSIARSVLRLY

AT5G58930.1 Protein of unknown function (DUF740)9.7e-4031.2Show/hide
Query:  AASCPRHP-QEHLTPFCPLCLCERLSLIES-SSSASSSTRKPPSAAASALKAIFKPSSASASASNPPLLRRSHSFSASKNEPFSSSLEPQRKSCDLRVRI
        +A C RHP  +  T FC  CL ERLS IE+ SSS S+ST                             LRR  S+S  ++   S   +P+R+SCD+R   
Subjt:  AASCPRHP-QEHLTPFCPLCLCERLSLIES-SSSASSSTRKPPSAAASALKAIFKPSSASASASNPPLLRRSHSFSASKNEPFSSSLEPQRKSCDLRVRI

Query:  DPHTKNLDDQPGFEEPPNVLDFVIQNRVQEIVEEEEEEEE-----IPLELQQQELKPMKDHIDLDSHTKKPTGSFWSAASVLSKKLQKWRDSKQKGKKQR
             N DD    E   + + F I   V +++E+EEEE++     +  E++  E K MK+ IDL+S  ++   +     SV S+ L+K+           
Subjt:  DPHTKNLDDQPGFEEPPNVLDFVIQNRVQEIVEEEEEEEE-----IPLELQQQELKPMKDHIDLDSHTKKPTGSFWSAASVLSKKLQKWRDSKQKGKKQR

Query:  STATLPVEKPIGRHFRETQSEIADYG--FGRRSCDIDPRFSLDIPRMSFDDPRHSFDEPRASWDGYLISRTFPRMPTMLSVVEDAPVHVFRSDAQIPVED
                    +H R    +I D G   GRRSCD+DPR SLD  R+       SFDEPRASWDG LI +T+P++  + SV ED            P + 
Subjt:  STATLPVEKPIGRHFRETQSEIADYG--FGRRSCDIDPRFSLDIPRMSFDDPRHSFDEPRASWDGYLISRTFPRMPTMLSVVEDAPVHVFRSDAQIPVED

Query:  SVNSIHEDE-NTPGGSLQTRDYY--SGRRKSLDRSNSIRKTAAAVVAEIDEMKSVSNAKVSP-----------AITESSHRDSNSNSLRDDCSNSFNLGF
        +   + EDE N PGG+ QTRDYY  S RR+S DRS      +   + E+DE+K++SNAKVSP            +TE   RDSN  S+++    S  LG 
Subjt:  SVNSIHEDE-NTPGGSLQTRDYY--SGRRKSLDRSNSIRKTAAAVVAEIDEMKSVSNAKVSP-----------AITESSHRDSNSNSLRDDCSNSFNLGF

Query:  NDNASNRKEESKK---------SRRWGKGWSIWGLINRR----GGNKDEEEEREIRPNGIERSYSGSWPELRGDRNVDAKGGFNPKMFRSNSSVSWRSS-
                 E KK          + W KGW+ WGLI R+          E+  ++  N +E S + S  +LR     +  G  + K+ RS  SVS R S 
Subjt:  NDNASNRKEESKK---------SRRWGKGWSIWGLINRR----GGNKDEEEEREIRPNGIERSYSGSWPELRGDRNVDAKGGFNPKMFRSNSSVSWRSS-

Query:  ---------------------SMVGGSFSSARKSNADSNGNGNGNGKKKKEQEPVLERNRSARHSPTNVDNGLLRFYLAPMKGSRRGGSGKEK
                              +  GS +           +G  +G + K    +   ++   +SP N+ NG++RFYL P+       SGK +
Subjt:  ---------------------SMVGGSFSSARKSNADSNGNGNGNGKKKKEQEPVLERNRSARHSPTNVDNGLLRFYLAPMKGSRRGGSGKEK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATCCCTCCACCGCCGCCGCACCACAACCAGCTCCTCCCTCCGTCGCCGCCGCCGCCGCCTCTTGCCCCCGCCACCCACAAGAGCACTTAACCCCCTTCTGCCCCTT
ATGCCTCTGTGAGCGCCTCTCCCTCATCGAATCCTCATCCTCCGCTTCTTCTTCAACCCGGAAACCCCCTTCCGCCGCCGCCTCTGCTCTTAAAGCCATTTTCAAACCTT
CCTCTGCTTCTGCTTCCGCTTCCAATCCTCCTCTGCTTCGTCGCTCTCACTCCTTCTCTGCTTCCAAGAACGAACCCTTCTCTTCCTCTCTCGAACCCCAGAGGAAATCT
TGCGACCTTCGCGTTCGAATCGATCCTCACACCAAGAATTTGGACGACCAGCCCGGTTTTGAAGAACCCCCCAATGTGCTGGATTTTGTGATTCAGAACAGAGTTCAGGA
GATTGTTGAAGAAGAGGAAGAAGAAGAAGAAATTCCGCTTGAATTGCAGCAACAAGAGTTGAAACCCATGAAGGATCATATAGATCTGGATTCCCACACCAAGAAACCCA
CCGGGAGTTTCTGGTCTGCGGCGTCGGTCTTGAGCAAGAAGCTCCAGAAATGGAGAGATAGTAAACAGAAAGGGAAGAAACAGAGATCTACCGCAACATTGCCGGTGGAG
AAGCCAATCGGCCGCCATTTCAGAGAAACCCAGTCAGAGATTGCGGATTACGGATTCGGCCGACGGTCCTGCGACATTGATCCCAGATTCTCCCTCGACATCCCCCGAAT
GTCCTTCGACGATCCCCGCCATTCCTTCGACGAACCCAGAGCTTCTTGGGATGGCTATTTGATTAGCAGAACCTTCCCGAGAATGCCTACGATGCTTTCTGTCGTTGAAG
ATGCTCCCGTTCATGTCTTCCGTTCTGATGCTCAAATTCCTGTGGAAGACTCCGTGAATTCGATCCACGAAGACGAGAATACCCCCGGCGGGTCGTTGCAGACCCGGGAT
TACTATTCGGGTCGGCGGAAGAGCCTCGACCGGTCCAACTCCATCAGGAAGACTGCAGCAGCGGTGGTGGCGGAGATTGATGAGATGAAATCTGTTTCCAATGCTAAAGT
TTCTCCTGCAATTACAGAAAGCAGCCATAGAGATTCAAACTCCAATTCACTTCGAGATGACTGCTCCAACTCCTTCAATTTGGGATTCAACGACAATGCAAGTAATCGAA
AAGAAGAATCGAAAAAGTCCCGCCGGTGGGGGAAGGGTTGGAGCATTTGGGGATTGATTAACCGGCGGGGAGGAAACAAAGATGAGGAGGAAGAGAGAGAGATTCGACCC
AATGGCATCGAGCGCTCGTATTCAGGGTCGTGGCCCGAGCTACGAGGCGATCGGAATGTCGATGCCAAAGGAGGGTTCAATCCCAAAATGTTTAGGAGTAACAGCAGTGT
GAGTTGGAGGAGTTCAAGTATGGTGGGTGGATCTTTCAGTAGTGCAAGGAAAAGCAATGCAGATTCTAATGGCAATGGCAATGGCAATGGGAAGAAGAAGAAGGAGCAGG
AGCCAGTGTTGGAGAGGAATAGGAGTGCTCGACATTCTCCGACGAACGTCGACAATGGACTTCTTCGATTCTACTTGGCGCCGATGAAAGGCAGCCGGAGAGGCGGGTCG
GGGAAGGAGAAGCCTAATCAAGCACAGTCCATTGCTAGAAGTGTTCTTCGACTGTATTGA
mRNA sequenceShow/hide mRNA sequence
AATTCACTCTCCAAATTCTCTATCATGAACTTCTTCTTCTTCTTCTCTCCCATTCTTTCTTAGAATTCTTCTTTTTTGTGTTATGAATCCCTCCACCGCCGCCGCACCAC
AACCAGCTCCTCCCTCCGTCGCCGCCGCCGCCGCCTCTTGCCCCCGCCACCCACAAGAGCACTTAACCCCCTTCTGCCCCTTATGCCTCTGTGAGCGCCTCTCCCTCATC
GAATCCTCATCCTCCGCTTCTTCTTCAACCCGGAAACCCCCTTCCGCCGCCGCCTCTGCTCTTAAAGCCATTTTCAAACCTTCCTCTGCTTCTGCTTCCGCTTCCAATCC
TCCTCTGCTTCGTCGCTCTCACTCCTTCTCTGCTTCCAAGAACGAACCCTTCTCTTCCTCTCTCGAACCCCAGAGGAAATCTTGCGACCTTCGCGTTCGAATCGATCCTC
ACACCAAGAATTTGGACGACCAGCCCGGTTTTGAAGAACCCCCCAATGTGCTGGATTTTGTGATTCAGAACAGAGTTCAGGAGATTGTTGAAGAAGAGGAAGAAGAAGAA
GAAATTCCGCTTGAATTGCAGCAACAAGAGTTGAAACCCATGAAGGATCATATAGATCTGGATTCCCACACCAAGAAACCCACCGGGAGTTTCTGGTCTGCGGCGTCGGT
CTTGAGCAAGAAGCTCCAGAAATGGAGAGATAGTAAACAGAAAGGGAAGAAACAGAGATCTACCGCAACATTGCCGGTGGAGAAGCCAATCGGCCGCCATTTCAGAGAAA
CCCAGTCAGAGATTGCGGATTACGGATTCGGCCGACGGTCCTGCGACATTGATCCCAGATTCTCCCTCGACATCCCCCGAATGTCCTTCGACGATCCCCGCCATTCCTTC
GACGAACCCAGAGCTTCTTGGGATGGCTATTTGATTAGCAGAACCTTCCCGAGAATGCCTACGATGCTTTCTGTCGTTGAAGATGCTCCCGTTCATGTCTTCCGTTCTGA
TGCTCAAATTCCTGTGGAAGACTCCGTGAATTCGATCCACGAAGACGAGAATACCCCCGGCGGGTCGTTGCAGACCCGGGATTACTATTCGGGTCGGCGGAAGAGCCTCG
ACCGGTCCAACTCCATCAGGAAGACTGCAGCAGCGGTGGTGGCGGAGATTGATGAGATGAAATCTGTTTCCAATGCTAAAGTTTCTCCTGCAATTACAGAAAGCAGCCAT
AGAGATTCAAACTCCAATTCACTTCGAGATGACTGCTCCAACTCCTTCAATTTGGGATTCAACGACAATGCAAGTAATCGAAAAGAAGAATCGAAAAAGTCCCGCCGGTG
GGGGAAGGGTTGGAGCATTTGGGGATTGATTAACCGGCGGGGAGGAAACAAAGATGAGGAGGAAGAGAGAGAGATTCGACCCAATGGCATCGAGCGCTCGTATTCAGGGT
CGTGGCCCGAGCTACGAGGCGATCGGAATGTCGATGCCAAAGGAGGGTTCAATCCCAAAATGTTTAGGAGTAACAGCAGTGTGAGTTGGAGGAGTTCAAGTATGGTGGGT
GGATCTTTCAGTAGTGCAAGGAAAAGCAATGCAGATTCTAATGGCAATGGCAATGGCAATGGGAAGAAGAAGAAGGAGCAGGAGCCAGTGTTGGAGAGGAATAGGAGTGC
TCGACATTCTCCGACGAACGTCGACAATGGACTTCTTCGATTCTACTTGGCGCCGATGAAAGGCAGCCGGAGAGGCGGGTCGGGGAAGGAGAAGCCTAATCAAGCACAGT
CCATTGCTAGAAGTGTTCTTCGACTGTATTGAACTGAAATTGGTTGGGGAAAGATTCAATTTAAGACTATGTTTTCTAGTTCAAAGTTGTTGTATTTTATTTTGTTCAAG
CATTATATTTACTGGGCTTATTGGAAATGGATGAGAAGAGGAAGCAGTTTCAAAATTTTTCACTATTTTTCCATTTTAGAATGGAGAATGTTTTGATAGTCA
Protein sequenceShow/hide protein sequence
MNPSTAAAPQPAPPSVAAAAASCPRHPQEHLTPFCPLCLCERLSLIESSSSASSSTRKPPSAAASALKAIFKPSSASASASNPPLLRRSHSFSASKNEPFSSSLEPQRKS
CDLRVRIDPHTKNLDDQPGFEEPPNVLDFVIQNRVQEIVEEEEEEEEIPLELQQQELKPMKDHIDLDSHTKKPTGSFWSAASVLSKKLQKWRDSKQKGKKQRSTATLPVE
KPIGRHFRETQSEIADYGFGRRSCDIDPRFSLDIPRMSFDDPRHSFDEPRASWDGYLISRTFPRMPTMLSVVEDAPVHVFRSDAQIPVEDSVNSIHEDENTPGGSLQTRD
YYSGRRKSLDRSNSIRKTAAAVVAEIDEMKSVSNAKVSPAITESSHRDSNSNSLRDDCSNSFNLGFNDNASNRKEESKKSRRWGKGWSIWGLINRRGGNKDEEEEREIRP
NGIERSYSGSWPELRGDRNVDAKGGFNPKMFRSNSSVSWRSSSMVGGSFSSARKSNADSNGNGNGNGKKKKEQEPVLERNRSARHSPTNVDNGLLRFYLAPMKGSRRGGS
GKEKPNQAQSIARSVLRLY