; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0014233 (gene) of Chayote v1 genome

Gene IDSed0014233
OrganismSechium edule (Chayote v1)
DescriptionUPF0503 protein At3g09070, chloroplastic-like
Genome locationLG04:24580658..24582392
RNA-Seq ExpressionSed0014233
SyntenySed0014233
Gene Ontology termsGO:0005886 - plasma membrane (cellular component)
InterPro domainsIPR008004 - Protein OCTOPUS-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7027911.1 UPF0503 protein, chloroplastic, partial [Cucurbita argyrosperma subsp. argyrosperma]1.5e-15070.87Show/hide
Query:  MKDHIDLDSQTNK--------ISGSFWS-------KLQKWRDKQKGKKQRSGAGSTALPVDKPLGRHFTETHSETADYGFGRRSCDIDPRFSLDIPGRM-
        MKDHIDLDS T K        I+GSF S       KLQKWRDKQKGKKQRSGAGST LPV+KP+GRHF ET SE ADYGFGRRSCDIDPRFSLD  GRM 
Subjt:  MKDHIDLDSQTNK--------ISGSFWS-------KLQKWRDKQKGKKQRSGAGSTALPVDKPLGRHFTETHSETADYGFGRRSCDIDPRFSLDIPGRM-

Query:  ------SFDEPRASWDGCLISRTFPRMATMLAVVEDAPINVFRSDSQIPV---------EENVPGGSSQTRDYYFDTSSRRRKSVDRSNSVRK---TVVA
              SFDEPRASWDG LISRTFP+M TML+VVEDAPI+VFRSD+QIPV         EEN+PGG+SQTRDYY D+SSRRRKS+DRSNS+RK    VVA
Subjt:  ------SFDEPRASWDGCLISRTFPRMATMLAVVEDAPINVFRSDSQIPV---------EENVPGGSSQTRDYYFDTSSRRRKSVDRSNSVRK---TVVA

Query:  ELDEMKS-----VTPATIEISQAPKLAIPHGDSNSNSLRDDCSPSFELGFNHTASAIATRNQKEESKKSSRWGKGWSIWGLINRPGGNKDGEEHKETRPN
        E+DEMKS     V+PAT +IS  PKL+IP  DSNSNS+RDDCS +F++GFN TAS IAT N+KEESKKS RWGKGWSIWGLINR GGNKD EE   +R N
Subjt:  ELDEMKS-----VTPATIEISQAPKLAIPHGDSNSNSLRDDCSPSFELGFNHTASAIATRNQKEESKKSSRWGKGWSIWGLINRPGGNKDGEEHKETRPN

Query:  GVERSFSGSWPELEGDENINVKGGFNPKFHRSNGSASWRSSSMLSGSF-SSSRKSNADSNGNSIGKKKKIKE-----QEQPSLERNRSARNSSTNVDNGL
        GVERSFSGSWPEL G+++++VKGGFNPK  RSN S SWRS+SM+ GSF SSSRKSNAD NGN  GKKKK +E     Q+QP L RN SAR+S TNVDNGL
Subjt:  GVERSFSGSWPELEGDENINVKGGFNPKFHRSNGSASWRSSSMLSGSF-SSSRKSNADSNGNSIGKKKKIKE-----QEQPSLERNRSARNSSTNVDNGL

Query:  LRFYFTPMRGSRSGGS-GKVKPNQALSIARSVLRLY
        LRFY TPM+ SR G S GKVKPNQA SIARSVLRLY
Subjt:  LRFYFTPMRGSRSGGS-GKVKPNQALSIARSVLRLY

XP_022934120.1 UPF0503 protein At3g09070, chloroplastic-like [Cucurbita moschata]1.8e-15170.59Show/hide
Query:  MKDHIDLDSQTNKIS--GSFWS-------KLQKWRDKQKGKKQRSGAGSTALPVDKPLGRHFTETHSETADYGFGRRSCDIDPRFSLDIPGRM-------
        MKDHIDLDS T K S  GSFWS       KLQKWRDKQK KKQR+G GST LPV+KP+GRHF +T SE ADYG+GRRSCDIDPRFSLD  GRM       
Subjt:  MKDHIDLDSQTNKIS--GSFWS-------KLQKWRDKQKGKKQRSGAGSTALPVDKPLGRHFTETHSETADYGFGRRSCDIDPRFSLDIPGRM-------

Query:  SFDEPRASWDGCLISRTFPRMATMLAVVEDAPINVFRSDSQIPV---------EENVPGGSSQTRDYYFDTSSRRRKSVDRSNSVRKT---VVAELDEMK
        SFDEPRASWDG LISRTFPRM TML+VVEDAPINVFR+D+QIPV         EEN+PGGSSQTRDYY D+SSRRRKS+DRSNS+RKT   VVAE+DEMK
Subjt:  SFDEPRASWDGCLISRTFPRMATMLAVVEDAPINVFRSDSQIPV---------EENVPGGSSQTRDYYFDTSSRRRKSVDRSNSVRKT---VVAELDEMK

Query:  S------VTPATIEISQAPKLAIPHGDSNSNSLRDDCSPSFELGFNHTASAIATRNQKEESKKSSRWGKGWSIWGLINRPGGNKDGEEHKE-TRPNGVER
        S      V+PAT ++   PKLAIP  DSNS+SL++DCS SF+  FN  AS + T N+KEESKKS  WGKGW IWGLINR GGNKD EE KE +RPNG+ER
Subjt:  S------VTPATIEISQAPKLAIPHGDSNSNSLRDDCSPSFELGFNHTASAIATRNQKEESKKSSRWGKGWSIWGLINRPGGNKDGEEHKE-TRPNGVER

Query:  SFSGSWPELEGDENINVKGGFNPKFHRSNGSASWRSSSMLSGSFSSSRKSNADSNGNSIGKKKKIKEQEQPSLERNRSARNSSTNVDNGLLRFYFTPMRG
        S+SGSWPEL G+ N +VKGGFNPK  RSN S SWRSSSM+ GSFSSSRKSNA++NGN  G+KK     E+P LERNRSAR+S TN+DNGLLRFY T +RG
Subjt:  SFSGSWPELEGDENINVKGGFNPKFHRSNGSASWRSSSMLSGSFSSSRKSNADSNGNSIGKKKKIKEQEQPSLERNRSARNSSTNVDNGLLRFYFTPMRG

Query:  SRSGGSGKVKPNQALSIARSVLRLY
        SR GGSGKVKPNQA SIARSVLRLY
Subjt:  SRSGGSGKVKPNQALSIARSVLRLY

XP_022937891.1 UPF0503 protein At3g09070, chloroplastic-like [Cucurbita moschata]5.1e-15170.53Show/hide
Query:  MKDHIDLDSQTNK--------ISGSFWS-------KLQKWRDKQKGKKQRSGAGSTALPVDKPLGRHFTETHSETADYGFGRRSCDIDPRFSLDIPGRM-
        MKDHIDLDS T K        I+GSF S       KLQKWRDKQKGKKQRSGAGST LPV+KP+GRHF ET SE ADYGFGRRSCDIDPRFSLD  GRM 
Subjt:  MKDHIDLDSQTNK--------ISGSFWS-------KLQKWRDKQKGKKQRSGAGSTALPVDKPLGRHFTETHSETADYGFGRRSCDIDPRFSLDIPGRM-

Query:  ------SFDEPRASWDGCLISRTFPRMATMLAVVEDAPINVFRSDSQIPV---------EENVPGGSSQTRDYYFDTSSRRRKSVDRSNSVRK---TVVA
              SFDEPRASWDG LISRTFP+M TML+VVEDAPI+VFRSD+QIPV         EEN+PGG+SQTRDYY D+SSRRRKS+DRSNS+RK    VVA
Subjt:  ------SFDEPRASWDGCLISRTFPRMATMLAVVEDAPINVFRSDSQIPV---------EENVPGGSSQTRDYYFDTSSRRRKSVDRSNSVRK---TVVA

Query:  ELDEMKS-----VTPATIEISQAPKLAIPHGDSNSNSLRDDCSPSFELGFNHTASAIATRNQKEESKKSSRWGKGWSIWGLINRPGGNKDGEEHKETRPN
        E+DEMKS     V+PAT +IS  PKL+IP  DSN+NS+RDDCS +F++GFN TAS IAT N+KEESKKS RWGKGWSIWGLINR GGNKD EE   +R N
Subjt:  ELDEMKS-----VTPATIEISQAPKLAIPHGDSNSNSLRDDCSPSFELGFNHTASAIATRNQKEESKKSSRWGKGWSIWGLINRPGGNKDGEEHKETRPN

Query:  GVERSFSGSWPELEGDENINVKGGFNPKFHRSNGSASWRSSSMLSGSF-SSSRKSNADSNGNSIGKKKKIKEQEQPSLERNRSARNSSTNVDNGLLRFYF
        GVERSFSGSWPEL G+++++VKGGFNPK  RSN S SWRS+SM+ GSF SSSRKSNAD NGN   KK++ ++Q+QP L RN SAR+S TNVDNGLLRFY 
Subjt:  GVERSFSGSWPELEGDENINVKGGFNPKFHRSNGSASWRSSSMLSGSF-SSSRKSNADSNGNSIGKKKKIKEQEQPSLERNRSARNSSTNVDNGLLRFYF

Query:  TPMRGSRSGGS-GKVKPNQALSIARSVLRLY
        TPM+ SR G S GKVKPNQA SIARSVLRLY
Subjt:  TPMRGSRSGGS-GKVKPNQALSIARSVLRLY

XP_022971404.1 UPF0503 protein At3g09070, chloroplastic-like [Cucurbita maxima]5.1e-15170.53Show/hide
Query:  MKDHIDLDSQTNK--------ISGSFWS-------KLQKWRDKQKGKKQRSGAGSTALPVDKPLGRHFTETHSETADYGFGRRSCDIDPRFSLDIPGRM-
        MKDHIDLDS T K        I+GSF S       KLQKWRDKQKGKKQRSGAGST LPV+KP+GRHF ET SE ADYGFGRRSCDIDPRFSLD  GRM 
Subjt:  MKDHIDLDSQTNK--------ISGSFWS-------KLQKWRDKQKGKKQRSGAGSTALPVDKPLGRHFTETHSETADYGFGRRSCDIDPRFSLDIPGRM-

Query:  ------SFDEPRASWDGCLISRTFPRMATMLAVVEDAPINVFRSDSQIPV---------EENVPGGSSQTRDYYFDTSSRRRKSVDRSNSVRK---TVVA
              SFDEPRASWDG LISRTFP+M TML+VVEDAPI+VFRSD+QIPV         EEN+PGG+SQTRDYY D+SSRRRKS+DRSNS+RK    VVA
Subjt:  ------SFDEPRASWDGCLISRTFPRMATMLAVVEDAPINVFRSDSQIPV---------EENVPGGSSQTRDYYFDTSSRRRKSVDRSNSVRK---TVVA

Query:  ELDEMKS-----VTPATIEISQAPKLAIPHGDSNSNSLRDDCSPSFELGFNHTASAIATRNQKEESKKSSRWGKGWSIWGLINRPGGNKDGEEHKETRPN
        E+DEMKS     V+PAT +IS  PKL+IP  DSNSNS+RDDCS +F++GFN TAS IAT N+KEESKKS RWGKGWSIWGLINR GGNKD EE   +R N
Subjt:  ELDEMKS-----VTPATIEISQAPKLAIPHGDSNSNSLRDDCSPSFELGFNHTASAIATRNQKEESKKSSRWGKGWSIWGLINRPGGNKDGEEHKETRPN

Query:  GVERSFSGSWPELEGDENINVKGGFNPKFHRSNGSASWRSSSMLSGSF-SSSRKSNADSNGNSIGKKKKIKEQEQPSLERNRSARNSSTNVDNGLLRFYF
        GVERSFSGSWPEL G+++++VKGGFNPK  RSN S SWRS+SM+ GSF SSSRKSNAD NGN   KK++ ++Q+QP L RN SAR+S TNVDNGLLRFY 
Subjt:  GVERSFSGSWPELEGDENINVKGGFNPKFHRSNGSASWRSSSMLSGSF-SSSRKSNADSNGNSIGKKKKIKEQEQPSLERNRSARNSSTNVDNGLLRFYF

Query:  TPMRGSRSG-GSGKVKPNQALSIARSVLRLY
        TPM+ SR G  +GKVKPNQA SIARSVLRLY
Subjt:  TPMRGSRSG-GSGKVKPNQALSIARSVLRLY

XP_023526761.1 UPF0503 protein At3g09070, chloroplastic-like [Cucurbita pepo subsp. pepo]1.8e-15170.52Show/hide
Query:  MKDHIDLDSQTNKIS--GSFWS-------KLQKWRDKQKGKKQRSGAGSTALPVDKPLGRHFTETHSETADYGFGRRSCDIDPRFSLDI------PGRMS
        MKDHIDLDS T K S  GSFWS       KLQKWRDKQK KKQR+G GST LPV+KP+GRHF +T SE ADYG+GRRSCDIDPRFSLD         R S
Subjt:  MKDHIDLDSQTNKIS--GSFWS-------KLQKWRDKQKGKKQRSGAGSTALPVDKPLGRHFTETHSETADYGFGRRSCDIDPRFSLDI------PGRMS

Query:  FDEPRASWDGCLISRTFPRMATMLAVVEDAPINVFRSDSQIPV---------EENVPGGSSQTRDYYFDTSSRRRKSVDRSNSVRKT---VVAELDEMKS
        FDEPRASWDG LISRTFPRM TML+VVEDAPINVFR+D+QIPV         EEN+PGGSSQTRDYY D+SSRRRKS+DRSNS+RKT   VVAE+DEMKS
Subjt:  FDEPRASWDGCLISRTFPRMATMLAVVEDAPINVFRSDSQIPV---------EENVPGGSSQTRDYYFDTSSRRRKSVDRSNSVRKT---VVAELDEMKS

Query:  ------VTPATIEISQAPKLAIPHGDSNSNSLRDDCSPSFELGFNHTASAIATRNQKEESKKSSRWGKGWSIWGLINRPGGNKDGEEHKE-TRPNGVERS
              V+PAT ++   PKLAIP  DSNS+SL+DDCS S +  FN  AS + T N+KEESKKS  WGKGW IWGLINR GGNKD EE KE +RPNG+ERS
Subjt:  ------VTPATIEISQAPKLAIPHGDSNSNSLRDDCSPSFELGFNHTASAIATRNQKEESKKSSRWGKGWSIWGLINRPGGNKDGEEHKE-TRPNGVERS

Query:  FSGSWPELEGDENINVKGGFNPKFHRSNGSASWRSSSMLSGSFSSSRKSNADSNGNSIGKKKKIKEQEQPSLERNRSARNSSTNVDNGLLRFYFTPMRGS
        +SGSWPEL G+ N +VKGGFNPK  RSN S SWRSSSM+ GSFSSSRKSNA++NGN  GKKK     E+P LERNRSAR+S TN+DNGLLRFY T +RGS
Subjt:  FSGSWPELEGDENINVKGGFNPKFHRSNGSASWRSSSMLSGSFSSSRKSNADSNGNSIGKKKKIKEQEQPSLERNRSARNSSTNVDNGLLRFYFTPMRGS

Query:  RSGGSGKVKPNQALSIARSVLRLY
        R GGSGKVKPNQA SIARSVLRLY
Subjt:  RSGGSGKVKPNQALSIARSVLRLY

TrEMBL top hitse value%identityAlignment
A0A5A7TLQ7 UPF0503 protein2.0e-14871.13Show/hide
Query:  MKDHIDLDSQTNKIS--GSFWS-------KLQKWRDKQKGKKQRSGAGSTALPVDKPLGRHFTETHSETADYGFGRRSCDIDPRFSLDIPGRM-------
        MKDHIDLDS T K S  GSFWS       KLQKWRDKQK KKQR+G GST LPV+KP+GRHF ET SE ADYGFGRRSCDIDPRFSLD  GRM       
Subjt:  MKDHIDLDSQTNKIS--GSFWS-------KLQKWRDKQKGKKQRSGAGSTALPVDKPLGRHFTETHSETADYGFGRRSCDIDPRFSLDIPGRM-------

Query:  SFDEPRASWDGCLISRTFPRMATMLAVVEDAPINVFRSDSQIPV---------EENVPGGSSQTRDYYFDTSSRRRKSVDRSNSVRKT---VVAELDEMK
        SFDEPRASWDG LISRTFPRM TML+VVEDAPI+VFRSD+QIPV         EEN+PGGSSQTR+YY D+SSRRRKS+DRSNS+RKT   VVAE+D+MK
Subjt:  SFDEPRASWDGCLISRTFPRMATMLAVVEDAPINVFRSDSQIPV---------EENVPGGSSQTRDYYFDTSSRRRKSVDRSNSVRKT---VVAELDEMK

Query:  S------VTPATIEISQAPKLAIPHGDSNSNSLRDDCSPSFELGFNHTASAIATRNQKEESKKSSRWGKGWSIWGLINRPGGNKDGEEHKE-TRPNGVER
        S      V+PAT ++   PKL +P  DSNSNSLRDD S SFE      AS + T N+KEESKKS  WGKGW IWGLINR GGNKD EE +E +RPNGVER
Subjt:  S------VTPATIEISQAPKLAIPHGDSNSNSLRDDCSPSFELGFNHTASAIATRNQKEESKKSSRWGKGWSIWGLINRPGGNKDGEEHKE-TRPNGVER

Query:  SFSGSWPELEGDENINVK-GGFNPKFHRSNGSASWRSSSMLSGSFSSSRKSNADSNGNSIGKKKKIKEQEQPSLERNRSARNSSTNVDNGLLRFYFTPMR
        S+S SWPEL GD N +VK GGFNPK  RSN S SWRS+SM+ GSFSSSRKSNA+SNGN  GKKK  KE+ QP LERNRSAR+S TNVDNGLLRFY TP+R
Subjt:  SFSGSWPELEGDENINVK-GGFNPKFHRSNGSASWRSSSMLSGSFSSSRKSNADSNGNSIGKKKKIKEQEQPSLERNRSARNSSTNVDNGLLRFYFTPMR

Query:  GSRSGGSGKVKPNQALSIARSVLRLY
        GSR GGSGKVKP+QA SIARSVLRLY
Subjt:  GSRSGGSGKVKPNQALSIARSVLRLY

A0A6J1F6S2 UPF0503 protein At3g09070, chloroplastic-like8.5e-15270.59Show/hide
Query:  MKDHIDLDSQTNKIS--GSFWS-------KLQKWRDKQKGKKQRSGAGSTALPVDKPLGRHFTETHSETADYGFGRRSCDIDPRFSLDIPGRM-------
        MKDHIDLDS T K S  GSFWS       KLQKWRDKQK KKQR+G GST LPV+KP+GRHF +T SE ADYG+GRRSCDIDPRFSLD  GRM       
Subjt:  MKDHIDLDSQTNKIS--GSFWS-------KLQKWRDKQKGKKQRSGAGSTALPVDKPLGRHFTETHSETADYGFGRRSCDIDPRFSLDIPGRM-------

Query:  SFDEPRASWDGCLISRTFPRMATMLAVVEDAPINVFRSDSQIPV---------EENVPGGSSQTRDYYFDTSSRRRKSVDRSNSVRKT---VVAELDEMK
        SFDEPRASWDG LISRTFPRM TML+VVEDAPINVFR+D+QIPV         EEN+PGGSSQTRDYY D+SSRRRKS+DRSNS+RKT   VVAE+DEMK
Subjt:  SFDEPRASWDGCLISRTFPRMATMLAVVEDAPINVFRSDSQIPV---------EENVPGGSSQTRDYYFDTSSRRRKSVDRSNSVRKT---VVAELDEMK

Query:  S------VTPATIEISQAPKLAIPHGDSNSNSLRDDCSPSFELGFNHTASAIATRNQKEESKKSSRWGKGWSIWGLINRPGGNKDGEEHKE-TRPNGVER
        S      V+PAT ++   PKLAIP  DSNS+SL++DCS SF+  FN  AS + T N+KEESKKS  WGKGW IWGLINR GGNKD EE KE +RPNG+ER
Subjt:  S------VTPATIEISQAPKLAIPHGDSNSNSLRDDCSPSFELGFNHTASAIATRNQKEESKKSSRWGKGWSIWGLINRPGGNKDGEEHKE-TRPNGVER

Query:  SFSGSWPELEGDENINVKGGFNPKFHRSNGSASWRSSSMLSGSFSSSRKSNADSNGNSIGKKKKIKEQEQPSLERNRSARNSSTNVDNGLLRFYFTPMRG
        S+SGSWPEL G+ N +VKGGFNPK  RSN S SWRSSSM+ GSFSSSRKSNA++NGN  G+KK     E+P LERNRSAR+S TN+DNGLLRFY T +RG
Subjt:  SFSGSWPELEGDENINVKGGFNPKFHRSNGSASWRSSSMLSGSFSSSRKSNADSNGNSIGKKKKIKEQEQPSLERNRSARNSSTNVDNGLLRFYFTPMRG

Query:  SRSGGSGKVKPNQALSIARSVLRLY
        SR GGSGKVKPNQA SIARSVLRLY
Subjt:  SRSGGSGKVKPNQALSIARSVLRLY

A0A6J1FBM4 UPF0503 protein At3g09070, chloroplastic-like2.5e-15170.53Show/hide
Query:  MKDHIDLDSQTNK--------ISGSFWS-------KLQKWRDKQKGKKQRSGAGSTALPVDKPLGRHFTETHSETADYGFGRRSCDIDPRFSLDIPGRM-
        MKDHIDLDS T K        I+GSF S       KLQKWRDKQKGKKQRSGAGST LPV+KP+GRHF ET SE ADYGFGRRSCDIDPRFSLD  GRM 
Subjt:  MKDHIDLDSQTNK--------ISGSFWS-------KLQKWRDKQKGKKQRSGAGSTALPVDKPLGRHFTETHSETADYGFGRRSCDIDPRFSLDIPGRM-

Query:  ------SFDEPRASWDGCLISRTFPRMATMLAVVEDAPINVFRSDSQIPV---------EENVPGGSSQTRDYYFDTSSRRRKSVDRSNSVRK---TVVA
              SFDEPRASWDG LISRTFP+M TML+VVEDAPI+VFRSD+QIPV         EEN+PGG+SQTRDYY D+SSRRRKS+DRSNS+RK    VVA
Subjt:  ------SFDEPRASWDGCLISRTFPRMATMLAVVEDAPINVFRSDSQIPV---------EENVPGGSSQTRDYYFDTSSRRRKSVDRSNSVRK---TVVA

Query:  ELDEMKS-----VTPATIEISQAPKLAIPHGDSNSNSLRDDCSPSFELGFNHTASAIATRNQKEESKKSSRWGKGWSIWGLINRPGGNKDGEEHKETRPN
        E+DEMKS     V+PAT +IS  PKL+IP  DSN+NS+RDDCS +F++GFN TAS IAT N+KEESKKS RWGKGWSIWGLINR GGNKD EE   +R N
Subjt:  ELDEMKS-----VTPATIEISQAPKLAIPHGDSNSNSLRDDCSPSFELGFNHTASAIATRNQKEESKKSSRWGKGWSIWGLINRPGGNKDGEEHKETRPN

Query:  GVERSFSGSWPELEGDENINVKGGFNPKFHRSNGSASWRSSSMLSGSF-SSSRKSNADSNGNSIGKKKKIKEQEQPSLERNRSARNSSTNVDNGLLRFYF
        GVERSFSGSWPEL G+++++VKGGFNPK  RSN S SWRS+SM+ GSF SSSRKSNAD NGN   KK++ ++Q+QP L RN SAR+S TNVDNGLLRFY 
Subjt:  GVERSFSGSWPELEGDENINVKGGFNPKFHRSNGSASWRSSSMLSGSF-SSSRKSNADSNGNSIGKKKKIKEQEQPSLERNRSARNSSTNVDNGLLRFYF

Query:  TPMRGSRSGGS-GKVKPNQALSIARSVLRLY
        TPM+ SR G S GKVKPNQA SIARSVLRLY
Subjt:  TPMRGSRSGGS-GKVKPNQALSIARSVLRLY

A0A6J1I8G1 UPF0503 protein At3g09070, chloroplastic-like2.5e-15170.53Show/hide
Query:  MKDHIDLDSQTNK--------ISGSFWS-------KLQKWRDKQKGKKQRSGAGSTALPVDKPLGRHFTETHSETADYGFGRRSCDIDPRFSLDIPGRM-
        MKDHIDLDS T K        I+GSF S       KLQKWRDKQKGKKQRSGAGST LPV+KP+GRHF ET SE ADYGFGRRSCDIDPRFSLD  GRM 
Subjt:  MKDHIDLDSQTNK--------ISGSFWS-------KLQKWRDKQKGKKQRSGAGSTALPVDKPLGRHFTETHSETADYGFGRRSCDIDPRFSLDIPGRM-

Query:  ------SFDEPRASWDGCLISRTFPRMATMLAVVEDAPINVFRSDSQIPV---------EENVPGGSSQTRDYYFDTSSRRRKSVDRSNSVRK---TVVA
              SFDEPRASWDG LISRTFP+M TML+VVEDAPI+VFRSD+QIPV         EEN+PGG+SQTRDYY D+SSRRRKS+DRSNS+RK    VVA
Subjt:  ------SFDEPRASWDGCLISRTFPRMATMLAVVEDAPINVFRSDSQIPV---------EENVPGGSSQTRDYYFDTSSRRRKSVDRSNSVRK---TVVA

Query:  ELDEMKS-----VTPATIEISQAPKLAIPHGDSNSNSLRDDCSPSFELGFNHTASAIATRNQKEESKKSSRWGKGWSIWGLINRPGGNKDGEEHKETRPN
        E+DEMKS     V+PAT +IS  PKL+IP  DSNSNS+RDDCS +F++GFN TAS IAT N+KEESKKS RWGKGWSIWGLINR GGNKD EE   +R N
Subjt:  ELDEMKS-----VTPATIEISQAPKLAIPHGDSNSNSLRDDCSPSFELGFNHTASAIATRNQKEESKKSSRWGKGWSIWGLINRPGGNKDGEEHKETRPN

Query:  GVERSFSGSWPELEGDENINVKGGFNPKFHRSNGSASWRSSSMLSGSF-SSSRKSNADSNGNSIGKKKKIKEQEQPSLERNRSARNSSTNVDNGLLRFYF
        GVERSFSGSWPEL G+++++VKGGFNPK  RSN S SWRS+SM+ GSF SSSRKSNAD NGN   KK++ ++Q+QP L RN SAR+S TNVDNGLLRFY 
Subjt:  GVERSFSGSWPELEGDENINVKGGFNPKFHRSNGSASWRSSSMLSGSF-SSSRKSNADSNGNSIGKKKKIKEQEQPSLERNRSARNSSTNVDNGLLRFYF

Query:  TPMRGSRSG-GSGKVKPNQALSIARSVLRLY
        TPM+ SR G  +GKVKPNQA SIARSVLRLY
Subjt:  TPMRGSRSG-GSGKVKPNQALSIARSVLRLY

A0A6J1J5H7 UPF0503 protein At3g09070, chloroplastic-like4.7e-15070.12Show/hide
Query:  MKDHIDLDSQTNKIS--GSFWS-------KLQKWRDKQKGKKQRSGAGSTALPVDKPLGRHFTETHSETADYGFGRRSCDIDPRFSLDIPGRM-------
        MKDHIDLDS T K S  GSFWS       KLQKWRDKQK KKQR+G GSTALPV+KP+GRHF +T SE ADYG+GRRSCDIDPRFSLD  GRM       
Subjt:  MKDHIDLDSQTNKIS--GSFWS-------KLQKWRDKQKGKKQRSGAGSTALPVDKPLGRHFTETHSETADYGFGRRSCDIDPRFSLDIPGRM-------

Query:  SFDEPRASWDGCLISRTFPRMATMLAVVEDAPINVFRSDSQIPV---------EENVPGGSSQTRDYYFDTSSRRRKSVDRSNSVRKT---VVAELDEMK
        SFDEPRASWDG LISRTFPRM TML+VVEDAPINVFR+D+QIPV         EEN+PGGSSQTRDYY D+SSRRRKS+DRSNS+RKT   VVAE+DEMK
Subjt:  SFDEPRASWDGCLISRTFPRMATMLAVVEDAPINVFRSDSQIPV---------EENVPGGSSQTRDYYFDTSSRRRKSVDRSNSVRKT---VVAELDEMK

Query:  S------VTPATIEISQAPKLAIPHGDSNSNSLRDDCSPSFELGFNHTASAIATRNQKEESKKSSRWGKGWSIWGLINRPGGNKDGEEHKE-TRPNGVER
        S      V+PAT ++   PKLAIP  DSNS+SL++DCS S +  FN  AS + T N+KEESKKS  WGKGW IWGLINR GGNKD EE KE +RPNG+ER
Subjt:  S------VTPATIEISQAPKLAIPHGDSNSNSLRDDCSPSFELGFNHTASAIATRNQKEESKKSSRWGKGWSIWGLINRPGGNKDGEEHKE-TRPNGVER

Query:  SFSGSWPELEGDENINVKGGFNPKFHRSNGSASWRSSSMLSGSFSSSRKSNADSNGNSIGKKKKIKEQEQPSLERNRSARNSSTNVDNGLLRFYFTPMRG
        S SGSWPEL G+ N ++KGGFNPK  RSN S SWRSSSM+ GSFSSSRKSN ++NGN  GKKK     E+P LER+RSAR+S TN+DNGLLRFY T +RG
Subjt:  SFSGSWPELEGDENINVKGGFNPKFHRSNGSASWRSSSMLSGSFSSSRKSNADSNGNSIGKKKKIKEQEQPSLERNRSARNSSTNVDNGLLRFYFTPMRG

Query:  SRSGGSGKVKPNQALSIARSVLRLY
        SR GGSGKVKPNQA SIARSVLRLY
Subjt:  SRSGGSGKVKPNQALSIARSVLRLY

SwissProt top hitse value%identityAlignment
Q9LFB9 Protein OCTOPUS-like6.2e-5139.96Show/hide
Query:  MKDHIDLDSQTNK-----ISGSFWS-------KLQKWRDKQKGKKQRSGAGSTALPVDKPLGRHFTETHSETADYGFGRRSCDIDPRFSLDI--------
        MKD++DL SQT K      +GSF+S       KLQKW+ KQK KK R+G G          GR         ++ G GRRS D DPRFSLD         
Subjt:  MKDHIDLDSQTNK-----ISGSFWS-------KLQKWRDKQKGKKQRSGAGSTALPVDKPLGRHFTETHSETADYGFGRRSCDIDPRFSLDI--------

Query:  -----PGRMSFDEPRASWDGCLISRT----FPRMATMLAVVEDAPINVFRSDSQIPVEEN-------------VPGGSSQTRDYYF-DTSSRRRKSVDRS
               R S DEPRASWDG LI RT     P   +ML+VVE+AP+N  RSD QIP   +             +PGGS+QTRDYY    SSRRRKS+DRS
Subjt:  -----PGRMSFDEPRASWDGCLISRT----FPRMATMLAVVEDAPINVFRSDSQIPVEEN-------------VPGGSSQTRDYYF-DTSSRRRKSVDRS

Query:  NSVRKTVVAELDEMKSVTPATIEISQAPKLAIPHGDSNSNSLRDDCSPSFELGFNHTASAIATRNQKEESKKSSRWGKGWSIWGLINRPGGNKDGEEHKE
        NS+RK +V EL+++KSV+ +T  I           DSNS    ++                  +  +   KKS RWGK WSI G I R  G  D EE + 
Subjt:  NSVRKTVVAELDEMKSVTPATIEISQAPKLAIPHGDSNSNSLRDDCSPSFELGFNHTASAIATRNQKEESKKSSRWGKGWSIWGLINRPGGNKDGEEHKE

Query:  TRPNG---VERSFSGSWPELEGDENINVKGGFNPKFHRSNGSASWRSSSMLSGSFSSSRKSNADSNGNSIGKKKKIKEQEQPSLERNRSARNSSTNVDNG
        +R N    VERS S SWPE+   E      G  PK  RSN + SWRS                 S G S                RN+S+R SS + +NG
Subjt:  TRPNG---VERSFSGSWPELEGDENINVKGGFNPKFHRSNGSASWRSSSMLSGSFSSSRKSNADSNGNSIGKKKKIKEQEQPSLERNRSARNSSTNVDNG

Query:  LLRFYFTPMRGS--RSGGSG--------------KVKPN-QALSIARSVLRLY
        +LRFY TPMR S   SGGSG                K N    SIAR V+RLY
Subjt:  LLRFYFTPMRGS--RSGGSG--------------KVKPN-QALSIARSVLRLY

Q9SS80 Protein OCTOPUS1.3e-6743.31Show/hide
Query:  MKDHIDLDSQTNK--ISGSFWS-------KLQKWRDKQKGKKQRSGA----GSTALPVDKPLGRHFTETHSETADYGFGRRSCDIDPRFSLDI-------
        +KD+IDLDSQT K  +  SFWS       KLQKWR  QK KK+R+G     GS  LPV+KP+GR   +T SE ADYG+GRRSCD DPRFSLD        
Subjt:  MKDHIDLDSQTNK--ISGSFWS-------KLQKWRDKQKGKKQRSGA----GSTALPVDKPLGRHFTETHSETADYGFGRRSCDIDPRFSLDI-------

Query:  -------------PGRMSFDEPRASWDGCLISRT-FPRMA------TMLAVVEDAP----INVFRSDSQIPVEEN------------------VPGGSSQ
                       R SFDEPRASWDG LI RT FP  A      +ML+VVEDAP     +V R+D Q PVEE                   +PGGS Q
Subjt:  -------------PGRMSFDEPRASWDGCLISRT-FPRMA------TMLAVVEDAP----INVFRSDSQIPVEEN------------------VPGGSSQ

Query:  TRDYYFDTSSRRRKSVDR-SNSVRKT---VVAELDEMKSVTPATIEISQAPKLAIPHGDSNSNSLRDDCSPSFEL----GFNHTASAIATR--NQKEESK
        TRDYY D+SSRRRKS+DR S+S+RKT   VVA++DE K    + I I           D+ S SLRD+ + + E      F   A  I  R  N  + +K
Subjt:  TRDYYFDTSSRRRKSVDR-SNSVRKT---VVAELDEMKSVTPATIEISQAPKLAIPHGDSNSNSLRDDCSPSFEL----GFNHTASAIATR--NQKEESK

Query:  KSSRWGKGWSIWGLINRPGGNKDGEEHKE-----TRPNG--VERSFSGSWPELEGDENINVKGGFNPKFHRSNGSASWRSSSMLSGSFSSSRKSNADSNG
        KS RWGK WSI GLI R   NK  EE +E      R NG  VERS S SWPEL         GG  P+  RSN + SWRS    SG  S+ + +  D   
Subjt:  KSSRWGKGWSIWGLINRPGGNKDGEEHKE-----TRPNG--VERSFSGSWPELEGDENINVKGGFNPKFHRSNGSASWRSSSMLSGSFSSSRKSNADSNG

Query:  NSIGKKKKIKEQEQPSLERNRSARNSSTNVDNGLLRFYFTPMRGSR---------SGGSGKVKPNQALSIARSVLRLY
                          RN+S+R S  N +NG+L+FY   M+ SR          GG G    +   SIARSV+RLY
Subjt:  NSIGKKKKIKEQEQPSLERNRSARNSSTNVDNGLLRFYFTPMRGSR---------SGGSGKVKPNQALSIARSVLRLY

Arabidopsis top hitse value%identityAlignment
AT2G38070.1 Protein of unknown function (DUF740)1.7e-6743.77Show/hide
Query:  KISGSFWS-------KLQKWRDKQKGKKQRS---GAGSTALPVDKPLGRHFTETHSETADYGFGRRSCDIDPRFSLDI-------------PGRMSFDEP
        +I+GSFWS       KLQKWR KQK KK R+   GAGS+ALPV+K +GR   +T SE A+YG+GRRSCD DPRFS+D                R SF+EP
Subjt:  KISGSFWS-------KLQKWRDKQKGKKQRS---GAGSTALPVDKPLGRHFTETHSETADYGFGRRSCDIDPRFSLDI-------------PGRMSFDEP

Query:  RASWDGCLISRTFP--RMATMLAVVEDAPI--NVFRSDSQIPVE-----------ENVPGGSSQTRDYYFD-TSSRRRKSVDRSNSVRK---TVVAELDE
        RASWDG LI R     RM +ML+VVED+P+  +V RSD+ IPVE           E VPGGS+QTR+YY D +SSRRRKS+DRS+S RK   +V+AE+DE
Subjt:  RASWDGCLISRTFP--RMATMLAVVEDAPI--NVFRSDSQIPVE-----------ENVPGGSSQTRDYYFD-TSSRRRKSVDRSNSVRK---TVVAELDE

Query:  MKSVTPATIEISQAPKLAIPHGDSNSNSLRDDC---SPSFELGFNHTASAIATRNQKEESKKSSRWGKGWSIWGLINRPGGNKDGEEHKETRPNGVERSF
        +K       +  +A  L      S+SNSLRDDC     ++E+G       I    ++    K SRW   W+I+GL++R  GNK  EE    R +GV+R+F
Subjt:  MKSVTPATIEISQAPKLAIPHGDSNSNSLRDDC---SPSFELGFNHTASAIATRNQKEESKKSSRWGKGWSIWGLINRPGGNKDGEEHKETRPNGVERSF

Query:  SGSWPELEGDENINVKGGFNPKFHRSNGSASWRSSSMLSGSFSSSRKSNADSNGNSIGKKKKIKEQEQPSLERNRSARNSSTNVDNGLLRFYFTPMRGSR
        SGSW       N+  + GF+PK  RSN S SWRSS    G     ++++ D  G   GKKK  K                    +NG+L+FY TP +G R
Subjt:  SGSWPELEGDENINVKGGFNPKFHRSNGSASWRSSSMLSGSFSSSRKSNADSNGNSIGKKKKIKEQEQPSLERNRSARNSSTNVDNGLLRFYFTPMRGSR

Query:  SGGSGKVKP
         G      P
Subjt:  SGGSGKVKP

AT3G09070.1 Protein of unknown function (DUF740)8.9e-6943.31Show/hide
Query:  MKDHIDLDSQTNK--ISGSFWS-------KLQKWRDKQKGKKQRSGA----GSTALPVDKPLGRHFTETHSETADYGFGRRSCDIDPRFSLDI-------
        +KD+IDLDSQT K  +  SFWS       KLQKWR  QK KK+R+G     GS  LPV+KP+GR   +T SE ADYG+GRRSCD DPRFSLD        
Subjt:  MKDHIDLDSQTNK--ISGSFWS-------KLQKWRDKQKGKKQRSGA----GSTALPVDKPLGRHFTETHSETADYGFGRRSCDIDPRFSLDI-------

Query:  -------------PGRMSFDEPRASWDGCLISRT-FPRMA------TMLAVVEDAP----INVFRSDSQIPVEEN------------------VPGGSSQ
                       R SFDEPRASWDG LI RT FP  A      +ML+VVEDAP     +V R+D Q PVEE                   +PGGS Q
Subjt:  -------------PGRMSFDEPRASWDGCLISRT-FPRMA------TMLAVVEDAP----INVFRSDSQIPVEEN------------------VPGGSSQ

Query:  TRDYYFDTSSRRRKSVDR-SNSVRKT---VVAELDEMKSVTPATIEISQAPKLAIPHGDSNSNSLRDDCSPSFEL----GFNHTASAIATR--NQKEESK
        TRDYY D+SSRRRKS+DR S+S+RKT   VVA++DE K    + I I           D+ S SLRD+ + + E      F   A  I  R  N  + +K
Subjt:  TRDYYFDTSSRRRKSVDR-SNSVRKT---VVAELDEMKSVTPATIEISQAPKLAIPHGDSNSNSLRDDCSPSFEL----GFNHTASAIATR--NQKEESK

Query:  KSSRWGKGWSIWGLINRPGGNKDGEEHKE-----TRPNG--VERSFSGSWPELEGDENINVKGGFNPKFHRSNGSASWRSSSMLSGSFSSSRKSNADSNG
        KS RWGK WSI GLI R   NK  EE +E      R NG  VERS S SWPEL         GG  P+  RSN + SWRS    SG  S+ + +  D   
Subjt:  KSSRWGKGWSIWGLINRPGGNKDGEEHKE-----TRPNG--VERSFSGSWPELEGDENINVKGGFNPKFHRSNGSASWRSSSMLSGSFSSSRKSNADSNG

Query:  NSIGKKKKIKEQEQPSLERNRSARNSSTNVDNGLLRFYFTPMRGSR---------SGGSGKVKPNQALSIARSVLRLY
                          RN+S+R S  N +NG+L+FY   M+ SR          GG G    +   SIARSV+RLY
Subjt:  NSIGKKKKIKEQEQPSLERNRSARNSSTNVDNGLLRFYFTPMRGSR---------SGGSGKVKPNQALSIARSVLRLY

AT3G46990.1 Protein of unknown function (DUF740)5.3e-2930.17Show/hide
Query:  MKDHIDLDSQTNKISGSFWSKLQKWRDKQKGKKQRSGAGSTALPVDKPLGRHFTETHSETADYGFGRRSCDIDPRFSLDIPGRMSFDEPRASWDGCLISR
        MK+ IDLD          W    K ++  K  K+ +   S  L       R+  ++ S  A    GR S D+DPR S D  GR+SF++PR+SWDGCLI +
Subjt:  MKDHIDLDSQTNKISGSFWSKLQKWRDKQKGKKQRSGAGSTALPVDKPLGRHFTETHSETADYGFGRRSCDIDPRFSLDIPGRMSFDEPRASWDGCLISR

Query:  TFPRMATMLAVVEDAPINVFRSDSQIPVEENVPGGSSQTRDYYFDTSSRRRKSVDRSNSVRKTVVAELDEMK-----SVTPATIEISQAPKLAIPH---G
        ++ ++ T+  V EDA       + ++  +E  PGG+ QT++YY D  SRRR+S DRS S+++  + E+DE++      V+P T+ +    KL +      
Subjt:  TFPRMATMLAVVEDAPINVFRSDSQIPVEENVPGGSSQTRDYYFDTSSRRRKSVDRSNSVRKTVVAELDEMK-----SVTPATIEISQAPKLAIPH---G

Query:  DSNSNSLRDDCSPSFEL-GFNHTASAIATRNQKEES----KKSSRWGKGWSIWGLINRPGGNKD---GEEHKETRPNGVERSFSGSWPEL----EGDENI
        DSN  S+++    S EL        A     +K++S    K   +W KGW+IWGLI R    K+    E+  +   N VE S + S  +L    +G+ N+
Subjt:  DSNSNSLRDDCSPSFEL-GFNHTASAIATRNQKEES----KKSSRWGKGWSIWGLINRPGGNKD---GEEHKETRPNGVERSFSGSWPEL----EGDENI

Query:  NVK---------------GGFNPKFHRSNGSASWRSS--SMLSGSFSSSRKSNADSNGNSIGKKKKIKEQEQPSLERNRSARN-SSTNVDNGLLRFYFTP
         V                 G     +  +G    RSS   +  GS +S        +G   G + K   Q    L+RN +    S  N++  + RFY +P
Subjt:  NVK---------------GGFNPKFHRSNGSASWRSS--SMLSGSFSSSRKSNADSNGNSIGKKKKIKEQEQPSLERNRSARN-SSTNVDNGLLRFYFTP

Query:  MRGSRSGGSGK
        ++  ++  SGK
Subjt:  MRGSRSGGSGK

AT5G01170.1 Protein of unknown function (DUF740)4.4e-5239.96Show/hide
Query:  MKDHIDLDSQTNK-----ISGSFWS-------KLQKWRDKQKGKKQRSGAGSTALPVDKPLGRHFTETHSETADYGFGRRSCDIDPRFSLDI--------
        MKD++DL SQT K      +GSF+S       KLQKW+ KQK KK R+G G          GR         ++ G GRRS D DPRFSLD         
Subjt:  MKDHIDLDSQTNK-----ISGSFWS-------KLQKWRDKQKGKKQRSGAGSTALPVDKPLGRHFTETHSETADYGFGRRSCDIDPRFSLDI--------

Query:  -----PGRMSFDEPRASWDGCLISRT----FPRMATMLAVVEDAPINVFRSDSQIPVEEN-------------VPGGSSQTRDYYF-DTSSRRRKSVDRS
               R S DEPRASWDG LI RT     P   +ML+VVE+AP+N  RSD QIP   +             +PGGS+QTRDYY    SSRRRKS+DRS
Subjt:  -----PGRMSFDEPRASWDGCLISRT----FPRMATMLAVVEDAPINVFRSDSQIPVEEN-------------VPGGSSQTRDYYF-DTSSRRRKSVDRS

Query:  NSVRKTVVAELDEMKSVTPATIEISQAPKLAIPHGDSNSNSLRDDCSPSFELGFNHTASAIATRNQKEESKKSSRWGKGWSIWGLINRPGGNKDGEEHKE
        NS+RK +V EL+++KSV+ +T  I           DSNS    ++                  +  +   KKS RWGK WSI G I R  G  D EE + 
Subjt:  NSVRKTVVAELDEMKSVTPATIEISQAPKLAIPHGDSNSNSLRDDCSPSFELGFNHTASAIATRNQKEESKKSSRWGKGWSIWGLINRPGGNKDGEEHKE

Query:  TRPNG---VERSFSGSWPELEGDENINVKGGFNPKFHRSNGSASWRSSSMLSGSFSSSRKSNADSNGNSIGKKKKIKEQEQPSLERNRSARNSSTNVDNG
        +R N    VERS S SWPE+   E      G  PK  RSN + SWRS                 S G S                RN+S+R SS + +NG
Subjt:  TRPNG---VERSFSGSWPELEGDENINVKGGFNPKFHRSNGSASWRSSSMLSGSFSSSRKSNADSNGNSIGKKKKIKEQEQPSLERNRSARNSSTNVDNG

Query:  LLRFYFTPMRGS--RSGGSG--------------KVKPN-QALSIARSVLRLY
        +LRFY TPMR S   SGGSG                K N    SIAR V+RLY
Subjt:  LLRFYFTPMRGS--RSGGSG--------------KVKPN-QALSIARSVLRLY

AT5G58930.1 Protein of unknown function (DUF740)3.0e-3231.23Show/hide
Query:  MKDHIDLDSQTNKISGSFWSKLQKWRDKQKGKKQRSGAGSTALPVDKPLGRHFTETHSETADYG--FGRRSCDIDPRFSLDIPGRMSFDEPRASWDGCLI
        MK+ IDL+S+  ++             K  GK              + L +   + H +  D G   GRRSCD+DPR SLD  GR+SFDEPRASWDGCLI
Subjt:  MKDHIDLDSQTNKISGSFWSKLQKWRDKQKGKKQRSGAGSTALPVDKPLGRHFTETHSETADYG--FGRRSCDIDPRFSLDIPGRMSFDEPRASWDGCLI

Query:  SRTFPRMATMLAVVED--APINVFRSDSQIPVEENVPGGSSQTRDYYFDTSSRRRKSVDRSNSVRKTVVAELDEMKS-----VTPATIEISQAPKLAIPH
         +T+P++  + +V ED  A       +     E+N PGG++QTRDYY D  SRRR+S DRS+   +  + E+DE+K+     V+P T+ +    KL +  
Subjt:  SRTFPRMATMLAVVED--APINVFRSDSQIPVEENVPGGSSQTRDYYFDTSSRRRKSVDRSNSVRKTVVAELDEMKS-----VTPATIEISQAPKLAIPH

Query:  ---GDSNSNSLRDDCSPSFELGFNHTASAIATRNQKEE----SKKSSRWGKGWSIWGLINR----PGGNKDGEEHKETRPNGVERSFSGSWPELEGDENI
            DSN  S+++    S ELG        A   +K++     K    W KGW+ WGLI R           E+  +   N +E S + S  +L      
Subjt:  ---GDSNSNSLRDDCSPSFELGFNHTASAIATRNQKEE----SKKSSRWGKGWSIWGLINR----PGGNKDGEEHKETRPNGVERSFSGSWPELEGDENI

Query:  NVKGGFNPKFHRSNGSASWRS-SSMLSGS-----FSSSRKS-----NADSNGNSIGKKKKIKEQEQPSLERNRS---------ARNSSTNVDNGLLRFYF
           G  + K  RS   ++ +S   ML G+     F   R S     +    G   G++   ++     +E  R+            S  N+ NG++RFY 
Subjt:  NVKGGFNPKFHRSNGSASWRS-SSMLSGS-----FSSSRKS-----NADSNGNSIGKKKKIKEQEQPSLERNRS---------ARNSSTNVDNGLLRFYF

Query:  TPMRGSRSGGSGK
        TP+    +  SGK
Subjt:  TPMRGSRSGGSGK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGGATCACATAGATCTGGATTCGCAAACGAACAAGATCTCCGGAAGCTTCTGGTCGAAGCTGCAGAAATGGAGGGATAAACAGAAGGGGAAGAAGCAGAGGAGCGG
CGCCGGATCTACAGCCTTGCCGGTGGACAAACCACTCGGACGCCATTTCACAGAAACCCACTCGGAAACTGCCGATTACGGCTTCGGCCGCCGATCCTGCGACATTGATC
CCCGATTCTCTCTCGACATTCCCGGCCGAATGTCCTTCGACGAACCTAGAGCTTCTTGGGATGGGTGCTTGATTAGCAGAACGTTTCCCAGAATGGCCACCATGCTTGCC
GTCGTTGAAGATGCTCCCATCAATGTCTTCCGTTCTGACTCCCAAATTCCTGTGGAAGAGAATGTCCCTGGTGGGTCCTCGCAGACCCGGGATTACTATTTCGACACATC
TTCCCGACGGCGCAAGAGTGTCGACCGGTCCAACTCCGTTAGAAAGACGGTGGTGGCGGAGCTTGATGAAATGAAATCTGTTACTCCTGCAACTATAGAGATCAGCCAAG
CCCCAAAACTAGCCATTCCACATGGAGATTCCAACTCCAATTCGCTTCGAGACGACTGCTCCCCGTCCTTCGAATTGGGATTCAACCACACTGCATCTGCGATCGCGACG
AGGAATCAGAAAGAGGAGTCGAAGAAATCCAGCAGGTGGGGGAAGGGATGGAGCATTTGGGGATTGATTAACCGGCCGGGAGGAAACAAAGATGGGGAGGAACACAAAGA
GACTAGACCCAATGGCGTGGAACGATCTTTTTCGGGGTCGTGGCCGGAGCTCGAAGGGGATGAGAATATCAATGTCAAAGGAGGATTCAATCCCAAATTTCATAGGAGTA
ACGGCAGTGCGAGTTGGAGGAGTTCAAGTATGTTAAGTGGATCTTTCAGTAGTTCAAGGAAAAGCAATGCAGATTCTAATGGGAATAGCATTGGGAAGAAGAAGAAGATT
AAAGAGCAGGAGCAGCCATCCTTGGAGAGGAATCGGAGTGCTCGAAACTCCTCGACGAACGTCGACAATGGACTTCTTCGATTCTACTTCACGCCGATGAGGGGAAGCCG
GAGTGGTGGGTCCGGGAAGGTGAAACCAAATCAAGCACTGTCCATTGCTAGAAGTGTTCTTAGACTGTATTAA
mRNA sequenceShow/hide mRNA sequence
CACCATGTGTGGACAATTGTTGCAACTGCAAAAATGTCTCCTCTTCTTCTTCATTTGGGTGGAAACCAAAGAGTGAGCAAATAAAATTGCAGTGAGGAACAAGAAAGAAA
GAGGAAAGCTTTGATGGAGTATCTGTTTTTGTAAGGCCCTTCACTTTCCTCATAATTTCCCAAAGTCCCATCATGGCCTTCTCTCCATTTTCTTCATTCTTCTTCTTCTT
CGCCGTTCTCTCTTATTAGCTAGGGTTCTTGCTTCTGCTCCTCAACGAAATTGGGGTTTCTCAAATACCCAATTCGGTGCACTTACGACTTCAAACCCATGAAGGATCAC
ATAGATCTGGATTCGCAAACGAACAAGATCTCCGGAAGCTTCTGGTCGAAGCTGCAGAAATGGAGGGATAAACAGAAGGGGAAGAAGCAGAGGAGCGGCGCCGGATCTAC
AGCCTTGCCGGTGGACAAACCACTCGGACGCCATTTCACAGAAACCCACTCGGAAACTGCCGATTACGGCTTCGGCCGCCGATCCTGCGACATTGATCCCCGATTCTCTC
TCGACATTCCCGGCCGAATGTCCTTCGACGAACCTAGAGCTTCTTGGGATGGGTGCTTGATTAGCAGAACGTTTCCCAGAATGGCCACCATGCTTGCCGTCGTTGAAGAT
GCTCCCATCAATGTCTTCCGTTCTGACTCCCAAATTCCTGTGGAAGAGAATGTCCCTGGTGGGTCCTCGCAGACCCGGGATTACTATTTCGACACATCTTCCCGACGGCG
CAAGAGTGTCGACCGGTCCAACTCCGTTAGAAAGACGGTGGTGGCGGAGCTTGATGAAATGAAATCTGTTACTCCTGCAACTATAGAGATCAGCCAAGCCCCAAAACTAG
CCATTCCACATGGAGATTCCAACTCCAATTCGCTTCGAGACGACTGCTCCCCGTCCTTCGAATTGGGATTCAACCACACTGCATCTGCGATCGCGACGAGGAATCAGAAA
GAGGAGTCGAAGAAATCCAGCAGGTGGGGGAAGGGATGGAGCATTTGGGGATTGATTAACCGGCCGGGAGGAAACAAAGATGGGGAGGAACACAAAGAGACTAGACCCAA
TGGCGTGGAACGATCTTTTTCGGGGTCGTGGCCGGAGCTCGAAGGGGATGAGAATATCAATGTCAAAGGAGGATTCAATCCCAAATTTCATAGGAGTAACGGCAGTGCGA
GTTGGAGGAGTTCAAGTATGTTAAGTGGATCTTTCAGTAGTTCAAGGAAAAGCAATGCAGATTCTAATGGGAATAGCATTGGGAAGAAGAAGAAGATTAAAGAGCAGGAG
CAGCCATCCTTGGAGAGGAATCGGAGTGCTCGAAACTCCTCGACGAACGTCGACAATGGACTTCTTCGATTCTACTTCACGCCGATGAGGGGAAGCCGGAGTGGTGGGTC
CGGGAAGGTGAAACCAAATCAAGCACTGTCCATTGCTAGAAGTGTTCTTAGACTGTATTAAACTGGAACTGGTCGGGGAAGTTTTCAACCAGAAAACATGGTTTTCTGGT
CCATTGCTATGTATAGTTTTTGTTCAAGCTTTTTCTTTACTTGGTTTATTGGGGATTCAATCCCTAAACCAGAATGAAGTATGAAGAATTTCTTTTCATGCGTCTTCTTA
ATTTTTCATATGTACTGTCATTTCATGAGACGAGCCAGTTTAGATGAGACGAGCCAATACCCAAAAGAAAACAAAATGAAGCGAC
Protein sequenceShow/hide protein sequence
MKDHIDLDSQTNKISGSFWSKLQKWRDKQKGKKQRSGAGSTALPVDKPLGRHFTETHSETADYGFGRRSCDIDPRFSLDIPGRMSFDEPRASWDGCLISRTFPRMATMLA
VVEDAPINVFRSDSQIPVEENVPGGSSQTRDYYFDTSSRRRKSVDRSNSVRKTVVAELDEMKSVTPATIEISQAPKLAIPHGDSNSNSLRDDCSPSFELGFNHTASAIAT
RNQKEESKKSSRWGKGWSIWGLINRPGGNKDGEEHKETRPNGVERSFSGSWPELEGDENINVKGGFNPKFHRSNGSASWRSSSMLSGSFSSSRKSNADSNGNSIGKKKKI
KEQEQPSLERNRSARNSSTNVDNGLLRFYFTPMRGSRSGGSGKVKPNQALSIARSVLRLY