; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0007069 (gene) of Chayote v1 genome

Gene IDSed0007069
OrganismSechium edule (Chayote v1)
DescriptionProtein KAKU4
Genome locationLG05:30330219..30340536
RNA-Seq ExpressionSed0007069
SyntenySed0007069
Gene Ontology termsGO:0071763 - nuclear membrane organization (biological process)
GO:0005635 - nuclear envelope (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022139984.1 protein KAKU4 isoform X1 [Momordica charantia]1.8e-19972.37Show/hide
Query:  MASVPAYGEAGGSRTGGKIVRARRAWSRKTPYERPEHSNSGTGENPSWISKFIFSPTRTIASGAGKLLSSVFVSDSSSSSSGSDSEEDVEDDVHDENHVF
        MA+VP+Y +  GSR+GGKIVRARRA SRK PYERP  SN G G NPSWIS+FIFSPT TIASGAGKLLSSVFVSDS SSSS  DSE+DVE D HDENH F
Subjt:  MASVPAYGEAGGSRTGGKIVRARRAWSRKTPYERPEHSNSGTGENPSWISKFIFSPTRTIASGAGKLLSSVFVSDSSSSSSGSDSEEDVEDDVHDENHVF

Query:  QESEGVKKNGTLEAVNLFRKDFPPEMKDSKRLIEQLLMQETF-SRAERDKLFQIIESRVVECQAIEGVAAGTLTEISNRAVEND-DAPAVCSTAILEAKK
        Q +EGVKKNGT E V+LFRKDFPP+ KDSK LIEQLLMQETF SRAERDKLFQIIESRVVE Q I+  AAG LTEISNRAV++D D PAVCSTAILEAKK
Subjt:  QESEGVKKNGTLEAVNLFRKDFPPEMKDSKRLIEQLLMQETF-SRAERDKLFQIIESRVVECQAIEGVAAGTLTEISNRAVEND-DAPAVCSTAILEAKK

Query:  WLNDKRLGLGSSTTVEFDHGPCTLNSTMPIVVKEEEMGSPVNVAKSYMRARPPWASPSSSNFEFKSPSPLGLQLFKEETPYSISGNPLSSSKIKRESPAS
        WLN+KRLGLGSS+T + DHGPCTLNSTM  V  +EEMGSPV+VAKSYMRARPPWASPSS+NFEFKSPSPLGLQLFKEET YSI+GNPLSSSK+KRESPAS
Subjt:  WLNDKRLGLGSSTTVEFDHGPCTLNSTMPIVVKEEEMGSPVNVAKSYMRARPPWASPSSSNFEFKSPSPLGLQLFKEETPYSISGNPLSSSKIKRESPAS

Query:  GSWNIQEEIRRVRSKATEEMLRTRSSATRDWPSFSSDYKSNLSSARSDY-------KMQLAVKSIDKSMNWSAYNTFTSNLSESKTAEDVSENGASLLNT
        GSWNIQEEIRRVRS+ATEEMLRT  SA  DW S +SDYKSNLSS  SD        KMQ A K IDK + WSA NT T NLSESK  +DVSENGA LL+T
Subjt:  GSWNIQEEIRRVRSKATEEMLRTRSSATRDWPSFSSDYKSNLSSARSDY-------KMQLAVKSIDKSMNWSAYNTFTSNLSESKTAEDVSENGASLLNT

Query:  TSITQQQDKDLETNPTTERKASNSSLDEKECSTAHEVAGMANGFPAVPSSSGEVA---KIVEENTSSGHDHEAKGVPAEEQCELLSELSMEVPNVNE---
        T+I  QQDKDL+TNPTTERK SNSSLDE++CST HEVAG+ANGF  VPSSSGE+    KIVEEN SSGHDHEAKG P EE+CELLSE SMEVPNV E   
Subjt:  TSITQQQDKDLETNPTTERKASNSSLDEKECSTAHEVAGMANGFPAVPSSSGEVA---KIVEENTSSGHDHEAKGVPAEEQCELLSELSMEVPNVNE---

Query:  ------NDTDKALSEGNGV----------------KSSSQSNVATRKASGTGYPRRGRRRN
              ND  K L E NG                 K S QS+ +  K+  T Y RRG+RRN
Subjt:  ------NDTDKALSEGNGV----------------KSSSQSNVATRKASGTGYPRRGRRRN

XP_022139992.1 protein KAKU4 isoform X2 [Momordica charantia]7.2e-20172.5Show/hide
Query:  MASVPAYGEAGGSRTGGKIVRARRAWSRKTPYERPEHSNSGTGENPSWISKFIFSPTRTIASGAGKLLSSVFVSDSSSSSSGSDSEEDVEDDVHDENHVF
        MA+VP+Y +  GSR+GGKIVRARRA SRK PYERP  SN G G NPSWIS+FIFSPT TIASGAGKLLSSVFVSDS SSSS  DSE+DVE D HDENH F
Subjt:  MASVPAYGEAGGSRTGGKIVRARRAWSRKTPYERPEHSNSGTGENPSWISKFIFSPTRTIASGAGKLLSSVFVSDSSSSSSGSDSEEDVEDDVHDENHVF

Query:  QESEGVKKNGTLEAVNLFRKDFPPEMKDSKRLIEQLLMQETFSRAERDKLFQIIESRVVECQAIEGVAAGTLTEISNRAVEND-DAPAVCSTAILEAKKW
        Q +EGVKKNGT E V+LFRKDFPP+ KDSK LIEQLLMQETFSRAERDKLFQIIESRVVE Q I+  AAG LTEISNRAV++D D PAVCSTAILEAKKW
Subjt:  QESEGVKKNGTLEAVNLFRKDFPPEMKDSKRLIEQLLMQETFSRAERDKLFQIIESRVVECQAIEGVAAGTLTEISNRAVEND-DAPAVCSTAILEAKKW

Query:  LNDKRLGLGSSTTVEFDHGPCTLNSTMPIVVKEEEMGSPVNVAKSYMRARPPWASPSSSNFEFKSPSPLGLQLFKEETPYSISGNPLSSSKIKRESPASG
        LN+KRLGLGSS+T + DHGPCTLNSTM  V  +EEMGSPV+VAKSYMRARPPWASPSS+NFEFKSPSPLGLQLFKEET YSI+GNPLSSSK+KRESPASG
Subjt:  LNDKRLGLGSSTTVEFDHGPCTLNSTMPIVVKEEEMGSPVNVAKSYMRARPPWASPSSSNFEFKSPSPLGLQLFKEETPYSISGNPLSSSKIKRESPASG

Query:  SWNIQEEIRRVRSKATEEMLRTRSSATRDWPSFSSDYKSNLSSARSDY-------KMQLAVKSIDKSMNWSAYNTFTSNLSESKTAEDVSENGASLLNTT
        SWNIQEEIRRVRS+ATEEMLRT  SA  DW S +SDYKSNLSS  SD        KMQ A K IDK + WSA NT T NLSESK  +DVSENGA LL+TT
Subjt:  SWNIQEEIRRVRSKATEEMLRTRSSATRDWPSFSSDYKSNLSSARSDY-------KMQLAVKSIDKSMNWSAYNTFTSNLSESKTAEDVSENGASLLNTT

Query:  SITQQQDKDLETNPTTERKASNSSLDEKECSTAHEVAGMANGFPAVPSSSGEVA---KIVEENTSSGHDHEAKGVPAEEQCELLSELSMEVPNVNE----
        +I  QQDKDL+TNPTTERK SNSSLDE++CST HEVAG+ANGF  VPSSSGE+    KIVEEN SSGHDHEAKG P EE+CELLSE SMEVPNV E    
Subjt:  SITQQQDKDLETNPTTERKASNSSLDEKECSTAHEVAGMANGFPAVPSSSGEVA---KIVEENTSSGHDHEAKGVPAEEQCELLSELSMEVPNVNE----

Query:  -----NDTDKALSEGNGV----------------KSSSQSNVATRKASGTGYPRRGRRRN
             ND  K L E NG                 K S QS+ +  K+  T Y RRG+RRN
Subjt:  -----NDTDKALSEGNGV----------------KSSSQSNVATRKASGTGYPRRGRRRN

XP_022982016.1 protein KAKU4-like isoform X1 [Cucurbita maxima]8.3e-19771.25Show/hide
Query:  MASVPAYGEAGGSRTGGKIVRARRAWSRKTPYERPEHSNSGTGENPSWISKFIFSPTRTIASGAGKLLSSVFVSDSSSSSSGSDSEEDVEDDVHDENHVF
        MASVPAY EAGGSRTGGKIVRARRAWSRKTPYERP  SN G G NPSWIS+FIFSPT TIASGAGK LSSVF+++SSSSSS SDSE+DVEDDVHD+N VF
Subjt:  MASVPAYGEAGGSRTGGKIVRARRAWSRKTPYERPEHSNSGTGENPSWISKFIFSPTRTIASGAGKLLSSVFVSDSSSSSSGSDSEEDVEDDVHDENHVF

Query:  QESEGVKKNGTLEAVNLFRKDFPPEMKDSKRLIEQLLMQETFSRAERDKLFQIIESRVVECQAIEGVAAGTLTEISNRAVENDD-APAVCSTAILEAKKW
        Q   GVKK+GT E VN FRKDFPPE KDSK LIEQLLMQETFSRAERDKLFQIIESRVVECQ+IEG AAG LTE+SN+AV++DD   AVCSTAILEAK W
Subjt:  QESEGVKKNGTLEAVNLFRKDFPPEMKDSKRLIEQLLMQETFSRAERDKLFQIIESRVVECQAIEGVAAGTLTEISNRAVENDD-APAVCSTAILEAKKW

Query:  LNDKRLGLGSSTTVEFDHGPCTLNSTMPIVVKEEEMGSPVNVAKSYMRARPPWASPSSSNFEFKSPSPLGLQLFKEETPYSISGNPLSSSKIKRESPASG
        LN+KRLGLGSS+T+E DHGPCTLNSTM  +V +EE GS V+VAKSYMR RP WASPS +NFEFKSPSP  LQLFKEET +S+SGNPLSSSKIKRESPASG
Subjt:  LNDKRLGLGSSTTVEFDHGPCTLNSTMPIVVKEEEMGSPVNVAKSYMRARPPWASPSSSNFEFKSPSPLGLQLFKEETPYSISGNPLSSSKIKRESPASG

Query:  SWNIQEEIRRVRSKATEEMLRTRSSATRDWPSFSSDYKSNLSSARSDY-------KMQLAVKSIDKSMNWSAYNTFTSNLSESKTAEDVSENGASLLNTT
        SWNIQEEIRRVRSKATEEMLR+RSSA  DW  F+SDYK NLSSARSDY       KMQ AV+SIDKSMN  A  + T NLSESK  +DV ENGA LL + 
Subjt:  SWNIQEEIRRVRSKATEEMLRTRSSATRDWPSFSSDYKSNLSSARSDY-------KMQLAVKSIDKSMNWSAYNTFTSNLSESKTAEDVSENGASLLNTT

Query:  SITQQQDKDLETNPTTERKASNSSLDEKECSTAHEVAGMANGFPAVPSSSGEVA-------KIVEENTSSGHDHEAKGVPAEEQCELLSELSMEVPNVNE
        SI QQQDKD ETNPT ERKASNSSLDE+ CST HEVAG+ANG P++PSS+GE+A       KIVEE+ SSGHDHEAKG+P +E+CE LSE+SMEVPN N+
Subjt:  SITQQQDKDLETNPTTERKASNSSLDEKECSTAHEVAGMANGFPAVPSSSGEVA-------KIVEENTSSGHDHEAKGVPAEEQCELLSELSMEVPNVNE

Query:  ND--TDKALSEGNGV--------------------------KSSSQSNVATRKASGTGYPRRGRRRN
         D   +K  S+GN V                          K  ++SNVA  K S   Y RRGRRRN
Subjt:  ND--TDKALSEGNGV--------------------------KSSSQSNVATRKASGTGYPRRGRRRN

XP_022982019.1 protein KAKU4-like isoform X3 [Cucurbita maxima]1.3e-19772.53Show/hide
Query:  MASVPAYGEAGGSRTGGKIVRARRAWSRKTPYERPEHSNSGTGENPSWISKFIFSPTRTIASGAGKLLSSVFVSDSSSSSSGSDSEEDVEDDVHDENHVF
        MASVPAY EAGGSRTGGKIVRARRAWSRKTPYERP  SN G G NPSWIS+FIFSPT TIASGAGK LSSVF+++SSSSSS SDSE+DVEDDVHD+N VF
Subjt:  MASVPAYGEAGGSRTGGKIVRARRAWSRKTPYERPEHSNSGTGENPSWISKFIFSPTRTIASGAGKLLSSVFVSDSSSSSSGSDSEEDVEDDVHDENHVF

Query:  QESEGVKKNGTLEAVNLFRKDFPPEMKDSKRLIEQLLMQETFSRAERDKLFQIIESRVVECQAIEGVAAGTLTEISNRAVENDD-APAVCSTAILEAKKW
        Q   GVKK+GT E VN FRKDFPPE KDSK LIEQLLMQETFSRAERDKLFQIIESRVVECQ+IEG AAG LTE+SN+AV++DD   AVCSTAILEAK W
Subjt:  QESEGVKKNGTLEAVNLFRKDFPPEMKDSKRLIEQLLMQETFSRAERDKLFQIIESRVVECQAIEGVAAGTLTEISNRAVENDD-APAVCSTAILEAKKW

Query:  LNDKRLGLGSSTTVEFDHGPCTLNSTMPIVVKEEEMGSPVNVAKSYMRARPPWASPSSSNFEFKSPSPLGLQLFKEETPYSISGNPLSSSKIKRESPASG
        LN+KRLGLGSS+T+E DHGPCTLNSTM  +V +EE GS V+VAKSYMR RP WASPS +NFEFKSPSP  LQLFKEET +S+SGNPLSSSKIKRESPASG
Subjt:  LNDKRLGLGSSTTVEFDHGPCTLNSTMPIVVKEEEMGSPVNVAKSYMRARPPWASPSSSNFEFKSPSPLGLQLFKEETPYSISGNPLSSSKIKRESPASG

Query:  SWNIQEEIRRVRSKATEEMLRTRSSATRDWPSFSSDYKSNLSSARSDY-------KMQLAVKSIDKSMNWSAYNTFTSNLSESKTAEDVSENGASLLNTT
        SWNIQEEIRRVRSKATEEMLR+RSSA  DW  F+SDYK NLSSARSDY       KMQ AV+SIDKSMN  A  + T NLSESK  +DV ENGA LL + 
Subjt:  SWNIQEEIRRVRSKATEEMLRTRSSATRDWPSFSSDYKSNLSSARSDY-------KMQLAVKSIDKSMNWSAYNTFTSNLSESKTAEDVSENGASLLNTT

Query:  SITQQQDKDLETNPTTERKASNSSLDEKECSTAHEVAGMANGFPAVPSSSGEVA-------KIVEENTSSGHDHEAKGVPAEEQCELLSELSMEVPNVNE
        SI QQQDKD ETNPT ERKASNSSLDE+ CST HEVAG+ANG P++PSS+GE+A       KIVEE+ SSGHDHEAKG+P +E+CE LSE+SMEVPN N+
Subjt:  SITQQQDKDLETNPTTERKASNSSLDEKECSTAHEVAGMANGFPAVPSSSGEVA-------KIVEENTSSGHDHEAKGVPAEEQCELLSELSMEVPNVNE

Query:  ND--TDKALSEGNG----------------VKSSSQSNVATRKASGTGYPRRGRRRN
         D   +K  SEGN                  K  ++SNVA  K S   Y RRGRRRN
Subjt:  ND--TDKALSEGNG----------------VKSSSQSNVATRKASGTGYPRRGRRRN

XP_038898993.1 protein KAKU4 [Benincasa hispida]1.8e-19672.29Show/hide
Query:  MASVPAYGEAGGSRTGGKIVRARRAWSRKTPYERPEHSNSGTGENPSWISKFIFSPTRTIASGAGKLLSSVFVSDSSSSSSGSDSEEDVEDDVHDENHVF
        MAS+PAY EAGGSR+GGKIVRARR  SRKTPYERP  SN G G NPSWISKFIFSPTRTIA+GAGKLLSSVFVSDSSSSSS S+SE+D EDDVHDEN VF
Subjt:  MASVPAYGEAGGSRTGGKIVRARRAWSRKTPYERPEHSNSGTGENPSWISKFIFSPTRTIASGAGKLLSSVFVSDSSSSSSGSDSEEDVEDDVHDENHVF

Query:  QESEGVKKNGTLEAVNLFRKDFPPEMKDSKRLIEQLLMQETFSRAERDKLFQIIESRVVECQAIEGVAAGTLTEISNRAVENDDA-PAVCSTAILEAKKW
        Q +EGVKKNGT E VNLFRKDFPP  KDSK LIEQLLMQETFSRAERDKLFQIIESRVVECQ  EG AAG LTEISNRAV+NDD  PAVC TAILEAKKW
Subjt:  QESEGVKKNGTLEAVNLFRKDFPPEMKDSKRLIEQLLMQETFSRAERDKLFQIIESRVVECQAIEGVAAGTLTEISNRAVENDDA-PAVCSTAILEAKKW

Query:  LNDKRLGLGSSTTVEFDHGPCTLNSTMPIVVKEEEMGSPVNVAKSYMRARPPWASPSSSNFEFKSPSPLGLQLFKEETPYSISGNPLSSSKIKRESPASG
        LN+KRLGLGS++T++ D GPCTLNSTM  VV  EEMGSPV+VAKSYM+ARPPWASPS++NFEFKSPSPLGLQLFKEET YSISGNPLSSS+IKR+SP SG
Subjt:  LNDKRLGLGSSTTVEFDHGPCTLNSTMPIVVKEEEMGSPVNVAKSYMRARPPWASPSSSNFEFKSPSPLGLQLFKEETPYSISGNPLSSSKIKRESPASG

Query:  SWNIQEEIRRVRSKATEEMLRTRSSATRDWPSF--SSDYKSNLSSAR--SDYKMQLAVKSIDKSMNWSAYNTFTSNLSESKTAEDVSENGASLLNTTSIT
        SWNIQEE+RRVRSKAT+E+LRT  SA  DW SF  +SDYKSNLSS +  S  K+Q AVKSIDKSMNWSA NT T NLSESKT +DVSEN A LL TTSI 
Subjt:  SWNIQEEIRRVRSKATEEMLRTRSSATRDWPSF--SSDYKSNLSSAR--SDYKMQLAVKSIDKSMNWSAYNTFTSNLSESKTAEDVSENGASLLNTTSIT

Query:  QQQDKDLETNPTTERKASNSSLDEKECSTAHEVAGMANGFPAVPSS------SGEVAKIVEENTSSGHDHEAKGVPAEEQCELLSELSMEVPNVNENDT-
         Q+DKDLETNPTTER   NSS D +ECST HE AG+ANGFP +PSS         V  IVEEN SS   HEAK     E+CELLSE+S+EVP++NENDT 
Subjt:  QQQDKDLETNPTTERKASNSSLDEKECSTAHEVAGMANGFPAVPSS------SGEVAKIVEENTSSGHDHEAKGVPAEEQCELLSELSMEVPNVNENDT-

Query:  ---------DKALSEGNG-----------------VKSSSQSNVATRK-ASGTGYPRRGRRRN
                  K +SEGNG                  K SS+SN+A  K  SGTG  RRGRRRN
Subjt:  ---------DKALSEGNG-----------------VKSSSQSNVATRK-ASGTGYPRRGRRRN

TrEMBL top hitse value%identityAlignment
A0A5D3E734 Protein KAKU47.6e-19671.3Show/hide
Query:  MASVPAYGEAGGSRTGGKIVRARRAWSRKTPYERPEHSNSGTGENPSWISKFIFSPTRTIASGAGKLLSSVFVSDSSSSSSGSDSEEDVEDDVHDENHVF
        MAS PAYGEA GSR+GGKIVRARR  SRKTPYERP  SN G G NPSWISKFIFSPTRTIA+GAGKLLSSVFVSDSSSSSS SDSE+D EDDV DE HVF
Subjt:  MASVPAYGEAGGSRTGGKIVRARRAWSRKTPYERPEHSNSGTGENPSWISKFIFSPTRTIASGAGKLLSSVFVSDSSSSSSGSDSEEDVEDDVHDENHVF

Query:  QESEGVKKNGTLEAVNLFRKDFPPEMKDSKRLIEQLLMQETFSRAERDKLFQIIESRVVECQAIEGVAAGTLTEISNRAVEND-DAPAVCSTAILEAKKW
        Q +EG KKNGT E V+LFRKDFPPE KDSK LIEQLLMQETFSRAERDKL QIIESRVVE Q  EG AA  LTEISNR V++D   PAVCS+AILEAKKW
Subjt:  QESEGVKKNGTLEAVNLFRKDFPPEMKDSKRLIEQLLMQETFSRAERDKLFQIIESRVVECQAIEGVAAGTLTEISNRAVEND-DAPAVCSTAILEAKKW

Query:  LNDKRLGLGSSTTVEFDHGPCTLNSTMPIVVKEEEMGSPVNVAKSYMRARPPWASPSSSNFEFKSPSPLGLQLFKEETPYSISGNPLSSSKIKRESPASG
        LN+KRLGLGS++T++ D GPCTLNSTM  +V  EEMGSPV+VAKSYM+ARPPWASPS+ NFEFKSPSPLGLQLFKEET YSISGNPLSSS+IKRESP SG
Subjt:  LNDKRLGLGSSTTVEFDHGPCTLNSTMPIVVKEEEMGSPVNVAKSYMRARPPWASPSSSNFEFKSPSPLGLQLFKEETPYSISGNPLSSSKIKRESPASG

Query:  SWNIQEEIRRVRSKATEEMLRTRSSATRDWPSFS--SDYKSNLSSARSDY-------KMQLAVKSIDKSMNWSAYNTFTSNLSESKTAEDVSENGASLLN
        SWNIQEE+RRVRSKATEEMLR+ SS   DW S +  SDYK+NLSS R ++       K+Q AVK IDKSM WSA NT T NLSESKTAEDVSEN A  L 
Subjt:  SWNIQEEIRRVRSKATEEMLRTRSSATRDWPSFS--SDYKSNLSSARSDY-------KMQLAVKSIDKSMNWSAYNTFTSNLSESKTAEDVSENGASLLN

Query:  TTSITQQQDKDLETNPTTERKASNSSLDEKECSTAHEVAGMANGFPAVPSSSGEVA-------KIVEENTSSGHDHEAKGVPAEEQCELLSELSMEVPNV
        TTSI  QQDKDLETNPTT+ K SNSSLDE+ECST HE AG+ANGFP +PSSSGE+         IVEEN SS HDH AK  P EE+CELLSE+SMEVP++
Subjt:  TTSITQQQDKDLETNPTTERKASNSSLDEKECSTAHEVAGMANGFPAVPSSSGEVA-------KIVEENTSSGHDHEAKGVPAEEQCELLSELSMEVPNV

Query:  NENDTDKALSEGNGV--------------------------KSSSQSNVATRK-ASGTGYPRRGRRRN
        NE DTDK +S+GN                            K SS+S VA  K  SGT Y RRGRRRN
Subjt:  NENDTDKALSEGNGV--------------------------KSSSQSNVATRK-ASGTGYPRRGRRRN

A0A6J1CEG9 protein KAKU4 isoform X18.6e-20072.37Show/hide
Query:  MASVPAYGEAGGSRTGGKIVRARRAWSRKTPYERPEHSNSGTGENPSWISKFIFSPTRTIASGAGKLLSSVFVSDSSSSSSGSDSEEDVEDDVHDENHVF
        MA+VP+Y +  GSR+GGKIVRARRA SRK PYERP  SN G G NPSWIS+FIFSPT TIASGAGKLLSSVFVSDS SSSS  DSE+DVE D HDENH F
Subjt:  MASVPAYGEAGGSRTGGKIVRARRAWSRKTPYERPEHSNSGTGENPSWISKFIFSPTRTIASGAGKLLSSVFVSDSSSSSSGSDSEEDVEDDVHDENHVF

Query:  QESEGVKKNGTLEAVNLFRKDFPPEMKDSKRLIEQLLMQETF-SRAERDKLFQIIESRVVECQAIEGVAAGTLTEISNRAVEND-DAPAVCSTAILEAKK
        Q +EGVKKNGT E V+LFRKDFPP+ KDSK LIEQLLMQETF SRAERDKLFQIIESRVVE Q I+  AAG LTEISNRAV++D D PAVCSTAILEAKK
Subjt:  QESEGVKKNGTLEAVNLFRKDFPPEMKDSKRLIEQLLMQETF-SRAERDKLFQIIESRVVECQAIEGVAAGTLTEISNRAVEND-DAPAVCSTAILEAKK

Query:  WLNDKRLGLGSSTTVEFDHGPCTLNSTMPIVVKEEEMGSPVNVAKSYMRARPPWASPSSSNFEFKSPSPLGLQLFKEETPYSISGNPLSSSKIKRESPAS
        WLN+KRLGLGSS+T + DHGPCTLNSTM  V  +EEMGSPV+VAKSYMRARPPWASPSS+NFEFKSPSPLGLQLFKEET YSI+GNPLSSSK+KRESPAS
Subjt:  WLNDKRLGLGSSTTVEFDHGPCTLNSTMPIVVKEEEMGSPVNVAKSYMRARPPWASPSSSNFEFKSPSPLGLQLFKEETPYSISGNPLSSSKIKRESPAS

Query:  GSWNIQEEIRRVRSKATEEMLRTRSSATRDWPSFSSDYKSNLSSARSDY-------KMQLAVKSIDKSMNWSAYNTFTSNLSESKTAEDVSENGASLLNT
        GSWNIQEEIRRVRS+ATEEMLRT  SA  DW S +SDYKSNLSS  SD        KMQ A K IDK + WSA NT T NLSESK  +DVSENGA LL+T
Subjt:  GSWNIQEEIRRVRSKATEEMLRTRSSATRDWPSFSSDYKSNLSSARSDY-------KMQLAVKSIDKSMNWSAYNTFTSNLSESKTAEDVSENGASLLNT

Query:  TSITQQQDKDLETNPTTERKASNSSLDEKECSTAHEVAGMANGFPAVPSSSGEVA---KIVEENTSSGHDHEAKGVPAEEQCELLSELSMEVPNVNE---
        T+I  QQDKDL+TNPTTERK SNSSLDE++CST HEVAG+ANGF  VPSSSGE+    KIVEEN SSGHDHEAKG P EE+CELLSE SMEVPNV E   
Subjt:  TSITQQQDKDLETNPTTERKASNSSLDEKECSTAHEVAGMANGFPAVPSSSGEVA---KIVEENTSSGHDHEAKGVPAEEQCELLSELSMEVPNVNE---

Query:  ------NDTDKALSEGNGV----------------KSSSQSNVATRKASGTGYPRRGRRRN
              ND  K L E NG                 K S QS+ +  K+  T Y RRG+RRN
Subjt:  ------NDTDKALSEGNGV----------------KSSSQSNVATRKASGTGYPRRGRRRN

A0A6J1CFJ1 protein KAKU4 isoform X23.5e-20172.5Show/hide
Query:  MASVPAYGEAGGSRTGGKIVRARRAWSRKTPYERPEHSNSGTGENPSWISKFIFSPTRTIASGAGKLLSSVFVSDSSSSSSGSDSEEDVEDDVHDENHVF
        MA+VP+Y +  GSR+GGKIVRARRA SRK PYERP  SN G G NPSWIS+FIFSPT TIASGAGKLLSSVFVSDS SSSS  DSE+DVE D HDENH F
Subjt:  MASVPAYGEAGGSRTGGKIVRARRAWSRKTPYERPEHSNSGTGENPSWISKFIFSPTRTIASGAGKLLSSVFVSDSSSSSSGSDSEEDVEDDVHDENHVF

Query:  QESEGVKKNGTLEAVNLFRKDFPPEMKDSKRLIEQLLMQETFSRAERDKLFQIIESRVVECQAIEGVAAGTLTEISNRAVEND-DAPAVCSTAILEAKKW
        Q +EGVKKNGT E V+LFRKDFPP+ KDSK LIEQLLMQETFSRAERDKLFQIIESRVVE Q I+  AAG LTEISNRAV++D D PAVCSTAILEAKKW
Subjt:  QESEGVKKNGTLEAVNLFRKDFPPEMKDSKRLIEQLLMQETFSRAERDKLFQIIESRVVECQAIEGVAAGTLTEISNRAVEND-DAPAVCSTAILEAKKW

Query:  LNDKRLGLGSSTTVEFDHGPCTLNSTMPIVVKEEEMGSPVNVAKSYMRARPPWASPSSSNFEFKSPSPLGLQLFKEETPYSISGNPLSSSKIKRESPASG
        LN+KRLGLGSS+T + DHGPCTLNSTM  V  +EEMGSPV+VAKSYMRARPPWASPSS+NFEFKSPSPLGLQLFKEET YSI+GNPLSSSK+KRESPASG
Subjt:  LNDKRLGLGSSTTVEFDHGPCTLNSTMPIVVKEEEMGSPVNVAKSYMRARPPWASPSSSNFEFKSPSPLGLQLFKEETPYSISGNPLSSSKIKRESPASG

Query:  SWNIQEEIRRVRSKATEEMLRTRSSATRDWPSFSSDYKSNLSSARSDY-------KMQLAVKSIDKSMNWSAYNTFTSNLSESKTAEDVSENGASLLNTT
        SWNIQEEIRRVRS+ATEEMLRT  SA  DW S +SDYKSNLSS  SD        KMQ A K IDK + WSA NT T NLSESK  +DVSENGA LL+TT
Subjt:  SWNIQEEIRRVRSKATEEMLRTRSSATRDWPSFSSDYKSNLSSARSDY-------KMQLAVKSIDKSMNWSAYNTFTSNLSESKTAEDVSENGASLLNTT

Query:  SITQQQDKDLETNPTTERKASNSSLDEKECSTAHEVAGMANGFPAVPSSSGEVA---KIVEENTSSGHDHEAKGVPAEEQCELLSELSMEVPNVNE----
        +I  QQDKDL+TNPTTERK SNSSLDE++CST HEVAG+ANGF  VPSSSGE+    KIVEEN SSGHDHEAKG P EE+CELLSE SMEVPNV E    
Subjt:  SITQQQDKDLETNPTTERKASNSSLDEKECSTAHEVAGMANGFPAVPSSSGEVA---KIVEENTSSGHDHEAKGVPAEEQCELLSELSMEVPNVNE----

Query:  -----NDTDKALSEGNGV----------------KSSSQSNVATRKASGTGYPRRGRRRN
             ND  K L E NG                 K S QS+ +  K+  T Y RRG+RRN
Subjt:  -----NDTDKALSEGNGV----------------KSSSQSNVATRKASGTGYPRRGRRRN

A0A6J1IY54 protein KAKU4-like isoform X14.0e-19771.25Show/hide
Query:  MASVPAYGEAGGSRTGGKIVRARRAWSRKTPYERPEHSNSGTGENPSWISKFIFSPTRTIASGAGKLLSSVFVSDSSSSSSGSDSEEDVEDDVHDENHVF
        MASVPAY EAGGSRTGGKIVRARRAWSRKTPYERP  SN G G NPSWIS+FIFSPT TIASGAGK LSSVF+++SSSSSS SDSE+DVEDDVHD+N VF
Subjt:  MASVPAYGEAGGSRTGGKIVRARRAWSRKTPYERPEHSNSGTGENPSWISKFIFSPTRTIASGAGKLLSSVFVSDSSSSSSGSDSEEDVEDDVHDENHVF

Query:  QESEGVKKNGTLEAVNLFRKDFPPEMKDSKRLIEQLLMQETFSRAERDKLFQIIESRVVECQAIEGVAAGTLTEISNRAVENDD-APAVCSTAILEAKKW
        Q   GVKK+GT E VN FRKDFPPE KDSK LIEQLLMQETFSRAERDKLFQIIESRVVECQ+IEG AAG LTE+SN+AV++DD   AVCSTAILEAK W
Subjt:  QESEGVKKNGTLEAVNLFRKDFPPEMKDSKRLIEQLLMQETFSRAERDKLFQIIESRVVECQAIEGVAAGTLTEISNRAVENDD-APAVCSTAILEAKKW

Query:  LNDKRLGLGSSTTVEFDHGPCTLNSTMPIVVKEEEMGSPVNVAKSYMRARPPWASPSSSNFEFKSPSPLGLQLFKEETPYSISGNPLSSSKIKRESPASG
        LN+KRLGLGSS+T+E DHGPCTLNSTM  +V +EE GS V+VAKSYMR RP WASPS +NFEFKSPSP  LQLFKEET +S+SGNPLSSSKIKRESPASG
Subjt:  LNDKRLGLGSSTTVEFDHGPCTLNSTMPIVVKEEEMGSPVNVAKSYMRARPPWASPSSSNFEFKSPSPLGLQLFKEETPYSISGNPLSSSKIKRESPASG

Query:  SWNIQEEIRRVRSKATEEMLRTRSSATRDWPSFSSDYKSNLSSARSDY-------KMQLAVKSIDKSMNWSAYNTFTSNLSESKTAEDVSENGASLLNTT
        SWNIQEEIRRVRSKATEEMLR+RSSA  DW  F+SDYK NLSSARSDY       KMQ AV+SIDKSMN  A  + T NLSESK  +DV ENGA LL + 
Subjt:  SWNIQEEIRRVRSKATEEMLRTRSSATRDWPSFSSDYKSNLSSARSDY-------KMQLAVKSIDKSMNWSAYNTFTSNLSESKTAEDVSENGASLLNTT

Query:  SITQQQDKDLETNPTTERKASNSSLDEKECSTAHEVAGMANGFPAVPSSSGEVA-------KIVEENTSSGHDHEAKGVPAEEQCELLSELSMEVPNVNE
        SI QQQDKD ETNPT ERKASNSSLDE+ CST HEVAG+ANG P++PSS+GE+A       KIVEE+ SSGHDHEAKG+P +E+CE LSE+SMEVPN N+
Subjt:  SITQQQDKDLETNPTTERKASNSSLDEKECSTAHEVAGMANGFPAVPSSSGEVA-------KIVEENTSSGHDHEAKGVPAEEQCELLSELSMEVPNVNE

Query:  ND--TDKALSEGNGV--------------------------KSSSQSNVATRKASGTGYPRRGRRRN
         D   +K  S+GN V                          K  ++SNVA  K S   Y RRGRRRN
Subjt:  ND--TDKALSEGNGV--------------------------KSSSQSNVATRKASGTGYPRRGRRRN

A0A6J1J3F5 protein KAKU4-like isoform X36.2e-19872.53Show/hide
Query:  MASVPAYGEAGGSRTGGKIVRARRAWSRKTPYERPEHSNSGTGENPSWISKFIFSPTRTIASGAGKLLSSVFVSDSSSSSSGSDSEEDVEDDVHDENHVF
        MASVPAY EAGGSRTGGKIVRARRAWSRKTPYERP  SN G G NPSWIS+FIFSPT TIASGAGK LSSVF+++SSSSSS SDSE+DVEDDVHD+N VF
Subjt:  MASVPAYGEAGGSRTGGKIVRARRAWSRKTPYERPEHSNSGTGENPSWISKFIFSPTRTIASGAGKLLSSVFVSDSSSSSSGSDSEEDVEDDVHDENHVF

Query:  QESEGVKKNGTLEAVNLFRKDFPPEMKDSKRLIEQLLMQETFSRAERDKLFQIIESRVVECQAIEGVAAGTLTEISNRAVENDD-APAVCSTAILEAKKW
        Q   GVKK+GT E VN FRKDFPPE KDSK LIEQLLMQETFSRAERDKLFQIIESRVVECQ+IEG AAG LTE+SN+AV++DD   AVCSTAILEAK W
Subjt:  QESEGVKKNGTLEAVNLFRKDFPPEMKDSKRLIEQLLMQETFSRAERDKLFQIIESRVVECQAIEGVAAGTLTEISNRAVENDD-APAVCSTAILEAKKW

Query:  LNDKRLGLGSSTTVEFDHGPCTLNSTMPIVVKEEEMGSPVNVAKSYMRARPPWASPSSSNFEFKSPSPLGLQLFKEETPYSISGNPLSSSKIKRESPASG
        LN+KRLGLGSS+T+E DHGPCTLNSTM  +V +EE GS V+VAKSYMR RP WASPS +NFEFKSPSP  LQLFKEET +S+SGNPLSSSKIKRESPASG
Subjt:  LNDKRLGLGSSTTVEFDHGPCTLNSTMPIVVKEEEMGSPVNVAKSYMRARPPWASPSSSNFEFKSPSPLGLQLFKEETPYSISGNPLSSSKIKRESPASG

Query:  SWNIQEEIRRVRSKATEEMLRTRSSATRDWPSFSSDYKSNLSSARSDY-------KMQLAVKSIDKSMNWSAYNTFTSNLSESKTAEDVSENGASLLNTT
        SWNIQEEIRRVRSKATEEMLR+RSSA  DW  F+SDYK NLSSARSDY       KMQ AV+SIDKSMN  A  + T NLSESK  +DV ENGA LL + 
Subjt:  SWNIQEEIRRVRSKATEEMLRTRSSATRDWPSFSSDYKSNLSSARSDY-------KMQLAVKSIDKSMNWSAYNTFTSNLSESKTAEDVSENGASLLNTT

Query:  SITQQQDKDLETNPTTERKASNSSLDEKECSTAHEVAGMANGFPAVPSSSGEVA-------KIVEENTSSGHDHEAKGVPAEEQCELLSELSMEVPNVNE
        SI QQQDKD ETNPT ERKASNSSLDE+ CST HEVAG+ANG P++PSS+GE+A       KIVEE+ SSGHDHEAKG+P +E+CE LSE+SMEVPN N+
Subjt:  SITQQQDKDLETNPTTERKASNSSLDEKECSTAHEVAGMANGFPAVPSSSGEVA-------KIVEENTSSGHDHEAKGVPAEEQCELLSELSMEVPNVNE

Query:  ND--TDKALSEGNG----------------VKSSSQSNVATRKASGTGYPRRGRRRN
         D   +K  SEGN                  K  ++SNVA  K S   Y RRGRRRN
Subjt:  ND--TDKALSEGNG----------------VKSSSQSNVATRKASGTGYPRRGRRRN

SwissProt top hitse value%identityAlignment
Q949W6 Protein KAKU46.0e-4941.5Show/hide
Query:  MASVPAYGE-AGGSRTGGKIVRARRAWSRKTPYERP-EHSNSGTGENPSWISKFIFSPTRTIASGAGKLLSSVFVSDSSSSS---SGSDSEEDVEDDVHD
        M SV  Y   AG  R GGKIVR RR    +TP ERP + S     +NPSWIS+ ++ P   IASGAGK +SSV  SDSSSSS     S S+ D ++DV  
Subjt:  MASVPAYGE-AGGSRTGGKIVRARRAWSRKTPYERP-EHSNSGTGENPSWISKFIFSPTRTIASGAGKLLSSVFVSDSSSSS---SGSDSEEDVEDDVHD

Query:  ENHVFQESEGVKKNGTLEAVNLFRKDFPP-EMKDSKRLIEQLLMQETFSRAERDKLFQIIESRVVECQAIEGVAAGTLTEISNRAVEND-DAPAVCSTAI
         N  F E +            L     P  +   SKR+IEQLL+QETF+R E D+L  II++RVV+  +   V +   T   +  + +D +   + +TA+
Subjt:  ENHVFQESEGVKKNGTLEAVNLFRKDFPP-EMKDSKRLIEQLLMQETFSRAERDKLFQIIESRVVECQAIEGVAAGTLTEISNRAVEND-DAPAVCSTAI

Query:  LEAKKWLNDKRLGLGSSTTVEFDHGPCTLNSTMPIVVKEEEMGSPVNVAKSYMRARPPWASPSSSNFEFKSPSPLGLQLFKEETPYSISGNPLSSSKIKR
        +EA+KWL +K+ G                 S+      E+  GSPV+VAKSYMRAR PW SP+++N +F+SPS   +Q     TP   S    SSSK+KR
Subjt:  LEAKKWLNDKRLGLGSSTTVEFDHGPCTLNSTMPIVVKEEEMGSPVNVAKSYMRARPPWASPSSSNFEFKSPSPLGLQLFKEETPYSISGNPLSSSKIKR

Query:  ESPASGSWNIQEEIRRVRSKATEEMLRTRSSATRDWPSFS------SDYKSNLSSARSD
        +S ++ SWNIQ+EIR+VR+KATEEML++ SS     P +S         K N SS  +D
Subjt:  ESPASGSWNIQEEIRRVRSKATEEMLRTRSSATRDWPSFS------SDYKSNLSSARSD

Arabidopsis top hitse value%identityAlignment
AT4G31430.1 unknown protein4.3e-5041.5Show/hide
Query:  MASVPAYGE-AGGSRTGGKIVRARRAWSRKTPYERP-EHSNSGTGENPSWISKFIFSPTRTIASGAGKLLSSVFVSDSSSSS---SGSDSEEDVEDDVHD
        M SV  Y   AG  R GGKIVR RR    +TP ERP + S     +NPSWIS+ ++ P   IASGAGK +SSV  SDSSSSS     S S+ D ++DV  
Subjt:  MASVPAYGE-AGGSRTGGKIVRARRAWSRKTPYERP-EHSNSGTGENPSWISKFIFSPTRTIASGAGKLLSSVFVSDSSSSS---SGSDSEEDVEDDVHD

Query:  ENHVFQESEGVKKNGTLEAVNLFRKDFPP-EMKDSKRLIEQLLMQETFSRAERDKLFQIIESRVVECQAIEGVAAGTLTEISNRAVEND-DAPAVCSTAI
         N  F E +            L     P  +   SKR+IEQLL+QETF+R E D+L  II++RVV+  +   V +   T   +  + +D +   + +TA+
Subjt:  ENHVFQESEGVKKNGTLEAVNLFRKDFPP-EMKDSKRLIEQLLMQETFSRAERDKLFQIIESRVVECQAIEGVAAGTLTEISNRAVEND-DAPAVCSTAI

Query:  LEAKKWLNDKRLGLGSSTTVEFDHGPCTLNSTMPIVVKEEEMGSPVNVAKSYMRARPPWASPSSSNFEFKSPSPLGLQLFKEETPYSISGNPLSSSKIKR
        +EA+KWL +K+ G                 S+      E+  GSPV+VAKSYMRAR PW SP+++N +F+SPS   +Q     TP   S    SSSK+KR
Subjt:  LEAKKWLNDKRLGLGSSTTVEFDHGPCTLNSTMPIVVKEEEMGSPVNVAKSYMRARPPWASPSSSNFEFKSPSPLGLQLFKEETPYSISGNPLSSSKIKR

Query:  ESPASGSWNIQEEIRRVRSKATEEMLRTRSSATRDWPSFS------SDYKSNLSSARSD
        +S ++ SWNIQ+EIR+VR+KATEEML++ SS     P +S         K N SS  +D
Subjt:  ESPASGSWNIQEEIRRVRSKATEEMLRTRSSATRDWPSFS------SDYKSNLSSARSD

AT4G31430.2 unknown protein4.3e-5041.5Show/hide
Query:  MASVPAYGE-AGGSRTGGKIVRARRAWSRKTPYERP-EHSNSGTGENPSWISKFIFSPTRTIASGAGKLLSSVFVSDSSSSS---SGSDSEEDVEDDVHD
        M SV  Y   AG  R GGKIVR RR    +TP ERP + S     +NPSWIS+ ++ P   IASGAGK +SSV  SDSSSSS     S S+ D ++DV  
Subjt:  MASVPAYGE-AGGSRTGGKIVRARRAWSRKTPYERP-EHSNSGTGENPSWISKFIFSPTRTIASGAGKLLSSVFVSDSSSSS---SGSDSEEDVEDDVHD

Query:  ENHVFQESEGVKKNGTLEAVNLFRKDFPP-EMKDSKRLIEQLLMQETFSRAERDKLFQIIESRVVECQAIEGVAAGTLTEISNRAVEND-DAPAVCSTAI
         N  F E +            L     P  +   SKR+IEQLL+QETF+R E D+L  II++RVV+  +   V +   T   +  + +D +   + +TA+
Subjt:  ENHVFQESEGVKKNGTLEAVNLFRKDFPP-EMKDSKRLIEQLLMQETFSRAERDKLFQIIESRVVECQAIEGVAAGTLTEISNRAVEND-DAPAVCSTAI

Query:  LEAKKWLNDKRLGLGSSTTVEFDHGPCTLNSTMPIVVKEEEMGSPVNVAKSYMRARPPWASPSSSNFEFKSPSPLGLQLFKEETPYSISGNPLSSSKIKR
        +EA+KWL +K+ G                 S+      E+  GSPV+VAKSYMRAR PW SP+++N +F+SPS   +Q     TP   S    SSSK+KR
Subjt:  LEAKKWLNDKRLGLGSSTTVEFDHGPCTLNSTMPIVVKEEEMGSPVNVAKSYMRARPPWASPSSSNFEFKSPSPLGLQLFKEETPYSISGNPLSSSKIKR

Query:  ESPASGSWNIQEEIRRVRSKATEEMLRTRSSATRDWPSFS------SDYKSNLSSARSD
        +S ++ SWNIQ+EIR+VR+KATEEML++ SS     P +S         K N SS  +D
Subjt:  ESPASGSWNIQEEIRRVRSKATEEMLRTRSSATRDWPSFS------SDYKSNLSSARSD

AT4G31430.3 unknown protein4.3e-5041.5Show/hide
Query:  MASVPAYGE-AGGSRTGGKIVRARRAWSRKTPYERP-EHSNSGTGENPSWISKFIFSPTRTIASGAGKLLSSVFVSDSSSSS---SGSDSEEDVEDDVHD
        M SV  Y   AG  R GGKIVR RR    +TP ERP + S     +NPSWIS+ ++ P   IASGAGK +SSV  SDSSSSS     S S+ D ++DV  
Subjt:  MASVPAYGE-AGGSRTGGKIVRARRAWSRKTPYERP-EHSNSGTGENPSWISKFIFSPTRTIASGAGKLLSSVFVSDSSSSS---SGSDSEEDVEDDVHD

Query:  ENHVFQESEGVKKNGTLEAVNLFRKDFPP-EMKDSKRLIEQLLMQETFSRAERDKLFQIIESRVVECQAIEGVAAGTLTEISNRAVEND-DAPAVCSTAI
         N  F E +            L     P  +   SKR+IEQLL+QETF+R E D+L  II++RVV+  +   V +   T   +  + +D +   + +TA+
Subjt:  ENHVFQESEGVKKNGTLEAVNLFRKDFPP-EMKDSKRLIEQLLMQETFSRAERDKLFQIIESRVVECQAIEGVAAGTLTEISNRAVEND-DAPAVCSTAI

Query:  LEAKKWLNDKRLGLGSSTTVEFDHGPCTLNSTMPIVVKEEEMGSPVNVAKSYMRARPPWASPSSSNFEFKSPSPLGLQLFKEETPYSISGNPLSSSKIKR
        +EA+KWL +K+ G                 S+      E+  GSPV+VAKSYMRAR PW SP+++N +F+SPS   +Q     TP   S    SSSK+KR
Subjt:  LEAKKWLNDKRLGLGSSTTVEFDHGPCTLNSTMPIVVKEEEMGSPVNVAKSYMRARPPWASPSSSNFEFKSPSPLGLQLFKEETPYSISGNPLSSSKIKR

Query:  ESPASGSWNIQEEIRRVRSKATEEMLRTRSSATRDWPSFS------SDYKSNLSSARSD
        +S ++ SWNIQ+EIR+VR+KATEEML++ SS     P +S         K N SS  +D
Subjt:  ESPASGSWNIQEEIRRVRSKATEEMLRTRSSATRDWPSFS------SDYKSNLSSARSD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGTCCGTTCCTGCATATGGCGAAGCCGGTGGGTCAAGAACCGGCGGGAAAATTGTTCGAGCTAGGCGAGCTTGGAGTCGGAAGACTCCTTATGAACGCCCC
GAGCATTCGAATTCGGGCACCGGCGAAAACCCTAGCTGGATCTCGAAGTTCATCTTCTCGCCTACTCGTACCATTGCTTCCGGCGCCGGAAAGTTGCTCTCGTCG
GTTTTCGTTTCCGACTCTTCATCCTCCTCTTCCGGCAGTGATTCAGAGGAAGATGTTGAAGATGATGTTCATGATGAAAATCATGTCTTCCAGGAATCTGAAGGA
GTGAAGAAGAATGGGACATTGGAGGCGGTCAATTTATTTAGGAAGGACTTTCCACCAGAGATGAAAGATAGCAAGCGTCTAATTGAACAGCTACTGATGCAGGAG
ACCTTCTCTAGGGCAGAGCGTGACAAGTTATTTCAAATAATTGAATCAAGGGTTGTAGAATGTCAAGCCATTGAAGGTGTGGCTGCAGGGACGCTGACTGAGATA
TCAAACAGAGCAGTTGAAAATGATGATGCGCCTGCAGTCTGCAGCACAGCAATTCTTGAGGCAAAAAAATGGTTGAATGACAAACGATTGGGGTTGGGGTCTAGT
ACGACAGTGGAGTTTGATCATGGACCATGCACCTTGAATTCCACGATGCCGATCGTGGTTAAGGAGGAAGAAATGGGTTCACCGGTAAATGTAGCCAAATCATAC
ATGCGAGCACGTCCTCCATGGGCTTCTCCTTCCTCGAGTAATTTTGAGTTTAAATCTCCTTCACCATTAGGGTTACAACTTTTCAAAGAAGAAACACCATATTCA
ATTAGTGGAAATCCACTATCTTCATCTAAGATAAAAAGGGAATCTCCTGCCAGTGGATCATGGAACATCCAGGAGGAAATACGACGTGTGCGATCAAAAGCAACC
GAGGAAATGCTAAGAACTCGCTCGTCTGCCACACGTGACTGGCCTTCATTTTCATCTGACTATAAAAGTAATTTGAGTTCTGCACGTTCTGACTATAAAATGCAA
CTTGCTGTTAAGTCAATAGATAAATCAATGAATTGGTCTGCATACAATACTTTCACTTCTAATTTATCAGAGTCGAAAACCGCAGAAGACGTGTCTGAAAACGGA
GCCTCCCTGCTCAATACAACCAGCATTACTCAGCAACAGGATAAGGATTTGGAAACCAATCCTACAACTGAGAGGAAAGCGTCAAATTCAAGTTTGGATGAGAAA
GAATGCTCTACAGCGCACGAAGTTGCTGGGATGGCCAACGGTTTTCCTGCAGTACCTAGTTCTTCAGGGGAGGTCGCCAAGATAGTGGAAGAAAACACTTCATCC
GGTCACGACCACGAGGCGAAAGGGGTGCCTGCAGAGGAGCAGTGCGAGCTATTAAGCGAATTGTCCATGGAGGTCCCAAATGTGAATGAAAATGATACTGATAAA
GCTCTATCGGAAGGAAATGGCGTCAAGTCAAGTTCACAAAGCAACGTAGCAACGAGAAAGGCTAGCGGAACGGGATATCCGCGGCGAGGAAGACGAAGAAACTGA
mRNA sequenceShow/hide mRNA sequence
GTGGGCTATTTCTCTTTCATTCCATTACTGGAAAAACCTATTCTCGCCTGAACGAACCAAGAATCTTCACTGTGAGCTTCAATGGCGTCCGTTCCTGCATATGGC
GAAGCCGGTGGGTCAAGAACCGGCGGGAAAATTGTTCGAGCTAGGCGAGCTTGGAGTCGGAAGACTCCTTATGAACGCCCCGAGCATTCGAATTCGGGCACCGGC
GAAAACCCTAGCTGGATCTCGAAGTTCATCTTCTCGCCTACTCGTACCATTGCTTCCGGCGCCGGAAAGTTGCTCTCGTCGGTTTTCGTTTCCGACTCTTCATCC
TCCTCTTCCGGCAGTGATTCAGAGGAAGATGTTGAAGATGATGTTCATGATGAAAATCATGTCTTCCAGGAATCTGAAGGAGTGAAGAAGAATGGGACATTGGAG
GCGGTCAATTTATTTAGGAAGGACTTTCCACCAGAGATGAAAGATAGCAAGCGTCTAATTGAACAGCTACTGATGCAGGAGACCTTCTCTAGGGCAGAGCGTGAC
AAGTTATTTCAAATAATTGAATCAAGGGTTGTAGAATGTCAAGCCATTGAAGGTGTGGCTGCAGGGACGCTGACTGAGATATCAAACAGAGCAGTTGAAAATGAT
GATGCGCCTGCAGTCTGCAGCACAGCAATTCTTGAGGCAAAAAAATGGTTGAATGACAAACGATTGGGGTTGGGGTCTAGTACGACAGTGGAGTTTGATCATGGA
CCATGCACCTTGAATTCCACGATGCCGATCGTGGTTAAGGAGGAAGAAATGGGTTCACCGGTAAATGTAGCCAAATCATACATGCGAGCACGTCCTCCATGGGCT
TCTCCTTCCTCGAGTAATTTTGAGTTTAAATCTCCTTCACCATTAGGGTTACAACTTTTCAAAGAAGAAACACCATATTCAATTAGTGGAAATCCACTATCTTCA
TCTAAGATAAAAAGGGAATCTCCTGCCAGTGGATCATGGAACATCCAGGAGGAAATACGACGTGTGCGATCAAAAGCAACCGAGGAAATGCTAAGAACTCGCTCG
TCTGCCACACGTGACTGGCCTTCATTTTCATCTGACTATAAAAGTAATTTGAGTTCTGCACGTTCTGACTATAAAATGCAACTTGCTGTTAAGTCAATAGATAAA
TCAATGAATTGGTCTGCATACAATACTTTCACTTCTAATTTATCAGAGTCGAAAACCGCAGAAGACGTGTCTGAAAACGGAGCCTCCCTGCTCAATACAACCAGC
ATTACTCAGCAACAGGATAAGGATTTGGAAACCAATCCTACAACTGAGAGGAAAGCGTCAAATTCAAGTTTGGATGAGAAAGAATGCTCTACAGCGCACGAAGTT
GCTGGGATGGCCAACGGTTTTCCTGCAGTACCTAGTTCTTCAGGGGAGGTCGCCAAGATAGTGGAAGAAAACACTTCATCCGGTCACGACCACGAGGCGAAAGGG
GTGCCTGCAGAGGAGCAGTGCGAGCTATTAAGCGAATTGTCCATGGAGGTCCCAAATGTGAATGAAAATGATACTGATAAAGCTCTATCGGAAGGAAATGGCGTC
AAGTCAAGTTCACAAAGCAACGTAGCAACGAGAAAGGCTAGCGGAACGGGATATCCGCGGCGAGGAAGACGAAGAAACTGAGGGGTGGGAGGAAACAAGAATTTA
GGTTTGTGTTGGAACCAAATGTTGAAGATGAAAACTTTTGACCATAGCTAAAGCAAGGCTATTGTATCATGAGTAGTTTATTTAGTTGTAAGGATTTGTATAGAG
CTAGATGAATGAATCAGCTTAAGCGACACTGAAGTAGTTTTGTGTATTTTAATGTTGTGGAAAATTTGTGGTAAAAATGGCTATATTTAACCCAGTTCACTAGGA
GTGTGTGTTTATTCACATACTGGCTTTGCAAATAATTTATCGGTTAACTTACTATTTTGGTCTTTAGATTTATTTTGGTCTTTAGATTTT
Protein sequenceShow/hide protein sequence
MASVPAYGEAGGSRTGGKIVRARRAWSRKTPYERPEHSNSGTGENPSWISKFIFSPTRTIASGAGKLLSSVFVSDSSSSSSGSDSEEDVEDDVHDENHVFQESEG
VKKNGTLEAVNLFRKDFPPEMKDSKRLIEQLLMQETFSRAERDKLFQIIESRVVECQAIEGVAAGTLTEISNRAVENDDAPAVCSTAILEAKKWLNDKRLGLGSS
TTVEFDHGPCTLNSTMPIVVKEEEMGSPVNVAKSYMRARPPWASPSSSNFEFKSPSPLGLQLFKEETPYSISGNPLSSSKIKRESPASGSWNIQEEIRRVRSKAT
EEMLRTRSSATRDWPSFSSDYKSNLSSARSDYKMQLAVKSIDKSMNWSAYNTFTSNLSESKTAEDVSENGASLLNTTSITQQQDKDLETNPTTERKASNSSLDEK
ECSTAHEVAGMANGFPAVPSSSGEVAKIVEENTSSGHDHEAKGVPAEEQCELLSELSMEVPNVNENDTDKALSEGNGVKSSSQSNVATRKASGTGYPRRGRRRN