; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10012324 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10012324
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
Descriptionprotein O-glucosyltransferase 1-like
Genome locationChr01:20094166..20099983
RNA-Seq ExpressionHG10012324
SyntenyHG10012324
Gene Ontology termsGO:0009086 - methionine biosynthetic process (biological process)
GO:0008705 - methionine synthase activity (molecular function)
InterPro domainsIPR004223 - Vitamin B12-dependent methionine synthase, activation domain
IPR006598 - Glycosyl transferase CAP10 domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0033638.1 O-glucosyltransferase rumi-like protein [Cucumis melo var. makuwa]6.1e-15562.11Show/hide
Query:  IGVKRDMELHIYPQRKVEFSPVNYTTCSWSEKWHA-SGSTIAKEEEED-RDRQNGDTCPEYFRWIHEDLRSWARIGITREMVERGQSKADFRLVIVDGRA
        IG + ++EL   P+++VEFSPVN T  S  EKWH+  G TI KEEE+   +RQN +TCPEYF+WIHEDL+ WA  GITREMVERG+ KA FRLVIV GR 
Subjt:  IGVKRDMELHIYPQRKVEFSPVNYTTCSWSEKWHA-SGSTIAKEEEED-RDRQNGDTCPEYFRWIHEDLRSWARIGITREMVERGQSKADFRLVIVDGRA

Query:  YVEKYFKAYESRDTFTLWGILQLLRWYPAM-----------------------------------------DTTD-----------------------ER
        YVEKY + Y+ RD FTLWGILQLLRWYP                                           DT D                       + 
Subjt:  YVEKYFKAYESRDTFTLWGILQLLRWYPAM-----------------------------------------DTTD-----------------------ER

Query:  TKGRKPKEKWINREAYAYWKGNTRVSLSRYRLRKCNLSSQYDWNARVYMQGWRKEIKQGFKNSNLAHQCVYRYKIYIEWIGWSVSLKYILARDLVTLMVK
         K    ++KWINRE YAYWKGN  +S+ RY+L KC+ S+Q+DW ARVYMQ W KE+KQGFKNSNLA QC  RYKIYIE IGWSVSLKYILA D +TLMVK
Subjt:  TKGRKPKEKWINREAYAYWKGNTRVSLSRYRLRKCNLSSQYDWNARVYMQGWRKEIKQGFKNSNLAHQCVYRYKIYIEWIGWSVSLKYILARDLVTLMVK

Query:  SQYYDFFTRSLVPMHHYWPIKDDNDMCNSIKFAVDWGNAHKQKAQAIGKTASKFIEEQLSMEKIYDYMFHSLNEYSKLLTFKPTIPPNATELCLEDLACP
          +YDFFTRSLVPMHHYWPIKDD+DMC SIKFAV+WGNAHK++AQAIGK ASK++EEQL+MEK+YDYMFHSLNEYSKLLTFKPTIPPNATE+  +DLACP
Subjt:  SQYYDFFTRSLVPMHHYWPIKDDNDMCNSIKFAVDWGNAHKQKAQAIGKTASKFIEEQLSMEKIYDYMFHSLNEYSKLLTFKPTIPPNATELCLEDLACP

Query:  AQGLTTKFMMDTLVKQPSFSSPCSLLPPFSPTALDYIRTRKEAPIKQVEMWEKN
         QGL  KFMMDTLVK+PSFSSPC LLPPFSP  LDYIRTRKE PI+Q+  WEKN
Subjt:  AQGLTTKFMMDTLVKQPSFSSPCSLLPPFSPTALDYIRTRKEAPIKQVEMWEKN

KAG6576728.1 Protein O-glucosyltransferase 1, partial [Cucurbita argyrosperma subsp. sororia]5.5e-15664.32Show/hide
Query:  RDMELHIYPQR-KVEFSPVNYTTCSWSEKWHASGSTIAKEEEEDRDRQNGD-TCPEYFRWIHEDLRSWARIGITREMVERGQSKADFRLVIVDGRAYVEK
        RD ELHIYP R +V+   VN T  SWS K     S I   EE DRDRQN D TCPEYFRWIHEDLR WA  GITREMVE G+ KA FRLVI+DGRAYVEK
Subjt:  RDMELHIYPQR-KVEFSPVNYTTCSWSEKWHASGSTIAKEEEEDRDRQNGD-TCPEYFRWIHEDLRSWARIGITREMVERGQSKADFRLVIVDGRAYVEK

Query:  YFKAYESRDTFTLWGILQLLRWYPAM-----------------------------------------DTTD-----------------------ERTKGR
        +  AY+SRD FTLWGILQLLR YP                                           DT D                       E  K  
Subjt:  YFKAYESRDTFTLWGILQLLRWYPAM-----------------------------------------DTTD-----------------------ERTKGR

Query:  KPKEKWINREAYAYWKGNTRVSLSRYRLRKCNLSSQYDWNARVYMQGWRKEIKQGFKNSNLAHQCVYRYKIYIEWIGWSVSLKYILARDLVTLMVKSQYY
          + KW NREAYAYWKGN +VS+ RY+L +CNLS ++DW ARV+MQ W KE K+ FKNSNLA QCV+RYKIY+E +GWSVSLKYILA D VTLMV   YY
Subjt:  KPKEKWINREAYAYWKGNTRVSLSRYRLRKCNLSSQYDWNARVYMQGWRKEIKQGFKNSNLAHQCVYRYKIYIEWIGWSVSLKYILARDLVTLMVKSQYY

Query:  DFFTRSLVPMHHYWPIKDDNDMCNSIKFAVDWGNAHKQKAQAIGKTASKFIEEQLSMEKIYDYMFHSLNEYSKLLTFKPTIPPNATELCLEDLACPAQGL
        DFFTRSLVPMHHYWPIKDD+DMCNSIKFAVDWGN H+QK +AIGK ASKF EE+L MEK+YDYMFHSLNEYSKLLTFKPTIPPNATELCLE+LACPAQ L
Subjt:  DFFTRSLVPMHHYWPIKDDNDMCNSIKFAVDWGNAHKQKAQAIGKTASKFIEEQLSMEKIYDYMFHSLNEYSKLLTFKPTIPPNATELCLEDLACPAQGL

Query:  TTKFMMDTLVKQPSFSSPCSLLPPFSPTALDYIRTRKEAPIKQVEMWEKNNMSF
         TKFM+DTLVK+PSFSSPCSLLPPFSPT LD IR RKE PIKQV+MWEK NMSF
Subjt:  TTKFMMDTLVKQPSFSSPCSLLPPFSPTALDYIRTRKEAPIKQVEMWEKNNMSF

XP_022922548.1 O-glucosyltransferase rumi homolog [Cucurbita moschata]1.7e-15764.46Show/hide
Query:  RDMELHIYPQR-KVEFSPVNYTTCSWSEKWHASGSTIAKEEEEDRDRQNGDTCPEYFRWIHEDLRSWARIGITREMVERGQSKADFRLVIVDGRAYVEKY
        RD ELHIYP R +V+   VN T  SWSEK     S I   EE DRDRQN DTCPEYFRWIHEDLR W R GITREM+E G+ KA FRLVI+DGRAYVEK+
Subjt:  RDMELHIYPQR-KVEFSPVNYTTCSWSEKWHASGSTIAKEEEEDRDRQNGDTCPEYFRWIHEDLRSWARIGITREMVERGQSKADFRLVIVDGRAYVEKY

Query:  FKAYESRDTFTLWGILQLLRWYPAM-----------------------------------------DTTD-----------------------ERTKGRK
          AY+SRD FTLWGILQLLR YP                                           DT D                       E  K   
Subjt:  FKAYESRDTFTLWGILQLLRWYPAM-----------------------------------------DTTD-----------------------ERTKGRK

Query:  PKEKWINREAYAYWKGNTRVSLSRYRLRKCNLSSQYDWNARVYMQGWRKEIKQGFKNSNLAHQCVYRYKIYIEWIGWSVSLKYILARDLVTLMVKSQYYD
         + KW NREA+AYWKGN +VS+ RY+L  CNLS ++DW ARV+MQ W KE +Q FKNSNLA QCV+RYKIY+E +GWSVSLKYILA D VTLMV   YYD
Subjt:  PKEKWINREAYAYWKGNTRVSLSRYRLRKCNLSSQYDWNARVYMQGWRKEIKQGFKNSNLAHQCVYRYKIYIEWIGWSVSLKYILARDLVTLMVKSQYYD

Query:  FFTRSLVPMHHYWPIKDDNDMCNSIKFAVDWGNAHKQKAQAIGKTASKFIEEQLSMEKIYDYMFHSLNEYSKLLTFKPTIPPNATELCLEDLACPAQGLT
        FFTRSLVPMHHYWPIKDD+DMCNSIKFAVDWGN H+QK +AIGK ASKF EEQL MEK+YDYMFHSLNEYSKLLTFKPTIPPNATELCLE+LACPAQ L 
Subjt:  FFTRSLVPMHHYWPIKDDNDMCNSIKFAVDWGNAHKQKAQAIGKTASKFIEEQLSMEKIYDYMFHSLNEYSKLLTFKPTIPPNATELCLEDLACPAQGLT

Query:  TKFMMDTLVKQPSFSSPCSLLPPFSPTALDYIRTRKEAPIKQVEMWEKNNMSF
        TKFM+DTLVK+PSFSSPCSLLPPFSPT LD IR RKE PIKQV+MWEK NMSF
Subjt:  TKFMMDTLVKQPSFSSPCSLLPPFSPTALDYIRTRKEAPIKQVEMWEKNNMSF

XP_031737709.1 O-glucosyltransferase rumi homolog [Cucumis sativus]1.4e-15160.71Show/hide
Query:  IGVKRDMELHIYPQRKVEFSPVNYTTCSWSEKWHAS-GSTIAKEEEEDRDRQNGDTCPEYFRWIHEDLRSWARIGITREMVERGQSKADFRLVIVDGRAY
        +G +R+ EL  YPQ++VEFSP+N T  S SEKW +  G T  +EEEED D +N +TCPEYFRWIHEDL+ WA  GITREMVERG+  A FRLVIV GRAY
Subjt:  IGVKRDMELHIYPQRKVEFSPVNYTTCSWSEKWHAS-GSTIAKEEEEDRDRQNGDTCPEYFRWIHEDLRSWARIGITREMVERGQSKADFRLVIVDGRAY

Query:  VEKYFKAYESRDTFTLWGILQLLRWYPAM-----------------------------------------DTTD-----------------------ERT
        VEKY + ++ RD FTLWGILQLLRWYP                                           DT D                       +  
Subjt:  VEKYFKAYESRDTFTLWGILQLLRWYPAM-----------------------------------------DTTD-----------------------ERT

Query:  KGRKPKEKWINREAYAYWKGNTRVSLSRYRLRKCNLSSQYDWNARVYMQGWRKEIKQGFKNSNLAHQCVYRYKIYIEWIGWSVSLKYILARDLVTLMVKS
        K    ++KWI+RE YA+WKGNT +S+ RY+L KC+ S+Q     RVYMQ W++E KQGFKNSNLA QC  RYK+YIE IGWSVSLKYILA D +TLMVK 
Subjt:  KGRKPKEKWINREAYAYWKGNTRVSLSRYRLRKCNLSSQYDWNARVYMQGWRKEIKQGFKNSNLAHQCVYRYKIYIEWIGWSVSLKYILARDLVTLMVKS

Query:  QYYDFFTRSLVPMHHYWPIKDDNDMCNSIKFAVDWGNAHKQKAQAIGKTASKFIEEQLSMEKIYDYMFHSLNEYSKLLTFKPTIPPNATELCLEDLACPA
         +YDFFTRSLVPMHHYWPIKDD+DMC SIKFAV+WG  HKQKAQAIGK ASKF+EEQL+M+K+YDYMFH+LNEYSKLLTFKPTIPPNATE+ L DLACP 
Subjt:  QYYDFFTRSLVPMHHYWPIKDDNDMCNSIKFAVDWGNAHKQKAQAIGKTASKFIEEQLSMEKIYDYMFHSLNEYSKLLTFKPTIPPNATELCLEDLACPA

Query:  QGLTTKFMMDTLVKQPSFSSPCSLLPPFSPTALDYIRTRKEAPIKQVEMWEKN
        +GL  K MMDTL+K+PSFSSPC LLPPFSP ALDYIRTRK+ PIKQ++MWEKN
Subjt:  QGLTTKFMMDTLVKQPSFSSPCSLLPPFSPTALDYIRTRKEAPIKQVEMWEKN

XP_038886324.1 protein O-glucosyltransferase 1-like [Benincasa hispida]1.3e-17368.94Show/hide
Query:  KRDMELHIYPQRKVEFSPVNYTTCSWSEKWHASGSTIAKEEEEDRDRQNGDTCPEYFRWIHEDLRSWARIGITREMVERGQSKADFRLVIVDGRAYVEKY
        +RD+ELHIYP+ +V+FSPVN T  S SEKWH S  T  K EEEDRD QNGDTCPEYFRWIHEDLR WA+ GITREMVERG+  ADFRLVIVDGR YVEKY
Subjt:  KRDMELHIYPQRKVEFSPVNYTTCSWSEKWHASGSTIAKEEEEDRDRQNGDTCPEYFRWIHEDLRSWARIGITREMVERGQSKADFRLVIVDGRAYVEKY

Query:  FKAYESRDTFTLWGILQLLRWYPAM-----------------------------------------DTTD-----------------------ERTKGRK
         +A++SRD+FTLWGILQLLRWYP                                           DT D                       +  K   
Subjt:  FKAYESRDTFTLWGILQLLRWYPAM-----------------------------------------DTTD-----------------------ERTKGRK

Query:  PKEKWINREAYAYWKGNTRVSLSRYRLRKCNLSSQYDWNARVYMQGWRKEIKQGFKNSNLAHQCVYRYKIYIEWIGWSVSLKYILARDLVTLMVKSQYYD
         ++KWI+REAYAYWKGN  VS SRYRLRKCNLS+QYDW  RVYMQ W KE+KQGFKNSNLA QCVYRYKIYIE I WS SLKYILA D VTLMV   YYD
Subjt:  PKEKWINREAYAYWKGNTRVSLSRYRLRKCNLSSQYDWNARVYMQGWRKEIKQGFKNSNLAHQCVYRYKIYIEWIGWSVSLKYILARDLVTLMVKSQYYD

Query:  FFTRSLVPMHHYWPIKDDNDMCNSIKFAVDWGNAHKQKAQAIGKTASKFIEEQLSMEKIYDYMFHSLNEYSKLLTFKPTIPPNATELCLEDLACPAQGLT
        FF+RSLVPMHHYWPIKDDN+MCNSIKFAVDWGNAHKQKAQAIGK ASKFIEEQL+MEK+Y+YMFHSLNEYSKLLTFKPTIPPNATEL LEDLACP QGLT
Subjt:  FFTRSLVPMHHYWPIKDDNDMCNSIKFAVDWGNAHKQKAQAIGKTASKFIEEQLSMEKIYDYMFHSLNEYSKLLTFKPTIPPNATELCLEDLACPAQGLT

Query:  TKFMMDTLVKQPSFSSPCSLLPPFSPTALDYIRTRKEAPIKQVEMWEKNNMSFG
        TKFMMDTL+K+PSFSSPC LLPPFSPTAL YI+TRKE  IKQ+EMWEK NMSFG
Subjt:  TKFMMDTLVKQPSFSSPCSLLPPFSPTALDYIRTRKEAPIKQVEMWEKNNMSFG

TrEMBL top hitse value%identityAlignment
A0A2I4FG82 O-glucosyltransferase rumi homolog2.9e-11848.88Show/hide
Query:  KRDMELHIYPQRKVEFSPVNYTTCSWSEKWHASGSTIAKEEEEDRDRQNGDTCPEYFRWIHEDLRSWARIGITREMVERGQSKADFRLVIVDGRAYVEKY
        K+++++     +++E  PV+    S ++   ++   I  + +E RD  +  TCP+YFRWIHEDLR WA  GITREM+ER ++ + FRL+IV+G+ YVEKY
Subjt:  KRDMELHIYPQRKVEFSPVNYTTCSWSEKWHASGSTIAKEEEEDRDRQNGDTCPEYFRWIHEDLRSWARIGITREMVERGQSKADFRLVIVDGRAYVEKY

Query:  FKAYESRDTFTLWGILQLLRWYP---------------------------------------AMDTTD------------------------ERTKGRKP
         +A+++RD FTLWGILQ+LR YP                                       A DT D                        +  +G K 
Subjt:  FKAYESRDTFTLWGILQLLRWYP---------------------------------------AMDTTD------------------------ERTKGRKP

Query:  KEKWINREAYAYWKGNTRVSLSRYRLRKCNLSSQYDWNARVYMQGWRKEIKQGFKNSNLAHQCVYRYKIYIEWIGWSVSLKYILARDLVTLMVKSQYYDF
        K++W++RE YAYWKGN  VSLSR++L KCN+S   DWNAR+Y+Q W++E ++G+K S+LA QC +RYKIY+E I WSVS KYILA D V+L+VK  Y+DF
Subjt:  KEKWINREAYAYWKGNTRVSLSRYRLRKCNLSSQYDWNARVYMQGWRKEIKQGFKNSNLAHQCVYRYKIYIEWIGWSVSLKYILARDLVTLMVKSQYYDF

Query:  FTRSLVPMHHYWPIKDDNDMCNSIKFAVDWGNAHKQKAQAIGKTASKFIEEQLSMEKIYDYMFHSLNEYSKLLTFKPTIPPNATELCLEDLACPAQGLTT
        FTR+L+P+HHYWP ++D D C SI FAVDWGN HKQKAQ IGK A+KF++E+L ME +YDYMFH LNEY+KLLTFKP  P NA ELC E +ACPAQGL  
Subjt:  FTRSLVPMHHYWPIKDDNDMCNSIKFAVDWGNAHKQKAQAIGKTASKFIEEQLSMEKIYDYMFHSLNEYSKLLTFKPTIPPNATELCLEDLACPAQGLTT

Query:  KFMMDTLVKQPSFSSPCSLLPPFSPTALDYIRTRKEAPIKQVEMWEKN
        KF M+++VK P++SSPC++ PP+  ++L     RKE+ IKQVE+WEKN
Subjt:  KFMMDTLVKQPSFSSPCSLLPPFSPTALDYIRTRKEAPIKQVEMWEKN

A0A2I4FGK6 O-glucosyltransferase rumi homolog1.7e-11849.56Show/hide
Query:  KRDMELHIYPQRKVEF---SPVNYTTCSWSEKWHASGSTIAKEEEEDRDRQNGDTCPEYFRWIHEDLRSWARIGITREMVERGQSKADFRLVIVDGRAYV
        ++++++ I P +++E     P N  T + S  +  + +T   + EE+ +  +  TCPEYFRWIHEDLR WA  GITR+MVER +  A+FRLVI++GR YV
Subjt:  KRDMELHIYPQRKVEF---SPVNYTTCSWSEKWHASGSTIAKEEEEDRDRQNGDTCPEYFRWIHEDLRSWARIGITREMVERGQSKADFRLVIVDGRAYV

Query:  EKYFKAYESRDTFTLWGILQLLRWYP-----------AMDTTDERTK--------GRKP-----------------------------------------
        +KY +A+++RD FTLWGILQLLR YP            +D    RTK        G  P                                         
Subjt:  EKYFKAYESRDTFTLWGILQLLRWYP-----------AMDTTDERTK--------GRKP-----------------------------------------

Query:  --KEKWINREAYAYWKGNTRVSLSRYRLRKCNLSSQYDWNARVYMQGWRKEIKQGFKNSNLAHQCVYRYKIYIEWIGWSVSLKYILARDLVTLMVKSQYY
          +++W +RE YAYWKGN  V+++R  L KCN+S+  DWNARVY Q W +E +QG+K S+L+ QC++RYKIYIE   WSVS KYILA D VTL+VK  YY
Subjt:  --KEKWINREAYAYWKGNTRVSLSRYRLRKCNLSSQYDWNARVYMQGWRKEIKQGFKNSNLAHQCVYRYKIYIEWIGWSVSLKYILARDLVTLMVKSQYY

Query:  DFFTRSLVPMHHYWPIKDDNDMCNSIKFAVDWGNAHKQKAQAIGKTASKFIEEQLSMEKIYDYMFHSLNEYSKLLTFKPTIPPNATELCLEDLACPAQGL
        DFFTR L+P+HHYWP++DD D C SIKFAVDWGN+H+QKAQ IGK AS+FI+E L ME +YDYMFH LNEY+KLLTFKP  P NA ELC E +ACPAQGL
Subjt:  DFFTRSLVPMHHYWPIKDDNDMCNSIKFAVDWGNAHKQKAQAIGKTASKFIEEQLSMEKIYDYMFHSLNEYSKLLTFKPTIPPNATELCLEDLACPAQGL

Query:  TTKFMMDTLVKQPSFSSPCSLLPPFSPTALDYIRTRKEAPIKQVEMWEKN
          KF M+++VK P++SSPC++ PP+ P +L     RKE  +K+VE+WEKN
Subjt:  TTKFMMDTLVKQPSFSSPCSLLPPFSPTALDYIRTRKEAPIKQVEMWEKN

A0A5D3DGW6 O-glucosyltransferase rumi-like protein2.9e-15562.11Show/hide
Query:  IGVKRDMELHIYPQRKVEFSPVNYTTCSWSEKWHA-SGSTIAKEEEED-RDRQNGDTCPEYFRWIHEDLRSWARIGITREMVERGQSKADFRLVIVDGRA
        IG + ++EL   P+++VEFSPVN T  S  EKWH+  G TI KEEE+   +RQN +TCPEYF+WIHEDL+ WA  GITREMVERG+ KA FRLVIV GR 
Subjt:  IGVKRDMELHIYPQRKVEFSPVNYTTCSWSEKWHA-SGSTIAKEEEED-RDRQNGDTCPEYFRWIHEDLRSWARIGITREMVERGQSKADFRLVIVDGRA

Query:  YVEKYFKAYESRDTFTLWGILQLLRWYPAM-----------------------------------------DTTD-----------------------ER
        YVEKY + Y+ RD FTLWGILQLLRWYP                                           DT D                       + 
Subjt:  YVEKYFKAYESRDTFTLWGILQLLRWYPAM-----------------------------------------DTTD-----------------------ER

Query:  TKGRKPKEKWINREAYAYWKGNTRVSLSRYRLRKCNLSSQYDWNARVYMQGWRKEIKQGFKNSNLAHQCVYRYKIYIEWIGWSVSLKYILARDLVTLMVK
         K    ++KWINRE YAYWKGN  +S+ RY+L KC+ S+Q+DW ARVYMQ W KE+KQGFKNSNLA QC  RYKIYIE IGWSVSLKYILA D +TLMVK
Subjt:  TKGRKPKEKWINREAYAYWKGNTRVSLSRYRLRKCNLSSQYDWNARVYMQGWRKEIKQGFKNSNLAHQCVYRYKIYIEWIGWSVSLKYILARDLVTLMVK

Query:  SQYYDFFTRSLVPMHHYWPIKDDNDMCNSIKFAVDWGNAHKQKAQAIGKTASKFIEEQLSMEKIYDYMFHSLNEYSKLLTFKPTIPPNATELCLEDLACP
          +YDFFTRSLVPMHHYWPIKDD+DMC SIKFAV+WGNAHK++AQAIGK ASK++EEQL+MEK+YDYMFHSLNEYSKLLTFKPTIPPNATE+  +DLACP
Subjt:  SQYYDFFTRSLVPMHHYWPIKDDNDMCNSIKFAVDWGNAHKQKAQAIGKTASKFIEEQLSMEKIYDYMFHSLNEYSKLLTFKPTIPPNATELCLEDLACP

Query:  AQGLTTKFMMDTLVKQPSFSSPCSLLPPFSPTALDYIRTRKEAPIKQVEMWEKN
         QGL  KFMMDTLVK+PSFSSPC LLPPFSP  LDYIRTRKE PI+Q+  WEKN
Subjt:  AQGLTTKFMMDTLVKQPSFSSPCSLLPPFSPTALDYIRTRKEAPIKQVEMWEKN

A0A6J1E6Y7 O-glucosyltransferase rumi homolog8.3e-15864.46Show/hide
Query:  RDMELHIYPQR-KVEFSPVNYTTCSWSEKWHASGSTIAKEEEEDRDRQNGDTCPEYFRWIHEDLRSWARIGITREMVERGQSKADFRLVIVDGRAYVEKY
        RD ELHIYP R +V+   VN T  SWSEK     S I   EE DRDRQN DTCPEYFRWIHEDLR W R GITREM+E G+ KA FRLVI+DGRAYVEK+
Subjt:  RDMELHIYPQR-KVEFSPVNYTTCSWSEKWHASGSTIAKEEEEDRDRQNGDTCPEYFRWIHEDLRSWARIGITREMVERGQSKADFRLVIVDGRAYVEKY

Query:  FKAYESRDTFTLWGILQLLRWYPAM-----------------------------------------DTTD-----------------------ERTKGRK
          AY+SRD FTLWGILQLLR YP                                           DT D                       E  K   
Subjt:  FKAYESRDTFTLWGILQLLRWYPAM-----------------------------------------DTTD-----------------------ERTKGRK

Query:  PKEKWINREAYAYWKGNTRVSLSRYRLRKCNLSSQYDWNARVYMQGWRKEIKQGFKNSNLAHQCVYRYKIYIEWIGWSVSLKYILARDLVTLMVKSQYYD
         + KW NREA+AYWKGN +VS+ RY+L  CNLS ++DW ARV+MQ W KE +Q FKNSNLA QCV+RYKIY+E +GWSVSLKYILA D VTLMV   YYD
Subjt:  PKEKWINREAYAYWKGNTRVSLSRYRLRKCNLSSQYDWNARVYMQGWRKEIKQGFKNSNLAHQCVYRYKIYIEWIGWSVSLKYILARDLVTLMVKSQYYD

Query:  FFTRSLVPMHHYWPIKDDNDMCNSIKFAVDWGNAHKQKAQAIGKTASKFIEEQLSMEKIYDYMFHSLNEYSKLLTFKPTIPPNATELCLEDLACPAQGLT
        FFTRSLVPMHHYWPIKDD+DMCNSIKFAVDWGN H+QK +AIGK ASKF EEQL MEK+YDYMFHSLNEYSKLLTFKPTIPPNATELCLE+LACPAQ L 
Subjt:  FFTRSLVPMHHYWPIKDDNDMCNSIKFAVDWGNAHKQKAQAIGKTASKFIEEQLSMEKIYDYMFHSLNEYSKLLTFKPTIPPNATELCLEDLACPAQGLT

Query:  TKFMMDTLVKQPSFSSPCSLLPPFSPTALDYIRTRKEAPIKQVEMWEKNNMSF
        TKFM+DTLVK+PSFSSPCSLLPPFSPT LD IR RKE PIKQV+MWEK NMSF
Subjt:  TKFMMDTLVKQPSFSSPCSLLPPFSPTALDYIRTRKEAPIKQVEMWEKNNMSF

B9R9B3 KDEL motif-containing protein 1, putative1.1e-11754.21Show/hide
Query:  EDRDRQNGDTCPEYFRWIHEDLRSWARIGITREMVERGQSKADFRLVIVDGRAYVEKYFKAYESRDTFTLWGILQLLRWYPAM-----------------
        E+ DR +   CPEY+RWI+EDLR WAR GI+R+MVER ++ A+FRLVIV+G+AYVEKY +A+++RD FTLWGILQLLR YP                   
Subjt:  EDRDRQNGDTCPEYFRWIHEDLRSWARIGITREMVERGQSKADFRLVIVDGRAYVEKYFKAYESRDTFTLWGILQLLRWYPAM-----------------

Query:  ----------------------DTTD-----------------------ERTKGRKPKEKWINREAYAYWKGNTRVSLSRYRLRKCNLSSQYDWNARVYM
                              DT D                          K    K +W+ RE YAYWKGN  V+ +R  L KCN+S Q DWNARVY 
Subjt:  ----------------------DTTD-----------------------ERTKGRKPKEKWINREAYAYWKGNTRVSLSRYRLRKCNLSSQYDWNARVYM

Query:  QGWRKEIKQGFKNSNLAHQCVYRYKIYIEWIGWSVSLKYILARDLVTLMVKSQYYDFFTRSLVPMHHYWPIKDDNDMCNSIKFAVDWGNAHKQKAQAIGK
        Q W KE++QG+K SNLA QC++RYKIYIE   WSVS KYILA D VTL+VK  YYDFFTRSL P+HHYWPIK D D C SIKFAVDWGN HKQKAQAIGK
Subjt:  QGWRKEIKQGFKNSNLAHQCVYRYKIYIEWIGWSVSLKYILARDLVTLMVKSQYYDFFTRSLVPMHHYWPIKDDNDMCNSIKFAVDWGNAHKQKAQAIGK

Query:  TASKFIEEQLSMEKIYDYMFHSLNEYSKLLTFKPTIPPNATELCLEDLACPAQGLTTKFMMDTLVKQPSFSSPCSLLPPFSPTALDYIRTRKEAPIKQVE
         AS+FI+E+L M+ +YDYMFH LNEY+KLLTFKP IP  A ELC E +ACPA G+  +FMM+++V+ P+ ++PC +LPP+ P+AL  I  RKE  I+QVE
Subjt:  TASKFIEEQLSMEKIYDYMFHSLNEYSKLLTFKPTIPPNATELCLEDLACPAQGLTTKFMMDTLVKQPSFSSPCSLLPPFSPTALDYIRTRKEAPIKQVE

Query:  MWEK
        +WEK
Subjt:  MWEK

SwissProt top hitse value%identityAlignment
G3V9D0 Protein O-glucosyltransferase 13.4e-0727.03Show/hide
Query:  WINREAYAYWKGNTRVSLSR------YRLRKCNLSSQYDWNARVYMQGWR--KEI--KQGFKNSNLAHQCVYRYKIYIEWIGWSVSLKYILARDLVTLMV
        W  + + AY++G +R S  R       R     + ++Y  N     Q W+  K+   K   K+ +L   C Y+Y      +  S   K++     +   V
Subjt:  WINREAYAYWKGNTRVSLSR------YRLRKCNLSSQYDWNARVYMQGWR--KEI--KQGFKNSNLAHQCVYRYKIYIEWIGWSVSLKYILARDLVTLMV

Query:  KSQYYDFFTRSLVPMHHYWPIKDDNDMCNSIKFAVDWGNAHKQKAQAIGKTASKFIEEQLSMEKIYDYMFHSLNEYSKLLTFKPT
          ++ +FF   L P  HY P+K D    + ++  + +  A+   AQ I K  S+FI   L M+ I  Y  + L EYSK L++  T
Subjt:  KSQYYDFFTRSLVPMHHYWPIKDDNDMCNSIKFAVDWGNAHKQKAQAIGKTASKFIEEQLSMEKIYDYMFHSLNEYSKLLTFKPT

Q5E9Q1 Protein O-glucosyltransferase 15.8e-0726.49Show/hide
Query:  WINREAYAYWKGNTRVSLSR------YRLRKCNLSSQYDWNARVYMQGWR--KEI--KQGFKNSNLAHQCVYRYKIYIEWIGWSVSLKYILARDLVTLMV
        W  + + AY++G +R S  R       R     + ++Y  N     Q W+  K+   K   K+ +L   C Y+Y      +  S   K++     +   V
Subjt:  WINREAYAYWKGNTRVSLSR------YRLRKCNLSSQYDWNARVYMQGWR--KEI--KQGFKNSNLAHQCVYRYKIYIEWIGWSVSLKYILARDLVTLMV

Query:  KSQYYDFFTRSLVPMHHYWPIKDDNDMCNSIKFAVDWGNAHKQKAQAIGKTASKFIEEQLSMEKIYDYMFHSLNEYSKLLTFKPT
          ++ +FF   L P  HY P+K D    ++++  + +  A+   AQ I +  S+FI   L M+ I  Y  + L EYSK L++  T
Subjt:  KSQYYDFFTRSLVPMHHYWPIKDDNDMCNSIKFAVDWGNAHKQKAQAIGKTASKFIEEQLSMEKIYDYMFHSLNEYSKLLTFKPT

Q8BYB9 Protein O-glucosyltransferase 13.4e-0727.03Show/hide
Query:  WINREAYAYWKGNTRVSLSR------YRLRKCNLSSQYDWNARVYMQGWR--KEI--KQGFKNSNLAHQCVYRYKIYIEWIGWSVSLKYILARDLVTLMV
        W  + + AY++G +R S  R       R     + ++Y  N     Q W+  K+   K   K+ +L   C YRY      +  S   K++     +   V
Subjt:  WINREAYAYWKGNTRVSLSR------YRLRKCNLSSQYDWNARVYMQGWR--KEI--KQGFKNSNLAHQCVYRYKIYIEWIGWSVSLKYILARDLVTLMV

Query:  KSQYYDFFTRSLVPMHHYWPIKDDNDMCNSIKFAVDWGNAHKQKAQAIGKTASKFIEEQLSMEKIYDYMFHSLNEYSKLLTFKPT
          ++ +FF   L P  HY P+K D    ++++  + +  A+   AQ I K  S+FI   L M+ I  Y  + L +YSK L++  T
Subjt:  KSQYYDFFTRSLVPMHHYWPIKDDNDMCNSIKFAVDWGNAHKQKAQAIGKTASKFIEEQLSMEKIYDYMFHSLNEYSKLLTFKPT

Q8NBL1 Protein O-glucosyltransferase 12.0e-0726.49Show/hide
Query:  WINREAYAYWKGNTRVSLSR------YRLRKCNLSSQYDWNARVYMQGWR--KEI--KQGFKNSNLAHQCVYRYKIYIEWIGWSVSLKYILARDLVTLMV
        W  + + AY++G +R S  R       R     + ++Y  N     Q W+  K+   K   K+ +L   C Y+Y      +  S   K++     +   V
Subjt:  WINREAYAYWKGNTRVSLSR------YRLRKCNLSSQYDWNARVYMQGWR--KEI--KQGFKNSNLAHQCVYRYKIYIEWIGWSVSLKYILARDLVTLMV

Query:  KSQYYDFFTRSLVPMHHYWPIKDDNDMCNSIKFAVDWGNAHKQKAQAIGKTASKFIEEQLSMEKIYDYMFHSLNEYSKLLTFKPT
          ++ +FF   L P  HY P+K D    ++++  + +  A+   AQ I +  S+FI   L M+ I  Y  + L+EYSK L++  T
Subjt:  KSQYYDFFTRSLVPMHHYWPIKDDNDMCNSIKFAVDWGNAHKQKAQAIGKTASKFIEEQLSMEKIYDYMFHSLNEYSKLLTFKPT

Arabidopsis top hitse value%identityAlignment
AT1G63420.1 Arabidopsis thaliana protein of unknown function (DUF821)1.8e-10448.89Show/hide
Query:  QNGDTCPEYFRWIHEDLRSWARIGITREMVERGQSKADFRLVIVDGRAYVEKYFKAYESRDTFTLWGILQLLRWYPA-------MDTTDERTKGR-----
        ++  +CP+YF+WIHEDL+ W   GIT+EMVERG++ A FRLVI++G+ +VE Y K+ ++RD FTLWGILQLLR YP        M   D+R   R     
Subjt:  QNGDTCPEYFRWIHEDLRSWARIGITREMVERGQSKADFRLVIVDGRAYVEKYFKAYESRDTFTLWGILQLLRWYPA-------MDTTDERTKGR-----

Query:  -----------------------------------------------------KPKEKWINREAYAYWKGNTRV-SLSRYRLRKCNLSSQYDWNARVYMQ
                                                             K K+K++ R+AYAYWKGN  V S SR  L  CNLSS +DWNAR+++Q
Subjt:  -----------------------------------------------------KPKEKWINREAYAYWKGNTRV-SLSRYRLRKCNLSSQYDWNARVYMQ

Query:  GWRKEIKQGFKNSNLAHQCVYRYKIYIEWIGWSVSLKYILARDLVTLMVKSQYYDFFTRSLVPMHHYWPIKDDNDMCNSIKFAVDWGNAHKQKAQAIGKT
         W  E ++GF+NSN+A+QC YRYKIYIE   WSVS KYILA D VTLMVK  YYDFF+R+L P+ HYWPI+ D D C SIKFAVDW N H QKAQ IG+ 
Subjt:  GWRKEIKQGFKNSNLAHQCVYRYKIYIEWIGWSVSLKYILARDLVTLMVKSQYYDFFTRSLVPMHHYWPIKDDNDMCNSIKFAVDWGNAHKQKAQAIGKT

Query:  ASKFIEEQLSMEKIYDYMFHSLNEYSKLLTFKPTIPPNATELCLEDLACPAQ-----GLTTKFMMDTLVKQPSFSSPCSLLPPFSPTALDYIRTRKEAPI
        AS+F++  LSME +YDYMFH LNEYSKLL +KP +P N+ ELC E L CP++     G+  KFM+ +LV +P  S PCSL PPF    L+    +K   I
Subjt:  ASKFIEEQLSMEKIYDYMFHSLNEYSKLLTFKPTIPPNATELCLEDLACPAQ-----GLTTKFMMDTLVKQPSFSSPCSLLPPFSPTALDYIRTRKEAPI

Query:  KQVEMWE
        +QVE WE
Subjt:  KQVEMWE

AT2G45830.1 downstream target of AGL15 21.1e-9645.43Show/hide
Query:  TCPEYFRWIHEDLRSWARIGITREMVERGQSKADFRLVIVDGRAYVEKYFKAYESRDTFTLWGILQLLRWYPA-------MDTTDERTKGR---------
        TCP YFRWIHEDLR W   G+TR M+E+ +  A FR+VI+DGR YV+KY K+ ++RD FTLWGI+QLLRWYP        M   D+R   R         
Subjt:  TCPEYFRWIHEDLRSWARIGITREMVERGQSKADFRLVIVDGRAYVEKYFKAYESRDTFTLWGILQLLRWYPA-------MDTTDERTKGR---------

Query:  ---------------------------------KPKEK-------------WINREAYAYWKGNTRVSLSRYRLRKCNLSSQYDWNARVYMQGWRKEIKQ
                                         KP +K             W +R AYAYW+GN  V+ +R  L +CN+S+Q DWN R+Y+Q W +E ++
Subjt:  ---------------------------------KPKEK-------------WINREAYAYWKGNTRVSLSRYRLRKCNLSSQYDWNARVYMQGWRKEIKQ

Query:  GFKNSNLAHQCVYRYKIYIEWIGWSVSLKYILARDLVTLMVKSQYYDFFTRSLVPMHHYWPIKDDNDMCNSIKFAVDWGNAHKQKAQAIGKTASKFIEEQ
        GFKNSNL +QC +RYKIYIE   WSVS KYI+A D +TL V+  +YDF+ R ++P+ HYWPI+ D   C S+KFAV WGN H  +A  IG+  S+FI E+
Subjt:  GFKNSNLAHQCVYRYKIYIEWIGWSVSLKYILARDLVTLMVKSQYYDFFTRSLVPMHHYWPIKDDNDMCNSIKFAVDWGNAHKQKAQAIGKTASKFIEEQ

Query:  LSMEKIYDYMFHSLNEYSKLLTFKPTIPPNATELCLEDLACPAQGLTTKFMMDTLVKQPSFSSPCSLLPPFSPTALDYIRTRKEAPIKQVEMWE
        + ME +YDYMFH +NEY+KLL FKP IP  ATE+  + + C A G    FM +++V  PS  SPC +  PF+P  L  I  RK    +QVE WE
Subjt:  LSMEKIYDYMFHSLNEYSKLLTFKPTIPPNATELCLEDLACPAQGLTTKFMMDTLVKQPSFSSPCSLLPPFSPTALDYIRTRKEAPIKQVEMWE

AT3G48980.1 Arabidopsis thaliana protein of unknown function (DUF821)1.6e-10848.02Show/hide
Query:  EEDRDRQNGDTCPEYFRWIHEDLRSWARIGITREMVERGQSKADFRLVIVDGRAYVEKYFKAYESRDTFTLWGILQLLRWYPAM----------------
        E + DR    TCP+YFRWIHEDLR W + GITRE +ER  + A FRL I++GR YVEK+ +A+++RD FT+WG +QLLR YP                  
Subjt:  EEDRDRQNGDTCPEYFRWIHEDLRSWARIGITREMVERGQSKADFRLVIVDGRAYVEKYFKAYESRDTFTLWGILQLLRWYPAM----------------

Query:  -----------------------DTTD-----------------------ERTKGRKPKEKWINREAYAYWKGNTRVSLSRYRLRKCNLSSQYDWNARVY
                               +T D                       +  +    + KWI+RE YAYWKGN  V+ +R  L KCNLS  YDW AR+Y
Subjt:  -----------------------DTTD-----------------------ERTKGRKPKEKWINREAYAYWKGNTRVSLSRYRLRKCNLSSQYDWNARVY

Query:  MQGWRKEIKQGFKNSNLAHQCVYRYKIYIEWIGWSVSLKYILARDLVTLMVKSQYYDFFTRSLVPMHHYWPIKDDNDMCNSIKFAVDWGNAHKQKAQAIG
         Q W KE K+G+K S+LA QC +RYKIYIE   WSVS KYILA D VTLMVK  YYDFFTR + P HHYWP+K+D D C SIKFAVDWGN H +KAQ IG
Subjt:  MQGWRKEIKQGFKNSNLAHQCVYRYKIYIEWIGWSVSLKYILARDLVTLMVKSQYYDFFTRSLVPMHHYWPIKDDNDMCNSIKFAVDWGNAHKQKAQAIG

Query:  KTASKFIEEQLSMEKIYDYMFHSLNEYSKLLTFKPTIPPNATELCLEDLACPAQGLTTKFMMDTLVKQPSFSSPCSLLPPFSPTALDYIRTRKEAPIKQV
        K AS+F++++L M+ +YDYMFH L +YSKLL FKP IP N+TELC E +ACP  G   KFMM++LVK+P+ + PC++ PP+ P +   +  R+++   ++
Subjt:  KTASKFIEEQLSMEKIYDYMFHSLNEYSKLLTFKPTIPPNATELCLEDLACPAQGLTTKFMMDTLVKQPSFSSPCSLLPPFSPTALDYIRTRKEAPIKQV

Query:  EMWE
        E WE
Subjt:  EMWE

AT3G61270.1 Arabidopsis thaliana protein of unknown function (DUF821)1.4e-9644.58Show/hide
Query:  KEEEEDRDRQNGDTCPEYFRWIHEDLRSWARIGITREMVERGQSKADFRLVIVDGRAYVEKYFKAYESRDTFTLWGILQLLRWYPA-------MDTTDER
        K      +     TCP YFRWIHEDLR W + GITR M+E     A FRLVI +G+AYV++Y K+ ++RD FTLWGILQLLRWYP        M   D+R
Subjt:  KEEEEDRDRQNGDTCPEYFRWIHEDLRSWARIGITREMVERGQSKADFRLVIVDGRAYVEKYFKAYESRDTFTLWGILQLLRWYPA-------MDTTDER

Query:  TKGR-----------------------------------------KP--------KE-----KWINREAYAYWKGNTRVSLSRYRLRKCNLSSQYDWNAR
           R                                         KP        KE     +W +R AYAYW+GN  V   R  L KCN +   +WN R
Subjt:  TKGR-----------------------------------------KP--------KE-----KWINREAYAYWKGNTRVSLSRYRLRKCNLSSQYDWNAR

Query:  VYMQGWRKEIKQGFKNSNLAHQCVYRYKIYIEWIGWSVSLKYILARDLVTLMVKSQYYDFFTRSLVPMHHYWPIKDDNDMCNSIKFAVDWGNAHKQKAQA
        +Y+Q W KE K+GFKNSNL +QC +RYKIYIE   WSVS KYI+A D +TL VK ++YDF+ R ++P+ HYWPI+DD+  C S+KFAV WGN H+ KA+ 
Subjt:  VYMQGWRKEIKQGFKNSNLAHQCVYRYKIYIEWIGWSVSLKYILARDLVTLMVKSQYYDFFTRSLVPMHHYWPIKDDNDMCNSIKFAVDWGNAHKQKAQA

Query:  IGKTASKFIEEQLSMEKIYDYMFHSLNEYSKLLTFKPTIPPNATELCLEDLACPAQGLTTKFMMDTLVKQPSFSSPCSLLPPFSPTALDYIRTRKEAPIK
        IG+  S+FI E+++M+ +YDYMFH L EY+ LL FKP IP +A E+  + + CPA      F  ++++  PS  SPC +LPP+ P AL  +  RK    +
Subjt:  IGKTASKFIEEQLSMEKIYDYMFHSLNEYSKLLTFKPTIPPNATELCLEDLACPAQGLTTKFMMDTLVKQPSFSSPCSLLPPFSPTALDYIRTRKEAPIK

Query:  QVEMWE
        QVE+WE
Subjt:  QVEMWE

AT5G23850.1 Arabidopsis thaliana protein of unknown function (DUF821)2.5e-10647.32Show/hide
Query:  TIAKEEEEDRDRQNGDTCPEYFRWIHEDLRSWARIGITREMVERGQSKADFRLVIVDGRAYVEKYFKAYESRDTFTLWGILQLLRWYPAM----------
        T    E++D +     TCP+YFRWIHEDLR W+R GITRE +ER +  A FRL IV G+ YVEK+  A+++RD FT+WG LQLLR YP            
Subjt:  TIAKEEEEDRDRQNGDTCPEYFRWIHEDLRSWARIGITREMVERGQSKADFRLVIVDGRAYVEKYFKAYESRDTFTLWGILQLLRWYPAM----------

Query:  -----------------------------DTTD-----------------------ERTKGRKPKEKWINREAYAYWKGNTRVSLSRYRLRKCNLSSQYD
                                     +T D                       +  +    + KWINRE YAYWKGN  V+ +R  L KCN+S +++
Subjt:  -----------------------------DTTD-----------------------ERTKGRKPKEKWINREAYAYWKGNTRVSLSRYRLRKCNLSSQYD

Query:  WNARVYMQGWRKEIKQGFKNSNLAHQCVYRYKIYIEWIGWSVSLKYILARDLVTLMVKSQYYDFFTRSLVPMHHYWPIKDDNDMCNSIKFAVDWGNAHKQ
        WNAR+Y Q W KE K+G+K S+LA QC +RYKIYIE   WSVS KYILA D VTL+VK  YYDFFTR L+P HHYWP++ ++D C SIKFAVDWGN+H Q
Subjt:  WNARVYMQGWRKEIKQGFKNSNLAHQCVYRYKIYIEWIGWSVSLKYILARDLVTLMVKSQYYDFFTRSLVPMHHYWPIKDDNDMCNSIKFAVDWGNAHKQ

Query:  KAQAIGKTASKFIEEQLSMEKIYDYMFHSLNEYSKLLTFKPTIPPNATELCLEDLACPAQGLTTKFMMDTLVKQPSFSSPCSLLPPFSPTALDYIRTRKE
        KAQ IGK AS FI++ L M+ +YDYM+H L EYSKLL FKP IP NA E+C E +AC   G   KFM ++LVKQP+ S PC++ PP+ P     +  RK+
Subjt:  KAQAIGKTASKFIEEQLSMEKIYDYMFHSLNEYSKLLTFKPTIPPNATELCLEDLACPAQGLTTKFMMDTLVKQPSFSSPCSLLPPFSPTALDYIRTRKE

Query:  APIKQVEMWE
        +   ++  WE
Subjt:  APIKQVEMWE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGGGTGAGGAGGGATGTGGAATTACATATTTATCCTCAAACGAAAGTCGAATTTTCATTTGTTAATTGTACAAACGAGAAGCGGCGTACGAGCGGTCCCATAATAGT
GAAGGAGGAAGAAGACCGAAACCATAAAAATGGAGACTCGTGTCCGGAGTACTTCCGTTGGATCCACGAGGATCTAATGTCGGATCACGAGGGAGATGGCAGAGAGAGGC
TGACCGGAAGGCGAATTTCTGGCCTAGTGATTTTTGACGGTAGAGCTTACGTGGAGAAGTACTTAGAAGTGTATCAAAGTAGGGATATTTTTACTCTGTGTGGGATTCTA
CAGTTGTTATGGTGGTACCCAGTAGGCCTGGGCCCAATACAATGGGCCTACCTCCTTTGTTCCGATACTGTGAGAATGCTGCTAATAGGCGTGAAAAGGGATATGGAATT
ACATATTTATCCTCAAAGGAAAGTCGAATTTTCACCCGTTAATTATACGACATGTTCATGGAGCGAGAAGTGGCATGCGAGTGGTTCCACAATAGCGAAGGAGGAGGAAG
AAGATCGAGACCGTCAGAATGGCGACACGTGTCCAGAGTACTTCCGTTGGATCCACGAAGATCTAAGGTCGTGGGCTCGGATAGGGATCACGAGAGAGATGGTGGAGAGA
GGCCAATCGAAGGCGGATTTTCGGCTGGTGATTGTTGACGGTAGGGCTTACGTGGAGAAGTACTTTAAAGCATATGAAAGTAGGGATACTTTTACGCTGTGGGGGATCCT
ACAATTGTTGCGGTGGTACCCAGCCATGGATACCACTGATGAAAGAACTAAAGGAAGGAAACCAAAGGAAAAATGGATAAATAGAGAAGCTTATGCTTATTGGAAAGGGA
ATACTCGTGTTTCTTTGTCAAGATATAGACTTCGCAAATGCAATCTCTCTAGCCAATACGATTGGAATGCTCGTGTGTACATGCAGGGTTGGCGTAAAGAAATTAAACAA
GGATTCAAAAATTCCAATCTAGCTCATCAATGTGTTTATAGGTATAAAATATATATTGAGTGGATTGGTTGGTCAGTAAGTCTCAAATATATCCTTGCTCGTGATTTAGT
GACATTAATGGTGAAATCCCAATATTACGATTTTTTCACAAGAAGTTTAGTGCCAATGCATCATTATTGGCCAATCAAAGATGATAATGACATGTGCAACTCTATCAAAT
TTGCTGTTGATTGGGGTAATGCCCACAAACAAAAGGCACAAGCAATTGGGAAGACAGCAAGTAAGTTTATTGAAGAACAACTAAGTATGGAGAAGATTTATGACTACATG
TTCCACAGTCTAAATGAATACTCCAAACTCTTAACCTTCAAACCAACCATCCCACCAAATGCTACTGAACTCTGCTTGGAGGATTTGGCTTGCCCTGCTCAAGGCTTAAC
CACCAAGTTCATGATGGATACCCTCGTAAAACAACCGTCCTTCTCGAGCCCTTGTTCCTTGCTTCCGCCTTTTAGCCCGACCGCTCTCGACTATATTCGAACCAGAAAAG
AGGCTCCAATTAAACAAGTCGAAATGTGGGAGAAAAATAATATGTCCTTTGGGTGA
mRNA sequenceShow/hide mRNA sequence
ATGGGGGTGAGGAGGGATGTGGAATTACATATTTATCCTCAAACGAAAGTCGAATTTTCATTTGTTAATTGTACAAACGAGAAGCGGCGTACGAGCGGTCCCATAATAGT
GAAGGAGGAAGAAGACCGAAACCATAAAAATGGAGACTCGTGTCCGGAGTACTTCCGTTGGATCCACGAGGATCTAATGTCGGATCACGAGGGAGATGGCAGAGAGAGGC
TGACCGGAAGGCGAATTTCTGGCCTAGTGATTTTTGACGGTAGAGCTTACGTGGAGAAGTACTTAGAAGTGTATCAAAGTAGGGATATTTTTACTCTGTGTGGGATTCTA
CAGTTGTTATGGTGGTACCCAGTAGGCCTGGGCCCAATACAATGGGCCTACCTCCTTTGTTCCGATACTGTGAGAATGCTGCTAATAGGCGTGAAAAGGGATATGGAATT
ACATATTTATCCTCAAAGGAAAGTCGAATTTTCACCCGTTAATTATACGACATGTTCATGGAGCGAGAAGTGGCATGCGAGTGGTTCCACAATAGCGAAGGAGGAGGAAG
AAGATCGAGACCGTCAGAATGGCGACACGTGTCCAGAGTACTTCCGTTGGATCCACGAAGATCTAAGGTCGTGGGCTCGGATAGGGATCACGAGAGAGATGGTGGAGAGA
GGCCAATCGAAGGCGGATTTTCGGCTGGTGATTGTTGACGGTAGGGCTTACGTGGAGAAGTACTTTAAAGCATATGAAAGTAGGGATACTTTTACGCTGTGGGGGATCCT
ACAATTGTTGCGGTGGTACCCAGCCATGGATACCACTGATGAAAGAACTAAAGGAAGGAAACCAAAGGAAAAATGGATAAATAGAGAAGCTTATGCTTATTGGAAAGGGA
ATACTCGTGTTTCTTTGTCAAGATATAGACTTCGCAAATGCAATCTCTCTAGCCAATACGATTGGAATGCTCGTGTGTACATGCAGGGTTGGCGTAAAGAAATTAAACAA
GGATTCAAAAATTCCAATCTAGCTCATCAATGTGTTTATAGGTATAAAATATATATTGAGTGGATTGGTTGGTCAGTAAGTCTCAAATATATCCTTGCTCGTGATTTAGT
GACATTAATGGTGAAATCCCAATATTACGATTTTTTCACAAGAAGTTTAGTGCCAATGCATCATTATTGGCCAATCAAAGATGATAATGACATGTGCAACTCTATCAAAT
TTGCTGTTGATTGGGGTAATGCCCACAAACAAAAGGCACAAGCAATTGGGAAGACAGCAAGTAAGTTTATTGAAGAACAACTAAGTATGGAGAAGATTTATGACTACATG
TTCCACAGTCTAAATGAATACTCCAAACTCTTAACCTTCAAACCAACCATCCCACCAAATGCTACTGAACTCTGCTTGGAGGATTTGGCTTGCCCTGCTCAAGGCTTAAC
CACCAAGTTCATGATGGATACCCTCGTAAAACAACCGTCCTTCTCGAGCCCTTGTTCCTTGCTTCCGCCTTTTAGCCCGACCGCTCTCGACTATATTCGAACCAGAAAAG
AGGCTCCAATTAAACAAGTCGAAATGTGGGAGAAAAATAATATGTCCTTTGGGTGA
Protein sequenceShow/hide protein sequence
MGVRRDVELHIYPQTKVEFSFVNCTNEKRRTSGPIIVKEEEDRNHKNGDSCPEYFRWIHEDLMSDHEGDGRERLTGRRISGLVIFDGRAYVEKYLEVYQSRDIFTLCGIL
QLLWWYPVGLGPIQWAYLLCSDTVRMLLIGVKRDMELHIYPQRKVEFSPVNYTTCSWSEKWHASGSTIAKEEEEDRDRQNGDTCPEYFRWIHEDLRSWARIGITREMVER
GQSKADFRLVIVDGRAYVEKYFKAYESRDTFTLWGILQLLRWYPAMDTTDERTKGRKPKEKWINREAYAYWKGNTRVSLSRYRLRKCNLSSQYDWNARVYMQGWRKEIKQ
GFKNSNLAHQCVYRYKIYIEWIGWSVSLKYILARDLVTLMVKSQYYDFFTRSLVPMHHYWPIKDDNDMCNSIKFAVDWGNAHKQKAQAIGKTASKFIEEQLSMEKIYDYM
FHSLNEYSKLLTFKPTIPPNATELCLEDLACPAQGLTTKFMMDTLVKQPSFSSPCSLLPPFSPTALDYIRTRKEAPIKQVEMWEKNNMSFG