; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi05G012330 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi05G012330
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionO-glucosyltransferase rumi homolog
Genome locationchr05:20194690..20198454
RNA-Seq ExpressionLsi05G012330
SyntenyLsi05G012330
Gene Ontology termsNA
InterPro domainsIPR006598 - Glycosyl transferase CAP10 domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0033638.1 O-glucosyltransferase rumi-like protein [Cucumis melo var. makuwa]1.2e-15863.22Show/hide
Query:  IGVKRDMELHIYPQRKVEFSPVNYTTCSWSEKWHA-SGSTIAKEEEED-QDRQNGDTCPEYFRWIHEDLRSWARIGITREMVERGQSKADFRLVIVDGRA
        IG + ++EL   P+++VEFSPVN T  S  EKWH+  G TI KEEE+   +RQN +TCPEYF+WIHEDL+ WA  GITREMVERG+ KA FRLVIV GR 
Subjt:  IGVKRDMELHIYPQRKVEFSPVNYTTCSWSEKWHA-SGSTIAKEEEED-QDRQNGDTCPEYFRWIHEDLRSWARIGITREMVERGQSKADFRLVIVDGRA

Query:  YVEKYFKAYESRDTFTLWGILQLLRWYPGKIPDLDLMFHCGDQPNIFISNYSGPGRNTTASPPLFRYCENDDTLDI------------------------
        YVEKY + Y+ RD FTLWGILQLLRWYP KIPDLDLMF C DQPNIFI NYSGPG N+ A PPLFRYC +DDTLDI                        
Subjt:  YVEKYFKAYESRDTFTLWGILQLLRWYPGKIPDLDLMFHCGDQPNIFISNYSGPGRNTTASPPLFRYCENDDTLDI------------------------

Query:  --------------------------------------------------GWRKEIKQGFKNSNLAHQCVYRYKIYIEWIGWSVSLKYILARDLVTLMVK
                                                           W KE+KQGFKNSNLA QC  RYKIYIE IGWSVSLKYILA D +TLMVK
Subjt:  --------------------------------------------------GWRKEIKQGFKNSNLAHQCVYRYKIYIEWIGWSVSLKYILARDLVTLMVK

Query:  SQYYDFFTRSLVPMHHYWPIKDDNDMCNSIKFAVDWGNAHKQKAQAIGKTASKFIEEQLSMEKIYDYMFHSLNEYSKLLTFKPTIPPNATELCLEDLACP
          +YDFFTRSLVPMHHYWPIKDD+DMC SIKFAV+WGNAHK++AQAIGK ASK++EEQL+MEK+YDYMFHSLNEYSKLLTFKPTIPPNATE+  +DLACP
Subjt:  SQYYDFFTRSLVPMHHYWPIKDDNDMCNSIKFAVDWGNAHKQKAQAIGKTASKFIEEQLSMEKIYDYMFHSLNEYSKLLTFKPTIPPNATELCLEDLACP

Query:  AQGLTTKFMMDTLVKQPSFSSPCSLLPPFSPTALDYIRTRKEAPIKQVEMWEKN
         QGL  KFMMDTLVK+PSFSSPC LLPPFSP  LDYIRTRKE PI+Q+  WEKN
Subjt:  AQGLTTKFMMDTLVKQPSFSSPCSLLPPFSPTALDYIRTRKEAPIKQVEMWEKN

KAG6576728.1 Protein O-glucosyltransferase 1, partial [Cucurbita argyrosperma subsp. sororia]2.0e-16463.36Show/hide
Query:  VRKPMAKLYFAYFFFYVSLIVAIFFIISSHLFHYVLIGVKRDMELHIYPQR-KVEFSPVNYTTCSWSEKWHASGSTIAKEEEEDQDRQNGD-TCPEYFRW
        VRKP+A+L F  F   VSL +A   IISS L  +V     RD ELHIYP R +V+   VN T  SWS K     S I   EE D+DRQN D TCPEYFRW
Subjt:  VRKPMAKLYFAYFFFYVSLIVAIFFIISSHLFHYVLIGVKRDMELHIYPQR-KVEFSPVNYTTCSWSEKWHASGSTIAKEEEEDQDRQNGD-TCPEYFRW

Query:  IHEDLRSWARIGITREMVERGQSKADFRLVIVDGRAYVEKYFKAYESRDTFTLWGILQLLRWYPGKIPDLDLMFHCGDQPNIFISNYSGPGRNTTASPPL
        IHEDLR WA  GITREMVE G+ KA FRLVI+DGRAYVEK+  AY+SRD FTLWGILQLLR YPGKIPDLDLMF+C D+PNIFI +YSGPG N+TA PPL
Subjt:  IHEDLRSWARIGITREMVERGQSKADFRLVIVDGRAYVEKYFKAYESRDTFTLWGILQLLRWYPGKIPDLDLMFHCGDQPNIFISNYSGPGRNTTASPPL

Query:  FRYCENDDTLDI--------------------------------------------------------------------------GWRKEIKQGFKNSN
        FRYC +DDTLDI                                                                           W KE K+ FKNSN
Subjt:  FRYCENDDTLDI--------------------------------------------------------------------------GWRKEIKQGFKNSN

Query:  LAHQCVYRYKIYIEWIGWSVSLKYILARDLVTLMVKSQYYDFFTRSLVPMHHYWPIKDDNDMCNSIKFAVDWGNAHKQKAQAIGKTASKFIEEQLSMEKI
        LA QCV+RYKIY+E +GWSVSLKYILA D VTLMV   YYDFFTRSLVPMHHYWPIKDD+DMCNSIKFAVDWGN H+QK +AIGK ASKF EE+L MEK+
Subjt:  LAHQCVYRYKIYIEWIGWSVSLKYILARDLVTLMVKSQYYDFFTRSLVPMHHYWPIKDDNDMCNSIKFAVDWGNAHKQKAQAIGKTASKFIEEQLSMEKI

Query:  YDYMFHSLNEYSKLLTFKPTIPPNATELCLEDLACPAQGLTTKFMMDTLVKQPSFSSPCSLLPPFSPTALDYIRTRKEAPIKQVEMWEKNNMSF
        YDYMFHSLNEYSKLLTFKPTIPPNATELCLE+LACPAQ L TKFM+DTLVK+PSFSSPCSLLPPFSPT LD IR RKE PIKQV+MWEK NMSF
Subjt:  YDYMFHSLNEYSKLLTFKPTIPPNATELCLEDLACPAQGLTTKFMMDTLVKQPSFSSPCSLLPPFSPTALDYIRTRKEAPIKQVEMWEKNNMSF

XP_022922548.1 O-glucosyltransferase rumi homolog [Cucurbita moschata]1.6e-16165.12Show/hide
Query:  RDMELHIYPQR-KVEFSPVNYTTCSWSEKWHASGSTIAKEEEEDQDRQNGDTCPEYFRWIHEDLRSWARIGITREMVERGQSKADFRLVIVDGRAYVEKY
        RD ELHIYP R +V+   VN T  SWSEK     S I   EE D+DRQN DTCPEYFRWIHEDLR W R GITREM+E G+ KA FRLVI+DGRAYVEK+
Subjt:  RDMELHIYPQR-KVEFSPVNYTTCSWSEKWHASGSTIAKEEEEDQDRQNGDTCPEYFRWIHEDLRSWARIGITREMVERGQSKADFRLVIVDGRAYVEKY

Query:  FKAYESRDTFTLWGILQLLRWYPGKIPDLDLMFHCGDQPNIFISNYSGPGRNTTASPPLFRYCENDDTLDI-----------------------------
          AY+SRD FTLWGILQLLR YPGKIPDLDLMF+C D+PNIFI +YSGPG N+TA PP+FRYC +DDTLDI                             
Subjt:  FKAYESRDTFTLWGILQLLRWYPGKIPDLDLMFHCGDQPNIFISNYSGPGRNTTASPPLFRYCENDDTLDI-----------------------------

Query:  ---------------------------------------------GWRKEIKQGFKNSNLAHQCVYRYKIYIEWIGWSVSLKYILARDLVTLMVKSQYYD
                                                      W KE +Q FKNSNLA QCV+RYKIY+E +GWSVSLKYILA D VTLMV   YYD
Subjt:  ---------------------------------------------GWRKEIKQGFKNSNLAHQCVYRYKIYIEWIGWSVSLKYILARDLVTLMVKSQYYD

Query:  FFTRSLVPMHHYWPIKDDNDMCNSIKFAVDWGNAHKQKAQAIGKTASKFIEEQLSMEKIYDYMFHSLNEYSKLLTFKPTIPPNATELCLEDLACPAQGLT
        FFTRSLVPMHHYWPIKDD+DMCNSIKFAVDWGN H+QK +AIGK ASKF EEQL MEK+YDYMFHSLNEYSKLLTFKPTIPPNATELCLE+LACPAQ L 
Subjt:  FFTRSLVPMHHYWPIKDDNDMCNSIKFAVDWGNAHKQKAQAIGKTASKFIEEQLSMEKIYDYMFHSLNEYSKLLTFKPTIPPNATELCLEDLACPAQGLT

Query:  TKFMMDTLVKQPSFSSPCSLLPPFSPTALDYIRTRKEAPIKQVEMWEKNNMSF
        TKFM+DTLVK+PSFSSPCSLLPPFSPT LD IR RKE PIKQV+MWEK NMSF
Subjt:  TKFMMDTLVKQPSFSSPCSLLPPFSPTALDYIRTRKEAPIKQVEMWEKNNMSF

XP_031737709.1 O-glucosyltransferase rumi homolog [Cucumis sativus]1.4e-16561.52Show/hide
Query:  RKPMAKLYFAYFFFYVSLIVAIFFIISSHLFHYVLIGVKRDMELHIYPQRKVEFSPVNYTTCSWSEKWHAS-GSTIAKEEEEDQDRQNGDTCPEYFRWIH
        R P+ K YF YFFFYV L V  +FIISS +     +G +R+ EL  YPQ++VEFSP+N T  S SEKW +  G T  +EEEED D +N +TCPEYFRWIH
Subjt:  RKPMAKLYFAYFFFYVSLIVAIFFIISSHLFHYVLIGVKRDMELHIYPQRKVEFSPVNYTTCSWSEKWHAS-GSTIAKEEEEDQDRQNGDTCPEYFRWIH

Query:  EDLRSWARIGITREMVERGQSKADFRLVIVDGRAYVEKYFKAYESRDTFTLWGILQLLRWYPGKIPDLDLMFHCGDQPNIFISNYSGPGRNTTASPPLFR
        EDL+ WA  GITREMVERG+  A FRLVIV GRAYVEKY + ++ RD FTLWGILQLLRWYP +IPDLDLMF C DQP +FI NYSGPG N+TA PPLFR
Subjt:  EDLRSWARIGITREMVERGQSKADFRLVIVDGRAYVEKYFKAYESRDTFTLWGILQLLRWYPGKIPDLDLMFHCGDQPNIFISNYSGPGRNTTASPPLFR

Query:  YCENDDTLDI------------------------------------------------------------------------GWRKEIKQGFKNSNLAHQ
        YC +DDT DI                                                                         W++E KQGFKNSNLA Q
Subjt:  YCENDDTLDI------------------------------------------------------------------------GWRKEIKQGFKNSNLAHQ

Query:  CVYRYKIYIEWIGWSVSLKYILARDLVTLMVKSQYYDFFTRSLVPMHHYWPIKDDNDMCNSIKFAVDWGNAHKQKAQAIGKTASKFIEEQLSMEKIYDYM
        C  RYK+YIE IGWSVSLKYILA D +TLMVK  +YDFFTRSLVPMHHYWPIKDD+DMC SIKFAV+WG  HKQKAQAIGK ASKF+EEQL+M+K+YDYM
Subjt:  CVYRYKIYIEWIGWSVSLKYILARDLVTLMVKSQYYDFFTRSLVPMHHYWPIKDDNDMCNSIKFAVDWGNAHKQKAQAIGKTASKFIEEQLSMEKIYDYM

Query:  FHSLNEYSKLLTFKPTIPPNATELCLEDLACPAQGLTTKFMMDTLVKQPSFSSPCSLLPPFSPTALDYIRTRKEAPIKQVEMWEKN
        FH+LNEYSKLLTFKPTIPPNATE+ L DLACP +GL  K MMDTL+K+PSFSSPC LLPPFSP ALDYIRTRK+ PIKQ++MWEKN
Subjt:  FHSLNEYSKLLTFKPTIPPNATELCLEDLACPAQGLTTKFMMDTLVKQPSFSSPCSLLPPFSPTALDYIRTRKEAPIKQVEMWEKN

XP_038886324.1 protein O-glucosyltransferase 1-like [Benincasa hispida]2.7e-17769.6Show/hide
Query:  KRDMELHIYPQRKVEFSPVNYTTCSWSEKWHASGSTIAKEEEEDQDRQNGDTCPEYFRWIHEDLRSWARIGITREMVERGQSKADFRLVIVDGRAYVEKY
        +RD+ELHIYP+ +V+FSPVN T  S SEKWH S  T  K EEED+D QNGDTCPEYFRWIHEDLR WA+ GITREMVERG+  ADFRLVIVDGR YVEKY
Subjt:  KRDMELHIYPQRKVEFSPVNYTTCSWSEKWHASGSTIAKEEEEDQDRQNGDTCPEYFRWIHEDLRSWARIGITREMVERGQSKADFRLVIVDGRAYVEKY

Query:  FKAYESRDTFTLWGILQLLRWYPGKIPDLDLMFHCGDQPNIFISNYSGPGRNTTASPPLFRYCENDDTLDI-----------------------------
         +A++SRD+FTLWGILQLLRWYPGKIPDLDLMFHCGDQPNIFI NYSGP  NTTA PPLFRYC NDDTLDI                             
Subjt:  FKAYESRDTFTLWGILQLLRWYPGKIPDLDLMFHCGDQPNIFISNYSGPGRNTTASPPLFRYCENDDTLDI-----------------------------

Query:  ---------------------------------------------GWRKEIKQGFKNSNLAHQCVYRYKIYIEWIGWSVSLKYILARDLVTLMVKSQYYD
                                                      W KE+KQGFKNSNLA QCVYRYKIYIE I WS SLKYILA D VTLMV   YYD
Subjt:  ---------------------------------------------GWRKEIKQGFKNSNLAHQCVYRYKIYIEWIGWSVSLKYILARDLVTLMVKSQYYD

Query:  FFTRSLVPMHHYWPIKDDNDMCNSIKFAVDWGNAHKQKAQAIGKTASKFIEEQLSMEKIYDYMFHSLNEYSKLLTFKPTIPPNATELCLEDLACPAQGLT
        FF+RSLVPMHHYWPIKDDN+MCNSIKFAVDWGNAHKQKAQAIGK ASKFIEEQL+MEK+Y+YMFHSLNEYSKLLTFKPTIPPNATEL LEDLACP QGLT
Subjt:  FFTRSLVPMHHYWPIKDDNDMCNSIKFAVDWGNAHKQKAQAIGKTASKFIEEQLSMEKIYDYMFHSLNEYSKLLTFKPTIPPNATELCLEDLACPAQGLT

Query:  TKFMMDTLVKQPSFSSPCSLLPPFSPTALDYIRTRKEAPIKQVEMWEKNNMSFG
        TKFMMDTL+K+PSFSSPC LLPPFSPTAL YI+TRKE  IKQ+EMWEK NMSFG
Subjt:  TKFMMDTLVKQPSFSSPCSLLPPFSPTALDYIRTRKEAPIKQVEMWEKNNMSFG

TrEMBL top hitse value%identityAlignment
A0A0A0L5W3 CAP10 domain-containing protein5.7e-11752.33Show/hide
Query:  EEEDQDRQNGDTCPEYFRWIHEDLRSWARIGITREMVERGQSKADFRLVIVDGRAYVEKYFKAYESRDTFTLWGILQLLRWYPGKIPDLDLMFHCGDQPN
        +E+     +   CP+YFRWIHEDLR WAR GITR  +E GQ  A+FRL+I++G+AYVE Y K++++RDTFT+WGILQLLR YPGK+PDLDLMF C D P 
Subjt:  EEEDQDRQNGDTCPEYFRWIHEDLRSWARIGITREMVERGQSKADFRLVIVDGRAYVEKYFKAYESRDTFTLWGILQLLRWYPGKIPDLDLMFHCGDQPN

Query:  IFISNYSGPGRNTTASPPLFRYCENDDTLDI---------------------------------------------------------------------
        I  S++SGP  N    PPLFRYC +D T DI                                                                     
Subjt:  IFISNYSGPGRNTTASPPLFRYCENDDTLDI---------------------------------------------------------------------

Query:  -----GWRKEIKQGFKNSNLAHQCVYRYKIYIEWIGWSVSLKYILARDLVTLMVKSQYYDFFTRSLVPMHHYWPIKDDNDMCNSIKFAVDWGNAHKQKAQ
              W KE ++G+K S+L++QC++RYKIYIE   WSVS KYILA D VTL+VK  YYDFFTR L+P+HHYWP+KDD D C SIKFAVDWGN+HKQKAQ
Subjt:  -----GWRKEIKQGFKNSNLAHQCVYRYKIYIEWIGWSVSLKYILARDLVTLMVKSQYYDFFTRSLVPMHHYWPIKDDNDMCNSIKFAVDWGNAHKQKAQ

Query:  AIGKTASKFIEEQLSMEKIYDYMFHSLNEYSKLLTFKPTIPPNATELCLEDLACPAQGLTTKFMMDTLVKQPSFSSPCSLLPPFSPTALDYIRTRKEAPI
        AIGK AS FI+E+L M+ +YDYMFH L+EYSKLLTFKPT+PPNA ELC E +ACPA+GLT KFM ++LVK+P+ S+PC++ PP+ P +L ++ +RKE  I
Subjt:  AIGKTASKFIEEQLSMEKIYDYMFHSLNEYSKLLTFKPTIPPNATELCLEDLACPAQGLTTKFMMDTLVKQPSFSSPCSLLPPFSPTALDYIRTRKEAPI

Query:  KQVEMWE
        KQVE WE
Subjt:  KQVEMWE

A0A2I4FG82 O-glucosyltransferase rumi homolog1.9e-11748.33Show/hide
Query:  KRDMELHIYPQRKVEFSPVNYTTCSWSEKWHASGSTIAKEEEEDQDRQNGDTCPEYFRWIHEDLRSWARIGITREMVERGQSKADFRLVIVDGRAYVEKY
        K+++++     +++E  PV+    S ++   ++   I  + +E +D  +  TCP+YFRWIHEDLR WA  GITREM+ER ++ + FRL+IV+G+ YVEKY
Subjt:  KRDMELHIYPQRKVEFSPVNYTTCSWSEKWHASGSTIAKEEEEDQDRQNGDTCPEYFRWIHEDLRSWARIGITREMVERGQSKADFRLVIVDGRAYVEKY

Query:  FKAYESRDTFTLWGILQLLRWYPGKIPDLDLMFHCGDQPNIFISNYSGPGRNTTASPPLFRYCENDDTLDI-----------------------------
         +A+++RD FTLWGILQ+LR YPGK+P+L+LMF CGDQP I  SNY GP  N T  PPLFRYC  DDTLDI                             
Subjt:  FKAYESRDTFTLWGILQLLRWYPGKIPDLDLMFHCGDQPNIFISNYSGPGRNTTASPPLFRYCENDDTLDI-----------------------------

Query:  ---------------------------------------------GWRKEIKQGFKNSNLAHQCVYRYKIYIEWIGWSVSLKYILARDLVTLMVKSQYYD
                                                      W++E ++G+K S+LA QC +RYKIY+E I WSVS KYILA D V+L+VK  Y+D
Subjt:  ---------------------------------------------GWRKEIKQGFKNSNLAHQCVYRYKIYIEWIGWSVSLKYILARDLVTLMVKSQYYD

Query:  FFTRSLVPMHHYWPIKDDNDMCNSIKFAVDWGNAHKQKAQAIGKTASKFIEEQLSMEKIYDYMFHSLNEYSKLLTFKPTIPPNATELCLEDLACPAQGLT
        FFTR+L+P+HHYWP ++D D C SI FAVDWGN HKQKAQ IGK A+KF++E+L ME +YDYMFH LNEY+KLLTFKP  P NA ELC E +ACPAQGL 
Subjt:  FFTRSLVPMHHYWPIKDDNDMCNSIKFAVDWGNAHKQKAQAIGKTASKFIEEQLSMEKIYDYMFHSLNEYSKLLTFKPTIPPNATELCLEDLACPAQGLT

Query:  TKFMMDTLVKQPSFSSPCSLLPPFSPTALDYIRTRKEAPIKQVEMWEKN
         KF M+++VK P++SSPC++ PP+  ++L     RKE+ IKQVE+WEKN
Subjt:  TKFMMDTLVKQPSFSSPCSLLPPFSPTALDYIRTRKEAPIKQVEMWEKN

A0A5D3DGW6 O-glucosyltransferase rumi-like protein6.0e-15963.22Show/hide
Query:  IGVKRDMELHIYPQRKVEFSPVNYTTCSWSEKWHA-SGSTIAKEEEED-QDRQNGDTCPEYFRWIHEDLRSWARIGITREMVERGQSKADFRLVIVDGRA
        IG + ++EL   P+++VEFSPVN T  S  EKWH+  G TI KEEE+   +RQN +TCPEYF+WIHEDL+ WA  GITREMVERG+ KA FRLVIV GR 
Subjt:  IGVKRDMELHIYPQRKVEFSPVNYTTCSWSEKWHA-SGSTIAKEEEED-QDRQNGDTCPEYFRWIHEDLRSWARIGITREMVERGQSKADFRLVIVDGRA

Query:  YVEKYFKAYESRDTFTLWGILQLLRWYPGKIPDLDLMFHCGDQPNIFISNYSGPGRNTTASPPLFRYCENDDTLDI------------------------
        YVEKY + Y+ RD FTLWGILQLLRWYP KIPDLDLMF C DQPNIFI NYSGPG N+ A PPLFRYC +DDTLDI                        
Subjt:  YVEKYFKAYESRDTFTLWGILQLLRWYPGKIPDLDLMFHCGDQPNIFISNYSGPGRNTTASPPLFRYCENDDTLDI------------------------

Query:  --------------------------------------------------GWRKEIKQGFKNSNLAHQCVYRYKIYIEWIGWSVSLKYILARDLVTLMVK
                                                           W KE+KQGFKNSNLA QC  RYKIYIE IGWSVSLKYILA D +TLMVK
Subjt:  --------------------------------------------------GWRKEIKQGFKNSNLAHQCVYRYKIYIEWIGWSVSLKYILARDLVTLMVK

Query:  SQYYDFFTRSLVPMHHYWPIKDDNDMCNSIKFAVDWGNAHKQKAQAIGKTASKFIEEQLSMEKIYDYMFHSLNEYSKLLTFKPTIPPNATELCLEDLACP
          +YDFFTRSLVPMHHYWPIKDD+DMC SIKFAV+WGNAHK++AQAIGK ASK++EEQL+MEK+YDYMFHSLNEYSKLLTFKPTIPPNATE+  +DLACP
Subjt:  SQYYDFFTRSLVPMHHYWPIKDDNDMCNSIKFAVDWGNAHKQKAQAIGKTASKFIEEQLSMEKIYDYMFHSLNEYSKLLTFKPTIPPNATELCLEDLACP

Query:  AQGLTTKFMMDTLVKQPSFSSPCSLLPPFSPTALDYIRTRKEAPIKQVEMWEKN
         QGL  KFMMDTLVK+PSFSSPC LLPPFSP  LDYIRTRKE PI+Q+  WEKN
Subjt:  AQGLTTKFMMDTLVKQPSFSSPCSLLPPFSPTALDYIRTRKEAPIKQVEMWEKN

A0A6J1E6Y7 O-glucosyltransferase rumi homolog7.6e-16265.12Show/hide
Query:  RDMELHIYPQR-KVEFSPVNYTTCSWSEKWHASGSTIAKEEEEDQDRQNGDTCPEYFRWIHEDLRSWARIGITREMVERGQSKADFRLVIVDGRAYVEKY
        RD ELHIYP R +V+   VN T  SWSEK     S I   EE D+DRQN DTCPEYFRWIHEDLR W R GITREM+E G+ KA FRLVI+DGRAYVEK+
Subjt:  RDMELHIYPQR-KVEFSPVNYTTCSWSEKWHASGSTIAKEEEEDQDRQNGDTCPEYFRWIHEDLRSWARIGITREMVERGQSKADFRLVIVDGRAYVEKY

Query:  FKAYESRDTFTLWGILQLLRWYPGKIPDLDLMFHCGDQPNIFISNYSGPGRNTTASPPLFRYCENDDTLDI-----------------------------
          AY+SRD FTLWGILQLLR YPGKIPDLDLMF+C D+PNIFI +YSGPG N+TA PP+FRYC +DDTLDI                             
Subjt:  FKAYESRDTFTLWGILQLLRWYPGKIPDLDLMFHCGDQPNIFISNYSGPGRNTTASPPLFRYCENDDTLDI-----------------------------

Query:  ---------------------------------------------GWRKEIKQGFKNSNLAHQCVYRYKIYIEWIGWSVSLKYILARDLVTLMVKSQYYD
                                                      W KE +Q FKNSNLA QCV+RYKIY+E +GWSVSLKYILA D VTLMV   YYD
Subjt:  ---------------------------------------------GWRKEIKQGFKNSNLAHQCVYRYKIYIEWIGWSVSLKYILARDLVTLMVKSQYYD

Query:  FFTRSLVPMHHYWPIKDDNDMCNSIKFAVDWGNAHKQKAQAIGKTASKFIEEQLSMEKIYDYMFHSLNEYSKLLTFKPTIPPNATELCLEDLACPAQGLT
        FFTRSLVPMHHYWPIKDD+DMCNSIKFAVDWGN H+QK +AIGK ASKF EEQL MEK+YDYMFHSLNEYSKLLTFKPTIPPNATELCLE+LACPAQ L 
Subjt:  FFTRSLVPMHHYWPIKDDNDMCNSIKFAVDWGNAHKQKAQAIGKTASKFIEEQLSMEKIYDYMFHSLNEYSKLLTFKPTIPPNATELCLEDLACPAQGLT

Query:  TKFMMDTLVKQPSFSSPCSLLPPFSPTALDYIRTRKEAPIKQVEMWEKNNMSF
        TKFM+DTLVK+PSFSSPCSLLPPFSPT LD IR RKE PIKQV+MWEK NMSF
Subjt:  TKFMMDTLVKQPSFSSPCSLLPPFSPTALDYIRTRKEAPIKQVEMWEKNNMSF

B9R9B3 KDEL motif-containing protein 1, putative3.0e-11854.19Show/hide
Query:  EDQDRQNGDTCPEYFRWIHEDLRSWARIGITREMVERGQSKADFRLVIVDGRAYVEKYFKAYESRDTFTLWGILQLLRWYPGKIPDLDLMFHCGDQPNIF
        E+ DR +   CPEY+RWI+EDLR WAR GI+R+MVER ++ A+FRLVIV+G+AYVEKY +A+++RD FTLWGILQLLR YPGK+PDL+LMF C D P I 
Subjt:  EDQDRQNGDTCPEYFRWIHEDLRSWARIGITREMVERGQSKADFRLVIVDGRAYVEKYFKAYESRDTFTLWGILQLLRWYPGKIPDLDLMFHCGDQPNIF

Query:  ISNYSGPGRNTTASPPLFRYCENDDTLDI-----------------------------------------------------------------------
         SNYSGP  N  A PPLFRYC +DDTLD+                                                                       
Subjt:  ISNYSGPGRNTTASPPLFRYCENDDTLDI-----------------------------------------------------------------------

Query:  ---GWRKEIKQGFKNSNLAHQCVYRYKIYIEWIGWSVSLKYILARDLVTLMVKSQYYDFFTRSLVPMHHYWPIKDDNDMCNSIKFAVDWGNAHKQKAQAI
            W KE++QG+K SNLA QC++RYKIYIE   WSVS KYILA D VTL+VK  YYDFFTRSL P+HHYWPIK D D C SIKFAVDWGN HKQKAQAI
Subjt:  ---GWRKEIKQGFKNSNLAHQCVYRYKIYIEWIGWSVSLKYILARDLVTLMVKSQYYDFFTRSLVPMHHYWPIKDDNDMCNSIKFAVDWGNAHKQKAQAI

Query:  GKTASKFIEEQLSMEKIYDYMFHSLNEYSKLLTFKPTIPPNATELCLEDLACPAQGLTTKFMMDTLVKQPSFSSPCSLLPPFSPTALDYIRTRKEAPIKQ
        GK AS+FI+E+L M+ +YDYMFH LNEY+KLLTFKP IP  A ELC E +ACPA G+  +FMM+++V+ P+ ++PC +LPP+ P+AL  I  RKE  I+Q
Subjt:  GKTASKFIEEQLSMEKIYDYMFHSLNEYSKLLTFKPTIPPNATELCLEDLACPAQGLTTKFMMDTLVKQPSFSSPCSLLPPFSPTALDYIRTRKEAPIKQ

Query:  VEMWEK
        VE+WEK
Subjt:  VEMWEK

SwissProt top hitse value%identityAlignment
G3V9D0 Protein O-glucosyltransferase 11.6e-0728.91Show/hide
Query:  KQGFKNSNLAHQCVYRYKIYIEWIGWSVSLKYILARDLVTLMVKSQYYDFFTRSLVPMHHYWPIKDDNDMCNSIKFAVDWGNAHKQKAQAIGKTASKFIE
        K   K+ +L   C Y+Y      +  S   K++     +   V  ++ +FF   L P  HY P+K D    + ++  + +  A+   AQ I K  S+FI 
Subjt:  KQGFKNSNLAHQCVYRYKIYIEWIGWSVSLKYILARDLVTLMVKSQYYDFFTRSLVPMHHYWPIKDDNDMCNSIKFAVDWGNAHKQKAQAIGKTASKFIE

Query:  EQLSMEKIYDYMFHSLNEYSKLLTFKPT
          L M+ I  Y  + L EYSK L++  T
Subjt:  EQLSMEKIYDYMFHSLNEYSKLLTFKPT

Q5E9Q1 Protein O-glucosyltransferase 12.7e-0728.12Show/hide
Query:  KQGFKNSNLAHQCVYRYKIYIEWIGWSVSLKYILARDLVTLMVKSQYYDFFTRSLVPMHHYWPIKDDNDMCNSIKFAVDWGNAHKQKAQAIGKTASKFIE
        K   K+ +L   C Y+Y      +  S   K++     +   V  ++ +FF   L P  HY P+K D    ++++  + +  A+   AQ I +  S+FI 
Subjt:  KQGFKNSNLAHQCVYRYKIYIEWIGWSVSLKYILARDLVTLMVKSQYYDFFTRSLVPMHHYWPIKDDNDMCNSIKFAVDWGNAHKQKAQAIGKTASKFIE

Query:  EQLSMEKIYDYMFHSLNEYSKLLTFKPT
          L M+ I  Y  + L EYSK L++  T
Subjt:  EQLSMEKIYDYMFHSLNEYSKLLTFKPT

Q8BYB9 Protein O-glucosyltransferase 11.6e-0728.91Show/hide
Query:  KQGFKNSNLAHQCVYRYKIYIEWIGWSVSLKYILARDLVTLMVKSQYYDFFTRSLVPMHHYWPIKDDNDMCNSIKFAVDWGNAHKQKAQAIGKTASKFIE
        K   K+ +L   C YRY      +  S   K++     +   V  ++ +FF   L P  HY P+K D    ++++  + +  A+   AQ I K  S+FI 
Subjt:  KQGFKNSNLAHQCVYRYKIYIEWIGWSVSLKYILARDLVTLMVKSQYYDFFTRSLVPMHHYWPIKDDNDMCNSIKFAVDWGNAHKQKAQAIGKTASKFIE

Query:  EQLSMEKIYDYMFHSLNEYSKLLTFKPT
          L M+ I  Y  + L +YSK L++  T
Subjt:  EQLSMEKIYDYMFHSLNEYSKLLTFKPT

Q8NBL1 Protein O-glucosyltransferase 19.3e-0828.12Show/hide
Query:  KQGFKNSNLAHQCVYRYKIYIEWIGWSVSLKYILARDLVTLMVKSQYYDFFTRSLVPMHHYWPIKDDNDMCNSIKFAVDWGNAHKQKAQAIGKTASKFIE
        K   K+ +L   C Y+Y      +  S   K++     +   V  ++ +FF   L P  HY P+K D    ++++  + +  A+   AQ I +  S+FI 
Subjt:  KQGFKNSNLAHQCVYRYKIYIEWIGWSVSLKYILARDLVTLMVKSQYYDFFTRSLVPMHHYWPIKDDNDMCNSIKFAVDWGNAHKQKAQAIGKTASKFIE

Query:  EQLSMEKIYDYMFHSLNEYSKLLTFKPT
          L M+ I  Y  + L+EYSK L++  T
Subjt:  EQLSMEKIYDYMFHSLNEYSKLLTFKPT

Arabidopsis top hitse value%identityAlignment
AT1G63420.1 Arabidopsis thaliana protein of unknown function (DUF821)8.7e-10246.31Show/hide
Query:  CSWSEKWHASGS---TIAKEEEEDQDRQNGDTCPEYFRWIHEDLRSWARIGITREMVERGQSKADFRLVIVDGRAYVEKYFKAYESRDTFTLWGILQLLR
        CS     + SGS   T+     ++Q   N  +CP+YF+WIHEDL+ W   GIT+EMVERG++ A FRLVI++G+ +VE Y K+ ++RD FTLWGILQLLR
Subjt:  CSWSEKWHASGS---TIAKEEEEDQDRQNGDTCPEYFRWIHEDLRSWARIGITREMVERGQSKADFRLVIVDGRAYVEKYFKAYESRDTFTLWGILQLLR

Query:  WYPGKIPDLDLMFHCGDQPNIFISNYSGPGRNT-TASPPLFRYCENDDTLDI--------GWRK------------------------------------
         YPGK+PD+DLMF C D+P I    Y+   R    A PPLFRYC +  T+DI        GW++                                    
Subjt:  WYPGKIPDLDLMFHCGDQPNIFISNYSGPGRNT-TASPPLFRYCENDDTLDI--------GWRK------------------------------------

Query:  -------------------------------EIKQGFKNSNLAHQCVYRYKIYIEWIGWSVSLKYILARDLVTLMVKSQYYDFFTRSLVPMHHYWPIKDD
                                       E ++GF+NSN+A+QC YRYKIYIE   WSVS KYILA D VTLMVK  YYDFF+R+L P+ HYWPI+ D
Subjt:  -------------------------------EIKQGFKNSNLAHQCVYRYKIYIEWIGWSVSLKYILARDLVTLMVKSQYYDFFTRSLVPMHHYWPIKDD

Query:  NDMCNSIKFAVDWGNAHKQKAQAIGKTASKFIEEQLSMEKIYDYMFHSLNEYSKLLTFKPTIPPNATELCLEDLACPAQ-----GLTTKFMMDTLVKQPS
         D C SIKFAVDW N H QKAQ IG+ AS+F++  LSME +YDYMFH LNEYSKLL +KP +P N+ ELC E L CP++     G+  KFM+ +LV +P 
Subjt:  NDMCNSIKFAVDWGNAHKQKAQAIGKTASKFIEEQLSMEKIYDYMFHSLNEYSKLLTFKPTIPPNATELCLEDLACPAQ-----GLTTKFMMDTLVKQPS

Query:  FSSPCSLLPPFSPTALDYIRTRKEAPIKQVEMWE
         S PCSL PPF    L+    +K   I+QVE WE
Subjt:  FSSPCSLLPPFSPTALDYIRTRKEAPIKQVEMWE

AT2G45830.1 downstream target of AGL15 28.7e-9443.43Show/hide
Query:  TCPEYFRWIHEDLRSWARIGITREMVERGQSKADFRLVIVDGRAYVEKYFKAYESRDTFTLWGILQLLRWYPGKIPDLDLMFHCGDQPNIFISNYSGPGR
        TCP YFRWIHEDLR W   G+TR M+E+ +  A FR+VI+DGR YV+KY K+ ++RD FTLWGI+QLLRWYPG++PDL+LMF   D+P +   ++   G+
Subjt:  TCPEYFRWIHEDLRSWARIGITREMVERGQSKADFRLVIVDGRAYVEKYFKAYESRDTFTLWGILQLLRWYPGKIPDLDLMFHCGDQPNIFISNYSGPGR

Query:  NTTASPPLFRYCENDDTLDI--------------------------------------------------------------------------GWRKEI
           A PPLFRYC +D +LDI                                                                           W +E 
Subjt:  NTTASPPLFRYCENDDTLDI--------------------------------------------------------------------------GWRKEI

Query:  KQGFKNSNLAHQCVYRYKIYIEWIGWSVSLKYILARDLVTLMVKSQYYDFFTRSLVPMHHYWPIKDDNDMCNSIKFAVDWGNAHKQKAQAIGKTASKFIE
        ++GFKNSNL +QC +RYKIYIE   WSVS KYI+A D +TL V+  +YDF+ R ++P+ HYWPI+ D   C S+KFAV WGN H  +A  IG+  S+FI 
Subjt:  KQGFKNSNLAHQCVYRYKIYIEWIGWSVSLKYILARDLVTLMVKSQYYDFFTRSLVPMHHYWPIKDDNDMCNSIKFAVDWGNAHKQKAQAIGKTASKFIE

Query:  EQLSMEKIYDYMFHSLNEYSKLLTFKPTIPPNATELCLEDLACPAQGLTTKFMMDTLVKQPSFSSPCSLLPPFSPTALDYIRTRKEAPIKQVEMWE
        E++ ME +YDYMFH +NEY+KLL FKP IP  ATE+  + + C A G    FM +++V  PS  SPC +  PF+P  L  I  RK    +QVE WE
Subjt:  EQLSMEKIYDYMFHSLNEYSKLLTFKPTIPPNATELCLEDLACPAQGLTTKFMMDTLVKQPSFSSPCSLLPPFSPTALDYIRTRKEAPIKQVEMWE

AT3G48980.1 Arabidopsis thaliana protein of unknown function (DUF821)3.4e-10647.04Show/hide
Query:  EEDQDRQNGDTCPEYFRWIHEDLRSWARIGITREMVERGQSKADFRLVIVDGRAYVEKYFKAYESRDTFTLWGILQLLRWYPGKIPDLDLMFHCGDQPNI
        E + DR    TCP+YFRWIHEDLR W + GITRE +ER  + A FRL I++GR YVEK+ +A+++RD FT+WG +QLLR YPGKIPDL+LMF C D P +
Subjt:  EEDQDRQNGDTCPEYFRWIHEDLRSWARIGITREMVERGQSKADFRLVIVDGRAYVEKYFKAYESRDTFTLWGILQLLRWYPGKIPDLDLMFHCGDQPNI

Query:  FISNYSGPGRNTTASPPLFRYCENDDTLDI----------------------------------------------------------------------
          + ++G   +    PPLFRYC ND+TLDI                                                                      
Subjt:  FISNYSGPGRNTTASPPLFRYCENDDTLDI----------------------------------------------------------------------

Query:  ----GWRKEIKQGFKNSNLAHQCVYRYKIYIEWIGWSVSLKYILARDLVTLMVKSQYYDFFTRSLVPMHHYWPIKDDNDMCNSIKFAVDWGNAHKQKAQA
             W KE K+G+K S+LA QC +RYKIYIE   WSVS KYILA D VTLMVK  YYDFFTR + P HHYWP+K+D D C SIKFAVDWGN H +KAQ 
Subjt:  ----GWRKEIKQGFKNSNLAHQCVYRYKIYIEWIGWSVSLKYILARDLVTLMVKSQYYDFFTRSLVPMHHYWPIKDDNDMCNSIKFAVDWGNAHKQKAQA

Query:  IGKTASKFIEEQLSMEKIYDYMFHSLNEYSKLLTFKPTIPPNATELCLEDLACPAQGLTTKFMMDTLVKQPSFSSPCSLLPPFSPTALDYIRTRKEAPIK
        IGK AS+F++++L M+ +YDYMFH L +YSKLL FKP IP N+TELC E +ACP  G   KFMM++LVK+P+ + PC++ PP+ P +   +  R+++   
Subjt:  IGKTASKFIEEQLSMEKIYDYMFHSLNEYSKLLTFKPTIPPNATELCLEDLACPAQGLTTKFMMDTLVKQPSFSSPCSLLPPFSPTALDYIRTRKEAPIK

Query:  QVEMWE
        ++E WE
Subjt:  QVEMWE

AT3G61270.1 Arabidopsis thaliana protein of unknown function (DUF821)6.0e-9542.54Show/hide
Query:  KEEEEDQDRQNGDTCPEYFRWIHEDLRSWARIGITREMVERGQSKADFRLVIVDGRAYVEKYFKAYESRDTFTLWGILQLLRWYPGKIPDLDLMFHCGDQ
        K      +     TCP YFRWIHEDLR W + GITR M+E     A FRLVI +G+AYV++Y K+ ++RD FTLWGILQLLRWYPGK+PDL+LMF   D+
Subjt:  KEEEEDQDRQNGDTCPEYFRWIHEDLRSWARIGITREMVERGQSKADFRLVIVDGRAYVEKYFKAYESRDTFTLWGILQLLRWYPGKIPDLDLMFHCGDQ

Query:  PNIFISNYSGPGRNTTASPPLFRYCENDDTLDI-------------------------------------------------------------------
        P +   ++ G  +     PP+FRYC +D +LDI                                                                   
Subjt:  PNIFISNYSGPGRNTTASPPLFRYCENDDTLDI-------------------------------------------------------------------

Query:  -------GWRKEIKQGFKNSNLAHQCVYRYKIYIEWIGWSVSLKYILARDLVTLMVKSQYYDFFTRSLVPMHHYWPIKDDNDMCNSIKFAVDWGNAHKQK
                W KE K+GFKNSNL +QC +RYKIYIE   WSVS KYI+A D +TL VK ++YDF+ R ++P+ HYWPI+DD+  C S+KFAV WGN H+ K
Subjt:  -------GWRKEIKQGFKNSNLAHQCVYRYKIYIEWIGWSVSLKYILARDLVTLMVKSQYYDFFTRSLVPMHHYWPIKDDNDMCNSIKFAVDWGNAHKQK

Query:  AQAIGKTASKFIEEQLSMEKIYDYMFHSLNEYSKLLTFKPTIPPNATELCLEDLACPAQGLTTKFMMDTLVKQPSFSSPCSLLPPFSPTALDYIRTRKEA
        A+ IG+  S+FI E+++M+ +YDYMFH L EY+ LL FKP IP +A E+  + + CPA      F  ++++  PS  SPC +LPP+ P AL  +  RK  
Subjt:  AQAIGKTASKFIEEQLSMEKIYDYMFHSLNEYSKLLTFKPTIPPNATELCLEDLACPAQGLTTKFMMDTLVKQPSFSSPCSLLPPFSPTALDYIRTRKEA

Query:  PIKQVEMWE
          +QVE+WE
Subjt:  PIKQVEMWE

AT5G23850.1 Arabidopsis thaliana protein of unknown function (DUF821)4.2e-10446.6Show/hide
Query:  TIAKEEEEDQDRQNGDTCPEYFRWIHEDLRSWARIGITREMVERGQSKADFRLVIVDGRAYVEKYFKAYESRDTFTLWGILQLLRWYPGKIPDLDLMFHC
        T    E++D +     TCP+YFRWIHEDLR W+R GITRE +ER +  A FRL IV G+ YVEK+  A+++RD FT+WG LQLLR YPGKIPDL+LMF C
Subjt:  TIAKEEEEDQDRQNGDTCPEYFRWIHEDLRSWARIGITREMVERGQSKADFRLVIVDGRAYVEKYFKAYESRDTFTLWGILQLLRWYPGKIPDLDLMFHC

Query:  GDQPNIFISNYSGPGRNTTASPPLFRYCENDDTLDI----------------------------------------------------------------
         D P +  + ++  G N  + PPLFRYC N++TLDI                                                                
Subjt:  GDQPNIFISNYSGPGRNTTASPPLFRYCENDDTLDI----------------------------------------------------------------

Query:  ----------GWRKEIKQGFKNSNLAHQCVYRYKIYIEWIGWSVSLKYILARDLVTLMVKSQYYDFFTRSLVPMHHYWPIKDDNDMCNSIKFAVDWGNAH
                   W KE K+G+K S+LA QC +RYKIYIE   WSVS KYILA D VTL+VK  YYDFFTR L+P HHYWP++ ++D C SIKFAVDWGN+H
Subjt:  ----------GWRKEIKQGFKNSNLAHQCVYRYKIYIEWIGWSVSLKYILARDLVTLMVKSQYYDFFTRSLVPMHHYWPIKDDNDMCNSIKFAVDWGNAH

Query:  KQKAQAIGKTASKFIEEQLSMEKIYDYMFHSLNEYSKLLTFKPTIPPNATELCLEDLACPAQGLTTKFMMDTLVKQPSFSSPCSLLPPFSPTALDYIRTR
         QKAQ IGK AS FI++ L M+ +YDYM+H L EYSKLL FKP IP NA E+C E +AC   G   KFM ++LVKQP+ S PC++ PP+ P     +  R
Subjt:  KQKAQAIGKTASKFIEEQLSMEKIYDYMFHSLNEYSKLLTFKPTIPPNATELCLEDLACPAQGLTTKFMMDTLVKQPSFSSPCSLLPPFSPTALDYIRTR

Query:  KEAPIKQVEMWE
        K++   ++  WE
Subjt:  KEAPIKQVEMWE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTGGGTGGCATATGGTTAGAAAGCCAATGGCTAAACTCTATTTCGCCTATTTTTTCTTCTATGTTTCGCTCATTGTTGCGATCTTTTTCATAATCTCTTCACACCT
ATTCCATTATGTTCTAATAGGCGTGAAAAGGGATATGGAATTACATATTTATCCTCAAAGGAAAGTCGAATTTTCACCCGTTAATTATACGACATGTTCATGGAGCGAGA
AGTGGCATGCGAGTGGTTCCACAATAGCGAAGGAGGAGGAAGAAGATCAAGACCGTCAGAATGGCGACACGTGTCCAGAGTACTTCCGTTGGATCCACGAAGATCTAAGG
TCGTGGGCTCGGATAGGGATCACGAGAGAGATGGTGGAGAGAGGCCAATCGAAGGCGGATTTTCGGCTGGTGATTGTTGACGGTAGGGCTTACGTGGAGAAGTACTTTAA
AGCATATGAAAGTAGGGATACTTTTACGCTGTGGGGGATCCTACAATTGTTGCGGTGGTACCCAGGTAAAATTCCTGATTTGGACCTCATGTTCCATTGCGGTGACCAGC
CTAACATTTTTATTAGTAATTATAGTGGACCTGGGCGTAATACAACGGCCTCACCTCCTTTGTTCCGATACTGTGAAAATGATGACACGTTGGACATTGGTTGGCGTAAA
GAAATTAAACAAGGATTCAAAAATTCCAATCTAGCTCATCAATGTGTTTATAGGTATAAAATATATATTGAGTGGATTGGTTGGTCAGTAAGTCTCAAATATATCCTTGC
TCGTGATTTAGTGACATTAATGGTGAAATCCCAATATTACGATTTTTTCACAAGAAGTTTAGTGCCAATGCATCATTATTGGCCAATCAAAGATGATAATGACATGTGCA
ACTCTATCAAATTTGCTGTTGATTGGGGTAATGCCCACAAACAAAAGGCACAAGCAATTGGGAAGACAGCAAGTAAGTTTATTGAAGAACAACTAAGTATGGAGAAGATT
TATGACTACATGTTCCACAGTCTAAATGAATACTCCAAACTCTTAACCTTCAAACCAACCATCCCACCAAATGCTACTGAACTCTGCTTGGAGGATTTGGCTTGCCCTGC
TCAAGGCTTAACCACCAAGTTCATGATGGATACCCTCGTAAAACAACCGTCCTTCTCGAGCCCTTGTTCCTTGCTTCCGCCTTTTAGCCCGACCGCTCTCGACTATATTC
GAACCAGAAAAGAGGCTCCAATTAAACAAGTCGAAATGTGGGAGAAAAATAATATGTCCTTTGGGTGA
mRNA sequenceShow/hide mRNA sequence
ATGAGTGGGTGGCATATGGTTAGAAAGCCAATGGCTAAACTCTATTTCGCCTATTTTTTCTTCTATGTTTCGCTCATTGTTGCGATCTTTTTCATAATCTCTTCACACCT
ATTCCATTATGTTCTAATAGGCGTGAAAAGGGATATGGAATTACATATTTATCCTCAAAGGAAAGTCGAATTTTCACCCGTTAATTATACGACATGTTCATGGAGCGAGA
AGTGGCATGCGAGTGGTTCCACAATAGCGAAGGAGGAGGAAGAAGATCAAGACCGTCAGAATGGCGACACGTGTCCAGAGTACTTCCGTTGGATCCACGAAGATCTAAGG
TCGTGGGCTCGGATAGGGATCACGAGAGAGATGGTGGAGAGAGGCCAATCGAAGGCGGATTTTCGGCTGGTGATTGTTGACGGTAGGGCTTACGTGGAGAAGTACTTTAA
AGCATATGAAAGTAGGGATACTTTTACGCTGTGGGGGATCCTACAATTGTTGCGGTGGTACCCAGGTAAAATTCCTGATTTGGACCTCATGTTCCATTGCGGTGACCAGC
CTAACATTTTTATTAGTAATTATAGTGGACCTGGGCGTAATACAACGGCCTCACCTCCTTTGTTCCGATACTGTGAAAATGATGACACGTTGGACATTGGTTGGCGTAAA
GAAATTAAACAAGGATTCAAAAATTCCAATCTAGCTCATCAATGTGTTTATAGGTATAAAATATATATTGAGTGGATTGGTTGGTCAGTAAGTCTCAAATATATCCTTGC
TCGTGATTTAGTGACATTAATGGTGAAATCCCAATATTACGATTTTTTCACAAGAAGTTTAGTGCCAATGCATCATTATTGGCCAATCAAAGATGATAATGACATGTGCA
ACTCTATCAAATTTGCTGTTGATTGGGGTAATGCCCACAAACAAAAGGCACAAGCAATTGGGAAGACAGCAAGTAAGTTTATTGAAGAACAACTAAGTATGGAGAAGATT
TATGACTACATGTTCCACAGTCTAAATGAATACTCCAAACTCTTAACCTTCAAACCAACCATCCCACCAAATGCTACTGAACTCTGCTTGGAGGATTTGGCTTGCCCTGC
TCAAGGCTTAACCACCAAGTTCATGATGGATACCCTCGTAAAACAACCGTCCTTCTCGAGCCCTTGTTCCTTGCTTCCGCCTTTTAGCCCGACCGCTCTCGACTATATTC
GAACCAGAAAAGAGGCTCCAATTAAACAAGTCGAAATGTGGGAGAAAAATAATATGTCCTTTGGGTGACCAATAAAAGACTACTACCAAGATTGATGCAACTTGTTCTGA
AATCATATACGTTAATCTTGCTTTCATAAATACTAC
Protein sequenceShow/hide protein sequence
MSGWHMVRKPMAKLYFAYFFFYVSLIVAIFFIISSHLFHYVLIGVKRDMELHIYPQRKVEFSPVNYTTCSWSEKWHASGSTIAKEEEEDQDRQNGDTCPEYFRWIHEDLR
SWARIGITREMVERGQSKADFRLVIVDGRAYVEKYFKAYESRDTFTLWGILQLLRWYPGKIPDLDLMFHCGDQPNIFISNYSGPGRNTTASPPLFRYCENDDTLDIGWRK
EIKQGFKNSNLAHQCVYRYKIYIEWIGWSVSLKYILARDLVTLMVKSQYYDFFTRSLVPMHHYWPIKDDNDMCNSIKFAVDWGNAHKQKAQAIGKTASKFIEEQLSMEKI
YDYMFHSLNEYSKLLTFKPTIPPNATELCLEDLACPAQGLTTKFMMDTLVKQPSFSSPCSLLPPFSPTALDYIRTRKEAPIKQVEMWEKNNMSFG