; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS023989 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS023989
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
Descriptionprotein O-glucosyltransferase 1-like
Genome locationscaffold44:1441922..1445665
RNA-Seq ExpressionMS023989
SyntenyMS023989
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
GO:0016740 - transferase activity (molecular function)
InterPro domainsIPR006598 - Glycosyl transferase CAP10 domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022141173.1 protein O-glucosyltransferase 1-like [Momordica charantia]8.1e-21078.32Show/hide
Query:  MRGEDSRPKFQKQFSGEKLLPFAKSPPRFPVIFFFAVALIVGGLLSGRLLISSGLKSDVHPPQPRRHVEQLNGTTFNSTKTKQDPDGPPHATCPEYFRWI
        MRGEDSRPKF+KQFSGEKLLPFAKSPPRFPVIFFFAVALIVGGLLSGRLLISSGLKSDVHPPQPRRHVEQLNGTTFNSTKTKQDPDGPPHATCPEYFRWI
Subjt:  MRGEDSRPKFQKQFSGEKLLPFAKSPPRFPVIFFFAVALIVGGLLSGRLLISSGLKSDVHPPQPRRHVEQLNGTTFNSTKTKQDPDGPPHATCPEYFRWI

Query:  YEDLRPWAGTRITKGMLEAAQKKAHFRLVIVKGKAYVEVYEKAYQSRDNLTLWGVLQLLRRYPGKLPDLDLMFNCDDRPEIYQKDYSGPQAPAPPPLFRY
        YEDLRPWAGTRITKGMLEAAQKKAHFRLVIVKGKAYVEVYEKAYQSRDNLTLWGVLQLLRRYPGKLPDLDLMFNCDDRPEIY+KDYSGPQAPAPPPLFRY
Subjt:  YEDLRPWAGTRITKGMLEAAQKKAHFRLVIVKGKAYVEVYEKAYQSRDNLTLWGVLQLLRRYPGKLPDLDLMFNCDDRPEIYQKDYSGPQAPAPPPLFRY

Query:  SGDDATLDIAFPDWSYWGWPEIRIKPWEEMLKDIKEGNKKMEWVKREPYAYWKGNPSVSHKRTDLLKCNLTRKQDWNARLHRQ-----------------
        SGDDATLDIAFPDWSYWGWPEIRIKPWEEMLKDIKEGNK+MEWVKREPYAYWKGNPSVSHKRTDLLKCNLTRKQDWNARL+RQ                 
Subjt:  SGDDATLDIAFPDWSYWGWPEIRIKPWEEMLKDIKEGNKKMEWVKREPYAYWKGNPSVSHKRTDLLKCNLTRKQDWNARLHRQ-----------------

Query:  -----------------------------------------------------------------------------AMAIGEAASKFIQEELNMDYVYD
                                                                                     AMAIGEAASKFIQEELNMDYVYD
Subjt:  -----------------------------------------------------------------------------AMAIGEAASKFIQEELNMDYVYD

Query:  YMFHLLNEYSKLLTFKPMVPPNATELSSESMASAVRKSVRKWMMKSFVKSPAVSGPCAMKPPYDPQSMELWLTTK
        YMFHLLNEYSKLLTFKP VP NATELS ESMASAVR+SVRKWMMKSFVKSPAVS PCAMKPPYDPQSMELWLTTK
Subjt:  YMFHLLNEYSKLLTFKPMVPPNATELSSESMASAVRKSVRKWMMKSFVKSPAVSGPCAMKPPYDPQSMELWLTTK

XP_031737817.1 protein O-glucosyltransferase 1 [Cucumis sativus]1.6e-14156.36Show/hide
Query:  EDSRPKFQKQ-FSGEKLLPFAKSPPRFPVIFFFAVALIVGGLLSGRLL-ISSGLKSDVHPPQPRRHVEQLNGTTFNSTKTKQDPDGPPHATCPEYFRWIY
        +  RPKF KQ FS EKLL F+ + PR  VI FFA  +++   LS RLL +  GLKS+V   +P+              + KQDPDGP  ATCPEYFRWI+
Subjt:  EDSRPKFQKQ-FSGEKLLPFAKSPPRFPVIFFFAVALIVGGLLSGRLL-ISSGLKSDVHPPQPRRHVEQLNGTTFNSTKTKQDPDGPPHATCPEYFRWIY

Query:  EDLRPWAGTRITKGMLEAAQKKAHFRLVIVKGKAYVEVYEKAYQSRDNLTLWGVLQLLRRYPGKLPDLDLMFNCDDRPEIYQKDYSGPQAPAPPPLFRYS
        EDL+PWAG  ITK MLE AQKKAHFR+V+V+GKAYVE Y KAYQSRDNLT+WGV+QLLRRYPGKLPDLDLMF+CDDRPEIYQKDYSG + P+PPPLFRYS
Subjt:  EDLRPWAGTRITKGMLEAAQKKAHFRLVIVKGKAYVEVYEKAYQSRDNLTLWGVLQLLRRYPGKLPDLDLMFNCDDRPEIYQKDYSGPQAPAPPPLFRYS

Query:  GDDATLDIAFPDWSYWGWPEIRIKPWEEMLKDIKEGNKKMEWVKREPYAYWKGNPSVSHKRTDLLKCNLTRKQDWNARLHRQ------------------
        GDDAT DI FPDWS+WGWPEI IK WE MLKDIKEGNKKM W+KR+PYAYWKGNP+V++ R DLLKCN+T+KQDW+ARL+RQ                  
Subjt:  GDDATLDIAFPDWSYWGWPEIRIKPWEEMLKDIKEGNKKMEWVKREPYAYWKGNPSVSHKRTDLLKCNLTRKQDWNARLHRQ------------------

Query:  --------------------------------------------------------------------------AMAIGEAASKFIQEELNMDYVYDYMF
                                                                                  AMAIG+AASK I+EEL M+Y+YDYMF
Subjt:  --------------------------------------------------------------------------AMAIGEAASKFIQEELNMDYVYDYMF

Query:  HLLNEYSKLLTFKPMVPPNATELSSESMASAVRKSVRKWMMKSFVKSPAVSGPCAMKPPYDPQSMELWLTTK
        HLLN+YSKLLTFKP VPPNATEL SES+ASA + S+RK MM+S V SPA SGPCA++PPYDPQS++L + +K
Subjt:  HLLNEYSKLLTFKPMVPPNATELSSESMASAVRKSVRKWMMKSFVKSPAVSGPCAMKPPYDPQSMELWLTTK

XP_038875850.1 protein O-glucosyltransferase 1-like isoform X1 [Benincasa hispida]5.0e-14357.57Show/hide
Query:  MRGEDSRPKFQKQFSGEKLLPFAKSPPRFPVIFFFAVALIVGGLLSGRLL-ISSGLKSDVHPPQPRRHVEQLNGTTFNSTKTKQDPDGPPHATCPEYFRW
        +RG+DSR KFQK  SG+KLL F   P R  VI + AV  +VG  LSGRLL +  GLKS++   QP               +TKQDPD P  ATCPEYFRW
Subjt:  MRGEDSRPKFQKQFSGEKLLPFAKSPPRFPVIFFFAVALIVGGLLSGRLL-ISSGLKSDVHPPQPRRHVEQLNGTTFNSTKTKQDPDGPPHATCPEYFRW

Query:  IYEDLRPWAGTRITKGMLEAAQKKAHFRLVIVKGKAYVEVYEKAYQSRDNLTLWGVLQLLRRYPGKLPDLDLMFNCDDRPEIYQKDYSGPQAPAPPPLFR
        I+ DLRPWAG  ITK MLE AQKKAHFRLV+V+GKAY+E Y KAYQSRDN+TLWGV+QLLRRYPGKLPDLDLMFNCDDRPEIYQKDY+GP+ P+PPPLF 
Subjt:  IYEDLRPWAGTRITKGMLEAAQKKAHFRLVIVKGKAYVEVYEKAYQSRDNLTLWGVLQLLRRYPGKLPDLDLMFNCDDRPEIYQKDYSGPQAPAPPPLFR

Query:  YSGDDATLDIAFPDWSYWGWPEIRIKPWEEMLKDIKEGNKKMEWVKREPYAYWKGNPSVSHKRTDLLKCNLTRKQDWNARLHRQ----------------
        YSGDDAT DI FPDWS+WGWPEI IKPWE +LKDIKEG KK EW+KREPYAYWKGNPSV++ R DLLKCN+T KQDWNARL+RQ                
Subjt:  YSGDDATLDIAFPDWSYWGWPEIRIKPWEEMLKDIKEGNKKMEWVKREPYAYWKGNPSVSHKRTDLLKCNLTRKQDWNARLHRQ----------------

Query:  ----------------------------------------------------------------------------AMAIGEAASKFIQEELNMDYVYDY
                                                                                    AM IG+AASKFI+EEL M+Y+YDY
Subjt:  ----------------------------------------------------------------------------AMAIGEAASKFIQEELNMDYVYDY

Query:  MFHLLNEYSKLLTFKPMVPPNATELSSESMASAVRKSVRKWMMKSFVKSPAVSGPCAMKPPYDPQSMEL
        MFHLLN+YSKLLTFKP VPPNATELSSESM SA   S+RK M +S V SPA S PCA++PPYDPQS++L
Subjt:  MFHLLNEYSKLLTFKPMVPPNATELSSESMASAVRKSVRKWMMKSFVKSPAVSGPCAMKPPYDPQSMEL

XP_038875857.1 protein O-glucosyltransferase 1-like isoform X2 [Benincasa hispida]2.6e-14762.5Show/hide
Query:  MRGEDSRPKFQKQFSGEKLLPFAKSPPRFPVIFFFAVALIVGGLLSGRLL-ISSGLKSDVHPPQPRRHVEQLNGTTFNSTKTKQDPDGPPHATCPEYFRW
        +RG+DSR KFQK  SG+KLL F   P R  VI + AV  +VG  LSGRLL +  GLKS++   QP               +TKQDPD P  ATCPEYFRW
Subjt:  MRGEDSRPKFQKQFSGEKLLPFAKSPPRFPVIFFFAVALIVGGLLSGRLL-ISSGLKSDVHPPQPRRHVEQLNGTTFNSTKTKQDPDGPPHATCPEYFRW

Query:  IYEDLRPWAGTRITKGMLEAAQKKAHFRLVIVKGKAYVEVYEKAYQSRDNLTLWGVLQLLRRYPGKLPDLDLMFNCDDRPEIYQKDYSGPQAPAPPPLFR
        I+ DLRPWAG  ITK MLE AQKKAHFRLV+V+GKAY+E Y KAYQSRDN+TLWGV+QLLRRYPGKLPDLDLMFNCDDRPEIYQKDY+GP+ P+PPPLF 
Subjt:  IYEDLRPWAGTRITKGMLEAAQKKAHFRLVIVKGKAYVEVYEKAYQSRDNLTLWGVLQLLRRYPGKLPDLDLMFNCDDRPEIYQKDYSGPQAPAPPPLFR

Query:  YSGDDATLDIAFPDWSYWGWPEIRIKPWEEMLKDIKEGNKKMEWVKREPYAYWKGNPSVSHKRTDLLKCNLTRKQDWNARLHRQ----------------
        YSGDDAT DI FPDWS+WGWPEI IKPWE +LKDIKEG KK EW+KREPYAYWKGNPSV++ R DLLKCN+T KQDWNARL+RQ                
Subjt:  YSGDDATLDIAFPDWSYWGWPEIRIKPWEEMLKDIKEGNKKMEWVKREPYAYWKGNPSVSHKRTDLLKCNLTRKQDWNARLHRQ----------------

Query:  ---------------------------------------AMAIGEAASKFIQEELNMDYVYDYMFHLLNEYSKLLTFKPMVPPNATELSSESMASAVRKS
                                               AM IG+AASKFI+EEL M+Y+YDYMFHLLN+YSKLLTFKP VPPNATELSSESM SA   S
Subjt:  ---------------------------------------AMAIGEAASKFIQEELNMDYVYDYMFHLLNEYSKLLTFKPMVPPNATELSSESMASAVRKS

Query:  VRKWMMKSFVKSPAVSGPCAMKPPYDPQSMEL
        +RK M +S V SPA S PCA++PPYDPQS++L
Subjt:  VRKWMMKSFVKSPAVSGPCAMKPPYDPQSMEL

XP_038875866.1 protein O-glucosyltransferase 1-like isoform X3 [Benincasa hispida]1.5e-14762.79Show/hide
Query:  MRGEDSRPKFQKQFSGEKLLPFAKSPPRFPVIFFFAVALIVGGLLSGRLL-ISSGLKSDVHPPQPRRHVEQLNGTTFNSTKTKQDPDGPPHATCPEYFRW
        +RG+DSR KFQK  SG+KLL F   P R  VI + AV  +VG  LSGRLL +  GLKS++   QP               +TKQDPD P  ATCPEYFRW
Subjt:  MRGEDSRPKFQKQFSGEKLLPFAKSPPRFPVIFFFAVALIVGGLLSGRLL-ISSGLKSDVHPPQPRRHVEQLNGTTFNSTKTKQDPDGPPHATCPEYFRW

Query:  IYEDLRPWAGTRITKGMLEAAQKKAHFRLVIVKGKAYVEVYEKAYQSRDNLTLWGVLQLLRRYPGKLPDLDLMFNCDDRPEIYQKDYSGPQAPAPPPLFR
        I+ DLRPWAG  ITK MLE AQKKAHFRLV+V+GKAY+E Y KAYQSRDN+TLWGV+QLLRRYPGKLPDLDLMFNCDDRPEIYQKDY+GP+ P+PPPLF 
Subjt:  IYEDLRPWAGTRITKGMLEAAQKKAHFRLVIVKGKAYVEVYEKAYQSRDNLTLWGVLQLLRRYPGKLPDLDLMFNCDDRPEIYQKDYSGPQAPAPPPLFR

Query:  YSGDDATLDIAFPDWSYWGWPEIRIKPWEEMLKDIKEGNKKMEWVKREPYAYWKGNPSVSHKRTDLLKCNLTRKQDWNARLHRQ----------------
        YSGDDAT DI FPDWS+WGWPEI IKPWE +LKDIKEG KK EW+KREPYAYWKGNPSV++ R DLLKCN+T KQDWNARL+RQ                
Subjt:  YSGDDATLDIAFPDWSYWGWPEIRIKPWEEMLKDIKEGNKKMEWVKREPYAYWKGNPSVSHKRTDLLKCNLTRKQDWNARLHRQ----------------

Query:  -------------------------------------AMAIGEAASKFIQEELNMDYVYDYMFHLLNEYSKLLTFKPMVPPNATELSSESMASAVRKSVR
                                             AM IG+AASKFI+EEL M+Y+YDYMFHLLN+YSKLLTFKP VPPNATELSSESM SA   S+R
Subjt:  -------------------------------------AMAIGEAASKFIQEELNMDYVYDYMFHLLNEYSKLLTFKPMVPPNATELSSESMASAVRKSVR

Query:  KWMMKSFVKSPAVSGPCAMKPPYDPQSMEL
        K M +S V SPA S PCA++PPYDPQS++L
Subjt:  KWMMKSFVKSPAVSGPCAMKPPYDPQSMEL

TrEMBL top hitse value%identityAlignment
A0A0A0L5W0 CAP10 domain-containing protein8.9e-13856.24Show/hide
Query:  KLLPFAKSPPRFPVIFFFAVALIVGGLLSGRLL-ISSGLKSDVHPPQPRRHVEQLNGTTFNSTKTKQDPDGPPHATCPEYFRWIYEDLRPWAGTRITKGM
        KLL F+ + PR  VI FFA  +++   LS RLL +  GLKS+V   +P+              + KQDPDGP  ATCPEYFRWI+EDL+PWAG  ITK M
Subjt:  KLLPFAKSPPRFPVIFFFAVALIVGGLLSGRLL-ISSGLKSDVHPPQPRRHVEQLNGTTFNSTKTKQDPDGPPHATCPEYFRWIYEDLRPWAGTRITKGM

Query:  LEAAQKKAHFRLVIVKGKAYVEVYEKAYQSRDNLTLWGVLQLLRRYPGKLPDLDLMFNCDDRPEIYQKDYSGPQAPAPPPLFRYSGDDATLDIAFPDWSY
        LE AQKKAHFR+V+V+GKAYVE Y KAYQSRDNLT+WGV+QLLRRYPGKLPDLDLMF+CDDRPEIYQKDYSG + P+PPPLFRYSGDDAT DI FPDWS+
Subjt:  LEAAQKKAHFRLVIVKGKAYVEVYEKAYQSRDNLTLWGVLQLLRRYPGKLPDLDLMFNCDDRPEIYQKDYSGPQAPAPPPLFRYSGDDATLDIAFPDWSY

Query:  WGWPEIRIKPWEEMLKDIKEGNKKMEWVKREPYAYWKGNPSVSHKRTDLLKCNLTRKQDWNARLHRQ---------------------------------
        WGWPEI IK WE MLKDIKEGNKKM W+KR+PYAYWKGNP+V++ R DLLKCN+T+KQDW+ARL+RQ                                 
Subjt:  WGWPEIRIKPWEEMLKDIKEGNKKMEWVKREPYAYWKGNPSVSHKRTDLLKCNLTRKQDWNARLHRQ---------------------------------

Query:  -----------------------------------------------------------AMAIGEAASKFIQEELNMDYVYDYMFHLLNEYSKLLTFKPM
                                                                   AMAIG+AASK I+EEL M+Y+YDYMFHLLN+YSKLLTFKP 
Subjt:  -----------------------------------------------------------AMAIGEAASKFIQEELNMDYVYDYMFHLLNEYSKLLTFKPM

Query:  VPPNATELSSESMASAVRKSVRKWMMKSFVKSPAVSGPCAMKPPYDPQSMELWLTTK
        VPPNATEL SES+ASA + S+RK MM+S V SPA SGPCA++PPYDPQS++L + +K
Subjt:  VPPNATELSSESMASAVRKSVRKWMMKSFVKSPAVSGPCAMKPPYDPQSMELWLTTK

A0A1S3AYX9 protein O-glucosyltransferase 1-like1.2e-13755.49Show/hide
Query:  EDSRPKFQKQ--FSGEKLLPFAKSPPRFPVIFFFAVALIVGGLLSGRLL-ISSGLKSDVHPPQPRRHVEQLNGTTFNSTKTKQDPDGPPHATCPEYFRWI
        +  R KF KQ  F   KLL F+ +PPR  VI FFA  +++G  LS RL  +   LKS+V   QP+              ++KQ PD P  ATCPEYFRWI
Subjt:  EDSRPKFQKQ--FSGEKLLPFAKSPPRFPVIFFFAVALIVGGLLSGRLL-ISSGLKSDVHPPQPRRHVEQLNGTTFNSTKTKQDPDGPPHATCPEYFRWI

Query:  YEDLRPWAGTRITKGMLEAAQKKAHFRLVIVKGKAYVEVYEKAYQSRDNLTLWGVLQLLRRYPGKLPDLDLMFNCDDRPEIYQKDYSGPQAPAPPPLFRY
        +EDL+PWAG  ITK MLE AQKKAHFRL++V+GKAYVE Y KAYQSRDNLT+WGV+QLLRRYPGK+PDLDLMFNCDDRPEIYQKDYSGP+ PAPPPLFRY
Subjt:  YEDLRPWAGTRITKGMLEAAQKKAHFRLVIVKGKAYVEVYEKAYQSRDNLTLWGVLQLLRRYPGKLPDLDLMFNCDDRPEIYQKDYSGPQAPAPPPLFRY

Query:  SGDDATLDIAFPDWSYWGWPEIRIKPWEEMLKDIKEGNKKMEWVKREPYAYWKGNPSVSHKRTDLLKCNLTRKQDWNARLHRQ-----------------
        SGDDAT DI FPDWS+WGWPEI IK WE +LKDIKEGNKKMEW+KR+PYAYWKGNP+V++ R DLLKCN+T+KQDW+ARL+RQ                 
Subjt:  SGDDATLDIAFPDWSYWGWPEIRIKPWEEMLKDIKEGNKKMEWVKREPYAYWKGNPSVSHKRTDLLKCNLTRKQDWNARLHRQ-----------------

Query:  ---------------------------------------------------------------------------AMAIGEAASKFIQEELNMDYVYDYM
                                                                                   AMAIG+AASK I+EEL M+Y+YDYM
Subjt:  ---------------------------------------------------------------------------AMAIGEAASKFIQEELNMDYVYDYM

Query:  FHLLNEYSKLLTFKPMVPPNATELSSESMASAVRKS-VRKWMMKSFVKSPAVSGPCAMKPPYDPQSMELWLTTK
        FHLLN+YSKLLTFKP VPPNATELSS+S+ASA + S +RK MM+S V SPA S PCA++PPYDPQS++L    K
Subjt:  FHLLNEYSKLLTFKPMVPPNATELSSESMASAVRKS-VRKWMMKSFVKSPAVSGPCAMKPPYDPQSMELWLTTK

A0A5A7SWV0 Protein O-glucosyltransferase 1-like2.8e-13956.29Show/hide
Query:  EDSRPKFQKQ--FSGEKLLPFAKSPPRFPVIFFFAVALIVGGLLSGRLL-ISSGLKSDVHPPQPRRHVEQLNGTTFNSTKTKQDPDGPPHATCPEYFRWI
        +  R KF KQ  F   KLL F+ +PPR  VI FFA  +++G  LSGRL  +   LKS+V   QP+              ++KQ PD P  ATCPEYFRWI
Subjt:  EDSRPKFQKQ--FSGEKLLPFAKSPPRFPVIFFFAVALIVGGLLSGRLL-ISSGLKSDVHPPQPRRHVEQLNGTTFNSTKTKQDPDGPPHATCPEYFRWI

Query:  YEDLRPWAGTRITKGMLEAAQKKAHFRLVIVKGKAYVEVYEKAYQSRDNLTLWGVLQLLRRYPGKLPDLDLMFNCDDRPEIYQKDYSGPQAPAPPPLFRY
        +EDL+PWAG  ITK MLE AQKKAHFRL++V+GKAYVE Y KAYQSRDNLT+WGV+QLLRRYPGK+PDLDLMFNCDDRPEIYQKDYSGP+ PAPPPLFRY
Subjt:  YEDLRPWAGTRITKGMLEAAQKKAHFRLVIVKGKAYVEVYEKAYQSRDNLTLWGVLQLLRRYPGKLPDLDLMFNCDDRPEIYQKDYSGPQAPAPPPLFRY

Query:  SGDDATLDIAFPDWSYWGWPEIRIKPWEEMLKDIKEGNKKMEWVKREPYAYWKGNPSVSHKRTDLLKCNLTRKQDWNARLHRQ-----------------
        SGDDAT DI FPDWS+WGWPEI IK WE +LKDIKEGNKKMEW+KR+PYAYWKGNP+V++ R DLLKCN+T+KQDW+ARL+RQ                 
Subjt:  SGDDATLDIAFPDWSYWGWPEIRIKPWEEMLKDIKEGNKKMEWVKREPYAYWKGNPSVSHKRTDLLKCNLTRKQDWNARLHRQ-----------------

Query:  ----------------------------------------------------------------------AMAIGEAASKFIQEELNMDYVYDYMFHLLN
                                                                              AMAIG+AASK I+EEL M+Y+YDYMFHLLN
Subjt:  ----------------------------------------------------------------------AMAIGEAASKFIQEELNMDYVYDYMFHLLN

Query:  EYSKLLTFKPMVPPNATELSSESMASAVRKS-VRKWMMKSFVKSPAVSGPCAMKPPYDPQSMELWLTTK
        +YSKLLTFKP VPPNATELSS+S+ASA + S +RK MM+S V SPA S PCA++PPYDPQS++L    K
Subjt:  EYSKLLTFKPMVPPNATELSSESMASAVRKS-VRKWMMKSFVKSPAVSGPCAMKPPYDPQSMELWLTTK

A0A6J1CHL3 protein O-glucosyltransferase 1-like1.1e-11446.17Show/hide
Query:  FQKQFS-------GEKLLPFAKSPPRFPVIFFFAVALIVGGLLSGRLL--------------ISSGLKSDVHPPQP--------RRHVE-QLNGTTFNST
        FQ++FS          L P  KSP R  ++FFF++ L++G  LS RLL              I  G KS  +P           RR VE  L+  +FN+ 
Subjt:  FQKQFS-------GEKLLPFAKSPPRFPVIFFFAVALIVGGLLSGRLL--------------ISSGLKSDVHPPQP--------RRHVE-QLNGTTFNST

Query:  KT---------------KQDPDGPPHATCPEYFRWIYEDLRPWAGTRITKGMLEAAQKKAHFRLVIVKGKAYVEVYEKAYQSRDNLTLWGVLQLLRRYPG
                         ++DPD    ATCPEYFRWI+EDLRPWA T IT+  +EAA++ A+FRLVIVKGKAYVE +EK++Q+RD+ T+WG+LQLLRRYPG
Subjt:  KT---------------KQDPDGPPHATCPEYFRWIYEDLRPWAGTRITKGMLEAAQKKAHFRLVIVKGKAYVEVYEKAYQSRDNLTLWGVLQLLRRYPG

Query:  KLPDLDLMFNCDDRPEIYQKDYSGPQAPAPPPLFRYSGDDATLDIAFPDWSYWGWPEIRIKPWEEMLKDIKEGNKKMEWVKREPYAYWKGNPSVSHKRTD
        K+PDL+LMF+C D P I  + +SGP  P PPP+FRY  DDATLDI FPDWS+WGWPEI IKPWE++LKD+KEGNK++ W +REPYAYWKGNP+V+  R D
Subjt:  KLPDLDLMFNCDDRPEIYQKDYSGPQAPAPPPLFRYSGDDATLDIAFPDWSYWGWPEIRIKPWEEMLKDIKEGNKKMEWVKREPYAYWKGNPSVSHKRTD

Query:  LLKCNLTRKQDWNAR-----------------------LHR-----------------------------------------------------------
        LLKCN++ +QDWNAR                       LHR                                                           
Subjt:  LLKCNLTRKQDWNAR-----------------------LHR-----------------------------------------------------------

Query:  ----------QAMAIGEAASKFIQEELNMDYVYDYMFHLLNEYSKLLTFKPMVPPNATELSSESMASAVRKSVRKWMMKSFVKSPAVSGPCAMKPPYDPQ
                  +A AIG+AAS FIQEEL MDYVYDYMFHLL+EYSKLL FKPM+P  ATEL SE+MA       RK+MM+S VKSPA + PC M PPYDP 
Subjt:  ----------QAMAIGEAASKFIQEELNMDYVYDYMFHLLNEYSKLLTFKPMVPPNATELSSESMASAVRKSVRKWMMKSFVKSPAVSGPCAMKPPYDPQ

Query:  SMELWLTTK
        S+   L+ K
Subjt:  SMELWLTTK

A0A6J1CJQ1 protein O-glucosyltransferase 1-like3.9e-21078.32Show/hide
Query:  MRGEDSRPKFQKQFSGEKLLPFAKSPPRFPVIFFFAVALIVGGLLSGRLLISSGLKSDVHPPQPRRHVEQLNGTTFNSTKTKQDPDGPPHATCPEYFRWI
        MRGEDSRPKF+KQFSGEKLLPFAKSPPRFPVIFFFAVALIVGGLLSGRLLISSGLKSDVHPPQPRRHVEQLNGTTFNSTKTKQDPDGPPHATCPEYFRWI
Subjt:  MRGEDSRPKFQKQFSGEKLLPFAKSPPRFPVIFFFAVALIVGGLLSGRLLISSGLKSDVHPPQPRRHVEQLNGTTFNSTKTKQDPDGPPHATCPEYFRWI

Query:  YEDLRPWAGTRITKGMLEAAQKKAHFRLVIVKGKAYVEVYEKAYQSRDNLTLWGVLQLLRRYPGKLPDLDLMFNCDDRPEIYQKDYSGPQAPAPPPLFRY
        YEDLRPWAGTRITKGMLEAAQKKAHFRLVIVKGKAYVEVYEKAYQSRDNLTLWGVLQLLRRYPGKLPDLDLMFNCDDRPEIY+KDYSGPQAPAPPPLFRY
Subjt:  YEDLRPWAGTRITKGMLEAAQKKAHFRLVIVKGKAYVEVYEKAYQSRDNLTLWGVLQLLRRYPGKLPDLDLMFNCDDRPEIYQKDYSGPQAPAPPPLFRY

Query:  SGDDATLDIAFPDWSYWGWPEIRIKPWEEMLKDIKEGNKKMEWVKREPYAYWKGNPSVSHKRTDLLKCNLTRKQDWNARLHRQ-----------------
        SGDDATLDIAFPDWSYWGWPEIRIKPWEEMLKDIKEGNK+MEWVKREPYAYWKGNPSVSHKRTDLLKCNLTRKQDWNARL+RQ                 
Subjt:  SGDDATLDIAFPDWSYWGWPEIRIKPWEEMLKDIKEGNKKMEWVKREPYAYWKGNPSVSHKRTDLLKCNLTRKQDWNARLHRQ-----------------

Query:  -----------------------------------------------------------------------------AMAIGEAASKFIQEELNMDYVYD
                                                                                     AMAIGEAASKFIQEELNMDYVYD
Subjt:  -----------------------------------------------------------------------------AMAIGEAASKFIQEELNMDYVYD

Query:  YMFHLLNEYSKLLTFKPMVPPNATELSSESMASAVRKSVRKWMMKSFVKSPAVSGPCAMKPPYDPQSMELWLTTK
        YMFHLLNEYSKLLTFKP VP NATELS ESMASAVR+SVRKWMMKSFVKSPAVS PCAMKPPYDPQSMELWLTTK
Subjt:  YMFHLLNEYSKLLTFKPMVPPNATELSSESMASAVRKSVRKWMMKSFVKSPAVSGPCAMKPPYDPQSMELWLTTK

SwissProt top hitse value%identityAlignment
A0NDG6 O-glucosyltransferase rumi homolog2.6e-0925.77Show/hide
Query:  DLRPWAGTRITKGMLEAAQKKAHFRLVIVKGKAYVEVYEKAYQSRDNL---TLWGVLQLLRRYPGKLPDLDLMFNCDDRPEIYQKDYSGPQAPAPPPLFR
        DL+P+    ITK M+  A++          G  Y  +  K Y+ R+ +      GV   +R     LPD+DL+ NC D P+I++       +    P+  
Subjt:  DLRPWAGTRITKGMLEAAQKKAHFRLVIVKGKAYVEVYEKAYQSRDNL---TLWGVLQLLRRYPGKLPDLDLMFNCDDRPEIYQKDYSGPQAPAPPPLFR

Query:  YSGDDATLDIAFPDWSYW-GWPEIRIKP-----WEEMLKDIKEGNKKMEWVKREPYAYWKGNPS---------VSHKRTDLLKCNLTRKQDWNA
        +S     LDI +P W++W G P I + P     W+   + I + +   +W  +EP A+++G+ +         +S  +  L+    T+ Q W +
Subjt:  YSGDDATLDIAFPDWSYW-GWPEIRIKP-----WEEMLKDIKEGNKKMEWVKREPYAYWKGNPS---------VSHKRTDLLKCNLTRKQDWNA

B0X1Q4 O-glucosyltransferase rumi homolog6.2e-1122.98Show/hide
Query:  ATCPEYFRWIYEDLRPWAGTRITKGMLEAAQKKAHFRLVIVKGKAYVEVYEKAYQSRDNL---TLWGVLQLLRRYPGKLPDLDLMFNCDDRPEIYQKDYS
        + C  +   +  DLRP+  + IT+ ++E A+           G  Y  +  + ++ RD +      GV   +R    KLPD++L+ NC D P+I  + ++
Subjt:  ATCPEYFRWIYEDLRPWAGTRITKGMLEAAQKKAHFRLVIVKGKAYVEVYEKAYQSRDNL---TLWGVLQLLRRYPGKLPDLDLMFNCDDRPEIYQKDYS

Query:  GPQAPAPPPLFRYSGDDATLDIAFPDWSYW-GWPEIRIKP-----WEEMLKDIKEGNKKMEWVKREPYAYWKGNPS---------VSHKRTDLLKCNLTR
          + P   P+  +S  +  LDI +P W +W G P I + P     W++    +++  K   W K+   A+++G+ +         +S  R +L+    T+
Subjt:  GPQAPAPPPLFRYSGDDATLDIAFPDWSYW-GWPEIRIKP-----WEEMLKDIKEGNKKMEWVKREPYAYWKGNPS---------VSHKRTDLLKCNLTR

Query:  KQDWNARLHRQAMAIGEAASKFIQEELNMDYVYDY
         Q W  R  +  +    A    +++     Y++++
Subjt:  KQDWNARLHRQAMAIGEAASKFIQEELNMDYVYDY

Q16QY8 O-glucosyltransferase rumi homolog2.3e-1025.24Show/hide
Query:  ATCPEYFRWIYEDLRPWAGTRITKGMLEAAQKKAHFRLVIVKGKAYVEVYEKAYQSRDNL---TLWGVLQLLRRYPGKLPDLDLMFNCDDRPEIYQKDYS
        A C  +   +  DLRP+ G  I++ M+E A+           G  Y  V  + Y+ +D +      GV   ++     LPD++L+ NC D P+I +    
Subjt:  ATCPEYFRWIYEDLRPWAGTRITKGMLEAAQKKAHFRLVIVKGKAYVEVYEKAYQSRDNL---TLWGVLQLLRRYPGKLPDLDLMFNCDDRPEIYQKDYS

Query:  GPQAPAPPPLFRYSGDDATLDIAFPDWSYW-GWPEIRIKP-----WEEMLKDIKEGNKKMEWVKREPYAYWKGNPS---------VSHKRTDLLKCNLTR
                P+  +S  D  LDI +P W +W G P I + P     W++    IK+     +W K++  A+++G+ +         +S ++ +L+    T+
Subjt:  GPQAPAPPPLFRYSGDDATLDIAFPDWSYW-GWPEIRIKP-----WEEMLKDIKEGNKKMEWVKREPYAYWKGNPS---------VSHKRTDLLKCNLTR

Query:  KQDWNA
         Q W +
Subjt:  KQDWNA

Q29AU6 O-glucosyltransferase rumi5.2e-1022.71Show/hide
Query:  ATCPEYFRWIYEDLRPWAGTRITKGMLEAAQKKAHFRLVIVKGKAYVEVYEKAYQSRDNLTL----WGVLQLLRRYPGKLPDLDLMFNCDDRPEIYQKDY
        A C  +   I  DL P+  T +++ M+E++ +          G  Y ++YEK     +N        G+   L      LPD+DL+ N  D P+I     
Subjt:  ATCPEYFRWIYEDLRPWAGTRITKGMLEAAQKKAHFRLVIVKGKAYVEVYEKAYQSRDNLTL----WGVLQLLRRYPGKLPDLDLMFNCDDRPEIYQKDY

Query:  SGPQAPAPPPLFRYSGDDATLDIAFPDWSYW-GWPEIRIKP-----WEEMLKDIKEGNKKMEWVKREPYAYWKGNPS---------VSHKRTDLLKCNLT
        +G Q     P+  +S      DI +P W++W G P  ++ P     W+ M + +++    + W ++    +++G+ +         +S +  +L++   T
Subjt:  SGPQAPAPPPLFRYSGDDATLDIAFPDWSYW-GWPEIRIKP-----WEEMLKDIKEGNKKMEWVKREPYAYWKGNPS---------VSHKRTDLLKCNLT

Query:  RKQDWNA
        + Q W +
Subjt:  RKQDWNA

Q8T045 O-glucosyltransferase rumi1.5e-0921.83Show/hide
Query:  RRHVEQLNGTTFNSTKTKQDPDGPPHATCPEYFRWIYEDLRPWAGTRITKGMLEAAQKKAHFRLVIVKGKAYVEVYEKAYQSRDNLTLWGVLQLLRRYPG
        RR +E+ N      +   QD D   HA        +  DL P+  T +T+ M+E++ +         K K Y     +           G+   L     
Subjt:  RRHVEQLNGTTFNSTKTKQDPDGPPHATCPEYFRWIYEDLRPWAGTRITKGMLEAAQKKAHFRLVIVKGKAYVEVYEKAYQSRDNLTLWGVLQLLRRYPG

Query:  KLPDLDLMFNCDDRPEIYQKDYSGPQAPAPPPLFRYSGDDATLDIAFPDWSYW-GWPEIRIKP-----WEEMLKDIKEGNKKMEWVKREPYAYWKGNPS-
         LPD+DL+ N  D P++     +     A  P+F +S      DI +P W++W G P  ++ P     W++M + +++    + W ++    +++G+ + 
Subjt:  KLPDLDLMFNCDDRPEIYQKDYSGPQAPAPPPLFRYSGDDATLDIAFPDWSYW-GWPEIRIKP-----WEEMLKDIKEGNKKMEWVKREPYAYWKGNPS-

Query:  --------VSHKRTDLLKCNLTRKQDWNA
                +S +  +L++   T+ Q W +
Subjt:  --------VSHKRTDLLKCNLTRKQDWNA

Arabidopsis top hitse value%identityAlignment
AT1G63420.1 Arabidopsis thaliana protein of unknown function (DUF821)3.1e-8241.82Show/hide
Query:  TCPEYFRWIYEDLRPWAGTRITKGMLEAAQKKAHFRLVIVKGKAYVEVYEKAYQSRDNLTLWGVLQLLRRYPGKLPDLDLMFNCDDRPEIYQKDY---SG
        +CP+YF+WI+EDL+PW  T ITK M+E  +  AHFRLVI+ GK +VE Y+K+ Q+RD  TLWG+LQLLR+YPGKLPD+DLMF+CDDRP I    Y   + 
Subjt:  TCPEYFRWIYEDLRPWAGTRITKGMLEAAQKKAHFRLVIVKGKAYVEVYEKAYQSRDNLTLWGVLQLLRRYPGKLPDLDLMFNCDDRPEIYQKDY---SG

Query:  PQAPAPPPLFRYSGDDATLDIAFPDWSYWGWPEIRIKPWEEMLKDIKEGNKKMEWVKREPYAYWKGNPSV-SHKRTDLLKCNLTRKQDWNARLH------
            APPPLFRY GD  T+DI FPDWS+WGW EI I+ W ++LK+++EG KK ++++R+ YAYWKGNP V S  R DLL CNL+   DWNAR+       
Subjt:  PQAPAPPPLFRYSGDDATLDIAFPDWSYWGWPEIRIKPWEEMLKDIKEGNKKMEWVKREPYAYWKGNPSV-SHKRTDLLKCNLTRKQDWNARLH------

Query:  --------------------------------------------------------------------------------------RQAMAIGEAASKFI
                                                                                              ++A  IG  AS+F+
Subjt:  --------------------------------------------------------------------------------------RQAMAIGEAASKFI

Query:  QEELNMDYVYDYMFHLLNEYSKLLTFKPMVPPNATELSSESM-----ASAVRKSVRKWMMKSFVKSPAVSGPCAMKPPYDPQSME
        Q +L+M+ VYDYMFHLLNEYSKLL +KP VP N+ EL +E++        V    +K+M+ S V  P  SGPC++ PP+D   +E
Subjt:  QEELNMDYVYDYMFHLLNEYSKLLTFKPMVPPNATELSSESM-----ASAVRKSVRKWMMKSFVKSPAVSGPCAMKPPYDPQSME

AT2G45830.1 downstream target of AGL15 21.5e-8941.79Show/hide
Query:  NGTTFNSTKTKQDPDGPPHATCPEYFRWIYEDLRPWAGTRITKGMLEAAQKKAHFRLVIVKGKAYVEVYEKAYQSRDNLTLWGVLQLLRRYPGKLPDLDL
        NG++ N+ K +        +TCP YFRWI+EDLRPW  T +T+GMLE A++ AHFR+VI+ G+ YV+ Y K+ Q+RD  TLWG++QLLR YPG+LPDL+L
Subjt:  NGTTFNSTKTKQDPDGPPHATCPEYFRWIYEDLRPWAGTRITKGMLEAAQKKAHFRLVIVKGKAYVEVYEKAYQSRDNLTLWGVLQLLRRYPGKLPDLDL

Query:  MFNCDDRPEIYQKDYSGPQAPAPPPLFRYSGDDATLDIAFPDWSYWGWPEIRIKPWEEMLKDIKEGNKKMEWVKREPYAYWKGNPSVSHKRTDLLKCNLT
        MF+ DDRP +  KD+ G Q PAPPPLFRY  DDA+LDI FPDWS+WGW E+ IKPW++ L  I+EGNK  +W  R  YAYW+GNP+V+  R DLL+CN++
Subjt:  MFNCDDRPEIYQKDYSGPQAPAPPPLFRYSGDDATLDIAFPDWSYWGWPEIRIKPWEEMLKDIKEGNKKMEWVKREPYAYWKGNPSVSHKRTDLLKCNLT

Query:  RKQDWNARL-----------------------HR------------------------------------------------------------------
         ++DWN RL                       HR                                                                  
Subjt:  RKQDWNARL-----------------------HR------------------------------------------------------------------

Query:  ---QAMAIGEAASKFIQEELNMDYVYDYMFHLLNEYSKLLTFKPMVPPNATELSSESMASAVRKSVRKWMMKSFVKSPAVSGPCAMKPPYDPQSMELWLT
           QA  IGE  S+FI+EE+ M+YVYDYMFHL+NEY+KLL FKP +P  ATE++ + M  +     R +M +S V  P+   PC M  P++P  ++  L 
Subjt:  ---QAMAIGEAASKFIQEELNMDYVYDYMFHLLNEYSKLLTFKPMVPPNATELSSESMASAVRKSVRKWMMKSFVKSPAVSGPCAMKPPYDPQSMELWLT

Query:  TK
         K
Subjt:  TK

AT3G48980.1 Arabidopsis thaliana protein of unknown function (DUF821)8.1e-9945.41Show/hide
Query:  TTFNSTKTKQDPDGPPHATCPEYFRWIYEDLRPWAGTRITKGMLEAAQKKAHFRLVIVKGKAYVEVYEKAYQSRDNLTLWGVLQLLRRYPGKLPDLDLMF
        T+F S+  + + D  P ATCP+YFRWI+EDLRPW  T IT+  LE A   A FRL I+ G+ YVE + +A+Q+RD  T+WG +QLLRRYPGK+PDL+LMF
Subjt:  TTFNSTKTKQDPDGPPHATCPEYFRWIYEDLRPWAGTRITKGMLEAAQKKAHFRLVIVKGKAYVEVYEKAYQSRDNLTLWGVLQLLRRYPGKLPDLDLMF

Query:  NCDDRPEIYQKDYSGPQAPAPPPLFRYSGDDATLDIAFPDWSYWGWPEIRIKPWEEMLKDIKEGNKKMEWVKREPYAYWKGNPSVSHKRTDLLKCNLTRK
        +C D P +   +++G   P PPPLFRY  +D TLDI FPDWSYWGW E+ IKPWE +LK+++EGN++ +W+ REPYAYWKGNP+V+  R DL+KCNL+  
Subjt:  NCDDRPEIYQKDYSGPQAPAPPPLFRYSGDDATLDIAFPDWSYWGWPEIRIKPWEEMLKDIKEGNKKMEWVKREPYAYWKGNPSVSHKRTDLLKCNLTRK

Query:  QDWNARLHRQ------------------------------------------------------------------------------------------
         DW ARL++Q                                                                                          
Subjt:  QDWNARLHRQ------------------------------------------------------------------------------------------

Query:  --AMAIGEAASKFIQEELNMDYVYDYMFHLLNEYSKLLTFKPMVPPNATELSSESMASAVRKSVRKWMMKSFVKSPAVSGPCAMKPPYDPQS
          A  IG+ AS+F+Q+EL MDYVYDYMFHLL +YSKLL FKP +P N+TEL SE+MA     + RK+MM+S VK PA +GPCAM PPYDP S
Subjt:  --AMAIGEAASKFIQEELNMDYVYDYMFHLLNEYSKLLTFKPMVPPNATELSSESMASAVRKSVRKWMMKSFVKSPAVSGPCAMKPPYDPQS

AT3G61270.1 Arabidopsis thaliana protein of unknown function (DUF821)5.4e-8739.73Show/hide
Query:  VIFFFAVALIVGGLLSGRLLISSGLKSDVHPPQPR--RHVEQLNGTT--FNSTKTKQDP-DGPPHATCPEYFRWIYEDLRPWAGTRITKGMLEAAQKKAH
        V+F  A  L + G L         L +    P P     V+  +  T    + K++ +P +    +TCP YFRWI+EDLRPW  T IT+GM+E A + AH
Subjt:  VIFFFAVALIVGGLLSGRLLISSGLKSDVHPPQPR--RHVEQLNGTT--FNSTKTKQDP-DGPPHATCPEYFRWIYEDLRPWAGTRITKGMLEAAQKKAH

Query:  FRLVIVKGKAYVEVYEKAYQSRDNLTLWGVLQLLRRYPGKLPDLDLMFNCDDRPEIYQKDYSGPQAPAPPPLFRYSGDDATLDIAFPDWSYWGWPEIRIK
        FRLVI  GKAYV+ Y+K+ Q+RD  TLWG+LQLLR YPGKLPDL+LMF+ DDRP +   D+ G Q   PPP+FRY  DDA+LDI FPDWS+WGW E+ +K
Subjt:  FRLVIVKGKAYVEVYEKAYQSRDNLTLWGVLQLLRRYPGKLPDLDLMFNCDDRPEIYQKDYSGPQAPAPPPLFRYSGDDATLDIAFPDWSYWGWPEIRIK

Query:  PWEEMLKDIKEGNKKMEWVKREPYAYWKGNPSVSHKRTDLLKCNLTRKQDWNARLHRQ------------------------------------------
        PW + L+ IKEGN   +W  R  YAYW+GNP V   R DLLKCN T  ++WN RL+ Q                                          
Subjt:  PWEEMLKDIKEGNKKMEWVKREPYAYWKGNPSVSHKRTDLLKCNLTRKQDWNARLHRQ------------------------------------------

Query:  --------------------------------------------------AMAIGEAASKFIQEELNMDYVYDYMFHLLNEYSKLLTFKPMVPPNATELS
                                                          A  IGE  S+FI+EE+NM YVYDYMFHLL EY+ LL FKP +P +A E++
Subjt:  --------------------------------------------------AMAIGEAASKFIQEELNMDYVYDYMFHLLNEYSKLLTFKPMVPPNATELS

Query:  SESMASAVRKSVRKWMMKSFVKSPAVSGPCAMKPPYDPQSMELWLTTK
         +SM     +  R +  +S + SP+   PC M PPYDP +++  L  K
Subjt:  SESMASAVRKSVRKWMMKSFVKSPAVSGPCAMKPPYDPQSMELWLTTK

AT5G23850.1 Arabidopsis thaliana protein of unknown function (DUF821)2.8e-9946.58Show/hide
Query:  DPDGPPHATCPEYFRWIYEDLRPWAGTRITKGMLEAAQKKAHFRLVIVKGKAYVEVYEKAYQSRDNLTLWGVLQLLRRYPGKLPDLDLMFNCDDRPEIYQ
        D + PP ATCP+YFRWI+EDLRPW+ T IT+  LE A+K A FRL IV GK YVE ++ A+Q+RD  T+WG LQLLR+YPGK+PDL+LMF+C D P +  
Subjt:  DPDGPPHATCPEYFRWIYEDLRPWAGTRITKGMLEAAQKKAHFRLVIVKGKAYVEVYEKAYQSRDNLTLWGVLQLLRRYPGKLPDLDLMFNCDDRPEIYQ

Query:  KDYSGPQAPAPPPLFRYSGDDATLDIAFPDWSYWGWPEIRIKPWEEMLKDIKEGNKKMEWVKREPYAYWKGNPSVSHKRTDLLKCNLTRKQDWNARLHRQ
         +++G  AP+PPPLFRY G++ TLDI FPDWS+WGW E+ IKPWE +LK+++EGN++ +W+ REPYAYWKGNP V+  R DL+KCN++ + +WNARL+ Q
Subjt:  KDYSGPQAPAPPPLFRYSGDDATLDIAFPDWSYWGWPEIRIKPWEEMLKDIKEGNKKMEWVKREPYAYWKGNPSVSHKRTDLLKCNLTRKQDWNARLHRQ

Query:  --------------------------------------------------------------------------------------------AMAIGEAA
                                                                                                    A  IG+AA
Subjt:  --------------------------------------------------------------------------------------------AMAIGEAA

Query:  SKFIQEELNMDYVYDYMFHLLNEYSKLLTFKPMVPPNATELSSESMASAVRKSVRKWMMKSFVKSPAVSGPCAMKPPYDP
        S FIQ++L MDYVYDYM+HLL EYSKLL FKP +P NA E+ SE+MA     + RK+M +S VK PA SGPCAM PPYDP
Subjt:  SKFIQEELNMDYVYDYMFHLLNEYSKLLTFKPMVPPNATELSSESMASAVRKSVRKWMMKSFVKSPAVSGPCAMKPPYDP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGAGGGGAGGATTCTCGGCCCAAGTTTCAGAAGCAATTTTCCGGCGAGAAACTGCTGCCGTTCGCCAAGTCGCCGCCTCGATTTCCCGTTATCTTCTTCTTCGCCGT
CGCGCTCATCGTCGGCGGGCTTCTCTCCGGGCGACTCCTTATTTCCTCGGGACTGAAATCCGATGTTCACCCTCCACAACCACGACGACATGTCGAGCAACTCAACGGCA
CGACATTCAACTCGACGAAAACGAAACAAGACCCGGATGGCCCGCCGCATGCCACGTGTCCAGAGTATTTCCGTTGGATCTACGAGGACCTACGACCGTGGGCCGGGACG
AGGATAACGAAGGGGATGTTAGAAGCGGCCCAAAAGAAGGCCCATTTCAGGCTAGTGATCGTGAAGGGAAAGGCCTACGTGGAGGTGTACGAAAAGGCATACCAAAGCAG
AGACAATCTTACGCTGTGGGGGGTCCTACAGTTGTTACGGAGATACCCAGGGAAATTGCCCGATCTTGATCTGATGTTTAACTGTGATGACCGGCCAGAGATCTATCAAA
AAGATTACAGTGGGCCCCAGGCGCCGGCCCCACCTCCCTTGTTTCGGTACAGTGGAGATGATGCCACGTTGGACATTGCGTTTCCTGATTGGTCCTATTGGGGTTGGCCT
GAGATAAGAATAAAGCCATGGGAAGAAATGTTGAAGGATATAAAAGAAGGGAACAAGAAGATGGAATGGGTGAAGAGGGAACCATATGCATATTGGAAGGGAAATCCATC
GGTGTCTCACAAAAGGACAGACCTTCTAAAATGCAATCTCACTCGCAAACAAGATTGGAATGCTCGTTTACATAGGCAGGCGATGGCGATCGGAGAAGCAGCAAGCAAGT
TCATCCAAGAAGAGCTAAATATGGATTATGTATACGACTACATGTTTCATCTTCTCAACGAATATTCTAAGCTGTTGACGTTCAAGCCGATGGTCCCGCCGAATGCGACA
GAGCTCTCGTCGGAATCAATGGCTTCCGCTGTGAGAAAGTCGGTGAGAAAGTGGATGATGAAGTCGTTTGTGAAGAGCCCTGCCGTTTCCGGCCCCTGCGCCATGAAGCC
GCCGTACGATCCACAGTCTATGGAACTTTGGCTTACAACAAAATAG
mRNA sequenceShow/hide mRNA sequence
ATGAGAGGGGAGGATTCTCGGCCCAAGTTTCAGAAGCAATTTTCCGGCGAGAAACTGCTGCCGTTCGCCAAGTCGCCGCCTCGATTTCCCGTTATCTTCTTCTTCGCCGT
CGCGCTCATCGTCGGCGGGCTTCTCTCCGGGCGACTCCTTATTTCCTCGGGACTGAAATCCGATGTTCACCCTCCACAACCACGACGACATGTCGAGCAACTCAACGGCA
CGACATTCAACTCGACGAAAACGAAACAAGACCCGGATGGCCCGCCGCATGCCACGTGTCCAGAGTATTTCCGTTGGATCTACGAGGACCTACGACCGTGGGCCGGGACG
AGGATAACGAAGGGGATGTTAGAAGCGGCCCAAAAGAAGGCCCATTTCAGGCTAGTGATCGTGAAGGGAAAGGCCTACGTGGAGGTGTACGAAAAGGCATACCAAAGCAG
AGACAATCTTACGCTGTGGGGGGTCCTACAGTTGTTACGGAGATACCCAGGGAAATTGCCCGATCTTGATCTGATGTTTAACTGTGATGACCGGCCAGAGATCTATCAAA
AAGATTACAGTGGGCCCCAGGCGCCGGCCCCACCTCCCTTGTTTCGGTACAGTGGAGATGATGCCACGTTGGACATTGCGTTTCCTGATTGGTCCTATTGGGGTTGGCCT
GAGATAAGAATAAAGCCATGGGAAGAAATGTTGAAGGATATAAAAGAAGGGAACAAGAAGATGGAATGGGTGAAGAGGGAACCATATGCATATTGGAAGGGAAATCCATC
GGTGTCTCACAAAAGGACAGACCTTCTAAAATGCAATCTCACTCGCAAACAAGATTGGAATGCTCGTTTACATAGGCAGGCGATGGCGATCGGAGAAGCAGCAAGCAAGT
TCATCCAAGAAGAGCTAAATATGGATTATGTATACGACTACATGTTTCATCTTCTCAACGAATATTCTAAGCTGTTGACGTTCAAGCCGATGGTCCCGCCGAATGCGACA
GAGCTCTCGTCGGAATCAATGGCTTCCGCTGTGAGAAAGTCGGTGAGAAAGTGGATGATGAAGTCGTTTGTGAAGAGCCCTGCCGTTTCCGGCCCCTGCGCCATGAAGCC
GCCGTACGATCCACAGTCTATGGAACTTTGGCTTACAACAAAATAG
Protein sequenceShow/hide protein sequence
MRGEDSRPKFQKQFSGEKLLPFAKSPPRFPVIFFFAVALIVGGLLSGRLLISSGLKSDVHPPQPRRHVEQLNGTTFNSTKTKQDPDGPPHATCPEYFRWIYEDLRPWAGT
RITKGMLEAAQKKAHFRLVIVKGKAYVEVYEKAYQSRDNLTLWGVLQLLRRYPGKLPDLDLMFNCDDRPEIYQKDYSGPQAPAPPPLFRYSGDDATLDIAFPDWSYWGWP
EIRIKPWEEMLKDIKEGNKKMEWVKREPYAYWKGNPSVSHKRTDLLKCNLTRKQDWNARLHRQAMAIGEAASKFIQEELNMDYVYDYMFHLLNEYSKLLTFKPMVPPNAT
ELSSESMASAVRKSVRKWMMKSFVKSPAVSGPCAMKPPYDPQSMELWLTTK