; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

PI0027249 (gene) of Melon (PI 482460) v1 genome

Gene IDPI0027249
OrganismCucumis metuliferus PI 482460 (Melon (PI 482460) v1)
DescriptionO-glucosyltransferase rumi homolog
Genome locationchr12:24212618..24216554
RNA-Seq ExpressionPI0027249
SyntenyPI0027249
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
GO:0016740 - transferase activity (molecular function)
InterPro domainsIPR006598 - Glycosyl transferase CAP10 domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6583363.1 O-glucosyltransferase rumi-like protein, partial [Cucurbita argyrosperma subsp. sororia]5.1e-19087.43Show/hide
Query:  MTQLEESQKFAAFRVVIVEGRLYVDMYYACVQSRAIFTIWGLVQLLRRFPGMVPDVDIMFDCMDKPSINRTENKAMPLPLFRYCTTEAHFDIPFPDWSFW
        M  L+ES+KFAAFRVVIVEGRLYVDMYYACVQSRAIFTIWGLVQLLRRFPGMVPDVD+MFDCMD+P+INRTENK MPLPLFRYCTT+AHFDIPFPDWSFW
Subjt:  MTQLEESQKFAAFRVVIVEGRLYVDMYYACVQSRAIFTIWGLVQLLRRFPGMVPDVDIMFDCMDKPSINRTENKAMPLPLFRYCTTEAHFDIPFPDWSFW

Query:  GWPEVNLRSWREEFEDIKKGSKNLSWLNKFPRAYWKGNPDVDSPARTELLKCNHSRMWGAQIMRQDWAQEARDGYEQSKLSNQCNHRYKIYAEGFAWSVS
        GWPEVN+RSW EEF+DIKK SK+ +W +K PRAYWKGNPDV SP RTELL CNHS  WGAQIMRQDW QEARDG+EQSKLSNQCNHRYKIYAEGFAWSVS
Subjt:  GWPEVNLRSWREEFEDIKKGSKNLSWLNKFPRAYWKGNPDVDSPARTELLKCNHSRMWGAQIMRQDWAQEARDGYEQSKLSNQCNHRYKIYAEGFAWSVS

Query:  LKYILSCGSMSLIISPQYEDFFSRGLDPLKNYWPIPFSNMCESIKHAVDWGNTHFPEAEAIGRQGQNFMESLSMDTVYSYMFHLITEYSKLLDFKPTPPS
        LKYILSCGSMSLIISP Y+DFFSRGLDPLKNYWPIPF NMCESIKHAVDWGN H  EAEAIG+QGQNFMESLSMDTVY+YMF LITEYSKLLDFKPTPP 
Subjt:  LKYILSCGSMSLIISPQYEDFFSRGLDPLKNYWPIPFSNMCESIKHAVDWGNTHFPEAEAIGRQGQNFMESLSMDTVYSYMFHLITEYSKLLDFKPTPPS

Query:  SALEVCTDSLLCIADEKQRQFLEKSAASVSSVPPCSLNRAGSDIIYSWLQQKERRKAM
        SALEVC +SLLCIADEKQRQFLEKSA S S VPPCSLNRAGSD +YSWLQQ+E RKAM
Subjt:  SALEVCTDSLLCIADEKQRQFLEKSAASVSSVPPCSLNRAGSDIIYSWLQQKERRKAM

XP_004145318.2 O-glucosyltransferase rumi homolog [Cucumis sativus]6.7e-20696.02Show/hide
Query:  MTQLEESQKFAAFRVVIVEGRLYVDMYYACVQSRAIFTIWGLVQLLRRFPGMVPDVDIMFDCMDKPSINRTENKAMPLPLFRYCTTEAHFDIPFPDWSFW
        MTQLEESQKFAAFRVVIVEGRLYVDMYYACVQSRAIFTIWGLVQ+LRR+PGMVPDVD+MFDCMDKPSINRTENKAMPLPLFRYCTTEAHFDIPFPDWSFW
Subjt:  MTQLEESQKFAAFRVVIVEGRLYVDMYYACVQSRAIFTIWGLVQLLRRFPGMVPDVDIMFDCMDKPSINRTENKAMPLPLFRYCTTEAHFDIPFPDWSFW

Query:  GWPEVNLRSWREEFEDIKKGSKNLSWLNKFPRAYWKGNPDVDSPARTELLKCNHSRMWGAQIMRQDWAQEARDGYEQSKLSNQCNHRYKIYAEGFAWSVS
        GWPEVNLRSWREEFEDIKKGSKNLSW NKFPRAYWKGNPDVDSPAR ELLKCNHSRMWGAQIMRQDWAQEA+DGYEQSKLSNQCNHRYKIYAEGFAWSVS
Subjt:  GWPEVNLRSWREEFEDIKKGSKNLSWLNKFPRAYWKGNPDVDSPARTELLKCNHSRMWGAQIMRQDWAQEARDGYEQSKLSNQCNHRYKIYAEGFAWSVS

Query:  LKYILSCGSMSLIISPQYEDFFSRGLDPLKNYWPIPFSNMCESIKHAVDWGNTHFPEAEAIGRQGQNFMESLSMDTVYSYMFHLITEYSKLLDFKPTPPS
        LKYILSCGSMSLIISPQYEDFFSRGLDPLKNYWPIPF+NMCESIKHAVDWGNTHFPEAE IGRQGQ FME+LSMDTVYSYMFHLITEYSKL DFKPTPP 
Subjt:  LKYILSCGSMSLIISPQYEDFFSRGLDPLKNYWPIPFSNMCESIKHAVDWGNTHFPEAEAIGRQGQNFMESLSMDTVYSYMFHLITEYSKLLDFKPTPPS

Query:  SALEVCTDSLLCIADEKQRQFLEKSAASVSSVPPCSLNRAGSDIIYSWLQQK
        SALEVCTDSLLCIADEKQ QFLEKSAASVSSVPPCSLNR GSDIIYSWLQQK
Subjt:  SALEVCTDSLLCIADEKQRQFLEKSAASVSSVPPCSLNRAGSDIIYSWLQQK

XP_008457372.1 PREDICTED: O-glucosyltransferase rumi homolog [Cucumis melo]2.2e-20998.02Show/hide
Query:  MTQLEESQKFAAFRVVIVEGRLYVDMYYACVQSRAIFTIWGLVQLLRRFPGMVPDVDIMFDCMDKPSINRTENKAMPLPLFRYCTTEAHFDIPFPDWSFW
        MTQLEESQKFAAFRVVIVEGRLYVDMYYACVQSRAIFTIWGLVQLLRRFPGMVPDVD+MFDCMDKPSINRTENKAMPLPLFRYCTTEAHFDIPFPDWSFW
Subjt:  MTQLEESQKFAAFRVVIVEGRLYVDMYYACVQSRAIFTIWGLVQLLRRFPGMVPDVDIMFDCMDKPSINRTENKAMPLPLFRYCTTEAHFDIPFPDWSFW

Query:  GWPEVNLRSWREEFEDIKKGSKNLSWLNKFPRAYWKGNPDVDSPARTELLKCNHSRMWGAQIMRQDWAQEARDGYEQSKLSNQCNHRYKIYAEGFAWSVS
        GWPEVNLRSWREEFEDIKKGSKNLSWLNKFPRAYWKGNPDVDSPARTELLKCNHSR WGAQIMRQDWAQEARDGYEQSKLSNQCNHRYKIYAEGFAWSVS
Subjt:  GWPEVNLRSWREEFEDIKKGSKNLSWLNKFPRAYWKGNPDVDSPARTELLKCNHSRMWGAQIMRQDWAQEARDGYEQSKLSNQCNHRYKIYAEGFAWSVS

Query:  LKYILSCGSMSLIISPQYEDFFSRGLDPLKNYWPIPFSNMCESIKHAVDWGNTHFPEAEAIGRQGQNFMESLSMDTVYSYMFHLITEYSKLLDFKPTPPS
        LKYILSCGSMSLIISPQYEDFFSRGLDPLKNYWPIPFSNMCESIKHAVDWGNTHFPEAE IG+QGQNFMESLSMDTVYSYMFHLITEYSKLLDFKPTPP 
Subjt:  LKYILSCGSMSLIISPQYEDFFSRGLDPLKNYWPIPFSNMCESIKHAVDWGNTHFPEAEAIGRQGQNFMESLSMDTVYSYMFHLITEYSKLLDFKPTPPS

Query:  SALEVCTDSLLCIADEKQRQFLEKSAASVSSVPPCSLNRAGSDIIYSWLQQKE
        SALEVC DSLLCIADEKQRQFLEKSAASVSSVPPCSLNRAGSDIIYSWLQQ E
Subjt:  SALEVCTDSLLCIADEKQRQFLEKSAASVSSVPPCSLNRAGSDIIYSWLQQKE

XP_022964856.1 O-glucosyltransferase rumi homolog [Cucurbita moschata]5.1e-19087.43Show/hide
Query:  MTQLEESQKFAAFRVVIVEGRLYVDMYYACVQSRAIFTIWGLVQLLRRFPGMVPDVDIMFDCMDKPSINRTENKAMPLPLFRYCTTEAHFDIPFPDWSFW
        M  L+ES+KFAAFRVVIVEGRLYVDMYYACVQSRAIFTIWGLVQLLRRFPGMVPDVD+MFDCMD+P+INRTENK MPLPLFRYCTT+AHFDIPFPDWSFW
Subjt:  MTQLEESQKFAAFRVVIVEGRLYVDMYYACVQSRAIFTIWGLVQLLRRFPGMVPDVDIMFDCMDKPSINRTENKAMPLPLFRYCTTEAHFDIPFPDWSFW

Query:  GWPEVNLRSWREEFEDIKKGSKNLSWLNKFPRAYWKGNPDVDSPARTELLKCNHSRMWGAQIMRQDWAQEARDGYEQSKLSNQCNHRYKIYAEGFAWSVS
        GWPEVN+RSW EEF+DIKK SK+ +W +K PRAYWKGNPDV SP RTELL CNHS  WGAQIMRQDW QEARDG+EQSKLSNQCNHRYKIYAEGFAWSVS
Subjt:  GWPEVNLRSWREEFEDIKKGSKNLSWLNKFPRAYWKGNPDVDSPARTELLKCNHSRMWGAQIMRQDWAQEARDGYEQSKLSNQCNHRYKIYAEGFAWSVS

Query:  LKYILSCGSMSLIISPQYEDFFSRGLDPLKNYWPIPFSNMCESIKHAVDWGNTHFPEAEAIGRQGQNFMESLSMDTVYSYMFHLITEYSKLLDFKPTPPS
        LKYILSCGSMSLIISP Y+DFFSRGLDPLKNYWPIPF NMCESIKHAVDWGN H  EAEAIG+QGQNFMESLSMDTVY+YMF LITEYSKLLDFKPTPP 
Subjt:  LKYILSCGSMSLIISPQYEDFFSRGLDPLKNYWPIPFSNMCESIKHAVDWGNTHFPEAEAIGRQGQNFMESLSMDTVYSYMFHLITEYSKLLDFKPTPPS

Query:  SALEVCTDSLLCIADEKQRQFLEKSAASVSSVPPCSLNRAGSDIIYSWLQQKERRKAM
        SALEVC +SLLCIADEKQRQFLEKSA S S VPPCSLNRAGSD +YSWLQQ+E RKAM
Subjt:  SALEVCTDSLLCIADEKQRQFLEKSAASVSSVPPCSLNRAGSDIIYSWLQQKERRKAM

XP_038893964.1 O-glucosyltransferase rumi homolog [Benincasa hispida]2.8e-20493.85Show/hide
Query:  MTQLEESQKFAAFRVVIVEGRLYVDMYYACVQSRAIFTIWGLVQLLRRFPGMVPDVDIMFDCMDKPSINRTENKAMPLPLFRYCTTEAHFDIPFPDWSFW
        MT L+E+QKFAAFRVVIVEGRLYVDMYYACVQSRAIFTIWGLVQLLRRFPGMVPDVD+MFDCMDKPSINRTENK MPLPLFRYCTTEAHFDIPFPDWSFW
Subjt:  MTQLEESQKFAAFRVVIVEGRLYVDMYYACVQSRAIFTIWGLVQLLRRFPGMVPDVDIMFDCMDKPSINRTENKAMPLPLFRYCTTEAHFDIPFPDWSFW

Query:  GWPEVNLRSWREEFEDIKKGSKNLSWLNKFPRAYWKGNPDVDSPARTELLKCNHSRMWGAQIMRQDWAQEARDGYEQSKLSNQCNHRYKIYAEGFAWSVS
        GWPEVNLRSWREEFEDIKKGSKNLSW +K+PRAYWKGNPDVDSPARTELL CNHSR WGAQIMRQDW QEA+ G+EQSKLSNQCNHRYKIYAEGFAWSVS
Subjt:  GWPEVNLRSWREEFEDIKKGSKNLSWLNKFPRAYWKGNPDVDSPARTELLKCNHSRMWGAQIMRQDWAQEARDGYEQSKLSNQCNHRYKIYAEGFAWSVS

Query:  LKYILSCGSMSLIISPQYEDFFSRGLDPLKNYWPIPFSNMCESIKHAVDWGNTHFPEAEAIGRQGQNFMESLSMDTVYSYMFHLITEYSKLLDFKPTPPS
        LKYILSCGSMSLIISPQYEDFFSRGLDPLKNYWPIPF+NMCESIKHAVDWGNTH PEAE IGRQ QNFMESL+MDTVYSYMFHLITEYSKLLDF+PTPP 
Subjt:  LKYILSCGSMSLIISPQYEDFFSRGLDPLKNYWPIPFSNMCESIKHAVDWGNTHFPEAEAIGRQGQNFMESLSMDTVYSYMFHLITEYSKLLDFKPTPPS

Query:  SALEVCTDSLLCIADEKQRQFLEKSAASVSSVPPCSLNRAGSDIIYSWLQQKERRKAM
        SALEVC DSLLCIADEKQRQFLEKSAASVSSVPPCSLNRAGSDIIYSWLQQKERRKAM
Subjt:  SALEVCTDSLLCIADEKQRQFLEKSAASVSSVPPCSLNRAGSDIIYSWLQQKERRKAM

TrEMBL top hitse value%identityAlignment
A0A0A0LY89 CAP10 domain-containing protein6.5e-20796.59Show/hide
Query:  MTQLEESQKFAAFRVVIVEGRLYVDMYYACVQSRAIFTIWGLVQLLRRFPGMVPDVDIMFDCMDKPSINRTENKAMPLPLFRYCTTEAHFDIPFPDWSFW
        MTQLEESQKFAAFRVVIVEGRLYVDMYYACVQSRAIFTIWGLVQ+LRR+PGMVPDVD+MFDCMDKPSINRTENKAMPLPLFRYCTTEAHFDIPFPDWSFW
Subjt:  MTQLEESQKFAAFRVVIVEGRLYVDMYYACVQSRAIFTIWGLVQLLRRFPGMVPDVDIMFDCMDKPSINRTENKAMPLPLFRYCTTEAHFDIPFPDWSFW

Query:  GWPEVNLRSWREEFEDIKKGSKNLSWLNKFPRAYWKGNPDVDSPARTELLKCNHSRMWGAQIMRQDWAQEARDGYEQSKLSNQCNHRYKIYAEGFAWSVS
        GWPEVNLRSWREEFEDIKKGSKNLSW NKFPRAYWKGNPDVDSPAR ELLKCNHSRMWGAQIMRQDWAQEARDGYEQSKLSNQCNHRYKIYAEGFAWSVS
Subjt:  GWPEVNLRSWREEFEDIKKGSKNLSWLNKFPRAYWKGNPDVDSPARTELLKCNHSRMWGAQIMRQDWAQEARDGYEQSKLSNQCNHRYKIYAEGFAWSVS

Query:  LKYILSCGSMSLIISPQYEDFFSRGLDPLKNYWPIPFSNMCESIKHAVDWGNTHFPEAEAIGRQGQNFMESLSMDTVYSYMFHLITEYSKLLDFKPTPPS
        LKYILSCGSMSLIISPQYEDFFSRGLDPLKNYWPIPF+NMCESIKHAVDWGNTHFPEAE IGRQGQ FMESLSMDTVYSYMFHLITEYSKL DFKPTPP 
Subjt:  LKYILSCGSMSLIISPQYEDFFSRGLDPLKNYWPIPFSNMCESIKHAVDWGNTHFPEAEAIGRQGQNFMESLSMDTVYSYMFHLITEYSKLLDFKPTPPS

Query:  SALEVCTDSLLCIADEKQRQFLEKSAASVSSVPPCSLNRAGSDIIYSWLQQK
        SALEVCTDSLLCIADEKQ QFLEKSAASVSSVPPCSLNR GSDIIYSWLQQK
Subjt:  SALEVCTDSLLCIADEKQRQFLEKSAASVSSVPPCSLNRAGSDIIYSWLQQK

A0A1S3C5H2 O-glucosyltransferase rumi homolog1.1e-20998.02Show/hide
Query:  MTQLEESQKFAAFRVVIVEGRLYVDMYYACVQSRAIFTIWGLVQLLRRFPGMVPDVDIMFDCMDKPSINRTENKAMPLPLFRYCTTEAHFDIPFPDWSFW
        MTQLEESQKFAAFRVVIVEGRLYVDMYYACVQSRAIFTIWGLVQLLRRFPGMVPDVD+MFDCMDKPSINRTENKAMPLPLFRYCTTEAHFDIPFPDWSFW
Subjt:  MTQLEESQKFAAFRVVIVEGRLYVDMYYACVQSRAIFTIWGLVQLLRRFPGMVPDVDIMFDCMDKPSINRTENKAMPLPLFRYCTTEAHFDIPFPDWSFW

Query:  GWPEVNLRSWREEFEDIKKGSKNLSWLNKFPRAYWKGNPDVDSPARTELLKCNHSRMWGAQIMRQDWAQEARDGYEQSKLSNQCNHRYKIYAEGFAWSVS
        GWPEVNLRSWREEFEDIKKGSKNLSWLNKFPRAYWKGNPDVDSPARTELLKCNHSR WGAQIMRQDWAQEARDGYEQSKLSNQCNHRYKIYAEGFAWSVS
Subjt:  GWPEVNLRSWREEFEDIKKGSKNLSWLNKFPRAYWKGNPDVDSPARTELLKCNHSRMWGAQIMRQDWAQEARDGYEQSKLSNQCNHRYKIYAEGFAWSVS

Query:  LKYILSCGSMSLIISPQYEDFFSRGLDPLKNYWPIPFSNMCESIKHAVDWGNTHFPEAEAIGRQGQNFMESLSMDTVYSYMFHLITEYSKLLDFKPTPPS
        LKYILSCGSMSLIISPQYEDFFSRGLDPLKNYWPIPFSNMCESIKHAVDWGNTHFPEAE IG+QGQNFMESLSMDTVYSYMFHLITEYSKLLDFKPTPP 
Subjt:  LKYILSCGSMSLIISPQYEDFFSRGLDPLKNYWPIPFSNMCESIKHAVDWGNTHFPEAEAIGRQGQNFMESLSMDTVYSYMFHLITEYSKLLDFKPTPPS

Query:  SALEVCTDSLLCIADEKQRQFLEKSAASVSSVPPCSLNRAGSDIIYSWLQQKE
        SALEVC DSLLCIADEKQRQFLEKSAASVSSVPPCSLNRAGSDIIYSWLQQ E
Subjt:  SALEVCTDSLLCIADEKQRQFLEKSAASVSSVPPCSLNRAGSDIIYSWLQQKE

A0A6J1DBK2 O-glucosyltransferase rumi1.0e-18886.52Show/hide
Query:  QLEESQKFAAFRVVIVEGRLYVDMYYACVQSRAIFTIWGLVQLLRRFPGMVPDVDIMFDCMDKPSINRTENKAMPLPLFRYCTTEAHFDIPFPDWSFWGW
        QL E+QKFAAFRVVIVEG+LYVDMYYACVQSRA+FTIWGLVQLL RFPGMVPDVD+MFDCMDKPSINRTE+  MPLPLFRYCTTEAHFDIPFPDWSFWGW
Subjt:  QLEESQKFAAFRVVIVEGRLYVDMYYACVQSRAIFTIWGLVQLLRRFPGMVPDVDIMFDCMDKPSINRTENKAMPLPLFRYCTTEAHFDIPFPDWSFWGW

Query:  PEVNLRSWREEFEDIKKGSKNLSWLNKFPRAYWKGNPDVDSPARTELLKCNHSRMWGAQIMRQDWAQEARDGYEQSKLSNQCNHRYKIYAEGFAWSVSLK
        PEVNLRSW EEFEDIKKGSK  SW +K P AYWKGNPDVDSPARTELLKCN +R WGAQIMRQ+W +EAR G+EQSKLSNQCN+RYKIYAEGFAWSVSLK
Subjt:  PEVNLRSWREEFEDIKKGSKNLSWLNKFPRAYWKGNPDVDSPARTELLKCNHSRMWGAQIMRQDWAQEARDGYEQSKLSNQCNHRYKIYAEGFAWSVSLK

Query:  YILSCGSMSLIISPQYEDFFSRGLDPLKNYWPIPFSNMCESIKHAVDWGNTHFPEAEAIGRQGQNFMESLSMDTVYSYMFHLITEYSKLLDFKPTPPSSA
        YILSCGSMSLIISPQYEDFFSRGLDPLKNYWPIPF+NMC+SIKHAVDWGN+H PE EA+G++GQ+FMESLSMDTVYSYMFHLI EYSKL DFKPTPP SA
Subjt:  YILSCGSMSLIISPQYEDFFSRGLDPLKNYWPIPFSNMCESIKHAVDWGNTHFPEAEAIGRQGQNFMESLSMDTVYSYMFHLITEYSKLLDFKPTPPSSA

Query:  LEVCTDSLLCIADEKQRQFLEKSAASVSSVPPCSLNRAGSDIIYSWLQQKERRKAM
        LEVC +SLLCIADE QR FLEKSA S SSVPPCSL+RAGSDI+YSWLQQK+  KAM
Subjt:  LEVCTDSLLCIADEKQRQFLEKSAASVSSVPPCSLNRAGSDIIYSWLQQKERRKAM

A0A6J1HK39 O-glucosyltransferase rumi homolog2.5e-19087.43Show/hide
Query:  MTQLEESQKFAAFRVVIVEGRLYVDMYYACVQSRAIFTIWGLVQLLRRFPGMVPDVDIMFDCMDKPSINRTENKAMPLPLFRYCTTEAHFDIPFPDWSFW
        M  L+ES+KFAAFRVVIVEGRLYVDMYYACVQSRAIFTIWGLVQLLRRFPGMVPDVD+MFDCMD+P+INRTENK MPLPLFRYCTT+AHFDIPFPDWSFW
Subjt:  MTQLEESQKFAAFRVVIVEGRLYVDMYYACVQSRAIFTIWGLVQLLRRFPGMVPDVDIMFDCMDKPSINRTENKAMPLPLFRYCTTEAHFDIPFPDWSFW

Query:  GWPEVNLRSWREEFEDIKKGSKNLSWLNKFPRAYWKGNPDVDSPARTELLKCNHSRMWGAQIMRQDWAQEARDGYEQSKLSNQCNHRYKIYAEGFAWSVS
        GWPEVN+RSW EEF+DIKK SK+ +W +K PRAYWKGNPDV SP RTELL CNHS  WGAQIMRQDW QEARDG+EQSKLSNQCNHRYKIYAEGFAWSVS
Subjt:  GWPEVNLRSWREEFEDIKKGSKNLSWLNKFPRAYWKGNPDVDSPARTELLKCNHSRMWGAQIMRQDWAQEARDGYEQSKLSNQCNHRYKIYAEGFAWSVS

Query:  LKYILSCGSMSLIISPQYEDFFSRGLDPLKNYWPIPFSNMCESIKHAVDWGNTHFPEAEAIGRQGQNFMESLSMDTVYSYMFHLITEYSKLLDFKPTPPS
        LKYILSCGSMSLIISP Y+DFFSRGLDPLKNYWPIPF NMCESIKHAVDWGN H  EAEAIG+QGQNFMESLSMDTVY+YMF LITEYSKLLDFKPTPP 
Subjt:  LKYILSCGSMSLIISPQYEDFFSRGLDPLKNYWPIPFSNMCESIKHAVDWGNTHFPEAEAIGRQGQNFMESLSMDTVYSYMFHLITEYSKLLDFKPTPPS

Query:  SALEVCTDSLLCIADEKQRQFLEKSAASVSSVPPCSLNRAGSDIIYSWLQQKERRKAM
        SALEVC +SLLCIADEKQRQFLEKSA S S VPPCSLNRAGSD +YSWLQQ+E RKAM
Subjt:  SALEVCTDSLLCIADEKQRQFLEKSAASVSSVPPCSLNRAGSDIIYSWLQQKERRKAM

A0A6J1HZ33 O-glucosyltransferase rumi homolog9.4e-19087.15Show/hide
Query:  MTQLEESQKFAAFRVVIVEGRLYVDMYYACVQSRAIFTIWGLVQLLRRFPGMVPDVDIMFDCMDKPSINRTENKAMPLPLFRYCTTEAHFDIPFPDWSFW
        M  L+ES KFAAFRVVIVEGRLYVDMYYACVQSRAIFTIWGLVQLLRRFPGMVPDVD+MFDCMD+P+INRTENK MPLPLFRYCTT+AHFDIPFPDWSFW
Subjt:  MTQLEESQKFAAFRVVIVEGRLYVDMYYACVQSRAIFTIWGLVQLLRRFPGMVPDVDIMFDCMDKPSINRTENKAMPLPLFRYCTTEAHFDIPFPDWSFW

Query:  GWPEVNLRSWREEFEDIKKGSKNLSWLNKFPRAYWKGNPDVDSPARTELLKCNHSRMWGAQIMRQDWAQEARDGYEQSKLSNQCNHRYKIYAEGFAWSVS
        GWPEVN+RSW EEF+DIKK SK+ +W +K PRAYWKGNPDV SP RTELL CNHS  WGAQIMRQDW QEARDG+EQSKLS QCNHRYKIYAEGFAWSVS
Subjt:  GWPEVNLRSWREEFEDIKKGSKNLSWLNKFPRAYWKGNPDVDSPARTELLKCNHSRMWGAQIMRQDWAQEARDGYEQSKLSNQCNHRYKIYAEGFAWSVS

Query:  LKYILSCGSMSLIISPQYEDFFSRGLDPLKNYWPIPFSNMCESIKHAVDWGNTHFPEAEAIGRQGQNFMESLSMDTVYSYMFHLITEYSKLLDFKPTPPS
        LKYILSCGSMSLIISP Y+DFFSRGLDPLKNYWPIPF NMCESIKHAVDWGN H  EAEAIG+QGQNFMESLSMDTVY+YMF LITEYSKLLDFKPTPP 
Subjt:  LKYILSCGSMSLIISPQYEDFFSRGLDPLKNYWPIPFSNMCESIKHAVDWGNTHFPEAEAIGRQGQNFMESLSMDTVYSYMFHLITEYSKLLDFKPTPPS

Query:  SALEVCTDSLLCIADEKQRQFLEKSAASVSSVPPCSLNRAGSDIIYSWLQQKERRKAM
        SALEVC++SLLCIADEKQRQFLEKSA S S VPPCSLNRAGSD +YSWLQQ+E RKAM
Subjt:  SALEVCTDSLLCIADEKQRQFLEKSAASVSSVPPCSLNRAGSDIIYSWLQQKERRKAM

SwissProt top hitse value%identityAlignment
A0NDG6 O-glucosyltransferase rumi homolog2.1e-2125.27Show/hide
Query:  GLVQLLRRFPGMVPDVDIMFDCMDKPSINRTENKAMPLPLFRYCTTEAHFDIPFPDWSFW-GWPEV-----NLRSWREEFEDIKKGSKNLSWLNKFPRAY
        G+   +R    ++PD+D++ +C D P I+R  +K   +P+  +  T  + DI +P W+FW G P +      L  W    + I K S +  W  K P+A+
Subjt:  GLVQLLRRFPGMVPDVDIMFDCMDKPSINRTENKAMPLPLFRYCTTEAHFDIPFPDWSFW-GWPEV-----NLRSWREEFEDIKKGSKNLSWLNKFPRAY

Query:  WKGNPDVDSPARTELLKCNHSRMWGAQIMRQDWAQEARDGY-----EQSKLSNQCNHRYKIYAEGFAWSVSLKYILSCGSMSLIISPQYEDFFSRGLDPL
        ++G+   D      LL      +  AQ  +    +  +D        +  L   C +R+     G A S   K++  C S+   +  ++++FF   L P 
Subjt:  WKGNPDVDSPARTELLKCNHSRMWGAQIMRQDWAQEARDGY-----EQSKLSNQCNHRYKIYAEGFAWSVSLKYILSCGSMSLIISPQYEDFFSRGLDPL

Query:  KNYWPIPFSNMCESIKHAVDWGNTHFPEAEAIGRQG-QNFMESLSMDTVYSYMFHLITEYSKLLDFKPTPPSSALEV
         +Y P+P  +  E ++  + +   H   A AI  +G ++    L M  V  Y   L+  Y KL+ +     S+ +EV
Subjt:  KNYWPIPFSNMCESIKHAVDWGNTHFPEAEAIGRQG-QNFMESLSMDTVYSYMFHLITEYSKLLDFKPTPPSSALEV

Q16QY8 O-glucosyltransferase rumi homolog1.2e-2124.91Show/hide
Query:  VPDVDIMFDCMDKPSINRTENKAMPLPLFRYCTTEAHFDIPFPDWSFW-GWPEVN-----LRSWREEFEDIKKGSKNLSWLNKFPRAYWKGNPDVDSPAR
        +PD++++ +C D P INR   K   LP+  +  T+ + DI +P W FW G P ++     L  W +    IKK + +  W  K  +A+++G+   D    
Subjt:  VPDVDIMFDCMDKPSINRTENKAMPLPLFRYCTTEAHFDIPFPDWSFW-GWPEVN-----LRSWREEFEDIKKGSKNLSWLNKFPRAYWKGNPDVDSPAR

Query:  TELLKCNHSRMWGAQIMRQDWAQEARDGY-----EQSKLSNQCNHRYKIYAEGFAWSVSLKYILSCGSMSLIISPQYEDFFSRGLDPLKNYWPIPFSNMC
          LL      +  AQ  +    +  +D       ++ +L + C ++Y     G A S   K++  C S+   +  ++++FF   L P  +Y P+      
Subjt:  TELLKCNHSRMWGAQIMRQDWAQEARDGY-----EQSKLSNQCNHRYKIYAEGFAWSVSLKYILSCGSMSLIISPQYEDFFSRGLDPLKNYWPIPFSNMC

Query:  ESIKHAVDWGNTHFPEAEAIGRQG-QNFMESLSMDTVYSYMFHLITEYSKLLDFKPTPPSSALEV
        E ++  +++   H   A  I  +G ++  + L M  V  Y   L+  Y KL+ ++     S +EV
Subjt:  ESIKHAVDWGNTHFPEAEAIGRQG-QNFMESLSMDTVYSYMFHLITEYSKLLDFKPTPPSSALEV

Q5E9Q1 Protein O-glucosyltransferase 18.9e-2024.44Show/hide
Query:  GLVQLLRRFPGMVPDVDIMFDCMDKPSINRTENKAMPLPLFRYCTTEAHFDIPFPDWSFWG-----WP--EVNLRSWREEFEDIKKGSKNLSWLNKFPRA
        G+   +    G +PD++++ +  D P + +    A  +P+F +  T  + DI +P W+FW      WP   + L  W    ED+ + +    W  K   A
Subjt:  GLVQLLRRFPGMVPDVDIMFDCMDKPSINRTENKAMPLPLFRYCTTEAHFDIPFPDWSFWG-----WP--EVNLRSWREEFEDIKKGSKNLSWLNKFPRA

Query:  YWKGNPDVDSPARTELLKCNHSRMWGAQIMRQDWAQEARD--GYEQSK---LSNQCNHRYKIYAEGFAWSVSLKYILSCGSMSLIISPQYEDFFSRGLDP
        Y++G+          LL   + ++  A+  +    +  +D  G   +K   L + C ++Y     G A S   K++  CGS+   +  ++ +FF   L P
Subjt:  YWKGNPDVDSPARTELLKCNHSRMWGAQIMRQDWAQEARD--GYEQSK---LSNQCNHRYKIYAEGFAWSVSLKYILSCGSMSLIISPQYEDFFSRGLDP

Query:  LKNYWPIPFSNMCESIKHAVDWGNTHFPEAEAIGRQGQNF-MESLSMDTVYSYMFHLITEYSKLLDFKPT
          +Y  IP      +++  + +   +   A+ I  +G  F +  L MD +  Y  +L+TEYSK L +  T
Subjt:  LKNYWPIPFSNMCESIKHAVDWGNTHFPEAEAIGRQGQNF-MESLSMDTVYSYMFHLITEYSKLLDFKPT

Q8NBL1 Protein O-glucosyltransferase 11.5e-1924.07Show/hide
Query:  GLVQLLRRFPGMVPDVDIMFDCMDKPSINRTENKAMPLPLFRYCTTEAHFDIPFPDWSFWG-----WP--EVNLRSWREEFEDIKKGSKNLSWLNKFPRA
        G+   +    G +PD++++ +  D P + +    A  +P+F +  T  + DI +P W+FW      WP     L  W    ED+ + +    W  K   A
Subjt:  GLVQLLRRFPGMVPDVDIMFDCMDKPSINRTENKAMPLPLFRYCTTEAHFDIPFPDWSFWG-----WP--EVNLRSWREEFEDIKKGSKNLSWLNKFPRA

Query:  YWKGNPDVDSPARTELLKCNHSRMWGAQIMRQDWAQEARD--GYEQSK---LSNQCNHRYKIYAEGFAWSVSLKYILSCGSMSLIISPQYEDFFSRGLDP
        Y++G+          LL   + ++  A+  +    +  +D  G   +K   L + C ++Y     G A S   K++  CGS+   +  ++ +FF   L P
Subjt:  YWKGNPDVDSPARTELLKCNHSRMWGAQIMRQDWAQEARD--GYEQSK---LSNQCNHRYKIYAEGFAWSVSLKYILSCGSMSLIISPQYEDFFSRGLDP

Query:  LKNYWPIPFSNMCESIKHAVDWGNTHFPEAEAIGRQGQNFMES-LSMDTVYSYMFHLITEYSKLLDFKPT
          +Y  IP      +++  + +   +   A+ I  +G  F+ + L MD +  Y  +L++EYSK L +  T
Subjt:  LKNYWPIPFSNMCESIKHAVDWGNTHFPEAEAIGRQGQNFMES-LSMDTVYSYMFHLITEYSKLLDFKPT

Q8T045 O-glucosyltransferase rumi1.4e-2022.74Show/hide
Query:  GLVQLLRRFPGMVPDVDIMFDCMDKPSINRTENKAMPLPLFRYCTTEAHFDIPFPDWSFW-GWPEVNLR-----SWREEFEDIKKGSKNLSWLNKFPRAY
        G+   L      +PD+D++ +  D P +N     A   P+F +  T+ + DI +P W+FW G P   L       W +  E ++K +  + W  K    +
Subjt:  GLVQLLRRFPGMVPDVDIMFDCMDKPSINRTENKAMPLPLFRYCTTEAHFDIPFPDWSFW-GWPEVNLR-----SWREEFEDIKKGSKNLSWLNKFPRAY

Query:  WKGNPDVDSPARTELLKCNHSRMWGAQIMRQDWAQEARD-----GYEQSKLSNQCNHRYKIYAEGFAWSVSLKYILSCGSMSLIISPQYEDFFSRGLDPL
        ++G+   D      LL   +  +  AQ  +    +  +D       ++    + C ++Y     G A S  LK++  C S+   +  ++++FF   L P 
Subjt:  WKGNPDVDSPARTELLKCNHSRMWGAQIMRQDWAQEARD-----GYEQSKLSNQCNHRYKIYAEGFAWSVSLKYILSCGSMSLIISPQYEDFFSRGLDPL

Query:  KNYWPIPFSNMCESIKHAVDWGNTHFPEAEAIGRQGQNFM-ESLSMDTVYSYMFHLITEYSKLLDFKPTPPSSALEV
         +Y P+      +  +H + +   +   A+ I ++G +F+ E L M  +  Y   L+  Y KLL ++  P    + +
Subjt:  KNYWPIPFSNMCESIKHAVDWGNTHFPEAEAIGRQGQNFM-ESLSMDTVYSYMFHLITEYSKLLDFKPTPPSSALEV

Arabidopsis top hitse value%identityAlignment
AT1G07220.1 Arabidopsis thaliana protein of unknown function (DUF821)1.1e-15070.06Show/hide
Query:  AAFRVVIVEGRLYVDMYYACVQSRAIFTIWGLVQLLRRFPGMVPDVDIMFDCMDKPSINRTENKAMPLPLFRYCTTEAHFDIPFPDWSFWGWPEVNLRSW
        AAFRVVI+ G+LYVD+YYACVQSR +FTIWG++QLL ++PGMVPDVD+MFDCMDKP IN+TE ++ P+PLFRYCT EAH DIPFPDWSFWGW E NLR W
Subjt:  AAFRVVIVEGRLYVDMYYACVQSRAIFTIWGLVQLLRRFPGMVPDVDIMFDCMDKPSINRTENKAMPLPLFRYCTTEAHFDIPFPDWSFWGWPEVNLRSW

Query:  REEFEDIKKGSKNLSWLNKFPRAYWKGNPDVDSPARTELLKCNHSRMWGAQIMRQDWAQEARDGYEQSKLSNQCNHRYKIYAEGFAWSVSLKYILSCGSM
         EEF DIK+GS+  SW NK PRAYWKGNPDV SP R EL+KCNHSR+WGAQIMRQDWA+EA+ G+EQSKLSNQCNHRYKIYAEG+AWSVSLKYILSCGSM
Subjt:  REEFEDIKKGSKNLSWLNKFPRAYWKGNPDVDSPARTELLKCNHSRMWGAQIMRQDWAQEARDGYEQSKLSNQCNHRYKIYAEGFAWSVSLKYILSCGSM

Query:  SLIISPQYEDFFSRGLDPLKNYWPIPFSNMCESIKHAVDWGNTHFPEAEAIGRQGQNFMESLSMDTVYSYMFHLITEYSKLLDFKPTPPSSALEVCTDSL
        +LIISP+YEDFFSRGL P +NYWPI  +++C SIK+AVDWGN++  EAE IG++GQ +MESLSM+ VY YMFHLITEYSKL  FKP  P+SA EVC  SL
Subjt:  SLIISPQYEDFFSRGLDPLKNYWPIPFSNMCESIKHAVDWGNTHFPEAEAIGRQGQNFMESLSMDTVYSYMFHLITEYSKLLDFKPTPPSSALEVCTDSL

Query:  LCIADEKQRQFLEKSAASVSSVPPCSLNRAGSDIIYSWLQQKER
        LCIA++K+R+ LE+S    S   PC       + +   +QQK +
Subjt:  LCIADEKQRQFLEKSAASVSSVPPCSLNRAGSDIIYSWLQQKER

AT1G63420.1 Arabidopsis thaliana protein of unknown function (DUF821)9.9e-9948.43Show/hide
Query:  LEESQKFAAFRVVIVEGRLYVDMYYACVQSRAIFTIWGLVQLLRRFPGMVPDVDIMFDCMDKPSI--------NRTENKAMPLPLFRYCTTEAHFDIPFP
        +E  +  A FR+VI+ G+++V+ Y   +Q+R  FT+WG++QLLR++PG +PDVD+MFDC D+P I        NRT   A P PLFRYC      DI FP
Subjt:  LEESQKFAAFRVVIVEGRLYVDMYYACVQSRAIFTIWGLVQLLRRFPGMVPDVDIMFDCMDKPSI--------NRTENKAMPLPLFRYCTTEAHFDIPFP

Query:  DWSFWGWPEVNLRSWREEFEDIKKGSKNLSWLNKFPRAYWKGNPDVDSPARTELLKCNHSRM--WGAQIMRQDWAQEARDGYEQSKLSNQCNHRYKIYAE
        DWSFWGW E+N+R W +  +++++G K   ++ +   AYWKGNP V SP+R +LL CN S +  W A+I  QDW  E + G+E S ++NQC +RYKIY E
Subjt:  DWSFWGWPEVNLRSWREEFEDIKKGSKNLSWLNKFPRAYWKGNPDVDSPARTELLKCNHSRM--WGAQIMRQDWAQEARDGYEQSKLSNQCNHRYKIYAE

Query:  GFAWSVSLKYILSCGSMSLIISPQYEDFFSRGLDPLKNYWPIPFSNMCESIKHAVDWGNTHFPEAEAIGRQGQNFME-SLSMDTVYSYMFHLITEYSKLL
        G+AWSVS KYIL+C S++L++ P Y DFFSR L PL++YWPI   + C SIK AVDW N H  +A+ IGR+   FM+  LSM+ VY YMFHL+ EYSKLL
Subjt:  GFAWSVSLKYILSCGSMSLIISPQYEDFFSRGLDPLKNYWPIPFSNMCESIKHAVDWGNTHFPEAEAIGRQGQNFME-SLSMDTVYSYMFHLITEYSKLL

Query:  DFKPTPPSSALEVCTDSLLCIADEKQRQFLEKS--AASVSSVP----PCSL
         +KP  P +++E+CT++L+C ++ +    ++K     S+ S P    PCSL
Subjt:  DFKPTPPSSALEVCTDSLLCIADEKQRQFLEKS--AASVSSVP----PCSL

AT2G45830.2 downstream target of AGL15 21.9e-9747.37Show/hide
Query:  LEESQKFAAFRVVIVEGRLYVDMYYACVQSRAIFTIWGLVQLLRRFPGMVPDVDIMFDCMDKPSIN----RTENKAMPLPLFRYCTTEAHFDIPFPDWSF
        LE++++ A FRVVI++GR+YV  Y   +Q+R +FT+WG+VQLLR +PG +PD+++MFD  D+P++     + +    P PLFRYC+ +A  DI FPDWSF
Subjt:  LEESQKFAAFRVVIVEGRLYVDMYYACVQSRAIFTIWGLVQLLRRFPGMVPDVDIMFDCMDKPSIN----RTENKAMPLPLFRYCTTEAHFDIPFPDWSF

Query:  WGWPEVNLRSWREEFEDIKKGSKNLSWLNKFPRAYWKGNPDVDSPARTELLKCNHSRM--WGAQIMRQDWAQEARDGYEQSKLSNQCNHRYKIYAEGFAW
        WGW EVN++ W +    I++G+K   W ++   AYW+GNP+V +P R +LL+CN S    W  ++  QDW +E+R+G++ S L NQC HRYKIY EG+AW
Subjt:  WGWPEVNLRSWREEFEDIKKGSKNLSWLNKFPRAYWKGNPDVDSPARTELLKCNHSRM--WGAQIMRQDWAQEARDGYEQSKLSNQCNHRYKIYAEGFAW

Query:  SVSLKYILSCGSMSLIISPQYEDFFSRGLDPLKNYWPIPFSNMCESIKHAVDWGNTHFPEAEAIGRQGQNFM-ESLSMDTVYSYMFHLITEYSKLLDFKP
        SVS KYI++C SM+L + P + DF+ RG+ PL++YWPI  ++ C S+K AV WGNTH  +A  IG +G  F+ E + M+ VY YMFHL+ EY+KLL FKP
Subjt:  SVSLKYILSCGSMSLIISPQYEDFFSRGLDPLKNYWPIPFSNMCESIKHAVDWGNTHFPEAEAIGRQGQNFM-ESLSMDTVYSYMFHLITEYSKLLDFKP

Query:  TPPSSALEVCTDSLLCIADEKQRQFLEKSAASV-SSVPPCSL
          P  A E+  D + C A  + R F+E+S     S   PC +
Subjt:  TPPSSALEVCTDSLLCIADEKQRQFLEKSAASV-SSVPPCSL

AT3G48980.1 Arabidopsis thaliana protein of unknown function (DUF821)1.8e-10046.49Show/hide
Query:  LEESQKFAAFRVVIVEGRLYVDMYYACVQSRAIFTIWGLVQLLRRFPGMVPDVDIMFDCMDKPSINRTE----NKAMPLPLFRYCTTEAHFDIPFPDWSF
        LE +   A FR+ I+ GR+YV+ +    Q+R +FTIWG VQLLRR+PG +PD+++MFDC+D P +   E    ++  P PLFRYC  +   DI FPDWS+
Subjt:  LEESQKFAAFRVVIVEGRLYVDMYYACVQSRAIFTIWGLVQLLRRFPGMVPDVDIMFDCMDKPSINRTE----NKAMPLPLFRYCTTEAHFDIPFPDWSF

Query:  WGWPEVNLRSWREEFEDIKKGSKNLSWLNKFPRAYWKGNPDVDSPARTELLKCNHSRM--WGAQIMRQDWAQEARDGYEQSKLSNQCNHRYKIYAEGFAW
        WGW EVN++ W    +++++G++   W+++ P AYWKGNP V +  R +L+KCN S +  W A++ +QDW +E+++GY+QS L++QC+HRYKIY EG AW
Subjt:  WGWPEVNLRSWREEFEDIKKGSKNLSWLNKFPRAYWKGNPDVDSPARTELLKCNHSRM--WGAQIMRQDWAQEARDGYEQSKLSNQCNHRYKIYAEGFAW

Query:  SVSLKYILSCGSMSLIISPQYEDFFSRGLDPLKNYWPIPFSNMCESIKHAVDWGNTHFPEAEAIGRQGQNFM-ESLSMDTVYSYMFHLITEYSKLLDFKP
        SVS KYIL+C S++L++ P Y DFF+RG+ P  +YWP+   + C SIK AVDWGN H  +A+ IG++   F+ + L MD VY YMFHL+ +YSKLL FKP
Subjt:  SVSLKYILSCGSMSLIISPQYEDFFSRGLDPLKNYWPIPFSNMCESIKHAVDWGNTHFPEAEAIGRQGQNFM-ESLSMDTVYSYMFHLITEYSKLLDFKP

Query:  TPPSSALEVCTDSLLCIADEKQRQFLEKSAAS-VSSVPPCSL
          P ++ E+C++++ C  D  +R+F+ +S     +   PC++
Subjt:  TPPSSALEVCTDSLLCIADEKQRQFLEKSAAS-VSSVPPCSL

AT5G23850.1 Arabidopsis thaliana protein of unknown function (DUF821)1.7e-10348.25Show/hide
Query:  LEESQKFAAFRVVIVEGRLYVDMYYACVQSRAIFTIWGLVQLLRRFPGMVPDVDIMFDCMDKPSINRTE----NKAMPLPLFRYCTTEAHFDIPFPDWSF
        LE ++K A FR+ IV G++YV+ +    Q+R +FTIWG +QLLR++PG +PD+++MFDC+D P +  TE    N   P PLFRYC  E   DI FPDWSF
Subjt:  LEESQKFAAFRVVIVEGRLYVDMYYACVQSRAIFTIWGLVQLLRRFPGMVPDVDIMFDCMDKPSINRTE----NKAMPLPLFRYCTTEAHFDIPFPDWSF

Query:  WGWPEVNLRSWREEFEDIKKGSKNLSWLNKFPRAYWKGNPDVDSPARTELLKCNHS--RMWGAQIMRQDWAQEARDGYEQSKLSNQCNHRYKIYAEGFAW
        WGW EVN++ W    +++++G++   W+N+ P AYWKGNP V +  R +L+KCN S    W A++  QDW +E+++GY+QS L++QC+HRYKIY EG AW
Subjt:  WGWPEVNLRSWREEFEDIKKGSKNLSWLNKFPRAYWKGNPDVDSPARTELLKCNHS--RMWGAQIMRQDWAQEARDGYEQSKLSNQCNHRYKIYAEGFAW

Query:  SVSLKYILSCGSMSLIISPQYEDFFSRGLDPLKNYWPIPFSNMCESIKHAVDWGNTHFPEAEAIGRQGQNFM-ESLSMDTVYSYMFHLITEYSKLLDFKP
        SVS KYIL+C S++L++ P Y DFF+RGL P  +YWP+   + C SIK AVDWGN+H  +A+ IG+   +F+ + L MD VY YM+HL+TEYSKLL FKP
Subjt:  SVSLKYILSCGSMSLIISPQYEDFFSRGLDPLKNYWPIPFSNMCESIKHAVDWGNTHFPEAEAIGRQGQNFM-ESLSMDTVYSYMFHLITEYSKLLDFKP

Query:  TPPSSALEVCTDSLLCIADEKQRQFLEKS-AASVSSVPPCSL
          P +A+E+C++++ C+    +R+F+ +S     +   PC++
Subjt:  TPPSSALEVCTDSLLCIADEKQRQFLEKS-AASVSSVPPCSL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACCCAGTTGGAAGAATCTCAGAAATTTGCGGCGTTTCGTGTTGTGATCGTGGAAGGTAGGCTTTATGTTGATATGTACTATGCTTGTGTGCAGAGCAGGGCGATTTT
CACGATCTGGGGTTTGGTTCAATTGCTTAGAAGGTTCCCTGGAATGGTGCCGGACGTGGATATAATGTTTGATTGTATGGATAAACCGAGTATCAATCGGACTGAGAATA
AGGCCATGCCGCTGCCTCTGTTTCGGTATTGCACGACCGAGGCTCACTTTGACATTCCTTTTCCTGATTGGTCTTTCTGGGGATGGCCAGAAGTGAACTTAAGATCATGG
AGGGAAGAGTTTGAAGATATAAAGAAAGGCTCGAAAAATTTAAGTTGGTTGAACAAATTTCCTCGAGCTTATTGGAAGGGAAATCCAGATGTCGATTCCCCTGCTCGTAC
AGAGTTGCTGAAATGCAATCACTCAAGAATGTGGGGAGCTCAGATCATGCGTCAGGACTGGGCACAAGAAGCAAGAGATGGTTATGAGCAATCCAAACTATCCAATCAAT
GCAACCACCGGTATAAAATCTATGCCGAAGGGTTTGCTTGGTCTGTGAGCTTGAAGTACATTCTTTCATGTGGTTCAATGTCTCTGATTATTTCACCTCAATATGAAGAT
TTCTTCAGCCGTGGTCTTGATCCTTTGAAGAACTATTGGCCCATCCCCTTCAGTAACATGTGCGAGTCTATTAAGCATGCTGTTGACTGGGGAAATACTCATTTCCCCGA
GGCCGAGGCTATAGGACGACAGGGGCAGAATTTCATGGAGAGCTTGAGCATGGACACAGTCTATTCTTACATGTTTCATCTCATCACAGAGTACTCAAAGCTTCTGGACT
TCAAGCCAACACCGCCCTCATCGGCTTTAGAAGTATGTACTGATTCCTTGCTTTGCATCGCCGACGAGAAGCAGAGGCAGTTCCTTGAGAAGTCAGCTGCCTCGGTTTCA
TCAGTCCCTCCATGCTCACTCAACCGTGCTGGTAGCGACATCATTTACAGTTGGCTGCAGCAAAAAGAGAGGAGGAAGGCGATGTAG
mRNA sequenceShow/hide mRNA sequence
CCGACGCCGAGACCTCCCTCCCACCTCCTCCCCTCCGTCATCGCCATCTCCTTCCTCTCCCTCACTTTCCTCCTTTGTTACAAGGTGGATGATTTTGCTGCTCAAACCAA
AACTGTTGCTGGTCACAACTTGGATCCAACCCCATGGCATTTGTTCCCTCCCAAGACCTTCAGTGATGAGACTCGGCATGCCAGAGCTGTTAAGATCATCCACTGTTCTT
ACCTCGCCTGCCGCTATGCCACCAACAATGCTACTAAATTCCCTTTCCATTCCGCTGTTTCAGCTCCCAAATGTCCTGAATTCTTCCGGTGGATTCATCACGATCTGGAT
CCCTGGGCTCGGACTCGAATCTCGATGACCCAGTTGGAAGAATCTCAGAAATTTGCGGCGTTTCGTGTTGTGATCGTGGAAGGTAGGCTTTATGTTGATATGTACTATGC
TTGTGTGCAGAGCAGGGCGATTTTCACGATCTGGGGTTTGGTTCAATTGCTTAGAAGGTTCCCTGGAATGGTGCCGGACGTGGATATAATGTTTGATTGTATGGATAAAC
CGAGTATCAATCGGACTGAGAATAAGGCCATGCCGCTGCCTCTGTTTCGGTATTGCACGACCGAGGCTCACTTTGACATTCCTTTTCCTGATTGGTCTTTCTGGGGATGG
CCAGAAGTGAACTTAAGATCATGGAGGGAAGAGTTTGAAGATATAAAGAAAGGCTCGAAAAATTTAAGTTGGTTGAACAAATTTCCTCGAGCTTATTGGAAGGGAAATCC
AGATGTCGATTCCCCTGCTCGTACAGAGTTGCTGAAATGCAATCACTCAAGAATGTGGGGAGCTCAGATCATGCGTCAGGACTGGGCACAAGAAGCAAGAGATGGTTATG
AGCAATCCAAACTATCCAATCAATGCAACCACCGGTATAAAATCTATGCCGAAGGGTTTGCTTGGTCTGTGAGCTTGAAGTACATTCTTTCATGTGGTTCAATGTCTCTG
ATTATTTCACCTCAATATGAAGATTTCTTCAGCCGTGGTCTTGATCCTTTGAAGAACTATTGGCCCATCCCCTTCAGTAACATGTGCGAGTCTATTAAGCATGCTGTTGA
CTGGGGAAATACTCATTTCCCCGAGGCCGAGGCTATAGGACGACAGGGGCAGAATTTCATGGAGAGCTTGAGCATGGACACAGTCTATTCTTACATGTTTCATCTCATCA
CAGAGTACTCAAAGCTTCTGGACTTCAAGCCAACACCGCCCTCATCGGCTTTAGAAGTATGTACTGATTCCTTGCTTTGCATCGCCGACGAGAAGCAGAGGCAGTTCCTT
GAGAAGTCAGCTGCCTCGGTTTCATCAGTCCCTCCATGCTCACTCAACCGTGCTGGTAGCGACATCATTTACAGTTGGCTGCAGCAAAAAGAGAGGAGGAAGGCGATGTA
GGAGGAAGAAATGGCTGCATAAAGAGCCTCAAAGTAGAAGTTATGTATGTTTTTTTCTTCCATTTTAGAGCATTGAGGAGGGTAGTGTTT
Protein sequenceShow/hide protein sequence
MTQLEESQKFAAFRVVIVEGRLYVDMYYACVQSRAIFTIWGLVQLLRRFPGMVPDVDIMFDCMDKPSINRTENKAMPLPLFRYCTTEAHFDIPFPDWSFWGWPEVNLRSW
REEFEDIKKGSKNLSWLNKFPRAYWKGNPDVDSPARTELLKCNHSRMWGAQIMRQDWAQEARDGYEQSKLSNQCNHRYKIYAEGFAWSVSLKYILSCGSMSLIISPQYED
FFSRGLDPLKNYWPIPFSNMCESIKHAVDWGNTHFPEAEAIGRQGQNFMESLSMDTVYSYMFHLITEYSKLLDFKPTPPSSALEVCTDSLLCIADEKQRQFLEKSAASVS
SVPPCSLNRAGSDIIYSWLQQKERRKAM