; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS005875 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS005875
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionGlycosyltransferase
Genome locationscaffold254:2157683..2167546
RNA-Seq ExpressionMS005875
SyntenyMS005875
Gene Ontology termsGO:0008194 - UDP-glycosyltransferase activity (molecular function)
InterPro domainsIPR002213 - UDP-glucuronosyl/UDP-glucosyltransferase
IPR035595 - UDP-glycosyltransferase family, conserved site


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA3476227.1 hydroquinone glucosyltransferase-like [Gossypium australe]4.6e-22455.03Show/hide
Query:  KIPGCIPIRGKDLMDPVQDRKDDAYKWTLHNAKRYVLADGIFLNSFPELEPGAINALQLTEPGNNYKPPVYPIGPLVKIDSNESEERAECLKWLDEQPHG
        K+PG +P+ G+D    +QDR  D YKW L+ AKRY +A G+ LNSF +LEPG I ALQL EP    KP VY IGP ++  S+   + +EC KWLD QP G
Subjt:  KIPGCIPIRGKDLMDPVQDRKDDAYKWTLHNAKRYVLADGIFLNSFPELEPGAINALQLTEPGNNYKPPVYPIGPLVKIDSNESEERAECLKWLDEQPHG

Query:  SVLFVSFGSGGTLSSHQINELALGLEMSGQRFIWVVRSPSNKEANATYFSVHSKNNPLDFLPEGFVERNRGRGFVVPSWAPQAQILRHESTGGFLSHCGW
        SVLFVSFGSGGTLS  Q+NELALGLEMS QRF+WVVR P+   A  +Y+   +   PL FLP+GF++RN  +G VVPSWAPQ +IL H STGGFL+HCGW
Subjt:  SVLFVSFGSGGTLSSHQINELALGLEMSGQRFIWVVRSPSNKEANATYFSVHSKNNPLDFLPEGFVERNRGRGFVVPSWAPQAQILRHESTGGFLSHCGW

Query:  NSTLESVVNGVPLIAWPLYAEQRMNAVMLTEEIKVAMRAEAKGENGIVEKEEICRVVKSLMEGEEGKKLRTKMKELKEAGERALGENGSSTKILCELVQ-
        NS LES+ NGVP++AWPLYAEQRMNAV+LTE I VA+R     + GIVE+EEI +VVK LM+G++G  +R +M + K+A  +A+ E+GSST+ L +LV+ 
Subjt:  NSTLESVVNGVPLIAWPLYAEQRMNAVMLTEEIKVAMRAEAKGENGIVEKEEICRVVKSLMEGEEGKKLRTKMKELKEAGERALGENGSSTKILCELVQ-

Query:  -----------------KWKSKICNL--------------------------------------------FKSMAAAGTHNVVALVVDQFGTDAFDVADE
                           K++I  L                                             KS+AA     +V LVVD FGTDAFDV  E
Subjt:  -----------------KWKSKICNL--------------------------------------------FKSMAAAGTHNVVALVVDQFGTDAFDVADE

Query:  FHVPSYIFFPCAAMTLSLLLHMPILDETVAGEYRDLPEPIRVPGCSPIPAKELPDPFQERGNDAYEALLHNTKRYGLAEGIFLNSFPKLDAGAINALQFA
        F++  YIF+P  AM LSL  ++P LD+ V+ EYR+LPE +R+PGC PI  KEL DP Q+R NDAY+ LLH+ KRY LAEGI +NSF +L+AGA  ALQ  
Subjt:  FHVPSYIFFPCAAMTLSLLLHMPILDETVAGEYRDLPEPIRVPGCSPIPAKELPDPFQERGNDAYEALLHNTKRYGLAEGIFLNSFPKLDAGAINALQFA

Query:  EPGKRTPPIYPIGPLVKIDSTGNDELGTEKCLNWLDEQPTGSVLFVSFGSGGTLSSHQINELALGLENSGQRFIWVVRNPSDKEADASFFSVHSENDPLS
        EP K  PP+YP+GPLV +D++   +     CL WLD+QP GSVL+VSFGSGGTLSS+Q+NELA+GLE S  RF+WVVR+P+DK A+A+FFS  S+ DP  
Subjt:  EPGKRTPPIYPIGPLVKIDSTGNDELGTEKCLNWLDEQPTGSVLFVSFGSGGTLSSHQINELALGLENSGQRFIWVVRNPSDKEADASFFSVHSENDPLS

Query:  FLPEGFVERNRGRGFVVASWAPQAEILSRGSTGGFLSHCGWNSTLESVVNGVPLIAWPLYAEQKMNAIMLTEEIKAALRPKMNEESGIVEKEEIAKVVKE
        FLP GF+ER +GRG VV SWAPQA++LS  STGGFL+HCGWNSTLES+VNGVPLIAWPLYAEQKMNA MLT++IK ALR + N E+G+V ++EIAK VK 
Subjt:  FLPEGFVERNRGRGFVVASWAPQAEILSRGSTGGFLSHCGWNSTLESVVNGVPLIAWPLYAEQKMNAIMLTEEIKAALRPKMNEESGIVEKEEIAKVVKE

Query:  LFEGEEGKRVKAKMAELKIAAKRVLGEDGSSSTTLSEVVQKWRTK
        L EGEEGK V+ +M +LK AA   L E+GSS+  LSEV  +WR +
Subjt:  LFEGEEGKRVKAKMAELKIAAKRVLGEDGSSSTTLSEVVQKWRTK

KAF2582258.1 hypothetical protein F2Q68_00002504 [Brassica cretica]4.7e-20856.53Show/hide
Query:  KIPGCIPIRGKDLMDPVQDRKDDAYKWTLHNAKRYVLADGIFLNSFPELEPGAINALQLTEPGNNYKPPVYPIGPLVKIDSNESEERAECLKWLDEQPHG
        ++PGC+P  GKD  DP +DRK D YK  LHNAKRY  A+GI +N+F +LEP A+ ALQ  EPG + KPPVYPIGPLVK + +   E +ECLKWLD+QP G
Subjt:  KIPGCIPIRGKDLMDPVQDRKDDAYKWTLHNAKRYVLADGIFLNSFPELEPGAINALQLTEPGNNYKPPVYPIGPLVKIDSNESEERAECLKWLDEQPHG

Query:  SVLFVSFGSGGTLSSHQINELALGLEMSGQRFIWVVRSPSNKEANATYFSVHSKNNPLDFLPEGFVERNRGRGFVVPSWAPQAQILRHESTGGFLSHCGW
        SVL+VSFGSGG L+  Q+NELALGLE S QRF+WV+RSPS + A+A++F  HS+ +PL FLP GF+ER + RGF++PSWAPQAQIL H STGGFLSHCGW
Subjt:  SVLFVSFGSGGTLSSHQINELALGLEMSGQRFIWVVRSPSNKEANATYFSVHSKNNPLDFLPEGFVERNRGRGFVVPSWAPQAQILRHESTGGFLSHCGW

Query:  NSTLESVVNGVPLIAWPLYAEQRMNAVMLTEEIKVAMRAEAKGENGIVEKEEICRVVKSLMEGEEGKKLRTKMKELKEAGERALGENGSSTKILCELVQK
        NSTLES+V+GVPLIAWPLYAEQ+MNAV+L E+I VA+RA A GE G+V +EE+ RVVK LMEGEEGK +R KMKE+KE   R L ++GSSTK L      
Subjt:  NSTLESVVNGVPLIAWPLYAEQRMNAVMLTEEIKVAMRAEAKGENGIVEKEEICRVVKSLMEGEEGKKLRTKMKELKEAGERALGENGSSTKILCELVQK

Query:  WKSKICNLFKSMAAAGTHNVVALVVDQFGTDAFDVADEFHVPSYIFFPCAAMTLSLLLHMPILDETVAGEYRDLPEPIRVPGCSPIPAKELPDPFQERGN
                          ++VAL                                                 +L EP+++PGC P   K+ PDP ++R +
Subjt:  WKSKICNLFKSMAAAGTHNVVALVVDQFGTDAFDVADEFHVPSYIFFPCAAMTLSLLLHMPILDETVAGEYRDLPEPIRVPGCSPIPAKELPDPFQERGN

Query:  DAYEALLHNTKRYGLAEGIFLNSFPKLDAGAINALQFAEPGKRTPPIYPIGPLVKIDSTGNDELGTEKCLNWLDEQPTGSVLFVSFGSGGTLSSHQINEL
        D Y+ LLHN KRY  AEGI +N+F  L+  A+ ALQ  EPG   PP+YPIGPLVK +S GN  +   +CL WLD+QP GSVL+VSFGSGG L+  Q+NEL
Subjt:  DAYEALLHNTKRYGLAEGIFLNSFPKLDAGAINALQFAEPGKRTPPIYPIGPLVKIDSTGNDELGTEKCLNWLDEQPTGSVLFVSFGSGGTLSSHQINEL

Query:  ALGLENSGQRFIWVVRNPSDKEADASFFSVHSENDPLSFLPEGFVERNRGRGFVVASWAPQAEILSRGSTGGFLSHCGWNSTLESVVNGVPLIAWPLYAE
        ALGLE+S QRF+WV+R+PS + A+ASFF  H+E DPL+FLP  F+ER + RGF++ SWAPQA+IL+  STGGFLSHCGWNSTLES+V+GVPLIAWPLYAE
Subjt:  ALGLENSGQRFIWVVRNPSDKEADASFFSVHSENDPLSFLPEGFVERNRGRGFVVASWAPQAEILSRGSTGGFLSHCGWNSTLESVVNGVPLIAWPLYAE

Query:  QKMNAIMLTEEIKAALRPKMNEESGIVEKEEIAKVVKELFEGEEGKRVKAKMAELKIAAKRVLGEDGSSSTTLSEVVQKWR
        QKMNA++L E+I+ ALR    EE G+V +EE+A+VVK L EGEEGK V  KM E+K  A RVL +DGSS+  LS V  KW+
Subjt:  QKMNAIMLTEEIKAALRPKMNEESGIVEKEEIAKVVKELFEGEEGKRVKAKMAELKIAAKRVLGEDGSSSTTLSEVVQKWR

PPD98686.1 hypothetical protein GOBAR_DD04270 [Gossypium barbadense]3.6e-25363.12Show/hide
Query:  KIPGCIPIRGKDLMDPVQDRKDDAYKWTLHNAKRYVLADGIFLNSFPELEPGAINALQLTEPGNNYKPPVYPIGPLVKIDSNESEERAECLKWLDEQPHG
        +IPGCIPI GK+L+DP QDRK+DAYKW LH+ KRY LA+GI +NSF +LE GAI ALQ  EPG   KPPVYP+GPLV ID ++ ++ ++CLKWLD+QPHG
Subjt:  KIPGCIPIRGKDLMDPVQDRKDDAYKWTLHNAKRYVLADGIFLNSFPELEPGAINALQLTEPGNNYKPPVYPIGPLVKIDSNESEERAECLKWLDEQPHG

Query:  SVLFVSFGSGGTLSSHQINELALGLEMSGQRFIWVVRSPSNKEANATYFSVHSKNNPLDFLPEGFVERNRGRGFVVPSWAPQAQILRHESTGGFLSHCGW
        SVL+VSFGSGGTLS +QI+ELALGLEMS QRF+WVVRSP++  ANATYFSV S+ +P DFLP+GF+ER +GRG VV SWAPQAQ+L H S+GGFL+HCGW
Subjt:  SVLFVSFGSGGTLSSHQINELALGLEMSGQRFIWVVRSPSNKEANATYFSVHSKNNPLDFLPEGFVERNRGRGFVVPSWAPQAQILRHESTGGFLSHCGW

Query:  NSTLESVVNGVPLIAWPLYAEQRMNAVMLTEEIKVAMRAEAKGENGIVEKEEICRVVKSLMEGEEGKKLRTKMKELKEAGERALGENGSSTKILCELVQK
        NSTLESVVNGVPLIAWPL+AEQ+MNA+ML E+IKVA+R +   ENG+V ++EI + VK LMEGEEGK +R +MK LKEA  + LGENG STK L ++  +
Subjt:  NSTLESVVNGVPLIAWPLYAEQRMNAVMLTEEIKVAMRAEAKGENGIVEKEEICRVVKSLMEGEEGKKLRTKMKELKEAGERALGENGSSTKILCELVQK

Query:  WKSKICNL-----------------------------------------FKSMAAAGTHNVVALVVDQFGTDAFDVADEFHVPSYIFFPCAAMTLSLLLH
         KS + +L                                         FKSM      N+VALVVD FGTDAFDVA EF+V  YIFFP  AMTLSL L+
Subjt:  WKSKICNL-----------------------------------------FKSMAAAGTHNVVALVVDQFGTDAFDVADEFHVPSYIFFPCAAMTLSLLLH

Query:  MPILDETVAGEYRDLPEPIRVPGCSPIPAKELPDPFQERGNDAYEALLHNTKRYGLAEGIFLNSFPKLDAGAINALQFAEPGKRTPPIYPIGPLVKIDST
        +P LD+ V  EYRD PE +R+PGC PI  KEL DP Q+R NDAY+ LLH+TKRY LAEGI +NSF  L+AGAI ALQ  EPGK  PP+YP+GPLV ID +
Subjt:  MPILDETVAGEYRDLPEPIRVPGCSPIPAKELPDPFQERGNDAYEALLHNTKRYGLAEGIFLNSFPKLDAGAINALQFAEPGKRTPPIYPIGPLVKIDST

Query:  GNDELGTEKCLNWLDEQPTGSVLFVSFGSGGTLSSHQINELALGLENSGQRFIWVVRNPSDKEADASFFSVHSENDPLSFLPEGFVERNRGRGFVVASWA
          D+ G++ CL WLD+QP GSVL+VSFGSGGTLS +QI+ELALGLE S QRF+WVVR+P+D  A+A++FSV SE DP  FLP+GF+ER +GRG VVASWA
Subjt:  GNDELGTEKCLNWLDEQPTGSVLFVSFGSGGTLSSHQINELALGLENSGQRFIWVVRNPSDKEADASFFSVHSENDPLSFLPEGFVERNRGRGFVVASWA

Query:  PQAEILSRGSTGGFLSHCGWNSTLESVVNGVPLIAWPLYAEQKMNAIMLTEEIKAALRPKMNEESGIVEKEEIAKVVKELFEGEEGKRVKAKMAELKIAA
        PQA++LS GS+GGFL+HCGWNSTLESVVNGVPLIAWPL+AEQKMNA+ML E+IK ALRPK N E+G+V ++EIAK VK L EGEEGK V+ +M  LK AA
Subjt:  PQAEILSRGSTGGFLSHCGWNSTLESVVNGVPLIAWPLYAEQKMNAIMLTEEIKAALRPKMNEESGIVEKEEIAKVVKELFEGEEGKRVKAKMAELKIAA

Query:  KRVLGEDGSSSTTLSEVVQKWRTK
         ++LGE+G S+  LS+V  KWR +
Subjt:  KRVLGEDGSSSTTLSEVVQKWRTK

RYR16256.1 hypothetical protein Ahy_B04g073250 [Arachis hypogaea]5.0e-20248.52Show/hide
Query:  KIPGCIPIRGKDLMDPVQDRKDDAYKWTLHNAKRYVLADGIFLNSFPELEPGAINALQLTEPGNNYKPPVYPIGPLVKIDSNESEERAECLKWLDEQPHG
        +IPGC+P+ GKDL+ P+Q+R ++AY + LH + R+ +  GI  NSF ELEP     LQ  EP      PVY +GP+V +D        + L WLDEQP G
Subjt:  KIPGCIPIRGKDLMDPVQDRKDDAYKWTLHNAKRYVLADGIFLNSFPELEPGAINALQLTEPGNNYKPPVYPIGPLVKIDSNESEERAECLKWLDEQPHG

Query:  SVLFVSFGSGGTLSSHQINELALGLEMSGQRFIWVVRSPSNKEANATYFS-VHSKNNPLDFLPEGFVERNRGRGFVVPSWAPQAQILRHESTGGFLSHCG
        SVLFV FGS  T+SS Q +ELA+GLE S QRF+WVV+ P N   N+T+F+ ++++ N LDFLPEGFVER RG G VVP WAPQAQIL H+S GGFL+HCG
Subjt:  SVLFVSFGSGGTLSSHQINELALGLEMSGQRFIWVVRSPSNKEANATYFS-VHSKNNPLDFLPEGFVERNRGRGFVVPSWAPQAQILRHESTGGFLSHCG

Query:  WNSTLESVVNGVPLIAWPLYAEQRMNAVMLTEEIKVAMRAEAKGENGIVEKEEICRVVKSLMEGEEGKKLRTKMKELKEAGERALGENGSSTKILCELV-
        WNS LES+ NGVP+IA PL AEQ+MNAV++TE+ KVA R +  GENG+VE EEI  VV+ LMEGEEGKKL  ++KELK+A  +ALGENGSSTK L ELV 
Subjt:  WNSTLESVVNGVPLIAWPLYAEQRMNAVMLTEEIKVAMRAEAKGENGIVEKEEICRVVKSLMEGEEGKKLRTKMKELKEAGERALGENGSSTKILCELV-

Query:  ----QKWKSKICNLFKSMAAAG------------------------------------------------------------------------------
             + +S +C+ + +    G                                                                              
Subjt:  ----QKWKSKICNLFKSMAAAG------------------------------------------------------------------------------

Query:  -----------THNVVALVVDQFGTDAFDVADEFHVPSYIFFPCAAMTLSLLLHMPILDETVAGEYRDLPEPIRVPGCSPIPAKELPDPFQERGNDAYEA
                    H +   ++D FG DA D+A E H+PSY++FP +AM LSL LH+P L +TV G ++DL EP+ +PGC P+  K+L +P QER +++Y+ 
Subjt:  -----------THNVVALVVDQFGTDAFDVADEFHVPSYIFFPCAAMTLSLLLHMPILDETVAGEYRDLPEPIRVPGCSPIPAKELPDPFQERGNDAYEA

Query:  LLHNTKRYGLAEGIFLNSFPKLDAGAINALQFAEPGKRTPPIYPIGPLVKIDSTGNDELGTEKCLNWLDEQPTGSVLFVSFGSGGTLSSHQINELALGLE
        +L    R+ + +GI  NSF +L+      LQ  EPG    P+Y +GPLV ++ T       +KCL WLDEQP GSVLFV FGSGGT+SS Q +ELA+GLE
Subjt:  LLHNTKRYGLAEGIFLNSFPKLDAGAINALQFAEPGKRTPPIYPIGPLVKIDSTGNDELGTEKCLNWLDEQPTGSVLFVSFGSGGTLSSHQINELALGLE

Query:  NSGQRFIWVVRNPSDKEADASFFSVHSENDPLSFLPEGFVERNRGRGFVVASWAPQAEILSRGSTGGFLSHCGWNSTLESVVNGVPLIAWPLYAEQKMNA
        NS QRF+WVV+ P      A F ++  + DPL FLPEGFVER RG G VV  WAPQA+IL+  S GGFL+HCGWNS LES+VNGVP+IAWPLYAEQKM A
Subjt:  NSGQRFIWVVRNPSDKEADASFFSVHSENDPLSFLPEGFVERNRGRGFVVASWAPQAEILSRGSTGGFLSHCGWNSTLESVVNGVPLIAWPLYAEQKMNA

Query:  IMLTEEIKAALRPKMNEESGIVEKEEIAKVVKELFEGEEGKRVKAKMAELKIAAKRVLGEDGSSSTTLSEVVQKWRT
        +++TE+IK A RP++  E+G+VE EEI +VV++L EGEEGK++  ++ ELK AA + LGE+GSS+  LS++V  W+T
Subjt:  IMLTEEIKAALRPKMNEESGIVEKEEIAKVVKELFEGEEGKRVKAKMAELKIAAKRVLGEDGSSSTTLSEVVQKWRT

XP_017972637.1 PREDICTED: uncharacterized protein LOC18606668 [Theobroma cacao]6.1e-20848.54Show/hide
Query:  KIPGCIPIRGKDLMDPVQDRKDDAYKWTLHNAKRYVLADGIFLNSFPELEPGAINALQLTEPGNNYKPPVYPIGPLVKIDS-NESEERAECLKWLDEQPH
        K+PGC+P  G D+ DPVQD+K+  Y+  +   +RY LA GI +NSF +LE  A  AL   E G    P VYP+GPL++  S NE  E   CL+WLD QPH
Subjt:  KIPGCIPIRGKDLMDPVQDRKDDAYKWTLHNAKRYVLADGIFLNSFPELEPGAINALQLTEPGNNYKPPVYPIGPLVKIDS-NESEERAECLKWLDEQPH

Query:  GSVLFVSFGSGGTLSSHQINELALGLEMSGQRFIWVVRSPSNKEANATYFSVHSKNNPLDFLPEGFVERNRGRGFVVPSWAPQAQILRHESTGGFLSHCG
        GSV++V FGSGGTLS  Q+NELALGLEMSGQRF+WV +SP+ K  NATYF V S  +P  FLP+GF+ER +G G VV SWAPQ +ILRH STGGFL+HCG
Subjt:  GSVLFVSFGSGGTLSSHQINELALGLEMSGQRFIWVVRSPSNKEANATYFSVHSKNNPLDFLPEGFVERNRGRGFVVPSWAPQAQILRHESTGGFLSHCG

Query:  WNSTLESVVNGVPLIAWPLYAEQRMNAVMLTEEIKVAMRAEAKGENGIVEKEEICRVVKSLMEGEEGKKLRTKMKELKEAGERALGENGSSTKILCELVQ
        WNSTLE++V+GVPLIAWPLYAEQ+MNAV+L +++KVA+R + + ENG+V +E+I + V+ L+EGEEG+ LR KMK+LK+A +  L  +GSSTK L ++ +
Subjt:  WNSTLESVVNGVPLIAWPLYAEQRMNAVMLTEEIKVAMRAEAKGENGIVEKEEICRVVKSLMEGEEGKKLRTKMKELKEAGERALGENGSSTKILCELVQ

Query:  KWKSK-----------------------------------------------------------------------------------------------
         WK++                                                                                               
Subjt:  KWKSK-----------------------------------------------------------------------------------------------

Query:  ICNLFKSM--------AAAGTHNVVALVVDQFGTDAFDVADEFHVPSYIFFPCAAMTLSLLLHMPILDETVAGEYRDLPEPIRVPGCSPIPAKELPDPFQ
        I +L +S+          A +  VVA VVD FG DAFDVA EF +  YIF   A M LSL+  +P LD+  + EYRDLPEPI++PGC P    ++PDP Q
Subjt:  ICNLFKSM--------AAAGTHNVVALVVDQFGTDAFDVADEFHVPSYIFFPCAAMTLSLLLHMPILDETVAGEYRDLPEPIRVPGCSPIPAKELPDPFQ

Query:  ERGNDAYEALLHNTKRYGLAEGIFLNSFPKLDAGAINALQFAEPGKRTPPIYPIGPLVKIDSTGNDELGTEKCLNWLDEQPTGSVLFVSFGSGGTLSSHQ
         R N AY+  L   KRY LA GI +NSF  L+ G ++AL   E G+  P +YP+GPL++  ST   E G+  CL WLDEQP+GSVL+V FGSGGTLS  Q
Subjt:  ERGNDAYEALLHNTKRYGLAEGIFLNSFPKLDAGAINALQFAEPGKRTPPIYPIGPLVKIDSTGNDELGTEKCLNWLDEQPTGSVLFVSFGSGGTLSSHQ

Query:  INELALGLENSGQRFIWVVRNPSDKEADASFFSVHSENDPLSFLPEGFVERNRGRGFVVASWAPQAEILSRGSTGGFLSHCGWNSTLESVVNGVPLIAWP
        +NELALGLE SGQRF+WVV+ P++  A+A++F + +  +P +FLP+GF+ER +  G VV SWAPQ ++LS GSTGGFL+HCGWNS LES+V+GVPLIAWP
Subjt:  INELALGLENSGQRFIWVVRNPSDKEADASFFSVHSENDPLSFLPEGFVERNRGRGFVVASWAPQAEILSRGSTGGFLSHCGWNSTLESVVNGVPLIAWP

Query:  LYAEQKMNAIMLTEEIKAALRPKMNEESGIVEKEEIAKVVKELFEGEEGKRVKAKMAELKIAAKRVLGEDGSSSTTLSEVVQKWRTK
        LYAEQKMNA++L + +K A R ++NE+ G+V +E+IAK VKEL EG+EG+ ++ ++ +LK AAK  L  DG S+ +L+++ + W+ K
Subjt:  LYAEQKMNAIMLTEEIKAALRPKMNEESGIVEKEEIAKVVKELFEGEEGKRVKAKMAELKIAAKRVLGEDGSSSTTLSEVVQKWRTK

TrEMBL top hitse value%identityAlignment
A0A3Q7YC63 uncharacterized protein LOC1015048044.7e-18245.95Show/hide
Query:  KIPGCIPIRGKDLMDPVQDRKDDAYKWTLHNAKRYVLADGIFLNSFPELEPGAINALQLTEPGNNYKPPVYPIGPLVKIDSNESEERAECLKWLDEQPHG
        KIPGC+PI G+DL+   QDR   +YK+ L   +R+ L DG+ +NSF E+E G I AL     GN   P VYP+GP+++ D+ +  +  ECL WLD+Q   
Subjt:  KIPGCIPIRGKDLMDPVQDRKDDAYKWTLHNAKRYVLADGIFLNSFPELEPGAINALQLTEPGNNYKPPVYPIGPLVKIDSNESEERAECLKWLDEQPHG

Query:  SVLFVSFGSGGTLSSHQINELALGLEMSGQRFIWVVRSPSNKEANATYFSVHSKN-NPLDFLPEGFVERNRGRGFVVPSWAPQAQILRHESTGGFLSHCG
        SVLFVSFGSGGTLS  QI ELALGLE+S  RF+WV+R+PSN  ANA Y S      +PL FLP GF++R + +G V+P WAPQ QIL H S GGFLSHCG
Subjt:  SVLFVSFGSGGTLSSHQINELALGLEMSGQRFIWVVRSPSNKEANATYFSVHSKN-NPLDFLPEGFVERNRGRGFVVPSWAPQAQILRHESTGGFLSHCG

Query:  WNSTLESVVNGVPLIAWPLYAEQRMNAVMLTEEIKVAMRAEAKGENGIVEKEEICRVVKSLMEGEEGKKLRTKMKELKEAGERALGENGSSTKILCELVQ
        WNS LESV++GVPLI WPL+AEQRMNAV+L+E +KV +R     ENGIVE+EEI +V+K LMEGEEG  +R +MKELK A   A+ E+GSS K L +L  
Subjt:  WNSTLESVVNGVPLIAWPLYAEQRMNAVMLTEEIKVAMRAEAKGENGIVEKEEICRVVKSLMEGEEGKKLRTKMKELKEAGERALGENGSSTKILCELVQ

Query:  KW-------KSKICNL---------------------------------------------------FKSMAAAGTHNV---------------------
        K        K    NL                                                    K++      N+                     
Subjt:  KW-------KSKICNL---------------------------------------------------FKSMAAAGTHNV---------------------

Query:  ---------------------------VALVVDQFGTDAFDVADEFHVPSYIFFPCAAMTLSLLLHMPILDETVAGEYRDLPEPIRVPGCSPIPAKELPD
                                   VALV D F  +A + A +F++ SYI+F  AA TLS   + P LDE  + EYRDLPEPI++PGC P+   +L  
Subjt:  ---------------------------VALVVDQFGTDAFDVADEFHVPSYIFFPCAAMTLSLLLHMPILDETVAGEYRDLPEPIRVPGCSPIPAKELPD

Query:  PFQERGNDAYEALLHNTKRYGLAEGIFLNSFPKLDAGAINALQFAEPGKRTPPIYPIGPLVKIDSTGNDELGTEK-CLNWLDEQPTGSVLFVSFGSGGTL
        P Q+R + AY+  L ++K    A+G+ +NSF +++ G I AL   E G   P +YPIGP+++  +    ++G  K CL WLD+Q   SVL+VSFGSGGTL
Subjt:  PFQERGNDAYEALLHNTKRYGLAEGIFLNSFPKLDAGAINALQFAEPGKRTPPIYPIGPLVKIDSTGNDELGTEK-CLNWLDEQPTGSVLFVSFGSGGTL

Query:  SSHQINELALGLENSGQRFIWVVRNPSDKEADASFFSVHSEN-DPLSFLPEGFVERNRGRGFVVASWAPQAEILSRGSTGGFLSHCGWNSTLESVVNGVP
        S  Q  ELALGLE S  RF+WVVR P++  A+A++FS   ++ DPL FLP GF+ER + +G V+ SWAPQ +ILS  S GGFLSHCGWNS LESV++GVP
Subjt:  SSHQINELALGLENSGQRFIWVVRNPSDKEADASFFSVHSEN-DPLSFLPEGFVERNRGRGFVVASWAPQAEILSRGSTGGFLSHCGWNSTLESVVNGVP

Query:  LIAWPLYAEQKMNAIMLTEEIKAALRPKMNEESGIVEKEEIAKVVKELFEGEEGKRVKAKMAELKIAAKRVLGEDGSSSTTLSEVVQKWR
        LI WPL+AEQ+MNA +L+E +K  +RP++N E+GIVE+ EI KV+K L E EEG+ +   M ELK AA   L E+G S+ T+ ++  KWR
Subjt:  LIAWPLYAEQKMNAIMLTEEIKAALRPKMNEESGIVEKEEIAKVVKELFEGEEGKRVKAKMAELKIAAKRVLGEDGSSSTTLSEVVQKWR

A0A3S3M7C3 UDP-glucosyltransferase family protein5.1e-19248.65Show/hide
Query:  KIPGCIPIRGKDLMDPVQDRKDDAYKWTLHNAKRYVLADGIFLNSFPELEPGAINALQLTEPGNNYKPPVYPIGPLVKIDS--NESEERAECLKWLDEQP
        K+PGC PI  +DL++  QDR ++AY W LH+  R++ A GI +NSF ELEP  + AL+  EP +   PP+YP+GPLV+  S    +  + + L+WLD+QP
Subjt:  KIPGCIPIRGKDLMDPVQDRKDDAYKWTLHNAKRYVLADGIFLNSFPELEPGAINALQLTEPGNNYKPPVYPIGPLVKIDS--NESEERAECLKWLDEQP

Query:  HGSVLFVSFGSGGTLSSHQINELALGLEMSGQRFIWVVRSPSNKEANATYFSVHSKNNPLDFLPEGFVERNRGRGFVVPSWAPQAQILRHESTGGFLSHC
        HGSVLFV+FGSGG LS  Q+ ELALGLEMS QRF+WVVRSP++KEA ATYF V S  +PLDFLPEGF++R +G G VVP W PQ ++L HESTGGFLSHC
Subjt:  HGSVLFVSFGSGGTLSSHQINELALGLEMSGQRFIWVVRSPSNKEANATYFSVHSKNNPLDFLPEGFVERNRGRGFVVPSWAPQAQILRHESTGGFLSHC

Query:  GWNSTLESVVNGVPLIAWPLYAEQRMNAVMLTEEIKVAMRAEAKGENGIVEKEEICRVVKSLMEGEEGKK-LRTKMKELKEAGERAL-GENGSSTKILCE
        GWNSTLES+V+GVP+IAWPLYAEQR NA+ML E++KVA+R  A  ++G+VE+E I +VVK LM+GE+ +K ++ +M +LKEA  +AL  E GSS + + E
Subjt:  GWNSTLESVVNGVPLIAWPLYAEQRMNAVMLTEEIKVAMRAEAKGENGIVEKEEICRVVKSLMEGEEGKK-LRTKMKELKEAGERAL-GENGSSTKILCE

Query:  L-------------------------------------------------VQKWKSKICNLFKSMA----------------------------------
        L                                                 +QK ++ I N   +                                    
Subjt:  L-------------------------------------------------VQKWKSKICNLFKSMA----------------------------------

Query:  ------AAGTHNVVALVVDQFGTDAFDVADEFHVPSYIFFPCAAMTLSLLLHMPILDETVAGEYRDLPEPIRVPGCSPIPAKELPDPFQERGNDAYEALL
               A TH +VALVVD FGTDAFDVA E  VP Y+ +    ++LS+ +H+P +D T +       EP+++PGC PIP ++L D  Q+R N AY   L
Subjt:  ------AAGTHNVVALVVDQFGTDAFDVADEFHVPSYIFFPCAAMTLSLLLHMPILDETVAGEYRDLPEPIRVPGCSPIPAKELPDPFQERGNDAYEALL

Query:  HNTKRYGLAEGIFLNSFPKLDAGAINALQFAEPGKRTPPIYPIGPLVKIDSTGNDELGTEKCLNWLDEQPTGSVLFVSFGSGGTLSSHQINELALGLENS
        H+  R+  A+GI +NS  +L+   +  L+  EP    PPIY +GPLV+  STG  +      L WLD+QP GSVLFVSFGS G+LS  Q  ELALGLE S
Subjt:  HNTKRYGLAEGIFLNSFPKLDAGAINALQFAEPGKRTPPIYPIGPLVKIDSTGNDELGTEKCLNWLDEQPTGSVLFVSFGSGGTLSSHQINELALGLENS

Query:  GQRFIWVVRNPSDKEADASFFSVHSENDPLSFLPEGFVERNRGRGFVVASWAPQAEILSRGSTGGFLSHCGWNSTLESVVNGVPLIAWPLYAEQKMNAIM
         QRF+WVVR+P+DKEA A      S  +PL FLP+GF+ER +G G VV SW  Q E+LS  STGGFLSHCGWNSTLES+V+GVP+IAWPLYAEQ+ NA++
Subjt:  GQRFIWVVRNPSDKEADASFFSVHSENDPLSFLPEGFVERNRGRGFVVASWAPQAEILSRGSTGGFLSHCGWNSTLESVVNGVPLIAWPLYAEQKMNAIM

Query:  LTEEIKAALRPKMNEESGIVEKEEIAKVVKELFEG-EEGKRVKAKMAELKIAA-KRVLGEDGSSSTTLSEVVQKW
        L E++K A+RP   +ESG+V++EEIA+VV++L +G +EGKRV  +M E+K A+ K +  E+GSS   LSEV+ KW
Subjt:  LTEEIKAALRPKMNEESGIVEKEEIAKVVKELFEG-EEGKRVKAKMAELKIAA-KRVLGEDGSSSTTLSEVVQKW

A0A444ZPY6 Uncharacterized protein2.4e-20248.52Show/hide
Query:  KIPGCIPIRGKDLMDPVQDRKDDAYKWTLHNAKRYVLADGIFLNSFPELEPGAINALQLTEPGNNYKPPVYPIGPLVKIDSNESEERAECLKWLDEQPHG
        +IPGC+P+ GKDL+ P+Q+R ++AY + LH + R+ +  GI  NSF ELEP     LQ  EP      PVY +GP+V +D        + L WLDEQP G
Subjt:  KIPGCIPIRGKDLMDPVQDRKDDAYKWTLHNAKRYVLADGIFLNSFPELEPGAINALQLTEPGNNYKPPVYPIGPLVKIDSNESEERAECLKWLDEQPHG

Query:  SVLFVSFGSGGTLSSHQINELALGLEMSGQRFIWVVRSPSNKEANATYFS-VHSKNNPLDFLPEGFVERNRGRGFVVPSWAPQAQILRHESTGGFLSHCG
        SVLFV FGS  T+SS Q +ELA+GLE S QRF+WVV+ P N   N+T+F+ ++++ N LDFLPEGFVER RG G VVP WAPQAQIL H+S GGFL+HCG
Subjt:  SVLFVSFGSGGTLSSHQINELALGLEMSGQRFIWVVRSPSNKEANATYFS-VHSKNNPLDFLPEGFVERNRGRGFVVPSWAPQAQILRHESTGGFLSHCG

Query:  WNSTLESVVNGVPLIAWPLYAEQRMNAVMLTEEIKVAMRAEAKGENGIVEKEEICRVVKSLMEGEEGKKLRTKMKELKEAGERALGENGSSTKILCELV-
        WNS LES+ NGVP+IA PL AEQ+MNAV++TE+ KVA R +  GENG+VE EEI  VV+ LMEGEEGKKL  ++KELK+A  +ALGENGSSTK L ELV 
Subjt:  WNSTLESVVNGVPLIAWPLYAEQRMNAVMLTEEIKVAMRAEAKGENGIVEKEEICRVVKSLMEGEEGKKLRTKMKELKEAGERALGENGSSTKILCELV-

Query:  ----QKWKSKICNLFKSMAAAG------------------------------------------------------------------------------
             + +S +C+ + +    G                                                                              
Subjt:  ----QKWKSKICNLFKSMAAAG------------------------------------------------------------------------------

Query:  -----------THNVVALVVDQFGTDAFDVADEFHVPSYIFFPCAAMTLSLLLHMPILDETVAGEYRDLPEPIRVPGCSPIPAKELPDPFQERGNDAYEA
                    H +   ++D FG DA D+A E H+PSY++FP +AM LSL LH+P L +TV G ++DL EP+ +PGC P+  K+L +P QER +++Y+ 
Subjt:  -----------THNVVALVVDQFGTDAFDVADEFHVPSYIFFPCAAMTLSLLLHMPILDETVAGEYRDLPEPIRVPGCSPIPAKELPDPFQERGNDAYEA

Query:  LLHNTKRYGLAEGIFLNSFPKLDAGAINALQFAEPGKRTPPIYPIGPLVKIDSTGNDELGTEKCLNWLDEQPTGSVLFVSFGSGGTLSSHQINELALGLE
        +L    R+ + +GI  NSF +L+      LQ  EPG    P+Y +GPLV ++ T       +KCL WLDEQP GSVLFV FGSGGT+SS Q +ELA+GLE
Subjt:  LLHNTKRYGLAEGIFLNSFPKLDAGAINALQFAEPGKRTPPIYPIGPLVKIDSTGNDELGTEKCLNWLDEQPTGSVLFVSFGSGGTLSSHQINELALGLE

Query:  NSGQRFIWVVRNPSDKEADASFFSVHSENDPLSFLPEGFVERNRGRGFVVASWAPQAEILSRGSTGGFLSHCGWNSTLESVVNGVPLIAWPLYAEQKMNA
        NS QRF+WVV+ P      A F ++  + DPL FLPEGFVER RG G VV  WAPQA+IL+  S GGFL+HCGWNS LES+VNGVP+IAWPLYAEQKM A
Subjt:  NSGQRFIWVVRNPSDKEADASFFSVHSENDPLSFLPEGFVERNRGRGFVVASWAPQAEILSRGSTGGFLSHCGWNSTLESVVNGVPLIAWPLYAEQKMNA

Query:  IMLTEEIKAALRPKMNEESGIVEKEEIAKVVKELFEGEEGKRVKAKMAELKIAAKRVLGEDGSSSTTLSEVVQKWRT
        +++TE+IK A RP++  E+G+VE EEI +VV++L EGEEGK++  ++ ELK AA + LGE+GSS+  LS++V  W+T
Subjt:  IMLTEEIKAALRPKMNEESGIVEKEEIAKVVKELFEGEEGKRVKAKMAELKIAAKRVLGEDGSSSTTLSEVVQKWRT

A0A5B6W565 Hydroquinone glucosyltransferase-like2.2e-22455.03Show/hide
Query:  KIPGCIPIRGKDLMDPVQDRKDDAYKWTLHNAKRYVLADGIFLNSFPELEPGAINALQLTEPGNNYKPPVYPIGPLVKIDSNESEERAECLKWLDEQPHG
        K+PG +P+ G+D    +QDR  D YKW L+ AKRY +A G+ LNSF +LEPG I ALQL EP    KP VY IGP ++  S+   + +EC KWLD QP G
Subjt:  KIPGCIPIRGKDLMDPVQDRKDDAYKWTLHNAKRYVLADGIFLNSFPELEPGAINALQLTEPGNNYKPPVYPIGPLVKIDSNESEERAECLKWLDEQPHG

Query:  SVLFVSFGSGGTLSSHQINELALGLEMSGQRFIWVVRSPSNKEANATYFSVHSKNNPLDFLPEGFVERNRGRGFVVPSWAPQAQILRHESTGGFLSHCGW
        SVLFVSFGSGGTLS  Q+NELALGLEMS QRF+WVVR P+   A  +Y+   +   PL FLP+GF++RN  +G VVPSWAPQ +IL H STGGFL+HCGW
Subjt:  SVLFVSFGSGGTLSSHQINELALGLEMSGQRFIWVVRSPSNKEANATYFSVHSKNNPLDFLPEGFVERNRGRGFVVPSWAPQAQILRHESTGGFLSHCGW

Query:  NSTLESVVNGVPLIAWPLYAEQRMNAVMLTEEIKVAMRAEAKGENGIVEKEEICRVVKSLMEGEEGKKLRTKMKELKEAGERALGENGSSTKILCELVQ-
        NS LES+ NGVP++AWPLYAEQRMNAV+LTE I VA+R     + GIVE+EEI +VVK LM+G++G  +R +M + K+A  +A+ E+GSST+ L +LV+ 
Subjt:  NSTLESVVNGVPLIAWPLYAEQRMNAVMLTEEIKVAMRAEAKGENGIVEKEEICRVVKSLMEGEEGKKLRTKMKELKEAGERALGENGSSTKILCELVQ-

Query:  -----------------KWKSKICNL--------------------------------------------FKSMAAAGTHNVVALVVDQFGTDAFDVADE
                           K++I  L                                             KS+AA     +V LVVD FGTDAFDV  E
Subjt:  -----------------KWKSKICNL--------------------------------------------FKSMAAAGTHNVVALVVDQFGTDAFDVADE

Query:  FHVPSYIFFPCAAMTLSLLLHMPILDETVAGEYRDLPEPIRVPGCSPIPAKELPDPFQERGNDAYEALLHNTKRYGLAEGIFLNSFPKLDAGAINALQFA
        F++  YIF+P  AM LSL  ++P LD+ V+ EYR+LPE +R+PGC PI  KEL DP Q+R NDAY+ LLH+ KRY LAEGI +NSF +L+AGA  ALQ  
Subjt:  FHVPSYIFFPCAAMTLSLLLHMPILDETVAGEYRDLPEPIRVPGCSPIPAKELPDPFQERGNDAYEALLHNTKRYGLAEGIFLNSFPKLDAGAINALQFA

Query:  EPGKRTPPIYPIGPLVKIDSTGNDELGTEKCLNWLDEQPTGSVLFVSFGSGGTLSSHQINELALGLENSGQRFIWVVRNPSDKEADASFFSVHSENDPLS
        EP K  PP+YP+GPLV +D++   +     CL WLD+QP GSVL+VSFGSGGTLSS+Q+NELA+GLE S  RF+WVVR+P+DK A+A+FFS  S+ DP  
Subjt:  EPGKRTPPIYPIGPLVKIDSTGNDELGTEKCLNWLDEQPTGSVLFVSFGSGGTLSSHQINELALGLENSGQRFIWVVRNPSDKEADASFFSVHSENDPLS

Query:  FLPEGFVERNRGRGFVVASWAPQAEILSRGSTGGFLSHCGWNSTLESVVNGVPLIAWPLYAEQKMNAIMLTEEIKAALRPKMNEESGIVEKEEIAKVVKE
        FLP GF+ER +GRG VV SWAPQA++LS  STGGFL+HCGWNSTLES+VNGVPLIAWPLYAEQKMNA MLT++IK ALR + N E+G+V ++EIAK VK 
Subjt:  FLPEGFVERNRGRGFVVASWAPQAEILSRGSTGGFLSHCGWNSTLESVVNGVPLIAWPLYAEQKMNAIMLTEEIKAALRPKMNEESGIVEKEEIAKVVKE

Query:  LFEGEEGKRVKAKMAELKIAAKRVLGEDGSSSTTLSEVVQKWRTK
        L EGEEGK V+ +M +LK AA   L E+GSS+  LSEV  +WR +
Subjt:  LFEGEEGKRVKAKMAELKIAAKRVLGEDGSSSTTLSEVVQKWRTK

A0A6J1D226 hydroquinone glucosyltransferase-like8.4e-18799.4Show/hide
Query:  MTLSLLLHMPILDETVAGEYRDLPEPIRVPGCSPIPAKELPDPFQERGNDAYEALLHNTKRYGLAEGIFLNSFPKLDAGAINALQFAEPGKRTPPIYPIG
        MTLSLLLHMP LDETVAGEYRDLPEPIRVPGCSPIPAKELPDPFQERGNDAYEALLHNTKRYGLAEGIFLNSFPKLDAGAINALQFAEPGKRTPPIYPIG
Subjt:  MTLSLLLHMPILDETVAGEYRDLPEPIRVPGCSPIPAKELPDPFQERGNDAYEALLHNTKRYGLAEGIFLNSFPKLDAGAINALQFAEPGKRTPPIYPIG

Query:  PLVKIDSTGNDELGTEKCLNWLDEQPTGSVLFVSFGSGGTLSSHQINELALGLENSGQRFIWVVRNPSDKEADASFFSVHSENDPLSFLPEGFVERNRGR
        PLVKIDSTGNDELG EKCLNWLDEQPTGSVLFVSFGSGGTLSSHQINELALGLENSGQRFIWVVRNPSDKEADASFFSVHSENDPLSFLPEGFVERNRGR
Subjt:  PLVKIDSTGNDELGTEKCLNWLDEQPTGSVLFVSFGSGGTLSSHQINELALGLENSGQRFIWVVRNPSDKEADASFFSVHSENDPLSFLPEGFVERNRGR

Query:  GFVVASWAPQAEILSRGSTGGFLSHCGWNSTLESVVNGVPLIAWPLYAEQKMNAIMLTEEIKAALRPKMNEESGIVEKEEIAKVVKELFEGEEGKRVKAK
        GFVVASWAPQAEILSRGSTGGFLSHCGWNSTLESVVNGVPLIAWPLYAEQKMNAIMLTEEIKAALRPKMNEESGIVEKEEIAKVVKELFEGEEGKRVKAK
Subjt:  GFVVASWAPQAEILSRGSTGGFLSHCGWNSTLESVVNGVPLIAWPLYAEQKMNAIMLTEEIKAALRPKMNEESGIVEKEEIAKVVKELFEGEEGKRVKAK

Query:  MAELKIAAKRVLGEDGSSSTTLSEVVQKWRTK
        MAELKIAAKRVLGEDGSSSTTLSEVVQKWRTK
Subjt:  MAELKIAAKRVLGEDGSSSTTLSEVVQKWRTK

SwissProt top hitse value%identityAlignment
Q40287 Anthocyanidin 3-O-glucosyltransferase 51.5e-7942.18Show/hide
Query:  FKSMAAAGTHNVVALVVDQFGTDAFDVADEFHVPSYIFFPCAAMTLSLLLHMPILDETVAGEYRDLPEPIRVPGCSPIPAKELPDPFQERGNDAYEALLH
        F++  +A      A++VD FGT++ +VA E  +  Y++    A  L+L +++PILD+ V GE+    EP+++PGC P+  +E+ DP  +R N  Y     
Subjt:  FKSMAAAGTHNVVALVVDQFGTDAFDVADEFHVPSYIFFPCAAMTLSLLLHMPILDETVAGEYRDLPEPIRVPGCSPIPAKELPDPFQERGNDAYEALLH

Query:  NTKRYGLAEGIFLNSFPKLDAGAINALQFAEPGKRTP--PIYPIGPLVKIDST--GNDELGTEKCLNWLDEQPTGSVLFVSFGSGGTLSSHQINELALGL
               A+GI +N++  L+     AL+  +   R    P++PIGPL +       N EL     L+WLD+QP  SV++VSFGSGGTLS  Q+ ELA GL
Subjt:  NTKRYGLAEGIFLNSFPKLDAGAINALQFAEPGKRTP--PIYPIGPLVKIDST--GNDELGTEKCLNWLDEQPTGSVLFVSFGSGGTLSSHQINELALGL

Query:  ENSGQRFIWVVRNPSDKEADASFFSVHSENDPLS-FLPEGFVERNRGRGFVVASWAPQAEILSRGSTGGFLSHCGWNSTLESVVNGVPLIAWPLYAEQKM
        E S QRFIWVVR P+ K  DA+FF+     D +S + PEGF+ R +  G VV  W+PQ  I+S  S G FLSHCGWNS LES+  GVP+IAWP+YAEQ+M
Subjt:  ENSGQRFIWVVRNPSDKEADASFFSVHSENDPLS-FLPEGFVERNRGRGFVVASWAPQAEILSRGSTGGFLSHCGWNSTLESVVNGVPLIAWPLYAEQKM

Query:  NAIMLTEEIKAALRPKMNEESGIVEKEEIAKVVKELFEGEEGKRVKAKMAELKIAAKRVLGEDGSSSTTLSEVVQKW
        NA +LTEE+  A+RPK      +V++EEI ++++ +   EEG  ++ ++ ELK + ++ L E GSS   +S +  +W
Subjt:  NAIMLTEEIKAALRPKMNEESGIVEKEEIAKVVKELFEGEEGKRVKAKMAELKIAAKRVLGEDGSSSTTLSEVVQKW

Q8W4C2 UDP-glycosyltransferase 72B22.0e-11357.5Show/hide
Query:  LVVDQFGTDAFDVADEFHVPSYIFFPCAAMTLSLLLHMPILDETVAGEYRDLPEPIRVPGCSPIPAKELPDPFQERGNDAYEALLHNTKRYGLAEGIFLN
        LVVD FG DAFDVA +FHV  YIF+   A  LS  LH+P LD+TV+ E+R L EP+++PGC PI  K+  D  Q+R +DAY+ LLHNTKRY  A+GI +N
Subjt:  LVVDQFGTDAFDVADEFHVPSYIFFPCAAMTLSLLLHMPILDETVAGEYRDLPEPIRVPGCSPIPAKELPDPFQERGNDAYEALLHNTKRYGLAEGIFLN

Query:  SFPKLDAGAINALQFAEPGKRTPPIYPIGPLVKIDSTGNDELGTEKCLNWLDEQPTGSVLFVSFGSGGTLSSHQINELALGLENSGQRFIWVVRNPSDKE
        SF  L++ AI ALQ  EP    P +YPIGPLV   S+  +      CL+WLD QP GSVL++SFGSGGTL+  Q NELA+GL  SG+RFIWV+R+PS+  
Subjt:  SFPKLDAGAINALQFAEPGKRTPPIYPIGPLVKIDSTGNDELGTEKCLNWLDEQPTGSVLFVSFGSGGTLSSHQINELALGLENSGQRFIWVVRNPSDKE

Query:  ADASFFSVHSENDPLSFLPEGFVERNRGRGFVVASWAPQAEILSRGSTGGFLSHCGWNSTLESVVNGVPLIAWPLYAEQKMNAIMLTEEIKAALRPKMNE
          +S+F+ HSE DP SFLP GF++R + +G VV SWAPQ +IL+  ST GFL+HCGWNSTLES+VNGVPLIAWPL+AEQKMN ++L E++ AALR    E
Subjt:  ADASFFSVHSENDPLSFLPEGFVERNRGRGFVVASWAPQAEILSRGSTGGFLSHCGWNSTLESVVNGVPLIAWPLYAEQKMNAIMLTEEIKAALRPKMNE

Query:  ESGIVEKEEIAKVVKELFEGEEGKRVKAKMAELKIAAKRVLGEDGSSSTTLSEVVQKWRT
        + GIV +EE+ +VVK L EGEEGK +  K+ ELK    RVLG+DG SS +  EV+ KW+T
Subjt:  ESGIVEKEEIAKVVKELFEGEEGKRVKAKMAELKIAAKRVLGEDGSSSTTLSEVVQKWRT

Q9AR73 Hydroquinone glucosyltransferase3.0e-12560.76Show/hide
Query:  THNVVALVVDQFGTDAFDVADEFHVPSYIFFPCAAMTLSLLLHMPILDETVAGEYRDLPEPIRVPGCSPIPAKELPDPFQERGNDAYEALLHNTKRYGLA
        T  + ALVVD FGTDAFDVA EF V  YIF+P  AM LSL  H+P LD+ V+ EYRD+PEP+++PGC PI  K+  DP Q+R NDAY+ LLH  KRY LA
Subjt:  THNVVALVVDQFGTDAFDVADEFHVPSYIFFPCAAMTLSLLLHMPILDETVAGEYRDLPEPIRVPGCSPIPAKELPDPFQERGNDAYEALLHNTKRYGLA

Query:  EGIFLNSFPKLDAGAINALQFAEPGKRTPPIYPIGPLVKIDSTGNDELGTEKCLNWLDEQPTGSVLFVSFGSGGTLSSHQINELALGLENSGQRFIWVVR
        EGI +N+F  L+ G + ALQ  + GK  PP+YPIGPL++ DS  + ++   +CL WLD+QP GSVLF+SFGSGG +S +Q  ELALGLE S QRF+WVVR
Subjt:  EGIFLNSFPKLDAGAINALQFAEPGKRTPPIYPIGPLVKIDSTGNDELGTEKCLNWLDEQPTGSVLFVSFGSGGTLSSHQINELALGLENSGQRFIWVVR

Query:  NPSDKEADASFFSVHSENDPLSFLPEGFVERNRGRGFVVASWAPQAEILSRGSTGGFLSHCGWNSTLESVVNGVPLIAWPLYAEQKMNAIMLTEEIKAAL
        +P+DK A+A++FS+ ++ND L++LPEGF+ER +GR  +V SWAPQ EILS GSTGGFL+HCGWNS LESVVNGVPLIAWPLYAEQKMNA+MLTE +K AL
Subjt:  NPSDKEADASFFSVHSENDPLSFLPEGFVERNRGRGFVVASWAPQAEILSRGSTGGFLSHCGWNSTLESVVNGVPLIAWPLYAEQKMNAIMLTEEIKAAL

Query:  RPKMNEESGIVEKEEIAKVVKELFEGEEGKRVKAKMAELKIAAKRVLGEDGSSSTTLSEVVQKWRTK
        RPK   E+G++ + EIA  VK L EGEEGK+ ++ M +LK AA R L +DGSS+  L+E+  KW  K
Subjt:  RPKMNEESGIVEKEEIAKVVKELFEGEEGKRVKAKMAELKIAAKRVLGEDGSSSTTLSEVVQKWRTK

Q9LNI1 UDP-glycosyltransferase 72B32.2e-11256.95Show/hide
Query:  LFKSMAAAGTHNVVALVVDQFGTDAFDVADEFHVPSYIFFPCAAMTLSLLLHMPILDETVAGEYRDLPEPIRVPGCSPIPAKELPDPFQERGNDAYEALL
        LF S++A      V LVVD FGTDAFDVA EFHV  YIF+   A  L+ LLH+P LDETV+ E+R+L EP+ +PGC PI  K+  DP Q+R +++Y+ LL
Subjt:  LFKSMAAAGTHNVVALVVDQFGTDAFDVADEFHVPSYIFFPCAAMTLSLLLHMPILDETVAGEYRDLPEPIRVPGCSPIPAKELPDPFQERGNDAYEALL

Query:  HNTKRYGLAEGIFLNSFPKLDAGAINALQFAEPGKRTPPIYPIGPLVKIDSTGNDELGTEKCLNWLDEQPTGSVLFVSFGSGGTLSSHQINELALGLENS
        HN KR+  AEGI +NSF  L+   I  +Q  EP    PP+Y IGPLV   S   D     KCLNWLD QP GSVL+VSFGSGGTL+  Q  ELALGL  S
Subjt:  HNTKRYGLAEGIFLNSFPKLDAGAINALQFAEPGKRTPPIYPIGPLVKIDSTGNDELGTEKCLNWLDEQPTGSVLFVSFGSGGTLSSHQINELALGLENS

Query:  GQRFIWVVRNPSDKEADASFFSVHSENDPLSFLPEGFVERNRGRGFVVASWAPQAEILSRGSTGGFLSHCGWNSTLESVVNGVPLIAWPLYAEQKMNAIM
        G+RF+WV+R+PS   A +S+F+  S NDP SFLP+GF++R + +G VV SWAPQA+IL+  S GGFL+HCGWNS+LES+VNGVPLIAWPLYAEQKMNA++
Subjt:  GQRFIWVVRNPSDKEADASFFSVHSENDPLSFLPEGFVERNRGRGFVVASWAPQAEILSRGSTGGFLSHCGWNSTLESVVNGVPLIAWPLYAEQKMNAIM

Query:  LTEEIKAALRPKMNEESGIVEKEEIAKVVKELFEGEEGKRVKAKMAELKIAAKRVLGEDGSSSTTLSEVVQKWR
        L  ++ AALR ++ E+ G+V +EE+A+VVK L EGEEG  V+ KM ELK  + RVL +DG S+ +L+EV  KW+
Subjt:  LTEEIKAALRPKMNEESGIVEKEEIAKVVKELFEGEEGKRVKAKMAELKIAAKRVLGEDGSSSTTLSEVVQKWR

Q9M156 UDP-glycosyltransferase 72B11.3e-12355.01Show/hide
Query:  EGEEGKKLRTKMKELKEAGERA------LGENGSSTKILCEL---VQKWKSKICNLFKSMAAAGTHNVVALVVDQFGTDAFDVADEFHVPSYIFFPCAAM
        EG   K  RT +  L  +          L +  SST+I   +   V +   ++  +F S    G     ALVVD FGTDAFDVA EFHVP YIF+P  A 
Subjt:  EGEEGKKLRTKMKELKEAGERA------LGENGSSTKILCEL---VQKWKSKICNLFKSMAAAGTHNVVALVVDQFGTDAFDVADEFHVPSYIFFPCAAM

Query:  TLSLLLHMPILDETVAGEYRDLPEPIRVPGCSPIPAKELPDPFQERGNDAYEALLHNTKRYGLAEGIFLNSFPKLDAGAINALQFAEPGKRTPPIYPIGP
         LS  LH+P LDETV+ E+R+L EP+ +PGC P+  K+  DP Q+R +DAY+ LLHNTKRY  AEGI +N+F +L+  AI ALQ  EPG   PP+YP+GP
Subjt:  TLSLLLHMPILDETVAGEYRDLPEPIRVPGCSPIPAKELPDPFQERGNDAYEALLHNTKRYGLAEGIFLNSFPKLDAGAINALQFAEPGKRTPPIYPIGP

Query:  LVKIDSTGNDELGTEKCLNWLDEQPTGSVLFVSFGSGGTLSSHQINELALGLENSGQRFIWVVRNPSDKEADASFFSVHSENDPLSFLPEGFVERNRGRG
        LV I      +    +CL WLD QP GSVL+VSFGSGGTL+  Q+NELALGL +S QRF+WV+R+PS   A++S+F  HS+ DPL+FLP GF+ER + RG
Subjt:  LVKIDSTGNDELGTEKCLNWLDEQPTGSVLFVSFGSGGTLSSHQINELALGLENSGQRFIWVVRNPSDKEADASFFSVHSENDPLSFLPEGFVERNRGRG

Query:  FVVASWAPQAEILSRGSTGGFLSHCGWNSTLESVVNGVPLIAWPLYAEQKMNAIMLTEEIKAALRPKMNEESGIVEKEEIAKVVKELFEGEEGKRVKAKM
        FV+  WAPQA++L+  STGGFL+HCGWNSTLESVV+G+PLIAWPLYAEQKMNA++L+E+I+AALRP+  ++ G+V +EE+A+VVK L EGEEGK V+ KM
Subjt:  FVVASWAPQAEILSRGSTGGFLSHCGWNSTLESVVNGVPLIAWPLYAEQKMNAIMLTEEIKAALRPKMNEESGIVEKEEIAKVVKELFEGEEGKRVKAKM

Query:  AELKIAAKRVLGEDGSSSTTLSEVVQKWR
         ELK AA RVL +DG+S+  LS V  KW+
Subjt:  AELKIAAKRVLGEDGSSSTTLSEVVQKWR

Arabidopsis top hitse value%identityAlignment
AT1G01390.1 UDP-Glycosyltransferase superfamily protein1.4e-11457.5Show/hide
Query:  LVVDQFGTDAFDVADEFHVPSYIFFPCAAMTLSLLLHMPILDETVAGEYRDLPEPIRVPGCSPIPAKELPDPFQERGNDAYEALLHNTKRYGLAEGIFLN
        LVVD FG DAFDVA +FHV  YIF+   A  LS  LH+P LD+TV+ E+R L EP+++PGC PI  K+  D  Q+R +DAY+ LLHNTKRY  A+GI +N
Subjt:  LVVDQFGTDAFDVADEFHVPSYIFFPCAAMTLSLLLHMPILDETVAGEYRDLPEPIRVPGCSPIPAKELPDPFQERGNDAYEALLHNTKRYGLAEGIFLN

Query:  SFPKLDAGAINALQFAEPGKRTPPIYPIGPLVKIDSTGNDELGTEKCLNWLDEQPTGSVLFVSFGSGGTLSSHQINELALGLENSGQRFIWVVRNPSDKE
        SF  L++ AI ALQ  EP    P +YPIGPLV   S+  +      CL+WLD QP GSVL++SFGSGGTL+  Q NELA+GL  SG+RFIWV+R+PS+  
Subjt:  SFPKLDAGAINALQFAEPGKRTPPIYPIGPLVKIDSTGNDELGTEKCLNWLDEQPTGSVLFVSFGSGGTLSSHQINELALGLENSGQRFIWVVRNPSDKE

Query:  ADASFFSVHSENDPLSFLPEGFVERNRGRGFVVASWAPQAEILSRGSTGGFLSHCGWNSTLESVVNGVPLIAWPLYAEQKMNAIMLTEEIKAALRPKMNE
          +S+F+ HSE DP SFLP GF++R + +G VV SWAPQ +IL+  ST GFL+HCGWNSTLES+VNGVPLIAWPL+AEQKMN ++L E++ AALR    E
Subjt:  ADASFFSVHSENDPLSFLPEGFVERNRGRGFVVASWAPQAEILSRGSTGGFLSHCGWNSTLESVVNGVPLIAWPLYAEQKMNAIMLTEEIKAALRPKMNE

Query:  ESGIVEKEEIAKVVKELFEGEEGKRVKAKMAELKIAAKRVLGEDGSSSTTLSEVVQKWRT
        + GIV +EE+ +VVK L EGEEGK +  K+ ELK    RVLG+DG SS +  EV+ KW+T
Subjt:  ESGIVEKEEIAKVVKELFEGEEGKRVKAKMAELKIAAKRVLGEDGSSSTTLSEVVQKWRT

AT1G01420.1 UDP-glucosyl transferase 72B31.6e-11356.95Show/hide
Query:  LFKSMAAAGTHNVVALVVDQFGTDAFDVADEFHVPSYIFFPCAAMTLSLLLHMPILDETVAGEYRDLPEPIRVPGCSPIPAKELPDPFQERGNDAYEALL
        LF S++A      V LVVD FGTDAFDVA EFHV  YIF+   A  L+ LLH+P LDETV+ E+R+L EP+ +PGC PI  K+  DP Q+R +++Y+ LL
Subjt:  LFKSMAAAGTHNVVALVVDQFGTDAFDVADEFHVPSYIFFPCAAMTLSLLLHMPILDETVAGEYRDLPEPIRVPGCSPIPAKELPDPFQERGNDAYEALL

Query:  HNTKRYGLAEGIFLNSFPKLDAGAINALQFAEPGKRTPPIYPIGPLVKIDSTGNDELGTEKCLNWLDEQPTGSVLFVSFGSGGTLSSHQINELALGLENS
        HN KR+  AEGI +NSF  L+   I  +Q  EP    PP+Y IGPLV   S   D     KCLNWLD QP GSVL+VSFGSGGTL+  Q  ELALGL  S
Subjt:  HNTKRYGLAEGIFLNSFPKLDAGAINALQFAEPGKRTPPIYPIGPLVKIDSTGNDELGTEKCLNWLDEQPTGSVLFVSFGSGGTLSSHQINELALGLENS

Query:  GQRFIWVVRNPSDKEADASFFSVHSENDPLSFLPEGFVERNRGRGFVVASWAPQAEILSRGSTGGFLSHCGWNSTLESVVNGVPLIAWPLYAEQKMNAIM
        G+RF+WV+R+PS   A +S+F+  S NDP SFLP+GF++R + +G VV SWAPQA+IL+  S GGFL+HCGWNS+LES+VNGVPLIAWPLYAEQKMNA++
Subjt:  GQRFIWVVRNPSDKEADASFFSVHSENDPLSFLPEGFVERNRGRGFVVASWAPQAEILSRGSTGGFLSHCGWNSTLESVVNGVPLIAWPLYAEQKMNAIM

Query:  LTEEIKAALRPKMNEESGIVEKEEIAKVVKELFEGEEGKRVKAKMAELKIAAKRVLGEDGSSSTTLSEVVQKWR
        L  ++ AALR ++ E+ G+V +EE+A+VVK L EGEEG  V+ KM ELK  + RVL +DG S+ +L+EV  KW+
Subjt:  LTEEIKAALRPKMNEESGIVEKEEIAKVVKELFEGEEGKRVKAKMAELKIAAKRVLGEDGSSSTTLSEVVQKWR

AT2G18570.1 UDP-Glycosyltransferase superfamily protein4.8e-7842.78Show/hide
Query:  LVVDQFGTDAFDVADEFHVPS-YIFFPCAAMTLSLLLHMPILDETVAGEYRDLPEPIRVPGCSPIPAKELPDPFQERGNDAYEALLHNTKRYGLAEGIFL
        ++VD  GT+   VAD+  + + Y++ P  A  L++++++P+LD  V GEY D+ EP+++PGC P+  KEL +   +R    Y+  +       +++G+ +
Subjt:  LVVDQFGTDAFDVADEFHVPS-YIFFPCAAMTLSLLLHMPILDETVAGEYRDLPEPIRVPGCSPIPAKELPDPFQERGNDAYEALLHNTKRYGLAEGIFL

Query:  NSFPKLDAGAINALQFAEPGKRT--PPIYPIGPLVKIDSTGNDELGTEKCLNWLDEQPTGSVLFVSFGSGGTLSSHQINELALGLENSGQRFIWVVRNPS
        N++ +L    + AL+  E   R    P+YPIGP+V+   T            WLDEQ   SV+FV  GSGGTL+  Q  ELALGLE SGQRF+WV+R P 
Subjt:  NSFPKLDAGAINALQFAEPGKRT--PPIYPIGPLVKIDSTGNDELGTEKCLNWLDEQPTGSVLFVSFGSGGTLSSHQINELALGLENSGQRFIWVVRNPS

Query:  DKEADASFFSVHSENDP--LSFLPEGFVERNRGRGFVVASWAPQAEILSRGSTGGFLSHCGWNSTLESVVNGVPLIAWPLYAEQKMNAIMLTEEIKAALR
             AS+    S +D    + LPEGF++R RG G VV  WAPQ EILS  S GGFLSHCGW+S LES+  GVP+IAWPLYAEQ MNA +LTEEI  A+R
Subjt:  DKEADASFFSVHSENDP--LSFLPEGFVERNRGRGFVVASWAPQAEILSRGSTGGFLSHCGWNSTLESVVNGVPLIAWPLYAEQKMNAIMLTEEIKAALR

Query:  PKMNEESGIVEKEEIAKVVKELF--EGEEGKRVKAKMAELKIAAKRVLGEDGSSSTTLSE
                ++ +EE+A +V+++   E EEG++++AK  E++++++R   +DGSS  +L E
Subjt:  PKMNEESGIVEKEEIAKVVKELF--EGEEGKRVKAKMAELKIAAKRVLGEDGSSSTTLSE

AT4G01070.1 UDP-Glycosyltransferase superfamily protein9.0e-12555.01Show/hide
Query:  EGEEGKKLRTKMKELKEAGERA------LGENGSSTKILCEL---VQKWKSKICNLFKSMAAAGTHNVVALVVDQFGTDAFDVADEFHVPSYIFFPCAAM
        EG   K  RT +  L  +          L +  SST+I   +   V +   ++  +F S    G     ALVVD FGTDAFDVA EFHVP YIF+P  A 
Subjt:  EGEEGKKLRTKMKELKEAGERA------LGENGSSTKILCEL---VQKWKSKICNLFKSMAAAGTHNVVALVVDQFGTDAFDVADEFHVPSYIFFPCAAM

Query:  TLSLLLHMPILDETVAGEYRDLPEPIRVPGCSPIPAKELPDPFQERGNDAYEALLHNTKRYGLAEGIFLNSFPKLDAGAINALQFAEPGKRTPPIYPIGP
         LS  LH+P LDETV+ E+R+L EP+ +PGC P+  K+  DP Q+R +DAY+ LLHNTKRY  AEGI +N+F +L+  AI ALQ  EPG   PP+YP+GP
Subjt:  TLSLLLHMPILDETVAGEYRDLPEPIRVPGCSPIPAKELPDPFQERGNDAYEALLHNTKRYGLAEGIFLNSFPKLDAGAINALQFAEPGKRTPPIYPIGP

Query:  LVKIDSTGNDELGTEKCLNWLDEQPTGSVLFVSFGSGGTLSSHQINELALGLENSGQRFIWVVRNPSDKEADASFFSVHSENDPLSFLPEGFVERNRGRG
        LV I      +    +CL WLD QP GSVL+VSFGSGGTL+  Q+NELALGL +S QRF+WV+R+PS   A++S+F  HS+ DPL+FLP GF+ER + RG
Subjt:  LVKIDSTGNDELGTEKCLNWLDEQPTGSVLFVSFGSGGTLSSHQINELALGLENSGQRFIWVVRNPSDKEADASFFSVHSENDPLSFLPEGFVERNRGRG

Query:  FVVASWAPQAEILSRGSTGGFLSHCGWNSTLESVVNGVPLIAWPLYAEQKMNAIMLTEEIKAALRPKMNEESGIVEKEEIAKVVKELFEGEEGKRVKAKM
        FV+  WAPQA++L+  STGGFL+HCGWNSTLESVV+G+PLIAWPLYAEQKMNA++L+E+I+AALRP+  ++ G+V +EE+A+VVK L EGEEGK V+ KM
Subjt:  FVVASWAPQAEILSRGSTGGFLSHCGWNSTLESVVNGVPLIAWPLYAEQKMNAIMLTEEIKAALRPKMNEESGIVEKEEIAKVVKELFEGEEGKRVKAKM

Query:  AELKIAAKRVLGEDGSSSTTLSEVVQKWR
         ELK AA RVL +DG+S+  LS V  KW+
Subjt:  AELKIAAKRVLGEDGSSSTTLSEVVQKWR

AT4G01070.2 UDP-Glycosyltransferase superfamily protein2.9e-7550.32Show/hide
Query:  EGEEGKKLRTKMKELKEAGERA------LGENGSSTKILCEL---VQKWKSKICNLFKSMAAAGTHNVVALVVDQFGTDAFDVADEFHVPSYIFFPCAAM
        EG   K  RT +  L  +          L +  SST+I   +   V +   ++  +F S    G     ALVVD FGTDAFDVA EFHVP YIF+P  A 
Subjt:  EGEEGKKLRTKMKELKEAGERA------LGENGSSTKILCEL---VQKWKSKICNLFKSMAAAGTHNVVALVVDQFGTDAFDVADEFHVPSYIFFPCAAM

Query:  TLSLLLHMPILDETVAGEYRDLPEPIRVPGCSPIPAKELPDPFQERGNDAYEALLHNTKRYGLAEGIFLNSFPKLDAGAINALQFAEPGKRTPPIYPIGP
         LS  LH+P LDETV+ E+R+L EP+ +PGC P+  K+  DP Q+R +DAY+ LLHNTKRY  AEGI +N+F +L+  AI ALQ  EPG   PP+YP+GP
Subjt:  TLSLLLHMPILDETVAGEYRDLPEPIRVPGCSPIPAKELPDPFQERGNDAYEALLHNTKRYGLAEGIFLNSFPKLDAGAINALQFAEPGKRTPPIYPIGP

Query:  LVKIDSTGNDELGTEKCLNWLDEQPTGSVLFVSFGSGGTLSSHQINELALGLENSGQRFIWVVRNPSDKEADASFFSVHSENDPLSFLPEGFVERNRGRG
        LV I      +    +CL WLD QP GSVL+VSFGSGGTL+  Q+NELALGL +S QRF+WV+R+PS   A++S+F  HS+ DPL+FLP GF+ER + R 
Subjt:  LVKIDSTGNDELGTEKCLNWLDEQPTGSVLFVSFGSGGTLSSHQINELALGLENSGQRFIWVVRNPSDKEADASFFSVHSENDPLSFLPEGFVERNRGRG

Query:  FVVASWAP
         V A W P
Subjt:  FVVASWAP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
AAAATCCCGGGGTGCATCCCGATTCGGGGCAAGGATTTGATGGACCCAGTTCAAGATCGGAAAGACGACGCCTATAAATGGACACTCCACAACGCAAAGAGGTATGTATT
AGCCGATGGAATTTTTCTCAATAGCTTCCCGGAATTGGAGCCCGGCGCTATAAACGCCCTGCAATTGACGGAACCCGGGAATAATTACAAGCCTCCGGTTTACCCAATTG
GGCCACTGGTGAAAATCGATTCGAATGAGAGCGAGGAGAGAGCAGAGTGTTTGAAGTGGCTTGACGAACAGCCTCATGGGTCTGTTCTGTTCGTGTCGTTCGGTAGTGGG
GGGACTCTGTCGAGCCATCAAATCAACGAGTTGGCGCTGGGATTGGAAATGAGCGGGCAAAGATTCATCTGGGTCGTTAGAAGTCCGAGCAACAAAGAAGCAAACGCCAC
GTATTTCAGCGTCCACAGCAAGAACAACCCATTGGATTTCTTGCCGGAGGGGTTCGTGGAGAGAAACAGAGGAAGGGGTTTTGTGGTGCCGTCGTGGGCGCCGCAGGCTC
AGATCCTCCGCCACGAGTCCACCGGCGGGTTTCTGAGCCACTGCGGGTGGAATTCGACTCTGGAGAGCGTGGTGAATGGGGTTCCTCTGATCGCGTGGCCGCTGTACGCG
GAGCAGAGGATGAACGCGGTGATGCTGACGGAGGAAATTAAGGTGGCGATGAGAGCAGAGGCGAAAGGGGAAAATGGGATTGTGGAGAAGGAGGAGATTTGCAGAGTTGT
GAAGTCGCTGATGGAAGGGGAAGAGGGGAAGAAACTGAGGACGAAAATGAAAGAATTGAAAGAAGCAGGGGAAAGAGCATTGGGAGAAAATGGATCTTCTACTAAAATCC
TCTGTGAATTGGTTCAGAAATGGAAGAGCAAGATTTGCAATCTCTTCAAATCTATGGCGGCGGCCGGAACCCACAACGTCGTCGCCCTCGTCGTCGACCAATTCGGCACC
GACGCCTTCGACGTCGCCGACGAATTCCACGTCCCCTCCTACATTTTCTTCCCTTGCGCCGCCATGACTCTGTCGCTCCTTCTCCACATGCCCATCCTCGACGAAACCGT
CGCCGGAGAGTACAGAGACCTCCCGGAGCCGATCAGAGTTCCCGGGTGCAGCCCGATTCCGGCGAAAGAACTGCCGGATCCGTTTCAGGAGAGGGGAAACGATGCCTACG
AGGCATTGCTACACAACACGAAGAGGTACGGGTTAGCAGAGGGGATTTTTCTCAACAGCTTCCCGAAATTGGATGCCGGAGCCATAAATGCTCTGCAATTCGCGGAACCC
GGTAAGCGTACGCCTCCGATTTACCCAATTGGGCCGCTGGTGAAAATCGATTCGACTGGGAATGACGAATTAGGGACAGAGAAGTGTTTGAATTGGCTTGATGAACAGCC
AACTGGGTCTGTTCTGTTCGTGAGTTTTGGGAGCGGGGGGACTCTGTCGAGCCATCAAATCAACGAATTGGCACTGGGATTGGAAAATAGTGGGCAGAGATTCATCTGGG
TTGTTCGAAATCCGAGCGATAAGGAAGCCGACGCTTCGTTTTTCAGCGTCCACAGCGAGAACGACCCATTGAGTTTCTTGCCGGAGGGTTTTGTGGAAAGAAACAGAGGA
AGGGGTTTTGTGGTGGCGTCGTGGGCTCCGCAGGCGGAGATTCTGAGCCGCGGTTCCACCGGCGGATTTCTGAGCCACTGCGGGTGGAATTCGACTCTGGAGAGCGTGGT
TAATGGGGTTCCTCTGATTGCTTGGCCGCTGTACGCGGAGCAGAAAATGAACGCCATAATGCTAACAGAGGAGATTAAGGCGGCGCTGAGGCCGAAGATGAACGAGGAAA
GTGGGATTGTGGAGAAGGAGGAGATTGCGAAAGTGGTGAAGGAGTTGTTTGAAGGCGAAGAGGGGAAGAGAGTGAAGGCGAAAATGGCGGAGCTGAAAATTGCAGCGAAA
AGGGTTCTTGGAGAAGATGGATCTTCTTCAACAACACTCTCGGAAGTGGTTCAAAAATGGAGGACCAAG
mRNA sequenceShow/hide mRNA sequence
AAAATCCCGGGGTGCATCCCGATTCGGGGCAAGGATTTGATGGACCCAGTTCAAGATCGGAAAGACGACGCCTATAAATGGACACTCCACAACGCAAAGAGGTATGTATT
AGCCGATGGAATTTTTCTCAATAGCTTCCCGGAATTGGAGCCCGGCGCTATAAACGCCCTGCAATTGACGGAACCCGGGAATAATTACAAGCCTCCGGTTTACCCAATTG
GGCCACTGGTGAAAATCGATTCGAATGAGAGCGAGGAGAGAGCAGAGTGTTTGAAGTGGCTTGACGAACAGCCTCATGGGTCTGTTCTGTTCGTGTCGTTCGGTAGTGGG
GGGACTCTGTCGAGCCATCAAATCAACGAGTTGGCGCTGGGATTGGAAATGAGCGGGCAAAGATTCATCTGGGTCGTTAGAAGTCCGAGCAACAAAGAAGCAAACGCCAC
GTATTTCAGCGTCCACAGCAAGAACAACCCATTGGATTTCTTGCCGGAGGGGTTCGTGGAGAGAAACAGAGGAAGGGGTTTTGTGGTGCCGTCGTGGGCGCCGCAGGCTC
AGATCCTCCGCCACGAGTCCACCGGCGGGTTTCTGAGCCACTGCGGGTGGAATTCGACTCTGGAGAGCGTGGTGAATGGGGTTCCTCTGATCGCGTGGCCGCTGTACGCG
GAGCAGAGGATGAACGCGGTGATGCTGACGGAGGAAATTAAGGTGGCGATGAGAGCAGAGGCGAAAGGGGAAAATGGGATTGTGGAGAAGGAGGAGATTTGCAGAGTTGT
GAAGTCGCTGATGGAAGGGGAAGAGGGGAAGAAACTGAGGACGAAAATGAAAGAATTGAAAGAAGCAGGGGAAAGAGCATTGGGAGAAAATGGATCTTCTACTAAAATCC
TCTGTGAATTGGTTCAGAAATGGAAGAGCAAGATTTGCAATCTCTTCAAATCTATGGCGGCGGCCGGAACCCACAACGTCGTCGCCCTCGTCGTCGACCAATTCGGCACC
GACGCCTTCGACGTCGCCGACGAATTCCACGTCCCCTCCTACATTTTCTTCCCTTGCGCCGCCATGACTCTGTCGCTCCTTCTCCACATGCCCATCCTCGACGAAACCGT
CGCCGGAGAGTACAGAGACCTCCCGGAGCCGATCAGAGTTCCCGGGTGCAGCCCGATTCCGGCGAAAGAACTGCCGGATCCGTTTCAGGAGAGGGGAAACGATGCCTACG
AGGCATTGCTACACAACACGAAGAGGTACGGGTTAGCAGAGGGGATTTTTCTCAACAGCTTCCCGAAATTGGATGCCGGAGCCATAAATGCTCTGCAATTCGCGGAACCC
GGTAAGCGTACGCCTCCGATTTACCCAATTGGGCCGCTGGTGAAAATCGATTCGACTGGGAATGACGAATTAGGGACAGAGAAGTGTTTGAATTGGCTTGATGAACAGCC
AACTGGGTCTGTTCTGTTCGTGAGTTTTGGGAGCGGGGGGACTCTGTCGAGCCATCAAATCAACGAATTGGCACTGGGATTGGAAAATAGTGGGCAGAGATTCATCTGGG
TTGTTCGAAATCCGAGCGATAAGGAAGCCGACGCTTCGTTTTTCAGCGTCCACAGCGAGAACGACCCATTGAGTTTCTTGCCGGAGGGTTTTGTGGAAAGAAACAGAGGA
AGGGGTTTTGTGGTGGCGTCGTGGGCTCCGCAGGCGGAGATTCTGAGCCGCGGTTCCACCGGCGGATTTCTGAGCCACTGCGGGTGGAATTCGACTCTGGAGAGCGTGGT
TAATGGGGTTCCTCTGATTGCTTGGCCGCTGTACGCGGAGCAGAAAATGAACGCCATAATGCTAACAGAGGAGATTAAGGCGGCGCTGAGGCCGAAGATGAACGAGGAAA
GTGGGATTGTGGAGAAGGAGGAGATTGCGAAAGTGGTGAAGGAGTTGTTTGAAGGCGAAGAGGGGAAGAGAGTGAAGGCGAAAATGGCGGAGCTGAAAATTGCAGCGAAA
AGGGTTCTTGGAGAAGATGGATCTTCTTCAACAACACTCTCGGAAGTGGTTCAAAAATGGAGGACCAAG
Protein sequenceShow/hide protein sequence
KIPGCIPIRGKDLMDPVQDRKDDAYKWTLHNAKRYVLADGIFLNSFPELEPGAINALQLTEPGNNYKPPVYPIGPLVKIDSNESEERAECLKWLDEQPHGSVLFVSFGSG
GTLSSHQINELALGLEMSGQRFIWVVRSPSNKEANATYFSVHSKNNPLDFLPEGFVERNRGRGFVVPSWAPQAQILRHESTGGFLSHCGWNSTLESVVNGVPLIAWPLYA
EQRMNAVMLTEEIKVAMRAEAKGENGIVEKEEICRVVKSLMEGEEGKKLRTKMKELKEAGERALGENGSSTKILCELVQKWKSKICNLFKSMAAAGTHNVVALVVDQFGT
DAFDVADEFHVPSYIFFPCAAMTLSLLLHMPILDETVAGEYRDLPEPIRVPGCSPIPAKELPDPFQERGNDAYEALLHNTKRYGLAEGIFLNSFPKLDAGAINALQFAEP
GKRTPPIYPIGPLVKIDSTGNDELGTEKCLNWLDEQPTGSVLFVSFGSGGTLSSHQINELALGLENSGQRFIWVVRNPSDKEADASFFSVHSENDPLSFLPEGFVERNRG
RGFVVASWAPQAEILSRGSTGGFLSHCGWNSTLESVVNGVPLIAWPLYAEQKMNAIMLTEEIKAALRPKMNEESGIVEKEEIAKVVKELFEGEEGKRVKAKMAELKIAAK
RVLGEDGSSSTTLSEVVQKWRTK