; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0008593 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0008593
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionGlycosyltransferase
Genome locationchr9:26085219..26098300
RNA-Seq ExpressionLag0008593
SyntenyLag0008593
Gene Ontology termsGO:0008194 - UDP-glycosyltransferase activity (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR002213 - UDP-glucuronosyl/UDP-glucosyltransferase
IPR025558 - Domain of unknown function DUF4283
IPR025836 - Zinc knuckle CX2CX4HX4C
IPR035595 - UDP-glycosyltransferase family, conserved site
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
MCH80348.1 hypothetical protein [Trifolium medium]1.4e-13733.03Show/hide
Query:  EKEPAFSFNEKEKMNIDGNLGHSLVGKLLSSRIISSLAIKNTMAEAWKTRHKFSIKVVGRNLFAFKFCSQEDRDWVAANEPWLFDRSLLILEFPAANQRS
        +++ A +  ++E  N D +   +LVGK+ +    +  A K TM +AW++R+   I+ + +NL+ FKF ++ + D V  N PW FDR+LLIL   + N++ 
Subjt:  EKEPAFSFNEKEKMNIDGNLGHSLVGKLLSSRIISSLAIKNTMAEAWKTRHKFSIKVVGRNLFAFKFCSQEDRDWVAANEPWLFDRSLLILEFPAANQRS

Query:  SELRFDRVAFWLRLFNLPMGYQNKCAAVKFGNLIGDFLEME-NEDDDLGWSDSIRIRVKIDISKALLRGFMLKTDDMGGGRWITIRYERLPDFCFKCGCI
        SEL  D V+FW+R+++LP+  +++  A K GN +G F EM+  E + +G    +R+RV +D+ K L RG   K +  G   W+  +YERLP+FCF CG I
Subjt:  SELRFDRVAFWLRLFNLPMGYQNKCAAVKFGNLIGDFLEME-NEDDDLGWSDSIRIRVKIDISKALLRGFMLKTDDMGGGRWITIRYERLPDFCFKCGCI

Query:  GHNAKEYSIDQEECESRGNIFE-----FGAWLKFQGFSK---GTRKQSPTTNDD---IPDLNESDHQEEETSADILHNGESLQVLRKVEEAMDVNFEKGP
        GH  ++    ++  E + +  E     FG WL+     K     +K+S ++N      P  + S  Q   T+ ++    E  +V      + D + ++  
Subjt:  GHNAKEYSIDQEECESRGNIFE-----FGAWLKFQGFSK---GTRKQSPTTNDD---IPDLNESDHQEEETSADILHNGESLQVLRKVEEAMDVNFEKGP

Query:  EKGIQPILDHSNVSSFDKAMEEEEI--QSLDSVR-DQEVHTALIIQSQGDGSSGGFVNSARKKMNWKRRAR-MGPI-KKSLI--------LEQVSKKRKE
         KG        + S      +E E+  +SL +V    +       Q+  +G    +V     K   ++ A+ +G I K+SL+        +E +    K+
Subjt:  EKGIQPILDHSNVSSFDKAMEEEEI--QSLDSVR-DQEVHTALIIQSQGDGSSGGFVNSARKKMNWKRRAR-MGPI-KKSLI--------LEQVSKKRKE

Query:  YFDSVDVRKKARLIGESSVVYSLV---RSVCPDILFISETKSGPEIPNKIRIQCGFDGCFSVPSRG----ASGGLCLLWKEKDLVSIQSFSLNHIDALIS
            V ++      G    V +L+   R   P ++F+ ET+        IR + GF  C +V   G     +GGL L+W E   V+I SFSLNHI     
Subjt:  YFDSVDVRKKARLIGESSVVYSLV---RSVCPDILFISETKSGPEIPNKIRIQCGFDGCFSVPSRG----ASGGLCLLWKEKDLVSIQSFSLNHIDALIS

Query:  WKD--KKWRFTGIYGFPEISQKHKTWNLIRNLNTMKNFPWILGGDFNEILWDFEKYGGPSCSNSLLEGFRKTLNDCELKELKFSGDPFTW-KGNRRGVQV
         ++  + W  TGIYG+PE   K KTW+LIR+L       W+  GDFN+IL   EK GG   S +     R+ + D  L +L F G PFTW  G   G  V
Subjt:  WKD--KKWRFTGIYGFPEISQKHKTWNLIRNLNTMKNFPWILGGDFNEILWDFEKYGGPSCSNSLLEGFRKTLNDCELKELKFSGDPFTW-KGNRRGVQV

Query:  WERLDRFLYNLEFESLFAFAGMRNLEWMFSDHRPIAALVDCRRAAWRKVVKRPFKFEELWTRYEECADIIKKSGDWQNPLSSDLAKNLSSCSEVLYKWGD
          RLDR + N EF + F+   + +L+   SDH  +   ++C   +  +  +R F+FEE WT+  +C ++I+   +W++   S   K L S   +   + D
Subjt:  WERLDRFLYNLEFESLFAFAGMRNLEWMFSDHRPIAALVDCRRAAWRKVVKRPFKFEELWTRYEECADIIKKSGDWQNPLSSDLAKNLSSCSEVLYKWGD

Query:  DVRNKMKSRIKECKSALKEAYDKAPHI-DYNAIHGLEFELEKLLEEEEIFWKQRSREDWLRWGEKNTGWFHRKASIRRQINEISGITSATGEWTEDPSLV
             +K  +   +  LK+    +  I D++    LE E  +LL+ +E  W+QRSR  WL+ G++NT +FH KAS R ++N I  I    G W      V
Subjt:  DVRNKMKSRIKECKSALKEAYDKAPHI-DYNAIHGLEFELEKLLEEEEIFWKQRSREDWLRWGEKNTGWFHRKASIRRQINEISGITSATGEWTEDPSLV

Query:  EDTFVAFFNNIFESSKPDMSIMDDVLRCVNSRVSQRMNEKLLLPFSRIDIEKAVQDMYPTKAPGPDGYSAIFYQQYWNMVGPSTIKDCLDVLNGDRSVEA
        E  F+++F+ +F +S P  S + +    V S++S        + ++R ++++A+  M+P KAPGPDG  A+F+Q+YW++VG       L++LN +     
Subjt:  EDTFVAFFNNIFESSKPDMSIMDDVLRCVNSRVSQRMNEKLLLPFSRIDIEKAVQDMYPTKAPGPDGYSAIFYQQYWNMVGPSTIKDCLDVLNGDRSVEA

Query:  WNKTNIVLISKIANPKEVGDFRPINLCNVSYKIITKTLANRLKIVLRDIISESQSAFIQGRLISDNIILGHECLHSINQNR
         NKT +VLI K  NP    DFRPI+LCNV  KI+TK +ANRLK  L D+I   QSAF+QGRLI+DN ++  EC H + + R
Subjt:  WNKTNIVLISKIANPKEVGDFRPINLCNVSYKIITKTLANRLKIVLRDIISESQSAFIQGRLISDNIILGHECLHSINQNR

XP_022149559.1 soyasapogenol B glucuronide galactosyltransferase-like [Momordica charantia]5.0e-20775.71Show/hide
Query:  MVSKNEELRITVLPLFASGHIIPIVDMGRLFARHGATVTVITTESNARAIQNNIDRDFAAGFKIQTHVISFPAAEVGLPPGIENFSDVSSLDLWGKIYQG
        MV +NEELRITVLPLFASGHIIPIVDM RLFARHGA VT+ITTESNAR+ QN++ RDFAAG+KIQT  + FPAAEVGLPPGIENFSDV S DL GKIY+ 
Subjt:  MVSKNEELRITVLPLFASGHIIPIVDMGRLFARHGATVTVITTESNARAIQNNIDRDFAAGFKIQTHVISFPAAEVGLPPGIENFSDVSSLDLWGKIYQG

Query:  FLILDKLIDQDIIPVTRPDCILSDLSHPWTTDTAERLGVPRLVFFVSNFMAYCAEHSVLQHSPHQNVSSDTEPFEIPGLPHKIQMTKLQQPEFLVQRKAF
        FLIL+K IDQ IIP TRPDCILSDLS+ WTTDTA RLGVPRLVFFVSNFMAY AEHSVLQH+PHQ V+SD E FE+PGLPHKIQMTK QQPEFLVQR  F
Subjt:  FLILDKLIDQDIIPVTRPDCILSDLSHPWTTDTAERLGVPRLVFFVSNFMAYCAEHSVLQHSPHQNVSSDTEPFEIPGLPHKIQMTKLQQPEFLVQRKAF

Query:  TKMIESYKKAERRSYGAVMNTFYELDGIYLEHYKKITGIKAWGLGPVSLAANKDMIGKFERGNKSGMESGELLEWLNSKDPNSVLY--------------
        T+MIE YK+AERRSYG V NTFYELDG+YLEHYK+  GIKAWGLGPVSLA NKD+IGK +RGNKSGMESGELL+WLNSK+PNSVLY              
Subjt:  TKMIESYKKAERRSYGAVMNTFYELDGIYLEHYKKITGIKAWGLGPVSLAANKDMIGKFERGNKSGMESGELLEWLNSKDPNSVLY--------------

Query:  --------------------NDKNDGKEVAEGLPEGFEERIT-SNKRGLIVRIWAPQLLILEHPSTGGFLTHCGWNSSIEGVSTGQPMVTWPVSSEQFFN
                             ++N+G E  EGLPEGFEER+    K+GLIVRIWAPQLLILEHPSTGGFLTHCGWNSSIEGVSTGQPMVTWPVSSEQF+N
Subjt:  --------------------NDKNDGKEVAEGLPEGFEERIT-SNKRGLIVRIWAPQLLILEHPSTGGFLTHCGWNSSIEGVSTGQPMVTWPVSSEQFFN

Query:  EKLLTEVLQVGVPVGTRRWWNMRDEMEE--IVGREKVEKAVGFLMGATEEAATIRKRAKELGAAAKRAVENGGSSENNVVSLMKELRSVK
        EKLLTEVL+VGVPVG RRWWNM DEMEE  IVGRE+V  AVGFLMG  EEAA IR+RAKELGAAAKRAV  GGSSE NVVS+++ELRS+K
Subjt:  EKLLTEVLQVGVPVGTRRWWNMRDEMEE--IVGREKVEKAVGFLMGATEEAATIRKRAKELGAAAKRAVENGGSSENNVVSLMKELRSVK

XP_023886153.1 uncharacterized protein LOC111998282 [Quercus suber]1.6e-13631.77Show/hide
Query:  EELINEWRKFSLKEREKEPAFSFNEKEKMNIDG--NLGHSLVGKLLSSRIISSLAIKNTMAEAWKTRHKFSIKVVGRNLFAFKFCSQEDRDWVAANEPWL
        EE++N+W++FSL E E       N K  ++ D        L GK ++ R +S  A+  T+   WKTR+ F I+ VG ++  F F  + +   V A EPW+
Subjt:  EELINEWRKFSLKEREKEPAFSFNEKEKMNIDG--NLGHSLVGKLLSSRIISSLAIKNTMAEAWKTRHKFSIKVVGRNLFAFKFCSQEDRDWVAANEPWL

Query:  FDRSLLILEFPAANQRSSELRFDRVAFWLRLFNLPMGYQNKCAAVKFGNLIGDFLEMENEDDDLGWSDSIRIRVKIDISKALLRGFMLKTDDMGGGRWIT
        +D+ L+IL +   +     LRF  V FW+++  LP+   N+  A   G  IGD  +     + +G  + +R+RV I++++ L +G  +   D G   W+ 
Subjt:  FDRSLLILEFPAANQRSSELRFDRVAFWLRLFNLPMGYQNKCAAVKFGNLIGDFLEMENEDDDLGWSDSIRIRVKIDISKALLRGFMLKTDDMGGGRWIT

Query:  IRYERLPDFCFKCGCIGHNAKEY-SIDQEECESRGNIFEFGAWLKFQG---FSKGTRKQSPTTNDDIPDLNESDHQEEETSADILHNGESLQVLRKVEEA
         +YE+LP+FC+ CG + H+AKE+   D      +  +F  G    F G    ++  R+         P    + +  +  S+++  N E    +  V   
Subjt:  IRYERLPDFCFKCGCIGHNAKEY-SIDQEECESRGNIFEFGAWLKFQG---FSKGTRKQSPTTNDDIPDLNESDHQEEETSADILHNGESLQVLRKVEEA

Query:  MDVNFEKGPEKGI---QPILDHSNVSSFDKAMEEEEIQSLD-SVRDQEVHTALI----IQSQGDGSSGGFV-------------NSARKKMN--------
          V+  +     +    P   H+N+++F     EE+IQ +D  +   + HT+ +    I  +   S   +V             N A    N        
Subjt:  MDVNFEKGPEKGI---QPILDHSNVSSFDKAMEEEEIQSLD-SVRDQEVHTALI----IQSQGDGSSGGFV-------------NSARKKMN--------

Query:  ------WKRRARMGPIKKSLILEQVSKKRKEYFDSVD----VRKKARLIGESSV--VYSLVRSVCPDILFISETKSGPEIPNKIRIQCGFDGCFSVPSRG
              WKR AR+   ++  +      K     D  D     RKK ++     +  +  L+R+  P ++F++ET +      +I+    F+    V    
Subjt:  ------WKRRARMGPIKKSLILEQVSKKRKEYFDSVD----VRKKARLIGESSV--VYSLVRSVCPDILFISETKSGPEIPNKIRIQCGFDGCFSVPSRG

Query:  ASGGLCLLWKEKDLVSIQSFSLNHIDALI-SWKDKKWRFTGIYGFPEISQKHKTWNLIRNLNTMKNFPWILGGDFNEILWDFEKYGGPSCSNSLLEGFRK
         +GGL L WK    + +++ S NHIDA++   K+  WRFTG YG P   ++ ++WNL+R LN+    PWI  GDFNEI    EK GG   S+S ++ FR 
Subjt:  ASGGLCLLWKEKDLVSIQSFSLNHIDALI-SWKDKKWRFTGIYGFPEISQKHKTWNLIRNLNTMKNFPWILGGDFNEILWDFEKYGGPSCSNSLLEGFRK

Query:  TLNDCELKELKFSGDPFTWKGN-RRGVQVWERLDRFLYNLEFESLFAFAGMRNLEWMFSDHRPIAALVDCRRAAWRKVVKRPFKFEELWTRYEECADIIK
         +++C   +L F+G  FTWK +   G  VWERLDR L N E+   FA   + +L+   SDH PI  +    R        +PF FEE+W     C D I+
Subjt:  TLNDCELKELKFSGDPFTWKGN-RRGVQVWERLDRFLYNLEFESLFAFAGMRNLEWMFSDHRPIAALVDCRRAAWRKVVKRPFKFEELWTRYEECADIIK

Query:  ---KSGDWQNPLSSDLAKNLSSCSEVLYKWG----DDVRNKMKSRIKECKSALKEAYDKAPHIDYNAIHGLEFELEKLLEEEEIFWKQRSREDWLRWGEK
              D  +P  + + K +  C   L KW       VR +++ + KE   A KEA     +     +  L  EL  L ++E   W QRS+  W + G+K
Subjt:  ---KSGDWQNPLSSDLAKNLSSCSEVLYKWG----DDVRNKMKSRIKECKSALKEAYDKAPHIDYNAIHGLEFELEKLLEEEEIFWKQRSREDWLRWGEK

Query:  NTGWFHRKASIRRQINEISGITSATGEWTEDPSLVEDTFVAFFNNIFESSKPDMSIMDDVLRCVNSRVSQRMNEKLLLPFSRIDIEKAVQDMYPTKAPGP
        N+ +FH +A+ R + N IS +    G W  D + V +T +A+F ++F S+ P    ++   + +   +S  MN +L+  F  +++++A++ M P KAPGP
Subjt:  NTGWFHRKASIRRQINEISGITSATGEWTEDPSLVEDTFVAFFNNIFESSKPDMSIMDDVLRCVNSRVSQRMNEKLLLPFSRIDIEKAVQDMYPTKAPGP

Query:  DGYSAIFYQQYWNMVGPSTIKDCLDVLNGDRSVEAWNKTNIVLISKIANPKEVGDFRPINLCNVSYKIITKTLANRLKIVLRDIISESQSAFIQGRLISD
        DG   +FYQ YW+++G +     L  LN        N T I LI K+ NP+ V +FRPI+LCNV YKI +K LANRLK +L  II+E+QSAF + RLISD
Subjt:  DGYSAIFYQQYWNMVGPSTIKDCLDVLNGDRSVEAWNKTNIVLISKIANPKEVGDFRPINLCNVSYKIITKTLANRLKIVLRDIISESQSAFIQGRLISD

Query:  NIILGHECLHSINQNRTKIIDMAAMKLDLSKAFDQVEWPFLK
        NI++  E L+S+ Q  T   +  A+KLD+SK++D+VEW +L+
Subjt:  NIILGHECLHSINQNRTKIIDMAAMKLDLSKAFDQVEWPFLK

XP_038900859.1 soyasapogenol B glucuronide galactosyltransferase-like [Benincasa hispida]1.8e-20473.62Show/hide
Query:  MVSKNEELRITVLPLFASGHIIPIVDMGRLFARHGATVTVITTESNARAIQNNIDRDFAAGFKIQTHVISFPAAEVGLPPGIENFSDVSSLDLWGKIYQG
        M+ +N E RITVLPLFASGH+IPI+DM RLFA HGATVT+ITTESNA   QN+IDRDF AGFKIQTH+++FP AEVGL PGIEN+SDVSS  L  KIYQ 
Subjt:  MVSKNEELRITVLPLFASGHIIPIVDMGRLFARHGATVTVITTESNARAIQNNIDRDFAAGFKIQTHVISFPAAEVGLPPGIENFSDVSSLDLWGKIYQG

Query:  FLILDKLIDQDIIPVTRPDCILSDLSHPWTTDTAERLGVPRLVFFVSNFMAYCAEHSVLQHSPHQNVSSDTEPFEIPGLPHKIQMTKLQQPEFLVQRKAF
        FLILDKLIDQ+IIP T+PDCILSDLSH WTTDTAERLGVPRLV  VSNFMAY AEHSV+QH P Q V+SDTE FEIPGLPH+IQMTK QQPEF +QR AF
Subjt:  FLILDKLIDQDIIPVTRPDCILSDLSHPWTTDTAERLGVPRLVFFVSNFMAYCAEHSVLQHSPHQNVSSDTEPFEIPGLPHKIQMTKLQQPEFLVQRKAF

Query:  TKMIESYKKAERRSYGAVMNTFYELDGIYLEHYKKITGIKAWGLGPVSLAANKDMIGKFERGNKSGMESGELLEWLNSKDPNSVLY--------------
        T M+E YK+AERRSYG VMNTFYELDG+YLEHYKKI GIKAWG+GPVSLA NKDM GK ERGNKS +ES ELLEWLNSK+PNSVLY              
Subjt:  TKMIESYKKAERRSYGAVMNTFYELDGIYLEHYKKITGIKAWGLGPVSLAANKDMIGKFERGNKSGMESGELLEWLNSKDPNSVLY--------------

Query:  --------------------NDKNDGKEVAEGLPEGFEERITSNKRGLIVRIWAPQLLILEHPSTGGFLTHCGWNSSIEGVSTGQPMVTWPVSSEQFFNE
                              KNDG E  EGLPEGFEERI +  RGLI+RIWAPQLLILEHPSTGGFLTHCGWNSSIEG+S G+PMVTWPVSSEQF+ E
Subjt:  --------------------NDKNDGKEVAEGLPEGFEERITSNKRGLIVRIWAPQLLILEHPSTGGFLTHCGWNSSIEGVSTGQPMVTWPVSSEQFFNE

Query:  KLLTEVLQVGVPVGTRRWWNMRDEMEEIVGREKVEKAVGFLMGATEEAATIRKRAKELGAAAKRAVENGGSSENNVVSLMKELRSVKLK
        KLLTEVLQVGVPVG + WWNM +EM+EIV REKV K VGFLMGAT+EA  IRKRA++LGAAA RAV++GGSSE N+VSLMKELR+VK K
Subjt:  KLLTEVLQVGVPVGTRRWWNMRDEMEEIVGREKVEKAVGFLMGATEEAATIRKRAKELGAAAKRAVENGGSSENNVVSLMKELRSVKLK

XP_042990668.1 uncharacterized protein LOC122317666 [Carya illinoinensis]1.2e-13632.09Show/hide
Query:  EELINEWRKFSLKERE-KEPAFSFNEKEKMNIDGNLGHSLVGKLLSSRIISSLAIKNTMAEAWKTRHKFSIKVVGRNLFAFKFCSQEDRDWVAANEPWLF
        E++   W+   L E E +E   S       N  G L   L+ K+ + R  +  A + TM++ W T    + K +  N++  +F    D++ V    PW F
Subjt:  EELINEWRKFSLKERE-KEPAFSFNEKEKMNIDGNLGHSLVGKLLSSRIISSLAIKNTMAEAWKTRHKFSIKVVGRNLFAFKFCSQEDRDWVAANEPWLF

Query:  DRSLLILEFPAANQRSSELRFDRVAFWLRLFNLPMGYQNKCAAVKFGNLIGDFLEMENEDDDLGWSDSIRIRVKIDISKALLRGFMLKTDDMGGGRWITI
        DR L+ ++        SE+RFD   FW+++ NLP    +K   +  G++IG  LE+E   +  GW   +RI+ +++++KAL+RG  LK+       W++ 
Subjt:  DRSLLILEFPAANQRSSELRFDRVAFWLRLFNLPMGYQNKCAAVKFGNLIGDFLEMENEDDDLGWSDSIRIRVKIDISKALLRGFMLKTDDMGGGRWITI

Query:  RYERLPDFCFKCGCIGHNAKEYSIDQEECESRG----NIFEFGAWL-------KFQGFSK--GTRKQSPT---------TNDDIPDL---------NESD
        +YERLP FCFKCG   H       +Q  C+ RG       ++G WL       KFQ   +  G +++ P+           DD             +E  
Subjt:  RYERLPDFCFKCGCIGHNAKEYSIDQEECESRG----NIFEFGAWL-------KFQGFSK--GTRKQSPT---------TNDDIPDL---------NESD

Query:  HQEEETSADILHNGESLQVLRKVEEAMDVNFEKGPEKG-------------------------IQPILDHSNVSSFDKAMEEEEIQSLDSVRDQE---VH
        +  E T A  ++ G +   L K ++   V      +KG                          Q +  HS  SS+D  M   E     S+R +      
Subjt:  HQEEETSADILHNGESLQVLRKVEEAMDVNFEKGPEKG-------------------------IQPILDHSNVSSFDKAMEEEEIQSLDSVRDQE---VH

Query:  TALIIQSQGDGSSGGFVNSARKKMNWKRRARMGPIKKS---LILEQVS------KKRKEYF----DSVDVRKKARLIGES-SVVYSLVRSVCPDILFISE
        T    ++  + S     NS  K   WKR+AR  P   S    I++Q S      +  ++ F    D    ++K +   ES   ++ LV++  P ++F++E
Subjt:  TALIIQSQGDGSSGGFVNSARKKMNWKRRARMGPIKKS---LILEQVS------KKRKEYF----DSVDVRKKARLIGES-SVVYSLVRSVCPDILFISE

Query:  TKSGPEIPNKIRIQCGFDGCFSVPSRGASGGLCLLWKEKDLVSIQSFSLNHIDALIS--WKDKKWRFTGIYGFPEISQKHKTWNLIRNLNTMKNFPWILG
        TK      ++I++  G++ CFSV S+G SG L LLWK+   V + +++  HI ALI+    + +W+ TG YG P  +++ ++W+L+R L    N PW+  
Subjt:  TKSGPEIPNKIRIQCGFDGCFSVPSRGASGGLCLLWKEKDLVSIQSFSLNHIDALIS--WKDKKWRFTGIYGFPEISQKHKTWNLIRNLNTMKNFPWILG

Query:  GDFNEILWDFEKYGGPSCSNSLLEGFRKTLNDCELKELKFSGDPFTWKGNRRGVQ-VWERLDRFLYNLEFESLFAFAGMRNLEWMFSDHRPIAALVDCRR
        GDFNEI    EK G  +     +  FR +L+ C+L +L F GD FTW  NR G Q   ERLDR   N  +  LFA   + +L+   SDH+ +        
Subjt:  GDFNEILWDFEKYGGPSCSNSLLEGFRKTLNDCELKELKFSGDPFTWKGNRRGVQ-VWERLDRFLYNLEFESLFAFAGMRNLEWMFSDHRPIAALVDCRR

Query:  AAWRKVVKRPFKFEELWTRYEECADIIKKSGDWQNPLSSD------LAKNLSSCSEVLYKWGDDVRNKMKSRIKECKSALKEAYDKAPHIDYNAIHGLEF
        +  +K   R F+FE  WT+  EC +IIKK   W+  LSS         ++L+ C   L  W  + +   +  +K     LK   ++        I  +  
Subjt:  AAWRKVVKRPFKFEELWTRYEECADIIKKSGDWQNPLSSD------LAKNLSSCSEVLYKWGDDVRNKMKSRIKECKSALKEAYDKAPHIDYNAIHGLEF

Query:  ELEKLLEEEEIFWKQRSREDWLRWGEKNTGWFHRKASIRRQINEISGITSATGEWTEDPSLVEDTFVAFFNNIFESSKPDMSIMDDVLRCVNSRVSQRMN
         +  +++ E +  +QR+++ WL+ G++NT +FH+ +S R++ N I  I + +G  T+DP  +  T + FF ++F SS P  S +D+ L  +   ++  M 
Subjt:  ELEKLLEEEEIFWKQRSREDWLRWGEKNTGWFHRKASIRRQINEISGITSATGEWTEDPSLVEDTFVAFFNNIFESSKPDMSIMDDVLRCVNSRVSQRMN

Query:  EKLLLPFSRIDIEKAVQDMYPTKAPGPDGYSAIFYQQYWNMVGPSTIKDCLDVLNGDRSVEAWNKTNIVLISKIANPKEVGDFRPINLCNVSYKIITKTL
          L   F+ I++++AV  M P  +PGPDG+ A+F+Q+YW+ VG +  K  L+VLNG +     N+T I LI K  NP  V DFRPI+LCNV YKII KTL
Subjt:  EKLLLPFSRIDIEKAVQDMYPTKAPGPDGYSAIFYQQYWNMVGPSTIKDCLDVLNGDRSVEAWNKTNIVLISKIANPKEVGDFRPINLCNVSYKIITKTL

Query:  ANRLKIVLRDIISESQSAFIQGRLISDNIILGHECLHSINQNRTKIIDMAAMKLDLSKAFDQVEWPFLK
        ANRLK +L  IIS +Q+AF+ GRLI+DNII+  E LH++           A+KLD+SKA+D++EW FL+
Subjt:  ANRLKIVLRDIISESQSAFIQGRLISDNIILGHECLHSINQNRTKIIDMAAMKLDLSKAFDQVEWPFLK

TrEMBL top hitse value%identityAlignment
A0A2N9G933 Reverse transcriptase domain-containing protein1.8e-14631.9Show/hide
Query:  KFSLKEREKEP-AFSFNEKEKMNIDGNLGHSLVGKLLSSRIISSLAIKNTMAEAWKTRHKFSIKVVGRNLFAFKFCSQEDRDWVAANEPWLFDRSLLILE
        + SL ++E+ P  F F E ++          L  + ++ R ++  ++  T    W+T H F+++ +G N+  F F  + D + V   EPW +D+ L+  +
Subjt:  KFSLKEREKEP-AFSFNEKEKMNIDGNLGHSLVGKLLSSRIISSLAIKNTMAEAWKTRHKFSIKVVGRNLFAFKFCSQEDRDWVAANEPWLFDRSLLILE

Query:  FPAANQRSSELRFDRVAFWLRLFNLPMGYQNKCAAVKFGNLIGDFLEMENEDDDLGWSDSIRIRVKIDISKALLRGFMLKTDDMGGGRWITIRYERLPDF
            +   +E+    V+FW+++ NLP+G   +  AV  G  +G+   +   +++ G    +RIRV+IDISK L RG   +    G   WI+ +YERLP F
Subjt:  FPAANQRSSELRFDRVAFWLRLFNLPMGYQNKCAAVKFGNLIGDFLEMENEDDDLGWSDSIRIRVKIDISKALLRGFMLKTDDMGGGRWITIRYERLPDF

Query:  CFKCGCIGHNAKEYSI-DQEECESRGNIFEFGAWLKFQGFSKGTRK---------------QSPTTNDDIPDLN---ESDHQEEET------------SA
        C+ CGC+ H  K+  +  + +   R    ++GAWL+   F K  R+               Q    ++ +   N   E    E++T            S 
Subjt:  CFKCGCIGHNAKEYSI-DQEECESRGNIFEFGAWLKFQGFSKGTRK---------------QSPTTNDDIPDLN---ESDHQEEET------------SA

Query:  DILHNGESLQV------------LRKVEEAMD--------VNFEKGPEKGIQPILDHSNVSSFDKAMEEEE----IQSLDSVRDQEVHTALIIQSQGDGS
        +++    S++V            +R+++EA+          N E    +  +P +  +  S  DKA+EE       Q +   +  E  T L+ Q+   G 
Subjt:  DILHNGESLQV------------LRKVEEAMD--------VNFEKGPEKGIQPILDHSNVSSFDKAMEEEE----IQSLDSVRDQEVHTALIIQSQGDGS

Query:  SGGFVNS---------------ARKKMNWKRRAR------MGPIKKSLILEQVSKKRKEYFDSVDVRKKARLIGESS--------VVYSLVRSVCPDILF
           + +S                  +  W+R +R      +G  K S + ++   + +     V  +KK+R I            V+++LV++  P ILF
Subjt:  SGGFVNS---------------ARKKMNWKRRAR------MGPIKKSLILEQVSKKRKEYFDSVDVRKKARLIGESS--------VVYSLVRSVCPDILF

Query:  ISETKSGPEIPNKIRIQCGFDGCFSVPSRGASGGLCLLWKEKDLVSIQSFSLNHIDALISWK-DKKWRFTGIYGFPEISQKHKTWNLIRNLNTMKNFPWI
        + ETK        +RI+  F  CF+VPS G SGGL LLW ++  +SIQ+FS+NHIDA + ++ + +WRFTG YG P   ++ ++W L+  L ++   PW+
Subjt:  ISETKSGPEIPNKIRIQCGFDGCFSVPSRGASGGLCLLWKEKDLVSIQSFSLNHIDALISWK-DKKWRFTGIYGFPEISQKHKTWNLIRNLNTMKNFPWI

Query:  LGGDFNEILWDFEKYGGPSCSNSLLEGFRKTLNDCELKELKFSGDPFTWKGNRRG-VQVWERLDRFLYNLEFESLFAFAGMRNLEWMFSDHRPIAALVDC
        L GDFNEIL   E+ G  + S + +  F + LN C L +L + G PFTW+  R     V +RLDR + ++ + S+F    + +L   +SDH PI    D 
Subjt:  LGGDFNEILWDFEKYGGPSCSNSLLEGFRKTLNDCELKELKFSGDPFTWKGNRRG-VQVWERLDRFLYNLEFESLFAFAGMRNLEWMFSDHRPIAALVDC

Query:  RRAAWRKVVKRPFKFEELWTRYEECADIIKKSGDWQNPLSSDL---AKNLSSCSEVLYKWGDDVRNKMKSRIKECKSALKEAY-DKAPHIDYNAIHGLEF
          + + +  +RP KFEE W+ + EC  +IK+  D  NP+ S +    + +  C E L++W   + ++  S+I+E   +L          ++ +AI   + 
Subjt:  RRAAWRKVVKRPFKFEELWTRYEECADIIKKSGDWQNPLSSDL---AKNLSSCSEVLYKWGDDVRNKMKSRIKECKSALKEAY-DKAPHIDYNAIHGLEF

Query:  ELEKLLEEEEIFWKQRSREDWLRWGEKNTGWFHRKASIRRQINEISGITSATGEWTEDPSLVEDTFVAFFNNIFESSKPDMSIMDDVLRCVNSRVSQRMN
        E+ +LL  EE+ W+QRSR  WL  G++NT +FH  A+ R++ N I  +      W +    VED  V +F NIF +S P  + +++ +  VN  VS  +N
Subjt:  ELEKLLEEEEIFWKQRSREDWLRWGEKNTGWFHRKASIRRQINEISGITSATGEWTEDPSLVEDTFVAFFNNIFESSKPDMSIMDDVLRCVNSRVSQRMN

Query:  EKLLLPFSRIDIEKAVQDMYPTKAPGPDGYSAIFYQQYWNMVGPSTIKDCLDVLNGDRSVEAWNKTNIVLISKIANPKEVGDFRPINLCNVSYKIITKTL
        ++LLLP++  ++  A+  M+P+KAPG DG S+ F+Q+YW++VG S     L VLN  + +   N T I LI K  +P+++ D+RPI+LCNV YKII+K +
Subjt:  EKLLLPFSRIDIEKAVQDMYPTKAPGPDGYSAIFYQQYWNMVGPSTIKDCLDVLNGDRSVEAWNKTNIVLISKIANPKEVGDFRPINLCNVSYKIITKTL

Query:  ANRLKIVLRDIISESQSAFIQGRLISDNIILGHECLHSINQNRTKIIDMAAMKLDLSKAFDQVEWPFLK
        ANRLK VL  IIS+SQSAF+ GRLI+DN+ +  E LH +   R       A+KLD+SKA+D+VEW F++
Subjt:  ANRLKIVLRDIISESQSAFIQGRLISDNIILGHECLHSINQNRTKIIDMAAMKLDLSKAFDQVEWPFLK

A0A2N9I946 Uncharacterized protein1.6e-14531.24Show/hide
Query:  EELINEWRKFSLKEREKEPAFSFNEKEKMNIDGNLGHSLVGKLLSSRIISSLAIKNTMAEAWKTRHKFSIKVVGRNLFAFKFCSQEDRDWVAANEPWLFD
        EEL    R+ +L ++EK       E+   +      +S++ KL ++R  +  A K T+ E W      ++  +  NLF   F S+     + A  PW FD
Subjt:  EELINEWRKFSLKEREKEPAFSFNEKEKMNIDGNLGHSLVGKLLSSRIISSLAIKNTMAEAWKTRHKFSIKVVGRNLFAFKFCSQEDRDWVAANEPWLFD

Query:  RSLLILEFPAANQRSSELRFDRVAFWLRLFNLPMGYQNKCAAVKFGNLIGDFLEME-NEDDDLGWSDSIRIRVKIDISKALLRGFMLKTDDMGGGRWITI
        + L++L     + + + ++F   AFW+R+ NLP+    +      G  +G  L+++   ++ + W   +RIRV+++I+K L+RG +++ +D     W+  
Subjt:  RSLLILEFPAANQRSSELRFDRVAFWLRLFNLPMGYQNKCAAVKFGNLIGDFLEME-NEDDDLGWSDSIRIRVKIDISKALLRGFMLKTDDMGGGRWITI

Query:  RYERLPDFCFKCGCIGHNAKEYSIDQEECESRGNIF---EFGAWLKFQG--FSKGTRKQSPTTNDDIPDLNESDHQEEETS-------------------
        RYE LP FC+KCG +GH++ + + ++     + ++F   ++G+WL+     + + TR Q+   N DI + + S+H+ EE S                   
Subjt:  RYERLPDFCFKCGCIGHNAKEYSIDQEECESRGNIF---EFGAWLKFQG--FSKGTRKQSPTTNDDIPDLNESDHQEEETS-------------------

Query:  --------ADILHNG--ESLQVLRKVEEAM---------------DVNFEKGPEKGIQPILDHS-------------------NVSSFDKAMEEEEIQSL
                +D    G  + +Q++   +E +                V F +    G +P  D S                   NV   D    ++++   
Subjt:  --------ADILHNG--ESLQVLRKVEEAM---------------DVNFEKGPEKGIQPILDHS-------------------NVSSFDKAMEEEEIQSL

Query:  DSVRDQEVHTALIIQSQGDG-SSGGFV-----NSARKKMNWKRRARMGP---------------------------------IKKS------LILEQVSK
        D+ R+ E   +  I+++ +     GF+      +  KK  WK+RAR+ P                                 +KK+      L+ E VS 
Subjt:  DSVRDQEVHTALIIQSQGDG-SSGGFV-----NSARKKMNWKRRARMGP---------------------------------IKKS------LILEQVSK

Query:  KRKEYFDSVDVRK---KARLIGESSVVYSL---VRSVCPDILFISETKSGPEIPNKIRIQCGFDGCFSVPSRGASGGLCLLWKEKDLVSIQSFSLNHIDA
        +  E    V ++      R +G    V  L   VR   PD++F+ ET+        +R++ G  GCF V   G  GGL L+WK    V IQSFS NHIDA
Subjt:  KRKEYFDSVDVRK---KARLIGESSVVYSL---VRSVCPDILFISETKSGPEIPNKIRIQCGFDGCFSVPSRGASGGLCLLWKEKDLVSIQSFSLNHIDA

Query:  LISWKD-KKWRFTGIYGFPEISQKHKTWNLIRNLNTMKNFPWILGGDFNEILWDFEKYGGPSCSNSLLEGFRKTLNDCELKELKFSGDPFTWKGNRR-GV
         +   D  KWR TG YG PE   +  +W L+R L +++N PW++ GDFNE+L   E++G    + S +  FR  L+DC L++L ++G  F+W   R  G 
Subjt:  LISWKD-KKWRFTGIYGFPEISQKHKTWNLIRNLNTMKNFPWILGGDFNEILWDFEKYGGPSCSNSLLEGFRKTLNDCELKELKFSGDPFTWKGNRR-GV

Query:  QVWERLDRFLYNLEFESLFAFAGMRNLEWMFSDHRPIAALVDCRRAAWRKVVKRPFKFEELWTRYEECADIIKKSGDWQNPLSSD----LAKNLSSCSEV
         V  RLDR + N E+  LF    + ++ +  SDH  +  +++          K+PF+FE +W R   C D IK +  W  P+S      +A+ + +C   
Subjt:  QVWERLDRFLYNLEFESLFAFAGMRNLEWMFSDHRPIAALVDCRRAAWRKVVKRPFKFEELWTRYEECADIIKKSGDWQNPLSSD----LAKNLSSCSEV

Query:  LYKWGDDVRNKMKSRIKECKSALKEAYDKAPHIDYNA--IHGLEFELEKLLEEEEIFWKQRSREDWLRWGEKNTGWFHRKASIRRQINEISGITSATGEW
        L +W    + ++  R+ E K       + +P  +Y++  ++ L  E+  L+E+EEIFW+QRSR  WL+ G++NT ++H  AS R++ N I G+    G W
Subjt:  LYKWGDDVRNKMKSRIKECKSALKEAYDKAPHIDYNA--IHGLEFELEKLLEEEEIFWKQRSREDWLRWGEKNTGWFHRKASIRRQINEISGITSATGEW

Query:  TEDPSLVEDTFVAFFNNIFESSKPDMSIMDDVLRCVNSRVSQRMNEKLLLPFSRIDIEKAVQDMYPTKAPGPDGYSAIFYQQYWNMVGPSTIKDCLDVLN
          +   + +  V +F+ +F SS PD   + +V+  V++ VS  MN+ LL  FS  +I++A+  M P+KAPGPDG +A+F+Q+YW++VG       LD  +
Subjt:  TEDPSLVEDTFVAFFNNIFESSKPDMSIMDDVLRCVNSRVSQRMNEKLLLPFSRIDIEKAVQDMYPTKAPGPDGYSAIFYQQYWNMVGPSTIKDCLDVLN

Query:  GDRSVEAWNKTNIVLISKIANPKEVGDFRPINLCNVSYKIITKTLANRLKIVLRDIISESQSAFIQGRLISDNIILGHECLHSINQNRTKIIDMAAMKLD
          R + + N TNIVLI K+ NP+ +  FRPI+LCNV YKI +K L NR+K++L +IIS+SQSAF+ GRLISDNII+  E LH +   R       A KLD
Subjt:  GDRSVEAWNKTNIVLISKIANPKEVGDFRPINLCNVSYKIITKTLANRLKIVLRDIISESQSAFIQGRLISDNIILGHECLHSINQNRTKIIDMAAMKLD

Query:  LSKAFDQVEWPFLK
        +SKA+D+VEW FL+
Subjt:  LSKAFDQVEWPFLK

A0A2N9IPS8 Reverse transcriptase domain-containing protein1.4e-14632.05Show/hide
Query:  EELINEWRKFSLKEREKEPAFSFNEKEKMNIDGNLGHSLVGKLLSSRIISSLAIKNTMAEAWKTRHKFSIKVVGRNLFAFKFCSQEDRDWVAANEPWLFD
        EEL+ EW+KFSL E ++ P F+ ++    N      H L+GKL++ R  +  A+KN M   W      +++ +G NLF F+F +  +RD V +  PWLFD
Subjt:  EELINEWRKFSLKEREKEPAFSFNEKEKMNIDGNLGHSLVGKLLSSRIISSLAIKNTMAEAWKTRHKFSIKVVGRNLFAFKFCSQEDRDWVAANEPWLFD

Query:  RSLLILEFPAANQRSSELRFDRVAFWLRLFNLPMGYQNKCAAVKFGNLIGDFLEMENEDDDLGWSDSIRIRVKIDISKALLRGFMLKTDDMG--GGRWIT
          LL L     +  +S+++F    FW+    +P+ Y  K    + G  +G  L+++  +D LGW  S+R+R+++D +K + RG M+    +G   GR + 
Subjt:  RSLLILEFPAANQRSSELRFDRVAFWLRLFNLPMGYQNKCAAVKFGNLIGDFLEMENEDDDLGWSDSIRIRVKIDISKALLRGFMLKTDDMG--GGRWIT

Query:  I------RYERLPDFCFKCGCIG----HNAKEYSIDQEECESRGNI-FEFGAWLKFQGFSKG-------------------------------TRKQSPT
        +       + R P+F  +    G       ++ ++ + E     N+ FEF     F G S                                  R ++  
Subjt:  I------RYERLPDFCFKCGCIG----HNAKEYSIDQEECESRGNI-FEFGAWLKFQGFSKG-------------------------------TRKQSPT

Query:  TNDDIPDLNESDHQEEET-----SADILHNGE--------SLQVLRKVEEAMDVNFEKGPEKGIQPILDHSNVSSFDKAMEEEEIQSLDSVRDQEVHTAL
         ND+   +N   +QEE          ++  GE        +  +   V E M++  +K PEK +Q   ++SNV       E  +   L +V        L
Subjt:  TNDDIPDLNESDHQEEET-----SADILHNGE--------SLQVLRKVEEAMDVNFEKGPEKGIQPILDHSNVSSFDKAMEEEEIQSLDSVRDQEVHTAL

Query:  IIQSQGDGSSGGFVNSARK-KMNWKRRARMGPIKKSLILEQVSKKRKEYFDSVD---------VRKKARLIGESSVVYS---LVRSVCPDILFISETKSG
         ++  G  +SG      +K K  WKR AR    +K    E   +  K   +S +         +    + +G    V     L++   P +LF+SET+  
Subjt:  IIQSQGDGSSGGFVNSARK-KMNWKRRARMGPIKKSLILEQVSKKRKEYFDSVD---------VRKKARLIGESSVVYS---LVRSVCPDILFISETKSG

Query:  PEIPNKIRIQCGFDGCFSVPSRGASGGLCLLWKEKDLVSIQSFSLNHIDALI--SWKDKKWRFTGIYGFPEISQKHKTWNLIRNLNTMKNFPWILGGDFN
             ++R+   FDG F VP RG  GGL +LW  K  V + ++S NHIDA I    K K +R TG YG PE  ++ ++W L+++L+ + + PW+  GDFN
Subjt:  PEIPNKIRIQCGFDGCFSVPSRGASGGLCLLWKEKDLVSIQSFSLNHIDALI--SWKDKKWRFTGIYGFPEISQKHKTWNLIRNLNTMKNFPWILGGDFN

Query:  EILWDFEKYGGPSCSNSLLEGFRKTLNDCELKELKFSGDPFTWKGNRRG-VQVWERLDRFLYNLEFESLFAFAGMRNLEWMFSDHRPIAALVDCRRAAWR
        EIL + E+ G        +  FR+ +  C L +L + G+ +TW+  R G   V  RLDR + ++ + + +  + + +L    SDH PI  L+D       
Subjt:  EILWDFEKYGGPSCSNSLLEGFRKTLNDCELKELKFSGDPFTWKGNRRG-VQVWERLDRFLYNLEFESLFAFAGMRNLEWMFSDHRPIAALVDCRRAAWR

Query:  KVVKRPFKFEELWTRYEECADIIKKSGDWQNPLSSD-----LAKNLSSCSEVLYKWGDDVRNKMKSRIKECKSALKEAYDKAPHIDYNAIHGLEFELEKL
        K  K+ F+FE +W + E+C ++I  +  W + ++       + + +  C   L  W  +    + S IK  +  L+   ++ P      I  L+ +L  L
Subjt:  KVVKRPFKFEELWTRYEECADIIKKSGDWQNPLSSD-----LAKNLSSCSEVLYKWGDDVRNKMKSRIKECKSALKEAYDKAPHIDYNAIHGLEFELEKL

Query:  LEEEEIFWKQRSREDWLRWGEKNTGWFHRKASIRRQINEISGITSATGEWTEDPSLVEDTFVAFFNNIFESSKPDMSIMDDVLRCVNSRVSQRMNEKLLL
        LE+EEIFW+QRSR  W+  G+KNT +FH + + RR+ N ISG+    G W  + + + +  V +F  IF SS P    +  VL+ + S V+  MN++L  
Subjt:  LEEEEIFWKQRSREDWLRWGEKNTGWFHRKASIRRQINEISGITSATGEWTEDPSLVEDTFVAFFNNIFESSKPDMSIMDDVLRCVNSRVSQRMNEKLLL

Query:  PFSRIDIEKAVQDMYPTKAPGPDGYSAIFYQQYWNMVGPSTIKDCLDVLNGDRSVEAWNKTNIVLISKIANPKEVGDFRPINLCNVSYKIITKTLANRLK
         F++ ++  A++ MYPTKAPGPDG SAIFYQ YW++VGP   +  L +L+    +   N T+I LI K+ NP+ + DFRPI+LCNV YKI++K LANRLK
Subjt:  PFSRIDIEKAVQDMYPTKAPGPDGYSAIFYQQYWNMVGPSTIKDCLDVLNGDRSVEAWNKTNIVLISKIANPKEVGDFRPINLCNVSYKIITKTLANRLK

Query:  IVLRDIISESQSAFIQGRLISDNIILGHECLHSINQNRTKIIDMAAMKLDLSKAFDQVEWPFLK
         VL  +ISE+QSAF+ GRLI+DN+++  E +HS++  R       A+KLD+SKA+D+VEW FL+
Subjt:  IVLRDIISESQSAFIQGRLISDNIILGHECLHSINQNRTKIIDMAAMKLDLSKAFDQVEWPFLK

A0A2N9J3U0 Reverse transcriptase domain-containing protein2.2e-14432.18Show/hide
Query:  EELINEWRKFSLKEREKEPAFSFNEKEKMNIDGNLGHSLVGKLLSSRIISSLAIKNTMAEAWKTRHKFSIKVVGRNLFAFKFCSQEDRDWVAANEPWLFD
        E++   W  FSL   E +   + + +   N +G    ++  K L+ R I+  A+  T+   W+  H F+I+ +G N   F F  + D + V  N PW +D
Subjt:  EELINEWRKFSLKEREKEPAFSFNEKEKMNIDGNLGHSLVGKLLSSRIISSLAIKNTMAEAWKTRHKFSIKVVGRNLFAFKFCSQEDRDWVAANEPWLFD

Query:  RSLLILEFPAANQRSSELRFDRVAFWLRLFNLPMGYQNKCAAVKFGNLIGDFLEMENEDDDLGWSDSIRIRVKIDISKALLRGFMLKTDDMGGGRWITIR
        RSL+I +   AN   +E+ F    FW+++  +P+    + A+   G  +G         ++ G    +R+RV IDI+K L RG  +  DD    RWI+ +
Subjt:  RSLLILEFPAANQRSSELRFDRVAFWLRLFNLPMGYQNKCAAVKFGNLIGDFLEMENEDDDLGWSDSIRIRVKIDISKALLRGFMLKTDDMGGGRWITIR

Query:  YERLPDFCFKCGCIGHNAKEYSIDQEECES-RGNIFEFGAWLKFQGFSKGTRKQS-----------PTT------NDDIPDLNESDH-------------
        YERLP+FC+ CG + H  K+ SI      + R +  +FG W++ Q    G+R+ +           PT+       +D+ D   S H             
Subjt:  YERLPDFCFKCGCIGHNAKEYSIDQEECES-RGNIFEFGAWLKFQGFSKGTRKQS-----------PTT------NDDIPDLNESDH-------------

Query:  ----------QEEETSADILHNGESLQ------VLRKVEEAMDVNF--EKGPEKGIQPILDHSNVSSFDKAMEEEE-------IQSLDSVRDQEV----H
                  QE   ++       S+Q       L +++EA+ +     + P   I+ I   ++V   +K  E+ E         ++    DQ +    +
Subjt:  ----------QEEETSADILHNGESLQ------VLRKVEEAMDVNF--EKGPEKGIQPILDHSNVSSFDKAMEEEE-------IQSLDSVRDQEV----H

Query:  TALIIQSQGDGSS--GGFVNSA----------RKKMNWKRRARMGPIKKSLILEQVSKKRK--------EYFDSVD--VRKKARLIGESSV---------
        TA  I    D SS  GG                K  NWK+RAR      S +      + +        EY D ++    KK  +  +S+V         
Subjt:  TALIIQSQGDGSS--GGFVNSA----------RKKMNWKRRARMGPIKKSLILEQVSKKRK--------EYFDSVD--VRKKARLIGESSV---------

Query:  -----VYSLVRSVCPDILFISETKSGPEIPNKIRIQCGFDGCFSVPSRGASGGLCLLWKEKDLVSIQSFSLNHIDALI-SWKDKKWRFTGIYGFPEISQK
             ++ LV+   PD++F+SET+   +  +KIR++ G  G   V   G  GGL LLWKE   V+  S S  HID  I S    +W FTG YG P+ S++
Subjt:  -----VYSLVRSVCPDILFISETKSGPEIPNKIRIQCGFDGCFSVPSRGASGGLCLLWKEKDLVSIQSFSLNHIDALI-SWKDKKWRFTGIYGFPEISQK

Query:  HKTWNLIRNLNTMKNFPWILGGDFNEILWDFEKYGGPSCSNSLLEGFRKTLNDCELKELKFSGDPFTWKGNRRGVQ-VWERLDRFLYNLEFESLFAFAGM
        H +W L+R L    + PW++ GDFNE+L   EK G  +     +E FR+ L+DCELK++ + G+ FTW   R G   V+ERLDR + + ++ SLF  A +
Subjt:  HKTWNLIRNLNTMKNFPWILGGDFNEILWDFEKYGGPSCSNSLLEGFRKTLNDCELKELKFSGDPFTWKGNRRGVQ-VWERLDRFLYNLEFESLFAFAGM

Query:  RNLEWMFSDHRPIAALVDCRRAAWR-KVVKRPFKFEELWTRYEECADIIKKSGDWQNPLSS----DLAKNLSSCSEVLYKWGDDVRNKMKSRIKECKSAL
        R++ +  SDH   A +VD      R +   + F+FE +W + E C ++++++  WQ P S      +++ + +C   L +W   +    K+R+ + ++A 
Subjt:  RNLEWMFSDHRPIAALVDCRRAAWR-KVVKRPFKFEELWTRYEECADIIKKSGDWQNPLSS----DLAKNLSSCSEVLYKWGDDVRNKMKSRIKECKSAL

Query:  KEA-----YDKAPHIDYNAIHGLEFELEKLLEEEEIFWKQRSREDWLRWGEKNTGWFHRKASIRRQINEISGITSATGEWTEDPSLVEDTFVAFFNNIFE
         ++      ++     Y   +    +L ++L +EE +W+QRS   WLR G++NT +FH  AS R++ N I G+  A G      + +     A+F+NIF 
Subjt:  KEA-----YDKAPHIDYNAIHGLEFELEKLLEEEEIFWKQRSREDWLRWGEKNTGWFHRKASIRRQINEISGITSATGEWTEDPSLVEDTFVAFFNNIFE

Query:  SSKPDMSIMDDVLRCVNSRVSQRMNEKLLLPFSRIDIEKAVQDMYPTKAPGPDGYSAIFYQQYWNMVGPSTIKDCLDVLNGDRSVEAWNKTNIVLISKIA
        +S P  S +  V+  V+  V+QRMN+ LL PF+  +I  A+  M+PTKAPGPDG +A+FYQ++W++VG       L+ L+  + +++ N T+I LI KI 
Subjt:  SSKPDMSIMDDVLRCVNSRVSQRMNEKLLLPFSRIDIEKAVQDMYPTKAPGPDGYSAIFYQQYWNMVGPSTIKDCLDVLNGDRSVEAWNKTNIVLISKIA

Query:  NPKEVGDFRPINLCNVSYKIITKTLANRLKIVLRDIISESQSAFIQGRLISDNIILGHECLHSINQNRTKIIDMAAMKLDLSKAFDQVEWPFLK
        +P+ +  FRPI+LCNV YKII+K LANRLK VL  IIS++QSAF+ GRLI+DNI++  E LH +   R       A+KLD+SKA+D+VEW FL+
Subjt:  NPKEVGDFRPINLCNVSYKIITKTLANRLKIVLRDIISESQSAFIQGRLISDNIILGHECLHSINQNRTKIIDMAAMKLDLSKAFDQVEWPFLK

A0A6J1D7D4 Glycosyltransferase2.4e-20775.71Show/hide
Query:  MVSKNEELRITVLPLFASGHIIPIVDMGRLFARHGATVTVITTESNARAIQNNIDRDFAAGFKIQTHVISFPAAEVGLPPGIENFSDVSSLDLWGKIYQG
        MV +NEELRITVLPLFASGHIIPIVDM RLFARHGA VT+ITTESNAR+ QN++ RDFAAG+KIQT  + FPAAEVGLPPGIENFSDV S DL GKIY+ 
Subjt:  MVSKNEELRITVLPLFASGHIIPIVDMGRLFARHGATVTVITTESNARAIQNNIDRDFAAGFKIQTHVISFPAAEVGLPPGIENFSDVSSLDLWGKIYQG

Query:  FLILDKLIDQDIIPVTRPDCILSDLSHPWTTDTAERLGVPRLVFFVSNFMAYCAEHSVLQHSPHQNVSSDTEPFEIPGLPHKIQMTKLQQPEFLVQRKAF
        FLIL+K IDQ IIP TRPDCILSDLS+ WTTDTA RLGVPRLVFFVSNFMAY AEHSVLQH+PHQ V+SD E FE+PGLPHKIQMTK QQPEFLVQR  F
Subjt:  FLILDKLIDQDIIPVTRPDCILSDLSHPWTTDTAERLGVPRLVFFVSNFMAYCAEHSVLQHSPHQNVSSDTEPFEIPGLPHKIQMTKLQQPEFLVQRKAF

Query:  TKMIESYKKAERRSYGAVMNTFYELDGIYLEHYKKITGIKAWGLGPVSLAANKDMIGKFERGNKSGMESGELLEWLNSKDPNSVLY--------------
        T+MIE YK+AERRSYG V NTFYELDG+YLEHYK+  GIKAWGLGPVSLA NKD+IGK +RGNKSGMESGELL+WLNSK+PNSVLY              
Subjt:  TKMIESYKKAERRSYGAVMNTFYELDGIYLEHYKKITGIKAWGLGPVSLAANKDMIGKFERGNKSGMESGELLEWLNSKDPNSVLY--------------

Query:  --------------------NDKNDGKEVAEGLPEGFEERIT-SNKRGLIVRIWAPQLLILEHPSTGGFLTHCGWNSSIEGVSTGQPMVTWPVSSEQFFN
                             ++N+G E  EGLPEGFEER+    K+GLIVRIWAPQLLILEHPSTGGFLTHCGWNSSIEGVSTGQPMVTWPVSSEQF+N
Subjt:  --------------------NDKNDGKEVAEGLPEGFEERIT-SNKRGLIVRIWAPQLLILEHPSTGGFLTHCGWNSSIEGVSTGQPMVTWPVSSEQFFN

Query:  EKLLTEVLQVGVPVGTRRWWNMRDEMEE--IVGREKVEKAVGFLMGATEEAATIRKRAKELGAAAKRAVENGGSSENNVVSLMKELRSVK
        EKLLTEVL+VGVPVG RRWWNM DEMEE  IVGRE+V  AVGFLMG  EEAA IR+RAKELGAAAKRAV  GGSSE NVVS+++ELRS+K
Subjt:  EKLLTEVLQVGVPVGTRRWWNMRDEMEE--IVGREKVEKAVGFLMGATEEAATIRKRAKELGAAAKRAVENGGSSENNVVSLMKELRSVK

SwissProt top hitse value%identityAlignment
D4Q9Z4 Soyasapogenol B glucuronide galactosyltransferase3.6e-11543.15Show/hide
Query:  MVSKNEELRITVLPLFASGHIIPIVDMGRLFARHGATVTVITTESNARAIQNNIDRDFAAGFKIQTHVISFPAAEVGLPPGIENFSDVSSLDLWGKIYQG
        M  K  EL+   LP  ++ HIIP+VDM RLFA H   VT+ITT  NA   Q +ID D + G  I+THV++FPAA+VGLP GIE F+  +  ++  +IY G
Subjt:  MVSKNEELRITVLPLFASGHIIPIVDMGRLFARHGATVTVITTESNARAIQNNIDRDFAAGFKIQTHVISFPAAEVGLPPGIENFSDVSSLDLWGKIYQG

Query:  FLILDKLIDQDIIPVTRPDCILSDLSHPWTTDTAERLGVPRLVFFVSNFMAYCAEHSVLQHSPHQNVSSDTEPFEIPGLPHKIQMTKLQQPEFLVQRKAF
          +L ++ ++ +    +PD I++D+ HPW+ D A +LG+PR++F  ++++A  A HSV Q++PH     DT+ F +PGLP  ++MT+LQ P++L     +
Subjt:  FLILDKLIDQDIIPVTRPDCILSDLSHPWTTDTAERLGVPRLVFFVSNFMAYCAEHSVLQHSPHQNVSSDTEPFEIPGLPHKIQMTKLQQPEFLVQRKAF

Query:  TKMIESYKKAERRSYGAVMNTFYELDGIYLEHYKKITGIKAWGLGPVSLAANKDMIGKFERGNKSGMESGE-LLEWLNSKDPNSVLYND-----------
        T+++ + K++E++SYG++ N+FY+L+  Y EHYK I G K+WG+GPVSL AN+D   K  RG     E  E  L+WLNSK  +SVLY             
Subjt:  TKMIESYKKAERRSYGAVMNTFYELDGIYLEHYKKITGIKAWGLGPVSLAANKDMIGKFERGNKSGMESGE-LLEWLNSKDPNSVLYND-----------

Query:  --------------------KNDGKEVAEGLPEGFEERITSNKRGLIVRIWAPQLLILEHPSTGGFLTHCGWNSSIEGVSTGQPMVTWPVSSEQFFNEKL
                            KNDG E  +   E FE+R+  + +G ++  WAPQLLILE+P+ GG +THCGWN+ +E V+ G PM TWP+ +E FFNEKL
Subjt:  --------------------KNDGKEVAEGLPEGFEERITSNKRGLIVRIWAPQLLILEHPSTGGFLTHCGWNSSIEGVSTGQPMVTWPVSSEQFFNEKL

Query:  LTEVLQVGVPVGTRRWWNMRDEMEEIVGREKVEKAVGFLMGATEEAATIRKRAKELGAAAKRAVENGGSSENNVVSLMKELRSVKLKRD
        + +VL++GVPVG + W N  +   E+V RE++  A+  LM   EE   +RKRAKEL  AAK A++ GGSS NN+  L++EL+ +KL ++
Subjt:  LTEVLQVGVPVGTRRWWNMRDEMEEIVGREKVEKAVGFLMGATEEAATIRKRAKELGAAAKRAVENGGSSENNVVSLMKELRSVKLKRD

Q2V6J9 UDP-glucose flavonoid 3-O-glucosyltransferase 71.2e-9741.82Show/hide
Query:  EELRITVLPLFASGHIIPIVDMGRLFARHGATVTVITTESNARAIQNNIDRDFAAGFKIQTHVISFPAAEVGLPPGIENFSDVSSLDLWGKIYQGFLILD
        ++L I  LP  A GH IP+ D+ +LF+ HGA  T++TT  NA        R      +I+  +I FP+AE GLP   E+   +++ D+ GK  +   +++
Subjt:  EELRITVLPLFASGHIIPIVDMGRLFARHGATVTVITTESNARAIQNNIDRDFAAGFKIQTHVISFPAAEVGLPPGIENFSDVSSLDLWGKIYQGFLILD

Query:  KLIDQDIIPVTRPDCILSDLSHPWTTDTAERLGVPRLVFFVSNFMAYCAEHSVLQHSPHQNVSSDTEPFEIPGLPHKIQMTKLQQPEFLVQRKAFTKMIE
           ++ I+   RP C+++D    W TD A +  +PRL F  + F A CA  SV+ + PH N+SSD+E F IP LP +I+MT+ Q P F      F KM++
Subjt:  KLIDQDIIPVTRPDCILSDLSHPWTTDTAERLGVPRLVFFVSNFMAYCAEHSVLQHSPHQNVSSDTEPFEIPGLPHKIQMTKLQQPEFLVQRKAFTKMIE

Query:  SYKKAERRSYGAVMNTFYELDGIYLEHYKKITGIKAWGLGPVSLAANKDMIGKFERGN--KSGMESGELLEWLNSKDPNSVLYND---------------
        +  + E RSYG ++N+FYEL+  Y  HY+K+ G KAW +GPVS   NK +  K ERG+   S  E  E L+WL+SK P SV+Y                 
Subjt:  SYKKAERRSYGAVMNTFYELDGIYLEHYKKITGIKAWGLGPVSLAANKDMIGKFERGN--KSGMESGELLEWLNSKDPNSVLYND---------------

Query:  ---------------KNDGKEVAEGLPEGFEERITSNKRGLIVRIWAPQLLILEHPSTGGFLTHCGWNSSIEGVSTGQPMVTWPVSSEQFFNEKLLTEVL
                       K + KEV E LPEGFE+R+    +GLI+R WAPQ+LILEH + G F+THCGWNS +E VS G PM+TWPV  EQF+NEKL+TE+ 
Subjt:  ---------------KNDGKEVAEGLPEGFEERITSNKRGLIVRIWAPQLLILEHPSTGGFLTHCGWNSSIEGVSTGQPMVTWPVSSEQFFNEKLLTEVL

Query:  QVGVPVGTRRW----WNMRDEMEEIVGREKVEKAVGFLMGATEEAATIRKRAKELGAAAKRAVENGGSSENNVVSLMKELRSV
        ++GVPVG+ +W     ++  E E  V RE +E+AV  +M   +EA   R R KELG  A+RAVE GGSS  ++ +L+ EL  +
Subjt:  QVGVPVGTRRW----WNMRDEMEEIVGREKVEKAVGFLMGATEEAATIRKRAKELGAAAKRAVENGGSSENNVVSLMKELRSV

Q8H0F2 Anthocyanin 3'-O-beta-glucosyltransferase2.5e-9237.84Show/hide
Query:  EELRITVLPLFASGHIIPIVDMGRLFARHGATVTVITTESNARAIQNNIDRDFAAGFKIQTHVISFPAAEVGLPPGIENFSDVSSLDLWGKIYQGFLILD
        ++L +   P  A+GHI+P +DM +LF+  G   T+ITT +N+      I+R    GF I    I FP+AE GLP G E      S+D+  + ++  ++L 
Subjt:  EELRITVLPLFASGHIIPIVDMGRLFARHGATVTVITTESNARAIQNNIDRDFAAGFKIQTHVISFPAAEVGLPPGIENFSDVSSLDLWGKIYQGFLILD

Query:  KLIDQDIIPVTRPDCILSDLSHPWTTDTAERLGVPRLVFFVSNFMAYCAEHSVLQHSPHQNVSSDTEPFEIPGLPHKIQMTKLQQP---EFLVQRKAFTK
        + + ++++   RP  +++DL   W  D A + G+PRL+F  S+  A  A  SV ++ P++N+SSD++PF +P +P KI +TK Q P   E        T+
Subjt:  KLIDQDIIPVTRPDCILSDLSHPWTTDTAERLGVPRLVFFVSNFMAYCAEHSVLQHSPHQNVSSDTEPFEIPGLPHKIQMTKLQQP---EFLVQRKAFTK

Query:  MIESYKKAERRSYGAVMNTFYELDGIYLEHYKKITGIKAWGLGPVSLAANKDMIGKFERGNKSGMESGELLEWLNSKDPNSVLY----------------
        M ++  ++E   YG ++N+FYEL+  Y+++ K + G +AW +GP+SL  N +     ERG KS +++ E L WL+SK+P+SV+Y                
Subjt:  MIESYKKAERRSYGAVMNTFYELDGIYLEHYKKITGIKAWGLGPVSLAANKDMIGKFERGNKSGMESGELLEWLNSKDPNSVLY----------------

Query:  ----------------NDKNDGKEVAEGLPEGFEERITSNKRGLIVRIWAPQLLILEHPSTGGFLTHCGWNSSIEGVSTGQPMVTWPVSSEQFFNEKLLT
                            D ++ ++  P+GFE+R+  N +GLI++ WAPQ+LILEH + G F++HCGWNS++EG+  G  MVTWP+ +EQF+NEKL+T
Subjt:  ----------------NDKNDGKEVAEGLPEGFEERITSNKRGLIVRIWAPQLLILEHPSTGGFLTHCGWNSSIEGVSTGQPMVTWPVSSEQFFNEKLLT

Query:  EVLQVGVPVGTRRWWNMRDEMEEIVGREKVEKAVGFLMGATEEAATIRKRAKELGAAAKRAVENGGSSENNVVSLMKELRS
        ++L+ GV VG+ +W  +      +V RE + KAV  LM A EE   IR RAK L   AK+AVE GGSS +++ +L+ EL S
Subjt:  EVLQVGVPVGTRRWWNMRDEMEEIVGREKVEKAVGFLMGATEEAATIRKRAKELGAAAKRAVENGGSSENNVVSLMKELRS

Q9AT54 Scopoletin glucosyltransferase7.3e-10041.56Show/hide
Query:  ELRITVLPLFASGHIIPIVDMGRLFARHGATVTVITTESNARAIQNNIDRDFAAGFKIQTHVISFPAAEVGLPPGIENFSDVSSLDLWGKIYQGFLILDK
        +L     P+ A GH+IP +DM +LFA  G   T+ITT  N       I R+   G +I+  +I FPA E GLP   E    + S +     ++   ++ +
Subjt:  ELRITVLPLFASGHIIPIVDMGRLFARHGATVTVITTESNARAIQNNIDRDFAAGFKIQTHVISFPAAEVGLPPGIENFSDVSSLDLWGKIYQGFLILDK

Query:  LIDQDIIPVTRPDCILSDLSHPWTTDTAERLGVPRLVFFVSNFMAYCAEHSVLQHSPHQNVSSDTEPFEIPGLPHKIQMTKLQQPEF--LVQRKAFTKMI
         ++Q +I   RPDC++SD+  PWTTDTA +  +PR+VF  ++F A C E+SV  + P +NVSSD+E F +P LPH+I++T+ Q   F    +  A T+MI
Subjt:  LIDQDIIPVTRPDCILSDLSHPWTTDTAERLGVPRLVFFVSNFMAYCAEHSVLQHSPHQNVSSDTEPFEIPGLPHKIQMTKLQQPEF--LVQRKAFTKMI

Query:  ESYKKAERRSYGAVMNTFYELDGIYLEHYKKITGIKAWGLGPVSLAANKDMIGKFERGNKSGMESGELLEWLNSKDPNSVLY------------------
        ++ ++++ +SYG V N+FYEL+  Y+EHY K+ G +AW +GP+S+  N+D+  K ERG KS ++  E L+WL+SK P+SV+Y                  
Subjt:  ESYKKAERRSYGAVMNTFYELDGIYLEHYKKITGIKAWGLGPVSLAANKDMIGKFERGNKSGMESGELLEWLNSKDPNSVLY------------------

Query:  -NDKNDGKEV----------AEGLPEGFEERITSNKRGLIVRIWAPQLLILEHPSTGGFLTHCGWNSSIEGVSTGQPMVTWPVSSEQFFNEKLLTEVLQV
           +  G+E            + LPEGFEER  + ++GLI+R WAPQ+LIL+H S G F+THCGWNS++EGVS G PMVTWPV +EQFFNEKL+TEVL+ 
Subjt:  -NDKNDGKEV----------AEGLPEGFEERITSNKRGLIVRIWAPQLLILEHPSTGGFLTHCGWNSSIEGVSTGQPMVTWPVSSEQFFNEKLLTEVLQV

Query:  GVPVGTRRWWNMRDEMEEIVGREKVEKAVGFLMGATEEAATIRKRAKELGAAAKRAVENGGSSENNVVSLMKEL
        G  VG+ +W   +    E V RE + KA+  +M  +EEA   R RAK     A++A+E GGSS   + +L++++
Subjt:  GVPVGTRRWWNMRDEMEEIVGREKVEKAVGFLMGATEEAATIRKRAKELGAAAKRAVENGGSSENNVVSLMKEL

Q9ZQG4 UDP-glycosyltransferase 73B52.1e-9139.14Show/hide
Query:  NEELRITVLPLFASGHIIPIVDMGRLFARHGATVTVITTESNARAIQNNID--RDFAAGFKIQTHVISFPAAEVGLPPGIENFSDVSSLDL--WGKIYQG
        +E + I   P  A GH+IPI+DM +LF+R GA  T++TT  NA+  +  I+  ++     +I   + +FP  E+GLP G EN   ++S      G ++  
Subjt:  NEELRITVLPLFASGHIIPIVDMGRLFARHGATVTVITTESNARAIQNNID--RDFAAGFKIQTHVISFPAAEVGLPPGIENFSDVSSLDL--WGKIYQG

Query:  FLILDKLIDQDI---IPVTRPDCILSDLSHPWTTDTAERLGVPRLVFFVSNFMAYCAEHSVLQHSPHQNVSSDTEPFEIPGLPHKIQMTKLQQPEFLVQR
        FL   K + Q +   I  T+P  +++D+  PW T++AE+LGVPRLVF  ++F + C  +++  H PH+ V++ + PF IPGLP  I +T+  Q     + 
Subjt:  FLILDKLIDQDI---IPVTRPDCILSDLSHPWTTDTAERLGVPRLVFFVSNFMAYCAEHSVLQHSPHQNVSSDTEPFEIPGLPHKIQMTKLQQPEFLVQR

Query:  KAFTKMIESYKKAERRSYGAVMNTFYELDGIYLEHYKKITGIKAWGLGPVSLAANKDMIGKFERGNKSGMESGELLEWLNSKDPNSVLY----------N
            K ++  +++E  S+G ++N+FYEL+  Y + Y+     +AW +GP+SL +N+++  K  RG K+ ++  E L+WL+SK P SV+Y          N
Subjt:  KAFTKMIESYKKAERRSYGAVMNTFYELDGIYLEHYKKITGIKAWGLGPVSLAANKDMIGKFERGNKSGMESGELLEWLNSKDPNSVLY----------N

Query:  D---------------------KNDGK-EVAEGLPEGFEERITSNKRGLIVRIWAPQLLILEHPSTGGFLTHCGWNSSIEGVSTGQPMVTWPVSSEQFFN
        D                     KN+ + +  E LPEGF+ER T   +GLI+  WAPQ+LIL+H + GGF+THCGWNS+IEG++ G PMVTWP+ +EQF+N
Subjt:  D---------------------KNDGK-EVAEGLPEGFEERITSNKRGLIVRIWAPQLLILEHPSTGGFLTHCGWNSSIEGVSTGQPMVTWPVSSEQFFN

Query:  EKLLTEVLQVGVPVGTRRWWNMRDEMEEIVGREKVEKAVGFLMGATEEAATIRKRAKELGAAAKRAVENGGSSENNVVSLMKELRSVK
        EKLLT+VL++GV VG         +  +++ R +VEKAV  ++G  E+A   R  AK+LG  AK AVE GGSS N+V   M+EL   K
Subjt:  EKLLTEVLQVGVPVGTRRWWNMRDEMEEIVGREKVEKAVGFLMGATEEAATIRKRAKELGAAAKRAVENGGSSENNVVSLMKELRSVK

Arabidopsis top hitse value%identityAlignment
AT2G15480.1 UDP-glucosyl transferase 73B51.5e-9239.14Show/hide
Query:  NEELRITVLPLFASGHIIPIVDMGRLFARHGATVTVITTESNARAIQNNID--RDFAAGFKIQTHVISFPAAEVGLPPGIENFSDVSSLDL--WGKIYQG
        +E + I   P  A GH+IPI+DM +LF+R GA  T++TT  NA+  +  I+  ++     +I   + +FP  E+GLP G EN   ++S      G ++  
Subjt:  NEELRITVLPLFASGHIIPIVDMGRLFARHGATVTVITTESNARAIQNNID--RDFAAGFKIQTHVISFPAAEVGLPPGIENFSDVSSLDL--WGKIYQG

Query:  FLILDKLIDQDI---IPVTRPDCILSDLSHPWTTDTAERLGVPRLVFFVSNFMAYCAEHSVLQHSPHQNVSSDTEPFEIPGLPHKIQMTKLQQPEFLVQR
        FL   K + Q +   I  T+P  +++D+  PW T++AE+LGVPRLVF  ++F + C  +++  H PH+ V++ + PF IPGLP  I +T+  Q     + 
Subjt:  FLILDKLIDQDI---IPVTRPDCILSDLSHPWTTDTAERLGVPRLVFFVSNFMAYCAEHSVLQHSPHQNVSSDTEPFEIPGLPHKIQMTKLQQPEFLVQR

Query:  KAFTKMIESYKKAERRSYGAVMNTFYELDGIYLEHYKKITGIKAWGLGPVSLAANKDMIGKFERGNKSGMESGELLEWLNSKDPNSVLY----------N
            K ++  +++E  S+G ++N+FYEL+  Y + Y+     +AW +GP+SL +N+++  K  RG K+ ++  E L+WL+SK P SV+Y          N
Subjt:  KAFTKMIESYKKAERRSYGAVMNTFYELDGIYLEHYKKITGIKAWGLGPVSLAANKDMIGKFERGNKSGMESGELLEWLNSKDPNSVLY----------N

Query:  D---------------------KNDGK-EVAEGLPEGFEERITSNKRGLIVRIWAPQLLILEHPSTGGFLTHCGWNSSIEGVSTGQPMVTWPVSSEQFFN
        D                     KN+ + +  E LPEGF+ER T   +GLI+  WAPQ+LIL+H + GGF+THCGWNS+IEG++ G PMVTWP+ +EQF+N
Subjt:  D---------------------KNDGK-EVAEGLPEGFEERITSNKRGLIVRIWAPQLLILEHPSTGGFLTHCGWNSSIEGVSTGQPMVTWPVSSEQFFN

Query:  EKLLTEVLQVGVPVGTRRWWNMRDEMEEIVGREKVEKAVGFLMGATEEAATIRKRAKELGAAAKRAVENGGSSENNVVSLMKELRSVK
        EKLLT+VL++GV VG         +  +++ R +VEKAV  ++G  E+A   R  AK+LG  AK AVE GGSS N+V   M+EL   K
Subjt:  EKLLTEVLQVGVPVGTRRWWNMRDEMEEIVGREKVEKAVGFLMGATEEAATIRKRAKELGAAAKRAVENGGSSENNVVSLMKELRSVK

AT2G15480.2 UDP-glucosyl transferase 73B56.8e-9338.83Show/hide
Query:  NEELRITVLPLFASGHIIPIVDMGRLFARHGATVTVITTESNARAIQNNID--RDFAAGFKIQTHVISFPAAEVGLPPGIENFSDVSSLDL--WGKIYQG
        +E + I   P  A GH+IPI+DM +LF+R GA  T++TT  NA+  +  I+  ++     +I   + +FP  E+GLP G EN   ++S      G ++  
Subjt:  NEELRITVLPLFASGHIIPIVDMGRLFARHGATVTVITTESNARAIQNNID--RDFAAGFKIQTHVISFPAAEVGLPPGIENFSDVSSLDL--WGKIYQG

Query:  FLILDKLIDQDI---IPVTRPDCILSDLSHPWTTDTAERLGVPRLVFFVSNFMAYCAEHSVLQHSPHQNVSSDTEPFEIPGLPHKIQMTKLQQPEFLVQR
        FL   K + Q +   I  T+P  +++D+  PW T++AE+LGVPRLVF  ++F + C  +++  H PH+ V++ + PF IPGLP  I +T+  Q     + 
Subjt:  FLILDKLIDQDI---IPVTRPDCILSDLSHPWTTDTAERLGVPRLVFFVSNFMAYCAEHSVLQHSPHQNVSSDTEPFEIPGLPHKIQMTKLQQPEFLVQR

Query:  KAFTKMIESYKKAERRSYGAVMNTFYELDGIYLEHYKKITGIKAWGLGPVSLAANKDMIGKFERGNKSGMESGELLEWLNSKDPNSVLY----------N
            K ++  +++E  S+G ++N+FYEL+  Y + Y+     +AW +GP+SL +N+++  K  RG K+ ++  E L+WL+SK P SV+Y          N
Subjt:  KAFTKMIESYKKAERRSYGAVMNTFYELDGIYLEHYKKITGIKAWGLGPVSLAANKDMIGKFERGNKSGMESGELLEWLNSKDPNSVLY----------N

Query:  D---------------------KNDGK-EVAEGLPEGFEERITSNKRGLIVRIWAPQLLILEHPSTGGFLTHCGWNSSIEGVSTGQPMVTWPVSSEQFFN
        D                     KN+ + +  E LPEGF+ER T   +GLI+  WAPQ+LIL+H + GGF+THCGWNS+IEG++ G PMVTWP+ +EQF+N
Subjt:  D---------------------KNDGK-EVAEGLPEGFEERITSNKRGLIVRIWAPQLLILEHPSTGGFLTHCGWNSSIEGVSTGQPMVTWPVSSEQFFN

Query:  EKLLTEVLQVGVPVGTRRWWNMRDEMEEIVGREKVEKAVGFLMGAT---------EEAATIRKRAKELGAAAKRAVENGGSSENNVVSLMKELRSVK
        EKLLT+VL++GV VG         +  +++ R +VEKAV  ++G           E+A   R RAKELG  AK AVE GGSS N+V   M+EL   K
Subjt:  EKLLTEVLQVGVPVGTRRWWNMRDEMEEIVGREKVEKAVGFLMGAT---------EEAATIRKRAKELGAAAKRAVENGGSSENNVVSLMKELRSVK

AT2G15490.1 UDP-glycosyltransferase 73B42.6e-9239.48Show/hide
Query:  EELRITVLPLFASGHIIPIVDMGRLFARHGATVTVITTESNARAIQNNIDRDFAAGFKIQT-------HVISFPAAEVGLPPGIEN------FSDVSSLD
        E++ I   P  A GH+IP++DM +LFAR GA  T++TT  NA+ ++  I+      FK+Q         +++FP  E+GLP G EN      +    S D
Subjt:  EELRITVLPLFASGHIIPIVDMGRLFARHGATVTVITTESNARAIQNNIDRDFAAGFKIQT-------HVISFPAAEVGLPPGIEN------FSDVSSLD

Query:  LWGKIYQGFLILDKLIDQDI---IPVTRPDCILSDLSHPWTTDTAERLGVPRLVFFVSNFMAYCAEHSVLQHSPHQNVSSDTEPFEIPGLPHKIQMTKLQ
        L+ K    FL   K + Q +   I  T+P  +++D+  PW T++AE++GVPRLVF  ++  A C  +++  H PH+ V+S + PF IPGLP  I +T+  
Subjt:  LWGKIYQGFLILDKLIDQDI---IPVTRPDCILSDLSHPWTTDTAERLGVPRLVFFVSNFMAYCAEHSVLQHSPHQNVSSDTEPFEIPGLPHKIQMTKLQ

Query:  QPEFLVQRKAFTKMIESYKKAERRSYGAVMNTFYELDGIYLEHYKKITGIKAWGLGPVSLAANKDMIGKFERGNKSGMESGELLEWLNSKDPNSVLYNDK
        Q     +   F K  +  +++E  S+G ++N+FYEL+  Y + Y+     KAW +GP+SL +N+ +  K  RG K+ ++  E L+WL+SK P SV+Y   
Subjt:  QPEFLVQRKAFTKMIESYKKAERRSYGAVMNTFYELDGIYLEHYKKITGIKAWGLGPVSLAANKDMIGKFERGNKSGMESGELLEWLNSKDPNSVLYNDK

Query:  NDGK--------EVAEG---------------------------LPEGFEERITSNKRGLIVRIWAPQLLILEHPSTGGFLTHCGWNSSIEGVSTGQPMV
          G         E+A G                           LP+GFEER  +  +GLI+R WAPQ+LIL+H + GGF+THCGWNS++EG++ G PMV
Subjt:  NDGK--------EVAEG---------------------------LPEGFEERITSNKRGLIVRIWAPQLLILEHPSTGGFLTHCGWNSSIEGVSTGQPMV

Query:  TWPVSSEQFFNEKLLTEVLQVGVPVGTRRWWNMRDEMEEIVGREKVEKAVGFLMGATEEAATIRKRAKELGAAAKRAVENGGSSENNVVSLMKELRSVK
        TWP+ +EQF+NEKLLT+VL++GV VG         +  +++ R +VEKAV  ++G  E+A   R RAKELG  AK AVE GGSS N+V   M+EL   K
Subjt:  TWPVSSEQFFNEKLLTEVLQVGVPVGTRRWWNMRDEMEEIVGREKVEKAVGFLMGATEEAATIRKRAKELGAAAKRAVENGGSSENNVVSLMKELRSVK

AT2G15490.3 UDP-glycosyltransferase 73B41.2e-9239.72Show/hide
Query:  EELRITVLPLFASGHIIPIVDMGRLFARHGATVTVITTESNARAIQNNIDRDFAAGFKIQT-------HVISFPAAEVGLPPGIEN------FSDVSSLD
        E++ I   P  A GH+IP++DM +LFAR GA  T++TT  NA+ ++  I+      FK+Q         +++FP  E+GLP G EN      +    S D
Subjt:  EELRITVLPLFASGHIIPIVDMGRLFARHGATVTVITTESNARAIQNNIDRDFAAGFKIQT-------HVISFPAAEVGLPPGIEN------FSDVSSLD

Query:  LWGKIYQGFLILDKLIDQDI---IPVTRPDCILSDLSHPWTTDTAERLGVPRLVFFVSNFMAYCAEHSVLQHSPHQNVSSDTEPFEIPGLPHKIQMTKLQ
        L+ K    FL   K + Q +   I  T+P  +++D+  PW T++AE++GVPRLVF  ++  A C  +++  H PH+ V+S + PF IPGLP  I +T+  
Subjt:  LWGKIYQGFLILDKLIDQDI---IPVTRPDCILSDLSHPWTTDTAERLGVPRLVFFVSNFMAYCAEHSVLQHSPHQNVSSDTEPFEIPGLPHKIQMTKLQ

Query:  QPEFLVQRKAFTKMIESYKKAERRSYGAVMNTFYELDGIYLEHYKKITGIKAWGLGPVSLAANKDMIGKFERGNKSGMESGELLEWLNSKDPNSVLYNDK
        Q     +   F K  +  +++E  S+G ++N+FYEL+  Y + Y+     KAW +GP+SL +N+ +  K  RG K+ ++  E L+WL+SK P SV+Y   
Subjt:  QPEFLVQRKAFTKMIESYKKAERRSYGAVMNTFYELDGIYLEHYKKITGIKAWGLGPVSLAANKDMIGKFERGNKSGMESGELLEWLNSKDPNSVLYNDK

Query:  NDGK--------EVAEG------------------------LPEGFEERITSNKRGLIVRIWAPQLLILEHPSTGGFLTHCGWNSSIEGVSTGQPMVTWP
          G         E+A G                        LP+GFEER  +  +GLI+R WAPQ+LIL+H + GGF+THCGWNS++EG++ G PMVTWP
Subjt:  NDGK--------EVAEG------------------------LPEGFEERITSNKRGLIVRIWAPQLLILEHPSTGGFLTHCGWNSSIEGVSTGQPMVTWP

Query:  VSSEQFFNEKLLTEVLQVGVPVGTRRWWNMRDEMEEIVGREKVEKAVGFLMGATEEAATIRKRAKELGAAAKRAVENGGSSENNVVSLMKELRSVK
        + +EQF+NEKLLT+VL++GV VG         +  +++ R +VEKAV  ++G  E+A   R RAKELG  AK AVE GGSS N+V   M+EL   K
Subjt:  VSSEQFFNEKLLTEVLQVGVPVGTRRWWNMRDEMEEIVGREKVEKAVGFLMGATEEAATIRKRAKELGAAAKRAVENGGSSENNVVSLMKELRSVK

AT4G34135.1 UDP-glucosyltransferase 73B22.6e-9240.5Show/hide
Query:  NEELRITVLPLFASGHIIPIVDMGRLFARHGATVTVITTESNARAIQNNID--RDFAAGFKIQTHVISFPAAEVGLPPGIEN---FSDVSSLDLWGKIYQ
        + +L +   P  A GH+IP +DM +LF+  GA  T++TT  N++ +Q  ID  ++   G +I   + +FP  E+GLP G EN   F+  ++ D    I +
Subjt:  NEELRITVLPLFASGHIIPIVDMGRLFARHGATVTVITTESNARAIQNNID--RDFAAGFKIQTHVISFPAAEVGLPPGIEN---FSDVSSLDLWGKIYQ

Query:  GFLILDKLIDQ--DIIPVTRPDCILSDLSHPWTTDTAERLGVPRLVFFVSNFMAYCAEHSVLQHSPHQNVSSDTEPFEIPGLPHKIQMTKLQQPEFLVQR
         F       DQ   ++  TRPDC+++D+  PW T+ A +  VPRLVF  + + + CA + +  H P + V+S +EPF IP LP  I +T+ Q  +     
Subjt:  GFLILDKLIDQ--DIIPVTRPDCILSDLSHPWTTDTAERLGVPRLVFFVSNFMAYCAEHSVLQHSPHQNVSSDTEPFEIPGLPHKIQMTKLQQPEFLVQR

Query:  KAFTKMIESYKKAERRSYGAVMNTFYELDGIYLEHYKKITGIKAWGLGPVSLAANKDMIGKFERGNKSGMESGELLEWLNSKDPNSVLYND-------KN
            K +   +++E +S G V+N+FYEL+  Y + YK     +AW +GP+S+  N+    K ERG K+ ++  E L+WL+SK PNSV+Y         KN
Subjt:  KAFTKMIESYKKAERRSYGAVMNTFYELDGIYLEHYKKITGIKAWGLGPVSLAANKDMIGKFERGNKSGMESGELLEWLNSKDPNSVLYND-------KN

Query:  D-------GKEVA----------------EGLPEGFEERITSNKRGLIVRIWAPQLLILEHPSTGGFLTHCGWNSSIEGVSTGQPMVTWPVSSEQFFNEK
        +       G E +                E LPEGFEER+    +G+I+R WAPQ+LIL+H +TGGF+THCGWNS +EGV+ G PMVTWPV +EQF+NEK
Subjt:  D-------GKEVA----------------EGLPEGFEERITSNKRGLIVRIWAPQLLILEHPSTGGFLTHCGWNSSIEGVSTGQPMVTWPVSSEQFFNEK

Query:  LLTEVLQVGVPVGTRRWWNMRDEMEEIVGREKVEKAVGFLMGATEEAATIRKRAKELGAAAKRAVENGGSSENNVVSLMKELRS
        L+T+VL+ GV VG  +  +M+  M + + REKV+KAV  ++ A E A   R+RAK+L A AK AVE GGSS N++ S M+E  S
Subjt:  LLTEVLQVGVPVGTRRWWNMRDEMEEIVGREKVEKAVGFLMGATEEAATIRKRAKELGAAAKRAVENGGSSENNVVSLMKELRS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTGTCGAAAAATGAAGAGTTGAGAATCACCGTCCTCCCGCTGTTCGCCTCCGGCCACATAATTCCTATCGTGGACATGGGCAGGCTCTTCGCCCGCCATGGCGCCAC
CGTCACCGTCATCACCACTGAGTCCAATGCCCGTGCTATTCAGAACAACATTGACCGTGACTTCGCTGCCGGCTTCAAGATTCAGACCCACGTCATTTCCTTCCCTGCTG
CAGAGGTCGGCCTCCCGCCCGGCATCGAGAACTTCAGCGACGTTTCTTCTCTCGACCTTTGGGGCAAAATCTACCAAGGCTTTCTCATTCTTGATAAATTAATCGATCAG
GACATCATTCCGGTGACCCGACCAGATTGCATTCTCAGCGATCTTTCACACCCGTGGACGACGGACACGGCAGAGAGGCTGGGAGTGCCGCGACTGGTGTTCTTCGTGTC
GAACTTCATGGCGTACTGCGCGGAGCACTCTGTTCTGCAACATTCCCCTCACCAGAATGTATCCTCCGACACAGAGCCATTCGAAATCCCCGGATTACCTCACAAGATCC
AAATGACCAAATTGCAGCAGCCAGAGTTTCTAGTCCAGCGAAAGGCCTTCACGAAGATGATTGAGAGCTACAAGAAAGCAGAGAGAAGAAGTTATGGAGCTGTAATGAAC
ACATTTTACGAGCTCGATGGGATCTATTTAGAGCACTACAAAAAGATAACCGGAATCAAAGCTTGGGGATTGGGACCAGTTTCGTTGGCAGCGAATAAGGATATGATTGG
AAAATTCGAGAGGGGAAACAAATCGGGAATGGAGAGTGGAGAATTATTGGAATGGTTGAATTCCAAGGACCCGAATTCGGTTTTGTATAATGACAAAAACGACGGCAAAG
AGGTAGCAGAGGGGTTGCCGGAGGGGTTCGAAGAAAGGATTACAAGCAACAAAAGGGGATTGATCGTTCGAATTTGGGCACCGCAGCTTTTGATTTTGGAGCACCCCTCA
ACAGGGGGCTTTTTGACGCATTGTGGGTGGAATTCGTCCATTGAAGGGGTCAGTACCGGGCAGCCGATGGTGACGTGGCCGGTCAGCTCCGAACAATTTTTTAATGAGAA
GTTGTTGACGGAGGTGCTACAGGTGGGGGTTCCAGTAGGGACACGACGGTGGTGGAATATGAGGGATGAGATGGAGGAGATTGTGGGGAGAGAGAAGGTGGAGAAGGCTG
TAGGGTTTTTGATGGGGGCAACCGAGGAGGCGGCGACGATTAGGAAGCGAGCGAAAGAGCTCGGAGCAGCTGCAAAGAGGGCGGTTGAAAACGGTGGCTCGTCGGAGAAT
AATGTGGTGTCACTGATGAAAGAATTGAGATCTGTTAAGTTGAAACGTGATGGCAGTTCAGCGCTTTGCTGCTGGAGTGGTGAATTGTTTTGTGAATGTTTGAGTGTCAT
CTGTGTTGGCTTGTGTTTTGGGGGAATAGCTGCAGTCAAAATGGAGACAGAAGAGTTAATCAACGAATGGAGGAAATTCAGTCTGAAGGAACGTGAAAAAGAACCAGCTT
TTTCCTTTAATGAAAAGGAAAAGATGAATATCGATGGGAATCTTGGCCATAGTCTTGTCGGGAAACTTCTGTCAAGCAGAATCATATCAAGCTTGGCGATAAAGAATACA
ATGGCAGAAGCTTGGAAGACTAGGCATAAATTTAGTATTAAGGTGGTGGGTAGAAATCTTTTTGCGTTCAAATTTTGTAGTCAGGAAGACAGAGATTGGGTGGCTGCAAA
CGAACCTTGGCTTTTTGATCGATCTTTATTGATTCTTGAATTTCCTGCAGCAAATCAAAGGTCATCTGAACTGCGCTTTGATAGGGTGGCTTTTTGGTTACGTCTTTTTA
ATTTGCCAATGGGATATCAAAATAAATGTGCTGCTGTTAAATTTGGGAATCTCATTGGAGATTTCTTGGAGATGGAAAATGAGGATGATGATCTTGGTTGGAGTGACAGC
ATTCGCATTAGAGTCAAGATTGACATATCTAAAGCTCTACTGAGGGGTTTTATGTTAAAGACAGATGATATGGGCGGTGGGAGATGGATTACCATCAGATACGAGCGTCT
TCCTGATTTTTGCTTCAAGTGTGGATGCATTGGTCACAATGCTAAGGAGTATTCAATCGATCAAGAAGAATGTGAATCTCGAGGCAATATCTTCGAATTTGGTGCGTGGC
TGAAGTTCCAGGGTTTCTCTAAAGGAACAAGGAAACAATCCCCTACAACGAATGATGATATCCCAGACTTGAATGAATCAGACCATCAGGAAGAGGAGACTTCTGCGGAT
ATCCTGCATAATGGTGAATCTCTCCAGGTTTTACGTAAAGTAGAGGAGGCTATGGATGTAAATTTTGAGAAAGGTCCTGAGAAAGGGATTCAGCCAATTCTTGATCATTC
CAATGTTTCCTCGTTTGATAAAGCGATGGAGGAGGAGGAGATTCAATCTTTGGATAGTGTCAGGGATCAAGAGGTACACACAGCTCTAATCATTCAGAGCCAGGGAGATG
GTTCCAGCGGAGGTTTTGTTAACTCTGCTAGGAAGAAGATGAACTGGAAGAGACGTGCTAGGATGGGCCCTATTAAAAAATCTCTTATTCTGGAACAAGTATCAAAGAAG
AGGAAAGAGTATTTTGATTCAGTAGATGTGAGGAAGAAGGCCAGATTGATCGGTGAATCATCTGTCGTATATTCCCTAGTTCGGTCTGTTTGCCCTGACATTCTTTTCAT
CTCGGAAACCAAGAGTGGCCCAGAGATTCCCAACAAGATTAGAATCCAATGCGGCTTTGATGGTTGTTTTTCAGTTCCTAGCAGGGGAGCCAGCGGAGGTCTTTGTCTTC
TTTGGAAGGAGAAGGATTTGGTTTCTATCCAGTCCTTTTCCCTAAATCATATTGATGCTTTGATCAGTTGGAAGGATAAAAAGTGGAGATTCACTGGAATCTATGGATTT
CCAGAAATTAGTCAAAAGCACAAGACCTGGAACCTCATTCGGAACTTGAATACTATGAAGAACTTCCCTTGGATTCTTGGTGGTGATTTCAACGAAATCTTGTGGGATTT
TGAGAAGTATGGGGGTCCTAGTTGTTCAAATAGTCTGTTGGAGGGGTTTAGGAAGACGCTGAATGATTGTGAGTTAAAAGAATTAAAGTTTTCCGGAGATCCCTTTACTT
GGAAAGGAAATAGGCGGGGAGTTCAAGTGTGGGAAAGGCTGGACAGATTTCTCTATAACCTGGAATTCGAATCCTTGTTTGCCTTCGCAGGTATGCGTAATCTGGAGTGG
ATGTTTTCGGATCACAGGCCTATTGCAGCCTTGGTGGACTGTCGTAGAGCTGCTTGGAGGAAGGTTGTAAAGCGTCCTTTTAAATTTGAGGAGTTGTGGACCAGGTATGA
GGAATGTGCAGACATTATTAAGAAGTCTGGTGATTGGCAGAATCCTTTGTCCTCTGATCTTGCTAAGAATCTGTCCTCTTGCTCTGAAGTCTTGTATAAATGGGGTGATG
ATGTGCGTAATAAGATGAAGAGTAGGATTAAAGAATGTAAATCTGCTTTAAAAGAAGCCTATGACAAAGCCCCTCACATCGACTACAATGCCATCCATGGACTGGAGTTC
GAACTAGAAAAATTGCTAGAAGAAGAAGAGATTTTTTGGAAGCAAAGATCTCGTGAAGACTGGTTGAGATGGGGCGAGAAAAACACGGGTTGGTTTCACAGAAAGGCTTC
TATACGCAGGCAAATCAATGAGATTTCAGGGATTACTAGTGCTACTGGAGAGTGGACCGAGGATCCTTCTCTTGTTGAGGATACTTTTGTGGCATTTTTTAATAATATTT
TCGAGTCTTCCAAGCCTGATATGAGCATTATGGATGATGTTTTGCGTTGTGTTAATTCCAGAGTTTCCCAAAGAATGAATGAAAAGCTTCTTCTTCCTTTTTCCAGAATT
GATATTGAAAAGGCAGTCCAAGATATGTACCCCACCAAGGCGCCGGGCCCAGATGGTTACTCTGCAATCTTTTACCAGCAGTATTGGAATATGGTTGGTCCTTCAACCAT
TAAGGATTGCTTAGATGTTCTAAATGGTGACAGGAGCGTGGAGGCTTGGAATAAGACTAATATCGTGTTGATTTCGAAGATAGCCAATCCGAAGGAGGTGGGAGATTTCA
GACCCATCAATCTTTGTAATGTAAGTTATAAAATCATTACAAAGACTCTTGCAAACAGGCTAAAAATCGTATTGAGGGATATTATTTCGGAGTCTCAATCAGCCTTCATT
CAGGGAAGGCTAATCTCTGATAACATAATTCTGGGGCATGAATGTCTTCATTCCATAAATCAGAATCGTACGAAGATTATAGATATGGCTGCTATGAAATTGGATCTCAG
TAAGGCCTTTGACCAAGTGGAATGGCCCTTTTTGAAGGAATCATGA
mRNA sequenceShow/hide mRNA sequence
ATGGTGTCGAAAAATGAAGAGTTGAGAATCACCGTCCTCCCGCTGTTCGCCTCCGGCCACATAATTCCTATCGTGGACATGGGCAGGCTCTTCGCCCGCCATGGCGCCAC
CGTCACCGTCATCACCACTGAGTCCAATGCCCGTGCTATTCAGAACAACATTGACCGTGACTTCGCTGCCGGCTTCAAGATTCAGACCCACGTCATTTCCTTCCCTGCTG
CAGAGGTCGGCCTCCCGCCCGGCATCGAGAACTTCAGCGACGTTTCTTCTCTCGACCTTTGGGGCAAAATCTACCAAGGCTTTCTCATTCTTGATAAATTAATCGATCAG
GACATCATTCCGGTGACCCGACCAGATTGCATTCTCAGCGATCTTTCACACCCGTGGACGACGGACACGGCAGAGAGGCTGGGAGTGCCGCGACTGGTGTTCTTCGTGTC
GAACTTCATGGCGTACTGCGCGGAGCACTCTGTTCTGCAACATTCCCCTCACCAGAATGTATCCTCCGACACAGAGCCATTCGAAATCCCCGGATTACCTCACAAGATCC
AAATGACCAAATTGCAGCAGCCAGAGTTTCTAGTCCAGCGAAAGGCCTTCACGAAGATGATTGAGAGCTACAAGAAAGCAGAGAGAAGAAGTTATGGAGCTGTAATGAAC
ACATTTTACGAGCTCGATGGGATCTATTTAGAGCACTACAAAAAGATAACCGGAATCAAAGCTTGGGGATTGGGACCAGTTTCGTTGGCAGCGAATAAGGATATGATTGG
AAAATTCGAGAGGGGAAACAAATCGGGAATGGAGAGTGGAGAATTATTGGAATGGTTGAATTCCAAGGACCCGAATTCGGTTTTGTATAATGACAAAAACGACGGCAAAG
AGGTAGCAGAGGGGTTGCCGGAGGGGTTCGAAGAAAGGATTACAAGCAACAAAAGGGGATTGATCGTTCGAATTTGGGCACCGCAGCTTTTGATTTTGGAGCACCCCTCA
ACAGGGGGCTTTTTGACGCATTGTGGGTGGAATTCGTCCATTGAAGGGGTCAGTACCGGGCAGCCGATGGTGACGTGGCCGGTCAGCTCCGAACAATTTTTTAATGAGAA
GTTGTTGACGGAGGTGCTACAGGTGGGGGTTCCAGTAGGGACACGACGGTGGTGGAATATGAGGGATGAGATGGAGGAGATTGTGGGGAGAGAGAAGGTGGAGAAGGCTG
TAGGGTTTTTGATGGGGGCAACCGAGGAGGCGGCGACGATTAGGAAGCGAGCGAAAGAGCTCGGAGCAGCTGCAAAGAGGGCGGTTGAAAACGGTGGCTCGTCGGAGAAT
AATGTGGTGTCACTGATGAAAGAATTGAGATCTGTTAAGTTGAAACGTGATGGCAGTTCAGCGCTTTGCTGCTGGAGTGGTGAATTGTTTTGTGAATGTTTGAGTGTCAT
CTGTGTTGGCTTGTGTTTTGGGGGAATAGCTGCAGTCAAAATGGAGACAGAAGAGTTAATCAACGAATGGAGGAAATTCAGTCTGAAGGAACGTGAAAAAGAACCAGCTT
TTTCCTTTAATGAAAAGGAAAAGATGAATATCGATGGGAATCTTGGCCATAGTCTTGTCGGGAAACTTCTGTCAAGCAGAATCATATCAAGCTTGGCGATAAAGAATACA
ATGGCAGAAGCTTGGAAGACTAGGCATAAATTTAGTATTAAGGTGGTGGGTAGAAATCTTTTTGCGTTCAAATTTTGTAGTCAGGAAGACAGAGATTGGGTGGCTGCAAA
CGAACCTTGGCTTTTTGATCGATCTTTATTGATTCTTGAATTTCCTGCAGCAAATCAAAGGTCATCTGAACTGCGCTTTGATAGGGTGGCTTTTTGGTTACGTCTTTTTA
ATTTGCCAATGGGATATCAAAATAAATGTGCTGCTGTTAAATTTGGGAATCTCATTGGAGATTTCTTGGAGATGGAAAATGAGGATGATGATCTTGGTTGGAGTGACAGC
ATTCGCATTAGAGTCAAGATTGACATATCTAAAGCTCTACTGAGGGGTTTTATGTTAAAGACAGATGATATGGGCGGTGGGAGATGGATTACCATCAGATACGAGCGTCT
TCCTGATTTTTGCTTCAAGTGTGGATGCATTGGTCACAATGCTAAGGAGTATTCAATCGATCAAGAAGAATGTGAATCTCGAGGCAATATCTTCGAATTTGGTGCGTGGC
TGAAGTTCCAGGGTTTCTCTAAAGGAACAAGGAAACAATCCCCTACAACGAATGATGATATCCCAGACTTGAATGAATCAGACCATCAGGAAGAGGAGACTTCTGCGGAT
ATCCTGCATAATGGTGAATCTCTCCAGGTTTTACGTAAAGTAGAGGAGGCTATGGATGTAAATTTTGAGAAAGGTCCTGAGAAAGGGATTCAGCCAATTCTTGATCATTC
CAATGTTTCCTCGTTTGATAAAGCGATGGAGGAGGAGGAGATTCAATCTTTGGATAGTGTCAGGGATCAAGAGGTACACACAGCTCTAATCATTCAGAGCCAGGGAGATG
GTTCCAGCGGAGGTTTTGTTAACTCTGCTAGGAAGAAGATGAACTGGAAGAGACGTGCTAGGATGGGCCCTATTAAAAAATCTCTTATTCTGGAACAAGTATCAAAGAAG
AGGAAAGAGTATTTTGATTCAGTAGATGTGAGGAAGAAGGCCAGATTGATCGGTGAATCATCTGTCGTATATTCCCTAGTTCGGTCTGTTTGCCCTGACATTCTTTTCAT
CTCGGAAACCAAGAGTGGCCCAGAGATTCCCAACAAGATTAGAATCCAATGCGGCTTTGATGGTTGTTTTTCAGTTCCTAGCAGGGGAGCCAGCGGAGGTCTTTGTCTTC
TTTGGAAGGAGAAGGATTTGGTTTCTATCCAGTCCTTTTCCCTAAATCATATTGATGCTTTGATCAGTTGGAAGGATAAAAAGTGGAGATTCACTGGAATCTATGGATTT
CCAGAAATTAGTCAAAAGCACAAGACCTGGAACCTCATTCGGAACTTGAATACTATGAAGAACTTCCCTTGGATTCTTGGTGGTGATTTCAACGAAATCTTGTGGGATTT
TGAGAAGTATGGGGGTCCTAGTTGTTCAAATAGTCTGTTGGAGGGGTTTAGGAAGACGCTGAATGATTGTGAGTTAAAAGAATTAAAGTTTTCCGGAGATCCCTTTACTT
GGAAAGGAAATAGGCGGGGAGTTCAAGTGTGGGAAAGGCTGGACAGATTTCTCTATAACCTGGAATTCGAATCCTTGTTTGCCTTCGCAGGTATGCGTAATCTGGAGTGG
ATGTTTTCGGATCACAGGCCTATTGCAGCCTTGGTGGACTGTCGTAGAGCTGCTTGGAGGAAGGTTGTAAAGCGTCCTTTTAAATTTGAGGAGTTGTGGACCAGGTATGA
GGAATGTGCAGACATTATTAAGAAGTCTGGTGATTGGCAGAATCCTTTGTCCTCTGATCTTGCTAAGAATCTGTCCTCTTGCTCTGAAGTCTTGTATAAATGGGGTGATG
ATGTGCGTAATAAGATGAAGAGTAGGATTAAAGAATGTAAATCTGCTTTAAAAGAAGCCTATGACAAAGCCCCTCACATCGACTACAATGCCATCCATGGACTGGAGTTC
GAACTAGAAAAATTGCTAGAAGAAGAAGAGATTTTTTGGAAGCAAAGATCTCGTGAAGACTGGTTGAGATGGGGCGAGAAAAACACGGGTTGGTTTCACAGAAAGGCTTC
TATACGCAGGCAAATCAATGAGATTTCAGGGATTACTAGTGCTACTGGAGAGTGGACCGAGGATCCTTCTCTTGTTGAGGATACTTTTGTGGCATTTTTTAATAATATTT
TCGAGTCTTCCAAGCCTGATATGAGCATTATGGATGATGTTTTGCGTTGTGTTAATTCCAGAGTTTCCCAAAGAATGAATGAAAAGCTTCTTCTTCCTTTTTCCAGAATT
GATATTGAAAAGGCAGTCCAAGATATGTACCCCACCAAGGCGCCGGGCCCAGATGGTTACTCTGCAATCTTTTACCAGCAGTATTGGAATATGGTTGGTCCTTCAACCAT
TAAGGATTGCTTAGATGTTCTAAATGGTGACAGGAGCGTGGAGGCTTGGAATAAGACTAATATCGTGTTGATTTCGAAGATAGCCAATCCGAAGGAGGTGGGAGATTTCA
GACCCATCAATCTTTGTAATGTAAGTTATAAAATCATTACAAAGACTCTTGCAAACAGGCTAAAAATCGTATTGAGGGATATTATTTCGGAGTCTCAATCAGCCTTCATT
CAGGGAAGGCTAATCTCTGATAACATAATTCTGGGGCATGAATGTCTTCATTCCATAAATCAGAATCGTACGAAGATTATAGATATGGCTGCTATGAAATTGGATCTCAG
TAAGGCCTTTGACCAAGTGGAATGGCCCTTTTTGAAGGAATCATGA
Protein sequenceShow/hide protein sequence
MVSKNEELRITVLPLFASGHIIPIVDMGRLFARHGATVTVITTESNARAIQNNIDRDFAAGFKIQTHVISFPAAEVGLPPGIENFSDVSSLDLWGKIYQGFLILDKLIDQ
DIIPVTRPDCILSDLSHPWTTDTAERLGVPRLVFFVSNFMAYCAEHSVLQHSPHQNVSSDTEPFEIPGLPHKIQMTKLQQPEFLVQRKAFTKMIESYKKAERRSYGAVMN
TFYELDGIYLEHYKKITGIKAWGLGPVSLAANKDMIGKFERGNKSGMESGELLEWLNSKDPNSVLYNDKNDGKEVAEGLPEGFEERITSNKRGLIVRIWAPQLLILEHPS
TGGFLTHCGWNSSIEGVSTGQPMVTWPVSSEQFFNEKLLTEVLQVGVPVGTRRWWNMRDEMEEIVGREKVEKAVGFLMGATEEAATIRKRAKELGAAAKRAVENGGSSEN
NVVSLMKELRSVKLKRDGSSALCCWSGELFCECLSVICVGLCFGGIAAVKMETEELINEWRKFSLKEREKEPAFSFNEKEKMNIDGNLGHSLVGKLLSSRIISSLAIKNT
MAEAWKTRHKFSIKVVGRNLFAFKFCSQEDRDWVAANEPWLFDRSLLILEFPAANQRSSELRFDRVAFWLRLFNLPMGYQNKCAAVKFGNLIGDFLEMENEDDDLGWSDS
IRIRVKIDISKALLRGFMLKTDDMGGGRWITIRYERLPDFCFKCGCIGHNAKEYSIDQEECESRGNIFEFGAWLKFQGFSKGTRKQSPTTNDDIPDLNESDHQEEETSAD
ILHNGESLQVLRKVEEAMDVNFEKGPEKGIQPILDHSNVSSFDKAMEEEEIQSLDSVRDQEVHTALIIQSQGDGSSGGFVNSARKKMNWKRRARMGPIKKSLILEQVSKK
RKEYFDSVDVRKKARLIGESSVVYSLVRSVCPDILFISETKSGPEIPNKIRIQCGFDGCFSVPSRGASGGLCLLWKEKDLVSIQSFSLNHIDALISWKDKKWRFTGIYGF
PEISQKHKTWNLIRNLNTMKNFPWILGGDFNEILWDFEKYGGPSCSNSLLEGFRKTLNDCELKELKFSGDPFTWKGNRRGVQVWERLDRFLYNLEFESLFAFAGMRNLEW
MFSDHRPIAALVDCRRAAWRKVVKRPFKFEELWTRYEECADIIKKSGDWQNPLSSDLAKNLSSCSEVLYKWGDDVRNKMKSRIKECKSALKEAYDKAPHIDYNAIHGLEF
ELEKLLEEEEIFWKQRSREDWLRWGEKNTGWFHRKASIRRQINEISGITSATGEWTEDPSLVEDTFVAFFNNIFESSKPDMSIMDDVLRCVNSRVSQRMNEKLLLPFSRI
DIEKAVQDMYPTKAPGPDGYSAIFYQQYWNMVGPSTIKDCLDVLNGDRSVEAWNKTNIVLISKIANPKEVGDFRPINLCNVSYKIITKTLANRLKIVLRDIISESQSAFI
QGRLISDNIILGHECLHSINQNRTKIIDMAAMKLDLSKAFDQVEWPFLKES