; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cp4.1LG04g03680 (gene) of Cucurbita pepo (MU-CU-16) v4.1 genome

Gene IDCp4.1LG04g03680
OrganismCucurbita pepo var. pepo MU-CU-16 (Cucurbita pepo (MU-CU-16) v4.1)
DescriptionProtein DETOXIFICATION
Genome locationCp4.1LG04:7165437..7180306
RNA-Seq ExpressionCp4.1LG04g03680
SyntenyCp4.1LG04g03680
Gene Ontology termsGO:1990961 - xenobiotic detoxification by transmembrane export across the plasma membrane (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0015297 - antiporter activity (molecular function)
GO:0042910 - xenobiotic transmembrane transporter activity (molecular function)
InterPro domainsIPR002528 - Multi antimicrobial extrusion protein
IPR045069 - Multidrug and toxic compound extrusion family, eukaryotic


Homology Show/hide homology
GenBank top hitse value%identityAlignment
BBH00002.1 MATE efflux family protein, partial [Prunus dulcis]0.057.51Show/hide
Query:  SGELEKILSDTELPFVVRIRLAAWIELKLLGYLAAPAVFVYMINNFMSMSTRIFSGHLGNAELAAASLGNSGVQTFAYGLLLGMGSAVETLCGQAYGAGK
        S ELE++LSDT  PF+ R++ A WIE KLL YLAAPAV VYMIN  MSMST+IF+GHLGN ELAAASLGN+G+  F+YGLLLGMGSAVETLCGQAYGA K
Subjt:  SGELEKILSDTELPFVVRIRLAAWIELKLLGYLAAPAVFVYMINNFMSMSTRIFSGHLGNAELAAASLGNSGVQTFAYGLLLGMGSAVETLCGQAYGAGK

Query:  YEMLGIYLQRSSILLTLVACLLTLLYVFSKPLLLLLGESPEIASSAAVFVYGLIPQIFAYAVNFPIQKFLQSQSVVLPSAYISAATLVVHLVLSWAAAYK
        YEMLGIYLQRS+ILLTL   LLT++Y++ +P+L+LLG+SP IAS+AA++VYGLIPQIF++AVNFPIQKFLQ+QS+V PSAYISA+TLV+HLVLSW   YK
Subjt:  YEMLGIYLQRSSILLTLVACLLTLLYVFSKPLLLLLGESPEIASSAAVFVYGLIPQIFAYAVNFPIQKFLQSQSVVLPSAYISAATLVVHLVLSWAAAYK

Query:  LGFGLFGASSVLSLSWWIIVTAQVVYIVKSERFKESWRGLSWAAFKGLPEFFKLSAASAIMLCLETWYFQILVLVAGLLDNPELALDSLSICMTINGLAF
        +G GL GA+ VLSLSWWI V AQ +YIVKSER K +W G SW AF GL  FFKLS ASA+MLCLETWYFQI VL+AGLL NPELALDSLSIC TI+G  F
Subjt:  LGFGLFGASSVLSLSWWIIVTAQVVYIVKSERFKESWRGLSWAAFKGLPEFFKLSAASAIMLCLETWYFQILVLVAGLLDNPELALDSLSICMTINGLAF

Query:  MISVGFNAAA---RVGNELGHGHPKSAAFSVVVVVAISSLVSILVASAVLSFRNVISYLFTDGATVAAAVSDLCPFLAVTLLLNGIQPA-----------
        MISVG NAAA   RV NELG G+PKS AFSVVVV  +S L+SI+ A  VL +R++ISY FT+G  VAAAVSDLCPFLA+TLLLNGIQP            
Subjt:  MISVGFNAAA---RVGNELGHGHPKSAAFSVVVVVAISSLVSILVASAVLSFRNVISYLFTDGATVAAAVSDLCPFLAVTLLLNGIQPA-----------

Query:  -----------WLWDVGGNLLL----------LWSMCAAITLLACP---------------SEFSSMGSAYEDDLRQALLQPAAAAL----LSSQSLCSN
                   +L  V   +LL          +W    A TL+                   E +   + +E+ L Q+   P         L    L  +
Subjt:  -----------WLWDVGGNLLL----------LWSMCAAITLLACP---------------SEFSSMGSAYEDDLRQALLQPAAAAL----LSSQSLCSN

Query:  KHEGSDELERILSDTEMPVVQRYSQATWIEIKLLFYLAAPAVFVYMINYAMSTSTQIFSGHLGNLELAASSLGNNGIQVFAYGLM---------------
         H  S+ELER+LSDT+ P ++R   A WIE KLLFYLA+PAV VYMINY M+ STQIF+GHLGNLELAA+SLGNNGIQVFAYG+M               
Subjt:  KHEGSDELERILSDTEMPVVQRYSQATWIEIKLLFYLAAPAVFVYMINYAMSTSTQIFSGHLGNLELAASSLGNNGIQVFAYGLM---------------

Query:  --------------RSAILLTLTGVVLTIPYIFCKPILLFLGESKDIASASETFVYGLVPQIYAYALNFPIQKFLQAQSIVFPSAFISAGTLVVHVVLSW
                      RS +LLTLTGV+LTI YIF +PIL+ LG+S  IASA+  +VYGL+PQI+AYA+NFPIQKFLQAQSIV PSA+ISA TL +H+VLSW
Subjt:  --------------RSAILLTLTGVVLTIPYIFCKPILLFLGESKDIASASETFVYGLVPQIYAYALNFPIQKFLQAQSIVFPSAFISAGTLVVHVVLSW

Query:  LAAYKMGLGLLGVSLVLSLSWWVIVVGQFVYIVKSGSCKKTWRGFNVQAFSGLFGFFKLSAASAVMLCLETWYFQILVLLAGLLDNPELALDSLSICR--
        +  YKMGLGLLG +LVLSLSWW+ V+ QFVYIVKS  CK TW GF+ QAFSGL GFFKLS ASAVMLCLETWYFQILVLLAGLL NPELALDSLSIC   
Subjt:  LAAYKMGLGLLGVSLVLSLSWWVIVVGQFVYIVKSGSCKKTWRGFNVQAFSGLFGFFKLSAASAVMLCLETWYFQILVLLAGLLDNPELALDSLSICR--

Query:  ----------LQRCRQCESKQRTGK------------------------------WESKIRSVFRGGGGCHL-------HHCLHLLCRTCTGSSQRVAVG
                  L            G                               W   I   F  G               L LL          VAVG
Subjt:  ----------LQRCRQCESKQRTGK------------------------------WESKIRSVFRGGGGCHL-------HHCLHLLCRTCTGSSQRVAVG

Query:  CGWQAFVAYVNVGCYYIVGVPLGALLGFYFKFGAK------------------GIWIGLMGGTFMQTVILVW
        CGWQAFVAYVNVGCYY+VGVP GALLGFYFK GAK                  GIW+G+M GT MQT+IL+W
Subjt:  CGWQAFVAYVNVGCYYIVGVPLGALLGFYFKFGAK------------------GIWIGLMGGTFMQTVILVW

KAG2306525.1 hypothetical protein Bca52824_026273 [Brassica carinata]0.056.97Show/hide
Query:  EPLVQP--AAKKHGPSGELEKILSDTELPFVVRIRLAAWIELKLLGYLAAPAVFVYMINNFMSMSTRIFSGHLGNAELAAASLGNSGVQTFAYGLLLGMG
        +PL+QP  +      +GELE++LSD E P   R+R A  IE KLL  LAAPAV VYMIN  MSMST+IFSGHLGN ELAAASLGN+G+Q FAYGL+LGMG
Subjt:  EPLVQP--AAKKHGPSGELEKILSDTELPFVVRIRLAAWIELKLLGYLAAPAVFVYMINNFMSMSTRIFSGHLGNAELAAASLGNSGVQTFAYGLLLGMG

Query:  SAVETLCGQAYGAGKYEMLGIYLQRSSILLTLVACLLTLLYVFSKPLLLLLGESPEIASSAAVFVYGLIPQIFAYAVNFPIQKFLQSQSVVLPSAYISAA
        SAVETLCGQA+G  KY+MLG+YLQRS++LLTL   LLT +YVFS+P+LL LGESPEIAS+A++FVYGLIPQIFAYA+NFPIQKFLQ+QS+V PSAYI+ A
Subjt:  SAVETLCGQAYGAGKYEMLGIYLQRSSILLTLVACLLTLLYVFSKPLLLLLGESPEIASSAAVFVYGLIPQIFAYAVNFPIQKFLQSQSVVLPSAYISAA

Query:  TLVVHLVLSWAAAYKLGFGLFGASSVLSLSWWIIVTAQVVYIVKSERFKESWRGLSWAAFKGLPEFFKLSAASAIMLCLETWYFQILVLVAGLLDNPELA
        TL VHL+LSW A YKLG GL GAS VLSLSWWIIV AQ VYIV S+R +E+WRG S  AF GLP FFKLSAASA+MLCLETWYFQILVL+AGLL+NPELA
Subjt:  TLVVHLVLSWAAAYKLGFGLFGASSVLSLSWWIIVTAQVVYIVKSERFKESWRGLSWAAFKGLPEFFKLSAASAIMLCLETWYFQILVLVAGLLDNPELA

Query:  LDSLSICMTINGLAFMISVGFNAA--ARVGNELGHGHPKSAAFSVVVVVAISSLVSILVASAVLSFRNVISYLFTDGATVAAAVSDLCPFLAVTLLLNGI
        LDSLSICMTI+G  FMISVGFNAA   RV NELG G+PKSAAFSV++V   S +  +++A A++  R+ +SY FT+G  V+AAVSDLCP LAVTL+LNGI
Subjt:  LDSLSICMTINGLAFMISVGFNAA--ARVGNELGHGHPKSAAFSVVVVVAISSLVSILVASAVLSFRNVISYLFTDGATVAAAVSDLCPFLAVTLLLNGI

Query:  QPA--------------------------------------------WLWDVGGNLL----LLW--------SMCAAITLLACPSEFSS-----------
        QP                                             W   + G L+    L W           + +   + P +              
Subjt:  QPA--------------------------------------------WLWDVGGNLL----LLW--------SMCAAITLLACPSEFSS-----------

Query:  MGSAYEDDLRQALLQPAAAALLSSQSLCSNKHEGSDELERILSDTEMPVVQRYSQATWIEIKLLFYLAAPAVFVYMINYAMSTSTQIFSGHLGNLELAAS
        M S+  D + Q LL P A   +    L  +    + ELER+LSD E P+  R  +AT IE KLLF LAAPAV VYMINY MS STQI SGHLGNLELAA+
Subjt:  MGSAYEDDLRQALLQPAAAALLSSQSLCSNKHEGSDELERILSDTEMPVVQRYSQATWIEIKLLFYLAAPAVFVYMINYAMSTSTQIFSGHLGNLELAAS

Query:  SLGNNGIQVFAYGLM----------------------------RSAILLTLTGVVLTIPYIFCKPILLFLGESKDIASASETFVYGLVPQIYAYALNFPI
        SLGN GIQ+FAYGLM                            RS +LLTLTG++LT+ Y+F KPILLFLGES +IASA+  FVYGL+PQI+AYA+NFPI
Subjt:  SLGNNGIQVFAYGLM----------------------------RSAILLTLTGVVLTIPYIFCKPILLFLGESKDIASASETFVYGLVPQIYAYALNFPI

Query:  QKFLQAQSIVFPSAFISAGTLVVHVVLSWLAAYKMGLGLLGVSLVLSLSWWVIVVGQFVYIVKSGSCKKTWRGFNVQAFSGLFGFFKLSAASAVMLCLET
        QKFLQAQSIV PSA+IS  TL VH++LSWLA YK+G+GLLG SLVLSLSWW+IVV QFVYIV S  C  TWRGF+VQAF GL  FFKLSAASAVMLCLET
Subjt:  QKFLQAQSIVFPSAFISAGTLVVHVVLSWLAAYKMGLGLLGVSLVLSLSWWVIVVGQFVYIVKSGSCKKTWRGFNVQAFSGLFGFFKLSAASAVMLCLET

Query:  WYFQILVLLAGLLDNPELALDSLSIC--------------RLQRCRQCESKQRTGKWESKIRSV----------------------------FRGGG--G
        WYFQILVLLAGLL+NPELALDSLSIC                    +  ++   G  +S   SV                            F  G    
Subjt:  WYFQILVLLAGLLDNPELALDSLSIC--------------RLQRCRQCESKQRTGKWESKIRSV----------------------------FRGGG--G

Query:  CHLHHCLHLLCRTCTGSSQR-----VAVGCGWQAFVAYVNVGCYYIVGVPLGALLGFYFKFGAKGIWIGLMGGTFMQTVILVWVTWRTDWNKELYSS
          +     LL  T   +  +     VAVGCGWQAFVA VNVGCYYIVG+PLGAL GFYFKF AKGIW G++ GT +QTVIL WVT+RTDW KE+  +
Subjt:  CHLHHCLHLLCRTCTGSSQR-----VAVGCGWQAFVAYVNVGCYYIVGVPLGALLGFYFKFGAKGIWIGLMGGTFMQTVILVWVTWRTDWNKELYSS

KAG6572298.1 Protein DETOXIFICATION 40, partial [Cucurbita argyrosperma subsp. sororia]0.062.58Show/hide
Query:  AKKHGPSGELEKILSDTELPFVVRIRLAAWIELKLLGYLAAPAVFVYMINNFMSMSTRIFSGHLGNAELAAASLGNSGVQTFAYGLLLGMGSAVETLCGQ
        + +H  S ELEKILSDT+L  V R   A W E+KL+ +LAAPA+ V++INN +SMST+I SGHLGN ELAA+SLGN+G+Q FAYGL++GMGSA+ETLCGQ
Subjt:  AKKHGPSGELEKILSDTELPFVVRIRLAAWIELKLLGYLAAPAVFVYMINNFMSMSTRIFSGHLGNAELAAASLGNSGVQTFAYGLLLGMGSAVETLCGQ

Query:  AYGAGKYEMLGIYLQRSSILLTLVACLLTLLYVFSKPLLLLLGESPEIASSAAVFVYGLIPQIFAYAVNFPIQKFLQSQSVVLPSAYISAATLVVHLVLS
        A+GA KY MLGIYLQRS+ILLT+   LLT+ Y+F KP+L+ LGE  +IAS+AA FVYGLIPQIFAYA+NFP QKFLQ+QS+V PS  ISA TLVVH++LS
Subjt:  AYGAGKYEMLGIYLQRSSILLTLVACLLTLLYVFSKPLLLLLGESPEIASSAAVFVYGLIPQIFAYAVNFPIQKFLQSQSVVLPSAYISAATLVVHLVLS

Query:  WAAAYKLGFGLFGASSVLSLSWWIIVTAQVVYIVKSERFKESWRGLSWAAFKGLPEFFKLSAASAIMLCLETWYFQILVLVAGLLDNPELALDSLSICMT
        W  AYK+G GLFG S VLS SWWIIV AQ VYI+KS++ K +W G S  AF GLP +FKLS ASA M CLE WYF I+VL+AGLL+NPELALDSLSICMT
Subjt:  WAAAYKLGFGLFGASSVLSLSWWIIVTAQVVYIVKSERFKESWRGLSWAAFKGLPEFFKLSAASAIMLCLETWYFQILVLVAGLLDNPELALDSLSICMT

Query:  INGLAFMISVGFNAAA--RVGNELGHGHPKSAAFSVVVVVAISSLVSILVASAVLSFRNVISYLFTDGATVAAAVSDLCPFLAVTLLLNGIQPA------
        ++   FMISVGF+AA   RV NELG  HPKSAAF VVVV A+  ++S+  A  VL+ R+VISY FT+   VAAAVSDLCPFLA+TLLL GIQP       
Subjt:  INGLAFMISVGFNAAA--RVGNELGHGHPKSAAFSVVVVVAISSLVSILVASAVLSFRNVISYLFTDGATVAAAVSDLCPFLAVTLLLNGIQPA------

Query:  --------------------------------------WLWDVGGN----LLLLWSM----------CAAITLLACPSEFS------SMGSAYEDDLRQA
                                              W+  +GG     +LL+W +           AA    A  ++ +       M S  +DD+RQA
Subjt:  --------------------------------------WLWDVGGN----LLLLWSM----------CAAITLLACPSEFS------SMGSAYEDDLRQA

Query:  LLQPAAAALLSSQSLCSNKHEGSDELERILSDTEMPVVQRYSQATWIEIKLLFYLAAPAVFVYMINYAMSTSTQIFSGHLGNLELAASSLGNNGIQVFAY
        LLQPAAAALLSSQSLCSNKHE SDELERILSDTEMPVV+RY++ATWIEIKLL YLAAPAVFVYMINYAMSTSTQIF+GHLGNLELAASSLGNNGIQVFAY
Subjt:  LLQPAAAALLSSQSLCSNKHEGSDELERILSDTEMPVVQRYSQATWIEIKLLFYLAAPAVFVYMINYAMSTSTQIFSGHLGNLELAASSLGNNGIQVFAY

Query:  GLM----------------------------RSAILLTLTGVVLTIPYIFCKPILLFLGESKDIASASETFVYGLVPQIYAYALNFPIQKFLQAQSIVFP
        GLM                            RSAILLT TGV+LTIPY+FCKP+L+FLGESKDIASA+E FVYGL+PQI+AYA+NFPIQKFLQAQSIVFP
Subjt:  GLM----------------------------RSAILLTLTGVVLTIPYIFCKPILLFLGESKDIASASETFVYGLVPQIYAYALNFPIQKFLQAQSIVFP

Query:  SAFISAGTLVVHVVLSWLAAYKMGLGLLGVSLVLSLSWWVIVVGQFVYIVKSGSCKKTWRGFNVQAFSGLFGFFKLSAASAVMLCLETWYFQILVLLAGL
        SAFISAGTLVVH++LSWLAAYKMGLGLLGVSLVLSLSWW+IVVGQFVYI+KS +CKKTWRGFNVQAFSGLFGFFKLSAASAVMLCLETWYFQILVLLAGL
Subjt:  SAFISAGTLVVHVVLSWLAAYKMGLGLLGVSLVLSLSWWVIVVGQFVYIVKSGSCKKTWRGFNVQAFSGLFGFFKLSAASAVMLCLETWYFQILVLLAGL

Query:  LDNPELALDSLSICR------------LQRCRQCESKQRTGKWESK------------------------------IRSVFRGGGG--------CHLHHC
        L+NPELALDSLSIC             L            G    K                              I   F GG          C L   
Subjt:  LDNPELALDSLSICR------------LQRCRQCESKQRTGKWESK------------------------------IRSVFRGGGG--------CHLHHC

Query:  LHLLCRTCTGSSQRVAVGCGWQAFVAYVNVGCYYIVGVPLGALLGFYFKFGAKGIWIGLMGGTFMQTVILVWVTWRTDWNKELYSS
        L LL          VAVGCGWQ+FVAYVNVGCYY+VGVPLGALLGFYF FGAKGIW+GLMGGT MQT+ILVWVTWRTDWNKE+  +
Subjt:  LHLLCRTCTGSSQRVAVGCGWQAFVAYVNVGCYYIVGVPLGALLGFYFKFGAKGIWIGLMGGTFMQTVILVWVTWRTDWNKELYSS

KAG6754564.1 hypothetical protein POTOM_040353 [Populus tomentosa]3.56e-31552.76Show/hide
Query:  SGELEKILSDTELPFVVRIRLAAWIELKLLGYLAAPAVFVYMINNFMSMSTRIFSGHLGNAELAAASLGNSGVQTFAYGLLLGMGSAVETLCGQAYGAGK
        S ELE+IL+DTE P+  RIR A+W+ELKLL +LAAPAV VY++NN +SMST+IF GHLGN ELAA SLGN+G+Q FAYGL+LGMGSAVETLCGQA+GA +
Subjt:  SGELEKILSDTELPFVVRIRLAAWIELKLLGYLAAPAVFVYMINNFMSMSTRIFSGHLGNAELAAASLGNSGVQTFAYGLLLGMGSAVETLCGQAYGAGK

Query:  YEMLGIYLQRSSILLTLVACLLTLLYVFSKPLLLLLGESPEIASSAAVFVYGLIPQIFAYAVNFPIQKFLQSQSVVLPSAYISAATLVVHLVLSWAAAYK
        YEMLG+YLQRS+ILL   A  L ++Y+FS+PLL+LLGE   IAS+AAVFVYGLIPQIFAYA NFPIQKFLQ+QS++ PSAYIS   LVVH++LSW A +K
Subjt:  YEMLGIYLQRSSILLTLVACLLTLLYVFSKPLLLLLGESPEIASSAAVFVYGLIPQIFAYAVNFPIQKFLQSQSVVLPSAYISAATLVVHLVLSWAAAYK

Query:  LGFGLFGASSVLSLSWWIIVTAQVVYIVKSERFKESWRGLSWAAFKGLPEFFKLSAASAIMLCLETWYFQILVLVAGLLDNPELALDSLSICMTINGLAF
          +GL GA  VLSLSWWIIV  Q VYI+ S   +++W+G S  AF GL  FFKLSAASA+MLCLETWY+QILVL+AGLL N E+ALD+LS+CMT++G  F
Subjt:  LGFGLFGASSVLSLSWWIIVTAQVVYIVKSERFKESWRGLSWAAFKGLPEFFKLSAASAIMLCLETWYFQILVLVAGLLDNPELALDSLSICMTINGLAF

Query:  MISVGFNAAA--RVGNELGHGHPKSAAFSVVVVVAISSLVSILVASAVLSFRNVISYLFTDGATVAAAVSDLCPFLAVTLLLNGIQPA------------
        MISVGFNAAA  RV NELG GHPKSA+FSV+VV + S ++S++ A  VL FR+ ISY+FT+G  VA A SDL PFLA TL+LNGIQP             
Subjt:  MISVGFNAAA--RVGNELGHGHPKSAAFSVVVVVAISSLVSILVASAVLSFRNVISYLFTDGATVAAAVSDLCPFLAVTLLLNGIQPA------------

Query:  ------------------------------------------------WL-------------------WDVGGNLLLLWSMCAAITLLACPSEFSSMGS
                                                        W+                   WD  G  LL+  +C  +T      E  ++  
Subjt:  ------------------------------------------------WL-------------------WDVGGNLLLLWSMCAAITLLACPSEFSSMGS

Query:  AYEDDLRQALLQPAAAALLSSQSLCSNKHEGSDELERILSDTEMPVVQRYSQATWIEIKLLFYLAAPAVFVYMINYAMSTSTQIFSGHLGNLELAASSLG
              ++  ++     L    S+       SDELE+IL+D E P  +R   A+W+E+KLLF+LAAPAV VY++N  +S STQIF GHLGNLELAA SLG
Subjt:  AYEDDLRQALLQPAAAALLSSQSLCSNKHEGSDELERILSDTEMPVVQRYSQATWIEIKLLFYLAAPAVFVYMINYAMSTSTQIFSGHLGNLELAASSLG

Query:  NNGIQVFAYGLM----------------------------RSAILLTLTGVVLTIPYIFCKPILLFLGESKDIASASETFVYGLVPQIYAYALNFPIQKF
        N GIQVFAYGLM                            RS +LL  TG+ L + YIF KPIL+ LGE  +IASA+  FV+GL+PQI+AYA NFPIQKF
Subjt:  NNGIQVFAYGLM----------------------------RSAILLTLTGVVLTIPYIFCKPILLFLGESKDIASASETFVYGLVPQIYAYALNFPIQKF

Query:  LQAQSIVFPSAFISAGTLVVHVVLSWLAAYKMGLGLLGVSLVLSLSWWVIVVGQFVYIVKSGSCKKTWRGFNVQAFSGLFGFFKLSAASAVMLCLETWYF
        LQAQSI+ PSA+IS G LVVHV+L+WLA +K   GLLG  LVLSLSWW+IVV QFVYIV S  C+KTW+GF+V+AFSGL+ FFKLSAASAVMLCLETWYF
Subjt:  LQAQSIVFPSAFISAGTLVVHVVLSWLAAYKMGLGLLGVSLVLSLSWWVIVVGQFVYIVKSGSCKKTWRGFNVQAFSGLFGFFKLSAASAVMLCLETWYF

Query:  QILVLLAGLLDNPELALDSLSICR------------LQRCRQCESKQRTGKWESKIRS----------------------VFRGG-------------GG
        QILVL+AGLL+N E+ALDSLS+C                          G    K  S                      +FR                 
Subjt:  QILVLLAGLLDNPELALDSLSICR------------LQRCRQCESKQRTGKWESKIRS----------------------VFRGG-------------GG

Query:  CHLHHCLH--LLCRTCTGSSQRVAVGCGWQAFVAYVNVGCYYIVGVPLGALLGFYFKFGAKGIWIGLMGGTFMQTVILVWVTWRTDWNKELYSSR
          L   L   L+          VAVGCGWQAFVAYVNVGCYY++G+PLG LLGF F  GAKGIW G++GGT +QT+IL+WVTWRTDWNKE+  ++
Subjt:  CHLHHCLH--LLCRTCTGSSQRVAVGCGWQAFVAYVNVGCYYIVGVPLGALLGFYFKFGAKGIWIGLMGGTFMQTVILVWVTWRTDWNKELYSSR

KAG6755717.1 hypothetical protein POTOM_039119 [Populus tomentosa]0.054.23Show/hide
Query:  SGELEKILSDTELPFVVRIRLAAWIELKLLGYLAAPAVFVYMINNFMSMSTRIFSGHLGNAELAAASLGNSGVQTFAYGLLLGMGSAVETLCGQAYGAGK
        S ELE+IL+DTE P+  RIR A+W+ELKLL +LAAPAV VY++NN +SMST+IF GHLGN ELAA SLGN+G+Q FAYGL+LGMGSAVETLCGQA+GA +
Subjt:  SGELEKILSDTELPFVVRIRLAAWIELKLLGYLAAPAVFVYMINNFMSMSTRIFSGHLGNAELAAASLGNSGVQTFAYGLLLGMGSAVETLCGQAYGAGK

Query:  YEMLGIYLQRSSILLTLVACLLTLLYVFSKPLLLLLGESPEIASSAAVFVYGLIPQIFAYAVNFPIQKFLQSQSVVLPSAYISAATLVVHLVLSWAAAYK
        YEMLG+YLQRS+ILL   A  L ++Y+FS+PLL+LLGE   IAS+AAVFVYGLIPQIFAYA NFPIQKFLQ+QS++ PSAYIS   LVVH++LSW A +K
Subjt:  YEMLGIYLQRSSILLTLVACLLTLLYVFSKPLLLLLGESPEIASSAAVFVYGLIPQIFAYAVNFPIQKFLQSQSVVLPSAYISAATLVVHLVLSWAAAYK

Query:  LGFGLFGASSVLSLSWWIIVTAQVVYIVKSERFKESWRGLSWAAFKGLPEFFKLSAASAIMLCLETWYFQILVLVAGLLDNPELALDSLSICMTINGLAF
          +GL GA  VLS SWWIIV  Q VYI+ S   +++W+G S  AF GL  FFKLSAASA+MLCLETWY+QILVL+AGLL N E+ALD+LS+CMT++G  F
Subjt:  LGFGLFGASSVLSLSWWIIVTAQVVYIVKSERFKESWRGLSWAAFKGLPEFFKLSAASAIMLCLETWYFQILVLVAGLLDNPELALDSLSICMTINGLAF

Query:  MISVGFNAAA--RVGNELGHGHPKSAAFSVVVVVAISSLVSILVASAVLSFRNVISYLFTDGATVAAAVSDLCPFLAVTLLLNGIQPA------------
        MISVGFNAAA  RV NELG GHPKSA+FSV+VV + S ++S++ A  VL FR+ ISY+FT+G  VA A SDL PFLA TL+LNGIQP             
Subjt:  MISVGFNAAA--RVGNELGHGHPKSAAFSVVVVVAISSLVSILVASAVLSFRNVISYLFTDGATVAAAVSDLCPFLAVTLLLNGIQPA------------

Query:  --------------------------------WLWDVGGNLL----LLWSMCAAITLLACPSEFSSMGSAYEDDLRQALLQPAAAALLSSQ--------S
                                        W   +GG +L    L+W            S  + + S   D+  Q LL+  + A    +        S
Subjt:  --------------------------------WLWDVGGNLL----LLWSMCAAITLLACPSEFSSMGSAYEDDLRQALLQPAAAALLSSQ--------S

Query:  LCSNKHEG----SDELERILSDTEMPVVQRYSQATWIEIKLLFYLAAPAVFVYMINYAMSTSTQIFSGHLGNLELAASSLGNNGIQVFAYGLM-------
           + H+G    SDELE+IL+DTE P  +R   A+W+E+K+LF+LAAPAV VY++N  +S STQIF GHLGNLELAA SLGN GIQVFAYGLM       
Subjt:  LCSNKHEG----SDELERILSDTEMPVVQRYSQATWIEIKLLFYLAAPAVFVYMINYAMSTSTQIFSGHLGNLELAASSLGNNGIQVFAYGLM-------

Query:  ---------------------RSAILLTLTGVVLTIPYIFCKPILLFLGESKDIASASETFVYGLVPQIYAYALNFPIQKFLQAQSIVFPSAFISAGTLV
                             RS +LL  TG+ L + YIF KPIL+ LGE  DIASA+  FV+GL+PQI+AYA NFPIQKFLQAQSI+ PSA+IS G LV
Subjt:  ---------------------RSAILLTLTGVVLTIPYIFCKPILLFLGESKDIASASETFVYGLVPQIYAYALNFPIQKFLQAQSIVFPSAFISAGTLV

Query:  VHVVLSWLAAYKMGLGLLGVSLVLSLSWWVIVVGQFVYIVKSGSCKKTWRGFNVQAFSGLFGFFKLSAASAVMLCLETWYFQILVLLAGLLDNPELALDS
        VHV+L+W A +K   GLLG  LVLSLSWW+IVV QFVYIV S  C+ TW+GF+V+AFSGL+ FFKLSAASAVMLCLETWYFQILVL+AGLL+N E+ALDS
Subjt:  VHVVLSWLAAYKMGLGLLGVSLVLSLSWWVIVVGQFVYIVKSGSCKKTWRGFNVQAFSGLFGFFKLSAASAVMLCLETWYFQILVLLAGLLDNPELALDS

Query:  LSICR------------LQRCRQCESKQRTGKWESKIRS----------------------VFRGGGGCHLHHCLHLLCRTCTGSS---------QRVAV
        LS+C                          G    K  S                      +FR     ++     ++ +  +  S           VAV
Subjt:  LSICR------------LQRCRQCESKQRTGKWESKIRS----------------------VFRGGGGCHLHHCLHLLCRTCTGSS---------QRVAV

Query:  GCGWQAFVAYVNVGCYYIVGVPLGALLGFYFKFGAKGIWIGLMGGTFMQTVILVWVTWRTDWNKELYSSR
        GCGWQAFVAYVNVGCYY++G+PLG LLGF F  GAKGIW+G++GGT +QT+IL+WVTWRTDWNKE+  ++
Subjt:  GCGWQAFVAYVNVGCYYIVGVPLGALLGFYFKFGAKGIWIGLMGGTFMQTVILVWVTWRTDWNKELYSSR

TrEMBL top hitse value%identityAlignment
A0A067JAE2 Protein DETOXIFICATION1.86e-31554.01Show/hide
Query:  SGELEKILSDTELPFVVRIRLAAWIELKLLGYLAAPAVFVYMINNFMSMSTRIFSGHLGNAELAAASLGNSGVQTFAYGLLLGMGSAVETLCGQAYGAGK
        S ELE+IL +T+  ++ R R A  +ELK L YLAAPA+ VY++NN +SMST+I  GHLGN ELAAASLGN+G+Q FAYGL+LGMGSAVETLCGQAYGAGK
Subjt:  SGELEKILSDTELPFVVRIRLAAWIELKLLGYLAAPAVFVYMINNFMSMSTRIFSGHLGNAELAAASLGNSGVQTFAYGLLLGMGSAVETLCGQAYGAGK

Query:  YEMLGIYLQRSSILLTLVACLLTLLYVFSKPLLLLLGESPEIASSAAVFVYGLIPQIFAYAVNFPIQKFLQSQSVVLPSAYISAATLVVHLVLSWAAAYK
        Y+MLG+YLQRS+ILLT     +TL+YVFSKP+L+LLGES EIAS+AAVFVYGLIPQIFAYAVNFPIQKFLQ+QS++ PSAYISAATLVVHL++SW A +K
Subjt:  YEMLGIYLQRSSILLTLVACLLTLLYVFSKPLLLLLGESPEIASSAAVFVYGLIPQIFAYAVNFPIQKFLQSQSVVLPSAYISAATLVVHLVLSWAAAYK

Query:  LGFGLFGASSVLSLSWWIIVTAQVVYIVKSERFKESWRGLSWAAFKGLPEFFKLSAASAIMLCLETWYFQILVLVAGLLDNPELALDSLSICMTINGLAF
        LG+GL GAS VLSLSWWIIV AQ VYIV S + K +W G +W AF GL  F KLS ASA+MLCLETWY+++L+L+AGLL N E+ LDSLS+CMTI+   F
Subjt:  LGFGLFGASSVLSLSWWIIVTAQVVYIVKSERFKESWRGLSWAAFKGLPEFFKLSAASAIMLCLETWYFQILVLVAGLLDNPELALDSLSICMTINGLAF

Query:  MISVGFNAAA--RVGNELGHGHPKSAAFSVVVVVAISSLVSILVASAVLSFRNVISYLFTDGATVAAAVSDLCPFLAVTLLLNGIQPA------------
        MISVGFNAAA  RV NELG GHPKSAAFSVV+V   S ++++++A  VL FRN +SY+FT G TVA AV++L P+LA++++LNGIQP             
Subjt:  MISVGFNAAA--RVGNELGHGHPKSAAFSVVVVVAISSLVSILVASAVLSFRNVISYLFTDGATVAAAVSDLCPFLAVTLLLNGIQPA------------

Query:  ----------------------WLWDVGGNLLLLWSMCAAITLLACPSEFSSMGSAYEDDLRQALLQPAAAA------LLSSQSLCSNKHEG---SDELE
                              +++D+G   +  WS     T+L        +   +  D  +   +   +       + ++Q+    KH     S ELE
Subjt:  ----------------------WLWDVGGNLLLLWSMCAAITLLACPSEFSSMGSAYEDDLRQALLQPAAAA------LLSSQSLCSNKHEG---SDELE

Query:  RILSDTEMPVVQRYSQATWIEIKLLFYLAAPAVFVYMINYAMSTSTQIFSGHLGNLELAASSLGNNGIQVFAYGLM------------------------
         ILS+ E+   QR   A  +E+K LF LA PAV VY++N  +S STQIF GHLGNLELAA+SLGN GIQVFAYGLM                        
Subjt:  RILSDTEMPVVQRYSQATWIEIKLLFYLAAPAVFVYMINYAMSTSTQIFSGHLGNLELAASSLGNNGIQVFAYGLM------------------------

Query:  ----RSAILLTLTGVVLTIPYIFCKPILLFLGESKDIASASETFVYGLVPQIYAYALNFPIQKFLQAQSIVFPSAFISAGTLVVHVVLSWLAAYKMGLGL
            RS ++LT  G+ LT+ Y F K ILL LGESK+IA+ +  FVYGL+PQIYAYA NFPIQKFLQAQSIVFPSA+I+AGTLVVH+V+SWLA YK+G GL
Subjt:  ----RSAILLTLTGVVLTIPYIFCKPILLFLGESKDIASASETFVYGLVPQIYAYALNFPIQKFLQAQSIVFPSAFISAGTLVVHVVLSWLAAYKMGLGL

Query:  LGVSLVLSLSWWVIVVGQFVYIVKSGSCKKTWRGFNVQAFSGLFGFFKLSAASAVMLCLETWYFQILVLLAGLLDNPELALDSLSICR------------
        LG SLVLSLSWW+IV+ QF+YIV S  CK+TW GF  QAFSGL+ F KLS ASA+MLCLETWY+QI+VL+AGLL+N E+ LDSLSIC             
Subjt:  LGVSLVLSLSWWVIVVGQFVYIVKSGSCKKTWRGFNVQAFSGLFGFFKLSAASAVMLCLETWYFQILVLLAGLLDNPELALDSLSICR------------

Query:  LQRCRQCESKQRTGKWESKIRS----------------------VFRG------GGGCHLHHC---------LHLLCRTCTGSSQRVAVGCGWQAFVAYV
                     G    +  S                      +FR         G  +            L +L          VAVGCGWQAFVAYV
Subjt:  LQRCRQCESKQRTGKWESKIRS----------------------VFRG------GGGCHLHHC---------LHLLCRTCTGSSQRVAVGCGWQAFVAYV

Query:  NVGCYYIVGVPLGALLGFYFKFGAKGIWIGLMGGTFMQTVILVWVTWRTDWNKELYSSR
        NVGCYY VG+PLG +LGF   FGA+GIW G++GGT +QT IL+WVT+RTDWN E+  ++
Subjt:  NVGCYYIVGVPLGALLGFYFKFGAKGIWIGLMGGTFMQTVILVWVTWRTDWNKELYSSR

A0A4Y1R795 Protein DETOXIFICATION (Fragment)0.057.51Show/hide
Query:  SGELEKILSDTELPFVVRIRLAAWIELKLLGYLAAPAVFVYMINNFMSMSTRIFSGHLGNAELAAASLGNSGVQTFAYGLLLGMGSAVETLCGQAYGAGK
        S ELE++LSDT  PF+ R++ A WIE KLL YLAAPAV VYMIN  MSMST+IF+GHLGN ELAAASLGN+G+  F+YGLLLGMGSAVETLCGQAYGA K
Subjt:  SGELEKILSDTELPFVVRIRLAAWIELKLLGYLAAPAVFVYMINNFMSMSTRIFSGHLGNAELAAASLGNSGVQTFAYGLLLGMGSAVETLCGQAYGAGK

Query:  YEMLGIYLQRSSILLTLVACLLTLLYVFSKPLLLLLGESPEIASSAAVFVYGLIPQIFAYAVNFPIQKFLQSQSVVLPSAYISAATLVVHLVLSWAAAYK
        YEMLGIYLQRS+ILLTL   LLT++Y++ +P+L+LLG+SP IAS+AA++VYGLIPQIF++AVNFPIQKFLQ+QS+V PSAYISA+TLV+HLVLSW   YK
Subjt:  YEMLGIYLQRSSILLTLVACLLTLLYVFSKPLLLLLGESPEIASSAAVFVYGLIPQIFAYAVNFPIQKFLQSQSVVLPSAYISAATLVVHLVLSWAAAYK

Query:  LGFGLFGASSVLSLSWWIIVTAQVVYIVKSERFKESWRGLSWAAFKGLPEFFKLSAASAIMLCLETWYFQILVLVAGLLDNPELALDSLSICMTINGLAF
        +G GL GA+ VLSLSWWI V AQ +YIVKSER K +W G SW AF GL  FFKLS ASA+MLCLETWYFQI VL+AGLL NPELALDSLSIC TI+G  F
Subjt:  LGFGLFGASSVLSLSWWIIVTAQVVYIVKSERFKESWRGLSWAAFKGLPEFFKLSAASAIMLCLETWYFQILVLVAGLLDNPELALDSLSICMTINGLAF

Query:  MISVGFNAAA---RVGNELGHGHPKSAAFSVVVVVAISSLVSILVASAVLSFRNVISYLFTDGATVAAAVSDLCPFLAVTLLLNGIQPA-----------
        MISVG NAAA   RV NELG G+PKS AFSVVVV  +S L+SI+ A  VL +R++ISY FT+G  VAAAVSDLCPFLA+TLLLNGIQP            
Subjt:  MISVGFNAAA---RVGNELGHGHPKSAAFSVVVVVAISSLVSILVASAVLSFRNVISYLFTDGATVAAAVSDLCPFLAVTLLLNGIQPA-----------

Query:  -----------WLWDVGGNLLL----------LWSMCAAITLLACP---------------SEFSSMGSAYEDDLRQALLQPAAAAL----LSSQSLCSN
                   +L  V   +LL          +W    A TL+                   E +   + +E+ L Q+   P         L    L  +
Subjt:  -----------WLWDVGGNLLL----------LWSMCAAITLLACP---------------SEFSSMGSAYEDDLRQALLQPAAAAL----LSSQSLCSN

Query:  KHEGSDELERILSDTEMPVVQRYSQATWIEIKLLFYLAAPAVFVYMINYAMSTSTQIFSGHLGNLELAASSLGNNGIQVFAYGLM---------------
         H  S+ELER+LSDT+ P ++R   A WIE KLLFYLA+PAV VYMINY M+ STQIF+GHLGNLELAA+SLGNNGIQVFAYG+M               
Subjt:  KHEGSDELERILSDTEMPVVQRYSQATWIEIKLLFYLAAPAVFVYMINYAMSTSTQIFSGHLGNLELAASSLGNNGIQVFAYGLM---------------

Query:  --------------RSAILLTLTGVVLTIPYIFCKPILLFLGESKDIASASETFVYGLVPQIYAYALNFPIQKFLQAQSIVFPSAFISAGTLVVHVVLSW
                      RS +LLTLTGV+LTI YIF +PIL+ LG+S  IASA+  +VYGL+PQI+AYA+NFPIQKFLQAQSIV PSA+ISA TL +H+VLSW
Subjt:  --------------RSAILLTLTGVVLTIPYIFCKPILLFLGESKDIASASETFVYGLVPQIYAYALNFPIQKFLQAQSIVFPSAFISAGTLVVHVVLSW

Query:  LAAYKMGLGLLGVSLVLSLSWWVIVVGQFVYIVKSGSCKKTWRGFNVQAFSGLFGFFKLSAASAVMLCLETWYFQILVLLAGLLDNPELALDSLSICR--
        +  YKMGLGLLG +LVLSLSWW+ V+ QFVYIVKS  CK TW GF+ QAFSGL GFFKLS ASAVMLCLETWYFQILVLLAGLL NPELALDSLSIC   
Subjt:  LAAYKMGLGLLGVSLVLSLSWWVIVVGQFVYIVKSGSCKKTWRGFNVQAFSGLFGFFKLSAASAVMLCLETWYFQILVLLAGLLDNPELALDSLSICR--

Query:  ----------LQRCRQCESKQRTGK------------------------------WESKIRSVFRGGGGCHL-------HHCLHLLCRTCTGSSQRVAVG
                  L            G                               W   I   F  G               L LL          VAVG
Subjt:  ----------LQRCRQCESKQRTGK------------------------------WESKIRSVFRGGGGCHL-------HHCLHLLCRTCTGSSQRVAVG

Query:  CGWQAFVAYVNVGCYYIVGVPLGALLGFYFKFGAK------------------GIWIGLMGGTFMQTVILVW
        CGWQAFVAYVNVGCYY+VGVP GALLGFYFK GAK                  GIW+G+M GT MQT+IL+W
Subjt:  CGWQAFVAYVNVGCYYIVGVPLGALLGFYFKFGAK------------------GIWIGLMGGTFMQTVILVW

A0A5J9WCB6 Protein DETOXIFICATION9.75e-30655.14Show/hide
Query:  SGELEKILSDTELPFVVRIRLAAWIELKLLGYLAAPAVFVYMINNFMSMSTRIFSGHLGNAELAAASLGNSGVQTFAYGLLLGMGSAVETLCGQAYGAGK
        SG+LE ILSD  +P+  R+  A+ +E++LL  LAAPAV VYMIN  MSMST+IFSGHLG  ELAAASLGN+G+Q FAYGL+LGMGSAVETLCGQAYGA K
Subjt:  SGELEKILSDTELPFVVRIRLAAWIELKLLGYLAAPAVFVYMINNFMSMSTRIFSGHLGNAELAAASLGNSGVQTFAYGLLLGMGSAVETLCGQAYGAGK

Query:  YEMLGIYLQRSSILLTLVACLLTLLYVFSKPLLLLLGESPEIASSAAVFVYGLIPQIFAYAVNFPIQKFLQSQSVVLPSAYISAATLVVHLVLSWAAAYK
        Y+MLGIYLQRS+ILL      L +LY FS+P+L+LLGESP IAS+AAVFVYGLIPQIFAYA NFPIQKF+Q+QS++ PSAYISAATL VHLV S+   YK
Subjt:  YEMLGIYLQRSSILLTLVACLLTLLYVFSKPLLLLLGESPEIASSAAVFVYGLIPQIFAYAVNFPIQKFLQSQSVVLPSAYISAATLVVHLVLSWAAAYK

Query:  LGFGLFGASSVLSLSWWIIVTAQVVYIVKSERFKESWRGLSWAAFKGLPEFFKLSAASAIMLCLETWYFQILVLVAGLLDNPELALDSLSICMTINGLAF
        LG GL GAS +LSLSWW+IV AQ VYIV S R + +W+G SW AF GLP+FFKLS ASA+MLCLETWYFQILVL+AGLL +PELAL SLS+CMTI+G  F
Subjt:  LGFGLFGASSVLSLSWWIIVTAQVVYIVKSERFKESWRGLSWAAFKGLPEFFKLSAASAIMLCLETWYFQILVLVAGLLDNPELALDSLSICMTINGLAF

Query:  MISVGFNAAA--RVGNELGHGHPKSAAFSVVVVVAISSLVSILVASAVLSFRNVISYLFTDGATVAAAVSDLCPFLAVTLLLNGIQPAWLWDVGGNLLLL
        MISVGFNAAA  RV NELG G+PKSAAFSVVVV  +S ++S+L++  +L FR+ ISY+FT+G  V+ AVS L P LA+TL+LNGIQP       G     
Subjt:  MISVGFNAAA--RVGNELGHGHPKSAAFSVVVVVAISSLVSILVASAVLSFRNVISYLFTDGATVAAAVSDLCPFLAVTLLLNGIQPAWLWDVGGNLLLL

Query:  WSMCAAITLLACP-----------SEFSSMGSA--YEDDLRQALLQPAAAALLSSQSLCSNKHEGSD-------------ELERILSDTEMPVVQRYSQA
        W    A   + C              +  +G+A  +   +   L+Q      ++ ++  + + E S              +LERIL +   P  QR ++A
Subjt:  WSMCAAITLLACP-----------SEFSSMGSA--YEDDLRQALLQPAAAALLSSQSLCSNKHEGSD-------------ELERILSDTEMPVVQRYSQA

Query:  TWIEIKLLFYLAAPAVFVYMINYAMSTSTQIFSGHLGNLELAASSLGNNGIQVFAYGLM----------------------------RSAILLTLTGVVL
           E++LL  LAAPAV VYMINYAMS ST+IF G LG LELAA+SLGN GIQVFAYGLM                            RS +LLT  GV L
Subjt:  TWIEIKLLFYLAAPAVFVYMINYAMSTSTQIFSGHLGNLELAASSLGNNGIQVFAYGLM----------------------------RSAILLTLTGVVL

Query:  TIPYIFCKPILLFLGESKDIASASETFVYGLVPQIYAYALNFPIQKFLQAQSIVFPSAFISAGTLVVHVVLSWLAAYKMGLGLLGVSLVLSLSWWVIVVG
           Y+F K ILL LGE ++IA A+  FV GL+PQI+AYA NFPIQKFLQ+QSIV PSA+ISA TLVVH+VLS+LA YK+ +GLLG SL+LSLSWWVIV  
Subjt:  TIPYIFCKPILLFLGESKDIASASETFVYGLVPQIYAYALNFPIQKFLQAQSIVFPSAFISAGTLVVHVVLSWLAAYKMGLGLLGVSLVLSLSWWVIVVG

Query:  QFVYIVKSGSCKKTWRGFNVQAFSGLFGFFKLSAASAVMLCLETWYFQILVLLAGLLDNPELALDSLSICR------------LQRCRQCESKQRTGKWE
        QFVYIV S  C++TW GF+ QAFSGL  FF+LSAASAVMLCLETWY QI VL+AGLL +PE+ALDSL++C                          G   
Subjt:  QFVYIVKSGSCKKTWRGFNVQAFSGLFGFFKLSAASAVMLCLETWYFQILVLLAGLLDNPELALDSLSICR------------LQRCRQCESKQRTGKWE

Query:  SKIRS------------------------------VFRGGGGCH--LHHCLHLLCRTCTGSSQR-----VAVGCGWQAFVAYVNVGCYYIVGVPLGALLG
         K  S                              VF  G      +     LL  T   +  +     VAVGCGWQAFVAYVNV CYY VG+PLG +LG
Subjt:  SKIRS------------------------------VFRGGGGCH--LHHCLHLLCRTCTGSSQR-----VAVGCGWQAFVAYVNVGCYYIVGVPLGALLG

Query:  FYFKFGAKGIWIGLMGGTFMQTVILVWVTWRTDWNKELYSSRI
        F+F  GA GIW G++GG  +QT+IL +VT RTDWNKE+  +R+
Subjt:  FYFKFGAKGIWIGLMGGTFMQTVILVWVTWRTDWNKELYSSRI

A0A5J9WDB0 Protein DETOXIFICATION1.58e-30455.04Show/hide
Query:  SGELEKILSDTELPFVVRIRLAAWIELKLLGYLAAPAVFVYMINNFMSMSTRIFSGHLGNAELAAASLGNSGVQTFAYGLLLGMGSAVETLCGQAYGAGK
        SG+LE ILSD  +P+  R+  A+ +E++LL  LAAPAV VYMIN  MSMST+IFSGHLG  ELAAASLGN+G+Q FAYGL+LGMGSAVETLCGQAYGA K
Subjt:  SGELEKILSDTELPFVVRIRLAAWIELKLLGYLAAPAVFVYMINNFMSMSTRIFSGHLGNAELAAASLGNSGVQTFAYGLLLGMGSAVETLCGQAYGAGK

Query:  YEMLGIYLQRSSILLTLVACLLTLLYVFSKPLLLLLGESPEIASSAAVFVYGLIPQIFAYAVNFPIQKFLQSQSVVLPSAYISAATLVVHLVLSWAAAYK
        Y+MLGIYLQRS+ILL      L +LY FS+P+L+LLGESP IAS+AAVFVYGLIPQIFAYA NFPIQKF+Q+QS++ PSAYISAATL VHLV S+   YK
Subjt:  YEMLGIYLQRSSILLTLVACLLTLLYVFSKPLLLLLGESPEIASSAAVFVYGLIPQIFAYAVNFPIQKFLQSQSVVLPSAYISAATLVVHLVLSWAAAYK

Query:  LGFGLFGASSVLSLSWWIIVTAQVVYIVKSERFKESWRGLSWAAFKGLPEFFKLSAASAIMLCLETWYFQILVLVAGLLDNPELALDSLSICMTINGLAF
        LG GL GAS +LSLSWW+IV AQ VYIV S R + +W+G SW AF GLP+FFKLS ASA+MLCLETWYFQILVL+AGLL +PELAL SLS+CMTI+G  F
Subjt:  LGFGLFGASSVLSLSWWIIVTAQVVYIVKSERFKESWRGLSWAAFKGLPEFFKLSAASAIMLCLETWYFQILVLVAGLLDNPELALDSLSICMTINGLAF

Query:  MISVGFNAAA--RVGNELGHGHPKSAAFSVVVVVAISSLVSILVASAVLSFRNVISYLFTDGATVAAAVSDLCPFLAVTLLLNGIQPAWLWDVGGNLLLL
        MISVGFNAAA  RV NELG G+PKSAAFSVVVV  +S ++S+L++  +L FR+ ISY+FT+G  V+ AVS L P LA+TL+LNGIQP       G     
Subjt:  MISVGFNAAA--RVGNELGHGHPKSAAFSVVVVVAISSLVSILVASAVLSFRNVISYLFTDGATVAAAVSDLCPFLAVTLLLNGIQPAWLWDVGGNLLLL

Query:  WSMCAAITLLACP-----------SEFSSMGSA--YEDDLRQALLQPAAAALLSSQSLCSNKHEGSD-------------ELERILSDTEMPVVQRYSQA
        W    A   + C              +  +G+A  +   +   L+Q      ++ ++  + + E S              +LERIL D      +R ++A
Subjt:  WSMCAAITLLACP-----------SEFSSMGSA--YEDDLRQALLQPAAAALLSSQSLCSNKHEGSD-------------ELERILSDTEMPVVQRYSQA

Query:  TWIEIKLLFYLAAPAVFVYMINYAMSTSTQIFSGHLGNLELAASSLGNNGIQVFAYGLM----------------------------RSAILLTLTGVVL
           E++LL  LAAPAV VYMINYAMS ST+IF G LG LELAA+SLGN GIQVFAYGLM                            RS +LLT  GV L
Subjt:  TWIEIKLLFYLAAPAVFVYMINYAMSTSTQIFSGHLGNLELAASSLGNNGIQVFAYGLM----------------------------RSAILLTLTGVVL

Query:  TIPYIFCKPILLFLGESKDIASASETFVYGLVPQIYAYALNFPIQKFLQAQSIVFPSAFISAGTLVVHVVLSWLAAYKMGLGLLGVSLVLSLSWWVIVVG
           Y+F K ILL LGE ++IA A+  FV GL+PQI+AYA NFPIQKFLQ+QSIV PSA+ISA TLVVH+VLS+LA YK+ +GLLG SL+LSLSWWVIV  
Subjt:  TIPYIFCKPILLFLGESKDIASASETFVYGLVPQIYAYALNFPIQKFLQAQSIVFPSAFISAGTLVVHVVLSWLAAYKMGLGLLGVSLVLSLSWWVIVVG

Query:  QFVYIVKSGSCKKTWRGFNVQAFSGLFGFFKLSAASAVMLCLETWYFQILVLLAGLLDNPELALDSLSICR------------LQRCRQCESKQRTGKWE
        QFVYIV S  C++TW GF+ QAFSGL  FF+LSAASAVMLCLETWY QI VL+AGLL +PE+ALDSL++C                          G   
Subjt:  QFVYIVKSGSCKKTWRGFNVQAFSGLFGFFKLSAASAVMLCLETWYFQILVLLAGLLDNPELALDSLSICR------------LQRCRQCESKQRTGKWE

Query:  SKIRS------------------------------VFRGGGGCH--LHHCLHLLCRTCTGSSQR-----VAVGCGWQAFVAYVNVGCYYIVGVPLGALLG
         K  S                              VF  G      +     LL  T   +  +     VAVGCGWQAFVAYVNV CYY VG+PLG +LG
Subjt:  SKIRS------------------------------VFRGGGGCH--LHHCLHLLCRTCTGSSQR-----VAVGCGWQAFVAYVNVGCYYIVGVPLGALLG

Query:  FYFKFGAKGIWIGLMGGTFMQTVILVWVTWRTDWNKELYSSRI
        F+F  GA GIW G++GG  +QT+IL +VT RTDWNKE+  +R+
Subjt:  FYFKFGAKGIWIGLMGGTFMQTVILVWVTWRTDWNKELYSSRI

A0A5N5KTH2 Protein DETOXIFICATION2.05e-31351.61Show/hide
Query:  SGELEKILSDTELPFVVRIRLAAWIELKLLGYLAAPAVFVYMINNFMSMSTRIFSGHLGNAELAAASLGNSGVQTFAYGLLLGMGSAVETLCGQAYGAGK
        S ELE+IL DTE P+  RIR A+W+ELKLL +LAAPAV VY++NN +SMST+IF GHLGN ELAA SLGN+G+Q FAYGL+LGMGSAVETLCGQAYGA +
Subjt:  SGELEKILSDTELPFVVRIRLAAWIELKLLGYLAAPAVFVYMINNFMSMSTRIFSGHLGNAELAAASLGNSGVQTFAYGLLLGMGSAVETLCGQAYGAGK

Query:  YEMLGIYLQRSSILLTLVACLLTLLYVFSKPLLLLLGESPEIASSAAVFVYGLIPQIFAYAVNFPIQKFLQSQSVVLPSAYISAATLVVHLVLSWAAAYK
        Y MLG+YLQRS+ILL   A  L ++Y+FS+PLL+LLGE   IAS+AAVFVYGLIPQIFAYA NFPIQKFLQ+QS++ PSAYIS   LVVH++LSW A +K
Subjt:  YEMLGIYLQRSSILLTLVACLLTLLYVFSKPLLLLLGESPEIASSAAVFVYGLIPQIFAYAVNFPIQKFLQSQSVVLPSAYISAATLVVHLVLSWAAAYK

Query:  LGFGLFGASSVLSLSWWIIVTAQVVYIVKSERFKESWRGLSWAAFKGLPEFFKLSAASAIMLCLETWYFQILVLVAGLLDNPELALDSLSICMTINGLAF
          +GL GA  VLS +WWIIV  Q VYI+ S R +++W+G S  AF GL  FFKLSAASA+MLCLETWY+QILVL+AGLL NPE+ALD+LS+CMTI+G  F
Subjt:  LGFGLFGASSVLSLSWWIIVTAQVVYIVKSERFKESWRGLSWAAFKGLPEFFKLSAASAIMLCLETWYFQILVLVAGLLDNPELALDSLSICMTINGLAF

Query:  MISVGFNAAA--RVGNELGHGHPKSAAFSVVVVVAISSLVSILVASAVLSFRNVISYLFTDGATVAAAVSDLCPFLAVTLLLNGIQPA------------
        MISVGFNAAA  RV NELG GHPKSA+FSV+VV + S ++S++ A  VL FR+ ISY+FT+G  VA A SDL PFLA TL+LNGIQP             
Subjt:  MISVGFNAAA--RVGNELGHGHPKSAAFSVVVVVAISSLVSILVASAVLSFRNVISYLFTDGATVAAAVSDLCPFLAVTLLLNGIQPA------------

Query:  --------------------------------WLWDVGGNLL----LLWSMCAAITLLACPSEFSSMGSAYEDDLRQALL----------QPAAAALLSS
                                        W   +GG +L    L+W            S  + + S   D+ RQ LL           P A     +
Subjt:  --------------------------------WLWDVGGNLL----LLWSMCAAITLLACPSEFSSMGSAYEDDLRQALL----------QPAAAALLSS

Query:  QSLCSNKHEG----------------------------SDELERILSDTEMPVVQRYSQATWIEIKLLFYLAAPAVFVYMINYAMSTSTQIFSGHLGNLE
        Q + SN  E                             S ELE +LSDT +P  +R  +AT+IE+K+LF LA PAV VY++N  +S STQI  GHLGNLE
Subjt:  QSLCSNKHEG----------------------------SDELERILSDTEMPVVQRYSQATWIEIKLLFYLAAPAVFVYMINYAMSTSTQIFSGHLGNLE

Query:  LAASSLGNNGIQVFAYGLM---------------------------------------RSAILLTLTGVVLTIPYIFCKPILLFLGESKDIASASETFVY
        LAASSLGNNGIQ+FAYGLM                                       RSA+LLT TG+ LT+ YIF KP+L+ LGES  IASA+  FVY
Subjt:  LAASSLGNNGIQVFAYGLM---------------------------------------RSAILLTLTGVVLTIPYIFCKPILLFLGESKDIASASETFVY

Query:  GLVPQIYAYALNFPIQKFLQAQSIVFPSAFISAGTLVVHVVLSWLAAYKMGLGLLGVSLVLSLSWWVIVVGQFVYIVKSGSCKKTWRGFNVQAFSGLFGF
        GL+PQI+AYA NFPIQKFLQ+QS++FPSA+IS G LVVH++LSW+A YK+G GLLG SLVLSLSWWV+V  QFVYI+ SG+ K TW GF++QAFSGL+ F
Subjt:  GLVPQIYAYALNFPIQKFLQAQSIVFPSAFISAGTLVVHVVLSWLAAYKMGLGLLGVSLVLSLSWWVIVVGQFVYIVKSGSCKKTWRGFNVQAFSGLFGF

Query:  FKLSAASAVMLCLETWYFQILVLLAGLLDNPELALDSLSICRLQRCRQCESKQRTGKWESKIRSVFRGGGGCHLHHCL----------------------
        FKLS ASAVMLCLETWY+QILVL+AGLL + E ALDSL++C                W   I   F       + + L                      
Subjt:  FKLSAASAVMLCLETWYFQILVLLAGLLDNPELALDSLSICRLQRCRQCESKQRTGKWESKIRSVFRGGGGCHLHHCL----------------------

Query:  ------------HLLCRTCTGSS--------------------------QRVAVGCGWQAFVAYVNVGCYYIVGVPLGALLGFYFKFGAKGIWIGLMGGT
                    H++    TG S                            VAVGCGWQAFVAYVNVGCYY +G+PLG +LGF   FG KGIW G++GGT
Subjt:  ------------HLLCRTCTGSS--------------------------QRVAVGCGWQAFVAYVNVGCYYIVGVPLGALLGFYFKFGAKGIWIGLMGGT

Query:  FMQTVILVWVTWRTDWNKELYSS
         +QT+IL+WVT+RTDWNKE+  +
Subjt:  FMQTVILVWVTWRTDWNKELYSS

SwissProt top hitse value%identityAlignment
F4JKB9 Protein DETOXIFICATION 381.1e-11657.57Show/hide
Query:  DIEEPLVQPAAKKHGPSGELEKILSDTELPFVVRIRLAAWIELKLLGYLAAPAVFVYMINNFMSMSTRIFSGHLGNAELAAASLGNSGVQTFAYGLLLGM
        ++  PLV    K     G LE +L+++ LP+  R+ L   IELKLL  LA PA+ VY+IN  M +S RIF+GHLG+ +LAAAS+GNS   +  Y L+LGM
Subjt:  DIEEPLVQPAAKKHGPSGELEKILSDTELPFVVRIRLAAWIELKLLGYLAAPAVFVYMINNFMSMSTRIFSGHLGNAELAAASLGNSGVQTFAYGLLLGM

Query:  GSAVETLCGQAYGAGKYEMLGIYLQRSSILLTLVACLLTLLYVFSKPLLLLLGESPEIASSAAVFVYGLIPQIFAYAVNFPIQKFLQSQSVVLPSAYISA
        GSAVETLCGQAYGA +YEMLGIYLQR++I+L LV   +T+LY FS P+LLLLGE   ++   ++++ GLIPQIFAYAV F  QKFLQ+QSVV PSAYISA
Subjt:  GSAVETLCGQAYGAGKYEMLGIYLQRSSILLTLVACLLTLLYVFSKPLLLLLGESPEIASSAAVFVYGLIPQIFAYAVNFPIQKFLQSQSVVLPSAYISA

Query:  ATLVVHLVLSWAAAYKLGFGLFGASSVLSLSWWIIVTAQVVYIVKSERFKESWRGLSWAAFKGLPEFFKLSAASAIMLCLETWYFQILVLVAGLLDNPEL
        A LV+ + L+W   Y +G GL G + VL++SWW IV AQ  Y++ S RFK++W G SW +  GL  FFKLSA SA+M+CLE WY QILVL+AGLL +P L
Subjt:  ATLVVHLVLSWAAAYKLGFGLFGASSVLSLSWWIIVTAQVVYIVKSERFKESWRGLSWAAFKGLPEFFKLSAASAIMLCLETWYFQILVLVAGLLDNPEL

Query:  ALDSLSICMTINGLAFMISVGFNAA--ARVGNELGHGHPKSAAFSVVVVVAISSLVSILVASAVLSFRNVISYLFTDGATVAAAVSDLCPFLAVTLLLNG
        +LDSLSICM+I+ L+FM+SVGFNAA   R  NELG G+PKSA FS      +S ++S++ A  V++ R+ +SY+FT  A VA AVSDLCPFLAVT++LNG
Subjt:  ALDSLSICMTINGLAFMISVGFNAA--ARVGNELGHGHPKSAAFSVVVVVAISSLVSILVASAVLSFRNVISYLFTDGATVAAAVSDLCPFLAVTLLLNG

Query:  IQP
        IQP
Subjt:  IQP

F4JKB9 Protein DETOXIFICATION 386.5e-2465.75Show/hide
Query:  VAVGCGWQAFVAYVNVGCYYIVGVPLGALLGFYFKFGAKGIWIGLMGGTFMQTVILVWVTWRTDWNKELYSSR
        VAVGCGWQ +VAYVN+GCYYIVG+P+G +LGF F F AKGIW G++GGT MQT+IL++VT++ DW+KE+  +R
Subjt:  VAVGCGWQAFVAYVNVGCYYIVGVPLGALLGFYFKFGAKGIWIGLMGGTFMQTVILVWVTWRTDWNKELYSSR

O80695 Protein DETOXIFICATION 371.2e-13463.52Show/hide
Query:  DIEEPLVQPAAKKHGPSGELEKILSDTELPFVVRIRLAAWIELKLLGYLAAPAVFVYMINNFMSMSTRIFSGHLGNAELAAASLGNSGVQTFAYGLLLGM
        ++  PL++  + K      LE +L+D ELP+  RI LA  IE+K L +LAAPA+FVY+INN MS+ TRIF+GH+G+ ELAAASLGNSG   F YGLLLGM
Subjt:  DIEEPLVQPAAKKHGPSGELEKILSDTELPFVVRIRLAAWIELKLLGYLAAPAVFVYMINNFMSMSTRIFSGHLGNAELAAASLGNSGVQTFAYGLLLGM

Query:  GSAVETLCGQAYGAGKYEMLGIYLQRSSILLTLVACLLTLLYVFSKPLLLLLGESPEIASSAAVFVYGLIPQIFAYAVNFPIQKFLQSQSVVLPSAYISA
        GSAVETLCGQA+GA +YEMLG+YLQRS+++L L    ++ L++FS P+L  LGE  ++A+ A+VFVYG+IP IFAYAVNFPIQKFLQSQS+V PSAYISA
Subjt:  GSAVETLCGQAYGAGKYEMLGIYLQRSSILLTLVACLLTLLYVFSKPLLLLLGESPEIASSAAVFVYGLIPQIFAYAVNFPIQKFLQSQSVVLPSAYISA

Query:  ATLVVHLVLSWAAAYKLGFGLFGASSVLSLSWWIIVTAQVVYIVKSERFKESWRGLSWAAFKGLPEFFKLSAASAIMLCLETWYFQILVLVAGLLDNPEL
        ATLV+HL+LSW A Y+LG+GL   S + S SWWIIV AQ+VYI  S R + +W G SW AF+GL +FF+LSAASA+MLCLE+WY QILVL+AGLL NPEL
Subjt:  ATLVVHLVLSWAAAYKLGFGLFGASSVLSLSWWIIVTAQVVYIVKSERFKESWRGLSWAAFKGLPEFFKLSAASAIMLCLETWYFQILVLVAGLLDNPEL

Query:  ALDSLSICMTINGLAFMISVGFNAAA--RVGNELGHGHPKSAAFSVVVVVAISSLVSILVASAVLSFRNVISYLFTDGATVAAAVSDLCPFLAVTLLLNG
        ALDSL+ICM+I+ ++FM+SVGFNAAA  RV NELG G+P++AAFS VV   +S L+S+  A  VLS+R+VISY FTD   VA AV+DL PFLA+T++LNG
Subjt:  ALDSLSICMTINGLAFMISVGFNAAA--RVGNELGHGHPKSAAFSVVVVVAISSLVSILVASAVLSFRNVISYLFTDGATVAAAVSDLCPFLAVTLLLNG

Query:  IQP
        IQP
Subjt:  IQP

O80695 Protein DETOXIFICATION 378.5e-2472.46Show/hide
Query:  VAVGCGWQAFVAYVNVGCYYIVGVPLGALLGFYFKFGAKGIWIGLMGGTFMQTVILVWVTWRTDWNKEL
        VAVGCGWQAFVAYVN+GCYY+VG+P+G +LGF +  GAKGIW G++GGT MQT+ILV VT RTDW+KE+
Subjt:  VAVGCGWQAFVAYVNVGCYYIVGVPLGALLGFYFKFGAKGIWIGLMGGTFMQTVILVWVTWRTDWNKEL

Q940N9 Protein DETOXIFICATION 392.7e-11856.37Show/hide
Query:  ENRDIEEPLVQPAAKKHGPSGE--LEKILSDTELPFVVRIRLAAWIELKLLGYLAAPAVFVYMINNFMSMSTRIFSGHLGNAELAAASLGNSGVQTFAYG
        E  D+  PLV PA  +  P  E  LE +L+++ L +  R+ L A IELK+L  LA PA+ +Y++N+ M +S R+F+GH+G+ ELAAAS+GNS      YG
Subjt:  ENRDIEEPLVQPAAKKHGPSGE--LEKILSDTELPFVVRIRLAAWIELKLLGYLAAPAVFVYMINNFMSMSTRIFSGHLGNAELAAASLGNSGVQTFAYG

Query:  LLLGMGSAVETLCGQAYGAGKYEMLGIYLQRSSILLTLVACLLTLLYVFSKPLLLLLGESPEIASSAAVFVYGLIPQIFAYAVNFPIQKFLQSQSVVLPS
        L+LGMGSAVETLCGQAYGA +YEMLGIYLQR++I+L LV   +TLLY FS P+L+LLGE   ++   + ++ GLIPQIFAYAVNF  QKFLQ+QSVV PS
Subjt:  LLLGMGSAVETLCGQAYGAGKYEMLGIYLQRSSILLTLVACLLTLLYVFSKPLLLLLGESPEIASSAAVFVYGLIPQIFAYAVNFPIQKFLQSQSVVLPS

Query:  AYISAATLVVHLVLSWAAAYKLGFGLFGASSVLSLSWWIIVTAQVVYIVKSERFKESWRGLSWAAFKGLPEFFKLSAASAIMLCLETWYFQILVLVAGLL
        A+ISAA L++ ++L+W   Y +  G  G + VL++SWW+IV +Q  YI  S +F+ +W GLSW + +GL  FFKLSA SA+M+CLE WY QILVL+AGLL
Subjt:  AYISAATLVVHLVLSWAAAYKLGFGLFGASSVLSLSWWIIVTAQVVYIVKSERFKESWRGLSWAAFKGLPEFFKLSAASAIMLCLETWYFQILVLVAGLL

Query:  DNPELALDSLSICMTINGLAFMISVGFNAA--ARVGNELGHGHPKSAAFSVVVVVAISSLVSILVASAVLSFRNVISYLFTDGATVAAAVSDLCPFLAVT
        +NP  +LDSLSICM+I+ L+FM+SVGFNAA   R  NELG G+PKSA FS      +S ++S+  A AV+ FR+ +SY+FT+ A VA AVSDLCPFLA+T
Subjt:  DNPELALDSLSICMTINGLAFMISVGFNAA--ARVGNELGHGHPKSAAFSVVVVVAISSLVSILVASAVLSFRNVISYLFTDGATVAAAVSDLCPFLAVT

Query:  LLLNGIQP
        ++LNGIQP
Subjt:  LLLNGIQP

Q940N9 Protein DETOXIFICATION 391.0e-2468.49Show/hide
Query:  VAVGCGWQAFVAYVNVGCYYIVGVPLGALLGFYFKFGAKGIWIGLMGGTFMQTVILVWVTWRTDWNKELYSSR
        VAVGCGWQ +VAYVNVGCYY+VG+P+G +LGF F F AKGIW G++GGT MQT+IL++VT+RTDW+KE+  +R
Subjt:  VAVGCGWQAFVAYVNVGCYYIVGVPLGALLGFYFKFGAKGIWIGLMGGTFMQTVILVWVTWRTDWNKELYSSR

Q9LVD9 Protein DETOXIFICATION 402.6e-15071.08Show/hide
Query:  NRDIEEPLVQPAAKKHGP---SGELEKILSDTELPFVVRIRLAAWIELKLLGYLAAPAVFVYMINNFMSMSTRIFSGHLGNAELAAASLGNSGVQTFAYG
        N  + +PL+ P      P   +GELE +LSD E P  +R+R A  IE KLL  LAAPAV VYMIN  MSMST+IFSGHLGN ELAAASLGN+G+Q FAYG
Subjt:  NRDIEEPLVQPAAKKHGP---SGELEKILSDTELPFVVRIRLAAWIELKLLGYLAAPAVFVYMINNFMSMSTRIFSGHLGNAELAAASLGNSGVQTFAYG

Query:  LLLGMGSAVETLCGQAYGAGKYEMLGIYLQRSSILLTLVACLLTLLYVFSKPLLLLLGESPEIASSAAVFVYGLIPQIFAYAVNFPIQKFLQSQSVVLPS
        L+LGMGSAVETLCGQAYG  KYEMLG+YLQRS++LLTL   LLTL+YVFS+P+LL LGESP IAS+A++FVYGLIPQIFAYA NFPIQKFLQSQS+V PS
Subjt:  LLLGMGSAVETLCGQAYGAGKYEMLGIYLQRSSILLTLVACLLTLLYVFSKPLLLLLGESPEIASSAAVFVYGLIPQIFAYAVNFPIQKFLQSQSVVLPS

Query:  AYISAATLVVHLVLSWAAAYKLGFGLFGASSVLSLSWWIIVTAQVVYIVKSERFKESWRGLSWAAFKGLPEFFKLSAASAIMLCLETWYFQILVLVAGLL
        AYIS ATL VHL+LSW A YKLG GL GAS VLSLSWWIIV AQ VYIV SER +E+WRG S  AF GL  FFKLSAASA+MLCLETWYFQILVL+AGLL
Subjt:  AYISAATLVVHLVLSWAAAYKLGFGLFGASSVLSLSWWIIVTAQVVYIVKSERFKESWRGLSWAAFKGLPEFFKLSAASAIMLCLETWYFQILVLVAGLL

Query:  DNPELALDSLSICMTINGLAFMISVGFNAA--ARVGNELGHGHPKSAAFSVVVVVAISSLVSILVASAVLSFRNVISYLFTDGATVAAAVSDLCPFLAVT
        +NPELALDSLSICMTI+G  FMISVGFNAA   RV NELG G+PKSAAFSV++V   S +  +++A  +L+ R+V+SY FT+G  V+ AVSDLCP LAVT
Subjt:  DNPELALDSLSICMTINGLAFMISVGFNAA--ARVGNELGHGHPKSAAFSVVVVVAISSLVSILVASAVLSFRNVISYLFTDGATVAAAVSDLCPFLAVT

Query:  LLLNGIQP
        L+LNGIQP
Subjt:  LLLNGIQP

Q9LVD9 Protein DETOXIFICATION 407.0e-2676.81Show/hide
Query:  VAVGCGWQAFVAYVNVGCYYIVGVPLGALLGFYFKFGAKGIWIGLMGGTFMQTVILVWVTWRTDWNKEL
        VAVGCGWQ FVA VNVGCYYI+G+PLGAL GFYF FGAKGIW G++GGT +QT IL WVT+RTDW KE+
Subjt:  VAVGCGWQAFVAYVNVGCYYIVGVPLGALLGFYFKFGAKGIWIGLMGGTFMQTVILVWVTWRTDWNKEL

Q9SAB0 Protein DETOXIFICATION 366.6e-13362.53Show/hide
Query:  DIEEPLVQPAAKKHGPSGELEKILSDTELPFVVRIRLAAWIELKLLGYLAAPAVFVYMINNFMSMSTRIFSGHLGNAELAAASLGNSGVQTFAYGLLLGM
        ++++PL++  + K      +E +L+DT L +  RI LA+ IE+K L +LAAPA+FVY+INN MSM TRIF+G LG+ +LAAASLGNSG   F  GL+LGM
Subjt:  DIEEPLVQPAAKKHGPSGELEKILSDTELPFVVRIRLAAWIELKLLGYLAAPAVFVYMINNFMSMSTRIFSGHLGNAELAAASLGNSGVQTFAYGLLLGM

Query:  GSAVETLCGQAYGAGKYEMLGIYLQRSSILLTLVACLLTLLYVFSKPLLLLLGESPEIASSAAVFVYGLIPQIFAYAVNFPIQKFLQSQSVVLPSAYISA
        GSAVETLCGQA+GA +Y+MLG+YLQRS+I+L +    +TLL++FSKPLL+ LGE  ++AS A+VFVYG+IP IFAYAVNFPIQKFLQSQS+V PSAYISA
Subjt:  GSAVETLCGQAYGAGKYEMLGIYLQRSSILLTLVACLLTLLYVFSKPLLLLLGESPEIASSAAVFVYGLIPQIFAYAVNFPIQKFLQSQSVVLPSAYISA

Query:  ATLVVHLVLSWAAAYKLGFGLFGASSVLSLSWWIIVTAQVVYIVKSERFKESWRGLSWAAFKGLPEFFKLSAASAIMLCLETWYFQILVLVAGLLDNPEL
        ATLV+HL+LSW + +K G+GL G S V SLSWWIIV AQ++YI  S R + +W G SW AF GL +FF+LSAASA+MLCLE+WY QILVL+AGLL +PEL
Subjt:  ATLVVHLVLSWAAAYKLGFGLFGASSVLSLSWWIIVTAQVVYIVKSERFKESWRGLSWAAFKGLPEFFKLSAASAIMLCLETWYFQILVLVAGLLDNPEL

Query:  ALDSLSICMTINGLAFMISVGFNAAA--RVGNELGHGHPKSAAFSVVVVVAISSLVSILVASAVLSFRNVISYLFTDGATVAAAVSDLCPFLAVTLLLNG
        ALDSL+ICM+I+ ++FM+SVGFNAAA  RV NELG G+P+SAAFS  V   +S L+S+  A  +LS+R+VISY+FTD   VA AV++L PFLA+T++LNG
Subjt:  ALDSLSICMTINGLAFMISVGFNAAA--RVGNELGHGHPKSAAFSVVVVVAISSLVSILVASAVLSFRNVISYLFTDGATVAAAVSDLCPFLAVTLLLNG

Query:  IQP
        +QP
Subjt:  IQP

Q9SAB0 Protein DETOXIFICATION 361.1e-2371.01Show/hide
Query:  VAVGCGWQAFVAYVNVGCYYIVGVPLGALLGFYFKFGAKGIWIGLMGGTFMQTVILVWVTWRTDWNKEL
        VAVGCGWQA+VAYVN+GCYYIVG+P+G +LGF +  GA+GIW G++GGT MQT+ILV VT+RTDW+KE+
Subjt:  VAVGCGWQAFVAYVNVGCYYIVGVPLGALLGFYFKFGAKGIWIGLMGGTFMQTVILVWVTWRTDWNKEL

Arabidopsis top hitse value%identityAlignment
AT1G11670.1 MATE efflux family protein4.7e-13462.53Show/hide
Query:  DIEEPLVQPAAKKHGPSGELEKILSDTELPFVVRIRLAAWIELKLLGYLAAPAVFVYMINNFMSMSTRIFSGHLGNAELAAASLGNSGVQTFAYGLLLGM
        ++++PL++  + K      +E +L+DT L +  RI LA+ IE+K L +LAAPA+FVY+INN MSM TRIF+G LG+ +LAAASLGNSG   F  GL+LGM
Subjt:  DIEEPLVQPAAKKHGPSGELEKILSDTELPFVVRIRLAAWIELKLLGYLAAPAVFVYMINNFMSMSTRIFSGHLGNAELAAASLGNSGVQTFAYGLLLGM

Query:  GSAVETLCGQAYGAGKYEMLGIYLQRSSILLTLVACLLTLLYVFSKPLLLLLGESPEIASSAAVFVYGLIPQIFAYAVNFPIQKFLQSQSVVLPSAYISA
        GSAVETLCGQA+GA +Y+MLG+YLQRS+I+L +    +TLL++FSKPLL+ LGE  ++AS A+VFVYG+IP IFAYAVNFPIQKFLQSQS+V PSAYISA
Subjt:  GSAVETLCGQAYGAGKYEMLGIYLQRSSILLTLVACLLTLLYVFSKPLLLLLGESPEIASSAAVFVYGLIPQIFAYAVNFPIQKFLQSQSVVLPSAYISA

Query:  ATLVVHLVLSWAAAYKLGFGLFGASSVLSLSWWIIVTAQVVYIVKSERFKESWRGLSWAAFKGLPEFFKLSAASAIMLCLETWYFQILVLVAGLLDNPEL
        ATLV+HL+LSW + +K G+GL G S V SLSWWIIV AQ++YI  S R + +W G SW AF GL +FF+LSAASA+MLCLE+WY QILVL+AGLL +PEL
Subjt:  ATLVVHLVLSWAAAYKLGFGLFGASSVLSLSWWIIVTAQVVYIVKSERFKESWRGLSWAAFKGLPEFFKLSAASAIMLCLETWYFQILVLVAGLLDNPEL

Query:  ALDSLSICMTINGLAFMISVGFNAAA--RVGNELGHGHPKSAAFSVVVVVAISSLVSILVASAVLSFRNVISYLFTDGATVAAAVSDLCPFLAVTLLLNG
        ALDSL+ICM+I+ ++FM+SVGFNAAA  RV NELG G+P+SAAFS  V   +S L+S+  A  +LS+R+VISY+FTD   VA AV++L PFLA+T++LNG
Subjt:  ALDSLSICMTINGLAFMISVGFNAAA--RVGNELGHGHPKSAAFSVVVVVAISSLVSILVASAVLSFRNVISYLFTDGATVAAAVSDLCPFLAVTLLLNG

Query:  IQP
        +QP
Subjt:  IQP

AT1G11670.1 MATE efflux family protein7.9e-2571.01Show/hide
Query:  VAVGCGWQAFVAYVNVGCYYIVGVPLGALLGFYFKFGAKGIWIGLMGGTFMQTVILVWVTWRTDWNKEL
        VAVGCGWQA+VAYVN+GCYYIVG+P+G +LGF +  GA+GIW G++GGT MQT+ILV VT+RTDW+KE+
Subjt:  VAVGCGWQAFVAYVNVGCYYIVGVPLGALLGFYFKFGAKGIWIGLMGGTFMQTVILVWVTWRTDWNKEL

AT1G61890.1 MATE efflux family protein8.5e-13663.52Show/hide
Query:  DIEEPLVQPAAKKHGPSGELEKILSDTELPFVVRIRLAAWIELKLLGYLAAPAVFVYMINNFMSMSTRIFSGHLGNAELAAASLGNSGVQTFAYGLLLGM
        ++  PL++  + K      LE +L+D ELP+  RI LA  IE+K L +LAAPA+FVY+INN MS+ TRIF+GH+G+ ELAAASLGNSG   F YGLLLGM
Subjt:  DIEEPLVQPAAKKHGPSGELEKILSDTELPFVVRIRLAAWIELKLLGYLAAPAVFVYMINNFMSMSTRIFSGHLGNAELAAASLGNSGVQTFAYGLLLGM

Query:  GSAVETLCGQAYGAGKYEMLGIYLQRSSILLTLVACLLTLLYVFSKPLLLLLGESPEIASSAAVFVYGLIPQIFAYAVNFPIQKFLQSQSVVLPSAYISA
        GSAVETLCGQA+GA +YEMLG+YLQRS+++L L    ++ L++FS P+L  LGE  ++A+ A+VFVYG+IP IFAYAVNFPIQKFLQSQS+V PSAYISA
Subjt:  GSAVETLCGQAYGAGKYEMLGIYLQRSSILLTLVACLLTLLYVFSKPLLLLLGESPEIASSAAVFVYGLIPQIFAYAVNFPIQKFLQSQSVVLPSAYISA

Query:  ATLVVHLVLSWAAAYKLGFGLFGASSVLSLSWWIIVTAQVVYIVKSERFKESWRGLSWAAFKGLPEFFKLSAASAIMLCLETWYFQILVLVAGLLDNPEL
        ATLV+HL+LSW A Y+LG+GL   S + S SWWIIV AQ+VYI  S R + +W G SW AF+GL +FF+LSAASA+MLCLE+WY QILVL+AGLL NPEL
Subjt:  ATLVVHLVLSWAAAYKLGFGLFGASSVLSLSWWIIVTAQVVYIVKSERFKESWRGLSWAAFKGLPEFFKLSAASAIMLCLETWYFQILVLVAGLLDNPEL

Query:  ALDSLSICMTINGLAFMISVGFNAAA--RVGNELGHGHPKSAAFSVVVVVAISSLVSILVASAVLSFRNVISYLFTDGATVAAAVSDLCPFLAVTLLLNG
        ALDSL+ICM+I+ ++FM+SVGFNAAA  RV NELG G+P++AAFS VV   +S L+S+  A  VLS+R+VISY FTD   VA AV+DL PFLA+T++LNG
Subjt:  ALDSLSICMTINGLAFMISVGFNAAA--RVGNELGHGHPKSAAFSVVVVVAISSLVSILVASAVLSFRNVISYLFTDGATVAAAVSDLCPFLAVTLLLNG

Query:  IQP
        IQP
Subjt:  IQP

AT1G61890.1 MATE efflux family protein6.1e-2572.46Show/hide
Query:  VAVGCGWQAFVAYVNVGCYYIVGVPLGALLGFYFKFGAKGIWIGLMGGTFMQTVILVWVTWRTDWNKEL
        VAVGCGWQAFVAYVN+GCYY+VG+P+G +LGF +  GAKGIW G++GGT MQT+ILV VT RTDW+KE+
Subjt:  VAVGCGWQAFVAYVNVGCYYIVGVPLGALLGFYFKFGAKGIWIGLMGGTFMQTVILVWVTWRTDWNKEL

AT3G21690.1 MATE efflux family protein1.9e-15171.08Show/hide
Query:  NRDIEEPLVQPAAKKHGP---SGELEKILSDTELPFVVRIRLAAWIELKLLGYLAAPAVFVYMINNFMSMSTRIFSGHLGNAELAAASLGNSGVQTFAYG
        N  + +PL+ P      P   +GELE +LSD E P  +R+R A  IE KLL  LAAPAV VYMIN  MSMST+IFSGHLGN ELAAASLGN+G+Q FAYG
Subjt:  NRDIEEPLVQPAAKKHGP---SGELEKILSDTELPFVVRIRLAAWIELKLLGYLAAPAVFVYMINNFMSMSTRIFSGHLGNAELAAASLGNSGVQTFAYG

Query:  LLLGMGSAVETLCGQAYGAGKYEMLGIYLQRSSILLTLVACLLTLLYVFSKPLLLLLGESPEIASSAAVFVYGLIPQIFAYAVNFPIQKFLQSQSVVLPS
        L+LGMGSAVETLCGQAYG  KYEMLG+YLQRS++LLTL   LLTL+YVFS+P+LL LGESP IAS+A++FVYGLIPQIFAYA NFPIQKFLQSQS+V PS
Subjt:  LLLGMGSAVETLCGQAYGAGKYEMLGIYLQRSSILLTLVACLLTLLYVFSKPLLLLLGESPEIASSAAVFVYGLIPQIFAYAVNFPIQKFLQSQSVVLPS

Query:  AYISAATLVVHLVLSWAAAYKLGFGLFGASSVLSLSWWIIVTAQVVYIVKSERFKESWRGLSWAAFKGLPEFFKLSAASAIMLCLETWYFQILVLVAGLL
        AYIS ATL VHL+LSW A YKLG GL GAS VLSLSWWIIV AQ VYIV SER +E+WRG S  AF GL  FFKLSAASA+MLCLETWYFQILVL+AGLL
Subjt:  AYISAATLVVHLVLSWAAAYKLGFGLFGASSVLSLSWWIIVTAQVVYIVKSERFKESWRGLSWAAFKGLPEFFKLSAASAIMLCLETWYFQILVLVAGLL

Query:  DNPELALDSLSICMTINGLAFMISVGFNAA--ARVGNELGHGHPKSAAFSVVVVVAISSLVSILVASAVLSFRNVISYLFTDGATVAAAVSDLCPFLAVT
        +NPELALDSLSICMTI+G  FMISVGFNAA   RV NELG G+PKSAAFSV++V   S +  +++A  +L+ R+V+SY FT+G  V+ AVSDLCP LAVT
Subjt:  DNPELALDSLSICMTINGLAFMISVGFNAA--ARVGNELGHGHPKSAAFSVVVVVAISSLVSILVASAVLSFRNVISYLFTDGATVAAAVSDLCPFLAVT

Query:  LLLNGIQP
        L+LNGIQP
Subjt:  LLLNGIQP

AT3G21690.1 MATE efflux family protein5.0e-2776.81Show/hide
Query:  VAVGCGWQAFVAYVNVGCYYIVGVPLGALLGFYFKFGAKGIWIGLMGGTFMQTVILVWVTWRTDWNKEL
        VAVGCGWQ FVA VNVGCYYI+G+PLGAL GFYF FGAKGIW G++GGT +QT IL WVT+RTDW KE+
Subjt:  VAVGCGWQAFVAYVNVGCYYIVGVPLGALLGFYFKFGAKGIWIGLMGGTFMQTVILVWVTWRTDWNKEL

AT4G21910.2 MATE efflux family protein2.9e-12056.76Show/hide
Query:  DIEEPLVQPAAKKHGPSGE----LEKILSDTELPFVVRIRLAAWIELKLLGYLAAPAVFVYMINNFMSMSTRIFSGHLGNAELAAASLGNSGVQTFAYGL
        D+  PLV P   +  P  +    LE +L++  LP+  R+ L A IE+KLL  LA PA+ VY++N+ M +S RIF+GHLG  ELAAAS+GNS   +  YGL
Subjt:  DIEEPLVQPAAKKHGPSGE----LEKILSDTELPFVVRIRLAAWIELKLLGYLAAPAVFVYMINNFMSMSTRIFSGHLGNAELAAASLGNSGVQTFAYGL

Query:  LLGMGSAVETLCGQAYGAGKYEMLGIYLQRSSILLTLVACLLTLLYVFSKPLLLLLGESPEIASSAAVFVYGLIPQIFAYAVNFPIQKFLQSQSVVLPSA
        +LGMGSAVETLCGQAYGA +YEMLGIYLQR++I+L LV   +TLLY FS P+L+LLGE   ++   + ++ GLIPQIFAYAVNF  QKFLQ+QSVV PSA
Subjt:  LLGMGSAVETLCGQAYGAGKYEMLGIYLQRSSILLTLVACLLTLLYVFSKPLLLLLGESPEIASSAAVFVYGLIPQIFAYAVNFPIQKFLQSQSVVLPSA

Query:  YISAATLVVHLVLSWAAAYKLGFGLFGASSVLSLSWWIIVTAQVVYIVKSERFKESWRGLSWAAFKGLPEFFKLSAASAIMLCLETWYFQILVLVAGLLD
        +ISAA L++ ++L+W   Y +  G  G + VL++SWW+IV +Q  YI  S +F+ +W GLSW + +GL  FFKLSA SA+M+CLE WY QILVL+AGLL+
Subjt:  YISAATLVVHLVLSWAAAYKLGFGLFGASSVLSLSWWIIVTAQVVYIVKSERFKESWRGLSWAAFKGLPEFFKLSAASAIMLCLETWYFQILVLVAGLLD

Query:  NPELALDSLSICMTINGLAFMISVGFNAA--ARVGNELGHGHPKSAAFSVVVVVAISSLVSILVASAVLSFRNVISYLFTDGATVAAAVSDLCPFLAVTL
        NP  +LDSLSICM+I+ L+FM+SVGFNAA   R  NELG G+PKSA FS      +S ++S+  A AV+ FR+ +SY+FT+ A VA AVSDLCPFLA+T+
Subjt:  NPELALDSLSICMTINGLAFMISVGFNAA--ARVGNELGHGHPKSAAFSVVVVVAISSLVSILVASAVLSFRNVISYLFTDGATVAAAVSDLCPFLAVTL

Query:  LLNGIQP
        +LNGIQP
Subjt:  LLNGIQP

AT4G21910.2 MATE efflux family protein7.2e-2668.49Show/hide
Query:  VAVGCGWQAFVAYVNVGCYYIVGVPLGALLGFYFKFGAKGIWIGLMGGTFMQTVILVWVTWRTDWNKELYSSR
        VAVGCGWQ +VAYVNVGCYY+VG+P+G +LGF F F AKGIW G++GGT MQT+IL++VT+RTDW+KE+  +R
Subjt:  VAVGCGWQAFVAYVNVGCYYIVGVPLGALLGFYFKFGAKGIWIGLMGGTFMQTVILVWVTWRTDWNKELYSSR

AT4G21910.4 MATE efflux family protein2.9e-12056.76Show/hide
Query:  DIEEPLVQPAAKKHGPSGE----LEKILSDTELPFVVRIRLAAWIELKLLGYLAAPAVFVYMINNFMSMSTRIFSGHLGNAELAAASLGNSGVQTFAYGL
        D+  PLV P   +  P  +    LE +L++  LP+  R+ L A IE+KLL  LA PA+ VY++N+ M +S RIF+GHLG  ELAAAS+GNS   +  YGL
Subjt:  DIEEPLVQPAAKKHGPSGE----LEKILSDTELPFVVRIRLAAWIELKLLGYLAAPAVFVYMINNFMSMSTRIFSGHLGNAELAAASLGNSGVQTFAYGL

Query:  LLGMGSAVETLCGQAYGAGKYEMLGIYLQRSSILLTLVACLLTLLYVFSKPLLLLLGESPEIASSAAVFVYGLIPQIFAYAVNFPIQKFLQSQSVVLPSA
        +LGMGSAVETLCGQAYGA +YEMLGIYLQR++I+L LV   +TLLY FS P+L+LLGE   ++   + ++ GLIPQIFAYAVNF  QKFLQ+QSVV PSA
Subjt:  LLGMGSAVETLCGQAYGAGKYEMLGIYLQRSSILLTLVACLLTLLYVFSKPLLLLLGESPEIASSAAVFVYGLIPQIFAYAVNFPIQKFLQSQSVVLPSA

Query:  YISAATLVVHLVLSWAAAYKLGFGLFGASSVLSLSWWIIVTAQVVYIVKSERFKESWRGLSWAAFKGLPEFFKLSAASAIMLCLETWYFQILVLVAGLLD
        +ISAA L++ ++L+W   Y +  G  G + VL++SWW+IV +Q  YI  S +F+ +W GLSW + +GL  FFKLSA SA+M+CLE WY QILVL+AGLL+
Subjt:  YISAATLVVHLVLSWAAAYKLGFGLFGASSVLSLSWWIIVTAQVVYIVKSERFKESWRGLSWAAFKGLPEFFKLSAASAIMLCLETWYFQILVLVAGLLD

Query:  NPELALDSLSICMTINGLAFMISVGFNAA--ARVGNELGHGHPKSAAFSVVVVVAISSLVSILVASAVLSFRNVISYLFTDGATVAAAVSDLCPFLAVTL
        NP  +LDSLSICM+I+ L+FM+SVGFNAA   R  NELG G+PKSA FS      +S ++S+  A AV+ FR+ +SY+FT+ A VA AVSDLCPFLA+T+
Subjt:  NPELALDSLSICMTINGLAFMISVGFNAA--ARVGNELGHGHPKSAAFSVVVVVAISSLVSILVASAVLSFRNVISYLFTDGATVAAAVSDLCPFLAVTL

Query:  LLNGIQP
        +LNGIQP
Subjt:  LLNGIQP

AT4G21910.4 MATE efflux family protein3.6e-2571.01Show/hide
Query:  VAVGCGWQAFVAYVNVGCYYIVGVPLGALLGFYFKFGAKGIWIGLMGGTFMQTVILVWVTWRTDWNKEL
        VAVGCGWQ +VAYVNVGCYY+VG+P+G +LGF F F AKGIW G++GGT MQT+IL++VT+RTDW+KE+
Subjt:  VAVGCGWQAFVAYVNVGCYYIVGVPLGALLGFYFKFGAKGIWIGLMGGTFMQTVILVWVTWRTDWNKEL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACGGCTAAGACAGAGAGGGAGAGAGAGAAGATGGAATATGGGTATGAGAATAGAGATATTGAGGAGCCATTGGTGCAGCCGGCGGCGAAGAAGCATGGGCCAAGCGG
CGAGCTGGAGAAGATATTATCAGACACAGAATTGCCCTTCGTGGTTCGAATCCGGCTGGCGGCTTGGATCGAGCTGAAGCTGTTGGGTTATTTGGCGGCGCCGGCGGTGT
TCGTGTACATGATCAACAACTTCATGTCTATGTCCACCCGCATCTTCTCCGGCCACCTCGGCAACGCCGAACTCGCCGCCGCTTCTCTCGGCAACAGCGGCGTTCAAACT
TTTGCTTATGGTCTCTTGTTGGGAATGGGGAGCGCCGTGGAGACGCTGTGCGGCCAAGCTTACGGCGCCGGAAAATACGAAATGTTAGGAATTTACTTACAAAGATCCTC
AATTTTGCTAACCTTAGTCGCCTGTCTCTTAACTCTTCTCTACGTCTTCTCCAAACCCCTTTTGCTGCTCCTCGGCGAGTCGCCGGAAATTGCTTCGTCGGCGGCCGTTT
TCGTGTACGGCCTGATCCCTCAAATCTTCGCCTACGCCGTTAACTTCCCCATCCAGAAGTTTCTTCAGTCCCAGAGCGTGGTGCTTCCCAGCGCCTACATATCGGCGGCG
ACGCTGGTGGTGCATTTGGTGCTGAGCTGGGCGGCGGCGTATAAGCTGGGGTTTGGGTTGTTTGGGGCGTCGTCGGTGCTCAGCCTGTCGTGGTGGATCATTGTGACGGC
GCAGGTTGTGTATATTGTGAAGAGTGAGAGGTTTAAGGAGAGTTGGAGAGGGTTGAGTTGGGCGGCGTTTAAGGGGCTGCCGGAATTCTTTAAATTGTCGGCGGCGTCGG
CGATTATGCTGTGCTTGGAAACTTGGTACTTTCAGATTCTGGTTTTGGTTGCTGGGTTGCTTGATAATCCTGAGTTGGCTCTTGATTCCCTCTCCATCTGTATGACCATT
AACGGATTGGCATTCATGATTTCTGTTGGCTTCAATGCAGCTGCTAGAGTGGGAAATGAACTAGGACATGGGCATCCCAAATCCGCAGCCTTTTCAGTGGTGGTGGTGGT
AGCCATTTCCTCCCTCGTTTCCATTCTCGTCGCCTCCGCCGTCCTTTCATTCCGCAACGTCATTAGCTATCTCTTCACCGACGGCGCCACCGTCGCCGCCGCCGTTTCTG
ACCTCTGCCCTTTCCTGGCCGTCACTCTCCTCCTCAACGGAATCCAGCCTGCGTGGCTGTGGGATGTGGGTGGCAATCTTTTGTTGCTTTGGTCAATGTGTGCTGCTATT
ACTTTGTTGGCTTGCCCTTCGGAGTTCTCCTCGATGGGATCTGCTTATGAAGATGATCTCCGACAGGCGCTGCTGCAGCCGGCGGCGGCAGCGTTGTTGTCGTCTCAGTC
GTTGTGTTCGAACAAGCACGAAGGCAGCGACGAACTGGAGAGGATTCTGTCGGACACAGAGATGCCGGTGGTCCAGCGTTATAGTCAAGCCACGTGGATCGAGATTAAGC
TTCTGTTTTACTTGGCAGCGCCGGCGGTTTTTGTGTACATGATTAACTACGCCATGTCGACGTCCACGCAGATCTTTTCCGGCCACCTTGGCAACCTTGAGCTTGCCGCT
TCCTCTCTTGGCAATAATGGAATTCAAGTCTTCGCCTATGGTCTCATGAGATCAGCCATATTACTAACTCTAACAGGCGTAGTCTTAACCATCCCTTACATATTCTGCAA
GCCAATCTTGCTCTTTCTGGGCGAATCCAAGGATATAGCTTCAGCATCCGAGACTTTCGTGTACGGCCTGGTGCCTCAAATCTACGCCTACGCCTTAAACTTCCCCATCC
AGAAGTTTCTCCAGGCGCAGAGCATAGTGTTTCCAAGCGCCTTCATATCGGCCGGGACGCTTGTGGTGCACGTGGTGTTGAGCTGGTTGGCGGCCTATAAAATGGGGCTG
GGCTTGCTGGGAGTGTCGCTGGTGCTGAGCTTGTCGTGGTGGGTGATCGTGGTGGGGCAATTTGTTTACATTGTGAAGAGCGGGAGTTGTAAGAAGACGTGGCGGGGGTT
CAATGTGCAGGCGTTTTCGGGGCTGTTTGGGTTCTTTAAGCTGTCGGCGGCGTCGGCGGTGATGCTGTGCTTGGAGACTTGGTATTTTCAGATTCTGGTTTTGCTTGCTG
GATTGCTTGACAACCCTGAGCTCGCTCTTGACTCCCTCTCCATTTGTCGGCTTCAACGCTGCCGCCAGTGTGAGAGTAAGCAACGAACTGGGAAGTGGGAATCCAAAATC
CGCAGCGTTTTCCGTGGCGGTGGTGGTTGCCATCTCCACCATTGTCTCCATCTCTTGTGCCGTACTTGTACTGGCTCTTCGCAACGTGTAGCGGTTGGATGTGGGTGGCA
AGCATTCGTCGCCTATGTAAACGTCGGCTGCTATTACATCGTCGGCGTGCCCTTGGGTGCCCTCCTCGGCTTCTATTTCAAATTTGGCGCCAAGGGCATATGGATAGGGT
TGATGGGTGGAACTTTCATGCAGACGGTAATTTTGGTATGGGTGACGTGGCGTACAGATTGGAATAAAGAGCTATACTCATCACGTATCTAG
mRNA sequenceShow/hide mRNA sequence
ATGACGGCTAAGACAGAGAGGGAGAGAGAGAAGATGGAATATGGGTATGAGAATAGAGATATTGAGGAGCCATTGGTGCAGCCGGCGGCGAAGAAGCATGGGCCAAGCGG
CGAGCTGGAGAAGATATTATCAGACACAGAATTGCCCTTCGTGGTTCGAATCCGGCTGGCGGCTTGGATCGAGCTGAAGCTGTTGGGTTATTTGGCGGCGCCGGCGGTGT
TCGTGTACATGATCAACAACTTCATGTCTATGTCCACCCGCATCTTCTCCGGCCACCTCGGCAACGCCGAACTCGCCGCCGCTTCTCTCGGCAACAGCGGCGTTCAAACT
TTTGCTTATGGTCTCTTGTTGGGAATGGGGAGCGCCGTGGAGACGCTGTGCGGCCAAGCTTACGGCGCCGGAAAATACGAAATGTTAGGAATTTACTTACAAAGATCCTC
AATTTTGCTAACCTTAGTCGCCTGTCTCTTAACTCTTCTCTACGTCTTCTCCAAACCCCTTTTGCTGCTCCTCGGCGAGTCGCCGGAAATTGCTTCGTCGGCGGCCGTTT
TCGTGTACGGCCTGATCCCTCAAATCTTCGCCTACGCCGTTAACTTCCCCATCCAGAAGTTTCTTCAGTCCCAGAGCGTGGTGCTTCCCAGCGCCTACATATCGGCGGCG
ACGCTGGTGGTGCATTTGGTGCTGAGCTGGGCGGCGGCGTATAAGCTGGGGTTTGGGTTGTTTGGGGCGTCGTCGGTGCTCAGCCTGTCGTGGTGGATCATTGTGACGGC
GCAGGTTGTGTATATTGTGAAGAGTGAGAGGTTTAAGGAGAGTTGGAGAGGGTTGAGTTGGGCGGCGTTTAAGGGGCTGCCGGAATTCTTTAAATTGTCGGCGGCGTCGG
CGATTATGCTGTGCTTGGAAACTTGGTACTTTCAGATTCTGGTTTTGGTTGCTGGGTTGCTTGATAATCCTGAGTTGGCTCTTGATTCCCTCTCCATCTGTATGACCATT
AACGGATTGGCATTCATGATTTCTGTTGGCTTCAATGCAGCTGCTAGAGTGGGAAATGAACTAGGACATGGGCATCCCAAATCCGCAGCCTTTTCAGTGGTGGTGGTGGT
AGCCATTTCCTCCCTCGTTTCCATTCTCGTCGCCTCCGCCGTCCTTTCATTCCGCAACGTCATTAGCTATCTCTTCACCGACGGCGCCACCGTCGCCGCCGCCGTTTCTG
ACCTCTGCCCTTTCCTGGCCGTCACTCTCCTCCTCAACGGAATCCAGCCTGCGTGGCTGTGGGATGTGGGTGGCAATCTTTTGTTGCTTTGGTCAATGTGTGCTGCTATT
ACTTTGTTGGCTTGCCCTTCGGAGTTCTCCTCGATGGGATCTGCTTATGAAGATGATCTCCGACAGGCGCTGCTGCAGCCGGCGGCGGCAGCGTTGTTGTCGTCTCAGTC
GTTGTGTTCGAACAAGCACGAAGGCAGCGACGAACTGGAGAGGATTCTGTCGGACACAGAGATGCCGGTGGTCCAGCGTTATAGTCAAGCCACGTGGATCGAGATTAAGC
TTCTGTTTTACTTGGCAGCGCCGGCGGTTTTTGTGTACATGATTAACTACGCCATGTCGACGTCCACGCAGATCTTTTCCGGCCACCTTGGCAACCTTGAGCTTGCCGCT
TCCTCTCTTGGCAATAATGGAATTCAAGTCTTCGCCTATGGTCTCATGAGATCAGCCATATTACTAACTCTAACAGGCGTAGTCTTAACCATCCCTTACATATTCTGCAA
GCCAATCTTGCTCTTTCTGGGCGAATCCAAGGATATAGCTTCAGCATCCGAGACTTTCGTGTACGGCCTGGTGCCTCAAATCTACGCCTACGCCTTAAACTTCCCCATCC
AGAAGTTTCTCCAGGCGCAGAGCATAGTGTTTCCAAGCGCCTTCATATCGGCCGGGACGCTTGTGGTGCACGTGGTGTTGAGCTGGTTGGCGGCCTATAAAATGGGGCTG
GGCTTGCTGGGAGTGTCGCTGGTGCTGAGCTTGTCGTGGTGGGTGATCGTGGTGGGGCAATTTGTTTACATTGTGAAGAGCGGGAGTTGTAAGAAGACGTGGCGGGGGTT
CAATGTGCAGGCGTTTTCGGGGCTGTTTGGGTTCTTTAAGCTGTCGGCGGCGTCGGCGGTGATGCTGTGCTTGGAGACTTGGTATTTTCAGATTCTGGTTTTGCTTGCTG
GATTGCTTGACAACCCTGAGCTCGCTCTTGACTCCCTCTCCATTTGTCGGCTTCAACGCTGCCGCCAGTGTGAGAGTAAGCAACGAACTGGGAAGTGGGAATCCAAAATC
CGCAGCGTTTTCCGTGGCGGTGGTGGTTGCCATCTCCACCATTGTCTCCATCTCTTGTGCCGTACTTGTACTGGCTCTTCGCAACGTGTAGCGGTTGGATGTGGGTGGCA
AGCATTCGTCGCCTATGTAAACGTCGGCTGCTATTACATCGTCGGCGTGCCCTTGGGTGCCCTCCTCGGCTTCTATTTCAAATTTGGCGCCAAGGGCATATGGATAGGGT
TGATGGGTGGAACTTTCATGCAGACGGTAATTTTGGTATGGGTGACGTGGCGTACAGATTGGAATAAAGAGCTATACTCATCACGTATCTAG
Protein sequenceShow/hide protein sequence
MTAKTEREREKMEYGYENRDIEEPLVQPAAKKHGPSGELEKILSDTELPFVVRIRLAAWIELKLLGYLAAPAVFVYMINNFMSMSTRIFSGHLGNAELAAASLGNSGVQT
FAYGLLLGMGSAVETLCGQAYGAGKYEMLGIYLQRSSILLTLVACLLTLLYVFSKPLLLLLGESPEIASSAAVFVYGLIPQIFAYAVNFPIQKFLQSQSVVLPSAYISAA
TLVVHLVLSWAAAYKLGFGLFGASSVLSLSWWIIVTAQVVYIVKSERFKESWRGLSWAAFKGLPEFFKLSAASAIMLCLETWYFQILVLVAGLLDNPELALDSLSICMTI
NGLAFMISVGFNAAARVGNELGHGHPKSAAFSVVVVVAISSLVSILVASAVLSFRNVISYLFTDGATVAAAVSDLCPFLAVTLLLNGIQPAWLWDVGGNLLLLWSMCAAI
TLLACPSEFSSMGSAYEDDLRQALLQPAAAALLSSQSLCSNKHEGSDELERILSDTEMPVVQRYSQATWIEIKLLFYLAAPAVFVYMINYAMSTSTQIFSGHLGNLELAA
SSLGNNGIQVFAYGLMRSAILLTLTGVVLTIPYIFCKPILLFLGESKDIASASETFVYGLVPQIYAYALNFPIQKFLQAQSIVFPSAFISAGTLVVHVVLSWLAAYKMGL
GLLGVSLVLSLSWWVIVVGQFVYIVKSGSCKKTWRGFNVQAFSGLFGFFKLSAASAVMLCLETWYFQILVLLAGLLDNPELALDSLSICRLQRCRQCESKQRTGKWESKI
RSVFRGGGGCHLHHCLHLLCRTCTGSSQRVAVGCGWQAFVAYVNVGCYYIVGVPLGALLGFYFKFGAKGIWIGLMGGTFMQTVILVWVTWRTDWNKELYSSRI