; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CcUC11G212380 (gene) of Watermelon (PI 537277) v1 genome

Gene IDCcUC11G212380
OrganismCitrullus colocynthis (Watermelon (PI 537277) v1)
DescriptionProtein DETOXIFICATION
Genome locationCicolChr11:4282774..4302713
RNA-Seq ExpressionCcUC11G212380
SyntenyCcUC11G212380
Gene Ontology termsGO:0055085 - transmembrane transport (biological process)
GO:0016020 - membrane (cellular component)
GO:0015297 - antiporter activity (molecular function)
GO:0042910 - xenobiotic transmembrane transporter activity (molecular function)
InterPro domainsIPR002528 - Multi antimicrobial extrusion protein
IPR044644 - Multi antimicrobial extrusion protein DinF-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAF3966159.1 hypothetical protein CMV_009716 [Castanea mollissima]5.3e-25349.07Show/hide
Query:  MPFKILHSPSSSITPQIHNHKFLNPLSHSCP-SFPFTPTIRFPSSSSPSSVVVCSPITRRFAVPHEDHEREV-------SNLEIEDQIDDG---VQGNEQ
        M FK L    +S TP   N  F N L    P S P  P    P   S  S+V         + P +     +       +N E  D  ++G   ++    
Subjt:  MPFKILHSPSSSITPQIHNHKFLNPLSHSCP-SFPFTPTIRFPSSSSPSSVVVCSPITRRFAVPHEDHEREV-------SNLEIEDQIDDG---VQGNEQ

Query:  LLGTG---IDELGSIGLLNQMKEIVTFTGPAIGLWICGPLMSLIDTAVIGQGSAIELAALGPATVLCDYTSYVFMFLSIATSNMVATALAKQDKNEVQHH
        +       + E+GS  +  QMKEIV FTGPA GLWICGPLMSLIDTAVIGQGS+IELAALGP TV+CDY SY FMFLSIATSNMVATALA+QDK EVQHH
Subjt:  LLGTG---IDELGSIGLLNQMKEIVTFTGPAIGLWICGPLMSLIDTAVIGQGSAIELAALGPATVLCDYTSYVFMFLSIATSNMVATALAKQDKNEVQHH

Query:  ISVLLFVGLMSGFLMLLVTKLLGLVALTA-----------------------------------------------------------------------
        IS+LLFVGL  G LMLL T+  G   LTA                                                                       
Subjt:  ISVLLFVGLMSGFLMLLVTKLLGLVALTA-----------------------------------------------------------------------

Query:  ----------------YMMIEQLNEKGYSGYSLSVPSPSEFVSILGLAAPVFITLMSKIVFYTLLIYHATSIGTYTMAAHQVMSQTFYMCSVLGEPLSQT
                        +MM+E LN+KGY+ +S+SVPSP E +++LGLAAPVF+T+MSK+ F++LL Y A S+GTY+MAAHQVM QTF  C+V GEPL+QT
Subjt:  ----------------YMMIEQLNEKGYSGYSLSVPSPSEFVSILGLAAPVFITLMSKIVFYTLLIYHATSIGTYTMAAHQVMSQTFYMCSVLGEPLSQT

Query:  AQSFMPGFINGVNRSLDKARMLLKSLLIIGAIFGLVLGTIGTLVPWLFPNLFTPEVKIIQEMHKVLIPYFLALLIMPATLCLEGTLLAGRDLKFISLSMC
        AQSFMP  I GV+RSL+KARMLLKSL+IIGAI GL+LG +GT VPWLFP +FT +  +I EMH+VLIP+F+AL + P TL LEGTLLAGRDLKF+SLSM 
Subjt:  AQSFMPGFINGVNRSLDKARMLLKSLLIIGAIFGLVLGTIGTLVPWLFPNLFTPEVKIIQEMHKVLIPYFLALLIMPATLCLEGTLLAGRDLKFISLSMC

Query:  GCLSFAALLLLVMFYLPTFLQPHFFSSLCINFMACSQVVSNRGYGLVGCWCALVGFQWARFVNALRRVLSPKGVLYSSDLSHYKPNFAPFNSQAPKMPFK
        GC S AALLLL                          +V++RGYGL G W  L+GFQWARF  +LRR+LSP G+LYS DL   K               +
Subjt:  GCLSFAALLLLVMFYLPTFLQPHFFSSLCINFMACSQVVSNRGYGLVGCWCALVGFQWARFVNALRRVLSPKGVLYSSDLSHYKPNFAPFNSQAPKMPFK

Query:  ILHYHSSSSPRIHYPNILRPFSPPSFPFTHQSLSSPTISFPSSSSPLPLHFSSQIRRRFAVPHEASSLETESEIGVEVQENEQLYGTGGEELGSQGL-SQ
         L    ++   I   N                 S P I   S+S                   EAS            +E E++      ELGS+ +  Q
Subjt:  ILHYHSSSSPRIHYPNILRPFSPPSFPFTHQSLSSPTISFPSSSSPLPLHFSSQIRRRFAVPHEASSLETESEIGVEVQENEQLYGTGGEELGSQGL-SQ

Query:  MKEIVTFTGPAIGLWICGPLMSLIDTAVIGQGSAVELAALGPATVLCDYTSYVFMFLSIATSNM--------DKNEVQHHISVLLFVGLMSGFLMLLLTK
        MKEIV FTGPA  LWICGPLMSLIDTAVIGQGS+VELAALGP TV+CDY  Y FMFLSIATSNM        DKNEVQHHIS+LLFVGL  G LMLL TK
Subjt:  MKEIVTFTGPAIGLWICGPLMSLIDTAVIGQGSAVELAALGPATVLCDYTSYVFMFLSIATSNM--------DKNEVQHHISVLLFVGLMSGFLMLLLTK

Query:  LMGPVALTAFVGIKNADIIPASNTYI-----------------------------------------------------------------QVISAYMMI
          G   LTAF G KNA +IP++N Y+                                                                 QVI+ +MMI
Subjt:  LMGPVALTAFVGIKNADIIPASNTYI-----------------------------------------------------------------QVISAYMMI

Query:  EALNKKGYNGYSLSVPSSGEFLSILGLAAPVFLTMMSKVVFYSLLIYFATSMGTHTMAAHQVMIQTFCMCTVWGEPLSQTAQSFMPGLINGVNRSLDKAW
        EALNKKGYN YS+SVPS  E L++LGLAAPVF+TMMSKV FYSLLIYFATSMGT+TMAAHQVMIQTF MCTVWGEPLSQTAQSFMP LI G  RSL+KA 
Subjt:  EALNKKGYNGYSLSVPSSGEFLSILGLAAPVFLTMMSKVVFYSLLIYFATSMGTHTMAAHQVMIQTFCMCTVWGEPLSQTAQSFMPGLINGVNRSLDKAW

Query:  MLLKSLLIIGAMFGLVLGTIGTSVPWLFPNLFTPEGKIIQEMHKVLIPYFLALVITPPTHCLEGT------------------------LLVISSRGYGL
        MLL+SL+IIG + GL+LG +GTSVPWLFP +FT +  +IQEMHKVLIPYF+AL +TPPTHCLEGT                        LL+++SRGYGL
Subjt:  MLLKSLLIIGAMFGLVLGTIGTSVPWLFPNLFTPEGKIIQEMHKVLIPYFLALVITPPTHCLEGT------------------------LLVISSRGYGL

Query:  TGCWYALVGFQWARFLSALRRILSPHGILYSNE
         GCW+ LVGFQWARFL +L R+LSP+GILYS +
Subjt:  TGCWYALVGFQWARFLSALRRILSPHGILYSNE

KAF4369626.1 hypothetical protein G4B88_021431 [Cannabis sativa]4.5e-22849.95Show/hide
Query:  PFKILHSPSSSITPQI-HNHKFLNPLSHSCPSFPFTPTIRF--PSSSSPSSVVVCSPITRRFAV----PHEDHEREVSNLE--IEDQIDDGVQGNEQLLG
        P+ +LH+P S    Q+  +  F N  S+S  S P     RF   +S+SP S+       RRF +          RE+S+ E    D+   GV   E    
Subjt:  PFKILHSPSSSITPQI-HNHKFLNPLSHSCPSFPFTPTIRF--PSSSSPSSVVVCSPITRRFAV----PHEDHEREVSNLE--IEDQIDDGVQGNEQLLG

Query:  TGIDELGSIGLLNQMKEIVTFTGPAIGLWICGPLMSLIDTAVIGQGSAIELAALGPATVLCDYTSYVFMFLSIATSNMVATALAKQDKNEVQHHISVLLF
        TG   LG+ G+ +QMKEI  FTGPA GLWICGPLMSLIDTAVIGQ S++ELAALGPATVLCD  SY+FMFLSIATSNMVATALA++DK EVQHHISVLLF
Subjt:  TGIDELGSIGLLNQMKEIVTFTGPAIGLWICGPLMSLIDTAVIGQGSAIELAALGPATVLCDYTSYVFMFLSIATSNMVATALAKQDKNEVQHHISVLLF

Query:  VGLMSGFLMLLVTKLLGLVALT----------------------AYMMIEQLNEKGYSGYSLSVPSPSEFVSILGLAAPVFITLMSKIVFYTLLIYHATS
        VGL  GF+MLL T+  G  ALT                      AYMMIE LN+KGY+ YS+S+PSP + ++I  LAAPVFITLM+KI FYTLL+Y ATS
Subjt:  VGLMSGFLMLLVTKLLGLVALT----------------------AYMMIEQLNEKGYSGYSLSVPSPSEFVSILGLAAPVFITLMSKIVFYTLLIYHATS

Query:  IGTYTMAAHQVMSQTFYMCSVLGEPLSQTAQSFMPGFINGVNRSLDKARMLLKSLLIIGAIFGLVLGTIGTLVPWLFPNLFTPEVKIIQEMHKVLIPYFL
        +GT T AAHQVM Q  +MC+V GEPLSQTAQSFMP  I+GV R+L+KAR LLKSL+++G +FG+VLGT GT +PWLFPN++TP+  II+EMH VLIPYFL
Subjt:  IGTYTMAAHQVMSQTFYMCSVLGEPLSQTAQSFMPGFINGVNRSLDKARMLLKSLLIIGAIFGLVLGTIGTLVPWLFPNLFTPEVKIIQEMHKVLIPYFL

Query:  ALLIMPATLCLEGTLLAGRDLKFISLSMCGCLSFAALLLLVMFYLPTFLQPHFFSSLCINFMACSQVVSNRGYGLVGCWCALVGFQWARFVNALRRVLSP
         +L  P T  LEGTLLAGRDLKFIS++M GC    A+LL                          ++VS +G+GL G W  L     +R  N  +  +  
Subjt:  ALLIMPATLCLEGTLLAGRDLKFISLSMCGCLSFAALLLLVMFYLPTFLQPHFFSSLCINFMACSQVVSNRGYGLVGCWCALVGFQWARFVNALRRVLSP

Query:  K---GVLYSSDLSH-------YKPNFAPFNS-------QAPKMPFK-ILHYHSSSSPRIHYPNILRPFSPPSFPFTHQSLSSPTISFPSSSSPLP-LHFS
             ++Y +  S         +      NS        +P  P K ++  H S        N LRP S  S  F +   ++P     +++  LP L F 
Subjt:  K---GVLYSSDLSH-------YKPNFAPFNS-------QAPKMPFK-ILHYHSSSSPRIHYPNILRPFSPPSFPFTHQSLSSPTISFPSSSSPLP-LHFS

Query:  SQIRRRFAVPHEASSLETESEIGVEVQENEQLYGTGGEELGSQGLSQMKEIVTFTGPAIGLWICGPLMSLIDTAVIGQGSAVELAALGPATVLCDYTSYV
        S+ R R +    A+  + ++  G    EN  + G G  E       Q+KEI  FTGPA+GLWICGPLMSLIDT VIGQGS++ELAALGP TV CDY SYV
Subjt:  SQIRRRFAVPHEASSLETESEIGVEVQENEQLYGTGGEELGSQGLSQMKEIVTFTGPAIGLWICGPLMSLIDTAVIGQGSAVELAALGPATVLCDYTSYV

Query:  FMFLSIATSNM--------DKNEVQHHISVLLFVGLMSGFLMLLLTKLMGPVALTAFVGIKNADIIPASNTYIQVISAYMMIEALNKKGYNGYSLSVPSS
        FMFLSIATSNM        DKNEVQHHISVLLFVGL  GFLM   T+  G  +LTAF G  N  I+PA+NTY+QV++ YMM+E LNKKGYN Y+LS+PS 
Subjt:  FMFLSIATSNM--------DKNEVQHHISVLLFVGLMSGFLMLLLTKLMGPVALTAFVGIKNADIIPASNTYIQVISAYMMIEALNKKGYNGYSLSVPSS

Query:  GEFLSILGLAAPVFLTMMSKVVFYSLLIYFATSMGTHTMAAHQVMIQTFCMCTVWGEPLSQTAQSFMPGLINGVNRSLDKAWMLLKSLLIIGAMFGLVLG
         E ++IL LAAPVF+TM SKV FYSLLIYFATSMGT +MAAHQVMIQTFCMCTVW                                            G
Subjt:  GEFLSILGLAAPVFLTMMSKVVFYSLLIYFATSMGTHTMAAHQVMIQTFCMCTVWGEPLSQTAQSFMPGLINGVNRSLDKAWMLLKSLLIIGAMFGLVLG

Query:  TIGTSVPWLFPNLFTPEGKIIQEMHKVLIPYFLALVITPPTHCLEGTLL------------------------VISSRGYGLTGCWYALVGFQWARFLSA
        +IGT++PWLFPN+FTP+  +IQEMHKVLIPY LA+V TP  H LEGTLL                        +IS++G+GL  CWY L  FQWARF  A
Subjt:  TIGTSVPWLFPNLFTPEGKIIQEMHKVLIPYFLALVITPPTHCLEGTLL------------------------VISSRGYGLTGCWYALVGFQWARFLSA

Query:  LRRILSPHGILYSNE----NKEIHKAV
        LRR+LSP+G+L+S +    N E  KAV
Subjt:  LRRILSPHGILYSNE----NKEIHKAV

KAF9676073.1 hypothetical protein SADUNF_Sadunf09G0100500 [Salix dunnii]5.0e-23547.09Show/hide
Query:  SNLEIEDQIDDGV----QGNEQLLGTGIDELGSIGLLNQMKEIVTFTGPAIGLWICGPLMSLIDTAVIGQGSAIELAALGPATVLCDYTSYVFMFLSIAT
        +N EIE + D  +    +  E  +    + L S  L +Q+KEIV FTGPA GLW+CGPLMSLIDT VIGQGS IELAALGPATVLCDY SYVFMFLSIAT
Subjt:  SNLEIEDQIDDGV----QGNEQLLGTGIDELGSIGLLNQMKEIVTFTGPAIGLWICGPLMSLIDTAVIGQGSAIELAALGPATVLCDYTSYVFMFLSIAT

Query:  SNMVATALAKQDKNEVQHHISVLLFVGLMSGFLMLLVTKLLGLVALTA----------------------------------------------------
        SNMVAT +A++DKN+VQH IS+LLFVG+  G LMLL T+  G  ALTA                                                    
Subjt:  SNMVATALAKQDKNEVQHHISVLLFVGLMSGFLMLLVTKLLGLVALTA----------------------------------------------------

Query:  -----------------------------------YMMIEQLNEKGYSGYSLSVPSPSEFVSILGLAAPVFITLMSKIVFYTLLIYHATSIGTYTMAAH-
                                           YMMIE LN+KGY+ +++SVP+  E ++++GLAAPVF+T++SK+ FY+L+IY ATS+GT+++AAH 
Subjt:  -----------------------------------YMMIEQLNEKGYSGYSLSVPSPSEFVSILGLAAPVFITLMSKIVFYTLLIYHATSIGTYTMAAH-

Query:  ---------QVMSQTFYMCSVLGEPLSQTAQSFMPGFINGVNRSLDKARMLLKSLLIIGAIFGLVLGTIGTLVPWLFPNLFTPEVKIIQEMHKVLIPYFL
                 QVM Q   MC+V+GEPLSQTAQSFMP  I GVNRSL KAR LLKSL+ IGA  GL+LGTIGT  PWLFPN+FT + K+IQEM+KVL+P+F+
Subjt:  ---------QVMSQTFYMCSVLGEPLSQTAQSFMPGFINGVNRSLDKARMLLKSLLIIGAIFGLVLGTIGTLVPWLFPNLFTPEVKIIQEMHKVLIPYFL

Query:  ALLIMPATLCLEGTL---------------------------------------LAGRDLKFISLSMCGCLSFAALLLLVMFYLPTFLQPHFFSSLCINF
        A+ + P+  CLEGTL                                       LAGRDL+F+S SM GC S  A++L++                    
Subjt:  ALLIMPATLCLEGTL---------------------------------------LAGRDLKFISLSMCGCLSFAALLLLVMFYLPTFLQPHFFSSLCINF

Query:  MACSQVVSNRGYGLVGCWCALVGFQWARFVNALRRVLSPKGVLYSSDLSHYK-PNFAPFNSQAPKMPFK---------------ILHYHSSSSPRIHYPN
               S+RGYGL GCW ALVGFQWARF  +LRR+LS  G+L+S DLS Y+        S   K+PF                ++  H +S P    P+
Subjt:  MACSQVVSNRGYGLVGCWCALVGFQWARFVNALRRVLSPKGVLYSSDLSHYK-PNFAPFNSQAPKMPFK---------------ILHYHSSSSPRIHYPN

Query:  ILRPFSPPSFPFTHQS--LSSPTISFPSSSSPLPLHFSSQIRRRFAVPHEASSLETESEIGVE--VQENEQLYGTGGEELGSQGL-SQMKEIVTFTGPAI
          RP S  S P +H S    SP +  P S    P +  ++     +V    SS ++ SE   E  +Q N        E L +Q +  QMKEIV FTGPA 
Subjt:  ILRPFSPPSFPFTHQS--LSSPTISFPSSSSPLPLHFSSQIRRRFAVPHEASSLETESEIGVE--VQENEQLYGTGGEELGSQGL-SQMKEIVTFTGPAI

Query:  GLWICGPLMSLIDTAVIGQGSAVELAALGPATVLCDYTSYVFMFLSIATSNM--------DKNEVQHHISVLLFVGLMSGFLMLLLTKLMGPVALTAFVG
        GLWICGPLMSLIDTAVIGQGS++ELAALGP TVLCD  SY+FMFLSIATSNM        DKNEVQH +S+LLF+GL  G LM L TK  GP ALT   G
Subjt:  GLWICGPLMSLIDTAVIGQGSAVELAALGPATVLCDYTSYVFMFLSIATSNM--------DKNEVQHHISVLLFVGLMSGFLMLLLTKLMGPVALTAFVG

Query:  IKNADIIPASNTYI-----------------------------------------------------------------QVISAYMMIEALNKKGYNGYS
          N DIIPA+NTY+                                                                 QV++A+MMI++LNKKGYN Y+
Subjt:  IKNADIIPASNTYI-----------------------------------------------------------------QVISAYMMIEALNKKGYNGYS

Query:  LSVPSSGEFLSILGLAAPVFLTMMSKVVFYSLLIYFATSMGTHTMAAHQVMIQTFCMCTVWGEPLSQTAQSFMPGLINGVNRSLDKAWMLLKSLLIIGAM
        +SVPS+ + + +  LAAP F+ M+SKV F+SL++YF TSM T T+AAHQVMIQ F MCTVWGEPLSQ AQSFMP L+ G+NRSL+KA  +LKSL IIG +
Subjt:  LSVPSSGEFLSILGLAAPVFLTMMSKVVFYSLLIYFATSMGTHTMAAHQVMIQTFCMCTVWGEPLSQTAQSFMPGLINGVNRSLDKAWMLLKSLLIIGAM

Query:  FGLVLGTIGTSVPWLFPNLFTPEGKIIQEMHKVLIPYFLALVITPPTHCLEGTLLVISSRGYGLTGCWYALVGFQWARFLSALRRILSPHGILYSNENKE
         GL LG +GTSVPWLFP++FT + KIIQEMHKVLIPYFLAL +TP    LEGTLL +SSRGYGL G W+ALVGFQWARF  AL+R+LSP G+L+S ++++
Subjt:  FGLVLGTIGTSVPWLFPNLFTPEGKIIQEMHKVLIPYFLALVITPPTHCLEGTLLVISSRGYGLTGCWYALVGFQWARFLSALRRILSPHGILYSNENKE

KAG7035062.1 Protein DETOXIFICATION 46, chloroplastic [Cucurbita argyrosperma subsp. argyrosperma]3.9e-26453.92Show/hide
Query:  MAELSLSLAPFTSQPPKMPFKILHSPSSSITPQIHNHKFLNPLSHSCPSFPFTPTIRFPSSSSPSSVVVCSPITRRFAVPHEDHEREVSNLEIEDQIDDG
        MAE SLSLAP   Q PKM F+ LH P SSI  +IH  + L P      SFPF+      +++SP SV V   + RRFAVP ++ ERE SN     +ID+ 
Subjt:  MAELSLSLAPFTSQPPKMPFKILHSPSSSITPQIHNHKFLNPLSHSCPSFPFTPTIRFPSSSSPSSVVVCSPITRRFAVPHEDHEREVSNLEIEDQIDDG

Query:  VQGNEQLLGTGIDELGSIGLLNQMKEIVTFTGPAIGLWICGPLMSLIDTAVIGQGSAIELAALGPATVLCDYTSYVFMFLSIATSNMVATALAKQDKNEV
        VQ NEQLLG G +ELG  GLL+QMKEIVTFTGPAIGLWICGPLMSLIDTAVIGQGSA+ELAALGPATVLCDYTS+VFMFLSIATSNMVATALAKQDKNEV
Subjt:  VQGNEQLLGTGIDELGSIGLLNQMKEIVTFTGPAIGLWICGPLMSLIDTAVIGQGSAIELAALGPATVLCDYTSYVFMFLSIATSNMVATALAKQDKNEV

Query:  QHHISVLLFVGLMSGFLMLLVTKLLGLVALTAYMMIEQLN-------------------EKGYSGYSLSVPSPSEFVSILGLAAPVFITLMSKIVFYTLL
        QHHIS LLFVGL+SGFLMLL TKLLG VALTA++  +  N                     G+   S S+     +  +  LA    +  +  IV    L
Subjt:  QHHISVLLFVGLMSGFLMLLVTKLLGLVALTAYMMIEQLN-------------------EKGYSGYSLSVPSPSEFVSILGLAAPVFITLMSKIVFYTLL

Query:  IYHATSIGTYTMAAHQVMSQTF-----------YMCS--------VLGEPLSQTAQSFMPGFINGVNRSLDKARMLLKSLLIIGAIFGLVLGTIGTLVPW
         Y        TMA+  + +              Y  S        +LGEPLSQTAQ+FMPG I GVNRS DKARMLLKSLLIIGAIFGLVLGTIGT VPW
Subjt:  IYHATSIGTYTMAAHQVMSQTF-----------YMCS--------VLGEPLSQTAQSFMPGFINGVNRSLDKARMLLKSLLIIGAIFGLVLGTIGTLVPW

Query:  LFPNLFTPEVKIIQEMHKVLIPYFLALLIMPATLCLEGTLLAGRDLKFISLSMCGCLSFAALLLLVMFYLPTFLQPHFFSSLCINFMACSQVVSNRGYGL
        LFPNLFTP+ KIIQEMHKVLIPYFLAL+IMPATL LEG+LLAGRDLKFISLSMCGC S  A+LLL                                   
Subjt:  LFPNLFTPEVKIIQEMHKVLIPYFLALLIMPATLCLEGTLLAGRDLKFISLSMCGCLSFAALLLLVMFYLPTFLQPHFFSSLCINFMACSQVVSNRGYGL

Query:  VGCWCALVGFQWARFVNALRRVLSPKGVLYSSDLSHYKPNFAPFNSQAPKMPFKILHYHSSSSPRIHYPNILRPFSPPSFPFTHQSLSSPTISFPSSSSP
                                                        PKM  K  H  SS++ +IH   ILRP   PS PFT++SLS+  + F S SSP
Subjt:  VGCWCALVGFQWARFVNALRRVLSPKGVLYSSDLSHYKPNFAPFNSQAPKMPFKILHYHSSSSPRIHYPNILRPFSPPSFPFTHQSLSSPTISFPSSSSP

Query:  LPLH-FSSQIRRRFAVPH-----EASSLETESEIGVEVQENEQLYGTGGEELGSQG-LSQMKEIVTFTGPAIGLWICGPLMSLIDTAVIGQGSAVELAAL
          ++   S+IRRRFAVP+     E SSLE ESE+   +Q NEQ        LG+QG L+QMKEIVTFTGPAIGLWICGPLMSLIDTAVIGQGSAVELAAL
Subjt:  LPLH-FSSQIRRRFAVPH-----EASSLETESEIGVEVQENEQLYGTGGEELGSQG-LSQMKEIVTFTGPAIGLWICGPLMSLIDTAVIGQGSAVELAAL

Query:  GPATVLCDYTSYVFMFLSIATSNM--------DKNEVQHHISVLLFVGLMSGFLMLLLTKLMGPVALT----------------AFVGIKNADIIPASNT
        GPATVLCDYTSYVFMFLSIATSNM        DKNEVQHHIS+LLFVGL SGF ML+ TKL G VALT                AF G KNAD+IPA+N 
Subjt:  GPATVLCDYTSYVFMFLSIATSNM--------DKNEVQHHISVLLFVGLMSGFLMLLLTKLMGPVALT----------------AFVGIKNADIIPASNT

Query:  YI-----------------------------------------------------------------QVISAYMMIEALNKKGYNGYSLSVPSSGEFLSI
        YI                                                                 QVI+AYMM+ ALNKKGY+ YS SVPS+GEFLSI
Subjt:  YI-----------------------------------------------------------------QVISAYMMIEALNKKGYNGYSLSVPSSGEFLSI

Query:  LGLAAPVFLTMMSKVVFYSLLIYFATSMGTHTMAAHQVMIQTFCMCTVWGEPLSQTAQSFMPGLINGVNRSLDKAWMLLKSLLIIGAMFGLVLGTIGTSV
        LG+AAP+FLTMMSKVVFYSLLIY+ATSMGTHTMAAHQVM+QTFCMCTVW                                          VLGTIGTSV
Subjt:  LGLAAPVFLTMMSKVVFYSLLIYFATSMGTHTMAAHQVMIQTFCMCTVWGEPLSQTAQSFMPGLINGVNRSLDKAWMLLKSLLIIGAMFGLVLGTIGTSV

Query:  PWLFPNLFTPEGKIIQEMHKVLIPYFLALVITPPTHCLEGT------------------------LLVISSRGYGLTGCWYALVGFQWARFLSALRRILS
        P  FPNLFTPE KIIQEMHKVL PYFLALVITPPT CLEGT                        LLVISSRGYGL GCWYALVGFQWARFLSALRR+LS
Subjt:  PWLFPNLFTPEGKIIQEMHKVLIPYFLALVITPPTHCLEGT------------------------LLVISSRGYGLTGCWYALVGFQWARFLSALRRILS

Query:  PHGILYSNE
        P G+LYS++
Subjt:  PHGILYSNE

TXG64624.1 hypothetical protein EZV62_011618 [Acer yangbiense]1.5e-23952.14Show/hide
Query:  NQMKEIVTFTGPAIGLWICGPLMSLIDTAVIGQGSAIELAALGPATVLCDYTSYVFMFLSIATSNMVATALAKQDKNEVQHHISVLLFVGLMSGFLMLLV
        NQMKEIV FT PA GLWICGPLMSLIDTAVIGQGS+IELAALGPATV+CDY +YVFMFLSIATSNMVAT+LAKQDKNEVQH ISVLLFVGL  G LM L 
Subjt:  NQMKEIVTFTGPAIGLWICGPLMSLIDTAVIGQGSAIELAALGPATVLCDYTSYVFMFLSIATSNMVATALAKQDKNEVQHHISVLLFVGLMSGFLMLLV

Query:  TKLLGLVALT------------------------------------------------------AYMMIEQLNEKGYSGYSLSVPSPSEFVSILGLAAPV
        T+  G  ALT                                                       +MMI+ LN+KGY+ ++ +VPSPSE  +I GLA PV
Subjt:  TKLLGLVALT------------------------------------------------------AYMMIEQLNEKGYSGYSLSVPSPSEFVSILGLAAPV

Query:  FITLMSKIVFYTLLIYHATSIGTYTMAAHQVMSQTFYMCSVLGEPLSQTAQSFMPGFINGVNRSLDKARMLLKSLLIIGAIFGLVLGTIGTLVPWLFPNL
        F+T+M+KI FY+L+IY ATS+GT T+AAHQVM Q F MC+V GEPLSQ AQSFMP  I G NRSL KARMLLKSL IIGA  GLVLG IGT VPWLFP++
Subjt:  FITLMSKIVFYTLLIYHATSIGTYTMAAHQVMSQTFYMCSVLGEPLSQTAQSFMPGFINGVNRSLDKARMLLKSLLIIGAIFGLVLGTIGTLVPWLFPNL

Query:  FTPEVKIIQEMHKVLIPYFLALLIMPATLCLEGTLLAGRDLKFISLSMCGCLSFAALLLLVMFYLPTFLQPHFFSSLCINFMACSQVVSNRGYGLVGCWC
        FTP+  +IQEMHKVL+PYFLAL + P+T  LEGTLLAGRDLKFISLSM GC S  AL+LL                          +VS+RGYGL GCW 
Subjt:  FTPEVKIIQEMHKVLIPYFLALLIMPATLCLEGTLLAGRDLKFISLSMCGCLSFAALLLLVMFYLPTFLQPHFFSSLCINFMACSQVVSNRGYGLVGCWC

Query:  ALVGFQWARFVNALRRVLSPKGVLYSSDLSHYKPNFAPFNSQAPKMPFKILHYHSSSSPRIHYPNILRPFSPPSFPFTHQSLSSPTISFPSSSSPLPLHF
        AL GFQWARF  +L RVLS  G+L+S DL+ Y                                  LR F P      H  L     S  + S  +PL  
Subjt:  ALVGFQWARFVNALRRVLSPKGVLYSSDLSHYKPNFAPFNSQAPKMPFKILHYHSSSSPRIHYPNILRPFSPPSFPFTHQSLSSPTISFPSSSSPLPLHF

Query:  SSQIRRRFAVPHEASSLETESEIGVEVQENEQLYGTGGEELGSQGL-SQMKEIVTFTGPAIGLWICGPLMSLIDTAVIGQGSAVELAALGPATVLCDYTS
        S                E E E  VEV+          E+L +Q + +QMKEI+ FTGPA  LWICGPLMSLIDTAV+GQGS++ELAALGP TV CD  S
Subjt:  SSQIRRRFAVPHEASSLETESEIGVEVQENEQLYGTGGEELGSQGL-SQMKEIVTFTGPAIGLWICGPLMSLIDTAVIGQGSAVELAALGPATVLCDYTS

Query:  YVFMFLSIATSNM--------DKNEVQHHISVLLFVGLMSGFLMLLLTKLMGPVALTAFVGIKNADIIPASNTYI-------------------------
        Y+FMFLSIATSNM        DKN VQH IS+LLFVGL+ G LMLL TK  G  ALT   G KN  ++PA+NTY+                         
Subjt:  YVFMFLSIATSNM--------DKNEVQHHISVLLFVGLMSGFLMLLLTKLMGPVALTAFVGIKNADIIPASNTYI-------------------------

Query:  ----------------------------------------QVISAYMMIEALNKKGYNGYSLSVPSSGEFLSILGLAAPVFLTMMSKVVFYSLLIYFATS
                                                QVI+AYMMI+ LNKKGYN +++SVPS  + L I  +AAP+F+ M+SKV FY+L+IYFATS
Subjt:  ----------------------------------------QVISAYMMIEALNKKGYNGYSLSVPSSGEFLSILGLAAPVFLTMMSKVVFYSLLIYFATS

Query:  MGTHTMAAHQ-----VMIQTFCMCTVWGEPLSQTAQSFMPGLINGVNRSLDKAWMLLKSLLIIGAMFGLVLGTIGTSVPWLFPNLFTPEGKIIQEMHKVL
        MGT T+AAHQ     V+IQTF M TV GEPLSQTAQSFMP  + G NR+L KA MLLKSL+IIG + GL LG +GTSVPWLFPN+FTP+  IIQEMHKVL
Subjt:  MGTHTMAAHQ-----VMIQTFCMCTVWGEPLSQTAQSFMPGLINGVNRSLDKAWMLLKSLLIIGAMFGLVLGTIGTSVPWLFPNLFTPEGKIIQEMHKVL

Query:  IPYFLALVITPPTHCLEGTL---------------------LVISSRGYGLTGCWYALVGFQWARFLSALRRILSPHGILYS
        +P+F+ L +TP    LEGTL                     L++SSRGYGLTG WYALVGF WARF  AL+R+LSP GIL+S
Subjt:  IPYFLALVITPPTHCLEGTL---------------------LVISSRGYGLTGCWYALVGFQWARFLSALRRILSPHGILYS

TrEMBL top hitse value%identityAlignment
A0A0A0LLA6 Protein DETOXIFICATION1.4e-20670.17Show/hide
Query:  MAELSLSLAPFTSQPPKMPFKILHSPS-SSITPQIHNHKFLNPLSHSCP--SFPFTPTIRFPSSSSPSSVVVCSPITRRFAVPHEDHEREVSNLEIEDQI
        MA+LSLSL PF+  PPKMPFK LHSPS SSI PQ H  KF NPLSHS P  SF FTPT+ FPS S P  + V SPITR FA+PH+DH REVS+ E   + 
Subjt:  MAELSLSLAPFTSQPPKMPFKILHSPS-SSITPQIHNHKFLNPLSHSCP--SFPFTPTIRFPSSSSPSSVVVCSPITRRFAVPHEDHEREVSNLEIEDQI

Query:  DDGVQGNEQLLGTGIDELGSIGLLNQMKEIVTFTGPAIGLWICGPLMSLIDTAVIGQGSAIELAALGPATVLCDYTSYVFMFLSIATSNMVATALAKQDK
        D+GVQGNEQLL TGI +L S GL+NQMKEIVTFTGPAIGLWICGP+MSLIDTAVIGQGSA+ELAALGPATVLCDYTSYVFMFLSIATSNMVATALAKQDK
Subjt:  DDGVQGNEQLLGTGIDELGSIGLLNQMKEIVTFTGPAIGLWICGPLMSLIDTAVIGQGSAIELAALGPATVLCDYTSYVFMFLSIATSNMVATALAKQDK

Query:  NEVQHHISVLLFVGLMSGFLMLLVTKLLGLVALT------------------------------------------------------------------
        NEVQHHISVLLFVGLMSG LMLLVTKLLG +ALT                                                                  
Subjt:  NEVQHHISVLLFVGLMSGFLMLLVTKLLGLVALT------------------------------------------------------------------

Query:  ---------------------AYMMIEQLNEKGYSGYSLSVPSPSEFVSILGLAAPVFITLMSKIVFYTLLIYHATSIGTYTMAAHQVMSQTFYMCSVLG
                             AYMMIEQLN+KGYSGYSLS+PSPSEF+SILGLAAPVFITLMSKIVFYTLLIYHATSIGT+TMAAHQVMSQTFYMCSVLG
Subjt:  ---------------------AYMMIEQLNEKGYSGYSLSVPSPSEFVSILGLAAPVFITLMSKIVFYTLLIYHATSIGTYTMAAHQVMSQTFYMCSVLG

Query:  EPLSQTAQSFMPGFINGVNRSLDKARMLLKSLLIIGAIFGLVLGTIGTLVPWLFPNLFTPEVKIIQEMHKVLIPYFLALLIMPATLCLEGTLLAGRDLKF
        EPLSQTAQSFMPGFI+GVNRSLDKARMLLKSLLIIG IFGLVLGTIGTLVPWLFPNLFTPEVKIIQEMHKVLIPYFLALLIMPATLCLEGTLLAGRDLKF
Subjt:  EPLSQTAQSFMPGFINGVNRSLDKARMLLKSLLIIGAIFGLVLGTIGTLVPWLFPNLFTPEVKIIQEMHKVLIPYFLALLIMPATLCLEGTLLAGRDLKF

Query:  ISLSMCGCLSFAALLLLVMFYLPTFLQPHFFSSLCINFMACSQVVSNRGYGLVGCWCALVGFQWARFVNALRRVLSPKGVLYSSDLSHYK
        ISLSMCGCLSF ALLLL                           V++RGYGL GCWCALVGFQWARF NALRRVLSP GVLYSSD+SHY+
Subjt:  ISLSMCGCLSFAALLLLVMFYLPTFLQPHFFSSLCINFMACSQVVSNRGYGLVGCWCALVGFQWARFVNALRRVLSPKGVLYSSDLSHYK

A0A1S3CEY5 Protein DETOXIFICATION1.7e-20970.97Show/hide
Query:  MAELSLSLAPFTSQPPKMPFKILHSPS-SSITPQIHNHKFLNPLSHSCPSFPFTPTIRFPSS-SSPSSVVVCSPITRRFAVPHEDHEREVSNLEIEDQID
        MA+LSLSL PF+  PPKMPFK LHSPS SS  PQIH  KF NPL HS PSF FTPTI FP+S SSP  V V SPITRRF++PH+DHEREVS++EI  + +
Subjt:  MAELSLSLAPFTSQPPKMPFKILHSPS-SSITPQIHNHKFLNPLSHSCPSFPFTPTIRFPSS-SSPSSVVVCSPITRRFAVPHEDHEREVSNLEIEDQID

Query:  DGVQGNEQLLGTGIDELGSIGLLNQMKEIVTFTGPAIGLWICGPLMSLIDTAVIGQGSAIELAALGPATVLCDYTSYVFMFLSIATSNMVATALAKQDKN
        +GVQGNEQLL TGI +L S GLLNQMKEIVTFTGPAIGLWICGPLMSLIDTAVIGQGSA+ELAALGPATVLCDYTSYVFMFLSIATSNMVATALAKQDKN
Subjt:  DGVQGNEQLLGTGIDELGSIGLLNQMKEIVTFTGPAIGLWICGPLMSLIDTAVIGQGSAIELAALGPATVLCDYTSYVFMFLSIATSNMVATALAKQDKN

Query:  EVQHHISVLLFVGLMSGFLMLLVTKLLGLVALT-------------------------------------------------------------------
        EVQHHISVLLFVGLMSG LMLLVTKLLG +ALT                                                                   
Subjt:  EVQHHISVLLFVGLMSGFLMLLVTKLLGLVALT-------------------------------------------------------------------

Query:  --------------------AYMMIEQLNEKGYSGYSLSVPSPSEFVSILGLAAPVFITLMSKIVFYTLLIYHATSIGTYTMAAHQVMSQTFYMCSVLGE
                            AYMMIEQLN+KGYSGYSLSVPSP EF+SILGLAAPVFITLMSKIVFYTLLIYHATS+GTYTMAAHQVMSQTFYMCSVLGE
Subjt:  --------------------AYMMIEQLNEKGYSGYSLSVPSPSEFVSILGLAAPVFITLMSKIVFYTLLIYHATSIGTYTMAAHQVMSQTFYMCSVLGE

Query:  PLSQTAQSFMPGFINGVNRSLDKARMLLKSLLIIGAIFGLVLGTIGTLVPWLFPNLFTPEVKIIQEMHKVLIPYFLALLIMPATLCLEGTLLAGRDLKFI
        PLSQTAQSFMPGFI+GVNRSLDKARMLLKSLLIIG IFGLVLG IGTLVPWLFPNLFTPEVKIIQEMHKVLIPYFLALLIMPATLCLEGTLLAGRDLKFI
Subjt:  PLSQTAQSFMPGFINGVNRSLDKARMLLKSLLIIGAIFGLVLGTIGTLVPWLFPNLFTPEVKIIQEMHKVLIPYFLALLIMPATLCLEGTLLAGRDLKFI

Query:  SLSMCGCLSFAALLLLVMFYLPTFLQPHFFSSLCINFMACSQVVSNRGYGLVGCWCALVGFQWARFVNALRRVLSPKGVLYSSDLSHYK
        SLSMCGCLSF ALLLL                          +V+NRGYGL GCWCALVGFQWARF +ALRRVLSP GVLYSSD+SHY+
Subjt:  SLSMCGCLSFAALLLLVMFYLPTFLQPHFFSSLCINFMACSQVVSNRGYGLVGCWCALVGFQWARFVNALRRVLSPKGVLYSSDLSHYK

A0A5C7I673 Protein DETOXIFICATION7.3e-24052.14Show/hide
Query:  NQMKEIVTFTGPAIGLWICGPLMSLIDTAVIGQGSAIELAALGPATVLCDYTSYVFMFLSIATSNMVATALAKQDKNEVQHHISVLLFVGLMSGFLMLLV
        NQMKEIV FT PA GLWICGPLMSLIDTAVIGQGS+IELAALGPATV+CDY +YVFMFLSIATSNMVAT+LAKQDKNEVQH ISVLLFVGL  G LM L 
Subjt:  NQMKEIVTFTGPAIGLWICGPLMSLIDTAVIGQGSAIELAALGPATVLCDYTSYVFMFLSIATSNMVATALAKQDKNEVQHHISVLLFVGLMSGFLMLLV

Query:  TKLLGLVALT------------------------------------------------------AYMMIEQLNEKGYSGYSLSVPSPSEFVSILGLAAPV
        T+  G  ALT                                                       +MMI+ LN+KGY+ ++ +VPSPSE  +I GLA PV
Subjt:  TKLLGLVALT------------------------------------------------------AYMMIEQLNEKGYSGYSLSVPSPSEFVSILGLAAPV

Query:  FITLMSKIVFYTLLIYHATSIGTYTMAAHQVMSQTFYMCSVLGEPLSQTAQSFMPGFINGVNRSLDKARMLLKSLLIIGAIFGLVLGTIGTLVPWLFPNL
        F+T+M+KI FY+L+IY ATS+GT T+AAHQVM Q F MC+V GEPLSQ AQSFMP  I G NRSL KARMLLKSL IIGA  GLVLG IGT VPWLFP++
Subjt:  FITLMSKIVFYTLLIYHATSIGTYTMAAHQVMSQTFYMCSVLGEPLSQTAQSFMPGFINGVNRSLDKARMLLKSLLIIGAIFGLVLGTIGTLVPWLFPNL

Query:  FTPEVKIIQEMHKVLIPYFLALLIMPATLCLEGTLLAGRDLKFISLSMCGCLSFAALLLLVMFYLPTFLQPHFFSSLCINFMACSQVVSNRGYGLVGCWC
        FTP+  +IQEMHKVL+PYFLAL + P+T  LEGTLLAGRDLKFISLSM GC S  AL+LL                          +VS+RGYGL GCW 
Subjt:  FTPEVKIIQEMHKVLIPYFLALLIMPATLCLEGTLLAGRDLKFISLSMCGCLSFAALLLLVMFYLPTFLQPHFFSSLCINFMACSQVVSNRGYGLVGCWC

Query:  ALVGFQWARFVNALRRVLSPKGVLYSSDLSHYKPNFAPFNSQAPKMPFKILHYHSSSSPRIHYPNILRPFSPPSFPFTHQSLSSPTISFPSSSSPLPLHF
        AL GFQWARF  +L RVLS  G+L+S DL+ Y                                  LR F P      H  L     S  + S  +PL  
Subjt:  ALVGFQWARFVNALRRVLSPKGVLYSSDLSHYKPNFAPFNSQAPKMPFKILHYHSSSSPRIHYPNILRPFSPPSFPFTHQSLSSPTISFPSSSSPLPLHF

Query:  SSQIRRRFAVPHEASSLETESEIGVEVQENEQLYGTGGEELGSQGL-SQMKEIVTFTGPAIGLWICGPLMSLIDTAVIGQGSAVELAALGPATVLCDYTS
        S                E E E  VEV+          E+L +Q + +QMKEI+ FTGPA  LWICGPLMSLIDTAV+GQGS++ELAALGP TV CD  S
Subjt:  SSQIRRRFAVPHEASSLETESEIGVEVQENEQLYGTGGEELGSQGL-SQMKEIVTFTGPAIGLWICGPLMSLIDTAVIGQGSAVELAALGPATVLCDYTS

Query:  YVFMFLSIATSNM--------DKNEVQHHISVLLFVGLMSGFLMLLLTKLMGPVALTAFVGIKNADIIPASNTYI-------------------------
        Y+FMFLSIATSNM        DKN VQH IS+LLFVGL+ G LMLL TK  G  ALT   G KN  ++PA+NTY+                         
Subjt:  YVFMFLSIATSNM--------DKNEVQHHISVLLFVGLMSGFLMLLLTKLMGPVALTAFVGIKNADIIPASNTYI-------------------------

Query:  ----------------------------------------QVISAYMMIEALNKKGYNGYSLSVPSSGEFLSILGLAAPVFLTMMSKVVFYSLLIYFATS
                                                QVI+AYMMI+ LNKKGYN +++SVPS  + L I  +AAP+F+ M+SKV FY+L+IYFATS
Subjt:  ----------------------------------------QVISAYMMIEALNKKGYNGYSLSVPSSGEFLSILGLAAPVFLTMMSKVVFYSLLIYFATS

Query:  MGTHTMAAHQ-----VMIQTFCMCTVWGEPLSQTAQSFMPGLINGVNRSLDKAWMLLKSLLIIGAMFGLVLGTIGTSVPWLFPNLFTPEGKIIQEMHKVL
        MGT T+AAHQ     V+IQTF M TV GEPLSQTAQSFMP  + G NR+L KA MLLKSL+IIG + GL LG +GTSVPWLFPN+FTP+  IIQEMHKVL
Subjt:  MGTHTMAAHQ-----VMIQTFCMCTVWGEPLSQTAQSFMPGLINGVNRSLDKAWMLLKSLLIIGAMFGLVLGTIGTSVPWLFPNLFTPEGKIIQEMHKVL

Query:  IPYFLALVITPPTHCLEGTL---------------------LVISSRGYGLTGCWYALVGFQWARFLSALRRILSPHGILYS
        +P+F+ L +TP    LEGTL                     L++SSRGYGLTG WYALVGF WARF  AL+R+LSP GIL+S
Subjt:  IPYFLALVITPPTHCLEGTL---------------------LVISSRGYGLTGCWYALVGFQWARFLSALRRILSPHGILYS

A0A5D3BWW2 Protein DETOXIFICATION1.7e-20970.97Show/hide
Query:  MAELSLSLAPFTSQPPKMPFKILHSPS-SSITPQIHNHKFLNPLSHSCPSFPFTPTIRFPSS-SSPSSVVVCSPITRRFAVPHEDHEREVSNLEIEDQID
        MA+LSLSL PF+  PPKMPFK LHSPS SS  PQIH  KF NPL HS PSF FTPTI FP+S SSP  V V SPITRRF++PH+DHEREVS++EI  + +
Subjt:  MAELSLSLAPFTSQPPKMPFKILHSPS-SSITPQIHNHKFLNPLSHSCPSFPFTPTIRFPSS-SSPSSVVVCSPITRRFAVPHEDHEREVSNLEIEDQID

Query:  DGVQGNEQLLGTGIDELGSIGLLNQMKEIVTFTGPAIGLWICGPLMSLIDTAVIGQGSAIELAALGPATVLCDYTSYVFMFLSIATSNMVATALAKQDKN
        +GVQGNEQLL TGI +L S GLLNQMKEIVTFTGPAIGLWICGPLMSLIDTAVIGQGSA+ELAALGPATVLCDYTSYVFMFLSIATSNMVATALAKQDKN
Subjt:  DGVQGNEQLLGTGIDELGSIGLLNQMKEIVTFTGPAIGLWICGPLMSLIDTAVIGQGSAIELAALGPATVLCDYTSYVFMFLSIATSNMVATALAKQDKN

Query:  EVQHHISVLLFVGLMSGFLMLLVTKLLGLVALT-------------------------------------------------------------------
        EVQHHISVLLFVGLMSG LMLLVTKLLG +ALT                                                                   
Subjt:  EVQHHISVLLFVGLMSGFLMLLVTKLLGLVALT-------------------------------------------------------------------

Query:  --------------------AYMMIEQLNEKGYSGYSLSVPSPSEFVSILGLAAPVFITLMSKIVFYTLLIYHATSIGTYTMAAHQVMSQTFYMCSVLGE
                            AYMMIEQLN+KGYSGYSLSVPSP EF+SILGLAAPVFITLMSKIVFYTLLIYHATS+GTYTMAAHQVMSQTFYMCSVLGE
Subjt:  --------------------AYMMIEQLNEKGYSGYSLSVPSPSEFVSILGLAAPVFITLMSKIVFYTLLIYHATSIGTYTMAAHQVMSQTFYMCSVLGE

Query:  PLSQTAQSFMPGFINGVNRSLDKARMLLKSLLIIGAIFGLVLGTIGTLVPWLFPNLFTPEVKIIQEMHKVLIPYFLALLIMPATLCLEGTLLAGRDLKFI
        PLSQTAQSFMPGFI+GVNRSLDKARMLLKSLLIIG IFGLVLG IGTLVPWLFPNLFTPEVKIIQEMHKVLIPYFLALLIMPATLCLEGTLLAGRDLKFI
Subjt:  PLSQTAQSFMPGFINGVNRSLDKARMLLKSLLIIGAIFGLVLGTIGTLVPWLFPNLFTPEVKIIQEMHKVLIPYFLALLIMPATLCLEGTLLAGRDLKFI

Query:  SLSMCGCLSFAALLLLVMFYLPTFLQPHFFSSLCINFMACSQVVSNRGYGLVGCWCALVGFQWARFVNALRRVLSPKGVLYSSDLSHYK
        SLSMCGCLSF ALLLL                          +V+NRGYGL GCWCALVGFQWARF +ALRRVLSP GVLYSSD+SHY+
Subjt:  SLSMCGCLSFAALLLLVMFYLPTFLQPHFFSSLCINFMACSQVVSNRGYGLVGCWCALVGFQWARFVNALRRVLSPKGVLYSSDLSHYK

A0A7J6FFZ4 Protein DETOXIFICATION2.2e-22849.95Show/hide
Query:  PFKILHSPSSSITPQI-HNHKFLNPLSHSCPSFPFTPTIRF--PSSSSPSSVVVCSPITRRFAV----PHEDHEREVSNLE--IEDQIDDGVQGNEQLLG
        P+ +LH+P S    Q+  +  F N  S+S  S P     RF   +S+SP S+       RRF +          RE+S+ E    D+   GV   E    
Subjt:  PFKILHSPSSSITPQI-HNHKFLNPLSHSCPSFPFTPTIRF--PSSSSPSSVVVCSPITRRFAV----PHEDHEREVSNLE--IEDQIDDGVQGNEQLLG

Query:  TGIDELGSIGLLNQMKEIVTFTGPAIGLWICGPLMSLIDTAVIGQGSAIELAALGPATVLCDYTSYVFMFLSIATSNMVATALAKQDKNEVQHHISVLLF
        TG   LG+ G+ +QMKEI  FTGPA GLWICGPLMSLIDTAVIGQ S++ELAALGPATVLCD  SY+FMFLSIATSNMVATALA++DK EVQHHISVLLF
Subjt:  TGIDELGSIGLLNQMKEIVTFTGPAIGLWICGPLMSLIDTAVIGQGSAIELAALGPATVLCDYTSYVFMFLSIATSNMVATALAKQDKNEVQHHISVLLF

Query:  VGLMSGFLMLLVTKLLGLVALT----------------------AYMMIEQLNEKGYSGYSLSVPSPSEFVSILGLAAPVFITLMSKIVFYTLLIYHATS
        VGL  GF+MLL T+  G  ALT                      AYMMIE LN+KGY+ YS+S+PSP + ++I  LAAPVFITLM+KI FYTLL+Y ATS
Subjt:  VGLMSGFLMLLVTKLLGLVALT----------------------AYMMIEQLNEKGYSGYSLSVPSPSEFVSILGLAAPVFITLMSKIVFYTLLIYHATS

Query:  IGTYTMAAHQVMSQTFYMCSVLGEPLSQTAQSFMPGFINGVNRSLDKARMLLKSLLIIGAIFGLVLGTIGTLVPWLFPNLFTPEVKIIQEMHKVLIPYFL
        +GT T AAHQVM Q  +MC+V GEPLSQTAQSFMP  I+GV R+L+KAR LLKSL+++G +FG+VLGT GT +PWLFPN++TP+  II+EMH VLIPYFL
Subjt:  IGTYTMAAHQVMSQTFYMCSVLGEPLSQTAQSFMPGFINGVNRSLDKARMLLKSLLIIGAIFGLVLGTIGTLVPWLFPNLFTPEVKIIQEMHKVLIPYFL

Query:  ALLIMPATLCLEGTLLAGRDLKFISLSMCGCLSFAALLLLVMFYLPTFLQPHFFSSLCINFMACSQVVSNRGYGLVGCWCALVGFQWARFVNALRRVLSP
         +L  P T  LEGTLLAGRDLKFIS++M GC    A+LL                          ++VS +G+GL G W  L     +R  N  +  +  
Subjt:  ALLIMPATLCLEGTLLAGRDLKFISLSMCGCLSFAALLLLVMFYLPTFLQPHFFSSLCINFMACSQVVSNRGYGLVGCWCALVGFQWARFVNALRRVLSP

Query:  K---GVLYSSDLSH-------YKPNFAPFNS-------QAPKMPFK-ILHYHSSSSPRIHYPNILRPFSPPSFPFTHQSLSSPTISFPSSSSPLP-LHFS
             ++Y +  S         +      NS        +P  P K ++  H S        N LRP S  S  F +   ++P     +++  LP L F 
Subjt:  K---GVLYSSDLSH-------YKPNFAPFNS-------QAPKMPFK-ILHYHSSSSPRIHYPNILRPFSPPSFPFTHQSLSSPTISFPSSSSPLP-LHFS

Query:  SQIRRRFAVPHEASSLETESEIGVEVQENEQLYGTGGEELGSQGLSQMKEIVTFTGPAIGLWICGPLMSLIDTAVIGQGSAVELAALGPATVLCDYTSYV
        S+ R R +    A+  + ++  G    EN  + G G  E       Q+KEI  FTGPA+GLWICGPLMSLIDT VIGQGS++ELAALGP TV CDY SYV
Subjt:  SQIRRRFAVPHEASSLETESEIGVEVQENEQLYGTGGEELGSQGLSQMKEIVTFTGPAIGLWICGPLMSLIDTAVIGQGSAVELAALGPATVLCDYTSYV

Query:  FMFLSIATSNM--------DKNEVQHHISVLLFVGLMSGFLMLLLTKLMGPVALTAFVGIKNADIIPASNTYIQVISAYMMIEALNKKGYNGYSLSVPSS
        FMFLSIATSNM        DKNEVQHHISVLLFVGL  GFLM   T+  G  +LTAF G  N  I+PA+NTY+QV++ YMM+E LNKKGYN Y+LS+PS 
Subjt:  FMFLSIATSNM--------DKNEVQHHISVLLFVGLMSGFLMLLLTKLMGPVALTAFVGIKNADIIPASNTYIQVISAYMMIEALNKKGYNGYSLSVPSS

Query:  GEFLSILGLAAPVFLTMMSKVVFYSLLIYFATSMGTHTMAAHQVMIQTFCMCTVWGEPLSQTAQSFMPGLINGVNRSLDKAWMLLKSLLIIGAMFGLVLG
         E ++IL LAAPVF+TM SKV FYSLLIYFATSMGT +MAAHQVMIQTFCMCTVW                                            G
Subjt:  GEFLSILGLAAPVFLTMMSKVVFYSLLIYFATSMGTHTMAAHQVMIQTFCMCTVWGEPLSQTAQSFMPGLINGVNRSLDKAWMLLKSLLIIGAMFGLVLG

Query:  TIGTSVPWLFPNLFTPEGKIIQEMHKVLIPYFLALVITPPTHCLEGTLL------------------------VISSRGYGLTGCWYALVGFQWARFLSA
        +IGT++PWLFPN+FTP+  +IQEMHKVLIPY LA+V TP  H LEGTLL                        +IS++G+GL  CWY L  FQWARF  A
Subjt:  TIGTSVPWLFPNLFTPEGKIIQEMHKVLIPYFLALVITPPTHCLEGTLL------------------------VISSRGYGLTGCWYALVGFQWARFLSA

Query:  LRRILSPHGILYSNE----NKEIHKAV
        LRR+LSP+G+L+S +    N E  KAV
Subjt:  LRRILSPHGILYSNE----NKEIHKAV

SwissProt top hitse value%identityAlignment
Q8W4G3 Protein DETOXIFICATION 46, chloroplastic4.3e-12048.25Show/hide
Query:  SPRIHYPNILRPFS-PPSFPFTHQSLSSPTISFPSSSSPLP--LHFSSQIRRRFAVPHEA-----SSLETESEIG--VEVQENEQLYGTGGEELGSQGL-
        +P++ +P+ L   S  PSFP    S  S  +S P SS  L   L   +   +   V  E      S L+ ++  G    V+   ++     ++L +Q + 
Subjt:  SPRIHYPNILRPFS-PPSFPFTHQSLSSPTISFPSSSSPLP--LHFSSQIRRRFAVPHEA-----SSLETESEIG--VEVQENEQLYGTGGEELGSQGL-

Query:  SQMKEIVTFTGPAIGLWICGPLMSLIDTAVIGQGSAVELAALGPATVLCDYTSYVFMFLSIATSNM--------DKNEVQHHISVLLFVGLMSGFLMLLL
         QMKEIV FTGPA GLW+CGPLMSLIDTAVIGQGS++ELAALGPATV+CDY  Y FMFLS+ATSN+        DK+EVQH IS+LLF+GL  G  M++L
Subjt:  SQMKEIVTFTGPAIGLWICGPLMSLIDTAVIGQGSAVELAALGPATVLCDYTSYVFMFLSIATSNM--------DKNEVQHHISVLLFVGLMSGFLMLLL

Query:  TKLMGPVALTAFVGIKNADIIPASNTYI-----------------------------------------------------------------QVISAYM
        T+L G  ALTAF G+KNADI+PA+N Y+                                                                 QV++AYM
Subjt:  TKLMGPVALTAFVGIKNADIIPASNTYI-----------------------------------------------------------------QVISAYM

Query:  MIEALNKKGYNGYSLSVPSSGEFLSILGLAAPVFLTMMSKVVFYSLLIYFATSMGTHTMAAHQVMIQTFCMCTVWGEPLSQTAQSFMPGLINGVNRSLDK
        M++ALNKKGY+ +S  VPS  E L+I GLAAPVF+TMMSKV+FY+LL+YFATSMGT+ +AAHQVM+Q + M TVWGEPLSQTAQSFMP L+ G+NR+L K
Subjt:  MIEALNKKGYNGYSLSVPSSGEFLSILGLAAPVFLTMMSKVVFYSLLIYFATSMGTHTMAAHQVMIQTFCMCTVWGEPLSQTAQSFMPGLINGVNRSLDK

Query:  AWMLLKSLLIIGAMFGLVLGTIGTSVPWLFPNLFTPEGKIIQEMHKVLIPYFLALVITPPTHCLEGTLL------------------------VISSRGY
        A +LLKSL+IIGA  G+V+GTIGT+VPWLFP +FT +  +  EMHKV+IPYFLAL ITP TH LEGTLL                        ++S+ G+
Subjt:  AWMLLKSLLIIGAMFGLVLGTIGTSVPWLFPNLFTPEGKIIQEMHKVLIPYFLALVITPPTHCLEGTLL------------------------VISSRGY

Query:  GLTGCWYALVGFQWARFLSALRRILSPHGILYSNENKEIHKAV
        GL GCWYALVGFQWARF  +L R+LS  G+LYS +     + V
Subjt:  GLTGCWYALVGFQWARFLSALRRILSPHGILYSNENKEIHKAV

Q945F0 Protein DETOXIFICATION 47, chloroplastic2.1e-11149.48Show/hide
Query:  IRRRFAVPH-EASSLETESEIGVEVQENEQLYGTGGEELGSQGL-SQMKEIVTFTGPAIGLWICGPLMSLIDTAVIGQGSAVELAALGPATVLCDYTSYV
        IRRR  +     + +  + EI  E +E E+  G    +L  Q +  QMKEIV FTGPA+G+WICGPLMSLIDT VIGQGS++ELAALGP TVLCD+ SYV
Subjt:  IRRRFAVPH-EASSLETESEIGVEVQENEQLYGTGGEELGSQGL-SQMKEIVTFTGPAIGLWICGPLMSLIDTAVIGQGSAVELAALGPATVLCDYTSYV

Query:  FMFLSIATSNM--------DKNEVQHHISVLLFVGLMSGFLMLLLTKLMGPVALTAFVGIKNADIIPASNTYI---------------------------
        FMFLS+ATSNM        DK E QH ISVLLF+GL+ G +MLLLT+L GP A+TAF   KN +I+PA+N YI                           
Subjt:  FMFLSIATSNM--------DKNEVQHHISVLLFVGLMSGFLMLLLTKLMGPVALTAFVGIKNADIIPASNTYI---------------------------

Query:  --------------------------------------QVISAYMMIEALNKKGYNGYSLSVPSSGEFLSILGLAAPVFLTMMSKVVFYSLLIYFATSMG
                                              Q++SAYMM+++LNK+GYN YS ++PS  E   I  LAAPVF+++ SK+ FYS +IY ATSMG
Subjt:  --------------------------------------QVISAYMMIEALNKKGYNGYSLSVPSSGEFLSILGLAAPVFLTMMSKVVFYSLLIYFATSMG

Query:  THTMAAHQVMIQTFCMCTVWGEPLSQTAQSFMPGLINGVNRSLDKAWMLLKSLLIIGAMFGLVLGTIGTSVPWLFPNLFTPEGKIIQEMHKVLIPYFLAL
        TH +AAHQVM QT+ MC VWGEPLSQTAQSFMP ++ G NR+L KA  LLKSL+IIGA  GLVLG IGT+VP LFP ++T +  II EMH++LIP+F+AL
Subjt:  THTMAAHQVMIQTFCMCTVWGEPLSQTAQSFMPGLINGVNRSLDKAWMLLKSLLIIGAMFGLVLGTIGTSVPWLFPNLFTPEGKIIQEMHKVLIPYFLAL

Query:  VITPPTHCLEGTLLV------------------------ISSRGYGLTGCWYALVGFQWARFLSALRRILSPHGILYSN
           P T  LEGTLL                         ++  GYGL GCW+ LVGFQW RF   LRR+LSP GIL S+
Subjt:  VITPPTHCLEGTLLV------------------------ISSRGYGLTGCWYALVGFQWARFLSALRRILSPHGILYSN

Arabidopsis top hitse value%identityAlignment
AT2G21340.1 MATE efflux family protein3.0e-12148.25Show/hide
Query:  SPRIHYPNILRPFS-PPSFPFTHQSLSSPTISFPSSSSPLP--LHFSSQIRRRFAVPHEA-----SSLETESEIG--VEVQENEQLYGTGGEELGSQGL-
        +P++ +P+ L   S  PSFP    S  S  +S P SS  L   L   +   +   V  E      S L+ ++  G    V+   ++     ++L +Q + 
Subjt:  SPRIHYPNILRPFS-PPSFPFTHQSLSSPTISFPSSSSPLP--LHFSSQIRRRFAVPHEA-----SSLETESEIG--VEVQENEQLYGTGGEELGSQGL-

Query:  SQMKEIVTFTGPAIGLWICGPLMSLIDTAVIGQGSAVELAALGPATVLCDYTSYVFMFLSIATSNM--------DKNEVQHHISVLLFVGLMSGFLMLLL
         QMKEIV FTGPA GLW+CGPLMSLIDTAVIGQGS++ELAALGPATV+CDY  Y FMFLS+ATSN+        DK+EVQH IS+LLF+GL  G  M++L
Subjt:  SQMKEIVTFTGPAIGLWICGPLMSLIDTAVIGQGSAVELAALGPATVLCDYTSYVFMFLSIATSNM--------DKNEVQHHISVLLFVGLMSGFLMLLL

Query:  TKLMGPVALTAFVGIKNADIIPASNTYI-----------------------------------------------------------------QVISAYM
        T+L G  ALTAF G+KNADI+PA+N Y+                                                                 QV++AYM
Subjt:  TKLMGPVALTAFVGIKNADIIPASNTYI-----------------------------------------------------------------QVISAYM

Query:  MIEALNKKGYNGYSLSVPSSGEFLSILGLAAPVFLTMMSKVVFYSLLIYFATSMGTHTMAAHQVMIQTFCMCTVWGEPLSQTAQSFMPGLINGVNRSLDK
        M++ALNKKGY+ +S  VPS  E L+I GLAAPVF+TMMSKV+FY+LL+YFATSMGT+ +AAHQVM+Q + M TVWGEPLSQTAQSFMP L+ G+NR+L K
Subjt:  MIEALNKKGYNGYSLSVPSSGEFLSILGLAAPVFLTMMSKVVFYSLLIYFATSMGTHTMAAHQVMIQTFCMCTVWGEPLSQTAQSFMPGLINGVNRSLDK

Query:  AWMLLKSLLIIGAMFGLVLGTIGTSVPWLFPNLFTPEGKIIQEMHKVLIPYFLALVITPPTHCLEGTLL------------------------VISSRGY
        A +LLKSL+IIGA  G+V+GTIGT+VPWLFP +FT +  +  EMHKV+IPYFLAL ITP TH LEGTLL                        ++S+ G+
Subjt:  AWMLLKSLLIIGAMFGLVLGTIGTSVPWLFPNLFTPEGKIIQEMHKVLIPYFLALVITPPTHCLEGTLL------------------------VISSRGY

Query:  GLTGCWYALVGFQWARFLSALRRILSPHGILYSNENKEIHKAV
        GL GCWYALVGFQWARF  +L R+LS  G+LYS +     + V
Subjt:  GLTGCWYALVGFQWARFLSALRRILSPHGILYSNENKEIHKAV

AT2G21340.2 MATE efflux family protein8.2e-11946.45Show/hide
Query:  PKMPFKILHSPSSSITPQIHNHKFLNPLSHSCPSFPFTPTIRFPSSSSPSSVVVCSPITRRFAVPHED--HEREVSNLEIEDQIDDGVQGN-------EQ
        PK+PF       SS+T +  N           PSF   P+ R  + S P S +  +   R  A  +++   + E  N  I +   D   G+        +
Subjt:  PKMPFKILHSPSSSITPQIHNHKFLNPLSHSCPSFPFTPTIRFPSSSSPSSVVVCSPITRRFAVPHED--HEREVSNLEIEDQIDDGVQGN-------EQ

Query:  LLGTGIDELGSIGLLNQMKEIVTFTGPAIGLWICGPLMSLIDTAVIGQGSAIELAALGPATVLCDYTSYVFMFLSIATSNMVATALAKQDKNEVQHHISV
        +    +D+L +  +  QMKEIV FTGPA GLW+CGPLMSLIDTAVIGQGS++ELAALGPATV+CDY  Y FMFLS+ATSN+VAT+LA+QDK+EVQH IS+
Subjt:  LLGTGIDELGSIGLLNQMKEIVTFTGPAIGLWICGPLMSLIDTAVIGQGSAIELAALGPATVLCDYTSYVFMFLSIATSNMVATALAKQDKNEVQHHISV

Query:  LLFVGLMSGFLMLLVTKLLGLVALT---------------------------------------------------------------------------
        LLF+GL  G  M+++T+L G  ALT                                                                           
Subjt:  LLFVGLMSGFLMLLVTKLLGLVALT---------------------------------------------------------------------------

Query:  ---------AYMMIEQLNEKGYSGYSLSVPSPSEFVSILGLAAPVFITLMSKIVFYTLLIYHATSIGTYTMAAHQVMSQTFYMCSVLGEPLSQTAQSFMP
                 AYMM++ LN+KGYS +S  VPSPSE ++I GLAAPVFIT+MSK++FYTLL+Y ATS+GT  +AAHQVM Q + M +V GEPLSQTAQSFMP
Subjt:  ---------AYMMIEQLNEKGYSGYSLSVPSPSEFVSILGLAAPVFITLMSKIVFYTLLIYHATSIGTYTMAAHQVMSQTFYMCSVLGEPLSQTAQSFMP

Query:  GFINGVNRSLDKARMLLKSLLIIGAIFGLVLGTIGTLVPWLFPNLFTPEVKIIQEMHKVLIPYFLALLIMPATLCLEGTLLAGRDLKFISLSMCGCLSFA
          + G+NR+L KAR+LLKSL+IIGA  G+V+GTIGT VPWLFP +FT +  +  EMHKV+IPYFLAL I P+T  LEGTLLAGRDL++ISLSM GCL+ A
Subjt:  GFINGVNRSLDKARMLLKSLLIIGAIFGLVLGTIGTLVPWLFPNLFTPEVKIIQEMHKVLIPYFLALLIMPATLCLEGTLLAGRDLKFISLSMCGCLSFA

Query:  ALLLLVMFYLPTFLQPHFFSSLCINFMACSQVVSNRGYGLVGCWCALVGFQWARFVNALRRVLSPKGVLYSSDLSHY
         LLL+++                          SN G+GL GCW ALVGFQWARF  +L R+LS  GVLYS D S Y
Subjt:  ALLLLVMFYLPTFLQPHFFSSLCINFMACSQVVSNRGYGLVGCWCALVGFQWARFVNALRRVLSPKGVLYSSDLSHY

AT4G39030.1 MATE efflux family protein1.5e-11249.48Show/hide
Query:  IRRRFAVPH-EASSLETESEIGVEVQENEQLYGTGGEELGSQGL-SQMKEIVTFTGPAIGLWICGPLMSLIDTAVIGQGSAVELAALGPATVLCDYTSYV
        IRRR  +     + +  + EI  E +E E+  G    +L  Q +  QMKEIV FTGPA+G+WICGPLMSLIDT VIGQGS++ELAALGP TVLCD+ SYV
Subjt:  IRRRFAVPH-EASSLETESEIGVEVQENEQLYGTGGEELGSQGL-SQMKEIVTFTGPAIGLWICGPLMSLIDTAVIGQGSAVELAALGPATVLCDYTSYV

Query:  FMFLSIATSNM--------DKNEVQHHISVLLFVGLMSGFLMLLLTKLMGPVALTAFVGIKNADIIPASNTYI---------------------------
        FMFLS+ATSNM        DK E QH ISVLLF+GL+ G +MLLLT+L GP A+TAF   KN +I+PA+N YI                           
Subjt:  FMFLSIATSNM--------DKNEVQHHISVLLFVGLMSGFLMLLLTKLMGPVALTAFVGIKNADIIPASNTYI---------------------------

Query:  --------------------------------------QVISAYMMIEALNKKGYNGYSLSVPSSGEFLSILGLAAPVFLTMMSKVVFYSLLIYFATSMG
                                              Q++SAYMM+++LNK+GYN YS ++PS  E   I  LAAPVF+++ SK+ FYS +IY ATSMG
Subjt:  --------------------------------------QVISAYMMIEALNKKGYNGYSLSVPSSGEFLSILGLAAPVFLTMMSKVVFYSLLIYFATSMG

Query:  THTMAAHQVMIQTFCMCTVWGEPLSQTAQSFMPGLINGVNRSLDKAWMLLKSLLIIGAMFGLVLGTIGTSVPWLFPNLFTPEGKIIQEMHKVLIPYFLAL
        TH +AAHQVM QT+ MC VWGEPLSQTAQSFMP ++ G NR+L KA  LLKSL+IIGA  GLVLG IGT+VP LFP ++T +  II EMH++LIP+F+AL
Subjt:  THTMAAHQVMIQTFCMCTVWGEPLSQTAQSFMPGLINGVNRSLDKAWMLLKSLLIIGAMFGLVLGTIGTSVPWLFPNLFTPEGKIIQEMHKVLIPYFLAL

Query:  VITPPTHCLEGTLLV------------------------ISSRGYGLTGCWYALVGFQWARFLSALRRILSPHGILYSN
           P T  LEGTLL                         ++  GYGL GCW+ LVGFQW RF   LRR+LSP GIL S+
Subjt:  VITPPTHCLEGTLLV------------------------ISSRGYGLTGCWYALVGFQWARFLSALRRILSPHGILYSN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCACGTGGCTTCACTTCATCTTCGTTCTTCTTCACTCCTTACTCACTACGCTCCCATGGCGGAGCTTTCGCTCTCTCTCGCTCCCTTCACTTCTCAGCCGCCAAAAAT
GCCTTTCAAAATCCTCCATTCTCCTTCTTCTTCAATCACTCCTCAAATCCATAACCACAAATTTCTCAATCCTCTCTCTCACTCTTGCCCTTCCTTTCCCTTCACCCCCA
CCATTCGCTTTCCTTCTTCTTCCTCCCCTTCATCCGTCGTTGTATGCTCGCCGATTACCCGCCGTTTCGCGGTTCCTCACGAGGATCATGAGCGGGAAGTCAGTAACCTT
GAGATTGAGGACCAAATTGACGATGGAGTACAGGGAAATGAGCAGTTATTGGGGACTGGAATAGATGAATTGGGGAGCATAGGGTTGTTGAATCAGATGAAGGAGATTGT
AACGTTTACCGGACCTGCCATTGGGTTGTGGATTTGTGGACCATTGATGAGTCTCATTGACACTGCGGTTATAGGCCAGGGGAGCGCCATTGAGCTTGCTGCTTTAGGCC
CAGCGACAGTTTTATGTGATTATACGAGCTATGTGTTCATGTTTCTCAGTATCGCAACTTCAAATATGGTAGCCACGGCCCTTGCCAAACAGGATAAAAACGAAGTGCAA
CATCACATATCTGTATTGCTATTTGTTGGGCTGATGTCTGGTTTCTTGATGCTCTTAGTTACCAAACTATTGGGTTTAGTGGCGCTAACTGCTTATATGATGATAGAGCA
ACTAAACGAGAAAGGATATAGTGGATATTCTCTCTCTGTTCCCTCGCCTAGTGAATTTGTGTCAATACTTGGACTTGCTGCTCCTGTATTTATAACACTGATGTCAAAGA
TAGTTTTTTATACTCTCCTCATTTATCATGCTACGTCTATAGGCACATACACCATGGCTGCTCATCAGGTCATGAGTCAAACATTCTACATGTGTAGCGTATTGGGTGAA
CCGCTTTCTCAAACTGCCCAATCATTTATGCCTGGATTCATAAATGGAGTGAATCGTAGTCTGGATAAGGCTCGGATGCTACTCAAGTCACTCTTGATCATAGGAGCTAT
CTTTGGTTTGGTATTAGGGACTATCGGAACGTTAGTTCCTTGGTTGTTCCCCAATCTCTTTACACCTGAAGTGAAGATTATTCAAGAGATGCATAAAGTGTTGATTCCAT
ATTTTTTGGCCTTACTCATAATGCCTGCAACGCTTTGCTTGGAAGGGACGTTATTGGCTGGACGTGACCTAAAATTTATTAGTTTATCAATGTGCGGATGCCTTTCCTTT
GCTGCCCTTCTATTGCTGGTAATGTTTTATCTCCCAACCTTCCTCCAACCTCATTTCTTCTCCTCTCTGTGCATTAACTTCATGGCTTGCTCCCAGGTTGTAAGCAATAG
GGGTTATGGTTTGGTGGGCTGCTGGTGCGCGCTCGTCGGATTTCAATGGGCTCGGTTTGTCAACGCTCTTCGACGTGTCCTCTCTCCCAAAGGAGTGCTTTACTCCAGTG
ATTTAAGCCATTATAAACCCAACTTCGCTCCTTTCAATTCTCAGGCGCCAAAAATGCCTTTCAAAATCCTCCATTATCATTCTTCATCCTCTCCTCGAATCCATTACCCC
AATATTCTCAGACCATTCTCTCCCCCTTCCTTTCCCTTCACCCATCAATCTCTCTCCTCCCCCACCATTTCCTTTCCTTCCTCTTCCTCCCCTTTACCTCTCCATTTCTC
GTCGCAGATTCGCCGCCGATTCGCGGTTCCTCACGAAGCCAGCAGCCTCGAGACTGAGAGCGAAATTGGCGTTGAAGTACAAGAAAATGAACAATTATACGGAACTGGAG
GCGAAGAATTGGGGAGCCAAGGGTTGAGTCAGATGAAGGAGATTGTAACGTTTACTGGACCTGCCATTGGATTGTGGATTTGTGGACCATTGATGAGTCTCATTGACACT
GCGGTTATTGGCCAGGGAAGCGCTGTTGAGCTTGCTGCTTTAGGCCCAGCGACTGTTTTATGTGATTATACGAGCTATGTGTTCATGTTTCTTAGTATTGCAACTTCAAA
CATGGATAAAAACGAAGTGCAGCATCACATATCTGTATTGCTGTTTGTCGGGTTGATGTCTGGCTTCTTGATGCTCTTACTTACCAAACTAATGGGTCCAGTGGCGCTAA
CTGCTTTTGTCGGAATAAAGAATGCAGACATCATACCTGCATCAAACACGTATATCCAGGTTATTTCTGCTTATATGATGATAGAAGCACTGAATAAGAAAGGATACAAT
GGATATTCTCTATCGGTTCCCTCATCTGGTGAATTTTTGTCAATACTTGGACTGGCTGCTCCTGTATTTCTAACAATGATGTCAAAGGTGGTTTTTTATTCTCTCCTCAT
TTATTTTGCTACATCTATGGGCACACATACCATGGCTGCTCATCAGGTCATGATTCAAACATTTTGTATGTGTACCGTTTGGGGTGAACCTCTTTCTCAAACTGCTCAGT
CATTTATGCCTGGATTGATAAACGGAGTGAATCGTAGTTTGGATAAGGCTTGGATGTTGCTAAAGTCACTCTTGATTATAGGAGCTATGTTTGGCTTGGTATTAGGGACT
ATTGGAACTTCAGTTCCTTGGTTGTTCCCCAATCTCTTCACACCCGAAGGGAAGATTATTCAGGAGATGCATAAAGTGTTGATTCCATATTTCTTGGCACTAGTCATAAC
ACCCCCGACTCATTGCTTAGAAGGGACGTTATTGGTTATTAGCAGTAGGGGTTATGGTTTGACAGGCTGCTGGTATGCCTTGGTTGGATTTCAATGGGCTCGGTTTCTCA
GTGCTCTTCGACGCATCCTCTCGCCTCATGGAATACTTTACAGCAATGAAAACAAGGAAATCCACAAAGCAGTGTTCTCAAAGGAAGATGAAATCAACAACATAGCAAGG
ACTTGTAAATTCTTTAAGTGA
mRNA sequenceShow/hide mRNA sequence
ATGCACGTGGCTTCACTTCATCTTCGTTCTTCTTCACTCCTTACTCACTACGCTCCCATGGCGGAGCTTTCGCTCTCTCTCGCTCCCTTCACTTCTCAGCCGCCAAAAAT
GCCTTTCAAAATCCTCCATTCTCCTTCTTCTTCAATCACTCCTCAAATCCATAACCACAAATTTCTCAATCCTCTCTCTCACTCTTGCCCTTCCTTTCCCTTCACCCCCA
CCATTCGCTTTCCTTCTTCTTCCTCCCCTTCATCCGTCGTTGTATGCTCGCCGATTACCCGCCGTTTCGCGGTTCCTCACGAGGATCATGAGCGGGAAGTCAGTAACCTT
GAGATTGAGGACCAAATTGACGATGGAGTACAGGGAAATGAGCAGTTATTGGGGACTGGAATAGATGAATTGGGGAGCATAGGGTTGTTGAATCAGATGAAGGAGATTGT
AACGTTTACCGGACCTGCCATTGGGTTGTGGATTTGTGGACCATTGATGAGTCTCATTGACACTGCGGTTATAGGCCAGGGGAGCGCCATTGAGCTTGCTGCTTTAGGCC
CAGCGACAGTTTTATGTGATTATACGAGCTATGTGTTCATGTTTCTCAGTATCGCAACTTCAAATATGGTAGCCACGGCCCTTGCCAAACAGGATAAAAACGAAGTGCAA
CATCACATATCTGTATTGCTATTTGTTGGGCTGATGTCTGGTTTCTTGATGCTCTTAGTTACCAAACTATTGGGTTTAGTGGCGCTAACTGCTTATATGATGATAGAGCA
ACTAAACGAGAAAGGATATAGTGGATATTCTCTCTCTGTTCCCTCGCCTAGTGAATTTGTGTCAATACTTGGACTTGCTGCTCCTGTATTTATAACACTGATGTCAAAGA
TAGTTTTTTATACTCTCCTCATTTATCATGCTACGTCTATAGGCACATACACCATGGCTGCTCATCAGGTCATGAGTCAAACATTCTACATGTGTAGCGTATTGGGTGAA
CCGCTTTCTCAAACTGCCCAATCATTTATGCCTGGATTCATAAATGGAGTGAATCGTAGTCTGGATAAGGCTCGGATGCTACTCAAGTCACTCTTGATCATAGGAGCTAT
CTTTGGTTTGGTATTAGGGACTATCGGAACGTTAGTTCCTTGGTTGTTCCCCAATCTCTTTACACCTGAAGTGAAGATTATTCAAGAGATGCATAAAGTGTTGATTCCAT
ATTTTTTGGCCTTACTCATAATGCCTGCAACGCTTTGCTTGGAAGGGACGTTATTGGCTGGACGTGACCTAAAATTTATTAGTTTATCAATGTGCGGATGCCTTTCCTTT
GCTGCCCTTCTATTGCTGGTAATGTTTTATCTCCCAACCTTCCTCCAACCTCATTTCTTCTCCTCTCTGTGCATTAACTTCATGGCTTGCTCCCAGGTTGTAAGCAATAG
GGGTTATGGTTTGGTGGGCTGCTGGTGCGCGCTCGTCGGATTTCAATGGGCTCGGTTTGTCAACGCTCTTCGACGTGTCCTCTCTCCCAAAGGAGTGCTTTACTCCAGTG
ATTTAAGCCATTATAAACCCAACTTCGCTCCTTTCAATTCTCAGGCGCCAAAAATGCCTTTCAAAATCCTCCATTATCATTCTTCATCCTCTCCTCGAATCCATTACCCC
AATATTCTCAGACCATTCTCTCCCCCTTCCTTTCCCTTCACCCATCAATCTCTCTCCTCCCCCACCATTTCCTTTCCTTCCTCTTCCTCCCCTTTACCTCTCCATTTCTC
GTCGCAGATTCGCCGCCGATTCGCGGTTCCTCACGAAGCCAGCAGCCTCGAGACTGAGAGCGAAATTGGCGTTGAAGTACAAGAAAATGAACAATTATACGGAACTGGAG
GCGAAGAATTGGGGAGCCAAGGGTTGAGTCAGATGAAGGAGATTGTAACGTTTACTGGACCTGCCATTGGATTGTGGATTTGTGGACCATTGATGAGTCTCATTGACACT
GCGGTTATTGGCCAGGGAAGCGCTGTTGAGCTTGCTGCTTTAGGCCCAGCGACTGTTTTATGTGATTATACGAGCTATGTGTTCATGTTTCTTAGTATTGCAACTTCAAA
CATGGATAAAAACGAAGTGCAGCATCACATATCTGTATTGCTGTTTGTCGGGTTGATGTCTGGCTTCTTGATGCTCTTACTTACCAAACTAATGGGTCCAGTGGCGCTAA
CTGCTTTTGTCGGAATAAAGAATGCAGACATCATACCTGCATCAAACACGTATATCCAGGTTATTTCTGCTTATATGATGATAGAAGCACTGAATAAGAAAGGATACAAT
GGATATTCTCTATCGGTTCCCTCATCTGGTGAATTTTTGTCAATACTTGGACTGGCTGCTCCTGTATTTCTAACAATGATGTCAAAGGTGGTTTTTTATTCTCTCCTCAT
TTATTTTGCTACATCTATGGGCACACATACCATGGCTGCTCATCAGGTCATGATTCAAACATTTTGTATGTGTACCGTTTGGGGTGAACCTCTTTCTCAAACTGCTCAGT
CATTTATGCCTGGATTGATAAACGGAGTGAATCGTAGTTTGGATAAGGCTTGGATGTTGCTAAAGTCACTCTTGATTATAGGAGCTATGTTTGGCTTGGTATTAGGGACT
ATTGGAACTTCAGTTCCTTGGTTGTTCCCCAATCTCTTCACACCCGAAGGGAAGATTATTCAGGAGATGCATAAAGTGTTGATTCCATATTTCTTGGCACTAGTCATAAC
ACCCCCGACTCATTGCTTAGAAGGGACGTTATTGGTTATTAGCAGTAGGGGTTATGGTTTGACAGGCTGCTGGTATGCCTTGGTTGGATTTCAATGGGCTCGGTTTCTCA
GTGCTCTTCGACGCATCCTCTCGCCTCATGGAATACTTTACAGCAATGAAAACAAGGAAATCCACAAAGCAGTGTTCTCAAAGGAAGATGAAATCAACAACATAGCAAGG
ACTTGTAAATTCTTTAAGTGA
Protein sequenceShow/hide protein sequence
MHVASLHLRSSSLLTHYAPMAELSLSLAPFTSQPPKMPFKILHSPSSSITPQIHNHKFLNPLSHSCPSFPFTPTIRFPSSSSPSSVVVCSPITRRFAVPHEDHEREVSNL
EIEDQIDDGVQGNEQLLGTGIDELGSIGLLNQMKEIVTFTGPAIGLWICGPLMSLIDTAVIGQGSAIELAALGPATVLCDYTSYVFMFLSIATSNMVATALAKQDKNEVQ
HHISVLLFVGLMSGFLMLLVTKLLGLVALTAYMMIEQLNEKGYSGYSLSVPSPSEFVSILGLAAPVFITLMSKIVFYTLLIYHATSIGTYTMAAHQVMSQTFYMCSVLGE
PLSQTAQSFMPGFINGVNRSLDKARMLLKSLLIIGAIFGLVLGTIGTLVPWLFPNLFTPEVKIIQEMHKVLIPYFLALLIMPATLCLEGTLLAGRDLKFISLSMCGCLSF
AALLLLVMFYLPTFLQPHFFSSLCINFMACSQVVSNRGYGLVGCWCALVGFQWARFVNALRRVLSPKGVLYSSDLSHYKPNFAPFNSQAPKMPFKILHYHSSSSPRIHYP
NILRPFSPPSFPFTHQSLSSPTISFPSSSSPLPLHFSSQIRRRFAVPHEASSLETESEIGVEVQENEQLYGTGGEELGSQGLSQMKEIVTFTGPAIGLWICGPLMSLIDT
AVIGQGSAVELAALGPATVLCDYTSYVFMFLSIATSNMDKNEVQHHISVLLFVGLMSGFLMLLLTKLMGPVALTAFVGIKNADIIPASNTYIQVISAYMMIEALNKKGYN
GYSLSVPSSGEFLSILGLAAPVFLTMMSKVVFYSLLIYFATSMGTHTMAAHQVMIQTFCMCTVWGEPLSQTAQSFMPGLINGVNRSLDKAWMLLKSLLIIGAMFGLVLGT
IGTSVPWLFPNLFTPEGKIIQEMHKVLIPYFLALVITPPTHCLEGTLLVISSRGYGLTGCWYALVGFQWARFLSALRRILSPHGILYSNENKEIHKAVFSKEDEINNIAR
TCKFFK