; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc11G04290 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc11G04290
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionProtein DETOXIFICATION
Genome locationClcChr11:4078131..4097089
RNA-Seq ExpressionClc11G04290
SyntenyClc11G04290
Gene Ontology termsGO:0055085 - transmembrane transport (biological process)
GO:0016020 - membrane (cellular component)
GO:0015297 - antiporter activity (molecular function)
GO:0042910 - xenobiotic transmembrane transporter activity (molecular function)
InterPro domainsIPR002528 - Multi antimicrobial extrusion protein
IPR044644 - Multi antimicrobial extrusion protein DinF-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAF3966159.1 hypothetical protein CMV_009716 [Castanea mollissima]4.2e-25849.51Show/hide
Query:  MPFKILHSPSSSITPQIHNHKFLNPLSHSRPSFPFTPTIRFPSSSSPSSIVVCSPITRRFAVPHDDHEREVSNLEIEDQIDDG---VQGNEQLLGTG---
        M FK L    +S TP   N  F N L   +P    TP  RF      S++ + S   + F           +N E  D  ++G   ++    +       
Subjt:  MPFKILHSPSSSITPQIHNHKFLNPLSHSRPSFPFTPTIRFPSSSSPSSIVVCSPITRRFAVPHDDHEREVSNLEIEDQIDDG---VQGNEQLLGTG---

Query:  IDELGSQGLLNQMKEIVTFTGPAIGLWICGPLMSLIDTAVIGQGSAIELAALGPATVLCDYTSYVFMFLSIATSNMVATALAKQDKNEVQHHISVLLFVG
        + E+GSQ +  QMKEIV FTGPA GLWICGPLMSLIDTAVIGQGS+IELAALGP TV+CDY SY FMFLSIATSNMVATALA+QDK EVQHHIS+LLFVG
Subjt:  IDELGSQGLLNQMKEIVTFTGPAIGLWICGPLMSLIDTAVIGQGSAIELAALGPATVLCDYTSYVFMFLSIATSNMVATALAKQDKNEVQHHISVLLFVG

Query:  LMSGFLMLLVTKLLGLVALTA-------------------------------------------------------------------------------
        L  G LMLL T+  G   LTA                                                                               
Subjt:  LMSGFLMLLVTKLLGLVALTA-------------------------------------------------------------------------------

Query:  --------YMMIEQLNKKGYSGYSLCVPSPSEFVSILGLAAPVFITLMSKIVFYTLLIYHATSIGTYTMAAHQVMSQTFYMCSVLGEPLSQTAQSFMPGF
                +MM+E LNKKGY+ +S+ VPSP E +++LGLAAPVF+T+MSK+ F++LL Y A S+GTY+MAAHQVM QTF  C+V GEPL+QTAQSFMP  
Subjt:  --------YMMIEQLNKKGYSGYSLCVPSPSEFVSILGLAAPVFITLMSKIVFYTLLIYHATSIGTYTMAAHQVMSQTFYMCSVLGEPLSQTAQSFMPGF

Query:  INGVNRSLDKARMLLKSLLIIGAIFGLVLGTIGTLVPWLFPNLFTPEVKIIQEMHKVLIPYFLALLIMPATLCLEGTLLAGRDLKFISLSMCGCLSFAAL
        I GV+RSL+KARMLLKSL+IIGAI GL+LG +GT VPWLFP +FT +  +I EMH+VLIP+F+AL + P TL LEGTLLAGRDLKF+SLSM GC S AAL
Subjt:  INGVNRSLDKARMLLKSLLIIGAIFGLVLGTIGTLVPWLFPNLFTPEVKIIQEMHKVLIPYFLALLIMPATLCLEGTLLAGRDLKFISLSMCGCLSFAAL

Query:  LLLVMFYLPTFLQPHFFSSLCINFMACSQVVSNRGYGLVGCWCALVGFQWARFVNALRRVLSPNGVLYSSDLSHYKLN--FAPFNSQAPKMPFKILHYHS
        LLL                          +V++RGYGL G W  L+GFQWARF  +LRR+LSPNG+LYS DL   KL    A +N+       K L  + 
Subjt:  LLLVMFYLPTFLQPHFFSSLCINFMACSQVVSNRGYGLVGCWCALVGFQWARFVNALRRVLSPNGVLYSSDLSHYKLN--FAPFNSQAPKMPFKILHYHS

Query:  SSSPRIRYPNILRPFSPPSFPFTHQSLSSPAISFPSSSSPLPLHFSSQIRRRFAVPHEASSLEIESEIGVEVQENEQLSGTGGEELGSQGLSSQMKEIVT
        +S P I                                                        +I S  G   +E E++S     ELGS+ +  QMKEIV 
Subjt:  SSSPRIRYPNILRPFSPPSFPFTHQSLSSPAISFPSSSSPLPLHFSSQIRRRFAVPHEASSLEIESEIGVEVQENEQLSGTGGEELGSQGLSSQMKEIVT

Query:  FTGPAIGLWICGPLMSLIDTAVIGQGSAVELAALGPATVLCDYTSYVFMFLSIATSNM--------DKNEVQHHISVLLFIGLMSGFLMLLLTKLLGPVA
        FTGPA  LWICGPLMSLIDTAVIGQGS+VELAALGP TV+CDY  Y FMFLSIATSNM        DKNEVQHHIS+LLF+GL  G LMLL TK  G   
Subjt:  FTGPAIGLWICGPLMSLIDTAVIGQGSAVELAALGPATVLCDYTSYVFMFLSIATSNM--------DKNEVQHHISVLLFIGLMSGFLMLLLTKLLGPVA

Query:  LTAFVGIKNADIIPASNTYI-----------------------------------------------------------------QVISAYMMIEALNKK
        LTAF G KNA +IP++N Y+                                                                 QVI+ +MMIEALNKK
Subjt:  LTAFVGIKNADIIPASNTYI-----------------------------------------------------------------QVISAYMMIEALNKK

Query:  GYNGYSLSVPSSGEFLSILGLAAPVFLTMMSKVVFYSLLIYFATSMGTHTMAAHQVMIQTFCMCTVWGEPLSQTAQSFMPGLINGVNRSLDKAWMLLKSL
        GYN YS+SVPS  E L++LGLAAPVF+TMMSKV FYSLLIYFATSMGT+TMAAHQVMIQTF MCTVWGEPLSQTAQSFMP LI G  RSL+KA MLL+SL
Subjt:  GYNGYSLSVPSSGEFLSILGLAAPVFLTMMSKVVFYSLLIYFATSMGTHTMAAHQVMIQTFCMCTVWGEPLSQTAQSFMPGLINGVNRSLDKAWMLLKSL

Query:  LIIGAIFGLVLGTIGTSVPWLFPNLFTPEGKIIQEMHKVLIPYFLALVITPPTHCLEGT------------------------LLVISSRGYGLTGCWYA
        +IIG I GL+LG +GTSVPWLFP +FT +  +IQEMHKVLIPYF+AL +TPPTHCLEGT                        LL+++SRGYGL GCW+ 
Subjt:  LIIGAIFGLVLGTIGTSVPWLFPNLFTPEGKIIQEMHKVLIPYFLALVITPPTHCLEGT------------------------LLVISSRGYGLTGCWYA

Query:  LVGFQWARFLSALRRILSPHGILYSNE
        LVGFQWARFL +L R+LSP+GILYS +
Subjt:  LVGFQWARFLSALRRILSPHGILYSNE

KAF4369626.1 hypothetical protein G4B88_021431 [Cannabis sativa]6.3e-23050.1Show/hide
Query:  PFKILHSPSSSITPQI-HNHKFLNPLSHSRPSFPFTPTIRF--PSSSSPSSIVVCSPITRRFAVPHDDHE----REVSNLE--IEDQIDDGVQGNEQLLG
        P+ +LH+P S    Q+  +  F N  S+S  S P     RF   +S+SP S+       RRF +    +     RE+S+ E    D+   GV   E    
Subjt:  PFKILHSPSSSITPQI-HNHKFLNPLSHSRPSFPFTPTIRF--PSSSSPSSIVVCSPITRRFAVPHDDHE----REVSNLE--IEDQIDDGVQGNEQLLG

Query:  TGIDELGSQGLLNQMKEIVTFTGPAIGLWICGPLMSLIDTAVIGQGSAIELAALGPATVLCDYTSYVFMFLSIATSNMVATALAKQDKNEVQHHISVLLF
        TG   LG+QG+ +QMKEI  FTGPA GLWICGPLMSLIDTAVIGQ S++ELAALGPATVLCD  SY+FMFLSIATSNMVATALA++DK EVQHHISVLLF
Subjt:  TGIDELGSQGLLNQMKEIVTFTGPAIGLWICGPLMSLIDTAVIGQGSAIELAALGPATVLCDYTSYVFMFLSIATSNMVATALAKQDKNEVQHHISVLLF

Query:  VGLMSGFLMLLVTKLLGLVALT----------------------AYMMIEQLNKKGYSGYSLCVPSPSEFVSILGLAAPVFITLMSKIVFYTLLIYHATS
        VGL  GF+MLL T+  G  ALT                      AYMMIE LNKKGY+ YS+ +PSP + ++I  LAAPVFITLM+KI FYTLL+Y ATS
Subjt:  VGLMSGFLMLLVTKLLGLVALT----------------------AYMMIEQLNKKGYSGYSLCVPSPSEFVSILGLAAPVFITLMSKIVFYTLLIYHATS

Query:  IGTYTMAAHQVMSQTFYMCSVLGEPLSQTAQSFMPGFINGVNRSLDKARMLLKSLLIIGAIFGLVLGTIGTLVPWLFPNLFTPEVKIIQEMHKVLIPYFL
        +GT T AAHQVM Q  +MC+V GEPLSQTAQSFMP  I+GV R+L+KAR LLKSL+++G +FG+VLGT GT +PWLFPN++TP+  II+EMH VLIPYFL
Subjt:  IGTYTMAAHQVMSQTFYMCSVLGEPLSQTAQSFMPGFINGVNRSLDKARMLLKSLLIIGAIFGLVLGTIGTLVPWLFPNLFTPEVKIIQEMHKVLIPYFL

Query:  ALLIMPATLCLEGTLLAGRDLKFISLSMCGCLSFAALLLLVMFYLPTFLQPHFFSSLCINFMACSQVVSNRGYGLVGCWCALVGFQWARFVNALRRVLSP
         +L  P T  LEGTLLAGRDLKFIS++M GC    A+LL                          ++VS +G+GL G W  L     +R  N  +  +  
Subjt:  ALLIMPATLCLEGTLLAGRDLKFISLSMCGCLSFAALLLLVMFYLPTFLQPHFFSSLCINFMACSQVVSNRGYGLVGCWCALVGFQWARFVNALRRVLSP

Query:  N---GVLYSSDLS-----HYKLNFAPF--NS-------QAPKMPFK-ILHYHSSSSPRIRYPNILRPFSPPSFPFTHQSLSSPAISFPSSSSPLP-LHFS
        N    ++Y +  S     H  L       NS        +P  P K ++  H S        N LRP S  S  F +   ++P     +++  LP L F 
Subjt:  N---GVLYSSDLS-----HYKLNFAPF--NS-------QAPKMPFK-ILHYHSSSSPRIRYPNILRPFSPPSFPFTHQSLSSPAISFPSSSSPLP-LHFS

Query:  SQIRRRFAVPHEASSLEIESEIGVEVQENEQLSGTGGEELGSQGLSSQMKEIVTFTGPAIGLWICGPLMSLIDTAVIGQGSAVELAALGPATVLCDYTSY
        S+ R R +    A+  + ++  G    EN  + G        +G+  Q+KEI  FTGPA+GLWICGPLMSLIDT VIGQGS++ELAALGP TV CDY SY
Subjt:  SQIRRRFAVPHEASSLEIESEIGVEVQENEQLSGTGGEELGSQGLSSQMKEIVTFTGPAIGLWICGPLMSLIDTAVIGQGSAVELAALGPATVLCDYTSY

Query:  VFMFLSIATSNM--------DKNEVQHHISVLLFIGLMSGFLMLLLTKLLGPVALTAFVGIKNADIIPASNTYIQVISAYMMIEALNKKGYNGYSLSVPS
        VFMFLSIATSNM        DKNEVQHHISVLLF+GL  GFLM   T+  G  +LTAF G  N  I+PA+NTY+QV++ YMM+E LNKKGYN Y+LS+PS
Subjt:  VFMFLSIATSNM--------DKNEVQHHISVLLFIGLMSGFLMLLLTKLLGPVALTAFVGIKNADIIPASNTYIQVISAYMMIEALNKKGYNGYSLSVPS

Query:  SGEFLSILGLAAPVFLTMMSKVVFYSLLIYFATSMGTHTMAAHQVMIQTFCMCTVWGEPLSQTAQSFMPGLINGVNRSLDKAWMLLKSLLIIGAIFGLVL
          E ++IL LAAPVF+TM SKV FYSLLIYFATSMGT +MAAHQVMIQTFCMCTVW                                            
Subjt:  SGEFLSILGLAAPVFLTMMSKVVFYSLLIYFATSMGTHTMAAHQVMIQTFCMCTVWGEPLSQTAQSFMPGLINGVNRSLDKAWMLLKSLLIIGAIFGLVL

Query:  GTIGTSVPWLFPNLFTPEGKIIQEMHKVLIPYFLALVITPPTHCLEGTLL------------------------VISSRGYGLTGCWYALVGFQWARFLS
        G+IGT++PWLFPN+FTP+  +IQEMHKVLIPY LA+V TP  H LEGTLL                        +IS++G+GL  CWY L  FQWARF  
Subjt:  GTIGTSVPWLFPNLFTPEGKIIQEMHKVLIPYFLALVITPPTHCLEGTLL------------------------VISSRGYGLTGCWYALVGFQWARFLS

Query:  ALRRILSPHGILYSNE----NKEIHKAV
        ALRR+LSP+G+L+S +    N E  KAV
Subjt:  ALRRILSPHGILYSNE----NKEIHKAV

KAF9676073.1 hypothetical protein SADUNF_Sadunf09G0100500 [Salix dunnii]1.2e-23646.03Show/hide
Query:  FLNP--LSHSRPSFPF--TPTIRFPSSSSPSSIVVCSPITRRFA----VPHDDHEREVSNLEIEDQIDDGV----QGNEQLLGTGIDELGSQGLLNQMKE
        F NP    H +PS      P     S   PS + + +P    +     +     E   +N EIE + D  +    +  E  +    + L SQ L +Q+KE
Subjt:  FLNP--LSHSRPSFPF--TPTIRFPSSSSPSSIVVCSPITRRFA----VPHDDHEREVSNLEIEDQIDDGV----QGNEQLLGTGIDELGSQGLLNQMKE

Query:  IVTFTGPAIGLWICGPLMSLIDTAVIGQGSAIELAALGPATVLCDYTSYVFMFLSIATSNMVATALAKQDKNEVQHHISVLLFVGLMSGFLMLLVTKLLG
        IV FTGPA GLW+CGPLMSLIDT VIGQGS IELAALGPATVLCDY SYVFMFLSIATSNMVAT +A++DKN+VQH IS+LLFVG+  G LMLL T+  G
Subjt:  IVTFTGPAIGLWICGPLMSLIDTAVIGQGSAIELAALGPATVLCDYTSYVFMFLSIATSNMVATALAKQDKNEVQHHISVLLFVGLMSGFLMLLVTKLLG

Query:  LVALTA---------------------------------------------------------------------------------------YMMIEQL
          ALTA                                                                                       YMMIE L
Subjt:  LVALTA---------------------------------------------------------------------------------------YMMIEQL

Query:  NKKGYSGYSLCVPSPSEFVSILGLAAPVFITLMSKIVFYTLLIYHATSIGTYTMAAH----------QVMSQTFYMCSVLGEPLSQTAQSFMPGFINGVN
        NKKGY+ +++ VP+  E ++++GLAAPVF+T++SK+ FY+L+IY ATS+GT+++AAH          QVM Q   MC+V+GEPLSQTAQSFMP  I GVN
Subjt:  NKKGYSGYSLCVPSPSEFVSILGLAAPVFITLMSKIVFYTLLIYHATSIGTYTMAAH----------QVMSQTFYMCSVLGEPLSQTAQSFMPGFINGVN

Query:  RSLDKARMLLKSLLIIGAIFGLVLGTIGTLVPWLFPNLFTPEVKIIQEMHKVLIPYFLALLIMPATLCLEGTL---------------------------
        RSL KAR LLKSL+ IGA  GL+LGTIGT  PWLFPN+FT + K+IQEM+KVL+P+F+A+ + P+  CLEGTL                           
Subjt:  RSLDKARMLLKSLLIIGAIFGLVLGTIGTLVPWLFPNLFTPEVKIIQEMHKVLIPYFLALLIMPATLCLEGTL---------------------------

Query:  ------------LAGRDLKFISLSMCGCLSFAALLLLVMFYLPTFLQPHFFSSLCINFMACSQVVSNRGYGLVGCWCALVGFQWARFVNALRRVLSPNGV
                    LAGRDL+F+S SM GC S  A++L++                           S+RGYGL GCW ALVGFQWARF  +LRR+LS +G+
Subjt:  ------------LAGRDLKFISLSMCGCLSFAALLLLVMFYLPTFLQPHFFSSLCINFMACSQVVSNRGYGLVGCWCALVGFQWARFVNALRRVLSPNGV

Query:  LYSSDLSHYKL-NFAPFNSQAPKMPFK---------------ILHYHSSSSPRIRYPNILRPFSPPSFPFTHQS--LSSPAISFPSSSSPLPLHFSSQIR
        L+S DLS Y++       S   K+PF                ++  H +S P    P+  RP S  S P +H S    SP +  P S    P +  ++  
Subjt:  LYSSDLSHYKL-NFAPFNSQAPKMPFK---------------ILHYHSSSSPRIRYPNILRPFSPPSFPFTHQS--LSSPAISFPSSSSPLPLHFSSQIR

Query:  RRFAVPHEASSLEIESEIGVE--VQENEQLSGTGGEELGSQGLSSQMKEIVTFTGPAIGLWICGPLMSLIDTAVIGQGSAVELAALGPATVLCDYTSYVF
           +V    SS +  SE   E  +Q N        E L +Q +  QMKEIV FTGPA GLWICGPLMSLIDTAVIGQGS++ELAALGP TVLCD  SY+F
Subjt:  RRFAVPHEASSLEIESEIGVE--VQENEQLSGTGGEELGSQGLSSQMKEIVTFTGPAIGLWICGPLMSLIDTAVIGQGSAVELAALGPATVLCDYTSYVF

Query:  MFLSIATSNM--------DKNEVQHHISVLLFIGLMSGFLMLLLTKLLGPVALTAFVGIKNADIIPASNTYI----------------------------
        MFLSIATSNM        DKNEVQH +S+LLFIGL  G LM L TK  GP ALT   G  N DIIPA+NTY+                            
Subjt:  MFLSIATSNM--------DKNEVQHHISVLLFIGLMSGFLMLLLTKLLGPVALTAFVGIKNADIIPASNTYI----------------------------

Query:  -------------------------------------QVISAYMMIEALNKKGYNGYSLSVPSSGEFLSILGLAAPVFLTMMSKVVFYSLLIYFATSMGT
                                             QV++A+MMI++LNKKGYN Y++SVPS+ + + +  LAAP F+ M+SKV F+SL++YF TSM T
Subjt:  -------------------------------------QVISAYMMIEALNKKGYNGYSLSVPSSGEFLSILGLAAPVFLTMMSKVVFYSLLIYFATSMGT

Query:  HTMAAHQVMIQTFCMCTVWGEPLSQTAQSFMPGLINGVNRSLDKAWMLLKSLLIIGAIFGLVLGTIGTSVPWLFPNLFTPEGKIIQEMHKVLIPYFLALV
         T+AAHQVMIQ F MCTVWGEPLSQ AQSFMP L+ G+NRSL+KA  +LKSL IIG I GL LG +GTSVPWLFP++FT + KIIQEMHKVLIPYFLAL 
Subjt:  HTMAAHQVMIQTFCMCTVWGEPLSQTAQSFMPGLINGVNRSLDKAWMLLKSLLIIGAIFGLVLGTIGTSVPWLFPNLFTPEGKIIQEMHKVLIPYFLALV

Query:  ITPPTHCLEGTLLVISSRGYGLTGCWYALVGFQWARFLSALRRILSPHGILYSNENKE
        +TP    LEGTLL +SSRGYGL G W+ALVGFQWARF  AL+R+LSP G+L+S ++++
Subjt:  ITPPTHCLEGTLLVISSRGYGLTGCWYALVGFQWARFLSALRRILSPHGILYSNENKE

KAG7035062.1 Protein DETOXIFICATION 46, chloroplastic [Cucurbita argyrosperma subsp. argyrosperma]5.5e-26653.76Show/hide
Query:  MAELSLSLASFTSQLPKMPFKILHSPSSSITPQIHNHKFLNPLSHSRPSFPFTPTIRFPSSSSPSSIVVCSPITRRFAVPHDDHEREVSNLEIEDQIDDG
        MAE SLSLA    Q PKM F+ LH P SSI  +IH  + L P    R SFPF+      +++SP S+ V   + RRFAVP D+ ERE SN     +ID+ 
Subjt:  MAELSLSLASFTSQLPKMPFKILHSPSSSITPQIHNHKFLNPLSHSRPSFPFTPTIRFPSSSSPSSIVVCSPITRRFAVPHDDHEREVSNLEIEDQIDDG

Query:  VQGNEQLLGTGIDELGSQGLLNQMKEIVTFTGPAIGLWICGPLMSLIDTAVIGQGSAIELAALGPATVLCDYTSYVFMFLSIATSNMVATALAKQDKNEV
        VQ NEQLLG G +ELG QGLL+QMKEIVTFTGPAIGLWICGPLMSLIDTAVIGQGSA+ELAALGPATVLCDYTS+VFMFLSIATSNMVATALAKQDKNEV
Subjt:  VQGNEQLLGTGIDELGSQGLLNQMKEIVTFTGPAIGLWICGPLMSLIDTAVIGQGSAIELAALGPATVLCDYTSYVFMFLSIATSNMVATALAKQDKNEV

Query:  QHHISVLLFVGLMSGFLMLLVTKLLGLVALTAYMMIEQLNKKGYSGYSLCVPSPSEFVSILGLAAPVFIT----------------------------LM
        QHHIS LLFVGL+SGFLMLL TKLLG VALTA++  +  N          +P+ + ++ I GLA P  +T                             +
Subjt:  QHHISVLLFVGLMSGFLMLLVTKLLGLVALTAYMMIEQLNKKGYSGYSLCVPSPSEFVSILGLAAPVFIT----------------------------LM

Query:  SKIVFYTLLIYHATSIGTYTMAAHQVMSQTF-----------YMCS--------VLGEPLSQTAQSFMPGFINGVNRSLDKARMLLKSLLIIGAIFGLVL
          IV    L Y        TMA+  + +              Y  S        +LGEPLSQTAQ+FMPG I GVNRS DKARMLLKSLLIIGAIFGLVL
Subjt:  SKIVFYTLLIYHATSIGTYTMAAHQVMSQTF-----------YMCS--------VLGEPLSQTAQSFMPGFINGVNRSLDKARMLLKSLLIIGAIFGLVL

Query:  GTIGTLVPWLFPNLFTPEVKIIQEMHKVLIPYFLALLIMPATLCLEGTLLAGRDLKFISLSMCGCLSFAALLLLVMFYLPTFLQPHFFSSLCINFMACSQ
        GTIGT VPWLFPNLFTP+ KIIQEMHKVLIPYFLAL+IMPATL LEG+LLAGRDLKFISLSMCGC S  A+LLL                          
Subjt:  GTIGTLVPWLFPNLFTPEVKIIQEMHKVLIPYFLALLIMPATLCLEGTLLAGRDLKFISLSMCGCLSFAALLLLVMFYLPTFLQPHFFSSLCINFMACSQ

Query:  VVSNRGYGLVGCWCALVGFQWARFVNALRRVLSPNGVLYSSDLSHYKLNFAPFNSQAPKMPFKILHYHSSSSPRIRYPNILRPFSPPSFPFTHQSLSSPA
                                                                 PKM  K  H  SS++ +I    ILRP   PS PFT++SLS+ A
Subjt:  VVSNRGYGLVGCWCALVGFQWARFVNALRRVLSPNGVLYSSDLSHYKLNFAPFNSQAPKMPFKILHYHSSSSPRIRYPNILRPFSPPSFPFTHQSLSSPA

Query:  ISFPSSSSPLPLH-FSSQIRRRFAVPH-----EASSLEIESEIGVEVQENEQLSGTGGEELGSQGLSSQMKEIVTFTGPAIGLWICGPLMSLIDTAVIGQ
        + F S SSP  ++   S+IRRRFAVP+     E SSLEIESE+   +Q NEQ        LG+QGL +QMKEIVTFTGPAIGLWICGPLMSLIDTAVIGQ
Subjt:  ISFPSSSSPLPLH-FSSQIRRRFAVPH-----EASSLEIESEIGVEVQENEQLSGTGGEELGSQGLSSQMKEIVTFTGPAIGLWICGPLMSLIDTAVIGQ

Query:  GSAVELAALGPATVLCDYTSYVFMFLSIATSNM--------DKNEVQHHISVLLFIGLMSGFLMLLLTKLLGPVALT----------------AFVGIKN
        GSAVELAALGPATVLCDYTSYVFMFLSIATSNM        DKNEVQHHIS+LLF+GL SGF ML+ TKL G VALT                AF G KN
Subjt:  GSAVELAALGPATVLCDYTSYVFMFLSIATSNM--------DKNEVQHHISVLLFIGLMSGFLMLLLTKLLGPVALT----------------AFVGIKN

Query:  ADIIPASNTYI-----------------------------------------------------------------QVISAYMMIEALNKKGYNGYSLSV
        AD+IPA+N YI                                                                 QVI+AYMM+ ALNKKGY+ YS SV
Subjt:  ADIIPASNTYI-----------------------------------------------------------------QVISAYMMIEALNKKGYNGYSLSV

Query:  PSSGEFLSILGLAAPVFLTMMSKVVFYSLLIYFATSMGTHTMAAHQVMIQTFCMCTVWGEPLSQTAQSFMPGLINGVNRSLDKAWMLLKSLLIIGAIFGL
        PS+GEFLSILG+AAP+FLTMMSKVVFYSLLIY+ATSMGTHTMAAHQVM+QTFCMCTVW                                          
Subjt:  PSSGEFLSILGLAAPVFLTMMSKVVFYSLLIYFATSMGTHTMAAHQVMIQTFCMCTVWGEPLSQTAQSFMPGLINGVNRSLDKAWMLLKSLLIIGAIFGL

Query:  VLGTIGTSVPWLFPNLFTPEGKIIQEMHKVLIPYFLALVITPPTHCLEGT------------------------LLVISSRGYGLTGCWYALVGFQWARF
        VLGTIGTSVP  FPNLFTPE KIIQEMHKVL PYFLALVITPPT CLEGT                        LLVISSRGYGL GCWYALVGFQWARF
Subjt:  VLGTIGTSVPWLFPNLFTPEGKIIQEMHKVLIPYFLALVITPPTHCLEGT------------------------LLVISSRGYGLTGCWYALVGFQWARF

Query:  LSALRRILSPHGILYSNE
        LSALRR+LSP G+LYS++
Subjt:  LSALRRILSPHGILYSNE

TXG64624.1 hypothetical protein EZV62_011618 [Acer yangbiense]1.2e-24150.73Show/hide
Query:  VPHDDHEREVSNLEIEDQIDDGVQGNEQLLGTGIDELGSQGLLNQMKEIVTFTGPAIGLWICGPLMSLIDTAVIGQGSAIELAALGPATVLCDYTSYVFM
        + +D     +S  + E++ ++   G E       + L +Q + NQMKEIV FT PA GLWICGPLMSLIDTAVIGQGS+IELAALGPATV+CDY +YVFM
Subjt:  VPHDDHEREVSNLEIEDQIDDGVQGNEQLLGTGIDELGSQGLLNQMKEIVTFTGPAIGLWICGPLMSLIDTAVIGQGSAIELAALGPATVLCDYTSYVFM

Query:  FLSIATSNMVATALAKQDKNEVQHHISVLLFVGLMSGFLMLLVTKLLGLVALT-----------------------------------------------
        FLSIATSNMVAT+LAKQDKNEVQH ISVLLFVGL  G LM L T+  G  ALT                                               
Subjt:  FLSIATSNMVATALAKQDKNEVQHHISVLLFVGLMSGFLMLLVTKLLGLVALT-----------------------------------------------

Query:  -------AYMMIEQLNKKGYSGYSLCVPSPSEFVSILGLAAPVFITLMSKIVFYTLLIYHATSIGTYTMAAHQVMSQTFYMCSVLGEPLSQTAQSFMPGF
                +MMI+ LNKKGY+ ++  VPSPSE  +I GLA PVF+T+M+KI FY+L+IY ATS+GT T+AAHQVM Q F MC+V GEPLSQ AQSFMP  
Subjt:  -------AYMMIEQLNKKGYSGYSLCVPSPSEFVSILGLAAPVFITLMSKIVFYTLLIYHATSIGTYTMAAHQVMSQTFYMCSVLGEPLSQTAQSFMPGF

Query:  INGVNRSLDKARMLLKSLLIIGAIFGLVLGTIGTLVPWLFPNLFTPEVKIIQEMHKVLIPYFLALLIMPATLCLEGTLLAGRDLKFISLSMCGCLSFAAL
        I G NRSL KARMLLKSL IIGA  GLVLG IGT VPWLFP++FTP+  +IQEMHKVL+PYFLAL + P+T  LEGTLLAGRDLKFISLSM GC S  AL
Subjt:  INGVNRSLDKARMLLKSLLIIGAIFGLVLGTIGTLVPWLFPNLFTPEVKIIQEMHKVLIPYFLALLIMPATLCLEGTLLAGRDLKFISLSMCGCLSFAAL

Query:  LLLVMFYLPTFLQPHFFSSLCINFMACSQVVSNRGYGLVGCWCALVGFQWARFVNALRRVLSPNGVLYSSDLSHYKLNFAPFNSQAPKMPFKILHYHSSS
        +LL                          +VS+RGYGL GCW AL GFQWARF  +L RVLS +G+L+S DL+ Y                         
Subjt:  LLLVMFYLPTFLQPHFFSSLCINFMACSQVVSNRGYGLVGCWCALVGFQWARFVNALRRVLSPNGVLYSSDLSHYKLNFAPFNSQAPKMPFKILHYHSSS

Query:  SPRIRYPNILRPFSPPSFPFTHQSLSSPAISFPSSSSPLPLHFSSQIRRRFAVPHEASSLEIESEIGVEVQENEQLSGTGGEELGSQGLSSQMKEIVTFT
            +    LR F P      H  L     S  + S  +PL  S                E E E  VEV+          E+L +Q + +QMKEI+ FT
Subjt:  SPRIRYPNILRPFSPPSFPFTHQSLSSPAISFPSSSSPLPLHFSSQIRRRFAVPHEASSLEIESEIGVEVQENEQLSGTGGEELGSQGLSSQMKEIVTFT

Query:  GPAIGLWICGPLMSLIDTAVIGQGSAVELAALGPATVLCDYTSYVFMFLSIATSNM--------DKNEVQHHISVLLFIGLMSGFLMLLLTKLLGPVALT
        GPA  LWICGPLMSLIDTAV+GQGS++ELAALGP TV CD  SY+FMFLSIATSNM        DKN VQH IS+LLF+GL+ G LMLL TK  G  ALT
Subjt:  GPAIGLWICGPLMSLIDTAVIGQGSAVELAALGPATVLCDYTSYVFMFLSIATSNM--------DKNEVQHHISVLLFIGLMSGFLMLLLTKLLGPVALT

Query:  AFVGIKNADIIPASNTYI-----------------------------------------------------------------QVISAYMMIEALNKKGY
           G KN  ++PA+NTY+                                                                 QVI+AYMMI+ LNKKGY
Subjt:  AFVGIKNADIIPASNTYI-----------------------------------------------------------------QVISAYMMIEALNKKGY

Query:  NGYSLSVPSSGEFLSILGLAAPVFLTMMSKVVFYSLLIYFATSMGTHTMAAHQ-----VMIQTFCMCTVWGEPLSQTAQSFMPGLINGVNRSLDKAWMLL
        N +++SVPS  + L I  +AAP+F+ M+SKV FY+L+IYFATSMGT T+AAHQ     V+IQTF M TV GEPLSQTAQSFMP  + G NR+L KA MLL
Subjt:  NGYSLSVPSSGEFLSILGLAAPVFLTMMSKVVFYSLLIYFATSMGTHTMAAHQ-----VMIQTFCMCTVWGEPLSQTAQSFMPGLINGVNRSLDKAWMLL

Query:  KSLLIIGAIFGLVLGTIGTSVPWLFPNLFTPEGKIIQEMHKVLIPYFLALVITPPTHCLEGTL---------------------LVISSRGYGLTGCWYA
        KSL+IIG I GL LG +GTSVPWLFPN+FTP+  IIQEMHKVL+P+F+ L +TP    LEGTL                     L++SSRGYGLTG WYA
Subjt:  KSLLIIGAIFGLVLGTIGTSVPWLFPNLFTPEGKIIQEMHKVLIPYFLALVITPPTHCLEGTL---------------------LVISSRGYGLTGCWYA

Query:  LVGFQWARFLSALRRILSPHGILYS
        LVGF WARF  AL+R+LSP GIL+S
Subjt:  LVGFQWARFLSALRRILSPHGILYS

TrEMBL top hitse value%identityAlignment
A0A0A0LLA6 Protein DETOXIFICATION1.3e-20770.39Show/hide
Query:  MAELSLSLASFTSQLPKMPFKILHSPS-SSITPQIHNHKFLNPLSHSRP--SFPFTPTIRFPSSSSPSSIVVCSPITRRFAVPHDDHEREVSNLEIEDQI
        MA+LSLSL  F+   PKMPFK LHSPS SSI PQ H  KF NPLSHSRP  SF FTPT+ FPS S P  + V SPITR FA+PHDDH REVS+ E   + 
Subjt:  MAELSLSLASFTSQLPKMPFKILHSPS-SSITPQIHNHKFLNPLSHSRP--SFPFTPTIRFPSSSSPSSIVVCSPITRRFAVPHDDHEREVSNLEIEDQI

Query:  DDGVQGNEQLLGTGIDELGSQGLLNQMKEIVTFTGPAIGLWICGPLMSLIDTAVIGQGSAIELAALGPATVLCDYTSYVFMFLSIATSNMVATALAKQDK
        D+GVQGNEQLL TGI +L SQGL+NQMKEIVTFTGPAIGLWICGP+MSLIDTAVIGQGSA+ELAALGPATVLCDYTSYVFMFLSIATSNMVATALAKQDK
Subjt:  DDGVQGNEQLLGTGIDELGSQGLLNQMKEIVTFTGPAIGLWICGPLMSLIDTAVIGQGSAIELAALGPATVLCDYTSYVFMFLSIATSNMVATALAKQDK

Query:  NEVQHHISVLLFVGLMSGFLMLLVTKLLGLVALT------------------------------------------------------------------
        NEVQHHISVLLFVGLMSG LMLLVTKLLG +ALT                                                                  
Subjt:  NEVQHHISVLLFVGLMSGFLMLLVTKLLGLVALT------------------------------------------------------------------

Query:  ---------------------AYMMIEQLNKKGYSGYSLCVPSPSEFVSILGLAAPVFITLMSKIVFYTLLIYHATSIGTYTMAAHQVMSQTFYMCSVLG
                             AYMMIEQLNKKGYSGYSL +PSPSEF+SILGLAAPVFITLMSKIVFYTLLIYHATSIGT+TMAAHQVMSQTFYMCSVLG
Subjt:  ---------------------AYMMIEQLNKKGYSGYSLCVPSPSEFVSILGLAAPVFITLMSKIVFYTLLIYHATSIGTYTMAAHQVMSQTFYMCSVLG

Query:  EPLSQTAQSFMPGFINGVNRSLDKARMLLKSLLIIGAIFGLVLGTIGTLVPWLFPNLFTPEVKIIQEMHKVLIPYFLALLIMPATLCLEGTLLAGRDLKF
        EPLSQTAQSFMPGFI+GVNRSLDKARMLLKSLLIIG IFGLVLGTIGTLVPWLFPNLFTPEVKIIQEMHKVLIPYFLALLIMPATLCLEGTLLAGRDLKF
Subjt:  EPLSQTAQSFMPGFINGVNRSLDKARMLLKSLLIIGAIFGLVLGTIGTLVPWLFPNLFTPEVKIIQEMHKVLIPYFLALLIMPATLCLEGTLLAGRDLKF

Query:  ISLSMCGCLSFAALLLLVMFYLPTFLQPHFFSSLCINFMACSQVVSNRGYGLVGCWCALVGFQWARFVNALRRVLSPNGVLYSSDLSHYKL
        ISLSMCGCLSF ALLLL                           V++RGYGL GCWCALVGFQWARF NALRRVLSPNGVLYSSD+SHY++
Subjt:  ISLSMCGCLSFAALLLLVMFYLPTFLQPHFFSSLCINFMACSQVVSNRGYGLVGCWCALVGFQWARFVNALRRVLSPNGVLYSSDLSHYKL

A0A1S3CEY5 Protein DETOXIFICATION2.7e-21071.02Show/hide
Query:  MAELSLSLASFTSQLPKMPFKILHSPS-SSITPQIHNHKFLNPLSHSRPSFPFTPTIRFPSS-SSPSSIVVCSPITRRFAVPHDDHEREVSNLEIEDQID
        MA+LSLSL  F+   PKMPFK LHSPS SS  PQIH  KF NPL HSRPSF FTPTI FP+S SSP  + V SPITRRF++PHDDHEREVS++EI  + +
Subjt:  MAELSLSLASFTSQLPKMPFKILHSPS-SSITPQIHNHKFLNPLSHSRPSFPFTPTIRFPSS-SSPSSIVVCSPITRRFAVPHDDHEREVSNLEIEDQID

Query:  DGVQGNEQLLGTGIDELGSQGLLNQMKEIVTFTGPAIGLWICGPLMSLIDTAVIGQGSAIELAALGPATVLCDYTSYVFMFLSIATSNMVATALAKQDKN
        +GVQGNEQLL TGI +L SQGLLNQMKEIVTFTGPAIGLWICGPLMSLIDTAVIGQGSA+ELAALGPATVLCDYTSYVFMFLSIATSNMVATALAKQDKN
Subjt:  DGVQGNEQLLGTGIDELGSQGLLNQMKEIVTFTGPAIGLWICGPLMSLIDTAVIGQGSAIELAALGPATVLCDYTSYVFMFLSIATSNMVATALAKQDKN

Query:  EVQHHISVLLFVGLMSGFLMLLVTKLLGLVALT-------------------------------------------------------------------
        EVQHHISVLLFVGLMSG LMLLVTKLLG +ALT                                                                   
Subjt:  EVQHHISVLLFVGLMSGFLMLLVTKLLGLVALT-------------------------------------------------------------------

Query:  --------------------AYMMIEQLNKKGYSGYSLCVPSPSEFVSILGLAAPVFITLMSKIVFYTLLIYHATSIGTYTMAAHQVMSQTFYMCSVLGE
                            AYMMIEQLNKKGYSGYSL VPSP EF+SILGLAAPVFITLMSKIVFYTLLIYHATS+GTYTMAAHQVMSQTFYMCSVLGE
Subjt:  --------------------AYMMIEQLNKKGYSGYSLCVPSPSEFVSILGLAAPVFITLMSKIVFYTLLIYHATSIGTYTMAAHQVMSQTFYMCSVLGE

Query:  PLSQTAQSFMPGFINGVNRSLDKARMLLKSLLIIGAIFGLVLGTIGTLVPWLFPNLFTPEVKIIQEMHKVLIPYFLALLIMPATLCLEGTLLAGRDLKFI
        PLSQTAQSFMPGFI+GVNRSLDKARMLLKSLLIIG IFGLVLG IGTLVPWLFPNLFTPEVKIIQEMHKVLIPYFLALLIMPATLCLEGTLLAGRDLKFI
Subjt:  PLSQTAQSFMPGFINGVNRSLDKARMLLKSLLIIGAIFGLVLGTIGTLVPWLFPNLFTPEVKIIQEMHKVLIPYFLALLIMPATLCLEGTLLAGRDLKFI

Query:  SLSMCGCLSFAALLLLVMFYLPTFLQPHFFSSLCINFMACSQVVSNRGYGLVGCWCALVGFQWARFVNALRRVLSPNGVLYSSDLSHYKL
        SLSMCGCLSF ALLLL                          +V+NRGYGL GCWCALVGFQWARF +ALRRVLSPNGVLYSSD+SHY++
Subjt:  SLSMCGCLSFAALLLLVMFYLPTFLQPHFFSSLCINFMACSQVVSNRGYGLVGCWCALVGFQWARFVNALRRVLSPNGVLYSSDLSHYKL

A0A5C7I673 Protein DETOXIFICATION6.0e-24250.73Show/hide
Query:  VPHDDHEREVSNLEIEDQIDDGVQGNEQLLGTGIDELGSQGLLNQMKEIVTFTGPAIGLWICGPLMSLIDTAVIGQGSAIELAALGPATVLCDYTSYVFM
        + +D     +S  + E++ ++   G E       + L +Q + NQMKEIV FT PA GLWICGPLMSLIDTAVIGQGS+IELAALGPATV+CDY +YVFM
Subjt:  VPHDDHEREVSNLEIEDQIDDGVQGNEQLLGTGIDELGSQGLLNQMKEIVTFTGPAIGLWICGPLMSLIDTAVIGQGSAIELAALGPATVLCDYTSYVFM

Query:  FLSIATSNMVATALAKQDKNEVQHHISVLLFVGLMSGFLMLLVTKLLGLVALT-----------------------------------------------
        FLSIATSNMVAT+LAKQDKNEVQH ISVLLFVGL  G LM L T+  G  ALT                                               
Subjt:  FLSIATSNMVATALAKQDKNEVQHHISVLLFVGLMSGFLMLLVTKLLGLVALT-----------------------------------------------

Query:  -------AYMMIEQLNKKGYSGYSLCVPSPSEFVSILGLAAPVFITLMSKIVFYTLLIYHATSIGTYTMAAHQVMSQTFYMCSVLGEPLSQTAQSFMPGF
                +MMI+ LNKKGY+ ++  VPSPSE  +I GLA PVF+T+M+KI FY+L+IY ATS+GT T+AAHQVM Q F MC+V GEPLSQ AQSFMP  
Subjt:  -------AYMMIEQLNKKGYSGYSLCVPSPSEFVSILGLAAPVFITLMSKIVFYTLLIYHATSIGTYTMAAHQVMSQTFYMCSVLGEPLSQTAQSFMPGF

Query:  INGVNRSLDKARMLLKSLLIIGAIFGLVLGTIGTLVPWLFPNLFTPEVKIIQEMHKVLIPYFLALLIMPATLCLEGTLLAGRDLKFISLSMCGCLSFAAL
        I G NRSL KARMLLKSL IIGA  GLVLG IGT VPWLFP++FTP+  +IQEMHKVL+PYFLAL + P+T  LEGTLLAGRDLKFISLSM GC S  AL
Subjt:  INGVNRSLDKARMLLKSLLIIGAIFGLVLGTIGTLVPWLFPNLFTPEVKIIQEMHKVLIPYFLALLIMPATLCLEGTLLAGRDLKFISLSMCGCLSFAAL

Query:  LLLVMFYLPTFLQPHFFSSLCINFMACSQVVSNRGYGLVGCWCALVGFQWARFVNALRRVLSPNGVLYSSDLSHYKLNFAPFNSQAPKMPFKILHYHSSS
        +LL                          +VS+RGYGL GCW AL GFQWARF  +L RVLS +G+L+S DL+ Y                         
Subjt:  LLLVMFYLPTFLQPHFFSSLCINFMACSQVVSNRGYGLVGCWCALVGFQWARFVNALRRVLSPNGVLYSSDLSHYKLNFAPFNSQAPKMPFKILHYHSSS

Query:  SPRIRYPNILRPFSPPSFPFTHQSLSSPAISFPSSSSPLPLHFSSQIRRRFAVPHEASSLEIESEIGVEVQENEQLSGTGGEELGSQGLSSQMKEIVTFT
            +    LR F P      H  L     S  + S  +PL  S                E E E  VEV+          E+L +Q + +QMKEI+ FT
Subjt:  SPRIRYPNILRPFSPPSFPFTHQSLSSPAISFPSSSSPLPLHFSSQIRRRFAVPHEASSLEIESEIGVEVQENEQLSGTGGEELGSQGLSSQMKEIVTFT

Query:  GPAIGLWICGPLMSLIDTAVIGQGSAVELAALGPATVLCDYTSYVFMFLSIATSNM--------DKNEVQHHISVLLFIGLMSGFLMLLLTKLLGPVALT
        GPA  LWICGPLMSLIDTAV+GQGS++ELAALGP TV CD  SY+FMFLSIATSNM        DKN VQH IS+LLF+GL+ G LMLL TK  G  ALT
Subjt:  GPAIGLWICGPLMSLIDTAVIGQGSAVELAALGPATVLCDYTSYVFMFLSIATSNM--------DKNEVQHHISVLLFIGLMSGFLMLLLTKLLGPVALT

Query:  AFVGIKNADIIPASNTYI-----------------------------------------------------------------QVISAYMMIEALNKKGY
           G KN  ++PA+NTY+                                                                 QVI+AYMMI+ LNKKGY
Subjt:  AFVGIKNADIIPASNTYI-----------------------------------------------------------------QVISAYMMIEALNKKGY

Query:  NGYSLSVPSSGEFLSILGLAAPVFLTMMSKVVFYSLLIYFATSMGTHTMAAHQ-----VMIQTFCMCTVWGEPLSQTAQSFMPGLINGVNRSLDKAWMLL
        N +++SVPS  + L I  +AAP+F+ M+SKV FY+L+IYFATSMGT T+AAHQ     V+IQTF M TV GEPLSQTAQSFMP  + G NR+L KA MLL
Subjt:  NGYSLSVPSSGEFLSILGLAAPVFLTMMSKVVFYSLLIYFATSMGTHTMAAHQ-----VMIQTFCMCTVWGEPLSQTAQSFMPGLINGVNRSLDKAWMLL

Query:  KSLLIIGAIFGLVLGTIGTSVPWLFPNLFTPEGKIIQEMHKVLIPYFLALVITPPTHCLEGTL---------------------LVISSRGYGLTGCWYA
        KSL+IIG I GL LG +GTSVPWLFPN+FTP+  IIQEMHKVL+P+F+ L +TP    LEGTL                     L++SSRGYGLTG WYA
Subjt:  KSLLIIGAIFGLVLGTIGTSVPWLFPNLFTPEGKIIQEMHKVLIPYFLALVITPPTHCLEGTL---------------------LVISSRGYGLTGCWYA

Query:  LVGFQWARFLSALRRILSPHGILYS
        LVGF WARF  AL+R+LSP GIL+S
Subjt:  LVGFQWARFLSALRRILSPHGILYS

A0A5D3BWW2 Protein DETOXIFICATION2.7e-21071.02Show/hide
Query:  MAELSLSLASFTSQLPKMPFKILHSPS-SSITPQIHNHKFLNPLSHSRPSFPFTPTIRFPSS-SSPSSIVVCSPITRRFAVPHDDHEREVSNLEIEDQID
        MA+LSLSL  F+   PKMPFK LHSPS SS  PQIH  KF NPL HSRPSF FTPTI FP+S SSP  + V SPITRRF++PHDDHEREVS++EI  + +
Subjt:  MAELSLSLASFTSQLPKMPFKILHSPS-SSITPQIHNHKFLNPLSHSRPSFPFTPTIRFPSS-SSPSSIVVCSPITRRFAVPHDDHEREVSNLEIEDQID

Query:  DGVQGNEQLLGTGIDELGSQGLLNQMKEIVTFTGPAIGLWICGPLMSLIDTAVIGQGSAIELAALGPATVLCDYTSYVFMFLSIATSNMVATALAKQDKN
        +GVQGNEQLL TGI +L SQGLLNQMKEIVTFTGPAIGLWICGPLMSLIDTAVIGQGSA+ELAALGPATVLCDYTSYVFMFLSIATSNMVATALAKQDKN
Subjt:  DGVQGNEQLLGTGIDELGSQGLLNQMKEIVTFTGPAIGLWICGPLMSLIDTAVIGQGSAIELAALGPATVLCDYTSYVFMFLSIATSNMVATALAKQDKN

Query:  EVQHHISVLLFVGLMSGFLMLLVTKLLGLVALT-------------------------------------------------------------------
        EVQHHISVLLFVGLMSG LMLLVTKLLG +ALT                                                                   
Subjt:  EVQHHISVLLFVGLMSGFLMLLVTKLLGLVALT-------------------------------------------------------------------

Query:  --------------------AYMMIEQLNKKGYSGYSLCVPSPSEFVSILGLAAPVFITLMSKIVFYTLLIYHATSIGTYTMAAHQVMSQTFYMCSVLGE
                            AYMMIEQLNKKGYSGYSL VPSP EF+SILGLAAPVFITLMSKIVFYTLLIYHATS+GTYTMAAHQVMSQTFYMCSVLGE
Subjt:  --------------------AYMMIEQLNKKGYSGYSLCVPSPSEFVSILGLAAPVFITLMSKIVFYTLLIYHATSIGTYTMAAHQVMSQTFYMCSVLGE

Query:  PLSQTAQSFMPGFINGVNRSLDKARMLLKSLLIIGAIFGLVLGTIGTLVPWLFPNLFTPEVKIIQEMHKVLIPYFLALLIMPATLCLEGTLLAGRDLKFI
        PLSQTAQSFMPGFI+GVNRSLDKARMLLKSLLIIG IFGLVLG IGTLVPWLFPNLFTPEVKIIQEMHKVLIPYFLALLIMPATLCLEGTLLAGRDLKFI
Subjt:  PLSQTAQSFMPGFINGVNRSLDKARMLLKSLLIIGAIFGLVLGTIGTLVPWLFPNLFTPEVKIIQEMHKVLIPYFLALLIMPATLCLEGTLLAGRDLKFI

Query:  SLSMCGCLSFAALLLLVMFYLPTFLQPHFFSSLCINFMACSQVVSNRGYGLVGCWCALVGFQWARFVNALRRVLSPNGVLYSSDLSHYKL
        SLSMCGCLSF ALLLL                          +V+NRGYGL GCWCALVGFQWARF +ALRRVLSPNGVLYSSD+SHY++
Subjt:  SLSMCGCLSFAALLLLVMFYLPTFLQPHFFSSLCINFMACSQVVSNRGYGLVGCWCALVGFQWARFVNALRRVLSPNGVLYSSDLSHYKL

A0A7J6FFZ4 Protein DETOXIFICATION3.1e-23050.1Show/hide
Query:  PFKILHSPSSSITPQI-HNHKFLNPLSHSRPSFPFTPTIRF--PSSSSPSSIVVCSPITRRFAVPHDDHE----REVSNLE--IEDQIDDGVQGNEQLLG
        P+ +LH+P S    Q+  +  F N  S+S  S P     RF   +S+SP S+       RRF +    +     RE+S+ E    D+   GV   E    
Subjt:  PFKILHSPSSSITPQI-HNHKFLNPLSHSRPSFPFTPTIRF--PSSSSPSSIVVCSPITRRFAVPHDDHE----REVSNLE--IEDQIDDGVQGNEQLLG

Query:  TGIDELGSQGLLNQMKEIVTFTGPAIGLWICGPLMSLIDTAVIGQGSAIELAALGPATVLCDYTSYVFMFLSIATSNMVATALAKQDKNEVQHHISVLLF
        TG   LG+QG+ +QMKEI  FTGPA GLWICGPLMSLIDTAVIGQ S++ELAALGPATVLCD  SY+FMFLSIATSNMVATALA++DK EVQHHISVLLF
Subjt:  TGIDELGSQGLLNQMKEIVTFTGPAIGLWICGPLMSLIDTAVIGQGSAIELAALGPATVLCDYTSYVFMFLSIATSNMVATALAKQDKNEVQHHISVLLF

Query:  VGLMSGFLMLLVTKLLGLVALT----------------------AYMMIEQLNKKGYSGYSLCVPSPSEFVSILGLAAPVFITLMSKIVFYTLLIYHATS
        VGL  GF+MLL T+  G  ALT                      AYMMIE LNKKGY+ YS+ +PSP + ++I  LAAPVFITLM+KI FYTLL+Y ATS
Subjt:  VGLMSGFLMLLVTKLLGLVALT----------------------AYMMIEQLNKKGYSGYSLCVPSPSEFVSILGLAAPVFITLMSKIVFYTLLIYHATS

Query:  IGTYTMAAHQVMSQTFYMCSVLGEPLSQTAQSFMPGFINGVNRSLDKARMLLKSLLIIGAIFGLVLGTIGTLVPWLFPNLFTPEVKIIQEMHKVLIPYFL
        +GT T AAHQVM Q  +MC+V GEPLSQTAQSFMP  I+GV R+L+KAR LLKSL+++G +FG+VLGT GT +PWLFPN++TP+  II+EMH VLIPYFL
Subjt:  IGTYTMAAHQVMSQTFYMCSVLGEPLSQTAQSFMPGFINGVNRSLDKARMLLKSLLIIGAIFGLVLGTIGTLVPWLFPNLFTPEVKIIQEMHKVLIPYFL

Query:  ALLIMPATLCLEGTLLAGRDLKFISLSMCGCLSFAALLLLVMFYLPTFLQPHFFSSLCINFMACSQVVSNRGYGLVGCWCALVGFQWARFVNALRRVLSP
         +L  P T  LEGTLLAGRDLKFIS++M GC    A+LL                          ++VS +G+GL G W  L     +R  N  +  +  
Subjt:  ALLIMPATLCLEGTLLAGRDLKFISLSMCGCLSFAALLLLVMFYLPTFLQPHFFSSLCINFMACSQVVSNRGYGLVGCWCALVGFQWARFVNALRRVLSP

Query:  N---GVLYSSDLS-----HYKLNFAPF--NS-------QAPKMPFK-ILHYHSSSSPRIRYPNILRPFSPPSFPFTHQSLSSPAISFPSSSSPLP-LHFS
        N    ++Y +  S     H  L       NS        +P  P K ++  H S        N LRP S  S  F +   ++P     +++  LP L F 
Subjt:  N---GVLYSSDLS-----HYKLNFAPF--NS-------QAPKMPFK-ILHYHSSSSPRIRYPNILRPFSPPSFPFTHQSLSSPAISFPSSSSPLP-LHFS

Query:  SQIRRRFAVPHEASSLEIESEIGVEVQENEQLSGTGGEELGSQGLSSQMKEIVTFTGPAIGLWICGPLMSLIDTAVIGQGSAVELAALGPATVLCDYTSY
        S+ R R +    A+  + ++  G    EN  + G        +G+  Q+KEI  FTGPA+GLWICGPLMSLIDT VIGQGS++ELAALGP TV CDY SY
Subjt:  SQIRRRFAVPHEASSLEIESEIGVEVQENEQLSGTGGEELGSQGLSSQMKEIVTFTGPAIGLWICGPLMSLIDTAVIGQGSAVELAALGPATVLCDYTSY

Query:  VFMFLSIATSNM--------DKNEVQHHISVLLFIGLMSGFLMLLLTKLLGPVALTAFVGIKNADIIPASNTYIQVISAYMMIEALNKKGYNGYSLSVPS
        VFMFLSIATSNM        DKNEVQHHISVLLF+GL  GFLM   T+  G  +LTAF G  N  I+PA+NTY+QV++ YMM+E LNKKGYN Y+LS+PS
Subjt:  VFMFLSIATSNM--------DKNEVQHHISVLLFIGLMSGFLMLLLTKLLGPVALTAFVGIKNADIIPASNTYIQVISAYMMIEALNKKGYNGYSLSVPS

Query:  SGEFLSILGLAAPVFLTMMSKVVFYSLLIYFATSMGTHTMAAHQVMIQTFCMCTVWGEPLSQTAQSFMPGLINGVNRSLDKAWMLLKSLLIIGAIFGLVL
          E ++IL LAAPVF+TM SKV FYSLLIYFATSMGT +MAAHQVMIQTFCMCTVW                                            
Subjt:  SGEFLSILGLAAPVFLTMMSKVVFYSLLIYFATSMGTHTMAAHQVMIQTFCMCTVWGEPLSQTAQSFMPGLINGVNRSLDKAWMLLKSLLIIGAIFGLVL

Query:  GTIGTSVPWLFPNLFTPEGKIIQEMHKVLIPYFLALVITPPTHCLEGTLL------------------------VISSRGYGLTGCWYALVGFQWARFLS
        G+IGT++PWLFPN+FTP+  +IQEMHKVLIPY LA+V TP  H LEGTLL                        +IS++G+GL  CWY L  FQWARF  
Subjt:  GTIGTSVPWLFPNLFTPEGKIIQEMHKVLIPYFLALVITPPTHCLEGTLL------------------------VISSRGYGLTGCWYALVGFQWARFLS

Query:  ALRRILSPHGILYSNE----NKEIHKAV
        ALRR+LSP+G+L+S +    N E  KAV
Subjt:  ALRRILSPHGILYSNE----NKEIHKAV

SwissProt top hitse value%identityAlignment
Q8W4G3 Protein DETOXIFICATION 46, chloroplastic5.6e-12046.72Show/hide
Query:  PKMPFKILHSPSSSITPQIHNHKFLNPLSHSRPSFPFTPTIRFPSSSSPSSIVVCSPITRRFAVPHDD--HEREVSNLEIEDQIDDGVQGN-------EQ
        PK+PF       SS+T +  N           PSF   P+ R  + S P S +  +   R  A  + +   + E  N  I +   D   G+        +
Subjt:  PKMPFKILHSPSSSITPQIHNHKFLNPLSHSRPSFPFTPTIRFPSSSSPSSIVVCSPITRRFAVPHDD--HEREVSNLEIEDQIDDGVQGN-------EQ

Query:  LLGTGIDELGSQGLLNQMKEIVTFTGPAIGLWICGPLMSLIDTAVIGQGSAIELAALGPATVLCDYTSYVFMFLSIATSNMVATALAKQDKNEVQHHISV
        +    +D+L +Q +  QMKEIV FTGPA GLW+CGPLMSLIDTAVIGQGS++ELAALGPATV+CDY  Y FMFLS+ATSN+VAT+LA+QDK+EVQH IS+
Subjt:  LLGTGIDELGSQGLLNQMKEIVTFTGPAIGLWICGPLMSLIDTAVIGQGSAIELAALGPATVLCDYTSYVFMFLSIATSNMVATALAKQDKNEVQHHISV

Query:  LLFVGLMSGFLMLLVTKLLGLVALT---------------------------------------------------------------------------
        LLF+GL  G  M+++T+L G  ALT                                                                           
Subjt:  LLFVGLMSGFLMLLVTKLLGLVALT---------------------------------------------------------------------------

Query:  ------------AYMMIEQLNKKGYSGYSLCVPSPSEFVSILGLAAPVFITLMSKIVFYTLLIYHATSIGTYTMAAHQVMSQTFYMCSVLGEPLSQTAQS
                    AYMM++ LNKKGYS +S CVPSPSE ++I GLAAPVFIT+MSK++FYTLL+Y ATS+GT  +AAHQVM Q + M +V GEPLSQTAQS
Subjt:  ------------AYMMIEQLNKKGYSGYSLCVPSPSEFVSILGLAAPVFITLMSKIVFYTLLIYHATSIGTYTMAAHQVMSQTFYMCSVLGEPLSQTAQS

Query:  FMPGFINGVNRSLDKARMLLKSLLIIGAIFGLVLGTIGTLVPWLFPNLFTPEVKIIQEMHKVLIPYFLALLIMPATLCLEGTLLAGRDLKFISLSMCGCL
        FMP  + G+NR+L KAR+LLKSL+IIGA  G+V+GTIGT VPWLFP +FT +  +  EMHKV+IPYFLAL I P+T  LEGTLLAGRDL++ISLSM GCL
Subjt:  FMPGFINGVNRSLDKARMLLKSLLIIGAIFGLVLGTIGTLVPWLFPNLFTPEVKIIQEMHKVLIPYFLALLIMPATLCLEGTLLAGRDLKFISLSMCGCL

Query:  SFAALLLLVMFYLPTFLQPHFFSSLCINFMACSQVVSNRGYGLVGCWCALVGFQWARFVNALRRVLSPNGVLYSSDLSHY
        + A LLL+++                          SN G+GL GCW ALVGFQWARF  +L R+LS +GVLYS D S Y
Subjt:  SFAALLLLVMFYLPTFLQPHFFSSLCINFMACSQVVSNRGYGLVGCWCALVGFQWARFVNALRRVLSPNGVLYSSDLSHY

Q945F0 Protein DETOXIFICATION 47, chloroplastic6.6e-11349.9Show/hide
Query:  IRRRFAVPH-EASSLEIESEIGVEVQENEQLSGTGGEELGSQGLSSQMKEIVTFTGPAIGLWICGPLMSLIDTAVIGQGSAVELAALGPATVLCDYTSYV
        IRRR  +     + + I+ EI  E +E E+  G    +L  Q +  QMKEIV FTGPA+G+WICGPLMSLIDT VIGQGS++ELAALGP TVLCD+ SYV
Subjt:  IRRRFAVPH-EASSLEIESEIGVEVQENEQLSGTGGEELGSQGLSSQMKEIVTFTGPAIGLWICGPLMSLIDTAVIGQGSAVELAALGPATVLCDYTSYV

Query:  FMFLSIATSNM--------DKNEVQHHISVLLFIGLMSGFLMLLLTKLLGPVALTAFVGIKNADIIPASNTYI---------------------------
        FMFLS+ATSNM        DK E QH ISVLLFIGL+ G +MLLLT+L GP A+TAF   KN +I+PA+N YI                           
Subjt:  FMFLSIATSNM--------DKNEVQHHISVLLFIGLMSGFLMLLLTKLLGPVALTAFVGIKNADIIPASNTYI---------------------------

Query:  --------------------------------------QVISAYMMIEALNKKGYNGYSLSVPSSGEFLSILGLAAPVFLTMMSKVVFYSLLIYFATSMG
                                              Q++SAYMM+++LNK+GYN YS ++PS  E   I  LAAPVF+++ SK+ FYS +IY ATSMG
Subjt:  --------------------------------------QVISAYMMIEALNKKGYNGYSLSVPSSGEFLSILGLAAPVFLTMMSKVVFYSLLIYFATSMG

Query:  THTMAAHQVMIQTFCMCTVWGEPLSQTAQSFMPGLINGVNRSLDKAWMLLKSLLIIGAIFGLVLGTIGTSVPWLFPNLFTPEGKIIQEMHKVLIPYFLAL
        TH +AAHQVM QT+ MC VWGEPLSQTAQSFMP ++ G NR+L KA  LLKSL+IIGA  GLVLG IGT+VP LFP ++T +  II EMH++LIP+F+AL
Subjt:  THTMAAHQVMIQTFCMCTVWGEPLSQTAQSFMPGLINGVNRSLDKAWMLLKSLLIIGAIFGLVLGTIGTSVPWLFPNLFTPEGKIIQEMHKVLIPYFLAL

Query:  VITPPTHCLEGTLLV------------------------ISSRGYGLTGCWYALVGFQWARFLSALRRILSPHGILYSN
           P T  LEGTLL                         ++  GYGL GCW+ LVGFQW RF   LRR+LSP GIL S+
Subjt:  VITPPTHCLEGTLLV------------------------ISSRGYGLTGCWYALVGFQWARFLSALRRILSPHGILYSN

Arabidopsis top hitse value%identityAlignment
AT2G21340.1 MATE efflux family protein4.0e-12146.72Show/hide
Query:  PKMPFKILHSPSSSITPQIHNHKFLNPLSHSRPSFPFTPTIRFPSSSSPSSIVVCSPITRRFAVPHDD--HEREVSNLEIEDQIDDGVQGN-------EQ
        PK+PF       SS+T +  N           PSF   P+ R  + S P S +  +   R  A  + +   + E  N  I +   D   G+        +
Subjt:  PKMPFKILHSPSSSITPQIHNHKFLNPLSHSRPSFPFTPTIRFPSSSSPSSIVVCSPITRRFAVPHDD--HEREVSNLEIEDQIDDGVQGN-------EQ

Query:  LLGTGIDELGSQGLLNQMKEIVTFTGPAIGLWICGPLMSLIDTAVIGQGSAIELAALGPATVLCDYTSYVFMFLSIATSNMVATALAKQDKNEVQHHISV
        +    +D+L +Q +  QMKEIV FTGPA GLW+CGPLMSLIDTAVIGQGS++ELAALGPATV+CDY  Y FMFLS+ATSN+VAT+LA+QDK+EVQH IS+
Subjt:  LLGTGIDELGSQGLLNQMKEIVTFTGPAIGLWICGPLMSLIDTAVIGQGSAIELAALGPATVLCDYTSYVFMFLSIATSNMVATALAKQDKNEVQHHISV

Query:  LLFVGLMSGFLMLLVTKLLGLVALT---------------------------------------------------------------------------
        LLF+GL  G  M+++T+L G  ALT                                                                           
Subjt:  LLFVGLMSGFLMLLVTKLLGLVALT---------------------------------------------------------------------------

Query:  ------------AYMMIEQLNKKGYSGYSLCVPSPSEFVSILGLAAPVFITLMSKIVFYTLLIYHATSIGTYTMAAHQVMSQTFYMCSVLGEPLSQTAQS
                    AYMM++ LNKKGYS +S CVPSPSE ++I GLAAPVFIT+MSK++FYTLL+Y ATS+GT  +AAHQVM Q + M +V GEPLSQTAQS
Subjt:  ------------AYMMIEQLNKKGYSGYSLCVPSPSEFVSILGLAAPVFITLMSKIVFYTLLIYHATSIGTYTMAAHQVMSQTFYMCSVLGEPLSQTAQS

Query:  FMPGFINGVNRSLDKARMLLKSLLIIGAIFGLVLGTIGTLVPWLFPNLFTPEVKIIQEMHKVLIPYFLALLIMPATLCLEGTLLAGRDLKFISLSMCGCL
        FMP  + G+NR+L KAR+LLKSL+IIGA  G+V+GTIGT VPWLFP +FT +  +  EMHKV+IPYFLAL I P+T  LEGTLLAGRDL++ISLSM GCL
Subjt:  FMPGFINGVNRSLDKARMLLKSLLIIGAIFGLVLGTIGTLVPWLFPNLFTPEVKIIQEMHKVLIPYFLALLIMPATLCLEGTLLAGRDLKFISLSMCGCL

Query:  SFAALLLLVMFYLPTFLQPHFFSSLCINFMACSQVVSNRGYGLVGCWCALVGFQWARFVNALRRVLSPNGVLYSSDLSHY
        + A LLL+++                          SN G+GL GCW ALVGFQWARF  +L R+LS +GVLYS D S Y
Subjt:  SFAALLLLVMFYLPTFLQPHFFSSLCINFMACSQVVSNRGYGLVGCWCALVGFQWARFVNALRRVLSPNGVLYSSDLSHY

AT2G21340.2 MATE efflux family protein1.8e-12146.97Show/hide
Query:  PKMPFKILHSPSSSITPQIHNHKFLNPLSHSRPSFPFTPTIRFPSSSSPSSIVVCSPITRRFAVPHDD--HEREVSNLEIEDQIDDGVQGN-------EQ
        PK+PF       SS+T +  N           PSF   P+ R  + S P S +  +   R  A  + +   + E  N  I +   D   G+        +
Subjt:  PKMPFKILHSPSSSITPQIHNHKFLNPLSHSRPSFPFTPTIRFPSSSSPSSIVVCSPITRRFAVPHDD--HEREVSNLEIEDQIDDGVQGN-------EQ

Query:  LLGTGIDELGSQGLLNQMKEIVTFTGPAIGLWICGPLMSLIDTAVIGQGSAIELAALGPATVLCDYTSYVFMFLSIATSNMVATALAKQDKNEVQHHISV
        +    +D+L +Q +  QMKEIV FTGPA GLW+CGPLMSLIDTAVIGQGS++ELAALGPATV+CDY  Y FMFLS+ATSN+VAT+LA+QDK+EVQH IS+
Subjt:  LLGTGIDELGSQGLLNQMKEIVTFTGPAIGLWICGPLMSLIDTAVIGQGSAIELAALGPATVLCDYTSYVFMFLSIATSNMVATALAKQDKNEVQHHISV

Query:  LLFVGLMSGFLMLLVTKLLGLVALT---------------------------------------------------------------------------
        LLF+GL  G  M+++T+L G  ALT                                                                           
Subjt:  LLFVGLMSGFLMLLVTKLLGLVALT---------------------------------------------------------------------------

Query:  ---------AYMMIEQLNKKGYSGYSLCVPSPSEFVSILGLAAPVFITLMSKIVFYTLLIYHATSIGTYTMAAHQVMSQTFYMCSVLGEPLSQTAQSFMP
                 AYMM++ LNKKGYS +S CVPSPSE ++I GLAAPVFIT+MSK++FYTLL+Y ATS+GT  +AAHQVM Q + M +V GEPLSQTAQSFMP
Subjt:  ---------AYMMIEQLNKKGYSGYSLCVPSPSEFVSILGLAAPVFITLMSKIVFYTLLIYHATSIGTYTMAAHQVMSQTFYMCSVLGEPLSQTAQSFMP

Query:  GFINGVNRSLDKARMLLKSLLIIGAIFGLVLGTIGTLVPWLFPNLFTPEVKIIQEMHKVLIPYFLALLIMPATLCLEGTLLAGRDLKFISLSMCGCLSFA
          + G+NR+L KAR+LLKSL+IIGA  G+V+GTIGT VPWLFP +FT +  +  EMHKV+IPYFLAL I P+T  LEGTLLAGRDL++ISLSM GCL+ A
Subjt:  GFINGVNRSLDKARMLLKSLLIIGAIFGLVLGTIGTLVPWLFPNLFTPEVKIIQEMHKVLIPYFLALLIMPATLCLEGTLLAGRDLKFISLSMCGCLSFA

Query:  ALLLLVMFYLPTFLQPHFFSSLCINFMACSQVVSNRGYGLVGCWCALVGFQWARFVNALRRVLSPNGVLYSSDLSHY
         LLL+++                          SN G+GL GCW ALVGFQWARF  +L R+LS +GVLYS D S Y
Subjt:  ALLLLVMFYLPTFLQPHFFSSLCINFMACSQVVSNRGYGLVGCWCALVGFQWARFVNALRRVLSPNGVLYSSDLSHY

AT4G39030.1 MATE efflux family protein4.7e-11449.9Show/hide
Query:  IRRRFAVPH-EASSLEIESEIGVEVQENEQLSGTGGEELGSQGLSSQMKEIVTFTGPAIGLWICGPLMSLIDTAVIGQGSAVELAALGPATVLCDYTSYV
        IRRR  +     + + I+ EI  E +E E+  G    +L  Q +  QMKEIV FTGPA+G+WICGPLMSLIDT VIGQGS++ELAALGP TVLCD+ SYV
Subjt:  IRRRFAVPH-EASSLEIESEIGVEVQENEQLSGTGGEELGSQGLSSQMKEIVTFTGPAIGLWICGPLMSLIDTAVIGQGSAVELAALGPATVLCDYTSYV

Query:  FMFLSIATSNM--------DKNEVQHHISVLLFIGLMSGFLMLLLTKLLGPVALTAFVGIKNADIIPASNTYI---------------------------
        FMFLS+ATSNM        DK E QH ISVLLFIGL+ G +MLLLT+L GP A+TAF   KN +I+PA+N YI                           
Subjt:  FMFLSIATSNM--------DKNEVQHHISVLLFIGLMSGFLMLLLTKLLGPVALTAFVGIKNADIIPASNTYI---------------------------

Query:  --------------------------------------QVISAYMMIEALNKKGYNGYSLSVPSSGEFLSILGLAAPVFLTMMSKVVFYSLLIYFATSMG
                                              Q++SAYMM+++LNK+GYN YS ++PS  E   I  LAAPVF+++ SK+ FYS +IY ATSMG
Subjt:  --------------------------------------QVISAYMMIEALNKKGYNGYSLSVPSSGEFLSILGLAAPVFLTMMSKVVFYSLLIYFATSMG

Query:  THTMAAHQVMIQTFCMCTVWGEPLSQTAQSFMPGLINGVNRSLDKAWMLLKSLLIIGAIFGLVLGTIGTSVPWLFPNLFTPEGKIIQEMHKVLIPYFLAL
        TH +AAHQVM QT+ MC VWGEPLSQTAQSFMP ++ G NR+L KA  LLKSL+IIGA  GLVLG IGT+VP LFP ++T +  II EMH++LIP+F+AL
Subjt:  THTMAAHQVMIQTFCMCTVWGEPLSQTAQSFMPGLINGVNRSLDKAWMLLKSLLIIGAIFGLVLGTIGTSVPWLFPNLFTPEGKIIQEMHKVLIPYFLAL

Query:  VITPPTHCLEGTLLV------------------------ISSRGYGLTGCWYALVGFQWARFLSALRRILSPHGILYSN
           P T  LEGTLL                         ++  GYGL GCW+ LVGFQW RF   LRR+LSP GIL S+
Subjt:  VITPPTHCLEGTLLV------------------------ISSRGYGLTGCWYALVGFQWARFLSALRRILSPHGILYSN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCACGTGGCTTCACTTCATCTTCGTTCTTCTTCACTCCTTACTCACTACACTCCCATGGCGGAGCTTTCGCTCTCTCTCGCTTCCTTCACTTCTCAGCTGCCAAAAAT
GCCTTTCAAAATCCTCCATTCTCCTTCTTCTTCAATCACTCCTCAAATCCATAACCACAAATTTCTCAATCCTCTCTCTCACTCTCGCCCTTCCTTTCCCTTCACCCCCA
CCATTCGCTTTCCTTCTTCTTCCTCCCCTTCATCCATCGTTGTATGCTCGCCGATTACCCGCCGTTTCGCGGTTCCTCACGATGATCATGAGCGGGAAGTCAGTAACCTT
GAGATTGAGGACCAAATTGACGATGGAGTACAGGGAAATGAGCAGTTATTGGGCACTGGAATAGATGAATTGGGGAGCCAAGGGTTGTTGAATCAGATGAAGGAGATTGT
AACGTTTACCGGACCTGCCATTGGGTTGTGGATTTGTGGACCATTGATGAGTCTCATTGACACTGCGGTTATAGGCCAGGGGAGCGCCATTGAGCTTGCTGCTTTAGGCC
CAGCGACAGTTTTATGTGATTATACGAGCTATGTGTTCATGTTTCTCAGTATCGCAACTTCAAATATGGTAGCCACGGCCCTTGCCAAACAGGATAAAAACGAAGTGCAA
CATCACATATCTGTATTGCTATTTGTTGGGCTGATGTCTGGTTTCTTGATGCTCTTAGTTACCAAACTATTGGGTTTAGTGGCGCTAACTGCTTATATGATGATAGAGCA
ACTAAACAAGAAAGGATATAGTGGATATTCTCTCTGTGTTCCCTCGCCTAGTGAATTTGTGTCAATACTTGGACTTGCTGCTCCTGTATTTATTACACTGATGTCAAAGA
TAGTTTTTTATACTCTCCTCATCTATCATGCTACGTCTATAGGCACATACACCATGGCTGCTCATCAGGTCATGAGTCAAACATTCTACATGTGTAGCGTATTGGGTGAA
CCGCTTTCTCAAACTGCCCAATCATTTATGCCTGGATTCATAAATGGAGTGAATCGTAGTCTGGATAAGGCTCGGATGCTACTCAAGTCACTCTTGATCATAGGAGCTAT
CTTTGGTTTGGTATTAGGGACTATCGGAACGTTAGTTCCTTGGTTGTTCCCCAATCTCTTTACACCTGAAGTGAAGATTATTCAAGAGATGCATAAAGTGTTGATTCCAT
ATTTCTTGGCCTTACTCATAATGCCTGCAACGCTTTGCTTGGAAGGGACGTTATTGGCTGGACGAGACCTAAAATTTATTAGTTTATCAATGTGCGGATGCCTTTCTTTT
GCTGCCCTTCTATTGCTGGTAATGTTTTATCTCCCAACCTTCCTCCAACCTCATTTCTTCTCCTCTTTGTGCATTAACTTCATGGCTTGCTCCCAGGTTGTAAGCAATAG
GGGTTATGGTTTGGTGGGCTGCTGGTGCGCGCTCGTCGGATTTCAATGGGCTCGGTTTGTTAACGCTCTTCGACGTGTCCTCTCTCCCAATGGAGTGCTTTACTCCAGTG
ATTTAAGCCATTATAAACTCAACTTCGCTCCTTTCAATTCTCAGGCGCCAAAAATGCCTTTCAAAATCCTCCATTATCATTCTTCATCCTCTCCTCGAATCCGTTACCCC
AATATTCTCAGACCATTCTCTCCCCCTTCCTTTCCCTTCACCCATCAATCTCTCTCCTCCCCCGCCATTTCCTTTCCTTCCTCTTCCTCCCCTTTACCTCTCCATTTCTC
GTCGCAGATTCGCCGCCGATTCGCGGTTCCTCACGAAGCCAGCAGCCTCGAGATTGAGAGCGAAATTGGCGTTGAAGTACAAGAAAATGAACAATTATCCGGAACTGGAG
GCGAAGAATTGGGGAGCCAAGGGTTGAGTAGTCAGATGAAGGAGATTGTAACGTTTACTGGACCTGCCATTGGATTGTGGATTTGTGGACCATTGATGAGTCTCATTGAC
ACTGCGGTTATTGGCCAGGGAAGCGCTGTTGAGCTTGCCGCTTTAGGCCCAGCGACTGTTTTATGTGATTATACGAGCTATGTGTTCATGTTTCTTAGTATTGCAACTTC
AAACATGGATAAAAACGAAGTGCAGCATCACATATCTGTATTGCTGTTTATCGGGTTGATGTCTGGCTTCTTGATGCTCTTACTTACCAAACTATTGGGTCCAGTGGCGC
TAACTGCTTTTGTCGGAATAAAGAATGCAGACATCATACCTGCATCAAACACGTATATCCAGGTTATTTCTGCTTATATGATGATAGAAGCACTGAATAAGAAAGGATAC
AATGGATATTCTCTATCGGTTCCCTCGTCTGGTGAATTTTTGTCAATACTTGGACTGGCTGCTCCTGTATTTCTAACAATGATGTCAAAGGTGGTTTTTTATTCTCTCCT
CATTTATTTTGCTACATCTATGGGCACACATACCATGGCTGCTCATCAGGTCATGATTCAAACATTTTGTATGTGTACCGTTTGGGGTGAACCTCTTTCTCAAACTGCTC
AGTCATTTATGCCTGGATTGATAAACGGAGTGAATCGAAGTTTGGATAAGGCTTGGATGTTGCTAAAGTCACTCTTGATTATAGGAGCTATATTTGGCTTGGTATTAGGG
ACTATTGGAACTTCAGTTCCTTGGTTGTTCCCCAATCTCTTCACACCCGAAGGGAAGATTATTCAGGAGATGCATAAAGTGTTGATTCCATATTTCTTGGCACTAGTCAT
AACACCCCCGACTCATTGCTTAGAAGGGACGTTATTGGTTATTAGCAGTAGGGGTTATGGTTTGACAGGCTGCTGGTATGCCTTGGTTGGATTTCAATGGGCTCGGTTTC
TCAGCGCTCTTCGACGCATCCTCTCGCCTCATGGAATACTTTACAGCAATGAAAACAAGGAAATCCACAAAGCAGTGTTCTCAAAGGAAGATGAAATCAACAGCATAGCA
AGGACTTGTAAATTCTTTAAGTGA
mRNA sequenceShow/hide mRNA sequence
ATGCACGTGGCTTCACTTCATCTTCGTTCTTCTTCACTCCTTACTCACTACACTCCCATGGCGGAGCTTTCGCTCTCTCTCGCTTCCTTCACTTCTCAGCTGCCAAAAAT
GCCTTTCAAAATCCTCCATTCTCCTTCTTCTTCAATCACTCCTCAAATCCATAACCACAAATTTCTCAATCCTCTCTCTCACTCTCGCCCTTCCTTTCCCTTCACCCCCA
CCATTCGCTTTCCTTCTTCTTCCTCCCCTTCATCCATCGTTGTATGCTCGCCGATTACCCGCCGTTTCGCGGTTCCTCACGATGATCATGAGCGGGAAGTCAGTAACCTT
GAGATTGAGGACCAAATTGACGATGGAGTACAGGGAAATGAGCAGTTATTGGGCACTGGAATAGATGAATTGGGGAGCCAAGGGTTGTTGAATCAGATGAAGGAGATTGT
AACGTTTACCGGACCTGCCATTGGGTTGTGGATTTGTGGACCATTGATGAGTCTCATTGACACTGCGGTTATAGGCCAGGGGAGCGCCATTGAGCTTGCTGCTTTAGGCC
CAGCGACAGTTTTATGTGATTATACGAGCTATGTGTTCATGTTTCTCAGTATCGCAACTTCAAATATGGTAGCCACGGCCCTTGCCAAACAGGATAAAAACGAAGTGCAA
CATCACATATCTGTATTGCTATTTGTTGGGCTGATGTCTGGTTTCTTGATGCTCTTAGTTACCAAACTATTGGGTTTAGTGGCGCTAACTGCTTATATGATGATAGAGCA
ACTAAACAAGAAAGGATATAGTGGATATTCTCTCTGTGTTCCCTCGCCTAGTGAATTTGTGTCAATACTTGGACTTGCTGCTCCTGTATTTATTACACTGATGTCAAAGA
TAGTTTTTTATACTCTCCTCATCTATCATGCTACGTCTATAGGCACATACACCATGGCTGCTCATCAGGTCATGAGTCAAACATTCTACATGTGTAGCGTATTGGGTGAA
CCGCTTTCTCAAACTGCCCAATCATTTATGCCTGGATTCATAAATGGAGTGAATCGTAGTCTGGATAAGGCTCGGATGCTACTCAAGTCACTCTTGATCATAGGAGCTAT
CTTTGGTTTGGTATTAGGGACTATCGGAACGTTAGTTCCTTGGTTGTTCCCCAATCTCTTTACACCTGAAGTGAAGATTATTCAAGAGATGCATAAAGTGTTGATTCCAT
ATTTCTTGGCCTTACTCATAATGCCTGCAACGCTTTGCTTGGAAGGGACGTTATTGGCTGGACGAGACCTAAAATTTATTAGTTTATCAATGTGCGGATGCCTTTCTTTT
GCTGCCCTTCTATTGCTGGTAATGTTTTATCTCCCAACCTTCCTCCAACCTCATTTCTTCTCCTCTTTGTGCATTAACTTCATGGCTTGCTCCCAGGTTGTAAGCAATAG
GGGTTATGGTTTGGTGGGCTGCTGGTGCGCGCTCGTCGGATTTCAATGGGCTCGGTTTGTTAACGCTCTTCGACGTGTCCTCTCTCCCAATGGAGTGCTTTACTCCAGTG
ATTTAAGCCATTATAAACTCAACTTCGCTCCTTTCAATTCTCAGGCGCCAAAAATGCCTTTCAAAATCCTCCATTATCATTCTTCATCCTCTCCTCGAATCCGTTACCCC
AATATTCTCAGACCATTCTCTCCCCCTTCCTTTCCCTTCACCCATCAATCTCTCTCCTCCCCCGCCATTTCCTTTCCTTCCTCTTCCTCCCCTTTACCTCTCCATTTCTC
GTCGCAGATTCGCCGCCGATTCGCGGTTCCTCACGAAGCCAGCAGCCTCGAGATTGAGAGCGAAATTGGCGTTGAAGTACAAGAAAATGAACAATTATCCGGAACTGGAG
GCGAAGAATTGGGGAGCCAAGGGTTGAGTAGTCAGATGAAGGAGATTGTAACGTTTACTGGACCTGCCATTGGATTGTGGATTTGTGGACCATTGATGAGTCTCATTGAC
ACTGCGGTTATTGGCCAGGGAAGCGCTGTTGAGCTTGCCGCTTTAGGCCCAGCGACTGTTTTATGTGATTATACGAGCTATGTGTTCATGTTTCTTAGTATTGCAACTTC
AAACATGGATAAAAACGAAGTGCAGCATCACATATCTGTATTGCTGTTTATCGGGTTGATGTCTGGCTTCTTGATGCTCTTACTTACCAAACTATTGGGTCCAGTGGCGC
TAACTGCTTTTGTCGGAATAAAGAATGCAGACATCATACCTGCATCAAACACGTATATCCAGGTTATTTCTGCTTATATGATGATAGAAGCACTGAATAAGAAAGGATAC
AATGGATATTCTCTATCGGTTCCCTCGTCTGGTGAATTTTTGTCAATACTTGGACTGGCTGCTCCTGTATTTCTAACAATGATGTCAAAGGTGGTTTTTTATTCTCTCCT
CATTTATTTTGCTACATCTATGGGCACACATACCATGGCTGCTCATCAGGTCATGATTCAAACATTTTGTATGTGTACCGTTTGGGGTGAACCTCTTTCTCAAACTGCTC
AGTCATTTATGCCTGGATTGATAAACGGAGTGAATCGAAGTTTGGATAAGGCTTGGATGTTGCTAAAGTCACTCTTGATTATAGGAGCTATATTTGGCTTGGTATTAGGG
ACTATTGGAACTTCAGTTCCTTGGTTGTTCCCCAATCTCTTCACACCCGAAGGGAAGATTATTCAGGAGATGCATAAAGTGTTGATTCCATATTTCTTGGCACTAGTCAT
AACACCCCCGACTCATTGCTTAGAAGGGACGTTATTGGTTATTAGCAGTAGGGGTTATGGTTTGACAGGCTGCTGGTATGCCTTGGTTGGATTTCAATGGGCTCGGTTTC
TCAGCGCTCTTCGACGCATCCTCTCGCCTCATGGAATACTTTACAGCAATGAAAACAAGGAAATCCACAAAGCAGTGTTCTCAAAGGAAGATGAAATCAACAGCATAGCA
AGGACTTGTAAATTCTTTAAGTGA
Protein sequenceShow/hide protein sequence
MHVASLHLRSSSLLTHYTPMAELSLSLASFTSQLPKMPFKILHSPSSSITPQIHNHKFLNPLSHSRPSFPFTPTIRFPSSSSPSSIVVCSPITRRFAVPHDDHEREVSNL
EIEDQIDDGVQGNEQLLGTGIDELGSQGLLNQMKEIVTFTGPAIGLWICGPLMSLIDTAVIGQGSAIELAALGPATVLCDYTSYVFMFLSIATSNMVATALAKQDKNEVQ
HHISVLLFVGLMSGFLMLLVTKLLGLVALTAYMMIEQLNKKGYSGYSLCVPSPSEFVSILGLAAPVFITLMSKIVFYTLLIYHATSIGTYTMAAHQVMSQTFYMCSVLGE
PLSQTAQSFMPGFINGVNRSLDKARMLLKSLLIIGAIFGLVLGTIGTLVPWLFPNLFTPEVKIIQEMHKVLIPYFLALLIMPATLCLEGTLLAGRDLKFISLSMCGCLSF
AALLLLVMFYLPTFLQPHFFSSLCINFMACSQVVSNRGYGLVGCWCALVGFQWARFVNALRRVLSPNGVLYSSDLSHYKLNFAPFNSQAPKMPFKILHYHSSSSPRIRYP
NILRPFSPPSFPFTHQSLSSPAISFPSSSSPLPLHFSSQIRRRFAVPHEASSLEIESEIGVEVQENEQLSGTGGEELGSQGLSSQMKEIVTFTGPAIGLWICGPLMSLID
TAVIGQGSAVELAALGPATVLCDYTSYVFMFLSIATSNMDKNEVQHHISVLLFIGLMSGFLMLLLTKLLGPVALTAFVGIKNADIIPASNTYIQVISAYMMIEALNKKGY
NGYSLSVPSSGEFLSILGLAAPVFLTMMSKVVFYSLLIYFATSMGTHTMAAHQVMIQTFCMCTVWGEPLSQTAQSFMPGLINGVNRSLDKAWMLLKSLLIIGAIFGLVLG
TIGTSVPWLFPNLFTPEGKIIQEMHKVLIPYFLALVITPPTHCLEGTLLVISSRGYGLTGCWYALVGFQWARFLSALRRILSPHGILYSNENKEIHKAVFSKEDEINSIA
RTCKFFK