; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG11G014590 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG11G014590
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionHXXXD-type acyl-transferase family protein
Genome locationCG_Chr11:27814511..27836663
RNA-Seq ExpressionClCG11G014590
SyntenyClCG11G014590
Gene Ontology termsGO:0016747 - transferase activity, transferring acyl groups other than amino-acyl groups (molecular function)
InterPro domainsIPR001005 - SANT/Myb domain
IPR003480 - Transferase
IPR009057 - Homeobox-like domain superfamily
IPR017930 - Myb domain
IPR023213 - Chloramphenicol acetyltransferase-like domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA8528364.1 hypothetical protein F0562_035719 [Nyssa sinensis]4.1e-21945Show/hide
Query:  ISELKFSSVVPAKATGDDKVRELTAIDLAMKLHYIRGVYLFRGSEEVRNLTIYDLKKPLFPLLEQYYVVSGRIRRRIEDGDRPFIKCNDSGVRIVEANCE
        + ++K SSVVPA+ TG+DKV ELT +DL MKLHYIRG+Y F+ +E V+ L IYDLKKP+F  L+ YY  SGR+R+   +  RPFIKCNDSGVRIVEA C 
Subjt:  ISELKFSSVVPAKATGDDKVRELTAIDLAMKLHYIRGVYLFRGSEEVRNLTIYDLKKPLFPLLEQYYVVSGRIRRRIEDGDRPFIKCNDSGVRIVEANCE

Query:  KSIDEWLSIIEGDDKFLHRDGCLVHTQAIGPDLGFSPLAFIQLTRFKCGGLSVGLSWTHILGDIFSASTFINAWGSIMNNRPAHQLRPAPAGLIWPFRST
        K++DEWL++     K    D  L++ Q +GPDLGFSPL FIQ T FKCGG SVGLSW HI+GD+FSASTFIN WG I+  +   Q    P    + F  +
Subjt:  KSIDEWLSIIEGDDKFLHRDGCLVHTQAIGPDLGFSPLAFIQLTRFKCGGLSVGLSWTHILGDIFSASTFINAWGSIMNNRPAHQLRPAPAGLIWPFRST

Query:  RLSTP-PVKRLDPTGDLWIGSSDCKMATLSFRITGEQLDRILS-VVGRNRAVNFSTFEAIAAIFWKSLSKIRLEDSDSRTISIYSTKCPNREGEIPRNGM
            P  +K +DP GD W+ +++CKM T S  +T +QLD ILS   G  +A     F+  +A+ WKSL+K+R    + R ++I +    + E EI  NG+
Subjt:  RLSTP-PVKRLDPTGDLWIGSSDCKMATLSFRITGEQLDRILS-VVGRNRAVNFSTFEAIAAIFWKSLSKIRLEDSDSRTISIYSTKCPNREGEIPRNGM

Query:  EMSGVEADFPVAGAAEGELAELIVKKKIDEGGEIEEVVEKGKEESDFIAYGARLTFVDLEEANIYGFELEGQKPVHVNYEIGGVGENGVVLVLPGPPR-D
         +S VEADFPV  A   EL  LI +K++DE   IEE+VE+G  +SDFI YGA LTFV+LEEANIYG EL+GQKPV  NY I G+G+ GVVLVLP P    
Subjt:  EMSGVEADFPVAGAAEGELAELIVKKKIDEGGEIEEVVEKGKEESDFIAYGARLTFVDLEEANIYGFELEGQKPVHVNYEIGGVGENGVVLVLPGPPR-D

Query:  DGRDGGRTVTVILPEKQLPDLIDELQKQWEIHFCACVSIIEHCTLTTVADKSEFLLQPQRLTFVFRFRLQTI-----SDKRFQPHRRQSPALHLFLCDFI
         G   GRT+TV LP  Q+ +L DEL+K+W+        I ++    TV   S  L  P +         Q++     S+K      ++S  L       +
Subjt:  DGRDGGRTVTVILPEKQLPDLIDELQKQWEIHFCACVSIIEHCTLTTVADKSEFLLQPQRLTFVFRFRLQTI-----SDKRFQPHRRQSPALHLFLCDFI

Query:  TSSTMIERKEKQRTGTISMEDCSTLLERYSVRTIFTLLREVAQVSGVRIDWDKLVKNTSTGISNVREYQLLWRHLAYRHTLLENMDSVTDPLDYDSDLDF
          +TM+E+ +KQ+ G+I+ E+ STLL+RYS  T+ T+L+EVAQV  V IDW+ LV+ TS+GISN REYQ+LWRHLAYR+TL E +D   +PLD DSDL++
Subjt:  TSSTMIERKEKQRTGTISMEDCSTLLERYSVRTIFTLLREVAQVSGVRIDWDKLVKNTSTGISNVREYQLLWRHLAYRHTLLENMDSVTDPLDYDSDLDF

Query:  EIEPFPSVSSESSYEAAACVKVLIANGIPSESDVPTSSAVEAPLTIGISNSQASTDNLKNPQYACLMQGMSVTIPLSIQRQPIPMAAATEVFDVNGAAGA
        E+E FP+VSSE+S EAAA VKVLIA+G P++S +P  + VEAPLTI I N Q+S    +N Q A  +QG ++TIP+ +Q+ P+P   A E  D NG+   
Subjt:  EIEPFPSVSSESSYEAAACVKVLIANGIPSESDVPTSSAVEAPLTIGISNSQASTDNLKNPQYACLMQGMSVTIPLSIQRQPIPMAAATEVFDVNGAAGA

Query:  NAASQKRRKPWSKAEDLELMAAVEKCGEGNWANILKGDFKGDRTASQLSQILGVFSPLSLWPHQFGMKCLPRPRLVAVEKPKILDDVDRKRELWGLKRHG
            +K+RK WS AED+EL+AAV+KCGEGNWA ILKGDFKGDRTASQLSQ   +                                          KR  
Subjt:  NAASQKRRKPWSKAEDLELMAAVEKCGEGNWANILKGDFKGDRTASQLSQILGVFSPLSLWPHQFGMKCLPRPRLVAVEKPKILDDVDRKRELWGLKRHG

Query:  NLNVGASTTSTTQKAQIDAAHRALSFALDLPVNNSKTANSNIN----------SSIVSPASG-----AEALVQMQNQSPQISMPSRPLLVEPLPSAVKSG
        NLNVG+   S   +AQ+ AA RA+S AL++P+ ++ TA+ +I+          S+   P +G       ++ Q+Q+QSPQ S+P+    +    S+ KS 
Subjt:  NLNVGASTTSTTQKAQIDAAHRALSFALDLPVNNSKTANSNIN----------SSIVSPASG-----AEALVQMQNQSPQISMPSRPLLVEPLPSAVKSG

Query:  INTSKNSLIMKSTHNSDSIVRATAVAAGARIVSLSDAASLLKVAQTKKAIHIKSKCVSSTQSPVAGNA----PIHLDGRPSVHYISPGKTPTPGSSHVGG
        + + K S   KS  +  S+V+A AVAAGA I +  DAASLLK  Q K AI I     S  +  VAGNA      H D  P+VHYI  G   TP SS+   
Subjt:  INTSKNSLIMKSTHNSDSIVRATAVAAGARIVSLSDAASLLKVAQTKKAIHIKSKCVSSTQSPVAGNA----PIHLDGRPSVHYISPGKTPTPGSSHVGG

Query:  KST--------MGCNNSVKAVSPKVLHNRSTAILT-NPPSDQVSPTTESPLKQEVNSSEERKIPKPIITAKEEFRENSLA
         S          G  NSV+   P V  N S A  T N  S+  +  T  P K EV  +EER +       KE+ +E+ +A
Subjt:  KST--------MGCNNSVKAVSPKVLHNRSTAILT-NPPSDQVSPTTESPLKQEVNSSEERKIPKPIITAKEEFRENSLA

XP_008466161.1 PREDICTED: uncharacterized protein LOC103503656 isoform X2 [Cucumis melo]2.3e-21476.26Show/hide
Query:  MIERKEKQRTGTISMEDCSTLLERYSVRTIFTLLREVAQVSGVRIDWDKLVKNTSTGISNVREYQLLWRHLAYRHTLLENMDSVTDPLDYDSDLDFEIEP
        MI +KE +RTGTISMEDCSTLL RYSVRTIFTLLREVAQVSGVRIDWDKLVKNTSTGIS+ REYQLLWRHLAYRHTLLE+M SVTD LDYDSDLDFE+EP
Subjt:  MIERKEKQRTGTISMEDCSTLLERYSVRTIFTLLREVAQVSGVRIDWDKLVKNTSTGISNVREYQLLWRHLAYRHTLLENMDSVTDPLDYDSDLDFEIEP

Query:  FPSVSSESSYEAAACVKVLIANGIPSESDVPTSSAVEAPLTIGISNSQASTDNLKNPQYACLMQGMSVTIPLSIQRQPIPMAAATEVFDVNGAAGANAAS
        FPSV SESS EAAACVKVLIANGIP+ESDVP SSAVEAPLTI ISNSQ  TDN  N Q A L QG+SVTIPLSIQRQPIP+  A EVFDVNGAAGA+AAS
Subjt:  FPSVSSESSYEAAACVKVLIANGIPSESDVPTSSAVEAPLTIGISNSQASTDNLKNPQYACLMQGMSVTIPLSIQRQPIPMAAATEVFDVNGAAGANAAS

Query:  QKRRKPWSKAEDLELMAAVEKCGEGNWANILKGDFKGDRTASQLSQILGVFSPLSLWPHQFGMKCLPRPRLVAVEKPKILDDVDRKRELWGLKRHGNLNV
        +KRRKPWSKAEDLEL+AAVEKCGEGNWANILKGDFKGDRTASQLSQ   V                                          KR  NLN+
Subjt:  QKRRKPWSKAEDLELMAAVEKCGEGNWANILKGDFKGDRTASQLSQILGVFSPLSLWPHQFGMKCLPRPRLVAVEKPKILDDVDRKRELWGLKRHGNLNV

Query:  GASTTSTTQKAQIDAAHRALSFALDLPVNNSKTANSNINSSIV-SPASGAEALVQMQNQSPQISMPSRPLLVEPLPSAVKSGINTSKNSLIMKSTHNSDS
        GAST+STTQKAQIDAAHRAL+FALDLPVNN+KTANSNINSSIV S AS +E+ VQMQNQSPQISMPSRPLLV+PLPSAVKSGINTSKNSL++ STHNSDS
Subjt:  GASTTSTTQKAQIDAAHRALSFALDLPVNNSKTANSNINSSIV-SPASGAEALVQMQNQSPQISMPSRPLLVEPLPSAVKSGINTSKNSLIMKSTHNSDS

Query:  IVRATAVAAGARIVSLSDAASLLKVAQTKKAIHIKSKCVSSTQSPVAGNAPIHLDGRPSVHYISPGKTPTPGSSHVGGKSTMGCNNSVKAVSPKVLHNRS
        IVRATAVAAGARIVS SDAASL+K  QTK AIHIKSKC            P+HLD RP+VHYIS GKTPTP S++V GKSTM  NNS+KAVSPK+LH+RS
Subjt:  IVRATAVAAGARIVSLSDAASLLKVAQTKKAIHIKSKCVSSTQSPVAGNAPIHLDGRPSVHYISPGKTPTPGSSHVGGKSTMGCNNSVKAVSPKVLHNRS

Query:  TAILTNPPSDQVSPTTESPLKQEVNSSEERKIPKPIITAKEEFRENSLANDVKIRG
         AI TN PS+QVSPTTESPLKQEVNSSEERK P+ IITAKEEFRENS  NDVKIRG
Subjt:  TAILTNPPSDQVSPTTESPLKQEVNSSEERKIPKPIITAKEEFRENSLANDVKIRG

XP_038898516.1 protein ECERIFERUM 2 [Benincasa hispida]7.7e-21888.76Show/hide
Query:  DGGKNSLISELKFSSVVPAKATGDDKVRELTAIDLAMKLHYIRGVYLFRGSEEVRNLTIYDLKKPLFPLLEQYYVVSGRIRRRIEDGDRPFIKCNDSGVR
        DG KN+LISELKFSSVVPAKATGD+KV+ELTAIDLAMKLHYIRGVY FR SEEVRNLTIYDLKKPLF LLE YYVVSGRIRRRIED DRPFIKCNDSGVR
Subjt:  DGGKNSLISELKFSSVVPAKATGDDKVRELTAIDLAMKLHYIRGVYLFRGSEEVRNLTIYDLKKPLFPLLEQYYVVSGRIRRRIEDGDRPFIKCNDSGVR

Query:  IVEANCEKSIDEWLSIIEGDDKFLHRDGCLVHTQAIGPDLGFSPLAFIQLTRFKCGGLSVGLSWTHILGDIFSASTFINAWGSIMNNRPAHQLRPAPAGL
        IVEA+CEK+I+EWLSI  GDDK  HRD CLVH+QAIGPDLGFSPLAFIQLTRFKCGGLSVGLSWTH+LGDIFSASTFIN WG IMNNRPA  L PAPA L
Subjt:  IVEANCEKSIDEWLSIIEGDDKFLHRDGCLVHTQAIGPDLGFSPLAFIQLTRFKCGGLSVGLSWTHILGDIFSASTFINAWGSIMNNRPAHQLRPAPAGL

Query:  IWPFRSTRLSTPPVKRLDPTGDLWIGSSDCKMATLSFRITGEQLDRILSVVGRNRAVNFSTFEAIAAIFWKSLSKIRLEDSDSRTISIYSTKCPNREGEI
          P RSTRLSTPPVKRLDPTGDLWIGSSDCKMAT SFRIT EQLDRIL V GRNRAVNFSTFEAIAAIFWK LSKIRLEDS+SRTISIYSTK PNREGEI
Subjt:  IWPFRSTRLSTPPVKRLDPTGDLWIGSSDCKMATLSFRITGEQLDRILSVVGRNRAVNFSTFEAIAAIFWKSLSKIRLEDSDSRTISIYSTKCPNREGEI

Query:  PRNGMEMSGVEADFPVAGAAEGELAELIVKKKIDEGGEIEEVVEKGKEESDFIAYGARLTFVDLEEANIYGFELEGQKPVHVNYEIGGVGENGVVLVLPG
        PRNGMEMSGVEA+FPVAGAAEGELAE+IVKKKIDEGGEI  +VEK ++ESDFIAYGARLTFVDLEEANIYGFELEGQKPVHVNYEIGGVGENGVVLVLPG
Subjt:  PRNGMEMSGVEADFPVAGAAEGELAELIVKKKIDEGGEIEEVVEKGKEESDFIAYGARLTFVDLEEANIYGFELEGQKPVHVNYEIGGVGENGVVLVLPG

Query:  PPRDDGRD-GGRTVTVILPEKQLPDLIDELQKQWEI
        PP  DGRD GG TVTVILPEK+LPDLIDELQKQW I
Subjt:  PPRDDGRD-GGRTVTVILPEKQLPDLIDELQKQWEI

XP_038898738.1 uncharacterized protein LOC120086263 isoform X1 [Benincasa hispida]6.1e-22380.8Show/hide
Query:  MIERKEKQRTGTISMEDCSTLLERYSVRTIFTLLREVAQVSGVRIDWDKLVKNTSTGISNVREYQLLWRHLAYRHTLLENMDSVTDPLDYDSDLDFEIEP
        MIERKE  RTGTISMEDCSTLLERYSVRTIFTLLREVAQVSGVRIDW  LVKNTSTGISNVREYQLLWRHLAYRHT LENMDSVTDPLDYDSDLDFEIEP
Subjt:  MIERKEKQRTGTISMEDCSTLLERYSVRTIFTLLREVAQVSGVRIDWDKLVKNTSTGISNVREYQLLWRHLAYRHTLLENMDSVTDPLDYDSDLDFEIEP

Query:  FPSVSSESSYEAAACVKVLIANGIPSESDVPTSSAVEAPLTIGISNSQASTDNLKNPQYACLMQGMSVTIPLSIQRQPIPMAAATEVFDVNGAAGANAAS
        FPSVSSESS EAAACVKVLIANGIPSESDVP+SSAVEAPLTIGISNSQ+ST NL+N Q ACLMQGMSVTIPLS+QRQPIPM +ATEV DVNGAA ANAAS
Subjt:  FPSVSSESSYEAAACVKVLIANGIPSESDVPTSSAVEAPLTIGISNSQASTDNLKNPQYACLMQGMSVTIPLSIQRQPIPMAAATEVFDVNGAAGANAAS

Query:  QKRRKPWSKAEDLELMAAVEKCGEGNWANILKGDFKGDRTASQLSQILGVFSPLSLWPHQFGMKCLPRPRLVAVEKPKILDDVDRKRELWGLKRHGNLNV
        +KRRKPWSKAEDLELMAAVEKCGEGNWANILKGDFKGDRTASQLSQ   V                                          KR GNLNV
Subjt:  QKRRKPWSKAEDLELMAAVEKCGEGNWANILKGDFKGDRTASQLSQILGVFSPLSLWPHQFGMKCLPRPRLVAVEKPKILDDVDRKRELWGLKRHGNLNV

Query:  GASTTSTTQKAQIDAAHRALSFALDLPVNNSKT-ANSNINSSIVSPASGAEALVQMQNQSPQISMPSRPLLVEPLPSAVKSGINTSKNSLIMKSTHNSDS
        GAST+STTQKAQIDAAHRALSFALDLPVNNSKT ANSNINSS+VS ASGAEA VQMQNQSPQISMPSRPLLVEPLPS+VKSGI TSKNSL+MKSTHNSDS
Subjt:  GASTTSTTQKAQIDAAHRALSFALDLPVNNSKT-ANSNINSSIVSPASGAEALVQMQNQSPQISMPSRPLLVEPLPSAVKSGINTSKNSLIMKSTHNSDS

Query:  IVRATAVAAGARIVSLSDAASLLKVAQTKKAIHIKSKCVSSTQSPVAGNAPIHLDGRPSVHYISPGKTPTPGSSHVGGKSTMGCNNSVKAVSPKVLHNRS
        IVRATAVAAGARIVS SDAASLLK AQ K AIHIKSKC           +P+HLD RPSVHYIS GKTPTPGS+ V GKSTM  NNSVKAVSPKV HNRS
Subjt:  IVRATAVAAGARIVSLSDAASLLKVAQTKKAIHIKSKCVSSTQSPVAGNAPIHLDGRPSVHYISPGKTPTPGSSHVGGKSTMGCNNSVKAVSPKVLHNRS

Query:  TAILTNPPSDQVSPTTESPLKQEVNSSEERKIPKPIITAKEEFRENS
        TAI TNPPSD+VSPTTESPLKQ+VNSSEERKI + IITAKEEFRE +
Subjt:  TAILTNPPSDQVSPTTESPLKQEVNSSEERKIPKPIITAKEEFRENS

XP_038898739.1 uncharacterized protein LOC120086263 isoform X2 [Benincasa hispida]2.5e-22480.95Show/hide
Query:  MIERKEKQRTGTISMEDCSTLLERYSVRTIFTLLREVAQVSGVRIDWDKLVKNTSTGISNVREYQLLWRHLAYRHTLLENMDSVTDPLDYDSDLDFEIEP
        MIERKE  RTGTISMEDCSTLLERYSVRTIFTLLREVAQVSGVRIDW  LVKNTSTGISNVREYQLLWRHLAYRHT LENMDSVTDPLDYDSDLDFEIEP
Subjt:  MIERKEKQRTGTISMEDCSTLLERYSVRTIFTLLREVAQVSGVRIDWDKLVKNTSTGISNVREYQLLWRHLAYRHTLLENMDSVTDPLDYDSDLDFEIEP

Query:  FPSVSSESSYEAAACVKVLIANGIPSESDVPTSSAVEAPLTIGISNSQASTDNLKNPQYACLMQGMSVTIPLSIQRQPIPMAAATEVFDVNGAAGANAAS
        FPSVSSESS EAAACVKVLIANGIPSESDVP+SSAVEAPLTIGISNSQ+ST NL+N Q ACLMQGMSVTIPLS+QRQPIPM +ATEV DVNGAA ANAAS
Subjt:  FPSVSSESSYEAAACVKVLIANGIPSESDVPTSSAVEAPLTIGISNSQASTDNLKNPQYACLMQGMSVTIPLSIQRQPIPMAAATEVFDVNGAAGANAAS

Query:  QKRRKPWSKAEDLELMAAVEKCGEGNWANILKGDFKGDRTASQLSQILGVFSPLSLWPHQFGMKCLPRPRLVAVEKPKILDDVDRKRELWGLKRHGNLNV
        +KRRKPWSKAEDLELMAAVEKCGEGNWANILKGDFKGDRTASQLSQ   V                                          KR GNLNV
Subjt:  QKRRKPWSKAEDLELMAAVEKCGEGNWANILKGDFKGDRTASQLSQILGVFSPLSLWPHQFGMKCLPRPRLVAVEKPKILDDVDRKRELWGLKRHGNLNV

Query:  GASTTSTTQKAQIDAAHRALSFALDLPVNNSKTANSNINSSIVSPASGAEALVQMQNQSPQISMPSRPLLVEPLPSAVKSGINTSKNSLIMKSTHNSDSI
        GAST+STTQKAQIDAAHRALSFALDLPVNNSKTANSNINSS+VS ASGAEA VQMQNQSPQISMPSRPLLVEPLPS+VKSGI TSKNSL+MKSTHNSDSI
Subjt:  GASTTSTTQKAQIDAAHRALSFALDLPVNNSKTANSNINSSIVSPASGAEALVQMQNQSPQISMPSRPLLVEPLPSAVKSGINTSKNSLIMKSTHNSDSI

Query:  VRATAVAAGARIVSLSDAASLLKVAQTKKAIHIKSKCVSSTQSPVAGNAPIHLDGRPSVHYISPGKTPTPGSSHVGGKSTMGCNNSVKAVSPKVLHNRST
        VRATAVAAGARIVS SDAASLLK AQ K AIHIKSKC           +P+HLD RPSVHYIS GKTPTPGS+ V GKSTM  NNSVKAVSPKV HNRST
Subjt:  VRATAVAAGARIVSLSDAASLLKVAQTKKAIHIKSKCVSSTQSPVAGNAPIHLDGRPSVHYISPGKTPTPGSSHVGGKSTMGCNNSVKAVSPKVLHNRST

Query:  AILTNPPSDQVSPTTESPLKQEVNSSEERKIPKPIITAKEEFRENS
        AI TNPPSD+VSPTTESPLKQ+VNSSEERKI + IITAKEEFRE +
Subjt:  AILTNPPSDQVSPTTESPLKQEVNSSEERKIPKPIITAKEEFRENS

TrEMBL top hitse value%identityAlignment
A0A1S3CQJ6 uncharacterized protein LOC103503656 isoform X18.1e-21375.58Show/hide
Query:  MIERKEKQRTGTISMEDCSTLLERYSVRTIFTLLREVAQVSGVRIDWDKLVKNTSTGISNVREYQLLWRHLAYRHTLLENMDSVTDPL-----DYDSDLD
        MI +KE +RTGTISMEDCSTLL RYSVRTIFTLLREVAQVSGVRIDWDKLVKNTSTGIS+ REYQLLWRHLAYRHTLLE+M SVTD L     DYDSDLD
Subjt:  MIERKEKQRTGTISMEDCSTLLERYSVRTIFTLLREVAQVSGVRIDWDKLVKNTSTGISNVREYQLLWRHLAYRHTLLENMDSVTDPL-----DYDSDLD

Query:  FEIEPFPSVSSESSYEAAACVKVLIANGIPSESDVPTSSAVEAPLTIGISNSQASTDNLKNPQYACLMQGMSVTIPLSIQRQPIPMAAATEVFDVNGAAG
        FE+EPFPSV SESS EAAACVKVLIANGIP+ESDVP SSAVEAPLTI ISNSQ  TDN  N Q A L QG+SVTIPLSIQRQPIP+  A EVFDVNGAAG
Subjt:  FEIEPFPSVSSESSYEAAACVKVLIANGIPSESDVPTSSAVEAPLTIGISNSQASTDNLKNPQYACLMQGMSVTIPLSIQRQPIPMAAATEVFDVNGAAG

Query:  ANAASQKRRKPWSKAEDLELMAAVEKCGEGNWANILKGDFKGDRTASQLSQILGVFSPLSLWPHQFGMKCLPRPRLVAVEKPKILDDVDRKRELWGLKRH
        A+AAS+KRRKPWSKAEDLEL+AAVEKCGEGNWANILKGDFKGDRTASQLSQ   V                                          KR 
Subjt:  ANAASQKRRKPWSKAEDLELMAAVEKCGEGNWANILKGDFKGDRTASQLSQILGVFSPLSLWPHQFGMKCLPRPRLVAVEKPKILDDVDRKRELWGLKRH

Query:  GNLNVGASTTSTTQKAQIDAAHRALSFALDLPVNNSKTANSNINSSIV-SPASGAEALVQMQNQSPQISMPSRPLLVEPLPSAVKSGINTSKNSLIMKST
         NLN+GAST+STTQKAQIDAAHRAL+FALDLPVNN+KTANSNINSSIV S AS +E+ VQMQNQSPQISMPSRPLLV+PLPSAVKSGINTSKNSL++ ST
Subjt:  GNLNVGASTTSTTQKAQIDAAHRALSFALDLPVNNSKTANSNINSSIV-SPASGAEALVQMQNQSPQISMPSRPLLVEPLPSAVKSGINTSKNSLIMKST

Query:  HNSDSIVRATAVAAGARIVSLSDAASLLKVAQTKKAIHIKSKCVSSTQSPVAGNAPIHLDGRPSVHYISPGKTPTPGSSHVGGKSTMGCNNSVKAVSPKV
        HNSDSIVRATAVAAGARIVS SDAASL+K  QTK AIHIKSKC            P+HLD RP+VHYIS GKTPTP S++V GKSTM  NNS+KAVSPK+
Subjt:  HNSDSIVRATAVAAGARIVSLSDAASLLKVAQTKKAIHIKSKCVSSTQSPVAGNAPIHLDGRPSVHYISPGKTPTPGSSHVGGKSTMGCNNSVKAVSPKV

Query:  LHNRSTAILTNPPSDQVSPTTESPLKQEVNSSEERKIPKPIITAKEEFRENSLANDVKIRG
        LH+RS AI TN PS+QVSPTTESPLKQEVNSSEERK P+ IITAKEEFRENS  NDVKIRG
Subjt:  LHNRSTAILTNPPSDQVSPTTESPLKQEVNSSEERKIPKPIITAKEEFRENSLANDVKIRG

A0A1S3CRZ4 uncharacterized protein LOC103503656 isoform X21.1e-21476.26Show/hide
Query:  MIERKEKQRTGTISMEDCSTLLERYSVRTIFTLLREVAQVSGVRIDWDKLVKNTSTGISNVREYQLLWRHLAYRHTLLENMDSVTDPLDYDSDLDFEIEP
        MI +KE +RTGTISMEDCSTLL RYSVRTIFTLLREVAQVSGVRIDWDKLVKNTSTGIS+ REYQLLWRHLAYRHTLLE+M SVTD LDYDSDLDFE+EP
Subjt:  MIERKEKQRTGTISMEDCSTLLERYSVRTIFTLLREVAQVSGVRIDWDKLVKNTSTGISNVREYQLLWRHLAYRHTLLENMDSVTDPLDYDSDLDFEIEP

Query:  FPSVSSESSYEAAACVKVLIANGIPSESDVPTSSAVEAPLTIGISNSQASTDNLKNPQYACLMQGMSVTIPLSIQRQPIPMAAATEVFDVNGAAGANAAS
        FPSV SESS EAAACVKVLIANGIP+ESDVP SSAVEAPLTI ISNSQ  TDN  N Q A L QG+SVTIPLSIQRQPIP+  A EVFDVNGAAGA+AAS
Subjt:  FPSVSSESSYEAAACVKVLIANGIPSESDVPTSSAVEAPLTIGISNSQASTDNLKNPQYACLMQGMSVTIPLSIQRQPIPMAAATEVFDVNGAAGANAAS

Query:  QKRRKPWSKAEDLELMAAVEKCGEGNWANILKGDFKGDRTASQLSQILGVFSPLSLWPHQFGMKCLPRPRLVAVEKPKILDDVDRKRELWGLKRHGNLNV
        +KRRKPWSKAEDLEL+AAVEKCGEGNWANILKGDFKGDRTASQLSQ   V                                          KR  NLN+
Subjt:  QKRRKPWSKAEDLELMAAVEKCGEGNWANILKGDFKGDRTASQLSQILGVFSPLSLWPHQFGMKCLPRPRLVAVEKPKILDDVDRKRELWGLKRHGNLNV

Query:  GASTTSTTQKAQIDAAHRALSFALDLPVNNSKTANSNINSSIV-SPASGAEALVQMQNQSPQISMPSRPLLVEPLPSAVKSGINTSKNSLIMKSTHNSDS
        GAST+STTQKAQIDAAHRAL+FALDLPVNN+KTANSNINSSIV S AS +E+ VQMQNQSPQISMPSRPLLV+PLPSAVKSGINTSKNSL++ STHNSDS
Subjt:  GASTTSTTQKAQIDAAHRALSFALDLPVNNSKTANSNINSSIV-SPASGAEALVQMQNQSPQISMPSRPLLVEPLPSAVKSGINTSKNSLIMKSTHNSDS

Query:  IVRATAVAAGARIVSLSDAASLLKVAQTKKAIHIKSKCVSSTQSPVAGNAPIHLDGRPSVHYISPGKTPTPGSSHVGGKSTMGCNNSVKAVSPKVLHNRS
        IVRATAVAAGARIVS SDAASL+K  QTK AIHIKSKC            P+HLD RP+VHYIS GKTPTP S++V GKSTM  NNS+KAVSPK+LH+RS
Subjt:  IVRATAVAAGARIVSLSDAASLLKVAQTKKAIHIKSKCVSSTQSPVAGNAPIHLDGRPSVHYISPGKTPTPGSSHVGGKSTMGCNNSVKAVSPKVLHNRS

Query:  TAILTNPPSDQVSPTTESPLKQEVNSSEERKIPKPIITAKEEFRENSLANDVKIRG
         AI TN PS+QVSPTTESPLKQEVNSSEERK P+ IITAKEEFRENS  NDVKIRG
Subjt:  TAILTNPPSDQVSPTTESPLKQEVNSSEERKIPKPIITAKEEFRENSLANDVKIRG

A0A5A7T5C8 HTH myb-type domain-containing protein2.0e-21175.44Show/hide
Query:  MIERKEKQRTGTISMEDCSTLLERYSVRTIFTLLREVAQVSGVRIDWDKLVKNTSTGISNVREYQLLWRHLAYRHTLLENMDSVTDPL-----DYDSDLD
        MI +KE +RTGTISMEDCSTLL RYSVRTIFTLLREVAQVSGVRIDWDKLVKNTSTGIS+ REYQLLWRHLAYRHTLLE+M SVTD L     DYDSDLD
Subjt:  MIERKEKQRTGTISMEDCSTLLERYSVRTIFTLLREVAQVSGVRIDWDKLVKNTSTGISNVREYQLLWRHLAYRHTLLENMDSVTDPL-----DYDSDLD

Query:  FEIEPFPSVSSESSYEAAACVKVLIANGIPSESDVPTSSAVEAPLTIGISNSQASTDNLKNPQYACLMQGMSVTIPLSIQRQPIPMAAATEVFDVNGAAG
        FE+EPFPSV SESS EAAACVKVLIANGIP+ESDVP SSAVEAPLTI ISNSQ  TDN  N Q A L QG+SVTIPLSIQRQPIP+  A EVFDVNGAAG
Subjt:  FEIEPFPSVSSESSYEAAACVKVLIANGIPSESDVPTSSAVEAPLTIGISNSQASTDNLKNPQYACLMQGMSVTIPLSIQRQPIPMAAATEVFDVNGAAG

Query:  ANAASQKRRKPWSKAEDLELMAAVEKCGEGNWANILKGDFKGDRTASQLSQILGVFSPLSLWPHQFGMKCLPRPRLVAVEKPKILDDVDRKRELWGLKRH
        A+AAS+KRRKPWSKAEDLEL+AAVEKCGEGNWANILKGDFKGDRTASQLSQ   V                                          KR 
Subjt:  ANAASQKRRKPWSKAEDLELMAAVEKCGEGNWANILKGDFKGDRTASQLSQILGVFSPLSLWPHQFGMKCLPRPRLVAVEKPKILDDVDRKRELWGLKRH

Query:  GNLNVGASTTSTTQKAQIDAAHRALSFALDLPVNNSKT-ANSNINSSIV-SPASGAEALVQMQNQSPQISMPSRPLLVEPLPSAVKSGINTSKNSLIMKS
         NLN+GAST+STTQKAQIDAAHRAL+FALDLPVNN+KT ANSNINSSIV S AS +E+ VQMQNQSPQISMPSRPLLV+PLPSAVKSGINTSKNSL++ S
Subjt:  GNLNVGASTTSTTQKAQIDAAHRALSFALDLPVNNSKT-ANSNINSSIV-SPASGAEALVQMQNQSPQISMPSRPLLVEPLPSAVKSGINTSKNSLIMKS

Query:  THNSDSIVRATAVAAGARIVSLSDAASLLKVAQTKKAIHIKSKCVSSTQSPVAGNAPIHLDGRPSVHYISPGKTPTPGSSHVGGKSTMGCNNSVKAVSPK
        THNSDSIVRATAVAAGARIVS SDAASL+K  QTK AIHIKSKC            P+HLD RP+VHYIS GKTPTP S++V GKSTM  NNS+KAVSPK
Subjt:  THNSDSIVRATAVAAGARIVSLSDAASLLKVAQTKKAIHIKSKCVSSTQSPVAGNAPIHLDGRPSVHYISPGKTPTPGSSHVGGKSTMGCNNSVKAVSPK

Query:  VLHNRSTAILTNPPSDQVSPTTESPLKQEVNSSEERKIPKPIITAKEEFRENSLANDVKIRG
        +LH+RS AI TN PS+QVSPTTESPLKQEVNSSEERK P+ IITAKEEFRENS  NDVKIRG
Subjt:  VLHNRSTAILTNPPSDQVSPTTESPLKQEVNSSEERKIPKPIITAKEEFRENSLANDVKIRG

A0A5D3E5P5 HTH myb-type domain-containing protein2.8e-21376.12Show/hide
Query:  MIERKEKQRTGTISMEDCSTLLERYSVRTIFTLLREVAQVSGVRIDWDKLVKNTSTGISNVREYQLLWRHLAYRHTLLENMDSVTDPLDYDSDLDFEIEP
        MI +KE +RTGTISMEDCSTLL RYSVRTIFTLLREVAQVSGVRIDWDKLVKNTSTGIS+ REYQLLWRHLAYRHTLLE+M SVTD LDYDSDLDFE+EP
Subjt:  MIERKEKQRTGTISMEDCSTLLERYSVRTIFTLLREVAQVSGVRIDWDKLVKNTSTGISNVREYQLLWRHLAYRHTLLENMDSVTDPLDYDSDLDFEIEP

Query:  FPSVSSESSYEAAACVKVLIANGIPSESDVPTSSAVEAPLTIGISNSQASTDNLKNPQYACLMQGMSVTIPLSIQRQPIPMAAATEVFDVNGAAGANAAS
        FPSV SESS EAAACVKVLIANGIP+ESDVP SSAVEAPLTI ISNSQ  TDN  N Q A L QG+SVTIPLSIQRQPIP+  A EVFDVNGAAGA+AAS
Subjt:  FPSVSSESSYEAAACVKVLIANGIPSESDVPTSSAVEAPLTIGISNSQASTDNLKNPQYACLMQGMSVTIPLSIQRQPIPMAAATEVFDVNGAAGANAAS

Query:  QKRRKPWSKAEDLELMAAVEKCGEGNWANILKGDFKGDRTASQLSQILGVFSPLSLWPHQFGMKCLPRPRLVAVEKPKILDDVDRKRELWGLKRHGNLNV
        +KRRKPWSKAEDLEL+AAVEKCGEGNWANILKGDFKGDRTASQLSQ   V                                          KR  NLN+
Subjt:  QKRRKPWSKAEDLELMAAVEKCGEGNWANILKGDFKGDRTASQLSQILGVFSPLSLWPHQFGMKCLPRPRLVAVEKPKILDDVDRKRELWGLKRHGNLNV

Query:  GASTTSTTQKAQIDAAHRALSFALDLPVNNSKT-ANSNINSSIV-SPASGAEALVQMQNQSPQISMPSRPLLVEPLPSAVKSGINTSKNSLIMKSTHNSD
        GAST+STTQKAQIDAAHRAL+FALDLPVNN+KT ANSNINSSIV S AS +E+ VQMQNQSPQISMPSRPLLV+PLPSAVKSGINTSKNSL++ STHNSD
Subjt:  GASTTSTTQKAQIDAAHRALSFALDLPVNNSKT-ANSNINSSIV-SPASGAEALVQMQNQSPQISMPSRPLLVEPLPSAVKSGINTSKNSLIMKSTHNSD

Query:  SIVRATAVAAGARIVSLSDAASLLKVAQTKKAIHIKSKCVSSTQSPVAGNAPIHLDGRPSVHYISPGKTPTPGSSHVGGKSTMGCNNSVKAVSPKVLHNR
        SIVRATAVAAGARIVS SDAASL+K  QTK AIHIKSKC            P+HLD RP+VHYIS GKTPTP S++V GKSTM  NNS+KAVSPK+LH+R
Subjt:  SIVRATAVAAGARIVSLSDAASLLKVAQTKKAIHIKSKCVSSTQSPVAGNAPIHLDGRPSVHYISPGKTPTPGSSHVGGKSTMGCNNSVKAVSPKVLHNR

Query:  STAILTNPPSDQVSPTTESPLKQEVNSSEERKIPKPIITAKEEFRENSLANDVKIRG
        S AI TN PS+QVSPTTESPLKQEVNSSEERK P+ IITAKEEFRENS  NDVKIRG
Subjt:  STAILTNPPSDQVSPTTESPLKQEVNSSEERKIPKPIITAKEEFRENSLANDVKIRG

A0A5J5ADN2 HTH myb-type domain-containing protein2.0e-21945Show/hide
Query:  ISELKFSSVVPAKATGDDKVRELTAIDLAMKLHYIRGVYLFRGSEEVRNLTIYDLKKPLFPLLEQYYVVSGRIRRRIEDGDRPFIKCNDSGVRIVEANCE
        + ++K SSVVPA+ TG+DKV ELT +DL MKLHYIRG+Y F+ +E V+ L IYDLKKP+F  L+ YY  SGR+R+   +  RPFIKCNDSGVRIVEA C 
Subjt:  ISELKFSSVVPAKATGDDKVRELTAIDLAMKLHYIRGVYLFRGSEEVRNLTIYDLKKPLFPLLEQYYVVSGRIRRRIEDGDRPFIKCNDSGVRIVEANCE

Query:  KSIDEWLSIIEGDDKFLHRDGCLVHTQAIGPDLGFSPLAFIQLTRFKCGGLSVGLSWTHILGDIFSASTFINAWGSIMNNRPAHQLRPAPAGLIWPFRST
        K++DEWL++     K    D  L++ Q +GPDLGFSPL FIQ T FKCGG SVGLSW HI+GD+FSASTFIN WG I+  +   Q    P    + F  +
Subjt:  KSIDEWLSIIEGDDKFLHRDGCLVHTQAIGPDLGFSPLAFIQLTRFKCGGLSVGLSWTHILGDIFSASTFINAWGSIMNNRPAHQLRPAPAGLIWPFRST

Query:  RLSTP-PVKRLDPTGDLWIGSSDCKMATLSFRITGEQLDRILS-VVGRNRAVNFSTFEAIAAIFWKSLSKIRLEDSDSRTISIYSTKCPNREGEIPRNGM
            P  +K +DP GD W+ +++CKM T S  +T +QLD ILS   G  +A     F+  +A+ WKSL+K+R    + R ++I +    + E EI  NG+
Subjt:  RLSTP-PVKRLDPTGDLWIGSSDCKMATLSFRITGEQLDRILS-VVGRNRAVNFSTFEAIAAIFWKSLSKIRLEDSDSRTISIYSTKCPNREGEIPRNGM

Query:  EMSGVEADFPVAGAAEGELAELIVKKKIDEGGEIEEVVEKGKEESDFIAYGARLTFVDLEEANIYGFELEGQKPVHVNYEIGGVGENGVVLVLPGPPR-D
         +S VEADFPV  A   EL  LI +K++DE   IEE+VE+G  +SDFI YGA LTFV+LEEANIYG EL+GQKPV  NY I G+G+ GVVLVLP P    
Subjt:  EMSGVEADFPVAGAAEGELAELIVKKKIDEGGEIEEVVEKGKEESDFIAYGARLTFVDLEEANIYGFELEGQKPVHVNYEIGGVGENGVVLVLPGPPR-D

Query:  DGRDGGRTVTVILPEKQLPDLIDELQKQWEIHFCACVSIIEHCTLTTVADKSEFLLQPQRLTFVFRFRLQTI-----SDKRFQPHRRQSPALHLFLCDFI
         G   GRT+TV LP  Q+ +L DEL+K+W+        I ++    TV   S  L  P +         Q++     S+K      ++S  L       +
Subjt:  DGRDGGRTVTVILPEKQLPDLIDELQKQWEIHFCACVSIIEHCTLTTVADKSEFLLQPQRLTFVFRFRLQTI-----SDKRFQPHRRQSPALHLFLCDFI

Query:  TSSTMIERKEKQRTGTISMEDCSTLLERYSVRTIFTLLREVAQVSGVRIDWDKLVKNTSTGISNVREYQLLWRHLAYRHTLLENMDSVTDPLDYDSDLDF
          +TM+E+ +KQ+ G+I+ E+ STLL+RYS  T+ T+L+EVAQV  V IDW+ LV+ TS+GISN REYQ+LWRHLAYR+TL E +D   +PLD DSDL++
Subjt:  TSSTMIERKEKQRTGTISMEDCSTLLERYSVRTIFTLLREVAQVSGVRIDWDKLVKNTSTGISNVREYQLLWRHLAYRHTLLENMDSVTDPLDYDSDLDF

Query:  EIEPFPSVSSESSYEAAACVKVLIANGIPSESDVPTSSAVEAPLTIGISNSQASTDNLKNPQYACLMQGMSVTIPLSIQRQPIPMAAATEVFDVNGAAGA
        E+E FP+VSSE+S EAAA VKVLIA+G P++S +P  + VEAPLTI I N Q+S    +N Q A  +QG ++TIP+ +Q+ P+P   A E  D NG+   
Subjt:  EIEPFPSVSSESSYEAAACVKVLIANGIPSESDVPTSSAVEAPLTIGISNSQASTDNLKNPQYACLMQGMSVTIPLSIQRQPIPMAAATEVFDVNGAAGA

Query:  NAASQKRRKPWSKAEDLELMAAVEKCGEGNWANILKGDFKGDRTASQLSQILGVFSPLSLWPHQFGMKCLPRPRLVAVEKPKILDDVDRKRELWGLKRHG
            +K+RK WS AED+EL+AAV+KCGEGNWA ILKGDFKGDRTASQLSQ   +                                          KR  
Subjt:  NAASQKRRKPWSKAEDLELMAAVEKCGEGNWANILKGDFKGDRTASQLSQILGVFSPLSLWPHQFGMKCLPRPRLVAVEKPKILDDVDRKRELWGLKRHG

Query:  NLNVGASTTSTTQKAQIDAAHRALSFALDLPVNNSKTANSNIN----------SSIVSPASG-----AEALVQMQNQSPQISMPSRPLLVEPLPSAVKSG
        NLNVG+   S   +AQ+ AA RA+S AL++P+ ++ TA+ +I+          S+   P +G       ++ Q+Q+QSPQ S+P+    +    S+ KS 
Subjt:  NLNVGASTTSTTQKAQIDAAHRALSFALDLPVNNSKTANSNIN----------SSIVSPASG-----AEALVQMQNQSPQISMPSRPLLVEPLPSAVKSG

Query:  INTSKNSLIMKSTHNSDSIVRATAVAAGARIVSLSDAASLLKVAQTKKAIHIKSKCVSSTQSPVAGNA----PIHLDGRPSVHYISPGKTPTPGSSHVGG
        + + K S   KS  +  S+V+A AVAAGA I +  DAASLLK  Q K AI I     S  +  VAGNA      H D  P+VHYI  G   TP SS+   
Subjt:  INTSKNSLIMKSTHNSDSIVRATAVAAGARIVSLSDAASLLKVAQTKKAIHIKSKCVSSTQSPVAGNA----PIHLDGRPSVHYISPGKTPTPGSSHVGG

Query:  KST--------MGCNNSVKAVSPKVLHNRSTAILT-NPPSDQVSPTTESPLKQEVNSSEERKIPKPIITAKEEFRENSLA
         S          G  NSV+   P V  N S A  T N  S+  +  T  P K EV  +EER +       KE+ +E+ +A
Subjt:  KST--------MGCNNSVKAVSPKVLHNRSTAILT-NPPSDQVSPTTESPLKQEVNSSEERKIPKPIITAKEEFRENSLA

SwissProt top hitse value%identityAlignment
A0A2H5AIZ1 Hydroxycinnamoyltransferase3.2e-1728.38Show/hide
Query:  LISELKFSSVV-PAKATGDDKVRELTAIDLAMKLHYIRGVYLFRGSEEVRNLTIYD---LKKPLFPLLEQYYVVSGRIRRRIEDGDRPFIKCNDSGVRIV
        +I  +K +++V P++ T   ++   + +DL +   +   VY +R      +   +D   LK+ L   L  +Y ++GR+ R  EDG R  I CN  GVR V
Subjt:  LISELKFSSVV-PAKATGDDKVRELTAIDLAMKLHYIRGVYLFRGSEEVRNLTIYD---LKKPLFPLLEQYYVVSGRIRRRIEDGDRPFIKCNDSGVRIV

Query:  EANCEKSIDEWLSIIEGDDKFLHRDGCLVHTQAIGPDLGFSPLAFIQLTRFKCGGLSVGLSWTHILGDIFSASTFINAWGSIMN----------NRPAHQ
         A  + +IDE+     GD         L+     G D+   PL  +Q+T FKCGG S+G+   H + D  S   FIN+W  I            +R   +
Subjt:  EANCEKSIDEWLSIIEGDDKFLHRDGCLVHTQAIGPDLGFSPLAFIQLTRFKCGGLSVGLSWTHILGDIFSASTFINAWGSIMN----------NRPAHQ

Query:  LR--PAPAGLIWPFR-STRLSTPPVKRLDPTGDLWIGSSDCKMATLSFRITGEQLDRILSVVGRNRAVNFSTFEAIAAIFWKSLSKIRLEDSDSRT
         R  P+P+     ++ +  ++T P    DPT    + S     A   F++T +QLD + S V    +  +S++  +A   W+  S  R    D RT
Subjt:  LR--PAPAGLIWPFR-STRLSTPPVKRLDPTGDLWIGSSDCKMATLSFRITGEQLDRILSVVGRNRAVNFSTFEAIAAIFWKSLSKIRLEDSDSRT

A0PDV5 Rosmarinate synthase1.4e-1528.83Show/hide
Query:  ELKFSSVVPAKATGDDKVRELTAIDLAMKLHY-IRGVYLFRGSEEVRNLTIYDLKKPLFPLLEQYYVVSGRIRRRIEDGDRPFIKCNDSGVRIVEANCEK
        E+K S+++   A        L+ +DL    +Y    V+ +             LK+ L   L ++Y  +GR++    +G+R  I CN+ G+ +VEA C+ 
Subjt:  ELKFSSVVPAKATGDDKVRELTAIDLAMKLHY-IRGVYLFRGSEEVRNLTIYDLKKPLFPLLEQYYVVSGRIRRRIEDGDRPFIKCNDSGVRIVEANCEK

Query:  SIDEWLSIIEGDDKFLHRDGC-LVHTQAIGPDLGFSPLAFIQLTRFKCGGLSVGLSWTHILGDIFSASTFINAWGSIMNNRPAHQLRPAPAGLIWP-FRS
        ++DE      GD  F  R    L+        +   PL   QLTRFKCGG+++G++  H L D  +A  FIN W        AH  R APA    P F  
Subjt:  SIDEWLSIIEGDDKFLHRDGC-LVHTQAIGPDLGFSPLAFIQLTRFKCGGLSVGLSWTHILGDIFSASTFINAWGSIMNNRPAHQLRPAPAGLIWP-FRS

Query:  TRLS--TPPVKRLD-------PTGDLWIGSSDCKMATLSFRITGEQLDRILS-----VVGRNRAVNFSTFEAIAAIFWKSL
        + LS   PP  +         PT +  +  +D  +A   F++T +QL+ + S             ++STFE +A   W+S+
Subjt:  TRLS--TPPVKRLD-------PTGDLWIGSSDCKMATLSFRITGEQLDRILS-----VVGRNRAVNFSTFEAIAAIFWKSL

Q39048 Protein ECERIFERUM 24.3e-7839.82Show/hide
Query:  KNSLISELKFSSVVPAKATGDDKVRELTAIDLAMKLHYIRGVYLFRGSEEVRNLTIYDLKKPLF---PLLEQYYVVSGRIRRRIEDGDR-----PFIKCN
        + S ++ ++ SSVVPA   G++K R+LT +DLAMKLHY+R VY F+G+   R+ T+ D+K  +F    LL+ Y+ VSGRIR    D D      P+I+CN
Subjt:  KNSLISELKFSSVVPAKATGDDKVRELTAIDLAMKLHYIRGVYLFRGSEEVRNLTIYDLKKPLF---PLLEQYYVVSGRIRRRIEDGDR-----PFIKCN

Query:  DSGVRIVEANCEK-SIDEWLSIIEGDDKFL-HRDGCLVHTQAIGPDLGFSPLAFIQLTRFKCGGLSVGLSWTHILGDIFSASTFINAWGSIMN-NRPAHQ
        DSG+R+VEAN E+ ++++WL +   DD+ + HR   LV+   +GPDL FSPL F+Q+T+FKCGGL +GLSW HILGD+FSASTF+   G +++ + P   
Subjt:  DSGVRIVEANCEK-SIDEWLSIIEGDDKFL-HRDGCLVHTQAIGPDLGFSPLAFIQLTRFKCGGLSVGLSWTHILGDIFSASTFINAWGSIMN-NRPAHQ

Query:  LRPAPAGLIWPFRSTRLSTPPVKRLDPTGDLWIGSSDCKMATLSFRITGEQLDRILSVVGRNRAVNFSTFEAIAAIFWKSLSKIRLEDSDSRTISIYSTK
        + P    L    R+       ++++D  G+ W+ ++ CKM    F  +   +D +++     R   FS  + + A+ WKSL  IR E +++  I+I   K
Subjt:  LRPAPAGLIWPFRSTRLSTPPVKRLDPTGDLWIGSSDCKMATLSFRITGEQLDRILSVVGRNRAVNFSTFEAIAAIFWKSLSKIRLEDSDSRTISIYSTK

Query:  CPNREGEIPRNGMEMSGVEADFPVAGAAEGELAELIVKKKIDEGGEIEEVVEKGKEESDFIAYGARLTFVDLEEANIYGFELEGQKPVHVNYEIGGVGEN
           +        + +S VE +  + G +  ELA LI  +K +E G I+ ++E+ K  SDF  YGA LTFV+L+E ++Y  E+ G KP  VNY I GVG+ 
Subjt:  CPNREGEIPRNGMEMSGVEADFPVAGAAEGELAELIVKKKIDEGGEIEEVVEKGKEESDFIAYGARLTFVDLEEANIYGFELEGQKPVHVNYEIGGVGEN

Query:  GVVLVLPGPPRDDGRDGGRTVTVILPEKQLPDLIDEL
        GVVLV P       ++  R V+V++PE+ L  L +E+
Subjt:  GVVLVLPGPPRDDGRDGGRTVTVILPEKQLPDLIDEL

Q9LIS1 Protein ECERIFERUM 26-like9.6e-7036.49Show/hide
Query:  KFSSVVPAKATGDDKVRELTAIDLAMKLHYIRGVYLFRGSEEVRNLTIYDLKKPLFPLLEQYYVVSGRIRRRIEDGDRPFIKCNDSGVRIVEANCEKSID
        + S+V  +  +      E T +DLAMKLHY++ VY++  +   R+LT+ D+K PLF +  Q   + GR RR   +  RP++KCND G R VE++C+ +++
Subjt:  KFSSVVPAKATGDDKVRELTAIDLAMKLHYIRGVYLFRGSEEVRNLTIYDLKKPLFPLLEQYYVVSGRIRRRIEDGDRPFIKCNDSGVRIVEANCEKSID

Query:  EWLSIIEGDDKFLHRDGCLVHTQAIGPDLGFSPLAFIQLTRFKCGGLSVGLSWTHILGDIFSASTFINAWGSIMNNRPAHQLRPAPAGLIWPFRSTRLST
        EWL +    D+ +  D  LV+ Q +GPDL FSPL +IQ+TRF CGGL++GLSW HI+GD FS S F N W         +  + +     +   ++    
Subjt:  EWLSIIEGDDKFLHRDGCLVHTQAIGPDLGFSPLAFIQLTRFKCGGLSVGLSWTHILGDIFSASTFINAWGSIMNNRPAHQLRPAPAGLIWPFRSTRLST

Query:  P-PVKRLDPTGDLWIGSSDCKMATLSFRITGEQLDRILSVVGRNRAVNFSTFEAIAAIFWKSLSKIRLEDSDSRTISIYSTKCPNREGEIPRNGMEMSGV
        P  VK++D  GDLW+  ++ KM T SF +T   L     V G         FE +  I WK ++ +R E S   TI++  +     +    RNG  +S +
Subjt:  P-PVKRLDPTGDLWIGSSDCKMATLSFRITGEQLDRILSVVGRNRAVNFSTFEAIAAIFWKSLSKIRLEDSDSRTISIYSTKCPNREGEIPRNGMEMSGV

Query:  EADFPVAGAAEGELAELIVKKKIDEGGEIEEVVEKGKEESDFIAYGARLTFVDLEEANIYGFELEGQKPVHVNYEIGGVGENGVVLVLPGPPRDDGRDGG
          DF VA A+  E+ + I + K DE   I+E+V+   + SDFI YGA LTFVD+ E + Y  ++ G+ P  V   + G+G++G V+VLPG   ++     
Subjt:  EADFPVAGAAEGELAELIVKKKIDEGGEIEEVVEKGKEESDFIAYGARLTFVDLEEANIYGFELEGQKPVHVNYEIGGVGENGVVLVLPGPPRDDGRDGG

Query:  RTVTVILPEKQLPDLIDELQK-QWEIHFCACVS
        R VTV LP       +DE++K +WE+  C  ++
Subjt:  RTVTVILPEKQLPDLIDELQK-QWEIHFCACVS

Q9SVM9 Protein ECERIFERUM 267.1e-6535.13Show/hide
Query:  ISELKFSSVVPAKATGDDKVRELTAIDLAMKLHYIRGVYLFRGSEEVRNLTIYDLKKPLFPLLEQYYVVSGRIRRRIEDGDRPFIKCNDSGVRIVEANCE
        +  ++ S+V   + T      E T +DLAMKLHY++  Y++  +E  R+LT+  LK+ +F L +Q    +GR  RR  D  RP+IKCND G R VE  C 
Subjt:  ISELKFSSVVPAKATGDDKVRELTAIDLAMKLHYIRGVYLFRGSEEVRNLTIYDLKKPLFPLLEQYYVVSGRIRRRIEDGDRPFIKCNDSGVRIVEANCE

Query:  KSIDEWLSIIEGDDKFLHRDGCLVHTQAIGPDLGFSPLAFIQLTRFKCGGLSVGLSWTHILGDIFSASTFINAWGSIMNNRPAH-QLRPAPAGLIWPFRS
         +++EWLS     D+ +  D  LV+   IGP+L FSPL ++Q+TRFKCGGL +GLSW +I+GD FS     N W   +     +    P+     +   +
Subjt:  KSIDEWLSIIEGDDKFLHRDGCLVHTQAIGPDLGFSPLAFIQLTRFKCGGLSVGLSWTHILGDIFSASTFINAWGSIMNNRPAH-QLRPAPAGLIWPFRS

Query:  TRLSTP-PVKRLDPTGDLWIGSSDCKMATLSFRIT-GEQLDRILSVVGRNRAVNFSTFEAIAAIFWKSLSKIRLEDSDSRTISIYSTKCPNREGEIPRNG
          +  P  +KR++P GDLW+  +D K+A   F ++  +Q+       G +   +   FE +A I WK ++K+R+E     T++I      + +    RN 
Subjt:  TRLSTP-PVKRLDPTGDLWIGSSDCKMATLSFRIT-GEQLDRILSVVGRNRAVNFSTFEAIAAIFWKSLSKIRLEDSDSRTISIYSTKCPNREGEIPRNG

Query:  MEMSGVEADFPVAGAAEGELAELIVKKKIDEGGEIEEVVEKGKEESDFIAYGARLTFVDLEEANIYGFELEGQKPVHVNYEIGGVGENGVVLVLPGPPRD
          +S V  DFPVA A   EL + + + K DE   IEE+ E      DF+ YGA+LTF+DL   ++Y  ++ G+ P  V   + G+GE G+V+V      +
Subjt:  MEMSGVEADFPVAGAAEGELAELIVKKKIDEGGEIEEVVEKGKEESDFIAYGARLTFVDLEEANIYGFELEGQKPVHVNYEIGGVGENGVVLVLPGPPRD

Query:  DGRDGGRTVTVILPEKQLPDLIDELQK
        +     R VTV LPE+++  +  E +K
Subjt:  DGRDGGRTVTVILPEKQLPDLIDELQK

Arabidopsis top hitse value%identityAlignment
AT1G09710.1 Homeodomain-like superfamily protein1.2e-5137.47Show/hide
Query:  QRTGTISMEDCSTLLERYSVRTIFTLLREVAQVSGVRIDWDKLVKNTSTGISNVREYQLLWRHLAYRHTLLENMDSVTDPLDYDSDLDFEIEPFPSVSSE
        +R   I+  D +TLL RY + TI  +L+E++  S  ++DW+ LVK T+TGI+N REYQLLWRHL+YRH LL   D    PLD DSD++ E+E  P+VS E
Subjt:  QRTGTISMEDCSTLLERYSVRTIFTLLREVAQVSGVRIDWDKLVKNTSTGISNVREYQLLWRHLAYRHTLLENMDSVTDPLDYDSDLDFEIEPFPSVSSE

Query:  SSYEAAACVKVLIANGIPSESDVPTSSAVEAPLTIGISNS--QASTDNLKNPQYACLMQGMSVTIPLSIQRQPIPMAAATEVFDVNGAAGANAASQKRRK
        +S EA A VKV+ A+ + SESD+   S VEAPLTI I  +  + S +  ++P  +   +GM++  P+ +Q+       +TE  + NG+AG + A +++RK
Subjt:  SSYEAAACVKVLIANGIPSESDVPTSSAVEAPLTIGISNS--QASTDNLKNPQYACLMQGMSVTIPLSIQRQPIPMAAATEVFDVNGAAGANAASQKRRK

Query:  PWSKAEDLELMAAVEKCGEGNWANILKGDFKGDRTASQLSQILGVFSPLSLWPHQFGMKCLPRPRLVAVEKPKILDDVDRKRELWGLKRHGNLNVGASTT
         WS  ED EL AAV++CGEGNWA+I+KGDF+G+RTASQLSQ                       R   + K                + H + +V     
Subjt:  PWSKAEDLELMAAVEKCGEGNWANILKGDFKGDRTASQLSQILGVFSPLSLWPHQFGMKCLPRPRLVAVEKPKILDDVDRKRELWGLKRHGNLNVGASTT

Query:  STTQKAQIDAAHRALSFALDLPVNNSKTANSNINSSIVSPASGAEALVQMQNQSPQISMPSRPLLVEPLPSAVKSGINTSKNSLIMK----STHNSDSIV
          T+ A++ A + ALS AL     ++K A   + ++     +  EA     +Q  Q S P    +V+ LP A  S +  +K+ ++ K    ST  SD +V
Subjt:  STTQKAQIDAAHRALSFALDLPVNNSKTANSNINSSIVSPASGAEALVQMQNQSPQISMPSRPLLVEPLPSAVKSGINTSKNSLIMK----STHNSDSIV

Query:  RATAVAAGARIVSLSDAASLLKVAQTK
         A +VAA A +  +  AAS  KV   K
Subjt:  RATAVAAGARIVSLSDAASLLKVAQTK

AT1G09710.2 Homeodomain-like superfamily protein3.5e-5136.89Show/hide
Query:  QRTGTISMEDCSTLLERYSVRTIFTLLREVAQVSGVRIDWDKLVKNTSTGISNVREYQLLWRHLAYRHTLLENMDSVTDPLDYDSDLDFEIEPFPSVSSE
        +R   I+  D +TLL RY + TI  +L+E++  S  ++DW+ LVK T+TGI+N REYQLLWRHL+YRH LL   D    PLD DSD++ E+E  P+VS E
Subjt:  QRTGTISMEDCSTLLERYSVRTIFTLLREVAQVSGVRIDWDKLVKNTSTGISNVREYQLLWRHLAYRHTLLENMDSVTDPLDYDSDLDFEIEPFPSVSSE

Query:  SSYEAAACVKVLIANGIPSESDVPTSSAVEAPLTIGISNS--QASTDNLKNPQYACLMQGMSVTIPLSIQRQPIPMAAATEVFDVNGAAGANAASQKRRK
        +S EA A VKV+ A+ + SESD+   S VEAPLTI I  +  + S +  ++P  +   +GM++  P+ +Q+       +TE  + NG+AG + A +++RK
Subjt:  SSYEAAACVKVLIANGIPSESDVPTSSAVEAPLTIGISNS--QASTDNLKNPQYACLMQGMSVTIPLSIQRQPIPMAAATEVFDVNGAAGANAASQKRRK

Query:  PWSKAEDLELMAAVEKCGEGNWANILKGDFKGDRTASQLSQ---ILGVFSPLSLWPHQFGMKCLPRPRLVAVEKPKILDDVDRKREL-WGLKRHGNLNVG
         WS  ED EL AAV++CGEGNWA+I+KGDF+G+RTASQLSQ   ++      S    Q G++       V       L +     +L  G     +    
Subjt:  PWSKAEDLELMAAVEKCGEGNWANILKGDFKGDRTASQLSQ---ILGVFSPLSLWPHQFGMKCLPRPRLVAVEKPKILDDVDRKREL-WGLKRHGNLNVG

Query:  ASTTSTTQKAQIDAAHRALSFALDLPVNNSKTANSNINSSIVSPASGAEALVQMQNQSPQISMPSRPLLVEPLPSAVKSGINTSKNSLIMK----STHNS
        +S    T+ A +      L+  L    N      ++  +   + A+G  +     +Q  Q S P    +V+ LP A  S +  +K+ ++ K    ST  S
Subjt:  ASTTSTTQKAQIDAAHRALSFALDLPVNNSKTANSNINSSIVSPASGAEALVQMQNQSPQISMPSRPLLVEPLPSAVKSGINTSKNSLIMK----STHNS

Query:  DSIVRATAVAAGARIVSLSDAASLLKVAQTK
        D +V A +VAA A +  +  AAS  KV   K
Subjt:  DSIVRATAVAAGARIVSLSDAASLLKVAQTK

AT3G23840.1 HXXXD-type acyl-transferase family protein6.8e-7136.49Show/hide
Query:  KFSSVVPAKATGDDKVRELTAIDLAMKLHYIRGVYLFRGSEEVRNLTIYDLKKPLFPLLEQYYVVSGRIRRRIEDGDRPFIKCNDSGVRIVEANCEKSID
        + S+V  +  +      E T +DLAMKLHY++ VY++  +   R+LT+ D+K PLF +  Q   + GR RR   +  RP++KCND G R VE++C+ +++
Subjt:  KFSSVVPAKATGDDKVRELTAIDLAMKLHYIRGVYLFRGSEEVRNLTIYDLKKPLFPLLEQYYVVSGRIRRRIEDGDRPFIKCNDSGVRIVEANCEKSID

Query:  EWLSIIEGDDKFLHRDGCLVHTQAIGPDLGFSPLAFIQLTRFKCGGLSVGLSWTHILGDIFSASTFINAWGSIMNNRPAHQLRPAPAGLIWPFRSTRLST
        EWL +    D+ +  D  LV+ Q +GPDL FSPL +IQ+TRF CGGL++GLSW HI+GD FS S F N W         +  + +     +   ++    
Subjt:  EWLSIIEGDDKFLHRDGCLVHTQAIGPDLGFSPLAFIQLTRFKCGGLSVGLSWTHILGDIFSASTFINAWGSIMNNRPAHQLRPAPAGLIWPFRSTRLST

Query:  P-PVKRLDPTGDLWIGSSDCKMATLSFRITGEQLDRILSVVGRNRAVNFSTFEAIAAIFWKSLSKIRLEDSDSRTISIYSTKCPNREGEIPRNGMEMSGV
        P  VK++D  GDLW+  ++ KM T SF +T   L     V G         FE +  I WK ++ +R E S   TI++  +     +    RNG  +S +
Subjt:  P-PVKRLDPTGDLWIGSSDCKMATLSFRITGEQLDRILSVVGRNRAVNFSTFEAIAAIFWKSLSKIRLEDSDSRTISIYSTKCPNREGEIPRNGMEMSGV

Query:  EADFPVAGAAEGELAELIVKKKIDEGGEIEEVVEKGKEESDFIAYGARLTFVDLEEANIYGFELEGQKPVHVNYEIGGVGENGVVLVLPGPPRDDGRDGG
          DF VA A+  E+ + I + K DE   I+E+V+   + SDFI YGA LTFVD+ E + Y  ++ G+ P  V   + G+G++G V+VLPG   ++     
Subjt:  EADFPVAGAAEGELAELIVKKKIDEGGEIEEVVEKGKEESDFIAYGARLTFVDLEEANIYGFELEGQKPVHVNYEIGGVGENGVVLVLPGPPRDDGRDGG

Query:  RTVTVILPEKQLPDLIDELQK-QWEIHFCACVS
        R VTV LP       +DE++K +WE+  C  ++
Subjt:  RTVTVILPEKQLPDLIDELQK-QWEIHFCACVS

AT4G13840.1 HXXXD-type acyl-transferase family protein5.0e-6635.13Show/hide
Query:  ISELKFSSVVPAKATGDDKVRELTAIDLAMKLHYIRGVYLFRGSEEVRNLTIYDLKKPLFPLLEQYYVVSGRIRRRIEDGDRPFIKCNDSGVRIVEANCE
        +  ++ S+V   + T      E T +DLAMKLHY++  Y++  +E  R+LT+  LK+ +F L +Q    +GR  RR  D  RP+IKCND G R VE  C 
Subjt:  ISELKFSSVVPAKATGDDKVRELTAIDLAMKLHYIRGVYLFRGSEEVRNLTIYDLKKPLFPLLEQYYVVSGRIRRRIEDGDRPFIKCNDSGVRIVEANCE

Query:  KSIDEWLSIIEGDDKFLHRDGCLVHTQAIGPDLGFSPLAFIQLTRFKCGGLSVGLSWTHILGDIFSASTFINAWGSIMNNRPAH-QLRPAPAGLIWPFRS
         +++EWLS     D+ +  D  LV+   IGP+L FSPL ++Q+TRFKCGGL +GLSW +I+GD FS     N W   +     +    P+     +   +
Subjt:  KSIDEWLSIIEGDDKFLHRDGCLVHTQAIGPDLGFSPLAFIQLTRFKCGGLSVGLSWTHILGDIFSASTFINAWGSIMNNRPAH-QLRPAPAGLIWPFRS

Query:  TRLSTP-PVKRLDPTGDLWIGSSDCKMATLSFRIT-GEQLDRILSVVGRNRAVNFSTFEAIAAIFWKSLSKIRLEDSDSRTISIYSTKCPNREGEIPRNG
          +  P  +KR++P GDLW+  +D K+A   F ++  +Q+       G +   +   FE +A I WK ++K+R+E     T++I      + +    RN 
Subjt:  TRLSTP-PVKRLDPTGDLWIGSSDCKMATLSFRIT-GEQLDRILSVVGRNRAVNFSTFEAIAAIFWKSLSKIRLEDSDSRTISIYSTKCPNREGEIPRNG

Query:  MEMSGVEADFPVAGAAEGELAELIVKKKIDEGGEIEEVVEKGKEESDFIAYGARLTFVDLEEANIYGFELEGQKPVHVNYEIGGVGENGVVLVLPGPPRD
          +S V  DFPVA A   EL + + + K DE   IEE+ E      DF+ YGA+LTF+DL   ++Y  ++ G+ P  V   + G+GE G+V+V      +
Subjt:  MEMSGVEADFPVAGAAEGELAELIVKKKIDEGGEIEEVVEKGKEESDFIAYGARLTFVDLEEANIYGFELEGQKPVHVNYEIGGVGENGVVLVLPGPPRD

Query:  DGRDGGRTVTVILPEKQLPDLIDELQK
        +     R VTV LPE+++  +  E +K
Subjt:  DGRDGGRTVTVILPEKQLPDLIDELQK

AT4G24510.1 HXXXD-type acyl-transferase family protein3.0e-7939.82Show/hide
Query:  KNSLISELKFSSVVPAKATGDDKVRELTAIDLAMKLHYIRGVYLFRGSEEVRNLTIYDLKKPLF---PLLEQYYVVSGRIRRRIEDGDR-----PFIKCN
        + S ++ ++ SSVVPA   G++K R+LT +DLAMKLHY+R VY F+G+   R+ T+ D+K  +F    LL+ Y+ VSGRIR    D D      P+I+CN
Subjt:  KNSLISELKFSSVVPAKATGDDKVRELTAIDLAMKLHYIRGVYLFRGSEEVRNLTIYDLKKPLF---PLLEQYYVVSGRIRRRIEDGDR-----PFIKCN

Query:  DSGVRIVEANCEK-SIDEWLSIIEGDDKFL-HRDGCLVHTQAIGPDLGFSPLAFIQLTRFKCGGLSVGLSWTHILGDIFSASTFINAWGSIMN-NRPAHQ
        DSG+R+VEAN E+ ++++WL +   DD+ + HR   LV+   +GPDL FSPL F+Q+T+FKCGGL +GLSW HILGD+FSASTF+   G +++ + P   
Subjt:  DSGVRIVEANCEK-SIDEWLSIIEGDDKFL-HRDGCLVHTQAIGPDLGFSPLAFIQLTRFKCGGLSVGLSWTHILGDIFSASTFINAWGSIMN-NRPAHQ

Query:  LRPAPAGLIWPFRSTRLSTPPVKRLDPTGDLWIGSSDCKMATLSFRITGEQLDRILSVVGRNRAVNFSTFEAIAAIFWKSLSKIRLEDSDSRTISIYSTK
        + P    L    R+       ++++D  G+ W+ ++ CKM    F  +   +D +++     R   FS  + + A+ WKSL  IR E +++  I+I   K
Subjt:  LRPAPAGLIWPFRSTRLSTPPVKRLDPTGDLWIGSSDCKMATLSFRITGEQLDRILSVVGRNRAVNFSTFEAIAAIFWKSLSKIRLEDSDSRTISIYSTK

Query:  CPNREGEIPRNGMEMSGVEADFPVAGAAEGELAELIVKKKIDEGGEIEEVVEKGKEESDFIAYGARLTFVDLEEANIYGFELEGQKPVHVNYEIGGVGEN
           +        + +S VE +  + G +  ELA LI  +K +E G I+ ++E+ K  SDF  YGA LTFV+L+E ++Y  E+ G KP  VNY I GVG+ 
Subjt:  CPNREGEIPRNGMEMSGVEADFPVAGAAEGELAELIVKKKIDEGGEIEEVVEKGKEESDFIAYGARLTFVDLEEANIYGFELEGQKPVHVNYEIGGVGEN

Query:  GVVLVLPGPPRDDGRDGGRTVTVILPEKQLPDLIDEL
        GVVLV P       ++  R V+V++PE+ L  L +E+
Subjt:  GVVLVLPGPPRDDGRDGGRTVTVILPEKQLPDLIDEL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACGGCGGTAAAAACAGTCTAATTTCCGAGTTAAAATTTTCGTCGGTGGTTCCGGCAAAGGCGACCGGCGACGACAAGGTCCGGGAATTAACGGCGATCGATCTGGC
GATGAAGCTTCATTATATTAGAGGCGTTTATTTGTTCAGAGGGAGCGAAGAAGTGAGAAATTTGACGATTTATGACCTGAAAAAACCTTTGTTTCCGTTGTTGGAGCAAT
ACTACGTCGTTTCGGGGAGGATTCGAAGGAGAATCGAAGATGGAGATCGGCCGTTCATTAAGTGTAATGATAGTGGAGTGAGAATTGTGGAAGCAAATTGTGAGAAAAGT
ATTGATGAATGGCTTTCGATTATTGAGGGAGACGATAAATTTCTGCATCGCGATGGCTGTTTGGTTCATACTCAAGCCATTGGTCCCGATCTTGGATTTTCCCCTCTTGC
TTTCATCCAGCTGACTCGGTTCAAGTGTGGCGGTCTCTCCGTGGGCCTCAGTTGGACTCACATTCTCGGCGATATCTTCTCCGCCTCCACCTTCATCAACGCATGGGGTT
CCATCATGAACAACCGCCCGGCCCACCAACTCCGTCCGGCGCCGGCTGGTCTTATCTGGCCGTTCAGATCAACCAGACTGTCCACACCGCCGGTCAAGAGGCTCGACCCG
ACCGGAGACCTTTGGATCGGGTCAAGCGACTGCAAAATGGCGACGCTGTCATTTCGAATCACGGGGGAGCAATTGGATCGAATATTGAGCGTCGTCGGCCGGAATCGAGC
GGTGAACTTCTCAACTTTCGAAGCTATTGCTGCGATTTTCTGGAAATCTTTGTCGAAAATACGGCTTGAGGACTCGGATTCGAGGACGATCTCGATCTATTCGACGAAAT
GCCCTAACAGAGAGGGTGAAATTCCGAGGAACGGAATGGAGATGAGCGGCGTCGAGGCCGATTTTCCAGTCGCAGGAGCGGCGGAAGGCGAATTGGCGGAGCTGATTGTG
AAGAAGAAAATTGATGAGGGCGGAGAAATCGAAGAAGTGGTGGAGAAAGGAAAGGAGGAATCGGATTTCATAGCGTACGGAGCGAGATTGACGTTCGTTGATTTGGAAGA
AGCGAATATTTACGGCTTCGAATTGGAAGGGCAGAAGCCAGTTCATGTGAATTATGAAATTGGGGGAGTTGGTGAAAACGGCGTCGTTTTGGTACTTCCAGGACCACCGC
GTGACGACGGAAGAGACGGTGGCCGAACGGTGACGGTTATCTTGCCGGAGAAACAGCTGCCGGACCTTATCGATGAACTGCAGAAGCAGTGGGAGATCCACTTCTGTGCT
TGCGTGTCTATCATTGAGCACTGCACCCTCACCACCGTCGCCGATAAGTCCGAATTTCTTTTGCAACCGCAACGCTTAACATTCGTTTTCCGGTTTCGTCTTCAAACAAT
TTCCGATAAGCGATTTCAGCCTCATCGGCGACAGTCACCGGCGCTGCACCTCTTCTTATGCGATTTCATTACGAGCTCTACAATGATTGAGAGGAAAGAGAAGCAAAGGA
CAGGGACAATTAGTATGGAAGATTGTTCCACTCTATTGGAAAGATATTCAGTAAGGACGATATTTACTTTGCTTCGGGAGGTGGCCCAGGTTTCGGGAGTGAGAATTGAT
TGGGACAAATTGGTGAAGAACACGTCCACTGGGATTTCTAATGTTCGGGAGTATCAGTTGTTATGGCGGCATTTGGCTTATCGTCACACATTACTGGAGAACATGGATTC
TGTTACTGATCCTCTGGATTATGATAGTGACTTAGATTTTGAAATAGAACCTTTTCCATCTGTTAGCAGTGAGTCCTCGTATGAAGCTGCAGCATGTGTAAAGGTACTGA
TTGCTAATGGTATACCAAGTGAGTCAGATGTTCCAACTAGTTCTGCAGTTGAGGCCCCATTGACTATAGGTATATCCAATAGTCAAGCATCTACAGACAATCTTAAAAAT
CCTCAATATGCTTGTTTGATGCAAGGGATGTCTGTTACAATTCCACTTTCCATTCAGAGGCAGCCGATTCCAATGGCAGCAGCAACTGAAGTATTTGATGTGAATGGAGC
AGCTGGTGCTAATGCAGCTTCTCAGAAAAGAAGGAAGCCTTGGTCGAAGGCAGAGGATTTGGAATTGATGGCTGCTGTGGAGAAGTGTGGTGAAGGAAACTGGGCGAATA
TCTTGAAAGGAGACTTCAAAGGGGATAGAACTGCTTCACAGCTATCTCAGATTCTTGGAGTATTCTCTCCTTTGTCCCTTTGGCCACACCAATTTGGTATGAAGTGTTTG
CCTAGGCCGAGGTTGGTTGCCGTGGAAAAGCCAAAAATTCTTGATGATGTCGATCGCAAGAGGGAGCTTTGGGGGTTGAAGCGACATGGTAATTTGAATGTGGGAGCTAG
CACCACAAGTACTACCCAGAAAGCTCAGATTGATGCTGCACACCGTGCATTGTCCTTTGCCCTTGATTTGCCTGTGAATAACTCAAAAACAGCAAATTCAAATATAAACA
GTAGCATTGTTTCTCCTGCAAGTGGTGCCGAAGCTTTGGTTCAAATGCAGAACCAGTCTCCACAGATTTCCATGCCTTCAAGGCCGCTGCTGGTAGAGCCTTTGCCTTCA
GCAGTGAAATCTGGAATCAACACTTCCAAGAATTCATTGATTATGAAGTCTACTCACAATTCTGATTCTATAGTTAGAGCAACTGCAGTAGCTGCAGGGGCCCGAATTGT
TTCTCTGTCCGATGCTGCATCTTTACTGAAAGTTGCACAAACAAAAAAGGCCATCCACATAAAGTCCAAATGTGTTTCATCAACCCAATCACCTGTGGCTGGAAATGCAC
CAATCCACTTGGATGGACGCCCCAGTGTACATTATATTTCCCCAGGAAAAACACCGACTCCAGGGTCAAGCCATGTCGGCGGTAAATCTACTATGGGGTGCAATAACTCA
GTGAAGGCTGTCTCACCAAAAGTTCTGCATAATCGTTCTACTGCTATTTTGACAAACCCGCCATCAGACCAAGTAAGCCCAACAACTGAGTCTCCACTGAAGCAAGAGGT
TAACAGTTCAGAAGAACGGAAAATTCCCAAGCCAATCATTACTGCAAAAGAGGAGTTCCGAGAAAACAGCTTGGCAAATGATGTCAAGATTAGGGGCTGA
mRNA sequenceShow/hide mRNA sequence
ATGGACGGCGGTAAAAACAGTCTAATTTCCGAGTTAAAATTTTCGTCGGTGGTTCCGGCAAAGGCGACCGGCGACGACAAGGTCCGGGAATTAACGGCGATCGATCTGGC
GATGAAGCTTCATTATATTAGAGGCGTTTATTTGTTCAGAGGGAGCGAAGAAGTGAGAAATTTGACGATTTATGACCTGAAAAAACCTTTGTTTCCGTTGTTGGAGCAAT
ACTACGTCGTTTCGGGGAGGATTCGAAGGAGAATCGAAGATGGAGATCGGCCGTTCATTAAGTGTAATGATAGTGGAGTGAGAATTGTGGAAGCAAATTGTGAGAAAAGT
ATTGATGAATGGCTTTCGATTATTGAGGGAGACGATAAATTTCTGCATCGCGATGGCTGTTTGGTTCATACTCAAGCCATTGGTCCCGATCTTGGATTTTCCCCTCTTGC
TTTCATCCAGCTGACTCGGTTCAAGTGTGGCGGTCTCTCCGTGGGCCTCAGTTGGACTCACATTCTCGGCGATATCTTCTCCGCCTCCACCTTCATCAACGCATGGGGTT
CCATCATGAACAACCGCCCGGCCCACCAACTCCGTCCGGCGCCGGCTGGTCTTATCTGGCCGTTCAGATCAACCAGACTGTCCACACCGCCGGTCAAGAGGCTCGACCCG
ACCGGAGACCTTTGGATCGGGTCAAGCGACTGCAAAATGGCGACGCTGTCATTTCGAATCACGGGGGAGCAATTGGATCGAATATTGAGCGTCGTCGGCCGGAATCGAGC
GGTGAACTTCTCAACTTTCGAAGCTATTGCTGCGATTTTCTGGAAATCTTTGTCGAAAATACGGCTTGAGGACTCGGATTCGAGGACGATCTCGATCTATTCGACGAAAT
GCCCTAACAGAGAGGGTGAAATTCCGAGGAACGGAATGGAGATGAGCGGCGTCGAGGCCGATTTTCCAGTCGCAGGAGCGGCGGAAGGCGAATTGGCGGAGCTGATTGTG
AAGAAGAAAATTGATGAGGGCGGAGAAATCGAAGAAGTGGTGGAGAAAGGAAAGGAGGAATCGGATTTCATAGCGTACGGAGCGAGATTGACGTTCGTTGATTTGGAAGA
AGCGAATATTTACGGCTTCGAATTGGAAGGGCAGAAGCCAGTTCATGTGAATTATGAAATTGGGGGAGTTGGTGAAAACGGCGTCGTTTTGGTACTTCCAGGACCACCGC
GTGACGACGGAAGAGACGGTGGCCGAACGGTGACGGTTATCTTGCCGGAGAAACAGCTGCCGGACCTTATCGATGAACTGCAGAAGCAGTGGGAGATCCACTTCTGTGCT
TGCGTGTCTATCATTGAGCACTGCACCCTCACCACCGTCGCCGATAAGTCCGAATTTCTTTTGCAACCGCAACGCTTAACATTCGTTTTCCGGTTTCGTCTTCAAACAAT
TTCCGATAAGCGATTTCAGCCTCATCGGCGACAGTCACCGGCGCTGCACCTCTTCTTATGCGATTTCATTACGAGCTCTACAATGATTGAGAGGAAAGAGAAGCAAAGGA
CAGGGACAATTAGTATGGAAGATTGTTCCACTCTATTGGAAAGATATTCAGTAAGGACGATATTTACTTTGCTTCGGGAGGTGGCCCAGGTTTCGGGAGTGAGAATTGAT
TGGGACAAATTGGTGAAGAACACGTCCACTGGGATTTCTAATGTTCGGGAGTATCAGTTGTTATGGCGGCATTTGGCTTATCGTCACACATTACTGGAGAACATGGATTC
TGTTACTGATCCTCTGGATTATGATAGTGACTTAGATTTTGAAATAGAACCTTTTCCATCTGTTAGCAGTGAGTCCTCGTATGAAGCTGCAGCATGTGTAAAGGTACTGA
TTGCTAATGGTATACCAAGTGAGTCAGATGTTCCAACTAGTTCTGCAGTTGAGGCCCCATTGACTATAGGTATATCCAATAGTCAAGCATCTACAGACAATCTTAAAAAT
CCTCAATATGCTTGTTTGATGCAAGGGATGTCTGTTACAATTCCACTTTCCATTCAGAGGCAGCCGATTCCAATGGCAGCAGCAACTGAAGTATTTGATGTGAATGGAGC
AGCTGGTGCTAATGCAGCTTCTCAGAAAAGAAGGAAGCCTTGGTCGAAGGCAGAGGATTTGGAATTGATGGCTGCTGTGGAGAAGTGTGGTGAAGGAAACTGGGCGAATA
TCTTGAAAGGAGACTTCAAAGGGGATAGAACTGCTTCACAGCTATCTCAGATTCTTGGAGTATTCTCTCCTTTGTCCCTTTGGCCACACCAATTTGGTATGAAGTGTTTG
CCTAGGCCGAGGTTGGTTGCCGTGGAAAAGCCAAAAATTCTTGATGATGTCGATCGCAAGAGGGAGCTTTGGGGGTTGAAGCGACATGGTAATTTGAATGTGGGAGCTAG
CACCACAAGTACTACCCAGAAAGCTCAGATTGATGCTGCACACCGTGCATTGTCCTTTGCCCTTGATTTGCCTGTGAATAACTCAAAAACAGCAAATTCAAATATAAACA
GTAGCATTGTTTCTCCTGCAAGTGGTGCCGAAGCTTTGGTTCAAATGCAGAACCAGTCTCCACAGATTTCCATGCCTTCAAGGCCGCTGCTGGTAGAGCCTTTGCCTTCA
GCAGTGAAATCTGGAATCAACACTTCCAAGAATTCATTGATTATGAAGTCTACTCACAATTCTGATTCTATAGTTAGAGCAACTGCAGTAGCTGCAGGGGCCCGAATTGT
TTCTCTGTCCGATGCTGCATCTTTACTGAAAGTTGCACAAACAAAAAAGGCCATCCACATAAAGTCCAAATGTGTTTCATCAACCCAATCACCTGTGGCTGGAAATGCAC
CAATCCACTTGGATGGACGCCCCAGTGTACATTATATTTCCCCAGGAAAAACACCGACTCCAGGGTCAAGCCATGTCGGCGGTAAATCTACTATGGGGTGCAATAACTCA
GTGAAGGCTGTCTCACCAAAAGTTCTGCATAATCGTTCTACTGCTATTTTGACAAACCCGCCATCAGACCAAGTAAGCCCAACAACTGAGTCTCCACTGAAGCAAGAGGT
TAACAGTTCAGAAGAACGGAAAATTCCCAAGCCAATCATTACTGCAAAAGAGGAGTTCCGAGAAAACAGCTTGGCAAATGATGTCAAGATTAGGGGCTGACCAAACATAA
AAGGAAGCAAGACCAAATCTATACCATATAATCATGGGGACCATATCACAAAGCTCTACAGGCATGTTAATAGCAGGGTTGTGGGGTCAATGCAAAAGGCCAATGACAAG
ACTAAAATTTGGAAGCTCATATTCTTCAGCAAGACAAGGAAAATGTATACTGTTTATCTGGGATTGAAGGTATAATTCATTCTTGAAATTAATGGATGTTCTTCCTTTCA
TTACATTGGCGAAATTACACCCTCAAAAAAAGCTCATCACTGAGGTAGGGGCCCTTTAGCTAGATCACTATATTGTTTCTGTAGTTGTGGACTTGCACATGAGAACAGTG
ATGGCTTTTGCCTCTTTAACTTCACCCCAAACAAGAAAGACCGTAAAGGAATTCGTGTTCGGCTTAACGATTTCGATTGTTGTATGCTCTTCCCTGCTGAAATGCCAAAT
TCTTGATCAAGTCTCTCCAAAGCCTTCTCAACTTGGACTTGTAGTACTCTTACCCGACCTTGACCAACTTGTAACTCATCAGCTATCTTCCTATTTTCTTGCTTCATGTT
GAGTACTTCACCTTGAAACTTTGCAGCTTGATAATCACTCAGTTCTGTCTTTTCTTCAGCTGAACCTTCATCTGTGATTCTTGATATATCACTTTGTATGTCACATAAGG
ATGAGAATCTATTGCATAGTTCATCTTTCAATACTGCACTATGTTCCAACCAAAGTGATAA
Protein sequenceShow/hide protein sequence
MDGGKNSLISELKFSSVVPAKATGDDKVRELTAIDLAMKLHYIRGVYLFRGSEEVRNLTIYDLKKPLFPLLEQYYVVSGRIRRRIEDGDRPFIKCNDSGVRIVEANCEKS
IDEWLSIIEGDDKFLHRDGCLVHTQAIGPDLGFSPLAFIQLTRFKCGGLSVGLSWTHILGDIFSASTFINAWGSIMNNRPAHQLRPAPAGLIWPFRSTRLSTPPVKRLDP
TGDLWIGSSDCKMATLSFRITGEQLDRILSVVGRNRAVNFSTFEAIAAIFWKSLSKIRLEDSDSRTISIYSTKCPNREGEIPRNGMEMSGVEADFPVAGAAEGELAELIV
KKKIDEGGEIEEVVEKGKEESDFIAYGARLTFVDLEEANIYGFELEGQKPVHVNYEIGGVGENGVVLVLPGPPRDDGRDGGRTVTVILPEKQLPDLIDELQKQWEIHFCA
CVSIIEHCTLTTVADKSEFLLQPQRLTFVFRFRLQTISDKRFQPHRRQSPALHLFLCDFITSSTMIERKEKQRTGTISMEDCSTLLERYSVRTIFTLLREVAQVSGVRID
WDKLVKNTSTGISNVREYQLLWRHLAYRHTLLENMDSVTDPLDYDSDLDFEIEPFPSVSSESSYEAAACVKVLIANGIPSESDVPTSSAVEAPLTIGISNSQASTDNLKN
PQYACLMQGMSVTIPLSIQRQPIPMAAATEVFDVNGAAGANAASQKRRKPWSKAEDLELMAAVEKCGEGNWANILKGDFKGDRTASQLSQILGVFSPLSLWPHQFGMKCL
PRPRLVAVEKPKILDDVDRKRELWGLKRHGNLNVGASTTSTTQKAQIDAAHRALSFALDLPVNNSKTANSNINSSIVSPASGAEALVQMQNQSPQISMPSRPLLVEPLPS
AVKSGINTSKNSLIMKSTHNSDSIVRATAVAAGARIVSLSDAASLLKVAQTKKAIHIKSKCVSSTQSPVAGNAPIHLDGRPSVHYISPGKTPTPGSSHVGGKSTMGCNNS
VKAVSPKVLHNRSTAILTNPPSDQVSPTTESPLKQEVNSSEERKIPKPIITAKEEFRENSLANDVKIRG