; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi04G022820 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi04G022820
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionHXXXD-type acyl-transferase family protein
Genome locationchr04:29842122..29858004
RNA-Seq ExpressionLsi04G022820
SyntenyLsi04G022820
Gene Ontology termsGO:0016747 - transferase activity, transferring acyl groups other than amino-acyl groups (molecular function)
InterPro domainsIPR001005 - SANT/Myb domain
IPR003480 - Transferase
IPR009057 - Homeobox-like domain superfamily
IPR017930 - Myb domain
IPR023213 - Chloramphenicol acetyltransferase-like domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TYK31274.1 uncharacterized protein E5676_scaffold455G005670 [Cucumis melo var. makuwa]1.7e-20680.96Show/hide
Query:  SPTLHLYSVRAIFTLLREVARVSGVRIDWDKLVKNTSTGISNVWEYQLLWRHLPYRHTLLENMDSVTDPLDYDSDLDFEIEPFPFVSSESLNEAAACVKV
        S  L  YSVR IFTLLREVA+VSGVRIDWDKLVKNTSTGIS+  EYQLLWRHL YRHTLLE+M SVTD LDYDSDLDFE+EPFP V SES NEAAACVKV
Subjt:  SPTLHLYSVRAIFTLLREVARVSGVRIDWDKLVKNTSTGISNVWEYQLLWRHLPYRHTLLENMDSVTDPLDYDSDLDFEIEPFPFVSSESLNEAAACVKV

Query:  LIANCIPTESDVPSSSAVEAPLTIGISNSQASTDNLENPQSACLMQGMSVTIPLSIQRQPIPLPVATEVFDVNGAAGANAASRKRKKPWSKAEDLELMAA
        LIAN IP ESDVP+SSAVEAPLTI ISNSQ  TDN +N QSA L QG+SVTIPLSIQRQPIP+P A EVFDVNGAAGA+AASRKR+KPWSKAEDLEL+AA
Subjt:  LIANCIPTESDVPSSSAVEAPLTIGISNSQASTDNLENPQSACLMQGMSVTIPLSIQRQPIPLPVATEVFDVNGAAGANAASRKRKKPWSKAEDLELMAA

Query:  VEKCGEGNWGNILKGDFKGDRTASQLSQ------------NVGASTTSTTQKAQIDAAHRALSFALDLPVNNSKT-ANSNINSSIV-SSASGAEASVQMQ
        VEKCGEGNW NILKGDFKGDRTASQLSQ            N+GAST+STTQKAQIDAAHRAL+FALDLPVNN+KT ANSNINSSIV SSAS +E+SVQMQ
Subjt:  VEKCGEGNWGNILKGDFKGDRTASQLSQ------------NVGASTTSTTQKAQIDAAHRALSFALDLPVNNSKT-ANSNINSSIV-SSASGAEASVQMQ

Query:  NQSPHISIPSRPLLVEPLPSAVKSGINTSKNSLMMKSTHNSDSIVRATAVAAGARIVSPSDTVSLLKATQTRNAIRIKSKCVSSIQSPVVGNAPMHLDAR
        NQSP IS+PSRPLLV+PLPSAVKSGINTSKNSLM+ STHNSDSIVRATAVAAGARIVSPSD  SL+KATQT+NAI IKSKC            PMHLDAR
Subjt:  NQSPHISIPSRPLLVEPLPSAVKSGINTSKNSLMMKSTHNSDSIVRATAVAAGARIVSPSDTVSLLKATQTRNAIRIKSKCVSSIQSPVVGNAPMHLDAR

Query:  PSVHYISTGKTPTPGSNYVSGKSTMVGNNSMKAVSPKVLHNRCTAILTNPPSDRVSPTTESPLKQEVNSSEERKIAKPIITAKEEFRENSVANDVKIRG
        P+VHYISTGKTPTP SNYVSGKSTMVGNNSMKAVSPK+LH+R  AI TN PS++VSPTTESPLKQEVNSSEERK  + IITAKEEFRENS  NDVKIRG
Subjt:  PSVHYISTGKTPTPGSNYVSGKSTMVGNNSMKAVSPKVLHNRCTAILTNPPSDRVSPTTESPLKQEVNSSEERKIAKPIITAKEEFRENSVANDVKIRG

XP_008466161.1 PREDICTED: uncharacterized protein LOC103503656 isoform X2 [Cucumis melo]6.7e-20881.12Show/hide
Query:  SPTLHLYSVRAIFTLLREVARVSGVRIDWDKLVKNTSTGISNVWEYQLLWRHLPYRHTLLENMDSVTDPLDYDSDLDFEIEPFPFVSSESLNEAAACVKV
        S  L  YSVR IFTLLREVA+VSGVRIDWDKLVKNTSTGIS+  EYQLLWRHL YRHTLLE+M SVTD LDYDSDLDFE+EPFP V SES NEAAACVKV
Subjt:  SPTLHLYSVRAIFTLLREVARVSGVRIDWDKLVKNTSTGISNVWEYQLLWRHLPYRHTLLENMDSVTDPLDYDSDLDFEIEPFPFVSSESLNEAAACVKV

Query:  LIANCIPTESDVPSSSAVEAPLTIGISNSQASTDNLENPQSACLMQGMSVTIPLSIQRQPIPLPVATEVFDVNGAAGANAASRKRKKPWSKAEDLELMAA
        LIAN IP ESDVP+SSAVEAPLTI ISNSQ  TDN +N QSA L QG+SVTIPLSIQRQPIP+P A EVFDVNGAAGA+AASRKR+KPWSKAEDLEL+AA
Subjt:  LIANCIPTESDVPSSSAVEAPLTIGISNSQASTDNLENPQSACLMQGMSVTIPLSIQRQPIPLPVATEVFDVNGAAGANAASRKRKKPWSKAEDLELMAA

Query:  VEKCGEGNWGNILKGDFKGDRTASQLSQ------------NVGASTTSTTQKAQIDAAHRALSFALDLPVNNSKTANSNINSSIV-SSASGAEASVQMQN
        VEKCGEGNW NILKGDFKGDRTASQLSQ            N+GAST+STTQKAQIDAAHRAL+FALDLPVNN+KTANSNINSSIV SSAS +E+SVQMQN
Subjt:  VEKCGEGNWGNILKGDFKGDRTASQLSQ------------NVGASTTSTTQKAQIDAAHRALSFALDLPVNNSKTANSNINSSIV-SSASGAEASVQMQN

Query:  QSPHISIPSRPLLVEPLPSAVKSGINTSKNSLMMKSTHNSDSIVRATAVAAGARIVSPSDTVSLLKATQTRNAIRIKSKCVSSIQSPVVGNAPMHLDARP
        QSP IS+PSRPLLV+PLPSAVKSGINTSKNSLM+ STHNSDSIVRATAVAAGARIVSPSD  SL+KATQT+NAI IKSKC            PMHLDARP
Subjt:  QSPHISIPSRPLLVEPLPSAVKSGINTSKNSLMMKSTHNSDSIVRATAVAAGARIVSPSDTVSLLKATQTRNAIRIKSKCVSSIQSPVVGNAPMHLDARP

Query:  SVHYISTGKTPTPGSNYVSGKSTMVGNNSMKAVSPKVLHNRCTAILTNPPSDRVSPTTESPLKQEVNSSEERKIAKPIITAKEEFRENSVANDVKIRG
        +VHYISTGKTPTP SNYVSGKSTMVGNNSMKAVSPK+LH+R  AI TN PS++VSPTTESPLKQEVNSSEERK  + IITAKEEFRENS  NDVKIRG
Subjt:  SVHYISTGKTPTPGSNYVSGKSTMVGNNSMKAVSPKVLHNRCTAILTNPPSDRVSPTTESPLKQEVNSSEERKIAKPIITAKEEFRENSVANDVKIRG

XP_038898516.1 protein ECERIFERUM 2 [Benincasa hispida]5.7e-21587.19Show/hide
Query:  MDGGKNNTLISELKFSSVVPAKATGDDKVRELTAIDLAMKLHYIRGVYFFKGSEEVRNLTIYDLKKPLFSLLEQYYVVSGRIRRRIEDGDRAFIKCNDSG
        MD G  NTLISELKFSSVVPAKATGD+KV+ELTAIDLAMKLHYIRGVYFF+ SEEVRNLTIYDLKKPLF LLE YYVVSGRIRRRIED DR FIKCNDSG
Subjt:  MDGGKNNTLISELKFSSVVPAKATGDDKVRELTAIDLAMKLHYIRGVYFFKGSEEVRNLTIYDLKKPLFSLLEQYYVVSGRIRRRIEDGDRAFIKCNDSG

Query:  VRIVEADCEKSIDEWLSIEGDDKFSHRDGCLVPTQAIGPDLGFSPLAFIQLTRFKCGGLSVGISWTHVLGDIFSASTFINAWGYITNDRPANQLRPAQAG
        VRIVEADCEK+I+EWLSI GDDK SHRD CLV +QAIGPDLGFSPLAFIQLTRFKCGGLSVG+SWTHVLGDIFSASTFIN WGYI N+RPA+ L PA A 
Subjt:  VRIVEADCEKSIDEWLSIEGDDKFSHRDGCLVPTQAIGPDLGFSPLAFIQLTRFKCGGLSVGISWTHVLGDIFSASTFINAWGYITNDRPANQLRPAQAG

Query:  LIWPSRSTKLSTPPVKRLDPTGDLWIGSIDCKMATRSFRIKAEQLDRILSVVGRNRAVNFSTFEAIAAIFWKSLAKIRLEDSDSRTISIYSTKCPNREGE
        L  PSRST+LSTPPVKRLDPTGDLWIGS DCKMATRSFRI AEQLDRIL V GRNRAVNFSTFEAIAAIFWK L+KIRLEDS+SRTISIYSTK PNREGE
Subjt:  LIWPSRSTKLSTPPVKRLDPTGDLWIGSIDCKMATRSFRIKAEQLDRILSVVGRNRAVNFSTFEAIAAIFWKSLAKIRLEDSDSRTISIYSTKCPNREGE

Query:  IPRNGMEMSGVEADFPVDEAAEGELAELIVKKKIDEGGEIEEMVEKEKEESDFIAYGARLTFVDLEEANIYGFELEGQRPIHVNYEIGGVGENGVVLVLT
        IPRNGMEMSGVEA+FPV  AAEGELAE+IVKKKIDEGGEI  +VEKE++ESDFIAYGARLTFVDLEEANIYGFELEGQ+P+HVNYEIGGVGENGVVLVL 
Subjt:  IPRNGMEMSGVEADFPVDEAAEGELAELIVKKKIDEGGEIEEMVEKEKEESDFIAYGARLTFVDLEEANIYGFELEGQRPIHVNYEIGGVGENGVVLVLT

Query:  GPLRGDGKDGGGRTVTVILPEKQLPDLIDELQKQWEI
        GP  GDG+DGGG TVTVILPEK+LPDLIDELQKQW I
Subjt:  GPLRGDGKDGGGRTVTVILPEKQLPDLIDELQKQWEI

XP_038898738.1 uncharacterized protein LOC120086263 isoform X1 [Benincasa hispida]1.4e-21384.66Show/hide
Query:  SPTLHLYSVRAIFTLLREVARVSGVRIDWDKLVKNTSTGISNVWEYQLLWRHLPYRHTLLENMDSVTDPLDYDSDLDFEIEPFPFVSSESLNEAAACVKV
        S  L  YSVR IFTLLREVA+VSGVRIDW  LVKNTSTGISNV EYQLLWRHL YRHT LENMDSVTDPLDYDSDLDFEIEPFP VSSES NEAAACVKV
Subjt:  SPTLHLYSVRAIFTLLREVARVSGVRIDWDKLVKNTSTGISNVWEYQLLWRHLPYRHTLLENMDSVTDPLDYDSDLDFEIEPFPFVSSESLNEAAACVKV

Query:  LIANCIPTESDVPSSSAVEAPLTIGISNSQASTDNLENPQSACLMQGMSVTIPLSIQRQPIPLPVATEVFDVNGAAGANAASRKRKKPWSKAEDLELMAA
        LIAN IP+ESDVPSSSAVEAPLTIGISNSQ+ST NLEN QSACLMQGMSVTIPLS+QRQPIP+P ATEV DVNGAA ANAASRKR+KPWSKAEDLELMAA
Subjt:  LIANCIPTESDVPSSSAVEAPLTIGISNSQASTDNLENPQSACLMQGMSVTIPLSIQRQPIPLPVATEVFDVNGAAGANAASRKRKKPWSKAEDLELMAA

Query:  VEKCGEGNWGNILKGDFKGDRTASQLSQ------------NVGASTTSTTQKAQIDAAHRALSFALDLPVNNSKT-ANSNINSSIVSSASGAEASVQMQN
        VEKCGEGNW NILKGDFKGDRTASQLSQ            NVGAST+STTQKAQIDAAHRALSFALDLPVNNSKT ANSNINSS+VSSASGAEASVQMQN
Subjt:  VEKCGEGNWGNILKGDFKGDRTASQLSQ------------NVGASTTSTTQKAQIDAAHRALSFALDLPVNNSKT-ANSNINSSIVSSASGAEASVQMQN

Query:  QSPHISIPSRPLLVEPLPSAVKSGINTSKNSLMMKSTHNSDSIVRATAVAAGARIVSPSDTVSLLKATQTRNAIRIKSKCVSSIQSPVVGNAPMHLDARP
        QSP IS+PSRPLLVEPLPS+VKSGI TSKNSLMMKSTHNSDSIVRATAVAAGARIVSPSD  SLLKA Q +NAI IKSKC           +PMHLDARP
Subjt:  QSPHISIPSRPLLVEPLPSAVKSGINTSKNSLMMKSTHNSDSIVRATAVAAGARIVSPSDTVSLLKATQTRNAIRIKSKCVSSIQSPVVGNAPMHLDARP

Query:  SVHYISTGKTPTPGSNYVSGKSTMVGNNSMKAVSPKVLHNRCTAILTNPPSDRVSPTTESPLKQEVNSSEERKIAKPIITAKEEFRENS
        SVHYISTGKTPTPGSN+V GKSTM+GNNS+KAVSPKV HNR TAI TNPPSDRVSPTTESPLKQ+VNSSEERKI++ IITAKEEFRE +
Subjt:  SVHYISTGKTPTPGSNYVSGKSTMVGNNSMKAVSPKVLHNRCTAILTNPPSDRVSPTTESPLKQEVNSSEERKIAKPIITAKEEFRENS

XP_038898739.1 uncharacterized protein LOC120086263 isoform X2 [Benincasa hispida]5.7e-21584.84Show/hide
Query:  SPTLHLYSVRAIFTLLREVARVSGVRIDWDKLVKNTSTGISNVWEYQLLWRHLPYRHTLLENMDSVTDPLDYDSDLDFEIEPFPFVSSESLNEAAACVKV
        S  L  YSVR IFTLLREVA+VSGVRIDW  LVKNTSTGISNV EYQLLWRHL YRHT LENMDSVTDPLDYDSDLDFEIEPFP VSSES NEAAACVKV
Subjt:  SPTLHLYSVRAIFTLLREVARVSGVRIDWDKLVKNTSTGISNVWEYQLLWRHLPYRHTLLENMDSVTDPLDYDSDLDFEIEPFPFVSSESLNEAAACVKV

Query:  LIANCIPTESDVPSSSAVEAPLTIGISNSQASTDNLENPQSACLMQGMSVTIPLSIQRQPIPLPVATEVFDVNGAAGANAASRKRKKPWSKAEDLELMAA
        LIAN IP+ESDVPSSSAVEAPLTIGISNSQ+ST NLEN QSACLMQGMSVTIPLS+QRQPIP+P ATEV DVNGAA ANAASRKR+KPWSKAEDLELMAA
Subjt:  LIANCIPTESDVPSSSAVEAPLTIGISNSQASTDNLENPQSACLMQGMSVTIPLSIQRQPIPLPVATEVFDVNGAAGANAASRKRKKPWSKAEDLELMAA

Query:  VEKCGEGNWGNILKGDFKGDRTASQLSQ------------NVGASTTSTTQKAQIDAAHRALSFALDLPVNNSKTANSNINSSIVSSASGAEASVQMQNQ
        VEKCGEGNW NILKGDFKGDRTASQLSQ            NVGAST+STTQKAQIDAAHRALSFALDLPVNNSKTANSNINSS+VSSASGAEASVQMQNQ
Subjt:  VEKCGEGNWGNILKGDFKGDRTASQLSQ------------NVGASTTSTTQKAQIDAAHRALSFALDLPVNNSKTANSNINSSIVSSASGAEASVQMQNQ

Query:  SPHISIPSRPLLVEPLPSAVKSGINTSKNSLMMKSTHNSDSIVRATAVAAGARIVSPSDTVSLLKATQTRNAIRIKSKCVSSIQSPVVGNAPMHLDARPS
        SP IS+PSRPLLVEPLPS+VKSGI TSKNSLMMKSTHNSDSIVRATAVAAGARIVSPSD  SLLKA Q +NAI IKSKC           +PMHLDARPS
Subjt:  SPHISIPSRPLLVEPLPSAVKSGINTSKNSLMMKSTHNSDSIVRATAVAAGARIVSPSDTVSLLKATQTRNAIRIKSKCVSSIQSPVVGNAPMHLDARPS

Query:  VHYISTGKTPTPGSNYVSGKSTMVGNNSMKAVSPKVLHNRCTAILTNPPSDRVSPTTESPLKQEVNSSEERKIAKPIITAKEEFRENS
        VHYISTGKTPTPGSN+V GKSTM+GNNS+KAVSPKV HNR TAI TNPPSDRVSPTTESPLKQ+VNSSEERKI++ IITAKEEFRE +
Subjt:  VHYISTGKTPTPGSNYVSGKSTMVGNNSMKAVSPKVLHNRCTAILTNPPSDRVSPTTESPLKQEVNSSEERKIAKPIITAKEEFRENS

TrEMBL top hitse value%identityAlignment
A0A1S3CQJ6 uncharacterized protein LOC103503656 isoform X12.3e-20680.32Show/hide
Query:  SPTLHLYSVRAIFTLLREVARVSGVRIDWDKLVKNTSTGISNVWEYQLLWRHLPYRHTLLENMDSVTDPL-----DYDSDLDFEIEPFPFVSSESLNEAA
        S  L  YSVR IFTLLREVA+VSGVRIDWDKLVKNTSTGIS+  EYQLLWRHL YRHTLLE+M SVTD L     DYDSDLDFE+EPFP V SES NEAA
Subjt:  SPTLHLYSVRAIFTLLREVARVSGVRIDWDKLVKNTSTGISNVWEYQLLWRHLPYRHTLLENMDSVTDPL-----DYDSDLDFEIEPFPFVSSESLNEAA

Query:  ACVKVLIANCIPTESDVPSSSAVEAPLTIGISNSQASTDNLENPQSACLMQGMSVTIPLSIQRQPIPLPVATEVFDVNGAAGANAASRKRKKPWSKAEDL
        ACVKVLIAN IP ESDVP+SSAVEAPLTI ISNSQ  TDN +N QSA L QG+SVTIPLSIQRQPIP+P A EVFDVNGAAGA+AASRKR+KPWSKAEDL
Subjt:  ACVKVLIANCIPTESDVPSSSAVEAPLTIGISNSQASTDNLENPQSACLMQGMSVTIPLSIQRQPIPLPVATEVFDVNGAAGANAASRKRKKPWSKAEDL

Query:  ELMAAVEKCGEGNWGNILKGDFKGDRTASQLSQ------------NVGASTTSTTQKAQIDAAHRALSFALDLPVNNSKTANSNINSSIV-SSASGAEAS
        EL+AAVEKCGEGNW NILKGDFKGDRTASQLSQ            N+GAST+STTQKAQIDAAHRAL+FALDLPVNN+KTANSNINSSIV SSAS +E+S
Subjt:  ELMAAVEKCGEGNWGNILKGDFKGDRTASQLSQ------------NVGASTTSTTQKAQIDAAHRALSFALDLPVNNSKTANSNINSSIV-SSASGAEAS

Query:  VQMQNQSPHISIPSRPLLVEPLPSAVKSGINTSKNSLMMKSTHNSDSIVRATAVAAGARIVSPSDTVSLLKATQTRNAIRIKSKCVSSIQSPVVGNAPMH
        VQMQNQSP IS+PSRPLLV+PLPSAVKSGINTSKNSLM+ STHNSDSIVRATAVAAGARIVSPSD  SL+KATQT+NAI IKSKC            PMH
Subjt:  VQMQNQSPHISIPSRPLLVEPLPSAVKSGINTSKNSLMMKSTHNSDSIVRATAVAAGARIVSPSDTVSLLKATQTRNAIRIKSKCVSSIQSPVVGNAPMH

Query:  LDARPSVHYISTGKTPTPGSNYVSGKSTMVGNNSMKAVSPKVLHNRCTAILTNPPSDRVSPTTESPLKQEVNSSEERKIAKPIITAKEEFRENSVANDVK
        LDARP+VHYISTGKTPTP SNYVSGKSTMVGNNSMKAVSPK+LH+R  AI TN PS++VSPTTESPLKQEVNSSEERK  + IITAKEEFRENS  NDVK
Subjt:  LDARPSVHYISTGKTPTPGSNYVSGKSTMVGNNSMKAVSPKVLHNRCTAILTNPPSDRVSPTTESPLKQEVNSSEERKIAKPIITAKEEFRENSVANDVK

Query:  IRG
        IRG
Subjt:  IRG

A0A1S3CRZ4 uncharacterized protein LOC103503656 isoform X23.3e-20881.12Show/hide
Query:  SPTLHLYSVRAIFTLLREVARVSGVRIDWDKLVKNTSTGISNVWEYQLLWRHLPYRHTLLENMDSVTDPLDYDSDLDFEIEPFPFVSSESLNEAAACVKV
        S  L  YSVR IFTLLREVA+VSGVRIDWDKLVKNTSTGIS+  EYQLLWRHL YRHTLLE+M SVTD LDYDSDLDFE+EPFP V SES NEAAACVKV
Subjt:  SPTLHLYSVRAIFTLLREVARVSGVRIDWDKLVKNTSTGISNVWEYQLLWRHLPYRHTLLENMDSVTDPLDYDSDLDFEIEPFPFVSSESLNEAAACVKV

Query:  LIANCIPTESDVPSSSAVEAPLTIGISNSQASTDNLENPQSACLMQGMSVTIPLSIQRQPIPLPVATEVFDVNGAAGANAASRKRKKPWSKAEDLELMAA
        LIAN IP ESDVP+SSAVEAPLTI ISNSQ  TDN +N QSA L QG+SVTIPLSIQRQPIP+P A EVFDVNGAAGA+AASRKR+KPWSKAEDLEL+AA
Subjt:  LIANCIPTESDVPSSSAVEAPLTIGISNSQASTDNLENPQSACLMQGMSVTIPLSIQRQPIPLPVATEVFDVNGAAGANAASRKRKKPWSKAEDLELMAA

Query:  VEKCGEGNWGNILKGDFKGDRTASQLSQ------------NVGASTTSTTQKAQIDAAHRALSFALDLPVNNSKTANSNINSSIV-SSASGAEASVQMQN
        VEKCGEGNW NILKGDFKGDRTASQLSQ            N+GAST+STTQKAQIDAAHRAL+FALDLPVNN+KTANSNINSSIV SSAS +E+SVQMQN
Subjt:  VEKCGEGNWGNILKGDFKGDRTASQLSQ------------NVGASTTSTTQKAQIDAAHRALSFALDLPVNNSKTANSNINSSIV-SSASGAEASVQMQN

Query:  QSPHISIPSRPLLVEPLPSAVKSGINTSKNSLMMKSTHNSDSIVRATAVAAGARIVSPSDTVSLLKATQTRNAIRIKSKCVSSIQSPVVGNAPMHLDARP
        QSP IS+PSRPLLV+PLPSAVKSGINTSKNSLM+ STHNSDSIVRATAVAAGARIVSPSD  SL+KATQT+NAI IKSKC            PMHLDARP
Subjt:  QSPHISIPSRPLLVEPLPSAVKSGINTSKNSLMMKSTHNSDSIVRATAVAAGARIVSPSDTVSLLKATQTRNAIRIKSKCVSSIQSPVVGNAPMHLDARP

Query:  SVHYISTGKTPTPGSNYVSGKSTMVGNNSMKAVSPKVLHNRCTAILTNPPSDRVSPTTESPLKQEVNSSEERKIAKPIITAKEEFRENSVANDVKIRG
        +VHYISTGKTPTP SNYVSGKSTMVGNNSMKAVSPK+LH+R  AI TN PS++VSPTTESPLKQEVNSSEERK  + IITAKEEFRENS  NDVKIRG
Subjt:  SVHYISTGKTPTPGSNYVSGKSTMVGNNSMKAVSPKVLHNRCTAILTNPPSDRVSPTTESPLKQEVNSSEERKIAKPIITAKEEFRENSVANDVKIRG

A0A5A7T5C8 HTH myb-type domain-containing protein5.8e-20580.16Show/hide
Query:  SPTLHLYSVRAIFTLLREVARVSGVRIDWDKLVKNTSTGISNVWEYQLLWRHLPYRHTLLENMDSVTDPL-----DYDSDLDFEIEPFPFVSSESLNEAA
        S  L  YSVR IFTLLREVA+VSGVRIDWDKLVKNTSTGIS+  EYQLLWRHL YRHTLLE+M SVTD L     DYDSDLDFE+EPFP V SES NEAA
Subjt:  SPTLHLYSVRAIFTLLREVARVSGVRIDWDKLVKNTSTGISNVWEYQLLWRHLPYRHTLLENMDSVTDPL-----DYDSDLDFEIEPFPFVSSESLNEAA

Query:  ACVKVLIANCIPTESDVPSSSAVEAPLTIGISNSQASTDNLENPQSACLMQGMSVTIPLSIQRQPIPLPVATEVFDVNGAAGANAASRKRKKPWSKAEDL
        ACVKVLIAN IP ESDVP+SSAVEAPLTI ISNSQ  TDN +N QSA L QG+SVTIPLSIQRQPIP+P A EVFDVNGAAGA+AASRKR+KPWSKAEDL
Subjt:  ACVKVLIANCIPTESDVPSSSAVEAPLTIGISNSQASTDNLENPQSACLMQGMSVTIPLSIQRQPIPLPVATEVFDVNGAAGANAASRKRKKPWSKAEDL

Query:  ELMAAVEKCGEGNWGNILKGDFKGDRTASQLSQ------------NVGASTTSTTQKAQIDAAHRALSFALDLPVNNSKT-ANSNINSSIV-SSASGAEA
        EL+AAVEKCGEGNW NILKGDFKGDRTASQLSQ            N+GAST+STTQKAQIDAAHRAL+FALDLPVNN+KT ANSNINSSIV SSAS +E+
Subjt:  ELMAAVEKCGEGNWGNILKGDFKGDRTASQLSQ------------NVGASTTSTTQKAQIDAAHRALSFALDLPVNNSKT-ANSNINSSIV-SSASGAEA

Query:  SVQMQNQSPHISIPSRPLLVEPLPSAVKSGINTSKNSLMMKSTHNSDSIVRATAVAAGARIVSPSDTVSLLKATQTRNAIRIKSKCVSSIQSPVVGNAPM
        SVQMQNQSP IS+PSRPLLV+PLPSAVKSGINTSKNSLM+ STHNSDSIVRATAVAAGARIVSPSD  SL+KATQT+NAI IKSKC            PM
Subjt:  SVQMQNQSPHISIPSRPLLVEPLPSAVKSGINTSKNSLMMKSTHNSDSIVRATAVAAGARIVSPSDTVSLLKATQTRNAIRIKSKCVSSIQSPVVGNAPM

Query:  HLDARPSVHYISTGKTPTPGSNYVSGKSTMVGNNSMKAVSPKVLHNRCTAILTNPPSDRVSPTTESPLKQEVNSSEERKIAKPIITAKEEFRENSVANDV
        HLDARP+VHYISTGKTPTP SNYVSGKSTMVGNNSMKAVSPK+LH+R  AI TN PS++VSPTTESPLKQEVNSSEERK  + IITAKEEFRENS  NDV
Subjt:  HLDARPSVHYISTGKTPTPGSNYVSGKSTMVGNNSMKAVSPKVLHNRCTAILTNPPSDRVSPTTESPLKQEVNSSEERKIAKPIITAKEEFRENSVANDV

Query:  KIRG
        KIRG
Subjt:  KIRG

A0A5D3E5P5 HTH myb-type domain-containing protein8.0e-20780.96Show/hide
Query:  SPTLHLYSVRAIFTLLREVARVSGVRIDWDKLVKNTSTGISNVWEYQLLWRHLPYRHTLLENMDSVTDPLDYDSDLDFEIEPFPFVSSESLNEAAACVKV
        S  L  YSVR IFTLLREVA+VSGVRIDWDKLVKNTSTGIS+  EYQLLWRHL YRHTLLE+M SVTD LDYDSDLDFE+EPFP V SES NEAAACVKV
Subjt:  SPTLHLYSVRAIFTLLREVARVSGVRIDWDKLVKNTSTGISNVWEYQLLWRHLPYRHTLLENMDSVTDPLDYDSDLDFEIEPFPFVSSESLNEAAACVKV

Query:  LIANCIPTESDVPSSSAVEAPLTIGISNSQASTDNLENPQSACLMQGMSVTIPLSIQRQPIPLPVATEVFDVNGAAGANAASRKRKKPWSKAEDLELMAA
        LIAN IP ESDVP+SSAVEAPLTI ISNSQ  TDN +N QSA L QG+SVTIPLSIQRQPIP+P A EVFDVNGAAGA+AASRKR+KPWSKAEDLEL+AA
Subjt:  LIANCIPTESDVPSSSAVEAPLTIGISNSQASTDNLENPQSACLMQGMSVTIPLSIQRQPIPLPVATEVFDVNGAAGANAASRKRKKPWSKAEDLELMAA

Query:  VEKCGEGNWGNILKGDFKGDRTASQLSQ------------NVGASTTSTTQKAQIDAAHRALSFALDLPVNNSKT-ANSNINSSIV-SSASGAEASVQMQ
        VEKCGEGNW NILKGDFKGDRTASQLSQ            N+GAST+STTQKAQIDAAHRAL+FALDLPVNN+KT ANSNINSSIV SSAS +E+SVQMQ
Subjt:  VEKCGEGNWGNILKGDFKGDRTASQLSQ------------NVGASTTSTTQKAQIDAAHRALSFALDLPVNNSKT-ANSNINSSIV-SSASGAEASVQMQ

Query:  NQSPHISIPSRPLLVEPLPSAVKSGINTSKNSLMMKSTHNSDSIVRATAVAAGARIVSPSDTVSLLKATQTRNAIRIKSKCVSSIQSPVVGNAPMHLDAR
        NQSP IS+PSRPLLV+PLPSAVKSGINTSKNSLM+ STHNSDSIVRATAVAAGARIVSPSD  SL+KATQT+NAI IKSKC            PMHLDAR
Subjt:  NQSPHISIPSRPLLVEPLPSAVKSGINTSKNSLMMKSTHNSDSIVRATAVAAGARIVSPSDTVSLLKATQTRNAIRIKSKCVSSIQSPVVGNAPMHLDAR

Query:  PSVHYISTGKTPTPGSNYVSGKSTMVGNNSMKAVSPKVLHNRCTAILTNPPSDRVSPTTESPLKQEVNSSEERKIAKPIITAKEEFRENSVANDVKIRG
        P+VHYISTGKTPTP SNYVSGKSTMVGNNSMKAVSPK+LH+R  AI TN PS++VSPTTESPLKQEVNSSEERK  + IITAKEEFRENS  NDVKIRG
Subjt:  PSVHYISTGKTPTPGSNYVSGKSTMVGNNSMKAVSPKVLHNRCTAILTNPPSDRVSPTTESPLKQEVNSSEERKIAKPIITAKEEFRENSVANDVKIRG

A0A5J5ADN2 HTH myb-type domain-containing protein5.2e-20645.16Show/hide
Query:  ISELKFSSVVPAKATGDDKVRELTAIDLAMKLHYIRGVYFFKGSEEVRNLTIYDLKKPLFSLLEQYYVVSGRIRRRIEDGDRAFIKCNDSGVRIVEADCE
        + ++K SSVVPA+ TG+DKV ELT +DL MKLHYIRG+YFFK +E V+ L IYDLKKP+F  L+ YY  SGR+R+   +  R FIKCNDSGVRIVEA C 
Subjt:  ISELKFSSVVPAKATGDDKVRELTAIDLAMKLHYIRGVYFFKGSEEVRNLTIYDLKKPLFSLLEQYYVVSGRIRRRIEDGDRAFIKCNDSGVRIVEADCE

Query:  KSIDEWLSIEGDDKFSHRDGCLVPTQAIGPDLGFSPLAFIQLTRFKCGGLSVGISWTHVLGDIFSASTFINAWGYI-TNDRPANQLRPAQAGLIWPSRST
        K++DEWL++    K    D  L+  Q +GPDLGFSPL FIQ T FKCGG SVG+SW H++GD+FSASTFIN WG I     P   LR           S 
Subjt:  KSIDEWLSIEGDDKFSHRDGCLVPTQAIGPDLGFSPLAFIQLTRFKCGGLSVGISWTHVLGDIFSASTFINAWGYI-TNDRPANQLRPAQAGLIWPSRST

Query:  KLSTPPVKRLDPTGDLWIGSIDCKMATRSFRIKAEQLDRILS-VVGRNRAVNFSTFEAIAAIFWKSLAKIRLEDSDSRTISIYSTKCPNREGEIPRNGME
              +K +DP GD W+ + +CKM T S  + A+QLD ILS   G  +A     F+  +A+ WKSLAK+R    + R ++I +    + E EI  NG+ 
Subjt:  KLSTPPVKRLDPTGDLWIGSIDCKMATRSFRIKAEQLDRILS-VVGRNRAVNFSTFEAIAAIFWKSLAKIRLEDSDSRTISIYSTKCPNREGEIPRNGME

Query:  MSGVEADFPVDEAAEGELAELIVKKKIDEGGEIEEMVEKEKEESDFIAYGARLTFVDLEEANIYGFELEGQRPIHVNYEIGGVGENGVVLVLTGPLRGDG
        +S VEADFPV  A   EL  LI +K++DE   IEEMVE+   +SDFI YGA LTFV+LEEANIYG EL+GQ+P+  NY I G+G+ GVVLVL  P     
Subjt:  MSGVEADFPVDEAAEGELAELIVKKKIDEGGEIEEMVEKEKEESDFIAYGARLTFVDLEEANIYGFELEGQRPIHVNYEIGGVGENGVVLVLTGPLRGDG

Query:  KDGGGRTVTVILPEKQLPDLIDELQKQWEIAKIQ---------SVCTCSELHFCACVSIIEQCTLT-TVADKSGFLLQPQRLTFVFRFRLQTISEKRFQ-
            GRT+TV LP  Q+ +L DEL+K+W+ A I+         SV         A +S+  Q   + T ++KS      + +T + R  +   ++K+ + 
Subjt:  KDGGGRTVTVILPEKQLPDLIDELQKQWEIAKIQ---------SVCTCSELHFCACVSIIEQCTLT-TVADKSGFLLQPQRLTFVFRFRLQTISEKRFQ-

Query:  --SHRRRSPTLHLYSVRAIFTLLREVARVSGVRIDWDKLVKNTSTGISNVWEYQLLWRHLPYRHTLLENMDSVTDPLDYDSDLDFEIEPFPFVSSESLNE
          +    S  L  YS   + T+L+EVA+V  V IDW+ LV+ TS+GISN  EYQ+LWRHL YR+TL E +D   +PLD DSDL++E+E FP VSSE+  E
Subjt:  --SHRRRSPTLHLYSVRAIFTLLREVARVSGVRIDWDKLVKNTSTGISNVWEYQLLWRHLPYRHTLLENMDSVTDPLDYDSDLDFEIEPFPFVSSESLNE

Query:  AAACVKVLIANCIPTESDVPSSSAVEAPLTIGISNSQASTDNLENPQSACLMQGMSVTIPLSIQRQPIPLPVATEVFDVNGAAGANAASRKRKKPWSKAE
        AAA VKVLIA+  PT+S +P+ + VEAPLTI I N Q+S    EN Q A  +QG ++TIP+ +Q+ P+P   A E  D NG+       RK++K WS AE
Subjt:  AAACVKVLIANCIPTESDVPSSSAVEAPLTIGISNSQASTDNLENPQSACLMQGMSVTIPLSIQRQPIPLPVATEVFDVNGAAGANAASRKRKKPWSKAE

Query:  DLELMAAVEKCGEGNWGNILKGDFKGDRTASQLSQ----------NVGASTTSTTQKAQIDAAHRALSFALDLPVNNSKTANSNIN----------SSIV
        D+EL+AAV+KCGEGNW  ILKGDFKGDRTASQLSQ          N+   + S   +AQ+ AA RA+S AL++P+ ++ TA+ +I+          S+  
Subjt:  DLELMAAVEKCGEGNWGNILKGDFKGDRTASQLSQ----------NVGASTTSTTQKAQIDAAHRALSFALDLPVNNSKTANSNIN----------SSIV

Query:  SSASGAEASV-----QMQNQSPHISIPSRPLLVEPLPSAVKSGINTSKNSLMMKSTHNSDSIVRATAVAAGARIVSPSDTVSLLKATQTRNAIRIKSKCV
           +G   SV     Q+Q+QSP  S+P+    +    S+ KS + + K S   KS  +  S+V+A AVAAGA I +P D  SLLKATQ +NAI I     
Subjt:  SSASGAEASV-----QMQNQSPHISIPSRPLLVEPLPSAVKSGINTSKNSLMMKSTHNSDSIVRATAVAAGARIVSPSDTVSLLKATQTRNAIRIKSKCV

Query:  SSIQSPVVGNA----PMHLDARPSVHYISTGKTPTPGSNY---------VSGKSTMVGNNSMKAVSPKVLHNRCTAILTNPPSDRVSPTTESPLKQEVNS
        S ++  V GNA      H DA P+VHYI TG   TP S+Y          SG     GN+   A  P  L+    A  +N  S+  +  T  P K EV  
Subjt:  SSIQSPVVGNA----PMHLDARPSVHYISTGKTPTPGSNY---------VSGKSTMVGNNSMKAVSPKVLHNRCTAILTNPPSDRVSPTTESPLKQEVNS

Query:  SEERKIAKPIITAKEEFRENSVA
        +EER ++      KE+ +E+ VA
Subjt:  SEERKIAKPIITAKEEFRENSVA

SwissProt top hitse value%identityAlignment
A0A2H5AIZ1 Hydroxycinnamoyltransferase7.9e-1828.47Show/hide
Query:  LISELKFSSVV-PAKATGDDKVRELTAIDLAMKLHYIRGVYFFKGSEEVRNLTIYD---LKKPLFSLLEQYYVVSGRIRRRIEDGDRAFIKCNDSGVRIV
        +I  +K +++V P++ T   ++   + +DL +   +   VYF++      +   +D   LK+ L   L  +Y ++GR+ R  EDG R  I CN  GVR V
Subjt:  LISELKFSSVV-PAKATGDDKVRELTAIDLAMKLHYIRGVYFFKGSEEVRNLTIYD---LKKPLFSLLEQYYVVSGRIRRRIEDGDRAFIKCNDSGVRIV

Query:  EADCEKSIDEWLSIEGDDKFSHRDGCLVPTQAIGPDLGFSPLAFIQLTRFKCGGLSVGISWTHVLGDIFSASTFINAWGYITN----------DRPANQL
         A+ + +IDE+    GD   +     L+P    G D+   PL  +Q+T FKCGG S+G+   H + D  S   FIN+W  I            DR   + 
Subjt:  EADCEKSIDEWLSIEGDDKFSHRDGCLVPTQAIGPDLGFSPLAFIQLTRFKCGGLSVGISWTHVLGDIFSASTFINAWGYITN----------DRPANQL

Query:  R-PAQAGL--IWPSRSTKLSTPPVKRLDPTGDLWIGSIDCKMATRSFRIKAEQLDRILSVVGRNRAVNFSTFEAIAAIFWKSLAKIRLEDSDSRT
        R P       I    +  ++T P    DPT    + S     A   F++  +QLD + S V    +  +S++  +A   W+  +  R    D RT
Subjt:  R-PAQAGL--IWPSRSTKLSTPPVKRLDPTGDLWIGSIDCKMATRSFRIKAEQLDRILSVVGRNRAVNFSTFEAIAAIFWKSLAKIRLEDSDSRT

A0PDV5 Rosmarinate synthase1.3e-1727.4Show/hide
Query:  ELKFSSVVPAKATGDDKVRELTAIDLAMKLHY-IRGVYFFKGSEEVRNLTIYDLKKPLFSLLEQYYVVSGRIRRRIEDGDRAFIKCNDSGVRIVEADCEK
        E+K S+++   A        L+ +DL    +Y    V+F+             LK+ L   L ++Y  +GR++    +G+R  I CN+ G+ +VEA+C+ 
Subjt:  ELKFSSVVPAKATGDDKVRELTAIDLAMKLHY-IRGVYFFKGSEEVRNLTIYDLKKPLFSLLEQYYVVSGRIRRRIEDGDRAFIKCNDSGVRIVEADCEK

Query:  SIDEWLSIEGDDKFSHRDGC-LVPTQAIGPDLGFSPLAFIQLTRFKCGGLSVGISWTHVLGDIFSASTFINAWGYITNDRPANQLRPAQAGLIWPSRSTK
        ++DE     GD  F+ R    L+P       +   PL   QLTRFKCGG+++G++  H L D  +A  FIN W +++   PA    P      +   S  
Subjt:  SIDEWLSIEGDDKFSHRDGC-LVPTQAIGPDLGFSPLAFIQLTRFKCGGLSVGISWTHVLGDIFSASTFINAWGYITNDRPANQLRPAQAGLIWPSRSTK

Query:  LSTPPVKRLD-------PTGDLWIGSIDCKMATRSFRIKAEQLDRILS-----VVGRNRAVNFSTFEAIAAIFWKSLAKIR
           PP  +         PT +  +   D  +A   F++  +QL+ + S             ++STFE +A   W+S+   R
Subjt:  LSTPPVKRLD-------PTGDLWIGSIDCKMATRSFRIKAEQLDRILS-----VVGRNRAVNFSTFEAIAAIFWKSLAKIR

Q39048 Protein ECERIFERUM 22.9e-7639.72Show/hide
Query:  ISELKFSSVVPAKATGDDKVRELTAIDLAMKLHYIRGVYFFKGSEEVRNLTIYDLKKPLF---SLLEQYYVVSGRIRRRIEDGDRA-----FIKCNDSGV
        ++ ++ SSVVPA   G++K R+LT +DLAMKLHY+R VYFFKG+   R+ T+ D+K  +F   SLL+ Y+ VSGRIR    D D +     +I+CNDSG+
Subjt:  ISELKFSSVVPAKATGDDKVRELTAIDLAMKLHYIRGVYFFKGSEEVRNLTIYDLKKPLF---SLLEQYYVVSGRIRRRIEDGDRA-----FIKCNDSGV

Query:  RIVEADCEK-SIDEWLSIEGDDKFSHRDGCLVPTQAIGPDLGFSPLAFIQLTRFKCGGLSVGISWTHVLGDIFSASTFINAWG-YITNDRPANQLRPAQA
        R+VEA+ E+ ++++WL ++ D    HR   LV    +GPDL FSPL F+Q+T+FKCGGL +G+SW H+LGD+FSASTF+   G  ++   P   + P   
Subjt:  RIVEADCEK-SIDEWLSIEGDDKFSHRDGCLVPTQAIGPDLGFSPLAFIQLTRFKCGGLSVGISWTHVLGDIFSASTFINAWG-YITNDRPANQLRPAQA

Query:  GLIWPSRSTKLSTPPVKRLDPTGDLWIGSIDCKMATRSFRIKAEQLDRILSVVGRNRAVNFSTFEAIAAIFWKSLAKIRLEDSDSRTISIYSTKCPNREG
         L   +R+       ++++D  G+ W+ +  CKM    F      +D +++     R   FS  + + A+ WKSL  IR E +++  I+I   K   +  
Subjt:  GLIWPSRSTKLSTPPVKRLDPTGDLWIGSIDCKMATRSFRIKAEQLDRILSVVGRNRAVNFSTFEAIAAIFWKSLAKIRLEDSDSRTISIYSTKCPNREG

Query:  EIPRNGMEMSGVEADFPVDE-AAEGELAELIVKKKIDEGGEIEEMVEKEKEESDFIAYGARLTFVDLEEANIYGFELEGQRPIHVNYEIGGVGENGVVLV
              + +S VE +   DE     ELA LI  +K +E G I+ M+E++K  SDF  YGA LTFV+L+E ++Y  E+ G +P  VNY I GVG+ GVVLV
Subjt:  EIPRNGMEMSGVEADFPVDE-AAEGELAELIVKKKIDEGGEIEEMVEKEKEESDFIAYGARLTFVDLEEANIYGFELEGQRPIHVNYEIGGVGENGVVLV

Query:  LTGPLRGDGKDGGGRTVTVILPEKQLPDLIDEL
                 K    R V+V++PE+ L  L +E+
Subjt:  LTGPLRGDGKDGGGRTVTVILPEKQLPDLIDEL

Q9LIS1 Protein ECERIFERUM 26-like7.3e-6436.72Show/hide
Query:  KFSSVVPAKATGDDKVRELTAIDLAMKLHYIRGVYFFKGSEEVRNLTIYDLKKPLFSLLEQYYVVSGRIRRRIEDGDRAFIKCNDSGVRIVEADCEKSID
        + S+V  +  +      E T +DLAMKLHY++ VY +      R+LT+ D+K PLFS+  Q   + GR RR   +  R ++KCND G R VE+ C+ +++
Subjt:  KFSSVVPAKATGDDKVRELTAIDLAMKLHYIRGVYFFKGSEEVRNLTIYDLKKPLFSLLEQYYVVSGRIRRRIEDGDRAFIKCNDSGVRIVEADCEKSID

Query:  EWLSIEGDDKFSHRDGCLVPTQAIGPDLGFSPLAFIQLTRFKCGGLSVGISWTHVLGDIFSASTFINAW------GYITNDRPANQLRPAQAGLIWPSRS
        EWL +         D  LV  Q +GPDL FSPL +IQ+TRF CGGL++G+SW H++GD FS S F N W      G I   + +   R  Q        S
Subjt:  EWLSIEGDDKFSHRDGCLVPTQAIGPDLGFSPLAFIQLTRFKCGGLSVGISWTHVLGDIFSASTFINAW------GYITNDRPANQLRPAQAGLIWPSRS

Query:  TKLSTPPVKRLDPTGDLWIGSIDCKMATRSFRIKAEQLDRILSVVGRNRAVNFSTFEAIAAIFWKSLAKIRLEDSDSRTISIYSTKCPNREGEIPRNGME
        T      VK++D  GDLW+   + KM T SF +    L     V G         FE +  I WK +A +R E S   TI++  +     +    RNG  
Subjt:  TKLSTPPVKRLDPTGDLWIGSIDCKMATRSFRIKAEQLDRILSVVGRNRAVNFSTFEAIAAIFWKSLAKIRLEDSDSRTISIYSTKCPNREGEIPRNGME

Query:  MSGVEADFPVDEAAEGELAELIVKKKIDEGGEIEEMVEKEKEESDFIAYGARLTFVDLEEANIYGFELEGQRPIHVNYEIGGVGENGVVLVLTGPLRGDG
        +S +  DF V EA+  E+ + I + K DE   I+E+V+   + SDFI YGA LTFVD+ E + Y  ++ G+ P  V   + G+G++G V+VL G +  + 
Subjt:  MSGVEADFPVDEAAEGELAELIVKKKIDEGGEIEEMVEKEKEESDFIAYGARLTFVDLEEANIYGFELEGQRPIHVNYEIGGVGENGVVLVLTGPLRGDG

Query:  KDGGGRTVTVILPEKQLPDLIDELQK-QWEIAK
             R VTV LP       +DE++K +WE+ K
Subjt:  KDGGGRTVTVILPEKQLPDLIDELQK-QWEIAK

Q9SVM9 Protein ECERIFERUM 263.1e-6235.33Show/hide
Query:  GKNNTLISELKFSSVVPAKATGDDKVRELTAIDLAMKLHYIRGVYFFKGSEEVRNLTIYDLKKPLFSLLEQYYVVSGRIRRRIEDGDRAFIKCNDSGVRI
        G+    +  ++ S+V   + T      E T +DLAMKLHY++  Y +  +E  R+LT+  LK+ +F L +Q    +GR  RR  D  R +IKCND G R 
Subjt:  GKNNTLISELKFSSVVPAKATGDDKVRELTAIDLAMKLHYIRGVYFFKGSEEVRNLTIYDLKKPLFSLLEQYYVVSGRIRRRIEDGDRAFIKCNDSGVRI

Query:  VEADCEKSIDEWLSIEGDDKFSHRDGCLVPTQAIGPDLGFSPLAFIQLTRFKCGGLSVGISWTHVLGDIFSASTFINAWG-YITNDRPANQLRPAQAGLI
        VE  C  +++EWLS + D      D  LV    IGP+L FSPL ++Q+TRFKCGGL +G+SW +++GD FS     N W   IT ++      P+     
Subjt:  VEADCEKSIDEWLSIEGDDKFSHRDGCLVPTQAIGPDLGFSPLAFIQLTRFKCGGLSVGISWTHVLGDIFSASTFINAWG-YITNDRPANQLRPAQAGLI

Query:  WPSRSTKLSTP-PVKRLDPTGDLWIGSIDCKMATRSFRIK-AEQLDRILSVVGRNRAVNFSTFEAIAAIFWKSLAKIRLEDSDSRTISIYSTKCPNREGE
        + S +  +  P  +KR++P GDLW+   D K+A   F +  A+Q+       G +   +   FE +A I WK +AK+R+E     T++I      + +  
Subjt:  WPSRSTKLSTP-PVKRLDPTGDLWIGSIDCKMATRSFRIK-AEQLDRILSVVGRNRAVNFSTFEAIAAIFWKSLAKIRLEDSDSRTISIYSTKCPNREGE

Query:  IPRNGMEMSGVEADFPVDEAAEGELAELIVKKKIDEGGEIEEMVEKEKEESDFIAYGARLTFVDLEEANIYGFELEGQRPIHVNYEIGGVGENGVVLVLT
          RN   +S V  DFPV EA   EL + + + K DE   IEE+ E      DF+ YGA+LTF+DL   ++Y  ++ G+ P  V   + G+GE G+V+V  
Subjt:  IPRNGMEMSGVEADFPVDEAAEGELAELIVKKKIDEGGEIEEMVEKEKEESDFIAYGARLTFVDLEEANIYGFELEGQRPIHVNYEIGGVGENGVVLVLT

Query:  GPLRGDGKDGGGRTVTVILPEKQLPDLIDELQK
             +      R VTV LPE+++  +  E +K
Subjt:  GPLRGDGKDGGGRTVTVILPEKQLPDLIDELQK

Arabidopsis top hitse value%identityAlignment
AT1G09710.1 Homeodomain-like superfamily protein1.8e-4939.6Show/hide
Query:  YSVRAIFTLLREVARVSGVRIDWDKLVKNTSTGISNVWEYQLLWRHLPYRHTLLENMDSVTDPLDYDSDLDFEIEPFPFVSSESLNEAAACVKVLIANCI
        Y +  I  +L+E++  S  ++DW+ LVK T+TGI+N  EYQLLWRHL YRH LL   D    PLD DSD++ E+E  P VS E+  EA A VKV+ A+ +
Subjt:  YSVRAIFTLLREVARVSGVRIDWDKLVKNTSTGISNVWEYQLLWRHLPYRHTLLENMDSVTDPLDYDSDLDFEIEPFPFVSSESLNEAAACVKVLIANCI

Query:  PTESDVPSSSAVEAPLTIGISNS--QASTDNLENPQSACLMQGMSVTIPLSIQRQPIPLPVATEVFDVNGAAGANAASRKRKKPWSKAEDLELMAAVEKC
         +ESD+   S VEAPLTI I  +  + S +  E+P S+   +GM++  P+ +Q+       +TE  + NG+AG + A R+++K WS  ED EL AAV++C
Subjt:  PTESDVPSSSAVEAPLTIGISNS--QASTDNLENPQSACLMQGMSVTIPLSIQRQPIPLPVATEVFDVNGAAGANAASRKRKKPWSKAEDLELMAAVEKC

Query:  GEGNWGNILKGDFKGDRTASQLSQ-------NVGASTTSTTQKAQIDAAHRALSFALDLPVNNSKTAN--------SNINSSIVSSASGAEASVQMQNQS
        GEGNW +I+KGDF+G+RTASQLSQ           ST+ +    Q   A  A++ AL L + N   +N        +  + +I  + +   +S Q Q QS
Subjt:  GEGNWGNILKGDFKGDRTASQLSQ-------NVGASTTSTTQKAQIDAAHRALSFALDLPVNNSKTAN--------SNINSSIVSSASGAEASVQMQNQS

Query:  PHISIPSRPLLVEPLPSAVKSGINTSKNSLMMKSTHNSDSIVRATAVAAGA
          I + + P     LP+A    +  +  S    ST  SD +V A +VAA A
Subjt:  PHISIPSRPLLVEPLPSAVKSGINTSKNSLMMKSTHNSDSIVRATAVAAGA

AT1G09710.2 Homeodomain-like superfamily protein4.9e-4740.27Show/hide
Query:  YSVRAIFTLLREVARVSGVRIDWDKLVKNTSTGISNVWEYQLLWRHLPYRHTLLENMDSVTDPLDYDSDLDFEIEPFPFVSSESLNEAAACVKVLIANCI
        Y +  I  +L+E++  S  ++DW+ LVK T+TGI+N  EYQLLWRHL YRH LL   D    PLD DSD++ E+E  P VS E+  EA A VKV+ A+ +
Subjt:  YSVRAIFTLLREVARVSGVRIDWDKLVKNTSTGISNVWEYQLLWRHLPYRHTLLENMDSVTDPLDYDSDLDFEIEPFPFVSSESLNEAAACVKVLIANCI

Query:  PTESDVPSSSAVEAPLTIGISNS--QASTDNLENPQSACLMQGMSVTIPLSIQRQPIPLPVATEVFDVNGAAGANAASRKRKKPWSKAEDLELMAAVEKC
         +ESD+   S VEAPLTI I  +  + S +  E+P S+   +GM++  P+ +Q+       +TE  + NG+AG + A R+++K WS  ED EL AAV++C
Subjt:  PTESDVPSSSAVEAPLTIGISNS--QASTDNLENPQSACLMQGMSVTIPLSIQRQPIPLPVATEVFDVNGAAGANAASRKRKKPWSKAEDLELMAAVEKC

Query:  GEGNWGNILKGDFKGDRTASQLSQ-------NVGASTTSTTQKAQIDAAHRALSFALDLPVNNSKTANSNINSSIVSSASGAEASVQMQNQSPHISIP
        GEGNW +I+KGDF+G+RTASQLSQ           ST+ +    Q   A  A++ AL L + N   +N     +    +  A +S+ +  +   + +P
Subjt:  GEGNWGNILKGDFKGDRTASQLSQ-------NVGASTTSTTQKAQIDAAHRALSFALDLPVNNSKTANSNINSSIVSSASGAEASVQMQNQSPHISIP

AT3G23840.1 HXXXD-type acyl-transferase family protein5.2e-6536.72Show/hide
Query:  KFSSVVPAKATGDDKVRELTAIDLAMKLHYIRGVYFFKGSEEVRNLTIYDLKKPLFSLLEQYYVVSGRIRRRIEDGDRAFIKCNDSGVRIVEADCEKSID
        + S+V  +  +      E T +DLAMKLHY++ VY +      R+LT+ D+K PLFS+  Q   + GR RR   +  R ++KCND G R VE+ C+ +++
Subjt:  KFSSVVPAKATGDDKVRELTAIDLAMKLHYIRGVYFFKGSEEVRNLTIYDLKKPLFSLLEQYYVVSGRIRRRIEDGDRAFIKCNDSGVRIVEADCEKSID

Query:  EWLSIEGDDKFSHRDGCLVPTQAIGPDLGFSPLAFIQLTRFKCGGLSVGISWTHVLGDIFSASTFINAW------GYITNDRPANQLRPAQAGLIWPSRS
        EWL +         D  LV  Q +GPDL FSPL +IQ+TRF CGGL++G+SW H++GD FS S F N W      G I   + +   R  Q        S
Subjt:  EWLSIEGDDKFSHRDGCLVPTQAIGPDLGFSPLAFIQLTRFKCGGLSVGISWTHVLGDIFSASTFINAW------GYITNDRPANQLRPAQAGLIWPSRS

Query:  TKLSTPPVKRLDPTGDLWIGSIDCKMATRSFRIKAEQLDRILSVVGRNRAVNFSTFEAIAAIFWKSLAKIRLEDSDSRTISIYSTKCPNREGEIPRNGME
        T      VK++D  GDLW+   + KM T SF +    L     V G         FE +  I WK +A +R E S   TI++  +     +    RNG  
Subjt:  TKLSTPPVKRLDPTGDLWIGSIDCKMATRSFRIKAEQLDRILSVVGRNRAVNFSTFEAIAAIFWKSLAKIRLEDSDSRTISIYSTKCPNREGEIPRNGME

Query:  MSGVEADFPVDEAAEGELAELIVKKKIDEGGEIEEMVEKEKEESDFIAYGARLTFVDLEEANIYGFELEGQRPIHVNYEIGGVGENGVVLVLTGPLRGDG
        +S +  DF V EA+  E+ + I + K DE   I+E+V+   + SDFI YGA LTFVD+ E + Y  ++ G+ P  V   + G+G++G V+VL G +  + 
Subjt:  MSGVEADFPVDEAAEGELAELIVKKKIDEGGEIEEMVEKEKEESDFIAYGARLTFVDLEEANIYGFELEGQRPIHVNYEIGGVGENGVVLVLTGPLRGDG

Query:  KDGGGRTVTVILPEKQLPDLIDELQK-QWEIAK
             R VTV LP       +DE++K +WE+ K
Subjt:  KDGGGRTVTVILPEKQLPDLIDELQK-QWEIAK

AT4G13840.1 HXXXD-type acyl-transferase family protein2.2e-6335.33Show/hide
Query:  GKNNTLISELKFSSVVPAKATGDDKVRELTAIDLAMKLHYIRGVYFFKGSEEVRNLTIYDLKKPLFSLLEQYYVVSGRIRRRIEDGDRAFIKCNDSGVRI
        G+    +  ++ S+V   + T      E T +DLAMKLHY++  Y +  +E  R+LT+  LK+ +F L +Q    +GR  RR  D  R +IKCND G R 
Subjt:  GKNNTLISELKFSSVVPAKATGDDKVRELTAIDLAMKLHYIRGVYFFKGSEEVRNLTIYDLKKPLFSLLEQYYVVSGRIRRRIEDGDRAFIKCNDSGVRI

Query:  VEADCEKSIDEWLSIEGDDKFSHRDGCLVPTQAIGPDLGFSPLAFIQLTRFKCGGLSVGISWTHVLGDIFSASTFINAWG-YITNDRPANQLRPAQAGLI
        VE  C  +++EWLS + D      D  LV    IGP+L FSPL ++Q+TRFKCGGL +G+SW +++GD FS     N W   IT ++      P+     
Subjt:  VEADCEKSIDEWLSIEGDDKFSHRDGCLVPTQAIGPDLGFSPLAFIQLTRFKCGGLSVGISWTHVLGDIFSASTFINAWG-YITNDRPANQLRPAQAGLI

Query:  WPSRSTKLSTP-PVKRLDPTGDLWIGSIDCKMATRSFRIK-AEQLDRILSVVGRNRAVNFSTFEAIAAIFWKSLAKIRLEDSDSRTISIYSTKCPNREGE
        + S +  +  P  +KR++P GDLW+   D K+A   F +  A+Q+       G +   +   FE +A I WK +AK+R+E     T++I      + +  
Subjt:  WPSRSTKLSTP-PVKRLDPTGDLWIGSIDCKMATRSFRIK-AEQLDRILSVVGRNRAVNFSTFEAIAAIFWKSLAKIRLEDSDSRTISIYSTKCPNREGE

Query:  IPRNGMEMSGVEADFPVDEAAEGELAELIVKKKIDEGGEIEEMVEKEKEESDFIAYGARLTFVDLEEANIYGFELEGQRPIHVNYEIGGVGENGVVLVLT
          RN   +S V  DFPV EA   EL + + + K DE   IEE+ E      DF+ YGA+LTF+DL   ++Y  ++ G+ P  V   + G+GE G+V+V  
Subjt:  IPRNGMEMSGVEADFPVDEAAEGELAELIVKKKIDEGGEIEEMVEKEKEESDFIAYGARLTFVDLEEANIYGFELEGQRPIHVNYEIGGVGENGVVLVLT

Query:  GPLRGDGKDGGGRTVTVILPEKQLPDLIDELQK
             +      R VTV LPE+++  +  E +K
Subjt:  GPLRGDGKDGGGRTVTVILPEKQLPDLIDELQK

AT4G24510.1 HXXXD-type acyl-transferase family protein2.0e-7739.72Show/hide
Query:  ISELKFSSVVPAKATGDDKVRELTAIDLAMKLHYIRGVYFFKGSEEVRNLTIYDLKKPLF---SLLEQYYVVSGRIRRRIEDGDRA-----FIKCNDSGV
        ++ ++ SSVVPA   G++K R+LT +DLAMKLHY+R VYFFKG+   R+ T+ D+K  +F   SLL+ Y+ VSGRIR    D D +     +I+CNDSG+
Subjt:  ISELKFSSVVPAKATGDDKVRELTAIDLAMKLHYIRGVYFFKGSEEVRNLTIYDLKKPLF---SLLEQYYVVSGRIRRRIEDGDRA-----FIKCNDSGV

Query:  RIVEADCEK-SIDEWLSIEGDDKFSHRDGCLVPTQAIGPDLGFSPLAFIQLTRFKCGGLSVGISWTHVLGDIFSASTFINAWG-YITNDRPANQLRPAQA
        R+VEA+ E+ ++++WL ++ D    HR   LV    +GPDL FSPL F+Q+T+FKCGGL +G+SW H+LGD+FSASTF+   G  ++   P   + P   
Subjt:  RIVEADCEK-SIDEWLSIEGDDKFSHRDGCLVPTQAIGPDLGFSPLAFIQLTRFKCGGLSVGISWTHVLGDIFSASTFINAWG-YITNDRPANQLRPAQA

Query:  GLIWPSRSTKLSTPPVKRLDPTGDLWIGSIDCKMATRSFRIKAEQLDRILSVVGRNRAVNFSTFEAIAAIFWKSLAKIRLEDSDSRTISIYSTKCPNREG
         L   +R+       ++++D  G+ W+ +  CKM    F      +D +++     R   FS  + + A+ WKSL  IR E +++  I+I   K   +  
Subjt:  GLIWPSRSTKLSTPPVKRLDPTGDLWIGSIDCKMATRSFRIKAEQLDRILSVVGRNRAVNFSTFEAIAAIFWKSLAKIRLEDSDSRTISIYSTKCPNREG

Query:  EIPRNGMEMSGVEADFPVDE-AAEGELAELIVKKKIDEGGEIEEMVEKEKEESDFIAYGARLTFVDLEEANIYGFELEGQRPIHVNYEIGGVGENGVVLV
              + +S VE +   DE     ELA LI  +K +E G I+ M+E++K  SDF  YGA LTFV+L+E ++Y  E+ G +P  VNY I GVG+ GVVLV
Subjt:  EIPRNGMEMSGVEADFPVDE-AAEGELAELIVKKKIDEGGEIEEMVEKEKEESDFIAYGARLTFVDLEEANIYGFELEGQRPIHVNYEIGGVGENGVVLV

Query:  LTGPLRGDGKDGGGRTVTVILPEKQLPDLIDEL
                 K    R V+V++PE+ L  L +E+
Subjt:  LTGPLRGDGKDGGGRTVTVILPEKQLPDLIDEL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACGGCGGTAAAAACAATACTCTAATTTCCGAGTTAAAATTTTCGTCGGTGGTTCCGGCAAAGGCGACCGGAGACGACAAGGTCCGGGAACTGACGGCGATCGATCT
GGCGATGAAGCTTCATTACATAAGAGGCGTTTATTTCTTCAAAGGGAGCGAAGAGGTGAGAAATTTGACGATTTATGACCTAAAAAAACCTTTGTTCTCGTTGTTGGAGC
AATACTACGTCGTTTCGGGGAGGATTCGAAGGAGAATCGAAGATGGAGATCGGGCGTTCATTAAGTGTAATGATAGTGGAGTGAGAATTGTGGAAGCAGATTGTGAGAAA
AGTATTGATGAATGGCTTTCGATTGAGGGAGATGATAAATTTTCGCATCGCGATGGCTGCTTGGTTCCTACTCAAGCCATTGGTCCTGATCTTGGATTCTCCCCTCTTGC
TTTCATCCAGCTGACTCGGTTCAAGTGCGGCGGACTCTCCGTGGGCATTAGTTGGACTCACGTTCTCGGCGATATCTTCTCCGCCTCCACCTTCATCAACGCATGGGGTT
ACATCACGAACGACCGCCCGGCCAACCAACTCCGTCCGGCGCAGGCTGGCCTTATTTGGCCGTCCAGATCAACCAAACTGTCCACACCGCCGGTCAAGAGGCTCGACCCG
ACCGGAGACCTCTGGATCGGGTCAATCGACTGCAAAATGGCGACGCGGTCATTTCGAATCAAGGCGGAGCAATTGGATCGAATCTTGAGCGTTGTCGGCCGGAATCGAGC
GGTGAACTTCTCAACTTTCGAAGCTATTGCTGCGATTTTCTGGAAATCTCTGGCGAAAATACGGCTTGAGGACTCGGATTCGAGGACGATCTCGATCTATTCGACGAAAT
GCCCTAACAGAGAGGGTGAAATTCCGAGGAACGGAATGGAGATGAGCGGCGTCGAGGCCGATTTTCCTGTCGATGAAGCGGCGGAAGGTGAATTAGCGGAGCTGATTGTG
AAGAAGAAAATTGATGAGGGCGGAGAAATCGAGGAAATGGTGGAGAAAGAAAAGGAGGAATCAGACTTCATAGCATACGGAGCGAGATTGACGTTTGTTGATTTGGAAGA
AGCGAATATTTACGGCTTCGAATTGGAAGGGCAGAGGCCAATTCATGTGAATTATGAGATTGGGGGAGTTGGTGAAAACGGCGTCGTTTTGGTACTCACTGGACCGCTGC
GTGGCGACGGAAAAGACGGCGGTGGCCGAACGGTGACGGTTATCTTGCCGGAGAAACAGCTGCCGGACCTTATCGATGAACTGCAGAAGCAGTGGGAGATCGCAAAGATT
CAGTCGGTTTGCACTTGCAGTGAGCTGCACTTCTGTGCCTGCGTGTCTATCATTGAGCAGTGCACTCTCACCACCGTCGCCGATAAGTCCGGATTTCTTTTGCAACCGCA
ACGCTTAACATTCGTTTTCCGGTTTCGTCTTCAAACAATTTCCGAGAAGCGATTTCAGTCTCATCGGCGACGGTCACCGACGCTGCACCTATATTCGGTAAGAGCAATTT
TTACTTTGCTTCGAGAGGTGGCCCGGGTTTCGGGAGTGAGAATTGATTGGGACAAATTGGTGAAGAACACGTCCACTGGGATTTCTAATGTTTGGGAGTATCAGTTGTTA
TGGCGGCATTTGCCTTATCGTCACACGTTACTGGAGAACATGGATTCTGTTACTGATCCACTGGATTATGATAGTGACTTAGATTTTGAAATAGAACCTTTTCCATTTGT
CAGCAGTGAGTCCTTGAATGAAGCTGCAGCATGTGTAAAGGTACTGATTGCTAATTGCATACCAACTGAGTCAGATGTTCCAAGTAGTTCTGCAGTTGAGGCCCCATTGA
CTATAGGTATATCCAATAGTCAAGCATCTACAGACAATCTTGAAAATCCTCAATCTGCTTGTTTGATGCAAGGGATGTCTGTTACAATTCCACTTTCCATTCAGAGACAG
CCGATTCCATTGCCAGTAGCAACTGAAGTATTTGATGTGAATGGAGCAGCTGGTGCTAATGCAGCTTCTCGAAAAAGAAAAAAACCTTGGTCGAAGGCAGAGGATTTGGA
ATTGATGGCTGCTGTGGAGAAGTGTGGTGAAGGAAACTGGGGGAATATCTTGAAAGGAGACTTCAAGGGGGATAGAACTGCTTCACAGCTATCTCAGAATGTGGGAGCTA
GCACCACAAGTACTACCCAGAAAGCTCAGATTGATGCTGCACACCGTGCATTGTCCTTTGCCCTTGATTTGCCTGTGAATAACTCAAAAACAGCAAATTCAAACATAAAC
AGTAGCATTGTTTCTTCTGCAAGTGGTGCCGAAGCTTCAGTTCAAATGCAGAATCAGTCTCCACATATTTCCATTCCTTCAAGGCCGCTGCTGGTAGAGCCTTTGCCTTC
AGCAGTGAAATCTGGGATCAACACTTCCAAAAATTCATTGATGATGAAGTCTACTCACAATTCTGATTCTATAGTTAGAGCAACTGCAGTAGCTGCAGGGGCCCGAATCG
TTTCTCCGTCTGATACTGTATCTCTACTGAAAGCGACGCAAACGAGAAATGCCATCCGCATAAAGTCCAAATGTGTTTCATCAATCCAATCACCCGTGGTTGGGAATGCA
CCAATGCACTTGGATGCACGCCCCAGTGTACATTATATTTCCACAGGAAAAACACCAACTCCAGGCTCAAACTATGTGAGTGGTAAATCTACTATGGTAGGCAATAACTC
AATGAAGGCTGTCTCACCAAAAGTTCTGCATAATCGTTGTACTGCTATTTTGACAAACCCGCCATCAGACCGAGTAAGCCCAACAACTGAGTCTCCATTGAAGCAAGAGG
TTAACAGTTCAGAAGAACGCAAAATTGCCAAGCCAATCATTACTGCAAAAGAGGAGTTTCGAGAAAACAGCGTGGCAAATGATGTCAAGATTAGGGGCTGA
mRNA sequenceShow/hide mRNA sequence
AGGACACGAAGCTCCTTCACTACACCACAAGTCTCCGCCATTCACAATCTATAAATTCAACTTCTCTGATCACTCGTAGTCCAGCCAATTTCATAAAATTCCTCTTCGCG
TTTCCAGTTTTACCCCTCCACGGCGGCGCAATATGGACGGCGGTAAAAACAATACTCTAATTTCCGAGTTAAAATTTTCGTCGGTGGTTCCGGCAAAGGCGACCGGAGAC
GACAAGGTCCGGGAACTGACGGCGATCGATCTGGCGATGAAGCTTCATTACATAAGAGGCGTTTATTTCTTCAAAGGGAGCGAAGAGGTGAGAAATTTGACGATTTATGA
CCTAAAAAAACCTTTGTTCTCGTTGTTGGAGCAATACTACGTCGTTTCGGGGAGGATTCGAAGGAGAATCGAAGATGGAGATCGGGCGTTCATTAAGTGTAATGATAGTG
GAGTGAGAATTGTGGAAGCAGATTGTGAGAAAAGTATTGATGAATGGCTTTCGATTGAGGGAGATGATAAATTTTCGCATCGCGATGGCTGCTTGGTTCCTACTCAAGCC
ATTGGTCCTGATCTTGGATTCTCCCCTCTTGCTTTCATCCAGCTGACTCGGTTCAAGTGCGGCGGACTCTCCGTGGGCATTAGTTGGACTCACGTTCTCGGCGATATCTT
CTCCGCCTCCACCTTCATCAACGCATGGGGTTACATCACGAACGACCGCCCGGCCAACCAACTCCGTCCGGCGCAGGCTGGCCTTATTTGGCCGTCCAGATCAACCAAAC
TGTCCACACCGCCGGTCAAGAGGCTCGACCCGACCGGAGACCTCTGGATCGGGTCAATCGACTGCAAAATGGCGACGCGGTCATTTCGAATCAAGGCGGAGCAATTGGAT
CGAATCTTGAGCGTTGTCGGCCGGAATCGAGCGGTGAACTTCTCAACTTTCGAAGCTATTGCTGCGATTTTCTGGAAATCTCTGGCGAAAATACGGCTTGAGGACTCGGA
TTCGAGGACGATCTCGATCTATTCGACGAAATGCCCTAACAGAGAGGGTGAAATTCCGAGGAACGGAATGGAGATGAGCGGCGTCGAGGCCGATTTTCCTGTCGATGAAG
CGGCGGAAGGTGAATTAGCGGAGCTGATTGTGAAGAAGAAAATTGATGAGGGCGGAGAAATCGAGGAAATGGTGGAGAAAGAAAAGGAGGAATCAGACTTCATAGCATAC
GGAGCGAGATTGACGTTTGTTGATTTGGAAGAAGCGAATATTTACGGCTTCGAATTGGAAGGGCAGAGGCCAATTCATGTGAATTATGAGATTGGGGGAGTTGGTGAAAA
CGGCGTCGTTTTGGTACTCACTGGACCGCTGCGTGGCGACGGAAAAGACGGCGGTGGCCGAACGGTGACGGTTATCTTGCCGGAGAAACAGCTGCCGGACCTTATCGATG
AACTGCAGAAGCAGTGGGAGATCGCAAAGATTCAGTCGGTTTGCACTTGCAGTGAGCTGCACTTCTGTGCCTGCGTGTCTATCATTGAGCAGTGCACTCTCACCACCGTC
GCCGATAAGTCCGGATTTCTTTTGCAACCGCAACGCTTAACATTCGTTTTCCGGTTTCGTCTTCAAACAATTTCCGAGAAGCGATTTCAGTCTCATCGGCGACGGTCACC
GACGCTGCACCTATATTCGGTAAGAGCAATTTTTACTTTGCTTCGAGAGGTGGCCCGGGTTTCGGGAGTGAGAATTGATTGGGACAAATTGGTGAAGAACACGTCCACTG
GGATTTCTAATGTTTGGGAGTATCAGTTGTTATGGCGGCATTTGCCTTATCGTCACACGTTACTGGAGAACATGGATTCTGTTACTGATCCACTGGATTATGATAGTGAC
TTAGATTTTGAAATAGAACCTTTTCCATTTGTCAGCAGTGAGTCCTTGAATGAAGCTGCAGCATGTGTAAAGGTACTGATTGCTAATTGCATACCAACTGAGTCAGATGT
TCCAAGTAGTTCTGCAGTTGAGGCCCCATTGACTATAGGTATATCCAATAGTCAAGCATCTACAGACAATCTTGAAAATCCTCAATCTGCTTGTTTGATGCAAGGGATGT
CTGTTACAATTCCACTTTCCATTCAGAGACAGCCGATTCCATTGCCAGTAGCAACTGAAGTATTTGATGTGAATGGAGCAGCTGGTGCTAATGCAGCTTCTCGAAAAAGA
AAAAAACCTTGGTCGAAGGCAGAGGATTTGGAATTGATGGCTGCTGTGGAGAAGTGTGGTGAAGGAAACTGGGGGAATATCTTGAAAGGAGACTTCAAGGGGGATAGAAC
TGCTTCACAGCTATCTCAGAATGTGGGAGCTAGCACCACAAGTACTACCCAGAAAGCTCAGATTGATGCTGCACACCGTGCATTGTCCTTTGCCCTTGATTTGCCTGTGA
ATAACTCAAAAACAGCAAATTCAAACATAAACAGTAGCATTGTTTCTTCTGCAAGTGGTGCCGAAGCTTCAGTTCAAATGCAGAATCAGTCTCCACATATTTCCATTCCT
TCAAGGCCGCTGCTGGTAGAGCCTTTGCCTTCAGCAGTGAAATCTGGGATCAACACTTCCAAAAATTCATTGATGATGAAGTCTACTCACAATTCTGATTCTATAGTTAG
AGCAACTGCAGTAGCTGCAGGGGCCCGAATCGTTTCTCCGTCTGATACTGTATCTCTACTGAAAGCGACGCAAACGAGAAATGCCATCCGCATAAAGTCCAAATGTGTTT
CATCAATCCAATCACCCGTGGTTGGGAATGCACCAATGCACTTGGATGCACGCCCCAGTGTACATTATATTTCCACAGGAAAAACACCAACTCCAGGCTCAAACTATGTG
AGTGGTAAATCTACTATGGTAGGCAATAACTCAATGAAGGCTGTCTCACCAAAAGTTCTGCATAATCGTTGTACTGCTATTTTGACAAACCCGCCATCAGACCGAGTAAG
CCCAACAACTGAGTCTCCATTGAAGCAAGAGGTTAACAGTTCAGAAGAACGCAAAATTGCCAAGCCAATCATTACTGCAAAAGAGGAGTTTCGAGAAAACAGCGTGGCAA
ATGATGTCAAGATTAGGGGCTGACCAAACATAGAAGGTAGCAATAGCAAGAACAAATCTATACCATATAATCACGGGGGGCATAGCACAGATAATGAACAGCTGAGCTCG
ACAAGCATGTTTAGCAGATGAATGCACAGAAATATCAGTTGGTGGTGTGGGGTCAATGCAAACGCCCAATGACAAGACTAAAATTTGGAAGCTCATATTCTTCAACAAGA
AAAGGAAATTGTATAGCTACTGTATATCTGTTATTGAAGTTGTGGACTCGCACATGAGAAGAGCGATGGCTTCTGCCTCTTTAACTTAACGCCAAACAAGAAAGACCGTA
AAGGAATTCGTGTTCGGCTCAACGATTTCGATTGTTGCATGCTCTTGCCTGCTGAAATACCAAATTCTTGATCAAGTCTCTCCAAAGCCTTCTCAACTTGAACTTGTAGT
ACTCTTACCCGACCCTGACCGACTTGTAACTCATCGGCTATCTTCCTATTTTCTTGCTTCATGTTGAGTACTTCACCTTGAAACTTTG
Protein sequenceShow/hide protein sequence
MDGGKNNTLISELKFSSVVPAKATGDDKVRELTAIDLAMKLHYIRGVYFFKGSEEVRNLTIYDLKKPLFSLLEQYYVVSGRIRRRIEDGDRAFIKCNDSGVRIVEADCEK
SIDEWLSIEGDDKFSHRDGCLVPTQAIGPDLGFSPLAFIQLTRFKCGGLSVGISWTHVLGDIFSASTFINAWGYITNDRPANQLRPAQAGLIWPSRSTKLSTPPVKRLDP
TGDLWIGSIDCKMATRSFRIKAEQLDRILSVVGRNRAVNFSTFEAIAAIFWKSLAKIRLEDSDSRTISIYSTKCPNREGEIPRNGMEMSGVEADFPVDEAAEGELAELIV
KKKIDEGGEIEEMVEKEKEESDFIAYGARLTFVDLEEANIYGFELEGQRPIHVNYEIGGVGENGVVLVLTGPLRGDGKDGGGRTVTVILPEKQLPDLIDELQKQWEIAKI
QSVCTCSELHFCACVSIIEQCTLTTVADKSGFLLQPQRLTFVFRFRLQTISEKRFQSHRRRSPTLHLYSVRAIFTLLREVARVSGVRIDWDKLVKNTSTGISNVWEYQLL
WRHLPYRHTLLENMDSVTDPLDYDSDLDFEIEPFPFVSSESLNEAAACVKVLIANCIPTESDVPSSSAVEAPLTIGISNSQASTDNLENPQSACLMQGMSVTIPLSIQRQ
PIPLPVATEVFDVNGAAGANAASRKRKKPWSKAEDLELMAAVEKCGEGNWGNILKGDFKGDRTASQLSQNVGASTTSTTQKAQIDAAHRALSFALDLPVNNSKTANSNIN
SSIVSSASGAEASVQMQNQSPHISIPSRPLLVEPLPSAVKSGINTSKNSLMMKSTHNSDSIVRATAVAAGARIVSPSDTVSLLKATQTRNAIRIKSKCVSSIQSPVVGNA
PMHLDARPSVHYISTGKTPTPGSNYVSGKSTMVGNNSMKAVSPKVLHNRCTAILTNPPSDRVSPTTESPLKQEVNSSEERKIAKPIITAKEEFRENSVANDVKIRG