; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC03g0065 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC03g0065
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionN-acetyltransferase domain-containing protein
Genome locationMC03:945561..948003
RNA-Seq ExpressionMC03g0065
SyntenyMC03g0065
Gene Ontology termsGO:0008080 - N-acetyltransferase activity (molecular function)
InterPro domainsIPR000182 - GNAT domain
IPR016181 - Acyl-CoA N-acyltransferase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7037005.1 putative N-acetyltransferase HLS1 [Cucurbita argyrosperma subsp. argyrosperma]1.44e-26590.23Show/hide
Query:  MGYGEEILIIRSYDGQSADRARVEDLERRCEVGPSERVFLFTDTMGDPICRIRNSPLYKMLVAEVDNQIVGVIQGSIKIVTVHQAPKGRAKVGYVLGLRV
        M Y EEILIIRSYDGQSADRARVEDLERRCEVGPSERVFLFTDTMGDPICRIRNSPLYKMLVAEVDNQ+VGVIQGSIK+VTVHQAPK RAKVGYVLGLRV
Subjt:  MGYGEEILIIRSYDGQSADRARVEDLERRCEVGPSERVFLFTDTMGDPICRIRNSPLYKMLVAEVDNQIVGVIQGSIKIVTVHQAPKGRAKVGYVLGLRV

Query:  VPMFRRRGIGCSLVRRLEEWFAANDVDYTYMATEKDNEASVKLFINKLGYTNFRVPAILVNPVKHYRPYNLPSNIQISRLKVDVAEFLYGKYMASTEFFP
         P FRRRGIG +LVRRLEEWF ANDVDY YMATEKDNEASVKLFINKLGYTNFRVPAILVNPVKHYRPY+LPSNIQIS LKVDVAEFLY K+MASTEFFP
Subjt:  VPMFRRRGIGCSLVRRLEEWFAANDVDYTYMATEKDNEASVKLFINKLGYTNFRVPAILVNPVKHYRPYNLPSNIQISRLKVDVAEFLYGKYMASTEFFP

Query:  HDIDRVLKHKLSLGTWVAYYKDD-----KFETTGSGSEMAIPKSWAMLSVWNSGEVFKLRLGKAPLSCLIYTESSKVMDKIFPCLKLPSIPDFYEPFGFY
        HDID VLKHKLSLG+WVAYYKDD     KFET G  SEM IPK WAMLSVWNSGEVFKLRLGKAPLSCLIYTESSKV+DKIFPCLKLPSIPDFYEPFGFY
Subjt:  HDIDRVLKHKLSLGTWVAYYKDD-----KFETTGSGSEMAIPKSWAMLSVWNSGEVFKLRLGKAPLSCLIYTESSKVMDKIFPCLKLPSIPDFYEPFGFY

Query:  FMYGVHREGAASTSGKLVRALCQHVHNMAAEARDCKVIVTEIGGEDALREEIPHWKLLSCPEDLWCIKALKKEARNSLHELTKTPPTTRPALFVDPREV
        FMYGVHREG    + KLV+ALCQ+VHNMAA ARDCKVIVTEIGGED+LR+EIPHWKLLSCPEDLWCIKALKKEARNSLHELTKTPPTTRPALFVDPREV
Subjt:  FMYGVHREGAASTSGKLVRALCQHVHNMAAEARDCKVIVTEIGGEDALREEIPHWKLLSCPEDLWCIKALKKEARNSLHELTKTPPTTRPALFVDPREV

XP_008457342.1 PREDICTED: probable N-acetyltransferase HLS1 [Cucumis melo]1.63e-26690.98Show/hide
Query:  MGYGEEILIIRSYDGQSADRARVEDLERRCEVGPSERVFLFTDTMGDPICRIRNSPLYKMLVAEVDNQIVGVIQGSIKIVTVHQAPKGRAKVGYVLGLRV
        MG  EEILIIRSYDGQSADR RVEDLERRCEVGPSERVFLFTDTMGDPICRIRNSPLYKMLVAEVDNQ+VGVIQGSIK+VTVHQAPK RAKVGYVLGLRV
Subjt:  MGYGEEILIIRSYDGQSADRARVEDLERRCEVGPSERVFLFTDTMGDPICRIRNSPLYKMLVAEVDNQIVGVIQGSIKIVTVHQAPKGRAKVGYVLGLRV

Query:  VPMFRRRGIGCSLVRRLEEWFAANDVDYTYMATEKDNEASVKLFINKLGYTNFRVPAILVNPVKHYRPYNLPSNIQISRLKVDVAEFLYGKYMASTEFFP
         P FRRRGIGCSLVRRLEEWF  NDVDY YMATEKDNEASVKLFINKLGYTNFRVPAILVNPVKHYR Y+LPSNIQI+RLKVDVAEFLY K+MASTEFFP
Subjt:  VPMFRRRGIGCSLVRRLEEWFAANDVDYTYMATEKDNEASVKLFINKLGYTNFRVPAILVNPVKHYRPYNLPSNIQISRLKVDVAEFLYGKYMASTEFFP

Query:  HDIDRVLKHKLSLGTWVAYYKDD-----KFETTGSGSEMAIPKSWAMLSVWNSGEVFKLRLGKAPLSCLIYTESSKVMDKIFPCLKLPSIPDFYEPFGFY
        HDID VLKHKLSLGTWVAYYKDD     KFET  S SE+ IPKSWAMLSVWNSGEVFKLRLGKAPLSCLIYTESSKV+DKIFPCLKLPSIPDFYEPFGFY
Subjt:  HDIDRVLKHKLSLGTWVAYYKDD-----KFETTGSGSEMAIPKSWAMLSVWNSGEVFKLRLGKAPLSCLIYTESSKVMDKIFPCLKLPSIPDFYEPFGFY

Query:  FMYGVHREGAASTSGKLVRALCQHVHNMAAEARDCKVIVTEIGGEDALREEIPHWKLLSCPEDLWCIKALKKEARNSLHELTKTPPTTRPALFVDPREV
        FMYGVHREG  +  GKLVRALCQ+VHNMAA ARDCKVIVTEIGGED+LREEIPHWKLLSCPEDLWCIKALKKEARNSLHELTKTPPTTRPALFVDPREV
Subjt:  FMYGVHREGAASTSGKLVRALCQHVHNMAAEARDCKVIVTEIGGEDALREEIPHWKLLSCPEDLWCIKALKKEARNSLHELTKTPPTTRPALFVDPREV

XP_011658711.1 probable N-acetyltransferase HLS1 [Cucumis sativus]4.01e-26791.23Show/hide
Query:  MGYGEEILIIRSYDGQSADRARVEDLERRCEVGPSERVFLFTDTMGDPICRIRNSPLYKMLVAEVDNQIVGVIQGSIKIVTVHQAPKGRAKVGYVLGLRV
        MG  EEILIIRSYDGQSADR RVEDLERRCEVGPSERVFLFTDTMGDPICRIRNSPLYKMLVAEVDNQ+VGVIQGSIK+VTVHQAPK RAKVGYVLGLRV
Subjt:  MGYGEEILIIRSYDGQSADRARVEDLERRCEVGPSERVFLFTDTMGDPICRIRNSPLYKMLVAEVDNQIVGVIQGSIKIVTVHQAPKGRAKVGYVLGLRV

Query:  VPMFRRRGIGCSLVRRLEEWFAANDVDYTYMATEKDNEASVKLFINKLGYTNFRVPAILVNPVKHYRPYNLPSNIQISRLKVDVAEFLYGKYMASTEFFP
         P FRRRGIGCSLVRRLEEWF  NDVDY YMATEKDNEASVKLFINKLGYTNFRVPAILVNPVKHYR Y LPSNIQI+RLKVDVAEFLY K+MASTEFFP
Subjt:  VPMFRRRGIGCSLVRRLEEWFAANDVDYTYMATEKDNEASVKLFINKLGYTNFRVPAILVNPVKHYRPYNLPSNIQISRLKVDVAEFLYGKYMASTEFFP

Query:  HDIDRVLKHKLSLGTWVAYYKDD-----KFETTGSGSEMAIPKSWAMLSVWNSGEVFKLRLGKAPLSCLIYTESSKVMDKIFPCLKLPSIPDFYEPFGFY
        HDID VLKHKLSLGTWVAYYKDD     KFET GS SE+ IPKSWAMLSVWNSGEVFKLRLGKAPLSCLIYTESSKV+DKIFPCLKLPSIPDFYEPFGFY
Subjt:  HDIDRVLKHKLSLGTWVAYYKDD-----KFETTGSGSEMAIPKSWAMLSVWNSGEVFKLRLGKAPLSCLIYTESSKVMDKIFPCLKLPSIPDFYEPFGFY

Query:  FMYGVHREGAASTSGKLVRALCQHVHNMAAEARDCKVIVTEIGGEDALREEIPHWKLLSCPEDLWCIKALKKEARNSLHELTKTPPTTRPALFVDPREV
        FMYGVHREG  +  GKLVRALCQ+VHNMAA ARDCKVIVTEIGGED+LREEIPHWKLLSCPEDLWCIKALKKEARNSLHELTKTPPTTRPALFVDPREV
Subjt:  FMYGVHREGAASTSGKLVRALCQHVHNMAAEARDCKVIVTEIGGEDALREEIPHWKLLSCPEDLWCIKALKKEARNSLHELTKTPPTTRPALFVDPREV

XP_022150962.1 probable N-acetyltransferase HLS1 [Momordica charantia]1.31e-294100Show/hide
Query:  MGYGEEILIIRSYDGQSADRARVEDLERRCEVGPSERVFLFTDTMGDPICRIRNSPLYKMLVAEVDNQIVGVIQGSIKIVTVHQAPKGRAKVGYVLGLRV
        MGYGEEILIIRSYDGQSADRARVEDLERRCEVGPSERVFLFTDTMGDPICRIRNSPLYKMLVAEVDNQIVGVIQGSIKIVTVHQAPKGRAKVGYVLGLRV
Subjt:  MGYGEEILIIRSYDGQSADRARVEDLERRCEVGPSERVFLFTDTMGDPICRIRNSPLYKMLVAEVDNQIVGVIQGSIKIVTVHQAPKGRAKVGYVLGLRV

Query:  VPMFRRRGIGCSLVRRLEEWFAANDVDYTYMATEKDNEASVKLFINKLGYTNFRVPAILVNPVKHYRPYNLPSNIQISRLKVDVAEFLYGKYMASTEFFP
        VPMFRRRGIGCSLVRRLEEWFAANDVDYTYMATEKDNEASVKLFINKLGYTNFRVPAILVNPVKHYRPYNLPSNIQISRLKVDVAEFLYGKYMASTEFFP
Subjt:  VPMFRRRGIGCSLVRRLEEWFAANDVDYTYMATEKDNEASVKLFINKLGYTNFRVPAILVNPVKHYRPYNLPSNIQISRLKVDVAEFLYGKYMASTEFFP

Query:  HDIDRVLKHKLSLGTWVAYYKDDKFETTGSGSEMAIPKSWAMLSVWNSGEVFKLRLGKAPLSCLIYTESSKVMDKIFPCLKLPSIPDFYEPFGFYFMYGV
        HDIDRVLKHKLSLGTWVAYYKDDKFETTGSGSEMAIPKSWAMLSVWNSGEVFKLRLGKAPLSCLIYTESSKVMDKIFPCLKLPSIPDFYEPFGFYFMYGV
Subjt:  HDIDRVLKHKLSLGTWVAYYKDDKFETTGSGSEMAIPKSWAMLSVWNSGEVFKLRLGKAPLSCLIYTESSKVMDKIFPCLKLPSIPDFYEPFGFYFMYGV

Query:  HREGAASTSGKLVRALCQHVHNMAAEARDCKVIVTEIGGEDALREEIPHWKLLSCPEDLWCIKALKKEARNSLHELTKTPPTTRPALFVDPREV
        HREGAASTSGKLVRALCQHVHNMAAEARDCKVIVTEIGGEDALREEIPHWKLLSCPEDLWCIKALKKEARNSLHELTKTPPTTRPALFVDPREV
Subjt:  HREGAASTSGKLVRALCQHVHNMAAEARDCKVIVTEIGGEDALREEIPHWKLLSCPEDLWCIKALKKEARNSLHELTKTPPTTRPALFVDPREV

XP_038894892.1 probable N-acetyltransferase HLS1 [Benincasa hispida]8.45e-26991.73Show/hide
Query:  MGYGEEILIIRSYDGQSADRARVEDLERRCEVGPSERVFLFTDTMGDPICRIRNSPLYKMLVAEVDNQIVGVIQGSIKIVTVHQAPKGRAKVGYVLGLRV
        MG  EEILIIRSYDGQSADR RVEDLERRCEVGPSERVFLFTDTMGDPICRIRNSPLYKMLVAEVDNQ+VGVIQGSIK+VTVHQAPK RAKVGYVLGLRV
Subjt:  MGYGEEILIIRSYDGQSADRARVEDLERRCEVGPSERVFLFTDTMGDPICRIRNSPLYKMLVAEVDNQIVGVIQGSIKIVTVHQAPKGRAKVGYVLGLRV

Query:  VPMFRRRGIGCSLVRRLEEWFAANDVDYTYMATEKDNEASVKLFINKLGYTNFRVPAILVNPVKHYRPYNLPSNIQISRLKVDVAEFLYGKYMASTEFFP
        VP FRRRGIGCSLVRRLEEWFA NDVDY YMATEKDNEASVKLFINKLGYTNFRVPAILVNPVKHYR Y+LPSNIQI+RLKVDVAEFLY K+MASTEFFP
Subjt:  VPMFRRRGIGCSLVRRLEEWFAANDVDYTYMATEKDNEASVKLFINKLGYTNFRVPAILVNPVKHYRPYNLPSNIQISRLKVDVAEFLYGKYMASTEFFP

Query:  HDIDRVLKHKLSLGTWVAYYKDD-----KFETTGSGSEMAIPKSWAMLSVWNSGEVFKLRLGKAPLSCLIYTESSKVMDKIFPCLKLPSIPDFYEPFGFY
        HDID+VLKHKLSLGTWVAYYKDD     KFET GS SE+AIPKSWAMLSVWNSGEVFKLRLGKAPLSCLIYTESSKV+DKIFPCLKLPSIPDFYEPFGFY
Subjt:  HDIDRVLKHKLSLGTWVAYYKDD-----KFETTGSGSEMAIPKSWAMLSVWNSGEVFKLRLGKAPLSCLIYTESSKVMDKIFPCLKLPSIPDFYEPFGFY

Query:  FMYGVHREGAASTSGKLVRALCQHVHNMAAEARDCKVIVTEIGGEDALREEIPHWKLLSCPEDLWCIKALKKEARNSLHELTKTPPTTRPALFVDPREV
        FMYGVHREG  +  GKLVRALCQ+VHNMAA ARDCKVIVTEIGGED+LREEIPHWKLLSCPEDLWCIKALKKEARNSLHELTKTPPTTRP LFVDPREV
Subjt:  FMYGVHREGAASTSGKLVRALCQHVHNMAAEARDCKVIVTEIGGEDALREEIPHWKLLSCPEDLWCIKALKKEARNSLHELTKTPPTTRPALFVDPREV

TrEMBL top hitse value%identityAlignment
A0A0A0M0V6 N-acetyltransferase domain-containing protein1.94e-26791.23Show/hide
Query:  MGYGEEILIIRSYDGQSADRARVEDLERRCEVGPSERVFLFTDTMGDPICRIRNSPLYKMLVAEVDNQIVGVIQGSIKIVTVHQAPKGRAKVGYVLGLRV
        MG  EEILIIRSYDGQSADR RVEDLERRCEVGPSERVFLFTDTMGDPICRIRNSPLYKMLVAEVDNQ+VGVIQGSIK+VTVHQAPK RAKVGYVLGLRV
Subjt:  MGYGEEILIIRSYDGQSADRARVEDLERRCEVGPSERVFLFTDTMGDPICRIRNSPLYKMLVAEVDNQIVGVIQGSIKIVTVHQAPKGRAKVGYVLGLRV

Query:  VPMFRRRGIGCSLVRRLEEWFAANDVDYTYMATEKDNEASVKLFINKLGYTNFRVPAILVNPVKHYRPYNLPSNIQISRLKVDVAEFLYGKYMASTEFFP
         P FRRRGIGCSLVRRLEEWF  NDVDY YMATEKDNEASVKLFINKLGYTNFRVPAILVNPVKHYR Y LPSNIQI+RLKVDVAEFLY K+MASTEFFP
Subjt:  VPMFRRRGIGCSLVRRLEEWFAANDVDYTYMATEKDNEASVKLFINKLGYTNFRVPAILVNPVKHYRPYNLPSNIQISRLKVDVAEFLYGKYMASTEFFP

Query:  HDIDRVLKHKLSLGTWVAYYKDD-----KFETTGSGSEMAIPKSWAMLSVWNSGEVFKLRLGKAPLSCLIYTESSKVMDKIFPCLKLPSIPDFYEPFGFY
        HDID VLKHKLSLGTWVAYYKDD     KFET GS SE+ IPKSWAMLSVWNSGEVFKLRLGKAPLSCLIYTESSKV+DKIFPCLKLPSIPDFYEPFGFY
Subjt:  HDIDRVLKHKLSLGTWVAYYKDD-----KFETTGSGSEMAIPKSWAMLSVWNSGEVFKLRLGKAPLSCLIYTESSKVMDKIFPCLKLPSIPDFYEPFGFY

Query:  FMYGVHREGAASTSGKLVRALCQHVHNMAAEARDCKVIVTEIGGEDALREEIPHWKLLSCPEDLWCIKALKKEARNSLHELTKTPPTTRPALFVDPREV
        FMYGVHREG  +  GKLVRALCQ+VHNMAA ARDCKVIVTEIGGED+LREEIPHWKLLSCPEDLWCIKALKKEARNSLHELTKTPPTTRPALFVDPREV
Subjt:  FMYGVHREGAASTSGKLVRALCQHVHNMAAEARDCKVIVTEIGGEDALREEIPHWKLLSCPEDLWCIKALKKEARNSLHELTKTPPTTRPALFVDPREV

A0A1S3C5Y9 probable N-acetyltransferase HLS17.90e-26790.98Show/hide
Query:  MGYGEEILIIRSYDGQSADRARVEDLERRCEVGPSERVFLFTDTMGDPICRIRNSPLYKMLVAEVDNQIVGVIQGSIKIVTVHQAPKGRAKVGYVLGLRV
        MG  EEILIIRSYDGQSADR RVEDLERRCEVGPSERVFLFTDTMGDPICRIRNSPLYKMLVAEVDNQ+VGVIQGSIK+VTVHQAPK RAKVGYVLGLRV
Subjt:  MGYGEEILIIRSYDGQSADRARVEDLERRCEVGPSERVFLFTDTMGDPICRIRNSPLYKMLVAEVDNQIVGVIQGSIKIVTVHQAPKGRAKVGYVLGLRV

Query:  VPMFRRRGIGCSLVRRLEEWFAANDVDYTYMATEKDNEASVKLFINKLGYTNFRVPAILVNPVKHYRPYNLPSNIQISRLKVDVAEFLYGKYMASTEFFP
         P FRRRGIGCSLVRRLEEWF  NDVDY YMATEKDNEASVKLFINKLGYTNFRVPAILVNPVKHYR Y+LPSNIQI+RLKVDVAEFLY K+MASTEFFP
Subjt:  VPMFRRRGIGCSLVRRLEEWFAANDVDYTYMATEKDNEASVKLFINKLGYTNFRVPAILVNPVKHYRPYNLPSNIQISRLKVDVAEFLYGKYMASTEFFP

Query:  HDIDRVLKHKLSLGTWVAYYKDD-----KFETTGSGSEMAIPKSWAMLSVWNSGEVFKLRLGKAPLSCLIYTESSKVMDKIFPCLKLPSIPDFYEPFGFY
        HDID VLKHKLSLGTWVAYYKDD     KFET  S SE+ IPKSWAMLSVWNSGEVFKLRLGKAPLSCLIYTESSKV+DKIFPCLKLPSIPDFYEPFGFY
Subjt:  HDIDRVLKHKLSLGTWVAYYKDD-----KFETTGSGSEMAIPKSWAMLSVWNSGEVFKLRLGKAPLSCLIYTESSKVMDKIFPCLKLPSIPDFYEPFGFY

Query:  FMYGVHREGAASTSGKLVRALCQHVHNMAAEARDCKVIVTEIGGEDALREEIPHWKLLSCPEDLWCIKALKKEARNSLHELTKTPPTTRPALFVDPREV
        FMYGVHREG  +  GKLVRALCQ+VHNMAA ARDCKVIVTEIGGED+LREEIPHWKLLSCPEDLWCIKALKKEARNSLHELTKTPPTTRPALFVDPREV
Subjt:  FMYGVHREGAASTSGKLVRALCQHVHNMAAEARDCKVIVTEIGGEDALREEIPHWKLLSCPEDLWCIKALKKEARNSLHELTKTPPTTRPALFVDPREV

A0A5D3BD96 Putative N-acetyltransferase HLS17.90e-26790.98Show/hide
Query:  MGYGEEILIIRSYDGQSADRARVEDLERRCEVGPSERVFLFTDTMGDPICRIRNSPLYKMLVAEVDNQIVGVIQGSIKIVTVHQAPKGRAKVGYVLGLRV
        MG  EEILIIRSYDGQSADR RVEDLERRCEVGPSERVFLFTDTMGDPICRIRNSPLYKMLVAEVDNQ+VGVIQGSIK+VTVHQAPK RAKVGYVLGLRV
Subjt:  MGYGEEILIIRSYDGQSADRARVEDLERRCEVGPSERVFLFTDTMGDPICRIRNSPLYKMLVAEVDNQIVGVIQGSIKIVTVHQAPKGRAKVGYVLGLRV

Query:  VPMFRRRGIGCSLVRRLEEWFAANDVDYTYMATEKDNEASVKLFINKLGYTNFRVPAILVNPVKHYRPYNLPSNIQISRLKVDVAEFLYGKYMASTEFFP
         P FRRRGIGCSLVRRLEEWF  NDVDY YMATEKDNEASVKLFINKLGYTNFRVPAILVNPVKHYR Y+LPSNIQI+RLKVDVAEFLY K+MASTEFFP
Subjt:  VPMFRRRGIGCSLVRRLEEWFAANDVDYTYMATEKDNEASVKLFINKLGYTNFRVPAILVNPVKHYRPYNLPSNIQISRLKVDVAEFLYGKYMASTEFFP

Query:  HDIDRVLKHKLSLGTWVAYYKDD-----KFETTGSGSEMAIPKSWAMLSVWNSGEVFKLRLGKAPLSCLIYTESSKVMDKIFPCLKLPSIPDFYEPFGFY
        HDID VLKHKLSLGTWVAYYKDD     KFET  S SE+ IPKSWAMLSVWNSGEVFKLRLGKAPLSCLIYTESSKV+DKIFPCLKLPSIPDFYEPFGFY
Subjt:  HDIDRVLKHKLSLGTWVAYYKDD-----KFETTGSGSEMAIPKSWAMLSVWNSGEVFKLRLGKAPLSCLIYTESSKVMDKIFPCLKLPSIPDFYEPFGFY

Query:  FMYGVHREGAASTSGKLVRALCQHVHNMAAEARDCKVIVTEIGGEDALREEIPHWKLLSCPEDLWCIKALKKEARNSLHELTKTPPTTRPALFVDPREV
        FMYGVHREG  +  GKLVRALCQ+VHNMAA ARDCKVIVTEIGGED+LREEIPHWKLLSCPEDLWCIKALKKEARNSLHELTKTPPTTRPALFVDPREV
Subjt:  FMYGVHREGAASTSGKLVRALCQHVHNMAAEARDCKVIVTEIGGEDALREEIPHWKLLSCPEDLWCIKALKKEARNSLHELTKTPPTTRPALFVDPREV

A0A6J1DC73 probable N-acetyltransferase HLS16.35e-295100Show/hide
Query:  MGYGEEILIIRSYDGQSADRARVEDLERRCEVGPSERVFLFTDTMGDPICRIRNSPLYKMLVAEVDNQIVGVIQGSIKIVTVHQAPKGRAKVGYVLGLRV
        MGYGEEILIIRSYDGQSADRARVEDLERRCEVGPSERVFLFTDTMGDPICRIRNSPLYKMLVAEVDNQIVGVIQGSIKIVTVHQAPKGRAKVGYVLGLRV
Subjt:  MGYGEEILIIRSYDGQSADRARVEDLERRCEVGPSERVFLFTDTMGDPICRIRNSPLYKMLVAEVDNQIVGVIQGSIKIVTVHQAPKGRAKVGYVLGLRV

Query:  VPMFRRRGIGCSLVRRLEEWFAANDVDYTYMATEKDNEASVKLFINKLGYTNFRVPAILVNPVKHYRPYNLPSNIQISRLKVDVAEFLYGKYMASTEFFP
        VPMFRRRGIGCSLVRRLEEWFAANDVDYTYMATEKDNEASVKLFINKLGYTNFRVPAILVNPVKHYRPYNLPSNIQISRLKVDVAEFLYGKYMASTEFFP
Subjt:  VPMFRRRGIGCSLVRRLEEWFAANDVDYTYMATEKDNEASVKLFINKLGYTNFRVPAILVNPVKHYRPYNLPSNIQISRLKVDVAEFLYGKYMASTEFFP

Query:  HDIDRVLKHKLSLGTWVAYYKDDKFETTGSGSEMAIPKSWAMLSVWNSGEVFKLRLGKAPLSCLIYTESSKVMDKIFPCLKLPSIPDFYEPFGFYFMYGV
        HDIDRVLKHKLSLGTWVAYYKDDKFETTGSGSEMAIPKSWAMLSVWNSGEVFKLRLGKAPLSCLIYTESSKVMDKIFPCLKLPSIPDFYEPFGFYFMYGV
Subjt:  HDIDRVLKHKLSLGTWVAYYKDDKFETTGSGSEMAIPKSWAMLSVWNSGEVFKLRLGKAPLSCLIYTESSKVMDKIFPCLKLPSIPDFYEPFGFYFMYGV

Query:  HREGAASTSGKLVRALCQHVHNMAAEARDCKVIVTEIGGEDALREEIPHWKLLSCPEDLWCIKALKKEARNSLHELTKTPPTTRPALFVDPREV
        HREGAASTSGKLVRALCQHVHNMAAEARDCKVIVTEIGGEDALREEIPHWKLLSCPEDLWCIKALKKEARNSLHELTKTPPTTRPALFVDPREV
Subjt:  HREGAASTSGKLVRALCQHVHNMAAEARDCKVIVTEIGGEDALREEIPHWKLLSCPEDLWCIKALKKEARNSLHELTKTPPTTRPALFVDPREV

A0A6J1K8Q3 probable N-acetyltransferase HLS16.99e-26690.23Show/hide
Query:  MGYGEEILIIRSYDGQSADRARVEDLERRCEVGPSERVFLFTDTMGDPICRIRNSPLYKMLVAEVDNQIVGVIQGSIKIVTVHQAPKGRAKVGYVLGLRV
        M Y EEILIIRSYDGQSADRARVEDLERRCEVGPSERVFLFTDTMGDPICRIRNSPLYKMLVAEVDNQ+VGVIQGSIK+VTVHQAPK RAKVGYVLGLRV
Subjt:  MGYGEEILIIRSYDGQSADRARVEDLERRCEVGPSERVFLFTDTMGDPICRIRNSPLYKMLVAEVDNQIVGVIQGSIKIVTVHQAPKGRAKVGYVLGLRV

Query:  VPMFRRRGIGCSLVRRLEEWFAANDVDYTYMATEKDNEASVKLFINKLGYTNFRVPAILVNPVKHYRPYNLPSNIQISRLKVDVAEFLYGKYMASTEFFP
         P FRRRGIG +LVRRLEEWF ANDVDY YMATEKDNEASVKLFINKLGYTNFRVPAILVNPVKHYRPY+LPSNIQIS LKVDVAEFLY K+MASTEFFP
Subjt:  VPMFRRRGIGCSLVRRLEEWFAANDVDYTYMATEKDNEASVKLFINKLGYTNFRVPAILVNPVKHYRPYNLPSNIQISRLKVDVAEFLYGKYMASTEFFP

Query:  HDIDRVLKHKLSLGTWVAYYKDD-----KFETTGSGSEMAIPKSWAMLSVWNSGEVFKLRLGKAPLSCLIYTESSKVMDKIFPCLKLPSIPDFYEPFGFY
        HDID VLKHKLSLG+WVAYYKDD     KFET G  SEM IPK WAMLSVWNSGEVFKLRLGKAPLSCLIYTESSKV+DKIFPCLKLPSIPDFYEPFGFY
Subjt:  HDIDRVLKHKLSLGTWVAYYKDD-----KFETTGSGSEMAIPKSWAMLSVWNSGEVFKLRLGKAPLSCLIYTESSKVMDKIFPCLKLPSIPDFYEPFGFY

Query:  FMYGVHREGAASTSGKLVRALCQHVHNMAAEARDCKVIVTEIGGEDALREEIPHWKLLSCPEDLWCIKALKKEARNSLHELTKTPPTTRPALFVDPREV
        FMYGVHREG    + KLV+ALCQ+VHNMAA ARDCKVIVTEIGGED+LR+EIPHWKLLSCPEDLWCIKALKKEARNSLHELTKTPPTTRPALFVDPREV
Subjt:  FMYGVHREGAASTSGKLVRALCQHVHNMAAEARDCKVIVTEIGGEDALREEIPHWKLLSCPEDLWCIKALKKEARNSLHELTKTPPTTRPALFVDPREV

SwissProt top hitse value%identityAlignment
O64815 Probable N-acetyltransferase HLS1-like7.1e-9545.91Show/hide
Query:  IRSYDGQSADRARVEDLERRCEVGPSERVFLFTDTMGDPICRIRNSPLYKMLVAEV----DNQIVGVIQGSIKIVT----------VHQAPKG-------
        +R YD  S D A VED+ERRCEVGP+ ++ LFTD +GDPICR+R+SP Y MLVAE+      ++VG+I+G IK VT           H   +        
Subjt:  IRSYDGQSADRARVEDLERRCEVGPSERVFLFTDTMGDPICRIRNSPLYKMLVAEV----DNQIVGVIQGSIKIVT----------VHQAPKG-------

Query:  -RAKVGYVLGLRVVPMFRRRGIGCSLVRRLEEWFAANDVDYTYMATEKDNEASVKLFINKLGYTNFRVPAILVNPVKHYRPYNLPSNIQISRLKVDVAEF
           K+ Y+LGLRV P  RR+GIG  LV+ +E+WF+ N  +Y+Y ATE DN ASV LF  K GY  FR P+ILVNPV  +R  N+   + + +L+   AE 
Subjt:  -RAKVGYVLGLRVVPMFRRRGIGCSLVRRLEEWFAANDVDYTYMATEKDNEASVKLFINKLGYTNFRVPAILVNPVKHYRPYNLPSNIQISRLKVDVAEF

Query:  LYGKYMASTEFFPHDIDRVLKHKLSLGTWVAYYKDDKFETTGSGSEM---------AIPKSWAMLSVWNSGEVFKLRLGKAPLSCLIYTESSKVMDKIFP
        LY    ++TEFFP DID VL +KLSLGT+VA  +   +   GSGS             P SWA+LSVWN  + F+L +  A     + +++++++DK  P
Subjt:  LYGKYMASTEFFPHDIDRVLKHKLSLGTWVAYYKDDKFETTGSGSEM---------AIPKSWAMLSVWNSGEVFKLRLGKAPLSCLIYTESSKVMDKIFP

Query:  CLKLPSIPDFYEPFGFYFMYGVHREGAASTSGKLVRALCQHVHNMAAEARDCKVIVTEIGGEDALREEIPHWKLLSCPEDLWCIKALKKE-ARNSLHELT
         LK+PSIP  + PFG +FMYG+  EG  +   K+V+ALC H HN+A E   C V+  E+ GE+ LR  IPHWK+LSC EDLWCIK L ++ +  S+ + T
Subjt:  CLKLPSIPDFYEPFGFYFMYGVHREGAASTSGKLVRALCQHVHNMAAEARDCKVIVTEIGGEDALREEIPHWKLLSCPEDLWCIKALKKE-ARNSLHELT

Query:  KTPPTTRPALFVDPRE
        K+PP    ++FVDPRE
Subjt:  KTPPTTRPALFVDPRE

Q42381 Probable N-acetyltransferase HLS12.9e-9648.04Show/hide
Query:  IIRSYDGQSADRARVEDLERRCEVGPSERVFLFTDTMGDPICRIRNSPLYKMLVAEV---DNQIVGVIQGSIKIVTVHQ------------APKGRAKVG
        ++R YD  + D   VED+ERRCEVGPS ++ LFTD +GDPICRIR+SP Y MLVAE+     +IVG+I+G IK VT  Q                  K+ 
Subjt:  IIRSYDGQSADRARVEDLERRCEVGPSERVFLFTDTMGDPICRIRNSPLYKMLVAEV---DNQIVGVIQGSIKIVTVHQ------------APKGRAKVG

Query:  YVLGLRVVPMFRRRGIGCSLVRRLEEWFAANDVDYTYMATEKDNEASVKLFINKLGYTNFRVPAILVNPVKHYRPYNLPSNIQISRLKVDVAEFLYGKYM
        YVLGLRV P  RR+GIG  LV+ +EEWF  N  +Y+Y+ATE DN+ASV LF  K GY+ FR P+ILVNPV  +R  N+   + + +L+   AE LY    
Subjt:  YVLGLRVVPMFRRRGIGCSLVRRLEEWFAANDVDYTYMATEKDNEASVKLFINKLGYTNFRVPAILVNPVKHYRPYNLPSNIQISRLKVDVAEFLYGKYM

Query:  ASTEFFPHDIDRVLKHKLSLGTWVAYYKDDKFETTGSGSEMAI-------PKSWAMLSVWNSGEVFKLRLGKAPLSCLIYTESSKVMDKIFPCLKLPSIP
        ++TEFFP DID VL +KLSLGT+VA  +   +  +GSGS           P+SWA+LSVWN  + F L +  A     +  ++++V+DK  P LKLPSIP
Subjt:  ASTEFFPHDIDRVLKHKLSLGTWVAYYKDDKFETTGSGSEMAI-------PKSWAMLSVWNSGEVFKLRLGKAPLSCLIYTESSKVMDKIFPCLKLPSIP

Query:  DFYEPFGFYFMYGVHREGAASTSGKLVRALCQHVHNMAAEARDCKVIVTEIGGEDALREEIPHWKLLSCPEDLWCIKALKKEARNS-LHELTKTPPTTRP
          +EPFG +FMYG+  EG  +   K+V++LC H HN+ A+A  C V+  E+ GED LR  IPHWK+LSC EDLWCIK L  +  +  + + TK+PP    
Subjt:  DFYEPFGFYFMYGVHREGAASTSGKLVRALCQHVHNMAAEARDCKVIVTEIGGEDALREEIPHWKLLSCPEDLWCIKALKKEARNS-LHELTKTPPTTRP

Query:  ALFVDPRE
        ++FVDPRE
Subjt:  ALFVDPRE

Arabidopsis top hitse value%identityAlignment
AT2G23060.1 Acyl-CoA N-acyltransferases (NAT) superfamily protein5.1e-9645.91Show/hide
Query:  IRSYDGQSADRARVEDLERRCEVGPSERVFLFTDTMGDPICRIRNSPLYKMLVAEV----DNQIVGVIQGSIKIVT----------VHQAPKG-------
        +R YD  S D A VED+ERRCEVGP+ ++ LFTD +GDPICR+R+SP Y MLVAE+      ++VG+I+G IK VT           H   +        
Subjt:  IRSYDGQSADRARVEDLERRCEVGPSERVFLFTDTMGDPICRIRNSPLYKMLVAEV----DNQIVGVIQGSIKIVT----------VHQAPKG-------

Query:  -RAKVGYVLGLRVVPMFRRRGIGCSLVRRLEEWFAANDVDYTYMATEKDNEASVKLFINKLGYTNFRVPAILVNPVKHYRPYNLPSNIQISRLKVDVAEF
           K+ Y+LGLRV P  RR+GIG  LV+ +E+WF+ N  +Y+Y ATE DN ASV LF  K GY  FR P+ILVNPV  +R  N+   + + +L+   AE 
Subjt:  -RAKVGYVLGLRVVPMFRRRGIGCSLVRRLEEWFAANDVDYTYMATEKDNEASVKLFINKLGYTNFRVPAILVNPVKHYRPYNLPSNIQISRLKVDVAEF

Query:  LYGKYMASTEFFPHDIDRVLKHKLSLGTWVAYYKDDKFETTGSGSEM---------AIPKSWAMLSVWNSGEVFKLRLGKAPLSCLIYTESSKVMDKIFP
        LY    ++TEFFP DID VL +KLSLGT+VA  +   +   GSGS             P SWA+LSVWN  + F+L +  A     + +++++++DK  P
Subjt:  LYGKYMASTEFFPHDIDRVLKHKLSLGTWVAYYKDDKFETTGSGSEM---------AIPKSWAMLSVWNSGEVFKLRLGKAPLSCLIYTESSKVMDKIFP

Query:  CLKLPSIPDFYEPFGFYFMYGVHREGAASTSGKLVRALCQHVHNMAAEARDCKVIVTEIGGEDALREEIPHWKLLSCPEDLWCIKALKKE-ARNSLHELT
         LK+PSIP  + PFG +FMYG+  EG  +   K+V+ALC H HN+A E   C V+  E+ GE+ LR  IPHWK+LSC EDLWCIK L ++ +  S+ + T
Subjt:  CLKLPSIPDFYEPFGFYFMYGVHREGAASTSGKLVRALCQHVHNMAAEARDCKVIVTEIGGEDALREEIPHWKLLSCPEDLWCIKALKKE-ARNSLHELT

Query:  KTPPTTRPALFVDPRE
        K+PP    ++FVDPRE
Subjt:  KTPPTTRPALFVDPRE

AT2G23060.2 Acyl-CoA N-acyltransferases (NAT) superfamily protein1.2e-7643.72Show/hide
Query:  MLVAEV----DNQIVGVIQGSIKIVT----------VHQAPKG--------RAKVGYVLGLRVVPMFRRRGIGCSLVRRLEEWFAANDVDYTYMATEKDN
        MLVAE+      ++VG+I+G IK VT           H   +           K+ Y+LGLRV P  RR+GIG  LV+ +E+WF+ N  +Y+Y ATE DN
Subjt:  MLVAEV----DNQIVGVIQGSIKIVT----------VHQAPKG--------RAKVGYVLGLRVVPMFRRRGIGCSLVRRLEEWFAANDVDYTYMATEKDN

Query:  EASVKLFINKLGYTNFRVPAILVNPVKHYRPYNLPSNIQISRLKVDVAEFLYGKYMASTEFFPHDIDRVLKHKLSLGTWVAYYKDDKFETTGSGSEM---
         ASV LF  K GY  FR P+ILVNPV  +R  N+   + + +L+   AE LY    ++TEFFP DID VL +KLSLGT+VA  +   +   GSGS     
Subjt:  EASVKLFINKLGYTNFRVPAILVNPVKHYRPYNLPSNIQISRLKVDVAEFLYGKYMASTEFFPHDIDRVLKHKLSLGTWVAYYKDDKFETTGSGSEM---

Query:  ------AIPKSWAMLSVWNSGEVFKLRLGKAPLSCLIYTESSKVMDKIFPCLKLPSIPDFYEPFGFYFMYGVHREGAASTSGKLVRALCQHVHNMAAEAR
                P SWA+LSVWN  + F+L +  A     + +++++++DK  P LK+PSIP  + PFG +FMYG+  EG  +   K+V+ALC H HN+A E  
Subjt:  ------AIPKSWAMLSVWNSGEVFKLRLGKAPLSCLIYTESSKVMDKIFPCLKLPSIPDFYEPFGFYFMYGVHREGAASTSGKLVRALCQHVHNMAAEAR

Query:  DCKVIVTEIGGEDALREEIPHWKLLSCPEDLWCIKALKKE-ARNSLHELTKTPPTTRPALFVDPRE
         C V+  E+ GE+ LR  IPHWK+LSC EDLWCIK L ++ +  S+ + TK+PP    ++FVDPRE
Subjt:  DCKVIVTEIGGEDALREEIPHWKLLSCPEDLWCIKALKKE-ARNSLHELTKTPPTTRPALFVDPRE

AT2G30090.1 Acyl-CoA N-acyltransferases (NAT) superfamily protein2.0e-11354.52Show/hide
Query:  EEILIIRSYDGQSADRARVEDLERRCEVGPSERVFLFTDTMGDPICRIRNSPLYKMLVAEVDNQIVGVIQGSIKIVTVHQAPKGRAKVGYVLGLRVVPMF
        +E ++IR YD +  DR ++  +E+ CE+G   +  LFTDT+GDPICRIRNSP + MLVA V N++VG IQGS+K V  H       +VGYVLGLRVVP +
Subjt:  EEILIIRSYDGQSADRARVEDLERRCEVGPSERVFLFTDTMGDPICRIRNSPLYKMLVAEVDNQIVGVIQGSIKIVTVHQAPKGRAKVGYVLGLRVVPMF

Query:  RRRGIGCSLVRRLEEWFAANDVDYTYMATEKDNEASVKLFINKLGYTNFRVPAILVNPVKHYRPYNLPSNIQISRLKVDVAEFLYGKYM-ASTEFFPHDI
        RRRGIG  LVR+LEEWF +++ DY YMATEKDNEAS  LFI +LGY  FR PAILVNPV   R   LPS+I I +LKV  AE LY + + A+TEFFP DI
Subjt:  RRRGIGCSLVRRLEEWFAANDVDYTYMATEKDNEASVKLFINKLGYTNFRVPAILVNPVKHYRPYNLPSNIQISRLKVDVAEFLYGKYM-ASTEFFPHDI

Query:  DRVLKHKLSLGTWVAYYKDDKFETTGSGSEMAIPKSWAMLSVWNSGEVFKLRLGKAPLSCLIYTESSKVMDKIFPCLKLPSIPDFYEPFGFYFMYGVHRE
        +++L++KLS+GTWVAYY           + +   +SWAMLSVW+S +VFKLR+ +APLS L+ T+ SK+       L L  +PD + PFGFYF+YGVH E
Subjt:  DRVLKHKLSLGTWVAYYKDDKFETTGSGSEMAIPKSWAMLSVWNSGEVFKLRLGKAPLSCLIYTESSKVMDKIFPCLKLPSIPDFYEPFGFYFMYGVHRE

Query:  GAASTSGKLVRALCQHVHNMAA--EARDCKVIVTEI----GGEDALREEIPHWKLLSCPEDLWCIKALKKEARN-SLHELTKTPPTTRPALFVDPREV
        G     GKLVRALC+HVHNMAA  +   CKV+V E+     G+D+L+  IPHWK+LSC +D+WCIK LK E     L E +K    +R +LFVDPREV
Subjt:  GAASTSGKLVRALCQHVHNMAA--EARDCKVIVTEI----GGEDALREEIPHWKLLSCPEDLWCIKALKKEARN-SLHELTKTPPTTRPALFVDPREV

AT4G37580.1 Acyl-CoA N-acyltransferases (NAT) superfamily protein2.1e-9748.04Show/hide
Query:  IIRSYDGQSADRARVEDLERRCEVGPSERVFLFTDTMGDPICRIRNSPLYKMLVAEV---DNQIVGVIQGSIKIVTVHQ------------APKGRAKVG
        ++R YD  + D   VED+ERRCEVGPS ++ LFTD +GDPICRIR+SP Y MLVAE+     +IVG+I+G IK VT  Q                  K+ 
Subjt:  IIRSYDGQSADRARVEDLERRCEVGPSERVFLFTDTMGDPICRIRNSPLYKMLVAEV---DNQIVGVIQGSIKIVTVHQ------------APKGRAKVG

Query:  YVLGLRVVPMFRRRGIGCSLVRRLEEWFAANDVDYTYMATEKDNEASVKLFINKLGYTNFRVPAILVNPVKHYRPYNLPSNIQISRLKVDVAEFLYGKYM
        YVLGLRV P  RR+GIG  LV+ +EEWF  N  +Y+Y+ATE DN+ASV LF  K GY+ FR P+ILVNPV  +R  N+   + + +L+   AE LY    
Subjt:  YVLGLRVVPMFRRRGIGCSLVRRLEEWFAANDVDYTYMATEKDNEASVKLFINKLGYTNFRVPAILVNPVKHYRPYNLPSNIQISRLKVDVAEFLYGKYM

Query:  ASTEFFPHDIDRVLKHKLSLGTWVAYYKDDKFETTGSGSEMAI-------PKSWAMLSVWNSGEVFKLRLGKAPLSCLIYTESSKVMDKIFPCLKLPSIP
        ++TEFFP DID VL +KLSLGT+VA  +   +  +GSGS           P+SWA+LSVWN  + F L +  A     +  ++++V+DK  P LKLPSIP
Subjt:  ASTEFFPHDIDRVLKHKLSLGTWVAYYKDDKFETTGSGSEMAI-------PKSWAMLSVWNSGEVFKLRLGKAPLSCLIYTESSKVMDKIFPCLKLPSIP

Query:  DFYEPFGFYFMYGVHREGAASTSGKLVRALCQHVHNMAAEARDCKVIVTEIGGEDALREEIPHWKLLSCPEDLWCIKALKKEARNS-LHELTKTPPTTRP
          +EPFG +FMYG+  EG  +   K+V++LC H HN+ A+A  C V+  E+ GED LR  IPHWK+LSC EDLWCIK L  +  +  + + TK+PP    
Subjt:  DFYEPFGFYFMYGVHREGAASTSGKLVRALCQHVHNMAAEARDCKVIVTEIGGEDALREEIPHWKLLSCPEDLWCIKALKKEARNS-LHELTKTPPTTRP

Query:  ALFVDPRE
        ++FVDPRE
Subjt:  ALFVDPRE

AT5G67430.1 Acyl-CoA N-acyltransferases (NAT) superfamily protein7.3e-8744.06Show/hide
Query:  MGYGEEILIIRSYDGQSADRARVEDLERRCEVGPSERVFLFTDTMGDPICRIRNSPLYKMLVAEVDNQIVGVIQGSIKIVT-----VHQAPK-----GRA
        MG G  ++++R YD    D   VE+LE  CEVG      L  D MGDP+ RIR SP + MLVAE+ N+IVG+I+G+IK+VT     + QA          
Subjt:  MGYGEEILIIRSYDGQSADRARVEDLERRCEVGPSERVFLFTDTMGDPICRIRNSPLYKMLVAEVDNQIVGVIQGSIKIVT-----VHQAPK-----GRA

Query:  KVGYVLGLRVVPMFRRRGIGCSLVRRLEEWFAANDVDYTYMATEKDNEASVKLFINKLGYTNFRVPAILVNPVKHYRPYNLPSNIQISRLKVDVAEFLYG
        K+ +V GLRV P +RR GIG  LV+RLEEWF  ND  Y+Y+ TE DN ASVKLF  K GY+ FR P  LVNPV ++R   +   ++I +L    AE LY 
Subjt:  KVGYVLGLRVVPMFRRRGIGCSLVRRLEEWFAANDVDYTYMATEKDNEASVKLFINKLGYTNFRVPAILVNPVKHYRPYNLPSNIQISRLKVDVAEFLYG

Query:  KYMASTEFFPHDIDRVLKHKLSLGTWVAYYKDDKFETTGSGSEMAIPKSWAMLSVWNSGEVFKLRLGKAPLSCLIYTESSKVMDKIFPCLKLPSIPDFYE
           ++TEFFP DI+ +L +KLSLGT++A  +        SGS      SWA++S+WNS +V++L++  A     +  +S++V D  FP LK+PS P+ ++
Subjt:  KYMASTEFFPHDIDRVLKHKLSLGTWVAYYKDDKFETTGSGSEMAIPKSWAMLSVWNSGEVFKLRLGKAPLSCLIYTESSKVMDKIFPCLKLPSIPDFYE

Query:  PFGFYFMYGVHREGAASTSGKLVRALCQHVHNMAAEARDCKVIVTEIGGEDALREEIPHWKLLSCPEDLWCIKALKKEARNSLHELTKTPPTTRPALFVD
         F  +FMYG+  EG    + ++V ALC H HN+A ++  C V+  E+   + LR  IPHWK+LS PEDLWC+K L+ +  +   + TK+PP    ++FVD
Subjt:  PFGFYFMYGVHREGAASTSGKLVRALCQHVHNMAAEARDCKVIVTEIGGEDALREEIPHWKLLSCPEDLWCIKALKKEARNSLHELTKTPPTTRPALFVD

Query:  PREV
        PRE+
Subjt:  PREV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGATACGGAGAAGAGATTTTGATAATAAGAAGCTACGATGGGCAATCTGCAGATAGAGCTAGAGTGGAAGATCTAGAGAGAAGATGCGAGGTAGGCCCATCTGAACG
AGTTTTTCTCTTCACAGACACCATGGGTGACCCCATTTGTAGGATCAGAAATAGTCCCTTATACAAGATGCTGGTGGCAGAGGTGGATAACCAAATAGTTGGAGTAATTC
AGGGCTCGATAAAGATCGTTACGGTTCACCAAGCGCCCAAGGGCCGGGCCAAGGTTGGGTATGTTTTAGGCCTTCGCGTTGTTCCGATGTTCCGCCGCCGAGGGATTGGC
TGCAGCCTTGTCCGACGATTGGAGGAGTGGTTTGCTGCCAACGACGTGGACTACACCTATATGGCGACAGAGAAAGACAATGAAGCCTCCGTCAAGCTCTTCATCAACAA
GCTCGGGTACACCAATTTTAGAGTTCCGGCGATCCTCGTCAACCCGGTGAAACATTACCGCCCTTATAACCTCCCTTCCAACATCCAAATTTCCCGCCTGAAAGTCGATG
TGGCCGAGTTTCTCTACGGAAAATACATGGCCTCCACCGAGTTCTTTCCCCATGACATCGACCGCGTGCTCAAACACAAGCTCAGCCTTGGAACATGGGTGGCTTATTAC
AAAGACGATAAATTCGAAACAACTGGCAGCGGATCAGAAATGGCAATTCCAAAGAGCTGGGCAATGCTGAGTGTATGGAATAGTGGGGAGGTGTTCAAACTGCGATTGGG
AAAGGCACCATTGTCATGCTTGATATACACAGAGAGCTCGAAGGTGATGGACAAGATCTTCCCATGTCTGAAACTTCCCTCAATACCGGACTTCTACGAGCCGTTCGGAT
TCTACTTCATGTATGGGGTTCACCGGGAGGGGGCAGCATCGACATCAGGGAAGCTGGTGAGGGCACTGTGCCAGCACGTGCACAACATGGCGGCGGAGGCAAGGGACTGT
AAAGTGATAGTTACAGAGATTGGAGGAGAAGACGCACTGAGAGAAGAGATTCCACATTGGAAATTGCTGTCATGCCCTGAAGATTTATGGTGCATAAAGGCATTGAAGAA
AGAAGCAAGAAATAGCCTCCATGAGTTGACAAAAACCCCACCAACTACAAGACCAGCCCTTTTTGTAGACCCAAGAGAGGTATGA
mRNA sequenceShow/hide mRNA sequence
GAAAAAAATGAGAGAGAGAGAGAAAGAGAAGGATCCCACAAATGGAACCCCACCAAATTTTGGTATAAAATGCCAATCATCAGACTCCAAAATTTCCAAGCCCCCCAAGT
TTTTTTCCCCCTCCACAAAAGAGAAAGCTAAGTAGCTAGCTGCTAGCAATATACCTTTAAAAGGGTTGATTGAACTTGCAGTGTGTGTATATATATGTAATAGCTAGAGA
ACTGCTGTTTTTTTTTTTTTCTTCTTTTCTGTTTTTTCTGTTTGTTTCCCACGAAAAGTGGGGAGATTTTTCTTTTAAAGAAATGGGATACGGAGAAGAGATTTTGATAA
TAAGAAGCTACGATGGGCAATCTGCAGATAGAGCTAGAGTGGAAGATCTAGAGAGAAGATGCGAGGTAGGCCCATCTGAACGAGTTTTTCTCTTCACAGACACCATGGGT
GACCCCATTTGTAGGATCAGAAATAGTCCCTTATACAAGATGCTGGTGGCAGAGGTGGATAACCAAATAGTTGGAGTAATTCAGGGCTCGATAAAGATCGTTACGGTTCA
CCAAGCGCCCAAGGGCCGGGCCAAGGTTGGGTATGTTTTAGGCCTTCGCGTTGTTCCGATGTTCCGCCGCCGAGGGATTGGCTGCAGCCTTGTCCGACGATTGGAGGAGT
GGTTTGCTGCCAACGACGTGGACTACACCTATATGGCGACAGAGAAAGACAATGAAGCCTCCGTCAAGCTCTTCATCAACAAGCTCGGGTACACCAATTTTAGAGTTCCG
GCGATCCTCGTCAACCCGGTGAAACATTACCGCCCTTATAACCTCCCTTCCAACATCCAAATTTCCCGCCTGAAAGTCGATGTGGCCGAGTTTCTCTACGGAAAATACAT
GGCCTCCACCGAGTTCTTTCCCCATGACATCGACCGCGTGCTCAAACACAAGCTCAGCCTTGGAACATGGGTGGCTTATTACAAAGACGATAAATTCGAAACAACTGGCA
GCGGATCAGAAATGGCAATTCCAAAGAGCTGGGCAATGCTGAGTGTATGGAATAGTGGGGAGGTGTTCAAACTGCGATTGGGAAAGGCACCATTGTCATGCTTGATATAC
ACAGAGAGCTCGAAGGTGATGGACAAGATCTTCCCATGTCTGAAACTTCCCTCAATACCGGACTTCTACGAGCCGTTCGGATTCTACTTCATGTATGGGGTTCACCGGGA
GGGGGCAGCATCGACATCAGGGAAGCTGGTGAGGGCACTGTGCCAGCACGTGCACAACATGGCGGCGGAGGCAAGGGACTGTAAAGTGATAGTTACAGAGATTGGAGGAG
AAGACGCACTGAGAGAAGAGATTCCACATTGGAAATTGCTGTCATGCCCTGAAGATTTATGGTGCATAAAGGCATTGAAGAAAGAAGCAAGAAATAGCCTCCATGAGTTG
ACAAAAACCCCACCAACTACAAGACCAGCCCTTTTTGTAGACCCAAGAGAGGTATGAGAATATCAAAATCATATCAATATCAATAATAAATCCAAACTTAATTACTTAAT
CACAAAACGCAGCAAGAATCCAACATATATCAATTACTTAATCCCTTCTCTTCTCCAAAAAATCTCACCTTTTATGTACATATCATCATATATACACCAATAATAAAATG
AGTTGACTTTTGAATTTGCCCCTAAAAACTCTTACTTACCTTACCTAATTGTGTTTTAAGAATTTTGTTCAATGAAGGAGAGGACCATAAAGGAAAGAACCAAACAAAGA
TATTATATGAATTAAGATTTAAACCTAACTTACTGTAATTAAGATGATCCTTTTTTTCTACAAATGGAAATAACTAGGCTACACACGAGTGAGTAAATAGGGATCTTATT
CGAGAGAGAATTAATGGTGGATGGCGTGGGAAAGATAATTGGGTTGAAAAGTGAAAGTGTAACTTTAAAAGAGGGTTGGGGGAAAGTGTTGGGAGAGGGATTTAAACCCC
ACCTCATGTGTAAATGTAGCTTAGATAAGTCTATATCCAGAAATCCATGTTGAGTCGAGTGGTGGGAGCTTTTATGTTGTAACTTTCACAACGGTGACAAAAAATCTTGA
AAGATTTCCCTAATTAGGAGCCACCTTCGCAGTGTCCTCTGCAACTCTAAATAATAATACTTTTCGGGACAACTCAATCATCAGCCCAAAGGTGAAAGGTGGTCCTTTCT
TTATTATTCCACCTGGCTCCTACCCTTTTTCTTCCTTTCTTTCTCTCTTTCCTCTCTGCCACTCAGTAAAAACTATC
Protein sequenceShow/hide protein sequence
MGYGEEILIIRSYDGQSADRARVEDLERRCEVGPSERVFLFTDTMGDPICRIRNSPLYKMLVAEVDNQIVGVIQGSIKIVTVHQAPKGRAKVGYVLGLRVVPMFRRRGIG
CSLVRRLEEWFAANDVDYTYMATEKDNEASVKLFINKLGYTNFRVPAILVNPVKHYRPYNLPSNIQISRLKVDVAEFLYGKYMASTEFFPHDIDRVLKHKLSLGTWVAYY
KDDKFETTGSGSEMAIPKSWAMLSVWNSGEVFKLRLGKAPLSCLIYTESSKVMDKIFPCLKLPSIPDFYEPFGFYFMYGVHREGAASTSGKLVRALCQHVHNMAAEARDC
KVIVTEIGGEDALREEIPHWKLLSCPEDLWCIKALKKEARNSLHELTKTPPTTRPALFVDPREV