; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC02g0700 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC02g0700
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionN-acetyltransferase domain-containing protein
Genome locationMC02:5656175..5659183
RNA-Seq ExpressionMC02g0700
SyntenyMC02g0700
Gene Ontology termsGO:0008080 - N-acetyltransferase activity (molecular function)
InterPro domainsIPR000182 - GNAT domain
IPR016181 - Acyl-CoA N-acyltransferase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7035352.1 putative N-acetyltransferase HLS1-like protein [Cucurbita argyrosperma subsp. argyrosperma]4.30e-19569.11Show/hide
Query:  MGFKGLVIRSYDGQ--FDRARVVDLERRCEVGPSKRVFLFTDTLGDPICRIRNSPLYKMLVAEWEKELVGVIQGSIKTVVAGGHKAAPSFPAKVGYILGL
        MG K  VIR+Y+     DRA+V DLE+RCE+GPSKRVFLFTDTLGDPICRIR+SPLYKMLVAEW  E+VGVIQGSIKT  +  HK  P    KVGYILGL
Subjt:  MGFKGLVIRSYDGQ--FDRARVVDLERRCEVGPSKRVFLFTDTLGDPICRIRNSPLYKMLVAEWEKELVGVIQGSIKTVVAGGHKAAPSFPAKVGYILGL

Query:  RVAPPYRHRGIGSSLVRHLEHWFSFNDVDYSCMATHKDNHASLNLFINNFRYIKFRTARILANPVTNRPYQIDPSKIKIQRLKIEEAEAIYKKHMATAEF
        RVAPP+R RGIGSSLV  LE WF  NDVDY CMAT KDNHAS+NLFIN+ RY+KFRT RIL NPVTN PY+I+ S+I+IQ+LKIEEAE IYKKHM + EF
Subjt:  RVAPPYRHRGIGSSLVRHLEHWFSFNDVDYSCMATHKDNHASLNLFINNFRYIKFRTARILANPVTNRPYQIDPSKIKIQRLKIEEAEAIYKKHMATAEF

Query:  FPKDINSILRNKLSLGTWMAYYSAGAGGDFSGEKGETPASWAVVSLWNSGEVFKLRLGKAPVAWIIYTKSLKFVDKILPWLKVGSVADFFEPFGFYFVYG
        FPKDINSIL+N LSLGTW+A+Y       +S    +  +SWAVVSLWNSGEVFKLRLGKAP  W++YTKSL+ ++K+LP LKV  V D+F+ FGFYFVYG
Subjt:  FPKDINSILRNKLSLGTWMAYYSAGAGGDFSGEKGETPASWAVVSLWNSGEVFKLRLGKAPVAWIIYTKSLKFVDKILPWLKVGSVADFFEPFGFYFVYG

Query:  VHHEGPFSGRLVRALCQYVHNMALKS-RDCKVIVTEIGGEDDVLKKEIPHWKLLSCPQDLWCIKGLKSNNGDHNHLLEWTKAPPNRALFVDPREV
        +HHEG  S RLV  LC+YVHN+AL + +DCK IVTEIGGEDD LK  IPHWKLLSC +DLWC+K LKS   + + LLEW   PPNR LFVDPREV
Subjt:  VHHEGPFSGRLVRALCQYVHNMALKS-RDCKVIVTEIGGEDDVLKKEIPHWKLLSCPQDLWCIKGLKSNNGDHNHLLEWTKAPPNRALFVDPREV

XP_022148434.1 probable N-acetyltransferase HLS1 [Momordica charantia]6.03e-299100Show/hide
Query:  MGFKGLVIRSYDGQFDRARVVDLERRCEVGPSKRVFLFTDTLGDPICRIRNSPLYKMLVAEWEKELVGVIQGSIKTVVAGGHKAAPSFPAKVGYILGLRV
        MGFKGLVIRSYDGQFDRARVVDLERRCEVGPSKRVFLFTDTLGDPICRIRNSPLYKMLVAEWEKELVGVIQGSIKTVVAGGHKAAPSFPAKVGYILGLRV
Subjt:  MGFKGLVIRSYDGQFDRARVVDLERRCEVGPSKRVFLFTDTLGDPICRIRNSPLYKMLVAEWEKELVGVIQGSIKTVVAGGHKAAPSFPAKVGYILGLRV

Query:  APPYRHRGIGSSLVRHLEHWFSFNDVDYSCMATHKDNHASLNLFINNFRYIKFRTARILANPVTNRPYQIDPSKIKIQRLKIEEAEAIYKKHMATAEFFP
        APPYRHRGIGSSLVRHLEHWFSFNDVDYSCMATHKDNHASLNLFINNFRYIKFRTARILANPVTNRPYQIDPSKIKIQRLKIEEAEAIYKKHMATAEFFP
Subjt:  APPYRHRGIGSSLVRHLEHWFSFNDVDYSCMATHKDNHASLNLFINNFRYIKFRTARILANPVTNRPYQIDPSKIKIQRLKIEEAEAIYKKHMATAEFFP

Query:  KDINSILRNKLSLGTWMAYYSAGAGGDFSGEKGETPASWAVVSLWNSGEVFKLRLGKAPVAWIIYTKSLKFVDKILPWLKVGSVADFFEPFGFYFVYGVH
        KDINSILRNKLSLGTWMAYYSAGAGGDFSGEKGETPASWAVVSLWNSGEVFKLRLGKAPVAWIIYTKSLKFVDKILPWLKVGSVADFFEPFGFYFVYGVH
Subjt:  KDINSILRNKLSLGTWMAYYSAGAGGDFSGEKGETPASWAVVSLWNSGEVFKLRLGKAPVAWIIYTKSLKFVDKILPWLKVGSVADFFEPFGFYFVYGVH

Query:  HEGPFSGRLVRALCQYVHNMALKSRDCKVIVTEIGGEDDVLKKEIPHWKLLSCPQDLWCIKGLKSNNGDHNHLLEWTKAPPNRALFVDPREV
        HEGPFSGRLVRALCQYVHNMALKSRDCKVIVTEIGGEDDVLKKEIPHWKLLSCPQDLWCIKGLKSNNGDHNHLLEWTKAPPNRALFVDPREV
Subjt:  HEGPFSGRLVRALCQYVHNMALKSRDCKVIVTEIGGEDDVLKKEIPHWKLLSCPQDLWCIKGLKSNNGDHNHLLEWTKAPPNRALFVDPREV

XP_022947633.1 probable N-acetyltransferase HLS1-like [Cucurbita moschata]8.36e-19568.66Show/hide
Query:  MGFKGLVIRSYDGQ--FDRARVVDLERRCEVGPSKRVFLFTDTLGDPICRIRNSPLYKMLVAEWEKELVGVIQGSIKTVVAGGHKAAPSFPAKVGYILGL
        MG K  VIR+Y+     DRA+V DLE+RCE+GPSKRVFLFTDTLGDPICRIR+SPLYKMLVAEW  E+VGVIQGSIKT  +  HK  P   AKVGY+LGL
Subjt:  MGFKGLVIRSYDGQ--FDRARVVDLERRCEVGPSKRVFLFTDTLGDPICRIRNSPLYKMLVAEWEKELVGVIQGSIKTVVAGGHKAAPSFPAKVGYILGL

Query:  RVAPPYRHRGIGSSLVRHLEHWFSFNDVDYSCMATHKDNHASLNLFINNFRYIKFRTARILANPVTNRPYQIDPSKIKIQRLKIEEAEAIYKKHMATAEF
        RVAPP+R RGIGSSLV  LE WF  NDVDY CMAT KDNHAS+NLFIN+ RY+KFRT RIL NPVTN PY+I+ S+IKIQ+LKIEEAE IYKKHM + EF
Subjt:  RVAPPYRHRGIGSSLVRHLEHWFSFNDVDYSCMATHKDNHASLNLFINNFRYIKFRTARILANPVTNRPYQIDPSKIKIQRLKIEEAEAIYKKHMATAEF

Query:  FPKDINSILRNKLSLGTWMAYY-------SAGAGGDFSGEKGETPASWAVVSLWNSGEVFKLRLGKAPVAWIIYTKSLKFVDKILPWLKVGSVADFFEPF
        FPKDINSIL+N LSLGTW+A+Y       SA A         +  +SWAVVSLWNSGEVFKLRLGKAP  W++YTKSL+ ++K+LP LKV  V D+F+ F
Subjt:  FPKDINSILRNKLSLGTWMAYY-------SAGAGGDFSGEKGETPASWAVVSLWNSGEVFKLRLGKAPVAWIIYTKSLKFVDKILPWLKVGSVADFFEPF

Query:  GFYFVYGVHHEGPFSGRLVRALCQYVHNMALKS-RDCKVIVTEIGGEDDVLKKEIPHWKLLSCPQDLWCIKGLKSNNGDHNHLLEWTKAPPNRALFVDPR
        GFYFVYG+HHEG  S RLV  LC+YVHN+AL + +DCK IVTEIGGEDD LK  IPHWKLLSC +DLWC+K LKS   + + LLEW   PPNR LFVDPR
Subjt:  GFYFVYGVHHEGPFSGRLVRALCQYVHNMALKS-RDCKVIVTEIGGEDDVLKKEIPHWKLLSCPQDLWCIKGLKSNNGDHNHLLEWTKAPPNRALFVDPR

Query:  EV
        EV
Subjt:  EV

XP_023007288.1 probable N-acetyltransferase HLS1-like [Cucurbita maxima]2.52e-19669.15Show/hide
Query:  MGFKGLVIRSYDGQ--FDRARVVDLERRCEVGPSKRVFLFTDTLGDPICRIRNSPLYKMLVAEWEKELVGVIQGSIKTVVAGGHKAAPSFPAKVGYILGL
        MG K  VIR+Y+     DRA+V DLE+RCE+G SKRVFLFTDTLGDPICRIR+SPLYKMLVAEW  E+VGVIQGSIKT  +  HK  P   AKVGYILGL
Subjt:  MGFKGLVIRSYDGQ--FDRARVVDLERRCEVGPSKRVFLFTDTLGDPICRIRNSPLYKMLVAEWEKELVGVIQGSIKTVVAGGHKAAPSFPAKVGYILGL

Query:  RVAPPYRHRGIGSSLVRHLEHWFSFNDVDYSCMATHKDNHASLNLFINNFRYIKFRTARILANPVTNRPYQIDPSKIKIQRLKIEEAEAIYKKHMATAEF
        RVAPP+R RGIG SLV  LE WF  NDVDY CMAT KDNHAS+NLFIN+ RY+KFRT RIL NPVTN PY+I+ S+IKIQ+LKIEEAE IYKKHMA+ EF
Subjt:  RVAPPYRHRGIGSSLVRHLEHWFSFNDVDYSCMATHKDNHASLNLFINNFRYIKFRTARILANPVTNRPYQIDPSKIKIQRLKIEEAEAIYKKHMATAEF

Query:  FPKDINSILRNKLSLGTWMAYY-------SAGAGGDFSGEKGETPASWAVVSLWNSGEVFKLRLGKAPVAWIIYTKSLKFVDKILPWLKVGSVADFFEPF
        FPKDINSIL+N LSLGTW+A+Y       SA A         + P SWAVVSLWNSGEVFKLRLGKAP  W++YTKSLK +DK+LP LKV  V D+F+ F
Subjt:  FPKDINSILRNKLSLGTWMAYY-------SAGAGGDFSGEKGETPASWAVVSLWNSGEVFKLRLGKAPVAWIIYTKSLKFVDKILPWLKVGSVADFFEPF

Query:  GFYFVYGVHHEGPFSGRLVRALCQYVHNMALKS-RDCKVIVTEIGGEDDVLKKEIPHWKLLSCPQDLWCIKGLKSNNGDHNHLLEWTKAPPNRALFVDPR
        GFYFVYG+HHEG  S RLV  LC++VHN+AL + +DCK IVTEIGGEDD LK  IPHWKLLSC +DLWC+K LK   G+ + LLEW   PPNR LFVDPR
Subjt:  GFYFVYGVHHEGPFSGRLVRALCQYVHNMALKS-RDCKVIVTEIGGEDDVLKKEIPHWKLLSCPQDLWCIKGLKSNNGDHNHLLEWTKAPPNRALFVDPR

Query:  EV
        EV
Subjt:  EV

XP_038902314.1 probable N-acetyltransferase HLS1-like [Benincasa hispida]1.31e-19869.81Show/hide
Query:  MGFKGLVIRSYDGQ--FDRARVVDLERRCEVGPSKRVFLFTDTLGDPICRIRNSPLYKMLVAEWEKELVGVIQGSIKTVVAGGHKAAP-SFPAKVGYILG
        +   G VIR Y+     D+A+V+DLERRC++G SKRVFLFTD LGDPICRIRNSP+YKMLVAEW+KE+VGVIQGSIK V    HK  P     KVGYILG
Subjt:  MGFKGLVIRSYDGQ--FDRARVVDLERRCEVGPSKRVFLFTDTLGDPICRIRNSPLYKMLVAEWEKELVGVIQGSIKTVVAGGHKAAP-SFPAKVGYILG

Query:  LRVAPPYRHRGIGSSLVRHLEHWFSFNDVDYSCMATHKDNHASLNLFINNFRYIKFRTARILANPVTNRPYQIDPSKIKIQRLKIEEAEAIYKKHMATAE
        LRVAPPYR RGIGS LVR LE WF  NDVDY CMAT KDNHASLNLFINN RYIKFRT RIL +PV NRPY I+ S+I IQ+LKIEEAEAIYKKHMA+ E
Subjt:  LRVAPPYRHRGIGSSLVRHLEHWFSFNDVDYSCMATHKDNHASLNLFINNFRYIKFRTARILANPVTNRPYQIDPSKIKIQRLKIEEAEAIYKKHMATAE

Query:  FFPKDINSILRNKLSLGTWMAYY--------SAGAGGDFSGEKGETPASWAVVSLWNSGEVFKLRLGKAPVAWIIYTKSLKFVDKILPWLKVGSVADFFE
        FFPKDI SIL+NKLSLGTWMA +        S   GG+    +  T +SWA+ SLWNSGEVFKLRLGKAP  W+IYTKSLK +DKILP  K+  V DFF+
Subjt:  FFPKDINSILRNKLSLGTWMAYY--------SAGAGGDFSGEKGETPASWAVVSLWNSGEVFKLRLGKAPVAWIIYTKSLKFVDKILPWLKVGSVADFFE

Query:  PFGFYFVYGVHHEGPFSGRLVRALCQYVHNMALK--SRD-CKVIVTEIGG-EDDVLKKEIPHWKLLSCPQDLWCIKGLKSNN----GDHNH---LLEWTK
        PFGFYFVYG+HHEGPFS RLV ALC++VHN+ALK  SRD CK IVTEIGG EDD LK EIPHWKLLSC +D WCIK L++NN     DH+H   +LEWT 
Subjt:  PFGFYFVYGVHHEGPFSGRLVRALCQYVHNMALK--SRD-CKVIVTEIGG-EDDVLKKEIPHWKLLSCPQDLWCIKGLKSNN----GDHNH---LLEWTK

Query:  APPNRALFVDPREV
        APPNR LFVDPREV
Subjt:  APPNRALFVDPREV

TrEMBL top hitse value%identityAlignment
A0A0A0KE16 N-acetyltransferase domain-containing protein2.08e-19467.87Show/hide
Query:  MGFKGLVIRSYD-----GQF-DRARVVDLERRCEVGPSKRVFLFTDTLGDPICRIRNSPLYKMLVAEWEKELVGVIQGSIKTVVAGGHKAAP-SFPAKVG
        M F G VIRSY+     GQF D+A+V+DLERRCE+G SKRVFLFTD LGDPICRIRNSP+YKMLVAE +KE+VGVIQGSIK V    HK  P     KVG
Subjt:  MGFKGLVIRSYD-----GQF-DRARVVDLERRCEVGPSKRVFLFTDTLGDPICRIRNSPLYKMLVAEWEKELVGVIQGSIKTVVAGGHKAAP-SFPAKVG

Query:  YILGLRVAPPYRHRGIGSSLVRHLEHWFSFNDVDYSCMATHKDNHASLNLFINNFRYIKFRTARILANPVTNRPYQIDPSKIKIQRLKIEEAEAIYKKHM
        Y+LGLRVAPPYR RG+G++LVR LE WF  NDVDY CMA  KDNHASLNLFINN RYIKFRT RIL NPV N PY I+ S+IKIQ+LKIE+AEAIYKKHM
Subjt:  YILGLRVAPPYRHRGIGSSLVRHLEHWFSFNDVDYSCMATHKDNHASLNLFINNFRYIKFRTARILANPVTNRPYQIDPSKIKIQRLKIEEAEAIYKKHM

Query:  ATAEFFPKDINSILRNKLSLGTWMA-----YYSAGAGGDFSGEKGETPASWAVVSLWNSGEVFKLRLGKAPVAWIIYTKSLKFVDKILPWLKVGSVADFF
        A+ E FPKDI +IL+NKLSLGTWMA     +Y   +    +G  G   +SWA+VSLWNSGEVF+LRLGKAP AW+IYTKSLK +DKILP  K+  V +FF
Subjt:  ATAEFFPKDINSILRNKLSLGTWMA-----YYSAGAGGDFSGEKGETPASWAVVSLWNSGEVFKLRLGKAPVAWIIYTKSLKFVDKILPWLKVGSVADFF

Query:  EPFGFYFVYGVHHEGPFSGRLVRALCQYVHNMAL---KSRDCKVIVTEIGG-EDDVLKKEIPHWKLLSCPQDLWCIKGLKS--------NNGDHN-HLLE
        +PFGFYFVYG+HHEGPFS RLV ALC++VHNMA+   K  +CK IVTEI G EDD LK EIPHWKLLSC +D WCIK LKS        N+ DH+ H+LE
Subjt:  EPFGFYFVYGVHHEGPFSGRLVRALCQYVHNMAL---KSRDCKVIVTEIGG-EDDVLKKEIPHWKLLSCPQDLWCIKGLKS--------NNGDHN-HLLE

Query:  WTKAPPNRALFVDPREV
        WT  PP R LFVDPREV
Subjt:  WTKAPPNRALFVDPREV

A0A5D3CAW1 Putative N-acetyltransferase HLS1-like3.54e-19467.38Show/hide
Query:  MGFKGLVIRSYD----GQF-DRARVVDLERRCEVGPSKRVFLFTDTLGDPICRIRNSPLYKMLVAEWEKELVGVIQGSIKTVVAGGHKAAP-SFPAKVGY
        M F G +IRSY+    GQ  D+A+V+DLERRCE+G SKRVFLFTD LGDPICRIRNSP+YKMLVAE +KE+VGVIQGSIK V    HK  P     KVGY
Subjt:  MGFKGLVIRSYD----GQF-DRARVVDLERRCEVGPSKRVFLFTDTLGDPICRIRNSPLYKMLVAEWEKELVGVIQGSIKTVVAGGHKAAP-SFPAKVGY

Query:  ILGLRVAPPYRHRGIGSSLVRHLEHWFSFNDVDYSCMATHKDNHASLNLFINNFRYIKFRTARILANPVTNRPYQIDPSKIKIQRLKIEEAEAIYKKHMA
        ILGLRVAPPYR RGIG++LVR LE WF  NDVDY CMAT KDNHASLNLFINN RYIKFRT RIL NPV N PY+I+ S+IKIQ+L+IEEAEAIYKKHMA
Subjt:  ILGLRVAPPYRHRGIGSSLVRHLEHWFSFNDVDYSCMATHKDNHASLNLFINNFRYIKFRTARILANPVTNRPYQIDPSKIKIQRLKIEEAEAIYKKHMA

Query:  TAEFFPKDINSILRNKLSLGTWMAYY------------SAGAGGDFSGEKGETPASWAVVSLWNSGEVFKLRLGKAPVAWIIYTKSLKFVDKILPWLKVG
        + E FP+DI +IL+NKLSLGTWMA +            S+ AGG+   E+  + +SWA+VSLWNSGEVFKLRLGKAP  W+IYTKSLK +DKI P  K+ 
Subjt:  TAEFFPKDINSILRNKLSLGTWMAYY------------SAGAGGDFSGEKGETPASWAVVSLWNSGEVFKLRLGKAPVAWIIYTKSLKFVDKILPWLKVG

Query:  SVADFFEPFGFYFVYGVHHEGPFSGRLVRALCQYVHNMAL---KSRDCKVIVTEIGG-EDDVLKKEIPHWKLLSCPQDLWCIKGLKS--------NNGDH
         V +FF+PFGFYFVYG+HHEGPFS RLV ALC++VHNMA+   K  +CK IVTEIGG EDD LK EIPHWKLLSC +D WCIK LKS        N+ DH
Subjt:  SVADFFEPFGFYFVYGVHHEGPFSGRLVRALCQYVHNMAL---KSRDCKVIVTEIGG-EDDVLKKEIPHWKLLSCPQDLWCIKGLKS--------NNGDH

Query:  NH-LLEWTKAPPNRALFVDPREV
        +H +LEWT  PP R LFVDPREV
Subjt:  NH-LLEWTKAPPNRALFVDPREV

A0A6J1D3Z5 probable N-acetyltransferase HLS12.92e-299100Show/hide
Query:  MGFKGLVIRSYDGQFDRARVVDLERRCEVGPSKRVFLFTDTLGDPICRIRNSPLYKMLVAEWEKELVGVIQGSIKTVVAGGHKAAPSFPAKVGYILGLRV
        MGFKGLVIRSYDGQFDRARVVDLERRCEVGPSKRVFLFTDTLGDPICRIRNSPLYKMLVAEWEKELVGVIQGSIKTVVAGGHKAAPSFPAKVGYILGLRV
Subjt:  MGFKGLVIRSYDGQFDRARVVDLERRCEVGPSKRVFLFTDTLGDPICRIRNSPLYKMLVAEWEKELVGVIQGSIKTVVAGGHKAAPSFPAKVGYILGLRV

Query:  APPYRHRGIGSSLVRHLEHWFSFNDVDYSCMATHKDNHASLNLFINNFRYIKFRTARILANPVTNRPYQIDPSKIKIQRLKIEEAEAIYKKHMATAEFFP
        APPYRHRGIGSSLVRHLEHWFSFNDVDYSCMATHKDNHASLNLFINNFRYIKFRTARILANPVTNRPYQIDPSKIKIQRLKIEEAEAIYKKHMATAEFFP
Subjt:  APPYRHRGIGSSLVRHLEHWFSFNDVDYSCMATHKDNHASLNLFINNFRYIKFRTARILANPVTNRPYQIDPSKIKIQRLKIEEAEAIYKKHMATAEFFP

Query:  KDINSILRNKLSLGTWMAYYSAGAGGDFSGEKGETPASWAVVSLWNSGEVFKLRLGKAPVAWIIYTKSLKFVDKILPWLKVGSVADFFEPFGFYFVYGVH
        KDINSILRNKLSLGTWMAYYSAGAGGDFSGEKGETPASWAVVSLWNSGEVFKLRLGKAPVAWIIYTKSLKFVDKILPWLKVGSVADFFEPFGFYFVYGVH
Subjt:  KDINSILRNKLSLGTWMAYYSAGAGGDFSGEKGETPASWAVVSLWNSGEVFKLRLGKAPVAWIIYTKSLKFVDKILPWLKVGSVADFFEPFGFYFVYGVH

Query:  HEGPFSGRLVRALCQYVHNMALKSRDCKVIVTEIGGEDDVLKKEIPHWKLLSCPQDLWCIKGLKSNNGDHNHLLEWTKAPPNRALFVDPREV
        HEGPFSGRLVRALCQYVHNMALKSRDCKVIVTEIGGEDDVLKKEIPHWKLLSCPQDLWCIKGLKSNNGDHNHLLEWTKAPPNRALFVDPREV
Subjt:  HEGPFSGRLVRALCQYVHNMALKSRDCKVIVTEIGGEDDVLKKEIPHWKLLSCPQDLWCIKGLKSNNGDHNHLLEWTKAPPNRALFVDPREV

A0A6J1G758 probable N-acetyltransferase HLS1-like4.05e-19568.66Show/hide
Query:  MGFKGLVIRSYDGQ--FDRARVVDLERRCEVGPSKRVFLFTDTLGDPICRIRNSPLYKMLVAEWEKELVGVIQGSIKTVVAGGHKAAPSFPAKVGYILGL
        MG K  VIR+Y+     DRA+V DLE+RCE+GPSKRVFLFTDTLGDPICRIR+SPLYKMLVAEW  E+VGVIQGSIKT  +  HK  P   AKVGY+LGL
Subjt:  MGFKGLVIRSYDGQ--FDRARVVDLERRCEVGPSKRVFLFTDTLGDPICRIRNSPLYKMLVAEWEKELVGVIQGSIKTVVAGGHKAAPSFPAKVGYILGL

Query:  RVAPPYRHRGIGSSLVRHLEHWFSFNDVDYSCMATHKDNHASLNLFINNFRYIKFRTARILANPVTNRPYQIDPSKIKIQRLKIEEAEAIYKKHMATAEF
        RVAPP+R RGIGSSLV  LE WF  NDVDY CMAT KDNHAS+NLFIN+ RY+KFRT RIL NPVTN PY+I+ S+IKIQ+LKIEEAE IYKKHM + EF
Subjt:  RVAPPYRHRGIGSSLVRHLEHWFSFNDVDYSCMATHKDNHASLNLFINNFRYIKFRTARILANPVTNRPYQIDPSKIKIQRLKIEEAEAIYKKHMATAEF

Query:  FPKDINSILRNKLSLGTWMAYY-------SAGAGGDFSGEKGETPASWAVVSLWNSGEVFKLRLGKAPVAWIIYTKSLKFVDKILPWLKVGSVADFFEPF
        FPKDINSIL+N LSLGTW+A+Y       SA A         +  +SWAVVSLWNSGEVFKLRLGKAP  W++YTKSL+ ++K+LP LKV  V D+F+ F
Subjt:  FPKDINSILRNKLSLGTWMAYY-------SAGAGGDFSGEKGETPASWAVVSLWNSGEVFKLRLGKAPVAWIIYTKSLKFVDKILPWLKVGSVADFFEPF

Query:  GFYFVYGVHHEGPFSGRLVRALCQYVHNMALKS-RDCKVIVTEIGGEDDVLKKEIPHWKLLSCPQDLWCIKGLKSNNGDHNHLLEWTKAPPNRALFVDPR
        GFYFVYG+HHEG  S RLV  LC+YVHN+AL + +DCK IVTEIGGEDD LK  IPHWKLLSC +DLWC+K LKS   + + LLEW   PPNR LFVDPR
Subjt:  GFYFVYGVHHEGPFSGRLVRALCQYVHNMALKS-RDCKVIVTEIGGEDDVLKKEIPHWKLLSCPQDLWCIKGLKSNNGDHNHLLEWTKAPPNRALFVDPR

Query:  EV
        EV
Subjt:  EV

A0A6J1L7A2 probable N-acetyltransferase HLS1-like1.22e-19669.15Show/hide
Query:  MGFKGLVIRSYDGQ--FDRARVVDLERRCEVGPSKRVFLFTDTLGDPICRIRNSPLYKMLVAEWEKELVGVIQGSIKTVVAGGHKAAPSFPAKVGYILGL
        MG K  VIR+Y+     DRA+V DLE+RCE+G SKRVFLFTDTLGDPICRIR+SPLYKMLVAEW  E+VGVIQGSIKT  +  HK  P   AKVGYILGL
Subjt:  MGFKGLVIRSYDGQ--FDRARVVDLERRCEVGPSKRVFLFTDTLGDPICRIRNSPLYKMLVAEWEKELVGVIQGSIKTVVAGGHKAAPSFPAKVGYILGL

Query:  RVAPPYRHRGIGSSLVRHLEHWFSFNDVDYSCMATHKDNHASLNLFINNFRYIKFRTARILANPVTNRPYQIDPSKIKIQRLKIEEAEAIYKKHMATAEF
        RVAPP+R RGIG SLV  LE WF  NDVDY CMAT KDNHAS+NLFIN+ RY+KFRT RIL NPVTN PY+I+ S+IKIQ+LKIEEAE IYKKHMA+ EF
Subjt:  RVAPPYRHRGIGSSLVRHLEHWFSFNDVDYSCMATHKDNHASLNLFINNFRYIKFRTARILANPVTNRPYQIDPSKIKIQRLKIEEAEAIYKKHMATAEF

Query:  FPKDINSILRNKLSLGTWMAYY-------SAGAGGDFSGEKGETPASWAVVSLWNSGEVFKLRLGKAPVAWIIYTKSLKFVDKILPWLKVGSVADFFEPF
        FPKDINSIL+N LSLGTW+A+Y       SA A         + P SWAVVSLWNSGEVFKLRLGKAP  W++YTKSLK +DK+LP LKV  V D+F+ F
Subjt:  FPKDINSILRNKLSLGTWMAYY-------SAGAGGDFSGEKGETPASWAVVSLWNSGEVFKLRLGKAPVAWIIYTKSLKFVDKILPWLKVGSVADFFEPF

Query:  GFYFVYGVHHEGPFSGRLVRALCQYVHNMALKS-RDCKVIVTEIGGEDDVLKKEIPHWKLLSCPQDLWCIKGLKSNNGDHNHLLEWTKAPPNRALFVDPR
        GFYFVYG+HHEG  S RLV  LC++VHN+AL + +DCK IVTEIGGEDD LK  IPHWKLLSC +DLWC+K LK   G+ + LLEW   PPNR LFVDPR
Subjt:  GFYFVYGVHHEGPFSGRLVRALCQYVHNMALKS-RDCKVIVTEIGGEDDVLKKEIPHWKLLSCPQDLWCIKGLKSNNGDHNHLLEWTKAPPNRALFVDPR

Query:  EV
        EV
Subjt:  EV

SwissProt top hitse value%identityAlignment
O64815 Probable N-acetyltransferase HLS1-like7.8e-9445.12Show/hide
Query:  IRSYDGQFDRARVVDLERRCEVGPSKRVFLFTDTLGDPICRIRNSPLYKMLVAE----WEKELVGVIQGSIKTVVAG--------GHKAAPS-------F
        +R YD   D A V D+ERRCEVGP+ ++ LFTD LGDPICR+R+SP Y MLVAE     +KELVG+I+G IKTV  G         H  + +        
Subjt:  IRSYDGQFDRARVVDLERRCEVGPSKRVFLFTDTLGDPICRIRNSPLYKMLVAE----WEKELVGVIQGSIKTVVAG--------GHKAAPS-------F

Query:  PAKVGYILGLRVAPPYRHRGIGSSLVRHLEHWFSFNDVDYSCMATHKDNHASLNLFINNFRYIKFRTARILANPVTNRPYQIDPSKIKIQRLKIEEAEAI
          K+ YILGLRV+P +R +GIG  LV+ +E WFS N  +YS  AT  DNHAS+NLF     Y +FRT  IL NPV      I   ++ + +L+  +AE +
Subjt:  PAKVGYILGLRVAPPYRHRGIGSSLVRHLEHWFSFNDVDYSCMATHKDNHASLNLFINNFRYIKFRTARILANPVTNRPYQIDPSKIKIQRLKIEEAEAI

Query:  YKKHMATAEFFPKDINSILRNKLSLGTWMAY-----YSAGAGGDFSGEK--GETPASWAVVSLWNSGEVFKLRLGKAPVAWIIYTKSLKFVDKILPWLKV
        Y+   +T EFFP+DI+S+L NKLSLGT++A      Y +G+       K     P SWAV+S+WN  + F+L +  A     + +K+ + VDK LP+LK+
Subjt:  YKKHMATAEFFPKDINSILRNKLSLGTWMAY-----YSAGAGGDFSGEK--GETPASWAVVSLWNSGEVFKLRLGKAPVAWIIYTKSLKFVDKILPWLKV

Query:  GSVADFFEPFGFYFVYGVHHEGPFSGRLVRALCQYVHNMALKSRDCKVIVTEIGGEDDVLKKEIPHWKLLSCPQDLWCIKGLKSNNGDHNHLLEWTKAPP
         S+   F PFG +F+YG+  EGP + ++V+ALC + HN+A K   C V+  E+ GE+  L++ IPHWK+LSC +DLWCIK L  +  D   + +WTK+PP
Subjt:  GSVADFFEPFGFYFVYGVHHEGPFSGRLVRALCQYVHNMALKSRDCKVIVTEIGGEDDVLKKEIPHWKLLSCPQDLWCIKGLKSNNGDHNHLLEWTKAPP

Query:  NRALFVDPRE
          ++FVDPRE
Subjt:  NRALFVDPRE

Q42381 Probable N-acetyltransferase HLS16.0e-9445.79Show/hide
Query:  VIRSYDGQFDRARVVDLERRCEVGPSKRVFLFTDTLGDPICRIRNSPLYKMLVAEW---EKELVGVIQGSIKTVVAG-----GHKA----APSFPAKVGY
        V+R YD   D   V D+ERRCEVGPS ++ LFTD LGDPICRIR+SP Y MLVAE    +KE+VG+I+G IKTV  G      HK+          K+ Y
Subjt:  VIRSYDGQFDRARVVDLERRCEVGPSKRVFLFTDTLGDPICRIRNSPLYKMLVAEW---EKELVGVIQGSIKTVVAG-----GHKA----APSFPAKVGY

Query:  ILGLRVAPPYRHRGIGSSLVRHLEHWFSFNDVDYSCMATHKDNHASLNLFINNFRYIKFRTARILANPVTNRPYQIDPSKIKIQRLKIEEAEAIYKKHMA
        +LGLRV+P +R +GIG  LV+ +E WF  N  +YS +AT  DN AS+NLF     Y +FRT  IL NPV      +   ++ + +L+  +AE +Y+   +
Subjt:  ILGLRVAPPYRHRGIGSSLVRHLEHWFSFNDVDYSCMATHKDNHASLNLFINNFRYIKFRTARILANPVTNRPYQIDPSKIKIQRLKIEEAEAIYKKHMA

Query:  TAEFFPKDINSILRNKLSLGTWMAY-----YSAGAGGDFSGEK--GETPASWAVVSLWNSGEVFKLRLGKAPVAWIIYTKSLKFVDKILPWLKVGSVADF
        T EFFP+DI+S+L NKLSLGT++A      Y +G+G      K     P SWAV+S+WN  + F L +  A     +  K+ + VDK LP+LK+ S+   
Subjt:  TAEFFPKDINSILRNKLSLGTWMAY-----YSAGAGGDFSGEK--GETPASWAVVSLWNSGEVFKLRLGKAPVAWIIYTKSLKFVDKILPWLKVGSVADF

Query:  FEPFGFYFVYGVHHEGPFSGRLVRALCQYVHNMALKSRDCKVIVTEIGGEDDVLKKEIPHWKLLSCPQDLWCIKGLKSNNGDHNHLLEWTKAPPNRALFV
        FEPFG +F+YG+  EGP + ++V++LC + HN+A K+  C V+  E+ GED  L++ IPHWK+LSC +DLWCIK L  +  D   + +WTK+PP  ++FV
Subjt:  FEPFGFYFVYGVHHEGPFSGRLVRALCQYVHNMALKSRDCKVIVTEIGGEDDVLKKEIPHWKLLSCPQDLWCIKGLKSNNGDHNHLLEWTKAPPNRALFV

Query:  DPRE
        DPRE
Subjt:  DPRE

Arabidopsis top hitse value%identityAlignment
AT2G23060.1 Acyl-CoA N-acyltransferases (NAT) superfamily protein5.6e-9545.12Show/hide
Query:  IRSYDGQFDRARVVDLERRCEVGPSKRVFLFTDTLGDPICRIRNSPLYKMLVAE----WEKELVGVIQGSIKTVVAG--------GHKAAPS-------F
        +R YD   D A V D+ERRCEVGP+ ++ LFTD LGDPICR+R+SP Y MLVAE     +KELVG+I+G IKTV  G         H  + +        
Subjt:  IRSYDGQFDRARVVDLERRCEVGPSKRVFLFTDTLGDPICRIRNSPLYKMLVAE----WEKELVGVIQGSIKTVVAG--------GHKAAPS-------F

Query:  PAKVGYILGLRVAPPYRHRGIGSSLVRHLEHWFSFNDVDYSCMATHKDNHASLNLFINNFRYIKFRTARILANPVTNRPYQIDPSKIKIQRLKIEEAEAI
          K+ YILGLRV+P +R +GIG  LV+ +E WFS N  +YS  AT  DNHAS+NLF     Y +FRT  IL NPV      I   ++ + +L+  +AE +
Subjt:  PAKVGYILGLRVAPPYRHRGIGSSLVRHLEHWFSFNDVDYSCMATHKDNHASLNLFINNFRYIKFRTARILANPVTNRPYQIDPSKIKIQRLKIEEAEAI

Query:  YKKHMATAEFFPKDINSILRNKLSLGTWMAY-----YSAGAGGDFSGEK--GETPASWAVVSLWNSGEVFKLRLGKAPVAWIIYTKSLKFVDKILPWLKV
        Y+   +T EFFP+DI+S+L NKLSLGT++A      Y +G+       K     P SWAV+S+WN  + F+L +  A     + +K+ + VDK LP+LK+
Subjt:  YKKHMATAEFFPKDINSILRNKLSLGTWMAY-----YSAGAGGDFSGEK--GETPASWAVVSLWNSGEVFKLRLGKAPVAWIIYTKSLKFVDKILPWLKV

Query:  GSVADFFEPFGFYFVYGVHHEGPFSGRLVRALCQYVHNMALKSRDCKVIVTEIGGEDDVLKKEIPHWKLLSCPQDLWCIKGLKSNNGDHNHLLEWTKAPP
         S+   F PFG +F+YG+  EGP + ++V+ALC + HN+A K   C V+  E+ GE+  L++ IPHWK+LSC +DLWCIK L  +  D   + +WTK+PP
Subjt:  GSVADFFEPFGFYFVYGVHHEGPFSGRLVRALCQYVHNMALKSRDCKVIVTEIGGEDDVLKKEIPHWKLLSCPQDLWCIKGLKSNNGDHNHLLEWTKAPP

Query:  NRALFVDPRE
          ++FVDPRE
Subjt:  NRALFVDPRE

AT2G23060.2 Acyl-CoA N-acyltransferases (NAT) superfamily protein3.7e-7542.94Show/hide
Query:  MLVAE----WEKELVGVIQGSIKTVVAG--------GHKAAPS-------FPAKVGYILGLRVAPPYRHRGIGSSLVRHLEHWFSFNDVDYSCMATHKDN
        MLVAE     +KELVG+I+G IKTV  G         H  + +          K+ YILGLRV+P +R +GIG  LV+ +E WFS N  +YS  AT  DN
Subjt:  MLVAE----WEKELVGVIQGSIKTVVAG--------GHKAAPS-------FPAKVGYILGLRVAPPYRHRGIGSSLVRHLEHWFSFNDVDYSCMATHKDN

Query:  HASLNLFINNFRYIKFRTARILANPVTNRPYQIDPSKIKIQRLKIEEAEAIYKKHMATAEFFPKDINSILRNKLSLGTWMAY-----YSAGAGGDFSGEK
        HAS+NLF     Y +FRT  IL NPV      I   ++ + +L+  +AE +Y+   +T EFFP+DI+S+L NKLSLGT++A      Y +G+       K
Subjt:  HASLNLFINNFRYIKFRTARILANPVTNRPYQIDPSKIKIQRLKIEEAEAIYKKHMATAEFFPKDINSILRNKLSLGTWMAY-----YSAGAGGDFSGEK

Query:  --GETPASWAVVSLWNSGEVFKLRLGKAPVAWIIYTKSLKFVDKILPWLKVGSVADFFEPFGFYFVYGVHHEGPFSGRLVRALCQYVHNMALKSRDCKVI
             P SWAV+S+WN  + F+L +  A     + +K+ + VDK LP+LK+ S+   F PFG +F+YG+  EGP + ++V+ALC + HN+A K   C V+
Subjt:  --GETPASWAVVSLWNSGEVFKLRLGKAPVAWIIYTKSLKFVDKILPWLKVGSVADFFEPFGFYFVYGVHHEGPFSGRLVRALCQYVHNMALKSRDCKVI

Query:  VTEIGGEDDVLKKEIPHWKLLSCPQDLWCIKGLKSNNGDHNHLLEWTKAPPNRALFVDPRE
          E+ GE+  L++ IPHWK+LSC +DLWCIK L  +  D   + +WTK+PP  ++FVDPRE
Subjt:  VTEIGGEDDVLKKEIPHWKLLSCPQDLWCIKGLKSNNGDHNHLLEWTKAPPNRALFVDPRE

AT2G30090.1 Acyl-CoA N-acyltransferases (NAT) superfamily protein7.2e-10350.13Show/hide
Query:  LVIRSYDGQFDRARVVDLERRCEVGPSKRVFLFTDTLGDPICRIRNSPLYKMLVAEWEKELVGVIQGSIKTVVAGGHKAAPSFPAKVGYILGLRVAPPYR
        +VIR YD + DR ++  +E+ CE+G   +  LFTDTLGDPICRIRNSP + MLVA    +LVG IQGS+K V    H  +     +VGY+LGLRV P YR
Subjt:  LVIRSYDGQFDRARVVDLERRCEVGPSKRVFLFTDTLGDPICRIRNSPLYKMLVAEWEKELVGVIQGSIKTVVAGGHKAAPSFPAKVGYILGLRVAPPYR

Query:  HRGIGSSLVRHLEHWFSFNDVDYSCMATHKDNHASLNLFINNFRYIKFRTARILANPVTNRPYQIDPSKIKIQRLKIEEAEAIYKKHM-ATAEFFPKDIN
         RGIGS LVR LE WF  ++ DY+ MAT KDN AS  LFI    Y+ FR   IL NPV        PS I I++LK++EAE++Y++++ AT EFFP DIN
Subjt:  HRGIGSSLVRHLEHWFSFNDVDYSCMATHKDNHASLNLFINNFRYIKFRTARILANPVTNRPYQIDPSKIKIQRLKIEEAEAIYKKHM-ATAEFFPKDIN

Query:  SILRNKLSLGTWMAYYSAGAGGDFSGEKGETPASWAVVSLWNSGEVFKLRLGKAPVAWIIYTKSLKFVDKILPWLKVGSVADFFEPFGFYFVYGVHHEGP
         ILRNKLS+GTW+AYY+            +   SWA++S+W+S +VFKLR+ +AP+++++ TK  K     L  L +  + D F PFGFYF+YGVH EGP
Subjt:  SILRNKLSLGTWMAYYSAGAGGDFSGEKGETPASWAVVSLWNSGEVFKLRLGKAPVAWIIYTKSLKFVDKILPWLKVGSVADFFEPFGFYFVYGVHHEGP

Query:  FSGRLVRALCQYVHNMALKSRD--CKVIVTEI---GGEDDVLKKEIPHWKLLSCPQDLWCIKGLKSNNGDHNHLLEWTKAPPNRALFVDPREV
          G+LVRALC++VHNMA  +    CKV+V E+      DD L++ IPHWK+LSC  D+WCIK LK      + L E +K+    +LFVDPREV
Subjt:  FSGRLVRALCQYVHNMALKSRD--CKVIVTEI---GGEDDVLKKEIPHWKLLSCPQDLWCIKGLKSNNGDHNHLLEWTKAPPNRALFVDPREV

AT4G37580.1 Acyl-CoA N-acyltransferases (NAT) superfamily protein4.3e-9545.79Show/hide
Query:  VIRSYDGQFDRARVVDLERRCEVGPSKRVFLFTDTLGDPICRIRNSPLYKMLVAEW---EKELVGVIQGSIKTVVAG-----GHKA----APSFPAKVGY
        V+R YD   D   V D+ERRCEVGPS ++ LFTD LGDPICRIR+SP Y MLVAE    +KE+VG+I+G IKTV  G      HK+          K+ Y
Subjt:  VIRSYDGQFDRARVVDLERRCEVGPSKRVFLFTDTLGDPICRIRNSPLYKMLVAEW---EKELVGVIQGSIKTVVAG-----GHKA----APSFPAKVGY

Query:  ILGLRVAPPYRHRGIGSSLVRHLEHWFSFNDVDYSCMATHKDNHASLNLFINNFRYIKFRTARILANPVTNRPYQIDPSKIKIQRLKIEEAEAIYKKHMA
        +LGLRV+P +R +GIG  LV+ +E WF  N  +YS +AT  DN AS+NLF     Y +FRT  IL NPV      +   ++ + +L+  +AE +Y+   +
Subjt:  ILGLRVAPPYRHRGIGSSLVRHLEHWFSFNDVDYSCMATHKDNHASLNLFINNFRYIKFRTARILANPVTNRPYQIDPSKIKIQRLKIEEAEAIYKKHMA

Query:  TAEFFPKDINSILRNKLSLGTWMAY-----YSAGAGGDFSGEK--GETPASWAVVSLWNSGEVFKLRLGKAPVAWIIYTKSLKFVDKILPWLKVGSVADF
        T EFFP+DI+S+L NKLSLGT++A      Y +G+G      K     P SWAV+S+WN  + F L +  A     +  K+ + VDK LP+LK+ S+   
Subjt:  TAEFFPKDINSILRNKLSLGTWMAY-----YSAGAGGDFSGEK--GETPASWAVVSLWNSGEVFKLRLGKAPVAWIIYTKSLKFVDKILPWLKVGSVADF

Query:  FEPFGFYFVYGVHHEGPFSGRLVRALCQYVHNMALKSRDCKVIVTEIGGEDDVLKKEIPHWKLLSCPQDLWCIKGLKSNNGDHNHLLEWTKAPPNRALFV
        FEPFG +F+YG+  EGP + ++V++LC + HN+A K+  C V+  E+ GED  L++ IPHWK+LSC +DLWCIK L  +  D   + +WTK+PP  ++FV
Subjt:  FEPFGFYFVYGVHHEGPFSGRLVRALCQYVHNMALKSRDCKVIVTEIGGEDDVLKKEIPHWKLLSCPQDLWCIKGLKSNNGDHNHLLEWTKAPPNRALFV

Query:  DPRE
        DPRE
Subjt:  DPRE

AT5G67430.1 Acyl-CoA N-acyltransferases (NAT) superfamily protein8.9e-8542.46Show/hide
Query:  GFKGLVIRSYDGQFDRARVVDLERRCEVGPSKRVFLFTDTLGDPICRIRNSPLYKMLVAEWEKELVGVIQGSIKTVVAG------GHKAAPSF-PAKVGY
        GF  +V+R YD + D   V +LE  CEVG      L  D +GDP+ RIR SP + MLVAE   E+VG+I+G+IK V  G          +P     K+ +
Subjt:  GFKGLVIRSYDGQFDRARVVDLERRCEVGPSKRVFLFTDTLGDPICRIRNSPLYKMLVAEWEKELVGVIQGSIKTVVAG------GHKAAPSF-PAKVGY

Query:  ILGLRVAPPYRHRGIGSSLVRHLEHWFSFNDVDYSCMATHKDNHASLNLFINNFRYIKFRTARILANPVTNRPYQIDPSKIKIQRLKIEEAEAIYKKHMA
        + GLRV+P YR  GIG  LV+ LE WF  ND  YS + T  DN AS+ LF     Y KFRT   L NPV N    +   ++KI +L   +AE++Y+   +
Subjt:  ILGLRVAPPYRHRGIGSSLVRHLEHWFSFNDVDYSCMATHKDNHASLNLFINNFRYIKFRTARILANPVTNRPYQIDPSKIKIQRLKIEEAEAIYKKHMA

Query:  TAEFFPKDINSILRNKLSLGTWMAYYSAGAGGDFSGEKGETPASWAVVSLWNSGEVFKLRLGKAPVAWIIYTKSLKFVDKILPWLKVGSVADFFEPFGFY
        T EFFP DINSIL NKLSLGT++A      G + SG   +   SWAV+S+WNS +V++L++  A     +  KS +  D   P+LK+ S  + F+ F  +
Subjt:  TAEFFPKDINSILRNKLSLGTWMAYYSAGAGGDFSGEKGETPASWAVVSLWNSGEVFKLRLGKAPVAWIIYTKSLKFVDKILPWLKVGSVADFFEPFGFY

Query:  FVYGVHHEGPFSGRLVRALCQYVHNMALKSRDCKVIVTEIGGEDDVLKKEIPHWKLLSCPQDLWCIKGLKSNNGDHNHLLEWTKAPPNRALFVDPREV
        F+YG+  EGP +  +V ALC + HN+A KS  C V+  E+    + L+  IPHWK+LS P+DLWC+K L+ ++      ++WTK+PP  ++FVDPRE+
Subjt:  FVYGVHHEGPFSGRLVRALCQYVHNMALKSRDCKVIVTEIGGEDDVLKKEIPHWKLLSCPQDLWCIKGLKSNNGDHNHLLEWTKAPPNRALFVDPREV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGGTTTAAGGGCTTGGTTATTAGAAGCTATGATGGTCAATTCGACAGAGCTCGAGTGGTGGATCTTGAAAGAAGATGCGAGGTTGGCCCATCAAAACGTGTGTTTCT
CTTCACAGATACTTTGGGTGACCCCATTTGTAGGATTCGTAACAGTCCTTTGTATAAGATGCTGGTTGCGGAGTGGGAGAAGGAGCTGGTCGGCGTGATTCAAGGGTCTA
TAAAGACCGTGGTGGCTGGTGGTCATAAGGCGGCACCCAGTTTTCCGGCCAAAGTGGGCTACATTCTTGGCCTGAGAGTGGCGCCGCCGTATCGCCACCGTGGGATTGGG
TCCAGCCTCGTCCGCCATTTGGAACACTGGTTCTCCTTCAATGATGTTGATTATTCTTGCATGGCCACTCATAAAGATAACCACGCCTCCCTCAATCTCTTCATTAACAA
CTTCAGGTATATTAAGTTCAGAACAGCAAGGATTCTGGCAAACCCAGTAACAAATCGTCCCTACCAAATCGATCCATCAAAAATCAAGATCCAACGGCTGAAAATAGAGG
AAGCAGAAGCAATATACAAAAAACACATGGCCACAGCCGAGTTCTTCCCCAAAGACATAAACAGCATATTAAGGAACAAGCTGAGCCTAGGGACATGGATGGCATATTAT
TCCGCCGGCGCCGGCGGCGACTTTTCAGGCGAAAAAGGGGAAACTCCGGCGAGCTGGGCCGTGGTGAGCCTATGGAACAGCGGGGAAGTTTTCAAGCTAAGGCTAGGGAA
AGCCCCAGTGGCTTGGATTATATACACCAAGAGTTTAAAATTTGTGGACAAAATATTGCCATGGCTCAAGGTGGGTTCGGTGGCGGATTTTTTCGAGCCATTTGGGTTCT
ATTTTGTATACGGGGTGCACCACGAGGGCCCATTCTCTGGGCGGCTGGTTCGAGCGCTGTGCCAATACGTGCACAACATGGCCCTGAAATCGAGGGACTGTAAAGTCATA
GTTACTGAGATTGGAGGGGAAGATGATGTGCTGAAGAAGGAGATTCCTCATTGGAAATTGCTGTCTTGTCCTCAAGATTTGTGGTGCATTAAGGGCTTGAAAAGTAATAA
TGGGGATCATAATCATCTCTTGGAATGGACTAAGGCCCCACCAAATAGAGCTCTCTTTGTGGACCCAAGAGAGGTATAA
mRNA sequenceShow/hide mRNA sequence
CCACGTTCTATCATTAATCACTTTTTGTCTCGTATTTAACACTTTGAAACAAATTAGGTTTGATGTTGATACAATTTACAATGCCCCTCTTCATTCCCTCACTTGTGTAT
AAATTCTCCTCATTTTTATCAATACCCTTCAACCTCATCTTTTATGCAATATAGTTTGATCTGCCTTTTCCTTAATTAATCTTGAGGTTTTGGAATAATGGGGTTTAAGG
GCTTGGTTATTAGAAGCTATGATGGTCAATTCGACAGAGCTCGAGTGGTGGATCTTGAAAGAAGATGCGAGGTTGGCCCATCAAAACGTGTGTTTCTCTTCACAGATACT
TTGGGTGACCCCATTTGTAGGATTCGTAACAGTCCTTTGTATAAGATGCTGGTTGCGGAGTGGGAGAAGGAGCTGGTCGGCGTGATTCAAGGGTCTATAAAGACCGTGGT
GGCTGGTGGTCATAAGGCGGCACCCAGTTTTCCGGCCAAAGTGGGCTACATTCTTGGCCTGAGAGTGGCGCCGCCGTATCGCCACCGTGGGATTGGGTCCAGCCTCGTCC
GCCATTTGGAACACTGGTTCTCCTTCAATGATGTTGATTATTCTTGCATGGCCACTCATAAAGATAACCACGCCTCCCTCAATCTCTTCATTAACAACTTCAGGTATATT
AAGTTCAGAACAGCAAGGATTCTGGCAAACCCAGTAACAAATCGTCCCTACCAAATCGATCCATCAAAAATCAAGATCCAACGGCTGAAAATAGAGGAAGCAGAAGCAAT
ATACAAAAAACACATGGCCACAGCCGAGTTCTTCCCCAAAGACATAAACAGCATATTAAGGAACAAGCTGAGCCTAGGGACATGGATGGCATATTATTCCGCCGGCGCCG
GCGGCGACTTTTCAGGCGAAAAAGGGGAAACTCCGGCGAGCTGGGCCGTGGTGAGCCTATGGAACAGCGGGGAAGTTTTCAAGCTAAGGCTAGGGAAAGCCCCAGTGGCT
TGGATTATATACACCAAGAGTTTAAAATTTGTGGACAAAATATTGCCATGGCTCAAGGTGGGTTCGGTGGCGGATTTTTTCGAGCCATTTGGGTTCTATTTTGTATACGG
GGTGCACCACGAGGGCCCATTCTCTGGGCGGCTGGTTCGAGCGCTGTGCCAATACGTGCACAACATGGCCCTGAAATCGAGGGACTGTAAAGTCATAGTTACTGAGATTG
GAGGGGAAGATGATGTGCTGAAGAAGGAGATTCCTCATTGGAAATTGCTGTCTTGTCCTCAAGATTTGTGGTGCATTAAGGGCTTGAAAAGTAATAATGGGGATCATAAT
CATCTCTTGGAATGGACTAAGGCCCCACCAAATAGAGCTCTCTTTGTGGACCCAAGAGAGGTATAAGAAAAAAAAAATTATATTATTTTTATCGGAGGATAGAGCGGAGA
TGTAACAAGGAAGAGACAATCGCCCTTAATTTGTCTTAGATTCAAAAAGTATGTAGTATTGTATTCGCTGAATACTAATGGGAATATATAGATATTAATTAATCAATGAA
TAGCATGTGTATACCGTCGTGGGTCTTCTCAATATCTAAATAATGAATGAAACGATAACCGTACTTTTTTGTTGATCAAAACGTACACACATTTATTTTATTGATTATCA
AGCCACCCTTTACTCTAGTTCCTTTTGCATGGGTGAAAATATCGAGACCATTATTTGTTTTAATCATAAAATAATAATATATGATGACATTTAGG
Protein sequenceShow/hide protein sequence
MGFKGLVIRSYDGQFDRARVVDLERRCEVGPSKRVFLFTDTLGDPICRIRNSPLYKMLVAEWEKELVGVIQGSIKTVVAGGHKAAPSFPAKVGYILGLRVAPPYRHRGIG
SSLVRHLEHWFSFNDVDYSCMATHKDNHASLNLFINNFRYIKFRTARILANPVTNRPYQIDPSKIKIQRLKIEEAEAIYKKHMATAEFFPKDINSILRNKLSLGTWMAYY
SAGAGGDFSGEKGETPASWAVVSLWNSGEVFKLRLGKAPVAWIIYTKSLKFVDKILPWLKVGSVADFFEPFGFYFVYGVHHEGPFSGRLVRALCQYVHNMALKSRDCKVI
VTEIGGEDDVLKKEIPHWKLLSCPQDLWCIKGLKSNNGDHNHLLEWTKAPPNRALFVDPREV