; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS014186 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS014186
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionN-acetyltransferase domain-containing protein
Genome locationscaffold5:1280696..1283161
RNA-Seq ExpressionMS014186
SyntenyMS014186
Gene Ontology termsGO:0008080 - N-acetyltransferase activity (molecular function)
InterPro domainsIPR000182 - GNAT domain
IPR016181 - Acyl-CoA N-acyltransferase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7035352.1 putative N-acetyltransferase HLS1-like protein [Cucurbita argyrosperma subsp. argyrosperma]1.5e-15569.11Show/hide
Query:  MGFKGLVIRSYDGQ--FDRARVVDLERRCEVGPSKRVFLFTDTLGDPICRIRNSPLYKMLVAEWEKELVGVIQGSIKTVVAGGHKAAPSFPAKVGYILGL
        MG K  VIR+Y+     DRA+V DLE+RCE+GPSKRVFLFTDTLGDPICRIR+SPLYKMLVAEW  E+VGVIQGSIKT  +  HK  P    KVGYILGL
Subjt:  MGFKGLVIRSYDGQ--FDRARVVDLERRCEVGPSKRVFLFTDTLGDPICRIRNSPLYKMLVAEWEKELVGVIQGSIKTVVAGGHKAAPSFPAKVGYILGL

Query:  RVAPPYRHRGIGSSLVRHLEHWFSFNDVDYSCMATHKDNHASLNLFINNFRYIKFRTARILANPVTNRPYQIDPSKIKIQRLKIEEAEAIYKKHMATAEF
        RVAPP+R RGIGSSLV  LE WF  NDVDY CMAT KDNHAS+NLFIN+ RY+KFRT RIL NPVTN PY+I+ S+I+IQ+LKIEEAE IYKKHM + EF
Subjt:  RVAPPYRHRGIGSSLVRHLEHWFSFNDVDYSCMATHKDNHASLNLFINNFRYIKFRTARILANPVTNRPYQIDPSKIKIQRLKIEEAEAIYKKHMATAEF

Query:  FPKDINSILRNKLSLGTWMAYYSAGAGGDFSGEKGQTPASWAVVSLWNSGEVFKLRLGKAPVAWIIYTKSLKFVDKILPWLKVGSVADFFEPFGFYFVYG
        FPKDINSIL+N LSLGTW+A+Y       +S       +SWAVVSLWNSGEVFKLRLGKAP  W++YTKSL+ ++K+LP LKV  V D+F+ FGFYFVYG
Subjt:  FPKDINSILRNKLSLGTWMAYYSAGAGGDFSGEKGQTPASWAVVSLWNSGEVFKLRLGKAPVAWIIYTKSLKFVDKILPWLKVGSVADFFEPFGFYFVYG

Query:  VHHEGPFSGRLVRALCQYVHNMAL-KSRDCKVIVTEIGGEDDVLKKEIPHWKLLSCPQDLWCIKGLKSNNGDHNHLLEWTKAPPNRALFVDPREV
        +HHEG  S RLV  LC+YVHN+AL  ++DCK IVTEIGGEDD LK  IPHWKLLSC +DLWC+K LKS   + + LLEW   PPNR LFVDPREV
Subjt:  VHHEGPFSGRLVRALCQYVHNMAL-KSRDCKVIVTEIGGEDDVLKKEIPHWKLLSCPQDLWCIKGLKSNNGDHNHLLEWTKAPPNRALFVDPREV

XP_008465276.1 PREDICTED: probable N-acetyltransferase HLS1-like [Cucumis melo]2.0e-15567.38Show/hide
Query:  MGFKGLVIRSY----DGQF-DRARVVDLERRCEVGPSKRVFLFTDTLGDPICRIRNSPLYKMLVAEWEKELVGVIQGSIKTVVAGGHK-AAPSFPAKVGY
        M F G +IRSY    +GQ  D+A+V+DLERRCE+G SKRVFLFTD LGDPICRIRNSP+YKMLVAE +KE+VGVIQGSIK V    HK   P    KVGY
Subjt:  MGFKGLVIRSY----DGQF-DRARVVDLERRCEVGPSKRVFLFTDTLGDPICRIRNSPLYKMLVAEWEKELVGVIQGSIKTVVAGGHK-AAPSFPAKVGY

Query:  ILGLRVAPPYRHRGIGSSLVRHLEHWFSFNDVDYSCMATHKDNHASLNLFINNFRYIKFRTARILANPVTNRPYQIDPSKIKIQRLKIEEAEAIYKKHMA
        ILGLRVAPPYR RGIG++LVR LE WF  NDVDY CMAT KDNHASLNLFINN RYIKFRT RIL NPV N PY+I+ S+IKIQ+L+IEEAEAIYKKHMA
Subjt:  ILGLRVAPPYRHRGIGSSLVRHLEHWFSFNDVDYSCMATHKDNHASLNLFINNFRYIKFRTARILANPVTNRPYQIDPSKIKIQRLKIEEAEAIYKKHMA

Query:  TAEFFPKDINSILRNKLSLGTWMAYY------------SAGAGGDFSGEKGQTPASWAVVSLWNSGEVFKLRLGKAPVAWIIYTKSLKFVDKILPWLKVG
        + E FP+DI +IL+NKLSLGTWMA +            S+ AGG+   E+  + +SWA+VSLWNSGEVFKLRLGKAP  W+IYTKSLK +DKI P  K+ 
Subjt:  TAEFFPKDINSILRNKLSLGTWMAYY------------SAGAGGDFSGEKGQTPASWAVVSLWNSGEVFKLRLGKAPVAWIIYTKSLKFVDKILPWLKVG

Query:  SVADFFEPFGFYFVYGVHHEGPFSGRLVRALCQYVHNMAL---KSRDCKVIVTEIGG-EDDVLKKEIPHWKLLSCPQDLWCIKGLK--------SNNGDH
         V +FF+PFGFYFVYG+HHEGPFS RLV ALC++VHNMA+   K  +CK IVTEIGG EDD LK EIPHWKLLSC +D WCIK LK        SN+ DH
Subjt:  SVADFFEPFGFYFVYGVHHEGPFSGRLVRALCQYVHNMAL---KSRDCKVIVTEIGG-EDDVLKKEIPHWKLLSCPQDLWCIKGLK--------SNNGDH

Query:  -NHLLEWTKAPPNRALFVDPREV
         +H+LEWT  PP R LFVDPREV
Subjt:  -NHLLEWTKAPPNRALFVDPREV

XP_022148434.1 probable N-acetyltransferase HLS1 [Momordica charantia]1.7e-23499.74Show/hide
Query:  MGFKGLVIRSYDGQFDRARVVDLERRCEVGPSKRVFLFTDTLGDPICRIRNSPLYKMLVAEWEKELVGVIQGSIKTVVAGGHKAAPSFPAKVGYILGLRV
        MGFKGLVIRSYDGQFDRARVVDLERRCEVGPSKRVFLFTDTLGDPICRIRNSPLYKMLVAEWEKELVGVIQGSIKTVVAGGHKAAPSFPAKVGYILGLRV
Subjt:  MGFKGLVIRSYDGQFDRARVVDLERRCEVGPSKRVFLFTDTLGDPICRIRNSPLYKMLVAEWEKELVGVIQGSIKTVVAGGHKAAPSFPAKVGYILGLRV

Query:  APPYRHRGIGSSLVRHLEHWFSFNDVDYSCMATHKDNHASLNLFINNFRYIKFRTARILANPVTNRPYQIDPSKIKIQRLKIEEAEAIYKKHMATAEFFP
        APPYRHRGIGSSLVRHLEHWFSFNDVDYSCMATHKDNHASLNLFINNFRYIKFRTARILANPVTNRPYQIDPSKIKIQRLKIEEAEAIYKKHMATAEFFP
Subjt:  APPYRHRGIGSSLVRHLEHWFSFNDVDYSCMATHKDNHASLNLFINNFRYIKFRTARILANPVTNRPYQIDPSKIKIQRLKIEEAEAIYKKHMATAEFFP

Query:  KDINSILRNKLSLGTWMAYYSAGAGGDFSGEKGQTPASWAVVSLWNSGEVFKLRLGKAPVAWIIYTKSLKFVDKILPWLKVGSVADFFEPFGFYFVYGVH
        KDINSILRNKLSLGTWMAYYSAGAGGDFSGEKG+TPASWAVVSLWNSGEVFKLRLGKAPVAWIIYTKSLKFVDKILPWLKVGSVADFFEPFGFYFVYGVH
Subjt:  KDINSILRNKLSLGTWMAYYSAGAGGDFSGEKGQTPASWAVVSLWNSGEVFKLRLGKAPVAWIIYTKSLKFVDKILPWLKVGSVADFFEPFGFYFVYGVH

Query:  HEGPFSGRLVRALCQYVHNMALKSRDCKVIVTEIGGEDDVLKKEIPHWKLLSCPQDLWCIKGLKSNNGDHNHLLEWTKAPPNRALFVDPREV
        HEGPFSGRLVRALCQYVHNMALKSRDCKVIVTEIGGEDDVLKKEIPHWKLLSCPQDLWCIKGLKSNNGDHNHLLEWTKAPPNRALFVDPREV
Subjt:  HEGPFSGRLVRALCQYVHNMALKSRDCKVIVTEIGGEDDVLKKEIPHWKLLSCPQDLWCIKGLKSNNGDHNHLLEWTKAPPNRALFVDPREV

XP_023007288.1 probable N-acetyltransferase HLS1-like [Cucurbita maxima]2.4e-15669.25Show/hide
Query:  MGFKGLVIRSYDGQ--FDRARVVDLERRCEVGPSKRVFLFTDTLGDPICRIRNSPLYKMLVAEWEKELVGVIQGSIKTVVAGGHKAAPSFPAKVGYILGL
        MG K  VIR+Y+     DRA+V DLE+RCE+G SKRVFLFTDTLGDPICRIR+SPLYKMLVAEW  E+VGVIQGSIKT  +  HK  P   AKVGYILGL
Subjt:  MGFKGLVIRSYDGQ--FDRARVVDLERRCEVGPSKRVFLFTDTLGDPICRIRNSPLYKMLVAEWEKELVGVIQGSIKTVVAGGHKAAPSFPAKVGYILGL

Query:  RVAPPYRHRGIGSSLVRHLEHWFSFNDVDYSCMATHKDNHASLNLFINNFRYIKFRTARILANPVTNRPYQIDPSKIKIQRLKIEEAEAIYKKHMATAEF
        RVAPP+R RGIG SLV  LE WF  NDVDY CMAT KDNHAS+NLFIN+ RY+KFRT RIL NPVTN PY+I+ S+IKIQ+LKIEEAE IYKKHMA+ EF
Subjt:  RVAPPYRHRGIGSSLVRHLEHWFSFNDVDYSCMATHKDNHASLNLFINNFRYIKFRTARILANPVTNRPYQIDPSKIKIQRLKIEEAEAIYKKHMATAEF

Query:  FPKDINSILRNKLSLGTWMAYYSA-----GAGGDFSGEKGQTPASWAVVSLWNSGEVFKLRLGKAPVAWIIYTKSLKFVDKILPWLKVGSVADFFEPFGF
        FPKDINSIL+N LSLGTW+A+Y        A  D        P SWAVVSLWNSGEVFKLRLGKAP  W++YTKSLK +DK+LP LKV  V D+F+ FGF
Subjt:  FPKDINSILRNKLSLGTWMAYYSA-----GAGGDFSGEKGQTPASWAVVSLWNSGEVFKLRLGKAPVAWIIYTKSLKFVDKILPWLKVGSVADFFEPFGF

Query:  YFVYGVHHEGPFSGRLVRALCQYVHNMAL-KSRDCKVIVTEIGGEDDVLKKEIPHWKLLSCPQDLWCIKGLKSNNGDHNHLLEWTKAPPNRALFVDPREV
        YFVYG+HHEG  S RLV  LC++VHN+AL  ++DCK IVTEIGGEDD LK  IPHWKLLSC +DLWC+K LK   G+ + LLEW   PPNR LFVDPREV
Subjt:  YFVYGVHHEGPFSGRLVRALCQYVHNMAL-KSRDCKVIVTEIGGEDDVLKKEIPHWKLLSCPQDLWCIKGLKSNNGDHNHLLEWTKAPPNRALFVDPREV

XP_038902314.1 probable N-acetyltransferase HLS1-like [Benincasa hispida]3.0e-15970Show/hide
Query:  MGFKGLVIRSYDGQ--FDRARVVDLERRCEVGPSKRVFLFTDTLGDPICRIRNSPLYKMLVAEWEKELVGVIQGSIKTVVAGGHK-AAPSFPAKVGYILG
        +   G VIR Y+     D+A+V+DLERRC++G SKRVFLFTD LGDPICRIRNSP+YKMLVAEW+KE+VGVIQGSIK V    HK   P    KVGYILG
Subjt:  MGFKGLVIRSYDGQ--FDRARVVDLERRCEVGPSKRVFLFTDTLGDPICRIRNSPLYKMLVAEWEKELVGVIQGSIKTVVAGGHK-AAPSFPAKVGYILG

Query:  LRVAPPYRHRGIGSSLVRHLEHWFSFNDVDYSCMATHKDNHASLNLFINNFRYIKFRTARILANPVTNRPYQIDPSKIKIQRLKIEEAEAIYKKHMATAE
        LRVAPPYR RGIGS LVR LE WF  NDVDY CMAT KDNHASLNLFINN RYIKFRT RIL +PV NRPY I+ S+I IQ+LKIEEAEAIYKKHMA+ E
Subjt:  LRVAPPYRHRGIGSSLVRHLEHWFSFNDVDYSCMATHKDNHASLNLFINNFRYIKFRTARILANPVTNRPYQIDPSKIKIQRLKIEEAEAIYKKHMATAE

Query:  FFPKDINSILRNKLSLGTWMAYYS----AGAGGDFSGEKGQTPASWAVVSLWNSGEVFKLRLGKAPVAWIIYTKSLKFVDKILPWLKVGSVADFFEPFGF
        FFPKDI SIL+NKLSLGTWMA +             G +  T +SWA+ SLWNSGEVFKLRLGKAP  W+IYTKSLK +DKILP  K+  V DFF+PFGF
Subjt:  FFPKDINSILRNKLSLGTWMAYYS----AGAGGDFSGEKGQTPASWAVVSLWNSGEVFKLRLGKAPVAWIIYTKSLKFVDKILPWLKVGSVADFFEPFGF

Query:  YFVYGVHHEGPFSGRLVRALCQYVHNMALK--SRD-CKVIVTEIGG-EDDVLKKEIPHWKLLSCPQDLWCIKGLKSNN----GDH---NHLLEWTKAPPN
        YFVYG+HHEGPFS RLV ALC++VHN+ALK  SRD CK IVTEIGG EDD LK EIPHWKLLSC +D WCIK L++NN     DH   +H+LEWT APPN
Subjt:  YFVYGVHHEGPFSGRLVRALCQYVHNMALK--SRD-CKVIVTEIGG-EDDVLKKEIPHWKLLSCPQDLWCIKGLKSNN----GDH---NHLLEWTKAPPN

Query:  RALFVDPREV
        R LFVDPREV
Subjt:  RALFVDPREV

TrEMBL top hitse value%identityAlignment
A0A0A0KE16 N-acetyltransferase domain-containing protein1.7e-15567.87Show/hide
Query:  MGFKGLVIRSY-----DGQF-DRARVVDLERRCEVGPSKRVFLFTDTLGDPICRIRNSPLYKMLVAEWEKELVGVIQGSIKTVVAGGHK-AAPSFPAKVG
        M F G VIRSY     +GQF D+A+V+DLERRCE+G SKRVFLFTD LGDPICRIRNSP+YKMLVAE +KE+VGVIQGSIK V    HK   P    KVG
Subjt:  MGFKGLVIRSY-----DGQF-DRARVVDLERRCEVGPSKRVFLFTDTLGDPICRIRNSPLYKMLVAEWEKELVGVIQGSIKTVVAGGHK-AAPSFPAKVG

Query:  YILGLRVAPPYRHRGIGSSLVRHLEHWFSFNDVDYSCMATHKDNHASLNLFINNFRYIKFRTARILANPVTNRPYQIDPSKIKIQRLKIEEAEAIYKKHM
        Y+LGLRVAPPYR RG+G++LVR LE WF  NDVDY CMA  KDNHASLNLFINN RYIKFRT RIL NPV N PY I+ S+IKIQ+LKIE+AEAIYKKHM
Subjt:  YILGLRVAPPYRHRGIGSSLVRHLEHWFSFNDVDYSCMATHKDNHASLNLFINNFRYIKFRTARILANPVTNRPYQIDPSKIKIQRLKIEEAEAIYKKHM

Query:  ATAEFFPKDINSILRNKLSLGTWMA-----YYSAGAGGDFSGEKGQTPASWAVVSLWNSGEVFKLRLGKAPVAWIIYTKSLKFVDKILPWLKVGSVADFF
        A+ E FPKDI +IL+NKLSLGTWMA     +Y   +    +G  G   +SWA+VSLWNSGEVF+LRLGKAP AW+IYTKSLK +DKILP  K+  V +FF
Subjt:  ATAEFFPKDINSILRNKLSLGTWMA-----YYSAGAGGDFSGEKGQTPASWAVVSLWNSGEVFKLRLGKAPVAWIIYTKSLKFVDKILPWLKVGSVADFF

Query:  EPFGFYFVYGVHHEGPFSGRLVRALCQYVHNMAL---KSRDCKVIVTEIGG-EDDVLKKEIPHWKLLSCPQDLWCIKGLK--------SNNGDH-NHLLE
        +PFGFYFVYG+HHEGPFS RLV ALC++VHNMA+   K  +CK IVTEI G EDD LK EIPHWKLLSC +D WCIK LK        SN+ DH +H+LE
Subjt:  EPFGFYFVYGVHHEGPFSGRLVRALCQYVHNMAL---KSRDCKVIVTEIGG-EDDVLKKEIPHWKLLSCPQDLWCIKGLK--------SNNGDH-NHLLE

Query:  WTKAPPNRALFVDPREV
        WT  PP R LFVDPREV
Subjt:  WTKAPPNRALFVDPREV

A0A1S3CNW9 probable N-acetyltransferase HLS1-like9.8e-15667.38Show/hide
Query:  MGFKGLVIRSY----DGQF-DRARVVDLERRCEVGPSKRVFLFTDTLGDPICRIRNSPLYKMLVAEWEKELVGVIQGSIKTVVAGGHK-AAPSFPAKVGY
        M F G +IRSY    +GQ  D+A+V+DLERRCE+G SKRVFLFTD LGDPICRIRNSP+YKMLVAE +KE+VGVIQGSIK V    HK   P    KVGY
Subjt:  MGFKGLVIRSY----DGQF-DRARVVDLERRCEVGPSKRVFLFTDTLGDPICRIRNSPLYKMLVAEWEKELVGVIQGSIKTVVAGGHK-AAPSFPAKVGY

Query:  ILGLRVAPPYRHRGIGSSLVRHLEHWFSFNDVDYSCMATHKDNHASLNLFINNFRYIKFRTARILANPVTNRPYQIDPSKIKIQRLKIEEAEAIYKKHMA
        ILGLRVAPPYR RGIG++LVR LE WF  NDVDY CMAT KDNHASLNLFINN RYIKFRT RIL NPV N PY+I+ S+IKIQ+L+IEEAEAIYKKHMA
Subjt:  ILGLRVAPPYRHRGIGSSLVRHLEHWFSFNDVDYSCMATHKDNHASLNLFINNFRYIKFRTARILANPVTNRPYQIDPSKIKIQRLKIEEAEAIYKKHMA

Query:  TAEFFPKDINSILRNKLSLGTWMAYY------------SAGAGGDFSGEKGQTPASWAVVSLWNSGEVFKLRLGKAPVAWIIYTKSLKFVDKILPWLKVG
        + E FP+DI +IL+NKLSLGTWMA +            S+ AGG+   E+  + +SWA+VSLWNSGEVFKLRLGKAP  W+IYTKSLK +DKI P  K+ 
Subjt:  TAEFFPKDINSILRNKLSLGTWMAYY------------SAGAGGDFSGEKGQTPASWAVVSLWNSGEVFKLRLGKAPVAWIIYTKSLKFVDKILPWLKVG

Query:  SVADFFEPFGFYFVYGVHHEGPFSGRLVRALCQYVHNMAL---KSRDCKVIVTEIGG-EDDVLKKEIPHWKLLSCPQDLWCIKGLK--------SNNGDH
         V +FF+PFGFYFVYG+HHEGPFS RLV ALC++VHNMA+   K  +CK IVTEIGG EDD LK EIPHWKLLSC +D WCIK LK        SN+ DH
Subjt:  SVADFFEPFGFYFVYGVHHEGPFSGRLVRALCQYVHNMAL---KSRDCKVIVTEIGG-EDDVLKKEIPHWKLLSCPQDLWCIKGLK--------SNNGDH

Query:  -NHLLEWTKAPPNRALFVDPREV
         +H+LEWT  PP R LFVDPREV
Subjt:  -NHLLEWTKAPPNRALFVDPREV

A0A5D3CAW1 Putative N-acetyltransferase HLS1-like9.8e-15667.38Show/hide
Query:  MGFKGLVIRSY----DGQF-DRARVVDLERRCEVGPSKRVFLFTDTLGDPICRIRNSPLYKMLVAEWEKELVGVIQGSIKTVVAGGHK-AAPSFPAKVGY
        M F G +IRSY    +GQ  D+A+V+DLERRCE+G SKRVFLFTD LGDPICRIRNSP+YKMLVAE +KE+VGVIQGSIK V    HK   P    KVGY
Subjt:  MGFKGLVIRSY----DGQF-DRARVVDLERRCEVGPSKRVFLFTDTLGDPICRIRNSPLYKMLVAEWEKELVGVIQGSIKTVVAGGHK-AAPSFPAKVGY

Query:  ILGLRVAPPYRHRGIGSSLVRHLEHWFSFNDVDYSCMATHKDNHASLNLFINNFRYIKFRTARILANPVTNRPYQIDPSKIKIQRLKIEEAEAIYKKHMA
        ILGLRVAPPYR RGIG++LVR LE WF  NDVDY CMAT KDNHASLNLFINN RYIKFRT RIL NPV N PY+I+ S+IKIQ+L+IEEAEAIYKKHMA
Subjt:  ILGLRVAPPYRHRGIGSSLVRHLEHWFSFNDVDYSCMATHKDNHASLNLFINNFRYIKFRTARILANPVTNRPYQIDPSKIKIQRLKIEEAEAIYKKHMA

Query:  TAEFFPKDINSILRNKLSLGTWMAYY------------SAGAGGDFSGEKGQTPASWAVVSLWNSGEVFKLRLGKAPVAWIIYTKSLKFVDKILPWLKVG
        + E FP+DI +IL+NKLSLGTWMA +            S+ AGG+   E+  + +SWA+VSLWNSGEVFKLRLGKAP  W+IYTKSLK +DKI P  K+ 
Subjt:  TAEFFPKDINSILRNKLSLGTWMAYY------------SAGAGGDFSGEKGQTPASWAVVSLWNSGEVFKLRLGKAPVAWIIYTKSLKFVDKILPWLKVG

Query:  SVADFFEPFGFYFVYGVHHEGPFSGRLVRALCQYVHNMAL---KSRDCKVIVTEIGG-EDDVLKKEIPHWKLLSCPQDLWCIKGLK--------SNNGDH
         V +FF+PFGFYFVYG+HHEGPFS RLV ALC++VHNMA+   K  +CK IVTEIGG EDD LK EIPHWKLLSC +D WCIK LK        SN+ DH
Subjt:  SVADFFEPFGFYFVYGVHHEGPFSGRLVRALCQYVHNMAL---KSRDCKVIVTEIGG-EDDVLKKEIPHWKLLSCPQDLWCIKGLK--------SNNGDH

Query:  -NHLLEWTKAPPNRALFVDPREV
         +H+LEWT  PP R LFVDPREV
Subjt:  -NHLLEWTKAPPNRALFVDPREV

A0A6J1D3Z5 probable N-acetyltransferase HLS18.1e-23599.74Show/hide
Query:  MGFKGLVIRSYDGQFDRARVVDLERRCEVGPSKRVFLFTDTLGDPICRIRNSPLYKMLVAEWEKELVGVIQGSIKTVVAGGHKAAPSFPAKVGYILGLRV
        MGFKGLVIRSYDGQFDRARVVDLERRCEVGPSKRVFLFTDTLGDPICRIRNSPLYKMLVAEWEKELVGVIQGSIKTVVAGGHKAAPSFPAKVGYILGLRV
Subjt:  MGFKGLVIRSYDGQFDRARVVDLERRCEVGPSKRVFLFTDTLGDPICRIRNSPLYKMLVAEWEKELVGVIQGSIKTVVAGGHKAAPSFPAKVGYILGLRV

Query:  APPYRHRGIGSSLVRHLEHWFSFNDVDYSCMATHKDNHASLNLFINNFRYIKFRTARILANPVTNRPYQIDPSKIKIQRLKIEEAEAIYKKHMATAEFFP
        APPYRHRGIGSSLVRHLEHWFSFNDVDYSCMATHKDNHASLNLFINNFRYIKFRTARILANPVTNRPYQIDPSKIKIQRLKIEEAEAIYKKHMATAEFFP
Subjt:  APPYRHRGIGSSLVRHLEHWFSFNDVDYSCMATHKDNHASLNLFINNFRYIKFRTARILANPVTNRPYQIDPSKIKIQRLKIEEAEAIYKKHMATAEFFP

Query:  KDINSILRNKLSLGTWMAYYSAGAGGDFSGEKGQTPASWAVVSLWNSGEVFKLRLGKAPVAWIIYTKSLKFVDKILPWLKVGSVADFFEPFGFYFVYGVH
        KDINSILRNKLSLGTWMAYYSAGAGGDFSGEKG+TPASWAVVSLWNSGEVFKLRLGKAPVAWIIYTKSLKFVDKILPWLKVGSVADFFEPFGFYFVYGVH
Subjt:  KDINSILRNKLSLGTWMAYYSAGAGGDFSGEKGQTPASWAVVSLWNSGEVFKLRLGKAPVAWIIYTKSLKFVDKILPWLKVGSVADFFEPFGFYFVYGVH

Query:  HEGPFSGRLVRALCQYVHNMALKSRDCKVIVTEIGGEDDVLKKEIPHWKLLSCPQDLWCIKGLKSNNGDHNHLLEWTKAPPNRALFVDPREV
        HEGPFSGRLVRALCQYVHNMALKSRDCKVIVTEIGGEDDVLKKEIPHWKLLSCPQDLWCIKGLKSNNGDHNHLLEWTKAPPNRALFVDPREV
Subjt:  HEGPFSGRLVRALCQYVHNMALKSRDCKVIVTEIGGEDDVLKKEIPHWKLLSCPQDLWCIKGLKSNNGDHNHLLEWTKAPPNRALFVDPREV

A0A6J1L7A2 probable N-acetyltransferase HLS1-like1.2e-15669.25Show/hide
Query:  MGFKGLVIRSYDGQ--FDRARVVDLERRCEVGPSKRVFLFTDTLGDPICRIRNSPLYKMLVAEWEKELVGVIQGSIKTVVAGGHKAAPSFPAKVGYILGL
        MG K  VIR+Y+     DRA+V DLE+RCE+G SKRVFLFTDTLGDPICRIR+SPLYKMLVAEW  E+VGVIQGSIKT  +  HK  P   AKVGYILGL
Subjt:  MGFKGLVIRSYDGQ--FDRARVVDLERRCEVGPSKRVFLFTDTLGDPICRIRNSPLYKMLVAEWEKELVGVIQGSIKTVVAGGHKAAPSFPAKVGYILGL

Query:  RVAPPYRHRGIGSSLVRHLEHWFSFNDVDYSCMATHKDNHASLNLFINNFRYIKFRTARILANPVTNRPYQIDPSKIKIQRLKIEEAEAIYKKHMATAEF
        RVAPP+R RGIG SLV  LE WF  NDVDY CMAT KDNHAS+NLFIN+ RY+KFRT RIL NPVTN PY+I+ S+IKIQ+LKIEEAE IYKKHMA+ EF
Subjt:  RVAPPYRHRGIGSSLVRHLEHWFSFNDVDYSCMATHKDNHASLNLFINNFRYIKFRTARILANPVTNRPYQIDPSKIKIQRLKIEEAEAIYKKHMATAEF

Query:  FPKDINSILRNKLSLGTWMAYYSA-----GAGGDFSGEKGQTPASWAVVSLWNSGEVFKLRLGKAPVAWIIYTKSLKFVDKILPWLKVGSVADFFEPFGF
        FPKDINSIL+N LSLGTW+A+Y        A  D        P SWAVVSLWNSGEVFKLRLGKAP  W++YTKSLK +DK+LP LKV  V D+F+ FGF
Subjt:  FPKDINSILRNKLSLGTWMAYYSA-----GAGGDFSGEKGQTPASWAVVSLWNSGEVFKLRLGKAPVAWIIYTKSLKFVDKILPWLKVGSVADFFEPFGF

Query:  YFVYGVHHEGPFSGRLVRALCQYVHNMAL-KSRDCKVIVTEIGGEDDVLKKEIPHWKLLSCPQDLWCIKGLKSNNGDHNHLLEWTKAPPNRALFVDPREV
        YFVYG+HHEG  S RLV  LC++VHN+AL  ++DCK IVTEIGGEDD LK  IPHWKLLSC +DLWC+K LK   G+ + LLEW   PPNR LFVDPREV
Subjt:  YFVYGVHHEGPFSGRLVRALCQYVHNMAL-KSRDCKVIVTEIGGEDDVLKKEIPHWKLLSCPQDLWCIKGLKSNNGDHNHLLEWTKAPPNRALFVDPREV

SwissProt top hitse value%identityAlignment
O64815 Probable N-acetyltransferase HLS1-like6.0e-9445.12Show/hide
Query:  IRSYDGQFDRARVVDLERRCEVGPSKRVFLFTDTLGDPICRIRNSPLYKMLVAE----WEKELVGVIQGSIKTVVAG--------GHKAAPS-------F
        +R YD   D A V D+ERRCEVGP+ ++ LFTD LGDPICR+R+SP Y MLVAE     +KELVG+I+G IKTV  G         H  + +        
Subjt:  IRSYDGQFDRARVVDLERRCEVGPSKRVFLFTDTLGDPICRIRNSPLYKMLVAE----WEKELVGVIQGSIKTVVAG--------GHKAAPS-------F

Query:  PAKVGYILGLRVAPPYRHRGIGSSLVRHLEHWFSFNDVDYSCMATHKDNHASLNLFINNFRYIKFRTARILANPVTNRPYQIDPSKIKIQRLKIEEAEAI
          K+ YILGLRV+P +R +GIG  LV+ +E WFS N  +YS  AT  DNHAS+NLF     Y +FRT  IL NPV      I   ++ + +L+  +AE +
Subjt:  PAKVGYILGLRVAPPYRHRGIGSSLVRHLEHWFSFNDVDYSCMATHKDNHASLNLFINNFRYIKFRTARILANPVTNRPYQIDPSKIKIQRLKIEEAEAI

Query:  YKKHMATAEFFPKDINSILRNKLSLGTWMAY-----YSAGAGGDFSGEK--GQTPASWAVVSLWNSGEVFKLRLGKAPVAWIIYTKSLKFVDKILPWLKV
        Y+   +T EFFP+DI+S+L NKLSLGT++A      Y +G+       K     P SWAV+S+WN  + F+L +  A     + +K+ + VDK LP+LK+
Subjt:  YKKHMATAEFFPKDINSILRNKLSLGTWMAY-----YSAGAGGDFSGEK--GQTPASWAVVSLWNSGEVFKLRLGKAPVAWIIYTKSLKFVDKILPWLKV

Query:  GSVADFFEPFGFYFVYGVHHEGPFSGRLVRALCQYVHNMALKSRDCKVIVTEIGGEDDVLKKEIPHWKLLSCPQDLWCIKGLKSNNGDHNHLLEWTKAPP
         S+   F PFG +F+YG+  EGP + ++V+ALC + HN+A K   C V+  E+ GE+  L++ IPHWK+LSC +DLWCIK L  +  D   + +WTK+PP
Subjt:  GSVADFFEPFGFYFVYGVHHEGPFSGRLVRALCQYVHNMALKSRDCKVIVTEIGGEDDVLKKEIPHWKLLSCPQDLWCIKGLKSNNGDHNHLLEWTKAPP

Query:  NRALFVDPRE
          ++FVDPRE
Subjt:  NRALFVDPRE

Q42381 Probable N-acetyltransferase HLS14.6e-9445.79Show/hide
Query:  VIRSYDGQFDRARVVDLERRCEVGPSKRVFLFTDTLGDPICRIRNSPLYKMLVAEW---EKELVGVIQGSIKTVVAG-----GHKA----APSFPAKVGY
        V+R YD   D   V D+ERRCEVGPS ++ LFTD LGDPICRIR+SP Y MLVAE    +KE+VG+I+G IKTV  G      HK+          K+ Y
Subjt:  VIRSYDGQFDRARVVDLERRCEVGPSKRVFLFTDTLGDPICRIRNSPLYKMLVAEW---EKELVGVIQGSIKTVVAG-----GHKA----APSFPAKVGY

Query:  ILGLRVAPPYRHRGIGSSLVRHLEHWFSFNDVDYSCMATHKDNHASLNLFINNFRYIKFRTARILANPVTNRPYQIDPSKIKIQRLKIEEAEAIYKKHMA
        +LGLRV+P +R +GIG  LV+ +E WF  N  +YS +AT  DN AS+NLF     Y +FRT  IL NPV      +   ++ + +L+  +AE +Y+   +
Subjt:  ILGLRVAPPYRHRGIGSSLVRHLEHWFSFNDVDYSCMATHKDNHASLNLFINNFRYIKFRTARILANPVTNRPYQIDPSKIKIQRLKIEEAEAIYKKHMA

Query:  TAEFFPKDINSILRNKLSLGTWMAY-----YSAGAGGDFSGEK--GQTPASWAVVSLWNSGEVFKLRLGKAPVAWIIYTKSLKFVDKILPWLKVGSVADF
        T EFFP+DI+S+L NKLSLGT++A      Y +G+G      K     P SWAV+S+WN  + F L +  A     +  K+ + VDK LP+LK+ S+   
Subjt:  TAEFFPKDINSILRNKLSLGTWMAY-----YSAGAGGDFSGEK--GQTPASWAVVSLWNSGEVFKLRLGKAPVAWIIYTKSLKFVDKILPWLKVGSVADF

Query:  FEPFGFYFVYGVHHEGPFSGRLVRALCQYVHNMALKSRDCKVIVTEIGGEDDVLKKEIPHWKLLSCPQDLWCIKGLKSNNGDHNHLLEWTKAPPNRALFV
        FEPFG +F+YG+  EGP + ++V++LC + HN+A K+  C V+  E+ GED  L++ IPHWK+LSC +DLWCIK L  +  D   + +WTK+PP  ++FV
Subjt:  FEPFGFYFVYGVHHEGPFSGRLVRALCQYVHNMALKSRDCKVIVTEIGGEDDVLKKEIPHWKLLSCPQDLWCIKGLKSNNGDHNHLLEWTKAPPNRALFV

Query:  DPRE
        DPRE
Subjt:  DPRE

Arabidopsis top hitse value%identityAlignment
AT2G23060.1 Acyl-CoA N-acyltransferases (NAT) superfamily protein4.3e-9545.12Show/hide
Query:  IRSYDGQFDRARVVDLERRCEVGPSKRVFLFTDTLGDPICRIRNSPLYKMLVAE----WEKELVGVIQGSIKTVVAG--------GHKAAPS-------F
        +R YD   D A V D+ERRCEVGP+ ++ LFTD LGDPICR+R+SP Y MLVAE     +KELVG+I+G IKTV  G         H  + +        
Subjt:  IRSYDGQFDRARVVDLERRCEVGPSKRVFLFTDTLGDPICRIRNSPLYKMLVAE----WEKELVGVIQGSIKTVVAG--------GHKAAPS-------F

Query:  PAKVGYILGLRVAPPYRHRGIGSSLVRHLEHWFSFNDVDYSCMATHKDNHASLNLFINNFRYIKFRTARILANPVTNRPYQIDPSKIKIQRLKIEEAEAI
          K+ YILGLRV+P +R +GIG  LV+ +E WFS N  +YS  AT  DNHAS+NLF     Y +FRT  IL NPV      I   ++ + +L+  +AE +
Subjt:  PAKVGYILGLRVAPPYRHRGIGSSLVRHLEHWFSFNDVDYSCMATHKDNHASLNLFINNFRYIKFRTARILANPVTNRPYQIDPSKIKIQRLKIEEAEAI

Query:  YKKHMATAEFFPKDINSILRNKLSLGTWMAY-----YSAGAGGDFSGEK--GQTPASWAVVSLWNSGEVFKLRLGKAPVAWIIYTKSLKFVDKILPWLKV
        Y+   +T EFFP+DI+S+L NKLSLGT++A      Y +G+       K     P SWAV+S+WN  + F+L +  A     + +K+ + VDK LP+LK+
Subjt:  YKKHMATAEFFPKDINSILRNKLSLGTWMAY-----YSAGAGGDFSGEK--GQTPASWAVVSLWNSGEVFKLRLGKAPVAWIIYTKSLKFVDKILPWLKV

Query:  GSVADFFEPFGFYFVYGVHHEGPFSGRLVRALCQYVHNMALKSRDCKVIVTEIGGEDDVLKKEIPHWKLLSCPQDLWCIKGLKSNNGDHNHLLEWTKAPP
         S+   F PFG +F+YG+  EGP + ++V+ALC + HN+A K   C V+  E+ GE+  L++ IPHWK+LSC +DLWCIK L  +  D   + +WTK+PP
Subjt:  GSVADFFEPFGFYFVYGVHHEGPFSGRLVRALCQYVHNMALKSRDCKVIVTEIGGEDDVLKKEIPHWKLLSCPQDLWCIKGLKSNNGDHNHLLEWTKAPP

Query:  NRALFVDPRE
          ++FVDPRE
Subjt:  NRALFVDPRE

AT2G23060.2 Acyl-CoA N-acyltransferases (NAT) superfamily protein3.7e-7542.94Show/hide
Query:  MLVAE----WEKELVGVIQGSIKTVVAG--------GHKAAPS-------FPAKVGYILGLRVAPPYRHRGIGSSLVRHLEHWFSFNDVDYSCMATHKDN
        MLVAE     +KELVG+I+G IKTV  G         H  + +          K+ YILGLRV+P +R +GIG  LV+ +E WFS N  +YS  AT  DN
Subjt:  MLVAE----WEKELVGVIQGSIKTVVAG--------GHKAAPS-------FPAKVGYILGLRVAPPYRHRGIGSSLVRHLEHWFSFNDVDYSCMATHKDN

Query:  HASLNLFINNFRYIKFRTARILANPVTNRPYQIDPSKIKIQRLKIEEAEAIYKKHMATAEFFPKDINSILRNKLSLGTWMAY-----YSAGAGGDFSGEK
        HAS+NLF     Y +FRT  IL NPV      I   ++ + +L+  +AE +Y+   +T EFFP+DI+S+L NKLSLGT++A      Y +G+       K
Subjt:  HASLNLFINNFRYIKFRTARILANPVTNRPYQIDPSKIKIQRLKIEEAEAIYKKHMATAEFFPKDINSILRNKLSLGTWMAY-----YSAGAGGDFSGEK

Query:  --GQTPASWAVVSLWNSGEVFKLRLGKAPVAWIIYTKSLKFVDKILPWLKVGSVADFFEPFGFYFVYGVHHEGPFSGRLVRALCQYVHNMALKSRDCKVI
             P SWAV+S+WN  + F+L +  A     + +K+ + VDK LP+LK+ S+   F PFG +F+YG+  EGP + ++V+ALC + HN+A K   C V+
Subjt:  --GQTPASWAVVSLWNSGEVFKLRLGKAPVAWIIYTKSLKFVDKILPWLKVGSVADFFEPFGFYFVYGVHHEGPFSGRLVRALCQYVHNMALKSRDCKVI

Query:  VTEIGGEDDVLKKEIPHWKLLSCPQDLWCIKGLKSNNGDHNHLLEWTKAPPNRALFVDPRE
          E+ GE+  L++ IPHWK+LSC +DLWCIK L  +  D   + +WTK+PP  ++FVDPRE
Subjt:  VTEIGGEDDVLKKEIPHWKLLSCPQDLWCIKGLKSNNGDHNHLLEWTKAPPNRALFVDPRE

AT2G30090.1 Acyl-CoA N-acyltransferases (NAT) superfamily protein1.2e-10250.13Show/hide
Query:  LVIRSYDGQFDRARVVDLERRCEVGPSKRVFLFTDTLGDPICRIRNSPLYKMLVAEWEKELVGVIQGSIKTVVAGGHKAAPSFPAKVGYILGLRVAPPYR
        +VIR YD + DR ++  +E+ CE+G   +  LFTDTLGDPICRIRNSP + MLVA    +LVG IQGS+K V    H  +     +VGY+LGLRV P YR
Subjt:  LVIRSYDGQFDRARVVDLERRCEVGPSKRVFLFTDTLGDPICRIRNSPLYKMLVAEWEKELVGVIQGSIKTVVAGGHKAAPSFPAKVGYILGLRVAPPYR

Query:  HRGIGSSLVRHLEHWFSFNDVDYSCMATHKDNHASLNLFINNFRYIKFRTARILANPVTNRPYQIDPSKIKIQRLKIEEAEAIYKKHM-ATAEFFPKDIN
         RGIGS LVR LE WF  ++ DY+ MAT KDN AS  LFI    Y+ FR   IL NPV        PS I I++LK++EAE++Y++++ AT EFFP DIN
Subjt:  HRGIGSSLVRHLEHWFSFNDVDYSCMATHKDNHASLNLFINNFRYIKFRTARILANPVTNRPYQIDPSKIKIQRLKIEEAEAIYKKHM-ATAEFFPKDIN

Query:  SILRNKLSLGTWMAYYSAGAGGDFSGEKGQTPASWAVVSLWNSGEVFKLRLGKAPVAWIIYTKSLKFVDKILPWLKVGSVADFFEPFGFYFVYGVHHEGP
         ILRNKLS+GTW+AYY+                SWA++S+W+S +VFKLR+ +AP+++++ TK  K     L  L +  + D F PFGFYF+YGVH EGP
Subjt:  SILRNKLSLGTWMAYYSAGAGGDFSGEKGQTPASWAVVSLWNSGEVFKLRLGKAPVAWIIYTKSLKFVDKILPWLKVGSVADFFEPFGFYFVYGVHHEGP

Query:  FSGRLVRALCQYVHNMALKSRD--CKVIVTEI---GGEDDVLKKEIPHWKLLSCPQDLWCIKGLKSNNGDHNHLLEWTKAPPNRALFVDPREV
          G+LVRALC++VHNMA  +    CKV+V E+      DD L++ IPHWK+LSC  D+WCIK LK      + L E +K+    +LFVDPREV
Subjt:  FSGRLVRALCQYVHNMALKSRD--CKVIVTEI---GGEDDVLKKEIPHWKLLSCPQDLWCIKGLKSNNGDHNHLLEWTKAPPNRALFVDPREV

AT4G37580.1 Acyl-CoA N-acyltransferases (NAT) superfamily protein3.3e-9545.79Show/hide
Query:  VIRSYDGQFDRARVVDLERRCEVGPSKRVFLFTDTLGDPICRIRNSPLYKMLVAEW---EKELVGVIQGSIKTVVAG-----GHKA----APSFPAKVGY
        V+R YD   D   V D+ERRCEVGPS ++ LFTD LGDPICRIR+SP Y MLVAE    +KE+VG+I+G IKTV  G      HK+          K+ Y
Subjt:  VIRSYDGQFDRARVVDLERRCEVGPSKRVFLFTDTLGDPICRIRNSPLYKMLVAEW---EKELVGVIQGSIKTVVAG-----GHKA----APSFPAKVGY

Query:  ILGLRVAPPYRHRGIGSSLVRHLEHWFSFNDVDYSCMATHKDNHASLNLFINNFRYIKFRTARILANPVTNRPYQIDPSKIKIQRLKIEEAEAIYKKHMA
        +LGLRV+P +R +GIG  LV+ +E WF  N  +YS +AT  DN AS+NLF     Y +FRT  IL NPV      +   ++ + +L+  +AE +Y+   +
Subjt:  ILGLRVAPPYRHRGIGSSLVRHLEHWFSFNDVDYSCMATHKDNHASLNLFINNFRYIKFRTARILANPVTNRPYQIDPSKIKIQRLKIEEAEAIYKKHMA

Query:  TAEFFPKDINSILRNKLSLGTWMAY-----YSAGAGGDFSGEK--GQTPASWAVVSLWNSGEVFKLRLGKAPVAWIIYTKSLKFVDKILPWLKVGSVADF
        T EFFP+DI+S+L NKLSLGT++A      Y +G+G      K     P SWAV+S+WN  + F L +  A     +  K+ + VDK LP+LK+ S+   
Subjt:  TAEFFPKDINSILRNKLSLGTWMAY-----YSAGAGGDFSGEK--GQTPASWAVVSLWNSGEVFKLRLGKAPVAWIIYTKSLKFVDKILPWLKVGSVADF

Query:  FEPFGFYFVYGVHHEGPFSGRLVRALCQYVHNMALKSRDCKVIVTEIGGEDDVLKKEIPHWKLLSCPQDLWCIKGLKSNNGDHNHLLEWTKAPPNRALFV
        FEPFG +F+YG+  EGP + ++V++LC + HN+A K+  C V+  E+ GED  L++ IPHWK+LSC +DLWCIK L  +  D   + +WTK+PP  ++FV
Subjt:  FEPFGFYFVYGVHHEGPFSGRLVRALCQYVHNMALKSRDCKVIVTEIGGEDDVLKKEIPHWKLLSCPQDLWCIKGLKSNNGDHNHLLEWTKAPPNRALFV

Query:  DPRE
        DPRE
Subjt:  DPRE

AT5G67430.1 Acyl-CoA N-acyltransferases (NAT) superfamily protein1.5e-8442.46Show/hide
Query:  GFKGLVIRSYDGQFDRARVVDLERRCEVGPSKRVFLFTDTLGDPICRIRNSPLYKMLVAEWEKELVGVIQGSIKTVVAG------GHKAAPSF-PAKVGY
        GF  +V+R YD + D   V +LE  CEVG      L  D +GDP+ RIR SP + MLVAE   E+VG+I+G+IK V  G          +P     K+ +
Subjt:  GFKGLVIRSYDGQFDRARVVDLERRCEVGPSKRVFLFTDTLGDPICRIRNSPLYKMLVAEWEKELVGVIQGSIKTVVAG------GHKAAPSF-PAKVGY

Query:  ILGLRVAPPYRHRGIGSSLVRHLEHWFSFNDVDYSCMATHKDNHASLNLFINNFRYIKFRTARILANPVTNRPYQIDPSKIKIQRLKIEEAEAIYKKHMA
        + GLRV+P YR  GIG  LV+ LE WF  ND  YS + T  DN AS+ LF     Y KFRT   L NPV N    +   ++KI +L   +AE++Y+   +
Subjt:  ILGLRVAPPYRHRGIGSSLVRHLEHWFSFNDVDYSCMATHKDNHASLNLFINNFRYIKFRTARILANPVTNRPYQIDPSKIKIQRLKIEEAEAIYKKHMA

Query:  TAEFFPKDINSILRNKLSLGTWMAYYSAGAGGDFSGEKGQTPASWAVVSLWNSGEVFKLRLGKAPVAWIIYTKSLKFVDKILPWLKVGSVADFFEPFGFY
        T EFFP DINSIL NKLSLGT++A      G + SG       SWAV+S+WNS +V++L++  A     +  KS +  D   P+LK+ S  + F+ F  +
Subjt:  TAEFFPKDINSILRNKLSLGTWMAYYSAGAGGDFSGEKGQTPASWAVVSLWNSGEVFKLRLGKAPVAWIIYTKSLKFVDKILPWLKVGSVADFFEPFGFY

Query:  FVYGVHHEGPFSGRLVRALCQYVHNMALKSRDCKVIVTEIGGEDDVLKKEIPHWKLLSCPQDLWCIKGLKSNNGDHNHLLEWTKAPPNRALFVDPREV
        F+YG+  EGP +  +V ALC + HN+A KS  C V+  E+    + L+  IPHWK+LS P+DLWC+K L+ ++      ++WTK+PP  ++FVDPRE+
Subjt:  FVYGVHHEGPFSGRLVRALCQYVHNMALKSRDCKVIVTEIGGEDDVLKKEIPHWKLLSCPQDLWCIKGLKSNNGDHNHLLEWTKAPPNRALFVDPREV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGGTTTAAGGGCTTGGTTATTAGAAGCTATGATGGTCAATTCGACAGAGCTCGAGTGGTGGATCTTGAAAGAAGATGCGAGGTTGGCCCATCAAAACGTGTGTTTCT
CTTCACAGATACTTTGGGTGACCCCATTTGTAGGATTCGTAACAGTCCTTTGTATAAGATGCTGGTTGCGGAGTGGGAGAAGGAGCTGGTCGGCGTGATTCAAGGGTCTA
TAAAGACCGTGGTGGCTGGTGGTCATAAGGCGGCGCCCAGTTTTCCGGCCAAAGTGGGCTACATTCTTGGCCTGAGAGTGGCGCCGCCGTATCGCCACCGTGGGATTGGG
TCCAGCCTCGTCCGCCATTTGGAACACTGGTTCTCCTTCAATGATGTTGATTATTCTTGCATGGCCACTCATAAAGATAACCACGCCTCCCTCAATCTCTTCATTAACAA
CTTCAGGTATATTAAGTTCAGAACAGCAAGGATTCTGGCAAACCCAGTAACAAATCGTCCCTACCAAATCGATCCATCAAAAATCAAGATCCAACGGCTGAAAATAGAGG
AAGCAGAAGCAATATACAAAAAACACATGGCCACAGCCGAGTTCTTCCCCAAAGACATAAACAGCATATTAAGGAACAAGCTGAGCCTAGGGACATGGATGGCATATTAT
TCCGCCGGCGCCGGCGGAGACTTTTCAGGCGAAAAAGGGCAAACTCCGGCGAGCTGGGCCGTGGTGAGCCTATGGAACAGCGGGGAAGTTTTCAAGCTAAGGCTAGGGAA
AGCCCCAGTGGCTTGGATTATATACACCAAGAGTTTAAAATTTGTGGACAAAATATTGCCATGGCTCAAGGTCGGTTCGGTGGCGGATTTTTTCGAGCCATTTGGGTTCT
ATTTTGTATACGGGGTGCACCACGAGGGCCCATTCTCTGGGCGGCTGGTTCGAGCGCTGTGCCAATACGTGCACAACATGGCCCTGAAATCGAGGGACTGTAAAGTCATA
GTTACTGAGATTGGAGGGGAAGATGATGTGCTGAAGAAGGAGATTCCTCATTGGAAATTGCTGTCTTGTCCTCAAGATTTGTGGTGCATTAAGGGCTTGAAAAGTAATAA
TGGGGATCATAATCATCTCTTGGAATGGACTAAGGCCCCACCAAATAGAGCTCTCTTTGTGGACCCAAGAGAGGTA
mRNA sequenceShow/hide mRNA sequence
ATGGGGTTTAAGGGCTTGGTTATTAGAAGCTATGATGGTCAATTCGACAGAGCTCGAGTGGTGGATCTTGAAAGAAGATGCGAGGTTGGCCCATCAAAACGTGTGTTTCT
CTTCACAGATACTTTGGGTGACCCCATTTGTAGGATTCGTAACAGTCCTTTGTATAAGATGCTGGTTGCGGAGTGGGAGAAGGAGCTGGTCGGCGTGATTCAAGGGTCTA
TAAAGACCGTGGTGGCTGGTGGTCATAAGGCGGCGCCCAGTTTTCCGGCCAAAGTGGGCTACATTCTTGGCCTGAGAGTGGCGCCGCCGTATCGCCACCGTGGGATTGGG
TCCAGCCTCGTCCGCCATTTGGAACACTGGTTCTCCTTCAATGATGTTGATTATTCTTGCATGGCCACTCATAAAGATAACCACGCCTCCCTCAATCTCTTCATTAACAA
CTTCAGGTATATTAAGTTCAGAACAGCAAGGATTCTGGCAAACCCAGTAACAAATCGTCCCTACCAAATCGATCCATCAAAAATCAAGATCCAACGGCTGAAAATAGAGG
AAGCAGAAGCAATATACAAAAAACACATGGCCACAGCCGAGTTCTTCCCCAAAGACATAAACAGCATATTAAGGAACAAGCTGAGCCTAGGGACATGGATGGCATATTAT
TCCGCCGGCGCCGGCGGAGACTTTTCAGGCGAAAAAGGGCAAACTCCGGCGAGCTGGGCCGTGGTGAGCCTATGGAACAGCGGGGAAGTTTTCAAGCTAAGGCTAGGGAA
AGCCCCAGTGGCTTGGATTATATACACCAAGAGTTTAAAATTTGTGGACAAAATATTGCCATGGCTCAAGGTCGGTTCGGTGGCGGATTTTTTCGAGCCATTTGGGTTCT
ATTTTGTATACGGGGTGCACCACGAGGGCCCATTCTCTGGGCGGCTGGTTCGAGCGCTGTGCCAATACGTGCACAACATGGCCCTGAAATCGAGGGACTGTAAAGTCATA
GTTACTGAGATTGGAGGGGAAGATGATGTGCTGAAGAAGGAGATTCCTCATTGGAAATTGCTGTCTTGTCCTCAAGATTTGTGGTGCATTAAGGGCTTGAAAAGTAATAA
TGGGGATCATAATCATCTCTTGGAATGGACTAAGGCCCCACCAAATAGAGCTCTCTTTGTGGACCCAAGAGAGGTA
Protein sequenceShow/hide protein sequence
MGFKGLVIRSYDGQFDRARVVDLERRCEVGPSKRVFLFTDTLGDPICRIRNSPLYKMLVAEWEKELVGVIQGSIKTVVAGGHKAAPSFPAKVGYILGLRVAPPYRHRGIG
SSLVRHLEHWFSFNDVDYSCMATHKDNHASLNLFINNFRYIKFRTARILANPVTNRPYQIDPSKIKIQRLKIEEAEAIYKKHMATAEFFPKDINSILRNKLSLGTWMAYY
SAGAGGDFSGEKGQTPASWAVVSLWNSGEVFKLRLGKAPVAWIIYTKSLKFVDKILPWLKVGSVADFFEPFGFYFVYGVHHEGPFSGRLVRALCQYVHNMALKSRDCKVI
VTEIGGEDDVLKKEIPHWKLLSCPQDLWCIKGLKSNNGDHNHLLEWTKAPPNRALFVDPREV