; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh02G007370 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh02G007370
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionN-acetyltransferase domain-containing protein
Genome locationCmo_Chr02:4632030..4633476
RNA-Seq ExpressionCmoCh02G007370
SyntenyCmoCh02G007370
Gene Ontology termsGO:0008080 - N-acetyltransferase activity (molecular function)
InterPro domainsIPR000182 - GNAT domain
IPR016181 - Acyl-CoA N-acyltransferase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6605401.1 putative N-acetyltransferase HLS1-like protein, partial [Cucurbita argyrosperma subsp. sororia]7.9e-22998.21Show/hide
Query:  MGSKDFVIRNYEESRLSDRAQVADLEQRCEIGPSKRVFLFTDTLGDPICRIRHSPLYKMLVAEWNNEVVGVIQGSIKTAFSSTHKRPGLAAKVGYLLGLR
        MGSKDFVIRNYEESRLSD+AQVADLEQRCEIGPSKRVFLFTDTLGDPICRIRHSPLYKMLVAEWNNEVVGVIQGSIKTAFSSTHKRPGLA KVGY+LGLR
Subjt:  MGSKDFVIRNYEESRLSDRAQVADLEQRCEIGPSKRVFLFTDTLGDPICRIRHSPLYKMLVAEWNNEVVGVIQGSIKTAFSSTHKRPGLAAKVGYLLGLR

Query:  VAPPFRRRGIGSSLVHDLEDWFVANDVDYCCMATEKDNHASVNLFINHMRYVKFRTGRILVNPVTNYPYKINQSEIKIQKLKIEEAEEIYKKHMTSTEFF
        VAPPFRRRGIGSSLVHDLEDWFVANDVDYCCMATEKDNHASVNLFINHMRYVKFRTGRILVNPVTNYPYKINQSEI+IQKLKIEEAEEIYKKHMTSTEFF
Subjt:  VAPPFRRRGIGSSLVHDLEDWFVANDVDYCCMATEKDNHASVNLFINHMRYVKFRTGRILVNPVTNYPYKINQSEIKIQKLKIEEAEEIYKKHMTSTEFF

Query:  PKDINSILKNNLSLGTWVAHYKKQPPPWSA-AADIRSSWAVVSLWNSGEVFKLRLGKAPFPWVVYTKSLRMMEKMLPCLKVILVPDYFKAFGFYFVYGLH
        PKDINSILKNNLSLGTWVAHYKKQPPPWS+ AADIRSSWAVVSLWNSGEVFKLRLGKAPFPWVVYTKSLRMMEKMLPCLKVILVPDYFKAFGFYFVYGLH
Subjt:  PKDINSILKNNLSLGTWVAHYKKQPPPWSA-AADIRSSWAVVSLWNSGEVFKLRLGKAPFPWVVYTKSLRMMEKMLPCLKVILVPDYFKAFGFYFVYGLH

Query:  HEGACSERLVGVLCEYVHNLALSNAKDCKAIVTEIGGEDDELKMAIPHWKLLSCSEDLWCVKALKSEEDSLLEWKNGPPNRPLFVDPREV
        HEGACSERLVGVLCEYVHNLALSNAKDCKAIVTEIGGEDDELKMAIPHWKLLSCSEDLWCVKALK+EEDSLLEWKNGPPNRPLFVDPREV
Subjt:  HEGACSERLVGVLCEYVHNLALSNAKDCKAIVTEIGGEDDELKMAIPHWKLLSCSEDLWCVKALKSEEDSLLEWKNGPPNRPLFVDPREV

KAG7035352.1 putative N-acetyltransferase HLS1-like protein [Cucurbita argyrosperma subsp. argyrosperma]7.2e-23098.97Show/hide
Query:  MGSKDFVIRNYEESRLSDRAQVADLEQRCEIGPSKRVFLFTDTLGDPICRIRHSPLYKMLVAEWNNEVVGVIQGSIKTAFSSTHKRPGLAAKVGYLLGLR
        MGSKDFVIRNYEESRLSDRAQVADLEQRCEIGPSKRVFLFTDTLGDPICRIRHSPLYKMLVAEWNNEVVGVIQGSIKTAFSSTHKRPGLA KVGY+LGLR
Subjt:  MGSKDFVIRNYEESRLSDRAQVADLEQRCEIGPSKRVFLFTDTLGDPICRIRHSPLYKMLVAEWNNEVVGVIQGSIKTAFSSTHKRPGLAAKVGYLLGLR

Query:  VAPPFRRRGIGSSLVHDLEDWFVANDVDYCCMATEKDNHASVNLFINHMRYVKFRTGRILVNPVTNYPYKINQSEIKIQKLKIEEAEEIYKKHMTSTEFF
        VAPPFRRRGIGSSLVHDLEDWFVANDVDYCCMATEKDNHASVNLFINHMRYVKFRTGRILVNPVTNYPYKINQSEI+IQKLKIEEAEEIYKKHMTSTEFF
Subjt:  VAPPFRRRGIGSSLVHDLEDWFVANDVDYCCMATEKDNHASVNLFINHMRYVKFRTGRILVNPVTNYPYKINQSEIKIQKLKIEEAEEIYKKHMTSTEFF

Query:  PKDINSILKNNLSLGTWVAHYKKQPPPW-SAAADIRSSWAVVSLWNSGEVFKLRLGKAPFPWVVYTKSLRMMEKMLPCLKVILVPDYFKAFGFYFVYGLH
        PKDINSILKNNLSLGTWVAHYKKQPPPW SAAADIRSSWAVVSLWNSGEVFKLRLGKAPFPWVVYTKSLRMMEKMLPCLKVILVPDYFKAFGFYFVYGLH
Subjt:  PKDINSILKNNLSLGTWVAHYKKQPPPW-SAAADIRSSWAVVSLWNSGEVFKLRLGKAPFPWVVYTKSLRMMEKMLPCLKVILVPDYFKAFGFYFVYGLH

Query:  HEGACSERLVGVLCEYVHNLALSNAKDCKAIVTEIGGEDDELKMAIPHWKLLSCSEDLWCVKALKSEEDSLLEWKNGPPNRPLFVDPREV
        HEGACSERLVGVLCEYVHNLALSNAKDCKAIVTEIGGEDDELKMAIPHWKLLSCSEDLWCVKALKSEEDSLLEWKNGPPNRPLFVDPREV
Subjt:  HEGACSERLVGVLCEYVHNLALSNAKDCKAIVTEIGGEDDELKMAIPHWKLLSCSEDLWCVKALKSEEDSLLEWKNGPPNRPLFVDPREV

XP_022947633.1 probable N-acetyltransferase HLS1-like [Cucurbita moschata]2.6e-232100Show/hide
Query:  MGSKDFVIRNYEESRLSDRAQVADLEQRCEIGPSKRVFLFTDTLGDPICRIRHSPLYKMLVAEWNNEVVGVIQGSIKTAFSSTHKRPGLAAKVGYLLGLR
        MGSKDFVIRNYEESRLSDRAQVADLEQRCEIGPSKRVFLFTDTLGDPICRIRHSPLYKMLVAEWNNEVVGVIQGSIKTAFSSTHKRPGLAAKVGYLLGLR
Subjt:  MGSKDFVIRNYEESRLSDRAQVADLEQRCEIGPSKRVFLFTDTLGDPICRIRHSPLYKMLVAEWNNEVVGVIQGSIKTAFSSTHKRPGLAAKVGYLLGLR

Query:  VAPPFRRRGIGSSLVHDLEDWFVANDVDYCCMATEKDNHASVNLFINHMRYVKFRTGRILVNPVTNYPYKINQSEIKIQKLKIEEAEEIYKKHMTSTEFF
        VAPPFRRRGIGSSLVHDLEDWFVANDVDYCCMATEKDNHASVNLFINHMRYVKFRTGRILVNPVTNYPYKINQSEIKIQKLKIEEAEEIYKKHMTSTEFF
Subjt:  VAPPFRRRGIGSSLVHDLEDWFVANDVDYCCMATEKDNHASVNLFINHMRYVKFRTGRILVNPVTNYPYKINQSEIKIQKLKIEEAEEIYKKHMTSTEFF

Query:  PKDINSILKNNLSLGTWVAHYKKQPPPWSAAADIRSSWAVVSLWNSGEVFKLRLGKAPFPWVVYTKSLRMMEKMLPCLKVILVPDYFKAFGFYFVYGLHH
        PKDINSILKNNLSLGTWVAHYKKQPPPWSAAADIRSSWAVVSLWNSGEVFKLRLGKAPFPWVVYTKSLRMMEKMLPCLKVILVPDYFKAFGFYFVYGLHH
Subjt:  PKDINSILKNNLSLGTWVAHYKKQPPPWSAAADIRSSWAVVSLWNSGEVFKLRLGKAPFPWVVYTKSLRMMEKMLPCLKVILVPDYFKAFGFYFVYGLHH

Query:  EGACSERLVGVLCEYVHNLALSNAKDCKAIVTEIGGEDDELKMAIPHWKLLSCSEDLWCVKALKSEEDSLLEWKNGPPNRPLFVDPREV
        EGACSERLVGVLCEYVHNLALSNAKDCKAIVTEIGGEDDELKMAIPHWKLLSCSEDLWCVKALKSEEDSLLEWKNGPPNRPLFVDPREV
Subjt:  EGACSERLVGVLCEYVHNLALSNAKDCKAIVTEIGGEDDELKMAIPHWKLLSCSEDLWCVKALKSEEDSLLEWKNGPPNRPLFVDPREV

XP_023007288.1 probable N-acetyltransferase HLS1-like [Cucurbita maxima]2.6e-22496.4Show/hide
Query:  MGSKDFVIRNYEESRLSDRAQVADLEQRCEIGPSKRVFLFTDTLGDPICRIRHSPLYKMLVAEWNNEVVGVIQGSIKTAFSSTHKRPGLAAKVGYLLGLR
        MGSK+FVIRNYEESRLSDRAQVADLEQRCEIG SKRVFLFTDTLGDPICRIRHSPLYKMLVAEWNNEVVGVIQGSIKTAFSS HK PGLAAKVGY+LGLR
Subjt:  MGSKDFVIRNYEESRLSDRAQVADLEQRCEIGPSKRVFLFTDTLGDPICRIRHSPLYKMLVAEWNNEVVGVIQGSIKTAFSSTHKRPGLAAKVGYLLGLR

Query:  VAPPFRRRGIGSSLVHDLEDWFVANDVDYCCMATEKDNHASVNLFINHMRYVKFRTGRILVNPVTNYPYKINQSEIKIQKLKIEEAEEIYKKHMTSTEFF
        VAPPFRRRGIG SLVHDLEDWFVANDVDYCCMATEKDNHASVNLFINHMRYVKFRTGRILVNPVTN+PYKINQSEIKIQKLKIEEAEEIYKKHM STEFF
Subjt:  VAPPFRRRGIGSSLVHDLEDWFVANDVDYCCMATEKDNHASVNLFINHMRYVKFRTGRILVNPVTNYPYKINQSEIKIQKLKIEEAEEIYKKHMTSTEFF

Query:  PKDINSILKNNLSLGTWVAHYKKQPPPWSAAADIRSSWAVVSLWNSGEVFKLRLGKAPFPWVVYTKSLRMMEKMLPCLKVILVPDYFKAFGFYFVYGLHH
        PKDINSILKNNLSLGTWVAHYKKQPPPWSAAADI  SWAVVSLWNSGEVFKLRLGKAPFPWVVYTKSL+MM+KMLPCLKVILVPDYFKAFGFYFVYGLHH
Subjt:  PKDINSILKNNLSLGTWVAHYKKQPPPWSAAADIRSSWAVVSLWNSGEVFKLRLGKAPFPWVVYTKSLRMMEKMLPCLKVILVPDYFKAFGFYFVYGLHH

Query:  EGACSERLVGVLCEYVHNLALSNAKDCKAIVTEIGGEDDELKMAIPHWKLLSCSEDLWCVKALKSEEDSLLEWKNGPPNRPLFVDPREV
        EGACSERLVGVLCE+VHNLALSNAKDCKAIVTEIGGEDDELKMAIPHWKLLSCSEDLWCVKALK EEDSLLEWKNGPPNRPLFVDPREV
Subjt:  EGACSERLVGVLCEYVHNLALSNAKDCKAIVTEIGGEDDELKMAIPHWKLLSCSEDLWCVKALKSEEDSLLEWKNGPPNRPLFVDPREV

XP_023532249.1 probable N-acetyltransferase HLS1-like [Cucurbita pepo subsp. pepo]3.2e-22296.4Show/hide
Query:  MGSKDFVIRNYEESRLSDRAQVADLEQRCEIGPSKRVFLFTDTLGDPICRIRHSPLYKMLVAEWNNEVVGVIQGSIKTAFSSTHKRPGLAAKVGYLLGLR
        MGSKDFVIRNYEESRLSDRAQVADLEQRCEIG SKRVFLFTDTLGDPICRIRHSPLYKMLVAEWNNEVVGVIQGSIKTAF STHK PGLAAKVGY+LGLR
Subjt:  MGSKDFVIRNYEESRLSDRAQVADLEQRCEIGPSKRVFLFTDTLGDPICRIRHSPLYKMLVAEWNNEVVGVIQGSIKTAFSSTHKRPGLAAKVGYLLGLR

Query:  VAPPFRRRGIGSSLVHDLEDWFVANDVDYCCMATEKDNHASVNLFINHMRYVKFRTGRILVNPVTNYPYKINQSEIKIQKLKIEEAEEIYKKHMTSTEFF
        VAPPFRRRGIGSSLVHDLEDWFVANDVDYCCMATEKDNHASVNLFINHMRYVKFRTGRILVNPVTNYPYKINQSEI+IQKLKIEEAEEIYKKHMTSTEFF
Subjt:  VAPPFRRRGIGSSLVHDLEDWFVANDVDYCCMATEKDNHASVNLFINHMRYVKFRTGRILVNPVTNYPYKINQSEIKIQKLKIEEAEEIYKKHMTSTEFF

Query:  PKDINSILKNNLSLGTWVAHYKKQPPPWSAAADIRSSWAVVSLWNSGEVFKLRLGKAPFPWVVYTKSLRMMEKMLPCLKVILVPDYFKAFGFYFVYGLHH
        PKDINSILKNNLSLGTWVAHYKKQPP      DIRSSWAVVSLWNSGEVFKLRLGKAPFPWVVYTKSL+MMEKMLPCLKVILVPDYFKAFGFYFVYGLHH
Subjt:  PKDINSILKNNLSLGTWVAHYKKQPPPWSAAADIRSSWAVVSLWNSGEVFKLRLGKAPFPWVVYTKSLRMMEKMLPCLKVILVPDYFKAFGFYFVYGLHH

Query:  EGACSERLVGVLCEYVHNLALSNAKDCKAIVTEIGGEDDELKMAIPHWKLLSCSEDLWCVKALKSEEDSLLEWKNGPPNRPLFVDPREV
        EGACSERLVGVLCE+VHNLALSNAKDCKAIVTEIGGEDDELKMAIPHWKLLSCSEDLWCVKALKSEEDS+LEWKNGPPNRPLFVDPREV
Subjt:  EGACSERLVGVLCEYVHNLALSNAKDCKAIVTEIGGEDDELKMAIPHWKLLSCSEDLWCVKALKSEEDSLLEWKNGPPNRPLFVDPREV

TrEMBL top hitse value%identityAlignment
A0A0A0KE16 N-acetyltransferase domain-containing protein1.3e-16569.76Show/hide
Query:  FVIRNYE----ESRLSDRAQVADLEQRCEIGPSKRVFLFTDTLGDPICRIRHSPLYKMLVAEWNNEVVGVIQGSIKTAFSSTHK--RPGLAAKVGYLLGL
        FVIR+YE    E + SD+AQV DLE+RCEIG SKRVFLFTD LGDPICRIR+SP+YKMLVAE + EVVGVIQGSIK  F + HK   PGL  KVGY+LGL
Subjt:  FVIRNYE----ESRLSDRAQVADLEQRCEIGPSKRVFLFTDTLGDPICRIRHSPLYKMLVAEWNNEVVGVIQGSIKTAFSSTHK--RPGLAAKVGYLLGL

Query:  RVAPPFRRRGIGSSLVHDLEDWFVANDVDYCCMATEKDNHASVNLFINHMRYVKFRTGRILVNPVTNYPYKINQSEIKIQKLKIEEAEEIYKKHMTSTEF
        RVAPP+RRRG+G++LV  LEDWFV+NDVDYCCMA EKDNHAS+NLFIN++RY+KFRTGRILVNPV N+PY IN SEIKIQKLKIE+AE IYKKHM STE 
Subjt:  RVAPPFRRRGIGSSLVHDLEDWFVANDVDYCCMATEKDNHASVNLFINHMRYVKFRTGRILVNPVTNYPYKINQSEIKIQKLKIEEAEEIYKKHMTSTEF

Query:  FPKDINSILKNNLSLGTWVAHYKKQPPPWSAAADI-----RSSWAVVSLWNSGEVFKLRLGKAPFPWVVYTKSLRMMEKMLPCLKVILVPDYFKAFGFYF
        FPKDI +ILKN LSLGTW+A++K+Q  P  +++       +SSWA+VSLWNSGEVF+LRLGKAPF WV+YTKSL++M+K+LPC K++LVP++FK FGFYF
Subjt:  FPKDINSILKNNLSLGTWVAHYKKQPPPWSAAADI-----RSSWAVVSLWNSGEVFKLRLGKAPFPWVVYTKSLRMMEKMLPCLKVILVPDYFKAFGFYF

Query:  VYGLHHEGACSERLVGVLCEYVHNLALSNAKD--CKAIVTEIGG-EDDELKMAIPHWKLLSCSEDLWCVKALKSE------------EDSLLEWKNGPPN
        VYGLHHEG  SERLVG LC++VHN+A++N+KD  CKAIVTEI G EDD+LKM IPHWKLLSC ED WC+K+LKS+            +D +LEW N PP 
Subjt:  VYGLHHEGACSERLVGVLCEYVHNLALSNAKD--CKAIVTEIGG-EDDELKMAIPHWKLLSCSEDLWCVKALKSE------------EDSLLEWKNGPPN

Query:  RPLFVDPREV
        R LFVDPREV
Subjt:  RPLFVDPREV

A0A1S3CNW9 probable N-acetyltransferase HLS1-like7.1e-16769.64Show/hide
Query:  FVIRNYE---ESRLSDRAQVADLEQRCEIGPSKRVFLFTDTLGDPICRIRHSPLYKMLVAEWNNEVVGVIQGSIKTAFSSTHK--RPGLAAKVGYLLGLR
        F+IR+YE   E +LSD+AQV DLE+RCEIG SKRVFLFTD LGDPICRIR+SP+YKMLVAE + EVVGVIQGSIK  F + HK   PGL  KVGY+LGLR
Subjt:  FVIRNYE---ESRLSDRAQVADLEQRCEIGPSKRVFLFTDTLGDPICRIRHSPLYKMLVAEWNNEVVGVIQGSIKTAFSSTHK--RPGLAAKVGYLLGLR

Query:  VAPPFRRRGIGSSLVHDLEDWFVANDVDYCCMATEKDNHASVNLFINHMRYVKFRTGRILVNPVTNYPYKINQSEIKIQKLKIEEAEEIYKKHMTSTEFF
        VAPP+RRRGIG++LV  LEDWFV+NDVDYCCMATEKDNHAS+NLFIN++RY+KFRTGRILVNPV N+PYKIN SEIKIQKL+IEEAE IYKKHM STE F
Subjt:  VAPPFRRRGIGSSLVHDLEDWFVANDVDYCCMATEKDNHASVNLFINHMRYVKFRTGRILVNPVTNYPYKINQSEIKIQKLKIEEAEEIYKKHMTSTEFF

Query:  PKDINSILKNNLSLGTWVAHYKKQPPPWSAAAD-----------IRSSWAVVSLWNSGEVFKLRLGKAPFPWVVYTKSLRMMEKMLPCLKVILVPDYFKA
        P+DI +ILKN LSLGTW+A++K+Q  P  +++              SSWA+VSLWNSGEVFKLRLGKAPFPWV+YTKSL++M+K+ PC K++LVP++FK 
Subjt:  PKDINSILKNNLSLGTWVAHYKKQPPPWSAAAD-----------IRSSWAVVSLWNSGEVFKLRLGKAPFPWVVYTKSLRMMEKMLPCLKVILVPDYFKA

Query:  FGFYFVYGLHHEGACSERLVGVLCEYVHNLALSNAKD--CKAIVTEIGG-EDDELKMAIPHWKLLSCSEDLWCVKALKSEEDS------------LLEWK
        FGFYFVYGLHHEG  SERLVG LC++VHN+A++N+KD  CKAIVTEIGG EDD+LKM IPHWKLLSC ED WC+K+LKS++++            +LEW 
Subjt:  FGFYFVYGLHHEGACSERLVGVLCEYVHNLALSNAKD--CKAIVTEIGG-EDDELKMAIPHWKLLSCSEDLWCVKALKSEEDS------------LLEWK

Query:  NGPPNRPLFVDPREV
        N PP R LFVDPREV
Subjt:  NGPPNRPLFVDPREV

A0A5D3CAW1 Putative N-acetyltransferase HLS1-like7.1e-16769.64Show/hide
Query:  FVIRNYE---ESRLSDRAQVADLEQRCEIGPSKRVFLFTDTLGDPICRIRHSPLYKMLVAEWNNEVVGVIQGSIKTAFSSTHK--RPGLAAKVGYLLGLR
        F+IR+YE   E +LSD+AQV DLE+RCEIG SKRVFLFTD LGDPICRIR+SP+YKMLVAE + EVVGVIQGSIK  F + HK   PGL  KVGY+LGLR
Subjt:  FVIRNYE---ESRLSDRAQVADLEQRCEIGPSKRVFLFTDTLGDPICRIRHSPLYKMLVAEWNNEVVGVIQGSIKTAFSSTHK--RPGLAAKVGYLLGLR

Query:  VAPPFRRRGIGSSLVHDLEDWFVANDVDYCCMATEKDNHASVNLFINHMRYVKFRTGRILVNPVTNYPYKINQSEIKIQKLKIEEAEEIYKKHMTSTEFF
        VAPP+RRRGIG++LV  LEDWFV+NDVDYCCMATEKDNHAS+NLFIN++RY+KFRTGRILVNPV N+PYKIN SEIKIQKL+IEEAE IYKKHM STE F
Subjt:  VAPPFRRRGIGSSLVHDLEDWFVANDVDYCCMATEKDNHASVNLFINHMRYVKFRTGRILVNPVTNYPYKINQSEIKIQKLKIEEAEEIYKKHMTSTEFF

Query:  PKDINSILKNNLSLGTWVAHYKKQPPPWSAAAD-----------IRSSWAVVSLWNSGEVFKLRLGKAPFPWVVYTKSLRMMEKMLPCLKVILVPDYFKA
        P+DI +ILKN LSLGTW+A++K+Q  P  +++              SSWA+VSLWNSGEVFKLRLGKAPFPWV+YTKSL++M+K+ PC K++LVP++FK 
Subjt:  PKDINSILKNNLSLGTWVAHYKKQPPPWSAAAD-----------IRSSWAVVSLWNSGEVFKLRLGKAPFPWVVYTKSLRMMEKMLPCLKVILVPDYFKA

Query:  FGFYFVYGLHHEGACSERLVGVLCEYVHNLALSNAKD--CKAIVTEIGG-EDDELKMAIPHWKLLSCSEDLWCVKALKSEEDS------------LLEWK
        FGFYFVYGLHHEG  SERLVG LC++VHN+A++N+KD  CKAIVTEIGG EDD+LKM IPHWKLLSC ED WC+K+LKS++++            +LEW 
Subjt:  FGFYFVYGLHHEGACSERLVGVLCEYVHNLALSNAKD--CKAIVTEIGG-EDDELKMAIPHWKLLSCSEDLWCVKALKSEEDS------------LLEWK

Query:  NGPPNRPLFVDPREV
        N PP R LFVDPREV
Subjt:  NGPPNRPLFVDPREV

A0A6J1G758 probable N-acetyltransferase HLS1-like1.3e-232100Show/hide
Query:  MGSKDFVIRNYEESRLSDRAQVADLEQRCEIGPSKRVFLFTDTLGDPICRIRHSPLYKMLVAEWNNEVVGVIQGSIKTAFSSTHKRPGLAAKVGYLLGLR
        MGSKDFVIRNYEESRLSDRAQVADLEQRCEIGPSKRVFLFTDTLGDPICRIRHSPLYKMLVAEWNNEVVGVIQGSIKTAFSSTHKRPGLAAKVGYLLGLR
Subjt:  MGSKDFVIRNYEESRLSDRAQVADLEQRCEIGPSKRVFLFTDTLGDPICRIRHSPLYKMLVAEWNNEVVGVIQGSIKTAFSSTHKRPGLAAKVGYLLGLR

Query:  VAPPFRRRGIGSSLVHDLEDWFVANDVDYCCMATEKDNHASVNLFINHMRYVKFRTGRILVNPVTNYPYKINQSEIKIQKLKIEEAEEIYKKHMTSTEFF
        VAPPFRRRGIGSSLVHDLEDWFVANDVDYCCMATEKDNHASVNLFINHMRYVKFRTGRILVNPVTNYPYKINQSEIKIQKLKIEEAEEIYKKHMTSTEFF
Subjt:  VAPPFRRRGIGSSLVHDLEDWFVANDVDYCCMATEKDNHASVNLFINHMRYVKFRTGRILVNPVTNYPYKINQSEIKIQKLKIEEAEEIYKKHMTSTEFF

Query:  PKDINSILKNNLSLGTWVAHYKKQPPPWSAAADIRSSWAVVSLWNSGEVFKLRLGKAPFPWVVYTKSLRMMEKMLPCLKVILVPDYFKAFGFYFVYGLHH
        PKDINSILKNNLSLGTWVAHYKKQPPPWSAAADIRSSWAVVSLWNSGEVFKLRLGKAPFPWVVYTKSLRMMEKMLPCLKVILVPDYFKAFGFYFVYGLHH
Subjt:  PKDINSILKNNLSLGTWVAHYKKQPPPWSAAADIRSSWAVVSLWNSGEVFKLRLGKAPFPWVVYTKSLRMMEKMLPCLKVILVPDYFKAFGFYFVYGLHH

Query:  EGACSERLVGVLCEYVHNLALSNAKDCKAIVTEIGGEDDELKMAIPHWKLLSCSEDLWCVKALKSEEDSLLEWKNGPPNRPLFVDPREV
        EGACSERLVGVLCEYVHNLALSNAKDCKAIVTEIGGEDDELKMAIPHWKLLSCSEDLWCVKALKSEEDSLLEWKNGPPNRPLFVDPREV
Subjt:  EGACSERLVGVLCEYVHNLALSNAKDCKAIVTEIGGEDDELKMAIPHWKLLSCSEDLWCVKALKSEEDSLLEWKNGPPNRPLFVDPREV

A0A6J1L7A2 probable N-acetyltransferase HLS1-like1.3e-22496.4Show/hide
Query:  MGSKDFVIRNYEESRLSDRAQVADLEQRCEIGPSKRVFLFTDTLGDPICRIRHSPLYKMLVAEWNNEVVGVIQGSIKTAFSSTHKRPGLAAKVGYLLGLR
        MGSK+FVIRNYEESRLSDRAQVADLEQRCEIG SKRVFLFTDTLGDPICRIRHSPLYKMLVAEWNNEVVGVIQGSIKTAFSS HK PGLAAKVGY+LGLR
Subjt:  MGSKDFVIRNYEESRLSDRAQVADLEQRCEIGPSKRVFLFTDTLGDPICRIRHSPLYKMLVAEWNNEVVGVIQGSIKTAFSSTHKRPGLAAKVGYLLGLR

Query:  VAPPFRRRGIGSSLVHDLEDWFVANDVDYCCMATEKDNHASVNLFINHMRYVKFRTGRILVNPVTNYPYKINQSEIKIQKLKIEEAEEIYKKHMTSTEFF
        VAPPFRRRGIG SLVHDLEDWFVANDVDYCCMATEKDNHASVNLFINHMRYVKFRTGRILVNPVTN+PYKINQSEIKIQKLKIEEAEEIYKKHM STEFF
Subjt:  VAPPFRRRGIGSSLVHDLEDWFVANDVDYCCMATEKDNHASVNLFINHMRYVKFRTGRILVNPVTNYPYKINQSEIKIQKLKIEEAEEIYKKHMTSTEFF

Query:  PKDINSILKNNLSLGTWVAHYKKQPPPWSAAADIRSSWAVVSLWNSGEVFKLRLGKAPFPWVVYTKSLRMMEKMLPCLKVILVPDYFKAFGFYFVYGLHH
        PKDINSILKNNLSLGTWVAHYKKQPPPWSAAADI  SWAVVSLWNSGEVFKLRLGKAPFPWVVYTKSL+MM+KMLPCLKVILVPDYFKAFGFYFVYGLHH
Subjt:  PKDINSILKNNLSLGTWVAHYKKQPPPWSAAADIRSSWAVVSLWNSGEVFKLRLGKAPFPWVVYTKSLRMMEKMLPCLKVILVPDYFKAFGFYFVYGLHH

Query:  EGACSERLVGVLCEYVHNLALSNAKDCKAIVTEIGGEDDELKMAIPHWKLLSCSEDLWCVKALKSEEDSLLEWKNGPPNRPLFVDPREV
        EGACSERLVGVLCE+VHNLALSNAKDCKAIVTEIGGEDDELKMAIPHWKLLSCSEDLWCVKALK EEDSLLEWKNGPPNRPLFVDPREV
Subjt:  EGACSERLVGVLCEYVHNLALSNAKDCKAIVTEIGGEDDELKMAIPHWKLLSCSEDLWCVKALKSEEDSLLEWKNGPPNRPLFVDPREV

SwissProt top hitse value%identityAlignment
O64815 Probable N-acetyltransferase HLS1-like5.2e-9044.07Show/hide
Query:  IRNYEESRLSDRAQVADLEQRCEIGPSKRVFLFTDTLGDPICRIRHSPLYKMLVAE----WNNEVVGVIQGSIKTA--------FSSTHKR--------P
        +R Y+ S+  D A V D+E+RCE+GP+ ++ LFTD LGDPICR+RHSP Y MLVAE       E+VG+I+G IKT            TH +         
Subjt:  IRNYEESRLSDRAQVADLEQRCEIGPSKRVFLFTDTLGDPICRIRHSPLYKMLVAE----WNNEVVGVIQGSIKTA--------FSSTHKR--------P

Query:  GLAAKVGYLLGLRVAPPFRRRGIGSSLVHDLEDWFVANDVDYCCMATEKDNHASVNLFINHMRYVKFRTGRILVNPVTNYPYKINQS-EIKIQKLKIEEA
         L  K+ Y+LGLRV+P  RR+GIG  LV  +EDWF  N  +Y   ATE DNHASVNLF     Y +FRT  ILVNPV  Y +++N S  + + KL+  +A
Subjt:  GLAAKVGYLLGLRVAPPFRRRGIGSSLVHDLEDWFVANDVDYCCMATEKDNHASVNLFINHMRYVKFRTGRILVNPVTNYPYKINQS-EIKIQKLKIEEA

Query:  EEIYKKHMTSTEFFPKDINSILKNNLSLGTWVA-----HYKKQPPPWSAAADI----RSSWAVVSLWNSGEVFKLRLGKAPFPWVVYTKSLRMMEKMLPC
        E +Y+   ++TEFFP+DI+S+L N LSLGT+VA      Y      W  +A        SWAV+S+WN  + F+L +  A     V +K+ RM++K LP 
Subjt:  EEIYKKHMTSTEFFPKDINSILKNNLSLGTWVA-----HYKKQPPPWSAAADI----RSSWAVVSLWNSGEVFKLRLGKAPFPWVVYTKSLRMMEKMLPC

Query:  LKVILVPDYFKAFGFYFVYGLHHEGACSERLVGVLCEYVHNLALSNAKDCKAIVTEIGGEDDELKMAIPHWKLLSCSEDLWCVKALKSE--EDSLLEWKN
        LK+  +P  F+ FG +F+YG+  EG  +E++V  LC++ HNLA      C  +  E+ GE + L+  IPHWK+LSC+EDLWC+K L  +  + S+ +W  
Subjt:  LKVILVPDYFKAFGFYFVYGLHHEGACSERLVGVLCEYVHNLALSNAKDCKAIVTEIGGEDDELKMAIPHWKLLSCSEDLWCVKALKSE--EDSLLEWKN

Query:  GPPNRPLFVDPRE
         PP   +FVDPRE
Subjt:  GPPNRPLFVDPRE

Q42381 Probable N-acetyltransferase HLS19.2e-8744.72Show/hide
Query:  VIRNYEESRLSDRAQVADLEQRCEIGPSKRVFLFTDTLGDPICRIRHSPLYKMLVAEW---NNEVVGVIQGSIKTA-----FSSTHKRPG-----LAAKV
        V+R Y+ +R  D   V D+E+RCE+GPS ++ LFTD LGDPICRIRHSP Y MLVAE      E+VG+I+G IKT          HK        L  K+
Subjt:  VIRNYEESRLSDRAQVADLEQRCEIGPSKRVFLFTDTLGDPICRIRHSPLYKMLVAEW---NNEVVGVIQGSIKTA-----FSSTHKRPG-----LAAKV

Query:  GYLLGLRVAPPFRRRGIGSSLVHDLEDWFVANDVDYCCMATEKDNHASVNLFINHMRYVKFRTGRILVNPVTNYPYKINQS-EIKIQKLKIEEAEEIYKK
         Y+LGLRV+P  RR+GIG  LV  +E+WF  N  +Y  +ATE DN ASVNLF     Y +FRT  ILVNPV  Y +++N S  + + KL+  +AE +Y+ 
Subjt:  GYLLGLRVAPPFRRRGIGSSLVHDLEDWFVANDVDYCCMATEKDNHASVNLFINHMRYVKFRTGRILVNPVTNYPYKINQS-EIKIQKLKIEEAEEIYKK

Query:  HMTSTEFFPKDINSILKNNLSLGTWVA-----HYKKQPPPWSAAADI----RSSWAVVSLWNSGEVFKLRLGKAPFPWVVYTKSLRMMEKMLPCLKVILV
          ++TEFFP+DI+S+L N LSLGT+VA      Y      W  +A        SWAV+S+WN  + F L +  A     V  K+ R+++K LP LK+  +
Subjt:  HMTSTEFFPKDINSILKNNLSLGTWVA-----HYKKQPPPWSAAADI----RSSWAVVSLWNSGEVFKLRLGKAPFPWVVYTKSLRMMEKMLPCLKVILV

Query:  PDYFKAFGFYFVYGLHHEGACSERLVGVLCEYVHNLALSNAKDCKAIVTEIGGEDDELKMAIPHWKLLSCSEDLWCVKALKSE-EDSLL-EWKNGPPNRP
        P  F+ FG +F+YG+  EG  + ++V  LC + HNLA   A  C  +  E+ GE D L+  IPHWK+LSC EDLWC+K L  +  D ++ +W   PP   
Subjt:  PDYFKAFGFYFVYGLHHEGACSERLVGVLCEYVHNLALSNAKDCKAIVTEIGGEDDELKMAIPHWKLLSCSEDLWCVKALKSE-EDSLL-EWKNGPPNRP

Query:  LFVDPRE
        +FVDPRE
Subjt:  LFVDPRE

Arabidopsis top hitse value%identityAlignment
AT2G23060.1 Acyl-CoA N-acyltransferases (NAT) superfamily protein3.7e-9144.07Show/hide
Query:  IRNYEESRLSDRAQVADLEQRCEIGPSKRVFLFTDTLGDPICRIRHSPLYKMLVAE----WNNEVVGVIQGSIKTA--------FSSTHKR--------P
        +R Y+ S+  D A V D+E+RCE+GP+ ++ LFTD LGDPICR+RHSP Y MLVAE       E+VG+I+G IKT            TH +         
Subjt:  IRNYEESRLSDRAQVADLEQRCEIGPSKRVFLFTDTLGDPICRIRHSPLYKMLVAE----WNNEVVGVIQGSIKTA--------FSSTHKR--------P

Query:  GLAAKVGYLLGLRVAPPFRRRGIGSSLVHDLEDWFVANDVDYCCMATEKDNHASVNLFINHMRYVKFRTGRILVNPVTNYPYKINQS-EIKIQKLKIEEA
         L  K+ Y+LGLRV+P  RR+GIG  LV  +EDWF  N  +Y   ATE DNHASVNLF     Y +FRT  ILVNPV  Y +++N S  + + KL+  +A
Subjt:  GLAAKVGYLLGLRVAPPFRRRGIGSSLVHDLEDWFVANDVDYCCMATEKDNHASVNLFINHMRYVKFRTGRILVNPVTNYPYKINQS-EIKIQKLKIEEA

Query:  EEIYKKHMTSTEFFPKDINSILKNNLSLGTWVA-----HYKKQPPPWSAAADI----RSSWAVVSLWNSGEVFKLRLGKAPFPWVVYTKSLRMMEKMLPC
        E +Y+   ++TEFFP+DI+S+L N LSLGT+VA      Y      W  +A        SWAV+S+WN  + F+L +  A     V +K+ RM++K LP 
Subjt:  EEIYKKHMTSTEFFPKDINSILKNNLSLGTWVA-----HYKKQPPPWSAAADI----RSSWAVVSLWNSGEVFKLRLGKAPFPWVVYTKSLRMMEKMLPC

Query:  LKVILVPDYFKAFGFYFVYGLHHEGACSERLVGVLCEYVHNLALSNAKDCKAIVTEIGGEDDELKMAIPHWKLLSCSEDLWCVKALKSE--EDSLLEWKN
        LK+  +P  F+ FG +F+YG+  EG  +E++V  LC++ HNLA      C  +  E+ GE + L+  IPHWK+LSC+EDLWC+K L  +  + S+ +W  
Subjt:  LKVILVPDYFKAFGFYFVYGLHHEGACSERLVGVLCEYVHNLALSNAKDCKAIVTEIGGEDDELKMAIPHWKLLSCSEDLWCVKALKSE--EDSLLEWKN

Query:  GPPNRPLFVDPRE
         PP   +FVDPRE
Subjt:  GPPNRPLFVDPRE

AT2G23060.2 Acyl-CoA N-acyltransferases (NAT) superfamily protein3.8e-7242.27Show/hide
Query:  MLVAE----WNNEVVGVIQGSIKTA--------FSSTHKR--------PGLAAKVGYLLGLRVAPPFRRRGIGSSLVHDLEDWFVANDVDYCCMATEKDN
        MLVAE       E+VG+I+G IKT            TH +          L  K+ Y+LGLRV+P  RR+GIG  LV  +EDWF  N  +Y   ATE DN
Subjt:  MLVAE----WNNEVVGVIQGSIKTA--------FSSTHKR--------PGLAAKVGYLLGLRVAPPFRRRGIGSSLVHDLEDWFVANDVDYCCMATEKDN

Query:  HASVNLFINHMRYVKFRTGRILVNPVTNYPYKINQS-EIKIQKLKIEEAEEIYKKHMTSTEFFPKDINSILKNNLSLGTWVA-----HYKKQPPPWSAAA
        HASVNLF     Y +FRT  ILVNPV  Y +++N S  + + KL+  +AE +Y+   ++TEFFP+DI+S+L N LSLGT+VA      Y      W  +A
Subjt:  HASVNLFINHMRYVKFRTGRILVNPVTNYPYKINQS-EIKIQKLKIEEAEEIYKKHMTSTEFFPKDINSILKNNLSLGTWVA-----HYKKQPPPWSAAA

Query:  DI----RSSWAVVSLWNSGEVFKLRLGKAPFPWVVYTKSLRMMEKMLPCLKVILVPDYFKAFGFYFVYGLHHEGACSERLVGVLCEYVHNLALSNAKDCK
                SWAV+S+WN  + F+L +  A     V +K+ RM++K LP LK+  +P  F+ FG +F+YG+  EG  +E++V  LC++ HNLA      C 
Subjt:  DI----RSSWAVVSLWNSGEVFKLRLGKAPFPWVVYTKSLRMMEKMLPCLKVILVPDYFKAFGFYFVYGLHHEGACSERLVGVLCEYVHNLALSNAKDCK

Query:  AIVTEIGGEDDELKMAIPHWKLLSCSEDLWCVKALKSE--EDSLLEWKNGPPNRPLFVDPRE
         +  E+ GE + L+  IPHWK+LSC+EDLWC+K L  +  + S+ +W   PP   +FVDPRE
Subjt:  AIVTEIGGEDDELKMAIPHWKLLSCSEDLWCVKALKSE--EDSLLEWKNGPPNRPLFVDPRE

AT2G30090.1 Acyl-CoA N-acyltransferases (NAT) superfamily protein4.4e-10048.08Show/hide
Query:  KDFVIRNYEESRLSDRAQVADLEQRCEIGPSKRVFLFTDTLGDPICRIRHSPLYKMLVAEWNNEVVGVIQGSIKTAFSSTHKRPGLAAKVGYLLGLRVAP
        ++ VIR Y++ R  DR Q+  +E+ CEIG   +  LFTDTLGDPICRIR+SP + MLVA   N++VG IQGS+K      H +   + +VGY+LGLRV P
Subjt:  KDFVIRNYEESRLSDRAQVADLEQRCEIGPSKRVFLFTDTLGDPICRIRHSPLYKMLVAEWNNEVVGVIQGSIKTAFSSTHKRPGLAAKVGYLLGLRVAP

Query:  PFRRRGIGSSLVHDLEDWFVANDVDYCCMATEKDNHASVNLFINHMRYVKFRTGRILVNPVTNYPYKINQSEIKIQKLKIEEAEEIYKKHM-TSTEFFPK
         +RRRGIGS LV  LE+WF +++ DY  MATEKDN AS  LFI  + YV FR   ILVNPV         S+I I+KLK++EAE +Y++++  +TEFFP 
Subjt:  PFRRRGIGSSLVHDLEDWFVANDVDYCCMATEKDNHASVNLFINHMRYVKFRTGRILVNPVTNYPYKINQSEIKIQKLKIEEAEEIYKKHM-TSTEFFPK

Query:  DINSILKNNLSLGTWVAHYKKQPPPWSAAADIRSSWAVVSLWNSGEVFKLRLGKAPFPWVVYTKSLRMMEKMLPCLKVILVPDYFKAFGFYFVYGLHHEG
        DIN IL+N LS+GTWVA+Y           D   SWA++S+W+S +VFKLR+ +AP  +++ TK  ++    L  L + ++PD F  FGFYF+YG+H EG
Subjt:  DINSILKNNLSLGTWVAHYKKQPPPWSAAADIRSSWAVVSLWNSGEVFKLRLGKAPFPWVVYTKSLRMMEKMLPCLKVILVPDYFKAFGFYFVYGLHHEG

Query:  ACSERLVGVLCEYVHNL-ALSNAKDCKAIVTEI---GGEDDELKMAIPHWKLLSCSEDLWCVKALKSEEDSLLEWKNGPPNRPLFVDPREV
            +LV  LCE+VHN+ AL++   CK +V E+      DD L+  IPHWK+LSC +D+WC+K LK E++     +       LFVDPREV
Subjt:  ACSERLVGVLCEYVHNL-ALSNAKDCKAIVTEI---GGEDDELKMAIPHWKLLSCSEDLWCVKALKSEEDSLLEWKNGPPNRPLFVDPREV

AT4G37580.1 Acyl-CoA N-acyltransferases (NAT) superfamily protein6.5e-8844.72Show/hide
Query:  VIRNYEESRLSDRAQVADLEQRCEIGPSKRVFLFTDTLGDPICRIRHSPLYKMLVAEW---NNEVVGVIQGSIKTA-----FSSTHKRPG-----LAAKV
        V+R Y+ +R  D   V D+E+RCE+GPS ++ LFTD LGDPICRIRHSP Y MLVAE      E+VG+I+G IKT          HK        L  K+
Subjt:  VIRNYEESRLSDRAQVADLEQRCEIGPSKRVFLFTDTLGDPICRIRHSPLYKMLVAEW---NNEVVGVIQGSIKTA-----FSSTHKRPG-----LAAKV

Query:  GYLLGLRVAPPFRRRGIGSSLVHDLEDWFVANDVDYCCMATEKDNHASVNLFINHMRYVKFRTGRILVNPVTNYPYKINQS-EIKIQKLKIEEAEEIYKK
         Y+LGLRV+P  RR+GIG  LV  +E+WF  N  +Y  +ATE DN ASVNLF     Y +FRT  ILVNPV  Y +++N S  + + KL+  +AE +Y+ 
Subjt:  GYLLGLRVAPPFRRRGIGSSLVHDLEDWFVANDVDYCCMATEKDNHASVNLFINHMRYVKFRTGRILVNPVTNYPYKINQS-EIKIQKLKIEEAEEIYKK

Query:  HMTSTEFFPKDINSILKNNLSLGTWVA-----HYKKQPPPWSAAADI----RSSWAVVSLWNSGEVFKLRLGKAPFPWVVYTKSLRMMEKMLPCLKVILV
          ++TEFFP+DI+S+L N LSLGT+VA      Y      W  +A        SWAV+S+WN  + F L +  A     V  K+ R+++K LP LK+  +
Subjt:  HMTSTEFFPKDINSILKNNLSLGTWVA-----HYKKQPPPWSAAADI----RSSWAVVSLWNSGEVFKLRLGKAPFPWVVYTKSLRMMEKMLPCLKVILV

Query:  PDYFKAFGFYFVYGLHHEGACSERLVGVLCEYVHNLALSNAKDCKAIVTEIGGEDDELKMAIPHWKLLSCSEDLWCVKALKSE-EDSLL-EWKNGPPNRP
        P  F+ FG +F+YG+  EG  + ++V  LC + HNLA   A  C  +  E+ GE D L+  IPHWK+LSC EDLWC+K L  +  D ++ +W   PP   
Subjt:  PDYFKAFGFYFVYGLHHEGACSERLVGVLCEYVHNLALSNAKDCKAIVTEIGGEDDELKMAIPHWKLLSCSEDLWCVKALKSE-EDSLL-EWKNGPPNRP

Query:  LFVDPRE
        +FVDPRE
Subjt:  LFVDPRE

AT5G67430.1 Acyl-CoA N-acyltransferases (NAT) superfamily protein8.6e-8041.18Show/hide
Query:  VIRNYEESRLSDRAQVADLEQRCEIGPSKRVFLFTDTLGDPICRIRHSPLYKMLVAEWNNEVVGVIQGSIKTAFSSTHK-------RPGL-AAKVGYLLG
        V+R Y+  R  D   V +LE+ CE+G      L  D +GDP+ RIR SP + MLVAE  NE+VG+I+G+IK      +         P +   K+ ++ G
Subjt:  VIRNYEESRLSDRAQVADLEQRCEIGPSKRVFLFTDTLGDPICRIRHSPLYKMLVAEWNNEVVGVIQGSIKTAFSSTHK-------RPGL-AAKVGYLLG

Query:  LRVAPPFRRRGIGSSLVHDLEDWFVANDVDYCCMATEKDNHASVNLFINHMRYVKFRTGRILVNPVTNYPYKINQSEIKIQKLKIEEAEEIYKKHMTSTE
        LRV+P +RR GIG  LV  LE+WF+ ND  Y  + TE DN ASV LF     Y KFRT   LVNPV N+   +++  +KI KL   +AE +Y+   ++TE
Subjt:  LRVAPPFRRRGIGSSLVHDLEDWFVANDVDYCCMATEKDNHASVNLFINHMRYVKFRTGRILVNPVTNYPYKINQSEIKIQKLKIEEAEEIYKKHMTSTE

Query:  FFPKDINSILKNNLSLGTWVAHYKKQPPPWSAAADIRSSWAVVSLWNSGEVFKLRLGKAPFPWVVYTKSLRMMEKMLPCLKVILVPDYFKAFGFYFVYGL
        FFP DINSIL N LSLGT++A  +       +  D   SWAV+S+WNS +V++L++  A     +  KS R+ +   P LK+   P+ FK+F  +F+YG+
Subjt:  FFPKDINSILKNNLSLGTWVAHYKKQPPPWSAAADIRSSWAVVSLWNSGEVFKLRLGKAPFPWVVYTKSLRMMEKMLPCLKVILVPDYFKAFGFYFVYGL

Query:  HHEGACSERLVGVLCEYVHNLALSNAKDCKAIVTEIGGEDDELKMAIPHWKLLSCSEDLWCVKALKSEEDSLLEWKNGPPNRPLFVDPREV
          EG  +  +V  LC + HNLA  +   C  +  E+    + L++ IPHWK+LS  EDLWC+K L+ ++D  ++W   PP   +FVDPRE+
Subjt:  HHEGACSERLVGVLCEYVHNLALSNAKDCKAIVTEIGGEDDELKMAIPHWKLLSCSEDLWCVKALKSEEDSLLEWKNGPPNRPLFVDPREV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGATCGAAAGATTTCGTTATACGAAACTACGAAGAGAGTCGATTATCAGATAGAGCTCAAGTAGCTGATCTCGAACAACGATGTGAAATTGGGCCATCGAAACGTGT
ATTTCTCTTCACTGACACATTGGGTGACCCCATTTGTAGGATTCGTCACAGTCCCTTATACAAAATGCTGGTGGCAGAGTGGAACAACGAGGTGGTTGGAGTCATTCAAG
GCTCGATAAAGACAGCGTTTTCTAGTACTCATAAACGGCCGGGTTTGGCGGCGAAAGTGGGCTACCTTCTTGGGCTGAGAGTTGCACCGCCGTTTCGCCGCCGTGGGATT
GGATCCAGCCTTGTTCATGATTTGGAAGATTGGTTTGTAGCTAATGACGTTGATTACTGTTGCATGGCTACTGAGAAAGATAATCATGCCTCTGTTAATCTCTTCATTAA
TCACATGAGGTACGTAAAATTCAGAACAGGAAGAATTCTGGTGAACCCAGTAACAAATTATCCATACAAAATCAACCAATCAGAAATCAAGATTCAAAAGCTGAAAATTG
AAGAAGCAGAAGAAATTTACAAAAAACACATGACATCAACGGAGTTCTTCCCCAAAGACATAAACAGCATATTGAAGAACAATTTGAGCTTAGGGACATGGGTAGCACAT
TACAAGAAACAGCCGCCACCGTGGTCGGCGGCGGCCGACATACGGTCGAGCTGGGCTGTGGTGAGTCTATGGAACAGCGGGGAAGTTTTCAAGTTAAGGCTAGGAAAAGC
TCCATTTCCATGGGTGGTTTATACAAAGAGCTTAAGAATGATGGAGAAAATGTTGCCTTGTTTGAAGGTGATTTTGGTGCCTGATTATTTCAAGGCATTTGGGTTTTATT
TTGTTTATGGGTTGCATCATGAAGGGGCTTGTTCTGAGAGATTGGTTGGGGTATTGTGTGAATATGTTCATAATTTGGCTTTGAGTAATGCGAAGGATTGTAAGGCTATT
GTTACAGAGATTGGTGGGGAAGATGATGAGCTGAAGATGGCTATTCCTCATTGGAAATTGCTATCATGTTCGGAAGATTTGTGGTGTGTTAAGGCCTTGAAGAGTGAGGA
GGATAGTCTCTTGGAATGGAAAAATGGCCCACCAAATAGACCTCTCTTTGTAGACCCAAGAGAGGTATGA
mRNA sequenceShow/hide mRNA sequence
CTCTTAATTTAAGAGTTTGATCCATTTTTTAATTAATGGGATCGAAAGATTTCGTTATACGAAACTACGAAGAGAGTCGATTATCAGATAGAGCTCAAGTAGCTGATCTC
GAACAACGATGTGAAATTGGGCCATCGAAACGTGTATTTCTCTTCACTGACACATTGGGTGACCCCATTTGTAGGATTCGTCACAGTCCCTTATACAAAATGCTGGTGGC
AGAGTGGAACAACGAGGTGGTTGGAGTCATTCAAGGCTCGATAAAGACAGCGTTTTCTAGTACTCATAAACGGCCGGGTTTGGCGGCGAAAGTGGGCTACCTTCTTGGGC
TGAGAGTTGCACCGCCGTTTCGCCGCCGTGGGATTGGATCCAGCCTTGTTCATGATTTGGAAGATTGGTTTGTAGCTAATGACGTTGATTACTGTTGCATGGCTACTGAG
AAAGATAATCATGCCTCTGTTAATCTCTTCATTAATCACATGAGGTACGTAAAATTCAGAACAGGAAGAATTCTGGTGAACCCAGTAACAAATTATCCATACAAAATCAA
CCAATCAGAAATCAAGATTCAAAAGCTGAAAATTGAAGAAGCAGAAGAAATTTACAAAAAACACATGACATCAACGGAGTTCTTCCCCAAAGACATAAACAGCATATTGA
AGAACAATTTGAGCTTAGGGACATGGGTAGCACATTACAAGAAACAGCCGCCACCGTGGTCGGCGGCGGCCGACATACGGTCGAGCTGGGCTGTGGTGAGTCTATGGAAC
AGCGGGGAAGTTTTCAAGTTAAGGCTAGGAAAAGCTCCATTTCCATGGGTGGTTTATACAAAGAGCTTAAGAATGATGGAGAAAATGTTGCCTTGTTTGAAGGTGATTTT
GGTGCCTGATTATTTCAAGGCATTTGGGTTTTATTTTGTTTATGGGTTGCATCATGAAGGGGCTTGTTCTGAGAGATTGGTTGGGGTATTGTGTGAATATGTTCATAATT
TGGCTTTGAGTAATGCGAAGGATTGTAAGGCTATTGTTACAGAGATTGGTGGGGAAGATGATGAGCTGAAGATGGCTATTCCTCATTGGAAATTGCTATCATGTTCGGAA
GATTTGTGGTGTGTTAAGGCCTTGAAGAGTGAGGAGGATAGTCTCTTGGAATGGAAAAATGGCCCACCAAATAGACCTCTCTTTGTAGACCCAAGAGAGGTATGAAAGAA
TAAATGAAAAGGGTTAAACCCTTTCCCGACACGTGGTCTCGATCTCGATTCAATCATTGCCTGATGTTATCATGGTAGAATGAAGAAAGAAGAAAAACACGTTCATC
Protein sequenceShow/hide protein sequence
MGSKDFVIRNYEESRLSDRAQVADLEQRCEIGPSKRVFLFTDTLGDPICRIRHSPLYKMLVAEWNNEVVGVIQGSIKTAFSSTHKRPGLAAKVGYLLGLRVAPPFRRRGI
GSSLVHDLEDWFVANDVDYCCMATEKDNHASVNLFINHMRYVKFRTGRILVNPVTNYPYKINQSEIKIQKLKIEEAEEIYKKHMTSTEFFPKDINSILKNNLSLGTWVAH
YKKQPPPWSAAADIRSSWAVVSLWNSGEVFKLRLGKAPFPWVVYTKSLRMMEKMLPCLKVILVPDYFKAFGFYFVYGLHHEGACSERLVGVLCEYVHNLALSNAKDCKAI
VTEIGGEDDELKMAIPHWKLLSCSEDLWCVKALKSEEDSLLEWKNGPPNRPLFVDPREV