; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Carg23864 (gene) of Silver-seed gourd (SMH-JMG-627) v2 genome

Gene IDCarg23864
OrganismCucurbita argyrosperma subsp. argyrosperma cv. SMH-JMG-627 (Silver-seed gourd (SMH-JMG-627) v2)
DescriptionN-acetyltransferase domain-containing protein
Genome locationCarg_Chr02:4110765..4112137
RNA-Seq ExpressionCarg23864
SyntenyCarg23864
Gene Ontology termsGO:0008080 - N-acetyltransferase activity (molecular function)
InterPro domainsIPR000182 - GNAT domain
IPR016181 - Acyl-CoA N-acyltransferase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6605401.1 putative N-acetyltransferase HLS1-like protein, partial [Cucurbita argyrosperma subsp. sororia]1.3e-23199.23Show/hide
Query:  MGSKDFVIRNYEESRLSDRAQVADLEQRCEIGPSKRVFLFTDTLGDPICRIRHSPLYKMLVAEWNNEVVGVIQGSIKTAFSSTHKRPGLAVKVGYILGLR
        MGSKDFVIRNYEESRLSD+AQVADLEQRCEIGPSKRVFLFTDTLGDPICRIRHSPLYKMLVAEWNNEVVGVIQGSIKTAFSSTHKRPGLAVKVGYILGLR
Subjt:  MGSKDFVIRNYEESRLSDRAQVADLEQRCEIGPSKRVFLFTDTLGDPICRIRHSPLYKMLVAEWNNEVVGVIQGSIKTAFSSTHKRPGLAVKVGYILGLR

Query:  VAPPFRRRGIGSSLVHDLEDWFVANDVDYCCMATEKDNHASVNLFINHMRYVKFRTGRILVNPVTNYPYKINQSEIRIQKLKIEEAEEIYKKHMTSTEFF
        VAPPFRRRGIGSSLVHDLEDWFVANDVDYCCMATEKDNHASVNLFINHMRYVKFRTGRILVNPVTNYPYKINQSEIRIQKLKIEEAEEIYKKHMTSTEFF
Subjt:  VAPPFRRRGIGSSLVHDLEDWFVANDVDYCCMATEKDNHASVNLFINHMRYVKFRTGRILVNPVTNYPYKINQSEIRIQKLKIEEAEEIYKKHMTSTEFF

Query:  PKDINSILKNNLSLGTWVAHYKKQPPPWSSAAADIRSSWAVVSLWNSGEVFKLRLGKAPFPWVVYTKSLRMMEKMLPCLKVILVPDYFKAFGFYFVYGLH
        PKDINSILKNNLSLGTWVAHYKKQPPPWSS AADIRSSWAVVSLWNSGEVFKLRLGKAPFPWVVYTKSLRMMEKMLPCLKVILVPDYFKAFGFYFVYGLH
Subjt:  PKDINSILKNNLSLGTWVAHYKKQPPPWSSAAADIRSSWAVVSLWNSGEVFKLRLGKAPFPWVVYTKSLRMMEKMLPCLKVILVPDYFKAFGFYFVYGLH

Query:  HEGACSERLVGVLCEYVHNLALSNAKDCKAIVTEIGGEDDELKMAIPHWKLLSCSEDLWCVKALKSEEDSLLEWKNGPPNRPLFVDPREV
        HEGACSERLVGVLCEYVHNLALSNAKDCKAIVTEIGGEDDELKMAIPHWKLLSCSEDLWCVKALK+EEDSLLEWKNGPPNRPLFVDPREV
Subjt:  HEGACSERLVGVLCEYVHNLALSNAKDCKAIVTEIGGEDDELKMAIPHWKLLSCSEDLWCVKALKSEEDSLLEWKNGPPNRPLFVDPREV

KAG7035352.1 putative N-acetyltransferase HLS1-like protein [Cucurbita argyrosperma subsp. argyrosperma]9.1e-233100Show/hide
Query:  MGSKDFVIRNYEESRLSDRAQVADLEQRCEIGPSKRVFLFTDTLGDPICRIRHSPLYKMLVAEWNNEVVGVIQGSIKTAFSSTHKRPGLAVKVGYILGLR
        MGSKDFVIRNYEESRLSDRAQVADLEQRCEIGPSKRVFLFTDTLGDPICRIRHSPLYKMLVAEWNNEVVGVIQGSIKTAFSSTHKRPGLAVKVGYILGLR
Subjt:  MGSKDFVIRNYEESRLSDRAQVADLEQRCEIGPSKRVFLFTDTLGDPICRIRHSPLYKMLVAEWNNEVVGVIQGSIKTAFSSTHKRPGLAVKVGYILGLR

Query:  VAPPFRRRGIGSSLVHDLEDWFVANDVDYCCMATEKDNHASVNLFINHMRYVKFRTGRILVNPVTNYPYKINQSEIRIQKLKIEEAEEIYKKHMTSTEFF
        VAPPFRRRGIGSSLVHDLEDWFVANDVDYCCMATEKDNHASVNLFINHMRYVKFRTGRILVNPVTNYPYKINQSEIRIQKLKIEEAEEIYKKHMTSTEFF
Subjt:  VAPPFRRRGIGSSLVHDLEDWFVANDVDYCCMATEKDNHASVNLFINHMRYVKFRTGRILVNPVTNYPYKINQSEIRIQKLKIEEAEEIYKKHMTSTEFF

Query:  PKDINSILKNNLSLGTWVAHYKKQPPPWSSAAADIRSSWAVVSLWNSGEVFKLRLGKAPFPWVVYTKSLRMMEKMLPCLKVILVPDYFKAFGFYFVYGLH
        PKDINSILKNNLSLGTWVAHYKKQPPPWSSAAADIRSSWAVVSLWNSGEVFKLRLGKAPFPWVVYTKSLRMMEKMLPCLKVILVPDYFKAFGFYFVYGLH
Subjt:  PKDINSILKNNLSLGTWVAHYKKQPPPWSSAAADIRSSWAVVSLWNSGEVFKLRLGKAPFPWVVYTKSLRMMEKMLPCLKVILVPDYFKAFGFYFVYGLH

Query:  HEGACSERLVGVLCEYVHNLALSNAKDCKAIVTEIGGEDDELKMAIPHWKLLSCSEDLWCVKALKSEEDSLLEWKNGPPNRPLFVDPREV
        HEGACSERLVGVLCEYVHNLALSNAKDCKAIVTEIGGEDDELKMAIPHWKLLSCSEDLWCVKALKSEEDSLLEWKNGPPNRPLFVDPREV
Subjt:  HEGACSERLVGVLCEYVHNLALSNAKDCKAIVTEIGGEDDELKMAIPHWKLLSCSEDLWCVKALKSEEDSLLEWKNGPPNRPLFVDPREV

XP_022947633.1 probable N-acetyltransferase HLS1-like [Cucurbita moschata]5.5e-23098.97Show/hide
Query:  MGSKDFVIRNYEESRLSDRAQVADLEQRCEIGPSKRVFLFTDTLGDPICRIRHSPLYKMLVAEWNNEVVGVIQGSIKTAFSSTHKRPGLAVKVGYILGLR
        MGSKDFVIRNYEESRLSDRAQVADLEQRCEIGPSKRVFLFTDTLGDPICRIRHSPLYKMLVAEWNNEVVGVIQGSIKTAFSSTHKRPGLA KVGY+LGLR
Subjt:  MGSKDFVIRNYEESRLSDRAQVADLEQRCEIGPSKRVFLFTDTLGDPICRIRHSPLYKMLVAEWNNEVVGVIQGSIKTAFSSTHKRPGLAVKVGYILGLR

Query:  VAPPFRRRGIGSSLVHDLEDWFVANDVDYCCMATEKDNHASVNLFINHMRYVKFRTGRILVNPVTNYPYKINQSEIRIQKLKIEEAEEIYKKHMTSTEFF
        VAPPFRRRGIGSSLVHDLEDWFVANDVDYCCMATEKDNHASVNLFINHMRYVKFRTGRILVNPVTNYPYKINQSEI+IQKLKIEEAEEIYKKHMTSTEFF
Subjt:  VAPPFRRRGIGSSLVHDLEDWFVANDVDYCCMATEKDNHASVNLFINHMRYVKFRTGRILVNPVTNYPYKINQSEIRIQKLKIEEAEEIYKKHMTSTEFF

Query:  PKDINSILKNNLSLGTWVAHYKKQPPPWSSAAADIRSSWAVVSLWNSGEVFKLRLGKAPFPWVVYTKSLRMMEKMLPCLKVILVPDYFKAFGFYFVYGLH
        PKDINSILKNNLSLGTWVAHYKKQPPPW SAAADIRSSWAVVSLWNSGEVFKLRLGKAPFPWVVYTKSLRMMEKMLPCLKVILVPDYFKAFGFYFVYGLH
Subjt:  PKDINSILKNNLSLGTWVAHYKKQPPPWSSAAADIRSSWAVVSLWNSGEVFKLRLGKAPFPWVVYTKSLRMMEKMLPCLKVILVPDYFKAFGFYFVYGLH

Query:  HEGACSERLVGVLCEYVHNLALSNAKDCKAIVTEIGGEDDELKMAIPHWKLLSCSEDLWCVKALKSEEDSLLEWKNGPPNRPLFVDPREV
        HEGACSERLVGVLCEYVHNLALSNAKDCKAIVTEIGGEDDELKMAIPHWKLLSCSEDLWCVKALKSEEDSLLEWKNGPPNRPLFVDPREV
Subjt:  HEGACSERLVGVLCEYVHNLALSNAKDCKAIVTEIGGEDDELKMAIPHWKLLSCSEDLWCVKALKSEEDSLLEWKNGPPNRPLFVDPREV

XP_023007288.1 probable N-acetyltransferase HLS1-like [Cucurbita maxima]1.9e-22295.9Show/hide
Query:  MGSKDFVIRNYEESRLSDRAQVADLEQRCEIGPSKRVFLFTDTLGDPICRIRHSPLYKMLVAEWNNEVVGVIQGSIKTAFSSTHKRPGLAVKVGYILGLR
        MGSK+FVIRNYEESRLSDRAQVADLEQRCEIG SKRVFLFTDTLGDPICRIRHSPLYKMLVAEWNNEVVGVIQGSIKTAFSS HK PGLA KVGYILGLR
Subjt:  MGSKDFVIRNYEESRLSDRAQVADLEQRCEIGPSKRVFLFTDTLGDPICRIRHSPLYKMLVAEWNNEVVGVIQGSIKTAFSSTHKRPGLAVKVGYILGLR

Query:  VAPPFRRRGIGSSLVHDLEDWFVANDVDYCCMATEKDNHASVNLFINHMRYVKFRTGRILVNPVTNYPYKINQSEIRIQKLKIEEAEEIYKKHMTSTEFF
        VAPPFRRRGIG SLVHDLEDWFVANDVDYCCMATEKDNHASVNLFINHMRYVKFRTGRILVNPVTN+PYKINQSEI+IQKLKIEEAEEIYKKHM STEFF
Subjt:  VAPPFRRRGIGSSLVHDLEDWFVANDVDYCCMATEKDNHASVNLFINHMRYVKFRTGRILVNPVTNYPYKINQSEIRIQKLKIEEAEEIYKKHMTSTEFF

Query:  PKDINSILKNNLSLGTWVAHYKKQPPPWSSAAADIRSSWAVVSLWNSGEVFKLRLGKAPFPWVVYTKSLRMMEKMLPCLKVILVPDYFKAFGFYFVYGLH
        PKDINSILKNNLSLGTWVAHYKKQPPPW SAAADI  SWAVVSLWNSGEVFKLRLGKAPFPWVVYTKSL+MM+KMLPCLKVILVPDYFKAFGFYFVYGLH
Subjt:  PKDINSILKNNLSLGTWVAHYKKQPPPWSSAAADIRSSWAVVSLWNSGEVFKLRLGKAPFPWVVYTKSLRMMEKMLPCLKVILVPDYFKAFGFYFVYGLH

Query:  HEGACSERLVGVLCEYVHNLALSNAKDCKAIVTEIGGEDDELKMAIPHWKLLSCSEDLWCVKALKSEEDSLLEWKNGPPNRPLFVDPREV
        HEGACSERLVGVLCE+VHNLALSNAKDCKAIVTEIGGEDDELKMAIPHWKLLSCSEDLWCVKALK EEDSLLEWKNGPPNRPLFVDPREV
Subjt:  HEGACSERLVGVLCEYVHNLALSNAKDCKAIVTEIGGEDDELKMAIPHWKLLSCSEDLWCVKALKSEEDSLLEWKNGPPNRPLFVDPREV

XP_023532249.1 probable N-acetyltransferase HLS1-like [Cucurbita pepo subsp. pepo]3.2e-22296.41Show/hide
Query:  MGSKDFVIRNYEESRLSDRAQVADLEQRCEIGPSKRVFLFTDTLGDPICRIRHSPLYKMLVAEWNNEVVGVIQGSIKTAFSSTHKRPGLAVKVGYILGLR
        MGSKDFVIRNYEESRLSDRAQVADLEQRCEIG SKRVFLFTDTLGDPICRIRHSPLYKMLVAEWNNEVVGVIQGSIKTAF STHK PGLA KVGYILGLR
Subjt:  MGSKDFVIRNYEESRLSDRAQVADLEQRCEIGPSKRVFLFTDTLGDPICRIRHSPLYKMLVAEWNNEVVGVIQGSIKTAFSSTHKRPGLAVKVGYILGLR

Query:  VAPPFRRRGIGSSLVHDLEDWFVANDVDYCCMATEKDNHASVNLFINHMRYVKFRTGRILVNPVTNYPYKINQSEIRIQKLKIEEAEEIYKKHMTSTEFF
        VAPPFRRRGIGSSLVHDLEDWFVANDVDYCCMATEKDNHASVNLFINHMRYVKFRTGRILVNPVTNYPYKINQSEIRIQKLKIEEAEEIYKKHMTSTEFF
Subjt:  VAPPFRRRGIGSSLVHDLEDWFVANDVDYCCMATEKDNHASVNLFINHMRYVKFRTGRILVNPVTNYPYKINQSEIRIQKLKIEEAEEIYKKHMTSTEFF

Query:  PKDINSILKNNLSLGTWVAHYKKQPPPWSSAAADIRSSWAVVSLWNSGEVFKLRLGKAPFPWVVYTKSLRMMEKMLPCLKVILVPDYFKAFGFYFVYGLH
        PKDINSILKNNLSLGTWVAHYKKQPP       DIRSSWAVVSLWNSGEVFKLRLGKAPFPWVVYTKSL+MMEKMLPCLKVILVPDYFKAFGFYFVYGLH
Subjt:  PKDINSILKNNLSLGTWVAHYKKQPPPWSSAAADIRSSWAVVSLWNSGEVFKLRLGKAPFPWVVYTKSLRMMEKMLPCLKVILVPDYFKAFGFYFVYGLH

Query:  HEGACSERLVGVLCEYVHNLALSNAKDCKAIVTEIGGEDDELKMAIPHWKLLSCSEDLWCVKALKSEEDSLLEWKNGPPNRPLFVDPREV
        HEGACSERLVGVLCE+VHNLALSNAKDCKAIVTEIGGEDDELKMAIPHWKLLSCSEDLWCVKALKSEEDS+LEWKNGPPNRPLFVDPREV
Subjt:  HEGACSERLVGVLCEYVHNLALSNAKDCKAIVTEIGGEDDELKMAIPHWKLLSCSEDLWCVKALKSEEDSLLEWKNGPPNRPLFVDPREV

TrEMBL top hitse value%identityAlignment
A0A0A0KE16 N-acetyltransferase domain-containing protein2.1e-16670Show/hide
Query:  FVIRNYE----ESRLSDRAQVADLEQRCEIGPSKRVFLFTDTLGDPICRIRHSPLYKMLVAEWNNEVVGVIQGSIKTAFSSTHK--RPGLAVKVGYILGL
        FVIR+YE    E + SD+AQV DLE+RCEIG SKRVFLFTD LGDPICRIR+SP+YKMLVAE + EVVGVIQGSIK  F + HK   PGL VKVGY+LGL
Subjt:  FVIRNYE----ESRLSDRAQVADLEQRCEIGPSKRVFLFTDTLGDPICRIRHSPLYKMLVAEWNNEVVGVIQGSIKTAFSSTHK--RPGLAVKVGYILGL

Query:  RVAPPFRRRGIGSSLVHDLEDWFVANDVDYCCMATEKDNHASVNLFINHMRYVKFRTGRILVNPVTNYPYKINQSEIRIQKLKIEEAEEIYKKHMTSTEF
        RVAPP+RRRG+G++LV  LEDWFV+NDVDYCCMA EKDNHAS+NLFIN++RY+KFRTGRILVNPV N+PY IN SEI+IQKLKIE+AE IYKKHM STE 
Subjt:  RVAPPFRRRGIGSSLVHDLEDWFVANDVDYCCMATEKDNHASVNLFINHMRYVKFRTGRILVNPVTNYPYKINQSEIRIQKLKIEEAEEIYKKHMTSTEF

Query:  FPKDINSILKNNLSLGTWVAHYKKQPPPWSSAAA----DIRSSWAVVSLWNSGEVFKLRLGKAPFPWVVYTKSLRMMEKMLPCLKVILVPDYFKAFGFYF
        FPKDI +ILKN LSLGTW+A++K+Q  P  S+++    + +SSWA+VSLWNSGEVF+LRLGKAPF WV+YTKSL++M+K+LPC K++LVP++FK FGFYF
Subjt:  FPKDINSILKNNLSLGTWVAHYKKQPPPWSSAAA----DIRSSWAVVSLWNSGEVFKLRLGKAPFPWVVYTKSLRMMEKMLPCLKVILVPDYFKAFGFYF

Query:  VYGLHHEGACSERLVGVLCEYVHNLALSNAKD--CKAIVTEIGG-EDDELKMAIPHWKLLSCSEDLWCVKALKSE------------EDSLLEWKNGPPN
        VYGLHHEG  SERLVG LC++VHN+A++N+KD  CKAIVTEI G EDD+LKM IPHWKLLSC ED WC+K+LKS+            +D +LEW N PP 
Subjt:  VYGLHHEGACSERLVGVLCEYVHNLALSNAKD--CKAIVTEIGG-EDDELKMAIPHWKLLSCSEDLWCVKALKSE------------EDSLLEWKNGPPN

Query:  RPLFVDPREV
        R LFVDPREV
Subjt:  RPLFVDPREV

A0A1S3CNW9 probable N-acetyltransferase HLS1-like8.5e-16870.12Show/hide
Query:  FVIRNYE---ESRLSDRAQVADLEQRCEIGPSKRVFLFTDTLGDPICRIRHSPLYKMLVAEWNNEVVGVIQGSIKTAFSSTHK--RPGLAVKVGYILGLR
        F+IR+YE   E +LSD+AQV DLE+RCEIG SKRVFLFTD LGDPICRIR+SP+YKMLVAE + EVVGVIQGSIK  F + HK   PGL VKVGYILGLR
Subjt:  FVIRNYE---ESRLSDRAQVADLEQRCEIGPSKRVFLFTDTLGDPICRIRHSPLYKMLVAEWNNEVVGVIQGSIKTAFSSTHK--RPGLAVKVGYILGLR

Query:  VAPPFRRRGIGSSLVHDLEDWFVANDVDYCCMATEKDNHASVNLFINHMRYVKFRTGRILVNPVTNYPYKINQSEIRIQKLKIEEAEEIYKKHMTSTEFF
        VAPP+RRRGIG++LV  LEDWFV+NDVDYCCMATEKDNHAS+NLFIN++RY+KFRTGRILVNPV N+PYKIN SEI+IQKL+IEEAE IYKKHM STE F
Subjt:  VAPPFRRRGIGSSLVHDLEDWFVANDVDYCCMATEKDNHASVNLFINHMRYVKFRTGRILVNPVTNYPYKINQSEIRIQKLKIEEAEEIYKKHMTSTEFF

Query:  PKDINSILKNNLSLGTWVAHYKKQPPPWSSAAAD----------IRSSWAVVSLWNSGEVFKLRLGKAPFPWVVYTKSLRMMEKMLPCLKVILVPDYFKA
        P+DI +ILKN LSLGTW+A++K+Q  P  S+++             SSWA+VSLWNSGEVFKLRLGKAPFPWV+YTKSL++M+K+ PC K++LVP++FK 
Subjt:  PKDINSILKNNLSLGTWVAHYKKQPPPWSSAAAD----------IRSSWAVVSLWNSGEVFKLRLGKAPFPWVVYTKSLRMMEKMLPCLKVILVPDYFKA

Query:  FGFYFVYGLHHEGACSERLVGVLCEYVHNLALSNAKD--CKAIVTEIGG-EDDELKMAIPHWKLLSCSEDLWCVKALKSEEDS------------LLEWK
        FGFYFVYGLHHEG  SERLVG LC++VHN+A++N+KD  CKAIVTEIGG EDD+LKM IPHWKLLSC ED WC+K+LKS++++            +LEW 
Subjt:  FGFYFVYGLHHEGACSERLVGVLCEYVHNLALSNAKD--CKAIVTEIGG-EDDELKMAIPHWKLLSCSEDLWCVKALKSEEDS------------LLEWK

Query:  NGPPNRPLFVDPREV
        N PP R LFVDPREV
Subjt:  NGPPNRPLFVDPREV

A0A5D3CAW1 Putative N-acetyltransferase HLS1-like8.5e-16870.12Show/hide
Query:  FVIRNYE---ESRLSDRAQVADLEQRCEIGPSKRVFLFTDTLGDPICRIRHSPLYKMLVAEWNNEVVGVIQGSIKTAFSSTHK--RPGLAVKVGYILGLR
        F+IR+YE   E +LSD+AQV DLE+RCEIG SKRVFLFTD LGDPICRIR+SP+YKMLVAE + EVVGVIQGSIK  F + HK   PGL VKVGYILGLR
Subjt:  FVIRNYE---ESRLSDRAQVADLEQRCEIGPSKRVFLFTDTLGDPICRIRHSPLYKMLVAEWNNEVVGVIQGSIKTAFSSTHK--RPGLAVKVGYILGLR

Query:  VAPPFRRRGIGSSLVHDLEDWFVANDVDYCCMATEKDNHASVNLFINHMRYVKFRTGRILVNPVTNYPYKINQSEIRIQKLKIEEAEEIYKKHMTSTEFF
        VAPP+RRRGIG++LV  LEDWFV+NDVDYCCMATEKDNHAS+NLFIN++RY+KFRTGRILVNPV N+PYKIN SEI+IQKL+IEEAE IYKKHM STE F
Subjt:  VAPPFRRRGIGSSLVHDLEDWFVANDVDYCCMATEKDNHASVNLFINHMRYVKFRTGRILVNPVTNYPYKINQSEIRIQKLKIEEAEEIYKKHMTSTEFF

Query:  PKDINSILKNNLSLGTWVAHYKKQPPPWSSAAAD----------IRSSWAVVSLWNSGEVFKLRLGKAPFPWVVYTKSLRMMEKMLPCLKVILVPDYFKA
        P+DI +ILKN LSLGTW+A++K+Q  P  S+++             SSWA+VSLWNSGEVFKLRLGKAPFPWV+YTKSL++M+K+ PC K++LVP++FK 
Subjt:  PKDINSILKNNLSLGTWVAHYKKQPPPWSSAAAD----------IRSSWAVVSLWNSGEVFKLRLGKAPFPWVVYTKSLRMMEKMLPCLKVILVPDYFKA

Query:  FGFYFVYGLHHEGACSERLVGVLCEYVHNLALSNAKD--CKAIVTEIGG-EDDELKMAIPHWKLLSCSEDLWCVKALKSEEDS------------LLEWK
        FGFYFVYGLHHEG  SERLVG LC++VHN+A++N+KD  CKAIVTEIGG EDD+LKM IPHWKLLSC ED WC+K+LKS++++            +LEW 
Subjt:  FGFYFVYGLHHEGACSERLVGVLCEYVHNLALSNAKD--CKAIVTEIGG-EDDELKMAIPHWKLLSCSEDLWCVKALKSEEDS------------LLEWK

Query:  NGPPNRPLFVDPREV
        N PP R LFVDPREV
Subjt:  NGPPNRPLFVDPREV

A0A6J1G758 probable N-acetyltransferase HLS1-like2.7e-23098.97Show/hide
Query:  MGSKDFVIRNYEESRLSDRAQVADLEQRCEIGPSKRVFLFTDTLGDPICRIRHSPLYKMLVAEWNNEVVGVIQGSIKTAFSSTHKRPGLAVKVGYILGLR
        MGSKDFVIRNYEESRLSDRAQVADLEQRCEIGPSKRVFLFTDTLGDPICRIRHSPLYKMLVAEWNNEVVGVIQGSIKTAFSSTHKRPGLA KVGY+LGLR
Subjt:  MGSKDFVIRNYEESRLSDRAQVADLEQRCEIGPSKRVFLFTDTLGDPICRIRHSPLYKMLVAEWNNEVVGVIQGSIKTAFSSTHKRPGLAVKVGYILGLR

Query:  VAPPFRRRGIGSSLVHDLEDWFVANDVDYCCMATEKDNHASVNLFINHMRYVKFRTGRILVNPVTNYPYKINQSEIRIQKLKIEEAEEIYKKHMTSTEFF
        VAPPFRRRGIGSSLVHDLEDWFVANDVDYCCMATEKDNHASVNLFINHMRYVKFRTGRILVNPVTNYPYKINQSEI+IQKLKIEEAEEIYKKHMTSTEFF
Subjt:  VAPPFRRRGIGSSLVHDLEDWFVANDVDYCCMATEKDNHASVNLFINHMRYVKFRTGRILVNPVTNYPYKINQSEIRIQKLKIEEAEEIYKKHMTSTEFF

Query:  PKDINSILKNNLSLGTWVAHYKKQPPPWSSAAADIRSSWAVVSLWNSGEVFKLRLGKAPFPWVVYTKSLRMMEKMLPCLKVILVPDYFKAFGFYFVYGLH
        PKDINSILKNNLSLGTWVAHYKKQPPPW SAAADIRSSWAVVSLWNSGEVFKLRLGKAPFPWVVYTKSLRMMEKMLPCLKVILVPDYFKAFGFYFVYGLH
Subjt:  PKDINSILKNNLSLGTWVAHYKKQPPPWSSAAADIRSSWAVVSLWNSGEVFKLRLGKAPFPWVVYTKSLRMMEKMLPCLKVILVPDYFKAFGFYFVYGLH

Query:  HEGACSERLVGVLCEYVHNLALSNAKDCKAIVTEIGGEDDELKMAIPHWKLLSCSEDLWCVKALKSEEDSLLEWKNGPPNRPLFVDPREV
        HEGACSERLVGVLCEYVHNLALSNAKDCKAIVTEIGGEDDELKMAIPHWKLLSCSEDLWCVKALKSEEDSLLEWKNGPPNRPLFVDPREV
Subjt:  HEGACSERLVGVLCEYVHNLALSNAKDCKAIVTEIGGEDDELKMAIPHWKLLSCSEDLWCVKALKSEEDSLLEWKNGPPNRPLFVDPREV

A0A6J1L7A2 probable N-acetyltransferase HLS1-like9.2e-22395.9Show/hide
Query:  MGSKDFVIRNYEESRLSDRAQVADLEQRCEIGPSKRVFLFTDTLGDPICRIRHSPLYKMLVAEWNNEVVGVIQGSIKTAFSSTHKRPGLAVKVGYILGLR
        MGSK+FVIRNYEESRLSDRAQVADLEQRCEIG SKRVFLFTDTLGDPICRIRHSPLYKMLVAEWNNEVVGVIQGSIKTAFSS HK PGLA KVGYILGLR
Subjt:  MGSKDFVIRNYEESRLSDRAQVADLEQRCEIGPSKRVFLFTDTLGDPICRIRHSPLYKMLVAEWNNEVVGVIQGSIKTAFSSTHKRPGLAVKVGYILGLR

Query:  VAPPFRRRGIGSSLVHDLEDWFVANDVDYCCMATEKDNHASVNLFINHMRYVKFRTGRILVNPVTNYPYKINQSEIRIQKLKIEEAEEIYKKHMTSTEFF
        VAPPFRRRGIG SLVHDLEDWFVANDVDYCCMATEKDNHASVNLFINHMRYVKFRTGRILVNPVTN+PYKINQSEI+IQKLKIEEAEEIYKKHM STEFF
Subjt:  VAPPFRRRGIGSSLVHDLEDWFVANDVDYCCMATEKDNHASVNLFINHMRYVKFRTGRILVNPVTNYPYKINQSEIRIQKLKIEEAEEIYKKHMTSTEFF

Query:  PKDINSILKNNLSLGTWVAHYKKQPPPWSSAAADIRSSWAVVSLWNSGEVFKLRLGKAPFPWVVYTKSLRMMEKMLPCLKVILVPDYFKAFGFYFVYGLH
        PKDINSILKNNLSLGTWVAHYKKQPPPW SAAADI  SWAVVSLWNSGEVFKLRLGKAPFPWVVYTKSL+MM+KMLPCLKVILVPDYFKAFGFYFVYGLH
Subjt:  PKDINSILKNNLSLGTWVAHYKKQPPPWSSAAADIRSSWAVVSLWNSGEVFKLRLGKAPFPWVVYTKSLRMMEKMLPCLKVILVPDYFKAFGFYFVYGLH

Query:  HEGACSERLVGVLCEYVHNLALSNAKDCKAIVTEIGGEDDELKMAIPHWKLLSCSEDLWCVKALKSEEDSLLEWKNGPPNRPLFVDPREV
        HEGACSERLVGVLCE+VHNLALSNAKDCKAIVTEIGGEDDELKMAIPHWKLLSCSEDLWCVKALK EEDSLLEWKNGPPNRPLFVDPREV
Subjt:  HEGACSERLVGVLCEYVHNLALSNAKDCKAIVTEIGGEDDELKMAIPHWKLLSCSEDLWCVKALKSEEDSLLEWKNGPPNRPLFVDPREV

SwissProt top hitse value%identityAlignment
O64815 Probable N-acetyltransferase HLS1-like1.8e-9044.31Show/hide
Query:  IRNYEESRLSDRAQVADLEQRCEIGPSKRVFLFTDTLGDPICRIRHSPLYKMLVAE----WNNEVVGVIQGSIKTA--------FSSTHKR--------P
        +R Y+ S+  D A V D+E+RCE+GP+ ++ LFTD LGDPICR+RHSP Y MLVAE       E+VG+I+G IKT            TH +         
Subjt:  IRNYEESRLSDRAQVADLEQRCEIGPSKRVFLFTDTLGDPICRIRHSPLYKMLVAE----WNNEVVGVIQGSIKTA--------FSSTHKR--------P

Query:  GLAVKVGYILGLRVAPPFRRRGIGSSLVHDLEDWFVANDVDYCCMATEKDNHASVNLFINHMRYVKFRTGRILVNPVTNYPYKINQS-EIRIQKLKIEEA
         L  K+ YILGLRV+P  RR+GIG  LV  +EDWF  N  +Y   ATE DNHASVNLF     Y +FRT  ILVNPV  Y +++N S  + + KL+  +A
Subjt:  GLAVKVGYILGLRVAPPFRRRGIGSSLVHDLEDWFVANDVDYCCMATEKDNHASVNLFINHMRYVKFRTGRILVNPVTNYPYKINQS-EIRIQKLKIEEA

Query:  EEIYKKHMTSTEFFPKDINSILKNNLSLGTWVA-----HYKKQPPPWSSAAADIR---SSWAVVSLWNSGEVFKLRLGKAPFPWVVYTKSLRMMEKMLPC
        E +Y+   ++TEFFP+DI+S+L N LSLGT+VA      Y      W  +A  +     SWAV+S+WN  + F+L +  A     V +K+ RM++K LP 
Subjt:  EEIYKKHMTSTEFFPKDINSILKNNLSLGTWVA-----HYKKQPPPWSSAAADIR---SSWAVVSLWNSGEVFKLRLGKAPFPWVVYTKSLRMMEKMLPC

Query:  LKVILVPDYFKAFGFYFVYGLHHEGACSERLVGVLCEYVHNLALSNAKDCKAIVTEIGGEDDELKMAIPHWKLLSCSEDLWCVKALKSE--EDSLLEWKN
        LK+  +P  F+ FG +F+YG+  EG  +E++V  LC++ HNLA      C  +  E+ GE + L+  IPHWK+LSC+EDLWC+K L  +  + S+ +W  
Subjt:  LKVILVPDYFKAFGFYFVYGLHHEGACSERLVGVLCEYVHNLALSNAKDCKAIVTEIGGEDDELKMAIPHWKLLSCSEDLWCVKALKSE--EDSLLEWKN

Query:  GPPNRPLFVDPRE
         PP   +FVDPRE
Subjt:  GPPNRPLFVDPRE

Q42381 Probable N-acetyltransferase HLS13.2e-8744.72Show/hide
Query:  VIRNYEESRLSDRAQVADLEQRCEIGPSKRVFLFTDTLGDPICRIRHSPLYKMLVAEW---NNEVVGVIQGSIKTA-----FSSTHKRPG-----LAVKV
        V+R Y+ +R  D   V D+E+RCE+GPS ++ LFTD LGDPICRIRHSP Y MLVAE      E+VG+I+G IKT          HK        L  K+
Subjt:  VIRNYEESRLSDRAQVADLEQRCEIGPSKRVFLFTDTLGDPICRIRHSPLYKMLVAEW---NNEVVGVIQGSIKTA-----FSSTHKRPG-----LAVKV

Query:  GYILGLRVAPPFRRRGIGSSLVHDLEDWFVANDVDYCCMATEKDNHASVNLFINHMRYVKFRTGRILVNPVTNYPYKINQS-EIRIQKLKIEEAEEIYKK
         Y+LGLRV+P  RR+GIG  LV  +E+WF  N  +Y  +ATE DN ASVNLF     Y +FRT  ILVNPV  Y +++N S  + + KL+  +AE +Y+ 
Subjt:  GYILGLRVAPPFRRRGIGSSLVHDLEDWFVANDVDYCCMATEKDNHASVNLFINHMRYVKFRTGRILVNPVTNYPYKINQS-EIRIQKLKIEEAEEIYKK

Query:  HMTSTEFFPKDINSILKNNLSLGTWVA-----HYKKQPPPWSSAAADIR---SSWAVVSLWNSGEVFKLRLGKAPFPWVVYTKSLRMMEKMLPCLKVILV
          ++TEFFP+DI+S+L N LSLGT+VA      Y      W  +A  +     SWAV+S+WN  + F L +  A     V  K+ R+++K LP LK+  +
Subjt:  HMTSTEFFPKDINSILKNNLSLGTWVA-----HYKKQPPPWSSAAADIR---SSWAVVSLWNSGEVFKLRLGKAPFPWVVYTKSLRMMEKMLPCLKVILV

Query:  PDYFKAFGFYFVYGLHHEGACSERLVGVLCEYVHNLALSNAKDCKAIVTEIGGEDDELKMAIPHWKLLSCSEDLWCVKALKSE-EDSLL-EWKNGPPNRP
        P  F+ FG +F+YG+  EG  + ++V  LC + HNLA   A  C  +  E+ GE D L+  IPHWK+LSC EDLWC+K L  +  D ++ +W   PP   
Subjt:  PDYFKAFGFYFVYGLHHEGACSERLVGVLCEYVHNLALSNAKDCKAIVTEIGGEDDELKMAIPHWKLLSCSEDLWCVKALKSE-EDSLL-EWKNGPPNRP

Query:  LFVDPRE
        +FVDPRE
Subjt:  LFVDPRE

Arabidopsis top hitse value%identityAlignment
AT2G23060.1 Acyl-CoA N-acyltransferases (NAT) superfamily protein1.3e-9144.31Show/hide
Query:  IRNYEESRLSDRAQVADLEQRCEIGPSKRVFLFTDTLGDPICRIRHSPLYKMLVAE----WNNEVVGVIQGSIKTA--------FSSTHKR--------P
        +R Y+ S+  D A V D+E+RCE+GP+ ++ LFTD LGDPICR+RHSP Y MLVAE       E+VG+I+G IKT            TH +         
Subjt:  IRNYEESRLSDRAQVADLEQRCEIGPSKRVFLFTDTLGDPICRIRHSPLYKMLVAE----WNNEVVGVIQGSIKTA--------FSSTHKR--------P

Query:  GLAVKVGYILGLRVAPPFRRRGIGSSLVHDLEDWFVANDVDYCCMATEKDNHASVNLFINHMRYVKFRTGRILVNPVTNYPYKINQS-EIRIQKLKIEEA
         L  K+ YILGLRV+P  RR+GIG  LV  +EDWF  N  +Y   ATE DNHASVNLF     Y +FRT  ILVNPV  Y +++N S  + + KL+  +A
Subjt:  GLAVKVGYILGLRVAPPFRRRGIGSSLVHDLEDWFVANDVDYCCMATEKDNHASVNLFINHMRYVKFRTGRILVNPVTNYPYKINQS-EIRIQKLKIEEA

Query:  EEIYKKHMTSTEFFPKDINSILKNNLSLGTWVA-----HYKKQPPPWSSAAADIR---SSWAVVSLWNSGEVFKLRLGKAPFPWVVYTKSLRMMEKMLPC
        E +Y+   ++TEFFP+DI+S+L N LSLGT+VA      Y      W  +A  +     SWAV+S+WN  + F+L +  A     V +K+ RM++K LP 
Subjt:  EEIYKKHMTSTEFFPKDINSILKNNLSLGTWVA-----HYKKQPPPWSSAAADIR---SSWAVVSLWNSGEVFKLRLGKAPFPWVVYTKSLRMMEKMLPC

Query:  LKVILVPDYFKAFGFYFVYGLHHEGACSERLVGVLCEYVHNLALSNAKDCKAIVTEIGGEDDELKMAIPHWKLLSCSEDLWCVKALKSE--EDSLLEWKN
        LK+  +P  F+ FG +F+YG+  EG  +E++V  LC++ HNLA      C  +  E+ GE + L+  IPHWK+LSC+EDLWC+K L  +  + S+ +W  
Subjt:  LKVILVPDYFKAFGFYFVYGLHHEGACSERLVGVLCEYVHNLALSNAKDCKAIVTEIGGEDDELKMAIPHWKLLSCSEDLWCVKALKSE--EDSLLEWKN

Query:  GPPNRPLFVDPRE
         PP   +FVDPRE
Subjt:  GPPNRPLFVDPRE

AT2G23060.2 Acyl-CoA N-acyltransferases (NAT) superfamily protein1.0e-7242.54Show/hide
Query:  MLVAE----WNNEVVGVIQGSIKTA--------FSSTHKR--------PGLAVKVGYILGLRVAPPFRRRGIGSSLVHDLEDWFVANDVDYCCMATEKDN
        MLVAE       E+VG+I+G IKT            TH +          L  K+ YILGLRV+P  RR+GIG  LV  +EDWF  N  +Y   ATE DN
Subjt:  MLVAE----WNNEVVGVIQGSIKTA--------FSSTHKR--------PGLAVKVGYILGLRVAPPFRRRGIGSSLVHDLEDWFVANDVDYCCMATEKDN

Query:  HASVNLFINHMRYVKFRTGRILVNPVTNYPYKINQS-EIRIQKLKIEEAEEIYKKHMTSTEFFPKDINSILKNNLSLGTWVA-----HYKKQPPPWSSAA
        HASVNLF     Y +FRT  ILVNPV  Y +++N S  + + KL+  +AE +Y+   ++TEFFP+DI+S+L N LSLGT+VA      Y      W  +A
Subjt:  HASVNLFINHMRYVKFRTGRILVNPVTNYPYKINQS-EIRIQKLKIEEAEEIYKKHMTSTEFFPKDINSILKNNLSLGTWVA-----HYKKQPPPWSSAA

Query:  ADIR---SSWAVVSLWNSGEVFKLRLGKAPFPWVVYTKSLRMMEKMLPCLKVILVPDYFKAFGFYFVYGLHHEGACSERLVGVLCEYVHNLALSNAKDCK
          +     SWAV+S+WN  + F+L +  A     V +K+ RM++K LP LK+  +P  F+ FG +F+YG+  EG  +E++V  LC++ HNLA      C 
Subjt:  ADIR---SSWAVVSLWNSGEVFKLRLGKAPFPWVVYTKSLRMMEKMLPCLKVILVPDYFKAFGFYFVYGLHHEGACSERLVGVLCEYVHNLALSNAKDCK

Query:  AIVTEIGGEDDELKMAIPHWKLLSCSEDLWCVKALKSE--EDSLLEWKNGPPNRPLFVDPRE
         +  E+ GE + L+  IPHWK+LSC+EDLWC+K L  +  + S+ +W   PP   +FVDPRE
Subjt:  AIVTEIGGEDDELKMAIPHWKLLSCSEDLWCVKALKSE--EDSLLEWKNGPPNRPLFVDPRE

AT2G30090.1 Acyl-CoA N-acyltransferases (NAT) superfamily protein8.8e-10148.21Show/hide
Query:  KDFVIRNYEESRLSDRAQVADLEQRCEIGPSKRVFLFTDTLGDPICRIRHSPLYKMLVAEWNNEVVGVIQGSIKTAFSSTHKRPGLAVKVGYILGLRVAP
        ++ VIR Y++ R  DR Q+  +E+ CEIG   +  LFTDTLGDPICRIR+SP + MLVA   N++VG IQGS+K      H +   +V+VGY+LGLRV P
Subjt:  KDFVIRNYEESRLSDRAQVADLEQRCEIGPSKRVFLFTDTLGDPICRIRHSPLYKMLVAEWNNEVVGVIQGSIKTAFSSTHKRPGLAVKVGYILGLRVAP

Query:  PFRRRGIGSSLVHDLEDWFVANDVDYCCMATEKDNHASVNLFINHMRYVKFRTGRILVNPVTNYPYKINQSEIRIQKLKIEEAEEIYKKHM-TSTEFFPK
         +RRRGIGS LV  LE+WF +++ DY  MATEKDN AS  LFI  + YV FR   ILVNPV         S+I I+KLK++EAE +Y++++  +TEFFP 
Subjt:  PFRRRGIGSSLVHDLEDWFVANDVDYCCMATEKDNHASVNLFINHMRYVKFRTGRILVNPVTNYPYKINQSEIRIQKLKIEEAEEIYKKHM-TSTEFFPK

Query:  DINSILKNNLSLGTWVAHYKKQPPPWSSAAADIRSSWAVVSLWNSGEVFKLRLGKAPFPWVVYTKSLRMMEKMLPCLKVILVPDYFKAFGFYFVYGLHHE
        DIN IL+N LS+GTWVA+Y            D   SWA++S+W+S +VFKLR+ +AP  +++ TK  ++    L  L + ++PD F  FGFYF+YG+H E
Subjt:  DINSILKNNLSLGTWVAHYKKQPPPWSSAAADIRSSWAVVSLWNSGEVFKLRLGKAPFPWVVYTKSLRMMEKMLPCLKVILVPDYFKAFGFYFVYGLHHE

Query:  GACSERLVGVLCEYVHNL-ALSNAKDCKAIVTEI---GGEDDELKMAIPHWKLLSCSEDLWCVKALKSEEDSLLEWKNGPPNRPLFVDPREV
        G    +LV  LCE+VHN+ AL++   CK +V E+      DD L+  IPHWK+LSC +D+WC+K LK E++     +       LFVDPREV
Subjt:  GACSERLVGVLCEYVHNL-ALSNAKDCKAIVTEI---GGEDDELKMAIPHWKLLSCSEDLWCVKALKSEEDSLLEWKNGPPNRPLFVDPREV

AT4G37580.1 Acyl-CoA N-acyltransferases (NAT) superfamily protein2.3e-8844.72Show/hide
Query:  VIRNYEESRLSDRAQVADLEQRCEIGPSKRVFLFTDTLGDPICRIRHSPLYKMLVAEW---NNEVVGVIQGSIKTA-----FSSTHKRPG-----LAVKV
        V+R Y+ +R  D   V D+E+RCE+GPS ++ LFTD LGDPICRIRHSP Y MLVAE      E+VG+I+G IKT          HK        L  K+
Subjt:  VIRNYEESRLSDRAQVADLEQRCEIGPSKRVFLFTDTLGDPICRIRHSPLYKMLVAEW---NNEVVGVIQGSIKTA-----FSSTHKRPG-----LAVKV

Query:  GYILGLRVAPPFRRRGIGSSLVHDLEDWFVANDVDYCCMATEKDNHASVNLFINHMRYVKFRTGRILVNPVTNYPYKINQS-EIRIQKLKIEEAEEIYKK
         Y+LGLRV+P  RR+GIG  LV  +E+WF  N  +Y  +ATE DN ASVNLF     Y +FRT  ILVNPV  Y +++N S  + + KL+  +AE +Y+ 
Subjt:  GYILGLRVAPPFRRRGIGSSLVHDLEDWFVANDVDYCCMATEKDNHASVNLFINHMRYVKFRTGRILVNPVTNYPYKINQS-EIRIQKLKIEEAEEIYKK

Query:  HMTSTEFFPKDINSILKNNLSLGTWVA-----HYKKQPPPWSSAAADIR---SSWAVVSLWNSGEVFKLRLGKAPFPWVVYTKSLRMMEKMLPCLKVILV
          ++TEFFP+DI+S+L N LSLGT+VA      Y      W  +A  +     SWAV+S+WN  + F L +  A     V  K+ R+++K LP LK+  +
Subjt:  HMTSTEFFPKDINSILKNNLSLGTWVA-----HYKKQPPPWSSAAADIR---SSWAVVSLWNSGEVFKLRLGKAPFPWVVYTKSLRMMEKMLPCLKVILV

Query:  PDYFKAFGFYFVYGLHHEGACSERLVGVLCEYVHNLALSNAKDCKAIVTEIGGEDDELKMAIPHWKLLSCSEDLWCVKALKSE-EDSLL-EWKNGPPNRP
        P  F+ FG +F+YG+  EG  + ++V  LC + HNLA   A  C  +  E+ GE D L+  IPHWK+LSC EDLWC+K L  +  D ++ +W   PP   
Subjt:  PDYFKAFGFYFVYGLHHEGACSERLVGVLCEYVHNLALSNAKDCKAIVTEIGGEDDELKMAIPHWKLLSCSEDLWCVKALKSE-EDSLL-EWKNGPPNRP

Query:  LFVDPRE
        +FVDPRE
Subjt:  LFVDPRE

AT5G67430.1 Acyl-CoA N-acyltransferases (NAT) superfamily protein1.9e-7941.07Show/hide
Query:  VIRNYEESRLSDRAQVADLEQRCEIGPSKRVFLFTDTLGDPICRIRHSPLYKMLVAEWNNEVVGVIQGSIKTAFSSTHK-------RPGL-AVKVGYILG
        V+R Y+  R  D   V +LE+ CE+G      L  D +GDP+ RIR SP + MLVAE  NE+VG+I+G+IK      +         P +   K+ ++ G
Subjt:  VIRNYEESRLSDRAQVADLEQRCEIGPSKRVFLFTDTLGDPICRIRHSPLYKMLVAEWNNEVVGVIQGSIKTAFSSTHK-------RPGL-AVKVGYILG

Query:  LRVAPPFRRRGIGSSLVHDLEDWFVANDVDYCCMATEKDNHASVNLFINHMRYVKFRTGRILVNPVTNYPYKINQSEIRIQKLKIEEAEEIYKKHMTSTE
        LRV+P +RR GIG  LV  LE+WF+ ND  Y  + TE DN ASV LF     Y KFRT   LVNPV N+   +++  ++I KL   +AE +Y+   ++TE
Subjt:  LRVAPPFRRRGIGSSLVHDLEDWFVANDVDYCCMATEKDNHASVNLFINHMRYVKFRTGRILVNPVTNYPYKINQSEIRIQKLKIEEAEEIYKKHMTSTE

Query:  FFPKDINSILKNNLSLGTWVAHYKKQPPPWSSAAADIRSSWAVVSLWNSGEVFKLRLGKAPFPWVVYTKSLRMMEKMLPCLKVILVPDYFKAFGFYFVYG
        FFP DINSIL N LSLGT++A   +     S +  D   SWAV+S+WNS +V++L++  A     +  KS R+ +   P LK+   P+ FK+F  +F+YG
Subjt:  FFPKDINSILKNNLSLGTWVAHYKKQPPPWSSAAADIRSSWAVVSLWNSGEVFKLRLGKAPFPWVVYTKSLRMMEKMLPCLKVILVPDYFKAFGFYFVYG

Query:  LHHEGACSERLVGVLCEYVHNLALSNAKDCKAIVTEIGGEDDELKMAIPHWKLLSCSEDLWCVKALKSEEDSLLEWKNGPPNRPLFVDPREV
        +  EG  +  +V  LC + HNLA  +   C  +  E+    + L++ IPHWK+LS  EDLWC+K L+ ++D  ++W   PP   +FVDPRE+
Subjt:  LHHEGACSERLVGVLCEYVHNLALSNAKDCKAIVTEIGGEDDELKMAIPHWKLLSCSEDLWCVKALKSEEDSLLEWKNGPPNRPLFVDPREV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGGTCGAAGGATTTCGTTATACGAAACTACGAAGAGAGTCGATTATCAGATAGAGCTCAAGTAGCTGATCTCGAACAACGATGCGAAATTGGGCCATCGAAACGTGT
GTTTCTCTTCACTGACACATTGGGTGACCCCATTTGTAGGATTCGTCATAGTCCCTTGTATAAAATGCTGGTGGCAGAGTGGAACAACGAGGTGGTTGGAGTCATTCAAG
GCTCGATAAAGACAGCGTTTTCTAGTACTCATAAACGGCCGGGTTTGGCGGTGAAAGTGGGCTACATTCTTGGGCTGAGAGTTGCGCCGCCGTTTCGCCGCCGTGGGATT
GGATCCAGCCTTGTCCATGATTTGGAAGATTGGTTTGTAGCTAATGACGTTGATTACTGTTGCATGGCTACTGAGAAAGATAATCATGCCTCTGTTAATCTCTTCATTAA
TCACATGAGGTATGTAAAATTCAGAACGGGAAGAATTCTGGTGAACCCAGTAACAAATTATCCATACAAAATCAACCAATCAGAAATCAGAATTCAAAAGCTGAAAATTG
AAGAAGCAGAAGAAATTTACAAAAAACACATGACATCAACGGAGTTCTTCCCCAAAGACATAAACAGCATATTGAAGAACAATTTGAGCTTAGGGACATGGGTGGCACAT
TACAAGAAACAGCCGCCACCGTGGTCGTCGGCGGCGGCCGACATACGGTCGAGCTGGGCTGTGGTGAGTCTATGGAACAGTGGGGAAGTTTTTAAGTTAAGACTAGGAAA
AGCTCCATTTCCATGGGTGGTTTATACAAAGAGCTTAAGAATGATGGAGAAAATGTTGCCTTGTTTGAAGGTGATTTTGGTGCCTGATTATTTCAAGGCATTTGGGTTTT
ATTTTGTTTATGGGTTGCATCATGAAGGGGCTTGTTCTGAGAGATTGGTTGGGGTATTGTGTGAGTATGTTCATAATTTGGCTTTGAGTAATGCGAAGGATTGTAAGGCT
ATTGTTACAGAGATTGGTGGGGAAGATGATGAGCTGAAGATGGCTATTCCTCATTGGAAATTGCTATCATGTTCGGAAGATTTGTGGTGTGTTAAGGCCTTGAAGAGTGA
GGAGGATAGTCTCTTGGAATGGAAAAATGGCCCACCAAATAGACCTCTCTTTGTAGACCCAAGAGAGGTATGA
mRNA sequenceShow/hide mRNA sequence
TATGAGTATCGTTATTGTTTTTCCCTTAATTTAAGAGTTCGATCCATTTTTTAATTAATGGGGTCGAAGGATTTCGTTATACGAAACTACGAAGAGAGTCGATTATCAGA
TAGAGCTCAAGTAGCTGATCTCGAACAACGATGCGAAATTGGGCCATCGAAACGTGTGTTTCTCTTCACTGACACATTGGGTGACCCCATTTGTAGGATTCGTCATAGTC
CCTTGTATAAAATGCTGGTGGCAGAGTGGAACAACGAGGTGGTTGGAGTCATTCAAGGCTCGATAAAGACAGCGTTTTCTAGTACTCATAAACGGCCGGGTTTGGCGGTG
AAAGTGGGCTACATTCTTGGGCTGAGAGTTGCGCCGCCGTTTCGCCGCCGTGGGATTGGATCCAGCCTTGTCCATGATTTGGAAGATTGGTTTGTAGCTAATGACGTTGA
TTACTGTTGCATGGCTACTGAGAAAGATAATCATGCCTCTGTTAATCTCTTCATTAATCACATGAGGTATGTAAAATTCAGAACGGGAAGAATTCTGGTGAACCCAGTAA
CAAATTATCCATACAAAATCAACCAATCAGAAATCAGAATTCAAAAGCTGAAAATTGAAGAAGCAGAAGAAATTTACAAAAAACACATGACATCAACGGAGTTCTTCCCC
AAAGACATAAACAGCATATTGAAGAACAATTTGAGCTTAGGGACATGGGTGGCACATTACAAGAAACAGCCGCCACCGTGGTCGTCGGCGGCGGCCGACATACGGTCGAG
CTGGGCTGTGGTGAGTCTATGGAACAGTGGGGAAGTTTTTAAGTTAAGACTAGGAAAAGCTCCATTTCCATGGGTGGTTTATACAAAGAGCTTAAGAATGATGGAGAAAA
TGTTGCCTTGTTTGAAGGTGATTTTGGTGCCTGATTATTTCAAGGCATTTGGGTTTTATTTTGTTTATGGGTTGCATCATGAAGGGGCTTGTTCTGAGAGATTGGTTGGG
GTATTGTGTGAGTATGTTCATAATTTGGCTTTGAGTAATGCGAAGGATTGTAAGGCTATTGTTACAGAGATTGGTGGGGAAGATGATGAGCTGAAGATGGCTATTCCTCA
TTGGAAATTGCTATCATGTTCGGAAGATTTGTGGTGTGTTAAGGCCTTGAAGAGTGAGGAGGATAGTCTCTTGGAATGGAAAAATGGCCCACCAAATAGACCTCTCTTTG
TAGACCCAAGAGAGGTATGA
Protein sequenceShow/hide protein sequence
MGSKDFVIRNYEESRLSDRAQVADLEQRCEIGPSKRVFLFTDTLGDPICRIRHSPLYKMLVAEWNNEVVGVIQGSIKTAFSSTHKRPGLAVKVGYILGLRVAPPFRRRGI
GSSLVHDLEDWFVANDVDYCCMATEKDNHASVNLFINHMRYVKFRTGRILVNPVTNYPYKINQSEIRIQKLKIEEAEEIYKKHMTSTEFFPKDINSILKNNLSLGTWVAH
YKKQPPPWSSAAADIRSSWAVVSLWNSGEVFKLRLGKAPFPWVVYTKSLRMMEKMLPCLKVILVPDYFKAFGFYFVYGLHHEGACSERLVGVLCEYVHNLALSNAKDCKA
IVTEIGGEDDELKMAIPHWKLLSCSEDLWCVKALKSEEDSLLEWKNGPPNRPLFVDPREV