; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0012010 (gene) of Snake gourd v1 genome

Gene IDTan0012010
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionN-acetyltransferase domain-containing protein
Genome locationLG04:83930459..83934208
RNA-Seq ExpressionTan0012010
SyntenyTan0012010
Gene Ontology termsGO:0008080 - N-acetyltransferase activity (molecular function)
InterPro domainsIPR000182 - GNAT domain
IPR016181 - Acyl-CoA N-acyltransferase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6606840.1 putative N-acetyltransferase HLS1, partial [Cucurbita argyrosperma subsp. sororia]6.2e-22492.67Show/hide
Query:  MVLKIADEYRLQVVSNTEENRKLVVVREYCEERDKVSVEKMERQCDVGQKGKPSIFTDLLGDPICRIRHFPLHVMLVAEYGEARDIVGVIRGCIKHVTTG
        MV+KIADEYR QVVSNTEENR L VVREYCEERDK+SVEKMERQCDVGQKGKPSIFTDLLGDPICRIRHFPLHVMLVAEYGEARDI+GVIRGCIKHVTTG
Subjt:  MVLKIADEYRLQVVSNTEENRKLVVVREYCEERDKVSVEKMERQCDVGQKGKPSIFTDLLGDPICRIRHFPLHVMLVAEYGEARDIVGVIRGCIKHVTTG

Query:  HSHHVLKLAYILGLRVSTTHRRLGVGTKLVQHLEEWCKQKGADYAYIATDCANQPCINLFTQKFSYTKFRSPTVLVQPVHAHYKSIGSRIAIVRIPPNIA
        HSHHVLKLAYILGLRVST HRRLGVG KLV+HLEEWCKQKGADYAYIATDCANQPCINLFTQKFSYTKFRSPTVLVQPVHAHYK IGS IAIVRIPP++ 
Subjt:  HSHHVLKLAYILGLRVSTTHRRLGVGTKLVQHLEEWCKQKGADYAYIATDCANQPCINLFTQKFSYTKFRSPTVLVQPVHAHYKSIGSRIAIVRIPPNIA

Query:  AKVYRHLFANAEFFATDIDAILSNKLNLGTFMAVPKKLLPKWNPETGILPQSFAILSVWNTKEVFKLQVKGVSKLTYACCMGSRLLDSWLPWLRLPSFPD
        AKVYRHLFANAEFF  DIDAILSNKLNLGTFMAVPKKLLPKW+PETGILPQSFAILSVWNTKEVFKLQVKGVSKLTYACCMGSRLLDSWLPWLRLPSFPD
Subjt:  AKVYRHLFANAEFFATDIDAILSNKLNLGTFMAVPKKLLPKWNPETGILPQSFAILSVWNTKEVFKLQVKGVSKLTYACCMGSRLLDSWLPWLRLPSFPD

Query:  VFSQFGVYFLYGVSMRGINGPRLMKSLCKFVHNMAKDDAGCGAVVTEVGQQDPVRMAIPHWRRLSWN-DLWCIKKLADLQGDECERSETSDWIKSPPSSA
        VFSQFGVYFLYG+SMRG NG RLMKSLC FVHNMAKDD GCGAVV EV QQDPVR+AIPHW+RLSWN DLWCIKKLADLQ  ECERSE SDWIKSPPSSA
Subjt:  VFSQFGVYFLYGVSMRGINGPRLMKSLCKFVHNMAKDDAGCGAVVTEVGQQDPVRMAIPHWRRLSWN-DLWCIKKLADLQGDECERSETSDWIKSPPSSA

Query:  GIFVDPRDI
        GIFVDPRDI
Subjt:  GIFVDPRDI

KAG7036544.1 putative N-acetyltransferase HLS1, partial [Cucurbita argyrosperma subsp. argyrosperma]2.8e-22492.91Show/hide
Query:  MVLKIADEYRLQVVSNTEENRKLVVVREYCEERDKVSVEKMERQCDVGQKGKPSIFTDLLGDPICRIRHFPLHVMLVAEYGEARDIVGVIRGCIKHVTTG
        MV+KIADEYR QVVSNTEENR L VVREYCEERDKVSVEKMERQCDVGQKGKPSIFTDLLGDPICRIRHFPLHVMLVAEYGEARDI+GVIRGCIKHVTTG
Subjt:  MVLKIADEYRLQVVSNTEENRKLVVVREYCEERDKVSVEKMERQCDVGQKGKPSIFTDLLGDPICRIRHFPLHVMLVAEYGEARDIVGVIRGCIKHVTTG

Query:  HSHHVLKLAYILGLRVSTTHRRLGVGTKLVQHLEEWCKQKGADYAYIATDCANQPCINLFTQKFSYTKFRSPTVLVQPVHAHYKSIGSRIAIVRIPPNIA
        HSHHVLKLAYILGLRVST HRRLGVG KLV+HLEEWCKQKGADYAYIATDCANQPCINLFTQKFSYTKFRSPTVLVQPVHAHYK IGS IAIVRIPP++ 
Subjt:  HSHHVLKLAYILGLRVSTTHRRLGVGTKLVQHLEEWCKQKGADYAYIATDCANQPCINLFTQKFSYTKFRSPTVLVQPVHAHYKSIGSRIAIVRIPPNIA

Query:  AKVYRHLFANAEFFATDIDAILSNKLNLGTFMAVPKKLLPKWNPETGILPQSFAILSVWNTKEVFKLQVKGVSKLTYACCMGSRLLDSWLPWLRLPSFPD
        AKVYRHLFANAEFF  DIDAILSNKLNLGTFMAVPKKLLPKW+PETGILPQSFAILSVWNTKEVFKLQVKGVSKLTYACCMGSRLLDSWLPWLRLPSFPD
Subjt:  AKVYRHLFANAEFFATDIDAILSNKLNLGTFMAVPKKLLPKWNPETGILPQSFAILSVWNTKEVFKLQVKGVSKLTYACCMGSRLLDSWLPWLRLPSFPD

Query:  VFSQFGVYFLYGVSMRGINGPRLMKSLCKFVHNMAKDDAGCGAVVTEVGQQDPVRMAIPHWRRLSWN-DLWCIKKLADLQGDECERSETSDWIKSPPSSA
        VFSQFGVYFLYG+SMRG NG RLMKSLC FVHNMAKDD GCGAVV EV QQDPVR+AIPHW+RLSWN DLWCIKKLADLQ  ECERSE SDWIKSPPSSA
Subjt:  VFSQFGVYFLYGVSMRGINGPRLMKSLCKFVHNMAKDDAGCGAVVTEVGQQDPVRMAIPHWRRLSWN-DLWCIKKLADLQGDECERSETSDWIKSPPSSA

Query:  GIFVDPRDI
        GIFVDPRDI
Subjt:  GIFVDPRDI

XP_022949178.1 probable N-acetyltransferase HLS1 isoform X1 [Cucurbita moschata]6.2e-22492.67Show/hide
Query:  MVLKIADEYRLQVVSNTEENRKLVVVREYCEERDKVSVEKMERQCDVGQKGKPSIFTDLLGDPICRIRHFPLHVMLVAEYGEARDIVGVIRGCIKHVTTG
        MV+KIADEYR QVVSNTEENR L VVREYCEERDKVSVEKMERQCDVGQKGKPSIFTDLLGDPICRIRHFPLHVMLVAEYGEARDI+GVIRGCIKHVTTG
Subjt:  MVLKIADEYRLQVVSNTEENRKLVVVREYCEERDKVSVEKMERQCDVGQKGKPSIFTDLLGDPICRIRHFPLHVMLVAEYGEARDIVGVIRGCIKHVTTG

Query:  HSHHVLKLAYILGLRVSTTHRRLGVGTKLVQHLEEWCKQKGADYAYIATDCANQPCINLFTQKFSYTKFRSPTVLVQPVHAHYKSIGSRIAIVRIPPNIA
        HSHHVLKLAYILGLRVST HRRLGVG KLV+HLEEWCKQKGADYAYIATDCANQPCINLFTQKFSYTKFRSPTVLVQPVHAHYK IGS IAIVRIPP++ 
Subjt:  HSHHVLKLAYILGLRVSTTHRRLGVGTKLVQHLEEWCKQKGADYAYIATDCANQPCINLFTQKFSYTKFRSPTVLVQPVHAHYKSIGSRIAIVRIPPNIA

Query:  AKVYRHLFANAEFFATDIDAILSNKLNLGTFMAVPKKLLPKWNPETGILPQSFAILSVWNTKEVFKLQVKGVSKLTYACCMGSRLLDSWLPWLRLPSFPD
        AKVYRHLFANAEFF  DIDAILSNKLNLGTFMAVPKKLLPKW+PETGILPQSFAILSVWNTKEVFKLQVKGVS+LTYACCMGSRLLDSWLPWLRLPSFPD
Subjt:  AKVYRHLFANAEFFATDIDAILSNKLNLGTFMAVPKKLLPKWNPETGILPQSFAILSVWNTKEVFKLQVKGVSKLTYACCMGSRLLDSWLPWLRLPSFPD

Query:  VFSQFGVYFLYGVSMRGINGPRLMKSLCKFVHNMAKDDAGCGAVVTEVGQQDPVRMAIPHWRRLSWN-DLWCIKKLADLQGDECERSETSDWIKSPPSSA
        VFSQFGVYFLYG+SMRG NG RLMKSLC FVHNMAKDD GCGAVV EV QQDPVR+AIPHW+RLSWN DLWCIKKLADLQ  ECERSE SDWIKSPPSSA
Subjt:  VFSQFGVYFLYGVSMRGINGPRLMKSLCKFVHNMAKDDAGCGAVVTEVGQQDPVRMAIPHWRRLSWN-DLWCIKKLADLQGDECERSETSDWIKSPPSSA

Query:  GIFVDPRDI
        GIFVDPRDI
Subjt:  GIFVDPRDI

XP_022998449.1 probable N-acetyltransferase HLS1 isoform X1 [Cucurbita maxima]3.3e-22593.15Show/hide
Query:  MVLKIADEYRLQVVSNTEENRKLVVVREYCEERDKVSVEKMERQCDVGQKGKPSIFTDLLGDPICRIRHFPLHVMLVAEYGEARDIVGVIRGCIKHVTTG
        MV+KIADEYR QVVSNTEENR L VVREYCEE+DKVSVEKMERQCDVGQKGKPSIFTDLLGDPICRIRHFPLHVMLVAEYGEARDI+GVIRGCIKHVTTG
Subjt:  MVLKIADEYRLQVVSNTEENRKLVVVREYCEERDKVSVEKMERQCDVGQKGKPSIFTDLLGDPICRIRHFPLHVMLVAEYGEARDIVGVIRGCIKHVTTG

Query:  HSHHVLKLAYILGLRVSTTHRRLGVGTKLVQHLEEWCKQKGADYAYIATDCANQPCINLFTQKFSYTKFRSPTVLVQPVHAHYKSIGSRIAIVRIPPNIA
        HSHHVLKLAYILGLRVST HRRLGVG KLV+HLEEWCKQKGADYAYIATDCANQPCINLFTQKFSYTKFRSPTVLVQPVHAHYK IGS IAIVRIPP++ 
Subjt:  HSHHVLKLAYILGLRVSTTHRRLGVGTKLVQHLEEWCKQKGADYAYIATDCANQPCINLFTQKFSYTKFRSPTVLVQPVHAHYKSIGSRIAIVRIPPNIA

Query:  AKVYRHLFANAEFFATDIDAILSNKLNLGTFMAVPKKLLPKWNPETGILPQSFAILSVWNTKEVFKLQVKGVSKLTYACCMGSRLLDSWLPWLRLPSFPD
        AKVYRHLFANAEFF TDIDAILSNKLNLGTFMAVPKKLLPKWNPETGILPQSFAILSVWNTKEVFKLQVKGVSKLTYACCMGSRLLDSWLPWLRLPSFPD
Subjt:  AKVYRHLFANAEFFATDIDAILSNKLNLGTFMAVPKKLLPKWNPETGILPQSFAILSVWNTKEVFKLQVKGVSKLTYACCMGSRLLDSWLPWLRLPSFPD

Query:  VFSQFGVYFLYGVSMRGINGPRLMKSLCKFVHNMAKDDAGCGAVVTEVGQQDPVRMAIPHWRRLSWN-DLWCIKKLADLQGDECERSETSDWIKSPPSSA
        VFSQFGVYFLYG+SMRG NG RLMKSLC FVHNMAKDD GCGAVV EV QQDPVR+AIPHW+RLSWN DLWCIKKLADLQ  ECERSE SDWIKSPPSSA
Subjt:  VFSQFGVYFLYGVSMRGINGPRLMKSLCKFVHNMAKDDAGCGAVVTEVGQQDPVRMAIPHWRRLSWN-DLWCIKKLADLQGDECERSETSDWIKSPPSSA

Query:  GIFVDPRDI
        GIFVDPRDI
Subjt:  GIFVDPRDI

XP_038903362.1 probable N-acetyltransferase HLS1 isoform X2 [Benincasa hispida]1.0e-22692.67Show/hide
Query:  MVLKIADEYRLQVVSNTEENRKLVVVREYCEERDKVSVEKMERQCDVGQKGKPSIFTDLLGDPICRIRHFPLHVMLVAEYGEARDIVGVIRGCIKHVTTG
        MVLKIADEYRLQV SNTEEN   V+VREYCEE DKVSVEKMERQCDVGQKGKPSIFTDLLGDPICR+RHFPLHVMLVAEYG+AR+I+GVIRGCIKHVTTG
Subjt:  MVLKIADEYRLQVVSNTEENRKLVVVREYCEERDKVSVEKMERQCDVGQKGKPSIFTDLLGDPICRIRHFPLHVMLVAEYGEARDIVGVIRGCIKHVTTG

Query:  HSHHVLKLAYILGLRVSTTHRRLGVGTKLVQHLEEWCKQKGADYAYIATDCANQPCINLFTQKFSYTKFRSPTVLVQPVHAHYKSIGSRIAIVRIPPNIA
        HSHHVLKLAYILGLRVSTTHRRLGVGTKLVQHLEEWCKQKGADYAYIATDCANQPCI+LFTQKFSYTKFRSPTVLVQPVHAHYK IGS IAIVRIPP+IA
Subjt:  HSHHVLKLAYILGLRVSTTHRRLGVGTKLVQHLEEWCKQKGADYAYIATDCANQPCINLFTQKFSYTKFRSPTVLVQPVHAHYKSIGSRIAIVRIPPNIA

Query:  AKVYRHLFANAEFFATDIDAILSNKLNLGTFMAVPKKLLPKWNPETGILPQSFAILSVWNTKEVFKLQVKGVSKLTYACCMGSRLLDSWLPWLRLPSFPD
         K+YRHLFANAEFFATDIDAIL NKLNLGTFMAVPKKLLPKW+PETGILPQSFAILSVWNTKEVFKLQVKGVSKLTYACCMGSRLLDSWLPWLR+PSFPD
Subjt:  AKVYRHLFANAEFFATDIDAILSNKLNLGTFMAVPKKLLPKWNPETGILPQSFAILSVWNTKEVFKLQVKGVSKLTYACCMGSRLLDSWLPWLRLPSFPD

Query:  VFSQFGVYFLYGVSMRGINGPRLMKSLCKFVHNMAKDDAGCGAVVTEVGQQDPVRMAIPHWRRLSWN-DLWCIKKLADLQGDECERSETSDWIKSPPSSA
        VFSQFGVYFLYG++MRG NG RLMKSLC FVHNMAKDD GCGAVVTEVGQQDPVR+AIPHWRRLSWN DLWCIKKL DLQGD CERS+TSDWIKSPPSSA
Subjt:  VFSQFGVYFLYGVSMRGINGPRLMKSLCKFVHNMAKDDAGCGAVVTEVGQQDPVRMAIPHWRRLSWN-DLWCIKKLADLQGDECERSETSDWIKSPPSSA

Query:  GIFVDPRDI
        GIFVDPRDI
Subjt:  GIFVDPRDI

TrEMBL top hitse value%identityAlignment
A0A0A0L9Z5 N-acetyltransferase domain-containing protein1.4e-22189.98Show/hide
Query:  MVLKIADEYRLQVVSNTEENRKLVVVREYCEERDKVSVEKMERQCDVGQKGKPSIFTDLLGDPICRIRHFPLHVMLVAEYGEARDIVGVIRGCIKHVTTG
        MVLKIADEYRLQV SNTEENR LVVVREYCEERDKVSVEKMERQCDVGQKGKPSIFTDLLGDPICR+RHFP HVMLVAEYG+AR+IVGVIRGCIKHVTTG
Subjt:  MVLKIADEYRLQVVSNTEENRKLVVVREYCEERDKVSVEKMERQCDVGQKGKPSIFTDLLGDPICRIRHFPLHVMLVAEYGEARDIVGVIRGCIKHVTTG

Query:  HSHHVLKLAYILGLRVSTTHRRLGVGTKLVQHLEEWCKQKGADYAYIATDCANQPCINLFTQKFSYTKFRSPTVLVQPVHAHYKSIGSRIAIVRIPPNIA
        HSHHVLKLAYILGLRVSTTHRRLGVGTKLVQH+EEWCKQKGADYAYIATDCANQP I+LFTQKF+YTKFRSPTVLVQPVHAHYK IGS I+IVR+PP++A
Subjt:  HSHHVLKLAYILGLRVSTTHRRLGVGTKLVQHLEEWCKQKGADYAYIATDCANQPCINLFTQKFSYTKFRSPTVLVQPVHAHYKSIGSRIAIVRIPPNIA

Query:  AKVYRHLFANAEFFATDIDAILSNKLNLGTFMAVPKKLLPKWNPETGILPQSFAILSVWNTKEVFKLQVKGVSKLTYACCMGSRLLDSWLPWLRLPSFPD
         K+YRHLFANAEFFA DIDAIL NKLNLGTFMAVPKKLLPKW+PETGILPQSFA+LSVWNTKEVFKLQVKG+SKLTYACCMGSRLLDSWLPWLR+PSFPD
Subjt:  AKVYRHLFANAEFFATDIDAILSNKLNLGTFMAVPKKLLPKWNPETGILPQSFAILSVWNTKEVFKLQVKGVSKLTYACCMGSRLLDSWLPWLRLPSFPD

Query:  VFSQFGVYFLYGVSMRGINGPRLMKSLCKFVHNMAKDDAGCGAVVTEVGQQDPVRMAIPHWRRLSWN-DLWCIKKLADLQGDECERSETSDWIKSPPSSA
        VFSQFGVYFLYG++MRG NG RLMKSLC FVHNMAKDD GCGA+VTEVGQQDPVR+AIPHW+RLSWN DLWCIKKL DL+GD  E S+T DWIKSPPSSA
Subjt:  VFSQFGVYFLYGVSMRGINGPRLMKSLCKFVHNMAKDDAGCGAVVTEVGQQDPVRMAIPHWRRLSWN-DLWCIKKLADLQGDECERSETSDWIKSPPSSA

Query:  GIFVDPRDI
        GIFVDPRDI
Subjt:  GIFVDPRDI

A0A5A7T819 Putative N-acetyltransferase HLS16.9e-22190.22Show/hide
Query:  MVLKIADEYRLQVVSNTEENRKLVVVREYCEERDKVSVEKMERQCDVGQKGKPSIFTDLLGDPICRIRHFPLHVMLVAEYGEARDIVGVIRGCIKHVTTG
        MVLKIADEYRL V SNTEENR LVVVREYCEERDK SVEKMERQCDVGQKGKPSIFTDLLGDPICR+RHFP HVMLVAEYG+AR+IVGVIRGCIKHVTTG
Subjt:  MVLKIADEYRLQVVSNTEENRKLVVVREYCEERDKVSVEKMERQCDVGQKGKPSIFTDLLGDPICRIRHFPLHVMLVAEYGEARDIVGVIRGCIKHVTTG

Query:  HSHHVLKLAYILGLRVSTTHRRLGVGTKLVQHLEEWCKQKGADYAYIATDCANQPCINLFTQKFSYTKFRSPTVLVQPVHAHYKSIGSRIAIVRIPPNIA
        HSHHVLKLAYILGLRVSTTHRRLGVGTKLVQHLEEWCKQKGADYAYIATDCANQP I+LFT+KFSYTKFRSPTVLVQPVHAHYK IGS IAIVRIPP++A
Subjt:  HSHHVLKLAYILGLRVSTTHRRLGVGTKLVQHLEEWCKQKGADYAYIATDCANQPCINLFTQKFSYTKFRSPTVLVQPVHAHYKSIGSRIAIVRIPPNIA

Query:  AKVYRHLFANAEFFATDIDAILSNKLNLGTFMAVPKKLLPKWNPETGILPQSFAILSVWNTKEVFKLQVKGVSKLTYACCMGSRLLDSWLPWLRLPSFPD
         K+YR+LFANAEFFA DIDAIL NKLNLGTFMA+PKKLLPKW+PETGILPQSFA+LSVWNTKEVFKLQVKG+SKLTYACCMGSRLLDSWLPWLR+PSFPD
Subjt:  AKVYRHLFANAEFFATDIDAILSNKLNLGTFMAVPKKLLPKWNPETGILPQSFAILSVWNTKEVFKLQVKGVSKLTYACCMGSRLLDSWLPWLRLPSFPD

Query:  VFSQFGVYFLYGVSMRGINGPRLMKSLCKFVHNMAKDDAGCGAVVTEVGQQDPVRMAIPHWRRLSWN-DLWCIKKLADLQGDECERSETSDWIKSPPSSA
        VFSQFGVYFLYG++MRG NG RLMKSLC FVHNMAKDD GCGAVVTEVGQQDPVR+AIPHWRRLSWN DLWCIKKL DL+GD  E S+T DWIKSPPSSA
Subjt:  VFSQFGVYFLYGVSMRGINGPRLMKSLCKFVHNMAKDDAGCGAVVTEVGQQDPVRMAIPHWRRLSWN-DLWCIKKLADLQGDECERSETSDWIKSPPSSA

Query:  GIFVDPRDI
        GIFVDPRDI
Subjt:  GIFVDPRDI

A0A6J1DJ28 probable N-acetyltransferase HLS13.9e-22491.93Show/hide
Query:  MVLKIADEYRLQVVSNTEENRKLVVVREYCEERDKVSVEKMERQCDVGQKGKPSIFTDLLGDPICRIRHFPLHVMLVAEYGEARDIVGVIRGCIKHVTTG
        MVLKIADEYR  VVSNTE N  LVVVREYCEERDKVSVEKME QCDVGQKGKPSIFTDLLGDPICRIR FPLHVMLVAEYGEARDIVGVIRGCIKHVTTG
Subjt:  MVLKIADEYRLQVVSNTEENRKLVVVREYCEERDKVSVEKMERQCDVGQKGKPSIFTDLLGDPICRIRHFPLHVMLVAEYGEARDIVGVIRGCIKHVTTG

Query:  HSHHVLKLAYILGLRVSTTHRRLGVGTKLVQHLEEWCKQKGADYAYIATDCANQPCINLFTQKFSYTKFRSPTVLVQPVHAHYKSIGSRIAIVRIPPNIA
        HSHHVLKLAYILGLRVS+THRRLGVGTKLVQHLEEWCKQKGADYAYIAT+CANQP INLFTQKFSYTKFRSPTVLVQPVHAHYK IGS I IVR+PP++A
Subjt:  HSHHVLKLAYILGLRVSTTHRRLGVGTKLVQHLEEWCKQKGADYAYIATDCANQPCINLFTQKFSYTKFRSPTVLVQPVHAHYKSIGSRIAIVRIPPNIA

Query:  AKVYRHLFANAEFFATDIDAILSNKLNLGTFMAVPKKLLPKWNPETGILPQSFAILSVWNTKEVFKLQVKGVSKLTYACCMGSRLLDSWLPWLRLPSFPD
         KVY HLFAN+EFF+ DIDAILSNKLNLGTFMAVPKKLLPKW+PETGILPQSFA+LSVWNTKEVFKLQVKG+SKLTYACCMGSRLLDSWLPWLRLPSFPD
Subjt:  AKVYRHLFANAEFFATDIDAILSNKLNLGTFMAVPKKLLPKWNPETGILPQSFAILSVWNTKEVFKLQVKGVSKLTYACCMGSRLLDSWLPWLRLPSFPD

Query:  VFSQFGVYFLYGVSMRGINGPRLMKSLCKFVHNMAKDDAGCGAVVTEVGQQDPVRMAIPHWRRLSWN-DLWCIKKLADLQGDECERSETSDWIKSPPSSA
        VFSQFGVYFLYG+SM G NGPRLMKSLC FVHNMAKDD GCGAVVTEVGQQDPVR+AIPHWRRLSWN DLWCIKKLA+LQGDECERSETSDWIKSPPSS+
Subjt:  VFSQFGVYFLYGVSMRGINGPRLMKSLCKFVHNMAKDDAGCGAVVTEVGQQDPVRMAIPHWRRLSWN-DLWCIKKLADLQGDECERSETSDWIKSPPSSA

Query:  GIFVDPRDI
        GIFVDPRDI
Subjt:  GIFVDPRDI

A0A6J1GBB5 probable N-acetyltransferase HLS1 isoform X13.0e-22492.67Show/hide
Query:  MVLKIADEYRLQVVSNTEENRKLVVVREYCEERDKVSVEKMERQCDVGQKGKPSIFTDLLGDPICRIRHFPLHVMLVAEYGEARDIVGVIRGCIKHVTTG
        MV+KIADEYR QVVSNTEENR L VVREYCEERDKVSVEKMERQCDVGQKGKPSIFTDLLGDPICRIRHFPLHVMLVAEYGEARDI+GVIRGCIKHVTTG
Subjt:  MVLKIADEYRLQVVSNTEENRKLVVVREYCEERDKVSVEKMERQCDVGQKGKPSIFTDLLGDPICRIRHFPLHVMLVAEYGEARDIVGVIRGCIKHVTTG

Query:  HSHHVLKLAYILGLRVSTTHRRLGVGTKLVQHLEEWCKQKGADYAYIATDCANQPCINLFTQKFSYTKFRSPTVLVQPVHAHYKSIGSRIAIVRIPPNIA
        HSHHVLKLAYILGLRVST HRRLGVG KLV+HLEEWCKQKGADYAYIATDCANQPCINLFTQKFSYTKFRSPTVLVQPVHAHYK IGS IAIVRIPP++ 
Subjt:  HSHHVLKLAYILGLRVSTTHRRLGVGTKLVQHLEEWCKQKGADYAYIATDCANQPCINLFTQKFSYTKFRSPTVLVQPVHAHYKSIGSRIAIVRIPPNIA

Query:  AKVYRHLFANAEFFATDIDAILSNKLNLGTFMAVPKKLLPKWNPETGILPQSFAILSVWNTKEVFKLQVKGVSKLTYACCMGSRLLDSWLPWLRLPSFPD
        AKVYRHLFANAEFF  DIDAILSNKLNLGTFMAVPKKLLPKW+PETGILPQSFAILSVWNTKEVFKLQVKGVS+LTYACCMGSRLLDSWLPWLRLPSFPD
Subjt:  AKVYRHLFANAEFFATDIDAILSNKLNLGTFMAVPKKLLPKWNPETGILPQSFAILSVWNTKEVFKLQVKGVSKLTYACCMGSRLLDSWLPWLRLPSFPD

Query:  VFSQFGVYFLYGVSMRGINGPRLMKSLCKFVHNMAKDDAGCGAVVTEVGQQDPVRMAIPHWRRLSWN-DLWCIKKLADLQGDECERSETSDWIKSPPSSA
        VFSQFGVYFLYG+SMRG NG RLMKSLC FVHNMAKDD GCGAVV EV QQDPVR+AIPHW+RLSWN DLWCIKKLADLQ  ECERSE SDWIKSPPSSA
Subjt:  VFSQFGVYFLYGVSMRGINGPRLMKSLCKFVHNMAKDDAGCGAVVTEVGQQDPVRMAIPHWRRLSWN-DLWCIKKLADLQGDECERSETSDWIKSPPSSA

Query:  GIFVDPRDI
        GIFVDPRDI
Subjt:  GIFVDPRDI

A0A6J1KED0 probable N-acetyltransferase HLS1 isoform X11.6e-22593.15Show/hide
Query:  MVLKIADEYRLQVVSNTEENRKLVVVREYCEERDKVSVEKMERQCDVGQKGKPSIFTDLLGDPICRIRHFPLHVMLVAEYGEARDIVGVIRGCIKHVTTG
        MV+KIADEYR QVVSNTEENR L VVREYCEE+DKVSVEKMERQCDVGQKGKPSIFTDLLGDPICRIRHFPLHVMLVAEYGEARDI+GVIRGCIKHVTTG
Subjt:  MVLKIADEYRLQVVSNTEENRKLVVVREYCEERDKVSVEKMERQCDVGQKGKPSIFTDLLGDPICRIRHFPLHVMLVAEYGEARDIVGVIRGCIKHVTTG

Query:  HSHHVLKLAYILGLRVSTTHRRLGVGTKLVQHLEEWCKQKGADYAYIATDCANQPCINLFTQKFSYTKFRSPTVLVQPVHAHYKSIGSRIAIVRIPPNIA
        HSHHVLKLAYILGLRVST HRRLGVG KLV+HLEEWCKQKGADYAYIATDCANQPCINLFTQKFSYTKFRSPTVLVQPVHAHYK IGS IAIVRIPP++ 
Subjt:  HSHHVLKLAYILGLRVSTTHRRLGVGTKLVQHLEEWCKQKGADYAYIATDCANQPCINLFTQKFSYTKFRSPTVLVQPVHAHYKSIGSRIAIVRIPPNIA

Query:  AKVYRHLFANAEFFATDIDAILSNKLNLGTFMAVPKKLLPKWNPETGILPQSFAILSVWNTKEVFKLQVKGVSKLTYACCMGSRLLDSWLPWLRLPSFPD
        AKVYRHLFANAEFF TDIDAILSNKLNLGTFMAVPKKLLPKWNPETGILPQSFAILSVWNTKEVFKLQVKGVSKLTYACCMGSRLLDSWLPWLRLPSFPD
Subjt:  AKVYRHLFANAEFFATDIDAILSNKLNLGTFMAVPKKLLPKWNPETGILPQSFAILSVWNTKEVFKLQVKGVSKLTYACCMGSRLLDSWLPWLRLPSFPD

Query:  VFSQFGVYFLYGVSMRGINGPRLMKSLCKFVHNMAKDDAGCGAVVTEVGQQDPVRMAIPHWRRLSWN-DLWCIKKLADLQGDECERSETSDWIKSPPSSA
        VFSQFGVYFLYG+SMRG NG RLMKSLC FVHNMAKDD GCGAVV EV QQDPVR+AIPHW+RLSWN DLWCIKKLADLQ  ECERSE SDWIKSPPSSA
Subjt:  VFSQFGVYFLYGVSMRGINGPRLMKSLCKFVHNMAKDDAGCGAVVTEVGQQDPVRMAIPHWRRLSWN-DLWCIKKLADLQGDECERSETSDWIKSPPSSA

Query:  GIFVDPRDI
        GIFVDPRDI
Subjt:  GIFVDPRDI

SwissProt top hitse value%identityAlignment
O64815 Probable N-acetyltransferase HLS1-like3.3e-10346.51Show/hide
Query:  LVVVREYCEERDKVSVEKMERQCDVGQKGKPSIFTDLLGDPICRIRHFPLHVMLVAEYG--EARDIVGVIRGCIKHVTTGHSHHVL--------------
        LV VREY   +D  +VE +ER+C+VG  GK S+FTDLLGDPICR+RH P ++MLVAE G  E +++VG+IRGCIK VT G +   L              
Subjt:  LVVVREYCEERDKVSVEKMERQCDVGQKGKPSIFTDLLGDPICRIRHFPLHVMLVAEYG--EARDIVGVIRGCIKHVTTGHSHHVL--------------

Query:  -----KLAYILGLRVSTTHRRLGVGTKLVQHLEEWCKQKGADYAYIATDCANQPCINLFTQKFSYTKFRSPTVLVQPVHAHYKSIGSRIAIVRIPPNIAA
             KLAYILGLRVS THRR G+G KLV+ +E+W  Q GA+Y+Y AT+  N   +NLFT K  Y +FR+P++LV PV+AH  +I  R+ ++++ P+ A 
Subjt:  -----KLAYILGLRVSTTHRRLGVGTKLVQHLEEWCKQKGADYAYIATDCANQPCINLFTQKFSYTKFRSPTVLVQPVHAHYKSIGSRIAIVRIPPNIAA

Query:  KVYRHLFANAEFFATDIDAILSNKLNLGTFMAVPKKLL-----PKWNPETGIL---PQSFAILSVWNTKEVFKLQVKGVSKLTYACCMGSRLLDSWLPWL
         +YR  F+  EFF  DID++L+NKL+LGTF+AVP+          W      L   P S+A+LSVWN K+ F+L+V+G S+L       +R++D  LP+L
Subjt:  KVYRHLFANAEFFATDIDAILSNKLNLGTFMAVPKKLL-----PKWNPETGIL---PQSFAILSVWNTKEVFKLQVKGVSKLTYACCMGSRLLDSWLPWL

Query:  RLPSFPDVFSQFGVYFLYGVSMRGINGPRLMKSLCKFVHNMAKDDAGCGAVVTEVGQQDPVRMAIPHWRRLSW-NDLWCIKKLADLQGDECERSETSDWI
        ++PS P VF  FG++F+YG+   G    +++K+LC   HN+AK + GCG V  EV  ++P+R  IPHW+ LS   DLWCIK+L    G++       DW 
Subjt:  RLPSFPDVFSQFGVYFLYGVSMRGINGPRLMKSLCKFVHNMAKDDAGCGAVVTEVGQQDPVRMAIPHWRRLSW-NDLWCIKKLADLQGDECERSETSDWI

Query:  KSPPSSAGIFVDPRD
        KSPP  + IFVDPR+
Subjt:  KSPPSSAGIFVDPRD

Q42381 Probable N-acetyltransferase HLS15.4e-10649.02Show/hide
Query:  LVVVREYCEERDKVSVEKMERQCDVGQKGKPSIFTDLLGDPICRIRHFPLHVMLVAEYG-EARDIVGVIRGCIKHVTTGHS---HH----------VLKL
        + VVREY   RD V VE +ER+C+VG  GK S+FTDLLGDPICRIRH P ++MLVAE G E ++IVG+IRGCIK VT G     +H            KL
Subjt:  LVVVREYCEERDKVSVEKMERQCDVGQKGKPSIFTDLLGDPICRIRHFPLHVMLVAEYG-EARDIVGVIRGCIKHVTTGHS---HH----------VLKL

Query:  AYILGLRVSTTHRRLGVGTKLVQHLEEWCKQKGADYAYIATDCANQPCINLFTQKFSYTKFRSPTVLVQPVHAHYKSIGSRIAIVRIPPNIAAKVYRHLF
        AY+LGLRVS  HRR G+G KLV+ +EEW +Q GA+Y+YIAT+  NQ  +NLFT K  Y++FR+P++LV PV+AH  ++  R+ ++++ P  A  +YR  F
Subjt:  AYILGLRVSTTHRRLGVGTKLVQHLEEWCKQKGADYAYIATDCANQPCINLFTQKFSYTKFRSPTVLVQPVHAHYKSIGSRIAIVRIPPNIAAKVYRHLF

Query:  ANAEFFATDIDAILSNKLNLGTFMAVPKKLL-----PKWNPETGIL---PQSFAILSVWNTKEVFKLQVKGVSKLTYACCMGSRLLDSWLPWLRLPSFPD
        +  EFF  DID++L+NKL+LGTF+AVP+          W      L   P+S+A+LSVWN K+ F L+V+G S+L       +R++D  LP+L+LPS P 
Subjt:  ANAEFFATDIDAILSNKLNLGTFMAVPKKLL-----PKWNPETGIL---PQSFAILSVWNTKEVFKLQVKGVSKLTYACCMGSRLLDSWLPWLRLPSFPD

Query:  VFSQFGVYFLYGVSMRGINGPRLMKSLCKFVHNMAKDDAGCGAVVTEVGQQDPVRMAIPHWRRLSWN-DLWCIKKLADLQGDECERSETSDWIKSPPSSA
        VF  FG++F+YG+   G    +++KSLC   HN+AK   GCG V  EV  +DP+R  IPHW+ LS + DLWCIK+L    GD+       DW KSPP   
Subjt:  VFSQFGVYFLYGVSMRGINGPRLMKSLCKFVHNMAKDDAGCGAVVTEVGQQDPVRMAIPHWRRLSWN-DLWCIKKLADLQGDECERSETSDWIKSPPSSA

Query:  GIFVDPRD
         IFVDPR+
Subjt:  GIFVDPRD

Arabidopsis top hitse value%identityAlignment
AT2G23060.1 Acyl-CoA N-acyltransferases (NAT) superfamily protein2.3e-10446.51Show/hide
Query:  LVVVREYCEERDKVSVEKMERQCDVGQKGKPSIFTDLLGDPICRIRHFPLHVMLVAEYG--EARDIVGVIRGCIKHVTTGHSHHVL--------------
        LV VREY   +D  +VE +ER+C+VG  GK S+FTDLLGDPICR+RH P ++MLVAE G  E +++VG+IRGCIK VT G +   L              
Subjt:  LVVVREYCEERDKVSVEKMERQCDVGQKGKPSIFTDLLGDPICRIRHFPLHVMLVAEYG--EARDIVGVIRGCIKHVTTGHSHHVL--------------

Query:  -----KLAYILGLRVSTTHRRLGVGTKLVQHLEEWCKQKGADYAYIATDCANQPCINLFTQKFSYTKFRSPTVLVQPVHAHYKSIGSRIAIVRIPPNIAA
             KLAYILGLRVS THRR G+G KLV+ +E+W  Q GA+Y+Y AT+  N   +NLFT K  Y +FR+P++LV PV+AH  +I  R+ ++++ P+ A 
Subjt:  -----KLAYILGLRVSTTHRRLGVGTKLVQHLEEWCKQKGADYAYIATDCANQPCINLFTQKFSYTKFRSPTVLVQPVHAHYKSIGSRIAIVRIPPNIAA

Query:  KVYRHLFANAEFFATDIDAILSNKLNLGTFMAVPKKLL-----PKWNPETGIL---PQSFAILSVWNTKEVFKLQVKGVSKLTYACCMGSRLLDSWLPWL
         +YR  F+  EFF  DID++L+NKL+LGTF+AVP+          W      L   P S+A+LSVWN K+ F+L+V+G S+L       +R++D  LP+L
Subjt:  KVYRHLFANAEFFATDIDAILSNKLNLGTFMAVPKKLL-----PKWNPETGIL---PQSFAILSVWNTKEVFKLQVKGVSKLTYACCMGSRLLDSWLPWL

Query:  RLPSFPDVFSQFGVYFLYGVSMRGINGPRLMKSLCKFVHNMAKDDAGCGAVVTEVGQQDPVRMAIPHWRRLSW-NDLWCIKKLADLQGDECERSETSDWI
        ++PS P VF  FG++F+YG+   G    +++K+LC   HN+AK + GCG V  EV  ++P+R  IPHW+ LS   DLWCIK+L    G++       DW 
Subjt:  RLPSFPDVFSQFGVYFLYGVSMRGINGPRLMKSLCKFVHNMAKDDAGCGAVVTEVGQQDPVRMAIPHWRRLSW-NDLWCIKKLADLQGDECERSETSDWI

Query:  KSPPSSAGIFVDPRD
        KSPP  + IFVDPR+
Subjt:  KSPPSSAGIFVDPRD

AT2G23060.2 Acyl-CoA N-acyltransferases (NAT) superfamily protein4.2e-8544.63Show/hide
Query:  MLVAEYG--EARDIVGVIRGCIKHVTTGHSHHVL-------------------KLAYILGLRVSTTHRRLGVGTKLVQHLEEWCKQKGADYAYIATDCAN
        MLVAE G  E +++VG+IRGCIK VT G +   L                   KLAYILGLRVS THRR G+G KLV+ +E+W  Q GA+Y+Y AT+  N
Subjt:  MLVAEYG--EARDIVGVIRGCIKHVTTGHSHHVL-------------------KLAYILGLRVSTTHRRLGVGTKLVQHLEEWCKQKGADYAYIATDCAN

Query:  QPCINLFTQKFSYTKFRSPTVLVQPVHAHYKSIGSRIAIVRIPPNIAAKVYRHLFANAEFFATDIDAILSNKLNLGTFMAVPKKLL-----PKWNPETGI
           +NLFT K  Y +FR+P++LV PV+AH  +I  R+ ++++ P+ A  +YR  F+  EFF  DID++L+NKL+LGTF+AVP+          W      
Subjt:  QPCINLFTQKFSYTKFRSPTVLVQPVHAHYKSIGSRIAIVRIPPNIAAKVYRHLFANAEFFATDIDAILSNKLNLGTFMAVPKKLL-----PKWNPETGI

Query:  L---PQSFAILSVWNTKEVFKLQVKGVSKLTYACCMGSRLLDSWLPWLRLPSFPDVFSQFGVYFLYGVSMRGINGPRLMKSLCKFVHNMAKDDAGCGAVV
        L   P S+A+LSVWN K+ F+L+V+G S+L       +R++D  LP+L++PS P VF  FG++F+YG+   G    +++K+LC   HN+AK + GCG V 
Subjt:  L---PQSFAILSVWNTKEVFKLQVKGVSKLTYACCMGSRLLDSWLPWLRLPSFPDVFSQFGVYFLYGVSMRGINGPRLMKSLCKFVHNMAKDDAGCGAVV

Query:  TEVGQQDPVRMAIPHWRRLSW-NDLWCIKKLADLQGDECERSETSDWIKSPPSSAGIFVDPRD
         EV  ++P+R  IPHW+ LS   DLWCIK+L    G++       DW KSPP  + IFVDPR+
Subjt:  TEVGQQDPVRMAIPHWRRLSW-NDLWCIKKLADLQGDECERSETSDWIKSPPSSAGIFVDPRD

AT2G30090.1 Acyl-CoA N-acyltransferases (NAT) superfamily protein2.4e-6938.75Show/hide
Query:  EENRKLVVVREYCEERDKVSVEKMERQCDVGQKGKPSIFTDLLGDPICRIRHFPLHVMLVAEYGEARDIVGVIRGCIKHVTTGHSHHVLKLAYILGLRVS
        EE  + VV+R Y + RD++ + +ME+ C++G   +  +FTD LGDPICRIR+ P  +MLVA  G    +VG I+G +K V   H   V ++ Y+LGLRV 
Subjt:  EENRKLVVVREYCEERDKVSVEKMERQCDVGQKGKPSIFTDLLGDPICRIRHFPLHVMLVAEYGEARDIVGVIRGCIKHVTTGHSHHVLKLAYILGLRVS

Query:  TTHRRLGVGTKLVQHLEEWCKQKGADYAYIATDCANQPCINLFTQKFSYTKFRSPTVLVQPVH-AHYKSIGSRIAIVRIPPNIAAKVY-RHLFANAEFFA
         ++RR G+G+ LV+ LEEW +   ADYAY+AT+  N+    LF  +  Y  FR+P +LV PV+      + S I I ++    A  +Y R++ A  EFF 
Subjt:  TTHRRLGVGTKLVQHLEEWCKQKGADYAYIATDCANQPCINLFTQKFSYTKFRSPTVLVQPVH-AHYKSIGSRIAIVRIPPNIAAKVY-RHLFANAEFFA

Query:  TDIDAILSNKLNLGTFMAVPKKLLPKWNPETGILPQSFAILSVWNTKEVFKLQVKGVSKLTYACCMGSRLLDSWLPWLRLPSFPDVFSQFGVYFLYGVSM
         DI+ IL NKL++GT++A    +            +S+A+LSVW++ +VFKL+++            S+L  ++L  L L   PD+F+ FG YFLYGV  
Subjt:  TDIDAILSNKLNLGTFMAVPKKLLPKWNPETGILPQSFAILSVWNTKEVFKLQVKGVSKLTYACCMGSRLLDSWLPWLRLPSFPDVFSQFGVYFLYGVSM

Query:  RGINGPRLMKSLCKFVHNMA--KDDAGCGAVVTEVGQ----QDPVRMAIPHWRRLSW-NDLWCIKKLADLQGDECERSETSDWIKSPPSSAGIFVDPRDI
         G +  +L+++LC+ VHNMA   D   C  VV EV +     D ++  IPHW+ LS  +D+WCIK L      +CE+++  D  +   S + +FVDPR++
Subjt:  RGINGPRLMKSLCKFVHNMA--KDDAGCGAVVTEVGQ----QDPVRMAIPHWRRLSW-NDLWCIKKLADLQGDECERSETSDWIKSPPSSAGIFVDPRDI

AT4G37580.1 Acyl-CoA N-acyltransferases (NAT) superfamily protein3.9e-10749.02Show/hide
Query:  LVVVREYCEERDKVSVEKMERQCDVGQKGKPSIFTDLLGDPICRIRHFPLHVMLVAEYG-EARDIVGVIRGCIKHVTTGHS---HH----------VLKL
        + VVREY   RD V VE +ER+C+VG  GK S+FTDLLGDPICRIRH P ++MLVAE G E ++IVG+IRGCIK VT G     +H            KL
Subjt:  LVVVREYCEERDKVSVEKMERQCDVGQKGKPSIFTDLLGDPICRIRHFPLHVMLVAEYG-EARDIVGVIRGCIKHVTTGHS---HH----------VLKL

Query:  AYILGLRVSTTHRRLGVGTKLVQHLEEWCKQKGADYAYIATDCANQPCINLFTQKFSYTKFRSPTVLVQPVHAHYKSIGSRIAIVRIPPNIAAKVYRHLF
        AY+LGLRVS  HRR G+G KLV+ +EEW +Q GA+Y+YIAT+  NQ  +NLFT K  Y++FR+P++LV PV+AH  ++  R+ ++++ P  A  +YR  F
Subjt:  AYILGLRVSTTHRRLGVGTKLVQHLEEWCKQKGADYAYIATDCANQPCINLFTQKFSYTKFRSPTVLVQPVHAHYKSIGSRIAIVRIPPNIAAKVYRHLF

Query:  ANAEFFATDIDAILSNKLNLGTFMAVPKKLL-----PKWNPETGIL---PQSFAILSVWNTKEVFKLQVKGVSKLTYACCMGSRLLDSWLPWLRLPSFPD
        +  EFF  DID++L+NKL+LGTF+AVP+          W      L   P+S+A+LSVWN K+ F L+V+G S+L       +R++D  LP+L+LPS P 
Subjt:  ANAEFFATDIDAILSNKLNLGTFMAVPKKLL-----PKWNPETGIL---PQSFAILSVWNTKEVFKLQVKGVSKLTYACCMGSRLLDSWLPWLRLPSFPD

Query:  VFSQFGVYFLYGVSMRGINGPRLMKSLCKFVHNMAKDDAGCGAVVTEVGQQDPVRMAIPHWRRLSWN-DLWCIKKLADLQGDECERSETSDWIKSPPSSA
        VF  FG++F+YG+   G    +++KSLC   HN+AK   GCG V  EV  +DP+R  IPHW+ LS + DLWCIK+L    GD+       DW KSPP   
Subjt:  VFSQFGVYFLYGVSMRGINGPRLMKSLCKFVHNMAKDDAGCGAVVTEVGQQDPVRMAIPHWRRLSWN-DLWCIKKLADLQGDECERSETSDWIKSPPSSA

Query:  GIFVDPRD
         IFVDPR+
Subjt:  GIFVDPRD

AT5G67430.1 Acyl-CoA N-acyltransferases (NAT) superfamily protein3.0e-9143.25Show/hide
Query:  LVVVREYCEERDKVSVEKMERQCDVGQKGKPSIFTDLLGDPICRIRHFPLHVMLVAEYGEARDIVGVIRGCIKHVTTGHSH-----------HVLKLAYI
        +VVVREY  +RD  SVE++E  C+VG     S+  DL+GDP+ RIR  P   MLVAE G   +IVG+IRG IK VT G +            +  KLA++
Subjt:  LVVVREYCEERDKVSVEKMERQCDVGQKGKPSIFTDLLGDPICRIRHFPLHVMLVAEYGEARDIVGVIRGCIKHVTTGHSH-----------HVLKLAYI

Query:  LGLRVSTTHRRLGVGTKLVQHLEEWCKQKGADYAYIATDCANQPCINLFTQKFSYTKFRSPTVLVQPVHAHYKSIGSRIAIVRIPPNIAAKVYRHLFANA
         GLRVS  +RR+G+G KLVQ LEEW  +  A Y+Y+ T+  N   + LFT+K  Y+KFR+PT LV PV  H  ++  R+ I+++ P+ A  +YR+ F+  
Subjt:  LGLRVSTTHRRLGVGTKLVQHLEEWCKQKGADYAYIATDCANQPCINLFTQKFSYTKFRSPTVLVQPVHAHYKSIGSRIAIVRIPPNIAAKVYRHLFANA

Query:  EFFATDIDAILSNKLNLGTFMAVPKKLLPKWNPETGILPQ---SFAILSVWNTKEVFKLQVKGVSKLTYACCMGSRLLDSWLPWLRLPSFPDVFSQFGVY
        EFF +DI++IL+NKL+LGT++AVP+      +  +G LP    S+A++S+WN+K+V++LQVKG S+L       +R+ D   P+L++PSFP++F  F ++
Subjt:  EFFATDIDAILSNKLNLGTFMAVPKKLLPKWNPETGILPQ---SFAILSVWNTKEVFKLQVKGVSKLTYACCMGSRLLDSWLPWLRLPSFPDVFSQFGVY

Query:  FLYGVSMRGINGPRLMKSLCKFVHNMAKDDAGCGAVVTEVGQQDPVRMAIPHWRRLSWNDLWCIKKLADLQGDECERSETSDWIKSPPSSAGIFVDPRDI
        F+YG+   G     ++++LC   HN+A+  +GC  V  EV   +P+R+ IPHW+ LS  DLWC+K+L           +  DW KSPP    IFVDPR+I
Subjt:  FLYGVSMRGINGPRLMKSLCKFVHNMAKDDAGCGAVVTEVGQQDPVRMAIPHWRRLSWNDLWCIKKLADLQGDECERSETSDWIKSPPSSAGIFVDPRDI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTGCTGAAGATAGCTGATGAATATAGGCTACAAGTTGTGTCAAACACAGAGGAAAATAGAAAGTTGGTTGTAGTGAGAGAATATTGTGAAGAGAGAGATAAGGTGTC
AGTGGAGAAGATGGAGAGGCAGTGTGATGTTGGACAGAAAGGGAAGCCTTCTATTTTCACTGATCTTCTGGGTGATCCTATTTGCCGCATTCGCCACTTCCCTTTGCACG
TCATGCTGGTTGCAGAGTATGGAGAAGCAAGAGATATTGTAGGAGTTATAAGAGGATGTATCAAACATGTGACAACAGGTCATTCTCATCATGTTCTAAAGCTGGCTTAT
ATTTTGGGCTTAAGGGTCTCTACCACTCACAGGAGGCTCGGAGTTGGCACAAAACTAGTTCAACATCTAGAAGAATGGTGCAAGCAAAAGGGTGCAGACTATGCATATAT
AGCAACAGACTGCGCCAACCAGCCGTGCATCAACTTGTTTACACAAAAGTTTTCATACACAAAATTCAGATCCCCCACAGTCCTGGTTCAGCCTGTCCACGCCCATTACA
AGTCAATAGGCTCAAGGATTGCCATTGTCCGAATTCCTCCAAACATTGCCGCTAAAGTTTATCGCCATCTCTTTGCAAATGCCGAGTTCTTCGCCACAGACATTGACGCC
ATCTTGTCCAACAAGCTTAACTTGGGCACTTTCATGGCCGTTCCCAAGAAGCTGCTACCCAAATGGAACCCTGAAACAGGAATCCTTCCTCAGAGTTTTGCAATCTTGAG
CGTGTGGAACACTAAAGAAGTTTTCAAGTTGCAGGTGAAGGGAGTGTCCAAGCTAACTTATGCATGTTGCATGGGGAGTAGATTGTTGGATTCATGGCTACCGTGGTTGA
GATTACCTTCATTTCCAGATGTATTTAGCCAATTTGGAGTGTATTTCTTGTATGGAGTATCCATGAGAGGAATTAATGGGCCACGCCTTATGAAGTCTCTATGCAAATTC
GTGCATAACATGGCTAAGGACGATGCAGGATGTGGGGCGGTGGTGACAGAGGTAGGCCAACAGGATCCTGTGAGAATGGCCATTCCCCATTGGAGGAGACTTTCATGGAA
TGATTTATGGTGCATCAAGAAGCTGGCAGATTTGCAGGGGGATGAGTGTGAAAGATCTGAAACGTCTGATTGGATCAAATCTCCACCGTCTTCAGCAGGGATATTTGTTG
ACCCTCGAGACATCTAA
mRNA sequenceShow/hide mRNA sequence
AACATACGGTGAGACCCACATAATTAGAGTTTCTAAAATGGTGTATAAATATGGGGAGATAGTGGGGTTGAAGCTCCAAACACAATCCTAAGAGGGGTTTTAGTACTTAG
TAGAGGTTTCCTCTGTCAAAACTATGGTGCTGAAGATAGCTGATGAATATAGGCTACAAGTTGTGTCAAACACAGAGGAAAATAGAAAGTTGGTTGTAGTGAGAGAATAT
TGTGAAGAGAGAGATAAGGTGTCAGTGGAGAAGATGGAGAGGCAGTGTGATGTTGGACAGAAAGGGAAGCCTTCTATTTTCACTGATCTTCTGGGTGATCCTATTTGCCG
CATTCGCCACTTCCCTTTGCACGTCATGCTGGTTGCAGAGTATGGAGAAGCAAGAGATATTGTAGGAGTTATAAGAGGATGTATCAAACATGTGACAACAGGTCATTCTC
ATCATGTTCTAAAGCTGGCTTATATTTTGGGCTTAAGGGTCTCTACCACTCACAGGAGGCTCGGAGTTGGCACAAAACTAGTTCAACATCTAGAAGAATGGTGCAAGCAA
AAGGGTGCAGACTATGCATATATAGCAACAGACTGCGCCAACCAGCCGTGCATCAACTTGTTTACACAAAAGTTTTCATACACAAAATTCAGATCCCCCACAGTCCTGGT
TCAGCCTGTCCACGCCCATTACAAGTCAATAGGCTCAAGGATTGCCATTGTCCGAATTCCTCCAAACATTGCCGCTAAAGTTTATCGCCATCTCTTTGCAAATGCCGAGT
TCTTCGCCACAGACATTGACGCCATCTTGTCCAACAAGCTTAACTTGGGCACTTTCATGGCCGTTCCCAAGAAGCTGCTACCCAAATGGAACCCTGAAACAGGAATCCTT
CCTCAGAGTTTTGCAATCTTGAGCGTGTGGAACACTAAAGAAGTTTTCAAGTTGCAGGTGAAGGGAGTGTCCAAGCTAACTTATGCATGTTGCATGGGGAGTAGATTGTT
GGATTCATGGCTACCGTGGTTGAGATTACCTTCATTTCCAGATGTATTTAGCCAATTTGGAGTGTATTTCTTGTATGGAGTATCCATGAGAGGAATTAATGGGCCACGCC
TTATGAAGTCTCTATGCAAATTCGTGCATAACATGGCTAAGGACGATGCAGGATGTGGGGCGGTGGTGACAGAGGTAGGCCAACAGGATCCTGTGAGAATGGCCATTCCC
CATTGGAGGAGACTTTCATGGAATGATTTATGGTGCATCAAGAAGCTGGCAGATTTGCAGGGGGATGAGTGTGAAAGATCTGAAACGTCTGATTGGATCAAATCTCCACC
GTCTTCAGCAGGGATATTTGTTGACCCTCGAGACATCTAAGTATCTTTTCTTCTTCTTTTCTTTTCCTTTCTTCTCTCCCTCTTCAGTACAGAAAATGACAAAATCCTGC
TGTGCACTGTCTTACAGTACAAAAGGAATGTTCAAACATCTCTAGTCTAGTCTCTCTCTCTTTTTTTTTTTTTTTTTTAATACTCTTTAGTCAATGACAACTGACAGAAG
CTAAGCTAGCTTACACTTTTTAAGTCACATTCCAATCATTGGTCTTGAACAAGTTCTTGAAGTCAGTTGAGAGCTAATGTGACAAGTGGGTAGAAAATTGCAAAATATCA
TTAGAAATTTGGGTATTTATTAAAATTTACTCAATTTAAATAATTATACATATGTAACTCTAATTTTCTTTCAACTAAAGGCTGGATCAACCTTATATATGAAAATTCAT
CATCTCAACATATGCTCTCCTTTAATTTGGAGATTTTTGTTTTATAAAGTAGAAAATCCTCTCCAAGTCAGTGATAGGAAGAGAAGAAATATGATGGCTGGAGGAAGGAG
GCAATAATCGAATAGGACAATTTGTTTATATATAAAAAAAGTAACAACATACAGTTGGTCCGTGCTCTCAAACCCCCTTGTGCACCATTCTTCCGAATCTTCTTCTCCCC
TTTTTCTTATCTGTTACTGGACTGACTCCATCTCTCTCCAATAACACTATGAACCATAAAAAAAAGATGGATGCTGGAAAACCATCTCTCTCCTCCTATGACCATGAGCC
ATAACAAAAAAAAAAAAAAGGAAGATTGATGCTGGAAAAGTTTACAGTAATTCCATGATATTTCCACTAACATTTAATCACATGATTAGAATTCAAAGTAATCTCATACT
TGTAAATGGATAAAAGTCTACTGTCACATCCGGTCATCTGAAATCAGGAATAGAATGGTTTGAGAGCTTAGAGACATTTATATGGTACCTACAAAGGAAACTCCAACATT
CAAGTTCGTATTTGAGAACGTAGTTCAATAAACTTGTATCCCAGCACGTTCACATATGTGCACAAGAACCTGGCTCCTAGGGGTATACCGTTATTTGTAAAGCTGAGCAA
GCTAGTTACTACAAAGAGTTGAAGATAGCTTCCCAAACTTCCTCTACATATTGCATGATTGTTGGAGATGGTTTAACTGGAGAGTTGTCTGGCTGTCAAATTCTTTAATT
ATAGAAGTCTTGGCCGAGCCAAGCCCCTTGGCTTGTTCAGCTGTTTGAGGAACATAGTGTGTTTTAGCCGCTAAGGATGACTATGATTCTGCAAGGTGTTATAATGGTAG
ACGTTAGAAAAAGACAGTGCATTTATATCTAAATTGAGATATTTCAAGATGAACTTGAATTGGGTGCTTCCATTAAAAACACAAGGTGTCTTCAGAAAAGTCACCTTTCA
CCTTGAAGGGTTAAGATGAAC
Protein sequenceShow/hide protein sequence
MVLKIADEYRLQVVSNTEENRKLVVVREYCEERDKVSVEKMERQCDVGQKGKPSIFTDLLGDPICRIRHFPLHVMLVAEYGEARDIVGVIRGCIKHVTTGHSHHVLKLAY
ILGLRVSTTHRRLGVGTKLVQHLEEWCKQKGADYAYIATDCANQPCINLFTQKFSYTKFRSPTVLVQPVHAHYKSIGSRIAIVRIPPNIAAKVYRHLFANAEFFATDIDA
ILSNKLNLGTFMAVPKKLLPKWNPETGILPQSFAILSVWNTKEVFKLQVKGVSKLTYACCMGSRLLDSWLPWLRLPSFPDVFSQFGVYFLYGVSMRGINGPRLMKSLCKF
VHNMAKDDAGCGAVVTEVGQQDPVRMAIPHWRRLSWNDLWCIKKLADLQGDECERSETSDWIKSPPSSAGIFVDPRDI