; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0018224 (gene) of Chayote v1 genome

Gene IDSed0018224
OrganismSechium edule (Chayote v1)
DescriptionAmino-acid N-acetyltransferase
Genome locationLG14:18310201..18315140
RNA-Seq ExpressionSed0018224
SyntenySed0018224
Gene Ontology termsGO:0006526 - arginine biosynthetic process (biological process)
GO:0005737 - cytoplasm (cellular component)
GO:0004042 - acetyl-CoA:L-glutamate N-acetyltransferase activity (molecular function)
GO:0103045 - methione N-acyltransferase activity (molecular function)
InterPro domainsIPR000182 - GNAT domain
IPR001048 - Aspartate/glutamate/uridylate kinase
IPR010167 - Amino-acid N-acetyltransferase
IPR016181 - Acyl-CoA N-acyltransferase
IPR033719 - N-acetylglutamate synthase, kinase-like domain
IPR036393 - Acetylglutamate kinase-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6594971.1 putative amino-acid acetyltransferase NAGS2, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia]1.2e-28983.01Show/hide
Query:  MAAALSSSQLHFQSPIEFPCKTFPYSHSFRNG--GFKFKRVEIWDNGLKKVGGNLRRFNRFSCGDGAAEEDG------EDNKFVEWFREAWPYLWAHRGG
        MAAA+SSS+LHFQ+  EFP KTFPYS  FRNG  GFKF  VE+WD GLKKVGG  R FNRF+ GDGA EE+       ED +FV WFREAWPYLWAHRGG
Subjt:  MAAALSSSQLHFQSPIEFPCKTFPYSHSFRNG--GFKFKRVEIWDNGLKKVGGNLRRFNRFSCGDGAAEEDG------EDNKFVEWFREAWPYLWAHRGG

Query:  TFVVIISGDIVSSSYLDPILKDIAFLHHLGIRFILVPGTHVLIDKLLAERGSKPNFVGQYRVTDKESLAAAMEAAGGIRVMIEAKLSPGPSICNIRRHGD
        TFVVIISG+IVSSSYL+ ILKDIAFLHHLGIRFILVPGT VLIDK+LAERGSKPNFVGQYRVTD E+LAAAMEAAGGIRVMIEAKLSPGPSICNIRRHGD
Subjt:  TFVVIISGDIVSSSYLDPILKDIAFLHHLGIRFILVPGTHVLIDKLLAERGSKPNFVGQYRVTDKESLAAAMEAAGGIRVMIEAKLSPGPSICNIRRHGD

Query:  SRRWHEVGVSVVSGNFLAAKRRGVVDGVDYGATGEVKKVDVTRIRERLDGGCMVILSNLGYSSSGEVLNCNTYEVVTACALAIGADKLICIIDGPILDEA
        SRRWHEVGVSVVSGNFLAAKRRGVVDGVDYGATGEVKKVDV R+RERLDGGC+VILSNLGYSSSGEVLNCNTYEV TACALAIGADKLICIIDGPILDEA
Subjt:  SRRWHEVGVSVVSGNFLAAKRRGVVDGVDYGATGEVKKVDVTRIRERLDGGCMVILSNLGYSSSGEVLNCNTYEVVTACALAIGADKLICIIDGPILDEA

Query:  GRHISFLTLQEADMLIRERAKQCEIAANYVKVVGKD------------------PLEDQTPT----------------------------FQNGVGFGYG
        GRHISFLTLQEADMLIRERAKQCEIAANYVKVVGK+                    ED +PT                            FQNGVGFG G
Subjt:  GRHISFLTLQEADMLIRERAKQCEIAANYVKVVGKD------------------PLEDQTPT----------------------------FQNGVGFGYG

Query:  NGRWSSEQGFAIGGERNSHLNGYLSELAAAAFVCRGGVQRVHLLDGTIGGVLLLELFKRDGMGTMVASDLYEGTRIARVTDLQGIRQIIQPLEMAGTLVR
        NG WSSEQGFAIGGERNSHLNGYLSELAAAAFVCRGGVQRVHLLDGTIGGVLLLELFKRDGMGTM+ASDLYEGTR+ARV+D++GIRQIIQPLEMAGTLVR
Subjt:  NGRWSSEQGFAIGGERNSHLNGYLSELAAAAFVCRGGVQRVHLLDGTIGGVLLLELFKRDGMGTMVASDLYEGTRIARVTDLQGIRQIIQPLEMAGTLVR

Query:  RTDEELLKSLDSFVVVEREGQIIACAALFPFFEERCGEIASIAVSAECRGQGQGDKLLEYMEKKAASLGLDRLFLLTTRTADWFVRRGFSECSFESIPEN
        RTDEELL+SLDS VVVEREGQIIACAALFPFFEERCGEIA+IAVSAECRGQGQGDKLL+YMEKKAASLGLDRLFLLTTRTADWFVRRGFSECSFESIPEN
Subjt:  RTDEELLKSLDSFVVVEREGQIIACAALFPFFEERCGEIASIAVSAECRGQGQGDKLLEYMEKKAASLGLDRLFLLTTRTADWFVRRGFSECSFESIPEN

Query:  RRRKINLSRKSKYYMKKLLPDRNR
        RR++INLSRKSKYYMKKLLPDRNR
Subjt:  RRRKINLSRKSKYYMKKLLPDRNR

XP_022962975.1 probable amino-acid acetyltransferase NAGS2, chloroplastic isoform X1 [Cucurbita moschata]1.8e-29083.07Show/hide
Query:  MAAALSSSQLHFQSPIEFPCKTFPYSHSFRNG--GFKFKRVEIWDNGLKKVGGNLRRFNRFSCGDGAAEEDG------EDNKFVEWFREAWPYLWAHRGG
        MAAALSSS+LHFQ+  EFP KTFPYS  FRNG  GFKF  VE+WD GLKKVGG  R FNRF+ GDGA EE+       ED +FV WFREAWPYLWAHRGG
Subjt:  MAAALSSSQLHFQSPIEFPCKTFPYSHSFRNG--GFKFKRVEIWDNGLKKVGGNLRRFNRFSCGDGAAEEDG------EDNKFVEWFREAWPYLWAHRGG

Query:  TFVVIISGDIVSSSYLDPILKDIAFLHHLGIRFILVPGTHVLIDKLLAERGSKPNFVGQYRVTDKESLAAAMEAAGGIRVMIEAKLSPGPSICNIRRHGD
        TFVVIISG+IVSSSYL+ ILKDIAFLHHLGIRFILVPGT VLIDK+LAERGSKPNFVGQYRVTD E+LAAAMEAAGGIRVMIEAKLSPGPSICNIRRHGD
Subjt:  TFVVIISGDIVSSSYLDPILKDIAFLHHLGIRFILVPGTHVLIDKLLAERGSKPNFVGQYRVTDKESLAAAMEAAGGIRVMIEAKLSPGPSICNIRRHGD

Query:  SRRWHEVGVSVVSGNFLAAKRRGVVDGVDYGATGEVKKVDVTRIRERLDGGCMVILSNLGYSSSGEVLNCNTYEVVTACALAIGADKLICIIDGPILDEA
        SRRWHEVGVSVVSGNFLAAKRRGVVDGVDYGATGEVKKVDV R+RERLDGGC+VILSNLGYSSSGEVLNCNTYEV TACALAIGADKLICIIDGPILDEA
Subjt:  SRRWHEVGVSVVSGNFLAAKRRGVVDGVDYGATGEVKKVDVTRIRERLDGGCMVILSNLGYSSSGEVLNCNTYEVVTACALAIGADKLICIIDGPILDEA

Query:  GRHISFLTLQEADMLIRERAKQCEIAANYVKVVGKD------------------PLEDQTPT----------------------------FQNGVGFGYG
        GRHISFLTLQEADMLIRERAKQCEIAANYVKVVGK+                    ED +PT                            FQNGVGFG G
Subjt:  GRHISFLTLQEADMLIRERAKQCEIAANYVKVVGKD------------------PLEDQTPT----------------------------FQNGVGFGYG

Query:  NGRWSSEQGFAIGGERNSHLNGYLSELAAAAFVCRGGVQRVHLLDGTIGGVLLLELFKRDGMGTMVASDLYEGTRIARVTDLQGIRQIIQPLEMAGTLVR
        NG WSSEQGFAIGGERNSHLNGYLSELAAAAFVCRGGVQRVHLLDGTIGGVLLLELFKRDGMGTM+ASDLYEGTR+ARV+D++GIRQIIQPLEMAGTLVR
Subjt:  NGRWSSEQGFAIGGERNSHLNGYLSELAAAAFVCRGGVQRVHLLDGTIGGVLLLELFKRDGMGTMVASDLYEGTRIARVTDLQGIRQIIQPLEMAGTLVR

Query:  RTDEELLKSLDSFVVVEREGQIIACAALFPFFEERCGEIASIAVSAECRGQGQGDKLLEYMEKKAASLGLDRLFLLTTRTADWFVRRGFSECSFESIPEN
        RTDEELL+SLDS VVVEREGQIIACAALFPFFEERCGEIA+IAVSAECRGQGQGDKLL+YMEKKAASLGLDRLFLLTTRTADWFVRRGFSECSFESIPEN
Subjt:  RTDEELLKSLDSFVVVEREGQIIACAALFPFFEERCGEIASIAVSAECRGQGQGDKLLEYMEKKAASLGLDRLFLLTTRTADWFVRRGFSECSFESIPEN

Query:  RRRKINLSRKSKYYMKKLLPDRNRIG
        RR++INLSRKSKYYMKKLLPDRNR G
Subjt:  RRRKINLSRKSKYYMKKLLPDRNRIG

XP_023003204.1 probable amino-acid acetyltransferase NAGS2, chloroplastic isoform X2 [Cucurbita maxima]1.2e-28982.75Show/hide
Query:  MAAALSSSQLHFQSPIEFPCKTFPYSHSFRNG--GFKFKRVEIWDNGLKKVGGNLRRFNRFSCGDGAAEEDG------EDNKFVEWFREAWPYLWAHRGG
        MAAALSSS+LHFQ+  EFP KTFPYS  FRNG  GFKF  VE+WD GLKKVGG  R FNRF+ GDGA EE+       ED +FV WFREAWPYLWAHRGG
Subjt:  MAAALSSSQLHFQSPIEFPCKTFPYSHSFRNG--GFKFKRVEIWDNGLKKVGGNLRRFNRFSCGDGAAEEDG------EDNKFVEWFREAWPYLWAHRGG

Query:  TFVVIISGDIVSSSYLDPILKDIAFLHHLGIRFILVPGTHVLIDKLLAERGSKPNFVGQYRVTDKESLAAAMEAAGGIRVMIEAKLSPGPSICNIRRHGD
        TFVVIISG+IVSSSYL+ ILKDIAFLHHLGIRFILVPGT VLIDK+LAERGSKPNFVGQYRVTD E+LAAAMEAAGGIRVMIEAKLSPGPSICNIRRHGD
Subjt:  TFVVIISGDIVSSSYLDPILKDIAFLHHLGIRFILVPGTHVLIDKLLAERGSKPNFVGQYRVTDKESLAAAMEAAGGIRVMIEAKLSPGPSICNIRRHGD

Query:  SRRWHEVGVSVVSGNFLAAKRRGVVDGVDYGATGEVKKVDVTRIRERLDGGCMVILSNLGYSSSGEVLNCNTYEVVTACALAIGADKLICIIDGPILDEA
        S+RWHEVGVSVVSGNFLAAKRRGVVDGVDYGATGEVKKVDV R+RERLDGGC+VILSNLGYSSSGEVLNCNTYEV TACALAIGADKLICIIDGPILDEA
Subjt:  SRRWHEVGVSVVSGNFLAAKRRGVVDGVDYGATGEVKKVDVTRIRERLDGGCMVILSNLGYSSSGEVLNCNTYEVVTACALAIGADKLICIIDGPILDEA

Query:  GRHISFLTLQEADMLIRERAKQCEIAANYVKVVGKD------------------PLEDQTPT----------------------------FQNGVGFGYG
        GRHISFLTLQEADMLIRERAKQCEIAANYVKVVGK+                    ED +PT                            FQNGVGFG G
Subjt:  GRHISFLTLQEADMLIRERAKQCEIAANYVKVVGKD------------------PLEDQTPT----------------------------FQNGVGFGYG

Query:  NGRWSSEQGFAIGGERNSHLNGYLSELAAAAFVCRGGVQRVHLLDGTIGGVLLLELFKRDGMGTMVASDLYEGTRIARVTDLQGIRQIIQPLEMAGTLVR
        NG WSSEQGFAIGGERNSHLNGYLSELAAAAFVCRGGVQRVHLLDGTIGGVLLLELFKRDGMGTM+ASDLYEGTR+ARV+D++GIRQIIQPLEMAGTLVR
Subjt:  NGRWSSEQGFAIGGERNSHLNGYLSELAAAAFVCRGGVQRVHLLDGTIGGVLLLELFKRDGMGTMVASDLYEGTRIARVTDLQGIRQIIQPLEMAGTLVR

Query:  RTDEELLKSLDSFVVVEREGQIIACAALFPFFEERCGEIASIAVSAECRGQGQGDKLLEYMEKKAASLGLDRLFLLTTRTADWFVRRGFSECSFESIPEN
        RTDEELL+S+DS VVVEREGQIIACAALFPFFEERCGEIA+IAVSAECRGQGQGDKLL+YMEKKAASLGLDRLFLLTTRTADWFVRRGFSECSFESIPEN
Subjt:  RTDEELLKSLDSFVVVEREGQIIACAALFPFFEERCGEIASIAVSAECRGQGQGDKLLEYMEKKAASLGLDRLFLLTTRTADWFVRRGFSECSFESIPEN

Query:  RRRKINLSRKSKYYMKKLLPDRNRIG
        RR+ INLSRKSKYYMKKLLPDRNR G
Subjt:  RRRKINLSRKSKYYMKKLLPDRNRIG

XP_023517796.1 probable amino-acid acetyltransferase NAGS2, chloroplastic isoform X2 [Cucurbita pepo subsp. pepo]9.1e-29082.91Show/hide
Query:  MAAALSSSQLHFQSPIEFPCKTFPYSHSFRNG--GFKFKRVEIWDNGLKKVGGNLRRFNRFSCGDGAAEEDG------EDNKFVEWFREAWPYLWAHRGG
        MAAALSSS+LHFQ+  EFP KTFP S  FRNG  GFKF RVE+WD GLKKVGG  R FNRF+ GDGA EE+       ED +FV WFREAWPYLWAHRGG
Subjt:  MAAALSSSQLHFQSPIEFPCKTFPYSHSFRNG--GFKFKRVEIWDNGLKKVGGNLRRFNRFSCGDGAAEEDG------EDNKFVEWFREAWPYLWAHRGG

Query:  TFVVIISGDIVSSSYLDPILKDIAFLHHLGIRFILVPGTHVLIDKLLAERGSKPNFVGQYRVTDKESLAAAMEAAGGIRVMIEAKLSPGPSICNIRRHGD
        TFVVIISG+IVSSSYL+ ILKDIAFLHHLGIRFILVPGT VLIDK+LAERGSKPNFVGQYRVTD E+LAAAMEAAGGIRVMIEAKLSPGPSICNIRRHGD
Subjt:  TFVVIISGDIVSSSYLDPILKDIAFLHHLGIRFILVPGTHVLIDKLLAERGSKPNFVGQYRVTDKESLAAAMEAAGGIRVMIEAKLSPGPSICNIRRHGD

Query:  SRRWHEVGVSVVSGNFLAAKRRGVVDGVDYGATGEVKKVDVTRIRERLDGGCMVILSNLGYSSSGEVLNCNTYEVVTACALAIGADKLICIIDGPILDEA
        SRRWHEVGVSVVSGNFLAAKRRGV+DGVDYGATGEVKKVDV R+RERLDGGC+VILSNLGYSSSGEVLNCNTYEV TACALAIGADKLICIIDGPILDEA
Subjt:  SRRWHEVGVSVVSGNFLAAKRRGVVDGVDYGATGEVKKVDVTRIRERLDGGCMVILSNLGYSSSGEVLNCNTYEVVTACALAIGADKLICIIDGPILDEA

Query:  GRHISFLTLQEADMLIRERAKQCEIAANYVKVVGKD------------------PLEDQTPT----------------------------FQNGVGFGYG
        GRHISFLTLQEADMLIRERAKQCEIAANYVKVVGK+                    ED +PT                            FQNGVGFG G
Subjt:  GRHISFLTLQEADMLIRERAKQCEIAANYVKVVGKD------------------PLEDQTPT----------------------------FQNGVGFGYG

Query:  NGRWSSEQGFAIGGERNSHLNGYLSELAAAAFVCRGGVQRVHLLDGTIGGVLLLELFKRDGMGTMVASDLYEGTRIARVTDLQGIRQIIQPLEMAGTLVR
        NG WSSEQGFAIGGERNSHLNGYLSELAAAAFVCRGGVQRVHLLDGTIGGVLLLELFKRDGMGTM+ASDLYEGTR+ARV+D++GIRQIIQPLEMAGTLVR
Subjt:  NGRWSSEQGFAIGGERNSHLNGYLSELAAAAFVCRGGVQRVHLLDGTIGGVLLLELFKRDGMGTMVASDLYEGTRIARVTDLQGIRQIIQPLEMAGTLVR

Query:  RTDEELLKSLDSFVVVEREGQIIACAALFPFFEERCGEIASIAVSAECRGQGQGDKLLEYMEKKAASLGLDRLFLLTTRTADWFVRRGFSECSFESIPEN
        RTDEELL+SLDS VVVEREGQIIACAALFPFFEERCGEIA+IAVSAECRGQGQGDKLL+YMEKKAASLGLDRLFLLTTRTADWFVRRGFSECSFESIPEN
Subjt:  RTDEELLKSLDSFVVVEREGQIIACAALFPFFEERCGEIASIAVSAECRGQGQGDKLLEYMEKKAASLGLDRLFLLTTRTADWFVRRGFSECSFESIPEN

Query:  RRRKINLSRKSKYYMKKLLPDRNRIG
        RR++INLSRKSKYYMKKLLPDRNR G
Subjt:  RRRKINLSRKSKYYMKKLLPDRNRIG

XP_038881460.1 probable amino-acid acetyltransferase NAGS2, chloroplastic isoform X1 [Benincasa hispida]5.0e-28884.01Show/hide
Query:  MAAALSSSQLHFQSPIEFPCKTFPYSHSFRNGG--FKFKRVEIWDNGLKKVGGNLRRFNRFSCGDGAAEEDG------EDNKFVEWFREAWPYLWAHRGG
        MAA+LSSSQL+ Q+P  FP K  PY  SFRNG   FKF +VE+WD GLKKVG N RRFN  + G+G  EE+       ED +FV WFREAWPYLWAHRGG
Subjt:  MAAALSSSQLHFQSPIEFPCKTFPYSHSFRNGG--FKFKRVEIWDNGLKKVGGNLRRFNRFSCGDGAAEEDG------EDNKFVEWFREAWPYLWAHRGG

Query:  TFVVIISGDIVSSSYLDPILKDIAFLHHLGIRFILVPGTHVLIDKLLAERGSKPNFVGQYRVTDKESLAAAMEAAGGIRVMIEAKLSPGPSICNIRRHGD
        TFVV+ISG+IVSSSYLD ILKDIAFLHHLGIRFILVPGTHVLIDKLL ERGSKPNFVGQYRVTD +SLAAAMEAAGGIRVMIEAKLSPGPSICNIRRHGD
Subjt:  TFVVIISGDIVSSSYLDPILKDIAFLHHLGIRFILVPGTHVLIDKLLAERGSKPNFVGQYRVTDKESLAAAMEAAGGIRVMIEAKLSPGPSICNIRRHGD

Query:  SRRWHEVGVSVVSGNFLAAKRRGVVDGVDYGATGEVKKVDVTRIRERLDGGCMVILSNLGYSSSGEVLNCNTYEVVTACALAIGADKLICIIDGPILDEA
        SRRWHEVGVSVVSGNFLAAKRRGVVDGVDYGATGEVKKVDV R+RERLDGGC+VILSNLGYSSSGEVLNCNTYEV TACALAIGADKLICIIDGPILDEA
Subjt:  SRRWHEVGVSVVSGNFLAAKRRGVVDGVDYGATGEVKKVDVTRIRERLDGGCMVILSNLGYSSSGEVLNCNTYEVVTACALAIGADKLICIIDGPILDEA

Query:  GRHISFLTLQEADMLIRERAKQCEIAANYVKVVGKD-------------------PLEDQT-------------PTFQNGVGFGYGNGRWSSEQGFAIGG
        GRHISFLTLQEADMLIRERAKQCEIAANYVKVVGK+                    LED T             P FQNGVGFG G G WSSEQGFAIGG
Subjt:  GRHISFLTLQEADMLIRERAKQCEIAANYVKVVGKD-------------------PLEDQT-------------PTFQNGVGFGYGNGRWSSEQGFAIGG

Query:  ERNSHLNGYLSELAAAAFVCRGGVQRVHLLDGTIGGVLLLELFKRDGMGTMVASDLYEGTRIARVTDLQGIRQIIQPLEMAGTLVRRTDEELLKSLDSFV
        ERNSH+NGYLSELAAAAFVCRGGVQRVHLLDGTIGGVLLLELFKRDGMGTMVASDLYEGTR+ARV+DL+GIRQIIQPLEMAGTLVRRTDEELL+SLDSFV
Subjt:  ERNSHLNGYLSELAAAAFVCRGGVQRVHLLDGTIGGVLLLELFKRDGMGTMVASDLYEGTRIARVTDLQGIRQIIQPLEMAGTLVRRTDEELLKSLDSFV

Query:  VVEREGQIIACAALFPFFEERCGEIASIAVSAECRGQGQGDKLLEYMEKKAASLGLDRLFLLTTRTADWFVRRGFSECSFESIPENRRRKINLSRKSKYY
        VVEREGQIIACAALFPFFEE+CGEIA+IAVSAECRGQGQGDKLL+YMEKKAASLGLDRLFLLTTRTADWFVRRGFSECSFESIPE RR KINLSRKSKYY
Subjt:  VVEREGQIIACAALFPFFEERCGEIASIAVSAECRGQGQGDKLLEYMEKKAASLGLDRLFLLTTRTADWFVRRGFSECSFESIPENRRRKINLSRKSKYY

Query:  MKKLLPDRNRIGA
        MKKLLPDRNRIG+
Subjt:  MKKLLPDRNRIGA

TrEMBL top hitse value%identityAlignment
A0A0A0KIJ1 Amino-acid N-acetyltransferase3.1e-28382.87Show/hide
Query:  MAAALSSSQLHFQSPIEFPCKTFPYSHSFRNGG--FKFKRVEIWDNGLKKVGGNLRRFNRFSCGDGAAEEDG------EDNKFVEWFREAWPYLWAHRGG
        M+A+LSSSQL+ Q+   FP K  PYS SF+NG   FKF ++E+ D GLKKVG + RRFN F+ G+G  EE+       ED +FV WFREAWPYLWAHRGG
Subjt:  MAAALSSSQLHFQSPIEFPCKTFPYSHSFRNGG--FKFKRVEIWDNGLKKVGGNLRRFNRFSCGDGAAEEDG------EDNKFVEWFREAWPYLWAHRGG

Query:  TFVVIISGDIVSSSYLDPILKDIAFLHHLGIRFILVPGTHVLIDKLLAERGSKPNFVGQYRVTDKESLAAAMEAAGGIRVMIEAKLSPGPSICNIRRHGD
        TFVV+ISG+IVSSSYLD ILKDIAFLHHLGIRFILVPGTHVLIDKLLAERGSKPNFVGQYRVTD +SLAAAMEAAGGIRVMIEAKLSPGPSICNIRRHGD
Subjt:  TFVVIISGDIVSSSYLDPILKDIAFLHHLGIRFILVPGTHVLIDKLLAERGSKPNFVGQYRVTDKESLAAAMEAAGGIRVMIEAKLSPGPSICNIRRHGD

Query:  SRRWHEVGVSVVSGNFLAAKRRGVVDGVDYGATGEVKKVDVTRIRERLDGGCMVILSNLGYSSSGEVLNCNTYEVVTACALAIGADKLICIIDGPILDEA
        SRRWHEVGVSVVSGNFLAAKRRGVVDGVDYGATGEVKKVDV R+RERLDGGC+VILSNLGYSSSGEVLNCNTYEV TACALAIGADKLICIIDGPILDEA
Subjt:  SRRWHEVGVSVVSGNFLAAKRRGVVDGVDYGATGEVKKVDVTRIRERLDGGCMVILSNLGYSSSGEVLNCNTYEVVTACALAIGADKLICIIDGPILDEA

Query:  GRHISFLTLQEADMLIRERAKQCEIAANYVKVVGKD-------------------PLEDQT-------------PTFQNGVGFGYGNGRWSSEQGFAIGG
        GRHISFLTLQEADMLIRERAKQCEIAANYVKVVGK+                    LE  T             P FQNGVGFG+G G WSSEQGF IGG
Subjt:  GRHISFLTLQEADMLIRERAKQCEIAANYVKVVGKD-------------------PLEDQT-------------PTFQNGVGFGYGNGRWSSEQGFAIGG

Query:  ERNSHLNGYLSELAAAAFVCRGGVQRVHLLDGTIGGVLLLELFKRDGMGTMVASDLYEGTRIARVTDLQGIRQIIQPLEMAGTLVRRTDEELLKSLDSFV
        ERNSH+NGYLSELAAAAFVCRGGVQRVHLLDGTIGGVLLLELFKRDGMGTMVASDLYEGTR+ARV+DL+GIRQII PLEMAGTLVRR+DEELL+SLDSFV
Subjt:  ERNSHLNGYLSELAAAAFVCRGGVQRVHLLDGTIGGVLLLELFKRDGMGTMVASDLYEGTRIARVTDLQGIRQIIQPLEMAGTLVRRTDEELLKSLDSFV

Query:  VVEREGQIIACAALFPFFEERCGEIASIAVSAECRGQGQGDKLLEYMEKKAASLGLDRLFLLTTRTADWFVRRGFSECSFESIPENRRRKINLSRKSKYY
        VVEREGQIIACAALFPFFEERCGEIA+IAVSAECRGQGQGDKLL+YMEKKAASLGLDRLFLLTTRTADWFVRRGFSECSFESIPE RR KINLSR SKYY
Subjt:  VVEREGQIIACAALFPFFEERCGEIASIAVSAECRGQGQGDKLLEYMEKKAASLGLDRLFLLTTRTADWFVRRGFSECSFESIPENRRRKINLSRKSKYY

Query:  MKKLLPDRNRIGA
        MKKLLPDRNRIG+
Subjt:  MKKLLPDRNRIGA

A0A1S3B0M9 Amino-acid N-acetyltransferase9.2e-28884.01Show/hide
Query:  MAAALSSSQLHFQSPIEFPCKTFPYSHSFRNG--GFKFKRVEIWDNGLKKVGGNLRRFNRFSCGDGAAEEDG------EDNKFVEWFREAWPYLWAHRGG
        M+A+LSSSQL+ Q+   FP K  PYS SF+NG   FKF +VE+W+ GLKKVG N RRFN F+ G+G  EE+       ED +FV WFREAWPYLWAHRGG
Subjt:  MAAALSSSQLHFQSPIEFPCKTFPYSHSFRNG--GFKFKRVEIWDNGLKKVGGNLRRFNRFSCGDGAAEEDG------EDNKFVEWFREAWPYLWAHRGG

Query:  TFVVIISGDIVSSSYLDPILKDIAFLHHLGIRFILVPGTHVLIDKLLAERGSKPNFVGQYRVTDKESLAAAMEAAGGIRVMIEAKLSPGPSICNIRRHGD
        TFVV+ISG+IVSSSYLD ILKDIAFLHHLGIRFILVPGTHVLIDKLLAERGSKPNFVGQYRVTD +SLAAAMEAAGGIRVMIEAKLSPGPSICNIRRHGD
Subjt:  TFVVIISGDIVSSSYLDPILKDIAFLHHLGIRFILVPGTHVLIDKLLAERGSKPNFVGQYRVTDKESLAAAMEAAGGIRVMIEAKLSPGPSICNIRRHGD

Query:  SRRWHEVGVSVVSGNFLAAKRRGVVDGVDYGATGEVKKVDVTRIRERLDGGCMVILSNLGYSSSGEVLNCNTYEVVTACALAIGADKLICIIDGPILDEA
        SRRWHEVGVSVVSGNFLAAKRRGVVDGVDYGATGEVKKVDV R+RERLDGGC+VILSNLGYSSSGEVLNCNTYEV TACALAIGADKLICIIDGPILDEA
Subjt:  SRRWHEVGVSVVSGNFLAAKRRGVVDGVDYGATGEVKKVDVTRIRERLDGGCMVILSNLGYSSSGEVLNCNTYEVVTACALAIGADKLICIIDGPILDEA

Query:  GRHISFLTLQEADMLIRERAKQCEIAANYVKVV-------------------GKDPLEDQT-------------PTFQNGVGFGYGNGRWSSEQGFAIGG
        GRHISFLTLQEADMLIRERAKQCEIAANYVKVV                   GK  LED T             P FQNGVGFG+G G WSSEQGF IGG
Subjt:  GRHISFLTLQEADMLIRERAKQCEIAANYVKVV-------------------GKDPLEDQT-------------PTFQNGVGFGYGNGRWSSEQGFAIGG

Query:  ERNSHLNGYLSELAAAAFVCRGGVQRVHLLDGTIGGVLLLELFKRDGMGTMVASDLYEGTRIARVTDLQGIRQIIQPLEMAGTLVRRTDEELLKSLDSFV
        ERNSH+NGYLSELAAAAFVCRGGVQRVHLLDGTIGGVLLLELFKRDGMGTMVASDLYEGTR+ARV+DL+GIRQIIQPLEMAGTLVRRTDEELLKSLDSFV
Subjt:  ERNSHLNGYLSELAAAAFVCRGGVQRVHLLDGTIGGVLLLELFKRDGMGTMVASDLYEGTRIARVTDLQGIRQIIQPLEMAGTLVRRTDEELLKSLDSFV

Query:  VVEREGQIIACAALFPFFEERCGEIASIAVSAECRGQGQGDKLLEYMEKKAASLGLDRLFLLTTRTADWFVRRGFSECSFESIPENRRRKINLSRKSKYY
        VVEREGQIIACAALFPFFEERCGEIA+IAVSAECRGQGQGDKLL+YMEKKAASLGLDRLFLLTTRTADWFVRRGFSECSFESIPE RR KINLSRKSKYY
Subjt:  VVEREGQIIACAALFPFFEERCGEIASIAVSAECRGQGQGDKLLEYMEKKAASLGLDRLFLLTTRTADWFVRRGFSECSFESIPENRRRKINLSRKSKYY

Query:  MKKLLPDRNRIGA
        MKKLLPDRNRIG+
Subjt:  MKKLLPDRNRIGA

A0A6J1GDN0 Amino-acid N-acetyltransferase8.7e-27882Show/hide
Query:  MAAALSSSQLHFQSPIEFPCKTFPYSHSFRNG--GFKFKRVEIWDNGLKKVGGNLRRFNRFSCGDGAA-----EEDG---EDNKFVEWFREAWPYLWAHR
        MA  LSSS+L  Q+  EFP    P +   RNG   FKF R E+W  GLKKVG   R+F  F+ G+GA      EED    ED +FV WFREAWPYLWAHR
Subjt:  MAAALSSSQLHFQSPIEFPCKTFPYSHSFRNG--GFKFKRVEIWDNGLKKVGGNLRRFNRFSCGDGAA-----EEDG---EDNKFVEWFREAWPYLWAHR

Query:  GGTFVVIISGDIVSSSYLDPILKDIAFLHHLGIRFILVPGTHVLIDKLLAERGSKPNFVGQYRVTDKESLAAAMEAAGGIRVMIEAKLSPGPSICNIRRH
        GGTFVV+ISG+IVSSS LD ILKDIAFLHHLGIRFILVPGTHVLIDKLLAERGSKPNFVGQYRVTD +SLAAAMEAAGGIRVMIEAKLSPGPSICNIRRH
Subjt:  GGTFVVIISGDIVSSSYLDPILKDIAFLHHLGIRFILVPGTHVLIDKLLAERGSKPNFVGQYRVTDKESLAAAMEAAGGIRVMIEAKLSPGPSICNIRRH

Query:  GDSRRWHEVGVSVVSGNFLAAKRRGVVDGVDYGATGEVKKVDVTRIRERLDGGCMVILSNLGYSSSGEVLNCNTYEVVTACALAIGADKLICIIDGPILD
        GDSRRWHEVGVSVVSGNFLAAKRRGVVDGVDYGATGEVKKVDV R+RERLDGGC+VILSNLGYSSSGEVLNCNTYEV TACALAIGADKLICIIDGPILD
Subjt:  GDSRRWHEVGVSVVSGNFLAAKRRGVVDGVDYGATGEVKKVDVTRIRERLDGGCMVILSNLGYSSSGEVLNCNTYEVVTACALAIGADKLICIIDGPILD

Query:  EAGRHISFLTLQEADMLIRERAKQCEIAANYVKVVGKDPL----------------------------EDQTPTFQNGVGFGYGNGRWSSEQGFAIGGER
        E GRHISFLTLQEAD LIRERAKQCEIAANYVKVVGK+                                  P FQNGVGFG+G+G WSSEQGF IGGER
Subjt:  EAGRHISFLTLQEADMLIRERAKQCEIAANYVKVVGKDPL----------------------------EDQTPTFQNGVGFGYGNGRWSSEQGFAIGGER

Query:  NSHLNGYLSELAAAAFVCRGGVQRVHLLDGTIGGVLLLELFKRDGMGTMVASDLYEGTRIARVTDLQGIRQIIQPLEMAGTLVRRTDEELLKSLDSFVVV
        NSHLNGYLSELAAAAFVCRGGVQRVHLLDGTIGGVLLLELFKRDGMGTMVASDLYEGTR+A V+DL+GIRQIIQPLEMAGTLV RTDEELL+SLDSFVVV
Subjt:  NSHLNGYLSELAAAAFVCRGGVQRVHLLDGTIGGVLLLELFKRDGMGTMVASDLYEGTRIARVTDLQGIRQIIQPLEMAGTLVRRTDEELLKSLDSFVVV

Query:  EREGQIIACAALFPFFEERCGEIASIAVSAECRGQGQGDKLLEYMEKKAASLGLDRLFLLTTRTADWFVRRGFSECSFESIPENRRRKINLSRKSKYYMK
        EREGQIIACAALFPFFEERCGEIA+IAVSAECRGQGQGDKLL+YMEKKAASLGLDRLFLLTTRTADWFVRRGFSECSFESIPE RRR+INLSRKSKYYMK
Subjt:  EREGQIIACAALFPFFEERCGEIASIAVSAECRGQGQGDKLLEYMEKKAASLGLDRLFLLTTRTADWFVRRGFSECSFESIPENRRRKINLSRKSKYYMK

Query:  KLLPDRNRIGA
        KLLPDRNRIG+
Subjt:  KLLPDRNRIGA

A0A6J1HGN9 Amino-acid N-acetyltransferase8.9e-29183.07Show/hide
Query:  MAAALSSSQLHFQSPIEFPCKTFPYSHSFRNG--GFKFKRVEIWDNGLKKVGGNLRRFNRFSCGDGAAEEDG------EDNKFVEWFREAWPYLWAHRGG
        MAAALSSS+LHFQ+  EFP KTFPYS  FRNG  GFKF  VE+WD GLKKVGG  R FNRF+ GDGA EE+       ED +FV WFREAWPYLWAHRGG
Subjt:  MAAALSSSQLHFQSPIEFPCKTFPYSHSFRNG--GFKFKRVEIWDNGLKKVGGNLRRFNRFSCGDGAAEEDG------EDNKFVEWFREAWPYLWAHRGG

Query:  TFVVIISGDIVSSSYLDPILKDIAFLHHLGIRFILVPGTHVLIDKLLAERGSKPNFVGQYRVTDKESLAAAMEAAGGIRVMIEAKLSPGPSICNIRRHGD
        TFVVIISG+IVSSSYL+ ILKDIAFLHHLGIRFILVPGT VLIDK+LAERGSKPNFVGQYRVTD E+LAAAMEAAGGIRVMIEAKLSPGPSICNIRRHGD
Subjt:  TFVVIISGDIVSSSYLDPILKDIAFLHHLGIRFILVPGTHVLIDKLLAERGSKPNFVGQYRVTDKESLAAAMEAAGGIRVMIEAKLSPGPSICNIRRHGD

Query:  SRRWHEVGVSVVSGNFLAAKRRGVVDGVDYGATGEVKKVDVTRIRERLDGGCMVILSNLGYSSSGEVLNCNTYEVVTACALAIGADKLICIIDGPILDEA
        SRRWHEVGVSVVSGNFLAAKRRGVVDGVDYGATGEVKKVDV R+RERLDGGC+VILSNLGYSSSGEVLNCNTYEV TACALAIGADKLICIIDGPILDEA
Subjt:  SRRWHEVGVSVVSGNFLAAKRRGVVDGVDYGATGEVKKVDVTRIRERLDGGCMVILSNLGYSSSGEVLNCNTYEVVTACALAIGADKLICIIDGPILDEA

Query:  GRHISFLTLQEADMLIRERAKQCEIAANYVKVVGKD------------------PLEDQTPT----------------------------FQNGVGFGYG
        GRHISFLTLQEADMLIRERAKQCEIAANYVKVVGK+                    ED +PT                            FQNGVGFG G
Subjt:  GRHISFLTLQEADMLIRERAKQCEIAANYVKVVGKD------------------PLEDQTPT----------------------------FQNGVGFGYG

Query:  NGRWSSEQGFAIGGERNSHLNGYLSELAAAAFVCRGGVQRVHLLDGTIGGVLLLELFKRDGMGTMVASDLYEGTRIARVTDLQGIRQIIQPLEMAGTLVR
        NG WSSEQGFAIGGERNSHLNGYLSELAAAAFVCRGGVQRVHLLDGTIGGVLLLELFKRDGMGTM+ASDLYEGTR+ARV+D++GIRQIIQPLEMAGTLVR
Subjt:  NGRWSSEQGFAIGGERNSHLNGYLSELAAAAFVCRGGVQRVHLLDGTIGGVLLLELFKRDGMGTMVASDLYEGTRIARVTDLQGIRQIIQPLEMAGTLVR

Query:  RTDEELLKSLDSFVVVEREGQIIACAALFPFFEERCGEIASIAVSAECRGQGQGDKLLEYMEKKAASLGLDRLFLLTTRTADWFVRRGFSECSFESIPEN
        RTDEELL+SLDS VVVEREGQIIACAALFPFFEERCGEIA+IAVSAECRGQGQGDKLL+YMEKKAASLGLDRLFLLTTRTADWFVRRGFSECSFESIPEN
Subjt:  RTDEELLKSLDSFVVVEREGQIIACAALFPFFEERCGEIASIAVSAECRGQGQGDKLLEYMEKKAASLGLDRLFLLTTRTADWFVRRGFSECSFESIPEN

Query:  RRRKINLSRKSKYYMKKLLPDRNRIG
        RR++INLSRKSKYYMKKLLPDRNR G
Subjt:  RRRKINLSRKSKYYMKKLLPDRNRIG

A0A6J1KSM6 Amino-acid N-acetyltransferase5.8e-29082.75Show/hide
Query:  MAAALSSSQLHFQSPIEFPCKTFPYSHSFRNG--GFKFKRVEIWDNGLKKVGGNLRRFNRFSCGDGAAEEDG------EDNKFVEWFREAWPYLWAHRGG
        MAAALSSS+LHFQ+  EFP KTFPYS  FRNG  GFKF  VE+WD GLKKVGG  R FNRF+ GDGA EE+       ED +FV WFREAWPYLWAHRGG
Subjt:  MAAALSSSQLHFQSPIEFPCKTFPYSHSFRNG--GFKFKRVEIWDNGLKKVGGNLRRFNRFSCGDGAAEEDG------EDNKFVEWFREAWPYLWAHRGG

Query:  TFVVIISGDIVSSSYLDPILKDIAFLHHLGIRFILVPGTHVLIDKLLAERGSKPNFVGQYRVTDKESLAAAMEAAGGIRVMIEAKLSPGPSICNIRRHGD
        TFVVIISG+IVSSSYL+ ILKDIAFLHHLGIRFILVPGT VLIDK+LAERGSKPNFVGQYRVTD E+LAAAMEAAGGIRVMIEAKLSPGPSICNIRRHGD
Subjt:  TFVVIISGDIVSSSYLDPILKDIAFLHHLGIRFILVPGTHVLIDKLLAERGSKPNFVGQYRVTDKESLAAAMEAAGGIRVMIEAKLSPGPSICNIRRHGD

Query:  SRRWHEVGVSVVSGNFLAAKRRGVVDGVDYGATGEVKKVDVTRIRERLDGGCMVILSNLGYSSSGEVLNCNTYEVVTACALAIGADKLICIIDGPILDEA
        S+RWHEVGVSVVSGNFLAAKRRGVVDGVDYGATGEVKKVDV R+RERLDGGC+VILSNLGYSSSGEVLNCNTYEV TACALAIGADKLICIIDGPILDEA
Subjt:  SRRWHEVGVSVVSGNFLAAKRRGVVDGVDYGATGEVKKVDVTRIRERLDGGCMVILSNLGYSSSGEVLNCNTYEVVTACALAIGADKLICIIDGPILDEA

Query:  GRHISFLTLQEADMLIRERAKQCEIAANYVKVVGKD------------------PLEDQTPT----------------------------FQNGVGFGYG
        GRHISFLTLQEADMLIRERAKQCEIAANYVKVVGK+                    ED +PT                            FQNGVGFG G
Subjt:  GRHISFLTLQEADMLIRERAKQCEIAANYVKVVGKD------------------PLEDQTPT----------------------------FQNGVGFGYG

Query:  NGRWSSEQGFAIGGERNSHLNGYLSELAAAAFVCRGGVQRVHLLDGTIGGVLLLELFKRDGMGTMVASDLYEGTRIARVTDLQGIRQIIQPLEMAGTLVR
        NG WSSEQGFAIGGERNSHLNGYLSELAAAAFVCRGGVQRVHLLDGTIGGVLLLELFKRDGMGTM+ASDLYEGTR+ARV+D++GIRQIIQPLEMAGTLVR
Subjt:  NGRWSSEQGFAIGGERNSHLNGYLSELAAAAFVCRGGVQRVHLLDGTIGGVLLLELFKRDGMGTMVASDLYEGTRIARVTDLQGIRQIIQPLEMAGTLVR

Query:  RTDEELLKSLDSFVVVEREGQIIACAALFPFFEERCGEIASIAVSAECRGQGQGDKLLEYMEKKAASLGLDRLFLLTTRTADWFVRRGFSECSFESIPEN
        RTDEELL+S+DS VVVEREGQIIACAALFPFFEERCGEIA+IAVSAECRGQGQGDKLL+YMEKKAASLGLDRLFLLTTRTADWFVRRGFSECSFESIPEN
Subjt:  RTDEELLKSLDSFVVVEREGQIIACAALFPFFEERCGEIASIAVSAECRGQGQGDKLLEYMEKKAASLGLDRLFLLTTRTADWFVRRGFSECSFESIPEN

Query:  RRRKINLSRKSKYYMKKLLPDRNRIG
        RR+ INLSRKSKYYMKKLLPDRNR G
Subjt:  RRRKINLSRKSKYYMKKLLPDRNRIG

SwissProt top hitse value%identityAlignment
B5X4Z4 Probable amino-acid acetyltransferase NAGS2, chloroplastic4.5e-22372.96Show/hide
Query:  SCGDGAAEEDGEDNKFVEWFREAWPYLWAHRGGTFVVIISGDIVSSSYLDPILKDIAFLHHLGIRFILVPGTHVLIDKLLAERGSKPNFVGQYRVTDKES
        + GD  AE   +D +FV WFREAWPYLWAHRG TFVVIISG+I++ S  D ILKDIAFLHHLGIRF+LVPGT   ID+LLAERG +  +VG+YRVTD  S
Subjt:  SCGDGAAEEDGEDNKFVEWFREAWPYLWAHRGGTFVVIISGDIVSSSYLDPILKDIAFLHHLGIRFILVPGTHVLIDKLLAERGSKPNFVGQYRVTDKES

Query:  LAAAMEAAGGIRVMIEAKLSPGPSICNIRRHGDSRRWHEVGVSVVSGNFLAAKRRGVVDGVDYGATGEVKKVDVTRIRERLDGGCMVILSNLGYSSSGEV
        L AA EAAG I VM+EAKLSPGPSICNIRRHGD  R H++GV V +GNF AAKRRGVVDGVD+GATGEVKK+DV RI ERLDGG +V+L NLG+SSSGEV
Subjt:  LAAAMEAAGGIRVMIEAKLSPGPSICNIRRHGDSRRWHEVGVSVVSGNFLAAKRRGVVDGVDYGATGEVKKVDVTRIRERLDGGCMVILSNLGYSSSGEV

Query:  LNCNTYEVVTACALAIGADKLICIIDGPILDEAGRHISFLTLQEADMLIRERAKQCEIAANYVKVVGKDPLE---------------------------D
        LNCNTYEV TACALAIGADKLICI+DGPILDE+G  I FLTLQEADML+R+RA+Q +IAANYVK VG   +                            +
Subjt:  LNCNTYEVVTACALAIGADKLICIIDGPILDEAGRHISFLTLQEADMLIRERAKQCEIAANYVKVVGKDPLE---------------------------D

Query:  QTPTFQNGVGFGYGNGRWSSEQGFAIGG-ERNSHLNGYLSELAAAAFVCRGGVQRVHLLDGTIGGVLLLELFKRDGMGTMVASDLYEGTRIARVTDLQGI
         TP FQNGVGF  GNG W  EQGFAIGG ER S LNGYLSELAAAAFVCRGGV+RVHLLDGTI GVLLLELFKRDGMGTMVASD+YEGTR ARV DL GI
Subjt:  QTPTFQNGVGFGYGNGRWSSEQGFAIGG-ERNSHLNGYLSELAAAAFVCRGGVQRVHLLDGTIGGVLLLELFKRDGMGTMVASDLYEGTRIARVTDLQGI

Query:  RQIIQPLEMAGTLVRRTDEELLKSLDSFVVVEREGQIIACAALFPFFEERCGEIASIAVSAECRGQGQGDKLLEYMEKKAASLGLDRLFLLTTRTADWFV
        R II+PLE +G LVRRTDEELL++LDSFVVVEREGQIIACAALFPFF+++CGE+A+IAV+++CRGQGQGDKLL+Y+EKKA+SLGL++LFLLTTRTADWFV
Subjt:  RQIIQPLEMAGTLVRRTDEELLKSLDSFVVVEREGQIIACAALFPFFEERCGEIASIAVSAECRGQGQGDKLLEYMEKKAASLGLDRLFLLTTRTADWFV

Query:  RRGFSECSFESIPENRRRKINLSRKSKYYMKKLLPDRNRI
        RRGF ECS E IPE+RR++INLSR SKYYMKKL+PDR+ I
Subjt:  RRGFSECSFESIPENRRRKINLSRKSKYYMKKLLPDRNRI

P61919 Amino-acid acetyltransferase7.2e-8038.62Show/hide
Query:  KFVEWFREAWPYLWAHRGGTFVVIISGDIVSSSYLDPILKDIAFLHHLGIRFILVPGTHVLIDKLLAERGSKPNFVGQYRVTDKESLAAAMEAAGGIRVM
        ++V W R A PY+ AHR  TFVV++ GD V+      I+ D+  LH LG+R +LV G+   I+  LA+RG  P +    R+TD E+L   ++A G +R+ 
Subjt:  KFVEWFREAWPYLWAHRGGTFVVIISGDIVSSSYLDPILKDIAFLHHLGIRFILVPGTHVLIDKLLAERGSKPNFVGQYRVTDKESLAAAMEAAGGIRVM

Query:  IEAKLSPGPSICNIRRHGDSRRWHEVGVSVVSGNFLAAKRRGVVDGVDYGATGEVKKVDVTRIRERLDGGCMVILSNLGYSSSGEVLNCNTYEVVTACAL
        IEA+LS   +   ++    SR      + V SGN + A+  GV++GVDY  TGEV++VD   I   LD   +V+LS LGYS +GE+ N    +V T  A+
Subjt:  IEAKLSPGPSICNIRRHGDSRRWHEVGVSVVSGNFLAAKRRGVVDGVDYGATGEVKKVDVTRIRERLDGGCMVILSNLGYSSSGEVLNCNTYEVVTACAL

Query:  AIGADKLICI-IDGPILDEAGRHISFLTLQEADMLIRERAKQCEIAANYVKVVGKDPLEDQTPTFQNGVGFGYGNGRWSSEQGFAIGGERNSHLNGYLSE
         + ADKL+    +  +LDE GR            L+RE   Q                  Q P     +G  Y                           
Subjt:  AIGADKLICI-IDGPILDEAGRHISFLTLQEADMLIRERAKQCEIAANYVKVVGKDPLEDQTPTFQNGVGFGYGNGRWSSEQGFAIGGERNSHLNGYLSE

Query:  LAAAAFVCRGGVQRVHLLDGTIGGVLLLELFKRDGMGTMVASDLYEGTRIARVTDLQGIRQIIQPLEMAGTLVRRTDEELLKSLDSFVVVEREGQIIACA
        L AAA  CRGGV R H++     G LL ELF RDG GT+VA + +E  R A + D+ G+  +I PLE  G LVRR+ E L + +  F VVEREG IIACA
Subjt:  LAAAAFVCRGGVQRVHLLDGTIGGVLLLELFKRDGMGTMVASDLYEGTRIARVTDLQGIRQIIQPLEMAGTLVRRTDEELLKSLDSFVVVEREGQIIACA

Query:  ALFPFFEERCGEIASIAVSAECRGQGQGDKLLEYMEKKAASLGLDRLFLLTTRTADWFVRRGFSECSFESIPENRRRKINLSRKSKYYMKKL
        AL+P  +   GE+A +AV+ E R  G+GD+LLE +E +A +LG+  LF+LTTRTA WF  RGF   S + +P  R    N  R SK + K +
Subjt:  ALFPFFEERCGEIASIAVSAECRGQGQGDKLLEYMEKKAASLGLDRLFLLTTRTADWFVRRGFSECSFESIPENRRRKINLSRKSKYYMKKL

Q4ZZU5 Amino-acid acetyltransferase7.2e-8038.62Show/hide
Query:  KFVEWFREAWPYLWAHRGGTFVVIISGDIVSSSYLDPILKDIAFLHHLGIRFILVPGTHVLIDKLLAERGSKPNFVGQYRVTDKESLAAAMEAAGGIRVM
        ++V W R A PY+ AHR  TFVV++ GD V+      I+ D+  LH LG+R +LV G+   I+  LA+RG  P +    R+TD E+L   ++A G +R+ 
Subjt:  KFVEWFREAWPYLWAHRGGTFVVIISGDIVSSSYLDPILKDIAFLHHLGIRFILVPGTHVLIDKLLAERGSKPNFVGQYRVTDKESLAAAMEAAGGIRVM

Query:  IEAKLSPGPSICNIRRHGDSRRWHEVGVSVVSGNFLAAKRRGVVDGVDYGATGEVKKVDVTRIRERLDGGCMVILSNLGYSSSGEVLNCNTYEVVTACAL
        IEA+LS   +   ++    SR      + V SGN + A+  GV++GVDY  TGEV++VD   I   LD   +V+LS LGYS +GE+ N    +V T  A+
Subjt:  IEAKLSPGPSICNIRRHGDSRRWHEVGVSVVSGNFLAAKRRGVVDGVDYGATGEVKKVDVTRIRERLDGGCMVILSNLGYSSSGEVLNCNTYEVVTACAL

Query:  AIGADKLICI-IDGPILDEAGRHISFLTLQEADMLIRERAKQCEIAANYVKVVGKDPLEDQTPTFQNGVGFGYGNGRWSSEQGFAIGGERNSHLNGYLSE
         + ADKL+    +  +LDE GR            L+RE   Q                  Q P     +G  Y                           
Subjt:  AIGADKLICI-IDGPILDEAGRHISFLTLQEADMLIRERAKQCEIAANYVKVVGKDPLEDQTPTFQNGVGFGYGNGRWSSEQGFAIGGERNSHLNGYLSE

Query:  LAAAAFVCRGGVQRVHLLDGTIGGVLLLELFKRDGMGTMVASDLYEGTRIARVTDLQGIRQIIQPLEMAGTLVRRTDEELLKSLDSFVVVEREGQIIACA
        L AAA  CRGGV R H++     G LL ELF RDG GT+VA + +E  R A + D+ G+  +I PLE  G LVRR+ E L + +  F VVEREG IIACA
Subjt:  LAAAAFVCRGGVQRVHLLDGTIGGVLLLELFKRDGMGTMVASDLYEGTRIARVTDLQGIRQIIQPLEMAGTLVRRTDEELLKSLDSFVVVEREGQIIACA

Query:  ALFPFFEERCGEIASIAVSAECRGQGQGDKLLEYMEKKAASLGLDRLFLLTTRTADWFVRRGFSECSFESIPENRRRKINLSRKSKYYMKKL
        AL+P  +   GE+A +AV+ E R  G+GD+LLE +E +A +LG+  LF+LTTRTA WF  RGF   S + +P  R    N  R SK + K +
Subjt:  ALFPFFEERCGEIASIAVSAECRGQGQGDKLLEYMEKKAASLGLDRLFLLTTRTADWFVRRGFSECSFESIPENRRRKINLSRKSKYYMKKL

Q84JF4 Probable amino-acid acetyltransferase NAGS1, chloroplastic1.4e-21973.95Show/hide
Query:  DNKFVEWFREAWPYLWAHRGGTFVVIISGDIVSSSYLDPILKDIAFLHHLGIRFILVPGTHVLIDKLLAERGSKPNFVGQYRVTDKESLAAAMEAAGGIR
        D +FV WFREAWPYLWAHR  TFVV ISGD++   Y D +LKDIAFLHHLGI+F+LVPGT V ID+LLAERG +P +VG+YRVTD  SL AA EAAG I 
Subjt:  DNKFVEWFREAWPYLWAHRGGTFVVIISGDIVSSSYLDPILKDIAFLHHLGIRFILVPGTHVLIDKLLAERGSKPNFVGQYRVTDKESLAAAMEAAGGIR

Query:  VMIEAKLSPGPSICNIRRHGDSRRWHEVGVSVVSGNFLAAKRRGVVDGVDYGATGEVKKVDVTRIRERLDGGCMVILSNLGYSSSGEVLNCNTYEVVTAC
        VMIEAKLSPGPSI NIRRHGDS R HE GV V +GNF AAKRRGVVDGVD+GATG VKK+DV RIRERLD G +V+L NLG+SS+GEVLNCNTYEV TAC
Subjt:  VMIEAKLSPGPSICNIRRHGDSRRWHEVGVSVVSGNFLAAKRRGVVDGVDYGATGEVKKVDVTRIRERLDGGCMVILSNLGYSSSGEVLNCNTYEVVTAC

Query:  ALAIGADKLICIIDGPILDEAGRHISFLTLQEADMLIRERAKQCEIAANYVKVVG-------KDPL------------------EDQTPTFQNGVGFGYG
        ALAIGADKLICI+DGP+LDE G  + FLTLQEAD L+R+RA+Q EIAANYVK VG        +PL                  E  +PTFQNGVGF  G
Subjt:  ALAIGADKLICIIDGPILDEAGRHISFLTLQEADMLIRERAKQCEIAANYVKVVG-------KDPL------------------EDQTPTFQNGVGFGYG

Query:  NGRWSSEQGFAIGG-ERNSHLNGYLSELAAAAFVCRGGVQRVHLLDGTIGGVLLLELFKRDGMGTMVASDLYEGTRIARVTDLQGIRQIIQPLEMAGTLV
        NG WS EQGFAIGG ER S LNGYLSELAAAAFVCRGGV+RVHLLDGTI GVLLLELFKRDGMGTMVASD+YEG R A+V DL GIRQII+PLE +G LV
Subjt:  NGRWSSEQGFAIGG-ERNSHLNGYLSELAAAAFVCRGGVQRVHLLDGTIGGVLLLELFKRDGMGTMVASDLYEGTRIARVTDLQGIRQIIQPLEMAGTLV

Query:  RRTDEELLKSLDSFVVVEREGQIIACAALFPFFEERCGEIASIAVSAECRGQGQGDKLLEYMEKKAASLGLDRLFLLTTRTADWFVRRGFSECSFESIPE
        RRTDEELL++LDSFVVVEREG IIACAALFPFFEE+CGE+A+IAV+++CRGQGQGDKLL+Y+EKKA++LGL+ LFLLTTRTADWFVRRGF EC  E IPE
Subjt:  RRTDEELLKSLDSFVVVEREGQIIACAALFPFFEERCGEIASIAVSAECRGQGQGDKLLEYMEKKAASLGLDRLFLLTTRTADWFVRRGFSECSFESIPE

Query:  NRRRKINLSRKSKYYMKKLLPDRNRI
         RR +INLSR+SKYYMKKLLPDR+ I
Subjt:  NRRRKINLSRKSKYYMKKLLPDRNRI

Q88AR2 Amino-acid acetyltransferase9.4e-8038.62Show/hide
Query:  KFVEWFREAWPYLWAHRGGTFVVIISGDIVSSSYLDPILKDIAFLHHLGIRFILVPGTHVLIDKLLAERGSKPNFVGQYRVTDKESLAAAMEAAGGIRVM
        ++V W R A PY+ AHR  TFVV++ GD V+      I+ D+  LH LG+R +LV G+   I+  LA+RG  P +    R+TD E+L   ++A G +R+ 
Subjt:  KFVEWFREAWPYLWAHRGGTFVVIISGDIVSSSYLDPILKDIAFLHHLGIRFILVPGTHVLIDKLLAERGSKPNFVGQYRVTDKESLAAAMEAAGGIRVM

Query:  IEAKLSPGPSICNIRRHGDSRRWHEVGVSVVSGNFLAAKRRGVVDGVDYGATGEVKKVDVTRIRERLDGGCMVILSNLGYSSSGEVLNCNTYEVVTACAL
        IEA+LS   +   ++    SR      + V SGN + A+  GV++G+DY  TGEV++VD   I   LD   +V+LS LGYS +GE+ N    +V T  A+
Subjt:  IEAKLSPGPSICNIRRHGDSRRWHEVGVSVVSGNFLAAKRRGVVDGVDYGATGEVKKVDVTRIRERLDGGCMVILSNLGYSSSGEVLNCNTYEVVTACAL

Query:  AIGADKLICI-IDGPILDEAGRHISFLTLQEADMLIRERAKQCEIAANYVKVVGKDPLEDQTPTFQNGVGFGYGNGRWSSEQGFAIGGERNSHLNGYLSE
         + ADKL+    +  +LDE GR            L+RE   Q                  Q P     +G     G + +E                   
Subjt:  AIGADKLICI-IDGPILDEAGRHISFLTLQEADMLIRERAKQCEIAANYVKVVGKDPLEDQTPTFQNGVGFGYGNGRWSSEQGFAIGGERNSHLNGYLSE

Query:  LAAAAFVCRGGVQRVHLLDGTIGGVLLLELFKRDGMGTMVASDLYEGTRIARVTDLQGIRQIIQPLEMAGTLVRRTDEELLKSLDSFVVVEREGQIIACA
        L AAA  CRGGV R H++     G LL ELF RDG GT+VA + +E  R A + D+ G+  +I PLE  G LVRR+ E L + +  F VVEREG IIACA
Subjt:  LAAAAFVCRGGVQRVHLLDGTIGGVLLLELFKRDGMGTMVASDLYEGTRIARVTDLQGIRQIIQPLEMAGTLVRRTDEELLKSLDSFVVVEREGQIIACA

Query:  ALFPFFEERCGEIASIAVSAECRGQGQGDKLLEYMEKKAASLGLDRLFLLTTRTADWFVRRGFSECSFESIPENRRRKINLSRKSKYYMKKL
        AL+P  +   GE+A +AV+ E R  G+GD+LLE +E +A +LG+  LF+LTTRTA WF  RGF   S + +P  R    N  R SK + K +
Subjt:  ALFPFFEERCGEIASIAVSAECRGQGQGDKLLEYMEKKAASLGLDRLFLLTTRTADWFVRRGFSECSFESIPENRRRKINLSRKSKYYMKKL

Arabidopsis top hitse value%identityAlignment
AT2G22910.1 N-acetyl-l-glutamate synthase 19.6e-22173.95Show/hide
Query:  DNKFVEWFREAWPYLWAHRGGTFVVIISGDIVSSSYLDPILKDIAFLHHLGIRFILVPGTHVLIDKLLAERGSKPNFVGQYRVTDKESLAAAMEAAGGIR
        D +FV WFREAWPYLWAHR  TFVV ISGD++   Y D +LKDIAFLHHLGI+F+LVPGT V ID+LLAERG +P +VG+YRVTD  SL AA EAAG I 
Subjt:  DNKFVEWFREAWPYLWAHRGGTFVVIISGDIVSSSYLDPILKDIAFLHHLGIRFILVPGTHVLIDKLLAERGSKPNFVGQYRVTDKESLAAAMEAAGGIR

Query:  VMIEAKLSPGPSICNIRRHGDSRRWHEVGVSVVSGNFLAAKRRGVVDGVDYGATGEVKKVDVTRIRERLDGGCMVILSNLGYSSSGEVLNCNTYEVVTAC
        VMIEAKLSPGPSI NIRRHGDS R HE GV V +GNF AAKRRGVVDGVD+GATG VKK+DV RIRERLD G +V+L NLG+SS+GEVLNCNTYEV TAC
Subjt:  VMIEAKLSPGPSICNIRRHGDSRRWHEVGVSVVSGNFLAAKRRGVVDGVDYGATGEVKKVDVTRIRERLDGGCMVILSNLGYSSSGEVLNCNTYEVVTAC

Query:  ALAIGADKLICIIDGPILDEAGRHISFLTLQEADMLIRERAKQCEIAANYVKVVG-------KDPL------------------EDQTPTFQNGVGFGYG
        ALAIGADKLICI+DGP+LDE G  + FLTLQEAD L+R+RA+Q EIAANYVK VG        +PL                  E  +PTFQNGVGF  G
Subjt:  ALAIGADKLICIIDGPILDEAGRHISFLTLQEADMLIRERAKQCEIAANYVKVVG-------KDPL------------------EDQTPTFQNGVGFGYG

Query:  NGRWSSEQGFAIGG-ERNSHLNGYLSELAAAAFVCRGGVQRVHLLDGTIGGVLLLELFKRDGMGTMVASDLYEGTRIARVTDLQGIRQIIQPLEMAGTLV
        NG WS EQGFAIGG ER S LNGYLSELAAAAFVCRGGV+RVHLLDGTI GVLLLELFKRDGMGTMVASD+YEG R A+V DL GIRQII+PLE +G LV
Subjt:  NGRWSSEQGFAIGG-ERNSHLNGYLSELAAAAFVCRGGVQRVHLLDGTIGGVLLLELFKRDGMGTMVASDLYEGTRIARVTDLQGIRQIIQPLEMAGTLV

Query:  RRTDEELLKSLDSFVVVEREGQIIACAALFPFFEERCGEIASIAVSAECRGQGQGDKLLEYMEKKAASLGLDRLFLLTTRTADWFVRRGFSECSFESIPE
        RRTDEELL++LDSFVVVEREG IIACAALFPFFEE+CGE+A+IAV+++CRGQGQGDKLL+Y+EKKA++LGL+ LFLLTTRTADWFVRRGF EC  E IPE
Subjt:  RRTDEELLKSLDSFVVVEREGQIIACAALFPFFEERCGEIASIAVSAECRGQGQGDKLLEYMEKKAASLGLDRLFLLTTRTADWFVRRGFSECSFESIPE

Query:  NRRRKINLSRKSKYYMKKLLPDRNRI
         RR +INLSR+SKYYMKKLLPDR+ I
Subjt:  NRRRKINLSRKSKYYMKKLLPDRNRI

AT3G57560.1 N-acetyl-l-glutamate kinase3.0e-1229.86Show/hide
Query:  VEWFREAWPYLWAHRGGTFVVIISGDIVSSSYL-DPILKDIAFLHHLGIRFILVPGTHVLIDKLLAERGSKPNFVGQYRVTDKESLAAAMEAAGGIRVMI
        VE   E+ P++   RG T VV   G  ++S  L   ++ D+  L  +G+R ILV G    I++ L +      F    RVTD    A  ME    + +++
Subjt:  VEWFREAWPYLWAHRGGTFVVIISGDIVSSSYL-DPILKDIAFLHHLGIRFILVPGTHVLIDKLLAERGSKPNFVGQYRVTDKESLAAAMEAAGGIRVMI

Query:  EAKLSPGPSICNIRRHGDSRRWHEVGVSVVSGNFLAAKRRGVVDGVDYGATGEVKKVDVTRIRERLDGGCMVILSNLGYSSSGEVLNCNTYEVVTACALA
          K++    +  I   G +     VG+S   G  L A  R V +    G  GEV +VD + +R  +D G + +++++    SG+  N N   V    A A
Subjt:  EAKLSPGPSICNIRRHGDSRRWHEVGVSVVSGNFLAAKRRGVVDGVDYGATGEVKKVDVTRIRERLDGGCMVILSNLGYSSSGEVLNCNTYEVVTACALA

Query:  IGADKLICIID
        +GA+KLI + D
Subjt:  IGADKLICIID

AT4G37670.1 N-acetyl-l-glutamate synthase 21.2e-19172.3Show/hide
Query:  SCGDGAAEEDGEDNKFVEWFREAWPYLWAHRGGTFVVIISGDIVSSSYLDPILKDIAFLHHLGIRFILVPGTHVLIDKLLAERGSKPNFVGQYRVTDKES
        + GD  AE   +D +FV WFREAWPYLWAHRG TFVVIISG+I++ S  D ILKDIAFLHHLGIRF+LVPGT   ID+LLAERG +  +VG+YRVTD  S
Subjt:  SCGDGAAEEDGEDNKFVEWFREAWPYLWAHRGGTFVVIISGDIVSSSYLDPILKDIAFLHHLGIRFILVPGTHVLIDKLLAERGSKPNFVGQYRVTDKES

Query:  LAAAMEAAGGIRVMIEAKLSPGPSICNIRRHGDSRRWHEVGVSVVSGNFLAAKRRGVVDGVDYGATGEVKKVDVTRIRERLDGGCMVILSNLGYSSSGEV
        L AA EAAG I VM+EAKLSPGPSICNIRRHGD  R H++GV V +GNF AAKRRGVVDGVD+GATGEVKK+DV RI ERLDGG +V+L NLG+SSSGEV
Subjt:  LAAAMEAAGGIRVMIEAKLSPGPSICNIRRHGDSRRWHEVGVSVVSGNFLAAKRRGVVDGVDYGATGEVKKVDVTRIRERLDGGCMVILSNLGYSSSGEV

Query:  LNCNTYEVVTACALAIGADKLICIIDGPILDEAGRHISFLTLQEADMLIRERAKQCEIAANYVKVVGKDPLE---------------------------D
        LNCNTYEV TACALAIGADKLICI+DGPILDE+G  I FLTLQEADML+R+RA+Q +IAANYVK VG   +                            +
Subjt:  LNCNTYEVVTACALAIGADKLICIIDGPILDEAGRHISFLTLQEADMLIRERAKQCEIAANYVKVVGKDPLE---------------------------D

Query:  QTPTFQNGVGFGYGNGRWSSEQGFAIGG-ERNSHLNGYLSELAAAAFVCRGGVQRVHLLDGTIGGVLLLELFKRDGMGTMVASDLYEGTRIARVTDLQGI
         TP FQNGVGF  GNG W  EQGFAIGG ER S LNGYLSELAAAAFVCRGGV+RVHLLDGTI GVLLLELFKRDGMGTMVASD+YEGTR ARV DL GI
Subjt:  QTPTFQNGVGFGYGNGRWSSEQGFAIGG-ERNSHLNGYLSELAAAAFVCRGGVQRVHLLDGTIGGVLLLELFKRDGMGTMVASDLYEGTRIARVTDLQGI

Query:  RQIIQPLEMAGTLVRRTDEELLKSLDSFVVVEREGQIIACAALFPFFEERCGEIASIAVSAECRGQGQGDKLL
        R II+PLE +G LVRRTDEELL++LDSFVVVEREGQIIACAALFPFF+++CGE+A+IAV+++CRGQGQGDKLL
Subjt:  RQIIQPLEMAGTLVRRTDEELLKSLDSFVVVEREGQIIACAALFPFFEERCGEIASIAVSAECRGQGQGDKLL

AT4G37670.2 N-acetyl-l-glutamate synthase 23.2e-22472.96Show/hide
Query:  SCGDGAAEEDGEDNKFVEWFREAWPYLWAHRGGTFVVIISGDIVSSSYLDPILKDIAFLHHLGIRFILVPGTHVLIDKLLAERGSKPNFVGQYRVTDKES
        + GD  AE   +D +FV WFREAWPYLWAHRG TFVVIISG+I++ S  D ILKDIAFLHHLGIRF+LVPGT   ID+LLAERG +  +VG+YRVTD  S
Subjt:  SCGDGAAEEDGEDNKFVEWFREAWPYLWAHRGGTFVVIISGDIVSSSYLDPILKDIAFLHHLGIRFILVPGTHVLIDKLLAERGSKPNFVGQYRVTDKES

Query:  LAAAMEAAGGIRVMIEAKLSPGPSICNIRRHGDSRRWHEVGVSVVSGNFLAAKRRGVVDGVDYGATGEVKKVDVTRIRERLDGGCMVILSNLGYSSSGEV
        L AA EAAG I VM+EAKLSPGPSICNIRRHGD  R H++GV V +GNF AAKRRGVVDGVD+GATGEVKK+DV RI ERLDGG +V+L NLG+SSSGEV
Subjt:  LAAAMEAAGGIRVMIEAKLSPGPSICNIRRHGDSRRWHEVGVSVVSGNFLAAKRRGVVDGVDYGATGEVKKVDVTRIRERLDGGCMVILSNLGYSSSGEV

Query:  LNCNTYEVVTACALAIGADKLICIIDGPILDEAGRHISFLTLQEADMLIRERAKQCEIAANYVKVVGKDPLE---------------------------D
        LNCNTYEV TACALAIGADKLICI+DGPILDE+G  I FLTLQEADML+R+RA+Q +IAANYVK VG   +                            +
Subjt:  LNCNTYEVVTACALAIGADKLICIIDGPILDEAGRHISFLTLQEADMLIRERAKQCEIAANYVKVVGKDPLE---------------------------D

Query:  QTPTFQNGVGFGYGNGRWSSEQGFAIGG-ERNSHLNGYLSELAAAAFVCRGGVQRVHLLDGTIGGVLLLELFKRDGMGTMVASDLYEGTRIARVTDLQGI
         TP FQNGVGF  GNG W  EQGFAIGG ER S LNGYLSELAAAAFVCRGGV+RVHLLDGTI GVLLLELFKRDGMGTMVASD+YEGTR ARV DL GI
Subjt:  QTPTFQNGVGFGYGNGRWSSEQGFAIGG-ERNSHLNGYLSELAAAAFVCRGGVQRVHLLDGTIGGVLLLELFKRDGMGTMVASDLYEGTRIARVTDLQGI

Query:  RQIIQPLEMAGTLVRRTDEELLKSLDSFVVVEREGQIIACAALFPFFEERCGEIASIAVSAECRGQGQGDKLLEYMEKKAASLGLDRLFLLTTRTADWFV
        R II+PLE +G LVRRTDEELL++LDSFVVVEREGQIIACAALFPFF+++CGE+A+IAV+++CRGQGQGDKLL+Y+EKKA+SLGL++LFLLTTRTADWFV
Subjt:  RQIIQPLEMAGTLVRRTDEELLKSLDSFVVVEREGQIIACAALFPFFEERCGEIASIAVSAECRGQGQGDKLLEYMEKKAASLGLDRLFLLTTRTADWFV

Query:  RRGFSECSFESIPENRRRKINLSRKSKYYMKKLLPDRNRI
        RRGF ECS E IPE+RR++INLSR SKYYMKKL+PDR+ I
Subjt:  RRGFSECSFESIPENRRRKINLSRKSKYYMKKLLPDRNRI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTGCCGCTCTCTCAAGCTCTCAGCTACATTTTCAGAGTCCCATTGAATTTCCTTGTAAGACTTTCCCTTATAGCCACAGTTTCAGAAATGGGGGGTTTAAATTCAA
AAGGGTAGAAATATGGGACAATGGACTGAAGAAAGTTGGGGGAAATTTGAGAAGGTTCAATCGATTCAGCTGCGGAGATGGCGCGGCGGAGGAAGACGGAGAGGATAATA
AATTTGTTGAGTGGTTTAGGGAAGCTTGGCCTTACTTGTGGGCTCATAGAGGAGGAACTTTCGTTGTGATCATTTCTGGTGACATTGTCTCTAGTTCTTACTTAGATCCT
ATTCTTAAGGATATTGCTTTTCTTCATCATCTTGGGATTAGATTCATTCTTGTTCCAGGGACCCATGTGTTAATTGACAAACTTTTGGCTGAGAGAGGAAGCAAGCCGAA
TTTTGTTGGTCAATATAGAGTTACTGATAAAGAATCTCTAGCAGCTGCAATGGAAGCGGCGGGAGGAATCCGTGTGATGATAGAGGCGAAGCTTTCCCCCGGACCTTCTA
TCTGCAATATTCGTAGACATGGTGATAGTAGGCGTTGGCATGAAGTCGGTGTAAGCGTCGTCAGTGGCAACTTTCTTGCTGCAAAGAGAAGGGGAGTTGTTGATGGTGTT
GATTATGGGGCAACGGGTGAAGTTAAGAAAGTAGACGTAACTCGCATTCGTGAGAGGCTTGATGGTGGTTGTATGGTTATATTGAGCAACCTGGGTTATTCTAGTTCTGG
CGAAGTTCTGAATTGCAATACATATGAGGTTGTAACAGCATGTGCCTTGGCCATTGGAGCTGACAAACTTATATGCATTATTGATGGTCCTATTCTTGATGAAGCTGGAC
GCCATATTAGTTTTCTAACCCTTCAAGAAGCAGACATGTTAATACGTGAAAGGGCTAAACAATGTGAGATTGCTGCAAACTATGTCAAAGTTGTTGGTAAGGATCCTTTG
GAAGATCAGACTCCAACATTCCAGAATGGAGTTGGTTTTGGTTATGGGAATGGGCGTTGGTCCTCCGAACAGGGTTTTGCCATCGGAGGCGAGCGGAATAGCCATTTGAA
CGGTTACCTTTCAGAATTGGCTGCTGCAGCTTTTGTTTGCCGGGGTGGTGTTCAAAGAGTACACTTGCTAGATGGCACTATTGGAGGAGTTTTATTGCTGGAATTGTTTA
AGAGAGATGGAATGGGCACTATGGTAGCCAGTGATCTTTACGAGGGGACCCGAATAGCCAGAGTGACTGATCTTCAAGGAATTAGACAAATTATACAACCTTTAGAAATG
GCTGGTACCTTGGTTAGAAGAACGGATGAAGAGTTACTTAAATCATTAGACTCGTTTGTCGTCGTTGAGCGAGAAGGTCAAATCATTGCATGTGCTGCTCTCTTCCCTTT
CTTTGAAGAGAGATGTGGAGAGATTGCTTCCATCGCAGTTTCCGCGGAGTGCAGAGGACAAGGACAAGGCGACAAATTGCTTGAATACATGGAAAAGAAGGCTGCCTCCC
TAGGACTGGATAGGCTCTTTCTGCTGACAACACGAACTGCGGACTGGTTCGTTAGGCGTGGTTTCTCAGAATGTTCTTTTGAATCAATACCTGAAAACAGGAGAAGAAAG
ATAAATCTATCCCGGAAATCCAAGTACTACATGAAGAAACTGTTGCCAGATAGGAATAGGATCGGGGCGTTCGGTTGA
mRNA sequenceShow/hide mRNA sequence
CCTTACTAAACAATCGTGGGGAGTTCAAATATCTCAAGTAATTCCTTTCCTTTCTGTTTTGCTTCAAAAAAAAAAAACAACAATGGCCAGTCGTCACTCTACGATGATTG
CTTTTTAATAATGTTGGAAAACAGCGAATGAGAAAGGCCAAGCTTTCACCACATTTTCTCTCAGCCGCGTCTTCTTTCTCCTACTCTAAGTTTGTGTTTCTCAAATGCGT
GTCCTGTTTTTCCATGCTCCAAACATTGAGACGAACCCTCACCTTATCTAATATATCTGTTTTCGATCCAACAGAATTGTAACAGAGGAGAGTTTTTTGCCCAAAAATCC
TTCGACCCTTTTTCTGGTAGTCTCTTGTTCGGCTTCCTCTGCCATGGCTGCCGCTCTCTCAAGCTCTCAGCTACATTTTCAGAGTCCCATTGAATTTCCTTGTAAGACTT
TCCCTTATAGCCACAGTTTCAGAAATGGGGGGTTTAAATTCAAAAGGGTAGAAATATGGGACAATGGACTGAAGAAAGTTGGGGGAAATTTGAGAAGGTTCAATCGATTC
AGCTGCGGAGATGGCGCGGCGGAGGAAGACGGAGAGGATAATAAATTTGTTGAGTGGTTTAGGGAAGCTTGGCCTTACTTGTGGGCTCATAGAGGAGGAACTTTCGTTGT
GATCATTTCTGGTGACATTGTCTCTAGTTCTTACTTAGATCCTATTCTTAAGGATATTGCTTTTCTTCATCATCTTGGGATTAGATTCATTCTTGTTCCAGGGACCCATG
TGTTAATTGACAAACTTTTGGCTGAGAGAGGAAGCAAGCCGAATTTTGTTGGTCAATATAGAGTTACTGATAAAGAATCTCTAGCAGCTGCAATGGAAGCGGCGGGAGGA
ATCCGTGTGATGATAGAGGCGAAGCTTTCCCCCGGACCTTCTATCTGCAATATTCGTAGACATGGTGATAGTAGGCGTTGGCATGAAGTCGGTGTAAGCGTCGTCAGTGG
CAACTTTCTTGCTGCAAAGAGAAGGGGAGTTGTTGATGGTGTTGATTATGGGGCAACGGGTGAAGTTAAGAAAGTAGACGTAACTCGCATTCGTGAGAGGCTTGATGGTG
GTTGTATGGTTATATTGAGCAACCTGGGTTATTCTAGTTCTGGCGAAGTTCTGAATTGCAATACATATGAGGTTGTAACAGCATGTGCCTTGGCCATTGGAGCTGACAAA
CTTATATGCATTATTGATGGTCCTATTCTTGATGAAGCTGGACGCCATATTAGTTTTCTAACCCTTCAAGAAGCAGACATGTTAATACGTGAAAGGGCTAAACAATGTGA
GATTGCTGCAAACTATGTCAAAGTTGTTGGTAAGGATCCTTTGGAAGATCAGACTCCAACATTCCAGAATGGAGTTGGTTTTGGTTATGGGAATGGGCGTTGGTCCTCCG
AACAGGGTTTTGCCATCGGAGGCGAGCGGAATAGCCATTTGAACGGTTACCTTTCAGAATTGGCTGCTGCAGCTTTTGTTTGCCGGGGTGGTGTTCAAAGAGTACACTTG
CTAGATGGCACTATTGGAGGAGTTTTATTGCTGGAATTGTTTAAGAGAGATGGAATGGGCACTATGGTAGCCAGTGATCTTTACGAGGGGACCCGAATAGCCAGAGTGAC
TGATCTTCAAGGAATTAGACAAATTATACAACCTTTAGAAATGGCTGGTACCTTGGTTAGAAGAACGGATGAAGAGTTACTTAAATCATTAGACTCGTTTGTCGTCGTTG
AGCGAGAAGGTCAAATCATTGCATGTGCTGCTCTCTTCCCTTTCTTTGAAGAGAGATGTGGAGAGATTGCTTCCATCGCAGTTTCCGCGGAGTGCAGAGGACAAGGACAA
GGCGACAAATTGCTTGAATACATGGAAAAGAAGGCTGCCTCCCTAGGACTGGATAGGCTCTTTCTGCTGACAACACGAACTGCGGACTGGTTCGTTAGGCGTGGTTTCTC
AGAATGTTCTTTTGAATCAATACCTGAAAACAGGAGAAGAAAGATAAATCTATCCCGGAAATCCAAGTACTACATGAAGAAACTGTTGCCAGATAGGAATAGGATCGGGG
CGTTCGGTTGAGGCTCGATTGAACATCAAAATCTCTTATAGTAGAAGAAGCATATTCTATATTTTGGTCAAATTTTTATCTTGGCTGTCTTGTTTCACCCTTTACATGTC
AGTCTGTGCCGTTGCCCAACCATGTGGGTTATATCACATGCATATATTGTTCTAAATAGAAGTTGATTACAGGTCGCTATATTTTTGCAATCTGCATCAAAATTTG
Protein sequenceShow/hide protein sequence
MAAALSSSQLHFQSPIEFPCKTFPYSHSFRNGGFKFKRVEIWDNGLKKVGGNLRRFNRFSCGDGAAEEDGEDNKFVEWFREAWPYLWAHRGGTFVVIISGDIVSSSYLDP
ILKDIAFLHHLGIRFILVPGTHVLIDKLLAERGSKPNFVGQYRVTDKESLAAAMEAAGGIRVMIEAKLSPGPSICNIRRHGDSRRWHEVGVSVVSGNFLAAKRRGVVDGV
DYGATGEVKKVDVTRIRERLDGGCMVILSNLGYSSSGEVLNCNTYEVVTACALAIGADKLICIIDGPILDEAGRHISFLTLQEADMLIRERAKQCEIAANYVKVVGKDPL
EDQTPTFQNGVGFGYGNGRWSSEQGFAIGGERNSHLNGYLSELAAAAFVCRGGVQRVHLLDGTIGGVLLLELFKRDGMGTMVASDLYEGTRIARVTDLQGIRQIIQPLEM
AGTLVRRTDEELLKSLDSFVVVEREGQIIACAALFPFFEERCGEIASIAVSAECRGQGQGDKLLEYMEKKAASLGLDRLFLLTTRTADWFVRRGFSECSFESIPENRRRK
INLSRKSKYYMKKLLPDRNRIGAFG