; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0030962 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0030962
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionN-acetyltransferase domain-containing protein
Genome locationchr11:3290788..3293929
RNA-Seq ExpressionLag0030962
SyntenyLag0030962
Gene Ontology termsGO:0008080 - N-acetyltransferase activity (molecular function)
InterPro domainsIPR000182 - GNAT domain
IPR016181 - Acyl-CoA N-acyltransferase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004140725.1 uncharacterized protein LOC101203967 [Cucumis sativus]2.2e-7585.63Show/hide
Query:  MISHVKGPFPPFPPALAHAHPNWNAAVVRFTAPLALTTAGCRIATTAAAILAAASLIASPAPSAAAENGSSMMSEQQEESTTLSNIPQTLSGECAQPSDC
        M++HVKGP PP PP L   HPNWN  V+RFTAPLALTT+ CRIATTAAAILAAA++IASP PSAA E  S+ +SEQQEES+TLSNIPQTLSGECAQPSDC
Subjt:  MISHVKGPFPPFPPALAHAHPNWNAAVVRFTAPLALTTAGCRIATTAAAILAAASLIASPAPSAAAENGSSMMSEQQEESTTLSNIPQTLSGECAQPSDC

Query:  KKARIQRPKSRKAESCTIKCVGTCIRGGDGSPGEGPLNIRRPIIVFKQGFRSRQYCLVECSEICNLIGDGDDGP
        KKARIQRPKSRKAESCTIKCVGTCIRGGDGSPGEGPLNIRRPI+VFKQGFRSRQYCLVECS+ICNLIGDGDDGP
Subjt:  KKARIQRPKSRKAESCTIKCVGTCIRGGDGSPGEGPLNIRRPIIVFKQGFRSRQYCLVECSEICNLIGDGDDGP

XP_008456185.1 PREDICTED: uncharacterized protein LOC103496200 [Cucumis melo]1.6e-7686.78Show/hide
Query:  MISHVKGPFPPFPPALAHAHPNWNAAVVRFTAPLALTTAGCRIATTAAAILAAASLIASPAPSAAAENGSSMMSEQQEESTTLSNIPQTLSGECAQPSDC
        M+SHVKGPFPP PP L   HPNWN  V+RFTAPLALTT+ CRIATTAAAILAAA++IASP PSAA E  S+ +SEQQEES+TLSNIPQTLSGECAQPSDC
Subjt:  MISHVKGPFPPFPPALAHAHPNWNAAVVRFTAPLALTTAGCRIATTAAAILAAASLIASPAPSAAAENGSSMMSEQQEESTTLSNIPQTLSGECAQPSDC

Query:  KKARIQRPKSRKAESCTIKCVGTCIRGGDGSPGEGPLNIRRPIIVFKQGFRSRQYCLVECSEICNLIGDGDDGP
        KKARIQRPKSRKAESCTIKCVGTCIRGGDGSPGEGPLNIRRPI+VFKQGFRSRQYCLVECS+ICNLIGDGDDGP
Subjt:  KKARIQRPKSRKAESCTIKCVGTCIRGGDGSPGEGPLNIRRPIIVFKQGFRSRQYCLVECSEICNLIGDGDDGP

XP_022934222.1 uncharacterized protein LOC111441457 isoform X2 [Cucurbita moschata]1.1e-7789.66Show/hide
Query:  MISHVKGPFPPFPPALAHAHPNWNAAVVRFTAPLALTTAGCRIATTAAAILAAASLIASPAPSAAAENGSSMMSEQQEESTTLSNIPQTLSGECAQPSDC
        MISHVKGPFPP P   A AHPNWN+AVVRFTAPLALT A CRIATTAAAILAAA +I SPAPSAA E+ SS +SEQQEESTTLSNIPQTLSGECAQPSDC
Subjt:  MISHVKGPFPPFPPALAHAHPNWNAAVVRFTAPLALTTAGCRIATTAAAILAAASLIASPAPSAAAENGSSMMSEQQEESTTLSNIPQTLSGECAQPSDC

Query:  KKARIQRPKSRKAESCTIKCVGTCIRGGDGSPGEGPLNIRRPIIVFKQGFRSRQYCLVECSEICNLIGDGDDGP
        KKARIQRPKSRKAESCTIKCVGTCIRGGDGSPGEGPLNIRRPI+VFKQGFRSRQYCLVECS+ICNLIGDGDDGP
Subjt:  KKARIQRPKSRKAESCTIKCVGTCIRGGDGSPGEGPLNIRRPIIVFKQGFRSRQYCLVECSEICNLIGDGDDGP

XP_022983921.1 uncharacterized protein LOC111482398 isoform X2 [Cucurbita maxima]4.1e-7789.08Show/hide
Query:  MISHVKGPFPPFPPALAHAHPNWNAAVVRFTAPLALTTAGCRIATTAAAILAAASLIASPAPSAAAENGSSMMSEQQEESTTLSNIPQTLSGECAQPSDC
        MISHVKGPFPP P   A AHPNWN+AVVRFTAPLALT A CRI TTAAAILAAA +I SPAPSAA E+ SS +SEQQEESTTLSNIPQTLSGECAQPSDC
Subjt:  MISHVKGPFPPFPPALAHAHPNWNAAVVRFTAPLALTTAGCRIATTAAAILAAASLIASPAPSAAAENGSSMMSEQQEESTTLSNIPQTLSGECAQPSDC

Query:  KKARIQRPKSRKAESCTIKCVGTCIRGGDGSPGEGPLNIRRPIIVFKQGFRSRQYCLVECSEICNLIGDGDDGP
        KKARIQRPKSRKAESCTIKCVGTCIRGGDGSPGEGPLNIRRPI+VFKQGFRSRQYCLVECS+ICNLIGDGDDGP
Subjt:  KKARIQRPKSRKAESCTIKCVGTCIRGGDGSPGEGPLNIRRPIIVFKQGFRSRQYCLVECSEICNLIGDGDDGP

XP_023526623.1 uncharacterized protein LOC111790064 isoform X2 [Cucurbita pepo subsp. pepo]1.4e-7789.66Show/hide
Query:  MISHVKGPFPPFPPALAHAHPNWNAAVVRFTAPLALTTAGCRIATTAAAILAAASLIASPAPSAAAENGSSMMSEQQEESTTLSNIPQTLSGECAQPSDC
        MISHVKGPFPP P   A AHPNWN+AVVRFTAPLALT A CRIATTAAAILAAA +I SPAPSAA E+ SS +SEQQEESTTLSNIPQTLSGECAQPSDC
Subjt:  MISHVKGPFPPFPPALAHAHPNWNAAVVRFTAPLALTTAGCRIATTAAAILAAASLIASPAPSAAAENGSSMMSEQQEESTTLSNIPQTLSGECAQPSDC

Query:  KKARIQRPKSRKAESCTIKCVGTCIRGGDGSPGEGPLNIRRPIIVFKQGFRSRQYCLVECSEICNLIGDGDDGP
        KKARIQRPKSRKAESCTIKCVGTCIRGGDGSPGEGPLNIRRPI+VFKQGFRSRQYCLVECS+ICNLIGDGDDGP
Subjt:  KKARIQRPKSRKAESCTIKCVGTCIRGGDGSPGEGPLNIRRPIIVFKQGFRSRQYCLVECSEICNLIGDGDDGP

TrEMBL top hitse value%identityAlignment
A0A0A0LA46 Uncharacterized protein1.1e-7585.63Show/hide
Query:  MISHVKGPFPPFPPALAHAHPNWNAAVVRFTAPLALTTAGCRIATTAAAILAAASLIASPAPSAAAENGSSMMSEQQEESTTLSNIPQTLSGECAQPSDC
        M++HVKGP PP PP L   HPNWN  V+RFTAPLALTT+ CRIATTAAAILAAA++IASP PSAA E  S+ +SEQQEES+TLSNIPQTLSGECAQPSDC
Subjt:  MISHVKGPFPPFPPALAHAHPNWNAAVVRFTAPLALTTAGCRIATTAAAILAAASLIASPAPSAAAENGSSMMSEQQEESTTLSNIPQTLSGECAQPSDC

Query:  KKARIQRPKSRKAESCTIKCVGTCIRGGDGSPGEGPLNIRRPIIVFKQGFRSRQYCLVECSEICNLIGDGDDGP
        KKARIQRPKSRKAESCTIKCVGTCIRGGDGSPGEGPLNIRRPI+VFKQGFRSRQYCLVECS+ICNLIGDGDDGP
Subjt:  KKARIQRPKSRKAESCTIKCVGTCIRGGDGSPGEGPLNIRRPIIVFKQGFRSRQYCLVECSEICNLIGDGDDGP

A0A1S3C289 uncharacterized protein LOC1034962007.5e-7786.78Show/hide
Query:  MISHVKGPFPPFPPALAHAHPNWNAAVVRFTAPLALTTAGCRIATTAAAILAAASLIASPAPSAAAENGSSMMSEQQEESTTLSNIPQTLSGECAQPSDC
        M+SHVKGPFPP PP L   HPNWN  V+RFTAPLALTT+ CRIATTAAAILAAA++IASP PSAA E  S+ +SEQQEES+TLSNIPQTLSGECAQPSDC
Subjt:  MISHVKGPFPPFPPALAHAHPNWNAAVVRFTAPLALTTAGCRIATTAAAILAAASLIASPAPSAAAENGSSMMSEQQEESTTLSNIPQTLSGECAQPSDC

Query:  KKARIQRPKSRKAESCTIKCVGTCIRGGDGSPGEGPLNIRRPIIVFKQGFRSRQYCLVECSEICNLIGDGDDGP
        KKARIQRPKSRKAESCTIKCVGTCIRGGDGSPGEGPLNIRRPI+VFKQGFRSRQYCLVECS+ICNLIGDGDDGP
Subjt:  KKARIQRPKSRKAESCTIKCVGTCIRGGDGSPGEGPLNIRRPIIVFKQGFRSRQYCLVECSEICNLIGDGDDGP

A0A5A7T6W8 Uncharacterized protein7.5e-7786.78Show/hide
Query:  MISHVKGPFPPFPPALAHAHPNWNAAVVRFTAPLALTTAGCRIATTAAAILAAASLIASPAPSAAAENGSSMMSEQQEESTTLSNIPQTLSGECAQPSDC
        M+SHVKGPFPP PP L   HPNWN  V+RFTAPLALTT+ CRIATTAAAILAAA++IASP PSAA E  S+ +SEQQEES+TLSNIPQTLSGECAQPSDC
Subjt:  MISHVKGPFPPFPPALAHAHPNWNAAVVRFTAPLALTTAGCRIATTAAAILAAASLIASPAPSAAAENGSSMMSEQQEESTTLSNIPQTLSGECAQPSDC

Query:  KKARIQRPKSRKAESCTIKCVGTCIRGGDGSPGEGPLNIRRPIIVFKQGFRSRQYCLVECSEICNLIGDGDDGP
        KKARIQRPKSRKAESCTIKCVGTCIRGGDGSPGEGPLNIRRPI+VFKQGFRSRQYCLVECS+ICNLIGDGDDGP
Subjt:  KKARIQRPKSRKAESCTIKCVGTCIRGGDGSPGEGPLNIRRPIIVFKQGFRSRQYCLVECSEICNLIGDGDDGP

A0A6J1F235 uncharacterized protein LOC111441457 isoform X25.2e-7889.66Show/hide
Query:  MISHVKGPFPPFPPALAHAHPNWNAAVVRFTAPLALTTAGCRIATTAAAILAAASLIASPAPSAAAENGSSMMSEQQEESTTLSNIPQTLSGECAQPSDC
        MISHVKGPFPP P   A AHPNWN+AVVRFTAPLALT A CRIATTAAAILAAA +I SPAPSAA E+ SS +SEQQEESTTLSNIPQTLSGECAQPSDC
Subjt:  MISHVKGPFPPFPPALAHAHPNWNAAVVRFTAPLALTTAGCRIATTAAAILAAASLIASPAPSAAAENGSSMMSEQQEESTTLSNIPQTLSGECAQPSDC

Query:  KKARIQRPKSRKAESCTIKCVGTCIRGGDGSPGEGPLNIRRPIIVFKQGFRSRQYCLVECSEICNLIGDGDDGP
        KKARIQRPKSRKAESCTIKCVGTCIRGGDGSPGEGPLNIRRPI+VFKQGFRSRQYCLVECS+ICNLIGDGDDGP
Subjt:  KKARIQRPKSRKAESCTIKCVGTCIRGGDGSPGEGPLNIRRPIIVFKQGFRSRQYCLVECSEICNLIGDGDDGP

A0A6J1J3Q4 uncharacterized protein LOC111482398 isoform X22.0e-7789.08Show/hide
Query:  MISHVKGPFPPFPPALAHAHPNWNAAVVRFTAPLALTTAGCRIATTAAAILAAASLIASPAPSAAAENGSSMMSEQQEESTTLSNIPQTLSGECAQPSDC
        MISHVKGPFPP P   A AHPNWN+AVVRFTAPLALT A CRI TTAAAILAAA +I SPAPSAA E+ SS +SEQQEESTTLSNIPQTLSGECAQPSDC
Subjt:  MISHVKGPFPPFPPALAHAHPNWNAAVVRFTAPLALTTAGCRIATTAAAILAAASLIASPAPSAAAENGSSMMSEQQEESTTLSNIPQTLSGECAQPSDC

Query:  KKARIQRPKSRKAESCTIKCVGTCIRGGDGSPGEGPLNIRRPIIVFKQGFRSRQYCLVECSEICNLIGDGDDGP
        KKARIQRPKSRKAESCTIKCVGTCIRGGDGSPGEGPLNIRRPI+VFKQGFRSRQYCLVECS+ICNLIGDGDDGP
Subjt:  KKARIQRPKSRKAESCTIKCVGTCIRGGDGSPGEGPLNIRRPIIVFKQGFRSRQYCLVECSEICNLIGDGDDGP

SwissProt top hitse value%identityAlignment
P80969 Tyramine N-feruloyltransferase 10/308.7e-1435.94Show/hide
Query:  LFFATESDLSANLFSSSP---FLSVTVFILEASPNPFPQISSHNPNPNYTPIVRFAGLKIPLDDPEREIFES-----EGESVVVAGFVLFFPNFSSFLGK
        L+ ATES L   LF ++P   F   +V +LE SP PF    +   +  + P+++   L+  ++D E E F+S     E E V +AG+  F+ N+S F  K
Subjt:  LFFATESDLSANLFSSSP---FLSVTVFILEASPNPFPQISSHNPNPNYTPIVRFAGLKIPLDDPEREIFES-----EGESVVVAGFVLFFPNFSSFLGK

Query:  QGMYVEHLFVRECYQRKGLGKMLLSAVA
         G+Y E L+ RE Y++ G+G +L   VA
Subjt:  QGMYVEHLFVRECYQRKGLGKMLLSAVA

Q9SMB8 Tyramine N-feruloyltransferase 4/115.1e-1435.94Show/hide
Query:  LFFATESDLSANLFSSSP---FLSVTVFILEASPNPFPQISSHNPNPNYTPIVRFAGLKIPLDDPEREIFES-----EGESVVVAGFVLFFPNFSSFLGK
        L+ ATES L   LF ++P   F   +V +LE SP PF    +   +  + P+++   L+  ++D E E F+S     E E V +AG+  F+ N+S F  K
Subjt:  LFFATESDLSANLFSSSP---FLSVTVFILEASPNPFPQISSHNPNPNYTPIVRFAGLKIPLDDPEREIFES-----EGESVVVAGFVLFFPNFSSFLGK

Query:  QGMYVEHLFVRECYQRKGLGKMLLSAVA
         G+Y E L+ RE Y++ G+G +L   VA
Subjt:  QGMYVEHLFVRECYQRKGLGKMLLSAVA

Q9ZV05 L-ornithine N5-acetyltransferase NATA11.9e-3255.64Show/hide
Query:  MAVFEHLFDLFFATESDLSANLFSSSPFLSVTVFILEASPNPFPQISSHN-PNPNYTPIVRFAGLKIPLDDPEREIF-ESEGESVVVAGFVLFFPNFSSF
        MAVFE L  LF ATES L++ LF+S PF +VTVF+LE SP+PFP  ++H+  +P++TP +    + +P++DP+RE F   +   VVVAGFVLFFPN+ SF
Subjt:  MAVFEHLFDLFFATESDLSANLFSSSPFLSVTVFILEASPNPFPQISSHN-PNPNYTPIVRFAGLKIPLDDPEREIF-ESEGESVVVAGFVLFFPNFSSF

Query:  LGKQGMYVEHLFVRECYQRKGLGKMLLSAVAEQ
        L KQG Y+E +F+RE Y+RKG GK+LL+AVA+Q
Subjt:  LGKQGMYVEHLFVRECYQRKGLGKMLLSAVAEQ

Q9ZV06 Probable acetyltransferase NATA1-like1.7e-3356.82Show/hide
Query:  MAVFEHLFDLFFATESDLSANLFSSSPFLSVTVFILEASPNPFPQISSHNPNPNYTPIVRFAGLKIPLDDPEREIFESEG-ESVVVAGFVLFFPNFSSFL
        MAVFE L  LF ATES L++ LF+S PF S TVF+LE S +PFP   + +P+P++TP  +   L +P+DDPE   F  +    VVVAGFVLFFPN+SSFL
Subjt:  MAVFEHLFDLFFATESDLSANLFSSSPFLSVTVFILEASPNPFPQISSHNPNPNYTPIVRFAGLKIPLDDPEREIFESEG-ESVVVAGFVLFFPNFSSFL

Query:  GKQGMYVEHLFVRECYQRKGLGKMLLSAVAEQ
         K G Y+E +FVRE Y+RKG G MLL+AVA+Q
Subjt:  GKQGMYVEHLFVRECYQRKGLGKMLLSAVAEQ

Arabidopsis top hitse value%identityAlignment
AT1G78995.1 unknown protein1.7e-3658.16Show/hide
Query:  LALTTAGCRIATTAAAILAAASLIASPAPSAAAENGSSMMSEQQEESTTLSNIPQTLSGECAQPSDCKKARIQRPKSRKAESCTIKCVGTCIRGGDGSPG
        L   +A  RI+T     +AAA ++      AA    S+ +    ++  TLSN+PQTLSGE     DCKK RIQRPKS+ AE CT+KCV TCIR GD   G
Subjt:  LALTTAGCRIATTAAAILAAASLIASPAPSAAAENGSSMMSEQQEESTTLSNIPQTLSGECAQPSDCKKARIQRPKSRKAESCTIKCVGTCIRGGDGSPG

Query:  EGPLNIRRPIIVFKQGFRSRQYCLVECSEICNLIGDGDDGP
        EGP+NIRRP++VFKQGFRSR YCLVECS+ICNLIGDGD GP
Subjt:  EGPLNIRRPIIVFKQGFRSRQYCLVECSEICNLIGDGDDGP

AT2G39020.1 Acyl-CoA N-acyltransferases (NAT) superfamily protein1.2e-3456.82Show/hide
Query:  MAVFEHLFDLFFATESDLSANLFSSSPFLSVTVFILEASPNPFPQISSHNPNPNYTPIVRFAGLKIPLDDPEREIFESEG-ESVVVAGFVLFFPNFSSFL
        MAVFE L  LF ATES L++ LF+S PF S TVF+LE S +PFP   + +P+P++TP  +   L +P+DDPE   F  +    VVVAGFVLFFPN+SSFL
Subjt:  MAVFEHLFDLFFATESDLSANLFSSSPFLSVTVFILEASPNPFPQISSHNPNPNYTPIVRFAGLKIPLDDPEREIFESEG-ESVVVAGFVLFFPNFSSFL

Query:  GKQGMYVEHLFVRECYQRKGLGKMLLSAVAEQ
         K G Y+E +FVRE Y+RKG G MLL+AVA+Q
Subjt:  GKQGMYVEHLFVRECYQRKGLGKMLLSAVAEQ

AT2G39030.1 Acyl-CoA N-acyltransferases (NAT) superfamily protein1.3e-3355.64Show/hide
Query:  MAVFEHLFDLFFATESDLSANLFSSSPFLSVTVFILEASPNPFPQISSHN-PNPNYTPIVRFAGLKIPLDDPEREIF-ESEGESVVVAGFVLFFPNFSSF
        MAVFE L  LF ATES L++ LF+S PF +VTVF+LE SP+PFP  ++H+  +P++TP +    + +P++DP+RE F   +   VVVAGFVLFFPN+ SF
Subjt:  MAVFEHLFDLFFATESDLSANLFSSSPFLSVTVFILEASPNPFPQISSHN-PNPNYTPIVRFAGLKIPLDDPEREIF-ESEGESVVVAGFVLFFPNFSSF

Query:  LGKQGMYVEHLFVRECYQRKGLGKMLLSAVAEQ
        L KQG Y+E +F+RE Y+RKG GK+LL+AVA+Q
Subjt:  LGKQGMYVEHLFVRECYQRKGLGKMLLSAVAEQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCGTCTTCGAACACCTCTTCGATCTATTCTTCGCCACCGAATCGGACCTCTCCGCCAACCTCTTCTCTTCGTCGCCGTTCCTCTCTGTCACCGTCTTCATCCTTGA
AGCCTCTCCCAATCCCTTCCCTCAAATCTCTTCTCACAACCCGAATCCTAACTACACCCCTATCGTTCGATTCGCCGGTCTGAAGATCCCATTGGACGATCCGGAGAGGG
AGATTTTCGAATCGGAAGGTGAGAGCGTTGTGGTGGCTGGATTTGTTCTGTTTTTCCCTAATTTTTCGTCATTCTTGGGGAAACAGGGGATGTACGTGGAGCATTTGTTT
GTGAGAGAGTGTTACCAGCGAAAGGGATTGGGGAAAATGTTGTTGTCGGCGGTGGCGGAGCAGGGGAGAAGAAGAGAGGGAAGGAGCAGAGCAGAGCACAGTCAAAATCA
AATCCCCAACTCCACAATGATAAGTCATGTAAAAGGTCCATTTCCGCCGTTTCCACCAGCCCTAGCGCATGCGCATCCAAATTGGAACGCCGCCGTCGTCCGATTCACCG
CCCCGCTCGCGCTAACCACAGCCGGTTGCCGGATCGCAACCACCGCCGCGGCGATTTTGGCAGCCGCATCCCTGATCGCTTCGCCGGCGCCGTCCGCTGCAGCCGAGAAC
GGCAGCAGCATGATGTCGGAACAACAAGAAGAAAGTACTACGCTGTCGAACATTCCGCAGACGCTTTCCGGCGAGTGCGCGCAGCCGAGCGACTGCAAGAAGGCGAGAAT
TCAACGGCCGAAGTCGAGGAAGGCGGAGTCGTGCACGATCAAGTGCGTCGGAACTTGCATTCGAGGCGGCGATGGATCTCCCGGAGAAGGACCTCTCAACATCAGGAGAC
CGATTATTGTTTTCAAGCAAGGGTTTCGAAGTCGTCAATATTGTTTGGTGGAGTGTTCTGAAATCTGCAATTTGATCGGAGATGGTGATGATGGACCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCCGTCTTCGAACACCTCTTCGATCTATTCTTCGCCACCGAATCGGACCTCTCCGCCAACCTCTTCTCTTCGTCGCCGTTCCTCTCTGTCACCGTCTTCATCCTTGA
AGCCTCTCCCAATCCCTTCCCTCAAATCTCTTCTCACAACCCGAATCCTAACTACACCCCTATCGTTCGATTCGCCGGTCTGAAGATCCCATTGGACGATCCGGAGAGGG
AGATTTTCGAATCGGAAGGTGAGAGCGTTGTGGTGGCTGGATTTGTTCTGTTTTTCCCTAATTTTTCGTCATTCTTGGGGAAACAGGGGATGTACGTGGAGCATTTGTTT
GTGAGAGAGTGTTACCAGCGAAAGGGATTGGGGAAAATGTTGTTGTCGGCGGTGGCGGAGCAGGGGAGAAGAAGAGAGGGAAGGAGCAGAGCAGAGCACAGTCAAAATCA
AATCCCCAACTCCACAATGATAAGTCATGTAAAAGGTCCATTTCCGCCGTTTCCACCAGCCCTAGCGCATGCGCATCCAAATTGGAACGCCGCCGTCGTCCGATTCACCG
CCCCGCTCGCGCTAACCACAGCCGGTTGCCGGATCGCAACCACCGCCGCGGCGATTTTGGCAGCCGCATCCCTGATCGCTTCGCCGGCGCCGTCCGCTGCAGCCGAGAAC
GGCAGCAGCATGATGTCGGAACAACAAGAAGAAAGTACTACGCTGTCGAACATTCCGCAGACGCTTTCCGGCGAGTGCGCGCAGCCGAGCGACTGCAAGAAGGCGAGAAT
TCAACGGCCGAAGTCGAGGAAGGCGGAGTCGTGCACGATCAAGTGCGTCGGAACTTGCATTCGAGGCGGCGATGGATCTCCCGGAGAAGGACCTCTCAACATCAGGAGAC
CGATTATTGTTTTCAAGCAAGGGTTTCGAAGTCGTCAATATTGTTTGGTGGAGTGTTCTGAAATCTGCAATTTGATCGGAGATGGTGATGATGGACCTTGA
Protein sequenceShow/hide protein sequence
MAVFEHLFDLFFATESDLSANLFSSSPFLSVTVFILEASPNPFPQISSHNPNPNYTPIVRFAGLKIPLDDPEREIFESEGESVVVAGFVLFFPNFSSFLGKQGMYVEHLF
VRECYQRKGLGKMLLSAVAEQGRRREGRSRAEHSQNQIPNSTMISHVKGPFPPFPPALAHAHPNWNAAVVRFTAPLALTTAGCRIATTAAAILAAASLIASPAPSAAAEN
GSSMMSEQQEESTTLSNIPQTLSGECAQPSDCKKARIQRPKSRKAESCTIKCVGTCIRGGDGSPGEGPLNIRRPIIVFKQGFRSRQYCLVECSEICNLIGDGDDGP