; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI07G12830 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI07G12830
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionN-acetyltransferase domain-containing protein
Genome locationChr7:11258003..11260927
RNA-Seq ExpressionCSPI07G12830
SyntenyCSPI07G12830
Gene Ontology termsGO:0005737 - cytoplasm (cellular component)
GO:0008080 - N-acetyltransferase activity (molecular function)
InterPro domainsIPR000182 - GNAT domain
IPR016181 - Acyl-CoA N-acyltransferase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004139678.1 uncharacterized protein LOC101214390 isoform X2 [Cucumis sativus]1.5e-14999.26Show/hide
Query:  MVHLLPNPLRVPSHLRSEPPLTAVPTRSKSGVVWRYGGGIKVRSAAVVRCSSDYSSPITAAARTEEELVGVSEEIDENEYLAREFGWKVRKLIEEEDDLK
        MVHLLPNPLRVPSHLRSEPPLTAVPTRSKSGVVWRYGGGIKVRSA VVRCSSDYSSPITAAARTEEELVGVSEEIDENEYLA EFGWKVRKLIEEEDDLK
Subjt:  MVHLLPNPLRVPSHLRSEPPLTAVPTRSKSGVVWRYGGGIKVRSAAVVRCSSDYSSPITAAARTEEELVGVSEEIDENEYLAREFGWKVRKLIEEEDDLK

Query:  AVARIQAEAFHEPVLLFNHFFFQFFQAEVLSALIYRLKNYPQDRYACLVAEPESEIGEEEYNFVGVVDVTVAGDLKIKRLLPPGVKEYLFVTGIAVAQNA
        AVARIQAEAFHEPVLLFNHFFFQFFQAEVLSALIYRLKNYPQDRYACLVAEPESEIGEEEYNFVGVVDVTVAGDLKIKRLLPPGVKEYLFVTGIAVAQNA
Subjt:  AVARIQAEAFHEPVLLFNHFFFQFFQAEVLSALIYRLKNYPQDRYACLVAEPESEIGEEEYNFVGVVDVTVAGDLKIKRLLPPGVKEYLFVTGIAVAQNA

Query:  RRRKVATALLKGCDMLGKVWGFKFLALSAYEDDYGARNLYSKAGYQVYYVDPLWKSTWIGRKRCVTMIKKL
        RRRKVATALLKGCDMLGKVWGFKFLALSAYEDDYGARNLYSKAGYQVYYVDPLWKSTWIGRKRCVTMIKKL
Subjt:  RRRKVATALLKGCDMLGKVWGFKFLALSAYEDDYGARNLYSKAGYQVYYVDPLWKSTWIGRKRCVTMIKKL

XP_008461923.1 PREDICTED: uncharacterized protein LOC103500412 isoform X1 [Cucumis melo]2.8e-14093.36Show/hide
Query:  MVHLLPNPLRVPSHLRSEPPLTAVPTRSKSGVVWRYGGGIKVRSAAVVRCSSDYSSPITAAARTEEELVGVSEEIDENEYLAREFGWKVRKLIEEEDDLK
        MVHLLPNPLRV SHLRSEPP TAVPTRSK   +WRYGG IKVRSA VVRCSSDYSSPITAA  TEEEL+ VSEE+ ENEYLAREFGWKVRKLIEEEDDL+
Subjt:  MVHLLPNPLRVPSHLRSEPPLTAVPTRSKSGVVWRYGGGIKVRSAAVVRCSSDYSSPITAAARTEEELVGVSEEIDENEYLAREFGWKVRKLIEEEDDLK

Query:  AVARIQAEAFHEPVLLFNHFFFQFFQAEVLSALIYRLKNYPQDRYACLVAEPESEIGEEEYNFVGVVDVTVAGDLKIKRLLPPGVKEYLFVTGIAVAQNA
        AVARIQAEAFHEPVLLFNHFFFQFFQAEVLSALIYRLKNYP +RYACLVAEPESEIGEEEYNFVGVVDVTVAGDLK+KRLLPPGVKEYLFVTGIAVAQNA
Subjt:  AVARIQAEAFHEPVLLFNHFFFQFFQAEVLSALIYRLKNYPQDRYACLVAEPESEIGEEEYNFVGVVDVTVAGDLKIKRLLPPGVKEYLFVTGIAVAQNA

Query:  RRRKVATALLKGCDMLGKVWGFKFLALSAYEDDYGARNLYSKAGYQVYYVDPLWKSTWIGRKRCVTMIKKL
        RRRKVATALLKGCDMLGKVWGFKFLALSAYEDDYGARNLYSKAGYQVYYVDPLWKSTWIGRKRCVTMIKKL
Subjt:  RRRKVATALLKGCDMLGKVWGFKFLALSAYEDDYGARNLYSKAGYQVYYVDPLWKSTWIGRKRCVTMIKKL

XP_016902774.1 PREDICTED: uncharacterized protein LOC103500412 isoform X2 [Cucumis melo]2.6e-13892.99Show/hide
Query:  MVHLLPNPLRVPSHLRSEPPLTAVPTRSKSGVVWRYGGGIKVRSAAVVRCSSDYSSPITAAARTEEELVGVSEEIDENEYLAREFGWKVRKLIEEEDDLK
        MVHLLPNPLRV SHLRSEPP TAVPTRSK   +WRYGG IKVRSA VVRCSSDYSSPITAA  TEEEL+ VSEE+ ENEYLAREFGWKVRKLIEEEDDL+
Subjt:  MVHLLPNPLRVPSHLRSEPPLTAVPTRSKSGVVWRYGGGIKVRSAAVVRCSSDYSSPITAAARTEEELVGVSEEIDENEYLAREFGWKVRKLIEEEDDLK

Query:  AVARIQAEAFHEPVLLFNHFFFQFFQAEVLSALIYRLKNYPQDRYACLVAEPESEIGEEEYNFVGVVDVTVAGDLKIKRLLPPGVKEYLFVTGIAVAQNA
        AVARIQAEAFHEPVLLFNHFFFQFFQAEVLSALIYRLKNYP +RYACLVAEPESEIGEEEYNFVGVVDVTVAGDLK+KRLLPPGVKEYLFVTGIAVAQNA
Subjt:  AVARIQAEAFHEPVLLFNHFFFQFFQAEVLSALIYRLKNYPQDRYACLVAEPESEIGEEEYNFVGVVDVTVAGDLKIKRLLPPGVKEYLFVTGIAVAQNA

Query:  RRRKVATALLKGCDMLGKVWGFKFLALSAYEDDYGARNLYSKAGYQVYYVDPLWKSTWIGRKRCVTMIKKL
         RRKVATALLKGCDMLGKVWGFKFLALSAYEDDYGARNLYSKAGYQVYYVDPLWKSTWIGRKRCVTMIKKL
Subjt:  RRRKVATALLKGCDMLGKVWGFKFLALSAYEDDYGARNLYSKAGYQVYYVDPLWKSTWIGRKRCVTMIKKL

XP_031744292.1 uncharacterized protein LOC101214390 isoform X1 [Cucumis sativus]1.4e-14797.11Show/hide
Query:  MVHLLPNPLRVPSHLRSEPPLTAVPTRSKSGVVWRYGGGIKVRSAAVVRCSSDYSSPITAAARTEEELVGVSEEIDENEYLAREFGWKVRKLIEEEDDLK
        MVHLLPNPLRVPSHLRSEPPLTAVPTRSKSGVVWRYGGGIKVRSA VVRCSSDYSSPITAAARTEEELVGVSEEIDENEYLA EFGWKVRKLIEEEDDLK
Subjt:  MVHLLPNPLRVPSHLRSEPPLTAVPTRSKSGVVWRYGGGIKVRSAAVVRCSSDYSSPITAAARTEEELVGVSEEIDENEYLAREFGWKVRKLIEEEDDLK

Query:  AVARIQAEAFHEPVLLFNHFFFQFF------QAEVLSALIYRLKNYPQDRYACLVAEPESEIGEEEYNFVGVVDVTVAGDLKIKRLLPPGVKEYLFVTGI
        AVARIQAEAFHEPVLLFNHFFFQFF      QAEVLSALIYRLKNYPQDRYACLVAEPESEIGEEEYNFVGVVDVTVAGDLKIKRLLPPGVKEYLFVTGI
Subjt:  AVARIQAEAFHEPVLLFNHFFFQFF------QAEVLSALIYRLKNYPQDRYACLVAEPESEIGEEEYNFVGVVDVTVAGDLKIKRLLPPGVKEYLFVTGI

Query:  AVAQNARRRKVATALLKGCDMLGKVWGFKFLALSAYEDDYGARNLYSKAGYQVYYVDPLWKSTWIGRKRCVTMIKKL
        AVAQNARRRKVATALLKGCDMLGKVWGFKFLALSAYEDDYGARNLYSKAGYQVYYVDPLWKSTWIGRKRCVTMIKKL
Subjt:  AVAQNARRRKVATALLKGCDMLGKVWGFKFLALSAYEDDYGARNLYSKAGYQVYYVDPLWKSTWIGRKRCVTMIKKL

XP_038897179.1 uncharacterized protein LOC120085320 [Benincasa hispida]1.7e-12485.45Show/hide
Query:  MVHLLPNPLRVPSHLRSEPPLTAVPTRSKSG----VVWRYGGGIKVRSAAVVRCSSDYSSPITAAARTEEELVGVSEEIDENEYLAREFGWKVRKLIEEE
        MVHLLPNPLRV  HLRSEPP TAV  R  SG     +WR  GGIKVRSA V+RCSSDYSSP TA     EE +G+ EEI+ENEYLAREFGW VRKLIEEE
Subjt:  MVHLLPNPLRVPSHLRSEPPLTAVPTRSKSG----VVWRYGGGIKVRSAAVVRCSSDYSSPITAAARTEEELVGVSEEIDENEYLAREFGWKVRKLIEEE

Query:  DDLKAVARIQAEAFHEPVLLFNHFFFQFFQAEVLSALIYRLKNYPQDRYACLVAEPESEIGEEEYNFVGVVDVTVAGDLKIKRLLPPGVKEYLFVTGIAV
        DDL+AVARIQAEAFHEPVLLFN FFFQFFQAEVLSALIYRLKNYP DRYACLVAEPESE  ++EYNFVGVVDVTVAGDLK+KRLLP G KEYLFVTGIAV
Subjt:  DDLKAVARIQAEAFHEPVLLFNHFFFQFFQAEVLSALIYRLKNYPQDRYACLVAEPESEIGEEEYNFVGVVDVTVAGDLKIKRLLPPGVKEYLFVTGIAV

Query:  AQNARRRKVATALLKGCDMLGKVWGFKFLALSAYEDDYGARNLYSKAGYQVYYVDPLWKSTWIGRKRCVTMIKKL
        AQ ARRRKVATALLKGCDMLGKVWGFKFLALSAYEDDYGARNLYSKAGYQVYYVDPLWKSTWIGRKRCVTMIKKL
Subjt:  AQNARRRKVATALLKGCDMLGKVWGFKFLALSAYEDDYGARNLYSKAGYQVYYVDPLWKSTWIGRKRCVTMIKKL

TrEMBL top hitse value%identityAlignment
A0A0A0K826 N-acetyltransferase domain-containing protein7.2e-15099.26Show/hide
Query:  MVHLLPNPLRVPSHLRSEPPLTAVPTRSKSGVVWRYGGGIKVRSAAVVRCSSDYSSPITAAARTEEELVGVSEEIDENEYLAREFGWKVRKLIEEEDDLK
        MVHLLPNPLRVPSHLRSEPPLTAVPTRSKSGVVWRYGGGIKVRSA VVRCSSDYSSPITAAARTEEELVGVSEEIDENEYLA EFGWKVRKLIEEEDDLK
Subjt:  MVHLLPNPLRVPSHLRSEPPLTAVPTRSKSGVVWRYGGGIKVRSAAVVRCSSDYSSPITAAARTEEELVGVSEEIDENEYLAREFGWKVRKLIEEEDDLK

Query:  AVARIQAEAFHEPVLLFNHFFFQFFQAEVLSALIYRLKNYPQDRYACLVAEPESEIGEEEYNFVGVVDVTVAGDLKIKRLLPPGVKEYLFVTGIAVAQNA
        AVARIQAEAFHEPVLLFNHFFFQFFQAEVLSALIYRLKNYPQDRYACLVAEPESEIGEEEYNFVGVVDVTVAGDLKIKRLLPPGVKEYLFVTGIAVAQNA
Subjt:  AVARIQAEAFHEPVLLFNHFFFQFFQAEVLSALIYRLKNYPQDRYACLVAEPESEIGEEEYNFVGVVDVTVAGDLKIKRLLPPGVKEYLFVTGIAVAQNA

Query:  RRRKVATALLKGCDMLGKVWGFKFLALSAYEDDYGARNLYSKAGYQVYYVDPLWKSTWIGRKRCVTMIKKL
        RRRKVATALLKGCDMLGKVWGFKFLALSAYEDDYGARNLYSKAGYQVYYVDPLWKSTWIGRKRCVTMIKKL
Subjt:  RRRKVATALLKGCDMLGKVWGFKFLALSAYEDDYGARNLYSKAGYQVYYVDPLWKSTWIGRKRCVTMIKKL

A0A1S3CFP1 uncharacterized protein LOC103500412 isoform X11.4e-14093.36Show/hide
Query:  MVHLLPNPLRVPSHLRSEPPLTAVPTRSKSGVVWRYGGGIKVRSAAVVRCSSDYSSPITAAARTEEELVGVSEEIDENEYLAREFGWKVRKLIEEEDDLK
        MVHLLPNPLRV SHLRSEPP TAVPTRSK   +WRYGG IKVRSA VVRCSSDYSSPITAA  TEEEL+ VSEE+ ENEYLAREFGWKVRKLIEEEDDL+
Subjt:  MVHLLPNPLRVPSHLRSEPPLTAVPTRSKSGVVWRYGGGIKVRSAAVVRCSSDYSSPITAAARTEEELVGVSEEIDENEYLAREFGWKVRKLIEEEDDLK

Query:  AVARIQAEAFHEPVLLFNHFFFQFFQAEVLSALIYRLKNYPQDRYACLVAEPESEIGEEEYNFVGVVDVTVAGDLKIKRLLPPGVKEYLFVTGIAVAQNA
        AVARIQAEAFHEPVLLFNHFFFQFFQAEVLSALIYRLKNYP +RYACLVAEPESEIGEEEYNFVGVVDVTVAGDLK+KRLLPPGVKEYLFVTGIAVAQNA
Subjt:  AVARIQAEAFHEPVLLFNHFFFQFFQAEVLSALIYRLKNYPQDRYACLVAEPESEIGEEEYNFVGVVDVTVAGDLKIKRLLPPGVKEYLFVTGIAVAQNA

Query:  RRRKVATALLKGCDMLGKVWGFKFLALSAYEDDYGARNLYSKAGYQVYYVDPLWKSTWIGRKRCVTMIKKL
        RRRKVATALLKGCDMLGKVWGFKFLALSAYEDDYGARNLYSKAGYQVYYVDPLWKSTWIGRKRCVTMIKKL
Subjt:  RRRKVATALLKGCDMLGKVWGFKFLALSAYEDDYGARNLYSKAGYQVYYVDPLWKSTWIGRKRCVTMIKKL

A0A1S4E3H1 uncharacterized protein LOC103500412 isoform X21.3e-13892.99Show/hide
Query:  MVHLLPNPLRVPSHLRSEPPLTAVPTRSKSGVVWRYGGGIKVRSAAVVRCSSDYSSPITAAARTEEELVGVSEEIDENEYLAREFGWKVRKLIEEEDDLK
        MVHLLPNPLRV SHLRSEPP TAVPTRSK   +WRYGG IKVRSA VVRCSSDYSSPITAA  TEEEL+ VSEE+ ENEYLAREFGWKVRKLIEEEDDL+
Subjt:  MVHLLPNPLRVPSHLRSEPPLTAVPTRSKSGVVWRYGGGIKVRSAAVVRCSSDYSSPITAAARTEEELVGVSEEIDENEYLAREFGWKVRKLIEEEDDLK

Query:  AVARIQAEAFHEPVLLFNHFFFQFFQAEVLSALIYRLKNYPQDRYACLVAEPESEIGEEEYNFVGVVDVTVAGDLKIKRLLPPGVKEYLFVTGIAVAQNA
        AVARIQAEAFHEPVLLFNHFFFQFFQAEVLSALIYRLKNYP +RYACLVAEPESEIGEEEYNFVGVVDVTVAGDLK+KRLLPPGVKEYLFVTGIAVAQNA
Subjt:  AVARIQAEAFHEPVLLFNHFFFQFFQAEVLSALIYRLKNYPQDRYACLVAEPESEIGEEEYNFVGVVDVTVAGDLKIKRLLPPGVKEYLFVTGIAVAQNA

Query:  RRRKVATALLKGCDMLGKVWGFKFLALSAYEDDYGARNLYSKAGYQVYYVDPLWKSTWIGRKRCVTMIKKL
         RRKVATALLKGCDMLGKVWGFKFLALSAYEDDYGARNLYSKAGYQVYYVDPLWKSTWIGRKRCVTMIKKL
Subjt:  RRRKVATALLKGCDMLGKVWGFKFLALSAYEDDYGARNLYSKAGYQVYYVDPLWKSTWIGRKRCVTMIKKL

A0A6J1F9A7 uncharacterized protein LOC1114432738.6e-11982.18Show/hide
Query:  MVHLLPNPLRVPSHLRSEPPLTAVPTRSKSGVVWRY----GGGIKVRSAAVVRCSSDYSSPITAAARTEEELVGVSEEIDENEYLAREFGWKVRKLIEEE
        MVHLLPNPLRV SHLR +PP TAV  R+KSG   RY     GGIKVR A VVRCSSDYSSPI AA  T EE+    E  +EN YLA EFGWKVRKLIEEE
Subjt:  MVHLLPNPLRVPSHLRSEPPLTAVPTRSKSGVVWRY----GGGIKVRSAAVVRCSSDYSSPITAAARTEEELVGVSEEIDENEYLAREFGWKVRKLIEEE

Query:  DDLKAVARIQAEAFHEPVLLFNHFFFQFFQAEVLSALIYRLKNYPQDRYACLVAEPESEIGEEEYNFVGVVDVTVAGDLKIKRLLPPGVKEYLFVTGIAV
        DDL+ VARIQAEAFHEPVLLFNHFFFQFFQAEVLSALIYRLK+YP DRYACLVAEPESE  ++EYNFVGVVDVTVAGDLK++RLLP GVKEYLFVTGIAV
Subjt:  DDLKAVARIQAEAFHEPVLLFNHFFFQFFQAEVLSALIYRLKNYPQDRYACLVAEPESEIGEEEYNFVGVVDVTVAGDLKIKRLLPPGVKEYLFVTGIAV

Query:  AQNARRRKVATALLKGCDMLGKVWGFKFLALSAYEDDYGARNLYSKAGYQVYYVDPLWKSTWIGRKRCVTMIKKL
         Q ARRRKVATALLKGCDML KVWGFKFLALSAYEDDYGARNLYSKAGYQV YVDPLWKS+WIGRKRCVTM+K+L
Subjt:  AQNARRRKVATALLKGCDMLGKVWGFKFLALSAYEDDYGARNLYSKAGYQVYYVDPLWKSTWIGRKRCVTMIKKL

A0A6J1IJZ4 uncharacterized protein LOC1114769676.0e-12082.91Show/hide
Query:  MVHLLPNPLRVPSHLRSEPPLTAVPTRSKSGVVWRY----GGGIKVRSAAVVRCSSDYSSPITAAARTEEELVGVSEEIDENEYLAREFGWKVRKLIEEE
        MVHLLPNPLRV SHLRS+PP TAV  R+KSG   RY     GGIKVR A VVRCSSDYSSPITAA  T EE+    E  +EN YLA EFGWKVRKLIEEE
Subjt:  MVHLLPNPLRVPSHLRSEPPLTAVPTRSKSGVVWRY----GGGIKVRSAAVVRCSSDYSSPITAAARTEEELVGVSEEIDENEYLAREFGWKVRKLIEEE

Query:  DDLKAVARIQAEAFHEPVLLFNHFFFQFFQAEVLSALIYRLKNYPQDRYACLVAEPESEIGEEEYNFVGVVDVTVAGDLKIKRLLPPGVKEYLFVTGIAV
        DDL+ VARIQAEAFHEPVLLFNHFFFQFFQAEVLSALIYRLKNYP DRYACLVAEPESE  ++EYNFVGVVDVTVAGDLK++RLLP GVKEYLFVTGIAV
Subjt:  DDLKAVARIQAEAFHEPVLLFNHFFFQFFQAEVLSALIYRLKNYPQDRYACLVAEPESEIGEEEYNFVGVVDVTVAGDLKIKRLLPPGVKEYLFVTGIAV

Query:  AQNARRRKVATALLKGCDMLGKVWGFKFLALSAYEDDYGARNLYSKAGYQVYYVDPLWKSTWIGRKRCVTMIKKL
         Q ARRRKVATALLKGCDML +VWGFKFLALSAYEDDYGARNLYSKAGYQV YVDPLWKS+WIGRKRCVTM+K+L
Subjt:  AQNARRRKVATALLKGCDMLGKVWGFKFLALSAYEDDYGARNLYSKAGYQVYYVDPLWKSTWIGRKRCVTMIKKL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G72030.1 Acyl-CoA N-acyltransferases (NAT) superfamily protein1.4e-5758.16Show/hide
Query:  ENEYLAREFGWKVRKL-IEEEDDLKAVARIQAEAFHEPVLLFNHFFFQFFQAEVLSALIYRLKNYPQDRYACLVAEPESEIGE-EEYNFVGVVDVTVAGD
        E +YL  + GW VR+L  ++ED+++ V+ +QAEAFH P+ LF+ FFF FFQAEVLSAL+Y+LKN P DRYACLVAE  SE       + VGVVDVT   +
Subjt:  ENEYLAREFGWKVRKL-IEEEDDLKAVARIQAEAFHEPVLLFNHFFFQFFQAEVLSALIYRLKNYPQDRYACLVAEPESEIGE-EEYNFVGVVDVTVAGD

Query:  LKIKRLLPPGVKEYLFVTGIAVAQNARRRKVATALLKGCDMLGKVWGFKFLALSAYEDDYGARNLYSKAGYQVYYVDPLWKSTWIGRKRCVTMIKK
          + R   PGV+EYL+V+G+AV+++ RR+K+A+ LLK CD+L  +WGFK LAL AYEDD  ARNLYS AGY V   DPLW STWIGRKR V M K+
Subjt:  LKIKRLLPPGVKEYLFVTGIAVAQNARRRKVATALLKGCDMLGKVWGFKFLALSAYEDDYGARNLYSKAGYQVYYVDPLWKSTWIGRKRCVTMIKK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTCCATTTACTTCCAAATCCCCTCCGAGTACCATCGCACCTCCGCTCGGAGCCACCTCTCACAGCGGTTCCGACGAGATCGAAGTCCGGCGTGGTCTGGAGATATGG
AGGAGGAATTAAGGTTAGATCGGCGGCGGTTGTGCGGTGCAGTAGTGATTATTCGAGTCCGATCACGGCGGCGGCGAGAACGGAGGAGGAGTTGGTCGGAGTATCGGAAG
AAATTGATGAGAATGAGTATTTGGCTAGAGAATTTGGATGGAAGGTGAGAAAATTGATTGAAGAAGAAGATGATTTGAAAGCAGTTGCAAGAATTCAAGCCGAAGCTTTT
CATGAACCTGTTCTTCTTTTCAACCATTTTTTCTTCCAATTTTTCCAGGCAGAAGTGCTCTCAGCATTGATTTATAGATTGAAAAATTACCCTCAAGATAGGTATGCTTG
TTTGGTTGCGGAACCGGAGAGTGAAATTGGTGAAGAAGAATACAATTTTGTGGGAGTGGTGGACGTGACGGTGGCGGGAGATTTGAAAATAAAGCGTCTCCTTCCCCCCG
GCGTCAAGGAGTATCTCTTTGTAACTGGAATTGCCGTCGCACAAAATGCCAGAAGACGAAAAGTAGCAACGGCATTATTAAAGGGGTGTGACATGCTTGGTAAGGTTTGG
GGATTCAAGTTTTTGGCATTAAGTGCATATGAAGATGATTATGGGGCTCGTAATTTGTATAGTAAAGCAGGCTATCAAGTTTACTATGTTGACCCTCTTTGGAAATCTAC
TTGGATTGGAAGAAAACGTTGTGTTACCATGATTAAAAAGCTCTAG
mRNA sequenceShow/hide mRNA sequence
AAAAAATAAGAGTTTTGAGATTAAAAATTCAAAATAATAACGCGACCATTGACCGCTGTTTCCCTTCCAAGAATGGTCCATTTACTTCCAAATCCCCTCCGAGTACCATC
GCACCTCCGCTCGGAGCCACCTCTCACAGCGGTTCCGACGAGATCGAAGTCCGGCGTGGTCTGGAGATATGGAGGAGGAATTAAGGTTAGATCGGCGGCGGTTGTGCGGT
GCAGTAGTGATTATTCGAGTCCGATCACGGCGGCGGCGAGAACGGAGGAGGAGTTGGTCGGAGTATCGGAAGAAATTGATGAGAATGAGTATTTGGCTAGAGAATTTGGA
TGGAAGGTGAGAAAATTGATTGAAGAAGAAGATGATTTGAAAGCAGTTGCAAGAATTCAAGCCGAAGCTTTTCATGAACCTGTTCTTCTTTTCAACCATTTTTTCTTCCA
ATTTTTCCAGGCAGAAGTGCTCTCAGCATTGATTTATAGATTGAAAAATTACCCTCAAGATAGGTATGCTTGTTTGGTTGCGGAACCGGAGAGTGAAATTGGTGAAGAAG
AATACAATTTTGTGGGAGTGGTGGACGTGACGGTGGCGGGAGATTTGAAAATAAAGCGTCTCCTTCCCCCCGGCGTCAAGGAGTATCTCTTTGTAACTGGAATTGCCGTC
GCACAAAATGCCAGAAGACGAAAAGTAGCAACGGCATTATTAAAGGGGTGTGACATGCTTGGTAAGGTTTGGGGATTCAAGTTTTTGGCATTAAGTGCATATGAAGATGA
TTATGGGGCTCGTAATTTGTATAGTAAAGCAGGCTATCAAGTTTACTATGTTGACCCTCTTTGGAAATCTACTTGGATTGGAAGAAAACGTTGTGTTACCATGATTAAAA
AGCTCTAGATTCATTTTCTTTTAACTATCTATTTACGAG
Protein sequenceShow/hide protein sequence
MVHLLPNPLRVPSHLRSEPPLTAVPTRSKSGVVWRYGGGIKVRSAAVVRCSSDYSSPITAAARTEEELVGVSEEIDENEYLAREFGWKVRKLIEEEDDLKAVARIQAEAF
HEPVLLFNHFFFQFFQAEVLSALIYRLKNYPQDRYACLVAEPESEIGEEEYNFVGVVDVTVAGDLKIKRLLPPGVKEYLFVTGIAVAQNARRRKVATALLKGCDMLGKVW
GFKFLALSAYEDDYGARNLYSKAGYQVYYVDPLWKSTWIGRKRCVTMIKKL