; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cucsat.G2757 (gene) of Cucumber (B10) v3 genome

Gene IDCucsat.G2757
OrganismCucumis sativus L. var. sativus cv. B10 (Cucumber (B10) v3)
DescriptionN-acetyltransferase domain-containing protein
Genome locationctg1041:822751..825452
RNA-Seq ExpressionCucsat.G2757
SyntenyCucsat.G2757
Gene Ontology termsGO:0005737 - cytoplasm (cellular component)
GO:0008080 - N-acetyltransferase activity (molecular function)
InterPro domainsIPR000182 - GNAT domain
IPR016181 - Acyl-CoA N-acyltransferase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004139678.1 uncharacterized protein LOC101214390 isoform X2 [Cucumis sativus]5.63e-193100Show/hide
Query:  MVHLLPNPLRVPSHLRSEPPLTAVPTRSKSGVVWRYGGGIKVRSAVVVRCSSDYSSPITAAARTEEELVGVSEEIDENEYLATEFGWKVRKLIEEEDDLK
        MVHLLPNPLRVPSHLRSEPPLTAVPTRSKSGVVWRYGGGIKVRSAVVVRCSSDYSSPITAAARTEEELVGVSEEIDENEYLATEFGWKVRKLIEEEDDLK
Subjt:  MVHLLPNPLRVPSHLRSEPPLTAVPTRSKSGVVWRYGGGIKVRSAVVVRCSSDYSSPITAAARTEEELVGVSEEIDENEYLATEFGWKVRKLIEEEDDLK

Query:  AVARIQAEAFHEPVLLFNHFFFQFFQAEVLSALIYRLKNYPQDRYACLVAEPESEIGEEEYNFVGVVDVTVAGDLKIKRLLPPGVKEYLFVTGIAVAQNA
        AVARIQAEAFHEPVLLFNHFFFQFFQAEVLSALIYRLKNYPQDRYACLVAEPESEIGEEEYNFVGVVDVTVAGDLKIKRLLPPGVKEYLFVTGIAVAQNA
Subjt:  AVARIQAEAFHEPVLLFNHFFFQFFQAEVLSALIYRLKNYPQDRYACLVAEPESEIGEEEYNFVGVVDVTVAGDLKIKRLLPPGVKEYLFVTGIAVAQNA

Query:  RRRKVATALLKGCDMLGKVWGFKFLALSAYEDDYGARNLYSKAGYQVYYVDPLWKSTWIGRKRCVTMIKKL
        RRRKVATALLKGCDMLGKVWGFKFLALSAYEDDYGARNLYSKAGYQVYYVDPLWKSTWIGRKRCVTMIKKL
Subjt:  RRRKVATALLKGCDMLGKVWGFKFLALSAYEDDYGARNLYSKAGYQVYYVDPLWKSTWIGRKRCVTMIKKL

XP_008461923.1 PREDICTED: uncharacterized protein LOC103500412 isoform X1 [Cucumis melo]5.96e-17993.36Show/hide
Query:  MVHLLPNPLRVPSHLRSEPPLTAVPTRSKSGVVWRYGGGIKVRSAVVVRCSSDYSSPITAAARTEEELVGVSEEIDENEYLATEFGWKVRKLIEEEDDLK
        MVHLLPNPLRV SHLRSEPP TAVPTRSK   +WRYGG IKVRSAVVVRCSSDYSSPITAA  TEEEL+ VSEE+ ENEYLA EFGWKVRKLIEEEDDL+
Subjt:  MVHLLPNPLRVPSHLRSEPPLTAVPTRSKSGVVWRYGGGIKVRSAVVVRCSSDYSSPITAAARTEEELVGVSEEIDENEYLATEFGWKVRKLIEEEDDLK

Query:  AVARIQAEAFHEPVLLFNHFFFQFFQAEVLSALIYRLKNYPQDRYACLVAEPESEIGEEEYNFVGVVDVTVAGDLKIKRLLPPGVKEYLFVTGIAVAQNA
        AVARIQAEAFHEPVLLFNHFFFQFFQAEVLSALIYRLKNYP +RYACLVAEPESEIGEEEYNFVGVVDVTVAGDLK+KRLLPPGVKEYLFVTGIAVAQNA
Subjt:  AVARIQAEAFHEPVLLFNHFFFQFFQAEVLSALIYRLKNYPQDRYACLVAEPESEIGEEEYNFVGVVDVTVAGDLKIKRLLPPGVKEYLFVTGIAVAQNA

Query:  RRRKVATALLKGCDMLGKVWGFKFLALSAYEDDYGARNLYSKAGYQVYYVDPLWKSTWIGRKRCVTMIKKL
        RRRKVATALLKGCDMLGKVWGFKFLALSAYEDDYGARNLYSKAGYQVYYVDPLWKSTWIGRKRCVTMIKKL
Subjt:  RRRKVATALLKGCDMLGKVWGFKFLALSAYEDDYGARNLYSKAGYQVYYVDPLWKSTWIGRKRCVTMIKKL

XP_016902774.1 PREDICTED: uncharacterized protein LOC103500412 isoform X2 [Cucumis melo]2.24e-17692.99Show/hide
Query:  MVHLLPNPLRVPSHLRSEPPLTAVPTRSKSGVVWRYGGGIKVRSAVVVRCSSDYSSPITAAARTEEELVGVSEEIDENEYLATEFGWKVRKLIEEEDDLK
        MVHLLPNPLRV SHLRSEPP TAVPTRSK   +WRYGG IKVRSAVVVRCSSDYSSPITAA  TEEEL+ VSEE+ ENEYLA EFGWKVRKLIEEEDDL+
Subjt:  MVHLLPNPLRVPSHLRSEPPLTAVPTRSKSGVVWRYGGGIKVRSAVVVRCSSDYSSPITAAARTEEELVGVSEEIDENEYLATEFGWKVRKLIEEEDDLK

Query:  AVARIQAEAFHEPVLLFNHFFFQFFQAEVLSALIYRLKNYPQDRYACLVAEPESEIGEEEYNFVGVVDVTVAGDLKIKRLLPPGVKEYLFVTGIAVAQNA
        AVARIQAEAFHEPVLLFNHFFFQFFQAEVLSALIYRLKNYP +RYACLVAEPESEIGEEEYNFVGVVDVTVAGDLK+KRLLPPGVKEYLFVTGIAVAQNA
Subjt:  AVARIQAEAFHEPVLLFNHFFFQFFQAEVLSALIYRLKNYPQDRYACLVAEPESEIGEEEYNFVGVVDVTVAGDLKIKRLLPPGVKEYLFVTGIAVAQNA

Query:  RRRKVATALLKGCDMLGKVWGFKFLALSAYEDDYGARNLYSKAGYQVYYVDPLWKSTWIGRKRCVTMIKKL
        RR KVATALLKGCDMLGKVWGFKFLALSAYEDDYGARNLYSKAGYQVYYVDPLWKSTWIGRKRCVTMIKKL
Subjt:  RRRKVATALLKGCDMLGKVWGFKFLALSAYEDDYGARNLYSKAGYQVYYVDPLWKSTWIGRKRCVTMIKKL

XP_031744292.1 uncharacterized protein LOC101214390 isoform X1 [Cucumis sativus]2.75e-19097.83Show/hide
Query:  MVHLLPNPLRVPSHLRSEPPLTAVPTRSKSGVVWRYGGGIKVRSAVVVRCSSDYSSPITAAARTEEELVGVSEEIDENEYLATEFGWKVRKLIEEEDDLK
        MVHLLPNPLRVPSHLRSEPPLTAVPTRSKSGVVWRYGGGIKVRSAVVVRCSSDYSSPITAAARTEEELVGVSEEIDENEYLATEFGWKVRKLIEEEDDLK
Subjt:  MVHLLPNPLRVPSHLRSEPPLTAVPTRSKSGVVWRYGGGIKVRSAVVVRCSSDYSSPITAAARTEEELVGVSEEIDENEYLATEFGWKVRKLIEEEDDLK

Query:  AVARIQAEAFHEPVLLFNHFFFQFFQ------AEVLSALIYRLKNYPQDRYACLVAEPESEIGEEEYNFVGVVDVTVAGDLKIKRLLPPGVKEYLFVTGI
        AVARIQAEAFHEPVLLFNHFFFQFFQ      AEVLSALIYRLKNYPQDRYACLVAEPESEIGEEEYNFVGVVDVTVAGDLKIKRLLPPGVKEYLFVTGI
Subjt:  AVARIQAEAFHEPVLLFNHFFFQFFQ------AEVLSALIYRLKNYPQDRYACLVAEPESEIGEEEYNFVGVVDVTVAGDLKIKRLLPPGVKEYLFVTGI

Query:  AVAQNARRRKVATALLKGCDMLGKVWGFKFLALSAYEDDYGARNLYSKAGYQVYYVDPLWKSTWIGRKRCVTMIKKL
        AVAQNARRRKVATALLKGCDMLGKVWGFKFLALSAYEDDYGARNLYSKAGYQVYYVDPLWKSTWIGRKRCVTMIKKL
Subjt:  AVAQNARRRKVATALLKGCDMLGKVWGFKFLALSAYEDDYGARNLYSKAGYQVYYVDPLWKSTWIGRKRCVTMIKKL

XP_038897179.1 uncharacterized protein LOC120085320 [Benincasa hispida]4.16e-15885.45Show/hide
Query:  MVHLLPNPLRVPSHLRSEPPLTAVPTRSKSGV----VWRYGGGIKVRSAVVVRCSSDYSSPITAAARTEEELVGVSEEIDENEYLATEFGWKVRKLIEEE
        MVHLLPNPLRV  HLRSEPP TAV  R  SG     +WR  GGIKVRSAVV+RCSSDYSSP TA     EE +G+ EEI+ENEYLA EFGW VRKLIEEE
Subjt:  MVHLLPNPLRVPSHLRSEPPLTAVPTRSKSGV----VWRYGGGIKVRSAVVVRCSSDYSSPITAAARTEEELVGVSEEIDENEYLATEFGWKVRKLIEEE

Query:  DDLKAVARIQAEAFHEPVLLFNHFFFQFFQAEVLSALIYRLKNYPQDRYACLVAEPESEIGEEEYNFVGVVDVTVAGDLKIKRLLPPGVKEYLFVTGIAV
        DDL+AVARIQAEAFHEPVLLFN FFFQFFQAEVLSALIYRLKNYP DRYACLVAEPESE  ++EYNFVGVVDVTVAGDLK+KRLLP G KEYLFVTGIAV
Subjt:  DDLKAVARIQAEAFHEPVLLFNHFFFQFFQAEVLSALIYRLKNYPQDRYACLVAEPESEIGEEEYNFVGVVDVTVAGDLKIKRLLPPGVKEYLFVTGIAV

Query:  AQNARRRKVATALLKGCDMLGKVWGFKFLALSAYEDDYGARNLYSKAGYQVYYVDPLWKSTWIGRKRCVTMIKKL
        AQ ARRRKVATALLKGCDMLGKVWGFKFLALSAYEDDYGARNLYSKAGYQVYYVDPLWKSTWIGRKRCVTMIKKL
Subjt:  AQNARRRKVATALLKGCDMLGKVWGFKFLALSAYEDDYGARNLYSKAGYQVYYVDPLWKSTWIGRKRCVTMIKKL

TrEMBL top hitse value%identityAlignment
A0A0A0K826 N-acetyltransferase domain-containing protein2.73e-193100Show/hide
Query:  MVHLLPNPLRVPSHLRSEPPLTAVPTRSKSGVVWRYGGGIKVRSAVVVRCSSDYSSPITAAARTEEELVGVSEEIDENEYLATEFGWKVRKLIEEEDDLK
        MVHLLPNPLRVPSHLRSEPPLTAVPTRSKSGVVWRYGGGIKVRSAVVVRCSSDYSSPITAAARTEEELVGVSEEIDENEYLATEFGWKVRKLIEEEDDLK
Subjt:  MVHLLPNPLRVPSHLRSEPPLTAVPTRSKSGVVWRYGGGIKVRSAVVVRCSSDYSSPITAAARTEEELVGVSEEIDENEYLATEFGWKVRKLIEEEDDLK

Query:  AVARIQAEAFHEPVLLFNHFFFQFFQAEVLSALIYRLKNYPQDRYACLVAEPESEIGEEEYNFVGVVDVTVAGDLKIKRLLPPGVKEYLFVTGIAVAQNA
        AVARIQAEAFHEPVLLFNHFFFQFFQAEVLSALIYRLKNYPQDRYACLVAEPESEIGEEEYNFVGVVDVTVAGDLKIKRLLPPGVKEYLFVTGIAVAQNA
Subjt:  AVARIQAEAFHEPVLLFNHFFFQFFQAEVLSALIYRLKNYPQDRYACLVAEPESEIGEEEYNFVGVVDVTVAGDLKIKRLLPPGVKEYLFVTGIAVAQNA

Query:  RRRKVATALLKGCDMLGKVWGFKFLALSAYEDDYGARNLYSKAGYQVYYVDPLWKSTWIGRKRCVTMIKKL
        RRRKVATALLKGCDMLGKVWGFKFLALSAYEDDYGARNLYSKAGYQVYYVDPLWKSTWIGRKRCVTMIKKL
Subjt:  RRRKVATALLKGCDMLGKVWGFKFLALSAYEDDYGARNLYSKAGYQVYYVDPLWKSTWIGRKRCVTMIKKL

A0A1S3CFP1 uncharacterized protein LOC103500412 isoform X12.89e-17993.36Show/hide
Query:  MVHLLPNPLRVPSHLRSEPPLTAVPTRSKSGVVWRYGGGIKVRSAVVVRCSSDYSSPITAAARTEEELVGVSEEIDENEYLATEFGWKVRKLIEEEDDLK
        MVHLLPNPLRV SHLRSEPP TAVPTRSK   +WRYGG IKVRSAVVVRCSSDYSSPITAA  TEEEL+ VSEE+ ENEYLA EFGWKVRKLIEEEDDL+
Subjt:  MVHLLPNPLRVPSHLRSEPPLTAVPTRSKSGVVWRYGGGIKVRSAVVVRCSSDYSSPITAAARTEEELVGVSEEIDENEYLATEFGWKVRKLIEEEDDLK

Query:  AVARIQAEAFHEPVLLFNHFFFQFFQAEVLSALIYRLKNYPQDRYACLVAEPESEIGEEEYNFVGVVDVTVAGDLKIKRLLPPGVKEYLFVTGIAVAQNA
        AVARIQAEAFHEPVLLFNHFFFQFFQAEVLSALIYRLKNYP +RYACLVAEPESEIGEEEYNFVGVVDVTVAGDLK+KRLLPPGVKEYLFVTGIAVAQNA
Subjt:  AVARIQAEAFHEPVLLFNHFFFQFFQAEVLSALIYRLKNYPQDRYACLVAEPESEIGEEEYNFVGVVDVTVAGDLKIKRLLPPGVKEYLFVTGIAVAQNA

Query:  RRRKVATALLKGCDMLGKVWGFKFLALSAYEDDYGARNLYSKAGYQVYYVDPLWKSTWIGRKRCVTMIKKL
        RRRKVATALLKGCDMLGKVWGFKFLALSAYEDDYGARNLYSKAGYQVYYVDPLWKSTWIGRKRCVTMIKKL
Subjt:  RRRKVATALLKGCDMLGKVWGFKFLALSAYEDDYGARNLYSKAGYQVYYVDPLWKSTWIGRKRCVTMIKKL

A0A1S4E3H1 uncharacterized protein LOC103500412 isoform X21.08e-17692.99Show/hide
Query:  MVHLLPNPLRVPSHLRSEPPLTAVPTRSKSGVVWRYGGGIKVRSAVVVRCSSDYSSPITAAARTEEELVGVSEEIDENEYLATEFGWKVRKLIEEEDDLK
        MVHLLPNPLRV SHLRSEPP TAVPTRSK   +WRYGG IKVRSAVVVRCSSDYSSPITAA  TEEEL+ VSEE+ ENEYLA EFGWKVRKLIEEEDDL+
Subjt:  MVHLLPNPLRVPSHLRSEPPLTAVPTRSKSGVVWRYGGGIKVRSAVVVRCSSDYSSPITAAARTEEELVGVSEEIDENEYLATEFGWKVRKLIEEEDDLK

Query:  AVARIQAEAFHEPVLLFNHFFFQFFQAEVLSALIYRLKNYPQDRYACLVAEPESEIGEEEYNFVGVVDVTVAGDLKIKRLLPPGVKEYLFVTGIAVAQNA
        AVARIQAEAFHEPVLLFNHFFFQFFQAEVLSALIYRLKNYP +RYACLVAEPESEIGEEEYNFVGVVDVTVAGDLK+KRLLPPGVKEYLFVTGIAVAQNA
Subjt:  AVARIQAEAFHEPVLLFNHFFFQFFQAEVLSALIYRLKNYPQDRYACLVAEPESEIGEEEYNFVGVVDVTVAGDLKIKRLLPPGVKEYLFVTGIAVAQNA

Query:  RRRKVATALLKGCDMLGKVWGFKFLALSAYEDDYGARNLYSKAGYQVYYVDPLWKSTWIGRKRCVTMIKKL
        RR KVATALLKGCDMLGKVWGFKFLALSAYEDDYGARNLYSKAGYQVYYVDPLWKSTWIGRKRCVTMIKKL
Subjt:  RRRKVATALLKGCDMLGKVWGFKFLALSAYEDDYGARNLYSKAGYQVYYVDPLWKSTWIGRKRCVTMIKKL

A0A6J1F9A7 uncharacterized protein LOC1114432731.05e-15182.55Show/hide
Query:  MVHLLPNPLRVPSHLRSEPPLTAVPTRSKSGVVWRY----GGGIKVRSAVVVRCSSDYSSPITAAARTEEELVGVSEEIDENEYLATEFGWKVRKLIEEE
        MVHLLPNPLRV SHLR +PP TAV  R+KSG   RY     GGIKVR AVVVRCSSDYSSPI AA  T EE+    E  +EN YLA EFGWKVRKLIEEE
Subjt:  MVHLLPNPLRVPSHLRSEPPLTAVPTRSKSGVVWRY----GGGIKVRSAVVVRCSSDYSSPITAAARTEEELVGVSEEIDENEYLATEFGWKVRKLIEEE

Query:  DDLKAVARIQAEAFHEPVLLFNHFFFQFFQAEVLSALIYRLKNYPQDRYACLVAEPESEIGEEEYNFVGVVDVTVAGDLKIKRLLPPGVKEYLFVTGIAV
        DDL+ VARIQAEAFHEPVLLFNHFFFQFFQAEVLSALIYRLK+YP DRYACLVAEPESE  ++EYNFVGVVDVTVAGDLK++RLLP GVKEYLFVTGIAV
Subjt:  DDLKAVARIQAEAFHEPVLLFNHFFFQFFQAEVLSALIYRLKNYPQDRYACLVAEPESEIGEEEYNFVGVVDVTVAGDLKIKRLLPPGVKEYLFVTGIAV

Query:  AQNARRRKVATALLKGCDMLGKVWGFKFLALSAYEDDYGARNLYSKAGYQVYYVDPLWKSTWIGRKRCVTMIKKL
         Q ARRRKVATALLKGCDML KVWGFKFLALSAYEDDYGARNLYSKAGYQV YVDPLWKS+WIGRKRCVTM+K+L
Subjt:  AQNARRRKVATALLKGCDMLGKVWGFKFLALSAYEDDYGARNLYSKAGYQVYYVDPLWKSTWIGRKRCVTMIKKL

A0A6J1IJZ4 uncharacterized protein LOC1114769675.46e-15483.64Show/hide
Query:  MVHLLPNPLRVPSHLRSEPPLTAVPTRSKSGVVWRY----GGGIKVRSAVVVRCSSDYSSPITAAARTEEELVGVSEEIDENEYLATEFGWKVRKLIEEE
        MVHLLPNPLRV SHLRS+PP TAV  R+KSG   RY     GGIKVR AVVVRCSSDYSSPITAA  T EE+    E  +EN YLATEFGWKVRKLIEEE
Subjt:  MVHLLPNPLRVPSHLRSEPPLTAVPTRSKSGVVWRY----GGGIKVRSAVVVRCSSDYSSPITAAARTEEELVGVSEEIDENEYLATEFGWKVRKLIEEE

Query:  DDLKAVARIQAEAFHEPVLLFNHFFFQFFQAEVLSALIYRLKNYPQDRYACLVAEPESEIGEEEYNFVGVVDVTVAGDLKIKRLLPPGVKEYLFVTGIAV
        DDL+ VARIQAEAFHEPVLLFNHFFFQFFQAEVLSALIYRLKNYP DRYACLVAEPESE  ++EYNFVGVVDVTVAGDLK++RLLP GVKEYLFVTGIAV
Subjt:  DDLKAVARIQAEAFHEPVLLFNHFFFQFFQAEVLSALIYRLKNYPQDRYACLVAEPESEIGEEEYNFVGVVDVTVAGDLKIKRLLPPGVKEYLFVTGIAV

Query:  AQNARRRKVATALLKGCDMLGKVWGFKFLALSAYEDDYGARNLYSKAGYQVYYVDPLWKSTWIGRKRCVTMIKKL
         Q ARRRKVATALLKGCDML +VWGFKFLALSAYEDDYGARNLYSKAGYQV YVDPLWKS+WIGRKRCVTM+K+L
Subjt:  AQNARRRKVATALLKGCDMLGKVWGFKFLALSAYEDDYGARNLYSKAGYQVYYVDPLWKSTWIGRKRCVTMIKKL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G72030.1 Acyl-CoA N-acyltransferases (NAT) superfamily protein8.4e-5853.25Show/hide
Query:  RSAVVVRCSSDYS-SPITAAARTEEELVGVSEEIDENEYLATEFGWKVRKL-IEEEDDLKAVARIQAEAFHEPVLLFNHFFFQFFQAEVLSALIYRLKNY
        R  V +RC+S  S + +T     E EL          +YL ++ GW VR+L  ++ED+++ V+ +QAEAFH P+ LF+ FFF FFQAEVLSAL+Y+LKN 
Subjt:  RSAVVVRCSSDYS-SPITAAARTEEELVGVSEEIDENEYLATEFGWKVRKL-IEEEDDLKAVARIQAEAFHEPVLLFNHFFFQFFQAEVLSALIYRLKNY

Query:  PQDRYACLVAEPESEIGE-EEYNFVGVVDVTVAGDLKIKRLLPPGVKEYLFVTGIAVAQNARRRKVATALLKGCDMLGKVWGFKFLALSAYEDDYGARNL
        P DRYACLVAE  SE       + VGVVDVT   +  + R   PGV+EYL+V+G+AV+++ RR+K+A+ LLK CD+L  +WGFK LAL AYEDD  ARNL
Subjt:  PQDRYACLVAEPESEIGE-EEYNFVGVVDVTVAGDLKIKRLLPPGVKEYLFVTGIAVAQNARRRKVATALLKGCDMLGKVWGFKFLALSAYEDDYGARNL

Query:  YSKAGYQVYYVDPLWKSTWIGRKRCVTMIKK
        YS AGY V   DPLW STWIGRKR V M K+
Subjt:  YSKAGYQVYYVDPLWKSTWIGRKRCVTMIKK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTCCATTTACTTCCAAATCCCCTCCGAGTACCATCGCACCTCCGCTCGGAGCCACCTCTAACGGCGGTTCCGACGAGATCGAAGTCCGGCGTGGTCTGGAGATATGG
AGGAGGAATTAAGGTTAGATCGGCGGTGGTTGTGCGGTGCAGTAGTGATTATTCGAGTCCGATCACGGCGGCGGCGAGAACGGAGGAGGAGTTGGTCGGAGTATCGGAAG
AAATTGATGAGAATGAGTATTTGGCTACAGAATTTGGATGGAAGGTGAGAAAATTGATTGAAGAAGAAGATGATTTGAAAGCAGTTGCAAGAATTCAAGCCGAAGCTTTT
CATGAACCTGTTCTTCTTTTCAACCATTTTTTCTTCCAATTTTTCCAGGCAGAAGTGCTCTCAGCATTGATTTATAGATTGAAAAATTACCCTCAAGATAGGTATGCTTG
TTTGGTTGCGGAACCGGAGAGTGAAATTGGTGAAGAAGAATACAATTTTGTGGGAGTGGTGGACGTGACGGTGGCGGGAGATTTGAAAATAAAGCGTCTCCTTCCCCCCG
GCGTCAAGGAGTATCTCTTTGTAACTGGAATTGCCGTCGCACAAAATGCCAGAAGACGAAAAGTAGCAACGGCATTATTAAAGGGGTGTGACATGCTTGGTAAGGTTTGG
GGATTCAAGTTTTTGGCATTAAGTGCATATGAAGATGATTATGGGGCTCGTAATTTGTATAGTAAAGCAGGCTATCAAGTTTACTATGTTGACCCTCTTTGGAAATCTAC
TTGGATTGGAAGAAAACGTTGTGTTACCATGATTAAAAAGCTCTAG
mRNA sequenceShow/hide mRNA sequence
ATGGTCCATTTACTTCCAAATCCCCTCCGAGTACCATCGCACCTCCGCTCGGAGCCACCTCTAACGGCGGTTCCGACGAGATCGAAGTCCGGCGTGGTCTGGAGATATGG
AGGAGGAATTAAGGTTAGATCGGCGGTGGTTGTGCGGTGCAGTAGTGATTATTCGAGTCCGATCACGGCGGCGGCGAGAACGGAGGAGGAGTTGGTCGGAGTATCGGAAG
AAATTGATGAGAATGAGTATTTGGCTACAGAATTTGGATGGAAGGTGAGAAAATTGATTGAAGAAGAAGATGATTTGAAAGCAGTTGCAAGAATTCAAGCCGAAGCTTTT
CATGAACCTGTTCTTCTTTTCAACCATTTTTTCTTCCAATTTTTCCAGGCAGAAGTGCTCTCAGCATTGATTTATAGATTGAAAAATTACCCTCAAGATAGGTATGCTTG
TTTGGTTGCGGAACCGGAGAGTGAAATTGGTGAAGAAGAATACAATTTTGTGGGAGTGGTGGACGTGACGGTGGCGGGAGATTTGAAAATAAAGCGTCTCCTTCCCCCCG
GCGTCAAGGAGTATCTCTTTGTAACTGGAATTGCCGTCGCACAAAATGCCAGAAGACGAAAAGTAGCAACGGCATTATTAAAGGGGTGTGACATGCTTGGTAAGGTTTGG
GGATTCAAGTTTTTGGCATTAAGTGCATATGAAGATGATTATGGGGCTCGTAATTTGTATAGTAAAGCAGGCTATCAAGTTTACTATGTTGACCCTCTTTGGAAATCTAC
TTGGATTGGAAGAAAACGTTGTGTTACCATGATTAAAAAGCTCTAG
Protein sequenceShow/hide protein sequence
MVHLLPNPLRVPSHLRSEPPLTAVPTRSKSGVVWRYGGGIKVRSAVVVRCSSDYSSPITAAARTEEELVGVSEEIDENEYLATEFGWKVRKLIEEEDDLKAVARIQAEAF
HEPVLLFNHFFFQFFQAEVLSALIYRLKNYPQDRYACLVAEPESEIGEEEYNFVGVVDVTVAGDLKIKRLLPPGVKEYLFVTGIAVAQNARRRKVATALLKGCDMLGKVW
GFKFLALSAYEDDYGARNLYSKAGYQVYYVDPLWKSTWIGRKRCVTMIKKL