; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

PI0017285 (gene) of Melon (PI 482460) v1 genome

Gene IDPI0017285
OrganismCucumis metuliferus PI 482460 (Melon (PI 482460) v1)
DescriptionN-acetyltransferase domain-containing protein
Genome locationchr01:17120899..17124933
RNA-Seq ExpressionPI0017285
SyntenyPI0017285
Gene Ontology termsGO:0005737 - cytoplasm (cellular component)
GO:0008080 - N-acetyltransferase activity (molecular function)
InterPro domainsIPR016181 - Acyl-CoA N-acyltransferase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004139678.1 uncharacterized protein LOC101214390 isoform X2 [Cucumis sativus]5.4e-13188.13Show/hide
Query:  MVHLLPNPLRVASHLRSEPPRTAVPTRPKSGAILRYGGGIKVRSAVVVRCSSDYSSPIAAAATTEEELIGVSEEIDESEYLAREFGWKVRKLIEEEDDLK
        MVHLLPNPLRV SHLRSEPP TAVPTR KSG + RYGGGIKVRSAVVVRCSSDYSSPI AAA TEEEL+GVSEEIDE+EYLA EFGWKVRKLIEEEDDLK
Subjt:  MVHLLPNPLRVASHLRSEPPRTAVPTRPKSGAILRYGGGIKVRSAVVVRCSSDYSSPIAAAATTEEELIGVSEEIDESEYLAREFGWKVRKLIEEEDDLK

Query:  AVARIQAEAFHEPVLLFNHFFFQFFQAEVLSALIYRLKNYPLDRYAFFFKIVCLFGCGTESEIGEEEYNFVGVVDVTVAGDLKVKRLLPPGVKEYLFVTG
        AVARIQAEAFHEPVLLFNHFFFQFFQAEVLSALIYRLKNYP DRYA      CL     ESEIGEEEYNFVGVVDVTVAGDLK+KRLLPPGVKEYLFVTG
Subjt:  AVARIQAEAFHEPVLLFNHFFFQFFQAEVLSALIYRLKNYPLDRYAFFFKIVCLFGCGTESEIGEEEYNFVGVVDVTVAGDLKVKRLLPPGVKEYLFVTG

Query:  IAVAQNASNR-----IIKGCDMLGKVWGFKFLALSAYEDDYGARNLYSKAGYQVYYVDPLWKSTWIGRKRCVTMIKKL
        IAVAQNA  R     ++KGCDMLGKVWGFKFLALSAYEDDYGARNLYSKAGYQVYYVDPLWKSTWIGRKRCVTMIKKL
Subjt:  IAVAQNASNR-----IIKGCDMLGKVWGFKFLALSAYEDDYGARNLYSKAGYQVYYVDPLWKSTWIGRKRCVTMIKKL

XP_008461923.1 PREDICTED: uncharacterized protein LOC103500412 isoform X1 [Cucumis melo]1.2e-12786.69Show/hide
Query:  MVHLLPNPLRVASHLRSEPPRTAVPTRPKSGAILRYGGGIKVRSAVVVRCSSDYSSPIAAAATTEEELIGVSEEIDESEYLAREFGWKVRKLIEEEDDLK
        MVHLLPNPLRV+SHLRSEPPRTAVPTR K  A+ RYGG IKVRSAVVVRCSSDYSSPI AA  TEEELI VSEE+ E+EYLAREFGWKVRKLIEEEDDL+
Subjt:  MVHLLPNPLRVASHLRSEPPRTAVPTRPKSGAILRYGGGIKVRSAVVVRCSSDYSSPIAAAATTEEELIGVSEEIDESEYLAREFGWKVRKLIEEEDDLK

Query:  AVARIQAEAFHEPVLLFNHFFFQFFQAEVLSALIYRLKNYPLDRYAFFFKIVCLFGCGTESEIGEEEYNFVGVVDVTVAGDLKVKRLLPPGVKEYLFVTG
        AVARIQAEAFHEPVLLFNHFFFQFFQAEVLSALIYRLKNYP +RYA      CL     ESEIGEEEYNFVGVVDVTVAGDLKVKRLLPPGVKEYLFVTG
Subjt:  AVARIQAEAFHEPVLLFNHFFFQFFQAEVLSALIYRLKNYPLDRYAFFFKIVCLFGCGTESEIGEEEYNFVGVVDVTVAGDLKVKRLLPPGVKEYLFVTG

Query:  IAVAQNASNR-----IIKGCDMLGKVWGFKFLALSAYEDDYGARNLYSKAGYQVYYVDPLWKSTWIGRKRCVTMIKKL
        IAVAQNA  R     ++KGCDMLGKVWGFKFLALSAYEDDYGARNLYSKAGYQVYYVDPLWKSTWIGRKRCVTMIKKL
Subjt:  IAVAQNASNR-----IIKGCDMLGKVWGFKFLALSAYEDDYGARNLYSKAGYQVYYVDPLWKSTWIGRKRCVTMIKKL

XP_016902774.1 PREDICTED: uncharacterized protein LOC103500412 isoform X2 [Cucumis melo]1.6e-12786.64Show/hide
Query:  MVHLLPNPLRVASHLRSEPPRTAVPTRPKSGAILRYGGGIKVRSAVVVRCSSDYSSPIAAAATTEEELIGVSEEIDESEYLAREFGWKVRKLIEEEDDLK
        MVHLLPNPLRV+SHLRSEPPRTAVPTR K  A+ RYGG IKVRSAVVVRCSSDYSSPI AA  TEEELI VSEE+ E+EYLAREFGWKVRKLIEEEDDL+
Subjt:  MVHLLPNPLRVASHLRSEPPRTAVPTRPKSGAILRYGGGIKVRSAVVVRCSSDYSSPIAAAATTEEELIGVSEEIDESEYLAREFGWKVRKLIEEEDDLK

Query:  AVARIQAEAFHEPVLLFNHFFFQFFQAEVLSALIYRLKNYPLDRYAFFFKIVCLFGCGTESEIGEEEYNFVGVVDVTVAGDLKVKRLLPPGVKEYLFVTG
        AVARIQAEAFHEPVLLFNHFFFQFFQAEVLSALIYRLKNYP +RYA      CL     ESEIGEEEYNFVGVVDVTVAGDLKVKRLLPPGVKEYLFVTG
Subjt:  AVARIQAEAFHEPVLLFNHFFFQFFQAEVLSALIYRLKNYPLDRYAFFFKIVCLFGCGTESEIGEEEYNFVGVVDVTVAGDLKVKRLLPPGVKEYLFVTG

Query:  IAVAQNASNRI----IKGCDMLGKVWGFKFLALSAYEDDYGARNLYSKAGYQVYYVDPLWKSTWIGRKRCVTMIKKL
        IAVAQNA  ++    +KGCDMLGKVWGFKFLALSAYEDDYGARNLYSKAGYQVYYVDPLWKSTWIGRKRCVTMIKKL
Subjt:  IAVAQNASNRI----IKGCDMLGKVWGFKFLALSAYEDDYGARNLYSKAGYQVYYVDPLWKSTWIGRKRCVTMIKKL

XP_031744292.1 uncharacterized protein LOC101214390 isoform X1 [Cucumis sativus]5.0e-12986.27Show/hide
Query:  MVHLLPNPLRVASHLRSEPPRTAVPTRPKSGAILRYGGGIKVRSAVVVRCSSDYSSPIAAAATTEEELIGVSEEIDESEYLAREFGWKVRKLIEEEDDLK
        MVHLLPNPLRV SHLRSEPP TAVPTR KSG + RYGGGIKVRSAVVVRCSSDYSSPI AAA TEEEL+GVSEEIDE+EYLA EFGWKVRKLIEEEDDLK
Subjt:  MVHLLPNPLRVASHLRSEPPRTAVPTRPKSGAILRYGGGIKVRSAVVVRCSSDYSSPIAAAATTEEELIGVSEEIDESEYLAREFGWKVRKLIEEEDDLK

Query:  AVARIQAEAFHEPVLLFNHFFFQFF------QAEVLSALIYRLKNYPLDRYAFFFKIVCLFGCGTESEIGEEEYNFVGVVDVTVAGDLKVKRLLPPGVKE
        AVARIQAEAFHEPVLLFNHFFFQFF      QAEVLSALIYRLKNYP DRYA      CL     ESEIGEEEYNFVGVVDVTVAGDLK+KRLLPPGVKE
Subjt:  AVARIQAEAFHEPVLLFNHFFFQFF------QAEVLSALIYRLKNYPLDRYAFFFKIVCLFGCGTESEIGEEEYNFVGVVDVTVAGDLKVKRLLPPGVKE

Query:  YLFVTGIAVAQNASNR-----IIKGCDMLGKVWGFKFLALSAYEDDYGARNLYSKAGYQVYYVDPLWKSTWIGRKRCVTMIKKL
        YLFVTGIAVAQNA  R     ++KGCDMLGKVWGFKFLALSAYEDDYGARNLYSKAGYQVYYVDPLWKSTWIGRKRCVTMIKKL
Subjt:  YLFVTGIAVAQNASNR-----IIKGCDMLGKVWGFKFLALSAYEDDYGARNLYSKAGYQVYYVDPLWKSTWIGRKRCVTMIKKL

XP_038897179.1 uncharacterized protein LOC120085320 [Benincasa hispida]1.1e-11280.14Show/hide
Query:  MVHLLPNPLRVASHLRSEPPRTAVPTRPKSG----AILRYGGGIKVRSAVVVRCSSDYSSPIAAAATTEEELIGVSEEIDESEYLAREFGWKVRKLIEEE
        MVHLLPNPLRV+ HLRSEPPRTAV  R  SG    AI R  GGIKVRSAVV+RCSSDYSSP     TT EE IG+ EEI+E+EYLAREFGW VRKLIEEE
Subjt:  MVHLLPNPLRVASHLRSEPPRTAVPTRPKSG----AILRYGGGIKVRSAVVVRCSSDYSSPIAAAATTEEELIGVSEEIDESEYLAREFGWKVRKLIEEE

Query:  DDLKAVARIQAEAFHEPVLLFNHFFFQFFQAEVLSALIYRLKNYPLDRYAFFFKIVCLFGCGTESEIGEEEYNFVGVVDVTVAGDLKVKRLLPPGVKEYL
        DDL+AVARIQAEAFHEPVLLFN FFFQFFQAEVLSALIYRLKNYP DRYA      CL     ESE  ++EYNFVGVVDVTVAGDLKVKRLLP G KEYL
Subjt:  DDLKAVARIQAEAFHEPVLLFNHFFFQFFQAEVLSALIYRLKNYPLDRYAFFFKIVCLFGCGTESEIGEEEYNFVGVVDVTVAGDLKVKRLLPPGVKEYL

Query:  FVTGIAVAQNASNR-----IIKGCDMLGKVWGFKFLALSAYEDDYGARNLYSKAGYQVYYVDPLWKSTWIGRKRCVTMIKKL
        FVTGIAVAQ A  R     ++KGCDMLGKVWGFKFLALSAYEDDYGARNLYSKAGYQVYYVDPLWKSTWIGRKRCVTMIKKL
Subjt:  FVTGIAVAQNASNR-----IIKGCDMLGKVWGFKFLALSAYEDDYGARNLYSKAGYQVYYVDPLWKSTWIGRKRCVTMIKKL

TrEMBL top hitse value%identityAlignment
A0A0A0K826 N-acetyltransferase domain-containing protein2.6e-13188.13Show/hide
Query:  MVHLLPNPLRVASHLRSEPPRTAVPTRPKSGAILRYGGGIKVRSAVVVRCSSDYSSPIAAAATTEEELIGVSEEIDESEYLAREFGWKVRKLIEEEDDLK
        MVHLLPNPLRV SHLRSEPP TAVPTR KSG + RYGGGIKVRSAVVVRCSSDYSSPI AAA TEEEL+GVSEEIDE+EYLA EFGWKVRKLIEEEDDLK
Subjt:  MVHLLPNPLRVASHLRSEPPRTAVPTRPKSGAILRYGGGIKVRSAVVVRCSSDYSSPIAAAATTEEELIGVSEEIDESEYLAREFGWKVRKLIEEEDDLK

Query:  AVARIQAEAFHEPVLLFNHFFFQFFQAEVLSALIYRLKNYPLDRYAFFFKIVCLFGCGTESEIGEEEYNFVGVVDVTVAGDLKVKRLLPPGVKEYLFVTG
        AVARIQAEAFHEPVLLFNHFFFQFFQAEVLSALIYRLKNYP DRYA      CL     ESEIGEEEYNFVGVVDVTVAGDLK+KRLLPPGVKEYLFVTG
Subjt:  AVARIQAEAFHEPVLLFNHFFFQFFQAEVLSALIYRLKNYPLDRYAFFFKIVCLFGCGTESEIGEEEYNFVGVVDVTVAGDLKVKRLLPPGVKEYLFVTG

Query:  IAVAQNASNR-----IIKGCDMLGKVWGFKFLALSAYEDDYGARNLYSKAGYQVYYVDPLWKSTWIGRKRCVTMIKKL
        IAVAQNA  R     ++KGCDMLGKVWGFKFLALSAYEDDYGARNLYSKAGYQVYYVDPLWKSTWIGRKRCVTMIKKL
Subjt:  IAVAQNASNR-----IIKGCDMLGKVWGFKFLALSAYEDDYGARNLYSKAGYQVYYVDPLWKSTWIGRKRCVTMIKKL

A0A1S3CFP1 uncharacterized protein LOC103500412 isoform X16.0e-12886.69Show/hide
Query:  MVHLLPNPLRVASHLRSEPPRTAVPTRPKSGAILRYGGGIKVRSAVVVRCSSDYSSPIAAAATTEEELIGVSEEIDESEYLAREFGWKVRKLIEEEDDLK
        MVHLLPNPLRV+SHLRSEPPRTAVPTR K  A+ RYGG IKVRSAVVVRCSSDYSSPI AA  TEEELI VSEE+ E+EYLAREFGWKVRKLIEEEDDL+
Subjt:  MVHLLPNPLRVASHLRSEPPRTAVPTRPKSGAILRYGGGIKVRSAVVVRCSSDYSSPIAAAATTEEELIGVSEEIDESEYLAREFGWKVRKLIEEEDDLK

Query:  AVARIQAEAFHEPVLLFNHFFFQFFQAEVLSALIYRLKNYPLDRYAFFFKIVCLFGCGTESEIGEEEYNFVGVVDVTVAGDLKVKRLLPPGVKEYLFVTG
        AVARIQAEAFHEPVLLFNHFFFQFFQAEVLSALIYRLKNYP +RYA      CL     ESEIGEEEYNFVGVVDVTVAGDLKVKRLLPPGVKEYLFVTG
Subjt:  AVARIQAEAFHEPVLLFNHFFFQFFQAEVLSALIYRLKNYPLDRYAFFFKIVCLFGCGTESEIGEEEYNFVGVVDVTVAGDLKVKRLLPPGVKEYLFVTG

Query:  IAVAQNASNR-----IIKGCDMLGKVWGFKFLALSAYEDDYGARNLYSKAGYQVYYVDPLWKSTWIGRKRCVTMIKKL
        IAVAQNA  R     ++KGCDMLGKVWGFKFLALSAYEDDYGARNLYSKAGYQVYYVDPLWKSTWIGRKRCVTMIKKL
Subjt:  IAVAQNASNR-----IIKGCDMLGKVWGFKFLALSAYEDDYGARNLYSKAGYQVYYVDPLWKSTWIGRKRCVTMIKKL

A0A1S4E3H1 uncharacterized protein LOC103500412 isoform X27.8e-12886.64Show/hide
Query:  MVHLLPNPLRVASHLRSEPPRTAVPTRPKSGAILRYGGGIKVRSAVVVRCSSDYSSPIAAAATTEEELIGVSEEIDESEYLAREFGWKVRKLIEEEDDLK
        MVHLLPNPLRV+SHLRSEPPRTAVPTR K  A+ RYGG IKVRSAVVVRCSSDYSSPI AA  TEEELI VSEE+ E+EYLAREFGWKVRKLIEEEDDL+
Subjt:  MVHLLPNPLRVASHLRSEPPRTAVPTRPKSGAILRYGGGIKVRSAVVVRCSSDYSSPIAAAATTEEELIGVSEEIDESEYLAREFGWKVRKLIEEEDDLK

Query:  AVARIQAEAFHEPVLLFNHFFFQFFQAEVLSALIYRLKNYPLDRYAFFFKIVCLFGCGTESEIGEEEYNFVGVVDVTVAGDLKVKRLLPPGVKEYLFVTG
        AVARIQAEAFHEPVLLFNHFFFQFFQAEVLSALIYRLKNYP +RYA      CL     ESEIGEEEYNFVGVVDVTVAGDLKVKRLLPPGVKEYLFVTG
Subjt:  AVARIQAEAFHEPVLLFNHFFFQFFQAEVLSALIYRLKNYPLDRYAFFFKIVCLFGCGTESEIGEEEYNFVGVVDVTVAGDLKVKRLLPPGVKEYLFVTG

Query:  IAVAQNASNRI----IKGCDMLGKVWGFKFLALSAYEDDYGARNLYSKAGYQVYYVDPLWKSTWIGRKRCVTMIKKL
        IAVAQNA  ++    +KGCDMLGKVWGFKFLALSAYEDDYGARNLYSKAGYQVYYVDPLWKSTWIGRKRCVTMIKKL
Subjt:  IAVAQNASNRI----IKGCDMLGKVWGFKFLALSAYEDDYGARNLYSKAGYQVYYVDPLWKSTWIGRKRCVTMIKKL

A0A6J1F9A7 uncharacterized protein LOC1114432733.1e-10876.95Show/hide
Query:  MVHLLPNPLRVASHLRSEPPRTAVPTRPKSGA----ILRYGGGIKVRSAVVVRCSSDYSSPIAAAATTEEELIGVSEEIDESEYLAREFGWKVRKLIEEE
        MVHLLPNPLRV+SHLR +PPRTAV  R KSG     ILR  GGIKVR AVVVRCSSDYSSPIAAA  T EE+    E  +E+ YLA EFGWKVRKLIEEE
Subjt:  MVHLLPNPLRVASHLRSEPPRTAVPTRPKSGA----ILRYGGGIKVRSAVVVRCSSDYSSPIAAAATTEEELIGVSEEIDESEYLAREFGWKVRKLIEEE

Query:  DDLKAVARIQAEAFHEPVLLFNHFFFQFFQAEVLSALIYRLKNYPLDRYAFFFKIVCLFGCGTESEIGEEEYNFVGVVDVTVAGDLKVKRLLPPGVKEYL
        DDL+ VARIQAEAFHEPVLLFNHFFFQFFQAEVLSALIYRLK+YP DRYA      CL     ESE  ++EYNFVGVVDVTVAGDLKV+RLLP GVKEYL
Subjt:  DDLKAVARIQAEAFHEPVLLFNHFFFQFFQAEVLSALIYRLKNYPLDRYAFFFKIVCLFGCGTESEIGEEEYNFVGVVDVTVAGDLKVKRLLPPGVKEYL

Query:  FVTGIAVAQNASNR-----IIKGCDMLGKVWGFKFLALSAYEDDYGARNLYSKAGYQVYYVDPLWKSTWIGRKRCVTMIKKL
        FVTGIAV Q A  R     ++KGCDML KVWGFKFLALSAYEDDYGARNLYSKAGYQV YVDPLWKS+WIGRKRCVTM+K+L
Subjt:  FVTGIAVAQNASNR-----IIKGCDMLGKVWGFKFLALSAYEDDYGARNLYSKAGYQVYYVDPLWKSTWIGRKRCVTMIKKL

A0A6J1IJZ4 uncharacterized protein LOC1114769672.4e-10876.95Show/hide
Query:  MVHLLPNPLRVASHLRSEPPRTAVPTRPKSGA----ILRYGGGIKVRSAVVVRCSSDYSSPIAAAATTEEELIGVSEEIDESEYLAREFGWKVRKLIEEE
        MVHLLPNPLRV+SHLRS+PPRTAV  R KSG     ILR  GGIKVR AVVVRCSSDYSSPI AA  T EE+    E  +E+ YLA EFGWKVRKLIEEE
Subjt:  MVHLLPNPLRVASHLRSEPPRTAVPTRPKSGA----ILRYGGGIKVRSAVVVRCSSDYSSPIAAAATTEEELIGVSEEIDESEYLAREFGWKVRKLIEEE

Query:  DDLKAVARIQAEAFHEPVLLFNHFFFQFFQAEVLSALIYRLKNYPLDRYAFFFKIVCLFGCGTESEIGEEEYNFVGVVDVTVAGDLKVKRLLPPGVKEYL
        DDL+ VARIQAEAFHEPVLLFNHFFFQFFQAEVLSALIYRLKNYP DRYA      CL     ESE  ++EYNFVGVVDVTVAGDLKV+RLLP GVKEYL
Subjt:  DDLKAVARIQAEAFHEPVLLFNHFFFQFFQAEVLSALIYRLKNYPLDRYAFFFKIVCLFGCGTESEIGEEEYNFVGVVDVTVAGDLKVKRLLPPGVKEYL

Query:  FVTGIAVAQNASNR-----IIKGCDMLGKVWGFKFLALSAYEDDYGARNLYSKAGYQVYYVDPLWKSTWIGRKRCVTMIKKL
        FVTGIAV Q A  R     ++KGCDML +VWGFKFLALSAYEDDYGARNLYSKAGYQV YVDPLWKS+WIGRKRCVTM+K+L
Subjt:  FVTGIAVAQNASNR-----IIKGCDMLGKVWGFKFLALSAYEDDYGARNLYSKAGYQVYYVDPLWKSTWIGRKRCVTMIKKL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G72030.1 Acyl-CoA N-acyltransferases (NAT) superfamily protein1.9e-4951.98Show/hide
Query:  ESEYLAREFGWKVRKL-IEEEDDLKAVARIQAEAFHEPVLLFNHFFFQFFQAEVLSALIYRLKNYPLDRYAFFFKIVCLFGCGTESEIGEEEYNFVGVVD
        E +YL  + GW VR+L  ++ED+++ V+ +QAEAFH P+ LF+ FFF FFQAEVLSAL+Y+LKN P DRYA      CL    T         + VGVVD
Subjt:  ESEYLAREFGWKVRKL-IEEEDDLKAVARIQAEAFHEPVLLFNHFFFQFFQAEVLSALIYRLKNYPLDRYAFFFKIVCLFGCGTESEIGEEEYNFVGVVD

Query:  VTVAGDLKVKRLLPPGVKEYLFVTGIAVAQN-----ASNRIIKGCDMLGKVWGFKFLALSAYEDDYGARNLYSKAGYQVYYVDPLWKSTWIGRKRCVTMI
        VT   +  V R   PGV+EYL+V+G+AV+++      ++ ++K CD+L  +WGFK LAL AYEDD  ARNLYS AGY V   DPLW STWIGRKR V M 
Subjt:  VTVAGDLKVKRLLPPGVKEYLFVTGIAVAQN-----ASNRIIKGCDMLGKVWGFKFLALSAYEDDYGARNLYSKAGYQVYYVDPLWKSTWIGRKRCVTMI

Query:  KK
        K+
Subjt:  KK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTCCATTTGCTTCCAAATCCCCTCCGAGTAGCATCGCACCTCCGCTCTGAGCCACCGCGCACGGCGGTTCCGACGAGACCGAAATCCGGCGCGATCTTGAGATATGG
AGGAGGAATTAAGGTTAGATCGGCGGTGGTTGTGCGGTGCAGTAGTGATTATTCGAGTCCGATCGCGGCGGCGGCGACGACTGAGGAGGAATTGATCGGAGTATCGGAAG
AAATTGATGAAAGTGAGTATTTGGCTAGAGAATTTGGATGGAAGGTGAGAAAATTAATTGAAGAAGAAGATGATTTGAAAGCAGTTGCAAGAATTCAAGCCGAAGCTTTT
CATGAACCTGTTCTTCTTTTCAACCACTTTTTCTTCCAATTTTTCCAGGCAGAAGTGCTTTCAGCATTGATTTATAGACTAAAAAATTACCCTCTAGACAGGTATGCTTT
CTTTTTTAAAATAGTATGCTTGTTTGGTTGCGGAACCGAAAGTGAAATTGGTGAAGAAGAATACAATTTTGTGGGAGTGGTGGACGTGACGGTGGCCGGAGATTTAAAAG
TAAAGCGTCTTCTTCCCCCCGGCGTCAAGGAGTATCTCTTTGTAACTGGAATTGCCGTCGCACAAAATGCCAGCAACCGCATTATTAAGGGGTGTGACATGCTTGGTAAG
GTTTGGGGATTCAAGTTTTTGGCATTAAGTGCATATGAAGATGATTATGGGGCTCGTAATTTGTATAGTAAAGCAGGCTATCAAGTTTACTATGTTGACCCTCTTTGGAA
ATCTACTTGGATTGGAAGAAAACGTTGTGTTACTATGATTAAAAAGCTCTAG
mRNA sequenceShow/hide mRNA sequence
ATGGTCCATTTGCTTCCAAATCCCCTCCGAGTAGCATCGCACCTCCGCTCTGAGCCACCGCGCACGGCGGTTCCGACGAGACCGAAATCCGGCGCGATCTTGAGATATGG
AGGAGGAATTAAGGTTAGATCGGCGGTGGTTGTGCGGTGCAGTAGTGATTATTCGAGTCCGATCGCGGCGGCGGCGACGACTGAGGAGGAATTGATCGGAGTATCGGAAG
AAATTGATGAAAGTGAGTATTTGGCTAGAGAATTTGGATGGAAGGTGAGAAAATTAATTGAAGAAGAAGATGATTTGAAAGCAGTTGCAAGAATTCAAGCCGAAGCTTTT
CATGAACCTGTTCTTCTTTTCAACCACTTTTTCTTCCAATTTTTCCAGGCAGAAGTGCTTTCAGCATTGATTTATAGACTAAAAAATTACCCTCTAGACAGGTATGCTTT
CTTTTTTAAAATAGTATGCTTGTTTGGTTGCGGAACCGAAAGTGAAATTGGTGAAGAAGAATACAATTTTGTGGGAGTGGTGGACGTGACGGTGGCCGGAGATTTAAAAG
TAAAGCGTCTTCTTCCCCCCGGCGTCAAGGAGTATCTCTTTGTAACTGGAATTGCCGTCGCACAAAATGCCAGCAACCGCATTATTAAGGGGTGTGACATGCTTGGTAAG
GTTTGGGGATTCAAGTTTTTGGCATTAAGTGCATATGAAGATGATTATGGGGCTCGTAATTTGTATAGTAAAGCAGGCTATCAAGTTTACTATGTTGACCCTCTTTGGAA
ATCTACTTGGATTGGAAGAAAACGTTGTGTTACTATGATTAAAAAGCTCTAG
Protein sequenceShow/hide protein sequence
MVHLLPNPLRVASHLRSEPPRTAVPTRPKSGAILRYGGGIKVRSAVVVRCSSDYSSPIAAAATTEEELIGVSEEIDESEYLAREFGWKVRKLIEEEDDLKAVARIQAEAF
HEPVLLFNHFFFQFFQAEVLSALIYRLKNYPLDRYAFFFKIVCLFGCGTESEIGEEEYNFVGVVDVTVAGDLKVKRLLPPGVKEYLFVTGIAVAQNASNRIIKGCDMLGK
VWGFKFLALSAYEDDYGARNLYSKAGYQVYYVDPLWKSTWIGRKRCVTMIKKL