; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc09G10580 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc09G10580
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionN-acetyltransferase domain-containing protein
Genome locationClcChr09:9371917..9375012
RNA-Seq ExpressionClc09G10580
SyntenyClc09G10580
Gene Ontology termsGO:0005737 - cytoplasm (cellular component)
GO:0008080 - N-acetyltransferase activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004139678.1 uncharacterized protein LOC101214390 isoform X2 [Cucumis sativus]5.9e-12079.38Show/hide
Query:  MVHLLPNPLRVSSHLRSEPPRTVVPTRAKSSTGGCEIWR-NGGIKVRSAVVVRCGSDYSSPI-AAATTEEDSIGVSEVIDQKEYLAREFGWKVRKLMEEE
        MVHLLPNPLRV SHLRSEPP T VPTR+KS      +WR  GGIKVRSAVVVRC SDYSSPI AAA TEE+ +GVSE ID+ EYLA EFGWKVRKL+EEE
Subjt:  MVHLLPNPLRVSSHLRSEPPRTVVPTRAKSSTGGCEIWR-NGGIKVRSAVVVRCGSDYSSPI-AAATTEEDSIGVSEVIDQKEYLAREFGWKVRKLMEEE

Query:  DDLRAVARIQAEAFHEPVLLFNDFFFQFFQAEVLSALIYRLKSYPPNRYACLVAEPESERCKDEYNFVGVVDVTVAGDLKVKRLLPAGEKEYLFVTGIAV
        DDL+AVARIQAEAFHEPVLLFN FFFQFFQAEVLSALIYRLK+YP +RYACLVAEPESE  ++EYNFVGVVDVTVAGDLK+KRLLP G KEYLFVTGIAV
Subjt:  DDLRAVARIQAEAFHEPVLLFNDFFFQFFQAEVLSALIYRLKSYPPNRYACLVAEPESERCKDEYNFVGVVDVTVAGDLKVKRLLPAGEKEYLFVTGIAV

Query:  AQTARFLIFDYLTSHLLLSLTLDVPSTTLLKGCDMLGKVWGFKFLALSAYEDDYGARNLYSKAGYEVYYVDPLWKSTWIGRKRCVTMIKKL
        AQ AR                    +T LLKGCDMLGKVWGFKFLALSAYEDDYGARNLYSKAGY+VYYVDPLWKSTWIGRKRCVTMIKKL
Subjt:  AQTARFLIFDYLTSHLLLSLTLDVPSTTLLKGCDMLGKVWGFKFLALSAYEDDYGARNLYSKAGYEVYYVDPLWKSTWIGRKRCVTMIKKL

XP_008461923.1 PREDICTED: uncharacterized protein LOC103500412 isoform X1 [Cucumis melo]6.5e-11979.73Show/hide
Query:  MVHLLPNPLRVSSHLRSEPPRTVVPTRAKSSTGGCEIWRNGG-IKVRSAVVVRCGSDYSSPI-AAATTEEDSIGVSEVIDQKEYLAREFGWKVRKLMEEE
        MVHLLPNPLRVSSHLRSEPPRT VPTR+K       +WR GG IKVRSAVVVRC SDYSSPI AA  TEE+ I VSE + + EYLAREFGWKVRKL+EEE
Subjt:  MVHLLPNPLRVSSHLRSEPPRTVVPTRAKSSTGGCEIWRNGG-IKVRSAVVVRCGSDYSSPI-AAATTEEDSIGVSEVIDQKEYLAREFGWKVRKLMEEE

Query:  DDLRAVARIQAEAFHEPVLLFNDFFFQFFQAEVLSALIYRLKSYPPNRYACLVAEPESERCKDEYNFVGVVDVTVAGDLKVKRLLPAGEKEYLFVTGIAV
        DDLRAVARIQAEAFHEPVLLFN FFFQFFQAEVLSALIYRLK+YP  RYACLVAEPESE  ++EYNFVGVVDVTVAGDLKVKRLLP G KEYLFVTGIAV
Subjt:  DDLRAVARIQAEAFHEPVLLFNDFFFQFFQAEVLSALIYRLKSYPPNRYACLVAEPESERCKDEYNFVGVVDVTVAGDLKVKRLLPAGEKEYLFVTGIAV

Query:  AQTARFLIFDYLTSHLLLSLTLDVPSTTLLKGCDMLGKVWGFKFLALSAYEDDYGARNLYSKAGYEVYYVDPLWKSTWIGRKRCVTMIKKL
        AQ AR                    +T LLKGCDMLGKVWGFKFLALSAYEDDYGARNLYSKAGY+VYYVDPLWKSTWIGRKRCVTMIKKL
Subjt:  AQTARFLIFDYLTSHLLLSLTLDVPSTTLLKGCDMLGKVWGFKFLALSAYEDDYGARNLYSKAGYEVYYVDPLWKSTWIGRKRCVTMIKKL

XP_016902774.1 PREDICTED: uncharacterized protein LOC103500412 isoform X2 [Cucumis melo]2.9e-11979.73Show/hide
Query:  MVHLLPNPLRVSSHLRSEPPRTVVPTRAKSSTGGCEIWRNGG-IKVRSAVVVRCGSDYSSPI-AAATTEEDSIGVSEVIDQKEYLAREFGWKVRKLMEEE
        MVHLLPNPLRVSSHLRSEPPRT VPTR+K       +WR GG IKVRSAVVVRC SDYSSPI AA  TEE+ I VSE + + EYLAREFGWKVRKL+EEE
Subjt:  MVHLLPNPLRVSSHLRSEPPRTVVPTRAKSSTGGCEIWRNGG-IKVRSAVVVRCGSDYSSPI-AAATTEEDSIGVSEVIDQKEYLAREFGWKVRKLMEEE

Query:  DDLRAVARIQAEAFHEPVLLFNDFFFQFFQAEVLSALIYRLKSYPPNRYACLVAEPESERCKDEYNFVGVVDVTVAGDLKVKRLLPAGEKEYLFVTGIAV
        DDLRAVARIQAEAFHEPVLLFN FFFQFFQAEVLSALIYRLK+YP  RYACLVAEPESE  ++EYNFVGVVDVTVAGDLKVKRLLP G KEYLFVTGIAV
Subjt:  DDLRAVARIQAEAFHEPVLLFNDFFFQFFQAEVLSALIYRLKSYPPNRYACLVAEPESERCKDEYNFVGVVDVTVAGDLKVKRLLPAGEKEYLFVTGIAV

Query:  AQTARFLIFDYLTSHLLLSLTLDVPSTTLLKGCDMLGKVWGFKFLALSAYEDDYGARNLYSKAGYEVYYVDPLWKSTWIGRKRCVTMIKKL
        AQ AR  +                 +T LLKGCDMLGKVWGFKFLALSAYEDDYGARNLYSKAGY+VYYVDPLWKSTWIGRKRCVTMIKKL
Subjt:  AQTARFLIFDYLTSHLLLSLTLDVPSTTLLKGCDMLGKVWGFKFLALSAYEDDYGARNLYSKAGYEVYYVDPLWKSTWIGRKRCVTMIKKL

XP_023535575.1 uncharacterized protein LOC111796973 [Cucurbita pepo subsp. pepo]1.7e-11978.89Show/hide
Query:  MVHLLPNPLRVSSHLRSEPPRTVVPTRAKSSTGGCEIWRNGGIKVRSAVVVRCGSDYSSPIAAATTEEDSIGVSEVIDQKEYLAREFGWKVRKLMEEEDD
        MVHLLPNPLRVSSHLR +PPRT V TRAKSSTG  EI RNGGIKVR AVVVRC SDYSSPI AA    + I   E+ D+  YLA EFGWKVRKL+EEEDD
Subjt:  MVHLLPNPLRVSSHLRSEPPRTVVPTRAKSSTGGCEIWRNGGIKVRSAVVVRCGSDYSSPIAAATTEEDSIGVSEVIDQKEYLAREFGWKVRKLMEEEDD

Query:  LRAVARIQAEAFHEPVLLFNDFFFQFFQAEVLSALIYRLKSYPPNRYACLVAEPESERCKDEYNFVGVVDVTVAGDLKVKRLLPAGEKEYLFVTGIAVAQ
        LR VARIQAEAFHEPVLLFN FFFQFFQAEVLSALIYRLKSYP +RYACLVAEPESERCKDEYNFVGVVDVTVAGDLKV+RLLPAG KEYLFVTGIAV Q
Subjt:  LRAVARIQAEAFHEPVLLFNDFFFQFFQAEVLSALIYRLKSYPPNRYACLVAEPESERCKDEYNFVGVVDVTVAGDLKVKRLLPAGEKEYLFVTGIAVAQ

Query:  TARFLIFDYLTSHLLLSLTLDVPSTTLLKGCDMLGKVWGFKFLALSAYEDDYGARNLYSKAGYEVYYVDPLWKSTWIGRKRCVTMIKKL
        TAR                    +T LLKGCDML KVWGFKFLALSAYEDDYGARNLYSKAGY+V YVDPLWKS+WIGRKRCVTM+K+L
Subjt:  TARFLIFDYLTSHLLLSLTLDVPSTTLLKGCDMLGKVWGFKFLALSAYEDDYGARNLYSKAGYEVYYVDPLWKSTWIGRKRCVTMIKKL

XP_038897179.1 uncharacterized protein LOC120085320 [Benincasa hispida]9.1e-12982.7Show/hide
Query:  MVHLLPNPLRVSSHLRSEPPRTVVPTRAKSSTGGCEIWRNGGIKVRSAVVVRCGSDYSSPIAAATTEEDSIGVSEVIDQKEYLAREFGWKVRKLMEEEDD
        MVHLLPNPLRVS HLRSEPPRT V  R  S TGGC IWRNGGIKVRSAVV+RC SDYSSP    TT E+SIG+ E I++ EYLAREFGW VRKL+EEEDD
Subjt:  MVHLLPNPLRVSSHLRSEPPRTVVPTRAKSSTGGCEIWRNGGIKVRSAVVVRCGSDYSSPIAAATTEEDSIGVSEVIDQKEYLAREFGWKVRKLMEEEDD

Query:  LRAVARIQAEAFHEPVLLFNDFFFQFFQAEVLSALIYRLKSYPPNRYACLVAEPESERCKDEYNFVGVVDVTVAGDLKVKRLLPAGEKEYLFVTGIAVAQ
        LRAVARIQAEAFHEPVLLFNDFFFQFFQAEVLSALIYRLK+YPP+RYACLVAEPESE CKDEYNFVGVVDVTVAGDLKVKRLLPAGEKEYLFVTGIAVAQ
Subjt:  LRAVARIQAEAFHEPVLLFNDFFFQFFQAEVLSALIYRLKSYPPNRYACLVAEPESERCKDEYNFVGVVDVTVAGDLKVKRLLPAGEKEYLFVTGIAVAQ

Query:  TARFLIFDYLTSHLLLSLTLDVPSTTLLKGCDMLGKVWGFKFLALSAYEDDYGARNLYSKAGYEVYYVDPLWKSTWIGRKRCVTMIKKL
        TAR                    +T LLKGCDMLGKVWGFKFLALSAYEDDYGARNLYSKAGY+VYYVDPLWKSTWIGRKRCVTMIKKL
Subjt:  TARFLIFDYLTSHLLLSLTLDVPSTTLLKGCDMLGKVWGFKFLALSAYEDDYGARNLYSKAGYEVYYVDPLWKSTWIGRKRCVTMIKKL

TrEMBL top hitse value%identityAlignment
A0A0A0K826 N-acetyltransferase domain-containing protein2.9e-12079.38Show/hide
Query:  MVHLLPNPLRVSSHLRSEPPRTVVPTRAKSSTGGCEIWR-NGGIKVRSAVVVRCGSDYSSPI-AAATTEEDSIGVSEVIDQKEYLAREFGWKVRKLMEEE
        MVHLLPNPLRV SHLRSEPP T VPTR+KS      +WR  GGIKVRSAVVVRC SDYSSPI AAA TEE+ +GVSE ID+ EYLA EFGWKVRKL+EEE
Subjt:  MVHLLPNPLRVSSHLRSEPPRTVVPTRAKSSTGGCEIWR-NGGIKVRSAVVVRCGSDYSSPI-AAATTEEDSIGVSEVIDQKEYLAREFGWKVRKLMEEE

Query:  DDLRAVARIQAEAFHEPVLLFNDFFFQFFQAEVLSALIYRLKSYPPNRYACLVAEPESERCKDEYNFVGVVDVTVAGDLKVKRLLPAGEKEYLFVTGIAV
        DDL+AVARIQAEAFHEPVLLFN FFFQFFQAEVLSALIYRLK+YP +RYACLVAEPESE  ++EYNFVGVVDVTVAGDLK+KRLLP G KEYLFVTGIAV
Subjt:  DDLRAVARIQAEAFHEPVLLFNDFFFQFFQAEVLSALIYRLKSYPPNRYACLVAEPESERCKDEYNFVGVVDVTVAGDLKVKRLLPAGEKEYLFVTGIAV

Query:  AQTARFLIFDYLTSHLLLSLTLDVPSTTLLKGCDMLGKVWGFKFLALSAYEDDYGARNLYSKAGYEVYYVDPLWKSTWIGRKRCVTMIKKL
        AQ AR                    +T LLKGCDMLGKVWGFKFLALSAYEDDYGARNLYSKAGY+VYYVDPLWKSTWIGRKRCVTMIKKL
Subjt:  AQTARFLIFDYLTSHLLLSLTLDVPSTTLLKGCDMLGKVWGFKFLALSAYEDDYGARNLYSKAGYEVYYVDPLWKSTWIGRKRCVTMIKKL

A0A1S3CFP1 uncharacterized protein LOC103500412 isoform X13.2e-11979.73Show/hide
Query:  MVHLLPNPLRVSSHLRSEPPRTVVPTRAKSSTGGCEIWRNGG-IKVRSAVVVRCGSDYSSPI-AAATTEEDSIGVSEVIDQKEYLAREFGWKVRKLMEEE
        MVHLLPNPLRVSSHLRSEPPRT VPTR+K       +WR GG IKVRSAVVVRC SDYSSPI AA  TEE+ I VSE + + EYLAREFGWKVRKL+EEE
Subjt:  MVHLLPNPLRVSSHLRSEPPRTVVPTRAKSSTGGCEIWRNGG-IKVRSAVVVRCGSDYSSPI-AAATTEEDSIGVSEVIDQKEYLAREFGWKVRKLMEEE

Query:  DDLRAVARIQAEAFHEPVLLFNDFFFQFFQAEVLSALIYRLKSYPPNRYACLVAEPESERCKDEYNFVGVVDVTVAGDLKVKRLLPAGEKEYLFVTGIAV
        DDLRAVARIQAEAFHEPVLLFN FFFQFFQAEVLSALIYRLK+YP  RYACLVAEPESE  ++EYNFVGVVDVTVAGDLKVKRLLP G KEYLFVTGIAV
Subjt:  DDLRAVARIQAEAFHEPVLLFNDFFFQFFQAEVLSALIYRLKSYPPNRYACLVAEPESERCKDEYNFVGVVDVTVAGDLKVKRLLPAGEKEYLFVTGIAV

Query:  AQTARFLIFDYLTSHLLLSLTLDVPSTTLLKGCDMLGKVWGFKFLALSAYEDDYGARNLYSKAGYEVYYVDPLWKSTWIGRKRCVTMIKKL
        AQ AR                    +T LLKGCDMLGKVWGFKFLALSAYEDDYGARNLYSKAGY+VYYVDPLWKSTWIGRKRCVTMIKKL
Subjt:  AQTARFLIFDYLTSHLLLSLTLDVPSTTLLKGCDMLGKVWGFKFLALSAYEDDYGARNLYSKAGYEVYYVDPLWKSTWIGRKRCVTMIKKL

A0A1S4E3H1 uncharacterized protein LOC103500412 isoform X21.4e-11979.73Show/hide
Query:  MVHLLPNPLRVSSHLRSEPPRTVVPTRAKSSTGGCEIWRNGG-IKVRSAVVVRCGSDYSSPI-AAATTEEDSIGVSEVIDQKEYLAREFGWKVRKLMEEE
        MVHLLPNPLRVSSHLRSEPPRT VPTR+K       +WR GG IKVRSAVVVRC SDYSSPI AA  TEE+ I VSE + + EYLAREFGWKVRKL+EEE
Subjt:  MVHLLPNPLRVSSHLRSEPPRTVVPTRAKSSTGGCEIWRNGG-IKVRSAVVVRCGSDYSSPI-AAATTEEDSIGVSEVIDQKEYLAREFGWKVRKLMEEE

Query:  DDLRAVARIQAEAFHEPVLLFNDFFFQFFQAEVLSALIYRLKSYPPNRYACLVAEPESERCKDEYNFVGVVDVTVAGDLKVKRLLPAGEKEYLFVTGIAV
        DDLRAVARIQAEAFHEPVLLFN FFFQFFQAEVLSALIYRLK+YP  RYACLVAEPESE  ++EYNFVGVVDVTVAGDLKVKRLLP G KEYLFVTGIAV
Subjt:  DDLRAVARIQAEAFHEPVLLFNDFFFQFFQAEVLSALIYRLKSYPPNRYACLVAEPESERCKDEYNFVGVVDVTVAGDLKVKRLLPAGEKEYLFVTGIAV

Query:  AQTARFLIFDYLTSHLLLSLTLDVPSTTLLKGCDMLGKVWGFKFLALSAYEDDYGARNLYSKAGYEVYYVDPLWKSTWIGRKRCVTMIKKL
        AQ AR  +                 +T LLKGCDMLGKVWGFKFLALSAYEDDYGARNLYSKAGY+VYYVDPLWKSTWIGRKRCVTMIKKL
Subjt:  AQTARFLIFDYLTSHLLLSLTLDVPSTTLLKGCDMLGKVWGFKFLALSAYEDDYGARNLYSKAGYEVYYVDPLWKSTWIGRKRCVTMIKKL

A0A6J1F9A7 uncharacterized protein LOC1114432734.1e-11978.2Show/hide
Query:  MVHLLPNPLRVSSHLRSEPPRTVVPTRAKSSTGGCEIWRNGGIKVRSAVVVRCGSDYSSPIAAATTEEDSIGVSEVIDQKEYLAREFGWKVRKLMEEEDD
        MVHLLPNPLRVSSHLR +PPRT V  RAKS TG  EI RNGGIKVR AVVVRC SDYSSPIAAA    + I   E+ ++  YLA EFGWKVRKL+EEEDD
Subjt:  MVHLLPNPLRVSSHLRSEPPRTVVPTRAKSSTGGCEIWRNGGIKVRSAVVVRCGSDYSSPIAAATTEEDSIGVSEVIDQKEYLAREFGWKVRKLMEEEDD

Query:  LRAVARIQAEAFHEPVLLFNDFFFQFFQAEVLSALIYRLKSYPPNRYACLVAEPESERCKDEYNFVGVVDVTVAGDLKVKRLLPAGEKEYLFVTGIAVAQ
        LR VARIQAEAFHEPVLLFN FFFQFFQAEVLSALIYRLKSYPP+RYACLVAEPESE CKDEYNFVGVVDVTVAGDLKV+RLLPAG KEYLFVTGIAV Q
Subjt:  LRAVARIQAEAFHEPVLLFNDFFFQFFQAEVLSALIYRLKSYPPNRYACLVAEPESERCKDEYNFVGVVDVTVAGDLKVKRLLPAGEKEYLFVTGIAVAQ

Query:  TARFLIFDYLTSHLLLSLTLDVPSTTLLKGCDMLGKVWGFKFLALSAYEDDYGARNLYSKAGYEVYYVDPLWKSTWIGRKRCVTMIKKL
        TAR                    +T LLKGCDML KVWGFKFLALSAYEDDYGARNLYSKAGY+V YVDPLWKS+WIGRKRCVTM+K+L
Subjt:  TARFLIFDYLTSHLLLSLTLDVPSTTLLKGCDMLGKVWGFKFLALSAYEDDYGARNLYSKAGYEVYYVDPLWKSTWIGRKRCVTMIKKL

A0A6J1IJZ4 uncharacterized protein LOC1114769673.5e-11877.51Show/hide
Query:  MVHLLPNPLRVSSHLRSEPPRTVVPTRAKSSTGGCEIWRNGGIKVRSAVVVRCGSDYSSPIAAATTEEDSIGVSEVIDQKEYLAREFGWKVRKLMEEEDD
        MVHLLPNPLRVSSHLRS+PPRT V  RAKS TG  EI RNGGIKVR AVVVRC SDYSSPI AA    + I   E+ ++  YLA EFGWKVRKL+EEEDD
Subjt:  MVHLLPNPLRVSSHLRSEPPRTVVPTRAKSSTGGCEIWRNGGIKVRSAVVVRCGSDYSSPIAAATTEEDSIGVSEVIDQKEYLAREFGWKVRKLMEEEDD

Query:  LRAVARIQAEAFHEPVLLFNDFFFQFFQAEVLSALIYRLKSYPPNRYACLVAEPESERCKDEYNFVGVVDVTVAGDLKVKRLLPAGEKEYLFVTGIAVAQ
        LR VARIQAEAFHEPVLLFN FFFQFFQAEVLSALIYRLK+YPP+RYACLVAEPESE CKDEYNFVGVVDVTVAGDLKV+RLLPAG KEYLFVTGIAV Q
Subjt:  LRAVARIQAEAFHEPVLLFNDFFFQFFQAEVLSALIYRLKSYPPNRYACLVAEPESERCKDEYNFVGVVDVTVAGDLKVKRLLPAGEKEYLFVTGIAVAQ

Query:  TARFLIFDYLTSHLLLSLTLDVPSTTLLKGCDMLGKVWGFKFLALSAYEDDYGARNLYSKAGYEVYYVDPLWKSTWIGRKRCVTMIKKL
        TAR                    +T LLKGCDML +VWGFKFLALSAYEDDYGARNLYSKAGY+V YVDPLWKS+WIGRKRCVTM+K+L
Subjt:  TARFLIFDYLTSHLLLSLTLDVPSTTLLKGCDMLGKVWGFKFLALSAYEDDYGARNLYSKAGYEVYYVDPLWKSTWIGRKRCVTMIKKL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G72030.1 Acyl-CoA N-acyltransferases (NAT) superfamily protein2.3e-5348.57Show/hide
Query:  RSAVVVRCGSDYSSPIAAATTEEDSIGVSEVIDQKEYLAREFGWKVRKL-MEEEDDLRAVARIQAEAFHEPVLLFNDFFFQFFQAEVLSALIYRLKSYPP
        R  V +RC S  S     + T+ ++  +     + +YL  + GW VR+L  ++ED++R V+ +QAEAFH P+ LF+DFFF FFQAEVLSAL+Y+LK+ PP
Subjt:  RSAVVVRCGSDYSSPIAAATTEEDSIGVSEVIDQKEYLAREFGWKVRKL-MEEEDDLRAVARIQAEAFHEPVLLFNDFFFQFFQAEVLSALIYRLKSYPP

Query:  NRYACLVAEPESE-RCKDEYNFVGVVDVTVAGDLKVKRLLPAGEKEYLFVTGIAVAQTARFLIFDYLTSHLLLSLTLDVPSTTLLKGCDMLGKVWGFKFL
        +RYACLVAE  SE       + VGVVDVT   +  V R  P G +EYL+V+G+AV+++ R                    ++TLLK CD+L  +WGFK L
Subjt:  NRYACLVAEPESE-RCKDEYNFVGVVDVTVAGDLKVKRLLPAGEKEYLFVTGIAVAQTARFLIFDYLTSHLLLSLTLDVPSTTLLKGCDMLGKVWGFKFL

Query:  ALSAYEDDYGARNLYSKAGYEVYYVDPLWKSTWIGRKRCVTMIKK
        AL AYEDD  ARNLYS AGY V   DPLW STWIGRKR V M K+
Subjt:  ALSAYEDDYGARNLYSKAGYEVYYVDPLWKSTWIGRKRCVTMIKK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTCCATTTGCTTCCAAATCCCCTTCGAGTTTCATCGCACCTCCGCTCGGAACCTCCGCGCACGGTGGTTCCAACGAGAGCGAAGTCCAGCACCGGCGGCTGCGAGAT
CTGGAGAAATGGAGGAATTAAGGTGAGATCGGCGGTAGTTGTGCGGTGTGGTAGTGATTATTCGAGTCCGATCGCAGCGGCGACGACGGAGGAGGACTCGATCGGAGTAT
CGGAAGTAATTGATCAAAAGGAGTATTTGGCAAGAGAATTTGGATGGAAGGTGAGAAAATTGATGGAAGAAGAAGATGATTTGAGAGCGGTTGCAAGAATTCAAGCCGAA
GCTTTTCATGAACCTGTTCTTCTTTTCAACGATTTTTTCTTCCAATTTTTCCAGGCAGAAGTGCTTTCAGCATTGATTTATAGACTCAAAAGTTACCCTCCAAACAGGTA
TGCTTGTTTGGTTGCGGAGCCGGAAAGTGAACGTTGTAAAGATGAATACAATTTTGTGGGAGTGGTGGACGTGACGGTGGCCGGAGATTTGAAAGTAAAGCGCCTCCTTC
CCGCCGGTGAAAAGGAGTATCTCTTTGTAACCGGAATCGCCGTCGCACAAACTGCCAGGTTTTTAATTTTCGATTACTTAACTTCCCATTTATTGCTTTCTCTCACACTC
GATGTTCCATCAACGACACTATTGAAGGGGTGTGACATGCTTGGGAAGGTTTGGGGATTCAAGTTTTTGGCATTAAGTGCATATGAAGATGATTATGGTGCTCGTAATTT
GTATAGTAAAGCAGGCTATGAGGTTTACTATGTTGACCCTCTTTGGAAATCTACTTGGATTGGAAGAAAACGTTGTGTTACCATGATTAAAAAGCTCTAG
mRNA sequenceShow/hide mRNA sequence
CGCGACTATTGACCGCGTTTTCAACGACGGCGACCTAATCCCCTTCCAAGAATGGTCCATTTGCTTCCAAATCCCCTTCGAGTTTCATCGCACCTCCGCTCGGAACCTCC
GCGCACGGTGGTTCCAACGAGAGCGAAGTCCAGCACCGGCGGCTGCGAGATCTGGAGAAATGGAGGAATTAAGGTGAGATCGGCGGTAGTTGTGCGGTGTGGTAGTGATT
ATTCGAGTCCGATCGCAGCGGCGACGACGGAGGAGGACTCGATCGGAGTATCGGAAGTAATTGATCAAAAGGAGTATTTGGCAAGAGAATTTGGATGGAAGGTGAGAAAA
TTGATGGAAGAAGAAGATGATTTGAGAGCGGTTGCAAGAATTCAAGCCGAAGCTTTTCATGAACCTGTTCTTCTTTTCAACGATTTTTTCTTCCAATTTTTCCAGGCAGA
AGTGCTTTCAGCATTGATTTATAGACTCAAAAGTTACCCTCCAAACAGGTATGCTTGTTTGGTTGCGGAGCCGGAAAGTGAACGTTGTAAAGATGAATACAATTTTGTGG
GAGTGGTGGACGTGACGGTGGCCGGAGATTTGAAAGTAAAGCGCCTCCTTCCCGCCGGTGAAAAGGAGTATCTCTTTGTAACCGGAATCGCCGTCGCACAAACTGCCAGG
TTTTTAATTTTCGATTACTTAACTTCCCATTTATTGCTTTCTCTCACACTCGATGTTCCATCAACGACACTATTGAAGGGGTGTGACATGCTTGGGAAGGTTTGGGGATT
CAAGTTTTTGGCATTAAGTGCATATGAAGATGATTATGGTGCTCGTAATTTGTATAGTAAAGCAGGCTATGAGGTTTACTATGTTGACCCTCTTTGGAAATCTACTTGGA
TTGGAAGAAAACGTTGTGTTACCATGATTAAAAAGCTCTAG
Protein sequenceShow/hide protein sequence
MVHLLPNPLRVSSHLRSEPPRTVVPTRAKSSTGGCEIWRNGGIKVRSAVVVRCGSDYSSPIAAATTEEDSIGVSEVIDQKEYLAREFGWKVRKLMEEEDDLRAVARIQAE
AFHEPVLLFNDFFFQFFQAEVLSALIYRLKSYPPNRYACLVAEPESERCKDEYNFVGVVDVTVAGDLKVKRLLPAGEKEYLFVTGIAVAQTARFLIFDYLTSHLLLSLTL
DVPSTTLLKGCDMLGKVWGFKFLALSAYEDDYGARNLYSKAGYEVYYVDPLWKSTWIGRKRCVTMIKKL