; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10022543 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10022543
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionN-acetyltransferase domain-containing protein
Genome locationChr05:25276399..25287712
RNA-Seq ExpressionHG10022543
SyntenyHG10022543
Gene Ontology termsGO:0005737 - cytoplasm (cellular component)
GO:0008080 - N-acetyltransferase activity (molecular function)
InterPro domainsIPR000182 - GNAT domain
IPR016181 - Acyl-CoA N-acyltransferase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004139678.1 uncharacterized protein LOC101214390 isoform X2 [Cucumis sativus]2.0e-11286.8Show/hide
Query:  MVHLLPNPLRVSSHLRSEPPRTVVPTRSKSGTGGGAIWR-NGGIKVSSAVVVRCSSDYSSPITAAT-TEEESIGVSEEIDQKEYLAREFGWKVRKLMEEE
        MVHLLPNPLRV SHLRSEPP T VPTRSKS    G +WR  GGIKV SAVVVRCSSDYSSPITAA  TEEE +GVSEEID+ EYLA EFGWKVRKL+EEE
Subjt:  MVHLLPNPLRVSSHLRSEPPRTVVPTRSKSGTGGGAIWR-NGGIKVSSAVVVRCSSDYSSPITAAT-TEEESIGVSEEIDQKEYLAREFGWKVRKLMEEE

Query:  DDLRAVARIQAEAFHEPVLLFNDFFFQFFQAEVLSALIYRLKNYPPDRYACLVAEPESESCKDEYNFVGVVDVTVAGDLKVKRLLPAGEKEYLFVTGIAV
        DDL+AVARIQAEAFHEPVLLFN FFFQFFQAEVLSALIYRLKNYP DRYACLVAEPESE  ++EYNFVGVVDVTVAGDLK+KRLLP G KEYLFVTGIAV
Subjt:  DDLRAVARIQAEAFHEPVLLFNDFFFQFFQAEVLSALIYRLKNYPPDRYACLVAEPESESCKDEYNFVGVVDVTVAGDLKVKRLLPAGEKEYLFVTGIAV

Query:  PQTARRRKVATTLLKGCDMLGKVWGFKFLALSAYEDDYGARNLYSKAGYQ
         Q ARRRKVAT LLKGCDMLGKVWGFKFLALSAYEDDYGARNLYSKAGYQ
Subjt:  PQTARRRKVATTLLKGCDMLGKVWGFKFLALSAYEDDYGARNLYSKAGYQ

XP_008461923.1 PREDICTED: uncharacterized protein LOC103500412 isoform X1 [Cucumis melo]5.0e-11187.2Show/hide
Query:  MVHLLPNPLRVSSHLRSEPPRTVVPTRSKSGTGGGAIWRNGG-IKVSSAVVVRCSSDYSSPITAA-TTEEESIGVSEEIDQKEYLAREFGWKVRKLMEEE
        MVHLLPNPLRVSSHLRSEPPRT VPTRSK      A+WR GG IKV SAVVVRCSSDYSSPITAA  TEEE I VSEE+ + EYLAREFGWKVRKL+EEE
Subjt:  MVHLLPNPLRVSSHLRSEPPRTVVPTRSKSGTGGGAIWRNGG-IKVSSAVVVRCSSDYSSPITAA-TTEEESIGVSEEIDQKEYLAREFGWKVRKLMEEE

Query:  DDLRAVARIQAEAFHEPVLLFNDFFFQFFQAEVLSALIYRLKNYPPDRYACLVAEPESESCKDEYNFVGVVDVTVAGDLKVKRLLPAGEKEYLFVTGIAV
        DDLRAVARIQAEAFHEPVLLFN FFFQFFQAEVLSALIYRLKNYP +RYACLVAEPESE  ++EYNFVGVVDVTVAGDLKVKRLLP G KEYLFVTGIAV
Subjt:  DDLRAVARIQAEAFHEPVLLFNDFFFQFFQAEVLSALIYRLKNYPPDRYACLVAEPESESCKDEYNFVGVVDVTVAGDLKVKRLLPAGEKEYLFVTGIAV

Query:  PQTARRRKVATTLLKGCDMLGKVWGFKFLALSAYEDDYGARNLYSKAGYQ
         Q ARRRKVAT LLKGCDMLGKVWGFKFLALSAYEDDYGARNLYSKAGYQ
Subjt:  PQTARRRKVATTLLKGCDMLGKVWGFKFLALSAYEDDYGARNLYSKAGYQ

XP_022976620.1 uncharacterized protein LOC111476967 [Cucurbita maxima]2.9e-11185.66Show/hide
Query:  MVHLLPNPLRVSSHLRSEPPRTVVPTRSKSGTGGGAIWRNGGIKVSSAVVVRCSSDYSSPITAATTEEESIGVSEEID---QKEYLAREFGWKVRKLMEE
        MVHLLPNPLRVSSHLRS+PPRT V  R+KSGTG   I RNGGIKV  AVVVRCSSDYSSPITAA      IG SEEI+   +  YLA EFGWKVRKL+EE
Subjt:  MVHLLPNPLRVSSHLRSEPPRTVVPTRSKSGTGGGAIWRNGGIKVSSAVVVRCSSDYSSPITAATTEEESIGVSEEID---QKEYLAREFGWKVRKLMEE

Query:  EDDLRAVARIQAEAFHEPVLLFNDFFFQFFQAEVLSALIYRLKNYPPDRYACLVAEPESESCKDEYNFVGVVDVTVAGDLKVKRLLPAGEKEYLFVTGIA
        EDDLR VARIQAEAFHEPVLLFN FFFQFFQAEVLSALIYRLKNYPPDRYACLVAEPESE CKDEYNFVGVVDVTVAGDLKV+RLLPAG KEYLFVTGIA
Subjt:  EDDLRAVARIQAEAFHEPVLLFNDFFFQFFQAEVLSALIYRLKNYPPDRYACLVAEPESESCKDEYNFVGVVDVTVAGDLKVKRLLPAGEKEYLFVTGIA

Query:  VPQTARRRKVATTLLKGCDMLGKVWGFKFLALSAYEDDYGARNLYSKAGYQ
        V QTARRRKVAT LLKGCDML +VWGFKFLALSAYEDDYGARNLYSKAGYQ
Subjt:  VPQTARRRKVATTLLKGCDMLGKVWGFKFLALSAYEDDYGARNLYSKAGYQ

XP_031744292.1 uncharacterized protein LOC101214390 isoform X1 [Cucumis sativus]1.9e-11084.77Show/hide
Query:  MVHLLPNPLRVSSHLRSEPPRTVVPTRSKSGTGGGAIWR-NGGIKVSSAVVVRCSSDYSSPITAAT-TEEESIGVSEEIDQKEYLAREFGWKVRKLMEEE
        MVHLLPNPLRV SHLRSEPP T VPTRSKS    G +WR  GGIKV SAVVVRCSSDYSSPITAA  TEEE +GVSEEID+ EYLA EFGWKVRKL+EEE
Subjt:  MVHLLPNPLRVSSHLRSEPPRTVVPTRSKSGTGGGAIWR-NGGIKVSSAVVVRCSSDYSSPITAAT-TEEESIGVSEEIDQKEYLAREFGWKVRKLMEEE

Query:  DDLRAVARIQAEAFHEPVLLFNDFFFQFF------QAEVLSALIYRLKNYPPDRYACLVAEPESESCKDEYNFVGVVDVTVAGDLKVKRLLPAGEKEYLF
        DDL+AVARIQAEAFHEPVLLFN FFFQFF      QAEVLSALIYRLKNYP DRYACLVAEPESE  ++EYNFVGVVDVTVAGDLK+KRLLP G KEYLF
Subjt:  DDLRAVARIQAEAFHEPVLLFNDFFFQFF------QAEVLSALIYRLKNYPPDRYACLVAEPESESCKDEYNFVGVVDVTVAGDLKVKRLLPAGEKEYLF

Query:  VTGIAVPQTARRRKVATTLLKGCDMLGKVWGFKFLALSAYEDDYGARNLYSKAGYQ
        VTGIAV Q ARRRKVAT LLKGCDMLGKVWGFKFLALSAYEDDYGARNLYSKAGYQ
Subjt:  VTGIAVPQTARRRKVATTLLKGCDMLGKVWGFKFLALSAYEDDYGARNLYSKAGYQ

XP_038897179.1 uncharacterized protein LOC120085320 [Benincasa hispida]1.0e-11990.73Show/hide
Query:  MVHLLPNPLRVSSHLRSEPPRTVVPTRSKSGTGGGAIWRNGGIKVSSAVVVRCSSDYSSPITAATTEEESIGVSEEIDQKEYLAREFGWKVRKLMEEEDD
        MVHLLPNPLRVS HLRSEPPRT V  R  SGTGG AIWRNGGIKV SAVV+RCSSDYSSP    TT EESIG+ EEI++ EYLAREFGW VRKL+EEEDD
Subjt:  MVHLLPNPLRVSSHLRSEPPRTVVPTRSKSGTGGGAIWRNGGIKVSSAVVVRCSSDYSSPITAATTEEESIGVSEEIDQKEYLAREFGWKVRKLMEEEDD

Query:  LRAVARIQAEAFHEPVLLFNDFFFQFFQAEVLSALIYRLKNYPPDRYACLVAEPESESCKDEYNFVGVVDVTVAGDLKVKRLLPAGEKEYLFVTGIAVPQ
        LRAVARIQAEAFHEPVLLFNDFFFQFFQAEVLSALIYRLKNYPPDRYACLVAEPESESCKDEYNFVGVVDVTVAGDLKVKRLLPAGEKEYLFVTGIAV Q
Subjt:  LRAVARIQAEAFHEPVLLFNDFFFQFFQAEVLSALIYRLKNYPPDRYACLVAEPESESCKDEYNFVGVVDVTVAGDLKVKRLLPAGEKEYLFVTGIAVPQ

Query:  TARRRKVATTLLKGCDMLGKVWGFKFLALSAYEDDYGARNLYSKAGYQ
        TARRRKVAT LLKGCDMLGKVWGFKFLALSAYEDDYGARNLYSKAGYQ
Subjt:  TARRRKVATTLLKGCDMLGKVWGFKFLALSAYEDDYGARNLYSKAGYQ

TrEMBL top hitse value%identityAlignment
A0A0A0K826 N-acetyltransferase domain-containing protein9.8e-11386.8Show/hide
Query:  MVHLLPNPLRVSSHLRSEPPRTVVPTRSKSGTGGGAIWR-NGGIKVSSAVVVRCSSDYSSPITAAT-TEEESIGVSEEIDQKEYLAREFGWKVRKLMEEE
        MVHLLPNPLRV SHLRSEPP T VPTRSKS    G +WR  GGIKV SAVVVRCSSDYSSPITAA  TEEE +GVSEEID+ EYLA EFGWKVRKL+EEE
Subjt:  MVHLLPNPLRVSSHLRSEPPRTVVPTRSKSGTGGGAIWR-NGGIKVSSAVVVRCSSDYSSPITAAT-TEEESIGVSEEIDQKEYLAREFGWKVRKLMEEE

Query:  DDLRAVARIQAEAFHEPVLLFNDFFFQFFQAEVLSALIYRLKNYPPDRYACLVAEPESESCKDEYNFVGVVDVTVAGDLKVKRLLPAGEKEYLFVTGIAV
        DDL+AVARIQAEAFHEPVLLFN FFFQFFQAEVLSALIYRLKNYP DRYACLVAEPESE  ++EYNFVGVVDVTVAGDLK+KRLLP G KEYLFVTGIAV
Subjt:  DDLRAVARIQAEAFHEPVLLFNDFFFQFFQAEVLSALIYRLKNYPPDRYACLVAEPESESCKDEYNFVGVVDVTVAGDLKVKRLLPAGEKEYLFVTGIAV

Query:  PQTARRRKVATTLLKGCDMLGKVWGFKFLALSAYEDDYGARNLYSKAGYQ
         Q ARRRKVAT LLKGCDMLGKVWGFKFLALSAYEDDYGARNLYSKAGYQ
Subjt:  PQTARRRKVATTLLKGCDMLGKVWGFKFLALSAYEDDYGARNLYSKAGYQ

A0A1S3CFP1 uncharacterized protein LOC103500412 isoform X12.4e-11187.2Show/hide
Query:  MVHLLPNPLRVSSHLRSEPPRTVVPTRSKSGTGGGAIWRNGG-IKVSSAVVVRCSSDYSSPITAA-TTEEESIGVSEEIDQKEYLAREFGWKVRKLMEEE
        MVHLLPNPLRVSSHLRSEPPRT VPTRSK      A+WR GG IKV SAVVVRCSSDYSSPITAA  TEEE I VSEE+ + EYLAREFGWKVRKL+EEE
Subjt:  MVHLLPNPLRVSSHLRSEPPRTVVPTRSKSGTGGGAIWRNGG-IKVSSAVVVRCSSDYSSPITAA-TTEEESIGVSEEIDQKEYLAREFGWKVRKLMEEE

Query:  DDLRAVARIQAEAFHEPVLLFNDFFFQFFQAEVLSALIYRLKNYPPDRYACLVAEPESESCKDEYNFVGVVDVTVAGDLKVKRLLPAGEKEYLFVTGIAV
        DDLRAVARIQAEAFHEPVLLFN FFFQFFQAEVLSALIYRLKNYP +RYACLVAEPESE  ++EYNFVGVVDVTVAGDLKVKRLLP G KEYLFVTGIAV
Subjt:  DDLRAVARIQAEAFHEPVLLFNDFFFQFFQAEVLSALIYRLKNYPPDRYACLVAEPESESCKDEYNFVGVVDVTVAGDLKVKRLLPAGEKEYLFVTGIAV

Query:  PQTARRRKVATTLLKGCDMLGKVWGFKFLALSAYEDDYGARNLYSKAGYQ
         Q ARRRKVAT LLKGCDMLGKVWGFKFLALSAYEDDYGARNLYSKAGYQ
Subjt:  PQTARRRKVATTLLKGCDMLGKVWGFKFLALSAYEDDYGARNLYSKAGYQ

A0A1S4E3H1 uncharacterized protein LOC103500412 isoform X23.0e-10986.8Show/hide
Query:  MVHLLPNPLRVSSHLRSEPPRTVVPTRSKSGTGGGAIWRNGG-IKVSSAVVVRCSSDYSSPITAA-TTEEESIGVSEEIDQKEYLAREFGWKVRKLMEEE
        MVHLLPNPLRVSSHLRSEPPRT VPTRSK      A+WR GG IKV SAVVVRCSSDYSSPITAA  TEEE I VSEE+ + EYLAREFGWKVRKL+EEE
Subjt:  MVHLLPNPLRVSSHLRSEPPRTVVPTRSKSGTGGGAIWRNGG-IKVSSAVVVRCSSDYSSPITAA-TTEEESIGVSEEIDQKEYLAREFGWKVRKLMEEE

Query:  DDLRAVARIQAEAFHEPVLLFNDFFFQFFQAEVLSALIYRLKNYPPDRYACLVAEPESESCKDEYNFVGVVDVTVAGDLKVKRLLPAGEKEYLFVTGIAV
        DDLRAVARIQAEAFHEPVLLFN FFFQFFQAEVLSALIYRLKNYP +RYACLVAEPESE  ++EYNFVGVVDVTVAGDLKVKRLLP G KEYLFVTGIAV
Subjt:  DDLRAVARIQAEAFHEPVLLFNDFFFQFFQAEVLSALIYRLKNYPPDRYACLVAEPESESCKDEYNFVGVVDVTVAGDLKVKRLLPAGEKEYLFVTGIAV

Query:  PQTARRRKVATTLLKGCDMLGKVWGFKFLALSAYEDDYGARNLYSKAGYQ
         Q A RRKVAT LLKGCDMLGKVWGFKFLALSAYEDDYGARNLYSKAGYQ
Subjt:  PQTARRRKVATTLLKGCDMLGKVWGFKFLALSAYEDDYGARNLYSKAGYQ

A0A6J1F9A7 uncharacterized protein LOC1114432732.0e-11084.86Show/hide
Query:  MVHLLPNPLRVSSHLRSEPPRTVVPTRSKSGTGGGAIWRNGGIKVSSAVVVRCSSDYSSPITAATTEEESIGVSEEID---QKEYLAREFGWKVRKLMEE
        MVHLLPNPLRVSSHLR +PPRT V  R+KSGTG   I RNGGIKV  AVVVRCSSDYSSPI AA      IG SEEI+   +  YLA EFGWKVRKL+EE
Subjt:  MVHLLPNPLRVSSHLRSEPPRTVVPTRSKSGTGGGAIWRNGGIKVSSAVVVRCSSDYSSPITAATTEEESIGVSEEID---QKEYLAREFGWKVRKLMEE

Query:  EDDLRAVARIQAEAFHEPVLLFNDFFFQFFQAEVLSALIYRLKNYPPDRYACLVAEPESESCKDEYNFVGVVDVTVAGDLKVKRLLPAGEKEYLFVTGIA
        EDDLR VARIQAEAFHEPVLLFN FFFQFFQAEVLSALIYRLK+YPPDRYACLVAEPESE+CKDEYNFVGVVDVTVAGDLKV+RLLPAG KEYLFVTGIA
Subjt:  EDDLRAVARIQAEAFHEPVLLFNDFFFQFFQAEVLSALIYRLKNYPPDRYACLVAEPESESCKDEYNFVGVVDVTVAGDLKVKRLLPAGEKEYLFVTGIA

Query:  VPQTARRRKVATTLLKGCDMLGKVWGFKFLALSAYEDDYGARNLYSKAGYQ
        V QTARRRKVAT LLKGCDML KVWGFKFLALSAYEDDYGARNLYSKAGYQ
Subjt:  VPQTARRRKVATTLLKGCDMLGKVWGFKFLALSAYEDDYGARNLYSKAGYQ

A0A6J1IJZ4 uncharacterized protein LOC1114769671.4e-11185.66Show/hide
Query:  MVHLLPNPLRVSSHLRSEPPRTVVPTRSKSGTGGGAIWRNGGIKVSSAVVVRCSSDYSSPITAATTEEESIGVSEEID---QKEYLAREFGWKVRKLMEE
        MVHLLPNPLRVSSHLRS+PPRT V  R+KSGTG   I RNGGIKV  AVVVRCSSDYSSPITAA      IG SEEI+   +  YLA EFGWKVRKL+EE
Subjt:  MVHLLPNPLRVSSHLRSEPPRTVVPTRSKSGTGGGAIWRNGGIKVSSAVVVRCSSDYSSPITAATTEEESIGVSEEID---QKEYLAREFGWKVRKLMEE

Query:  EDDLRAVARIQAEAFHEPVLLFNDFFFQFFQAEVLSALIYRLKNYPPDRYACLVAEPESESCKDEYNFVGVVDVTVAGDLKVKRLLPAGEKEYLFVTGIA
        EDDLR VARIQAEAFHEPVLLFN FFFQFFQAEVLSALIYRLKNYPPDRYACLVAEPESE CKDEYNFVGVVDVTVAGDLKV+RLLPAG KEYLFVTGIA
Subjt:  EDDLRAVARIQAEAFHEPVLLFNDFFFQFFQAEVLSALIYRLKNYPPDRYACLVAEPESESCKDEYNFVGVVDVTVAGDLKVKRLLPAGEKEYLFVTGIA

Query:  VPQTARRRKVATTLLKGCDMLGKVWGFKFLALSAYEDDYGARNLYSKAGYQ
        V QTARRRKVAT LLKGCDML +VWGFKFLALSAYEDDYGARNLYSKAGYQ
Subjt:  VPQTARRRKVATTLLKGCDMLGKVWGFKFLALSAYEDDYGARNLYSKAGYQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G72030.1 Acyl-CoA N-acyltransferases (NAT) superfamily protein3.1e-5055.22Show/hide
Query:  VVVRCSSDYSSPITAATTEEESIGVSEEIDQKEYLAREFGWKVRKL-MEEEDDLRAVARIQAEAFHEPVLLFNDFFFQFFQAEVLSALIYRLKNYPPDRY
        V +RC+S  S  +T+ T        + EI+ K YL  + GW VR+L  ++ED++R V+ +QAEAFH P+ LF+DFFF FFQAEVLSAL+Y+LKN PPDRY
Subjt:  VVVRCSSDYSSPITAATTEEESIGVSEEIDQKEYLAREFGWKVRKL-MEEEDDLRAVARIQAEAFHEPVLLFNDFFFQFFQAEVLSALIYRLKNYPPDRY

Query:  ACLVAEPESES-CKDEYNFVGVVDVTVAGDLKVKRLLPAGEKEYLFVTGIAVPQTARRRKVATTLLKGCDMLGKVWGFKFLALSAYEDDYGARNLYSKAG
        ACLVAE  SE+      + VGVVDVT   +  V R  P G +EYL+V+G+AV ++ RR+K+A+TLLK CD+L  +WGFK LAL AYEDD  ARNLYS AG
Subjt:  ACLVAEPESES-CKDEYNFVGVVDVTVAGDLKVKRLLPAGEKEYLFVTGIAVPQTARRRKVATTLLKGCDMLGKVWGFKFLALSAYEDDYGARNLYSKAG

Query:  Y
        Y
Subjt:  Y


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTCATTTGCTTCCAAATCCCCTCCGTGTTTCATCGCATCTCCGCTCGGAGCCACCGCGCACGGTGGTTCCGACAAGATCGAAGTCCGGCACCGGCGGCGGCGCGAT
CTGGAGAAATGGAGGAATTAAGGTGAGCTCGGCGGTGGTTGTGCGGTGTAGTAGTGACTATTCGAGTCCGATTACGGCGGCGACGACGGAGGAGGAATCGATCGGAGTAT
CGGAAGAAATTGATCAAAAAGAGTATTTGGCGAGAGAATTTGGATGGAAGGTGAGAAAATTGATGGAAGAAGAAGATGATTTGAGAGCGGTTGCAAGAATTCAAGCCGAA
GCTTTTCATGAACCTGTTCTTCTTTTCAACGATTTCTTCTTCCAATTTTTCCAGGCAGAAGTGCTTTCAGCGTTGATTTACAGACTGAAAAATTACCCTCCAGACAGGTA
TGCTTGTTTGGTTGCGGAGCCGGAAAGTGAAAGTTGTAAAGATGAATACAATTTTGTGGGAGTGGTGGACGTGACGGTGGCTGGAGATTTGAAAGTAAAACGCCTCCTTC
CCGCTGGCGAAAAGGAGTATCTCTTTGTAACTGGAATTGCCGTCCCACAAACTGCCAGAAGACGCAAAGTAGCAACGACACTACTGAAGGGGTGTGACATGCTTGGGAAG
GTTTGGGGATTCAAATTTTTGGCATTAAGTGCATATGAAGATGATTATGGGGCTCGTAATTTGTATAGTAAAGCAGGCTATCAGTCTAAAACTCAGATGAATTCCCTAAG
AAGAAACAAGAAAGAAAATGAAGTTTCAGAGATGAAGAAGAAGAAGAAGAAGAAGATTAAGAAGGGATCAATTATTATTCCATGGGAGCAAAAGAAAGAAATGATTGATA
AAGAAGAAATTCAACTCCACAAAGACTTGGATCAACTTACAAATTGGATAAAAATGGTGGATTCCATGAATGATGAGAAGCTGAAAGAATATCTACAAGATACACCACAA
AAATTCAAGATTCTCAAAATCCCAAAGTGCAACCCTAAGCGCACTGGGCAAAAAATTGAGGATTCTAAATATTGGGCTTCTTACGGGATAATGGCTTCTGTTTGGAAGTT
TCACAAGCAAGACAATGACCAGCAGCTTTCTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGTTCATTTGCTTCCAAATCCCCTCCGTGTTTCATCGCATCTCCGCTCGGAGCCACCGCGCACGGTGGTTCCGACAAGATCGAAGTCCGGCACCGGCGGCGGCGCGAT
CTGGAGAAATGGAGGAATTAAGGTGAGCTCGGCGGTGGTTGTGCGGTGTAGTAGTGACTATTCGAGTCCGATTACGGCGGCGACGACGGAGGAGGAATCGATCGGAGTAT
CGGAAGAAATTGATCAAAAAGAGTATTTGGCGAGAGAATTTGGATGGAAGGTGAGAAAATTGATGGAAGAAGAAGATGATTTGAGAGCGGTTGCAAGAATTCAAGCCGAA
GCTTTTCATGAACCTGTTCTTCTTTTCAACGATTTCTTCTTCCAATTTTTCCAGGCAGAAGTGCTTTCAGCGTTGATTTACAGACTGAAAAATTACCCTCCAGACAGGTA
TGCTTGTTTGGTTGCGGAGCCGGAAAGTGAAAGTTGTAAAGATGAATACAATTTTGTGGGAGTGGTGGACGTGACGGTGGCTGGAGATTTGAAAGTAAAACGCCTCCTTC
CCGCTGGCGAAAAGGAGTATCTCTTTGTAACTGGAATTGCCGTCCCACAAACTGCCAGAAGACGCAAAGTAGCAACGACACTACTGAAGGGGTGTGACATGCTTGGGAAG
GTTTGGGGATTCAAATTTTTGGCATTAAGTGCATATGAAGATGATTATGGGGCTCGTAATTTGTATAGTAAAGCAGGCTATCAGTCTAAAACTCAGATGAATTCCCTAAG
AAGAAACAAGAAAGAAAATGAAGTTTCAGAGATGAAGAAGAAGAAGAAGAAGAAGATTAAGAAGGGATCAATTATTATTCCATGGGAGCAAAAGAAAGAAATGATTGATA
AAGAAGAAATTCAACTCCACAAAGACTTGGATCAACTTACAAATTGGATAAAAATGGTGGATTCCATGAATGATGAGAAGCTGAAAGAATATCTACAAGATACACCACAA
AAATTCAAGATTCTCAAAATCCCAAAGTGCAACCCTAAGCGCACTGGGCAAAAAATTGAGGATTCTAAATATTGGGCTTCTTACGGGATAATGGCTTCTGTTTGGAAGTT
TCACAAGCAAGACAATGACCAGCAGCTTTCTTAA
Protein sequenceShow/hide protein sequence
MVHLLPNPLRVSSHLRSEPPRTVVPTRSKSGTGGGAIWRNGGIKVSSAVVVRCSSDYSSPITAATTEEESIGVSEEIDQKEYLAREFGWKVRKLMEEEDDLRAVARIQAE
AFHEPVLLFNDFFFQFFQAEVLSALIYRLKNYPPDRYACLVAEPESESCKDEYNFVGVVDVTVAGDLKVKRLLPAGEKEYLFVTGIAVPQTARRRKVATTLLKGCDMLGK
VWGFKFLALSAYEDDYGARNLYSKAGYQSKTQMNSLRRNKKENEVSEMKKKKKKKIKKGSIIIPWEQKKEMIDKEEIQLHKDLDQLTNWIKMVDSMNDEKLKEYLQDTPQ
KFKILKIPKCNPKRTGQKIEDSKYWASYGIMASVWKFHKQDNDQQLS