; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC11g1198 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC11g1198
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionN-acetyltransferase domain-containing protein
Genome locationMC11:11602414..11607011
RNA-Seq ExpressionMC11g1198
SyntenyMC11g1198
Gene Ontology termsGO:0005737 - cytoplasm (cellular component)
GO:0008080 - N-acetyltransferase activity (molecular function)
InterPro domainsIPR000182 - GNAT domain
IPR016181 - Acyl-CoA N-acyltransferase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004139678.1 uncharacterized protein LOC101214390 isoform X2 [Cucumis sativus]3.15e-13173.9Show/hide
Query:  MVHLLPNYARVSSHLLP----AAAP--ASGGVVWRSGGSIRARSAVVLQCNREFSGSITAAAAAED-LIGIPAEIETEIEYLASENGWKVRKLIKEEDDL
        MVHLLPN  RV SHL       A P  +  GVVWR GG I+ RSAVV++C+ ++S  ITAAA  E+ L+G+  EI+ E EYLA+E GWKVRKLI+EEDDL
Subjt:  MVHLLPNYARVSSHLLP----AAAP--ASGGVVWRSGGSIRARSAVVLQCNREFSGSITAAAAAED-LIGIPAEIETEIEYLASENGWKVRKLIKEEDDL

Query:  RAVARIQAEAFHEPVFLFNDFFFHFFQAEVLAALIYRLQNYPPDRYACLVAEAENEDRTDHYNFVGVVDVTVAGDLKVKRLLPAGEKEYLFVSGIAVYES
        +AVARIQAEAFHEPV LFN FFF FFQAEVL+ALIYRL+NYP DRYACLVAE E+E   + YNFVGVVDVTVAGDLK+KRLLP G KEYLFV+GIAV ++
Subjt:  RAVARIQAEAFHEPVFLFNDFFFHFFQAEVLAALIYRLQNYPPDRYACLVAEAENEDRTDHYNFVGVVDVTVAGDLKVKRLLPAGEKEYLFVSGIAVYES

Query:  ARRRKVATTLLKGCDMLAKVWGFKFLALSAYEDDYGARNLYSKAGYQLLSADPLWKSSWIGRKRCVTLIKKL
        ARRRKVAT LLKGCDML KVWGFKFLALSAYEDDYGARNLYSKAGYQ+   DPLWKS+WIGRKRCVT+IKKL
Subjt:  ARRRKVATTLLKGCDMLAKVWGFKFLALSAYEDDYGARNLYSKAGYQLLSADPLWKSSWIGRKRCVTLIKKL

XP_008461923.1 PREDICTED: uncharacterized protein LOC103500412 isoform X1 [Cucumis melo]2.01e-12772.69Show/hide
Query:  MVHLLPNYARVSSHLLP----AAAPASGG--VVWRSGGSIRARSAVVLQCNREFSGSITAAAAAEDLIGIPAEIETEIEYLASENGWKVRKLIKEEDDLR
        MVHLLPN  RVSSHL       A P       +WR GG I+ RSAVV++C+ ++S  ITAA A E+ +   +E   E EYLA E GWKVRKLI+EEDDLR
Subjt:  MVHLLPNYARVSSHLLP----AAAPASGG--VVWRSGGSIRARSAVVLQCNREFSGSITAAAAAEDLIGIPAEIETEIEYLASENGWKVRKLIKEEDDLR

Query:  AVARIQAEAFHEPVFLFNDFFFHFFQAEVLAALIYRLQNYPPDRYACLVAEAENEDRTDHYNFVGVVDVTVAGDLKVKRLLPAGEKEYLFVSGIAVYESA
        AVARIQAEAFHEPV LFN FFF FFQAEVL+ALIYRL+NYP +RYACLVAE E+E   + YNFVGVVDVTVAGDLKVKRLLP G KEYLFV+GIAV ++A
Subjt:  AVARIQAEAFHEPVFLFNDFFFHFFQAEVLAALIYRLQNYPPDRYACLVAEAENEDRTDHYNFVGVVDVTVAGDLKVKRLLPAGEKEYLFVSGIAVYESA

Query:  RRRKVATTLLKGCDMLAKVWGFKFLALSAYEDDYGARNLYSKAGYQLLSADPLWKSSWIGRKRCVTLIKKL
        RRRKVAT LLKGCDML KVWGFKFLALSAYEDDYGARNLYSKAGYQ+   DPLWKS+WIGRKRCVT+IKKL
Subjt:  RRRKVATTLLKGCDMLAKVWGFKFLALSAYEDDYGARNLYSKAGYQLLSADPLWKSSWIGRKRCVTLIKKL

XP_022143656.1 uncharacterized protein LOC111013514 [Momordica charantia]5.18e-18699.62Show/hide
Query:  MVHLLPNYARVSSHLLPAAAPASGGVVWRSGGSIRARSAVVLQCNREFSGSITAAAAAEDLIGIPAEIETEIEYLASENGWKVRKLIKEEDDLRAVARIQ
        MVHLLPNYARVSSHLLPAAAPASGGVVWRSGGSIRARSAVVLQCNREFSGSITAAAAAEDLIGIPAEIETEIEYLASENGWKVRKLIKEEDDLRAVARIQ
Subjt:  MVHLLPNYARVSSHLLPAAAPASGGVVWRSGGSIRARSAVVLQCNREFSGSITAAAAAEDLIGIPAEIETEIEYLASENGWKVRKLIKEEDDLRAVARIQ

Query:  AEAFHEPVFLFNDFFFHFFQAEVLAALIYRLQNYPPDRYACLVAEAENEDRTDHYNFVGVVDVTVAGDLKVKRLLPAGEKEYLFVSGIAVYESARRRKVA
        AEAFHEPVFLFNDFFFHFFQAEVLAALIYRLQNYPPDRYACLVAEAENE+RTDHYNFVGVVDVTVAGDLKVKRLLPAGEKEYLFVSGIAVYESARRRKVA
Subjt:  AEAFHEPVFLFNDFFFHFFQAEVLAALIYRLQNYPPDRYACLVAEAENEDRTDHYNFVGVVDVTVAGDLKVKRLLPAGEKEYLFVSGIAVYESARRRKVA

Query:  TTLLKGCDMLAKVWGFKFLALSAYEDDYGARNLYSKAGYQLLSADPLWKSSWIGRKRCVTLIKKL
        TTLLKGCDMLAKVWGFKFLALSAYEDDYGARNLYSKAGYQLLSADPLWKSSWIGRKRCVTLIKKL
Subjt:  TTLLKGCDMLAKVWGFKFLALSAYEDDYGARNLYSKAGYQLLSADPLWKSSWIGRKRCVTLIKKL

XP_031744292.1 uncharacterized protein LOC101214390 isoform X1 [Cucumis sativus]1.50e-12872.3Show/hide
Query:  MVHLLPNYARVSSHLLP----AAAP--ASGGVVWRSGGSIRARSAVVLQCNREFSGSITAAAAAED-LIGIPAEIETEIEYLASENGWKVRKLIKEEDDL
        MVHLLPN  RV SHL       A P  +  GVVWR GG I+ RSAVV++C+ ++S  ITAAA  E+ L+G+  EI+ E EYLA+E GWKVRKLI+EEDDL
Subjt:  MVHLLPNYARVSSHLLP----AAAP--ASGGVVWRSGGSIRARSAVVLQCNREFSGSITAAAAAED-LIGIPAEIETEIEYLASENGWKVRKLIKEEDDL

Query:  RAVARIQAEAFHEPVFLFNDFFFHFFQ------AEVLAALIYRLQNYPPDRYACLVAEAENEDRTDHYNFVGVVDVTVAGDLKVKRLLPAGEKEYLFVSG
        +AVARIQAEAFHEPV LFN FFF FFQ      AEVL+ALIYRL+NYP DRYACLVAE E+E   + YNFVGVVDVTVAGDLK+KRLLP G KEYLFV+G
Subjt:  RAVARIQAEAFHEPVFLFNDFFFHFFQ------AEVLAALIYRLQNYPPDRYACLVAEAENEDRTDHYNFVGVVDVTVAGDLKVKRLLPAGEKEYLFVSG

Query:  IAVYESARRRKVATTLLKGCDMLAKVWGFKFLALSAYEDDYGARNLYSKAGYQLLSADPLWKSSWIGRKRCVTLIKKL
        IAV ++ARRRKVAT LLKGCDML KVWGFKFLALSAYEDDYGARNLYSKAGYQ+   DPLWKS+WIGRKRCVT+IKKL
Subjt:  IAVYESARRRKVATTLLKGCDMLAKVWGFKFLALSAYEDDYGARNLYSKAGYQLLSADPLWKSSWIGRKRCVTLIKKL

XP_038897179.1 uncharacterized protein LOC120085320 [Benincasa hispida]9.76e-13073.82Show/hide
Query:  MVHLLPNYARVSSHL---------LPAAAPASGG-VVWRSGGSIRARSAVVLQCNREFSGSITAAAAAEDLIGIPAEIETEIEYLASENGWKVRKLIKEE
        MVHLLPN  RVS HL         L      +GG  +WR+GG I+ RSAVVL+C+ ++S   TA    E+ IG+  EI  E EYLA E GW VRKLI+EE
Subjt:  MVHLLPNYARVSSHL---------LPAAAPASGG-VVWRSGGSIRARSAVVLQCNREFSGSITAAAAAEDLIGIPAEIETEIEYLASENGWKVRKLIKEE

Query:  DDLRAVARIQAEAFHEPVFLFNDFFFHFFQAEVLAALIYRLQNYPPDRYACLVAEAENEDRTDHYNFVGVVDVTVAGDLKVKRLLPAGEKEYLFVSGIAV
        DDLRAVARIQAEAFHEPV LFNDFFF FFQAEVL+ALIYRL+NYPPDRYACLVAE E+E   D YNFVGVVDVTVAGDLKVKRLLPAGEKEYLFV+GIAV
Subjt:  DDLRAVARIQAEAFHEPVFLFNDFFFHFFQAEVLAALIYRLQNYPPDRYACLVAEAENEDRTDHYNFVGVVDVTVAGDLKVKRLLPAGEKEYLFVSGIAV

Query:  YESARRRKVATTLLKGCDMLAKVWGFKFLALSAYEDDYGARNLYSKAGYQLLSADPLWKSSWIGRKRCVTLIKKL
         ++ARRRKVAT LLKGCDML KVWGFKFLALSAYEDDYGARNLYSKAGYQ+   DPLWKS+WIGRKRCVT+IKKL
Subjt:  YESARRRKVATTLLKGCDMLAKVWGFKFLALSAYEDDYGARNLYSKAGYQLLSADPLWKSSWIGRKRCVTLIKKL

TrEMBL top hitse value%identityAlignment
A0A0A0K826 N-acetyltransferase domain-containing protein1.52e-13173.9Show/hide
Query:  MVHLLPNYARVSSHLLP----AAAP--ASGGVVWRSGGSIRARSAVVLQCNREFSGSITAAAAAED-LIGIPAEIETEIEYLASENGWKVRKLIKEEDDL
        MVHLLPN  RV SHL       A P  +  GVVWR GG I+ RSAVV++C+ ++S  ITAAA  E+ L+G+  EI+ E EYLA+E GWKVRKLI+EEDDL
Subjt:  MVHLLPNYARVSSHLLP----AAAP--ASGGVVWRSGGSIRARSAVVLQCNREFSGSITAAAAAED-LIGIPAEIETEIEYLASENGWKVRKLIKEEDDL

Query:  RAVARIQAEAFHEPVFLFNDFFFHFFQAEVLAALIYRLQNYPPDRYACLVAEAENEDRTDHYNFVGVVDVTVAGDLKVKRLLPAGEKEYLFVSGIAVYES
        +AVARIQAEAFHEPV LFN FFF FFQAEVL+ALIYRL+NYP DRYACLVAE E+E   + YNFVGVVDVTVAGDLK+KRLLP G KEYLFV+GIAV ++
Subjt:  RAVARIQAEAFHEPVFLFNDFFFHFFQAEVLAALIYRLQNYPPDRYACLVAEAENEDRTDHYNFVGVVDVTVAGDLKVKRLLPAGEKEYLFVSGIAVYES

Query:  ARRRKVATTLLKGCDMLAKVWGFKFLALSAYEDDYGARNLYSKAGYQLLSADPLWKSSWIGRKRCVTLIKKL
        ARRRKVAT LLKGCDML KVWGFKFLALSAYEDDYGARNLYSKAGYQ+   DPLWKS+WIGRKRCVT+IKKL
Subjt:  ARRRKVATTLLKGCDMLAKVWGFKFLALSAYEDDYGARNLYSKAGYQLLSADPLWKSSWIGRKRCVTLIKKL

A0A1S3CFP1 uncharacterized protein LOC103500412 isoform X19.71e-12872.69Show/hide
Query:  MVHLLPNYARVSSHLLP----AAAPASGG--VVWRSGGSIRARSAVVLQCNREFSGSITAAAAAEDLIGIPAEIETEIEYLASENGWKVRKLIKEEDDLR
        MVHLLPN  RVSSHL       A P       +WR GG I+ RSAVV++C+ ++S  ITAA A E+ +   +E   E EYLA E GWKVRKLI+EEDDLR
Subjt:  MVHLLPNYARVSSHLLP----AAAPASGG--VVWRSGGSIRARSAVVLQCNREFSGSITAAAAAEDLIGIPAEIETEIEYLASENGWKVRKLIKEEDDLR

Query:  AVARIQAEAFHEPVFLFNDFFFHFFQAEVLAALIYRLQNYPPDRYACLVAEAENEDRTDHYNFVGVVDVTVAGDLKVKRLLPAGEKEYLFVSGIAVYESA
        AVARIQAEAFHEPV LFN FFF FFQAEVL+ALIYRL+NYP +RYACLVAE E+E   + YNFVGVVDVTVAGDLKVKRLLP G KEYLFV+GIAV ++A
Subjt:  AVARIQAEAFHEPVFLFNDFFFHFFQAEVLAALIYRLQNYPPDRYACLVAEAENEDRTDHYNFVGVVDVTVAGDLKVKRLLPAGEKEYLFVSGIAVYESA

Query:  RRRKVATTLLKGCDMLAKVWGFKFLALSAYEDDYGARNLYSKAGYQLLSADPLWKSSWIGRKRCVTLIKKL
        RRRKVAT LLKGCDML KVWGFKFLALSAYEDDYGARNLYSKAGYQ+   DPLWKS+WIGRKRCVT+IKKL
Subjt:  RRRKVATTLLKGCDMLAKVWGFKFLALSAYEDDYGARNLYSKAGYQLLSADPLWKSSWIGRKRCVTLIKKL

A0A1S4E3H1 uncharacterized protein LOC103500412 isoform X23.62e-12572.32Show/hide
Query:  MVHLLPNYARVSSHLLP----AAAPASGG--VVWRSGGSIRARSAVVLQCNREFSGSITAAAAAEDLIGIPAEIETEIEYLASENGWKVRKLIKEEDDLR
        MVHLLPN  RVSSHL       A P       +WR GG I+ RSAVV++C+ ++S  ITAA A E+ +   +E   E EYLA E GWKVRKLI+EEDDLR
Subjt:  MVHLLPNYARVSSHLLP----AAAPASGG--VVWRSGGSIRARSAVVLQCNREFSGSITAAAAAEDLIGIPAEIETEIEYLASENGWKVRKLIKEEDDLR

Query:  AVARIQAEAFHEPVFLFNDFFFHFFQAEVLAALIYRLQNYPPDRYACLVAEAENEDRTDHYNFVGVVDVTVAGDLKVKRLLPAGEKEYLFVSGIAVYESA
        AVARIQAEAFHEPV LFN FFF FFQAEVL+ALIYRL+NYP +RYACLVAE E+E   + YNFVGVVDVTVAGDLKVKRLLP G KEYLFV+GIAV ++A
Subjt:  AVARIQAEAFHEPVFLFNDFFFHFFQAEVLAALIYRLQNYPPDRYACLVAEAENEDRTDHYNFVGVVDVTVAGDLKVKRLLPAGEKEYLFVSGIAVYESA

Query:  RRRKVATTLLKGCDMLAKVWGFKFLALSAYEDDYGARNLYSKAGYQLLSADPLWKSSWIGRKRCVTLIKKL
        RR KVAT LLKGCDML KVWGFKFLALSAYEDDYGARNLYSKAGYQ+   DPLWKS+WIGRKRCVT+IKKL
Subjt:  RRRKVATTLLKGCDMLAKVWGFKFLALSAYEDDYGARNLYSKAGYQLLSADPLWKSSWIGRKRCVTLIKKL

A0A6J1CQY1 uncharacterized protein LOC1110135142.51e-18699.62Show/hide
Query:  MVHLLPNYARVSSHLLPAAAPASGGVVWRSGGSIRARSAVVLQCNREFSGSITAAAAAEDLIGIPAEIETEIEYLASENGWKVRKLIKEEDDLRAVARIQ
        MVHLLPNYARVSSHLLPAAAPASGGVVWRSGGSIRARSAVVLQCNREFSGSITAAAAAEDLIGIPAEIETEIEYLASENGWKVRKLIKEEDDLRAVARIQ
Subjt:  MVHLLPNYARVSSHLLPAAAPASGGVVWRSGGSIRARSAVVLQCNREFSGSITAAAAAEDLIGIPAEIETEIEYLASENGWKVRKLIKEEDDLRAVARIQ

Query:  AEAFHEPVFLFNDFFFHFFQAEVLAALIYRLQNYPPDRYACLVAEAENEDRTDHYNFVGVVDVTVAGDLKVKRLLPAGEKEYLFVSGIAVYESARRRKVA
        AEAFHEPVFLFNDFFFHFFQAEVLAALIYRLQNYPPDRYACLVAEAENE+RTDHYNFVGVVDVTVAGDLKVKRLLPAGEKEYLFVSGIAVYESARRRKVA
Subjt:  AEAFHEPVFLFNDFFFHFFQAEVLAALIYRLQNYPPDRYACLVAEAENEDRTDHYNFVGVVDVTVAGDLKVKRLLPAGEKEYLFVSGIAVYESARRRKVA

Query:  TTLLKGCDMLAKVWGFKFLALSAYEDDYGARNLYSKAGYQLLSADPLWKSSWIGRKRCVTLIKKL
        TTLLKGCDMLAKVWGFKFLALSAYEDDYGARNLYSKAGYQLLSADPLWKSSWIGRKRCVTLIKKL
Subjt:  TTLLKGCDMLAKVWGFKFLALSAYEDDYGARNLYSKAGYQLLSADPLWKSSWIGRKRCVTLIKKL

A0A6J1IJZ4 uncharacterized protein LOC1114769672.55e-12571.64Show/hide
Query:  MVHLLPNYARVSSHL------LPAAAPASGGV----VWRSGGSIRARSAVVLQCNREFSGSITAAAAAEDLIGIPAEIETEIEYLASENGWKVRKLIKEE
        MVHLLPN  RVSSHL          A A  G     + R+GG I+ R AVV++C+ ++S  ITAA    + I    EI  E  YLA+E GWKVRKLI+EE
Subjt:  MVHLLPNYARVSSHL------LPAAAPASGGV----VWRSGGSIRARSAVVLQCNREFSGSITAAAAAEDLIGIPAEIETEIEYLASENGWKVRKLIKEE

Query:  DDLRAVARIQAEAFHEPVFLFNDFFFHFFQAEVLAALIYRLQNYPPDRYACLVAEAENEDRTDHYNFVGVVDVTVAGDLKVKRLLPAGEKEYLFVSGIAV
        DDLR VARIQAEAFHEPV LFN FFF FFQAEVL+ALIYRL+NYPPDRYACLVAE E+E   D YNFVGVVDVTVAGDLKV+RLLPAG KEYLFV+GIAV
Subjt:  DDLRAVARIQAEAFHEPVFLFNDFFFHFFQAEVLAALIYRLQNYPPDRYACLVAEAENEDRTDHYNFVGVVDVTVAGDLKVKRLLPAGEKEYLFVSGIAV

Query:  YESARRRKVATTLLKGCDMLAKVWGFKFLALSAYEDDYGARNLYSKAGYQLLSADPLWKSSWIGRKRCVTLIKKL
         ++ARRRKVAT LLKGCDMLA+VWGFKFLALSAYEDDYGARNLYSKAGYQ+L  DPLWKSSWIGRKRCVT++K+L
Subjt:  YESARRRKVATTLLKGCDMLAKVWGFKFLALSAYEDDYGARNLYSKAGYQLLSADPLWKSSWIGRKRCVTLIKKL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G72030.1 Acyl-CoA N-acyltransferases (NAT) superfamily protein4.6e-6159.09Show/hide
Query:  ETEIEYLASENGWKVRKLIK-EEDDLRAVARIQAEAFHEPVFLFNDFFFHFFQAEVLAALIYRLQNYPPDRYACLVAEAENEDRT-DHYNFVGVVDVTVA
        E E++YL S++GW VR+L + +ED++R V+ +QAEAFH P+ LF+DFFF FFQAEVL+AL+Y+L+N PPDRYACLVAE  +E  T    + VGVVDVT  
Subjt:  ETEIEYLASENGWKVRKLIK-EEDDLRAVARIQAEAFHEPVFLFNDFFFHFFQAEVLAALIYRLQNYPPDRYACLVAEAENEDRT-DHYNFVGVVDVTVA

Query:  GDLKVKRLLPAGEKEYLFVSGIAVYESARRRKVATTLLKGCDMLAKVWGFKFLALSAYEDDYGARNLYSKAGYQLLSADPLWKSSWIGRKRCVTLIKK
         +  V R  P G +EYL+VSG+AV +S RR+K+A+TLLK CD+L  +WGFK LAL AYEDD  ARNLYS AGY ++  DPLW S+WIGRKR V + K+
Subjt:  GDLKVKRLLPAGEKEYLFVSGIAVYESARRRKVATTLLKGCDMLAKVWGFKFLALSAYEDDYGARNLYSKAGYQLLSADPLWKSSWIGRKRCVTLIKK

AT2G39000.1 Acyl-CoA N-acyltransferases (NAT) superfamily protein4.0e-0433.73Show/hide
Query:  GVVDV-TVAGDLKVKRLLPAGEKEYLFVSGIAVYESARRRKVATTLLKGCDMLAKVWGFKFLALSAYEDDYGARNLYSKAGYQ
        G++ V TVA  L  K  L        +VS +AV E+ RR+ +A  L+   + LAK WG + + L    ++ GA  LY   G++
Subjt:  GVVDV-TVAGDLKVKRLLPAGEKEYLFVSGIAVYESARRRKVATTLLKGCDMLAKVWGFKFLALSAYEDDYGARNLYSKAGYQ

AT2G39000.3 Acyl-CoA N-acyltransferases (NAT) superfamily protein4.0e-0433.73Show/hide
Query:  GVVDV-TVAGDLKVKRLLPAGEKEYLFVSGIAVYESARRRKVATTLLKGCDMLAKVWGFKFLALSAYEDDYGARNLYSKAGYQ
        G++ V TVA  L  K  L        +VS +AV E+ RR+ +A  L+   + LAK WG + + L    ++ GA  LY   G++
Subjt:  GVVDV-TVAGDLKVKRLLPAGEKEYLFVSGIAVYESARRRKVATTLLKGCDMLAKVWGFKFLALSAYEDDYGARNLYSKAGYQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTCCATTTGCTCCCAAATTACGCCCGAGTTTCATCGCACCTCCTCCCGGCGGCGGCTCCGGCGTCCGGCGGCGTCGTATGGAGGAGCGGGGGAAGCATTAGG
GCCAGATCGGCGGTGGTTCTGCAGTGTAATCGGGAGTTTTCTGGTTCGATCACGGCGGCGGCGGCAGCGGAGGACTTGATCGGAATCCCGGCCGAAATCGAAACC
GAAATCGAGTATTTGGCGAGCGAGAACGGATGGAAAGTGAGGAAATTGATTAAAGAAGAAGACGATTTGAGAGCGGTTGCAAGAATTCAAGCCGAAGCTTTCCAT
GAACCTGTTTTTCTTTTCAACGATTTCTTCTTCCACTTCTTCCAGGCAGAAGTGCTTGCAGCGTTGATTTACAGATTGCAGAATTACCCTCCAGACAGGTATGCG
TGTTTGGTTGCGGAGGCAGAGAACGAGGATCGAACGGATCATTACAATTTTGTGGGAGTGGTGGACGTCACGGTGGCCGGAGATCTAAAAGTAAAGCGCCTCCTT
CCCGCCGGCGAAAAAGAGTATCTCTTCGTATCTGGAATTGCAGTCTATGAAAGTGCCAGAAGGCGCAAAGTAGCAACGACATTGTTGAAGGGTTGTGACATGCTT
GCAAAAGTTTGGGGATTCAAGTTTTTGGCACTAAGTGCATATGAAGATGACTATGGGGCTCGTAATTTGTATAGTAAAGCAGGATATCAGCTTCTATCTGCTGAC
CCTCTTTGGAAATCTTCTTGGATTGGAAGAAAACGTTGTGTTACTTTGATTAAAAAGCTCTAG
mRNA sequenceShow/hide mRNA sequence
ACGGGCCTTCCGAGTGTTGGGGGCTGATTTACTTCTCTTGGAACATTTTCTTCCATTTTATTTTTAGTAATTATATTTGCAAATACTTTGTTACCATAGTACTGT
CTTTTAAAATGATAGGGATTAAATTCCATTTCATAATATAAAGTAGCATTTACTGTAGAGAGAAAAATAAAATGATGTAGTAATTCCACGTGGCTGGTTAGATCT
TATGGCTGCCAACAACCAGAAAGTTTCAGCCACTTGGAACCCAAACCTTATCCTCCAATTTCTCACTGTATCACCTCAAAATAAAAATTAAAAATAAAATAAAAA
AAATTACTCATATAATAATAATAATAAATAAATAAATAAATAAATAAATAAATAAAAACCAGAGAAACCAAATTCCAACAAATTCCAACGCAACGAAAACGCGAC
TCAGAACTGCGGTTTCAAGAATGGTCCATTTGCTCCCAAATTACGCCCGAGTTTCATCGCACCTCCTCCCGGCGGCGGCTCCGGCGTCCGGCGGCGTCGTATGGA
GGAGCGGGGGAAGCATTAGGGCCAGATCGGCGGTGGTTCTGCAGTGTAATCGGGAGTTTTCTGGTTCGATCACGGCGGCGGCGGCAGCGGAGGACTTGATCGGAA
TCCCGGCCGAAATCGAAACCGAAATCGAGTATTTGGCGAGCGAGAACGGATGGAAAGTGAGGAAATTGATTAAAGAAGAAGACGATTTGAGAGCGGTTGCAAGAA
TTCAAGCCGAAGCTTTCCATGAACCTGTTTTTCTTTTCAACGATTTCTTCTTCCACTTCTTCCAGGCAGAAGTGCTTGCAGCGTTGATTTACAGATTGCAGAATT
ACCCTCCAGACAGGTATGCGTGTTTGGTTGCGGAGGCAGAGAACGAGGATCGAACGGATCATTACAATTTTGTGGGAGTGGTGGACGTCACGGTGGCCGGAGATC
TAAAAGTAAAGCGCCTCCTTCCCGCCGGCGAAAAAGAGTATCTCTTCGTATCTGGAATTGCAGTCTATGAAAGTGCCAGAAGGCGCAAAGTAGCAACGACATTGT
TGAAGGGTTGTGACATGCTTGCAAAAGTTTGGGGATTCAAGTTTTTGGCACTAAGTGCATATGAAGATGACTATGGGGCTCGTAATTTGTATAGTAAAGCAGGAT
ATCAGCTTCTATCTGCTGACCCTCTTTGGAAATCTTCTTGGATTGGAAGAAAACGTTGTGTTACTTTGATTAAAAAGCTCTAGTTTTCCCTTTCTTTCAACTTTG
CTTTTTAAAACCCAACTCTCAC
Protein sequenceShow/hide protein sequence
MVHLLPNYARVSSHLLPAAAPASGGVVWRSGGSIRARSAVVLQCNREFSGSITAAAAAEDLIGIPAEIETEIEYLASENGWKVRKLIKEEDDLRAVARIQAEAFH
EPVFLFNDFFFHFFQAEVLAALIYRLQNYPPDRYACLVAEAENEDRTDHYNFVGVVDVTVAGDLKVKRLLPAGEKEYLFVSGIAVYESARRRKVATTLLKGCDML
AKVWGFKFLALSAYEDDYGARNLYSKAGYQLLSADPLWKSSWIGRKRCVTLIKKL