; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS021578 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS021578
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionN-acetyltransferase domain-containing protein
Genome locationscaffold108:328544..332514
RNA-Seq ExpressionMS021578
SyntenyMS021578
Gene Ontology termsGO:0005737 - cytoplasm (cellular component)
GO:0008080 - N-acetyltransferase activity (molecular function)
InterPro domainsIPR000182 - GNAT domain
IPR016181 - Acyl-CoA N-acyltransferase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004139678.1 uncharacterized protein LOC101214390 isoform X2 [Cucumis sativus]3.0e-10273.16Show/hide
Query:  MVHLLPNYPRVSSHLLP----AAAP--ASGGVVWRSGGSIRARSAVVLQCNREFSGSITAAA-AAEDLIGIPAEIETEIEYLASENGWKVRKLIKEEDDL
        MVHLLPN  RV SHL       A P  +  GVVWR GG I+ RSAVV++C+ ++S  ITAAA   E+L+G+  EI+ E EYLA+E GWKVRKLI+EEDDL
Subjt:  MVHLLPNYPRVSSHLLP----AAAP--ASGGVVWRSGGSIRARSAVVLQCNREFSGSITAAA-AAEDLIGIPAEIETEIEYLASENGWKVRKLIKEEDDL

Query:  RAVAGIQAEAFHEPVFLFNDFFFHFFQAEVLAALIYRLQNYPPDRYACLVAEAENEDRTDHYNFVGVVDVTVAGDLKVKRLLPAGEKEYLFVSGIAVYES
        +AVA IQAEAFHEPV LFN FFF FFQAEVL+ALIYRL+NYP DRYACLVAE E+E   + YNFVGVVDVTVAGDLK+KRLLP G KEYLFV+GIAV ++
Subjt:  RAVAGIQAEAFHEPVFLFNDFFFHFFQAEVLAALIYRLQNYPPDRYACLVAEAENEDRTDHYNFVGVVDVTVAGDLKVKRLLPAGEKEYLFVSGIAVYES

Query:  ARRRKVATTLLKGCDILAKVWGFKFLALSAYEDDYGARNLYSKAGYQLLSADPLWKSSWIGRKRCVTLIKKL
        ARRRKVAT LLKGCD+L KVWGFKFLALSAYEDDYGARNLYSKAGYQ+   DPLWKS+WIGRKRCVT+IKKL
Subjt:  ARRRKVATTLLKGCDILAKVWGFKFLALSAYEDDYGARNLYSKAGYQLLSADPLWKSSWIGRKRCVTLIKKL

XP_008461923.1 PREDICTED: uncharacterized protein LOC103500412 isoform X1 [Cucumis melo]2.4e-9971.96Show/hide
Query:  MVHLLPNYPRVSSHLLP----AAAPASG--GVVWRSGGSIRARSAVVLQCNREFSGSITAAAAAEDLIGIPAEIETEIEYLASENGWKVRKLIKEEDDLR
        MVHLLPN  RVSSHL       A P       +WR GG I+ RSAVV++C+ ++S  ITAA A E+ +   +E   E EYLA E GWKVRKLI+EEDDLR
Subjt:  MVHLLPNYPRVSSHLLP----AAAPASG--GVVWRSGGSIRARSAVVLQCNREFSGSITAAAAAEDLIGIPAEIETEIEYLASENGWKVRKLIKEEDDLR

Query:  AVAGIQAEAFHEPVFLFNDFFFHFFQAEVLAALIYRLQNYPPDRYACLVAEAENEDRTDHYNFVGVVDVTVAGDLKVKRLLPAGEKEYLFVSGIAVYESA
        AVA IQAEAFHEPV LFN FFF FFQAEVL+ALIYRL+NYP +RYACLVAE E+E   + YNFVGVVDVTVAGDLKVKRLLP G KEYLFV+GIAV ++A
Subjt:  AVAGIQAEAFHEPVFLFNDFFFHFFQAEVLAALIYRLQNYPPDRYACLVAEAENEDRTDHYNFVGVVDVTVAGDLKVKRLLPAGEKEYLFVSGIAVYESA

Query:  RRRKVATTLLKGCDILAKVWGFKFLALSAYEDDYGARNLYSKAGYQLLSADPLWKSSWIGRKRCVTLIKKL
        RRRKVAT LLKGCD+L KVWGFKFLALSAYEDDYGARNLYSKAGYQ+   DPLWKS+WIGRKRCVT+IKKL
Subjt:  RRRKVATTLLKGCDILAKVWGFKFLALSAYEDDYGARNLYSKAGYQLLSADPLWKSSWIGRKRCVTLIKKL

XP_022143656.1 uncharacterized protein LOC111013514 [Momordica charantia]1.6e-14398.49Show/hide
Query:  MVHLLPNYPRVSSHLLPAAAPASGGVVWRSGGSIRARSAVVLQCNREFSGSITAAAAAEDLIGIPAEIETEIEYLASENGWKVRKLIKEEDDLRAVAGIQ
        MVHLLPNY RVSSHLLPAAAPASGGVVWRSGGSIRARSAVVLQCNREFSGSITAAAAAEDLIGIPAEIETEIEYLASENGWKVRKLIKEEDDLRAVA IQ
Subjt:  MVHLLPNYPRVSSHLLPAAAPASGGVVWRSGGSIRARSAVVLQCNREFSGSITAAAAAEDLIGIPAEIETEIEYLASENGWKVRKLIKEEDDLRAVAGIQ

Query:  AEAFHEPVFLFNDFFFHFFQAEVLAALIYRLQNYPPDRYACLVAEAENEDRTDHYNFVGVVDVTVAGDLKVKRLLPAGEKEYLFVSGIAVYESARRRKVA
        AEAFHEPVFLFNDFFFHFFQAEVLAALIYRLQNYPPDRYACLVAEAENE+RTDHYNFVGVVDVTVAGDLKVKRLLPAGEKEYLFVSGIAVYESARRRKVA
Subjt:  AEAFHEPVFLFNDFFFHFFQAEVLAALIYRLQNYPPDRYACLVAEAENEDRTDHYNFVGVVDVTVAGDLKVKRLLPAGEKEYLFVSGIAVYESARRRKVA

Query:  TTLLKGCDILAKVWGFKFLALSAYEDDYGARNLYSKAGYQLLSADPLWKSSWIGRKRCVTLIKKL
        TTLLKGCD+LAKVWGFKFLALSAYEDDYGARNLYSKAGYQLLSADPLWKSSWIGRKRCVTLIKKL
Subjt:  TTLLKGCDILAKVWGFKFLALSAYEDDYGARNLYSKAGYQLLSADPLWKSSWIGRKRCVTLIKKL

XP_031744292.1 uncharacterized protein LOC101214390 isoform X1 [Cucumis sativus]2.8e-10071.58Show/hide
Query:  MVHLLPNYPRVSSHLLP----AAAP--ASGGVVWRSGGSIRARSAVVLQCNREFSGSITAAA-AAEDLIGIPAEIETEIEYLASENGWKVRKLIKEEDDL
        MVHLLPN  RV SHL       A P  +  GVVWR GG I+ RSAVV++C+ ++S  ITAAA   E+L+G+  EI+ E EYLA+E GWKVRKLI+EEDDL
Subjt:  MVHLLPNYPRVSSHLLP----AAAP--ASGGVVWRSGGSIRARSAVVLQCNREFSGSITAAA-AAEDLIGIPAEIETEIEYLASENGWKVRKLIKEEDDL

Query:  RAVAGIQAEAFHEPVFLFNDFFFHFF------QAEVLAALIYRLQNYPPDRYACLVAEAENEDRTDHYNFVGVVDVTVAGDLKVKRLLPAGEKEYLFVSG
        +AVA IQAEAFHEPV LFN FFF FF      QAEVL+ALIYRL+NYP DRYACLVAE E+E   + YNFVGVVDVTVAGDLK+KRLLP G KEYLFV+G
Subjt:  RAVAGIQAEAFHEPVFLFNDFFFHFF------QAEVLAALIYRLQNYPPDRYACLVAEAENEDRTDHYNFVGVVDVTVAGDLKVKRLLPAGEKEYLFVSG

Query:  IAVYESARRRKVATTLLKGCDILAKVWGFKFLALSAYEDDYGARNLYSKAGYQLLSADPLWKSSWIGRKRCVTLIKKL
        IAV ++ARRRKVAT LLKGCD+L KVWGFKFLALSAYEDDYGARNLYSKAGYQ+   DPLWKS+WIGRKRCVT+IKKL
Subjt:  IAVYESARRRKVATTLLKGCDILAKVWGFKFLALSAYEDDYGARNLYSKAGYQLLSADPLWKSSWIGRKRCVTLIKKL

XP_038897179.1 uncharacterized protein LOC120085320 [Benincasa hispida]5.6e-10173.09Show/hide
Query:  MVHLLPNYPRVSSHL---------LPAAAPASGG-VVWRSGGSIRARSAVVLQCNREFSGSITAAAAAEDLIGIPAEIETEIEYLASENGWKVRKLIKEE
        MVHLLPN  RVS HL         L      +GG  +WR+GG I+ RSAVVL+C+ ++S   T    AE+ IG+  EI  E EYLA E GW VRKLI+EE
Subjt:  MVHLLPNYPRVSSHL---------LPAAAPASGG-VVWRSGGSIRARSAVVLQCNREFSGSITAAAAAEDLIGIPAEIETEIEYLASENGWKVRKLIKEE

Query:  DDLRAVAGIQAEAFHEPVFLFNDFFFHFFQAEVLAALIYRLQNYPPDRYACLVAEAENEDRTDHYNFVGVVDVTVAGDLKVKRLLPAGEKEYLFVSGIAV
        DDLRAVA IQAEAFHEPV LFNDFFF FFQAEVL+ALIYRL+NYPPDRYACLVAE E+E   D YNFVGVVDVTVAGDLKVKRLLPAGEKEYLFV+GIAV
Subjt:  DDLRAVAGIQAEAFHEPVFLFNDFFFHFFQAEVLAALIYRLQNYPPDRYACLVAEAENEDRTDHYNFVGVVDVTVAGDLKVKRLLPAGEKEYLFVSGIAV

Query:  YESARRRKVATTLLKGCDILAKVWGFKFLALSAYEDDYGARNLYSKAGYQLLSADPLWKSSWIGRKRCVTLIKKL
         ++ARRRKVAT LLKGCD+L KVWGFKFLALSAYEDDYGARNLYSKAGYQ+   DPLWKS+WIGRKRCVT+IKKL
Subjt:  YESARRRKVATTLLKGCDILAKVWGFKFLALSAYEDDYGARNLYSKAGYQLLSADPLWKSSWIGRKRCVTLIKKL

TrEMBL top hitse value%identityAlignment
A0A0A0K826 N-acetyltransferase domain-containing protein1.4e-10273.16Show/hide
Query:  MVHLLPNYPRVSSHLLP----AAAP--ASGGVVWRSGGSIRARSAVVLQCNREFSGSITAAA-AAEDLIGIPAEIETEIEYLASENGWKVRKLIKEEDDL
        MVHLLPN  RV SHL       A P  +  GVVWR GG I+ RSAVV++C+ ++S  ITAAA   E+L+G+  EI+ E EYLA+E GWKVRKLI+EEDDL
Subjt:  MVHLLPNYPRVSSHLLP----AAAP--ASGGVVWRSGGSIRARSAVVLQCNREFSGSITAAA-AAEDLIGIPAEIETEIEYLASENGWKVRKLIKEEDDL

Query:  RAVAGIQAEAFHEPVFLFNDFFFHFFQAEVLAALIYRLQNYPPDRYACLVAEAENEDRTDHYNFVGVVDVTVAGDLKVKRLLPAGEKEYLFVSGIAVYES
        +AVA IQAEAFHEPV LFN FFF FFQAEVL+ALIYRL+NYP DRYACLVAE E+E   + YNFVGVVDVTVAGDLK+KRLLP G KEYLFV+GIAV ++
Subjt:  RAVAGIQAEAFHEPVFLFNDFFFHFFQAEVLAALIYRLQNYPPDRYACLVAEAENEDRTDHYNFVGVVDVTVAGDLKVKRLLPAGEKEYLFVSGIAVYES

Query:  ARRRKVATTLLKGCDILAKVWGFKFLALSAYEDDYGARNLYSKAGYQLLSADPLWKSSWIGRKRCVTLIKKL
        ARRRKVAT LLKGCD+L KVWGFKFLALSAYEDDYGARNLYSKAGYQ+   DPLWKS+WIGRKRCVT+IKKL
Subjt:  ARRRKVATTLLKGCDILAKVWGFKFLALSAYEDDYGARNLYSKAGYQLLSADPLWKSSWIGRKRCVTLIKKL

A0A1S3CFP1 uncharacterized protein LOC103500412 isoform X11.1e-9971.96Show/hide
Query:  MVHLLPNYPRVSSHLLP----AAAPASG--GVVWRSGGSIRARSAVVLQCNREFSGSITAAAAAEDLIGIPAEIETEIEYLASENGWKVRKLIKEEDDLR
        MVHLLPN  RVSSHL       A P       +WR GG I+ RSAVV++C+ ++S  ITAA A E+ +   +E   E EYLA E GWKVRKLI+EEDDLR
Subjt:  MVHLLPNYPRVSSHLLP----AAAPASG--GVVWRSGGSIRARSAVVLQCNREFSGSITAAAAAEDLIGIPAEIETEIEYLASENGWKVRKLIKEEDDLR

Query:  AVAGIQAEAFHEPVFLFNDFFFHFFQAEVLAALIYRLQNYPPDRYACLVAEAENEDRTDHYNFVGVVDVTVAGDLKVKRLLPAGEKEYLFVSGIAVYESA
        AVA IQAEAFHEPV LFN FFF FFQAEVL+ALIYRL+NYP +RYACLVAE E+E   + YNFVGVVDVTVAGDLKVKRLLP G KEYLFV+GIAV ++A
Subjt:  AVAGIQAEAFHEPVFLFNDFFFHFFQAEVLAALIYRLQNYPPDRYACLVAEAENEDRTDHYNFVGVVDVTVAGDLKVKRLLPAGEKEYLFVSGIAVYESA

Query:  RRRKVATTLLKGCDILAKVWGFKFLALSAYEDDYGARNLYSKAGYQLLSADPLWKSSWIGRKRCVTLIKKL
        RRRKVAT LLKGCD+L KVWGFKFLALSAYEDDYGARNLYSKAGYQ+   DPLWKS+WIGRKRCVT+IKKL
Subjt:  RRRKVATTLLKGCDILAKVWGFKFLALSAYEDDYGARNLYSKAGYQLLSADPLWKSSWIGRKRCVTLIKKL

A0A1S4E3H1 uncharacterized protein LOC103500412 isoform X21.1e-9771.59Show/hide
Query:  MVHLLPNYPRVSSHLLP----AAAPASG--GVVWRSGGSIRARSAVVLQCNREFSGSITAAAAAEDLIGIPAEIETEIEYLASENGWKVRKLIKEEDDLR
        MVHLLPN  RVSSHL       A P       +WR GG I+ RSAVV++C+ ++S  ITAA A E+ +   +E   E EYLA E GWKVRKLI+EEDDLR
Subjt:  MVHLLPNYPRVSSHLLP----AAAPASG--GVVWRSGGSIRARSAVVLQCNREFSGSITAAAAAEDLIGIPAEIETEIEYLASENGWKVRKLIKEEDDLR

Query:  AVAGIQAEAFHEPVFLFNDFFFHFFQAEVLAALIYRLQNYPPDRYACLVAEAENEDRTDHYNFVGVVDVTVAGDLKVKRLLPAGEKEYLFVSGIAVYESA
        AVA IQAEAFHEPV LFN FFF FFQAEVL+ALIYRL+NYP +RYACLVAE E+E   + YNFVGVVDVTVAGDLKVKRLLP G KEYLFV+GIAV ++A
Subjt:  AVAGIQAEAFHEPVFLFNDFFFHFFQAEVLAALIYRLQNYPPDRYACLVAEAENEDRTDHYNFVGVVDVTVAGDLKVKRLLPAGEKEYLFVSGIAVYESA

Query:  RRRKVATTLLKGCDILAKVWGFKFLALSAYEDDYGARNLYSKAGYQLLSADPLWKSSWIGRKRCVTLIKKL
         RRKVAT LLKGCD+L KVWGFKFLALSAYEDDYGARNLYSKAGYQ+   DPLWKS+WIGRKRCVT+IKKL
Subjt:  RRRKVATTLLKGCDILAKVWGFKFLALSAYEDDYGARNLYSKAGYQLLSADPLWKSSWIGRKRCVTLIKKL

A0A6J1CQY1 uncharacterized protein LOC1110135147.6e-14498.49Show/hide
Query:  MVHLLPNYPRVSSHLLPAAAPASGGVVWRSGGSIRARSAVVLQCNREFSGSITAAAAAEDLIGIPAEIETEIEYLASENGWKVRKLIKEEDDLRAVAGIQ
        MVHLLPNY RVSSHLLPAAAPASGGVVWRSGGSIRARSAVVLQCNREFSGSITAAAAAEDLIGIPAEIETEIEYLASENGWKVRKLIKEEDDLRAVA IQ
Subjt:  MVHLLPNYPRVSSHLLPAAAPASGGVVWRSGGSIRARSAVVLQCNREFSGSITAAAAAEDLIGIPAEIETEIEYLASENGWKVRKLIKEEDDLRAVAGIQ

Query:  AEAFHEPVFLFNDFFFHFFQAEVLAALIYRLQNYPPDRYACLVAEAENEDRTDHYNFVGVVDVTVAGDLKVKRLLPAGEKEYLFVSGIAVYESARRRKVA
        AEAFHEPVFLFNDFFFHFFQAEVLAALIYRLQNYPPDRYACLVAEAENE+RTDHYNFVGVVDVTVAGDLKVKRLLPAGEKEYLFVSGIAVYESARRRKVA
Subjt:  AEAFHEPVFLFNDFFFHFFQAEVLAALIYRLQNYPPDRYACLVAEAENEDRTDHYNFVGVVDVTVAGDLKVKRLLPAGEKEYLFVSGIAVYESARRRKVA

Query:  TTLLKGCDILAKVWGFKFLALSAYEDDYGARNLYSKAGYQLLSADPLWKSSWIGRKRCVTLIKKL
        TTLLKGCD+LAKVWGFKFLALSAYEDDYGARNLYSKAGYQLLSADPLWKSSWIGRKRCVTLIKKL
Subjt:  TTLLKGCDILAKVWGFKFLALSAYEDDYGARNLYSKAGYQLLSADPLWKSSWIGRKRCVTLIKKL

A0A6J1IJZ4 uncharacterized protein LOC1114769678.2e-9870.91Show/hide
Query:  MVHLLPNYPRVSSHL------LPAAAPASGGV----VWRSGGSIRARSAVVLQCNREFSGSITAAAAAEDLIGIPAEIETEIEYLASENGWKVRKLIKEE
        MVHLLPN  RVSSHL          A A  G     + R+GG I+ R AVV++C+ ++S  ITAA    + I    EI  E  YLA+E GWKVRKLI+EE
Subjt:  MVHLLPNYPRVSSHL------LPAAAPASGGV----VWRSGGSIRARSAVVLQCNREFSGSITAAAAAEDLIGIPAEIETEIEYLASENGWKVRKLIKEE

Query:  DDLRAVAGIQAEAFHEPVFLFNDFFFHFFQAEVLAALIYRLQNYPPDRYACLVAEAENEDRTDHYNFVGVVDVTVAGDLKVKRLLPAGEKEYLFVSGIAV
        DDLR VA IQAEAFHEPV LFN FFF FFQAEVL+ALIYRL+NYPPDRYACLVAE E+E   D YNFVGVVDVTVAGDLKV+RLLPAG KEYLFV+GIAV
Subjt:  DDLRAVAGIQAEAFHEPVFLFNDFFFHFFQAEVLAALIYRLQNYPPDRYACLVAEAENEDRTDHYNFVGVVDVTVAGDLKVKRLLPAGEKEYLFVSGIAV

Query:  YESARRRKVATTLLKGCDILAKVWGFKFLALSAYEDDYGARNLYSKAGYQLLSADPLWKSSWIGRKRCVTLIKKL
         ++ARRRKVAT LLKGCD+LA+VWGFKFLALSAYEDDYGARNLYSKAGYQ+L  DPLWKSSWIGRKRCVT++K+L
Subjt:  YESARRRKVATTLLKGCDILAKVWGFKFLALSAYEDDYGARNLYSKAGYQLLSADPLWKSSWIGRKRCVTLIKKL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G72030.1 Acyl-CoA N-acyltransferases (NAT) superfamily protein3.6e-6159.09Show/hide
Query:  ETEIEYLASENGWKVRKLIK-EEDDLRAVAGIQAEAFHEPVFLFNDFFFHFFQAEVLAALIYRLQNYPPDRYACLVAEAENEDRT-DHYNFVGVVDVTVA
        E E++YL S++GW VR+L + +ED++R V+ +QAEAFH P+ LF+DFFF FFQAEVL+AL+Y+L+N PPDRYACLVAE  +E  T    + VGVVDVT  
Subjt:  ETEIEYLASENGWKVRKLIK-EEDDLRAVAGIQAEAFHEPVFLFNDFFFHFFQAEVLAALIYRLQNYPPDRYACLVAEAENEDRT-DHYNFVGVVDVTVA

Query:  GDLKVKRLLPAGEKEYLFVSGIAVYESARRRKVATTLLKGCDILAKVWGFKFLALSAYEDDYGARNLYSKAGYQLLSADPLWKSSWIGRKRCVTLIKK
         +  V R  P G +EYL+VSG+AV +S RR+K+A+TLLK CD+L  +WGFK LAL AYEDD  ARNLYS AGY ++  DPLW S+WIGRKR V + K+
Subjt:  GDLKVKRLLPAGEKEYLFVSGIAVYESARRRKVATTLLKGCDILAKVWGFKFLALSAYEDDYGARNLYSKAGYQLLSADPLWKSSWIGRKRCVTLIKK

AT2G39000.1 Acyl-CoA N-acyltransferases (NAT) superfamily protein4.0e-0433.73Show/hide
Query:  GVVDV-TVAGDLKVKRLLPAGEKEYLFVSGIAVYESARRRKVATTLLKGCDILAKVWGFKFLALSAYEDDYGARNLYSKAGYQ
        G++ V TVA  L  K  L        +VS +AV E+ RR+ +A  L+   + LAK WG + + L    ++ GA  LY   G++
Subjt:  GVVDV-TVAGDLKVKRLLPAGEKEYLFVSGIAVYESARRRKVATTLLKGCDILAKVWGFKFLALSAYEDDYGARNLYSKAGYQ

AT2G39000.3 Acyl-CoA N-acyltransferases (NAT) superfamily protein4.0e-0433.73Show/hide
Query:  GVVDV-TVAGDLKVKRLLPAGEKEYLFVSGIAVYESARRRKVATTLLKGCDILAKVWGFKFLALSAYEDDYGARNLYSKAGYQ
        G++ V TVA  L  K  L        +VS +AV E+ RR+ +A  L+   + LAK WG + + L    ++ GA  LY   G++
Subjt:  GVVDV-TVAGDLKVKRLLPAGEKEYLFVSGIAVYESARRRKVATTLLKGCDILAKVWGFKFLALSAYEDDYGARNLYSKAGYQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTCCATTTGCTCCCAAATTACCCCCGAGTTTCATCGCACCTCCTCCCGGCGGCTGCTCCGGCGTCCGGCGGCGTCGTATGGAGGAGCGGGGGAAGCATTAGGGCCAG
ATCGGCGGTGGTTCTGCAGTGTAATCGGGAGTTTTCTGGTTCGATCACGGCGGCGGCGGCAGCGGAGGACTTGATCGGAATCCCGGCCGAAATCGAAACCGAAATCGAGT
ATTTGGCGAGCGAGAACGGATGGAAAGTGAGGAAATTGATTAAAGAAGAAGACGATTTGAGAGCGGTTGCAGGAATTCAAGCCGAAGCTTTCCATGAACCTGTTTTTCTT
TTCAACGATTTCTTCTTCCACTTCTTCCAGGCAGAAGTGCTTGCAGCGTTGATTTACAGATTGCAGAATTACCCTCCAGACAGGTATGCGTGTTTGGTTGCGGAGGCAGA
GAACGAGGATCGTACGGATCATTACAATTTTGTGGGAGTGGTGGACGTCACGGTGGCCGGAGATCTAAAAGTAAAGCGCCTCCTTCCCGCCGGCGAAAAAGAGTATCTCT
TCGTATCTGGAATTGCAGTCTATGAAAGTGCCAGAAGGCGGAAAGTAGCAACGACATTGTTGAAGGGTTGTGACATTCTTGCAAAAGTTTGGGGATTCAAGTTTTTGGCA
CTAAGTGCATATGAAGATGACTATGGGGCTCGTAATTTGTATAGTAAAGCAGGATATCAGCTTCTATCTGCTGACCCTCTTTGGAAATCTTCTTGGATTGGAAGAAAACG
TTGTGTTACTTTGATTAAAAAGCTC
mRNA sequenceShow/hide mRNA sequence
ATGGTCCATTTGCTCCCAAATTACCCCCGAGTTTCATCGCACCTCCTCCCGGCGGCTGCTCCGGCGTCCGGCGGCGTCGTATGGAGGAGCGGGGGAAGCATTAGGGCCAG
ATCGGCGGTGGTTCTGCAGTGTAATCGGGAGTTTTCTGGTTCGATCACGGCGGCGGCGGCAGCGGAGGACTTGATCGGAATCCCGGCCGAAATCGAAACCGAAATCGAGT
ATTTGGCGAGCGAGAACGGATGGAAAGTGAGGAAATTGATTAAAGAAGAAGACGATTTGAGAGCGGTTGCAGGAATTCAAGCCGAAGCTTTCCATGAACCTGTTTTTCTT
TTCAACGATTTCTTCTTCCACTTCTTCCAGGCAGAAGTGCTTGCAGCGTTGATTTACAGATTGCAGAATTACCCTCCAGACAGGTATGCGTGTTTGGTTGCGGAGGCAGA
GAACGAGGATCGTACGGATCATTACAATTTTGTGGGAGTGGTGGACGTCACGGTGGCCGGAGATCTAAAAGTAAAGCGCCTCCTTCCCGCCGGCGAAAAAGAGTATCTCT
TCGTATCTGGAATTGCAGTCTATGAAAGTGCCAGAAGGCGGAAAGTAGCAACGACATTGTTGAAGGGTTGTGACATTCTTGCAAAAGTTTGGGGATTCAAGTTTTTGGCA
CTAAGTGCATATGAAGATGACTATGGGGCTCGTAATTTGTATAGTAAAGCAGGATATCAGCTTCTATCTGCTGACCCTCTTTGGAAATCTTCTTGGATTGGAAGAAAACG
TTGTGTTACTTTGATTAAAAAGCTC
Protein sequenceShow/hide protein sequence
MVHLLPNYPRVSSHLLPAAAPASGGVVWRSGGSIRARSAVVLQCNREFSGSITAAAAAEDLIGIPAEIETEIEYLASENGWKVRKLIKEEDDLRAVAGIQAEAFHEPVFL
FNDFFFHFFQAEVLAALIYRLQNYPPDRYACLVAEAENEDRTDHYNFVGVVDVTVAGDLKVKRLLPAGEKEYLFVSGIAVYESARRRKVATTLLKGCDILAKVWGFKFLA
LSAYEDDYGARNLYSKAGYQLLSADPLWKSSWIGRKRCVTLIKKL