; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr028354 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr028354
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionN-acetyltransferase domain-containing protein
Genome locationtig00153057:2165490..2181981
RNA-Seq ExpressionSgr028354
SyntenySgr028354
Gene Ontology termsGO:0005737 - cytoplasm (cellular component)
GO:0008080 - N-acetyltransferase activity (molecular function)
InterPro domainsIPR000182 - GNAT domain
IPR016181 - Acyl-CoA N-acyltransferase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6592197.1 hypothetical protein SDJN03_14543, partial [Cucurbita argyrosperma subsp. sororia]2.2e-10575Show/hide
Query:  MVHLLPNYPRVTSHL-----CAAVQARAKSGS---VIWRNGGIRARSAAVVLQWSRDYSSSISAAREDLIGISEVVGERIGEVEGE--YLASEYGWKVRK
        MVHLLPN  RV+SHL       AV+ARAKSG+    I RNGGI+ R  AVV++ S DYSS I+AA    IG SE +     E+  E  YLA+E+GWKVRK
Subjt:  MVHLLPNYPRVTSHL-----CAAVQARAKSGS---VIWRNGGIRARSAAVVLQWSRDYSSSISAAREDLIGISEVVGERIGEVEGE--YLASEYGWKVRK

Query:  LIEEDDDLKRVARIQAEAFHEPFLLFNDFFFQFFQAEVLSALIYRLKNYPPNRYACLVAEPDSDTCNDRYNFVGVVDVTVAGDLKVKRLLPAGDKEYLFV
        LIEE+DDL+ VARIQAEAFHEP LLFN FFFQFFQAEVLSALIYRLK+YPP+RYACLVAEP+S++C D YNFVGVVDVTVAGDLKV+RLLPAG KEYLFV
Subjt:  LIEEDDDLKRVARIQAEAFHEPFLLFNDFFFQFFQAEVLSALIYRLKNYPPNRYACLVAEPDSDTCNDRYNFVGVVDVTVAGDLKVKRLLPAGDKEYLFV

Query:  SGIAVSESARRKKVATTLLKGCDMLAKVWGFKFLALSAYEDDYGARNLYSKAGYQVLSADPLWKSSWIGRKRCVTMIKKL
        +GIAV ++ARR+KVAT LLKGCDMLAKVWGFKFLALSAYEDDYGARNLYSKAGYQVL  DPLWKSSWIGRKRCVTM+K+L
Subjt:  SGIAVSESARRKKVATTLLKGCDMLAKVWGFKFLALSAYEDDYGARNLYSKAGYQVLSADPLWKSSWIGRKRCVTMIKKL

XP_022143656.1 uncharacterized protein LOC111013514 [Momordica charantia]8.8e-11078.75Show/hide
Query:  MVHLLPNYPRVTSHLCAAVQARAKSGSVIWRNGG-IRARSAAVVLQWSRDYSSSI--SAAREDLIGISEVVGERIGEVEGEYLASEYGWKVRKLIEEDDD
        MVHLLPNY RV+SHL  A  A   SG V+WR+GG IRARS AVVLQ +R++S SI  +AA EDLIGI   +     E E EYLASE GWKVRKLI+E+DD
Subjt:  MVHLLPNYPRVTSHLCAAVQARAKSGSVIWRNGG-IRARSAAVVLQWSRDYSSSI--SAAREDLIGISEVVGERIGEVEGEYLASEYGWKVRKLIEEDDD

Query:  LKRVARIQAEAFHEPFLLFNDFFFQFFQAEVLSALIYRLKNYPPNRYACLVAEPDSDTCNDRYNFVGVVDVTVAGDLKVKRLLPAGDKEYLFVSGIAVSE
        L+ VARIQAEAFHEP  LFNDFFF FFQAEVL+ALIYRL+NYPP+RYACLVAE +++   D YNFVGVVDVTVAGDLKVKRLLPAG+KEYLFVSGIAV E
Subjt:  LKRVARIQAEAFHEPFLLFNDFFFQFFQAEVLSALIYRLKNYPPNRYACLVAEPDSDTCNDRYNFVGVVDVTVAGDLKVKRLLPAGDKEYLFVSGIAVSE

Query:  SARRKKVATTLLKGCDMLAKVWGFKFLALSAYEDDYGARNLYSKAGYQVLSADPLWKSSWIGRKRCVTMIKKL
        SARR+KVATTLLKGCDMLAKVWGFKFLALSAYEDDYGARNLYSKAGYQ+LSADPLWKSSWIGRKRCVT+IKKL
Subjt:  SARRKKVATTLLKGCDMLAKVWGFKFLALSAYEDDYGARNLYSKAGYQVLSADPLWKSSWIGRKRCVTMIKKL

XP_022936794.1 uncharacterized protein LOC111443273 [Cucurbita moschata]2.9e-10575Show/hide
Query:  MVHLLPNYPRVTSHL-----CAAVQARAKSGS---VIWRNGGIRARSAAVVLQWSRDYSSSISAAREDLIGISEVVGERIGEVEGE--YLASEYGWKVRK
        MVHLLPN  RV+SHL       AV+ARAKSG+    I RNGGI+ R  AVV++ S DYSS I+AA    IG SE +     E+  E  YLA+E+GWKVRK
Subjt:  MVHLLPNYPRVTSHL-----CAAVQARAKSGS---VIWRNGGIRARSAAVVLQWSRDYSSSISAAREDLIGISEVVGERIGEVEGE--YLASEYGWKVRK

Query:  LIEEDDDLKRVARIQAEAFHEPFLLFNDFFFQFFQAEVLSALIYRLKNYPPNRYACLVAEPDSDTCNDRYNFVGVVDVTVAGDLKVKRLLPAGDKEYLFV
        LIEE+DDL+ VARIQAEAFHEP LLFN FFFQFFQAEVLSALIYRLK+YPP+RYACLVAEP+S+ C D YNFVGVVDVTVAGDLKV+RLLPAG KEYLFV
Subjt:  LIEEDDDLKRVARIQAEAFHEPFLLFNDFFFQFFQAEVLSALIYRLKNYPPNRYACLVAEPDSDTCNDRYNFVGVVDVTVAGDLKVKRLLPAGDKEYLFV

Query:  SGIAVSESARRKKVATTLLKGCDMLAKVWGFKFLALSAYEDDYGARNLYSKAGYQVLSADPLWKSSWIGRKRCVTMIKKL
        +GIAV ++ARR+KVAT LLKGCDMLAKVWGFKFLALSAYEDDYGARNLYSKAGYQVL  DPLWKSSWIGRKRCVTM+K+L
Subjt:  SGIAVSESARRKKVATTLLKGCDMLAKVWGFKFLALSAYEDDYGARNLYSKAGYQVLSADPLWKSSWIGRKRCVTMIKKL

XP_022976620.1 uncharacterized protein LOC111476967 [Cucurbita maxima]2.2e-10575Show/hide
Query:  MVHLLPNYPRVTSHLCA-----AVQARAKSGS---VIWRNGGIRARSAAVVLQWSRDYSSSISAAREDLIGISEVVGERIGEVEGE--YLASEYGWKVRK
        MVHLLPN  RV+SHL +     AV+ARAKSG+    I RNGGI+ R  AVV++ S DYSS I+AA    IG SE +     E+  E  YLA+E+GWKVRK
Subjt:  MVHLLPNYPRVTSHLCA-----AVQARAKSGS---VIWRNGGIRARSAAVVLQWSRDYSSSISAAREDLIGISEVVGERIGEVEGE--YLASEYGWKVRK

Query:  LIEEDDDLKRVARIQAEAFHEPFLLFNDFFFQFFQAEVLSALIYRLKNYPPNRYACLVAEPDSDTCNDRYNFVGVVDVTVAGDLKVKRLLPAGDKEYLFV
        LIEE+DDL+ VARIQAEAFHEP LLFN FFFQFFQAEVLSALIYRLKNYPP+RYACLVAEP+S+ C D YNFVGVVDVTVAGDLKV+RLLPAG KEYLFV
Subjt:  LIEEDDDLKRVARIQAEAFHEPFLLFNDFFFQFFQAEVLSALIYRLKNYPPNRYACLVAEPDSDTCNDRYNFVGVVDVTVAGDLKVKRLLPAGDKEYLFV

Query:  SGIAVSESARRKKVATTLLKGCDMLAKVWGFKFLALSAYEDDYGARNLYSKAGYQVLSADPLWKSSWIGRKRCVTMIKKL
        +GIAV ++ARR+KVAT LLKGCDMLA+VWGFKFLALSAYEDDYGARNLYSKAGYQVL  DPLWKSSWIGRKRCVTM+K+L
Subjt:  SGIAVSESARRKKVATTLLKGCDMLAKVWGFKFLALSAYEDDYGARNLYSKAGYQVLSADPLWKSSWIGRKRCVTMIKKL

XP_038897179.1 uncharacterized protein LOC120085320 [Benincasa hispida]2.4e-10775.18Show/hide
Query:  MVHLLPNYPRVTSHLCA-----AVQARAKSGS---VIWRNGGIRARSAAVVLQWSRDYSSSISAAREDLIGISEVVGERIGEVEGEYLASEYGWKVRKLI
        MVHLLPN  RV+ HL +     AV AR  SG+    IWRNGGI+ RS AVVL+ S DYSS  +A  E+ IG+ E +       E EYLA E+GW VRKLI
Subjt:  MVHLLPNYPRVTSHLCA-----AVQARAKSGS---VIWRNGGIRARSAAVVLQWSRDYSSSISAAREDLIGISEVVGERIGEVEGEYLASEYGWKVRKLI

Query:  EEDDDLKRVARIQAEAFHEPFLLFNDFFFQFFQAEVLSALIYRLKNYPPNRYACLVAEPDSDTCNDRYNFVGVVDVTVAGDLKVKRLLPAGDKEYLFVSG
        EE+DDL+ VARIQAEAFHEP LLFNDFFFQFFQAEVLSALIYRLKNYPP+RYACLVAEP+S++C D YNFVGVVDVTVAGDLKVKRLLPAG+KEYLFV+G
Subjt:  EEDDDLKRVARIQAEAFHEPFLLFNDFFFQFFQAEVLSALIYRLKNYPPNRYACLVAEPDSDTCNDRYNFVGVVDVTVAGDLKVKRLLPAGDKEYLFVSG

Query:  IAVSESARRKKVATTLLKGCDMLAKVWGFKFLALSAYEDDYGARNLYSKAGYQVLSADPLWKSSWIGRKRCVTMIKKL
        IAV+++ARR+KVAT LLKGCDML KVWGFKFLALSAYEDDYGARNLYSKAGYQV   DPLWKS+WIGRKRCVTMIKKL
Subjt:  IAVSESARRKKVATTLLKGCDMLAKVWGFKFLALSAYEDDYGARNLYSKAGYQVLSADPLWKSSWIGRKRCVTMIKKL

TrEMBL top hitse value%identityAlignment
A0A0A0K826 N-acetyltransferase domain-containing protein7.1e-10574.19Show/hide
Query:  MVHLLPNYPRVTSHL-----CAAVQARAKSGSVIWR-NGGIRARSAAVVLQWSRDYSSSISAA---REDLIGISEVVGERIGEVEGEYLASEYGWKVRKL
        MVHLLPN  RV SHL       AV  R+KSG V+WR  GGI+ RS AVV++ S DYSS I+AA    E+L+G+SE +       E EYLA+E+GWKVRKL
Subjt:  MVHLLPNYPRVTSHL-----CAAVQARAKSGSVIWR-NGGIRARSAAVVLQWSRDYSSSISAA---REDLIGISEVVGERIGEVEGEYLASEYGWKVRKL

Query:  IEEDDDLKRVARIQAEAFHEPFLLFNDFFFQFFQAEVLSALIYRLKNYPPNRYACLVAEPDSDTCNDRYNFVGVVDVTVAGDLKVKRLLPAGDKEYLFVS
        IEE+DDLK VARIQAEAFHEP LLFN FFFQFFQAEVLSALIYRLKNYP +RYACLVAEP+S+   + YNFVGVVDVTVAGDLK+KRLLP G KEYLFV+
Subjt:  IEEDDDLKRVARIQAEAFHEPFLLFNDFFFQFFQAEVLSALIYRLKNYPPNRYACLVAEPDSDTCNDRYNFVGVVDVTVAGDLKVKRLLPAGDKEYLFVS

Query:  GIAVSESARRKKVATTLLKGCDMLAKVWGFKFLALSAYEDDYGARNLYSKAGYQVLSADPLWKSSWIGRKRCVTMIKKL
        GIAV+++ARR+KVAT LLKGCDML KVWGFKFLALSAYEDDYGARNLYSKAGYQV   DPLWKS+WIGRKRCVTMIKKL
Subjt:  GIAVSESARRKKVATTLLKGCDMLAKVWGFKFLALSAYEDDYGARNLYSKAGYQVLSADPLWKSSWIGRKRCVTMIKKL

A0A1S3CFP1 uncharacterized protein LOC103500412 isoform X17.3e-10273.48Show/hide
Query:  MVHLLPNYPRVTSHLCA-----AVQARAKSGSVIWRNGG-IRARSAAVVLQWSRDYSSSISAA---REDLIGISEVVGERIGEVEGEYLASEYGWKVRKL
        MVHLLPN  RV+SHL +     AV  R+K    +WR GG I+ RS AVV++ S DYSS I+AA    E+LI +SE +G      E EYLA E+GWKVRKL
Subjt:  MVHLLPNYPRVTSHLCA-----AVQARAKSGSVIWRNGG-IRARSAAVVLQWSRDYSSSISAA---REDLIGISEVVGERIGEVEGEYLASEYGWKVRKL

Query:  IEEDDDLKRVARIQAEAFHEPFLLFNDFFFQFFQAEVLSALIYRLKNYPPNRYACLVAEPDSDTCNDRYNFVGVVDVTVAGDLKVKRLLPAGDKEYLFVS
        IEE+DDL+ VARIQAEAFHEP LLFN FFFQFFQAEVLSALIYRLKNYP  RYACLVAEP+S+   + YNFVGVVDVTVAGDLKVKRLLP G KEYLFV+
Subjt:  IEEDDDLKRVARIQAEAFHEPFLLFNDFFFQFFQAEVLSALIYRLKNYPPNRYACLVAEPDSDTCNDRYNFVGVVDVTVAGDLKVKRLLPAGDKEYLFVS

Query:  GIAVSESARRKKVATTLLKGCDMLAKVWGFKFLALSAYEDDYGARNLYSKAGYQVLSADPLWKSSWIGRKRCVTMIKKL
        GIAV+++ARR+KVAT LLKGCDML KVWGFKFLALSAYEDDYGARNLYSKAGYQV   DPLWKS+WIGRKRCVTMIKKL
Subjt:  GIAVSESARRKKVATTLLKGCDMLAKVWGFKFLALSAYEDDYGARNLYSKAGYQVLSADPLWKSSWIGRKRCVTMIKKL

A0A6J1CQY1 uncharacterized protein LOC1110135144.3e-11078.75Show/hide
Query:  MVHLLPNYPRVTSHLCAAVQARAKSGSVIWRNGG-IRARSAAVVLQWSRDYSSSI--SAAREDLIGISEVVGERIGEVEGEYLASEYGWKVRKLIEEDDD
        MVHLLPNY RV+SHL  A  A   SG V+WR+GG IRARS AVVLQ +R++S SI  +AA EDLIGI   +     E E EYLASE GWKVRKLI+E+DD
Subjt:  MVHLLPNYPRVTSHLCAAVQARAKSGSVIWRNGG-IRARSAAVVLQWSRDYSSSI--SAAREDLIGISEVVGERIGEVEGEYLASEYGWKVRKLIEEDDD

Query:  LKRVARIQAEAFHEPFLLFNDFFFQFFQAEVLSALIYRLKNYPPNRYACLVAEPDSDTCNDRYNFVGVVDVTVAGDLKVKRLLPAGDKEYLFVSGIAVSE
        L+ VARIQAEAFHEP  LFNDFFF FFQAEVL+ALIYRL+NYPP+RYACLVAE +++   D YNFVGVVDVTVAGDLKVKRLLPAG+KEYLFVSGIAV E
Subjt:  LKRVARIQAEAFHEPFLLFNDFFFQFFQAEVLSALIYRLKNYPPNRYACLVAEPDSDTCNDRYNFVGVVDVTVAGDLKVKRLLPAGDKEYLFVSGIAVSE

Query:  SARRKKVATTLLKGCDMLAKVWGFKFLALSAYEDDYGARNLYSKAGYQVLSADPLWKSSWIGRKRCVTMIKKL
        SARR+KVATTLLKGCDMLAKVWGFKFLALSAYEDDYGARNLYSKAGYQ+LSADPLWKSSWIGRKRCVT+IKKL
Subjt:  SARRKKVATTLLKGCDMLAKVWGFKFLALSAYEDDYGARNLYSKAGYQVLSADPLWKSSWIGRKRCVTMIKKL

A0A6J1F9A7 uncharacterized protein LOC1114432731.4e-10575Show/hide
Query:  MVHLLPNYPRVTSHL-----CAAVQARAKSGS---VIWRNGGIRARSAAVVLQWSRDYSSSISAAREDLIGISEVVGERIGEVEGE--YLASEYGWKVRK
        MVHLLPN  RV+SHL       AV+ARAKSG+    I RNGGI+ R  AVV++ S DYSS I+AA    IG SE +     E+  E  YLA+E+GWKVRK
Subjt:  MVHLLPNYPRVTSHL-----CAAVQARAKSGS---VIWRNGGIRARSAAVVLQWSRDYSSSISAAREDLIGISEVVGERIGEVEGE--YLASEYGWKVRK

Query:  LIEEDDDLKRVARIQAEAFHEPFLLFNDFFFQFFQAEVLSALIYRLKNYPPNRYACLVAEPDSDTCNDRYNFVGVVDVTVAGDLKVKRLLPAGDKEYLFV
        LIEE+DDL+ VARIQAEAFHEP LLFN FFFQFFQAEVLSALIYRLK+YPP+RYACLVAEP+S+ C D YNFVGVVDVTVAGDLKV+RLLPAG KEYLFV
Subjt:  LIEEDDDLKRVARIQAEAFHEPFLLFNDFFFQFFQAEVLSALIYRLKNYPPNRYACLVAEPDSDTCNDRYNFVGVVDVTVAGDLKVKRLLPAGDKEYLFV

Query:  SGIAVSESARRKKVATTLLKGCDMLAKVWGFKFLALSAYEDDYGARNLYSKAGYQVLSADPLWKSSWIGRKRCVTMIKKL
        +GIAV ++ARR+KVAT LLKGCDMLAKVWGFKFLALSAYEDDYGARNLYSKAGYQVL  DPLWKSSWIGRKRCVTM+K+L
Subjt:  SGIAVSESARRKKVATTLLKGCDMLAKVWGFKFLALSAYEDDYGARNLYSKAGYQVLSADPLWKSSWIGRKRCVTMIKKL

A0A6J1IJZ4 uncharacterized protein LOC1114769671.1e-10575Show/hide
Query:  MVHLLPNYPRVTSHLCA-----AVQARAKSGS---VIWRNGGIRARSAAVVLQWSRDYSSSISAAREDLIGISEVVGERIGEVEGE--YLASEYGWKVRK
        MVHLLPN  RV+SHL +     AV+ARAKSG+    I RNGGI+ R  AVV++ S DYSS I+AA    IG SE +     E+  E  YLA+E+GWKVRK
Subjt:  MVHLLPNYPRVTSHLCA-----AVQARAKSGS---VIWRNGGIRARSAAVVLQWSRDYSSSISAAREDLIGISEVVGERIGEVEGE--YLASEYGWKVRK

Query:  LIEEDDDLKRVARIQAEAFHEPFLLFNDFFFQFFQAEVLSALIYRLKNYPPNRYACLVAEPDSDTCNDRYNFVGVVDVTVAGDLKVKRLLPAGDKEYLFV
        LIEE+DDL+ VARIQAEAFHEP LLFN FFFQFFQAEVLSALIYRLKNYPP+RYACLVAEP+S+ C D YNFVGVVDVTVAGDLKV+RLLPAG KEYLFV
Subjt:  LIEEDDDLKRVARIQAEAFHEPFLLFNDFFFQFFQAEVLSALIYRLKNYPPNRYACLVAEPDSDTCNDRYNFVGVVDVTVAGDLKVKRLLPAGDKEYLFV

Query:  SGIAVSESARRKKVATTLLKGCDMLAKVWGFKFLALSAYEDDYGARNLYSKAGYQVLSADPLWKSSWIGRKRCVTMIKKL
        +GIAV ++ARR+KVAT LLKGCDMLA+VWGFKFLALSAYEDDYGARNLYSKAGYQVL  DPLWKSSWIGRKRCVTM+K+L
Subjt:  SGIAVSESARRKKVATTLLKGCDMLAKVWGFKFLALSAYEDDYGARNLYSKAGYQVLSADPLWKSSWIGRKRCVTMIKKL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G72030.1 Acyl-CoA N-acyltransferases (NAT) superfamily protein6.0e-6456.58Show/hide
Query:  LQWSRDYSSSISAAREDLIGISEVVGERIGEVEGEYLASEYGWKVRKLIEED-DDLKRVARIQAEAFHEPFLLFNDFFFQFFQAEVLSALIYRLKNYPPN
        L WSR       A+ E L  +++       E+E +YL S++GW VR+L  +D D+++RV+ +QAEAFH P  LF+DFFF FFQAEVLSAL+Y+LKN PP+
Subjt:  LQWSRDYSSSISAAREDLIGISEVVGERIGEVEGEYLASEYGWKVRKLIEED-DDLKRVARIQAEAFHEPFLLFNDFFFQFFQAEVLSALIYRLKNYPPN

Query:  RYACLVAEPDSDT-CNDRYNFVGVVDVTVAGDLKVKRLLPAGDKEYLFVSGIAVSESARRKKVATTLLKGCDMLAKVWGFKFLALSAYEDDYGARNLYSK
        RYACLVAE  S+T      + VGVVDVT   +  V R  P G +EYL+VSG+AVS+S RRKK+A+TLLK CD+L  +WGFK LAL AYEDD  ARNLYS 
Subjt:  RYACLVAEPDSDT-CNDRYNFVGVVDVTVAGDLKVKRLLPAGDKEYLFVSGIAVSESARRKKVATTLLKGCDMLAKVWGFKFLALSAYEDDYGARNLYSK

Query:  AGYQVLSADPLWKSSWIGRKRCVTMIKK
        AGY V+  DPLW S+WIGRKR V M K+
Subjt:  AGYQVLSADPLWKSSWIGRKRCVTMIKK

AT2G39000.1 Acyl-CoA N-acyltransferases (NAT) superfamily protein3.1e-0434.94Show/hide
Query:  GVVDV-TVAGDLKVKRLLPAGDKEYLFVSGIAVSESARRKKVATTLLKGCDMLAKVWGFKFLALSAYEDDYGARNLYSKAGYQ
        G++ V TVA  L  K  L        +VS +AV E+ RRK +A  L+   + LAK WG + + L    ++ GA  LY   G++
Subjt:  GVVDV-TVAGDLKVKRLLPAGDKEYLFVSGIAVSESARRKKVATTLLKGCDMLAKVWGFKFLALSAYEDDYGARNLYSKAGYQ

AT2G39000.3 Acyl-CoA N-acyltransferases (NAT) superfamily protein3.1e-0434.94Show/hide
Query:  GVVDV-TVAGDLKVKRLLPAGDKEYLFVSGIAVSESARRKKVATTLLKGCDMLAKVWGFKFLALSAYEDDYGARNLYSKAGYQ
        G++ V TVA  L  K  L        +VS +AV E+ RRK +A  L+   + LAK WG + + L    ++ GA  LY   G++
Subjt:  GVVDV-TVAGDLKVKRLLPAGDKEYLFVSGIAVSESARRKKVATTLLKGCDMLAKVWGFKFLALSAYEDDYGARNLYSKAGYQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTCCATTTGCTTCCCAATTACCCCCGCGTGACATCGCATCTCTGCGCTGCGGTTCAGGCGAGAGCGAAGTCCGGCAGCGTGATCTGGAGGAATGGGGGAATTAGGGC
GAGATCGGCGGCGGTGGTTCTGCAGTGGAGTAGGGATTATTCGAGCTCGATCTCGGCGGCGAGGGAGGACTTGATCGGCATATCGGAGGTAGTCGGGGAGAGAATTGGGG
AAGTTGAGGGGGAGTATTTGGCCAGCGAGTACGGATGGAAGGTGAGGAAGTTGATCGAAGAAGACGACGACTTGAAGAGAGTCGCCAGAATTCAAGCCGAAGCTTTTCAT
GAACCCTTTCTTCTTTTCAACGACTTCTTCTTCCAATTTTTCCAGGCAGAAGTCCTTTCAGCGTTGATTTACAGATTAAAAAATTACCCTCCAAACAGGTATGCGTGTTT
GGTTGCGGAGCCGGACAGTGATACCTGCAACGATCGCTACAACTTTGTGGGAGTGGTGGATGTGACGGTGGCCGGAGATCTGAAAGTGAAGCGCCTCCTTCCCGCCGGCG
ATAAGGAGTATCTCTTTGTATCTGGAATTGCTGTTTCAGAAAGTGCCAGAAGGAAAAAAGTAGCAACGACACTATTGAAGGGTTGTGACATGCTTGCAAAGGTTTGGGGA
TTCAAGTTTTTGGCATTAAGTGCCTACGAAGATGATTATGGGGCTCGTAATTTGTATAGTAAAGCAGGGTATCAGGTTCTCTCTGCTGACCCTCTTTGGAAGTCTTCTTG
GATTGGAAGGAAACGTTGCGTTACTATGATTAAAAAGCTTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGTCCATTTGCTTCCCAATTACCCCCGCGTGACATCGCATCTCTGCGCTGCGGTTCAGGCGAGAGCGAAGTCCGGCAGCGTGATCTGGAGGAATGGGGGAATTAGGGC
GAGATCGGCGGCGGTGGTTCTGCAGTGGAGTAGGGATTATTCGAGCTCGATCTCGGCGGCGAGGGAGGACTTGATCGGCATATCGGAGGTAGTCGGGGAGAGAATTGGGG
AAGTTGAGGGGGAGTATTTGGCCAGCGAGTACGGATGGAAGGTGAGGAAGTTGATCGAAGAAGACGACGACTTGAAGAGAGTCGCCAGAATTCAAGCCGAAGCTTTTCAT
GAACCCTTTCTTCTTTTCAACGACTTCTTCTTCCAATTTTTCCAGGCAGAAGTCCTTTCAGCGTTGATTTACAGATTAAAAAATTACCCTCCAAACAGGTATGCGTGTTT
GGTTGCGGAGCCGGACAGTGATACCTGCAACGATCGCTACAACTTTGTGGGAGTGGTGGATGTGACGGTGGCCGGAGATCTGAAAGTGAAGCGCCTCCTTCCCGCCGGCG
ATAAGGAGTATCTCTTTGTATCTGGAATTGCTGTTTCAGAAAGTGCCAGAAGGAAAAAAGTAGCAACGACACTATTGAAGGGTTGTGACATGCTTGCAAAGGTTTGGGGA
TTCAAGTTTTTGGCATTAAGTGCCTACGAAGATGATTATGGGGCTCGTAATTTGTATAGTAAAGCAGGGTATCAGGTTCTCTCTGCTGACCCTCTTTGGAAGTCTTCTTG
GATTGGAAGGAAACGTTGCGTTACTATGATTAAAAAGCTTTAG
Protein sequenceShow/hide protein sequence
MVHLLPNYPRVTSHLCAAVQARAKSGSVIWRNGGIRARSAAVVLQWSRDYSSSISAAREDLIGISEVVGERIGEVEGEYLASEYGWKVRKLIEEDDDLKRVARIQAEAFH
EPFLLFNDFFFQFFQAEVLSALIYRLKNYPPNRYACLVAEPDSDTCNDRYNFVGVVDVTVAGDLKVKRLLPAGDKEYLFVSGIAVSESARRKKVATTLLKGCDMLAKVWG
FKFLALSAYEDDYGARNLYSKAGYQVLSADPLWKSSWIGRKRCVTMIKKL