; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr027309 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr027309
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionN-acetyltransferase domain-containing protein
Genome locationtig00153048:2970057..2972022
RNA-Seq ExpressionSgr027309
SyntenySgr027309
Gene Ontology termsGO:0008080 - N-acetyltransferase activity (molecular function)
InterPro domainsIPR000182 - GNAT domain
IPR016181 - Acyl-CoA N-acyltransferase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6588080.1 hypothetical protein SDJN03_16645, partial [Cucurbita argyrosperma subsp. sororia]4.5e-11287.93Show/hide
Query:  MDSKSPQTMKEEVSIQRSTPLLLPQSKPSTSNDLQFNRSPPADQDLVHKRRLEFSQFVAREAVLDEELWTAAWLRAESHWENRTNERYVDSFKRKFAEQE
        MDSKSPQTMKEE+SIQ STPLLLP+SKPST N+L+FNR PP DQ+LVHK++LEF QFVAREAVLDEELWTAAWLRAESHWENR NERYVDSFKRKFAEQE
Subjt:  MDSKSPQTMKEEVSIQRSTPLLLPQSKPSTSNDLQFNRSPPADQDLVHKRRLEFSQFVAREAVLDEELWTAAWLRAESHWENRTNERYVDSFKRKFAEQE

Query:  FNAIKRRCSGQHGQTCTCIVTVRKEQKHIKRTVIKSVVATLDMSLRHLMHGETFPGEREKTHLCSINKEIPNKYAYIANLCVAKAARHQGIASNMLKFAV
        FNAIKRRCSG  GQ+CTCIVTV KEQKHIKRTVIKSVVATLDMSLRHLMHGETFPGEREK+H+CSINKEIPNKYAY++NLCV+KAAR QG+ASNMLKFAV
Subjt:  FNAIKRRCSGQHGQTCTCIVTVRKEQKHIKRTVIKSVVATLDMSLRHLMHGETFPGEREKTHLCSINKEIPNKYAYIANLCVAKAARHQGIASNMLKFAV

Query:  ETARSSGIEQVYVHVHRNNAPAQALYQKIGFE
        ETARSSGIEQVYVHVHRNN PA+ LY+KIGFE
Subjt:  ETARSSGIEQVYVHVHRNNAPAQALYQKIGFE

KAG7021966.1 hypothetical protein SDJN02_15694 [Cucurbita argyrosperma subsp. argyrosperma]4.5e-11287.93Show/hide
Query:  MDSKSPQTMKEEVSIQRSTPLLLPQSKPSTSNDLQFNRSPPADQDLVHKRRLEFSQFVAREAVLDEELWTAAWLRAESHWENRTNERYVDSFKRKFAEQE
        MDSKSPQTMKEE+SIQ STPLLLP+SKPST N+L+FNR PP DQ+LVHK++LEF QFVAREAVLDEELWTAAWLRAESHWENR NERYVDSFKRKFAEQE
Subjt:  MDSKSPQTMKEEVSIQRSTPLLLPQSKPSTSNDLQFNRSPPADQDLVHKRRLEFSQFVAREAVLDEELWTAAWLRAESHWENRTNERYVDSFKRKFAEQE

Query:  FNAIKRRCSGQHGQTCTCIVTVRKEQKHIKRTVIKSVVATLDMSLRHLMHGETFPGEREKTHLCSINKEIPNKYAYIANLCVAKAARHQGIASNMLKFAV
        FNAIKRRCSG  GQ+CTCIVTV KEQKHIKRTVIKSVVATLDMSLRHLMHGETFPGEREK+H+CSINKEIPNKYAY++NLCV+KAAR QG+ASNMLKFAV
Subjt:  FNAIKRRCSGQHGQTCTCIVTVRKEQKHIKRTVIKSVVATLDMSLRHLMHGETFPGEREKTHLCSINKEIPNKYAYIANLCVAKAARHQGIASNMLKFAV

Query:  ETARSSGIEQVYVHVHRNNAPAQALYQKIGFE
        ETARSSGIEQVYVHVHRNN PA+ LY+KIGFE
Subjt:  ETARSSGIEQVYVHVHRNNAPAQALYQKIGFE

XP_022147601.1 uncharacterized protein LOC111016486 [Momordica charantia]4.9e-11992.02Show/hide
Query:  PTLWTMDSKSPQTM-KEEVSIQRSTPLLLPQSKPSTSNDLQFNRSPPADQDLVHKRRLEFSQFVAREAVLDEELWTAAWLRAESHWENRTNERYVDSFKR
        P   TMDSKSPQTM KEE+SI  STPLLLPQSKPS S+DLQFNR PPADQDL+HKRRLEF QFVAREAVLDEELWTAAWLRAESHWENRTNERYVDSFKR
Subjt:  PTLWTMDSKSPQTM-KEEVSIQRSTPLLLPQSKPSTSNDLQFNRSPPADQDLVHKRRLEFSQFVAREAVLDEELWTAAWLRAESHWENRTNERYVDSFKR

Query:  KFAEQEFNAIKRRCSGQHGQTCTCIVTVRKEQKHIKRTVIKSVVATLDMSLRHLMHGETFPGEREKTHLCSINKEIPNKYAYIANLCVAKAARHQGIASN
        KFAEQEFNAIK+RCSGQHGQTCTC VTVRKEQ+HIKRTVIKSVVATLDMSLRHLMHGETFPGEREK+HLCSINKEIPNKYAYIANLCVAKAAR QGIASN
Subjt:  KFAEQEFNAIKRRCSGQHGQTCTCIVTVRKEQKHIKRTVIKSVVATLDMSLRHLMHGETFPGEREKTHLCSINKEIPNKYAYIANLCVAKAARHQGIASN

Query:  MLKFAVETARSSGIEQVYVHVHRNNAPAQALYQKIGFE
        MLKFAVETARSSGIEQVYVHVHRNN PAQALY+KIGFE
Subjt:  MLKFAVETARSSGIEQVYVHVHRNNAPAQALYQKIGFE

XP_023531510.1 uncharacterized protein LOC111793724 [Cucurbita pepo subsp. pepo]5.3e-11388.79Show/hide
Query:  MDSKSPQTMKEEVSIQRSTPLLLPQSKPSTSNDLQFNRSPPADQDLVHKRRLEFSQFVAREAVLDEELWTAAWLRAESHWENRTNERYVDSFKRKFAEQE
        MDSKSPQTMKEE+SIQ STPLLLP+SKPST N+L+FNR PP DQDLVHK+RLEF QFVAREAVLDEELWTAAWLRAESHWENR NERYVDSFKRKFAEQE
Subjt:  MDSKSPQTMKEEVSIQRSTPLLLPQSKPSTSNDLQFNRSPPADQDLVHKRRLEFSQFVAREAVLDEELWTAAWLRAESHWENRTNERYVDSFKRKFAEQE

Query:  FNAIKRRCSGQHGQTCTCIVTVRKEQKHIKRTVIKSVVATLDMSLRHLMHGETFPGEREKTHLCSINKEIPNKYAYIANLCVAKAARHQGIASNMLKFAV
        FNAIKRRCSG  GQ+CTCIVTV KEQKHIKRTVIKSVVATLDMSLRHLMHGETFPGEREK+H+CSINKEIPNKYAY++NLCV+KAAR QG+ASNMLKFAV
Subjt:  FNAIKRRCSGQHGQTCTCIVTVRKEQKHIKRTVIKSVVATLDMSLRHLMHGETFPGEREKTHLCSINKEIPNKYAYIANLCVAKAARHQGIASNMLKFAV

Query:  ETARSSGIEQVYVHVHRNNAPAQALYQKIGFE
        ETARSSGIEQVYVHVHRNN PA+ LY+KIGFE
Subjt:  ETARSSGIEQVYVHVHRNNAPAQALYQKIGFE

XP_038878818.1 uncharacterized protein LOC120070956 isoform X1 [Benincasa hispida]2.4e-11388.66Show/hide
Query:  PTLWTMDSKSPQTM-KEEVSIQRSTPLLLPQSKPSTSNDLQFNRSPPADQDLVHKRRLEFSQFVAREAVLDEELWTAAWLRAESHWENRTNERYVDSFKR
        P   TMDSKS QTM KEEVSIQ STPLLLPQSKP  +N+LQFNR PPA+QDLVHKRRLEF QFVAREAVLDEELWTAAWLRAESHWENR+NERYVDSFKR
Subjt:  PTLWTMDSKSPQTM-KEEVSIQRSTPLLLPQSKPSTSNDLQFNRSPPADQDLVHKRRLEFSQFVAREAVLDEELWTAAWLRAESHWENRTNERYVDSFKR

Query:  KFAEQEFNAIKRRCSGQHGQTCTCIVTVRKEQKHIKRTVIKSVVATLDMSLRHLMHGETFPGEREKTHLCSINKEIPNKYAYIANLCVAKAARHQGIASN
        KFAEQEFNAIK+R SGQHGQTCTC+VTVRKEQKHIK TVIKSVVATLDMS RHLMHGETFPGEREK+H+CSINKEIPNKYAYI+NLCVAKAAR QGIASN
Subjt:  KFAEQEFNAIKRRCSGQHGQTCTCIVTVRKEQKHIKRTVIKSVVATLDMSLRHLMHGETFPGEREKTHLCSINKEIPNKYAYIANLCVAKAARHQGIASN

Query:  MLKFAVETARSSGIEQVYVHVHRNNAPAQALYQKIGFE
        MLKFAV TARS GIEQVYVHVHRNN PAQ LYQKIGFE
Subjt:  MLKFAVETARSSGIEQVYVHVHRNNAPAQALYQKIGFE

TrEMBL top hitse value%identityAlignment
A0A5A7UL26 Putative Acyl-CoA N-acyltransferases (NAT) superfamily protein2.7e-10784.62Show/hide
Query:  TMDSKSPQTM-KEEVSIQRSTPLLLPQSKPSTSNDLQFNRSPPADQDLVHKRRLEFSQFVAREAVLDEELWTAAWLRAESHWENRTNERYVDSFKRKFAE
        TMD KS QTM KEEVSIQ STPLL P+SKP  SN LQF+R PP D+DLVH+RRLEF QFVAREAV+DEELWTAAWLRAESHWENR N+RYVDSFKRKFAE
Subjt:  TMDSKSPQTM-KEEVSIQRSTPLLLPQSKPSTSNDLQFNRSPPADQDLVHKRRLEFSQFVAREAVLDEELWTAAWLRAESHWENRTNERYVDSFKRKFAE

Query:  QEFNAIKRRCSGQHGQTCTCIVTVRKEQKHIKRTVIKSVVATLDMSLRHLMHGETFPGEREKTHLCSINKEIPNKYAYIANLCVAKAARHQGIASNMLKF
        QEFNAIK++C GQ+GQTCTCIVTVRKEQKHIKRTVIKSVVATLD+ LRHLMHGE+FPGEREK+H+CSINKEIPNKYAYI+NLCV KAAR QGIA NMLKF
Subjt:  QEFNAIKRRCSGQHGQTCTCIVTVRKEQKHIKRTVIKSVVATLDMSLRHLMHGETFPGEREKTHLCSINKEIPNKYAYIANLCVAKAARHQGIASNMLKF

Query:  AVETARSSGIEQVYVHVHRNNAPAQALYQKIGFE
        AV TA+S GI+QVYVHVHRNN PAQALYQKIGFE
Subjt:  AVETARSSGIEQVYVHVHRNNAPAQALYQKIGFE

A0A5D3CIF7 Putative Acyl-CoA N-acyltransferases (NAT) superfamily protein2.7e-10784.62Show/hide
Query:  TMDSKSPQTM-KEEVSIQRSTPLLLPQSKPSTSNDLQFNRSPPADQDLVHKRRLEFSQFVAREAVLDEELWTAAWLRAESHWENRTNERYVDSFKRKFAE
        TMD KS QTM KEEVSIQ STPLL P+SKP  SN LQF+R PP D+DLVH+RRLEF QFVAREAV+DEELWTAAWLRAESHWENR N+RYVDSFKRKFAE
Subjt:  TMDSKSPQTM-KEEVSIQRSTPLLLPQSKPSTSNDLQFNRSPPADQDLVHKRRLEFSQFVAREAVLDEELWTAAWLRAESHWENRTNERYVDSFKRKFAE

Query:  QEFNAIKRRCSGQHGQTCTCIVTVRKEQKHIKRTVIKSVVATLDMSLRHLMHGETFPGEREKTHLCSINKEIPNKYAYIANLCVAKAARHQGIASNMLKF
        QEFNAIK++C GQ+GQTCTCIVTVRKEQKHIKRTVIKSVVATLD+ LRHLMHGE+FPGEREK+H+CSINKEIPNKYAYI+NLCV KAAR QGIA NMLKF
Subjt:  QEFNAIKRRCSGQHGQTCTCIVTVRKEQKHIKRTVIKSVVATLDMSLRHLMHGETFPGEREKTHLCSINKEIPNKYAYIANLCVAKAARHQGIASNMLKF

Query:  AVETARSSGIEQVYVHVHRNNAPAQALYQKIGFE
        AV TA+S GI+QVYVHVHRNN PAQALYQKIGFE
Subjt:  AVETARSSGIEQVYVHVHRNNAPAQALYQKIGFE

A0A6J1D2V4 uncharacterized protein LOC1110164862.4e-11992.02Show/hide
Query:  PTLWTMDSKSPQTM-KEEVSIQRSTPLLLPQSKPSTSNDLQFNRSPPADQDLVHKRRLEFSQFVAREAVLDEELWTAAWLRAESHWENRTNERYVDSFKR
        P   TMDSKSPQTM KEE+SI  STPLLLPQSKPS S+DLQFNR PPADQDL+HKRRLEF QFVAREAVLDEELWTAAWLRAESHWENRTNERYVDSFKR
Subjt:  PTLWTMDSKSPQTM-KEEVSIQRSTPLLLPQSKPSTSNDLQFNRSPPADQDLVHKRRLEFSQFVAREAVLDEELWTAAWLRAESHWENRTNERYVDSFKR

Query:  KFAEQEFNAIKRRCSGQHGQTCTCIVTVRKEQKHIKRTVIKSVVATLDMSLRHLMHGETFPGEREKTHLCSINKEIPNKYAYIANLCVAKAARHQGIASN
        KFAEQEFNAIK+RCSGQHGQTCTC VTVRKEQ+HIKRTVIKSVVATLDMSLRHLMHGETFPGEREK+HLCSINKEIPNKYAYIANLCVAKAAR QGIASN
Subjt:  KFAEQEFNAIKRRCSGQHGQTCTCIVTVRKEQKHIKRTVIKSVVATLDMSLRHLMHGETFPGEREKTHLCSINKEIPNKYAYIANLCVAKAARHQGIASN

Query:  MLKFAVETARSSGIEQVYVHVHRNNAPAQALYQKIGFE
        MLKFAVETARSSGIEQVYVHVHRNN PAQALY+KIGFE
Subjt:  MLKFAVETARSSGIEQVYVHVHRNNAPAQALYQKIGFE

A0A6J1EZR4 uncharacterized protein LOC1114410202.2e-11287.93Show/hide
Query:  MDSKSPQTMKEEVSIQRSTPLLLPQSKPSTSNDLQFNRSPPADQDLVHKRRLEFSQFVAREAVLDEELWTAAWLRAESHWENRTNERYVDSFKRKFAEQE
        MDSKSPQTMKEE+SIQ STPLLLP+SKPST N+L+FNR PP DQ+LVHK++LEF QFVAREAVLDEELWTAAWLRAESHWENR NERYVDSFKRKFAEQE
Subjt:  MDSKSPQTMKEEVSIQRSTPLLLPQSKPSTSNDLQFNRSPPADQDLVHKRRLEFSQFVAREAVLDEELWTAAWLRAESHWENRTNERYVDSFKRKFAEQE

Query:  FNAIKRRCSGQHGQTCTCIVTVRKEQKHIKRTVIKSVVATLDMSLRHLMHGETFPGEREKTHLCSINKEIPNKYAYIANLCVAKAARHQGIASNMLKFAV
        FNAIKRRCSG  GQ+CTCIVTV KEQKHIKRTVIKSVVATLDMSLRHLMHGETFPGEREK+H+CSINKEIPNKYAY++NLCV+KAAR QG+ASNMLKFAV
Subjt:  FNAIKRRCSGQHGQTCTCIVTVRKEQKHIKRTVIKSVVATLDMSLRHLMHGETFPGEREKTHLCSINKEIPNKYAYIANLCVAKAARHQGIASNMLKFAV

Query:  ETARSSGIEQVYVHVHRNNAPAQALYQKIGFE
        ETARSSGIEQVYVHVHRNN PA+ LY+KIGFE
Subjt:  ETARSSGIEQVYVHVHRNNAPAQALYQKIGFE

A0A6J1HY42 uncharacterized protein LOC1114672912.2e-11287.93Show/hide
Query:  MDSKSPQTMKEEVSIQRSTPLLLPQSKPSTSNDLQFNRSPPADQDLVHKRRLEFSQFVAREAVLDEELWTAAWLRAESHWENRTNERYVDSFKRKFAEQE
        MDSKSPQT+KE++SIQ STPLLLP+SKPST N+L+FNR PP DQDLVHK+RLEF QFVAREAVLDEELWTAAWLRAESHWENR NERYVDSFKRKFAEQE
Subjt:  MDSKSPQTMKEEVSIQRSTPLLLPQSKPSTSNDLQFNRSPPADQDLVHKRRLEFSQFVAREAVLDEELWTAAWLRAESHWENRTNERYVDSFKRKFAEQE

Query:  FNAIKRRCSGQHGQTCTCIVTVRKEQKHIKRTVIKSVVATLDMSLRHLMHGETFPGEREKTHLCSINKEIPNKYAYIANLCVAKAARHQGIASNMLKFAV
        FNAIKRRCSG  GQ+CTCIVTV KEQKHIKRTVIKSVVATLDMSLRHLMHGETFPGEREK+H+CSINKEIPNKYAY++NLCV+KAAR QG+ASNMLKFAV
Subjt:  FNAIKRRCSGQHGQTCTCIVTVRKEQKHIKRTVIKSVVATLDMSLRHLMHGETFPGEREKTHLCSINKEIPNKYAYIANLCVAKAARHQGIASNMLKFAV

Query:  ETARSSGIEQVYVHVHRNNAPAQALYQKIGFE
        ETARSSGIEQVYVHVHRNN PA+ LY+KIGFE
Subjt:  ETARSSGIEQVYVHVHRNNAPAQALYQKIGFE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G06025.1 Acyl-CoA N-acyltransferases (NAT) superfamily protein1.2e-8365.27Show/hide
Query:  LW--TMDSKSPQT-MKEEVSIQRSTPLLLPQSKPSTSNDLQFNRSPPADQDLVHKRRLEFSQFVAREAVLDEELWTAAWLRAESHWENRTNERYVDSFKR
        LW  T++S+S  T  KEE+S+Q S P  + QS+P     L+F+R  P + +  H+ R EF +FVAREA+LDEE WTAAWLRAESHWE+R+NERYVD++KR
Subjt:  LW--TMDSKSPQT-MKEEVSIQRSTPLLLPQSKPSTSNDLQFNRSPPADQDLVHKRRLEFSQFVAREAVLDEELWTAAWLRAESHWENRTNERYVDSFKR

Query:  KFAEQEFNAIKRRCSGQHGQTCTCIVTVRKEQKHIKRTVIKSVVATLDMSLRHLMHGETFPGEREKTHL-CSINKEIPNKYAYIANLCVAKAARHQGIAS
        KFAEQEFNAIKRRC G  GQ C+CIV V+KE+KHIKR+VIKSVV TLD+S+R+ + GETFPGE+ K+ L CSIN+E  N+Y YIANLCVAK+AR QGIA 
Subjt:  KFAEQEFNAIKRRCSGQHGQTCTCIVTVRKEQKHIKRTVIKSVVATLDMSLRHLMHGETFPGEREKTHL-CSINKEIPNKYAYIANLCVAKAARHQGIAS

Query:  NMLKFAVETARSSGIEQVYVHVHRNNAPAQALYQKIGFE
        NML+FAVE+AR SG+EQVYVHVH+NN+ AQ LYQK GF+
Subjt:  NMLKFAVETARSSGIEQVYVHVHRNNAPAQALYQKIGFE

AT4G28030.1 Acyl-CoA N-acyltransferases (NAT) superfamily protein2.5e-1227.57Show/hide
Query:  LQFNRSPPADQDLVHKRRLEFSQFVAREAVLDEELWTAAWLRAESHWENRTNERYVDSFKRKFAEQEFNAIKRRCSG-QHGQTCTCIVTVRKEQKHIKRT
        L+    P A    +    ++ S FV  E+V ++ELW AA LR  +  E   +   +   +R  AE+EF A+K R SG + G T    +        +  +
Subjt:  LQFNRSPPADQDLVHKRRLEFSQFVAREAVLDEELWTAAWLRAESHWENRTNERYVDSFKRKFAEQEFNAIKRRCSG-QHGQTCTCIVTVRKEQKHIKRT

Query:  V--------------IKSVVATLDMSLRHLMHGETFPGEREKTHLCSINKEIPNKYAYIANLCVAKAARHQGIASNMLKFAVETARSSGIEQVYVHVHRN
                        + VV +LD++          P E   T    I  +     AY++N+CVAK     G+   ++  +   A   GI  +YVHV  +
Subjt:  V--------------IKSVVATLDMSLRHLMHGETFPGEREKTHLCSINKEIPNKYAYIANLCVAKAARHQGIASNMLKFAVETARSSGIEQVYVHVHRN

Query:  NAPAQALYQKIGFE
        N  A++LY K GFE
Subjt:  NAPAQALYQKIGFE

AT4G28030.2 Acyl-CoA N-acyltransferases (NAT) superfamily protein7.1e-0725.23Show/hide
Query:  LQFNRSPPADQDLVHKRRLEFSQFVAREAVLDEELWTAAWLRAESHWENRTNERYVDSFKRKFAEQEFNAIKRRCSG-QHGQTCTCIVTVRKEQKHIKRT
        L+    P A    +    ++ S FV  E+V ++ELW AA LR  +  E   +   +   +R  AE+EF A+K R SG + G T    +        +  +
Subjt:  LQFNRSPPADQDLVHKRRLEFSQFVAREAVLDEELWTAAWLRAESHWENRTNERYVDSFKRKFAEQEFNAIKRRCSG-QHGQTCTCIVTVRKEQKHIKRT

Query:  V--------------IKSVVATLDMSLRHLMHGETFPGEREKTHLCSINKEIPNKYAYIANLCVAKAARHQGIASNMLKFAVETARSSGIEQVYVHVHRN
                        + VV +LD++          P E   T    I  +     AY++N+CVAK     G+   ++    ++ R +G          +
Subjt:  V--------------IKSVVATLDMSLRHLMHGETFPGEREKTHLCSINKEIPNKYAYIANLCVAKAARHQGIASNMLKFAVETARSSGIEQVYVHVHRN

Query:  NAPAQALYQKIGFE
        N  A++LY K GFE
Subjt:  NAPAQALYQKIGFE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATACCGTAATCACCACAAATTTCATAGAAATTTCGCCTCGTGGACTATGTAAGTTTGTCTCTGTTCCTCTTCTTCGTTTCCTCTACGCGGAAATCCTCAGTTTCGA
TTCTGGTACCGATTTTCCCTTGCTGCCTGATTTTCCCACGCTCTGGACAATGGATTCTAAATCACCTCAAACGATGAAGGAAGAAGTTTCCATTCAGCGTTCAACGCCAC
TGTTGCTACCGCAGTCGAAACCATCGACGTCAAACGACTTACAATTCAACCGGTCGCCGCCGGCGGACCAAGATTTAGTTCACAAAAGAAGATTAGAGTTCAGTCAATTC
GTAGCGCGCGAGGCTGTGCTTGATGAAGAATTGTGGACGGCAGCATGGCTCCGGGCTGAAAGTCATTGGGAGAATCGAACAAATGAACGATATGTTGATAGCTTCAAAAG
GAAATTTGCAGAACAGGAGTTCAATGCCATTAAAAGGAGATGCAGTGGGCAACATGGACAGACATGCACATGCATTGTCACGGTAAGGAAGGAACAGAAGCATATAAAGC
GTACGGTGATTAAAAGTGTAGTAGCAACTCTTGACATGAGCTTGCGGCATTTGATGCATGGCGAGACTTTTCCAGGGGAAAGAGAGAAGACTCATTTATGCAGCATCAAC
AAAGAGATCCCAAACAAATATGCTTATATTGCAAACCTATGCGTAGCAAAAGCAGCACGTCATCAGGGTATTGCTAGCAATATGTTGAAGTTTGCAGTTGAAACAGCAAG
ATCCAGTGGTATTGAACAGGTATACGTGCATGTACATAGAAACAACGCACCCGCCCAAGCACTGTACCAAAAGATAGGCTTCGAG
mRNA sequenceShow/hide mRNA sequence
ATGGATACCGTAATCACCACAAATTTCATAGAAATTTCGCCTCGTGGACTATGTAAGTTTGTCTCTGTTCCTCTTCTTCGTTTCCTCTACGCGGAAATCCTCAGTTTCGA
TTCTGGTACCGATTTTCCCTTGCTGCCTGATTTTCCCACGCTCTGGACAATGGATTCTAAATCACCTCAAACGATGAAGGAAGAAGTTTCCATTCAGCGTTCAACGCCAC
TGTTGCTACCGCAGTCGAAACCATCGACGTCAAACGACTTACAATTCAACCGGTCGCCGCCGGCGGACCAAGATTTAGTTCACAAAAGAAGATTAGAGTTCAGTCAATTC
GTAGCGCGCGAGGCTGTGCTTGATGAAGAATTGTGGACGGCAGCATGGCTCCGGGCTGAAAGTCATTGGGAGAATCGAACAAATGAACGATATGTTGATAGCTTCAAAAG
GAAATTTGCAGAACAGGAGTTCAATGCCATTAAAAGGAGATGCAGTGGGCAACATGGACAGACATGCACATGCATTGTCACGGTAAGGAAGGAACAGAAGCATATAAAGC
GTACGGTGATTAAAAGTGTAGTAGCAACTCTTGACATGAGCTTGCGGCATTTGATGCATGGCGAGACTTTTCCAGGGGAAAGAGAGAAGACTCATTTATGCAGCATCAAC
AAAGAGATCCCAAACAAATATGCTTATATTGCAAACCTATGCGTAGCAAAAGCAGCACGTCATCAGGGTATTGCTAGCAATATGTTGAAGTTTGCAGTTGAAACAGCAAG
ATCCAGTGGTATTGAACAGGTATACGTGCATGTACATAGAAACAACGCACCCGCCCAAGCACTGTACCAAAAGATAGGCTTCGAG
Protein sequenceShow/hide protein sequence
MDTVITTNFIEISPRGLCKFVSVPLLRFLYAEILSFDSGTDFPLLPDFPTLWTMDSKSPQTMKEEVSIQRSTPLLLPQSKPSTSNDLQFNRSPPADQDLVHKRRLEFSQF
VAREAVLDEELWTAAWLRAESHWENRTNERYVDSFKRKFAEQEFNAIKRRCSGQHGQTCTCIVTVRKEQKHIKRTVIKSVVATLDMSLRHLMHGETFPGEREKTHLCSIN
KEIPNKYAYIANLCVAKAARHQGIASNMLKFAVETARSSGIEQVYVHVHRNNAPAQALYQKIGFE