; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10013752 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10013752
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionChalcone-flavonone isomerase family protein
Genome locationChr02:4441712..4444157
RNA-Seq ExpressionHG10013752
SyntenyHG10013752
Gene Ontology termsGO:0006631 - fatty acid metabolic process (biological process)
GO:0009570 - chloroplast stroma (cellular component)
GO:0005504 - fatty acid binding (molecular function)
GO:0016872 - intramolecular lyase activity (molecular function)
InterPro domainsIPR016087 - Chalcone isomerase
IPR016088 - Chalcone isomerase, 3-layer sandwich
IPR036298 - Chalcone isomerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004146009.1 fatty-acid-binding protein 3, chloroplastic isoform X1 [Cucumis sativus]4.1e-9270.92Show/hide
Query:  MAANGAVNSTPLRLPSAIPTGKWNSNPRICLLTKPPIPTLSLYQFQSTFTL-NRNFRFCSNLSLKASSSLASVGNAGYVEEPSTNVKFPTSLTLPGCSTS
        MAAN AVNSTPL LP +IPTGK NS+P ICLLTKP IPTLS++   STF+L N NFRF SN SL+ SSSLASVGNAG+VEEPSTNVKFPTSLTLPGCSTS
Subjt:  MAANGAVNSTPLRLPSAIPTGKWNSNPRICLLTKPPIPTLSLYQFQSTFTL-NRNFRFCSNLSLKASSSLASVGNAGYVEEPSTNVKFPTSLTLPGCSTS

Query:  LSLLGT----------------------------------------------------APSEKSLQIVLVRDVDGKTFWDALDDAISPRIKAPTTVDESA
        LSLLGT                                                    + SEKSLQIVLVRDVDGKTFWDALDDAISPRIKAPT  DESA
Subjt:  LSLLGT----------------------------------------------------APSEKSLQIVLVRDVDGKTFWDALDDAISPRIKAPTTVDESA

Query:  LSTFRNIFEGRSLKKGTFIFLTWLEPPKMLVSISTDGPPTGIEATIESNNVTSALFDVFFGDSPVSPTLKASVATGLAAVLK
        LSTFR+IFEGRSLKKGTFIFLTWLEPPKMLVSIS DG PTGI+ATIESNNVTS+LFDVFFGDSPVSPTLKASVATGLAAVLK
Subjt:  LSTFRNIFEGRSLKKGTFIFLTWLEPPKMLVSISTDGPPTGIEATIESNNVTSALFDVFFGDSPVSPTLKASVATGLAAVLK

XP_008452432.2 PREDICTED: fatty-acid-binding protein 3, chloroplastic isoform X1 [Cucumis melo]1.2e-9170.57Show/hide
Query:  MAANGAVNSTPLRLPSAIPTGKWNSNPRICLLTKPPIPTLSLYQFQSTFTLNRN-FRFCSNLSLKASSSLASVGNAGYVEEPSTNVKFPTSLTLPGCSTS
        MAAN  VN TPL LP ++PTGK NSNP ICLLTKP IPTLS++   STF+LN N FRF SN SLK SSSLASVGNAG+VEEPSTNVKFPTSLTLPGCSTS
Subjt:  MAANGAVNSTPLRLPSAIPTGKWNSNPRICLLTKPPIPTLSLYQFQSTFTLNRN-FRFCSNLSLKASSSLASVGNAGYVEEPSTNVKFPTSLTLPGCSTS

Query:  LSLLGT----------------------------------------------------APSEKSLQIVLVRDVDGKTFWDALDDAISPRIKAPTTVDESA
        LSLLGT                                                    + SEKSLQIVLVRDVDGKTFWDALDDAISPRIKAPT  DESA
Subjt:  LSLLGT----------------------------------------------------APSEKSLQIVLVRDVDGKTFWDALDDAISPRIKAPTTVDESA

Query:  LSTFRNIFEGRSLKKGTFIFLTWLEPPKMLVSISTDGPPTGIEATIESNNVTSALFDVFFGDSPVSPTLKASVATGLAAVLK
        LSTFR+IFEGRSLKKGTFIFLTWLEPPKMLVSIS +GPPTGI+ATIESNNVTSALFDVFFGDSPVS TLKASVATGLAAVLK
Subjt:  LSTFRNIFEGRSLKKGTFIFLTWLEPPKMLVSISTDGPPTGIEATIESNNVTSALFDVFFGDSPVSPTLKASVATGLAAVLK

XP_022140846.1 fatty-acid-binding protein 3, chloroplastic [Momordica charantia]2.9e-9369.53Show/hide
Query:  ANGAVNSTPLRLPSAIPTGKWNSNPRICLLTKPPIPTLSLYQFQSTFTLNRNFRFCSNLSLKASSSLASVGNAGYVEEPSTNVKFPTSLTLPGCSTSLSL
        A  A NSTPL+ PS IPT KWNSNPRIC+LTKPP P+LSLYQ QS  +L+ N RFCS+ SLKASSS+ASVGN  YV EPSTNVKF TSL+LPGCSTSLSL
Subjt:  ANGAVNSTPLRLPSAIPTGKWNSNPRICLLTKPPIPTLSLYQFQSTFTLNRNFRFCSNLSLKASSSLASVGNAGYVEEPSTNVKFPTSLTLPGCSTSLSL

Query:  LGT----------------------------------------------------APSEKSLQIVLVRDVDGKTFWDALDDAISPRIKAPTTVDESALST
        LGT                                                    +PSEKSLQIVLVRDVDGKTFWDALDDAISPRIKAPT VDE ALST
Subjt:  LGT----------------------------------------------------APSEKSLQIVLVRDVDGKTFWDALDDAISPRIKAPTTVDESALST

Query:  FRNIFEGRSLKKGTFIFLTWLEPPKMLVSISTDGPPTGIEATIESNNVTSALFDVFFGDSPVSPTLKASVATGLAAVLK
        FR+IF+GRSLKKGTFIFLTWLEPPKMLVSIS DG PTG+EATIESNNVTSALFDVFFGDSPVSPTLKASVATGLAAVLK
Subjt:  FRNIFEGRSLKKGTFIFLTWLEPPKMLVSISTDGPPTGIEATIESNNVTSALFDVFFGDSPVSPTLKASVATGLAAVLK

XP_038898313.1 fatty-acid-binding protein 3, chloroplastic isoform X1 [Benincasa hispida]1.8e-10074.38Show/hide
Query:  MAANGAVNSTPLRLPSAIPTGKWNSNPRICLLTKPPIPTLSLYQFQSTFTLNRNFRFCSNLSLKASSSLASVGNAGYVEEPSTNVKFPTSLTLPGCSTSL
        MAA+ AVNSTPLRLPSAIPTGKWN NPR+CLLTKP  PT+SLY+ QSTF+LN NFRFCSNLSLKASSSLASVGNAGYVEEPSTNVKFPTSLTLPG STSL
Subjt:  MAANGAVNSTPLRLPSAIPTGKWNSNPRICLLTKPPIPTLSLYQFQSTFTLNRNFRFCSNLSLKASSSLASVGNAGYVEEPSTNVKFPTSLTLPGCSTSL

Query:  SLLGT----------------------------------------------------APSEKSLQIVLVRDVDGKTFWDALDDAISPRIKAPTTVDESAL
        SLLGT                                                    +PSEKSLQIVLVRDVDGKTFWDALDDAISPRIKAPT VDESAL
Subjt:  SLLGT----------------------------------------------------APSEKSLQIVLVRDVDGKTFWDALDDAISPRIKAPTTVDESAL

Query:  STFRNIFEGRSLKKGTFIFLTWLEPPKMLVSISTDGPPTGIEATIESNNVTSALFDVFFGDSPVSPTLKASVATGLAAVLK
        STFR+IF+GRSLKKGTFIFLTWLEP KMLVSIS DGPPTGIEATIESNNV SALFDVFFGDSPVSPTLKASVATGLA VLK
Subjt:  STFRNIFEGRSLKKGTFIFLTWLEPPKMLVSISTDGPPTGIEATIESNNVTSALFDVFFGDSPVSPTLKASVATGLAAVLK

XP_038898314.1 fatty-acid-binding protein 3, chloroplastic isoform X2 [Benincasa hispida]2.1e-9672.95Show/hide
Query:  MAANGAVNSTPLRLPSAIPTGKWNSNPRICLLTKPPIPTLSLYQFQSTFTLNRNFRFCSNLSLKASSSLASVGNAGYVEEPSTNVKFPTSLTLPGCSTSL
        MAA+ AVNSTPLRLPSAIPTGKWN NPR+CLLTKP  PT+SLY+ QSTF+LN NFRFCSNLSLKASSSLAS    GYVEEPSTNVKFPTSLTLPG STSL
Subjt:  MAANGAVNSTPLRLPSAIPTGKWNSNPRICLLTKPPIPTLSLYQFQSTFTLNRNFRFCSNLSLKASSSLASVGNAGYVEEPSTNVKFPTSLTLPGCSTSL

Query:  SLLGT----------------------------------------------------APSEKSLQIVLVRDVDGKTFWDALDDAISPRIKAPTTVDESAL
        SLLGT                                                    +PSEKSLQIVLVRDVDGKTFWDALDDAISPRIKAPT VDESAL
Subjt:  SLLGT----------------------------------------------------APSEKSLQIVLVRDVDGKTFWDALDDAISPRIKAPTTVDESAL

Query:  STFRNIFEGRSLKKGTFIFLTWLEPPKMLVSISTDGPPTGIEATIESNNVTSALFDVFFGDSPVSPTLKASVATGLAAVLK
        STFR+IF+GRSLKKGTFIFLTWLEP KMLVSIS DGPPTGIEATIESNNV SALFDVFFGDSPVSPTLKASVATGLA VLK
Subjt:  STFRNIFEGRSLKKGTFIFLTWLEPPKMLVSISTDGPPTGIEATIESNNVTSALFDVFFGDSPVSPTLKASVATGLAAVLK

TrEMBL top hitse value%identityAlignment
A0A0A0L368 Chalcone-flavonone isomerase family protein2.0e-9270.92Show/hide
Query:  MAANGAVNSTPLRLPSAIPTGKWNSNPRICLLTKPPIPTLSLYQFQSTFTL-NRNFRFCSNLSLKASSSLASVGNAGYVEEPSTNVKFPTSLTLPGCSTS
        MAAN AVNSTPL LP +IPTGK NS+P ICLLTKP IPTLS++   STF+L N NFRF SN SL+ SSSLASVGNAG+VEEPSTNVKFPTSLTLPGCSTS
Subjt:  MAANGAVNSTPLRLPSAIPTGKWNSNPRICLLTKPPIPTLSLYQFQSTFTL-NRNFRFCSNLSLKASSSLASVGNAGYVEEPSTNVKFPTSLTLPGCSTS

Query:  LSLLGT----------------------------------------------------APSEKSLQIVLVRDVDGKTFWDALDDAISPRIKAPTTVDESA
        LSLLGT                                                    + SEKSLQIVLVRDVDGKTFWDALDDAISPRIKAPT  DESA
Subjt:  LSLLGT----------------------------------------------------APSEKSLQIVLVRDVDGKTFWDALDDAISPRIKAPTTVDESA

Query:  LSTFRNIFEGRSLKKGTFIFLTWLEPPKMLVSISTDGPPTGIEATIESNNVTSALFDVFFGDSPVSPTLKASVATGLAAVLK
        LSTFR+IFEGRSLKKGTFIFLTWLEPPKMLVSIS DG PTGI+ATIESNNVTS+LFDVFFGDSPVSPTLKASVATGLAAVLK
Subjt:  LSTFRNIFEGRSLKKGTFIFLTWLEPPKMLVSISTDGPPTGIEATIESNNVTSALFDVFFGDSPVSPTLKASVATGLAAVLK

A0A1S3BTS4 Chalcone-flavonone isomerase family protein5.8e-9270.57Show/hide
Query:  MAANGAVNSTPLRLPSAIPTGKWNSNPRICLLTKPPIPTLSLYQFQSTFTLNRN-FRFCSNLSLKASSSLASVGNAGYVEEPSTNVKFPTSLTLPGCSTS
        MAAN  VN TPL LP ++PTGK NSNP ICLLTKP IPTLS++   STF+LN N FRF SN SLK SSSLASVGNAG+VEEPSTNVKFPTSLTLPGCSTS
Subjt:  MAANGAVNSTPLRLPSAIPTGKWNSNPRICLLTKPPIPTLSLYQFQSTFTLNRN-FRFCSNLSLKASSSLASVGNAGYVEEPSTNVKFPTSLTLPGCSTS

Query:  LSLLGT----------------------------------------------------APSEKSLQIVLVRDVDGKTFWDALDDAISPRIKAPTTVDESA
        LSLLGT                                                    + SEKSLQIVLVRDVDGKTFWDALDDAISPRIKAPT  DESA
Subjt:  LSLLGT----------------------------------------------------APSEKSLQIVLVRDVDGKTFWDALDDAISPRIKAPTTVDESA

Query:  LSTFRNIFEGRSLKKGTFIFLTWLEPPKMLVSISTDGPPTGIEATIESNNVTSALFDVFFGDSPVSPTLKASVATGLAAVLK
        LSTFR+IFEGRSLKKGTFIFLTWLEPPKMLVSIS +GPPTGI+ATIESNNVTSALFDVFFGDSPVS TLKASVATGLAAVLK
Subjt:  LSTFRNIFEGRSLKKGTFIFLTWLEPPKMLVSISTDGPPTGIEATIESNNVTSALFDVFFGDSPVSPTLKASVATGLAAVLK

A0A5A7VAQ8 Chalcone-flavonone isomerase family protein5.8e-9270.57Show/hide
Query:  MAANGAVNSTPLRLPSAIPTGKWNSNPRICLLTKPPIPTLSLYQFQSTFTLNRN-FRFCSNLSLKASSSLASVGNAGYVEEPSTNVKFPTSLTLPGCSTS
        MAAN  VN TPL LP ++PTGK NSNP ICLLTKP IPTLS++   STF+LN N FRF SN SLK SSSLASVGNAG+VEEPSTNVKFPTSLTLPGCSTS
Subjt:  MAANGAVNSTPLRLPSAIPTGKWNSNPRICLLTKPPIPTLSLYQFQSTFTLNRN-FRFCSNLSLKASSSLASVGNAGYVEEPSTNVKFPTSLTLPGCSTS

Query:  LSLLGT----------------------------------------------------APSEKSLQIVLVRDVDGKTFWDALDDAISPRIKAPTTVDESA
        LSLLGT                                                    + SEKSLQIVLVRDVDGKTFWDALDDAISPRIKAPT  DESA
Subjt:  LSLLGT----------------------------------------------------APSEKSLQIVLVRDVDGKTFWDALDDAISPRIKAPTTVDESA

Query:  LSTFRNIFEGRSLKKGTFIFLTWLEPPKMLVSISTDGPPTGIEATIESNNVTSALFDVFFGDSPVSPTLKASVATGLAAVLK
        LSTFR+IFEGRSLKKGTFIFLTWLEPPKMLVSIS +GPPTGI+ATIESNNVTSALFDVFFGDSPVS TLKASVATGLAAVLK
Subjt:  LSTFRNIFEGRSLKKGTFIFLTWLEPPKMLVSISTDGPPTGIEATIESNNVTSALFDVFFGDSPVSPTLKASVATGLAAVLK

A0A6J1CH93 Chalcone-flavonone isomerase family protein1.4e-9369.53Show/hide
Query:  ANGAVNSTPLRLPSAIPTGKWNSNPRICLLTKPPIPTLSLYQFQSTFTLNRNFRFCSNLSLKASSSLASVGNAGYVEEPSTNVKFPTSLTLPGCSTSLSL
        A  A NSTPL+ PS IPT KWNSNPRIC+LTKPP P+LSLYQ QS  +L+ N RFCS+ SLKASSS+ASVGN  YV EPSTNVKF TSL+LPGCSTSLSL
Subjt:  ANGAVNSTPLRLPSAIPTGKWNSNPRICLLTKPPIPTLSLYQFQSTFTLNRNFRFCSNLSLKASSSLASVGNAGYVEEPSTNVKFPTSLTLPGCSTSLSL

Query:  LGT----------------------------------------------------APSEKSLQIVLVRDVDGKTFWDALDDAISPRIKAPTTVDESALST
        LGT                                                    +PSEKSLQIVLVRDVDGKTFWDALDDAISPRIKAPT VDE ALST
Subjt:  LGT----------------------------------------------------APSEKSLQIVLVRDVDGKTFWDALDDAISPRIKAPTTVDESALST

Query:  FRNIFEGRSLKKGTFIFLTWLEPPKMLVSISTDGPPTGIEATIESNNVTSALFDVFFGDSPVSPTLKASVATGLAAVLK
        FR+IF+GRSLKKGTFIFLTWLEPPKMLVSIS DG PTG+EATIESNNVTSALFDVFFGDSPVSPTLKASVATGLAAVLK
Subjt:  FRNIFEGRSLKKGTFIFLTWLEPPKMLVSISTDGPPTGIEATIESNNVTSALFDVFFGDSPVSPTLKASVATGLAAVLK

A0A6J1FD29 Chalcone-flavonone isomerase family protein2.3e-7266.53Show/hide
Query:  AVNSTPLRLPSAIPTGKWNSNPRICLLTKPPIPTLSLYQFQSTFTLNRNFRFCS---NLSLKASSSLASVGNAGYVEEPSTNVKFPTSL---------TL
        AVN+TPLRLP AIPTGKW+SN R+CL  KPP         QST + N++FR CS   N+S       A +G   Y      N      L          +
Subjt:  AVNSTPLRLPSAIPTGKWNSNPRICLLTKPPIPTLSLYQFQSTFTLNRNFRFCS---NLSLKASSSLASVGNAGYVEEPSTNVKFPTSL---------TL

Query:  PGCSTSLSLLGTAPSEKSLQIVLVRDVDGKTFWDALDDAISPRIKAPTTVDESALSTFRNIFEGRSLKKGTFIFLTWLEPPKMLVSISTDGPPTGIEATI
           S+  +++  +PSEKSLQIVLVRDVDGKTFWDALDDAISPRIKAPT VDESALSTFR+IFEGRSLKKGTFIFLTWLEPPKMLVSIS DGPPTGIEATI
Subjt:  PGCSTSLSLLGTAPSEKSLQIVLVRDVDGKTFWDALDDAISPRIKAPTTVDESALSTFRNIFEGRSLKKGTFIFLTWLEPPKMLVSISTDGPPTGIEATI

Query:  ESNNVTSALFDVFFGDSPVSPTLKASVATGLAAVLK
        ESNNVTSALFDVFFGD+PVSPTLKASVA GL+AVLK
Subjt:  ESNNVTSALFDVFFGDSPVSPTLKASVATGLAAVLK

SwissProt top hitse value%identityAlignment
A7ISP5 Chalcone--flavanone isomerase 2-B7.5e-0425.2Show/hide
Query:  PSEKSLQIVLVRDVDGKTFWDALDDAISPRIKAPTTV---DESALSTFRNIFEGRSLKKGTFIFLTWLEPPK----MLVSISTDGPPTGIEATIESNNVT
        P EK ++   +R +DG+ +   + +     +++  T    +E A+  FRN F+ ++   G+ +F  + + P      L+    +  P    A I++  ++
Subjt:  PSEKSLQIVLVRDVDGKTFWDALDDAISPRIKAPTTV---DESALSTFRNIFEGRSLKKGTFIFLTWLEPPK----MLVSISTDGPPTGIEATIESNNVT

Query:  SALFDVFFGDSPVSPTLKASVAT
         A+ +   G+ PVSP LK S+AT
Subjt:  SALFDVFFGDSPVSPTLKASVAT

Q53B70 Chalcone--flavanone isomerase 1B-26.8e-0526.67Show/hide
Query:  PSEKSLQIVLVRDVDGKTFWDALDDAISPRIKAPTT---VDESALSTFRNIFEGRSLKKGTFIFLTWLEPPKMLVSISTDGP-PTGIEATIESNNVTSAL
        P EK ++   +R +DG+ +   + +     +++  T    +E A+  FRN F+ ++   G+ +F        + +S S D   P    A I++  ++ A+
Subjt:  PSEKSLQIVLVRDVDGKTFWDALDDAISPRIKAPTT---VDESALSTFRNIFEGRSLKKGTFIFLTWLEPPKMLVSISTDGP-PTGIEATIESNNVTSAL

Query:  FDVFFGDSPVSPTLKASVAT
         +   G+ PVSP LK S+AT
Subjt:  FDVFFGDSPVSPTLKASVAT

Q53B75 Chalcone--flavanone isomerase 1B-18.8e-0526.67Show/hide
Query:  PSEKSLQIVLVRDVDGKTFWDALDDAISPRIKAPTTV---DESALSTFRNIFEGRSLKKGTFIFLTWLEPPKMLVSISTDGP-PTGIEATIESNNVTSAL
        P EK ++   +R +DG+ +   + +     +++  T    +E A+  FRN F+ ++   G+ +F        + +S S D   P    A I++  ++ A+
Subjt:  PSEKSLQIVLVRDVDGKTFWDALDDAISPRIKAPTTV---DESALSTFRNIFEGRSLKKGTFIFLTWLEPPKMLVSISTDGP-PTGIEATIESNNVTSAL

Query:  FDVFFGDSPVSPTLKASVAT
         +   G+ PVSP LK S+AT
Subjt:  FDVFFGDSPVSPTLKASVAT

Q9C8L2 Fatty-acid-binding protein 3, chloroplastic2.0e-4951.54Show/hide
Query:  RFCSNLS-LKASSSLASVGNA-GYVEEPSTNVKFPTSLTLPGCSTSLSLLGT------------------------------------------------
        R C  +S +   S+ +SVGNA  Y EE +T+VKF  S+TLPGCS+ LSLLGT                                                
Subjt:  RFCSNLS-LKASSSLASVGNA-GYVEEPSTNVKFPTSLTLPGCSTSLSLLGT------------------------------------------------

Query:  ----APSEKSLQIVLVRDVDGKTFWDALDDAISPRIKAPTTVDESALSTFRNIFEGRSLKKGTFIFLTWLEPPKMLVSISTDGPPTGIEATIESNNVTSA
            A +EKSLQIVLVRDVDGKTFWDALD+AISPRIK+P++ D +ALSTFR IF+ R L KG+ I LTW+    MLVS+S+ G PT ++ATIES NVTSA
Subjt:  ----APSEKSLQIVLVRDVDGKTFWDALDDAISPRIKAPTTVDESALSTFRNIFEGRSLKKGTFIFLTWLEPPKMLVSISTDGPPTGIEATIESNNVTSA

Query:  LFDVFFGDSPVSPTLKASVATGLAAVL
        LFDVFFGDSPVSPTLK+SVA  LA  L
Subjt:  LFDVFFGDSPVSPTLKASVATGLAAVL

Arabidopsis top hitse value%identityAlignment
AT1G53520.1 Chalcone-flavanone isomerase family protein1.4e-5051.54Show/hide
Query:  RFCSNLS-LKASSSLASVGNA-GYVEEPSTNVKFPTSLTLPGCSTSLSLLGT------------------------------------------------
        R C  +S +   S+ +SVGNA  Y EE +T+VKF  S+TLPGCS+ LSLLGT                                                
Subjt:  RFCSNLS-LKASSSLASVGNA-GYVEEPSTNVKFPTSLTLPGCSTSLSLLGT------------------------------------------------

Query:  ----APSEKSLQIVLVRDVDGKTFWDALDDAISPRIKAPTTVDESALSTFRNIFEGRSLKKGTFIFLTWLEPPKMLVSISTDGPPTGIEATIESNNVTSA
            A +EKSLQIVLVRDVDGKTFWDALD+AISPRIK+P++ D +ALSTFR IF+ R L KG+ I LTW+    MLVS+S+ G PT ++ATIES NVTSA
Subjt:  ----APSEKSLQIVLVRDVDGKTFWDALDDAISPRIKAPTTVDESALSTFRNIFEGRSLKKGTFIFLTWLEPPKMLVSISTDGPPTGIEATIESNNVTSA

Query:  LFDVFFGDSPVSPTLKASVATGLAAVL
        LFDVFFGDSPVSPTLK+SVA  LA  L
Subjt:  LFDVFFGDSPVSPTLKASVATGLAAVL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTGCAAATGGTGCTGTAAATTCCACCCCATTAAGGCTCCCATCAGCCATTCCCACTGGGAAATGGAATTCAAATCCCAGAATTTGCCTTCTAACAAAACCCCCAAT
TCCTACTCTCTCCTTGTACCAATTTCAGTCTACTTTCACTCTCAACAGGAACTTCAGATTTTGTTCGAATCTCTCTCTCAAAGCTTCTTCTTCCCTTGCTTCAGTTGGAA
ATGCAGGATATGTGGAGGAACCTTCAACCAATGTAAAATTTCCGACGTCCTTGACTCTGCCGGGCTGCTCAACCTCGCTGTCACTGCTCGGAACAGCTCCTTCAGAAAAA
TCTTTACAGATTGTTCTTGTCCGAGACGTCGATGGTAAAACTTTCTGGGACGCCTTGGATGATGCCATTTCTCCAAGAATCAAAGCACCAACAACGGTTGATGAATCTGC
ATTATCTACCTTTCGTAACATCTTCGAGGGACGATCTCTTAAGAAAGGAACTTTCATATTCTTGACTTGGTTGGAACCTCCAAAGATGCTTGTTAGCATCTCAACAGATG
GCCCACCAACAGGAATAGAAGCTACAATTGAATCAAATAATGTGACTTCAGCTCTTTTCGATGTTTTTTTCGGGGACTCCCCTGTTTCTCCTACTTTGAAGGCTTCTGTT
GCAACTGGATTGGCTGCAGTTTTGAAATAG
mRNA sequenceShow/hide mRNA sequence
ATGGCTGCAAATGGTGCTGTAAATTCCACCCCATTAAGGCTCCCATCAGCCATTCCCACTGGGAAATGGAATTCAAATCCCAGAATTTGCCTTCTAACAAAACCCCCAAT
TCCTACTCTCTCCTTGTACCAATTTCAGTCTACTTTCACTCTCAACAGGAACTTCAGATTTTGTTCGAATCTCTCTCTCAAAGCTTCTTCTTCCCTTGCTTCAGTTGGAA
ATGCAGGATATGTGGAGGAACCTTCAACCAATGTAAAATTTCCGACGTCCTTGACTCTGCCGGGCTGCTCAACCTCGCTGTCACTGCTCGGAACAGCTCCTTCAGAAAAA
TCTTTACAGATTGTTCTTGTCCGAGACGTCGATGGTAAAACTTTCTGGGACGCCTTGGATGATGCCATTTCTCCAAGAATCAAAGCACCAACAACGGTTGATGAATCTGC
ATTATCTACCTTTCGTAACATCTTCGAGGGACGATCTCTTAAGAAAGGAACTTTCATATTCTTGACTTGGTTGGAACCTCCAAAGATGCTTGTTAGCATCTCAACAGATG
GCCCACCAACAGGAATAGAAGCTACAATTGAATCAAATAATGTGACTTCAGCTCTTTTCGATGTTTTTTTCGGGGACTCCCCTGTTTCTCCTACTTTGAAGGCTTCTGTT
GCAACTGGATTGGCTGCAGTTTTGAAATAG
Protein sequenceShow/hide protein sequence
MAANGAVNSTPLRLPSAIPTGKWNSNPRICLLTKPPIPTLSLYQFQSTFTLNRNFRFCSNLSLKASSSLASVGNAGYVEEPSTNVKFPTSLTLPGCSTSLSLLGTAPSEK
SLQIVLVRDVDGKTFWDALDDAISPRIKAPTTVDESALSTFRNIFEGRSLKKGTFIFLTWLEPPKMLVSISTDGPPTGIEATIESNNVTSALFDVFFGDSPVSPTLKASV
ATGLAAVLK