; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmaCh08G004890 (gene) of Cucurbita maxima (Rimu) v1.1 genome

Gene IDCmaCh08G004890
OrganismCucurbita maxima Rimu (Cucurbita maxima (Rimu) v1.1)
DescriptionPolyketide_cyc domain-containing protein
Genome locationCma_Chr08:2784440..2790424
RNA-Seq ExpressionCmaCh08G004890
SyntenyCmaCh08G004890
Gene Ontology termsNA
InterPro domainsIPR005031 - Coenzyme Q-binding protein COQ10, START domain
IPR023393 - START-like domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008461396.1 PREDICTED: uncharacterized protein LOC103499992 isoform X1 [Cucumis melo]1.4e-10889.29Show/hide
Query:  MSLAALTCNSNPNTLNLSHRSIRRSNGVLFMAIPTRRSINLRSLSLPKLLFRIPRSSSKCRRNPISPRVKFVSPVMEWQNCTAKMEVDIPASVAYKCYSD
        MS+A LT NS PN+L L HRSIRR NG+LFMAIPT RSIN RS SLP L+F+IPR+SSK  RNPI PR+KFVSPVMEWQNCTAKMEVDIPASVAYKCYSD
Subjt:  MSLAALTCNSNPNTLNLSHRSIRRSNGVLFMAIPTRRSINLRSLSLPKLLFRIPRSSSKCRRNPISPRVKFVSPVMEWQNCTAKMEVDIPASVAYKCYSD

Query:  REAIPKWMPFISSVKVLEDNPTLSRWSLKYNAFGQDIEFSWLARNLQPTLNQKIHWRSLEGLPNRGVVRFYPKGPSSCLVELTVSYEVPPLLSPVASALQ
        REAIPKWMPFISSVKVLEDNP+LSRWSLKYNAFGQDIEFSWLARNLQPT NQKIHWRSLEGLPNRGVVRFYPKGPSSCLVELTVSYEVPPLLSPVASALQ
Subjt:  REAIPKWMPFISSVKVLEDNPTLSRWSLKYNAFGQDIEFSWLARNLQPTLNQKIHWRSLEGLPNRGVVRFYPKGPSSCLVELTVSYEVPPLLSPVASALQ

Query:  PLLERLLKRGLESFATFAKKYQTA
        PLLERLL+RGL+SFATFAKKYQTA
Subjt:  PLLERLLKRGLESFATFAKKYQTA

XP_022148300.1 uncharacterized protein LOC111016984 isoform X1 [Momordica charantia]2.0e-10788.84Show/hide
Query:  MSLAALTCNSNPNTLNLSHRSIRRSNGVLFMAIPTRRSINLRSLSLPKLLFRIPRSSSKCRRNPISPRVKFVSPVMEWQNCTAKMEVDIPASVAYKCYSD
        MS AA+TCNSNPN+ NLSH SI+R NGVL MAIPTRRSIN +S SLP+L+FRIPR S K   NP  PRVKFVSPVMEWQNCTAKMEVDIPASVAYKCYSD
Subjt:  MSLAALTCNSNPNTLNLSHRSIRRSNGVLFMAIPTRRSINLRSLSLPKLLFRIPRSSSKCRRNPISPRVKFVSPVMEWQNCTAKMEVDIPASVAYKCYSD

Query:  REAIPKWMPFISSVKVLEDNPTLSRWSLKYNAFGQDIEFSWLARNLQPTLNQKIHWRSLEGLPNRGVVRFYPKGPSSCLVELTVSYEVPPLLSPVASALQ
        REAIPKWMPFISSVKV+EDNP+LSRWSLKY AFGQDIEFSWLA+NLQPTLNQKIHWRSLEGLPNRGVVRFYPKG SSCLVELTVSYEVPPLLSPVASALQ
Subjt:  REAIPKWMPFISSVKVLEDNPTLSRWSLKYNAFGQDIEFSWLARNLQPTLNQKIHWRSLEGLPNRGVVRFYPKGPSSCLVELTVSYEVPPLLSPVASALQ

Query:  PLLERLLKRGLESFATFAKKYQTA
        PLLERLLKRGLESFATFAKKYQTA
Subjt:  PLLERLLKRGLESFATFAKKYQTA

XP_022960550.1 uncharacterized protein LOC111461253 [Cucurbita moschata]2.9e-12299.55Show/hide
Query:  MSLAALTCNSNPNTLNLSHRSIRRSNGVLFMAIPTRRSINLRSLSLPKLLFRIPRSSSKCRRNPISPRVKFVSPVMEWQNCTAKMEVDIPASVAYKCYSD
        MSLAALTCNSNPNTLNLSHRSIRRSNGVLFMAIPTRRSIN RSLSLPKLLFRIPRSSSKCRRNPISPRVKFVSPVMEWQNCTAKMEVDIPASVAYKCYSD
Subjt:  MSLAALTCNSNPNTLNLSHRSIRRSNGVLFMAIPTRRSINLRSLSLPKLLFRIPRSSSKCRRNPISPRVKFVSPVMEWQNCTAKMEVDIPASVAYKCYSD

Query:  REAIPKWMPFISSVKVLEDNPTLSRWSLKYNAFGQDIEFSWLARNLQPTLNQKIHWRSLEGLPNRGVVRFYPKGPSSCLVELTVSYEVPPLLSPVASALQ
        REAIPKWMPFISSVKVLEDNPTLSRWSLKYNAFGQDIEFSWLARNLQPTLNQKIHWRSLEGLPNRGVVRFYPKGPSSCLVELTVSYEVPPLLSPVASALQ
Subjt:  REAIPKWMPFISSVKVLEDNPTLSRWSLKYNAFGQDIEFSWLARNLQPTLNQKIHWRSLEGLPNRGVVRFYPKGPSSCLVELTVSYEVPPLLSPVASALQ

Query:  PLLERLLKRGLESFATFAKKYQTA
        PLLERLLKRGLESFATFAKKYQTA
Subjt:  PLLERLLKRGLESFATFAKKYQTA

XP_023004647.1 uncharacterized protein LOC111497878 [Cucurbita maxima]5.8e-123100Show/hide
Query:  MSLAALTCNSNPNTLNLSHRSIRRSNGVLFMAIPTRRSINLRSLSLPKLLFRIPRSSSKCRRNPISPRVKFVSPVMEWQNCTAKMEVDIPASVAYKCYSD
        MSLAALTCNSNPNTLNLSHRSIRRSNGVLFMAIPTRRSINLRSLSLPKLLFRIPRSSSKCRRNPISPRVKFVSPVMEWQNCTAKMEVDIPASVAYKCYSD
Subjt:  MSLAALTCNSNPNTLNLSHRSIRRSNGVLFMAIPTRRSINLRSLSLPKLLFRIPRSSSKCRRNPISPRVKFVSPVMEWQNCTAKMEVDIPASVAYKCYSD

Query:  REAIPKWMPFISSVKVLEDNPTLSRWSLKYNAFGQDIEFSWLARNLQPTLNQKIHWRSLEGLPNRGVVRFYPKGPSSCLVELTVSYEVPPLLSPVASALQ
        REAIPKWMPFISSVKVLEDNPTLSRWSLKYNAFGQDIEFSWLARNLQPTLNQKIHWRSLEGLPNRGVVRFYPKGPSSCLVELTVSYEVPPLLSPVASALQ
Subjt:  REAIPKWMPFISSVKVLEDNPTLSRWSLKYNAFGQDIEFSWLARNLQPTLNQKIHWRSLEGLPNRGVVRFYPKGPSSCLVELTVSYEVPPLLSPVASALQ

Query:  PLLERLLKRGLESFATFAKKYQTA
        PLLERLLKRGLESFATFAKKYQTA
Subjt:  PLLERLLKRGLESFATFAKKYQTA

XP_023514604.1 uncharacterized protein LOC111778852 [Cucurbita pepo subsp. pepo]1.9e-12198.66Show/hide
Query:  MSLAALTCNSNPNTLNLSHRSIRRSNGVLFMAIPTRRSINLRSLSLPKLLFRIPRSSSKCRRNPISPRVKFVSPVMEWQNCTAKMEVDIPASVAYKCYSD
        MSLAALTCNSNPNTLNLSHRSIRRSNGVLFMAIPTRR+IN RSLSLPKLLFRIPR SSKCRRNPISPRVKFVSPVMEWQNCTAKMEVDIPASVAYKCYSD
Subjt:  MSLAALTCNSNPNTLNLSHRSIRRSNGVLFMAIPTRRSINLRSLSLPKLLFRIPRSSSKCRRNPISPRVKFVSPVMEWQNCTAKMEVDIPASVAYKCYSD

Query:  REAIPKWMPFISSVKVLEDNPTLSRWSLKYNAFGQDIEFSWLARNLQPTLNQKIHWRSLEGLPNRGVVRFYPKGPSSCLVELTVSYEVPPLLSPVASALQ
        REAIPKWMPFISSVKVLEDNPTLSRWSLKYNAFGQDIEFSWLARNLQPTLNQKIHWRSLEGLPNRGVVRFYPKGPSSCLVELTVSYEVPPLLSPVASALQ
Subjt:  REAIPKWMPFISSVKVLEDNPTLSRWSLKYNAFGQDIEFSWLARNLQPTLNQKIHWRSLEGLPNRGVVRFYPKGPSSCLVELTVSYEVPPLLSPVASALQ

Query:  PLLERLLKRGLESFATFAKKYQTA
        PLLERLLKRGLESFATFAKKYQTA
Subjt:  PLLERLLKRGLESFATFAKKYQTA

TrEMBL top hitse value%identityAlignment
A0A0A0K6F1 Polyketide_cyc domain-containing protein6.9e-10687.95Show/hide
Query:  MSLAALTCNSNPNTLNLSHRSIRRSNGVLFMAIPTRRSINLRSLSLPKLLFRIPRSSSKCRRNPISPRVKFVSPVMEWQNCTAKMEVDIPASVAYKCYSD
        MS+A LT NS+PN+L L HRSIRR NG++FMAIPT RSIN RS SLP L+F+IPR+SSK  RNP+    KFVSPVMEWQNCTAKMEVDIPASVAYKCYSD
Subjt:  MSLAALTCNSNPNTLNLSHRSIRRSNGVLFMAIPTRRSINLRSLSLPKLLFRIPRSSSKCRRNPISPRVKFVSPVMEWQNCTAKMEVDIPASVAYKCYSD

Query:  REAIPKWMPFISSVKVLEDNPTLSRWSLKYNAFGQDIEFSWLARNLQPTLNQKIHWRSLEGLPNRGVVRFYPKGPSSCLVELTVSYEVPPLLSPVASALQ
        REAIPKWMPFISSVKVLEDNPTLSRWSLKYNAFGQDIEFSWLARNLQPT NQKIHWRSLEGLPNRGVVRFYPKGPSSCLVELTVSYEVPPLLSPVASALQ
Subjt:  REAIPKWMPFISSVKVLEDNPTLSRWSLKYNAFGQDIEFSWLARNLQPTLNQKIHWRSLEGLPNRGVVRFYPKGPSSCLVELTVSYEVPPLLSPVASALQ

Query:  PLLERLLKRGLESFATFAKKYQTA
        PLLERLL+RGL+SFATFAKKYQTA
Subjt:  PLLERLLKRGLESFATFAKKYQTA

A0A1S3CEM7 uncharacterized protein LOC103499992 isoform X16.7e-10989.29Show/hide
Query:  MSLAALTCNSNPNTLNLSHRSIRRSNGVLFMAIPTRRSINLRSLSLPKLLFRIPRSSSKCRRNPISPRVKFVSPVMEWQNCTAKMEVDIPASVAYKCYSD
        MS+A LT NS PN+L L HRSIRR NG+LFMAIPT RSIN RS SLP L+F+IPR+SSK  RNPI PR+KFVSPVMEWQNCTAKMEVDIPASVAYKCYSD
Subjt:  MSLAALTCNSNPNTLNLSHRSIRRSNGVLFMAIPTRRSINLRSLSLPKLLFRIPRSSSKCRRNPISPRVKFVSPVMEWQNCTAKMEVDIPASVAYKCYSD

Query:  REAIPKWMPFISSVKVLEDNPTLSRWSLKYNAFGQDIEFSWLARNLQPTLNQKIHWRSLEGLPNRGVVRFYPKGPSSCLVELTVSYEVPPLLSPVASALQ
        REAIPKWMPFISSVKVLEDNP+LSRWSLKYNAFGQDIEFSWLARNLQPT NQKIHWRSLEGLPNRGVVRFYPKGPSSCLVELTVSYEVPPLLSPVASALQ
Subjt:  REAIPKWMPFISSVKVLEDNPTLSRWSLKYNAFGQDIEFSWLARNLQPTLNQKIHWRSLEGLPNRGVVRFYPKGPSSCLVELTVSYEVPPLLSPVASALQ

Query:  PLLERLLKRGLESFATFAKKYQTA
        PLLERLL+RGL+SFATFAKKYQTA
Subjt:  PLLERLLKRGLESFATFAKKYQTA

A0A6J1D2J2 uncharacterized protein LOC111016984 isoform X19.6e-10888.84Show/hide
Query:  MSLAALTCNSNPNTLNLSHRSIRRSNGVLFMAIPTRRSINLRSLSLPKLLFRIPRSSSKCRRNPISPRVKFVSPVMEWQNCTAKMEVDIPASVAYKCYSD
        MS AA+TCNSNPN+ NLSH SI+R NGVL MAIPTRRSIN +S SLP+L+FRIPR S K   NP  PRVKFVSPVMEWQNCTAKMEVDIPASVAYKCYSD
Subjt:  MSLAALTCNSNPNTLNLSHRSIRRSNGVLFMAIPTRRSINLRSLSLPKLLFRIPRSSSKCRRNPISPRVKFVSPVMEWQNCTAKMEVDIPASVAYKCYSD

Query:  REAIPKWMPFISSVKVLEDNPTLSRWSLKYNAFGQDIEFSWLARNLQPTLNQKIHWRSLEGLPNRGVVRFYPKGPSSCLVELTVSYEVPPLLSPVASALQ
        REAIPKWMPFISSVKV+EDNP+LSRWSLKY AFGQDIEFSWLA+NLQPTLNQKIHWRSLEGLPNRGVVRFYPKG SSCLVELTVSYEVPPLLSPVASALQ
Subjt:  REAIPKWMPFISSVKVLEDNPTLSRWSLKYNAFGQDIEFSWLARNLQPTLNQKIHWRSLEGLPNRGVVRFYPKGPSSCLVELTVSYEVPPLLSPVASALQ

Query:  PLLERLLKRGLESFATFAKKYQTA
        PLLERLLKRGLESFATFAKKYQTA
Subjt:  PLLERLLKRGLESFATFAKKYQTA

A0A6J1H9D8 uncharacterized protein LOC1114612531.4e-12299.55Show/hide
Query:  MSLAALTCNSNPNTLNLSHRSIRRSNGVLFMAIPTRRSINLRSLSLPKLLFRIPRSSSKCRRNPISPRVKFVSPVMEWQNCTAKMEVDIPASVAYKCYSD
        MSLAALTCNSNPNTLNLSHRSIRRSNGVLFMAIPTRRSIN RSLSLPKLLFRIPRSSSKCRRNPISPRVKFVSPVMEWQNCTAKMEVDIPASVAYKCYSD
Subjt:  MSLAALTCNSNPNTLNLSHRSIRRSNGVLFMAIPTRRSINLRSLSLPKLLFRIPRSSSKCRRNPISPRVKFVSPVMEWQNCTAKMEVDIPASVAYKCYSD

Query:  REAIPKWMPFISSVKVLEDNPTLSRWSLKYNAFGQDIEFSWLARNLQPTLNQKIHWRSLEGLPNRGVVRFYPKGPSSCLVELTVSYEVPPLLSPVASALQ
        REAIPKWMPFISSVKVLEDNPTLSRWSLKYNAFGQDIEFSWLARNLQPTLNQKIHWRSLEGLPNRGVVRFYPKGPSSCLVELTVSYEVPPLLSPVASALQ
Subjt:  REAIPKWMPFISSVKVLEDNPTLSRWSLKYNAFGQDIEFSWLARNLQPTLNQKIHWRSLEGLPNRGVVRFYPKGPSSCLVELTVSYEVPPLLSPVASALQ

Query:  PLLERLLKRGLESFATFAKKYQTA
        PLLERLLKRGLESFATFAKKYQTA
Subjt:  PLLERLLKRGLESFATFAKKYQTA

A0A6J1KR05 uncharacterized protein LOC1114978782.8e-123100Show/hide
Query:  MSLAALTCNSNPNTLNLSHRSIRRSNGVLFMAIPTRRSINLRSLSLPKLLFRIPRSSSKCRRNPISPRVKFVSPVMEWQNCTAKMEVDIPASVAYKCYSD
        MSLAALTCNSNPNTLNLSHRSIRRSNGVLFMAIPTRRSINLRSLSLPKLLFRIPRSSSKCRRNPISPRVKFVSPVMEWQNCTAKMEVDIPASVAYKCYSD
Subjt:  MSLAALTCNSNPNTLNLSHRSIRRSNGVLFMAIPTRRSINLRSLSLPKLLFRIPRSSSKCRRNPISPRVKFVSPVMEWQNCTAKMEVDIPASVAYKCYSD

Query:  REAIPKWMPFISSVKVLEDNPTLSRWSLKYNAFGQDIEFSWLARNLQPTLNQKIHWRSLEGLPNRGVVRFYPKGPSSCLVELTVSYEVPPLLSPVASALQ
        REAIPKWMPFISSVKVLEDNPTLSRWSLKYNAFGQDIEFSWLARNLQPTLNQKIHWRSLEGLPNRGVVRFYPKGPSSCLVELTVSYEVPPLLSPVASALQ
Subjt:  REAIPKWMPFISSVKVLEDNPTLSRWSLKYNAFGQDIEFSWLARNLQPTLNQKIHWRSLEGLPNRGVVRFYPKGPSSCLVELTVSYEVPPLLSPVASALQ

Query:  PLLERLLKRGLESFATFAKKYQTA
        PLLERLLKRGLESFATFAKKYQTA
Subjt:  PLLERLLKRGLESFATFAKKYQTA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G02470.1 Polyketide cyclase/dehydrase and lipid transport superfamily protein3.7e-5148Show/hide
Query:  SLAALTCNSNPNTLNLSHRSIRRSNGVLFMAIPTRRSINLRSLSLPKLLFRIPRSSSKCRR-------NPISPRVKFVSPVMEWQNCTAKMEVDIPASVA
        S+A+ T ++  +T  L  R I +S  VL M       ++  S S P++L  +  SS+   +         IS   K   PVM+WQ+ T KM VD PASVA
Subjt:  SLAALTCNSNPNTLNLSHRSIRRSNGVLFMAIPTRRSINLRSLSLPKLLFRIPRSSSKCRR-------NPISPRVKFVSPVMEWQNCTAKMEVDIPASVA

Query:  YKCYSDREAIPKWMPFISSVKVLEDNPTLSRWSLKYNAFGQDIEFSWLARNLQPTLNQKIHWRSLEGLPNRGVVRFYPKGPSSCLVELTVSYEVPPLLSP
        YK Y+DRE  PKWMPF+SSV+ +E +P LSR+ +K  +FGQ+IE+ +LA+NLQP  ++KIHWRS+EG  NRG VRF+P+GPSSCLVE++ SYEVP   +P
Subjt:  YKCYSDREAIPKWMPFISSVKVLEDNPTLSRWSLKYNAFGQDIEFSWLARNLQPTLNQKIHWRSLEGLPNRGVVRFYPKGPSSCLVELTVSYEVPPLLSP

Query:  VASALQPLLERLLKRGLESFATFAK
        VA A++P +E++++ GLE FA F K
Subjt:  VASALQPLLERLLKRGLESFATFAK

AT1G02470.2 Polyketide cyclase/dehydrase and lipid transport superfamily protein9.1e-5047.79Show/hide
Query:  SLAALTCNSNPNTLNLSHRSIRRSNGVLFMAIPTRRSINLRSLSLPKLLFRIPRSSSKCRR-------NPISPRVKFVSPVMEWQNCT-AKMEVDIPASV
        S+A+ T ++  +T  L  R I +S  VL M       ++  S S P++L  +  SS+   +         IS   K   PVM+WQ+ T  KM VD PASV
Subjt:  SLAALTCNSNPNTLNLSHRSIRRSNGVLFMAIPTRRSINLRSLSLPKLLFRIPRSSSKCRR-------NPISPRVKFVSPVMEWQNCT-AKMEVDIPASV

Query:  AYKCYSDREAIPKWMPFISSVKVLEDNPTLSRWSLKYNAFGQDIEFSWLARNLQPTLNQKIHWRSLEGLPNRGVVRFYPKGPSSCLVELTVSYEVPPLLS
        AYK Y+DRE  PKWMPF+SSV+ +E +P LSR+ +K  +FGQ+IE+ +LA+NLQP  ++KIHWRS+EG  NRG VRF+P+GPSSCLVE++ SYEVP   +
Subjt:  AYKCYSDREAIPKWMPFISSVKVLEDNPTLSRWSLKYNAFGQDIEFSWLARNLQPTLNQKIHWRSLEGLPNRGVVRFYPKGPSSCLVELTVSYEVPPLLS

Query:  PVASALQPLLERLLKRGLESFATFAK
        PVA A++P +E++++ GLE FA F K
Subjt:  PVASALQPLLERLLKRGLESFATFAK

AT1G02475.1 Polyketide cyclase/dehydrase and lipid transport superfamily protein4.5e-6572.12Show/hide
Query:  SSSKCRRNP-ISPRVKFVSPVMEWQNCTAKMEVDIPASVAYKCYSDREAIPKWMPFISSVKVLEDNPTLSRWSLKYNAFGQDIEFSWLARNLQPTLNQKI
        SSS  RR+  I P+ +  S  MEWQ+C+ KMEVD+P SVAY  Y DRE+ PKWMPFISSV+VL+D P LSRWSLKYNAFGQDI++SWLARNLQPT NQKI
Subjt:  SSSKCRRNP-ISPRVKFVSPVMEWQNCTAKMEVDIPASVAYKCYSDREAIPKWMPFISSVKVLEDNPTLSRWSLKYNAFGQDIEFSWLARNLQPTLNQKI

Query:  HWRSLEGLPNRGVVRFYPKGPSSCLVELTVSYEVPPLLSPVASALQPLLERLLKRGLESFATFAK
        HWRSLEGLPN+G VRF+PKGPSSC+VELTVSYEVP LL+PVAS L+P +E LL+ GLE FA  AK
Subjt:  HWRSLEGLPNRGVVRFYPKGPSSCLVELTVSYEVPPLLSPVASALQPLLERLLKRGLESFATFAK

AT4G01883.1 Polyketide cyclase / dehydrase and lipid transport protein1.7e-5667.11Show/hide
Query:  VMEWQNCTAKMEVDIPASVAYKCYSDREAIPKWMPFISSVKVLEDNPTLSRWSLKYNAFGQDIEFSWLARNLQPTLNQKIHWRSLEGLPNRGVVRFYPKG
        +MEWQ C  KM+V++P SVAY  YS+RE+IPKWM FISSVKVL+D P LSRW+LKY AFGQ++E++WLA+NLQP  NQKIHW SLEGLPN+G VRF+P G
Subjt:  VMEWQNCTAKMEVDIPASVAYKCYSDREAIPKWMPFISSVKVLEDNPTLSRWSLKYNAFGQDIEFSWLARNLQPTLNQKIHWRSLEGLPNRGVVRFYPKG

Query:  PSSCLVELTVSYEVPPLLSPVASALQPLLERLLKRGLESFATFAKKYQT
        PSSC VELT +YEVP LL P A+ALQPL++ L+K  LE FA  AK  +T
Subjt:  PSSCLVELTVSYEVPPLLSPVASALQPLLERLLKRGLESFATFAKKYQT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCACTTGCAGCGCTCACCTGTAATTCGAATCCAAACACTCTGAATCTCAGTCATCGATCGATCAGAAGGAGTAATGGCGTCTTATTCATGGCGATTCCCACTCGCAG
AAGCATCAATTTGAGGTCACTATCTCTTCCCAAGCTCCTCTTCAGAATCCCACGGAGTTCTTCGAAGTGCCGCAGAAACCCCATTTCCCCTCGAGTTAAATTCGTCTCCC
CTGTGATGGAATGGCAGAATTGCACGGCCAAGATGGAAGTAGACATTCCTGCCTCGGTTGCTTATAAATGCTACTCAGATCGTGAAGCCATTCCCAAATGGATGCCCTTC
ATTTCATCTGTGAAGGTTTTGGAAGATAATCCTACTTTATCACGGTGGTCATTAAAATACAATGCCTTTGGTCAAGATATTGAGTTCTCTTGGCTTGCCCGGAACTTGCA
GCCGACCCTCAATCAAAAAATCCATTGGCGGTCTCTCGAAGGTCTTCCAAACAGAGGTGTTGTTCGATTTTATCCAAAAGGCCCATCATCTTGCCTCGTAGAACTGACTG
TCTCTTATGAAGTTCCTCCTCTTTTATCTCCAGTGGCATCTGCATTGCAACCTTTGCTGGAGAGATTACTTAAACGAGGTCTTGAAAGCTTTGCCACATTTGCTAAGAAA
TACCAAACGGCTTGA
mRNA sequenceShow/hide mRNA sequence
GACGGCCTTGATATTGTTGTGGCTCTGAGAGGAGAGAGATTTTCAGAGGCTGTTCTTGCATTTGGTGCCCTCAACTCTCTACTCTCCACTCCCTTTGTGTCCAGTCGCTC
TCTTCTTCTTCACGAATTTCGAATCCCAGTATGTCACTTGCAGCGCTCACCTGTAATTCGAATCCAAACACTCTGAATCTCAGTCATCGATCGATCAGAAGGAGTAATGG
CGTCTTATTCATGGCGATTCCCACTCGCAGAAGCATCAATTTGAGGTCACTATCTCTTCCCAAGCTCCTCTTCAGAATCCCACGGAGTTCTTCGAAGTGCCGCAGAAACC
CCATTTCCCCTCGAGTTAAATTCGTCTCCCCTGTGATGGAATGGCAGAATTGCACGGCCAAGATGGAAGTAGACATTCCTGCCTCGGTTGCTTATAAATGCTACTCAGAT
CGTGAAGCCATTCCCAAATGGATGCCCTTCATTTCATCTGTGAAGGTTTTGGAAGATAATCCTACTTTATCACGGTGGTCATTAAAATACAATGCCTTTGGTCAAGATAT
TGAGTTCTCTTGGCTTGCCCGGAACTTGCAGCCGACCCTCAATCAAAAAATCCATTGGCGGTCTCTCGAAGGTCTTCCAAACAGAGGTGTTGTTCGATTTTATCCAAAAG
GCCCATCATCTTGCCTCGTAGAACTGACTGTCTCTTATGAAGTTCCTCCTCTTTTATCTCCAGTGGCATCTGCATTGCAACCTTTGCTGGAGAGATTACTTAAACGAGGT
CTTGAAAGCTTTGCCACATTTGCTAAGAAATACCAAACGGCTTGAAGACGGGTATTTGTGTCATATGGTATGGATATCACCATTTTTTACTAGATACTCTTATTGACAAA
TGTGAAATCTTGAACAGAATACTTTAATACAATCATCTGTTTCAATAAGAAATTAATGTACTTTACAAAAATATTCCGAGTCCAGCTGATTTATTGGGTTATATATGTTC
TTAATCTCAAAAAATTATTTACATCACAATGAATGGGATAGCAGTTCTGAACTCTTCAATCTGTTTGTTACAAATGAGAGTTGTTACAGGAAATGGTAGACTCATTGGCA
TCCACTCTTATCCAACGATCTCTGCAAGCTACAAAGGGCTTGAGCAGAGAGGCTGCAGTATCTCAATTTTCGTTGCTGTGGTTGTGGAGAGTTGTCCCTATGGAAACTAG
AGAAGCAAACAATAACAACGGTGAGATTATCGAATGAGTCTAATCCCAAAGCCCGCAGGACGAGATCTCGAGCGCAACGCTCAGGGTTGTCATGTCGCTGAAGTCCACGG
CGAACTACATCGACTGCATGCTGGCTTGACATCACGTCCCAAATCCCATCGCAGGCTATGATTAAGAACTCATCCTCCTCTGTTAGAACCACCTGCCTGCATTCTGGTTC
TGCAATGAGGGGTGAAGGAGAACCATGGGGGAGGAGCTTCATGTCCCAATCACCCAAGGCCCTGGAAACTGATAAAAGGCCGTTAAGATAGCCACCATCGACGAAGCCAC
CCAACTCTTCTACTCGTTCCTTCTCCGGTGAATAAATTGGCCTGTGGTCTTGAGACATATCTACTGCCTCTCCCTTCCGACTAAGAACAGCCCGGCAATCTCCAACATTA
GCCACCATTAGAAGCCTGAAGAACAGAAACATATGGTCAACAACATAGCTAGCTACCTCCTATTTGTGATTTCTCCCAATCATATGAACAATGCAGTGCTATTCTACAAA
AGCTAGAGATGAAAATCTAAATAGACAGTCGAAACCAGAACAAGACAGCATAATCGTATGAGTAGGTCAACAAAAACCAATAAAACCATGATCAGCCACAAAACTACCAC
CAACTTTTGGATTCTCCTAACCATATAAACAAATGCAGTACCACCCAACAAGGTGGCCGGACACAATCATCTAAAAGACAGTGAATTCGTATGAGTAGATTAACGAAAGG
CAGCAACACCCGACTACATTTTACCGAGATTGACATCAGGTATTGTGAGATCCCACAGGAGGAGAACCAAGCATTCCTTAGTGTGAAACCTCTCCTTATAGACTTGGACT
ATTACAATGGGTACCAGACACCGGGTGACACTGCCCAGTGAGGATACTGGCCCAAGGAGGTAGATTGTAAGATCCCACCTCGGTTGGAGAGAAGAACGAAGTAATGCTTG
CAGAGGTGTGGAAATCTCTCCCCAAGGGTGTGGAAATCTCTCCGTATCAATATCTTTGTCAGGGTGGGGAAATCTCTCCCTATCAAGCCTAGGGGCCTCATGTTACAACC
TAAAAGATGCGTTTTAAAACCCAAGCACTTGGGCTCATGAAGGATGATTACATAAGATTGGAAAAACATACCTTCCAAGCACAAGAGCAGTGAGGGCCGTCGTCCCAGAC
AAATTACTAATCCCAGAATCATCGGCCAAAGCTTGATCGGCCAGAAGAAATGCTTCTCGTAAGAAATTTTCAATCTCCTCCGGCCGGAGTTCATCAACATCAGGCATGTG
AGGAAACCTAACATCTTCAAAAAAGAGTCTAAGAATATTCTTTCTTATATAAGCTGCTGCTTCAGGTCCTCCATGTCCATCAAACACCTGAGATCAGCAAACTCAAGTCA
TAATGACATCCAAAGCTACACAAACAAGGCAACCCACTCAACAAAGAAACTCAACAACTCACCCCATAGAAACCACTTGGCTTAGAAAACTTAAAGAGCGAGCCTAAATG
TGTACACAGATCGTCAATCCTTATGTGTTCATCCTCCATGTCTCTCCTAGGTCCAATATCAGCAAAGCTCCCCGACCGGAGACTTGGAAATTCCCTCTGCTCCATCGAAT
CGACGTCGGTATCTGGAATTTGCTTCACACCCTGAAGAATTAAACATCATAAGAACTCGCACACATACCGAAACCATCGCTACAGAAACAAAATCATGACAATCACTGTT
TCTATACCCAAAAACAAACAAAAACCCCTGAGAAGAAGAAGAAGAAGAAGAAGAACATACCGATTGAGAACTCGAACAAACCGATTCGGAAGCGCGAACACGATCGAGAG
TCGGATCAACCGGAACCGCACCGTCGATGGACTTCATTTCATCAACAAGATGAGCCTCTTTACCGAAATAGGGGACCTCCAAAACAGGGAGGCTCTTCGAACAGAAGACA
ACCTTAGTTTCAGCCACCATTTTTACATTTACCAAAAAATCAGATTCTTCAACTGAGAGATCAAAATCACTATAATCTGCAGAAATGTAAGTTGGAGTTGGAATCGGATT
TTTCTCTCTCTTTGCTATGGAAATTTGAGAATAATGATGAGGAAATTTCAGTAATAAAGAAGTCGGCAATGAAGAAGGGATATATAGTGGGAGCCGCCGCGGGAGGGAGT
AAATTCGTGGAAGGAGACGGAGCACGTAACGCCGTTACAGAGACTTGCCTGTGAAGCCACGTTAACGTTTCCTCATCGACACTTGGCTTTATGAAATTACTTAAACACCC
TTGCTTTAACTTCGTTGACCTTTTTAGCCTCTCT
Protein sequenceShow/hide protein sequence
MSLAALTCNSNPNTLNLSHRSIRRSNGVLFMAIPTRRSINLRSLSLPKLLFRIPRSSSKCRRNPISPRVKFVSPVMEWQNCTAKMEVDIPASVAYKCYSDREAIPKWMPF
ISSVKVLEDNPTLSRWSLKYNAFGQDIEFSWLARNLQPTLNQKIHWRSLEGLPNRGVVRFYPKGPSSCLVELTVSYEVPPLLSPVASALQPLLERLLKRGLESFATFAKK
YQTA