; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc09G05080 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc09G05080
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionPolyketide_cyc domain-containing protein
Genome locationClcChr09:4009799..4013338
RNA-Seq ExpressionClc09G05080
SyntenyClc09G05080
Gene Ontology termsNA
InterPro domainsIPR005031 - Coenzyme Q-binding protein COQ10, START domain
IPR023393 - START-like domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008461396.1 PREDICTED: uncharacterized protein LOC103499992 isoform X1 [Cucumis melo]1.3e-11292.86Show/hide
Query:  MSIAALTCNSTPNSLNLAHPSIRRRNGILFMAIPTCRSINSRSLSLPELVFKIPRGSSNRSRNPICPRLKFVSPVMEWQDCTAKMEVDIPASVAYKCYSD
        MSIA LT NSTPNSL L H SIRRRNGILFMAIPT RSINSRS SLP+LVFKIPR SS RSRNPICPRLKFVSPVMEWQ+CTAKMEVDIPASVAYKCYSD
Subjt:  MSIAALTCNSTPNSLNLAHPSIRRRNGILFMAIPTCRSINSRSLSLPELVFKIPRGSSNRSRNPICPRLKFVSPVMEWQDCTAKMEVDIPASVAYKCYSD

Query:  REAIPKWMPFISSVKVLEDNPTLSRWSLKYNAFGQDIEFSWLARNLQPTLNQKIHWRSLEGLPNRGVVRFYPKSPSSCLVELTVSYEVPPLLSPVASALQ
        REAIPKWMPFISSVKVLEDNP+LSRWSLKYNAFGQDIEFSWLARNLQPT NQKIHWRSLEGLPNRGVVRFYPK PSSCLVELTVSYEVPPLLSPVASALQ
Subjt:  REAIPKWMPFISSVKVLEDNPTLSRWSLKYNAFGQDIEFSWLARNLQPTLNQKIHWRSLEGLPNRGVVRFYPKSPSSCLVELTVSYEVPPLLSPVASALQ

Query:  PLLERLLQRGLRSFAKFAKKYQTA
        PLLERLLQRGL+SFA FAKKYQTA
Subjt:  PLLERLLQRGLRSFAKFAKKYQTA

XP_022960550.1 uncharacterized protein LOC111461253 [Cucurbita moschata]6.9e-11190.18Show/hide
Query:  MSIAALTCNSTPNSLNLAHPSIRRRNGILFMAIPTCRSINSRSLSLPELVFKIPRGSSNRSRNPICPRLKFVSPVMEWQDCTAKMEVDIPASVAYKCYSD
        MS+AALTCNS PN+LNL+H SIRR NG+LFMAIPT RSINSRSLSLP+L+F+IPR SS   RNPI PR+KFVSPVMEWQ+CTAKMEVDIPASVAYKCYSD
Subjt:  MSIAALTCNSTPNSLNLAHPSIRRRNGILFMAIPTCRSINSRSLSLPELVFKIPRGSSNRSRNPICPRLKFVSPVMEWQDCTAKMEVDIPASVAYKCYSD

Query:  REAIPKWMPFISSVKVLEDNPTLSRWSLKYNAFGQDIEFSWLARNLQPTLNQKIHWRSLEGLPNRGVVRFYPKSPSSCLVELTVSYEVPPLLSPVASALQ
        REAIPKWMPFISSVKVLEDNPTLSRWSLKYNAFGQDIEFSWLARNLQPTLNQKIHWRSLEGLPNRGVVRFYPK PSSCLVELTVSYEVPPLLSPVASALQ
Subjt:  REAIPKWMPFISSVKVLEDNPTLSRWSLKYNAFGQDIEFSWLARNLQPTLNQKIHWRSLEGLPNRGVVRFYPKSPSSCLVELTVSYEVPPLLSPVASALQ

Query:  PLLERLLQRGLRSFAKFAKKYQTA
        PLLERLL+RGL SFA FAKKYQTA
Subjt:  PLLERLLQRGLRSFAKFAKKYQTA

XP_023004647.1 uncharacterized protein LOC111497878 [Cucurbita maxima]3.4e-11089.73Show/hide
Query:  MSIAALTCNSTPNSLNLAHPSIRRRNGILFMAIPTCRSINSRSLSLPELVFKIPRGSSNRSRNPICPRLKFVSPVMEWQDCTAKMEVDIPASVAYKCYSD
        MS+AALTCNS PN+LNL+H SIRR NG+LFMAIPT RSIN RSLSLP+L+F+IPR SS   RNPI PR+KFVSPVMEWQ+CTAKMEVDIPASVAYKCYSD
Subjt:  MSIAALTCNSTPNSLNLAHPSIRRRNGILFMAIPTCRSINSRSLSLPELVFKIPRGSSNRSRNPICPRLKFVSPVMEWQDCTAKMEVDIPASVAYKCYSD

Query:  REAIPKWMPFISSVKVLEDNPTLSRWSLKYNAFGQDIEFSWLARNLQPTLNQKIHWRSLEGLPNRGVVRFYPKSPSSCLVELTVSYEVPPLLSPVASALQ
        REAIPKWMPFISSVKVLEDNPTLSRWSLKYNAFGQDIEFSWLARNLQPTLNQKIHWRSLEGLPNRGVVRFYPK PSSCLVELTVSYEVPPLLSPVASALQ
Subjt:  REAIPKWMPFISSVKVLEDNPTLSRWSLKYNAFGQDIEFSWLARNLQPTLNQKIHWRSLEGLPNRGVVRFYPKSPSSCLVELTVSYEVPPLLSPVASALQ

Query:  PLLERLLQRGLRSFAKFAKKYQTA
        PLLERLL+RGL SFA FAKKYQTA
Subjt:  PLLERLLQRGLRSFAKFAKKYQTA

XP_023514604.1 uncharacterized protein LOC111778852 [Cucurbita pepo subsp. pepo]3.1e-11190.18Show/hide
Query:  MSIAALTCNSTPNSLNLAHPSIRRRNGILFMAIPTCRSINSRSLSLPELVFKIPRGSSNRSRNPICPRLKFVSPVMEWQDCTAKMEVDIPASVAYKCYSD
        MS+AALTCNS PN+LNL+H SIRR NG+LFMAIPT R+INSRSLSLP+L+F+IPRGSS   RNPI PR+KFVSPVMEWQ+CTAKMEVDIPASVAYKCYSD
Subjt:  MSIAALTCNSTPNSLNLAHPSIRRRNGILFMAIPTCRSINSRSLSLPELVFKIPRGSSNRSRNPICPRLKFVSPVMEWQDCTAKMEVDIPASVAYKCYSD

Query:  REAIPKWMPFISSVKVLEDNPTLSRWSLKYNAFGQDIEFSWLARNLQPTLNQKIHWRSLEGLPNRGVVRFYPKSPSSCLVELTVSYEVPPLLSPVASALQ
        REAIPKWMPFISSVKVLEDNPTLSRWSLKYNAFGQDIEFSWLARNLQPTLNQKIHWRSLEGLPNRGVVRFYPK PSSCLVELTVSYEVPPLLSPVASALQ
Subjt:  REAIPKWMPFISSVKVLEDNPTLSRWSLKYNAFGQDIEFSWLARNLQPTLNQKIHWRSLEGLPNRGVVRFYPKSPSSCLVELTVSYEVPPLLSPVASALQ

Query:  PLLERLLQRGLRSFAKFAKKYQTA
        PLLERLL+RGL SFA FAKKYQTA
Subjt:  PLLERLLQRGLRSFAKFAKKYQTA

XP_038898800.1 uncharacterized protein LOC120086302 [Benincasa hispida]1.1e-11192.41Show/hide
Query:  MSIAALTCNSTPNSLNLAHPSIRRRNGILFMAIPTCRSINSRSLSLPELVFKIPRGSSNRSRNPICPRLKFVSPVMEWQDCTAKMEVDIPASVAYKCYSD
        M+IAALTCNSTPNS  L H SIRRRNGILFMAIPTCRSI+SRSLSLPELVFKIPR SS R RNPICP LK VSPVMEWQ+CTAKMEVDIPASVAYKCYSD
Subjt:  MSIAALTCNSTPNSLNLAHPSIRRRNGILFMAIPTCRSINSRSLSLPELVFKIPRGSSNRSRNPICPRLKFVSPVMEWQDCTAKMEVDIPASVAYKCYSD

Query:  REAIPKWMPFISSVKVLEDNPTLSRWSLKYNAFGQDIEFSWLARNLQPTLNQKIHWRSLEGLPNRGVVRFYPKSPSSCLVELTVSYEVPPLLSPVASALQ
        REAIPKWMPFISSVKVLEDNPTLSRWSLKYNAFGQDIEFSWLARNLQPT NQKIHWRSLEGLPNRGVVRFYPK PSSCLVELTVSYEVPPLLSPVASALQ
Subjt:  REAIPKWMPFISSVKVLEDNPTLSRWSLKYNAFGQDIEFSWLARNLQPTLNQKIHWRSLEGLPNRGVVRFYPKSPSSCLVELTVSYEVPPLLSPVASALQ

Query:  PLLERLLQRGLRSFAKFAKKYQTA
        PLLERLLQRGL+SFA FAKKYQT+
Subjt:  PLLERLLQRGLRSFAKFAKKYQTA

TrEMBL top hitse value%identityAlignment
A0A0A0K6F1 Polyketide_cyc domain-containing protein3.4e-10890.62Show/hide
Query:  MSIAALTCNSTPNSLNLAHPSIRRRNGILFMAIPTCRSINSRSLSLPELVFKIPRGSSNRSRNPICPRLKFVSPVMEWQDCTAKMEVDIPASVAYKCYSD
        MSIA LT NS+PNSL L H SIRRRNGI+FMAIPT RSINSRS SLP+LVFKIPR SS RSRNP    LKFVSPVMEWQ+CTAKMEVDIPASVAYKCYSD
Subjt:  MSIAALTCNSTPNSLNLAHPSIRRRNGILFMAIPTCRSINSRSLSLPELVFKIPRGSSNRSRNPICPRLKFVSPVMEWQDCTAKMEVDIPASVAYKCYSD

Query:  REAIPKWMPFISSVKVLEDNPTLSRWSLKYNAFGQDIEFSWLARNLQPTLNQKIHWRSLEGLPNRGVVRFYPKSPSSCLVELTVSYEVPPLLSPVASALQ
        REAIPKWMPFISSVKVLEDNPTLSRWSLKYNAFGQDIEFSWLARNLQPT NQKIHWRSLEGLPNRGVVRFYPK PSSCLVELTVSYEVPPLLSPVASALQ
Subjt:  REAIPKWMPFISSVKVLEDNPTLSRWSLKYNAFGQDIEFSWLARNLQPTLNQKIHWRSLEGLPNRGVVRFYPKSPSSCLVELTVSYEVPPLLSPVASALQ

Query:  PLLERLLQRGLRSFAKFAKKYQTA
        PLLERLLQRGL+SFA FAKKYQTA
Subjt:  PLLERLLQRGLRSFAKFAKKYQTA

A0A1S3CEM7 uncharacterized protein LOC103499992 isoform X16.1e-11392.86Show/hide
Query:  MSIAALTCNSTPNSLNLAHPSIRRRNGILFMAIPTCRSINSRSLSLPELVFKIPRGSSNRSRNPICPRLKFVSPVMEWQDCTAKMEVDIPASVAYKCYSD
        MSIA LT NSTPNSL L H SIRRRNGILFMAIPT RSINSRS SLP+LVFKIPR SS RSRNPICPRLKFVSPVMEWQ+CTAKMEVDIPASVAYKCYSD
Subjt:  MSIAALTCNSTPNSLNLAHPSIRRRNGILFMAIPTCRSINSRSLSLPELVFKIPRGSSNRSRNPICPRLKFVSPVMEWQDCTAKMEVDIPASVAYKCYSD

Query:  REAIPKWMPFISSVKVLEDNPTLSRWSLKYNAFGQDIEFSWLARNLQPTLNQKIHWRSLEGLPNRGVVRFYPKSPSSCLVELTVSYEVPPLLSPVASALQ
        REAIPKWMPFISSVKVLEDNP+LSRWSLKYNAFGQDIEFSWLARNLQPT NQKIHWRSLEGLPNRGVVRFYPK PSSCLVELTVSYEVPPLLSPVASALQ
Subjt:  REAIPKWMPFISSVKVLEDNPTLSRWSLKYNAFGQDIEFSWLARNLQPTLNQKIHWRSLEGLPNRGVVRFYPKSPSSCLVELTVSYEVPPLLSPVASALQ

Query:  PLLERLLQRGLRSFAKFAKKYQTA
        PLLERLLQRGL+SFA FAKKYQTA
Subjt:  PLLERLLQRGLRSFAKFAKKYQTA

A0A6J1D2J2 uncharacterized protein LOC111016984 isoform X11.9e-10687.05Show/hide
Query:  MSIAALTCNSTPNSLNLAHPSIRRRNGILFMAIPTCRSINSRSLSLPELVFKIPRGSSNRSRNPICPRLKFVSPVMEWQDCTAKMEVDIPASVAYKCYSD
        MS AA+TCNS PNS NL+H SI+RRNG+L MAIPT RSINS+S SLPELVF+IPRGS  RS NP  PR+KFVSPVMEWQ+CTAKMEVDIPASVAYKCYSD
Subjt:  MSIAALTCNSTPNSLNLAHPSIRRRNGILFMAIPTCRSINSRSLSLPELVFKIPRGSSNRSRNPICPRLKFVSPVMEWQDCTAKMEVDIPASVAYKCYSD

Query:  REAIPKWMPFISSVKVLEDNPTLSRWSLKYNAFGQDIEFSWLARNLQPTLNQKIHWRSLEGLPNRGVVRFYPKSPSSCLVELTVSYEVPPLLSPVASALQ
        REAIPKWMPFISSVKV+EDNP+LSRWSLKY AFGQDIEFSWLA+NLQPTLNQKIHWRSLEGLPNRGVVRFYPK  SSCLVELTVSYEVPPLLSPVASALQ
Subjt:  REAIPKWMPFISSVKVLEDNPTLSRWSLKYNAFGQDIEFSWLARNLQPTLNQKIHWRSLEGLPNRGVVRFYPKSPSSCLVELTVSYEVPPLLSPVASALQ

Query:  PLLERLLQRGLRSFAKFAKKYQTA
        PLLERLL+RGL SFA FAKKYQTA
Subjt:  PLLERLLQRGLRSFAKFAKKYQTA

A0A6J1H9D8 uncharacterized protein LOC1114612533.3e-11190.18Show/hide
Query:  MSIAALTCNSTPNSLNLAHPSIRRRNGILFMAIPTCRSINSRSLSLPELVFKIPRGSSNRSRNPICPRLKFVSPVMEWQDCTAKMEVDIPASVAYKCYSD
        MS+AALTCNS PN+LNL+H SIRR NG+LFMAIPT RSINSRSLSLP+L+F+IPR SS   RNPI PR+KFVSPVMEWQ+CTAKMEVDIPASVAYKCYSD
Subjt:  MSIAALTCNSTPNSLNLAHPSIRRRNGILFMAIPTCRSINSRSLSLPELVFKIPRGSSNRSRNPICPRLKFVSPVMEWQDCTAKMEVDIPASVAYKCYSD

Query:  REAIPKWMPFISSVKVLEDNPTLSRWSLKYNAFGQDIEFSWLARNLQPTLNQKIHWRSLEGLPNRGVVRFYPKSPSSCLVELTVSYEVPPLLSPVASALQ
        REAIPKWMPFISSVKVLEDNPTLSRWSLKYNAFGQDIEFSWLARNLQPTLNQKIHWRSLEGLPNRGVVRFYPK PSSCLVELTVSYEVPPLLSPVASALQ
Subjt:  REAIPKWMPFISSVKVLEDNPTLSRWSLKYNAFGQDIEFSWLARNLQPTLNQKIHWRSLEGLPNRGVVRFYPKSPSSCLVELTVSYEVPPLLSPVASALQ

Query:  PLLERLLQRGLRSFAKFAKKYQTA
        PLLERLL+RGL SFA FAKKYQTA
Subjt:  PLLERLLQRGLRSFAKFAKKYQTA

A0A6J1KR05 uncharacterized protein LOC1114978781.7e-11089.73Show/hide
Query:  MSIAALTCNSTPNSLNLAHPSIRRRNGILFMAIPTCRSINSRSLSLPELVFKIPRGSSNRSRNPICPRLKFVSPVMEWQDCTAKMEVDIPASVAYKCYSD
        MS+AALTCNS PN+LNL+H SIRR NG+LFMAIPT RSIN RSLSLP+L+F+IPR SS   RNPI PR+KFVSPVMEWQ+CTAKMEVDIPASVAYKCYSD
Subjt:  MSIAALTCNSTPNSLNLAHPSIRRRNGILFMAIPTCRSINSRSLSLPELVFKIPRGSSNRSRNPICPRLKFVSPVMEWQDCTAKMEVDIPASVAYKCYSD

Query:  REAIPKWMPFISSVKVLEDNPTLSRWSLKYNAFGQDIEFSWLARNLQPTLNQKIHWRSLEGLPNRGVVRFYPKSPSSCLVELTVSYEVPPLLSPVASALQ
        REAIPKWMPFISSVKVLEDNPTLSRWSLKYNAFGQDIEFSWLARNLQPTLNQKIHWRSLEGLPNRGVVRFYPK PSSCLVELTVSYEVPPLLSPVASALQ
Subjt:  REAIPKWMPFISSVKVLEDNPTLSRWSLKYNAFGQDIEFSWLARNLQPTLNQKIHWRSLEGLPNRGVVRFYPKSPSSCLVELTVSYEVPPLLSPVASALQ

Query:  PLLERLLQRGLRSFAKFAKKYQTA
        PLLERLL+RGL SFA FAKKYQTA
Subjt:  PLLERLLQRGLRSFAKFAKKYQTA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G02470.1 Polyketide cyclase/dehydrase and lipid transport superfamily protein7.2e-5049.03Show/hide
Query:  RNGILFMAIPTCRSINS----RSLSLPELVFKIPRGSSN-----RSRNP--ICPRLKFVSPVMEWQDCTAKMEVDIPASVAYKCYSDREAIPKWMPFISS
        R+ + F  IP   ++ S     S S P ++  +   S+N      S N   I    K   PVM+WQD T KM VD PASVAYK Y+DRE  PKWMPF+SS
Subjt:  RNGILFMAIPTCRSINS----RSLSLPELVFKIPRGSSN-----RSRNP--ICPRLKFVSPVMEWQDCTAKMEVDIPASVAYKCYSDREAIPKWMPFISS

Query:  VKVLEDNPTLSRWSLKYNAFGQDIEFSWLARNLQPTLNQKIHWRSLEGLPNRGVVRFYPKSPSSCLVELTVSYEVPPLLSPVASALQPLLERLLQRGLRS
        V+ +E +P LSR+ +K  +FGQ+IE+ +LA+NLQP  ++KIHWRS+EG  NRG VRF+P+ PSSCLVE++ SYEVP   +PVA A++P +E++++ GL  
Subjt:  VKVLEDNPTLSRWSLKYNAFGQDIEFSWLARNLQPTLNQKIHWRSLEGLPNRGVVRFYPKSPSSCLVELTVSYEVPPLLSPVASALQPLLERLLQRGLRS

Query:  FAKFAK
        FA F K
Subjt:  FAKFAK

AT1G02470.2 Polyketide cyclase/dehydrase and lipid transport superfamily protein1.8e-4848.79Show/hide
Query:  RNGILFMAIPTCRSINS----RSLSLPELVFKIPRGSSN-----RSRNP--ICPRLKFVSPVMEWQDCT-AKMEVDIPASVAYKCYSDREAIPKWMPFIS
        R+ + F  IP   ++ S     S S P ++  +   S+N      S N   I    K   PVM+WQD T  KM VD PASVAYK Y+DRE  PKWMPF+S
Subjt:  RNGILFMAIPTCRSINS----RSLSLPELVFKIPRGSSN-----RSRNP--ICPRLKFVSPVMEWQDCT-AKMEVDIPASVAYKCYSDREAIPKWMPFIS

Query:  SVKVLEDNPTLSRWSLKYNAFGQDIEFSWLARNLQPTLNQKIHWRSLEGLPNRGVVRFYPKSPSSCLVELTVSYEVPPLLSPVASALQPLLERLLQRGLR
        SV+ +E +P LSR+ +K  +FGQ+IE+ +LA+NLQP  ++KIHWRS+EG  NRG VRF+P+ PSSCLVE++ SYEVP   +PVA A++P +E++++ GL 
Subjt:  SVKVLEDNPTLSRWSLKYNAFGQDIEFSWLARNLQPTLNQKIHWRSLEGLPNRGVVRFYPKSPSSCLVELTVSYEVPPLLSPVASALQPLLERLLQRGLR

Query:  SFAKFAK
         FA F K
Subjt:  SFAKFAK

AT1G02475.1 Polyketide cyclase/dehydrase and lipid transport superfamily protein1.5e-6357.21Show/hide
Query:  MSIAALTCNS------TPNSLNLAHPSIRRRNGILFMAIP----TCRSINSRSLSLPELVFKIPRGSSNRSRNPICPRLKFVSPVMEWQDCTAKMEVDIP
        MS+ A+  N+      T  + N+  P    R    +  IP    T    +S S S+           S RSR  I P+ +  S  MEWQDC+ KMEVD+P
Subjt:  MSIAALTCNS------TPNSLNLAHPSIRRRNGILFMAIP----TCRSINSRSLSLPELVFKIPRGSSNRSRNPICPRLKFVSPVMEWQDCTAKMEVDIP

Query:  ASVAYKCYSDREAIPKWMPFISSVKVLEDNPTLSRWSLKYNAFGQDIEFSWLARNLQPTLNQKIHWRSLEGLPNRGVVRFYPKSPSSCLVELTVSYEVPP
         SVAY  Y DRE+ PKWMPFISSV+VL+D P LSRWSLKYNAFGQDI++SWLARNLQPT NQKIHWRSLEGLPN+G VRF+PK PSSC+VELTVSYEVP 
Subjt:  ASVAYKCYSDREAIPKWMPFISSVKVLEDNPTLSRWSLKYNAFGQDIEFSWLARNLQPTLNQKIHWRSLEGLPNRGVVRFYPKSPSSCLVELTVSYEVPP

Query:  LLSPVASALQPLLERLLQRGLRSFAKFAK
        LL+PVAS L+P +E LL+ GL  FA  AK
Subjt:  LLSPVASALQPLLERLLQRGLRSFAKFAK

AT4G01883.1 Polyketide cyclase / dehydrase and lipid transport protein9.7e-5565.1Show/hide
Query:  VMEWQDCTAKMEVDIPASVAYKCYSDREAIPKWMPFISSVKVLEDNPTLSRWSLKYNAFGQDIEFSWLARNLQPTLNQKIHWRSLEGLPNRGVVRFYPKS
        +MEWQ+C  KM+V++P SVAY  YS+RE+IPKWM FISSVKVL+D P LSRW+LKY AFGQ++E++WLA+NLQP  NQKIHW SLEGLPN+G VRF+P  
Subjt:  VMEWQDCTAKMEVDIPASVAYKCYSDREAIPKWMPFISSVKVLEDNPTLSRWSLKYNAFGQDIEFSWLARNLQPTLNQKIHWRSLEGLPNRGVVRFYPKS

Query:  PSSCLVELTVSYEVPPLLSPVASALQPLLERLLQRGLRSFAKFAKKYQT
        PSSC VELT +YEVP LL P A+ALQPL++ L++  L  FA+ AK  +T
Subjt:  PSSCLVELTVSYEVPPLLSPVASALQPLLERLLQRGLRSFAKFAKKYQT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAATCTGATTATGTAGATATCACTTGTGCTCTCAATAAGCTATCATCTGATATGTCAAAAATAAAAAATGTGGTTGATGCCTCGTTGGTTCTTGCACTTCAGCCTCC
GAAAATCTCTTTCCAGAGCCACAATAATATCAAAGGGCTGTTCTTGCTGTTTGGTGCATCTCAAATCTCTGCTCCAACTCCCTTTAATGTCCTCTTGCTGTTTTCTTCTT
CACTAATCCAAAATCCCACTATGTCAATTGCAGCACTCACCTGCAATTCTACTCCAAACTCTCTCAATCTCGCTCATCCATCGATCAGAAGGAGAAATGGCATCTTGTTC
ATGGCGATTCCCACTTGCAGAAGCATCAATTCGAGGTCACTCTCTCTACCCGAGCTCGTCTTCAAGATCCCACGTGGTTCTTCGAACCGCAGCAGAAACCCCATTTGCCC
TCGACTTAAATTCGTCTCCCCTGTGATGGAATGGCAGGATTGCACGGCTAAGATGGAAGTTGACATACCTGCTTCGGTTGCCTATAAATGCTACTCAGATCGTGAAGCTA
TTCCCAAATGGATGCCATTCATTTCATCTGTGAAGGTATTGGAAGATAACCCTACATTATCACGGTGGTCACTAAAATATAATGCTTTTGGTCAAGATATCGAGTTCTCT
TGGCTTGCTCGAAACTTGCAGCCGACCCTAAATCAAAAAATTCATTGGCGGTCACTTGAAGGTCTTCCCAACAGAGGTGTTGTACGATTTTACCCAAAGAGCCCCTCATC
TTGCCTCGTAGAATTGACAGTCTCCTATGAAGTTCCTCCTCTTTTGTCTCCAGTGGCCTCTGCACTGCAACCTCTGCTTGAGAGATTACTTCAACGAGGTCTTAGAAGCT
TTGCCAAGTTTGCCAAGAAATACCAAACGGCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGAATCTGATTATGTAGATATCACTTGTGCTCTCAATAAGCTATCATCTGATATGTCAAAAATAAAAAATGTGGTTGATGCCTCGTTGGTTCTTGCACTTCAGCCTCC
GAAAATCTCTTTCCAGAGCCACAATAATATCAAAGGGCTGTTCTTGCTGTTTGGTGCATCTCAAATCTCTGCTCCAACTCCCTTTAATGTCCTCTTGCTGTTTTCTTCTT
CACTAATCCAAAATCCCACTATGTCAATTGCAGCACTCACCTGCAATTCTACTCCAAACTCTCTCAATCTCGCTCATCCATCGATCAGAAGGAGAAATGGCATCTTGTTC
ATGGCGATTCCCACTTGCAGAAGCATCAATTCGAGGTCACTCTCTCTACCCGAGCTCGTCTTCAAGATCCCACGTGGTTCTTCGAACCGCAGCAGAAACCCCATTTGCCC
TCGACTTAAATTCGTCTCCCCTGTGATGGAATGGCAGGATTGCACGGCTAAGATGGAAGTTGACATACCTGCTTCGGTTGCCTATAAATGCTACTCAGATCGTGAAGCTA
TTCCCAAATGGATGCCATTCATTTCATCTGTGAAGGTATTGGAAGATAACCCTACATTATCACGGTGGTCACTAAAATATAATGCTTTTGGTCAAGATATCGAGTTCTCT
TGGCTTGCTCGAAACTTGCAGCCGACCCTAAATCAAAAAATTCATTGGCGGTCACTTGAAGGTCTTCCCAACAGAGGTGTTGTACGATTTTACCCAAAGAGCCCCTCATC
TTGCCTCGTAGAATTGACAGTCTCCTATGAAGTTCCTCCTCTTTTGTCTCCAGTGGCCTCTGCACTGCAACCTCTGCTTGAGAGATTACTTCAACGAGGTCTTAGAAGCT
TTGCCAAGTTTGCCAAGAAATACCAAACGGCTTGAAGACTCGATTATTGTTTTATGGGTATCATACATCACCCCATACATTACCATTTTTACCATATACTCCTATTGACA
AATGTAAAATCTTGAACAGAATACTTGAACACAATCATCATCTGTTTTTACTAAAGAATTTCATCTTCTTTACAAAAATATGCTGAGCCCAGCTGATTTATTGATATATG
TTCA
Protein sequenceShow/hide protein sequence
MESDYVDITCALNKLSSDMSKIKNVVDASLVLALQPPKISFQSHNNIKGLFLLFGASQISAPTPFNVLLLFSSSLIQNPTMSIAALTCNSTPNSLNLAHPSIRRRNGILF
MAIPTCRSINSRSLSLPELVFKIPRGSSNRSRNPICPRLKFVSPVMEWQDCTAKMEVDIPASVAYKCYSDREAIPKWMPFISSVKVLEDNPTLSRWSLKYNAFGQDIEFS
WLARNLQPTLNQKIHWRSLEGLPNRGVVRFYPKSPSSCLVELTVSYEVPPLLSPVASALQPLLERLLQRGLRSFAKFAKKYQTA