; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

PI0002028 (gene) of Melon (PI 482460) v1 genome

Gene IDPI0002028
OrganismCucumis metuliferus PI 482460 (Melon (PI 482460) v1)
DescriptionPolyketide_cyc domain-containing protein
Genome locationchr01:25583084..25589272
RNA-Seq ExpressionPI0002028
SyntenyPI0002028
Gene Ontology termsNA
InterPro domainsIPR005031 - Coenzyme Q-binding protein COQ10, START domain
IPR023393 - START-like domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004135974.1 uncharacterized protein LOC101216857 isoform X1 [Cucumis sativus]1.2e-11595.54Show/hide
Query:  MSIAPLTSNSTPNSLNLIHRSIRRRNGILFMAIPTSRSINSRSLSLPDLLFKIPRASSKRSRNPIFPPLKFVSPVMEWQNCTAKMEVDIPASVAYKCYSD
        MSIAPLTSNS+PNSL LIHRSIRRRNGI+FMAIPTSRSINSRS SLPDL+FKIPR SSKRSRN    PLKFVSPVMEWQNCTAKMEVDIPASVAYKCYSD
Subjt:  MSIAPLTSNSTPNSLNLIHRSIRRRNGILFMAIPTSRSINSRSLSLPDLLFKIPRASSKRSRNPIFPPLKFVSPVMEWQNCTAKMEVDIPASVAYKCYSD

Query:  REAIPKWMPFISSVKVLEDNPTLSRWSLKYNAFGQDIEFSWLARNLQPTPNQKIHWRSLEGLPNRGVVRFYPKGPSSCLVELTVSYEVPPLLSPVASALQ
        REAIPKWMPFISSVKVLEDNPTLSRWSLKYNAFGQDIEFSWLARNLQPTPNQKIHWRSLEGLPNRGVVRFYPKGPSSCLVELTVSYEVPPLLSPVASALQ
Subjt:  REAIPKWMPFISSVKVLEDNPTLSRWSLKYNAFGQDIEFSWLARNLQPTPNQKIHWRSLEGLPNRGVVRFYPKGPSSCLVELTVSYEVPPLLSPVASALQ

Query:  PLLERLLQRGLKSFATFAKKYQTA
        PLLERLLQRGLKSFATFAKKYQTA
Subjt:  PLLERLLQRGLKSFATFAKKYQTA

XP_008461396.1 PREDICTED: uncharacterized protein LOC103499992 isoform X1 [Cucumis melo]1.9e-11896.88Show/hide
Query:  MSIAPLTSNSTPNSLNLIHRSIRRRNGILFMAIPTSRSINSRSLSLPDLLFKIPRASSKRSRNPIFPPLKFVSPVMEWQNCTAKMEVDIPASVAYKCYSD
        MSIAPLTSNSTPNSL LIHRSIRRRNGILFMAIPTSRSINSRS SLPDL+FKIPR SSKRSRNPI P LKFVSPVMEWQNCTAKMEVDIPASVAYKCYSD
Subjt:  MSIAPLTSNSTPNSLNLIHRSIRRRNGILFMAIPTSRSINSRSLSLPDLLFKIPRASSKRSRNPIFPPLKFVSPVMEWQNCTAKMEVDIPASVAYKCYSD

Query:  REAIPKWMPFISSVKVLEDNPTLSRWSLKYNAFGQDIEFSWLARNLQPTPNQKIHWRSLEGLPNRGVVRFYPKGPSSCLVELTVSYEVPPLLSPVASALQ
        REAIPKWMPFISSVKVLEDNP+LSRWSLKYNAFGQDIEFSWLARNLQPTPNQKIHWRSLEGLPNRGVVRFYPKGPSSCLVELTVSYEVPPLLSPVASALQ
Subjt:  REAIPKWMPFISSVKVLEDNPTLSRWSLKYNAFGQDIEFSWLARNLQPTPNQKIHWRSLEGLPNRGVVRFYPKGPSSCLVELTVSYEVPPLLSPVASALQ

Query:  PLLERLLQRGLKSFATFAKKYQTA
        PLLERLLQRGLKSFATFAKKYQTA
Subjt:  PLLERLLQRGLKSFATFAKKYQTA

XP_022960550.1 uncharacterized protein LOC111461253 [Cucurbita moschata]5.1e-11191.07Show/hide
Query:  MSIAPLTSNSTPNSLNLIHRSIRRRNGILFMAIPTSRSINSRSLSLPDLLFKIPRASSKRSRNPIFPPLKFVSPVMEWQNCTAKMEVDIPASVAYKCYSD
        MS+A LT NS PN+LNL HRSIRR NG+LFMAIPT RSINSRSLSLP LLF+IPR+SSK  RNPI P +KFVSPVMEWQNCTAKMEVDIPASVAYKCYSD
Subjt:  MSIAPLTSNSTPNSLNLIHRSIRRRNGILFMAIPTSRSINSRSLSLPDLLFKIPRASSKRSRNPIFPPLKFVSPVMEWQNCTAKMEVDIPASVAYKCYSD

Query:  REAIPKWMPFISSVKVLEDNPTLSRWSLKYNAFGQDIEFSWLARNLQPTPNQKIHWRSLEGLPNRGVVRFYPKGPSSCLVELTVSYEVPPLLSPVASALQ
        REAIPKWMPFISSVKVLEDNPTLSRWSLKYNAFGQDIEFSWLARNLQPT NQKIHWRSLEGLPNRGVVRFYPKGPSSCLVELTVSYEVPPLLSPVASALQ
Subjt:  REAIPKWMPFISSVKVLEDNPTLSRWSLKYNAFGQDIEFSWLARNLQPTPNQKIHWRSLEGLPNRGVVRFYPKGPSSCLVELTVSYEVPPLLSPVASALQ

Query:  PLLERLLQRGLKSFATFAKKYQTA
        PLLERLL+RGL+SFATFAKKYQTA
Subjt:  PLLERLLQRGLKSFATFAKKYQTA

XP_023514604.1 uncharacterized protein LOC111778852 [Cucurbita pepo subsp. pepo]1.5e-11090.62Show/hide
Query:  MSIAPLTSNSTPNSLNLIHRSIRRRNGILFMAIPTSRSINSRSLSLPDLLFKIPRASSKRSRNPIFPPLKFVSPVMEWQNCTAKMEVDIPASVAYKCYSD
        MS+A LT NS PN+LNL HRSIRR NG+LFMAIPT R+INSRSLSLP LLF+IPR SSK  RNPI P +KFVSPVMEWQNCTAKMEVDIPASVAYKCYSD
Subjt:  MSIAPLTSNSTPNSLNLIHRSIRRRNGILFMAIPTSRSINSRSLSLPDLLFKIPRASSKRSRNPIFPPLKFVSPVMEWQNCTAKMEVDIPASVAYKCYSD

Query:  REAIPKWMPFISSVKVLEDNPTLSRWSLKYNAFGQDIEFSWLARNLQPTPNQKIHWRSLEGLPNRGVVRFYPKGPSSCLVELTVSYEVPPLLSPVASALQ
        REAIPKWMPFISSVKVLEDNPTLSRWSLKYNAFGQDIEFSWLARNLQPT NQKIHWRSLEGLPNRGVVRFYPKGPSSCLVELTVSYEVPPLLSPVASALQ
Subjt:  REAIPKWMPFISSVKVLEDNPTLSRWSLKYNAFGQDIEFSWLARNLQPTPNQKIHWRSLEGLPNRGVVRFYPKGPSSCLVELTVSYEVPPLLSPVASALQ

Query:  PLLERLLQRGLKSFATFAKKYQTA
        PLLERLL+RGL+SFATFAKKYQTA
Subjt:  PLLERLLQRGLKSFATFAKKYQTA

XP_038898800.1 uncharacterized protein LOC120086302 [Benincasa hispida]2.7e-11292.86Show/hide
Query:  MSIAPLTSNSTPNSLNLIHRSIRRRNGILFMAIPTSRSINSRSLSLPDLLFKIPRASSKRSRNPIFPPLKFVSPVMEWQNCTAKMEVDIPASVAYKCYSD
        M+IA LT NSTPNSL   HRSIRRRNGILFMAIPT RSI+SRSLSLP+L+FKIPR SSKR RNPI PPLK VSPVMEWQNCTAKMEVDIPASVAYKCYSD
Subjt:  MSIAPLTSNSTPNSLNLIHRSIRRRNGILFMAIPTSRSINSRSLSLPDLLFKIPRASSKRSRNPIFPPLKFVSPVMEWQNCTAKMEVDIPASVAYKCYSD

Query:  REAIPKWMPFISSVKVLEDNPTLSRWSLKYNAFGQDIEFSWLARNLQPTPNQKIHWRSLEGLPNRGVVRFYPKGPSSCLVELTVSYEVPPLLSPVASALQ
        REAIPKWMPFISSVKVLEDNPTLSRWSLKYNAFGQDIEFSWLARNLQPTPNQKIHWRSLEGLPNRGVVRFYPKGPSSCLVELTVSYEVPPLLSPVASALQ
Subjt:  REAIPKWMPFISSVKVLEDNPTLSRWSLKYNAFGQDIEFSWLARNLQPTPNQKIHWRSLEGLPNRGVVRFYPKGPSSCLVELTVSYEVPPLLSPVASALQ

Query:  PLLERLLQRGLKSFATFAKKYQTA
        PLLERLLQRGLKSFA FAKKYQT+
Subjt:  PLLERLLQRGLKSFATFAKKYQTA

TrEMBL top hitse value%identityAlignment
A0A0A0K6F1 Polyketide_cyc domain-containing protein5.6e-11695.54Show/hide
Query:  MSIAPLTSNSTPNSLNLIHRSIRRRNGILFMAIPTSRSINSRSLSLPDLLFKIPRASSKRSRNPIFPPLKFVSPVMEWQNCTAKMEVDIPASVAYKCYSD
        MSIAPLTSNS+PNSL LIHRSIRRRNGI+FMAIPTSRSINSRS SLPDL+FKIPR SSKRSRN    PLKFVSPVMEWQNCTAKMEVDIPASVAYKCYSD
Subjt:  MSIAPLTSNSTPNSLNLIHRSIRRRNGILFMAIPTSRSINSRSLSLPDLLFKIPRASSKRSRNPIFPPLKFVSPVMEWQNCTAKMEVDIPASVAYKCYSD

Query:  REAIPKWMPFISSVKVLEDNPTLSRWSLKYNAFGQDIEFSWLARNLQPTPNQKIHWRSLEGLPNRGVVRFYPKGPSSCLVELTVSYEVPPLLSPVASALQ
        REAIPKWMPFISSVKVLEDNPTLSRWSLKYNAFGQDIEFSWLARNLQPTPNQKIHWRSLEGLPNRGVVRFYPKGPSSCLVELTVSYEVPPLLSPVASALQ
Subjt:  REAIPKWMPFISSVKVLEDNPTLSRWSLKYNAFGQDIEFSWLARNLQPTPNQKIHWRSLEGLPNRGVVRFYPKGPSSCLVELTVSYEVPPLLSPVASALQ

Query:  PLLERLLQRGLKSFATFAKKYQTA
        PLLERLLQRGLKSFATFAKKYQTA
Subjt:  PLLERLLQRGLKSFATFAKKYQTA

A0A1S3CEM7 uncharacterized protein LOC103499992 isoform X19.3e-11996.88Show/hide
Query:  MSIAPLTSNSTPNSLNLIHRSIRRRNGILFMAIPTSRSINSRSLSLPDLLFKIPRASSKRSRNPIFPPLKFVSPVMEWQNCTAKMEVDIPASVAYKCYSD
        MSIAPLTSNSTPNSL LIHRSIRRRNGILFMAIPTSRSINSRS SLPDL+FKIPR SSKRSRNPI P LKFVSPVMEWQNCTAKMEVDIPASVAYKCYSD
Subjt:  MSIAPLTSNSTPNSLNLIHRSIRRRNGILFMAIPTSRSINSRSLSLPDLLFKIPRASSKRSRNPIFPPLKFVSPVMEWQNCTAKMEVDIPASVAYKCYSD

Query:  REAIPKWMPFISSVKVLEDNPTLSRWSLKYNAFGQDIEFSWLARNLQPTPNQKIHWRSLEGLPNRGVVRFYPKGPSSCLVELTVSYEVPPLLSPVASALQ
        REAIPKWMPFISSVKVLEDNP+LSRWSLKYNAFGQDIEFSWLARNLQPTPNQKIHWRSLEGLPNRGVVRFYPKGPSSCLVELTVSYEVPPLLSPVASALQ
Subjt:  REAIPKWMPFISSVKVLEDNPTLSRWSLKYNAFGQDIEFSWLARNLQPTPNQKIHWRSLEGLPNRGVVRFYPKGPSSCLVELTVSYEVPPLLSPVASALQ

Query:  PLLERLLQRGLKSFATFAKKYQTA
        PLLERLLQRGLKSFATFAKKYQTA
Subjt:  PLLERLLQRGLKSFATFAKKYQTA

A0A6J1D2J2 uncharacterized protein LOC111016984 isoform X13.4e-10586.16Show/hide
Query:  MSIAPLTSNSTPNSLNLIHRSIRRRNGILFMAIPTSRSINSRSLSLPDLLFKIPRASSKRSRNPIFPPLKFVSPVMEWQNCTAKMEVDIPASVAYKCYSD
        MS A +T NS PNS NL H SI+RRNG+L MAIPT RSINS+S SLP+L+F+IPR S KRS NP FP +KFVSPVMEWQNCTAKMEVDIPASVAYKCYSD
Subjt:  MSIAPLTSNSTPNSLNLIHRSIRRRNGILFMAIPTSRSINSRSLSLPDLLFKIPRASSKRSRNPIFPPLKFVSPVMEWQNCTAKMEVDIPASVAYKCYSD

Query:  REAIPKWMPFISSVKVLEDNPTLSRWSLKYNAFGQDIEFSWLARNLQPTPNQKIHWRSLEGLPNRGVVRFYPKGPSSCLVELTVSYEVPPLLSPVASALQ
        REAIPKWMPFISSVKV+EDNP+LSRWSLKY AFGQDIEFSWLA+NLQPT NQKIHWRSLEGLPNRGVVRFYPKG SSCLVELTVSYEVPPLLSPVASALQ
Subjt:  REAIPKWMPFISSVKVLEDNPTLSRWSLKYNAFGQDIEFSWLARNLQPTPNQKIHWRSLEGLPNRGVVRFYPKGPSSCLVELTVSYEVPPLLSPVASALQ

Query:  PLLERLLQRGLKSFATFAKKYQTA
        PLLERLL+RGL+SFATFAKKYQTA
Subjt:  PLLERLLQRGLKSFATFAKKYQTA

A0A6J1H9D8 uncharacterized protein LOC1114612532.5e-11191.07Show/hide
Query:  MSIAPLTSNSTPNSLNLIHRSIRRRNGILFMAIPTSRSINSRSLSLPDLLFKIPRASSKRSRNPIFPPLKFVSPVMEWQNCTAKMEVDIPASVAYKCYSD
        MS+A LT NS PN+LNL HRSIRR NG+LFMAIPT RSINSRSLSLP LLF+IPR+SSK  RNPI P +KFVSPVMEWQNCTAKMEVDIPASVAYKCYSD
Subjt:  MSIAPLTSNSTPNSLNLIHRSIRRRNGILFMAIPTSRSINSRSLSLPDLLFKIPRASSKRSRNPIFPPLKFVSPVMEWQNCTAKMEVDIPASVAYKCYSD

Query:  REAIPKWMPFISSVKVLEDNPTLSRWSLKYNAFGQDIEFSWLARNLQPTPNQKIHWRSLEGLPNRGVVRFYPKGPSSCLVELTVSYEVPPLLSPVASALQ
        REAIPKWMPFISSVKVLEDNPTLSRWSLKYNAFGQDIEFSWLARNLQPT NQKIHWRSLEGLPNRGVVRFYPKGPSSCLVELTVSYEVPPLLSPVASALQ
Subjt:  REAIPKWMPFISSVKVLEDNPTLSRWSLKYNAFGQDIEFSWLARNLQPTPNQKIHWRSLEGLPNRGVVRFYPKGPSSCLVELTVSYEVPPLLSPVASALQ

Query:  PLLERLLQRGLKSFATFAKKYQTA
        PLLERLL+RGL+SFATFAKKYQTA
Subjt:  PLLERLLQRGLKSFATFAKKYQTA

A0A6J1KR05 uncharacterized protein LOC1114978781.2e-11090.62Show/hide
Query:  MSIAPLTSNSTPNSLNLIHRSIRRRNGILFMAIPTSRSINSRSLSLPDLLFKIPRASSKRSRNPIFPPLKFVSPVMEWQNCTAKMEVDIPASVAYKCYSD
        MS+A LT NS PN+LNL HRSIRR NG+LFMAIPT RSIN RSLSLP LLF+IPR+SSK  RNPI P +KFVSPVMEWQNCTAKMEVDIPASVAYKCYSD
Subjt:  MSIAPLTSNSTPNSLNLIHRSIRRRNGILFMAIPTSRSINSRSLSLPDLLFKIPRASSKRSRNPIFPPLKFVSPVMEWQNCTAKMEVDIPASVAYKCYSD

Query:  REAIPKWMPFISSVKVLEDNPTLSRWSLKYNAFGQDIEFSWLARNLQPTPNQKIHWRSLEGLPNRGVVRFYPKGPSSCLVELTVSYEVPPLLSPVASALQ
        REAIPKWMPFISSVKVLEDNPTLSRWSLKYNAFGQDIEFSWLARNLQPT NQKIHWRSLEGLPNRGVVRFYPKGPSSCLVELTVSYEVPPLLSPVASALQ
Subjt:  REAIPKWMPFISSVKVLEDNPTLSRWSLKYNAFGQDIEFSWLARNLQPTPNQKIHWRSLEGLPNRGVVRFYPKGPSSCLVELTVSYEVPPLLSPVASALQ

Query:  PLLERLLQRGLKSFATFAKKYQTA
        PLLERLL+RGL+SFATFAKKYQTA
Subjt:  PLLERLLQRGLKSFATFAKKYQTA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G02470.1 Polyketide cyclase/dehydrase and lipid transport superfamily protein1.7e-5149.76Show/hide
Query:  RNGILFMAIPTSRSINS----RSLSLPDLLFKIPRASSK-----RSRNPIF---PPLKFVSPVMEWQNCTAKMEVDIPASVAYKCYSDREAIPKWMPFIS
        R+ + F  IP S ++ S     S S P +L  +  +S+       S N +     P  F  PVM+WQ+ T KM VD PASVAYK Y+DRE  PKWMPF+S
Subjt:  RNGILFMAIPTSRSINS----RSLSLPDLLFKIPRASSK-----RSRNPIF---PPLKFVSPVMEWQNCTAKMEVDIPASVAYKCYSDREAIPKWMPFIS

Query:  SVKVLEDNPTLSRWSLKYNAFGQDIEFSWLARNLQPTPNQKIHWRSLEGLPNRGVVRFYPKGPSSCLVELTVSYEVPPLLSPVASALQPLLERLLQRGLK
        SV+ +E +P LSR+ +K  +FGQ+IE+ +LA+NLQP P++KIHWRS+EG  NRG VRF+P+GPSSCLVE++ SYEVP   +PVA A++P +E++++ GL+
Subjt:  SVKVLEDNPTLSRWSLKYNAFGQDIEFSWLARNLQPTPNQKIHWRSLEGLPNRGVVRFYPKGPSSCLVELTVSYEVPPLLSPVASALQPLLERLLQRGLK

Query:  SFATFAK
         FA F K
Subjt:  SFATFAK

AT1G02470.2 Polyketide cyclase/dehydrase and lipid transport superfamily protein4.1e-5049.52Show/hide
Query:  RNGILFMAIPTSRSINS----RSLSLPDLLFKIPRASSK-----RSRNPIF---PPLKFVSPVMEWQNCT-AKMEVDIPASVAYKCYSDREAIPKWMPFI
        R+ + F  IP S ++ S     S S P +L  +  +S+       S N +     P  F  PVM+WQ+ T  KM VD PASVAYK Y+DRE  PKWMPF+
Subjt:  RNGILFMAIPTSRSINS----RSLSLPDLLFKIPRASSK-----RSRNPIF---PPLKFVSPVMEWQNCT-AKMEVDIPASVAYKCYSDREAIPKWMPFI

Query:  SSVKVLEDNPTLSRWSLKYNAFGQDIEFSWLARNLQPTPNQKIHWRSLEGLPNRGVVRFYPKGPSSCLVELTVSYEVPPLLSPVASALQPLLERLLQRGL
        SSV+ +E +P LSR+ +K  +FGQ+IE+ +LA+NLQP P++KIHWRS+EG  NRG VRF+P+GPSSCLVE++ SYEVP   +PVA A++P +E++++ GL
Subjt:  SSVKVLEDNPTLSRWSLKYNAFGQDIEFSWLARNLQPTPNQKIHWRSLEGLPNRGVVRFYPKGPSSCLVELTVSYEVPPLLSPVASALQPLLERLLQRGL

Query:  KSFATFAK
        + FA F K
Subjt:  KSFATFAK

AT1G02475.1 Polyketide cyclase/dehydrase and lipid transport superfamily protein1.2e-6572.84Show/hide
Query:  SKRSRNPIFPPLKFVSPVMEWQNCTAKMEVDIPASVAYKCYSDREAIPKWMPFISSVKVLEDNPTLSRWSLKYNAFGQDIEFSWLARNLQPTPNQKIHWR
        S+RSR  I P  +  S  MEWQ+C+ KMEVD+P SVAY  Y DRE+ PKWMPFISSV+VL+D P LSRWSLKYNAFGQDI++SWLARNLQPTPNQKIHWR
Subjt:  SKRSRNPIFPPLKFVSPVMEWQNCTAKMEVDIPASVAYKCYSDREAIPKWMPFISSVKVLEDNPTLSRWSLKYNAFGQDIEFSWLARNLQPTPNQKIHWR

Query:  SLEGLPNRGVVRFYPKGPSSCLVELTVSYEVPPLLSPVASALQPLLERLLQRGLKSFATFAK
        SLEGLPN+G VRF+PKGPSSC+VELTVSYEVP LL+PVAS L+P +E LL+ GL+ FA  AK
Subjt:  SLEGLPNRGVVRFYPKGPSSCLVELTVSYEVPPLLSPVASALQPLLERLLQRGLKSFATFAK

AT4G01883.1 Polyketide cyclase / dehydrase and lipid transport protein1.3e-5651.53Show/hide
Query:  MSIAPLTSNSTPNSL--NLIHRSIRRRNGILFMAIPTSRSINSRSLSLPDLLFK---IPRASSKRS-RNPIFPPLKFVSPVMEWQNCTAKMEVDIPASVA
        MS   + S + P  L     +R+   R+     A P+SR   S S  +  L       P  S+ RS ++ +F   +    +MEWQ C  KM+V++P SVA
Subjt:  MSIAPLTSNSTPNSL--NLIHRSIRRRNGILFMAIPTSRSINSRSLSLPDLLFK---IPRASSKRS-RNPIFPPLKFVSPVMEWQNCTAKMEVDIPASVA

Query:  YKCYSDREAIPKWMPFISSVKVLEDNPTLSRWSLKYNAFGQDIEFSWLARNLQPTPNQKIHWRSLEGLPNRGVVRFYPKGPSSCLVELTVSYEVPPLLSP
        Y  YS+RE+IPKWM FISSVKVL+D P LSRW+LKY AFGQ++E++WLA+NLQP PNQKIHW SLEGLPN+G VRF+P GPSSC VELT +YEVP LL P
Subjt:  YKCYSDREAIPKWMPFISSVKVLEDNPTLSRWSLKYNAFGQDIEFSWLARNLQPTPNQKIHWRSLEGLPNRGVVRFYPKGPSSCLVELTVSYEVPPLLSP

Query:  VASALQPLLERLLQRGLKSFATFAKKYQT
         A+ALQPL++ L++  L+ FA  AK  +T
Subjt:  VASALQPLLERLLQRGLKSFATFAKKYQT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTATTGCACCACTCACTTCCAATTCCACTCCTAACTCTCTTAATCTCATTCATCGATCAATCAGAAGAAGAAATGGCATCTTATTCATGGCGATTCCCACTTCCAG
AAGCATCAATTCCAGGTCCCTTTCTCTACCCGACCTCCTCTTCAAGATCCCACGTGCTTCTTCCAAGCGCAGCAGAAACCCCATTTTCCCTCCCCTCAAATTCGTCTCCC
CTGTAATGGAATGGCAAAATTGCACGGCTAAGATGGAAGTTGACATACCTGCTTCGGTTGCCTATAAATGCTACTCAGATCGTGAAGCTATTCCCAAATGGATGCCATTC
ATTTCATCTGTGAAGGTATTGGAAGATAATCCTACATTATCACGGTGGTCACTAAAATATAATGCTTTTGGTCAAGATATCGAGTTCTCTTGGCTTGCTCGAAACTTGCA
GCCGACCCCAAATCAAAAAATCCACTGGCGGTCTCTTGAAGGTCTCCCAAACAGAGGTGTTGTACGATTTTATCCGAAGGGCCCTTCATCTTGCCTTGTAGAATTGACAG
TCTCCTATGAAGTCCCTCCTCTTTTGTCTCCAGTGGCTTCTGCACTGCAACCTTTGCTTGAGAGATTACTTCAACGAGGTCTCAAAAGCTTTGCTACGTTTGCTAAGAAA
TACCAAACGGCTTGA
mRNA sequenceShow/hide mRNA sequence
AAAACTTGTTTTTGTTGTCTAGAAAAGAAAATTTCAAATCCACTCAACACTGAAAAAAATCCAAAAAGAAAAGAAAATTATGAAATAATTATTAACTAATCATGGTTTCT
CCAAGAAGAAGAGCCTCCCAATATCTAATTTTTCCATAGCCACAATAACATCAAAGGGCTGTTCTTCTTCCAGTTTATTGCAAGTCAAATCTCTTCTTACCACTCTCTTT
CCATGTCCCTTTGCAATTTTCTTCTTCACTAATCCCCAAACCTCACTATGTCTATTGCACCACTCACTTCCAATTCCACTCCTAACTCTCTTAATCTCATTCATCGATCA
ATCAGAAGAAGAAATGGCATCTTATTCATGGCGATTCCCACTTCCAGAAGCATCAATTCCAGGTCCCTTTCTCTACCCGACCTCCTCTTCAAGATCCCACGTGCTTCTTC
CAAGCGCAGCAGAAACCCCATTTTCCCTCCCCTCAAATTCGTCTCCCCTGTAATGGAATGGCAAAATTGCACGGCTAAGATGGAAGTTGACATACCTGCTTCGGTTGCCT
ATAAATGCTACTCAGATCGTGAAGCTATTCCCAAATGGATGCCATTCATTTCATCTGTGAAGGTATTGGAAGATAATCCTACATTATCACGGTGGTCACTAAAATATAAT
GCTTTTGGTCAAGATATCGAGTTCTCTTGGCTTGCTCGAAACTTGCAGCCGACCCCAAATCAAAAAATCCACTGGCGGTCTCTTGAAGGTCTCCCAAACAGAGGTGTTGT
ACGATTTTATCCGAAGGGCCCTTCATCTTGCCTTGTAGAATTGACAGTCTCCTATGAAGTCCCTCCTCTTTTGTCTCCAGTGGCTTCTGCACTGCAACCTTTGCTTGAGA
GATTACTTCAACGAGGTCTCAAAAGCTTTGCTACGTTTGCTAAGAAATACCAAACGGCTTGAAGACTGAGATATTGTTATAATGGGTATCATAGATTACGATTTTTTTTT
CAAAAGTACTCTTATTGAACAAATGTAAAATCTTGAACACAATAATCTGTTTTACTAAAGAATCTCATTTACTTTACAAAAATATGCTGAGTCCAGCTGATTTATTGATA
TATGTTCATACATTATCTGCAATCTGAAGAAGAGTTGATGGTATCTCAGAAAATTATTTACATCACAATAAATGAGATTGCAGTTACTGAACTCTTCAATCTGTTGTTAC
AAATGAGAGGTTTTACAGGAAAGCAATTACACTCATTGGCATCCACTGTTATCCAACCATTTTTGCAAGCTACAAAGAGCCTGTGCCGAGAGACTGCAGTATCTTAACTT
CCGCTGCTGCGGTAGTGGAGAATTGCCGGGATGGAAACTGGAGAAGCAAACAACAACAACAGTGAGGTTATCAAACGAGTCTAATCGCAAAGCCTGCAGGACAAGGTCTC
GTGCACACCGCTCAGGGTCATCGTGGCGTTGAAGTCCCTGTCGAACTACATTGACTGCTTGCTGGCTTGACATCACATCCCATATCCCGTCGCAGGCTATGATCAAGAAC
TCATCCTCTTCTGTTAGAACCATCTGCCTGCATTCTGGTTCCGCAATGAGAGGTGAAGGAGCACCATCTGGGAGCTTCATGTCCCAATCCCCTAAGGCCCTGGAAACCGA
TAAAACACCGTTGAGATAGCCACCATCAACATAGCCACCCAACTCTTCTACTCGTTGCCTCTCCGGAGAATAAACCGGCCGGTGGTCTTGAGACATATCTACTGCCTCTC
CATTCCGGCTGAGAACTGCTCGGCAGTCTCCAGCATTTGCCACCATTAAAAGCCTGAAGAACAGAAACACACAATCAGCCATATGACAATTGACAATGCAGTATCACTAG
ACTATACAGCGAAATCGAATTGATGGATTAATGGAAACCAATAACAGCAGACTACAAACAGAGATTGATGGATCATTACATTGGAGAGGGCAAACATACCTTCCAAGAAC
AAGAGCTGTCAGGGCCGTCGTCCCGGACGAGCTACTGACACTGGAATCATCAGCAAGAGCACGATCAGCAAGAAGAAATGCTTTTCGGAGGCAAGTTTCAATCTCCTCCG
GCAGGACTTCATCGATATCTGGAATTTGAGGGAAACTAACATCCTCGAAAAAGAGTCTAAGAACATTCTTTCTGATATAAGATGCTGCTTCAGGACCTCCATGTCCATCA
AACACCTGATCAACAGATTCAACACATCAAACTCCCACCTCAAATCACAACAAACAATCATCCATATTATACAAAGAAGACAACCCAAAAACAAAGGACTCAAACTCACC
CCATAGAAAGCACTTGGCCTGGGAAATTTAAAGAGCGATCCTAAATGTGAAGACAAATCATCTATCCTTATATGTTCATCTTCCATGTATCTCCTAGGTCCAATATCAGC
AAAACTCCCCGACCGGAGGCTCGGAAATTCCCTTCCGTTCACCGTATCGACATCGGAGTCTGGAATTTTCTTCATGGGTTTATCCTGAAGAAAGAAATATCATAAAACTA
AAAATCATCATTCCAAGAGACTAAACCCCAAAATTCCCTATAATCGAACCCGATCACTACAGAAACAAGTTCATAACGATCACAGTTTCTTACCCTAAACCAAAGGCTGA
ATTTCAGAAGAACAAAAACCCATAAGAAGGAGAAGAAGAACATACCGATTGAGAACTGGAACAAACCGATTCGGAAGCACGAACACGATCAAGAGTCGGATCAATCGAAA
TCCCACCATCGATGGACTCAATTTCATCAACAAGACAAGTCTCTTTACCAAAATACGGAACCTCCAAAACCGGGAGGCTCTTCGAACAAACAACAACCTCAGCTTCAGCC
ACCATTTTTACATTTACCAAAGAATCAGATTCTTCAACAGAGAGATCAAAATGGTTATAATCAGAAGATGGGCAACTTGGAATTTGAATTGAAATTGGAATTGGAATTGG
AATCGCTATGGGAATTGGAAATTCTATTTTTTTTTTCTTTGTTCAGGAGATTTTAGAACGTTGATCAGAAATTTTTTTGATCAAGTCGGCGATGAAGAAGAATATATAGT
GGAGCCGCCGCGGGTGGGAGTAAATTCGTGGAAGAAGACGGAGCACGTAACGCCGTTATCGGGACTTGTCCGTGAAGACACGTTAATGTTCCCACATTGACGCGTGGCAT
ATGATTTTCATTTTTATTTTTATTTTTATTTTCTTTTTTAGCATTGATGGAATTACTCAAATAGCCTTCCTTTGAATTGCGTTGACTTTTTTAGGTCTTTTTTTAAAAAA
AAAATTTATGGAGTTGGTGTGACGTAGGATAATGGGTGTAGTAGCTATAGAATTTTCTAGCGACTTAATGGAAGCTTCTCTTATTTTCTTTTTTCAA
Protein sequenceShow/hide protein sequence
MSIAPLTSNSTPNSLNLIHRSIRRRNGILFMAIPTSRSINSRSLSLPDLLFKIPRASSKRSRNPIFPPLKFVSPVMEWQNCTAKMEVDIPASVAYKCYSDREAIPKWMPF
ISSVKVLEDNPTLSRWSLKYNAFGQDIEFSWLARNLQPTPNQKIHWRSLEGLPNRGVVRFYPKGPSSCLVELTVSYEVPPLLSPVASALQPLLERLLQRGLKSFATFAKK
YQTA