; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CcUC01G012930 (gene) of Watermelon (PI 537277) v1 genome

Gene IDCcUC01G012930
OrganismCitrullus colocynthis (Watermelon (PI 537277) v1)
Description1-deoxy-D-xylulose-5-phosphate synthase
Genome locationCicolChr01:24911403..24915974
RNA-Seq ExpressionCcUC01G012930
SyntenyCcUC01G012930
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7019828.1 hypothetical protein SDJN02_18792 [Cucurbita argyrosperma subsp. argyrosperma]7.3e-13891.47Show/hide
Query:  MPGPGPHMLYAMGSGMALTTLTDGRFSPHHTLVYTLNAFFGPDIGSFSDWLSSVLGFSASSLPDAIHHPLFYILILGLPLCLFYSWLSSFLLHKGLLDSV
        MPGPGPHMLYAMGSGMAL +L+DGRFSPHHTL+YT+NAFFGPDIGSFS+WLSSVLGFS SS+PDAIHHP+FYILILGLPLCLFY+WLSS LL KGLLDSV
Subjt:  MPGPGPHMLYAMGSGMALTTLTDGRFSPHHTLVYTLNAFFGPDIGSFSDWLSSVLGFSASSLPDAIHHPLFYILILGLPLCLFYSWLSSFLLHKGLLDSV

Query:  FGVSLNRRQCLLLISAGSFSHFFLDHLFEENGHSSMYTWILSTGWWENRAPINPDAVLVVGFLCTCLIGGFVYINRVKSAKSMSEQSYQSVKLIVVVATL
        FGVSLNRRQCL LISAGSFSHFFLDHLFEENGHSSMYTWILSTGWWENRAPINPDAV+VVGFLCTCLIGGFVYINRVKSAKS+ +QSYQSVKLI+VVATL
Subjt:  FGVSLNRRQCLLLISAGSFSHFFLDHLFEENGHSSMYTWILSTGWWENRAPINPDAVLVVGFLCTCLIGGFVYINRVKSAKSMSEQSYQSVKLIVVVATL

Query:  YSMWCASQIYWASPRRPAVGEEADLGVLVFLVVYFSLPHYLCIKSMQPKDLEIKHLPL
        YSMWCASQIYWA+PRRPAVGEEADLGVLVFL VYF LPHYLCIKSMQPKDLEI+HLPL
Subjt:  YSMWCASQIYWASPRRPAVGEEADLGVLVFLVVYFSLPHYLCIKSMQPKDLEIKHLPL

XP_004152468.1 uncharacterized protein LOC101221499 isoform X1 [Cucumis sativus]3.5e-14093.8Show/hide
Query:  MPGPGPHMLYAMGSGMALTTLTDGRFSPHHTLVYTLNAFFGPDIGSFSDWLSSVLGFSASSLPDAIHHPLFYILILGLPLCLFYSWLSSFLLHKGLLDSV
        MPGPGPHMLYAMGSGMALTTL+DGRFSPHHTL YT+NAFFGPDIGSFSDWLSSVLGFSASS+PD IHHP+FYILILGLPLCLFYSWLSSFLLHKGLLDSV
Subjt:  MPGPGPHMLYAMGSGMALTTLTDGRFSPHHTLVYTLNAFFGPDIGSFSDWLSSVLGFSASSLPDAIHHPLFYILILGLPLCLFYSWLSSFLLHKGLLDSV

Query:  FGVSLNRRQCLLLISAGSFSHFFLDHLFEENGHSSMYTWILSTGWWENRAPINPDAVLVVGFLCTCLIGGFVYINRVKSAKSMSEQSYQSVKLIVVVATL
        FGVSLNRRQCLLLISAGSFSHFFLDHLFEENGHSS YTWILSTGWWENRAPINPDAV+VVGFLCTCLIGGFVYINRVKS KS+S+QSYQSVKL+VVVATL
Subjt:  FGVSLNRRQCLLLISAGSFSHFFLDHLFEENGHSSMYTWILSTGWWENRAPINPDAVLVVGFLCTCLIGGFVYINRVKSAKSMSEQSYQSVKLIVVVATL

Query:  YSMWCASQIYWASPRRPAVGEEADLGVLVFLVVYFSLPHYLCIKSMQPKDLEIKHLPL
        YSMWCASQIYWASPRRPAVGEEADLGVLVFLV YF LPHYLCIKSMQPKD E KHLPL
Subjt:  YSMWCASQIYWASPRRPAVGEEADLGVLVFLVVYFSLPHYLCIKSMQPKDLEIKHLPL

XP_022923863.1 uncharacterized protein LOC111431454 [Cucurbita moschata]2.5e-13891.86Show/hide
Query:  MPGPGPHMLYAMGSGMALTTLTDGRFSPHHTLVYTLNAFFGPDIGSFSDWLSSVLGFSASSLPDAIHHPLFYILILGLPLCLFYSWLSSFLLHKGLLDSV
        MPGPGPHMLYAMGSGMAL +L+DGRFSPHHTL+YT+NAFFGPDIGSFS+WLSSVLGFS SS+PDAIHHP+FYILILGLPLCLFY+WLSS LL KGLLDSV
Subjt:  MPGPGPHMLYAMGSGMALTTLTDGRFSPHHTLVYTLNAFFGPDIGSFSDWLSSVLGFSASSLPDAIHHPLFYILILGLPLCLFYSWLSSFLLHKGLLDSV

Query:  FGVSLNRRQCLLLISAGSFSHFFLDHLFEENGHSSMYTWILSTGWWENRAPINPDAVLVVGFLCTCLIGGFVYINRVKSAKSMSEQSYQSVKLIVVVATL
        FGVSLNRRQCLLLISAGSFSHFFLDHLFEENGHSSMYTWILSTGWWENRAPINPDAV+VVGFLCTCLIGGFVYINRVKSAKS+ +QSYQSVKLI+VVATL
Subjt:  FGVSLNRRQCLLLISAGSFSHFFLDHLFEENGHSSMYTWILSTGWWENRAPINPDAVLVVGFLCTCLIGGFVYINRVKSAKSMSEQSYQSVKLIVVVATL

Query:  YSMWCASQIYWASPRRPAVGEEADLGVLVFLVVYFSLPHYLCIKSMQPKDLEIKHLPL
        YSMWCASQIYWA+PRRPAVGEEADLGVLVFL VYF LPHYLCIKSMQPKDLEI+HLPL
Subjt:  YSMWCASQIYWASPRRPAVGEEADLGVLVFLVVYFSLPHYLCIKSMQPKDLEIKHLPL

XP_023519820.1 uncharacterized protein LOC111783152 [Cucurbita pepo subsp. pepo]1.1e-13891.47Show/hide
Query:  MPGPGPHMLYAMGSGMALTTLTDGRFSPHHTLVYTLNAFFGPDIGSFSDWLSSVLGFSASSLPDAIHHPLFYILILGLPLCLFYSWLSSFLLHKGLLDSV
        MPGPGPHMLYAMGSGMAL +L+DGRFSPHHTL+YT+NAFFGPDIGSFS+WLSSVLGFS SS+PDAIHHP+FYILILGLPLCLFY+WLSS LLHKGLLDSV
Subjt:  MPGPGPHMLYAMGSGMALTTLTDGRFSPHHTLVYTLNAFFGPDIGSFSDWLSSVLGFSASSLPDAIHHPLFYILILGLPLCLFYSWLSSFLLHKGLLDSV

Query:  FGVSLNRRQCLLLISAGSFSHFFLDHLFEENGHSSMYTWILSTGWWENRAPINPDAVLVVGFLCTCLIGGFVYINRVKSAKSMSEQSYQSVKLIVVVATL
        FGVSLNRRQCLLLISAGSFSHFFLDHLFEENGHSSMYTWILSTGWWENRAPINPDAV+V+GFLCTCLIGGFVYINRVKSAKS+ +QSYQSVKLI+VVATL
Subjt:  FGVSLNRRQCLLLISAGSFSHFFLDHLFEENGHSSMYTWILSTGWWENRAPINPDAVLVVGFLCTCLIGGFVYINRVKSAKSMSEQSYQSVKLIVVVATL

Query:  YSMWCASQIYWASPRRPAVGEEADLGVLVFLVVYFSLPHYLCIKSMQPKDLEIKHLPL
        YSMWCASQIYW +PRRPAVGEEADLGVLVFL VYF LPHYLCIKSMQPKDLEI+HLPL
Subjt:  YSMWCASQIYWASPRRPAVGEEADLGVLVFLVVYFSLPHYLCIKSMQPKDLEIKHLPL

XP_038893660.1 uncharacterized protein LOC120082528 isoform X1 [Benincasa hispida]1.7e-13993.41Show/hide
Query:  MPGPGPHMLYAMGSGMALTTLTDGRFSPHHTLVYTLNAFFGPDIGSFSDWLSSVLGFSASSLPDAIHHPLFYILILGLPLCLFYSWLSSFLLHKGLLDSV
        MPGPGPHMLYAMGSG ALTTLTDGRFSPHHTL+YT+NAFFGPDIGSFSDWLSSVLGFS SS+PDAIHHP+FYILILGLPLCLFYSWLSSFLL KGLL+SV
Subjt:  MPGPGPHMLYAMGSGMALTTLTDGRFSPHHTLVYTLNAFFGPDIGSFSDWLSSVLGFSASSLPDAIHHPLFYILILGLPLCLFYSWLSSFLLHKGLLDSV

Query:  FGVSLNRRQCLLLISAGSFSHFFLDHLFEENGHSSMYTWILSTGWWENRAPINPDAVLVVGFLCTCLIGGFVYINRVKSAKSMSEQSYQSVKLIVVVATL
         GVSLNRRQCLLLISAGSFSHFFLDHLFEENGHSS YTWILSTGWWENRAPINPDAV+VVGFLCTCLIGGFVYINR KSAKS+S+QSYQSVKLI+VVATL
Subjt:  FGVSLNRRQCLLLISAGSFSHFFLDHLFEENGHSSMYTWILSTGWWENRAPINPDAVLVVGFLCTCLIGGFVYINRVKSAKSMSEQSYQSVKLIVVVATL

Query:  YSMWCASQIYWASPRRPAVGEEADLGVLVFLVVYFSLPHYLCIKSMQPKDLEIKHLPL
        YSMWCASQIYWASPRRPAVGEEADLGVLVFLV YF LPHYLCIKSMQPKDLEIKHLPL
Subjt:  YSMWCASQIYWASPRRPAVGEEADLGVLVFLVVYFSLPHYLCIKSMQPKDLEIKHLPL

TrEMBL top hitse value%identityAlignment
A0A0A0LR72 Uncharacterized protein1.7e-14093.8Show/hide
Query:  MPGPGPHMLYAMGSGMALTTLTDGRFSPHHTLVYTLNAFFGPDIGSFSDWLSSVLGFSASSLPDAIHHPLFYILILGLPLCLFYSWLSSFLLHKGLLDSV
        MPGPGPHMLYAMGSGMALTTL+DGRFSPHHTL YT+NAFFGPDIGSFSDWLSSVLGFSASS+PD IHHP+FYILILGLPLCLFYSWLSSFLLHKGLLDSV
Subjt:  MPGPGPHMLYAMGSGMALTTLTDGRFSPHHTLVYTLNAFFGPDIGSFSDWLSSVLGFSASSLPDAIHHPLFYILILGLPLCLFYSWLSSFLLHKGLLDSV

Query:  FGVSLNRRQCLLLISAGSFSHFFLDHLFEENGHSSMYTWILSTGWWENRAPINPDAVLVVGFLCTCLIGGFVYINRVKSAKSMSEQSYQSVKLIVVVATL
        FGVSLNRRQCLLLISAGSFSHFFLDHLFEENGHSS YTWILSTGWWENRAPINPDAV+VVGFLCTCLIGGFVYINRVKS KS+S+QSYQSVKL+VVVATL
Subjt:  FGVSLNRRQCLLLISAGSFSHFFLDHLFEENGHSSMYTWILSTGWWENRAPINPDAVLVVGFLCTCLIGGFVYINRVKSAKSMSEQSYQSVKLIVVVATL

Query:  YSMWCASQIYWASPRRPAVGEEADLGVLVFLVVYFSLPHYLCIKSMQPKDLEIKHLPL
        YSMWCASQIYWASPRRPAVGEEADLGVLVFLV YF LPHYLCIKSMQPKD E KHLPL
Subjt:  YSMWCASQIYWASPRRPAVGEEADLGVLVFLVVYFSLPHYLCIKSMQPKDLEIKHLPL

A0A1S3AWH9 uncharacterized protein LOC103483387 isoform X12.5e-13691.86Show/hide
Query:  MPGPGPHMLYAMGSGMALTTLTDGRFSPHHTLVYTLNAFFGPDIGSFSDWLSSVLGFSASSLPDAIHHPLFYILILGLPLCLFYSWLSSFLLHKGLLDSV
        MPGPGPHMLYAMGSGMALTTLTDGRFSPHHTL Y++NAFFGPDIGSFSDWLSSVLGF ASSLPDAIHHP+FYILILGLPLCLFYSWLSSFLL KGLLDSV
Subjt:  MPGPGPHMLYAMGSGMALTTLTDGRFSPHHTLVYTLNAFFGPDIGSFSDWLSSVLGFSASSLPDAIHHPLFYILILGLPLCLFYSWLSSFLLHKGLLDSV

Query:  FGVSLNRRQCLLLISAGSFSHFFLDHLFEENGHSSMYTWILSTGWWENRAPINPDAVLVVGFLCTCLIGGFVYINRVKSAKSMSEQSYQSVKLIVVVATL
         GVSLNRRQCLLLISAGSFSHFFLDHLFEENGHSS YTWILSTGWWENRAPINPDAV VVGFLC CLIGGFVYINRVKS KS+S+Q +QSVKL+VVVATL
Subjt:  FGVSLNRRQCLLLISAGSFSHFFLDHLFEENGHSSMYTWILSTGWWENRAPINPDAVLVVGFLCTCLIGGFVYINRVKSAKSMSEQSYQSVKLIVVVATL

Query:  YSMWCASQIYWASPRRPAVGEEADLGVLVFLVVYFSLPHYLCIKSMQPKDLEIKHLPL
        YSMWCASQIYWASPRRPAVGEEAD GVLVFLV YF LPHYLCIKSMQPKD E KHLPL
Subjt:  YSMWCASQIYWASPRRPAVGEEADLGVLVFLVVYFSLPHYLCIKSMQPKDLEIKHLPL

A0A6J1C7B1 uncharacterized protein LOC1110090543.4e-13389.53Show/hide
Query:  MPGPGPHMLYAMGSGMALTTLTDGRFSPHHTLVYTLNAFFGPDIGSFSDWLSSVLGFSASSLPDAIHHPLFYILILGLPLCLFYSWLSSFLLHKGLLDSV
        MPGPGPHMLYAMGSGMAL TL+DGRFSPHH LVYT+NAFFGPDIGSFS+WL+S+LGFS SS+PDAIHHPLFYILILGLPLCLFYS+LSS LL KG LDSV
Subjt:  MPGPGPHMLYAMGSGMALTTLTDGRFSPHHTLVYTLNAFFGPDIGSFSDWLSSVLGFSASSLPDAIHHPLFYILILGLPLCLFYSWLSSFLLHKGLLDSV

Query:  FGVSLNRRQCLLLISAGSFSHFFLDHLFEENGHSSMYTWILSTGWWENRAPINPDAVLVVGFLCTCLIGGFVYINRVKSAKSMSEQSYQSVKLIVVVATL
        FGVSLN RQCLLLISAGSFSHFFLDHLFEENGHSSMYTWILSTGWW NRAPINPDAV++VGFLC CLIGGFVYINRVKSAKS+ +QSYQSVKLIVVVATL
Subjt:  FGVSLNRRQCLLLISAGSFSHFFLDHLFEENGHSSMYTWILSTGWWENRAPINPDAVLVVGFLCTCLIGGFVYINRVKSAKSMSEQSYQSVKLIVVVATL

Query:  YSMWCASQIYWASPRRPAVGEEADLGVLVFLVVYFSLPHYLCIKSMQPKDLEIKHLPL
        YSMWCASQIYWASPRRPAVGEEADLGVLVFL  YF LPHYLCIKSM PKDLEI+ LPL
Subjt:  YSMWCASQIYWASPRRPAVGEEADLGVLVFLVVYFSLPHYLCIKSMQPKDLEIKHLPL

A0A6J1EAR6 uncharacterized protein LOC1114314541.2e-13891.86Show/hide
Query:  MPGPGPHMLYAMGSGMALTTLTDGRFSPHHTLVYTLNAFFGPDIGSFSDWLSSVLGFSASSLPDAIHHPLFYILILGLPLCLFYSWLSSFLLHKGLLDSV
        MPGPGPHMLYAMGSGMAL +L+DGRFSPHHTL+YT+NAFFGPDIGSFS+WLSSVLGFS SS+PDAIHHP+FYILILGLPLCLFY+WLSS LL KGLLDSV
Subjt:  MPGPGPHMLYAMGSGMALTTLTDGRFSPHHTLVYTLNAFFGPDIGSFSDWLSSVLGFSASSLPDAIHHPLFYILILGLPLCLFYSWLSSFLLHKGLLDSV

Query:  FGVSLNRRQCLLLISAGSFSHFFLDHLFEENGHSSMYTWILSTGWWENRAPINPDAVLVVGFLCTCLIGGFVYINRVKSAKSMSEQSYQSVKLIVVVATL
        FGVSLNRRQCLLLISAGSFSHFFLDHLFEENGHSSMYTWILSTGWWENRAPINPDAV+VVGFLCTCLIGGFVYINRVKSAKS+ +QSYQSVKLI+VVATL
Subjt:  FGVSLNRRQCLLLISAGSFSHFFLDHLFEENGHSSMYTWILSTGWWENRAPINPDAVLVVGFLCTCLIGGFVYINRVKSAKSMSEQSYQSVKLIVVVATL

Query:  YSMWCASQIYWASPRRPAVGEEADLGVLVFLVVYFSLPHYLCIKSMQPKDLEIKHLPL
        YSMWCASQIYWA+PRRPAVGEEADLGVLVFL VYF LPHYLCIKSMQPKDLEI+HLPL
Subjt:  YSMWCASQIYWASPRRPAVGEEADLGVLVFLVVYFSLPHYLCIKSMQPKDLEIKHLPL

A0A6J1KN89 uncharacterized protein LOC1114950332.3e-13790.7Show/hide
Query:  MPGPGPHMLYAMGSGMALTTLTDGRFSPHHTLVYTLNAFFGPDIGSFSDWLSSVLGFSASSLPDAIHHPLFYILILGLPLCLFYSWLSSFLLHKGLLDSV
        MPGPGPHMLYAMGSGMAL +L+DGRFSPHHTL+YT+NAFFGPDIGSFS+WLSSVLGFS SS+PDAIHHP+FYILILGLPLCLFY+WLSSFLL KGLLDSV
Subjt:  MPGPGPHMLYAMGSGMALTTLTDGRFSPHHTLVYTLNAFFGPDIGSFSDWLSSVLGFSASSLPDAIHHPLFYILILGLPLCLFYSWLSSFLLHKGLLDSV

Query:  FGVSLNRRQCLLLISAGSFSHFFLDHLFEENGHSSMYTWILSTGWWENRAPINPDAVLVVGFLCTCLIGGFVYINRVKSAKSMSEQSYQSVKLIVVVATL
        FGV LN+ QCLLLISAGSFSHFFLDHLFEENGHSSMYTWILSTGWWENRAPINPDAV+VVGFLCTCLIGGFVYINRVKSAKS+ +QSYQSVKLI+VVATL
Subjt:  FGVSLNRRQCLLLISAGSFSHFFLDHLFEENGHSSMYTWILSTGWWENRAPINPDAVLVVGFLCTCLIGGFVYINRVKSAKSMSEQSYQSVKLIVVVATL

Query:  YSMWCASQIYWASPRRPAVGEEADLGVLVFLVVYFSLPHYLCIKSMQPKDLEIKHLPL
        YSMWCASQIYWA+PRRPA GEEADLGVLVFL VYF LPHYLCIKSMQPKDLEI+HLPL
Subjt:  YSMWCASQIYWASPRRPAVGEEADLGVLVFLVVYFSLPHYLCIKSMQPKDLEIKHLPL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCTGGACCTGGGCCTCACATGCTCTACGCCATGGGCTCAGGCATGGCTCTGACCACTCTCACCGACGGTCGATTCAGCCCCCACCATACCCTCGTTTACACCCTCAA
CGCCTTCTTCGGCCCCGACATCGGCTCCTTCTCCGATTGGCTCTCCTCCGTTCTCGGTTTCTCAGCATCTTCCCTTCCCGACGCAATCCATCATCCCCTCTTCTACATCC
TCATTCTGGGTCTTCCTCTCTGCCTCTTCTACTCCTGGCTCTCCTCTTTTCTACTCCACAAGGGCCTCCTCGATTCCGTCTTTGGGGTATCTCTTAATAGAAGGCAATGC
TTGTTGTTAATCTCTGCTGGTTCTTTCTCACACTTCTTTCTTGACCATTTGTTTGAGGAAAATGGGCATTCGTCAATGTATACTTGGATATTGAGCACTGGTTGGTGGGA
GAACCGAGCGCCAATTAATCCAGATGCTGTTCTCGTTGTTGGATTCTTGTGCACTTGCTTAATCGGCGGCTTTGTATACATTAATAGAGTGAAGTCTGCAAAGTCGATGT
CAGAACAATCGTATCAGTCGGTAAAGCTTATCGTAGTTGTAGCTACCCTATATTCCATGTGGTGTGCAAGCCAGATATACTGGGCTAGCCCTCGTCGACCAGCTGTCGGT
GAAGAAGCCGACCTTGGAGTTTTAGTGTTTCTGGTTGTCTATTTTTCTCTACCTCATTATCTTTGTATAAAGTCCATGCAACCAAAAGATCTCGAAATCAAACATCTCCC
ATTGTGA
mRNA sequenceShow/hide mRNA sequence
ATTTAAGGATTCTTCTTCATACTTGCCATAAAGTCTCATCAAGTTAATAGCTTCCGTTCCCTTCGACACTACTGTGTATTGAGATCGGAAACTTGGGTTGCGATTGCGAC
TTCGATCGTTGAGACGGAACCATGCCTGGACCTGGGCCTCACATGCTCTACGCCATGGGCTCAGGCATGGCTCTGACCACTCTCACCGACGGTCGATTCAGCCCCCACCA
TACCCTCGTTTACACCCTCAACGCCTTCTTCGGCCCCGACATCGGCTCCTTCTCCGATTGGCTCTCCTCCGTTCTCGGTTTCTCAGCATCTTCCCTTCCCGACGCAATCC
ATCATCCCCTCTTCTACATCCTCATTCTGGGTCTTCCTCTCTGCCTCTTCTACTCCTGGCTCTCCTCTTTTCTACTCCACAAGGGCCTCCTCGATTCCGTCTTTGGGGTA
TCTCTTAATAGAAGGCAATGCTTGTTGTTAATCTCTGCTGGTTCTTTCTCACACTTCTTTCTTGACCATTTGTTTGAGGAAAATGGGCATTCGTCAATGTATACTTGGAT
ATTGAGCACTGGTTGGTGGGAGAACCGAGCGCCAATTAATCCAGATGCTGTTCTCGTTGTTGGATTCTTGTGCACTTGCTTAATCGGCGGCTTTGTATACATTAATAGAG
TGAAGTCTGCAAAGTCGATGTCAGAACAATCGTATCAGTCGGTAAAGCTTATCGTAGTTGTAGCTACCCTATATTCCATGTGGTGTGCAAGCCAGATATACTGGGCTAGC
CCTCGTCGACCAGCTGTCGGTGAAGAAGCCGACCTTGGAGTTTTAGTGTTTCTGGTTGTCTATTTTTCTCTACCTCATTATCTTTGTATAAAGTCCATGCAACCAAAAGA
TCTCGAAATCAAACATCTCCCATTGTGATAGATGAGATATTCATTTATATTAGAAAAGTCTCGATTGACAACAGTTCCCTTTTCCCACTTTGACATTACAGCCTACAGTT
GAGAAACCACTGGATTTGCTAGACTTTAGTCTTGTTCGCATACATATGATTCTGACTTTCTCGTGCATATTTGGAACCCATTGCCAACTTTGCGAATGCAGGAAGACAAA
ACGCAGCAGTATGGATCTAAACAGCAGAGTTAAACAGCCAATTAGTTAGATCATAGAAGGGGCAATAGAGTAGGCTTATGAACATATGTGAGCAAAAGAAAGAAGAAACC
ACCTCAGAGTTGTAGAACTTGGGGGGTCCTTTTGCAACACCAAAATTTTCTGGATCCAACAGGTTCACTGGCTGTTTGAAGTCGACTGATGGACCGTCGGTTTAACAGAG
CATGAACCCATTGCGCCAGTGAGTTTGGATCATTTAGCAGGGGAAAGAAAGGCATTAGAGAGAACTCCAGATTGTGCAGAAGCATAAAAGTAAAGAATTTTGGAAATGAA
ACAAAGCTTTAAATGGTTGTGGTTGGGTTTGCCTTGGATATGAAGGAACAGAGGTCCAAGCATAGTTTACAGAGCCTTTGAATATGTCACGACAGAGAGTTATGGTATCT
TCCATGGCGAAATTGTGGAGCCATATGCTGTCCGCCGGAGTTGACATTACCCCGCCGGGCTTCAAAGCCTTTGCTATTGATTTAAGTAAGGCTTCATTTGAAAGCTCTGT
TGCATAAGCCCCTGATATCCATATTCAATTATATAGGTTTTAACTTCACTTTGTATAGTGAAAACAAATATCATGATTGGTACTGTTCGGTTGTCAGATTTTATTTTTTT
T
Protein sequenceShow/hide protein sequence
MPGPGPHMLYAMGSGMALTTLTDGRFSPHHTLVYTLNAFFGPDIGSFSDWLSSVLGFSASSLPDAIHHPLFYILILGLPLCLFYSWLSSFLLHKGLLDSVFGVSLNRRQC
LLLISAGSFSHFFLDHLFEENGHSSMYTWILSTGWWENRAPINPDAVLVVGFLCTCLIGGFVYINRVKSAKSMSEQSYQSVKLIVVVATLYSMWCASQIYWASPRRPAVG
EEADLGVLVFLVVYFSLPHYLCIKSMQPKDLEIKHLPL