; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG01G013000 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG01G013000
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
Description1-deoxy-D-xylulose-5-phosphate synthase
Genome locationCG_Chr01:26127088..26131664
RNA-Seq ExpressionClCG01G013000
SyntenyClCG01G013000
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7019828.1 hypothetical protein SDJN02_18792 [Cucurbita argyrosperma subsp. argyrosperma]9.5e-13891.09Show/hide
Query:  MPGPGPHMLYAMGSGMALTTLTDGRFSPHHTLVYTLNAFFGPDIGSFSDWLSSVLGFSASSLPDAIHHPLFYVLILGLPLCLFYSWLSSFLLHKALLDSV
        MPGPGPHMLYAMGSGMAL +L+DGRFSPHHTL+YT+NAFFGPDIGSFS+WLSSVLGFS SS+PDAIHHP+FY+LILGLPLCLFY+WLSS LL K LLDSV
Subjt:  MPGPGPHMLYAMGSGMALTTLTDGRFSPHHTLVYTLNAFFGPDIGSFSDWLSSVLGFSASSLPDAIHHPLFYVLILGLPLCLFYSWLSSFLLHKALLDSV

Query:  FGVSLNRRQCLLLISAGSFSHFFLDHLFEENGHSSMYTWILSTGWWENRAPINPDAVLVVGFLCTCLIGGFVYINRVKSAKSISEQSYQSVKLIVVVATL
        FGVSLNRRQCL LISAGSFSHFFLDHLFEENGHSSMYTWILSTGWWENRAPINPDAV+VVGFLCTCLIGGFVYINRVKSAKSI +QSYQSVKLI+VVATL
Subjt:  FGVSLNRRQCLLLISAGSFSHFFLDHLFEENGHSSMYTWILSTGWWENRAPINPDAVLVVGFLCTCLIGGFVYINRVKSAKSISEQSYQSVKLIVVVATL

Query:  YSMWCASQIYWASPRRPAVGEEADLGVLVFLVVYFFLPHYLCMKSMQPKDLEIKHLPL
        YSMWCASQIYWA+PRRPAVGEEADLGVLVFL VYFFLPHYLC+KSMQPKDLEI+HLPL
Subjt:  YSMWCASQIYWASPRRPAVGEEADLGVLVFLVVYFFLPHYLCMKSMQPKDLEIKHLPL

XP_004152468.1 uncharacterized protein LOC101221499 isoform X1 [Cucumis sativus]4.6e-14093.41Show/hide
Query:  MPGPGPHMLYAMGSGMALTTLTDGRFSPHHTLVYTLNAFFGPDIGSFSDWLSSVLGFSASSLPDAIHHPLFYVLILGLPLCLFYSWLSSFLLHKALLDSV
        MPGPGPHMLYAMGSGMALTTL+DGRFSPHHTL YT+NAFFGPDIGSFSDWLSSVLGFSASS+PD IHHP+FY+LILGLPLCLFYSWLSSFLLHK LLDSV
Subjt:  MPGPGPHMLYAMGSGMALTTLTDGRFSPHHTLVYTLNAFFGPDIGSFSDWLSSVLGFSASSLPDAIHHPLFYVLILGLPLCLFYSWLSSFLLHKALLDSV

Query:  FGVSLNRRQCLLLISAGSFSHFFLDHLFEENGHSSMYTWILSTGWWENRAPINPDAVLVVGFLCTCLIGGFVYINRVKSAKSISEQSYQSVKLIVVVATL
        FGVSLNRRQCLLLISAGSFSHFFLDHLFEENGHSS YTWILSTGWWENRAPINPDAV+VVGFLCTCLIGGFVYINRVKS KSIS+QSYQSVKL+VVVATL
Subjt:  FGVSLNRRQCLLLISAGSFSHFFLDHLFEENGHSSMYTWILSTGWWENRAPINPDAVLVVGFLCTCLIGGFVYINRVKSAKSISEQSYQSVKLIVVVATL

Query:  YSMWCASQIYWASPRRPAVGEEADLGVLVFLVVYFFLPHYLCMKSMQPKDLEIKHLPL
        YSMWCASQIYWASPRRPAVGEEADLGVLVFLV YFFLPHYLC+KSMQPKD E KHLPL
Subjt:  YSMWCASQIYWASPRRPAVGEEADLGVLVFLVVYFFLPHYLCMKSMQPKDLEIKHLPL

XP_022923863.1 uncharacterized protein LOC111431454 [Cucurbita moschata]3.3e-13891.47Show/hide
Query:  MPGPGPHMLYAMGSGMALTTLTDGRFSPHHTLVYTLNAFFGPDIGSFSDWLSSVLGFSASSLPDAIHHPLFYVLILGLPLCLFYSWLSSFLLHKALLDSV
        MPGPGPHMLYAMGSGMAL +L+DGRFSPHHTL+YT+NAFFGPDIGSFS+WLSSVLGFS SS+PDAIHHP+FY+LILGLPLCLFY+WLSS LL K LLDSV
Subjt:  MPGPGPHMLYAMGSGMALTTLTDGRFSPHHTLVYTLNAFFGPDIGSFSDWLSSVLGFSASSLPDAIHHPLFYVLILGLPLCLFYSWLSSFLLHKALLDSV

Query:  FGVSLNRRQCLLLISAGSFSHFFLDHLFEENGHSSMYTWILSTGWWENRAPINPDAVLVVGFLCTCLIGGFVYINRVKSAKSISEQSYQSVKLIVVVATL
        FGVSLNRRQCLLLISAGSFSHFFLDHLFEENGHSSMYTWILSTGWWENRAPINPDAV+VVGFLCTCLIGGFVYINRVKSAKSI +QSYQSVKLI+VVATL
Subjt:  FGVSLNRRQCLLLISAGSFSHFFLDHLFEENGHSSMYTWILSTGWWENRAPINPDAVLVVGFLCTCLIGGFVYINRVKSAKSISEQSYQSVKLIVVVATL

Query:  YSMWCASQIYWASPRRPAVGEEADLGVLVFLVVYFFLPHYLCMKSMQPKDLEIKHLPL
        YSMWCASQIYWA+PRRPAVGEEADLGVLVFL VYFFLPHYLC+KSMQPKDLEI+HLPL
Subjt:  YSMWCASQIYWASPRRPAVGEEADLGVLVFLVVYFFLPHYLCMKSMQPKDLEIKHLPL

XP_023519820.1 uncharacterized protein LOC111783152 [Cucurbita pepo subsp. pepo]1.5e-13891.09Show/hide
Query:  MPGPGPHMLYAMGSGMALTTLTDGRFSPHHTLVYTLNAFFGPDIGSFSDWLSSVLGFSASSLPDAIHHPLFYVLILGLPLCLFYSWLSSFLLHKALLDSV
        MPGPGPHMLYAMGSGMAL +L+DGRFSPHHTL+YT+NAFFGPDIGSFS+WLSSVLGFS SS+PDAIHHP+FY+LILGLPLCLFY+WLSS LLHK LLDSV
Subjt:  MPGPGPHMLYAMGSGMALTTLTDGRFSPHHTLVYTLNAFFGPDIGSFSDWLSSVLGFSASSLPDAIHHPLFYVLILGLPLCLFYSWLSSFLLHKALLDSV

Query:  FGVSLNRRQCLLLISAGSFSHFFLDHLFEENGHSSMYTWILSTGWWENRAPINPDAVLVVGFLCTCLIGGFVYINRVKSAKSISEQSYQSVKLIVVVATL
        FGVSLNRRQCLLLISAGSFSHFFLDHLFEENGHSSMYTWILSTGWWENRAPINPDAV+V+GFLCTCLIGGFVYINRVKSAKSI +QSYQSVKLI+VVATL
Subjt:  FGVSLNRRQCLLLISAGSFSHFFLDHLFEENGHSSMYTWILSTGWWENRAPINPDAVLVVGFLCTCLIGGFVYINRVKSAKSISEQSYQSVKLIVVVATL

Query:  YSMWCASQIYWASPRRPAVGEEADLGVLVFLVVYFFLPHYLCMKSMQPKDLEIKHLPL
        YSMWCASQIYW +PRRPAVGEEADLGVLVFL VYFFLPHYLC+KSMQPKDLEI+HLPL
Subjt:  YSMWCASQIYWASPRRPAVGEEADLGVLVFLVVYFFLPHYLCMKSMQPKDLEIKHLPL

XP_038893660.1 uncharacterized protein LOC120082528 isoform X1 [Benincasa hispida]2.3e-13993.02Show/hide
Query:  MPGPGPHMLYAMGSGMALTTLTDGRFSPHHTLVYTLNAFFGPDIGSFSDWLSSVLGFSASSLPDAIHHPLFYVLILGLPLCLFYSWLSSFLLHKALLDSV
        MPGPGPHMLYAMGSG ALTTLTDGRFSPHHTL+YT+NAFFGPDIGSFSDWLSSVLGFS SS+PDAIHHP+FY+LILGLPLCLFYSWLSSFLL K LL+SV
Subjt:  MPGPGPHMLYAMGSGMALTTLTDGRFSPHHTLVYTLNAFFGPDIGSFSDWLSSVLGFSASSLPDAIHHPLFYVLILGLPLCLFYSWLSSFLLHKALLDSV

Query:  FGVSLNRRQCLLLISAGSFSHFFLDHLFEENGHSSMYTWILSTGWWENRAPINPDAVLVVGFLCTCLIGGFVYINRVKSAKSISEQSYQSVKLIVVVATL
         GVSLNRRQCLLLISAGSFSHFFLDHLFEENGHSS YTWILSTGWWENRAPINPDAV+VVGFLCTCLIGGFVYINR KSAKSIS+QSYQSVKLI+VVATL
Subjt:  FGVSLNRRQCLLLISAGSFSHFFLDHLFEENGHSSMYTWILSTGWWENRAPINPDAVLVVGFLCTCLIGGFVYINRVKSAKSISEQSYQSVKLIVVVATL

Query:  YSMWCASQIYWASPRRPAVGEEADLGVLVFLVVYFFLPHYLCMKSMQPKDLEIKHLPL
        YSMWCASQIYWASPRRPAVGEEADLGVLVFLV YFFLPHYLC+KSMQPKDLEIKHLPL
Subjt:  YSMWCASQIYWASPRRPAVGEEADLGVLVFLVVYFFLPHYLCMKSMQPKDLEIKHLPL

TrEMBL top hitse value%identityAlignment
A0A0A0LR72 Uncharacterized protein2.2e-14093.41Show/hide
Query:  MPGPGPHMLYAMGSGMALTTLTDGRFSPHHTLVYTLNAFFGPDIGSFSDWLSSVLGFSASSLPDAIHHPLFYVLILGLPLCLFYSWLSSFLLHKALLDSV
        MPGPGPHMLYAMGSGMALTTL+DGRFSPHHTL YT+NAFFGPDIGSFSDWLSSVLGFSASS+PD IHHP+FY+LILGLPLCLFYSWLSSFLLHK LLDSV
Subjt:  MPGPGPHMLYAMGSGMALTTLTDGRFSPHHTLVYTLNAFFGPDIGSFSDWLSSVLGFSASSLPDAIHHPLFYVLILGLPLCLFYSWLSSFLLHKALLDSV

Query:  FGVSLNRRQCLLLISAGSFSHFFLDHLFEENGHSSMYTWILSTGWWENRAPINPDAVLVVGFLCTCLIGGFVYINRVKSAKSISEQSYQSVKLIVVVATL
        FGVSLNRRQCLLLISAGSFSHFFLDHLFEENGHSS YTWILSTGWWENRAPINPDAV+VVGFLCTCLIGGFVYINRVKS KSIS+QSYQSVKL+VVVATL
Subjt:  FGVSLNRRQCLLLISAGSFSHFFLDHLFEENGHSSMYTWILSTGWWENRAPINPDAVLVVGFLCTCLIGGFVYINRVKSAKSISEQSYQSVKLIVVVATL

Query:  YSMWCASQIYWASPRRPAVGEEADLGVLVFLVVYFFLPHYLCMKSMQPKDLEIKHLPL
        YSMWCASQIYWASPRRPAVGEEADLGVLVFLV YFFLPHYLC+KSMQPKD E KHLPL
Subjt:  YSMWCASQIYWASPRRPAVGEEADLGVLVFLVVYFFLPHYLCMKSMQPKDLEIKHLPL

A0A1S3AWH9 uncharacterized protein LOC103483387 isoform X13.3e-13691.47Show/hide
Query:  MPGPGPHMLYAMGSGMALTTLTDGRFSPHHTLVYTLNAFFGPDIGSFSDWLSSVLGFSASSLPDAIHHPLFYVLILGLPLCLFYSWLSSFLLHKALLDSV
        MPGPGPHMLYAMGSGMALTTLTDGRFSPHHTL Y++NAFFGPDIGSFSDWLSSVLGF ASSLPDAIHHP+FY+LILGLPLCLFYSWLSSFLL K LLDSV
Subjt:  MPGPGPHMLYAMGSGMALTTLTDGRFSPHHTLVYTLNAFFGPDIGSFSDWLSSVLGFSASSLPDAIHHPLFYVLILGLPLCLFYSWLSSFLLHKALLDSV

Query:  FGVSLNRRQCLLLISAGSFSHFFLDHLFEENGHSSMYTWILSTGWWENRAPINPDAVLVVGFLCTCLIGGFVYINRVKSAKSISEQSYQSVKLIVVVATL
         GVSLNRRQCLLLISAGSFSHFFLDHLFEENGHSS YTWILSTGWWENRAPINPDAV VVGFLC CLIGGFVYINRVKS KSIS+Q +QSVKL+VVVATL
Subjt:  FGVSLNRRQCLLLISAGSFSHFFLDHLFEENGHSSMYTWILSTGWWENRAPINPDAVLVVGFLCTCLIGGFVYINRVKSAKSISEQSYQSVKLIVVVATL

Query:  YSMWCASQIYWASPRRPAVGEEADLGVLVFLVVYFFLPHYLCMKSMQPKDLEIKHLPL
        YSMWCASQIYWASPRRPAVGEEAD GVLVFLV YFFLPHYLC+KSMQPKD E KHLPL
Subjt:  YSMWCASQIYWASPRRPAVGEEADLGVLVFLVVYFFLPHYLCMKSMQPKDLEIKHLPL

A0A6J1C7B1 uncharacterized protein LOC1110090544.5e-13389.15Show/hide
Query:  MPGPGPHMLYAMGSGMALTTLTDGRFSPHHTLVYTLNAFFGPDIGSFSDWLSSVLGFSASSLPDAIHHPLFYVLILGLPLCLFYSWLSSFLLHKALLDSV
        MPGPGPHMLYAMGSGMAL TL+DGRFSPHH LVYT+NAFFGPDIGSFS+WL+S+LGFS SS+PDAIHHPLFY+LILGLPLCLFYS+LSS LL K  LDSV
Subjt:  MPGPGPHMLYAMGSGMALTTLTDGRFSPHHTLVYTLNAFFGPDIGSFSDWLSSVLGFSASSLPDAIHHPLFYVLILGLPLCLFYSWLSSFLLHKALLDSV

Query:  FGVSLNRRQCLLLISAGSFSHFFLDHLFEENGHSSMYTWILSTGWWENRAPINPDAVLVVGFLCTCLIGGFVYINRVKSAKSISEQSYQSVKLIVVVATL
        FGVSLN RQCLLLISAGSFSHFFLDHLFEENGHSSMYTWILSTGWW NRAPINPDAV++VGFLC CLIGGFVYINRVKSAKSI +QSYQSVKLIVVVATL
Subjt:  FGVSLNRRQCLLLISAGSFSHFFLDHLFEENGHSSMYTWILSTGWWENRAPINPDAVLVVGFLCTCLIGGFVYINRVKSAKSISEQSYQSVKLIVVVATL

Query:  YSMWCASQIYWASPRRPAVGEEADLGVLVFLVVYFFLPHYLCMKSMQPKDLEIKHLPL
        YSMWCASQIYWASPRRPAVGEEADLGVLVFL  YFFLPHYLC+KSM PKDLEI+ LPL
Subjt:  YSMWCASQIYWASPRRPAVGEEADLGVLVFLVVYFFLPHYLCMKSMQPKDLEIKHLPL

A0A6J1EAR6 uncharacterized protein LOC1114314541.6e-13891.47Show/hide
Query:  MPGPGPHMLYAMGSGMALTTLTDGRFSPHHTLVYTLNAFFGPDIGSFSDWLSSVLGFSASSLPDAIHHPLFYVLILGLPLCLFYSWLSSFLLHKALLDSV
        MPGPGPHMLYAMGSGMAL +L+DGRFSPHHTL+YT+NAFFGPDIGSFS+WLSSVLGFS SS+PDAIHHP+FY+LILGLPLCLFY+WLSS LL K LLDSV
Subjt:  MPGPGPHMLYAMGSGMALTTLTDGRFSPHHTLVYTLNAFFGPDIGSFSDWLSSVLGFSASSLPDAIHHPLFYVLILGLPLCLFYSWLSSFLLHKALLDSV

Query:  FGVSLNRRQCLLLISAGSFSHFFLDHLFEENGHSSMYTWILSTGWWENRAPINPDAVLVVGFLCTCLIGGFVYINRVKSAKSISEQSYQSVKLIVVVATL
        FGVSLNRRQCLLLISAGSFSHFFLDHLFEENGHSSMYTWILSTGWWENRAPINPDAV+VVGFLCTCLIGGFVYINRVKSAKSI +QSYQSVKLI+VVATL
Subjt:  FGVSLNRRQCLLLISAGSFSHFFLDHLFEENGHSSMYTWILSTGWWENRAPINPDAVLVVGFLCTCLIGGFVYINRVKSAKSISEQSYQSVKLIVVVATL

Query:  YSMWCASQIYWASPRRPAVGEEADLGVLVFLVVYFFLPHYLCMKSMQPKDLEIKHLPL
        YSMWCASQIYWA+PRRPAVGEEADLGVLVFL VYFFLPHYLC+KSMQPKDLEI+HLPL
Subjt:  YSMWCASQIYWASPRRPAVGEEADLGVLVFLVVYFFLPHYLCMKSMQPKDLEIKHLPL

A0A6J1KN89 uncharacterized protein LOC1114950333.0e-13790.31Show/hide
Query:  MPGPGPHMLYAMGSGMALTTLTDGRFSPHHTLVYTLNAFFGPDIGSFSDWLSSVLGFSASSLPDAIHHPLFYVLILGLPLCLFYSWLSSFLLHKALLDSV
        MPGPGPHMLYAMGSGMAL +L+DGRFSPHHTL+YT+NAFFGPDIGSFS+WLSSVLGFS SS+PDAIHHP+FY+LILGLPLCLFY+WLSSFLL K LLDSV
Subjt:  MPGPGPHMLYAMGSGMALTTLTDGRFSPHHTLVYTLNAFFGPDIGSFSDWLSSVLGFSASSLPDAIHHPLFYVLILGLPLCLFYSWLSSFLLHKALLDSV

Query:  FGVSLNRRQCLLLISAGSFSHFFLDHLFEENGHSSMYTWILSTGWWENRAPINPDAVLVVGFLCTCLIGGFVYINRVKSAKSISEQSYQSVKLIVVVATL
        FGV LN+ QCLLLISAGSFSHFFLDHLFEENGHSSMYTWILSTGWWENRAPINPDAV+VVGFLCTCLIGGFVYINRVKSAKSI +QSYQSVKLI+VVATL
Subjt:  FGVSLNRRQCLLLISAGSFSHFFLDHLFEENGHSSMYTWILSTGWWENRAPINPDAVLVVGFLCTCLIGGFVYINRVKSAKSISEQSYQSVKLIVVVATL

Query:  YSMWCASQIYWASPRRPAVGEEADLGVLVFLVVYFFLPHYLCMKSMQPKDLEIKHLPL
        YSMWCASQIYWA+PRRPA GEEADLGVLVFL VYFFLPHYLC+KSMQPKDLEI+HLPL
Subjt:  YSMWCASQIYWASPRRPAVGEEADLGVLVFLVVYFFLPHYLCMKSMQPKDLEIKHLPL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCTGGACCTGGGCCTCACATGCTCTACGCCATGGGCTCAGGCATGGCTCTGACCACTCTCACCGACGGTCGATTCAGCCCCCACCATACCCTCGTTTACACCCTCAA
CGCCTTCTTCGGCCCCGACATCGGCTCCTTCTCCGATTGGCTCTCCTCCGTTCTCGGTTTCTCAGCATCTTCCCTTCCCGACGCAATCCATCATCCCCTCTTCTACGTCC
TCATTTTGGGTCTTCCTCTCTGCCTCTTCTACTCCTGGCTCTCCTCTTTTCTACTCCACAAGGCCCTCCTCGATTCCGTCTTTGGGGTATCTCTTAATAGAAGGCAATGC
TTGTTGTTAATCTCTGCTGGTTCTTTCTCACACTTCTTTCTTGACCATTTGTTTGAGGAAAATGGGCATTCGTCAATGTACACTTGGATATTGAGCACTGGTTGGTGGGA
GAACCGAGCACCAATTAATCCAGATGCTGTTCTCGTTGTTGGATTCTTGTGCACTTGCTTAATTGGCGGCTTTGTATACATTAACAGAGTGAAGTCTGCAAAGTCGATTT
CAGAACAATCGTATCAGTCGGTAAAGCTTATCGTAGTTGTAGCTACCCTATATTCCATGTGGTGTGCAAGCCAGATATACTGGGCTAGCCCTCGTCGACCAGCTGTCGGT
GAAGAAGCCGACCTTGGAGTTTTAGTGTTTCTGGTTGTCTATTTTTTTCTACCTCATTATCTTTGTATGAAGTCCATGCAACCAAAAGATCTCGAAATCAAACATCTCCC
ATTGTGA
mRNA sequenceShow/hide mRNA sequence
ATCAAGTTAGTGGCTTCCATTCCCTTCGGCACTACTGTGTATTGAGATCGGAAACTTGGGTTGCGATTGCGACTTCGATCGTTGAGACGGAACCATGCCTGGACCTGGGC
CTCACATGCTCTACGCCATGGGCTCAGGCATGGCTCTGACCACTCTCACCGACGGTCGATTCAGCCCCCACCATACCCTCGTTTACACCCTCAACGCCTTCTTCGGCCCC
GACATCGGCTCCTTCTCCGATTGGCTCTCCTCCGTTCTCGGTTTCTCAGCATCTTCCCTTCCCGACGCAATCCATCATCCCCTCTTCTACGTCCTCATTTTGGGTCTTCC
TCTCTGCCTCTTCTACTCCTGGCTCTCCTCTTTTCTACTCCACAAGGCCCTCCTCGATTCCGTCTTTGGGGTATCTCTTAATAGAAGGCAATGCTTGTTGTTAATCTCTG
CTGGTTCTTTCTCACACTTCTTTCTTGACCATTTGTTTGAGGAAAATGGGCATTCGTCAATGTACACTTGGATATTGAGCACTGGTTGGTGGGAGAACCGAGCACCAATT
AATCCAGATGCTGTTCTCGTTGTTGGATTCTTGTGCACTTGCTTAATTGGCGGCTTTGTATACATTAACAGAGTGAAGTCTGCAAAGTCGATTTCAGAACAATCGTATCA
GTCGGTAAAGCTTATCGTAGTTGTAGCTACCCTATATTCCATGTGGTGTGCAAGCCAGATATACTGGGCTAGCCCTCGTCGACCAGCTGTCGGTGAAGAAGCCGACCTTG
GAGTTTTAGTGTTTCTGGTTGTCTATTTTTTTCTACCTCATTATCTTTGTATGAAGTCCATGCAACCAAAAGATCTCGAAATCAAACATCTCCCATTGTGATAGATGAGA
TATTCATTTATATTAGAAAAGTCTCAATTGACAACAGTTCCCTTTTCCCACTTTGACATTACAGCCTACGGTTGAGAAACCACTGAATTTGCTCTAGACTTTAGTCTCGC
ATAATTTGTTCGCATACATATGATTCTGACTTTCTCATGCATATTTGGAACCCATTGCCAACTTTGCGAATGCAGGAAGACAAAACGCAGCAGTATGGATCTAAACAGCA
GAGTTAAACAGCCAATTAGTTAGATCATAGAAGGGGCAAGCATAGAGCATAGGCTTATGAACAAATGTGAGCAAAAGAAAGAAGAAACCACCTCAGAGTTGTAGAACTTG
GGGGGTCCTTTTGCAACACCAAAATTTTCTGGATCCAACAGGTTCACTGGCTGTTTGAAGTCGACTGATGGACCGTCGGTTGAACAGAGCATGAACCCGATTGCCCCACT
GAGTTTGGATCATTTAGCAGGGGAAAGAAAGGCATTAGAGAGAACTCCAGATTGTGCAGAAGCATAAAAGTAGAGAATTTTGGAAATGAAACAAAGCTTTAAATGGTTGT
GGTTGGGTTTGCCTTGGATATGAAGGAACCGAGGTCCAAGCATAGTTTACAGAGCCTTTGAATATGTCACGACAGAGAGTTATGGTATCTTCCATGGCGAAATTGTTGAG
CCATATGCTGTCCGCCGGAGTTGACATTACCCCGCCGGGCTTCAAAGCCTTTGCTATTGATTTAAGTAAGTCTTCATTTGAAAGCTCTGTTGCATAAGCCCCTGATATCC
ATATTCAATTATATAGGTTTTAACTTCACTTTGTATAGTGAAAACAAATATCATGATTGGTACTGTTCGGTTGTCAGATTTTTTTTTTC
Protein sequenceShow/hide protein sequence
MPGPGPHMLYAMGSGMALTTLTDGRFSPHHTLVYTLNAFFGPDIGSFSDWLSSVLGFSASSLPDAIHHPLFYVLILGLPLCLFYSWLSSFLLHKALLDSVFGVSLNRRQC
LLLISAGSFSHFFLDHLFEENGHSSMYTWILSTGWWENRAPINPDAVLVVGFLCTCLIGGFVYINRVKSAKSISEQSYQSVKLIVVVATLYSMWCASQIYWASPRRPAVG
EEADLGVLVFLVVYFFLPHYLCMKSMQPKDLEIKHLPL