; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh14G016900 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh14G016900
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionUnknown protein
Genome locationCmo_Chr14:13262143..13269461
RNA-Seq ExpressionCmoCh14G016900
SyntenyCmoCh14G016900
Gene Ontology termsGO:0000124 - SAGA complex (cellular component)
InterPro domainsIPR037804 - SAGA-associated factor 73


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6582244.1 hypothetical protein SDJN03_22246, partial [Cucurbita argyrosperma subsp. sororia]7.5e-16698.37Show/hide
Query:  MGLDDWRSSRAVGNGRMAVMTRLLAAGSFSRTIAEEVGHQKLASEFICRELRDADEANLIDEEDMHVFGLKPMVDPLNLVSCNICKKPVKASQYIIHSEL
        MGLDDWRSSRAVGNGRMAVMTRLLAAGSFSRTIAEEVGHQKLASEFICRELRDADEANLIDEEDMHVFGLKPMVDPLNLVSCNICKKPVKASQYIIHSEL
Subjt:  MGLDDWRSSRAVGNGRMAVMTRLLAAGSFSRTIAEEVGHQKLASEFICRELRDADEANLIDEEDMHVFGLKPMVDPLNLVSCNICKKPVKASQYIIHSEL

Query:  CRSLGSGQGIIMDLDSGMGHRKHSRKEKKKLLPADANISGGEKEGSESTYADYSSAPVLPISNEFEMVKLTKRNSTCTVAPILDDSAGVCHGVVDHSASP
        CRSLGSGQGIIMDLDSGMGHRKHSRKEKKKLLPADANIS GEKEGSESTYADYSSAPVLPISNEFEMVKLTKRNSTCTVAPILDDSAGVCHGVVDHSASP
Subjt:  CRSLGSGQGIIMDLDSGMGHRKHSRKEKKKLLPADANISGGEKEGSESTYADYSSAPVLPISNEFEMVKLTKRNSTCTVAPILDDSAGVCHGVVDHSASP

Query:  IHPSSKRSKLITGDGLLLASDLEPSSSKTKIRNDPFPLASKIYYSQRNNRLRSTLSYLYWEAVSSSKEICNMVDHGMTKENIKQFHSTSQEEESQEQSSD
        IHPS KRSKLITG+GLLLASDLEPSSSKTKIRNDPFPLASKIYYSQRNNRLRSTLSYLYWEAVSSSKEICNMVD GMTKENIKQFHSTSQEEESQEQSSD
Subjt:  IHPSSKRSKLITGDGLLLASDLEPSSSKTKIRNDPFPLASKIYYSQRNNRLRSTLSYLYWEAVSSSKEICNMVDHGMTKENIKQFHSTSQEEESQEQSSD

Query:  IIGKKDD
        IIGKK D
Subjt:  IIGKKDD

XP_022955384.1 uncharacterized protein LOC111457430 isoform X1 [Cucurbita moschata]2.1e-16899.67Show/hide
Query:  MGLDDWRSSRAVGNGRMAVMTRLLAAGSFSRTIAEEVGHQKLASEFICRELRDADEANLIDEEDMHVFGLKPMVDPLNLVSCNICKKPVKASQYIIHSEL
        MGLDDWRSSRAVGNGRMAVMTRLLAAGSFSRTIAEEVGHQKLASEFICRELRDADEANLIDEEDMHVFGLKPMVDPLNLVSCNICKKPVKASQYIIHSEL
Subjt:  MGLDDWRSSRAVGNGRMAVMTRLLAAGSFSRTIAEEVGHQKLASEFICRELRDADEANLIDEEDMHVFGLKPMVDPLNLVSCNICKKPVKASQYIIHSEL

Query:  CRSLGSGQGIIMDLDSGMGHRKHSRKEKKKLLPADANISGGEKEGSESTYADYSSAPVLPISNEFEMVKLTKRNSTCTVAPILDDSAGVCHGVVDHSASP
        CRSLGSGQGIIMDLDSGMGHRKHSRKEKKKLLPADANISGGEKEGSESTYADYSSAPVLPISNEFEMVKLTKRNSTCTVAPILDDSAGVCHGVVDHSASP
Subjt:  CRSLGSGQGIIMDLDSGMGHRKHSRKEKKKLLPADANISGGEKEGSESTYADYSSAPVLPISNEFEMVKLTKRNSTCTVAPILDDSAGVCHGVVDHSASP

Query:  IHPSSKRSKLITGDGLLLASDLEPSSSKTKIRNDPFPLASKIYYSQRNNRLRSTLSYLYWEAVSSSKEICNMVDHGMTKENIKQFHSTSQEEESQEQSSD
        IHPSSKRSKLITGDGLLLASDLEPSSSKTKIRNDPFPLASKIYYSQRNNRLRSTLSYLYWEAVSSSKEICNMVDHGMTKENIKQFHSTSQEEESQEQSSD
Subjt:  IHPSSKRSKLITGDGLLLASDLEPSSSKTKIRNDPFPLASKIYYSQRNNRLRSTLSYLYWEAVSSSKEICNMVDHGMTKENIKQFHSTSQEEESQEQSSD

Query:  IIGKKDD
        IIGKK D
Subjt:  IIGKKDD

XP_022955387.1 uncharacterized protein LOC111457430 isoform X3 [Cucurbita moschata]2.1e-168100Show/hide
Query:  MGLDDWRSSRAVGNGRMAVMTRLLAAGSFSRTIAEEVGHQKLASEFICRELRDADEANLIDEEDMHVFGLKPMVDPLNLVSCNICKKPVKASQYIIHSEL
        MGLDDWRSSRAVGNGRMAVMTRLLAAGSFSRTIAEEVGHQKLASEFICRELRDADEANLIDEEDMHVFGLKPMVDPLNLVSCNICKKPVKASQYIIHSEL
Subjt:  MGLDDWRSSRAVGNGRMAVMTRLLAAGSFSRTIAEEVGHQKLASEFICRELRDADEANLIDEEDMHVFGLKPMVDPLNLVSCNICKKPVKASQYIIHSEL

Query:  CRSLGSGQGIIMDLDSGMGHRKHSRKEKKKLLPADANISGGEKEGSESTYADYSSAPVLPISNEFEMVKLTKRNSTCTVAPILDDSAGVCHGVVDHSASP
        CRSLGSGQGIIMDLDSGMGHRKHSRKEKKKLLPADANISGGEKEGSESTYADYSSAPVLPISNEFEMVKLTKRNSTCTVAPILDDSAGVCHGVVDHSASP
Subjt:  CRSLGSGQGIIMDLDSGMGHRKHSRKEKKKLLPADANISGGEKEGSESTYADYSSAPVLPISNEFEMVKLTKRNSTCTVAPILDDSAGVCHGVVDHSASP

Query:  IHPSSKRSKLITGDGLLLASDLEPSSSKTKIRNDPFPLASKIYYSQRNNRLRSTLSYLYWEAVSSSKEICNMVDHGMTKENIKQFHSTSQEEESQEQSSD
        IHPSSKRSKLITGDGLLLASDLEPSSSKTKIRNDPFPLASKIYYSQRNNRLRSTLSYLYWEAVSSSKEICNMVDHGMTKENIKQFHSTSQEEESQEQSSD
Subjt:  IHPSSKRSKLITGDGLLLASDLEPSSSKTKIRNDPFPLASKIYYSQRNNRLRSTLSYLYWEAVSSSKEICNMVDHGMTKENIKQFHSTSQEEESQEQSSD

Query:  IIGKK
        IIGKK
Subjt:  IIGKK

XP_022955388.1 uncharacterized protein LOC111457430 isoform X4 [Cucurbita moschata]1.9e-169100Show/hide
Query:  MGLDDWRSSRAVGNGRMAVMTRLLAAGSFSRTIAEEVGHQKLASEFICRELRDADEANLIDEEDMHVFGLKPMVDPLNLVSCNICKKPVKASQYIIHSEL
        MGLDDWRSSRAVGNGRMAVMTRLLAAGSFSRTIAEEVGHQKLASEFICRELRDADEANLIDEEDMHVFGLKPMVDPLNLVSCNICKKPVKASQYIIHSEL
Subjt:  MGLDDWRSSRAVGNGRMAVMTRLLAAGSFSRTIAEEVGHQKLASEFICRELRDADEANLIDEEDMHVFGLKPMVDPLNLVSCNICKKPVKASQYIIHSEL

Query:  CRSLGSGQGIIMDLDSGMGHRKHSRKEKKKLLPADANISGGEKEGSESTYADYSSAPVLPISNEFEMVKLTKRNSTCTVAPILDDSAGVCHGVVDHSASP
        CRSLGSGQGIIMDLDSGMGHRKHSRKEKKKLLPADANISGGEKEGSESTYADYSSAPVLPISNEFEMVKLTKRNSTCTVAPILDDSAGVCHGVVDHSASP
Subjt:  CRSLGSGQGIIMDLDSGMGHRKHSRKEKKKLLPADANISGGEKEGSESTYADYSSAPVLPISNEFEMVKLTKRNSTCTVAPILDDSAGVCHGVVDHSASP

Query:  IHPSSKRSKLITGDGLLLASDLEPSSSKTKIRNDPFPLASKIYYSQRNNRLRSTLSYLYWEAVSSSKEICNMVDHGMTKENIKQFHSTSQEEESQEQSSD
        IHPSSKRSKLITGDGLLLASDLEPSSSKTKIRNDPFPLASKIYYSQRNNRLRSTLSYLYWEAVSSSKEICNMVDHGMTKENIKQFHSTSQEEESQEQSSD
Subjt:  IHPSSKRSKLITGDGLLLASDLEPSSSKTKIRNDPFPLASKIYYSQRNNRLRSTLSYLYWEAVSSSKEICNMVDHGMTKENIKQFHSTSQEEESQEQSSD

Query:  IIGKKDD
        IIGKKDD
Subjt:  IIGKKDD

XP_022979843.1 uncharacterized protein LOC111479416 isoform X5 [Cucurbita maxima]6.0e-16396.42Show/hide
Query:  MGLDDWRSSRAVGNGRMAVMTRLLAAGSFSRTIAEEVGHQKLASEFICRELRDADEANLIDEEDMHVFGLKPMVDPLNLVSCNICKKPVKASQYIIHSEL
        MGLDDWRSSRAVGNGRMAVMTRLLAAGSFSRTIAEEVGHQKLASEFICRELRDADEANLIDEEDMHVFGLKPMVDPLNLVSCNICKKPVKASQYIIHSEL
Subjt:  MGLDDWRSSRAVGNGRMAVMTRLLAAGSFSRTIAEEVGHQKLASEFICRELRDADEANLIDEEDMHVFGLKPMVDPLNLVSCNICKKPVKASQYIIHSEL

Query:  CRSLGSGQGIIMDLDSGMGHRKHSRKEKKKLLPADANISGGEKEGSESTYADYSSAPVLPISNEFEMVKLTKRNSTCTVAPILDDSAGVCHGVVDHSASP
        CRSLGSGQG IMDLDSGMGHRKHSRKEKKKLLPADAN S  EKEGSESTYADYSSA V PISNEFEMVKLTKRNSTCTVAPILDDSAGVCHGVVDHS SP
Subjt:  CRSLGSGQGIIMDLDSGMGHRKHSRKEKKKLLPADANISGGEKEGSESTYADYSSAPVLPISNEFEMVKLTKRNSTCTVAPILDDSAGVCHGVVDHSASP

Query:  IHPSSKRSKLITGDGLLLASDLEPSSSKTKIRNDPFPLASKIYYSQRNNRLRSTLSYLYWEAVSSSKEICNMVDHGMTKENIKQFHSTSQEEESQEQSSD
        IHPS+KRSKLITG+GLLLASDLEPSSSKTKI+NDPFPLASKIYYSQRNNRLRSTLSYLYWEAVSSSKEICNMVDHGMTKENIKQFHSTSQEEESQEQSSD
Subjt:  IHPSSKRSKLITGDGLLLASDLEPSSSKTKIRNDPFPLASKIYYSQRNNRLRSTLSYLYWEAVSSSKEICNMVDHGMTKENIKQFHSTSQEEESQEQSSD

Query:  IIGKKDD
        +IGKKDD
Subjt:  IIGKKDD

TrEMBL top hitse value%identityAlignment
A0A6J1GTH0 uncharacterized protein LOC111457430 isoform X31.0e-168100Show/hide
Query:  MGLDDWRSSRAVGNGRMAVMTRLLAAGSFSRTIAEEVGHQKLASEFICRELRDADEANLIDEEDMHVFGLKPMVDPLNLVSCNICKKPVKASQYIIHSEL
        MGLDDWRSSRAVGNGRMAVMTRLLAAGSFSRTIAEEVGHQKLASEFICRELRDADEANLIDEEDMHVFGLKPMVDPLNLVSCNICKKPVKASQYIIHSEL
Subjt:  MGLDDWRSSRAVGNGRMAVMTRLLAAGSFSRTIAEEVGHQKLASEFICRELRDADEANLIDEEDMHVFGLKPMVDPLNLVSCNICKKPVKASQYIIHSEL

Query:  CRSLGSGQGIIMDLDSGMGHRKHSRKEKKKLLPADANISGGEKEGSESTYADYSSAPVLPISNEFEMVKLTKRNSTCTVAPILDDSAGVCHGVVDHSASP
        CRSLGSGQGIIMDLDSGMGHRKHSRKEKKKLLPADANISGGEKEGSESTYADYSSAPVLPISNEFEMVKLTKRNSTCTVAPILDDSAGVCHGVVDHSASP
Subjt:  CRSLGSGQGIIMDLDSGMGHRKHSRKEKKKLLPADANISGGEKEGSESTYADYSSAPVLPISNEFEMVKLTKRNSTCTVAPILDDSAGVCHGVVDHSASP

Query:  IHPSSKRSKLITGDGLLLASDLEPSSSKTKIRNDPFPLASKIYYSQRNNRLRSTLSYLYWEAVSSSKEICNMVDHGMTKENIKQFHSTSQEEESQEQSSD
        IHPSSKRSKLITGDGLLLASDLEPSSSKTKIRNDPFPLASKIYYSQRNNRLRSTLSYLYWEAVSSSKEICNMVDHGMTKENIKQFHSTSQEEESQEQSSD
Subjt:  IHPSSKRSKLITGDGLLLASDLEPSSSKTKIRNDPFPLASKIYYSQRNNRLRSTLSYLYWEAVSSSKEICNMVDHGMTKENIKQFHSTSQEEESQEQSSD

Query:  IIGKK
        IIGKK
Subjt:  IIGKK

A0A6J1GTT5 uncharacterized protein LOC111457430 isoform X49.3e-170100Show/hide
Query:  MGLDDWRSSRAVGNGRMAVMTRLLAAGSFSRTIAEEVGHQKLASEFICRELRDADEANLIDEEDMHVFGLKPMVDPLNLVSCNICKKPVKASQYIIHSEL
        MGLDDWRSSRAVGNGRMAVMTRLLAAGSFSRTIAEEVGHQKLASEFICRELRDADEANLIDEEDMHVFGLKPMVDPLNLVSCNICKKPVKASQYIIHSEL
Subjt:  MGLDDWRSSRAVGNGRMAVMTRLLAAGSFSRTIAEEVGHQKLASEFICRELRDADEANLIDEEDMHVFGLKPMVDPLNLVSCNICKKPVKASQYIIHSEL

Query:  CRSLGSGQGIIMDLDSGMGHRKHSRKEKKKLLPADANISGGEKEGSESTYADYSSAPVLPISNEFEMVKLTKRNSTCTVAPILDDSAGVCHGVVDHSASP
        CRSLGSGQGIIMDLDSGMGHRKHSRKEKKKLLPADANISGGEKEGSESTYADYSSAPVLPISNEFEMVKLTKRNSTCTVAPILDDSAGVCHGVVDHSASP
Subjt:  CRSLGSGQGIIMDLDSGMGHRKHSRKEKKKLLPADANISGGEKEGSESTYADYSSAPVLPISNEFEMVKLTKRNSTCTVAPILDDSAGVCHGVVDHSASP

Query:  IHPSSKRSKLITGDGLLLASDLEPSSSKTKIRNDPFPLASKIYYSQRNNRLRSTLSYLYWEAVSSSKEICNMVDHGMTKENIKQFHSTSQEEESQEQSSD
        IHPSSKRSKLITGDGLLLASDLEPSSSKTKIRNDPFPLASKIYYSQRNNRLRSTLSYLYWEAVSSSKEICNMVDHGMTKENIKQFHSTSQEEESQEQSSD
Subjt:  IHPSSKRSKLITGDGLLLASDLEPSSSKTKIRNDPFPLASKIYYSQRNNRLRSTLSYLYWEAVSSSKEICNMVDHGMTKENIKQFHSTSQEEESQEQSSD

Query:  IIGKKDD
        IIGKKDD
Subjt:  IIGKKDD

A0A6J1GW46 uncharacterized protein LOC111457430 isoform X11.0e-16899.67Show/hide
Query:  MGLDDWRSSRAVGNGRMAVMTRLLAAGSFSRTIAEEVGHQKLASEFICRELRDADEANLIDEEDMHVFGLKPMVDPLNLVSCNICKKPVKASQYIIHSEL
        MGLDDWRSSRAVGNGRMAVMTRLLAAGSFSRTIAEEVGHQKLASEFICRELRDADEANLIDEEDMHVFGLKPMVDPLNLVSCNICKKPVKASQYIIHSEL
Subjt:  MGLDDWRSSRAVGNGRMAVMTRLLAAGSFSRTIAEEVGHQKLASEFICRELRDADEANLIDEEDMHVFGLKPMVDPLNLVSCNICKKPVKASQYIIHSEL

Query:  CRSLGSGQGIIMDLDSGMGHRKHSRKEKKKLLPADANISGGEKEGSESTYADYSSAPVLPISNEFEMVKLTKRNSTCTVAPILDDSAGVCHGVVDHSASP
        CRSLGSGQGIIMDLDSGMGHRKHSRKEKKKLLPADANISGGEKEGSESTYADYSSAPVLPISNEFEMVKLTKRNSTCTVAPILDDSAGVCHGVVDHSASP
Subjt:  CRSLGSGQGIIMDLDSGMGHRKHSRKEKKKLLPADANISGGEKEGSESTYADYSSAPVLPISNEFEMVKLTKRNSTCTVAPILDDSAGVCHGVVDHSASP

Query:  IHPSSKRSKLITGDGLLLASDLEPSSSKTKIRNDPFPLASKIYYSQRNNRLRSTLSYLYWEAVSSSKEICNMVDHGMTKENIKQFHSTSQEEESQEQSSD
        IHPSSKRSKLITGDGLLLASDLEPSSSKTKIRNDPFPLASKIYYSQRNNRLRSTLSYLYWEAVSSSKEICNMVDHGMTKENIKQFHSTSQEEESQEQSSD
Subjt:  IHPSSKRSKLITGDGLLLASDLEPSSSKTKIRNDPFPLASKIYYSQRNNRLRSTLSYLYWEAVSSSKEICNMVDHGMTKENIKQFHSTSQEEESQEQSSD

Query:  IIGKKDD
        IIGKK D
Subjt:  IIGKKDD

A0A6J1IXG1 uncharacterized protein LOC111479416 isoform X13.2e-16296.09Show/hide
Query:  MGLDDWRSSRAVGNGRMAVMTRLLAAGSFSRTIAEEVGHQKLASEFICRELRDADEANLIDEEDMHVFGLKPMVDPLNLVSCNICKKPVKASQYIIHSEL
        MGLDDWRSSRAVGNGRMAVMTRLLAAGSFSRTIAEEVGHQKLASEFICRELRDADEANLIDEEDMHVFGLKPMVDPLNLVSCNICKKPVKASQYIIHSEL
Subjt:  MGLDDWRSSRAVGNGRMAVMTRLLAAGSFSRTIAEEVGHQKLASEFICRELRDADEANLIDEEDMHVFGLKPMVDPLNLVSCNICKKPVKASQYIIHSEL

Query:  CRSLGSGQGIIMDLDSGMGHRKHSRKEKKKLLPADANISGGEKEGSESTYADYSSAPVLPISNEFEMVKLTKRNSTCTVAPILDDSAGVCHGVVDHSASP
        CRSLGSGQG IMDLDSGMGHRKHSRKEKKKLLPADAN S  EKEGSESTYADYSSA V PISNEFEMVKLTKRNSTCTVAPILDDSAGVCHGVVDHS SP
Subjt:  CRSLGSGQGIIMDLDSGMGHRKHSRKEKKKLLPADANISGGEKEGSESTYADYSSAPVLPISNEFEMVKLTKRNSTCTVAPILDDSAGVCHGVVDHSASP

Query:  IHPSSKRSKLITGDGLLLASDLEPSSSKTKIRNDPFPLASKIYYSQRNNRLRSTLSYLYWEAVSSSKEICNMVDHGMTKENIKQFHSTSQEEESQEQSSD
        IHPS+KRSKLITG+GLLLASDLEPSSSKTKI+NDPFPLASKIYYSQRNNRLRSTLSYLYWEAVSSSKEICNMVDHGMTKENIKQFHSTSQEEESQEQSSD
Subjt:  IHPSSKRSKLITGDGLLLASDLEPSSSKTKIRNDPFPLASKIYYSQRNNRLRSTLSYLYWEAVSSSKEICNMVDHGMTKENIKQFHSTSQEEESQEQSSD

Query:  IIGKKDD
        +IGKK D
Subjt:  IIGKKDD

A0A6J1IXG6 uncharacterized protein LOC111479416 isoform X52.9e-16396.42Show/hide
Query:  MGLDDWRSSRAVGNGRMAVMTRLLAAGSFSRTIAEEVGHQKLASEFICRELRDADEANLIDEEDMHVFGLKPMVDPLNLVSCNICKKPVKASQYIIHSEL
        MGLDDWRSSRAVGNGRMAVMTRLLAAGSFSRTIAEEVGHQKLASEFICRELRDADEANLIDEEDMHVFGLKPMVDPLNLVSCNICKKPVKASQYIIHSEL
Subjt:  MGLDDWRSSRAVGNGRMAVMTRLLAAGSFSRTIAEEVGHQKLASEFICRELRDADEANLIDEEDMHVFGLKPMVDPLNLVSCNICKKPVKASQYIIHSEL

Query:  CRSLGSGQGIIMDLDSGMGHRKHSRKEKKKLLPADANISGGEKEGSESTYADYSSAPVLPISNEFEMVKLTKRNSTCTVAPILDDSAGVCHGVVDHSASP
        CRSLGSGQG IMDLDSGMGHRKHSRKEKKKLLPADAN S  EKEGSESTYADYSSA V PISNEFEMVKLTKRNSTCTVAPILDDSAGVCHGVVDHS SP
Subjt:  CRSLGSGQGIIMDLDSGMGHRKHSRKEKKKLLPADANISGGEKEGSESTYADYSSAPVLPISNEFEMVKLTKRNSTCTVAPILDDSAGVCHGVVDHSASP

Query:  IHPSSKRSKLITGDGLLLASDLEPSSSKTKIRNDPFPLASKIYYSQRNNRLRSTLSYLYWEAVSSSKEICNMVDHGMTKENIKQFHSTSQEEESQEQSSD
        IHPS+KRSKLITG+GLLLASDLEPSSSKTKI+NDPFPLASKIYYSQRNNRLRSTLSYLYWEAVSSSKEICNMVDHGMTKENIKQFHSTSQEEESQEQSSD
Subjt:  IHPSSKRSKLITGDGLLLASDLEPSSSKTKIRNDPFPLASKIYYSQRNNRLRSTLSYLYWEAVSSSKEICNMVDHGMTKENIKQFHSTSQEEESQEQSSD

Query:  IIGKKDD
        +IGKKDD
Subjt:  IIGKKDD

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTTTGGATGATTGGAGGAGCTCTCGTGCAGTTGGAAATGGGAGAATGGCAGTGATGACAAGGCTTCTGGCTGCAGGGAGTTTCTCCCGAACTATTGCTGAGGAAGT
TGGTCATCAGAAATTAGCTTCTGAATTTATCTGCAGAGAACTTCGTGATGCAGATGAAGCAAATTTAATTGATGAAGAAGATATGCACGTCTTTGGTTTGAAGCCTATGG
TTGATCCTCTGAACTTGGTTTCCTGCAATATTTGTAAGAAGCCAGTAAAGGCCAGTCAATATATCATTCATTCAGAACTATGCAGGTCACTAGGTTCCGGACAAGGGATT
ATTATGGACCTCGATAGTGGGATGGGCCATAGGAAACACTCAAGGAAGGAGAAGAAAAAGTTGCTACCTGCTGATGCTAATATATCAGGTGGGGAGAAAGAAGGGTCTGA
ATCAACATATGCCGACTATTCTTCTGCACCTGTGCTTCCAATTAGTAACGAATTTGAAATGGTCAAGTTGACAAAAAGAAATTCAACTTGTACTGTGGCGCCTATACTGG
ATGATAGTGCAGGAGTCTGTCATGGTGTTGTAGACCATTCAGCTAGTCCGATACATCCTTCCTCAAAACGATCCAAATTGATAACTGGTGACGGGCTGTTACTGGCATCT
GATTTAGAACCATCGTCATCTAAAACAAAAATTAGAAATGATCCATTTCCCCTTGCAAGTAAAATATACTACTCTCAAAGAAATAATCGTTTGCGCTCGACCCTTAGTTA
TCTTTACTGGGAGGCTGTTTCATCTAGCAAGGAAATTTGTAATATGGTGGATCATGGAATGACTAAGGAAAATATAAAACAATTTCATAGTACTTCCCAGGAGGAGGAGT
CTCAAGAACAATCAAGTGATATTATTGGAAAGAAGGATGACTGA
mRNA sequenceShow/hide mRNA sequence
GAAGGAGAGAGAGACTGAGAGAAAAATTCCCGCTCTCTCTCTCTCTCTCTCTCTCTCTATATTGCTTCCTCCATAGGGTTTCCTCTTTCCGCTTCCACTGTTTTTTCTTC
TCTTCTTCATGTCATTCGGCCCTCACTTCCGCAGCATTTCTCTCAAATTTCTCTAAATTCGTTTCAGCAATTGATCCGAGTTTTCCTGCTTTCTCGCCGGTTCCATGGGT
TTGGATGATTGGAGGAGCTCTCGTGCAGTTGGAAATGGGAGAATGGCAGTGATGACAAGGCTTCTGGCTGCAGGGAGTTTCTCCCGAACTATTGCTGAGGAAGTTGGTCA
TCAGAAATTAGCTTCTGAATTTATCTGCAGAGAACTTCGTGATGCAGATGAAGCAAATTTAATTGATGAAGAAGATATGCACGTCTTTGGTTTGAAGCCTATGGTTGATC
CTCTGAACTTGGTTTCCTGCAATATTTGTAAGAAGCCAGTAAAGGCCAGTCAATATATCATTCATTCAGAACTATGCAGGTCACTAGGTTCCGGACAAGGGATTATTATG
GACCTCGATAGTGGGATGGGCCATAGGAAACACTCAAGGAAGGAGAAGAAAAAGTTGCTACCTGCTGATGCTAATATATCAGGTGGGGAGAAAGAAGGGTCTGAATCAAC
ATATGCCGACTATTCTTCTGCACCTGTGCTTCCAATTAGTAACGAATTTGAAATGGTCAAGTTGACAAAAAGAAATTCAACTTGTACTGTGGCGCCTATACTGGATGATA
GTGCAGGAGTCTGTCATGGTGTTGTAGACCATTCAGCTAGTCCGATACATCCTTCCTCAAAACGATCCAAATTGATAACTGGTGACGGGCTGTTACTGGCATCTGATTTA
GAACCATCGTCATCTAAAACAAAAATTAGAAATGATCCATTTCCCCTTGCAAGTAAAATATACTACTCTCAAAGAAATAATCGTTTGCGCTCGACCCTTAGTTATCTTTA
CTGGGAGGCTGTTTCATCTAGCAAGGAAATTTGTAATATGGTGGATCATGGAATGACTAAGGAAAATATAAAACAATTTCATAGTACTTCCCAGGAGGAGGAGTCTCAAG
AACAATCAAGTGATATTATTGGAAAGAAGGATGACTGATGTAAAACAATCGCTCTTCCTTGGAAAACAGATGGATAGTCATTCCTTAACATCTGCATGGAAATCTGACCG
TAATCTGGCCATATTCTCATCTGGGAAATGTCTCCCTGCTGGTGGTGCCTCAACTAAGTTTGTTACTGGCAGCAGTGTTGCATGGCAACAGATTGCTCCAGTTGAATTGA
CACAAAAGAAACTATCTACCTAGAGGGCTAGATCTAACTATTTTGAAGGAAAACCACTGGAAAGTAGGCAACAGCCAAGAGGAAGTGTTCCTGTTGTACAGAAGTCTAGG
TGGCACTTTTTTGTAGGCACAGTGGTACATATAATTAACCTTAAAACTTCAAGTTTGGACCAAGAATCTGTATTTCGATACTGGCATTGAAAATCATTTAAAATTATTCA
TTCATTTTCTTTGTATTTAATCTCCTCTGTTCACTGCAAAGAGTAAATATGTCCTTCACCCACTTCAAATATGATCATTTTAAGTCCCTGAATGGTACTTTTTTTCTAGA
ACCTCGGGTAGAAATCTTGTGCTG
Protein sequenceShow/hide protein sequence
MGLDDWRSSRAVGNGRMAVMTRLLAAGSFSRTIAEEVGHQKLASEFICRELRDADEANLIDEEDMHVFGLKPMVDPLNLVSCNICKKPVKASQYIIHSELCRSLGSGQGI
IMDLDSGMGHRKHSRKEKKKLLPADANISGGEKEGSESTYADYSSAPVLPISNEFEMVKLTKRNSTCTVAPILDDSAGVCHGVVDHSASPIHPSSKRSKLITGDGLLLAS
DLEPSSSKTKIRNDPFPLASKIYYSQRNNRLRSTLSYLYWEAVSSSKEICNMVDHGMTKENIKQFHSTSQEEESQEQSSDIIGKKDD