; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CcUC05G087780 (gene) of Watermelon (PI 537277) v1 genome

Gene IDCcUC05G087780
OrganismCitrullus colocynthis (Watermelon (PI 537277) v1)
DescriptionUnknown protein
Genome locationCicolChr05:5541645..5552217
RNA-Seq ExpressionCcUC05G087780
SyntenyCcUC05G087780
Gene Ontology termsGO:0000124 - SAGA complex (cellular component)
InterPro domainsIPR037804 - SAGA-associated factor 73


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008438795.1 PREDICTED: uncharacterized protein LOC103483792 isoform X2 [Cucumis melo]1.4e-14587.74Show/hide
Query:  MVCSVGNGRMAVMTRLLAAGSFSRTIA----EEVGHQKFASEFICRELRDADEANLIDEEDMHVFGLKPMVDPLNLVCCNICKKPVKASQYIIHSELCRS
        MV SVGNGRMAVMTRL+AAGSFSRTIA    EEVG QKFASEFICRELRDADEANLIDEEDMHVFGLKPMVDPLNLVCCNICKKPVKASQYIIHSELCRS
Subjt:  MVCSVGNGRMAVMTRLLAAGSFSRTIA----EEVGHQKFASEFICRELRDADEANLIDEEDMHVFGLKPMVDPLNLVCCNICKKPVKASQYIIHSELCRS

Query:  LGFGQGTIMDLDGGMGHRKHSRKEKKKLLLADANISAVEKEGSESIYADYSAAPAFPINNQFEMVKLTKRNSTCTVAPILDDGTGVCSGVVDHAASLMHP
        LGFGQGTIMDLDGGMGHRKHSRKEKKKLL +DANIS VEKEGSES  AD+SAAPA PINNQFEM+KLTKRNSTC VAPILDDGTG CSGV   AAS +HP
Subjt:  LGFGQGTIMDLDGGMGHRKHSRKEKKKLLLADANISAVEKEGSESIYADYSAAPAFPINNQFEMVKLTKRNSTCTVAPILDDGTGVCSGVVDHAASLMHP

Query:  STKRSKLITGEGLLLASDLEPSSAKTKIRNVPFPLASKIYYSQRNNRLRSALGYLYWEAVASSKEICNMGDHQITKEDIKQFHNTSQEESSQEQTSDIIG
        STKRSKLITGEGLLLASDLEPSSAKTKIRNVPFPLASKIYYSQRNNRLRSALGYLYWEAVASSKEICNM D Q+TKE+IK FH+TS+EE SQEQTSD+IG
Subjt:  STKRSKLITGEGLLLASDLEPSSAKTKIRNVPFPLASKIYYSQRNNRLRSALGYLYWEAVASSKEICNMGDHQITKEDIKQFHNTSQEESSQEQTSDIIG

Query:  KKDRLDNSQV
         K  +DN  +
Subjt:  KKDRLDNSQV

XP_022979840.1 uncharacterized protein LOC111479416 isoform X3 [Cucurbita maxima]1.4e-14588.93Show/hide
Query:  MVCSVGNGRMAVMTRLLAAGSFSRTIAEEVGHQKFASEFICRELRDADEANLIDEEDMHVFGLKPMVDPLNLVCCNICKKPVKASQYIIHSELCRSLGFG
        MVCSVGNGRMAVMTRLLAAGSFSRTIAEEVGHQK ASEFICRELRDADEANLIDEEDMHVFGLKPMVDPLNLV CNICKKPVKASQYIIHSELCRSLG G
Subjt:  MVCSVGNGRMAVMTRLLAAGSFSRTIAEEVGHQKFASEFICRELRDADEANLIDEEDMHVFGLKPMVDPLNLVCCNICKKPVKASQYIIHSELCRSLGFG

Query:  QGTIMDLDGGMGHRKHSRKEKKKLLLADANISAVEKEGSESIYADYSAAPAFPINNQFEMVKLTKRNSTCTVAPILDDGTGVCSGVVDHAASLMHPSTKR
        QGTIMDLD GMGHRKHSRKEKKKLL ADAN SAVEKEGSES YADYS+A  FPI+N+FEMVKLTKRNSTCTVAPILDD  GVC GVVDH+ S +HPSTKR
Subjt:  QGTIMDLDGGMGHRKHSRKEKKKLLLADANISAVEKEGSESIYADYSAAPAFPINNQFEMVKLTKRNSTCTVAPILDDGTGVCSGVVDHAASLMHPSTKR

Query:  SKLITGEGLLLASDLEPSSAKTKIRNVPFPLASKIYYSQRNNRLRSALGYLYWEAVASSKEICNMGDHQITKEDIKQFHNTSQEESSQEQTSDIIGKK
        SKLITGEGLLLASDLEPSS+KTKI+N PFPLASKIYYSQRNNRLRS L YLYWEAV+SSKEICNM DH +TKE+IKQFH+TSQEE SQEQ+SD+IGKK
Subjt:  SKLITGEGLLLASDLEPSSAKTKIRNVPFPLASKIYYSQRNNRLRSALGYLYWEAVASSKEICNMGDHQITKEDIKQFHNTSQEESSQEQTSDIIGKK

XP_023528790.1 uncharacterized protein LOC111791619 isoform X4 [Cucurbita pepo subsp. pepo]1.7e-14690.27Show/hide
Query:  MVCSVGNGRMAVMTRLLAAGSFSRTIAEEVGHQKFASEFICRELRDADEANLIDEEDMHVFGLKPMVDPLNLVCCNICKKPVKASQYIIHSELCRSLGFG
        MVCSVGNGRMAVMTRLLAAGSFSRTIAEEVGHQK ASEFICRELRDADEANLIDEEDMHVFGLKPMVDPLNLV CNICKKPVKASQYIIHSELCRSLG G
Subjt:  MVCSVGNGRMAVMTRLLAAGSFSRTIAEEVGHQKFASEFICRELRDADEANLIDEEDMHVFGLKPMVDPLNLVCCNICKKPVKASQYIIHSELCRSLGFG

Query:  QGTIMDLDGGMGHRKHSRKEKKKLLLADANISAVEKEGSESIYADYSAAPAFPINNQFEMVKLTKRNSTCTVAPILDDGTGVCSGVVDHAASLMHPSTKR
        QGTIMDLD GMGHRKHSRKEKKKLL ADANISAVEKEGSES YADYS+AP FPI+N+FEMVKLTKRNSTCTVAPILDD  GVC GVVDH+AS +HPSTKR
Subjt:  QGTIMDLDGGMGHRKHSRKEKKKLLLADANISAVEKEGSESIYADYSAAPAFPINNQFEMVKLTKRNSTCTVAPILDDGTGVCSGVVDHAASLMHPSTKR

Query:  SKLITGEGLLLASDLEPSSAKTKIRNVPFPLASKIYYSQRNNRLRSALGYLYWEAVASSKEICNMGDHQITKEDIKQFHNTSQEESSQEQTSDIIGKK
        SKLITGEGLLLASDLEPSS+KTKIRN PFPLASKIYYSQRNNRLRS L YLYWEAV+SSKEICNM DH + KE+IKQFH+TSQEE SQEQ+SDIIGKK
Subjt:  SKLITGEGLLLASDLEPSSAKTKIRNVPFPLASKIYYSQRNNRLRSALGYLYWEAVASSKEICNMGDHQITKEDIKQFHNTSQEESSQEQTSDIIGKK

XP_038900665.1 uncharacterized protein LOC120087804 isoform X2 [Benincasa hispida]8.4e-15493.38Show/hide
Query:  MVCSVGNGRMAVMTRLLAAGSFSRTIA----EEVGHQKFASEFICRELRDADEANLIDEEDMHVFGLKPMVDPLNLVCCNICKKPVKASQYIIHSELCRS
        MVCSVGNGRMAVMTRLLAAGSFSR+IA    EEVGHQKFASEFICRELRDADEANLIDEEDMHVFGLKPMVDPLNLVCCNICKKPVKASQYIIHSELCRS
Subjt:  MVCSVGNGRMAVMTRLLAAGSFSRTIA----EEVGHQKFASEFICRELRDADEANLIDEEDMHVFGLKPMVDPLNLVCCNICKKPVKASQYIIHSELCRS

Query:  LGFGQGTIMDLDGGMGHRKHSRKEKKKLLLADANISAVEKEGSESIYADYSAAPAFPINNQFEMVKLTKRNSTCTVAPILDDGTGVCSGVVDHAASLMHP
        LGFGQGTIMDLDGGMGHRKHSRKEKKK+L  DANISAVEKEGSES YA+YS APAFPINNQFEMVKLTKRNSTCTVA ILDD TGVCS VVDHAASL+HP
Subjt:  LGFGQGTIMDLDGGMGHRKHSRKEKKKLLLADANISAVEKEGSESIYADYSAAPAFPINNQFEMVKLTKRNSTCTVAPILDDGTGVCSGVVDHAASLMHP

Query:  STKRSKLITGEGLLLASDLEPSSAKTKIRNVPFPLASKIYYSQRNNRLRSALGYLYWEAVASSKEICNMGDHQITKEDIKQFHNTSQEESSQEQTSDIIG
        STKRSKLITGEGLLLASDLEPSSAKTKIRNVPFPLASKIYYSQRNNRLRSALGYLYWEAVASSKEICNM DHQ+TKE+IK FHNTSQEESSQEQTSDIIG
Subjt:  STKRSKLITGEGLLLASDLEPSSAKTKIRNVPFPLASKIYYSQRNNRLRSALGYLYWEAVASSKEICNMGDHQITKEDIKQFHNTSQEESSQEQTSDIIG

Query:  KK
         K
Subjt:  KK

XP_038900699.1 uncharacterized protein LOC120087804 isoform X3 [Benincasa hispida]1.5e-15594.63Show/hide
Query:  MVCSVGNGRMAVMTRLLAAGSFSRTIAEEVGHQKFASEFICRELRDADEANLIDEEDMHVFGLKPMVDPLNLVCCNICKKPVKASQYIIHSELCRSLGFG
        MVCSVGNGRMAVMTRLLAAGSFSR+IAEEVGHQKFASEFICRELRDADEANLIDEEDMHVFGLKPMVDPLNLVCCNICKKPVKASQYIIHSELCRSLGFG
Subjt:  MVCSVGNGRMAVMTRLLAAGSFSRTIAEEVGHQKFASEFICRELRDADEANLIDEEDMHVFGLKPMVDPLNLVCCNICKKPVKASQYIIHSELCRSLGFG

Query:  QGTIMDLDGGMGHRKHSRKEKKKLLLADANISAVEKEGSESIYADYSAAPAFPINNQFEMVKLTKRNSTCTVAPILDDGTGVCSGVVDHAASLMHPSTKR
        QGTIMDLDGGMGHRKHSRKEKKK+L  DANISAVEKEGSES YA+YS APAFPINNQFEMVKLTKRNSTCTVA ILDD TGVCS VVDHAASL+HPSTKR
Subjt:  QGTIMDLDGGMGHRKHSRKEKKKLLLADANISAVEKEGSESIYADYSAAPAFPINNQFEMVKLTKRNSTCTVAPILDDGTGVCSGVVDHAASLMHPSTKR

Query:  SKLITGEGLLLASDLEPSSAKTKIRNVPFPLASKIYYSQRNNRLRSALGYLYWEAVASSKEICNMGDHQITKEDIKQFHNTSQEESSQEQTSDIIGKK
        SKLITGEGLLLASDLEPSSAKTKIRNVPFPLASKIYYSQRNNRLRSALGYLYWEAVASSKEICNM DHQ+TKE+IK FHNTSQEESSQEQTSDIIG K
Subjt:  SKLITGEGLLLASDLEPSSAKTKIRNVPFPLASKIYYSQRNNRLRSALGYLYWEAVASSKEICNMGDHQITKEDIKQFHNTSQEESSQEQTSDIIGKK

TrEMBL top hitse value%identityAlignment
A0A1S3AX95 uncharacterized protein LOC103483792 isoform X27.0e-14687.74Show/hide
Query:  MVCSVGNGRMAVMTRLLAAGSFSRTIA----EEVGHQKFASEFICRELRDADEANLIDEEDMHVFGLKPMVDPLNLVCCNICKKPVKASQYIIHSELCRS
        MV SVGNGRMAVMTRL+AAGSFSRTIA    EEVG QKFASEFICRELRDADEANLIDEEDMHVFGLKPMVDPLNLVCCNICKKPVKASQYIIHSELCRS
Subjt:  MVCSVGNGRMAVMTRLLAAGSFSRTIA----EEVGHQKFASEFICRELRDADEANLIDEEDMHVFGLKPMVDPLNLVCCNICKKPVKASQYIIHSELCRS

Query:  LGFGQGTIMDLDGGMGHRKHSRKEKKKLLLADANISAVEKEGSESIYADYSAAPAFPINNQFEMVKLTKRNSTCTVAPILDDGTGVCSGVVDHAASLMHP
        LGFGQGTIMDLDGGMGHRKHSRKEKKKLL +DANIS VEKEGSES  AD+SAAPA PINNQFEM+KLTKRNSTC VAPILDDGTG CSGV   AAS +HP
Subjt:  LGFGQGTIMDLDGGMGHRKHSRKEKKKLLLADANISAVEKEGSESIYADYSAAPAFPINNQFEMVKLTKRNSTCTVAPILDDGTGVCSGVVDHAASLMHP

Query:  STKRSKLITGEGLLLASDLEPSSAKTKIRNVPFPLASKIYYSQRNNRLRSALGYLYWEAVASSKEICNMGDHQITKEDIKQFHNTSQEESSQEQTSDIIG
        STKRSKLITGEGLLLASDLEPSSAKTKIRNVPFPLASKIYYSQRNNRLRSALGYLYWEAVASSKEICNM D Q+TKE+IK FH+TS+EE SQEQTSD+IG
Subjt:  STKRSKLITGEGLLLASDLEPSSAKTKIRNVPFPLASKIYYSQRNNRLRSALGYLYWEAVASSKEICNMGDHQITKEDIKQFHNTSQEESSQEQTSDIIG

Query:  KKDRLDNSQV
         K  +DN  +
Subjt:  KKDRLDNSQV

A0A1S3AXZ3 uncharacterized protein LOC103483792 isoform X31.2e-14588.27Show/hide
Query:  MVCSVGNGRMAVMTRLLAAGSFSRTIAEEVGHQKFASEFICRELRDADEANLIDEEDMHVFGLKPMVDPLNLVCCNICKKPVKASQYIIHSELCRSLGFG
        MV SVGNGRMAVMTRL+AAGSFSRTIAEEVG QKFASEFICRELRDADEANLIDEEDMHVFGLKPMVDPLNLVCCNICKKPVKASQYIIHSELCRSLGFG
Subjt:  MVCSVGNGRMAVMTRLLAAGSFSRTIAEEVGHQKFASEFICRELRDADEANLIDEEDMHVFGLKPMVDPLNLVCCNICKKPVKASQYIIHSELCRSLGFG

Query:  QGTIMDLDGGMGHRKHSRKEKKKLLLADANISAVEKEGSESIYADYSAAPAFPINNQFEMVKLTKRNSTCTVAPILDDGTGVCSGVVDHAASLMHPSTKR
        QGTIMDLDGGMGHRKHSRKEKKKLL +DANIS VEKEGSES  AD+SAAPA PINNQFEM+KLTKRNSTC VAPILDDGTG CSGV   AAS +HPSTKR
Subjt:  QGTIMDLDGGMGHRKHSRKEKKKLLLADANISAVEKEGSESIYADYSAAPAFPINNQFEMVKLTKRNSTCTVAPILDDGTGVCSGVVDHAASLMHPSTKR

Query:  SKLITGEGLLLASDLEPSSAKTKIR-NVPFPLASKIYYSQRNNRLRSALGYLYWEAVASSKEICNMGDHQITKEDIKQFHNTSQEESSQEQTSDIIGKKD
        SKLITGEGLLLASDLEPSSAKTKIR +VPFPLASKIYYSQRNNRLRSALGYLYWEAVASSKEICNM D Q+TKE+IK FH+TS+EE SQEQTSD+IG K 
Subjt:  SKLITGEGLLLASDLEPSSAKTKIR-NVPFPLASKIYYSQRNNRLRSALGYLYWEAVASSKEICNMGDHQITKEDIKQFHNTSQEESSQEQTSDIIGKKD

Query:  RLDNSQV
         +DN  +
Subjt:  RLDNSQV

A0A6J1GV02 uncharacterized protein LOC111457430 isoform X22.7e-14588.59Show/hide
Query:  MVCSVGNGRMAVMTRLLAAGSFSRTIAEEVGHQKFASEFICRELRDADEANLIDEEDMHVFGLKPMVDPLNLVCCNICKKPVKASQYIIHSELCRSLGFG
        MVCSVGNGRMAVMTRLLAAGSFSRTIAEEVGHQK ASEFICRELRDADEANLIDEEDMHVFGLKPMVDPLNLV CNICKKPVKASQYIIHSELCRSLG G
Subjt:  MVCSVGNGRMAVMTRLLAAGSFSRTIAEEVGHQKFASEFICRELRDADEANLIDEEDMHVFGLKPMVDPLNLVCCNICKKPVKASQYIIHSELCRSLGFG

Query:  QGTIMDLDGGMGHRKHSRKEKKKLLLADANISAVEKEGSESIYADYSAAPAFPINNQFEMVKLTKRNSTCTVAPILDDGTGVCSGVVDHAASLMHPSTKR
        QG IMDLD GMGHRKHSRKEKKKLL ADANIS  EKEGSES YADYS+AP  PI+N+FEMVKLTKRNSTCTVAPILDD  GVC GVVDH+AS +HPS+KR
Subjt:  QGTIMDLDGGMGHRKHSRKEKKKLLLADANISAVEKEGSESIYADYSAAPAFPINNQFEMVKLTKRNSTCTVAPILDDGTGVCSGVVDHAASLMHPSTKR

Query:  SKLITGEGLLLASDLEPSSAKTKIRNVPFPLASKIYYSQRNNRLRSALGYLYWEAVASSKEICNMGDHQITKEDIKQFHNTSQEESSQEQTSDIIGKK
        SKLITG+GLLLASDLEPSS+KTKIRN PFPLASKIYYSQRNNRLRS L YLYWEAV+SSKEICNM DH +TKE+IKQFH+TSQEE SQEQ+SDIIGKK
Subjt:  SKLITGEGLLLASDLEPSSAKTKIRNVPFPLASKIYYSQRNNRLRSALGYLYWEAVASSKEICNMGDHQITKEDIKQFHNTSQEESSQEQTSDIIGKK

A0A6J1IRX4 uncharacterized protein LOC111479416 isoform X37.0e-14688.93Show/hide
Query:  MVCSVGNGRMAVMTRLLAAGSFSRTIAEEVGHQKFASEFICRELRDADEANLIDEEDMHVFGLKPMVDPLNLVCCNICKKPVKASQYIIHSELCRSLGFG
        MVCSVGNGRMAVMTRLLAAGSFSRTIAEEVGHQK ASEFICRELRDADEANLIDEEDMHVFGLKPMVDPLNLV CNICKKPVKASQYIIHSELCRSLG G
Subjt:  MVCSVGNGRMAVMTRLLAAGSFSRTIAEEVGHQKFASEFICRELRDADEANLIDEEDMHVFGLKPMVDPLNLVCCNICKKPVKASQYIIHSELCRSLGFG

Query:  QGTIMDLDGGMGHRKHSRKEKKKLLLADANISAVEKEGSESIYADYSAAPAFPINNQFEMVKLTKRNSTCTVAPILDDGTGVCSGVVDHAASLMHPSTKR
        QGTIMDLD GMGHRKHSRKEKKKLL ADAN SAVEKEGSES YADYS+A  FPI+N+FEMVKLTKRNSTCTVAPILDD  GVC GVVDH+ S +HPSTKR
Subjt:  QGTIMDLDGGMGHRKHSRKEKKKLLLADANISAVEKEGSESIYADYSAAPAFPINNQFEMVKLTKRNSTCTVAPILDDGTGVCSGVVDHAASLMHPSTKR

Query:  SKLITGEGLLLASDLEPSSAKTKIRNVPFPLASKIYYSQRNNRLRSALGYLYWEAVASSKEICNMGDHQITKEDIKQFHNTSQEESSQEQTSDIIGKK
        SKLITGEGLLLASDLEPSS+KTKI+N PFPLASKIYYSQRNNRLRS L YLYWEAV+SSKEICNM DH +TKE+IKQFH+TSQEE SQEQ+SD+IGKK
Subjt:  SKLITGEGLLLASDLEPSSAKTKIRNVPFPLASKIYYSQRNNRLRSALGYLYWEAVASSKEICNMGDHQITKEDIKQFHNTSQEESSQEQTSDIIGKK

A0A6J1IXG6 uncharacterized protein LOC111479416 isoform X53.8e-14488.51Show/hide
Query:  SVGNGRMAVMTRLLAAGSFSRTIAEEVGHQKFASEFICRELRDADEANLIDEEDMHVFGLKPMVDPLNLVCCNICKKPVKASQYIIHSELCRSLGFGQGT
        +VGNGRMAVMTRLLAAGSFSRTIAEEVGHQK ASEFICRELRDADEANLIDEEDMHVFGLKPMVDPLNLV CNICKKPVKASQYIIHSELCRSLG GQGT
Subjt:  SVGNGRMAVMTRLLAAGSFSRTIAEEVGHQKFASEFICRELRDADEANLIDEEDMHVFGLKPMVDPLNLVCCNICKKPVKASQYIIHSELCRSLGFGQGT

Query:  IMDLDGGMGHRKHSRKEKKKLLLADANISAVEKEGSESIYADYSAAPAFPINNQFEMVKLTKRNSTCTVAPILDDGTGVCSGVVDHAASLMHPSTKRSKL
        IMDLD GMGHRKHSRKEKKKLL ADAN SAVEKEGSES YADYS+A  FPI+N+FEMVKLTKRNSTCTVAPILDD  GVC GVVDH+ S +HPSTKRSKL
Subjt:  IMDLDGGMGHRKHSRKEKKKLLLADANISAVEKEGSESIYADYSAAPAFPINNQFEMVKLTKRNSTCTVAPILDDGTGVCSGVVDHAASLMHPSTKRSKL

Query:  ITGEGLLLASDLEPSSAKTKIRNVPFPLASKIYYSQRNNRLRSALGYLYWEAVASSKEICNMGDHQITKEDIKQFHNTSQEESSQEQTSDIIGKKD
        ITGEGLLLASDLEPSS+KTKI+N PFPLASKIYYSQRNNRLRS L YLYWEAV+SSKEICNM DH +TKE+IKQFH+TSQEE SQEQ+SD+IGKKD
Subjt:  ITGEGLLLASDLEPSSAKTKIRNVPFPLASKIYYSQRNNRLRSALGYLYWEAVASSKEICNMGDHQITKEDIKQFHNTSQEESSQEQTSDIIGKKD

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTATGCTCAGTTGGAAATGGGAGAATGGCAGTGATGACAAGGCTTCTGGCTGCTGGGAGTTTCTCTCGTACTATTGCAGAGGAAGTTGGTCACCAGAAATTTGCTTC
TGAATTTATCTGCCGAGAACTTCGTGATGCAGATGAAGCAAATTTAATTGATGAGGAAGATATGCACGTTTTTGGTTTGAAGCCTATGGTTGATCCTCTGAACTTGGTTT
GCTGCAATATTTGTAAGAAGCCAGTAAAGGCCAGTCAATATATCATTCATTCAGAACTTTGCAGGTCATTAGGTTTTGGACAAGGAACTATAATGGACCTTGATGGTGGG
ATGGGTCATAGAAAACACTCAAGGAAGGAGAAGAAAAAGTTACTACTTGCTGATGCTAATATATCAGCTGTGGAGAAAGAAGGGTCTGAATCAATATATGCTGACTATTC
TGCTGCACCTGCATTTCCAATTAATAACCAATTTGAAATGGTCAAGTTGACAAAAAGAAATTCAACTTGTACTGTGGCACCTATACTGGATGATGGTACAGGAGTCTGTT
CTGGTGTTGTAGACCATGCAGCTAGTCTCATGCATCCTTCGACAAAGCGGTCCAAATTGATAACTGGTGAAGGGCTGTTACTGGCATCTGATTTAGAACCATCGTCAGCT
AAAACAAAAATTAGAAATGTTCCGTTTCCCCTTGCAAGTAAAATATATTACTCTCAGAGAAATAATCGTCTGCGCTCGGCTCTTGGTTATCTTTACTGGGAGGCTGTTGC
ATCTAGCAAGGAAATTTGTAATATGGGGGATCATCAAATAACAAAGGAAGATATAAAACAATTTCACAATACTTCCCAGGAGGAGTCGTCTCAAGAACAAACAAGTGACA
TTATTGGAAAGAAGGATAGACTAGACAACAGCCAAGTGGAAGTATTCCTGTTGAATAGAAGTCTTGGTGGCACTTTTTTTGTAGGCACAGTGGTACATCAGCACAGACGT
TTCGCACCAATGTCATGTGCTCCGCGTTACTGGCCAACATTTATGGCTGATATTTCAATGCGGAGGAAAAAGAAGATGAAGAAATATCTGCCATTTTCGTCACTGGGCAA
GCTTTTAAAAGCGACTTGGCTAGGGCATTACCAGTCTGGTTTACTTGGGGAAATACAGCTCGCTGCAAAAGGTTAA
mRNA sequenceShow/hide mRNA sequence
TTTCTTTCTGATTTGGCAGTTGCCAGTGCCACCTCACATTTTCTTCTCTTTGCGCGGCGAGAGAGAAAAGACTCTCTCTTCCGGTTCAAATATTTCAGATAAGAAAACGA
ACCAAAAACACTGCCAGAGAAAGAGAGAGAGAGAGAAATTCCCGCTCTCTTTTCTCTGTATTGCTTCCTCCGTAGGGTTTCTTCTTTCCTCTTTCACTGTTTTTTCTCCT
CTTCTTCATGCTATTCGCCCTTCACTTCCGCAGCATTTCTCTCAAATTTCTCTAAATTCGTCTCAGCAGTTGATTCGAATTTCTCCACTTTCTCGCTGGTTTCATGCGCC
TGGATGATTGGAGGAGCTTTCGTGCTGCGGTGTAGAACCTCTTCCTTCCAGTACATAATACATTTGCTTAGTCTCAATGGTATGCTCAGTTGGAAATGGGAGAATGGCAG
TGATGACAAGGCTTCTGGCTGCTGGGAGTTTCTCTCGTACTATTGCAGAGGAAGTTGGTCACCAGAAATTTGCTTCTGAATTTATCTGCCGAGAACTTCGTGATGCAGAT
GAAGCAAATTTAATTGATGAGGAAGATATGCACGTTTTTGGTTTGAAGCCTATGGTTGATCCTCTGAACTTGGTTTGCTGCAATATTTGTAAGAAGCCAGTAAAGGCCAG
TCAATATATCATTCATTCAGAACTTTGCAGGTCATTAGGTTTTGGACAAGGAACTATAATGGACCTTGATGGTGGGATGGGTCATAGAAAACACTCAAGGAAGGAGAAGA
AAAAGTTACTACTTGCTGATGCTAATATATCAGCTGTGGAGAAAGAAGGGTCTGAATCAATATATGCTGACTATTCTGCTGCACCTGCATTTCCAATTAATAACCAATTT
GAAATGGTCAAGTTGACAAAAAGAAATTCAACTTGTACTGTGGCACCTATACTGGATGATGGTACAGGAGTCTGTTCTGGTGTTGTAGACCATGCAGCTAGTCTCATGCA
TCCTTCGACAAAGCGGTCCAAATTGATAACTGGTGAAGGGCTGTTACTGGCATCTGATTTAGAACCATCGTCAGCTAAAACAAAAATTAGAAATGTTCCGTTTCCCCTTG
CAAGTAAAATATATTACTCTCAGAGAAATAATCGTCTGCGCTCGGCTCTTGGTTATCTTTACTGGGAGGCTGTTGCATCTAGCAAGGAAATTTGTAATATGGGGGATCAT
CAAATAACAAAGGAAGATATAAAACAATTTCACAATACTTCCCAGGAGGAGTCGTCTCAAGAACAAACAAGTGACATTATTGGAAAGAAGGATAGACTAGACAACAGCCA
AGTGGAAGTATTCCTGTTGAATAGAAGTCTTGGTGGCACTTTTTTTGTAGGCACAGTGGTACATCAGCACAGACGTTTCGCACCAATGTCATGTGCTCCGCGTTACTGGC
CAACATTTATGGCTGATATTTCAATGCGGAGGAAAAAGAAGATGAAGAAATATCTGCCATTTTCGTCACTGGGCAAGCTTTTAAAAGCGACTTGGCTAGGGCATTACCAG
TCTGGTTTACTTGGGGAAATACAGCTCGCTGCAAAAGGTTAAAGAATATTCAATCCGTGAATCAAATGCTTGGGTTATAATGACCTAGTTGATTTGGTAAAGACTTGAAT
GCTACGTCCAGTTAAATCGGGGTTAGGATTTGGATCTATAACTAAATTCAAATTCTTATGGTTGGAAAAATATTTAAGGTTAGTCCATGACTTTAAAGTTTGTATTTATT
TGATCTTCGAACTTTAAAAACTATTTAATAGGGCTCTTATTTATAATTGTGTCTAATAGGTCGTTCTTGTTTGATAACTATTTTATTTTTTAAAACTAAATTTATAAATA
CTACTTTTACCTTTAGTTTTCTTGCTTTATTACCTATTTTTTGAAAATTGTTTTAAAAAACCAAGCTAAAAGAAATAAATAGTTGTGAAGCTTTTGATTTTGTTTTTGAA
ATTTGGGTTAGAATTCAGATGTATATTTATGTGAAATGAAAATAAAATGAAAAGCTTCCCAAAAGAAACATCAGAATGTTAAGGAGAAATATGGGTGAATATATTTTTTT
TCTCATCATTTATTTATACTTAAAATGATTTGTTTATTTTAAAATGATGAAATTATGAATGGTTATAGGTATATATATCATATGATAGGTGGGATTCGAGAGTAGGTTGG
TTGAGTTGAGCATGCTGCATTATTTCTTTTACAAATCACTAGCAAATTATCATTAAACATAATCTTTGAAGCATAAAACCACTTTTGTAATAGGACGGGTCAAAGAATCT
CAATTTTTAAAAATTTATTATTTATTTTTATTTTAAAAATCTTTTCTTTCATTTTTTTCCTCGAT
Protein sequenceShow/hide protein sequence
MVCSVGNGRMAVMTRLLAAGSFSRTIAEEVGHQKFASEFICRELRDADEANLIDEEDMHVFGLKPMVDPLNLVCCNICKKPVKASQYIIHSELCRSLGFGQGTIMDLDGG
MGHRKHSRKEKKKLLLADANISAVEKEGSESIYADYSAAPAFPINNQFEMVKLTKRNSTCTVAPILDDGTGVCSGVVDHAASLMHPSTKRSKLITGEGLLLASDLEPSSA
KTKIRNVPFPLASKIYYSQRNNRLRSALGYLYWEAVASSKEICNMGDHQITKEDIKQFHNTSQEESSQEQTSDIIGKKDRLDNSQVEVFLLNRSLGGTFFVGTVVHQHRR
FAPMSCAPRYWPTFMADISMRRKKKMKKYLPFSSLGKLLKATWLGHYQSGLLGEIQLAAKG