; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc03g0072921 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc03g0072921
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionTranscription factor SCREAM2-like isoform X2
Genome locationCMiso1.1chr03:20286748..20288317
RNA-Seq ExpressionCmc03g0072921
SyntenyCmc03g0072921
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0005634 - nucleus (cellular component)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
GO:0043565 - sequence-specific DNA binding (molecular function)
GO:0046983 - protein dimerization activity (molecular function)
InterPro domainsIPR036638 - Helix-loop-helix DNA-binding domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004138402.1 uncharacterized protein LOC101218918 isoform X2 [Cucumis sativus]1.2e-7799.37Show/hide
Query:  MVSREHKKAAALHDNLQLLRSITNSHSLNKASIIVDASKYIEELKQKVERLNQDISTVQNSNPLSHQYSPMVTVERVVKGFSINVFSEKSCQGLLVSILE
        MVSREHKKAAALHDNLQLLRSITNSHSLNKASIIVDASKYIEELKQKVERLNQDISTVQNSNPLSHQYSPMVTVERVVKGFSINVFSEKSCQGLLVSILE
Subjt:  MVSREHKKAAALHDNLQLLRSITNSHSLNKASIIVDASKYIEELKQKVERLNQDISTVQNSNPLSHQYSPMVTVERVVKGFSINVFSEKSCQGLLVSILE

Query:  VFEELGLNVIEARVSCTHTFQLQAIGEIEEEGEEAIDAQTVKEAVVQAIKSWSQNGEQD
        VFEELGLNVIEARVSCTHTFQLQAIGEIEEEGEE IDAQTVKEAVVQAIKSWSQNGEQD
Subjt:  VFEELGLNVIEARVSCTHTFQLQAIGEIEEEGEEAIDAQTVKEAVVQAIKSWSQNGEQD

XP_011656425.1 uncharacterized protein LOC101218918 isoform X1 [Cucumis sativus]2.9e-7698.75Show/hide
Query:  MVSREHKKAAALHDNLQLLRSITNSHSLNKASIIVDASKYIEELKQKVERLNQDISTVQNSNPLSHQYSPM-VTVERVVKGFSINVFSEKSCQGLLVSIL
        MVSREHKKAAALHDNLQLLRSITNSHSLNKASIIVDASKYIEELKQKVERLNQDISTVQNSNPLSHQYSPM VTVERVVKGFSINVFSEKSCQGLLVSIL
Subjt:  MVSREHKKAAALHDNLQLLRSITNSHSLNKASIIVDASKYIEELKQKVERLNQDISTVQNSNPLSHQYSPM-VTVERVVKGFSINVFSEKSCQGLLVSIL

Query:  EVFEELGLNVIEARVSCTHTFQLQAIGEIEEEGEEAIDAQTVKEAVVQAIKSWSQNGEQD
        EVFEELGLNVIEARVSCTHTFQLQAIGEIEEEGEE IDAQTVKEAVVQAIKSWSQNGEQD
Subjt:  EVFEELGLNVIEARVSCTHTFQLQAIGEIEEEGEEAIDAQTVKEAVVQAIKSWSQNGEQD

XP_016901983.1 PREDICTED: uncharacterized protein LOC103496601 [Cucumis melo]6.4e-7698.11Show/hide
Query:  MVSREHKKAAALHDNLQLLRSITNSHSLNKASIIVDASKYIEELKQKVERLNQDISTVQNSNPLSHQYSPMVTVERVVKGFSINVFSEKSCQGLLVSILE
        MVSREHKKAAALHDNLQLLRSITNS   NKASIIVDASKYIEELKQKVERLNQDISTVQNSNPLSHQYSPMVTVERVVKGFSINVFSEKSCQGLLVSILE
Subjt:  MVSREHKKAAALHDNLQLLRSITNSHSLNKASIIVDASKYIEELKQKVERLNQDISTVQNSNPLSHQYSPMVTVERVVKGFSINVFSEKSCQGLLVSILE

Query:  VFEELGLNVIEARVSCTHTFQLQAIGEIEEEGEEAIDAQTVKEAVVQAIKSWSQNGEQD
        VFEELGLNVIEARVSCTHTFQLQAIGEIEEEGEEAIDAQTVKEAVVQAIKSWSQNGEQD
Subjt:  VFEELGLNVIEARVSCTHTFQLQAIGEIEEEGEEAIDAQTVKEAVVQAIKSWSQNGEQD

XP_038884498.1 uncharacterized protein LOC120075304 isoform X3 [Benincasa hispida]1.4e-6790.8Show/hide
Query:  MVSREHKKAAALHDNLQLLRSITNSHS-LNKASIIVDASKYIEELKQKVERLNQDISTVQNS---NPLSHQYSPMVTVERVVKGFSINVFSEKSCQGLLV
        MVSREHKK A LH+ LQLLRSITNSH+ LNKASIIVDASKYIEELKQKVERLNQDISTVQNS   NPLSHQYSPMVTVER+VKGFSINVFSEKSCQGLLV
Subjt:  MVSREHKKAAALHDNLQLLRSITNSHS-LNKASIIVDASKYIEELKQKVERLNQDISTVQNS---NPLSHQYSPMVTVERVVKGFSINVFSEKSCQGLLV

Query:  SILEVFEELGLNVIEARVSCTHTFQLQAIGEIEEEGEEAIDAQTVKEAVVQAIKSWSQNGEQD
        SILEVFEELGLNVIEARVSCT TFQLQAI EIEEEGEEAIDAQ VKEAVVQAIKSW Q+GEQD
Subjt:  SILEVFEELGLNVIEARVSCTHTFQLQAIGEIEEEGEEAIDAQTVKEAVVQAIKSWSQNGEQD

XP_038884499.1 uncharacterized protein LOC120075304 isoform X4 [Benincasa hispida]5.8e-6991.36Show/hide
Query:  MVSREHKKAAALHDNLQLLRSITNSHSLNKASIIVDASKYIEELKQKVERLNQDISTVQNS---NPLSHQYSPMVTVERVVKGFSINVFSEKSCQGLLVS
        MVSREHKK A LH+ LQLLRSITNSH+LNKASIIVDASKYIEELKQKVERLNQDISTVQNS   NPLSHQYSPMVTVER+VKGFSINVFSEKSCQGLLVS
Subjt:  MVSREHKKAAALHDNLQLLRSITNSHSLNKASIIVDASKYIEELKQKVERLNQDISTVQNS---NPLSHQYSPMVTVERVVKGFSINVFSEKSCQGLLVS

Query:  ILEVFEELGLNVIEARVSCTHTFQLQAIGEIEEEGEEAIDAQTVKEAVVQAIKSWSQNGEQD
        ILEVFEELGLNVIEARVSCT TFQLQAI EIEEEGEEAIDAQ VKEAVVQAIKSW Q+GEQD
Subjt:  ILEVFEELGLNVIEARVSCTHTFQLQAIGEIEEEGEEAIDAQTVKEAVVQAIKSWSQNGEQD

TrEMBL top hitse value%identityAlignment
A0A0A0K8N7 Uncharacterized protein1.4e-7698.75Show/hide
Query:  MVSREHKKAAALHDNLQLLRSITNSHSLNKASIIVDASKYIEELKQKVERLNQDISTVQNSNPLSHQYSPM-VTVERVVKGFSINVFSEKSCQGLLVSIL
        MVSREHKKAAALHDNLQLLRSITNSHSLNKASIIVDASKYIEELKQKVERLNQDISTVQNSNPLSHQYSPM VTVERVVKGFSINVFSEKSCQGLLVSIL
Subjt:  MVSREHKKAAALHDNLQLLRSITNSHSLNKASIIVDASKYIEELKQKVERLNQDISTVQNSNPLSHQYSPM-VTVERVVKGFSINVFSEKSCQGLLVSIL

Query:  EVFEELGLNVIEARVSCTHTFQLQAIGEIEEEGEEAIDAQTVKEAVVQAIKSWSQNGEQD
        EVFEELGLNVIEARVSCTHTFQLQAIGEIEEEGEE IDAQTVKEAVVQAIKSWSQNGEQD
Subjt:  EVFEELGLNVIEARVSCTHTFQLQAIGEIEEEGEEAIDAQTVKEAVVQAIKSWSQNGEQD

A0A1S4E185 uncharacterized protein LOC1034966013.1e-7698.11Show/hide
Query:  MVSREHKKAAALHDNLQLLRSITNSHSLNKASIIVDASKYIEELKQKVERLNQDISTVQNSNPLSHQYSPMVTVERVVKGFSINVFSEKSCQGLLVSILE
        MVSREHKKAAALHDNLQLLRSITNS   NKASIIVDASKYIEELKQKVERLNQDISTVQNSNPLSHQYSPMVTVERVVKGFSINVFSEKSCQGLLVSILE
Subjt:  MVSREHKKAAALHDNLQLLRSITNSHSLNKASIIVDASKYIEELKQKVERLNQDISTVQNSNPLSHQYSPMVTVERVVKGFSINVFSEKSCQGLLVSILE

Query:  VFEELGLNVIEARVSCTHTFQLQAIGEIEEEGEEAIDAQTVKEAVVQAIKSWSQNGEQD
        VFEELGLNVIEARVSCTHTFQLQAIGEIEEEGEEAIDAQTVKEAVVQAIKSWSQNGEQD
Subjt:  VFEELGLNVIEARVSCTHTFQLQAIGEIEEEGEEAIDAQTVKEAVVQAIKSWSQNGEQD

A0A6J1HJR0 uncharacterized protein LOC111464709 isoform X24.3e-6284.91Show/hide
Query:  MVSREHKKAAALHDNLQLLRSITNSHSLNKASIIVDASKYIEELKQKVERLNQDISTVQNSNPLSHQYSPMVTVERVVKGFSINVFSEKSCQGLLVSILE
        MVSREHKK AALH+ LQLLRSITNSH+LNK SIIVDASKYIEELKQKVERLNQDI+TVQNS    H   PMVTVE +VKGFSINVFSEKSCQGLLVSILE
Subjt:  MVSREHKKAAALHDNLQLLRSITNSHSLNKASIIVDASKYIEELKQKVERLNQDISTVQNSNPLSHQYSPMVTVERVVKGFSINVFSEKSCQGLLVSILE

Query:  VFEELGLNVIEARVSCTHTFQLQAIGEIEEEGEEAIDAQTVKEAVVQAIKSWSQNGEQD
         FEELGLNV+EARVSCT TFQLQA  EIEE+GEEA+DAQ VKEAVV+AIKSWSQNGEQD
Subjt:  VFEELGLNVIEARVSCTHTFQLQAIGEIEEEGEEAIDAQTVKEAVVQAIKSWSQNGEQD

A0A6J1JTS8 uncharacterized protein LOC111487778 isoform X24.3e-6284.91Show/hide
Query:  MVSREHKKAAALHDNLQLLRSITNSHSLNKASIIVDASKYIEELKQKVERLNQDISTVQNSNPLSHQYSPMVTVERVVKGFSINVFSEKSCQGLLVSILE
        MVSREHKK AALH+ LQLLRSITNSH+LNK SIIVDASKYIEELKQKVERLNQDI+TVQNS    H   PMVTVE +VKGFSINVFSEKSCQGLLVSILE
Subjt:  MVSREHKKAAALHDNLQLLRSITNSHSLNKASIIVDASKYIEELKQKVERLNQDISTVQNSNPLSHQYSPMVTVERVVKGFSINVFSEKSCQGLLVSILE

Query:  VFEELGLNVIEARVSCTHTFQLQAIGEIEEEGEEAIDAQTVKEAVVQAIKSWSQNGEQD
         FEELGLNV+EARVSCT TFQLQA  EIEE+GEEA+DAQ VKEAVV+AIKSWSQNGEQD
Subjt:  VFEELGLNVIEARVSCTHTFQLQAIGEIEEEGEEAIDAQTVKEAVVQAIKSWSQNGEQD

A0A6J1JV46 uncharacterized protein LOC111487778 isoform X11.1e-6084.38Show/hide
Query:  MVSREHKKAAALHDNLQLLRSITNSHSLNKASIIVDASKYIEELKQKVERLNQDISTVQNSNPLSHQYSPM-VTVERVVKGFSINVFSEKSCQGLLVSIL
        MVSREHKK AALH+ LQLLRSITNSH+LNK SIIVDASKYIEELKQKVERLNQDI+TVQNS    H   PM VTVE +VKGFSINVFSEKSCQGLLVSIL
Subjt:  MVSREHKKAAALHDNLQLLRSITNSHSLNKASIIVDASKYIEELKQKVERLNQDISTVQNSNPLSHQYSPM-VTVERVVKGFSINVFSEKSCQGLLVSIL

Query:  EVFEELGLNVIEARVSCTHTFQLQAIGEIEEEGEEAIDAQTVKEAVVQAIKSWSQNGEQD
        E FEELGLNV+EARVSCT TFQLQA  EIEE+GEEA+DAQ VKEAVV+AIKSWSQNGEQD
Subjt:  EVFEELGLNVIEARVSCTHTFQLQAIGEIEEEGEEAIDAQTVKEAVVQAIKSWSQNGEQD

SwissProt top hitse value%identityAlignment
Q10S44 Transcription factor BHLH33.0e-0425.77Show/hide
Query:  EHKKAAALHDNLQLLRSITNSHS-LNKASIIVDASKYIEELKQKVERLNQDI---------------STVQNSNPLSHQYSPMVTVERVVKGFSINVFSE
        E ++   L+D L +LRSI    S +++ SI+ D   Y++EL ++++ L ++I               S+  N+N +  + S    VE    G   N   E
Subjt:  EHKKAAALHDNLQLLRSITNSHS-LNKASIIVDASKYIEELKQKVERLNQDI---------------STVQNSNPLSHQYSPMVTVERVVKGFSINVFSE

Query:  KSC---QGLLVSILEVFEELGLNVIEARVSCTHTFQLQAIGEIEEEGEEAIDAQTVKEAVVQA
          C    G+L+S +   E LGL + +  VSC   F +QA    E+   + +    +K+ + ++
Subjt:  KSC---QGLLVSILEVFEELGLNVIEARVSCTHTFQLQAIGEIEEEGEEAIDAQTVKEAVVQA

Q9LPW3 Transcription factor SCREAM25.2e-0427.21Show/hide
Query:  EHKKAAALHDNLQLLRSITNSHS-LNKASIIVDASKYIEELKQKVERLNQDISTVQNSNPLSHQYS---------------------------PMVTVE-
        E ++   L+D L +LRS+    S +++ASI+ DA  Y++EL Q++  L+ ++ +   S+   H  +                           P V V  
Subjt:  EHKKAAALHDNLQLLRSITNSHS-LNKASIIVDASKYIEELKQKVERLNQDISTVQNSNPLSHQYS---------------------------PMVTVE-

Query:  RVVKGFSINVFSEKSCQGLLVSILEVFEELGLNVIEARVSCTHTFQL
        R  K  +I++F  +   GLL+S +   + LGL+V +A +SC + F L
Subjt:  RVVKGFSINVFSEKSCQGLLVSILEVFEELGLNVIEARVSCTHTFQL

Q9LSL1 Transcription factor bHLH931.9e-0627.04Show/hide
Query:  EHKKAAALHDNLQLLRSITNSHS-LNKASIIVDASKYIEELKQKVERLNQDISTVQNSNPLSH-----------------QYSPMVTVERVVKGFSINVF
        E ++   L+D L +LRSI    S +++ SI+ DA  Y++EL  K+ +L  +   + NSN   H                 + SP   ++R  +   +++ 
Subjt:  EHKKAAALHDNLQLLRSITNSHS-LNKASIIVDASKYIEELKQKVERLNQDISTVQNSNPLSH-----------------QYSPMVTVERVVKGFSINVF

Query:  SEKSCQGLLVSILEVFEELGLNVIEARVSCTHTFQLQAIGEIEEEGEEAIDAQTVKEAV
              GLL+S +   E LGL + +  +SC   F LQA      E  + I ++ +K+A+
Subjt:  SEKSCQGLLVSILEVFEELGLNVIEARVSCTHTFQLQAIGEIEEEGEEAIDAQTVKEAV

Q9LXA9 Transcription factor bHLH613.3e-0627.74Show/hide
Query:  EHKKAAALHDNLQLLRSIT-NSHSLNKASIIVDASKYIEELKQKVERLNQDISTVQNSNPLSHQYSPMVTVERVVKGF--------SINVFSEKSC---Q
        E ++   L+D L LLRSI      +++ SI+ DA  Y++EL  K+ +L +D   + +++ L    S ++T E +V+           +N   +  C    
Subjt:  EHKKAAALHDNLQLLRSIT-NSHSLNKASIIVDASKYIEELKQKVERLNQDISTVQNSNPLSHQYSPMVTVERVVKGF--------SINVFSEKSC---Q

Query:  GLLVSILEVFEELGLNVIEARVSCTHTFQLQAIGEIEEEGEEAIDAQTVKEAVVQ
        GL+VS +   E LGL + +  +SC   F LQA      E    + ++  K+A+++
Subjt:  GLLVSILEVFEELGLNVIEARVSCTHTFQLQAIGEIEEEGEEAIDAQTVKEAVVQ

Arabidopsis top hitse value%identityAlignment
AT1G29270.1 unknown protein1.4e-0933.07Show/hide
Query:  MVSREHKKAAALHDNLQLLRSITN-SHSLNKASIIV-DASKYIEELKQKVERLNQDISTVQNSNPLSHQYSPMVTVERVVKGFSINVFSEKSCQGLLVSI
        MV+ E KK A+       L+++T+   S+++ S+++ +A  YI  LK ++E L ++   ++ +   S      V VE++ + F + + S +  +  LV+I
Subjt:  MVSREHKKAAALHDNLQLLRSITN-SHSLNKASIIV-DASKYIEELKQKVERLNQDISTVQNSNPLSHQYSPMVTVERVVKGFSINVFSEKSCQGLLVSI

Query:  LEVFEELGLNVIEARVSCTHTFQLQAI
        LE FEE+GLNV +AR SC  +F ++AI
Subjt:  LEVFEELGLNVIEARVSCTHTFQLQAI

AT2G40435.1 BEST Arabidopsis thaliana protein match is: transcription regulators (TAIR:AT3G56220.1)3.5e-4057.79Show/hide
Query:  MVSREHKKAAALHDNLQLLRSITNSHSLNKASIIVDASKYIEELKQKVERLNQDISTVQNSNPLSHQYSPMVTVERVVKGFSINVFSEKSCQGLLVSILE
        MVSRE K+  +L +  QLLRSITNSH+ N  SII+DASKYI++LKQKVER NQD +  Q+S+  +   +PMVTVE + KGF INVFS K+  G+LVS+LE
Subjt:  MVSREHKKAAALHDNLQLLRSITNSHSLNKASIIVDASKYIEELKQKVERLNQDISTVQNSNPLSHQYSPMVTVERVVKGFSINVFSEKSCQGLLVSILE

Query:  VFEELGLNVIEARVSCTHTFQLQAIGEIEEEGEEAIDAQTVKEAVVQAIKSWSQ
         FE++GLNV+EAR SCT +F L A+G +E E  E +DA+ VK+AV  AI+SW +
Subjt:  VFEELGLNVIEARVSCTHTFQLQAIGEIEEEGEEAIDAQTVKEAVVQAIKSWSQ

AT3G56220.1 transcription regulators1.5e-3854.19Show/hide
Query:  MVSREHKKAAALHDNLQLLRSITNSHSLNKASIIVDASKYIEELKQKVERLNQDISTVQNSNPLSHQYSPMVTVERVVKGFSINVFSEKSCQGLLVSILE
        MVSREHK+ ++L +   LLRSIT+SH+ ++ SIIVDASKYI++LKQKVE++N + +T + S   S   +PMVTVE + KGF I V S K+  G+LV +LE
Subjt:  MVSREHKKAAALHDNLQLLRSITNSHSLNKASIIVDASKYIEELKQKVERLNQDISTVQNSNPLSHQYSPMVTVERVVKGFSINVFSEKSCQGLLVSILE

Query:  VFEELGLNVIEARVSCTHTFQLQAIGEIEEEGEEAIDAQTVKEAVVQAIKSWSQN
         FE+LGL+V+EARVSCT TF L AIG    +  + IDA+ VK+AV +AI++WS +
Subjt:  VFEELGLNVIEARVSCTHTFQLQAIGEIEEEGEEAIDAQTVKEAVVQAIKSWSQN

AT5G10570.1 basic helix-loop-helix (bHLH) DNA-binding superfamily protein2.3e-0727.74Show/hide
Query:  EHKKAAALHDNLQLLRSIT-NSHSLNKASIIVDASKYIEELKQKVERLNQDISTVQNSNPLSHQYSPMVTVERVVKGF--------SINVFSEKSC---Q
        E ++   L+D L LLRSI      +++ SI+ DA  Y++EL  K+ +L +D   + +++ L    S ++T E +V+           +N   +  C    
Subjt:  EHKKAAALHDNLQLLRSIT-NSHSLNKASIIVDASKYIEELKQKVERLNQDISTVQNSNPLSHQYSPMVTVERVVKGF--------SINVFSEKSC---Q

Query:  GLLVSILEVFEELGLNVIEARVSCTHTFQLQAIGEIEEEGEEAIDAQTVKEAVVQ
        GL+VS +   E LGL + +  +SC   F LQA      E    + ++  K+A+++
Subjt:  GLLVSILEVFEELGLNVIEARVSCTHTFQLQAIGEIEEEGEEAIDAQTVKEAVVQ

AT5G65640.1 beta HLH protein 931.4e-0727.04Show/hide
Query:  EHKKAAALHDNLQLLRSITNSHS-LNKASIIVDASKYIEELKQKVERLNQDISTVQNSNPLSH-----------------QYSPMVTVERVVKGFSINVF
        E ++   L+D L +LRSI    S +++ SI+ DA  Y++EL  K+ +L  +   + NSN   H                 + SP   ++R  +   +++ 
Subjt:  EHKKAAALHDNLQLLRSITNSHS-LNKASIIVDASKYIEELKQKVERLNQDISTVQNSNPLSH-----------------QYSPMVTVERVVKGFSINVF

Query:  SEKSCQGLLVSILEVFEELGLNVIEARVSCTHTFQLQAIGEIEEEGEEAIDAQTVKEAV
              GLL+S +   E LGL + +  +SC   F LQA      E  + I ++ +K+A+
Subjt:  SEKSCQGLLVSILEVFEELGLNVIEARVSCTHTFQLQAIGEIEEEGEEAIDAQTVKEAV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTTCTAGAGAGCACAAGAAGGCTGCAGCTCTTCATGACAACCTCCAATTACTTCGCTCTATTACCAACTCTCATTCTCTAAACAAGGCGTCGATTATAGTGGATGC
ATCGAAATATATCGAGGAGCTAAAACAAAAAGTAGAAAGATTGAATCAAGATATATCCACCGTTCAAAATTCAAATCCACTTTCTCATCAATATTCTCCCATGGTAACAG
TGGAAAGGGTAGTAAAGGGATTTTCTATAAATGTATTTTCAGAAAAGAGTTGTCAAGGTCTTCTTGTCTCCATATTAGAAGTGTTTGAAGAGCTTGGACTTAATGTTATT
GAAGCTAGGGTTTCTTGTACTCATACTTTCCAATTACAAGCTATTGGAGAAATTGAGGAAGAAGGAGAAGAAGCCATTGATGCTCAAACTGTGAAAGAAGCAGTAGTTCA
AGCAATAAAGAGCTGGAGCCAAAATGGTGAACAAGATTAA
mRNA sequenceShow/hide mRNA sequence
TATACATATTGATTATGCAGTGTCATGTGGGAAGCTAACAAACAGAAAAAAGATCATATAAATTTCAGCCTCCTCCTCATCATACCCTTATAAAACCACACCCCTAACTT
TTTTTTTTTTTCATTTTTGTAGATCATATATTACAAAAAAGAAAAAAAGAAAGAAAGAAAAAGAAAGATATAAAGAGAAACAAAATATTGAAAGAGTTGAATCCATGGTT
TCTAGAGAGCACAAGAAGGCTGCAGCTCTTCATGACAACCTCCAATTACTTCGCTCTATTACCAACTCTCATTCTCTAAACAAGGCGTCGATTATAGTGGATGCATCGAA
ATATATCGAGGAGCTAAAACAAAAAGTAGAAAGATTGAATCAAGATATATCCACCGTTCAAAATTCAAATCCACTTTCTCATCAATATTCTCCCATGGTAACAGTGGAAA
GGGTAGTAAAGGGATTTTCTATAAATGTATTTTCAGAAAAGAGTTGTCAAGGTCTTCTTGTCTCCATATTAGAAGTGTTTGAAGAGCTTGGACTTAATGTTATTGAAGCT
AGGGTTTCTTGTACTCATACTTTCCAATTACAAGCTATTGGAGAAATTGAGGAAGAAGGAGAAGAAGCCATTGATGCTCAAACTGTGAAAGAAGCAGTAGTTCAAGCAAT
AAAGAGCTGGAGCCAAAATGGTGAACAAGATTAATAAATCTAATCTCTACAAGGAGAATTAATTATTCCCCCCCTCCAGAAGAAAACCCTTAATTTCTCCAACTTGTTTT
AATCGCGGCGTTTTTTGTTAATTATTATTATTATCTCTATCTGTTTGTAATGTGGTAAAAATTAATCAATGGAAAAACCGATTGATTAATGTTCTTCCAATTAAATGGAA
AGATAATAATAACCAAATATATTCTTTTTTTAAAAGAAAT
Protein sequenceShow/hide protein sequence
MVSREHKKAAALHDNLQLLRSITNSHSLNKASIIVDASKYIEELKQKVERLNQDISTVQNSNPLSHQYSPMVTVERVVKGFSINVFSEKSCQGLLVSILEVFEELGLNVI
EARVSCTHTFQLQAIGEIEEEGEEAIDAQTVKEAVVQAIKSWSQNGEQD