; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmUC11G209680 (gene) of Watermelon (USVL531) v1 genome

Gene IDCmUC11G209680
OrganismCitrullus mucosospermus (Watermelon (USVL531) v1)
DescriptionTranscription factor bHLH61-like protein
Genome locationCmU531Chr11:6132421..6133534
RNA-Seq ExpressionCmUC11G209680
SyntenyCmUC11G209680
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0005634 - nucleus (cellular component)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
GO:0043565 - sequence-specific DNA binding (molecular function)
GO:0046983 - protein dimerization activity (molecular function)
InterPro domainsIPR036638 - Helix-loop-helix DNA-binding domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_011656425.1 uncharacterized protein LOC101218918 isoform X1 [Cucumis sativus]6.8e-6589.02Show/hide
Query:  MVSREHKK-AALHEKLQLLRSITNSHSQLNKASIIVDASKYIEELKQKVERLNQDISTVENSIHPNPISH-HPPMQVTVERLVKGFSINVFSEKSCQGLL
        MVSREHKK AALH+ LQLLRSITNSHS LNKASIIVDASKYIEELKQKVERLNQDISTV+NS   NP+SH + PMQVTVER+VKGFSINVFSEKSCQGLL
Subjt:  MVSREHKK-AALHEKLQLLRSITNSHSQLNKASIIVDASKYIEELKQKVERLNQDISTVENSIHPNPISH-HPPMQVTVERLVKGFSINVFSEKSCQGLL

Query:  VSILEVFEELGLNVIEARVSCTDSFQLQAIGEIEEEGEEAIDAQSVKEAVVQAIKSWSQSGEQD
        VSILEVFEELGLNVIEARVSCT +FQLQAIGEIEEEGEE IDAQ+VKEAVVQAIKSWSQ+GEQD
Subjt:  VSILEVFEELGLNVIEARVSCTDSFQLQAIGEIEEEGEEAIDAQSVKEAVVQAIKSWSQSGEQD

XP_038884496.1 uncharacterized protein LOC120075304 isoform X1 [Benincasa hispida]6.8e-7393.25Show/hide
Query:  MVSREHKKAALHEKLQLLRSITNSHSQLNKASIIVDASKYIEELKQKVERLNQDISTVENSIHPNPISH-HPPMQVTVERLVKGFSINVFSEKSCQGLLV
        MVSREHKKA LHEKLQLLRSITNSH+QLNKASIIVDASKYIEELKQKVERLNQDISTV+NSIHPNP+SH + PMQVTVERLVKGFSINVFSEKSCQGLLV
Subjt:  MVSREHKKAALHEKLQLLRSITNSHSQLNKASIIVDASKYIEELKQKVERLNQDISTVENSIHPNPISH-HPPMQVTVERLVKGFSINVFSEKSCQGLLV

Query:  SILEVFEELGLNVIEARVSCTDSFQLQAIGEIEEEGEEAIDAQSVKEAVVQAIKSWSQSGEQD
        SILEVFEELGLNVIEARVSCTD+FQLQAI EIEEEGEEAIDAQ+VKEAVVQAIKSW QSGEQD
Subjt:  SILEVFEELGLNVIEARVSCTDSFQLQAIGEIEEEGEEAIDAQSVKEAVVQAIKSWSQSGEQD

XP_038884497.1 uncharacterized protein LOC120075304 isoform X2 [Benincasa hispida]4.8e-7192.64Show/hide
Query:  MVSREHKKAALHEKLQLLRSITNSHSQLNKASIIVDASKYIEELKQKVERLNQDISTVENSIHPNPISH-HPPMQVTVERLVKGFSINVFSEKSCQGLLV
        MVSREHKKA LHEKLQLLRSITNSH+ LNKASIIVDASKYIEELKQKVERLNQDISTV+NSIHPNP+SH + PMQVTVERLVKGFSINVFSEKSCQGLLV
Subjt:  MVSREHKKAALHEKLQLLRSITNSHSQLNKASIIVDASKYIEELKQKVERLNQDISTVENSIHPNPISH-HPPMQVTVERLVKGFSINVFSEKSCQGLLV

Query:  SILEVFEELGLNVIEARVSCTDSFQLQAIGEIEEEGEEAIDAQSVKEAVVQAIKSWSQSGEQD
        SILEVFEELGLNVIEARVSCTD+FQLQAI EIEEEGEEAIDAQ+VKEAVVQAIKSW QSGEQD
Subjt:  SILEVFEELGLNVIEARVSCTDSFQLQAIGEIEEEGEEAIDAQSVKEAVVQAIKSWSQSGEQD

XP_038884498.1 uncharacterized protein LOC120075304 isoform X3 [Benincasa hispida]1.7e-7191.98Show/hide
Query:  MVSREHKKAALHEKLQLLRSITNSHSQLNKASIIVDASKYIEELKQKVERLNQDISTVENSIHPNPISHHPPMQVTVERLVKGFSINVFSEKSCQGLLVS
        MVSREHKKA LHEKLQLLRSITNSH+QLNKASIIVDASKYIEELKQKVERLNQDISTV+NSIHPNP+SH     VTVERLVKGFSINVFSEKSCQGLLVS
Subjt:  MVSREHKKAALHEKLQLLRSITNSHSQLNKASIIVDASKYIEELKQKVERLNQDISTVENSIHPNPISHHPPMQVTVERLVKGFSINVFSEKSCQGLLVS

Query:  ILEVFEELGLNVIEARVSCTDSFQLQAIGEIEEEGEEAIDAQSVKEAVVQAIKSWSQSGEQD
        ILEVFEELGLNVIEARVSCTD+FQLQAI EIEEEGEEAIDAQ+VKEAVVQAIKSW QSGEQD
Subjt:  ILEVFEELGLNVIEARVSCTDSFQLQAIGEIEEEGEEAIDAQSVKEAVVQAIKSWSQSGEQD

XP_038884499.1 uncharacterized protein LOC120075304 isoform X4 [Benincasa hispida]1.2e-6991.36Show/hide
Query:  MVSREHKKAALHEKLQLLRSITNSHSQLNKASIIVDASKYIEELKQKVERLNQDISTVENSIHPNPISHHPPMQVTVERLVKGFSINVFSEKSCQGLLVS
        MVSREHKKA LHEKLQLLRSITNSH+ LNKASIIVDASKYIEELKQKVERLNQDISTV+NSIHPNP+SH     VTVERLVKGFSINVFSEKSCQGLLVS
Subjt:  MVSREHKKAALHEKLQLLRSITNSHSQLNKASIIVDASKYIEELKQKVERLNQDISTVENSIHPNPISHHPPMQVTVERLVKGFSINVFSEKSCQGLLVS

Query:  ILEVFEELGLNVIEARVSCTDSFQLQAIGEIEEEGEEAIDAQSVKEAVVQAIKSWSQSGEQD
        ILEVFEELGLNVIEARVSCTD+FQLQAI EIEEEGEEAIDAQ+VKEAVVQAIKSW QSGEQD
Subjt:  ILEVFEELGLNVIEARVSCTDSFQLQAIGEIEEEGEEAIDAQSVKEAVVQAIKSWSQSGEQD

TrEMBL top hitse value%identityAlignment
A0A0A0K8N7 Uncharacterized protein3.3e-6589.02Show/hide
Query:  MVSREHKK-AALHEKLQLLRSITNSHSQLNKASIIVDASKYIEELKQKVERLNQDISTVENSIHPNPISH-HPPMQVTVERLVKGFSINVFSEKSCQGLL
        MVSREHKK AALH+ LQLLRSITNSHS LNKASIIVDASKYIEELKQKVERLNQDISTV+NS   NP+SH + PMQVTVER+VKGFSINVFSEKSCQGLL
Subjt:  MVSREHKK-AALHEKLQLLRSITNSHSQLNKASIIVDASKYIEELKQKVERLNQDISTVENSIHPNPISH-HPPMQVTVERLVKGFSINVFSEKSCQGLL

Query:  VSILEVFEELGLNVIEARVSCTDSFQLQAIGEIEEEGEEAIDAQSVKEAVVQAIKSWSQSGEQD
        VSILEVFEELGLNVIEARVSCT +FQLQAIGEIEEEGEE IDAQ+VKEAVVQAIKSWSQ+GEQD
Subjt:  VSILEVFEELGLNVIEARVSCTDSFQLQAIGEIEEEGEEAIDAQSVKEAVVQAIKSWSQSGEQD

A0A6J1FG89 uncharacterized protein LOC111445214 isoform X16.2e-6485.8Show/hide
Query:  MVSREHKKAALHEKLQLLRSITNSHSQLNKASIIVDASKYIEELKQKVERLNQDISTVENSIHPNPISHHPPMQVTVERLVKGFSINVFSEKSCQGLLVS
        MVSREH  A+LH  LQLLRSITNSH+QLNKASIIVDASKYIEELKQKVERLNQDISTV+ SIH        PMQVTVE L KGFSINVFSEKSCQGLLVS
Subjt:  MVSREHKKAALHEKLQLLRSITNSHSQLNKASIIVDASKYIEELKQKVERLNQDISTVENSIHPNPISHHPPMQVTVERLVKGFSINVFSEKSCQGLLVS

Query:  ILEVFEELGLNVIEARVSCTDSFQLQAIGEIEEEGEEAIDAQSVKEAVVQAIKSWSQSGEQD
        ILE FEELGLNV+EARVSCTDSFQLQAI EIEE+GEEAIDAQSVKEAVVQAIK WSQSGEQD
Subjt:  ILEVFEELGLNVIEARVSCTDSFQLQAIGEIEEEGEEAIDAQSVKEAVVQAIKSWSQSGEQD

A0A6J1HIH6 uncharacterized protein LOC111464709 isoform X14.3e-6586.42Show/hide
Query:  MVSREHKKAALHEKLQLLRSITNSHSQLNKASIIVDASKYIEELKQKVERLNQDISTVENSIHPNPISHHPPMQVTVERLVKGFSINVFSEKSCQGLLVS
        MVSREHKKAALHEKLQLLRSITNSH+ LNK SIIVDASKYIEELKQKVERLNQDI+TV+NSIHPN      PMQVTVE LVKGFSINVFSEKSCQGLLVS
Subjt:  MVSREHKKAALHEKLQLLRSITNSHSQLNKASIIVDASKYIEELKQKVERLNQDISTVENSIHPNPISHHPPMQVTVERLVKGFSINVFSEKSCQGLLVS

Query:  ILEVFEELGLNVIEARVSCTDSFQLQAIGEIEEEGEEAIDAQSVKEAVVQAIKSWSQSGEQD
        ILE FEELGLNV+EARVSCTD+FQLQA  EIEE+GEEA+DAQ+VKEAVV+AIKSWSQ+GEQD
Subjt:  ILEVFEELGLNVIEARVSCTDSFQLQAIGEIEEEGEEAIDAQSVKEAVVQAIKSWSQSGEQD

A0A6J1JV46 uncharacterized protein LOC111487778 isoform X14.3e-6586.42Show/hide
Query:  MVSREHKKAALHEKLQLLRSITNSHSQLNKASIIVDASKYIEELKQKVERLNQDISTVENSIHPNPISHHPPMQVTVERLVKGFSINVFSEKSCQGLLVS
        MVSREHKKAALHEKLQLLRSITNSH+ LNK SIIVDASKYIEELKQKVERLNQDI+TV+NSIHPN      PMQVTVE LVKGFSINVFSEKSCQGLLVS
Subjt:  MVSREHKKAALHEKLQLLRSITNSHSQLNKASIIVDASKYIEELKQKVERLNQDISTVENSIHPNPISHHPPMQVTVERLVKGFSINVFSEKSCQGLLVS

Query:  ILEVFEELGLNVIEARVSCTDSFQLQAIGEIEEEGEEAIDAQSVKEAVVQAIKSWSQSGEQD
        ILE FEELGLNV+EARVSCTD+FQLQA  EIEE+GEEA+DAQ+VKEAVV+AIKSWSQ+GEQD
Subjt:  ILEVFEELGLNVIEARVSCTDSFQLQAIGEIEEEGEEAIDAQSVKEAVVQAIKSWSQSGEQD

A0A6J1K1N3 uncharacterized protein LOC111489817 isoform X12.8e-6486.42Show/hide
Query:  MVSREHKKAALHEKLQLLRSITNSHSQLNKASIIVDASKYIEELKQKVERLNQDISTVENSIHPNPISHHPPMQVTVERLVKGFSINVFSEKSCQGLLVS
        MVSREH  AALH  LQLLRSITNSH+QLNKASIIVDASKYIEELKQKVERLNQDISTV+ SIH        PMQVTVE L KGFSINVFSEKSCQGLLVS
Subjt:  MVSREHKKAALHEKLQLLRSITNSHSQLNKASIIVDASKYIEELKQKVERLNQDISTVENSIHPNPISHHPPMQVTVERLVKGFSINVFSEKSCQGLLVS

Query:  ILEVFEELGLNVIEARVSCTDSFQLQAIGEIEEEGEEAIDAQSVKEAVVQAIKSWSQSGEQD
        ILE FEELGLNV+EARVSCTDSFQLQAI EIEEEGEEAIDAQ+VKEAVVQAIK WSQSGEQD
Subjt:  ILEVFEELGLNVIEARVSCTDSFQLQAIGEIEEEGEEAIDAQSVKEAVVQAIKSWSQSGEQD

SwissProt top hitse value%identityAlignment
Q10S44 Transcription factor BHLH33.1e-0423.31Show/hide
Query:  MVSREHKKAALHEKLQLLRSITNSHSQLNKASIIVDASKYIEELKQKVERLNQDISTVE------NSIHPNPISHHPPMQV---TVERLVKGFSINVFSE
        +++   ++  L+++L +LRSI    S++++ SI+ D   Y++EL ++++ L ++I          N++  +   ++  M V   T   +    S N   E
Subjt:  MVSREHKKAALHEKLQLLRSITNSHSQLNKASIIVDASKYIEELKQKVERLNQDISTVE------NSIHPNPISHHPPMQV---TVERLVKGFSINVFSE

Query:  KSC---QGLLVSILEVFEELGLNVIEARVSCTDSFQLQAIGEIEEEGEEAIDAQSVKEAVVQA
          C    G+L+S +   E LGL + +  VSC   F +QA    E+   + +    +K+ + ++
Subjt:  KSC---QGLLVSILEVFEELGLNVIEARVSCTDSFQLQAIGEIEEEGEEAIDAQSVKEAVVQA

Q9LPW3 Transcription factor SCREAM21.3e-0522.86Show/hide
Query:  MVSREHKKAALHEKLQLLRSITNSHSQLNKASIIVDASKYIEELKQKVERLNQDISTV---ENSIH-----------------------PNPISHHPPMQ
        +++   ++  L+++L +LRS+    S++++ASI+ DA  Y++EL Q++  L+ ++ +     +S+H                       P+P    P ++
Subjt:  MVSREHKKAALHEKLQLLRSITNSHSQLNKASIIVDASKYIEELKQKVERLNQDISTV---ENSIH-----------------------PNPISHHPPMQ

Query:  VTVERLVKGFSINVFSEKSCQGLLVSILEVFEELGLNVIEARVSCTDSFQLQAIGEIEEEGEEAIDAQSVKEAVV
        V + R  K  +I++F  +   GLL+S +   + LGL+V +A +SC + F L      + + +  +  + +K  ++
Subjt:  VTVERLVKGFSINVFSEKSCQGLLVSILEVFEELGLNVIEARVSCTDSFQLQAIGEIEEEGEEAIDAQSVKEAVV

Q9LSE2 Transcription factor ICE16.7e-0727.91Show/hide
Query:  MVSREHKKAALHEKLQLLRSITNSHSQLNKASIIVDASKYIEELKQKVERLNQDI-STVENSIHPNPISHHP-------------------------PMQ
        +++   ++  L+++L +LRS+    S++++ASI+ DA  Y++EL Q++  L+ ++ ST   S+ P   S HP                           Q
Subjt:  MVSREHKKAALHEKLQLLRSITNSHSQLNKASIIVDASKYIEELKQKVERLNQDI-STVENSIHPNPISHHP-------------------------PMQ

Query:  VTVE-RLVKGFSINV--FSEKSCQGLLVSILEVFEELGLNVIEARVSCTDSFQLQAI-GEIEEEGEEAIDAQ
          VE RL +G ++N+  F  +   GLL++ ++  + LGL+V +A +SC + F L     E  +EG+E +  Q
Subjt:  VTVE-RLVKGFSINV--FSEKSCQGLLVSILEVFEELGLNVIEARVSCTDSFQLQAI-GEIEEEGEEAIDAQ

Q9LSL1 Transcription factor bHLH931.9e-0625.15Show/hide
Query:  MVSREHKKAALHEKLQLLRSITNSHSQLNKASIIVDASKYIEELKQKVERL---NQDISTVENSIHP------------NPISHHPPMQVTVERLVKGFS
        +++   ++  L+++L +LRSI    S++++ SI+ DA  Y++EL  K+ +L    Q++    NS H              P+  + P +  ++R  +   
Subjt:  MVSREHKKAALHEKLQLLRSITNSHSQLNKASIIVDASKYIEELKQKVERL---NQDISTVENSIHP------------NPISHHPPMQVTVERLVKGFS

Query:  INVFSEKSCQGLLVSILEVFEELGLNVIEARVSCTDSFQLQAIGEIEEEGEEAIDAQSVKEAV
        +++       GLL+S +   E LGL + +  +SC   F LQA      E  + I ++ +K+A+
Subjt:  INVFSEKSCQGLLVSILEVFEELGLNVIEARVSCTDSFQLQAIGEIEEEGEEAIDAQSVKEAV

Q9LXA9 Transcription factor bHLH613.9e-0725.32Show/hide
Query:  MVSREHKKAALHEKLQLLRSITNSHSQLNKASIIVDASKYIEELKQKVERLNQDISTVENSIHPNP-ISHHPPMQVTVERLVKGFSINVFSEKSC---QG
        +++   ++  L+++L LLRSI    +++++ SI+ DA  Y++EL  K+ +L +D   + ++ H +  I++   ++ +++  V    +N   +  C    G
Subjt:  MVSREHKKAALHEKLQLLRSITNSHSQLNKASIIVDASKYIEELKQKVERLNQDISTVENSIHPNP-ISHHPPMQVTVERLVKGFSINVFSEKSC---QG

Query:  LLVSILEVFEELGLNVIEARVSCTDSFQLQAIGEIEEEGEEAIDAQSVKEAVVQ
        L+VS +   E LGL + +  +SC   F LQA      E    + +++ K+A+++
Subjt:  LLVSILEVFEELGLNVIEARVSCTDSFQLQAIGEIEEEGEEAIDAQSVKEAVVQ

Arabidopsis top hitse value%identityAlignment
AT1G29270.1 unknown protein5.4e-1234.88Show/hide
Query:  MVSREHKKAALHEKLQLLRSITNSHSQLNKASIIV-DASKYIEELKQKVERLNQDISTVENSIHPNPISHHPPMQVTVERLVKGFSINVFSEKSCQGLLV
        MV+ E KK A   K   L+++T+    +++ S+++ +A  YI  LK ++E L ++   ++ +      S H   +V VE++ + F + + S +  +  LV
Subjt:  MVSREHKKAALHEKLQLLRSITNSHSQLNKASIIV-DASKYIEELKQKVERLNQDISTVENSIHPNPISHHPPMQVTVERLVKGFSINVFSEKSCQGLLV

Query:  SILEVFEELGLNVIEARVSCTDSFQLQAI
        +ILE FEE+GLNV +AR SC DSF ++AI
Subjt:  SILEVFEELGLNVIEARVSCTDSFQLQAI

AT2G40435.1 BEST Arabidopsis thaliana protein match is: transcription regulators (TAIR:AT3G56220.1)3.6e-4059.87Show/hide
Query:  MVSREHKKAALHEKLQLLRSITNSHSQLNKASIIVDASKYIEELKQKVERLNQDISTVENSIHPNPISHHPPMQVTVERLVKGFSINVFSEKSCQGLLVS
        MVSRE K+ +L EK QLLRSITNSH++ N  SII+DASKYI++LKQKVER NQD +  ++S    P     PM VTVE L KGF INVFS K+  G+LVS
Subjt:  MVSREHKKAALHEKLQLLRSITNSHSQLNKASIIVDASKYIEELKQKVERLNQDISTVENSIHPNPISHHPPMQVTVERLVKGFSINVFSEKSCQGLLVS

Query:  ILEVFEELGLNVIEARVSCTDSFQLQAIGEIEEEGEEAIDAQSVKEAVVQAIKSWSQ
        +LE FE++GLNV+EAR SCTDSF L A+G +E E  E +DA++VK+AV  AI+SW +
Subjt:  ILEVFEELGLNVIEARVSCTDSFQLQAIGEIEEEGEEAIDAQSVKEAVVQAIKSWSQ

AT3G26744.1 basic helix-loop-helix (bHLH) DNA-binding superfamily protein4.8e-0827.91Show/hide
Query:  MVSREHKKAALHEKLQLLRSITNSHSQLNKASIIVDASKYIEELKQKVERLNQDI-STVENSIHPNPISHHP-------------------------PMQ
        +++   ++  L+++L +LRS+    S++++ASI+ DA  Y++EL Q++  L+ ++ ST   S+ P   S HP                           Q
Subjt:  MVSREHKKAALHEKLQLLRSITNSHSQLNKASIIVDASKYIEELKQKVERLNQDI-STVENSIHPNPISHHP-------------------------PMQ

Query:  VTVE-RLVKGFSINV--FSEKSCQGLLVSILEVFEELGLNVIEARVSCTDSFQLQAI-GEIEEEGEEAIDAQ
          VE RL +G ++N+  F  +   GLL++ ++  + LGL+V +A +SC + F L     E  +EG+E +  Q
Subjt:  VTVE-RLVKGFSINV--FSEKSCQGLLVSILEVFEELGLNVIEARVSCTDSFQLQAI-GEIEEEGEEAIDAQ

AT3G56220.1 transcription regulators2.4e-3655.35Show/hide
Query:  MVSREHKK-AALHEKLQLLRSITNSHSQLNKASIIVDASKYIEELKQKVERLNQDISTVENSIHPNPISHHPPMQVTVERLVKGFSINVFSEKSCQGLLV
        MVSREHK+ ++L EK  LLRSIT+SH++ ++ SIIVDASKYI++LKQKVE++N + +T E S      S  P   VTVE L KGF I V S K+  G+LV
Subjt:  MVSREHKK-AALHEKLQLLRSITNSHSQLNKASIIVDASKYIEELKQKVERLNQDISTVENSIHPNPISHHPPMQVTVERLVKGFSINVFSEKSCQGLLV

Query:  SILEVFEELGLNVIEARVSCTDSFQLQAIGEIEEEGEEAIDAQSVKEAVVQAIKSWSQS
         +LE FE+LGL+V+EARVSCTD+F L AIG    +  + IDA++VK+AV +AI++WS S
Subjt:  SILEVFEELGLNVIEARVSCTDSFQLQAIGEIEEEGEEAIDAQSVKEAVVQAIKSWSQS

AT5G10570.1 basic helix-loop-helix (bHLH) DNA-binding superfamily protein2.8e-0825.32Show/hide
Query:  MVSREHKKAALHEKLQLLRSITNSHSQLNKASIIVDASKYIEELKQKVERLNQDISTVENSIHPNP-ISHHPPMQVTVERLVKGFSINVFSEKSC---QG
        +++   ++  L+++L LLRSI    +++++ SI+ DA  Y++EL  K+ +L +D   + ++ H +  I++   ++ +++  V    +N   +  C    G
Subjt:  MVSREHKKAALHEKLQLLRSITNSHSQLNKASIIVDASKYIEELKQKVERLNQDISTVENSIHPNP-ISHHPPMQVTVERLVKGFSINVFSEKSC---QG

Query:  LLVSILEVFEELGLNVIEARVSCTDSFQLQAIGEIEEEGEEAIDAQSVKEAVVQ
        L+VS +   E LGL + +  +SC   F LQA      E    + +++ K+A+++
Subjt:  LLVSILEVFEELGLNVIEARVSCTDSFQLQAIGEIEEEGEEAIDAQSVKEAVVQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTTCTAGAGAGCACAAGAAGGCAGCTCTGCATGAGAAGCTTCAATTACTTCGCTCTATTACCAACTCTCATTCACAGCTAAACAAGGCCTCGATTATAGTGGATGC
ATCAAAATATATCGAGGAGCTAAAACAGAAAGTAGAAAGATTGAATCAAGATATATCAACAGTTGAAAATTCAATCCACCCAAATCCAATTTCTCATCATCCTCCCATGC
AGGTTACAGTGGAAAGGCTTGTAAAGGGATTTTCTATAAATGTATTTTCGGAAAAGAGTTGTCAAGGCCTTCTTGTCTCCATATTAGAAGTGTTTGAAGAGCTGGGGCTT
AATGTTATTGAAGCTAGGGTTTCCTGTACTGATAGTTTCCAATTACAAGCTATTGGAGAAATTGAAGAAGAAGGAGAAGAAGCCATTGATGCTCAATCTGTGAAAGAAGC
AGTAGTTCAAGCTATAAAGAGCTGGAGCCAAAGCGGTGAACAAGATTAA
mRNA sequenceShow/hide mRNA sequence
ATGGTTTCTAGAGAGCACAAGAAGGCAGCTCTGCATGAGAAGCTTCAATTACTTCGCTCTATTACCAACTCTCATTCACAGCTAAACAAGGCCTCGATTATAGTGGATGC
ATCAAAATATATCGAGGAGCTAAAACAGAAAGTAGAAAGATTGAATCAAGATATATCAACAGTTGAAAATTCAATCCACCCAAATCCAATTTCTCATCATCCTCCCATGC
AGGTTACAGTGGAAAGGCTTGTAAAGGGATTTTCTATAAATGTATTTTCGGAAAAGAGTTGTCAAGGCCTTCTTGTCTCCATATTAGAAGTGTTTGAAGAGCTGGGGCTT
AATGTTATTGAAGCTAGGGTTTCCTGTACTGATAGTTTCCAATTACAAGCTATTGGAGAAATTGAAGAAGAAGGAGAAGAAGCCATTGATGCTCAATCTGTGAAAGAAGC
AGTAGTTCAAGCTATAAAGAGCTGGAGCCAAAGCGGTGAACAAGATTAA
Protein sequenceShow/hide protein sequence
MVSREHKKAALHEKLQLLRSITNSHSQLNKASIIVDASKYIEELKQKVERLNQDISTVENSIHPNPISHHPPMQVTVERLVKGFSINVFSEKSCQGLLVSILEVFEELGL
NVIEARVSCTDSFQLQAIGEIEEEGEEAIDAQSVKEAVVQAIKSWSQSGEQD