; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CcUC11G214330 (gene) of Watermelon (PI 537277) v1 genome

Gene IDCcUC11G214330
OrganismCitrullus colocynthis (Watermelon (PI 537277) v1)
DescriptionACT domain-containing protein
Genome locationCicolChr11:6424181..6425689
RNA-Seq ExpressionCcUC11G214330
SyntenyCcUC11G214330
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0005634 - nucleus (cellular component)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
GO:0043565 - sequence-specific DNA binding (molecular function)
GO:0046983 - protein dimerization activity (molecular function)
InterPro domainsIPR036638 - Helix-loop-helix DNA-binding domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_011656425.1 uncharacterized protein LOC101218918 isoform X1 [Cucumis sativus]1.3e-6388.41Show/hide
Query:  MVSREHKK-AALHEKLQLLRSITNSHSQLNKASIIVDASKYIEELKQKVERLNQDISTVENSIHPNPISH-HIPMQVTVERLVKGFSINVFSEKSCQGLL
        MVSREHKK AALH+ LQLLRSITNSHS LNKASIIVDASKYIEELKQKVERLNQDISTV+NS   NP+SH + PMQVTVER+VKGFSINVFSEKSCQGLL
Subjt:  MVSREHKK-AALHEKLQLLRSITNSHSQLNKASIIVDASKYIEELKQKVERLNQDISTVENSIHPNPISH-HIPMQVTVERLVKGFSINVFSEKSCQGLL

Query:  VSILEVFEELGLNVIEARVSCTDCFQLQAIGEIEEEGEEAIDAQSVKEAVAQAIKSWSQSGEQD
        VSILEVFEELGLNVIEARVSCT  FQLQAIGEIEEEGEE IDAQ+VKEAV QAIKSWSQ+GEQD
Subjt:  VSILEVFEELGLNVIEARVSCTDCFQLQAIGEIEEEGEEAIDAQSVKEAVAQAIKSWSQSGEQD

XP_038884496.1 uncharacterized protein LOC120075304 isoform X1 [Benincasa hispida]1.3e-7192.64Show/hide
Query:  MVSREHKKAALHEKLQLLRSITNSHSQLNKASIIVDASKYIEELKQKVERLNQDISTVENSIHPNPISH-HIPMQVTVERLVKGFSINVFSEKSCQGLLV
        MVSREHKKA LHEKLQLLRSITNSH+QLNKASIIVDASKYIEELKQKVERLNQDISTV+NSIHPNP+SH + PMQVTVERLVKGFSINVFSEKSCQGLLV
Subjt:  MVSREHKKAALHEKLQLLRSITNSHSQLNKASIIVDASKYIEELKQKVERLNQDISTVENSIHPNPISH-HIPMQVTVERLVKGFSINVFSEKSCQGLLV

Query:  SILEVFEELGLNVIEARVSCTDCFQLQAIGEIEEEGEEAIDAQSVKEAVAQAIKSWSQSGEQD
        SILEVFEELGLNVIEARVSCTD FQLQAI EIEEEGEEAIDAQ+VKEAV QAIKSW QSGEQD
Subjt:  SILEVFEELGLNVIEARVSCTDCFQLQAIGEIEEEGEEAIDAQSVKEAVAQAIKSWSQSGEQD

XP_038884497.1 uncharacterized protein LOC120075304 isoform X2 [Benincasa hispida]1.2e-6992.02Show/hide
Query:  MVSREHKKAALHEKLQLLRSITNSHSQLNKASIIVDASKYIEELKQKVERLNQDISTVENSIHPNPISH-HIPMQVTVERLVKGFSINVFSEKSCQGLLV
        MVSREHKKA LHEKLQLLRSITNSH+ LNKASIIVDASKYIEELKQKVERLNQDISTV+NSIHPNP+SH + PMQVTVERLVKGFSINVFSEKSCQGLLV
Subjt:  MVSREHKKAALHEKLQLLRSITNSHSQLNKASIIVDASKYIEELKQKVERLNQDISTVENSIHPNPISH-HIPMQVTVERLVKGFSINVFSEKSCQGLLV

Query:  SILEVFEELGLNVIEARVSCTDCFQLQAIGEIEEEGEEAIDAQSVKEAVAQAIKSWSQSGEQD
        SILEVFEELGLNVIEARVSCTD FQLQAI EIEEEGEEAIDAQ+VKEAV QAIKSW QSGEQD
Subjt:  SILEVFEELGLNVIEARVSCTDCFQLQAIGEIEEEGEEAIDAQSVKEAVAQAIKSWSQSGEQD

XP_038884498.1 uncharacterized protein LOC120075304 isoform X3 [Benincasa hispida]1.5e-7091.36Show/hide
Query:  MVSREHKKAALHEKLQLLRSITNSHSQLNKASIIVDASKYIEELKQKVERLNQDISTVENSIHPNPISHHIPMQVTVERLVKGFSINVFSEKSCQGLLVS
        MVSREHKKA LHEKLQLLRSITNSH+QLNKASIIVDASKYIEELKQKVERLNQDISTV+NSIHPNP+SH     VTVERLVKGFSINVFSEKSCQGLLVS
Subjt:  MVSREHKKAALHEKLQLLRSITNSHSQLNKASIIVDASKYIEELKQKVERLNQDISTVENSIHPNPISHHIPMQVTVERLVKGFSINVFSEKSCQGLLVS

Query:  ILEVFEELGLNVIEARVSCTDCFQLQAIGEIEEEGEEAIDAQSVKEAVAQAIKSWSQSGEQD
        ILEVFEELGLNVIEARVSCTD FQLQAI EIEEEGEEAIDAQ+VKEAV QAIKSW QSGEQD
Subjt:  ILEVFEELGLNVIEARVSCTDCFQLQAIGEIEEEGEEAIDAQSVKEAVAQAIKSWSQSGEQD

XP_038884499.1 uncharacterized protein LOC120075304 isoform X4 [Benincasa hispida]1.4e-6890.74Show/hide
Query:  MVSREHKKAALHEKLQLLRSITNSHSQLNKASIIVDASKYIEELKQKVERLNQDISTVENSIHPNPISHHIPMQVTVERLVKGFSINVFSEKSCQGLLVS
        MVSREHKKA LHEKLQLLRSITNSH+ LNKASIIVDASKYIEELKQKVERLNQDISTV+NSIHPNP+SH     VTVERLVKGFSINVFSEKSCQGLLVS
Subjt:  MVSREHKKAALHEKLQLLRSITNSHSQLNKASIIVDASKYIEELKQKVERLNQDISTVENSIHPNPISHHIPMQVTVERLVKGFSINVFSEKSCQGLLVS

Query:  ILEVFEELGLNVIEARVSCTDCFQLQAIGEIEEEGEEAIDAQSVKEAVAQAIKSWSQSGEQD
        ILEVFEELGLNVIEARVSCTD FQLQAI EIEEEGEEAIDAQ+VKEAV QAIKSW QSGEQD
Subjt:  ILEVFEELGLNVIEARVSCTDCFQLQAIGEIEEEGEEAIDAQSVKEAVAQAIKSWSQSGEQD

TrEMBL top hitse value%identityAlignment
A0A0A0K8N7 Uncharacterized protein6.5e-6488.41Show/hide
Query:  MVSREHKK-AALHEKLQLLRSITNSHSQLNKASIIVDASKYIEELKQKVERLNQDISTVENSIHPNPISH-HIPMQVTVERLVKGFSINVFSEKSCQGLL
        MVSREHKK AALH+ LQLLRSITNSHS LNKASIIVDASKYIEELKQKVERLNQDISTV+NS   NP+SH + PMQVTVER+VKGFSINVFSEKSCQGLL
Subjt:  MVSREHKK-AALHEKLQLLRSITNSHSQLNKASIIVDASKYIEELKQKVERLNQDISTVENSIHPNPISH-HIPMQVTVERLVKGFSINVFSEKSCQGLL

Query:  VSILEVFEELGLNVIEARVSCTDCFQLQAIGEIEEEGEEAIDAQSVKEAVAQAIKSWSQSGEQD
        VSILEVFEELGLNVIEARVSCT  FQLQAIGEIEEEGEE IDAQ+VKEAV QAIKSWSQ+GEQD
Subjt:  VSILEVFEELGLNVIEARVSCTDCFQLQAIGEIEEEGEEAIDAQSVKEAVAQAIKSWSQSGEQD

A0A6J1FG89 uncharacterized protein LOC111445214 isoform X12.7e-6284.57Show/hide
Query:  MVSREHKKAALHEKLQLLRSITNSHSQLNKASIIVDASKYIEELKQKVERLNQDISTVENSIHPNPISHHIPMQVTVERLVKGFSINVFSEKSCQGLLVS
        MVSREH  A+LH  LQLLRSITNSH+QLNKASIIVDASKYIEELKQKVERLNQDISTV+ SIH        PMQVTVE L KGFSINVFSEKSCQGLLVS
Subjt:  MVSREHKKAALHEKLQLLRSITNSHSQLNKASIIVDASKYIEELKQKVERLNQDISTVENSIHPNPISHHIPMQVTVERLVKGFSINVFSEKSCQGLLVS

Query:  ILEVFEELGLNVIEARVSCTDCFQLQAIGEIEEEGEEAIDAQSVKEAVAQAIKSWSQSGEQD
        ILE FEELGLNV+EARVSCTD FQLQAI EIEE+GEEAIDAQSVKEAV QAIK WSQSGEQD
Subjt:  ILEVFEELGLNVIEARVSCTDCFQLQAIGEIEEEGEEAIDAQSVKEAVAQAIKSWSQSGEQD

A0A6J1HIH6 uncharacterized protein LOC111464709 isoform X18.4e-6485.8Show/hide
Query:  MVSREHKKAALHEKLQLLRSITNSHSQLNKASIIVDASKYIEELKQKVERLNQDISTVENSIHPNPISHHIPMQVTVERLVKGFSINVFSEKSCQGLLVS
        MVSREHKKAALHEKLQLLRSITNSH+ LNK SIIVDASKYIEELKQKVERLNQDI+TV+NSIHPN      PMQVTVE LVKGFSINVFSEKSCQGLLVS
Subjt:  MVSREHKKAALHEKLQLLRSITNSHSQLNKASIIVDASKYIEELKQKVERLNQDISTVENSIHPNPISHHIPMQVTVERLVKGFSINVFSEKSCQGLLVS

Query:  ILEVFEELGLNVIEARVSCTDCFQLQAIGEIEEEGEEAIDAQSVKEAVAQAIKSWSQSGEQD
        ILE FEELGLNV+EARVSCTD FQLQA  EIEE+GEEA+DAQ+VKEAV +AIKSWSQ+GEQD
Subjt:  ILEVFEELGLNVIEARVSCTDCFQLQAIGEIEEEGEEAIDAQSVKEAVAQAIKSWSQSGEQD

A0A6J1JV46 uncharacterized protein LOC111487778 isoform X18.4e-6485.8Show/hide
Query:  MVSREHKKAALHEKLQLLRSITNSHSQLNKASIIVDASKYIEELKQKVERLNQDISTVENSIHPNPISHHIPMQVTVERLVKGFSINVFSEKSCQGLLVS
        MVSREHKKAALHEKLQLLRSITNSH+ LNK SIIVDASKYIEELKQKVERLNQDI+TV+NSIHPN      PMQVTVE LVKGFSINVFSEKSCQGLLVS
Subjt:  MVSREHKKAALHEKLQLLRSITNSHSQLNKASIIVDASKYIEELKQKVERLNQDISTVENSIHPNPISHHIPMQVTVERLVKGFSINVFSEKSCQGLLVS

Query:  ILEVFEELGLNVIEARVSCTDCFQLQAIGEIEEEGEEAIDAQSVKEAVAQAIKSWSQSGEQD
        ILE FEELGLNV+EARVSCTD FQLQA  EIEE+GEEA+DAQ+VKEAV +AIKSWSQ+GEQD
Subjt:  ILEVFEELGLNVIEARVSCTDCFQLQAIGEIEEEGEEAIDAQSVKEAVAQAIKSWSQSGEQD

A0A6J1K1N3 uncharacterized protein LOC111489817 isoform X11.2e-6285.19Show/hide
Query:  MVSREHKKAALHEKLQLLRSITNSHSQLNKASIIVDASKYIEELKQKVERLNQDISTVENSIHPNPISHHIPMQVTVERLVKGFSINVFSEKSCQGLLVS
        MVSREH  AALH  LQLLRSITNSH+QLNKASIIVDASKYIEELKQKVERLNQDISTV+ SIH        PMQVTVE L KGFSINVFSEKSCQGLLVS
Subjt:  MVSREHKKAALHEKLQLLRSITNSHSQLNKASIIVDASKYIEELKQKVERLNQDISTVENSIHPNPISHHIPMQVTVERLVKGFSINVFSEKSCQGLLVS

Query:  ILEVFEELGLNVIEARVSCTDCFQLQAIGEIEEEGEEAIDAQSVKEAVAQAIKSWSQSGEQD
        ILE FEELGLNV+EARVSCTD FQLQAI EIEEEGEEAIDAQ+VKEAV QAIK WSQSGEQD
Subjt:  ILEVFEELGLNVIEARVSCTDCFQLQAIGEIEEEGEEAIDAQSVKEAVAQAIKSWSQSGEQD

SwissProt top hitse value%identityAlignment
Q9LPW3 Transcription factor SCREAM25.0e-0526.71Show/hide
Query:  KKEKKRKRVESMVSREHKKAALHEKLQLLRSITNSHSQLNKASIIVDASKYIEELKQKVERLNQDISTV---ENSIHP-NPISHHIPMQVTVE-------
        K +KK    +++++   ++  L+++L +LRS+    S++++ASI+ DA  Y++EL Q++  L+ ++ +     +S+HP  P    +  +V  E       
Subjt:  KKEKKRKRVESMVSREHKKAALHEKLQLLRSITNSHSQLNKASIIVDASKYIEELKQKVERLNQDISTV---ENSIHP-NPISHHIPMQVTVE-------

Query:  ------------RLVKGFSINV--FSEKSCQGLLVSILEVFEELGLNVIEARVSCTDCFQL
                    RL +G ++N+  F  +   GLL+S +   + LGL+V +A +SC + F L
Subjt:  ------------RLVKGFSINV--FSEKSCQGLLVSILEVFEELGLNVIEARVSCTDCFQL

Q9LSE2 Transcription factor ICE12.9e-0527.87Show/hide
Query:  KKEKKRKRVESMVSREHKKAALHEKLQLLRSITNSHSQLNKASIIVDASKYIEELKQKVERLNQDI-STVENSIHPNPISHH------------------
        K +KK    +++++   ++  L+++L +LRS+    S++++ASI+ DA  Y++EL Q++  L+ ++ ST   S+ P   S H                  
Subjt:  KKEKKRKRVESMVSREHKKAALHEKLQLLRSITNSHSQLNKASIIVDASKYIEELKQKVERLNQDI-STVENSIHPNPISHH------------------

Query:  ---IP----MQVTVE-RLVKGFSINV--FSEKSCQGLLVSILEVFEELGLNVIEARVSCTDCFQLQAI-GEIEEEGEEAIDAQ
           +P     Q  VE RL +G ++N+  F  +   GLL++ ++  + LGL+V +A +SC + F L     E  +EG+E +  Q
Subjt:  ---IP----MQVTVE-RLVKGFSINV--FSEKSCQGLLVSILEVFEELGLNVIEARVSCTDCFQLQAI-GEIEEEGEEAIDAQ

Q9LSL1 Transcription factor bHLH938.5e-0524.29Show/hide
Query:  REKKKEKKRKRVESMVSREHKKAALHEKLQLLRSITNSHSQLNKASIIVDASKYIEELKQKVERL---NQDISTVENSIHP------------NPISHHI
        ++K K+ + +  +++++   ++  L+++L +LRSI    S++++ SI+ DA  Y++EL  K+ +L    Q++    NS H              P+  + 
Subjt:  REKKKEKKRKRVESMVSREHKKAALHEKLQLLRSITNSHSQLNKASIIVDASKYIEELKQKVERL---NQDISTVENSIHP------------NPISHHI

Query:  PMQVTVERLVKGFSINVFSEKSCQGLLVSILEVFEELGLNVIEARVSCTDCFQLQAIGEIEEEGEEAIDAQSVKEAV
        P +  ++R  +   +++       GLL+S +   E LGL + +  +SC   F LQA      E  + I ++ +K+A+
Subjt:  PMQVTVERLVKGFSINVFSEKSCQGLLVSILEVFEELGLNVIEARVSCTDCFQLQAIGEIEEEGEEAIDAQSVKEAV

Q9LXA9 Transcription factor bHLH612.2e-0525.29Show/hide
Query:  EKKKEKKRKRVE-----SMVSREHKKAALHEKLQLLRSITNSHSQLNKASIIVDASKYIEELKQKVERLNQDISTVENSIHPNP-ISHHIPMQVTVERLV
        E  K++  K++E     ++++   ++  L+++L LLRSI    +++++ SI+ DA  Y++EL  K+ +L +D   + ++ H +  I++   ++ +++  V
Subjt:  EKKKEKKRKRVE-----SMVSREHKKAALHEKLQLLRSITNSHSQLNKASIIVDASKYIEELKQKVERLNQDISTVENSIHPNP-ISHHIPMQVTVERLV

Query:  KGFSINVFSEKSC---QGLLVSILEVFEELGLNVIEARVSCTDCFQLQAIGEIEEEGEEAIDAQSVKEAV
            +N   +  C    GL+VS +   E LGL + +  +SC   F LQA      E    + +++ K+A+
Subjt:  KGFSINVFSEKSC---QGLLVSILEVFEELGLNVIEARVSCTDCFQLQAIGEIEEEGEEAIDAQSVKEAV

Arabidopsis top hitse value%identityAlignment
AT1G29270.1 unknown protein1.1e-1034.11Show/hide
Query:  MVSREHKKAALHEKLQLLRSITNSHSQLNKASIIV-DASKYIEELKQKVERLNQDISTVENSIHPNPISHHIPMQVTVERLVKGFSINVFSEKSCQGLLV
        MV+ E KK A   K   L+++T+    +++ S+++ +A  YI  LK ++E L ++   ++ +      S H   +V VE++ + F + + S +  +  LV
Subjt:  MVSREHKKAALHEKLQLLRSITNSHSQLNKASIIV-DASKYIEELKQKVERLNQDISTVENSIHPNPISHHIPMQVTVERLVKGFSINVFSEKSCQGLLV

Query:  SILEVFEELGLNVIEARVSCTDCFQLQAI
        +ILE FEE+GLNV +AR SC D F ++AI
Subjt:  SILEVFEELGLNVIEARVSCTDCFQLQAI

AT2G40435.1 BEST Arabidopsis thaliana protein match is: transcription regulators (TAIR:AT3G56220.1)5.4e-3959.24Show/hide
Query:  MVSREHKKAALHEKLQLLRSITNSHSQLNKASIIVDASKYIEELKQKVERLNQDISTVENSIHPNPISHHIPMQVTVERLVKGFSINVFSEKSCQGLLVS
        MVSRE K+ +L EK QLLRSITNSH++ N  SII+DASKYI++LKQKVER NQD +  ++S    P     PM VTVE L KGF INVFS K+  G+LVS
Subjt:  MVSREHKKAALHEKLQLLRSITNSHSQLNKASIIVDASKYIEELKQKVERLNQDISTVENSIHPNPISHHIPMQVTVERLVKGFSINVFSEKSCQGLLVS

Query:  ILEVFEELGLNVIEARVSCTDCFQLQAIGEIEEEGEEAIDAQSVKEAVAQAIKSWSQ
        +LE FE++GLNV+EAR SCTD F L A+G +E E  E +DA++VK+AV  AI+SW +
Subjt:  ILEVFEELGLNVIEARVSCTDCFQLQAIGEIEEEGEEAIDAQSVKEAVAQAIKSWSQ

AT3G26744.1 basic helix-loop-helix (bHLH) DNA-binding superfamily protein2.1e-0627.87Show/hide
Query:  KKEKKRKRVESMVSREHKKAALHEKLQLLRSITNSHSQLNKASIIVDASKYIEELKQKVERLNQDI-STVENSIHPNPISHH------------------
        K +KK    +++++   ++  L+++L +LRS+    S++++ASI+ DA  Y++EL Q++  L+ ++ ST   S+ P   S H                  
Subjt:  KKEKKRKRVESMVSREHKKAALHEKLQLLRSITNSHSQLNKASIIVDASKYIEELKQKVERLNQDI-STVENSIHPNPISHH------------------

Query:  ---IP----MQVTVE-RLVKGFSINV--FSEKSCQGLLVSILEVFEELGLNVIEARVSCTDCFQLQAI-GEIEEEGEEAIDAQ
           +P     Q  VE RL +G ++N+  F  +   GLL++ ++  + LGL+V +A +SC + F L     E  +EG+E +  Q
Subjt:  ---IP----MQVTVE-RLVKGFSINV--FSEKSCQGLLVSILEVFEELGLNVIEARVSCTDCFQLQAI-GEIEEEGEEAIDAQ

AT3G56220.1 transcription regulators7.3e-3654.32Show/hide
Query:  MVSREHKK-AALHEKLQLLRSITNSHSQLNKASIIVDASKYIEELKQKVERLNQDISTVEN---SIHPNPISHHIPMQVTVERLVKGFSINVFSEKSCQG
        MVSREHK+ ++L EK  LLRSIT+SH++ ++ SIIVDASKYI++LKQKVE++N   ++ ++   S  PNP+       VTVE L KGF I V S K+  G
Subjt:  MVSREHKK-AALHEKLQLLRSITNSHSQLNKASIIVDASKYIEELKQKVERLNQDISTVEN---SIHPNPISHHIPMQVTVERLVKGFSINVFSEKSCQG

Query:  LLVSILEVFEELGLNVIEARVSCTDCFQLQAIGEIEEEGEEAIDAQSVKEAVAQAIKSWSQS
        +LV +LE FE+LGL+V+EARVSCTD F L AIG    +  + IDA++VK+AVA+AI++WS S
Subjt:  LLVSILEVFEELGLNVIEARVSCTDCFQLQAIGEIEEEGEEAIDAQSVKEAVAQAIKSWSQS

AT5G10570.1 basic helix-loop-helix (bHLH) DNA-binding superfamily protein1.6e-0625.29Show/hide
Query:  EKKKEKKRKRVE-----SMVSREHKKAALHEKLQLLRSITNSHSQLNKASIIVDASKYIEELKQKVERLNQDISTVENSIHPNP-ISHHIPMQVTVERLV
        E  K++  K++E     ++++   ++  L+++L LLRSI    +++++ SI+ DA  Y++EL  K+ +L +D   + ++ H +  I++   ++ +++  V
Subjt:  EKKKEKKRKRVE-----SMVSREHKKAALHEKLQLLRSITNSHSQLNKASIIVDASKYIEELKQKVERLNQDISTVENSIHPNP-ISHHIPMQVTVERLV

Query:  KGFSINVFSEKSC---QGLLVSILEVFEELGLNVIEARVSCTDCFQLQAIGEIEEEGEEAIDAQSVKEAV
            +N   +  C    GL+VS +   E LGL + +  +SC   F LQA      E    + +++ K+A+
Subjt:  KGFSINVFSEKSC---QGLLVSILEVFEELGLNVIEARVSCTDCFQLQAIGEIEEEGEEAIDAQSVKEAV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAGTCATGTGGGAAGCTAACAAACAGAAAAGATCATATAGATTTGAGGCCTCATCATACCCTTATAAAACACACCTCAAAGTTTTTTTCTTTTTTGCAAAGATATTA
CAAAAGAAAAAATATATATATAAAGAGGGAAAAAAAAAAAGAAAAGAAAAGAAAAAGGGTTGAATCCATGGTTTCTAGAGAGCACAAGAAGGCAGCTCTGCATGAGAAGC
TTCAATTACTTCGCTCTATTACCAACTCTCATTCACAGCTAAACAAGGCCTCGATTATAGTGGATGCATCAAAATATATCGAGGAGCTAAAACAGAAAGTAGAAAGATTG
AATCAAGATATATCAACAGTTGAAAATTCAATCCACCCAAATCCAATTTCTCATCATATTCCCATGCAGGTTACAGTGGAAAGGCTTGTAAAGGGATTTTCTATAAATGT
ATTTTCGGAAAAGAGTTGTCAAGGCCTTCTTGTCTCCATATTAGAAGTGTTTGAAGAGCTGGGGCTTAATGTTATTGAAGCTAGGGTTTCCTGTACTGATTGTTTCCAAT
TACAAGCTATTGGAGAAATTGAAGAAGAAGGAGAAGAAGCCATTGATGCTCAATCTGTGAAAGAAGCAGTAGCTCAAGCTATAAAGAGCTGGAGCCAAAGCGGTGAACAA
GATTAA
mRNA sequenceShow/hide mRNA sequence
AGTTGGCAAAGTATTTTATATATTTGGTTGTAAAGAAGAGGGTGATGCAGTCATGTGGGAAGCTAACAAACAGAAAAGATCATATAGATTTGAGGCCTCATCATACCCTT
ATAAAACACACCTCAAAGTTTTTTTCTTTTTTGCAAAGATATTACAAAAGAAAAAATATATATATAAAGAGGGAAAAAAAAAAAGAAAAGAAAAGAAAAAGGGTTGAATC
CATGGTTTCTAGAGAGCACAAGAAGGCAGCTCTGCATGAGAAGCTTCAATTACTTCGCTCTATTACCAACTCTCATTCACAGCTAAACAAGGCCTCGATTATAGTGGATG
CATCAAAATATATCGAGGAGCTAAAACAGAAAGTAGAAAGATTGAATCAAGATATATCAACAGTTGAAAATTCAATCCACCCAAATCCAATTTCTCATCATATTCCCATG
CAGGTTACAGTGGAAAGGCTTGTAAAGGGATTTTCTATAAATGTATTTTCGGAAAAGAGTTGTCAAGGCCTTCTTGTCTCCATATTAGAAGTGTTTGAAGAGCTGGGGCT
TAATGTTATTGAAGCTAGGGTTTCCTGTACTGATTGTTTCCAATTACAAGCTATTGGAGAAATTGAAGAAGAAGGAGAAGAAGCCATTGATGCTCAATCTGTGAAAGAAG
CAGTAGCTCAAGCTATAAAGAGCTGGAGCCAAAGCGGTGAACAAGATTAATAATTATCAACAATCTAGAAGGAAA
Protein sequenceShow/hide protein sequence
MQSCGKLTNRKDHIDLRPHHTLIKHTSKFFSFLQRYYKRKNIYIKREKKKEKKRKRVESMVSREHKKAALHEKLQLLRSITNSHSQLNKASIIVDASKYIEELKQKVERL
NQDISTVENSIHPNPISHHIPMQVTVERLVKGFSINVFSEKSCQGLLVSILEVFEELGLNVIEARVSCTDCFQLQAIGEIEEEGEEAIDAQSVKEAVAQAIKSWSQSGEQ
D