; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc11G06290 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc11G06290
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionACT domain-containing protein
Genome locationClcChr11:6163681..6164904
RNA-Seq ExpressionClc11G06290
SyntenyClc11G06290
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0005634 - nucleus (cellular component)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
GO:0043565 - sequence-specific DNA binding (molecular function)
GO:0046983 - protein dimerization activity (molecular function)
InterPro domainsIPR036638 - Helix-loop-helix DNA-binding domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_011656425.1 uncharacterized protein LOC101218918 isoform X1 [Cucumis sativus]3.4e-6488.41Show/hide
Query:  MVSREHKK-AALHEKLQLLRSITNSHSQLNKASIIVDASKYIEERKQKVERLNQDISTVENSIHPNPISH-HPPMQVTVERLVKGFSINVFSEKSCQGLL
        MVSREHKK AALH+ LQLLRSITNSHS LNKASIIVDASKYIEE KQKVERLNQDISTV+NS   NP+SH + PMQVTVER+VKGFSINVFSEKSCQGLL
Subjt:  MVSREHKK-AALHEKLQLLRSITNSHSQLNKASIIVDASKYIEERKQKVERLNQDISTVENSIHPNPISH-HPPMQVTVERLVKGFSINVFSEKSCQGLL

Query:  VSILEVFEELGLNVIEARVSCTDSFQLQAIGEIEEEGEEAIDAQSVKEAVVQAIKSWSQSGEQD
        VSILEVFEELGLNVIEARVSCT +FQLQAIGEIEEEGEE IDAQ+VKEAVVQAIKSWSQ+GEQD
Subjt:  VSILEVFEELGLNVIEARVSCTDSFQLQAIGEIEEEGEEAIDAQSVKEAVVQAIKSWSQSGEQD

XP_038884496.1 uncharacterized protein LOC120075304 isoform X1 [Benincasa hispida]3.4e-7292.64Show/hide
Query:  MVSREHKKAALHEKLQLLRSITNSHSQLNKASIIVDASKYIEERKQKVERLNQDISTVENSIHPNPISH-HPPMQVTVERLVKGFSINVFSEKSCQGLLV
        MVSREHKKA LHEKLQLLRSITNSH+QLNKASIIVDASKYIEE KQKVERLNQDISTV+NSIHPNP+SH + PMQVTVERLVKGFSINVFSEKSCQGLLV
Subjt:  MVSREHKKAALHEKLQLLRSITNSHSQLNKASIIVDASKYIEERKQKVERLNQDISTVENSIHPNPISH-HPPMQVTVERLVKGFSINVFSEKSCQGLLV

Query:  SILEVFEELGLNVIEARVSCTDSFQLQAIGEIEEEGEEAIDAQSVKEAVVQAIKSWSQSGEQD
        SILEVFEELGLNVIEARVSCTD+FQLQAI EIEEEGEEAIDAQ+VKEAVVQAIKSW QSGEQD
Subjt:  SILEVFEELGLNVIEARVSCTDSFQLQAIGEIEEEGEEAIDAQSVKEAVVQAIKSWSQSGEQD

XP_038884497.1 uncharacterized protein LOC120075304 isoform X2 [Benincasa hispida]3.1e-7092.02Show/hide
Query:  MVSREHKKAALHEKLQLLRSITNSHSQLNKASIIVDASKYIEERKQKVERLNQDISTVENSIHPNPISH-HPPMQVTVERLVKGFSINVFSEKSCQGLLV
        MVSREHKKA LHEKLQLLRSITNSH+ LNKASIIVDASKYIEE KQKVERLNQDISTV+NSIHPNP+SH + PMQVTVERLVKGFSINVFSEKSCQGLLV
Subjt:  MVSREHKKAALHEKLQLLRSITNSHSQLNKASIIVDASKYIEERKQKVERLNQDISTVENSIHPNPISH-HPPMQVTVERLVKGFSINVFSEKSCQGLLV

Query:  SILEVFEELGLNVIEARVSCTDSFQLQAIGEIEEEGEEAIDAQSVKEAVVQAIKSWSQSGEQD
        SILEVFEELGLNVIEARVSCTD+FQLQAI EIEEEGEEAIDAQ+VKEAVVQAIKSW QSGEQD
Subjt:  SILEVFEELGLNVIEARVSCTDSFQLQAIGEIEEEGEEAIDAQSVKEAVVQAIKSWSQSGEQD

XP_038884498.1 uncharacterized protein LOC120075304 isoform X3 [Benincasa hispida]8.3e-7191.36Show/hide
Query:  MVSREHKKAALHEKLQLLRSITNSHSQLNKASIIVDASKYIEERKQKVERLNQDISTVENSIHPNPISHHPPMQVTVERLVKGFSINVFSEKSCQGLLVS
        MVSREHKKA LHEKLQLLRSITNSH+QLNKASIIVDASKYIEE KQKVERLNQDISTV+NSIHPNP+SH     VTVERLVKGFSINVFSEKSCQGLLVS
Subjt:  MVSREHKKAALHEKLQLLRSITNSHSQLNKASIIVDASKYIEERKQKVERLNQDISTVENSIHPNPISHHPPMQVTVERLVKGFSINVFSEKSCQGLLVS

Query:  ILEVFEELGLNVIEARVSCTDSFQLQAIGEIEEEGEEAIDAQSVKEAVVQAIKSWSQSGEQD
        ILEVFEELGLNVIEARVSCTD+FQLQAI EIEEEGEEAIDAQ+VKEAVVQAIKSW QSGEQD
Subjt:  ILEVFEELGLNVIEARVSCTDSFQLQAIGEIEEEGEEAIDAQSVKEAVVQAIKSWSQSGEQD

XP_038884499.1 uncharacterized protein LOC120075304 isoform X4 [Benincasa hispida]7.7e-6990.74Show/hide
Query:  MVSREHKKAALHEKLQLLRSITNSHSQLNKASIIVDASKYIEERKQKVERLNQDISTVENSIHPNPISHHPPMQVTVERLVKGFSINVFSEKSCQGLLVS
        MVSREHKKA LHEKLQLLRSITNSH+ LNKASIIVDASKYIEE KQKVERLNQDISTV+NSIHPNP+SH     VTVERLVKGFSINVFSEKSCQGLLVS
Subjt:  MVSREHKKAALHEKLQLLRSITNSHSQLNKASIIVDASKYIEERKQKVERLNQDISTVENSIHPNPISHHPPMQVTVERLVKGFSINVFSEKSCQGLLVS

Query:  ILEVFEELGLNVIEARVSCTDSFQLQAIGEIEEEGEEAIDAQSVKEAVVQAIKSWSQSGEQD
        ILEVFEELGLNVIEARVSCTD+FQLQAI EIEEEGEEAIDAQ+VKEAVVQAIKSW QSGEQD
Subjt:  ILEVFEELGLNVIEARVSCTDSFQLQAIGEIEEEGEEAIDAQSVKEAVVQAIKSWSQSGEQD

TrEMBL top hitse value%identityAlignment
A0A0A0K8N7 Uncharacterized protein1.6e-6488.41Show/hide
Query:  MVSREHKK-AALHEKLQLLRSITNSHSQLNKASIIVDASKYIEERKQKVERLNQDISTVENSIHPNPISH-HPPMQVTVERLVKGFSINVFSEKSCQGLL
        MVSREHKK AALH+ LQLLRSITNSHS LNKASIIVDASKYIEE KQKVERLNQDISTV+NS   NP+SH + PMQVTVER+VKGFSINVFSEKSCQGLL
Subjt:  MVSREHKK-AALHEKLQLLRSITNSHSQLNKASIIVDASKYIEERKQKVERLNQDISTVENSIHPNPISH-HPPMQVTVERLVKGFSINVFSEKSCQGLL

Query:  VSILEVFEELGLNVIEARVSCTDSFQLQAIGEIEEEGEEAIDAQSVKEAVVQAIKSWSQSGEQD
        VSILEVFEELGLNVIEARVSCT +FQLQAIGEIEEEGEE IDAQ+VKEAVVQAIKSWSQ+GEQD
Subjt:  VSILEVFEELGLNVIEARVSCTDSFQLQAIGEIEEEGEEAIDAQSVKEAVVQAIKSWSQSGEQD

A0A6J1FG89 uncharacterized protein LOC111445214 isoform X14.0e-6385.19Show/hide
Query:  MVSREHKKAALHEKLQLLRSITNSHSQLNKASIIVDASKYIEERKQKVERLNQDISTVENSIHPNPISHHPPMQVTVERLVKGFSINVFSEKSCQGLLVS
        MVSREH  A+LH  LQLLRSITNSH+QLNKASIIVDASKYIEE KQKVERLNQDISTV+ SIH        PMQVTVE L KGFSINVFSEKSCQGLLVS
Subjt:  MVSREHKKAALHEKLQLLRSITNSHSQLNKASIIVDASKYIEERKQKVERLNQDISTVENSIHPNPISHHPPMQVTVERLVKGFSINVFSEKSCQGLLVS

Query:  ILEVFEELGLNVIEARVSCTDSFQLQAIGEIEEEGEEAIDAQSVKEAVVQAIKSWSQSGEQD
        ILE FEELGLNV+EARVSCTDSFQLQAI EIEE+GEEAIDAQSVKEAVVQAIK WSQSGEQD
Subjt:  ILEVFEELGLNVIEARVSCTDSFQLQAIGEIEEEGEEAIDAQSVKEAVVQAIKSWSQSGEQD

A0A6J1HIH6 uncharacterized protein LOC111464709 isoform X12.8e-6485.8Show/hide
Query:  MVSREHKKAALHEKLQLLRSITNSHSQLNKASIIVDASKYIEERKQKVERLNQDISTVENSIHPNPISHHPPMQVTVERLVKGFSINVFSEKSCQGLLVS
        MVSREHKKAALHEKLQLLRSITNSH+ LNK SIIVDASKYIEE KQKVERLNQDI+TV+NSIHPN      PMQVTVE LVKGFSINVFSEKSCQGLLVS
Subjt:  MVSREHKKAALHEKLQLLRSITNSHSQLNKASIIVDASKYIEERKQKVERLNQDISTVENSIHPNPISHHPPMQVTVERLVKGFSINVFSEKSCQGLLVS

Query:  ILEVFEELGLNVIEARVSCTDSFQLQAIGEIEEEGEEAIDAQSVKEAVVQAIKSWSQSGEQD
        ILE FEELGLNV+EARVSCTD+FQLQA  EIEE+GEEA+DAQ+VKEAVV+AIKSWSQ+GEQD
Subjt:  ILEVFEELGLNVIEARVSCTDSFQLQAIGEIEEEGEEAIDAQSVKEAVVQAIKSWSQSGEQD

A0A6J1JV46 uncharacterized protein LOC111487778 isoform X12.8e-6485.8Show/hide
Query:  MVSREHKKAALHEKLQLLRSITNSHSQLNKASIIVDASKYIEERKQKVERLNQDISTVENSIHPNPISHHPPMQVTVERLVKGFSINVFSEKSCQGLLVS
        MVSREHKKAALHEKLQLLRSITNSH+ LNK SIIVDASKYIEE KQKVERLNQDI+TV+NSIHPN      PMQVTVE LVKGFSINVFSEKSCQGLLVS
Subjt:  MVSREHKKAALHEKLQLLRSITNSHSQLNKASIIVDASKYIEERKQKVERLNQDISTVENSIHPNPISHHPPMQVTVERLVKGFSINVFSEKSCQGLLVS

Query:  ILEVFEELGLNVIEARVSCTDSFQLQAIGEIEEEGEEAIDAQSVKEAVVQAIKSWSQSGEQD
        ILE FEELGLNV+EARVSCTD+FQLQA  EIEE+GEEA+DAQ+VKEAVV+AIKSWSQ+GEQD
Subjt:  ILEVFEELGLNVIEARVSCTDSFQLQAIGEIEEEGEEAIDAQSVKEAVVQAIKSWSQSGEQD

A0A6J1K1N3 uncharacterized protein LOC111489817 isoform X11.8e-6385.8Show/hide
Query:  MVSREHKKAALHEKLQLLRSITNSHSQLNKASIIVDASKYIEERKQKVERLNQDISTVENSIHPNPISHHPPMQVTVERLVKGFSINVFSEKSCQGLLVS
        MVSREH  AALH  LQLLRSITNSH+QLNKASIIVDASKYIEE KQKVERLNQDISTV+ SIH        PMQVTVE L KGFSINVFSEKSCQGLLVS
Subjt:  MVSREHKKAALHEKLQLLRSITNSHSQLNKASIIVDASKYIEERKQKVERLNQDISTVENSIHPNPISHHPPMQVTVERLVKGFSINVFSEKSCQGLLVS

Query:  ILEVFEELGLNVIEARVSCTDSFQLQAIGEIEEEGEEAIDAQSVKEAVVQAIKSWSQSGEQD
        ILE FEELGLNV+EARVSCTDSFQLQAI EIEEEGEEAIDAQ+VKEAVVQAIK WSQSGEQD
Subjt:  ILEVFEELGLNVIEARVSCTDSFQLQAIGEIEEEGEEAIDAQSVKEAVVQAIKSWSQSGEQD

SwissProt top hitse value%identityAlignment
Q9LPW3 Transcription factor SCREAM28.2e-0522.29Show/hide
Query:  MVSREHKKAALHEKLQLLRSITNSHSQLNKASIIVDASKYIEERKQKVERLNQDISTV---ENSIH-----------------------PNPISHHPPMQ
        +++   ++  L+++L +LRS+    S++++ASI+ DA  Y++E  Q++  L+ ++ +     +S+H                       P+P    P ++
Subjt:  MVSREHKKAALHEKLQLLRSITNSHSQLNKASIIVDASKYIEERKQKVERLNQDISTV---ENSIH-----------------------PNPISHHPPMQ

Query:  VTVERLVKGFSINVFSEKSCQGLLVSILEVFEELGLNVIEARVSCTDSFQLQAIGEIEEEGEEAIDAQSVKEAVV
        V + R  K  +I++F  +   GLL+S +   + LGL+V +A +SC + F L      + + +  +  + +K  ++
Subjt:  VTVERLVKGFSINVFSEKSCQGLLVSILEVFEELGLNVIEARVSCTDSFQLQAIGEIEEEGEEAIDAQSVKEAVV

Q9LSE2 Transcription factor ICE13.3e-0627.33Show/hide
Query:  MVSREHKKAALHEKLQLLRSITNSHSQLNKASIIVDASKYIEERKQKVERLNQDI-STVENSIHPNPISHHP-------------------------PMQ
        +++   ++  L+++L +LRS+    S++++ASI+ DA  Y++E  Q++  L+ ++ ST   S+ P   S HP                           Q
Subjt:  MVSREHKKAALHEKLQLLRSITNSHSQLNKASIIVDASKYIEERKQKVERLNQDI-STVENSIHPNPISHHP-------------------------PMQ

Query:  VTVE-RLVKGFSINV--FSEKSCQGLLVSILEVFEELGLNVIEARVSCTDSFQLQAI-GEIEEEGEEAIDAQ
          VE RL +G ++N+  F  +   GLL++ ++  + LGL+V +A +SC + F L     E  +EG+E +  Q
Subjt:  VTVE-RLVKGFSINV--FSEKSCQGLLVSILEVFEELGLNVIEARVSCTDSFQLQAI-GEIEEEGEEAIDAQ

Q9LSL1 Transcription factor bHLH931.3e-0524.54Show/hide
Query:  MVSREHKKAALHEKLQLLRSITNSHSQLNKASIIVDASKYIEERKQKVERL---NQDISTVENSIHP------------NPISHHPPMQVTVERLVKGFS
        +++   ++  L+++L +LRSI    S++++ SI+ DA  Y++E   K+ +L    Q++    NS H              P+  + P +  ++R  +   
Subjt:  MVSREHKKAALHEKLQLLRSITNSHSQLNKASIIVDASKYIEERKQKVERL---NQDISTVENSIHP------------NPISHHPPMQVTVERLVKGFS

Query:  INVFSEKSCQGLLVSILEVFEELGLNVIEARVSCTDSFQLQAIGEIEEEGEEAIDAQSVKEAV
        +++       GLL+S +   E LGL + +  +SC   F LQA      E  + I ++ +K+A+
Subjt:  INVFSEKSCQGLLVSILEVFEELGLNVIEARVSCTDSFQLQAIGEIEEEGEEAIDAQSVKEAV

Q9LXA9 Transcription factor bHLH611.9e-0624.68Show/hide
Query:  MVSREHKKAALHEKLQLLRSITNSHSQLNKASIIVDASKYIEERKQKVERLNQDISTVENSIHPNP-ISHHPPMQVTVERLVKGFSINVFSEKSC---QG
        +++   ++  L+++L LLRSI    +++++ SI+ DA  Y++E   K+ +L +D   + ++ H +  I++   ++ +++  V    +N   +  C    G
Subjt:  MVSREHKKAALHEKLQLLRSITNSHSQLNKASIIVDASKYIEERKQKVERLNQDISTVENSIHPNP-ISHHPPMQVTVERLVKGFSINVFSEKSC---QG

Query:  LLVSILEVFEELGLNVIEARVSCTDSFQLQAIGEIEEEGEEAIDAQSVKEAVVQ
        L+VS +   E LGL + +  +SC   F LQA      E    + +++ K+A+++
Subjt:  LLVSILEVFEELGLNVIEARVSCTDSFQLQAIGEIEEEGEEAIDAQSVKEAVVQ

Arabidopsis top hitse value%identityAlignment
AT1G29270.1 unknown protein3.5e-1134.11Show/hide
Query:  MVSREHKKAALHEKLQLLRSITNSHSQLNKASIIV-DASKYIEERKQKVERLNQDISTVENSIHPNPISHHPPMQVTVERLVKGFSINVFSEKSCQGLLV
        MV+ E KK A   K   L+++T+    +++ S+++ +A  YI   K ++E L ++   ++ +      S H   +V VE++ + F + + S +  +  LV
Subjt:  MVSREHKKAALHEKLQLLRSITNSHSQLNKASIIV-DASKYIEERKQKVERLNQDISTVENSIHPNPISHHPPMQVTVERLVKGFSINVFSEKSCQGLLV

Query:  SILEVFEELGLNVIEARVSCTDSFQLQAI
        +ILE FEE+GLNV +AR SC DSF ++AI
Subjt:  SILEVFEELGLNVIEARVSCTDSFQLQAI

AT2G40435.1 BEST Arabidopsis thaliana protein match is: transcription regulators (TAIR:AT3G56220.1)2.3e-3959.24Show/hide
Query:  MVSREHKKAALHEKLQLLRSITNSHSQLNKASIIVDASKYIEERKQKVERLNQDISTVENSIHPNPISHHPPMQVTVERLVKGFSINVFSEKSCQGLLVS
        MVSRE K+ +L EK QLLRSITNSH++ N  SII+DASKYI++ KQKVER NQD +  ++S    P     PM VTVE L KGF INVFS K+  G+LVS
Subjt:  MVSREHKKAALHEKLQLLRSITNSHSQLNKASIIVDASKYIEERKQKVERLNQDISTVENSIHPNPISHHPPMQVTVERLVKGFSINVFSEKSCQGLLVS

Query:  ILEVFEELGLNVIEARVSCTDSFQLQAIGEIEEEGEEAIDAQSVKEAVVQAIKSWSQ
        +LE FE++GLNV+EAR SCTDSF L A+G +E E  E +DA++VK+AV  AI+SW +
Subjt:  ILEVFEELGLNVIEARVSCTDSFQLQAIGEIEEEGEEAIDAQSVKEAVVQAIKSWSQ

AT3G26744.1 basic helix-loop-helix (bHLH) DNA-binding superfamily protein2.4e-0727.33Show/hide
Query:  MVSREHKKAALHEKLQLLRSITNSHSQLNKASIIVDASKYIEERKQKVERLNQDI-STVENSIHPNPISHHP-------------------------PMQ
        +++   ++  L+++L +LRS+    S++++ASI+ DA  Y++E  Q++  L+ ++ ST   S+ P   S HP                           Q
Subjt:  MVSREHKKAALHEKLQLLRSITNSHSQLNKASIIVDASKYIEERKQKVERLNQDI-STVENSIHPNPISHHP-------------------------PMQ

Query:  VTVE-RLVKGFSINV--FSEKSCQGLLVSILEVFEELGLNVIEARVSCTDSFQLQAI-GEIEEEGEEAIDAQ
          VE RL +G ++N+  F  +   GLL++ ++  + LGL+V +A +SC + F L     E  +EG+E +  Q
Subjt:  VTVE-RLVKGFSINV--FSEKSCQGLLVSILEVFEELGLNVIEARVSCTDSFQLQAI-GEIEEEGEEAIDAQ

AT3G56220.1 transcription regulators1.6e-3554.72Show/hide
Query:  MVSREHKK-AALHEKLQLLRSITNSHSQLNKASIIVDASKYIEERKQKVERLNQDISTVENSIHPNPISHHPPMQVTVERLVKGFSINVFSEKSCQGLLV
        MVSREHK+ ++L EK  LLRSIT+SH++ ++ SIIVDASKYI++ KQKVE++N + +T E S      S  P   VTVE L KGF I V S K+  G+LV
Subjt:  MVSREHKK-AALHEKLQLLRSITNSHSQLNKASIIVDASKYIEERKQKVERLNQDISTVENSIHPNPISHHPPMQVTVERLVKGFSINVFSEKSCQGLLV

Query:  SILEVFEELGLNVIEARVSCTDSFQLQAIGEIEEEGEEAIDAQSVKEAVVQAIKSWSQS
         +LE FE+LGL+V+EARVSCTD+F L AIG    +  + IDA++VK+AV +AI++WS S
Subjt:  SILEVFEELGLNVIEARVSCTDSFQLQAIGEIEEEGEEAIDAQSVKEAVVQAIKSWSQS

AT5G10570.1 basic helix-loop-helix (bHLH) DNA-binding superfamily protein1.4e-0724.68Show/hide
Query:  MVSREHKKAALHEKLQLLRSITNSHSQLNKASIIVDASKYIEERKQKVERLNQDISTVENSIHPNP-ISHHPPMQVTVERLVKGFSINVFSEKSC---QG
        +++   ++  L+++L LLRSI    +++++ SI+ DA  Y++E   K+ +L +D   + ++ H +  I++   ++ +++  V    +N   +  C    G
Subjt:  MVSREHKKAALHEKLQLLRSITNSHSQLNKASIIVDASKYIEERKQKVERLNQDISTVENSIHPNP-ISHHPPMQVTVERLVKGFSINVFSEKSC---QG

Query:  LLVSILEVFEELGLNVIEARVSCTDSFQLQAIGEIEEEGEEAIDAQSVKEAVVQ
        L+VS +   E LGL + +  +SC   F LQA      E    + +++ K+A+++
Subjt:  LLVSILEVFEELGLNVIEARVSCTDSFQLQAIGEIEEEGEEAIDAQSVKEAVVQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTTCTAGAGAGCACAAGAAGGCAGCTCTGCATGAGAAGCTTCAATTACTTCGCTCTATTACCAACTCTCATTCACAGCTAAACAAGGCCTCGATTATAGTGGATGC
ATCAAAATATATCGAGGAGCGAAAACAGAAAGTAGAAAGATTGAATCAAGATATATCAACAGTTGAAAATTCAATCCACCCAAATCCAATTTCTCATCATCCTCCCATGC
AGGTTACAGTGGAAAGGCTTGTAAAGGGATTTTCTATAAATGTATTTTCGGAAAAGAGTTGTCAAGGCCTTCTTGTCTCCATATTAGAAGTGTTTGAAGAGCTGGGGCTT
AATGTTATTGAAGCTAGGGTTTCCTGTACTGATAGTTTCCAATTACAAGCTATTGGAGAAATTGAAGAAGAAGGAGAAGAAGCCATTGATGCTCAATCTGTGAAAGAAGC
AGTAGTTCAAGCTATAAAGAGCTGGAGCCAAAGCGGTGAACAAGATTAA
mRNA sequenceShow/hide mRNA sequence
AAAAAAAAAAAGAAAAGATATAAAGAGAAAAAAAAAAAAGAAAAGAAAAGAAAAGAAAAGAAAAAGGGTTGAATCCATGGTTTCTAGAGAGCACAAGAAGGCAGCTCTGC
ATGAGAAGCTTCAATTACTTCGCTCTATTACCAACTCTCATTCACAGCTAAACAAGGCCTCGATTATAGTGGATGCATCAAAATATATCGAGGAGCGAAAACAGAAAGTA
GAAAGATTGAATCAAGATATATCAACAGTTGAAAATTCAATCCACCCAAATCCAATTTCTCATCATCCTCCCATGCAGGTTACAGTGGAAAGGCTTGTAAAGGGATTTTC
TATAAATGTATTTTCGGAAAAGAGTTGTCAAGGCCTTCTTGTCTCCATATTAGAAGTGTTTGAAGAGCTGGGGCTTAATGTTATTGAAGCTAGGGTTTCCTGTACTGATA
GTTTCCAATTACAAGCTATTGGAGAAATTGAAGAAGAAGGAGAAGAAGCCATTGATGCTCAATCTGTGAAAGAAGCAGTAGTTCAAGCTATAAAGAGCTGGAGCCAAAGC
GGTGAACAAGATTAATAATTATCAACTCTCTAGAAGGAAAATTATTCCCC
Protein sequenceShow/hide protein sequence
MVSREHKKAALHEKLQLLRSITNSHSQLNKASIIVDASKYIEERKQKVERLNQDISTVENSIHPNPISHHPPMQVTVERLVKGFSINVFSEKSCQGLLVSILEVFEELGL
NVIEARVSCTDSFQLQAIGEIEEEGEEAIDAQSVKEAVVQAIKSWSQSGEQD