; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Carg05951 (gene) of Silver-seed gourd (SMH-JMG-627) v2 genome

Gene IDCarg05951
OrganismCucurbita argyrosperma subsp. argyrosperma cv. SMH-JMG-627 (Silver-seed gourd (SMH-JMG-627) v2)
DescriptionTranscription factor bHLH61 isoform 2
Genome locationCarg_Chr04:21269886..21271204
RNA-Seq ExpressionCarg05951
SyntenyCarg05951
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0005634 - nucleus (cellular component)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
GO:0043565 - sequence-specific DNA binding (molecular function)
GO:0046983 - protein dimerization activity (molecular function)
InterPro domainsIPR036638 - Helix-loop-helix DNA-binding domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022964712.1 uncharacterized protein LOC111464709 isoform X1 [Cucurbita moschata]2.2e-76100Show/hide
Query:  MVSREHKKAALHEKLQLLRSITNSHALNKGSIIVDASKYIEELKQKVERLNQDIATVQNSIHPNHPMQVTVEALVKGFSINVFSEKSCQGLLVSILEAFE
        MVSREHKKAALHEKLQLLRSITNSHALNKGSIIVDASKYIEELKQKVERLNQDIATVQNSIHPNHPMQVTVEALVKGFSINVFSEKSCQGLLVSILEAFE
Subjt:  MVSREHKKAALHEKLQLLRSITNSHALNKGSIIVDASKYIEELKQKVERLNQDIATVQNSIHPNHPMQVTVEALVKGFSINVFSEKSCQGLLVSILEAFE

Query:  ELGLNVLEARVSCTDTFQLQAFAEIEEQGEEAMDAQAVKEAVVEAIKSWSQNGEQD
        ELGLNVLEARVSCTDTFQLQAFAEIEEQGEEAMDAQAVKEAVVEAIKSWSQNGEQD
Subjt:  ELGLNVLEARVSCTDTFQLQAFAEIEEQGEEAMDAQAVKEAVVEAIKSWSQNGEQD

XP_022964721.1 uncharacterized protein LOC111464709 isoform X2 [Cucurbita moschata]2.0e-7499.36Show/hide
Query:  MVSREHKKAALHEKLQLLRSITNSHALNKGSIIVDASKYIEELKQKVERLNQDIATVQNSIHPNHPMQVTVEALVKGFSINVFSEKSCQGLLVSILEAFE
        MVSREHKKAALHEKLQLLRSITNSHALNKGSIIVDASKYIEELKQKVERLNQDIATVQNSIHPNHPM VTVEALVKGFSINVFSEKSCQGLLVSILEAFE
Subjt:  MVSREHKKAALHEKLQLLRSITNSHALNKGSIIVDASKYIEELKQKVERLNQDIATVQNSIHPNHPMQVTVEALVKGFSINVFSEKSCQGLLVSILEAFE

Query:  ELGLNVLEARVSCTDTFQLQAFAEIEEQGEEAMDAQAVKEAVVEAIKSWSQNGEQD
        ELGLNVLEARVSCTDTFQLQAFAEIEEQGEEAMDAQAVKEAVVEAIKSWSQNGEQD
Subjt:  ELGLNVLEARVSCTDTFQLQAFAEIEEQGEEAMDAQAVKEAVVEAIKSWSQNGEQD

XP_038884496.1 uncharacterized protein LOC120075304 isoform X1 [Benincasa hispida]2.6e-6687.73Show/hide
Query:  MVSREHKKAALHEKLQLLRSITNSHA-LNKGSIIVDASKYIEELKQKVERLNQDIATVQNSIHPN------HPMQVTVEALVKGFSINVFSEKSCQGLLV
        MVSREHKKA LHEKLQLLRSITNSHA LNK SIIVDASKYIEELKQKVERLNQDI+TVQNSIHPN       PMQVTVE LVKGFSINVFSEKSCQGLLV
Subjt:  MVSREHKKAALHEKLQLLRSITNSHA-LNKGSIIVDASKYIEELKQKVERLNQDIATVQNSIHPN------HPMQVTVEALVKGFSINVFSEKSCQGLLV

Query:  SILEAFEELGLNVLEARVSCTDTFQLQAFAEIEEQGEEAMDAQAVKEAVVEAIKSWSQNGEQD
        SILE FEELGLNV+EARVSCTDTFQLQA AEIEE+GEEA+DAQAVKEAVV+AIKSW Q+GEQD
Subjt:  SILEAFEELGLNVLEARVSCTDTFQLQAFAEIEEQGEEAMDAQAVKEAVVEAIKSWSQNGEQD

XP_038884497.1 uncharacterized protein LOC120075304 isoform X2 [Benincasa hispida]1.1e-6788.27Show/hide
Query:  MVSREHKKAALHEKLQLLRSITNSHALNKGSIIVDASKYIEELKQKVERLNQDIATVQNSIHPN------HPMQVTVEALVKGFSINVFSEKSCQGLLVS
        MVSREHKKA LHEKLQLLRSITNSHALNK SIIVDASKYIEELKQKVERLNQDI+TVQNSIHPN       PMQVTVE LVKGFSINVFSEKSCQGLLVS
Subjt:  MVSREHKKAALHEKLQLLRSITNSHALNKGSIIVDASKYIEELKQKVERLNQDIATVQNSIHPN------HPMQVTVEALVKGFSINVFSEKSCQGLLVS

Query:  ILEAFEELGLNVLEARVSCTDTFQLQAFAEIEEQGEEAMDAQAVKEAVVEAIKSWSQNGEQD
        ILE FEELGLNV+EARVSCTDTFQLQA AEIEE+GEEA+DAQAVKEAVV+AIKSW Q+GEQD
Subjt:  ILEAFEELGLNVLEARVSCTDTFQLQAFAEIEEQGEEAMDAQAVKEAVVEAIKSWSQNGEQD

XP_038884499.1 uncharacterized protein LOC120075304 isoform X4 [Benincasa hispida]7.7e-6686.96Show/hide
Query:  MVSREHKKAALHEKLQLLRSITNSHALNKGSIIVDASKYIEELKQKVERLNQDIATVQNSIHPN-----HPMQVTVEALVKGFSINVFSEKSCQGLLVSI
        MVSREHKKA LHEKLQLLRSITNSHALNK SIIVDASKYIEELKQKVERLNQDI+TVQNSIHPN     +   VTVE LVKGFSINVFSEKSCQGLLVSI
Subjt:  MVSREHKKAALHEKLQLLRSITNSHALNKGSIIVDASKYIEELKQKVERLNQDIATVQNSIHPN-----HPMQVTVEALVKGFSINVFSEKSCQGLLVSI

Query:  LEAFEELGLNVLEARVSCTDTFQLQAFAEIEEQGEEAMDAQAVKEAVVEAIKSWSQNGEQD
        LE FEELGLNV+EARVSCTDTFQLQA AEIEE+GEEA+DAQAVKEAVV+AIKSW Q+GEQD
Subjt:  LEAFEELGLNVLEARVSCTDTFQLQAFAEIEEQGEEAMDAQAVKEAVVEAIKSWSQNGEQD

TrEMBL top hitse value%identityAlignment
A0A6J1HIH6 uncharacterized protein LOC111464709 isoform X11.0e-76100Show/hide
Query:  MVSREHKKAALHEKLQLLRSITNSHALNKGSIIVDASKYIEELKQKVERLNQDIATVQNSIHPNHPMQVTVEALVKGFSINVFSEKSCQGLLVSILEAFE
        MVSREHKKAALHEKLQLLRSITNSHALNKGSIIVDASKYIEELKQKVERLNQDIATVQNSIHPNHPMQVTVEALVKGFSINVFSEKSCQGLLVSILEAFE
Subjt:  MVSREHKKAALHEKLQLLRSITNSHALNKGSIIVDASKYIEELKQKVERLNQDIATVQNSIHPNHPMQVTVEALVKGFSINVFSEKSCQGLLVSILEAFE

Query:  ELGLNVLEARVSCTDTFQLQAFAEIEEQGEEAMDAQAVKEAVVEAIKSWSQNGEQD
        ELGLNVLEARVSCTDTFQLQAFAEIEEQGEEAMDAQAVKEAVVEAIKSWSQNGEQD
Subjt:  ELGLNVLEARVSCTDTFQLQAFAEIEEQGEEAMDAQAVKEAVVEAIKSWSQNGEQD

A0A6J1HJR0 uncharacterized protein LOC111464709 isoform X29.8e-7599.36Show/hide
Query:  MVSREHKKAALHEKLQLLRSITNSHALNKGSIIVDASKYIEELKQKVERLNQDIATVQNSIHPNHPMQVTVEALVKGFSINVFSEKSCQGLLVSILEAFE
        MVSREHKKAALHEKLQLLRSITNSHALNKGSIIVDASKYIEELKQKVERLNQDIATVQNSIHPNHPM VTVEALVKGFSINVFSEKSCQGLLVSILEAFE
Subjt:  MVSREHKKAALHEKLQLLRSITNSHALNKGSIIVDASKYIEELKQKVERLNQDIATVQNSIHPNHPMQVTVEALVKGFSINVFSEKSCQGLLVSILEAFE

Query:  ELGLNVLEARVSCTDTFQLQAFAEIEEQGEEAMDAQAVKEAVVEAIKSWSQNGEQD
        ELGLNVLEARVSCTDTFQLQAFAEIEEQGEEAMDAQAVKEAVVEAIKSWSQNGEQD
Subjt:  ELGLNVLEARVSCTDTFQLQAFAEIEEQGEEAMDAQAVKEAVVEAIKSWSQNGEQD

A0A6J1JTS8 uncharacterized protein LOC111487778 isoform X29.8e-7599.36Show/hide
Query:  MVSREHKKAALHEKLQLLRSITNSHALNKGSIIVDASKYIEELKQKVERLNQDIATVQNSIHPNHPMQVTVEALVKGFSINVFSEKSCQGLLVSILEAFE
        MVSREHKKAALHEKLQLLRSITNSHALNKGSIIVDASKYIEELKQKVERLNQDIATVQNSIHPNHPM VTVEALVKGFSINVFSEKSCQGLLVSILEAFE
Subjt:  MVSREHKKAALHEKLQLLRSITNSHALNKGSIIVDASKYIEELKQKVERLNQDIATVQNSIHPNHPMQVTVEALVKGFSINVFSEKSCQGLLVSILEAFE

Query:  ELGLNVLEARVSCTDTFQLQAFAEIEEQGEEAMDAQAVKEAVVEAIKSWSQNGEQD
        ELGLNVLEARVSCTDTFQLQAFAEIEEQGEEAMDAQAVKEAVVEAIKSWSQNGEQD
Subjt:  ELGLNVLEARVSCTDTFQLQAFAEIEEQGEEAMDAQAVKEAVVEAIKSWSQNGEQD

A0A6J1JV46 uncharacterized protein LOC111487778 isoform X11.0e-76100Show/hide
Query:  MVSREHKKAALHEKLQLLRSITNSHALNKGSIIVDASKYIEELKQKVERLNQDIATVQNSIHPNHPMQVTVEALVKGFSINVFSEKSCQGLLVSILEAFE
        MVSREHKKAALHEKLQLLRSITNSHALNKGSIIVDASKYIEELKQKVERLNQDIATVQNSIHPNHPMQVTVEALVKGFSINVFSEKSCQGLLVSILEAFE
Subjt:  MVSREHKKAALHEKLQLLRSITNSHALNKGSIIVDASKYIEELKQKVERLNQDIATVQNSIHPNHPMQVTVEALVKGFSINVFSEKSCQGLLVSILEAFE

Query:  ELGLNVLEARVSCTDTFQLQAFAEIEEQGEEAMDAQAVKEAVVEAIKSWSQNGEQD
        ELGLNVLEARVSCTDTFQLQAFAEIEEQGEEAMDAQAVKEAVVEAIKSWSQNGEQD
Subjt:  ELGLNVLEARVSCTDTFQLQAFAEIEEQGEEAMDAQAVKEAVVEAIKSWSQNGEQD

A0A6J1K013 uncharacterized protein LOC111489817 isoform X23.5e-6487.82Show/hide
Query:  MVSREHKKAALHEKLQLLRSITNSHALNKGSIIVDASKYIEELKQKVERLNQDIATVQNSIHPNHPMQVTVEALVKGFSINVFSEKSCQGLLVSILEAFE
        MVSREH  AALH  LQLLRSITNSHALNK SIIVDASKYIEELKQKVERLNQDI+TVQ SI   HPMQVTVE+L KGFSINVFSEKSCQGLLVSILEAFE
Subjt:  MVSREHKKAALHEKLQLLRSITNSHALNKGSIIVDASKYIEELKQKVERLNQDIATVQNSIHPNHPMQVTVEALVKGFSINVFSEKSCQGLLVSILEAFE

Query:  ELGLNVLEARVSCTDTFQLQAFAEIEEQGEEAMDAQAVKEAVVEAIKSWSQNGEQD
        ELGLNVLEARVSCTD+FQLQA AEIEE+GEEA+DAQAVKEAVV+AIK WSQ+GEQD
Subjt:  ELGLNVLEARVSCTDTFQLQAFAEIEEQGEEAMDAQAVKEAVVEAIKSWSQNGEQD

SwissProt top hitse value%identityAlignment
Q9LPW3 Transcription factor SCREAM21.8e-0422.86Show/hide
Query:  MVSREHKKAALHEKLQLLRSIT-NSHALNKGSIIVDASKYIEELKQKVERLNQDIATV---QNSIHPNHPMQVTVEALV---------------------
        +++   ++  L+++L +LRS+      +++ SI+ DA  Y++EL Q++  L+ ++ +     +S+HP  P   T+   V                     
Subjt:  MVSREHKKAALHEKLQLLRSIT-NSHALNKGSIIVDASKYIEELKQKVERLNQDIATV---QNSIHPNHPMQVTVEALV---------------------

Query:  ------KGFSINVFSEKSCQGLLVSILEAFEELGLNVLEARVSCTDTFQLQAFAEIEEQGEEAMDAQAVKEAVVE
              K  +I++F  +   GLL+S + A + LGL+V +A +SC + F L  F   + Q +  +  + +K  +++
Subjt:  ------KGFSINVFSEKSCQGLLVSILEAFEELGLNVLEARVSCTDTFQLQAFAEIEEQGEEAMDAQAVKEAVVE

Q9LSE2 Transcription factor ICE16.0e-0524.42Show/hide
Query:  MVSREHKKAALHEKLQLLRSIT-NSHALNKGSIIVDASKYIEELKQKVERLNQDIAT--------VQNSIHPNHPMQVTVEALVK---------------
        +++   ++  L+++L +LRS+      +++ SI+ DA  Y++EL Q++  L+ ++ +          +S HP  P   T+   VK               
Subjt:  MVSREHKKAALHEKLQLLRSIT-NSHALNKGSIIVDASKYIEELKQKVERLNQDIAT--------VQNSIHPNHPMQVTVEALVK---------------

Query:  -----------GFSINVFSEKSCQGLLVSILEAFEELGLNVLEARVSCTDTFQLQAF-AEIEEQGEEAMDAQ
                     +I++F  +   GLL++ ++A + LGL+V +A +SC + F L  F AE  ++G+E +  Q
Subjt:  -----------GFSINVFSEKSCQGLLVSILEAFEELGLNVLEARVSCTDTFQLQAF-AEIEEQGEEAMDAQ

Q9LSL1 Transcription factor bHLH935.1e-0425.47Show/hide
Query:  MVSREHKKAALHEKLQLLRSIT-NSHALNKGSIIVDASKYIEELKQKVERLNQDIATVQNSIHPNHPMQV-------TVEALVKG---FSINVFSEKS--
        +++   ++  L+++L +LRSI      +++ SI+ DA  Y++EL  K+ +L  +   + NS + +H             E LV+    F I+   E +  
Subjt:  MVSREHKKAALHEKLQLLRSIT-NSHALNKGSIIVDASKYIEELKQKVERLNQDIATVQNSIHPNHPMQV-------TVEALVKG---FSINVFSEKS--

Query:  ---CQ---GLLVSILEAFEELGLNVLEARVSCTDTFQLQAFAEIEEQGEEAMDAQAVKEAV
           C    GLL+S +   E LGL + +  +SC   F LQA      +  + + ++ +K+A+
Subjt:  ---CQ---GLLVSILEAFEELGLNVLEARVSCTDTFQLQAFAEIEEQGEEAMDAQAVKEAV

Q9LXA9 Transcription factor bHLH619.3e-0627.56Show/hide
Query:  MVSREHKKAALHEKLQLLRSIT-NSHALNKGSIIVDASKYIEELKQKVERLNQDIATVQNSIHPNHPMQVTVEALVKGF--------SINVFSEKSC---
        +++   ++  L+++L LLRSI      +++ SI+ DA  Y++EL  K+ +L +D   + ++ H +    +T E++V+           +N   +  C   
Subjt:  MVSREHKKAALHEKLQLLRSIT-NSHALNKGSIIVDASKYIEELKQKVERLNQDIATVQNSIHPNHPMQVTVEALVKGF--------SINVFSEKSC---

Query:  QGLLVSILEAFEELGLNVLEARVSCTDTFQLQAFA-EIEEQGEEAMDAQAVKEAVV
         GL+VS +   E LGL + +  +SC   F LQA   E+ EQ    + ++A K+A++
Subjt:  QGLLVSILEAFEELGLNVLEARVSCTDTFQLQAFA-EIEEQGEEAMDAQAVKEAVV

Arabidopsis top hitse value%identityAlignment
AT1G29270.1 unknown protein7.5e-1135.43Show/hide
Query:  MVSREHKKAALHEKLQLLRSITN-SHALNKGSIIV-DASKYIEELKQKVERLNQDI----ATVQNSIHPNHPMQVTVEALVKGFSINVFSEKSCQGLLVS
        MV+ E KK A   K   L+++T+   ++++ S+++ +A  YI  LK ++E L ++      T + S+H     +V VE + + F + + S +  +  LV+
Subjt:  MVSREHKKAALHEKLQLLRSITN-SHALNKGSIIV-DASKYIEELKQKVERLNQDI----ATVQNSIHPNHPMQVTVEALVKGFSINVFSEKSCQGLLVS

Query:  ILEAFEELGLNVLEARVSCTDTFQLQA
        ILEAFEE+GLNV +AR SC D+F ++A
Subjt:  ILEAFEELGLNVLEARVSCTDTFQLQA

AT2G40435.1 BEST Arabidopsis thaliana protein match is: transcription regulators (TAIR:AT3G56220.1)3.4e-4363.4Show/hide
Query:  MVSREHKKAALHEKLQLLRSITNSHALNKGSIIVDASKYIEELKQKVERLNQDIATVQNSIHPNHPM--QVTVEALVKGFSINVFSEKSCQGLLVSILEA
        MVSRE K+ +L EK QLLRSITNSHA N  SII+DASKYI++LKQKVER NQD    Q+S  P  P    VTVE L KGF INVFS K+  G+LVS+LEA
Subjt:  MVSREHKKAALHEKLQLLRSITNSHALNKGSIIVDASKYIEELKQKVERLNQDIATVQNSIHPNHPM--QVTVEALVKGFSINVFSEKSCQGLLVSILEA

Query:  FEELGLNVLEARVSCTDTFQLQAFAEIEEQGEEAMDAQAVKEAVVEAIKSWSQ
        FE++GLNVLEAR SCTD+F L A     E GE  MDA+AVK+AV +AI+SW +
Subjt:  FEELGLNVLEARVSCTDTFQLQAFAEIEEQGEEAMDAQAVKEAVVEAIKSWSQ

AT3G26744.1 basic helix-loop-helix (bHLH) DNA-binding superfamily protein4.3e-0624.42Show/hide
Query:  MVSREHKKAALHEKLQLLRSIT-NSHALNKGSIIVDASKYIEELKQKVERLNQDIAT--------VQNSIHPNHPMQVTVEALVK---------------
        +++   ++  L+++L +LRS+      +++ SI+ DA  Y++EL Q++  L+ ++ +          +S HP  P   T+   VK               
Subjt:  MVSREHKKAALHEKLQLLRSIT-NSHALNKGSIIVDASKYIEELKQKVERLNQDIAT--------VQNSIHPNHPMQVTVEALVK---------------

Query:  -----------GFSINVFSEKSCQGLLVSILEAFEELGLNVLEARVSCTDTFQLQAF-AEIEEQGEEAMDAQ
                     +I++F  +   GLL++ ++A + LGL+V +A +SC + F L  F AE  ++G+E +  Q
Subjt:  -----------GFSINVFSEKSCQGLLVSILEAFEELGLNVLEARVSCTDTFQLQAF-AEIEEQGEEAMDAQ

AT3G56220.1 transcription regulators1.4e-3657.05Show/hide
Query:  MVSREHKK-AALHEKLQLLRSITNSHALNKGSIIVDASKYIEELKQKVERLNQDIATVQN---SIHPNHPMQVTVEALVKGFSINVFSEKSCQGLLVSIL
        MVSREHK+ ++L EK  LLRSIT+SHA ++ SIIVDASKYI++LKQKVE++N    + Q+   S  PN PM VTVE L KGF I V S K+  G+LV +L
Subjt:  MVSREHKK-AALHEKLQLLRSITNSHALNKGSIIVDASKYIEELKQKVERLNQDIATVQN---SIHPNHPMQVTVEALVKGFSINVFSEKSCQGLLVSIL

Query:  EAFEELGLNVLEARVSCTDTFQLQAFAEIEEQGEEAMDAQAVKEAVVEAIKSWSQN
        E FE+LGL+V+EARVSCTDTF L A         + +DA+AVK+AV EAI++WS +
Subjt:  EAFEELGLNVLEARVSCTDTFQLQAFAEIEEQGEEAMDAQAVKEAVVEAIKSWSQN

AT5G10570.1 basic helix-loop-helix (bHLH) DNA-binding superfamily protein6.6e-0727.56Show/hide
Query:  MVSREHKKAALHEKLQLLRSIT-NSHALNKGSIIVDASKYIEELKQKVERLNQDIATVQNSIHPNHPMQVTVEALVKGF--------SINVFSEKSC---
        +++   ++  L+++L LLRSI      +++ SI+ DA  Y++EL  K+ +L +D   + ++ H +    +T E++V+           +N   +  C   
Subjt:  MVSREHKKAALHEKLQLLRSIT-NSHALNKGSIIVDASKYIEELKQKVERLNQDIATVQNSIHPNHPMQVTVEALVKGF--------SINVFSEKSC---

Query:  QGLLVSILEAFEELGLNVLEARVSCTDTFQLQAFA-EIEEQGEEAMDAQAVKEAVV
         GL+VS +   E LGL + +  +SC   F LQA   E+ EQ    + ++A K+A++
Subjt:  QGLLVSILEAFEELGLNVLEARVSCTDTFQLQAFA-EIEEQGEEAMDAQAVKEAVV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTGTCTAGAGAGCACAAGAAGGCCGCTTTGCATGAAAAGCTTCAATTACTTCGTTCTATTACCAACTCTCATGCTCTAAACAAGGGCTCGATTATAGTGGATGCATC
AAAATACATCGAGGAGCTAAAGCAGAAAGTTGAAAGATTGAATCAAGATATAGCAACAGTTCAAAATTCAATCCACCCAAATCATCCCATGCAGGTCACAGTGGAAGCCC
TAGTAAAGGGATTTTCTATAAATGTATTCTCAGAAAAGAGCTGTCAAGGCCTCCTTGTCTCAATATTAGAAGCCTTCGAAGAGCTGGGGCTTAATGTTCTTGAAGCTAGG
GTTTCCTGTACTGATACTTTCCAATTACAAGCTTTTGCAGAAATTGAGGAACAAGGAGAGGAAGCCATGGATGCTCAAGCTGTGAAAGAAGCTGTAGTTGAAGCTATAAA
GAGCTGGAGCCAAAACGGTGAACAAGATTAA
mRNA sequenceShow/hide mRNA sequence
TGAGGGAAAAAACCCCCAAAAAAATAGCAAAAAGAAGAAGAAAAAGAAGGCTGAAAAAGATATAAAGAGAGAGACAGAAACAAAAGGGTTGAATCCATGGTGTCTAGAGA
GCACAAGAAGGCCGCTTTGCATGAAAAGCTTCAATTACTTCGTTCTATTACCAACTCTCATGCTCTAAACAAGGGCTCGATTATAGTGGATGCATCAAAATACATCGAGG
AGCTAAAGCAGAAAGTTGAAAGATTGAATCAAGATATAGCAACAGTTCAAAATTCAATCCACCCAAATCATCCCATGCAGGTCACAGTGGAAGCCCTAGTAAAGGGATTT
TCTATAAATGTATTCTCAGAAAAGAGCTGTCAAGGCCTCCTTGTCTCAATATTAGAAGCCTTCGAAGAGCTGGGGCTTAATGTTCTTGAAGCTAGGGTTTCCTGTACTGA
TACTTTCCAATTACAAGCTTTTGCAGAAATTGAGGAACAAGGAGAGGAAGCCATGGATGCTCAAGCTGTGAAAGAAGCTGTAGTTGAAGCTATAAAGAGCTGGAGCCAAA
ACGGTGAACAAGATTAAAAAAAAACATCCTTAATTTCTCCAACTTTTTTCCGAACCGCGGCGTTTTTTTTTTTTATTTATTGTTCTTAACTTCTTTGTGTTTGTAATGTG
TAAAAATTAATCAATGGAAAAACCGATTGATTAAGGTTCTGGAAAATAAAGGAAAATAAAGTGAATGACAACCATTGCAGCAACAACAAGGAATATTATCGATATCAATC
GGATAAACTAATTTTCTTTTCTTTTTTTGAAATTTGTTTGTAATTTATTATCAGAGTTCATAAGTTTTACCACAAGAATATTGCACATAGGTTTTCGAATATCATGAAAG
GTGTCACTTTTGGAGGCGAAGTACGAAGAGGGTGAGCAAGAAAGTTTAAAGTACCTTAAAAGAAGGAACATTTTTTTTGGTTCTGTGGGTTGTAAATGATGAGGAATTTT
TCATC
Protein sequenceShow/hide protein sequence
MVSREHKKAALHEKLQLLRSITNSHALNKGSIIVDASKYIEELKQKVERLNQDIATVQNSIHPNHPMQVTVEALVKGFSINVFSEKSCQGLLVSILEAFEELGLNVLEAR
VSCTDTFQLQAFAEIEEQGEEAMDAQAVKEAVVEAIKSWSQNGEQD