; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC06g0475 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC06g0475
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
Descriptiontranscription factor IBH1-like 1
Genome locationMC06:3896829..3897566
RNA-Seq ExpressionMC06g0475
SyntenyMC06g0475
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0005634 - nucleus (cellular component)
GO:0000976 - transcription regulatory region sequence-specific DNA binding (molecular function)
GO:0046983 - protein dimerization activity (molecular function)
InterPro domainsIPR036638 - Helix-loop-helix DNA-binding domain superfamily
IPR044549 - Transcription factor IBH1-like, bHLH domain
IPR044660 - Transcription factor IBH1-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004149880.1 transcription factor IBH1-like 1 [Cucumis sativus]2.31e-5862.16Show/hide
Query:  MCNPSTLRREFLKKWMMGLQILSNNGCGSMTVSERKKAIKLSADIALASSRDCATRWSRAVIA-------------GAVGRSVYCQKAKN-------KLQ
        M N ++L++EF KKWMMGLQI +     +MTV+ERK AIKLSADIALASSR+CATRWSRAVIA             G +GR + C++ K        +  
Subjt:  MCNPSTLRREFLKKWMMGLQILSNNGCGSMTVSERKKAIKLSADIALASSRDCATRWSRAVIA-------------GAVGRSVYCQKAKN-------KLQ

Query:  SGSKKILKRSRRVWRRKTSCSRKPAAAESVARRLVEKRTKILRGLVPGGEFMDEISLIEETLDYISALQAQVDVMRCLATAYKPS
          S KI+KRSRRVWRR+   + K  AA ++ARRLV+KRTK+LRGLVPGGEFMDEISLIEETLDY+SALQAQVDVMR LATAY PS
Subjt:  SGSKKILKRSRRVWRRKTSCSRKPAAAESVARRLVEKRTKILRGLVPGGEFMDEISLIEETLDYISALQAQVDVMRCLATAYKPS

XP_008443801.1 PREDICTED: uncharacterized protein LOC103487305 [Cucumis melo]4.03e-5962.16Show/hide
Query:  MCNPSTLRREFLKKWMMGLQILSNNGCGSMTVSERKKAIKLSADIALASSRDCATRWSRAVIA-------------GAVGRSVYCQKAKN-------KLQ
        M NP++L++EFLKKWMMGLQI +     +MTV+ERK AIKLSADIALASSR+CATRWSRAVIA             G +GR + C++ K        +  
Subjt:  MCNPSTLRREFLKKWMMGLQILSNNGCGSMTVSERKKAIKLSADIALASSRDCATRWSRAVIA-------------GAVGRSVYCQKAKN-------KLQ

Query:  SGSKKILKRSRRVWRRKTSCSRKPAAAESVARRLVEKRTKILRGLVPGGEFMDEISLIEETLDYISALQAQVDVMRCLATAYKPS
          S KI+KRSRRVWRR+   + K  AA ++ARRLV++RTK+LRGLVPGGEFMDEISLIEETLDY+SALQAQVDVMR LA+AY PS
Subjt:  SGSKKILKRSRRVWRRKTSCSRKPAAAESVARRLVEKRTKILRGLVPGGEFMDEISLIEETLDYISALQAQVDVMRCLATAYKPS

XP_022135325.1 transcription factor IBH1-like 1 [Momordica charantia]7.45e-114100Show/hide
Query:  MCNPSTLRREFLKKWMMGLQILSNNGCGSMTVSERKKAIKLSADIALASSRDCATRWSRAVIAGAVGRSVYCQKAKNKLQSGSKKILKRSRRVWRRKTSC
        MCNPSTLRREFLKKWMMGLQILSNNGCGSMTVSERKKAIKLSADIALASSRDCATRWSRAVIAGAVGRSVYCQKAKNKLQSGSKKILKRSRRVWRRKTSC
Subjt:  MCNPSTLRREFLKKWMMGLQILSNNGCGSMTVSERKKAIKLSADIALASSRDCATRWSRAVIAGAVGRSVYCQKAKNKLQSGSKKILKRSRRVWRRKTSC

Query:  SRKPAAAESVARRLVEKRTKILRGLVPGGEFMDEISLIEETLDYISALQAQVDVMRCLATAYKPSIHDHE
        SRKPAAAESVARRLVEKRTKILRGLVPGGEFMDEISLIEETLDYISALQAQVDVMRCLATAYKPSIHDHE
Subjt:  SRKPAAAESVARRLVEKRTKILRGLVPGGEFMDEISLIEETLDYISALQAQVDVMRCLATAYKPSIHDHE

XP_022988593.1 transcription factor IBH1-like 1 [Cucurbita maxima]1.77e-5765.38Show/hide
Query:  MCNPSTLRREFLKKWMMGLQILSNNGCGSMTVSERKKAIKLSADIALASSRDCATRWSRAVIAGA-------------VGRSVYCQKAKNKLQSGSK---
        M NPS+L+R+FLKKW+MGLQI +     +MTV++RK+AIKLSADIALASSR+CATRWSRAVIAG              +GR+V C++ K   QS SK   
Subjt:  MCNPSTLRREFLKKWMMGLQILSNNGCGSMTVSERKKAIKLSADIALASSRDCATRWSRAVIAGA-------------VGRSVYCQKAKNKLQSGSK---

Query:  ------KILKRSRRVWRRKTSCSRKPAAAESVARRLVEKRTKILRGLVPGGEFMDEISLIEETLDYISALQAQVDVMRCLAT
              KILKRSRRV RRK  C RKPAA E +ARRLV+KRT+ LRGLVPGGEFMDE+SLIEETLDYI ALQ QVDVMRCLAT
Subjt:  ------KILKRSRRVWRRKTSCSRKPAAAESVARRLVEKRTKILRGLVPGGEFMDEISLIEETLDYISALQAQVDVMRCLAT

XP_038879866.1 transcription factor IBH1-like 1 [Benincasa hispida]2.14e-6467.03Show/hide
Query:  MCNPSTLRREFLKKWMMGLQILSNNGCGSMTVSERKKAIKLSADIALASSRDCATRWSRAVIA-------------GAVGRSVYCQKAKNKLQS------
        M NPS+L+REFLKKWM GLQI + +   +MTV ERK AIKLSADIALASSR+CATRWSRAVIA             G +GR+V C++ K    S      
Subjt:  MCNPSTLRREFLKKWMMGLQILSNNGCGSMTVSERKKAIKLSADIALASSRDCATRWSRAVIA-------------GAVGRSVYCQKAKNKLQS------

Query:  -GSKKILKRSRRVWRRKTSCSRKPAAAESVARRLVEKRTKILRGLVPGGEFMDEISLIEETLDYISALQAQVDVMRCLATAYKPS
          S  ILKRSRRV RR+  C  K  AAESVA+RLV KRTK+LRGLVPGGEFMDEISLIEETLDYISALQAQVDVMRCLATAY PS
Subjt:  -GSKKILKRSRRVWRRKTSCSRKPAAAESVARRLVEKRTKILRGLVPGGEFMDEISLIEETLDYISALQAQVDVMRCLATAYKPS

TrEMBL top hitse value%identityAlignment
A0A0A0LWQ3 Uncharacterized protein1.12e-5862.16Show/hide
Query:  MCNPSTLRREFLKKWMMGLQILSNNGCGSMTVSERKKAIKLSADIALASSRDCATRWSRAVIA-------------GAVGRSVYCQKAKN-------KLQ
        M N ++L++EF KKWMMGLQI +     +MTV+ERK AIKLSADIALASSR+CATRWSRAVIA             G +GR + C++ K        +  
Subjt:  MCNPSTLRREFLKKWMMGLQILSNNGCGSMTVSERKKAIKLSADIALASSRDCATRWSRAVIA-------------GAVGRSVYCQKAKN-------KLQ

Query:  SGSKKILKRSRRVWRRKTSCSRKPAAAESVARRLVEKRTKILRGLVPGGEFMDEISLIEETLDYISALQAQVDVMRCLATAYKPS
          S KI+KRSRRVWRR+   + K  AA ++ARRLV+KRTK+LRGLVPGGEFMDEISLIEETLDY+SALQAQVDVMR LATAY PS
Subjt:  SGSKKILKRSRRVWRRKTSCSRKPAAAESVARRLVEKRTKILRGLVPGGEFMDEISLIEETLDYISALQAQVDVMRCLATAYKPS

A0A1S3B8V1 uncharacterized protein LOC1034873051.95e-5962.16Show/hide
Query:  MCNPSTLRREFLKKWMMGLQILSNNGCGSMTVSERKKAIKLSADIALASSRDCATRWSRAVIA-------------GAVGRSVYCQKAKN-------KLQ
        M NP++L++EFLKKWMMGLQI +     +MTV+ERK AIKLSADIALASSR+CATRWSRAVIA             G +GR + C++ K        +  
Subjt:  MCNPSTLRREFLKKWMMGLQILSNNGCGSMTVSERKKAIKLSADIALASSRDCATRWSRAVIA-------------GAVGRSVYCQKAKN-------KLQ

Query:  SGSKKILKRSRRVWRRKTSCSRKPAAAESVARRLVEKRTKILRGLVPGGEFMDEISLIEETLDYISALQAQVDVMRCLATAYKPS
          S KI+KRSRRVWRR+   + K  AA ++ARRLV++RTK+LRGLVPGGEFMDEISLIEETLDY+SALQAQVDVMR LA+AY PS
Subjt:  SGSKKILKRSRRVWRRKTSCSRKPAAAESVARRLVEKRTKILRGLVPGGEFMDEISLIEETLDYISALQAQVDVMRCLATAYKPS

A0A6J1C0T4 transcription factor IBH1-like 13.61e-114100Show/hide
Query:  MCNPSTLRREFLKKWMMGLQILSNNGCGSMTVSERKKAIKLSADIALASSRDCATRWSRAVIAGAVGRSVYCQKAKNKLQSGSKKILKRSRRVWRRKTSC
        MCNPSTLRREFLKKWMMGLQILSNNGCGSMTVSERKKAIKLSADIALASSRDCATRWSRAVIAGAVGRSVYCQKAKNKLQSGSKKILKRSRRVWRRKTSC
Subjt:  MCNPSTLRREFLKKWMMGLQILSNNGCGSMTVSERKKAIKLSADIALASSRDCATRWSRAVIAGAVGRSVYCQKAKNKLQSGSKKILKRSRRVWRRKTSC

Query:  SRKPAAAESVARRLVEKRTKILRGLVPGGEFMDEISLIEETLDYISALQAQVDVMRCLATAYKPSIHDHE
        SRKPAAAESVARRLVEKRTKILRGLVPGGEFMDEISLIEETLDYISALQAQVDVMRCLATAYKPSIHDHE
Subjt:  SRKPAAAESVARRLVEKRTKILRGLVPGGEFMDEISLIEETLDYISALQAQVDVMRCLATAYKPSIHDHE

A0A6J1F0A2 transcription factor IBH1-like 1 isoform X22.69e-5762.86Show/hide
Query:  MCNPSTLRREFLKKWMMGLQILSNNGCGSMTVSERKKAIKLSADIALASSRDCATRWSRAVIAGA-------------VGRSVYCQKAKNKLQSGSKKIL
        M NPS L+REFLKKWMMGL+I + +   +MTV++RK AIKLSADIALASSR+CATRWS+A+IA               +GR++  +  +    SG  KI+
Subjt:  MCNPSTLRREFLKKWMMGLQILSNNGCGSMTVSERKKAIKLSADIALASSRDCATRWSRAVIAGA-------------VGRSVYCQKAKNKLQSGSKKIL

Query:  KRSRRVWRRKTSCSRKPAAAESVARRLVEKRTKILRGLVPGGEFMDEISLIEETLDYISALQAQVDVMRCLATAY
        KRSRRV R +  C  K  AAE +A++LV KR K+LRGLVPGGEFMDEISLIEETLDYISALQAQVDVMRCLATAY
Subjt:  KRSRRVWRRKTSCSRKPAAAESVARRLVEKRTKILRGLVPGGEFMDEISLIEETLDYISALQAQVDVMRCLATAY

A0A6J1JMR3 transcription factor IBH1-like 18.57e-5865.38Show/hide
Query:  MCNPSTLRREFLKKWMMGLQILSNNGCGSMTVSERKKAIKLSADIALASSRDCATRWSRAVIAGA-------------VGRSVYCQKAKNKLQSGSK---
        M NPS+L+R+FLKKW+MGLQI +     +MTV++RK+AIKLSADIALASSR+CATRWSRAVIAG              +GR+V C++ K   QS SK   
Subjt:  MCNPSTLRREFLKKWMMGLQILSNNGCGSMTVSERKKAIKLSADIALASSRDCATRWSRAVIAGA-------------VGRSVYCQKAKNKLQSGSK---

Query:  ------KILKRSRRVWRRKTSCSRKPAAAESVARRLVEKRTKILRGLVPGGEFMDEISLIEETLDYISALQAQVDVMRCLAT
              KILKRSRRV RRK  C RKPAA E +ARRLV+KRT+ LRGLVPGGEFMDE+SLIEETLDYI ALQ QVDVMRCLAT
Subjt:  ------KILKRSRRVWRRKTSCSRKPAAAESVARRLVEKRTKILRGLVPGGEFMDEISLIEETLDYISALQAQVDVMRCLAT

SwissProt top hitse value%identityAlignment
O80482 Transcription factor bHLH1497.0e-0732.89Show/hide
Query:  EFLKKWMMGLQILSNNGCGSMTVSERKKAIKLSADIALASSRDCATRWSRAVIAGAVGRSVYCQKAKNKLQSGSKKILKRSRRVWRRKTSCSRKPAAAES
        E L++        SNN    + VS   + I+ +AD  LA+S    TRWSRA++A  V       +AK          LK+ R+  +   +C  +    E+
Subjt:  EFLKKWMMGLQILSNNGCGSMTVSERKKAIKLSADIALASSRDCATRWSRAVIAGAVGRSVYCQKAKNKLQSGSKKILKRSRRVWRRKTSCSRKPAAAES

Query:  VARRL--VEKRTKILRGLVPGGEFMDEISLIEETLDYISALQAQVDVMRCLA
           +L  VE++ KIL  LVPG   +   +L++E  DYI+AL+ QV  M  LA
Subjt:  VARRL--VEKRTKILRGLVPGGEFMDEISLIEETLDYISALQAQVDVMRCLA

Q01I07 Transcription factor IBH17.3e-0426.32Show/hide
Query:  VSERKKAIKLSADIALASSRDCATRWSRAVIAGAVGRSVYCQKAKNKLQSGSKKILKRSRRVWRRKTSCSRKPAAAESVARRLV----------------
        V+ R + I+ +A  ++A +      WSRA++  A  R              S+ +++R+  + RR+   +  P+ A +   R++                
Subjt:  VSERKKAIKLSADIALASSRDCATRWSRAVIAGAVGRSVYCQKAKNKLQSGSKKILKRSRRVWRRKTSCSRKPAAAESVARRLV----------------

Query:  -----EKRTKILRGLVPGGEFMDEISLIEETLDYISALQAQVDVMRCLATAY
               R   LR LVPGG  M+  SL+EET DY+ +L+AQV +M+ L   +
Subjt:  -----EKRTKILRGLVPGGEFMDEISLIEETLDYISALQAQVDVMRCLATAY

Q9M0B9 Transcription factor IBH1-like 17.5e-2542.94Show/hide
Query:  STLRREFLKKWMMGLQILSNNGCGSMTVSERKKAIKLSADIALASSRDCATRWSRAVI--------------AGAVGRSVYCQKAKNKLQSGSKKILKRS
        S++  EFLKKW MGLQI       + +V ERKKAIKLSAD+A+AS R   T WSRA+I                 +       K   K     +KI++RS
Subjt:  STLRREFLKKWMMGLQILSNNGCGSMTVSERKKAIKLSADIALASSRDCATRWSRAVI--------------AGAVGRSVYCQKAKNKLQSGSKKILKRS

Query:  RRVWRRKTSCSRKPAAAESVARRLVEKRTKILRGLVPGGEFM-DEISLIEETLDYISALQAQVDVMRCLATAYKPSI
        +++ RRK+  + + AAA+  A+RLV++RT+ LR +VPGGE M +++ L++ETLDYI +LQ QV+VMR +  A +  I
Subjt:  RRVWRRKTSCSRKPAAAESVARRLVEKRTKILRGLVPGGEFM-DEISLIEETLDYISALQAQVDVMRCLATAYKPSI

Q9M9L6 Transcription factor bHLH1508.6e-0533.86Show/hide
Query:  SERKKAIKLSADIALASSRDCATRWSRAVIAGAVGRSVYCQKAKNKLQSGSKKILKRSRRVWRRKTSCSRKPAAAESVARRLVEKRTKILRGLVPGGEFM
        S R   ++ +AD  LA++   ATRWSRA++    G S+  ++   K  S     ++ S    RR     RK +A        V  R ++L GLVPG    
Subjt:  SERKKAIKLSADIALASSRDCATRWSRAVIAGAVGRSVYCQKAKNKLQSGSKKILKRSRRVWRRKTSCSRKPAAAESVARRLVEKRTKILRGLVPGGEFM

Query:  DEISLIEETLDYISALQAQVDVMRCLA
            L++ET DYI+AL+ QV  M  L+
Subjt:  DEISLIEETLDYISALQAQVDVMRCLA

Q9SKX1 Transcription factor IBH11.6e-0632.59Show/hide
Query:  RKKAIKLSADIALASSRDCATR-WSRAVIAGAVGRSVYCQKAKNKLQSGSKKILKRSRRVW----RRKTSCSRKPAAAESVARRLVEKRTKILRGLVPGG
        R + IK +A +++A +   ++R WSRA++               +      KI++ SRR W    +R+ S  R P   E+  R         LR LVPGG
Subjt:  RKKAIKLSADIALASSRDCATR-WSRAVIAGAVGRSVYCQKAKNKLQSGSKKILKRSRRVW----RRKTSCSRKPAAAESVARRLVEKRTKILRGLVPGG

Query:  EFMDEISLIEETLDYISALQAQVDVMRCLATAYKP
          M+   L+EET  YI  L  QV VM+CL     P
Subjt:  EFMDEISLIEETLDYISALQAQVDVMRCLATAYKP

Arabidopsis top hitse value%identityAlignment
AT1G09250.1 basic helix-loop-helix (bHLH) DNA-binding superfamily protein5.0e-0832.89Show/hide
Query:  EFLKKWMMGLQILSNNGCGSMTVSERKKAIKLSADIALASSRDCATRWSRAVIAGAVGRSVYCQKAKNKLQSGSKKILKRSRRVWRRKTSCSRKPAAAES
        E L++        SNN    + VS   + I+ +AD  LA+S    TRWSRA++A  V       +AK          LK+ R+  +   +C  +    E+
Subjt:  EFLKKWMMGLQILSNNGCGSMTVSERKKAIKLSADIALASSRDCATRWSRAVIAGAVGRSVYCQKAKNKLQSGSKKILKRSRRVWRRKTSCSRKPAAAES

Query:  VARRL--VEKRTKILRGLVPGGEFMDEISLIEETLDYISALQAQVDVMRCLA
           +L  VE++ KIL  LVPG   +   +L++E  DYI+AL+ QV  M  LA
Subjt:  VARRL--VEKRTKILRGLVPGGEFMDEISLIEETLDYISALQAQVDVMRCLA

AT2G43060.1 ILI1 binding bHLH 11.1e-0732.59Show/hide
Query:  RKKAIKLSADIALASSRDCATR-WSRAVIAGAVGRSVYCQKAKNKLQSGSKKILKRSRRVW----RRKTSCSRKPAAAESVARRLVEKRTKILRGLVPGG
        R + IK +A +++A +   ++R WSRA++               +      KI++ SRR W    +R+ S  R P   E+  R         LR LVPGG
Subjt:  RKKAIKLSADIALASSRDCATR-WSRAVIAGAVGRSVYCQKAKNKLQSGSKKILKRSRRVW----RRKTSCSRKPAAAESVARRLVEKRTKILRGLVPGG

Query:  EFMDEISLIEETLDYISALQAQVDVMRCLATAYKP
          M+   L+EET  YI  L  QV VM+CL     P
Subjt:  EFMDEISLIEETLDYISALQAQVDVMRCLATAYKP

AT4G30410.1 sequence-specific DNA binding transcription factors5.3e-2642.94Show/hide
Query:  STLRREFLKKWMMGLQILSNNGCGSMTVSERKKAIKLSADIALASSRDCATRWSRAVI--------------AGAVGRSVYCQKAKNKLQSGSKKILKRS
        S++  EFLKKW MGLQI       + +V ERKKAIKLSAD+A+AS R   T WSRA+I                 +       K   K     +KI++RS
Subjt:  STLRREFLKKWMMGLQILSNNGCGSMTVSERKKAIKLSADIALASSRDCATRWSRAVI--------------AGAVGRSVYCQKAKNKLQSGSKKILKRS

Query:  RRVWRRKTSCSRKPAAAESVARRLVEKRTKILRGLVPGGEFM-DEISLIEETLDYISALQAQVDVMRCLATAYKPSI
        +++ RRK+  + + AAA+  A+RLV++RT+ LR +VPGGE M +++ L++ETLDYI +LQ QV+VMR +  A +  I
Subjt:  RRVWRRKTSCSRKPAAAESVARRLVEKRTKILRGLVPGGEFM-DEISLIEETLDYISALQAQVDVMRCLATAYKPSI

AT4G30410.2 sequence-specific DNA binding transcription factors5.3e-2642.94Show/hide
Query:  STLRREFLKKWMMGLQILSNNGCGSMTVSERKKAIKLSADIALASSRDCATRWSRAVI--------------AGAVGRSVYCQKAKNKLQSGSKKILKRS
        S++  EFLKKW MGLQI       + +V ERKKAIKLSAD+A+AS R   T WSRA+I                 +       K   K     +KI++RS
Subjt:  STLRREFLKKWMMGLQILSNNGCGSMTVSERKKAIKLSADIALASSRDCATRWSRAVI--------------AGAVGRSVYCQKAKNKLQSGSKKILKRS

Query:  RRVWRRKTSCSRKPAAAESVARRLVEKRTKILRGLVPGGEFM-DEISLIEETLDYISALQAQVDVMRCLATAYKPSI
        +++ RRK+  + + AAA+  A+RLV++RT+ LR +VPGGE M +++ L++ETLDYI +LQ QV+VMR +  A +  I
Subjt:  RRVWRRKTSCSRKPAAAESVARRLVEKRTKILRGLVPGGEFM-DEISLIEETLDYISALQAQVDVMRCLATAYKPSI

AT5G57780.1 EXPRESSED IN: 18 plant structures1.1e-2042.21Show/hide
Query:  LRREFLKKWMMGLQILSNNGCGSMTVSERKKAIKLSADIALASSRDCATRWSRAVIAGAVGRSVYCQKAKNKLQSGSKKILKRSRRVWRRKTSCSRKPAA
        +++EF+KKW+  L +L ++    + V+ERK AI+LS+D+A+A++R+ +T WSRA+I+ +        K  NK    +++ILK++R   R K  C+     
Subjt:  LRREFLKKWMMGLQILSNNGCGSMTVSERKKAIKLSADIALASSRDCATRWSRAVIAGAVGRSVYCQKAKNKLQSGSKKILKRSRRVWRRKTSCSRKPAA

Query:  AESVARRLVEKRTKILRGLVPGGEFMDEIS-LIEETLDYISALQAQVDVMRCLA
            A+  V KRT +L+ LVPGGE +D+   LI ETLDYI  L+AQVDVMR +A
Subjt:  AESVARRLVEKRTKILRGLVPGGEFMDEIS-LIEETLDYISALQAQVDVMRCLA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTGTAATCCGAGCACACTGAGGAGAGAATTCCTCAAGAAATGGATGATGGGTCTCCAAATATTAAGCAATAACGGTTGTGGCTCCATGACGGTCTCCGAGAGAAAAAA
GGCCATCAAGTTATCGGCAGACATAGCGTTGGCCTCCTCTAGAGACTGCGCCACGCGGTGGAGCCGAGCGGTCATCGCCGGCGCGGTCGGGCGGAGCGTGTACTGCCAGA
AGGCCAAGAACAAGCTGCAAAGTGGCTCGAAAAAGATCTTGAAAAGAAGCCGCCGTGTATGGCGGCGGAAAACATCATGCAGCCGGAAGCCAGCGGCGGCGGAGTCGGTT
GCGAGGAGATTGGTGGAGAAAAGAACAAAAATTTTAAGAGGGCTGGTGCCGGGGGGTGAATTTATGGACGAGATTTCGCTGATTGAAGAAACCCTAGATTACATATCGGC
TCTCCAAGCTCAGGTTGATGTAATGCGGTGCCTTGCAACTGCTTACAAGCCATCAATCCATGATCATGAATAA
mRNA sequenceShow/hide mRNA sequence
ATGTGTAATCCGAGCACACTGAGGAGAGAATTCCTCAAGAAATGGATGATGGGTCTCCAAATATTAAGCAATAACGGTTGTGGCTCCATGACGGTCTCCGAGAGAAAAAA
GGCCATCAAGTTATCGGCAGACATAGCGTTGGCCTCCTCTAGAGACTGCGCCACGCGGTGGAGCCGAGCGGTCATCGCCGGCGCGGTCGGGCGGAGCGTGTACTGCCAGA
AGGCCAAGAACAAGCTGCAAAGTGGCTCGAAAAAGATCTTGAAAAGAAGCCGCCGTGTATGGCGGCGGAAAACATCATGCAGCCGGAAGCCAGCGGCGGCGGAGTCGGTT
GCGAGGAGATTGGTGGAGAAAAGAACAAAAATTTTAAGAGGGCTGGTGCCGGGGGGTGAATTTATGGACGAGATTTCGCTGATTGAAGAAACCCTAGATTACATATCGGC
TCTCCAAGCTCAGGTTGATGTAATGCGGTGCCTTGCAACTGCTTACAAGCCATCAATCCATGATCATGAATAAGGGAATTAAATATCTCCCCCCCCCCCCCCCTTTCTTT
TTCTTTTTCTTTTTCTTTTCTGCCCAGCATTCGATCGTATATGATGAATGGGATGTGGAGAAAATCAAGGCCTTTTTGCACATATATAATTTAATTTAAGTTAATGTAGT
CATAGTTTTCAAAACATTGCTGAGCTTATATATTGATCGATGTATATATAATATAAAACTAATTATATGAATTATTAA
Protein sequenceShow/hide protein sequence
MCNPSTLRREFLKKWMMGLQILSNNGCGSMTVSERKKAIKLSADIALASSRDCATRWSRAVIAGAVGRSVYCQKAKNKLQSGSKKILKRSRRVWRRKTSCSRKPAAAESV
ARRLVEKRTKILRGLVPGGEFMDEISLIEETLDYISALQAQVDVMRCLATAYKPSIHDHE