; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS002292 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS002292
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
Descriptiontranscription factor IBH1-like 1
Genome locationscaffold30:3887537..3888040
RNA-Seq ExpressionMS002292
SyntenyMS002292
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0005634 - nucleus (cellular component)
GO:0000976 - transcription regulatory region sequence-specific DNA binding (molecular function)
GO:0046983 - protein dimerization activity (molecular function)
InterPro domainsIPR036638 - Helix-loop-helix DNA-binding domain superfamily
IPR044549 - Transcription factor IBH1-like, bHLH domain
IPR044660 - Transcription factor IBH1-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004149880.1 transcription factor IBH1-like 1 [Cucumis sativus]1.1e-4462.16Show/hide
Query:  MRNPSTLRREFLKKWMMGLQILSNNSCGSMTVSERKKAIKLSADIALASSRDCATRWSRAVIA-------------GAVGRSVYCQKAK-------NKLQ
        M+N ++L++EF KKWMMGLQI +  +  +MTV+ERK AIKLSADIALASSR+CATRWSRAVIA             G +GR + C++ K        +  
Subjt:  MRNPSTLRREFLKKWMMGLQILSNNSCGSMTVSERKKAIKLSADIALASSRDCATRWSRAVIA-------------GAVGRSVYCQKAK-------NKLQ

Query:  SGSKKILKRSRRVWRRKTSCSRKPAAAESVARRLVEKRTKILRGLVPGGEFMDEISLIEETLDYISALQAQVDVMRCLATAYKPS
          S KI+KRSRRVWRR+     K  AA ++ARRLV+KRTK+LRGLVPGGEFMDEISLIEETLDY+SALQAQVDVMR LATAY PS
Subjt:  SGSKKILKRSRRVWRRKTSCSRKPAAAESVARRLVEKRTKILRGLVPGGEFMDEISLIEETLDYISALQAQVDVMRCLATAYKPS

XP_008443801.1 PREDICTED: uncharacterized protein LOC103487305 [Cucumis melo]2.8e-4562.16Show/hide
Query:  MRNPSTLRREFLKKWMMGLQILSNNSCGSMTVSERKKAIKLSADIALASSRDCATRWSRAVIA-------------GAVGRSVYCQKAK-------NKLQ
        M+NP++L++EFLKKWMMGLQI +  +  +MTV+ERK AIKLSADIALASSR+CATRWSRAVIA             G +GR + C++ K        +  
Subjt:  MRNPSTLRREFLKKWMMGLQILSNNSCGSMTVSERKKAIKLSADIALASSRDCATRWSRAVIA-------------GAVGRSVYCQKAK-------NKLQ

Query:  SGSKKILKRSRRVWRRKTSCSRKPAAAESVARRLVEKRTKILRGLVPGGEFMDEISLIEETLDYISALQAQVDVMRCLATAYKPS
          S KI+KRSRRVWRR+     K  AA ++ARRLV++RTK+LRGLVPGGEFMDEISLIEETLDY+SALQAQVDVMR LA+AY PS
Subjt:  SGSKKILKRSRRVWRRKTSCSRKPAAAESVARRLVEKRTKILRGLVPGGEFMDEISLIEETLDYISALQAQVDVMRCLATAYKPS

XP_022135325.1 transcription factor IBH1-like 1 [Momordica charantia]7.5e-8398.81Show/hide
Query:  MRNPSTLRREFLKKWMMGLQILSNNSCGSMTVSERKKAIKLSADIALASSRDCATRWSRAVIAGAVGRSVYCQKAKNKLQSGSKKILKRSRRVWRRKTSC
        M NPSTLRREFLKKWMMGLQILSNN CGSMTVSERKKAIKLSADIALASSRDCATRWSRAVIAGAVGRSVYCQKAKNKLQSGSKKILKRSRRVWRRKTSC
Subjt:  MRNPSTLRREFLKKWMMGLQILSNNSCGSMTVSERKKAIKLSADIALASSRDCATRWSRAVIAGAVGRSVYCQKAKNKLQSGSKKILKRSRRVWRRKTSC

Query:  SRKPAAAESVARRLVEKRTKILRGLVPGGEFMDEISLIEETLDYISALQAQVDVMRCLATAYKPSIHD
        SRKPAAAESVARRLVEKRTKILRGLVPGGEFMDEISLIEETLDYISALQAQVDVMRCLATAYKPSIHD
Subjt:  SRKPAAAESVARRLVEKRTKILRGLVPGGEFMDEISLIEETLDYISALQAQVDVMRCLATAYKPSIHD

XP_022988593.1 transcription factor IBH1-like 1 [Cucurbita maxima]2.3e-4465.38Show/hide
Query:  MRNPSTLRREFLKKWMMGLQILSNNSCGSMTVSERKKAIKLSADIALASSRDCATRWSRAVIAGA-------------VGRSVYCQKAKNKLQSGSK---
        MRNPS+L+R+FLKKW+MGLQI +  +  +MTV++RK+AIKLSADIALASSR+CATRWSRAVIAG              +GR+V C++ K   QS SK   
Subjt:  MRNPSTLRREFLKKWMMGLQILSNNSCGSMTVSERKKAIKLSADIALASSRDCATRWSRAVIAGA-------------VGRSVYCQKAKNKLQSGSK---

Query:  ------KILKRSRRVWRRKTSCSRKPAAAESVARRLVEKRTKILRGLVPGGEFMDEISLIEETLDYISALQAQVDVMRCLAT
              KILKRSRRV RRK    RKP AAE +ARRLV+KRT+ LRGLVPGGEFMDE+SLIEETLDYI ALQ QVDVMRCLAT
Subjt:  ------KILKRSRRVWRRKTSCSRKPAAAESVARRLVEKRTKILRGLVPGGEFMDEISLIEETLDYISALQAQVDVMRCLAT

XP_038879866.1 transcription factor IBH1-like 1 [Benincasa hispida]3.7e-5068.11Show/hide
Query:  MRNPSTLRREFLKKWMMGLQILSNNSCGSMTVSERKKAIKLSADIALASSRDCATRWSRAVIA-------------GAVGRSVYCQKAKNKLQS------
        MRNPS+L+REFLKKWM GLQI + +S  +MTV ERK AIKLSADIALASSR+CATRWSRAVIA             G +GR+V C++ K    S      
Subjt:  MRNPSTLRREFLKKWMMGLQILSNNSCGSMTVSERKKAIKLSADIALASSRDCATRWSRAVIA-------------GAVGRSVYCQKAKNKLQS------

Query:  -GSKKILKRSRRVWRRKTSCSRKPAAAESVARRLVEKRTKILRGLVPGGEFMDEISLIEETLDYISALQAQVDVMRCLATAYKPS
          S  ILKRSRRV RR+  C  K  AAESVA+RLV KRTK+LRGLVPGGEFMDEISLIEETLDYISALQAQVDVMRCLATAY PS
Subjt:  -GSKKILKRSRRVWRRKTSCSRKPAAAESVARRLVEKRTKILRGLVPGGEFMDEISLIEETLDYISALQAQVDVMRCLATAYKPS

TrEMBL top hitse value%identityAlignment
A0A0A0LWQ3 Uncharacterized protein5.1e-4562.16Show/hide
Query:  MRNPSTLRREFLKKWMMGLQILSNNSCGSMTVSERKKAIKLSADIALASSRDCATRWSRAVIA-------------GAVGRSVYCQKAK-------NKLQ
        M+N ++L++EF KKWMMGLQI +  +  +MTV+ERK AIKLSADIALASSR+CATRWSRAVIA             G +GR + C++ K        +  
Subjt:  MRNPSTLRREFLKKWMMGLQILSNNSCGSMTVSERKKAIKLSADIALASSRDCATRWSRAVIA-------------GAVGRSVYCQKAK-------NKLQ

Query:  SGSKKILKRSRRVWRRKTSCSRKPAAAESVARRLVEKRTKILRGLVPGGEFMDEISLIEETLDYISALQAQVDVMRCLATAYKPS
          S KI+KRSRRVWRR+     K  AA ++ARRLV+KRTK+LRGLVPGGEFMDEISLIEETLDY+SALQAQVDVMR LATAY PS
Subjt:  SGSKKILKRSRRVWRRKTSCSRKPAAAESVARRLVEKRTKILRGLVPGGEFMDEISLIEETLDYISALQAQVDVMRCLATAYKPS

A0A1S3B8V1 uncharacterized protein LOC1034873051.3e-4562.16Show/hide
Query:  MRNPSTLRREFLKKWMMGLQILSNNSCGSMTVSERKKAIKLSADIALASSRDCATRWSRAVIA-------------GAVGRSVYCQKAK-------NKLQ
        M+NP++L++EFLKKWMMGLQI +  +  +MTV+ERK AIKLSADIALASSR+CATRWSRAVIA             G +GR + C++ K        +  
Subjt:  MRNPSTLRREFLKKWMMGLQILSNNSCGSMTVSERKKAIKLSADIALASSRDCATRWSRAVIA-------------GAVGRSVYCQKAK-------NKLQ

Query:  SGSKKILKRSRRVWRRKTSCSRKPAAAESVARRLVEKRTKILRGLVPGGEFMDEISLIEETLDYISALQAQVDVMRCLATAYKPS
          S KI+KRSRRVWRR+     K  AA ++ARRLV++RTK+LRGLVPGGEFMDEISLIEETLDY+SALQAQVDVMR LA+AY PS
Subjt:  SGSKKILKRSRRVWRRKTSCSRKPAAAESVARRLVEKRTKILRGLVPGGEFMDEISLIEETLDYISALQAQVDVMRCLATAYKPS

A0A6J1C0T4 transcription factor IBH1-like 13.6e-8398.81Show/hide
Query:  MRNPSTLRREFLKKWMMGLQILSNNSCGSMTVSERKKAIKLSADIALASSRDCATRWSRAVIAGAVGRSVYCQKAKNKLQSGSKKILKRSRRVWRRKTSC
        M NPSTLRREFLKKWMMGLQILSNN CGSMTVSERKKAIKLSADIALASSRDCATRWSRAVIAGAVGRSVYCQKAKNKLQSGSKKILKRSRRVWRRKTSC
Subjt:  MRNPSTLRREFLKKWMMGLQILSNNSCGSMTVSERKKAIKLSADIALASSRDCATRWSRAVIAGAVGRSVYCQKAKNKLQSGSKKILKRSRRVWRRKTSC

Query:  SRKPAAAESVARRLVEKRTKILRGLVPGGEFMDEISLIEETLDYISALQAQVDVMRCLATAYKPSIHD
        SRKPAAAESVARRLVEKRTKILRGLVPGGEFMDEISLIEETLDYISALQAQVDVMRCLATAYKPSIHD
Subjt:  SRKPAAAESVARRLVEKRTKILRGLVPGGEFMDEISLIEETLDYISALQAQVDVMRCLATAYKPSIHD

A0A6J1F0A2 transcription factor IBH1-like 1 isoform X21.9e-4463.43Show/hide
Query:  MRNPSTLRREFLKKWMMGLQILSNNSCGSMTVSERKKAIKLSADIALASSRDCATRWSRAVIAGA-------------VGRSVYCQKAKNKLQSGSKKIL
        MRNPS L+REFLKKWMMGL+I + ++  +MTV++RK AIKLSADIALASSR+CATRWS+A+IA               +GR++  +  +    SG  KI+
Subjt:  MRNPSTLRREFLKKWMMGLQILSNNSCGSMTVSERKKAIKLSADIALASSRDCATRWSRAVIAGA-------------VGRSVYCQKAKNKLQSGSKKIL

Query:  KRSRRVWRRKTSCSRKPAAAESVARRLVEKRTKILRGLVPGGEFMDEISLIEETLDYISALQAQVDVMRCLATAY
        KRSRRV R +  C  K  AAE +A++LV KR K+LRGLVPGGEFMDEISLIEETLDYISALQAQVDVMRCLATAY
Subjt:  KRSRRVWRRKTSCSRKPAAAESVARRLVEKRTKILRGLVPGGEFMDEISLIEETLDYISALQAQVDVMRCLATAY

A0A6J1JMR3 transcription factor IBH1-like 11.1e-4465.38Show/hide
Query:  MRNPSTLRREFLKKWMMGLQILSNNSCGSMTVSERKKAIKLSADIALASSRDCATRWSRAVIAGA-------------VGRSVYCQKAKNKLQSGSK---
        MRNPS+L+R+FLKKW+MGLQI +  +  +MTV++RK+AIKLSADIALASSR+CATRWSRAVIAG              +GR+V C++ K   QS SK   
Subjt:  MRNPSTLRREFLKKWMMGLQILSNNSCGSMTVSERKKAIKLSADIALASSRDCATRWSRAVIAGA-------------VGRSVYCQKAKNKLQSGSK---

Query:  ------KILKRSRRVWRRKTSCSRKPAAAESVARRLVEKRTKILRGLVPGGEFMDEISLIEETLDYISALQAQVDVMRCLAT
              KILKRSRRV RRK    RKP AAE +ARRLV+KRT+ LRGLVPGGEFMDE+SLIEETLDYI ALQ QVDVMRCLAT
Subjt:  ------KILKRSRRVWRRKTSCSRKPAAAESVARRLVEKRTKILRGLVPGGEFMDEISLIEETLDYISALQAQVDVMRCLAT

SwissProt top hitse value%identityAlignment
O80482 Transcription factor bHLH1499.1e-0732.89Show/hide
Query:  EFLKKWMMGLQILSNNSCGSMTVSERKKAIKLSADIALASSRDCATRWSRAVIAGAVGRSVYCQKAKNKLQSGSKKILKRSRRVWRRKTSCSRKPAAAES
        E L++        SNN    + VS   + I+ +AD  LA+S    TRWSRA++A  V       +AK          LK+ R+  +   +C  +    E+
Subjt:  EFLKKWMMGLQILSNNSCGSMTVSERKKAIKLSADIALASSRDCATRWSRAVIAGAVGRSVYCQKAKNKLQSGSKKILKRSRRVWRRKTSCSRKPAAAES

Query:  VARRL--VEKRTKILRGLVPGGEFMDEISLIEETLDYISALQAQVDVMRCLA
           +L  VE++ KIL  LVPG   +   +L++E  DYI+AL+ QV  M  LA
Subjt:  VARRL--VEKRTKILRGLVPGGEFMDEISLIEETLDYISALQAQVDVMRCLA

Q9C8Z9 Transcription factor bHLH1489.4e-0430.38Show/hide
Query:  RREFLKKWMMGLQILSNNSCGSM----TVSERKKAIKLSADIALASSRDCATRWSRAVIAGAVG---RSVYCQKAKNKLQSGSKKILKRSRRVWRRKTSC
        +R +  K    LQ +  NS  S     T  +R KA++ +AD ALA S    T WSRA++A  +    R     +A   + + +  +   S R  +R+ S 
Subjt:  RREFLKKWMMGLQILSNNSCGSM----TVSERKKAIKLSADIALASSRDCATRWSRAVIAGAVG---RSVYCQKAKNKLQSGSKKILKRSRRVWRRKTSC

Query:  SRKPAAAESVARRLVEKRTKILRGLVPGGEFMDEISLIEETLDYISALQAQVDVMRCL
         R     +S+    V ++ ++L  LVPG        ++EE  DYI AL+ QV  M  L
Subjt:  SRKPAAAESVARRLVEKRTKILRGLVPGGEFMDEISLIEETLDYISALQAQVDVMRCL

Q9M0B9 Transcription factor IBH1-like 12.5e-2543.09Show/hide
Query:  MRNPSTLRREFLKKWMMGLQILSNNSCGSMTVSERKKAIKLSADIALASSRDCATRWSRAVI--------------AGAVGRSVYCQKAKNKLQSGSKKI
        M+  S++  EFLKKW MGLQI    S  + +V ERKKAIKLSAD+A+AS R   T WSRA+I                 +       K   K     +KI
Subjt:  MRNPSTLRREFLKKWMMGLQILSNNSCGSMTVSERKKAIKLSADIALASSRDCATRWSRAVI--------------AGAVGRSVYCQKAKNKLQSGSKKI

Query:  LKRSRRVWRRKTSCSRKPAAAESVARRLVEKRTKILRGLVPGGEFM-DEISLIEETLDYISALQAQVDVMRCLATAYKPSI
        ++RS+++ RRK+  + + AAA+  A+RLV++RT+ LR +VPGGE M +++ L++ETLDYI +LQ QV+VMR +  A +  I
Subjt:  LKRSRRVWRRKTSCSRKPAAAESVARRLVEKRTKILRGLVPGGEFM-DEISLIEETLDYISALQAQVDVMRCLATAYKPSI

Q9M9L6 Transcription factor bHLH1501.1e-0433.86Show/hide
Query:  SERKKAIKLSADIALASSRDCATRWSRAVIAGAVGRSVYCQKAKNKLQSGSKKILKRSRRVWRRKTSCSRKPAAAESVARRLVEKRTKILRGLVPGGEFM
        S R   ++ +AD  LA++   ATRWSRA++    G S+  ++   K  S     ++ S    RR     RK +A        V  R ++L GLVPG    
Subjt:  SERKKAIKLSADIALASSRDCATRWSRAVIAGAVGRSVYCQKAKNKLQSGSKKILKRSRRVWRRKTSCSRKPAAAESVARRLVEKRTKILRGLVPGGEFM

Query:  DEISLIEETLDYISALQAQVDVMRCLA
            L++ET DYI+AL+ QV  M  L+
Subjt:  DEISLIEETLDYISALQAQVDVMRCLA

Q9SKX1 Transcription factor IBH12.0e-0632.59Show/hide
Query:  RKKAIKLSADIALASSRDCATR-WSRAVIAGAVGRSVYCQKAKNKLQSGSKKILKRSRRVW----RRKTSCSRKPAAAESVARRLVEKRTKILRGLVPGG
        R + IK +A +++A +   ++R WSRA++               +      KI++ SRR W    +R+ S  R P   E+  R         LR LVPGG
Subjt:  RKKAIKLSADIALASSRDCATR-WSRAVIAGAVGRSVYCQKAKNKLQSGSKKILKRSRRVW----RRKTSCSRKPAAAESVARRLVEKRTKILRGLVPGG

Query:  EFMDEISLIEETLDYISALQAQVDVMRCLATAYKP
          M+   L+EET  YI  L  QV VM+CL     P
Subjt:  EFMDEISLIEETLDYISALQAQVDVMRCLATAYKP

Arabidopsis top hitse value%identityAlignment
AT1G09250.1 basic helix-loop-helix (bHLH) DNA-binding superfamily protein6.4e-0832.89Show/hide
Query:  EFLKKWMMGLQILSNNSCGSMTVSERKKAIKLSADIALASSRDCATRWSRAVIAGAVGRSVYCQKAKNKLQSGSKKILKRSRRVWRRKTSCSRKPAAAES
        E L++        SNN    + VS   + I+ +AD  LA+S    TRWSRA++A  V       +AK          LK+ R+  +   +C  +    E+
Subjt:  EFLKKWMMGLQILSNNSCGSMTVSERKKAIKLSADIALASSRDCATRWSRAVIAGAVGRSVYCQKAKNKLQSGSKKILKRSRRVWRRKTSCSRKPAAAES

Query:  VARRL--VEKRTKILRGLVPGGEFMDEISLIEETLDYISALQAQVDVMRCLA
           +L  VE++ KIL  LVPG   +   +L++E  DYI+AL+ QV  M  LA
Subjt:  VARRL--VEKRTKILRGLVPGGEFMDEISLIEETLDYISALQAQVDVMRCLA

AT2G43060.1 ILI1 binding bHLH 11.4e-0732.59Show/hide
Query:  RKKAIKLSADIALASSRDCATR-WSRAVIAGAVGRSVYCQKAKNKLQSGSKKILKRSRRVW----RRKTSCSRKPAAAESVARRLVEKRTKILRGLVPGG
        R + IK +A +++A +   ++R WSRA++               +      KI++ SRR W    +R+ S  R P   E+  R         LR LVPGG
Subjt:  RKKAIKLSADIALASSRDCATR-WSRAVIAGAVGRSVYCQKAKNKLQSGSKKILKRSRRVW----RRKTSCSRKPAAAESVARRLVEKRTKILRGLVPGG

Query:  EFMDEISLIEETLDYISALQAQVDVMRCLATAYKP
          M+   L+EET  YI  L  QV VM+CL     P
Subjt:  EFMDEISLIEETLDYISALQAQVDVMRCLATAYKP

AT4G30410.1 sequence-specific DNA binding transcription factors1.8e-2643.09Show/hide
Query:  MRNPSTLRREFLKKWMMGLQILSNNSCGSMTVSERKKAIKLSADIALASSRDCATRWSRAVI--------------AGAVGRSVYCQKAKNKLQSGSKKI
        M+  S++  EFLKKW MGLQI    S  + +V ERKKAIKLSAD+A+AS R   T WSRA+I                 +       K   K     +KI
Subjt:  MRNPSTLRREFLKKWMMGLQILSNNSCGSMTVSERKKAIKLSADIALASSRDCATRWSRAVI--------------AGAVGRSVYCQKAKNKLQSGSKKI

Query:  LKRSRRVWRRKTSCSRKPAAAESVARRLVEKRTKILRGLVPGGEFM-DEISLIEETLDYISALQAQVDVMRCLATAYKPSI
        ++RS+++ RRK+  + + AAA+  A+RLV++RT+ LR +VPGGE M +++ L++ETLDYI +LQ QV+VMR +  A +  I
Subjt:  LKRSRRVWRRKTSCSRKPAAAESVARRLVEKRTKILRGLVPGGEFM-DEISLIEETLDYISALQAQVDVMRCLATAYKPSI

AT4G30410.2 sequence-specific DNA binding transcription factors1.8e-2643.09Show/hide
Query:  MRNPSTLRREFLKKWMMGLQILSNNSCGSMTVSERKKAIKLSADIALASSRDCATRWSRAVI--------------AGAVGRSVYCQKAKNKLQSGSKKI
        M+  S++  EFLKKW MGLQI    S  + +V ERKKAIKLSAD+A+AS R   T WSRA+I                 +       K   K     +KI
Subjt:  MRNPSTLRREFLKKWMMGLQILSNNSCGSMTVSERKKAIKLSADIALASSRDCATRWSRAVI--------------AGAVGRSVYCQKAKNKLQSGSKKI

Query:  LKRSRRVWRRKTSCSRKPAAAESVARRLVEKRTKILRGLVPGGEFM-DEISLIEETLDYISALQAQVDVMRCLATAYKPSI
        ++RS+++ RRK+  + + AAA+  A+RLV++RT+ LR +VPGGE M +++ L++ETLDYI +LQ QV+VMR +  A +  I
Subjt:  LKRSRRVWRRKTSCSRKPAAAESVARRLVEKRTKILRGLVPGGEFM-DEISLIEETLDYISALQAQVDVMRCLATAYKPSI

AT5G57780.1 EXPRESSED IN: 18 plant structures8.7e-2141.25Show/hide
Query:  MRNPSTLRREFLKKWMMGLQILSNNSCGSMTVSERKKAIKLSADIALASSRDCATRWSRAVIAGAVGRSVYCQKAKNKLQSGSKKILKRSRRVWRRKTSC
        M   + +++EF+KKW+  L +L ++    + V+ERK AI+LS+D+A+A++R+ +T WSRA+I+ +        K  NK    +++ILK++R   R K  C
Subjt:  MRNPSTLRREFLKKWMMGLQILSNNSCGSMTVSERKKAIKLSADIALASSRDCATRWSRAVIAGAVGRSVYCQKAKNKLQSGSKKILKRSRRVWRRKTSC

Query:  SRKPAAAESVARRLVEKRTKILRGLVPGGEFMDEIS-LIEETLDYISALQAQVDVMRCLA
        +         A+  V KRT +L+ LVPGGE +D+   LI ETLDYI  L+AQVDVMR +A
Subjt:  SRKPAAAESVARRLVEKRTKILRGLVPGGEFMDEIS-LIEETLDYISALQAQVDVMRCLA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCGTAATCCGAGCACACTGAGGAGAGAATTCCTCAAGAAATGGATGATGGGTCTCCAAATATTAAGCAATAACAGTTGTGGCTCCATGACGGTCTCCGAGAGAAAAAA
GGCCATCAAGTTATCGGCAGACATAGCGTTGGCCTCCTCTAGAGACTGCGCCACGCGGTGGAGCCGAGCGGTCATCGCCGGCGCGGTCGGGCGGAGCGTGTACTGCCAGA
AGGCCAAGAACAAGCTGCAAAGTGGCTCGAAGAAGATCTTGAAAAGAAGCCGCCGTGTATGGCGGCGGAAAACATCATGCAGCCGGAAGCCAGCGGCGGCGGAGTCGGTT
GCGAGGAGATTGGTGGAGAAAAGAACAAAAATTTTAAGAGGGCTGGTGCCGGGGGGTGAATTTATGGACGAGATTTCGCTGATTGAAGAAACCCTAGATTACATATCGGC
TCTCCAAGCTCAGGTTGATGTAATGCGGTGCCTTGCAACTGCTTACAAGCCATCAATCCATGAT
mRNA sequenceShow/hide mRNA sequence
ATGCGTAATCCGAGCACACTGAGGAGAGAATTCCTCAAGAAATGGATGATGGGTCTCCAAATATTAAGCAATAACAGTTGTGGCTCCATGACGGTCTCCGAGAGAAAAAA
GGCCATCAAGTTATCGGCAGACATAGCGTTGGCCTCCTCTAGAGACTGCGCCACGCGGTGGAGCCGAGCGGTCATCGCCGGCGCGGTCGGGCGGAGCGTGTACTGCCAGA
AGGCCAAGAACAAGCTGCAAAGTGGCTCGAAGAAGATCTTGAAAAGAAGCCGCCGTGTATGGCGGCGGAAAACATCATGCAGCCGGAAGCCAGCGGCGGCGGAGTCGGTT
GCGAGGAGATTGGTGGAGAAAAGAACAAAAATTTTAAGAGGGCTGGTGCCGGGGGGTGAATTTATGGACGAGATTTCGCTGATTGAAGAAACCCTAGATTACATATCGGC
TCTCCAAGCTCAGGTTGATGTAATGCGGTGCCTTGCAACTGCTTACAAGCCATCAATCCATGAT
Protein sequenceShow/hide protein sequence
MRNPSTLRREFLKKWMMGLQILSNNSCGSMTVSERKKAIKLSADIALASSRDCATRWSRAVIAGAVGRSVYCQKAKNKLQSGSKKILKRSRRVWRRKTSCSRKPAAAESV
ARRLVEKRTKILRGLVPGGEFMDEISLIEETLDYISALQAQVDVMRCLATAYKPSIHD