; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC05g0759 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC05g0759
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
Descriptiontranscription factor bHLH128
Genome locationMC05:6216739..6222826
RNA-Seq ExpressionMC05g0759
SyntenyMC05g0759
Gene Ontology termsGO:0006357 - regulation of transcription by RNA polymerase II (biological process)
GO:0005634 - nucleus (cellular component)
GO:0000978 - RNA polymerase II proximal promoter sequence-specific DNA binding (molecular function)
GO:0000981 - DNA-binding transcription factor activity, RNA polymerase II-specific (molecular function)
GO:0046983 - protein dimerization activity (molecular function)
InterPro domainsIPR011598 - Myc-type, basic helix-loop-helix (bHLH) domain
IPR036638 - Helix-loop-helix DNA-binding domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7029066.1 Transcription factor bHLH [Cucurbita argyrosperma subsp. argyrosperma]1.29e-15774.8Show/hide
Query:  MYQSTSSSSSSQKSM------GPAGGGGGGGLTRYGSAPGSLLTSAVDSVIGGGRHPDSTASLRTPPPSFGGHYFSSADSS-------------TSNDLK
        MYQSTSSSSSS  SM      G  GG GGGGLTRYGSAPGSLL SAVDSVIG  RHPDS A LRTPP SFGGH+FSSA+SS             TS+DLK
Subjt:  MYQSTSSSSSSQKSM------GPAGGGGGGGLTRYGSAPGSLLTSAVDSVIGGGRHPDSTASLRTPPPSFGGHYFSSADSS-------------TSNDLK

Query:  SSSSGAAAAAALNRSYGIHDLALGDFSIPRNFNSNGGQSSSSSPLVRQRSSPAGFLGHFAVADGNGSGFSVTRGGGNYGSQNNNNGVCRLKSQLSFSG-Q
        SS   AAAAAAL RSYGIHDLAL DFS  R+FNSNGGQSSSSSPLVRQRSSPAGFLGH +VA+G G GFS+TRGGG         G+ RLKSQLSFSG Q
Subjt:  SSSSGAAAAAALNRSYGIHDLALGDFSIPRNFNSNGGQSSSSSPLVRQRSSPAGFLGHFAVADGNGSGFSVTRGGGNYGSQNNNNGVCRLKSQLSFSG-Q

Query:  DSLSQISEMSESVVEGVNSSNANANIAITHSFGIDSWDSANSNSIVFTAPHPKRPH--HSDSDFFNCLESQFSMPQTSLEMATVERLLQIPEDSVPCKIR
        DS+SQ+S MSESVVEG +SS++ A  A    F +DSWD+  SNSI F APHPKR    HSD DFF  L+SQFS+PQTS EMATVERLLQIPEDSVPCKIR
Subjt:  DSLSQISEMSESVVEGVNSSNANANIAITHSFGIDSWDSANSNSIVFTAPHPKRPH--HSDSDFFNCLESQFSMPQTSLEMATVERLLQIPEDSVPCKIR

Query:  AKRGCATHPRSIAERERRTRISGKLKKLQELVPNMDKQTSYADMLDLAVQHIKGLQNQIQKLNKEVENCSCGCKGSS
        AKRGCATHPRSIAERERRTRISGKLKKLQELVPNMDKQTSY+DMLDLAVQHIKGLQNQIQKLNKEVENC+CG K  S
Subjt:  AKRGCATHPRSIAERERRTRISGKLKKLQELVPNMDKQTSYADMLDLAVQHIKGLQNQIQKLNKEVENCSCGCKGSS

XP_022137823.1 transcription factor bHLH128 [Momordica charantia]6.92e-243100Show/hide
Query:  MYQSTSSSSSSQKSMGPAGGGGGGGLTRYGSAPGSLLTSAVDSVIGGGRHPDSTASLRTPPPSFGGHYFSSADSSTSNDLKSSSSGAAAAAALNRSYGIH
        MYQSTSSSSSSQKSMGPAGGGGGGGLTRYGSAPGSLLTSAVDSVIGGGRHPDSTASLRTPPPSFGGHYFSSADSSTSNDLKSSSSGAAAAAALNRSYGIH
Subjt:  MYQSTSSSSSSQKSMGPAGGGGGGGLTRYGSAPGSLLTSAVDSVIGGGRHPDSTASLRTPPPSFGGHYFSSADSSTSNDLKSSSSGAAAAAALNRSYGIH

Query:  DLALGDFSIPRNFNSNGGQSSSSSPLVRQRSSPAGFLGHFAVADGNGSGFSVTRGGGNYGSQNNNNGVCRLKSQLSFSGQDSLSQISEMSESVVEGVNSS
        DLALGDFSIPRNFNSNGGQSSSSSPLVRQRSSPAGFLGHFAVADGNGSGFSVTRGGGNYGSQNNNNGVCRLKSQLSFSGQDSLSQISEMSESVVEGVNSS
Subjt:  DLALGDFSIPRNFNSNGGQSSSSSPLVRQRSSPAGFLGHFAVADGNGSGFSVTRGGGNYGSQNNNNGVCRLKSQLSFSGQDSLSQISEMSESVVEGVNSS

Query:  NANANIAITHSFGIDSWDSANSNSIVFTAPHPKRPHHSDSDFFNCLESQFSMPQTSLEMATVERLLQIPEDSVPCKIRAKRGCATHPRSIAERERRTRIS
        NANANIAITHSFGIDSWDSANSNSIVFTAPHPKRPHHSDSDFFNCLESQFSMPQTSLEMATVERLLQIPEDSVPCKIRAKRGCATHPRSIAERERRTRIS
Subjt:  NANANIAITHSFGIDSWDSANSNSIVFTAPHPKRPHHSDSDFFNCLESQFSMPQTSLEMATVERLLQIPEDSVPCKIRAKRGCATHPRSIAERERRTRIS

Query:  GKLKKLQELVPNMDKQTSYADMLDLAVQHIKGLQNQIQKLNKEVENCSCGCKGSS
        GKLKKLQELVPNMDKQTSYADMLDLAVQHIKGLQNQIQKLNKEVENCSCGCKGSS
Subjt:  GKLKKLQELVPNMDKQTSYADMLDLAVQHIKGLQNQIQKLNKEVENCSCGCKGSS

XP_022932581.1 transcription factor bHLH128-like [Cucurbita moschata]2.10e-15775.47Show/hide
Query:  MYQSTSSSSSSQKSMGPAGGGGGGGLTRYGSAPGSLLTSAVDSVIGGGRHPDSTASLRTPPPSFGGHYFSSADSS-------------TSNDLKSSSSGA
        MYQSTSSSSSS  SM P+   GGGGLTRYGSAPGSLL SAVDSVIG  RHPDS A LRTPP SFGGH+FSSA+SS             TS+DLKSS+  A
Subjt:  MYQSTSSSSSSQKSMGPAGGGGGGGLTRYGSAPGSLLTSAVDSVIGGGRHPDSTASLRTPPPSFGGHYFSSADSS-------------TSNDLKSSSSGA

Query:  AAAAALNRSYGIHDLALGDFSIPRNFNSNGGQSSSSSPLVRQRSSPAGFLGHFAVADGNGSGFSVTRGGGNYGSQNNNNGVCRLKSQLSFSG-QDSLSQI
        AAAAAL RSYGIHDLAL DFS  R+FNSNGGQSSSSSPLVRQRSSPAGFLGH +VA+G G GFS+TRGGG         G+ RLKSQLSFSG QDS+SQ+
Subjt:  AAAAALNRSYGIHDLALGDFSIPRNFNSNGGQSSSSSPLVRQRSSPAGFLGHFAVADGNGSGFSVTRGGGNYGSQNNNNGVCRLKSQLSFSG-QDSLSQI

Query:  SEMSESVVEGVNSSNANANIAITHSFGIDSWDSANSNSIVFTAPHPKRPH--HSDSDFFNCLESQFSMPQTSLEMATVERLLQIPEDSVPCKIRAKRGCA
        S MSESVVEG +SS++ A  A    F +DSWD+  SNSI F APHPKR    HSD DFF  L+SQFS+PQTS EMATVERLLQIPEDSVPCKIRAKRGCA
Subjt:  SEMSESVVEGVNSSNANANIAITHSFGIDSWDSANSNSIVFTAPHPKRPH--HSDSDFFNCLESQFSMPQTSLEMATVERLLQIPEDSVPCKIRAKRGCA

Query:  THPRSIAERERRTRISGKLKKLQELVPNMDKQTSYADMLDLAVQHIKGLQNQIQKLNKEVENCSCGCKGSS
        THPRSIAERERRTRISGKLKKLQELVPNMDKQTSY+DMLDLAVQHIKGLQNQIQKLNKEVENC+CG K  S
Subjt:  THPRSIAERERRTRISGKLKKLQELVPNMDKQTSYADMLDLAVQHIKGLQNQIQKLNKEVENCSCGCKGSS

XP_038899818.1 transcription factor bHLH128-like isoform X1 [Benincasa hispida]1.49e-15773.45Show/hide
Query:  MYQSTSSSSSSQKSM---------GPAGGGGGGGLTRYGSAPGSLLTSAVDSVIGGGRHPDSTASLRTPPPSFGGHYFSSADSS-------------TSN
        MYQSTSSSSSSQKSM         G  G GGGGGLTRYGSAPGSLLT+AVDSVIG  R PDS A+LR PP SFGGHYFSS DSS             TSN
Subjt:  MYQSTSSSSSSQKSM---------GPAGGGGGGGLTRYGSAPGSLLTSAVDSVIGGGRHPDSTASLRTPPPSFGGHYFSSADSS-------------TSN

Query:  DLKSSSS---GAAAAAALNRSYGIHDLALGDFSIPRNFNSNGGQSSSSSPLVRQRSSPAGFLGHFAVADGNGSGFSVTRGGGNYGSQNNNNGVCRLKSQL
        DLKSSSS    AAAAAALNRSYG +DLALGDFS  RNFNSNGGQSSSSSPLVRQRSSPAGFLGH +VA+ NG GFS+T GGG        NG+ RLKSQ+
Subjt:  DLKSSSS---GAAAAAALNRSYGIHDLALGDFSIPRNFNSNGGQSSSSSPLVRQRSSPAGFLGHFAVADGNGSGFSVTRGGGNYGSQNNNNGVCRLKSQL

Query:  SFSGQD-SLSQISEMSESVVEGVNS--SNANANIAITHSFGID------SWDSANSNSIVFTAPHPKRP-HHSDSDFFNCLESQFS-MPQTSLEMATVER
        SF+GQD SLSQISEMSES VE  NS  +   +N   THSFG        SW+++N NSIVF APH KR  HHSD+DFF  LESQ   +PQT+LEMA VER
Subjt:  SFSGQD-SLSQISEMSESVVEGVNS--SNANANIAITHSFGID------SWDSANSNSIVFTAPHPKRP-HHSDSDFFNCLESQFS-MPQTSLEMATVER

Query:  LLQIPEDSVPCKIRAKRGCATHPRSIAERERRTRISGKLKKLQELVPNMDKQTSYADMLDLAVQHIKGLQNQIQKLNKEVENCSCGCK
        LLQIPEDSVPCKIRAKRGCATHPRSIAERERRTRISGKLKKLQELVPNMDKQTSY+DMLDLAVQHIKGLQNQIQKLNKEVENC+CG K
Subjt:  LLQIPEDSVPCKIRAKRGCATHPRSIAERERRTRISGKLKKLQELVPNMDKQTSYADMLDLAVQHIKGLQNQIQKLNKEVENCSCGCK

XP_038899826.1 transcription factor bHLH128-like isoform X2 [Benincasa hispida]3.25e-16174.16Show/hide
Query:  MYQSTSSSSSSQKSM---------GPAGGGGGGGLTRYGSAPGSLLTSAVDSVIGGGRHPDSTASLRTPPPSFGGHYFSSADSS-------------TSN
        MYQSTSSSSSSQKSM         G  G GGGGGLTRYGSAPGSLLT+AVDSVIG  R PDS A+LR PP SFGGHYFSS DSS             TSN
Subjt:  MYQSTSSSSSSQKSM---------GPAGGGGGGGLTRYGSAPGSLLTSAVDSVIGGGRHPDSTASLRTPPPSFGGHYFSSADSS-------------TSN

Query:  DLKSSSS---GAAAAAALNRSYGIHDLALGDFSIPRNFNSNGGQSSSSSPLVRQRSSPAGFLGHFAVADGNGSGFSVTRGGGNYGSQNNNNGVCRLKSQL
        DLKSSSS    AAAAAALNRSYG +DLALGDFS  RNFNSNGGQSSSSSPLVRQRSSPAGFLGH +VA+ NG GFS+T GGG        NG+ RLKSQ+
Subjt:  DLKSSSS---GAAAAAALNRSYGIHDLALGDFSIPRNFNSNGGQSSSSSPLVRQRSSPAGFLGHFAVADGNGSGFSVTRGGGNYGSQNNNNGVCRLKSQL

Query:  SFSGQD-SLSQISEMSESVVEGVNS--SNANANIAITHSFGID------SWDSANSNSIVFTAPHPKRP-HHSDSDFFNCLESQFSMPQTSLEMATVERL
        SF+GQD SLSQISEMSES VE  NS  +   +N   THSFG        SW+++N NSIVF APH KR  HHSD+DFF  LESQFS+PQT+LEMA VERL
Subjt:  SFSGQD-SLSQISEMSESVVEGVNS--SNANANIAITHSFGID------SWDSANSNSIVFTAPHPKRP-HHSDSDFFNCLESQFSMPQTSLEMATVERL

Query:  LQIPEDSVPCKIRAKRGCATHPRSIAERERRTRISGKLKKLQELVPNMDKQTSYADMLDLAVQHIKGLQNQIQKLNKEVENCSCGCK
        LQIPEDSVPCKIRAKRGCATHPRSIAERERRTRISGKLKKLQELVPNMDKQTSY+DMLDLAVQHIKGLQNQIQKLNKEVENC+CG K
Subjt:  LQIPEDSVPCKIRAKRGCATHPRSIAERERRTRISGKLKKLQELVPNMDKQTSYADMLDLAVQHIKGLQNQIQKLNKEVENCSCGCK

TrEMBL top hitse value%identityAlignment
A0A0A0KZ44 BHLH domain-containing protein1.68e-15674.41Show/hide
Query:  MYQSTSSSSSSQKSMGPAGGGG---GGGLTRYGSAPGSLLTSAVDSVIGGGRHPDSTASLRTPPPSFGGHYFSSADSS-------------TSNDLKSSS
        MYQSTSSSSSSQKSM  A GG     GGLTRYGSAPGS LT+AVDSVIG  R PDS A+LR PP SFG HYFSSADSS             TSNDLKSSS
Subjt:  MYQSTSSSSSSQKSMGPAGGGG---GGGLTRYGSAPGSLLTSAVDSVIGGGRHPDSTASLRTPPPSFGGHYFSSADSS-------------TSNDLKSSS

Query:  SGAAAAAALNRSYGIHDLALGDFSIPRNFNSNGGQSSSSSPLVRQRSSPAGFLGHFAVADGNGSGFSVTRGGGNYGSQNNNNGVCRLKSQLSFSGQD-SL
           A AAALNRSYG +DLALGDFS  RNFNSNGGQSSSSSPLVRQRSSPAGFLGH +VA+ NG GFS+T GGG  G     NG  RLKSQ+SF+GQD SL
Subjt:  SGAAAAAALNRSYGIHDLALGDFSIPRNFNSNGGQSSSSSPLVRQRSSPAGFLGHFAVADGNGSGFSVTRGGGNYGSQNNNNGVCRLKSQLSFSGQD-SL

Query:  SQISEMSESVVEGVNS------SNANA--NIAITHSFGIDS-WDSANSNSIVFTAPHPKRP-HHSDSDFFNCLESQFSMPQTSLEMATVERLLQIPEDSV
        SQISE+SES VE  NS      SN N+  + A + +F +DS WD++ SNSIVF APH KR  HHSD+DFF  LESQFS+PQT+LEMA VERLLQIPEDSV
Subjt:  SQISEMSESVVEGVNS------SNANA--NIAITHSFGIDS-WDSANSNSIVFTAPHPKRP-HHSDSDFFNCLESQFSMPQTSLEMATVERLLQIPEDSV

Query:  PCKIRAKRGCATHPRSIAERERRTRISGKLKKLQELVPNMDKQTSYADMLDLAVQHIKGLQNQIQKLNKEVENCSCGCK
        PCKIRAKRGCATHPRSIAERERRTRISGKLKKLQELVPNMDKQTSY+DMLDLAVQHIKGLQNQIQKLNKEVENC+CG K
Subjt:  PCKIRAKRGCATHPRSIAERERRTRISGKLKKLQELVPNMDKQTSYADMLDLAVQHIKGLQNQIQKLNKEVENCSCGCK

A0A6J1CBF5 transcription factor bHLH1283.35e-243100Show/hide
Query:  MYQSTSSSSSSQKSMGPAGGGGGGGLTRYGSAPGSLLTSAVDSVIGGGRHPDSTASLRTPPPSFGGHYFSSADSSTSNDLKSSSSGAAAAAALNRSYGIH
        MYQSTSSSSSSQKSMGPAGGGGGGGLTRYGSAPGSLLTSAVDSVIGGGRHPDSTASLRTPPPSFGGHYFSSADSSTSNDLKSSSSGAAAAAALNRSYGIH
Subjt:  MYQSTSSSSSSQKSMGPAGGGGGGGLTRYGSAPGSLLTSAVDSVIGGGRHPDSTASLRTPPPSFGGHYFSSADSSTSNDLKSSSSGAAAAAALNRSYGIH

Query:  DLALGDFSIPRNFNSNGGQSSSSSPLVRQRSSPAGFLGHFAVADGNGSGFSVTRGGGNYGSQNNNNGVCRLKSQLSFSGQDSLSQISEMSESVVEGVNSS
        DLALGDFSIPRNFNSNGGQSSSSSPLVRQRSSPAGFLGHFAVADGNGSGFSVTRGGGNYGSQNNNNGVCRLKSQLSFSGQDSLSQISEMSESVVEGVNSS
Subjt:  DLALGDFSIPRNFNSNGGQSSSSSPLVRQRSSPAGFLGHFAVADGNGSGFSVTRGGGNYGSQNNNNGVCRLKSQLSFSGQDSLSQISEMSESVVEGVNSS

Query:  NANANIAITHSFGIDSWDSANSNSIVFTAPHPKRPHHSDSDFFNCLESQFSMPQTSLEMATVERLLQIPEDSVPCKIRAKRGCATHPRSIAERERRTRIS
        NANANIAITHSFGIDSWDSANSNSIVFTAPHPKRPHHSDSDFFNCLESQFSMPQTSLEMATVERLLQIPEDSVPCKIRAKRGCATHPRSIAERERRTRIS
Subjt:  NANANIAITHSFGIDSWDSANSNSIVFTAPHPKRPHHSDSDFFNCLESQFSMPQTSLEMATVERLLQIPEDSVPCKIRAKRGCATHPRSIAERERRTRIS

Query:  GKLKKLQELVPNMDKQTSYADMLDLAVQHIKGLQNQIQKLNKEVENCSCGCKGSS
        GKLKKLQELVPNMDKQTSYADMLDLAVQHIKGLQNQIQKLNKEVENCSCGCKGSS
Subjt:  GKLKKLQELVPNMDKQTSYADMLDLAVQHIKGLQNQIQKLNKEVENCSCGCKGSS

A0A6J1EWR4 transcription factor bHLH128-like1.02e-15775.47Show/hide
Query:  MYQSTSSSSSSQKSMGPAGGGGGGGLTRYGSAPGSLLTSAVDSVIGGGRHPDSTASLRTPPPSFGGHYFSSADSS-------------TSNDLKSSSSGA
        MYQSTSSSSSS  SM P+   GGGGLTRYGSAPGSLL SAVDSVIG  RHPDS A LRTPP SFGGH+FSSA+SS             TS+DLKSS+  A
Subjt:  MYQSTSSSSSSQKSMGPAGGGGGGGLTRYGSAPGSLLTSAVDSVIGGGRHPDSTASLRTPPPSFGGHYFSSADSS-------------TSNDLKSSSSGA

Query:  AAAAALNRSYGIHDLALGDFSIPRNFNSNGGQSSSSSPLVRQRSSPAGFLGHFAVADGNGSGFSVTRGGGNYGSQNNNNGVCRLKSQLSFSG-QDSLSQI
        AAAAAL RSYGIHDLAL DFS  R+FNSNGGQSSSSSPLVRQRSSPAGFLGH +VA+G G GFS+TRGGG         G+ RLKSQLSFSG QDS+SQ+
Subjt:  AAAAALNRSYGIHDLALGDFSIPRNFNSNGGQSSSSSPLVRQRSSPAGFLGHFAVADGNGSGFSVTRGGGNYGSQNNNNGVCRLKSQLSFSG-QDSLSQI

Query:  SEMSESVVEGVNSSNANANIAITHSFGIDSWDSANSNSIVFTAPHPKRPH--HSDSDFFNCLESQFSMPQTSLEMATVERLLQIPEDSVPCKIRAKRGCA
        S MSESVVEG +SS++ A  A    F +DSWD+  SNSI F APHPKR    HSD DFF  L+SQFS+PQTS EMATVERLLQIPEDSVPCKIRAKRGCA
Subjt:  SEMSESVVEGVNSSNANANIAITHSFGIDSWDSANSNSIVFTAPHPKRPH--HSDSDFFNCLESQFSMPQTSLEMATVERLLQIPEDSVPCKIRAKRGCA

Query:  THPRSIAERERRTRISGKLKKLQELVPNMDKQTSYADMLDLAVQHIKGLQNQIQKLNKEVENCSCGCKGSS
        THPRSIAERERRTRISGKLKKLQELVPNMDKQTSY+DMLDLAVQHIKGLQNQIQKLNKEVENC+CG K  S
Subjt:  THPRSIAERERRTRISGKLKKLQELVPNMDKQTSYADMLDLAVQHIKGLQNQIQKLNKEVENCSCGCKGSS

A0A6J1GUX2 transcription factor bHLH128-like isoform X22.42e-15373.55Show/hide
Query:  MYQSTSSSSSSQKSM---------GPAGGGGGGGLTRYGSAPGSLLTSAVDSVIGGGRHPDSTASLRTPPPSFGGHYFSSADSSTSNDLKSSSSGAAAAA
        MYQS SSSSSSQK M         G  GGG GGGLTRYGSAPGSLLTSAVDSVIG  RHPDS +SLRTPP SF GHYFSSADSS    LK+SSS AAAA 
Subjt:  MYQSTSSSSSSQKSM---------GPAGGGGGGGLTRYGSAPGSLLTSAVDSVIGGGRHPDSTASLRTPPPSFGGHYFSSADSSTSNDLKSSSSGAAAAA

Query:  ALNRSYGIHDLALGDFSIPRNFNSNGGQSSSSSPLVRQRSSPAGFLGHFAVADGNGSGFSVTRGGGNYGSQNNNNGVCRLKSQLSFSGQDSLSQISEMSE
           RSYGIHDLALGDFS  RNFNSNGGQSS SSPLVRQ+SSPAGFLGH +VA+ NG GFS+T GGG        NG+ RLKSQ+  +GQDSLSQISEM+E
Subjt:  ALNRSYGIHDLALGDFSIPRNFNSNGGQSSSSSPLVRQRSSPAGFLGHFAVADGNGSGFSVTRGGGNYGSQNNNNGVCRLKSQLSFSGQDSLSQISEMSE

Query:  SVVEGVNSSNANANIAITHSFGIDS-WDSANSNSIVFTAPHPKRP-HHSDSDFFNCLESQFSMPQTSLEMATVERLLQIPEDSVPCKIRAKRGCATHPRS
        S +E  NSS   AN     +FG+DS WD++N NSIVF APH K   HH D++FF  LESQFSMPQT+LEMATVERLL IPE S PCKIRAKRGCATHPRS
Subjt:  SVVEGVNSSNANANIAITHSFGIDS-WDSANSNSIVFTAPHPKRP-HHSDSDFFNCLESQFSMPQTSLEMATVERLLQIPEDSVPCKIRAKRGCATHPRS

Query:  IAERERRTRISGKLKKLQELVPNMDKQTSYADMLDLAVQHIKGLQNQIQKLNKEVENCSCGCK
        IAERERRTRISGKLKKLQ+LVPNMDKQTSY+DMLDLAVQHIKGLQ+QIQ LNKE E+C+CG K
Subjt:  IAERERRTRISGKLKKLQELVPNMDKQTSYADMLDLAVQHIKGLQNQIQKLNKEVENCSCGCK

A0A6J1IXR0 transcription factor bHLH128-like isoform X14.81e-15574.52Show/hide
Query:  MYQSTSSSSSSQKSM-------GPAGGGGGGGLTRYGSAPGSLLTSAVDSVIGGGRHPDSTASLRTPPPSFGGHYFSSADSSTSNDLKSSSSGAAAAAAL
        MYQS SSSSSSQK M       G  GGGGGGGLTRYGSAPGSLLTSAVDSVIG  RHPDS +SLRTPP SF GHYFSSADSS    LKSSSS AAA    
Subjt:  MYQSTSSSSSSQKSM-------GPAGGGGGGGLTRYGSAPGSLLTSAVDSVIGGGRHPDSTASLRTPPPSFGGHYFSSADSSTSNDLKSSSSGAAAAAAL

Query:  NRSYGIHDLALGDFSIPRNFNSNGGQSSSSSPLVRQRSSPAGFLGHFAVADGNGSGFSVTRGGGNYGSQNNNNGVCRLKSQLSFSGQDSLSQISEMSESV
         RSYGIHDLALGDFS  RNFNSNGGQSSSSSPLVRQ+SSPAGFLGH +VA+ NG GFS+T GGG        N + RLKSQ+  +GQDSLSQISEMSE  
Subjt:  NRSYGIHDLALGDFSIPRNFNSNGGQSSSSSPLVRQRSSPAGFLGHFAVADGNGSGFSVTRGGGNYGSQNNNNGVCRLKSQLSFSGQDSLSQISEMSESV

Query:  VEGVNSSNANANIAITHSFGIDS-WDSANSNSIVFTAPHPKRP-HHSDSDFFNCLESQFSMPQTSLEMATVERLLQIPEDSVPCKIRAKRGCATHPRSIA
        +E  NSS   AN     +FG+DS WD++N NSIVF APH K   HH D++FF  LESQFSMPQT+LEMATVERLLQIPE S PCKIRAKRGCATHPRSIA
Subjt:  VEGVNSSNANANIAITHSFGIDS-WDSANSNSIVFTAPHPKRP-HHSDSDFFNCLESQFSMPQTSLEMATVERLLQIPEDSVPCKIRAKRGCATHPRSIA

Query:  ERERRTRISGKLKKLQELVPNMDKQTSYADMLDLAVQHIKGLQNQIQKLNKEVENCSCGCK
        ERERRTRISGKLKKLQ+LVPNMDKQTSY+DMLDLAVQHIKGLQ+QIQ LNKE E+C+CG K
Subjt:  ERERRTRISGKLKKLQELVPNMDKQTSYADMLDLAVQHIKGLQNQIQKLNKEVENCSCGCK

SwissProt top hitse value%identityAlignment
Q66GR3 Transcription factor bHLH1302.6e-2734.7Show/hide
Query:  GGGLTRYGSAPGSLLTSAVDS--------------VIGGGRHPD---------STASLRTPPPSFGGHYFSSADSSTSNDLKSSSSGAAAAAALNRSYGI
        G GL R+ SAP S+L + VD               V   G + D         S  SL     S+      +A       L+ SS          +S GI
Subjt:  GGGLTRYGSAPGSLLTSAVDS--------------VIGGGRHPD---------STASLRTPPPSFGGHYFSSADSSTSNDLKSSSSGAAAAAALNRSYGI

Query:  -HDLALGDFSIPRNFNSNGGQSSSSSPLVRQRSSPAGFLGHFAVADGNGSGFSVTRGGGNYGSQNNNNGVCRLKSQLSFSGQDSLSQISEMSESVVEGVN
         + + L  F    N ++   +S+    L+RQ SSPAG   + +  +G GS  ++     +  S +N+NG+ R    LS     SL  +S++ E       
Subjt:  -HDLALGDFSIPRNFNSNGGQSSSSSPLVRQRSSPAGFLGHFAVADGNGSGFSVTRGGGNYGSQNNNNGVCRLKSQLSFSGQDSLSQISEMSESVVEGVN

Query:  SSNANANIAITHSFGIDSWDSANSNSIVFTAPHPKRPHHSDSDFF------------NCLESQFSMPQ---TSLEMATVERLLQIPEDSVPCKIRAKRGC
               IA   +F    W+  + +S +      KR    D   F              L    S+P+   T+ +M +V++ LQ+ +DSVPCKIRAKRGC
Subjt:  SSNANANIAITHSFGIDSWDSANSNSIVFTAPHPKRPHHSDSDFF------------NCLESQFSMPQ---TSLEMATVERLLQIPEDSVPCKIRAKRGC

Query:  ATHPRSIAERERRTRISGKLKKLQELVPNMDKQTSYADMLDLAVQHIKGLQNQIQKLNKEVENCSC
        ATHPRSIAER RRTRIS +++KLQELVPNMDKQT+ +DMLDLAV +IK LQ Q + LN    NC C
Subjt:  ATHPRSIAERERRTRISGKLKKLQELVPNMDKQTSYADMLDLAVQHIKGLQNQIQKLNKEVENCSC

Q8H102 Transcription factor bHLH1284.8e-6650.54Show/hide
Query:  MYQSTSS-SSSSQKSMGPAGGGGGGGLTRYGSAPGSLLTSAVDSVIGGG-RHPDSTASLRTPPPSFGGHYFSSA-------DSSTSNDLKSSSSGAAAAA
        MYQS+SS SSSSQ+S  P    GGGGL RYGSAPGS L S VD VIGGG  +       +    +F G++F+ A         ST+  + +SS G     
Subjt:  MYQSTSS-SSSSQKSMGPAGGGGGGGLTRYGSAPGSLLTSAVDSVIGGG-RHPDSTASLRTPPPSFGGHYFSSA-------DSSTSNDLKSSSSGAAAAA

Query:  ALNRSYGIHDLAL----GDFS-IPRNFNSN---GGQSSSSSPLVRQRSSPAGFLGHFAVADGNGSGFSVTRGGGNYGSQNNNNGVCRLKSQLSFSGQDSL
          N +    D+ L    G F+ I +   SN   GG SS S  L RQRSSPA F  + A    N S    T      G  N   G  RLKSQLSF+  DSL
Subjt:  ALNRSYGIHDLAL----GDFS-IPRNFNSN---GGQSSSSSPLVRQRSSPAGFLGHFAVADGNGSGFSVTRGGGNYGSQNNNNGVCRLKSQLSFSGQDSL

Query:  SQISEMSESVV---EGVNSSNANANIAITHSFGIDSWDSANSNSIVFTAPHP-KRPHHSDSDFFNCLESQFSMPQTSLEMATVERLLQIPEDSVPCKIRA
        ++I+E++E+ V    G + S A+   A T     DSWD   S SI FT   P KR    DS  F    SQ+S+P +   M  ++  +Q+PEDSVPCKIRA
Subjt:  SQISEMSESVV---EGVNSSNANANIAITHSFGIDSWDSANSNSIVFTAPHP-KRPHHSDSDFFNCLESQFSMPQTSLEMATVERLLQIPEDSVPCKIRA

Query:  KRGCATHPRSIAERERRTRISGKLKKLQELVPNMDKQTSYADMLDLAVQHIKGLQNQIQKLNKEVENCSCGC
        KRGCATHPRSIAERERRTRISGKLKKLQ+LVPNMDKQTSY+DMLDLAVQHIKGLQ+Q+Q L K+ ENC+CGC
Subjt:  KRGCATHPRSIAERERRTRISGKLKKLQELVPNMDKQTSYADMLDLAVQHIKGLQNQIQKLNKEVENCSCGC

Q9C690 Transcription factor bHLH1221.6e-2437.75Show/hide
Query:  LVRQRSSPAGFLGHFAVADGNGSGFSVTRGGGNYGSQN----NNNGVCRL--KSQLSFSGQDSLSQISEMS-----------ESVVEGVNSSNANAN---
        L R  SSPAG    F+  D   +  +V +  G +G  N    +N     L  +S+L      ++S ISE+             ++  G N S  N     
Subjt:  LVRQRSSPAGFLGHFAVADGNGSGFSVTRGGGNYGSQN----NNNGVCRL--KSQLSFSGQDSLSQISEMS-----------ESVVEGVNSSNANAN---

Query:  -----IAITHSFGIDSWDSANSNSIVFTAPHPKRPHHSDSDFFNCLESQFSMPQTSLEMATVERLLQIPEDSVPCKIRAKRGCATHPRSIAERERRTRIS
             +A T S G+D + + + +S    +  P   HH             S+P++   ++ +E+LL    DS+PCKIRAKRGCATHPRSIAER RRT+IS
Subjt:  -----IAITHSFGIDSWDSANSNSIVFTAPHPKRPHHSDSDFFNCLESQFSMPQTSLEMATVERLLQIPEDSVPCKIRAKRGCATHPRSIAERERRTRIS

Query:  GKLKKLQELVPNMDKQTSYADMLDLAVQHIKGLQNQIQKLNKEVENCSC
         +++KLQ+LVPNMD QT+ ADMLDLAVQ+IK LQ Q++ L +    C C
Subjt:  GKLKKLQELVPNMDKQTSYADMLDLAVQHIKGLQNQIQKLNKEVENCSC

Q9M0R0 Transcription factor bHLH811.6e-2445.81Show/hide
Query:  SFGID-SWDSANSNSIVFTAPHPKRPHHSDSDFFNCLESQFSMPQTSLEMATVERLLQIP---------------EDSVPCKIRAKRGCATHPRSIAERE
        SFGI  ++D  + N  +  +P  KR           +E+ FS P+ + +M   +   Q+P               EDSV  ++RAKRGCATHPRSIAER 
Subjt:  SFGID-SWDSANSNSIVFTAPHPKRPHHSDSDFFNCLESQFSMPQTSLEMATVERLLQIP---------------EDSVPCKIRAKRGCATHPRSIAERE

Query:  RRTRISGKLKKLQELVPNMDKQTSYADMLDLAVQHIKGLQNQIQKLNKEVENCSC
        RRTRIS +++KLQELVPNMDKQT+ ADML+ AV+++K LQ QIQ+L +E + C+C
Subjt:  RRTRISGKLKKLQELVPNMDKQTSYADMLDLAVQHIKGLQNQIQKLNKEVENCSC

Q9ZW81 Transcription factor bHLH1291.5e-4844.64Show/hide
Query:  MYQSTSSSSSSQKSMGPAGGGGGGGLTRYGSAPGSLLTSAVDSVIGGGRH---PDSTASLRTPPPSFGGHYFSSADSSTSNDLKSSSSGAAAAAALNRSY
        MY   SS S++        GGG     +Y SA G+   +   S +G   H   P          P+  GHY     SS                      
Subjt:  MYQSTSSSSSSQKSMGPAGGGGGGGLTRYGSAPGSLLTSAVDSVIGGGRH---PDSTASLRTPPPSFGGHYFSSADSSTSNDLKSSSSGAAAAAALNRSY

Query:  GIHDLALGDFSIPRNFNSNGGQSSSSSPLVRQRSSPAGFLGHFAVADGNGSGFSVTRGGGNYGSQNNNNGVCRLKSQLSFSGQDSLSQISEMSESVVEGV
                       F+SN   +SSSS L R RSSPAGF       D NG+GFS+ R  G YG      G  RLKS+L FS   S  Q       + E  
Subjt:  GIHDLALGDFSIPRNFNSNGGQSSSSSPLVRQRSSPAGFLGHFAVADGNGSGFSVTRGGGNYGSQNNNNGVCRLKSQLSFSGQDSLSQISEMSESVVEGV

Query:  NSSNANANIAITH-SFG---IDSWDSANSNSIVFTAPHPKRPHHSDSDFFNCLESQFSMPQTSLEMATVERLLQIPEDSVPCKIRAKRGCATHPRSIAER
         ++ A   +A +  SFG    ++WD+++S+ I FT   P +    +SDFF  LE+Q+SMPQT+LEMAT+E L+ IPEDSVPC+ RAKRG ATHPRSIAER
Subjt:  NSSNANANIAITH-SFG---IDSWDSANSNSIVFTAPHPKRPHHSDSDFFNCLESQFSMPQTSLEMATVERLLQIPEDSVPCKIRAKRGCATHPRSIAER

Query:  ERRTRISGKLKKLQELVPNMDKQTSYADMLDLAVQHIKGLQNQIQ
        ERRTRISGKLKKLQELVPNMDKQTSYADMLDLAV+HIKGLQ+Q++
Subjt:  ERRTRISGKLKKLQELVPNMDKQTSYADMLDLAVQHIKGLQNQIQ

Arabidopsis top hitse value%identityAlignment
AT1G05805.1 basic helix-loop-helix (bHLH) DNA-binding superfamily protein3.4e-6750.54Show/hide
Query:  MYQSTSS-SSSSQKSMGPAGGGGGGGLTRYGSAPGSLLTSAVDSVIGGG-RHPDSTASLRTPPPSFGGHYFSSA-------DSSTSNDLKSSSSGAAAAA
        MYQS+SS SSSSQ+S  P    GGGGL RYGSAPGS L S VD VIGGG  +       +    +F G++F+ A         ST+  + +SS G     
Subjt:  MYQSTSS-SSSSQKSMGPAGGGGGGGLTRYGSAPGSLLTSAVDSVIGGG-RHPDSTASLRTPPPSFGGHYFSSA-------DSSTSNDLKSSSSGAAAAA

Query:  ALNRSYGIHDLAL----GDFS-IPRNFNSN---GGQSSSSSPLVRQRSSPAGFLGHFAVADGNGSGFSVTRGGGNYGSQNNNNGVCRLKSQLSFSGQDSL
          N +    D+ L    G F+ I +   SN   GG SS S  L RQRSSPA F  + A    N S    T      G  N   G  RLKSQLSF+  DSL
Subjt:  ALNRSYGIHDLAL----GDFS-IPRNFNSN---GGQSSSSSPLVRQRSSPAGFLGHFAVADGNGSGFSVTRGGGNYGSQNNNNGVCRLKSQLSFSGQDSL

Query:  SQISEMSESVV---EGVNSSNANANIAITHSFGIDSWDSANSNSIVFTAPHP-KRPHHSDSDFFNCLESQFSMPQTSLEMATVERLLQIPEDSVPCKIRA
        ++I+E++E+ V    G + S A+   A T     DSWD   S SI FT   P KR    DS  F    SQ+S+P +   M  ++  +Q+PEDSVPCKIRA
Subjt:  SQISEMSESVV---EGVNSSNANANIAITHSFGIDSWDSANSNSIVFTAPHP-KRPHHSDSDFFNCLESQFSMPQTSLEMATVERLLQIPEDSVPCKIRA

Query:  KRGCATHPRSIAERERRTRISGKLKKLQELVPNMDKQTSYADMLDLAVQHIKGLQNQIQKLNKEVENCSCGC
        KRGCATHPRSIAERERRTRISGKLKKLQ+LVPNMDKQTSY+DMLDLAVQHIKGLQ+Q+Q L K+ ENC+CGC
Subjt:  KRGCATHPRSIAERERRTRISGKLKKLQELVPNMDKQTSYADMLDLAVQHIKGLQNQIQKLNKEVENCSCGC

AT1G51140.1 basic helix-loop-helix (bHLH) DNA-binding superfamily protein1.1e-2537.75Show/hide
Query:  LVRQRSSPAGFLGHFAVADGNGSGFSVTRGGGNYGSQN----NNNGVCRL--KSQLSFSGQDSLSQISEMS-----------ESVVEGVNSSNANAN---
        L R  SSPAG    F+  D   +  +V +  G +G  N    +N     L  +S+L      ++S ISE+             ++  G N S  N     
Subjt:  LVRQRSSPAGFLGHFAVADGNGSGFSVTRGGGNYGSQN----NNNGVCRL--KSQLSFSGQDSLSQISEMS-----------ESVVEGVNSSNANAN---

Query:  -----IAITHSFGIDSWDSANSNSIVFTAPHPKRPHHSDSDFFNCLESQFSMPQTSLEMATVERLLQIPEDSVPCKIRAKRGCATHPRSIAERERRTRIS
             +A T S G+D + + + +S    +  P   HH             S+P++   ++ +E+LL    DS+PCKIRAKRGCATHPRSIAER RRT+IS
Subjt:  -----IAITHSFGIDSWDSANSNSIVFTAPHPKRPHHSDSDFFNCLESQFSMPQTSLEMATVERLLQIPEDSVPCKIRAKRGCATHPRSIAERERRTRIS

Query:  GKLKKLQELVPNMDKQTSYADMLDLAVQHIKGLQNQIQKLNKEVENCSC
         +++KLQ+LVPNMD QT+ ADMLDLAVQ+IK LQ Q++ L +    C C
Subjt:  GKLKKLQELVPNMDKQTSYADMLDLAVQHIKGLQNQIQKLNKEVENCSC

AT2G42280.1 basic helix-loop-helix (bHLH) DNA-binding superfamily protein1.8e-2834.7Show/hide
Query:  GGGLTRYGSAPGSLLTSAVDS--------------VIGGGRHPD---------STASLRTPPPSFGGHYFSSADSSTSNDLKSSSSGAAAAAALNRSYGI
        G GL R+ SAP S+L + VD               V   G + D         S  SL     S+      +A       L+ SS          +S GI
Subjt:  GGGLTRYGSAPGSLLTSAVDS--------------VIGGGRHPD---------STASLRTPPPSFGGHYFSSADSSTSNDLKSSSSGAAAAAALNRSYGI

Query:  -HDLALGDFSIPRNFNSNGGQSSSSSPLVRQRSSPAGFLGHFAVADGNGSGFSVTRGGGNYGSQNNNNGVCRLKSQLSFSGQDSLSQISEMSESVVEGVN
         + + L  F    N ++   +S+    L+RQ SSPAG   + +  +G GS  ++     +  S +N+NG+ R    LS     SL  +S++ E       
Subjt:  -HDLALGDFSIPRNFNSNGGQSSSSSPLVRQRSSPAGFLGHFAVADGNGSGFSVTRGGGNYGSQNNNNGVCRLKSQLSFSGQDSLSQISEMSESVVEGVN

Query:  SSNANANIAITHSFGIDSWDSANSNSIVFTAPHPKRPHHSDSDFF------------NCLESQFSMPQ---TSLEMATVERLLQIPEDSVPCKIRAKRGC
               IA   +F    W+  + +S +      KR    D   F              L    S+P+   T+ +M +V++ LQ+ +DSVPCKIRAKRGC
Subjt:  SSNANANIAITHSFGIDSWDSANSNSIVFTAPHPKRPHHSDSDFF------------NCLESQFSMPQ---TSLEMATVERLLQIPEDSVPCKIRAKRGC

Query:  ATHPRSIAERERRTRISGKLKKLQELVPNMDKQTSYADMLDLAVQHIKGLQNQIQKLNKEVENCSC
        ATHPRSIAER RRTRIS +++KLQELVPNMDKQT+ +DMLDLAV +IK LQ Q + LN    NC C
Subjt:  ATHPRSIAERERRTRISGKLKKLQELVPNMDKQTSYADMLDLAVQHIKGLQNQIQKLNKEVENCSC

AT2G43140.1 basic helix-loop-helix (bHLH) DNA-binding superfamily protein1.1e-4944.64Show/hide
Query:  MYQSTSSSSSSQKSMGPAGGGGGGGLTRYGSAPGSLLTSAVDSVIGGGRH---PDSTASLRTPPPSFGGHYFSSADSSTSNDLKSSSSGAAAAAALNRSY
        MY   SS S++        GGG     +Y SA G+   +   S +G   H   P          P+  GHY     SS                      
Subjt:  MYQSTSSSSSSQKSMGPAGGGGGGGLTRYGSAPGSLLTSAVDSVIGGGRH---PDSTASLRTPPPSFGGHYFSSADSSTSNDLKSSSSGAAAAAALNRSY

Query:  GIHDLALGDFSIPRNFNSNGGQSSSSSPLVRQRSSPAGFLGHFAVADGNGSGFSVTRGGGNYGSQNNNNGVCRLKSQLSFSGQDSLSQISEMSESVVEGV
                       F+SN   +SSSS L R RSSPAGF       D NG+GFS+ R  G YG      G  RLKS+L FS   S  Q       + E  
Subjt:  GIHDLALGDFSIPRNFNSNGGQSSSSSPLVRQRSSPAGFLGHFAVADGNGSGFSVTRGGGNYGSQNNNNGVCRLKSQLSFSGQDSLSQISEMSESVVEGV

Query:  NSSNANANIAITH-SFG---IDSWDSANSNSIVFTAPHPKRPHHSDSDFFNCLESQFSMPQTSLEMATVERLLQIPEDSVPCKIRAKRGCATHPRSIAER
         ++ A   +A +  SFG    ++WD+++S+ I FT   P +    +SDFF  LE+Q+SMPQT+LEMAT+E L+ IPEDSVPC+ RAKRG ATHPRSIAER
Subjt:  NSSNANANIAITH-SFG---IDSWDSANSNSIVFTAPHPKRPHHSDSDFFNCLESQFSMPQTSLEMATVERLLQIPEDSVPCKIRAKRGCATHPRSIAER

Query:  ERRTRISGKLKKLQELVPNMDKQTSYADMLDLAVQHIKGLQNQIQ
        ERRTRISGKLKKLQELVPNMDKQTSYADMLDLAV+HIKGLQ+Q++
Subjt:  ERRTRISGKLKKLQELVPNMDKQTSYADMLDLAVQHIKGLQNQIQ

AT2G43140.2 basic helix-loop-helix (bHLH) DNA-binding superfamily protein5.3e-5244.72Show/hide
Query:  MYQSTSSSSSSQKSMGPAGGGGGGGLTRYGSAPGSLLTSAVDSVIGGGRH---PDSTASLRTPPPSFGGHYFSSADSSTSNDLKSSSSGAAAAAALNRSY
        MY   SS S++        GGG     +Y SA G+   +   S +G   H   P          P+  GHY     SS                      
Subjt:  MYQSTSSSSSSQKSMGPAGGGGGGGLTRYGSAPGSLLTSAVDSVIGGGRH---PDSTASLRTPPPSFGGHYFSSADSSTSNDLKSSSSGAAAAAALNRSY

Query:  GIHDLALGDFSIPRNFNSNGGQSSSSSPLVRQRSSPAGFLGHFAVADGNGSGFSVTRGGGNYGSQNNNNGVCRLKSQLSFSGQDSLSQISEMSESVVEGV
                       F+SN   +SSSS L R RSSPAGF       D N  GFS+ R  G YG      G  RLKS+L FS   S  Q       + E  
Subjt:  GIHDLALGDFSIPRNFNSNGGQSSSSSPLVRQRSSPAGFLGHFAVADGNGSGFSVTRGGGNYGSQNNNNGVCRLKSQLSFSGQDSLSQISEMSESVVEGV

Query:  NSSNANANIAITH-SFG---IDSWDSANSNSIVFTAPHPKRPHHSDSDFFNCLESQFSMPQTSLEMATVERLLQIPEDSVPCKIRAKRGCATHPRSIAER
         ++ A   +A +  SFG    ++WD+++S+ I FT   P +    +SDFF  LE+Q+SMPQT+LEMAT+E L+ IPEDSVPC+ RAKRG ATHPRSIAER
Subjt:  NSSNANANIAITH-SFG---IDSWDSANSNSIVFTAPHPKRPHHSDSDFFNCLESQFSMPQTSLEMATVERLLQIPEDSVPCKIRAKRGCATHPRSIAER

Query:  ERRTRISGKLKKLQELVPNMDKQTSYADMLDLAVQHIKGLQNQIQKLNKEVENCSCG-CK
        ERRTRISGKLKKLQELVPNMDKQTSYADMLDLAV+HIKGLQ+Q++ L K +E C+CG CK
Subjt:  ERRTRISGKLKKLQELVPNMDKQTSYADMLDLAVQHIKGLQNQIQKLNKEVENCSCG-CK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTACCAATCAACCTCTTCCTCTTCATCTTCTCAGAAATCCATGGGCCCCGCCGGAGGAGGCGGCGGTGGAGGCCTTACTCGCTACGGCTCCGCCCCTGGCTCTCTCCT
GACCTCGGCCGTGGACTCTGTAATCGGCGGAGGGCGCCATCCGGACTCCACCGCGTCCCTCCGTACGCCGCCGCCGTCGTTCGGCGGCCATTACTTCTCCTCCGCCGACT
CCTCCACATCCAACGATCTCAAATCCTCCTCCTCCGGGGCCGCCGCCGCCGCTGCCCTGAATCGCTCCTACGGAATCCACGATTTAGCCCTAGGCGATTTCTCGATACCG
AGAAATTTCAATAGCAATGGCGGCCAATCGTCTTCCTCCTCGCCGTTGGTTCGCCAGAGAAGCTCGCCGGCGGGATTTCTAGGCCACTTCGCCGTCGCCGATGGAAACGG
ATCAGGGTTTTCAGTGACAAGGGGAGGTGGGAACTATGGTTCACAAAATAATAACAATGGAGTTTGCAGGCTTAAGTCTCAGCTGAGCTTCAGTGGGCAGGATTCTCTGT
CTCAGATATCAGAAATGAGTGAGAGTGTGGTGGAGGGAGTCAACTCGAGCAATGCCAATGCCAATATCGCCATCACCCATTCCTTCGGAATCGACTCTTGGGACAGCGCC
AACTCCAACTCCATTGTTTTCACTGCCCCTCACCCCAAACGCCCCCACCATTCTGATTCCGACTTCTTCAATTGCCTCGAGTCTCAGTTTAGCATGCCACAGACAAGCCT
AGAAATGGCAACTGTGGAGAGGCTGCTCCAAATTCCAGAGGACTCTGTTCCTTGTAAGATCCGAGCCAAGCGTGGCTGTGCCACTCATCCTCGCAGCATTGCTGAAAGGG
AGAGAAGAACTAGAATCAGTGGAAAATTGAAGAAACTTCAAGAGCTTGTCCCCAACATGGATAAGCAAACAAGCTATGCAGACATGCTGGATTTGGCAGTGCAGCACATC
AAGGGCCTTCAAAATCAGATTCAGAAGCTTAACAAAGAAGTTGAAAATTGTAGCTGTGGATGCAAAGGATCCTCATAA
mRNA sequenceShow/hide mRNA sequence
CAAGGGTAAAACACATATGTCTCTTATGATATCCACATTATGTGGAAGCATCTACGTAATTTCATAGAATCATTCGATCGGAATGGTACACGAAAATGTACATACATAAA
AGAGCTAAAATGTAGAAGATGGTAACAAGAACGACGACTTGTAATTGAATATAATATATACAATTTTACAAAGAGACACGAGGGGGGCAAATGTGATTTTGGAGTTGGAG
AATGGTAATATGAAGGTGGAGTAAGGTTATCAAAGCCCTTAGGCTTTGCCTCAACAAGCAAAAGTATCTATTTTAATTTATAATTTTTATTATAAAATTATAAATTATAA
AGTCCTTTAATTTGTTATTGGTGTTTCTCTATTTCCTCCCTTCTCTCTCTCTCTTCGAAGTTCGTCCGCTCCTCTGACAAAAAACCGCTCCACTCCAATTCCGGCATATG
TACCAATCAACCTCTTCCTCTTCATCTTCTCAGAAATCCATGGGCCCCGCCGGAGGAGGCGGCGGTGGAGGCCTTACTCGCTACGGCTCCGCCCCTGGCTCTCTCCTGAC
CTCGGCCGTGGACTCTGTAATCGGCGGAGGGCGCCATCCGGACTCCACCGCGTCCCTCCGTACGCCGCCGCCGTCGTTCGGCGGCCATTACTTCTCCTCCGCCGACTCCT
CCACATCCAACGATCTCAAATCCTCCTCCTCCGGGGCCGCCGCCGCCGCTGCCCTGAATCGCTCCTACGGAATCCACGATTTAGCCCTAGGCGATTTCTCGATACCGAGA
AATTTCAATAGCAATGGCGGCCAATCGTCTTCCTCCTCGCCGTTGGTTCGCCAGAGAAGCTCGCCGGCGGGATTTCTAGGCCACTTCGCCGTCGCCGATGGAAACGGATC
AGGGTTTTCAGTGACAAGGGGAGGTGGGAACTATGGTTCACAAAATAATAACAATGGAGTTTGCAGGCTTAAGTCTCAGCTGAGCTTCAGTGGGCAGGATTCTCTGTCTC
AGATATCAGAAATGAGTGAGAGTGTGGTGGAGGGAGTCAACTCGAGCAATGCCAATGCCAATATCGCCATCACCCATTCCTTCGGAATCGACTCTTGGGACAGCGCCAAC
TCCAACTCCATTGTTTTCACTGCCCCTCACCCCAAACGCCCCCACCATTCTGATTCCGACTTCTTCAATTGCCTCGAGTCTCAGTTTAGCATGCCACAGACAAGCCTAGA
AATGGCAACTGTGGAGAGGCTGCTCCAAATTCCAGAGGACTCTGTTCCTTGTAAGATCCGAGCCAAGCGTGGCTGTGCCACTCATCCTCGCAGCATTGCTGAAAGGGAGA
GAAGAACTAGAATCAGTGGAAAATTGAAGAAACTTCAAGAGCTTGTCCCCAACATGGATAAGCAAACAAGCTATGCAGACATGCTGGATTTGGCAGTGCAGCACATCAAG
GGCCTTCAAAATCAGATTCAGAAGCTTAACAAAGAAGTTGAAAATTGTAGCTGTGGATGCAAAGGATCCTCATAACATATGGAATTTTGGGAGTTGGAGAATGATACACA
TCAGAATTTTGCCTTTAATTTCCTACTAAATTTGTTCTTTCAACTAACCCTCAAAAAAATTCAAAAAATTAAAATTAAAATTAAAAGGAAGAACAAAGAAATTAAAACCA
CAGGGCAAGTGTACATAAAAGAATTAGCTCTTCTCTAATTAATTCCCAAATTTCATTAGAATAATAAACTTATTCTTCTTCTTATTATTATTTTTTTCATGTAAAATAAT
TTTGAATATTTGTTTCTGGTTCCCCACTATCCTTGTTTCTTTGCTTTCTAATGTTAGAGAAACCCATATCTTTGACTCTCTTTTTTGTTAAATTTACATTCAATGTTGT
Protein sequenceShow/hide protein sequence
MYQSTSSSSSSQKSMGPAGGGGGGGLTRYGSAPGSLLTSAVDSVIGGGRHPDSTASLRTPPPSFGGHYFSSADSSTSNDLKSSSSGAAAAAALNRSYGIHDLALGDFSIP
RNFNSNGGQSSSSSPLVRQRSSPAGFLGHFAVADGNGSGFSVTRGGGNYGSQNNNNGVCRLKSQLSFSGQDSLSQISEMSESVVEGVNSSNANANIAITHSFGIDSWDSA
NSNSIVFTAPHPKRPHHSDSDFFNCLESQFSMPQTSLEMATVERLLQIPEDSVPCKIRAKRGCATHPRSIAERERRTRISGKLKKLQELVPNMDKQTSYADMLDLAVQHI
KGLQNQIQKLNKEVENCSCGCKGSS