; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr029436 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr029436
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
Descriptionbasic helix-loop-helix (bHLH) DNA-binding superfamily protein
Genome locationtig00153348:958339..960545
RNA-Seq ExpressionSgr029436
SyntenySgr029436
Gene Ontology termsGO:0046983 - protein dimerization activity (molecular function)
InterPro domainsIPR036638 - Helix-loop-helix DNA-binding domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6584048.1 Transcription factor basic helix-loop-helix 123, partial [Cucurbita argyrosperma subsp. sororia]3.4e-6946.32Show/hide
Query:  MAEEFPAGICGRRWSDSTKTVFMP-VANRSPCSMELIDMIRSNSCPTGFVDRNVKFCNEKALDSDSVSGDGTFLIDPTVRTTSDCNQTLLGDGDTGRGDT
        M +EFP GICGR+  D+TK  F P  A  SPCS+                                   D TFLIDPTVRT S+ NQ  LGD D  R DT
Subjt:  MAEEFPAGICGRRWSDSTKTVFMP-VANRSPCSMELIDMIRSNSCPTGFVDRNVKFCNEKALDSDSVSGDGTFLIDPTVRTTSDCNQTLLGDGDTGRGDT

Query:  TLCSNYDRQETSGFDEQENHCSIFKAD-DFVLLMDQPNLSSANFYDSL-TTTCQGSSMGFPVRTISNGYCSTLLQNSWDQPKQFLVNNRAIHCSSSTNSD
        TL  +Y+ QET+  +E +N  SIF AD D V LM +   + +NFYD++ TTT QG  M +P   ISNGY  TLL +S DQPKQFLVNNR I         
Subjt:  TLCSNYDRQETSGFDEQENHCSIFKAD-DFVLLMDQPNLSSANFYDSL-TTTCQGSSMGFPVRTISNGYCSTLLQNSWDQPKQFLVNNRAIHCSSSTNSD

Query:  ELSSCRPMFETHLPLSLQFSNRVRPILNNASAAALHAFQSGFFPSSQSKSSPASTLEGKPKCPNLTKKLQIHILTRPLNHFQLISC-EVVQDSGRVQKKN
          +SC+PMFET     L F+ +                                                            LISC EVV+DS RV+ KN
Subjt:  ELSSCRPMFETHLPLSLQFSNRVRPILNNASAAALHAFQSGFFPSSQSKSSPASTLEGKPKCPNLTKKLQIHILTRPLNHFQLISC-EVVQDSGRVQKKN

Query:  RSSE-TLIKRPRSDMPSPLPTFKVRKEKLG----ALQQLVSPFGKTDTASVLDEANEYIKFLHDQVRVLSTPYMKIGDNNQDQQM--------QQVEEEL
        +S+E T IKR R+DMPS LPTFKVRKEKLG    ALQQLVSPFGKTDTASVL EA EYIKFLHDQVRVLSTPYM++G+  Q+QQ         Q V+EEL
Subjt:  RSSE-TLIKRPRSDMPSPLPTFKVRKEKLG----ALQQLVSPFGKTDTASVLDEANEYIKFLHDQVRVLSTPYMKIGDNNQDQQM--------QQVEEEL

Query:  KENKKEDLRSRGLCLVAISSN
        +ENK++DL SRGLCLV I S+
Subjt:  KENKKEDLRSRGLCLVAISSN

XP_023001290.1 transcription factor bHLH112-like isoform X1 [Cucurbita maxima]3.1e-7047.71Show/hide
Query:  MAEEFPAGICGRRWSDSTKTVFMP-VANRSPCSMELIDMIRSNSCPTGFVDRNVKFCNEKALDSDSVSGDGTFLIDPTVRTTSDCNQTLLGDGDTGRGDT
        M +EFP GICGR   D+TKT+F P  AN SPCS+                                  GD TFLIDPTVRT S+ NQ  LGD D  R DT
Subjt:  MAEEFPAGICGRRWSDSTKTVFMP-VANRSPCSMELIDMIRSNSCPTGFVDRNVKFCNEKALDSDSVSGDGTFLIDPTVRTTSDCNQTLLGDGDTGRGDT

Query:  TLCSNYDRQETSGFDEQENHCSIFKAD-DFVLLMDQPNLSSANFYDSL-TTTCQGSSMGFPVRTISNGYCSTLLQNSWDQPKQFLVNNRAIHCSSSTNSD
        TL  +YD QET+  +E +N  SIF AD D + LM Q   + +NFYD++ TTT QG  M +P   ISNGY  TLL +S DQPKQFLVNNR       TN+ 
Subjt:  TLCSNYDRQETSGFDEQENHCSIFKAD-DFVLLMDQPNLSSANFYDSL-TTTCQGSSMGFPVRTISNGYCSTLLQNSWDQPKQFLVNNRAIHCSSSTNSD

Query:  ELSSCRPMFETHLPLSLQFSNRVRPILNNASAAALHAFQSGFFPSSQSKSSPASTLEGKPKCPNLTKKLQIHILTRPLNHFQLISCEVVQDSGRVQKKNR
          SSC+PMFET                                                    N T KL             +   EVV+DS RV+ KN+
Subjt:  ELSSCRPMFETHLPLSLQFSNRVRPILNNASAAALHAFQSGFFPSSQSKSSPASTLEGKPKCPNLTKKLQIHILTRPLNHFQLISCEVVQDSGRVQKKNR

Query:  SSE-TLIKRPRSDMPSPLPTFKVRKEKLG----ALQQLVSPFGKTDTASVLDEANEYIKFLHDQVRVLSTPYMKIGDNNQDQQM------QQVEEELKEN
        S+E T IKRPR+DM S LPTFKVRKEKLG    ALQQLVSPFGKTDTASVL EA EYIKFLHDQVRVLSTPYM++G+  Q+QQ+      Q ++EEL+EN
Subjt:  SSE-TLIKRPRSDMPSPLPTFKVRKEKLG----ALQQLVSPFGKTDTASVLDEANEYIKFLHDQVRVLSTPYMKIGDNNQDQQM------QQVEEELKEN

Query:  KKEDLRSRGLCLVAI
        KK+DL SRGLCLV I
Subjt:  KKEDLRSRGLCLVAI

XP_023001291.1 transcription factor bHLH112-like isoform X2 [Cucurbita maxima]1.4e-6747.23Show/hide
Query:  MAEEFPAGICGRRWSDSTKTVFMP-VANRSPCSMELIDMIRSNSCPTGFVDRNVKFCNEKALDSDSVSGDGTFLIDPTVRTTSDCNQTLLGDGDTGRGDT
        M +EFP GICGR   D+TKT+F P  AN SPCS+                                  GD TFLIDPTVRT S+ NQ  LGD D  R DT
Subjt:  MAEEFPAGICGRRWSDSTKTVFMP-VANRSPCSMELIDMIRSNSCPTGFVDRNVKFCNEKALDSDSVSGDGTFLIDPTVRTTSDCNQTLLGDGDTGRGDT

Query:  TLCSNYDRQETSGFDEQENHCSIFKAD-DFVLLMDQPNLSSANFYDSL-TTTCQGSSMGFPVRTISNGYCSTLLQNSWDQPKQFLVNNRAIHCSSSTNSD
        TL  +YD QET+  +E +N  SIF AD D + LM Q   + +NFYD++ TTT QG  M +P   ISNGY  TLL +S DQPKQFLVNNR       TN+ 
Subjt:  TLCSNYDRQETSGFDEQENHCSIFKAD-DFVLLMDQPNLSSANFYDSL-TTTCQGSSMGFPVRTISNGYCSTLLQNSWDQPKQFLVNNRAIHCSSSTNSD

Query:  ELSSCRPMFETHLPLSLQFSNRVRPILNNASAAALHAFQSGFFPSSQSKSSPASTLEGKPKCPNLTKKLQIHILTRPLNHFQLISCEVVQDSGRVQKKNR
          SSC+PMFET                                                    N T KL             +   EVV+DS RV+ KN+
Subjt:  ELSSCRPMFETHLPLSLQFSNRVRPILNNASAAALHAFQSGFFPSSQSKSSPASTLEGKPKCPNLTKKLQIHILTRPLNHFQLISCEVVQDSGRVQKKNR

Query:  SSE-TLIKRPRSDMPSPLPTFKVRKEKLG----ALQQLVSPFGKTDTASVLDEANEYIKFLHDQVRVLSTPYMKIGDNNQDQQM------QQVEEELKEN
        S+E T IKRPR+DM S LPTFKVRKEKLG    ALQQLVSPFGKTDTASVL EA EYIKFLHDQ  VLSTPYM++G+  Q+QQ+      Q ++EEL+EN
Subjt:  SSE-TLIKRPRSDMPSPLPTFKVRKEKLG----ALQQLVSPFGKTDTASVLDEANEYIKFLHDQVRVLSTPYMKIGDNNQDQQM------QQVEEELKEN

Query:  KKEDLRSRGLCLVAI
        KK+DL SRGLCLV I
Subjt:  KKEDLRSRGLCLVAI

XP_023519613.1 transcription factor bHLH112-like [Cucurbita pepo subsp. pepo]3.1e-7047.46Show/hide
Query:  MAEEFPAGICGRRWSDSTKTVFMP-VANRSPCSMELIDMIRSNSCPTGFVDRNVKFCNEKALDSDSVSGDGTFLIDPTVRTTSDCNQTLLGDGDTGRGDT
        M +EFP GICGR+  D+TK  F P  A  SPCS+                                   D TFLIDPTVRT S+ NQ  LGD D  R DT
Subjt:  MAEEFPAGICGRRWSDSTKTVFMP-VANRSPCSMELIDMIRSNSCPTGFVDRNVKFCNEKALDSDSVSGDGTFLIDPTVRTTSDCNQTLLGDGDTGRGDT

Query:  TLCSNYDRQETSGFDEQENHCSIFKAD-DFVLLMDQPNLSSANFYDSL-TTTCQGSSMGFPVRTISNGYCSTLLQNSWDQPKQFLVNNRAIHCSSSTNSD
        TL  +YD QET+  +E +N  SIF AD D + LM Q   + +NFYD++ TTT QG  M +P   ISNGY  TLL +S DQPKQFLVNNR I         
Subjt:  TLCSNYDRQETSGFDEQENHCSIFKAD-DFVLLMDQPNLSSANFYDSL-TTTCQGSSMGFPVRTISNGYCSTLLQNSWDQPKQFLVNNRAIHCSSSTNSD

Query:  ELSSCRPMFETHLPLSLQFSNRVRPILNNASAAALHAFQSGFFPSSQSKSSPASTLEGKPKCPNLTKKLQIHILTRPLNHFQLISC-EVVQDSGRVQKKN
          +SCRPMFET     L F+ +                                                            LISC EVV+DS RV+ KN
Subjt:  ELSSCRPMFETHLPLSLQFSNRVRPILNNASAAALHAFQSGFFPSSQSKSSPASTLEGKPKCPNLTKKLQIHILTRPLNHFQLISC-EVVQDSGRVQKKN

Query:  RSSE-TLIKRPRSDMPSPLPTFKVRKEKLG----ALQQLVSPFGKTDTASVLDEANEYIKFLHDQVRVLSTPYMKIGDNNQDQQMQQVEEELKENKKEDL
        +S+E T IKR R+DMPS LPTFKVRKEKLG    ALQQLVSPFGKTDTASVL EA EYIKFLHDQVRVLSTPYM++G+   D  +Q V+EEL+ENKK+DL
Subjt:  RSSE-TLIKRPRSDMPSPLPTFKVRKEKLG----ALQQLVSPFGKTDTASVLDEANEYIKFLHDQVRVLSTPYMKIGDNNQDQQMQQVEEELKENKKEDL

Query:  RSRGLCLVAISSN
         SRGLCLV I S+
Subjt:  RSRGLCLVAISSN

XP_038895926.1 transcription factor bHLH112-like [Benincasa hispida]1.1e-6745.69Show/hide
Query:  MAEEFPAGICGRRWSDSTKTVFMPVA-NRSPCSMELIDMIRSNSCPTGFVDRNVKFCNEKALDSDSVSGDGTFLIDPTVRTTSDCNQTLLGDGDTGRGDT
        MA+EFPAG           T+F P A N SPCS+                                  G+ TFLID T+ + S+ NQ  LG  D  R D+
Subjt:  MAEEFPAGICGRRWSDSTKTVFMPVA-NRSPCSMELIDMIRSNSCPTGFVDRNVKFCNEKALDSDSVSGDGTFLIDPTVRTTSDCNQTLLGDGDTGRGDT

Query:  TLC--------SNYDRQETSGFDEQENHCSIFKAD---DFVLLMDQP-NLSSANFYD-SLTTTCQGSSMGFPVRTISNGYCSTLLQNSWDQPKQFLVNNR
        TL          NYD QE +   E++ +  IFK     D +  M+QP N +   FYD ++TTT QG  M FP  TIS GY  TLL +S DQPKQFLVNNR
Subjt:  TLC--------SNYDRQETSGFDEQENHCSIFKAD---DFVLLMDQP-NLSSANFYD-SLTTTCQGSSMGFPVRTISNGYCSTLLQNSWDQPKQFLVNNR

Query:  AIHCSSS-TNSDELSSCRPMFETHL--PLSLQFSNRVRPILNNASAAALHAFQSGFFPSSQSKSSPASTLEGKPKCPNLTKKLQIHILTRPLNHFQLISC
         I+ S S TNS+EL SCRPMF+THL  P  L FSN +R                                   PKC  LTK               L+ C
Subjt:  AIHCSSS-TNSDELSSCRPMFETHL--PLSLQFSNRVRPILNNASAAALHAFQSGFFPSSQSKSSPASTLEGKPKCPNLTKKLQIHILTRPLNHFQLISC

Query:  EVVQDSGRVQKKNRSSE-TLIKRPRSD-MPSPLPTFKVRKEKLG----ALQQLVSPFGKTDTASVLDEANEYIKFLHDQVRVLSTPYMKIGDNNQDQQM-
        + V+D  RV+++N S E  +IKRPRSD MPSPLPTFKVRKEKLG    ALQQLVSPFGKTDTASVL EA EYIKFLHDQ+RVLSTPYM+I D+NQ+ ++ 
Subjt:  EVVQDSGRVQKKNRSSE-TLIKRPRSD-MPSPLPTFKVRKEKLG----ALQQLVSPFGKTDTASVLDEANEYIKFLHDQVRVLSTPYMKIGDNNQDQQM-

Query:  --QQVEEELKENKKEDLRSRGLCLVAISS
          +++++  KEN KEDLRSRGLCLV + S
Subjt:  --QQVEEELKENKKEDLRSRGLCLVAISS

TrEMBL top hitse value%identityAlignment
A0A1S3B4S8 transcription factor bHLH112-like isoform X17.9e-6450.14Show/hide
Query:  GDGTFLIDPTVRTTSDCNQTLLGDGDTGRGDTTLC--------SNYDRQE-TSGFDEQENHCSIFKA----DDFVLLMDQP-NLSSA-NFYDSL--TTTC
        G+ TFLID T+ +TS+ NQ+  G  D  R DTT+          NYD QE TSG + Q+N  S FK      D + LM+QP N+ +A NFY ++  TTT 
Subjt:  GDGTFLIDPTVRTTSDCNQTLLGDGDTGRGDTTLC--------SNYDRQE-TSGFDEQENHCSIFKA----DDFVLLMDQP-NLSSA-NFYDSL--TTTC

Query:  QGSSMGFPVRTISNGYCSTLLQ-NSWDQPKQFLVNNRAI-HCSSSTNSDELSSCRPMFETHL--PLSLQFSNRVRPILNNASAAALHAFQSGFFPSSQSK
        QG  M FP   IS GY  TLL  +S +QP QF+VNNR I +C S TNSDEL SC PMF+THL  P  L FSN +                          
Subjt:  QGSSMGFPVRTISNGYCSTLLQ-NSWDQPKQFLVNNRAI-HCSSSTNSDELSSCRPMFETHL--PLSLQFSNRVRPILNNASAAALHAFQSGFFPSSQSK

Query:  SSPASTLEGKPKCPNLTKKLQIHILTRPLNHFQLISC-EVVQDSGRVQKKNRSSE-TLIKRPRSD-MPSPLPTFKVRKEKLG----ALQQLVSPFGKTDT
                  PKCP                  +L+ C EVV+D  RV+++N S E  +IKRPRSD   SPLPTFKVRKEKLG    ALQQLVSPFGKTDT
Subjt:  SSPASTLEGKPKCPNLTKKLQIHILTRPLNHFQLISC-EVVQDSGRVQKKNRSSE-TLIKRPRSD-MPSPLPTFKVRKEKLG----ALQQLVSPFGKTDT

Query:  ASVLDEANEYIKFLHDQVRVLSTPYMKIGDNNQDQQMQQVEEELK---ENKKEDLRSRGLCLVAISS
        ASVL EA EYIKFLH+Q+RVLSTPYM+IGD+NQ+ ++  +EEELK   EN KEDLRSRGLCLV I S
Subjt:  ASVLDEANEYIKFLHDQVRVLSTPYMKIGDNNQDQQMQQVEEELK---ENKKEDLRSRGLCLVAISS

A0A5A7UQK7 Transcription factor bHLH112-like isoform X14.6e-6450.14Show/hide
Query:  GDGTFLIDPTVRTTSDCNQTLLGDGDTGRGDTTLC--------SNYDRQE-TSGFDEQENHCSIFKA----DDFVLLMDQP-NLSSA-NFYDSL--TTTC
        G+ TFLID T+ +TS+ NQ+  G  D  R DTT+          NYD QE TSG + Q+N  S FK      D + LM+QP N+ +A NFY ++  TTT 
Subjt:  GDGTFLIDPTVRTTSDCNQTLLGDGDTGRGDTTLC--------SNYDRQE-TSGFDEQENHCSIFKA----DDFVLLMDQP-NLSSA-NFYDSL--TTTC

Query:  QGSSMGFPVRTISNGYCSTLLQ-NSWDQPKQFLVNNRAI-HCSSSTNSDELSSCRPMFETHL--PLSLQFSNRVRPILNNASAAALHAFQSGFFPSSQSK
        QG  M FP   IS GY  TLL  +S +QP QF+VNNR I +C S TNSDEL SC PMF+THL  P  L FSN +                          
Subjt:  QGSSMGFPVRTISNGYCSTLLQ-NSWDQPKQFLVNNRAI-HCSSSTNSDELSSCRPMFETHL--PLSLQFSNRVRPILNNASAAALHAFQSGFFPSSQSK

Query:  SSPASTLEGKPKCPNLTKKLQIHILTRPLNHFQLISC-EVVQDSGRVQKKNRSSE-TLIKRPRSD-MPSPLPTFKVRKEKLG----ALQQLVSPFGKTDT
                  PKCP                  +L+ C EVV+D  RV+++N S E  +IKRPRSD   SPLPTFKVRKEKLG    ALQQLVSPFGKTDT
Subjt:  SSPASTLEGKPKCPNLTKKLQIHILTRPLNHFQLISC-EVVQDSGRVQKKNRSSE-TLIKRPRSD-MPSPLPTFKVRKEKLG----ALQQLVSPFGKTDT

Query:  ASVLDEANEYIKFLHDQVRVLSTPYMKIGDNNQDQQMQQVEEELK---ENKKEDLRSRGLCLVAISS
        ASVL EA EYIKFLH+Q+RVLSTPYM+IGD+NQ+ ++  +EEELK   EN KEDLRSRGLCLV I S
Subjt:  ASVLDEANEYIKFLHDQVRVLSTPYMKIGDNNQDQQMQQVEEELK---ENKKEDLRSRGLCLVAISS

A0A6J1EG97 transcription factor bHLH112-like2.0e-6745.95Show/hide
Query:  MAEEFPAGICGRRWSDSTKTVFMP-VANRSPCSMELIDMIRSNSCPTGFVDRNVKFCNEKALDSDSVSGDGTFLIDPTVRTTSDCNQTLLGDGDTGRGDT
        M +EFP GICGR+  D+TK  F P  A  S CS+                                   D TFLIDPTVRT S+ NQ  LGD D  R DT
Subjt:  MAEEFPAGICGRRWSDSTKTVFMP-VANRSPCSMELIDMIRSNSCPTGFVDRNVKFCNEKALDSDSVSGDGTFLIDPTVRTTSDCNQTLLGDGDTGRGDT

Query:  TLCSNYDRQETSGFDEQENHCSIFKAD-DFVLLMDQPNLSSANFYDSL-TTTCQGSSMGFPVRTISNGYCSTLLQNSWDQPKQFLVNNRAIHCSSSTNSD
        TL  +YD QET+  +E +N  SIF AD D + LM Q   + +NFYD++ TTT Q   M +P   ISNGY  TLL +S DQPKQFLVNNR I+        
Subjt:  TLCSNYDRQETSGFDEQENHCSIFKAD-DFVLLMDQPNLSSANFYDSL-TTTCQGSSMGFPVRTISNGYCSTLLQNSWDQPKQFLVNNRAIHCSSSTNSD

Query:  ELSSCRPMFETHLPLSLQFSNRVRPILNNASAAALHAFQSGFFPSSQSKSSPASTLEGKPKCPNLTKKLQIHILTRPLNHFQLISCEVVQDSGRVQKKNR
          SSC PMFET                                                    N T KL             +   EVV+DS RV+ KN+
Subjt:  ELSSCRPMFETHLPLSLQFSNRVRPILNNASAAALHAFQSGFFPSSQSKSSPASTLEGKPKCPNLTKKLQIHILTRPLNHFQLISCEVVQDSGRVQKKNR

Query:  SSE-TLIKRPRSDMPSPLPTFKVRKEKLG----ALQQLVSPFGKTDTASVLDEANEYIKFLHDQVRVLSTPYMKIGDNNQDQQM--------QQVEEELK
        S+E T IKR R+DMPS LPTFKVRKEKLG    ALQQLVSPFGKTDTASVL EA EYIKFLHDQVRVLSTPYM++G+  Q+Q+         Q V+EEL+
Subjt:  SSE-TLIKRPRSDMPSPLPTFKVRKEKLG----ALQQLVSPFGKTDTASVLDEANEYIKFLHDQVRVLSTPYMKIGDNNQDQQM--------QQVEEELK

Query:  ENKKEDLRSRGLCLVAISSN
        ENKK+DL SRGLCLV I S+
Subjt:  ENKKEDLRSRGLCLVAISSN

A0A6J1KI75 transcription factor bHLH112-like isoform X26.9e-6847.23Show/hide
Query:  MAEEFPAGICGRRWSDSTKTVFMP-VANRSPCSMELIDMIRSNSCPTGFVDRNVKFCNEKALDSDSVSGDGTFLIDPTVRTTSDCNQTLLGDGDTGRGDT
        M +EFP GICGR   D+TKT+F P  AN SPCS+                                  GD TFLIDPTVRT S+ NQ  LGD D  R DT
Subjt:  MAEEFPAGICGRRWSDSTKTVFMP-VANRSPCSMELIDMIRSNSCPTGFVDRNVKFCNEKALDSDSVSGDGTFLIDPTVRTTSDCNQTLLGDGDTGRGDT

Query:  TLCSNYDRQETSGFDEQENHCSIFKAD-DFVLLMDQPNLSSANFYDSL-TTTCQGSSMGFPVRTISNGYCSTLLQNSWDQPKQFLVNNRAIHCSSSTNSD
        TL  +YD QET+  +E +N  SIF AD D + LM Q   + +NFYD++ TTT QG  M +P   ISNGY  TLL +S DQPKQFLVNNR       TN+ 
Subjt:  TLCSNYDRQETSGFDEQENHCSIFKAD-DFVLLMDQPNLSSANFYDSL-TTTCQGSSMGFPVRTISNGYCSTLLQNSWDQPKQFLVNNRAIHCSSSTNSD

Query:  ELSSCRPMFETHLPLSLQFSNRVRPILNNASAAALHAFQSGFFPSSQSKSSPASTLEGKPKCPNLTKKLQIHILTRPLNHFQLISCEVVQDSGRVQKKNR
          SSC+PMFET                                                    N T KL             +   EVV+DS RV+ KN+
Subjt:  ELSSCRPMFETHLPLSLQFSNRVRPILNNASAAALHAFQSGFFPSSQSKSSPASTLEGKPKCPNLTKKLQIHILTRPLNHFQLISCEVVQDSGRVQKKNR

Query:  SSE-TLIKRPRSDMPSPLPTFKVRKEKLG----ALQQLVSPFGKTDTASVLDEANEYIKFLHDQVRVLSTPYMKIGDNNQDQQM------QQVEEELKEN
        S+E T IKRPR+DM S LPTFKVRKEKLG    ALQQLVSPFGKTDTASVL EA EYIKFLHDQ  VLSTPYM++G+  Q+QQ+      Q ++EEL+EN
Subjt:  SSE-TLIKRPRSDMPSPLPTFKVRKEKLG----ALQQLVSPFGKTDTASVLDEANEYIKFLHDQVRVLSTPYMKIGDNNQDQQM------QQVEEELKEN

Query:  KKEDLRSRGLCLVAI
        KK+DL SRGLCLV I
Subjt:  KKEDLRSRGLCLVAI

A0A6J1KKS6 transcription factor bHLH112-like isoform X11.5e-7047.71Show/hide
Query:  MAEEFPAGICGRRWSDSTKTVFMP-VANRSPCSMELIDMIRSNSCPTGFVDRNVKFCNEKALDSDSVSGDGTFLIDPTVRTTSDCNQTLLGDGDTGRGDT
        M +EFP GICGR   D+TKT+F P  AN SPCS+                                  GD TFLIDPTVRT S+ NQ  LGD D  R DT
Subjt:  MAEEFPAGICGRRWSDSTKTVFMP-VANRSPCSMELIDMIRSNSCPTGFVDRNVKFCNEKALDSDSVSGDGTFLIDPTVRTTSDCNQTLLGDGDTGRGDT

Query:  TLCSNYDRQETSGFDEQENHCSIFKAD-DFVLLMDQPNLSSANFYDSL-TTTCQGSSMGFPVRTISNGYCSTLLQNSWDQPKQFLVNNRAIHCSSSTNSD
        TL  +YD QET+  +E +N  SIF AD D + LM Q   + +NFYD++ TTT QG  M +P   ISNGY  TLL +S DQPKQFLVNNR       TN+ 
Subjt:  TLCSNYDRQETSGFDEQENHCSIFKAD-DFVLLMDQPNLSSANFYDSL-TTTCQGSSMGFPVRTISNGYCSTLLQNSWDQPKQFLVNNRAIHCSSSTNSD

Query:  ELSSCRPMFETHLPLSLQFSNRVRPILNNASAAALHAFQSGFFPSSQSKSSPASTLEGKPKCPNLTKKLQIHILTRPLNHFQLISCEVVQDSGRVQKKNR
          SSC+PMFET                                                    N T KL             +   EVV+DS RV+ KN+
Subjt:  ELSSCRPMFETHLPLSLQFSNRVRPILNNASAAALHAFQSGFFPSSQSKSSPASTLEGKPKCPNLTKKLQIHILTRPLNHFQLISCEVVQDSGRVQKKNR

Query:  SSE-TLIKRPRSDMPSPLPTFKVRKEKLG----ALQQLVSPFGKTDTASVLDEANEYIKFLHDQVRVLSTPYMKIGDNNQDQQM------QQVEEELKEN
        S+E T IKRPR+DM S LPTFKVRKEKLG    ALQQLVSPFGKTDTASVL EA EYIKFLHDQVRVLSTPYM++G+  Q+QQ+      Q ++EEL+EN
Subjt:  SSE-TLIKRPRSDMPSPLPTFKVRKEKLG----ALQQLVSPFGKTDTASVLDEANEYIKFLHDQVRVLSTPYMKIGDNNQDQQM------QQVEEELKEN

Query:  KKEDLRSRGLCLVAI
        KK+DL SRGLCLV I
Subjt:  KKEDLRSRGLCLVAI

SwissProt top hitse value%identityAlignment
Q8GXT3 Transcription factor bHLH1231.1e-2536.75Show/hide
Query:  SSANFYDSLTTTCQGSSMGFPVRTISNGYCSTLLQNSWDQPKQFLVNNRAIHCSSSTNSDELSSCRPMF---------ETHLPLSLQFSNRVR----PIL
        +S+N   + TTT   SS G      + G+ S+  Q S +  +  L  ++    SS+ N D+++S  P           + H P  L+FSN          
Subjt:  SSANFYDSLTTTCQGSSMGFPVRTISNGYCSTLLQNSWDQPKQFLVNNRAIHCSSSTNSDELSSCRPMF---------ETHLPLSLQFSNRVR----PIL

Query:  NNASAAALHAFQSGFFPSSQSKSSPASTLEGKPKCPNLTKKLQIHILTRPLNHFQLISCEVVQDSGRVQKKNRSSETLIKRPRSDMPSPLPTFKVRKEKL
         NA A   H   S FFP+ Q       + + +PK  N+++                        S  V++     +   KR +S+  SP P FK RKEK+
Subjt:  NNASAAALHAFQSGFFPSSQSKSSPASTLEGKPKCPNLTKKLQIHILTRPLNHFQLISCEVVQDSGRVQKKNRSSETLIKRPRSDMPSPLPTFKVRKEKL

Query:  G----ALQQLVSPFGKTDTASVLDEANEYIKFLHDQVRVLSTPYMKIGDNNQDQQMQQVEEELKENKKEDLRSRGLCLVAISS
        G    ALQQLVSPFGKTD ASVL EA EYIKFLH QV  LS PYMK G + Q QQ      EL+ +++ DLRSRGLCLV +SS
Subjt:  G----ALQQLVSPFGKTDTASVLDEANEYIKFLHDQVRVLSTPYMKIGDNNQDQQMQQVEEELKENKKEDLRSRGLCLVAISS

Q8VZ22 Transcription factor bHLH1032.0e-1937.88Show/hide
Query:  LHAFQSGFFPSSQSKSSPASTLEGKPKC-----------PNLTKKLQIHILTRPLNHFQLISCEVVQDSGRVQKKNRSSETL-----IKRPRSDMPSPLP
        +++F  G F SS+    P      KP+            PN ++ + +    +P    Q  +C+ +    R  +K+   E +     +KRPR + PS  P
Subjt:  LHAFQSGFFPSSQSKSSPASTLEGKPKC-----------PNLTKKLQIHILTRPLNHFQLISCEVVQDSGRVQKKNRSSETL-----IKRPRSDMPSPLP

Query:  TFKVRKEKLG----ALQQLVSPFGKTDTASVLDEANEYIKFLHDQV--RVLSTPYM-KIGDNNQDQQMQQVEEELKE---NKKEDLRSRGLCLVAISS
        +FKVRKEKLG    ALQQLVSPFGKTDTASVL +A +YIKFL +Q+  +V ++P++  IG   Q Q   +          + ++DLRSRGLCL+ ISS
Subjt:  TFKVRKEKLG----ALQQLVSPFGKTDTASVLDEANEYIKFLHDQV--RVLSTPYM-KIGDNNQDQQMQQVEEELKE---NKKEDLRSRGLCLVAISS

Q94JL3 Transcription factor bHLH1124.3e-2740.52Show/hide
Query:  CSSSTNSDELSSCRPMFETHLPLSLQFSNRVR-----PILNNASAAALHAFQSGFFPSSQSKSSPASTLEGKPKCPNLTKKLQIHILTRPLNHFQLISCE
        CSSS ++   SS    F    P    F +        P L+ A+    H   +    +S S ++ +          NL      + +T P    Q+IS  
Subjt:  CSSSTNSDELSSCRPMFETHLPLSLQFSNRVR-----PILNNASAAALHAFQSGFFPSSQSKSSPASTLEGKPKCPNLTKKLQIHILTRPLNHFQLISCE

Query:  VVQDSGRVQ--------KKNRSSETLIKRPRSDMPSPLPTFKVRKEKL----GALQQLVSPFGKTDTASVLDEANEYIKFLHDQVRVLSTPYMKIG-DNN
        +   +  ++        K+ + +E+  K+PR   PSPLPTFKVRKE L     +LQQLVSPFGKTDTASVL EA EYIKFLHDQV VLSTPYMK G  N 
Subjt:  VVQDSGRVQ--------KKNRSSETLIKRPRSDMPSPLPTFKVRKEKL----GALQQLVSPFGKTDTASVLDEANEYIKFLHDQVRVLSTPYMKIG-DNN

Query:  QDQQMQQVEEELKENKKEDLRSRGLCLVAISS
        Q QQ+    +   EN+  +LR  GLCLV ISS
Subjt:  QDQQMQQVEEELKENKKEDLRSRGLCLVAISS

Q9M0X8 Transcription factor bHLH1149.4e-2253.97Show/hide
Query:  QKKNRSSETLIKRPRSDMPSPLPTFKVRKEKLG----ALQQLVSPFGKTDTASVLDEANEYIKFLHDQVRVLSTPYMK-IGDNNQDQ---------QMQQ
        +  +  S  L+KRPR +  SPLP+FKVRKEKLG    ALQQLVSPFGKTDTASVL+EA EYIKFL +QV VLS P    IG   Q Q         Q + 
Subjt:  QKKNRSSETLIKRPRSDMPSPLPTFKVRKEKLG----ALQQLVSPFGKTDTASVLDEANEYIKFLHDQVRVLSTPYMK-IGDNNQDQ---------QMQQ

Query:  VEEELKENKKEDLRSRGLCLVAISSN
         E+E    +  DL SRGLCL+ IS++
Subjt:  VEEELKENKKEDLRSRGLCLVAISSN

Q9SFZ3 Transcription factor bHLH1105.5e-2246.85Show/hide
Query:  HILTRPLNHFQLISCEVVQDSGRVQK---KNRSSETLIKRPRSDMPSPLPTFKVRKEKLG----ALQQLVSPFGKTDTASVLDEANEYIKFLHDQVRVLS
        H+ + P +  ++ S E     G+        ++ E   K+PR +  S  P FKVRKEKLG    ALQQLVSPFGKTDTASVL EA  YIKFL  Q+  LS
Subjt:  HILTRPLNHFQLISCEVVQDSGRVQK---KNRSSETLIKRPRSDMPSPLPTFKVRKEKLG----ALQQLVSPFGKTDTASVLDEANEYIKFLHDQVRVLS

Query:  TPYMKIGDNNQDQQMQQV--EEELKENKKEDLRSRGLCLVAIS
         PYM+   N   +  Q V   +E  E +  DLRSRGLCLV +S
Subjt:  TPYMKIGDNNQDQQMQQV--EEELKENKKEDLRSRGLCLVAIS

Arabidopsis top hitse value%identityAlignment
AT1G27660.1 basic helix-loop-helix (bHLH) DNA-binding superfamily protein3.9e-2346.85Show/hide
Query:  HILTRPLNHFQLISCEVVQDSGRVQK---KNRSSETLIKRPRSDMPSPLPTFKVRKEKLG----ALQQLVSPFGKTDTASVLDEANEYIKFLHDQVRVLS
        H+ + P +  ++ S E     G+        ++ E   K+PR +  S  P FKVRKEKLG    ALQQLVSPFGKTDTASVL EA  YIKFL  Q+  LS
Subjt:  HILTRPLNHFQLISCEVVQDSGRVQK---KNRSSETLIKRPRSDMPSPLPTFKVRKEKLG----ALQQLVSPFGKTDTASVLDEANEYIKFLHDQVRVLS

Query:  TPYMKIGDNNQDQQMQQV--EEELKENKKEDLRSRGLCLVAIS
         PYM+   N   +  Q V   +E  E +  DLRSRGLCLV +S
Subjt:  TPYMKIGDNNQDQQMQQV--EEELKENKKEDLRSRGLCLVAIS

AT1G61660.1 basic helix-loop-helix (bHLH) DNA-binding superfamily protein3.1e-2840.52Show/hide
Query:  CSSSTNSDELSSCRPMFETHLPLSLQFSNRVR-----PILNNASAAALHAFQSGFFPSSQSKSSPASTLEGKPKCPNLTKKLQIHILTRPLNHFQLISCE
        CSSS ++   SS    F    P    F +        P L+ A+    H   +    +S S ++ +          NL      + +T P    Q+IS  
Subjt:  CSSSTNSDELSSCRPMFETHLPLSLQFSNRVR-----PILNNASAAALHAFQSGFFPSSQSKSSPASTLEGKPKCPNLTKKLQIHILTRPLNHFQLISCE

Query:  VVQDSGRVQ--------KKNRSSETLIKRPRSDMPSPLPTFKVRKEKL----GALQQLVSPFGKTDTASVLDEANEYIKFLHDQVRVLSTPYMKIG-DNN
        +   +  ++        K+ + +E+  K+PR   PSPLPTFKVRKE L     +LQQLVSPFGKTDTASVL EA EYIKFLHDQV VLSTPYMK G  N 
Subjt:  VVQDSGRVQ--------KKNRSSETLIKRPRSDMPSPLPTFKVRKEKL----GALQQLVSPFGKTDTASVLDEANEYIKFLHDQVRVLSTPYMKIG-DNN

Query:  QDQQMQQVEEELKENKKEDLRSRGLCLVAISS
        Q QQ+    +   EN+  +LR  GLCLV ISS
Subjt:  QDQQMQQVEEELKENKKEDLRSRGLCLVAISS

AT1G61660.2 basic helix-loop-helix (bHLH) DNA-binding superfamily protein7.9e-2440.29Show/hide
Query:  CSSSTNSDELSSCRPMFETHLPLSLQFSNRVR-----PILNNASAAALHAFQSGFFPSSQSKSSPASTLEGKPKCPNLTKKLQIHILTRPLNHFQLISCE
        CSSS ++   SS    F    P    F +        P L+ A+    H   +    +S S ++ +          NL      + +T P    Q+IS  
Subjt:  CSSSTNSDELSSCRPMFETHLPLSLQFSNRVR-----PILNNASAAALHAFQSGFFPSSQSKSSPASTLEGKPKCPNLTKKLQIHILTRPLNHFQLISCE

Query:  VVQDSGRVQ--------KKNRSSETLIKRPRSDMPSPLPTFKVRKEKL----GALQQLVSPFGKTDTASVLDEANEYIKFLHDQVRVLSTPYMKIGDNNQ
        +   +  ++        K+ + +E+  K+PR   PSPLPTFKVRKE L     +LQQLVSPFGKTDTASVL EA EYIKFLHDQV VLSTPYMK G +NQ
Subjt:  VVQDSGRVQ--------KKNRSSETLIKRPRSDMPSPLPTFKVRKEKL----GALQQLVSPFGKTDTASVLDEANEYIKFLHDQVRVLSTPYMKIGDNNQ

Query:  DQQMQQ
         QQ  Q
Subjt:  DQQMQQ

AT1G61660.3 basic helix-loop-helix (bHLH) DNA-binding superfamily protein2.4e-2841.3Show/hide
Query:  CSSSTNSDELSSCRPMFETHLPLSLQFSNRVR-----PILNNASAAALHAFQSGFFPSSQSKSSPASTLEGKPKCPNLTKKLQIHILTRPLNHFQLISCE
        CSSS ++   SS    F    P    F +        P L+ A+    H   +    +S S ++ +          NL      + +T P    Q+IS  
Subjt:  CSSSTNSDELSSCRPMFETHLPLSLQFSNRVR-----PILNNASAAALHAFQSGFFPSSQSKSSPASTLEGKPKCPNLTKKLQIHILTRPLNHFQLISCE

Query:  VVQDSGRVQ------KKNRSSETLIKRPRSDMPSPLPTFKVRKEKL----GALQQLVSPFGKTDTASVLDEANEYIKFLHDQVRVLSTPYMKIG-DNNQD
         ++D  + +      K+ + +E+  K+PR   PSPLPTFKVRKE L     +LQQLVSPFGKTDTASVL EA EYIKFLHDQV VLSTPYMK G  N Q 
Subjt:  VVQDSGRVQ------KKNRSSETLIKRPRSDMPSPLPTFKVRKEKL----GALQQLVSPFGKTDTASVLDEANEYIKFLHDQVRVLSTPYMKIG-DNNQD

Query:  QQMQQVEEELKENKKEDLRSRGLCLVAISS
        QQ+    +   EN+  +LR  GLCLV ISS
Subjt:  QQMQQVEEELKENKKEDLRSRGLCLVAISS

AT3G20640.1 basic helix-loop-helix (bHLH) DNA-binding superfamily protein7.6e-2736.75Show/hide
Query:  SSANFYDSLTTTCQGSSMGFPVRTISNGYCSTLLQNSWDQPKQFLVNNRAIHCSSSTNSDELSSCRPMF---------ETHLPLSLQFSNRVR----PIL
        +S+N   + TTT   SS G      + G+ S+  Q S +  +  L  ++    SS+ N D+++S  P           + H P  L+FSN          
Subjt:  SSANFYDSLTTTCQGSSMGFPVRTISNGYCSTLLQNSWDQPKQFLVNNRAIHCSSSTNSDELSSCRPMF---------ETHLPLSLQFSNRVR----PIL

Query:  NNASAAALHAFQSGFFPSSQSKSSPASTLEGKPKCPNLTKKLQIHILTRPLNHFQLISCEVVQDSGRVQKKNRSSETLIKRPRSDMPSPLPTFKVRKEKL
         NA A   H   S FFP+ Q       + + +PK  N+++                        S  V++     +   KR +S+  SP P FK RKEK+
Subjt:  NNASAAALHAFQSGFFPSSQSKSSPASTLEGKPKCPNLTKKLQIHILTRPLNHFQLISCEVVQDSGRVQKKNRSSETLIKRPRSDMPSPLPTFKVRKEKL

Query:  G----ALQQLVSPFGKTDTASVLDEANEYIKFLHDQVRVLSTPYMKIGDNNQDQQMQQVEEELKENKKEDLRSRGLCLVAISS
        G    ALQQLVSPFGKTD ASVL EA EYIKFLH QV  LS PYMK G + Q QQ      EL+ +++ DLRSRGLCLV +SS
Subjt:  G----ALQQLVSPFGKTDTASVLDEANEYIKFLHDQVRVLSTPYMKIGDNNQDQQMQQVEEELKENKKEDLRSRGLCLVAISS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCGAGGAATTTCCAGCCGGGATTTGCGGCCGGAGGTGGTCGGATTCAACCAAAACCGTGTTCATGCCGGTGGCCAATCGGTCGCCGTGTTCCATGGAGCTTATTGA
TATGATCAGAAGCAACAGTTGTCCGACTGGTTTCGTAGACAGGAACGTCAAGTTCTGTAACGAGAAAGCTCTTGACTCGGATTCTGTAAGCGGTGATGGCACCTTCTTGA
TTGATCCCACCGTTCGAACGACTTCTGATTGCAACCAAACTTTGCTTGGAGATGGTGATACTGGAAGAGGTGACACCACTTTGTGTTCAAATTATGATCGCCAAGAAACT
AGTGGATTTGATGAGCAAGAAAATCATTGTTCCATTTTCAAGGCCGACGACTTTGTGCTTTTAATGGACCAACCTAATCTAAGCTCTGCCAACTTTTACGATAGTCTTAC
AACTACATGCCAAGGGTCGTCCATGGGTTTCCCTGTTAGGACCATTTCCAATGGGTACTGTTCAACGTTGTTGCAGAACTCATGGGATCAACCAAAACAATTTCTTGTCA
ACAACCGGGCCATTCATTGTAGTTCGTCCACAAATTCAGACGAGTTATCGTCTTGTCGGCCTATGTTTGAAACCCACCTGCCACTCTCATTGCAGTTCTCCAACAGAGTT
AGACCCATTTTGAATAATGCCTCAGCAGCAGCACTCCATGCCTTTCAATCTGGTTTCTTCCCTTCGTCTCAATCAAAGTCTTCTCCAGCATCGACATTAGAAGGGAAACC
CAAATGCCCAAACCTTACAAAAAAGTTGCAAATTCATATTTTGACTCGACCCTTGAACCATTTTCAGCTAATTAGTTGTGAAGTTGTTCAAGACTCGGGAAGGGTTCAGA
AGAAAAACAGAAGTAGTGAAACATTAATCAAAAGGCCAAGAAGTGACATGCCATCTCCATTACCCACTTTTAAGGTTCGAAAAGAGAAGCTAGGCGCGCTCCAACAACTC
GTTTCACCTTTTGGAAAGACCGATACTGCATCAGTTCTAGATGAAGCTAATGAGTACATCAAGTTTCTTCACGATCAAGTCCGCGTTTTGAGTACTCCATATATGAAAAT
CGGAGACAACAATCAAGACCAACAGATGCAGCAGGTTGAGGAAGAATTGAAGGAGAATAAGAAAGAAGATCTAAGAAGTCGAGGACTTTGTCTGGTAGCAATTTCAAGTA
ATGGAACTGAAATTTTCCTCAATTATAAGCTACATGAAAAGGCAGCCAAATGGGTGATGCTCCAATTCGCCATGGCTGTAGCAGTAGTCTTTGGTCAGCGCCAGAGAGAA
AGAGAAAGAGAGAGGAAGACGAGTGAGAAAGATGGGCATTGTTGGGCCAACAAACTGTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCCGAGGAATTTCCAGCCGGGATTTGCGGCCGGAGGTGGTCGGATTCAACCAAAACCGTGTTCATGCCGGTGGCCAATCGGTCGCCGTGTTCCATGGAGCTTATTGA
TATGATCAGAAGCAACAGTTGTCCGACTGGTTTCGTAGACAGGAACGTCAAGTTCTGTAACGAGAAAGCTCTTGACTCGGATTCTGTAAGCGGTGATGGCACCTTCTTGA
TTGATCCCACCGTTCGAACGACTTCTGATTGCAACCAAACTTTGCTTGGAGATGGTGATACTGGAAGAGGTGACACCACTTTGTGTTCAAATTATGATCGCCAAGAAACT
AGTGGATTTGATGAGCAAGAAAATCATTGTTCCATTTTCAAGGCCGACGACTTTGTGCTTTTAATGGACCAACCTAATCTAAGCTCTGCCAACTTTTACGATAGTCTTAC
AACTACATGCCAAGGGTCGTCCATGGGTTTCCCTGTTAGGACCATTTCCAATGGGTACTGTTCAACGTTGTTGCAGAACTCATGGGATCAACCAAAACAATTTCTTGTCA
ACAACCGGGCCATTCATTGTAGTTCGTCCACAAATTCAGACGAGTTATCGTCTTGTCGGCCTATGTTTGAAACCCACCTGCCACTCTCATTGCAGTTCTCCAACAGAGTT
AGACCCATTTTGAATAATGCCTCAGCAGCAGCACTCCATGCCTTTCAATCTGGTTTCTTCCCTTCGTCTCAATCAAAGTCTTCTCCAGCATCGACATTAGAAGGGAAACC
CAAATGCCCAAACCTTACAAAAAAGTTGCAAATTCATATTTTGACTCGACCCTTGAACCATTTTCAGCTAATTAGTTGTGAAGTTGTTCAAGACTCGGGAAGGGTTCAGA
AGAAAAACAGAAGTAGTGAAACATTAATCAAAAGGCCAAGAAGTGACATGCCATCTCCATTACCCACTTTTAAGGTTCGAAAAGAGAAGCTAGGCGCGCTCCAACAACTC
GTTTCACCTTTTGGAAAGACCGATACTGCATCAGTTCTAGATGAAGCTAATGAGTACATCAAGTTTCTTCACGATCAAGTCCGCGTTTTGAGTACTCCATATATGAAAAT
CGGAGACAACAATCAAGACCAACAGATGCAGCAGGTTGAGGAAGAATTGAAGGAGAATAAGAAAGAAGATCTAAGAAGTCGAGGACTTTGTCTGGTAGCAATTTCAAGTA
ATGGAACTGAAATTTTCCTCAATTATAAGCTACATGAAAAGGCAGCCAAATGGGTGATGCTCCAATTCGCCATGGCTGTAGCAGTAGTCTTTGGTCAGCGCCAGAGAGAA
AGAGAAAGAGAGAGGAAGACGAGTGAGAAAGATGGGCATTGTTGGGCCAACAAACTGTAG
Protein sequenceShow/hide protein sequence
MAEEFPAGICGRRWSDSTKTVFMPVANRSPCSMELIDMIRSNSCPTGFVDRNVKFCNEKALDSDSVSGDGTFLIDPTVRTTSDCNQTLLGDGDTGRGDTTLCSNYDRQET
SGFDEQENHCSIFKADDFVLLMDQPNLSSANFYDSLTTTCQGSSMGFPVRTISNGYCSTLLQNSWDQPKQFLVNNRAIHCSSSTNSDELSSCRPMFETHLPLSLQFSNRV
RPILNNASAAALHAFQSGFFPSSQSKSSPASTLEGKPKCPNLTKKLQIHILTRPLNHFQLISCEVVQDSGRVQKKNRSSETLIKRPRSDMPSPLPTFKVRKEKLGALQQL
VSPFGKTDTASVLDEANEYIKFLHDQVRVLSTPYMKIGDNNQDQQMQQVEEELKENKKEDLRSRGLCLVAISSNGTEIFLNYKLHEKAAKWVMLQFAMAVAVVFGQRQRE
RERERKTSEKDGHCWANKL