; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh01G001080 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh01G001080
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
Descriptiontranscription factor bHLH68-like isoform X1
Genome locationCmo_Chr01:485292..488651
RNA-Seq ExpressionCmoCh01G001080
SyntenyCmoCh01G001080
Gene Ontology termsGO:0005634 - nucleus (cellular component)
GO:0046983 - protein dimerization activity (molecular function)
InterPro domainsIPR011598 - Myc-type, basic helix-loop-helix (bHLH) domain
IPR036638 - Helix-loop-helix DNA-binding domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6606759.1 Transcription factor basic helix-loop-helix 68, partial [Cucurbita argyrosperma subsp. sororia]3.1e-18998.24Show/hide
Query:  MMGGNPSNWWNMFPPNSL--PPPPPPPPPQFVVGSSSLPFTSMADHPNQEHPNSQSWSQLLLGGLQEGDGDGLVLNSNYNHFQPKKLDNLEGRILIPFPR
        MMGGNPSNWWNMFPPNSL  PPPPPPPPPQFVVGSSSLPFTSMADHPNQEHPNSQSWSQLLLGGLQEGDGDGLVLNSNYNHFQPKKLDNLEGRILIPFPR
Subjt:  MMGGNPSNWWNMFPPNSL--PPPPPPPPPQFVVGSSSLPFTSMADHPNQEHPNSQSWSQLLLGGLQEGDGDGLVLNSNYNHFQPKKLDNLEGRILIPFPR

Query:  FGVGDDDGDDDHVLKQQTCSQSGKSLSFLWNEKESCSSSSISMKGSQTRPPPTTSSSPKSSVNSNAILEFSFNKLDSNNQFPDHTYSSECVSTAATGGVC
        FGVGDDDGDDDHVLKQQTCSQSGKSLSFLWNEKESCSSSSISMKGSQTRPPPTTSSSPKSSVNSNAILEFSFNKLDSNNQFPDHTYSSECVSTAATGGVC
Subjt:  FGVGDDDGDDDHVLKQQTCSQSGKSLSFLWNEKESCSSSSISMKGSQTRPPPTTSSSPKSSVNSNAILEFSFNKLDSNNQFPDHTYSSECVSTAATGGVC

Query:  KKPRVQPVSGQPPIKNQVRKEKVGDRITALHQLVSPFGKTDTASVLSEAIGYVRFLHNQIEALISPYLGNNSSEPTRTDQLLFNDSTLKRKLPPTQEKD-
        KKPRVQPVSGQPPIKNQVRKEKVGDRITALHQLVSPFGKTDTASVLSEAIGYVRFLHNQIEALISPYLGNNSSEPTRTDQLLFNDSTLKRKLPPTQEKD 
Subjt:  KKPRVQPVSGQPPIKNQVRKEKVGDRITALHQLVSPFGKTDTASVLSEAIGYVRFLHNQIEALISPYLGNNSSEPTRTDQLLFNDSTLKRKLPPTQEKD-

Query:  -EEANALRSRGLCLVPVSCTQHVQSDVNGADYWAQAYNGSF
         EEAN LRSRGLCLVPVSCTQHVQSD+NGADYWAQAYNGSF
Subjt:  -EEANALRSRGLCLVPVSCTQHVQSDVNGADYWAQAYNGSF

KAG7036474.1 Transcription factor bHLH68 [Cucurbita argyrosperma subsp. argyrosperma]5.9e-18897.09Show/hide
Query:  MMGGNPSNWWNMFPPNSL-----PPPPPPPPPQFVVGSSSLPFTSMADHPNQEHPNSQSWSQLLLGGLQEGDGDGLVLNSNYNHFQPKKLDNLEGRILIP
        MMGGNPSNWWNMFPPNSL     PPPPPPPPPQFVVGSSSLPFTSMADHPNQEHPNSQSWSQLLLGGLQEGDGDGLVLNSNYNHFQPKKLDNLEGRILIP
Subjt:  MMGGNPSNWWNMFPPNSL-----PPPPPPPPPQFVVGSSSLPFTSMADHPNQEHPNSQSWSQLLLGGLQEGDGDGLVLNSNYNHFQPKKLDNLEGRILIP

Query:  FPRFGVGDDDGDDDHVLKQQTCSQSGKSLSFLWNEKESCSSSSISMKGSQTRPPPTTSSSPKSSVNSNAILEFSFNKLDSNNQFPDHTYSSECVSTAATG
        FPRFGVGDDDGDDDHVLKQQTCSQSGKSLSFLWNEKESCSSSSISMKGSQTRPP TTSSSPKSSVNSNAILEFSFNKLDSNNQFPDHTYSSECVSTAATG
Subjt:  FPRFGVGDDDGDDDHVLKQQTCSQSGKSLSFLWNEKESCSSSSISMKGSQTRPPPTTSSSPKSSVNSNAILEFSFNKLDSNNQFPDHTYSSECVSTAATG

Query:  GVCKKPRVQPVSGQPPIKNQVRKEKVGDRITALHQLVSPFGKTDTASVLSEAIGYVRFLHNQIEALISPYLGNNSSEPTRTDQLLFNDSTLKRKLPPTQE
        GVCKKPRVQPVSGQPPIKNQVRKEKVGDRITALHQLVSPFGKTDTASVLSEAIGYVRFLHNQIEALISPYLGNNSSEPTRTDQLLFNDSTLKRKLPPTQE
Subjt:  GVCKKPRVQPVSGQPPIKNQVRKEKVGDRITALHQLVSPFGKTDTASVLSEAIGYVRFLHNQIEALISPYLGNNSSEPTRTDQLLFNDSTLKRKLPPTQE

Query:  KD--EEANALRSRGLCLVPVSCTQHVQSDVNGADYWAQAYNGSF
        KD  EEAN LRSRGLCLVPVSCTQHVQSD+NGADYWAQAYNGSF
Subjt:  KD--EEANALRSRGLCLVPVSCTQHVQSDVNGADYWAQAYNGSF

XP_022948766.1 transcription factor bHLH68-like isoform X1 [Cucurbita moschata]6.1e-193100Show/hide
Query:  MMGGNPSNWWNMFPPNSLPPPPPPPPPQFVVGSSSLPFTSMADHPNQEHPNSQSWSQLLLGGLQEGDGDGLVLNSNYNHFQPKKLDNLEGRILIPFPRFG
        MMGGNPSNWWNMFPPNSLPPPPPPPPPQFVVGSSSLPFTSMADHPNQEHPNSQSWSQLLLGGLQEGDGDGLVLNSNYNHFQPKKLDNLEGRILIPFPRFG
Subjt:  MMGGNPSNWWNMFPPNSLPPPPPPPPPQFVVGSSSLPFTSMADHPNQEHPNSQSWSQLLLGGLQEGDGDGLVLNSNYNHFQPKKLDNLEGRILIPFPRFG

Query:  VGDDDGDDDHVLKQQTCSQSGKSLSFLWNEKESCSSSSISMKGSQTRPPPTTSSSPKSSVNSNAILEFSFNKLDSNNQFPDHTYSSECVSTAATGGVCKK
        VGDDDGDDDHVLKQQTCSQSGKSLSFLWNEKESCSSSSISMKGSQTRPPPTTSSSPKSSVNSNAILEFSFNKLDSNNQFPDHTYSSECVSTAATGGVCKK
Subjt:  VGDDDGDDDHVLKQQTCSQSGKSLSFLWNEKESCSSSSISMKGSQTRPPPTTSSSPKSSVNSNAILEFSFNKLDSNNQFPDHTYSSECVSTAATGGVCKK

Query:  PRVQPVSGQPPIKNQVRKEKVGDRITALHQLVSPFGKTDTASVLSEAIGYVRFLHNQIEALISPYLGNNSSEPTRTDQLLFNDSTLKRKLPPTQEKDEEA
        PRVQPVSGQPPIKNQVRKEKVGDRITALHQLVSPFGKTDTASVLSEAIGYVRFLHNQIEALISPYLGNNSSEPTRTDQLLFNDSTLKRKLPPTQEKDEEA
Subjt:  PRVQPVSGQPPIKNQVRKEKVGDRITALHQLVSPFGKTDTASVLSEAIGYVRFLHNQIEALISPYLGNNSSEPTRTDQLLFNDSTLKRKLPPTQEKDEEA

Query:  NALRSRGLCLVPVSCTQHVQSDVNGADYWAQAYNGSF
        NALRSRGLCLVPVSCTQHVQSDVNGADYWAQAYNGSF
Subjt:  NALRSRGLCLVPVSCTQHVQSDVNGADYWAQAYNGSF

XP_022948768.1 transcription factor bHLH68-like isoform X2 [Cucurbita moschata]3.7e-19099.41Show/hide
Query:  MMGGNPSNWWNMFPPNSLPPPPPPPPPQFVVGSSSLPFTSMADHPNQEHPNSQSWSQLLLGGLQEGDGDGLVLNSNYNHFQPKKLDNLEGRILIPFPRFG
        MMGGNPSNWWNMFPPNSLPPPPPPPPPQFVVGSSSLPFTSMADHPNQEHPNSQSWSQLLLGGLQEGDGDGLVLNSNYNHFQPKKLDNLEGRILIPFPRFG
Subjt:  MMGGNPSNWWNMFPPNSLPPPPPPPPPQFVVGSSSLPFTSMADHPNQEHPNSQSWSQLLLGGLQEGDGDGLVLNSNYNHFQPKKLDNLEGRILIPFPRFG

Query:  VGDDDGDDDHVLKQQTCSQSGKSLSFLWNEKESCSSSSISMKGSQTRPPPTTSSSPKSSVNSNAILEFSFNKLDSNNQFPDHTYSSECVSTAATGGVCKK
        VGDDDGDDDHVLKQQTCSQSGKSLSFLWNEKESCSSSSISMKGSQTRPPPTTSSSPKSSVNSNAILEFSFNKLDSNNQFPDHTYSSECVSTAATGGVCKK
Subjt:  VGDDDGDDDHVLKQQTCSQSGKSLSFLWNEKESCSSSSISMKGSQTRPPPTTSSSPKSSVNSNAILEFSFNKLDSNNQFPDHTYSSECVSTAATGGVCKK

Query:  PRVQPVSGQPPIKNQVRKEKVGDRITALHQLVSPFGKTDTASVLSEAIGYVRFLHNQIEALISPYLGNNSSEPTRTDQLLFNDSTLKRKLPPTQEKDEEA
        PRVQPVSGQPPIK  VRKEKVGDRITALHQLVSPFGKTDTASVLSEAIGYVRFLHNQIEALISPYLGNNSSEPTRTDQLLFNDSTLKRKLPPTQEKDEEA
Subjt:  PRVQPVSGQPPIKNQVRKEKVGDRITALHQLVSPFGKTDTASVLSEAIGYVRFLHNQIEALISPYLGNNSSEPTRTDQLLFNDSTLKRKLPPTQEKDEEA

Query:  NALRSRGLCLVPVSCTQHVQSDVNGADYWAQAYNGSF
        NALRSRGLCLVPVSCTQHVQSDVNGADYWAQAYNGSF
Subjt:  NALRSRGLCLVPVSCTQHVQSDVNGADYWAQAYNGSF

XP_023524873.1 transcription factor bHLH68-like isoform X1 [Cucurbita pepo subsp. pepo]6.7e-18494.52Show/hide
Query:  MMGGNPSNWWNMFPPNSL----PPPPPPPPPQFVVGSSSLPFTSMADHPNQEHPNSQSWSQLLLGGLQEGDGDGLVLNSNYNHFQPKKLDNLEGRILIPF
        MMGGNPSNWWNMFPPNSL    PPPPPPPPPQFVVGSSSLPFTSMADHPNQEHPNSQSWSQLLLGGLQEGDGDGLVLNSNYNHFQPKKL+NLEGRILIPF
Subjt:  MMGGNPSNWWNMFPPNSL----PPPPPPPPPQFVVGSSSLPFTSMADHPNQEHPNSQSWSQLLLGGLQEGDGDGLVLNSNYNHFQPKKLDNLEGRILIPF

Query:  PRFGVGDDD-GDDDHVLKQQTCSQSGKSLSFLWNEKESCSSSSISMKGSQTR-----PPPTTSSSPKSSVNSNAILEFSFNKLDSNNQFPDHTYSSECVS
        PRFGVGDDD  DDDHVLKQQTCSQSGKSLSFLWNEKESCSSSSIS+KGSQTR     PPPTTSSSPKSSVNSNAILEFSFN+LDSNNQFPDHTYSSECVS
Subjt:  PRFGVGDDD-GDDDHVLKQQTCSQSGKSLSFLWNEKESCSSSSISMKGSQTR-----PPPTTSSSPKSSVNSNAILEFSFNKLDSNNQFPDHTYSSECVS

Query:  TAATGGVCKKPRVQPVSGQPPIKNQVRKEKVGDRITALHQLVSPFGKTDTASVLSEAIGYVRFLHNQIEALISPYLGNNSSEPTRTDQLLFNDSTLKRKL
        TAATGGVCKKPRVQPVSGQPPIKNQVRKEKVGDRITALHQLVSPFGKTDTASVLSEAIGYVRFLHNQIEALISPYLGNNSSEPTRTD+LLFNDS+LKRKL
Subjt:  TAATGGVCKKPRVQPVSGQPPIKNQVRKEKVGDRITALHQLVSPFGKTDTASVLSEAIGYVRFLHNQIEALISPYLGNNSSEPTRTDQLLFNDSTLKRKL

Query:  PPTQEKDEEANALRSRGLCLVPVSCTQHVQSDVNGADYWAQAYNGSF
        PPTQEK+EEAN LRSRGLCLVPVSCTQHVQSD+NGADYWAQAYNGSF
Subjt:  PPTQEKDEEANALRSRGLCLVPVSCTQHVQSDVNGADYWAQAYNGSF

TrEMBL top hitse value%identityAlignment
A0A6J1DKZ3 transcription factor bHLH68-like isoform X46.7e-11368.97Show/hide
Query:  MMGGNPSNWWNMFPPNSLPPPPPPPPPQFVVGSSSLPFTSMADHPNQEH---PNSQSWSQLLLGGLQE-GD-GDGLVLNSNYNHFQPKKLDNLEGRILIP
        MMGGNPSNWW+M PPNSL        PQFV+GSS LP +SMADH N  H   PNSQSWSQLL+GGLQE GD  + LVLNS  N+F+ KK + LEGRILIP
Subjt:  MMGGNPSNWWNMFPPNSLPPPPPPPPPQFVVGSSSLPFTSMADHPNQEH---PNSQSWSQLLLGGLQE-GD-GDGLVLNSNYNHFQPKKLDNLEGRILIP

Query:  FPRFGVGDDDGDDDHVLKQQTCSQS-GKSLSFLWNEKESCSSSSISMKGSQTRPPPTTSSSPKSSVNSNAILEFSFNKLDSNNQFPDHTYSSECVSTAAT
        FPRFGVG   G D  VLKQ++C +S  K+LSFLWN  E  SSSS +   S +    + S+S  S + SNAIL+FSF+K+DS NQ PDH YSSEC STAAT
Subjt:  FPRFGVGDDDGDDDHVLKQQTCSQS-GKSLSFLWNEKESCSSSSISMKGSQTRPPPTTSSSPKSSVNSNAILEFSFNKLDSNNQFPDHTYSSECVSTAAT

Query:  -GGVCKKPRVQPVSGQPPIKNQVRKEKVGDRITALHQLVSPFGKTDTASVLSEAIGYVRFLHNQIEALISPYLGNNSSEPTRTDQLLFNDSTLKRKLPPT
         GGVCKK RVQP SGQPP+K  VRKEKVGDRITALHQLVSPFGKTDTASVLSEAIGYVRFL +QIEALISPYLG NSS+PT+ +Q L ND++LKRKLPP 
Subjt:  -GGVCKKPRVQPVSGQPPIKNQVRKEKVGDRITALHQLVSPFGKTDTASVLSEAIGYVRFLHNQIEALISPYLGNNSSEPTRTDQLLFNDSTLKRKLPPT

Query:  Q----EKDEEANALRSRGLCLVPVSCTQHVQSDVNGADYWAQAYNGSF
        Q    +++E    LRSRGLCLVPVSCTQ VQSD+NGADYWAQAYNGSF
Subjt:  Q----EKDEEANALRSRGLCLVPVSCTQHVQSDVNGADYWAQAYNGSF

A0A6J1GA41 transcription factor bHLH68-like isoform X12.9e-193100Show/hide
Query:  MMGGNPSNWWNMFPPNSLPPPPPPPPPQFVVGSSSLPFTSMADHPNQEHPNSQSWSQLLLGGLQEGDGDGLVLNSNYNHFQPKKLDNLEGRILIPFPRFG
        MMGGNPSNWWNMFPPNSLPPPPPPPPPQFVVGSSSLPFTSMADHPNQEHPNSQSWSQLLLGGLQEGDGDGLVLNSNYNHFQPKKLDNLEGRILIPFPRFG
Subjt:  MMGGNPSNWWNMFPPNSLPPPPPPPPPQFVVGSSSLPFTSMADHPNQEHPNSQSWSQLLLGGLQEGDGDGLVLNSNYNHFQPKKLDNLEGRILIPFPRFG

Query:  VGDDDGDDDHVLKQQTCSQSGKSLSFLWNEKESCSSSSISMKGSQTRPPPTTSSSPKSSVNSNAILEFSFNKLDSNNQFPDHTYSSECVSTAATGGVCKK
        VGDDDGDDDHVLKQQTCSQSGKSLSFLWNEKESCSSSSISMKGSQTRPPPTTSSSPKSSVNSNAILEFSFNKLDSNNQFPDHTYSSECVSTAATGGVCKK
Subjt:  VGDDDGDDDHVLKQQTCSQSGKSLSFLWNEKESCSSSSISMKGSQTRPPPTTSSSPKSSVNSNAILEFSFNKLDSNNQFPDHTYSSECVSTAATGGVCKK

Query:  PRVQPVSGQPPIKNQVRKEKVGDRITALHQLVSPFGKTDTASVLSEAIGYVRFLHNQIEALISPYLGNNSSEPTRTDQLLFNDSTLKRKLPPTQEKDEEA
        PRVQPVSGQPPIKNQVRKEKVGDRITALHQLVSPFGKTDTASVLSEAIGYVRFLHNQIEALISPYLGNNSSEPTRTDQLLFNDSTLKRKLPPTQEKDEEA
Subjt:  PRVQPVSGQPPIKNQVRKEKVGDRITALHQLVSPFGKTDTASVLSEAIGYVRFLHNQIEALISPYLGNNSSEPTRTDQLLFNDSTLKRKLPPTQEKDEEA

Query:  NALRSRGLCLVPVSCTQHVQSDVNGADYWAQAYNGSF
        NALRSRGLCLVPVSCTQHVQSDVNGADYWAQAYNGSF
Subjt:  NALRSRGLCLVPVSCTQHVQSDVNGADYWAQAYNGSF

A0A6J1GA87 transcription factor bHLH68-like isoform X21.8e-19099.41Show/hide
Query:  MMGGNPSNWWNMFPPNSLPPPPPPPPPQFVVGSSSLPFTSMADHPNQEHPNSQSWSQLLLGGLQEGDGDGLVLNSNYNHFQPKKLDNLEGRILIPFPRFG
        MMGGNPSNWWNMFPPNSLPPPPPPPPPQFVVGSSSLPFTSMADHPNQEHPNSQSWSQLLLGGLQEGDGDGLVLNSNYNHFQPKKLDNLEGRILIPFPRFG
Subjt:  MMGGNPSNWWNMFPPNSLPPPPPPPPPQFVVGSSSLPFTSMADHPNQEHPNSQSWSQLLLGGLQEGDGDGLVLNSNYNHFQPKKLDNLEGRILIPFPRFG

Query:  VGDDDGDDDHVLKQQTCSQSGKSLSFLWNEKESCSSSSISMKGSQTRPPPTTSSSPKSSVNSNAILEFSFNKLDSNNQFPDHTYSSECVSTAATGGVCKK
        VGDDDGDDDHVLKQQTCSQSGKSLSFLWNEKESCSSSSISMKGSQTRPPPTTSSSPKSSVNSNAILEFSFNKLDSNNQFPDHTYSSECVSTAATGGVCKK
Subjt:  VGDDDGDDDHVLKQQTCSQSGKSLSFLWNEKESCSSSSISMKGSQTRPPPTTSSSPKSSVNSNAILEFSFNKLDSNNQFPDHTYSSECVSTAATGGVCKK

Query:  PRVQPVSGQPPIKNQVRKEKVGDRITALHQLVSPFGKTDTASVLSEAIGYVRFLHNQIEALISPYLGNNSSEPTRTDQLLFNDSTLKRKLPPTQEKDEEA
        PRVQPVSGQPPIK  VRKEKVGDRITALHQLVSPFGKTDTASVLSEAIGYVRFLHNQIEALISPYLGNNSSEPTRTDQLLFNDSTLKRKLPPTQEKDEEA
Subjt:  PRVQPVSGQPPIKNQVRKEKVGDRITALHQLVSPFGKTDTASVLSEAIGYVRFLHNQIEALISPYLGNNSSEPTRTDQLLFNDSTLKRKLPPTQEKDEEA

Query:  NALRSRGLCLVPVSCTQHVQSDVNGADYWAQAYNGSF
        NALRSRGLCLVPVSCTQHVQSDVNGADYWAQAYNGSF
Subjt:  NALRSRGLCLVPVSCTQHVQSDVNGADYWAQAYNGSF

A0A6J1K6J4 transcription factor bHLH68-like isoform X23.0e-17794.96Show/hide
Query:  MMGGNPSNWWNMFPPNSLPPPPPPPPPQFVVGSSSLPFTSMADHPNQEHPNSQSWSQLLLGGLQEGDGDGLVLNSNYNHFQPKKLDNLEGRILIPFPRFG
        MMGGNPSNWWNMFPPNSL  PPPPPPPQFVVGSSS PFTSMADHPNQEHPNSQSWSQLLLGGLQEG GDGLVLNSNYNHFQPKKLDNLEGRILIPFPRFG
Subjt:  MMGGNPSNWWNMFPPNSLPPPPPPPPPQFVVGSSSLPFTSMADHPNQEHPNSQSWSQLLLGGLQEGDGDGLVLNSNYNHFQPKKLDNLEGRILIPFPRFG

Query:  VGDDDGDDDHVLKQQTCSQSGKSLSFLWNEKESCSSSSISMKGSQTRPPPTTSSSPKSSVNSNAILEFSFNKLDSNNQFPDHTYSSECVSTAATGGVCKK
        VGDDD DDDHVLKQQTCSQSGKSLSFLW+EKESCSSSSIS+KGSQTRPP  TSSSPKSSVNSNAILEFSFNKLDSNNQ PDHTYSSECVSTAATGGVCKK
Subjt:  VGDDDGDDDHVLKQQTCSQSGKSLSFLWNEKESCSSSSISMKGSQTRPPPTTSSSPKSSVNSNAILEFSFNKLDSNNQFPDHTYSSECVSTAATGGVCKK

Query:  PRVQPVSGQPPIKNQVRKEKVGDRITALHQLVSPFGKTDTASVLSEAIGYVRFLHNQIEALISPYLGNNSSEPTRTDQLLFNDSTLKRKLPPTQEKDEEA
        PRVQPVSGQPPIK  VRKEKVGDRITALHQLVSPFGKTDTASVLSEAIGYVRFLHNQIEALISPYLGNNSSEPTR DQLLFNDS+LKRKLPPTQEK+EEA
Subjt:  PRVQPVSGQPPIKNQVRKEKVGDRITALHQLVSPFGKTDTASVLSEAIGYVRFLHNQIEALISPYLGNNSSEPTRTDQLLFNDSTLKRKLPPTQEKDEEA

Query:  NALRSRGLCLVPVSCTQHVQSDVNGADYWAQAYNGSF
        N LRSRGLCLVPVSCTQHVQSD+NGADYWAQAYNGSF
Subjt:  NALRSRGLCLVPVSCTQHVQSDVNGADYWAQAYNGSF

A0A6J1K8Z1 transcription factor bHLH68-like isoform X14.9e-18095.55Show/hide
Query:  MMGGNPSNWWNMFPPNSLPPPPPPPPPQFVVGSSSLPFTSMADHPNQEHPNSQSWSQLLLGGLQEGDGDGLVLNSNYNHFQPKKLDNLEGRILIPFPRFG
        MMGGNPSNWWNMFPPNSL  PPPPPPPQFVVGSSS PFTSMADHPNQEHPNSQSWSQLLLGGLQEG GDGLVLNSNYNHFQPKKLDNLEGRILIPFPRFG
Subjt:  MMGGNPSNWWNMFPPNSLPPPPPPPPPQFVVGSSSLPFTSMADHPNQEHPNSQSWSQLLLGGLQEGDGDGLVLNSNYNHFQPKKLDNLEGRILIPFPRFG

Query:  VGDDDGDDDHVLKQQTCSQSGKSLSFLWNEKESCSSSSISMKGSQTRPPPTTSSSPKSSVNSNAILEFSFNKLDSNNQFPDHTYSSECVSTAATGGVCKK
        VGDDD DDDHVLKQQTCSQSGKSLSFLW+EKESCSSSSIS+KGSQTRPP  TSSSPKSSVNSNAILEFSFNKLDSNNQ PDHTYSSECVSTAATGGVCKK
Subjt:  VGDDDGDDDHVLKQQTCSQSGKSLSFLWNEKESCSSSSISMKGSQTRPPPTTSSSPKSSVNSNAILEFSFNKLDSNNQFPDHTYSSECVSTAATGGVCKK

Query:  PRVQPVSGQPPIKNQVRKEKVGDRITALHQLVSPFGKTDTASVLSEAIGYVRFLHNQIEALISPYLGNNSSEPTRTDQLLFNDSTLKRKLPPTQEKDEEA
        PRVQPVSGQPPIKNQVRKEKVGDRITALHQLVSPFGKTDTASVLSEAIGYVRFLHNQIEALISPYLGNNSSEPTR DQLLFNDS+LKRKLPPTQEK+EEA
Subjt:  PRVQPVSGQPPIKNQVRKEKVGDRITALHQLVSPFGKTDTASVLSEAIGYVRFLHNQIEALISPYLGNNSSEPTRTDQLLFNDSTLKRKLPPTQEKDEEA

Query:  NALRSRGLCLVPVSCTQHVQSDVNGADYWAQAYNGSF
        N LRSRGLCLVPVSCTQHVQSD+NGADYWAQAYNGSF
Subjt:  NALRSRGLCLVPVSCTQHVQSDVNGADYWAQAYNGSF

SwissProt top hitse value%identityAlignment
Q7XHI5 Transcription factor bHLH1333.4e-3734.87Show/hide
Query:  GNPSNWWN-----MFPPNSLPPPPPPPP--------PQFVVGSSSLPFTSMADHPNQEHPNSQSW--------------SQLLLGGLQEGDGDGLVLNSN
        GNP NWWN     + PP  L    PP          P F    +S   +S +  P   +PN  SW              SQLLLGGL  G+ + + + ++
Subjt:  GNPSNWWN-----MFPPNSLPPPPPPPP--------PQFVVGSSSLPFTSMADHPNQEHPNSQSW--------------SQLLLGGLQEGDGDGLVLNSN

Query:  YNH------FQPKKLDNLEGRILIPFPRFGVGDDDGDDDHVLKQQTCSQSGKSLSFLWNEKESCSSSSISMKGSQTRPPPTTSSSPKSSVNSNAILEFSF
        ++H      +Q K++ N E                   + VL+ Q              ++ES +++S  +  S   PP   + S  + +N+N       
Subjt:  YNH------FQPKKLDNLEGRILIPFPRFGVGDDDGDDDHVLKQQTCSQSGKSLSFLWNEKESCSSSSISMKGSQTRPPPTTSSSPKSSVNSNAILEFSF

Query:  NKLDSNNQFPDHTYSSECVSTAATGG--VCKKPRVQPVSGQPPIKNQVRKEKVGDRITALHQLVSPFGKTDTASVLSEAIGYVRFLHNQIEALISPYLGN
           D+NN        SEC S+   G     KKP++Q  S Q  +K  VRKEK+G RI +LHQLVSPFGKTDTASVLSEAIGY+RFLH+QIEAL  PY G 
Subjt:  NKLDSNNQFPDHTYSSECVSTAATGG--VCKKPRVQPVSGQPPIKNQVRKEKVGDRITALHQLVSPFGKTDTASVLSEAIGYVRFLHNQIEALISPYLGN

Query:  NSSEPTRTDQL------------------LFNDSTLKRKLPPTQEKDEEANA-------LRSRGLCLVPVSCTQHVQSDVNGADYWAQAY
            P+R + +                  L N+  +KR +  +   ++++N        LRSRGLCLVP+SCT  V SD NGADYWA A+
Subjt:  NSSEPTRTDQL------------------LFNDSTLKRKLPPTQEKDEEANA-------LRSRGLCLVPVSCTQHVQSDVNGADYWAQAY

Q8GXT3 Transcription factor bHLH1233.6e-2347.48Show/hide
Query:  KKPRVQPVSGQPPIKNQVRKEKVGDRITALHQLVSPFGKTDTASVLSEAIGYVRFLHNQIEALISPYLGNNSSEPTRTDQLLFNDSTLKRKLPPTQEKDE
        K+ + +  S  P  K   RKEK+GDRI AL QLVSPFGKTD ASVLSEAI Y++FLH Q+ AL +PY+ + +S       L    S    +L  ++E D 
Subjt:  KKPRVQPVSGQPPIKNQVRKEKVGDRITALHQLVSPFGKTDTASVLSEAIGYVRFLHNQIEALISPYLGNNSSEPTRTDQLLFNDSTLKRKLPPTQEKDE

Query:  EANALRSRGLCLVPVSCTQHVQSDVNGADYWAQAYNGSF
            LRSRGLCLVPVS T  V  D    D+W   + G+F
Subjt:  EANALRSRGLCLVPVSCTQHVQSDVNGADYWAQAYNGSF

Q8S3D1 Transcription factor bHLH685.1e-4138.25Show/hide
Query:  MMGGNPSNWWN----MFPPNSL------PPPP--------------PPPPPQFV-----VGSSSLPFTSMADHPNQEH--------PNSQSWSQLLLGGL
        M  GNP NWWN    M PP  L      P PP              P P P F+       SSS    S+ ++PN           P S S SQLLLGGL
Subjt:  MMGGNPSNWWN----MFPPNSL------PPPP--------------PPPPPQFV-----VGSSSLPFTSMADHPNQEH--------PNSQSWSQLLLGGL

Query:  QEGDGDGLVLNSNYNHFQPKKLDNLEGRILIPFPRFGVGDDDGDDDHVLKQQTCSQSGKSLSFLWNEKES-----CSSSSISMKGSQTRPPPTTSSSPKS
          G+ + L + +++NH   ++    +G+I +          +  ++ VL  Q  S     +    N   +      S +S   K   T    T+ +S   
Subjt:  QEGDGDGLVLNSNYNHFQPKKLDNLEGRILIPFPRFGVGDDDGDDDHVLKQQTCSQSGKSLSFLWNEKES-----CSSSSISMKGSQTRPPPTTSSSPKS

Query:  SV-NSNAILEFSFNKLDSNNQFPDHT---YSSECVSTAATGGVCKKPRVQP-VSGQPPIKNQVRKEKVGDRITALHQLVSPFGKTDTASVLSEAIGYVRF
        ++ N+N +L+FS N    +     HT    SSEC S    G   KKPR+QP  S Q  +K  VRKEK+G RI ALHQLVSPFGKTDTASVLSEAIGY+RF
Subjt:  SV-NSNAILEFSFNKLDSNNQFPDHT---YSSECVSTAATGGVCKKPRVQP-VSGQPPIKNQVRKEKVGDRITALHQLVSPFGKTDTASVLSEAIGYVRF

Query:  LHNQIEALISPYLGNNSSEPTRTDQ---------------LLFNDSTLKRKLPPTQEKD------EEANALRSRGLCLVPVSCTQHVQSDVNGADYWAQA
        L +QIEAL  PY G  +S   R  Q                L ND  +KR+   +   D      E    LRSRGLCLVP+SCT  V SD NGADYWA A
Subjt:  LHNQIEALISPYLGNNSSEPTRTDQ---------------LLFNDSTLKRKLPPTQEKD------EEANALRSRGLCLVPVSCTQHVQSDVNGADYWAQA

Q9LT67 Transcription factor bHLH1131.1e-2251.24Show/hide
Query:  QVRKEKVGDRITALHQLVSPFGKTDTASVLSEAIGYVRFLHNQIEALISPYLGNNSSEPTRTDQLLFNDSTLKRKLPPTQEKDEEANALRSRGLCLVPVS
        +VRKE++G+RI AL QLVSP+GKTD ASVL EA+GY++FL +QI+ L SPYL N+S +      ++  D     K          A  LRSRGLCLVPVS
Subjt:  QVRKEKVGDRITALHQLVSPFGKTDTASVLSEAIGYVRFLHNQIEALISPYLGNNSSEPTRTDQLLFNDSTLKRKLPPTQEKDEEANALRSRGLCLVPVS

Query:  CTQHVQSDVNGADYWAQAYNG
         T HV++  NGAD+W+ A  G
Subjt:  CTQHVQSDVNGADYWAQAYNG

Q9SFZ3 Transcription factor bHLH1102.4e-2753.33Show/hide
Query:  VSTAATGGVCKKPRVQPVSGQPPIKNQVRKEKVGDRITALHQLVSPFGKTDTASVLSEAIGYVRFLHNQIEALISPYLGNNSSEPTRTDQLLFNDSTLKR
        ++T A     KKPRV+  S  PP K  VRKEK+GDRI AL QLVSPFGKTDTASVL EAIGY++FL +QIE L  PY+  + + P +  QL         
Subjt:  VSTAATGGVCKKPRVQPVSGQPPIKNQVRKEKVGDRITALHQLVSPFGKTDTASVLSEAIGYVRFLHNQIEALISPYLGNNSSEPTRTDQLLFNDSTLKR

Query:  KLPPTQEKD-EEANALRSRGLCLVPVSCTQHVQSD
         +  +QE D EE   LRSRGLCLVP+SC  +V  D
Subjt:  KLPPTQEKD-EEANALRSRGLCLVPVSCTQHVQSD

Arabidopsis top hitse value%identityAlignment
AT1G27660.1 basic helix-loop-helix (bHLH) DNA-binding superfamily protein1.7e-2853.33Show/hide
Query:  VSTAATGGVCKKPRVQPVSGQPPIKNQVRKEKVGDRITALHQLVSPFGKTDTASVLSEAIGYVRFLHNQIEALISPYLGNNSSEPTRTDQLLFNDSTLKR
        ++T A     KKPRV+  S  PP K  VRKEK+GDRI AL QLVSPFGKTDTASVL EAIGY++FL +QIE L  PY+  + + P +  QL         
Subjt:  VSTAATGGVCKKPRVQPVSGQPPIKNQVRKEKVGDRITALHQLVSPFGKTDTASVLSEAIGYVRFLHNQIEALISPYLGNNSSEPTRTDQLLFNDSTLKR

Query:  KLPPTQEKD-EEANALRSRGLCLVPVSCTQHVQSD
         +  +QE D EE   LRSRGLCLVP+SC  +V  D
Subjt:  KLPPTQEKD-EEANALRSRGLCLVPVSCTQHVQSD

AT2G20100.1 basic helix-loop-helix (bHLH) DNA-binding superfamily protein2.4e-3834.87Show/hide
Query:  GNPSNWWN-----MFPPNSLPPPPPPPP--------PQFVVGSSSLPFTSMADHPNQEHPNSQSW--------------SQLLLGGLQEGDGDGLVLNSN
        GNP NWWN     + PP  L    PP          P F    +S   +S +  P   +PN  SW              SQLLLGGL  G+ + + + ++
Subjt:  GNPSNWWN-----MFPPNSLPPPPPPPP--------PQFVVGSSSLPFTSMADHPNQEHPNSQSW--------------SQLLLGGLQEGDGDGLVLNSN

Query:  YNH------FQPKKLDNLEGRILIPFPRFGVGDDDGDDDHVLKQQTCSQSGKSLSFLWNEKESCSSSSISMKGSQTRPPPTTSSSPKSSVNSNAILEFSF
        ++H      +Q K++ N E                   + VL+ Q              ++ES +++S  +  S   PP   + S  + +N+N       
Subjt:  YNH------FQPKKLDNLEGRILIPFPRFGVGDDDGDDDHVLKQQTCSQSGKSLSFLWNEKESCSSSSISMKGSQTRPPPTTSSSPKSSVNSNAILEFSF

Query:  NKLDSNNQFPDHTYSSECVSTAATGG--VCKKPRVQPVSGQPPIKNQVRKEKVGDRITALHQLVSPFGKTDTASVLSEAIGYVRFLHNQIEALISPYLGN
           D+NN        SEC S+   G     KKP++Q  S Q  +K  VRKEK+G RI +LHQLVSPFGKTDTASVLSEAIGY+RFLH+QIEAL  PY G 
Subjt:  NKLDSNNQFPDHTYSSECVSTAATGG--VCKKPRVQPVSGQPPIKNQVRKEKVGDRITALHQLVSPFGKTDTASVLSEAIGYVRFLHNQIEALISPYLGN

Query:  NSSEPTRTDQL------------------LFNDSTLKRKLPPTQEKDEEANA-------LRSRGLCLVPVSCTQHVQSDVNGADYWAQAY
            P+R + +                  L N+  +KR +  +   ++++N        LRSRGLCLVP+SCT  V SD NGADYWA A+
Subjt:  NSSEPTRTDQL------------------LFNDSTLKRKLPPTQEKDEEANA-------LRSRGLCLVPVSCTQHVQSDVNGADYWAQAY

AT2G20100.2 basic helix-loop-helix (bHLH) DNA-binding superfamily protein3.3e-2735.1Show/hide
Query:  GNPSNWWN-----MFPPNSLPPPPPPPP--------PQFVVGSSSLPFTSMADHPNQEHPNSQSW--------------SQLLLGGLQEGDGDGLVLNSN
        GNP NWWN     + PP  L    PP          P F    +S   +S +  P   +PN  SW              SQLLLGGL  G+ + + + ++
Subjt:  GNPSNWWN-----MFPPNSLPPPPPPPP--------PQFVVGSSSLPFTSMADHPNQEHPNSQSW--------------SQLLLGGLQEGDGDGLVLNSN

Query:  YNH------FQPKKLDNLEGRILIPFPRFGVGDDDGDDDHVLKQQTCSQSGKSLSFLWNEKESCSSSSISMKGSQTRPPPTTSSSPKSSVNSNAILEFSF
        ++H      +Q K++ N E                   + VL+ Q              ++ES +++S  +  S   PP   + S  + +N+N       
Subjt:  YNH------FQPKKLDNLEGRILIPFPRFGVGDDDGDDDHVLKQQTCSQSGKSLSFLWNEKESCSSSSISMKGSQTRPPPTTSSSPKSSVNSNAILEFSF

Query:  NKLDSNNQFPDHTYSSECVSTAATGG--VCKKPRVQPVSGQPPIKNQVRKEKVGDRITALHQLVSPFGKTDTASVLSEAIGYVRFLHNQIEALISPYLGN
           D+NN        SEC S+   G     KKP++Q  S Q  +K  VRKEK+G RI +LHQLVSPFGKTDTASVLSEAIGY+RFLH+QIEAL  PY G 
Subjt:  NKLDSNNQFPDHTYSSECVSTAATGG--VCKKPRVQPVSGQPPIKNQVRKEKVGDRITALHQLVSPFGKTDTASVLSEAIGYVRFLHNQIEALISPYLGN

Query:  NS
         S
Subjt:  NS

AT3G20640.1 basic helix-loop-helix (bHLH) DNA-binding superfamily protein2.6e-2447.48Show/hide
Query:  KKPRVQPVSGQPPIKNQVRKEKVGDRITALHQLVSPFGKTDTASVLSEAIGYVRFLHNQIEALISPYLGNNSSEPTRTDQLLFNDSTLKRKLPPTQEKDE
        K+ + +  S  P  K   RKEK+GDRI AL QLVSPFGKTD ASVLSEAI Y++FLH Q+ AL +PY+ + +S       L    S    +L  ++E D 
Subjt:  KKPRVQPVSGQPPIKNQVRKEKVGDRITALHQLVSPFGKTDTASVLSEAIGYVRFLHNQIEALISPYLGNNSSEPTRTDQLLFNDSTLKRKLPPTQEKDE

Query:  EANALRSRGLCLVPVSCTQHVQSDVNGADYWAQAYNGSF
            LRSRGLCLVPVS T  V  D    D+W   + G+F
Subjt:  EANALRSRGLCLVPVSCTQHVQSDVNGADYWAQAYNGSF

AT4G29100.1 basic helix-loop-helix (bHLH) DNA-binding superfamily protein3.6e-4238.25Show/hide
Query:  MMGGNPSNWWN----MFPPNSL------PPPP--------------PPPPPQFV-----VGSSSLPFTSMADHPNQEH--------PNSQSWSQLLLGGL
        M  GNP NWWN    M PP  L      P PP              P P P F+       SSS    S+ ++PN           P S S SQLLLGGL
Subjt:  MMGGNPSNWWN----MFPPNSL------PPPP--------------PPPPPQFV-----VGSSSLPFTSMADHPNQEH--------PNSQSWSQLLLGGL

Query:  QEGDGDGLVLNSNYNHFQPKKLDNLEGRILIPFPRFGVGDDDGDDDHVLKQQTCSQSGKSLSFLWNEKES-----CSSSSISMKGSQTRPPPTTSSSPKS
          G+ + L + +++NH   ++    +G+I +          +  ++ VL  Q  S     +    N   +      S +S   K   T    T+ +S   
Subjt:  QEGDGDGLVLNSNYNHFQPKKLDNLEGRILIPFPRFGVGDDDGDDDHVLKQQTCSQSGKSLSFLWNEKES-----CSSSSISMKGSQTRPPPTTSSSPKS

Query:  SV-NSNAILEFSFNKLDSNNQFPDHT---YSSECVSTAATGGVCKKPRVQP-VSGQPPIKNQVRKEKVGDRITALHQLVSPFGKTDTASVLSEAIGYVRF
        ++ N+N +L+FS N    +     HT    SSEC S    G   KKPR+QP  S Q  +K  VRKEK+G RI ALHQLVSPFGKTDTASVLSEAIGY+RF
Subjt:  SV-NSNAILEFSFNKLDSNNQFPDHT---YSSECVSTAATGGVCKKPRVQP-VSGQPPIKNQVRKEKVGDRITALHQLVSPFGKTDTASVLSEAIGYVRF

Query:  LHNQIEALISPYLGNNSSEPTRTDQ---------------LLFNDSTLKRKLPPTQEKD------EEANALRSRGLCLVPVSCTQHVQSDVNGADYWAQA
        L +QIEAL  PY G  +S   R  Q                L ND  +KR+   +   D      E    LRSRGLCLVP+SCT  V SD NGADYWA A
Subjt:  LHNQIEALISPYLGNNSSEPTRTDQ---------------LLFNDSTLKRKLPPTQEKD------EEANALRSRGLCLVPVSCTQHVQSDVNGADYWAQA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATGGGTGGAAACCCTAGTAATTGGTGGAACATGTTTCCACCTAATTCTCTTCCTCCTCCTCCTCCTCCTCCTCCTCCTCAGTTTGTTGTTGGATCTTCTTCGCTTCC
TTTCACTTCCATGGCTGATCATCCCAATCAAGAGCATCCCAATTCACAGTCATGGAGCCAACTACTTTTAGGTGGATTGCAAGAAGGGGATGGAGATGGGTTGGTTTTGA
ATAGTAATTATAATCATTTTCAACCAAAGAAGTTGGATAATTTGGAGGGGAGAATTTTGATTCCGTTTCCAAGATTTGGAGTGGGGGATGATGATGGTGATGATGATCAT
GTTTTGAAGCAACAAACTTGTTCTCAAAGTGGTAAGAGCTTATCATTTTTATGGAATGAAAAGGAATCATGTTCATCATCATCAATATCAATGAAAGGCTCTCAAACAAG
ACCACCACCAACTACTTCTTCTTCTCCTAAATCTTCTGTCAATAGTAATGCCATCTTGGAATTCTCTTTCAACAAACTTGATTCCAACAATCAATTCCCAGATCACACTT
ATTCATCTGAGTGTGTTAGCACAGCTGCCACTGGTGGAGTGTGCAAGAAGCCTAGGGTTCAGCCCGTCTCCGGCCAGCCTCCGATAAAGAATCAGGTGAGAAAGGAGAAG
GTAGGGGACAGAATCACAGCTCTCCACCAGCTGGTTTCTCCATTTGGAAAGACTGACACTGCTTCTGTCTTGTCAGAGGCTATTGGGTATGTTAGATTCCTTCACAATCA
AATTGAGGCTCTCATCTCTCCATATTTAGGCAATAATTCATCAGAACCCACAAGGACGGATCAACTTCTCTTCAACGACAGCACCCTGAAAAGAAAACTACCTCCTACCC
AGGAAAAGGATGAAGAAGCAAATGCGTTGAGGAGTAGAGGGCTTTGTTTGGTACCTGTATCTTGTACACAACATGTACAAAGTGACGTAAACGGAGCTGATTATTGGGCT
CAAGCTTACAATGGCAGCTTCTAA
mRNA sequenceShow/hide mRNA sequence
AATACATCACAAAGTGTTTGATATAATAAAGCAACTCACAAAGTAAAGTTGTTCCCTTTTCTTTTCTTTTCTTTTCTTTTCTTTTCTTTTCTTTCCAAGTATTGTTTTTG
TTTCAGGTGGGTTTGATTCTCTTTCAAAGCATAAACTCAAGAAAAAGCTGATATTGTGAAGAGGGAATTTTTGTTTGAAAGCAAATGATGGGTGGAAACCCTAGTAATTG
GTGGAACATGTTTCCACCTAATTCTCTTCCTCCTCCTCCTCCTCCTCCTCCTCCTCAGTTTGTTGTTGGATCTTCTTCGCTTCCTTTCACTTCCATGGCTGATCATCCCA
ATCAAGAGCATCCCAATTCACAGTCATGGAGCCAACTACTTTTAGGTGGATTGCAAGAAGGGGATGGAGATGGGTTGGTTTTGAATAGTAATTATAATCATTTTCAACCA
AAGAAGTTGGATAATTTGGAGGGGAGAATTTTGATTCCGTTTCCAAGATTTGGAGTGGGGGATGATGATGGTGATGATGATCATGTTTTGAAGCAACAAACTTGTTCTCA
AAGTGGTAAGAGCTTATCATTTTTATGGAATGAAAAGGAATCATGTTCATCATCATCAATATCAATGAAAGGCTCTCAAACAAGACCACCACCAACTACTTCTTCTTCTC
CTAAATCTTCTGTCAATAGTAATGCCATCTTGGAATTCTCTTTCAACAAACTTGATTCCAACAATCAATTCCCAGATCACACTTATTCATCTGAGTGTGTTAGCACAGCT
GCCACTGGTGGAGTGTGCAAGAAGCCTAGGGTTCAGCCCGTCTCCGGCCAGCCTCCGATAAAGAATCAGGTGAGAAAGGAGAAGGTAGGGGACAGAATCACAGCTCTCCA
CCAGCTGGTTTCTCCATTTGGAAAGACTGACACTGCTTCTGTCTTGTCAGAGGCTATTGGGTATGTTAGATTCCTTCACAATCAAATTGAGGCTCTCATCTCTCCATATT
TAGGCAATAATTCATCAGAACCCACAAGGACGGATCAACTTCTCTTCAACGACAGCACCCTGAAAAGAAAACTACCTCCTACCCAGGAAAAGGATGAAGAAGCAAATGCG
TTGAGGAGTAGAGGGCTTTGTTTGGTACCTGTATCTTGTACACAACATGTACAAAGTGACGTAAACGGAGCTGATTATTGGGCTCAAGCTTACAATGGCAGCTTCTAAGT
CCCTCGCTATCTGCACCCATACAAATCAAACAACTATGCTAATTCCTGCTATCATTTTCTCTATATTTATTATTAGGAATTAATGCTAACTTTTTTCCCCTTCTTTTCAA
TAAACGTCCTTTAAATTCTTCTTCATTATCTTCCCAATTCCTGTGATAAAATTGTAGTAGCAGAACCAATTTAATTTTAGTCTCCATTTCCTTTGCTCATTGTACGTAAT
GATTGAATTATTTTCTTTCTTTACAG
Protein sequenceShow/hide protein sequence
MMGGNPSNWWNMFPPNSLPPPPPPPPPQFVVGSSSLPFTSMADHPNQEHPNSQSWSQLLLGGLQEGDGDGLVLNSNYNHFQPKKLDNLEGRILIPFPRFGVGDDDGDDDH
VLKQQTCSQSGKSLSFLWNEKESCSSSSISMKGSQTRPPPTTSSSPKSSVNSNAILEFSFNKLDSNNQFPDHTYSSECVSTAATGGVCKKPRVQPVSGQPPIKNQVRKEK
VGDRITALHQLVSPFGKTDTASVLSEAIGYVRFLHNQIEALISPYLGNNSSEPTRTDQLLFNDSTLKRKLPPTQEKDEEANALRSRGLCLVPVSCTQHVQSDVNGADYWA
QAYNGSF