; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh07G007100 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh07G007100
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
Descriptiontranscription factor SPATULA
Genome locationCmo_Chr07:3207130..3209315
RNA-Seq ExpressionCmoCh07G007100
SyntenyCmoCh07G007100
Gene Ontology termsGO:0006468 - protein phosphorylation (biological process)
GO:0005634 - nucleus (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0004672 - protein kinase activity (molecular function)
GO:0005524 - ATP binding (molecular function)
GO:0046983 - protein dimerization activity (molecular function)
InterPro domainsIPR011598 - Myc-type, basic helix-loop-helix (bHLH) domain
IPR031066 - Basic helix-loop-helix (bHLH) transcription factors ALC-like, plant
IPR036638 - Helix-loop-helix DNA-binding domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6595026.1 Transcription factor SPATULA, partial [Cucurbita argyrosperma subsp. sororia]6.8e-21596.05Show/hide
Query:  MDHSISNKCDPNACISSTWTLPAVSSDEISLFLQHILPRSSPSIFSDLSSNVRPITPLTQNIPPSYTAPNALPADRISAVDSSEQFANSSSSAGLHDPFR
        MDHSISNKCDPNACISSTWTLPAVSSDEISLFLQHILPRSSPSIFSDL+SNVRPITP+TQNIPPSYT PNALPADRISAVDSSEQFANSSSSAGLHDPFR
Subjt:  MDHSISNKCDPNACISSTWTLPAVSSDEISLFLQHILPRSSPSIFSDLSSNVRPITPLTQNIPPSYTAPNALPADRISAVDSSEQFANSSSSAGLHDPFR

Query:  SCPTPTPPNASSASVGASDHHENDEFDCESEEGLEALIEEMPSKPYPRSSSKRSRAAEVHNLSEKRRRSRINEKMKALQNLIPNSNKATQFISKFNCRLT
        SCPTPTPPNASSASVGASDHHENDEFDCESEEGLEALIEEMPSKPYPRSSSKRSRAAEVHNLSEKRRRSRINEKMKALQNLIPNSNK            T
Subjt:  SCPTPTPPNASSASVGASDHHENDEFDCESEEGLEALIEEMPSKPYPRSSSKRSRAAEVHNLSEKRRRSRINEKMKALQNLIPNSNKATQFISKFNCRLT

Query:  DKASMLDEAIEYLKQLQLQVQMLSMNHGLSLHPMNLPGTLQYRQLSHMRMDFTEENRSLSSDQERPNRVFLSLPDQKAASIHPFMPDIGRSNAETPFELV
        DKASMLDEAIEYLKQLQLQVQMLSMNHGLSLHPMNLPGTLQYRQLSHMRMDFTEENRSLSSDQERPNRVFLSLPDQKAASIHPFMPDIGRSNAETPFELV
Subjt:  DKASMLDEAIEYLKQLQLQVQMLSMNHGLSLHPMNLPGTLQYRQLSHMRMDFTEENRSLSSDQERPNRVFLSLPDQKAASIHPFMPDIGRSNAETPFELV

Query:  PPVQAHLLPFYLSESSKRLKEIGTDVRRDHQVNATQSETTPFVSLPYNIAAPDLQELQNCVSVEPCIIEGNRSGVVLNTPEHSHVFPSLFNGIDTGRPTE
        PPVQAHLLPFYLSESSKRLKEIGTDVRRDHQVNATQSETTPFVSLPYNIAAPDLQELQNCVSVEPCIIEGNRSGVVLNTPEHSHVFPSLFNGIDTGR TE
Subjt:  PPVQAHLLPFYLSESSKRLKEIGTDVRRDHQVNATQSETTPFVSLPYNIAAPDLQELQNCVSVEPCIIEGNRSGVVLNTPEHSHVFPSLFNGIDTGRPTE

Query:  TTGPQ
        TTGPQ
Subjt:  TTGPQ

KAG7027051.1 putative serine/threonine-protein kinase, partial [Cucurbita argyrosperma subsp. argyrosperma]2.6e-16684.76Show/hide
Query:  MDHSISNKCDPNACISSTWTLPAVSSDEISLFLQHILPRSSPSIFSDLSSNVRPITPLTQNIPPSYTAPNALPADRISAVDSSEQFANSSSSAGLHDPFR
        MDHSISNKCDPNACISSTWTLPAVSSDEISLFLQHILPRSSPSIFSDL+SNVRPITP+TQNIPPSYT PNALPADRISAVDSSEQFANSSSSAGLHDPFR
Subjt:  MDHSISNKCDPNACISSTWTLPAVSSDEISLFLQHILPRSSPSIFSDLSSNVRPITPLTQNIPPSYTAPNALPADRISAVDSSEQFANSSSSAGLHDPFR

Query:  SCPTPTPPNASSASVGASDHHENDEFDCESEEGLEALIEEMPSKPYPRSSSKRSRAAEVHNLSEKRRRSRINEKMKALQNLIPNSNKATQFISKFNCRLT
        SCPTPTPPNASSASVGASDHHENDEFDCESEEGLEALIEEMPSKPYPRSSSKRSRAAEVHNLSEK                                  T
Subjt:  SCPTPTPPNASSASVGASDHHENDEFDCESEEGLEALIEEMPSKPYPRSSSKRSRAAEVHNLSEKRRRSRINEKMKALQNLIPNSNKATQFISKFNCRLT

Query:  DKASMLDEAIEYLKQLQLQVQMLSMNHGLSLHPMNLPGTLQYRQLSHMRMDFTEENRSLSSDQERPNRVFLSLPDQKAASIHPFMPDIGRSNAETPFELV
        DKASMLDEAIEYLKQLQLQV                    QYRQLSHMRMDFTEENRSLSSDQERPNRVFLSLPDQKAASIHPFMPDIGRSNAETPFELV
Subjt:  DKASMLDEAIEYLKQLQLQVQMLSMNHGLSLHPMNLPGTLQYRQLSHMRMDFTEENRSLSSDQERPNRVFLSLPDQKAASIHPFMPDIGRSNAETPFELV

Query:  PPVQAHLLPFYLSESSKRLKEIGTDVRRDHQVNATQSETTPFVSLPYNIAAPDLQELQNCVSVEPCIIEGNRSG
        PPVQAHLLPFYLSESSKRLKEIGTDVRRDHQVNATQSETTPFVSLPYNIAAPDLQELQNCVSVEPCIIEGNRSG
Subjt:  PPVQAHLLPFYLSESSKRLKEIGTDVRRDHQVNATQSETTPFVSLPYNIAAPDLQELQNCVSVEPCIIEGNRSG

XP_022962654.1 uncharacterized protein LOC111463073 [Cucurbita moschata]3.6e-21697.03Show/hide
Query:  MDHSISNKCDPNACISSTWTLPAVSSDEISLFLQHILPRSSPSIFSDLSSNVRPITPLTQNIPPSYTAPNALPADRISAVDSSEQFANSSSSAGLHDPFR
        MDHSISNKCDPNACISSTWTLPAVSSDEISLFLQHILPRSSPSIFSDLSSNVRPITPLTQNIPPSYTAPNALPADRISAVDSSEQFANSSSSAGLHDPFR
Subjt:  MDHSISNKCDPNACISSTWTLPAVSSDEISLFLQHILPRSSPSIFSDLSSNVRPITPLTQNIPPSYTAPNALPADRISAVDSSEQFANSSSSAGLHDPFR

Query:  SCPTPTPPNASSASVGASDHHENDEFDCESEEGLEALIEEMPSKPYPRSSSKRSRAAEVHNLSEKRRRSRINEKMKALQNLIPNSNKATQFISKFNCRLT
        SCPTPTPPNASSASVGASDHHENDEFDCESEEGLEALIEEMPSKPYPRSSSKRSRAAEVHNLSEKRRRSRINEKMKALQNLIPNSNK            T
Subjt:  SCPTPTPPNASSASVGASDHHENDEFDCESEEGLEALIEEMPSKPYPRSSSKRSRAAEVHNLSEKRRRSRINEKMKALQNLIPNSNKATQFISKFNCRLT

Query:  DKASMLDEAIEYLKQLQLQVQMLSMNHGLSLHPMNLPGTLQYRQLSHMRMDFTEENRSLSSDQERPNRVFLSLPDQKAASIHPFMPDIGRSNAETPFELV
        DKASMLDEAIEYLKQLQLQVQMLSMNHGLSLHPMNLPGTLQYRQLSHMRMDFTEENRSLSSDQERPNRVFLSLPDQKAASIHPFMPDIGRSNAETPFELV
Subjt:  DKASMLDEAIEYLKQLQLQVQMLSMNHGLSLHPMNLPGTLQYRQLSHMRMDFTEENRSLSSDQERPNRVFLSLPDQKAASIHPFMPDIGRSNAETPFELV

Query:  PPVQAHLLPFYLSESSKRLKEIGTDVRRDHQVNATQSETTPFVSLPYNIAAPDLQELQNCVSVEPCIIEGNRSGVVLNTPEHSHVFPSLFNGIDTGRPTE
        PPVQAHLLPFYLSESSKRLKEIGTDVRRDHQVNATQSETTPFVSLPYNIAAPDLQELQNCVSVEPCIIEGNRSGVVLNTPEHSHVFPSLFNGIDTGRPTE
Subjt:  PPVQAHLLPFYLSESSKRLKEIGTDVRRDHQVNATQSETTPFVSLPYNIAAPDLQELQNCVSVEPCIIEGNRSGVVLNTPEHSHVFPSLFNGIDTGRPTE

Query:  TTGP
        TTGP
Subjt:  TTGP

XP_023003972.1 uncharacterized protein LOC111497422 [Cucurbita maxima]1.3e-21094.55Show/hide
Query:  MDHSISNKCDPNACISSTWTLPAVSSDEISLFLQHILPRSSPSIFSDLSSNVRPITPLTQNIPPSYTAPNALPADRISAVDSSEQFANSSSSAGLHDPFR
        MDHSISNKCDPNACISSTWTLPAVSSDEISLFLQHILPRSSPSIFSDLSSN RPITPL QNIPPSYT PNALPA RISAVDSSEQFANS+SSAGLHDPFR
Subjt:  MDHSISNKCDPNACISSTWTLPAVSSDEISLFLQHILPRSSPSIFSDLSSNVRPITPLTQNIPPSYTAPNALPADRISAVDSSEQFANSSSSAGLHDPFR

Query:  SCPTPTPPNASSASVGASDHHENDEFDCESEEGLEALIEEMPSKPYPRSSSKRSRAAEVHNLSEKRRRSRINEKMKALQNLIPNSNKATQFISKFNCRLT
        SCPTPTPPNASS SVGASDHHENDEFDCESEEGLEALIEEMPSKPYPRSSSKRSRAAEVHNLSEKRRRSRINEKMKALQNLIPNSNK            T
Subjt:  SCPTPTPPNASSASVGASDHHENDEFDCESEEGLEALIEEMPSKPYPRSSSKRSRAAEVHNLSEKRRRSRINEKMKALQNLIPNSNKATQFISKFNCRLT

Query:  DKASMLDEAIEYLKQLQLQVQMLSMNHGLSLHPMNLPGTLQYRQLSHMRMDFTEENRSLSSDQERPNRVFLSLPDQKAASIHPFMPDIGRSNAETPFELV
        DKASMLDEAIEYLKQLQLQVQMLSMNHGLSLHPMNLPGTLQYRQLSHMRMDFTEENRSLSSDQERPNRVFLSLPDQKAASIHPFMPDIGRSNAETPFELV
Subjt:  DKASMLDEAIEYLKQLQLQVQMLSMNHGLSLHPMNLPGTLQYRQLSHMRMDFTEENRSLSSDQERPNRVFLSLPDQKAASIHPFMPDIGRSNAETPFELV

Query:  PPVQAHLLPFYLSESSKRLKEIGTDVRRDHQVNATQSETTPFVSLPYNIAAPDLQELQNCVSVEPCIIEGNRSGVVLNTPEHSHVFPSLFNGIDTGRPTE
        PPVQ HLLPFY  ESSKRLKEIGTDV+RDHQVNATQSETTPFVSLPYNIAAPDLQELQNCVSVEPCIIEGNRSGVVLNTPEHSHVFPSLFNGIDTGRPTE
Subjt:  PPVQAHLLPFYLSESSKRLKEIGTDVRRDHQVNATQSETTPFVSLPYNIAAPDLQELQNCVSVEPCIIEGNRSGVVLNTPEHSHVFPSLFNGIDTGRPTE

Query:  TTGP
        TTGP
Subjt:  TTGP

XP_023518121.1 uncharacterized protein LOC111781667 [Cucurbita pepo subsp. pepo]9.8e-21496.04Show/hide
Query:  MDHSISNKCDPNACISSTWTLPAVSSDEISLFLQHILPRSSPSIFSDLSSNVRPITPLTQNIPPSYTAPNALPADRISAVDSSEQFANSSSSAGLHDPFR
        MDHSISNKCDPNACISSTWTLPAVSSDEISLFLQHILPRSSPSIFSDL+SNVRPITPLTQNIPPSYT PNALPA RISAVDSSEQFANSSSSAGLHDPFR
Subjt:  MDHSISNKCDPNACISSTWTLPAVSSDEISLFLQHILPRSSPSIFSDLSSNVRPITPLTQNIPPSYTAPNALPADRISAVDSSEQFANSSSSAGLHDPFR

Query:  SCPTPTPPNASSASVGASDHHENDEFDCESEEGLEALIEEMPSKPYPRSSSKRSRAAEVHNLSEKRRRSRINEKMKALQNLIPNSNKATQFISKFNCRLT
        SCPTPTPPNASSASVGASD+HENDEFDCESEEGLEALIEEMPSKPYPRSSSKRSRAAEVHNLSEKRRRSRINEKMKALQNLIPNSNK            T
Subjt:  SCPTPTPPNASSASVGASDHHENDEFDCESEEGLEALIEEMPSKPYPRSSSKRSRAAEVHNLSEKRRRSRINEKMKALQNLIPNSNKATQFISKFNCRLT

Query:  DKASMLDEAIEYLKQLQLQVQMLSMNHGLSLHPMNLPGTLQYRQLSHMRMDFTEENRSLSSDQERPNRVFLSLPDQKAASIHPFMPDIGRSNAETPFELV
        DKASMLDEAIEYLKQLQLQVQMLSMNHGLSLHPMNLPGTLQYRQLSHMRMDFTEENRSLSSDQERPNRVFLSLPDQKAASIHPFMPDIGRSNAETPFELV
Subjt:  DKASMLDEAIEYLKQLQLQVQMLSMNHGLSLHPMNLPGTLQYRQLSHMRMDFTEENRSLSSDQERPNRVFLSLPDQKAASIHPFMPDIGRSNAETPFELV

Query:  PPVQAHLLPFYLSESSKRLKEIGTDVRRDHQVNATQSETTPFVSLPYNIAAPDLQELQNCVSVEPCIIEGNRSGVVLNTPEHSHVFPSLFNGIDTGRPTE
        PPVQAHLLPFYLSESSKRLKEIGTDVRRDHQVNATQSETTPFVSLPYNIAAPDLQELQNCVSVEPCIIEGNRSGVVLNTPEHSHVFPSLFNGIDTGRPTE
Subjt:  PPVQAHLLPFYLSESSKRLKEIGTDVRRDHQVNATQSETTPFVSLPYNIAAPDLQELQNCVSVEPCIIEGNRSGVVLNTPEHSHVFPSLFNGIDTGRPTE

Query:  TTGP
        TTGP
Subjt:  TTGP

TrEMBL top hitse value%identityAlignment
A0A0A0KLD3 BHLH domain-containing protein6.8e-15273.33Show/hide
Query:  MDHSISNKCDPNACISSTWTLPAVS--------SDEISLFLQHILPRS------------SPSIFSDLSSNVRPITPLTQNIPPSYTAPNALPADRISAV
        MDHS+ N CD NAC SSTWTLPA +         D+ISLFLQ IL RS            SPSIFS+L+ N+R  TP T N+PP Y  PNA+P D ISAV
Subjt:  MDHSISNKCDPNACISSTWTLPAVS--------SDEISLFLQHILPRS------------SPSIFSDLSSNVRPITPLTQNIPPSYTAPNALPADRISAV

Query:  DSSEQFANSSSSAGLHDPFRSCPTPTPPNASSASVGASDHHENDEFDCESEEGLEALIEEMPSKPYPRSSSKRSRAAEVHNLSEKRRRSRINEKMKALQN
        DSSEQFANS SS  LHDP R+ PT  PPNASS SVGASDH+ENDEFDCESEEGLEAL+EE+P+KP PRSSSKRSRAAEVHNLSEKRRRSRINEKMKALQN
Subjt:  DSSEQFANSSSSAGLHDPFRSCPTPTPPNASSASVGASDHHENDEFDCESEEGLEALIEEMPSKPYPRSSSKRSRAAEVHNLSEKRRRSRINEKMKALQN

Query:  LIPNSNKATQFISKFNCRLTDKASMLDEAIEYLKQLQLQVQMLSMNHGLSLHPMNLPGTLQYRQLSHMRMDFTEENRSLSSDQERPNRVFLSLPDQKAAS
        LIPNSNK            TDKASMLDEAIEYLKQLQLQVQMLSM +GLSLHPMNLPG+LQY QLSHMRMDF EENRS+SSDQERPN++FLSLPDQKAAS
Subjt:  LIPNSNKATQFISKFNCRLTDKASMLDEAIEYLKQLQLQVQMLSMNHGLSLHPMNLPGTLQYRQLSHMRMDFTEENRSLSSDQERPNRVFLSLPDQKAAS

Query:  IHPFMPDIGRSN-AETPFELVPPVQAHLLPFYLSESSKRLKEIGTD-VRRDHQ---VNATQSETTPFVSLPYNI-AAPDLQELQNCVSVEPCIIEGNRSG
        IHPFM DIGR+N AETPFELVPP+QAHL+PFYLSESSK  +    D +R DHQ   VN +QSETTP VS PYNI   PDLQ LQN +SVEPCIIEGNRSG
Subjt:  IHPFMPDIGRSN-AETPFELVPPVQAHLLPFYLSESSKRLKEIGTD-VRRDHQ---VNATQSETTPFVSLPYNI-AAPDLQELQNCVSVEPCIIEGNRSG

Query:  VVLN-TPEHSHVFPSLFNGI
        V+LN TPEHS VFPS FNGI
Subjt:  VVLN-TPEHSHVFPSLFNGI

A0A1S3B0R8 transcription factor SPATULA1.0e-14771.43Show/hide
Query:  MDHSISNKCDPNACISSTWTLPAVSS--------DEISLFLQHILPRS------------SPSIFSDLSSNVRPITPLTQNIPPSYTAPNALPADRISAV
        MDHSI N CD NAC SSTWTLPA ++        D+ISLFLQ I+ RS            SPSIFS+L+ N+R  TP T NIPP Y   NA+P D ISAV
Subjt:  MDHSISNKCDPNACISSTWTLPAVSS--------DEISLFLQHILPRS------------SPSIFSDLSSNVRPITPLTQNIPPSYTAPNALPADRISAV

Query:  DSSEQFANSSSSAGLHDPFRSCPTPTPPNASSASVGASDHHENDEFDCESEEGLEALIEEMPSKPYPRSSSKRSRAAEVHNLSEKRRRSRINEKMKALQN
        DSSEQFANS SS  LHDP R CPT  PPNASS SVGASDH+ENDEFDCESEEGLEAL+EE+P+KP PRSSSKRSRAAEVHNLSEKRRRSRINEKMKALQN
Subjt:  DSSEQFANSSSSAGLHDPFRSCPTPTPPNASSASVGASDHHENDEFDCESEEGLEALIEEMPSKPYPRSSSKRSRAAEVHNLSEKRRRSRINEKMKALQN

Query:  LIPNSNKATQFISKFNCRLTDKASMLDEAIEYLKQLQLQVQMLSMNHGLSLHPMNLPGTLQYRQLSHMRMDFTEENRSLSSDQERPNRVFLSLPDQKAAS
        LIPNSNK            TDKASMLDEAIEYLKQLQLQVQMLSM +GLSLHPMN PG+LQY QLSHMRMDF EENRS+SSDQ+RPN++FLSLPDQK A 
Subjt:  LIPNSNKATQFISKFNCRLTDKASMLDEAIEYLKQLQLQVQMLSMNHGLSLHPMNLPGTLQYRQLSHMRMDFTEENRSLSSDQERPNRVFLSLPDQKAAS

Query:  IHPFMPDIGRSN-AETPFELVPPVQAHLLPFYLSESSKRLKEIGTD-VRRDHQ---VNATQSETTPFVSLPYNI-AAPDLQELQNCVSVEPCIIEGNRSG
        IHPFM DIGR+N AETPF+LVPP+QAHL+PFYLSESSK  ++   D +R DHQ   VN +QSE TP  S PYNI   PDLQ LQN VSVEPCIIE NRS 
Subjt:  IHPFMPDIGRSN-AETPFELVPPVQAHLLPFYLSESSKRLKEIGTD-VRRDHQ---VNATQSETTPFVSLPYNI-AAPDLQELQNCVSVEPCIIEGNRSG

Query:  VVLN-TPEHSHVFPSLFNGI
        V+LN TPEHS V+PS FNGI
Subjt:  VVLN-TPEHSHVFPSLFNGI

A0A5D3CR49 Transcription factor SPATULA3.8e-14771.36Show/hide
Query:  MDHSISNKCDPNACISSTWTLPAVSS--------DEISLFLQHILPRS------------SPSIFSDLSSNVRPITPLTQNIPPSYTAPNALPADRISAV
        MDHSI N CD NAC SSTWTLPA ++        D+ISLFLQ I+ RS            SPSIFS+L+ N+R  TP T NIPP Y   NA+P D ISAV
Subjt:  MDHSISNKCDPNACISSTWTLPAVSS--------DEISLFLQHILPRS------------SPSIFSDLSSNVRPITPLTQNIPPSYTAPNALPADRISAV

Query:  DSSEQFANSSSSAGLHDPFRSCPTPTPPNASSASVGASDHHENDEFDCESEEGLEALIEEMPSKPYPRSSSKRSRAAEVHNLSEKRRRSRINEKMKALQN
        DSSEQFANS SS  LHDP R CPT  PPNASS SVGASDH+ENDEFDCESEEGLEAL+EE+P+KP PRSSSKRSRAAEVHNLSEKRRRSRINEKMKALQN
Subjt:  DSSEQFANSSSSAGLHDPFRSCPTPTPPNASSASVGASDHHENDEFDCESEEGLEALIEEMPSKPYPRSSSKRSRAAEVHNLSEKRRRSRINEKMKALQN

Query:  LIPNSNKATQFISKFNCRLTDKASMLDEAIEYLKQLQLQVQMLSMNHGLSLHPMNLPGTLQYRQLSHMRMDFTEENRSLSSDQERPNRVFLSLPDQKAAS
        LIPNSNK            TDKASMLDEAIEYLKQLQLQVQMLSM +GLSLHPMN PG+LQY QLSHMRMDF EENRS+SSDQ+RPN++FLSLPDQK A 
Subjt:  LIPNSNKATQFISKFNCRLTDKASMLDEAIEYLKQLQLQVQMLSMNHGLSLHPMNLPGTLQYRQLSHMRMDFTEENRSLSSDQERPNRVFLSLPDQKAAS

Query:  IHPFMPDIGRSN-AETPFELVPPVQAHLLPFYLSESSKRLKEIGTD-VRRDHQ---VNATQSETTPFVSLPYNI-AAPDLQELQNCVSVEPCIIEGNRSG
        IHPFM DIGR+N AETPF+LVPP+QAHL+PFYLSESSK  ++   D +R DHQ   VN +QSE TP  S PYNI   PDLQ LQN VSVEPCIIE NRS 
Subjt:  IHPFMPDIGRSN-AETPFELVPPVQAHLLPFYLSESSKRLKEIGTD-VRRDHQ---VNATQSETTPFVSLPYNI-AAPDLQELQNCVSVEPCIIEGNRSG

Query:  VVLN-TPEHSHVFPSLFNG
        V+LN TPEHS V+PS FNG
Subjt:  VVLN-TPEHSHVFPSLFNG

A0A6J1HHP8 uncharacterized protein LOC1114630731.7e-21697.03Show/hide
Query:  MDHSISNKCDPNACISSTWTLPAVSSDEISLFLQHILPRSSPSIFSDLSSNVRPITPLTQNIPPSYTAPNALPADRISAVDSSEQFANSSSSAGLHDPFR
        MDHSISNKCDPNACISSTWTLPAVSSDEISLFLQHILPRSSPSIFSDLSSNVRPITPLTQNIPPSYTAPNALPADRISAVDSSEQFANSSSSAGLHDPFR
Subjt:  MDHSISNKCDPNACISSTWTLPAVSSDEISLFLQHILPRSSPSIFSDLSSNVRPITPLTQNIPPSYTAPNALPADRISAVDSSEQFANSSSSAGLHDPFR

Query:  SCPTPTPPNASSASVGASDHHENDEFDCESEEGLEALIEEMPSKPYPRSSSKRSRAAEVHNLSEKRRRSRINEKMKALQNLIPNSNKATQFISKFNCRLT
        SCPTPTPPNASSASVGASDHHENDEFDCESEEGLEALIEEMPSKPYPRSSSKRSRAAEVHNLSEKRRRSRINEKMKALQNLIPNSNK            T
Subjt:  SCPTPTPPNASSASVGASDHHENDEFDCESEEGLEALIEEMPSKPYPRSSSKRSRAAEVHNLSEKRRRSRINEKMKALQNLIPNSNKATQFISKFNCRLT

Query:  DKASMLDEAIEYLKQLQLQVQMLSMNHGLSLHPMNLPGTLQYRQLSHMRMDFTEENRSLSSDQERPNRVFLSLPDQKAASIHPFMPDIGRSNAETPFELV
        DKASMLDEAIEYLKQLQLQVQMLSMNHGLSLHPMNLPGTLQYRQLSHMRMDFTEENRSLSSDQERPNRVFLSLPDQKAASIHPFMPDIGRSNAETPFELV
Subjt:  DKASMLDEAIEYLKQLQLQVQMLSMNHGLSLHPMNLPGTLQYRQLSHMRMDFTEENRSLSSDQERPNRVFLSLPDQKAASIHPFMPDIGRSNAETPFELV

Query:  PPVQAHLLPFYLSESSKRLKEIGTDVRRDHQVNATQSETTPFVSLPYNIAAPDLQELQNCVSVEPCIIEGNRSGVVLNTPEHSHVFPSLFNGIDTGRPTE
        PPVQAHLLPFYLSESSKRLKEIGTDVRRDHQVNATQSETTPFVSLPYNIAAPDLQELQNCVSVEPCIIEGNRSGVVLNTPEHSHVFPSLFNGIDTGRPTE
Subjt:  PPVQAHLLPFYLSESSKRLKEIGTDVRRDHQVNATQSETTPFVSLPYNIAAPDLQELQNCVSVEPCIIEGNRSGVVLNTPEHSHVFPSLFNGIDTGRPTE

Query:  TTGP
        TTGP
Subjt:  TTGP

A0A6J1KP34 uncharacterized protein LOC1114974226.4e-21194.55Show/hide
Query:  MDHSISNKCDPNACISSTWTLPAVSSDEISLFLQHILPRSSPSIFSDLSSNVRPITPLTQNIPPSYTAPNALPADRISAVDSSEQFANSSSSAGLHDPFR
        MDHSISNKCDPNACISSTWTLPAVSSDEISLFLQHILPRSSPSIFSDLSSN RPITPL QNIPPSYT PNALPA RISAVDSSEQFANS+SSAGLHDPFR
Subjt:  MDHSISNKCDPNACISSTWTLPAVSSDEISLFLQHILPRSSPSIFSDLSSNVRPITPLTQNIPPSYTAPNALPADRISAVDSSEQFANSSSSAGLHDPFR

Query:  SCPTPTPPNASSASVGASDHHENDEFDCESEEGLEALIEEMPSKPYPRSSSKRSRAAEVHNLSEKRRRSRINEKMKALQNLIPNSNKATQFISKFNCRLT
        SCPTPTPPNASS SVGASDHHENDEFDCESEEGLEALIEEMPSKPYPRSSSKRSRAAEVHNLSEKRRRSRINEKMKALQNLIPNSNK            T
Subjt:  SCPTPTPPNASSASVGASDHHENDEFDCESEEGLEALIEEMPSKPYPRSSSKRSRAAEVHNLSEKRRRSRINEKMKALQNLIPNSNKATQFISKFNCRLT

Query:  DKASMLDEAIEYLKQLQLQVQMLSMNHGLSLHPMNLPGTLQYRQLSHMRMDFTEENRSLSSDQERPNRVFLSLPDQKAASIHPFMPDIGRSNAETPFELV
        DKASMLDEAIEYLKQLQLQVQMLSMNHGLSLHPMNLPGTLQYRQLSHMRMDFTEENRSLSSDQERPNRVFLSLPDQKAASIHPFMPDIGRSNAETPFELV
Subjt:  DKASMLDEAIEYLKQLQLQVQMLSMNHGLSLHPMNLPGTLQYRQLSHMRMDFTEENRSLSSDQERPNRVFLSLPDQKAASIHPFMPDIGRSNAETPFELV

Query:  PPVQAHLLPFYLSESSKRLKEIGTDVRRDHQVNATQSETTPFVSLPYNIAAPDLQELQNCVSVEPCIIEGNRSGVVLNTPEHSHVFPSLFNGIDTGRPTE
        PPVQ HLLPFY  ESSKRLKEIGTDV+RDHQVNATQSETTPFVSLPYNIAAPDLQELQNCVSVEPCIIEGNRSGVVLNTPEHSHVFPSLFNGIDTGRPTE
Subjt:  PPVQAHLLPFYLSESSKRLKEIGTDVRRDHQVNATQSETTPFVSLPYNIAAPDLQELQNCVSVEPCIIEGNRSGVVLNTPEHSHVFPSLFNGIDTGRPTE

Query:  TTGP
        TTGP
Subjt:  TTGP

SwissProt top hitse value%identityAlignment
Q0JNI9 Transcription factor PHYTOCHROME INTERACTING FACTOR-LIKE 151.4e-2163.64Show/hide
Query:  SSKRSRAAEVHNLSEKRRRSRINEKMKALQNLIPNSNKATQFISKFNCRLTDKASMLDEAIEYLKQLQLQVQMLSMNHGLSLHPMNLPGTLQYRQLSHM
        S+KRSR AEVHNLSE+RRR RINEKM+ALQ LIPN NK             DKASMLDEAIEYLK LQLQVQM+SM  GL + PM LP  +Q+ Q+  M
Subjt:  SSKRSRAAEVHNLSEKRRRSRINEKMKALQNLIPNSNKATQFISKFNCRLTDKASMLDEAIEYLKQLQLQVQMLSMNHGLSLHPMNLPGTLQYRQLSHM

Q6AT90 Transcription factor APG1.0e-1943.81Show/hide
Query:  LTQNIPPSYT-APNALPADRISAVDSSEQFANSSSSAGLHDPFRSCPTPTPPNASSASVGASDHHENDEFDCESEEGLEALIEEMPSKPYPRSSSKRSRA
        + Q +PPS   AP+A P        ++   A +SS    +   R        N  SA   AS     DE D + E  L  +     ++     SSKRSR 
Subjt:  LTQNIPPSYT-APNALPADRISAVDSSEQFANSSSSAGLHDPFRSCPTPTPPNASSASVGASDHHENDEFDCESEEGLEALIEEMPSKPYPRSSSKRSRA

Query:  AEVHNLSEKRRRSRINEKMKALQNLIPNSNKATQFISKFNCRLTDKASMLDEAIEYLKQLQLQVQMLSMNHGLSLHPMNLPGTLQYRQLSHMRM
        AEVHNLSE+RRR RINEKM+ALQ LIPN NK             DKASML+EAIEYLK LQLQVQM+SM  G+ + PM LP      Q  HM+M
Subjt:  AEVHNLSEKRRRSRINEKMKALQNLIPNSNKATQFISKFNCRLTDKASMLDEAIEYLKQLQLQVQMLSMNHGLSLHPMNLPGTLQYRQLSHMRM

Q8GZM7 Transcription factor PIF12.2e-1936.71Show/hide
Query:  VRPITPLTQNIPPSYTAPNALPADRISAVDSSEQFANSSSSAGLHDPFRSCPTPTPPNASSASVGAS---------DHHENDEFDCESEEGLEALIEEMP
        VR  T ++ +  PS  A  +    R    DSS      + +        + P       SS+ V  S         D  +  E +  + +  E+  EE  
Subjt:  VRPITPLTQNIPPSYTAPNALPADRISAVDSSEQFANSSSSAGLHDPFRSCPTPTPPNASSASVGAS---------DHHENDEFDCESEEGLEALIEEMP

Query:  SKPYPRSSSKRSRAAEVHNLSEKRRRSRINEKMKALQNLIPNSNKATQFISKFNCRLTDKASMLDEAIEYLKQLQLQVQMLSMNHGLSLHPMNLPGTLQY
              +S+KRSRAAEVHNLSE++RR RINE+MKALQ LIP  NK            +DKASMLDEAIEY+K LQLQ+QM+SM  G  + PM  PG  QY
Subjt:  SKPYPRSSSKRSRAAEVHNLSEKRRRSRINEKMKALQNLIPNSNKATQFISKFNCRLTDKASMLDEAIEYLKQLQLQVQMLSMNHGLSLHPMNLPGTLQY

Query:  RQLSHMRMDFTEENRSLSSDQERPNRVFLSLPDQKAA
          + HM M        +  +Q  P   F+  P+  AA
Subjt:  RQLSHMRMDFTEENRSLSSDQERPNRVFLSLPDQKAA

Q9FHA2 Transcription factor ALC7.7e-2039.74Show/hide
Query:  PAVSSDEISLFLQHILPRSSPSIFSDLSSNVRPITPLTQNIPPSYTAPNALPADRISAVDSSEQFANSSSSAGLHDPFRSCPTPTPPNASSASVGASDHH
        P  SSDE+S FL+ IL R                TP  Q        P++ P  + + V S+E F  S S   +               SS   G S+  
Subjt:  PAVSSDEISLFLQHILPRSSPSIFSDLSSNVRPITPLTQNIPPSYTAPNALPADRISAVDSSEQFANSSSSAGLHDPFRSCPTPTPPNASSASVGASDHH

Query:  ENDEFDCESEEGLEALIEEMPSKPYPRSSSKRSRAAEVHNLSEKRRRSRINEKMKALQNLIPNSNKATQFISKFNCRLTDKASMLDEAIEYLKQLQLQVQ
        ++           +   E   S    R+S KR+  A+ HNLSEK+RRS+INEKMKALQ LIPNSNK            TDKASMLDEAIEYLKQLQLQVQ
Subjt:  ENDEFDCESEEGLEALIEEMPSKPYPRSSSKRSRAAEVHNLSEKRRRSRINEKMKALQNLIPNSNKATQFISKFNCRLTDKASMLDEAIEYLKQLQLQVQ

Query:  MLSMNHGLSLHPMNLPGTLQYRQLSHMRMDFTEE
         L++ +GL L+PM LP   Q    +H R++ T E
Subjt:  MLSMNHGLSLHPMNLPGTLQYRQLSHMRMDFTEE

Q9FUA4 Transcription factor SPATULA1.5e-3942.99Show/hide
Query:  PAVSSDEISLFLQHILPRSS--PSIFSDLSSNVRP---------------ITPLTQNIPPSYTAPNALPADRISAV-----DSSEQFANSSSSAGLHDPF
        P  SSDEIS FL+HI  RSS  PS +S  ++                      L  + PPS +   +      S V       S       S  G ++  
Subjt:  PAVSSDEISLFLQHILPRSS--PSIFSDLSSNVRP---------------ITPLTQNIPPSYTAPNALPADRISAV-----DSSEQFANSSSSAGLHDPF

Query:  RSCPTPTPPNASSASVGASDHHENDEFDCESEEGLEALIEEMPSK---PYPRSSSKRSRAAEVHNLSEKRRRSRINEKMKALQNLIPNSNKATQFISKFN
        +   + T    SS+SVGAS  +E DE+DCESEEG EA+++E PS    P  RSSSKR RAAEVHNLSEKRRRSRINEKMKALQ+LIPNSNK         
Subjt:  RSCPTPTPPNASSASVGASDHHENDEFDCESEEGLEALIEEMPSK---PYPRSSSKRSRAAEVHNLSEKRRRSRINEKMKALQNLIPNSNKATQFISKFN

Query:  CRLTDKASMLDEAIEYLKQLQLQVQMLSMNHGLSLHPMNLPG-TLQYRQLSHMRMDFTEENRSLSSDQERPNRVFLSLPDQKAASIHPFMPDIGRSNAET
           TDKASMLDEAIEYLKQLQLQVQML+M +G++LHP+ LPG TL   QLS +R              E  N   L+  +Q A++ +   P++  + A +
Subjt:  CRLTDKASMLDEAIEYLKQLQLQVQMLSMNHGLSLHPMNLPG-TLQYRQLSHMRMDFTEENRSLSSDQERPNRVFLSLPDQKAASIHPFMPDIGRSNAET

Query:  PFELVPPVQAHLLPFYLSESSKRLKEIG
         + L P +++H  PF L  S   +   G
Subjt:  PFELVPPVQAHLLPFYLSESSKRLKEIG

Arabidopsis top hitse value%identityAlignment
AT2G20180.1 phytochrome interacting factor 3-like 51.6e-2036.71Show/hide
Query:  VRPITPLTQNIPPSYTAPNALPADRISAVDSSEQFANSSSSAGLHDPFRSCPTPTPPNASSASVGAS---------DHHENDEFDCESEEGLEALIEEMP
        VR  T ++ +  PS  A  +    R    DSS      + +        + P       SS+ V  S         D  +  E +  + +  E+  EE  
Subjt:  VRPITPLTQNIPPSYTAPNALPADRISAVDSSEQFANSSSSAGLHDPFRSCPTPTPPNASSASVGAS---------DHHENDEFDCESEEGLEALIEEMP

Query:  SKPYPRSSSKRSRAAEVHNLSEKRRRSRINEKMKALQNLIPNSNKATQFISKFNCRLTDKASMLDEAIEYLKQLQLQVQMLSMNHGLSLHPMNLPGTLQY
              +S+KRSRAAEVHNLSE++RR RINE+MKALQ LIP  NK            +DKASMLDEAIEY+K LQLQ+QM+SM  G  + PM  PG  QY
Subjt:  SKPYPRSSSKRSRAAEVHNLSEKRRRSRINEKMKALQNLIPNSNKATQFISKFNCRLTDKASMLDEAIEYLKQLQLQVQMLSMNHGLSLHPMNLPGTLQY

Query:  RQLSHMRMDFTEENRSLSSDQERPNRVFLSLPDQKAA
          + HM M        +  +Q  P   F+  P+  AA
Subjt:  RQLSHMRMDFTEENRSLSSDQERPNRVFLSLPDQKAA

AT2G20180.2 phytochrome interacting factor 3-like 51.6e-2036.71Show/hide
Query:  VRPITPLTQNIPPSYTAPNALPADRISAVDSSEQFANSSSSAGLHDPFRSCPTPTPPNASSASVGAS---------DHHENDEFDCESEEGLEALIEEMP
        VR  T ++ +  PS  A  +    R    DSS      + +        + P       SS+ V  S         D  +  E +  + +  E+  EE  
Subjt:  VRPITPLTQNIPPSYTAPNALPADRISAVDSSEQFANSSSSAGLHDPFRSCPTPTPPNASSASVGAS---------DHHENDEFDCESEEGLEALIEEMP

Query:  SKPYPRSSSKRSRAAEVHNLSEKRRRSRINEKMKALQNLIPNSNKATQFISKFNCRLTDKASMLDEAIEYLKQLQLQVQMLSMNHGLSLHPMNLPGTLQY
              +S+KRSRAAEVHNLSE++RR RINE+MKALQ LIP  NK            +DKASMLDEAIEY+K LQLQ+QM+SM  G  + PM  PG  QY
Subjt:  SKPYPRSSSKRSRAAEVHNLSEKRRRSRINEKMKALQNLIPNSNKATQFISKFNCRLTDKASMLDEAIEYLKQLQLQVQMLSMNHGLSLHPMNLPGTLQY

Query:  RQLSHMRMDFTEENRSLSSDQERPNRVFLSLPDQKAA
          + HM M        +  +Q  P   F+  P+  AA
Subjt:  RQLSHMRMDFTEENRSLSSDQERPNRVFLSLPDQKAA

AT2G20180.3 phytochrome interacting factor 3-like 51.6e-2036.71Show/hide
Query:  VRPITPLTQNIPPSYTAPNALPADRISAVDSSEQFANSSSSAGLHDPFRSCPTPTPPNASSASVGAS---------DHHENDEFDCESEEGLEALIEEMP
        VR  T ++ +  PS  A  +    R    DSS      + +        + P       SS+ V  S         D  +  E +  + +  E+  EE  
Subjt:  VRPITPLTQNIPPSYTAPNALPADRISAVDSSEQFANSSSSAGLHDPFRSCPTPTPPNASSASVGAS---------DHHENDEFDCESEEGLEALIEEMP

Query:  SKPYPRSSSKRSRAAEVHNLSEKRRRSRINEKMKALQNLIPNSNKATQFISKFNCRLTDKASMLDEAIEYLKQLQLQVQMLSMNHGLSLHPMNLPGTLQY
              +S+KRSRAAEVHNLSE++RR RINE+MKALQ LIP  NK            +DKASMLDEAIEY+K LQLQ+QM+SM  G  + PM  PG  QY
Subjt:  SKPYPRSSSKRSRAAEVHNLSEKRRRSRINEKMKALQNLIPNSNKATQFISKFNCRLTDKASMLDEAIEYLKQLQLQVQMLSMNHGLSLHPMNLPGTLQY

Query:  RQLSHMRMDFTEENRSLSSDQERPNRVFLSLPDQKAA
          + HM M        +  +Q  P   F+  P+  AA
Subjt:  RQLSHMRMDFTEENRSLSSDQERPNRVFLSLPDQKAA

AT4G36930.1 basic helix-loop-helix (bHLH) DNA-binding superfamily protein1.1e-4042.99Show/hide
Query:  PAVSSDEISLFLQHILPRSS--PSIFSDLSSNVRP---------------ITPLTQNIPPSYTAPNALPADRISAV-----DSSEQFANSSSSAGLHDPF
        P  SSDEIS FL+HI  RSS  PS +S  ++                      L  + PPS +   +      S V       S       S  G ++  
Subjt:  PAVSSDEISLFLQHILPRSS--PSIFSDLSSNVRP---------------ITPLTQNIPPSYTAPNALPADRISAV-----DSSEQFANSSSSAGLHDPF

Query:  RSCPTPTPPNASSASVGASDHHENDEFDCESEEGLEALIEEMPSK---PYPRSSSKRSRAAEVHNLSEKRRRSRINEKMKALQNLIPNSNKATQFISKFN
        +   + T    SS+SVGAS  +E DE+DCESEEG EA+++E PS    P  RSSSKR RAAEVHNLSEKRRRSRINEKMKALQ+LIPNSNK         
Subjt:  RSCPTPTPPNASSASVGASDHHENDEFDCESEEGLEALIEEMPSK---PYPRSSSKRSRAAEVHNLSEKRRRSRINEKMKALQNLIPNSNKATQFISKFN

Query:  CRLTDKASMLDEAIEYLKQLQLQVQMLSMNHGLSLHPMNLPG-TLQYRQLSHMRMDFTEENRSLSSDQERPNRVFLSLPDQKAASIHPFMPDIGRSNAET
           TDKASMLDEAIEYLKQLQLQVQML+M +G++LHP+ LPG TL   QLS +R              E  N   L+  +Q A++ +   P++  + A +
Subjt:  CRLTDKASMLDEAIEYLKQLQLQVQMLSMNHGLSLHPMNLPG-TLQYRQLSHMRMDFTEENRSLSSDQERPNRVFLSLPDQKAASIHPFMPDIGRSNAET

Query:  PFELVPPVQAHLLPFYLSESSKRLKEIG
         + L P +++H  PF L  S   +   G
Subjt:  PFELVPPVQAHLLPFYLSESSKRLKEIG

AT5G67110.1 basic helix-loop-helix (bHLH) DNA-binding superfamily protein5.5e-2139.74Show/hide
Query:  PAVSSDEISLFLQHILPRSSPSIFSDLSSNVRPITPLTQNIPPSYTAPNALPADRISAVDSSEQFANSSSSAGLHDPFRSCPTPTPPNASSASVGASDHH
        P  SSDE+S FL+ IL R                TP  Q        P++ P  + + V S+E F  S S   +               SS   G S+  
Subjt:  PAVSSDEISLFLQHILPRSSPSIFSDLSSNVRPITPLTQNIPPSYTAPNALPADRISAVDSSEQFANSSSSAGLHDPFRSCPTPTPPNASSASVGASDHH

Query:  ENDEFDCESEEGLEALIEEMPSKPYPRSSSKRSRAAEVHNLSEKRRRSRINEKMKALQNLIPNSNKATQFISKFNCRLTDKASMLDEAIEYLKQLQLQVQ
        ++           +   E   S    R+S KR+  A+ HNLSEK+RRS+INEKMKALQ LIPNSNK            TDKASMLDEAIEYLKQLQLQVQ
Subjt:  ENDEFDCESEEGLEALIEEMPSKPYPRSSSKRSRAAEVHNLSEKRRRSRINEKMKALQNLIPNSNKATQFISKFNCRLTDKASMLDEAIEYLKQLQLQVQ

Query:  MLSMNHGLSLHPMNLPGTLQYRQLSHMRMDFTEE
         L++ +GL L+PM LP   Q    +H R++ T E
Subjt:  MLSMNHGLSLHPMNLPGTLQYRQLSHMRMDFTEE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATCATTCCATTTCCAACAAATGCGATCCAAATGCATGTATTTCTTCCACGTGGACTCTCCCCGCCGTCTCCTCCGACGAAATCTCTCTCTTTCTGCAACACATTTT
GCCACGTTCCTCTCCCTCCATCTTCTCCGACCTCTCCTCCAACGTTAGACCCATCACCCCCCTAACTCAAAACATTCCCCCTTCCTACACTGCGCCTAATGCACTACCCG
CCGACCGGATCTCCGCCGTCGATTCCTCTGAACAGTTCGCGAATTCCTCATCATCCGCTGGATTACATGATCCCTTCCGCTCTTGTCCCACACCCACACCTCCAAATGCG
TCCTCTGCATCAGTTGGGGCTAGTGATCACCACGAAAATGATGAGTTTGACTGCGAAAGTGAGGAGGGTCTGGAGGCATTGATTGAAGAGATGCCTTCAAAGCCTTATCC
ACGCAGTTCTTCCAAGAGAAGCAGAGCCGCTGAAGTTCATAATCTCTCCGAAAAGAGAAGGAGGAGCAGAATTAATGAGAAAATGAAGGCCCTACAAAATCTGATCCCCA
ATTCTAACAAGGCAACACAATTCATCTCAAAATTTAATTGCCGTTTGACAGACAAGGCCTCGATGCTGGACGAGGCAATTGAATATCTGAAGCAGCTTCAACTTCAAGTT
CAGATGCTGTCAATGAATCATGGCTTGAGTTTGCATCCTATGAACTTGCCTGGAACTTTGCAGTATCGCCAGCTTTCCCACATGAGAATGGACTTTACTGAAGAAAACAG
ATCACTTTCCTCTGACCAAGAAAGACCAAACCGGGTTTTTCTCAGCTTGCCTGACCAAAAGGCTGCCTCCATCCATCCTTTCATGCCAGACATTGGAAGAAGTAATGCAG
AAACTCCATTCGAATTGGTCCCTCCCGTCCAGGCTCATCTACTCCCCTTTTACTTGAGTGAATCCTCCAAGCGTTTGAAGGAGATTGGCACAGATGTAAGGCGAGATCAC
CAAGTGAATGCGACTCAATCGGAGACGACTCCCTTTGTGTCATTACCCTATAACATAGCAGCGCCTGATTTGCAAGAATTGCAGAATTGCGTTTCAGTGGAACCGTGCAT
CATTGAAGGCAACCGGTCAGGTGTGGTTCTAAACACTCCAGAGCACAGCCATGTATTCCCTTCCCTTTTCAATGGCATTGACACAGGAAGACCAACTGAAACAACAGGGC
CGCAATGA
mRNA sequenceShow/hide mRNA sequence
GAGAGATTAGAGGAGCGGAGACGACGATGCTGAGATACAGAATACAGTGTCATTTCAGAAGTAATTTAAAATGAGAAGTTAGAACAACACTCCTCTGCTTCGCTCTAAAA
TGGATCATTCCATTTCCAACAAATGCGATCCAAATGCATGTATTTCTTCCACGTGGACTCTCCCCGCCGTCTCCTCCGACGAAATCTCTCTCTTTCTGCAACACATTTTG
CCACGTTCCTCTCCCTCCATCTTCTCCGACCTCTCCTCCAACGTTAGACCCATCACCCCCCTAACTCAAAACATTCCCCCTTCCTACACTGCGCCTAATGCACTACCCGC
CGACCGGATCTCCGCCGTCGATTCCTCTGAACAGTTCGCGAATTCCTCATCATCCGCTGGATTACATGATCCCTTCCGCTCTTGTCCCACACCCACACCTCCAAATGCGT
CCTCTGCATCAGTTGGGGCTAGTGATCACCACGAAAATGATGAGTTTGACTGCGAAAGTGAGGAGGGTCTGGAGGCATTGATTGAAGAGATGCCTTCAAAGCCTTATCCA
CGCAGTTCTTCCAAGAGAAGCAGAGCCGCTGAAGTTCATAATCTCTCCGAAAAGAGAAGGAGGAGCAGAATTAATGAGAAAATGAAGGCCCTACAAAATCTGATCCCCAA
TTCTAACAAGGCAACACAATTCATCTCAAAATTTAATTGCCGTTTGACAGACAAGGCCTCGATGCTGGACGAGGCAATTGAATATCTGAAGCAGCTTCAACTTCAAGTTC
AGATGCTGTCAATGAATCATGGCTTGAGTTTGCATCCTATGAACTTGCCTGGAACTTTGCAGTATCGCCAGCTTTCCCACATGAGAATGGACTTTACTGAAGAAAACAGA
TCACTTTCCTCTGACCAAGAAAGACCAAACCGGGTTTTTCTCAGCTTGCCTGACCAAAAGGCTGCCTCCATCCATCCTTTCATGCCAGACATTGGAAGAAGTAATGCAGA
AACTCCATTCGAATTGGTCCCTCCCGTCCAGGCTCATCTACTCCCCTTTTACTTGAGTGAATCCTCCAAGCGTTTGAAGGAGATTGGCACAGATGTAAGGCGAGATCACC
AAGTGAATGCGACTCAATCGGAGACGACTCCCTTTGTGTCATTACCCTATAACATAGCAGCGCCTGATTTGCAAGAATTGCAGAATTGCGTTTCAGTGGAACCGTGCATC
ATTGAAGGCAACCGGTCAGGTGTGGTTCTAAACACTCCAGAGCACAGCCATGTATTCCCTTCCCTTTTCAATGGCATTGACACAGGAAGACCAACTGAAACAACAGGGCC
GCAATGATAAAGATTGTTGACATTCTCTGTGGCTACGTTCTGTTAAAACTATTCGCAAGCATTGAGCTGGTTAATCATGGAGAGGATCTGCACAGTTTTCCCCTCTTGTA
CAATTCTGCATTCAAGTTCTGAGTGAATTTTCTTCATCATTACTTCATGGGTTGAATCGTTTTTCTCATCCATCTTCTGGGTTTACCCATGTTTCCTGTTTCTAATATAT
ATATATATTTCGATCATCTGTGATACTTTAAAGAAAGTGCTCCTTCGACAACTCTTCATTCCAACCCATCTATTTTGAATCAACATGTAAAGGG
Protein sequenceShow/hide protein sequence
MDHSISNKCDPNACISSTWTLPAVSSDEISLFLQHILPRSSPSIFSDLSSNVRPITPLTQNIPPSYTAPNALPADRISAVDSSEQFANSSSSAGLHDPFRSCPTPTPPNA
SSASVGASDHHENDEFDCESEEGLEALIEEMPSKPYPRSSSKRSRAAEVHNLSEKRRRSRINEKMKALQNLIPNSNKATQFISKFNCRLTDKASMLDEAIEYLKQLQLQV
QMLSMNHGLSLHPMNLPGTLQYRQLSHMRMDFTEENRSLSSDQERPNRVFLSLPDQKAASIHPFMPDIGRSNAETPFELVPPVQAHLLPFYLSESSKRLKEIGTDVRRDH
QVNATQSETTPFVSLPYNIAAPDLQELQNCVSVEPCIIEGNRSGVVLNTPEHSHVFPSLFNGIDTGRPTETTGPQ