; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr020650 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr020650
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
Descriptionhomeobox-leucine zipper protein ATHB-6-like
Genome locationtig00153552:842253..843597
RNA-Seq ExpressionSgr020650
SyntenySgr020650
Gene Ontology termsGO:0006357 - regulation of transcription by RNA polymerase II (biological process)
GO:0045893 - positive regulation of transcription, DNA-templated (biological process)
GO:0005634 - nucleus (cellular component)
GO:0000981 - DNA-binding transcription factor activity, RNA polymerase II-specific (molecular function)
GO:0043565 - sequence-specific DNA binding (molecular function)
InterPro domainsIPR000047 - Helix-turn-helix motif
IPR001356 - Homeobox domain
IPR003106 - Leucine zipper, homeobox-associated
IPR009057 - Homeobox-like domain superfamily
IPR017970 - Homeobox, conserved site


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7031308.1 Homeobox-leucine zipper protein ATHB-6 [Cucurbita argyrosperma subsp. argyrosperma]1.1e-13380.84Show/hide
Query:  MKRPPGSSSDSLGALMSICPTS-EEHSPRNNHVYGTEYQSMLDGFDEEGCVEESGHVSEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGLQPRQ
        MKRP    +DS+GALMSI PT+ +E SPRNN V GTE+QSMLDGF EEG VEE GHVSEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGLQ RQ
Subjt:  MKRPPGSSSDSLGALMSICPTS-EEHSPRNNHVYGTEYQSMLDGFDEEGCVEESGHVSEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGLQPRQ

Query:  VAVWFQNRRARWKTKQLERDYGVLKTNYENLKLSYEALQHDNQALLKEIRELKAKLQEDNSESNLSVEEEMVVPADSENALIEQTKPE-TDHFSHPPAAE
        VAVWFQNRRARWKTKQLERDYGVLKTNY+NLKL+YE LQHDNQALLKEI+ELK KLQEDNSESNLSVEEE  VPADSENALIEQ KPE TD FS P A E
Subjt:  VAVWFQNRRARWKTKQLERDYGVLKTNYENLKLSYEALQHDNQALLKEIRELKAKLQEDNSESNLSVEEEMVVPADSENALIEQTKPE-TDHFSHPPAAE

Query:  SKDFNYESFNNNGGEGEEVRTEEASLFPDFKDGSSDSDSSAILNEDYSPTAAISSAGVLQ-NHHHFM-AAASPSPSATVKFNCSSTALNYLQFQKAY--Q
        S+DFNY S +NNGGEGEEV     SLFPDFKDGSSDSDSSAILNEDY PT AISS  VLQ NH HFM  AASPSPS  VK NC++TALNYLQ+QK Y  Q
Subjt:  SKDFNYESFNNNGGEGEEVRTEEASLFPDFKDGSSDSDSSAILNEDYSPTAAISSAGVLQ-NHHHFM-AAASPSPSATVKFNCSSTALNYLQFQKAY--Q

Query:  PHLYSKMEEHNFFSGEEACNFFSEEQAPTLHWWS
          ++ KMEEHNFFSGEE CNFFS+EQAPTLHWWS
Subjt:  PHLYSKMEEHNFFSGEEACNFFSEEQAPTLHWWS

XP_022136962.1 homeobox-leucine zipper protein ATHB-6-like [Momordica charantia]2.2e-14786.23Show/hide
Query:  MKRPPGSSSDSLGALMSICPTSE-EHSPRNNHVYGTEYQSMLDGFDEEGCVEESGHVSEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGLQPRQ
        MKRP   SSDSLGALMSICPTS+ E SPRNNHVYG E+QSMLDGFDEEGCVEESGHVSEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGLQPRQ
Subjt:  MKRPPGSSSDSLGALMSICPTSE-EHSPRNNHVYGTEYQSMLDGFDEEGCVEESGHVSEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGLQPRQ

Query:  VAVWFQNRRARWKTKQLERDYGVLKTNYENLKLSYEALQHDNQALLKEIRELKAKLQEDNSESNLSVEEEMVV-PADSENALIEQTKPETDHFSHPPAAE
        VAVWFQNRRARWKTKQLERDYGVLKTNYE LKL+YE LQ DN ALLKEIRELK+KLQEDNSESN+SVEEEMV+  ADSENALIE+T+PET  FS PPA E
Subjt:  VAVWFQNRRARWKTKQLERDYGVLKTNYENLKLSYEALQHDNQALLKEIRELKAKLQEDNSESNLSVEEEMVV-PADSENALIEQTKPETDHFSHPPAAE

Query:  SKDFNYESFNNNGGEGEEVRTEEASLFPDFKDGSSDSDSSAILNEDYSPTAAISSAGVLQN-HHHFMAAA-SPSPSATVKFNCSSTALNYLQFQKAY-QP
         KD NYESFNNNGGEGEEV TEEASLFPDFKDGSSDSDSSAILNEDYS TAAISS GVLQN H+HFMAA+ SPSPSA VKFNCS+ ALNY QFQKAY Q 
Subjt:  SKDFNYESFNNNGGEGEEVRTEEASLFPDFKDGSSDSDSSAILNEDYSPTAAISSAGVLQN-HHHFMAAA-SPSPSATVKFNCSSTALNYLQFQKAY-QP

Query:  HLYSKMEEHNFFSGEE-ACNFFSEEQAPTLHWWS
         +Y KMEEHNFF+GEE  CNFFSEEQAP+LHWWS
Subjt:  HLYSKMEEHNFFSGEE-ACNFFSEEQAPTLHWWS

XP_022999792.1 homeobox-leucine zipper protein ATHB-6-like [Cucurbita maxima]5.6e-13581.14Show/hide
Query:  MKRPPGSSSDSLGALMSICPTS-EEHSPRNNHVYGTEYQSMLDGFDEEGCVEESGHVSEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGLQPRQ
        MKRP    +DS+GALMSI PT+ +E SPRNN V GTE+QSMLDGF EEG VEE GHVSEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGLQ RQ
Subjt:  MKRPPGSSSDSLGALMSICPTS-EEHSPRNNHVYGTEYQSMLDGFDEEGCVEESGHVSEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGLQPRQ

Query:  VAVWFQNRRARWKTKQLERDYGVLKTNYENLKLSYEALQHDNQALLKEIRELKAKLQEDNSESNLSVEEEMVVPADSENALIEQTKPE-TDHFSHPPAAE
        VAVWFQNRRARWKTKQLERDYGVLKTNY+NLKL+YE LQHDNQALLKEI+ELK KLQEDNS+SNLSVEEEM VPADSENALIEQ KPE TD FS P A E
Subjt:  VAVWFQNRRARWKTKQLERDYGVLKTNYENLKLSYEALQHDNQALLKEIRELKAKLQEDNSESNLSVEEEMVVPADSENALIEQTKPE-TDHFSHPPAAE

Query:  SKDFNYESFNNNGGEGEEVRTEEASLFPDFKDGSSDSDSSAILNEDYSPTAAISSAGVLQ-NHHHFM-AAASPSPSATVKFNCSSTALNYLQFQKAY--Q
        S+DFNYES +NNGGEGEEV     SLFPDFKDGSSDSDSSAILNEDY PT AISS+ VLQ NH HFM  AASPSPS  VK NC++TALNYLQ+QK Y  Q
Subjt:  SKDFNYESFNNNGGEGEEVRTEEASLFPDFKDGSSDSDSSAILNEDYSPTAAISSAGVLQ-NHHHFM-AAASPSPSATVKFNCSSTALNYLQFQKAY--Q

Query:  PHLYSKMEEHNFFSGEEACNFFSEEQAPTLHWWS
          ++ KMEEHNFFSGEE CNFFS+EQAPTLHWWS
Subjt:  PHLYSKMEEHNFFSGEEACNFFSEEQAPTLHWWS

XP_023547219.1 homeobox-leucine zipper protein ATHB-6-like [Cucurbita pepo subsp. pepo]3.7e-13480.84Show/hide
Query:  MKRPPGSSSDSLGALMSICPTS-EEHSPRNNHVYGTEYQSMLDGFDEEGCVEESGHVSEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGLQPRQ
        MKRP    +DS+GALMSI PT+ +E SPRNN V GTE+QSMLDGF EEG VEE GHVSEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGLQ RQ
Subjt:  MKRPPGSSSDSLGALMSICPTS-EEHSPRNNHVYGTEYQSMLDGFDEEGCVEESGHVSEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGLQPRQ

Query:  VAVWFQNRRARWKTKQLERDYGVLKTNYENLKLSYEALQHDNQALLKEIRELKAKLQEDNSESNLSVEEEMVVPADSENALIEQTKPE-TDHFSHPPAAE
        VAVWFQNRRARWKTKQLERDYGVLKTNY+NLKL+YE LQHDNQALLKEI+ELK KLQEDNSESNLSVEEEM VPADSENALIEQ KPE TD FS P A E
Subjt:  VAVWFQNRRARWKTKQLERDYGVLKTNYENLKLSYEALQHDNQALLKEIRELKAKLQEDNSESNLSVEEEMVVPADSENALIEQTKPE-TDHFSHPPAAE

Query:  SKDFNYESFNNNGGEGEEVRTEEASLFPDFKDGSSDSDSSAILNEDYSPTAAISSAGVLQ-NHHHFM-AAASPSPSATVKFNCSSTALNYLQFQKAY--Q
        ++DFNYES ++NGGEGEEV     SLFPDFKDGSSDSDSSAILNEDY PT AISS  VLQ NH HFM  AASPSPS  VK NC++TALNYLQ+QK Y  Q
Subjt:  SKDFNYESFNNNGGEGEEVRTEEASLFPDFKDGSSDSDSSAILNEDYSPTAAISSAGVLQ-NHHHFM-AAASPSPSATVKFNCSSTALNYLQFQKAY--Q

Query:  PHLYSKMEEHNFFSGEEACNFFSEEQAPTLHWWS
          ++ KMEEHNFFSGEE CNFFS+EQAPTLHWWS
Subjt:  PHLYSKMEEHNFFSGEEACNFFSEEQAPTLHWWS

XP_038905018.1 homeobox-leucine zipper protein ATHB-6-like [Benincasa hispida]3.5e-13781.42Show/hide
Query:  MKRPPGSSSDSLGALMSICPTSE-EHSPRN---NHVYGTEYQSMLDGFDEEGCVEESGHVSEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGLQ
        MKR P   SDSLGAL+SICP+S+ E SPRN   NHVY TE+QSMLDGFDEEGCVEESGHVSEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGLQ
Subjt:  MKRPPGSSSDSLGALMSICPTSE-EHSPRN---NHVYGTEYQSMLDGFDEEGCVEESGHVSEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGLQ

Query:  PRQVAVWFQNRRARWKTKQLERDYGVLKTNYENLKLSYEALQHDNQALLKEIRELKAKLQEDNSESNLSVEEEMVVPADSENALIEQTKPET--DHFSHP
        PRQVAVWFQNRRARWKTKQLERDYGVLKTNYENLKLS+E LQ+DNQ LLK+IRELK KLQEDNSESNLSVEEEMVV A+SENA IEQTKPE   D FS P
Subjt:  PRQVAVWFQNRRARWKTKQLERDYGVLKTNYENLKLSYEALQHDNQALLKEIRELKAKLQEDNSESNLSVEEEMVVPADSENALIEQTKPET--DHFSHP

Query:  PAAESKDFNYESFNNNGGEGEEVRTEEASLFPDFKDGSSDSDSSAILNEDYSPTAAISSAGVLQNH--HHFMAAA-SPSPSATVKFNCSSTALNYLQFQK
        PA+ES+DFNYESFN+NGGEGEE   EEA+LFPDFKDGSSDSDSSAILNEDYSPTA ISS GVLQNH  +H M  A SP PSA VK       LNYLQFQK
Subjt:  PAAESKDFNYESFNNNGGEGEEVRTEEASLFPDFKDGSSDSDSSAILNEDYSPTAAISSAGVLQNH--HHFMAAA-SPSPSATVKFNCSSTALNYLQFQK

Query:  AY--QPHLYSKMEEHNFFSGEEACNFFSEEQAPTLHWWS
         Y  Q  ++ KMEEHNFFSGEE CNFFS+EQAPTLHWWS
Subjt:  AY--QPHLYSKMEEHNFFSGEEACNFFSEEQAPTLHWWS

TrEMBL top hitse value%identityAlignment
A0A6J1C6W4 homeobox-leucine zipper protein ATHB-6-like1.1e-14786.23Show/hide
Query:  MKRPPGSSSDSLGALMSICPTSE-EHSPRNNHVYGTEYQSMLDGFDEEGCVEESGHVSEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGLQPRQ
        MKRP   SSDSLGALMSICPTS+ E SPRNNHVYG E+QSMLDGFDEEGCVEESGHVSEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGLQPRQ
Subjt:  MKRPPGSSSDSLGALMSICPTSE-EHSPRNNHVYGTEYQSMLDGFDEEGCVEESGHVSEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGLQPRQ

Query:  VAVWFQNRRARWKTKQLERDYGVLKTNYENLKLSYEALQHDNQALLKEIRELKAKLQEDNSESNLSVEEEMVV-PADSENALIEQTKPETDHFSHPPAAE
        VAVWFQNRRARWKTKQLERDYGVLKTNYE LKL+YE LQ DN ALLKEIRELK+KLQEDNSESN+SVEEEMV+  ADSENALIE+T+PET  FS PPA E
Subjt:  VAVWFQNRRARWKTKQLERDYGVLKTNYENLKLSYEALQHDNQALLKEIRELKAKLQEDNSESNLSVEEEMVV-PADSENALIEQTKPETDHFSHPPAAE

Query:  SKDFNYESFNNNGGEGEEVRTEEASLFPDFKDGSSDSDSSAILNEDYSPTAAISSAGVLQN-HHHFMAAA-SPSPSATVKFNCSSTALNYLQFQKAY-QP
         KD NYESFNNNGGEGEEV TEEASLFPDFKDGSSDSDSSAILNEDYS TAAISS GVLQN H+HFMAA+ SPSPSA VKFNCS+ ALNY QFQKAY Q 
Subjt:  SKDFNYESFNNNGGEGEEVRTEEASLFPDFKDGSSDSDSSAILNEDYSPTAAISSAGVLQN-HHHFMAAA-SPSPSATVKFNCSSTALNYLQFQKAY-QP

Query:  HLYSKMEEHNFFSGEE-ACNFFSEEQAPTLHWWS
         +Y KMEEHNFF+GEE  CNFFSEEQAP+LHWWS
Subjt:  HLYSKMEEHNFFSGEE-ACNFFSEEQAPTLHWWS

A0A6J1ENE6 homeobox-leucine zipper protein ATHB-6-like5.9e-11473.8Show/hide
Query:  SDSLGALMSICPTS-EEHSPRN---NHVYGTEYQSMLDGFDEEGCVEESGHVSEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGLQPRQVAVWF
        SDS+ AL+SI PTS +E SPRN   NHVY  E+Q MLDGFDE    EE GHVSEKKRRL VEQVKALEKNFEVENKLEPERK+KLA+ELGLQPRQVAVWF
Subjt:  SDSLGALMSICPTS-EEHSPRN---NHVYGTEYQSMLDGFDEEGCVEESGHVSEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGLQPRQVAVWF

Query:  QNRRARWKTKQLERDYGVLKTNYENLKLSYEALQHDNQALLKEIRELKAKLQEDNSESNLSVEEEMVVPADSENALIEQTKPE-TDHFSHPPAAESKDFN
        QNRRARWKTKQLERDYGVLKTNY+NLKLS+EALQ+DNQALLKEIRELKAK+QEDNS        EM+VPADSENALIEQTKPE TD FS PPA       
Subjt:  QNRRARWKTKQLERDYGVLKTNYENLKLSYEALQHDNQALLKEIRELKAKLQEDNSESNLSVEEEMVVPADSENALIEQTKPE-TDHFSHPPAAESKDFN

Query:  YESFNNNGGEGEEVRTEEASLFPDFKDGSSDSDSSAILNEDYSPTAAISSAGVLQNHHHFMAAASP--SPS---ATVKFNCSSTALNYLQFQKAYQ--PH
          SFNNNGGEG+E         P  KDGSSDSDSSAILNEDYSPTA +SS GVLQN++HFM  A P  SPS   A VK N ++TALNYLQFQK YQ    
Subjt:  YESFNNNGGEGEEVRTEEASLFPDFKDGSSDSDSSAILNEDYSPTAAISSAGVLQNHHHFMAAASP--SPS---ATVKFNCSSTALNYLQFQKAYQ--PH

Query:  LYSKMEEHNFFSGEEACNFFSEEQAPTLHWWS
        ++ KMEEHNFF GEEACNFFS+EQAPTLHWWS
Subjt:  LYSKMEEHNFFSGEEACNFFSEEQAPTLHWWS

A0A6J1FNC9 homeobox-leucine zipper protein ATHB-6-like2.0e-13380.54Show/hide
Query:  MKRPPGSSSDSLGALMSICPTS-EEHSPRNNHVYGTEYQSMLDGFDEEGCVEESGHVSEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGLQPRQ
        MKRP    +DS+GALMSI PT+ +E SPRNN V GTE+QSMLDGF E+G VEE GHVSEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGLQ RQ
Subjt:  MKRPPGSSSDSLGALMSICPTS-EEHSPRNNHVYGTEYQSMLDGFDEEGCVEESGHVSEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGLQPRQ

Query:  VAVWFQNRRARWKTKQLERDYGVLKTNYENLKLSYEALQHDNQALLKEIRELKAKLQEDNSESNLSVEEEMVVPADSENALIEQTKPE-TDHFSHPPAAE
        VAVWFQNRRARWKTKQLERDYGVLKTNY+NLKL+YE LQHDNQALLKEI+ELK KLQEDNSESNLSVEEE  VPADSENALIEQ KPE TD FS P A E
Subjt:  VAVWFQNRRARWKTKQLERDYGVLKTNYENLKLSYEALQHDNQALLKEIRELKAKLQEDNSESNLSVEEEMVVPADSENALIEQTKPE-TDHFSHPPAAE

Query:  SKDFNYESFNNNGGEGEEVRTEEASLFPDFKDGSSDSDSSAILNEDYSPTAAISSAGVLQ-NHHHFM-AAASPSPSATVKFNCSSTALNYLQFQKAY--Q
        S+DFNY S +NNGGEGEEV     SLFPDFKDGSSDSDSSAILNEDY PT AISS  VLQ NH HFM  AASPSPS  VK NC++TALNYLQ+QK Y  Q
Subjt:  SKDFNYESFNNNGGEGEEVRTEEASLFPDFKDGSSDSDSSAILNEDYSPTAAISSAGVLQ-NHHHFM-AAASPSPSATVKFNCSSTALNYLQFQKAY--Q

Query:  PHLYSKMEEHNFFSGEEACNFFSEEQAPTLHWWS
          ++ KMEEHNFFSGEE CNFFS+EQAPTLHWWS
Subjt:  PHLYSKMEEHNFFSGEEACNFFSEEQAPTLHWWS

A0A6J1JAW1 homeobox-leucine zipper protein ATHB-6-like5.9e-11473.19Show/hide
Query:  SDSLGALMSICPTS-EEHSPRN---NHVYGTEYQSMLDGFDEEGCVEESGHVSEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGLQPRQVAVWF
        SDS+ AL+SI PTS +E SPRN   NHVY  E+Q MLDGFDE    EE GHVSEKKRRL VEQVK+LEKNFEVENKLEPERK+KLA+ELGLQPRQVAVWF
Subjt:  SDSLGALMSICPTS-EEHSPRN---NHVYGTEYQSMLDGFDEEGCVEESGHVSEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGLQPRQVAVWF

Query:  QNRRARWKTKQLERDYGVLKTNYENLKLSYEALQHDNQALLKEIRELKAKLQEDNSESNLSVEEEMVVPADSENALIEQTKPE-TDHFSHPPAAESKDFN
        QNRRARWKTKQLERDYGVLKTNY+NLKLS+EALQ+DNQALLKEIRELKAK+QEDNS        EM+ PADSENALIEQTKPE TD FS PPA       
Subjt:  QNRRARWKTKQLERDYGVLKTNYENLKLSYEALQHDNQALLKEIRELKAKLQEDNSESNLSVEEEMVVPADSENALIEQTKPE-TDHFSHPPAAESKDFN

Query:  YESFNNNGGEGEEVRTEEASLFPDFKDGSSDSDSSAILNEDYSPTAAISSAGVLQNHHHFMAAASP--SPS---ATVKFNCSSTALNYLQFQKAYQ--PH
          SFNNNGGEG+E         P  KDGSSDSDSSAILNEDYSPTA +SS GVLQN++HFM    P  SPS   A VK NC++TALNYLQFQK YQ    
Subjt:  YESFNNNGGEGEEVRTEEASLFPDFKDGSSDSDSSAILNEDYSPTAAISSAGVLQNHHHFMAAASP--SPS---ATVKFNCSSTALNYLQFQKAYQ--PH

Query:  LYSKMEEHNFFSGEEACNFFSEEQAPTLHWWS
        ++ KMEEHNFF GEEACNFFS+EQAPTLHWWS
Subjt:  LYSKMEEHNFFSGEEACNFFSEEQAPTLHWWS

A0A6J1KBS4 homeobox-leucine zipper protein ATHB-6-like2.7e-13581.14Show/hide
Query:  MKRPPGSSSDSLGALMSICPTS-EEHSPRNNHVYGTEYQSMLDGFDEEGCVEESGHVSEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGLQPRQ
        MKRP    +DS+GALMSI PT+ +E SPRNN V GTE+QSMLDGF EEG VEE GHVSEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGLQ RQ
Subjt:  MKRPPGSSSDSLGALMSICPTS-EEHSPRNNHVYGTEYQSMLDGFDEEGCVEESGHVSEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGLQPRQ

Query:  VAVWFQNRRARWKTKQLERDYGVLKTNYENLKLSYEALQHDNQALLKEIRELKAKLQEDNSESNLSVEEEMVVPADSENALIEQTKPE-TDHFSHPPAAE
        VAVWFQNRRARWKTKQLERDYGVLKTNY+NLKL+YE LQHDNQALLKEI+ELK KLQEDNS+SNLSVEEEM VPADSENALIEQ KPE TD FS P A E
Subjt:  VAVWFQNRRARWKTKQLERDYGVLKTNYENLKLSYEALQHDNQALLKEIRELKAKLQEDNSESNLSVEEEMVVPADSENALIEQTKPE-TDHFSHPPAAE

Query:  SKDFNYESFNNNGGEGEEVRTEEASLFPDFKDGSSDSDSSAILNEDYSPTAAISSAGVLQ-NHHHFM-AAASPSPSATVKFNCSSTALNYLQFQKAY--Q
        S+DFNYES +NNGGEGEEV     SLFPDFKDGSSDSDSSAILNEDY PT AISS+ VLQ NH HFM  AASPSPS  VK NC++TALNYLQ+QK Y  Q
Subjt:  SKDFNYESFNNNGGEGEEVRTEEASLFPDFKDGSSDSDSSAILNEDYSPTAAISSAGVLQ-NHHHFM-AAASPSPSATVKFNCSSTALNYLQFQKAY--Q

Query:  PHLYSKMEEHNFFSGEEACNFFSEEQAPTLHWWS
          ++ KMEEHNFFSGEE CNFFS+EQAPTLHWWS
Subjt:  PHLYSKMEEHNFFSGEEACNFFSEEQAPTLHWWS

SwissProt top hitse value%identityAlignment
P46667 Homeobox-leucine zipper protein ATHB-54.2e-4844.02Show/hide
Query:  MKRPPGSSSDSLGALMSI--CPTSEEHSPR---NNHVY--GTEYQSMLDGFDEEGCVEESGHV-------SEKKRRLSVEQVKALEKNFEVENKLEPERK
        MKR  G SSDSL   + I    T ++ SPR      +Y    +Y  M D  +++G +E+ G V       +EKKRRL VEQVKALEKNFE++NKLEPERK
Subjt:  MKRPPGSSSDSLGALMSI--CPTSEEHSPR---NNHVY--GTEYQSMLDGFDEEGCVEESGHV-------SEKKRRLSVEQVKALEKNFEVENKLEPERK

Query:  VKLARELGLQPRQVAVWFQNRRARWKTKQLERDYGVLKTNYENLKLSYEALQHDNQALLKEIRELKAKLQEDNSESNLSVEEEMVVPADSENALIEQTKP
        VKLA+ELGLQPRQVA+WFQNRRARWKTKQLERDYGVLK+N++ LK + ++LQ DN +LL +I+ELKAKL   N E    +EE   + A   N  +     
Subjt:  VKLARELGLQPRQVAVWFQNRRARWKTKQLERDYGVLKTNYENLKLSYEALQHDNQALLKEIRELKAKLQEDNSESNLSVEEEMVVPADSENALIEQTKP

Query:  ETDHFSH----PPAAESKDFNYESFNNNGGEGEEVRTEEASLFP---DFKDGSSD-SDSSAILNEDYSPTAAISSAGVLQNHHHFMAAASPSPSATVKFN
        E    SH    PP     D              E+  E  S+FP   +F+D  +D SDSSA+LNE+YSP   + +AG +        AA+    +T+   
Subjt:  ETDHFSH----PPAAESKDFNYESFNNNGGEGEEVRTEEASLFP---DFKDGSSD-SDSSAILNEDYSPTAAISSAGVLQNHHHFMAAASPSPSATVKFN

Query:  CSSTALNYLQFQKAYQPHLYSKMEEH-NFFSGEEACNFFSEEQ
        C S                + KMEEH + FSGEEAC  F++ +
Subjt:  CSSTALNYLQFQKAYQPHLYSKMEEH-NFFSGEEACNFFSEEQ

P46668 Homeobox-leucine zipper protein ATHB-63.6e-6048.37Show/hide
Query:  SSSDSLGALMSICPT--SEEHSPRNNHVYGTEYQSMLDGF--DEEGCVEESGHV--SEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGLQPRQV
        SSSDS+G L+S+CPT  ++E SPR     G E+QSML+G+  +EE  VEE GHV  SEKKRRLS+ QVKALEKNFE+ENKLEPERKVKLA+ELGLQPRQV
Subjt:  SSSDSLGALMSICPT--SEEHSPRNNHVYGTEYQSMLDGF--DEEGCVEESGHV--SEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGLQPRQV

Query:  AVWFQNRRARWKTKQLERDYGVLKTNYENLKLSYEALQHDNQALLKEIRELKAKL-----QEDNSESNLSVEEEMVVPADSENALIEQTKPE--TDHFSH
        AVWFQNRRARWKTKQLE+DYGVLKT Y++L+ ++++L+ DN++LL+EI +LK KL     +E+  E+N +V  E  +    E    E + PE  T+  S 
Subjt:  AVWFQNRRARWKTKQLERDYGVLKTNYENLKLSYEALQHDNQALLKEIRELKAKL-----QEDNSESNLSVEEEMVVPADSENALIEQTKPE--TDHFSH

Query:  PP--AAESKDFNYESFNNNGGEGEEVRTEEASLFPDFKDGSSDSDSSAILNEDYSPTAAISSAGVLQNHHHFMAAASPSPSATVKFNCSSTALNYLQFQK
        PP     S   NY SF +            AS F      S  SDSSA+LNE+ S    +             AA    P             N+ QF K
Subjt:  PP--AAESKDFNYESFNNNGGEGEEVRTEEASLFPDFKDGSSDSDSSAILNEDYSPTAAISSAGVLQNHHHFMAAASPSPSATVKFNCSSTALNYLQFQK

Query:  AYQPHLYSKMEEHNFFSGEEACNFFSEEQAPTLHWWS
          Q       +  +F SGEEAC FFS+EQ P+LHW+S
Subjt:  AYQPHLYSKMEEHNFFSGEEACNFFSEEQAPTLHWWS

Q6K498 Homeobox-leucine zipper protein HOX41.0e-3842.71Show/hide
Query:  GFDEEGCVEES----GHVSEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGLQPRQVAVWFQNRRARWKTKQLERDYGVLKTNYENLKLSYEALQ
        G + EG VEE     G   EKKRRLSVEQV+ALE++FEVENKLEPERK +LAR+LGLQPRQVAVWFQNRRARWKTKQLERDY  L+ +Y++L+L ++AL+
Subjt:  GFDEEGCVEES----GHVSEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGLQPRQVAVWFQNRRARWKTKQLERDYGVLKTNYENLKLSYEALQ

Query:  HDNQALLKEIRELKAKL-QEDNSESNLSVEEEMVVPADSENALIEQTKPETDHFSHPPAAESKDFNYESFNNNGGEGEEVRTEEASLFPDFKDGSSDSDS
         D  ALL EI+ELKAKL  E+ + S  SV+EE   PA S+                PPAA                                 GSSDSDS
Subjt:  HDNQALLKEIRELKAKL-QEDNSESNLSVEEEMVVPADSENALIEQTKPETDHFSHPPAAESKDFNYESFNNNGGEGEEVRTEEASLFPDFKDGSSDSDS

Query:  SAILNEDYSPTAAISSAGVLQNHHHFMAAASPSPSATVKFNCSSTALNYLQFQKAYQPHLYSKMEEH--NFFSGEEAC-NFFSEEQAPTL-HWWS
        SA+LN+  +  AA ++   L         A P+  A       + A      ++ +    + K+EE    F   +E C  FF+++Q P L  WW+
Subjt:  SAILNEDYSPTAAISSAGVLQNHHHFMAAASPSPSATVKFNCSSTALNYLQFQKAYQPHLYSKMEEH--NFFSGEEAC-NFFSEEQAPTL-HWWS

Q940J1 Homeobox-leucine zipper protein ATHB-161.9e-5345.86Show/hide
Query:  SSSDSLGALMSICPTSEEHSPRNNHVYGTEYQSMLDGFDEEGCV--EESGH-----VSEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGLQPRQ
        SSSDS+  L+S   +++E SPR    YG+ YQSML+G+DE+  +  E SG+     +SEKKRRL V+QVKALEKNFE+ENKLEPERK KLA+ELGLQPRQ
Subjt:  SSSDSLGALMSICPTSEEHSPRNNHVYGTEYQSMLDGFDEEGCV--EESGH-----VSEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGLQPRQ

Query:  VAVWFQNRRARWKTKQLERDYGVLKTNYENLKLSYEALQHDNQALLKEIRELKAKLQ-EDNSESNLSVEEEMVVPADSENALIEQTKPETDHFSHPP---
        VAVWFQNRRARWKTKQLE+DYGVLK  Y++L+ ++++L+ DN +LL+EI ++KAK+  E+++ +N ++ E           + E+   +TD     P   
Subjt:  VAVWFQNRRARWKTKQLERDYGVLKTNYENLKLSYEALQHDNQALLKEIRELKAKLQ-EDNSESNLSVEEEMVVPADSENALIEQTKPETDHFSHPP---

Query:  AAESKDFNY-ESFNNNGGEGEEVRTEEASLFPD---FKDGSSDS-DSSAILNEDYSPTAAISSAGVLQNHHHFMAAASPSPSATVKFNCSSTALNYLQFQ
           S  FNY  SF           T+   L P+    + GSSDS DSSA+LN++ S     S  G L            +P  TV      T  ++LQF 
Subjt:  AAESKDFNY-ESFNNNGGEGEEVRTEEASLFPD---FKDGSSDS-DSSAILNEDYSPTAAISSAGVLQNHHHFMAAASPSPSATVKFNCSSTALNYLQFQ

Query:  KAYQPHLYSKMEEHNFFSGEEACNFFSEEQAPTLHWWS
        K  Q       +  +F SGEEAC FFS+EQ P+LHW+S
Subjt:  KAYQPHLYSKMEEHNFFSGEEACNFFSEEQAPTLHWWS

Q9XH37 Homeobox-leucine zipper protein HOX41.0e-3842.71Show/hide
Query:  GFDEEGCVEES----GHVSEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGLQPRQVAVWFQNRRARWKTKQLERDYGVLKTNYENLKLSYEALQ
        G + EG VEE     G   EKKRRLSVEQV+ALE++FEVENKLEPERK +LAR+LGLQPRQVAVWFQNRRARWKTKQLERDY  L+ +Y++L+L ++AL+
Subjt:  GFDEEGCVEES----GHVSEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGLQPRQVAVWFQNRRARWKTKQLERDYGVLKTNYENLKLSYEALQ

Query:  HDNQALLKEIRELKAKL-QEDNSESNLSVEEEMVVPADSENALIEQTKPETDHFSHPPAAESKDFNYESFNNNGGEGEEVRTEEASLFPDFKDGSSDSDS
         D  ALL EI+ELKAKL  E+ + S  SV+EE   PA S+                PPAA                                 GSSDSDS
Subjt:  HDNQALLKEIRELKAKL-QEDNSESNLSVEEEMVVPADSENALIEQTKPETDHFSHPPAAESKDFNYESFNNNGGEGEEVRTEEASLFPDFKDGSSDSDS

Query:  SAILNEDYSPTAAISSAGVLQNHHHFMAAASPSPSATVKFNCSSTALNYLQFQKAYQPHLYSKMEEH--NFFSGEEAC-NFFSEEQAPTL-HWWS
        SA+LN+  +  AA ++   L         A P+  A       + A      ++ +    + K+EE    F   +E C  FF+++Q P L  WW+
Subjt:  SAILNEDYSPTAAISSAGVLQNHHHFMAAASPSPSATVKFNCSSTALNYLQFQKAYQPHLYSKMEEH--NFFSGEEAC-NFFSEEQAPTL-HWWS

Arabidopsis top hitse value%identityAlignment
AT1G69780.1 Homeobox-leucine zipper protein family4.0e-3044.5Show/hide
Query:  EEGCVEESGHVSEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGLQPRQVAVWFQNRRARWKTKQLERDYGVLKTNYENLKLSYEALQHDNQALL
        EE   ++   + EKKRRL++EQVK LEKNFE+ NKLEPERK++LAR LGLQPRQ+A+WFQNRRARWKTKQLE+DY  LK  ++ LK   + LQ  NQ L 
Subjt:  EEGCVEESGHVSEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGLQPRQVAVWFQNRRARWKTKQLERDYGVLKTNYENLKLSYEALQHDNQALL

Query:  KEIRELKAKLQ-----------------EDNSESNLSVEEEMVVPA-DSENALIEQTKPET--DHFSHP-PAAESKDFNYESFNNNGGEGEEVRTEEASL
         EI  LK + Q                  DNS  NL ++     P+ DS         P+T   HF  P PA  +       F  N   G+ +  EE S+
Subjt:  KEIRELKAKLQ-----------------EDNSESNLSVEEEMVVPA-DSENALIEQTKPET--DHFSHP-PAAESKDFNYESFNNNGGEGEEVRTEEASL

AT2G22430.1 homeobox protein 62.6e-6148.37Show/hide
Query:  SSSDSLGALMSICPT--SEEHSPRNNHVYGTEYQSMLDGF--DEEGCVEESGHV--SEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGLQPRQV
        SSSDS+G L+S+CPT  ++E SPR     G E+QSML+G+  +EE  VEE GHV  SEKKRRLS+ QVKALEKNFE+ENKLEPERKVKLA+ELGLQPRQV
Subjt:  SSSDSLGALMSICPT--SEEHSPRNNHVYGTEYQSMLDGF--DEEGCVEESGHV--SEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGLQPRQV

Query:  AVWFQNRRARWKTKQLERDYGVLKTNYENLKLSYEALQHDNQALLKEIRELKAKL-----QEDNSESNLSVEEEMVVPADSENALIEQTKPE--TDHFSH
        AVWFQNRRARWKTKQLE+DYGVLKT Y++L+ ++++L+ DN++LL+EI +LK KL     +E+  E+N +V  E  +    E    E + PE  T+  S 
Subjt:  AVWFQNRRARWKTKQLERDYGVLKTNYENLKLSYEALQHDNQALLKEIRELKAKL-----QEDNSESNLSVEEEMVVPADSENALIEQTKPE--TDHFSH

Query:  PP--AAESKDFNYESFNNNGGEGEEVRTEEASLFPDFKDGSSDSDSSAILNEDYSPTAAISSAGVLQNHHHFMAAASPSPSATVKFNCSSTALNYLQFQK
        PP     S   NY SF +            AS F      S  SDSSA+LNE+ S    +             AA    P             N+ QF K
Subjt:  PP--AAESKDFNYESFNNNGGEGEEVRTEEASLFPDFKDGSSDSDSSAILNEDYSPTAAISSAGVLQNHHHFMAAASPSPSATVKFNCSSTALNYLQFQK

Query:  AYQPHLYSKMEEHNFFSGEEACNFFSEEQAPTLHWWS
          Q       +  +F SGEEAC FFS+EQ P+LHW+S
Subjt:  AYQPHLYSKMEEHNFFSGEEACNFFSEEQAPTLHWWS

AT4G40060.1 homeobox protein 161.4e-5445.86Show/hide
Query:  SSSDSLGALMSICPTSEEHSPRNNHVYGTEYQSMLDGFDEEGCV--EESGH-----VSEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGLQPRQ
        SSSDS+  L+S   +++E SPR    YG+ YQSML+G+DE+  +  E SG+     +SEKKRRL V+QVKALEKNFE+ENKLEPERK KLA+ELGLQPRQ
Subjt:  SSSDSLGALMSICPTSEEHSPRNNHVYGTEYQSMLDGFDEEGCV--EESGH-----VSEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGLQPRQ

Query:  VAVWFQNRRARWKTKQLERDYGVLKTNYENLKLSYEALQHDNQALLKEIRELKAKLQ-EDNSESNLSVEEEMVVPADSENALIEQTKPETDHFSHPP---
        VAVWFQNRRARWKTKQLE+DYGVLK  Y++L+ ++++L+ DN +LL+EI ++KAK+  E+++ +N ++ E           + E+   +TD     P   
Subjt:  VAVWFQNRRARWKTKQLERDYGVLKTNYENLKLSYEALQHDNQALLKEIRELKAKLQ-EDNSESNLSVEEEMVVPADSENALIEQTKPETDHFSHPP---

Query:  AAESKDFNY-ESFNNNGGEGEEVRTEEASLFPD---FKDGSSDS-DSSAILNEDYSPTAAISSAGVLQNHHHFMAAASPSPSATVKFNCSSTALNYLQFQ
           S  FNY  SF           T+   L P+    + GSSDS DSSA+LN++ S     S  G L            +P  TV      T  ++LQF 
Subjt:  AAESKDFNY-ESFNNNGGEGEEVRTEEASLFPD---FKDGSSDS-DSSAILNEDYSPTAAISSAGVLQNHHHFMAAASPSPSATVKFNCSSTALNYLQFQ

Query:  KAYQPHLYSKMEEHNFFSGEEACNFFSEEQAPTLHWWS
        K  Q       +  +F SGEEAC FFS+EQ P+LHW+S
Subjt:  KAYQPHLYSKMEEHNFFSGEEACNFFSEEQAPTLHWWS

AT5G65310.1 homeobox protein 53.0e-4944.02Show/hide
Query:  MKRPPGSSSDSLGALMSI--CPTSEEHSPR---NNHVY--GTEYQSMLDGFDEEGCVEESGHV-------SEKKRRLSVEQVKALEKNFEVENKLEPERK
        MKR  G SSDSL   + I    T ++ SPR      +Y    +Y  M D  +++G +E+ G V       +EKKRRL VEQVKALEKNFE++NKLEPERK
Subjt:  MKRPPGSSSDSLGALMSI--CPTSEEHSPR---NNHVY--GTEYQSMLDGFDEEGCVEESGHV-------SEKKRRLSVEQVKALEKNFEVENKLEPERK

Query:  VKLARELGLQPRQVAVWFQNRRARWKTKQLERDYGVLKTNYENLKLSYEALQHDNQALLKEIRELKAKLQEDNSESNLSVEEEMVVPADSENALIEQTKP
        VKLA+ELGLQPRQVA+WFQNRRARWKTKQLERDYGVLK+N++ LK + ++LQ DN +LL +I+ELKAKL   N E    +EE   + A   N  +     
Subjt:  VKLARELGLQPRQVAVWFQNRRARWKTKQLERDYGVLKTNYENLKLSYEALQHDNQALLKEIRELKAKLQEDNSESNLSVEEEMVVPADSENALIEQTKP

Query:  ETDHFSH----PPAAESKDFNYESFNNNGGEGEEVRTEEASLFP---DFKDGSSD-SDSSAILNEDYSPTAAISSAGVLQNHHHFMAAASPSPSATVKFN
        E    SH    PP     D              E+  E  S+FP   +F+D  +D SDSSA+LNE+YSP   + +AG +        AA+    +T+   
Subjt:  ETDHFSH----PPAAESKDFNYESFNNNGGEGEEVRTEEASLFP---DFKDGSSD-SDSSAILNEDYSPTAAISSAGVLQNHHHFMAAASPSPSATVKFN

Query:  CSSTALNYLQFQKAYQPHLYSKMEEH-NFFSGEEACNFFSEEQ
        C S                + KMEEH + FSGEEAC  F++ +
Subjt:  CSSTALNYLQFQKAYQPHLYSKMEEH-NFFSGEEACNFFSEEQ

AT5G65310.2 homeobox protein 52.1e-4745.18Show/hide
Query:  EYQSMLDGFDEEGCVEESGHV-------SEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGLQPRQVAVWFQNRRARWKTKQLERDYGVLKTNYE
        +Y  M D  +++G +E+ G V       +EKKRRL VEQVKALEKNFE++NKLEPERKVKLA+ELGLQPRQVA+WFQNRRARWKTKQLERDYGVLK+N++
Subjt:  EYQSMLDGFDEEGCVEESGHV-------SEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGLQPRQVAVWFQNRRARWKTKQLERDYGVLKTNYE

Query:  NLKLSYEALQHDNQALLKEIRELKAKLQEDNSESNLSVEEEMVVPADSENALIEQTKPETDHFSH----PPAAESKDFNYESFNNNGGEGEEVRTEEASL
         LK + ++LQ DN +LL +I+ELKAKL   N E    +EE   + A   N  +     E    SH    PP     D              E+  E  S+
Subjt:  NLKLSYEALQHDNQALLKEIRELKAKLQEDNSESNLSVEEEMVVPADSENALIEQTKPETDHFSH----PPAAESKDFNYESFNNNGGEGEEVRTEEASL

Query:  FP---DFKDGSSD-SDSSAILNEDYSPTAAISSAGVLQNHHHFMAAASPSPSATVKFNCSSTALNYLQFQKAYQPHLYSKMEEH-NFFSGEEACNFFSEE
        FP   +F+D  +D SDSSA+LNE+YSP   + +AG +        AA+    +T+   C S                + KMEEH + FSGEEAC  F++ 
Subjt:  FP---DFKDGSSD-SDSSAILNEDYSPTAAISSAGVLQNHHHFMAAASPSPSATVKFNCSSTALNYLQFQKAYQPHLYSKMEEH-NFFSGEEACNFFSEE

Query:  Q
        +
Subjt:  Q


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGAGGCCTCCAGGCAGCAGCTCGGATTCCTTGGGTGCTCTCATGTCCATTTGCCCAACTTCAGAAGAACACAGTCCGAGAAACAACCATGTTTATGGCACGGAATA
CCAGTCTATGCTTGATGGGTTTGATGAAGAAGGCTGCGTTGAAGAATCAGGACATGTTTCAGAGAAGAAGAGGCGACTCAGTGTGGAGCAAGTTAAGGCTTTAGAGAAGA
ATTTCGAAGTTGAAAACAAGCTCGAACCAGAGAGGAAGGTGAAGCTTGCTCGAGAACTTGGTTTGCAACCTCGACAAGTAGCTGTTTGGTTCCAAAATCGTCGAGCCAGA
TGGAAAACCAAGCAATTGGAAAGAGATTATGGCGTCCTCAAAACCAATTATGAGAATCTCAAGCTCAGTTATGAAGCTCTCCAACATGACAATCAAGCTCTTCTCAAAGA
GATTCGGGAACTGAAAGCAAAGCTTCAAGAAGATAACTCAGAGAGCAATCTTTCGGTGGAGGAAGAGATGGTGGTACCGGCCGACTCTGAGAATGCTCTGATTGAACAAA
CAAAGCCGGAAACCGATCACTTCTCTCATCCTCCGGCGGCTGAGTCCAAAGACTTCAATTACGAGAGCTTCAACAACAATGGCGGAGAAGGAGAAGAGGTACGAACAGAA
GAAGCCTCATTGTTTCCCGATTTTAAAGACGGTTCATCCGATAGCGATTCAAGCGCAATTTTAAACGAAGATTACAGCCCCACGGCCGCCATTTCTTCAGCGGGGGTGCT
CCAGAATCACCACCACTTCATGGCGGCGGCATCTCCGTCTCCCTCCGCCACCGTAAAATTCAACTGCTCATCGACGGCCCTCAATTACTTACAGTTTCAGAAGGCGTATC
AGCCCCATCTGTACTCGAAAATGGAGGAGCATAATTTCTTCAGCGGAGAGGAGGCTTGCAACTTCTTCTCCGAGGAGCAAGCTCCAACTCTGCACTGGTGGAGCTAA
mRNA sequenceShow/hide mRNA sequence
ATGAAGAGGCCTCCAGGCAGCAGCTCGGATTCCTTGGGTGCTCTCATGTCCATTTGCCCAACTTCAGAAGAACACAGTCCGAGAAACAACCATGTTTATGGCACGGAATA
CCAGTCTATGCTTGATGGGTTTGATGAAGAAGGCTGCGTTGAAGAATCAGGACATGTTTCAGAGAAGAAGAGGCGACTCAGTGTGGAGCAAGTTAAGGCTTTAGAGAAGA
ATTTCGAAGTTGAAAACAAGCTCGAACCAGAGAGGAAGGTGAAGCTTGCTCGAGAACTTGGTTTGCAACCTCGACAAGTAGCTGTTTGGTTCCAAAATCGTCGAGCCAGA
TGGAAAACCAAGCAATTGGAAAGAGATTATGGCGTCCTCAAAACCAATTATGAGAATCTCAAGCTCAGTTATGAAGCTCTCCAACATGACAATCAAGCTCTTCTCAAAGA
GATTCGGGAACTGAAAGCAAAGCTTCAAGAAGATAACTCAGAGAGCAATCTTTCGGTGGAGGAAGAGATGGTGGTACCGGCCGACTCTGAGAATGCTCTGATTGAACAAA
CAAAGCCGGAAACCGATCACTTCTCTCATCCTCCGGCGGCTGAGTCCAAAGACTTCAATTACGAGAGCTTCAACAACAATGGCGGAGAAGGAGAAGAGGTACGAACAGAA
GAAGCCTCATTGTTTCCCGATTTTAAAGACGGTTCATCCGATAGCGATTCAAGCGCAATTTTAAACGAAGATTACAGCCCCACGGCCGCCATTTCTTCAGCGGGGGTGCT
CCAGAATCACCACCACTTCATGGCGGCGGCATCTCCGTCTCCCTCCGCCACCGTAAAATTCAACTGCTCATCGACGGCCCTCAATTACTTACAGTTTCAGAAGGCGTATC
AGCCCCATCTGTACTCGAAAATGGAGGAGCATAATTTCTTCAGCGGAGAGGAGGCTTGCAACTTCTTCTCCGAGGAGCAAGCTCCAACTCTGCACTGGTGGAGCTAA
Protein sequenceShow/hide protein sequence
MKRPPGSSSDSLGALMSICPTSEEHSPRNNHVYGTEYQSMLDGFDEEGCVEESGHVSEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGLQPRQVAVWFQNRRAR
WKTKQLERDYGVLKTNYENLKLSYEALQHDNQALLKEIRELKAKLQEDNSESNLSVEEEMVVPADSENALIEQTKPETDHFSHPPAAESKDFNYESFNNNGGEGEEVRTE
EASLFPDFKDGSSDSDSSAILNEDYSPTAAISSAGVLQNHHHFMAAASPSPSATVKFNCSSTALNYLQFQKAYQPHLYSKMEEHNFFSGEEACNFFSEEQAPTLHWWS