; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg029700 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg029700
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
Descriptionhomeobox-leucine zipper protein ATHB-6-like
Genome locationscaffold6:11539906..11541207
RNA-Seq ExpressionSpg029700
SyntenySpg029700
Gene Ontology termsGO:0006357 - regulation of transcription by RNA polymerase II (biological process)
GO:0005634 - nucleus (cellular component)
GO:0000981 - DNA-binding transcription factor activity, RNA polymerase II-specific (molecular function)
GO:0043565 - sequence-specific DNA binding (molecular function)
InterPro domainsIPR000047 - Helix-turn-helix motif
IPR001356 - Homeobox domain
IPR003106 - Leucine zipper, homeobox-associated
IPR009057 - Homeobox-like domain superfamily
IPR017970 - Homeobox, conserved site


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7031308.1 Homeobox-leucine zipper protein ATHB-6 [Cucurbita argyrosperma subsp. argyrosperma]6.7e-14484.78Show/hide
Query:  MKRPAAGSSDSLGALMSICPTSDQEHSPRNNHVYGTEFHSMLDGFDEEGCVEESGHVSEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGLQPRQ
        MKRPA    DS+GALMSI PT+DQE SPRNN V GTEF SMLDGF EEG VEE GHVSEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGLQ RQ
Subjt:  MKRPAAGSSDSLGALMSICPTSDQEHSPRNNHVYGTEFHSMLDGFDEEGCVEESGHVSEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGLQPRQ

Query:  VAVWFQNRRARWKTKQLERDYGVLKNNYENLKLSYETLQQDNQALLKEIRELKAKLQEDNSESNLSMEEEMVVAADSENALIEQIKPEIAEQFSVPPASE
        VAVWFQNRRARWKTKQLERDYGVLK NY+NLKL+YETLQ DNQALLKEI+ELK KLQEDNSESNLS+EEE  V ADSENALIEQIKPEI +QFSVP A+E
Subjt:  VAVWFQNRRARWKTKQLERDYGVLKNNYENLKLSYETLQQDNQALLKEIRELKAKLQEDNSESNLSMEEEMVVAADSENALIEQIKPEIAEQFSVPPASE

Query:  SQDFNYESFNNNGGEGEEAPTEEASLFPDFKDGSSDSDSSAILNEDYSPTAGISSPGVLQNHHHHHFMT-AASPSPSAAVKLNCATTALSYLQYQKGYQ-
        SQDFNY S +NNGGEG     EE SLFPDFKDGSSDSDSSAILNEDY PT  ISSP VLQ H+H HFMT AASPSPS  VKLNCATTAL+YLQYQKGYQ 
Subjt:  SQDFNYESFNNNGGEGEEAPTEEASLFPDFKDGSSDSDSSAILNEDYSPTAGISSPGVLQNHHHHHFMT-AASPSPSAAVKLNCATTALSYLQYQKGYQ-

Query:  QTQMFPKMEEHNFFSGEEACNFFSDEQAPTLHWWS
        QTQMFPKMEEHNFFSGEE CNFFSDEQAPTLHWWS
Subjt:  QTQMFPKMEEHNFFSGEEACNFFSDEQAPTLHWWS

XP_022136962.1 homeobox-leucine zipper protein ATHB-6-like [Momordica charantia]3.0e-15285.07Show/hide
Query:  MKRPAAGSSDSLGALMSICPTSDQEHSPRNNHVYGTEFHSMLDGFDEEGCVEESGHVSEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGLQPRQ
        MKRP AGSSDSLGALMSICPTSD+E SPRNNHVYG EF SMLDGFDEEGCVEESGHVSEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGLQPRQ
Subjt:  MKRPAAGSSDSLGALMSICPTSDQEHSPRNNHVYGTEFHSMLDGFDEEGCVEESGHVSEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGLQPRQ

Query:  VAVWFQNRRARWKTKQLERDYGVLKNNYENLKLSYETLQQDNQALLKEIRELKAKLQEDNSESNLSMEEEMVV-AADSENALIEQIKPEIAEQFSVPPAS
        VAVWFQNRRARWKTKQLERDYGVLK NYE LKL+YETLQQDN ALLKEIRELK+KLQEDNSESN+S+EEEMV+ AADSENALIE+ +PE  + FSVPPA+
Subjt:  VAVWFQNRRARWKTKQLERDYGVLKNNYENLKLSYETLQQDNQALLKEIRELKAKLQEDNSESNLSMEEEMVV-AADSENALIEQIKPEIAEQFSVPPAS

Query:  ESQDFNYESFNNNGGEGEEAPTEEASLFPDFKDGSSDSDSSAILNEDYSPTAGISSPGVLQNHHHHHFMTAASPSPSAAVKLNCATTALSYLQYQKGYQQ
        E +D NYESFNNNGGEGEE PTEEASLFPDFKDGSSDSDSSAILNEDYS TA ISSPGVLQN H+H    + SPSPSAAVK NC+T AL+Y Q+QK YQQ
Subjt:  ESQDFNYESFNNNGGEGEEAPTEEASLFPDFKDGSSDSDSSAILNEDYSPTAGISSPGVLQNHHHHHFMTAASPSPSAAVKLNCATTALSYLQYQKGYQQ

Query:  TQMFPKMEEHNFFSGEE-ACNFFSDEQAPTLHWWS
        TQ++PKMEEHNFF+GEE  CNFFS+EQAP+LHWWS
Subjt:  TQMFPKMEEHNFFSGEE-ACNFFSDEQAPTLHWWS

XP_022999792.1 homeobox-leucine zipper protein ATHB-6-like [Cucurbita maxima]1.2e-14384.48Show/hide
Query:  MKRPAAGSSDSLGALMSICPTSDQEHSPRNNHVYGTEFHSMLDGFDEEGCVEESGHVSEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGLQPRQ
        MKRPA    DS+GALMSI PT+DQE SPRNN V GTEF SMLDGF EEG VEE GHVSEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGLQ RQ
Subjt:  MKRPAAGSSDSLGALMSICPTSDQEHSPRNNHVYGTEFHSMLDGFDEEGCVEESGHVSEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGLQPRQ

Query:  VAVWFQNRRARWKTKQLERDYGVLKNNYENLKLSYETLQQDNQALLKEIRELKAKLQEDNSESNLSMEEEMVVAADSENALIEQIKPEIAEQFSVPPASE
        VAVWFQNRRARWKTKQLERDYGVLK NY+NLKL+YETLQ DNQALLKEI+ELK KLQEDNS+SNLS+EEEM V ADSENALIEQ+KPEI +QFSVP A+E
Subjt:  VAVWFQNRRARWKTKQLERDYGVLKNNYENLKLSYETLQQDNQALLKEIRELKAKLQEDNSESNLSMEEEMVVAADSENALIEQIKPEIAEQFSVPPASE

Query:  SQDFNYESFNNNGGEGEEAPTEEASLFPDFKDGSSDSDSSAILNEDYSPTAGISSPGVLQNHHHHHFMT-AASPSPSAAVKLNCATTALSYLQYQKGY-Q
        SQDFNYES +NNGGEG     EE SLFPDFKDGSSDSDSSAILNEDY PT  ISS  VLQ H+H HFMT AASPSPS  VKLNCATTAL+YLQYQKGY Q
Subjt:  SQDFNYESFNNNGGEGEEAPTEEASLFPDFKDGSSDSDSSAILNEDYSPTAGISSPGVLQNHHHHHFMT-AASPSPSAAVKLNCATTALSYLQYQKGY-Q

Query:  QTQMFPKMEEHNFFSGEEACNFFSDEQAPTLHWWS
        QTQMFPKMEEHNFFSGEE CNFFSDEQAPTLHWWS
Subjt:  QTQMFPKMEEHNFFSGEEACNFFSDEQAPTLHWWS

XP_023547219.1 homeobox-leucine zipper protein ATHB-6-like [Cucurbita pepo subsp. pepo]2.3e-14484.78Show/hide
Query:  MKRPAAGSSDSLGALMSICPTSDQEHSPRNNHVYGTEFHSMLDGFDEEGCVEESGHVSEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGLQPRQ
        MKRPA    DS+GALMSI PT+DQE SPRNN V GTEF SMLDGF EEG VEE GHVSEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGLQ RQ
Subjt:  MKRPAAGSSDSLGALMSICPTSDQEHSPRNNHVYGTEFHSMLDGFDEEGCVEESGHVSEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGLQPRQ

Query:  VAVWFQNRRARWKTKQLERDYGVLKNNYENLKLSYETLQQDNQALLKEIRELKAKLQEDNSESNLSMEEEMVVAADSENALIEQIKPEIAEQFSVPPASE
        VAVWFQNRRARWKTKQLERDYGVLK NY+NLKL+YETLQ DNQALLKEI+ELK KLQEDNSESNLS+EEEM V ADSENALIEQIKPEI +QFSVP A+E
Subjt:  VAVWFQNRRARWKTKQLERDYGVLKNNYENLKLSYETLQQDNQALLKEIRELKAKLQEDNSESNLSMEEEMVVAADSENALIEQIKPEIAEQFSVPPASE

Query:  SQDFNYESFNNNGGEGEEAPTEEASLFPDFKDGSSDSDSSAILNEDYSPTAGISSPGVLQNHHHHHFMT-AASPSPSAAVKLNCATTALSYLQYQKGY-Q
        +QDFNYES ++NGGEG     EE SLFPDFKDGSSDSDSSAILNEDY PT  ISSP VLQ H+H HFMT AASPSPS  VKLNCATTAL+YLQYQKGY Q
Subjt:  SQDFNYESFNNNGGEGEEAPTEEASLFPDFKDGSSDSDSSAILNEDYSPTAGISSPGVLQNHHHHHFMT-AASPSPSAAVKLNCATTALSYLQYQKGY-Q

Query:  QTQMFPKMEEHNFFSGEEACNFFSDEQAPTLHWWS
        QTQMFPKMEEHNFFSGEE CNFFSDEQAPTLHWWS
Subjt:  QTQMFPKMEEHNFFSGEEACNFFSDEQAPTLHWWS

XP_038905018.1 homeobox-leucine zipper protein ATHB-6-like [Benincasa hispida]6.7e-15287.02Show/hide
Query:  MKRPAAGSSDSLGALMSICPTSDQEHSPRN---NHVYGTEFHSMLDGFDEEGCVEESGHVSEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGLQ
        MKRPA G SDSLGAL+SICP+SD E SPRN   NHVY TEF SMLDGFDEEGCVEESGHVSEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGLQ
Subjt:  MKRPAAGSSDSLGALMSICPTSDQEHSPRN---NHVYGTEFHSMLDGFDEEGCVEESGHVSEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGLQ

Query:  PRQVAVWFQNRRARWKTKQLERDYGVLKNNYENLKLSYETLQQDNQALLKEIRELKAKLQEDNSESNLSMEEEMVVAADSENALIEQIKPEI-AEQFSVP
        PRQVAVWFQNRRARWKTKQLERDYGVLK NYENLKLS+ETLQ DNQ LLK+IRELK KLQEDNSESNLS+EEEMVVAA+SENA IEQ KPEI  +QFSVP
Subjt:  PRQVAVWFQNRRARWKTKQLERDYGVLKNNYENLKLSYETLQQDNQALLKEIRELKAKLQEDNSESNLSMEEEMVVAADSENALIEQIKPEI-AEQFSVP

Query:  PASESQDFNYESFNNNGGEGEEAPTEEASLFPDFKDGSSDSDSSAILNEDYSPTAGISSPGVLQNHHHHHFMTAA-SPSPSAAVKLNCATTALSYLQYQK
        PASESQDFNYESFN+NGGEGEEAP EEA+LFPDFKDGSSDSDSSAILNEDYSPTAGISSPGVLQNH  +H MT A SP PSAAVKLN       YLQ+QK
Subjt:  PASESQDFNYESFNNNGGEGEEAPTEEASLFPDFKDGSSDSDSSAILNEDYSPTAGISSPGVLQNHHHHHFMTAA-SPSPSAAVKLNCATTALSYLQYQK

Query:  GY-QQTQMFPKMEEHNFFSGEEACNFFSDEQAPTLHWWS
        GY QQTQMFPKMEEHNFFSGEE CNFFSDEQAPTLHWWS
Subjt:  GY-QQTQMFPKMEEHNFFSGEEACNFFSDEQAPTLHWWS

TrEMBL top hitse value%identityAlignment
A0A6J1C6W4 homeobox-leucine zipper protein ATHB-6-like1.5e-15285.07Show/hide
Query:  MKRPAAGSSDSLGALMSICPTSDQEHSPRNNHVYGTEFHSMLDGFDEEGCVEESGHVSEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGLQPRQ
        MKRP AGSSDSLGALMSICPTSD+E SPRNNHVYG EF SMLDGFDEEGCVEESGHVSEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGLQPRQ
Subjt:  MKRPAAGSSDSLGALMSICPTSDQEHSPRNNHVYGTEFHSMLDGFDEEGCVEESGHVSEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGLQPRQ

Query:  VAVWFQNRRARWKTKQLERDYGVLKNNYENLKLSYETLQQDNQALLKEIRELKAKLQEDNSESNLSMEEEMVV-AADSENALIEQIKPEIAEQFSVPPAS
        VAVWFQNRRARWKTKQLERDYGVLK NYE LKL+YETLQQDN ALLKEIRELK+KLQEDNSESN+S+EEEMV+ AADSENALIE+ +PE  + FSVPPA+
Subjt:  VAVWFQNRRARWKTKQLERDYGVLKNNYENLKLSYETLQQDNQALLKEIRELKAKLQEDNSESNLSMEEEMVV-AADSENALIEQIKPEIAEQFSVPPAS

Query:  ESQDFNYESFNNNGGEGEEAPTEEASLFPDFKDGSSDSDSSAILNEDYSPTAGISSPGVLQNHHHHHFMTAASPSPSAAVKLNCATTALSYLQYQKGYQQ
        E +D NYESFNNNGGEGEE PTEEASLFPDFKDGSSDSDSSAILNEDYS TA ISSPGVLQN H+H    + SPSPSAAVK NC+T AL+Y Q+QK YQQ
Subjt:  ESQDFNYESFNNNGGEGEEAPTEEASLFPDFKDGSSDSDSSAILNEDYSPTAGISSPGVLQNHHHHHFMTAASPSPSAAVKLNCATTALSYLQYQKGYQQ

Query:  TQMFPKMEEHNFFSGEE-ACNFFSDEQAPTLHWWS
        TQ++PKMEEHNFF+GEE  CNFFS+EQAP+LHWWS
Subjt:  TQMFPKMEEHNFFSGEE-ACNFFSDEQAPTLHWWS

A0A6J1ENE6 homeobox-leucine zipper protein ATHB-6-like2.7e-12276.65Show/hide
Query:  SDSLGALMSICPTSDQEHSPRN---NHVYGTEFHSMLDGFDEEGCVEESGHVSEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGLQPRQVAVWF
        SDS+ AL+SI PTSDQE SPRN   NHVY  EF  MLDGFDE    EE GHVSEKKRRL VEQVKALEKNFEVENKLEPERK+KLA+ELGLQPRQVAVWF
Subjt:  SDSLGALMSICPTSDQEHSPRN---NHVYGTEFHSMLDGFDEEGCVEESGHVSEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGLQPRQVAVWF

Query:  QNRRARWKTKQLERDYGVLKNNYENLKLSYETLQQDNQALLKEIRELKAKLQEDNSESNLSMEEEMVVAADSENALIEQIKPEIAEQFSVPPASESQDFN
        QNRRARWKTKQLERDYGVLK NY+NLKLS+E LQ DNQALLKEIRELKAK+QEDNS        EM+V ADSENALIEQ KPEI + FSVPPA       
Subjt:  QNRRARWKTKQLERDYGVLKNNYENLKLSYETLQQDNQALLKEIRELKAKLQEDNSESNLSMEEEMVVAADSENALIEQIKPEIAEQFSVPPASESQDFN

Query:  YESFNNNGGEGEEAPTEEASLFPDFKDGSSDSDSSAILNEDYSPTAGISSPGVLQNHHHHHFMTAASP--SPS---AAVKLNCATTALSYLQYQKGYQQT
          SFNNNGGEG+E PT         KDGSSDSDSSAILNEDYSPTAG+SSPGVLQN  ++HFMT A P  SPS   A VKLN ATTAL+YLQ+QKGYQQT
Subjt:  YESFNNNGGEGEEAPTEEASLFPDFKDGSSDSDSSAILNEDYSPTAGISSPGVLQNHHHHHFMTAASP--SPS---AAVKLNCATTALSYLQYQKGYQQT

Query:  Q-MFPKMEEHNFFSGEEACNFFSDEQAPTLHWWS
        Q MFPKMEEHNFF GEEACNFFSDEQAPTLHWWS
Subjt:  Q-MFPKMEEHNFFSGEEACNFFSDEQAPTLHWWS

A0A6J1FNC9 homeobox-leucine zipper protein ATHB-6-like3.6e-14384.18Show/hide
Query:  MKRPAAGSSDSLGALMSICPTSDQEHSPRNNHVYGTEFHSMLDGFDEEGCVEESGHVSEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGLQPRQ
        MKRPA    DS+GALMSI PT+DQE SPRNN V GTEF SMLDGF E+G VEE GHVSEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGLQ RQ
Subjt:  MKRPAAGSSDSLGALMSICPTSDQEHSPRNNHVYGTEFHSMLDGFDEEGCVEESGHVSEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGLQPRQ

Query:  VAVWFQNRRARWKTKQLERDYGVLKNNYENLKLSYETLQQDNQALLKEIRELKAKLQEDNSESNLSMEEEMVVAADSENALIEQIKPEIAEQFSVPPASE
        VAVWFQNRRARWKTKQLERDYGVLK NY+NLKL+YETLQ DNQALLKEI+ELK KLQEDNSESNLS+EEE  V ADSENALIEQIKPEI +QFSVP A+E
Subjt:  VAVWFQNRRARWKTKQLERDYGVLKNNYENLKLSYETLQQDNQALLKEIRELKAKLQEDNSESNLSMEEEMVVAADSENALIEQIKPEIAEQFSVPPASE

Query:  SQDFNYESFNNNGGEGEEAPTEEASLFPDFKDGSSDSDSSAILNEDYSPTAGISSPGVLQNHHHHHFMT-AASPSPSAAVKLNCATTALSYLQYQKGY-Q
        SQDFNY S +NNGGEG     EE SLFPDFKDGSSDSDSSAILNEDY PT  ISSP VLQ H+H HFMT AASPSPS  VKLNCATTAL+YLQYQKGY Q
Subjt:  SQDFNYESFNNNGGEGEEAPTEEASLFPDFKDGSSDSDSSAILNEDYSPTAGISSPGVLQNHHHHHFMT-AASPSPSAAVKLNCATTALSYLQYQKGY-Q

Query:  QTQMFPKMEEHNFFSGEEACNFFSDEQAPTLHWWS
        Q+QMFPKMEEHNFFSGEE CNFFSDEQAPTLHWWS
Subjt:  QTQMFPKMEEHNFFSGEEACNFFSDEQAPTLHWWS

A0A6J1JAW1 homeobox-leucine zipper protein ATHB-6-like2.7e-12276.05Show/hide
Query:  SDSLGALMSICPTSDQEHSPRN---NHVYGTEFHSMLDGFDEEGCVEESGHVSEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGLQPRQVAVWF
        SDS+ AL+SI PTSDQE SPRN   NHVY  EF  MLDGFDE    EE GHVSEKKRRL VEQVK+LEKNFEVENKLEPERK+KLA+ELGLQPRQVAVWF
Subjt:  SDSLGALMSICPTSDQEHSPRN---NHVYGTEFHSMLDGFDEEGCVEESGHVSEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGLQPRQVAVWF

Query:  QNRRARWKTKQLERDYGVLKNNYENLKLSYETLQQDNQALLKEIRELKAKLQEDNSESNLSMEEEMVVAADSENALIEQIKPEIAEQFSVPPASESQDFN
        QNRRARWKTKQLERDYGVLK NY+NLKLS+E LQ DNQALLKEIRELKAK+QEDNS        EM+  ADSENALIEQ KPEI + FSVPPA       
Subjt:  QNRRARWKTKQLERDYGVLKNNYENLKLSYETLQQDNQALLKEIRELKAKLQEDNSESNLSMEEEMVVAADSENALIEQIKPEIAEQFSVPPASESQDFN

Query:  YESFNNNGGEGEEAPTEEASLFPDFKDGSSDSDSSAILNEDYSPTAGISSPGVLQNHHHHHFMTAASP--SPS---AAVKLNCATTALSYLQYQKGYQQT
          SFNNNGGEG+E PT         KDGSSDSDSSAILNEDYSPTAG+SSPGVLQN  ++HFMT   P  SPS   A VKLNCATTAL+YLQ+QKGYQQT
Subjt:  YESFNNNGGEGEEAPTEEASLFPDFKDGSSDSDSSAILNEDYSPTAGISSPGVLQNHHHHHFMTAASP--SPS---AAVKLNCATTALSYLQYQKGYQQT

Query:  Q-MFPKMEEHNFFSGEEACNFFSDEQAPTLHWWS
        Q MFPKMEEHNFF GEEACNFFSDEQAPTLHWWS
Subjt:  Q-MFPKMEEHNFFSGEEACNFFSDEQAPTLHWWS

A0A6J1KBS4 homeobox-leucine zipper protein ATHB-6-like5.6e-14484.48Show/hide
Query:  MKRPAAGSSDSLGALMSICPTSDQEHSPRNNHVYGTEFHSMLDGFDEEGCVEESGHVSEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGLQPRQ
        MKRPA    DS+GALMSI PT+DQE SPRNN V GTEF SMLDGF EEG VEE GHVSEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGLQ RQ
Subjt:  MKRPAAGSSDSLGALMSICPTSDQEHSPRNNHVYGTEFHSMLDGFDEEGCVEESGHVSEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGLQPRQ

Query:  VAVWFQNRRARWKTKQLERDYGVLKNNYENLKLSYETLQQDNQALLKEIRELKAKLQEDNSESNLSMEEEMVVAADSENALIEQIKPEIAEQFSVPPASE
        VAVWFQNRRARWKTKQLERDYGVLK NY+NLKL+YETLQ DNQALLKEI+ELK KLQEDNS+SNLS+EEEM V ADSENALIEQ+KPEI +QFSVP A+E
Subjt:  VAVWFQNRRARWKTKQLERDYGVLKNNYENLKLSYETLQQDNQALLKEIRELKAKLQEDNSESNLSMEEEMVVAADSENALIEQIKPEIAEQFSVPPASE

Query:  SQDFNYESFNNNGGEGEEAPTEEASLFPDFKDGSSDSDSSAILNEDYSPTAGISSPGVLQNHHHHHFMT-AASPSPSAAVKLNCATTALSYLQYQKGY-Q
        SQDFNYES +NNGGEG     EE SLFPDFKDGSSDSDSSAILNEDY PT  ISS  VLQ H+H HFMT AASPSPS  VKLNCATTAL+YLQYQKGY Q
Subjt:  SQDFNYESFNNNGGEGEEAPTEEASLFPDFKDGSSDSDSSAILNEDYSPTAGISSPGVLQNHHHHHFMT-AASPSPSAAVKLNCATTALSYLQYQKGY-Q

Query:  QTQMFPKMEEHNFFSGEEACNFFSDEQAPTLHWWS
        QTQMFPKMEEHNFFSGEE CNFFSDEQAPTLHWWS
Subjt:  QTQMFPKMEEHNFFSGEEACNFFSDEQAPTLHWWS

SwissProt top hitse value%identityAlignment
P46667 Homeobox-leucine zipper protein ATHB-51.5e-4843.1Show/hide
Query:  MKRPAAGSSDSLGALMSI-CPTSDQEHSPR---NNHVY--GTEFHSMLDGFDEEGCVEESGHV-------SEKKRRLSVEQVKALEKNFEVENKLEPERK
        MKR + GSSDSL   + I   T+D++ SPR      +Y    ++  M D  +++G +E+ G V       +EKKRRL VEQVKALEKNFE++NKLEPERK
Subjt:  MKRPAAGSSDSLGALMSI-CPTSDQEHSPR---NNHVY--GTEFHSMLDGFDEEGCVEESGHV-------SEKKRRLSVEQVKALEKNFEVENKLEPERK

Query:  VKLARELGLQPRQVAVWFQNRRARWKTKQLERDYGVLKNNYENLKLSYETLQQDNQALLKEIRELKAKLQEDNSESNLSMEEEMVVAADSENALIEQIKP
        VKLA+ELGLQPRQVA+WFQNRRARWKTKQLERDYGVLK+N++ LK + ++LQ+DN +LL +I+ELKAKL   N E    +EE   + A   N        
Subjt:  VKLARELGLQPRQVAVWFQNRRARWKTKQLERDYGVLKNNYENLKLSYETLQQDNQALLKEIRELKAKLQEDNSESNLSMEEEMVVAADSENALIEQIKP

Query:  EIAEQFSVPPASESQDFNYESFNNNGGEGEEAPTEEA-----SLFP---DFKDGSSD-SDSSAILNEDYSPTAGISSPGVLQNHHHHHFMTAASPSPSAA
              SV   +E  + ++ S +       +APT E      S+FP   +F+D  +D SDSSA+LNE+YSP                   T  +    AA
Subjt:  EIAEQFSVPPASESQDFNYESFNNNGGEGEEAPTEEA-----SLFP---DFKDGSSD-SDSSAILNEDYSPTAGISSPGVLQNHHHHHFMTAASPSPSAA

Query:  VKLNCATTALSYLQYQKGYQQTQMFPKMEEH-NFFSGEEACNFFSDEQ
          +  +T                 F KMEEH + FSGEEAC  F+D +
Subjt:  VKLNCATTALSYLQYQKGYQQTQMFPKMEEH-NFFSGEEACNFFSDEQ

P46668 Homeobox-leucine zipper protein ATHB-62.2e-6047.11Show/hide
Query:  MKRPAAGSSDSLGALMSICP-TSDQEHSPRNNHVYGTEFHSMLDGF--DEEGCVEESGHV--SEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELG
        MKR    SSDS+G L+S+CP TS  E SPR     G EF SML+G+  +EE  VEE GHV  SEKKRRLS+ QVKALEKNFE+ENKLEPERKVKLA+ELG
Subjt:  MKRPAAGSSDSLGALMSICP-TSDQEHSPRNNHVYGTEFHSMLDGF--DEEGCVEESGHV--SEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELG

Query:  LQPRQVAVWFQNRRARWKTKQLERDYGVLKNNYENLKLSYETLQQDNQALLKEIRELKAKL-----QEDNSESNLSMEEEMVVAADSENALIEQIKPEIA
        LQPRQVAVWFQNRRARWKTKQLE+DYGVLK  Y++L+ ++++L++DN++LL+EI +LK KL     +E+  E+N ++  E  ++   E   + +   +I 
Subjt:  LQPRQVAVWFQNRRARWKTKQLERDYGVLKNNYENLKLSYETLQQDNQALLKEIRELKAKL-----QEDNSESNLSMEEEMVVAADSENALIEQIKPEIA

Query:  EQFSVPP--ASESQDFNYESFNNNGGEGEEAPTEEASLFPDFKDGSSD-SDSSAILNEDYSPTAGISSPGVLQNHHHHHFMTAASPSPSAAVKLNCATTA
        E  S PP     S   NY SF +     +  P + A+       GSSD SDSSA+LNE+ S    +++P  +   +   F           VK+      
Subjt:  EQFSVPP--ASESQDFNYESFNNNGGEGEEAPTEEASLFPDFKDGSSD-SDSSAILNEDYSPTAGISSPGVLQNHHHHHFMTAASPSPSAAVKLNCATTA

Query:  LSYLQYQKGYQQTQMFPKMEEHNFFSGEEACNFFSDEQAPTLHWWS
                  +QT+     +  +F SGEEAC FFSDEQ P+LHW+S
Subjt:  LSYLQYQKGYQQTQMFPKMEEHNFFSGEEACNFFSDEQAPTLHWWS

Q6K498 Homeobox-leucine zipper protein HOX41.8e-3838.6Show/hide
Query:  MKRP-AAGSSDSLGALMSICPTSDQEHSPRNNHVYGTEFHSMLDGFDEEGCVEES----GHVSEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELG
        MKRP  AG      +L+++  +SD          YG        G + EG VEE     G   EKKRRLSVEQV+ALE++FEVENKLEPERK +LAR+LG
Subjt:  MKRP-AAGSSDSLGALMSICPTSDQEHSPRNNHVYGTEFHSMLDGFDEEGCVEES----GHVSEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELG

Query:  LQPRQVAVWFQNRRARWKTKQLERDYGVLKNNYENLKLSYETLQQDNQALLKEIRELKAKLQEDNSESNLSMEEEMVVAADSENALIEQIKPEIAEQFSV
        LQPRQVAVWFQNRRARWKTKQLERDY  L+++Y++L+L ++ L++D  ALL EI+ELKAKL ++ + ++ +  +E   A+D                   
Subjt:  LQPRQVAVWFQNRRARWKTKQLERDYGVLKNNYENLKLSYETLQQDNQALLKEIRELKAKLQEDNSESNLSMEEEMVVAADSENALIEQIKPEIAEQFSV

Query:  PPASESQDFNYESFNNNGGEGEEAPTEEASLFPDFKDGSSDSDSSAILNEDYSPTAGISSPGVLQNHHHHHFMTAASPSPSAAVKLNCATTALSYLQYQK
        PPA+                                 GSSDSDSSA+LN+  +  A  ++   L         T     P+A      A  A     +  
Subjt:  PPASESQDFNYESFNNNGGEGEEAPTEEASLFPDFKDGSSDSDSSAILNEDYSPTAGISSPGVLQNHHHHHFMTAASPSPSAAVKLNCATTALSYLQYQK

Query:  GYQQTQMFPKMEEH--NFFSGEEAC-NFFSDEQAPTL-HWWS
        G      F K+EE    F   +E C  FF+D+Q P L  WW+
Subjt:  GYQQTQMFPKMEEH--NFFSGEEAC-NFFSDEQAPTL-HWWS

Q940J1 Homeobox-leucine zipper protein ATHB-161.1e-5344.48Show/hide
Query:  MKRPAAGSSDSLGALMSICPTSDQEHSPRNNHVYGTEFHSMLDGFDEEGCV--EESGH-----VSEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARE
        MKR    SSDS+  L+S   TS  E SPR    YG+ + SML+G+DE+  +  E SG+     +SEKKRRL V+QVKALEKNFE+ENKLEPERK KLA+E
Subjt:  MKRPAAGSSDSLGALMSICPTSDQEHSPRNNHVYGTEFHSMLDGFDEEGCV--EESGH-----VSEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARE

Query:  LGLQPRQVAVWFQNRRARWKTKQLERDYGVLKNNYENLKLSYETLQQDNQALLKEIRELKAKLQEDNSESNLSMEEEMVVAADSENALIEQIKPEIAEQF
        LGLQPRQVAVWFQNRRARWKTKQLE+DYGVLK  Y++L+ ++++L++DN +LL+EI ++KAK+  +   +N               A+ E +K E   + 
Subjt:  LGLQPRQVAVWFQNRRARWKTKQLERDYGVLKNNYENLKLSYETLQQDNQALLKEIRELKAKLQEDNSESNLSMEEEMVVAADSENALIEQIKPEIAEQF

Query:  SVPPASESQDFNYESFNNNGGEGEEAPTEEASLFPD---FKDGSSDS-DSSAILNEDYSPTAGISSPGVLQNHHHHHFMTAASPSPSAAVKLNCATTALS
           P+S  Q   + S    G     + T+   L P+    + GSSDS DSSA+LN++ S   G  +P V                           T  S
Subjt:  SVPPASESQDFNYESFNNNGGEGEEAPTEEASLFPD---FKDGSSDS-DSSAILNEDYSPTAGISSPGVLQNHHHHHFMTAASPSPSAAVKLNCATTALS

Query:  YLQYQKGYQQTQMFPKMEEHNFFSGEEACNFFSDEQAPTLHWWS
        +LQ+ K  +QT+     +  +F SGEEAC FFSDEQ P+LHW+S
Subjt:  YLQYQKGYQQTQMFPKMEEHNFFSGEEACNFFSDEQAPTLHWWS

Q9XH37 Homeobox-leucine zipper protein HOX41.8e-3838.6Show/hide
Query:  MKRP-AAGSSDSLGALMSICPTSDQEHSPRNNHVYGTEFHSMLDGFDEEGCVEES----GHVSEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELG
        MKRP  AG      +L+++  +SD          YG        G + EG VEE     G   EKKRRLSVEQV+ALE++FEVENKLEPERK +LAR+LG
Subjt:  MKRP-AAGSSDSLGALMSICPTSDQEHSPRNNHVYGTEFHSMLDGFDEEGCVEES----GHVSEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELG

Query:  LQPRQVAVWFQNRRARWKTKQLERDYGVLKNNYENLKLSYETLQQDNQALLKEIRELKAKLQEDNSESNLSMEEEMVVAADSENALIEQIKPEIAEQFSV
        LQPRQVAVWFQNRRARWKTKQLERDY  L+++Y++L+L ++ L++D  ALL EI+ELKAKL ++ + ++ +  +E   A+D                   
Subjt:  LQPRQVAVWFQNRRARWKTKQLERDYGVLKNNYENLKLSYETLQQDNQALLKEIRELKAKLQEDNSESNLSMEEEMVVAADSENALIEQIKPEIAEQFSV

Query:  PPASESQDFNYESFNNNGGEGEEAPTEEASLFPDFKDGSSDSDSSAILNEDYSPTAGISSPGVLQNHHHHHFMTAASPSPSAAVKLNCATTALSYLQYQK
        PPA+                                 GSSDSDSSA+LN+  +  A  ++   L         T     P+A      A  A     +  
Subjt:  PPASESQDFNYESFNNNGGEGEEAPTEEASLFPDFKDGSSDSDSSAILNEDYSPTAGISSPGVLQNHHHHHFMTAASPSPSAAVKLNCATTALSYLQYQK

Query:  GYQQTQMFPKMEEH--NFFSGEEAC-NFFSDEQAPTL-HWWS
        G      F K+EE    F   +E C  FF+D+Q P L  WW+
Subjt:  GYQQTQMFPKMEEH--NFFSGEEAC-NFFSDEQAPTL-HWWS

Arabidopsis top hitse value%identityAlignment
AT1G69780.1 Homeobox-leucine zipper protein family1.8e-3050.32Show/hide
Query:  EEGCVEESGHVSEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGLQPRQVAVWFQNRRARWKTKQLERDYGVLKNNYENLKLSYETLQQDNQALL
        EE   ++   + EKKRRL++EQVK LEKNFE+ NKLEPERK++LAR LGLQPRQ+A+WFQNRRARWKTKQLE+DY  LK  ++ LK   + LQ  NQ L 
Subjt:  EEGCVEESGHVSEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGLQPRQVAVWFQNRRARWKTKQLERDYGVLKNNYENLKLSYETLQQDNQALL

Query:  KEIRELKAKLQEDNSESNLSMEEEMVVAADSENALIEQIKPEIAEQFSVPPASES
         EI  LK +  E     NL+ E E   +  S+N+  + ++ +I+   + PP+++S
Subjt:  KEIRELKAKLQEDNSESNLSMEEEMVVAADSENALIEQIKPEIAEQFSVPPASES

AT2G22430.1 homeobox protein 61.5e-6147.11Show/hide
Query:  MKRPAAGSSDSLGALMSICP-TSDQEHSPRNNHVYGTEFHSMLDGF--DEEGCVEESGHV--SEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELG
        MKR    SSDS+G L+S+CP TS  E SPR     G EF SML+G+  +EE  VEE GHV  SEKKRRLS+ QVKALEKNFE+ENKLEPERKVKLA+ELG
Subjt:  MKRPAAGSSDSLGALMSICP-TSDQEHSPRNNHVYGTEFHSMLDGF--DEEGCVEESGHV--SEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELG

Query:  LQPRQVAVWFQNRRARWKTKQLERDYGVLKNNYENLKLSYETLQQDNQALLKEIRELKAKL-----QEDNSESNLSMEEEMVVAADSENALIEQIKPEIA
        LQPRQVAVWFQNRRARWKTKQLE+DYGVLK  Y++L+ ++++L++DN++LL+EI +LK KL     +E+  E+N ++  E  ++   E   + +   +I 
Subjt:  LQPRQVAVWFQNRRARWKTKQLERDYGVLKNNYENLKLSYETLQQDNQALLKEIRELKAKL-----QEDNSESNLSMEEEMVVAADSENALIEQIKPEIA

Query:  EQFSVPP--ASESQDFNYESFNNNGGEGEEAPTEEASLFPDFKDGSSD-SDSSAILNEDYSPTAGISSPGVLQNHHHHHFMTAASPSPSAAVKLNCATTA
        E  S PP     S   NY SF +     +  P + A+       GSSD SDSSA+LNE+ S    +++P  +   +   F           VK+      
Subjt:  EQFSVPP--ASESQDFNYESFNNNGGEGEEAPTEEASLFPDFKDGSSD-SDSSAILNEDYSPTAGISSPGVLQNHHHHHFMTAASPSPSAAVKLNCATTA

Query:  LSYLQYQKGYQQTQMFPKMEEHNFFSGEEACNFFSDEQAPTLHWWS
                  +QT+     +  +F SGEEAC FFSDEQ P+LHW+S
Subjt:  LSYLQYQKGYQQTQMFPKMEEHNFFSGEEACNFFSDEQAPTLHWWS

AT4G40060.1 homeobox protein 168.2e-5544.48Show/hide
Query:  MKRPAAGSSDSLGALMSICPTSDQEHSPRNNHVYGTEFHSMLDGFDEEGCV--EESGH-----VSEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARE
        MKR    SSDS+  L+S   TS  E SPR    YG+ + SML+G+DE+  +  E SG+     +SEKKRRL V+QVKALEKNFE+ENKLEPERK KLA+E
Subjt:  MKRPAAGSSDSLGALMSICPTSDQEHSPRNNHVYGTEFHSMLDGFDEEGCV--EESGH-----VSEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARE

Query:  LGLQPRQVAVWFQNRRARWKTKQLERDYGVLKNNYENLKLSYETLQQDNQALLKEIRELKAKLQEDNSESNLSMEEEMVVAADSENALIEQIKPEIAEQF
        LGLQPRQVAVWFQNRRARWKTKQLE+DYGVLK  Y++L+ ++++L++DN +LL+EI ++KAK+  +   +N               A+ E +K E   + 
Subjt:  LGLQPRQVAVWFQNRRARWKTKQLERDYGVLKNNYENLKLSYETLQQDNQALLKEIRELKAKLQEDNSESNLSMEEEMVVAADSENALIEQIKPEIAEQF

Query:  SVPPASESQDFNYESFNNNGGEGEEAPTEEASLFPD---FKDGSSDS-DSSAILNEDYSPTAGISSPGVLQNHHHHHFMTAASPSPSAAVKLNCATTALS
           P+S  Q   + S    G     + T+   L P+    + GSSDS DSSA+LN++ S   G  +P V                           T  S
Subjt:  SVPPASESQDFNYESFNNNGGEGEEAPTEEASLFPD---FKDGSSDS-DSSAILNEDYSPTAGISSPGVLQNHHHHHFMTAASPSPSAAVKLNCATTALS

Query:  YLQYQKGYQQTQMFPKMEEHNFFSGEEACNFFSDEQAPTLHWWS
        +LQ+ K  +QT+     +  +F SGEEAC FFSDEQ P+LHW+S
Subjt:  YLQYQKGYQQTQMFPKMEEHNFFSGEEACNFFSDEQAPTLHWWS

AT5G65310.1 homeobox protein 51.0e-4943.1Show/hide
Query:  MKRPAAGSSDSLGALMSI-CPTSDQEHSPR---NNHVY--GTEFHSMLDGFDEEGCVEESGHV-------SEKKRRLSVEQVKALEKNFEVENKLEPERK
        MKR + GSSDSL   + I   T+D++ SPR      +Y    ++  M D  +++G +E+ G V       +EKKRRL VEQVKALEKNFE++NKLEPERK
Subjt:  MKRPAAGSSDSLGALMSI-CPTSDQEHSPR---NNHVY--GTEFHSMLDGFDEEGCVEESGHV-------SEKKRRLSVEQVKALEKNFEVENKLEPERK

Query:  VKLARELGLQPRQVAVWFQNRRARWKTKQLERDYGVLKNNYENLKLSYETLQQDNQALLKEIRELKAKLQEDNSESNLSMEEEMVVAADSENALIEQIKP
        VKLA+ELGLQPRQVA+WFQNRRARWKTKQLERDYGVLK+N++ LK + ++LQ+DN +LL +I+ELKAKL   N E    +EE   + A   N        
Subjt:  VKLARELGLQPRQVAVWFQNRRARWKTKQLERDYGVLKNNYENLKLSYETLQQDNQALLKEIRELKAKLQEDNSESNLSMEEEMVVAADSENALIEQIKP

Query:  EIAEQFSVPPASESQDFNYESFNNNGGEGEEAPTEEA-----SLFP---DFKDGSSD-SDSSAILNEDYSPTAGISSPGVLQNHHHHHFMTAASPSPSAA
              SV   +E  + ++ S +       +APT E      S+FP   +F+D  +D SDSSA+LNE+YSP                   T  +    AA
Subjt:  EIAEQFSVPPASESQDFNYESFNNNGGEGEEAPTEEA-----SLFP---DFKDGSSD-SDSSAILNEDYSPTAGISSPGVLQNHHHHHFMTAASPSPSAA

Query:  VKLNCATTALSYLQYQKGYQQTQMFPKMEEH-NFFSGEEACNFFSDEQ
          +  +T                 F KMEEH + FSGEEAC  F+D +
Subjt:  VKLNCATTALSYLQYQKGYQQTQMFPKMEEH-NFFSGEEACNFFSDEQ

AT5G65310.2 homeobox protein 56.3e-4742.64Show/hide
Query:  SDQEHSPR---NNHVY--GTEFHSMLDGFDEEGCVEESGHV-------SEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGLQPRQVAVWFQNRR
        +D++ SPR      +Y    ++  M D  +++G +E+ G V       +EKKRRL VEQVKALEKNFE++NKLEPERKVKLA+ELGLQPRQVA+WFQNRR
Subjt:  SDQEHSPR---NNHVY--GTEFHSMLDGFDEEGCVEESGHV-------SEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGLQPRQVAVWFQNRR

Query:  ARWKTKQLERDYGVLKNNYENLKLSYETLQQDNQALLKEIRELKAKLQEDNSESNLSMEEEMVVAADSENALIEQIKPEIAEQFSVPPASESQDFNYESF
        ARWKTKQLERDYGVLK+N++ LK + ++LQ+DN +LL +I+ELKAKL   N E    +EE   + A   N              SV   +E  + ++ S 
Subjt:  ARWKTKQLERDYGVLKNNYENLKLSYETLQQDNQALLKEIRELKAKLQEDNSESNLSMEEEMVVAADSENALIEQIKPEIAEQFSVPPASESQDFNYESF

Query:  NNNGGEGEEAPTEEA-----SLFP---DFKDGSSD-SDSSAILNEDYSPTAGISSPGVLQNHHHHHFMTAASPSPSAAVKLNCATTALSYLQYQKGYQQT
        +       +APT E      S+FP   +F+D  +D SDSSA+LNE+YSP                   T  +    AA  +  +T               
Subjt:  NNNGGEGEEAPTEEA-----SLFP---DFKDGSSD-SDSSAILNEDYSPTAGISSPGVLQNHHHHHFMTAASPSPSAAVKLNCATTALSYLQYQKGYQQT

Query:  QMFPKMEEH-NFFSGEEACNFFSDEQ
          F KMEEH + FSGEEAC  F+D +
Subjt:  QMFPKMEEH-NFFSGEEACNFFSDEQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGAGGCCTGCAGCTGGCAGCTCAGATTCCTTGGGTGCACTCATGTCCATTTGCCCAACTTCAGATCAAGAACACAGTCCGAGGAACAACCACGTTTATGGCACGGA
ATTCCATTCGATGCTGGATGGGTTTGATGAGGAAGGTTGCGTTGAAGAATCTGGACATGTTTCAGAGAAGAAGAGACGACTCAGTGTGGAGCAAGTTAAGGCTCTAGAGA
AGAATTTCGAAGTTGAAAACAAGCTCGAACCAGAGAGGAAAGTGAAGCTGGCTCGAGAACTTGGGTTGCAACCTCGACAAGTCGCTGTTTGGTTCCAAAATCGTCGAGCC
AGATGGAAAACCAAGCAATTAGAAAGAGATTATGGCGTTCTCAAGAACAATTACGAGAATCTCAAACTCAGTTATGAAACTCTCCAACAGGACAATCAAGCCCTCCTCAA
AGAGATTCGGGAACTGAAAGCGAAGCTTCAGGAAGATAACTCGGAGAGCAATCTTTCGATGGAGGAAGAGATGGTGGTGGCGGCCGATTCTGAGAATGCTCTGATCGAAC
AAATAAAGCCGGAAATTGCCGAACAGTTCTCTGTTCCTCCGGCGAGTGAATCCCAAGACTTCAATTACGAGAGCTTCAACAACAATGGCGGAGAAGGGGAAGAGGCGCCG
ACGGAAGAGGCGTCATTGTTCCCCGATTTCAAAGATGGGTCATCGGATAGCGATTCGAGCGCAATTTTGAACGAAGATTACAGCCCCACGGCGGGCATTTCTTCACCGGG
GGTGCTGCAGAACCACCACCATCACCACTTCATGACGGCGGCATCTCCGTCTCCGTCCGCCGCCGTGAAACTGAACTGCGCAACGACGGCGCTGAGTTACTTGCAGTATC
AGAAGGGGTATCAACAAACCCAGATGTTTCCGAAAATGGAGGAGCATAATTTCTTCAGCGGAGAGGAGGCTTGTAACTTCTTCTCCGATGAGCAAGCTCCGACTCTGCAC
TGGTGGAGCTAA
mRNA sequenceShow/hide mRNA sequence
ATGAAGAGGCCTGCAGCTGGCAGCTCAGATTCCTTGGGTGCACTCATGTCCATTTGCCCAACTTCAGATCAAGAACACAGTCCGAGGAACAACCACGTTTATGGCACGGA
ATTCCATTCGATGCTGGATGGGTTTGATGAGGAAGGTTGCGTTGAAGAATCTGGACATGTTTCAGAGAAGAAGAGACGACTCAGTGTGGAGCAAGTTAAGGCTCTAGAGA
AGAATTTCGAAGTTGAAAACAAGCTCGAACCAGAGAGGAAAGTGAAGCTGGCTCGAGAACTTGGGTTGCAACCTCGACAAGTCGCTGTTTGGTTCCAAAATCGTCGAGCC
AGATGGAAAACCAAGCAATTAGAAAGAGATTATGGCGTTCTCAAGAACAATTACGAGAATCTCAAACTCAGTTATGAAACTCTCCAACAGGACAATCAAGCCCTCCTCAA
AGAGATTCGGGAACTGAAAGCGAAGCTTCAGGAAGATAACTCGGAGAGCAATCTTTCGATGGAGGAAGAGATGGTGGTGGCGGCCGATTCTGAGAATGCTCTGATCGAAC
AAATAAAGCCGGAAATTGCCGAACAGTTCTCTGTTCCTCCGGCGAGTGAATCCCAAGACTTCAATTACGAGAGCTTCAACAACAATGGCGGAGAAGGGGAAGAGGCGCCG
ACGGAAGAGGCGTCATTGTTCCCCGATTTCAAAGATGGGTCATCGGATAGCGATTCGAGCGCAATTTTGAACGAAGATTACAGCCCCACGGCGGGCATTTCTTCACCGGG
GGTGCTGCAGAACCACCACCATCACCACTTCATGACGGCGGCATCTCCGTCTCCGTCCGCCGCCGTGAAACTGAACTGCGCAACGACGGCGCTGAGTTACTTGCAGTATC
AGAAGGGGTATCAACAAACCCAGATGTTTCCGAAAATGGAGGAGCATAATTTCTTCAGCGGAGAGGAGGCTTGTAACTTCTTCTCCGATGAGCAAGCTCCGACTCTGCAC
TGGTGGAGCTAA
Protein sequenceShow/hide protein sequence
MKRPAAGSSDSLGALMSICPTSDQEHSPRNNHVYGTEFHSMLDGFDEEGCVEESGHVSEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGLQPRQVAVWFQNRRA
RWKTKQLERDYGVLKNNYENLKLSYETLQQDNQALLKEIRELKAKLQEDNSESNLSMEEEMVVAADSENALIEQIKPEIAEQFSVPPASESQDFNYESFNNNGGEGEEAP
TEEASLFPDFKDGSSDSDSSAILNEDYSPTAGISSPGVLQNHHHHHFMTAASPSPSAAVKLNCATTALSYLQYQKGYQQTQMFPKMEEHNFFSGEEACNFFSDEQAPTLH
WWS