; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CcUC10G193710 (gene) of Watermelon (PI 537277) v1 genome

Gene IDCcUC10G193710
OrganismCitrullus colocynthis (Watermelon (PI 537277) v1)
Descriptionhomeobox-leucine zipper protein ATHB-6-like
Genome locationCicolChr10:15445831..15447943
RNA-Seq ExpressionCcUC10G193710
SyntenyCcUC10G193710
Gene Ontology termsGO:0006357 - regulation of transcription by RNA polymerase II (biological process)
GO:0045893 - positive regulation of transcription, DNA-templated (biological process)
GO:0005634 - nucleus (cellular component)
GO:0000981 - DNA-binding transcription factor activity, RNA polymerase II-specific (molecular function)
GO:0043565 - sequence-specific DNA binding (molecular function)
InterPro domainsIPR000047 - Helix-turn-helix motif
IPR001356 - Homeobox domain
IPR003106 - Leucine zipper, homeobox-associated
IPR009057 - Homeobox-like domain superfamily
IPR017970 - Homeobox, conserved site


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7031308.1 Homeobox-leucine zipper protein ATHB-6 [Cucurbita argyrosperma subsp. argyrosperma]1.4e-13680.65Show/hide
Query:  MKRPAVGSDSLGALISICPTSDHEQSPRNKNSNHVYGTEFQSMLDGFEEEGCVEESGHVSEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGLQP
        MKRPA   DS+GAL+SI PT+D EQSPRN   N V GTEFQSMLDGF EEG VEE GHVSEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGLQ 
Subjt:  MKRPAVGSDSLGALISICPTSDHEQSPRNKNSNHVYGTEFQSMLDGFEEEGCVEESGHVSEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGLQP

Query:  RQVAVWFQNRRARWKTKQLERDYGVLKTNYENLKLSYETLQNDNQALLKQIRELKSKLQEDNSESNISVEEEMVVAADSENALMKQTKPEIGDQFSVPPA
        RQVAVWFQNRRARWKTKQLERDYGVLKTNY+NLKL+YETLQ+DNQALLK+I+ELK KLQEDNSESN+SVEEE  V ADSENAL++Q KPEI DQFSVP A
Subjt:  RQVAVWFQNRRARWKTKQLERDYGVLKTNYENLKLSYETLQNDNQALLKQIRELKSKLQEDNSESNISVEEEMVVAADSENALMKQTKPEIGDQFSVPPA

Query:  SESQDFNHESFNNNGGEGEEAAIEEVSLFADFKDGSSDSDSSAILNEDYSPTAGISSPGVLQNHQQYHFMTGAVSPAPSAAVK-------LNYLQFQKGY
        +ESQDFN+ S +NNGGEG     EEVSLF DFKDGSSDSDSSAILNEDY PT  ISSP VLQ H   HFMTGA SP+PS  VK       LNYLQ+QKGY
Subjt:  SESQDFNHESFNNNGGEGEEAAIEEVSLFADFKDGSSDSDSSAILNEDYSPTAGISSPGVLQNHQQYHFMTGAVSPAPSAAVK-------LNYLQFQKGY

Query:  QQQTQMFPKMEEHNFFSGEEACNFFSDEQAPTLHWW
        Q QTQMFPKMEEHNFFSGEE CNFFSDEQAPTLHWW
Subjt:  QQQTQMFPKMEEHNFFSGEEACNFFSDEQAPTLHWW

XP_022136962.1 homeobox-leucine zipper protein ATHB-6-like [Momordica charantia]1.1e-14182.3Show/hide
Query:  MKRPAVG-SDSLGALISICPTSDHEQSPRNKNSNHVYGTEFQSMLDGFEEEGCVEESGHVSEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGLQ
        MKRP  G SDSLGAL+SICPTSD EQSPRN   NHVYG EFQSMLDGF+EEGCVEESGHVSEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGLQ
Subjt:  MKRPAVG-SDSLGALISICPTSDHEQSPRNKNSNHVYGTEFQSMLDGFEEEGCVEESGHVSEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGLQ

Query:  PRQVAVWFQNRRARWKTKQLERDYGVLKTNYENLKLSYETLQNDNQALLKQIRELKSKLQEDNSESNISVEEEMVV-AADSENALMKQTKPEIGDQFSVP
        PRQVAVWFQNRRARWKTKQLERDYGVLKTNYE LKL+YETLQ DN ALLK+IRELKSKLQEDNSESN+SVEEEMV+ AADSENAL+++T+PE GD FSVP
Subjt:  PRQVAVWFQNRRARWKTKQLERDYGVLKTNYENLKLSYETLQNDNQALLKQIRELKSKLQEDNSESNISVEEEMVV-AADSENALMKQTKPEIGDQFSVP

Query:  PASESQDFNHESFNNNGGEGEEAAIEEVSLFADFKDGSSDSDSSAILNEDYSPTAGISSPGVLQNHQQYHFMTGAVSPAPSAAVK-------LNYLQFQK
        PA+E +D N+ESFNNNGGEGEE   EE SLF DFKDGSSDSDSSAILNEDYS TA ISSPGVLQN Q YHFM  + SP+PSAAVK       LNY QFQK
Subjt:  PASESQDFNHESFNNNGGEGEEAAIEEVSLFADFKDGSSDSDSSAILNEDYSPTAGISSPGVLQNHQQYHFMTGAVSPAPSAAVK-------LNYLQFQK

Query:  GYQQQTQMFPKMEEHNFFSGEE-ACNFFSDEQAPTLHWW
         Y QQTQ++PKMEEHNFF+GEE  CNFFS+EQAP+LHWW
Subjt:  GYQQQTQMFPKMEEHNFFSGEE-ACNFFSDEQAPTLHWW

XP_022999792.1 homeobox-leucine zipper protein ATHB-6-like [Cucurbita maxima]2.7e-13780.95Show/hide
Query:  MKRPAVGSDSLGALISICPTSDHEQSPRNKNSNHVYGTEFQSMLDGFEEEGCVEESGHVSEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGLQP
        MKRPA   DS+GAL+SI PT+D EQSPRN   N V GTEFQSMLDGF EEG VEE GHVSEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGLQ 
Subjt:  MKRPAVGSDSLGALISICPTSDHEQSPRNKNSNHVYGTEFQSMLDGFEEEGCVEESGHVSEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGLQP

Query:  RQVAVWFQNRRARWKTKQLERDYGVLKTNYENLKLSYETLQNDNQALLKQIRELKSKLQEDNSESNISVEEEMVVAADSENALMKQTKPEIGDQFSVPPA
        RQVAVWFQNRRARWKTKQLERDYGVLKTNY+NLKL+YETLQ+DNQALLK+I+ELK KLQEDNS+SN+SVEEEM V ADSENAL++Q KPEI DQFSVP A
Subjt:  RQVAVWFQNRRARWKTKQLERDYGVLKTNYENLKLSYETLQNDNQALLKQIRELKSKLQEDNSESNISVEEEMVVAADSENALMKQTKPEIGDQFSVPPA

Query:  SESQDFNHESFNNNGGEGEEAAIEEVSLFADFKDGSSDSDSSAILNEDYSPTAGISSPGVLQNHQQYHFMTGAVSPAPSAAVK-------LNYLQFQKGY
        +ESQDFN+ES +NNGGEG     EEVSLF DFKDGSSDSDSSAILNEDY PT  ISS  VLQ H   HFMTGA SP+PS  VK       LNYLQ+QKGY
Subjt:  SESQDFNHESFNNNGGEGEEAAIEEVSLFADFKDGSSDSDSSAILNEDYSPTAGISSPGVLQNHQQYHFMTGAVSPAPSAAVK-------LNYLQFQKGY

Query:  QQQTQMFPKMEEHNFFSGEEACNFFSDEQAPTLHWW
        QQQTQMFPKMEEHNFFSGEE CNFFSDEQAPTLHWW
Subjt:  QQQTQMFPKMEEHNFFSGEEACNFFSDEQAPTLHWW

XP_023547219.1 homeobox-leucine zipper protein ATHB-6-like [Cucurbita pepo subsp. pepo]1.2e-13780.95Show/hide
Query:  MKRPAVGSDSLGALISICPTSDHEQSPRNKNSNHVYGTEFQSMLDGFEEEGCVEESGHVSEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGLQP
        MKRPA   DS+GAL+SI PT+D EQSPRN   N V GTEFQSMLDGF EEG VEE GHVSEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGLQ 
Subjt:  MKRPAVGSDSLGALISICPTSDHEQSPRNKNSNHVYGTEFQSMLDGFEEEGCVEESGHVSEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGLQP

Query:  RQVAVWFQNRRARWKTKQLERDYGVLKTNYENLKLSYETLQNDNQALLKQIRELKSKLQEDNSESNISVEEEMVVAADSENALMKQTKPEIGDQFSVPPA
        RQVAVWFQNRRARWKTKQLERDYGVLKTNY+NLKL+YETLQ+DNQALLK+I+ELK KLQEDNSESN+SVEEEM V ADSENAL++Q KPEI DQFSVP A
Subjt:  RQVAVWFQNRRARWKTKQLERDYGVLKTNYENLKLSYETLQNDNQALLKQIRELKSKLQEDNSESNISVEEEMVVAADSENALMKQTKPEIGDQFSVPPA

Query:  SESQDFNHESFNNNGGEGEEAAIEEVSLFADFKDGSSDSDSSAILNEDYSPTAGISSPGVLQNHQQYHFMTGAVSPAPSAAVK-------LNYLQFQKGY
        +E+QDFN+ES ++NGGEG     EEVSLF DFKDGSSDSDSSAILNEDY PT  ISSP VLQ H   HFMTGA SP+PS  VK       LNYLQ+QKGY
Subjt:  SESQDFNHESFNNNGGEGEEAAIEEVSLFADFKDGSSDSDSSAILNEDYSPTAGISSPGVLQNHQQYHFMTGAVSPAPSAAVK-------LNYLQFQKGY

Query:  QQQTQMFPKMEEHNFFSGEEACNFFSDEQAPTLHWW
        QQQTQMFPKMEEHNFFSGEE CNFFSDEQAPTLHWW
Subjt:  QQQTQMFPKMEEHNFFSGEEACNFFSDEQAPTLHWW

XP_038905018.1 homeobox-leucine zipper protein ATHB-6-like [Benincasa hispida]3.0e-16893.03Show/hide
Query:  MKRPAVGSDSLGALISICPTSDHEQSPRNKNSNHVYGTEFQSMLDGFEEEGCVEESGHVSEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGLQP
        MKRPAVGSDSLGALISICP+SDHEQSPRNKNSNHVY TEFQSMLDGF+EEGCVEESGHVSEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGLQP
Subjt:  MKRPAVGSDSLGALISICPTSDHEQSPRNKNSNHVYGTEFQSMLDGFEEEGCVEESGHVSEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGLQP

Query:  RQVAVWFQNRRARWKTKQLERDYGVLKTNYENLKLSYETLQNDNQALLKQIRELKSKLQEDNSESNISVEEEMVVAADSENALMKQTKPEI-GDQFSVPP
        RQVAVWFQNRRARWKTKQLERDYGVLKTNYENLKLS+ETLQNDNQ LLKQIRELK+KLQEDNSESN+SVEEEMVVAA+SENA ++QTKPEI GDQFSVPP
Subjt:  RQVAVWFQNRRARWKTKQLERDYGVLKTNYENLKLSYETLQNDNQALLKQIRELKSKLQEDNSESNISVEEEMVVAADSENALMKQTKPEI-GDQFSVPP

Query:  ASESQDFNHESFNNNGGEGEEAAIEEVSLFADFKDGSSDSDSSAILNEDYSPTAGISSPGVLQNHQQYHFMTGAVSPAPSAAVKLNYLQFQKGYQQQTQM
        ASESQDFN+ESFN+NGGEGEEA +EE +LF DFKDGSSDSDSSAILNEDYSPTAGISSPGVLQNHQQYH MTGA+SP PSAAVKLNYLQFQKGYQQQTQM
Subjt:  ASESQDFNHESFNNNGGEGEEAAIEEVSLFADFKDGSSDSDSSAILNEDYSPTAGISSPGVLQNHQQYHFMTGAVSPAPSAAVKLNYLQFQKGYQQQTQM

Query:  FPKMEEHNFFSGEEACNFFSDEQAPTLHWW
        FPKMEEHNFFSGEE CNFFSDEQAPTLHWW
Subjt:  FPKMEEHNFFSGEEACNFFSDEQAPTLHWW

TrEMBL top hitse value%identityAlignment
A0A6J1C6W4 homeobox-leucine zipper protein ATHB-6-like5.2e-14282.3Show/hide
Query:  MKRPAVG-SDSLGALISICPTSDHEQSPRNKNSNHVYGTEFQSMLDGFEEEGCVEESGHVSEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGLQ
        MKRP  G SDSLGAL+SICPTSD EQSPRN   NHVYG EFQSMLDGF+EEGCVEESGHVSEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGLQ
Subjt:  MKRPAVG-SDSLGALISICPTSDHEQSPRNKNSNHVYGTEFQSMLDGFEEEGCVEESGHVSEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGLQ

Query:  PRQVAVWFQNRRARWKTKQLERDYGVLKTNYENLKLSYETLQNDNQALLKQIRELKSKLQEDNSESNISVEEEMVV-AADSENALMKQTKPEIGDQFSVP
        PRQVAVWFQNRRARWKTKQLERDYGVLKTNYE LKL+YETLQ DN ALLK+IRELKSKLQEDNSESN+SVEEEMV+ AADSENAL+++T+PE GD FSVP
Subjt:  PRQVAVWFQNRRARWKTKQLERDYGVLKTNYENLKLSYETLQNDNQALLKQIRELKSKLQEDNSESNISVEEEMVV-AADSENALMKQTKPEIGDQFSVP

Query:  PASESQDFNHESFNNNGGEGEEAAIEEVSLFADFKDGSSDSDSSAILNEDYSPTAGISSPGVLQNHQQYHFMTGAVSPAPSAAVK-------LNYLQFQK
        PA+E +D N+ESFNNNGGEGEE   EE SLF DFKDGSSDSDSSAILNEDYS TA ISSPGVLQN Q YHFM  + SP+PSAAVK       LNY QFQK
Subjt:  PASESQDFNHESFNNNGGEGEEAAIEEVSLFADFKDGSSDSDSSAILNEDYSPTAGISSPGVLQNHQQYHFMTGAVSPAPSAAVK-------LNYLQFQK

Query:  GYQQQTQMFPKMEEHNFFSGEE-ACNFFSDEQAPTLHWW
         Y QQTQ++PKMEEHNFF+GEE  CNFFS+EQAP+LHWW
Subjt:  GYQQQTQMFPKMEEHNFFSGEE-ACNFFSDEQAPTLHWW

A0A6J1ENE6 homeobox-leucine zipper protein ATHB-6-like1.6e-12274.77Show/hide
Query:  SDSLGALISICPTSDHEQSPRNKNSNHVYGTEFQSMLDGFEEEGCVEESGHVSEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGLQPRQVAVWF
        SDS+ ALISI PTSD EQSPRNKNSNHVY  EFQ MLDGF+E    EE GHVSEKKRRL VEQVKALEKNFEVENKLEPERK+KLA+ELGLQPRQVAVWF
Subjt:  SDSLGALISICPTSDHEQSPRNKNSNHVYGTEFQSMLDGFEEEGCVEESGHVSEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGLQPRQVAVWF

Query:  QNRRARWKTKQLERDYGVLKTNYENLKLSYETLQNDNQALLKQIRELKSKLQEDNSESNISVEEEMVVAADSENALMKQTKPEIGDQFSVPPASESQDFN
        QNRRARWKTKQLERDYGVLKTNY+NLKLS+E LQNDNQALLK+IRELK+K+QEDNS        EM+V ADSENAL++QTKPEI D FSVPPA       
Subjt:  QNRRARWKTKQLERDYGVLKTNYENLKLSYETLQNDNQALLKQIRELKSKLQEDNSESNISVEEEMVVAADSENALMKQTKPEIGDQFSVPPASESQDFN

Query:  HESFNNNGGEGEEAAIEEVSLFADFKDGSSDSDSSAILNEDYSPTAGISSPGVLQNHQQYHFMTGAVSP-APS----------AAVKLNYLQFQKGYQQQ
          SFNNNGGEG+E            KDGSSDSDSSAILNEDYSPTAG+SSPGVLQN+   HFMTGA+ P +PS          A   LNYLQFQKGYQQ 
Subjt:  HESFNNNGGEGEEAAIEEVSLFADFKDGSSDSDSSAILNEDYSPTAGISSPGVLQNHQQYHFMTGAVSP-APS----------AAVKLNYLQFQKGYQQQ

Query:  TQMFPKMEEHNFFSGEEACNFFSDEQAPTLHWW
          MFPKMEEHNFF GEEACNFFSDEQAPTLHWW
Subjt:  TQMFPKMEEHNFFSGEEACNFFSDEQAPTLHWW

A0A6J1FNC9 homeobox-leucine zipper protein ATHB-6-like1.9e-13680.36Show/hide
Query:  MKRPAVGSDSLGALISICPTSDHEQSPRNKNSNHVYGTEFQSMLDGFEEEGCVEESGHVSEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGLQP
        MKRPA   DS+GAL+SI PT+D EQSPRN   N V GTEFQSMLDGF E+G VEE GHVSEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGLQ 
Subjt:  MKRPAVGSDSLGALISICPTSDHEQSPRNKNSNHVYGTEFQSMLDGFEEEGCVEESGHVSEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGLQP

Query:  RQVAVWFQNRRARWKTKQLERDYGVLKTNYENLKLSYETLQNDNQALLKQIRELKSKLQEDNSESNISVEEEMVVAADSENALMKQTKPEIGDQFSVPPA
        RQVAVWFQNRRARWKTKQLERDYGVLKTNY+NLKL+YETLQ+DNQALLK+I+ELK KLQEDNSESN+SVEEE  V ADSENAL++Q KPEI DQFSVP A
Subjt:  RQVAVWFQNRRARWKTKQLERDYGVLKTNYENLKLSYETLQNDNQALLKQIRELKSKLQEDNSESNISVEEEMVVAADSENALMKQTKPEIGDQFSVPPA

Query:  SESQDFNHESFNNNGGEGEEAAIEEVSLFADFKDGSSDSDSSAILNEDYSPTAGISSPGVLQNHQQYHFMTGAVSPAPSAAVK-------LNYLQFQKGY
        +ESQDFN+ S +NNGGEG     EEVSLF DFKDGSSDSDSSAILNEDY PT  ISSP VLQ H   HFMTGA SP+PS  VK       LNYLQ+QKGY
Subjt:  SESQDFNHESFNNNGGEGEEAAIEEVSLFADFKDGSSDSDSSAILNEDYSPTAGISSPGVLQNHQQYHFMTGAVSPAPSAAVK-------LNYLQFQKGY

Query:  QQQTQMFPKMEEHNFFSGEEACNFFSDEQAPTLHWW
        QQQ+QMFPKMEEHNFFSGEE CNFFSDEQAPTLHWW
Subjt:  QQQTQMFPKMEEHNFFSGEEACNFFSDEQAPTLHWW

A0A6J1JAW1 homeobox-leucine zipper protein ATHB-6-like3.8e-12173.87Show/hide
Query:  SDSLGALISICPTSDHEQSPRNKNSNHVYGTEFQSMLDGFEEEGCVEESGHVSEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGLQPRQVAVWF
        SDS+ ALISI PTSD EQSPRNKNSNHVY  EFQ MLDGF+E    EE GHVSEKKRRL VEQVK+LEKNFEVENKLEPERK+KLA+ELGLQPRQVAVWF
Subjt:  SDSLGALISICPTSDHEQSPRNKNSNHVYGTEFQSMLDGFEEEGCVEESGHVSEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGLQPRQVAVWF

Query:  QNRRARWKTKQLERDYGVLKTNYENLKLSYETLQNDNQALLKQIRELKSKLQEDNSESNISVEEEMVVAADSENALMKQTKPEIGDQFSVPPASESQDFN
        QNRRARWKTKQLERDYGVLKTNY+NLKLS+E LQNDNQALLK+IRELK+K+QEDNS        EM+  ADSENAL++QTKPEI D FSVPPA       
Subjt:  QNRRARWKTKQLERDYGVLKTNYENLKLSYETLQNDNQALLKQIRELKSKLQEDNSESNISVEEEMVVAADSENALMKQTKPEIGDQFSVPPASESQDFN

Query:  HESFNNNGGEGEEAAIEEVSLFADFKDGSSDSDSSAILNEDYSPTAGISSPGVLQNHQQYHFMTGAVSP-APS----------AAVKLNYLQFQKGYQQQ
          SFNNNGGEG+E            KDGSSDSDSSAILNEDYSPTAG+SSPGVLQN+   HFMTG + P +PS          A   LNYLQFQKGYQQ 
Subjt:  HESFNNNGGEGEEAAIEEVSLFADFKDGSSDSDSSAILNEDYSPTAGISSPGVLQNHQQYHFMTGAVSP-APS----------AAVKLNYLQFQKGYQQQ

Query:  TQMFPKMEEHNFFSGEEACNFFSDEQAPTLHWW
          MFPKMEEHNFF GEEACNFFSDEQAPTLHWW
Subjt:  TQMFPKMEEHNFFSGEEACNFFSDEQAPTLHWW

A0A6J1KBS4 homeobox-leucine zipper protein ATHB-6-like1.3e-13780.95Show/hide
Query:  MKRPAVGSDSLGALISICPTSDHEQSPRNKNSNHVYGTEFQSMLDGFEEEGCVEESGHVSEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGLQP
        MKRPA   DS+GAL+SI PT+D EQSPRN   N V GTEFQSMLDGF EEG VEE GHVSEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGLQ 
Subjt:  MKRPAVGSDSLGALISICPTSDHEQSPRNKNSNHVYGTEFQSMLDGFEEEGCVEESGHVSEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGLQP

Query:  RQVAVWFQNRRARWKTKQLERDYGVLKTNYENLKLSYETLQNDNQALLKQIRELKSKLQEDNSESNISVEEEMVVAADSENALMKQTKPEIGDQFSVPPA
        RQVAVWFQNRRARWKTKQLERDYGVLKTNY+NLKL+YETLQ+DNQALLK+I+ELK KLQEDNS+SN+SVEEEM V ADSENAL++Q KPEI DQFSVP A
Subjt:  RQVAVWFQNRRARWKTKQLERDYGVLKTNYENLKLSYETLQNDNQALLKQIRELKSKLQEDNSESNISVEEEMVVAADSENALMKQTKPEIGDQFSVPPA

Query:  SESQDFNHESFNNNGGEGEEAAIEEVSLFADFKDGSSDSDSSAILNEDYSPTAGISSPGVLQNHQQYHFMTGAVSPAPSAAVK-------LNYLQFQKGY
        +ESQDFN+ES +NNGGEG     EEVSLF DFKDGSSDSDSSAILNEDY PT  ISS  VLQ H   HFMTGA SP+PS  VK       LNYLQ+QKGY
Subjt:  SESQDFNHESFNNNGGEGEEAAIEEVSLFADFKDGSSDSDSSAILNEDYSPTAGISSPGVLQNHQQYHFMTGAVSPAPSAAVK-------LNYLQFQKGY

Query:  QQQTQMFPKMEEHNFFSGEEACNFFSDEQAPTLHWW
        QQQTQMFPKMEEHNFFSGEE CNFFSDEQAPTLHWW
Subjt:  QQQTQMFPKMEEHNFFSGEEACNFFSDEQAPTLHWW

SwissProt top hitse value%identityAlignment
P46667 Homeobox-leucine zipper protein ATHB-52.0e-5043.36Show/hide
Query:  MKRPAVGSDSLGALISI-CPTSDHEQSPRNKNSNHVY--GTEFQSMLDGFEEEGCVEESGHV-------SEKKRRLSVEQVKALEKNFEVENKLEPERKV
        MKR    SDSL   + I   T+D + SPR   +  +Y    ++  M D  E++G +E+ G V       +EKKRRL VEQVKALEKNFE++NKLEPERKV
Subjt:  MKRPAVGSDSLGALISI-CPTSDHEQSPRNKNSNHVY--GTEFQSMLDGFEEEGCVEESGHV-------SEKKRRLSVEQVKALEKNFEVENKLEPERKV

Query:  KLARELGLQPRQVAVWFQNRRARWKTKQLERDYGVLKTNYENLKLSYETLQNDNQALLKQIRELKSKLQEDNSESNISVEEE-MVVAADSENALMKQTKP
        KLA+ELGLQPRQVA+WFQNRRARWKTKQLERDYGVLK+N++ LK + ++LQ DN +LL QI+ELK+KL   N E    +EE   + A ++  ++M   + 
Subjt:  KLARELGLQPRQVAVWFQNRRARWKTKQLERDYGVLKTNYENLKLSYETLQNDNQALLKQIRELKSKLQEDNSESNISVEEE-MVVAADSENALMKQTKP

Query:  -EIGDQFSVPPASESQDFNHESFNNNGGEGEEAAIEEVSLF---ADFKDGSSD-SDSSAILNEDYSPTAGISSPGVLQNHQQYHFMTGAVSPAPSAAVKL
         E+  +   PP     D              E A E  S+F    +F+D  +D SDSSA+LNE+YSP                      V  A + A   
Subjt:  -EIGDQFSVPPASESQDFNHESFNNNGGEGEEAAIEEVSLF---ADFKDGSSD-SDSSAILNEDYSPTAGISSPGVLQNHQQYHFMTGAVSPAPSAAVKL

Query:  NYLQFQKGYQQQTQMFPKMEEH-NFFSGEEACNFFSDEQ
          +     + Q    F KMEEH + FSGEEAC  F+D +
Subjt:  NYLQFQKGYQQQTQMFPKMEEH-NFFSGEEACNFFSDEQ

P46668 Homeobox-leucine zipper protein ATHB-63.9e-6247.51Show/hide
Query:  MKRPAVGSDSLGALISICP-TSDHEQSPRNKNSNHVYGTEFQSMLDGF--EEEGCVEESGHV--SEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARE
        M +    SDS+G LIS+CP TS  EQSPR        G EFQSML+G+  EEE  VEE GHV  SEKKRRLS+ QVKALEKNFE+ENKLEPERKVKLA+E
Subjt:  MKRPAVGSDSLGALISICP-TSDHEQSPRNKNSNHVYGTEFQSMLDGF--EEEGCVEESGHV--SEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARE

Query:  LGLQPRQVAVWFQNRRARWKTKQLERDYGVLKTNYENLKLSYETLQNDNQALLKQIRELKSKL-----QEDNSESNISVEEEMVVAADSENALMKQTKPE
        LGLQPRQVAVWFQNRRARWKTKQLE+DYGVLKT Y++L+ ++++L+ DN++LL++I +LK+KL     +E+  E+N +V  E  ++   E   + +   +
Subjt:  LGLQPRQVAVWFQNRRARWKTKQLERDYGVLKTNYENLKLSYETLQNDNQALLKQIRELKSKL-----QEDNSESNISVEEEMVVAADSENALMKQTKPE

Query:  IGDQFSVPP--ASESQDFNHESFNNNGGEGEEAAIEEVSLFADFKDGSSDSDSSAILNEDYSPTAGISSPGVLQNHQQYHFMTGAVSPAPSAAVKLNYLQ
        I +  S PP     S   N+ SF +        A    S FA     S  SDSSA+LNE+ S    +++P  +                       N+ Q
Subjt:  IGDQFSVPP--ASESQDFNHESFNNNGGEGEEAAIEEVSLFADFKDGSSDSDSSAILNEDYSPTAGISSPGVLQNHQQYHFMTGAVSPAPSAAVKLNYLQ

Query:  FQKGYQQQTQMFPKMEEHNFFSGEEACNFFSDEQAPTLHWW
        F K   +QT+     +  +F SGEEAC FFSDEQ P+LHW+
Subjt:  FQKGYQQQTQMFPKMEEHNFFSGEEACNFFSDEQAPTLHWW

Q6K498 Homeobox-leucine zipper protein HOX41.5e-3740.34Show/hide
Query:  GFEEEGCVEES----GHVSEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGLQPRQVAVWFQNRRARWKTKQLERDYGVLKTNYENLKLSYETLQ
        G E EG VEE     G   EKKRRLSVEQV+ALE++FEVENKLEPERK +LAR+LGLQPRQVAVWFQNRRARWKTKQLERDY  L+ +Y++L+L ++ L+
Subjt:  GFEEEGCVEES----GHVSEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGLQPRQVAVWFQNRRARWKTKQLERDYGVLKTNYENLKLSYETLQ

Query:  NDNQALLKQIRELKSKLQEDNSESNISVEEEMVVAADSENALMKQTKPEIGDQFSVPPASESQDFNHESFNNNGGEGEEAAIEEVSLFADFKDGSSDSDS
         D  ALL +I+ELK+KL ++ + ++ +  +E   A+D                   PPA                             A F  GSSDSDS
Subjt:  NDNQALLKQIRELKSKLQEDNSESNISVEEEMVVAADSENALMKQTKPEIGDQFSVPPASESQDFNHESFNNNGGEGEEAAIEEVSLFADFKDGSSDSDS

Query:  SAILNEDYSPTAGISSPGVLQNHQQYHFMTGAVSPAPSAAVKLNYLQFQKGYQQQTQMFPKMEEHNFFSGEEAC-NFFSDEQAPTL-HWW
        SA+LN+  +  A  ++   L    +     GA   A + A        ++ +     +  + +E  F   +E C  FF+D+Q P L  WW
Subjt:  SAILNEDYSPTAGISSPGVLQNHQQYHFMTGAVSPAPSAAVKLNYLQFQKGYQQQTQMFPKMEEHNFFSGEEAC-NFFSDEQAPTL-HWW

Q940J1 Homeobox-leucine zipper protein ATHB-166.2e-5244.76Show/hide
Query:  MKRPAVGSDSLGALISICPTSDHEQSPRNKNSNHVYGTEFQSMLDGFEEEGCV--EESGH-----VSEKKRRLSVEQVKALEKNFEVENKLEPERKVKLA
        MKR +  SDS+  LIS   TS  EQSPR       YG+ +QSML+G++E+  +  E SG+     +SEKKRRL V+QVKALEKNFE+ENKLEPERK KLA
Subjt:  MKRPAVGSDSLGALISICPTSDHEQSPRNKNSNHVYGTEFQSMLDGFEEEGCV--EESGH-----VSEKKRRLSVEQVKALEKNFEVENKLEPERKVKLA

Query:  RELGLQPRQVAVWFQNRRARWKTKQLERDYGVLKTNYENLKLSYETLQNDNQALLKQIRELKSKL--QEDNSESNI---SVEEEMVVAADSENALMKQTK
        +ELGLQPRQVAVWFQNRRARWKTKQLE+DYGVLK  Y++L+ ++++L+ DN +LL++I ++K+K+  +EDN+ +      V+EE V   DS         
Subjt:  RELGLQPRQVAVWFQNRRARWKTKQLERDYGVLKTNYENLKLSYETLQNDNQALLKQIRELKSKL--QEDNSESNI---SVEEEMVVAADSENALMKQTK

Query:  PEIGDQFSVPPASESQDFNHES-FNNNGGEGEEAAIEEVSLFADFKD----------GSSDS-DSSAILNEDYSPTAGISSPGVLQNHQQYHFMTGAVSP
                  P+S  Q   H S FN                F D +D          GSSDS DSSA+LN++ S   G  +P V         +TG    
Subjt:  PEIGDQFSVPPASESQDFNHES-FNNNGGEGEEAAIEEVSLFADFKD----------GSSDS-DSSAILNEDYSPTAGISSPGVLQNHQQYHFMTGAVSP

Query:  APSAAVKLNYLQFQKGYQQQTQMFPKMEEHNFFSGEEACNFFSDEQAPTLHWW
                ++LQF K   +QT+     +  +F SGEEAC FFSDEQ P+LHW+
Subjt:  APSAAVKLNYLQFQKGYQQQTQMFPKMEEHNFFSGEEACNFFSDEQAPTLHWW

Q9XH37 Homeobox-leucine zipper protein HOX41.5e-3740.34Show/hide
Query:  GFEEEGCVEES----GHVSEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGLQPRQVAVWFQNRRARWKTKQLERDYGVLKTNYENLKLSYETLQ
        G E EG VEE     G   EKKRRLSVEQV+ALE++FEVENKLEPERK +LAR+LGLQPRQVAVWFQNRRARWKTKQLERDY  L+ +Y++L+L ++ L+
Subjt:  GFEEEGCVEES----GHVSEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGLQPRQVAVWFQNRRARWKTKQLERDYGVLKTNYENLKLSYETLQ

Query:  NDNQALLKQIRELKSKLQEDNSESNISVEEEMVVAADSENALMKQTKPEIGDQFSVPPASESQDFNHESFNNNGGEGEEAAIEEVSLFADFKDGSSDSDS
         D  ALL +I+ELK+KL ++ + ++ +  +E   A+D                   PPA                             A F  GSSDSDS
Subjt:  NDNQALLKQIRELKSKLQEDNSESNISVEEEMVVAADSENALMKQTKPEIGDQFSVPPASESQDFNHESFNNNGGEGEEAAIEEVSLFADFKDGSSDSDS

Query:  SAILNEDYSPTAGISSPGVLQNHQQYHFMTGAVSPAPSAAVKLNYLQFQKGYQQQTQMFPKMEEHNFFSGEEAC-NFFSDEQAPTL-HWW
        SA+LN+  +  A  ++   L    +     GA   A + A        ++ +     +  + +E  F   +E C  FF+D+Q P L  WW
Subjt:  SAILNEDYSPTAGISSPGVLQNHQQYHFMTGAVSPAPSAAVKLNYLQFQKGYQQQTQMFPKMEEHNFFSGEEAC-NFFSDEQAPTL-HWW

Arabidopsis top hitse value%identityAlignment
AT1G69780.1 Homeobox-leucine zipper protein family5.3e-3047.74Show/hide
Query:  EEGCVEESGHVSEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGLQPRQVAVWFQNRRARWKTKQLERDYGVLKTNYENLKLSYETLQNDNQALL
        EE   ++   + EKKRRL++EQVK LEKNFE+ NKLEPERK++LAR LGLQPRQ+A+WFQNRRARWKTKQLE+DY  LK  ++ LK   + LQ  NQ L 
Subjt:  EEGCVEESGHVSEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGLQPRQVAVWFQNRRARWKTKQLERDYGVLKTNYENLKLSYETLQNDNQALL

Query:  KQIRELKSKLQEDNSESNISVEEEMVVAADSENALMKQTKPEIGDQFSVPPASES
         +I  LK++ Q ++   N   E      +D+ +  ++       D  + PP+++S
Subjt:  KQIRELKSKLQEDNSESNISVEEEMVVAADSENALMKQTKPEIGDQFSVPPASES

AT2G22430.1 homeobox protein 62.8e-6347.51Show/hide
Query:  MKRPAVGSDSLGALISICP-TSDHEQSPRNKNSNHVYGTEFQSMLDGF--EEEGCVEESGHV--SEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARE
        M +    SDS+G LIS+CP TS  EQSPR        G EFQSML+G+  EEE  VEE GHV  SEKKRRLS+ QVKALEKNFE+ENKLEPERKVKLA+E
Subjt:  MKRPAVGSDSLGALISICP-TSDHEQSPRNKNSNHVYGTEFQSMLDGF--EEEGCVEESGHV--SEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARE

Query:  LGLQPRQVAVWFQNRRARWKTKQLERDYGVLKTNYENLKLSYETLQNDNQALLKQIRELKSKL-----QEDNSESNISVEEEMVVAADSENALMKQTKPE
        LGLQPRQVAVWFQNRRARWKTKQLE+DYGVLKT Y++L+ ++++L+ DN++LL++I +LK+KL     +E+  E+N +V  E  ++   E   + +   +
Subjt:  LGLQPRQVAVWFQNRRARWKTKQLERDYGVLKTNYENLKLSYETLQNDNQALLKQIRELKSKL-----QEDNSESNISVEEEMVVAADSENALMKQTKPE

Query:  IGDQFSVPP--ASESQDFNHESFNNNGGEGEEAAIEEVSLFADFKDGSSDSDSSAILNEDYSPTAGISSPGVLQNHQQYHFMTGAVSPAPSAAVKLNYLQ
        I +  S PP     S   N+ SF +        A    S FA     S  SDSSA+LNE+ S    +++P  +                       N+ Q
Subjt:  IGDQFSVPP--ASESQDFNHESFNNNGGEGEEAAIEEVSLFADFKDGSSDSDSSAILNEDYSPTAGISSPGVLQNHQQYHFMTGAVSPAPSAAVKLNYLQ

Query:  FQKGYQQQTQMFPKMEEHNFFSGEEACNFFSDEQAPTLHWW
        F K   +QT+     +  +F SGEEAC FFSDEQ P+LHW+
Subjt:  FQKGYQQQTQMFPKMEEHNFFSGEEACNFFSDEQAPTLHWW

AT4G40060.1 homeobox protein 164.4e-5344.76Show/hide
Query:  MKRPAVGSDSLGALISICPTSDHEQSPRNKNSNHVYGTEFQSMLDGFEEEGCV--EESGH-----VSEKKRRLSVEQVKALEKNFEVENKLEPERKVKLA
        MKR +  SDS+  LIS   TS  EQSPR       YG+ +QSML+G++E+  +  E SG+     +SEKKRRL V+QVKALEKNFE+ENKLEPERK KLA
Subjt:  MKRPAVGSDSLGALISICPTSDHEQSPRNKNSNHVYGTEFQSMLDGFEEEGCV--EESGH-----VSEKKRRLSVEQVKALEKNFEVENKLEPERKVKLA

Query:  RELGLQPRQVAVWFQNRRARWKTKQLERDYGVLKTNYENLKLSYETLQNDNQALLKQIRELKSKL--QEDNSESNI---SVEEEMVVAADSENALMKQTK
        +ELGLQPRQVAVWFQNRRARWKTKQLE+DYGVLK  Y++L+ ++++L+ DN +LL++I ++K+K+  +EDN+ +      V+EE V   DS         
Subjt:  RELGLQPRQVAVWFQNRRARWKTKQLERDYGVLKTNYENLKLSYETLQNDNQALLKQIRELKSKL--QEDNSESNI---SVEEEMVVAADSENALMKQTK

Query:  PEIGDQFSVPPASESQDFNHES-FNNNGGEGEEAAIEEVSLFADFKD----------GSSDS-DSSAILNEDYSPTAGISSPGVLQNHQQYHFMTGAVSP
                  P+S  Q   H S FN                F D +D          GSSDS DSSA+LN++ S   G  +P V         +TG    
Subjt:  PEIGDQFSVPPASESQDFNHES-FNNNGGEGEEAAIEEVSLFADFKD----------GSSDS-DSSAILNEDYSPTAGISSPGVLQNHQQYHFMTGAVSP

Query:  APSAAVKLNYLQFQKGYQQQTQMFPKMEEHNFFSGEEACNFFSDEQAPTLHWW
                ++LQF K   +QT+     +  +F SGEEAC FFSDEQ P+LHW+
Subjt:  APSAAVKLNYLQFQKGYQQQTQMFPKMEEHNFFSGEEACNFFSDEQAPTLHWW

AT5G65310.1 homeobox protein 51.4e-5143.36Show/hide
Query:  MKRPAVGSDSLGALISI-CPTSDHEQSPRNKNSNHVY--GTEFQSMLDGFEEEGCVEESGHV-------SEKKRRLSVEQVKALEKNFEVENKLEPERKV
        MKR    SDSL   + I   T+D + SPR   +  +Y    ++  M D  E++G +E+ G V       +EKKRRL VEQVKALEKNFE++NKLEPERKV
Subjt:  MKRPAVGSDSLGALISI-CPTSDHEQSPRNKNSNHVY--GTEFQSMLDGFEEEGCVEESGHV-------SEKKRRLSVEQVKALEKNFEVENKLEPERKV

Query:  KLARELGLQPRQVAVWFQNRRARWKTKQLERDYGVLKTNYENLKLSYETLQNDNQALLKQIRELKSKLQEDNSESNISVEEE-MVVAADSENALMKQTKP
        KLA+ELGLQPRQVA+WFQNRRARWKTKQLERDYGVLK+N++ LK + ++LQ DN +LL QI+ELK+KL   N E    +EE   + A ++  ++M   + 
Subjt:  KLARELGLQPRQVAVWFQNRRARWKTKQLERDYGVLKTNYENLKLSYETLQNDNQALLKQIRELKSKLQEDNSESNISVEEE-MVVAADSENALMKQTKP

Query:  -EIGDQFSVPPASESQDFNHESFNNNGGEGEEAAIEEVSLF---ADFKDGSSD-SDSSAILNEDYSPTAGISSPGVLQNHQQYHFMTGAVSPAPSAAVKL
         E+  +   PP     D              E A E  S+F    +F+D  +D SDSSA+LNE+YSP                      V  A + A   
Subjt:  -EIGDQFSVPPASESQDFNHESFNNNGGEGEEAAIEEVSLF---ADFKDGSSD-SDSSAILNEDYSPTAGISSPGVLQNHQQYHFMTGAVSPAPSAAVKL

Query:  NYLQFQKGYQQQTQMFPKMEEH-NFFSGEEACNFFSDEQ
          +     + Q    F KMEEH + FSGEEAC  F+D +
Subjt:  NYLQFQKGYQQQTQMFPKMEEH-NFFSGEEACNFFSDEQ

AT5G65310.2 homeobox protein 51.1e-4843.4Show/hide
Query:  SDHEQSPRNKNSNHVY--GTEFQSMLDGFEEEGCVEESGHV-------SEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGLQPRQVAVWFQNRR
        +D + SPR   +  +Y    ++  M D  E++G +E+ G V       +EKKRRL VEQVKALEKNFE++NKLEPERKVKLA+ELGLQPRQVA+WFQNRR
Subjt:  SDHEQSPRNKNSNHVY--GTEFQSMLDGFEEEGCVEESGHV-------SEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGLQPRQVAVWFQNRR

Query:  ARWKTKQLERDYGVLKTNYENLKLSYETLQNDNQALLKQIRELKSKLQEDNSESNISVEEE-MVVAADSENALMKQTKP-EIGDQFSVPPASESQDFNHE
        ARWKTKQLERDYGVLK+N++ LK + ++LQ DN +LL QI+ELK+KL   N E    +EE   + A ++  ++M   +  E+  +   PP     D    
Subjt:  ARWKTKQLERDYGVLKTNYENLKLSYETLQNDNQALLKQIRELKSKLQEDNSESNISVEEE-MVVAADSENALMKQTKP-EIGDQFSVPPASESQDFNHE

Query:  SFNNNGGEGEEAAIEEVSLF---ADFKDGSSD-SDSSAILNEDYSPTAGISSPGVLQNHQQYHFMTGAVSPAPSAAVKLNYLQFQKGYQQQTQMFPKMEE
                  E A E  S+F    +F+D  +D SDSSA+LNE+YSP                      V  A + A     +     + Q    F KMEE
Subjt:  SFNNNGGEGEEAAIEEVSLF---ADFKDGSSD-SDSSAILNEDYSPTAGISSPGVLQNHQQYHFMTGAVSPAPSAAVKLNYLQFQKGYQQQTQMFPKMEE

Query:  H-NFFSGEEACNFFSDEQ
        H + FSGEEAC  F+D +
Subjt:  H-NFFSGEEACNFFSDEQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGAGACCTGCAGTCGGCTCAGATTCCTTGGGTGCACTCATCTCCATTTGCCCAACTTCAGATCATGAACAGAGTCCGAGAAACAAGAACAGTAACCATGTTTATGG
CACAGAATTCCAGTCTATGCTGGATGGATTTGAGGAAGAAGGGTGCGTTGAAGAATCGGGGCATGTATCAGAGAAGAAAAGGCGACTTAGTGTGGAGCAAGTGAAGGCTC
TAGAGAAGAATTTCGAAGTTGAAAACAAGCTCGAGCCAGAAAGGAAAGTGAAGCTTGCTCGAGAACTTGGATTGCAGCCTCGACAAGTGGCTGTTTGGTTTCAAAATCGT
CGAGCCAGATGGAAAACCAAGCAATTAGAAAGAGACTATGGCGTTCTCAAAACAAATTATGAGAATCTCAAACTCAGTTATGAAACTCTCCAAAATGACAATCAAGCTCT
CCTCAAACAGATTCGGGAACTGAAATCAAAGCTTCAAGAAGATAACTCAGAGAGCAATATTTCGGTGGAGGAAGAAATGGTGGTGGCGGCCGATTCTGAAAATGCTCTGA
TGAAACAAACTAAGCCGGAAATTGGTGATCAGTTCTCTGTTCCGCCGGCGAGTGAGTCCCAAGACTTCAATCACGAGAGCTTCAACAACAATGGCGGAGAAGGGGAAGAG
GCAGCAATAGAAGAAGTGTCATTGTTCGCCGATTTCAAAGATGGGTCATCTGATAGCGATTCGAGCGCAATTTTAAACGAAGATTACAGCCCGACGGCGGGCATTTCTTC
ACCCGGGGTGCTGCAGAATCACCAGCAGTACCATTTCATGACGGGAGCGGTATCTCCGGCGCCTTCCGCCGCCGTGAAACTCAACTACTTGCAGTTTCAAAAGGGGTATC
AACAACAAACCCAGATGTTTCCAAAAATGGAGGAGCATAATTTCTTCAGTGGAGAGGAGGCTTGTAACTTCTTCTCCGATGAGCAAGCTCCGACTCTGCACTGGTGGGGC
TGA
mRNA sequenceShow/hide mRNA sequence
TTTTTTTTTTCACTTAATTTGATTCCTCTTTCTTCTGGGTTTTGCGTATAAGCTTCCTATTTTGGAGTTGGGGGATTCTAAATGGGTTCCGAGGGAGAAGAAATTTTGAA
GAACAATGATGATGAATCAGGAAAAAGGAAGTGTTTGTGCAGTCATTGTGAACAACACTGTACAAATTGCTCTTCTTTTTCTTTCACTACACATCGGACTTTTGCCTTCA
TTGCTTCCTGAGTGATATCATAATTTCAAATTTCTTTTCCCATTCAGACACTTAAAACAAACAAACAAACAATCATTAAACAGCAACAAAAAACTAACCCATCATCATCA
TCATCTAAGTAGAAGTGTGTTTAATTTTTTGCTGATTCCCAATCATGAAGAGACCTGCAGTCGGCTCAGATTCCTTGGGTGCACTCATCTCCATTTGCCCAACTTCAGAT
CATGAACAGAGTCCGAGAAACAAGAACAGTAACCATGTTTATGGCACAGAATTCCAGTCTATGCTGGATGGATTTGAGGAAGAAGGGTGCGTTGAAGAATCGGGGCATGT
ATCAGAGAAGAAAAGGCGACTTAGTGTGGAGCAAGTGAAGGCTCTAGAGAAGAATTTCGAAGTTGAAAACAAGCTCGAGCCAGAAAGGAAAGTGAAGCTTGCTCGAGAAC
TTGGATTGCAGCCTCGACAAGTGGCTGTTTGGTTTCAAAATCGTCGAGCCAGATGGAAAACCAAGCAATTAGAAAGAGACTATGGCGTTCTCAAAACAAATTATGAGAAT
CTCAAACTCAGTTATGAAACTCTCCAAAATGACAATCAAGCTCTCCTCAAACAGATTCGGGAACTGAAATCAAAGCTTCAAGAAGATAACTCAGAGAGCAATATTTCGGT
GGAGGAAGAAATGGTGGTGGCGGCCGATTCTGAAAATGCTCTGATGAAACAAACTAAGCCGGAAATTGGTGATCAGTTCTCTGTTCCGCCGGCGAGTGAGTCCCAAGACT
TCAATCACGAGAGCTTCAACAACAATGGCGGAGAAGGGGAAGAGGCAGCAATAGAAGAAGTGTCATTGTTCGCCGATTTCAAAGATGGGTCATCTGATAGCGATTCGAGC
GCAATTTTAAACGAAGATTACAGCCCGACGGCGGGCATTTCTTCACCCGGGGTGCTGCAGAATCACCAGCAGTACCATTTCATGACGGGAGCGGTATCTCCGGCGCCTTC
CGCCGCCGTGAAACTCAACTACTTGCAGTTTCAAAAGGGGTATCAACAACAAACCCAGATGTTTCCAAAAATGGAGGAGCATAATTTCTTCAGTGGAGAGGAGGCTTGTA
ACTTCTTCTCCGATGAGCAAGCTCCGACTCTGCACTGGTGGGGCTGAATCCATGGCGGCATAAAGGCGAAATTTAAGAGAGAATAAAAACGGAAGTAGATTTGGAAACTT
GGGTGTGTAGTAATTTTGAAGATGGTCGGAATGAAGGAAGAAGATGATTGAATTGGGGAAGAGTAATGGAGAAATTGAGGTGGGATTCTCCATGAAAATAATTTGGGCTA
ATTTTTCCCTTTTTCCTTTGATTCTTCCATGAGTTCGTGAGGGGAAAATGATGAGATGGAAACTGAAATCTAAGATGTAAAAATTAAAACCAGAAAAGAAAAAAAAAAAA
AAAAATAGAGAGAAAAATCAAATAGGGTTAGGAAAAAAACTCATGTAATGGAGAAATAATCATAGTGTTCGCTCTTTTGCCGTTGTCCAAACAAAACTTTACAACT
Protein sequenceShow/hide protein sequence
MKRPAVGSDSLGALISICPTSDHEQSPRNKNSNHVYGTEFQSMLDGFEEEGCVEESGHVSEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGLQPRQVAVWFQNR
RARWKTKQLERDYGVLKTNYENLKLSYETLQNDNQALLKQIRELKSKLQEDNSESNISVEEEMVVAADSENALMKQTKPEIGDQFSVPPASESQDFNHESFNNNGGEGEE
AAIEEVSLFADFKDGSSDSDSSAILNEDYSPTAGISSPGVLQNHQQYHFMTGAVSPAPSAAVKLNYLQFQKGYQQQTQMFPKMEEHNFFSGEEACNFFSDEQAPTLHWWG