; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi05G008180 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi05G008180
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
Descriptionhomeobox-leucine zipper protein ATHB-6-like
Genome locationchr05:12772727..12774724
RNA-Seq ExpressionLsi05G008180
SyntenyLsi05G008180
Gene Ontology termsGO:0006357 - regulation of transcription by RNA polymerase II (biological process)
GO:0045893 - positive regulation of transcription, DNA-templated (biological process)
GO:0005634 - nucleus (cellular component)
GO:0000981 - DNA-binding transcription factor activity, RNA polymerase II-specific (molecular function)
GO:0043565 - sequence-specific DNA binding (molecular function)
InterPro domainsIPR000047 - Helix-turn-helix motif
IPR001356 - Homeobox domain
IPR003106 - Leucine zipper, homeobox-associated
IPR009057 - Homeobox-like domain superfamily
IPR017970 - Homeobox, conserved site


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7031308.1 Homeobox-leucine zipper protein ATHB-6 [Cucurbita argyrosperma subsp. argyrosperma]9.1e-13378.76Show/hide
Query:  MKRPAVSSDSLGALISICPNSDHEQSPRNKNSTNNHVYGTEFQCMLDGFEEEGCVEESGHVSEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGL
        MKRPA   DS+GAL+SI P +D EQSPR     NN V GTEFQ MLDGF EEG VEE GHVSEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGL
Subjt:  MKRPAVSSDSLGALISICPNSDHEQSPRNKNSTNNHVYGTEFQCMLDGFEEEGCVEESGHVSEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGL

Query:  QPRQVAVWFQNRRARWKTKQLERDYGVLKTNYENLKLSYETLQNDNQALLKQIRELKAKLQEDNSESNLSVEEEMVVAADSENALIQQTKTEIGDHFSVP
        Q RQVAVWFQNRRARWKTKQLERDYGVLKTNY+NLKL+YETLQ+DNQALLK+I+ELK KLQEDNSESNLSVEEE  V ADSENALI+Q K EI D FSVP
Subjt:  QPRQVAVWFQNRRARWKTKQLERDYGVLKTNYENLKLSYETLQNDNQALLKQIRELKAKLQEDNSESNLSVEEEMVVAADSENALIQQTKTEIGDHFSVP

Query:  PASESQDFNYESFNNNGGGGEEAAIEEASLFPDFKDGSSDSDSSAILNEDYSPTAGISSPGVLQKHQQYHFMTGPESPAPSAAVK-------LNYLPFQK
         A+ESQDFNY S +NNGG G     EE SLFPDFKDGSSDSDSSAILNEDY PT  ISSP VLQ H   HFMTG  SP+PS  VK       LNYL +QK
Subjt:  PASESQDFNYESFNNNGGGGEEAAIEEASLFPDFKDGSSDSDSSAILNEDYSPTAGISSPGVLQKHQQYHFMTGPESPAPSAAVK-------LNYLPFQK

Query:  GYQQQTQMFTKMEEHNFFSGEEACNFFSDEQAPTLHWWS
        GYQ QTQMF KMEEHNFFSGEE CNFFSDEQAPTLHWWS
Subjt:  GYQQQTQMFTKMEEHNFFSGEEACNFFSDEQAPTLHWWS

XP_022136962.1 homeobox-leucine zipper protein ATHB-6-like [Momordica charantia]1.5e-13880.99Show/hide
Query:  MKRP-AVSSDSLGALISICPNSDHEQSPRNKNSTNNHVYGTEFQCMLDGFEEEGCVEESGHVSEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELG
        MKRP A SSDSLGAL+SICP SD EQSPR     NNHVYG EFQ MLDGF+EEGCVEESGHVSEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELG
Subjt:  MKRP-AVSSDSLGALISICPNSDHEQSPRNKNSTNNHVYGTEFQCMLDGFEEEGCVEESGHVSEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELG

Query:  LQPRQVAVWFQNRRARWKTKQLERDYGVLKTNYENLKLSYETLQNDNQALLKQIRELKAKLQEDNSESNLSVEEEMVV-AADSENALIQQTKTEIGDHFS
        LQPRQVAVWFQNRRARWKTKQLERDYGVLKTNYE LKL+YETLQ DN ALLK+IRELK+KLQEDNSESN+SVEEEMV+ AADSENALI++T+ E GD FS
Subjt:  LQPRQVAVWFQNRRARWKTKQLERDYGVLKTNYENLKLSYETLQNDNQALLKQIRELKAKLQEDNSESNLSVEEEMVV-AADSENALIQQTKTEIGDHFS

Query:  VPPASESQDFNYESFNNNGGGGEEAAIEEASLFPDFKDGSSDSDSSAILNEDYSPTAGISSPGVLQKHQQYHFMTGPESPAPSAAVK-------LNYLPF
        VPPA+E +D NYESFNNNGG GEE   EEASLFPDFKDGSSDSDSSAILNEDYS TA ISSPGVLQ +Q YHFM    SP+PSAAVK       LNY  F
Subjt:  VPPASESQDFNYESFNNNGGGGEEAAIEEASLFPDFKDGSSDSDSSAILNEDYSPTAGISSPGVLQKHQQYHFMTGPESPAPSAAVK-------LNYLPF

Query:  QKGYQQQTQMFTKMEEHNFFSGEE-ACNFFSDEQAPTLHWWS
        QK Y QQTQ++ KMEEHNFF+GEE  CNFFS+EQAP+LHWWS
Subjt:  QKGYQQQTQMFTKMEEHNFFSGEE-ACNFFSDEQAPTLHWWS

XP_022999792.1 homeobox-leucine zipper protein ATHB-6-like [Cucurbita maxima]1.8e-13379.06Show/hide
Query:  MKRPAVSSDSLGALISICPNSDHEQSPRNKNSTNNHVYGTEFQCMLDGFEEEGCVEESGHVSEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGL
        MKRPA   DS+GAL+SI P +D EQSPR     NN V GTEFQ MLDGF EEG VEE GHVSEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGL
Subjt:  MKRPAVSSDSLGALISICPNSDHEQSPRNKNSTNNHVYGTEFQCMLDGFEEEGCVEESGHVSEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGL

Query:  QPRQVAVWFQNRRARWKTKQLERDYGVLKTNYENLKLSYETLQNDNQALLKQIRELKAKLQEDNSESNLSVEEEMVVAADSENALIQQTKTEIGDHFSVP
        Q RQVAVWFQNRRARWKTKQLERDYGVLKTNY+NLKL+YETLQ+DNQALLK+I+ELK KLQEDNS+SNLSVEEEM V ADSENALI+Q K EI D FSVP
Subjt:  QPRQVAVWFQNRRARWKTKQLERDYGVLKTNYENLKLSYETLQNDNQALLKQIRELKAKLQEDNSESNLSVEEEMVVAADSENALIQQTKTEIGDHFSVP

Query:  PASESQDFNYESFNNNGGGGEEAAIEEASLFPDFKDGSSDSDSSAILNEDYSPTAGISSPGVLQKHQQYHFMTGPESPAPSAAVK-------LNYLPFQK
         A+ESQDFNYES +NNGG G     EE SLFPDFKDGSSDSDSSAILNEDY PT  ISS  VLQ H   HFMTG  SP+PS  VK       LNYL +QK
Subjt:  PASESQDFNYESFNNNGGGGEEAAIEEASLFPDFKDGSSDSDSSAILNEDYSPTAGISSPGVLQKHQQYHFMTGPESPAPSAAVK-------LNYLPFQK

Query:  GYQQQTQMFTKMEEHNFFSGEEACNFFSDEQAPTLHWWS
        GYQQQTQMF KMEEHNFFSGEE CNFFSDEQAPTLHWWS
Subjt:  GYQQQTQMFTKMEEHNFFSGEEACNFFSDEQAPTLHWWS

XP_023547219.1 homeobox-leucine zipper protein ATHB-6-like [Cucurbita pepo subsp. pepo]8.2e-13479.06Show/hide
Query:  MKRPAVSSDSLGALISICPNSDHEQSPRNKNSTNNHVYGTEFQCMLDGFEEEGCVEESGHVSEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGL
        MKRPA   DS+GAL+SI P +D EQSPR     NN V GTEFQ MLDGF EEG VEE GHVSEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGL
Subjt:  MKRPAVSSDSLGALISICPNSDHEQSPRNKNSTNNHVYGTEFQCMLDGFEEEGCVEESGHVSEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGL

Query:  QPRQVAVWFQNRRARWKTKQLERDYGVLKTNYENLKLSYETLQNDNQALLKQIRELKAKLQEDNSESNLSVEEEMVVAADSENALIQQTKTEIGDHFSVP
        Q RQVAVWFQNRRARWKTKQLERDYGVLKTNY+NLKL+YETLQ+DNQALLK+I+ELK KLQEDNSESNLSVEEEM V ADSENALI+Q K EI D FSVP
Subjt:  QPRQVAVWFQNRRARWKTKQLERDYGVLKTNYENLKLSYETLQNDNQALLKQIRELKAKLQEDNSESNLSVEEEMVVAADSENALIQQTKTEIGDHFSVP

Query:  PASESQDFNYESFNNNGGGGEEAAIEEASLFPDFKDGSSDSDSSAILNEDYSPTAGISSPGVLQKHQQYHFMTGPESPAPSAAVK-------LNYLPFQK
         A+E+QDFNYES ++NGG G     EE SLFPDFKDGSSDSDSSAILNEDY PT  ISSP VLQ H   HFMTG  SP+PS  VK       LNYL +QK
Subjt:  PASESQDFNYESFNNNGGGGEEAAIEEASLFPDFKDGSSDSDSSAILNEDYSPTAGISSPGVLQKHQQYHFMTGPESPAPSAAVK-------LNYLPFQK

Query:  GYQQQTQMFTKMEEHNFFSGEEACNFFSDEQAPTLHWWS
        GYQQQTQMF KMEEHNFFSGEE CNFFSDEQAPTLHWWS
Subjt:  GYQQQTQMFTKMEEHNFFSGEEACNFFSDEQAPTLHWWS

XP_038905018.1 homeobox-leucine zipper protein ATHB-6-like [Benincasa hispida]6.5e-16391.29Show/hide
Query:  MKRPAVSSDSLGALISICPNSDHEQSPRNKNSTNNHVYGTEFQCMLDGFEEEGCVEESGHVSEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGL
        MKRPAV SDSLGALISICP+SDHEQSPRNKNS  NHVY TEFQ MLDGF+EEGCVEESGHVSEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGL
Subjt:  MKRPAVSSDSLGALISICPNSDHEQSPRNKNSTNNHVYGTEFQCMLDGFEEEGCVEESGHVSEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGL

Query:  QPRQVAVWFQNRRARWKTKQLERDYGVLKTNYENLKLSYETLQNDNQALLKQIRELKAKLQEDNSESNLSVEEEMVVAADSENALIQQTKTEI-GDHFSV
        QPRQVAVWFQNRRARWKTKQLERDYGVLKTNYENLKLS+ETLQNDNQ LLKQIRELK KLQEDNSESNLSVEEEMVVAA+SENA I+QTK EI GD FSV
Subjt:  QPRQVAVWFQNRRARWKTKQLERDYGVLKTNYENLKLSYETLQNDNQALLKQIRELKAKLQEDNSESNLSVEEEMVVAADSENALIQQTKTEI-GDHFSV

Query:  PPASESQDFNYESFNNNGGGGEEAAIEEASLFPDFKDGSSDSDSSAILNEDYSPTAGISSPGVLQKHQQYHFMTGPESPAPSAAVKLNYLPFQKGYQQQT
        PPASESQDFNYESFN+NGG GEEA +EEA+LFPDFKDGSSDSDSSAILNEDYSPTAGISSPGVLQ HQQYH MTG  SP PSAAVKLNYL FQKGYQQQT
Subjt:  PPASESQDFNYESFNNNGGGGEEAAIEEASLFPDFKDGSSDSDSSAILNEDYSPTAGISSPGVLQKHQQYHFMTGPESPAPSAAVKLNYLPFQKGYQQQT

Query:  QMFTKMEEHNFFSGEEACNFFSDEQAPTLHWWS
        QMF KMEEHNFFSGEE CNFFSDEQAPTLHWWS
Subjt:  QMFTKMEEHNFFSGEEACNFFSDEQAPTLHWWS

TrEMBL top hitse value%identityAlignment
A0A6J1C6W4 homeobox-leucine zipper protein ATHB-6-like7.0e-13980.99Show/hide
Query:  MKRP-AVSSDSLGALISICPNSDHEQSPRNKNSTNNHVYGTEFQCMLDGFEEEGCVEESGHVSEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELG
        MKRP A SSDSLGAL+SICP SD EQSPR     NNHVYG EFQ MLDGF+EEGCVEESGHVSEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELG
Subjt:  MKRP-AVSSDSLGALISICPNSDHEQSPRNKNSTNNHVYGTEFQCMLDGFEEEGCVEESGHVSEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELG

Query:  LQPRQVAVWFQNRRARWKTKQLERDYGVLKTNYENLKLSYETLQNDNQALLKQIRELKAKLQEDNSESNLSVEEEMVV-AADSENALIQQTKTEIGDHFS
        LQPRQVAVWFQNRRARWKTKQLERDYGVLKTNYE LKL+YETLQ DN ALLK+IRELK+KLQEDNSESN+SVEEEMV+ AADSENALI++T+ E GD FS
Subjt:  LQPRQVAVWFQNRRARWKTKQLERDYGVLKTNYENLKLSYETLQNDNQALLKQIRELKAKLQEDNSESNLSVEEEMVV-AADSENALIQQTKTEIGDHFS

Query:  VPPASESQDFNYESFNNNGGGGEEAAIEEASLFPDFKDGSSDSDSSAILNEDYSPTAGISSPGVLQKHQQYHFMTGPESPAPSAAVK-------LNYLPF
        VPPA+E +D NYESFNNNGG GEE   EEASLFPDFKDGSSDSDSSAILNEDYS TA ISSPGVLQ +Q YHFM    SP+PSAAVK       LNY  F
Subjt:  VPPASESQDFNYESFNNNGGGGEEAAIEEASLFPDFKDGSSDSDSSAILNEDYSPTAGISSPGVLQKHQQYHFMTGPESPAPSAAVK-------LNYLPF

Query:  QKGYQQQTQMFTKMEEHNFFSGEE-ACNFFSDEQAPTLHWWS
        QK Y QQTQ++ KMEEHNFF+GEE  CNFFS+EQAP+LHWWS
Subjt:  QKGYQQQTQMFTKMEEHNFFSGEE-ACNFFSDEQAPTLHWWS

A0A6J1ENE6 homeobox-leucine zipper protein ATHB-6-like1.4e-11874.11Show/hide
Query:  SDSLGALISICPNSDHEQSPRNKNSTNNHVYGTEFQCMLDGFEEEGCVEESGHVSEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGLQPRQVAV
        SDS+ ALISI P SD EQSPRNKNS  NHVY  EFQCMLDGF+E    EE GHVSEKKRRL VEQVKALEKNFEVENKLEPERK+KLA+ELGLQPRQVAV
Subjt:  SDSLGALISICPNSDHEQSPRNKNSTNNHVYGTEFQCMLDGFEEEGCVEESGHVSEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGLQPRQVAV

Query:  WFQNRRARWKTKQLERDYGVLKTNYENLKLSYETLQNDNQALLKQIRELKAKLQEDNSESNLSVEEEMVVAADSENALIQQTKTEIGDHFSVPPASESQD
        WFQNRRARWKTKQLERDYGVLKTNY+NLKLS+E LQNDNQALLK+IRELKAK+QEDNS        EM+V ADSENALI+QTK EI D FSVPPA     
Subjt:  WFQNRRARWKTKQLERDYGVLKTNYENLKLSYETLQNDNQALLKQIRELKAKLQEDNSESNLSVEEEMVVAADSENALIQQTKTEIGDHFSVPPASESQD

Query:  FNYESFNNNGGGGEEAAIEEASLFPDFKDGSSDSDSSAILNEDYSPTAGISSPGVLQKHQQYHFMTG---PESPA-PSAAVK-------LNYLPFQKGYQ
            SFNNNGG G+E         P  KDGSSDSDSSAILNEDYSPTAG+SSPGVLQ +   HFMTG   P SP+  +A VK       LNYL FQKGYQ
Subjt:  FNYESFNNNGGGGEEAAIEEASLFPDFKDGSSDSDSSAILNEDYSPTAGISSPGVLQKHQQYHFMTG---PESPA-PSAAVK-------LNYLPFQKGYQ

Query:  QQTQMFTKMEEHNFFSGEEACNFFSDEQAPTLHWWS
        Q   MF KMEEHNFF GEEACNFFSDEQAPTLHWWS
Subjt:  QQTQMFTKMEEHNFFSGEEACNFFSDEQAPTLHWWS

A0A6J1FNC9 homeobox-leucine zipper protein ATHB-6-like1.3e-13278.47Show/hide
Query:  MKRPAVSSDSLGALISICPNSDHEQSPRNKNSTNNHVYGTEFQCMLDGFEEEGCVEESGHVSEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGL
        MKRPA   DS+GAL+SI P +D EQSPR     NN V GTEFQ MLDGF E+G VEE GHVSEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGL
Subjt:  MKRPAVSSDSLGALISICPNSDHEQSPRNKNSTNNHVYGTEFQCMLDGFEEEGCVEESGHVSEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGL

Query:  QPRQVAVWFQNRRARWKTKQLERDYGVLKTNYENLKLSYETLQNDNQALLKQIRELKAKLQEDNSESNLSVEEEMVVAADSENALIQQTKTEIGDHFSVP
        Q RQVAVWFQNRRARWKTKQLERDYGVLKTNY+NLKL+YETLQ+DNQALLK+I+ELK KLQEDNSESNLSVEEE  V ADSENALI+Q K EI D FSVP
Subjt:  QPRQVAVWFQNRRARWKTKQLERDYGVLKTNYENLKLSYETLQNDNQALLKQIRELKAKLQEDNSESNLSVEEEMVVAADSENALIQQTKTEIGDHFSVP

Query:  PASESQDFNYESFNNNGGGGEEAAIEEASLFPDFKDGSSDSDSSAILNEDYSPTAGISSPGVLQKHQQYHFMTGPESPAPSAAVK-------LNYLPFQK
         A+ESQDFNY S +NNGG G     EE SLFPDFKDGSSDSDSSAILNEDY PT  ISSP VLQ H   HFMTG  SP+PS  VK       LNYL +QK
Subjt:  PASESQDFNYESFNNNGGGGEEAAIEEASLFPDFKDGSSDSDSSAILNEDYSPTAGISSPGVLQKHQQYHFMTGPESPAPSAAVK-------LNYLPFQK

Query:  GYQQQTQMFTKMEEHNFFSGEEACNFFSDEQAPTLHWWS
        GYQQQ+QMF KMEEHNFFSGEE CNFFSDEQAPTLHWWS
Subjt:  GYQQQTQMFTKMEEHNFFSGEEACNFFSDEQAPTLHWWS

A0A6J1JAW1 homeobox-leucine zipper protein ATHB-6-like8.9e-11873.51Show/hide
Query:  SDSLGALISICPNSDHEQSPRNKNSTNNHVYGTEFQCMLDGFEEEGCVEESGHVSEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGLQPRQVAV
        SDS+ ALISI P SD EQSPRNKNS  NHVY  EFQCMLDGF+E    EE GHVSEKKRRL VEQVK+LEKNFEVENKLEPERK+KLA+ELGLQPRQVAV
Subjt:  SDSLGALISICPNSDHEQSPRNKNSTNNHVYGTEFQCMLDGFEEEGCVEESGHVSEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGLQPRQVAV

Query:  WFQNRRARWKTKQLERDYGVLKTNYENLKLSYETLQNDNQALLKQIRELKAKLQEDNSESNLSVEEEMVVAADSENALIQQTKTEIGDHFSVPPASESQD
        WFQNRRARWKTKQLERDYGVLKTNY+NLKLS+E LQNDNQALLK+IRELKAK+QEDNS        EM+  ADSENALI+QTK EI D FSVPPA     
Subjt:  WFQNRRARWKTKQLERDYGVLKTNYENLKLSYETLQNDNQALLKQIRELKAKLQEDNSESNLSVEEEMVVAADSENALIQQTKTEIGDHFSVPPASESQD

Query:  FNYESFNNNGGGGEEAAIEEASLFPDFKDGSSDSDSSAILNEDYSPTAGISSPGVLQKHQQYHFMTG---PESPA-PSAAVK-------LNYLPFQKGYQ
            SFNNNGG G+E         P  KDGSSDSDSSAILNEDYSPTAG+SSPGVLQ +   HFMTG   P SP+  +A VK       LNYL FQKGYQ
Subjt:  FNYESFNNNGGGGEEAAIEEASLFPDFKDGSSDSDSSAILNEDYSPTAGISSPGVLQKHQQYHFMTG---PESPA-PSAAVK-------LNYLPFQKGYQ

Query:  QQTQMFTKMEEHNFFSGEEACNFFSDEQAPTLHWWS
        Q   MF KMEEHNFF GEEACNFFSDEQAPTLHWWS
Subjt:  QQTQMFTKMEEHNFFSGEEACNFFSDEQAPTLHWWS

A0A6J1KBS4 homeobox-leucine zipper protein ATHB-6-like8.9e-13479.06Show/hide
Query:  MKRPAVSSDSLGALISICPNSDHEQSPRNKNSTNNHVYGTEFQCMLDGFEEEGCVEESGHVSEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGL
        MKRPA   DS+GAL+SI P +D EQSPR     NN V GTEFQ MLDGF EEG VEE GHVSEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGL
Subjt:  MKRPAVSSDSLGALISICPNSDHEQSPRNKNSTNNHVYGTEFQCMLDGFEEEGCVEESGHVSEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGL

Query:  QPRQVAVWFQNRRARWKTKQLERDYGVLKTNYENLKLSYETLQNDNQALLKQIRELKAKLQEDNSESNLSVEEEMVVAADSENALIQQTKTEIGDHFSVP
        Q RQVAVWFQNRRARWKTKQLERDYGVLKTNY+NLKL+YETLQ+DNQALLK+I+ELK KLQEDNS+SNLSVEEEM V ADSENALI+Q K EI D FSVP
Subjt:  QPRQVAVWFQNRRARWKTKQLERDYGVLKTNYENLKLSYETLQNDNQALLKQIRELKAKLQEDNSESNLSVEEEMVVAADSENALIQQTKTEIGDHFSVP

Query:  PASESQDFNYESFNNNGGGGEEAAIEEASLFPDFKDGSSDSDSSAILNEDYSPTAGISSPGVLQKHQQYHFMTGPESPAPSAAVK-------LNYLPFQK
         A+ESQDFNYES +NNGG G     EE SLFPDFKDGSSDSDSSAILNEDY PT  ISS  VLQ H   HFMTG  SP+PS  VK       LNYL +QK
Subjt:  PASESQDFNYESFNNNGGGGEEAAIEEASLFPDFKDGSSDSDSSAILNEDYSPTAGISSPGVLQKHQQYHFMTGPESPAPSAAVK-------LNYLPFQK

Query:  GYQQQTQMFTKMEEHNFFSGEEACNFFSDEQAPTLHWWS
        GYQQQTQMF KMEEHNFFSGEE CNFFSDEQAPTLHWWS
Subjt:  GYQQQTQMFTKMEEHNFFSGEEACNFFSDEQAPTLHWWS

SwissProt top hitse value%identityAlignment
P46667 Homeobox-leucine zipper protein ATHB-51.5e-5043.66Show/hide
Query:  MKRPAVSSDSLGALISI-CPNSDHEQSPRNKNSTNNHVYGTEFQCMLDGFEEEGCVEESGHV-------SEKKRRLSVEQVKALEKNFEVENKLEPERKV
        MKR   SSDSL   + I    +D + SPR   +   +    ++  M D  E++G +E+ G V       +EKKRRL VEQVKALEKNFE++NKLEPERKV
Subjt:  MKRPAVSSDSLGALISI-CPNSDHEQSPRNKNSTNNHVYGTEFQCMLDGFEEEGCVEESGHV-------SEKKRRLSVEQVKALEKNFEVENKLEPERKV

Query:  KLARELGLQPRQVAVWFQNRRARWKTKQLERDYGVLKTNYENLKLSYETLQNDNQALLKQIRELKAKLQEDNSESNLSVEEEMVVAADSENALIQQTK--
        KLA+ELGLQPRQVA+WFQNRRARWKTKQLERDYGVLK+N++ LK + ++LQ DN +LL QI+ELKAKL   N E    +EE   + A   N  +      
Subjt:  KLARELGLQPRQVAVWFQNRRARWKTKQLERDYGVLKTNYENLKLSYETLQNDNQALLKQIRELKAKLQEDNSESNLSVEEEMVVAADSENALIQQTK--

Query:  TEIGDHFSVPPASESQDFNYESFNNNGGGGEEAAIEEASLFP---DFKDGSSD-SDSSAILNEDYSPTAGISSPGVLQKHQQYHFMTGPESPAPSAAVKL
         E+      PP     D              E A E  S+FP   +F+D  +D SDSSA+LNE+YSP   + + G +                 +  V++
Subjt:  TEIGDHFSVPPASESQDFNYESFNNNGGGGEEAAIEEASLFP---DFKDGSSD-SDSSAILNEDYSPTAGISSPGVLQKHQQYHFMTGPESPAPSAAVKL

Query:  NYLPFQKGYQQQTQMFTKMEEH-NFFSGEEACNFFSDEQ
        + +    G   Q   F KMEEH + FSGEEAC  F+D +
Subjt:  NYLPFQKGYQQQTQMFTKMEEH-NFFSGEEACNFFSDEQ

P46668 Homeobox-leucine zipper protein ATHB-68.7e-6247.97Show/hide
Query:  MKRPAVSSDSLGALISICP-NSDHEQSPRNKNSTNNHVYGTEFQCMLDGF--EEEGCVEESGHV--SEKKRRLSVEQVKALEKNFEVENKLEPERKVKLA
        M +   SSDS+G LIS+CP  S  EQSPR          G EFQ ML+G+  EEE  VEE GHV  SEKKRRLS+ QVKALEKNFE+ENKLEPERKVKLA
Subjt:  MKRPAVSSDSLGALISICP-NSDHEQSPRNKNSTNNHVYGTEFQCMLDGF--EEEGCVEESGHV--SEKKRRLSVEQVKALEKNFEVENKLEPERKVKLA

Query:  RELGLQPRQVAVWFQNRRARWKTKQLERDYGVLKTNYENLKLSYETLQNDNQALLKQIRELKAKL-----QEDNSESNLSVEEEMVVAADSENALIQQTK
        +ELGLQPRQVAVWFQNRRARWKTKQLE+DYGVLKT Y++L+ ++++L+ DN++LL++I +LK KL     +E+  E+N +V  E  ++   E   + +  
Subjt:  RELGLQPRQVAVWFQNRRARWKTKQLERDYGVLKTNYENLKLSYETLQNDNQALLKQIRELKAKL-----QEDNSESNLSVEEEMVVAADSENALIQQTK

Query:  TEIGDHFSVPP--ASESQDFNYESFNNNGGGGEEAAIEEASLFPDFKDGSSDSDSSAILNEDYSPTAGISSPGVLQKHQQYHFMTGPESPAPSAAVKLNY
        TE     S PP     S   NY SF +        A   AS F      S  SDSSA+LNE+ S    +++P           +T P           N+
Subjt:  TEIGDHFSVPP--ASESQDFNYESFNNNGGGGEEAAIEEASLFPDFKDGSSDSDSSAILNEDYSPTAGISSPGVLQKHQQYHFMTGPESPAPSAAVKLNY

Query:  LPFQKGYQQQTQMFTKMEEHNFFSGEEACNFFSDEQAPTLHWWS
          F K   +QT+     +  +F SGEEAC FFSDEQ P+LHW+S
Subjt:  LPFQKGYQQQTQMFTKMEEHNFFSGEEACNFFSDEQAPTLHWWS

Q6K498 Homeobox-leucine zipper protein HOX41.8e-3841.22Show/hide
Query:  GFEEEGCVEES----GHVSEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGLQPRQVAVWFQNRRARWKTKQLERDYGVLKTNYENLKLSYETLQ
        G E EG VEE     G   EKKRRLSVEQV+ALE++FEVENKLEPERK +LAR+LGLQPRQVAVWFQNRRARWKTKQLERDY  L+ +Y++L+L ++ L+
Subjt:  GFEEEGCVEES----GHVSEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGLQPRQVAVWFQNRRARWKTKQLERDYGVLKTNYENLKLSYETLQ

Query:  NDNQALLKQIRELKAKLQEDNSESNLSVEEEMVVAADSENALIQQTKTEIGDHFSVPPASESQDFNYESFNNNGGGGEEAAIEEASLFPDFKDGSSDSDS
         D  ALL +I+ELKAKL ++ + ++ +  +E   A+D                   PPA+                                 GSSDSDS
Subjt:  NDNQALLKQIRELKAKLQEDNSESNLSVEEEMVVAADSENALIQQTKTEIGDHFSVPPASESQDFNYESFNNNGGGGEEAAIEEASLFPDFKDGSSDSDS

Query:  SAILNEDYSPTAGISSPGVLQKHQQYHFMTGPESP---APSAAVKLNYLPFQKGYQQQTQMFTKMEEH--NFFSGEEAC-NFFSDEQAPTL-HWWS
        SA+LN+  +  A  ++   L   +   F+  P +    A +AA   +   F  G       F K+EE    F   +E C  FF+D+Q P L  WW+
Subjt:  SAILNEDYSPTAGISSPGVLQKHQQYHFMTGPESP---APSAAVKLNYLPFQKGYQQQTQMFTKMEEH--NFFSGEEAC-NFFSDEQAPTL-HWWS

Q940J1 Homeobox-leucine zipper protein ATHB-164.8e-5245.66Show/hide
Query:  MKRPAVSSDSLGALISICPNSDHEQSPRNKNSTNNHVYGTEFQCMLDGFEEEGCV--EESGH-----VSEKKRRLSVEQVKALEKNFEVENKLEPERKVK
        MKR + SSDS+  LIS    S  EQSPR         YG+ +Q ML+G++E+  +  E SG+     +SEKKRRL V+QVKALEKNFE+ENKLEPERK K
Subjt:  MKRPAVSSDSLGALISICPNSDHEQSPRNKNSTNNHVYGTEFQCMLDGFEEEGCV--EESGH-----VSEKKRRLSVEQVKALEKNFEVENKLEPERKVK

Query:  LARELGLQPRQVAVWFQNRRARWKTKQLERDYGVLKTNYENLKLSYETLQNDNQALLKQIRELKAKL--QEDNSESNL---SVEEEMVVAADSENALIQQ
        LA+ELGLQPRQVAVWFQNRRARWKTKQLE+DYGVLK  Y++L+ ++++L+ DN +LL++I ++KAK+  +EDN+ +      V+EE V   DS    I  
Subjt:  LARELGLQPRQVAVWFQNRRARWKTKQLERDYGVLKTNYENLKLSYETLQNDNQALLKQIRELKAKL--QEDNSESNL---SVEEEMVVAADSENALIQQ

Query:  TKTEIGDHFSVPPASESQDFNY-ESFNNNGGGGEEAAIEEASLFPDFKDGSSDS-DSSAILNEDYSPTAGISSPGVLQKHQQYHFMTGPESPAPSAAVKL
        +  +  +H        S  FNY  SF +       + + EA        GSSDS DSSA+LN++ S   G  +P V         +TG            
Subjt:  TKTEIGDHFSVPPASESQDFNY-ESFNNNGGGGEEAAIEEASLFPDFKDGSSDS-DSSAILNEDYSPTAGISSPGVLQKHQQYHFMTGPESPAPSAAVKL

Query:  NYLPFQKGYQQQTQMFTKMEEHNFFSGEEACNFFSDEQAPTLHWWS
        ++L F K   +QT+     +  +F SGEEAC FFSDEQ P+LHW+S
Subjt:  NYLPFQKGYQQQTQMFTKMEEHNFFSGEEACNFFSDEQAPTLHWWS

Q9XH37 Homeobox-leucine zipper protein HOX41.8e-3841.22Show/hide
Query:  GFEEEGCVEES----GHVSEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGLQPRQVAVWFQNRRARWKTKQLERDYGVLKTNYENLKLSYETLQ
        G E EG VEE     G   EKKRRLSVEQV+ALE++FEVENKLEPERK +LAR+LGLQPRQVAVWFQNRRARWKTKQLERDY  L+ +Y++L+L ++ L+
Subjt:  GFEEEGCVEES----GHVSEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGLQPRQVAVWFQNRRARWKTKQLERDYGVLKTNYENLKLSYETLQ

Query:  NDNQALLKQIRELKAKLQEDNSESNLSVEEEMVVAADSENALIQQTKTEIGDHFSVPPASESQDFNYESFNNNGGGGEEAAIEEASLFPDFKDGSSDSDS
         D  ALL +I+ELKAKL ++ + ++ +  +E   A+D                   PPA+                                 GSSDSDS
Subjt:  NDNQALLKQIRELKAKLQEDNSESNLSVEEEMVVAADSENALIQQTKTEIGDHFSVPPASESQDFNYESFNNNGGGGEEAAIEEASLFPDFKDGSSDSDS

Query:  SAILNEDYSPTAGISSPGVLQKHQQYHFMTGPESP---APSAAVKLNYLPFQKGYQQQTQMFTKMEEH--NFFSGEEAC-NFFSDEQAPTL-HWWS
        SA+LN+  +  A  ++   L   +   F+  P +    A +AA   +   F  G       F K+EE    F   +E C  FF+D+Q P L  WW+
Subjt:  SAILNEDYSPTAGISSPGVLQKHQQYHFMTGPESP---APSAAVKLNYLPFQKGYQQQTQMFTKMEEH--NFFSGEEAC-NFFSDEQAPTL-HWWS

Arabidopsis top hitse value%identityAlignment
AT1G69780.1 Homeobox-leucine zipper protein family4.1e-3042.57Show/hide
Query:  EEGCVEESGHVSEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGLQPRQVAVWFQNRRARWKTKQLERDYGVLKTNYENLKLSYETLQNDNQALL
        EE   ++   + EKKRRL++EQVK LEKNFE+ NKLEPERK++LAR LGLQPRQ+A+WFQNRRARWKTKQLE+DY  LK  ++ LK   + LQ  NQ L 
Subjt:  EEGCVEESGHVSEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGLQPRQVAVWFQNRRARWKTKQLERDYGVLKTNYENLKLSYETLQNDNQALL

Query:  KQIRELKAKLQ-----------------EDNSESNLSVEEEMVVAADSENALI----QQTKTEIGDHFSVP-PASESQDFNYESFNNNGGGGEEAAIEEA
         +I  LK + Q                  DNS  NL +  ++  A  S ++ +          +G HF  P PA+ +       F  N   G+    EE 
Subjt:  KQIRELKAKLQ-----------------EDNSESNLSVEEEMVVAADSENALI----QQTKTEIGDHFSVP-PASESQDFNYESFNNNGGGGEEAAIEEA

Query:  SL
        S+
Subjt:  SL

AT2G22430.1 homeobox protein 66.2e-6347.97Show/hide
Query:  MKRPAVSSDSLGALISICP-NSDHEQSPRNKNSTNNHVYGTEFQCMLDGF--EEEGCVEESGHV--SEKKRRLSVEQVKALEKNFEVENKLEPERKVKLA
        M +   SSDS+G LIS+CP  S  EQSPR          G EFQ ML+G+  EEE  VEE GHV  SEKKRRLS+ QVKALEKNFE+ENKLEPERKVKLA
Subjt:  MKRPAVSSDSLGALISICP-NSDHEQSPRNKNSTNNHVYGTEFQCMLDGF--EEEGCVEESGHV--SEKKRRLSVEQVKALEKNFEVENKLEPERKVKLA

Query:  RELGLQPRQVAVWFQNRRARWKTKQLERDYGVLKTNYENLKLSYETLQNDNQALLKQIRELKAKL-----QEDNSESNLSVEEEMVVAADSENALIQQTK
        +ELGLQPRQVAVWFQNRRARWKTKQLE+DYGVLKT Y++L+ ++++L+ DN++LL++I +LK KL     +E+  E+N +V  E  ++   E   + +  
Subjt:  RELGLQPRQVAVWFQNRRARWKTKQLERDYGVLKTNYENLKLSYETLQNDNQALLKQIRELKAKL-----QEDNSESNLSVEEEMVVAADSENALIQQTK

Query:  TEIGDHFSVPP--ASESQDFNYESFNNNGGGGEEAAIEEASLFPDFKDGSSDSDSSAILNEDYSPTAGISSPGVLQKHQQYHFMTGPESPAPSAAVKLNY
        TE     S PP     S   NY SF +        A   AS F      S  SDSSA+LNE+ S    +++P           +T P           N+
Subjt:  TEIGDHFSVPP--ASESQDFNYESFNNNGGGGEEAAIEEASLFPDFKDGSSDSDSSAILNEDYSPTAGISSPGVLQKHQQYHFMTGPESPAPSAAVKLNY

Query:  LPFQKGYQQQTQMFTKMEEHNFFSGEEACNFFSDEQAPTLHWWS
          F K   +QT+     +  +F SGEEAC FFSDEQ P+LHW+S
Subjt:  LPFQKGYQQQTQMFTKMEEHNFFSGEEACNFFSDEQAPTLHWWS

AT4G40060.1 homeobox protein 163.4e-5345.66Show/hide
Query:  MKRPAVSSDSLGALISICPNSDHEQSPRNKNSTNNHVYGTEFQCMLDGFEEEGCV--EESGH-----VSEKKRRLSVEQVKALEKNFEVENKLEPERKVK
        MKR + SSDS+  LIS    S  EQSPR         YG+ +Q ML+G++E+  +  E SG+     +SEKKRRL V+QVKALEKNFE+ENKLEPERK K
Subjt:  MKRPAVSSDSLGALISICPNSDHEQSPRNKNSTNNHVYGTEFQCMLDGFEEEGCV--EESGH-----VSEKKRRLSVEQVKALEKNFEVENKLEPERKVK

Query:  LARELGLQPRQVAVWFQNRRARWKTKQLERDYGVLKTNYENLKLSYETLQNDNQALLKQIRELKAKL--QEDNSESNL---SVEEEMVVAADSENALIQQ
        LA+ELGLQPRQVAVWFQNRRARWKTKQLE+DYGVLK  Y++L+ ++++L+ DN +LL++I ++KAK+  +EDN+ +      V+EE V   DS    I  
Subjt:  LARELGLQPRQVAVWFQNRRARWKTKQLERDYGVLKTNYENLKLSYETLQNDNQALLKQIRELKAKL--QEDNSESNL---SVEEEMVVAADSENALIQQ

Query:  TKTEIGDHFSVPPASESQDFNY-ESFNNNGGGGEEAAIEEASLFPDFKDGSSDS-DSSAILNEDYSPTAGISSPGVLQKHQQYHFMTGPESPAPSAAVKL
        +  +  +H        S  FNY  SF +       + + EA        GSSDS DSSA+LN++ S   G  +P V         +TG            
Subjt:  TKTEIGDHFSVPPASESQDFNY-ESFNNNGGGGEEAAIEEASLFPDFKDGSSDS-DSSAILNEDYSPTAGISSPGVLQKHQQYHFMTGPESPAPSAAVKL

Query:  NYLPFQKGYQQQTQMFTKMEEHNFFSGEEACNFFSDEQAPTLHWWS
        ++L F K   +QT+     +  +F SGEEAC FFSDEQ P+LHW+S
Subjt:  NYLPFQKGYQQQTQMFTKMEEHNFFSGEEACNFFSDEQAPTLHWWS

AT5G65310.1 homeobox protein 51.1e-5143.66Show/hide
Query:  MKRPAVSSDSLGALISI-CPNSDHEQSPRNKNSTNNHVYGTEFQCMLDGFEEEGCVEESGHV-------SEKKRRLSVEQVKALEKNFEVENKLEPERKV
        MKR   SSDSL   + I    +D + SPR   +   +    ++  M D  E++G +E+ G V       +EKKRRL VEQVKALEKNFE++NKLEPERKV
Subjt:  MKRPAVSSDSLGALISI-CPNSDHEQSPRNKNSTNNHVYGTEFQCMLDGFEEEGCVEESGHV-------SEKKRRLSVEQVKALEKNFEVENKLEPERKV

Query:  KLARELGLQPRQVAVWFQNRRARWKTKQLERDYGVLKTNYENLKLSYETLQNDNQALLKQIRELKAKLQEDNSESNLSVEEEMVVAADSENALIQQTK--
        KLA+ELGLQPRQVA+WFQNRRARWKTKQLERDYGVLK+N++ LK + ++LQ DN +LL QI+ELKAKL   N E    +EE   + A   N  +      
Subjt:  KLARELGLQPRQVAVWFQNRRARWKTKQLERDYGVLKTNYENLKLSYETLQNDNQALLKQIRELKAKLQEDNSESNLSVEEEMVVAADSENALIQQTK--

Query:  TEIGDHFSVPPASESQDFNYESFNNNGGGGEEAAIEEASLFP---DFKDGSSD-SDSSAILNEDYSPTAGISSPGVLQKHQQYHFMTGPESPAPSAAVKL
         E+      PP     D              E A E  S+FP   +F+D  +D SDSSA+LNE+YSP   + + G +                 +  V++
Subjt:  TEIGDHFSVPPASESQDFNYESFNNNGGGGEEAAIEEASLFP---DFKDGSSD-SDSSAILNEDYSPTAGISSPGVLQKHQQYHFMTGPESPAPSAAVKL

Query:  NYLPFQKGYQQQTQMFTKMEEH-NFFSGEEACNFFSDEQ
        + +    G   Q   F KMEEH + FSGEEAC  F+D +
Subjt:  NYLPFQKGYQQQTQMFTKMEEH-NFFSGEEACNFFSDEQ

AT5G65310.2 homeobox protein 53.0e-4943.71Show/hide
Query:  SDHEQSPRNKNSTNNHVYGTEFQCMLDGFEEEGCVEESGHV-------SEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGLQPRQVAVWFQNRR
        +D + SPR   +   +    ++  M D  E++G +E+ G V       +EKKRRL VEQVKALEKNFE++NKLEPERKVKLA+ELGLQPRQVA+WFQNRR
Subjt:  SDHEQSPRNKNSTNNHVYGTEFQCMLDGFEEEGCVEESGHV-------SEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGLQPRQVAVWFQNRR

Query:  ARWKTKQLERDYGVLKTNYENLKLSYETLQNDNQALLKQIRELKAKLQEDNSESNLSVEEEMVVAADSENALIQQTK--TEIGDHFSVPPASESQDFNYE
        ARWKTKQLERDYGVLK+N++ LK + ++LQ DN +LL QI+ELKAKL   N E    +EE   + A   N  +       E+      PP     D    
Subjt:  ARWKTKQLERDYGVLKTNYENLKLSYETLQNDNQALLKQIRELKAKLQEDNSESNLSVEEEMVVAADSENALIQQTK--TEIGDHFSVPPASESQDFNYE

Query:  SFNNNGGGGEEAAIEEASLFP---DFKDGSSD-SDSSAILNEDYSPTAGISSPGVLQKHQQYHFMTGPESPAPSAAVKLNYLPFQKGYQQQTQMFTKMEE
                  E A E  S+FP   +F+D  +D SDSSA+LNE+YSP   + + G +                 +  V+++ +    G   Q   F KMEE
Subjt:  SFNNNGGGGEEAAIEEASLFP---DFKDGSSD-SDSSAILNEDYSPTAGISSPGVLQKHQQYHFMTGPESPAPSAAVKLNYLPFQKGYQQQTQMFTKMEE

Query:  H-NFFSGEEACNFFSDEQ
        H + FSGEEAC  F+D +
Subjt:  H-NFFSGEEACNFFSDEQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGAGACCGGCAGTCAGCTCAGATTCCTTGGGTGCACTCATCTCCATTTGCCCAAATTCAGATCATGAACAGAGTCCGAGAAACAAGAACAGTACTAACAACCATGT
TTACGGCACAGAATTCCAGTGTATGCTGGATGGATTTGAGGAAGAAGGGTGCGTTGAAGAATCAGGGCATGTTTCAGAGAAGAAAAGGCGACTTAGTGTGGAGCAAGTGA
AGGCTTTAGAGAAGAATTTCGAAGTTGAAAACAAGCTCGAACCAGAAAGGAAAGTGAAGCTTGCTCGAGAACTTGGATTACAGCCTCGACAAGTGGCTGTTTGGTTCCAA
AATCGTCGAGCCAGATGGAAAACCAAGCAATTAGAAAGAGACTATGGCGTTCTCAAAACTAATTATGAGAATCTCAAACTCAGTTATGAAACTCTTCAAAATGACAATCA
AGCTCTCCTCAAACAGATTCGGGAACTGAAAGCAAAGCTTCAAGAAGATAACTCAGAGAGCAATCTTTCGGTGGAGGAAGAAATGGTGGTGGCCGCCGATTCTGAAAATG
CTCTGATCCAACAAACTAAGACGGAAATTGGCGATCACTTCTCTGTTCCGCCGGCGAGTGAGTCCCAAGACTTCAATTACGAGAGCTTCAACAACAATGGCGGAGGAGGG
GAAGAGGCAGCAATAGAAGAAGCGTCATTGTTCCCCGATTTCAAAGATGGGTCATCCGATAGCGATTCGAGCGCAATTTTAAACGAAGATTACAGCCCGACGGCGGGCAT
TTCTTCTCCCGGGGTGCTGCAGAAACACCAACAGTACCATTTCATGACGGGACCGGAATCTCCGGCGCCCTCCGCCGCCGTGAAACTCAACTACTTACCGTTTCAGAAGG
GATATCAACAACAAACCCAGATGTTTACAAAAATGGAGGAGCATAATTTCTTCAGCGGAGAGGAGGCTTGTAACTTTTTCTCCGATGAGCAAGCTCCGACTCTGCACTGG
TGGAGCTAA
mRNA sequenceShow/hide mRNA sequence
CCCTTAATTTGATTCCTCTTTCTTCTGGGTTTTTCCTATAAGCTTCCTATTTAGTGTTTTGGAGTGGGGGGATTCTAAATGGGTTCTGAGGGAGAAGAAATTTTGAAGAA
CAATGATTATGAATCAGGAAAAAGGAAGTGTTTGTACAGTCATTGCGAACAACACTGTACAAATTGTTCTTCTTTTTCTTTCACTACAAATCGGACTTTTGCTTTCATTG
CTTCCTTCCAGTGATATCATAATTTCAAAGTTCTTCCCATTCAGACACTTAAAACAAACAAACAACAATCATTAAACAGCAAGAAAAACTAACCCATCATCATCATCATC
TAAGTTTTCTGTGTGTTTAATTTTGTGCTGATTCTCAATCATGAAGAGACCGGCAGTCAGCTCAGATTCCTTGGGTGCACTCATCTCCATTTGCCCAAATTCAGATCATG
AACAGAGTCCGAGAAACAAGAACAGTACTAACAACCATGTTTACGGCACAGAATTCCAGTGTATGCTGGATGGATTTGAGGAAGAAGGGTGCGTTGAAGAATCAGGGCAT
GTTTCAGAGAAGAAAAGGCGACTTAGTGTGGAGCAAGTGAAGGCTTTAGAGAAGAATTTCGAAGTTGAAAACAAGCTCGAACCAGAAAGGAAAGTGAAGCTTGCTCGAGA
ACTTGGATTACAGCCTCGACAAGTGGCTGTTTGGTTCCAAAATCGTCGAGCCAGATGGAAAACCAAGCAATTAGAAAGAGACTATGGCGTTCTCAAAACTAATTATGAGA
ATCTCAAACTCAGTTATGAAACTCTTCAAAATGACAATCAAGCTCTCCTCAAACAGATTCGGGAACTGAAAGCAAAGCTTCAAGAAGATAACTCAGAGAGCAATCTTTCG
GTGGAGGAAGAAATGGTGGTGGCCGCCGATTCTGAAAATGCTCTGATCCAACAAACTAAGACGGAAATTGGCGATCACTTCTCTGTTCCGCCGGCGAGTGAGTCCCAAGA
CTTCAATTACGAGAGCTTCAACAACAATGGCGGAGGAGGGGAAGAGGCAGCAATAGAAGAAGCGTCATTGTTCCCCGATTTCAAAGATGGGTCATCCGATAGCGATTCGA
GCGCAATTTTAAACGAAGATTACAGCCCGACGGCGGGCATTTCTTCTCCCGGGGTGCTGCAGAAACACCAACAGTACCATTTCATGACGGGACCGGAATCTCCGGCGCCC
TCCGCCGCCGTGAAACTCAACTACTTACCGTTTCAGAAGGGATATCAACAACAAACCCAGATGTTTACAAAAATGGAGGAGCATAATTTCTTCAGCGGAGAGGAGGCTTG
TAACTTTTTCTCCGATGAGCAAGCTCCGACTCTGCACTGGTGGAGCTAAATCCATGGCGGCATAAAGGGGAAATTTTAAGAGAATAAAAACGGGAAGTAGAATTCGAAAC
TTGGGTGTGTAGTAGTTTTGAAGATGGGGTGGAATGAAGGAAGAAGATGATTGAATTGGGGAAGATTAATGGAGAAATTGAGGTGGGATTCTCCATGAAAATAATTTAGG
CTAATTTTTCCCTTTTTCCTTTGATTCTTCTATAACTTCGTGAGGGAAAATGATGAGATCGAAACTTAAATCTAAGCTGT
Protein sequenceShow/hide protein sequence
MKRPAVSSDSLGALISICPNSDHEQSPRNKNSTNNHVYGTEFQCMLDGFEEEGCVEESGHVSEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGLQPRQVAVWFQ
NRRARWKTKQLERDYGVLKTNYENLKLSYETLQNDNQALLKQIRELKAKLQEDNSESNLSVEEEMVVAADSENALIQQTKTEIGDHFSVPPASESQDFNYESFNNNGGGG
EEAAIEEASLFPDFKDGSSDSDSSAILNEDYSPTAGISSPGVLQKHQQYHFMTGPESPAPSAAVKLNYLPFQKGYQQQTQMFTKMEEHNFFSGEEACNFFSDEQAPTLHW
WS