; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0022135 (gene) of Chayote v1 genome

Gene IDSed0022135
OrganismSechium edule (Chayote v1)
Descriptionhomeobox-leucine zipper protein HAT9-like
Genome locationLG13:23345957..23348255
RNA-Seq ExpressionSed0022135
SyntenySed0022135
Gene Ontology termsGO:0006357 - regulation of transcription by RNA polymerase II (biological process)
GO:0005634 - nucleus (cellular component)
GO:0000981 - DNA-binding transcription factor activity, RNA polymerase II-specific (molecular function)
GO:0043565 - sequence-specific DNA binding (molecular function)
InterPro domainsIPR001356 - Homeobox domain
IPR003106 - Leucine zipper, homeobox-associated
IPR009057 - Homeobox-like domain superfamily
IPR017970 - Homeobox, conserved site


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6591449.1 Homeobox-leucine zipper protein HAT9, partial [Cucurbita argyrosperma subsp. sororia]3.0e-9475Show/hide
Query:  MDGD---GLLLGLGRG--SNDRLHPEAADVK--KKKLQALKFDDMFPSLTLGLSIIVEKSGVESS--TEEELIQQGSSGSTPGSSFSNSSSGLKRERFDG
        MD D   GLLLGLGRG   ++ + P   DV   KKKLQ LKFDD+ PSLTLGLS++VEKSG ESS    EELI QGSSGS P SSFSN SSG KRER  G
Subjt:  MDGD---GLLLGLGRG--SNDRLHPEAADVK--KKKLQALKFDDMFPSLTLGLSIIVEKSGVESS--TEEELIQQGSSGSTPGSSFSNSSSGLKRERFDG

Query:  E-EKAIEAETEM---CMKIGHEEEEDGSPRKKLRLSKEQSAILEDHFKEHSSLSPKQKQDLARQLNLRPRQVEVWFQNRRARTKLKQTEMDCELLKKFCE
          E+ +EAE  M    MK+  EEEEDGSPRKKLRL+KEQSA+LED+FKEHSSLSPKQKQDLARQLNLRPRQVEVWFQNRRARTKLKQTEMDCELLKK CE
Subjt:  E-EKAIEAETEM---CMKIGHEEEEDGSPRKKLRLSKEQSAILEDHFKEHSSLSPKQKQDLARQLNLRPRQVEVWFQNRRARTKLKQTEMDCELLKKFCE

Query:  NLKEENTRLQKELRELKSLKLTAPPLCMQLQAATLTVCPSC---------GGGHGGDASPATTFSISPKPHFLKFPFNHPSAAC
         LKEENT+LQKEL+ELKSLKLTAPP CMQLQAATLTVCPSC         GGG G DASPA TFSI  KPHFLKFPFNHPSAAC
Subjt:  NLKEENTRLQKELRELKSLKLTAPPLCMQLQAATLTVCPSC---------GGGHGGDASPATTFSISPKPHFLKFPFNHPSAAC

KAG7024329.1 Homeobox-leucine zipper protein HAT9 [Cucurbita argyrosperma subsp. argyrosperma]3.0e-9474.56Show/hide
Query:  MDGD---GLLLGLGRG--SNDRLHPEAADVK--KKKLQALKFDDMFPSLTLGLSIIVEKSGVESS--TEEELIQQGSSGSTPGSSFSNSSSGLKRERFDG
        MD D   GLLLGLGRG   ++ + P   DV   KKKLQ LKFDD+ PSLTLGLS++VEKSG ESS    EELI QGSSGS P SSFSN SSG KRER  G
Subjt:  MDGD---GLLLGLGRG--SNDRLHPEAADVK--KKKLQALKFDDMFPSLTLGLSIIVEKSGVESS--TEEELIQQGSSGSTPGSSFSNSSSGLKRERFDG

Query:  E-EKAIEAETEM---CMKIGHEEEEDGSPRKKLRLSKEQSAILEDHFKEHSSLSPKQKQDLARQLNLRPRQVEVWFQNRRARTKLKQTEMDCELLKKFCE
          E+ +EAE  M    MK+  EEEEDGSPRKKLRLSKEQSA+LED+FKEHSSLSPKQKQDLARQLNLRPRQVEVWFQNRRARTKLKQTEMDCELLKK CE
Subjt:  E-EKAIEAETEM---CMKIGHEEEEDGSPRKKLRLSKEQSAILEDHFKEHSSLSPKQKQDLARQLNLRPRQVEVWFQNRRARTKLKQTEMDCELLKKFCE

Query:  NLKEENTRLQKELRELKSLKLTAPPLCMQLQAATLTVCPSC------------GGGHGGDASPATTFSISPKPHFLKFPFNHPSAAC
         LKEENT+LQKEL+ELKSLKLTAPP CMQLQAATLTVCPSC            GGG G DASPA TFSI  KPHFLKFPFNHPSAAC
Subjt:  NLKEENTRLQKELRELKSLKLTAPPLCMQLQAATLTVCPSC------------GGGHGGDASPATTFSISPKPHFLKFPFNHPSAAC

XP_022935802.1 homeobox-leucine zipper protein HAT9-like [Cucurbita moschata]1.6e-9273.43Show/hide
Query:  MDGD---GLLLGLGRG--SNDRLHPEAADVK--KKKLQALKFDDMFPSLTLGLSIIVEKSGVES--STEEELIQQGSSGSTPGSSFSNSSSGLKRERFDG
        MD D   GLLLGLGRG   ++ + P   DV   KKKLQ LKFDD+ PSLTLGLS++V+KSG ES  +  +ELIQQGSSGS P SSFS+ SSG KRER  G
Subjt:  MDGD---GLLLGLGRG--SNDRLHPEAADVK--KKKLQALKFDDMFPSLTLGLSIIVEKSGVES--STEEELIQQGSSGSTPGSSFSNSSSGLKRERFDG

Query:  E-EKAIEAETEM---CMKIGHEEEEDGSPRKKLRLSKEQSAILEDHFKEHSSLSPKQKQDLARQLNLRPRQVEVWFQNRRARTKLKQTEMDCELLKKFCE
          E+  EAE  M    MK+  EEEEDGSPRKKLRL+KEQSA+LED+FKEHSSLSPKQKQDLARQLNLRPRQVEVWFQNRRARTKLKQTEMDCELLKK CE
Subjt:  E-EKAIEAETEM---CMKIGHEEEEDGSPRKKLRLSKEQSAILEDHFKEHSSLSPKQKQDLARQLNLRPRQVEVWFQNRRARTKLKQTEMDCELLKKFCE

Query:  NLKEENTRLQKELRELKSLKLTAPPLCMQLQAATLTVCPSC-----------GGGHGGDASPATTFSISPKPHFLKFPFNHPSAAC
         LKEENT+LQKEL+ELKSLKLTAPP CMQLQAATLTVCPSC           GGG G DASPA TFSI  KPHFLKFPFNHPSAAC
Subjt:  NLKEENTRLQKELRELKSLKLTAPPLCMQLQAATLTVCPSC-----------GGGHGGDASPATTFSISPKPHFLKFPFNHPSAAC

XP_022977087.1 homeobox-leucine zipper protein HAT9-like [Cucurbita maxima]4.3e-9374.73Show/hide
Query:  MDGD---GLLLGLGRG--SNDRLHPEAADVK--KKKLQALKFDDMFPSLTLGLSIIVEKSGVESSTEEELIQQGSSGSTPGSSFSNSSSGLKRERFDGE-
        MD D   GLLLGLGRG   ++ + P   DV   KKKLQ LKFDD+ PSLTLGLS++VEKSG ES+  ++LI QGSSGS P SSFSN SSG KRER  G  
Subjt:  MDGD---GLLLGLGRG--SNDRLHPEAADVK--KKKLQALKFDDMFPSLTLGLSIIVEKSGVESSTEEELIQQGSSGSTPGSSFSNSSSGLKRERFDGE-

Query:  EKAIEAETEM---CMKIGHEEEEDGSPRKKLRLSKEQSAILEDHFKEHSSLSPKQKQDLARQLNLRPRQVEVWFQNRRARTKLKQTEMDCELLKKFCENL
        E+  EAE  M    MK+  EEEEDGSPRKKLRL+KEQSA+LED+FKEHSSLSPKQKQDLARQLNLRPRQVEVWFQNRRARTKLKQTEMDCELLKK CE L
Subjt:  EKAIEAETEM---CMKIGHEEEEDGSPRKKLRLSKEQSAILEDHFKEHSSLSPKQKQDLARQLNLRPRQVEVWFQNRRARTKLKQTEMDCELLKKFCENL

Query:  KEENTRLQKELRELKSLKLTAPPLCMQLQAATLTVCPSC--------GGGHGGDASPATTFSISPKPHFLKFPFNHPSAAC
        KEENT+LQKEL+ELKSLKLTAPP CMQLQAATLTVCPSC        GGG G DASPA TFSI  KPHFLKFPFNHPSAAC
Subjt:  KEENTRLQKELRELKSLKLTAPPLCMQLQAATLTVCPSC--------GGGHGGDASPATTFSISPKPHFLKFPFNHPSAAC

XP_023536403.1 homeobox-leucine zipper protein HAT9-like [Cucurbita pepo subsp. pepo]4.3e-9373.78Show/hide
Query:  MDGD---GLLLGLGRG--SNDRLHPEAADVK--KKKLQALKFDDMFPSLTLGLSIIVEKSGVES--STEEELIQQGSSGSTPGSSFSNSSSGLKRERFDG
        MD D   GLLLGLGRG   ++ + P   DV   KKKLQ LKFDD+ PSLTLGLS++V+KSG ES  +  +ELIQQGSSGS P SSFSN SSG KRER DG
Subjt:  MDGD---GLLLGLGRG--SNDRLHPEAADVK--KKKLQALKFDDMFPSLTLGLSIIVEKSGVES--STEEELIQQGSSGSTPGSSFSNSSSGLKRERFDG

Query:  EEKAIEAETE-----MCMKIGHEEEEDGSPRKKLRLSKEQSAILEDHFKEHSSLSPKQKQDLARQLNLRPRQVEVWFQNRRARTKLKQTEMDCELLKKFC
              AETE     + MK+  EEEEDGSPRKKLRL+KEQSA+LED+FKEHSSLSPKQKQDLARQLNLRPRQVEVWFQNRRARTKLKQTEMDCELLKK C
Subjt:  EEKAIEAETE-----MCMKIGHEEEEDGSPRKKLRLSKEQSAILEDHFKEHSSLSPKQKQDLARQLNLRPRQVEVWFQNRRARTKLKQTEMDCELLKKFC

Query:  ENLKEENTRLQKELRELKSLKLTAPPLCMQLQAATLTVCPSC----------GGGHGGDASPATTFSISPKPHFLKFPFNHPSAAC
        E LKEENT+LQKEL+ELKSLKLTAPP CMQLQAATLTVCPSC          GGG G DASPA TFSI  KPHFLKFPFNHPSAAC
Subjt:  ENLKEENTRLQKELRELKSLKLTAPPLCMQLQAATLTVCPSC----------GGGHGGDASPATTFSISPKPHFLKFPFNHPSAAC

TrEMBL top hitse value%identityAlignment
A0A0A0L3X3 Homeobox domain-containing protein1.6e-8271.84Show/hide
Query:  MDGD---GLLLGLGRGS----NDRLHPEAADVKKKKL-QALKF-DDMFPSLTLGLSIIVEKSGVESSTEEELIQQGSSGSTPGSSFSNSSSGLKRERFDG
        MD D   GLLLGLGR S    N  +  E   + KKKL Q LKF DD+ PSLTLGLS +     V+++TE+     G SGS P SSFSN SSG KRER  G
Subjt:  MDGD---GLLLGLGRGS----NDRLHPEAADVKKKKL-QALKF-DDMFPSLTLGLSIIVEKSGVESSTEEELIQQGSSGSTPGSSFSNSSSGLKRERFDG

Query:  EEKAIEAETEMCMKIGHEEEEDGSPRKKLRLSKEQSAILEDHFKEHSSLSPKQKQDLARQLNLRPRQVEVWFQNRRARTKLKQTEMDCELLKKFCENLKE
        EE    AETE CMK+G EE+E+GSPRKKLRL+K QSAILED+FKEHSSLSPKQKQDLARQLNLRPRQVEVWFQNRRARTKLKQTEMDCELLKK CE LKE
Subjt:  EEKAIEAETEMCMKIGHEEEEDGSPRKKLRLSKEQSAILEDHFKEHSSLSPKQKQDLARQLNLRPRQVEVWFQNRRARTKLKQTEMDCELLKKFCENLKE

Query:  ENTRLQKELRELKSLKLTAPPLCMQLQAATLTVCPSC------GGGHGGDASPATTFSISPKPHFLKFPFNHPSAAC
        ENTRLQKEL+ELKSLKLT PP CMQLQAATLTVCPSC      G   GGDASPA  FSI  KP FLKFPFNHPSAAC
Subjt:  ENTRLQKELRELKSLKLTAPPLCMQLQAATLTVCPSC------GGGHGGDASPATTFSISPKPHFLKFPFNHPSAAC

A0A2C9VXJ5 Homeobox domain-containing protein1.9e-6260.54Show/hide
Query:  GLLLGLGRGSNDRLHPEAADVKKKKLQALKFDDMFPSLTLGLSIIVEKSGVESSTEEELIQQGSSGSTPGSSFSNSSSGLKRER-FDGEEKAIEAETEMC
        GL LGL    N + + ++  +K+KK   LK+D  FPSLTLG     E     +  E +L  Q SS S   SSFSNSS  +K+ER F GE    E E E  
Subjt:  GLLLGLGRGSNDRLHPEAADVKKKKLQALKFDDMFPSLTLGLSIIVEKSGVESSTEEELIQQGSSGSTPGSSFSNSSSGLKRER-FDGEEKAIEAETEMC

Query:  MKIGHEEEEDGSPRKKLRLSKEQSAILEDHFKEHSSLSPKQKQDLARQLNLRPRQVEVWFQNRRARTKLKQTEMDCELLKKFCENLKEENTRLQKELREL
             +E+E+GSPRKKLRLSK+QSAILED FKEHS+L+PKQKQ LA QLNLRPRQVEVWFQNRRARTKLKQTE+DCE+L+K CE L EEN RLQKEL+EL
Subjt:  MKIGHEEEEDGSPRKKLRLSKEQSAILEDHFKEHSSLSPKQKQDLARQLNLRPRQVEVWFQNRRARTKLKQTEMDCELLKKFCENLKEENTRLQKELREL

Query:  KSLKLTAPPLCMQLQAATLTVCPSCG--GGHGGDASPATTFSISPKPHFLKFPFNHPSAAC
        +SLK+ A PL MQL AATLTVCPSC   G  GGDA+  +T ++ PKPHF   PF HPSAAC
Subjt:  KSLKLTAPPLCMQLQAATLTVCPSCG--GGHGGDASPATTFSISPKPHFLKFPFNHPSAAC

A0A6J1C954 homeobox-leucine zipper protein HAT9-like7.9e-8567.71Show/hide
Query:  MDGD-----GLLLGLGRGSNDRLHPEAADVKKKKLQALKFDDMFPSLTLGLS-------IIVEKSGVESSTEEELIQQGSSGSTPGSSFSNSSSGLKRER
        MDGD     GLLLGLGR   + L     +V  KK   LKFDD+ P LTLGLS        + E    +++  EEL+QQGSS S P SSFSN SSG KR+R
Subjt:  MDGD-----GLLLGLGRGSNDRLHPEAADVKKKKLQALKFDDMFPSLTLGLS-------IIVEKSGVESSTEEELIQQGSSGSTPGSSFSNSSSGLKRER

Query:  FD-----GEEKAIEAE----TEMCMKIGHEEEEDGSPRKKLRLSKEQSAILEDHFKEHSSLSPKQKQDLARQLNLRPRQVEVWFQNRRARTKLKQTEMDC
         D     GEE+  EA       +  K+G +E+EDGSPRKKLRL+KEQSAILED FKEHSSL+PKQK DLARQL+LRPRQVEVWFQNRRARTKLKQTEMDC
Subjt:  FD-----GEEKAIEAE----TEMCMKIGHEEEEDGSPRKKLRLSKEQSAILEDHFKEHSSLSPKQKQDLARQLNLRPRQVEVWFQNRRARTKLKQTEMDC

Query:  ELLKKFCENLKEENTRLQKELRELKSLKLTAPPLCMQLQAATLTVCPS-----CGGGHGGDASPATTFSISPKPHFLKFPFNHPSAAC
        ELLKK CE LKEENTRLQKEL+ELKSLKLTAPP CMQLQAATLTVCPS     CGGG GGDASP T FSI  KP FLKFPFNHPSAAC
Subjt:  ELLKKFCENLKEENTRLQKELRELKSLKLTAPPLCMQLQAATLTVCPS-----CGGGHGGDASPATTFSISPKPHFLKFPFNHPSAAC

A0A6J1F6L5 homeobox-leucine zipper protein HAT9-like7.8e-9373.43Show/hide
Query:  MDGD---GLLLGLGRG--SNDRLHPEAADVK--KKKLQALKFDDMFPSLTLGLSIIVEKSGVES--STEEELIQQGSSGSTPGSSFSNSSSGLKRERFDG
        MD D   GLLLGLGRG   ++ + P   DV   KKKLQ LKFDD+ PSLTLGLS++V+KSG ES  +  +ELIQQGSSGS P SSFS+ SSG KRER  G
Subjt:  MDGD---GLLLGLGRG--SNDRLHPEAADVK--KKKLQALKFDDMFPSLTLGLSIIVEKSGVES--STEEELIQQGSSGSTPGSSFSNSSSGLKRERFDG

Query:  E-EKAIEAETEM---CMKIGHEEEEDGSPRKKLRLSKEQSAILEDHFKEHSSLSPKQKQDLARQLNLRPRQVEVWFQNRRARTKLKQTEMDCELLKKFCE
          E+  EAE  M    MK+  EEEEDGSPRKKLRL+KEQSA+LED+FKEHSSLSPKQKQDLARQLNLRPRQVEVWFQNRRARTKLKQTEMDCELLKK CE
Subjt:  E-EKAIEAETEM---CMKIGHEEEEDGSPRKKLRLSKEQSAILEDHFKEHSSLSPKQKQDLARQLNLRPRQVEVWFQNRRARTKLKQTEMDCELLKKFCE

Query:  NLKEENTRLQKELRELKSLKLTAPPLCMQLQAATLTVCPSC-----------GGGHGGDASPATTFSISPKPHFLKFPFNHPSAAC
         LKEENT+LQKEL+ELKSLKLTAPP CMQLQAATLTVCPSC           GGG G DASPA TFSI  KPHFLKFPFNHPSAAC
Subjt:  NLKEENTRLQKELRELKSLKLTAPPLCMQLQAATLTVCPSC-----------GGGHGGDASPATTFSISPKPHFLKFPFNHPSAAC

A0A6J1IHG6 homeobox-leucine zipper protein HAT9-like2.1e-9374.73Show/hide
Query:  MDGD---GLLLGLGRG--SNDRLHPEAADVK--KKKLQALKFDDMFPSLTLGLSIIVEKSGVESSTEEELIQQGSSGSTPGSSFSNSSSGLKRERFDGE-
        MD D   GLLLGLGRG   ++ + P   DV   KKKLQ LKFDD+ PSLTLGLS++VEKSG ES+  ++LI QGSSGS P SSFSN SSG KRER  G  
Subjt:  MDGD---GLLLGLGRG--SNDRLHPEAADVK--KKKLQALKFDDMFPSLTLGLSIIVEKSGVESSTEEELIQQGSSGSTPGSSFSNSSSGLKRERFDGE-

Query:  EKAIEAETEM---CMKIGHEEEEDGSPRKKLRLSKEQSAILEDHFKEHSSLSPKQKQDLARQLNLRPRQVEVWFQNRRARTKLKQTEMDCELLKKFCENL
        E+  EAE  M    MK+  EEEEDGSPRKKLRL+KEQSA+LED+FKEHSSLSPKQKQDLARQLNLRPRQVEVWFQNRRARTKLKQTEMDCELLKK CE L
Subjt:  EKAIEAETEM---CMKIGHEEEEDGSPRKKLRLSKEQSAILEDHFKEHSSLSPKQKQDLARQLNLRPRQVEVWFQNRRARTKLKQTEMDCELLKKFCENL

Query:  KEENTRLQKELRELKSLKLTAPPLCMQLQAATLTVCPSC--------GGGHGGDASPATTFSISPKPHFLKFPFNHPSAAC
        KEENT+LQKEL+ELKSLKLTAPP CMQLQAATLTVCPSC        GGG G DASPA TFSI  KPHFLKFPFNHPSAAC
Subjt:  KEENTRLQKELRELKSLKLTAPPLCMQLQAATLTVCPSC--------GGGHGGDASPATTFSISPKPHFLKFPFNHPSAAC

SwissProt top hitse value%identityAlignment
A2XE76 Homeobox-leucine zipper protein HOX195.5e-4344.95Show/hide
Query:  GLLLGLGRGSNDRLHPEAADVK----KKKLQALKFDDMFPSLTLGLSIIVEKSGVESSTEEELIQQGSSGSTPGSSFSNSSSGLKRERFDGEEKAIEAET
        GL LGL  G       +AA       ++   + +   + PSLTL L          ++T        S G  P  S S+ S G         E+A EA+ 
Subjt:  GLLLGLGRGSNDRLHPEAADVK----KKKLQALKFDDMFPSLTLGLSIIVEKSGVESSTEEELIQQGSSGSTPGSSFSNSSSGLKRERFDGEEKAIEAET

Query:  EMCMK--IGHEEEEDGSPRKKLRLSKEQSAILEDHFKEHSSLSPKQKQDLARQLNLRPRQVEVWFQNRRARTKLKQTEMDCELLKKFCENLKEENTRLQK
        E       G ++++DGS RKKLRL+KEQSA+LED F+EHS+L+PKQK  LA+QLNLRPRQVEVWFQNRRARTKLKQTE+DCE LK+ CE L EEN RLQ+
Subjt:  EMCMK--IGHEEEEDGSPRKKLRLSKEQSAILEDHFKEHSSLSPKQKQDLARQLNLRPRQVEVWFQNRRARTKLKQTEMDCELLKKFCENLKEENTRLQK

Query:  ELRELKSLKLTAP---------------PLCMQLQAATLTVCPSCGGGHGGDASPATTFSIS--------PKPHFLKFPFNHPSAAC
        EL+EL++LK   P               P  MQL AATLT+CPSC    GG AS A   +             H    PF H SAAC
Subjt:  ELRELKSLKLTAP---------------PLCMQLQAATLTVCPSCGGGHGGDASPATTFSIS--------PKPHFLKFPFNHPSAAC

P46603 Homeobox-leucine zipper protein HAT91.4e-5152.52Show/hide
Query:  GLLLGLGRGSNDRLHPEAADVKKKKLQALKFDDMFPSLTLGLSIIVEKSGVESSTEEELIQQGSSGSTPGSSFSNSSSGLKRERFDGEEKAIEAE-TEMC
        GL+LGLG       +   + +++  +  L+     PSLTL LS   + S    +  ++L +Q SS S  G S  +S   +KRER  GEE   E E TE  
Subjt:  GLLLGLGRGSNDRLHPEAADVKKKKLQALKFDDMFPSLTLGLSIIVEKSGVESSTEEELIQQGSSGSTPGSSFSNSSSGLKRERFDGEEKAIEAE-TEMC

Query:  MKIGHEEEEDGSPRKKLRLSKEQSAILEDHFKEHSSLSPKQKQDLARQLNLRPRQVEVWFQNRRARTKLKQTEMDCELLKKFCENLKEENTRLQKELREL
        +   HE+EE  S RKKLRL+K+QSA+LE+ FK+HS+L+PKQKQ LARQLNLRPRQVEVWFQNRRARTKLKQTE+DCE LKK CE L +EN RLQKE++EL
Subjt:  MKIGHEEEEDGSPRKKLRLSKEQSAILEDHFKEHSSLSPKQKQDLARQLNLRPRQVEVWFQNRRARTKLKQTEMDCELLKKFCENLKEENTRLQKELREL

Query:  KSLKLTAPPLCMQLQAATLTVCPSC------GGGHGG------------DASPAT-TFSISPKPHFLKFPFNHPSAAC
        K+LKLT  P  M + A+TLT CPSC      GGG+GG            D S A   FSIS KPHF   PF +PSAAC
Subjt:  KSLKLTAPPLCMQLQAATLTVCPSC------GGGHGG------------DASPAT-TFSISPKPHFLKFPFNHPSAAC

P46604 Homeobox-leucine zipper protein HAT223.8e-5253.09Show/hide
Query:  GLLLGLGRGSNDRLHPEAADVKKKKLQALKFDDMFPSLTLGLSIIVEKSGVESSTEEELIQQGSSGSTPGSSFSNSSSGLKRER----FDGEEKAIE-AE
        GL+LGLG       +  A   K       +F  + PSLTL LS    K    +   +++ +Q SS S  G S S SS  +KRER     DGEE+A E  E
Subjt:  GLLLGLGRGSNDRLHPEAADVKKKKLQALKFDDMFPSLTLGLSIIVEKSGVESSTEEELIQQGSSGSTPGSSFSNSSSGLKRER----FDGEEKAIE-AE

Query:  TEMCMKIG--HEEEEDGSPRKKLRLSKEQSAILEDHFKEHSSLSPKQKQDLARQLNLRPRQVEVWFQNRRARTKLKQTEMDCELLKKFCENLKEENTRLQ
          +C ++   H++EE  S RKKLRL+K+QSA+LED+FK HS+L+PKQKQ LARQLNLRPRQVEVWFQNRRARTKLKQTE+DCE LKK CE L +EN RLQ
Subjt:  TEMCMKIG--HEEEEDGSPRKKLRLSKEQSAILEDHFKEHSSLSPKQKQDLARQLNLRPRQVEVWFQNRRARTKLKQTEMDCELLKKFCENLKEENTRLQ

Query:  KELRELKSLKLTAPPLCMQLQAATLTVCPSC----GGGHGGDASPAT------TFSISPKPHFLKFPFNHPSAAC
        KEL++LK+LKL + P  M + AATLT+CPSC    GGG GGD +          FSI  KP F   PF +PSAAC
Subjt:  KELRELKSLKLTAPPLCMQLQAATLTVCPSC----GGGHGGDASPAT------TFSISPKPHFLKFPFNHPSAAC

P46665 Homeobox-leucine zipper protein HAT141.3e-3952.4Show/hide
Query:  LTLGLSIIVEKSGVESSTEEELIQQGSSGSTPGSSFSNS---SSGLKRERFD--GEEKAIEAETEMCMKIGHEE---EEDGSPRKKLRLSKEQSAILEDH
        + LG + +VE    E   EEE +   S   +P  S ++S     G+K   ++    ++ I+ E E        E   +E+GS RKKLRLSK+QSA LED 
Subjt:  LTLGLSIIVEKSGVESSTEEELIQQGSSGSTPGSSFSNS---SSGLKRERFD--GEEKAIEAETEMCMKIGHEE---EEDGSPRKKLRLSKEQSAILEDH

Query:  FKEHSSLSPKQKQDLARQLNLRPRQVEVWFQNRRARTKLKQTEMDCELLKKFCENLKEENTRLQKELRELKSLKLTAPPLCMQLQAATLTVCPSCGGGHG
        FKEHS+L+PKQK  LA+QLNLRPRQVEVWFQNRRARTKLKQTE+DCE LK+ CE+L EEN RLQKE++EL++LK T+ P  MQL A TLT+CPSC     
Subjt:  FKEHSSLSPKQKQDLARQLNLRPRQVEVWFQNRRARTKLKQTEMDCELLKKFCENLKEENTRLQKELRELKSLKLTAPPLCMQLQAATLTVCPSCGGGHG

Query:  GDASPATT
          A P+T+
Subjt:  GDASPATT

Q8GRL4 Homeobox-leucine zipper protein HOX195.5e-4344.95Show/hide
Query:  GLLLGLGRGSNDRLHPEAADVK----KKKLQALKFDDMFPSLTLGLSIIVEKSGVESSTEEELIQQGSSGSTPGSSFSNSSSGLKRERFDGEEKAIEAET
        GL LGL  G       +AA       ++   + +   + PSLTL L          ++T        S G  P  S S+ S G         E+A EA+ 
Subjt:  GLLLGLGRGSNDRLHPEAADVK----KKKLQALKFDDMFPSLTLGLSIIVEKSGVESSTEEELIQQGSSGSTPGSSFSNSSSGLKRERFDGEEKAIEAET

Query:  EMCMK--IGHEEEEDGSPRKKLRLSKEQSAILEDHFKEHSSLSPKQKQDLARQLNLRPRQVEVWFQNRRARTKLKQTEMDCELLKKFCENLKEENTRLQK
        E       G ++++DGS RKKLRL+KEQSA+LED F+EHS+L+PKQK  LA+QLNLRPRQVEVWFQNRRARTKLKQTE+DCE LK+ CE L EEN RLQ+
Subjt:  EMCMK--IGHEEEEDGSPRKKLRLSKEQSAILEDHFKEHSSLSPKQKQDLARQLNLRPRQVEVWFQNRRARTKLKQTEMDCELLKKFCENLKEENTRLQK

Query:  ELRELKSLKLTAP---------------PLCMQLQAATLTVCPSCGGGHGGDASPATTFSIS--------PKPHFLKFPFNHPSAAC
        EL+EL++LK   P               P  MQL AATLT+CPSC    GG AS A   +             H    PF H SAAC
Subjt:  ELRELKSLKLTAP---------------PLCMQLQAATLTVCPSCGGGHGGDASPATTFSIS--------PKPHFLKFPFNHPSAAC

Arabidopsis top hitse value%identityAlignment
AT2G22800.1 Homeobox-leucine zipper protein family1.0e-5252.52Show/hide
Query:  GLLLGLGRGSNDRLHPEAADVKKKKLQALKFDDMFPSLTLGLSIIVEKSGVESSTEEELIQQGSSGSTPGSSFSNSSSGLKRERFDGEEKAIEAE-TEMC
        GL+LGLG       +   + +++  +  L+     PSLTL LS   + S    +  ++L +Q SS S  G S  +S   +KRER  GEE   E E TE  
Subjt:  GLLLGLGRGSNDRLHPEAADVKKKKLQALKFDDMFPSLTLGLSIIVEKSGVESSTEEELIQQGSSGSTPGSSFSNSSSGLKRERFDGEEKAIEAE-TEMC

Query:  MKIGHEEEEDGSPRKKLRLSKEQSAILEDHFKEHSSLSPKQKQDLARQLNLRPRQVEVWFQNRRARTKLKQTEMDCELLKKFCENLKEENTRLQKELREL
        +   HE+EE  S RKKLRL+K+QSA+LE+ FK+HS+L+PKQKQ LARQLNLRPRQVEVWFQNRRARTKLKQTE+DCE LKK CE L +EN RLQKE++EL
Subjt:  MKIGHEEEEDGSPRKKLRLSKEQSAILEDHFKEHSSLSPKQKQDLARQLNLRPRQVEVWFQNRRARTKLKQTEMDCELLKKFCENLKEENTRLQKELREL

Query:  KSLKLTAPPLCMQLQAATLTVCPSC------GGGHGG------------DASPAT-TFSISPKPHFLKFPFNHPSAAC
        K+LKLT  P  M + A+TLT CPSC      GGG+GG            D S A   FSIS KPHF   PF +PSAAC
Subjt:  KSLKLTAPPLCMQLQAATLTVCPSC------GGGHGG------------DASPAT-TFSISPKPHFLKFPFNHPSAAC

AT3G60390.1 homeobox-leucine zipper protein 37.1e-3851.28Show/hide
Query:  VEKSGVESSTEEELIQQGSSGSTPGSSFSNSSSGLKRER---------FDGEEKAIEAETEMCMKIGHEEEEDG------SPRKKLRLSKEQSAILEDHF
        ++ +   S+   ++  +G+  S+P S+ S+  SG K ER           G  +  E E   C   G  ++EDG      S RKKLRLSKEQ+ +LE+ F
Subjt:  VEKSGVESSTEEELIQQGSSGSTPGSSFSNSSSGLKRER---------FDGEEKAIEAETEMCMKIGHEEEEDG------SPRKKLRLSKEQSAILEDHF

Query:  KEHSSLSPKQKQDLARQLNLRPRQVEVWFQNRRARTKLKQTEMDCELLKKFCENLKEENTRLQKELRELKSLKLTAPPLCMQLQ-AATLTVCPSC
        KEHS+L+PKQK  LA+QLNLR RQVEVWFQNRRARTKLKQTE+DCE LK+ CENL +EN RLQKE+ EL++LKL +P L M ++   TLT+CPSC
Subjt:  KEHSSLSPKQKQDLARQLNLRPRQVEVWFQNRRARTKLKQTEMDCELLKKFCENLKEENTRLQKELRELKSLKLTAPPLCMQLQ-AATLTVCPSC

AT4G16780.1 homeobox protein 27.9e-3758.49Show/hide
Query:  STPGSSFSNSSSGLKRERFDGEEKAIEAETEMCMKIGHEEEEDG-SPRKKLRLSKEQSAILEDHFKEHSSLSPKQKQDLARQLNLRPRQVEVWFQNRRAR
        S+P S+ S SS+G + ER        E +T+     G  ++EDG + RKKLRLSK+QSAILE+ FK+HS+L+PKQKQ LA+QL LR RQVEVWFQNRRAR
Subjt:  STPGSSFSNSSSGLKRERFDGEEKAIEAETEMCMKIGHEEEEDG-SPRKKLRLSKEQSAILEDHFKEHSSLSPKQKQDLARQLNLRPRQVEVWFQNRRAR

Query:  TKLKQTEMDCELLKKFCENLKEENTRLQKELRELKSLKLTAPPLCMQLQAATLTVCPSC
        TKLKQTE+DCE L++ CENL EEN RLQKE+ EL++LKL+           TLT+CPSC
Subjt:  TKLKQTEMDCELLKKFCENLKEENTRLQKELRELKSLKLTAPPLCMQLQAATLTVCPSC

AT4G37790.1 Homeobox-leucine zipper protein family2.7e-5353.09Show/hide
Query:  GLLLGLGRGSNDRLHPEAADVKKKKLQALKFDDMFPSLTLGLSIIVEKSGVESSTEEELIQQGSSGSTPGSSFSNSSSGLKRER----FDGEEKAIE-AE
        GL+LGLG       +  A   K       +F  + PSLTL LS    K    +   +++ +Q SS S  G S S SS  +KRER     DGEE+A E  E
Subjt:  GLLLGLGRGSNDRLHPEAADVKKKKLQALKFDDMFPSLTLGLSIIVEKSGVESSTEEELIQQGSSGSTPGSSFSNSSSGLKRER----FDGEEKAIE-AE

Query:  TEMCMKIG--HEEEEDGSPRKKLRLSKEQSAILEDHFKEHSSLSPKQKQDLARQLNLRPRQVEVWFQNRRARTKLKQTEMDCELLKKFCENLKEENTRLQ
          +C ++   H++EE  S RKKLRL+K+QSA+LED+FK HS+L+PKQKQ LARQLNLRPRQVEVWFQNRRARTKLKQTE+DCE LKK CE L +EN RLQ
Subjt:  TEMCMKIG--HEEEEDGSPRKKLRLSKEQSAILEDHFKEHSSLSPKQKQDLARQLNLRPRQVEVWFQNRRARTKLKQTEMDCELLKKFCENLKEENTRLQ

Query:  KELRELKSLKLTAPPLCMQLQAATLTVCPSC----GGGHGGDASPAT------TFSISPKPHFLKFPFNHPSAAC
        KEL++LK+LKL + P  M + AATLT+CPSC    GGG GGD +          FSI  KP F   PF +PSAAC
Subjt:  KELRELKSLKLTAPPLCMQLQAATLTVCPSC----GGGHGGDASPAT------TFSISPKPHFLKFPFNHPSAAC

AT5G06710.1 homeobox from Arabidopsis thaliana9.0e-4152.4Show/hide
Query:  LTLGLSIIVEKSGVESSTEEELIQQGSSGSTPGSSFSNS---SSGLKRERFD--GEEKAIEAETEMCMKIGHEE---EEDGSPRKKLRLSKEQSAILEDH
        + LG + +VE    E   EEE +   S   +P  S ++S     G+K   ++    ++ I+ E E        E   +E+GS RKKLRLSK+QSA LED 
Subjt:  LTLGLSIIVEKSGVESSTEEELIQQGSSGSTPGSSFSNS---SSGLKRERFD--GEEKAIEAETEMCMKIGHEE---EEDGSPRKKLRLSKEQSAILEDH

Query:  FKEHSSLSPKQKQDLARQLNLRPRQVEVWFQNRRARTKLKQTEMDCELLKKFCENLKEENTRLQKELRELKSLKLTAPPLCMQLQAATLTVCPSCGGGHG
        FKEHS+L+PKQK  LA+QLNLRPRQVEVWFQNRRARTKLKQTE+DCE LK+ CE+L EEN RLQKE++EL++LK T+ P  MQL A TLT+CPSC     
Subjt:  FKEHSSLSPKQKQDLARQLNLRPRQVEVWFQNRRARTKLKQTEMDCELLKKFCENLKEENTRLQKELRELKSLKLTAPPLCMQLQAATLTVCPSCGGGHG

Query:  GDASPATT
          A P+T+
Subjt:  GDASPATT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATGGCGATGGGCTTCTTCTCGGGCTGGGGCGGGGATCGAATGATCGATTGCACCCGGAAGCAGCCGATGTCAAGAAGAAGAAGCTGCAGGCTTTGAAGTTTGATGA
CATGTTCCCTTCGTTGACGTTGGGATTATCCATCATCGTCGAGAAGAGCGGTGTCGAATCATCGACGGAGGAGGAGTTGATTCAGCAGGGGTCTTCAGGGAGTACTCCGG
GATCGTCGTTTTCGAATTCGTCGTCGGGGTTAAAGAGGGAGAGATTCGACGGTGAAGAGAAGGCGATCGAGGCGGAGACGGAGATGTGTATGAAAATTGGTCATGAGGAA
GAGGAAGATGGAAGTCCGAGGAAGAAACTGAGATTAAGTAAAGAACAATCAGCCATTTTGGAAGATCATTTCAAAGAACATTCAAGCCTCAGCCCTAAGCAAAAGCAGGA
TTTGGCAAGACAGTTAAACCTAAGACCAAGACAAGTGGAAGTTTGGTTTCAAAACAGAAGAGCCAGAACCAAGCTGAAACAAACAGAAATGGATTGTGAATTACTGAAAA
AATTCTGTGAAAATCTGAAAGAAGAAAACACAAGGCTTCAAAAAGAGCTTCGAGAACTCAAATCACTCAAATTAACTGCGCCGCCGCTGTGCATGCAGCTACAAGCCGCC
ACCCTCACCGTTTGCCCGTCCTGCGGAGGCGGCCACGGCGGCGATGCATCTCCGGCCACCACTTTCTCCATCAGCCCAAAGCCTCACTTTCTTAAATTCCCCTTTAACCA
CCCATCGGCGGCTTGTTAG
mRNA sequenceShow/hide mRNA sequence
TTGAACTTTCTCAAGAGTCCCCACAATCCCATTTGCAAGTAAAGAAGCTTCCTCTCTCTTCCTCCTCCATCTCCTTTTTTCCCTTTATATTTCACCCCCTCAAGTTTACA
GCCTTTTCATTTTTGGCCATTGTCAAGTCTTTCCATTTTTCCATTCTTCAATTAAAACAAAAATACAAAAAAACTAATGGATGGCGATGGGCTTCTTCTCGGGCTGGGGC
GGGGATCGAATGATCGATTGCACCCGGAAGCAGCCGATGTCAAGAAGAAGAAGCTGCAGGCTTTGAAGTTTGATGACATGTTCCCTTCGTTGACGTTGGGATTATCCATC
ATCGTCGAGAAGAGCGGTGTCGAATCATCGACGGAGGAGGAGTTGATTCAGCAGGGGTCTTCAGGGAGTACTCCGGGATCGTCGTTTTCGAATTCGTCGTCGGGGTTAAA
GAGGGAGAGATTCGACGGTGAAGAGAAGGCGATCGAGGCGGAGACGGAGATGTGTATGAAAATTGGTCATGAGGAAGAGGAAGATGGAAGTCCGAGGAAGAAACTGAGAT
TAAGTAAAGAACAATCAGCCATTTTGGAAGATCATTTCAAAGAACATTCAAGCCTCAGCCCTAAGCAAAAGCAGGATTTGGCAAGACAGTTAAACCTAAGACCAAGACAA
GTGGAAGTTTGGTTTCAAAACAGAAGAGCCAGAACCAAGCTGAAACAAACAGAAATGGATTGTGAATTACTGAAAAAATTCTGTGAAAATCTGAAAGAAGAAAACACAAG
GCTTCAAAAAGAGCTTCGAGAACTCAAATCACTCAAATTAACTGCGCCGCCGCTGTGCATGCAGCTACAAGCCGCCACCCTCACCGTTTGCCCGTCCTGCGGAGGCGGCC
ACGGCGGCGATGCATCTCCGGCCACCACTTTCTCCATCAGCCCAAAGCCTCACTTTCTTAAATTCCCCTTTAACCACCCATCGGCGGCTTGTTAGGCTGCCTAATTTTGA
TTACTATATCGTTAATTAAAATGACCAGAAGACTTCTATTTTTGCTTACGACTTCTGAGTCAATTTATGAAATATATATATATAGCTATATATACCACTGATATTTTATC
ATTATAG
Protein sequenceShow/hide protein sequence
MDGDGLLLGLGRGSNDRLHPEAADVKKKKLQALKFDDMFPSLTLGLSIIVEKSGVESSTEEELIQQGSSGSTPGSSFSNSSSGLKRERFDGEEKAIEAETEMCMKIGHEE
EEDGSPRKKLRLSKEQSAILEDHFKEHSSLSPKQKQDLARQLNLRPRQVEVWFQNRRARTKLKQTEMDCELLKKFCENLKEENTRLQKELRELKSLKLTAPPLCMQLQAA
TLTVCPSCGGGHGGDASPATTFSISPKPHFLKFPFNHPSAAC