; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG05G017650 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG05G017650
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
Descriptionhomeobox-leucine zipper protein HAT9-like
Genome locationCG_Chr05:30009259..30012384
RNA-Seq ExpressionClCG05G017650
SyntenyClCG05G017650
Gene Ontology termsGO:0006357 - regulation of transcription by RNA polymerase II (biological process)
GO:0005634 - nucleus (cellular component)
GO:0000981 - DNA-binding transcription factor activity, RNA polymerase II-specific (molecular function)
GO:0043565 - sequence-specific DNA binding (molecular function)
InterPro domainsIPR001356 - Homeobox domain
IPR003106 - Leucine zipper, homeobox-associated
IPR009057 - Homeobox-like domain superfamily
IPR017970 - Homeobox, conserved site


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6591449.1 Homeobox-leucine zipper protein HAT9, partial [Cucurbita argyrosperma subsp. sororia]1.8e-11082.39Show/hide
Query:  MNRDCNTGLLLGLGRVSGDDSLGSLLP--PEVADVKKKLQVLKFDDILPSLTLGLSTVVVEKS------AATDELIQQGCSGSPASSFSNSSGFKRERDS
        M+ DCNTGLLLGLGR  G D   S+ P  P+VA VKKKLQVLKFDDILPSLTLGLS VVVEKS       A +ELI QG SGSP SSFSNSSGFKRERD 
Subjt:  MNRDCNTGLLLGLGRVSGDDSLGSLLP--PEVADVKKKLQVLKFDDILPSLTLGLSTVVVEKS------AATDELIQQGCSGSPASSFSNSSGFKRERDS

Query:  G-GEEK-EAEVY-----MKVGEEDEDGSPRKKLRLTKQQSAILEDNFKEHSSLSPKQKQDLAKQLKLRPRQVEVWFQNRRARTKLKQTEMDCELLKKCCE
        G GEE  EAEV+     MKV EE+EDGSPRKKLRLTK+QSA+LEDNFKEHSSLSPKQKQDLA+QL LRPRQVEVWFQNRRARTKLKQTEMDCELLKKCCE
Subjt:  G-GEEK-EAEVY-----MKVGEEDEDGSPRKKLRLTKQQSAILEDNFKEHSSLSPKQKQDLAKQLKLRPRQVEVWFQNRRARTKLKQTEMDCELLKKCCE

Query:  KLKEENTRLQKELQELKSLKLTAPPFCMQLQAATLTVCPSCETSICSGGGGGGG--DASPPNTFSIGSKPHFLKFPFKHPSAAC
        KLKEENT+LQKELQELKSLKLTAPPFCMQLQAATLTVCPSCE+SIC GGGGGGG  DASP NTFSIGSKPHFLKFPF HPSAAC
Subjt:  KLKEENTRLQKELQELKSLKLTAPPFCMQLQAATLTVCPSCETSICSGGGGGGG--DASPPNTFSIGSKPHFLKFPFKHPSAAC

XP_022935802.1 homeobox-leucine zipper protein HAT9-like [Cucurbita moschata]2.3e-11081.82Show/hide
Query:  MNRDCNTGLLLGLGRVSGDDSLGSLLP--PEVADVKKKLQVLKFDDILPSLTLGLSTVVVEKS------AATDELIQQGCSGSPASSFSNSSGFKRERDS
        M+ DCNTGLLLGLGR  G D   S+ P  P+VA VKKKLQVLKFDDILPSLTLGLS VVV+KS       A DELIQQG SGSP SSFS+SSGFKRERD 
Subjt:  MNRDCNTGLLLGLGRVSGDDSLGSLLP--PEVADVKKKLQVLKFDDILPSLTLGLSTVVVEKS------AATDELIQQGCSGSPASSFSNSSGFKRERDS

Query:  G-GEE-KEAEVY-----MKVGEEDEDGSPRKKLRLTKQQSAILEDNFKEHSSLSPKQKQDLAKQLKLRPRQVEVWFQNRRARTKLKQTEMDCELLKKCCE
        G GEE  EAEV+     MKV EE+EDGSPRKKLRLTK+QSA+LEDNFKEHSSLSPKQKQDLA+QL LRPRQVEVWFQNRRARTKLKQTEMDCELLKKCCE
Subjt:  G-GEE-KEAEVY-----MKVGEEDEDGSPRKKLRLTKQQSAILEDNFKEHSSLSPKQKQDLAKQLKLRPRQVEVWFQNRRARTKLKQTEMDCELLKKCCE

Query:  KLKEENTRLQKELQELKSLKLTAPPFCMQLQAATLTVCPSCETSICSGGGGGGG----DASPPNTFSIGSKPHFLKFPFKHPSAAC
        KLKEENT+LQKELQELKSLKLTAPPFCMQLQAATLTVCPSCE+SIC GGGGGGG    DASP NTFSIGSKPHFLKFPF HPSAAC
Subjt:  KLKEENTRLQKELQELKSLKLTAPPFCMQLQAATLTVCPSCETSICSGGGGGGG----DASPPNTFSIGSKPHFLKFPFKHPSAAC

XP_022977087.1 homeobox-leucine zipper protein HAT9-like [Cucurbita maxima]4.6e-11183.57Show/hide
Query:  MNRDCNTGLLLGLGRVSGDDSLGSLLP--PEVADVKKKLQVLKFDDILPSLTLGLSTVVVEKS---AATDELIQQGCSGSPASSFSNSSGFKRERDSG-G
        M+ DCN GLLLGLGR  G D   S+ P  P+VA VKKKLQVLKFDDILPSLTLGLS VVVEKS   +A D+LI QG SGSPASSFSNSSGFKRERD G G
Subjt:  MNRDCNTGLLLGLGRVSGDDSLGSLLP--PEVADVKKKLQVLKFDDILPSLTLGLSTVVVEKS---AATDELIQQGCSGSPASSFSNSSGFKRERDSG-G

Query:  EE-KEAEVY-----MKVGEEDEDGSPRKKLRLTKQQSAILEDNFKEHSSLSPKQKQDLAKQLKLRPRQVEVWFQNRRARTKLKQTEMDCELLKKCCEKLK
        EE  EAEV+     MKV EE+EDGSPRKKLRLTK+QSA+LEDNFKEHSSLSPKQKQDLA+QL LRPRQVEVWFQNRRARTKLKQTEMDCELLKKCCEKLK
Subjt:  EE-KEAEVY-----MKVGEEDEDGSPRKKLRLTKQQSAILEDNFKEHSSLSPKQKQDLAKQLKLRPRQVEVWFQNRRARTKLKQTEMDCELLKKCCEKLK

Query:  EENTRLQKELQELKSLKLTAPPFCMQLQAATLTVCPSCETSICSGGGGGGG-DASPPNTFSIGSKPHFLKFPFKHPSAAC
        EENT+LQKELQELKSLKLTAPPFCMQLQAATLTVCPSCE+SIC GGGGGGG DASP NTFSIGSKPHFLKFPF HPSAAC
Subjt:  EENTRLQKELQELKSLKLTAPPFCMQLQAATLTVCPSCETSICSGGGGGGG-DASPPNTFSIGSKPHFLKFPFKHPSAAC

XP_023536403.1 homeobox-leucine zipper protein HAT9-like [Cucurbita pepo subsp. pepo]6.0e-11181.05Show/hide
Query:  MNRDCNTGLLLGLGRVSGDDSLGSLLP--PEVADVKKKLQVLKFDDILPSLTLGLSTVVVEKS------AATDELIQQGCSGSPASSFSNSSGFKRERDS
        M+ DCNTGLLLGLGR  G D   S+ P  P+VA VKKKLQVLKFDDILPSLTLGLS VVV+KS       A DELIQQG SGSP SSFSNSSGFKRERD 
Subjt:  MNRDCNTGLLLGLGRVSGDDSLGSLLP--PEVADVKKKLQVLKFDDILPSLTLGLSTVVVEKS------AATDELIQQGCSGSPASSFSNSSGFKRERDS

Query:  GGEEKEAE-------VYMKVGEEDEDGSPRKKLRLTKQQSAILEDNFKEHSSLSPKQKQDLAKQLKLRPRQVEVWFQNRRARTKLKQTEMDCELLKKCCE
        G  E+ AE       + MKV EE+EDGSPRKKLRLTK+QSA+LEDNFKEHSSLSPKQKQDLA+QL LRPRQVEVWFQNRRARTKLKQTEMDCELLKKCCE
Subjt:  GGEEKEAE-------VYMKVGEEDEDGSPRKKLRLTKQQSAILEDNFKEHSSLSPKQKQDLAKQLKLRPRQVEVWFQNRRARTKLKQTEMDCELLKKCCE

Query:  KLKEENTRLQKELQELKSLKLTAPPFCMQLQAATLTVCPSCETSICSGGGGGGG---DASPPNTFSIGSKPHFLKFPFKHPSAAC
        KLKEENT+LQKELQELKSLKLTAPPFCMQLQAATLTVCPSCE+SIC GGGGGGG   DASP NTFSIGSKPHFLKFPF HPSAAC
Subjt:  KLKEENTRLQKELQELKSLKLTAPPFCMQLQAATLTVCPSCETSICSGGGGGGG---DASPPNTFSIGSKPHFLKFPFKHPSAAC

XP_038898886.1 homeobox-leucine zipper protein HAT22-like [Benincasa hispida]3.5e-11987.05Show/hide
Query:  MNRDCNTGLLLGLGRVSGDDSLGSLLPPEVADVKKKLQVLKFDDILPSLTLGLSTVVVEKS--AATDELIQQGCSGSPASSFSNSSGFKRERD----SGG
        M RDCNTGLLLGLGRVS DDS+ S++PPEVA VKKKLQVLKFDDILPSLTLGLS +VVEKS   AT+EL Q+GCSGSP SSFSNSSGFKRERD     GG
Subjt:  MNRDCNTGLLLGLGRVSGDDSLGSLLPPEVADVKKKLQVLKFDDILPSLTLGLSTVVVEKS--AATDELIQQGCSGSPASSFSNSSGFKRERD----SGG

Query:  EEKEAEVY-----MKVGEEDEDGSPRKKLRLTKQQSAILEDNFKEHSSLSPKQKQDLAKQLKLRPRQVEVWFQNRRARTKLKQTEMDCELLKKCCEKLKE
        EE EAE Y     MKVGEEDEDGSPRKKLRLTKQQSAILEDNFKEHSSLSPKQKQDLAKQL LRPRQVEVWFQNRRARTKLKQTEMDCELLKKCCEKLKE
Subjt:  EEKEAEVY-----MKVGEEDEDGSPRKKLRLTKQQSAILEDNFKEHSSLSPKQKQDLAKQLKLRPRQVEVWFQNRRARTKLKQTEMDCELLKKCCEKLKE

Query:  ENTRLQKELQELKSLKLTAPPFCMQLQAATLTVCPSCETSICSGGGGGGGDASPPNTFSIGSKPHFLKFPFKHPSAAC
        ENTRLQKELQELKSLKLTAPPFCMQLQAATLTVCPSCE+SIC  GGGGGGDASP NTFSI SKP FLKFPF HPSAAC
Subjt:  ENTRLQKELQELKSLKLTAPPFCMQLQAATLTVCPSCETSICSGGGGGGGDASPPNTFSIGSKPHFLKFPFKHPSAAC

TrEMBL top hitse value%identityAlignment
A0A0A0L3X3 Homeobox domain-containing protein1.6e-10180.3Show/hide
Query:  MNRDCNTGLLLGLGRVSGDDSLGSLLPPEVADVKKKL-QVLKF-DDILPSLTLGLSTVVVEKSAATDELIQQGCSGSPASSFSNSSGFKRERDSGGEEKE
        M+ DCNTGLLLGLGRVSG +   S+     A  KKKL QVLKF DDILPSLTLGLS VV       D   + GCSGSP SSFSNSSGFKRER +G E  E
Subjt:  MNRDCNTGLLLGLGRVSGDDSLGSLLPPEVADVKKKL-QVLKF-DDILPSLTLGLSTVVVEKSAATDELIQQGCSGSPASSFSNSSGFKRERDSGGEEKE

Query:  AEVYMKVGEEDEDGSPRKKLRLTKQQSAILEDNFKEHSSLSPKQKQDLAKQLKLRPRQVEVWFQNRRARTKLKQTEMDCELLKKCCEKLKEENTRLQKEL
         E  MKVGEEDE+GSPRKKLRLTK QSAILEDNFKEHSSLSPKQKQDLA+QL LRPRQVEVWFQNRRARTKLKQTEMDCELLKKCCEKLKEENTRLQKEL
Subjt:  AEVYMKVGEEDEDGSPRKKLRLTKQQSAILEDNFKEHSSLSPKQKQDLAKQLKLRPRQVEVWFQNRRARTKLKQTEMDCELLKKCCEKLKEENTRLQKEL

Query:  QELKSLKLTAPPFCMQLQAATLTVCPSCETSICSGGGGGGGDASPPNTFSIGSKPHFLKFPFKHPSAAC
        QELKSLKLT PPFCMQLQAATLTVCPSCE+SIC GG   GGDASP N FSIGSKP FLKFPF HPSAAC
Subjt:  QELKSLKLTAPPFCMQLQAATLTVCPSCETSICSGGGGGGGDASPPNTFSIGSKPHFLKFPFKHPSAAC

A0A6J1C954 homeobox-leucine zipper protein HAT9-like5.4e-9774.32Show/hide
Query:  MNRDCN--TGLLLGLGRVSGDDSLGSLLPPEVADVKKKLQVLKFDDILPSLTLGLS----------TVVVEKSAATDELIQQGCSGSPASSFSNSSGFKR
        M+ DC+  TGLLLGLGR S  + L SL+P    DVKKKL VLKFDDILP LTLGLS           ++V K+ A +EL+QQG S SP SSFSNSSG KR
Subjt:  MNRDCN--TGLLLGLGRVSGDDSLGSLLPPEVADVKKKLQVLKFDDILPSLTLGLS----------TVVVEKSAATDELIQQGCSGSPASSFSNSSGFKR

Query:  ERD----SGGEEKEAE--------VYMKVG-EEDEDGSPRKKLRLTKQQSAILEDNFKEHSSLSPKQKQDLAKQLKLRPRQVEVWFQNRRARTKLKQTEM
        +R      GGEE+EAE        V  KVG +EDEDGSPRKKLRLTK+QSAILED+FKEHSSL+PKQK DLA+QL LRPRQVEVWFQNRRARTKLKQTEM
Subjt:  ERD----SGGEEKEAE--------VYMKVG-EEDEDGSPRKKLRLTKQQSAILEDNFKEHSSLSPKQKQDLAKQLKLRPRQVEVWFQNRRARTKLKQTEM

Query:  DCELLKKCCEKLKEENTRLQKELQELKSLKLTAPPFCMQLQAATLTVCPSCETSICSGGGGGGGDASPPNTFSIGSKPHFLKFPFKHPSAAC
        DCELLKKCCEKLKEENTRLQKELQELKSLKLTAPPFCMQLQAATLTVCPSCE SIC  GGGGGGDASP   FSIGSKP FLKFPF HPSAAC
Subjt:  DCELLKKCCEKLKEENTRLQKELQELKSLKLTAPPFCMQLQAATLTVCPSCETSICSGGGGGGGDASPPNTFSIGSKPHFLKFPFKHPSAAC

A0A6J1F6L5 homeobox-leucine zipper protein HAT9-like1.1e-11081.82Show/hide
Query:  MNRDCNTGLLLGLGRVSGDDSLGSLLP--PEVADVKKKLQVLKFDDILPSLTLGLSTVVVEKS------AATDELIQQGCSGSPASSFSNSSGFKRERDS
        M+ DCNTGLLLGLGR  G D   S+ P  P+VA VKKKLQVLKFDDILPSLTLGLS VVV+KS       A DELIQQG SGSP SSFS+SSGFKRERD 
Subjt:  MNRDCNTGLLLGLGRVSGDDSLGSLLP--PEVADVKKKLQVLKFDDILPSLTLGLSTVVVEKS------AATDELIQQGCSGSPASSFSNSSGFKRERDS

Query:  G-GEE-KEAEVY-----MKVGEEDEDGSPRKKLRLTKQQSAILEDNFKEHSSLSPKQKQDLAKQLKLRPRQVEVWFQNRRARTKLKQTEMDCELLKKCCE
        G GEE  EAEV+     MKV EE+EDGSPRKKLRLTK+QSA+LEDNFKEHSSLSPKQKQDLA+QL LRPRQVEVWFQNRRARTKLKQTEMDCELLKKCCE
Subjt:  G-GEE-KEAEVY-----MKVGEEDEDGSPRKKLRLTKQQSAILEDNFKEHSSLSPKQKQDLAKQLKLRPRQVEVWFQNRRARTKLKQTEMDCELLKKCCE

Query:  KLKEENTRLQKELQELKSLKLTAPPFCMQLQAATLTVCPSCETSICSGGGGGGG----DASPPNTFSIGSKPHFLKFPFKHPSAAC
        KLKEENT+LQKELQELKSLKLTAPPFCMQLQAATLTVCPSCE+SIC GGGGGGG    DASP NTFSIGSKPHFLKFPF HPSAAC
Subjt:  KLKEENTRLQKELQELKSLKLTAPPFCMQLQAATLTVCPSCETSICSGGGGGGG----DASPPNTFSIGSKPHFLKFPFKHPSAAC

A0A6J1IHG6 homeobox-leucine zipper protein HAT9-like2.2e-11183.57Show/hide
Query:  MNRDCNTGLLLGLGRVSGDDSLGSLLP--PEVADVKKKLQVLKFDDILPSLTLGLSTVVVEKS---AATDELIQQGCSGSPASSFSNSSGFKRERDSG-G
        M+ DCN GLLLGLGR  G D   S+ P  P+VA VKKKLQVLKFDDILPSLTLGLS VVVEKS   +A D+LI QG SGSPASSFSNSSGFKRERD G G
Subjt:  MNRDCNTGLLLGLGRVSGDDSLGSLLP--PEVADVKKKLQVLKFDDILPSLTLGLSTVVVEKS---AATDELIQQGCSGSPASSFSNSSGFKRERDSG-G

Query:  EE-KEAEVY-----MKVGEEDEDGSPRKKLRLTKQQSAILEDNFKEHSSLSPKQKQDLAKQLKLRPRQVEVWFQNRRARTKLKQTEMDCELLKKCCEKLK
        EE  EAEV+     MKV EE+EDGSPRKKLRLTK+QSA+LEDNFKEHSSLSPKQKQDLA+QL LRPRQVEVWFQNRRARTKLKQTEMDCELLKKCCEKLK
Subjt:  EE-KEAEVY-----MKVGEEDEDGSPRKKLRLTKQQSAILEDNFKEHSSLSPKQKQDLAKQLKLRPRQVEVWFQNRRARTKLKQTEMDCELLKKCCEKLK

Query:  EENTRLQKELQELKSLKLTAPPFCMQLQAATLTVCPSCETSICSGGGGGGG-DASPPNTFSIGSKPHFLKFPFKHPSAAC
        EENT+LQKELQELKSLKLTAPPFCMQLQAATLTVCPSCE+SIC GGGGGGG DASP NTFSIGSKPHFLKFPF HPSAAC
Subjt:  EENTRLQKELQELKSLKLTAPPFCMQLQAATLTVCPSCETSICSGGGGGGG-DASPPNTFSIGSKPHFLKFPFKHPSAAC

B9R6T5 Homeobox protein, putative8.1e-6962.45Show/hide
Query:  CNTGLLLGLGRVSGDDSLGSLLPPEVADVKKKLQVLKFDDILPSLTLGL-STVVVEKSAATDELIQQGCSGSPASSFSNSSGFKRERDS----GGEEKEA
        CNTGL LGL     +++            KKK   LK+D + PSLTLGL        +    +L  Q  S S  SSFSNSS  K+ERDS    GGEE + 
Subjt:  CNTGLLLGLGRVSGDDSLGSLLPPEVADVKKKLQVLKFDDILPSLTLGL-STVVVEKSAATDELIQQGCSGSPASSFSNSSGFKRERDS----GGEEKEA

Query:  E-VYMKVGEEDEDGSPRKKLRLTKQQSAILEDNFKEHSSLSPKQKQDLAKQLKLRPRQVEVWFQNRRARTKLKQTEMDCELLKKCCEKLKEENTRLQKEL
        E V  +V +EDE+GSPRKKLRLTKQQSAILEDNFKEHS+L+PKQKQ LA+QL LRPRQVEVWFQNRRARTKLKQTE+DCE+LKKCCE L EEN RLQKEL
Subjt:  E-VYMKVGEEDEDGSPRKKLRLTKQQSAILEDNFKEHSSLSPKQKQDLAKQLKLRPRQVEVWFQNRRARTKLKQTEMDCELLKKCCEKLKEENTRLQKEL

Query:  QELKSLKLTAPPFCMQLQAATLTVCPSCETSICSGGGGGGGDASPPNTFSIGSKPHFLKFPFKHPSAAC
        QELKSLKL A PF MQL AATLT+CPSCE     GGGG G  ++   T  +GSKPHF   PF HPSAAC
Subjt:  QELKSLKLTAPPFCMQLQAATLTVCPSCETSICSGGGGGGGDASPPNTFSIGSKPHFLKFPFKHPSAAC

SwissProt top hitse value%identityAlignment
A2XE76 Homeobox-leucine zipper protein HOX194.6e-4545.67Show/hide
Query:  NTGLLLGLGRVSGDDSLGSLLPPEVADVKKKLQVLKFDDILPSLTLGLSTVVVEKSAATDELIQQGCSGSPASSFSN-------SSGFKRERDSGGEEKE
        + GL LGL    G               ++     +   + PSLTL L       +AAT      G  G PA S S+       ++  KRER    + + 
Subjt:  NTGLLLGLGRVSGDDSLGSLLPPEVADVKKKLQVLKFDDILPSLTLGLSTVVVEKSAATDELIQQGCSGSPASSFSN-------SSGFKRERDSGGEEKE

Query:  AEVYMKVGEEDEDGSPRKKLRLTKQQSAILEDNFKEHSSLSPKQKQDLAKQLKLRPRQVEVWFQNRRARTKLKQTEMDCELLKKCCEKLKEENTRLQKEL
                ++D+DGS RKKLRLTK+QSA+LED F+EHS+L+PKQK  LAKQL LRPRQVEVWFQNRRARTKLKQTE+DCE LK+CCE L EEN RLQ+EL
Subjt:  AEVYMKVGEEDEDGSPRKKLRLTKQQSAILEDNFKEHSSLSPKQKQDLAKQLKLRPRQVEVWFQNRRARTKLKQTEMDCELLKKCCEKLKEENTRLQKEL

Query:  QELKSLKLTAP---------------PFCMQLQAATLTVCPSCE-----TSICSGGGGGGGDASPPNTFSIGSKPHFLKFPFKHPSAAC
        QEL++LK   P               PF MQL AATLT+CPSCE      S        G  A P  T    +  HF   PF H SAAC
Subjt:  QELKSLKLTAP---------------PFCMQLQAATLTVCPSCE-----TSICSGGGGGGGDASPPNTFSIGSKPHFLKFPFKHPSAAC

P46603 Homeobox-leucine zipper protein HAT91.6e-5854.42Show/hide
Query:  CNTGLLLGLGRVSGDDSLGSLLPPEVADVKKKLQVLKFDDILPSLTLGLS---TVVVEKSAATDELIQQGCSGSPASSFSNSSGFKRERDSGGEEKEAE-
        CNTGL+LGLG         S +P       ++  V K +   PSLTL LS   +V V   A  D+L +Q  S S  SSFS+    KRERD G E  E E 
Subjt:  CNTGLLLGLGRVSGDDSLGSLLPPEVADVKKKLQVLKFDDILPSLTLGLS---TVVVEKSAATDELIQQGCSGSPASSFSNSSGFKRERDSGGEEKEAE-

Query:  ----VYMKVGEEDEDGSPRKKLRLTKQQSAILEDNFKEHSSLSPKQKQDLAKQLKLRPRQVEVWFQNRRARTKLKQTEMDCELLKKCCEKLKEENTRLQK
            V     E++E  S RKKLRLTKQQSA+LE++FK+HS+L+PKQKQ LA+QL LRPRQVEVWFQNRRARTKLKQTE+DCE LKKCCE L +EN RLQK
Subjt:  ----VYMKVGEEDEDGSPRKKLRLTKQQSAILEDNFKEHSSLSPKQKQDLAKQLKLRPRQVEVWFQNRRARTKLKQTEMDCELLKKCCEKLKEENTRLQK

Query:  ELQELKSLKLTAPPFCMQLQAATLTVCPSCETSICSGGGGGGG------------DASPPNTFSIGSKPHFLKFPFKHPSAAC
        E+QELK+LKLT  PF M + A+TLT CPSCE     GGG GGG             ++    FSI SKPHF   PF +PSAAC
Subjt:  ELQELKSLKLTAPPFCMQLQAATLTVCPSCETSICSGGGGGGG------------DASPPNTFSIGSKPHFLKFPFKHPSAAC

P46604 Homeobox-leucine zipper protein HAT227.5e-5654.01Show/hide
Query:  MNRDCNTGLLLGLGRVSGDDSLGSLLPPEVADVKKKLQVLKFDDILPSLTLGL---STVVVEKSAATDELIQQGCSGSPASSFSNSSGFKRERD-SGGE-
        ++  CNTGL+LGLG     ++    +    + V       +F  + PSLTL L   S  +   + A D++ +Q  S S  SSFS S   KRER+ SGG+ 
Subjt:  MNRDCNTGLLLGLGRVSGDDSLGSLLPPEVADVKKKLQVLKFDDILPSLTLGL---STVVVEKSAATDELIQQGCSGSPASSFSNSSGFKRERD-SGGE-

Query:  EKEAE------VYMKVGE--EDEDG-SPRKKLRLTKQQSAILEDNFKEHSSLSPKQKQDLAKQLKLRPRQVEVWFQNRRARTKLKQTEMDCELLKKCCEK
        E+EAE      V  +V +  +DE+G S RKKLRLTKQQSA+LEDNFK HS+L+PKQKQ LA+QL LRPRQVEVWFQNRRARTKLKQTE+DCE LKKCCE 
Subjt:  EKEAE------VYMKVGE--EDEDG-SPRKKLRLTKQQSAILEDNFKEHSSLSPKQKQDLAKQLKLRPRQVEVWFQNRRARTKLKQTEMDCELLKKCCEK

Query:  LKEENTRLQKELQELKSLKLTAPPFCMQLQAATLTVCPSCETSICSGGGGGGGDA------SPPNTFSIGSKPHFLKFPFKHPSAAC
        L +EN RLQKELQ+LK+LKL + PF M + AATLT+CPSCE     GGGG GGD       +    FSI +KP F   PF +PSAAC
Subjt:  LKEENTRLQKELQELKSLKLTAPPFCMQLQAATLTVCPSCETSICSGGGGGGGDA------SPPNTFSIGSKPHFLKFPFKHPSAAC

P46665 Homeobox-leucine zipper protein HAT143.6e-4249.8Show/hide
Query:  GDDSLGSLLPPEVADVKKK--------------------LQV-LKFDDILPS---------LTLGLSTVVVEKSAATDELIQQGCS--GSPASSFSNSSG
        G  SL S   P V D KKK                    LQ+ L F + LP          + LG +TVV E+    + +     S   S  SSF    G
Subjt:  GDDSLGSLLPPEVADVKKK--------------------LQV-LKFDDILPS---------LTLGLSTVVVEKSAATDELIQQGCS--GSPASSFSNSSG

Query:  FK--------RERDSGGE-EKEAEVYMKVGEEDEDGSPRKKLRLTKQQSAILEDNFKEHSSLSPKQKQDLAKQLKLRPRQVEVWFQNRRARTKLKQTEMD
         K         +RD   E E+ A        +DE+GS RKKLRL+K QSA LED+FKEHS+L+PKQK  LAKQL LRPRQVEVWFQNRRARTKLKQTE+D
Subjt:  FK--------RERDSGGE-EKEAEVYMKVGEEDEDGSPRKKLRLTKQQSAILEDNFKEHSSLSPKQKQDLAKQLKLRPRQVEVWFQNRRARTKLKQTEMD

Query:  CELLKKCCEKLKEENTRLQKELQELKSLKLTAPPFCMQLQAATLTVCPSCE
        CE LK+CCE L EEN RLQKE++EL++LK T+ PF MQL A TLT+CPSCE
Subjt:  CELLKKCCEKLKEENTRLQKELQELKSLKLTAPPFCMQLQAATLTVCPSCE

Q8GRL4 Homeobox-leucine zipper protein HOX194.6e-4545.67Show/hide
Query:  NTGLLLGLGRVSGDDSLGSLLPPEVADVKKKLQVLKFDDILPSLTLGLSTVVVEKSAATDELIQQGCSGSPASSFSN-------SSGFKRERDSGGEEKE
        + GL LGL    G               ++     +   + PSLTL L       +AAT      G  G PA S S+       ++  KRER    + + 
Subjt:  NTGLLLGLGRVSGDDSLGSLLPPEVADVKKKLQVLKFDDILPSLTLGLSTVVVEKSAATDELIQQGCSGSPASSFSN-------SSGFKRERDSGGEEKE

Query:  AEVYMKVGEEDEDGSPRKKLRLTKQQSAILEDNFKEHSSLSPKQKQDLAKQLKLRPRQVEVWFQNRRARTKLKQTEMDCELLKKCCEKLKEENTRLQKEL
                ++D+DGS RKKLRLTK+QSA+LED F+EHS+L+PKQK  LAKQL LRPRQVEVWFQNRRARTKLKQTE+DCE LK+CCE L EEN RLQ+EL
Subjt:  AEVYMKVGEEDEDGSPRKKLRLTKQQSAILEDNFKEHSSLSPKQKQDLAKQLKLRPRQVEVWFQNRRARTKLKQTEMDCELLKKCCEKLKEENTRLQKEL

Query:  QELKSLKLTAP---------------PFCMQLQAATLTVCPSCE-----TSICSGGGGGGGDASPPNTFSIGSKPHFLKFPFKHPSAAC
        QEL++LK   P               PF MQL AATLT+CPSCE      S        G  A P  T    +  HF   PF H SAAC
Subjt:  QELKSLKLTAP---------------PFCMQLQAATLTVCPSCE-----TSICSGGGGGGGDASPPNTFSIGSKPHFLKFPFKHPSAAC

Arabidopsis top hitse value%identityAlignment
AT2G22800.1 Homeobox-leucine zipper protein family1.2e-5954.42Show/hide
Query:  CNTGLLLGLGRVSGDDSLGSLLPPEVADVKKKLQVLKFDDILPSLTLGLS---TVVVEKSAATDELIQQGCSGSPASSFSNSSGFKRERDSGGEEKEAE-
        CNTGL+LGLG         S +P       ++  V K +   PSLTL LS   +V V   A  D+L +Q  S S  SSFS+    KRERD G E  E E 
Subjt:  CNTGLLLGLGRVSGDDSLGSLLPPEVADVKKKLQVLKFDDILPSLTLGLS---TVVVEKSAATDELIQQGCSGSPASSFSNSSGFKRERDSGGEEKEAE-

Query:  ----VYMKVGEEDEDGSPRKKLRLTKQQSAILEDNFKEHSSLSPKQKQDLAKQLKLRPRQVEVWFQNRRARTKLKQTEMDCELLKKCCEKLKEENTRLQK
            V     E++E  S RKKLRLTKQQSA+LE++FK+HS+L+PKQKQ LA+QL LRPRQVEVWFQNRRARTKLKQTE+DCE LKKCCE L +EN RLQK
Subjt:  ----VYMKVGEEDEDGSPRKKLRLTKQQSAILEDNFKEHSSLSPKQKQDLAKQLKLRPRQVEVWFQNRRARTKLKQTEMDCELLKKCCEKLKEENTRLQK

Query:  ELQELKSLKLTAPPFCMQLQAATLTVCPSCETSICSGGGGGGG------------DASPPNTFSIGSKPHFLKFPFKHPSAAC
        E+QELK+LKLT  PF M + A+TLT CPSCE     GGG GGG             ++    FSI SKPHF   PF +PSAAC
Subjt:  ELQELKSLKLTAPPFCMQLQAATLTVCPSCETSICSGGGGGGG------------DASPPNTFSIGSKPHFLKFPFKHPSAAC

AT2G44910.1 homeobox-leucine zipper protein 41.6e-3751.81Show/hide
Query:  SLTLGLSTVVVEKSAATDELIQQGC-SGSPASSFSNSSGFKRER--DSGGEEKEAEVYM------KVGEEDEDG----SPRKKLRLTKQQSAILEDNFKE
        S   G +    + S A  +L ++     SP S+ S+ SG KR+     GG+E EAE           G +DEDG      RKKLRL+K Q+ +LE+ FKE
Subjt:  SLTLGLSTVVVEKSAATDELIQQGC-SGSPASSFSNSSGFKRER--DSGGEEKEAEVYM------KVGEEDEDG----SPRKKLRLTKQQSAILEDNFKE

Query:  HSSLSPKQKQDLAKQLKLRPRQVEVWFQNRRARTKLKQTEMDCELLKKCCEKLKEENTRLQKELQELKSLKLTAPPFCMQLQAATLTVCPSCE
        HS+L+PKQK  LAKQL LR RQVEVWFQNRRARTKLKQTE+DCE LK+CC+ L EEN RLQKE+ EL++LKL+   +       TLT+CPSCE
Subjt:  HSSLSPKQKQDLAKQLKLRPRQVEVWFQNRRARTKLKQTEMDCELLKKCCEKLKEENTRLQKELQELKSLKLTAPPFCMQLQAATLTVCPSCE

AT4G16780.1 homeobox protein 26.6e-3959.09Show/hide
Query:  SPASSFSNSSGFKRERDSGGEEKEAEVYMKVGEEDEDGSPRKKLRLTKQQSAILEDNFKEHSSLSPKQKQDLAKQLKLRPRQVEVWFQNRRARTKLKQTE
        SP S+ S+S+G + ER+   E+ + +    + ++++  + RKKLRL+K QSAILE+ FK+HS+L+PKQKQ LAKQL LR RQVEVWFQNRRARTKLKQTE
Subjt:  SPASSFSNSSGFKRERDSGGEEKEAEVYMKVGEEDEDGSPRKKLRLTKQQSAILEDNFKEHSSLSPKQKQDLAKQLKLRPRQVEVWFQNRRARTKLKQTE

Query:  MDCELLKKCCEKLKEENTRLQKELQELKSLKLTAPPFCMQLQ-AATLTVCPSCE
        +DCE L++CCE L EEN RLQKE+ EL++LKL +P F M +    TLT+CPSCE
Subjt:  MDCELLKKCCEKLKEENTRLQKELQELKSLKLTAPPFCMQLQ-AATLTVCPSCE

AT4G37790.1 Homeobox-leucine zipper protein family5.3e-5754.01Show/hide
Query:  MNRDCNTGLLLGLGRVSGDDSLGSLLPPEVADVKKKLQVLKFDDILPSLTLGL---STVVVEKSAATDELIQQGCSGSPASSFSNSSGFKRERD-SGGE-
        ++  CNTGL+LGLG     ++    +    + V       +F  + PSLTL L   S  +   + A D++ +Q  S S  SSFS S   KRER+ SGG+ 
Subjt:  MNRDCNTGLLLGLGRVSGDDSLGSLLPPEVADVKKKLQVLKFDDILPSLTLGL---STVVVEKSAATDELIQQGCSGSPASSFSNSSGFKRERD-SGGE-

Query:  EKEAE------VYMKVGE--EDEDG-SPRKKLRLTKQQSAILEDNFKEHSSLSPKQKQDLAKQLKLRPRQVEVWFQNRRARTKLKQTEMDCELLKKCCEK
        E+EAE      V  +V +  +DE+G S RKKLRLTKQQSA+LEDNFK HS+L+PKQKQ LA+QL LRPRQVEVWFQNRRARTKLKQTE+DCE LKKCCE 
Subjt:  EKEAE------VYMKVGE--EDEDG-SPRKKLRLTKQQSAILEDNFKEHSSLSPKQKQDLAKQLKLRPRQVEVWFQNRRARTKLKQTEMDCELLKKCCEK

Query:  LKEENTRLQKELQELKSLKLTAPPFCMQLQAATLTVCPSCETSICSGGGGGGGDA------SPPNTFSIGSKPHFLKFPFKHPSAAC
        L +EN RLQKELQ+LK+LKL + PF M + AATLT+CPSCE     GGGG GGD       +    FSI +KP F   PF +PSAAC
Subjt:  LKEENTRLQKELQELKSLKLTAPPFCMQLQAATLTVCPSCETSICSGGGGGGGDA------SPPNTFSIGSKPHFLKFPFKHPSAAC

AT5G06710.1 homeobox from Arabidopsis thaliana2.6e-4349.8Show/hide
Query:  GDDSLGSLLPPEVADVKKK--------------------LQV-LKFDDILPS---------LTLGLSTVVVEKSAATDELIQQGCS--GSPASSFSNSSG
        G  SL S   P V D KKK                    LQ+ L F + LP          + LG +TVV E+    + +     S   S  SSF    G
Subjt:  GDDSLGSLLPPEVADVKKK--------------------LQV-LKFDDILPS---------LTLGLSTVVVEKSAATDELIQQGCS--GSPASSFSNSSG

Query:  FK--------RERDSGGE-EKEAEVYMKVGEEDEDGSPRKKLRLTKQQSAILEDNFKEHSSLSPKQKQDLAKQLKLRPRQVEVWFQNRRARTKLKQTEMD
         K         +RD   E E+ A        +DE+GS RKKLRL+K QSA LED+FKEHS+L+PKQK  LAKQL LRPRQVEVWFQNRRARTKLKQTE+D
Subjt:  FK--------RERDSGGE-EKEAEVYMKVGEEDEDGSPRKKLRLTKQQSAILEDNFKEHSSLSPKQKQDLAKQLKLRPRQVEVWFQNRRARTKLKQTEMD

Query:  CELLKKCCEKLKEENTRLQKELQELKSLKLTAPPFCMQLQAATLTVCPSCE
        CE LK+CCE L EEN RLQKE++EL++LK T+ PF MQL A TLT+CPSCE
Subjt:  CELLKKCCEKLKEENTRLQKELQELKSLKLTAPPFCMQLQAATLTVCPSCE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATAGGGATTGTAATACGGGGCTTCTCCTTGGCCTGGGGCGGGTTTCGGGTGATGATTCCCTGGGTTCCCTCCTACCACCCGAAGTAGCCGATGTGAAGAAGAAGCT
CCAGGTTTTGAAGTTTGATGATATTTTGCCTTCTTTGACGCTTGGATTGTCGACTGTTGTAGTAGAGAAGAGCGCCGCCACCGACGAATTAATTCAGCAGGGGTGTTCGG
GTAGTCCGGCGTCGTCGTTTTCAAACTCATCGGGATTTAAAAGAGAGCGAGACAGCGGCGGTGAAGAGAAGGAGGCGGAGGTGTATATGAAAGTGGGTGAGGAAGATGAA
GATGGAAGTCCCAGGAAGAAACTTAGATTAACTAAACAACAATCCGCCATTTTAGAGGATAACTTCAAAGAACACTCAAGTCTCAGTCCTAAACAAAAGCAGGATTTGGC
CAAACAGTTAAAGCTAAGGCCAAGACAAGTGGAAGTATGGTTTCAAAACAGAAGAGCCAGGACCAAGCTGAAGCAAACAGAAATGGACTGTGAATTACTGAAGAAATGTT
GTGAAAAGCTGAAAGAAGAAAACACAAGGCTTCAAAAGGAACTTCAAGAGCTTAAATCACTCAAATTAACGGCTCCACCGTTCTGCATGCAGCTACAAGCCGCCACTCTC
ACTGTTTGCCCTTCATGTGAGACCTCAATTTGCAGCGGCGGCGGAGGAGGTGGCGGTGATGCATCTCCGCCCAATACTTTCTCAATTGGATCAAAGCCTCACTTTCTCAA
ATTTCCCTTTAAACACCCATCGGCGGCTTGTTAG
mRNA sequenceShow/hide mRNA sequence
CTTCCGCTCTCTTTCTCCTCCATTCTCCCCCTTTATATTTCCCCCCCACAAGGTTATTGGGTTTTCAAATTTGGGCATTTACAATTTGTTGTTTATTCAAATCATGAATA
GGGATTGTAATACGGGGCTTCTCCTTGGCCTGGGGCGGGTTTCGGGTGATGATTCCCTGGGTTCCCTCCTACCACCCGAAGTAGCCGATGTGAAGAAGAAGCTCCAGGTT
TTGAAGTTTGATGATATTTTGCCTTCTTTGACGCTTGGATTGTCGACTGTTGTAGTAGAGAAGAGCGCCGCCACCGACGAATTAATTCAGCAGGGGTGTTCGGGTAGTCC
GGCGTCGTCGTTTTCAAACTCATCGGGATTTAAAAGAGAGCGAGACAGCGGCGGTGAAGAGAAGGAGGCGGAGGTGTATATGAAAGTGGGTGAGGAAGATGAAGATGGAA
GTCCCAGGAAGAAACTTAGATTAACTAAACAACAATCCGCCATTTTAGAGGATAACTTCAAAGAACACTCAAGTCTCAGTCCTAAACAAAAGCAGGATTTGGCCAAACAG
TTAAAGCTAAGGCCAAGACAAGTGGAAGTATGGTTTCAAAACAGAAGAGCCAGGACCAAGCTGAAGCAAACAGAAATGGACTGTGAATTACTGAAGAAATGTTGTGAAAA
GCTGAAAGAAGAAAACACAAGGCTTCAAAAGGAACTTCAAGAGCTTAAATCACTCAAATTAACGGCTCCACCGTTCTGCATGCAGCTACAAGCCGCCACTCTCACTGTTT
GCCCTTCATGTGAGACCTCAATTTGCAGCGGCGGCGGAGGAGGTGGCGGTGATGCATCTCCGCCCAATACTTTCTCAATTGGATCAAAGCCTCACTTTCTCAAATTTCCC
TTTAAACACCCATCGGCGGCTTGTTAGGCACCAGCCTAATTTTAATTATTATATATAATCATTAATTAATTAAAATGACGAGAAGACTGCTATTTTTCCTTGGGGCTTCT
TGGTCAAAATATATATCGTATATAACACTGATATTTATATTTTTAATATGTTATGTGTACAAATATATTGTTGTGTTTTGAACACATTGGAAATTAGGCTTTGTAGAAAT
GGATCTTATGATGACACACATGTATTTAGGTTTCCATTAATTTGACTTGATGGTC
Protein sequenceShow/hide protein sequence
MNRDCNTGLLLGLGRVSGDDSLGSLLPPEVADVKKKLQVLKFDDILPSLTLGLSTVVVEKSAATDELIQQGCSGSPASSFSNSSGFKRERDSGGEEKEAEVYMKVGEEDE
DGSPRKKLRLTKQQSAILEDNFKEHSSLSPKQKQDLAKQLKLRPRQVEVWFQNRRARTKLKQTEMDCELLKKCCEKLKEENTRLQKELQELKSLKLTAPPFCMQLQAATL
TVCPSCETSICSGGGGGGGDASPPNTFSIGSKPHFLKFPFKHPSAAC