; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0006873 (gene) of Chayote v1 genome

Gene IDSed0006873
OrganismSechium edule (Chayote v1)
DescriptionHomeobox-leucine zipper protein family
Genome locationLG13:8970251..8971845
RNA-Seq ExpressionSed0006873
SyntenySed0006873
Gene Ontology termsGO:0006357 - regulation of transcription by RNA polymerase II (biological process)
GO:0005634 - nucleus (cellular component)
GO:0016020 - membrane (cellular component)
GO:0000981 - DNA-binding transcription factor activity, RNA polymerase II-specific (molecular function)
GO:0043565 - sequence-specific DNA binding (molecular function)
InterPro domainsIPR001356 - Homeobox domain
IPR003106 - Leucine zipper, homeobox-associated
IPR009057 - Homeobox-like domain superfamily
IPR017970 - Homeobox, conserved site


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6600675.1 Homeobox-leucine zipper protein HAT22, partial [Cucurbita argyrosperma subsp. sororia]5.7e-10176.36Show/hide
Query:  MGFDDLSQTGLVLGLGLSELPASPIINKLKKKPAPCSS---TSLDFEPCALSLGFLGDVSTRLKAVDANKIV---DADSSSVVCSSFSGGGGGRVKRERD
        MG DD+SQT LVLGLG+SE  ASPIIN LK K    ++   TSL+FEPCAL+LGF GDV         +  +    +  SS VCSSFSGGGG  +KRERD
Subjt:  MGFDDLSQTGLVLGLGLSELPASPIINKLKKKPAPCSS---TSLDFEPCALSLGFLGDVSTRLKAVDANKIV---DADSSSVVCSSFSGGGGGRVKRERD

Query:  LSSEEVELERVICSKVSDEDEDGSNTRKKLRLSKEQSAFLEESFNQNSTINPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKKCCETLT
        LSSEEVELERV C +VSDEDEDG NTRKKLRLSK+QSA LEESF QNST+NPKQKQ LAR LNLRPRQVEVWFQNRRARTK+KQTEVDCEFLKKCCETLT
Subjt:  LSSEEVELERVICSKVSDEDEDGSNTRKKLRLSKEQSAFLEESFNQNSTINPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKKCCETLT

Query:  DENRRLQKELQELKALKLAQPLYMHMPAATLAMCPSCERVSGVGLG-NDG-SKAKFSMAPKPHFYNPFSNSSAAC
        +ENRRLQKELQELKALKLA PLYMHMPAATL MCPSCERV GVG+G  DG SK KFSMAPKPHFYNPFSN SAAC
Subjt:  DENRRLQKELQELKALKLAQPLYMHMPAATLAMCPSCERVSGVGLG-NDG-SKAKFSMAPKPHFYNPFSNSSAAC

KAG7031313.1 Homeobox-leucine zipper protein HAT22 [Cucurbita argyrosperma subsp. argyrosperma]6.7e-10277.09Show/hide
Query:  MGFDDLSQTGLVLGLGLSELPASPIINKL---KKKPAPCSSTSLDFEPCALSLGFLGDVSTRLKAVDANKIV---DADSSSVVCSSFSGGGGGRVKRERD
        MG DD+SQT LVLGLG+SE  ASPIIN L   KKK    + TSL+FEPCAL+LGF GDV         +  +    +  SS VCSSFSGGGGG +KRERD
Subjt:  MGFDDLSQTGLVLGLGLSELPASPIINKL---KKKPAPCSSTSLDFEPCALSLGFLGDVSTRLKAVDANKIV---DADSSSVVCSSFSGGGGGRVKRERD

Query:  LSSEEVELERVICSKVSDEDEDGSNTRKKLRLSKEQSAFLEESFNQNSTINPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKKCCETLT
        LSSEEVELERV C +VSDEDEDG NTRKKLRLSK+QSA LEESF QNST+NPKQKQ LAR LNLRPRQVEVWFQNRRARTK+KQTEVDCEFLKKCCETLT
Subjt:  LSSEEVELERVICSKVSDEDEDGSNTRKKLRLSKEQSAFLEESFNQNSTINPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKKCCETLT

Query:  DENRRLQKELQELKALKLAQPLYMHMPAATLAMCPSCERVSGVGLG-NDG-SKAKFSMAPKPHFYNPFSNSSAAC
        +ENRRLQKELQELKALKLA PLYMHMPAATL MCPSCERV GVG+G  DG SK KFSMAPKPHFYNPFSN SAAC
Subjt:  DENRRLQKELQELKALKLAQPLYMHMPAATLAMCPSCERVSGVGLG-NDG-SKAKFSMAPKPHFYNPFSNSSAAC

XP_022136472.1 homeobox-leucine zipper protein HAT22-like [Momordica charantia]1.9e-10179.63Show/hide
Query:  MGFDDLSQTGLVLGLGLSELPASPIINK--LKKKPAPCSSTSLDFEPCALSLGFLGDVSTRLKAVDANKIVDADSSSVVCSSFSGGGGG-RVKRERDLSS
        MGFDDLS TGLVLGLGLSE    P  ++  L KKPAPCSS SLDF+PC L+LGF G+ + R K  D +    A   S   SSFSGGGGG RVKRERDLSS
Subjt:  MGFDDLSQTGLVLGLGLSELPASPIINK--LKKKPAPCSSTSLDFEPCALSLGFLGDVSTRLKAVDANKIVDADSSSVVCSSFSGGGGG-RVKRERDLSS

Query:  EEVELERVICSKVSDEDEDGSNTRKKLRLSKEQSAFLEESFNQNSTINPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKKCCETLTDEN
        EEV+LERV  S++SDEDEDGSNTRKKLRLS+EQSA LEESF QNST+NPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKKCCETLTDEN
Subjt:  EEVELERVICSKVSDEDEDGSNTRKKLRLSKEQSAFLEESFNQNSTINPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKKCCETLTDEN

Query:  RRLQKELQELKALKLAQPLYMHMPAATLAMCPSCERVSGVGLGNDGSKAKFSMAPKPHFYNPFSNSSAAC
        RRLQKELQELKALKLAQPLYMHMPAATL MCPSCERV G   G   SKAKFSMAPKPHFYNPFSN SAAC
Subjt:  RRLQKELQELKALKLAQPLYMHMPAATLAMCPSCERVSGVGLGNDGSKAKFSMAPKPHFYNPFSNSSAAC

XP_022943239.1 homeobox-leucine zipper protein HAT22-like [Cucurbita moschata]2.3e-10277.74Show/hide
Query:  MGFDDLSQTGLVLGLGLSELPASPIINKL---KKKPAPCSSTSLDFEPCALSLGFLGDVSTRLKAVDANKIVDADS--SSVVCSSFSGGGGGRVKRERDL
        MG DD+SQT LVLGLG+SE  ASPIIN L   KKK    + TSL+FEPCAL+LGF GDV         + +    S  SS VCSSFSGGGGG +KRERDL
Subjt:  MGFDDLSQTGLVLGLGLSELPASPIINKL---KKKPAPCSSTSLDFEPCALSLGFLGDVSTRLKAVDANKIVDADS--SSVVCSSFSGGGGGRVKRERDL

Query:  SSEEVELERVICSKVSDEDEDGSNTRKKLRLSKEQSAFLEESFNQNSTINPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKKCCETLTD
        SSEEVELERV C +VSDEDEDG NTRKKLRLSK+QSA LEESF QNST+NPKQKQ LAR LNLRPRQVEVWFQNRRARTK+KQTEVDCEFLKKCCETLT+
Subjt:  SSEEVELERVICSKVSDEDEDGSNTRKKLRLSKEQSAFLEESFNQNSTINPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKKCCETLTD

Query:  ENRRLQKELQELKALKLAQPLYMHMPAATLAMCPSCERVSGVGLG-NDG-SKAKFSMAPKPHFYNPFSNSSAAC
        ENRRLQKELQELKALKLA PLYMHMPAATL MCPSCERV GVG+G  DG SK KFSMAPKPHFYNPFSN SAAC
Subjt:  ENRRLQKELQELKALKLAQPLYMHMPAATLAMCPSCERVSGVGLG-NDG-SKAKFSMAPKPHFYNPFSNSSAAC

XP_023000993.1 homeobox-leucine zipper protein HAT22-like [Cucurbita maxima]1.0e-10275.91Show/hide
Query:  MGFDDLSQTGLVLGLGLSELPASPIINKLKKKPAPCSSTSLDFEPCALSLGFLGDVSTRLKAVDANKIVD-------ADSSSVVCSSFSGGGGGRVKRER
        MG DD+SQT LVLGLG+SE  ASPIIN  K        TSLDFEPCAL+LGF GDV         +           +  SS VCSSFSGGGGG +KRER
Subjt:  MGFDDLSQTGLVLGLGLSELPASPIINKLKKKPAPCSSTSLDFEPCALSLGFLGDVSTRLKAVDANKIVD-------ADSSSVVCSSFSGGGGGRVKRER

Query:  DLSSEEVELERVICSKVSDEDEDGSNTRKKLRLSKEQSAFLEESFNQNSTINPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKKCCETL
        DLSSEEVELERV C +VSDEDEDG NTRKKLRLSK+QSA LEESF QNST+NPKQKQ LAR LNLRPRQVEVWFQNRRARTK+KQTEVDCEFLKKCCETL
Subjt:  DLSSEEVELERVICSKVSDEDEDGSNTRKKLRLSKEQSAFLEESFNQNSTINPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKKCCETL

Query:  TDENRRLQKELQELKALKLAQPLYMHMPAATLAMCPSCERVSGVGLGNDGSKAKFSMAPKPHFYNPFSNSSAAC
        T+ENRRLQKELQELKALKLA PLYMHMPAATL MCPSCERV GVG+G+  SK KFSMAPKPHFYNPFSN SAAC
Subjt:  TDENRRLQKELQELKALKLAQPLYMHMPAATLAMCPSCERVSGVGLGNDGSKAKFSMAPKPHFYNPFSNSSAAC

TrEMBL top hitse value%identityAlignment
A0A1S3BNL6 homeobox-leucine zipper protein HAT22-like1.0e-9271.33Show/hide
Query:  MGFDDLSQTGLVLGLGLSELPASPIINKLKKKPAPCSSTSLDFEPCALSLGFL-GDVSTRLKAVDANKI-----VDADSSSVVCSSFSGGGGGRVKRERD
        MGFDD S+TGLVLGLGLSEL A      LKKKPAPCSS+SLDFEPC L+LGF  G      K +D   +       +  SS VCSSFS    G+VKRERD
Subjt:  MGFDDLSQTGLVLGLGLSELPASPIINKLKKKPAPCSSTSLDFEPCALSLGFL-GDVSTRLKAVDANKI-----VDADSSSVVCSSFSGGGGGRVKRERD

Query:  LSSEEVELERVICSKVSDEDEDG-SNTRKKLRLSKEQSAFLEESFNQNSTINPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKKCCETL
        LSSEEVELERV C +VSDED+DG +NTRKKLRLSK+QSA LEESF QNST+NPKQKQ LARQLNL PRQVEVWFQNRRARTK+KQTEVDCE LKKCCETL
Subjt:  LSSEEVELERVICSKVSDEDEDG-SNTRKKLRLSKEQSAFLEESFNQNSTINPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKKCCETL

Query:  TDENRRLQKELQELKALKLAQPLYMHMPAATLAMCPSCERV-----SGVGLGNDGSKAKFSMAPKPHFYNPFSNSSAAC
        TDENRRLQKE+QELKA+KLA+P+YM M  ATL +CPSCERV      GV  GN  SK KFSM P P FYNPFSN SAAC
Subjt:  TDENRRLQKELQELKALKLAQPLYMHMPAATLAMCPSCERV-----SGVGLGNDGSKAKFSMAPKPHFYNPFSNSSAAC

A0A5D3DEZ1 Homeobox-leucine zipper protein HAT22-like2.3e-9270.97Show/hide
Query:  MGFDDLSQTGLVLGLGLSELPASPIINKLKKKPAPCSSTSLDFEPCALSLGFL-GDVSTRLKAVDANKI-----VDADSSSVVCSSFSGGGGGRVKRERD
        MGFDD S+TGLVLGLGLSEL A      LKKKPAPCSS+SLDFEPC L+LGF  G      K +D   +       +  SS VCSSFS    G+VKRERD
Subjt:  MGFDDLSQTGLVLGLGLSELPASPIINKLKKKPAPCSSTSLDFEPCALSLGFL-GDVSTRLKAVDANKI-----VDADSSSVVCSSFSGGGGGRVKRERD

Query:  LSSEEVELERVICSKVSDEDEDG-SNTRKKLRLSKEQSAFLEESFNQNSTINPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKKCCETL
        LSSEEVELERV C +V+DED+DG +NTRKKLRLSK+QSA LEESF QNST+NPKQKQ LARQLNL PRQVEVWFQNRRARTK+KQTEVDCE LKKCCETL
Subjt:  LSSEEVELERVICSKVSDEDEDG-SNTRKKLRLSKEQSAFLEESFNQNSTINPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKKCCETL

Query:  TDENRRLQKELQELKALKLAQPLYMHMPAATLAMCPSCERV-----SGVGLGNDGSKAKFSMAPKPHFYNPFSNSSAAC
        TDENRRLQKE+QELKA+KLA+P+YM M  ATL +CPSCERV      GV  GN  SK KFSM P P FYNPFSN SAAC
Subjt:  TDENRRLQKELQELKALKLAQPLYMHMPAATLAMCPSCERV-----SGVGLGNDGSKAKFSMAPKPHFYNPFSNSSAAC

A0A6J1C5L0 homeobox-leucine zipper protein HAT22-like9.4e-10279.63Show/hide
Query:  MGFDDLSQTGLVLGLGLSELPASPIINK--LKKKPAPCSSTSLDFEPCALSLGFLGDVSTRLKAVDANKIVDADSSSVVCSSFSGGGGG-RVKRERDLSS
        MGFDDLS TGLVLGLGLSE    P  ++  L KKPAPCSS SLDF+PC L+LGF G+ + R K  D +    A   S   SSFSGGGGG RVKRERDLSS
Subjt:  MGFDDLSQTGLVLGLGLSELPASPIINK--LKKKPAPCSSTSLDFEPCALSLGFLGDVSTRLKAVDANKIVDADSSSVVCSSFSGGGGG-RVKRERDLSS

Query:  EEVELERVICSKVSDEDEDGSNTRKKLRLSKEQSAFLEESFNQNSTINPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKKCCETLTDEN
        EEV+LERV  S++SDEDEDGSNTRKKLRLS+EQSA LEESF QNST+NPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKKCCETLTDEN
Subjt:  EEVELERVICSKVSDEDEDGSNTRKKLRLSKEQSAFLEESFNQNSTINPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKKCCETLTDEN

Query:  RRLQKELQELKALKLAQPLYMHMPAATLAMCPSCERVSGVGLGNDGSKAKFSMAPKPHFYNPFSNSSAAC
        RRLQKELQELKALKLAQPLYMHMPAATL MCPSCERV G   G   SKAKFSMAPKPHFYNPFSN SAAC
Subjt:  RRLQKELQELKALKLAQPLYMHMPAATLAMCPSCERVSGVGLGNDGSKAKFSMAPKPHFYNPFSNSSAAC

A0A6J1FR71 homeobox-leucine zipper protein HAT22-like1.1e-10277.74Show/hide
Query:  MGFDDLSQTGLVLGLGLSELPASPIINKL---KKKPAPCSSTSLDFEPCALSLGFLGDVSTRLKAVDANKIVDADS--SSVVCSSFSGGGGGRVKRERDL
        MG DD+SQT LVLGLG+SE  ASPIIN L   KKK    + TSL+FEPCAL+LGF GDV         + +    S  SS VCSSFSGGGGG +KRERDL
Subjt:  MGFDDLSQTGLVLGLGLSELPASPIINKL---KKKPAPCSSTSLDFEPCALSLGFLGDVSTRLKAVDANKIVDADS--SSVVCSSFSGGGGGRVKRERDL

Query:  SSEEVELERVICSKVSDEDEDGSNTRKKLRLSKEQSAFLEESFNQNSTINPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKKCCETLTD
        SSEEVELERV C +VSDEDEDG NTRKKLRLSK+QSA LEESF QNST+NPKQKQ LAR LNLRPRQVEVWFQNRRARTK+KQTEVDCEFLKKCCETLT+
Subjt:  SSEEVELERVICSKVSDEDEDGSNTRKKLRLSKEQSAFLEESFNQNSTINPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKKCCETLTD

Query:  ENRRLQKELQELKALKLAQPLYMHMPAATLAMCPSCERVSGVGLG-NDG-SKAKFSMAPKPHFYNPFSNSSAAC
        ENRRLQKELQELKALKLA PLYMHMPAATL MCPSCERV GVG+G  DG SK KFSMAPKPHFYNPFSN SAAC
Subjt:  ENRRLQKELQELKALKLAQPLYMHMPAATLAMCPSCERVSGVGLG-NDG-SKAKFSMAPKPHFYNPFSNSSAAC

A0A6J1KP81 homeobox-leucine zipper protein HAT22-like5.0e-10375.91Show/hide
Query:  MGFDDLSQTGLVLGLGLSELPASPIINKLKKKPAPCSSTSLDFEPCALSLGFLGDVSTRLKAVDANKIVD-------ADSSSVVCSSFSGGGGGRVKRER
        MG DD+SQT LVLGLG+SE  ASPIIN  K        TSLDFEPCAL+LGF GDV         +           +  SS VCSSFSGGGGG +KRER
Subjt:  MGFDDLSQTGLVLGLGLSELPASPIINKLKKKPAPCSSTSLDFEPCALSLGFLGDVSTRLKAVDANKIVD-------ADSSSVVCSSFSGGGGGRVKRER

Query:  DLSSEEVELERVICSKVSDEDEDGSNTRKKLRLSKEQSAFLEESFNQNSTINPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKKCCETL
        DLSSEEVELERV C +VSDEDEDG NTRKKLRLSK+QSA LEESF QNST+NPKQKQ LAR LNLRPRQVEVWFQNRRARTK+KQTEVDCEFLKKCCETL
Subjt:  DLSSEEVELERVICSKVSDEDEDGSNTRKKLRLSKEQSAFLEESFNQNSTINPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKKCCETL

Query:  TDENRRLQKELQELKALKLAQPLYMHMPAATLAMCPSCERVSGVGLGNDGSKAKFSMAPKPHFYNPFSNSSAAC
        T+ENRRLQKELQELKALKLA PLYMHMPAATL MCPSCERV GVG+G+  SK KFSMAPKPHFYNPFSN SAAC
Subjt:  TDENRRLQKELQELKALKLAQPLYMHMPAATLAMCPSCERVSGVGLGNDGSKAKFSMAPKPHFYNPFSNSSAAC

SwissProt top hitse value%identityAlignment
A2XE76 Homeobox-leucine zipper protein HOX198.3e-5548.96Show/hide
Query:  LSQTGLVLGLGL-----SELPASPIINKLKKKPAPCSSTSLDFEPCALSLGFLGDVSTRLKAVDANKIVDADSSSVVCSSFSGG--GGGRVKRERDLSSE
        LS  GL LGL L         A+       ++P+P SS     EP +L+L    D +    A            +   SS S G      VKRER   +E
Subjt:  LSQTGLVLGLGL-----SELPASPIINKLKKKPAPCSSTSLDFEPCALSLGFLGDVSTRLKAVDANKIVDADSSSVVCSSFSGG--GGGRVKRERDLSSE

Query:  EVELERVICSKVSDEDEDGSNTRKKLRLSKEQSAFLEESFNQNSTINPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKKCCETLTDENR
        E + ERV  +    +D+D  +TRKKLRL+KEQSA LE+ F ++ST+NPKQK ALA+QLNLRPRQVEVWFQNRRARTKLKQTEVDCEFLK+CCETLT+ENR
Subjt:  EVELERVICSKVSDEDEDGSNTRKKLRLSKEQSAFLEESFNQNSTINPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKKCCETLTDENR

Query:  RLQKELQELKALKLA----------------QPLYMHMPAATLAMCPSCERVSGVG-----LGNDGSKAKFSMAPKPHFYNPFSNSSA
        RLQ+ELQEL+ALK A                 P YM +PAATL +CPSCERV G       +  DG+KA        HF+NPF++S+A
Subjt:  RLQKELQELKALKLA----------------QPLYMHMPAATLAMCPSCERVSGVG-----LGNDGSKAKFSMAPKPHFYNPFSNSSA

A2Z1U1 Homeobox-leucine zipper protein HOX117.8e-5372.97Show/hide
Query:  CSKVSDEDEDGSNTRKKLRLSKEQSAFLEESFNQNSTINPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKKCCETLTDENRRLQKELQE
        CS+ SDED DG + RKKLRLSKEQSAFLEESF ++ST+NPKQK ALA+QLNLRPRQVEVWFQNRRARTKLKQTEVDCE+LK+CCETLT+ENRRLQKEL E
Subjt:  CSKVSDEDEDGSNTRKKLRLSKEQSAFLEESFNQNSTINPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKKCCETLTDENRRLQKELQE

Query:  LKALKLAQPLYMHMPAATLAMCPSCERVSGVGLGNDGSKAKFSMAPKP
        L+ALK   P YMH+PA TL+MCPSCERV+        S A  S    P
Subjt:  LKALKLAQPLYMHMPAATLAMCPSCERVSGVGLGNDGSKAKFSMAPKP

P46603 Homeobox-leucine zipper protein HAT92.8e-7158.25Show/hide
Query:  MGFDDLSQTGLVLGLGLSELPASPIINKLKKKPAPCSSTSLDFEPCALSLGFLGDVSTRLKAVDANKIVDADSSSVVCSSFSGGGGGRVKRERDLSSEEV
        MGFDD   TGLVLGLG S +P +   N   ++     S+    EP +L+L   GD S  +    A+++    SS    SSFS   G  VKRERD   E  
Subjt:  MGFDDLSQTGLVLGLGLSELPASPIINKLKKKPAPCSSTSLDFEPCALSLGFLGDVSTRLKAVDANKIVDADSSSVVCSSFSGGGGGRVKRERDLSSEEV

Query:  ELERVICSKVSD--EDEDGSNTRKKLRLSKEQSAFLEESFNQNSTINPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKKCCETLTDENR
        E E +    +SD  EDE+G + RKKLRL+K+QSA LEESF  +ST+NPKQKQ LARQLNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKKCCETL DEN 
Subjt:  ELERVICSKVSD--EDEDGSNTRKKLRLSKEQSAFLEESFNQNSTINPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKKCCETLTDENR

Query:  RLQKELQELKALKLAQPLYMHMPAATLAMCPSCERVSGVGLGN--------------DGSKAK--FSMAPKPHFYNPFSNSSAAC
        RLQKE+QELK LKL QP YMHMPA+TL  CPSCER+ G G GN              DGS AK  FS++ KPHF+NPF+N SAAC
Subjt:  RLQKELQELKALKLAQPLYMHMPAATLAMCPSCERVSGVGLGN--------------DGSKAK--FSMAPKPHFYNPFSNSSAAC

P46604 Homeobox-leucine zipper protein HAT221.0e-7660.56Show/hide
Query:  MGFDDLSQTGLVLGLGLSELPASPIINKLKKKPAPCSSTSLDFEPCALSLGFLGDVSTRLK--AVDANKIVDADSSSVVCSSFSGGGGGRVKRERDLS--
        MG DD   TGLVLGLGLS  P +   + +KK  +      +  +P +L+L   G+ S ++K  A   ++I    SS    SSFS    GRVKRER++S  
Subjt:  MGFDDLSQTGLVLGLGLSELPASPIINKLKKKPAPCSSTSLDFEPCALSLGFLGDVSTRLK--AVDANKIVDADSSSVVCSSFSGGGGGRVKRERDLS--

Query:  --SEEVE--LERVICSKVSD--EDEDGSNTRKKLRLSKEQSAFLEESFNQNSTINPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKKCC
           EE E   ERV+CS+VSD  +DE+G + RKKLRL+K+QSA LE++F  +ST+NPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKKCC
Subjt:  --SEEVE--LERVICSKVSD--EDEDGSNTRKKLRLSKEQSAFLEESFNQNSTINPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKKCC

Query:  ETLTDENRRLQKELQELKALKLAQPLYMHMPAATLAMCPSCERVSGVGLGND-------GSKAKFSMAPKPHFYNPFSNSSAAC
        ETLTDENRRLQKELQ+LKALKL+QP YMHMPAATL MCPSCER+ G G+G D        +K  FS+  KP FYNPF+N SAAC
Subjt:  ETLTDENRRLQKELQELKALKLAQPLYMHMPAATLAMCPSCERVSGVGLGND-------GSKAKFSMAPKPHFYNPFSNSSAAC

Q8GRL4 Homeobox-leucine zipper protein HOX198.3e-5548.96Show/hide
Query:  LSQTGLVLGLGL-----SELPASPIINKLKKKPAPCSSTSLDFEPCALSLGFLGDVSTRLKAVDANKIVDADSSSVVCSSFSGG--GGGRVKRERDLSSE
        LS  GL LGL L         A+       ++P+P SS     EP +L+L    D +    A            +   SS S G      VKRER   +E
Subjt:  LSQTGLVLGLGL-----SELPASPIINKLKKKPAPCSSTSLDFEPCALSLGFLGDVSTRLKAVDANKIVDADSSSVVCSSFSGG--GGGRVKRERDLSSE

Query:  EVELERVICSKVSDEDEDGSNTRKKLRLSKEQSAFLEESFNQNSTINPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKKCCETLTDENR
        E + ERV  +    +D+D  +TRKKLRL+KEQSA LE+ F ++ST+NPKQK ALA+QLNLRPRQVEVWFQNRRARTKLKQTEVDCEFLK+CCETLT+ENR
Subjt:  EVELERVICSKVSDEDEDGSNTRKKLRLSKEQSAFLEESFNQNSTINPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKKCCETLTDENR

Query:  RLQKELQELKALKLA----------------QPLYMHMPAATLAMCPSCERVSGVG-----LGNDGSKAKFSMAPKPHFYNPFSNSSA
        RLQ+ELQEL+ALK A                 P YM +PAATL +CPSCERV G       +  DG+KA        HF+NPF++S+A
Subjt:  RLQKELQELKALKLA----------------QPLYMHMPAATLAMCPSCERVSGVG-----LGNDGSKAKFSMAPKPHFYNPFSNSSA

Arabidopsis top hitse value%identityAlignment
AT2G22800.1 Homeobox-leucine zipper protein family2.0e-7258.25Show/hide
Query:  MGFDDLSQTGLVLGLGLSELPASPIINKLKKKPAPCSSTSLDFEPCALSLGFLGDVSTRLKAVDANKIVDADSSSVVCSSFSGGGGGRVKRERDLSSEEV
        MGFDD   TGLVLGLG S +P +   N   ++     S+    EP +L+L   GD S  +    A+++    SS    SSFS   G  VKRERD   E  
Subjt:  MGFDDLSQTGLVLGLGLSELPASPIINKLKKKPAPCSSTSLDFEPCALSLGFLGDVSTRLKAVDANKIVDADSSSVVCSSFSGGGGGRVKRERDLSSEEV

Query:  ELERVICSKVSD--EDEDGSNTRKKLRLSKEQSAFLEESFNQNSTINPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKKCCETLTDENR
        E E +    +SD  EDE+G + RKKLRL+K+QSA LEESF  +ST+NPKQKQ LARQLNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKKCCETL DEN 
Subjt:  ELERVICSKVSD--EDEDGSNTRKKLRLSKEQSAFLEESFNQNSTINPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKKCCETLTDENR

Query:  RLQKELQELKALKLAQPLYMHMPAATLAMCPSCERVSGVGLGN--------------DGSKAK--FSMAPKPHFYNPFSNSSAAC
        RLQKE+QELK LKL QP YMHMPA+TL  CPSCER+ G G GN              DGS AK  FS++ KPHF+NPF+N SAAC
Subjt:  RLQKELQELKALKLAQPLYMHMPAATLAMCPSCERVSGVGLGN--------------DGSKAK--FSMAPKPHFYNPFSNSSAAC

AT2G44910.1 homeobox-leucine zipper protein 42.7e-4858.29Show/hide
Query:  KAVDANKIVDADSSSVVCSSFSGGGGGRVKRERDLS----SEEVELERVICSK----VSDEDEDGSN---TRKKLRLSKEQSAFLEESFNQNSTINPKQK
        +A  +  +VD +  + V SS +         +RDL+     +E E ER  CS+       +DEDG N   +RKKLRLSK+Q+  LEE+F ++ST+NPKQK
Subjt:  KAVDANKIVDADSSSVVCSSFSGGGGGRVKRERDLS----SEEVELERVICSK----VSDEDEDGSN---TRKKLRLSKEQSAFLEESFNQNSTINPKQK

Query:  QALARQLNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKKCCETLTDENRRLQKELQELKALKLAQPLYMHM-PAATLAMCPSCERVS
         ALA+QLNLR RQVEVWFQNRRARTKLKQTEVDCE+LK+CC+ LT+ENRRLQKE+ EL+ALKL+  LYMHM P  TL MCPSCERVS
Subjt:  QALARQLNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKKCCETLTDENRRLQKELQELKALKLAQPLYMHM-PAATLAMCPSCERVS

AT3G60390.1 homeobox-leucine zipper protein 32.0e-4855.14Show/hide
Query:  DVSTRLKAVDANK-----IVDADS--------SSVVCSSFSGG-------------GGGRVKRERDLSSEEVELERVICS-KVSDEDEDGS-----NTRK
        D+ + L+ +D N+     +VD +         +S V S  SG              GGGRV        E+ E+ER  CS     +DEDGS     ++RK
Subjt:  DVSTRLKAVDANK-----IVDADS--------SSVVCSSFSGG-------------GGGRVKRERDLSSEEVELERVICS-KVSDEDEDGS-----NTRK

Query:  KLRLSKEQSAFLEESFNQNSTINPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKKCCETLTDENRRLQKELQELKALKLAQPLYMHM-P
        KLRLSKEQ+  LEE+F ++ST+NPKQK ALA+QLNLR RQVEVWFQNRRARTKLKQTEVDCE+LK+CCE LTDENRRLQKE+ EL+ALKL+  LYMHM P
Subjt:  KLRLSKEQSAFLEESFNQNSTINPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKKCCETLTDENRRLQKELQELKALKLAQPLYMHM-P

Query:  AATLAMCPSCERVS
          TL MCPSCERV+
Subjt:  AATLAMCPSCERVS

AT4G37790.1 Homeobox-leucine zipper protein family7.2e-7860.56Show/hide
Query:  MGFDDLSQTGLVLGLGLSELPASPIINKLKKKPAPCSSTSLDFEPCALSLGFLGDVSTRLK--AVDANKIVDADSSSVVCSSFSGGGGGRVKRERDLS--
        MG DD   TGLVLGLGLS  P +   + +KK  +      +  +P +L+L   G+ S ++K  A   ++I    SS    SSFS    GRVKRER++S  
Subjt:  MGFDDLSQTGLVLGLGLSELPASPIINKLKKKPAPCSSTSLDFEPCALSLGFLGDVSTRLK--AVDANKIVDADSSSVVCSSFSGGGGGRVKRERDLS--

Query:  --SEEVE--LERVICSKVSD--EDEDGSNTRKKLRLSKEQSAFLEESFNQNSTINPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKKCC
           EE E   ERV+CS+VSD  +DE+G + RKKLRL+K+QSA LE++F  +ST+NPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKKCC
Subjt:  --SEEVE--LERVICSKVSD--EDEDGSNTRKKLRLSKEQSAFLEESFNQNSTINPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKKCC

Query:  ETLTDENRRLQKELQELKALKLAQPLYMHMPAATLAMCPSCERVSGVGLGND-------GSKAKFSMAPKPHFYNPFSNSSAAC
        ETLTDENRRLQKELQ+LKALKL+QP YMHMPAATL MCPSCER+ G G+G D        +K  FS+  KP FYNPF+N SAAC
Subjt:  ETLTDENRRLQKELQELKALKLAQPLYMHMPAATLAMCPSCERVSGVGLGND-------GSKAKFSMAPKPHFYNPFSNSSAAC

AT5G06710.1 homeobox from Arabidopsis thaliana6.3e-5068.21Show/hide
Query:  GGGRVKRERDLSSEEVELERVICSKVSDEDEDGSNTRKKLRLSKEQSAFLEESFNQNSTINPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDCE
        G  R   +RD+  +EVE      S   ++DE+GS TRKKLRLSK+QSAFLE+SF ++ST+NPKQK ALA+QLNLRPRQVEVWFQNRRARTKLKQTEVDCE
Subjt:  GGGRVKRERDLSSEEVELERVICSKVSDEDEDGSNTRKKLRLSKEQSAFLEESFNQNSTINPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDCE

Query:  FLKKCCETLTDENRRLQKELQELKALKLAQPLYMHMPAATLAMCPSCERVS
        +LK+CCE+LT+ENRRLQKE++EL+ LK + P YM +PA TL MCPSCERV+
Subjt:  FLKKCCETLTDENRRLQKELQELKALKLAQPLYMHMPAATLAMCPSCERVS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTTTTGATGATCTTTCCCAAACAGGCTTGGTTTTGGGCTTGGGGCTCTCAGAATTACCAGCTTCTCCAATTATTAACAAGTTGAAGAAGAAGCCTGCTCCTTGCTC
CTCTACTTCTCTTGATTTTGAGCCTTGTGCTTTGTCTTTGGGCTTTTTGGGTGACGTCTCGACTCGCTTGAAAGCTGTTGATGCTAACAAGATTGTTGATGCCGATAGCA
GCAGCGTGGTGTGTTCTTCGTTCTCGGGAGGCGGCGGAGGTAGGGTTAAAAGGGAGAGAGATCTCAGCAGTGAAGAGGTTGAATTGGAGAGGGTAATTTGTTCCAAAGTT
AGTGATGAAGATGAAGATGGTTCTAATACTAGAAAGAAGCTTAGGCTTTCTAAAGAACAATCCGCTTTTTTGGAAGAGAGTTTCAACCAAAATAGCACCATCAACCCGAA
GCAAAAACAAGCCTTAGCAAGACAGCTAAATCTACGGCCACGACAAGTCGAAGTATGGTTTCAGAATAGGAGAGCTCGAACTAAACTGAAACAAACAGAGGTAGATTGTG
AGTTCTTGAAGAAATGTTGCGAGACGTTGACCGACGAAAATAGAAGACTACAAAAGGAGCTTCAAGAACTTAAGGCGCTAAAGCTAGCGCAGCCGCTGTACATGCACATG
CCAGCAGCAACACTAGCGATGTGCCCGTCGTGCGAAAGGGTCAGTGGTGTCGGCCTAGGCAATGACGGTTCGAAAGCTAAATTCTCAATGGCTCCCAAGCCACACTTTTA
CAACCCCTTCTCCAACTCTTCAGCCGCATGTTAG
mRNA sequenceShow/hide mRNA sequence
TGCTAAGATTCTTTTGTCTTCTCAATTTTTTTCTTCTGGATTAATAATCATTCCACCCACCCACATACAAACACAGCCATACCATATCTCTTTTTTCCCTTCTTTTTAAA
CCCTCCATTTTGGTTTGTTGTTCTTCACAAGTTCTTCTTTTTCCACTCCCCATCCACACAATTCCCTCTAACTCCAAATGGGTTTTGATGATCTTTCCCAAACAGGCTTG
GTTTTGGGCTTGGGGCTCTCAGAATTACCAGCTTCTCCAATTATTAACAAGTTGAAGAAGAAGCCTGCTCCTTGCTCCTCTACTTCTCTTGATTTTGAGCCTTGTGCTTT
GTCTTTGGGCTTTTTGGGTGACGTCTCGACTCGCTTGAAAGCTGTTGATGCTAACAAGATTGTTGATGCCGATAGCAGCAGCGTGGTGTGTTCTTCGTTCTCGGGAGGCG
GCGGAGGTAGGGTTAAAAGGGAGAGAGATCTCAGCAGTGAAGAGGTTGAATTGGAGAGGGTAATTTGTTCCAAAGTTAGTGATGAAGATGAAGATGGTTCTAATACTAGA
AAGAAGCTTAGGCTTTCTAAAGAACAATCCGCTTTTTTGGAAGAGAGTTTCAACCAAAATAGCACCATCAACCCGAAGCAAAAACAAGCCTTAGCAAGACAGCTAAATCT
ACGGCCACGACAAGTCGAAGTATGGTTTCAGAATAGGAGAGCTCGAACTAAACTGAAACAAACAGAGGTAGATTGTGAGTTCTTGAAGAAATGTTGCGAGACGTTGACCG
ACGAAAATAGAAGACTACAAAAGGAGCTTCAAGAACTTAAGGCGCTAAAGCTAGCGCAGCCGCTGTACATGCACATGCCAGCAGCAACACTAGCGATGTGCCCGTCGTGC
GAAAGGGTCAGTGGTGTCGGCCTAGGCAATGACGGTTCGAAAGCTAAATTCTCAATGGCTCCCAAGCCACACTTTTACAACCCCTTCTCCAACTCTTCAGCCGCATGTTA
GTTCATTCATCTAGAGAACAATACAAGATTAAGCTAGACACAAAATAGACGACCCACATAACATTATAATCATATGTTTTTTCATTTTCTTCTTGAAGAAAAGGCAATCT
ATTGTAGAAATTTACCCTTAATGGTCAAATTATTAGTTTCTTATTTTGGGGGGAAGGGGTTGGATTAGATGTTGTAAAGGTTTTTAAGTTATTTTGAAATGAAAGATGGA
TGGGTATATATAATTTG
Protein sequenceShow/hide protein sequence
MGFDDLSQTGLVLGLGLSELPASPIINKLKKKPAPCSSTSLDFEPCALSLGFLGDVSTRLKAVDANKIVDADSSSVVCSSFSGGGGGRVKRERDLSSEEVELERVICSKV
SDEDEDGSNTRKKLRLSKEQSAFLEESFNQNSTINPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKKCCETLTDENRRLQKELQELKALKLAQPLYMHM
PAATLAMCPSCERVSGVGLGNDGSKAKFSMAPKPHFYNPFSNSSAAC