; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr020657 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr020657
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionHomeobox-leucine zipper protein family
Genome locationtig00153552:928527..929536
RNA-Seq ExpressionSgr020657
SyntenySgr020657
Gene Ontology termsGO:0006357 - regulation of transcription by RNA polymerase II (biological process)
GO:0005634 - nucleus (cellular component)
GO:0000981 - DNA-binding transcription factor activity, RNA polymerase II-specific (molecular function)
GO:0043565 - sequence-specific DNA binding (molecular function)
InterPro domainsIPR001356 - Homeobox domain
IPR003106 - Leucine zipper, homeobox-associated
IPR009057 - Homeobox-like domain superfamily
IPR017970 - Homeobox, conserved site


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAB1202014.1 Homeobox-leucine zipper protein HAT22 [Morella rubra]8.9e-10371.48Show/hide
Query:  MGFDDLSQTGLVLGLGLAAEASPRPAADNQRLKKPACSSFAASSSVDFEPCLTLGISGETY---RKIDVNKTSDVGVDHHLYRQASPHSSAVSSFSGGRV
        M FDD+  TGLVLGLG+AA A   P+    +  K +CS+F   ++ +FEP LTLG+SGETY   +KIDVN+  +   +  LYRQ SPH SAVSSFS GRV
Subjt:  MGFDDLSQTGLVLGLGLAAEASPRPAADNQRLKKPACSSFAASSSVDFEPCLTLGISGETY---RKIDVNKTSDVGVDHHLYRQASPHSSAVSSFSGGRV

Query:  KRERDLSSEEVELERVSSRVSDEDEDGSNARKKLRLSKEQSALLEESFKQNSTLNPVTLSLSLSLSLLFPIIQILTDTSLIGTFNQKQKQALARQLNLRP
        KRERDLSSEEVE ERVSSR+SDEDEDG NARKKLRL+KEQSALLEESFKQ+STLNP                              KQKQALARQLNLRP
Subjt:  KRERDLSSEEVELERVSSRVSDEDEDGSNARKKLRLSKEQSALLEESFKQNSTLNPVTLSLSLSLSLLFPIIQILTDTSLIGTFNQKQKQALARQLNLRP

Query:  RQVEVWFQNRRARTKLKQTEVDCEFLKKCCETLTDENRRLQKELQELKALKLAQPLYMHMPAATLTMCPSCERVGGVGDGASKAKFSMAPKPHFYNPFSN
        RQVEVWFQNRRARTKLKQTEVDCEFLKKCCETLTDENRRLQKELQELKALKLAQPLYMHMPAATLTMCPSCER+GGVG+ ASK+ FSMAPKPHFYNPF+N
Subjt:  RQVEVWFQNRRARTKLKQTEVDCEFLKKCCETLTDENRRLQKELQELKALKLAQPLYMHMPAATLTMCPSCERVGGVGDGASKAKFSMAPKPHFYNPFSN

Query:  PSAAC
        PSAAC
Subjt:  PSAAC

XP_018857289.2 homeobox-leucine zipper protein HAT22-like [Juglans regia]1.4e-10372.26Show/hide
Query:  MGFDDLSQTGLVLGLGLAA--EASPRPAADNQRLKKPACSSFAASSSVDFEPCLTLGISGETY-----RKIDVNKTSDVGVDHHLYRQASPHSSAVSSFS
        MGFDD+  TGLVLGLGL A  E +  P+      KKPACS+F   +  +FE  LTLG+SGE+Y     +KIDVNK  +  VD  LYRQASPH SAVSSFS
Subjt:  MGFDDLSQTGLVLGLGLAA--EASPRPAADNQRLKKPACSSFAASSSVDFEPCLTLGISGETY-----RKIDVNKTSDVGVDHHLYRQASPHSSAVSSFS

Query:  GGRVKRERDLSSEEVELERVSSRVSDEDEDGSNARKKLRLSKEQSALLEESFKQNSTLNPVTLSLSLSLSLLFPIIQILTDTSLIGTFNQKQKQALARQL
         GRVKRERDLSSEEVE ERVSSRVSDEDEDG NARKKLRL+KEQSALLEESFKQ+STLNP                              KQKQALARQL
Subjt:  GGRVKRERDLSSEEVELERVSSRVSDEDEDGSNARKKLRLSKEQSALLEESFKQNSTLNPVTLSLSLSLSLLFPIIQILTDTSLIGTFNQKQKQALARQL

Query:  NLRPRQVEVWFQNRRARTKLKQTEVDCEFLKKCCETLTDENRRLQKELQELKALKLAQPLYMHMPAATLTMCPSCERVGGVGDG-ASKAKFSMAPKPHFY
        NLRPRQVEVWFQNRRARTKLKQTEVDCEFLKKCCETLTDENRRLQKELQELKALKLAQPLYMHMPAATLTMCPSCER+GGVG+   SK+ FSMAPKPHFY
Subjt:  NLRPRQVEVWFQNRRARTKLKQTEVDCEFLKKCCETLTDENRRLQKELQELKALKLAQPLYMHMPAATLTMCPSCERVGGVGDG-ASKAKFSMAPKPHFY

Query:  NPFSNPSAAC
        NPF+NPSAAC
Subjt:  NPFSNPSAAC

XP_021273528.1 homeobox-leucine zipper protein HAT22 [Herrania umbratica]8.9e-10368.57Show/hide
Query:  MGFDDLSQTGLVLGLGLAAEASPRPAADNQRLKKPACSSF--AASSSVDFEPCLTLGISGETYRKIDVNKTSDV---GVDHH--------LYRQASPHSS
        MG DD   TGLVLGLG+++       A+NQ  KK +C  F   A+++  FEP LTLG+SGE+Y+ +  +K  DV   G  HH        LYRQASPH S
Subjt:  MGFDDLSQTGLVLGLGLAAEASPRPAADNQRLKKPACSSF--AASSSVDFEPCLTLGISGETYRKIDVNKTSDV---GVDHH--------LYRQASPHSS

Query:  AVSSFSGGRVKRERDLSSEEVELERVSSRVSDEDEDGSNARKKLRLSKEQSALLEESFKQNSTLNPVTLSLSLSLSLLFPIIQILTDTSLIGTFNQKQKQ
        AVSSFS GRVKRERDLSSEEVE+E+ SSRVSDEDEDG NARKKLRL+K+QSALLEESFKQ+STLNP                              KQKQ
Subjt:  AVSSFSGGRVKRERDLSSEEVELERVSSRVSDEDEDGSNARKKLRLSKEQSALLEESFKQNSTLNPVTLSLSLSLSLLFPIIQILTDTSLIGTFNQKQKQ

Query:  ALARQLNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKKCCETLTDENRRLQKELQELKALKLAQPLYMHMPAATLTMCPSCERVGGVGDGASKAKFSMAP
        ALARQLNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKKCCETLTDENRRLQKELQELKALKLAQP YMHMPAATLTMCPSCER+GGVGDG SK+ FSMA 
Subjt:  ALARQLNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKKCCETLTDENRRLQKELQELKALKLAQPLYMHMPAATLTMCPSCERVGGVGDGASKAKFSMAP

Query:  KPHFYNPFSNPSAAC
        KPHFYNPF+NPSAAC
Subjt:  KPHFYNPFSNPSAAC

XP_022136472.1 homeobox-leucine zipper protein HAT22-like [Momordica charantia]6.6e-11477.6Show/hide
Query:  MGFDDLSQTGLVLGLGLAAEASPRPAADNQRL-KKPACSSFAASSSVDFEPCLTLGISGET-YRKIDVNKTSDVGVDHHLYRQASPHSSAVSSFSGG---
        MGFDDLS TGLVLGLGL +EASP+PA D   L KKPA      SSS+DF+PCLTLG SGE+ YRKID         DHHLYRQASPHSSAVSSFSGG   
Subjt:  MGFDDLSQTGLVLGLGLAAEASPRPAADNQRL-KKPACSSFAASSSVDFEPCLTLGISGET-YRKIDVNKTSDVGVDHHLYRQASPHSSAVSSFSGG---

Query:  -RVKRERDLSSEEVELERVSSRVSDEDEDGSNARKKLRLSKEQSALLEESFKQNSTLNPVTLSLSLSLSLLFPIIQILTDTSLIGTFNQKQKQALARQLN
         RVKRERDLSSEEV+LERVSSR+SDEDEDGSN RKKLRLS+EQSALLEESFKQNSTLNP                              KQKQALARQLN
Subjt:  -RVKRERDLSSEEVELERVSSRVSDEDEDGSNARKKLRLSKEQSALLEESFKQNSTLNPVTLSLSLSLSLLFPIIQILTDTSLIGTFNQKQKQALARQLN

Query:  LRPRQVEVWFQNRRARTKLKQTEVDCEFLKKCCETLTDENRRLQKELQELKALKLAQPLYMHMPAATLTMCPSCERVGGVGDGASKAKFSMAPKPHFYNP
        LRPRQVEVWFQNRRARTKLKQTEVDCEFLKKCCETLTDENRRLQKELQELKALKLAQPLYMHMPAATLTMCPSCERVGG  DGASKAKFSMAPKPHFYNP
Subjt:  LRPRQVEVWFQNRRARTKLKQTEVDCEFLKKCCETLTDENRRLQKELQELKALKLAQPLYMHMPAATLTMCPSCERVGGVGDGASKAKFSMAPKPHFYNP

Query:  FSNPSAAC
        FSNPSAAC
Subjt:  FSNPSAAC

XP_041018876.1 homeobox-leucine zipper protein HAT22-like [Juglans microcarpa x Juglans regia]4.0e-10372.17Show/hide
Query:  MGFDDLSQTGLVLGLGL-AAEASPRPAADNQRLKKPACSSFAASSSVDFEPCLTLGISGETY-----RKIDVNKTSDVGVDHHLYRQASPHSSAVSSFSG
        MGFDD+  TGLVLGLGL A++    P+      KKPACS+F   +  +FE  LTLG+SGE+Y     +KIDVNK  +  VD  LYRQASPH SAVSSFS 
Subjt:  MGFDDLSQTGLVLGLGL-AAEASPRPAADNQRLKKPACSSFAASSSVDFEPCLTLGISGETY-----RKIDVNKTSDVGVDHHLYRQASPHSSAVSSFSG

Query:  GRVKRERDLSSEEVELERVSSRVSDEDEDGSNARKKLRLSKEQSALLEESFKQNSTLNPVTLSLSLSLSLLFPIIQILTDTSLIGTFNQKQKQALARQLN
        GRVKRERDLSSEEVE ERVSSRVSDEDEDG NARKKLRL+KEQSALLEESFKQ+STLNP                              KQKQALARQLN
Subjt:  GRVKRERDLSSEEVELERVSSRVSDEDEDGSNARKKLRLSKEQSALLEESFKQNSTLNPVTLSLSLSLSLLFPIIQILTDTSLIGTFNQKQKQALARQLN

Query:  LRPRQVEVWFQNRRARTKLKQTEVDCEFLKKCCETLTDENRRLQKELQELKALKLAQPLYMHMPAATLTMCPSCERVGGVGDG-ASKAKFSMAPKPHFYN
        LRPRQVEVWFQNRRARTKLKQTEVDCEFLKKCCETLTDENRRLQKELQELKALKLAQPLYMHMPAATLTMCPSCER+GGVG+   SK+ FSMAPKPHFYN
Subjt:  LRPRQVEVWFQNRRARTKLKQTEVDCEFLKKCCETLTDENRRLQKELQELKALKLAQPLYMHMPAATLTMCPSCERVGGVGDG-ASKAKFSMAPKPHFYN

Query:  PFSNPSAAC
        PF+NPSAAC
Subjt:  PFSNPSAAC

TrEMBL top hitse value%identityAlignment
A0A2I4HM96 homeobox-leucine zipper protein HAT22-like6.7e-10472.26Show/hide
Query:  MGFDDLSQTGLVLGLGLAA--EASPRPAADNQRLKKPACSSFAASSSVDFEPCLTLGISGETY-----RKIDVNKTSDVGVDHHLYRQASPHSSAVSSFS
        MGFDD+  TGLVLGLGL A  E +  P+      KKPACS+F   +  +FE  LTLG+SGE+Y     +KIDVNK  +  VD  LYRQASPH SAVSSFS
Subjt:  MGFDDLSQTGLVLGLGLAA--EASPRPAADNQRLKKPACSSFAASSSVDFEPCLTLGISGETY-----RKIDVNKTSDVGVDHHLYRQASPHSSAVSSFS

Query:  GGRVKRERDLSSEEVELERVSSRVSDEDEDGSNARKKLRLSKEQSALLEESFKQNSTLNPVTLSLSLSLSLLFPIIQILTDTSLIGTFNQKQKQALARQL
         GRVKRERDLSSEEVE ERVSSRVSDEDEDG NARKKLRL+KEQSALLEESFKQ+STLNP                              KQKQALARQL
Subjt:  GGRVKRERDLSSEEVELERVSSRVSDEDEDGSNARKKLRLSKEQSALLEESFKQNSTLNPVTLSLSLSLSLLFPIIQILTDTSLIGTFNQKQKQALARQL

Query:  NLRPRQVEVWFQNRRARTKLKQTEVDCEFLKKCCETLTDENRRLQKELQELKALKLAQPLYMHMPAATLTMCPSCERVGGVGDG-ASKAKFSMAPKPHFY
        NLRPRQVEVWFQNRRARTKLKQTEVDCEFLKKCCETLTDENRRLQKELQELKALKLAQPLYMHMPAATLTMCPSCER+GGVG+   SK+ FSMAPKPHFY
Subjt:  NLRPRQVEVWFQNRRARTKLKQTEVDCEFLKKCCETLTDENRRLQKELQELKALKLAQPLYMHMPAATLTMCPSCERVGGVGDG-ASKAKFSMAPKPHFY

Query:  NPFSNPSAAC
        NPF+NPSAAC
Subjt:  NPFSNPSAAC

A0A2N9J7A3 Homeobox domain-containing protein1.0e-10471.9Show/hide
Query:  MGFDDLSQTGLVLGLGLAAEASPRPAADNQRLKKPACSSFAAS-SSVDFEPCLTLGISGETY---RKIDVNKTSDVGVDHHLYRQASPHSSAVSSFSGGR
        M F+D+  TGLVLGLGL A         ++   K  CS+F  S SS  FEP LTLG+ GE+Y   +KIDVNK  +  VD  LYRQASP +SAVSSFS GR
Subjt:  MGFDDLSQTGLVLGLGLAAEASPRPAADNQRLKKPACSSFAAS-SSVDFEPCLTLGISGETY---RKIDVNKTSDVGVDHHLYRQASPHSSAVSSFSGGR

Query:  VKRERDLSSEEVELERVSSRVSDEDEDGSNARKKLRLSKEQSALLEESFKQNSTLNPVTLSLSLSLSLLFPIIQILTDTSLIGTFNQKQKQALARQLNLR
        VKRERDLSSEEVE ERVSSRVSDEDEDG NARKKLRL+KEQSALLEESFKQ+STLNP                              KQKQALARQLNLR
Subjt:  VKRERDLSSEEVELERVSSRVSDEDEDGSNARKKLRLSKEQSALLEESFKQNSTLNPVTLSLSLSLSLLFPIIQILTDTSLIGTFNQKQKQALARQLNLR

Query:  PRQVEVWFQNRRARTKLKQTEVDCEFLKKCCETLTDENRRLQKELQELKALKLAQPLYMHMPAATLTMCPSCERVGGVGDGASKAKFSMAPKPHFYNPFS
        PRQVEVWFQNRRARTKLKQTEVDCEFLKKCCETLTDENRRLQKELQELKALKLAQPLYMHMPAATLTMCPSCER+GGVG+ ASK+ FSMAPKPHFYNPF+
Subjt:  PRQVEVWFQNRRARTKLKQTEVDCEFLKKCCETLTDENRRLQKELQELKALKLAQPLYMHMPAATLTMCPSCERVGGVGDGASKAKFSMAPKPHFYNPFS

Query:  NPSAAC
        NPSAAC
Subjt:  NPSAAC

A0A6A1UND8 Homeobox-leucine zipper protein HAT224.3e-10371.48Show/hide
Query:  MGFDDLSQTGLVLGLGLAAEASPRPAADNQRLKKPACSSFAASSSVDFEPCLTLGISGETY---RKIDVNKTSDVGVDHHLYRQASPHSSAVSSFSGGRV
        M FDD+  TGLVLGLG+AA A   P+    +  K +CS+F   ++ +FEP LTLG+SGETY   +KIDVN+  +   +  LYRQ SPH SAVSSFS GRV
Subjt:  MGFDDLSQTGLVLGLGLAAEASPRPAADNQRLKKPACSSFAASSSVDFEPCLTLGISGETY---RKIDVNKTSDVGVDHHLYRQASPHSSAVSSFSGGRV

Query:  KRERDLSSEEVELERVSSRVSDEDEDGSNARKKLRLSKEQSALLEESFKQNSTLNPVTLSLSLSLSLLFPIIQILTDTSLIGTFNQKQKQALARQLNLRP
        KRERDLSSEEVE ERVSSR+SDEDEDG NARKKLRL+KEQSALLEESFKQ+STLNP                              KQKQALARQLNLRP
Subjt:  KRERDLSSEEVELERVSSRVSDEDEDGSNARKKLRLSKEQSALLEESFKQNSTLNPVTLSLSLSLSLLFPIIQILTDTSLIGTFNQKQKQALARQLNLRP

Query:  RQVEVWFQNRRARTKLKQTEVDCEFLKKCCETLTDENRRLQKELQELKALKLAQPLYMHMPAATLTMCPSCERVGGVGDGASKAKFSMAPKPHFYNPFSN
        RQVEVWFQNRRARTKLKQTEVDCEFLKKCCETLTDENRRLQKELQELKALKLAQPLYMHMPAATLTMCPSCER+GGVG+ ASK+ FSMAPKPHFYNPF+N
Subjt:  RQVEVWFQNRRARTKLKQTEVDCEFLKKCCETLTDENRRLQKELQELKALKLAQPLYMHMPAATLTMCPSCERVGGVGDGASKAKFSMAPKPHFYNPFSN

Query:  PSAAC
        PSAAC
Subjt:  PSAAC

A0A6J0ZEZ9 homeobox-leucine zipper protein HAT224.3e-10368.57Show/hide
Query:  MGFDDLSQTGLVLGLGLAAEASPRPAADNQRLKKPACSSF--AASSSVDFEPCLTLGISGETYRKIDVNKTSDV---GVDHH--------LYRQASPHSS
        MG DD   TGLVLGLG+++       A+NQ  KK +C  F   A+++  FEP LTLG+SGE+Y+ +  +K  DV   G  HH        LYRQASPH S
Subjt:  MGFDDLSQTGLVLGLGLAAEASPRPAADNQRLKKPACSSF--AASSSVDFEPCLTLGISGETYRKIDVNKTSDV---GVDHH--------LYRQASPHSS

Query:  AVSSFSGGRVKRERDLSSEEVELERVSSRVSDEDEDGSNARKKLRLSKEQSALLEESFKQNSTLNPVTLSLSLSLSLLFPIIQILTDTSLIGTFNQKQKQ
        AVSSFS GRVKRERDLSSEEVE+E+ SSRVSDEDEDG NARKKLRL+K+QSALLEESFKQ+STLNP                              KQKQ
Subjt:  AVSSFSGGRVKRERDLSSEEVELERVSSRVSDEDEDGSNARKKLRLSKEQSALLEESFKQNSTLNPVTLSLSLSLSLLFPIIQILTDTSLIGTFNQKQKQ

Query:  ALARQLNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKKCCETLTDENRRLQKELQELKALKLAQPLYMHMPAATLTMCPSCERVGGVGDGASKAKFSMAP
        ALARQLNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKKCCETLTDENRRLQKELQELKALKLAQP YMHMPAATLTMCPSCER+GGVGDG SK+ FSMA 
Subjt:  ALARQLNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKKCCETLTDENRRLQKELQELKALKLAQPLYMHMPAATLTMCPSCERVGGVGDGASKAKFSMAP

Query:  KPHFYNPFSNPSAAC
        KPHFYNPF+NPSAAC
Subjt:  KPHFYNPFSNPSAAC

A0A6J1C5L0 homeobox-leucine zipper protein HAT22-like3.2e-11477.6Show/hide
Query:  MGFDDLSQTGLVLGLGLAAEASPRPAADNQRL-KKPACSSFAASSSVDFEPCLTLGISGET-YRKIDVNKTSDVGVDHHLYRQASPHSSAVSSFSGG---
        MGFDDLS TGLVLGLGL +EASP+PA D   L KKPA      SSS+DF+PCLTLG SGE+ YRKID         DHHLYRQASPHSSAVSSFSGG   
Subjt:  MGFDDLSQTGLVLGLGLAAEASPRPAADNQRL-KKPACSSFAASSSVDFEPCLTLGISGET-YRKIDVNKTSDVGVDHHLYRQASPHSSAVSSFSGG---

Query:  -RVKRERDLSSEEVELERVSSRVSDEDEDGSNARKKLRLSKEQSALLEESFKQNSTLNPVTLSLSLSLSLLFPIIQILTDTSLIGTFNQKQKQALARQLN
         RVKRERDLSSEEV+LERVSSR+SDEDEDGSN RKKLRLS+EQSALLEESFKQNSTLNP                              KQKQALARQLN
Subjt:  -RVKRERDLSSEEVELERVSSRVSDEDEDGSNARKKLRLSKEQSALLEESFKQNSTLNPVTLSLSLSLSLLFPIIQILTDTSLIGTFNQKQKQALARQLN

Query:  LRPRQVEVWFQNRRARTKLKQTEVDCEFLKKCCETLTDENRRLQKELQELKALKLAQPLYMHMPAATLTMCPSCERVGGVGDGASKAKFSMAPKPHFYNP
        LRPRQVEVWFQNRRARTKLKQTEVDCEFLKKCCETLTDENRRLQKELQELKALKLAQPLYMHMPAATLTMCPSCERVGG  DGASKAKFSMAPKPHFYNP
Subjt:  LRPRQVEVWFQNRRARTKLKQTEVDCEFLKKCCETLTDENRRLQKELQELKALKLAQPLYMHMPAATLTMCPSCERVGGVGDGASKAKFSMAPKPHFYNP

Query:  FSNPSAAC
        FSNPSAAC
Subjt:  FSNPSAAC

SwissProt top hitse value%identityAlignment
A2XE76 Homeobox-leucine zipper protein HOX199.7e-5246.34Show/hide
Query:  LSQTGLVLGLGL---AAEASPRPAADNQRLKKPACSSFAASSSVDFEPCLTLGISGETYRKIDVNKT---SDVGVDHHLYRQASPHSSAVSSFSGGRVKR
        LS  GL LGL L       +   AA     ++P+ S    S     EP LTL +  +         T   S  G   H     S  S +V + +   VKR
Subjt:  LSQTGLVLGLGL---AAEASPRPAADNQRLKKPACSSFAASSSVDFEPCLTLGISGETYRKIDVNKT---SDVGVDHHLYRQASPHSSAVSSFSGGRVKR

Query:  ERDLSSEEVELERVSSRVS--DEDEDGSNARKKLRLSKEQSALLEESFKQNSTLNPVTLSLSLSLSLLFPIIQILTDTSLIGTFNQKQKQALARQLNLRP
        ER   +EE + ERVSS  +  D+D+DGS  RKKLRL+KEQSALLE+ F+++STLNP                              KQK ALA+QLNLRP
Subjt:  ERDLSSEEVELERVSSRVS--DEDEDGSNARKKLRLSKEQSALLEESFKQNSTLNPVTLSLSLSLSLLFPIIQILTDTSLIGTFNQKQKQALARQLNLRP

Query:  RQVEVWFQNRRARTKLKQTEVDCEFLKKCCETLTDENRRLQKELQELKALKLA----------------QPLYMHMPAATLTMCPSCERVGGVGDGA---
        RQVEVWFQNRRARTKLKQTEVDCEFLK+CCETLT+ENRRLQ+ELQEL+ALK A                 P YM +PAATLT+CPSCERVGG    A   
Subjt:  RQVEVWFQNRRARTKLKQTEVDCEFLKKCCETLTDENRRLQKELQELKALKLA----------------QPLYMHMPAATLTMCPSCERVGGVGDGA---

Query:  ----SKAKFSMAPKPHFYNPFSNPSAAC
            +KA        HF+NPF++ SAAC
Subjt:  ----SKAKFSMAPKPHFYNPFSNPSAAC

P46603 Homeobox-leucine zipper protein HAT99.1e-7455.86Show/hide
Query:  MGFDDLSQTGLVLGLGLAAEASPRPAADNQRLKKPACSSFAASSSVDFEPCLTLGISGETYRKIDVNKTSDVGVDHHLYRQASPHSSAVSSFSGGR-VKR
        MGFDD   TGLVLGLG      P P  +N        S+   SS    EP LTL +SG      D + T   G D  L RQ S H S VSSFS GR VKR
Subjt:  MGFDDLSQTGLVLGLGLAAEASPRPAADNQRLKKPACSSFAASSSVDFEPCLTLGISGETYRKIDVNKTSDVGVDHHLYRQASPHSSAVSSFSGGR-VKR

Query:  ERDLSSEEVELERVSSRV-SD--EDEDGSNARKKLRLSKEQSALLEESFKQNSTLNPVTLSLSLSLSLLFPIIQILTDTSLIGTFNQKQKQALARQLNLR
        ERD   E  E E ++ RV SD  EDE+G +ARKKLRL+K+QSALLEESFK +STLNP                              KQKQ LARQLNLR
Subjt:  ERDLSSEEVELERVSSRV-SD--EDEDGSNARKKLRLSKEQSALLEESFKQNSTLNPVTLSLSLSLSLLFPIIQILTDTSLIGTFNQKQKQALARQLNLR

Query:  PRQVEVWFQNRRARTKLKQTEVDCEFLKKCCETLTDENRRLQKELQELKALKLAQPLYMHMPAATLTMCPSCERVGGVGDG------------------A
        PRQVEVWFQNRRARTKLKQTEVDCEFLKKCCETL DEN RLQKE+QELK LKL QP YMHMPA+TLT CPSCER+GG G G                   
Subjt:  PRQVEVWFQNRRARTKLKQTEVDCEFLKKCCETLTDENRRLQKELQELKALKLAQPLYMHMPAATLTMCPSCERVGGVGDG------------------A

Query:  SKAKFSMAPKPHFYNPFSNPSAAC
        +K  FS++ KPHF+NPF+NPSAAC
Subjt:  SKAKFSMAPKPHFYNPFSNPSAAC

P46604 Homeobox-leucine zipper protein HAT224.2e-7957.19Show/hide
Query:  MGFDDLSQTGLVLGLGLAAEASPRPAADNQRLKKPACSSFAASSSVDFEPCLTLGISGETYRKIDVNKTSDVGVDHHLYRQASPHSSAVSSFSGGRVKRE
        MG DD   TGLVLGLGL    SP P   N  +KK   SS      +  +P LTL +SGE+Y+       +  G    + RQ S H S +SSFS GRVKRE
Subjt:  MGFDDLSQTGLVLGLGLAAEASPRPAADNQRLKKPACSSFAASSSVDFEPCLTLGISGETYRKIDVNKTSDVGVDHHLYRQASPHSSAVSSFSGGRVKRE

Query:  RDLS-------SEEVELERVSSRVSD--EDEDGSNARKKLRLSKEQSALLEESFKQNSTLNPVTLSLSLSLSLLFPIIQILTDTSLIGTFNQKQKQALAR
        R++S       +EE     V SRVSD  +DE+G +ARKKLRL+K+QSALLE++FK +STLNP                              KQKQALAR
Subjt:  RDLS-------SEEVELERVSSRVSD--EDEDGSNARKKLRLSKEQSALLEESFKQNSTLNPVTLSLSLSLSLLFPIIQILTDTSLIGTFNQKQKQALAR

Query:  QLNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKKCCETLTDENRRLQKELQELKALKLAQPLYMHMPAATLTMCPSCERVGGVGDG---------ASKAK
        QLNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKKCCETLTDENRRLQKELQ+LKALKL+QP YMHMPAATLTMCPSCER+GG G G          +K  
Subjt:  QLNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKKCCETLTDENRRLQKELQELKALKLAQPLYMHMPAATLTMCPSCERVGGVGDG---------ASKAK

Query:  FSMAPKPHFYNPFSNPSAAC
        FS+  KP FYNPF+NPSAAC
Subjt:  FSMAPKPHFYNPFSNPSAAC

Q6YPD0 Homeobox-leucine zipper protein HOX272.0e-4945.11Show/hide
Query:  LVLGLGLAAEASPRPAADNQRLKKP---ACSSFAASSSVDFEPCLTL--------------GISGETYRKIDVNKTSDVGV------DHHLYRQASPHSS
        LVLGLG+   A  R   + +R ++        +AA ++   EP + L              G S    R  DVN+   V        D      A+P  S
Subjt:  LVLGLGLAAEASPRPAADNQRLKKP---ACSSFAASSSVDFEPCLTL--------------GISGETYRKIDVNKTSDVGV------DHHLYRQASPHSS

Query:  AVSSFSGGRVKRERDLSSEEVE---------------LERVSSRVSDEDEDGSNARKKLRLSKEQSALLEESFKQNSTLNPVTLSLSLSLSLLFPIIQIL
        +  + SGG      DLS + +                 ER SSR SD+DE G++ARKKLRLSKEQSA LEESFK++STLNP                   
Subjt:  AVSSFSGGRVKRERDLSSEEVE---------------LERVSSRVSDEDEDGSNARKKLRLSKEQSALLEESFKQNSTLNPVTLSLSLSLSLLFPIIQIL

Query:  TDTSLIGTFNQKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKKCCETLTDENRRLQKELQELKALKLAQPLYMHMPAATLTMCPSCERVG
                   KQK ALA+QLNLRPRQVEVWFQNRRARTKLKQTEVDCE+LK+CCETLT+ENRRL KEL EL+ALK A+P YMH+PA TL+MCPSCERV 
Subjt:  TDTSLIGTFNQKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKKCCETLTDENRRLQKELQELKALKLAQPLYMHMPAATLTMCPSCERVG

Query:  GVGDGASKAKFSMAPKP
             AS +  + A  P
Subjt:  GVGDGASKAKFSMAPKP

Q8GRL4 Homeobox-leucine zipper protein HOX199.7e-5246.34Show/hide
Query:  LSQTGLVLGLGL---AAEASPRPAADNQRLKKPACSSFAASSSVDFEPCLTLGISGETYRKIDVNKT---SDVGVDHHLYRQASPHSSAVSSFSGGRVKR
        LS  GL LGL L       +   AA     ++P+ S    S     EP LTL +  +         T   S  G   H     S  S +V + +   VKR
Subjt:  LSQTGLVLGLGL---AAEASPRPAADNQRLKKPACSSFAASSSVDFEPCLTLGISGETYRKIDVNKT---SDVGVDHHLYRQASPHSSAVSSFSGGRVKR

Query:  ERDLSSEEVELERVSSRVS--DEDEDGSNARKKLRLSKEQSALLEESFKQNSTLNPVTLSLSLSLSLLFPIIQILTDTSLIGTFNQKQKQALARQLNLRP
        ER   +EE + ERVSS  +  D+D+DGS  RKKLRL+KEQSALLE+ F+++STLNP                              KQK ALA+QLNLRP
Subjt:  ERDLSSEEVELERVSSRVS--DEDEDGSNARKKLRLSKEQSALLEESFKQNSTLNPVTLSLSLSLSLLFPIIQILTDTSLIGTFNQKQKQALARQLNLRP

Query:  RQVEVWFQNRRARTKLKQTEVDCEFLKKCCETLTDENRRLQKELQELKALKLA----------------QPLYMHMPAATLTMCPSCERVGGVGDGA---
        RQVEVWFQNRRARTKLKQTEVDCEFLK+CCETLT+ENRRLQ+ELQEL+ALK A                 P YM +PAATLT+CPSCERVGG    A   
Subjt:  RQVEVWFQNRRARTKLKQTEVDCEFLKKCCETLTDENRRLQKELQELKALKLA----------------QPLYMHMPAATLTMCPSCERVGGVGDGA---

Query:  ----SKAKFSMAPKPHFYNPFSNPSAAC
            +KA        HF+NPF++ SAAC
Subjt:  ----SKAKFSMAPKPHFYNPFSNPSAAC

Arabidopsis top hitse value%identityAlignment
AT2G22800.1 Homeobox-leucine zipper protein family6.4e-7555.86Show/hide
Query:  MGFDDLSQTGLVLGLGLAAEASPRPAADNQRLKKPACSSFAASSSVDFEPCLTLGISGETYRKIDVNKTSDVGVDHHLYRQASPHSSAVSSFSGGR-VKR
        MGFDD   TGLVLGLG      P P  +N        S+   SS    EP LTL +SG      D + T   G D  L RQ S H S VSSFS GR VKR
Subjt:  MGFDDLSQTGLVLGLGLAAEASPRPAADNQRLKKPACSSFAASSSVDFEPCLTLGISGETYRKIDVNKTSDVGVDHHLYRQASPHSSAVSSFSGGR-VKR

Query:  ERDLSSEEVELERVSSRV-SD--EDEDGSNARKKLRLSKEQSALLEESFKQNSTLNPVTLSLSLSLSLLFPIIQILTDTSLIGTFNQKQKQALARQLNLR
        ERD   E  E E ++ RV SD  EDE+G +ARKKLRL+K+QSALLEESFK +STLNP                              KQKQ LARQLNLR
Subjt:  ERDLSSEEVELERVSSRV-SD--EDEDGSNARKKLRLSKEQSALLEESFKQNSTLNPVTLSLSLSLSLLFPIIQILTDTSLIGTFNQKQKQALARQLNLR

Query:  PRQVEVWFQNRRARTKLKQTEVDCEFLKKCCETLTDENRRLQKELQELKALKLAQPLYMHMPAATLTMCPSCERVGGVGDG------------------A
        PRQVEVWFQNRRARTKLKQTEVDCEFLKKCCETL DEN RLQKE+QELK LKL QP YMHMPA+TLT CPSCER+GG G G                   
Subjt:  PRQVEVWFQNRRARTKLKQTEVDCEFLKKCCETLTDENRRLQKELQELKALKLAQPLYMHMPAATLTMCPSCERVGGVGDG------------------A

Query:  SKAKFSMAPKPHFYNPFSNPSAAC
        +K  FS++ KPHF+NPF+NPSAAC
Subjt:  SKAKFSMAPKPHFYNPFSNPSAAC

AT3G60390.1 homeobox-leucine zipper protein 32.8e-4651.05Show/hide
Query:  RKIDVNK---TSDVGVDHHLYRQASPHSSAVSSFSGGRVKRERDLSS--------EEVELERVSSRV--SDEDEDGS-----NARKKLRLSKEQSALLEE
        R IDVN+   T  V V+      +SP+S+  S  SG + +RE   ++        E+ E+ER S  +    +DEDGS     ++RKKLRLSKEQ+ +LEE
Subjt:  RKIDVNK---TSDVGVDHHLYRQASPHSSAVSSFSGGRVKRERDLSS--------EEVELERVSSRV--SDEDEDGS-----NARKKLRLSKEQSALLEE

Query:  SFKQNSTLNPVTLSLSLSLSLLFPIIQILTDTSLIGTFNQKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKKCCETLTDENRRLQKELQE
        +FK++STLNP                              KQK ALA+QLNLR RQVEVWFQNRRARTKLKQTEVDCE+LK+CCE LTDENRRLQKE+ E
Subjt:  SFKQNSTLNPVTLSLSLSLSLLFPIIQILTDTSLIGTFNQKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKKCCETLTDENRRLQKELQE

Query:  LKALKLAQPLYMHM-PAATLTMCPSCERVGGVGDGASKA
        L+ALKL+  LYMHM P  TLTMCPSCERV      +S A
Subjt:  LKALKLAQPLYMHM-PAATLTMCPSCERVGGVGDGASKA

AT4G16780.1 homeobox protein 29.7e-4754.03Show/hide
Query:  RKIDVNKTSDVGVDHHLYRQASPHSSAVSSFSGGRVKRERDLSSEEVELERVSSRVSDEDEDGSNARKKLRLSKEQSALLEESFKQNSTLNPVTLSLSLS
        R IDVN+              S  +S VSS +G R +RE D   +        SR   +DEDG N+RKKLRLSK+QSA+LEE+FK +STLNP        
Subjt:  RKIDVNKTSDVGVDHHLYRQASPHSSAVSSFSGGRVKRERDLSSEEVELERVSSRVSDEDEDGSNARKKLRLSKEQSALLEESFKQNSTLNPVTLSLSLS

Query:  LSLLFPIIQILTDTSLIGTFNQKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKKCCETLTDENRRLQKELQELKALKLAQPLYMHM-PAA
                              KQKQALA+QL LR RQVEVWFQNRRARTKLKQTEVDCEFL++CCE LT+ENRRLQKE+ EL+ALKL+   YMHM P  
Subjt:  LSLLFPIIQILTDTSLIGTFNQKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKKCCETLTDENRRLQKELQELKALKLAQPLYMHM-PAA

Query:  TLTMCPSCERV
        TLTMCPSCE V
Subjt:  TLTMCPSCERV

AT4G37790.1 Homeobox-leucine zipper protein family3.0e-8057.19Show/hide
Query:  MGFDDLSQTGLVLGLGLAAEASPRPAADNQRLKKPACSSFAASSSVDFEPCLTLGISGETYRKIDVNKTSDVGVDHHLYRQASPHSSAVSSFSGGRVKRE
        MG DD   TGLVLGLGL    SP P   N  +KK   SS      +  +P LTL +SGE+Y+       +  G    + RQ S H S +SSFS GRVKRE
Subjt:  MGFDDLSQTGLVLGLGLAAEASPRPAADNQRLKKPACSSFAASSSVDFEPCLTLGISGETYRKIDVNKTSDVGVDHHLYRQASPHSSAVSSFSGGRVKRE

Query:  RDLS-------SEEVELERVSSRVSD--EDEDGSNARKKLRLSKEQSALLEESFKQNSTLNPVTLSLSLSLSLLFPIIQILTDTSLIGTFNQKQKQALAR
        R++S       +EE     V SRVSD  +DE+G +ARKKLRL+K+QSALLE++FK +STLNP                              KQKQALAR
Subjt:  RDLS-------SEEVELERVSSRVSD--EDEDGSNARKKLRLSKEQSALLEESFKQNSTLNPVTLSLSLSLSLLFPIIQILTDTSLIGTFNQKQKQALAR

Query:  QLNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKKCCETLTDENRRLQKELQELKALKLAQPLYMHMPAATLTMCPSCERVGGVGDG---------ASKAK
        QLNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKKCCETLTDENRRLQKELQ+LKALKL+QP YMHMPAATLTMCPSCER+GG G G          +K  
Subjt:  QLNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKKCCETLTDENRRLQKELQELKALKLAQPLYMHMPAATLTMCPSCERVGGVGDG---------ASKAK

Query:  FSMAPKPHFYNPFSNPSAAC
        FS+  KP FYNPF+NPSAAC
Subjt:  FSMAPKPHFYNPFSNPSAAC

AT5G06710.1 homeobox from Arabidopsis thaliana9.7e-4757.3Show/hide
Query:  VSSFSGGRVKRERDLSSEEVELERVSSRVSDEDEDGSN--ARKKLRLSKEQSALLEESFKQNSTLNPVTLSLSLSLSLLFPIIQILTDTSLIGTFNQKQK
        + S+   R   +RD+     E+ER +SR S+ED D  N   RKKLRLSK+QSA LE+SFK++STLNP                              KQK
Subjt:  VSSFSGGRVKRERDLSSEEVELERVSSRVSDEDEDGSN--ARKKLRLSKEQSALLEESFKQNSTLNPVTLSLSLSLSLLFPIIQILTDTSLIGTFNQKQK

Query:  QALARQLNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKKCCETLTDENRRLQKELQELKALKLAQPLYMHMPAATLTMCPSCERV
         ALA+QLNLRPRQVEVWFQNRRARTKLKQTEVDCE+LK+CCE+LT+ENRRLQKE++EL+ LK + P YM +PA TLTMCPSCERV
Subjt:  QALARQLNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKKCCETLTDENRRLQKELQELKALKLAQPLYMHMPAATLTMCPSCERV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTTTTGATGATCTTTCTCAGACGGGCTTGGTCTTGGGGTTAGGGCTCGCAGCAGAAGCTTCTCCAAGGCCAGCAGCTGATAACCAGAGGTTGAAGAAGCCAGCTTG
CTCGAGTTTCGCCGCCTCCAGTTCAGTTGATTTTGAGCCTTGTTTGACTTTAGGTATTTCCGGCGAGACTTACCGGAAGATTGACGTGAACAAGACCTCCGACGTGGGCG
TTGATCATCATTTGTATCGCCAAGCTTCCCCTCATAGCAGCGCTGTTTCTTCCTTCTCCGGCGGTAGGGTTAAAAGGGAGAGAGACCTTAGCAGCGAAGAAGTGGAATTG
GAGAGAGTTTCTTCTAGAGTCAGCGATGAAGACGAAGATGGTTCTAATGCTAGAAAGAAGCTTAGGCTCTCCAAAGAACAGTCAGCTCTCTTGGAAGAAAGCTTCAAACA
AAACAGCACTCTCAATCCTGTAACTCTCTCTCTCTCTCTCTCTCTCTCTCTCTTATTTCCAATCATACAAATTCTAACAGATACTTCTCTGATTGGTACTTTTAACCAGA
AGCAAAAGCAGGCTTTAGCGAGACAGCTAAATCTGCGGCCACGACAAGTCGAAGTATGGTTTCAGAATAGGAGAGCCAGAACGAAACTGAAACAAACAGAAGTAGACTGC
GAGTTCTTGAAGAAGTGCTGCGAGACGCTGACAGACGAAAACAGAAGACTACAGAAGGAGCTACAAGAACTGAAGGCGCTAAAGCTCGCGCAGCCTCTTTACATGCACAT
GCCGGCGGCGACGCTGACGATGTGCCCGTCGTGCGAAAGGGTCGGCGGCGTCGGCGACGGAGCTTCCAAAGCCAAGTTTTCTATGGCTCCTAAGCCCCACTTTTACAACC
CCTTCTCCAATCCTTCCGCCGCTTGTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGGTTTTGATGATCTTTCTCAGACGGGCTTGGTCTTGGGGTTAGGGCTCGCAGCAGAAGCTTCTCCAAGGCCAGCAGCTGATAACCAGAGGTTGAAGAAGCCAGCTTG
CTCGAGTTTCGCCGCCTCCAGTTCAGTTGATTTTGAGCCTTGTTTGACTTTAGGTATTTCCGGCGAGACTTACCGGAAGATTGACGTGAACAAGACCTCCGACGTGGGCG
TTGATCATCATTTGTATCGCCAAGCTTCCCCTCATAGCAGCGCTGTTTCTTCCTTCTCCGGCGGTAGGGTTAAAAGGGAGAGAGACCTTAGCAGCGAAGAAGTGGAATTG
GAGAGAGTTTCTTCTAGAGTCAGCGATGAAGACGAAGATGGTTCTAATGCTAGAAAGAAGCTTAGGCTCTCCAAAGAACAGTCAGCTCTCTTGGAAGAAAGCTTCAAACA
AAACAGCACTCTCAATCCTGTAACTCTCTCTCTCTCTCTCTCTCTCTCTCTCTTATTTCCAATCATACAAATTCTAACAGATACTTCTCTGATTGGTACTTTTAACCAGA
AGCAAAAGCAGGCTTTAGCGAGACAGCTAAATCTGCGGCCACGACAAGTCGAAGTATGGTTTCAGAATAGGAGAGCCAGAACGAAACTGAAACAAACAGAAGTAGACTGC
GAGTTCTTGAAGAAGTGCTGCGAGACGCTGACAGACGAAAACAGAAGACTACAGAAGGAGCTACAAGAACTGAAGGCGCTAAAGCTCGCGCAGCCTCTTTACATGCACAT
GCCGGCGGCGACGCTGACGATGTGCCCGTCGTGCGAAAGGGTCGGCGGCGTCGGCGACGGAGCTTCCAAAGCCAAGTTTTCTATGGCTCCTAAGCCCCACTTTTACAACC
CCTTCTCCAATCCTTCCGCCGCTTGTTAG
Protein sequenceShow/hide protein sequence
MGFDDLSQTGLVLGLGLAAEASPRPAADNQRLKKPACSSFAASSSVDFEPCLTLGISGETYRKIDVNKTSDVGVDHHLYRQASPHSSAVSSFSGGRVKRERDLSSEEVEL
ERVSSRVSDEDEDGSNARKKLRLSKEQSALLEESFKQNSTLNPVTLSLSLSLSLLFPIIQILTDTSLIGTFNQKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDC
EFLKKCCETLTDENRRLQKELQELKALKLAQPLYMHMPAATLTMCPSCERVGGVGDGASKAKFSMAPKPHFYNPFSNPSAAC