; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg029852 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg029852
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionHomeobox-leucine zipper protein family
Genome locationscaffold6:11853768..11855638
RNA-Seq ExpressionSpg029852
SyntenySpg029852
Gene Ontology termsGO:0006357 - regulation of transcription by RNA polymerase II (biological process)
GO:0005634 - nucleus (cellular component)
GO:0016020 - membrane (cellular component)
GO:0000981 - DNA-binding transcription factor activity, RNA polymerase II-specific (molecular function)
GO:0043565 - sequence-specific DNA binding (molecular function)
InterPro domainsIPR001356 - Homeobox domain
IPR003106 - Leucine zipper, homeobox-associated
IPR009057 - Homeobox-like domain superfamily
IPR017970 - Homeobox, conserved site


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7031313.1 Homeobox-leucine zipper protein HAT22 [Cucurbita argyrosperma subsp. argyrosperma]2.2e-11180.14Show/hide
Query:  MGFDDISHTALVLGLGLSEEAASPIINN--KLKKKPAPCSSTSLDFEPCALTLGFSAYTHRKQLVDVNKIVNSDVVAVDPYRQASPHSSAVCSSFSGGGG
        MG DD+S T+LVLGLG+S EAASPIINN    KKK    + TSL+FEPCALTLGFS         DV+   +        YRQASPHSSAVCSSFSGGGG
Subjt:  MGFDDISHTALVLGLGLSEEAASPIINN--KLKKKPAPCSSTSLDFEPCALTLGFSAYTHRKQLVDVNKIVNSDVVAVDPYRQASPHSSAVCSSFSGGGG

Query:  GRVKRERDLSSEEVDLERVSSRISDEDEDGSNTRKKLRLSKEQSALLEESFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDCEFLK
        G +KRERDLSSEEV+LERV  R+SDEDEDG NTRKKLRLSK+QSALLEESFKQNSTLNPKQKQ LAR LNLRPRQVEVWFQNRRARTK+KQTEVDCEFLK
Subjt:  GRVKRERDLSSEEVDLERVSSRISDEDEDGSNTRKKLRLSKEQSALLEESFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDCEFLK

Query:  KCCETLTDENRRLQKEVQELKALKLAQPLYMHMPAATLTMCPSCERVG--GVGVGDGASKAKFSMAPKPHFYNPFSNPSAAC
        KCCETLT+ENRRLQKE+QELKALKLA PLYMHMPAATLTMCPSCERVG  GVGVGDGASK KFSMAPKPHFYNPFSNPSAAC
Subjt:  KCCETLTDENRRLQKEVQELKALKLAQPLYMHMPAATLTMCPSCERVG--GVGVGDGASKAKFSMAPKPHFYNPFSNPSAAC

XP_022136472.1 homeobox-leucine zipper protein HAT22-like [Momordica charantia]1.3e-11182.86Show/hide
Query:  MGFDDISHTALVLGLGLSEEAASPIIN-NKLKKKPAPCSSTSLDFEPCALTLGFSAYTHRKQLVDVNKIVNSDVVAVDPYRQASPHSSAVCSSFSGGGGG
        MGFDD+S T LVLGLGLSE +  P  + + L KKPAPCSS SLDF+PC LTLGFS  +  +++ D +            YRQASPHSSAV SSFSGGGGG
Subjt:  MGFDDISHTALVLGLGLSEEAASPIIN-NKLKKKPAPCSSTSLDFEPCALTLGFSAYTHRKQLVDVNKIVNSDVVAVDPYRQASPHSSAVCSSFSGGGGG

Query:  -RVKRERDLSSEEVDLERVSSRISDEDEDGSNTRKKLRLSKEQSALLEESFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDCEFLK
         RVKRERDLSSEEVDLERVSSRISDEDEDGSNTRKKLRLS+EQSALLEESFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDCEFLK
Subjt:  -RVKRERDLSSEEVDLERVSSRISDEDEDGSNTRKKLRLSKEQSALLEESFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDCEFLK

Query:  KCCETLTDENRRLQKEVQELKALKLAQPLYMHMPAATLTMCPSCERVGGVGVGDGASKAKFSMAPKPHFYNPFSNPSAAC
        KCCETLTDENRRLQKE+QELKALKLAQPLYMHMPAATLTMCPSCERVG  G  DGASKAKFSMAPKPHFYNPFSNPSAAC
Subjt:  KCCETLTDENRRLQKEVQELKALKLAQPLYMHMPAATLTMCPSCERVGGVGVGDGASKAKFSMAPKPHFYNPFSNPSAAC

XP_022943239.1 homeobox-leucine zipper protein HAT22-like [Cucurbita moschata]2.8e-11178.2Show/hide
Query:  MGFDDISHTALVLGLGLSEEAASPIINN--KLKKKPAPCSSTSLDFEPCALTLGFSA-------YTHRKQLVDVNKIVNSDVVAVDPYRQASPHSSAVCS
        MG DD+S T+LVLGLG+S EAASPIINN    KKK    + TSL+FEPCALTLGFS        + H   L                YRQASPHSSAVCS
Subjt:  MGFDDISHTALVLGLGLSEEAASPIINN--KLKKKPAPCSSTSLDFEPCALTLGFSA-------YTHRKQLVDVNKIVNSDVVAVDPYRQASPHSSAVCS

Query:  SFSGGGGGRVKRERDLSSEEVDLERVSSRISDEDEDGSNTRKKLRLSKEQSALLEESFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTE
        SFSGGGGG +KRERDLSSEEV+LERV  R+SDEDEDG NTRKKLRLSK+QSALLEESFKQNSTLNPKQKQ LAR LNLRPRQVEVWFQNRRARTK+KQTE
Subjt:  SFSGGGGGRVKRERDLSSEEVDLERVSSRISDEDEDGSNTRKKLRLSKEQSALLEESFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTE

Query:  VDCEFLKKCCETLTDENRRLQKEVQELKALKLAQPLYMHMPAATLTMCPSCERVG--GVGVGDGASKAKFSMAPKPHFYNPFSNPSAAC
        VDCEFLKKCCETLT+ENRRLQKE+QELKALKLA PLYMHMPAATLTMCPSCERVG  GVGVGDGASK KFSMAPKPHFYNPFSNPSAAC
Subjt:  VDCEFLKKCCETLTDENRRLQKEVQELKALKLAQPLYMHMPAATLTMCPSCERVG--GVGVGDGASKAKFSMAPKPHFYNPFSNPSAAC

XP_023000993.1 homeobox-leucine zipper protein HAT22-like [Cucurbita maxima]1.3e-11180.14Show/hide
Query:  MGFDDISHTALVLGLGLSEEAASPIINNKLKKKPAPCSSTSLDFEPCALTLGFSAYTHRKQLVDVNKIVNSDVVAVDP----YRQASPHSSAVCSSFSGG
        MG DD+S T+LVLGLG+S EAASPIINN  K        TSLDFEPCALTLGFS         DV+           P    YRQASPHSSAVCSSFSGG
Subjt:  MGFDDISHTALVLGLGLSEEAASPIINNKLKKKPAPCSSTSLDFEPCALTLGFSAYTHRKQLVDVNKIVNSDVVAVDP----YRQASPHSSAVCSSFSGG

Query:  GGGRVKRERDLSSEEVDLERVSSRISDEDEDGSNTRKKLRLSKEQSALLEESFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDCEF
        GGG +KRERDLSSEEV+LERV  R+SDEDEDG NTRKKLRLSK+QSALLEESFKQNSTLNPKQKQ LAR LNLRPRQVEVWFQNRRARTK+KQTEVDCEF
Subjt:  GGGRVKRERDLSSEEVDLERVSSRISDEDEDGSNTRKKLRLSKEQSALLEESFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDCEF

Query:  LKKCCETLTDENRRLQKEVQELKALKLAQPLYMHMPAATLTMCPSCERVGGVGVGDGASKAKFSMAPKPHFYNPFSNPSAAC
        LKKCCETLT+ENRRLQKE+QELKALKLA PLYMHMPAATLTMCPSCERVGGVGVGDGASK KFSMAPKPHFYNPFSNPSAAC
Subjt:  LKKCCETLTDENRRLQKEVQELKALKLAQPLYMHMPAATLTMCPSCERVGGVGVGDGASKAKFSMAPKPHFYNPFSNPSAAC

XP_023547039.1 homeobox-leucine zipper protein HAT22-like [Cucurbita pepo subsp. pepo]3.1e-11076.61Show/hide
Query:  MGFDDISHTALVLGLGLSEEAASPIIN---NKLKKKPAPCSSTSLDFEPCALTLGFSAYTHRKQLVDVNKIVNSDVVAVDP------------YRQASPH
        MG DD+S T+LVLGLG+S EAASPI+N   N  KKK    + TSL+FEPCALTLGFS                     VDP            YRQASPH
Subjt:  MGFDDISHTALVLGLGLSEEAASPIIN---NKLKKKPAPCSSTSLDFEPCALTLGFSAYTHRKQLVDVNKIVNSDVVAVDP------------YRQASPH

Query:  SSAVCSSFSGGGGGRVKRERDLSSEEVDLERVSSRISDEDEDGSNTRKKLRLSKEQSALLEESFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRART
        SSAVCSSFSGGGGG +KRERDLSSEEV+LERV  R+SDEDEDG NTRKKLRLSK+QSALLEESFKQNSTLNPKQKQ LAR LNLRPRQVEVWFQNRRART
Subjt:  SSAVCSSFSGGGGGRVKRERDLSSEEVDLERVSSRISDEDEDGSNTRKKLRLSKEQSALLEESFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRART

Query:  KLKQTEVDCEFLKKCCETLTDENRRLQKEVQELKALKLAQPLYMHMPAATLTMCPSCERVG--GVGVGDGASKAKFSMAPKPHFYNPFSNPSAAC
        K+KQTEVDCEFLKKCCETLT+ENRRLQKE+QELKALKLA PLYMHMPAATLTMCPSCERVG  GVGVGDGASK KFSMAPKPHFYNPFSNPSAAC
Subjt:  KLKQTEVDCEFLKKCCETLTDENRRLQKEVQELKALKLAQPLYMHMPAATLTMCPSCERVG--GVGVGDGASKAKFSMAPKPHFYNPFSNPSAAC

TrEMBL top hitse value%identityAlignment
A0A1S3BNL6 homeobox-leucine zipper protein HAT22-like3.5e-9973.52Show/hide
Query:  MGFDDISHTALVLGLGLSEEAASPIINNKLKKKPAPCSSTSLDFEPCALTLGFS---AYTHRKQLVDVNKIVNSDVVAVDPYRQASPHSSAVCSSFSGGG
        MGFDD S T LVLGLGLSE A        LKKKPAPCSS+SLDFEPC LTLGFS      HR       K+++ + V    YRQ SPHSSAVCSSFS   
Subjt:  MGFDDISHTALVLGLGLSEEAASPIINNKLKKKPAPCSSTSLDFEPCALTLGFS---AYTHRKQLVDVNKIVNSDVVAVDPYRQASPHSSAVCSSFSGGG

Query:  GGRVKRERDLSSEEVDLERVSSRISDEDEDG-SNTRKKLRLSKEQSALLEESFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDCEF
         G+VKRERDLSSEEV+LERV  R+SDED+DG +NTRKKLRLSK+QSALLEESFKQNSTLNPKQKQ LARQLNL PRQVEVWFQNRRARTK+KQTEVDCE 
Subjt:  GGRVKRERDLSSEEVDLERVSSRISDEDEDG-SNTRKKLRLSKEQSALLEESFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDCEF

Query:  LKKCCETLTDENRRLQKEVQELKALKLAQPLYMHMPAATLTMCPSCERV-----GGVGVGDGASKAKFSMAPKPHFYNPFSNPSAAC
        LKKCCETLTDENRRLQKEVQELKA+KLA+P+YM M  ATLT+CPSCERV     GGV  G+  SK KFSM P P FYNPFSNPSAAC
Subjt:  LKKCCETLTDENRRLQKEVQELKALKLAQPLYMHMPAATLTMCPSCERV-----GGVGVGDGASKAKFSMAPKPHFYNPFSNPSAAC

A0A2P5F2Y1 Octamer-binding transcription factor1.6e-9969.48Show/hide
Query:  MGFDDISHTALVLGLGLSEEAASPIINNKLKKKPAPCS--STSLDFEPCALTLGFSA---YTHRKQ---------LVDVNK-------------IVNSDV
        MGFDD+ +T LVLGLG +  A+S        +KP+  S  +T+  FEP +LTLG S    Y+             ++DVNK              VN++ 
Subjt:  MGFDDISHTALVLGLGLSEEAASPIINNKLKKKPAPCS--STSLDFEPCALTLGFSA---YTHRKQ---------LVDVNK-------------IVNSDV

Query:  VAVDPYRQASPHSSAVCSSFSGGGGGRVKRERDLSSEEVDLERVSSRISDEDEDGSNTRKKLRLSKEQSALLEESFKQNSTLNPKQKQALARQLNLRPRQ
          +D YRQASPHS+   SSFS  GGGRVKRERDLSSEE+++E+VSSRISDEDEDG N RKKLRL+KEQSALLEESFKQ+STLNPKQKQALARQLNLRPRQ
Subjt:  VAVDPYRQASPHSSAVCSSFSGGGGGRVKRERDLSSEEVDLERVSSRISDEDEDGSNTRKKLRLSKEQSALLEESFKQNSTLNPKQKQALARQLNLRPRQ

Query:  VEVWFQNRRARTKLKQTEVDCEFLKKCCETLTDENRRLQKEVQELKALKLAQPLYMHMPAATLTMCPSCERVGGVGVG---DGASKAKFSMAPKPHFYNP
        VEVWFQNRRARTKLKQTEVDCEFLKKCCETLTDENRRLQKE+QELKALKL+QPLYMHMPAATLTMCPSCER+GGVGVG   DGASK+ FSMAPKPHFYNP
Subjt:  VEVWFQNRRARTKLKQTEVDCEFLKKCCETLTDENRRLQKEVQELKALKLAQPLYMHMPAATLTMCPSCERVGGVGVG---DGASKAKFSMAPKPHFYNP

Query:  FSNPSAAC
        F+NPSAAC
Subjt:  FSNPSAAC

A0A6J1C5L0 homeobox-leucine zipper protein HAT22-like6.1e-11282.86Show/hide
Query:  MGFDDISHTALVLGLGLSEEAASPIIN-NKLKKKPAPCSSTSLDFEPCALTLGFSAYTHRKQLVDVNKIVNSDVVAVDPYRQASPHSSAVCSSFSGGGGG
        MGFDD+S T LVLGLGLSE +  P  + + L KKPAPCSS SLDF+PC LTLGFS  +  +++ D +            YRQASPHSSAV SSFSGGGGG
Subjt:  MGFDDISHTALVLGLGLSEEAASPIIN-NKLKKKPAPCSSTSLDFEPCALTLGFSAYTHRKQLVDVNKIVNSDVVAVDPYRQASPHSSAVCSSFSGGGGG

Query:  -RVKRERDLSSEEVDLERVSSRISDEDEDGSNTRKKLRLSKEQSALLEESFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDCEFLK
         RVKRERDLSSEEVDLERVSSRISDEDEDGSNTRKKLRLS+EQSALLEESFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDCEFLK
Subjt:  -RVKRERDLSSEEVDLERVSSRISDEDEDGSNTRKKLRLSKEQSALLEESFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDCEFLK

Query:  KCCETLTDENRRLQKEVQELKALKLAQPLYMHMPAATLTMCPSCERVGGVGVGDGASKAKFSMAPKPHFYNPFSNPSAAC
        KCCETLTDENRRLQKE+QELKALKLAQPLYMHMPAATLTMCPSCERVG  G  DGASKAKFSMAPKPHFYNPFSNPSAAC
Subjt:  KCCETLTDENRRLQKEVQELKALKLAQPLYMHMPAATLTMCPSCERVGGVGVGDGASKAKFSMAPKPHFYNPFSNPSAAC

A0A6J1FR71 homeobox-leucine zipper protein HAT22-like1.4e-11178.2Show/hide
Query:  MGFDDISHTALVLGLGLSEEAASPIINN--KLKKKPAPCSSTSLDFEPCALTLGFSA-------YTHRKQLVDVNKIVNSDVVAVDPYRQASPHSSAVCS
        MG DD+S T+LVLGLG+S EAASPIINN    KKK    + TSL+FEPCALTLGFS        + H   L                YRQASPHSSAVCS
Subjt:  MGFDDISHTALVLGLGLSEEAASPIINN--KLKKKPAPCSSTSLDFEPCALTLGFSA-------YTHRKQLVDVNKIVNSDVVAVDPYRQASPHSSAVCS

Query:  SFSGGGGGRVKRERDLSSEEVDLERVSSRISDEDEDGSNTRKKLRLSKEQSALLEESFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTE
        SFSGGGGG +KRERDLSSEEV+LERV  R+SDEDEDG NTRKKLRLSK+QSALLEESFKQNSTLNPKQKQ LAR LNLRPRQVEVWFQNRRARTK+KQTE
Subjt:  SFSGGGGGRVKRERDLSSEEVDLERVSSRISDEDEDGSNTRKKLRLSKEQSALLEESFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTE

Query:  VDCEFLKKCCETLTDENRRLQKEVQELKALKLAQPLYMHMPAATLTMCPSCERVG--GVGVGDGASKAKFSMAPKPHFYNPFSNPSAAC
        VDCEFLKKCCETLT+ENRRLQKE+QELKALKLA PLYMHMPAATLTMCPSCERVG  GVGVGDGASK KFSMAPKPHFYNPFSNPSAAC
Subjt:  VDCEFLKKCCETLTDENRRLQKEVQELKALKLAQPLYMHMPAATLTMCPSCERVG--GVGVGDGASKAKFSMAPKPHFYNPFSNPSAAC

A0A6J1KP81 homeobox-leucine zipper protein HAT22-like6.1e-11280.14Show/hide
Query:  MGFDDISHTALVLGLGLSEEAASPIINNKLKKKPAPCSSTSLDFEPCALTLGFSAYTHRKQLVDVNKIVNSDVVAVDP----YRQASPHSSAVCSSFSGG
        MG DD+S T+LVLGLG+S EAASPIINN  K        TSLDFEPCALTLGFS         DV+           P    YRQASPHSSAVCSSFSGG
Subjt:  MGFDDISHTALVLGLGLSEEAASPIINNKLKKKPAPCSSTSLDFEPCALTLGFSAYTHRKQLVDVNKIVNSDVVAVDP----YRQASPHSSAVCSSFSGG

Query:  GGGRVKRERDLSSEEVDLERVSSRISDEDEDGSNTRKKLRLSKEQSALLEESFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDCEF
        GGG +KRERDLSSEEV+LERV  R+SDEDEDG NTRKKLRLSK+QSALLEESFKQNSTLNPKQKQ LAR LNLRPRQVEVWFQNRRARTK+KQTEVDCEF
Subjt:  GGGRVKRERDLSSEEVDLERVSSRISDEDEDGSNTRKKLRLSKEQSALLEESFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDCEF

Query:  LKKCCETLTDENRRLQKEVQELKALKLAQPLYMHMPAATLTMCPSCERVGGVGVGDGASKAKFSMAPKPHFYNPFSNPSAAC
        LKKCCETLT+ENRRLQKE+QELKALKLA PLYMHMPAATLTMCPSCERVGGVGVGDGASK KFSMAPKPHFYNPFSNPSAAC
Subjt:  LKKCCETLTDENRRLQKEVQELKALKLAQPLYMHMPAATLTMCPSCERVGGVGVGDGASKAKFSMAPKPHFYNPFSNPSAAC

SwissProt top hitse value%identityAlignment
A2XE76 Homeobox-leucine zipper protein HOX191.1e-5460.73Show/hide
Query:  PHSSAVCSSFSGGGGGRVKRERDLSSEEVDLERVSSRIS--DEDEDGSNTRKKLRLSKEQSALLEESFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNR
        P  S    S        VKRER   +EE D ERVSS  +  D+D+DGS TRKKLRL+KEQSALLE+ F+++STLNPKQK ALA+QLNLRPRQVEVWFQNR
Subjt:  PHSSAVCSSFSGGGGGRVKRERDLSSEEVDLERVSSRIS--DEDEDGSNTRKKLRLSKEQSALLEESFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNR

Query:  RARTKLKQTEVDCEFLKKCCETLTDENRRLQKEVQELKALKLA----------------QPLYMHMPAATLTMCPSCERVGG------VGVGDGASKAKF
        RARTKLKQTEVDCEFLK+CCETLT+ENRRLQ+E+QEL+ALK A                 P YM +PAATLT+CPSCERVGG      V   DG +KA  
Subjt:  RARTKLKQTEVDCEFLKKCCETLTDENRRLQKEVQELKALKLA----------------QPLYMHMPAATLTMCPSCERVGG------VGVGDGASKAKF

Query:  SMAPKPHFYNPFSNPSAAC
              HF+NPF++ SAAC
Subjt:  SMAPKPHFYNPFSNPSAAC

A2YW03 Homeobox-leucine zipper protein HOX276.2e-5370.19Show/hide
Query:  HSSAVCSSFSGGGGGRVKRERDLSSEEVDLERVSSRISDEDEDGSNTRKKLRLSKEQSALLEESFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRAR
        H+ A      GGGGG               ER SSR SD+DE G++ RKKLRLSKEQSA LEESFK++STLNPKQK ALA+QLNLRPRQVEVWFQNRRAR
Subjt:  HSSAVCSSFSGGGGGRVKRERDLSSEEVDLERVSSRISDEDEDGSNTRKKLRLSKEQSALLEESFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRAR

Query:  TKLKQTEVDCEFLKKCCETLTDENRRLQKEVQELKALKLAQPLYMHMPAATLTMCPSCERV
        TKLKQTEVDCE+LK+CCETLT+ENRRL KE+ EL+ALK A+P YMH+PA TL+MCPSCERV
Subjt:  TKLKQTEVDCEFLKKCCETLTDENRRLQKEVQELKALKLAQPLYMHMPAATLTMCPSCERV

P46603 Homeobox-leucine zipper protein HAT97.8e-7258.25Show/hide
Query:  MGFDDISHTALVLGLGLSEEAASPIINNKLKKKPAPCSSTSLDFEPCALTLGFSAYTHRKQLVDVNKIVNSDVVAVDPYRQASPHSSAVCSSFSGGGGGR
        MGFDD  +T LVLGLG      SPI NN          S+    EP +LTL  S          V  +  +D +     RQ S HS    SSFS   G  
Subjt:  MGFDDISHTALVLGLGLSEEAASPIINNKLKKKPAPCSSTSLDFEPCALTLGFSAYTHRKQLVDVNKIVNSDVVAVDPYRQASPHSSAVCSSFSGGGGGR

Query:  VKRERDLSSEEVDLERVSSR-ISD--EDEDGSNTRKKLRLSKEQSALLEESFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDCEFL
        VKRERD   E  + E ++ R ISD  EDE+G + RKKLRL+K+QSALLEESFK +STLNPKQKQ LARQLNLRPRQVEVWFQNRRARTKLKQTEVDCEFL
Subjt:  VKRERDLSSEEVDLERVSSR-ISD--EDEDGSNTRKKLRLSKEQSALLEESFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDCEFL

Query:  KKCCETLTDENRRLQKEVQELKALKLAQPLYMHMPAATLTMCPSCERVGGVGVGDG----------------ASKAKFSMAPKPHFYNPFSNPSAAC
        KKCCETL DEN RLQKE+QELK LKL QP YMHMPA+TLT CPSCER+GG G G+G                 +K  FS++ KPHF+NPF+NPSAAC
Subjt:  KKCCETLTDENRRLQKEVQELKALKLAQPLYMHMPAATLTMCPSCERVGGVGVGDG----------------ASKAKFSMAPKPHFYNPFSNPSAAC

P46604 Homeobox-leucine zipper protein HAT223.1e-7659.18Show/hide
Query:  MGFDDISHTALVLGLGLSEEAASPIINNKLKKKPAPCSSTSLDFEPCALTLGFSAYTHRKQLVDVNKIVNSDVVAVDPYRQASPHSSAVCSSFSGGGGGR
        MG DD  +T LVLGLGLS    +   N+ +KK  +      +  +P +LTL  S  ++        KI           RQ S HS    SSFS    GR
Subjt:  MGFDDISHTALVLGLGLSEEAASPIINNKLKKKPAPCSSTSLDFEPCALTLGFSAYTHRKQLVDVNKIVNSDVVAVDPYRQASPHSSAVCSSFSGGGGGR

Query:  VKRERDLS-------SEEVDLERVSSRISD--EDEDGSNTRKKLRLSKEQSALLEESFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTE
        VKRER++S       +EE     V SR+SD  +DE+G + RKKLRL+K+QSALLE++FK +STLNPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTE
Subjt:  VKRERDLS-------SEEVDLERVSSRISD--EDEDGSNTRKKLRLSKEQSALLEESFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTE

Query:  VDCEFLKKCCETLTDENRRLQKEVQELKALKLAQPLYMHMPAATLTMCPSCERVGGVGVG-------DGASKAKFSMAPKPHFYNPFSNPSAAC
        VDCEFLKKCCETLTDENRRLQKE+Q+LKALKL+QP YMHMPAATLTMCPSCER+GG GVG       +  +K  FS+  KP FYNPF+NPSAAC
Subjt:  VDCEFLKKCCETLTDENRRLQKEVQELKALKLAQPLYMHMPAATLTMCPSCERVGGVGVG-------DGASKAKFSMAPKPHFYNPFSNPSAAC

Q8GRL4 Homeobox-leucine zipper protein HOX191.1e-5460.73Show/hide
Query:  PHSSAVCSSFSGGGGGRVKRERDLSSEEVDLERVSSRIS--DEDEDGSNTRKKLRLSKEQSALLEESFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNR
        P  S    S        VKRER   +EE D ERVSS  +  D+D+DGS TRKKLRL+KEQSALLE+ F+++STLNPKQK ALA+QLNLRPRQVEVWFQNR
Subjt:  PHSSAVCSSFSGGGGGRVKRERDLSSEEVDLERVSSRIS--DEDEDGSNTRKKLRLSKEQSALLEESFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNR

Query:  RARTKLKQTEVDCEFLKKCCETLTDENRRLQKEVQELKALKLA----------------QPLYMHMPAATLTMCPSCERVGG------VGVGDGASKAKF
        RARTKLKQTEVDCEFLK+CCETLT+ENRRLQ+E+QEL+ALK A                 P YM +PAATLT+CPSCERVGG      V   DG +KA  
Subjt:  RARTKLKQTEVDCEFLKKCCETLTDENRRLQKEVQELKALKLA----------------QPLYMHMPAATLTMCPSCERVGG------VGVGDGASKAKF

Query:  SMAPKPHFYNPFSNPSAAC
              HF+NPF++ SAAC
Subjt:  SMAPKPHFYNPFSNPSAAC

Arabidopsis top hitse value%identityAlignment
AT2G22800.1 Homeobox-leucine zipper protein family5.5e-7358.25Show/hide
Query:  MGFDDISHTALVLGLGLSEEAASPIINNKLKKKPAPCSSTSLDFEPCALTLGFSAYTHRKQLVDVNKIVNSDVVAVDPYRQASPHSSAVCSSFSGGGGGR
        MGFDD  +T LVLGLG      SPI NN          S+    EP +LTL  S          V  +  +D +     RQ S HS    SSFS   G  
Subjt:  MGFDDISHTALVLGLGLSEEAASPIINNKLKKKPAPCSSTSLDFEPCALTLGFSAYTHRKQLVDVNKIVNSDVVAVDPYRQASPHSSAVCSSFSGGGGGR

Query:  VKRERDLSSEEVDLERVSSR-ISD--EDEDGSNTRKKLRLSKEQSALLEESFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDCEFL
        VKRERD   E  + E ++ R ISD  EDE+G + RKKLRL+K+QSALLEESFK +STLNPKQKQ LARQLNLRPRQVEVWFQNRRARTKLKQTEVDCEFL
Subjt:  VKRERDLSSEEVDLERVSSR-ISD--EDEDGSNTRKKLRLSKEQSALLEESFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDCEFL

Query:  KKCCETLTDENRRLQKEVQELKALKLAQPLYMHMPAATLTMCPSCERVGGVGVGDG----------------ASKAKFSMAPKPHFYNPFSNPSAAC
        KKCCETL DEN RLQKE+QELK LKL QP YMHMPA+TLT CPSCER+GG G G+G                 +K  FS++ KPHF+NPF+NPSAAC
Subjt:  KKCCETLTDENRRLQKEVQELKALKLAQPLYMHMPAATLTMCPSCERVGGVGVGDG----------------ASKAKFSMAPKPHFYNPFSNPSAAC

AT3G60390.1 homeobox-leucine zipper protein 36.6e-5059.51Show/hide
Query:  VDVNKIVNSDVVAV-DPYRQASPHSSAVCSSFSGG-------------GGGRVKRERDLSSEEVDLERVSSRI--SDEDEDGS-----NTRKKLRLSKEQ
        +DVN+  ++ VV V D     S  +S V S  SG              GGGRV        E+ ++ER S  +    +DEDGS     ++RKKLRLSKEQ
Subjt:  VDVNKIVNSDVVAV-DPYRQASPHSSAVCSSFSGG-------------GGGRVKRERDLSSEEVDLERVSSRI--SDEDEDGS-----NTRKKLRLSKEQ

Query:  SALLEESFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKKCCETLTDENRRLQKEVQELKALKLAQPLYMHM-PAATLTMCP
        + +LEE+FK++STLNPKQK ALA+QLNLR RQVEVWFQNRRARTKLKQTEVDCE+LK+CCE LTDENRRLQKEV EL+ALKL+  LYMHM P  TLTMCP
Subjt:  SALLEESFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKKCCETLTDENRRLQKEVQELKALKLAQPLYMHM-PAATLTMCP

Query:  SCERV
        SCERV
Subjt:  SCERV

AT4G16780.1 homeobox protein 21.1e-4960.87Show/hide
Query:  VDVNKIVNSDVVAVDPYRQASPHSSAVCSSFSGGGGGRVKRERDLSSEEVDLERVSSRISDEDEDGSNTRKKLRLSKEQSALLEESFKQNSTLNPKQKQA
        +DVN+  ++     +    +SP+S+      S   G R +RE D   +        SR   +DEDG N+RKKLRLSK+QSA+LEE+FK +STLNPKQKQA
Subjt:  VDVNKIVNSDVVAVDPYRQASPHSSAVCSSFSGGGGGRVKRERDLSSEEVDLERVSSRISDEDEDGSNTRKKLRLSKEQSALLEESFKQNSTLNPKQKQA

Query:  LARQLNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKKCCETLTDENRRLQKEVQELKALKLAQPLYMHM-PAATLTMCPSCERV
        LA+QL LR RQVEVWFQNRRARTKLKQTEVDCEFL++CCE LT+ENRRLQKEV EL+ALKL+   YMHM P  TLTMCPSCE V
Subjt:  LARQLNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKKCCETLTDENRRLQKEVQELKALKLAQPLYMHM-PAATLTMCPSCERV

AT4G37790.1 Homeobox-leucine zipper protein family2.2e-7759.18Show/hide
Query:  MGFDDISHTALVLGLGLSEEAASPIINNKLKKKPAPCSSTSLDFEPCALTLGFSAYTHRKQLVDVNKIVNSDVVAVDPYRQASPHSSAVCSSFSGGGGGR
        MG DD  +T LVLGLGLS    +   N+ +KK  +      +  +P +LTL  S  ++        KI           RQ S HS    SSFS    GR
Subjt:  MGFDDISHTALVLGLGLSEEAASPIINNKLKKKPAPCSSTSLDFEPCALTLGFSAYTHRKQLVDVNKIVNSDVVAVDPYRQASPHSSAVCSSFSGGGGGR

Query:  VKRERDLS-------SEEVDLERVSSRISD--EDEDGSNTRKKLRLSKEQSALLEESFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTE
        VKRER++S       +EE     V SR+SD  +DE+G + RKKLRL+K+QSALLE++FK +STLNPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTE
Subjt:  VKRERDLS-------SEEVDLERVSSRISD--EDEDGSNTRKKLRLSKEQSALLEESFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTE

Query:  VDCEFLKKCCETLTDENRRLQKEVQELKALKLAQPLYMHMPAATLTMCPSCERVGGVGVG-------DGASKAKFSMAPKPHFYNPFSNPSAAC
        VDCEFLKKCCETLTDENRRLQKE+Q+LKALKL+QP YMHMPAATLTMCPSCER+GG GVG       +  +K  FS+  KP FYNPF+NPSAAC
Subjt:  VDCEFLKKCCETLTDENRRLQKEVQELKALKLAQPLYMHMPAATLTMCPSCERVGGVGVG-------DGASKAKFSMAPKPHFYNPFSNPSAAC

AT5G06710.1 homeobox from Arabidopsis thaliana1.2e-5165.91Show/hide
Query:  PYRQASPHSSAVCSSF------SGGGGGRVKRERDLSSEEVDLERVSSRISDEDEDGSN--TRKKLRLSKEQSALLEESFKQNSTLNPKQKQALARQLNL
        P    SP  S V SSF         G  R   +RD+  E   +ER +SR S+ED D  N  TRKKLRLSK+QSA LE+SFK++STLNPKQK ALA+QLNL
Subjt:  PYRQASPHSSAVCSSF------SGGGGGRVKRERDLSSEEVDLERVSSRISDEDEDGSN--TRKKLRLSKEQSALLEESFKQNSTLNPKQKQALARQLNL

Query:  RPRQVEVWFQNRRARTKLKQTEVDCEFLKKCCETLTDENRRLQKEVQELKALKLAQPLYMHMPAATLTMCPSCERV
        RPRQVEVWFQNRRARTKLKQTEVDCE+LK+CCE+LT+ENRRLQKEV+EL+ LK + P YM +PA TLTMCPSCERV
Subjt:  RPRQVEVWFQNRRARTKLKQTEVDCEFLKKCCETLTDENRRLQKEVQELKALKLAQPLYMHMPAATLTMCPSCERV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTTTTGATGATATTTCTCATACAGCCTTGGTTTTGGGCTTAGGGCTCTCAGAAGAAGCTGCTTCTCCAATTATCAACAACAAGTTGAAGAAAAAGCCTGCTCCTTG
CTCCTCTACTTCACTTGATTTTGAGCCTTGTGCTTTGACTTTGGGATTCTCCGCCTACACTCACAGGAAACAACTCGTCGACGTTAACAAGATCGTCAACTCTGACGTCG
TCGCTGTCGATCCGTATCGCCAAGCTTCCCCTCATAGCAGCGCCGTTTGTTCTTCCTTCTCCGGTGGCGGCGGGGGTAGGGTTAAAAGGGAGAGAGACCTCAGCAGTGAA
GAAGTTGACTTGGAGAGGGTTTCTTCGAGAATCAGCGATGAAGATGAAGATGGTTCCAATACTAGAAAGAAGCTCAGGCTTTCTAAAGAACAATCCGCTCTCTTGGAAGA
GAGTTTCAAACAAAATAGCACTCTCAACCCGAAGCAAAAACAAGCCTTGGCGAGACAATTAAATCTACGGCCACGACAAGTTGAAGTATGGTTTCAGAATAGGAGAGCTC
GAACGAAACTGAAACAAACAGAAGTAGATTGTGAGTTCTTGAAGAAGTGTTGCGAGACGCTGACTGATGAAAACAGAAGACTACAAAAGGAGGTACAAGAATTGAAGGCA
CTGAAGCTGGCACAGCCTCTATACATGCACATGCCAGCAGCAACATTAACGATGTGTCCTTCATGCGAAAGGGTCGGCGGCGTCGGCGTCGGCGACGGCGCTTCCAAAGC
CAAATTTTCAATGGCTCCCAAGCCTCACTTTTACAACCCCTTCTCCAATCCTTCCGCCGCATGTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGGTTTTGATGATATTTCTCATACAGCCTTGGTTTTGGGCTTAGGGCTCTCAGAAGAAGCTGCTTCTCCAATTATCAACAACAAGTTGAAGAAAAAGCCTGCTCCTTG
CTCCTCTACTTCACTTGATTTTGAGCCTTGTGCTTTGACTTTGGGATTCTCCGCCTACACTCACAGGAAACAACTCGTCGACGTTAACAAGATCGTCAACTCTGACGTCG
TCGCTGTCGATCCGTATCGCCAAGCTTCCCCTCATAGCAGCGCCGTTTGTTCTTCCTTCTCCGGTGGCGGCGGGGGTAGGGTTAAAAGGGAGAGAGACCTCAGCAGTGAA
GAAGTTGACTTGGAGAGGGTTTCTTCGAGAATCAGCGATGAAGATGAAGATGGTTCCAATACTAGAAAGAAGCTCAGGCTTTCTAAAGAACAATCCGCTCTCTTGGAAGA
GAGTTTCAAACAAAATAGCACTCTCAACCCGAAGCAAAAACAAGCCTTGGCGAGACAATTAAATCTACGGCCACGACAAGTTGAAGTATGGTTTCAGAATAGGAGAGCTC
GAACGAAACTGAAACAAACAGAAGTAGATTGTGAGTTCTTGAAGAAGTGTTGCGAGACGCTGACTGATGAAAACAGAAGACTACAAAAGGAGGTACAAGAATTGAAGGCA
CTGAAGCTGGCACAGCCTCTATACATGCACATGCCAGCAGCAACATTAACGATGTGTCCTTCATGCGAAAGGGTCGGCGGCGTCGGCGTCGGCGACGGCGCTTCCAAAGC
CAAATTTTCAATGGCTCCCAAGCCTCACTTTTACAACCCCTTCTCCAATCCTTCCGCCGCATGTTAG
Protein sequenceShow/hide protein sequence
MGFDDISHTALVLGLGLSEEAASPIINNKLKKKPAPCSSTSLDFEPCALTLGFSAYTHRKQLVDVNKIVNSDVVAVDPYRQASPHSSAVCSSFSGGGGGRVKRERDLSSE
EVDLERVSSRISDEDEDGSNTRKKLRLSKEQSALLEESFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKKCCETLTDENRRLQKEVQELKA
LKLAQPLYMHMPAATLTMCPSCERVGGVGVGDGASKAKFSMAPKPHFYNPFSNPSAAC