; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0018969 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0018969
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionHomeobox-leucine zipper protein family
Genome locationchr5:37163796..37164833
RNA-Seq ExpressionLag0018969
SyntenyLag0018969
Gene Ontology termsGO:0006357 - regulation of transcription by RNA polymerase II (biological process)
GO:0005634 - nucleus (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0000981 - DNA-binding transcription factor activity, RNA polymerase II-specific (molecular function)
GO:0043565 - sequence-specific DNA binding (molecular function)
InterPro domainsIPR001356 - Homeobox domain
IPR003106 - Leucine zipper, homeobox-associated
IPR009057 - Homeobox-like domain superfamily
IPR017970 - Homeobox, conserved site


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6600675.1 Homeobox-leucine zipper protein HAT22, partial [Cucurbita argyrosperma subsp. sororia]9.8e-11277.89Show/hide
Query:  MGFDDISHTALVLGLGLSEEAASPIINN--KLKKKPAPCSSTSLDFEPCALTLGFSADTHRKQLVDVNKIVNSDVIAVDP-----------YRQASPHSS
        MG DD+S T+LVLGLG+S EAASPIINN    KK     + TSL+FEPCALTLGFS D                   VDP           YRQASPHSS
Subjt:  MGFDDISHTALVLGLGLSEEAASPIINN--KLKKKPAPCSSTSLDFEPCALTLGFSADTHRKQLVDVNKIVNSDVIAVDP-----------YRQASPHSS

Query:  AVCSSFSGGGCGGRVKRERDLSSEEVDLERVSSRVSDEDEDGSNTRKKLRLSKEQSALLEESFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRARTK
        AVCSSFSGGG G  +KRERDLSSEEV+LERV  RVSDEDEDG NTRKKLRLSK+QSALLEESFKQNSTLNPKQKQ LAR LNLRPRQVEVWFQNRRARTK
Subjt:  AVCSSFSGGGCGGRVKRERDLSSEEVDLERVSSRVSDEDEDGSNTRKKLRLSKEQSALLEESFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRARTK

Query:  LKQTEVDCEFLKKCCETLTDENRRLQKEVQELKALKLAQPLYMHMPAATLTMCPSCERVGGVGVGVGDGASKAKFSMAPKPHFYNPFSNPSAAC
        +KQTEVDCEFLKKCCETLT+ENRRLQKE+QELKALKLA PLYMHMPAATLTMCPSCERVGGVGVGVGDGASK KFSMAPKPHFYNPFSNPSAAC
Subjt:  LKQTEVDCEFLKKCCETLTDENRRLQKEVQELKALKLAQPLYMHMPAATLTMCPSCERVGGVGVGVGDGASKAKFSMAPKPHFYNPFSNPSAAC

KAG7031313.1 Homeobox-leucine zipper protein HAT22 [Cucurbita argyrosperma subsp. argyrosperma]5.2e-11378.57Show/hide
Query:  MGFDDISHTALVLGLGLSEEAASPIINN--KLKKKPAPCSSTSLDFEPCALTLGFSADTHRKQLVDVNKIVNSDVIAVDP-----------YRQASPHSS
        MG DD+S T+LVLGLG+S EAASPIINN    KKK    + TSL+FEPCALTLGFS D                   VDP           YRQASPHSS
Subjt:  MGFDDISHTALVLGLGLSEEAASPIINN--KLKKKPAPCSSTSLDFEPCALTLGFSADTHRKQLVDVNKIVNSDVIAVDP-----------YRQASPHSS

Query:  AVCSSFSGGGCGGRVKRERDLSSEEVDLERVSSRVSDEDEDGSNTRKKLRLSKEQSALLEESFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRARTK
        AVCSSFSGGG GG +KRERDLSSEEV+LERV  RVSDEDEDG NTRKKLRLSK+QSALLEESFKQNSTLNPKQKQ LAR LNLRPRQVEVWFQNRRARTK
Subjt:  AVCSSFSGGGCGGRVKRERDLSSEEVDLERVSSRVSDEDEDGSNTRKKLRLSKEQSALLEESFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRARTK

Query:  LKQTEVDCEFLKKCCETLTDENRRLQKEVQELKALKLAQPLYMHMPAATLTMCPSCERVGGVGVGVGDGASKAKFSMAPKPHFYNPFSNPSAAC
        +KQTEVDCEFLKKCCETLT+ENRRLQKE+QELKALKLA PLYMHMPAATLTMCPSCERVGGVGVGVGDGASK KFSMAPKPHFYNPFSNPSAAC
Subjt:  LKQTEVDCEFLKKCCETLTDENRRLQKEVQELKALKLAQPLYMHMPAATLTMCPSCERVGGVGVGVGDGASKAKFSMAPKPHFYNPFSNPSAAC

XP_022136472.1 homeobox-leucine zipper protein HAT22-like [Momordica charantia]7.5e-11281.56Show/hide
Query:  MGFDDISHTALVLGLGLSEEAASPIIN-NKLKKKPAPCSSTSLDFEPCALTLGFSADTHRKQLVDVNKIVNSDVIAVDPYRQASPHSSAVCSSFSGGGCG
        MGFDD+S T LVLGLGLSE +  P  + + L KKPAPCSS SLDF+PC LTLGFS ++  +++ D +            YRQASPHSSAV SSFSGGG G
Subjt:  MGFDDISHTALVLGLGLSEEAASPIIN-NKLKKKPAPCSSTSLDFEPCALTLGFSADTHRKQLVDVNKIVNSDVIAVDPYRQASPHSSAVCSSFSGGGCG

Query:  GRVKRERDLSSEEVDLERVSSRVSDEDEDGSNTRKKLRLSKEQSALLEESFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDCEFLK
         RVKRERDLSSEEVDLERVSSR+SDEDEDGSNTRKKLRLS+EQSALLEESFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDCEFLK
Subjt:  GRVKRERDLSSEEVDLERVSSRVSDEDEDGSNTRKKLRLSKEQSALLEESFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDCEFLK

Query:  KCCETLTDENRRLQKEVQELKALKLAQPLYMHMPAATLTMCPSCERVGGVGVGVGDGASKAKFSMAPKPHFYNPFSNPSAAC
        KCCETLTDENRRLQKE+QELKALKLAQPLYMHMPAATLTMCPSCERVG    G  DGASKAKFSMAPKPHFYNPFSNPSAAC
Subjt:  KCCETLTDENRRLQKEVQELKALKLAQPLYMHMPAATLTMCPSCERVGGVGVGVGDGASKAKFSMAPKPHFYNPFSNPSAAC

XP_022943239.1 homeobox-leucine zipper protein HAT22-like [Cucurbita moschata]5.2e-11378.84Show/hide
Query:  MGFDDISHTALVLGLGLSEEAASPIINN--KLKKKPAPCSSTSLDFEPCALTLGFSADTHRKQLVDVNKIVNSDVIAVDP----------YRQASPHSSA
        MG DD+S T+LVLGLG+S EAASPIINN    KKK    + TSL+FEPCALTLGFS D                   VDP          YRQASPHSSA
Subjt:  MGFDDISHTALVLGLGLSEEAASPIINN--KLKKKPAPCSSTSLDFEPCALTLGFSADTHRKQLVDVNKIVNSDVIAVDP----------YRQASPHSSA

Query:  VCSSFSGGGCGGRVKRERDLSSEEVDLERVSSRVSDEDEDGSNTRKKLRLSKEQSALLEESFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRARTKL
        VCSSFSGGG GG +KRERDLSSEEV+LERV  RVSDEDEDG NTRKKLRLSK+QSALLEESFKQNSTLNPKQKQ LAR LNLRPRQVEVWFQNRRARTK+
Subjt:  VCSSFSGGGCGGRVKRERDLSSEEVDLERVSSRVSDEDEDGSNTRKKLRLSKEQSALLEESFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRARTKL

Query:  KQTEVDCEFLKKCCETLTDENRRLQKEVQELKALKLAQPLYMHMPAATLTMCPSCERVGGVGVGVGDGASKAKFSMAPKPHFYNPFSNPSAAC
        KQTEVDCEFLKKCCETLT+ENRRLQKE+QELKALKLA PLYMHMPAATLTMCPSCERVGGVGVGVGDGASK KFSMAPKPHFYNPFSNPSAAC
Subjt:  KQTEVDCEFLKKCCETLTDENRRLQKEVQELKALKLAQPLYMHMPAATLTMCPSCERVGGVGVGVGDGASKAKFSMAPKPHFYNPFSNPSAAC

XP_023547039.1 homeobox-leucine zipper protein HAT22-like [Cucurbita pepo subsp. pepo]1.5e-11277.7Show/hide
Query:  MGFDDISHTALVLGLGLSEEAASPIIN---NKLKKKPAPCSSTSLDFEPCALTLGFSADTHRKQLVDVNKIVNSDVIAVDP------------YRQASPH
        MG DD+S T+LVLGLG+S EAASPI+N   N  KKK    + TSL+FEPCALTLGFS D                   VDP            YRQASPH
Subjt:  MGFDDISHTALVLGLGLSEEAASPIIN---NKLKKKPAPCSSTSLDFEPCALTLGFSADTHRKQLVDVNKIVNSDVIAVDP------------YRQASPH

Query:  SSAVCSSFSGGGCGGRVKRERDLSSEEVDLERVSSRVSDEDEDGSNTRKKLRLSKEQSALLEESFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRAR
        SSAVCSSFSGGG GG +KRERDLSSEEV+LERV  RVSDEDEDG NTRKKLRLSK+QSALLEESFKQNSTLNPKQKQ LAR LNLRPRQVEVWFQNRRAR
Subjt:  SSAVCSSFSGGGCGGRVKRERDLSSEEVDLERVSSRVSDEDEDGSNTRKKLRLSKEQSALLEESFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRAR

Query:  TKLKQTEVDCEFLKKCCETLTDENRRLQKEVQELKALKLAQPLYMHMPAATLTMCPSCERVGGVGVGVGDGASKAKFSMAPKPHFYNPFSNPSAAC
        TK+KQTEVDCEFLKKCCETLT+ENRRLQKE+QELKALKLA PLYMHMPAATLTMCPSCERVGGVGVGVGDGASK KFSMAPKPHFYNPFSNPSAAC
Subjt:  TKLKQTEVDCEFLKKCCETLTDENRRLQKEVQELKALKLAQPLYMHMPAATLTMCPSCERVGGVGVGVGDGASKAKFSMAPKPHFYNPFSNPSAAC

TrEMBL top hitse value%identityAlignment
A0A1S3BNL6 homeobox-leucine zipper protein HAT22-like9.3e-10073.96Show/hide
Query:  MGFDDISHTALVLGLGLSEEAASPIINNKLKKKPAPCSSTSLDFEPCALTLGFS---ADTHRKQLVDVNKIVNSDVIAVDPYRQASPHSSAVCSSFSGGG
        MGFDD S T LVLGLGLSE A        LKKKPAPCSS+SLDFEPC LTLGFS    D HRK ++D   + +        YRQ SPHSSAVCSSFS   
Subjt:  MGFDDISHTALVLGLGLSEEAASPIINNKLKKKPAPCSSTSLDFEPCALTLGFS---ADTHRKQLVDVNKIVNSDVIAVDPYRQASPHSSAVCSSFSGGG

Query:  CGGRVKRERDLSSEEVDLERVSSRVSDEDEDG-SNTRKKLRLSKEQSALLEESFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDCE
          G+VKRERDLSSEEV+LERV  RVSDED+DG +NTRKKLRLSK+QSALLEESFKQNSTLNPKQKQ LARQLNL PRQVEVWFQNRRARTK+KQTEVDCE
Subjt:  CGGRVKRERDLSSEEVDLERVSSRVSDEDEDG-SNTRKKLRLSKEQSALLEESFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDCE

Query:  FLKKCCETLTDENRRLQKEVQELKALKLAQPLYMHMPAATLTMCPSCERV---GGVGVGVGDGASKAKFSMAPKPHFYNPFSNPSAAC
         LKKCCETLTDENRRLQKEVQELKA+KLA+P+YM M  ATLT+CPSCERV   G  GV  G+  SK KFSM P P FYNPFSNPSAAC
Subjt:  FLKKCCETLTDENRRLQKEVQELKALKLAQPLYMHMPAATLTMCPSCERV---GGVGVGVGDGASKAKFSMAPKPHFYNPFSNPSAAC

A0A2P5F2Y1 Octamer-binding transcription factor1.1e-10069.9Show/hide
Query:  MGFDDISHTALVLGLGLSEEAASPIINNKLKKKPAPCS--STSLDFEPCALTLGFSA---------DTHRK---QLVDVNK-------------IVNSDV
        MGFDD+ +T LVLGLG +  A+S        +KP+  S  +T+  FEP +LTLG S          +  RK    ++DVNK              VN++ 
Subjt:  MGFDDISHTALVLGLGLSEEAASPIINNKLKKKPAPCS--STSLDFEPCALTLGFSA---------DTHRK---QLVDVNK-------------IVNSDV

Query:  IAVDPYRQASPHSSAVCSSFSGGGCGGRVKRERDLSSEEVDLERVSSRVSDEDEDGSNTRKKLRLSKEQSALLEESFKQNSTLNPKQKQALARQLNLRPR
          +D YRQASPHS+   SSFSG   GGRVKRERDLSSEE+++E+VSSR+SDEDEDG N RKKLRL+KEQSALLEESFKQ+STLNPKQKQALARQLNLRPR
Subjt:  IAVDPYRQASPHSSAVCSSFSGGGCGGRVKRERDLSSEEVDLERVSSRVSDEDEDGSNTRKKLRLSKEQSALLEESFKQNSTLNPKQKQALARQLNLRPR

Query:  QVEVWFQNRRARTKLKQTEVDCEFLKKCCETLTDENRRLQKEVQELKALKLAQPLYMHMPAATLTMCPSCERVGGVGVGVG-DGASKAKFSMAPKPHFYN
        QVEVWFQNRRARTKLKQTEVDCEFLKKCCETLTDENRRLQKE+QELKALKL+QPLYMHMPAATLTMCPSCER+GGVGVGVG DGASK+ FSMAPKPHFYN
Subjt:  QVEVWFQNRRARTKLKQTEVDCEFLKKCCETLTDENRRLQKEVQELKALKLAQPLYMHMPAATLTMCPSCERVGGVGVGVG-DGASKAKFSMAPKPHFYN

Query:  PFSNPSAAC
        PF+NPSAAC
Subjt:  PFSNPSAAC

A0A6J1C5L0 homeobox-leucine zipper protein HAT22-like3.6e-11281.56Show/hide
Query:  MGFDDISHTALVLGLGLSEEAASPIIN-NKLKKKPAPCSSTSLDFEPCALTLGFSADTHRKQLVDVNKIVNSDVIAVDPYRQASPHSSAVCSSFSGGGCG
        MGFDD+S T LVLGLGLSE +  P  + + L KKPAPCSS SLDF+PC LTLGFS ++  +++ D +            YRQASPHSSAV SSFSGGG G
Subjt:  MGFDDISHTALVLGLGLSEEAASPIIN-NKLKKKPAPCSSTSLDFEPCALTLGFSADTHRKQLVDVNKIVNSDVIAVDPYRQASPHSSAVCSSFSGGGCG

Query:  GRVKRERDLSSEEVDLERVSSRVSDEDEDGSNTRKKLRLSKEQSALLEESFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDCEFLK
         RVKRERDLSSEEVDLERVSSR+SDEDEDGSNTRKKLRLS+EQSALLEESFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDCEFLK
Subjt:  GRVKRERDLSSEEVDLERVSSRVSDEDEDGSNTRKKLRLSKEQSALLEESFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDCEFLK

Query:  KCCETLTDENRRLQKEVQELKALKLAQPLYMHMPAATLTMCPSCERVGGVGVGVGDGASKAKFSMAPKPHFYNPFSNPSAAC
        KCCETLTDENRRLQKE+QELKALKLAQPLYMHMPAATLTMCPSCERVG    G  DGASKAKFSMAPKPHFYNPFSNPSAAC
Subjt:  KCCETLTDENRRLQKEVQELKALKLAQPLYMHMPAATLTMCPSCERVGGVGVGVGDGASKAKFSMAPKPHFYNPFSNPSAAC

A0A6J1FR71 homeobox-leucine zipper protein HAT22-like2.5e-11378.84Show/hide
Query:  MGFDDISHTALVLGLGLSEEAASPIINN--KLKKKPAPCSSTSLDFEPCALTLGFSADTHRKQLVDVNKIVNSDVIAVDP----------YRQASPHSSA
        MG DD+S T+LVLGLG+S EAASPIINN    KKK    + TSL+FEPCALTLGFS D                   VDP          YRQASPHSSA
Subjt:  MGFDDISHTALVLGLGLSEEAASPIINN--KLKKKPAPCSSTSLDFEPCALTLGFSADTHRKQLVDVNKIVNSDVIAVDP----------YRQASPHSSA

Query:  VCSSFSGGGCGGRVKRERDLSSEEVDLERVSSRVSDEDEDGSNTRKKLRLSKEQSALLEESFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRARTKL
        VCSSFSGGG GG +KRERDLSSEEV+LERV  RVSDEDEDG NTRKKLRLSK+QSALLEESFKQNSTLNPKQKQ LAR LNLRPRQVEVWFQNRRARTK+
Subjt:  VCSSFSGGGCGGRVKRERDLSSEEVDLERVSSRVSDEDEDGSNTRKKLRLSKEQSALLEESFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRARTKL

Query:  KQTEVDCEFLKKCCETLTDENRRLQKEVQELKALKLAQPLYMHMPAATLTMCPSCERVGGVGVGVGDGASKAKFSMAPKPHFYNPFSNPSAAC
        KQTEVDCEFLKKCCETLT+ENRRLQKE+QELKALKLA PLYMHMPAATLTMCPSCERVGGVGVGVGDGASK KFSMAPKPHFYNPFSNPSAAC
Subjt:  KQTEVDCEFLKKCCETLTDENRRLQKEVQELKALKLAQPLYMHMPAATLTMCPSCERVGGVGVGVGDGASKAKFSMAPKPHFYNPFSNPSAAC

A0A6J1KP81 homeobox-leucine zipper protein HAT22-like9.9e-11080.07Show/hide
Query:  MGFDDISHTALVLGLGLSEEAASPIINNKLKKKPAPCSSTSLDFEPCALTLGFSADTHRKQLVDVNKIVNSDVIAVDPYRQASPHSSAVCSSFSGGGCGG
        MG DD+S T+LVLGLG+S EAASPIINN  K        TSLDFEPCALTLGFS D         +       +    YRQASPHSSAVCSSFSGGG GG
Subjt:  MGFDDISHTALVLGLGLSEEAASPIINNKLKKKPAPCSSTSLDFEPCALTLGFSADTHRKQLVDVNKIVNSDVIAVDPYRQASPHSSAVCSSFSGGGCGG

Query:  RVKRERDLSSEEVDLERVSSRVSDEDEDGSNTRKKLRLSKEQSALLEESFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKK
         +KRERDLSSEEV+LERV  RVSDEDEDG NTRKKLRLSK+QSALLEESFKQNSTLNPKQKQ LAR LNLRPRQVEVWFQNRRARTK+KQTEVDCEFLKK
Subjt:  RVKRERDLSSEEVDLERVSSRVSDEDEDGSNTRKKLRLSKEQSALLEESFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKK

Query:  CCETLTDENRRLQKEVQELKALKLAQPLYMHMPAATLTMCPSCERVGGVGVGVGDGASKAKFSMAPKPHFYNPFSNPSAAC
        CCETLT+ENRRLQKE+QELKALKLA PLYMHMPAATLTMCPSCERVG  GVGVGDGASK KFSMAPKPHFYNPFSNPSAAC
Subjt:  CCETLTDENRRLQKEVQELKALKLAQPLYMHMPAATLTMCPSCERVGGVGVGVGDGASKAKFSMAPKPHFYNPFSNPSAAC

SwissProt top hitse value%identityAlignment
A2XE76 Homeobox-leucine zipper protein HOX195.1e-5561.57Show/hide
Query:  AVCSSFSGGGCGGRVKRERDLSSEEVDLERVSSRVS--DEDEDGSNTRKKLRLSKEQSALLEESFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRAR
        +V S   G      VKRER   +EE D ERVSS  +  D+D+DGS TRKKLRL+KEQSALLE+ F+++STLNPKQK ALA+QLNLRPRQVEVWFQNRRAR
Subjt:  AVCSSFSGGGCGGRVKRERDLSSEEVDLERVSSRVS--DEDEDGSNTRKKLRLSKEQSALLEESFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRAR

Query:  TKLKQTEVDCEFLKKCCETLTDENRRLQKEVQELKALKLA----------------QPLYMHMPAATLTMCPSCERVGG----VGVGVGDGASKAKFSMA
        TKLKQTEVDCEFLK+CCETLT+ENRRLQ+E+QEL+ALK A                 P YM +PAATLT+CPSCERVGG      V   DG +KA     
Subjt:  TKLKQTEVDCEFLKKCCETLTDENRRLQKEVQELKALKLA----------------QPLYMHMPAATLTMCPSCERVGG----VGVGVGDGASKAKFSMA

Query:  PKPHFYNPFSNPSAAC
           HF+NPF++ SAAC
Subjt:  PKPHFYNPFSNPSAAC

A2YW03 Homeobox-leucine zipper protein HOX272.4e-5263.59Show/hide
Query:  HSSAVCSSFSGGGCGGRVKRERDLSSEEVDLERVSSRVSDEDEDGSNTRKKLRLSKEQSALLEESFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRA
        H+ A      GGG G                ER SSR SD+DE G++ RKKLRLSKEQSA LEESFK++STLNPKQK ALA+QLNLRPRQVEVWFQNRRA
Subjt:  HSSAVCSSFSGGGCGGRVKRERDLSSEEVDLERVSSRVSDEDEDGSNTRKKLRLSKEQSALLEESFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRA

Query:  RTKLKQTEVDCEFLKKCCETLTDENRRLQKEVQELKALKLAQPLYMHMPAATLTMCPSCERVGGVGVGVGDGASKAKFSMAPKP
        RTKLKQTEVDCE+LK+CCETLT+ENRRL KE+ EL+ALK A+P YMH+PA TL+MCPSCERV          A  A  S A  P
Subjt:  RTKLKQTEVDCEFLKKCCETLTDENRRLQKEVQELKALKLAQPLYMHMPAATLTMCPSCERVGGVGVGVGDGASKAKFSMAPKP

P46603 Homeobox-leucine zipper protein HAT92.5e-7358.72Show/hide
Query:  MGFDDISHTALVLGLGLSEEAASPIINNKLKKKPAPCSSTSLDFEPCALTLGFSADTHRKQLVDVNKIVNSDVIAVDPYRQASPHSSAVCSSFSGGGCGG
        MGFDD  +T LVLGLG      SPI NN          S+    EP +LTL  S D        V  +  +D +     RQ S HS    SSFS G    
Subjt:  MGFDDISHTALVLGLGLSEEAASPIINNKLKKKPAPCSSTSLDFEPCALTLGFSADTHRKQLVDVNKIVNSDVIAVDPYRQASPHSSAVCSSFSGGGCGG

Query:  RVKRERDLSSEEVDLERVSSRV-SD--EDEDGSNTRKKLRLSKEQSALLEESFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDCEF
         VKRERD   E  + E ++ RV SD  EDE+G + RKKLRL+K+QSALLEESFK +STLNPKQKQ LARQLNLRPRQVEVWFQNRRARTKLKQTEVDCEF
Subjt:  RVKRERDLSSEEVDLERVSSRV-SD--EDEDGSNTRKKLRLSKEQSALLEESFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDCEF

Query:  LKKCCETLTDENRRLQKEVQELKALKLAQPLYMHMPAATLTMCPSCERVGGVGVGVGDG--------------ASKAKFSMAPKPHFYNPFSNPSAAC
        LKKCCETL DEN RLQKE+QELK LKL QP YMHMPA+TLT CPSCER+GG G G G G               +K  FS++ KPHF+NPF+NPSAAC
Subjt:  LKKCCETLTDENRRLQKEVQELKALKLAQPLYMHMPAATLTMCPSCERVGGVGVGVGDG--------------ASKAKFSMAPKPHFYNPFSNPSAAC

P46604 Homeobox-leucine zipper protein HAT223.7e-7759.66Show/hide
Query:  MGFDDISHTALVLGLGLSEEAASPIINNKLKKKPAPCSSTSLDFEPCALTLGFSADTHRKQLVDVNKIVNSDVIAVDPYRQASPHSSAVCSSFSGGGCGG
        MG DD  +T LVLGLGLS    +   N+ +KK  +      +  +P +LTL  S +++        KI           RQ S HS    SSFS     G
Subjt:  MGFDDISHTALVLGLGLSEEAASPIINNKLKKKPAPCSSTSLDFEPCALTLGFSADTHRKQLVDVNKIVNSDVIAVDPYRQASPHSSAVCSSFSGGGCGG

Query:  RVKRERDLS-------SEEVDLERVSSRVSD--EDEDGSNTRKKLRLSKEQSALLEESFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQT
        RVKRER++S       +EE     V SRVSD  +DE+G + RKKLRL+K+QSALLE++FK +STLNPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQT
Subjt:  RVKRERDLS-------SEEVDLERVSSRVSD--EDEDGSNTRKKLRLSKEQSALLEESFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQT

Query:  EVDCEFLKKCCETLTDENRRLQKEVQELKALKLAQPLYMHMPAATLTMCPSCERVGGVGVG-----VGDGASKAKFSMAPKPHFYNPFSNPSAAC
        EVDCEFLKKCCETLTDENRRLQKE+Q+LKALKL+QP YMHMPAATLTMCPSCER+GG GVG     V +  +K  FS+  KP FYNPF+NPSAAC
Subjt:  EVDCEFLKKCCETLTDENRRLQKEVQELKALKLAQPLYMHMPAATLTMCPSCERVGGVGVG-----VGDGASKAKFSMAPKPHFYNPFSNPSAAC

Q8GRL4 Homeobox-leucine zipper protein HOX195.1e-5561.57Show/hide
Query:  AVCSSFSGGGCGGRVKRERDLSSEEVDLERVSSRVS--DEDEDGSNTRKKLRLSKEQSALLEESFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRAR
        +V S   G      VKRER   +EE D ERVSS  +  D+D+DGS TRKKLRL+KEQSALLE+ F+++STLNPKQK ALA+QLNLRPRQVEVWFQNRRAR
Subjt:  AVCSSFSGGGCGGRVKRERDLSSEEVDLERVSSRVS--DEDEDGSNTRKKLRLSKEQSALLEESFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRAR

Query:  TKLKQTEVDCEFLKKCCETLTDENRRLQKEVQELKALKLA----------------QPLYMHMPAATLTMCPSCERVGG----VGVGVGDGASKAKFSMA
        TKLKQTEVDCEFLK+CCETLT+ENRRLQ+E+QEL+ALK A                 P YM +PAATLT+CPSCERVGG      V   DG +KA     
Subjt:  TKLKQTEVDCEFLKKCCETLTDENRRLQKEVQELKALKLA----------------QPLYMHMPAATLTMCPSCERVGG----VGVGVGDGASKAKFSMA

Query:  PKPHFYNPFSNPSAAC
           HF+NPF++ SAAC
Subjt:  PKPHFYNPFSNPSAAC

Arabidopsis top hitse value%identityAlignment
AT2G22800.1 Homeobox-leucine zipper protein family1.7e-7458.72Show/hide
Query:  MGFDDISHTALVLGLGLSEEAASPIINNKLKKKPAPCSSTSLDFEPCALTLGFSADTHRKQLVDVNKIVNSDVIAVDPYRQASPHSSAVCSSFSGGGCGG
        MGFDD  +T LVLGLG      SPI NN          S+    EP +LTL  S D        V  +  +D +     RQ S HS    SSFS G    
Subjt:  MGFDDISHTALVLGLGLSEEAASPIINNKLKKKPAPCSSTSLDFEPCALTLGFSADTHRKQLVDVNKIVNSDVIAVDPYRQASPHSSAVCSSFSGGGCGG

Query:  RVKRERDLSSEEVDLERVSSRV-SD--EDEDGSNTRKKLRLSKEQSALLEESFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDCEF
         VKRERD   E  + E ++ RV SD  EDE+G + RKKLRL+K+QSALLEESFK +STLNPKQKQ LARQLNLRPRQVEVWFQNRRARTKLKQTEVDCEF
Subjt:  RVKRERDLSSEEVDLERVSSRV-SD--EDEDGSNTRKKLRLSKEQSALLEESFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDCEF

Query:  LKKCCETLTDENRRLQKEVQELKALKLAQPLYMHMPAATLTMCPSCERVGGVGVGVGDG--------------ASKAKFSMAPKPHFYNPFSNPSAAC
        LKKCCETL DEN RLQKE+QELK LKL QP YMHMPA+TLT CPSCER+GG G G G G               +K  FS++ KPHF+NPF+NPSAAC
Subjt:  LKKCCETLTDENRRLQKEVQELKALKLAQPLYMHMPAATLTMCPSCERVGGVGVGVGDG--------------ASKAKFSMAPKPHFYNPFSNPSAAC

AT3G60390.1 homeobox-leucine zipper protein 31.1e-4959.11Show/hide
Query:  VDVNKIVNSDVIAV-DPYRQASPHSSAVCSSFSGGGCGGRVKRERDLSS----------EEVDLERVSSRV--SDEDEDGS-----NTRKKLRLSKEQSA
        +DVN+  ++ V+ V D     S  +S V S  SG       K ER+L +          E+ ++ER S  +    +DEDGS     ++RKKLRLSKEQ+ 
Subjt:  VDVNKIVNSDVIAV-DPYRQASPHSSAVCSSFSGGGCGGRVKRERDLSS----------EEVDLERVSSRV--SDEDEDGS-----NTRKKLRLSKEQSA

Query:  LLEESFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKKCCETLTDENRRLQKEVQELKALKLAQPLYMHM-PAATLTMCPSC
        +LEE+FK++STLNPKQK ALA+QLNLR RQVEVWFQNRRARTKLKQTEVDCE+LK+CCE LTDENRRLQKEV EL+ALKL+  LYMHM P  TLTMCPSC
Subjt:  LLEESFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKKCCETLTDENRRLQKEVQELKALKLAQPLYMHM-PAATLTMCPSC

Query:  ERV
        ERV
Subjt:  ERV

AT4G16780.1 homeobox protein 26.7e-5060.54Show/hide
Query:  VDVNKIVNSDVIAVDPYRQASPHSSAVCSSFSGGGCGGRVKRERDLSSEEVDLERVSSRVSDEDEDGSNTRKKLRLSKEQSALLEESFKQNSTLNPKQKQ
        +DVN+  ++     +    +SP+S+   S+      G R +RE D   +        SR   +DEDG N+RKKLRLSK+QSA+LEE+FK +STLNPKQKQ
Subjt:  VDVNKIVNSDVIAVDPYRQASPHSSAVCSSFSGGGCGGRVKRERDLSSEEVDLERVSSRVSDEDEDGSNTRKKLRLSKEQSALLEESFKQNSTLNPKQKQ

Query:  ALARQLNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKKCCETLTDENRRLQKEVQELKALKLAQPLYMHM-PAATLTMCPSCERV
        ALA+QL LR RQVEVWFQNRRARTKLKQTEVDCEFL++CCE LT+ENRRLQKEV EL+ALKL+   YMHM P  TLTMCPSCE V
Subjt:  ALARQLNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKKCCETLTDENRRLQKEVQELKALKLAQPLYMHM-PAATLTMCPSCERV

AT4G37790.1 Homeobox-leucine zipper protein family2.6e-7859.66Show/hide
Query:  MGFDDISHTALVLGLGLSEEAASPIINNKLKKKPAPCSSTSLDFEPCALTLGFSADTHRKQLVDVNKIVNSDVIAVDPYRQASPHSSAVCSSFSGGGCGG
        MG DD  +T LVLGLGLS    +   N+ +KK  +      +  +P +LTL  S +++        KI           RQ S HS    SSFS     G
Subjt:  MGFDDISHTALVLGLGLSEEAASPIINNKLKKKPAPCSSTSLDFEPCALTLGFSADTHRKQLVDVNKIVNSDVIAVDPYRQASPHSSAVCSSFSGGGCGG

Query:  RVKRERDLS-------SEEVDLERVSSRVSD--EDEDGSNTRKKLRLSKEQSALLEESFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQT
        RVKRER++S       +EE     V SRVSD  +DE+G + RKKLRL+K+QSALLE++FK +STLNPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQT
Subjt:  RVKRERDLS-------SEEVDLERVSSRVSD--EDEDGSNTRKKLRLSKEQSALLEESFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQT

Query:  EVDCEFLKKCCETLTDENRRLQKEVQELKALKLAQPLYMHMPAATLTMCPSCERVGGVGVG-----VGDGASKAKFSMAPKPHFYNPFSNPSAAC
        EVDCEFLKKCCETLTDENRRLQKE+Q+LKALKL+QP YMHMPAATLTMCPSCER+GG GVG     V +  +K  FS+  KP FYNPF+NPSAAC
Subjt:  EVDCEFLKKCCETLTDENRRLQKEVQELKALKLAQPLYMHMPAATLTMCPSCERVGGVGVG-----VGDGASKAKFSMAPKPHFYNPFSNPSAAC

AT5G06710.1 homeobox from Arabidopsis thaliana7.1e-5272.3Show/hide
Query:  RVKRERDLSSEEVDLERVSSRVSDEDEDGSN--TRKKLRLSKEQSALLEESFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDCEFL
        R   +RD+  E   +ER +SR S+ED D  N  TRKKLRLSK+QSA LE+SFK++STLNPKQK ALA+QLNLRPRQVEVWFQNRRARTKLKQTEVDCE+L
Subjt:  RVKRERDLSSEEVDLERVSSRVSDEDEDGSN--TRKKLRLSKEQSALLEESFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDCEFL

Query:  KKCCETLTDENRRLQKEVQELKALKLAQPLYMHMPAATLTMCPSCERV
        K+CCE+LT+ENRRLQKEV+EL+ LK + P YM +PA TLTMCPSCERV
Subjt:  KKCCETLTDENRRLQKEVQELKALKLAQPLYMHMPAATLTMCPSCERV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTTTTGATGATATTTCTCATACAGCCTTGGTTTTGGGCTTAGGGCTCTCAGAAGAAGCTGCCTCTCCAATTATCAACAACAAGTTGAAGAAAAAGCCTGCTCCTTG
CTCCTCTACTTCACTTGATTTTGAGCCTTGTGCTTTGACTTTGGGATTCTCCGCCGACACTCACCGGAAACAACTCGTCGACGTCAACAAGATCGTCAACTCCGACGTCA
TCGCTGTCGATCCGTATCGCCAAGCTTCCCCTCATAGCAGCGCCGTTTGTTCTTCCTTCTCCGGTGGCGGCTGCGGCGGTAGGGTTAAAAGGGAGAGAGACCTCAGCAGT
GAAGAAGTTGACTTGGAGAGGGTTTCTTCGAGAGTTAGTGATGAAGATGAAGATGGTTCCAATACTAGAAAGAAACTCAGGCTTTCTAAAGAACAATCCGCTCTCTTGGA
AGAGAGTTTCAAACAAAATAGCACTCTCAACCCGAAGCAAAAACAAGCCTTGGCGAGACAATTAAATCTACGGCCACGACAAGTTGAAGTATGGTTTCAGAATAGGAGAG
CTCGAACGAAACTGAAGCAAACAGAAGTAGATTGTGAGTTCTTGAAGAAGTGTTGCGAGACGCTGACAGATGAAAACAGAAGACTACAAAAGGAGGTACAAGAATTGAAG
GCACTGAAGCTGGCACAGCCTCTATACATGCACATGCCAGCAGCAACATTAACGATGTGTCCTTCATGCGAAAGGGTCGGCGGCGTCGGGGTCGGCGTCGGCGACGGCGC
TTCCAAAGCCAAATTTTCAATGGCTCCCAAGCCTCACTTTTACAACCCCTTCTCCAATCCTTCCGCCGCATGTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGGTTTTGATGATATTTCTCATACAGCCTTGGTTTTGGGCTTAGGGCTCTCAGAAGAAGCTGCCTCTCCAATTATCAACAACAAGTTGAAGAAAAAGCCTGCTCCTTG
CTCCTCTACTTCACTTGATTTTGAGCCTTGTGCTTTGACTTTGGGATTCTCCGCCGACACTCACCGGAAACAACTCGTCGACGTCAACAAGATCGTCAACTCCGACGTCA
TCGCTGTCGATCCGTATCGCCAAGCTTCCCCTCATAGCAGCGCCGTTTGTTCTTCCTTCTCCGGTGGCGGCTGCGGCGGTAGGGTTAAAAGGGAGAGAGACCTCAGCAGT
GAAGAAGTTGACTTGGAGAGGGTTTCTTCGAGAGTTAGTGATGAAGATGAAGATGGTTCCAATACTAGAAAGAAACTCAGGCTTTCTAAAGAACAATCCGCTCTCTTGGA
AGAGAGTTTCAAACAAAATAGCACTCTCAACCCGAAGCAAAAACAAGCCTTGGCGAGACAATTAAATCTACGGCCACGACAAGTTGAAGTATGGTTTCAGAATAGGAGAG
CTCGAACGAAACTGAAGCAAACAGAAGTAGATTGTGAGTTCTTGAAGAAGTGTTGCGAGACGCTGACAGATGAAAACAGAAGACTACAAAAGGAGGTACAAGAATTGAAG
GCACTGAAGCTGGCACAGCCTCTATACATGCACATGCCAGCAGCAACATTAACGATGTGTCCTTCATGCGAAAGGGTCGGCGGCGTCGGGGTCGGCGTCGGCGACGGCGC
TTCCAAAGCCAAATTTTCAATGGCTCCCAAGCCTCACTTTTACAACCCCTTCTCCAATCCTTCCGCCGCATGTTAG
Protein sequenceShow/hide protein sequence
MGFDDISHTALVLGLGLSEEAASPIINNKLKKKPAPCSSTSLDFEPCALTLGFSADTHRKQLVDVNKIVNSDVIAVDPYRQASPHSSAVCSSFSGGGCGGRVKRERDLSS
EEVDLERVSSRVSDEDEDGSNTRKKLRLSKEQSALLEESFKQNSTLNPKQKQALARQLNLRPRQVEVWFQNRRARTKLKQTEVDCEFLKKCCETLTDENRRLQKEVQELK
ALKLAQPLYMHMPAATLTMCPSCERVGGVGVGVGDGASKAKFSMAPKPHFYNPFSNPSAAC